ID Tx1-6_BF repbase; DNA; INV; 6019 BP. XX AC . XX DT 28-APR-2009 (Rel. 14.04, Created) DT 28-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE Amphioxus Tx1-6_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW Tx1; Non-LTR Retrotransposon; Transposable Element; L1-6_BF; KW Tx1-6_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-6019 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-6019 RA Kapitonov V. and Jurka J.; RT "Young families of Tx1 non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(4), 843-843 (2009). XX DR [2] (Consensus) XX FH Key Location/Qualifiers FT CDS 220..1839 FT /product="Tx1-6_BF_1p" FT /note="ORF1." FT /translation="MPQVTPSAYTKRRTVAITFLENRDAEFDDIVELLRKH FT KVDIMRDCAGIQSKERGWAEVVFETTSALNRVAPALAADRTIDVELYGNGV FT TLVTVIGASLELDDNYVRYQLREYGTIKDGRFLTYASRGFPNLKTGTRQYK FT MLLSKHIPNSIRVGGDNVSIRYNGQPKMCHRCGGQDHFVAACQAVKCTRCH FT ELGHRAADCSAQIKCNNCKKEGHTLRSCPLSFANRLTMGTKWGSTAEWTQS FT DDQNRPGGSGAQQGNVGLKPRVELTSDDNESVKEDDDKEDNESVKEVDDKE FT EEDRDDSSDEGSDFEDARKSAPSKEEDMEVEDDEEEKGKEEERENGKEEET FT VPLKVAGDPSTGVIVGAKPTTPKVAGDAPPEVVTATKPVTPLSEPPQSGGF FT WDLEALGLPEDGANKIPDSLSEQADPQASLALALPESGDLIIDMETEGDVD FT TPKRPHSSESEDSSDGSNIPPKAATKGKKAKVLPEPQEKGFGSQIDLFPSS FT AEPSQKGKKPEAKNVTRKKSTTNSGKGKSGKPTKPGSKANPKPT" FT CDS 1891..5421 FT /product="Tx1-6_BF_2p" FT /note="endonuclease and RT." FT /translation="MDWKLLTLNVNGMKDRDKRDLVFDLCKTEQVDVACLQ FT ECHVSSIEDKKRWTRQWGYKAFWSLGTNSSRGVGILLSSRMEFVSSQTDLD FT GRVVSILVKADDTQYNVVNVYAPSTAAERKTFFEDLHQYMHPNATLIVTGD FT FNCVLDCSLDRYSDSTTTPGNDTQDVVELRNFCADLGLVDVWRDQHPDQKD FT FTWRSKATSTRSRLDRFYTSTSNLGPCVIQACSFSDHDMVSLSVPPTQDTE FT RGPGVWKCNVKLKVLEDPLFANDFEEKYKSWQESKQNCENLREWWEEIKSN FT TKDLIRQHSKRIVHSTNLVERDLLREIDILRSRLNVLRGCSDTMERYSEAK FT EKLTKLRLDRLSGQKIRSRIRAFEQDEKPTRFFFQKEREKGKKRLIHSIRN FT ATGEVVSSKNGILETFRTFYSRLYEPDQINTQDQQYFLDKLDTALSQVSRD FT ALDKPVSLSELEDALKGMANGKTPGSDGLPKEFYRRFWNLVGQDLLDVLNE FT GLEEGLLSPSQRESIISLIEKSGDPLDPANNRPISLLNVDYKILTKTLANR FT LKVVIEEVVHSNQTCGIPGRSIVDSSCLLRDIISYVNDKNLDCAFLALDQE FT KAFDRVDHAYMANVLEKLGFGQNFCKWMKVLYEDVSSQVLVNGNLTSRLSI FT GRGVRQGCPLSPLLYVLCIEPLAAAIRADPQIKGVQIPGGQGQEVKLVQYA FT DDNTCVLTDQQSIDRTFYLIGRFESGTGSKLNMGKTKALWLGGWRGRQDKP FT YPMHSWVSTHLKILGNPTGNGSRLAEEAWLQRFAKFKEKLDKWRHRKLTLI FT GKVEVVNSLAAARLWYVAPVYPLPRSVKVKIEQEIRRFIWDGKTPLVKLDT FT LALSKDKGGLGLVSVAHKADALLLKCVRKALTDPQDPGSRFAHYWLGLCLR FT RIDPESWSNTTPHSFNPPKQYEEIANLLGDIARKGIVVDWTSSTTASLYTA FT LLEAEGTVPRCVQKDPLRDWERVWKALQNRLHSNWDRMISWHVAHNSLLTN FT LKLSGWRNFHPDSACPRQGCSLDESVPHTFWECPVVNGLWDWLGGLIRRHI FT LPGFQVTKSFALFGTVPPTTPKKTRLVLEAASAIVRSLIWRTRGDSKFQRS FT HVTGPDLVDKLKRMLKDRLLFEFQRLGPAGFYSTWAEGFSWVSIDLTQCYV FT HFWWSSRTDLPILGRSET" XX SQ Sequence 6019 BP; 1641 A; 1391 C; 1637 G; 1350 T; 0 other; taaaaccggg gtaaaccggg gtttggtatc gtccgcccgg gtccattgtc ttagcccggc 60 ggcggaagtg tatttaggtc acgtcgctat ttttagcgga ggccagattc cgggggccta 120 aaggggtttc ccgggagcct tttcgggagc ccaaaagggg tttctcgagc ccttaaaccc 180 cttttttcct ctacaaaacg agcttctgag ctacgaatca tgccgcaggt tacgccctcc 240 gcgtacacga aaaggagaac agttgccatc accttcctgg agaaccggga cgcagagttc 300 gatgatatcg ttgagcttct gcgcaaacac aaggtggata ttatgagaga ttgcgctggt 360 attcaaagta aggaaagggg atgggccgaa gtggtgtttg agaccactag cgctctaaac 420 agggtggctc ctgcccttgc agccgacaga actatcgacg ttgagctgta tgggaatgga 480 gtcaccctgg tcaccgttat cggtgcttct ctggagttgg acgacaatta cgttcggtac 540 caactcaggg aatacgggac tataaaggac ggaaggtttc tgacgtatgc cagccggggg 600 ttcccgaatc tgaaaacggg aacccggcaa tataagatgt tactgtccaa gcatataccc 660 aattctatca gggtgggtgg agacaatgtc tctattaggt ataacggcca acctaagatg 720 tgccatagat gtggtggcca ggaccacttt gtggcagcgt gccaggctgt caagtgcact 780 cgctgccacg aattggggca cagggcggca gattgcagcg ctcagatcaa atgcaataac 840 tgcaagaaag agggccacac gctcagatcg tgtccactga gctttgccaa caggctgacc 900 atgggcacga agtggggcag tacagccgaa tggactcagt ctgatgacca gaatagaccc 960 gggggaagtg gtgcccagca gggtaatgta gggctcaaac cccgagtcga attgactagt 1020 gatgacaatg agtcggtcaa agaagacgat gataaggaag acaatgagtc ggtcaaagaa 1080 gtcgatgata aggaagagga ggatagagat gatagcagtg atgaaggtag tgacttcgaa 1140 gatgctagaa agagtgcccc ctctaaggag gaggatatgg aggttgagga cgatgaggaa 1200 gaaaagggaa aagaggagga aagagagaat gggaaggaag aggaaactgt gccactcaag 1260 gtggcagggg acccctccac cggggtgatt gtgggtgcca aacctacaac acccaaggtg 1320 gcgggagacg ctccacccga ggtggttacg gcgaccaaac ctgtaacacc tctttccgaa 1380 ccacctcagt ccggtggttt ttgggacctc gaggccttag gcctaccgga agacggggct 1440 aataaaatcc ccgactccct ctcagaacag gcagaccctc aggccagcct cgcactggca 1500 ctcccagagt cgggagacct tatcatagac atggaaacag agggggacgt ggacacacca 1560 aagcgccccc actcgagtga gtcggaagac tcctcggatg gttctaacat cccgccaaag 1620 gcagcaacga aaggcaagaa ggccaaagtc ctgcccgagc ctcaggaaaa agggttcggg 1680 tctcagatag acttgttccc gtcctcggct gagccctctc agaagggcaa gaagccagag 1740 gccaagaacg tcacgaggaa gaagtccacc acaaacagtg gcaaaggcaa gagtggcaaa 1800 cctaccaagc caggtagcaa agccaaccca aagccaacct aggtccggaa cggtgtacat 1860 gccaagtcaa acgatcaacc ataaacattg atggattgga aattactcac cttaaatgtt 1920 aacgggatga aagataggga caaaagggat ctggttttcg acctgtgtaa aacggagcag 1980 gtcgatgtgg cctgcctcca ggagtgccac gtatcctcta tagaagataa gaaaagatgg 2040 acccgccaat gggggtacaa agctttctgg tccctcggta caaattcatc gaggggagtc 2100 ggcatcctcc tatcctctcg gatggaattt gtgtcatccc agacagattt agatggtagg 2160 gtggtctcga tactagtgaa ggccgacgac acccaataca atgtagtaaa tgtctatgct 2220 cctagtactg cggcagaaag aaaaaccttc tttgaagacc tccaccagta catgcacccc 2280 aatgctaccc ttatagtaac tggggatttc aactgtgttt tagattgtag cctggataga 2340 tactcagact caaccaccac accggggaat gatacccagg atgtagtgga gttgagaaat 2400 ttctgtgcag acctaggtct tgtggatgtt tggagagacc aacatcctga ccagaaggac 2460 ttcacttgga ggtcgaaagc cactagcaca aggtctagat tagacagatt ttacacgtcc 2520 accagtaacc ttggtccctg tgtaatacag gcttgttcct tttctgatca tgacatggtc 2580 tcgttgtcag tgccacccac acaggacact gaacgaggcc ctggtgtatg gaagtgcaat 2640 gtcaaactca aagtcttaga agatccacta tttgccaacg attttgaaga gaaatataag 2700 agttggcagg agtctaagca aaactgtgaa aatcttaggg aatggtggga ggagataaaa 2760 tccaacacca aagacctgat taggcagcat agcaaacgaa tagtccactc gaccaacctg 2820 gtcgagaggg acttactaag agaaatagac attctaaggt cccgtctgaa tgtgttgagg 2880 gggtgttccg acacaatgga gcgttactca gaggccaaag agaagctgac caaactacgt 2940 cttgacagac tgtcgggtca aaagatccgg agtaggatca gggctttcga gcaagacgaa 3000 aagcctactc gtttcttctt tcaaaaggag cgagaaaagg ggaagaagcg tctaattcac 3060 agcatcagaa atgctacagg agaagttgta tcctccaaga atgggatctt ggagactttc 3120 cggaccttct attcgcggtt gtatgaaccc gaccagataa atacacaaga ccagcagtac 3180 ttcttagata agctggacac cgccttgtcc caggttagcc gagatgcgct agacaagcca 3240 gtctccttgt cagagcttga agatgctctg aaaggaatgg ccaatggtaa aactccgggc 3300 tctgatggct tgccaaaaga gttctaccgg cgtttctgga acctagtggg gcaagatttg 3360 ctggacgtgt taaatgaagg tttggaggag ggtcttctct ctccttccca aagggagagt 3420 attatttctt taatagaaaa gtccggggat ccactcgacc cagcaaacaa ccgaccgata 3480 agcctactaa acgtagacta taagatcctg accaaaaccc ttgcaaatag gctaaaggtg 3540 gtcattgaag aggtagttca ctcgaaccag acttgtggca tcccgggtag gtccattgtg 3600 gacagctcct gtctcctcag agacataatc tcgtatgtca atgataagaa cctggattgt 3660 gccttcctgg ctcttgatca agagaaggca tttgataggg ttgaccatgc ttacatggcc 3720 aacgtccttg aaaagcttgg cttcggacaa aacttctgta agtggatgaa ggtcttgtac 3780 gaagatgtgt cgagccaagt gcttgtaaac gggaacctga cttcccgtct atctatcggt 3840 agaggggtta gacaaggatg tcccctgtcc cccctcctat atgtgctttg tatagaacct 3900 ctggcggcgg ccatcagggc tgacccccag attaagggtg tccagatccc tggtgggcaa 3960 gggcaggagg taaagctagt gcagtacgct gatgacaaca cctgcgtgct aacagaccag 4020 caatccatcg acagaacttt ctacctcata ggcaggttcg aaagcggcac agggtcaaaa 4080 ctgaatatgg ggaaaaccaa agcgctatgg cttgggggat ggaggggacg gcaggacaaa 4140 ccgtacccca tgcacagttg ggtgtccacc caccttaaga tcttgggtaa cccgactgga 4200 aacggctctc gcctcgccga ggaggcttgg ctccaacgtt tcgccaagtt taaggagaaa 4260 ctagataaat ggaggcatag gaagctcacc ctcattggca aagtggaggt ggttaacagc 4320 ctggccgcag ccagactgtg gtatgttgct ccagtatatc cactcccgag atcagtaaag 4380 gttaagattg aacaagaaat tcgccgcttc atatgggacg gcaaaactcc gctagtcaag 4440 ttggacacac tagctctatc gaaagacaag ggaggtttgg gccttgtgtc tgtggcccat 4500 aaggcagatg cacttttgtt gaagtgtgta aggaaggcac tgacagatcc tcaggatcct 4560 ggcagtaggt tcgcgcacta ctggctggga ctttgcctcc gtcgcatcga tccagagtcg 4620 tggagcaaca ctactccaca ctctttcaat ccgcccaagc agtacgaaga gattgccaac 4680 ctcttaggag acattgctag gaaaggcata gtggtggact ggacatctag caccaccgct 4740 tccctgtata ctgcattgtt ggaagcggag ggcactgttc ccagatgtgt acagaaagac 4800 ccccttcgtg actgggagcg agtgtggaaa gcgctgcaaa accgcttaca ctcaaactgg 4860 gatcgcatga tatcgtggca tgttgcccat aactctctgc tgacgaacct gaaactcagt 4920 gggtggagaa acttccatcc agatagtgct tgtccaagac aaggatgttc tctggatgaa 4980 agtgtcccac acaccttttg ggagtgccct gtggtcaatg gcctgtggga ttggcttgga 5040 ggcttgatca gacgacacat cctccctggc ttccaagtga cgaagtcctt tgccttgttt 5100 ggtactgttc ccccaacaac cccaaagaag accaggttgg ttctcgaggc tgccagtgcg 5160 atagtccggt ctctgatctg gagaacgaga ggtgactcga agttccaacg aagtcatgtc 5220 accggtcctg acttggtgga caagttgaaa aggatgttaa aagatcgctt gctgttcgaa 5280 tttcaacgcc taggacctgc gggtttctac tccacctggg cggagggatt ctcctgggtc 5340 tccatagacc taactcagtg ttacgtccac ttttggtggt ctagtcgtac tgatctccct 5400 atattaggga ggtcagaaac ctaaaaaaga gaaaacccaa aaaaactata aaaaagaaaa 5460 cccaaaaaca gtagaatagt gggggatatg ggaaagggga tggaccaaga gtggtgggtg 5520 ggagtgaggt gctgcgttat ccgagtcgat ccgtttcaga attggtttga tctgggtccc 5580 cggaggggga tgcatcaaga gatatgtaag ccggggagtg tttatcctcc cttggtatga 5640 actgtataac aagtaaatgc tctgaactag tatacaatta cttttgtcaa aagagtatct 5700 gttcttatca gtcttaccaa ctacggttca gttactgctt gggccgggct aagatccgat 5760 gtgagtggac catatctgtc ctttgggggt atgggtgacg tctgcctgtt aggctggttt 5820 ctttatgtac tacatcggta caacgcccca agagtggtca tggcctttta gtcagctgtg 5880 cctggaaatg ccactgattt ctagattcag tttccaagtc acatggttat tttttgccgt 5940 aattttattg acagtcttgt gaggttagtt aaggcccttg agtgccctgt agcctcaagg 6000 tgtagcggct ggaataccc 6019 // ID DINOLT1 repbase; DNA; INV; 5378 BP. XX AC NW_633168; XX DT 31-MAY-2006 (Rel. 11.05, Created) DT 16-AUG-2009 (Rel. 14.09, Last updated, Version 2) XX DE Non-LTR retrotransposon from Dictyostelium discoideum. ~41%. XX KW L1; Non-LTR Retrotransposon; Transposable Element; KW Interspersed repeat; DINOLT1. XX NM DINOLT1. XX OS Dictyostelium discoideum OC Eukaryota; Amoebozoa; Mycetozoa; Dictyosteliida; Dictyostelium. XX RN [1] RP 1-5378 RA Jurka J.; RT "DINOLT1: Non-LTR retrotransposon from Dictyostelium RT discoideum."; RL Repbase Reports 6(5), 210-210 (2006). XX DR EMBL/GenBank/DDBJ; NW_633168; Positions 160330 165707. XX CC This element is present in several copies in the genome, >99% CC identical. The closest element (DRE) is ~41% identical to CC DINOLT1 at the protein CC level. XX FH Key Location/Qualifiers FT CDS 223..1584 FT /product="DINOLT1_1p" FT /translation="MNKLANTIMTESTETLADKIKKSNELRTKFKYLKNTY FT RTLDKISEAIYHTQIELKQYKFMINIPDEISNNLKAELEIHLQGDDKIIIS FT KNKFKILTDCRTQVIKYSNVFNGVLKGVHRAIIKPKKIFKTFNWEKSMDIV FT EKTLHEEGLEIIKLDNLGENLIIHSFNSIISPEHKDQLIHCKHITILFLTT FT TEETKNQRKTNKEIEDEISENSKITELPNPEIKENTRINIKSKEESNHKNT FT SESKNNNKNKIENRKENNKEINNDNIKDKSKNEKVKENEIENKKPTTIRAT FT TTTTTTETTNTINNIEKRKVLRVPPITVTYSVDSKEKRKMELTEDEEALEV FT VRKMFENEEENRIFEEKINNQKKKETPRLASIKKRQTMSPIQNPESPATPE FT TVKRKRSNSSPNPTTNHQEISTPYKTPTKITKEESDLMEAALRDKLPKVLQ FT DILPSVSFV" FT CDS 1694..5125 FT /product="DINOLT1_2p" FT /translation="MIKIMTWNCQGCSTIKSMNQTKNTINLIKPDISILTE FT TNLKKDQNLAQIKNHYSTGGKGKEARGKGVMAINHLEQIEIKNLEEKEGRI FT LKFTAILNNMEINIICIYAPASYDRRKGWFSNNISLEDLEQADIVTGDFNI FT NKYAKIKFKDGSSDKRKFTENETIEFLMMEVGLTEILPNSTTKTTFKDKLI FT DRTFLSTKMHQLNHSYKIIDKINKSDHNIVVMDFKIPNFTKIYKSPLWKMN FT SIIIKEDKTIKKIEKIIDHYQETKRKNLNITEYWIKFKKKIKKQCITTEKK FT RILERKKTLSNLANEHTKTKNKYEKETISNEITILQEEERIDRMIKSSINY FT INNKEIASNLLTSILKKRDSSSQIHRIKNPSNGEIETEQKEITDCFRKYYE FT TLFKEKECDQETHTELLSTWNPPIDKKKLTEMEDRIEEYEVKLAIEKIAEG FT KSPGEDGITSSFYKIHQHKLIPILTELFNFFLNNEIPTEFKNGILTSIYKG FT KGDVLEISNRRPITLLNVDYKIYSKIINNRILKILPNIISKYQNGFIPGRL FT LHNNIIALDLAMEKSDRNTIITFYDFEKAFDSISHKALIRTLNHLKFPRKI FT TNTILSMLTNTNIRVMVNGQLSESFRAGRGTKQGDPISPTLFAIVCECLST FT SIRKDNTIIGIKLNQNNSMKIGQYADDTVTLASNIDDADKMDMKVIKFCQA FT TAAKINDNKCVCITKNPKIKTKYKTIGAKEEERYLGFFFNRKGVISKVDDT FT VNKLENLTKCYSSVSSTLKGRITILKSYLLSQLTFQLYINEINDIKKLENV FT NANMLFKGDRWAISKERSRRDYEIGGLELWNMATRSNAQKAWIYEQYLREK FT DDQNCPPHMEVWKSEKENSLSRIHIKCWKAWKLLHHPRERITLKLNQVKPK FT YENKQKLKVIYRNMMDIKYKGWNKHQPTTGQKLIQKNINHPILPFREARSI FT TTIKGRDLAWRYLLKALPKHHGENCHSCKEEESSMHIFFECKSIKQNIDSI FT YQKVCKDSNNTYHGPWSEKVLGKLLTPFSSNLIGAIMESIWYRRNQIKFND FT NTTIITENQIIHKIKKARDAEWDRTRKIVEKQLRQELRCTDNRESINRTAS FT IKRRLEKFSHNWNSKLMTIKIPEHFIPYCSYNTNYS" XX SQ Sequence 5378 BP; 2686 A; 938 C; 730 G; 1024 T; 0 other; agtaagagaa aaaacatcaa caacaacact ttttttattt gattatttca tttttcatat 60 aaaaaaaaga cttaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 120 aaaaaaaaaa aaaaaaaaaa aacctccgaa aacattcttt cagggaattt ccagaatttt 180 aaatttttaa agtcgaataa agattttaaa gaaaaaaaag aaatgaataa attggcaaat 240 acaataatga cagaatcaac agaaacattg gcagataaaa tcaagaaatc aaatgaactc 300 agaacaaagt tcaaatattt aaaaaatacg tacagaacac ttgataaaat aagcgaagca 360 atataccaca ctcaaattga actaaaacaa tataaattca tgatcaatat cccagatgag 420 atttccaaca acttaaaagc agaattggaa attcatttac aaggagatga taagataata 480 ataagtaaga ataaattcaa aatcctaact gactgcagaa ctcaagtaat caagtactct 540 aacgttttca acggagtact aaaaggagtt cacagagcaa taataaaacc aaaaaagatc 600 ttcaaaacat tcaactggga aaaatcaatg gacatagtag aaaaaacatt gcatgaagaa 660 ggactggaaa taatcaaatt agataattta ggtgaaaacc tgataataca ctcattcaac 720 tcaattatca gtccagagca caaagaccag ctaatacatt gtaaacacat aacaatacta 780 ttcctcacta caacagaaga aacaaaaaat caaagaaaaa caaataagga aatcgaagac 840 gaaatctcag aaaattcaaa aatcacggaa ctaccaaatc ccgagataaa ggaaaacaca 900 agaataaaca taaaatcaaa agaagaatcc aatcataaaa acactagcga atcaaaaaat 960 aacaacaaaa acaaaataga aaacagaaaa gaaaataata aagaaatcaa taatgacaac 1020 atcaaagaca agagcaaaaa cgaaaaagta aaggaaaacg aaattgaaaa taaaaaaccc 1080 acaacaatta gagcaactac aacaaccaca acgacagaaa cgacaaacac aataaacaat 1140 attgaaaaga gaaaagtact tcgtgtccca ccaataacag taacatatag tgttgacagc 1200 aaagagaaaa ggaaaatgga gttaactgaa gatgaagaag cattagaagt ggtaagaaag 1260 atgttcgaaa atgaagaaga gaacagaatt tttgaagaaa aaataaataa ccaaaaaaag 1320 aaggaaaccc caaggttagc ctcaatcaaa aaaagacaaa caatgtcacc tatccaaaat 1380 ccagaatctc cagcaacccc agagacagta aaaagaaaaa gatccaactc atctccaaac 1440 ccaacaacta atcaccaaga aatatcaaca ccttacaaaa ctccaactaa aataacaaag 1500 gaagaatcag atctaatgga agcagcacta agagacaaat tgcccaaagt actccaagac 1560 atacttccat ctgtctcatt tgtttaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaca 1620 aaaaaaaaaa aaaaaaaaaa aaaaaacaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1680 aaaaaaaaaa aaaatgataa aaataatgac ctggaattgc caaggttgct caacaatcaa 1740 aagcatgaat cagacaaaaa atacaattaa cctaataaaa cctgatatat caatactaac 1800 agaaacaaat ctaaaaaagg atcaaaatct agcgcaaatc aaaaaccatt actcaacagg 1860 aggaaaagga aaagaagcaa gaggaaaagg agtaatggca ataaatcacc tagaacaaat 1920 agaaattaaa aacttggaag aaaaagaagg cagaatctta aaatttactg caattctaaa 1980 caatatggaa atcaacataa tttgcatcta tgcaccagcc tcatatgaca gaagaaaagg 2040 atggttttca aacaatattt cccttgaaga tctagaacag gcagacatag ttactggtga 2100 ctttaacatc aacaaatatg caaaaatcaa gttcaaagat ggatcatccg ataagagaaa 2160 attcacagaa aatgaaacaa tagaattcct aatgatggaa gtaggactca ccgaaatact 2220 accaaattca acaactaaaa caaccttcaa agataaacta atcgatagaa cattcttatc 2280 aacaaagatg caccaactca atcactcgta caagataata gataaaataa acaaatcaga 2340 ccataacata gtagtgatgg atttcaaaat accaaatttc acaaaaatat acaaatcacc 2400 cctatggaaa atgaactcaa taataattaa agaagataaa acaataaaaa agatagaaaa 2460 aataatcgac cactaccagg aaaccaaaag aaaaaacctt aacataacgg aatactggat 2520 taagttcaaa aaaaaaataa aaaaacaatg cattacaaca gaaaagaaaa gaatactaga 2580 gagaaaaaaa acgctatcaa atctagcaaa tgaacacaca aaaacaaaaa acaaatatga 2640 aaaagaaaca atatcgaacg aaataacaat acttcaagaa gaagaaagaa tagacagaat 2700 gatcaaaagc tcaatcaact acatcaacaa taaagaaata gcatcaaatc tccttacatc 2760 tattcttaaa aaaagagact cgagctccca gattcacaga ataaagaacc catccaatgg 2820 agaaatcgaa acagaacaaa aagaaataac agattgcttc agaaaatatt acgagacact 2880 cttcaaagag aaagaatgcg accaagaaac acatacagaa cttctaagca cctggaaccc 2940 accaatagat aaaaaaaaat taacagagat ggaagacaga atcgaagaat atgaggttaa 3000 gctagcaata gaaaagatag cagaaggtaa atcaccagga gaagatggaa taacatcttc 3060 attctataaa atccaccaac acaaactaat tccaatcctc acagaactat tcaacttctt 3120 tttaaacaat gaaattccaa cagaattcaa gaacggtatc cttacatcaa tatacaaagg 3180 taaaggggat gtcttggaaa tctcaaacag aagaccaata acgttactca acgtagacta 3240 caaaatctac tcaaaaataa tcaacaatag aatactaaaa atcctgccaa acataatctc 3300 aaagtaccag aacggtttta taccaggaag gctactccac aacaacatta tagctttgga 3360 tttagcaatg gaaaagagtg atagaaacac gataatcacc ttttatgact tcgaaaaagc 3420 cttcgattca atatctcata aagcgttaat aaggactcta aaccatctaa agttccctcg 3480 gaaaataaca aacacaatac taagcatgtt aacgaacacc aacataaggg tcatggtaaa 3540 tggacaactc tctgaatcct tcagagcagg aagaggtaca aaacagggcg acccaatttc 3600 accaactctc ttcgctatag tatgtgaatg cttatcaaca tcaataagaa aagacaatac 3660 cataataggc ataaaactca atcagaacaa ttcgatgaag attggacaat atgcagacga 3720 cacagtcaca ctagccagta acatagatga cgcagataaa atggatatga aagttataaa 3780 attttgccaa gccacagcag caaaaataaa cgataataaa tgtgtgtgca tcactaaaaa 3840 cccaaagata aagaccaaat ataagacaat tggggcgaaa gaagaagaaa gatatcttgg 3900 ctttttcttc aaccgcaaag gtgtaataag caaagtagat gatacggtga acaaactgga 3960 aaacctcaca aaatgctatt cgagcgtgtc atcaacgcta aaaggtagaa tcacaatcct 4020 caaatcatac ctgctgtccc aactaacatt ccaactatac atcaacgaaa taaatgacat 4080 aaagaaactg gagaacgtta atgccaacat gttattcaaa ggggacaggt gggctatctc 4140 aaaagaaagg agcagaagag actatgaaat tggaggcctg gaactctgga atatggcaac 4200 cagatccaat gcccagaaag catggatata cgaacaatac ctcagagaaa aagacgacca 4260 gaactgccct ccacatatgg aagtctggaa gtcggaaaaa gaaaactcac tatcaaggat 4320 tcacatcaaa tgctggaaag cctggaagtt gttgcaccac ccaagagagc gaataactct 4380 caaactaaac caagtcaaac ccaaatacga aaacaaacaa aaattaaaag taatctatag 4440 aaacatgatg gatatcaaat acaagggatg gaacaaacac cagccaacaa caggtcaaaa 4500 actgattcaa aagaacatta atcacccaat tctaccattt agagaagcca gatcaatcac 4560 aactatcaaa ggaagagact tagcatggag atatctactc aaggcacttc caaaacacca 4620 tggggagaat tgccactcat gtaaagaaga agaatcgtct atgcacatct tcttcgaatg 4680 caaatcaata aaacagaata tcgactcaat ctatcaaaag gtctgtaaag actccaacaa 4740 cacttaccac ggtccatgga gcgaaaaagt tctcggaaaa ctactcacac cattctcatc 4800 aaatctgata ggagccatta tggaatcgat atggtacaga agaaatcaaa taaaattcaa 4860 cgataacact acaataataa cagagaacca aataattcat aaaataaaaa aagcgagaga 4920 tgccgaatgg gatagaacaa gaaagatagt agaaaaacaa ctacgtcaag aactaagatg 4980 cactgataac cgagagtcaa tcaatcggac cgcatcaata aaaagaagac tcgaaaaatt 5040 cagccacaac tggaactcga aactcatgac cataaaaatc ccggaacact tcataccata 5100 ctgctcatat aacaccaact actcataaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 5160 ttataacaat aataataata ataataatat aataataata ataatataat aataataata 5220 taataataat aataataata taataataat aaaaaaaaaa aaaataagaa taacaagaaa 5280 aaaagaaaaa aaaaaaaaaa aaaaaaaaaa aaaaatatac aaataaactc tcctagaaca 5340 cactcaaatt gagtgatgtt attaaaaaaa aaaaaaaa 5378 // ID RTE-10_BF repbase; DNA; INV; 3160 BP. XX AC . XX DT 29-JUL-2009 (Rel. 14.07, Created) DT 29-JUL-2009 (Rel. 14.07, Last updated, Version -1) XX DE Amphioxus RTE-10_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW RTE; Non-LTR Retrotransposon; Transposable Element; RTE-10_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-3160 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-3160 RA Kapitonov V. and Jurka J.; RT "Young families of RTE non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(7), 1708-1708 (2009). XX DR [2] (Consensus) XX FH Key Location/Qualifiers FT CDS 4..3144 FT /product="RTE-10_BF_1p" FT /translation="TMEAQHKQREMDGGLGTSRRCDPALPTTGVEGGAGVT FT DPRTENGVLDPNPDDGDHYGDQTERPSTDGRESTGRINLRKNIRVATWNVR FT TLHQAGRLSNVLREMERCGINVLGVAETHWTGSGFFSTTDGEMVIYSGGEI FT HRAGVAVLLSKAVAKSMISYKAVNERILLVRIKASPFNIYMVQVYAPTTDA FT PDEEVERFYEEVQQLLLECPSQDMALVLGDFNAKVGADRLEDDVCGPYGLG FT TPNDRGERLVEFCQENGLFITNTAFKHHERRRYTWLSPGCRYRNQIDFILI FT GKRWRKCVTNSRAYPGPDCGSDHNMVGASLKLKIKKCGRSKRRTLLNLEAL FT ESSAVRETFNVEVQNRFEALNLLEEERTPDEFSTSFTEAIKAAAEKVLGKA FT PRKANHPWISQDTLDLIDRRRKLKTQRNTESGDLLYREVDKQVQRQARADK FT ERWLEEQCAEMEKGLGSNSSLRKSFSIIKSLRRGFQPKQRNIRSEDNRVLT FT DLQSILQRWKQYCEQLYQDNTLHSVTDDEQDTHSEETVDEHFPEILESEVE FT FAIKRLPKNKAAGVDDLPGELLKTDNPTVTKALCNLCNKVLRTGEWPTDWV FT RSVFITIPKKPGTTDCSEHRTIALISHVSKILLRILLKRMEGVADREFAEE FT QMGFRKKVGTRDQIFNIRILMEKARESNVSLYMAFIDYKKAFDSVKHNILW FT RVMERMGVPKYITNSLRQLYRRQQAAVRVEEELSDWFEVTKGVRQGCLVSP FT ACFNFYSEAVMRESADELSWIGVNISGRLINNLRFADDIGLIATSPERLQE FT LLDLVHTVSMEYSLEISTKKTKVMAATKQPTELKIFCQGVQLEQVPTFKYL FT GAIIDGSAGSSREIQARLGAARTALGSLESIWKDRALRKSTKLRVLKALVW FT PVVTYGCEAWTLHAKDTQKIQAFEMKCYRKLLRVSWTEHRTNDSVLAELGI FT TRTLLNLVKRRKLQYFGHVTRAQNISTHILQGKINGTRSRGRPRRRWSDDI FT RDWTGSSLAVCTQTAKNREEWRRLVLETTIVPDPPDEDGTR" XX SQ Sequence 3160 BP; 957 A; 750 C; 881 G; 571 T; 1 other; tgaactatgg aagcacagca caagcaaaga gagatggacg ggggtctcgg tacctccagg 60 agatgtgacc ccgccctacc aactactgga gtagagggag gtgctggcgt tacagacccg 120 agaacagaga atggcgtgct ggacccaaac ccggatgatg gggaccacta tggcgaccag 180 acggaaaggc ccagcaccga tggaagggaa agtaccggca ggatcaacct acgaaagaac 240 ataagagtgg caacttggaa tgtacgtact ctacaccaag cagggaggct gagtaacgta 300 ctgagggaaa tggaaaggtg cggcatcaat gttctcggtg tcgccgaaac ccattggaca 360 ggatctggtt tcttctctac cacagatgga gagatggtta tctactcagg gggagaaata 420 catagagcag gagtggcggt cttgctatcc aaagcagtgg caaaaagcat gatctcctac 480 aaagcggtga atgagcgtat ccttctagtg cgcattaagg catccccatt caacatctac 540 atggtacagg tgtatgcccc aaccacagat gccccagacg aggaagtgga gaggttctac 600 gaagaagtcc aacaactact acttgagtgc ccaagccagg acatggcctt ggtgttgggg 660 gacttcaatg ctaaagtagg ggcagacagg cttgaagatg atgtctgcgg accctacggc 720 ctaggaactc ccaacgacag aggggaacgg ctggtggaat tctgccaaga gaatggtctc 780 ttcataacca acacagcctt caagcatcac gagaggaggc gctacacatg gctgtcccca 840 ggatgcaggt acaggaacca gatcgacttc atcctcatcg gtaagagatg gaggaagtgc 900 gtcacaaact cccgtgccta ccctggacct gactgtggat cagaccacaa catggtggga 960 gcaagcctca agctcaagat aaagaagtgt ggcaggagta aacgaagaac tctccttaac 1020 ctagaggcac tggagtcatc agcagtacgg gaaaccttca acgttgaagt ccaaaacaga 1080 tttgaggctc tcaacctact tgaagaggaa agaacaccgg atgagttctc aacctccttc 1140 acagaagcta tcaaagctgc agcagagaaa gtgctgggga aggcccctag gaaagcaaac 1200 cacccatgga tatcacagga cactcttgac ctgatagaca gaaggcggaa gctgaagacg 1260 cagcgcaaca cagaaagtgg agacttgctc tacagggagg tggacaaaca agtccaaaga 1320 caggcaagag cagacaagga aaggtggctg gaagagcagt gtgctgagat ggagaagggc 1380 cttggaagta acagctcact caggaagtcc ttcagcatca taaagtcgct gcggagaggt 1440 ttccaaccta agcaaaggaa catcaggagc gaggataacc gggtgctcac tgatctccag 1500 agcattttac agcgatggaa acagtactgc gaacaactgt accaagacaa tacactgcac 1560 tctgttactg atgacgaaca ggacacacac agtgaggaga cagtagatga acacttcccg 1620 gaaattctag aatctgaagt agagtttgcc atcaagcgsc taccaaagaa caaagcagct 1680 ggcgtagatg accttccagg tgaacttctg aaaactgaca acccaactgt gacaaaagcc 1740 ctgtgcaacc tctgcaacaa ggtcctgagg actggagaat ggcccacaga ctgggtacgg 1800 tcagtcttca tcaccatccc caagaagcca ggaaccacgg actgttcaga gcatcgcacc 1860 atcgccctga tatcccacgt cagcaagatt ctactgcgca tcctgctgaa gcgtatggag 1920 ggggtggcag acagggaatt tgctgaggaa cagatggggt tcaggaagaa agtgggaacc 1980 agggaccaga ttttcaacat caggatcctc atggagaaag cacgtgagag caatgtaagt 2040 ctttatatgg catttattga ctataaaaaa gcctttgact cagtaaaaca caatattctc 2100 tggagagtga tggaaagaat gggagtcccc aagtacatca ccaactcact acgacagctg 2160 tacagacgtc agcaggctgc agtgagagta gaggaggagc tgtccgactg gtttgaagtc 2220 accaaaggcg tgcggcaggg atgcctggtc tctccggcct gttttaattt ctactccgaa 2280 gcagtgatga gagaatctgc cgacgaactc tcctggattg gggtcaacat cagtggaagg 2340 ctcattaaca acctgaggtt cgcagatgac attggcctga tagcaacatc accagagcga 2400 ctgcaggagc tgttggacct agtacacaca gtgtccatgg agtacagcct ggagattagc 2460 accaagaaga ccaaggtcat ggcagcaacc aaacagccca cagaactgaa gatcttctgc 2520 cagggggtcc agctagagca agtaccaacc ttcaagtacc tgggtgccat tattgatgga 2580 tcagcgggaa gtagccgaga gatccaggca cgcttgggag cagcgcggac agcccttggt 2640 tctctggagt ccatctggaa ggacagagcc ttaaggaagt ccaccaagct gagggtactg 2700 aaggcactgg tctggcctgt agtgacctat ggctgtgaag cttggacact ccacgccaaa 2760 gacactcaga agatccaagc ctttgaaatg aagtgctaca ggaaactgct cagggtcagc 2820 tggactgaac accgcaccaa cgactcagta ctagctgaac tgggaataac acgaacactc 2880 ctaaacctgg tgaaaaggag gaagttacag tactttggcc acgtcactag ggcacagaac 2940 atctcgaccc acatactgca gggaaagatc aacggtacca gaagcagggg gagaccacga 3000 aggagatgga gtgatgacat aagggactgg acaggcagct ccctggcggt atgcacccag 3060 acggctaaga acagggagga atggcgaaga cttgtgctgg agacaaccat agtccccgac 3120 cctcctgatg aggatgggac taggtgaggt gaggtgaggt 3160 // ID BEL-598_AA-I repbase; DNA; INV; 6864 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-598_AA_; KW BEL-598_AA-LTR; Pao_Bel_Ele216; BEL-598_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-6864 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [5917-6474] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 1014..3005 FT /product="BEL-598_AA-I_2p" FT /translation="MQSKTGTIPKLKTHVTYEQWRSRTTNLEMEKEHLKQN FT DIRRQRELELVNQLNRLEMQKDDGELRNLELIQQMKEHEQEKQRIIQQKQR FT ECIELEQQMHRQSQEHGNLIEQQQKELEHLRCVEREYRNYTQNQENRFKHP FT ASMTVNRGDGISNLRPLTGLPISSIPNSVNLREHIELPRQEDIWYISGYQQ FT KSLPAFGNISTHSSGFPVSDALQSGTLPPSHDVRFRDRSSNVPEINTPVGG FT RSNFDAECRSTDNFSVRHNQLQYPVFNSPVSAPPVLNNVVIPPQLAPTPQQ FT LAARQVVNRELPIFAGDPIDWPLFISSYNHSTQACGYSDSENLLRLQRCLK FT GSAKEAVSSFLLHPSTVAQVISTLQLLYGRPEQIVQSLIAKVRNTPAPKNE FT RLDTLVSFGLVVQNLCGHLKAIGLENHLSNPVLLNELVEKLPTAVKFNWAL FT HQRQLPVVDLSAFGEYMANVVSATSCVVSWNGGASKGIKDDRARGKDRAYV FT NAHSNPGSSEGRREYTGESRENIVAQKEPSNTSGNTPLCPSCKNRGHLIEA FT CGLFKRISIDDRWKVVKENRLCGRCLTSHTRWPCKGQVCGIDGCPRRHHRL FT LHDKNSSVPKQIDTGKNVTVSLHRQPSSFNLFRIIPVTLYGRKGKLDTYAF FT LDDGSSVSLIEDKIA" FT CDS 4279..6864 FT /product="BEL-598_AA-I_1p" FT /translation="MEVHDKAGFHIRNWISNKEDVMRAIGEVHPTVVKTLT FT LDQECGGERLLGMIWRPRDDIFSFTLNFREDLKDILAGDVIPTKRQLLRIV FT MSIYDPLGLVSTFVVHGKIMIQEVWRSKTGWDERIPTSILVRWQQWLICLQ FT GMEEIEVPRCYFSGYERCSFDTLQLHIFVDASAEAYAAAAYFRIVDRGVVR FT CALVSSKTKVVPLQPLSTPRAELLAAVLGARLRKTIEEGHSLKVHRTYFWC FT DSSTVHTWITSDLRRYRQFVALRVNEILNLSKPEEWRWVPTNQNVADEATK FT WGKGPSFTSDSRWYRAPRFLYESEEFWPGDQEFRITETDEEVRPAYLNKHS FT IMQPLVLVERFSKWERLLRCVAYVIHFVKRIGPQRKTATSNSTGNLSKEEL FT IEAERVPWRSAQNEAYPDEVDILKSNKEASREQLKGVERTSSLVKLSPMMD FT EYGVVRINSRLSAAEFLPFDTRYPIILPKDSYITRLVVDWYHRQHKHANDE FT TVINEIRQKFHISKLRGRLSFVKKNCNWCRVYKAVPIIPRMASLPPERLTP FT FVRPFSYVGIDYFGPYLVKIGRSSVKRWVALFTCLTIRAIHLEVAATLSTD FT SCKKAIRRFIARRGAPLEIFTDNGTNFVGAGKELANEIMKLNVELSSTFTD FT AHTQWRFNPPAAPHMGGCWERMVRSVKVALGSVPTLRKLDEESLSTMLAEA FT EYMVNSRPLTFVPLATESEECLTPNHFLMMSSKGVKQPVKNPVGEGAALKD FT SWSTLQMVLDHYWKRWLLEYLPTITRRSKWFQDVRPIKVGELVVIADEKLR FT NSWTRGRVIRTYAGQDGVVRKADIQTNTGVLQRAVAKLALLDVGRFNDAEA FT ELPSSSGGG" XX SQ Sequence 6864 BP; 1981 A; 1489 C; 1725 G; 1660 T; 9 other; gaaaactcga agaawttaca ttggaatgtc caacccacca cgactttcaa tgattcgcca 60 gacgcgatcg aaaacacgag ccgcgaagga gagtgtaaat cccaacctcc aagttgagac 120 ggattcactg gaggaaaaac actcgtcaaa ttcgtttgag ccgtctatca tcgatgtcga 180 gggaggtgat aagattgaat gcggtggttg taacaaaccc aataatgccg aactattcat 240 ggtccaatgc cggtgctgtt cccgttggta ccatttctct tgcgccaacg tcaagcaatc 300 gacggttcgg aaggaagcct ttgcgtgcaa attgtgcgtt ctaccagcga gttccagaac 360 aagtcgcacc actacgtcca gtgtacgaca ggcgaggatt gcccgggagc tagagcacct 420 tgaagaggag aggaagcttt tggagaggat gcacgaggaa gaattagcag gaaaaggcta 480 ggaaggataa gttcgagcgc gaaaascaat acatcgccaa aaggttcgat ttgcttcggc 540 agcaagatga gcaagaaatt gctagttcag ctagtagccg tcgtagtcag cacagtcgcc 600 cggcaagcca aaatcgaatt cagaactggg tagacgatgt cgctagggtt gcccatggag 660 tcggtggact gttgcccgga aataccgtca tcgttaatga aagggatttc gatccaaatg 720 tgattgttgc caatgaaggt cacgcatcat caacacccgt acgtacgccg ttgcctaata 780 cwgccagcgg tttggtccag ccagcaccgg tacggttacg tttgaatgat aagagccgag 840 aattagcttt cgttccgcca accacgggaa gtattgagat cggagagaca gatgaggacg 900 aatcaaaaga tatcggagct atcggtggat ccaatcgaaa aaggatctca cctgcgctat 960 cgttcgttga tgtggtagga cccctcgatg atctcttgaa caaaccggca gagatgcaat 1020 cgaaaaccgg tacgatcccg aaattgaaga cccacgtaac ctacgaacaa tggcgttctc 1080 gcaccacaaa tctcgaaatg gaaaaggaac atcttaaaca aaacgatatt cgtcgccaac 1140 gtgagctgga gttagtaaat cagttaaatc gactggagat gcaaaaagat gatggggaat 1200 tacgaaattt agagctgata caacaaatga aggaacatga gcaagagaaa cagcggataa 1260 tccaacaaaa gcaacgagaa tgcattgagt tggaacaaca gatgcatcgt cagagccaag 1320 agcacggcaa cctcatcgag cagcagcaga aggagttgga acacctacga tgtgtggagc 1380 gagaatacag aaactacacg caaaatcaag agaatcgatt taaacaccca gcgtcgatga 1440 cagtgaatcg gggcgacgga atttccaacc tgcggccgtt gactggtctt cccatctcat 1500 cgataccaaa ttcagtaaac cttcgtgaac acatcgagct tcctcgccaa gaagacatct 1560 ggtacataag tggctatcag caaaagtcgc ttcctgcatt tggtaatatc tcgactcatt 1620 ccagtggatt tccggtaagt gatgctttgc aaagtggtac tctgccgcct tcacatgatg 1680 tccgttttcg cgaccggagc tccaacgtac ctgaaatcaa cacgcctgtg ggtgggcgct 1740 caaacttcga cgcagagtgt aggagtactg ataacttttc cgttaggcac aatcaacttc 1800 aatatccagt gtttaattcc ccagtttcgg ctccccctgt attgaacaat gtagttattc 1860 ctcctcaatt ggcaccaact ccccagcagt tggcggccag acaagtggtt aatcgagaat 1920 tacctatttt tgcgggtgac ccaatagatt ggccactatt tatcagtagc tacaaccatt 1980 caactcaagc ttgtggctac tcagattctg aaaacttgct tcgtcttcaa aggtgcttga 2040 aaggaagtgc caaagaagca gttagcagct ttctgctaca tccctccact gtggcgcaag 2100 tgatttccac tttgcagctt ctgtacggaa ggccagaaca aattgtacag agcttgatag 2160 ctaaagttcg taacacaccg gcgccgaaaa acgaacggtt ggatacgcta gtgagtttcg 2220 gtttggtggt gcaaaattta tgcgggcatc tgaaagctat tgggctggaa aatcatcttt 2280 cgaatccggt tctcctaaac gaacttgttg agaaactacc aacagcggtg aagttcaatt 2340 gggctctaca tcaacgtcaa cttccagtag ttgatctgag tgctttcgga gaatacatgg 2400 caaacgttgt gtctgcaaca agctgtgttg tgtcttggaa tggaggtgca tcgaaaggga 2460 tcaaagatga tcgagctagg ggaaaagacc gggcgtacgt caacgcacat tcaaacccag 2520 gatcgtcgga agggcgacgt gaatacaccg gagagagtcg cgaaaacatt gttgctcaaa 2580 aagagccatc caatacttca ggaaatactc ctttgtgtcc atcatgtaag aatagaggcc 2640 acctgataga agcatgcgga ttgttcaaaa gaatctctat cgacgatcga tggaaggttg 2700 tgaaggaaaa tcgactgtgt ggccgatgtc tcacttccca tacacgttgg ccgtgcaaag 2760 gtcaagtatg tggaatagat ggttgcccaa ggcgtcatca tcgacttctc catgataaga 2820 actcatcagt gccaaaacag atagataccg gaaagaatgt gaccgtatcc ctccaccgtc 2880 aaccatcatc attcaacctt ttccgkataa ttcctgtgac actttacggt agaaagggaa 2940 aattggacac atatgccttc cttgacgacg gttcgtctgt ttctctgatt gaggacaaaa 3000 tagccgawgc acttggcttg gaaggcgtcg ctgagtccct ttgtgtccat tggactggtg 3060 gaattaagaa gaactattcg aatactcgga tggtttcact acaagtatct ggcgaaggaa 3120 tcgataagcg ttatcaaatg tcggaagtat acaccgtgcg caatctgggt ttacctgagc 3180 aaacaatgaa gatccgtgaa ttagccgatg agtttgagca tcttaagaac ctaccggttc 3240 gcgatttgga ttctgctgtt cctggcatac tcatcgggca caataatgtt catctactgg 3300 catgcctcaa attgagggaa ggacgagagc aggagccgat tgccacgaag acccgtttgg 3360 gctgggttgt ttatggaagt cgacgagcag cagagatcaa ctttccacac cgkcagatgt 3420 atatctgcac gaagcaagat gatcaaagct tacacgatta cgttcgtaag tttttttgcs 3480 ctggaaagct taggtgtcgc agtggttccg gaagtgaaag gatcggaaga acagcgagcg 3540 ttgtctattc tcaacgagtc aacggtgcgg acatctagtg gtcgtttcaa aactgccctt 3600 ttgtggaagt acgatgaatt cgagtttccg gacagcaaat tgatggcggt taaaagattg 3660 gagtgtttgg aacgaagact gcgcaaaaat ccagatttgt acgacaatgt tcgaaaacaa 3720 attgcggaat atcaagaaaa aggatattct cataaagcta cagaagaaga actgcaacga 3780 ttcgacccta gacgaacgtg gttcttaccc ttaggggtgg tattgaaccc caagaaaccc 3840 aacaaagtgc ggctcgttcg ggacgccgct gctacagtgg acggagtatc gctcaatacc 3900 atgttgatga aaggtcctga cctgttaacg cctctattac atgtattgtt tacctaccgt 3960 gaacgagagg tggccatcac agcggatgtg aaagaaatgt tccaccaact tctggtgcat 4020 gaagaagacc gcagtgctct gctgtatttg tggcggaatt cgactgaatc ggctttggac 4080 acgatggtta tggatgtggc aatatttggc gcctcgtgct cacccacgca agcacaattc 4140 gtcaagaacc taaatgctaw ggaatacgaa caccagtatc ccagggcggc gattgcgata 4200 actcggcgac actatgtgga tgactattta gatagtgtcg attccgtaga tgaagctgta 4260 gaactcatta cacaagtcat ggaagttcat gataaggctg gcttccatat ccggaactgg 4320 atctctaaca aggaggacgt tatgagagca ataggwgagg ttcaccctac agtagtgaag 4380 actttgactc tagaccagga atgtggcggc gaacgattgt tgggaatgat atggcgaccg 4440 agggacgata tattttcatt caccttgaat tttcgcgaag atttgaaaga cattctggcc 4500 ggtgacgtca ttccaacgaa acgacagttg cttagaatag ttatgagcat ttatgaccca 4560 ttgggactag tttcaacctt cgtagttcat ggaaagataa tgattcaaga agtatggcga 4620 agtaagacag gttgggatga aaggattcct acaagcattc tcgttcgctg gcagcaatgg 4680 ctgatttgtc tacagggaat ggaagagatt gaagtgccac gttgctattt ttccggctat 4740 gaacgctgca gctttgatac gctacaattg cacatattcg tggacgctag tgcagaagcc 4800 tacgctgcgg ctgcatattt tagaattgtc gatcgcggag ttgttcgctg cgcgttggtg 4860 tcatcaaaaa cgaaagttgt acctcttcag cctttgtcaa caccacgtgc agaattactt 4920 gcagcagttc taggagctcg cctacggaag acgattgaag aaggacactc tttgaaggta 4980 caccgaacat atttttggtg tgattcctct actgtacata cgtggatcac gtcggacctg 5040 cgacggtaca gacaatttgt ggcgctaaga gtcaatgaaa tcctgaactt gtccaagcca 5100 gaggaatgga gatgggttcc aaccaatcaa aacgtcgccg atgaagcaac taaatggggg 5160 aagggacctt cgtttacctc tgatagtcgt tggtatcgag cccctcgctt tctgtacgaa 5220 tccgaagaat tctggccagg agatcaggag ttccgtataa ctgaaacaga tgaagaagtc 5280 cgacctgctt atctaaacaa acactccata atgcaaccac tcgttctcgt tgagcggttt 5340 tctaagtggg agcgacttct cagatgcgta gcctacgtga tacattttgt caaacgaata 5400 ggacctcaac ggaaaacggc tacttcaaac tctacaggaa atctgtcaaa ggaggagctt 5460 atagaggcgg aaagagttcc gtggagatcg gcacaaaacg aagcgtatcc cgatgaggtc 5520 gacattttaa agtcaaataa ggaagcttca cgtgaacaat tgaagggagt ggagagaact 5580 agttcgctag ttaagctttc gccaatgatg gacgagtacg gagttgttcg cattaatagt 5640 cgcttgtctg ctgctgaatt tctaccattc gatacccggt atccgattat tctcccgaag 5700 gacagctata tcacccgcct tgttgtagat tggtaccacc gtcagcacaa acatgctaat 5760 gatgaaacag tcataaatga gattagacaa aagtttcaca tttcaaaatt aagaggacgg 5820 ttaagtttcg tcaaaaagaa ttgcaattgg tgccgtgtct acaaagcagt accgataata 5880 ccacggatgg cttcgcttcc tccagaaagg cttactccct ttgttcgtcc attttcctat 5940 gtaggaatag actacttcgg gccctatttg gtgaaaatag gaagatcatc agtgaaaaga 6000 tgggtggcac ttttcacctg cttgacgatc agagccattc atttggaagt cgctgctaca 6060 ttgtctacgg attcgtgcaa gaaggcaatt cgtcgattca ttgcccgacg aggtgcaccg 6120 ctggaaattt ttacggataa cgggacaaac ttcgttggag ccggcaagga actggctaac 6180 gagataatga aactcaacgt cgagcttagc agtaccttca cagatgctca tacgcagtgg 6240 agattcaacc ctccagcagc tcctcatatg ggtggatgct gggagaggat ggtgcgctcg 6300 gtcaaagtag cccttgggag cgttccgaca ttgcgaaaac ttgacgagga gtccttgtca 6360 accatgctag cggaggcaga atacatggtt aactctcggc cactaacatt tgtacccctt 6420 gccactgaat cagaagaatg tttgactcca aatcactttt tgatgatgag ttcaaaaggt 6480 gtcaaacaac ctgttaaaaa cccagttgga gaaggagctg cgttgaagga tagctggtcc 6540 acattacaaa tggttttaga tcactactgg aaacgttggt tgttggagta tctaccaacg 6600 attacaagaa ggtcaaaatg gttccaagat gtgcggccga tcaaggtagg tgaacttgtg 6660 gtgattgcag atgagaagct tcgaaatagt tggactcggg gacgtgtcat tcgaacatat 6720 gcaggacagg atggtgtagt gaggaaagcg gatattcaaa caaatacagg agtcctgcaa 6780 cgtgctgtgg cgaaactcgc gctgttggat gtcggacgat ttaatgacgc ggaagcggaa 6840 cttccgtcgt catccggggg agga 6864 // ID DNA8-66_AP repbase; DNA; INV; 578 BP. XX AC . XX DT 25-AUG-2009 (Rel. 14.09, Created) DT 25-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-66_AP. XX NM DNA8-66_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-578 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2001-2001 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 578 BP; 193 A; 72 C; 76 G; 237 T; 0 other; cagggctcgg aagttgtagc aaatgcatat ttttcttggt tcgtcctaga acatccaaag 60 taagatacaa atatgtatca tacttctctg atgtcataca cgaaatttta ttttgctatt 120 ttttgcatat ttacggtttt tattgctatt ttgctaattt agggtttata gtgctatttt 180 taggcatatt ttcttaatat acataatata tacctacata ttatacaaaa tataggaaat 240 tatcacatta tgttgaaaat aatgattaat ttggttagga aataggaaat aaatagatct 300 tataagtaca cttttacaaa catatttaag taatttaata tatattttgt taacggttta 360 taattcaact agtccagaca gtccctgatg tctatgagcc caacaatttt ataaaattgt 420 ataattacat ttaacttttt aaaataaatt taattttttc aatttaaatg catatttttt 480 gattttttgt gctatttata gcgcatattt tggagttttt aagtgcatat ttaagagcta 540 aaaagtaaca tttttagtgc tacaacttcc gagccctg 578 // ID EnSpm-N1_AAe repbase; DNA; INV; 1396 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A non-autonomous EnSpm DNA transposon family from Aedes aegypti. XX KW EnSpm; DNA transposon; Transposable Element; nonautonomous; KW EnSpm-N1_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-1396 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the yellow fever mosquito."; RL Repbase Reports 11(4), 1284-1284 (2011). XX DR [2] (Consensus) XX CC 2-bp TSDs. Both termini are 66-69% identical to those of CC EnSpm-1_AA. XX SQ Sequence 1396 BP; 424 A; 242 C; 249 G; 480 T; 1 other; cccgcataat catgaaccat actagtaccg tttcccgttt tgtccataac aatttgctac 60 aatatgtgaa caatttcgta cagtatcaaa aataacaata aatctaccct accgcaaacg 120 ttgtttacag tttgatagat ggtaactgag aaaataacaa tatgaggaat aggcggttaa 180 gttatgtaac catttgaaac cgacgatttt cgcgaacaaa atttttcgtt tacagtttac 240 caaacgttag atatattatc cattttttgt acaaatcact gttatgcaaa cagtttgaaa 300 actgtataac tataaaattg gaataaaaaa aatactgtat gtgtaatgat tgattcacgg 360 ttttgtatgg tactttaacc gcttactgta ctgttctttc actgtcatac aatataataa 420 gtaataggtt catgtaggtt atttttgttt aattgtttat tattgaaaaa tatttggttt 480 tctaaaactc actttatcta ccctgaagtg ccatgcgatg gtttatagaa aatatcaaaa 540 tacagttcca acaccagtcc tataatttct cccaactcct ccgccatcca tgtgtcttag 600 atccggaaaa ggcatttttc tttcatcagc ttgatgaaaa gtgaatctac tccgtagatc 660 gtagaacttc cattgccttc gatcaccata gaaatgttgt cataccggca ccgacagtag 720 ttcgtggttc catgagggtt tttgtgctgc catgccacct ggaatgacga tgagtatcag 780 aacttcagac gggatttacg gcatatgtgg ctattcgata gagttgcggg tttaaactta 840 cctggcgcac ggaattccat gcgttttcga cgctgccgga ttgtttgatt tcacggagtt 900 tcaggtgttt cagaaaatca gcggttacta aataaaaaat aaactttggc tgttgtggaa 960 atatgacaca acgatgaaaa tgatcaacga tgcactaaat atgttcagta cgaatcgctc 1020 acaaatatca acatgtttga ctgtttgcca aattgtgaaa caaaagtaac atattgtttt 1080 aattttcgtg tgtgcgtgtg ttgccatgtt tcttgcgatt atttgaaaat ataatttgag 1140 aaaaggtgat caaattgttg cacaatttct ctcaagtttc tgtcttcgtc tcgttcttcg 1200 tatggtttta caatttcttg acacttttac atattgtaac cgttcgctaa gaactctgca 1260 aagtgttctt aatgatttta agctttttga ttgatattgt tcgacaaaat agcaactatt 1320 tgggataaat taaccktttc tgttaaaaga agcatactgt ttggaagcta aggataattg 1380 ttattttcta tgcggg 1396 // ID DNA8-19_AP repbase; DNA; INV; 373 BP. XX AC . XX DT 22-AUG-2009 (Rel. 14.08, Created) DT 22-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-19_AP. XX NM DNA8-19_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-373 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(8), 1761-1761 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 373 BP; 119 A; 68 C; 63 G; 122 T; 1 other; caagcccgga ttaagggggg gtccagaggg ggccatggcc cccgggcctc gatagttcga 60 atttattnag gggcctcaga cttgtccatt acttttttcg ttataggcta taacactgca 120 taaatcacct taaaaaaatc gatgattatt tagcgaagaa aaaagcttat acattataat 180 ttataaataa aatgatacat tatttattgc caagtaccta atatctatat atatttatta 240 taacaggtac ctaacattaa ataatatata cattataaac ataacgtaac aagctattat 300 tttttttttt tcgtacaggg gcctctttta aaactttggc ccctgggcct cgaaatgtct 360 taatccgggc ttg 373 // ID Gypsy-220_AA-LTR repbase; DNA; INV; 1917 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-220_AA_; KW Gypsy-220_AA-I; Gypsy-220_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-1917 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1044-1044 (2011). XX DR [2] (Consensus) XX SQ Sequence 1917 BP; 532 A; 384 C; 449 G; 552 T; 0 other; tgtaaatatt attttaagat ttattatagt attatagaaa tgtagataca tattgttgcg 60 tatacagtca tttatttcct tcaatgtaat ttgttgttcc gtgtttaatt tatttttcca 120 tttcaaacga taattctttg ataatttact aaatccaata agttcactaa atatttgcaa 180 actcagtatc accacttcta aatgacatgc attattctca aatccgcata acttgcatgc 240 aagttgcctt caaagttata gcaggataac tatcgttcac acgccgccgt agaatctgaa 300 ttagctcgag gttcaaagtt caaatccgaa ccattctctc tttttatatg accggcaagc 360 gaagataaaa caaaacgaga caagaccatc tagaaggtat gcgtatgaaa gggatgtagt 420 tttgggcaag aacgatggga agattgatga gaatcgattg tttgaaaggt cgtcgaacta 480 cttcgggagt ttgacgcact tccggccacg aagaccgtaa cattcaaccc tatctgagga 540 tttcagggac ttctactcgt tcactttgac ttgaaccgtg gaagaggcga gagtgcgcgt 600 agttagtact taagtgtacc ttgtgtcgcg gtgtgtttaa tgtggtatgt gatttatcaa 660 acgtgagaag tctttaagat taagttattg ttcaattctt gttcctttcc gtgtagcctt 720 agtcaacgat aggtcaaaaa gttacgtgcg gtctattttt ttgtgtgcgt atagtgtata 780 aagaaggtaa attgaatcgt cttttttttg tgtggaaatc attaattaat tattaattaa 840 ttgcgtcgaa aaagccgatc gacggcttct tgtgtgaaat cgcttctagc gatagaaaac 900 ctgctcggca caccgcccgt tttcgtcaag cgcattttcg accccacgct tcaacgtcga 960 aggtcccggg tcgggggaga ttccggaagg gagacgttta tctccgacag agcgaaaggt 1020 ccaacacgct attttccgcc acgtgtggac cattgaacgg atcgacggtc cccacgtcga 1080 ttatttgcac caaccgtcac ccgcgtgcaa atcgcagcag aagtcgagaa gccaactggg 1140 agtgcaacgc gcaatacttc gtccggcgag atgctgcttg gccgacgacg ataatgcaga 1200 acacggtacc gtgagtagaa caaaaagcat gcttatttcc acacaccaac actgagctcg 1260 aagacagacg ataaacctta gcaatcgcta gaaccgttag ctaggaaacc gcacacaagt 1320 taggctagga agcagaaaga gagagagaga gagagagaga gagagagggg gagaggaaag 1380 gaaccaccat gaataaaccc tagggtacaa atccatgcaa taaatgttta aggtctcgtt 1440 ttttcctgaa taaatgtgtt tctgttagtt tagttcatgt agcaataatt ttggtccgaa 1500 ggatttctca tgtttttcgc gtcgttcttg tagttttttg ttgattgttt tactccaggt 1560 tgggctgcct gggaaatcag cctcagcact cacgtgtcgg aaacggacga gaagatttga 1620 acagtgagtt cctcccgtga tgagaaatac gtgagtggtg tttttgtggt ttttcggtag 1680 aacgacggtt agcaccagtt ttcccttcag gaccgtttgg agcatgatct ttttcagccg 1740 tactcgatag gtggtcagaa tcgttccgat attagctaat aacctaaaaa cccgccctga 1800 ggttgggttt ctgccgggca acctaaactt gacggacagt cctctttgag gtggcgctta 1860 agctgtccgt tttttcgagc tagaccacgt gtagagaacg agcggagcgt agccgca 1917 // ID Gypsy8-SM_LTR repbase; DNA; INV; 969 BP. XX AC Contig1154; XX DT 15-AUG-2007 (Rel. 12.08, Created) DT 15-AUG-2007 (Rel. 12.08, Last updated, Version 1) XX DE LTR retrotransposon from Schmidtea mediterranea: long terminal DE repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy8-SM_LTR; KW Interspersed repeat; LG_I. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-969 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from Schmidtea mediterranea."; RL Repbase Reports 7(8), 760-760 (2007). XX DR [1] (Consensus) XX SQ Sequence 969 BP; 376 A; 137 C; 161 G; 293 T; 2 other; tgttatgtac aggctcttag cctaaaaata ttaagtcata ctgacgtcac taaacggtga 60 cgtgtaaatt aatgtgatga cgtttagtga cgtcacggaa aaaattataa aatacgtcac 120 agaaaattac taaaattgga aatacgtcat cagaaaattg ctaaatttgg atttctgatg 180 acgtggctct aaatttagta attaatttaa ttcaaaatta taaataaggt aattttatta 240 ttttttaatc aatataaata gaagagtatt tttaagtacg gggccgataa caaacaatag 300 cgaaaaactc attaagccga tttaatcaga agagaaaaac waattcaaga aaaagaaact 360 gtttgacaaa ttgattgtat tattgaaaty tacctgaaga gaaaaacaaa ttcaagaaaa 420 agagagtgtt tgacaaatcg attttattac tgaaatccac ccgaagagaa aaacaaattc 480 aagaaaccag ctgtcctact cgaaacacac aactctaacc ttaggaaata tatttaaatc 540 gctcctactg aaagaaggaa aaaaaaggaa aaaacagaac agaaaacaat tttggaagca 600 gttattgtgg aaatttaaca attggcaaaa tttggatacg aattatcaag tggaaaattt 660 tgtcaatgat atttgaggaa attaatttat ctagattttg gatactctat ccagtggcga 720 tattttggta gtttttattt tatttaatta aaataacatt gaaaaaagcg gattccctaa 780 tccggcttat aaataattat aattaatttt taaaatcaag cagtatccaa agtagttgac 840 cgctttcgga tattcggggt gccccaaaat caaaaactta cttaccctct gttttttagg 900 ataagtcgtt gcgtctgcat agtgaccatc gacttcttgc gggccatttg gcagggatca 960 tttgatcca 969 // ID Gypsy-46_AA-LTR repbase; DNA; INV; 1325 BP. XX AC supercont1.286; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-46_AA_; KW Gypsy-46_AA-I; Gypsy-46_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-1325 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.286; Positions 1282780 1284104. XX SQ Sequence 1325 BP; 370 A; 310 C; 285 G; 360 T; 0 other; tgtaacctgt tcttctcgct tgatggccgt ggcgggcaat cgttgttccg tgcaccgagc 60 ccttagtggt gctctcagcc tcgccaatca acaaagaagt tttggattat taataccgtg 120 tcaaatcact acactaacca acagaaacaa cagaacaaga tcttcgggtt gatcgttatc 180 aagtgtcgat cgaccgcgcc gcatgcctca ttggatgctc cgctgggcgc aaactgctta 240 atgcacagac aaattaaaac ttgattatcc ccatagcagc attttataaa taccacccac 300 tcattccact aatagccaat caatcattga agcaccataa aaacccatgc accggtatca 360 ggaacactac ctgcactcgt caacttcaga gagatcaata ttctcccgtt atcagagatc 420 ccgatcaacc atgagcacca ctagatcgtc tatcactcac gagtccactc acttagtgcc 480 gataaaacta atgcacaaga ggaggagaac caacagttat cggttaatct ttaagcagac 540 cgcgtgaaac ccagtaaagt gtgtagcaaa agtacgatag agaagtttgg aattgttgtt 600 caattattgg tctgaccgat gtgagttggc ctattgtgtg agttgttcgt ttttatccta 660 aattcttgat tccttcttag agcgaagtga acaaacaatg gttaaaattg caaaaatgat 720 gtgcagtgtg gtgcaatggc ggtagagcga taagcttctt cgagatcggt aagccattta 780 gacaaatgaa taccctaaac agcatcccaa agggtcatca gtattaatgt ttgaatacga 840 cgccttctgt aggcctttct ttaacaccag ctatgttcat ggtggtgcgt taacggtggt 900 gatgtcgtgc ggtggtattg ctatgggcgt gctaagcagc ggggtggccg gcatggcgcc 960 ggtagtcttt cccccaaact gtccggacag gctagacgtt ttgcagacgc cgagggatac 1020 acgaagaata cactgtcatc actccaacac aatttaggat aagctgcatc taggataagt 1080 accacaaata aaatgtaagt acactctaag tctaaatgat cacctccact ttattttgtg 1140 aaactggaat ttctcttttg acttgggatg gctcgttaac attcagcgtt ggttccgctc 1200 ttctcgaacc tggttctagc tacccattaa gtatttggat cgggggatca atactcataa 1260 gaaccgttct aaattactta actttgtctc cgagtctcat gtgccctctg cggagctggc 1320 ccaca 1325 // ID I_Ele4C_AAe repbase; DNA; INV; 5674 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A I non-LTR retrotransposon family from Aedes aegypti. XX KW I; Non-LTR Retrotransposon; Transposable Element; I_Ele4C_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5674 RA Kojima K.K. and Jurka J.; RT "I non-LTR retrotransposons from the yellow fever mosquito."; RL Repbase Reports 11(4), 1372-1372 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 8 sequences with >98% CC identity. XX FH Key Location/Qualifiers FT CDS 370..1665 FT /product="I_Ele4C_AAe_1p" FT /translation="METDGDAVSIVYGDSKESSSTIKPIRIKTYPSTFMGP FT YVVFFRKKEKPINVLLISSEIYKIYKTVKEIKKISLDKLRVIFGSREDANA FT LLESKLFLNSYRVYAPCNSCEINGVIYDESLECDEILNHGSGIFKNKAILP FT VKILECVRLSKLLFSDKGSSYTHSNCIKITFEGSVLPDFVVVDNVKFHVRL FT FYPKIMHCDRCLLFGHTSHFCSNKQKCSKCGGIHSPSDCNKISDICIHCGK FT KHNFLKECAVFIAHQKQFNLKIRNKNKLSYSEVIKTSNAFSSNNIFEPLTQ FT KVDIEDSNEEHNFVYNPPTKRKRINKTNNQNNIFDPQPSTSYEKNFPSINL FT SKSQNIPGFQKINQDYFGNKNDDIKKTTNNSHVQNDDGGSILKILEDIIEF FT LGLNDFWKKMIKKCLPFVASILEKLNSFGPLISSLFCS" FT CDS 1668..5360 FT /product="I_Ele4C_AAe_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="MAQIKNSSLNILQWNCRSVIPKIERLKALIVNNDIDI FT FCLNETWLVENXFFRIPSFNIIRKDRNMAYGGVMIGIRQNIEFKFLNFSLD FT SPIEYVAITVKHNGIEFSILCLYIPPQASFSLLDLKTILNNIPSPFYILGD FT LNAHNLAWGSEFSDGRGSLIMDLIDELNLNILNDGSFTRIAVPPAHHSCID FT LSLCSSSLAIKSTWKIIDDPNGSDHLPIQISIHFPLRDQHHQEPFVPDLTK FT NIDWSKFSDLVSTALINFDYSLSPLQNYNRFSVILIDCLQKSQTKKIFLGP FT TKRRHNSFWWDHDCTVALKNKSEAFKNFRNSGTRNNYIFYRKTEAQFTRIV FT KFKKRNYWKTFIENLDSETSLNKLWSVARNLRNYNIPSTSVLEYSENWIDQ FT FSSKICPDFVPTPIIFKTHQLYNYYPDLCNEFSIEEMELALSVTKNTAPGI FT DNIKFIVLKKLPIDGKLHLLSLYNSFLFQNVFPSEWRFIKVISLLKPDKNP FT SLVESRRPISLLSCLRKLMERMILNRLELWAEKNNIFSSSQYGFRKGRGTR FT DCIALLASHIELSFNKKQDVVSTFLDVSGAYDSVLIDLLFNKMNDCKIPII FT ISNFICNLFSFKIMHFFHNGSSRMVRYSYFGLPQGSCLSPFLYNLFTRDII FT SIIPNGCYFIQFADDNVISVNGHSREVIRHFMQNCLDNLHSWAHNNGFTFS FT VQKTKFILFSRKHSPVNIDLYLSNQQIEQVIDYKYLGLWFDSKLKWNSQIK FT YIQKICSKRINFLRMITGTWWGAHPNDMITLYKTTIRSVMEYGCFTFGSAA FT QIHFSELEKIQFRCLRICLNLMNSTHTKSIEVLAGIIPLKYRFHELNCKFL FT IHCFSMDHPLIGTLKSLFEINPSNRILKSYIYCSRENIVSNVSPHFYEYSI FT AAHSFQPCVDLSLHEELKQIPNHARSRYANLLFKRKFIGVDNDQIFFTDGS FT LIENVAGFGVFNFHLAHFYKLESPCSIFTAELTAIYFTCKLIENYSPNIFM FT VCSDSLSCLQALNCISFNFKTHHIVLSIKGLLNDLYSKGFIIKLVWVPAHC FT NIYGNEQADLLAKLGVFRGIIYNCDIYYSEYFTNLKKYSMNDWQISWNTSD FT KGRYCYSICPKVKIFPWFRSLPVGRNFICSYSRLMSNHYICSNHLYRMNIT FT DSNICECNKSYEDIDHIVFECSRFNAPREKFLDRIISLSHDIPVSVRDILG FT NHSLPVMRILYKYLNEISYRV" XX SQ Sequence 5674 BP; 1821 A; 811 C; 887 G; 2152 T; 3 other; tcatcttcgg taagtaggcc ttgaccgggc wacaggtttt tttttctgct cttgtcattt 60 ttttttcttt ttcaggtgaa ttttcattcc gaagtattgc agcttgaagg cgattgttac 120 tacgtttttg atgtttgaag agaaggagaa acaagtggat tgcgaagtcg ttctgaaggt 180 tcaaattgca agactcctca caaacgttgg tgttcccggc ctttgacgtt tttcttggtt 240 gctgttgctg gtgacgctgg acttgtatgg tgtttgttaa gtattaattt tctattttaa 300 ccgttcatta ttgatattac tgtttcttgg tgttgatttg ggttttgttc accccgtcat 360 ttttccatta tggaaactga cggggatgca gtttctatag tatatgggga ttcgaaagaa 420 tcttcatcta caatcaaacc tattcgtatc aaaacttatc cttctacatt tatgggacct 480 tatgtagtgt ttttccggaa aaaggaaaaa cctatcaatg tccttttaat atcatcagaa 540 atttacaaaa tttacaaaac tgttaaggaa atcaaaaaga tttccttaga caaattgcgt 600 gtsatttttg gatctcgaga agacgctaat gcgctattag agtccaagtt atttttgaat 660 tcttatcgag tttacgctcc atgtaactcg tgtgaaatca atggtgtcat ctatgatgaa 720 tcattggaat gtgatgagat tttgaatcat ggttcaggaa tatttaaaaa taaagcaatt 780 ttgccagtta aaattttgga atgtgttcga ttatcgaaat tattattttc agataaaggt 840 tcctcatata cacattcgaa ttgcatcaaa atcacatttg aaggatctgt tcttcctgat 900 tttgttgttg ttgacaatgt aaaatttcat gttaggctat tttaccctaa aattatgcat 960 tgcgatcgtt gtcttctttt tggacacacg tcgcattttt gttccaataa acagaaatgt 1020 tcaaaatgtg gtggaattca ttctccatca gattgtaaca aaatttctga catttgtatt 1080 cattgtggca aaaaacataa ttttttgaaa gaatgtgctg tttttatagc tcatcaaaag 1140 caattcaatt tgaaaataag gaataagaac aaattatctt attctgaagt tattaaaaca 1200 tccaatgcat tttcgtcaaa taatattttt gaaccattaa ctcaaaaggt tgatattgag 1260 gattcaaatg aggaacacaa ttttgtgtat aatcctccta ccaaaagaaa aagaattaat 1320 aaaacaaaca atcaaaacaa tatttttgat cctcaaccat caacatctta tgaaaaaaat 1380 tttccttcaa ttaatttatc aaaatctcaa aacattcctg gttttcagaa aattaatcaa 1440 gattattttg ggaacaaaaa tgatgatatt aaaaagacta ccaataattc tcatgttcaa 1500 aacgatgatg ggggtagtat tttgaaaatt ttagaagata ttatagaatt tttgggttta 1560 aatgattttt ggaaaaaaat gataaagaaa tgtttaccat ttgtagctag tatactagag 1620 aaattgaatt cttttggacc cctcattagt tccttgtttt gttcctaatg gctcaaatca 1680 aaaatagtag tttgaatatt ttacaatgga attgcagaag tgttattcca aaaattgaaa 1740 gactgaaagc tttaatagtc aataatgata ttgatatatt ttgtttgaat gaaacatggt 1800 tagtggaaaa tawttttttt agaattcctt cttttaatat aattcgaaag gatcgtaata 1860 tggcatatgg aggtgtaatg atcggaattc gtcagaatat tgaatttaaa tttttaaatt 1920 tttcattgga ctcaccaatt gaatatgttg ctattactgt caaacataat ggaatagaat 1980 tttcgatttt atgtttatat attcctccac aagcaagttt ttcattatta gatttgaaaa 2040 caattctaaa taatatacct tctccatttt acatacttgg tgatctgaat gctcataact 2100 tggcttgggg tagtgagttt tctgatggca gaggttcatt aatcatggat ctgattgatg 2160 agttgaattt aaatattctg aatgatggct cattcactag aatagcagtt ccccctgctc 2220 atcattcatg cattgattta tcattgtgtt caagtagttt ggccataaaa tcaacttgga 2280 aaattataga tgatccaaat ggtagtgatc atttacctat tcaaatttct attcattttc 2340 ctcttcgtga tcaacatcac caagaaccat ttgttcctga tttaacaaaa aatattgatt 2400 ggtccaaatt ttcagattta gtttctactg ctttaatcaa tttcgattat tcactttctc 2460 ctcttcaaaa ttataacagg ttttcagtca ttctaataga ttgtttacaa aaatctcaga 2520 ctaaaaaaat atttttgggg ccaaccaaaa gaagacataa ttctttttgg tgggatcatg 2580 attgtactgt tgcacttaaa aataaatctg aagcctttaa aaattttcgt aattcaggta 2640 ctaggaacaa ttatattttt tatcgtaaaa ctgaagccca gtttacacga attgttaagt 2700 ttaaaaaaag gaattattgg aaaactttta ttgaaaatct tgattcagaa acgtcattaa 2760 ataaattatg gtctgttgct agaaatttaa ggaactataa tattccttct acctctgttt 2820 tggaatattc tgaaaattgg atagatcaat tttcttcgaa aatttgtcct gattttgttc 2880 ctacccccat catatttaaa actcatcaat tgtacaatta ttaccctgat ctttgtaatg 2940 aattttcaat tgaggaaatg gaattagcat tatctgttac taaaaatact gctccaggta 3000 ttgacaatat aaaatttatt gtgttaaaaa aattacctat tgatggtaaa ttacatttac 3060 tatccctata taattcattt ttgtttcaga acgtttttcc ttctgaatgg cgattcataa 3120 aagtaattag tttacttaaa cctgataaga atccttcatt agtagaaagt agaagaccta 3180 ttagtttatt atcatgtctt cgtaaactaa tggaaagaat gatacttaat cgtctggagt 3240 tatgggctga gaaaaataat atattttcat catctcaata tgggtttaga aaaggtcgcg 3300 gaactagaga ttgtattgct cttctagctt cacatattga attatcgttc aataaaaaac 3360 aagatgtagt ttctactttt cttgacgttt ctggtgcata tgattcagtt cttattgatt 3420 tacttttcaa caaaatgaat gattgtaaaa ttccaatcat tatctccaat ttcatatgta 3480 atttgttttc cttcaaaata atgcattttt ttcataatgg gtcttctaga atggtccgtt 3540 atagttattt tggcttacca cagggttctt gtttaagtcc ctttttatac aatttattca 3600 ccagagacat catttccatt attccaaatg gatgttattt tatacaattt gcggatgata 3660 atgtcatttc tgttaatggt catagcagag aagttattcg tcattttatg caaaattgtt 3720 tagataatct tcactcatgg gcacataata atggttttac tttttcagtt caaaaaacaa 3780 aattcatatt attttctcgt aaacattctc cagtaaatat tgatttgtac cttagtaatc 3840 aacaaattga acaagttatt gattataaat atcttggtct atggttcgat tcgaaattaa 3900 agtggaatag tcagattaaa tatattcaaa aaatttgctc gaagagaata aattttcttc 3960 ggatgataac tggaacatgg tggggtgcac atcctaatga catgatcaca ctctataaaa 4020 caactattcg ttcagtaatg gaatatggtt gttttacttt tgggagtgct gcacaaatac 4080 atttttccga actggaaaaa atacaatttc gttgcttgag aatttgttta aatttgatga 4140 attctacgca tactaaatct attgaagtcc ttgctggtat tattccactc aaatatcgct 4200 ttcatgaatt gaattgtaaa tttttgatac attgtttttc aatggatcat cctttaattg 4260 gtacactaaa atctttgttt gaaatcaatc cttctaatag aatattgaaa tcatatatct 4320 attgttctag agaaaatatt gtatcaaatg tttctccaca tttttatgaa tatagcatag 4380 ctgctcattc ttttcaacca tgtgtggatt tatctttaca tgaagaatta aaacaaattc 4440 ctaatcatgc tcgttcccgt tatgccaatt tattatttaa acgtaaattt attggggtag 4500 acaatgatca gatttttttt accgatggat ctttaattga aaatgtagca gggtttggag 4560 tgtttaattt tcatttggcc catttctata aattagaatc tccttgttca attttcacag 4620 ctgaattaac agctatatat tttacatgca aattgattga gaattattct ccaaatatat 4680 ttatggtgtg ctcagatagc cttagttgtc ttcaagctct gaattgcatt agttttaatt 4740 tcaaaaccca tcatattgtt ctatcaatta aagggttatt aaatgattta tattctaaag 4800 gatttataat taaattggtt tgggttccag ctcattgcaa tatttatggt aatgaacaag 4860 ctgatttgtt ggcaaaattg ggggtttttc gtggaataat ttataattgt gatatttatt 4920 attcggaata ttttactaat ttgaaaaaat attctatgaa tgattggcaa atttcttgga 4980 atacaagtga caaaggtcga tattgttatt ccatatgtcc gaaggtaaaa atatttcctt 5040 ggtttcgcag tcttcctgtt ggacgtaatt ttatttgctc ctattctaga ctaatgtcaa 5100 atcattacat ttgtagtaat catttatatc gcatgaatat cacagattca aatatttgtg 5160 aatgtaataa atcatatgaa gacatagacc atatcgtttt tgaatgctct cgttttaatg 5220 cacctagaga aaaattcctt gatagaataa tcagtttaag tcatgatatt cctgtgtctg 5280 ttcgggatat tctgggaaat cattctcttc ctgtcatgag aattttatac aaatatttga 5340 atgaaatttc ttatcgtgta tgatactgct cgtttgtttt ttttctttga ttttattttc 5400 agagacatca agtttggtac tcatccatct aatcgattgg cttctgaatg cgtgctagag 5460 aatattcgtg atgactgaac ttggaaaact ttggctccgc tatggatcga ttccgggaga 5520 gcctttattt aataaatatt attttataac gttattcgaa aagataaaga ggttttgtgc 5580 ctttttgaga aagatttcat tagaatatca ctcaaagggg ctttttcctc tttcaaaatt 5640 cataagttaa aaataaataa ataaataaaa taaa 5674 // ID Gypsy-17-I_HM repbase; DNA; INV; 4087 BP. XX AC . XX DT 03-FEB-2009 (Rel. 14.02, Created) DT 03-FEB-2009 (Rel. 14.02, Last updated, Version 1) XX DE An internal portion of the LTR retrotransposon - a consensus DE sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW reverse transcriptase; Gypsy superfamily; Gypsy-17-I_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4087 RA Bao W. and Jurka J.; RT "Gypsy LTR retrotransposons from Hydra magnipapillata."; RL Repbase Reports 9(2), 406-406 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 20..3475 FT /product="Gypsy-17-I_HM_1p" FT /translation="MMYLANISSFDLENDDFVEYIERFENYLLANNIQEAE FT LQKAVFLSTIGGPAYKLLRSLCENDTKNKSFTQLIKLMRDHLKPTPNFIAQ FT RFQFYKRDRKEGESVNGYITELRRLSEHCEFSEKLNDYLRDRFVCGLNNEN FT VQQKLLTIKNLTLETALDTARAYEAAYKDAKILRGTREGHIEQEEVHKMDT FT RTRFEKNRECFRCGYMGHQANNCHYRNSKCHLCGRIGHIKRRCQSEKKEFE FT GRKEKKLGVKQVEIREGSAKGNTADTDENDSDFLALYSLGEESERVNGPVM FT VNVKINGKEVGMEVDTGAAVSVMAVSAYRRVKGNKGKLRRSEVVLKTYTGE FT LVRPEGIGLVEVDYKGQCCSLPITVVKGNVPTLMGRDWIYRLNLEWADLCK FT GIKKVNICNRLDSRVEALVAKFPEVFSDNLGCLKDFKCHIPVREGAQPKFF FT KPRPVPYALRTRIEQELDRLENQGVWRRVEYSQWAAPIVPVLKNSKDPTGP FT LRICGDYKITINQAAPLDTYPIPNTTDQLATIAGGQKYTKLDLSQAYQQLE FT LDETSQEFLTINTHQGLYQPTRLQFGVHSATGIFQREMDRRLGRLPFVKVR FT VDDILISGKSDIEHLNNLESVLRILKESGLTLKASKCSFMQPEVVFCGFII FT SQEGCRPTTQNVEAVMDAPRPTXIKELRAFLGMANYYNAYLPRMASVTEPL FT HNLMRKNVFWKWSRDSEEAFQKVKTMLCNAPLLAHFDPSKKIMVHCDASPH FT GVGAVLSQQQDDGREKPISFASRTLNMAERNYAQVEKEGLALVFAVKKFHQ FT YLYGHKFTLYTDHKPLLGLFSENKELPARAAARVLRWALLLSAYDYKLLYC FT PGEKNAAADGLSRLPLDASREKSRLKTMEVAMMELVKAPITEKQLRVATYN FT DPILGVVLNKVLDGGLMMEESKVELKPYTSRFPELSTEGGCLLWGRRVVVP FT RVLRETVLEELHEVHPGVSKMKALARSYVWWPGIDLEIENKVKNCETCQRN FT QKCPLTESHPWEYPSRPWERLHIDHAGPMNGKIFLVVVDSFSKWIEVEVVG FT STEAKTTIRVLRRLFSTHGIPRVIVSDNGSGFSSEEYKQFLSSNNIKPIYA FT APYHPASNGQAERMVQTFKNSLKCFQGSDVEAQLCRFLFLNIA*" XX SQ Sequence 4087 BP; 1417 A; 631 C; 926 G; 1107 T; 6 other; attggcgacg aagataaaaa tgatgtattt ggctaatata tcgagttttg atttagaaaa 60 tgacgatttt gttgaatata ttgaaagatt tgaaaattat ttacttgcca ataacataca 120 ggaggcagaa ttacaaaaag ccgtttttct atcaactatt ggaggacctg cttacaaact 180 tcttcgtagt ttatgtgaaa atgacactaa aaataaaagt tttactcaac taataaagtt 240 aatgagagac catttaaaac caaccccaaa cttcattgca caacggtttc aattttataa 300 aagagacaga aaagagggag aatcagtgaa tggatatatt actgaattgc gcagattatc 360 agagcattgt gagtttagtg agaagttaaa tgattactta agagatagat ttgtgtgtgg 420 attaaacaac gaaaatgtac agcaaaagtt attaaccata aaaaacctta cattagagac 480 agcattagac acagcaagag catatgaagc ggcatataaa gatgctaaga ttttacgtgg 540 tactagagag ggtcatatag aacaagaaga agtgcacaag atggatactc ggacaagatt 600 tgagaaaaat agagaatgtt ttcgatgtgg ctatatgggt catcaggcaa acaattgtca 660 ttatcgcaac tcaaagtgtc acttatgtgg gagaatagga catataaaaa gaagatgtca 720 atcagaaaaa aaggaatttg agggtaggaa ggagaagaaa ttaggagtga aacaagttga 780 aattcgtgag ggttcagcta aaggaaacac tgcagacaca gatgaaaatg atagtgattt 840 tttagccttg tattcattag gtgaggagtc tgagagggtg aatgggccag ttatggttaa 900 tgtaaaaatc aatggtaaag aggtaggaat ggaagtggac acaggagctg ctgtttccgt 960 aatggctgtt tcagcttata gaagagtgaa aggaaataaa ggaaagttaa gaaggtctga 1020 agtagtgtta aagacttaca caggagaact tgtaagacca gaaggaatag gacttgttga 1080 agtagattat aaaggacaat gttgcagttt acctataact gtagttaagg gaaatgtacc 1140 tacattaatg ggaagagatt ggatttatag actaaacctg gaatgggcag atttgtgtaa 1200 agggattaaa aaagttaaca tttgcaatag gttggattcc agggtagagg cattagttgc 1260 gaaatttcca gaggtgttta gtgataattt agggtgctta aaagatttta aatgccacat 1320 acctgtacgt gagggtgcac aaccaaagtt ttttaaacca agaccagtac cgtatgcttt 1380 gaggacaagg attgaacaag agcttgatcg cttggaaaac caaggggttt ggagaagggt 1440 tgaatattca caatgggcag cacctatagt gcctgttcta aagaattcaa aagaccctac 1500 gggcccatta cgtatatgtg gggattataa aattacaata aatcaagctg cccctctgga 1560 tacctatcca atcccaaaca ccacagatca gttagccact attgcaggtg gacaaaaata 1620 caccaagctt gacttgtccc aggcatatca gcaacttgag ttggatgaaa cttcacaaga 1680 gtttctaaca ataaatacgc atcaaggttt atatcagccg actcgccttc agtttggtgt 1740 tcatagtgca actggcatct ttcaaagaga aatggataga aggttaggga ggttaccatt 1800 tgtaaaagtw cgagtggatg atatactcat ctctggaaaa tcggacatag agcatttaaa 1860 caatttagaa tccgtattaa gaattttaaa ggaatcagga ttaactctta aagcatctaa 1920 gtgctctttc atgcaacctg aagttgtatt ctgtggtttt ataatcagtc aagaaggttg 1980 cagaccaact acacaaaatg tggaggcagt tatggatgct cctcgaccca ccawtatcaa 2040 agagctaaga gcatttttag gaatggctaa ctattacaat gcctatttac ctagaatggc 2100 ttctgttaca gagccactcc ataatttgat gaggaaaaat gtattctgga agtggagtag 2160 agacagtgag gaggcttttc aaaaagttaa aactatgtta tgcaatgcac cattgttagc 2220 tcactttgat ccatcgaaaa aaattatggt tcactgcgat gccagtccac atggcgtggg 2280 agctgtgctg agtcaacagc aggatgatgg aagagaaaaa cctattagtt tcgcttcaag 2340 aacattaaat atggctgaac gaaattatgc ccaagttgaa aaggaaggat tagcattagt 2400 ttttgcagta aaaaaatttc atcagtattt atatgggcac aagtttactt tatacactga 2460 tcataaacct ttgctggggc ttttttcaga aaacaaggaa cttcctgcaa gagctgctgc 2520 aagagtattg cgctgggctc tgctactgtc agcatatgat tataaactcc tatattgtcc 2580 tggtgaaaag aacgcagctg cagatggttt aagtcgttta ccattagatg catcgagaga 2640 aaagtcacgg ttaaaaacca tggaggtggc tatgatggag ctagtaaaag cccctattac 2700 agaaaaacaa ttgagggtag ccacatacaa tgatcccata ttgggagtgg ttctaaataa 2760 agtgttagac ggaggattga tgatggaaga gagtaaagtg gaactgaaac catacacatc 2820 caggttccct gagttgtcaa cagaaggggg ttgtttgttg tgggggcgaa gagtagtggt 2880 gccaagagta ttaagagaaa cagtattaga agagttacat gaggttcatc caggagtaag 2940 taaaatgaaa gctttagcta gaagttatgt ctggtggcct ggaatcgatt tggaaattga 3000 gaacaaagta aaaaattgtg aaacctgtca aaggaatcag aagtgtccat taactgagtc 3060 tcatccatgg gaatatccaa gyagaccatg ggagagatta cacattgatc atgcaggtcc 3120 aatgaatgga aaaatattcy tggtggtggt tgatagtttc tcaaagtgga ttgaagtaga 3180 agttgtgggc agtactgaag ccaagacaac aattagggta ctcagaagat tgttttcgac 3240 ccatggtata cctcgagtta ttgttagtga taatggatct gggttttcga gtgaagaata 3300 taaacagttt ttgtcttcaa ataatattaa accaatctat gcagcaccct atcatccagc 3360 atcgaatggt caagcagaac gaatggttca aacatttaag aactcgttaa aatgttttca 3420 aggaagtgat gtcgaggcac aattgtgccg ctttcttttt ttaaatatcg cttaacacca 3480 cattctacaa ctggagtttc tccggctgaa cttttattag gtagaagatt aagaaatcca 3540 ttgtcgatgc tactccctga ggtcatcacc aaaatcaacg aaaagcagct tcccggtatt 3600 tttagtaata aaagtcgttt ctttcaacca gaagaccctg tgtacgtaag gaattatagt 3660 ggaggtgaga aatgggtctc agctattatt gtctcaaaag tcggaaatgt gaactacaaa 3720 gtattgacat tggatggtcg tatccagatt agacatgtag accaaattgt gaaacgtcat 3780 gtaaaagatt gcattgaacc tttagttaaa acagcagatg gcataattgg tcctttgtta 3840 agcaataaaa tacctgaaac atcagtcccg agtgatagca tgtatgatga atgcaaccca 3900 attaacaagg atattgaacc caacagagca cacgagcagc twgaaaataa agacgttagt 3960 cacgatggat cttgtgaaca acctgaaccg ttttctgagc ccattactac aaaaaccyga 4020 aggtcaggtc gaacctgtcg taaaccagca tatttagaac agtatgaatg atttatgggt 4080 ggaggat 4087 // ID Copia2-LTR_Dpse repbase; DNA; INV; 494 BP. XX AC Unknown_group_59; XX DT 15-MAY-2009 (Rel. 14.05, Created) DT 15-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia2_Dpse; KW Copia2-I_Dpse; Copia2-LTR_Dpse. XX OS Drosophila pseudoobscura OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; obscura group; OC pseudoobscura subgroup. XX RN [1] RP 1-494 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1025-1025 (2009). XX DR Genome; Unknown_group_59; Positions 73234 72741. XX SQ Sequence 494 BP; 146 A; 94 C; 95 G; 159 T; 0 other; tggagcacat cagcttgcta gaaaccccat gtttcatgca cgtacaaagt acatcgacgt 60 caacatacat catattcgag aggttctcaa gagaggtgaa gtcaacattg catatctgcc 120 aaccgatgag atgatagcag atgtcctcac aaagaactta gccaaggtta agcaccataa 180 atttgtattc gcatttggtt taaaagatcg tttataagtg cttgacatgt tattgaatat 240 tagttaagta tatttaaagt ttgtaaacat gcgttgtgtt gaggaggagt gttcaaagtg 300 accgccacga taattagata ttgatctgta acaacactgc gcatgtgccc tgctgctgtt 360 cgatgttctc atatcgatgt tctgtatatc gagtttccct tttcggtgtc tctctcttgc 420 gtatgttact cggaagcaca cataattctt tatactgata ttctgctgtc ttacaaataa 480 aaataacttt aaca 494 // ID BEL-623_AA-I repbase; DNA; INV; 6301 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-623_AA_; KW BEL-623_AA-LTR; Pao_Bel_Ele155; BEL-623_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-6301 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [5344-5901] - Integrase core CC 'ACGG' target site duplication CC LTRs are 95% similar to each other. XX FH Key Location/Qualifiers FT CDS 625..2190 FT /product="BEL-623_AA-I_1p" FT /translation="MAASAKLSAILNDDRTASGFNEVVQTEVALTIGAEGN FT IIDATAQGMLLNQTSVEPALGGFIDRTASMLGSIVEIDPNTRNDIPSGVRY FT STVRYPPYHQPLTTESHTLQIPKIQSSLTQRPFSSDKATLPAKPSHIIPPS FT FSRNQAVTSNSNTVVIPSGGVNPANIGQRVKSVTVNSELSHQHRSPADKPA FT YESVHSRRSQTGAAYRVDSENRERLGPQHSVLPTLPVGSIPDVQRGSERQY FT SVAATPAQTWGPTPQQLAARHVMSKELPVFSGNPEDWPLFISSYNNTTQAC FT GYSEAENLARLQRCLKGHALESVRSRLLLPQGVPYVIATLETLYGRPELLI FT HTLLQKVRGVPAPKHDRLDTLIGFGMAVQNLCDHLEAGGQEAHLKNPMLLF FT ELVEKLPANMKLDWSLYKQRCGEVSLREFAQYMSTLVRAATDVTLHYGPRP FT TPPQQRETRPEKGSRDKNFCAAHSEGPSKLKKPGKDDVMDQSTKHTPACLI FT CHDPKHRVKECTSFLTKVWMSAGRSS" FT CDS 3094..6300 FT /product="BEL-623_AA-I_2p" FT /translation="MAVKRLQCLERRMRSDPVIGDNVHRQIQEYQTKGYIH FT KATKEELQEADPRRVWYLPLGVVLNPKKPSKVRIFCDAAAKVDGISLNTML FT LKGPDLLSSLFGVLCGFREKRVALCADVKEMFHQIRIRKDDRHAQRLLWRD FT DPTQAPEIYLMDVSTFGATCSPCSAQFVKNVNAAEHAKEYPLAAEAIQRKH FT YVDDYLDSADSEKEAIHIAEEVRHVHSLGGFHLRNWLSNSKAVLARVGESD FT SATEKSLQLDRGSSTERVLGLFWRPENDDFTFSTDLVTHVELPTKRQALSI FT VMSPFDPLGFLSFFLIHGKILIQKMWRAKIKWDELIPETLRKQWTDWMEHF FT KHLDKISISRCYFPSHSVKDILSLQLHVFVDASEDAFACVAYLRAEFANEI FT DVAVVAAKSKVAPLKSLSIPRLELMAAVIGARLQSTIIEGHSLHIDQVVLW FT SDSKTVLAWIYSDHRNYRQFVACRIGEILSKTRECQWRWIPSKENVADDAT FT KWGKGPCMSPNSRWFRGPEFLCLSEENWPAGIEAEAHQTDEELRSCLVHRE FT VIVPHRMDWKRFSQWTRLWRATAYVHRYVGNLKRKVKQNELQVGPLTQTEL FT AAAETTIWRWVQNEVFPDEVATLSGERSEQPVRQAKLEPTSKLRKLSPYMD FT VNGVIRMESRIAAATFASFDTRFPIVLPKDHAVTALLIEWYHKKYLHRNGE FT TVVNEVRQRFHIPNLRTVVRKEKKKCVLCKLKLATPAIPRMAPLPAARLKA FT FERPFSYVGVDYFGPIAVRVNRSNAKRWIALFTCLTIRAVHLEIVHSLSTE FT SCKMAFRRFIARRGAPVEVYSDRGTNFVGCSNELEREKKAIDQQLAETFTN FT ANTKWIFNPPAAPHMGGSWERLVRSVKTALAAMYTSRIPTEETLATLVAEA FT ESIVNSRPLTFIPLEREQQEALTPNHFLLLSSNGVAQTSKTLVDSREACRN FT QWNLYRAMVDQFWRRWVREYMPDLTRRTKWFEDVPAIRPGKLVIIVDDQVR FT NGWIRGRVVQVVKGCDGRIRQAVLQTATGLVRRPIAKLALLDIAESKAAPE FT KPEQLTGRGDVT" XX SQ Sequence 6301 BP; 1761 A; 1493 C; 1671 G; 1374 T; 2 other; acggtgttgc tgctcaagga attccgaata cccccacgcc aacaaaacta aagaatttgc 60 cgtcatgccg atggctacac gcagtgctcc aggacgtagg ccgcgctgtc tggcttgcaa 120 cgggcctgat acgatgcgaa tggtagcatg ttgccaatgt gaaacttggt ggcactacga 180 gtgtgtgaat gttgatgaaa gcatagccgc accagatcgc acattcatct gtccgcaatg 240 tcaaagacct gcaccgaccg tcccgaatct gcaacacgcc gcgtctgaaa ccagtcatcg 300 atgcccctcc aacttgtccg gagccaggtc tgctgtttca tccactccca gcatgagagc 360 taggagagct cgccttcagt tggaaaagct cgaggcacaa aaagctctaa tggataaacg 420 tatagagcaa gcacgtcgag agcagctgat taggcatgag caggagaagc agatgcagga 480 agctgaactg gaccaaatac ggctgcaaat ggaggaaaac atcatcgagg aatcattccg 540 gtgccgagaa gaggaactgt tggaccaaga gagcgataga ggcagcattt catccgccca 600 gagcagtgtt agtaaggtgc gtgaatggca gcaagcgcaa aactatccgc gatcctcaac 660 gatgaccgaa cagctagcgg gttcaacgag gtggtgcaaa cagaagtagc tctcactatc 720 ggggcggaag gtaacattat agatgcaaca gcacaaggta tgttattgaa ccagacttca 780 gttgagccag cgttaggagg tttcatagat agaacagcta gtatgttagg gagcatagtt 840 gagatagatc ccaatactcg caatgatatt ccgagcggtg ttcgctacag cactgtgcgc 900 taccccccat atcaccaacc actaaccact gaaagccata cccttcaaat acctaaaatc 960 caatcctctt taacgcaacg ccctttttcg tctgacaaag cgacgttacc cgcgaaacct 1020 tcccatatca tacctccctc gttcagtcga aatcaagcag ttacgtcgaa cagtaacacg 1080 gtagtgattc ccagtggtgg agtaaaccca gctaatattg gccaacgggt taaatcagtt 1140 acagtgaata gtgaattatc tcatcaacac cgaagtcctg cagacaaacc ggcatatgag 1200 tcggtacatt cgcggcgttc tcagacggga gcagcatacc gagtggattc agagaatcgg 1260 gagagactgg gaccacaaca ttctgtatta ccgacactcc cagtgggctc gattcccgat 1320 gtacagcgag ggtcagaacg gcagtactct gtcgccgcca ccccagctca aacttgggga 1380 ccaacacctc agcagttggc agcaagacat gtgatgtcga aggagctacc agtcttctct 1440 ggcaacccag aggattggcc cttattcatc agctcctaca acaatacaac ccaggcatgt 1500 ggttactcgg aagcggagaa tcttgccagg ctacagcgat gccttaaagg acatgcgttg 1560 gaatcagtgc gaagccgtct attgctgcca caaggtgtac cgtacgtcat cgctacactg 1620 gaaacgctat atgggagacc ggagctgttg attcacaccc tgctgcagaa agtccgagga 1680 gttccagcgc cgaaacacga cagattagac acgctgattg gtttcggaat ggcagtgcag 1740 aacctttgtg accatttgga agctggtggt caagaagccc atctaaagaa tccaatgctt 1800 ctctttgagc tagtggaaaa actgccagcc aacatgaagt tggactggtc attgtataaa 1860 cagcgttgcg gagaggtgag ccttcgagaa ttcgcacaat acatgtccac tttggtgcgt 1920 gccgcaacag acgtaacatt gcattacggt ccgaggccaa cgccgccgca acagcgggaa 1980 acaagaccag aaaaaggcag ccgtgacaaa aatttctgcg cagcacactc agaaggtcct 2040 tcgaagctaa aaaagccggg gaaagatgat gtaatggatc aaagtacaaa acatactcca 2100 gcgtgcctca tttgccacga tccaaagcat cgagtaaagg agtgcacaag ttttctaaca 2160 aaagtgtgga tgagcgctgg aaggtcatcc tgaatttcgg gttgtgtagg aattgcctag 2220 gttctcacgg gcggcgaccg tgcaagatat acaagcggtg tgaagtcgat ggatgtcaac 2280 taaaacacca ccccctacta cattctaagc aatacaagca agaagtaaag cgagatcata 2340 aggaggaccc gatgccggag tccaaggcag tagccaatca ccattatgcg ggaaagacaa 2400 ctctcttccg cgtcattccg gtgacgcttt tcggaaacaa tcaatccgtt tctgtgtatg 2460 cttttcttga tgatgggtca gagcgaactc tggtggagca agaaatcgtg gataagctgg 2520 gagtaaccgg tgaacatctt ccactatgtc tgcaatggac agctgaggtg aaacgaacgg 2580 aaaacgagtc gcagcgagtt gcgttgcaaa tctctggggt caacggaatg aaacatgcac 2640 tgtcggacgt tcgaacggtc aaaaagttga atttaccacg acaatcatta caccatgcaa 2700 agctggcgga agcatacccg tacttgcatg ggttgcctat tcaagattac gaggaggcgt 2760 tacctcgcat cttgataggt aatgacaacg ctcatgttac atccacactc aaactgcgcg 2820 aagggaagcc cggtgagcct attgcagcaa aaacccgatt gggatggacc gtctatggaa 2880 cgaaccttaa gcaatcggaa caacaggcac actctttcca catctgtgaa tgtcggaatg 2940 aggacacact acatgactta gtgactcagt tcttcagcgt cgaatccttg ggaattacgg 3000 ccatcgcatg cccggagtct gacgaagtgc aacgagccaa cgaaattctc aaagcaacca 3060 ctaagcgagt tggcaagcgg tcgaaacaac cctatggcag tcaagcgtct gcagtgcttg 3120 gaacgacgca tgagaagcga tcctgtgata ggcgacaatg tacatcggca gatccaggag 3180 taccagacaa aaggctatat acataaggca acgaaggagg aactacagga ggccgacccg 3240 cggcgtgtat ggtacttgcc actcggagtg gttctgaacc cgaagaaacc atcgaaggtt 3300 cgcatttttt gtgacgctgc agccaaagtg gacggaatct cactaaatac gatgctcttg 3360 aaaggaccgg atctgctaag ctcgctgttt ggagttctct gtggctttcg ggaaaagcgg 3420 gtcgcacttt gcgctgacgt gaaagaaatg ttccatcaaa tacgaatacg taaagatgat 3480 cgccatgcgc aaagactact ctggagagat gacccaactc aagctccgga gatctacttg 3540 atggatgtat ctacattcgg ggctacgtgc tcgccgtgct ccgcgcaatt tgttaaaaac 3600 gtgaatgccg ctgagcacgc taaggaatat cctcttgcag cggaagcaat tcaacggaag 3660 cattatgtag atgattattt ggatagtgca gacagcgaaa aagaggcgat acatattgcc 3720 gaagaggtgc gacacgtgca ctcacttgga ggtttccatt tgcggaactg gctatccaac 3780 tccaaagccg ttcttgcacg ggtcggggag agtgactctg caaccgagaa aagcctccag 3840 ctagacagag gaagctccac ggaacgcgtt ctgggattat tttggaggcc tgaaaatgat 3900 gacttcactt tctctacgga tctcgttacg catgtagagc tgccgacgaa acgccaagct 3960 ttgagcatcg ttatgagtcc gttcgatccg ttaggattcc tttctttctt cttaatccac 4020 gggaaaatct tgattcagaa gatgtggcga gcgaagatca agtgggacga gcttataccg 4080 gagacgctgc gtaagcaatg gactgattgg atggagcatt ttaagcacct ggacaagatc 4140 agcatttcac gctgctattt tccaagtcat tctgtcaagg atattttatc actgcaactc 4200 catgtcttcg tagatgcgag tgaggatgcg tttgcttgtg tggcatactt gcgggccgag 4260 tttgcgaatg aaatcgacgt agcagtagta gctgcgaaat ccaaggtggc tcccctcaaa 4320 tctttatcca ttccaaggct ggaactgatg gcggctgtga ttggagcgcg tctacagagc 4380 actattatcg aaggacattc tttgcacatc gaccaagtcg tcttgtggag tgattctaag 4440 acggtgttag cctggatwta ctccgatcat cgcaactacc gtcaatttgt tgcttgtcgg 4500 ataggagaaa tactgtcaaa aacacgggag tgccagtggc gatggatacc atcgaaggag 4560 aatgtggcag acgacgctac aaaatgggga aaaggaccct gtatgtcacc aaatagtcgc 4620 tggttcagag gtcccgaatt tctttgtcta tcggaggaaa attggcctgc aggcatcgaa 4680 gctgaagcac accaaacgga tgaagaactt cggtcgtgtt tggtacaccg cgaagtgata 4740 gttccacatc ggatggactg gaaacgtttc tcgcagtgga cgcgtttgtg gcgcgcaaca 4800 gcgtacgttc atcgctacgt gggtaacttg aaacgcaagg tgaagcagaa cgagttgcaa 4860 gtcggtccgt tgacccagac cgaactagct gccgcagaga caacaatatg gcgttgggtt 4920 caaaatgagg tgtttccaga cgaagtagct acactttcag gagaacggag tgagcaacca 4980 gtgcgacaag ccaaattgga gccaactagc aaacttcgaa agctgtcgcc gtacatggac 5040 gtaaacggtg taatacgcat ggaaagtcga atagccgcag ctaccttcgc ttcattcgat 5100 acccgctttc ccatcgttct gccgaaagat catgccgtaa cggcgcttct aatcgaatgg 5160 taccacaaga aatatctcca taggaacggc gaaacggtgg tgaatgaggt aaggcaacgt 5220 ttccacatac ctaaccttcg cacggttgta cgcaaggaaa agaagaaatg cgtattatgc 5280 aagttgaaac tagcgacgcc ggcaattcct cggatggctc cgctgcctgc tgctagactg 5340 aaggctttcg agcgtccatt ttcgtacgtg ggagtcgatt actttggccc gatcgctgta 5400 cgtgtgaatc gaagcaatgc caagagatgg atagcgttgt tcacgtgcct gacaatccgg 5460 gctgtgcatt tggagattgt acattctttg tccaccgaat cgtgcaagat ggcctttagg 5520 agatttattg cgcgcagagg agctccagta gaagtgtata gtgatcgagg aacaaatttt 5580 gtgggatgta gcaacgagtt ggagagggag aagaaagcta tcgatcaaca gcttgccgaa 5640 acgttcacga acgcgaacac caaatggatt ttcaatccac cggccgctcc ccacatgggc 5700 ggatcatggg agcgtctagt gagatcggtg aagacagctt tagcggccat gtacacttcc 5760 agaattccaa ccgaagagac cttggcgacg ttggttgcag aagcggagag tatagtgaac 5820 tcgagaccac tgactttcat cccactcgag cgagaacagc aggaggcgct tacgccaaac 5880 cacttcctgt tgttgagttc gaatggwgtt gcgcaaactt cgaaaacatt ggtggactcc 5940 agagaagcct gcaggaacca gtggaacctg taccgagcta tggtggacca gttttggcga 6000 agatgggtac gggaatacat gccggacttg acgcgtcgaa ccaagtggtt cgaggacgtg 6060 cccgctatac gaccgggaaa actagtgatc attgtagatg accaagtacg caacggatgg 6120 atccgaggac gagtcgtaca agtggtaaag ggatgtgatg gccgtattcg tcaagctgta 6180 ctgcagaccg caacaggatt agttcgacgg ccgattgcta agttggcgct cctggacata 6240 gctgagagta aggcggcgcc ggaaaagccg gagcagctta cgggtcgggg agatgttacg 6300 g 6301 // ID Copia-16_DPu-LTR repbase; DNA; INV; 190 BP. XX AC scaffold_31; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-16_DPu_; KW Copia-16_DPu-LTR; Copia-16_DPu-I. XX NM Copia-16_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-190 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 696-696 (2010). XX DR Genome; scaffold_31; Positions 930572 930761. XX CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX SQ Sequence 190 BP; 62 A; 32 C; 46 G; 50 T; 0 other; tgttgaatat tgttgaatat aacaaggcaa caaactggga agagcggcaa cgctgcacag 60 gcagagaagg gaaaaatcca gaccttttct tgtacgtaga taagacggtg tggcgtatgg 120 tactcttgag agtaatacac agattccttt tgctacagtt tactcaaagt gtgttagttg 180 ataaccaaca 190 // ID BEL-185_AA-I repbase; DNA; INV; 1948 BP. XX AC supercont1.118; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-185_AA_; KW BEL-185_AA-LTR; BEL-185_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-1948 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.118; Positions 1583505 1585452. XX CC 'GGGGG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 49..1878 FT /product="BEL-185_AA-I_1p" FT /translation="MEVQCKTCKSPVLLSKCLTCAKCENHYHHECAQMNDS FT TPNLNWECDVCTPSTSGSQIILELHRLQNEVTLERKAIDKRFDMLQKHINI FT IKAEIEKLPQCLPSTAHKRFASNPDSPADDEMSNNESRTTQISPESGLQKM FT FARQVIPSDLPIFTGKPEDWPIFISCYRNSTIACGFTAVENLIRLQRCLRG FT PARQAVCSKLLLPECVDQVIGTLQLLYGRPELLISSLISKMHETPAPSDDD FT LNSLIVYGLAIQNLSDHMIAAGLNSHLNNPCLLQEMVNKLPTHLKLQWASH FT KRDNQDITIATFSTFMSGLVKAASQVTSPHITLDQNNHRDEYSDDSDDDGG FT SSRSSPNNSRREKRRRCYVCKEYSHRVSSCRKFASMHVSKRWELINEIKIC FT SCCLNRHLPWPCKTVRYCGINGCQMKHHPLLHSADNIQSTSGCTVNVGTDV FT HTLLKYIPVTLHGKSRSIDTFAFIDEGSSSTMLDSKLADELCIDGKDDTLS FT LRWIDGSVKTVKTRCINVGISDIDKTNRFMLKDVHTINHLDLPVQQIDALL FT WRSDHLKDLTPYTYQNVKPRLIIGLANAHLAAPSIVRVGKSSEPIVFKCPL FT GWCIYGSTKYNE" XX SQ Sequence 1948 BP; 606 A; 403 C; 409 G; 530 T; 0 other; ctaaagaatt tctcaaattg aagcggaatt ttggaacgaa atccgatcat ggaggtccag 60 tgcaaaactt gcaagtcacc agttctccta tctaagtgct tgacgtgtgc gaaatgcgaa 120 aatcattacc atcatgaatg tgcacagatg aacgattcga cacctaacct taactgggaa 180 tgtgatgttt gtacaccatc aacatctggc tcccaaataa ttcttgagct tcatcgattg 240 caaaacgagg taactctgga gaggaaagct atcgataagc gtttcgacat gttgcagaag 300 cacatcaaca tcataaaagc agaaatcgaa aaactgcccc aatgcctacc ttcgactgca 360 cacaaacgat ttgcatccaa tccagattct ccagctgatg atgaaatgag taacaatgag 420 agtagaacca ctcagatttc accagaatca ggattgcaaa aaatgtttgc tcgtcaagtc 480 attccatcgg atctaccaat tttcacagga aagcctgaag attggccaat tttcattagt 540 tgctatagaa actctacaat tgcttgtgga tttaccgccg tcgagaatct catcagatta 600 cagcgatgtc ttcgtgggcc tgctcgtcaa gctgtttgta gtaaactgct gcttccagaa 660 tgcgtagatc aagtaatcgg tacgttgcag ttactatatg gaaggcctga gttgttgatc 720 tcctctctca tctccaaaat gcacgagaca ccagcaccga gtgatgacga tctcaattcg 780 ttgattgtgt atgggttagc cattcaaaac ttgtcagatc atatgatcgc tgctggattg 840 aactctcatt tgaacaatcc gtgtcttctc caggagatgg taaacaaact cccaacacac 900 ttaaaactgc agtgggcttc tcacaagaga gacaaccaag acatcactat tgctactttc 960 agtaccttca tgtctggact ggtcaaagca gcaagtcaag tcacatctcc tcatatcact 1020 ctggatcaaa acaatcatcg agatgaatac agcgacgata gcgatgatga cggtggatcg 1080 agtagatcat cgcctaacaa cagtcgtagg gaaaaacgta gacgttgtta tgtatgcaaa 1140 gaatacagtc atcgggtttc aagctgtaga aaatttgctt cgatgcatgt aagcaaacgt 1200 tgggagctta tcaacgaaat taagatatgt tcctgttgcc tgaatcgtca cttgccgtgg 1260 ccgtgtaaaa ctgttcgata ctgtggcatc aacggctgtc aaatgaagca tcatcctttg 1320 cttcactccg ccgataacat tcagtcaacc agtgggtgca ctgttaacgt aggtactgat 1380 gtgcacacct tgttaaagta tatcccagtc acattacatg gaaagtctag gagtattgat 1440 acttttgcct ttatcgacga aggttcatcg agcaccatgt tggacagtaa gcttgcagat 1500 gaactttgta tcgatggtaa ggatgataca ctaagtctca gatggatcga tggttcggta 1560 aagactgtta aaaccagatg cataaatgta ggtatttcag atattgataa gacgaatcga 1620 tttatgttaa aggacgtaca taccattaat catcttgatt tacctgtaca acagatcgat 1680 gcattgcttt ggagaagcga tcatctaaaa gatttgacac catatacata tcaaaatgtt 1740 aagccacgtt tgataatagg cttagccaat gctcacttag cagcaccttc gatagttcgc 1800 gtaggaaagt ctagtgaacc gatagtcttc aaatgcccct tagggtggtg catatatggc 1860 agtactaagt ataatgaatg aaagcacgaa attgacgcag ataatttgta aaacaagaat 1920 taaaattatt gtgttacggg gtggagga 1948 // ID DNA8-44_AP repbase; DNA; INV; 675 BP. XX AC . XX DT 25-AUG-2009 (Rel. 14.09, Created) DT 25-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-44_AP. XX NM DNA8-44_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-675 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 1974-1974 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 675 BP; 254 A; 80 C; 63 G; 278 T; 0 other; cagtgttagg cattatctag atacattttt cgtatatttt atctcatcta gataactttt 60 aaatgattta tctattatct tatctagata ataaaaacgg tctttttttt aaattatcta 120 gataattatt tactctatag tctatttttt tttttataca atatttattc acacaatttc 180 actgcattgt aataatatga cccgtggatc tacaaaaagt taatatagag aaaatataac 240 acaatataga gaaaaaaaat atttgttatc aacaatattc tttccgataa aaattaaaaa 300 ccaaaaacta ttattatggc atacaattac taatgacaaa attctaagga aaaaagtttt 360 ttactaaaat actatatctc tattcaatca taatgtgggc ttgaaagggt taaatgtttt 420 tttttatgta accaaatgta ccatagtacc aatctgttga ctgtttatgt aacataaaat 480 aaatgaaata tcaatgtaaa tttattatat ttttttttat aatcagctat cagctaaagt 540 ttttttttat aaattatcta agtattatct agataatttt atttgaatta atctattatc 600 tcatctagat gattatattg aaaaatatct attatcttat ctagataatt atttgattat 660 ctatgcctaa cactg 675 // ID Gypsy-25_DYa-I repbase; DNA; INV; 4923 BP. XX AC chr2h_random; XX DT 06-APR-2011 (Rel. 16.04, Created) DT 06-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-25_DYa_; KW Gypsy-25_DYa-LTR; Gypsy-25_DYa-I. XX OS Drosophila yakuba OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; melanogaster subgroup. XX RN [1] RP 1-4923 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; chr2h_random; Positions 3470017 3474939. XX CC Positions [3801-4178] - Integrase core CC 'AAAT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 119..1132 FT /product="Gypsy-25_DYa-I_1p" FT /translation="MNKSEIADWLKANEIEFPTSATLRQLRKLALDAGCQN FT ASEISETISEEEDTKLEEICQGIASAQINSTLEEEEEALDAAIRVAEKKKI FT LAKLMQNDSKMTDDFQMAKQLMVPFSATQKEEAFQWISDFERVCRGVNDSE FT IFQLRCVRMLMKPGTDADLFLRVDRSNSYDTFKESFLKTFGHGNSTADIVL FT LLKDTIFNPSKNTVMGYILQMEEIAMRADIDEKLTVQFIIDGFRDRSSNIA FT LLYSATTIGQLKEMARKYENLRKVPKTSTIRTGIAFAGERGQIRCFNCSAH FT GHYASSCTAPKREKGSCFRCGSLQHRIKDCQQQPKTNPRVVGATGC" FT CDS 894..4178 FT /product="Gypsy-25_DYa-I_2p" FT /translation="MKIYGRFPKRRPFARESLSLEREVKYAALIVRHMDTT FT LHLVLHQSARKDPVSVVDPFSIGSRIVNSSRKLIRESWEQLVVDQLIRDEE FT ENNTFIPIFNQVGVYFCTDKRQNSNMTFLSAMFDTGSAINLIQRTAIPFKK FT FNDVLSPTEYCGVNGSRINTFGKIFVKLNFQNIEEEISFFVVPDNYVATHV FT LLGRKFLNKFGIKLIFKDKMRGRREIDLLNIDSDPLNIDSDPLNIDSDPFN FT IDSDLLNIDSDLLNIDSDLLNIDSDPLHRLKIELNRSNLDSEPVNRFHIDR FT EPVNIFDIDSDPASRSKHCKPINKLETRQRFQKCDFDKDIEIDREYKSLPY FT IYNIDIFKDDEDFNIDPKLNNSNRSELIEIIRINYLDLHNIPAIQHSYEMR FT LRLKSEEPVRSVPRRLSYQEKKEVDCKIDELLRKGFMRESNSPYSSAIELV FT KKKSGEKRMCVDYRQLNKITIRDGYPLPLIDDCLERLEGNQYFTLLDLKSG FT YHQVKVAESSVQYTAFVAPSGQYEYTRTPFGLMNAPAVFMRFINFILQPLI FT REGNVVVYIDDIAIGSKTLSEHFVTVGRVLRILAAHRLEIKISKCQFAYKS FT IEFLGYTLSEKGISPNHEHTSTIKHLPIPKDRHGMQKCLGLFSYFRRFIPA FT YSKIAKPLQELTKAGSKYELDENCINIFEYLREKLISFPILALYNPNRETE FT LHCDASSEGFGGILLQKQDDNKFHPIAYFSQRATKLEARYESFKLETLAVV FT YSLRKFRIYLEGIPFNIVTDCNAVVLCLGKGRLNSNIARWALELANYNYTL FT KHRSGKYMTHVDTLSRYPSAFNQDFEYEPVVTIQESENKDKIVSVIDSDDI FT NLQLQITQNRDSLIVKLKEKLEHRDVKKFVLSDGIVYRLGENDIELLYVPA FT EMENHVIRRVHENLCHLGTEKCIKEIMRHYWFPDMKSKIESFIGNCLKCIM FT FTVPTHANNRTLHNIPKRPIPFDTLHVDHFGPLPSVISQKKHLLVIIDGFT FT KFVKLFPVNTTSTKEVKTCFSKYFQYYSHPRRIISDRGTCFTSLEFAKYLL FT ENNIEHVKVATASPQANGQVERVNRTVKAMLAKITEPI" XX SQ Sequence 4923 BP; 1764 A; 774 C; 958 G; 1427 T; 0 other; acttcagaag tgggatagca accttttatg cacccgccta catcaatatt tcgacgtacc 60 cgacaataca cacgtttttt tttcacaaag ttaaaatagc gtgccctacc aattggccat 120 gaataaatcg gagattgcgg attggctgaa ggccaatgaa attgagttcc caacaagcgc 180 aacgcttcga caactgcgta agcttgcatt ggacgccggt tgccaaaacg ctagcgagat 240 ttctgaaacc atatcagaag aagaagacac caaactggag gaaatatgcc aggggatcgc 300 gagcgcccag atcaatagta cattggaaga agaagaggaa gcactagacg ctgcaataag 360 ggtggccgag aagaagaaga tactggccaa gttgatgcag aacgacagca agatgacgga 420 tgacttccaa atggctaagc aactcatggt tccgttctcc gccactcaaa aggaagaagc 480 attccaatgg atttcggact tcgaaagagt ttgcagagga gtaaatgata gtgaaatttt 540 tcaacttcgc tgtgtccgaa tgctgatgaa accaggaaca gatgccgacc tgttcttgcg 600 tgttgaccgt tcgaattctt acgatacatt taaagaaagt tttctcaaga catttggcca 660 tggaaattca acagccgata tcgtattgct tctgaaggat accatcttta atcctagcaa 720 gaacaccgtc atgggctaca tacttcaaat ggaggaaatt gctatgcgtg cagatattga 780 tgaaaagtta acagtacaat tcataattga cggttttcga gatcggtcat ccaatatcgc 840 gctattgtac tcagcaacca caattggaca attgaaggag atggctcgga aatatgaaaa 900 tctacggaag gttcccaaaa cgtcgaccat tcgcacggga atcgctttcg ctggagagag 960 aggtcaaata cgctgcttta attgttcggc acatggacac tacgcttcat cttgtactgc 1020 accaaagcgc gagaaaggat cctgtttccg ttgtggatcc cttcagcata ggatcaagga 1080 ttgtcaacag cagccgaaaa ctaatccgag agtcgtggga gcaactggtt gttgaccagt 1140 tgatacgaga tgaggaggag aacaatacgt ttattcctat atttaaccag gtaggagtat 1200 atttttgcac tgataaaaga caaaatagta atatgacgtt cctttctgcc atgtttgata 1260 ccggcagcgc cattaattta atccagcgca cagcgatacc atttaaaaag tttaatgacg 1320 tattgagccc aacagaatat tgcggtgtta atggatctcg aataaataca tttggaaaaa 1380 tttttgttaa acttaatttt caaaacatag aagaagaaat atcatttttt gttgtacccg 1440 acaattatgt agcaactcat gttcttttag gtcgtaagtt cttgaacaaa tttggaataa 1500 aattaatttt taaagacaag atgagaggaa gaagagaaat tgacttattg aatattgata 1560 gtgacccatt gaatattgat agtgacccat tgaatattga tagtgaccca ttcaatattg 1620 acagtgactt attgaatatt gacagtgact tattgaatat tgacagtgat ttattgaata 1680 ttgatagtga cccattgcat agattgaaaa ttgaattgaa tagatcgaat ttggatagtg 1740 aaccagtaaa tagattccat attgataggg aaccggtgaa tatatttgat attgatagtg 1800 atccagcgag tagatcaaaa cattgtaaac ccataaataa attagaaact agacagagat 1860 ttcagaaatg tgattttgat aaggatattg aaatagatag agaatataag agcttaccat 1920 atatatataa tattgatata tttaaggatg atgaagattt taacattgac ccgaagttaa 1980 ataatagtaa tcgcagtgaa ctcattgaaa taataagaat aaattattta gatctacaca 2040 acattccagc gattcaacat agttatgaaa tgaggctaag acttaagtca gaagaacctg 2100 ttcgctctgt tccgagaaga ctttcatatc aggagaaaaa ggaagtagat tgcaaaattg 2160 atgaattgtt aagaaaagga tttatgcgag aaagtaattc accttatagt tcagctattg 2220 aactggttaa gaagaaatct ggagaaaaac ggatgtgtgt ggattataga caactcaata 2280 agataacaat ccgtgatgga tatccattgc cgctcataga tgactgttta gaaagattag 2340 aagggaatca atatttcaca cttttagatt tgaagagcgg atatcatcag gtgaaagtag 2400 cagagagttc agttcagtac acagcgtttg tagcaccctc tggtcaatac gaatatacaa 2460 gaacgccttt tggacttatg aatgcaccag cggtatttat gagatttata aattttattc 2520 tgcaaccatt gatacgtgaa ggaaatgtag tagtttatat agacgatatt gctatcggtt 2580 ctaagacatt gagtgaacat tttgtaactg taggaagagt attaagaatt ctggcagcac 2640 atagattaga aattaaaata agtaaatgtc agtttgctta taaaagcatc gaatttttag 2700 gatacacact tagtgaaaaa gggataagtc cgaatcacga acacacatcg actattaaac 2760 atcttccgat tccaaaggat cgccatggta tgcaaaagtg tttaggatta ttttcttatt 2820 ttaggcgatt tattccggca tactcgaaaa tagcaaaacc attgcaagaa ctcacaaaag 2880 ctggctctaa gtatgaactc gacgagaact gcatcaatat ttttgaatat cttcgcgaga 2940 aattaatttc ttttccgata ttagctcttt ataatcccaa cagagaaaca gaattgcatt 3000 gtgacgcaag tagtgaagga tttggtggaa ttttattaca aaaacaagat gataataagt 3060 ttcatccaat agcttatttt tcacaaagag caacaaaatt agaggcaagg tatgaaagtt 3120 ttaaattgga aacattagca gtagtttatt cattacgaaa attcagaata tatctagaag 3180 gaattccgtt taatatagta accgattgta atgcagtagt tttatgccta ggaaaaggac 3240 gccttaattc taatattgcc agatgggcct tagaattagc taactataat tataccctta 3300 aacatcgcag tggaaaatac atgacccatg tggatacgtt aagtagatat ccttcagcct 3360 ttaatcaaga ttttgaatat gaaccagtag tgacaataca agaaagtgaa aataaggata 3420 aaatagtttc agtaattgac agtgatgaca taaaccttca gcttcagata acgcaaaata 3480 gagatagtct tatagtaaaa ttaaaagaaa aattagaaca tagagatgta aaaaagttcg 3540 ttttaagcga tggaattgtt tatagattag gtgaaaatga tatagaatta ttatatgtac 3600 cagctgaaat ggaaaaccat gtaatcagga gagttcacga gaatttatgt catttaggta 3660 cagagaaatg tataaaagaa ataatgagac actattggtt tccggatatg aagtctaaga 3720 tagaatcgtt tattggaaat tgtttaaagt gtattatgtt tacagtaccc acacatgcta 3780 ataatcgaac attgcacaat ataccaaaac gaccaattcc ttttgacaca ttacatgtgg 3840 atcactttgg accactgccc tccgttattt cgcaaaagaa acatctatta gttatcatcg 3900 acggatttac taaatttgtt aaattgttcc ctgtgaacac aacgagtaca aaagaagtca 3960 aaacttgttt ttcaaaatat tttcaatatt atagtcaccc acgtagaatt attagtgata 4020 gagggacatg ttttacgtcc ttggagtttg caaaatattt gttagaaaat aatattgaac 4080 acgtgaaagt agcaactgca tcacctcaag caaacggtca ggtggagaga gttaatagaa 4140 cagtgaaagc gatgttagct aagataactg aaccaattta gcatgaagat tggagtaagc 4200 ttcttgtaaa ggcagaatat gccatcaata attcagtaca ttcaacaacg aaagaattac 4260 cttcagtatt attatttgga gttcaacaga gaggttgtaa tattgatgca ttaacggaat 4320 atttagatga tagaactgat atggcacaat gcgatttaga aaacgttcgt aagaactcct 4380 taaataaaat agaacagtgt cagcgaagaa gcgaagaaca ttttcagaaa aatcataaga 4440 gccacgtaga atttttagaa ggcgattttg ttggccacaa tcggcacaaa taagaaattt 4500 ttacctaaat ataagggccc atacgtcatt aaacgaattt taccaaatga tagatacgtt 4560 ttaacagaca tagaaaattg ccaaattagt caaattccgt atggaggaat agtagaggca 4620 tgtcggttaa gaaaatgggc agattggcgt aaccagtgcg gaaatatctc ctaatttgta 4680 gaataatatc ttaacttatt ttctctcttt atatacctta tactgtacat attttcattt 4740 ttttattatt attatttctc ttttacttat attctattca aattttaata tatatttaac 4800 taaactttaa tatttaggta gccaaaactt caagacatgt aaataattaa aatcaacaat 4860 gtacatattg taaataattg aaatcgacga tcgaggacga tctaattgtc aggatggccg 4920 agc 4923 // ID ARS_EH repbase; DNA; INV; 126 BP. XX AC M55340; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 25-MAR-1997 (Rel. 2.02, Last updated, Version 2) XX DE E.histolytica autonomously replicating sequence (ARS) repeat DE region. XX KW Satellite; Simple Repeat; ARS_EH; KW Autonomously replicating sequence; Repetitive sequence; KW tandem repeat. XX OS Entamoeba histolytica OC Eukaryota; Amoebozoa; Archamoebae; Entamoebidae; Entamoeba. XX RN [1] RP 1-126 RA Lohia A., Haider N. and Biswas B.B.; RT "Characterization of a repetitive DNA family from Entamoeba RT histolytica containing Saccharomyces cerevisiae ARS consensus RT sequences."; RL Gene 96(2), 197-203 (1990). XX DR GenBank; M55340; Positions 103 228. XX SQ Sequence 126 BP; 84 A; 0 C; 11 G; 31 T; 0 other; aataataaag ataataaaga taataaagat aataataaag ataataataa agataataat 60 aaagataata ataaagataa taataaagat aataataaag ataataataa agataataat 120 aaagat 126 // ID Gypsy-626_AAe-LTR repbase; DNA; INV; 129 BP. XX AC . XX DT 27-MAR-2011 (Rel. 16.04, Created) DT 27-MAR-2011 (Rel. 16.04, Last updated, Version 1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy-626_AAe-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-129 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(4), 1426-1426 (2011). XX DR [1] (Consensus) XX CC Solo LTR. ~93% identical to consensus. XX SQ Sequence 129 BP; 54 A; 17 C; 23 G; 35 T; 0 other; tgttgtaata aagaataata taattaatat catcaaggta ggcagtgact tagggagaca 60 tgactgaata tgtaaaagga ctgacattca ataaataacc accaaactga aaagaagttg 120 tttccttca 129 // ID SMAR26 repbase; DNA; INV; 1188 BP. XX AC . XX DT 08-OCT-2007 (Rel. 12.1, Created) DT 22-OCT-2007 (Rel. 12.1, Last updated, Version 1) XX DE Consensus sequence of Mariner-type family of repeats. XX KW Mariner/Tc1; DNA transposon; Transposable Element; SMAR26. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-1188 RA Jurka J.; RT "Mariner-type element from freshwater planarian (Schmidtea RT mediterranea)."; RL Repbase Reports 7(10), 1084-1084 (2007). XX DR [1] (Consensus) XX CC Individual repeats are up to 90% identical to consensus sequence. XX FH Key Location/Qualifiers FT CDS 507..1115 FT /product="SMAR26_1p" FT /translation="MEPFLDKLITGDEKWILYENIKKRKVYCKPGTSPATL FT AKPDIHQKKVLLCLWWDRKEPVYYELLKQGQTINANVYCNQLDKLNAAIKE FT KRPALASRKGILFHHDNARPHTAMVTQQKLTALGWEVLSHPPYSPDIAPSD FT YYLFRSLQNYLMGKNFNSFEGVSKAVAEYFQSKNENFYRSGIDRLPERWQQ FT VVTNNGNYIIEKN" XX SQ Sequence 1188 BP; 408 A; 196 C; 235 G; 349 T; 0 other; tatcaggtta ttacttgttt tattatataa aattaaattt ttaatagcgt tcgttagtta 60 cttgcgatat ttaaagtgaa aatgatgtca gtggataaag tgcatttaca acattgtatt 120 ttgtatgagt tcgagaaaaa aacaacgcgt cgattgcatg tgaaaacatt tgtgcagtat 180 ttggacgagg tgttgtaaat gttcgtactt gtcaaagatg gtttagaaaa tttcgttccg 240 gagatctgag cctagaagaa gatactcgac ctggacgacc atccaagatc gataatgaag 300 ttttgcggtc tatgttggag cataatccac atctaacaac aagacaaatt ggggaaaact 360 cagaatttct catacagttg ctggagatca cattaaagca cttgggtttg tcttgaagca 420 agccgtttgg atcccacacg atttgactga aaaaaacctt gtctgaccgt ataagaatat 480 gttcatcaca tctcattaga aataatatgg agccgtttct ggataaactg attactggag 540 atgaaaagtg gattctttac gagaatataa agaagagaaa agtttactgc aaacctggaa 600 catcaccagc aactcttgca aaaccagata ttcatcaaaa gaaagttctg ctctgtttgt 660 ggtgggatag aaaagaacca gtgtattacg agttgctgaa acaaggacaa acgatcaatg 720 cgaatgtgta ctgcaaccag ctcgataagt taaatgcagc tatcaaggaa aaaaggcctg 780 cattggcatc cagaaaagga atcttgtttc accatgataa tgctagacca catactgcaa 840 tggtgacaca acaaaaactt actgcgttgg gatgggaagt tcttagtcat ccaccatatt 900 cacctgacat tgcaccatct gactattacc tgttcaggtc cctgcaaaat tatctaatgg 960 gaaaaaattt taattctttc gaaggtgttt ctaaggcagt ggctgagtat tttcagtcca 1020 agaacgagaa tttttataga agcgggatcg accggctacc tgaaagatgg caacaagtgg 1080 ttacaaataa tggaaattat attattgaaa aaaattaaat atatattttt ttaaattgat 1140 tcaaatgttt tattgaaaat acgacagaac tttccttcca acctaata 1188 // ID DNA8-34_AP repbase; DNA; INV; 194 BP. XX AC . XX DT 23-AUG-2009 (Rel. 14.08, Created) DT 23-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-34_AP. XX NM DNA8-34_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-194 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(8), 1776-1776 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. Putative hAT element. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 194 BP; 52 A; 43 C; 35 G; 64 T; 0 other; caggggtggc caaaccgcgg ctcgcgtcat agacttttgc ggctcgcatc ttaacgtaac 60 attagccaag ataaaaaaaa aaaaggtcat ataatataca ataatatgac cttttttttt 120 ttttacatct tggctcttca atttttatta atatatttgg tggctcgcga gtctcttctc 180 gctggccacc cctg 194 // ID DNA8-29_AP repbase; DNA; INV; 771 BP. XX AC . XX DT 23-AUG-2009 (Rel. 14.08, Created) DT 23-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-29_AP. XX NM DNA8-29_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-771 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(8), 1771-1771 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 771 BP; 269 A; 135 C; 136 G; 229 T; 2 other; caggggcgcc atttggrggg gagcagggta tactttggct cccctatttt tttttgtgtc 60 ttattggcct ggattaaaat ttagaatgcg ccgacggctg tcccgagccc aaattctccc 120 actctcgcac tcaatcgtgc acgacggtcg ccgatcaaag cttcccgaca agcatacccc 180 gatttcaaat aaattgtacg taaacgcgta tattatagtg gtactaattg gaactggctg 240 tatatggttc acaaaggact gatagcactg cacgcacgtc gccgtcgcat atcggtggtt 300 gttcttattg atagcggtag aaaaagaaga tgctaacaaa atagatttag atgaagcagt 360 agatagattt tcgaaactaa aaagtagaag gtataagcta gaatagcata aataattaaa 420 aaaactaatg aatactgtaa gtctgtaact gtatgaaata ttgtattatt attaaattat 480 ttattataat aagtaaataa atatattgta ttcattttat ttggaataaa ttgcattgaa 540 taaatgtaga tacctatacc tgtaccaaaa ggcaaacaaa ttattaatga ataatttgaa 600 ttcattaaaa tatacccttt aagagcytaa ccccatacaa tcgacttaca cacaataaac 660 gatattaaac gatatctgta tttactattt ttaaaaagtg gcctgacgaa aaaaaaaatt 720 ggcctgctta aatattggca cccctattat aaaactcaaa tggcgcccct g 771 // ID Gypsy-3_RP-LTR repbase; DNA; INV; 211 BP. XX AC ACPB02031233; XX DT 28-JAN-2011 (Rel. 16.02, Created) DT 28-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Rhodnius prolixus genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-3_RP_; KW Gypsy-3_RP-I; Gypsy-3_RP-LTR. XX OS Rhodnius prolixus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Euhemiptera; Heteroptera; OC Panheteroptera; Cimicomorpha; Reduviidae; Triatominae; Rhodnius. XX RN [1] RP 1-211 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Rhodnius prolixus genome."; RL Direct Submission to RU (28-JAN-2011). XX DR Genome; ACPB02031233; Positions 11613 11403. XX SQ Sequence 211 BP; 67 A; 30 C; 41 G; 73 T; 0 other; tgtaacggag gggctcctcc aataacaact gcttatcgat ggtgaaaact attatatggg 60 aaggttacaa gaaacatata cagagagaga gtcccgcctt gccaagaccg tgtgtttttt 120 tattgcgtaa caattttatt aataaataat aagtgtattt tttgtaaata tatttcattt 180 attgaacgtt acagtgtgtt tttgcgtaac a 211 // ID Gypsy-2_OD-I repbase; DNA; INV; 11463 BP. XX AC CABV01000624; XX DT 24-FEB-2011 (Rel. 16.02, Created) DT 24-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Oikopleura dioica genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-2_OD_; KW Gypsy-2_OD-LTR; Gypsy-2_OD-I. XX OS Oikopleura dioica OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; OC Oikopleuridae; Oikopleura. XX RN [1] RP 1-11463 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Oikopleura dioica genome."; RL Direct Submission to RU (24-FEB-2011). XX DR Genome; CABV01000624; Positions 55642 67104. XX CC Positions [5242-5727] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 2361..3641 FT /product="Gypsy-2_OD-I_2p" FT /translation="MAPSPYCTVLLKQYKPNSSSTVNLKFNALIDTGSTHS FT YLHKKLLNPDLILEDVNYTVGNITQDNILIIRKRIKCNIIIADGSEITNCT FT LCVIEDDSADVDGIIGMDLISNNTLKSSELFIQHQSSNLKNLSLQKHQEDD FT QILNVTTLRKTKNTAYGSVDLKNAALLHPNEQKRVKVTKFNGKELKQVDET FT LFKNWVVLAAPVEKANDIDHVTLLNKGHSALIIEAGTTILKHAQKMENPEN FT GKRDKILNFLISKAELAPAEKITFKKELQKWKEKRNELVEKVSIDDEITEA FT VQTAPADMRQELQRILAKFSWFFSRNQSDAGLNQHWAMDLCLKEENTSPVF FT SRPYKIDASLLKQIEEKLAQMCESGIIEPANSSWNSPLLTVVKKDKSIRIV FT NNYSAKTKNGSVNTNLKLPRVSAFASTNNFGKN" FT CDS 6246..8966 FT /product="Gypsy-2_OD-I_3p" FT /translation="MKLRIRNWNLLILLLLQKTSAEVSTTPQNQPKLNSTE FT NSNHPSQELVNTLVNGIALEEDIIPGYLVFDEIIDSIELALFPEILDYTIA FT ENNHCFRHVSTDLAHDQLVRINQLLGNFFNLEEDEDMFYEFQRSLLNNIIQ FT LDTYDKEMPWAFIEHTKIKGSTDVTKPETLLTYHEETTDRNLLTSLTLLEG FT CNQSKNPQLVQQFKIQIFGHADEIIIQSKKTAPGMLKWQVAVEGVSIWRNQ FT LDHPEHYLFRDSETSDFARFPLFDMPGYNSDQSMTVTIGFYFDWNIMGQEK FT TFGTKTINGENYVEACLTFEISEIYVIKNAKRYSDQGSRERRSAAEAATAI FT LSLVGYIQQNYQNYRVNQRIDDIKKRVEKDELRQIELARITENVIEIVASH FT KQELLILQRQICSSEIQLEELKFDQYTFMLFSNFIMHVERTENAIANQMPN FT THAQKLMIKVCSEINNSTEGCVQYYHRAHYELISTESTHKTTPMGQLRGVK FT FRISYSVPILRKITRVFGVLSTPVPLKKENNEYFYQSYEVPDVIGYSAESK FT KAFSMEKCEKTPHLGTRFCRMSLLTQKSSNMDCSTAILNAKPYEYCESSVY FT SSPSDCLYTGSTTSAHVLISHFNKFEIIEHTDASFPGVTLVTSSKDESQNV FT TMIERPEFAANIICKNARFWVAARSENAKPLSYIIKTVKLDQSLILRELDE FT ANFWNFKNNNVSGAFLQENIARSMIQQYASRLTKDYNKAKGYFDNRSTRLQ FT WILYSTGTLIALVLLIVLCVKYPAAGTAFWTAFTALGKGLSTLKNCCFKRE FT NKPAVPWEPKKSRKNKVFGIPLTEFNDEESQEFYPKSDSSITTSIKTKRIK FT PPRPSSRPGKSPSAPVLRKKKIVKSSFSENDRRNSDTEIGINRTDKNRRSV FT WRF" FT CDS 4156..6246 FT /product="Gypsy-2_OD-I_1p" FT /translation="MLNYYGRSIKMMQAALEPIAKGSALGKNFTLTDEMKR FT GLDEVKNQIKNGIHANHLNYVNGENGRYIFIAADTSLHRIGGCIGNATLKG FT GKVSGVTVAAYYSRKLSEQESLLSSRCRELIGIQATVKHFADLVPKTMKFL FT VFVDHRSLESVFHSTELKTSGATRTRSAYADLLDYPLMEIRYLPATDELIS FT VCDALSRNENWTEEIDKEVFHPRNFSKTDASINRLTQQSLKSIGHFAPKLE FT VSAVREAQLNHELYSKVIKNNDPGKQFYLKNIRYIIGENQILYRITDAGRK FT LAIIPKSIGREIISYLHVMTMHSGKAALESIVRDEPIWIESKTAIITEVTK FT ECLACYIQTPEKFRKEGSQLRRKPALRTLEKLYTDIIELRVEQKSFLYMTV FT LDEFSSFLMAKRVRSKKAADMVPALMIMMTSLGAQGNSLIVSDNGPEFVSK FT EFQTALRALGMDSARISPYNSGSNLVERSHRDLRAKLRLANVTTDNVDFHF FT EMAISHYNHSPKASLKNYTPMEIFANISRPTTFSSLDIDKANETVPYGIED FT FIEKLKEHQSDLAKSKIDTYFMKTPNENLKLVPGDFVVILDSRKQLGHSND FT ARGPYEVTRLKYGTCYELTEVLTGKRVLRNSKFLIKMQPSQKSKEMLRKIQ FT EKGALNITEKEALEILWGKRELVRCSLDSSELIYLNPHERKYNLRSRR" XX SQ Sequence 11463 BP; 4035 A; 2505 C; 2329 G; 2594 T; 0 other; actggtgact gaagtcataa ccactcccgt gcctccggga atatcggata gtcatccgat 60 ttaagaacag aagttcttcc gaaagcagca gctttcgatc agaaaattat ctgatcaagg 120 acagttgtcc tcttgaaagt agtcactttc aaaaacaagt tggagcttgt tcttaaagga 180 ctgtgagtcc tcaaaaaagg aaaattatct gatcaaggac agttgtcctc ttgaaagaag 240 tcactttcaa aaacaagttg gaacttgttc ttaaaggact gtgagtcctc aaaaaggcag 300 attggcccct acgacaggtc ggacctgttt ccaaaaaaca agttggtttt ggtaaaagcg 360 gtgtcttttt taaaaaagaa cttggaagtt ctaaatcagt cagttggcga cttttttaaa 420 gaacttggaa gttctaaatc agtcagttgg cgactataaa cttacgcaac gtacagatca 480 ggcacaccaa cgatatccag taacaacttg caaacaggaa ggggaaatta gctaatttct 540 taacaaattt acaaaatgat cgcgaccgat gataattctg ctgaggcttc tactggaatc 600 agcaaaatca ttgtgtcaat tgacagcata aatgtgccgg ggcttaggtt tatttgtaaa 660 acgctttacg aagacgaata tgaacagctc ggcgcgtctc caagaaagag ggagctactc 720 aggattctga agaaagatga agctgacgtc atatcatttg gaacgattgg tttcagaata 780 tgtgaaggaa aaactctgca cttaaccgtt ccactttcag cacctttcgc gtgtgacgaa 840 tctgataaag tgaggataag aaaattatca gattggaagc aagaactgga agatttagaa 900 gaggaggaaa gagctttaga atattacatc atttcattca acaacggcga atttagtgat 960 ctgggtactt caaacctgca aggcatccaa cttataaagg catattacag ctacagacct 1020 cgaattgatc aatttacaat actttatttg gaatcaaata tatccattct ggatattgca 1080 aataaaatta aagccgaaac tgacagcgcg aacaaaaaga aggttgaatt tcaaggagac 1140 acagaaaaac aggacgaacg acgagaacag agtgaaattt acaagcgatt ttcaactcca 1200 aaggcaaaat taaatttcag cggattacca aaaacctgac gatggaccag aaagctgggc 1260 ttacgaggat gatagttcag aggaagaaca ggaagattct gaaaacaagc ggaatacctt 1320 acagtggttc ggagtcgaga aagaacttgc cgaaaaagat caagcaaagt taaaagtaga 1380 cgtgaaagtc ataggttttg aagatttaaa aacagtcaga gaatggtaca ccattctgca 1440 atttcagctc gatttacagg gaatcacctg cgcaaggttg agggtaggcc agcttctcaa 1500 taatttgaaa acgaacctca aaactaccat tatcaacaaa ttgggccgga tgcgagcgtt 1560 gccatcaatg gaagacgcct ttgactgcct tcttgaatgt tctatgttta caagcataga 1620 tgcgaaaaaa tcgacggaca aattcgcgat tacgctccaa aaaccggtca tcgagcagtt 1680 ttacagactt cgggggctta tggaacgagc atggccacag aactcagaaa aagcaataat 1740 tcaaatgaca agatcgaaat tcgaagaaaa actgccaaaa gctattacca acgaaggcct 1800 cttcaagtac cgcgacgatg atattgacac ttcaacctac ctcaagctca ttcagaaaat 1860 cgccgatagc acggatgaat caaggactat gaacgtgctt aaggagattc acgtgcaatt 1920 tttgctcgaa aaagggacat aaatggaaag aatgccgaaa gaggttaaaa ggggaaagtg 1980 caaataagga tggaagcaaa tccggcccga actcaaatgc gtgcaatcat tgcggaattc 2040 ctggacataa atggatcgac tgcagaaaaa ggttgagaga acaaggagga ggtaaaacga 2100 caaccaggtc agattatttt cagaagaaag ataacggcca gaaagcagcg aatgcaaata 2160 attaccgtgg caacgctgat tttaagaaaa atgctgattc ctcaaaaaag caaatcgaat 2220 gctggaattg cggaaaaacg gggcattata aatcaaattg cagaagtcca cgaaagctga 2280 attttaccga actgacacaa gagggcgctc ctccaagcta cgagaaggag ttccgaggag 2340 aatatactca cgatcagtaa atggcgcctt ctccttattg cactgtgctg ctgaaacagt 2400 acaaacctaa ttcatcatca actgtaaatt taaaatttaa cgcactcatt gatacgggct 2460 caacacactc ttatctgcac aaaaaactgc tcaatcccga cctcattctg gaagacgtca 2520 actacacggt tgggaacata actcaggaca atattctcat cataagaaaa cgcatcaaat 2580 gcaacattat aatcgccgat ggatcggaaa ttacgaactg cactctttgc gtcattgaag 2640 acgacagcgc agatgtagac ggaatcatag ggatggatct tatcagtaac aacactctga 2700 agtcaagcga gctgttcatc cagcatcaat caagcaactt gaagaatctc agtctgcaaa 2760 aacatcagga agatgaccaa atcttaaatg tcacaacatt gaggaaaacg aaaaatacgg 2820 cttatggatc tgtggatctg aaaaacgccg ctttgctgca tccaaatgag caaaagcgag 2880 taaaggtgac aaaattcaac ggcaaagagt tgaaacaagt agatgaaaca ttatttaaaa 2940 attgggtcgt attggcagca ccggtcgaaa aagccaacga cattgatcac gtgacactgc 3000 ttaataaggg gcactcagca cttatcatag aagcaggcac gacgatcttg aaacatgcgc 3060 aaaaaatgga aaacccggaa aatggaaaaa gagacaaaat cctgaatttt ttgatctcaa 3120 aagccgaact agcaccggcg gaaaagataa cgttcaaaaa ggaactacaa aaatggaaag 3180 aaaaaagaaa cgaactggtg gaaaaagtca gcatagacga cgaaatcaca gaagcagttc 3240 aaacagcacc ggcagatatg cgacaagaac ttcaacgaat tttggccaaa ttctcctggt 3300 tctttagtcg caaccagtca gacgccggac taaatcagca ttgggcaatg gacctttgcc 3360 taaaagagga aaacacgtca cccgttttca gccgtcccta caaaattgat gcctcacttc 3420 ttaagcaaat cgaagaaaaa ctggctcaga tgtgcgaatc tggaataatt gagccagcaa 3480 attcaagctg gaatagcccg ctcctaacag ttgtcaaaaa ggacaaatca attagaattg 3540 taaacaatta ttctgcaaag acgaaaaatg gatccgtcaa caccaacctg aagttgccac 3600 gggtttccgc ttttgccagt acgaacaatt ttggcaaaaa ttagtagcgc aatcagcaga 3660 attcgctcca aaaatccgaa tgatcagatc gttttttggg gctgcgatct gcgaaacgcg 3720 tactattcag tttcaatacg agaaagtaag cgagatatca cgtcctttct tttcgcttcc 3780 cagcaattac agtattccag gatgtcacag ggcctcagca gcgctccttc aacttttcaa 3840 aattttatca tgaaagtcct ggaaaacgtg gaacgtgatg acgacagctt tgagcttata 3900 atatacatgg acgatgttaa ttgcatcaca acccgttcga aacacaacat tgtcgtcgat 3960 aaggttttaa cagttctaac agaaaacaac ctcgttatcg ctcttcgaaa aagtgaattt 4020 tttatgcgga aaatagcctt tctgggcttc atcatcaacg aaaacgggat cgaagtaaac 4080 cagaaaaaag tgggacattc ttctaaaaat cgaattatcc aaaatcggca aaagattgta 4140 tgaaaatttt ggggcatgct taactactac ggtagaagca ttaaaatgat gcaagcagct 4200 ctagaaccaa tcgcaaaagg ttcagctctt ggcaagaatt ttactctgac agatgaaatg 4260 aaaagagggc ttgacgaagt taaaaaccaa atcaaaaatg gaatccatgc caaccacctt 4320 aattacgtaa atggagaaaa cggtcgatac atcttcatag ccgcggatac ttcgctacac 4380 agaatcggag gctgcatcgg aaatgcaact ctgaaaggcg gaaaagtgtc cggagtgaca 4440 gttgcagctt attattcaag aaagctttca gaacaggaat cacttttaag ctcgcgatgt 4500 cgagagctta tcggaatcca agccacagtt aaacatttcg cagaccttgt tccaaaaact 4560 atgaaattcc ttgtctttgt ggatcatcgt tcacttgagt ctgtgttcca cagcactgag 4620 ctgaaaacga gtggagcaac acgaaccagg tctgcatatg ctgatcttct ggactacccc 4680 ttgatggaaa tccggtactt accagccact gatgagctta tatcagtatg tgacgcactt 4740 tccagaaatg aaaactggac agaagaaatt gacaaagaag tttttcatcc tcgaaatttt 4800 tcaaaaacag acgcaagcat aaacaggctc actcaacaat cgctgaaatc aatcggtcat 4860 tttgcgccaa aactggaagt aagcgcggtc cgggaagccc agttaaatca cgaattatat 4920 tcaaaagtga ttaaaaataa cgatcctggt aaacaattct atctaaaaaa cattagatat 4980 atcattggtg aaaatcaaat cctttacaga ataacagatg ctggtagaaa actagcaatc 5040 ataccgaaaa gcatcggacg agaaatcatc tcttatctgc acgttatgac aatgcacagc 5100 ggtaaggcgg cgctcgagag cattgtaaga gatgagccaa tttggatcga aagtaaaact 5160 gcaattatca ctgaagtaac caaggagtgc ctagcttgct atattcaaac tccagaaaaa 5220 ttcagaaaag aaggctcaca attacgaaga aagccagcat tgcggactct cgagaaactc 5280 tacacggata tcatcgaact aagagtcgag caaaaaagct tcctgtatat gacggtttta 5340 gacgaattca gcagctttct catggcaaaa cgggtacgtt cgaaaaaggc agcagacatg 5400 gtgccagctt tgatgataat gatgacatca ctcggagctc aaggaaattc attgattgtg 5460 tccgacaatg gacccgagtt cgtttcaaag gaatttcaaa cagcattaag ggcactgggg 5520 atggacagcg caagaatcag cccgtacaat tccggtagta acctagttga gcgctcccac 5580 cgtgatctgc gcgcaaagct gagacttgca aatgtgacta cggataacgt tgatttccac 5640 ttcgaaatgg ccataagcca ttataatcac agcccaaaag caagcttaaa aaactacacg 5700 ccaatggaaa tcttcgcgaa catttctaga ccaacaactt tcagttcact cgacatcgac 5760 aaggcaaatg agaccgttcc ttatggaata gaagatttca ttgaaaagct taaggagcac 5820 cagagcgatt tggcaaaatc gaagattgac acatatttca tgaaaactcc taatgaaaac 5880 ctgaaactgg tgccaggcga ctttgtggtc atattagatt ctcgaaaaca acttggtcac 5940 tcaaatgacg caagaggtcc ttacgaagta acaagactga aatatgggac atgctacgaa 6000 cttactgaag tccttacagg aaaacgagta cttcgaaatt caaaattcct gatcaaaatg 6060 cagccaagcc agaagtcgaa ggagatgctc agaaaaatac aagaaaaagg agctctaaat 6120 atcaccgaaa aagaagctct cgaaatttta tggggcaaaa gagagctcgt cagatgctca 6180 cttgactcgt cagaactaat ctatttaaat ccccatgaaa gaaaatacaa ccttagaagt 6240 cgaagatgaa gcttcggata cgtaactgga atctgctgat actgctattg ctacaaaaaa 6300 catctgcaga agtctcaaca acaccacaaa atcaaccaaa actgaactcc acagaaaatt 6360 caaaccatcc aagccaagaa cttgtaaata ctctcgtcaa cggaatcgcc ctcgaagaag 6420 atatcattcc aggctaccta gtctttgatg aaattatcga ctctatcgaa ctagccttgt 6480 ttccagaaat cttggactac accattgccg aaaataacca ctgcttcaga catgtaagta 6540 cggacttggc tcatgatcaa ttggtcagaa tcaaccaatt acttggaaat tttttcaacc 6600 tcgaagaaga tgaagatatg ttctacgagt tccagcgatc cttgctaaac aacatcatcc 6660 aacttgacac ttacgacaaa gaaatgccat gggccttcat agaacataca aaaatcaaag 6720 gatcaacaga cgtcacaaaa cctgaaacgc ttctaactta tcacgaagaa acaacggata 6780 gaaacttact aacaagtctt actctactgg aaggatgtaa ccaatcaaaa aacccacaac 6840 ttgtgcagca gttcaaaatc cagatcttcg gccatgctga tgaaataata atccaatcga 6900 aaaaaaccgc tccaggaatg ctcaaatggc aagttgcggt tgaaggcgtt tcaatctggc 6960 gcaaccagtt ggaccaccca gagcactacc tttttcgcga cagcgaaaca tcagattttg 7020 caagatttcc tttgttcgat atgcccgggt acaattccga tcagagcatg acggtgacca 7080 ttggattcta cttcgactgg aacataatgg gtcaagaaaa aaccttcggc acgaaaacga 7140 tcaatggaga aaattacgtt gaagcttgtc tcaccttcga aatatcagaa atttacgtaa 7200 taaaaaatgc aaaaagatac agcgatcaag gaagccgaga acgccgaagc gcagctgaag 7260 cagcaacagc gattttatca ctggtcggtt acattcagca aaactatcaa aactaccgcg 7320 tgaatcaacg aatcgatgac ataaaaaaac gtgtggaaaa agacgagctt cgccaaatcg 7380 agctcgcacg aataactgaa aacgttattg aaattgttgc gtctcacaaa caagagttgc 7440 taatcctcca gcgacaaatc tgttcaagtg aaatccagct ggaagagctc aagtttgatc 7500 aatacacctt tatgttgttc tccaacttca ttatgcacgt tgagagaacc gaaaacgcga 7560 tcgcaaacca gatgccgaat acgcacgcgc aaaaactaat gattaaagtt tgctctgaaa 7620 tcaacaattc aacagaagga tgtgttcagt actaccacag agctcattat gaacttataa 7680 gcactgaaag tacgcacaaa actacgccga tggggcaact tcgcggagtg aaattccgaa 7740 tcagctactc ggtgccaatc ttaagaaaaa taacaagggt tttcggggtc ctcagcactc 7800 cagttccgct caaaaaggaa aataatgagt atttttacca gtcttacgag gttccagacg 7860 tcattggata ctcagcggag tcaaaaaagg ccttttcaat ggaaaaatgt gaaaaaactc 7920 cgcatttagg cactagattt tgcagaatga gccttcttac tcagaaaagc agcaacatgg 7980 actgcagtac ggcaattctc aatgcaaagc cttatgaata ctgcgaaagc tcagtctact 8040 ccagtcccag tgactgtctc tatacaggtt caacaacgtc agcgcacgtg cttatctcac 8100 acttcaataa atttgaaatt attgagcaca cagatgcaag ctttcctgga gtaactctcg 8160 taacaagttc aaaagatgaa agtcaaaacg tcacgatgat tgagagacca gaatttgctg 8220 ccaacataat ttgcaaaaat gcaagattct gggtcgctgc aagaagcgaa aacgccaagc 8280 cactgagcta cataataaaa accgtcaaac tcgaccagtc actcattttg agggaattag 8340 acgaggctaa cttctggaac ttcaaaaaca acaatgtaag tggtgcgttt ctccaagaaa 8400 atatagcacg ctcaatgatc cagcaatacg catccagatt gactaaggat tacaacaagg 8460 caaaaggcta ttttgataac agatcgacga gattacagtg gatcctgtat agcactggaa 8520 cacttatagc gctcgtcttg ttaattgtcc tttgcgtgaa atatccagca gcaggaaccg 8580 ctttttggac cgcctttaca gcgctgggaa aaggactttc tacgctaaaa aactgttgct 8640 tcaaacgcga aaacaaacca gctgttccat gggaacctaa aaaatcgcga aaaaataaag 8700 tttttggaat ccctctaaca gaattcaacg acgaagagag ccaagaattt tacccaaaat 8760 ctgattcttc aatcaccacg tcaatcaaga caaaaagaat caaaccgcca cgtccctctt 8820 cacgacccgg aaaatcgcca tcagctccag ttttacgaaa aaagaaaata gttaaaagct 8880 cgttctcgga aaatgacaga agaaactctg atactgagat cggaattaac agaacagaca 8940 agaacaggcg ttcagtctgg cgcttttagc caaaacgatc tgctttcaca acttcatcga 9000 cattttgatc gccaggacag cttcgacact atatcatccg atgccactta cgaccttcca 9060 cccattcgag atctcccatc gattccaaag gagacagaac cagcaatatc aagccaacca 9120 aaatcatgga aaattcaaaa accgcccaat ttcgctccaa acaggacaac ctcgctaata 9180 aaaaatgaac aaaaacagca acaggacaca acggaaacag aatataagaa ctgtttccca 9240 tatctagtaa aagctgacgg aaccttctgc tgcccgttat tcttaaaaaa ataataagta 9300 catttattga aaaattgatt tataaaataa cgcttttttt tcattttgcg aaagcttttt 9360 gcatctatgc ctactttaat tctatttaaa tgtaaaaata ccaaaatcaa aaggaaaaaa 9420 cagtccaaaa agtccaccat aaccctaagg gggttaaaaa cgagcgcgca gcgtctaata 9480 gtcaacaacc agctcgcgac aaccagcgcg cagcacccag caaccatagc aaccaaacaa 9540 cgaagctgcg tagcggtcaa cgaccggcgc gcagcgccca gtgcgcagca ccaagcgaac 9600 atagcaacga tgagcaaaca aatttttgct ctttcgacta catcaaattt ccttaattaa 9660 cctgcttaat taaaacagct taaaaaataa aaaacaaata aaaattataa aaaaaaagta 9720 aaattaataa aaatattcaa aatttaaaaa aaaaatgaca caaaaaaaaa aattattttt 9780 cgtttttttt tattttgcag ctcagcagag caaatcataa gtattgcatg ttacgagcaa 9840 ggtagccata atcgtctccc tgctcccacc aggaaaaccc gaactgcaca gggacgaaaa 9900 cgcccggaag attcgcggcg ggctgcagaa aagaaagcgg caaagcgcag ttcagcagct 9960 aaaacaaaaa ttaaaaaagt ctacaagtct aaaatacctc ctcttcgccg gcagcgaagc 10020 gcaatttaat gcggcaccag tactggacgg cgcggttgta tccagcctgc tcctccacgc 10080 ggtccaattg gcagagcatc gccgagcgag gcatcgattt cagtatgaaa ctgaaatctg 10140 cccaatttga gccagctggc gcctcctcca aacgggtgac gcagagccag atcagatcac 10200 tggcactaaa aaacattaaa aagaaagaaa ttaaaaaaca cttacggggt agtagaccca 10260 cagctgccgg gaatcatttt catagatggc ctgtcggacc cagcggcaga gcagcggaag 10320 agcggcatgg tcccagtcca caaaaggcaa tgcaagcagg cgcgctcggc cgttgcgcaa 10380 gcgggatgcg cgtcgcggca tcagacttcc ctgggcggca gcatcggtgg atgcggcagc 10440 gacggcgcca ccatcgtcag agacagcagc ggcgccgtcg tcagcagcag cagcggcgtc 10500 gacaacctgg tcaatgttgc cttcttcgac aacttcgtcg gctgaaagtt aaagttaaaa 10560 ttgaaatccg attcaaaaat ttgaaaaata aattaaatag gtttattttt attcataaaa 10620 taatttaatt acgttctgtt atcgccgaaa gcgcgttaag aacagccgcg aagtttgggc 10680 caaatcgtat catcatggat ggatgaagat ggatcgcagc agagatcatc accttggcag 10740 cagccaaact caccgacagt ggtcgaaact gcgacagtag tcgcaaatcc gatggcagta 10800 gcctcaactt cgttgactgt agtcacaacc cctgcgacag cagtcgcaac tccgatgtca 10860 gtagtcgcat cgccgccagg cagtcagcag gcacctgtta tcgatttaac tacaagttct 10920 ggtagagcca gaataaatga agtccgagat cttctcaaag attcggaaaa ctatttcagg 10980 acctgtaccg cgatgaactt caaactgctg ctatcaacaa caacggaaat accagcacca 11040 acagaagtcg ttgcagcaat atttaacggc atcaaatcca gttcaacgaa gacaataaaa 11100 aagcaattcg cgaaaatcca cagctatcgc tggtttattc catacactgc tgaaacagaa 11160 agaatattca atgataagaa ggttgcgcag atggccagct tcttatcgcg cggaagattg 11220 acaattcgat tcatcgcaag tgagctgcga aatactgact tcgcagcgct cgtcaccaaa 11280 cctttggatt acatcacctg ggcagctttt ccagaaccac aaacgaaaat ctcgccatca 11340 gtgacggctg tggcgctcta cgccgccttg aactccaacg aagtcagacg aatcaacaac 11400 gactgcagac gaaataacat ccgacttcga ttggcaagct aggatgcaat tatgggggag 11460 gga 11463 // ID Copia-9_SI-I repbase; DNA; INV; 4381 BP. XX AC AEAQ01030799; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-9_SI_; KW Copia-9_SI-LTR; Copia-9_SI-I. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-4381 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01030799; Positions 4821 441. XX CC Positions [1624-2142] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 112..2598 FT /product="Copia-9_SI-I_1p" FT /translation="MENWNLTSVEPLDDKNYFLWSEKIEGILRAKKLWKKV FT INVKPPEKPVEGTENYETKYKSWNEWDDDNYAARAVIINTMSSSQLLKYSR FT EKSADKLWSLIKNNMAAETDQLKAKFLSELSNLRMYKDESVDAYVNRAEAL FT RNQCVQLGKNIEDYELRMYILSGLRSEFDQNVRVLDTQRELTINDIRYALK FT QEESRKNKRKEERTSRDEYVRRARDKPKFDVACYNCGMKGHVSSECRNKQK FT CFNCQGLNHISADCKEPKRIASRGRGTRGNSRGRGFGRRRGQSEVTLKTAD FT ESVLTVRDSAHVSMSLCKNGESKVTKDNSNDCIWILDSGATSHMASNEVMF FT GYLEDEKRDIRLADKEGKKLISDGKGEIVLKQDSVNSRIRLENVLCVPDIN FT VNLLSVAKITDHGYNVKFDKHGAIVYRDKDEIKMRAVREDNAYYVKTTTMN FT DEKAATITDIDIWHKRLGHINKRIIEEMKREDLVIGMKETSKEIRQCEPCV FT EGKICRKAHPRLPGRRTKKIMELWHTDLIGPIKPSSRGGKRYIMTTIDDYS FT RVIFIDFLREKGEAAKRLKELIMLKENQTELKLKAIRSDNGGEFIAEELEN FT WFKSKGIKHEFSPARTPQCNGVVERANKSIIEMTRTMMMDSKLSMDFWAEA FT ASTAVYIKNRSKSTVHGKTPYEMWNGWKPNINFMRRFGCMAYVLDKEKQRR FT KFDSKSVKGIFIGYASNNTYRVYVPETGKVKADCDVKFDETRNGCELINRK FT ADKEEKNEEKLIIMGINPGNNEEWMENRMEIEEDEEDRRTESSVYEDATQE FT DLGDDTHENLENGDESEDENKLMMEGM" FT CDS 2685..4382 FT /product="Copia-9_SI-I_2p" FT /translation="MELRRVERLEEQERNIGQNDVRRSERIKNQQSALLVT FT DDEIPKNVEKAMESSDWKFWKDAIKEELTSLDKHKVWDVVPRPKDKKVIKC FT KWIFNIKEDPNTEQRRYKARLVALGCGQRPGVDYRETFAPVVRTETLRILF FT SISAQERRKIKIYDVRTAFLHGTLKEEIFMEMPDGLRSNKEQVCKLKKSIY FT GLKQAGRCWYECLTDVMQNCGMKQSVEDPCLFYVKKENNYLYCGIHVDDMV FT IISNREEFEKEYMDKIKRYIDIKDLGEANTVLGMQIEQEEGRIYVHQRKYI FT QNLLQLYGMEECNTVGSPMDVNVIMEECSNSEQADVHIYQELMGRLMYLGV FT YTRPDISFTLSNLSQFNNDPRMMHMTALKRILRYLRGTIDYRLEFGEKAVE FT GLIECEADASWDRTKDAKSYSGILVYRNGDLIHWKSKKQSKVALSSTESEL FT EAMLEGTKEVVWTARLLREIGMSNGSQTELRCDNLNAVRLANGGTFKTKSK FT LLNRRCHYIKEIVKEENVRVKHVPNEVMTADCLTKPLSKPKLLKACEEVHE FT RTRMIKPRGRLWDIVFYHL" XX SQ Sequence 4381 BP; 1680 A; 595 C; 1151 G; 955 T; 0 other; caattggtag cagagcgtgg ttgatgcaaa aagaaggttg gtgagtgtaa ttgagagacc 60 gagtgttgag agtttgtgcg tgaacgacga gagagaagag tccgattcaa gatggagaat 120 tggaatctca caagtgtgga accgcttgac gacaagaact atttcctatg gagcgaaaag 180 atagaaggaa tcctgagagc gaaaaagttg tggaaaaaag ttattaatgt gaagccacct 240 gaaaaacccg tagaaggtac ggaaaattat gagacgaaat acaagtcatg gaacgaatgg 300 gacgacgaca attacgcagc ccgtgctgtg ataataaata caatgagtag ttcacaattg 360 ctgaaataca gtcgtgaaaa aagtgcggac aagctttgga gcttgatcaa gaacaacatg 420 gcagcggaaa ccgatcaatt gaaggccaaa ttcctcagcg aactgtcgaa cctgcgaatg 480 tataaagatg agagtgtcga cgcatatgtg aacagagcgg aggcgctgcg aaatcagtgt 540 gtgcaacttg gaaaaaatat agaagactat gaactgagaa tgtatatact cagcggacta 600 agatccgagt ttgatcaaaa tgttcgagtg ctggatacac agagggagct gacgatcaat 660 gatataagat acgcactgaa acaagaagaa tcccggaaga ataagcgaaa agaagagagg 720 acatcgaggg atgaatatgt gagaagagca agagataaac caaaattcga cgttgcgtgc 780 tataattgcg gaatgaaagg ccatgtatcg agtgagtgta gaaacaagca aaaatgcttc 840 aactgccagg gattaaacca tatttcggcc gactgtaaag aaccgaagag aatcgcatca 900 cgaggaagag gaacaagagg aaactcgaga ggaagaggat tcggcagaag acgcgggcaa 960 agtgaggtta cgctgaagac agcggatgag tcggtactga cagtaagaga cagcgcgcat 1020 gtgagtatgt cgttatgcaa aaatggagaa agcaaggtaa cgaaagataa ttctaacgat 1080 tgcatatgga tattagattc aggcgcgaca agccatatgg ccagtaatga ggttatgttt 1140 gggtatctag aagacgaaaa aagagatatt cggttagcag ataaagaagg aaaaaaattg 1200 atctctgacg ggaagggaga aatagtatta aaacaagatt cagtcaacag tagaatccga 1260 ttagaaaatg tgttatgcgt tcctgatatt aatgtaaatc tattgtctgt ggcaaagatt 1320 actgatcacg gatataatgt gaaattcgat aaacatggag ctatcgttta cagagataag 1380 gacgagataa agatgagagc cgtgcgtgag gacaatgcgt actacgtaaa aactacaaca 1440 atgaatgacg aaaaagcagc gacaattaca gacatagaca tttggcacaa acgacttgga 1500 cacatcaaca agaggatcat cgaagaaatg aagagagaag accttgtcat tggaatgaag 1560 gaaaccagta aagagataag acaatgtgaa ccatgcgtgg aaggaaagat atgtcgaaag 1620 gcacatccta gattgccagg gagacgaaca aagaaaataa tggaactgtg gcacacagat 1680 ttgattggtc ctataaaacc atcgtcacga ggaggtaaga gatacataat gacgacaata 1740 gatgattact caagagtaat ttttattgat tttttgagag aaaagggaga agcggcaaaa 1800 aggttgaagg aattgataat gttgaaggaa aatcaaactg aattaaaatt gaaagcaatt 1860 cgatcagaca atggaggtga attcatagcg gaagagctag aaaactggtt caaatcgaaa 1920 ggaatcaaac atgaatttag cccagcgaga acaccgcagt gcaacggggt agtggaaagg 1980 gcaaacaaat caattattga aatgacgagg acgatgatga tggattcgaa actgtcaatg 2040 gatttttggg ctgaggcggc gagtacagca gtatacatca agaataggag caagtcaacg 2100 gttcatggaa aaacaccgta tgaaatgtgg aacggttgga aaccaaatat aaattttatg 2160 agaaggtttg gctgcatggc atatgtcctg gataaggaaa aacaaagaag gaaatttgat 2220 tcaaagtctg tgaaaggaat atttataggc tatgcaagca ataatacata tagggtgtat 2280 gttccggaaa caggaaaggt gaaagcggat tgcgatgtta aattcgacga gaccaggaat 2340 ggatgtgagt tgatcaatag aaaggcagac aaagaggaaa aaaatgaaga aaaactgata 2400 atcatgggaa tcaatcccgg aaacaacgaa gaatggatgg aaaatcgaat ggaaatagaa 2460 gaagatgaag aggataggcg aactgagagc agtgtttacg aggatgcaac acaggaagat 2520 cttggagatg atacacatga aaatctagag aatggagacg agagcgaaga tgaaaacaaa 2580 ttgatgatgg aaggaatgtg agaaatggag tgatagaaga aattcaagaa caaccaattg 2640 agattagaag tagaggaaga ccgaagggaa ccactaaagc tgtaatggaa ttacgaagag 2700 ttgagagatt agaagagcaa gaaagaaata taggacaaaa cgacgttaga agatctgaaa 2760 ggattaaaaa tcaacagtca gcattactgg tgacagacga cgaaattccc aagaatgttg 2820 agaaagcgat ggagtcgagt gattggaaat tttggaaaga tgccataaaa gaagagctga 2880 catctctaga taaacataaa gtttgggatg ttgtgccgag gccaaaggac aagaaagtta 2940 taaagtgcaa atggattttc aacattaaag aagacccgaa tacggaacaa cggagataca 3000 aggctagatt agtggcactt ggatgcggac aacgaccagg agtggattat agagaaacct 3060 ttgccccggt ggtaagaacg gaaacattaa gaatactatt cagtataagt gctcaagaaa 3120 ggagaaaaat aaaaatttat gatgtgagaa cagcgtttct acatggaaca ttaaaggaag 3180 aaatatttat ggaaatgcca gatggattac gaagtaacaa agagcaagta tgtaaattga 3240 agaagagtat ttatggactc aagcaggccg gaagatgctg gtatgaatgt ttgacagatg 3300 tgatgcagaa ttgtggaatg aaacaatcgg tggaagatcc ttgtttattt tatgtaaaaa 3360 aggagaataa ttacctgtat tgtggaatac atgtcgatga tatggtgatt atttcgaata 3420 gagaagaatt tgaaaaggag tacatggata agatcaaacg gtacattgat attaaagatc 3480 tcggagaagc gaatacagtt cttggaatgc aaatagaaca agaggaagga cgaatatatg 3540 tgcatcaaag gaagtatatt caaaatttac tgcagttata cggaatggaa gagtgtaaca 3600 ctgtcggatc accaatggat gtaaatgtaa ttatggaaga atgtagtaac agtgaacaag 3660 ctgatgttca tatttatcag gaattaatgg gacgtcttat gtacttggga gtgtacacta 3720 gaccagatat atcgttcaca ctgagtaacc tgtcgcaatt taataatgat ccaaggatga 3780 tgcatatgac agcactgaag aggatacttc gttatttgag gggaaccatt gattaccgcc 3840 tggaattcgg agaaaaggct gtggaaggac taattgaatg tgaagctgat gcatcgtggg 3900 ataggacgaa ggatgctaaa tcatattcag gaatattggt gtatagaaac ggagacttga 3960 tacactggaa gagcaaaaaa caatcgaagg ttgcactttc ttcgacggag agcgagttgg 4020 aagcgatgct ggaaggaaca aaggaagtcg tatggactgc taggctacta cgtgaaattg 4080 ggatgtcgaa tggatcgcag acggaactga gatgcgacaa tttaaatgcc gttagattgg 4140 caaatggagg aacatttaaa acaaaatcca aactattgaa cagaagatgt cattatatca 4200 aggagattgt aaaggaagaa aatgttcgtg tgaagcatgt accaaatgaa gttatgactg 4260 cggattgctt gacgaaacca ttgagtaaac caaaactgtt gaaagcatgt gaggaagttc 4320 atgagcgtac ccgaatgata aaaccaaggg gcagattgtg ggatattgtt ttttatcatc 4380 t 4381 // ID Gypsy-56_AA-LTR repbase; DNA; INV; 107 BP. XX AC AAGE02021956; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-56_AA_; KW Gypsy-56_AA-I; Gypsy-56_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-107 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02021956; Positions 95197 95091. XX SQ Sequence 107 BP; 38 A; 14 C; 24 G; 31 T; 0 other; tgtagtaaga tcatagtagg cagacttagg gtaaaatata agattgacag ttagggatgt 60 tcattctgta ataaaccact gaaagctgaa gacgttgttt cactaca 107 // ID LCSAT1 repbase; DNA; INV; 190 BP. XX AC X57585; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 28-SEP-1995 (Rel. 1.08, Last updated, Version 1) XX DE L.cuprina satellite DNA. XX KW SAT; Satellite; Simple Repeat; Highly repeated satellite DNA; KW LCSAT1; Repetitive sequence; satellite DNA. XX OS Lucilia cuprina OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Oestroidea; Calliphoridae; Luciliinae; Lucilia. XX RN [1] RP 1-190 RA Perkins H.; RT "LCSAT1."; RL Direct Submission to Genbank (31-JAN-1991)H. Perkins, Department RL of Biochemistry and Mol Biol, Australian National University, GPO RL Box 4, Canberra 2601, AUSTRALIA. XX RN [2] RP 1-190 RA Perkins H., Bedo G.D. and Howells J.A.; RT "Characterization and chromosomal distribution of a tandemly RT repeated DNA sequence from the Australian sheep blowfly, Lucilia RT cuprina."; RL Chromosoma 101(5-6), 358-364 (1992). XX DR GenBank; X57585; Positions 1 190. XX SQ Sequence 190 BP; 54 A; 22 C; 37 G; 77 T; 0 other; gatcggtaga attatttttc caggtttaaa aaggcggtaa tctgaggttc cgatatggtt 60 gtgaataact ttgacagttt tcatgctatg acagtcattt atgcatcatt ttaaagcatt 120 atgattgcac tttaatttgg tatatcattt gtagcataat gtgtgttttt acaaattggt 180 caatttgata 190 // ID DNA8-31_AP repbase; DNA; INV; 595 BP. XX AC . XX DT 23-AUG-2009 (Rel. 14.08, Created) DT 23-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-31_AP. XX NM DNA8-31_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-595 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(8), 1773-1773 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 595 BP; 259 A; 80 C; 67 G; 189 T; 0 other; cagggcttgc aaacgttata ataacgttat gattataacg ttatatttga caaaaaacga 60 ttgaaatttg taaacgttat tataacggaa agcgataatc aaaaataatt aaataccgca 120 aaacaaaaca taacgattaa caaaatatat tatattatat cattaaacaa aacataacgc 180 aaaacgtaat ttatagaata tcattaaacg aaaaataacg caaaacgtaa tttatagaat 240 atcattgaac gaaaaataac gcaaaacgaa atattttaaa atctatttat ttaaaaggtc 300 tattggtttc gtagaaacat ttatttcatg caaacaatct aagaattgat tatattatat 360 tccgtatccg ggccattacc tacccgcatt agttattatt tttaaattta ataggtatta 420 acactatttt aaataacgat aaacgcaaaa cttgtataac gttttaattc taaataacga 480 taaacgcaaa acttggataa cgttttaatt ctaaataacg ataaacgcaa aaattgtgta 540 acgttttaat tgtaaataac ataaaacgga attttttttt caaatgcaag ccctg 595 // ID Gypsy2-LTR_Dpse repbase; DNA; INV; 198 BP. XX AC Unknown_singleton_20; XX DT 13-MAY-2009 (Rel. 14.05, Created) DT 13-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy2_Dpse; KW Gypsy2-I_Dpse; Gypsy2-LTR_Dpse. XX OS Drosophila pseudoobscura OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; obscura group; OC pseudoobscura subgroup. XX RN [1] RP 1-198 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1036-1036 (2009). XX DR Genome; Unknown_singleton_20; Positions 11796 11599. XX SQ Sequence 198 BP; 84 A; 21 C; 55 G; 38 T; 0 other; tgtaggaaag gcagtaacgg gagtaccatt aacatagaga gaggaaataa tcgaatagaa 60 actaagatcg agtaatcgaa aagagaagag tttgagctac gagggagaga ggttgtgttc 120 tcggtgcgcg gattaaacga aagcgagaga agagaaaatt ataaaacact gtgtaaatta 180 aatataagac gaatgaca 198 // ID Copia10-NVi_LTR repbase; DNA; INV; 293 BP. XX AC AAZX01001646; XX DT 05-NOV-2007 (Rel. 12.11, Created) DT 05-NOV-2007 (Rel. 12.11, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: long terminal DE repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia10-NVi; KW Copia10-NVi_I; Copia10-NVi_LTR. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-293 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from Nasonia parasitic wasp."; RL Repbase Reports 7(11), 1108-1108 (2007). XX DR Genome; AAZX01001646; Positions 8966 9258. XX SQ Sequence 293 BP; 89 A; 59 C; 42 G; 103 T; 0 other; tgatataatt tttcgtcaat tcatataatt aaatgtttga atatttcgcg ctctttgata 60 ctcatatgtt tatagagttg aagcgagtaa tatctaacgg atcgttaacg gttatagttg 120 tgtatataag ctccgaaacg tttataattc cttcttcttg tcccgaccac cgaccagtca 180 acaagtatag tttatactcg ccacagtaat cgctctacgt aaataaacaa tatagagtac 240 aagttaatct tctattttat tcgcaagatc tgcttccctg atccccaaca tca 293 // ID hAT-34_SM repbase; DNA; INV; 2483 BP. XX AC . XX DT 12-AUG-2009 (Rel. 14.08, Created) DT 12-AUG-2009 (Rel. 14.08, Last updated, Version 2) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-34_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-2483 RA Jurka J.; RT "haT-type repetitive family from freshwater planarian Schmidtea RT mediterranea."; RL Repbase Reports 9(8), 1837-1837 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 465..2264 FT /product="hAT-34_SM_1p" FT /translation="MSKKRKYDESYVKYGFCCMKESDDVEKPQCFLCGKIL FT ANASMKPAKLIEHLKSLHPENASKDLEFFTKKKAQFSKSGTLTKLGFGIPQ FT KPLVEASFRVAYRIAKSKKPHTIGETLIKPCALEMVELVCGLEQRKKLEAI FT PLSNDVIQSRIVEISCNILKQIINELKASPFPFSMQLDETTDISNCSQLLV FT FVRYVSADTIKEEFLFCEPLLQTTKAVDVLAILNVFFSKHDFDWKQKLHSL FT CTDGAPAMLGNKSGFAHLVKKEAPNVIVTHCFLHRHALATKTLPTSLKDVL FT SIVIKTVNFIRSRALNHRLFKTLCQEMNAEHEVLLYHTEVRWLSRGQVLKR FT IFKLKTEVSLFLEEKDNSLFEYFNNKDFICKLAYLADIFNHMNDINISIQG FT PDTTIMDATEKLQAFLSKMSVWKIRIQNGIYANFQMLDEFIFENGSRQDSL FT LLNNLKDAICEHLEVLQVSFEKYFNLDEITKKDELWIRNPFLCDIDCIDDM FT DLAKDELIDLKTKSLLKMDFDSKTLGEFWSSLREAYPLLVKRAMAAIIPFA FT TVYLCESGFSTLVTIKTKHRNRLNVEHDMRVALSKTIPQFNLLIKEKQQQH FT SH" XX SQ Sequence 2483 BP; 819 A; 389 C; 437 G; 838 T; 0 other; cagggtttct taacctgtgg ttcgtggacc ccttgggggt ccatggatca tattttgggg 60 gttcgccgac tgctcctaat ttctgctcct aatttttaaa atttctttaa aaaaaatttt 120 tttttgtcaa aatttgaaaa tttgaaacac tgagtacctt tgatttttat ttcacttgga 180 ataaagtgcg ttagaatcag ttccgcaaag ttcttgcaaa ttcttaaagc gaaatatttg 240 ttcgtaaagt attgtattta ctagtatttt tttttcgagt gcttttttgt atcatagttt 300 tttctagtgt ttttttgtgt tgacaacggt gttttcaagt aaaggtgagt tcagtcaagt 360 aattcatttg tgaacaattt caaattgaat tcttaataat ttcttgatat ttttttctta 420 ataatttatt gatatttaat tcttgatatt ttttgtagcc caaaatgtcc aaaaaaagga 480 agtacgatga aagttacgtt aaatatggat tctgttgcat gaaggaatcg gacgatgttg 540 aaaagccaca atgctttctg tgtggaaaaa ttttggcaaa cgccagtatg aaaccggcaa 600 aattgataga gcatctcaaa tctcttcatc ctgaaaatgc atctaaagat ctagaatttt 660 ttacaaagaa gaaagcccag ttttcgaaat ctggaacatt aaccaaactt ggatttggta 720 ttccacaaaa acctttggtt gaagcatctt tcagagttgc atatcgtatt gccaaaagta 780 agaaaccgca tactattgga gagacattaa ttaagccatg tgcgctagaa atggttgaat 840 tagtttgtgg actggaacag agaaaaaaac ttgaggctat tccattatcc aatgatgtga 900 tacagtcaag aatagttgaa atttcttgta atattttgaa acagatcatc aatgaattga 960 aagcatcgcc atttcccttt agtatgcaac tggatgaaac tacagacatc tcgaattgta 1020 gtcaacttct ggtttttgtt cgttacgtgt cagctgatac tatcaaagaa gaattcttat 1080 tttgtgagcc tcttttgcaa actacaaagg ccgttgatgt tttagcgatt ttgaatgttt 1140 tcttttccaa acacgatttc gactggaaac aaaaacttca ttcgctttgt accgatggag 1200 cacctgcaat gcttggcaat aaatctggtt ttgcacattt ggtaaaaaaa gaagctccta 1260 atgttattgt tacacattgt tttttacata gacatgcgct ggctacaaaa actcttccaa 1320 ccagcttaaa agatgtattg tcaatagtta ttaaaaccgt gaattttatt agaagtagag 1380 ctcttaatca tcgtcttttt aaaacattgt gtcaagaaat gaacgcagaa catgaggtac 1440 ttctgtacca cacggaagtg cgctggctgt ctcgagggca ggttttgaaa cgaattttca 1500 agttgaagac agaagtatca ctatttttgg aagaaaaaga taactcgctc ttcgaatatt 1560 tcaataacaa agattttata tgtaaattgg cttacctggc agatattttt aaccacatga 1620 acgatataaa tatttcaatt caggggcctg atacaacaat tatggatgcc actgaaaaac 1680 tacaagcatt tttatcgaaa atgtccgttt ggaaaattag aattcaaaat gggatatatg 1740 caaactttca aatgttggac gaattcattt ttgaaaatgg atctcggcaa gatagtttgt 1800 tattgaataa cttgaaggat gcaatctgtg agcacctgga agtacttcaa gtttcatttg 1860 aaaaatattt caatttggat gaaataacta aaaaagatga gttgtggatt cgtaatcctt 1920 ttctttgtga tattgactgt atcgacgaca tggaccttgc caaagacgag ttaattgatt 1980 tgaaaacaaa gtccttgtta aaaatggatt tcgattcaaa aactcttgga gagttttggt 2040 cttctttaag agaagcctac ccattgctag taaaacgagc tatggcagcc attataccat 2100 ttgctacggt atatctttgc gaatcaggat tttccacact cgtaacaata aaaacaaaac 2160 atcgaaatcg actgaatgtc gaacatgata tgcgcgttgc tttgtcgaaa accatccctc 2220 aattcaatct cctaatcaag gaaaagcagc aacagcattc acattgaaaa ttttattttc 2280 ttcaattaca attattattt tatatccagg aacataaaga tatctgtcaa cagttatatt 2340 attgaaataa aaatatttaa ataataaaat agaatatatt ttgtaataat tgacgtttaa 2400 acaaattttt ttcacagagg ggtccgtgac atatttaaaa tttttaaagg ggtccattgc 2460 accaaaaagg ttaagaaacc ctg 2483 // ID YOYOI repbase; DNA; INV; 7065 BP. XX AC U60529; XX DT 11-JAN-1999 (Rel. 4, Created) DT 11-JAN-1999 (Rel. 4, Last updated, Version 1) XX DE Ceratitis capitata yoyo retrotransposon gag-like, pol-like and DE env-like genes, complete cds. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy superfamily; YOYO; YOYOI; YOYOLTR; env; gag; KW internal portion of LTR retrotransposon; pol; internal portion. XX OS Ceratitis capitata OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Tephritoidea; Tephritidae; Ceratitis; Ceratitis. XX RN [1] RP 1-7065 RA Zhou Q. and Haymer S.D.; RT "Gypsy-like retrotransposon in the Medfly."; RL Unpublished. XX RN [2] RP 1-7065 RA Zhou Q. and Haymer S.D.; RT "YOYOI."; RL Direct Submission to Genbank (11-JUN-1996)Genetics & Molecular RL Biology, University of Hawaii, 1960 East West Rd, Honolulu, HI RL 96822, USA. XX DR GenBank; U60529; Positions 317 7381. XX SQ Sequence 7065 BP; 2883 A; 1431 C; 1034 G; 1717 T; 0 other; tggcgcccaa cgctccagtc gcggacggta tacgtagtgc ataccttcga aacgtagtgc 60 atcacgcgcg aacctaagtg aacagtgcgc gtacgtaaag gactctaccg tcaaagcaga 120 agcgacagga cgaaccaaaa cgggtaaact tcggtggaag ataacccaaa cccaccagaa 180 ggcgcatcac cccggaacca tagtagcgga agcaaactgg ataattccca atgcgtcgtg 240 ggattttaac gtaagaaata gattaagaca taaatataga taacgttgtg aaaaaaaaag 300 aaaagttaaa aaaagtacta attaaaatta agataaggaa aattatattt aaaaaggaaa 360 agagaaaatt tgtgaaaaaa agaaaagtta aaaaagtact aattaaaatc aagataagga 420 aaatattatt tgtgaaaaaa aaagaaaaat taaaaatact gattaaaatc agggaataag 480 gatcataata aaacaatttt aaagcataca agataatatt aaaggcttgt acatccttaa 540 ataaacaaac cctaaagctt tcggcttata acataaccat atctctgcga ttcactccat 600 aaagggggac cctccagaga atattaaaat aaaaagacag ttaattaaaa atatttaaaa 660 aaataaaatt aaattaaata aaatggcaca taacgacatt attagaccct cgatgcttgc 720 atccgcacct agagcagcag ctccccccct acagcctaat ccactacagc ctaatccacc 780 tacttcggca aacgccacac cagaacttag aaacataatt cgggatttaa tgagagaaat 840 attggccacc gaggaagcca atattgtaag aacatcaacg caaacaatga ccgaccaaat 900 aatcgaagaa cggtttaggg acaatttgac agacatggac aaagttcgag acgtggtaag 960 atccctacgc gagtttggcg gcaatccagc ggaatacagc agctggaaaa aaagtgtcga 1020 gcgtatcctc aaaatatacg aacctaacat gggttcacca aaatattttg ggatactcaa 1080 cgttatccgc aacaaaattg tgggtagcgc tgatgccgct ctagaatcct ataatacacc 1140 cttaaattgg aaagcgattt ccaaatgcct cacaatgcac tatgccgaca aaagagattt 1200 gagcactctc gaataccaga tgacttgctt actccaaggc aggagaacag tgcatgagtt 1260 ttacgcggag gtatactcac acctttcctt aattctaaat aaaattgcct gcatggacat 1320 caatgaagaa gccatgcgta tccttaccca cacatatcgc gataaggcac ttgatacctt 1380 cgtcggggga ttgtcaggcg atctatcaag gctacttggt atgaaagagc cagccgatct 1440 acctgaagca ctacatctgt gcatcaagtt agaaaatcaa aattttagaa caatacacgc 1500 gaataaccaa tacagtttac caagaaaacc taatttacca ctcttaccat taggaatgca 1560 tcatcaacat aaacccccaa taccacagag aaacataaat cacaacaacc agccctttta 1620 tccacaatta gcccatatac cccaagcaat aaatccaatt aatcgttttt cccagcatcc 1680 ccaataccgt caacaccaat atcaatacca tccccaatat caataccaac ctcaatatca 1740 acagcaacaa cgctttagac ccaatcaatt cttcaatact ccaccaaggc ctatacaacc 1800 aaaaccacca atcccaatgg aagttgacga atcattacaa actaggaacg ttaactacat 1860 gaacagacca aacaaaatct tcgccggaaa aagacccata gtaccttcaa accaaatacg 1920 acaacccttc aaacaaagta gaatcaataa cataatagca gtctcagcac ctcaagacga 1980 agaacagaca tctaaggatt actatgacca atactcagct caagataata attcaaacga 2040 taatattcta gaagaaaacg aatacgaatt ttccgatata cattttttag attaacgaac 2100 tccacgttac catactttca atgtaaagat agtaacggag atattttaaa atttttgatt 2160 gatacagggt caaataaaaa ttatattcag ccaaacctag taaaaaactc aattcccaat 2220 aatgatattt tttatgccac ctcagttgga ggaaaaatca aaataaccca ccacactttc 2280 attgaccttt tcggactaaa agacgaaaat ctcaaatttt tcgtactacc aacattaaaa 2340 tccttccacg gaattctcgg taacgacagc ttaaaacagt tagaagccat aatttttaca 2400 tcaaaaaacc acatgctaat aaaaaataag attaaaatag caattaaaca acaaaatgcc 2460 acatcagtaa acaacgtcgg aataagaaca acgcacctta cagactcaca agcagaaaaa 2520 cttcgaaatc tatgtcaatt gtacccaaaa ctttttttgg aaccagacga aaaattaaca 2580 tacacaacag tagtaaaagc aacaatcaga acaactacag acaacccagt ttattctcga 2640 tgttaccctt atccaatgtc tttaaaatcc gaagtcgaaa gacaaatcaa caaacttctg 2700 gaagatggca tcataagacc atctcgctct ccttataatt ctcctgtatg gatagtagac 2760 aaaaaaccag actctttagg aaataaacaa taccgacttg ttatagacta tcgaaaatta 2820 aactctgtta caattgcaga ccgttatccc attcccgaaa ttaatgaagt cctttcccac 2880 ttaggaagta atacattttt ttcagtcatc gatttaaaaa gtggttttca tcaaattccc 2940 ttaaaaaact cggacataga aaagaccgct ttctccataa ataatgaaaa atatgaattt 3000 accaggctgc catttggctt aaaaaatgcc ccttcaatat tccaaaggac gttagacgac 3060 attttaagag actacatagg acaatgctgt tacgtataca ttgacgatat aattatattt 3120 agtagaaatg aaaaagaaca ttcaacccac ctaaaaaata ttttcactac actagaaaaa 3180 gctaatatga aggtacagtt agacaaatgc aaatttttcg aaaaagaggt agaattccta 3240 ggattcatag taacaccgga aggaataaaa acaaaccctt caaaaataga ggctattcaa 3300 aatttcccaa ttccacgaaa cctcaaagaa ctcagatcct ttttaggtct gtcgggttat 3360 taccgtcgct ttgtcaaaga ctatgcgaag ctcgctaaac ccctgacagc ccttttaaga 3420 ggggaggaag gacgtgtgtc caaaagtcaa tcagctagag cgcacataac actaggcgac 3480 gaagcacttg cagctttaga aaagataaaa aacgtgttaa tttcaagaga tgttatgctg 3540 acctacccca acttaaataa agattttgag cttacaactg atgcttccaa ttacgcaata 3600 ggcgctgttc tttcgcaaga agaccgacca atcactttca tctctaggac tcttacgaaa 3660 acagaggaaa attacgcggc taacgagaaa gaaatgctcg ccatcatatg ggcattgaag 3720 tccctgcgaa attacctcta cggatcagca aaagttaaaa tctttacgga tcatcaaccc 3780 ttaacatatg cactaagcaa caaaaacaat aatagcaaaa tgaaacgatg gaaggcaatc 3840 ctcgaagaat ataactatga actaaaatat aaaccaggca aaactaatgt agtagcagac 3900 ggactatcta gaccaccaca gcaggaccaa attaattcct tgacaccgac tcaacatagc 3960 gacgaaagct caccgcaaaa cctcattcct tatacggacg ccccgattaa tgctttcaaa 4020 aaccagttat tcctaaaaac cgcagacacc tcgtcctacc aattcttaat acccttccca 4080 acctaccata ggcatatcat tgaagaaccc gaatatacag aagaaaattt aacaaatcat 4140 ctaaagcgct atttaaatcc ttcagttacc aatgcaatta ttacagacga ccatatttta 4200 ggtagaatac aaaacataca tccaattcac ttcagcagat acagaataaa atatacgcga 4260 aaaatagtaa aagacttagt acgagaatct gaacaggaat ccgaaatagt taaagaacac 4320 aaaagagcac acagaagtgc aacggaaaac aaagcccaaa ttttagaaaa tttttatttt 4380 cctcaaatga attcaaagat taataaaata attaagcaat gtaaaatatg tctcgaaaac 4440 aaatatgaaa ggcacccttc aaaactagtt ctaaaagcaa ccccagttcc taattatccc 4500 ggacatatag ttcatatcga tatttaccac accaataata gagtaatcct tacagccgta 4560 gacaaatttt cgaagtacgc gcaagctaga atagtcaaat caagagcaac agaagacata 4620 aaactaccct tacaagatct actcacatcc ttcggaattc cagaaaaaat cgtcattgac 4680 aacgaaaagt cattaaactc atcctccata gtatttatgt tagaaaacca atacagtatt 4740 gaaattttta aaacacctcc ttatactagt tcagtaaacg gtcaagtaga gcgttttcac 4800 tcaaccctta ccgaaataat gcgttgtaca aaagcagaga acacccacaa tagttttgaa 4860 gaattactaa atagatcaat atgcaaatat aaccactcaa ttcattcaac cacgaaaaag 4920 aaacccattg aaatcttctt tggtagaagc gtttatagtg acccatcgtt attagaaaag 4980 gatagactcg acaatattaa aaaaatagtt aacaaacagg aaaaagacct taccttccac 5040 aacaaaaaga ggacagaagt aaaagcatat tccagcaatg acattatcta tgtcaaaatc 5100 aataaaagaa taggcaataa actcacaccc agatataaaa aggaaatagt tctcgaagac 5160 aatggtaata cagtcacaac aaaatcagga aggtcggtgc ataaaagtca tatcaaggct 5220 tcataacaag aaccaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaactata ataataataa 5280 taaaactaat aaaataaaat ctaaactaaa taattttata atataattaa ggcaattaaa 5340 caaagaaaat actaaaggaa aaagccatta agtaagaaat aaataaaatt tagcaaatag 5400 gagataaagg aaattagttt taaaatcaat cgaccaaaat gccataggcg gtgcgatttt 5460 agattaatta atagcctaga aatcattaac caatcgaatc gtcattaaag gacatatata 5520 caaattaaat ttttcagtct cttcatctgc atgtcctatg cagacataca aatccacgac 5580 tattcctcat cccatttaat aaccattgac aatggctact ccaaaatcaa ggacggtacc 5640 ttaaatttta tccatatcat tgacattcaa gcctatcgga acgttttaga aaatattact 5700 gacatgatag aaaagagtta cccaccagaa aatcccatat atcctacctt acaacacgaa 5760 gctcaacaag cattagaaat gttggaaatt gtggaaccca tccagaatcc tagaattaga 5820 agatcctttg atatccttgg tactgcttgg aaatacctag caggaagccc tgatcacgac 5880 gacttgaata tgattaataa caacttagat aaacttataa caaataataa caatcaaatt 5940 tttataaata atgctgttaa aaaccaaata aataaattaa ctgaaataac taataacata 6000 ctacaatcga ttaaaaatga taacatgatt gcaaacgaaa tagcgttaaa tttacaaaat 6060 agattaagat tgattaaaga agaaatcata aatataaaat atgccataca atgggctaga 6120 ctaggaatcg ttaatactat cattttaaat aaaaaggaaa ttgaaacatt tgtaaaagaa 6180 tttaataaag aaaatatgcc cttcaatact gtagaggagg ctttagaatt agcagatatt 6240 aatgtattaa gtaattcaac aacaatttta tatattttga aaataccctt aacgacaagg 6300 gaagtttata ataattatat aataaaatct gtaataagaa atgatgttat gataaatttg 6360 aaatttaaca atgtacttaa aaacaaaaat acaatatatg gaattaaaaa ccaatgtaaa 6420 aaatataata aagttagtat ttgtaagcaa gaagaattgt tagatttaag taatgatttt 6480 tgtatatcga aattaataaa aagccaaaac tcttcttgtc acatggtaaa ctctcaccac 6540 ataccaagac acgagctaat cacgcccgga atcattttcc tcaacgacta tcatggtaac 6600 atcgttatga atgacaacat ccgaatcatg aacggcactt acatcattaa tttcagaaac 6660 gcatcgatta ccatcaatga gcgaacatat aagaactttg aatcaccagt aatgaaagtg 6720 atgccagcaa ttgcgcaacc aacaccaata gaagaaagca ttacaaagct tctctccctg 6780 gaagcgctca gcgagctaaa cgtaaattac cccgtagaaa tacattcact gcaaatagaa 6840 agaaccgtga atcgaatttc ttcaggaatt actcttctta tggtaatcat cctgatggcc 6900 agcgccataa actattttca taaaaaaacg aagtcgacaa aaccaacaga atccagctct 6960 gaggacaacc agcttcaaga aatcatcact gaagctccta ttcctgaaac aaggaccttt 7020 caaatttttc agacgctcga ggacaagcgt ttttgaagag ggagg 7065 // ID Academ-1_AP repbase; DNA; INV; 6967 BP. XX AC . XX DT 30-APR-2010 (Rel. 15.04, Created) DT 30-APR-2010 (Rel. 15.12, Last updated, Version 2) XX DE This family belongs to the Academ superfamily of DNA transposons. XX KW Academ; DNA transposon; Transposable Element; Academ-1_AP. XX NM Academ-1_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-6967 RA Kapitonov V.V. and Jurka J.; RT "Academ - a novel superfamily of eukaryotic DNA transposons."; RL Repbase Reports 10(4), 643-643 (2010). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC Academ is a novel superfamily of DNA transposons that populate CC genomes of metazoans, including cnidarians, insects, sea urchins, CC lancelet, and fish. The autonomous Academ transposons encode a CC ~1500-aa protein composed of a novel Academ transposase domain, CC which is not similar to transposases encoded by any other CC transposable elements reported previously, the XPG domain, and CC the putative Cys8 zinc finger. The XPG domain is structurally and CC functionally related to FEN-1; divalent metal ion-dependent exo- CC and endonuclease, and bacterial and bacteriophage 5'-3' CC exonucleases. The Cys8 zinc finger is a conserved set of eight CC cysteines: CC Cys-X-Cys-X3,4-Cys-X3,10-Cys-X-Cys-X6-Cys-X3-Cys-X1,2-Cys. CC Academ transposons generate 3-bp target site duplications and CC contain terminal inverted repeats whose length varies from 6 to CC 530 bp. Usually, Academ transposons have the 5'-TAG and CTA-3' CC termini. CC Academ-1_AP is a young family. The consensus was derived from CC multiple alignment of several copies >95% identical to it. TIRs CC are 500 bp long. XX FH Key Location/Qualifiers FT CDS 2190..6332 FT /product="Academ-1_AP" FT /note="Contains the Academ TPase, XPG nuclease, and FT Cys8 zinc finger." FT /translation="MRSSQFAFDFKNCCFICGKDADEKKENKKRKELRLKI FT SHVCTLGFEKNIIDVAKARGDEWAKAVQKRVILNTDLVAAEAKYHQNCLNK FT FNLPVPREKKSGRPANEIVAAAMEEIFSFIENNEDSQFTLNELKEAVSDYL FT PDNKTIKKKLEEKYGDRIIITTKKTGFTIISFRETHINILNQAWYEKKKID FT PNDERRRLLNTCADILRQDIHTKIYETDFYPPSTSLFNDLDENIPETLIFF FT VEKLLLKNNKGNLESKKRVCKTICHMIISALGPRSFRSPLQLGLAVYCYRQ FT YASQRLVDILSSLGVCVSYKEALLYEASSLFHPQPLVSPPEDGCFVQYVCD FT NADHNISSIDGLNTFHSMGMIKIISPYDKINDSQQIVRLSKIPTKIEMANV FT SHIPLKLYNNHGVQGLKTITIKKLNFDQIKVTSIFRNSDVLWWYAKWQADD FT VVGWSGFMEILTREMIHTKSRILFLPFINHSASNYNTIFTTLQYITNDGNK FT DGHTTCVVTLDQPLYLKTREIIATLTGEPMFSNVFVRLGGFHLLMSYLGSI FT GYIMAGSGLKEIMSIIFAPNSVDKILLGHAYSRAVRAHTLIQITLSQIIFK FT EMSFNDEQKEQYKVHLDNFNEELFENIESSKVIEELKLLFEEKIVELKNRG FT PTAKLWLQYFEMTALAKEFIRAERMGDWKMHLVCVKRMLPYFHAAGHYNYA FT KSAHLYVQDMENLENTMDNTAFQKFTNNFFSIKRSNKYFCGTWSDMIIEQS FT LMKSSKSKGGFTRGRSTNESVLNKWVDGLLTASNISEGLEHFCGLYFHSGE FT QHVDASDARIKRDAKDVKRLLDWFNFHDPFPYTESIMSIATGVTGDEKINC FT HVSWSVGMAGMDKKIGSTFGDVKFQRVDRVLSLLTVNSTVKINDTEIAVDP FT LLLFQRIIVIKRTNEELRNYLEYELAPYPLSLFDEAGMRKTTKSLFYDNFE FT KVKSPPDFQNATYVIDGGLLLRKVVWDMNRTYDDICGKYVDYVQNHFGSKA FT IVVFDGYENIRNSVKAVEQLRRSSKSSSVDICFDKQMTVTVSQEKLLSNRK FT NKTRLISMLTEKFEANNFPVKHAQDDADILIIETAIEQSENGTTVVVGEDI FT DLLVILTARTPIDKEVFLLKPGKGMIEQNIYSSRSFNHQGNIKNDVLFLHA FT FSGCDTTSALFHKGKTAALKLMKKRNDLQIAAGCFNNSNTSRETISLNGIK FT FFLAVYGASVKETSINNHRYSCFTKSVGKGKSVKLNLLPPTFEAAQQHLLR FT VYYQIQKWLGNELNPLDWGWVMRDNLLWPKTTTQPPAPDCLLHMIFCNCTK FT GCGPLCGCRRLGLHCSAVCGNCQGLSCLNTEPVEDDQLEKQTMDDTENPEL FT DETNPADFQQHIDDDEDE" XX SQ Sequence 6967 BP; 2447 A; 1021 C; 1169 G; 2330 T; 0 other; tagtccactg aaattagttg aaaaaaaggc caaactgggg aataattcat acaaaaggta 60 ttagtgtgtt tttaggtcta tttaagtgtc ctgaataact cctcaaaagt ccccttcaaa 120 aaagtgtacg cataacagtt ggcaacactg caaagatgct caaaaacggt ttttcaagat 180 tttctcagta actattgatt ttacaacaaa aatatgtcta tacaatctga aagaacaaat 240 aattttacat aaaaaaagat ctatgacttt ttttgtcaga acagtattta tcatgataaa 300 ttctaatcca attacagttg gcaccaatga aaagacgctc aaaaatggtt tttcgagatt 360 ttctcgataa ctattgattt tacgacaaaa atacttataa taaaaatgaa agaaaaaata 420 attttacata aaaaaggttt taaacatttt ttcatcagat caatatttat catgataaac 480 gccaatcaaa ttatgtaata aataaacaaa tatgtatatt gtatacattt gaatattcag 540 atatatttaa agattcgatg tactactatc taactgtatc taaccgtacg agactgtcga 600 cgtcttatcg gtaatattgt tgtcggtcag tatcgatgag ataaatatag aactcattat 660 attcgatatc ccccgtccca gccagcagtc taccatagtt ccttttcctt atctctaaac 720 ttacagtcta caaagtctaa tagttgcgtg tttttgtgaa ttatccgact tgacgagtgt 780 ctcacggcgt catattattt tcatatacgc cgcgtgtgcg cgtgtgtctt ttgctggaaa 840 gtgcgtcagt cgtggttcag cgggatacag tggtctaaaa agtaacctaa ttaagtaagt 900 atctaaatta catattattt attttattaa ctatctatat tacccatttg ttaattgatt 960 taatacaact aaagccataa atattattta actatccaaa tgtatgtaac ttgaaccata 1020 ttatctatca ccatgaacgt ttatttaatt taactttaag ctgtgattat aagcccattt 1080 cagtttttat aattattctt tttgaatttc catatacatt ggaacctaag ctgtgtctga 1140 tattgatatt cctaattgga ataattatat aacttttatg atattagaaa tttaatttca 1200 atgtttctta ttgatacttt attattgata ctattttata taattattga tatcataacc 1260 taaagattaa atatagaacg aggagacaag ataaattatt ataataaata atattaagat 1320 ttaataaatt aggtatttaa tattgtatta aatacctacg tttatataaa tgtaagtcgt 1380 attatagtac accagaattt gtatacttgt atctcctgtt aatatgaaat agtaaaacat 1440 aatttcatac aatatactta aaaattatat atttaataag ttaaagatac ttaaaagaat 1500 aaaatcactc ttggtgaatc tctgtataat atattatttc agaaattcat aatatatcca 1560 ttaatcctac atttatgtaa aacgaatata aactaaataa atattctata tcaattttaa 1620 atttaaatta aattttatta attcatggtt aatataaata aaaactaaaa aataatttaa 1680 tgtaagctaa atttacttta cagaccactg tgagctatcg gttgcatttc tacaagatcg 1740 gtgactataa tctatattat tattttgtca attgtacatt atcatatatt attgttataa 1800 ttattgttag ttttggatgt atatcagtaa ttagtgtgcg cccatccatt ttattattta 1860 gttaataatt attcgatttt ctctaaatcg actgtagact gtagaggact cgtgttacaa 1920 tgaatgtaag tgacgctcag tgttttattt gtgatcaatc gcttgttgat ggatctactg 1980 tcgaagttga gcgtggatta gataagttaa tcaatgctag tgttgaaaga ggtgatggga 2040 aacatgaaca attgcaaaaa ttaagttcaa caaaagttca tgttcaatgc cgtaaagagt 2100 atacgcgtcc atcgagtatc aacgcatata aaaaaaaaag atagaacgag gaagaagttg 2160 cagctgcaac gagtcctttg aaaaagaaaa tgaggtcaag tcaattcgct tttgatttca 2220 aaaactgttg ttttatttgt ggaaaagatg ctgacgagaa aaaggaaaat aagaaacgaa 2280 aggaacttcg tttaaaaatt tctcatgtct gtactcttgg ttttgaaaaa aatatcattg 2340 atgttgctaa agcaagagga gatgagtggg ccaaagctgt tcaaaaacgt gtaattttga 2400 atacagattt agttgcagct gaagcaaaat atcaccaaaa ctgtttgaat aagtttaatc 2460 taccagtacc tagggaaaaa aaatcgggtc gtcctgcaaa tgaaattgtt gctgcagcaa 2520 tggaggaaat tttttctttt attgaaaata atgaagactc ccagtttaca ttgaatgagt 2580 tgaaggaagc tgtgtctgac tatttacctg ataacaaaac tattaaaaaa aaacttgaag 2640 agaagtatgg agacaggatc atcataacga ctaaaaaaac aggatttaca attatttctt 2700 ttcgagaaac tcatatcaat attctaaatc aagcatggta cgaaaaaaaa aaaattgatc 2760 ccaatgatga acgccgcagg cttcttaata catgtgcaga tattttacgc caagatatac 2820 atactaaaat ctacgaaaca gatttttatc cacccagcac cagcttgttt aatgatctgg 2880 atgaaaatat tccagaaacg ttgatatttt ttgttgaaaa acttctttta aaaaataata 2940 aaggaaatct agagtcaaag aaaagggtgt gtaaaacgat ttgtcacatg attatatctg 3000 cactcggtcc tcggtcattt cgatctcctc tgcagctcgg actggccgtt tattgttacc 3060 gtcaatacgc gtcacaacgt ttggttgata ttttatcaag tctaggtgta tgtgtatcat 3120 acaaagaagc attattgtac gaagcttcta gtttatttca tccccaacca cttgtttctc 3180 caccagaaga tgggtgtttt gtgcagtatg tttgtgacaa tgcagaccac aatattagtt 3240 ctattgacgg tttgaataca tttcattcta tgggcatgat aaaaataatt tctccttacg 3300 acaagattaa tgatagtcaa caaatagtaa gattatcaaa aattccaact aaaattgaaa 3360 tggctaatgt ttctcatatc cctcttaaat tgtataacaa tcatggtgta caaggactta 3420 aaactataac cattaaaaaa ctgaattttg atcaaataaa agttacgtca atattccgga 3480 attcagatgt attgtggtgg tacgcaaagt ggcaagctga cgatgtggtt ggctggagtg 3540 gattcatgga gattttaaca agagaaatga tacacaccaa atcaaggatt ctttttctac 3600 cgtttataaa tcattcagca agtaattaca acactatttt tacaacttta caatatatta 3660 caaatgatgg gaacaaagat ggccatacaa catgtgtcgt aacgcttgac caaccacttt 3720 atttgaaaac acgagaaatc attgcaacgt tgactgggga acctatgttt tctaacgtgt 3780 tcgtcagact tggtggtttt catttactta tgtcatatct tggatcaata ggttatatca 3840 tggctgggag tggcctaaaa gaaataatga gtattatttt tgctcctaat tccgtggaca 3900 aaatattgtt aggtcatgct tactcaagag ctgtgagagc tcacacatta atccaaataa 3960 ctctttctca aattattttc aaggaaatgt cattcaatga tgaacaaaaa gagcaatata 4020 aagtccatct cgacaatttt aacgaagaat tatttgaaaa tattgaatct tcaaaggtca 4080 ttgaagaatt gaaattgttg tttgaggaaa aaattgttga attaaagaac agaggtccaa 4140 ctgctaaact gtggctacaa tactttgaaa tgacagcgct tgcaaaagaa tttatacgag 4200 cggaaagaat gggagattgg aagatgcatt tagtttgtgt taaaagaatg cttccatatt 4260 ttcatgcagc tgggcattat aattatgcaa aatcggcaca tttgtatgta caagatatgg 4320 agaatttaga aaatacaatg gataatacag cttttcaaaa gtttactaac aactttttct 4380 caataaaacg ttctaataaa tatttctgtg gaacttggag tgacatgatt atagagcaat 4440 ctctcatgaa atcgtcaaaa tcaaagggtg gctttacacg tggccgaagt acaaatgaaa 4500 gtgtattaaa taaatgggtt gatggattat tgaccgcaag caacattagc gaaggccttg 4560 aacatttttg tggactttat tttcactctg gggagcagca tgttgatgca agtgatgctc 4620 gcataaaaag agatgctaaa gatgttaaaa gattattaga ttggtttaat tttcatgatc 4680 cttttcctta tacagaaagc atcatgtcaa ttgcaacagg agttactggt gatgaaaaaa 4740 taaactgcca tgtatcttgg agtgttggga tggctggaat ggataaaaaa attggaagca 4800 ctttcggcga tgttaaattc cagcgtgtag atagagtact ttcattatta actgtcaata 4860 gtacggtaaa aattaatgat actgaaatag ctgttgatcc attattattg ttccaaagaa 4920 ttattgtcat aaaaagaact aacgaagaac ttcgtaatta tttagagtat gagctggctc 4980 catatccttt atcacttttc gatgaagctg gtatgcgtaa aactactaag tcactttttt 5040 atgacaattt tgagaaagtt aaatctccac ctgattttca aaatgccact tatgtcatcg 5100 atggtggact tctcctgcgt aaagttgtgt gggatatgaa tcgaacatat gatgacattt 5160 gtggaaaata tgttgattac gtgcaaaatc actttggctc aaaggcaatt gttgtttttg 5220 atggctatga aaatatcaga aacagcgtaa aagctgtaga acaacttcgt cggtcctcaa 5280 aatcttcatc tgttgatatt tgttttgata agcaaatgac agttacagta agccaagaaa 5340 aacttctttc caacagaaaa aataaaacta gattaatttc aatgctgaca gaaaagtttg 5400 aggcaaataa ctttcctgtt aaacatgctc aagatgatgc tgatatttta ataattgaaa 5460 cagcaataga gcaatctgag aatggtacaa cagttgttgt tggtgaagat atagatctcc 5520 ttgttattct aactgctcgt acgcctattg acaaagaagt ttttctcctc aagccaggca 5580 agggaatgat tgaacaaaat atttattcgt cacgaagttt taatcaccaa ggaaatatca 5640 aaaatgatgt gttatttttg cacgcattct ctgggtgtga tactacatcg gctctttttc 5700 ataaaggtaa aacagcagca ttgaaactca tgaaaaaacg caacgactta caaattgcag 5760 ctggttgttt taataattct aatacttccc gtgaaaccat ttccttaaat ggaattaagt 5820 ttttcctcgc agtttatggt gcatcagtta aggaaacttc tataaataac catcgttact 5880 catgtttcac taagtcagtg ggaaaaggca aatcagtaaa attgaattta ttgccaccca 5940 cttttgaagc agcacaacaa catctccttc gtgtatatta ccaaattcaa aaatggttgg 6000 gcaatgaatt aaaccctcta gactggggat gggtaatgag agacaacctg ttatggccaa 6060 aaactacaac tcaacctcct gcacctgatt gtcttttaca tatgattttt tgtaattgca 6120 caaaaggttg tgggccattg tgcggttgtc gaagattagg tctgcattgt tcggcagttt 6180 gcggaaattg ccaaggtcta tcttgtttaa atacagagcc agttgaagac gatcagttgg 6240 aaaaacaaac tatggacgac actgaaaatc cagaattgga tgagactaat ccggcagatt 6300 ttcaacagca tattgatgat gatgaagatg aatagacaaa ttaatgtata atggattaat 6360 acaattttat cattggttta aaattttaaa tattacatca tgaaatagta tattatgaaa 6420 taaataaata atatggtgat aaactgataa gtgaaataaa ctgttttttt ttttatttct 6480 tacataattt gattggcgtt tatcatgata aatattgatc tgatgaaaaa atgtttaaaa 6540 ctttttttat gtaaaattat tttttctttc atttttatta taagtatttt tgtcgtaaaa 6600 tcaatagtta tcgagaaaat ctcgaaaaac catttttgag cgtcttttca ttggtgccaa 6660 ctgtaattgg attagaattt atcatgataa atactgttct gacaaaaaaa gtcatagatc 6720 tttttttatg taaaattatt tgttctttca gattgtatag atatattttt gttgtaaaat 6780 caatagttac tgagaaaatc ttgaaaaacc gtttttgagc atttttgcag tgttgccaac 6840 tgttatgcgt acactttttt gaaggggact tttgaggagt tattcaggac acttaaatag 6900 acctaaaaac acacttatac cttttgtatg aattattccg tatcaaaatt ctaatttcag 6960 tggacta 6967 // ID pSOS_SO repbase; DNA; INV; 691 BP. XX AC D38566; XX DT 09-JUL-2004 (Rel. 9.06, Created) DT 09-JUL-2004 (Rel. 9.06, Last updated, Version 2) XX DE Fern sawfly repetitive DNA sequences. XX KW pSOS family; pSOS_SO. XX OS Strongylogaster osmundae OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Tenthredinoidea; OC Tenthredinidae; Selandriinae; Strongylogaster. XX RN [1] RA Sonoda S.; RT "Direct Submission to Genbank."; RL Direct Submission to Genbank (20-OCT-1994)Shoji Sonoda, Faculty RL of Agriculture, Okayama University, Laboratory of Applied RL Entomology; 1-1 Tsushima-naka, Okayama, Okayama 700, Japan RL (Tel:0862-51-8324, Fax:0862-54-0714). XX DR Genbank; D38566; Positions 1 691. XX SQ Sequence 691 BP; 247 A; 69 C; 153 G; 222 T; 0 other; aattcgaata gaagtgtttt agttgaggga tcgaatacaa gtgtagtagt gattgggtgt 60 gggggctcta gtagtggtgg tgggggtaag gattcaggga aaagaaaaat cagcaggatg 120 ttacggaaag tcgtataact aaagattgtg ttcatgagat aggagattga aaggactgat 180 tgttagagta acggaaacgg tttatgatag tgaggatagg tttgagaaga ttagaaactc 240 attttcaaaa atatagaaat attaaataaa cttacagttt agagaaaata ttatatctaa 300 tacagggtta taggtcggta cattacagga tgaatgagag cttacattgc aaattaggaa 360 aaggatcttt caaatctatc ggaatgatga aactcttgat caacatcgag agataattgt 420 aatcacaatt gataaattca atgaataaat ttcaatctta cccttcttat taattgtttg 480 gaaaaatata aaatcgacgt tcattgattg ctgaggaagc atactatctg gatttatcat 540 aacaattgga tgaaaaacat gtataggtct ccgagaatgc atgtacaaga atgtttgcaa 600 tttattaaat ctgttaatat caagttttat tcattatgaa attgtttacg tttacaagtt 660 tgaattattc gtttgtagtg agttagtcga g 691 // ID Gypsy-80_CQ-I repbase; DNA; INV; 4355 BP. XX AC AAWU01003625; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-80_CQ_; KW Gypsy-80_CQ-LTR; Gypsy-80_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4355 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 539-539 (2011). XX DR Genome; AAWU01003625; Positions 46784 42430. XX CC Positions [3228-3689] - Integrase core CC 'GGCCC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 204..4343 FT /product="Gypsy-80_CQ-I_1p" FT /translation="MFKEDASKLKEEAKVRLLLRKLDTTAHSRYCNYILPK FT LPSAVKFDDTVATLRKIFGSHTSIFHKRYQCLQLAKSEAEDIITYGGKVNR FT ACEDFEFQKVKIVQLQCLIFICGLKSPRYADVRARLLSRIEAETAENPVTL FT QTQIDEFQRLVNLKADTTLVERPSSSKPAVHAVQEKRSGPQQQRPAKTEGK FT LPRTPCWQCGQMHFVRDCPFSGHLCKACNRVGHKEGYCSCVSKPSSSSSKP FT DHSVEEKKKQAMKQQKAKSRGIFVNHVAANRSRRKYLPITINGVATQLQLD FT TASDITVVAKATWRKLGQPCLQPSSIQAINASGQPLGLVGEFVCDVTLNGR FT TERSRCFVTSSPGLNLLGIEWIELFELWSVPIDSICNVVEVKSVEEQLQEL FT RANHAAVFDDSLGHCTKTKVKLYLKPNAKPVFCPKRPVPFNTISLVDDELN FT RLQSLGIITPVEFSEWAAPIVAVRKPNGRVRICADFSTGLNEALEANHYPL FT PTPEEIFAQLSGSSIFSIIDLSDAYLQLEVDDDSKKLLTINTHRGLFQYNR FT LTPGVKSAPGAYQSLLDGMIAGIPGVRTFMDDVIVFGPDRKSHASSLKQLL FT QRLKEYGFHVKAEKCSFFQLQIRYLGHIVDSRGIRPDPEKLRTIAAIPAPT FT NVSELRSFLGAVNFYGRFVRNLHELRRPMDQLLKKDTKWRWTSECQQAFEK FT FKEVLQSSLLLTHYDPKLPIIVAADASSTGIGAVILHQFPDGSLKAVQHAS FT RSLTPAERNYGQPEKEALALVYAVTKFHKYLLGRPFTLQTDHKPLLSIFGS FT KKGIPLHTANRLQRWALTMLNYDFEIQHVSTSDFGCADMLSRLIDRTIQPE FT EEYVVAALTLEEDLVSVIADTIDKVPVSFAALQKATATNATLQAVVKYIRD FT GWPSSAGSVTNSEVLPYFRRRESLSLVDGCVMFHDRVVVPNQFRSQILRQF FT HRGHPGMVRMKAIARSFVYWPGLDNEIEDFVKRCNPCSIAGKAPTKTCLES FT WPTPSKPWSRIHIDYAGPVDGVYFLVVVDPFTKWPEVYATRTMTAKTTIKL FT LTQSFATFGIPEVIVSDNGTQFTGHEFKEFCIKLGIRHLRTAPFHPQSNGS FT AERFVDTLKRSLRKIRSGGETLEEALQTLLQVYRSTPTSDLDGKSPAEVMF FT GRPVRTISALLQPTKDNSPSTSARAEKQNDAFNKKHGAIQRIFKHGDAVFA FT QVHRANSWQWEAGTVIEKIGRVNYNIFLDDRRRLIRSHANQLKTRVSETTA FT SPEPTPLSIFFDGFGLADAVAAPVPATPVPVPVPVDDSDPEFSDDDSEESD FT DQEQSGSSEAEFEDAAVEPPAPAALPAERASGGQGHPVPEAPQGAPAGGGR FT GQRAIKLPGRFSNFWMK" XX SQ Sequence 4355 BP; 1053 A; 1300 C; 1159 G; 843 T; 0 other; atttggcgac gaggaaaaac cggaagcatg gcggaactac aacaggcgat cctcaagatt 60 acggaattgc tgcagaagat ggcggcaccg gcggcagcac cagataacag atcctggagt 120 cgctggccac caacatcagc gagttttctt tcgacgaaga gaacggtgtc actttcgaca 180 agtggttccg gcgttacgag gacatgttca aggaggacgc gtccaagctg aaggaggagg 240 cgaaagtgcg acttttgctc cggaagttgg acaccaccgc acacagccgc tattgtaact 300 acatcctgcc caagcttcca agtgccgtca agttcgacga caccgtggcc acgctacgga 360 aaattttcgg ctcccacacc tccatcttcc acaaacgcta ccagtgtctc cagctggcca 420 agtcggaggc ggaggatatc atcacctacg gaggtaaggt gaaccgggcg tgtgaggact 480 ttgagttcca aaaggtgaag attgtacaat tgcaatgcct catcttcatc tgcggtctta 540 agtcacctcg atacgccgat gttcgggcaa gactcctttc ccgcatcgaa gcggagaccg 600 ccgaaaaccc tgtgacgctc caaacccaga tcgacgagtt tcagcgcctc gtcaatctca 660 aggcggacac gacgctggtc gagcggccat caagctcaaa accagcagtc cacgcggtcc 720 aggagaagcg aagcggtccc cagcagcagc gtccggcgaa aaccgaaggc aagttacccc 780 gaaccccctg ctggcaatgc ggccaaatgc actttgtccg ggactgtccg ttttcggggc 840 atttgtgcaa ggcatgcaac cgagttggcc acaaggaagg ctattgcagc tgcgtttcca 900 agccttccag cagctcttcc aagccggatc attccgtgga ggagaagaaa aagcaggcga 960 tgaagcagca gaaagccaag tcgcgaggaa ttttcgtcaa ccacgtggct gccaatcgca 1020 gcaggcgtaa gtacttgccg atcaccatca acggcgttgc tacgcaactg cagctcgaca 1080 cagccagcga cataacggtg gttgccaagg caacgtggcg caagctagga cagccatgtt 1140 tgcagccatc ctccattcaa gcaatcaacg cgtccggcca gccgctcggg cttgtcggcg 1200 agttcgtgtg tgacgtcact ctcaacggca gaacagagcg cagcagatgc ttcgtgacgt 1260 catcgcctgg actcaacctt ctgggaatcg agtggatcga gctctttgag ttgtggtccg 1320 tgccgatcga ttccatttgc aacgtcgtcg aggtcaagtc ggtggaggag cagcttcaag 1380 agcttcgagc caaccacgcg gccgttttcg acgattccct gggacactgc accaagacga 1440 aggtaaagct ttatctcaaa cctaacgcca agcccgtctt ttgtcccaaa agaccagttc 1500 cgttcaacac catttccctg gtcgacgacg agctcaaccg cctccaatct ctgggaatca 1560 tcacacctgt tgaattctcg gagtgggccg ctccgatcgt cgctgtgcgc aagccgaacg 1620 gacgagttcg catctgtgcg gatttctcaa ctggtctgaa cgaagcattg gaagccaacc 1680 actacccgct gccaacaccg gaggaaattt ttgcgcagct gtctggcagt tccatcttca 1740 gcatcatcga cctctccgat gcgtaccttc agctcgaagt tgatgacgac tccaagaagc 1800 tcctcaccat aaatacgcat cgaggtctgt tccagtacaa tcgcctcact ccgggggtga 1860 agtcagcgcc gggagcttac caaagcctcc tggacggaat gattgcgggc atacctggcg 1920 tccgcacctt catggacgac gtcatcgtgt tcggaccaga ccggaaatcg cacgcatctt 1980 cactcaagca gttgctccaa cgtctcaagg agtacgggtt ccacgtcaag gccgagaaat 2040 gcagtttctt ccagctgcaa attcggtatc tgggacacat cgtggacagt cgaggcatcc 2100 gccccgatcc cgagaagctg aggacgattg ccgccattcc agcaccaacc aacgtgtccg 2160 aactgcgatc gttcctgggc gcggtgaact tctacggaag attcgtgcgc aaccttcacg 2220 agctacgccg cccaatggac cagctgctca agaaggacac caagtggcgt tggacatccg 2280 agtgccagca ggcgttcgag aagttcaagg aggtgctcca gtcaagcctg ctactaactc 2340 actacgatcc gaagctgccg atcatcgtag ccgcggacgc gtcgagcacc ggcatcgggg 2400 ccgtcatact ccaccagttc cctgatggct cactcaaggc agttcaacac gcgtcaaggt 2460 cgctcacacc cgctgagcgc aactacggac aacctgaaaa ggaagcgctc gcgttagtat 2520 atgcggtaac gaagttccac aagtacctgc tgggacgacc cttcacactg caaactgacc 2580 acaagccgct cctgtccatc ttcggctcga agaagggaat tccactgcac accgcgaatc 2640 gcctccaacg ctgggcactc acaatgctga actacgattt cgagatccag catgtgtcca 2700 cgagcgactt cgggtgtgcc gacatgttgt ctcggctcat cgaccgtacc atccaaccgg 2760 aagaggagta cgtcgtggca gcactcacgc tcgaagagga cctggtaagc gtcatcgcag 2820 acacgatcga caaagtgccg gtctctttcg cagctctcca aaaagccact gcgacgaacg 2880 ccacactcca ggccgtggtc aagtacatcc gcgacggatg gccaagcagc gcgggatcgg 2940 tcaccaactc tgaggttctt ccctacttcc gccgacggga atcgctcagt cttgtcgacg 3000 gctgcgtgat gttccacgac cgagtggtgg ttccgaacca gttccgttcc cagattttgc 3060 gacagttcca ccgtggacat cctggaatgg tccgcatgaa ggcgattgcg cgcagtttcg 3120 tctactggcc cggactagac aacgaaatag aagatttcgt taaacgatgc aacccttgct 3180 ccatcgctgg aaaagcaccc accaaaacgt gcttagaatc gtggccaact ccaagcaagc 3240 cgtggtcccg gatccacatc gactacgccg gtcccgtgga tggcgtgtac ttcctggtgg 3300 tggtcgaccc attcaccaaa tggcccgaag tgtacgcaac tcgaacgatg acagcgaaga 3360 cgaccatcaa gctgctgacc cagtcattcg ctacgttcgg tattccggag gttattgttt 3420 ccgacaacgg aacccagttc actggccacg agttcaagga gttctgcatc aagctgggaa 3480 ttcgccacct tcgaaccgct ccgttccacc cacaatcaaa cggatcggcg gaacgatttg 3540 tggacacgct gaagaggagc cttcgcaaaa ttcgctcggg gggagaaacc ctggaggaag 3600 cactgcagac gcttttgcaa gtgtaccgct cgaccccaac cagcgatctc gatggcaagt 3660 ctccggcgga agtgatgttc ggccggccag tccggaccat ttctgctctc cttcaaccga 3720 ccaaggataa ttcgccaagc accagcgcta gggcggagaa gcagaacgac gcttttaaca 3780 aaaagcacgg cgcgatccaa cggattttca agcacggcga tgccgtcttc gcgcaggtcc 3840 atcgtgccaa ctcctggcag tgggaggctg gtaccgtgat cgagaagatc ggcagagtca 3900 actacaacat attcctcgac gaccgccgac gcttgatccg ttcgcacgca aaccagttga 3960 agacacgagt ttccgagacc acagcctcgc cagagccgac gccgctctcg atctttttcg 4020 acggttttgg attggccgat gcggtagcag caccagtgcc tgcaactcca gtgccagttc 4080 ctgttccagt tgacgacagc gatccggaat tctcggacga cgactctgaa gaatctgacg 4140 atcaggagca atccggttca tccgaagcag agttcgagga cgcagctgta gaaccaccgg 4200 cacctgccgc tctcccggcg gaacgagctt ctggaggaca aggacatccg gtacctgaag 4260 cacctcaagg cgcgccagct ggtggaggac gaggacaacg cgcaatcaag cttccaggaa 4320 gattttccaa cttctggatg aaataagggg agaga 4355 // ID Crack-8_CQ repbase; DNA; INV; 3347 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A Crack non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; Crack-8_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-3347 RA Kojima K.K. and Jurka J.; RT "Crack non-LTR retrotransposons from the southern house RT mosquito."; RL Repbase Reports 11(1), 39-39 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 7 sequences with >97% CC identity. XX FH Key Location/Qualifiers FT CDS 57..2870 FT /product="Crack-8_CQ_1p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="MQLNIRSMNNVTKFDSIKEVLHRFDRRVDIIVLGETW FT IQQDRVCLFSIAGYKSMFSCRNESNGGLAVFYRDELDVDECCNTIKDGMHH FT IQLVLKRGKPIDFHAIYRPPNFEARRFISEMEQFVSSSKPGHDCIIVGDMN FT IAVNQPSVNTVQEYTRVLESYNTYVTNNVVTRPMSSNVLDHAICSEALLDK FT VVNETVFHDLSDHCLVLSSFNLGCSTARTTLEKDIIDHRRLNQQFHEFMIG FT LPMEMSANEKLTAVITNYNNFVKQCTKTVTVQAKIKGHCPWIRFDLWTNMR FT WKENLLKRSRRNPLDEEVAGLLSHVSRMLQTKKDRCKKDYYQRLLSNTSQK FT NAWRIVGDVLGRNGVNDAPRSVYDNGGVTTTDMNKICQLFNEFFCSVGHKL FT AATIPSDRNICRFNTLPRRPDSMFLSPTTYNEVVSLINQLDTKKCAGPDNI FT SATFIKVHYEVFAQLISDVFNEIIMTGQFPDCLKVARVVPVYKAGDKKDVN FT NYRPISTLSVLDKLIEKLIVCRVNAYVTRKTDDRPAILYSHQYGFRPGSST FT LTATCDLVEDIYSSLDSKLLAATLFIDLKKAFDTINHDLLLQKLERYGIRG FT TPLELLKSYLQGRQQYVSIGKHRSGWCPITVGVPQGSNLGPLLFLLFINDI FT GKLQLNGKTRLFADDTSVSYRGGTCEELQRQMAADIVLLNDFFRTNVLSLN FT LAKTKYMIIHSSRRRVPDHPQLIVNGQTVEEVSSYPFLGLVLDSTMSWTAH FT IRALKSKLSSLCGIFWRISSFIPYPQMKMLYFALVHSRIQYLVANWGAACK FT TDLHDLQVLQNRCLKIISRKPLLFPTTQLYSDCTDSILPVKALYELQTMML FT HRKISTDAKQHHNFALRRRESSRASRQLGDFILPRPYTEFGRKKFSYLGGK FT LYNALTTDCKKTTTIAAFKRLLTINMKQRIHLFV" XX SQ Sequence 3347 BP; 1001 A; 837 C; 729 G; 778 T; 2 other; ttgawgagtg gcttatttcc aataccaatc taccccatga gtcgtctttg agaattatgc 60 agctcaacat tcgcagcatg aacaatgtaa caaagttcga cagcataaaa gaggttctac 120 acagattcga cagaagggta gacataattg tactaggtga aacgtggatt caacaagatc 180 gcgtttgttt gttcagcatt gcggggtaca agtctatgtt ttcctgtcgg aatgaatcaa 240 atgggggcct ggcagttttt taccgagatg aactcgatgt ggatgaatgc tgcaacacaa 300 ttaaggacgg tatgcaccat atccaactgg tgctgaagcg gggtaagccg attgatttcc 360 atgcgattta tcgtccacca aatttcgaag caaggagatt tatttccgaa atggagcagt 420 ttgtgtccag cagcaaacca gggcacgatt gcataattgt tggagacatg aacatagcag 480 taaaccagcc atccgtcaac accgttcagg agtacacgcg tgttctggag tcctataaca 540 cttacgtgac aaacaatgtg gttactcgac ctatgagctc aaatgtgctg gatcatgcca 600 tctgctctga agcgctgctc gataaggttg tgaatgaaac agtctttcat gacctgagcg 660 atcattgttt ggtgctgtct tcattcaatc ttggttgcag cacagcgaga acgactctgg 720 agaaggacat catcgatcat aggcgactaa accaacaatt ccacgagttt atgatagggt 780 tgccgatgga aatgtcagcc aatgagaaac ttacagcagt aattaccaac tacaacaatt 840 tcgtcaagca atgcactaaa acagtaacgg ttcaggctaa gataaagggt cactgtcctt 900 ggattaggtt tgacctgtgg actaacatgc gctggaaaga aaatcttctt aaaagaagca 960 gacgaaatcc gctggacgaa gaagtagcag gcctgctgag tcacgtgtca cggatgctgc 1020 aaaccaaaaa ggacaggtgc aagaaagatt actatcaaag attgctttca aacaccagcc 1080 agaagaacgc gtggaggatt gtaggagacg tacttggaag gaacggtgta aatgatgcac 1140 ctcggtctgt ttacgacaac ggaggcgtga cgactacgga catgaacaaa atctgccagc 1200 tgttcaacga gttcttctgc agtgttgggc acaaacttgc ggccactatt ccaagtgatc 1260 gaaatatatg tcgcttcaac actctgccgc gccgtcccga ttctatgttc ctcagtccca 1320 ctacgtacaa tgaagtagtc tcgttgatta accagctgga caccaagaaa tgtgcgggac 1380 cggacaacat atctgcaact ttcattaagg tgcattacga ggtgtttgcg cagctgatat 1440 cagatgtctt caatgaaatc atcatgactg gacagttccc ggattgcttg aaagtggctc 1500 gagtcgtacc ggtttacaag gcaggagata agaaggacgt maacaattat cgaccgatct 1560 caaccctctc agtcttagac aaactgatag agaaattgat cgtttgccgc gttaacgcat 1620 atgtaacaag aaagaccgac gatcgccctg caattcttta ctcacaccaa tatgggttca 1680 gaccggggtc gagcacgctt acggcaacgt gtgatctagt cgaagacata tacagctcgt 1740 tggactccaa actacttgca gccaccctct tcattgacct caagaaggca tttgacacca 1800 taaaccacga tcttctccta caaaagctgg aacgctatgg gatcagagga acaccacttg 1860 agttactgaa aagttatcta caaggacgac agcagtacgt gtctattgga aaacacagaa 1920 gtgggtggtg tcccataact gttggagtgc cccaagggag taacttgggt ccgttactgt 1980 tcttgctctt catcaatgac atcggcaaac tgcagctcaa cgggaaaact cgactgtttg 2040 cggatgacac gtcagtgtca tacagaggag gaacctgcga agaattacaa cgacaaatgg 2100 cagcagacat cgtcctcctg aacgacttct tcagaacgaa cgtgctatcc cttaaccttg 2160 cgaaaaccaa atacatgatc atccactcgt caaggaggag agttccagat caccctcaac 2220 ttattgtaaa cggacagact gttgaagaag tttccagtta tccctttctt ggccttgttc 2280 ttgatagtac catgtcctgg actgctcata taagggccct caaaagcaaa ctcagctctc 2340 tgtgtggaat attttggaga atctcgtcgt tcattcctta cccacaaatg aaaatgctgt 2400 actttgcttt ggtacactcc agaatccagt acctagttgc aaactgggga gcagcctgta 2460 aaacggatct acacgactta caagttcttc aaaatcgctg cctgaaaatt atttcacgca 2520 agccattgct gtttcctaca acccaacttt actcggattg tactgattcg atccttccgg 2580 tgaaagcatt gtatgaacta caaaccatga tgctgcaccg caagatctca acggacgcca 2640 agcaacatca caactttgca ctgaggagaa gggagtcatc tcgcgcatcc cgacaactag 2700 gagatttcat cttaccacgg ccgtacacgg agtttggaag gaaaaaattc tcctacctcg 2760 gggggaagtt gtacaatgca ctgacaaccg actgcaaaaa aacgacaacc atcgccgcct 2820 tcaaacggtt actcaccatc aacatgaagc agagaatcca cctttttgtg tagaagtcaa 2880 aactaatgtt ctgttacatc tcgtttcatg ccaccaccgc ccaccgccca ccgcccaccg 2940 cccgccgacc accgcccacc gcccaccgcc caacgcccac cgcccgccgc ccaccgccca 3000 acgcccaacg cccaccgccc accgcccaac gcccaccacc caccagcaac cgcccatcgc 3060 caaacgccgc tcatcttagt caacaactgc caccaccgct cgccatcaat accaaatcat 3120 agaatgtagt tagaatataa gaaatgcact gccttaaaag agcttaggct cactggctag 3180 tgcaacagat tattgtggaa ggttaaagat caattgaaaa gaagggagcc catccccaaa 3240 cttaaaaaga cagccgagcc taaccccagg tcaatcttgg agtaggtgaa tattccagca 3300 aaatgctgag aagtgtgcaa ttcggtcttt aataaaaaaa aaaaaaa 3347 // ID Copia-124_AA-LTR repbase; DNA; INV; 229 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-124_AA_; KW Ty1_copia_Ele102; Copia-124_AA-I; Copia-124_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-229 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 229 BP; 57 A; 61 C; 45 G; 65 T; 1 other; tgttaggata ggcaatatcc gaaccctgtt cgttcgacca aacgggccga acccactaag 60 cgatgagctg tcattcaagc acaagcgcaa aagtgggctc gccttttctt agtacagttt 120 ccattcaatc gttgttgcga gaaattcgcc tcaataaata ccgatttata gtttgcaawt 180 tcgcgtgttc cgcgtttcat ttccggtatc actgcctccc taaatccca 229 // ID BEL-39_CQ-LTR repbase; DNA; INV; 821 BP. XX AC AAWU01047176; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-39_CQ_; KW BEL-39_CQ-I; BEL-39_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-821 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 232-232 (2011). XX DR Genome; AAWU01047176; Positions 4012 4832. XX SQ Sequence 821 BP; 277 A; 142 C; 176 G; 226 T; 0 other; tgttggcgct agaaaattaa ttatagtaaa aatttggctt taaaagcaaa tcatcaattt 60 aagatttcat gagcagaaga attgactaga cttaattgaa atatcataga aagtgaaagc 120 agcaaactcc gcgagatgtt tgaaatggat gcaacgacaa atcaggtaga atgaaggaag 180 gacggaaacc agtagaacgc agttgcatca gataagaaat ggtcccacat ttagggttcc 240 ttatgcgcgg tacacgcacg cgtgggtgcg tggtcagatt atataattgc tagtttagag 300 aaacactatt tgtccctttc tatctttaag gaaggataga actaaacagt acaaatgttt 360 tgaaatgcat taggaactct ttaggaccca cagaaaaact gtgtttagag ctagatctga 420 gaaaacgaaa acgatggaac cgaaagagtt acaggatgca gggaattcac accctgacaa 480 ggccacatca aatctcatcc aaattcagga gaaaaaatca tttacgacgg gactagatag 540 gttagatttt cggtttagaa attatgtaac gctaggagaa cttgtagaat taagacgaca 600 ttgtacgaat tattttcaat aaagcaagtt cggaggtcat ctcaaacctg tgtattaatc 660 atactcagta ttgaactcct gtcttcgatt tttgggaaac ctacggtttc gactcatcca 720 accttggtag cggcttcatc gagatcatcg cctaggactg cggcagtgtt cgacgtaagt 780 ttccagttat aaaaacgttt cgttttttcg acgtttttac a 821 // ID Gypsy17-LTR_Dpse repbase; DNA; INV; 1367 BP. XX AC Unknown_group_151; XX DT 15-MAY-2009 (Rel. 14.05, Created) DT 15-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy17_Dpse; KW Gypsy17-I_Dpse; Gypsy17-LTR_Dpse. XX OS Drosophila pseudoobscura OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; obscura group; OC pseudoobscura subgroup. XX RN [1] RP 1-1367 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1106-1106 (2009). XX DR Genome; Unknown_group_151; Positions 42093 40727. XX SQ Sequence 1367 BP; 600 A; 216 C; 219 G; 332 T; 0 other; tgaagatcag agagaaatac gtcatagatt tctatatggt tggtaaccaa cagttcctat 60 catgtattga cgtatactcg aagtttgcca ctcttgtaga agtaaaaagt cgtgactggt 120 tagaagcaaa aagagccatc atgaaagttt ttaacgagat gggcaaacca attgaaatca 180 aagccgataa agattcagct tttatgtgca cagcattgca agcatggctg aatgcggaaa 240 acgtcaaaat agagatcact tccagcaaaa atggaatatc tgacgtagaa agatttcata 300 aaacagtaaa tgaaaaatta agaataatat ctagcgagga taacacagaa gatagattca 360 cgaagtttga attaattctt tatacttata atcataaaac caaacataat acaacaaata 420 ggacaccagc tgatatattc atttatgcag gctcacctga ttatgataca caactaaata 480 aagtgaataa aatcactcaa ttaaataaaa accgcataga gtatgaagta gatacccgct 540 ataaactatc acctctagtt aagtctaagg taacaaatcc tttcaaaagg acgggtgaaa 600 ttagacaggt agacgaaaaa cattatgaag aaaagaatag gggtaggaaa ataactcatt 660 ataaatcaaa attcaagaaa aagaagaaat ctaatagaag caaatataat aattccagag 720 aaaccgagga agttaatgga atcggcgaac ttcaacacaa ttaaaatcct catcttcctc 780 ataattacca tcgtaatggc acagggccaa aacatagaaa taaattctat aaaatcccaa 840 aacggatata tgatattcaa aactggatcg atcaacatac ctatcaatta cgaataccat 900 tatctaactg ttaatgtaac taaaaccgaa gaactgtacc aaaatttgtt aaggcaagcc 960 actaaattcc aggacataat ccaaataaaa tatctagtag acaaattgca aagagaaatg 1020 aacgggctaa aaataaccaa aagaaataaa agaggcctaa tcaatatagt aggaacagcc 1080 tacaaatatc tctttggaac actagatcag gaagataaag tcgatttgga acaaaaaata 1140 gaaaatttag ccagccatag cattcaaatg aatgaactaa atttggtaat agatgcagtt 1200 aatagcggaa taaacgttat aaacaaacta aatgaagaaa aagacaggaa tcaacaaata 1260 gaaattttga tattcaatct acaacatttc acagaatata tcgaagatat agaactaggt 1320 atgcaattaa ctaggctcgg tatatttaat ccaaaattat taaaaca 1367 // ID Gypsy-64_AA-LTR repbase; DNA; INV; 193 BP. XX AC AAGE02025748; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-64_AA_; KW Gypsy-64_AA-I; Gypsy-64_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-193 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02025748; Positions 24493 24685. XX SQ Sequence 193 BP; 63 A; 30 C; 35 G; 65 T; 0 other; tgttaaatag ttataataga cccctcgtca taagatctag ataaactgtg gtagatgtac 60 ttgcataaga actatcttga agataacaat tatcttgaca tgaattgttt ttccggttta 120 ctactataaa aggaattgta acgagaggct cagcctcttt ctcctgttgg attttgtgga 180 ataaagaaca tca 193 // ID Copia-127_AA-I repbase; DNA; INV; 4047 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-127_AA_; KW Copia-127_AA-LTR; Ty1_copia_Ele190; Copia-127_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4047 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [1416-1943] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 30..3548 FT /product="Copia-127_AA-I_1p" FT /translation="MERSGIAKLTNENYETWKLEVEFLLVRESLWKYIVPG FT VKPEEAENGANVAELAAWDEGDQKARATIGLLLTRSQHGHIRATRTAKDVW FT ENLRKQHEKKSLTSKVHLLKRICDLKFQEGDDIEEHLAEFETLFEKLANAG FT TKLDDDLQVILVFRSLPNTFDALTTTLENRSDDELTLALVKDKIINEVQKR FT GEAVPADSAVLKVDGKGKSIVCHHCQKVGHKKRDCALWLKQNRGENSRSQQ FT AVAIDTKKVARQHAKARKAASSEENFVFSAREGPVMNWIVDSGASSHMCAN FT REFFVSVENPREGTPSFVTVADGKKAAVKGVGNCQIQCYGRAGEEKQIVLT FT GVLYVPDLDMNLVSVGKLVQKGADVKFNESGCTIASGDRIAAVARRSNGLY FT HLQMVERVGAVNERCVSKSCLHDWHRKLGHRDVKAIQELENRQLATGISIK FT KCGLVLPCETCMQGKMARLPFPKKAEKKTSAVMDLVHSDLCGPMRTVTPGG FT RRFFLTLIDDHSRYTTVYLLKRKSETFDAIQDYVQKMKTQFGKPPKVLRSD FT QGGEYRGSNLVAFLKREGIKQQFTAAYSPQQNGIAERKNRSLVEMARCMLL FT EAKMSNRYWGEAINTANYLQNMLPTKAVELTPYEIWHGAKPDVSKLQVFGT FT TAYVFVPDAKRTKLESKAEKLTFVGYSLQHKAWRFINMKTNKLTISRDARF FT LPPTDNNEEVPADEKGTLFFPLSFETEQQKQASLPGPAVAAGNEPVVSDSE FT DDFHGFEDHSDDSQYEDAGIDDDDHAVPQDEDVSQPEVELARNEQAVSTPL FT RSTRMNKGVPPQRYCADGRLAHYNCDEPRSYPEAVSGPEANLWKQAMQDEL FT KSLHERNTWELTTLPPGKKAIGCRWIFKKKADESGKVIRYKARLVAQGFTQ FT KYGVDYDEVFAPVVKQLTFRALFSVASQRNLLVKHADVKTAYPNGDLKETV FT YMRLPPGCAADGENVVRHLRKGLYGLKQAARIWNHKLDDVLKQMGFKSSKN FT DSCLYVRRSKGKLAYIAVYVDDFVIVCETEREYEDIIKKLNEFFTVTSLGD FT IKFFLGIRVTRSEKHIALNQKRYIKKLLSRFGLEDAKPSSFPLDPGHLHPK FT EEIPLPNNHQYASLIGGCCTWPCALARMLQQQFQFLAEKLVVPPRLTGWKL FT NASCDTSKVLSTTN" XX SQ Sequence 4047 BP; 1127 A; 937 C; 1138 G; 844 T; 1 other; agaggttttg ggccctgcaa aagtacaaga tggaaagatc tggaatcgcg aaactgacga 60 acgaaaacta cgaaacctgg aagcttgaag ttgagttcct gctggtgcga gaaagccttt 120 ggaagtacat cgttcccgga gtgaagccgg aagaggctga aaacggagcg aacgtggcgg 180 agctggctgc ctgggatgaa ggtgaccaga aagcacgtgc gaccatcggg ctgctgctaa 240 ccaggtctca acacggccat attcgggcaa cgcggaccgc caaggacgtg tgggagaacc 300 ttcggaagca acacgagaag aagtcgttga cgtcsaaagt acatctcctg aagcggattt 360 gcgatctcaa gttccaggaa ggtgacgata tcgaggagca cctggccgaa ttcgaaactc 420 tgttcgagaa gttggccaac gctggtacga agctcgatga cgacctacag gtaatcctag 480 ttttccgcag tttacccaat acgttcgatg cgctgaccac cactctcgag aaccggtcgg 540 atgacgagct tacgctggcg ctcgtaaagg ataaaattat caacgaggtc caaaagcgag 600 gagaagctgt tccggcggat tcggcggtgc tgaaagtgga cggaaaaggt aaatcgatcg 660 tgtgccacca ctgtcagaag gttggccaca agaagcgaga ttgtgctttg tggctgaaac 720 agaaccgtgg tgaaaattct cgaagtcaac aagcggttgc catcgacacc aagaaggttg 780 ctaggcaaca tgcgaaagca aggaaggctg cgtcgagtga agaaaatttc gtgttcagcg 840 cacgagaagg accagtgatg aattggattg tggattccgg tgccagctcg cacatgtgcg 900 ccaatcgtga atttttcgtt tccgttgaga accctcgtga aggtacaccg agctttgtga 960 cggtagctga tgggaagaag gcagcagtga aaggtgttgg caactgccaa attcaatgtt 1020 acgggcgtgc cggtgaagaa aagcagattg tgctgaccgg agtgctgtac gtcccggacc 1080 tggacatgaa ccttgtctcg gttggaaagc ttgtccagaa aggtgctgac gtgaagttca 1140 acgaaagtgg atgtaccatt gccagtggtg atcgaatcgc tgctgtggca aggcgaagca 1200 acggtctcta ccatctgcaa atggtggagc gcgtgggtgc tgtgaacgag cgatgtgttt 1260 cgaaaagctg tttgcatgat tggcatcgca aactcggtca tcgggacgtg aaggccatcc 1320 aggagctgga gaaccgacaa ctggccaccg gtataagcat caagaaatgc ggactggttt 1380 tgccatgtga aacgtgtatg caaggtaaga tggcacggct accattcccg aagaaggcag 1440 agaagaaaac cagtgctgtg atggatctgg ttcacagtga cctgtgtgga ccaatgcgca 1500 ccgtgacacc gggtggtcgt cggtttttct tgaccctcat cgatgaccac agccggtata 1560 caacggtgta cttgttgaag aggaaatccg aaacgtttga tgcaatccag gactacgtgc 1620 agaagatgaa gacacagttt ggcaaaccgc cgaaagtgct gaggtccgat caaggaggag 1680 aataccgtgg ctccaatttg gttgctttcc tgaagcgtga aggcatcaag caacaattca 1740 cagcagcata ctcgccgcaa caaaacggga tcgccgagag aaagaaccgt tcgctggtgg 1800 agatggctag gtgcatgcta ctggaggcga aaatgtccaa ccgctactgg ggcgaagcaa 1860 tcaacacggc caattatctt caaaatatgc tgccaacgaa agcggttgag ttgacgccct 1920 acgaaatttg gcatggcgcc aaaccggatg tttcgaagtt gcaagtcttc gggactacag 1980 cgtacgtttt cgttcctgat gccaaacgaa cgaaactgga atccaaggct gagaagttaa 2040 ccttcgtcgg ctattcgttg caacataaag cgtggcggtt catcaatatg aagacgaaca 2100 agttgaccat tagccgggat gctcgcttcc tgccaccgac agataacaac gaagaggttc 2160 ctgcagatga aaaaggtacg ttattctttc cattatcatt cgagactgag caacagaagc 2220 aagcaagtct gccaggaccg gcagtcgcgg ctggtaacga gccagttgtt agcgatagcg 2280 aagatgactt tcatggattc gaagatcatt ctgacgacag ccagtatgaa gacgcgggta 2340 tagacgacga cgaccacgcc gtaccgcaag acgaagatgt gtcacaacca gaagtggaac 2400 tggcacgcaa cgagcaagct gtgagtacac cattgcgatc aaccaggatg aacaaaggtg 2460 tcccccctca acggtactgt gctgacggta gactggcaca ctataattgc gatgagcctc 2520 gaagctaccc agaggcagtg tctggtccag aagcaaattt gtggaagcaa gccatgcaag 2580 atgagctgaa gtcgctgcac gaacgcaaca cgtgggagct gaccacgtta ccacccggca 2640 agaaagccat cggatgccgc tggattttca aaaagaaggc ggatgaaagc ggtaaggtca 2700 tcaggtacaa ggcgcgtcta gtcgctcaag gctttacgca gaaatatgga gtggactacg 2760 acgaagtttt tgcacctgtg gtgaagcagc taacgttccg ggctctgttc tccgttgcca 2820 gtcaacgcaa tcttttagtc aagcatgctg atgtgaagac agcttacccg aacggagatc 2880 tgaaagagac ggtatatatg cggctacctc ccggatgtgc agccgacggg gaaaacgttg 2940 tgcgccatct gcggaaggga ttatacgggc taaagcaggc cgctcgcatt tggaaccaca 3000 aattggacga cgtgctcaag caaatggggt tcaagtcatc gaagaacgac agttgtctct 3060 acgtcaggag gagcaaaggt aagctggcct atattgctgt ctacgtcgat gatttcgtca 3120 tagtgtgtga gaccgaacgg gaatacgagg acatcatcaa gaaattgaac gaatttttca 3180 cggtgacatc gctcggggac atcaaattct tcctcggaat tcgagtgacc cgttcggaga 3240 aacatatcgc gctgaaccag aaaaggtaca tcaagaagct cctgtccagg ttcggtctag 3300 aagatgcaaa accgtcaagc tttccattgg atccaggcca cctacatcca aaggaggaga 3360 tcccgctgcc caataaccac cagtatgcca gcttaattgg gggttgttgt acgtggccat 3420 gtgcactcgc ccggatgttg cagcagcagt ttcaattctt ggcagaaaaa ctagttgtcc 3480 ctcccaggct gactgggtgg aagctaaacg catcctgcga tacctcaaag gtactgtcga 3540 ccacgaactg atcctgggtg cggattccaa gcaactggaa gtatttgtag atgccgactg 3600 ggccggtgac accaaggatc gcaaatcgac cactggttat ttgatccggt acgcaggtgg 3660 tatgattggg tggtgctcca ggaagcagga atgcgtgacc ttgagcagtt cagaagccga 3720 gtacgttgca attacggaaa gctgcaagga gttatcctgg gttttacggc tattcgacga 3780 tctggacatc aagactcagc tgccagtcat tatccacgaa gataaccaga gcgcaatcaa 3840 gcaattggaa tccaattcca gtgaacgccg atcaaaacac gtggacacga gattccacta 3900 cgctcgtcag ttgaagcaac agggaatcat cagtccacaa tatcttccaa caaacgaaat 3960 gattgcagac atgatgacga agccacttcc tcgtgtcaag cagtcaagat tccgtgtcgc 4020 agctatgatt cttccgtcga ggaggag 4047 // ID I-7B_AAe repbase; DNA; INV; 6629 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A I non-LTR retrotransposon family from Aedes aegypti. XX KW I; Non-LTR Retrotransposon; Transposable Element; I-7B_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6629 RA Kojima K.K. and Jurka J.; RT "I non-LTR retrotransposons from the yellow fever mosquito."; RL Repbase Reports 11(4), 1358-1358 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 18 sequences with >98% CC identity. The consensus is ~83% identical to I-7_AAe. XX FH Key Location/Qualifiers FT CDS 926..2572 FT /product="I-7B_AAe_1p" FT /translation="MEPNDKGGGQSQYEISSDDESNPNRSNVVLTSIPQNI FT TEMETNEVSPTLPITPQGYASRTNGSSSPIPSEDGESIESPSYEYLACAQV FT PMDTSSNTRNVNQPLVLSSSSGTVKEATPRLKAYPPGSKGPFLVFFRPKGK FT PLNKLQIEKDLAKSFRGIESVDAPSRDKLRVTVSDREQANKIVAYKLFSME FT YRVYLPSREIEIAGVVTEPFLSCADIKSGVGGFKNRAVPPVAILDARQMNQ FT VAPDGTKQPSNSFCVTFSGSALPDYLVIGKLRLPVRLYKPTVMHCDKCQQI FT GHTSPFCCNKPRCAKCGELHVQGACSSDPKCSCCGQAPHELTACPKFIERE FT KHQIRSLKQRSKQSYAEMLKKIAPTVVPPAHSIASNNLFTTLPDDDQGSDS FT EAGEEYNVIQTGTKRKRAIARRHRQQPSHVPVAQQNSRLSLKKSRSDGNTT FT KVTPPGFKFKPGDFPSLPGSSKTPDVPVFRPESQQTNIFTSRQEHVETSDK FT ITLSGIVEIIFEVMEVSPAIRNLISKALSFIKPLLKRLASKWPILESFISF FT DG" FT CDS 2568..6242 FT /product="I-7B_AAe_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="MDNTSNEVGDMVEILQWNCRSISKNIEAFKYLVHSTR FT CDVFALSETWLTSDKNLSFHDFNIIRQDRGDGYGGVLLGINKLHSFYRIDF FT PPMTGTEVVACQVTIRGKSFSIASTYLPPSARISRRELTAICCAMPAPRLI FT VGDFNSHGTAWGSMYDDNRSTLIYDLCDDFNLTILNTGEATRIKPPAPPSI FT LDLSICSNSLSLDCTWKVIQDPHGSDHLPIKISISNDSCQARQIDVAYDLT FT KHIDWGKYAEAISDGEQSVDALPPLEEYQFLSELIINSAIEAQRRPIPGAK FT VRRRPPTPWWDDECTAVYREKSVAFKEYRKRGSRENYERYTSLERKFGSLV FT KAKKKGYWRNFVNGLSRETSMTTLWTVGRRMRNATAVNEDKEHSSRWIFQF FT AKKVCPDSVPVHGKVRVNSTERNEVDRPFSMLEFSLALLSCNNSAPGMDRI FT KFNLLKNLPDVAKRRLLNLFNQFLERNIVPDDWRQVRVIAIQKPGKPASDY FT NSYRPIAMLSCLRKLLEKMILHRLDKWVESNGFLSDTQFGFRRGKGTSDCL FT ALLSTEIQLAYAQKEQMGSVFLDIKGAFDSVCIDVLSDQLHDCGLSPLINS FT YLYNLLSEKRMSFSHGTSTASRISYMGLPQGSCLSPLLYNFYVRDIDDCLV FT GNCSLRQLADDAVVSVTGPGADDLQRPLQDTLDNLSTWALKLGIEFSPEKT FT ESVVFSKKHVPAQLGLQIMGKELTQGLSHMYLGVWFDSKCTWGKHIKYLYQ FT KCQQRINFMRTLTGTWWGAHPDDLIMLYRTTILSVLEYGSFCFQSAAKTHI FT LKLERIQYRCLRIALGCMNSTHTMSLEVLAGVLPLSDRFAELSLRFLIRCE FT VLNPLVIENFEKLIERNPQTKFMTLYYWYMSLEINPSLNAPNRSCFPNLSN FT STVGFDLTMRRETHGIPDLLRSEXIPLIFANKFGQVSSDRTFYTDGSKTND FT STGFGVYNEFHSAAHKLQNPCSVYVAELAAIHYALERIASLPSDRYFIFTD FT SLSSIEAIRSMRPVKHSPYFLXEIRSKLSALSNHTITLVWVPSHCSIPGNE FT KADALAKVGAMEGDIYDRQIAFEEFFTIARQQTLLSWQQKWDKGDLGRWLH FT SILPQVSKKSWFKGMNMSRDFIKIMCRLMSNHYSLGSHLYRIGLADNNRCG FT CGAGYQDINHVVWYCPEYGIARSNLCTSLRARGKPDKEDIRDVLGRLDFDY FT MLLVYKFLKQIDVFV" XX SQ Sequence 6629 BP; 1795 A; 1566 C; 1403 G; 1862 T; 3 other; attcctttcc gaacgctagc cagtatcggt tgtgttttgc gcagtttctc tatagttcga 60 agcaaattga tttcactgcg aagtgaaccg ttttttctcc tcgcgttgcg tggtttttta 120 cggtgttaca atttccagtt gaaataaccg atcatccgcg tttcggatca cggaagcagc 180 cagccttagc ctagcctatt ggttggactg tccagcttta ccgaaggcca tcccgtcgca 240 gcattgtgtc atcatcatat ttgccatcat catcaccatt atcatcatca gcaacagcag 300 cacagaagag gcctcaaacc atcaagttgc tgtgttgcct accgggtgcc tatcgtttcg 360 aacagcaagt cccggacaat acatccatcg gcggatctgc acgtgtttac gaaaaacaac 420 agcagtgcag agaagccagt cggtcacatt ggatgaatag gtatttattt cgttctgccg 480 atctctttcg tctgcccgcc tatcttcctt ctcgacgcta gccggtgatc aactcaccta 540 ccagtgtata gttgcctttt tgcgttcatt ttcttattga actttttttt tttgtaaata 600 tttttttttc ttccctcatt ttttttggtg attaattcac ctaaaggtaa atttgtctac 660 ctctgctatt actattagta ttattattat tatatttttt tttttgttca attttttttt 720 tttttttttt ttgttttgag gtgattgatt cacctgaagt gcgaacgaac ttaaccgcac 780 tttaaatttt acaataagta gttctttatt tcgttattat tattatttta agtgtaaata 840 ttgttcaaaa tctatctgct gggtgctaaa aattgattgc acatacgatt ttgaattctt 900 cctggtgggg taatctcacc ccacaatgga accaaacgat aaaggcgggg gtcagtcgca 960 atacgaaatt tcctcggatg atgaatcaaa tccaaaccgg tcaaatgtcg ttctaacatc 1020 gattccgcag aatatcacgg aaatggagac caacgaagtc tcacccaccc tccctatcac 1080 cccgcaagga tatgcatccc gtacaaacgg ttcatctagc cctatcccct cggaagatgg 1140 tgaatcgatt gaatctccgt catatgaata tttggcctgt gctcaagtgc cgatggatac 1200 ctcctccaat acgcgtaatg taaaccaacc cctggtgctc tcctcctctt ctggtactgt 1260 taaggaagcg accccccgcc taaaagccta tccacccgga tctaagggtc cctttctggt 1320 tttctttcgg cccaaaggca aaccattgaa caaattgcaa atagaaaaag atctggcaaa 1380 gtcgtttcgg ggcatcgaat cagtagatgc tcctagtcgt gacaaattgc gtgtcacggt 1440 tagtgatcgt gaacaggcca ataagattgt tgcctacaaa ctcttctcga tggagtacag 1500 agtatacctt ccaagtcgag aaattgagat cgcaggagtg gtcaccgaac cgtttttgag 1560 ttgcgctgac atcaaatctg gtgttggagg atttaaaaac cgtgccgtcc ctccggtcgc 1620 catacttgat gccagacaaa tgaaccaggt ggctcctgat ggcacaaaac agccgtcgaa 1680 ttcgttttgt gtaactttct ctggatcggc gcttccggac tacttggtga tcgggaaact 1740 tcggttgccc gttcgcctct ataaaccgac ggttatgcac tgtgataagt gccagcaaat 1800 tggtcataca tcacctttct gctgtaacaa gccacggtgc gcaaaatgcg gtgagctaca 1860 tgtgcagggt gcctgcagtt ctgatcccaa gtgttcatgc tgtggtcaag ctccacatga 1920 gctcactgca tgcccgaagt ttatagagcg ggagaaacat cagatacgct ctctaaaaca 1980 acggtcaaag caatcttacg cagaaatgct aaagaagatc gcaccgactg ttgttccacc 2040 agcccactct atcgctagta acaacctctt tactaccctg cctgatgatg accaaggctc 2100 tgactctgag gcgggtgaag aatataatgt tatacaaaca ggaacaaagc gaaagcgagc 2160 tatagcacga aggcaccgtc agcagccttc tcatgttcct gttgctcaac agaattctcg 2220 actctccctg aaaaagtcaa gaagtgacgg aaataccact aaagttaccc ccccaggttt 2280 taaattcaaa cctggagatt tcccttcact tccgggatca tctaaaaccc cagatgtccc 2340 agtttttcgc ccggaaagcc aacaaacaaa catcttcaca tctcgacaag aacatgtcga 2400 aacttccgac aaaataacgc tttctgggat tgtggaaatc atcttcgaag taatggaagt 2460 ctctcccgca ataagaaacc ttattagtaa ggcactttct ttcataaaac ctcttctgaa 2520 gcgactggct tcaaaatggc cgattcttga atcgttcata tctttcgatg gataatacat 2580 ccaatgaggt cggggatatg gtcgaaatcc tacagtggaa ttgcagaagc atttctaaaa 2640 atattgaagc gtttaagtat ttagttcaca gcacacgctg tgacgtattt gctctcagtg 2700 aaacatggct aacttctgat aagaatctct ctttccacga tttcaacatt attcgccaag 2760 atcgtgggga tggttatgga ggggtattat tggggatcaa caagctccac tccttttata 2820 gaattgattt tcccccgatg acaggcactg aagtagttgc atgtcaggtt actatacgag 2880 gaaaaagctt tagcatagca agtacgtacc taccgcctag tgccaggata tctcgcagag 2940 agcttacagc catctgctgc gctatgccag ccccacggtt gatcgtaggt gatttcaatt 3000 cacacggtac agcctggggg tcaatgtacg acgacaaccg ttcaaccttg atatacgacc 3060 tatgcgacga cttcaacttg actattttaa acactgggga agcaacacgt atcaaacctc 3120 cagctcctcc aagtatttta gacctctcaa tctgttcgaa ctcattatca ttggattgca 3180 cgtggaaagt aattcaagat ccccatggta gtgatcacct gcctatcaaa atttcaattt 3240 ccaatgattc gtgtcaggct cgccagatcg acgtagcata cgacctcacc aagcacattg 3300 actggggaaa atatgctgaa gcgatctccg atggcgagca gtcggtcgat gctcttcctc 3360 cgttggaaga gtatcagttc ttatccgagt taattatcaa cagtgctatt gaagcacagc 3420 gtcgaccaat accgggagcg aaagttcgaa gacggccacc caccccttgg tgggatgatg 3480 agtgcaccgc agtatatcgg gaaaagtccg tcgcgtttaa agaataccgg aaacgcggtt 3540 ctcgtgaaaa ctatgagcgc tatacctctc ttgaacgcaa gtttggtagc ctcgtcaaag 3600 cgaagaaaaa aggatattgg cgcaatttcg taaacgggct ttctagggaa acttccatga 3660 caacattatg gaccgtcggt agaagaatgc ggaacgcgac agcggtaaat gaagacaaag 3720 aacactcttc tcggtggatc ttccagttcg ccaagaaagt gtgtcctgat tccgttccag 3780 tacacggaaa ggttcgcgtt aactcgaccg aaagaaacga agttgataga cccttttcga 3840 tgctagaatt ctcacttgct ctcctttcat gtaacaattc tgccccagga atggatagga 3900 ttaagttcaa cttgctcaaa aacctcccag acgtcgcgaa gaggcgcttg ttgaacttat 3960 tcaatcagtt tctggaacgc aacattgttc cggatgattg gagacaagtg agagtgattg 4020 ccatccaaaa gcccggaaaa cccgcgtcgg attacaactc gtaccgccct atcgcgatgc 4080 tgtcctgttt acgcaagctg ttagagaaga tgattcttca tcggcttgac aaatgggttg 4140 aatcgaacgg ttttctatca gatacgcaat ttggtttccg cagaggcaaa ggaacgagcg 4200 actgtcttgc gcttctttcg acagaaatcc aactggccta tgctcaaaag gagcaaatgg 4260 gatcggtttt tttggacatt aagggagctt ttgattcagt atgcatagat gtcctttcag 4320 accaactcca cgactgtggg ctttcaccat taatcaacag ctatttgtac aatttgttgt 4380 ctgagaaacg tatgagcttt tctcacggta cctcaacagc ttcacgaatt agttacatgg 4440 gtctccccca gggctcatgc ttaagccccc ttctttacaa tttttatgtc agagacattg 4500 atgattgtct cgtgggaaat tgctcgctaa gacagcttgc agatgatgcc gttgtttccg 4560 taacaggacc aggggcggat gatttgcaaa gaccactgca agatactcta gacaatttgt 4620 ctacttgggc tttaaagctg ggtatcgaat tctctccgga gaaaactgag tcggttgtct 4680 tttctaagaa gcatgtccca gcacagttgg ggcttcaaat catgggtaag gaactaactc 4740 agggtctttc acatatgtat cttggggtct ggttcgattc caaatgcact tggggaaagc 4800 acattaagta tctgtatcag aaatgccaac aacgaatcaa cttcatgcgc acactgaccg 4860 gaacatggtg gggagcccac ccggacgatc tgataatgtt ataccgcact acaatcctct 4920 cggtccttga atacggtagc ttttgtttcc aatccgccgc gaaaacacac attctgaagc 4980 ttgagcgtat ccagtatcgc tgtcttcgaa tcgcgttagg ctgtatgaac tcgactcata 5040 caatgagttt agaggtatta gcaggagtcc ttcctttatc agatcgtttc gcggaattgt 5100 ccctccggtt cctcatccgc tgtgaggtat taaacccatt ggttattgaa aattttgaaa 5160 agctaatcga acgaaatcct caaacaaaat tcatgacact gtactactgg tacatgagtc 5220 tggagattaa cccttctttg aatgctccca atcgtagttg cttcccaaat ctctccaact 5280 ccactgtagg ttttgatctg accatgaggc gagaaaccca tggaattcca gatctgctcc 5340 gatcggagkt cataccacta atttttgcaa acaagttcgg ccaggtcagc agcgacagaa 5400 cgttttatac tgacgggtca aaaacaaatg attccactgg atttggtgta tacaacgaat 5460 ttcatagcgc cgcccacaaa cttcagaacc cttgttcagt atacgtcgca gaattagcgg 5520 ctatacacta tgcattagag cgaattgcct ctcttccctc tgatcgatat ttcattttta 5580 cggatagtct cagctccatc gaggctatac gttcaatgag gccggtaaag cactcmccgt 5640 attttctcmg tgaaatacga tctaaattga gtgctttgtc gaatcatacc atcaccttgg 5700 tttgggtccc ctcgcattgc tcgattccgg gcaatgagaa agcggacgca ctcgccaagg 5760 tgggcgctat ggaaggcgat atttatgatc gacaaatcgc ctttgaagaa tttttcacaa 5820 ttgctcgtca gcagaccttg ctcagttggc aacaaaaatg ggataaaggg gatttgggca 5880 ggtggttaca ttccattctc ccgcaggtgt caaagaagtc gtggtttaaa gggatgaaca 5940 tgagtcgcga ctttatcaag ataatgtgtc ggctgatgtc caaccactac tcgttaggct 6000 cgcatctcta tcgaataggg ctcgcagaca ataatcgatg tggttgtgga gctggttacc 6060 aagacatcaa ccatgttgtc tggtactgcc ccgaatacgg aattgccaga tcaaatctat 6120 gtacatctct cagggcccga ggaaaaccag acaaggaaga cattagagat gtgttgggta 6180 gactggactt tgactacatg ttacttgttt acaagttctt gaagcaaatc gacgtttttg 6240 tttgataata tgcttttttg ttccgcttgt cctcctcgta taccccttcc cgttttttgg 6300 atgtccacct tgtttccctc gaaacccgtt tgtttgtacc gttacaggtt gtcgtcatgt 6360 ccactctgcg ttgaccagca gcaataacgc ctcaacaata tgctgctgtt gaagatcaac 6420 gagtaaaccc aaaattccca tcccttccaa aaatgattgt actccttaac ctcgaccaaa 6480 ccgcgagttt tacggttccc caaaactaac ctagttctat aagaagcaaa tatgattttg 6540 taaaataaat cctttcaatc cggctccgta atgcctaaga gcgcttgagc ctgctaaata 6600 aatgattagg taaaaaaaaa aaaaaaaaa 6629 // ID BEL-205_AA-LTR repbase; DNA; INV; 535 BP. XX AC supercont1.2; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-205_AA_; KW BEL-205_AA-I; BEL-205_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-535 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.2; Positions 2161677 2161143. XX SQ Sequence 535 BP; 201 A; 85 C; 91 G; 158 T; 0 other; tgttgctgcc aacactgccc agtgatgaca ttgcaattag gacggagaga tatgacagaa 60 ttattaagag agaaatatac tgagcactag aatgaatagt agtgaggtgc aaagccatgc 120 aagtttaatt tgagatagtg tgatttaata attgatctaa tttattgata tctataaatc 180 tacttacagc tacttcttaa aatactaaaa tgaattaaaa tgaattaaac taaactaatt 240 atcacagcag atatcacagt aagttagtaa aagctattga aatttaatat tctaacctag 300 aagttataat ctcaggaaaa tacatgcaga tgttacgagt gaaatgaata gttatccaag 360 attatgcctc aaaccaaatt atgtaagtaa cctataaaga aatgttcgta attgtaatcc 420 taataattga ataaattcta gctttaagct gcttcgcacc cacacaacgg agtttgcgag 480 ctgctctgaa gaatttggct gtgagccccg gaaaaccccc atttttaccg taaca 535 // ID Mariner-39_SM repbase; DNA; INV; 2408 BP. XX AC . XX DT 14-AUG-2009 (Rel. 14.08, Created) DT 14-AUG-2009 (Rel. 14.08, Last updated, Version 2) XX DE Mariner DNA transposon from Schmidtea mediterranea: consensus. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-39_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-2408 RA Jurka J.; RT "Mariner DNA transposons from Schmidtea mediterranea."; RL Repbase Reports 9(8), 1888-1888 (2009). XX DR [1] (Consensus) XX CC Contains 2 overlapping ORFs (probably due to stop codons). CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 1148..2158 FT /product="Mariner-39_SM_1p" FT /translation="MIKPMIINRSSTPRVMKNINKTQLPVFWRSNKKAWMT FT QDLFKDWFYNCFIPAVETYTMEKNLYFKVLLVLDNAGCHNIELDHPNVKIV FT FLPPNCTSLIQPLDQGVIQTLKMYYTRQLFQTIFDRLENSENKTLTQVWME FT FSILDCIRTVSSACAEIKPSTLNACWKPLLPQMVQAIQDDSTISLPVTEIV FT NIASRLSDEEFAVNHQDVRELVLGEETLDEEELMELIDAPTSNALVNENKE FT NEWVPNLDLQDVKKGLNLAKELELHFIQTDRSTVRSAKFKRELRNCLAPYR FT EVLVELEKKATIENQCEDEPNEEDILPIKRRRVRRLISDSEDEC*" XX SQ Sequence 2408 BP; 817 A; 420 C; 466 G; 705 T; 0 other; caaccctcct ataacgcggt actcttacag cgcggtttcc gttcaacagc atcccataag 60 ttccttccgc gcagcctatg aatgactcta gtaactagat attgtaagag gttttaatat 120 aagttttatt ttcgacgcaa aagcacagcc gagatacata aacaatatgt ttatataaac 180 atgccagctt atctcatagc cgtctacgac atcggtttca ctcgtctcag agctctttca 240 cagtacatat gttttgttgg tgaccaagtc tttctaaaaa tttttctaca ccgtccgttc 300 gttcagcttt ttgctgctta atttgtggca tataattgga caacattaca caatactgta 360 agtaattttt ttaatttgtt cgaatataat aataataaag taattaattt tactaaagta 420 attaattgcc ctagatttgc atcatgtcaa acacaaaaga aacaaagaaa ggattaaaaa 480 gaaagtttat atcgcttgaa gataaaatcc aaattttaaa tcgtttagaa ggcggggaaa 540 aaatatcgtc tgttgcaaaa tcaacaaatt tgaatgaatc tacaattcga accttaaaga 600 aaaatgcaga taatataagg aaaactgtgg cagatggctg tccgttaggt gcgaaacgtg 660 ttactcgcac acgaaattct aatatggtta aaatggaacg ggcgttaatg atctggttgg 720 aagactgtat agccaaaaag attcctatca gtggaaatct tatcaagcag aaagcactta 780 aaatctacga acatttaagg aatattgggc attcttatgc agggctagaa aatcattcgt 840 ttgtcgcgag taaaggatgg tttgaaaaac taaaaaaagg tacacattgc acaatattaa 900 gtttcaagga gaacaggcct cagcagatgc tgaggctgcc gaaaattata agctggaact 960 ggctagaatt atcaatgaag gaggttattc acctgatcag atttttaacg ctgatgagac 1020 ggcattatat tggaaaaaac taccatctag gaccttcatt ctgaaaaatc aaaggcgtgc 1080 tcaggggttt aaactttcaa aagaacgtat cacccttcat gtgtgcagca atctatcagg 1140 gagcctgatg ataaaaccaa tgataataaa tcgttcttca actccccgtg taatgaaaaa 1200 tattaataag actcaattgc ctgtattctg gcgatctaat aagaaagctt ggatgactca 1260 agatttattt aaggattggt tttataattg ttttatcccg gcagtggaaa cctatacgat 1320 ggaaaaaaat ttgtatttca aagttttatt ggttcttgat aacgctggtt gccacaacat 1380 agaactggat catccaaatg tgaaaattgt cttccttcct cctaactgca cgtcattaat 1440 acaacctttg gaccaaggag tcattcaaac tttaaaaatg tactacacac gccaactttt 1500 ccaaaccatt ttcgacagat tagaaaatag tgaaaataaa actcttactc aagtgtggat 1560 ggaattttct attttagatt gtattagaac cgtgtcatcg gcttgcgcgg aaattaaacc 1620 atcaacgctt aatgcgtgtt ggaaacctct tttacctcaa atggttcaag caatacaaga 1680 tgattctaca atttcactcc cagttacaga aatagtaaac attgcttctc gtttaagcga 1740 tgaagaattt gcagtaaacc atcaggatgt aagagaactg gtactcgggg aagaaacttt 1800 agatgaggaa gaactgatgg aattaattga tgcacccacc tcaaatgcat tggtaaacga 1860 aaataaggaa aacgaatggg tgccaaatct cgatttacaa gatgtgaaaa aagggctaaa 1920 tttggctaaa gaattagaac ttcatttcat tcaaactgat cgttcaacgg tgcgcagcgc 1980 aaagtttaaa agagagttaa gaaattgctt agctccctat agagaggttt tagtggagtt 2040 agaaaaaaaa gctacaatcg aaaaccaatg tgaagatgaa ccaaatgagg aagacatttt 2100 gcccattaaa agaagacgag ttaggcgact catatctgac agcgaggatg agtgttgaaa 2160 gattaaataa ttttacaaaa gtctttcttg tttttgttgg attatttatt ggtctccata 2220 taacgcggta cgaatgatgt gatttgtact aattccgagt ttcttatagc gcggtttcca 2280 tacaacgcgg tagacggttt ccatagaacg cgctgcgaat tagcctggtt tgtatggtta 2340 aatccgagtt tcttatagcg cggtttccat ataacgcggt acgcactcac cgcgttatag 2400 gagggttg 2408 // ID BEL-4_DPu-LTR repbase; DNA; INV; 292 BP. XX AC scaffold_26; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: long terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-4_DPu_; KW BEL-4_DPu-LTR; BEL-4_DPu-I. XX NM BEL-4_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-292 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 656-656 (2010). XX DR Genome; scaffold_26; Positions 947791 948082. XX CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX SQ Sequence 292 BP; 82 A; 79 C; 57 G; 74 T; 0 other; tgttaggaat aatccacccc aacagacgac gacgagatcc gggcgccgcg taaccacacc 60 ggcgcaccta cgcgactatt gcctcggggg cccccgatag gtggccccaa cttaactcgt 120 tatttccatt ccccccaatt gttaactccc ctctgaatta ctgtaaaatc gttagtgtgt 180 taaaatatac gagaggaaaa ggacactgtg atttgtgtcc aatttatttc acatcttgaa 240 aactgtaagt aaaaggcctg acacgccgaa catctcttta caagttttta ca 292 // ID Gypsy-38_AA-LTR repbase; DNA; INV; 1046 BP. XX AC AAGE02019168; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-38_AA_; KW Gypsy-38_AA-I; Gypsy-38_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-1046 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02019168; Positions 146704 145659. XX SQ Sequence 1046 BP; 319 A; 235 C; 245 G; 247 T; 0 other; tgttaccatt ctggtacgtt caccctataa ttccaatcca gagctagtga gtaataaact 60 aggagataag agaggtatgc catactagag ttcggcatga atttgtagaa caatatgtgt 120 ataacagttt actttttgaa ttatatacta taggtaagcc actcccaatc aatcaaccgc 180 caaattgaat gtcaggaatt attgttagca cactatacat actacacaaa caacttgaca 240 atatatatca cgggatgaca gaaggataag gtgggcattg gaagtttgga atggatcaag 300 gagaagggtg gtggataagt gggtggctca gaagaggggg ggaggggctt agaggttagc 360 taaggggtct gaaggataag atagttcttt ccaacttgaa ccaccaaagt gtaaagaact 420 attcagttaa aattcagtta gtcactaaag tgccttatat agataaattg tgatatttat 480 cgtggaggat aatcgttcgt taaaaagaat tcggtgataa attaccagcg aagtcaggta 540 tggaagatat tcaaacgcca ttagtcagac ccagccctaa gacctcccat tcgtgccaca 600 ataggaccca ttgttgcttc ttagcttgag cccagggaaa atccaagccg gatttccagg 660 cactccgccg taggccctcc ccgggtccat ctacaagtag gaggcacgcc aaccggcggc 720 catcttgttg ggagacacca cgccacccgt tcatcctcga gagcccaccc atccgcatcg 780 cccgagtacc ccgtaagctg gccagcaact cacactacac catacacgcc acacagtaag 840 tgtaataaaa acctaattaa agtgataaag tggtgttgtt ttactttggg cactgatcag 900 taccacagcg aagagccgac cctgggaaaa gaggtggttg ttacccgccc cagacgactt 960 ccgtggccgt gaccagaagt caggggtacc ccggtgacaa ggcccatctg gccgagtcat 1020 cctagattcg aaataattgt ataaca 1046 // ID BMC1 repbase; DNA; INV; 5091 BP. XX AC AB018558; XX DT 13-AUG-1999 (Rel. 4.07, Created) DT 13-AUG-1999 (Rel. 4.07, Last updated, Version 1) XX DE Non-LTR retrotransposon BMC1 DNA, complete sequence. XX KW Jockey; Non-LTR Retrotransposon; Transposable Element; BMC1; KW Gag-like protein; ORF1 and ORF2; KW endonuclease and reverse transcriptase-like protein. XX OS Bombyx mori OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Bombycidae; Bombycinae; Bombyx. XX RN [1] RA Abe H., Ohbayashi F., Shimada T., Sugasaki T., Kawai S. RA and Oshiki T.; RT "A complete full-length non-LTR retrotransposon, BMC1, on the W RT chromosome of the silkworm, Bombyx mori."; RL Genes Genet. Syst 73(6), 353-358 (1998). XX RN [2] RP 1-5091 RA Abe H.; RT "Direct submission."; RL Direct Submission to EMBL/GenBank/DDBJ (11-OCT-1998). Hiroaki RL Abe, Tokyo University of Agriculture and Technology, Department RL of Biological Production, Faculty of Agriculture; 3-5-8 RL Saiwai-cho, Fuchu,, Tokyo 183-8509, Japan RL (E-mail:wfem@cc.tuat.ac.jp, Tel:+81-42-367-5848, RL Fax:+81-42-360-8830). XX DR GenBank; AB018558; Positions 1 5091. XX CC ORF1: 313-1959 (putative DNA-binding domain; CC ORF2: 1963-4845 (endonuclease and reverse transcriptase). XX SQ Sequence 5091 BP; 1182 A; 1604 C; 1166 G; 1139 T; 0 other; cattccgatc caagcactcg ccgtgtgctg acgtgctcat tgttcgttcg atcgcccgac 60 ctcggttcgt ggcacatctg gcgtccaagt gttttctcgg aagttactac ctgtgctaat 120 tgcgtcgcac cgccgtttgt gaacgggcgg acccctaaat tcctggaatt cgttcggcaa 180 cttagtggcc cgtttgtaag cggacggttg tgtgccgtaa ttaatcgtga ttattataat 240 attggggcct cgccgtaagg cgagtgtgtt taaagtcccg gggtagcccc ccctgggcac 300 gtaatagtta caatggactc cacgatcgat ctgttcgcgg aattcctccg tctgaggtac 360 ccgacattat cggtcgaatt tttagaattt aaggccaatc acgtcgcgaa ctcagaagcg 420 gtctccgtcg cgcccgccgc tcccgtgtct ccaatactcg cgagcaaagc tcccgcgtcg 480 atcgctgcgg tctcggttcc agccctgcac tcatccgcag cacacgtcgc gtccacgcgt 540 tcatccgcgg cctccgtcgc gcccgcgccg tcctctgccc ccgagcgatt gtccatggcc 600 tccgtcgcga ccgtcaaccc tgcgccctct aaaacgcccg cctgcgggtc gcccgcaccc 660 tcctcttcat ccgactcgga gtcgaacatg gaggtcgatc cttcccccgc cccctcgact 720 gatggattca cagtagtcca aaaaggtaag aggcgcgctg cggaagcccg agctcccgcg 780 gccgccaaag taagcagagc cacgaacgcg tcgcgcctcc gtccaccgac ccccgttgct 840 ccccctgccc gtgctacacc gtcgccgcgt ccggtagaac aaaagaaaat ccaaacccct 900 cccccggtaa tccttcaaga gaaggcagcc tgggaacgag tttccctggc ccttaaggcc 960 aaaaatatta atttcacgaa tgcccgtaac ctcgcgaacg gcattcaaat taaggttcaa 1020 acacccgacg accatagggc cctctcttct tacctggagc gtataagttt ccatacgtat 1080 acgctccagg aggagcgcga acttcgtgcc gtcatacgtg gcatccctaa agagttggat 1140 gccgaactcg tcaaggccga ccttctggaa caaggcctac cagtaaattc agtgcaccgc 1200 atgcacaccg gccgcggtag ggagccatat aatatggttc tagtcgccct ccagcctacc 1260 cccgagggta agcaaatttt taacatacgg accgtctgta gactctccgg tatcgccgtc 1320 gaagcccccc ataaaaaagg cactcctagc cagtgccata actgtcaact gtacgggcat 1380 tcttcccgta actgtcacgc gcgcccccga tgcgtcaagt gtttaggcga tcacgctacg 1440 gccctatgca ctcgcgatca aaaaaccgcg acggaaccgc ctagctgcgt cctgtgtcaa 1500 acacagggtc accccgcaaa ttaccgcgga tgcccccgag ccccgaaaat aaatcgccgc 1560 gtcgcccgcc aaaaccgcct ccgagcttct cacccagaca ccaaagcctc ggcaccctct 1620 gtgtcgcagg ctaagccagc gttcgttccg gcgccggtgc ccagtggttc ggcctgggcg 1680 aaaccgctgc cgtacacgaa cacggctaca actccctcct ccgcgattcg tcccgccccc 1740 gcgatacgcc cctcccccgc gaattgccct ccgacagcgt ccgacaatct cgctctagcg 1800 atcgacttct ttcaatcgat taactttgag cgcgttaacg ctttaggtga cgccatccgc 1860 gccgcctcca ctgcacagca cttcatcgcc gttgtgcaag aatacgccga cgtatacgcg 1920 tcgttaaata cgtacgtcct cccctcactc cgccggtaat caatggcgta caaaagtagg 1980 ctgaaacccc tatccgtaac gataggattt tttaacgcat acggtctcgc aaatcaacgt 2040 gatcaggttt gtgatttttt gcgtgaccat caaattgaca tatttttagt acaggagact 2100 ctacttaagc ccgcgcgccg cgaccctaaa atcgcgaact ataacatggt taggaacgac 2160 aggcgcactg ctcgtggtgg tggtaccgtc atctactaca gaagagcctt gcactgcatc 2220 ccgctcgatc cccccgcgct cgttaacatc gaagcatcag tatgccgaat ctcactgacg 2280 ggacacgcgc cgatcgttat cgcgtccgtt tatcttccgc cggaaaagat cgttctaagc 2340 agtgatatcg aggcgctgct cagcatggga agctctgtca ttctggcggg cgacctaaat 2400 tgtaaacaca tcaggtggaa ctctcacacc acaaccccta atggcaggcg gctcgacgcg 2460 ttagtcgatg atctcgcctt cgatatcatc gctccgctaa ccccgactca ctacccgcta 2520 aatatcgcgc atcgcccgga tatactcgac atagcgttat taaaaaacgt aactctgcgc 2580 ttacactcga tcgaagtagt ttcagagtta gattcagacc accgtcccgt cgttatgaag 2640 ctcggtcgct ctcccgattc cgttcccgtc acgaggactg tggtggattg gcacacgctg 2700 ggcatcagcc tggctgaatc tgatccacca tcgctcccgt ttagcccgga ctctaccccg 2760 tctcctcagg ataccgctga agccatagac atcctaacgt cacacattac ctcgacatta 2820 gataggtcat cgaagcaagt tgtagcggag gacttccttc accgcttcaa attgcccgac 2880 gatattgggg aactccttag agctaagaac gcttcgatcc gcgcctacga taggtatcct 2940 accgtggaaa atcgtattcg aatgcgtgcc ctacaacgcg gcgtaaagtc tcgcatcgcc 3000 gaagtccgag atgccagatg gtctgatttc ttagaaggac tcgcgccctc tcaaaggtct 3060 tactaccgct tagctcgtac tctcaaatcg gatacggttg taactatgcc ccccctcgta 3120 ggcccctcag gtcgactcgc ggcgtttgat gatgacgaaa aagcagagct gctggccgat 3180 acattgcaaa cccagtgcac gcccagcatt caatccgcgg accctgttca tgtagaatta 3240 gtagacagtg aggtagaacg cagagcctcc ttgccaccct cggatgcgtt accacccgtc 3300 accccaatgg aagttaaaga tctgatcaaa gacctacgtc ctcgcaaggc tcccggttcc 3360 gacggtatat ctaaccgcgt tattaaactt ctacccatcc agctcatcgt gatgttggca 3420 tctattttca atgccgctat ggcgaactgt atctttcccg cggtgtggaa agaagcggac 3480 gttatcggca tacacaaacc cggtaaacca aaaaataatc cgacgagtta ccgcccgatt 3540 agcctcctca tgtctctggg caaactgtat gagcgtctgc tctacaaacg cctcagagac 3600 ttcgtctcat caaagggcat tctcatcgat gaacaattcg gattccgtgc aaatcactca 3660 tgcgtacaac aggtgcaccg cctcacggag cacattcttg taggacttaa ccgaccaaaa 3720 ccgatctaca cgggagccct cttcttcgac gtcgcaaaag cgttcgacaa agtttggcac 3780 aacggtttga tttttaaact tttcaacatg ggcgtaccgg acagtctcgt gctcatcata 3840 cgagacttct tgtctaaccg ctcttttcga tatcgagtag agggaacccg ctcctccccg 3900 cgaccactca cggctggagt cccgcaaggc tccgttctct ctccgctcct atttagttta 3960 tttattaacg atattccccg gtcgccgccg acccagctag ccttattcgc tgacgacaca 4020 accgtttact actcaagtag aaacaagtcc ctaatcgcga gtaagctcca gagtgcagcg 4080 ttaaccctcg gacagtggtt ccgaaaatgg cgcatagaca tcaacccagc gaaaagcaca 4140 gcagtgttat ttcaaagggg aaactcaccg cttatttcct cccgcattag gaggagaaat 4200 atcacacccc cgatcaccct cttcggccta cctatacctt gggctaggaa ggtcaagtac 4260 ctgggagtca ccctggatgc atccatgaca ttccgcccgc atataaaaac agtccgcgat 4320 cgtgccgcat ttattctcgg cagactctat cccatgatct gtaagcggag taaaatgtcc 4380 cttcggaaca aggtgacact ttacaaaact tgcataaggc ccgtcatgac ttacgcgagt 4440 gtggtgttcg ctcacgcggc ccgcacacac atagacaccc tccaatccct acaatcccgc 4500 ttttgcaggt tagccgtcgg ggctccgtgg ttcgtgagga acgtcgacct acacgacgac 4560 ctgggcctcg aatcgatccg caaacacatg aagtcagtgt cggaacgtta cttcgataag 4620 gctatgcgtc atgataatcg ccttatcgtt gccgccgctg actactcccc gaatcctgat 4680 cacgcaggag caagtcaccg tcgtcgccct agacacgtcc ttacggatcc atcagatcca 4740 ataactcttg cattagatac cttcagctct aacactagga gcaggcttag gagccccggt 4800 aaccgtactc gtcgaactcg acaaagaggt cgacgtgcag cctaacccat gcatcagccc 4860 gctgagtttc tcgccggatc ttctcagtgg gtcacgattc cgatccggta gtagattcat 4920 tcgcgaagca actgctcttg agtcgttagg tttccttcgg aggcgctcgg gcagttgtta 4980 gcaaatccca cccctcttgg ctgagccttt gctcgcccac ctgtcctggt gaagctggaa 5040 aggcctccgg gccaccagta atatctcaat cataaaaaaa aaaaaaaaaa a 5091 // ID Gypsy-36_NVi-LTR repbase; DNA; INV; 1294 BP. XX AC . XX DT 01-JUL-2009 (Rel. 14.07, Created) DT 01-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: long terminal DE repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy-36_NVi-LTR. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-1294 RA Bao W. and Jurka J.; RT "Gypsy LTR retrotransposons from Nasonia vitripennis."; RL Repbase Reports 9(7), 1384-1384 (2009). XX DR [1] (Consensus) XX SQ Sequence 1294 BP; 444 A; 317 C; 274 G; 259 T; 0 other; tgtaatggaa tccccgattc cattacgcgc ctacgacacc aatttaagta ggtcacacaa 60 tttttctacg cacgaaacaa gaaacacaca tacgtgagac actcgcgagc atcgtcatcg 120 cagcagccgc aagaagcagt atttcccacc acaacgtatt cctacaaaaa taggattaga 180 ggcgtagcga caaaccgcgc tctaggcgaa gatcatccga aagaatggct aacgccaaga 240 aaaaaaaaat gacggcgacg aaataaaaag atgtcactga acgagtcgcg atattcctca 300 cgaaaagcga cgaaaatcaa aatagctatg cgacgagcaa gcatcggtcg agcacgctct 360 cccgaaaacg aatcgggaaa agctacacat gacagtacgc aggcgacgaa aagcagcagg 420 agagctgtca caagacgctt ttcctaaaaa ctcaaggtaa taaaaatttt acgaggtcac 480 gcaaagtcaa gagaaaagtc gtgagaatat taggcactcc taatatggaa aatcgaggct 540 acagaatttc cagtatataa aggccgggag caaaagggtg caacatggtc agtttccttg 600 aactcgtcat actaaacgaa tcgctgctag tcgccagatc cagtcgatat cagttatact 660 attcaaatca gtatcgaatc agttagtcgc gagccagaac agcgcggacg aactttgtcg 720 ccacaataaa actagtgcgc actaaaccaa ttaaaaagtg cggatacgtt aaagagttcg 780 agaaagaata gcttatacaa aaactaagtg atcgtgtcaa gaataagaca agcgccactt 840 tcaccgtcct ctctcatcac ggaaagagag ccgacacatc ggagcgccca tctgtcaaca 900 cctgcttcaa agagggacaa gctaggagga ccgagacgca caccagcctg accgcactga 960 accaccgacg ccaggctcca gagcgccaag gacccacagc tgcagcttcc tgctgagaca 1020 tcgatacatc cacacacaca cacacccaca catattgtaa aaggtacttc tctctttctt 1080 ttctaatcga gtgaataaaa tcgtagttgc tcttaaaatt ctaacggctt taaagtgatt 1140 tcgttatccg gctccctttc ccggaaggca gtgaccgcgt ctgcggtagc caaaagggat 1200 ttcttttgaa taaataaaaa taataaataa tagccgaaga tagggactcg acaccgtaag 1260 tatattggtg ggttacgcac attccgacgt aaca 1294 // ID BEL2_MH-I repbase; DNA; INV; 8351 BP. XX AC ABLG01001008; XX DT 24-JUN-2009 (Rel. 14.07, Created) DT 24-JUN-2009 (Rel. 14.07, Last updated, Version -1) XX DE LTR retrotransposon from northern root-knot nematode: internal DE portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL2_MH; KW BEL2_MH-LTR; BEL2_MH-I. XX OS Meloidogyne hapla OC Eukaryota; Metazoa; Nematoda; Chromadorea; Tylenchida; Tylenchina; OC Tylenchoidea; Meloidogynidae; Meloidogyninae; Meloidogyne. XX RN [1] RP 1-8351 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from northern root-knot nematode."; RL Repbase Reports 9(7), 1518-1518 (2009). XX DR Genome; ABLG01001008; Positions 4038 12388. XX CC Positions [4503-5072] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 1423..3840 FT /product="BEL2_MH-I_1p" FT /translation="MSILSEQNRCLKCLRKGHAKINCNQRYSCGVCKGDHN FT RIICPKIVEKEKKVQIKENECVNTGQIKENVLNIEDDCEENIILMKADLTL FT KERNNKLNATCFFDSGATVNFLCNELVEKLKLKKIKRMELGVVPFFKKEPV FT KLKTSCYSLKILLSNGEFEEILVYSINRNMMPRFRCADLKTNEIVCANIVP FT DMLIGMKHFWRFFKGIEPLSDYLFKVETTVGPIICGEIHNEKRTSNRKENL FT YTILSGNTLENEELNNYWSLETIGVRNDPICKDDKVALDIFNNTIKRKENG FT SYIVKWPWKTERESLPSNFSLAYKRFLNLMDKLNKNQELLNKYEEIIKSDL FT ERGVIEKAEKKGEIEHFLPHHPVVNTKKVRIVYDASAKMREGKSLNECLYR FT GPIIMPDLAGLLIRFRVPKIAMWADIEKAFHTLELEEEDREVTKFIWVKDT FT QKPRSTQNIQYFRFAKVPFGVISSPFLLSATVRHHLEKCDDPVAKIAKENS FT YVDNIFIGVESTSEAWLAYKTLKKIFKDAQMNLREFVSNDIELNEQFPVED FT RLEKNKPKILGIPWDICTDKINIVFPVVNFSEKLSKRIVLRQLASMFDPLG FT LASPSLLSAKLFFQSLWTKENKWDDKISEEKVKKWKEITDSWAINPIEINR FT RVTEIKSEKQLHVFSDASKCAYAACVYIKSQFKEKIYTQILYARNRLKPRK FT AEVTIPRMELLGVVIAKRALEFVEKQIGVEIKEKFLWCDSKPVLSWIKAGG FT KNEKFVENRIKEIRKNNGIKFGYVNTSENPADIATRGLLASDLRKCHIWWE FT AQNG" FT CDS 6834..8351 FT /product="BEL2_MH-I_4p" FT /translation="MTFQWKDIKFTLLDITPPMAPILNSEFIIGERGIAIS FT ENSISHLECHENAANFTKCELELSACNECWLEQEIEKIQCACTDFIIEKLI FT KDPRKSLPLEINRIRLRRKEDMIISESNYLPILLMIRFEDWSISSNIDITS FT CKIEPEYLKGCYSCMGAIFKFNCSTEYGSAYAEIKCPGGNTFTAKCNEKGI FT DQKIALSFDHSEINEICHVYCSGGTSNFTLKGQLEFILPEKMKVIVKEGSA FT EIFIENNENWFTNPFSNLQFLDWKNILWDLMNMQMILIIIFSFALIIFALK FT LFLKFNPLFKGYKLATSFIIIFYFFLLAGCSIHCLCIECIKIEMIQNRHSF FT VEKLKIERNLINHTNFTLFHTKIMMSINFRVSFSLKNDVIRKIEKMSMSQK FT IQLKNYINKPSFLNISYIEQKTHSLKMFLLVEISADLFRCDDIEAVWELYV FT SEAQRTKLRKCVEAAINRGSKLMVSNGKNKDLINLEMLEDWEIVENTPYLY FT ERPREFPVKD" FT CDS join(3810..5213,5217..6602) FT /product="BEL2_MH-I_2p" FT /translation="MPYLVGGPEWLKKPEEDWPNELKFEVEKENNEEDNLE FT IILNCATKKEENLFNFNNWNSWNKLIKCFMFVSIAARIWIEKGRKIIKGKF FT LSEINLENRISSSNFKKVEIFILKIAQSSFRNEKKEDIQTIIDENGLIRLK FT TRISNCEAEEYFKHPILLPKGCFAVKLILRKIHEELNHSGVDSTLANFLLK FT FWTPCARRITKIVIKECKKCQKIISPKFALPEMPLLPKERVRQSRPFEFVG FT VDYLGPSICKIEGSKVKFWIALFTCLSTRGIHLDLVTDLSALSFLNILRRF FT VALRGAPVKIISDNGNQFIVIAKVINAIIEGKWKKEKTKNDEKVFNNYLIE FT KEIEWKFIPALSPWQGGVYERLVKLVKDCFKRSLGSLILNLEELRTFIKEV FT EMSINCRPISHISVEKDGPICLRPIDFMIPSVEMAQKFSKIEDEDFRLGKM FT STAEQVQERWMATLRTLQKFWDSKHDYLIMLRDKAKWTHHNQRSDLKRGPR FT IGEIVVVHQEGQPRNVWPLGKIIELDGTPARSAKIKIGNKTFIRPINKISP FT LEIEENMIEDNSNKELENEKDLTVEDSKKVDKRNRNTHPMVTRSKVALCLQ FT FVQLIYLITIAFAQPPRYINCKGCFMKCRTFGAEANAVEYINKIELCCAGS FT CLLETNGISRLKFIIPKDKTTLEYKCRATYWGKRYMYRDWISCPAVDPCLR FT IAGVRERIFNPQCESTESLILIGIVVGLILAITLAIGLFALRILSIFGSLL FT KLIWYILKMPFGILFVQKKNKQKDVSEKLLLNKADNKKRIKRNRRLNKLSL FT LALLAIFSNSIQIISGEVEVVVIQAKSEDCQRKGGISTCKMNAINTVTLLP FT NEQINSLSLKNNKGNIMGELELSMHGLQIQCNQKILRYLRSYEVKVQSIKR FT CPAAGTCKAGRCNHIRTKDYVEELSE" XX SQ Sequence 8351 BP; 3166 A; 1053 C; 1686 G; 2446 T; 0 other; tttggtgcca cgaccgactt ttatagttgg tgccacggac ccttacgaag aggggtgaaa 60 aaaaaaaaga gagagaaata gaagagctat aggctcagat tattggaaga gctgtaaatt 120 tggaaagctc ggtaaaaaca aaaagaaaag ttaaataatt taattggaaa ttaaaagtgg 180 gatttaaata atatattatt aaaggttaaa aatgtctgga actattcgta gtactattgc 240 accttccata atacgtctag gaaggtattt ggatcaagtt aggcctatac ttgatgccga 300 tgagcgcact gaagaaggaa tagaattact tcaaaaaatg ctcattaaaa ttcgtagagc 360 tatcgcgtta gttgatgaaa aggttgagaa atgggaaaaa tatattaaga atttaacggc 420 aaatgataag gccgcagagg aagatatttt tgctagatat tccattaacg aaaggcattt 480 tttagaatgg attgatttag gaagagaagc tattgttgat atagaaattg cacttaccga 540 aggatctgat gctcatagta cgcaatcaga acattcagaa gagataggag ttggtctgaa 600 tagaacgaac gaagttcgtc ggaatatacc aaccagaaat gaagtaccag tcagaaacga 660 gcaagtactt ccacctcatc ttccgcgcac taggttgccg gaattttatg gggattcact 720 aagatggcct gaattttggc aagcttttga gcgcactgtg gattgtctga atattgattt 780 tgggctgaaa gcacattatt taatacaatg tctcagagga aaggccaaga gggcagtttt 840 aggctatagg ccaatagagg aacattatga acccttaaaa gacgcactta aaagacaatt 900 tgggaatgaa aaagcaatta gagatacgtt acatgccgag ttaatcagct taccagtggc 960 taatgaatca ataggatctt tgcgctctta ccttgaggag gttgagcgaa tttgtagatg 1020 cttattagct atggggcact tggaagatga agctatccgt aatgatggct attaagaata 1080 agttacctag aaatatcgtt taagagttat taaagatgga aaacattaga aagagtagat 1140 tggaatgtaa caaaactgag agaaggactt tcagaaatcg tgaccttaag agaagaagct 1200 caaaggtgca caaggtcatt atcatataag aataattcgc aaaatgtaca gaattttaat 1260 aatcagagga gatttacggg aaactcgaat aatcagagaa gagcagaacc tgcgcgggta 1320 ttcacagcgc cgaaaaacca gagaaaggat atttcgtgtt tgttttgcaa tggtccacat 1380 ttggctggaa attgtaaaca atttaaattt attgggcaaa gaatgagcat tttgtctgaa 1440 cagaatagat gtttaaaatg tttaagaaaa gggcacgcaa aaataaattg taatcaaaga 1500 tatagctgtg gtgtctgtaa aggtgatcat aacagaataa tatgtcctaa aatagtcgaa 1560 aaggaaaaga aagtacagat taaagaaaat gaatgtgtaa atacaggcca aataaaagaa 1620 aatgtactaa atattgaaga tgactgcgaa gagaatatta ttttaatgaa agcagattta 1680 actttaaagg aaagaaataa taaattaaat gcaacatgtt tctttgattc gggagcaact 1740 gttaattttc tatgtaatga actcgtagag aaattaaaat tgaaaaagat aaaaagaatg 1800 gaattaggag ttgtgccatt tttcaaaaaa gaacctgtaa aattaaaaac gtcttgttat 1860 tctttaaaaa ttctcctaag taatggtgaa tttgaagaaa tattagtata tagcattaat 1920 agaaatatga tgccaagatt caggtgcgca gatttaaaaa caaatgaaat tgtgtgcgca 1980 aatatagtgc cggatatgct tataggcatg aagcattttt ggaggttctt taagggaata 2040 gagccattat cggattattt atttaaagtt gaaacaacgg ttggtccaat tatttgtggc 2100 gagattcaca atgaaaaaag gacttcaaat agaaaagaaa atttgtatac aatcctaagt 2160 ggcaatacat tagaaaatga ggaacttaat aactattggt cactagagac tattggagtc 2220 aggaatgacc cgatatgtaa agatgataaa gttgctttag atatttttaa taatacaatt 2280 aaaagaaaag aaaatggaag ctatatagta aaatggcctt ggaaaactga gagggaaagc 2340 ttgccgtcaa atttttcttt agcttataaa agatttttaa atttaatgga taaattaaat 2400 aaaaatcagg aattgttaaa taaatatgaa gaaattatta aaagtgatct ggaaagggga 2460 gtaattgaaa aagctgagaa gaaaggtgaa atagagcact ttcttcccca ccatccagtc 2520 gttaatacaa agaaggtacg tatagtttac gatgcttcag caaagatgag agaaggaaaa 2580 agcttaaatg aatgcttata tcgggggcca ataatcatgc cggatttagc aggattatta 2640 atacgtttta gagtgccaaa gattgccatg tgggcagata ttgagaaagc gttccatacg 2700 ttggaattag aagaagaaga cagagaagta actaagttta tatgggtgaa ggatacacag 2760 aagcccagat caactcagaa tattcaatat tttagatttg ctaaagttcc tttcggggta 2820 atttctagcc catttttgct ttcggcaacg gtcagacatc atttagagaa atgtgatgac 2880 ccggttgcga aaattgccaa agaaaattct tatgtggata atatttttat cggagtagaa 2940 tcaactagtg aagcatggct tgcatataaa accttaaaga agatatttaa agatgcccaa 3000 atgaatttaa gggagttcgt tagtaatgat attgagctta atgaacagtt tcctgttgaa 3060 gacaggctgg agaaaaataa gcctaagatt ttgggaattc catgggatat ttgtactgat 3120 aaaattaata ttgtatttcc cgtggtaaat tttagtgaaa aattatctaa aagaattgtt 3180 cttcggcaat tagcaagtat gttcgatcct ctgggacttg cttctcccag tcttttatca 3240 gcaaaattat tctttcagtc attatggact aaggagaata aatgggatga taagattagc 3300 gaggaaaagg tgaaaaaatg gaaagagatt acggattcat gggcaataaa tcccattgag 3360 attaatagaa gagttactga aattaaaagc gaaaaacagt tgcatgtgtt ttctgacgct 3420 tccaaatgtg catatgcggc ttgtgtatat atcaaatcgc aatttaaaga aaaaatatat 3480 acacaaatat tatatgccag aaatcgtcta aaacctagaa aagctgaagt aactatacca 3540 cgcatggaac ttttaggcgt agtaattgct aaaagagctc tagaatttgt agagaaacaa 3600 attggagttg aaataaaaga aaaatttctg tggtgcgatt caaaaccggt attaagttgg 3660 ataaaggccg gaggtaaaaa tgaaaagttt gtagagaata gaataaaaga aataagaaaa 3720 aataatggaa ttaaatttgg ttatgtaaat acgagtgaaa atccggcgga tatagccact 3780 cggggattat tagcatcaga tttaagaaaa tgccatattt ggtgggaggc ccagaatggt 3840 tgaagaaacc agaggaagat tggccaaatg aacttaaatt tgaggtcgaa aaggaaaata 3900 atgaagagga taatttagaa ataatattaa attgtgcaac taaaaaagaa gaaaatttat 3960 ttaattttaa taattggaat tcatggaata agttaattaa atgctttatg tttgtatcta 4020 tagcggctcg aatttggata gaaaaaggca ggaaaattat aaaaggaaaa tttcttagtg 4080 aaataaattt agaaaatcgt atatcttcaa gtaatttcaa gaaagttgaa atatttattt 4140 taaagattgc gcaaagttcg ttcagaaatg aaaagaaaga agatatacag acaataatag 4200 atgagaatgg actaataaga ttaaaaacaa gaataagtaa ttgtgaagct gaagaatatt 4260 ttaaacatcc aattttattg cctaaaggat gtttcgctgt taaactaatt ttgagaaaaa 4320 tacatgaaga attaaatcat tcgggtgtcg attctacact cgcaaacttt ttattgaaat 4380 tttggacgcc ctgcgcacgg cgtataacaa aaattgtcat aaaagaatgt aagaagtgcc 4440 aaaagattat ttcaccaaaa tttgcactcc cagaaatgcc tttattacca aaagaaaggg 4500 taaggcaatc acgtccattt gaatttgttg gagtggatta tctaggccct tctatttgca 4560 aaatagaagg aagtaaagtt aagttttgga ttgcgctctt tacatgtttg tcaacaagag 4620 gaattcattt agatttggta accgacctat ccgcactttc gttcttaaat attctaagaa 4680 gatttgtcgc gctaaggggt gcaccagtaa agataatatc agacaatggc aatcagttta 4740 tagtcattgc taaagtaata aatgccatta ttgaaggtaa atggaagaaa gaaaagacga 4800 aaaatgatga aaaggtattt aataattatt taatagaaaa agaaattgag tggaaattca 4860 tacccgcgct gtccccgtgg cagggcggag tttatgaaag acttgtaaaa ttagttaaag 4920 attgtttcaa gagatctttg ggaagtctta tattaaattt ggaagaatta agaacattta 4980 taaaagaggt agaaatgtca ataaattgta ggccgatttc ccatatatcc gtagaaaagg 5040 acgggccaat atgcctaagg ccaatagatt ttatgatacc aagtgtggaa atggcgcaga 5100 aattttcaaa aattgaagat gaagatttta gactagggaa aatgagcaca gcagagcaag 5160 tccaggaaag atggatggcc actttgagaa cgcttcaaaa attctgggac tcatgaaagc 5220 atgattatct tataatgtta agagataagg caaagtggac ccatcataat cagagaagtg 5280 atctaaaaag aggaccaaga attggcgaaa ttgtagttgt ccatcaagag ggacaacctc 5340 gtaatgtctg gcccttaggg aagataatag agttggatgg cactcccgcg aggagcgcaa 5400 aaataaaaat cggaaataaa acctttatac gtccgattaa taaaatctct cctctagaga 5460 tagaagaaaa tatgattgaa gataactcaa ataaagaatt agaaaatgaa aaagatttga 5520 ctgtagaaga ttctaaaaaa gtggataaaa gaaatagaaa tacacatcca atggttacaa 5580 gatcaaaagt ggcattatgt ttacaatttg tgcaattaat ttacttaatt acaattgcat 5640 ttgctcagcc accaagatat ataaattgta aaggctgttt tatgaaatgc aggactttcg 5700 gtgccgaagc caacgcagta gaatatataa ataaaattga attatgttgt gctggaagtt 5760 gcctactaga aacaaatggc atttcaagat taaagtttat aattccaaag gataaaacaa 5820 cattagagta caaatgtaga gccacttatt ggggcaaaag atatatgtat agggattgga 5880 tttcatgtcc agctgttgac ccttgtctaa gaatagcagg ggtacgagaa agaatattca 5940 acccacagtg tgaatcaact gaaagcttaa tattgattgg tattgttgtg ggtcttattc 6000 tagcaattac tcttgctatt ggattatttg ccctcagaat tttgtcaata tttggaagtt 6060 tattaaaatt aatttggtat attttgaaaa tgccttttgg aatactcttc gtgcagaaga 6120 aaaataaaca aaaagatgtg tcagaaaaat tattattgaa taaagctgat aacaagaaaa 6180 ggataaagag aaatcgtaga ttaaataaat taagtttatt ggcattattg gccattttct 6240 ctaattcgat tcagataatt agtggagaag ttgaagttgt agttatacaa gctaaatctg 6300 aagattgtca aagaaaagga ggaatttcaa cttgtaaaat gaatgccata aacactgtga 6360 ctctcttgcc aaatgagcaa attaattctc tttctttaaa aaataataag ggaaatataa 6420 tgggagagct tgaattatct atgcatgggt tacaaattca gtgtaatcaa aaaatactta 6480 gatatttgag atcttatgag gttaaagtcc aatcgattaa aagatgtcca gcagcaggaa 6540 catgtaaagc tgggagatgt aatcatatca gaacaaagga ttatgttgaa gaattatcag 6600 aatgaaatga ttaccctgga aactcgtttt gtattgaagg aaagtccctt tggggatatg 6660 gatgttcatt acccggtgcg gactgttatt tttacagagt atatgcagtt ccaaagtccg 6720 atgagatttt taaacttgtg tcatgtccta cctgggaata tagtattgaa gtaaatatag 6780 gacttgtttc aaatggaact atcacaaaaa agagatttac acttaatcca ggaatgacat 6840 ttcaatggaa agatataaaa tttacactat tagatatcac acctccaatg gctccaattt 6900 taaatagtga gtttattatt ggagagaggg gaattgcaat ttcagaaaat agtatttctc 6960 atttggaatg ccatgaaaat gccgcaaatt ttacaaaatg tgaactcgaa ttgagcgcat 7020 gcaacgaatg ctggctcgaa caagaaattg agaaaataca atgcgcatgt acggatttta 7080 ttatagagaa attaataaaa gatccgcgca aatcgttacc actagaaata aacaggatca 7140 gattaagaag aaaagaggat atgattatta gtgagtcgaa ctatttgcct attcttctga 7200 tgatccgttt cgaagattgg tcgatatcca gcaatattga cattacaagt tgtaaaatag 7260 aacccgaata tttaaaaggt tgttacagct gtatgggggc tatatttaaa tttaattgct 7320 ctactgaata cggatcagct tatgctgaga ttaaatgccc tgggggtaac actttcacgg 7380 caaaatgtaa tgaaaaagga attgatcaaa aaattgctct ttcttttgat cactcggaaa 7440 taaatgaaat ttgtcatgtt tattgttcgg gaggaacttc gaattttact ttaaaaggac 7500 aattagagtt tattttgccg gaaaagatga aagttatagt aaaagaagga tcagctgaaa 7560 tatttattga aaataatgaa aattggttta caaacccttt ctccaatcta cagtttttag 7620 attggaaaaa catactttgg gatttaatga atatgcaaat gattttaatt attatttttt 7680 catttgcact tattatcttt gcacttaaat tatttttaaa gtttaatcca ttattcaaag 7740 gctataagtt agctacgtca tttattatta ttttttattt ttttctcttg gccggatgtt 7800 ctatacattg tttatgtata gagtgcatta aaatagagat gattcaaaat agacattctt 7860 ttgttgaaaa gttgaaaata gaaagaaatt taattaatca tacaaatttc actctttttc 7920 atacaaaaat aatgatgtca attaatttca gggtttcatt ttctttaaaa aatgatgtga 7980 ttagaaaaat tgaaaaaatg tcaatgagtc aaaaaattca gttaaagaac tatataaaca 8040 agccaagttt tcttaatata tcttatattg aacaaaaaac acattcttta aaaatgttct 8100 tattagttga gatttctgcg gatttgttcc gctgtgatga catagaggca gtttgggagc 8160 tctatgtgag cgaggcccaa cgtacaaaac ttcggaaatg tgttgaagct gccatcaata 8220 gaggaagcaa attgatggta agtaatggca aaaataaaga tttaataaat ttagaaatgt 8280 tagaggattg ggaaattgta gaaaatacgc cctatttata tgaaagacca agggaatttc 8340 cagttaagga c 8351 // ID BEL-33_CQ-LTR repbase; DNA; INV; 336 BP. XX AC AAWU01003985; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-33_CQ_; KW BEL-33_CQ-I; BEL-33_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-336 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 220-220 (2011). XX DR GenBank; AAWU01003985; Positions 53003 52668. XX SQ Sequence 336 BP; 99 A; 101 C; 86 G; 50 T; 0 other; tgttcggtct ggccgaacac ttgccccacc cgtgcggcga aactgactac gccgctgcgc 60 aacggcagct gtcaagctca ccgacagcaa caacggcccg cttactgcgc gcagaccagt 120 cccgaaccat ccatcgaacg cgtcgcacgt gcagccaaaa cacagcagag cagaagcaga 180 gacagaagag gaagacgaga aagtgaagtg ataagaggag gagaataaaa gaaaattaga 240 aaatctaaac cctcccagtg tttgcttccc cggacaatct tccgccgcga tacagtccac 300 ttccgcgagt ttgaagggcc accgttggtc cgaaca 336 // ID Copia-31_DPu-I repbase; DNA; INV; 4557 BP. XX AC scaffold_50; XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from Daphnia: internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-31_DP_; KW Copia-31_DPu-LTR; Copia-31_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-4557 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Direct Submission to RU (16-DEC-2010). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR Genome; scaffold_50; Positions 870584 866028. XX CC Positions [1847-2437] - Integrase core CC 'TCCCT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS join(1895..3223,3227..4510) FT /product="Copia-31_DPu-I_1p" FT /translation="MIHADLCGPMEVATPNGSLYFALFIDDYSGWRFISFL FT KAKSEAVDCFKELIHVIRGETGNLVRTFRTDNGGEWSSHEFATWMTHKGIR FT HETSVPHTPEQDGVSERGIRTVTEGLRSCLYDTRTPLESWNTEVVNGAVNL FT IKESVLPKYLWAEAASFTVYTLNRVLSKVSPVTPFEAWHNKKPNLSHLRVF FT GSIAFIHIPKAERRKLDPKSTRCIFVGYSQTQKAYRFWNPTNRTIKISRDV FT TFDEHHRLVGIPGETPAYNNQKAHSPTLVQVQEEPLAVTNKDTTTSGQEQP FT IIHLDSDPQTHLNEEMQPNQPTEPNALDEIDVEQQTQPRRSLRGRVPIREW FT KACPVMDRPETDNSYIPLNYQDAMNCPAARNWKIATQEEYQSLIDNGTWSI FT VSCPKDRTPIKSRWTFDIKPGLNGEPQRFKARFVAKGYSQRPGIDFNETVV FT SHDTLRILLSVIAADDLEALQIDVKTAFLYGPFEEEIFMDQPEGFIIPGQE FT SKVCRLHKCIYGLKQASRVWGDLFTDFIEHHGFQRSEADPCLFSRVRGTER FT TFLTTWVDDVIVASNQQQAIDGFLIALGEKFQFRSHPLQRYVGIAIARDRE FT QRKLHISQPEYISQIVKKFHMTSCSPKSIPADPNVHLVKPTEKKEEENAFP FT YREAVGALLYLALVSRPDISFAVGQVARFVESYNLSHVKAVRQIISYIHGT FT PNHGICFAGSNRNPLVGFSDADYAGCQTTRRSTTGSVFMYNGGIIAWCSRR FT QTCVALSTTEAEYVAASETAKEAIWIRRILPIFQQDPEAPIVIRCDNQSAI FT QLFCHPDQRPKTKHIAIRYNFIRLQQENGEIKMEFTRSADQLADIMTKALP FT SPSFHSIREELSVVPGFT" XX SQ Sequence 4557 BP; 1394 A; 1197 C; 964 G; 1002 T; 0 other; ggttatgggc ccagctacgc gctacctacc tcgtggatac taaagaatga acacatccaa 60 agatgtttct cacattgcca agtttgatgg tcaaaattat tcactgtgga agctgggact 120 atgggtcctg ctcgaacaac acaacctcat cgacatagta actggagatt acaccattcc 180 tcaaatggta atagtttact acagcattat aaatatgcca atcaaacttg cagtaaagac 240 ctcatatttg caaatgatag gtggaagatg ctgaacgtga tgctcaacta gcaatcatag 300 aagaaatcca agattggaaa gaaagagatg taagagctcg aggcttcatt ctttcaacaa 360 ctgaggtatc aatctacagg atcactccta ctatatacaa tccggttgat aactttccta 420 cattatctat atttatagtt gtcacaacaa cgactcctca ttgattgcat aactgcacac 480 caaatgtgga ctacacttac ggctcaacac cttgaaaggg catctgacaa ccaacacgac 540 ctgttaatgc ggtattatga gtacaaatac caacctgaaa ccgacttgaa gaatcatatt 600 gccaacataa aatccattgc tcaccaattg ggagaagttg gacgccccat agaggaaaga 660 gagctgatca ccaagatcat tagcaccctc ccgattgact acaaagcttt cgtctcttca 720 tggcgacatg tggcaacaga aaaaacaaac tttattgact ctcacctccc tcatattaca 780 agaggagaga gaaattacga aatggaggcc gaagtcaagt gataacaagg attctgcttt 840 tcacgcccaa aatcgtgacg aatcccaggg atactcgaat cactatcaga aacgtaatta 900 tataatatag ttatcatgat tcagtttatc taaaggttcc aatattctca accgattaca 960 gaaagaggaa ggggaaagaa ccactcccaa cacactaaag gcctgagtcg aaaaagtgga 1020 caagaccgga acaaaggaga caaaccgatg cactgctcat actgtggcct gacaaaccac 1080 gaagtagaga gctgttacac caagaaacgt catgaccgaa ctgattccga acgtgctgaa 1140 ctagccaaga agatgaagat gaatgacggg tctgccaagg tggcatttgc tcaacataat 1200 gaaaaacttt aaacttgaat ctaaaaagtg ttgtattaca ttccagaaat acaaagagac 1260 tcagaactcc agctcgtttc ctatcacctc tcgtcttgaa actcgaagta ctggtgattg 1320 gtttgctgat tccggagcga cacagcacat gtcggatcag aaagaatggc tcgtggactt 1380 tgtaactgta ccagatggct cctggaatgt cgacggaatt ggttcggcca aatgttcagt 1440 tagaggctac ggagacgtca atatctggac tgaagtaaat ggcgacagga aacccgccac 1500 catcaagaaa gtcttatacg tgccagggct aggcattaac ctattttcta ttgccgcagc 1560 cacagacctt ggatggaaag tcacatttgt tgacaccatt gtccacattt cctcagagaa 1620 aaaacaccaa aatcatagtt ggagaacgag taggcagaaa tttatatctc ttagccattc 1680 aagctcgata caaggaggag ccgacaagct ttgcactggc atcctctgtc tcacctgaca 1740 tttccacgtg gcaccgtcgc ctggcacatc taaactacta caccacctag caaataaaac 1800 cattccacca gaaccttgca acggttgcgc atttggtaaa caccaaagat caccttttcc 1860 catcggccgg aatcgagcaa cttacactgg agaaatgatt cacgccgacc tctgcggccc 1920 aatggaagtg gcaacaccaa atgggtccct ctactttgcc ctattcattg atgactactc 1980 aggatggcga ttcatctcat ttctcaaggc caaatctgaa gccgtggact gtttcaaaga 2040 acttatccat gtcatccgtg gagagacagg caacctagta cgcaccttcc ggacggacaa 2100 cggcggagaa tggtccagcc acgaattcgc cacctggatg acccacaaag gcatccgcca 2160 tgagacgagt gtcccacaca cacccgaaca agacggtgtg tccgaaagag gcatcagaac 2220 agtgactgaa ggcctacgaa gttgtcttta cgacactcgt accccattag aatcatggaa 2280 caccgaagtg gttaatggag cggtaaatct tatcaaagaa agtgtactcc cgaagtatct 2340 ctgggctgag gcagcgtcat tcaccgtcta tacactcaac cgcgtgttaa gcaaagtctc 2400 acccgtgacc ccttttgaag cctggcataa caagaagccc aacttatcgc acctgcgtgt 2460 tttcggctca atcgctttca tccatattcc aaaagctgaa agacgaaagc tggacccaaa 2520 gagcactcgt tgcatcttcg ttggttacag ccaaacccag aaggcctaca gattttggaa 2580 cccgacaaat agaacaatta agatcagccg ggacgtcacg tttgacgaac atcatcgtct 2640 tgtgggtata ccaggggaaa cccccgccta caacaaccag aaagcccact caccgaccct 2700 cgtccaagtc caggaggagc cattggcggt caccaacaaa gacaccacca ccagtggtca 2760 agaacaacct atcatccacc tagatagtga ccctcagact catctgaatg aagagatgca 2820 gcccaaccag cccaccgagc ccaatgctct tgatgaaatc gacgtagaac agcaaactca 2880 gccacgtcgc tccctcagag gtcgagtccc catccgagag tggaaggcat gcccggttat 2940 ggaccgaccc gaaaccgaca actcctacat tccgctcaac taccaagatg caatgaactg 3000 cccggctgcc aggaactgga agatagcgac tcaggaggag tatcagtcgc tgatcgacaa 3060 tggaacctgg tccattgtct cctgtccgaa agaccgaacg cccatcaaat cgcgctggac 3120 atttgacatc aagcctggat taaatggtga accacaacgc ttcaaagccc gattcgtagc 3180 taaaggctac agccaacgcc ccgggattga cttcaatgag acataagtcg tctcccatga 3240 caccttaagg atcctcttgt cagttattgc cgctgatgac ctcgaagcac tccagatcga 3300 tgtcaagact gccttcctct acggcccctt tgaggaggag attttcatgg accaacctga 3360 aggattcatc atcccaggcc aagaaagcaa ggtgtgccgt ctccacaagt gcatatacgg 3420 actcaaacaa gcatctcgtg tctggggaga cctcttcact gattttattg agcatcatgg 3480 ctttcaaagg agcgaagcgg acccatgcct cttttcccga gtacgcggaa cagaaagaac 3540 ttttctaacc acctgggtag atgacgtcat tgttgccagt aatcagcagc aggccattga 3600 cggattcctg atcgcccttg gtgaaaaatt ccagtttcga tcccaccccc tacaacgata 3660 tgtgggcatc gcaatagctc gcgatcgaga acagagaaaa ctccacatct cccaaccaga 3720 atatatctcc cagatagtga agaaatttca catgacatca tgttctccca agtcaatccc 3780 ggcagaccca aatgtccacc tagtcaaacc aactgagaaa aaggaagaag aaaatgcttt 3840 cccgtaccga gaagccgtcg gagctctact ctacctagcg ctggtatcac ggcctgatat 3900 atcattcgcg gtgggtcagg tggctcgttt tgttgagagt tacaacctct cccatgtcaa 3960 agctgttcgc caaatcatct cctacattca cggaacccct aaccatggaa tatgcttcgc 4020 tggatcaaac agaaaccctc tggttgggtt ttcagatgct gactatgcgg gatgccaaac 4080 aactcgacgc tcaacaactg gaagcgtctt catgtacaac ggaggaataa ttgcatggtg 4140 cagccgccgt caaacctgtg tggccctgtc caccaccgaa gcagagtacg tggccgctag 4200 tgagaccgcg aaagaagcga tctggattcg ccgcattcta ccaattttcc aacaggaccc 4260 agaagcgcca attgtcatca ggtgcgataa tcaaagcgca atccaacttt tttgccaccc 4320 tgaccaacgc ccaaagacga agcacattgc tatccgctac aacttcattc gcctacaaca 4380 agaaaatgga gagatcaaaa tggagttcac aagatcggcg gatcaacttg ctgacatcat 4440 gacaaaggcg cttccaagcc ccagttttca ttccatccgt gaagaactaa gcgtcgtacc 4500 gggattcacc taggacagca aaaaaaaaag gatccatcat gtccacctga ggaggag 4557 // ID Gypsy-11_IS-I repbase; DNA; INV; 4092 BP. XX AC ABJB010103780; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the black-legged tick genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-11_IS_; KW Gypsy-11_IS-LTR; Gypsy-11_IS-I. XX OS Ixodes scapularis OC Eukaryota; Metazoa; Arthropoda; Chelicerata; Arachnida; Acari; OC Parasitiformes; Ixodida; Ixodoidea; Ixodidae; Ixodinae; Ixodes. XX RN [1] RP 1-4092 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the black-legged tick genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; ABJB010103780; Positions 20643 24734. XX CC Positions [3174-3407] - Integrase core CC 'GAAGT' target site duplication CC LTRs are 95% similar to each other. XX FH Key Location/Qualifiers FT CDS 241..3219 FT /product="Gypsy-11_IS-I_1p" FT /translation="MGRQARKIFQTFGLSQTQSQSYDTVRKKFDGHFVATK FT NVVYESASFHRRKQESGESVDQFVTDLHVLADRCDFAQFRDRMIRDRFVVG FT LADAKLSKTLQMDASLTLESALAKARLKETVRRQQQELRPHSDAATSVVVD FT DVGQRSGRTGALGNMHSRLPSLQQQQQWRLSEQHHQLCPYCGRSNHFRSQC FT PARSARCHHCNARGYFAALCRNKVEKPKVGVSSLEWEQPTYSVGSVGASKA FT RFVQVSINGVPVSAKVDSGAEVSVVPPSFPGAPPKLTPSSAQLTGAGNEHL FT RLKGTFESDIFWKGKTTRQTLYVVESAHAILLGLPAIEALGVVKFVDVLED FT KNYEQRYPQLFSGLGAWSGEYTIRLKANAIPFAIFTPRRIPIPLKEAVRKE FT LSKMEANRVIRKVDGPAELCAGIVPVLKLSGEVRFCVDLTQLNKSVLRERF FT VLPTLDDTLGQLAGATYFSKLDANSGFHQIKLARESQTLTTFITPYGRYCF FT RRLPFGISSAPEYFQKQMGEVLDGLPGVANMMDDILIFGATKTEHDDRLRA FT ILERLSTAGVLHNRAKCSFAVREVAFLGSVVGDSGIRPDPRKVQAITEMAI FT PSSVADMRRFLGMVNYLARSVPNLAEISAPLRELLLKDKDWCWGGSQQKSF FT DSLKEVISSADSVAKFILRLPTVVSADASSFGIGAVLAQVQPDGQRRSVVY FT LSRSLTQTDQRYAQMEKEALELTWVAERLECYLKGLDFQFETDHKPLVTLL FT GKSPVDVLPPRVQRFRLRLMRFSFSICYVPGKQIVTADTLSRAPRKETDFV FT LGELSSEDVSSFVAGCVSELKLSAHLDKIRLAQEQDTSCQQLLLYCRNGWP FT RFTDVSSWLRPYWSERANLTICDGVLMYKSRLVIPQKLRQEVVASLHEGHQ FT GIVKYRATARESVSWPGLRSELASECAKLRVQRAEPMLPSETPARPWQKLG FT ADLFHFRGKEYLVVVDYFSRFPELALLLSTTSALSLCN" XX SQ Sequence 4092 BP; 976 A; 998 C; 1126 G; 992 T; 0 other; tggtgtcaga agtgggagga tgcttctcgc ccgtgttgtt ttggcggccc tcgtcttgca 60 ctcggtttgc aagatggaag tacctccagc ggtccttcct ctggcagtcc gcccaaacaa 120 ctcaacttcg accggacttc agaatggccg gagtggattc aagaattcga cgatttccgc 180 tacgccgcgg gtctcaacga gaggtcggag aaagcccagg tacggacgct gctgtatacc 240 atgggaaggc aggccaggaa aattttccag acgtttggcc tctcgcaaac gcagtctcaa 300 agttacgata cagtgaggaa gaagttcgac ggccacttcg tagccacgaa gaacgtggtt 360 tacgaaagtg cctctttcca tcgtcgcaag caagagagcg gggagagtgt cgaccagttt 420 gttaccgacc tccatgtgct cgctgaccgc tgcgacttcg cacagttcag agaccgtatg 480 atccgtgacc gcttcgtcgt gggacttgcc gatgcaaagt tatccaaaac cctccaaatg 540 gatgccagcc tcactctcga gtcggcacta gccaaggccc gcctcaagga gactgttcgt 600 cgccagcagc aggagctgcg tcctcactcg gatgcagcga cctcggtggt agtcgacgac 660 gtgggccagc ggagtggacg cactggagca ctcggcaaca tgcacagtcg gctgccttca 720 cttcagcagc agcagcagtg gcggctctct gagcagcacc accagctctg cccgtattgc 780 ggacgttcga accacttcag gtctcagtgc ccggcgcgct ctgcgcgttg ccaccactgc 840 aacgccagag gatattttgc tgcgttgtgc cgtaacaagg ttgagaagcc gaaggtaggc 900 gtttcttcgc tcgaatggga acaacctacc tattctgtag gatcagttgg cgcttcgaaa 960 gctaggtttg ttcaagtgtc tattaacggc gtgccggtgt ccgcaaaagt agactctggg 1020 gcagaagtgt cagttgttcc gcccagcttc ccaggagcac cgcctaagtt aacgccctcg 1080 tcggcacagc tgactggcgc aggaaatgag caccttcgct tgaaaggcac gtttgaatct 1140 gacatttttt ggaagggcaa aaccacacgg cagactctgt atgtcgtaga atcggcacat 1200 gcaattttgt taggtcttcc cgccatagag gcgctcggtg tggtgaaatt tgtagatgtg 1260 cttgaagata aaaattatga gcaaaggtac cctcagcttt tctctggtct tggggcatgg 1320 tctggcgagt acaccattcg gttgaaggcc aacgcgatac ctttcgctat tttcacgccc 1380 aggcgcattc ccattccttt aaaggaggct gttagaaagg agttgtctaa gatggaagca 1440 aatagggtga tacggaaagt agatggtcca gcagaattgt gtgcaggcat agtgcctgtc 1500 ctgaaactct ctggcgaagt tcgcttctgt gtagacctga cacagttgaa taagagtgta 1560 ctacgcgaac gctttgtctt gccaacactg gacgatactt tgggccagtt ggcgggagct 1620 acgtattttt cgaagctgga cgctaactct gggtttcacc aaataaagtt ggcaagggaa 1680 agccagacgt taacaacatt catcaccccg tatggccgct attgcttccg gcgattaccg 1740 tttggaatca gctcggcacc agagtatttt caaaagcaaa tgggcgaagt cttggacggg 1800 cttccgggag tagcaaacat gatggacgac attttaattt ttggagccac aaagactgag 1860 catgatgaca gattgcgcgc cattctcgaa cggttgagca ctgccggcgt cttacataac 1920 cgggctaagt gctcgtttgc ggttcgtgaa gtggcgtttt tgggctccgt tgttggtgac 1980 tcgggtatca gaccagaccc cagaaaggta caggccataa cggaaatggc cataccttcg 2040 tcagttgcag acatgcgtag atttttgggc atggttaatt acttggcgag gtctgtgccg 2100 aatctcgctg aaatatctgc accactgcgg gagttgctgt tgaaggataa agattggtgc 2160 tggggcgggt cgcagcaaaa atctttcgat tcattgaagg aggttatttc ttcggccgat 2220 tctgttgcga aatttattct gaggcttccc actgtagtat cagccgatgc atcatcgttc 2280 gggatcggcg ctgtattggc gcaggtgcag ccagacggtc aaagacggtc agttgtatat 2340 ctttctcgtt cattaacgca aaccgaccag cggtatgctc aaatggaaaa agaagcgctg 2400 gaattgacgt gggtagcaga acggttggaa tgttatctga agggtcttga ctttcagttt 2460 gaaactgatc acaagcctct ggtgacgctt cttgggaaat cgccggtaga tgtgcttccg 2520 ccgagagtac aaagatttcg cttaaggctc atgcgttttt ctttctcaat ctgttacgta 2580 cccggaaagc agattgttac tgcggacacc ctttctcggg cgccccgcaa agaaaccgat 2640 tttgttttgg gggaactgtc atctgaggac gtttctagtt ttgtagccgg ctgcgtcagt 2700 gaattgaaac tgtcggcgca cctcgacaaa ataaggttgg cacaagagca ggacacctca 2760 tgccaacaac ttttgcttta ctgtcgcaat ggttggccac gcttcacaga tgtgagtagt 2820 tggttaaggc cttattggtc agaaagagcg aaccttacta tttgtgatgg cgtgctaatg 2880 tataagtctc gccttgtgat tcctcaaaag ctcagacagg aagttgtggc ttcattgcat 2940 gaaggtcatc agggtattgt caaatatcgc gctacagctc gcgagtcggt ttcgtggccc 3000 ggtctccgct cggagctggc gtctgaatgt gcaaagctcc gtgttcaaag agcggaaccg 3060 atgctgccct cagaaacccc cgctcggccg tggcaaaagc taggtgcaga cctttttcat 3120 tttaggggaa aagagtacct cgttgtggtt gactattttt cccgtttccc tgagctcgcg 3180 ctactgttgt caacgacgtc tgcgctgtca ttgtgcaatt gaaaagttct tttgccagac 3240 atggtatacc agagcttctt gtaactgaga atggtaatca atttatatct gcagagttca 3300 aggctttctc agaaagctac cagttctgtg acgtgacgtc gagccctcgg tacccgcaat 3360 ctaacggaga agcggaacgg aactttcaga ccgtgaaaag gctgctagag aaatcggcgg 3420 atccacactt ggcacttctg gcttaccgaa gcaccccagg ccctctcgaa aaaagcccat 3480 cggaattgct catgggcaga cgcctgcgct ctacgctacc tgtgcactct aaagagttga 3540 cgccgtgcac cgcaagaggc cttgcgagcc tcggaaggaa ggatgaggcg ttccgtggca 3600 aacaaaggcg caactacgac caccgccatg cggccaggcc acttccacgg ttgctacctg 3660 gagaccaggt gtgggtgaag gactgtcgag tccgggctac ggttttgaga ccagcctccc 3720 ggccaaggtc ttatatggtg caaacagaaa caggagcaga gctagagagg aaccgccgag 3780 gcctgctata taaggaacca gcacacgaaa caggcggttc cctctatgaa attccgcagg 3840 attccgggga aggggaaaat ggagagccag aaaacgaggc gaattcgaca gagccttccg 3900 cgctcaattc tgactcagca gctcaagaaa agagcggtac acattgcacg tacgtcacgc 3960 gttttggtcg acccatgaaa aggccgcaat tttatgggtg gagtagcatt actaagtttg 4020 ttttaggtgt tctactgttt gtttttgggg gcaatatagt tcaactgttg ccttaataga 4080 aaaaaaggga aa 4092 // ID R1-2B_AP repbase; DNA; INV; 5175 BP. XX AC Contig17211; XX DT 19-AUG-2009 (Rel. 14.08, Created) DT 19-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE Non-LTR retrotransposon. XX KW R1; Non-LTR Retrotransposon; Transposable Element; R1-2B_AP. XX NM R1-2B_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-5175 RA Jurka J.; RT "Non-LTR retrotransposons from pea aphid."; RL Repbase Reports 9(8), 1794-1794 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC The termini are approximate. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX FH Key Location/Qualifiers FT CDS 722..1972 FT /product="R1-2B_AP_1p" FT /translation="MKKLRCSPDPVPVAREAIKWIRYTIESSATKKTNLPA FT EIQRAMFDKLKALDTAVHDMVISNLQLSSQLEEARRSAEICVGAAAAQFGT FT ELRLRETAHEQTLEAVVARYAEREALRQQELEARGPHAERVVTQEAPVETF FT AQVARTARPPTTTASRKPDRSVSRATKRNKALKEAKQVEHIPAFVIKSCGG FT KDPKEVRNVVWKKVASQQVQPNCHSTIAKDGIIKPLNRETADVLKSLAKTS FT TLIVEDSPRWPRVIFRGVQADARPDELQSAIISQNGHLNINEDTTEEILRP FT IFKQGKRDMDTTNWVMEVNPKYYDRFEDATVYIGFMRCKAAAFEEVTQCHI FT CLKYGHPASKCQGTEPKCTHCSRAGHKAVECPAAEGDPICANCRGKHSARD FT KTCSVRTAYLLGIAKRTDYGVSQ" FT CDS 1972..4983 FT /product="R1-2B_AP_2p" FT /translation="MSLRPLKIVQLNMERAAAVNDQLLAYCQETGVDIAMV FT QEPYTNRGKLTGFETAPIRCYLSKGTRRRGAPNYLDHGAAIIVFNANLVIV FT ARCVGTIENFVSIDLDCDLEGIVTLISGYFKYRVPTAVHVEALGPLHQDTA FT QEVLITLDANAFSTRWHSRINDRRGETLVTWLDDQNLRIANKRSPHMTFDG FT PRGRTNIDVTVCSDNIYGKLRDWTVIPDATASDHNLISFTMTLRAREFIHR FT ITRFNLLRRNHITFVQEYEARATRRAATDRDLDTMATQAIEDITYAAERHA FT SRSNRRRKTLPPWWSPELTDKRKEVRAAARHKATGGRQLYNEKRNEYTMLL FT RRNKISSWRNFCTLEGVQPWGKLYRWMKGGHKSLTAIGLMRLPDGSVSNSI FT DESVSILLNALIPNDPNTQVDARPVPEEGDMDPVTEEELKVHAWGLSPNRA FT PGMDSITARMVRVLWPVLAPRLVNIVNECLKTGQFPNNWKAAQVIPILKGQ FT DRDIALPMSYRPVSLLPVMGKIAEKVINSRLTEQIRPRLTGKQYGFTQGRS FT TSDAIQRLLTWSNLCLEQYVITIFLDISGAFDNLTWPALQRDLGSLGASKH FT MRHLIADYLRGRTATMTIGGVAKTVRVTKGCPQGSILGPVLWNVTVEALLR FT TDFPGHVNIQAYADDIAIYVSGPNRRTLIERSTTALLPVLAWARDRGLTFS FT AQKSVAMITKGALVPGFTLTFGDERIVSVDNVRYLGIRLDQKKSFTPHFEA FT LKNSSETLFSRLRGTIGAGWGLKRENILILYRGVFLPKIAYGAQFWAHKLK FT TLQTIKKFGSIQRRALLGMTSAYSTTSTDALQVLAGVPPLDIEIKWMVNKA FT VVALLPAHLRHDTMTAMRETLLDEWQIRWTNSLKGRWTFRFFPDIRARLLL FT PLSLGHEVVQFLSGHGNFRAKLATFNLQPVSMCACGMGDEYAEHVLFNCFL FT DADHRAVLELAVHRSGHLWPCNPATLVSTAKLYAALVKFSKVAAYMERP" XX SQ Sequence 5175 BP; 1430 A; 1296 C; 1392 G; 1057 T; 0 other; gtgcgcgaac agaccggtgt cgattttgag tcggtctgct taaaacctcc cccgtggggt 60 ccatttttgt gtaatcccat aagcaataca cggccgcccg ccggtattgc cgagcgattt 120 ttggatttta gtcagctacg agaccaggtg tcaagagcct gacgaggtac gttcggaatc 180 gttattacgt aattcagact taacttaagt cccccaccaa gtggctggta taggtggtat 240 actaccccct ccttaaattt tcaagtggcc ataactccta gggtttaagg ggcacagccc 300 ctggaccaag gggaaaccca ttttctgggc gaaccgcata ttctcccacc gttgggaact 360 aagcccccac ccccccccga ttttgagtta agcataactc cgcggagaaa cgccgcagaa 420 ccccaggggc aacgactagt ggtaccccta gtgcagggct acagttttcc gtgataggga 480 acggggcaac cccccacccc ccattttggt ttttgacttt taggtcagat acccatccag 540 gaggcctcta gcacccgtac ccccgcgggg aaactcgggg ttttaaatcc cccacccccc 600 agagggaaga actgtgaaca atggacgaga agattcccag ccaggggagc ttccacaacc 660 agttccgggc actagccaac aggttctgag gaccccaccg tcggcaggct cctcgcctgg 720 aatgaagaaa ctcagatgct caccagaccc agtgccggtt gcccgagagg ccatcaaatg 780 gatccgctat actatcgagt cctccgccac aaaaaagacg aacctaccag ccgagataca 840 acgagctatg tttgataagc taaaggccct ggacacagct gtgcatgata tggttatcag 900 caatctacag ctctctagtc aactagaaga ggctaggaga tcggccgaga tttgcgttgg 960 agctgccgca gcgcagtttg gcacggagct caggctaagg gagaccgctc acgaacaaac 1020 ccttgaggcc gttgttgcca ggtatgcgga gagggaggca ttaaggcaac aggagttaga 1080 ggcgaggggc cctcacgctg agcgtgtcgt cacgcaggaa gcacctgtgg agacctttgc 1140 acaggtggca aggaccgcca ggccgcccac aaccacagcc agccgcaaac cggaccgctc 1200 ggtatccaga gccacaaaaa gaaacaaggc gctcaaggag gcaaagcaag tcgagcacat 1260 cccggcattc gttataaaat cgtgcggagg gaaggaccct aaggaggtaa gaaatgtggt 1320 atggaagaag gtggccagtc aacaggtaca gccaaattgc cactcaacaa ttgccaagga 1380 tggcatcatc aagcccttaa accgagagac agcggatgtg ttgaaatctt tggcaaagac 1440 ttcaacactc atcgtggaag acagcccccg atggcccagg gttatattca ggggagtcca 1500 agcagacgcc aggccagatg aacttcagag tgcgatcata agccaaaacg gccacttgaa 1560 catcaatgag gatacgacag aggagatcct taggcctatc ttcaaacagg gcaaaagaga 1620 catggatacc acaaactggg ttatggaggt aaatccgaaa tactacgaca ggtttgagga 1680 cgccacggta tacataggtt ttatgaggtg caaggccgca gcatttgagg aggtcacaca 1740 atgccacatt tgtttgaagt acgggcaccc ggcttcgaaa tgtcaaggga ccgagccgaa 1800 gtgtacgcac tgctccaggg ccggccacaa ggcagtggaa tgtcctgccg cagaaggaga 1860 ccctatctgc gcgaattgta gaggtaagca cagcgcccgc gataaaacct gctcggtgag 1920 gaccgcatac cttctgggaa ttgcgaagag gactgactac ggggtctcgc aatgagcctc 1980 agacctctca agatagtgca acttaatatg gagagagctg cggcggtcaa tgaccaactc 2040 ttggcctatt gccaagaaac gggtgtggac atagccatgg ttcaagaacc gtacacaaac 2100 aggggcaaac tgacgggttt tgagacagcc ccgataaggt gctacctctc gaagggtact 2160 cgacgaagag gtgcgcccaa ttacttggat catggcgccg caataattgt ttttaatgca 2220 aacctggtga ttgttgccag atgtgtggga actatagaaa acttcgtctc catagatctc 2280 gattgtgact tagaaggcat tgtgacccta atcagcggat attttaagta ccgagtccct 2340 acagctgtgc atgttgaagc attgggaccc cttcaccagg atacagcaca ggaggtactc 2400 atcacgcttg atgccaatgc gttttcaacc cgatggcaca gcagaataaa cgaccgaaga 2460 ggggaaactt tggtaacgtg gcttgacgac caaaatctac gcattgcaaa taagcgcagt 2520 ccacacatga ccttcgatgg accccgaggt cgtaccaata ttgatgtcac agtctgcagc 2580 gacaacatct acgggaagct cagggactgg acggttatcc cagatgcaac ggccagtgat 2640 cacaatctaa tatcctttac catgaccctg agagcaaggg aattcattca ccgtataact 2700 cgtttcaacc tcctgaggag aaaccacatt accttcgtac aggaatatga agctagggca 2760 acacgaagag ccgcaactga tcgtgatttg gataccatgg ctacccaggc aatcgaggac 2820 atcacatacg cagcagagcg gcatgcctct aggagtaacc gtagaagaaa aacactacca 2880 ccctggtggt ctccggagct gactgacaag agaaaggagg tacgtgcagc agcaagacat 2940 aaagccaccg gaggacgaca actatataat gagaaacgta atgagtacac aatgctgctg 3000 agaaggaaca aaatctcttc gtggaggaac ttctgcacac ttgaaggggt tcagccttgg 3060 ggcaagctgt atcgctggat gaagggcgga cacaagtccc tcaccgctat agggttaatg 3120 aggctcccag atggttcagt ttcgaacagc atcgatgaat cagtgtccat cctgcttaac 3180 gccctgattc cgaacgaccc aaacacccag gtggatgccc gcccagtgcc cgaagaaggg 3240 gacatggatc cggttactga ggaggaactc aaggtccacg catggggcct gtcgcctaac 3300 agagctccgg ggatggatag tataacagcc agaatggtca gagttctctg gccggtgtta 3360 gccccgagac ttgttaacat cgtaaatgag tgcctgaaaa ctggacaatt cccaaacaac 3420 tggaaggcgg ctcaggtaat accgatatta aagggacaag acagggacat agcattacca 3480 atgtcatata gaccagtgag cctgttgcct gtaatgggga aaatagcgga gaaggttatt 3540 aactccaggc tgaccgagca gataaggcca agactaacag ggaagcaata cggtttcacc 3600 caaggccgtt caacgtcaga tgcgatccaa agactgttaa catggagcaa tctctgtttg 3660 gaacagtatg tcatcactat atttctcgat atatccggtg cgtttgacaa cctaacgtgg 3720 ccggcgttac agcgcgacct gggtagcttg ggtgccagca aacacatgag acatctcatc 3780 gccgactatc tgagaggtcg aacggccacg atgaccatag gaggagttgc caaaacggta 3840 agggtaacga aggggtgtcc acaaggctca attctgggtc cagtgctgtg gaatgtgacg 3900 gtggaagcgt tgctccgaac cgatttcccg ggccatgtaa acatccaagc atatgcagat 3960 gacattgcca tatatgtatc agggccaaac aggcgaacat taatagagag gtcaacaaca 4020 gcgctactac cagtcttagc ctgggcccgg gatagggggc tgactttctc tgcccaaaaa 4080 tcagtggcaa tgatcactaa aggtgcactc gttccgggct ttacactcac cttcggtgac 4140 gagagaattg tttctgtgga taacgtcaga tacctgggca tacgtctgga ccagaaaaaa 4200 tcgtttacgc cacatttcga ggcactgaaa aattcgtctg agactctgtt ctcaaggctg 4260 cgaggaacca ttggcgcagg atggggactt aagagggaaa acatattgat tctttatcgt 4320 ggtgtattcc tgccaaagat tgcttacggg gcgcagttct gggcacacaa attaaagaca 4380 ctccagacaa tcaaaaagtt tgggtccata caaagaagag cactgcttgg catgaccagt 4440 gcatatagca ccacttcgac ggatgcatta caggtgctcg cgggagtacc accgctggat 4500 attgaaatta aatggatggt gaacaaggcc gtggtcgctc tcctcccagc acatttgagg 4560 cacgatacaa tgacagccat gagagaaacc ttattggacg agtggcaaat aagatggacg 4620 aactctttaa agggcaggtg gacctttcgg ttcttccccg acatcagagc taggctactg 4680 ctcccccttt ccttgggaca tgaagtcgtg cagtttttgt ctggacatgg gaatttcagg 4740 gccaaactcg cgaccttcaa cttgcaaccg gtttcgatgt gtgcatgcgg catgggagat 4800 gagtacgcgg aacatgtgtt gttcaactgc tttctagatg ctgatcacag ggcagtcctg 4860 gaacttgcgg tacacagatc aggtcaccta tggccgtgta acccagcaac actggtttct 4920 acagccaagc tatatgcggc gcttgtgaag ttttcgaaag tagccgctta catggaacgc 4980 ccgtgagacg gagaatttgc aggacgggca agacactctg cgcgccagag tgccaaaccc 5040 cgcatgagtt ctgacgagtg tcaaggagag agaaggcaga agcggttcac atgtggggcc 5100 gtcctgaggg gacactcgtg cgtcggggac gcgggcggcg ctgacgctga caacagtggt 5160 tcgtggacaa ccgga 5175 // ID Helitron-1_HM repbase; DNA; INV; 12699 BP. XX AC . XX DT 16-DEC-2008 (Rel. 13.12, Created) DT 16-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE Helitron DNA transposon from hydra - consensus. XX KW Helitron; DNA transposon; Transposable Element; Helitron-1_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-12699 RA Bao W. and Jurka J.; RT "Helitron DNA transposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 2056-2056 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 1786..6279 FT /product="Helitron-1_HM_1p" FT /translation="MIFLVIKFFYFIVNFRLVKKKEIISVALINMFNENDD FT IEMDDRLDVVLYLQDDVSAIRRVENNRRRNERNRLRRDEINCLQNERNRLD FT RDNVNRRQNERNRLDRDNVNRRQNERRAVGQGRMYSIARSNAIPDYNYLGE FT MNQICQHCGAKKFPDETHFLCCHNGKVALPPLSPITQALQDLFTGSYVDCN FT ANANFLKHIRNYNACLSFASFTANVVQPMNHGLPCFKICGQIFHRVGNLRP FT DQDIPPIYSQLYIYDPLAALNFRMQHYANDLCLRDLMFQLQTIIMEQSPFA FT LAFKNMAEVEDEEIRKAAIEGRSASVVKMSLLEGGDRRRYNLPSHDEVAVV FT FVGEDGAPPTSREVVIYPRGHPLKIVSSMSANLDPMVYPLFFPRGDAGWHN FT QLVHNPERATLVRNHVTLSQFYNYRLSVRQFFCSLFYGKKLFQQYAVDAYV FT KIEGQRLAFIRNNQNKLRSEQYDALHEHINNIANDRNIRPGRVVVLPSSYV FT GSPRALKENFEDAMAIIKKYGKPDLFITFTCNPKWREITENLYPGQNANDR FT PDLVTRVFKLKLNNLLNDIFKHGVLGKVVTHVQVIEFQKRGLPHAHILLHL FT ANDDKLETSQDIDNLICAEIPDPIVNCELYDIIKTCMIHGPCGILNPNSPC FT MKDGVCSKKYPKDFNANTVAVHNGYPRYRRRDNGLVINIKGNNVDNRWVVP FT YNPWLSKKYQAHINVEACMSVKAVKYLYKYIYKGHDCANVLINEQVNHDEI FT NTFLDCRYVSAPEALWRIFEYPISHMSHSIIRLKVHLPENQIVYFREGEEQ FT VALDRAAQRDTHLTAWFKLNSENEGANRYSYVDIPYHFVFDDKHCKWKVRQ FT RGGNKVIVRMYKVSPTGELFFLRLLLLQAKGAKSWEDLRTVNGIVLETFRE FT ACVFNGLLQDDTEWQNTLSEAVLTRMPKQIRQLFSIILTFCEPDDPLHLWN FT SYKAFMMEDFIHRQVPFILAEQATLLQIEKIINQSGKTLSDYNLPVVDEFI FT DFNLENLNDNVQQSIDEANRMRPLLNVNQLNVSNAVLAALNEQPCVENQHS FT RLFFMDGPAGSGKTFTYNYLIAEMSSRGVKSATAAWTGIAATLLTNGSTLH FT GLFKLPVPILDNSTCNVTPNSIQGQFLRQVSLFMLDETSMIPKHALNAIDR FT LLKDVCNNNFPFGGKVILFGGDFRQILPVVKRGRPAEVVESCIKCSLQWQW FT VQKFTLTENMRVRDGEGDFSEWLLKLGSGTIPGKEEDPFKGCIEIPQQCII FT RENESIVEKIFGDAQQDDYAKRVILTPTNVDSLSINEEVLERLHGEVKTYL FT SADQIDTDDLNEINNFPVEFLNSLTPSGMPTHCLKLKIGCVIMLLRNLDLK FT AGLCNGTRMKVCALQNNYIDAEVLTGVSEGKRVFVPRIQLAPSDSNLPFVL FT KRRQFPVRLAYSMTINKSQGQTFDRVGVYLKKPCFSHGQLYVACSRTRAFN FT SLFFKIDKHPIQGMVGEKYYTNNVIFSNVLNL*" XX SQ Sequence 12699 BP; 4239 A; 1530 C; 2012 G; 4913 T; 5 other; ctataatgaa tagcctatga atattattaa tagtgcgcat agttacgcat tgttacgcct 60 attgttcgtt ttgttaatta cggttctgca ttatgtaaaa tacactattt ctttgcctct 120 agaattttct aagcgcagtt atatattcca tgcggtcatt cgagtaaaat cagcaatcaa 180 attggagctg atacatttat tttcaagaag aattaatgtt gtttattttt taaaataaaa 240 atttaaataa ttgttagttt gctatgtata tatatatata tatgtatatc tatatgtatt 300 ttctcttagc aaaaataatt gagaatagaa aaataaaaaa aatttcttct ggaatttgtc 360 aaaagggtgt tgtgttattc ttcattttaa ggtatctaaa tatgtattta tttttcattt 420 tttagaataa gttagaacta aattgactgg taaaatagtt tttttagtac aatgtttctt 480 ttttaagaaa acagttttgt cgttagtctt tttaaaacat tttaagcagt atttgtttta 540 ttttttagag cagttaagag aagctgatcg cttttttagt aagttcaagt tattttctca 600 tacataaatt aaggttttcg atttatttca gtctatttta tatttctcta tatctacata 660 tatatattct ctaaggaaaa ataataagga ataaaaaaat taagaaaatt tcttctggaa 720 tttgtcaaaa gagtgttgtg ttatgcttca ttttaaggta tctaaatatg tatttattag 780 aaattttttt aaaccaagtt taaaaaagtt gttaaaccaa ggaaattgta gaggtttatt 840 tatcaatata tactataaat tatataataa tattggaggg agtaagttaa aaagcatttt 900 taaattttta aaaagaaata aaaatacttt tcaatatata tctgtttttt tttgaggtag 960 atttaaagtt tcaggttttc aaaattgtag atctttttaa tgtataaata gacattaagt 1020 atttctaagt agacattacg tatatataaa atcttttaaa atttaataag taattatttg 1080 tttaaataag tagttaattg ttataattta ttaagattta aattattaga aaattcatgt 1140 ttacagagtt gttattaatc attatcgagt tgttgaacca aagaaattgt agaggtttat 1200 ttatcaaata tatactatat atattgtaaa aaaaattttg gttttcatat gttttcacta 1260 ataaatagtt ctgtatatat atatatatat atatatatat atatatatat atatatatat 1320 atatatatat atatatatat atatatatat atatatatat atatatatat atatatatat 1380 atatatatat atatatatat atataactat atatatcctt ggagttgcgg cagcataata 1440 gtgtgttagg agaaaaaaat ttataaaccc gcgttaataa cctgttttct atcgtacata 1500 atgcattaat aatgcattaa taagttttat tacgtgtttt gttcaatgtt ttctcccaac 1560 gcatgaacta tgacgctgtc gcaatttttt tttttttttt tattagaaat aaaataagtt 1620 ttaataagtt tttgctgatg tggttgcagg tttttcttaa cttctatcaa ataaacttaa 1680 gaaacctgta ttacttttga tgatggcaaa ttaattttat tttgcatttg tttggtttct 1740 ttatatatat ttgttttgaa aatttttttt tttaattttt cgagtatgat ttttttagtt 1800 attaagtttt tttattttat tgttaacttt agattagtaa aaaaaaaaga aataattagt 1860 gttgctttga taaatatgtt taatgaaaat gatgacattg aaatggatga taggctagat 1920 gtagttttgt atttacaaga tgatgtaagt gctataaggc gtgttgaaaa taatcgtaga 1980 cgaaatgaaa gaaatagact taggcgagat gaaattaatt gtttacaaaa tgaaagaaat 2040 aggctcgatc gagataatgt taatcgtcgc caaaatgaaa gaaataggct cgatcgagat 2100 aatgttaatc gtcgccaaaa tgaaagaaga gctgttggac aaggacgaat gtattctata 2160 gcgagaagta atgctattcc agattataat tatttaggag aaatgaatca aatttgtcag 2220 cactgtggtg ctaagaaatt tccagatgaa acacattttc tatgttgcca taatggtaag 2280 gtagctttgc ctccactttc accaattaca caagctttac aggatttgtt tacagggagt 2340 tatgtagatt gtaatgctaa tgctaatttt ttaaaacata ttcgaaatta taatgcctgc 2400 ctttcttttg cttcttttac agcaaatgta gttcaaccaa tgaatcatgg gcttccatgt 2460 tttaaaatat gtggccagat ttttcatcgt gtaggtaatc ttaggccaga tcaagatatt 2520 ccaccaatat attctcaact atatatttat gatccacttg ctgcactaaa ttttagaatg 2580 caacactatg ctaatgatct ttgtttacgt gatttaatgt ttcaattgca aactataatt 2640 atggaacaaa gtccatttgc tcttgcattt aaaaacatgg ctgaagtaga agatgaagaa 2700 attcgtaagg cggctataga aggtcgctca gcttctgttg taaaaatgtc tttacttgaa 2760 ggcggtgata gacgtcgtta taatctacct tcacatgatg aagttgcagt tgtatttgtt 2820 ggcgaagatg gtgctccccc aacttccagg gaagttgtta tttacccaag aggtcatcct 2880 ttaaaaattg tatcaagtat gtcagccaac ttggatccta tggtttatcc gctatttttt 2940 ccaagaggtg atgctggttg gcataatcaa ttggtgcata accctgagcg tgccacactg 3000 gttagaaatc atgttacatt atctcagttc tacaattata gactttctgt taggcaattt 3060 ttctgttcat tattttatgg caagaaacta tttcagcaat atgctgtaga tgcatatgta 3120 aaaattgaag gccagcgtct tgcttttatc agaaataatc aaaataagct gagaagcgag 3180 cagtacgatg ccttacatga acatattaac aacattgcca atgatcgtaa tattagacca 3240 ggacgagtag tagttttgcc atcatcttat gtaggaagtc ctagagcttt aaaagaaaac 3300 tttgaagatg ctatggcaat tattaaaaag tatggtaaac cagatctttt tataactttt 3360 acttgcaacc caaaatggcg tgaaataaca gaaaatttat atccgggtca gaatgcgaat 3420 gatagacctg atttggttac tcgagtattt aaactcaagt taaataacct cctcaatgat 3480 atattcaaac atggtgtatt aggcaaagtt gtaacgcatg ttcaggttat tgagtttcaa 3540 aagcgtggtt tgccgcatgc ccacatttta cttcatttag caaatgatga taaacttgaa 3600 acatcacagg atatcgataa tttgatttgt gcagaaattc cagatccgat agttaattgt 3660 gaactttatg atattatcaa aacttgcatg attcacggcc catgtgggat actgaatcca 3720 aattctccct gcatgaaaga tggggtgtgc agcaaaaaat atcctaaaga ttttaatgcc 3780 aacacagtag ctgttcacaa tggttatccg cgctataggc gtcgtgataa tgggttggtt 3840 atcaatatta aaggcaacaa tgtagataac cgttgggtgg taccttataa tccttggtta 3900 tcaaagaaat atcaagccca cattaatgtt gaagcttgca tgtctgtaaa ggctgttaaa 3960 tatttgtata aatacatata caaagggcat gattgtgcga atgttttgat aaatgaacag 4020 gttaatcacg atgaaataaa cactttctta gattgtcgtt atgttagtgc acctgaggca 4080 ctgtggcgaa tatttgagta tcccataagt catatgtcac actctattat ccgtcttaag 4140 gttcatcttc ctgagaatca gattgtgtat tttagagaag gagaagaaca agtggcgtta 4200 gatcgtgctg ctcaacgtga tacacacctt acggcctggt ttaaattgaa ttctgaaaat 4260 gaaggagcca atcgctattc atatgttgac attccatatc actttgtatt tgatgataaa 4320 cattgtaaat ggaaggttag acaaagaggt ggaaataagg tgatagtaag aatgtataaa 4380 gttagtccaa caggagaatt attttttctt agattgttac ttttacaagc aaaaggagca 4440 aaatcttggg aggatttgcg tactgttaat ggaatagttc ttgaaacctt tcgtgaagcg 4500 tgtgttttta atggtttgtt gcaagatgac actgaatggc agaatactct ttctgaggca 4560 gttttaacgc gaatgcctaa gcaaatcagg cagctgtttt caattatttt aaccttttgt 4620 gaacctgatg accctttgca tctttggaat tcatacaaag cttttatgat ggaggatttc 4680 attcatcgcc aagttccatt tatattagct gaacaagcta ctcttcttca aatcgaaaag 4740 attattaatc aaagtggtaa aacgctatct gattataatt tgcctgttgt tgatgaattc 4800 atagatttta atctagaaaa tttaaatgat aatgttcagc aatcaattga cgaagctaat 4860 agaatgagac cgcttcttaa tgtcaatcaa ttaaatgtaa gtaatgctgt ccttgctgca 4920 ttgaacgagc aaccatgtgt agaaaatcaa cattccagat tgttttttat ggatggacca 4980 gccggtagtg gcaaaacttt tacatataat tatttgattg ctgaaatgag cagtaggggt 5040 gttaaatctg ctacagctgc atggactggc atagcagcaa ctcttcttac aaatggatct 5100 acgctgcacg ggttatttaa acttcctgtc ccaatactgg ataacagcac atgtaatgtg 5160 actcctaact ctattcaggg acaattttta aggcaagtta gtttgtttat gcttgacgag 5220 acatccatga ttcccaaaca cgctttaaat gcaatcgatc ggttgttaaa agatgtttgc 5280 aacaacaact ttccatttgg aggaaaagtc attctttttg gtggtgactt taggcaaatt 5340 ttgcctgttg tgaaaagagg gagaccagct gaagttgtag aatcatgtat aaaatgttct 5400 ttacagtggc aatgggtgca gaagtttaca ttaactgaaa atatgagagt acgtgatgga 5460 gaaggagatt tttccgaatg gctgttaaaa cttggtagtg gaacaatacc tggcaaggag 5520 gaagatcctt ttaagggatg tattgaaata ccgcaacagt gcattattag agaaaacgaa 5580 tcaattgttg aaaaaatatt tggagatgct caacaagatg attatgctaa acgtgtcatt 5640 ttaacaccca cyaatgtgga ttcattgtca atcaatgaag aagtgcttga acgtctacat 5700 ggagaggtca aaacttattt aagtgctgat caaatagaca ctgacgatct taatgaaata 5760 aataatttcc ctgttgagtt tttgaacagc ttaactcctt caggtatgcc tactcattgt 5820 ttaaaattga aaattggttg tgtaattatg ctacttagaa atttagatct taaagctggg 5880 ctatgtaatg gaacccgaat gaaagtttgt gctctccaaa acaattatat tgatgcagag 5940 gttttgacag gtgtttctga aggtaaacgg gtatttgttc ctcgaattca gttggctcca 6000 tcagattcta atttaccttt tgttctaaaa cgtcgtcagt ttcctgtcag attagcttat 6060 tcgatgacaa tcaataaaag tcaaggtcaa acatttgata gagttggggt atatctaaaa 6120 aaaccgtgtt tctcccatgg tcaactatac gttgcatgtt caagaactag agcatttaat 6180 agtttgtttt tcaaaattga taaacatcct attcaaggta tggtaggtga aaaatactac 6240 acaaataatg ttatattttc taatgttctt aatttatagt cttaatattt tcatttctaa 6300 aatttactgt ttgtaggttg tgagcacatt ttgtatgtgt aactttaata tttttatttt 6360 tttcatagtt tttgataaat atatttgtca tttaatattg ttatatattg tgtttatatg 6420 tgttatatga agtcatattt gtttatagtt gtatatatgt ttacattatg ttgtttccat 6480 gttttcaaag tggtgtgact tggcaaaatg ttaagcaaga ttgttaacct tgccagccaa 6540 gcggtgtact tatagcagag gtgtgacttg gcaccatttt gtgccaggcg gtgttactgc 6600 tttagagcta ggttgtgttg tctagctttt tttaattatt ctggaaacac ataatgtctg 6660 gtaatgtttt gtgtaatgtg tataaaagag tatgcaacct tgatccttaa atttaaaata 6720 attgaataac taatattgaa tataaaagca aatatactat attaaagttt agtaaaagga 6780 taaatataga taactcaaaa ttggtttttt agttttgcct ttcatattct tttaaaattg 6840 gtttgttatt ttccactgga ttttagagtc aagggtgtaa atattagtta tataggtgtg 6900 aatatgtgtt aatgttatgt gttattgtta taatgttgta atgtattgtt ataatgtata 6960 atattgttat attgttgtga tgttttgtgt tgatagacag cttggcttac cttcccgttg 7020 gtgtctatcg ttaaaaatga ataatagata gtttcattat tggaacttgc tgcttttatt 7080 catttactaa aagccaagac aaaatcttgc taaaaattca atttatgtct ttatttgatt 7140 tttttttctt tagtattgtt aatcctttag tacttcgaaa aagattttat agctactttt 7200 ttctataaat gtttatatta actgcaatat caaaaggtta caaaaacagg taaatcggtt 7260 ttcagtgttt atatgtggtg tatatatata tatatatata tatatatata tatatatata 7320 tatatatata tatatatgat gttataattc cgttactttt ttttatttta gattttacca 7380 tatgacgtat taatacaata ttaaaaaaaa aaacttgatt tcctaaattt gtaaaatttt 7440 gttcgtttgt taaaaagata taaagacatt aatatatata tttttttccg atgttttccc 7500 tatcttatgc tttaataatt cgttgttaaa aacacaggga gacttgcagt ttgtcaaata 7560 ttaagttgat tagtttagtc tgcaattgca acacttttga ggaaagaaaa aattcttata 7620 tatgtaaatc gttgttttac gctttattat tttggtaaaa ggtcttcatc ataaatagat 7680 tactcttaat aaaaaaaact ctactcacat ggctaccatt aaaacaaaaa aaggggaggg 7740 gctcttaatt agcggagatg tcaaaatatt gacgaaaccg aaacttcgag taataaaagg 7800 aatatttagt aaatggcagc aaattacttt accgtttgca aaaaattatt gaaataaaga 7860 ggccatctat tcaatataga aataatcaga gtttagaagg cttaacatga ccgggtccgg 7920 ttagctttgt ttgcaaacat gagattatgt cacgttaaca acctcaaaag tcaatttaaa 7980 acatgtttag tagcaaaatg tgatatatac aagtataaat gtgatagtat agtaatactc 8040 ttttatgatt ttttagctaa taaaatggat ttaagactag attgattttt gtttcctgtt 8100 aaaccgcagt aaataaaaat tagaaggcta tttggccccg acagaatatg cgacaagttt 8160 gtattaaact catttacaca aaaatggaat taagacaagt tagctcctaa aattatctat 8220 atgtacagat atgtttattg cacttattag taaataaact gattagttgc gtaatgagtt 8280 taaattttag ccatttttta aagttgataa tcataacgta cttaattaaa catgcgtgat 8340 gtaatttgca tagctaaatt aaatgctctt ttactttagt tactgcgcta agtgttgcgc 8400 agtttggttt gtgcgcagtt tttcttctgt tctggaattt tgagattgga atttccagac 8460 ttattttaaa aagcgcgttt tcgttataaa ttttttttac atctaatcgt acctatcaag 8520 taggtcatat tcagtataat caattcaatc gttttcattt tttatgtttt aacatgtcta 8580 ttacctttat taaatttgtt tttaagcata tttgttttta ttttttatat cgaattaata 8640 gaatgaatca gattgaagat gaaggtttgt atttatttag atgtgtatta tgttacagat 8700 gtataatgtt atgttgttag ttcttaaata tatgtttatt tttagttgct gaattactaa 8760 gaagtaagtt ttctttcatt ttgttattaa caaatgttat aaattgcatt tatatgtttg 8820 ttaatttaaa ttaactttac aatattaaaa tggtttaaac gaacttaaac gaacttttgt 8880 aaatgaattt aatcctcaaa gattaaattt tttttattat tgaagaaatt aaactagatt 8940 taagaatgag tcgacttaaa tgtttgtttg tttttctatt tagatgatga caatattgaa 9000 gcaagtaagt taaaaagcat ttttaaattt ttaaaaagaa ataaaaatac ttttcaatat 9060 atatctgttt tctttgaggt agatttaaag tttcaggttt ttaaaatcat atatattata 9120 agatttaaga atgagtcgac ttaaatgttt gtttgttttt ctatttagac gttggtgata 9180 atattggaac aagtaagtta taaagcattt ttaaattttt aaaaagaaat aaaaatactt 9240 tttaatatgt atctgttttc ttcgaggtag atttaaaatt tcaggttttt aaaatcatat 9300 atattataat attaaattct tttttacttt aaaaatatcc atattatagt tacactgagt 9360 cgtagaaaaa aaattaaagt ttattaataa ggtagtagat tattagagta gattaaattg 9420 tttgatctat taatcaactt aataacgtac tattaaagtr aagatttatt ttggttgttt 9480 gttaaaattt agttagagtt tatttaaaat ttaaattctt tagttaatca agagagatct 9540 aataaaggta ttaattacat tagttgaatg tagcatttaa tatgtatttt atcgatattt 9600 taacgtctaa acattaaagt gcttttattg tttagataca gaagtgaaga aggaaataag 9660 tcagtgtttt tgctattaga tatttgttaa taggaatttg tttatttgta ctgccactgg 9720 aattttaaaa aatcaagttc tttagctttt ttttatactt tcttttaggg gagttggaac 9780 aaaaactaag tataaattat gattagttta gtttaataaa taaagcatta atatttagaa 9840 gatttaaagt ttactttagt ttatagtttt ggtttagttt atagttttgt ttttagcttt 9900 agttattttt ttcttacttt agacaacttt atggatgaaa tgagggggat gggtaaaaca 9960 attatttaga tatgtatata tttatatgtt aacattattt tatcgattat tatatattta 10020 tttaagtata tatttgtctt tagtgtcagg tggaagtata gctggtagga ctgtgtttta 10080 tattttatat attttgtatg gagaaatata tgcaataggt gaaaaaatta attaataaat 10140 atgacacttt tgaactggtt gcattattgc cctatatatt tattaaaata caaaatattg 10200 catgtttaac ccttattaat ttgctctytt tttttttttt tttagcatct tctgtttcga 10260 atgaaaatgg tgaggttatg ctaatttttt tttttaataa ttgcaaatgt ataattttta 10320 ttatactctt ctatttgtaa ctttagattc cttatcattc gaacawtacc aaaattatcg 10380 ctcccaacac ccgggacctc tctcagagag gtccttcaat aaatggaaac gtaatggcgg 10440 atggatgagc cctcctttaa atggagggct catccatacc ggggtgttgg gagacaccga 10500 agaaaaaaca gcaaccaaaa ggttgtaaat gtcttctatc agtagagata tttatgactt 10560 tttagttgct gctttttctt cgtttttttt ttttttttaa tttgtaattt tttgtttcgg 10620 tgtttttttt tttttaattt tgtaattttt ttttaaatat atattacgtt gtatatttga 10680 tttgttttaa tcttttttaa taaaattttt aattgttttc tcaccaatta aacctgactt 10740 caaatttaaa gccctcttaa atacccctcc ctaaattcta aagtcgaaag ttgaattgat 10800 tcattatcca attatcataa tataatgatg ctgcattctt ttgttaaaaa caattatttg 10860 gtactgcttt taaaatttac caaaacccaa ggcaatatat ttagctatat aaatacaatt 10920 tatttatctc gattctattc tttatctttt atgtttatag tttggctcct taaagcattt 10980 tatgttttgc tctttttatt atgaatattt taaatccgtt gtgatgtctt ttaaattaaa 11040 ctaaagttta ctaaaagcca acataaaaat tacgtttttg aacaacacca aaaagaacag 11100 ctgcatccct acagcaactg ctgcagtatc acaataggtt accaaaagca ggtatactgt 11160 tttccagttg ttgttttttt ttcatatata tctacttatt tatataatac tatagtacga 11220 ttgtttatat ctcattcatt tttttttctt tatctttaag tgttaataat tttattcttt 11280 gaatgatatt agttaaacaa gtcggctttt cattaaagat attggtttca aaattgaaca 11340 tcattggcaa aatgcttaca aaattaggta gtttttaaaa atctgttttc tatttaatac 11400 actttttttt aaatatactt ttttaatata cttttctata acaattctac taatatggaa 11460 tcacgtagct ttttaaatat gtttaattgt cgtccctatc ggtaataaat attttaaatt 11520 atttgatatg gttttcttta caataggcta tgcttggctc ctcggtgctg taaatggtgg 11580 cttcaacttt gtttttttaa gaaaaatgtg ttttatatat gcttctgttt taaaatattt 11640 ttgttacata tttaactata tatcatgtaa ataaatttct tttaaagata agtaaacttg 11700 atacttgctt ttttacggat tttcaaaaag tgaacaaagg tttaaatata ttttttaact 11760 gtcttttatg attttatcaa ttttattaaa gctgaataat tacttcaaaa atattgagta 11820 aaatccaatt gcttttatta ttaacctgat ctatacaaaa attaatgaaa taaaactaat 11880 aatctaacgc caagttaagg caaaatgtat tttataaaat tagtataaat atggattgga 11940 tgtatgctga aataaatatt gttgatacac gttagctatt atagtaaatc catatattcc 12000 tttgtttttt tataaacaag tttctgctgg attactaatt aaaaaaaagg ttggtttaaa 12060 taataatctt tatcataata tataggatcc gtatgcaggg taagcccttt tcggaattgc 12120 gatatatttt tcaacttata gtttttccgg atatccaact tctatgccta caagtttggg 12180 tgtaatattt tagatgtttt aaaaaatata ttaataacat ttaaaaggtt taggtgcgat 12240 agtgacatat tttgaagctc ttcactcagt aaaatgaaaa gtttaaaaaa taaagaaacg 12300 actcagctta aagcaaaatt aagagtagaa tatattccca atatttacct aaacaatttt 12360 acggaatttg aaaatttgac atcgattttc tgtgcatata atttaagtca ttttatggtt 12420 aactttatct ttaatgaagc taagagttct cttaaactaa gaagccaaga aaaataaaag 12480 tgagctcaac taaaaaaatg tataaataaa tttatatttc attagagaaa gtaaataaag 12540 atcaaattgt yagtaatgat ttatttacta ctaacccgta ttacgggttg cttactgtat 12600 aaataaattt atattttatt agagaaagta aataaatatc aaattgttag taatgattta 12660 tatactacta acccgtatta cgggttgctt acagctagt 12699 // ID I-57_AAe repbase; DNA; INV; 6198 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE An I non-LTR retrotransposon from Aedes aegypti. XX KW I; Non-LTR Retrotransposon; Transposable Element; I-57_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6198 RA Kojima K.K. and Jurka J.; RT "I clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1328-1328 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 10 sequences with >98% CC identity. XX FH Key Location/Qualifiers FT CDS 381..1733 FT /product="I-57_AAe_1p" FT /translation="MAVAAHPPPWGVVEEPPNLALLGGNQPNWMLRNDEMG FT QTLVLVMQVKRNESDVDSATTINEISRNGTRLPHAFIVGKSIEAVIGVEAA FT RAMQSIREARGLRYLLRTNSHTTYQKIQQLDTLCDGTKVEVFPHPTMNTIQ FT GIVFEPDTIDLDEQTILEFLSSQGVKGVRRITKQVGRGRQNTPLLILTFHG FT TKLPEDVYFGLLKVPTRKYYPTPMLCFNCGCFGHTKKACSNDKICLHCSQI FT HEMVDDIPCSNDAHCKNCQGCHSAFSKTCPIYVEEDKIMRYRVDQNISLGE FT ARKIFKAQKEQRTYASVVQQRLIEQSSEKDQIIQALREQLDAVKAELESLK FT KQLQKQATDEHMLPSTSSKRKKSDEKMITSEQNTKASTTTASSSFNETHEH FT QHNTRQSRKDLGNTSKENKKQKDISKMIDCSSPDRNRSRSNRRNGVELRQK FT DKKLNK" FT CDS 1786..6099 FT /product="I-57_AAe_2p" FT /note="endonuclease, reverse transcriptase, and FT ribonuclease H." FT /translation="MNPDLSTLFFQRDSIEIMDFDPSITTDFEIGNFQQHQ FT QDQTSENVRSNENYINFNGNIQHTNSFGEARHSMSTPVRNVSGSPDYLREV FT VDVACPSLQASSTSINICPHNPCRPFSTLSACSSTNINVSLPSGTPLFSGR FT MLNDCSTGARHSMSTPXRNVSGLPDYLREVVDVACPPPQASKNNFEDSSGS FT CSVSLIPEVNHMTSVANNVATNHQRDSSSSSSSSGRVSISNGTRRYQRVAI FT QWNINGLYNNLPDLETLINKDPPAVLALQEIHNCKARNLNQILRGSYRWCT FT KNGPTPYANVALAIHQSVPSTSVKINSALIVSAAIIEVPFRLTVASIYIPS FT GVTNLEQQLNDLIEQLEEPVLIMGDMNSHHYAWGSHKTDKRGEIILKICEE FT KNWIIMNDGTSTFMRGNQTSAIDVTMVSISALSKLRWKIKTDSMGSDHYPI FT EIFGSGTPTNISRRRRWKYDAADWETYEQRILAAISPEKSYTIDEITDIIF FT SAASSSIPQTSSAPGKKATFWWNEETRIAVKKRRKALRAFKRLAVDHPDKE FT NAKEELKLRRSECRSIVKNAKKESWESFIDGISSSQSSADMWRRINSLSGK FT KRIAGMSIRHGDTVSTDPDTVSNTLADYFQKISSEDLYSSEFKNLNGTAKK FT ALSDIIIPDDQGIVDYNQPFSTEELFSALDTGTGKSSGPDEVGYPMIRRLP FT FRAKLALLDAYNVVWTTGNYPDGWRRSLVTAIPKKGSKSMEPDDYRPISLT FT SCIGKVMERMINRRLSTILREKNLLDERQYAFQRGKGAGIYLANLGQVLHD FT ALSRNLHVDIATLDLAKAYNRVWNPGVLKTLIDWGITGNMVRFVKEFLSNR FT TFQVSVGNHRSQPKHEETGVPQGSVLAVTLFLVSMNSVFERLPENIFIFVY FT ADDILLVTVGKQPKTLRKKIQAAVRAVAQWSNINGFNMSLEKCAITHCCNQ FT RHHPWKSPVTVDSVPVPYRKSIRVLGVHIDRQVNFALHFENVKKASEGWMR FT IIRAISGRHQNCNRGTILRVMNATIVSRLVYGIEITCRSAISLIDKLGPVY FT NRCIRFCSGLLPSAPTVATYAEAGVLPFDLLITKIVSNRANNFLEKTYGER FT EEVFLLMEAMNLLQKYANMDFPPVAEIHRVGDRRRDHAGPKIDWTMKNHVR FT AHDPKNKVLSCFNALISDKYKNHISIYTDGSRANGQVGIGVYGTTARAARL FT PDSCSVFSAEAAAILCAISMISNRPTVIFTDSASVLAALESGRSTHPWVQR FT IESSVPDTTTLCWVPGHCGIRGNEEADRLAADGRTALFFMTEVPALDVKNT FT IAAGVQAAWTIRWRNTRNVFLRKIKGEPDKWTDRYSWREQRVMSRLRVGHT FT RLTHAHFLSKVEQPRCVPCGQALTVEHILLNCVQYEEERTFTNLPTTIRDI FT LCNDRDEEEKLLRFLRVSGLYDCI" XX SQ Sequence 6198 BP; 1959 A; 1286 C; 1381 G; 1569 T; 3 other; cagttttgat cgggttacct gcatacaaaa acagaaacct acatcgctgt tcgatttcga 60 ccgattttga gtgttatatt gaacatttag ttcggtaaat tgattcagat agttaacaga 120 tcgttacggt taacgtagat tataaaatct actagtttcg cggcgtaata taaggaaaaa 180 ctcaactttt gccccgtgag aagagcgcac ttcgtgtggg ccgtatacat agtgaaattt 240 gtattgcttt tgttcgccac atctacccga acaaacgtag ctgcagtgag tgaccgtaaa 300 gatacctttc aatccgtgct acgttcgtta aatatttgct ctgagtgttt ggtctaagac 360 aatctgagtg ggtcagctta atggcggtgg cagcacaccc cccaccttgg ggggtggtgg 420 aagagccccc aaatttggca ctacttggag gaaaccagcc aaactggatg ttaagaaacg 480 atgaaatggg tcaaaccttg gtactcgtga tgcaagtaaa acggaatgaa agtgatgttg 540 attctgccac taccatcaac gagatcagca gaaatggaac acgtttacct catgctttca 600 tcgttggcaa atcgattgaa gccgttatcg gagtagaggc agctcgtgca atgcaatcaa 660 tccgagaggc tcgtggactc cgctacctgc tgcgtacaaa ttctcacacc acataccaaa 720 aaattcaaca actggacact ttgtgtgatg gtaccaaagt tgaagtcttt ccacatccaa 780 ccatgaatac tattcaggga atagtattcg agccagacac gatagatttg gacgagcaga 840 caattctgga atttctcagc tcgcagggcg ttaaaggcgt aagaagaatc accaaacagg 900 ttggtagagg gagacagaac actcctttgc taattctgac cttccatgga accaaattgc 960 cagaggatgt gtatttcgga ttactaaagg taccaacaag gaagtattat ccgacaccaa 1020 tgctttgctt caattgtggt tgctttggac atacgaagaa agcatgctcc aacgataaga 1080 tttgcctgca ttgctctcaa attcacgaga tggtcgacga tatcccctgt tctaacgacg 1140 ctcactgcaa aaattgtcag ggttgtcatt ctgcattttc caaaacatgt ccgatctatg 1200 ttgaggaaga taagataatg cgataccgtg tagatcagaa tatcagcctc ggtgaagctc 1260 gcaaaatatt caaagctcaa aaggaacagc gaacatatgc tagtgtcgtg cagcaacgtc 1320 ttattgaaca atcctctgaa aaagaccaaa tcattcaggc cctacgtgag caacttgatg 1380 ctgtcaaagc cgaactggaa agtttgaaga aacaacttca gaagcaagca acagacgaac 1440 atatgttgcc atctaccagc agcaaacgaa agaagtctga tgagaagatg atcacatcag 1500 aacaaaacac gaaggcctcc acgacgactg catcaagctc atttaacgaa acccatgaac 1560 atcaacataa cacgcgacaa tcgaggaaag atttaggaaa cacatcgaaa gaaaataaga 1620 agcaaaaaga catcagtaaa atgatcgatt gcagcagccc tgaccggaat cgtagcagaa 1680 gtaataggag gaatggtgtc gaactacggc aaaaggacaa aaaactgaac aaatgatgtc 1740 tacsgaccaa aaaaacaaca attttccagt tattaagaac ctgctatgaa cccggacttg 1800 agtacattat tttttcaacg agactccatc gagattatgg attttgaccc cagcatcaca 1860 acggactttg agatcggaaa ttttcaacaa catcagcagg atcaaacttc tgaaaatgtc 1920 agatcaaacg aaaactacat caatttcaac ggaaacattc agcatactaa cagcttcggt 1980 gaagcaaggc attccatgtc gacccctgtg cggaacgtct caggctctcc tgattatctc 2040 cgcgaggtgg tcgatgtggc ttgcccatca ttacaggcaa gttcaacaag cataaacatt 2100 tgtccacaca atccctgtcg acccttctcc acgctttcag catgtagcag cacaaacatc 2160 aatgtgtcgt taccttctgg gacaccttta ttcagcggac gcatgctcaa cgattgctca 2220 actggagcaa ggcattccat gtcgacccct stgcggaacg tctcaggtct tcctgattat 2280 ctccgtgagg tggtcgatgt ggcttgccca ccacctcagg caagtaaaaa taattttgag 2340 gattcgagtg gttcctgttc cgtatcacta attccagagg tcaaccatat gacatctgtt 2400 gcaaataatg tggcaactaa ccatcagcgt gattcatcgt cgtcgtcgag ttccagcggc 2460 cgcgtgtcta tatcgaatgg cacccgccga taccagaggg ttgctattca atggaacatc 2520 aacggacttt acaacaatct tcctgatctc gaaactctaa tcaacaagga tccacctgcg 2580 gttcttgctc tgcaggaaat tcacaactgt aaagcaagga atctcaatca aatattacgt 2640 gggagttatc gctggtgcac gaaaaacgga ccaacaccgt atgcgaatgt tgcgttagca 2700 attcatcaat ctgtaccgag tacatccgtg aagattaact ctgcgttgat tgtgtctgca 2760 gccataattg aagtaccttt ccgtttaacc gtggcttcca tatacattcc ttcaggtgtg 2820 accaatttag aacaacagtt gaatgatctt attgaacaac tagaagaacc tgttcttatt 2880 atgggtgata tgaatagtca ccattatgca tggggatcac acaaaactga caaaagaggc 2940 gaaataatct taaaaatctg tgaggagaaa aactggatta taatgaatga tggaacttca 3000 actttcatga gaggtaacca aacgtccgca attgatgtta ctatggtcag tatttctgcg 3060 cttagtaagc tacgatggaa aatcaagaca gattctatgg gaagtgatca ttatccgatt 3120 gaaattttcg gcagtggtac tccaacaaat atttcgcgcc gtcggagatg gaaatatgat 3180 gctgcggatt gggaaacgta cgagcagcga atcctggcag ccattagtcc ggaaaaatcg 3240 tacaccattg atgaaattac tgatataata ttttcggcag cttcgtcgag catccctcaa 3300 actagtagtg ctccagggaa aaaagctacg ttctggtgga atgaggaaac cagaatagcg 3360 gtcaaaaaga gaagaaaagc tcttagagca ttcaaaagat tggctgttga tcatcccgat 3420 aaagaaaacg ctaaggaaga gttgaaactg aggcgtagtg aatgtagatc catcgtaaaa 3480 aatgcaaaaa aggaatcatg ggagagtttt atcgatggca tcagcagctc ccaaagttca 3540 gcggatatgt ggcgaagaat aaattcgctg agtggtaaga agcgcatagc gggtatgtcc 3600 atccgtcatg gcgatacagt tagtacagat cctgatacgg tttccaatac actggctgac 3660 tattttcaaa aaatctcatc cgaagattta tatagctccg aattcaaaaa tttaaacggg 3720 acggcaaaaa aagcgctgag tgacattatc attcctgatg accaaggaat tgtcgactat 3780 aaccaaccgt tttcaacgga agaattgttt tctgcattgg atactggtac cggaaaatcg 3840 tctggtcctg atgaagtagg gtatccaatg atcagaagat taccattcag agccaaactc 3900 gcactgttgg atgcatacaa cgttgtctgg actaccggta actacccaga tggatggcga 3960 agaagtctcg tcaccgctat tccaaaaaag ggctcgaagt caatggaacc tgatgattat 4020 agacccattt ctttaacaag ctgtattgga aaagtgatgg agagaatgat aaacaggcga 4080 ttatcgacaa ttttacggga aaagaatcta ttagatgaac gtcaatatgc ctttcaacga 4140 ggtaaaggag ccggaatata tttggcmaat ttaggtcaag tgctgcatga cgcgctcagc 4200 agaaacttgc atgtggacat agctacattg gatctagcga aagcttataa cagggtttgg 4260 aatccaggag tgttgaaaac attaatcgat tggggtataa ccggcaatat ggtgagattc 4320 gtcaaggaat ttctttcaaa ccgaactttc caagtatctg ttgggaatca tcgatctcaa 4380 ccaaaacatg aagaaaccgg tgttcctcag ggatcagttt tggctgtaac cctgtttctc 4440 gtttctatga acagcgtttt tgagcgacta ccagaaaata tcttcatttt tgtatacgca 4500 gatgatattt tattggtgac tgtgggcaaa caacctaaga cattgcggaa gaaaatccaa 4560 gcagcggttc gtgctgttgc acagtggtcc aacatcaatg gcttcaatat gtctttggaa 4620 aaatgcgcga ttactcactg ctgcaatcag aggcaccatc catggaaatc accagtaact 4680 gttgattccg tccctgttcc gtataggaag agcattcgag tattaggggt acacattgat 4740 aggcaggtga actttgcttt gcattttgaa aatgtaaaga aagcgtcaga aggatggatg 4800 agaatcattc gtgcaataag cggaaggcat caaaactgta acagaggcac tattttacga 4860 gtgatgaacg caaccattgt tagtcgtttg gtttatggaa tagaaataac atgtaggtca 4920 gcaatcagtt taattgataa gttaggacca gtttacaatc gttgtattag attttgctca 4980 ggcttactcc caagtgcacc aactgttgca acatatgcgg aagccggggt acttcctttc 5040 gacctactta tcactaaaat tgtctcgaac agagctaata actttctaga aaagacatat 5100 ggagagagag aggaagtctt cctcctaatg gaggccatga acctcctcca aaaatatgcc 5160 aacatggatt tccctccggt agctgagatc catcgggtgg gagataggcg tcgggaccat 5220 gctggaccca aaatagactg gacgatgaaa aaccatgtaa gggctcatga cccgaaaaac 5280 aaagtgttgt cctgctttaa tgccttaatt agtgataaat acaaaaacca cattagcatt 5340 tacactgatg gatctcgagc aaatggacaa gtcggtattg gggtctacgg tacaacagca 5400 cgtgcggcta gattaccaga ttcttgctcc gttttttcag cagaagctgc agctatactg 5460 tgcgccatat caatgatttc caacaggcct acggtcattt tcactgactc ggctagcgtt 5520 ttagcggcat tggaaagtgg tagatcgaca catccatggg tgcaacgtat tgagagttca 5580 gtccctgaca ctaccactct gtgttgggtc ccaggccact gtggtatacg cggaaacgaa 5640 gaagctgatc gtttggcagc agatggaaga acggcgttgt tttttatgac agaggtgcca 5700 gctctggacg tgaagaatac gatagcagct ggagtgcaag cagcatggac aattcgatgg 5760 agaaacacgc gtaatgtttt cttacgaaaa atcaagggtg aaccagataa atggacagac 5820 cgatatagtt ggcgtgaaca gcgtgttatg tcccgcctaa gagtagggca cactcgttta 5880 acacatgcac attttctttc aaaggttgaa caaccaagat gtgtaccttg tggacaggct 5940 cttacagtgg agcacatcct tctaaactgc gtccaatatg aagaagagag aactttcaca 6000 aacttgccta caacaataag ggatatttta tgcaacgacc gagacgaaga agaaaaacta 6060 cttagattct taagagtatc tggattgtac gattgtatat gaaaatgtag cggaacaaaa 6120 ttatatttta atatttcttg aggtgaaccg gccgccaggt tgaaagcctc tttaataaag 6180 acaaaaaaaa aaaaaaaa 6198 // ID Copia-129_AA-I repbase; DNA; INV; 4097 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-129_AA_; KW Copia-129_AA-LTR; Ty1_copia_Ele197; Copia-129_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4097 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [1479-1988] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 78..4082 FT /product="Copia-129_AA-I_1p" FT /translation="MVLPRDDPNSQGGGRSTGGGTAGTGQGDGATTSGGSV FT RPAAPVRSFGNAVSLPAMEKLKGRQNYASWSFAMKMILIREGTWRAVKPAE FT DQNVDPEMCERAFATICLALEKQNYSLVKTAKTAREAWEKLQQAFQDNGLI FT RRFGLLDKLTSVKLEECETVEDYVDQLVTTSNDLSEIGFEVNDQWLASLLL FT KGLPEYYNPMIMGLQASGINLTADVVKTKIIQDVKWPLKSSGVGGALYTKP FT KSKHSKVDKKSGLCFTCNKPGHFASNCPQKSSKPKGKALCATLAVGTGSED FT CWYFDSGATAHMTRSKDGFVKQVDWVHPIDTASSHSIKSVAKGTVQLELDE FT GPIEVKEVLMVPELSANLLSISKICEKDMTVLFKAGGCEVRDENGDIVVTG FT IQQDGMYKLITKKSPKAFLMTDMELWHRRLGHLNKQFMGKLTTLVDGLEIR FT ITDMDDCVACAQGKHCRDSFHSSASRATDLLDLVHSDVCGPIEVPSIGGSR FT YFVTFVDDASHKVFLYFIESKSQVKEVYEQFKAMVERQSGRKLKVLRTDNG FT TEYVNSSLEKIMQRDGVVHQTTCPYSPEQNGTAERMNRTLVEKARSMLNDA FT GLSKKFWAEAVSTAAYLVNRSFTRAVDGKTPEEAWSQKKPNLKHLRVFGSR FT VMVHCPKQKRQKFDPKSVEGIFVGYGERSKGFRVFNPARNDVIVSRDVVFI FT NEGRPITSTATEDRSQPVEFMELHSWLDMEENQANDAGESDDVVLPGPSTE FT ESPVPAVALPPQSERSAVEQQGLRRSGRERQPTGKYTDFVCLSSTSGSDDL FT TEPIEAFPGQAQPGSTDDPISYAEVLQRSDRNRWLEAMREELEALDKNETW FT ELTELPKGKKAIRNKWIFKTKYGSDGNVDRFKARLVVKGCSQRYGIDYEEV FT YAPVVRYSTIRYLMALAVEHDLEIEQMDAVTAFLQGKLKDEEIYMDQPEGF FT VKDPSKVCKLKKALYGLKQSSRIWNLQLDEALREFGLQRSEIDPCLYFKKE FT GGRMMFVTIYVDDFLLFTNDQALKKKLKAFLNSRFKMKDLGEAKLCLGLRI FT TRDRKNGKLWLDQKQYLKDVLERFSMANCNSVSTPADPSAKLDKTMCPSDP FT EEVREMQTVPYKEAVGSLMYAAYATRPDIAFAVNTVSKFSNNPGRRHWEAV FT KRIIRYLKGTMDHRLQFSKQSNPDLTGYSDADWGGDAEERKSTTGYVFTKM FT GGAISWNSKRQDVVALSTCEAEYIALSRTAQEALWWQQLLKQLDDQQVVPI FT MCDNQSAICVAKNQGFNPKTKHIAIRYHFVRDTLNRGEVTLDYVPTKQQPA FT DGLTKPLTKQNHEQFKKMLGIVS" XX SQ Sequence 4097 BP; 1170 A; 874 C; 1169 G; 884 T; 0 other; actggttatg ggcccaggat tttggccata gcctgaagaa aagttaacaa gcgtgaagag 60 ttctgaagtg tccgaaaatg gttcttccga gggatgatcc aaattctcaa ggcggcggta 120 gaagtactgg cggaggaacg gcagggaccg gtcaaggtga tggagcaact accagtggag 180 gcagtgtgcg gccagcagct ccggttcgat cgtttgggaa cgcagtaagt cttccggcca 240 tggagaaatt aaaaggacgt cagaattacg cttcgtggtc cttcgcaatg aagatgatcc 300 tgatacgtga aggtacgtgg cgcgcagtga agcctgctga agaccagaac gtggatccag 360 agatgtgcga acgggcgttt gcgaccatat gtttagccct ggaaaagcag aactacagtt 420 tggtcaagac agcgaagaca gcccgagaag cctgggagaa gctgcagcaa gctttccaag 480 acaacggact gatccgaagg tttggacttc tcgacaagct aacatccgtc aagctagaag 540 agtgtgaaac tgttgaagat tacgtggacc agctggtgac aacatcgaac gatttgagcg 600 agattggatt tgaagtcaac gaccagtggc ttgcgtcctt gctgctcaaa ggacttcctg 660 agtactataa tccgatgatc atgggactgc aggcatccgg gatcaacctc acagctgacg 720 tagtcaagac gaagatcatt caagacgtga agtggccgtt gaaaagttcc ggtgtgggtg 780 gcgctctgta cacgaaaccg aaatcgaagc attcaaaggt ggacaagaag agcggattgt 840 gtttcacctg caacaagcct ggacatttcg cttcaaattg ccctcagaag tcatcaaaac 900 ccaaaggtaa ggcgttgtgt gcaacattgg ccgttggaac tggtagcgaa gactgctggt 960 attttgactc tggagcgacg gcacatatga cgagatcaaa agatggattt gtgaagcaag 1020 tggattgggt acatccaatt gacaccgcta gcagccatag catcaagtct gttgcaaaag 1080 gtacggtcca actggagttg gatgaaggcc cgattgaagt taaagaagtc ttgatggttc 1140 cggagttgtc ggcgaatctt ctgtctatca gcaaaatctg tgagaaggat atgacggtat 1200 tattcaaggc cggtggatgc gaggtacgcg acgagaacgg cgacatcgtt gtaaccggaa 1260 ttcagcaaga tggcatgtac aagctgatta cgaagaaatc cccgaaagcg ttcttgatga 1320 ccgatatgga actgtggcat cgtagacttg gacatctgaa taagcagttc atgggcaagc 1380 taacaacgtt ggtcgatgga ttggagatac gaataacgga catggacgat tgcgtcgcgt 1440 gtgcgcaagg taagcactgc cgagattctt ttcattctag tgcttcacgt gcgaccgatc 1500 tactggactt ggtgcactcc gacgtgtgcg gtccgattga ggtgccatcg atcggcggaa 1560 gtcgttattt cgttacgttc gttgacgacg cgagccacaa ggtgttcctg tatttcatcg 1620 agtccaagag ccaagtgaag gaggtttacg agcagttcaa agcaatggtc gagcgacagt 1680 ccggacgaaa actcaaagtt ttgcgcacgg acaacggaac cgagtatgtg aattcgtcat 1740 tggagaagat catgcaacgg gatggcgtgg ttcaccaaac aacgtgtcca tattctccag 1800 aacaaaatgg gacagccgag cgtatgaacc gtacgctggt cgagaaagcg agatcgatgc 1860 tgaacgacgc aggactttcg aagaagttct gggcggaagc cgtatcaacc gcggcgtatc 1920 tggtgaacag aagttttacc cgagccgtag atggaaagac accagaggaa gcttggagcc 1980 aaaagaagcc gaacttgaag catctacgtg tgtttggttc ccgagtaatg gtgcactgtc 2040 cgaagcagaa acgacaaaag tttgacccga aatccgttga aggcatcttt gtgggatacg 2100 gtgaacgttc aaaaggattc cgagtattca atccagcaag aaatgatgtg atcgttagtc 2160 gtgacgtggt gttcatcaac gaagggcgtc cgataacctc gacagcaact gaggaccgga 2220 gccaaccggt agaattcatg gagctacatt cttggttgga catggaggag aatcaagcca 2280 acgacgctgg tgaatccgat gatgtagtgc tacctggtcc cagcaccgaa gaatcgcccg 2340 tgcctgccgt cgcgctccca ccgcaatcag aaagatcagc tgtagagcag caaggattga 2400 ggcgcagcgg tcgggagcgc caacccacag gcaagtatac agattttgtt tgtcttagtt 2460 cgacttctgg ttccgatgat cttacagaac caattgaggc gtttcccggt caagcacaac 2520 ccggatcaac tgatgacccg atcagctacg cggaggttct acaacggtcg gatcgaaatc 2580 gatggctgga agctatgcga gaggaactag aggcgctgga caagaacgag acgtgggagc 2640 tgacagaatt accaaagggc aagaaggcga tacgaaacaa gtggatcttc aaaacaaagt 2700 acgggtccga cggaaatgtg gatcgattca aggctcgatt ggtggtgaag ggctgttccc 2760 aacgttatgg tatcgattac gaggaggtgt acgcgccagt ggttcgttac tcaacaataa 2820 gatacctcat ggccctagca gtggaacacg atttggagat agaacaaatg gacgccgtaa 2880 ccgctttcct tcaagggaag ttgaaagacg aggaaatcta catggatcaa cctgaaggct 2940 tcgtcaagga tccatcgaag gtgtgcaagt tgaaaaaggc actatacggt ttgaagcaat 3000 caagccgcat atggaatctt caactagacg aagcattgcg agaatttggt ttgcagcggt 3060 cggaaattga tccctgtttg tacttcaaga aagaaggcgg acgaatgatg ttcgtgacga 3120 tatatgtgga cgattttctc ctgttcacaa acgatcaagc tttgaagaag aaattgaagg 3180 catttctgaa tagtcgcttc aagatgaagg atttgggaga agcgaagctg tgcctaggac 3240 tccggataac acgcgaccgg aagaacggaa agctctggct ggatcagaag cagtacctga 3300 aagatgtttt ggaacgattt agcatggcaa actgtaattc tgtatcaacc ccggccgatc 3360 cttcagcaaa attggacaag acgatgtgcc catctgaccc agaagaagta cgagagatgc 3420 agacggttcc ttacaaagaa gcggtaggtt ccctgatgta cgcggcgtac gctacaaggc 3480 cggatatagc gtttgctgtg aatacggtga gcaagttcag caacaaccct ggacgtcgac 3540 actgggaggc ggtgaaacgc ataatacggt acctgaaggg gacaatggac catcgtttgc 3600 agttttcgaa gcagagcaat ccggatttga caggttactc ggatgcggat tggggaggag 3660 atgcggaaga gcgaaaatca actactggtt acgtttttac caagatggga ggagccatat 3720 cctggaactc caagcggcaa gacgttgtgg ctttgtctac gtgcgaggcc gaatatattg 3780 cgctatcgag gacggcgcag gaagcactct ggtggcagca gctactcaag cagctggacg 3840 atcaacaggt ggtgccgatc atgtgcgaca accagtcggc gatctgtgta gcgaaaaacc 3900 aaggcttcaa cccgaagacg aaacatattg cgattcgtta ccattttgtc cgagacacgc 3960 tcaatcgagg agaagttaca ttggattacg ttcccacgaa gcagcaaccg gcagatggac 4020 taacaaaacc tttaactaag cagaatcacg aacagttcaa gaagatgctt ggtatcgtat 4080 cgtaggttaa ggaggag 4097 // ID piggyBac-N1_AAe repbase; DNA; INV; 948 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A non-autonomous piggyBac DNA transposon family from Aedes DE aegypti. XX KW piggyBac; DNA transposon; Transposable Element; nonautonomous; KW piggyBac-N1_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-948 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the yellow fever mosquito."; RL Repbase Reports 11(4), 1315-1315 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 7 sequences with >96% CC identity. CC TTAA TSDs. Both termini are ~87% identical to those of CC piggyBac-1_AAe. XX SQ Sequence 948 BP; 323 A; 169 C; 146 G; 310 T; 0 other; ccctttcctt cccatggtag cacaggtgat ccaccacttt gcgtatgtcg tatataaact 60 gtacaagtta gatcatatcc attttttcac cacatgttca catatgtttc atgaaataat 120 gtacaaagtt tcatcaaaaa taatcacttg gattttcagt taaagcacaa taaccgaagt 180 aaagtcagca ttttttctaa tttaaataaa gtggattttc aaacttctat atttagaaat 240 ataaacatga ttttcaccaa acaaaaataa aaacaagctc aattgaatta cttttccaag 300 attcagttga gtttttagaa tattctttac acaaaactat tgattttgcg gaaagttagg 360 tggatcactg gtgatccatt gggaataaaa cgacattttc gtcaattctg ttctaacctc 420 tcttttcgaa aacaaaacta tcatcatgtc ttggaaagtg attttaagtt ggaaagaaaa 480 ttgtaagacg ctgagggcga tcaacaaacc aacatactca gcagtagcta taagcacggt 540 atgacggagt agaacattgg cccgttgaat accgttctgt gcaaatatca ataatcattc 600 agttcattta aatgcactaa gtgtgatgtt tgcttgtgtc tcaaaaaaga caaaaacttc 660 ttcccagctt tttattttcc tccagaattg acgagaggtg tgcagaacac actgttcaaa 720 taaattgtat gagtaaaaac aatgaaatgg aaaaaaatat tgatgcctct ttcttaccct 780 agttgcagtc atcttcccaa tggatcacag gtgatccatc acaaaaaata tttttttcag 840 ttctattcat attttcgagt aaaactttac atcttttgtt ctggtgattt atccagatat 900 cacattttct atttttaaac tacgaaaaac ctcgtgggaa agaaaggg 948 // ID RTEX-7_BF repbase; DNA; INV; 2630 BP. XX AC . XX DT 30-JUN-2009 (Rel. 14.07, Created) DT 30-JUN-2009 (Rel. 14.07, Last updated, Version 1) XX DE Amphioxus RTEX-7_BF autonomous non-LTR retrotransposon - DE consensus. XX KW RTEX; Non-LTR Retrotransposon; Transposable Element; Jockey-1_BF; KW RTEX-7_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-2630 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-2630 RA Kapitonov V. and Jurka J.; RT "Young families of RTEX non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(7), 1723-1723 (2009). XX DR [2] (Consensus) XX CC The 5' terminal portion is missing. The 3' terminus is composed CC of the (CATT)n microsatellite. XX FH Key Location/Qualifiers FT CDS 2..2518 FT /product="RTEX-7_BF_2p" FT /note="RT." FT /translation="PKEEIEAFVSEFSKILKDAGTRSLSIKIFNKKRKTRR FT QNKQWFDKSCTELKREVKNLASLLEKQPSNPDIRGRFFKRKKEYRKLTKKK FT KRDFHKYIMTQLSSLKDKDPGAFWKLVNKLKPGKAQENNISDEEWLKHFSS FT VDKLFNNTNDCPVSENDSLNSDNDLSPPHATPSVLDSPITLEELTTAISNL FT KNNKSSGNDMILNEMLKSGKTILKTPLLQLFNTCLQNGYFPDEWSHSHIVP FT IHKSGDPSIPDNYRGISIMSCLGKLFSSILNTRLVDYAENNRLFKPHQAGF FT RKNFRTTDNLFVVSTLVRKYISQNSHLFACFIDFSKAFDSVWRNGLIYKLN FT KLGIGGKFLQTIRDMYSKTTNCIKHSQGLTESYVTNCGVRQGCNLSPTLFN FT LFISDITAVFDNECEPPIMHTERIPCLLYADDLVIFSESKRGLQSSLNKLE FT NYCNTWRLNVNLRKTKVIVFTKGGRLPRDCSFTYQKKPVELVTSYCYLGII FT VSSAGTFKANHKHLYKKGLKALFGINQSLDKADAPISVRNKLFDACVKPII FT LYGFEIWGAFKTSNSCPIESVHLKFCKQSLRVPRSSCSLAARAELGRYPMQ FT VEASLNSIKHFLRLRLNVPADSFQADAFSCQLELDKSGNKCWASGVRKSLE FT ECGYAYLWHFPIPHRTNTFQIVNSIGQRLKDIYFQTFLKEIHNDNKGGAAK FT NKLRSYRLFKTTYNEEKYLSNENMRDRNAATRLRISCHKLHIETGRHTRTP FT LEQRLCKHCNLDRIEDERHFVVECRLYTVERNELYKVIQNQFPYFIHLSSV FT EKFIFLMQLDKPSIIKHICSFISTITEKRGEVEFIQQ" XX SQ Sequence 2630 BP; 891 A; 522 C; 479 G; 738 T; 0 other; ccctaaagag gaaatcgaag cgtttgtgtc agaattcagc aaaatattga aagatgccgg 60 taccagatca ttatcaatca agatattcaa caagaagcgt aagactagga gacaaaacaa 120 acaatggttt gacaagagct gcactgaatt gaaacgtgag gtgaaaaact tagcctctct 180 tcttgaaaaa cagccatcaa acccagatat tagaggtagg ttctttaaaa ggaaaaagga 240 atatagaaaa cttaccaaaa agaagaaaag agattttcac aaatacataa tgactcaatt 300 atcatctctg aaagacaaag atcctggtgc cttctggaag ctagttaaca aattgaaacc 360 tgggaaagca caagaaaaca acatttcaga tgaggagtgg cttaaacact ttagcagcgt 420 agacaaactt ttcaataata caaatgactg ccctgtctct gaaaacgatt cacttaactc 480 agataacgat ttgtccccgc cccatgcaac cccctctgtc ctcgattccc caataacctt 540 ggaggaacta acaactgcaa tttctaatct taagaataac aaatctagcg gaaatgacat 600 gattctaaac gaaatgctaa aatctgggaa aactatattg aaaacaccac ttcttcagct 660 attcaacaca tgtttacaaa acggatattt tccggatgaa tggtctcata gtcacatcgt 720 cccaatccac aagtccgggg acccgtccat tccagataac tatcgtggaa tttccataat 780 gagctgttta ggtaaattat tttcttccat cttgaatacc cgtttagtag attatgcaga 840 aaacaataga ttatttaaac cacatcaggc cggctttagg aaaaatttta gaacaacaga 900 caatcttttt gttgtaagca ctctagttag gaagtacatt agtcagaact ctcatctttt 960 cgcatgtttt atagatttta gtaaagcgtt tgattccgtc tggagaaacg gtctcatata 1020 caaactgaac aagctcggta tagggggtaa atttcttcaa actatcagag acatgtactc 1080 taaaactact aactgtataa aacacagcca aggactcact gaatcttatg ttacgaactg 1140 tggtgtccga caaggctgca atttaagtcc gactttattc aatttgttta ttagtgatat 1200 cacagccgtc ttcgataatg aatgtgaacc gccaattatg catactgaac gcattccttg 1260 tttattgtac gctgacgact tagtcatttt ttctgagtcg aagcgagggt tacaatcttc 1320 attgaataag ctagaaaact actgtaacac atggagactg aatgtaaacc ttaggaaaac 1380 aaaagtcatt gttttcacta agggtggccg tttgccaaga gattgttctt ttacatacca 1440 gaaaaaacct gtagagttag ttacgtccta ttgttactta ggtatcattg ttagctctgc 1500 cggcactttc aaggcgaacc acaagcatct ctacaagaaa ggtctaaagg ctttgttcgg 1560 gattaaccaa tcactagaca aagccgatgc acccatttcc gttagaaaca aactctttga 1620 cgcatgcgtc aaacccatta ttttgtatgg tttcgaaata tggggtgcgt ttaaaacttc 1680 caattcctgt ccaattgaat ctgttcactt aaaattttgt aaacaatcat tacgcgtgcc 1740 gaggtcatct tgcagcttag ctgcaagggc ggagttaggg agatacccca tgcaagtgga 1800 ggcctcccta aactctatca aacattttct tagacttcgc ctgaacgtgc cagccgacag 1860 ttttcaagca gacgctttct cctgccagtt agagttggac aaatctggta ataaatgttg 1920 ggcgtcagga gtgcgtaagt ccttagagga gtgcgggtat gcatacttgt ggcattttcc 1980 tatcccacat agaacaaaca catttcagat tgttaactct attggccagc ggttaaaaga 2040 catatacttt caaacgtttt taaaagaaat acataatgac aacaaaggcg gagctgccaa 2100 aaataagtta agatcatata gattattcaa aactacatac aatgaagaaa aatatctaag 2160 taatgaaaat atgagggaca ggaatgctgc cacacgacta aggattagct gccataaact 2220 tcatattgaa acaggaagac atactcgcac ccctctggaa caaagactat gcaaacattg 2280 taatttagat agaatagagg acgaacgcca ttttgtagta gagtgtagat tgtataccgt 2340 agagagaaat gaactgtata aggttataca aaaccaattt ccttatttca tacacctaag 2400 ttctgtagaa aaattcatat tcctaatgca actagacaaa ccatcgatca taaaacatat 2460 ctgctctttt atctccacta taacagaaaa gcgaggagaa gttgaattta tccagcaata 2520 gtctcatttt gtatctgtga actgtatctt ttcaacatgt cttgttgtga cctgtactta 2580 gcccggtgtg gcaaaaatgt gcaataaagg tcttcattca ttcattcatt 2630 // ID BEL-3_SI-I repbase; DNA; INV; 5430 BP. XX AC AEAQ01011211; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-3_SI_; KW BEL-3_SI-LTR; BEL-3_SI-I. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-5430 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01011211; Positions 13114 7685. XX CC Positions [4190-4480] - Integrase core CC 'GATGG' target site duplication CC LTRs are 96% similar to each other. XX FH Key Location/Qualifiers FT CDS 284..2932 FT /product="BEL-3_SI-I_1p" FT /translation="MKVTSRRPAIPEDGSSTRVTNTTKTSKFPKLELPKFS FT GNIKEWLPFWNQFKKIMSISSEDKFQYLLQATGPDSRANEIVKSFPPTGEN FT YVKAITSLRNRFGRDDIVEEFYVRELLGLVLQNAVKGNKKSSLGNLYDKIE FT CYIRASETLGVTTDKCAAMLYPLVESSLPEEVLRAWQRSGQRGLAEDNGQR FT ESTDRLAKLLKFLQLEVENEERIDMALTGFGLSTEQDKAKKLRNKPESSKE FT TASASILLVAKEQKKPVCIFCKSNHESQHCENARKLTLDERKDIVKKENCC FT YNCLIRGHAAQKCRSRLKCDWCTRRHVLLMCPGIFRKESVSAIKSDDNKNT FT ISDDNKKTVEDSSLATFCERYDVYLQTLRVKIYSMSREKIVTAVIDTAAQR FT SYIRTDIVKDLGYNSLGELKVTHTLFGGIKSEVEKHEMFRIHLKSLDDSYA FT CNFSAMNTNVICSTISSIKKDSWVNELRNNNINLTDIGEETGAIGILIGAD FT VAGKLMTGRKYNLENGLTAFETLLGWTVMGLPMISRRSDTAVMLTTLFVQE FT ASPSDLWRLDVIGITDTIEKMDKLEQEERIREFLQETAKLNDEDRYEVKLP FT WVDDHAPVSSNYDIARNRLKKCTQKLKAQNLFEAYDSVFQEWLSEGIIERI FT TDSEINGSGHYLPHRPVVKTHGTTKIRPVFDASASRKGYPSLNQCLETGPN FT LSELVPSVLNRFREGKIGVVSDIKKAFLQISVHKEDRDYLRFFWIVNNEMI FT IFRHCRVVFGLACSPFLLATIIEFHLSAYIKAVKGTETIIQKLKNAFYVDN FT CVASVNSEEELENFVREATSILAAGGLRGWESSGETIENESSLVLGILWNK FT KEDTISINPAVLNENSPDVITKRTILSATR" XX SQ Sequence 5430 BP; 1809 A; 978 C; 1251 G; 1392 T; 0 other; actagcgcag tcggtaggac gcgatggatg ctatcaagaa acagagaaaa atccagagga 60 tggccttcac gaaggcactc acggctttta cgacgaaaat ggacagtgat tgctcaagtg 120 aagacaagat gatggccttc caatttctcg agacaaagat gtcggaatta gatactgtac 180 attcggcata taatcaggcg ctctttcaat cagacctgga tgaagcggtt atcatcaagg 240 agttggaatc gacgcatata aaacgcagta cctaaccgca aaaatgaagg taacgtcgag 300 aaggcccgcg ataccggaag acggatcaag cacgagggta acaaatacta ccaagacctc 360 aaagtttcca aagctggaac tgccgaaatt cagtggcaat atcaaagaat ggctaccgtt 420 ctggaaccaa ttcaagaaaa ttatgtcaat aagcagtgaa gacaagttcc agtatcttct 480 acaggccacc ggtccagatt cccgagccaa tgagatcgtg aaaagttttc cgcccacggg 540 cgagaattac gtaaaggcga tcacgagctt gagaaatcgt tttggaagag acgacattgt 600 ggaggaattc tacgttcggg aacttttggg tctcgttctg caaaacgccg tgaaaggaaa 660 caaaaagtcg tcgctgggca acctatacga taaaatcgag tgttatatac gagcatcgga 720 aacgctcggg gtgacgacgg acaagtgtgc ggccatgctc tatccattgg ttgagtcgtc 780 ccttccagaa gaggtcctac gagcatggca acgtagcggt caacgaggat tggctgagga 840 caacggtcaa cgtgagtcta ccgatcgact ggccaaactg ttgaaattcc tgcaattgga 900 agtagaaaac gaggaacgta tcgacatggc gctaacagga ttcggattat cgacggagca 960 agacaaagcg aaaaagctga gaaacaaacc agagtcgtca aaggaaacgg ccagcgctag 1020 cattcttctc gtcgcgaaag agcagaaaaa accggtctgt attttttgta aatcgaatca 1080 cgaaagccaa cactgtgaaa acgcacgtaa gttaacatta gatgaacgta aagatatcgt 1140 caaaaaagaa aattgttgtt acaattgttt gatacgagga cacgcggcac aaaaatgtag 1200 aagcaggtta aaatgcgatt ggtgcacgag acgccatgtt ctcttaatgt gtccgggtat 1260 atttcgcaaa gagagcgttt ccgcgatcaa atcggacgat aataagaata cgatctcgga 1320 tgacaataag aaaacagtcg aagattctag tttagctacg ttttgtgaaa ggtatgatgt 1380 gtacttgcaa acacttcgcg taaaaatata ttcgatgtca cgggagaaaa ttgtaacagc 1440 ggtcattgat acagccgcac aacggtcata tatccgcacc gatatcgtaa aagatttagg 1500 ttataattcg cttggagaat taaaagttac acatacgttg ttcggtggaa ttaaatccga 1560 ggtcgagaaa cacgagatgt tccgaataca tttgaaaagt ctagatgatt catacgcatg 1620 caatttttct gccatgaata caaacgttat ttgcagtact atttcgagca tcaaaaaaga 1680 ctcttgggtt aacgaactac gcaacaataa tattaattta actgatatcg gagaagaaac 1740 cggtgcgata ggcattctta tcggtgcgga cgtggcaggc aaattaatga ccggtcgaaa 1800 atataatttg gaaaatggac taacagcatt tgaaacgctc ctaggttgga cagtaatggg 1860 attgccgatg atatcccgga gatccgacac cgcggtaatg cttacgacgc tgtttgtgca 1920 agaggccagt ccgtctgatc tgtggcgttt agatgttatt ggtatcacgg atacaattga 1980 aaagatggat aaattggaac aagaggaacg aattcgagag tttctgcagg aaacggcgaa 2040 gttgaacgac gaagatcgat acgaagtcaa attaccgtgg gtggacgatc acgctcctgt 2100 ttccagtaat tatgatatcg cgcgcaatag attaaagaaa tgtacgcaaa agcttaaagc 2160 ccaaaatctg ttcgaagcgt acgatagtgt ttttcaggaa tggttgtcgg aaggcattat 2220 tgaaagaata acggacagcg aaattaacgg ttccggacat tatttaccgc atcgtccagt 2280 agtaaagaca cacggcacaa caaaaatcag accggtgttt gatgcctcgg cgagcagaaa 2340 gggttatccg tcgttaaatc aatgtcttga aacgggaccc aatttaagtg aattagtacc 2400 gtcagttctg aacagatttc gagaagggaa aattggagtg gtgtcagata ttaaaaaggc 2460 gtttttgcaa atctccgtac acaaagaaga tagagattat ctacgtttct tctggattgt 2520 aaataatgaa atgataattt tccgtcactg tcgtgtagtt ttcggtttgg catgtagtcc 2580 ctttttatta gcaaccatta tagagtttca tttatcggca tacattaaag ctgtaaaggg 2640 aaccgaaact ataatacaaa aattgaaaaa cgcgttttat gtagacaact gtgtagcaag 2700 cgtaaattca gaagaagagt tagaaaactt tgtacgagaa gctacgtcga ttttagcggc 2760 aggcggatta cgtggatggg aaagctcggg tgaaacaata gaaaatgaat cgagtctcgt 2820 tttaggaatt ttatggaata aaaaggaaga tacaatatca atcaatccag cagtattaaa 2880 cgagaattca cctgatgtca taacaaaaag aacaattctg tctgcaacac gttaaagttt 2940 tcgatcctat aggttttacg tgctctgtgt ctttactacc caagctgtta tttaaagaat 3000 tttgggcgga aaaaatagac tgggataccg aaatcgaaga caatcggggc aatcaattta 3060 gaaactggct gaatgactta gataaattaa accagattaa aattccgcga aaactgggca 3120 aaagaaatct aacgttacac acattttgtg acgcaagtgg actagcatac gtggcggcgg 3180 tctttgctag agttaaggac gggaataaca caagtgttag acttctaagt gctaaatctc 3240 gtatagccct gtgaaaaact actattccgc gtttagaatt gatggcggcc accattgcag 3300 tccggctaac ggtgtctaaa tctctaacgc gaaaaatttt aaagacaacg ttttggacag 3360 attcgacgac ggtcttagcg tggattaaac gggatacgca gtggggtaca tttgtctgga 3420 atagaataaa agaaattcga acttttgcaa accaaatgat tggagacatg tacccggaga 3480 tttaaatccc acggatcttc cctcgcgtgg ctgcagtccg gcacaattag tgcattccga 3540 atgatggttt ggccgaacgt ggctctataa agtggaatcc gagtggcgca taactagtcc 3600 agacatttat agtgaagagg aaataataaa agaagcgaaa aggtcaattc taatacacac 3660 attataaact gagattttta agtttaaagc caattgtact tttcttccta cagcaaactt 3720 gttcgattct ttgcgtggat gcttagattt tttgctaact gtaggaaaaa agtcagcaat 3780 aaagaagttg caaagaagat taacattatc ttttgcagag agaaaagcag ccgaaacgaa 3840 actttttaag catttacaaa gtaatatgtt taactcagta tctaaaacta agttatcggt 3900 atttcgaaca atgaaaactg aagacgcgct atacgttttg aaaactaaga tttttaatag 3960 acgcgataat cgtaatttct tgtgccctat tcttttatcc aacgacaaag atatagttcc 4020 aatgctgatt cgggaatatc acgagtctat gggccacgcg gggacacaaa taattatgac 4080 caagctacgg gagagattct ggattatttc tgttcgtaaa attgtaagag cagttatctc 4140 tgactgcata atttgtaaaa aacaaagagt taaacgcatg gaaaccaaaa ctcctccgtt 4200 acctcgcaat cgcattcaag atgccgcgat tttcgaaata gttggagtgg attttgcagg 4260 gccactaatt tcgcgggaag ggggggggaa gagttggatc tgtattttta cgtgtgcagt 4320 gtacagagca atccatcttg aattagcatc gacgctatca acacagggtt tcttagaatg 4380 tttgcgcaga tttatagcaa ggcgcggacg tccaaaagtt atatatagtg acaatggaac 4440 taattttacg ggtgccgcta acgctctgag caagttagat tgagaaaaaa tcggcaaaca 4500 taaatcgcta actcaaattg aatggtattt caatccacca gcggcaccgt ggtggggtgg 4560 ttggtgggaa agattgatag gaattttgaa aacgttgctg cgtaagattt tgagcaaagc 4620 gagtcttcct tatgaaagct tatataccgt tctatgtgat gtcgaagcaa taataaatgg 4680 tcgcccactc acatatatct cagaagatcc ggacgatctt agaccattgt cgccttcaat 4740 gtttttgcaa gagattcgag agtacggtgt tccagattgc gacatgctct atcgtaaaaa 4800 actaaataac aagttcaaat acagacagaa aattttcgag gaccttcgga aaaaattccg 4860 cactgaatat ttaggccaac tattgttaaa aaatggaaag aaagaaacgc gaaaaatcaa 4920 agtgggtgat gtcattttgg cgtcggcgat gatacacata gacgcataaa ctggcctctt 4980 acacgaataa tagaaataat tccgggttcc gatggtcaag gcagagtact cactctaaaa 5040 attaggaatg gtgtgcttaa gagaccgata cagaggatct atccgttaga gatttcgcaa 5100 gaaaaaaaaa gatgttgtaa cagatttacg ggaaaaagca aaatccggaa aagttatgcg 5160 taaagatctc aatcgggagc cgaataattt taaaaccaat gacacgttgt taagttataa 5220 accaaaggag caagatccga agaccgttac cactagaagt ggtcgcgtag ttaagaaacc 5280 ggaacgtttt ggatgttact ttaagttagt tgtaagtttt atatgtccaa ttgtcaatgc 5340 gttcgataca aatttaagta gcgtaatagt ttattttgta acagtttatt ctgtaagaac 5400 atttcgtttt cccacaaaag gaggggagga 5430 // ID BEL-4_NVi-I repbase; DNA; INV; 5704 BP. XX AC . XX DT 30-JUN-2009 (Rel. 14.07, Created) DT 30-JUN-2009 (Rel. 14.07, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia vitripennis: internal portion - DE consensus. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-4_NVi-I. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-5704 RA Bao W. and Jurka J.; RT "BEL type LTR-retrotransposons from Nasonia vitripennis."; RL Repbase Reports 9(7), 1340-1340 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 453..1976 FT /product="BEL-4_NVi-I_1p" FT /translation="WRTATSCISVGTTPRNLISSGSTEKCTAFPICDESSP FT RTRYTSASDDSSGSTGAQKRDIILGSRTQRGVARSRVASYNSSQRQATRHF FT SPFPNDDYGDGLQAEQLDDDDQAEQLDGQENSEESALNLSLNENAGIESEN FT ESENDRYGEICDDFGDRFHSTECNSRLVRNHVAIQEQSSSNPANEIAEAIK FT LTLSTVRESSFYSSNENSKLAHRMSSKSLPTFTGDPIEWSRFKQVYEVSSE FT LGAYNDKENAARLYDALRDDARKAVKMLYVSGSSAEEIMKVLEMRFGNKKV FT ILLKVIREIKNLPKMGSSKSDLINFASDLRGALGAIKVLDSGYLCSTELED FT EIIKKLPDSVISNYIRFAAHDDNQKSKLEKITEFLDKEAEMMIRAGVVTDQ FT SHEFNRKKSDHSSKDSRDSHNTYRERSVCTTNQNISELPTRQNNQRCAHCG FT RKNHSTEFCHDFQKAPNYQRWKVARSNRLCFNCLREGHSRLKCTQPSCKRC FT GRRITQMKRNS*" FT CDS join(2856..3587,3591..4055,3947..4822,4689..5684) FT /product="BEL-4_NVi-I_2p" FT /translation="FHYAIKDDLRDMFLKINIQEADRDAQRFLWRGCDRTI FT EPDKYVMSSVLFGAKSSPFTALFIKNKNASLCSSTYPAAAKSIIENSYIDD FT FLESCETREEASSRVQQVVEINKHANWEMHGWASNDASVLSNSNINSNDKQ FT PINFETKDNIEEILGLRWVNSTDELIFKINIEKISQDLRTGAKKPTKRVLL FT KIIMSIFDPLGFLTPFTIEAKLILQGVWNSKIGWDECIQDTEFDQWKRWLQ FT KLDIKLLHIPRSYRSKNCQVKSAELHIFCDASSKAYAAVAYWRICLKDDSY FT HVSLIMAKSRVAPMKETTVTIPRLELQAAVLAIRVANIIAKEHDFQISRRV FT FWSDSKTVLHWINKEPREFKIFVANQLAKILRLPNGGGFRRPKTPLMMGLE FT SRQTLDFCSESISENSSTSEWRWIPSSENPADDGTRIAPDALAKDSRWFLG FT PEFLKKSEESWPVEQVSNFDKGDSERREKPVFCAFTITKRVYKLFDFTKYL FT SQFSSWLRLIYPIIRFYEAINKMRKLDINLVEIRDKAEKMCLEWSQSVAFS FT VEINALKNSLPISKKSKIAGLNPFLSKDGILCSNSRLINLEECEISTQPII FT LEGSDKITRLLVKHYHESCFHGSHETVINELRQKYWIVGLRHLLRSIVSNC FT AICKWFRSNPSTPKGSSPSSSTRLSTASLFSLRYRLFRAFECAIAQFVNGF FT AVIHLLQKAALPLARLGYRLRPFSHCGIDYFGPLNVKVGYRRREKRWGVLF FT TCMSTRAVHIELAQTLSTSSAILALKRFTGRRGTPFVIYSDNGTNFIGINN FT EIKKALKDLDRKKLNEFAAKKQIVWKFNPPTASHMGGAWERLIRSVKNALN FT VVFKDQAYCEELLQTILIEIEHCINSRPLTHVSIDPRDKEALTPNHFLLGA FT SSGEIRLGRCDKQIECSRKQWETAQYMADKFWDRWLREYLPTLIPRKKWQE FT NESPLIVGDLVLVLDNNLPRNQWKKGVVTRIFPGSDGQVRVAEVRTSSGIF FT LRPSRKLIRFTVVQNS*" XX SQ Sequence 5704 BP; 1820 A; 1105 C; 1261 G; 1518 T; 0 other; ttttggcgcc cgaacaggga cctgaggtta cagccttttt tacgcgcaga aggaaaaatg 60 ggcaagagga cgacaaccag caagaaaact gcaccaccca ataagaaaag ggctccagag 120 attttatctc gatatgaagc tgaggacgat ccggatgcgt tagaggttct tccaccttct 180 agcgcggact ctggcgaaga gcagggacat gtagacagtg gcggcctcga cggcgcagaa 240 gcaccgcctt cgaattcatc atcggggaaa aaccggtcaa cttcagcagc ggcccagaaa 300 cgcgttttac ctcctgtgac ggcgttgcaa aagcgccaac cagcttcaac agcagtggca 360 gcacagaagc gtgctccgct tcctaaacat acgacagcag cagctcaagg acgtacgtca 420 attcaatcag cgacggtgac gcgggcgctt agtggcgcac cgcaacctcc tgcatcagtg 480 tcgggacgac gccacgtaac ctcatcagca gcggaagtac ggaaaagtgc accgctttcc 540 caatctgcga cgagagtagc ccaaggacgc gctacacttc ggccagcgac gacagcagcg 600 gcagcacggg cgcgcaaaag cgcgatatca tcctcggcag ccgtacccag cgcggagtcg 660 cgagatcgcg cgtcgcatcc tacaacagca gccaacgaca agccacccgt catttttctc 720 ctttcccgaa cgacgactac ggcgatggtc tccaagcaga gcagctcgac gatgacgacc 780 aagctgagca gctcgacggt caagagaatt cagaagaatc tgcgttaaat ttaagtctga 840 atgagaatgc tggtattgaa agtgagaatg aaagcgagaa tgatcggtac ggcgagattt 900 gcgatgattt cggtgatcgt tttcacagca ctgagtgcaa ttcaaggtta gtccgaaatc 960 atgtagcgat tcaagaacaa tcaagcagca accccgcaaa tgaaatagcc gaagcgatca 1020 aattaacgtt aagcactgtt cgcgagtcaa gtttttatag ttctaatgaa aattcaaaac 1080 tagcacacag aatgtcatca aagtctttac caacatttac gggagatcca atagagtggt 1140 cgcggttcaa acaagtttac gaagtttcct ccgaactcgg agcatataat gacaaagaaa 1200 acgctgctcg attgtatgat gcgttgcgtg atgatgctag aaaggcagta aaaatgcttt 1260 acgtatctgg tagcagcgcg gaagaaataa tgaaagtgct tgagatgcgt tttggtaaca 1320 aaaaagtaat actgcttaaa gtaattagag aaatcaaaaa tttaccaaaa atgggctcaa 1380 gcaagagcga tttaattaat ttcgcttcag atctgcgagg cgctttaggg gcgattaaag 1440 tgttagatag cggttatttg tgcagtactg agcttgaaga cgaaattata aaaaagctac 1500 ctgactcggt gatcagtaat tacatccgct ttgctgctca tgatgacaat caaaaatcaa 1560 agttagaaaa aattactgag tttttagata aagaagctga aatgatgatt agagcaggtg 1620 tagtaactga tcaatcgcac gagtttaatc gcaaaaaatc tgatcactcg tcaaaggatt 1680 ctcgagattc acataatact tatcgagaaa gatcggtgtg taccactaat caaaatatat 1740 ctgagcttcc tacgagacaa aataatcagc gatgtgctca ttgcggccgt aaaaatcaca 1800 gtactgagtt ttgtcatgat tttcaaaaag ctccgaacta tcagcggtgg aaagtagcta 1860 gatcaaaccg tctctgtttt aattgtttaa gagaaggaca ttctcgatta aaatgcactc 1920 agccatcgtg caaacgttgt ggccgacgta ttactcaaat gaaacggaac agttaaaacc 1980 gactgatcaa gaatctcctt ctcaaagctc tgataaacct gctgatattt cgactttcac 2040 gcaatcaaca aatagtacca gctgactagc atcgcagacg tttttaaaac taattccggt 2100 tagaataaat ggaacgtcaa gctttaaaat tgtgtatgct cttttagatg aaggttcgac 2160 tgcgacgcta atcaactcac gaatagtaaa agaaattgga gtgcgaagta cgaaaataga 2220 tatagctata aaaggtgttg gaagcgaaat tgagagcatg ttttcaagcg aaaatgtaga 2280 cattgaggtt tctagtccgt tctcaaattt ttcattaaca aatgttttag tcgttaataa 2340 tctagcgctt cccaaacaat gtgtgccttc tggattaatc gaactttgcg caaagcttat 2400 gggaattcgt gtaagcgctt accatcaagc tcccgatttg ttgatcggcc aagatcacag 2460 tagtttaata attactcgtg aatttcgtgt tattcaaaaa gctctttacg tatcaagatg 2520 ccttttgggt tggtgcatac acggtaattt tttaaataac gaagtctctg ttcagattga 2580 aacttgaccg tgacgaacaa tatgcgaaat tgtattactg tgaaatggaa agacttatca 2640 aaaatggttt tgctaaaaaa tgtgcgaata aaccgtcagg gagtcgaatt tggtacctgc 2700 ctcacttcgg cgtatcaaac attaataaac cggggaaagt aaggttggtt ttcgatgccg 2760 cggtcgcgac ttcatctgtg agttttaata aacttttact ttccggtcct gatcttttaa 2820 aatcactttt aggcgtgttg atgcgatttc gttagtttca ctacgcgatt aaagacgatt 2880 tgcgcgacat gttcttaaaa attaatatac aagaagccga ccgcgatgct caaagatttt 2940 tatggcgagg atgcgatagg actattgaac ctgacaaata cgtaatgtct tccgtattgt 3000 ttggagcaaa atcgtctccg ttcactgcgc tattcataaa aaataaaaac gcttctctgt 3060 gttcatcaac atatcctgca gctgcaaaga gcattattga gaatagttat atcgacgatt 3120 ttttagaaag ctgcgaaaca agagaggaag cgtcttcgcg ggttcagcaa gtagtcgaaa 3180 ttaataaaca tgcgaattgg gagatgcacg gttgggcaag caatgatgcg tctgttctga 3240 gcaattcgaa cataaacagc aatgataaac agcccataaa ttttgagaca aaagacaata 3300 tcgaagaaat tttaggatta agatgggtaa actcaacgga cgaattaata tttaaaataa 3360 atatcgaaaa aatcagtcaa gacctacgta caggagccaa aaaaccaacg aaacgagttt 3420 tgttaaaaat tatcatgtcg atctttgatc ctcttggttt tttgacacct tttaccattg 3480 aagccaaatt aatattgcag ggtgtttgga attcaaaaat aggttgggac gagtgcatac 3540 aagacacaga atttgatcaa tggaaacggt ggttacagaa gttagattaa ataaaattat 3600 tgcacatacc gcgctcttat cgatctaaga attgtcaagt aaaatcggcc gagttacata 3660 tattttgcga tgctagctct aaagcgtatg ctgctgttgc atattggcgt atttgtttaa 3720 aagacgattc gtatcatgtt tcgttaatca tggctaaaag tcgagtcgcg ccaatgaaag 3780 aaactactgt cactattccc cgtctagaat tacaagcggc agtgcttgca atcagagtcg 3840 ccaatattat tgcgaaagaa catgactttc aaatctcgcg tcgggttttt tggtctgatt 3900 caaagaccgt gttacattgg atcaataaag aacctcgcga atttaagatt tttgtagcga 3960 atcaattagc gaaaattctt cgacttccga atggcggtgg attccgtcgt ccgaaaaccc 4020 cgctgatgat gggactagaa tcgcgccaga cgctttagct aaagacagca ggtggtttct 4080 cggtcctgaa ttcttaaaaa aatcggaaga aagctggcct gtcgaacaag tgtctaattt 4140 cgacaaaggg gattcagagc gccgtgaaaa acccgttttt tgcgctttca cgataactaa 4200 acgagtatat aaactttttg attttacaaa atatttatcg caattctcct catggttacg 4260 acttatttac ccgattatta gattctacga agcaattaat aaaatgagaa agttagatat 4320 taatcttgtt gaaattcgtg ataaagcgga aaagatgtgt ttggagtgga gtcaatcagt 4380 ggccttttct gttgaaatta atgctttaaa aaattcattg ccaatttcga agaagagtaa 4440 gattgctggc ctgaaccctt ttttaagcaa agacggtata ttatgttcca acagtcgtct 4500 tataaattta gaagaatgcg aaatctcaac tcaaccaata attttagagg gaagcgataa 4560 aatcacgcgt ctgttagtaa aacactatca cgaaagctgt tttcacggta gccatgagac 4620 tgtaataaat gaattaaggc aaaaatattg gattgttgga ttacggcatt tattgcggag 4680 catagtaagc aattgcgcaa tttgtaaatg gtttcgcagt aatccatcta ctccaaaagg 4740 cagctctccc tctagctcga ctaggctatc gactgcgtcc cttttctcac tgcggtatcg 4800 attatttcgg gcctttgaat gttaaagttg gatatagaag acgcgaaaaa cgctggggcg 4860 tattgtttac gtgcatgtca actagagccg tacacatcga actcgcacag acgctcagta 4920 caagttcagc gattttagct ttaaaacgtt ttacaggcag aaggggaact ccattcgtta 4980 tatacagcga taacggtaca aattttattg gaataaataa tgaaataaag aaagccctta 5040 aggatctcga tcgtaaaaag ctcaatgaat ttgcggctaa aaaacaaata gtttggaaat 5100 tcaatccacc aacggcatcg catatgggag gcgcttggga acgattaatt cgttcggtca 5160 aaaatgcttt gaacgtagtt tttaaggatc aagcgtattg cgaagagctt ttacagacaa 5220 tattgattga aatagagcat tgtattaatt caagaccttt gacacatgtc tcaatagatc 5280 cacgggataa agaagcatta actcctaatc atttcttgct cggcgcgtcg tcaggtgaaa 5340 tcagattagg tagatgcgac aaacagatag aatgttcgcg aaaacaatgg gagactgcac 5400 agtatatggc ggataaattt tgggacagat ggctgcgaga gtaccttcca acactgattc 5460 cgagaaaaaa atggcaagag aatgaatcgc ccttgattgt tggagatttg gttttagttc 5520 tggataataa tttacctcgg aatcagtgga aaaaaggtgt tgttacgcga atattccctg 5580 gatctgacgg acaagtacgc gtcgctgaag tacgaacgtc cagcggtata tttttaagac 5640 cgagtagaaa gttaattcgt tttacagtag tgcaaaactc ctgagttttg caccaggggg 5700 agaa 5704 // ID Neptune2_Ren repbase; DNA; INV; 3400 BP. XX AC . XX DT 20-DEC-2006 (Rel. 11.12, Created) DT 20-DEC-2006 (Rel. 11.12, Last updated, Version 1) XX DE Neptune2_Ren is a Penelope-like retrotransposon. XX KW Penelope; Non-LTR Retrotransposon; Transposable Element; KW Penelope-like element; reverse transcriptase; KW GIY-YIG endonuclease; Neptune2_Ren. XX OS Reniera OC Eukaryota; Metazoa; Porifera; Demospongiae; Ceractinomorpha; OC Haplosclerida; Chalinidae. XX RN [1] RP 1-3400 RA Arkhipova I.R.; RT "Distribution and Phylogeny of Penelope-like elements in RT Eukaryotes."; RL Syst. Biol 55(6), 875-885 (2006). XX DR [1] (Consensus) XX CC Neptune2_Ren is a Penelope-like element (PLE) from the sequenced CC genome of the sponge Reniera sp. JGI-2005. It belongs to the CC Neptune group of PLEs. Its short ORF1 is defective, and ORF2 CC contains regions homologous to reverse transcriptases and to CC GIY-YIG endonucleases. The element appears to be low-copy and CC probably inactive. It is related to Neptune3_Ren (~60% CC nucleotide sequence identity), and also ends with TC dinucleotide CC repeats. Consensus sequence was assembled from trace archives. XX FH Key Location/Qualifiers FT CDS 388..873 FT /product="Neptune2_Ren_1p" FT /translation="NHKHSISIENTPNQRKKVLILEKRIKYHKKILTNKSI FT PAKYKPQHLLESSEASQNEMNSLFIKEYNEIFFKHLKRVIEADTTVLKVKM FT ARLQSAKSYHHQPTTVTCKTHTSAKPQQANIYASQEANSRKRKNTDSQSTN FT PTKQPKISHFLEKSRHRPPNIE*" FT CDS 708..3224 FT /product="Neptune2_Ren_2p" FT /translation="NPHISKASTSKHICISRSKLKKKEKYRFTKYQPNKAT FT ENKSFFREKSSQTTKHRMTNSNTIYNLSSYKLSQSQLNILNKGLSFSPSQP FT FNLQDHLSFLNQYDAFSNSLRDLALTDNDRPQHTPDLSLDSVTQHLYREMK FT FLKGSTQPHYPPYFTTNIASVENFIESTKIEIDELLSKITSKQKENISKQE FT LKAIKSLKKSNLVIKPADKNLGIVVMNSNDFVEQCLKQLTSNTYVRVETVP FT EEEIKKKIQNTLVKFKNQFGASNNRLYCFLQPNQKYQTPNFYGLPKMHKEL FT DVNGLPPLRPIVAHNNSLLSNTAKFIDHILQPLAQAYNDYLKNSTQLISYL FT ETSTIPNDIILVTLDVKSLYTSIPQSECLESVRNEMFNHQDLIIFDPDLIT FT HLLSINLNNNYFEFSKKVFLQVIGTAMGAAFSPTVANIYMSVLIKRFLSST FT TEKPFFFKRYIDDIIMLWPKHQDLNNFFNKLNKFHPNIKFTMSQSETTINF FT LDITIYKGEDFSETNKLSIKTYQKEINRYQYLHFNSNHPIHIFKGLIIGET FT IRYVRTNSTIIEYKKQIKRFTERLISRGYKINFINTAIRSVNYKNRAKYLQ FT NNNNNTTTKQRLQQPILKCTPPPNFICIKRIILEQFHEYNLSRYSKEPLFV FT TLKSNTLKDILVHSKHRPSTKDLRKITEETSPIPQHNQPKRIVTLRTTGPN FT KCNKKRCSTCNHFNSSAIFKSTANKRQFKIEHPFTCTSSNVIYLITCLKCQ FT KQYVGKTSKTLRERICHHRSSINNNEPRYISKHFNLPGHQLSHLKVQVIDS FT PKSNDPETLDRLETHWIQSLNTIQPKGLNVLSNQQI*" XX SQ Sequence 3400 BP; 1349 A; 800 C; 432 G; 819 T; 0 other; atgggagggc taattatttt ggcccgtgta accatttgga tagtctcggt ccctttcact 60 tgttaattgg gtatatcctg aggactcaag agttaagcct gacagcgggg aacaccaacc 120 cctagagtct aggaccactt tgagtcactt aactcagttt gctgtgtcac ttcttcccta 180 tagcaccgga gtattgtgca ctaatcctat gcaatgaacc cctcaaaaca attacataat 240 ttgtcaaata aatgtaagca taaagtcaaa aaataaatct gtcaccattc taacaacact 300 tacatacaca tgtacatgta cacctgttat acacagcacc aacttgtaac acacaaaata 360 tatactacaa agataacata catgtaaaac cataaacact ctatttccat agaaaatacc 420 cccaatcaaa ggaaaaaggt actaatactg gagaaacgca ttaaatacca caaaaagatc 480 ctaaccaaca aatcaatacc agcaaaatat aaacctcagc atcttttaga gtcatcagag 540 gcatctcaaa atgaaatgaa cagcctcttc atcaaagaat acaacgagat ctttttcaaa 600 cacctcaaac gtgtcattga agctgacact acagtactaa aagtcaaaat ggcaaggctt 660 cagtcagcta aaagctatca tcatcagcca actacagtta catgtaaaac ccacacatca 720 gcaaagcctc aacaagcaaa catatatgca tctcaagaag caaactcaag aaaaaggaaa 780 aatacagatt cacaaagtac caacccaaca aagcaaccga aaataagtca ttttttagag 840 aaaagtcgtc acagaccacc aaacatagaa tgacaaacag taacacaata tacaacctca 900 gctcctataa actatcgcaa agtcagctta acattttaaa caaaggtcta tctttctcac 960 cgagtcagcc atttaatcta caagatcatc tctcctttct aaatcagtat gatgcgttca 1020 gcaactcact tagagattta gctctcacag ataatgacag accacaacat accccagatt 1080 taagtctcga cagtgtaacc cagcacttat atagagaaat gaagttcctc aaaggttcaa 1140 cacaaccaca ttacccacca tactttacta ctaatatcgc atcagttgaa aattttattg 1200 agagtacgaa aattgaaata gacgaactcc tcagtaaaat tacatcaaag cagaaagaaa 1260 atatatccaa acaagaactc aaagcgataa aatcgttaaa aaaatcaaac ctagttataa 1320 aaccggctga caagaactta ggcatagtag tgatgaatag caatgacttt gtcgagcaat 1380 gcctgaagca actgacatca aacacatacg tcagagtaga aaccgtgcca gaggaagaaa 1440 taaaaaagaa aatacaaaac actctcgtca agtttaaaaa tcaatttggt gcctcaaata 1500 acagattata ttgcttttta caaccaaacc agaaatatca aactcctaat ttctacggac 1560 ttcccaaaat gcataaagaa ctagatgtga atgggctacc tccactcaga ccaatagtcg 1620 cacacaataa ttctctattg tcaaacacag caaaatttat tgaccacatc ctacaaccac 1680 ttgcacaagc atataatgat tacctgaaga actcaactca actgatctca tatctagaaa 1740 cgtcaacaat accaaatgac atcatactag taacactaga tgtcaagagt ctctacacat 1800 ctatccctca aagcgaatgt ctagaatcag tcagaaatga aatgttcaat caccaagatc 1860 tcataatttt tgatccagat ttaataacac atcttctcag cattaatcta aataataact 1920 atttcgaatt tagtaaaaaa gtttttcttc aagtcatagg aacagcaatg ggagcagcct 1980 tctctccgac agtagccaac atctacatgt ctgtcctcat taaaaggttc ctatcttcaa 2040 ccactgaaaa gccattcttc tttaaacgct acattgatga catcataatg ctatggccca 2100 aacatcaaga ccttaataat ttttttaata aactaaacaa attccaccca aacatcaagt 2160 tcaccatgtc gcagtcagaa acaactatca attttcttga cataaccatc tacaagggtg 2220 aagatttctc tgaaacaaat aaactcagca ttaaaacata tcaaaaagaa attaatcgat 2280 accaatacct tcacttcaac tcaaaccacc caatacatat atttaaaggc ctcattattg 2340 gcgaaacaat aagatacgtt cgcaccaact ctactataat tgaatataag aaacaaataa 2400 aaagattcac tgaacgtctc atcagccgag gatacaaaat taactttatt aacacagcaa 2460 taagaagtgt gaactacaaa aatagggcta aataccttca aaacaacaac aacaacacca 2520 caacaaagca acgcctgcaa caacccatac taaaatgcac tccaccacca aacttcatct 2580 gcataaaaag aattattctt gaacaatttc atgaatacaa tttatcacga tacagcaaag 2640 aaccattatt tgtaacattg aaaagcaaca ctttgaaaga tatcctcgtc cacagcaaac 2700 atagaccatc aacaaaagat ttacgaaaaa taacagaaga aaccagcccg ataccacaac 2760 acaaccaacc aaaaagaatt gtcactttga gaacaacagg accaaacaaa tgcaataaaa 2820 aacgctgcag tacctgcaac catttcaact cgtctgcaat tttcaaaagc acagcaaaca 2880 aaagacaatt caagatagaa catcctttca cgtgcactag cagcaatgtc atatatctca 2940 tcacatgtct caaatgccag aaacaatacg ttgggaaaac atcaaaaaca ttacgtgaac 3000 gaatttgcca ccatcgatca agtattaaca acaacgaacc cagatacatc agtaagcatt 3060 tcaatctacc tggacatcaa ctttcacatc tcaaagtcca agtcattgac agtccaaaat 3120 cgaacgaccc agaaacactt gacagattag aaacacactg gattcaatct ttaaacacaa 3180 tacaacctaa aggtttaaat gttttaagca accaacagat ctaacaactc aaggtatttc 3240 aaatatacat gtaacaaaac aaaaactcta gatcttatca aaaatacatt acaattatca 3300 aaagggcggg gtccccacag aggtggggac cccgcccacg tgtcctcacc tctgtgggtt 3360 ttttctctct ctctctctct ctctttctct ctctctctct 3400 // ID Sat6_Cis repbase; DNA; INV; 120 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 06-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE Satellite from Ciona savignyi. XX KW Satellite; Simple Repeat; Sat6_Cis. XX OS Ciona savignyi OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. XX RN [1] RP 1-120 RA Smit A.F.; RT "Sat6_Cis - Satellite from Ciona savignyi."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX CC Ci000006; The 3' end of Baggins1_Cis has turned into a common CC satellite. XX SQ Sequence 120 BP; 30 A; 38 C; 20 G; 32 T; 0 other; taatgtaatg tatcatggcc tacattcctg gtcccagtgc taccatggac ccacccatac 60 taatgtaatg tatcatggcc tacattcctg gtcccagtgc taccatggac ccacccatac 120 // ID BEL-178_AA-LTR repbase; DNA; INV; 195 BP. XX AC AAGE02028718; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-178_AA_; KW BEL-178_AA-I; BEL-178_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-195 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02028718; Positions 1267 1461. XX SQ Sequence 195 BP; 63 A; 43 C; 41 G; 48 T; 0 other; tgtgacgacc aaaccctccc cgcgatggag atggcctaat gcagaaggtg ccgttttatt 60 tctatcgggt ccactgctgt caagtgttgc gagtaggaac aatagcacgg ccggattgat 120 ataaaatcat cataaaaaca actttgcaag tcgtacaaca acgtgatgaa atttgatctc 180 cctaaaataa taaca 195 // ID R1_DPe repbase; DNA; INV; 5444 BP. XX AC . XX DT 08-NOV-2010 (Rel. 15.11, Created) DT 08-NOV-2010 (Rel. 15.11, Last updated, Version 2) XX DE 28S rDNA-specific non-LTR retrotransposon R1 in Drosophila DE persimilis. XX KW R1; Non-LTR Retrotransposon; Transposable Element; R1_DPe. XX OS Drosophila persimilis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; obscura group; OC pseudoobscura subgroup. XX RN [1] RP 1-5444 RA Stage D.E. and Eickbush T.H.; RT "Origin of nascent lineages and the mechanisms used to prime RT second-strand DNA synthesis in the R1 and R2 retrotransposons of RT Drosophila."; RL Genome Biology 10(5), R49-R49 (2009). XX DR [1] (Consensus) XX CC This family is specifically inserted into 28S ribosomal RNA CC genes. XX FH Key Location/Qualifiers FT CDS 398..1840 FT /product="R1_DPe_1p" FT /translation="VTPIEVATTRRAMEIETEADASDTSVASVGASSSSSV FT PARRRGRPKSSAAKQGAKKIGLGIGLAGERPIPAERSPPTSLPFEQAPSTS FT AAAAATTAASTSAAAAATAAAPLPPRCHTAATTAATTAATCPATDIVAAGY FT SAIIAEMSAIRMAVSKAVLKGLMPPEASMEILVATNRYDELVMALAGEKVR FT LEERARMPPPPRPAAAAHTGTTAVTTAAYATAFPGLPAPSAVAAPIPKPRD FT TWSALIKSKNPEETSKELVERVKKTVVPTLGVRVHEVRELKSGGAIIRTPS FT VSELRKVVASSKFTEAGLEVKKRPETKPQVVVYDVDTSITPEEFMEELFTK FT NLEETMTAAEFKKSVHLGSKPWSVTDGATINVTLEVDAKAQEALRECVYIK FT WFRCRCRSLVRTYACHRCAGFDHKVSQCRLKENVCHRCGQNGHNVARCPNP FT VDCRNCRFKGYPAAHSMLSAACPIYGAVLARVQARH" FT CDS 1837..4956 FT /product="R1_DPe_2p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="TLMFSFIQANCGRGRAAVMELGAIMRSSGRQFALVQE FT PYVDGAGRITGLPSGMRVFQDRRGKAAIIVDEPDAICMPMETLTTDFGVCV FT RVTGRFGSIFLCSVYCQFDTDLAPYLRYLDAVLLLGSRTPVIFGLDANAVS FT PAWFSKLSEHNRGQANYARGELLSEWITEMRAGVLNEASRVFTFDNRRGQS FT DIDVTIVNQAATMWATYDWEVKEWDTSDHNMIHVVVTTDPNDTVEPIAPVP FT SWKLSNARWRLFEEEVVREIAELPEDIAESPLDNQVSALRSVVHSVCDRVL FT GRSTPRAARKVVWWTAELHSKRREVRRLRRRLQDARRHETDAAEELVLALR FT ISSAQYKKLILRSKEDNWRRFVGENRDDPWGHVYRICRGRKKSTEIGCLRT FT DGRLVATWRDCAGVLLRNFFPVAETNAHIAIPIEAPPALEAFEVDACVARL FT KSRRSPGMDGITGAICKALWRAIPQHMTAMYSRCLETGYFPTEWKHPRVVP FT LLKGPDKDRTDPTSYRGICLLPVLGKVLEGIMVNRLKDTIPDGCRWQFGFR FT QGRCVEDAWKHVVSTVSASRSKYVLGVFVDFKGAFDNVEWNAALRRLLELG FT CREASLWRSFFSGRSASIVSRHGVATVPVTRGCPQGSISGPFIWNILMDVL FT LQRLEPLGALSAYADDLLLLIEGNARSELEQKGGELMSIVGAWGVEVGVAV FT STSKTAIMLLKGILRRPPVVRFAGASLPYKRKYRYLGITVGERLSFLPHIT FT GLRDKLTGVVGALSRVLRVDWGLSPRAKRTIYAGLMVPCALFGASVWSVVT FT TTQVVARRHLLACQRIVLIGCLPVCRTVSTMALQVLAGAPPFDLVARRLAI FT SFKLKRDYPLEESDWLYGEDLANLSWKQKMARLDEDLLCEWQRRWDDGDSP FT GRVTHRFIPNAGFVYSERRFGFTLRAGFLLTGHGSLNAFLHGRSLSDTPAC FT LCGAEREDWQHFLCACPLYTDLRDLDGLGVQYTDGDWTFSDVTASQERMRT FT LDRFAGLAFSRRQQLLNAQAAHGLGGFPIQAGRG" XX SQ Sequence 5444 BP; 1230 A; 1452 C; 1641 G; 1121 T; 0 other; cagttgcttt ttgactgcca ttcgaagcag acgtgttttt caagcggtcc gctataccgt 60 cgttcgaatt aagtaaaaag tgatttttcc aagtggaaaa tatatcaaaa ttgccgcgag 120 atcacgcgct gttcgtgagt gaaatcgggt gttaatttgg tgaaagaaca aagcagatat 180 ttgcaaatta atatttgcaa aataaacgct aaaaacaacc acgtgcggaa ccctagttcg 240 taaacaagtg cgaatagttc gacaacaagt cggaatagtt cgacagcacg ttcgagttcg 300 aagctttggt tcgaagcttt ggctcgagtt cgaagcgttc ggttcgtgtc tccgctgaga 360 cgttggttcg agtctccact aagtgccgat tagttgagtc acgccaatag aggtagccac 420 aactcgccgg gctatggaaa tagagacgga ggccgacgcc agcgacacta gcgtggctag 480 tgtaggggcg agctcgtcgt cgagcgttcc cgcccgcagg cggggtcggc cgaagtcctc 540 agccgccaaa cagggagcca aaaagatcgg gcttgggatt gggctagctg gagagcggcc 600 gatcccagcc gagagatcgc cgccaacgtc gctccccttt gagcaggctc catccacctc 660 tgccgctgcc gctgccacca ccgctgccag tacttctgcc gccgctgccg ccaccgctgc 720 cgccccgctg ccgcctcgct gccacaccgc tgccaccacc gctgccacca ccgctgccac 780 atgcccggca actgacattg tcgcggcagg ctactcggcc atcatcgcag aaatgtctgc 840 gataaggatg gcggtgagca aggctgtcct taaggggctg atgccgcccg aggcctcaat 900 ggaaatcctc gtcgcgacca accggtacga cgagttggtc atggccctgg ctggggaaaa 960 ggtgcgcctg gaggagaggg caaggatgcc gcctccaccg agacccgctg ctgctgccca 1020 cacgggcacc actgccgtca caaccgctgc gtatgcgaca gccttcccag gcctgcccgc 1080 accgagcgcc gtcgctgcac cgatcccaaa gcctagggat acctggtctg ctctaataaa 1140 gagcaaaaac ccggaggaga ccagcaagga gcttgtggag cgtgttaaga agaccgtggt 1200 gccgactctt ggagtccgcg tccacgaggt tcgcgagctg aagagcggag gagcgattat 1260 tcgcacccct tcagtgagcg aactgaggaa ggtagtggcc agcagcaaat tcaccgaagc 1320 aggattggag gtcaagaaga ggccggagac caagcctcag gtcgtggtgt acgacgtgga 1380 cacgtccata acaccggaag agttcatgga ggagctcttc acaaagaacc tggaggagac 1440 catgactgcc gcggagttca aaaagtcggt tcacctgggc agtaaaccct ggtcggtcac 1500 cgacggcgcc acgatcaacg tgacgctaga ggtcgacgca aaggcacagg aggcgttgcg 1560 cgaatgcgta tacatcaagt ggtttagatg ccgctgccgc tccttggtca gaacatacgc 1620 ctgccacaga tgtgcaggct tcgaccacaa ggtgtcgcaa tgtcgcctaa aggagaacgt 1680 gtgccaccgg tgtggacaga acggccacaa cgttgcacgg tgtcccaacc ccgtggactg 1740 ccgcaactgc cgcttcaagg ggtaccctgc agcacattcc atgctgtcag cggcgtgccc 1800 gatctacgga gcggtactgg cgagggtgca agctagacat taatgtttag cttcatccaa 1860 gctaattgtg gccgtggccg agcggctgtg atggaactcg gagccatcat gcgcagctct 1920 ggccgtcagt tcgcactggt ccaggagccg tacgtcgatg gagcagggcg gattaccggc 1980 cttccttctg gaatgcgagt tttccaggac cgccgaggaa aagctgctat catcgtcgac 2040 gaaccggacg ccatctgcat gccaatggag acccttacca cggattttgg agtatgcgtc 2100 agagttacgg gaagattcgg ctcaatcttc ctatgctccg tgtactgcca attcgacacc 2160 gacttggcgc cgtacctcag gtacttagat gcggtgctgc tgctgggcag ccgcactcct 2220 gtcatctttg ggctcgacgc gaacgcagta tccccagcgt ggtttagcaa gctctccgaa 2280 cacaatcggg ggcaagctaa ctatgcacgg ggtgagctgc tgtctgagtg gataaccgag 2340 atgagagccg gcgtgctcaa tgaagccagt cgggtgttta cattcgataa ccgtagaggc 2400 caaagcgata tcgatgtgac aatcgtcaac caagctgcga ctatgtgggc cacatacgat 2460 tgggaagtga aggaatggga caccagcgat cacaacatga tccatgttgt ggtgacgact 2520 gacccgaacg acacagttga gcccattgct cctgtgccgt catggaagct ttccaatgcg 2580 cgctggcgat tgttcgagga ggaagtggta agggagattg ccgaattacc ggaagacatc 2640 gccgaatcgc cgttggacaa ccaagtgtct gcactgcgct ctgtagtgca cagtgtgtgc 2700 gacagagtgc tgggacgcag cacaccgaga gccgcgagaa aagtagtttg gtggactgcc 2760 gaactacact ccaaacgccg agaggtcagg agactgaggc gaaggctcca ggacgctcgt 2820 cggcatgaga ccgacgcagc agaggaactt gtgctcgcgt tgaggatctc ctcagcgcag 2880 tacaagaagc tcatcctgag atcgaaggaa gacaactggc gacgcttcgt gggagagaac 2940 agagatgatc catgggggca cgtctacagg atttgccgag gccgcaaaaa gagcacggag 3000 attggatgcc ttcgaacgga tggtaggctg gtcgcaacat ggcgcgactg tgcgggtgtg 3060 ctccttcgca acttctttcc tgttgcggag acgaatgcac acattgccat cccgatcgag 3120 gctccaccgg ccctcgaagc tttcgaggtt gatgcatgcg tcgccaggtt gaagagcaga 3180 cgctctcccg gcatggacgg catcacaggt gccatttgca aggcattgtg gcgtgccatc 3240 cctcagcaca tgacagcgat gtattcccgc tgcctggaga cagggtattt cccaacggaa 3300 tggaagcatc ccagggtggt tccactcctg aagggacccg ataaggaccg gaccgatcct 3360 acctcatatc ggggtatctg tctgttgcca gtgctgggca aagtgcttga gggcatcatg 3420 gtgaatcgtc tgaaggacac cattccggat ggctgcagat ggcaatttgg ctttcgccaa 3480 ggacgttgtg tggaggatgc ttggaaacac gtcgtgagta ctgtttcggc cagccggtcg 3540 aaatacgtgc tcggagtctt cgtggatttc aagggagcct tcgacaacgt tgagtggaat 3600 gctgcactac gccgcctcct tgagctggga tgccgagaag caagcttgtg gcgaagtttc 3660 ttctccggcc ggagtgcgag catcgtcagt aggcatggag tagccactgt tccggtgaca 3720 agaggttgcc cgcaggggtc cataagtggt ccatttatat ggaacatctt aatggacgtg 3780 ttgctccagc gcttagagcc ccttggtgcg ctcagcgcgt atgctgacga cttgctcctc 3840 ctcatcgaag ggaatgcccg atcagagctc gaacaaaaag gaggtgagtt aatgtccatc 3900 gtaggcgctt ggggagttga agtcggcgtt gccgtttcaa ctagcaagac ggcgatcatg 3960 ctgctcaaag gcatacttag acggccgcca gtggtacggt ttgctggagc aagcctgcca 4020 tataagcgca agtatcggta cctaggcatc acggtcggcg agcggttgag ttttctcccg 4080 cacatcacgg gcttgcgtga taagctgacc ggagtcgtag gggcattgtc gcgcgtactg 4140 cgggtcgact ggggactcag tccccgcgca aagcggacaa tatatgccgg actcatggtg 4200 ccctgtgcac tatttggtgc ctcggtttgg tctgtcgtga cgacgacgca agtggttgcc 4260 aggaggcatc tgcttgcgtg ccaaaggatc gtcctgattg gatgcctacc ggtatgccga 4320 acagtgtcca ccatggcgct gcaagtacta gctggagccc ccccgtttga tctggttgcc 4380 agacgcctgg caatcagctt caaactaaag cgtgactacc cgctggagga gagcgattgg 4440 ctgtacggcg aagatttggc aaatcttagc tggaagcaga agatggcgcg actagacgaa 4500 gacctgttgt gcgagtggca acgcagatgg gatgatggtg actccccagg acgggtgacg 4560 caccgcttca tcccgaacgc aggcttcgtc tacagcgaac gaaggtttgg cttcacgctg 4620 cgcgctgggt tcctgctgac gggccacgga tcgctcaatg catttctgca tggaagaagc 4680 cttagcgaca cgccagcatg tctatgcggc gcagaacgcg aggattggca gcacttccta 4740 tgtgcttgtc ccctctatac agacttgcga gacctcgatg gacttggagt gcagtacacg 4800 gatggcgact ggaccttctc cgatgtgacg gcttctcagg agagaatgcg gactctcgac 4860 aggttcgccg gactggcgtt ctccaggcga caacagctgc tgaatgcgca agcggcgcac 4920 gggctgggtg gattccctat ccaggccggt cggggctaaa agggaggatc gagctgagtc 4980 cttaagatcg gtaccacggg ttgtgcagtt ccaaggctgc acattgaggt cggcccccta 5040 gtgggagtat cgtggtggct gtggttgata cccaaacgtt acgcggggag agccgctagg 5100 ctcctcgtgg agttgcgctc tcaaccgggt gccgagaccc atagatcgga agacgtgtta 5160 gatacacctc gcccctcacc aagggggatt gtatgcccga ccaggatact ttcaattggt 5220 accagaagag tcgctatgta catagctata gcttcttttt aggggcgcta actggcgcat 5280 tgtaccaggt tcctgcgtat gtagcggtgg tgcgtgatgg cgtgatatat gtaaatatat 5340 cgcactaatc gaggctaagc tgcaaataaa cattagacgc cgtggttgaa atccctccct 5400 gaggaaccgc cacgtaaaat aaagctgaga gatcagattc attc 5444 // ID Zator-1_BF repbase; DNA; INV; 5481 BP. XX AC ABEP01023904.1; XX DT 29-JAN-2009 (Rel. 14.02, Created) DT 25-JUL-2009 (Rel. 14.08, Last updated, Version 2) XX DE Zator DNA transposons from Branchiostoma floridae. XX KW Zator; DNA transposon; Transposable Element; Zator-1_BF. XX NM Zator-1_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-5481 RA Bao W., Jurka M.G., Kapitonov V.V. and Jurka J.; RT "New superfamilies of eukaryotic DNA transposons and their RT internal divisions."; RL Mol Biol Evol 26(5), 983-993 (2009). XX DR EMBL/GenBank/DDBJ; ABEP01023904.1; Positions 20567 15087. XX FH Key Location/Qualifiers FT CDS 1730..4144 FT /product="Zator-1_BF_1p" FT /translation="MSGKELHDHAQELFKVCCVPLLQKPHLKKVKDMVVTL FT GEGMSEYSEHLEKSNRKVQEQRKQLFPVRSVQDGNSSELRVVEAKVRHDPK FT IINRYCQLEQHLADTEEYKVCFLNDFAPEDCRKRYTYISELQLPMSVELYT FT YKTGNNMGSLHFIWKVPPSKECRDFNLSNRLMHEVEKNVPVYHTREMHKKF FT NDRFALISSAKPVVLREMYQFLTNDASVEECQLSKDIRQRLKILLDSGDPD FT LIVDMRSLNEKKESYEEFWSEASKVIEEMQLASVNDRRHGQVCYMALAMSI FT PDFISQVAKRLPKETNIPCPSWVSLQFWPKNAFADKAVRYTGRLGVKYMVQ FT SRQLNHNHPDCHYAAAVFKYMKEFAVMFKDYSAFVCLDDKHSIKVVEPGYP FT VAAVDRGKSVLVAKDTVFVVADHDFTKVKLVPSVTLVCDIPESACESFYRG FT KVFVCMKDATFQPSSPIRHQTELYHILTKAGKLGLPILCQYSDGGPDHRLT FT YISTQVSLICTGMFIILDLDMLVCARTAPQNSFRNPVERVMSIINLSLQAI FT GVMREKMPPAIEKLFAGLGTMKDVRAMADKHPELKQALVASTEKTRGMMNE FT LFQRLTLKGEAVSTIKPATEQEVEEMWDLILMVDQALTKGDTTKVKVQNKK FT DLMAFMHHCCVPRHYSFSIKKCGDSDCTICKKPKLPPEVFNRLKTFPDPVL FT SGTGEHYKPFIEVYGTATSEGYRPSLKESRAKEHSMPFSPCAENTRGVVTC FT LDCGKPRTIHSQRALTTAQKKDLDSLKEDSMYTCGVSWIPDDHPLRESCYV FT SR*" XX SQ Sequence 5481 BP; 1646 A; 1095 C; 1240 G; 1500 T; 0 other; ggccatgttg atttgattat atggatgaca tccgcgtgcg caccaatttt cagccgattc 60 caaaaaaaaa aagttttcaa gcatcgttca aaccagcttc aacactagta aatatatgcc 120 gaaaatccat accaatgtcg taaaatgcat cttttgtgtc ttctcctatc gcaaaacgta 180 ctgattttgt cgtaaactat tccttgcaca tgcgcattga cgacttaacc gaaacttgcc 240 aaaataactg aaaatggcga cgaaatcaac tgatcgtgat tctcctccga attatgtaag 300 catcaaagtg acaaggaagg gcggcggtgt tgtcgtgaat tggtacgcga ttgattgtgg 360 acaaggtgac gttacatttg cccagttata tgagaaactt gttgcggcaa gcgagttcag 420 gccggtgaac ttcagcgata ttctttcaag aaccgcccat gtgcaatact tctgcaatga 480 agaaaaagtc ggaagaggtg tcgaggtcta cgctggccta aaggtcgggc atgcctgccg 540 acaattcgga tgttttgtcc gcatcgaggt ggacaagacg gccagtggaa agactgatga 600 agatccgcgg cccaagtctg cttttgaggt gatttaaatt tcattgttac cttcagtttt 660 ggattgaact actagtaggg gaggggggcc gagggtattt taaattttca cattcggtac 720 atgtatgcat aaacgactac agcactgttt tttttgtcta ttgacaggat gtctaagcta 780 tgtagtcagt ataaaggcac tgaaccgact gataacaaat tttgcactat tctattgaca 840 ggagattcct gtcagtggaa tttttgtcac cattgacaag ataccttgat accatgacag 900 gcatgaaata gagagactaa agtattatat ggtggggatt aaacccccca tccccttatt 960 acaaggtgtg ctttctatcc cctaggccat cgcaccgctt catcgtatat agtacatgag 1020 gtatactgat tcaaaggtat ttctgtgctt ccagtcaatc atcattataa gattgtttta 1080 cagtgttgca ggaacatcct gtcaatagcc acatcatatt tttgcatgag gaaagaatag 1140 tttccatcat actttgagta caaattgtga tactttgcca ttttcaattg tttaaaggtg 1200 atgatgacag cctccagaga gcagagccga gagacgttca tgtttccagc ctacgctggc 1260 agtggcaatg ctgagaacaa caatcccatg gggcacatcc gtctgtacaa tgacatcgtg 1320 gacattctca agaaacacaa cttcaggtac attttattcc tggagaatgt attgtaacaa 1380 ccaagttttg agatttacat gtatgtcata aggtttgcat gagtggtgat ttagtagagc 1440 tgatcagata gtgacactga caaaatgatg gatttatttt aggagcttga ccttaagttc 1500 gctttcttta tgtccttaaa acaggtttga aagaaacaaa gggaatgaag tttccagcaa 1560 ggaagtcctt accgctgtga gaaatgctgt gtggtatgtg ctaccacatt tgaaaactct 1620 ggctgatcgc accatatatc taccacaaga gttgaggtcg ctgtatgacg acaagacatt 1680 caaacagaca tacaataacc cgttacttta tcgtcataca ctcaaaccta tgtcaggtaa 1740 agagctacac gatcatgccc aggagttgtt caaagtttgc tgtgtgccct tgctacaaaa 1800 gccacacctt aagaaggtga aagacatggt cgtgacacta ggagaaggaa tgtctgagta 1860 cagcgagcat ttggagaaat caaataggaa agtccaagaa cagaggaagc agcttttccc 1920 tgttcgatcc gtgcaggatg ggaacagctc agaactgaga gtggttgaag caaaagtccg 1980 acatgatcca aagataatca acaggtactg ccagttagag cagcatctcg ctgatactga 2040 agaatacaag gtatgcttct tgaatgactt tgctccggaa gactgcagaa agaggtacac 2100 atacatcagt gagctgcagc tgccgatgag cgttgaactg tacacatata agactggaaa 2160 caatatggga tcattgcact tcatctggaa ggttccgccg agtaaagagt gtagagactt 2220 caacctgtca aatcggctaa tgcatgaagt agagaagaat gtccctgttt accatacacg 2280 ggaaatgcac aagaagttca atgatcgctt tgcgctgatc agcagtgcca aacctgttgt 2340 cctgagagag atgtatcagt ttctaacaaa tgatgcctct gtagaagagt gtcaattgag 2400 caaagacatt agacaaagac tgaagattct gctggattca ggggacccag acctcattgt 2460 ggatatgagg tctttgaatg agaagaagga aagttatgaa gagttctgga gtgaagcttc 2520 aaaggtgatt gaagagatgc agcttgcaag cgtgaatgac agaagacatg gacaagtatg 2580 ctacatggcg ctggcaatgt ccattccaga ttttatctca caagtagcca agaggcttcc 2640 taaggaaaca aacataccct gtccatcttg ggttagtctg caattttggc ccaaaaacgc 2700 atttgcagac aaagctgtac gctatacagg ccgactcggc gtgaagtata tggtgcagag 2760 ccgacaattg aaccacaacc atcctgattg ccactatgca gcggcggtgt tcaagtacat 2820 gaaggaattt gctgtgatgt tcaaagacta ctctgcattt gtctgcctgg acgacaaaca 2880 cagcatcaag gttgtcgaac caggatatcc cgtagcagct gtagatagag ggaagtccgt 2940 actagtggca aaagatactg tttttgtagt tgcagaccat gacttcacaa aagtgaaact 3000 ggttccaagt gtcacacttg tgtgtgatat accggaaagt gcatgtgagt ctttctatag 3060 gggtaaggtc tttgtgtgca tgaaggatgc cacatttcaa ccatcatctc ctatcagaca 3120 tcaaacagaa ctctatcaca tccttaccaa agcaggaaag cttggcttgc caatactatg 3180 tcaatacagc gacggtggtc cagaccacag gctgacctac atcagcacac aggtgtccct 3240 gatatgtacc ggtatgttca tcattctgga cttggatatg cttgtatgtg ctcgtactgc 3300 accgcaaaac agtttcagaa acccagtgga gagagtgatg agcattatca atctgtcgtt 3360 gcaagctatt ggtgtaatga gggagaagat gcctcctgca attgagaagc tttttgctgg 3420 acttggcact atgaaggatg tcagagctat ggctgataaa cacccagagc tgaagcaagc 3480 gcttgtagca agcacagaga agacacgtgg aatgatgaat gaactgttcc agcgcctaac 3540 tttgaaaggt gaagcagtct ctaccatcaa gcccgccaca gagcaagaag tagaagagat 3600 gtgggaccta atcttgatgg ttgaccaagc tctaacaaag ggtgacacaa ccaaggtcaa 3660 ggtgcaaaac aagaaagacc tcatggcatt catgcaccat tgctgtgtgc cgaggcatta 3720 cagtttcagc atcaagaaat gtggagacag tgactgcact atctgcaaaa agccaaagct 3780 cccaccagag gtattcaacc gactgaaaac cttccctgat cctgtgctaa gtggcaccgg 3840 agagcactac aagccgttca tagaagtgta tggcacagca acatctgagg gatacagacc 3900 cagtctaaag gagtcacgtg ccaaggaaca cagtatgccg ttctcacctt gtgcggaaaa 3960 caccagaggt gtagtgacct gcttagattg tggaaaacct cgaacaatcc atagtcagag 4020 agcgctcacc acagcacaga agaaggactt agacagtcta aaggaggact ccatgtacac 4080 ctgtggagta tcatggatac cagacgatca tcctctgaga gagagctgtt atgtgagccg 4140 gtgagttata gcgtacgttg agtttatgtt ggttgacaat gttgttattc tgaggctaca 4200 attgtagctc atcacaagta cttaaatagg acctaaaact agactttgat ttgaaagata 4260 caggaagaag accatcttga tatattaagc ctttttgagt gggtaagaca actgttattt 4320 ttacttttat tgtttcatag acaaatgtct ttttttcaat caagtttgat actgtattaa 4380 tgtctcattt cacacctgtt ttgtgcaggg ccctatcatg tgccgttcca gtagagccct 4440 actactattc tgcgagactc aaggctccag ctgcattgag gtcaatcctc ccactggtat 4500 gctggcagtg tggggaggaa gagacactcc ccatcccttt ggaaaaggcc aagcaattcc 4560 agacgatcca tccagtgtgt caggttggtt ttaagtcttt atttgcatgt gcatttttac 4620 aatgtacatt atatgtgaag atgatacttt gcctatagat acattcattg tgcttttatc 4680 tgaattagtg tgctttctca acataaaact tgatttatgt agtagtttat taatgaaggt 4740 taaatatcca tgtaccaata catgtgtata gatgatgtat ctttccacaa acatgtctaa 4800 aactggagct aaatacattt tgaatttcag atatttgtag tggcatccat gtattgcaaa 4860 attcaatcca taaatgatgt atcccatctt gcgttttttg taggtgtgca aatctgcagg 4920 agtagaggag aggacacgag gaaagaagaa gttaaaaagg agaagagagg cggaagatga 4980 caactaaggc aggtgtttta tttgatgtac ctttttcaat actgttttca ccagttgaaa 5040 agccttcatc ctacatgctt cattccatct gaagtgctga gcccttccat ttgcatttgt 5100 gtctatcatg tacactgtac tggaaaacta cattttgtaa actataattg aagaaattta 5160 tacattagaa gggttgatac ctaattgttg cttttcatac tgtgtttctg taggcacctg 5220 aaaaatgaga gccatgagag gaaaacaaga tgtcagagga agttagacag gacatgcagc 5280 atgccagtca aacaaaacaa aataaagttc ttgtttggta taccatactc tgtgtgttta 5340 gtcatttgtt ctaccttcct ggccacttag atttcttaat aaagagaact tttttttttt 5400 gccaaacttt ttttttcgct cgctcgcacc aattctgggg ctgccagagg atgtcatcca 5460 tataatcaaa tcaacatggc c 5481 // ID BEL-244_AA-LTR repbase; DNA; INV; 1024 BP. XX AC . XX DT 13-JAN-2011 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-244_AA_; KW BEL-244_AA-I; BEL-244_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-1024 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (13-JAN-2011). XX DR [1] (Consensus) XX SQ Sequence 1024 BP; 208 A; 237 C; 298 G; 281 T; 0 other; tgtacgcgcc ctacgcgtgc agaataagct atgcaagcaa tccatccaaa ccagcccatc 60 ttataccagc gaataacgca acgagacggt ttccaccaac ttgtgcatcg aatgggcgga 120 gcttgggctt caacttggtt ggctgttaca gctgtagaga aaaatatata agcacattca 180 ttgtttcaat atgcattcag tcttaataaa accattcccg cgacaacatc aaaaagtaaa 240 agtcttttct gaacgagagc cgaagagtgt ggtgacagtg agccagaccg gtcaacgagg 300 cgatcggttg gcattttttt ccctttcctt tttctctgct tccacgagat cgtgagccga 360 gacggcgacg agtgtgcctg tgtctctctc cctttcgggt cagcgttggc gagtcatcgc 420 gaagagtgtg tgtgtgcgcc gctcatccga gccggcgttg tcgagcaaga cagtgaacgt 480 gttgcgtctt gttcggcgtt cctctttcct tcgttcgtca aacgattggt gtgcggcggt 540 gctgttccag tgaagcgaag ctgctgtggt ggtgctgttc ggtcgaaatt gctaatgcga 600 cgattcacag gagtgtgccg ggagttgcta agataatatt atagagtcgt tttcacgatt 660 aggctacact cctggtgatt tatcgttagc gagcaagcga tcacgacccg aaattggaag 720 gtgtgaggca caaagctgga agtgtgaacg tgttttcggc atccttgccg tggtccttcg 780 tcggagtgtt cctgtgctgg tgaccgcaac gtgtgcggtt gtcggcattc cgttggtggc 840 tgtggcgtgc acggctgctg gtggtgtgtt tcctgcgctg gtgaccgtga cgtgcacggt 900 tgtcggcaat ctcgtctgcg tcggactgct tgtggctgtg atgtgcacgg ctattggtgg 960 ctcgatcacg aggtaagtgt gacgatttcc ctcctgagtg tattgagagt ttgagtaccc 1020 aaca 1024 // ID Gypsy-615_AA-I repbase; DNA; INV; 4384 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-615_AA_; KW Gypsy-615_AA-LTR; Ty3_gypsy_Ele18; Gypsy-615_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4384 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [3311-3790] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 101..4327 FT /product="Gypsy-615_AA-I_1p" FT /translation="MPVTRSQAAKKGHEAQIEEEEPTFFDVKTMADDDKSV FT SGVKREENVCVQPVSQTHTFAPHPHDLGRLIPEFSEGDGVTKWLRRIDHFR FT ILYGWTEQMCLLYGSTRLNGAAASWYRRYEEKIFSWAQFKQHLTVAFPETF FT DEADIHRQLEEMKKEKDESYESFVYRVDALAQQGDFSTSATLKYIIKGLRN FT DQVYPSLLARQYTSTLDLLHHIKWISSNLCMISSRANAVPMRISKVPTMTG FT SGASSSGSETICFNCREAGHRSIDCPKPQRRERCTKCMKVGHTAEKCMVMT FT NSTKHQGANQIASGTSGRNVVNALFGGDEDTLELNESSLVEVDVDAAGERF FT RTMALVDTGSYVNLIKRKLLSSSITLGGNVQEVVGINDAKVTVLGEFSTSL FT RIRNRDMTARCLVVLDDTMQVDMILGRSFLRENRIHYLHISEGGAYEAQKM FT GSKQDWSMFESFDESYLIDSQSNLKLDVGDSNETRCQSGRLEKILQDSYFN FT RPRPDKPLIRYQADIRLKDDKIFTATPQRLSVWEKGELDKMVADLLQQGII FT RESESAYTSRVVLTRKKNNSLRMCVNYRPLNRLVERNHFPMPIIEDQIMRL FT QGKRYFTSLDLKNGFYHVELTEDSKKYTSFVTDSGQYEFNRLPFGYANSPA FT VFVKFITKVLDPFIRDGKVVVFIDDMLIASENIDEHFQTLQDVLETLSDNH FT LELQLSKCQFIKTRVEHLGYDVQFNRIQPSDKHIQSIRDFPVPTSRKSLQR FT FLGLVNYFRKFIKGFNVQAGMLYELLKEDRDFLMTAEHVQAAENLKDALIS FT KPVLRIYSPTAETELHTDACSSGFGGVLLQRQSDDGLMHPVMYHSRKTSPT FT ESKLHSFELETLAIVYCLQRFRTYLFGLRFKIVTDCNSLKHTLEKRDINSK FT IARWSMFLEQFDFEIVHRSGARMQHADALSRVNVYMVDEDERLNESSLFEN FT ALYVAQLRDPEIAKLKDAVLSGSTKDYEIREAILYKLVGRNSLLCVPGQMI FT QSVINKFHNEMGHFGVDKVCGLIRRTYWFPRMREQVQDHIKSCVTCIAYNP FT RNKRYDGDLQIVDKPDKPFEVLHIDHLGPLEKSKGKNEYILAVVDACTKFV FT KLYPTKTTKTTEVMKSLRNYFHAYSAPKAIISDRGTAFTSHAFENFTQSHG FT IKHIKIASACPKANGQVERYNKTMMPLISKLVEETGRNWDTILTDAEFLLN FT NTVNRSTGTTASILLFGIEQRRRVDHDLTHYLNEINGAEDRDLDELRETSK FT KNTKRQQQYNKKVYDKHCSRNTYYKSGDLVMLRRVGVAGERNKLKPKYRGP FT YEVKKVLDNNRYVVGDLDEFQVSGMRFEGIYDPMNMRLYQRATKDELREHK FT SNGVEGANGENENEEVEYQDIEYLEDDQESEIEFEYVEYLEEDCDF" XX SQ Sequence 4384 BP; 1345 A; 845 C; 1097 G; 1097 T; 0 other; gttatcagaa gtgggatagc accggcagtc aaaagttaat ttcggtgacg gtgactccaa 60 cgttccagga gcacgacgat attccggaat tttcaacgcg atgccggtga ccagaagcca 120 ggcggccaag aaggggcacg aagcgcaaat tgaagaagaa gaaccgacct ttttcgacgt 180 gaaaacaatg gcggacgacg ataaatcggt gagcggagtt aagcgagagg agaatgtgtg 240 tgtgcaacca gtatctcaaa cccatacttt tgcaccgcat ccacacgatc ttggaaggct 300 gattccagag ttttcggaag gagacggcgt taccaaatgg ctgcggcgaa ttgaccattt 360 tcgcatcttg tacggttgga cggagcagat gtgtctacta tatggatcca ctcggctcaa 420 tggggcagcc gccagttggt accgacggta cgaagaaaaa atattcagtt gggcgcagtt 480 taagcagcat ttgacggtgg cgtttccgga aactttcgat gaggcggaca ttcatcgaca 540 actcgaagag atgaagaagg agaaggacga atcgtatgaa tcattcgtct atcgcgttga 600 tgctctagcg caacaaggag attttagtac ctcggctacc ctcaagtata ttatcaaggg 660 actacggaac gatcaagtgt acccgagcct gctagcaagg caatacactt cgacactgga 720 cttgctgcat cacatcaaat ggatatcatc gaatctgtgc atgatttcga gccgtgccaa 780 tgcggttccg atgcgcattt ccaaggtacc gacgatgacg ggatcgggag cgagctcatc 840 cggaagtgaa accatctgct tcaattgtcg tgaggcagga cacagatcaa ttgactgtcc 900 aaaaccacag cgcagggagc gatgcaccaa gtgtatgaag gtgggacaca cagcggaaaa 960 gtgcatggtg atgacgaact caacgaagca tcaaggagca aatcagatcg catctggaac 1020 atcgggacgg aacgtggtaa acgcgctttt tggcggagat gaagatacac tcgagttgaa 1080 tgagtcttcg ttggtggaag tggacgttga tgcagccggc gaacgtttcc gaaccatggc 1140 gctggttgac acaggtagct atgtgaacct tataaaacgt aagcttttgt catcgagtat 1200 tactcttgga ggaaatgttc aggaagttgt agggattaat gatgccaaag taactgtttt 1260 gggagagttc agtacaagct tgcgtataag aaatagagat atgactgcga gatgtttagt 1320 agttcttgat gatacgatgc aggtagacat gattttaggt agaagttttt tgagagaaaa 1380 tcgcatacat tatctgcata tctctgaggg tggtgcctac gaggcgcaga aaatgggcag 1440 taaacaagat tggtcaatgt tcgaaagctt cgatgaatcg tacctaatag acagtcaaag 1500 taatttgaaa ctggatgtgg gagacagcaa tgagactcgg tgtcagtctg ggaggctgga 1560 gaagattttg caagatagtt atttcaacag acctcgtcct gataaacctt tgattcgata 1620 ccaggcagac attcggttga aggatgataa gatatttacc gctactccac aacggctcag 1680 tgtgtgggag aagggtgaac tggataagat ggttgccgat ttgttacaac aagggataat 1740 tcgcgaaagt gaatcagctt atacttcccg cgttgttctg accaggaaga aaaataattc 1800 tttgagaatg tgcgtgaatt atcggccact caatcgactc gtggagcgga atcattttcc 1860 gatgccgatt atcgaggacc aaataatgcg cttacaagga aagcgatact ttacatcttt 1920 agatttaaag aacgggttct accacgttga attgacagaa gatagtaaaa aatacacttc 1980 gttcgtgact gatagtggcc agtacgaatt taatcgccta ccgtttggat atgcaaactc 2040 tccagcggtg ttcgtgaagt tcattacaaa agttttggac ccttttatcc gcgatggaaa 2100 agtggttgtg ttcatcgatg atatgctgat agcttcggaa aatattgatg aacatttcca 2160 aaccctacaa gacgttttgg aaacgctttc tgataaccat ttggagttac agctttcgaa 2220 atgtcagttt attaaaacaa gagttgaaca cttaggatat gacgtccagt tcaacagaat 2280 tcaaccaagt gacaaacaca tccagtccat acgtgacttc cctgtcccta ccagtcgcaa 2340 aagccttcag cgttttctag gcctggtgaa ttattttaga aagttcatta aaggcttcaa 2400 tgtccaggcg ggcatgttgt atgaactact caaggaggat agagatttcc ttatgactgc 2460 tgaacatgtt caagccgccg aaaacttgaa agacgcgcta atatcgaagc cagttttacg 2520 aatttattca cccactgcgg aaacagaact tcacaccgat gcatgttcat ctgggtttgg 2580 aggtgtccta ctacagcggc aatcggacga cggtctgatg catccggtga tgtaccacag 2640 tagaaaaact tccccaacgg agtcgaagct gcacagcttt gaattggaga cgcttgctat 2700 tgtgtattgt ctgcaacgtt ttcgaactta tcttttcggg ttgcggttca agatagttac 2760 agattgtaat tcgttgaaac acaccttgga gaaacgcgac attaattcaa aaattgcacg 2820 ctggtcaatg ttcttagaac aatttgactt cgagatagtt caccgatctg gagctaggat 2880 gcaacatgca gacgccctct cgcgtgttaa cgtttatatg gttgacgagg atgagcgact 2940 aaatgaatcg tcactttttg agaacgcact atatgtggct cagctgcgtg atccggagat 3000 agctaaactg aaagatgctg tactgtccgg ttctacgaaa gattatgaaa tccgcgaagc 3060 aatcttgtac aagttggtag ggcggaattc attgttgtgt gtgccgggtc aaatgatcca 3120 gtctgtgatt aataagttcc ataatgagat gggacatttc ggagtcgaca aagtgtgtgg 3180 attgataagg aggacgtact ggtttccgag aatgcgagaa caggtgcaag atcatattaa 3240 gtcgtgtgtt acgtgtatag cttacaatcc acggaacaaa cgatatgacg gggacctcca 3300 aatcgtcgac aaaccagaca aaccatttga agttctccac atagaccatc taggcccgct 3360 tgaaaaaagc aaaggcaaaa atgagtatat actggcagtc gtagatgcat gtacgaaatt 3420 tgttaaactg tatccgacta aaaccacgaa gacaacggaa gtaatgaaaa gcttacgtaa 3480 ctatttccac gcttattcgg ctccaaaggc aattatttcc gatcgtggaa ctgcatttac 3540 ttctcacgct tttgagaact ttactcagag ccacggtata aaacacatta agatcgcctc 3600 tgcgtgtcca aaagccaatg gacaggtgga acggtataat aaaactatga tgcctctgat 3660 aagtaagtta gtagaagaaa caggacgaaa ttgggacaca atcctaacag atgcagagtt 3720 tcttctcaac aacactgtaa accgttcaac tggaactaca gcttctattt tactctttgg 3780 gatagagcag cgacgccgtg tagatcacga tctgacgcat tatttgaatg aaatcaatgg 3840 agccgaagat cgagatctgg atgaactacg tgaaacttcc aagaagaata ctaaacggca 3900 gcagcagtat aacaagaaag tatacgataa acactgctct agaaatacgt attacaagag 3960 tggcgatttg gtgatgctca gaagagtcgg agtagcaggt gaaagaaaca agcttaagcc 4020 gaaatatcgg ggaccttatg aggtgaaaaa agttttggac aacaaccggt acgtagtggg 4080 tgacctggat gagttccaag tgtcaggtat gcgatttgag ggaatatatg atccaatgaa 4140 catgcgtctc taccagagag ccacgaagga cgaattaagg gaacataaaa gcaacggagt 4200 agaaggtgct aacggggaga atgaaaatga agaagtcgag tatcaggata tagagtattt 4260 agaagatgat caggaatcag aaatagaatt cgaatacgtt gaatatttgg aagaagattg 4320 tgacttttga tttgattttc cagatgagta attgctacca aattaaagtg caggatggcc 4380 gagc 4384 // ID CR1-85_HM repbase; DNA; INV; 3001 BP. XX AC . XX DT 15-JAN-2009 (Rel. 14.02, Created) DT 15-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE CR1-type family - consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-85_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3001 RA Bao W. and Jurka J.; RT "CR1 families from Hydra magnipapillata."; RL Repbase Reports 9(2), 372-372 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(182..1516,1407..2879) FT /product="CR1-85_HM_1p" FT /translation="MEKILKFFDMTETVKAVGGSLILVNNNIHSVQYNLEI FT PEELQNTEITCVDIFVSSIDQKIRLXCVYHPPCAATSILRTKALCKLIYKI FT TSKCNSTCIIAGDFNLPLINWSIPISFGDSSHDLFVETCSANALDQLVSDN FT TRHNNLLDLILTNNSKNVFNVEAIAPFTSTCDHCMIVFDCNLEKSKSIKKS FT IKYNFYKGDYNAIQSDLQNINWYMLAKQNQDVEVFWQIICKYLNSSISAHV FT PVHCNRKNNFQPLAIWKLASKKRILYREYKRTGNTSIHIKYKXCSXLYDAE FT INKYCIFREKDLVDKANIAQFYSFVKAKTKCCSSVAPLLXPDGSLSTDDCE FT KSNLLNNFFSSVFVTDNGIAPVLDLPTHSNSLNNVMFPYSSVVXSLQNLPP FT KSSKSPDGYPAILLRSIANAIAIPFSILFELSMSKKCHTFNLENRYSVPHL FT DQSQMPLLFLFPYFLNCQCQKNAIPLIWKTAIVCPIYKKGNPSLTTNYRPI FT SMTCIVCKVMESIIAECITNYLLINNLITAHQFGFLKRRSTCTQMLTTLNE FT WTTAANNKKIVDIVYIDFEKAFDSVTHSKLLFKLHCYGIQYELLNWITNFL FT TNRSQCVCVGNSYSKKILVKSGIPQGTVLGPILFLVFINDVTSCIDGDCSI FT KLFADDSKLYQASNDGNNINQYLQLINTLKNFSEWSKVWQLNIATKKCFHL FT RYGNSSLPLNSYSMNIGTPIESINSIQDIGIVVSSDMKFSNHCSFIASKAH FT SXSYLLLKSFISNDFKLLIKAYKVYVQPIIESCTSVWNPHLLKDIRRLEKV FT QKSFTRSVCKRCHIPYKDYSSRLNIFNLESLECRRIKFDLVMTYKIIHNLV FT DLPFSEFFTFITFAPSYYPTRCHAQKLYSRIKCKNDTRRFFYCERIIKIWN FT SLPDYIVYSSTLSLFKSNLKKYNVHHHCTLF*" XX SQ Sequence 3001 BP; 1027 A; 545 C; 398 G; 1022 T; 9 other; tgtttaccaa ccatcttaca aaaccaacct tttaaaacca tcaaatgtgc ttacttcaat 60 gctagatcaa taaacaataa actacatcac ctacacaatc ttcttgaaaa caacaatttt 120 gacatcatac tcataaccga aacttggcta aataaaaaaa cacctgattc attattgtta 180 aatggaaaaa attttaaagt ttttcgacat gaccgaaaca gtaaaggcgg taggtggttc 240 acttatacta gttaacaata atattcattc agttcaatat aatcttgaaa taccagaaga 300 actacaaaat actgaaatta catgtgttga catttttgtc tcatccattg accaaaaaat 360 tcgactttya tgtgtttatc accctccatg tgctgccacc agtattttaa gaactaaagc 420 tttgtgcaag ttaatttata aaattacttc caagtgcaac tctacatgta tcattgcagg 480 tgactttaac ctaccactca taaactggtc aattccaatt agtttcggag attcgtccca 540 tgacctattt gtagaaacgt gttcagcaaa tgctttagac cagcttgtat ctgataatac 600 tcgacataat aacctcctag atctaatact aaccaataac tcaaaaaatg tttttaatgt 660 agaagctatt gccccattta caagcacttg cgaccactgt atgatcgttt tcgattgtaa 720 tttggagaaa tctaaatcaa ttaaaaaatc aattaaatac aatttttaca aaggtgatta 780 caacgccata caatcagatt tacaaaatat aaactggtat atgcttgcca agcaaaatca 840 agatgttgaa gtattctggc aaattatttg taaatattta aactcatcta tttccgcaca 900 tgttccagta cattgcaata gaaaaaataa cttccaacct ctagccattt ggaaattggc 960 ctctaaaaaa cgcattctct atagagaata caaaagaact ggtaatacat ccattcatat 1020 caaatataag rtttgttcty gtttatacga tgcagaaata aataaatatt gtattttcag 1080 agagaaagat ttagtcgata aagctaatat tgcccagttt tattcttttg taaaagcaaa 1140 aaccaaatgc tgtagttctg ttgcaccact attacracca gatggttctc ttagcacaga 1200 tgattgtgaa aaatcaaatc ttttaaacaa ttttttcagt tctgtttttg ttacagataa 1260 tggtattgca cctgtcttag atttgccaac acactcaaac tctttaaata atgtaatgtt 1320 tccatacagc tctgtagttt yatccctaca aaaycttcca ccaaagtcct ccaaatcacc 1380 tgatggatac ccagcaatac ttctaagatc aatcgcaaat gccattgcta ttcctttttc 1440 catacttttt gaattgtcaa tgtcaaaaaa atgccatacc tttaatctgg aaaaccgcta 1500 tagtgtgccc catctataag aaaggaaacc cgtcattaac cacaaactat agacctattt 1560 ctatgacgtg cattgtttgt aaggttatgg agtcaataat tgcagaatgc attacaaact 1620 atttactaat taataacctt attactgctc aycaatttgg ctttttaaag cgtaggtcca 1680 cctgtacaca gatgttgaca accttaaatg agtggaccac tgctgctaat aacaaaaaga 1740 tagttgatat tgtatatata gactttgaga aagcctttga ttctgtgact cactctaagt 1800 tactgttcaa actgcattgt tatggtatcc agtatgaatt gctaaactgg attaccaatt 1860 ttttaacaaa tcgttctcaa tgtgtttgcg ttggtaactc gtattcaaaa aaaattcttg 1920 taaagagtgg aatacctcaa gggactgttt taggacccat cttattttta gtttttataa 1980 atgatgtaac ttcatgtatt gatggtgatt gcagcattaa gttgtttgct gatgacagca 2040 agttatatca agcaagcaat gatggaaaca atataaacca atacttacaa ctcatcaata 2100 ccttaaaaaa tttttcagaa tggtcaaaag tttggcaact taatatagca accaaaaaat 2160 gttttcatct acgctatgga aacagttctt taccattaaa ttcatactca atgaacattg 2220 gtacacctat tgaatctatt aattcaattc aagatatcgg tatagttgta tcttcggata 2280 tgaaattttc taatcactgt tcttttattg caagcaaggc acattctygt tcctacttac 2340 ttttaaaatc cttcatcagt aatgatttca agcttctgat caaagcctat aaagtttatg 2400 ttcaacctat catagaatct tgtacctctg tgtggaatcc ccatctttta aaagatatac 2460 gccggttaga aaaggttcag aaatccttta ctagaagtgt ctgtaaaaga tgtcatattc 2520 cttataaaga ttattcttca agactcaata tcttcaatct agaatccctt gaatgccgta 2580 gaataaaatt tgatcttgta atgacctata aaattataca caaccttgtc gatctacctt 2640 tttctgagtt ttttacattt attacatttg ctccttctta ctatcctaca agatgtcatg 2700 cacaaaaact ttattcacgt attaaatgta aaaatgatac acgaagattt ttctactgtg 2760 aaagaattat caagatatgg aattctctac ctgattatat tgtttactcg tccactttgt 2820 ctttgttcaa atcaaattta aaaaagtaca atgttcatca ccattgcaca ctattttaac 2880 tagttttttt tgttgtttct ttttttgtcg gtgttacata agagtaattt tgatattacc 2940 tgtatcatct ttttgtattt cttatytata aatgcaaata aatcagaaaa taaaatgaaa 3000 t 3001 // ID Sola2-2_HM repbase; DNA; INV; 5293 BP. XX AC . XX DT 28-JAN-2009 (Rel. 14.02, Created) DT 28-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE Sola2 DNA transposons from Hydra magnipapillata. XX KW Sola; DNA transposon; Transposable Element; Sola3; Sola2-2_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-5293 RA Bao W., Jurka M.G., Kapitonov V.V. and Jurka J.; RT "New superfamilies of eukaryotic DNA transposons and their RT internal divisions."; RL Molecular Biology and Evolution 26(5), 983-993 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(982..1230,1247..1525,1540..3882) FT /product="Sola2-2_HM_1p" FT /translation="MLLYVLLYKMCLIFFLMHEVLMHKITIEHTQLKAVTW FT QTSYWVYFLSLISYIKFIILYLFIVSKLMETLVYTSKIENKYIFFRMASSS FT LDLTQCCIGKKLNENCHEGKFTKIKKIKFFEELTKKDQELVHLRCKVNPIK FT TICNHHEKIFLDRFESSYVGSFCCDPFNKHKIKIKSILNDYIKDKVFFIYL FT FSVLSYIFLESVRVISAAVAKEFDLIPGYKLCTECRKILHLRKQEHESDSC FT KNQDNDFNGIHISKENFNLSINTFGCSPLKLVAYKDRVSYAKRKINKIHTA FT TKKKIATVLDVSPDLINDEPKTKELCCKSKQDLDSLMELIKAKFDISSKPE FT KLKLLTLVPESWNINYTENYFSASQRMIKTARHLKQMHGIMAEPSKKEGRS FT ISEKVKNLVHEFYQSDEYSRMCPGKKQYVSIQVDGKRQHFQKRLLLLNIKE FT LYLEFVKISLDVKIGFSKFCELRPKWCIPVGGASGLHSVCVCEYHQNAKLL FT ALKIPDISDYKNLLTLVVCDLTNRDCMLHCCDNCPDNDTLKEYLTTLFDKH FT SIEEINFNQWQKANKKHDIVPVTLPVDDFIEKACQQIDQLRDHHYIAKNQA FT AYLQHLKITLASNHILILLDFAENYSFLIQDAVQGFHWNNSQATLHPFVIY FT FMEENKIKCKSICIISDNLQHNTNSVHCFIHEVLKHLKNLIPYFNHCIYFS FT DGASSQYKNYKNLSNLCHHKQDHDVSAEWHFFATSHGKSACDGVGGTVKRL FT VARASLQSLVDPINSSKKMYDWCRQNIDGIHFQHISNDAVTLHCMKFKLEE FT RYATCSTIPGTRNHHCFIPQSLTTLQMRRVSFDNVFTNVNISTMQQNYTIP FT LASLSPGIYIACVYDGKWFLGNIVEISEENQDILIKFMKQSISSNTFTWPH FT REDICWVPIMHVLCKISSLKVQSATGRCYELSAQEKTEIIHLYNRMFNNEI FT I*" XX SQ Sequence 5293 BP; 1976 A; 764 C; 786 G; 1767 T; 0 other; gggtcgttcc gtgccaaatg gtctaaaatt ttaggattcc aacatggacc ccctcagatt 60 ttgatgaaac ttagtgggtt ggtttatatc aatgagaaaa gcaattcctg aaaatttcag 120 cttcaaatct tatgtggttc atgagatatg gccatttgaa atttctgatt tttccaaaaa 180 taaaaacttc agtagcaaaa acatgttttg ttcatacttt gaagctctgt attttaaaaa 240 ctaaatttca gaaaaccttc tagttttttt tatcatatac aatgctaggc ttactttgtt 300 aaaactttga cattaaattt ttaaatgtca cattttatcc ataatggctg cctaaaaaac 360 cgaaaaaatt gtttgcagat atagaaagtt gatttttgat gggtcaatat acaaaactag 420 caatttaaga agtaaaaata catatctttt tagtttctat atatggtagg cttcctaatt 480 ttcagaaatt tgctgatatg atacaattca gtaagaaatt atggacccct aaaaatggct 540 ccggctagaa cttacaaaat tgtgccttcc cttaaatcca ctggtaacct atgatcaaca 600 ctgattttca tgatatactt tttttctact catttagtag actttattaa agtgatttca 660 gaaaaaaaaa tagtgggtac tcatggggaa gttacaaagt cagaacaatg aagttggcac 720 aaaatgcatt aaaaacgcaa aaaatagaga tattgatgta ctttagtatt cattatttac 780 tgcaatgcaa aaaaatgtat atatatatga aagtatatca aaatcattgc tacaaattag 840 ctttaaatta ggactgaatt aaactcaggt tttaaataac cttagttgcc cccccccccc 900 ctgtcttttt attttttaac atatatatat ctggaagaaa atacatatgt attttctccc 960 agaacttatc ttaaaagaaa tatgttattg tatgttttac tttacaaaat gtgtctgatt 1020 ttttttctaa tgcatgaagt gttaatgcac aaaataacaa ttgaacatac acagttaaaa 1080 gctgtgacat ggcaaacaag ttattgggtg tactttttat ctttaatttc gtatataaaa 1140 tttatcattc tatatttatt tattgtgtca aaacttatgg aaactttagt ttacacttca 1200 aaaattgaaa ataaatatat tttttttaga taataattaa aaataaatgg caagttcatc 1260 tttagatcta actcaatgtt gcattggaaa aaaattaaat gaaaattgtc atgaaggaaa 1320 atttaccaaa atcaagaaaa ttaaattttt tgaggaatta acaaaaaaag atcaagagct 1380 tgtgcactta agatgtaaag taaatcctat taaaactatt tgtaatcacc atgaaaagat 1440 cttcttggat agatttgagt catcgtatgt tggaagcttt tgttgcgatc cttttaacaa 1500 acacaaaatt aaaattaaaa gtatttaaaa atagattaac ttaatgatta tattaaagat 1560 aaagtcttct ttatttattt atttagtgtt ttatcatata tttttttaga atctgtcaga 1620 gtaatcagtg ctgctgttgc aaaggaattt gatttaatac ctggttataa gctttgtaca 1680 gaatgtcgta aaatattgca cttaagaaaa caagaacatg aaagcgattc atgtaaaaat 1740 caagataacg attttaacgg gattcatatt tcaaaagaaa actttaattt aagcattaac 1800 acctttggtt gttcacctct aaaactagta gcttataaag acagagtcag ctatgcaaaa 1860 agaaagatta acaagatcca tactgcaacc aaaaagaaaa tagcaactgt tttagatgtt 1920 tctccggatt taataaatga tgaaccaaaa acaaaagaac tttgttgcaa aagtaaacaa 1980 gatttggact cactaatgga attaattaaa gcaaagtttg atatttcttc aaaaccagaa 2040 aaacttaaat tgctaacttt ggttcctgag agttggaaca ttaactatac agaaaactat 2100 ttttctgctt cccaaagaat gataaaaact gcaaggcatt tgaaacaaat gcatggcata 2160 atggcggagc cgtctaaaaa agaaggaaga tctattagtg agaaagtaaa aaatctagtc 2220 catgaatttt accaatcaga tgaatactct agaatgtgtc caggaaaaaa acagtatgtc 2280 tcaatccaag ttgatggtaa aagacaacat tttcaaaaaa gactgctatt acttaatata 2340 aaagaacttt atttagagtt tgttaaaatt tccttagatg tcaagattgg attttctaag 2400 ttttgtgaac ttagaccaaa gtggtgtatt cctgttggag gtgcttcagg tctccactca 2460 gtatgcgttt gtgagtacca ccagaatgcc aaacttttag cattaaaaat accagatatt 2520 tcagactata aaaatctctt gacattagtt gtatgtgact taacaaatcg agactgcatg 2580 ctgcattgtt gtgataactg cccagataat gacacactta aagagtatct tactactttg 2640 tttgataaac atagcattga agaaatcaac tttaatcagt ggcaaaaagc taacaaaaaa 2700 catgatatag ttccggtaac tcttccagtg gacgatttta ttgaaaaagc ttgtcaacaa 2760 atagaccaat taagagacca tcactatata gccaaaaatc aagcagcata tttgcagcat 2820 ctaaaaatca cattagcaag taatcatatt ttaatattat tagactttgc tgagaattac 2880 agttttttaa ttcaagatgc agtacaaggt ttccattgga acaatagcca agcaacattg 2940 catccttttg ttatttactt tatggaagaa aacaagataa aatgtaaaag tatttgcatt 3000 atctcagata atttgcaaca caatacaaac tcagtacatt gtttcattca tgaagtatta 3060 aaacatttaa aaaacttgat accctacttt aaccactgca tttatttcag tgatggtgca 3120 agttcacaat acaaaaatta taaaaaccta tctaacttat gtcaccataa acaagatcat 3180 gacgtatctg cagaatggca tttttttgct acatcacatg gaaagagtgc gtgtgatggt 3240 gttgggggaa ctgtcaaaag acttgtagca agggcaagtt tacaatcact agttgatcct 3300 atcaattcat ctaaaaaaat gtatgattgg tgtagacaaa atattgatgg aatacatttt 3360 caacacattt caaatgatgc agttacatta cattgcatga aatttaaatt agaagaacgt 3420 tatgctactt gttcaacaat tccaggaaca agaaaccacc attgttttat accacaatca 3480 ttaactactt tacaaatgag acgggtttca ttcgacaatg tgtttactaa tgtaaatatt 3540 tcaacaatgc aacagaacta tacaattcca cttgcatctc tttcaccagg aatttatata 3600 gcatgtgttt atgatggaaa atggttttta ggtaatattg tggaaatatc tgaagaaaat 3660 caagatatcc ttataaagtt catgaagcag tctatttcct ctaacacttt tacttggcca 3720 catagagaag atatttgctg ggttccaata atgcacgttc tatgcaaaat ttcatcttta 3780 aaagtgcaat cagcaactgg tagatgctat gaactatcag ctcaagaaaa aactgaaata 3840 atacatttgt acaatagaat gtttaataac gaaattatat aataacaaag caaaattttt 3900 ataagcttga tcaatcgaaa aaatgtatta aagcatactt gaatgattcc atctttatgt 3960 tgatttttta ttaaataatg ttttttaatg cataatacgt attatataat tacataataa 4020 acttgtgtat tgcaatgaca aatcaaatta ctcttatcac ttgatataat ctgtaatcta 4080 aatttaaaca tgacaataat aaaatgggga aaatgatact aaatttaagt cagaatataa 4140 aactgtaaga taataactga tatagaaaca aaaatgtgtt gtgtctaata agcatttaca 4200 aataaataac acagctatta tttgagtatg ttcaatactt ggtgcataaa gatacttcat 4260 gcatcagata taataacaga aacactgcat aagctaaaat gtttaataaa cagtagttta 4320 aaatgttatc gagaaataag ttcttcttaa gataagaatg ttctgggcaa aaatacgcat 4380 ttaaaaaaat aaaaagacag ggtggggggg ggggcaacta aggttattta aaacctgagt 4440 ttaattcagt cctaatttaa agctaatttg tagcaatgat tttgatatac tttcatatat 4500 atatacattt ttttgcattg cagtaaataa tgaatactaa agtacatcaa tatctctatt 4560 ttttgcgttt ttaatgcatt ttgtgccaac ttcattgttc tgactttgta acttccccat 4620 gagtacccac tatttttttt tctgaaatca ctttaataaa gtctactaaa tgagtagaaa 4680 aaaagtatat catgaaaatc agtgttgatc ataggttacc agtggattta agggaaggca 4740 caattttgta agttctagcc ggagccattt ttaggggtcc ataatttctt actgaattgt 4800 atcatatcag caaatttctg aaaattagga agcctaccat atatagaaac taaaaagata 4860 tgtattttta cttcttaaat tgctagtttt gtatattgac ccatcaaaaa tcaactttct 4920 atatctgcaa acaatttttt cggtttttta ggcagccatt atggataaaa tgtgacattt 4980 aaaaatttaa tgtcaaagtt ttaacaaagt aagcctagca ttgtatatga taaaaaaaac 5040 tagaaggttt tctgaaattt agtttttaaa atacagagct tcaaagtatg aacaaaacat 5100 gtttttgcta ctgaagtttt tatttttgga aaaatcagaa atttcaaatg gccatatctc 5160 atgaaccaca taagatttga agctgaaatt ttcaggaatt gcttttctca ttgatataaa 5220 ccaacccact aagtttcatc aaaatctgag ggggtccatg tttgaccctg gttcacttgg 5280 catggaatga ccc 5293 // ID L1-41_AAe repbase; DNA; INV; 4632 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE An L1 non-LTR retrotransposon from Aedes aegypti. XX KW L1; Non-LTR Retrotransposon; Transposable Element; L1-41_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4632 RA Kojima K.K. and Jurka J.; RT "L1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1394-1394 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 11 sequences with >98% CC identity. XX FH Key Location/Qualifiers FT CDS 142..1209 FT /product="L1-41_AAe_1p" FT /translation="MPDYRKNSMIIDFSVVPTRPAISEIHKFVFESLKITM FT ASIKSIQLSTTKSYVYLEMESSALAEQLVAENNRKFSIKIDGGEYAIPMFI FT ADGAVEVKVCDLPPHMPNSIIAKHLAAYGEVMSIKDDVWREYFPGLPNGNR FT TVRMRIQKAIPSYLSIETETAYIRYKNQVPTCRYCSRNLHVGTKCSDVRKA FT LSKSVNERLSLASIVQGVNPILPESTQTANTTSPENTSSVSGGNNSQRLSS FT MDPPLTTDAMTVETEIANLVNADPQSSKAVVQSFDISDSEMENSNRRKQRE FT LRGATNSTERSTASTVGSMDTHDTMGDQQENWREIRSKRVGSPRLAETSKI FT EAKRQSRSRHRYK" FT CDS 1387..4578 FT /product="L1-41_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MDLRSYKIATININAITNATKTEALQLFIRTAELDVI FT FLQEVANASLQLYGFQLFFNIDERKRGTAIAVRRGFRCHDVQRSLDARVIS FT VRIGQVTFVNIYAPSGTNQRRNRELFFNSTVAHYIYNSTDAVVVGGDLNSV FT INRKDAIGSNSHSPMFQRLIQAGNLIDSWEIPNGSNVEYSLVRHDSSSRID FT RIYVGKDMRTWLRTSQFVPLSFTDHKAFVIRMVLPDMGVNTGRGIWRFKPQ FT ILDDPAVMQEFSMKWSYWTRARRNYPTWMNWWVSYAKPKLVSFFKWKTSLF FT NKDFRDTMELYYTLLKSAYDDLFGNPEQLQTVNKIKAQMLLHQRKFSANQH FT SQNECFIGGENTTIYHVGQSKKRKSETWLKELEDNGRTIVAHADIKEHVTK FT YFEELYTDAELIGSQQLDFHPIRKIPDNNIWNSKLTDRATVEEVREAIKNS FT ASKKSPGIDGLPKEFYSKTWHIISSEITAVINDAISGTIPKQFMDGVIVLV FT RKKQSRPTVNGFRPISLLNVDYKIVARILKQRMVNILPLVLSSNQKCSNGK FT KNIFGATARIIDKLCELEETRGSSLLVSFDMDHAFDRVDQNFLKRTMTEMQ FT FNPNFTQLIGNIMANSFSKVLINGSLSREIQIRRSVRQGDPLSMFLFVIYL FT QPLLDKITQLFPNAIMNAYADDVSMFVQNERDLIAIMDIFSDFANEAGAIL FT NTAKTVALMIGNVNLTTDSDWLTIENEVKILGIQFSSNFTRTVELNWSSVL FT LKFQRALWMNNVRNLNLIQKVILLNTYICSKMWYMASVLPLPNKFAGKFIS FT RIGSFLWYGQNSARIPFETLTLPKNRGGLNLHSPAIKSKALLVNRMISLAE FT HLPFFNRFYALNANPPFVNGVPRKFVHIRITIQESAFLPEAMMESLSSASL FT HECFVQRLKDAKMIRNHPQRDWKSILQNLHSKIINSDRRSIWYRIMHGKII FT TNSLLFNQQQRPDPYCEQCNQEIDDIEHRIFKCISIREIWHYQRQLLIREN FT RRFRDFQLNDWLYPSLKQFSRHQKLSCIKIIATFLSFVIQTPKEEHSIANF FT KFYKMYN" XX SQ Sequence 4632 BP; 1484 A; 1012 C; 936 G; 1200 T; 0 other; cagttgacac tcagcattcg tgcgcgatag ttgtgtaaaa tattcgtctg ctgagagaaa 60 cgtttttcac cattactagc ggacttggtc cggccggttt ttttttctaa tcccggaaat 120 atcaaagcta gacaagacac gatgccagat tatcggaaaa attcgatgat catcgatttc 180 agtgtggttc caactagacc agctatttct gaaattcaca agttcgtctt cgagtccctc 240 aaaataacaa tggcctccat caaaagcatc caactgagca caacgaagtc gtatgtgtat 300 ctcgaaatgg agtcatcggc tctggcagag caactagtcg ccgagaacaa ccgaaaattt 360 tccatcaaaa tcgatggtgg cgagtacgcg atccctatgt ttatcgcaga tggtgcagtg 420 gaagtcaaag tttgcgacct tccacctcac atgccgaact ccatcattgc taaacatcta 480 gctgcttacg gtgaagtcat gtccatcaaa gacgatgtgt ggcgcgaata ttttccaggc 540 ctgccaaatg gaaatcgcac cgtccgaatg cgaatacaga aagcaatacc atcatatctg 600 tccatcgaaa ccgagacggc gtatatccgc tacaagaacc aggtacccac ctgcaggtat 660 tgttctcgca accttcacgt tggcacgaaa tgctcggacg taaggaaggc tctctccaag 720 agcgtcaatg agagattgtc tctggccagt attgttcagg gagtgaatcc aatattgccc 780 gaatccacac aaacagcaaa cactacatca cctgaaaaca catcatctgt gtccggaggc 840 aataattctc aacgtttgtc ttcgatggat cctccgttga ctacagatgc aatgaccgtt 900 gaaactgaaa tcgcaaatct tgtaaatgct gatccacaat cgtcgaaagc agttgtgcag 960 tccttcgaca tctcggacag cgagatggaa aacagcaatc gacgaaagca gcgtgagctg 1020 cgcggtgcca caaattcaac tgagagaagt actgccagta ctgttggtag tatggacact 1080 cacgacacta tgggtgacca acaggaaaat tggcgagaaa taagatcgaa acgagtgggt 1140 tctccgagat tagcagaaac atcgaaaatt gaagccaaac gacaatctcg ttccagacac 1200 cgatacaagt aaagcttaac gttttcatag gtaagagcat gtatatgttc agccgaagca 1260 ctcacagatt ctaccaatct aaggcgttat ttctccattc ttctgtatgc caccgagtac 1320 tgatgccatc ccgttcttcg tgctgtaaac cgcgtcaatc aagctttcca ttcatccgtt 1380 tcttctatgg atttaaggag ttataagata gcaaccatca atataaatgc tataactaat 1440 gctacgaaga ccgaagccct acaactattc atccgtactg cagagcttga cgtgatattt 1500 cttcaggaag ttgccaatgc gagtctacaa ctctacggtt tccagctttt cttcaatatc 1560 gatgagagga aaagaggtac agcaatagca gtcagaagag ggtttcgatg ccacgatgtt 1620 caacgaagcc tagatgcacg agtaatttca gttagaattg gccaagtaac gttcgtcaac 1680 atatacgctc cctctggtac aaaccaacgt agaaaccgtg agttgttctt caacagtacc 1740 gttgcgcact atatctacaa ttcgacggac gcagttgtag ttggaggcga tctaaattcc 1800 gtgattaata gaaaggatgc tataggatcc aactcacaca gtccaatgtt ccaacgactg 1860 atacaagccg gtaatctgat cgatagttgg gagataccga atggatcaaa cgtagagtat 1920 tcactcgtaa ggcatgattc atcatcaaga atagatagaa tatacgtagg taaggatatg 1980 cgcacttggc tacgtaccag tcagtttgta cccctttctt ttactgacca caaagctttt 2040 gttatacgca tggtattacc tgatatggga gtaaacacag gtcgtggaat ttggcgtttt 2100 aaaccacaga ttctcgatga cccagcggtt atgcaagaat tttccatgaa atggtcgtat 2160 tggacgagag caagaagaaa ttatcctaca tggatgaatt ggtgggtctc ttacgccaaa 2220 cccaaactcg tctccttctt caaatggaaa acttcgctct tcaacaagga ttttcgtgac 2280 accatggagc tctactatac ccttttgaaa tctgcatatg atgacctatt tggcaatccc 2340 gagcagttac aaactgtaaa caaaatcaag gctcaaatgc tgctccatca acgaaaattc 2400 tcggcgaatc aacactccca aaacgaatgt ttcattggtg gcgaaaacac tacgatatat 2460 catgttggtc aatcgaaaaa gcggaaatct gaaacgtggt tgaaagaact ggaagacaac 2520 ggccggacta tagttgcaca tgcggatatt aaggaacacg tgaccaaata tttcgaagag 2580 ttgtacacgg acgcagaact aatcggtagc caacaactag attttcatcc aatacggaaa 2640 ataccggaca acaacatttg gaacagtaag ctgacggaca gagcaactgt cgaagaagta 2700 cgggaagcca taaaaaatag cgcatcgaag aagtcacctg gcatcgacgg actccccaaa 2760 gagttttatt cgaaaacctg gcacatcatc tcctcagaaa tcacggcagt gataaatgat 2820 gcgataagtg gaacgatacc aaagcaattc atggacggtg taatagttct cgttagaaaa 2880 aaacaaagcc gaccaaccgt gaatggattt cgaccaattt cgctgttgaa tgtcgattat 2940 aaaattgtag ctagaattct caagcaacga atggtaaata tccttcctct cgtgctatcc 3000 agcaatcaaa aatgctcgaa cggcaagaaa aacatcttcg gtgcaacggc tagaataatc 3060 gacaaactct gtgaactaga agaaacccgt ggtagttcat tattggtttc tttcgatatg 3120 gaccacgcct ttgacagagt ggatcaaaac tttttgaagc gaaccatgac agagatgcaa 3180 tttaatccaa atttcacaca gcttattggc aacattatgg ccaatagttt ctcaaaggta 3240 cttatcaacg gttccttatc ccgggaaata cagattcggc gttcagttag acaaggtgat 3300 cctctgtcaa tgtttttatt tgtgatctac ctccagccgc ttctggacaa aatcacccaa 3360 ttatttccca acgcaatcat gaatgcatac gccgacgatg tatcaatgtt tgtgcaaaat 3420 gagagagatt tgattgctat catggatatt ttttctgact ttgcaaatga agctggagcg 3480 attttgaaca ctgcgaaaac cgttgcactt atgataggca atgtcaacct tacaacagac 3540 agtgattggt tgacgattga aaacgaggtt aagatcctag gaatacagtt ttcgagcaac 3600 tttactagga cagttgagct aaattggagc tccgtactac tcaagtttca acgagcactg 3660 tggatgaata acgtgcgaaa tttgaatctg attcaaaaag tcatactgct aaacacatac 3720 atctgctcca aaatgtggta catggcttca gttcttccac tgccaaacaa atttgctggg 3780 aaattcatat caagaatcgg atcttttcta tggtatggtc aaaactcagc aagaattcct 3840 ttcgaaacat taacgcttcc gaaaaacaga ggagggctaa atctgcactc accggcgatt 3900 aaatccaaag cattgcttgt caatcgcatg ataagtctgg cagaacacct tccattcttc 3960 aatcggttct atgcattaaa cgccaatcct cctttcgtga atggggttcc tagaaagttt 4020 gtgcatataa ggatcaccat acaggaatcc gcattcttgc ctgaagcaat gatggagtcc 4080 ctttcctcag catcgttaca tgagtgcttt gtgcagagac tgaaagatgc gaaaatgatc 4140 agaaatcatc ctcaacgaga ttggaaatct attctgcaaa atttgcattc taaaatcatc 4200 aactctgatc gacgttcgat atggtatcgg ataatgcatg gaaaaattat caccaatagt 4260 ttgctcttca accaacaaca gcgaccggac ccatactgtg aacagtgcaa ccaggaaata 4320 gatgatattg agcacaggat cttcaaatgt atatctataa gggaaatatg gcactatcaa 4380 cgacagctac tcattagaga aaacaggaga tttcgagatt ttcaactaaa tgattggctt 4440 tacccatcac taaagcagtt ttcgagacac caaaaactta gttgtattaa aataatagct 4500 actttccttt cttttgtaat tcaaacgcct aaagaggaac acagtattgc aaactttaaa 4560 ttttataaaa tgtacaacta atttagaatt ttttgacaat aaaacaagga catcgtaacc 4620 taaaaaaaaa aa 4632 // ID LIN9_SM repbase; DNA; INV; 5304 BP. XX AC . XX DT 06-MAR-2008 (Rel. 13.03, Created) DT 10-AUG-2009 (Rel. 14.09, Last updated, Version 2) XX DE Non-LTR retrotransposon: consensus. XX KW NeSL; Non-LTR Retrotransposon; Transposable Element; LIN9_SM. XX NM LIN9_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-5304 RA Tempel S., Bao W. and Jurka J.; RT "Non-LTR retrotransposons from Schmidtea mediterranea."; RL Repbase Reports 8(3), 343-343 (2008). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS join(228..1547,1551..4796) FT /product="LIN9_SM_1p" FT /translation="MMDSRQLNTPKIRKYQNPKMTNDIMKSYNYAVLSDVT FT PQETTQTTTHLNVDIDNETTQPKQPLTKSGKPKSKPIAVSYKFKDATFIWD FT TTPQTNPPRDCTKLIDKTRPRKTIFKKSAFQSYLKKELSNETFVEVKTFLM FT ATHKYRFKDENSRLLAYRIINRYVMETANEFKETEFDMARFAKFFTIPENW FT LKHLKPYSTATETSPADRIKVQKLVDLTCRYPFKTQEEQTSVANFLHFFTQ FT RSIIGISRDYKFQKFIPFMARKNTRPETTSTMVTTSPTEQNRLPMVIITPL FT EEPKSEHRRPEKRGASNDTIVLSDEEFPLLKRRTLPTRKSKNPTGAGNVPT FT ETECTDEVKFILNNEYQIECKECGKVWENVRNGLNHLRQKHDFPNRTDVMV FT SCVRCEVPIKGAECVNHIKNHKKDDKEESEAGSLVANTQDIPNESSLSQAA FT IEVYLRNILKMKENQERNIQYLEPSTANFLINRNLRAFYQNVKIEKLIGWE FT QVIWLIHWNKCHWIVYLANCDSKTSVILDSDNQMTLQQRCNIKAKFDKFLE FT GTFEEKTVLGTLERKVPQQPNNFDCGIYVIQYISDFLKDPQRIDYHTPDSK FT RIRKEIGELILEEMKNPASKIKNPNKEIQSLLQKFRLLQINVNDVFHWFAA FT EYQKSLPKIRTKRDGKLNKLSCSYQIQRLFGLAPKRAVKEIYFQETSTADL FT ETRVLNEHFKKDESTMKECKIKNGNHYQDWITKAQIDNKEILEALKNSTDS FT APGEDNIPLRQWIIWNNDGVLFDMFNYIKRTHDIPDMWKNYTTTLLIKPGK FT SQESNIPANWRPISILPTSYRIFMKVLNKRVLEWANRGELISKWQKAVDKA FT NGCDEHSYVIQALIEKANRSYYKNEQCHLAFLDLADAFGSIPFQVIWHTLK FT NMGMDEETINLLKEIYKDCSTKYKCGKNESEKIKITKGVRQGCPLSMTLFS FT LCIQYLIQGIAEKKKGATIAGQEVCILAYADDLVIVANTAKDMQMLLTTIE FT NLAKQADLIFKPAKCGYYRDPRDKKSMMKIYGKEISIVDEKNVYTYLGVRI FT GDTKKKDLNVRFEEVKKKTTAIFKSKLRSDQKLEAYNIFCQSKFVYILQGE FT DIAKTKIETYDEEIKKMIKEDILKLQDKSPFTDFVIYSPREKGGLGITKII FT DEQTIQTINRTAKLLNSSHRAIRAIIYEELIQVANLRGEKEINTIEEALKW FT LEGTNKYKKNSNAKTTWITRVREAFQTLEKKHKIKVRFVPKENCIGYKIKC FT DTQEKIVELDNSKELSKSLHWMIKEAYYKEWKALKCQGYIISLKTSEFMEW FT KMPRGLPDPDWRFLTKVKANMLDVNMKQANQGGRLGSTKCRKCEDKESASH FT VINHCASGNWSRVEKHNQVQNELAKELTKRNISFEKDSIPKETKESLRPDL FT VIRLKDKIMIVDIKCPFDEESAIESARNKNIDKYRELAKEIQAKTGLQTTV FT STFVVCSLGTWDKRNNELLRQMGIRYEESKEMRINMIQKAIHGSRKTYDHH FT RNFNNG" XX SQ Sequence 5304 BP; 2180 A; 1039 C; 935 G; 1148 T; 2 other; aaacgacatc atgaacgctt ggccgcaaca atccagttat ccctgcggta acattgtgga 60 actcataaga caagtactaa aagaagaatt agaaaaatta gaagaaaaaa ttgaaaataa 120 tttatttata aaatttaaaa atttaaataa atttaaaaat ttaaatttaa atttaaatga 180 agataaaaat ttatttaatc caataaataa tcaagaaaat caagaaaatg atggattcaa 240 gacaattaaa tactccaaaa ataagaaaat atcagaaccc aaaaatgaca aacgacatca 300 tgaaaagcta caactacgcg gttttgagcg atgtcacgcc tcaagaaacc actcaaacaa 360 caacccactt aaatgtcgat atagacaatg aaaccaccca accaaaacag ccacttacga 420 agtctggcaa accaaaatct aaaccaattg cggtatcata caaatttaaa gatgccacct 480 tcatctggga cactacccca caaacaaatc caccaagaga ttgcaccaaa cttattgata 540 aaacaagacc aagaaagacc atcttcaaaa aatcagcatt tcaaagctac ctcaaaaaag 600 aactgtccaa tgagacattt gtggaagtaa aaaccttcct catggcaact cacaaatatc 660 gttttaaaga cgaaaactca agactcttgg cataccgaat aattaatcgc tatgtcatgg 720 agacagcaaa tgaattcaaa gaaaccgaat ttgacatggc tcgctttgcc aaattcttca 780 caatcccaga gaattggtta aaacatctaa aaccatactc tacagctacc gaaacatcac 840 cggctgatag aataaaagta caaaaattag tggatctcac atgcagatac ccattcaaaa 900 ctcaagaaga gcaaacaagt gtagcaaact tcctacactt cttcacccaa agatcaataa 960 ttggaatctc aagagattat aaattccaaa aatttatacc atttatggca agaaaaaaca 1020 ccaggccgga gacaacctcc actatggtta cgacttctcc aacagaacaa aacagactac 1080 caatggtaat aatcacacca cttgaagaac caaaaagtga acatcgtaga ccagagaaaa 1140 gaggcgcaag caatgacaca attgtgctta gcgacgaaga gttcccacta cttaaaagga 1200 gaactcttcc aaccagaaaa tccaaaaatc ctactggtgc aggaaatgta ccwacagaaa 1260 ccgaatgcac tgatgaagtt aaattcatcc tcaacaatga ataccaaata gaatgtaaag 1320 agtgtggaaa agtgtgggaa aacgtacgaa atggattaaa ccaccttcgt caaaaacacg 1380 atttcccaaa ccgaacagat gttatggtat cttgcgtaag atgtgaagta ccgatcaaag 1440 gagcagaatg tgtaaatcac attaaaaatc acaaaaaaga tgacaaagaa gaaagtgaag 1500 csgggagtct tgtggctaac actcaagaca tcccaaatga aagtagctga ctgtcacaag 1560 ccgcaatcga agtatatctg aggaatattc tgaaaatgaa agaaaaccag gaaaggaata 1620 ttcaatatct tgaacctagt actgcgaatt tcctcataaa taggaacctc agagcatttt 1680 atcaaaacgt caaaatcgaa aagcttatcg gatgggaaca agtcatctgg cttatacatt 1740 ggaacaaatg tcattggatt gtatacctag ctaattgcga ctcaaaaacc tctgttatct 1800 tggactctga caaccaaatg acattacagc aaagatgtaa cataaaagcc aaatttgaca 1860 aattcctaga aggtaccttt gaagaaaaaa cagtgcttgg aaccctagaa agaaaagttc 1920 ctcagcaacc aaacaacttc gattgcggta tatatgtgat acaatacatc agcgactttc 1980 ttaaagaccc acaaagaata gattatcata cacccgactc caaaagaatt agaaaagaaa 2040 taggagaatt aatattagaa gaaatgaaaa accctgcctc aaaaatcaaa aatccaaaca 2100 aagaaataca atctttactc caaaaattca gactactgca aatcaatgtg aatgatgtat 2160 tccattggtt tgcggctgaa taccaaaaat ctctaccgaa gatacgtacc aaaagagatg 2220 gaaaactgaa taaactaagc tgctcctatc aaatccaaag attatttggt ctagctccta 2280 aaagagcagt caaagaaata tatttccaag aaacctctac agcagacttg gaaacaagag 2340 ttctaaatga acatttcaaa aaggatgaat caacgatgaa agaatgtaaa ataaaaaatg 2400 gaaaccatta ccaagactgg ataacaaagg cccaaattga taataaagaa atattggaag 2460 ccctaaaaaa cagtacagat tctgcccccg gagaagataa cattcctctg aggcaatgga 2520 taatctggaa caacgacggt gtcctctttg atatgtttaa ctacatcaaa aggacacacg 2580 atatcccaga tatgtggaaa aactacacca caacactact tataaaaccc ggaaaaagcc 2640 aagaaagcaa catccccgct aattggaggc caatatcgat attgccaaca agctatcgta 2700 tatttatgaa agtcctaaat aaaagagtac tagaatgggc taatagagga gaactgatat 2760 caaaatggca gaaagccgta gacaaagcta atggatgtga tgagcacagc tatgtcatac 2820 aagcgcttat cgaaaaagca aacagaagct actacaaaaa cgagcaatgt cacctcgcct 2880 tcttggattt ggcagatgct tttggaagca tcccattcca agtaatatgg cataccctaa 2940 aaaatatggg tatggatgag gaaaccatca acttgctcaa agaaatctac aaagattgct 3000 ccacaaaata taaatgtgga aagaatgagt cagaaaagat caaaattacg aaaggagtcc 3060 gacagggatg cccattgtcg atgaccctct tcagcctctg tatacaatat cttatacaag 3120 gcatagcaga aaagaaaaaa ggagcaacaa ttgcaggtca agaagtttgc atattggctt 3180 atgcggacga cctagtaatt gttgcaaaca cagcaaaaga catgcaaatg ctgttaacaa 3240 caatcgaaaa tctggcaaaa caagccgatc tcatattcaa accggcaaaa tgtggatatt 3300 acagagaccc aagagataaa aagtccatga tgaagatata tggcaaagaa atcagcatag 3360 tagacgaaaa gaatgtttac acctacctag gtgtaagaat cggtgacaca aagaaaaaag 3420 acctaaatgt cagattcgaa gaggtcaaaa agaaaacgac agcaatcttc aaatcgaaat 3480 tgcgaagtga ccaaaaacta gaggcataca acatcttttg ccaatcaaaa tttgtgtaca 3540 tcctacaagg cgaagatatc gcaaaaacca aaattgaaac ttacgacgaa gaaatcaaga 3600 aaatgataaa agaagatata ttaaaattac aagacaaaag tccgttcaca gacttcgtta 3660 tctactcccc aagagaaaaa ggggggttag gaataacaaa gataatagat gaacaaacaa 3720 ttcaaactat taatagaacg gcaaaactcc taaatagtag ccatagagca atccgggcta 3780 ttatttatga agagctaata caagtagcta acctaagagg agaaaaagaa atcaacacca 3840 ttgaagaagc actaaaatgg ttggaaggta ccaacaaata caaaaagaac tccaacgcca 3900 agaccacctg gataacaagg gttcgggagg cctttcaaac tctagaaaag aaacacaaaa 3960 tcaaggttag atttgtgccc aaagaaaact gcattggata taaaatcaaa tgcgacaccc 4020 aagaaaagat agtggagctt gataactcaa aagagttatc aaaaagctta cactggatga 4080 taaaagaggc atattataaa gaatggaaag ccctaaaatg ccaaggatat attataagcc 4140 taaaaacctc cgaatttatg gagtggaaaa tgcccagagg ccttccggac cctgattgga 4200 gattcctaac aaaagtaaag gcaaatatgt tggacgtaaa catgaaacaa gccaaccagg 4260 gaggaaggtt gggaagcaca aaatgccgaa aatgtgaaga taaagaatcg gcaagccatg 4320 ttataaacca ctgtgcctca ggtaactgga gtagagtgga aaagcacaac caggtgcaaa 4380 atgagctagc aaaagaactg acaaagcgga atatcagctt cgaaaaggac agcatcccaa 4440 aagaaacaaa agagagccta agaccagatt tggttataag actcaaagac aagataatga 4500 tagtggacat caaatgccca tttgatgagg aatctgctat cgagagtgcc agaaacaaga 4560 acatagacaa atatcgagaa ctggccaaag agatccaagc aaaaactggg ttacaaacaa 4620 cagtctcaac tttcgttgtc tgttctttgg gaacctggga taagaggaac aacgagctcc 4680 tacggcagat gggaataaga tatgaagaat ccaaagagat gaggatcaat atgatccaaa 4740 aagccatcca cgggtctaga aaaacctacg accaccacag aaattttaac aatggttaaa 4800 atggcaaaaa gatatttcaa gatgaattgt ggactcatct aaaaaatgac caccttgagt 4860 ccaaatatgc ctagctatca tggttgctga tggaaacagt aaggcacctg atagctaact 4920 tttcactgtg aatatcttca gatattcaca gtgacacgaa aggacaccac tagtaaaaac 4980 cactagtttt ttctgacacc tcttgctaca aactctgtaa aaatcaaaag gatcgatagg 5040 ccgcgctttc acggtctgta ttcgtactga aaatcaagat caaggaagct tttccccttt 5100 tagtcaacac caggtttctg tcctagttga gcttcccttg ggacatctgc gttaccattt 5160 gacagatgta ccgccccagt caaactcccc acctgacact gtcctcaaaa cagttcaatt 5220 gcatccgaag atcgcaattt tttcactaaa ataaattaac aaaagttaat tatactgctt 5280 cattgagtaa gtagaaaaac aatc 5304 // ID Copia-6_DPu-LTR repbase; DNA; INV; 186 BP. XX AC scaffold_71; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-6_DPu_; KW Copia-6_DPu-LTR; Copia-6_DPu-I. XX NM Copia-6_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-186 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 676-676 (2010). XX DR Genome; scaffold_71; Positions 404090 404275. XX CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX SQ Sequence 186 BP; 44 A; 34 C; 38 G; 70 T; 0 other; tgttggaaat gcgcaaaccg agtgaggaga ggcttcactc ttctcgctca gtagtcgtcc 60 cgctattctc tcatgtcaac cttttgtgag tgtatgctat atatatgtgc atttgctgtt 120 aattatggtc agttgaatat attatttatt gtaaggccaa ttgctcacga ttgttaattt 180 cttaca 186 // ID Gypsy7-NVi_I repbase; DNA; INV; 15196 BP. XX AC AAZX01000892; XX DT 05-NOV-2007 (Rel. 12.11, Created) DT 05-NOV-2007 (Rel. 12.11, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy7-NVi; KW Gypsy7-NVi_I; Gypsy7-NVi_LTR; internal portion. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-15196 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from Nasonia parasitic wasp."; RL Repbase Reports 7(11), 1137-1137 (2007). XX DR Genome; AAZX01000892; Positions 16693 1498. XX CC Positions [6166-6537] - Reverse transcriptase CC Positions [9225-9563] - Integrase core CC LTRs are 93% similar to each other. XX FH Key Location/Qualifiers FT CDS 5002..6648 FT /product="Gypsy7-NVi_I_1p" FT /translation="MIDNGAIVNLIKASVLDDDMPICTADARELGEITNQT FT VKMYASIYLDIKGTPVKFQMVSDDFPIPFDGMLGRNYLKKEEAVISYYKNA FT LMISGDVMHPIPFIEHEEEHNPRNIKRGYKCKQTNEVLKVESTQSGKSESM FT NQNDENITDQIGGTVKHIIKARTRQVIQINLIKTELKEGYIPRIDVGNKHL FT FQGEGVVTNVNNTCKIMAINTSEEDVIIEVDPKELIPFDTGPDFSEFNGDM FT IVDQIKRLERVKEAIRRDHLNRKELDIVDRIIEDYLDRFLLTGDKLPCTDM FT IEHHIHLENDVPINTKQYRHPPQHKQVVRDSVEKKLRYKIIRESNSPCNSP FT IWIVPKKPDSHGNLRWRMMIDFRELNKKTIRDAYPLPNIADIMDQLGGATY FT FSIFDLASGFQQIPIAPEDCYNTAFTPINGHYEYTRMPEGLKNATATFQKL FT MEKVLRRLQNIEMLVYLDDIIVYSKDLQEHGSRIRNLMQRLREAKLVLQPD FT KIEFFRKEVGFLGHIISARGVEPNPEKVAAIAKLATPKSAKNIRKVLGMFG FT YY" FT CDS 8333..9331 FT /product="Gypsy7-NVi_I_2p" FT /translation="MPFLIPTPPDLEPLNKSRGSHQHHEESDNALTHITAK FT MIRAPHNIITVRECLTHKRDNIVNFLSADCDNTWSVARLLVETEAIDLMKI FT KNQKPKSGQVLVTPFKKHHIFTVIVKDKYFNTIKMPNLHEALINLKQILVE FT KGIKSFRVARKGDILDQLGSPTIIEMLYEHFHRSGIRVTMCYGEAEVPSEK FT HRKEIISHLHDSLTGGHKGINQTYQKIREWYYWPGMRNDVQDYIRRCANCQ FT EQKIERFKTREPMIITDTPIEAFDKVSIDTVGKLKMTPRGNCHLLTMQCNL FT TKYLIAIPIKNLHATTIADALAKYLICQFGAPRNQFLVEYS" XX SQ Sequence 15196 BP; 5924 A; 2879 C; 2696 G; 3697 T; 0 other; gattatttag ccgagattct tgtgccgaca cgatctaagt tttattctct cttgtccaac 60 tcttgatttt tgcatgaata tctaaaaatt aatatattga attcgaataa atttcgaata 120 aaaactaaaa attaaaatca gtgtccgcgt cctaaaagaa aagtttaaaa attaaattaa 180 ttgtgaagtg aaacaagagt acgcgaagtt attaaaagaa aaagaaagag aaattccttt 240 cggagtagaa gcacgcgtta tactcgcgac aaaggcgggt gtccgatttc gaactgcgcg 300 caacaaagtt gagagtatcg cttgggagct tggcgtcgcg tcatcagagc ctttgtgccg 360 gtccaacgcg cgtggagagt taaggaggcg gccatcttgc gagcttcggt tttcccaaat 420 tcgaaatcac tcgttgcttt gttcatgttc gttcaacctg atgagaagaa aaacgtgtta 480 gtaaaatatt cgaatcgaaa agaaaaatta tcgaggatct aaatttcaaa agaagaaatt 540 ttcaaatctt caaaaatttg ctgaaaataa aatcatttta gtcaaaagca tattaatgct 600 caaaaatttg gacaaaagca actttgcaat tttcaaaaaa aaaaggaacg aaatcattgc 660 ttgtgaatgg accccaaatg ataatgacta aattgtatat gattttaata taaaaaaaga 720 tatatatttc atgaacatat aaaaatttat ataatactgc tctagtgctg aaattcgatt 780 ttcacagcaa aggtgaaggt tgactgttca ggaccagatg tattcgctcc attctataaa 840 aggaaaatca aaattggtat cagaggttat caggaaagaa cgagtgcgac tgacaacaaa 900 ttggtaaagt cgcactcatt gtttgaactt gtgaaaagaa atagaatatc aaaaacgcac 960 tttgaagtat ttgtgcgagt ccctttttat gcatctgtac actaaaaagt taagttaaga 1020 taattagata tgtatggcat ttgaacattt ttctgtgagt agtgttaatt tgaagaccac 1080 gtcattctac gatacttcaa aaaggagaaa agtaaacaaa ttagcctcac cagcatttga 1140 gaacgtgaaa aggttaagaa aacacgttaa aacaggacca tttaactttt gttgttcaac 1200 aaatgtaatc aaacactcag tttcaaattg ttcattacta atacacaaac aaacactttt 1260 tactttcact atcatttata aaaaggacaa atgttaaaac gcgtcgaagt tagtttcatt 1320 tgcaaacact tgtagaaaac aaaataaaag cagatacact aaaagaaaca ctcatctggt 1380 ttattcggct ttgtaacatt gttgatcact gcgtaaaatc aaagaaatca aaacgttgat 1440 gtcgatcgtg gaagtttcaa ccagtaaatt cagtttggcc ccgtgaacaa catttattgc 1500 acaatttcct cgactctaga aggagcaaat aagtcgtaaa tacattgagt aatcgtttta 1560 atacacaatg ttcccacagt tagaaagtag aatcagaatt gtaaaagcaa gtttgaaaag 1620 gaatttttat cacatggctt aataatcaat tcagagaaag gaaattttac acggatacac 1680 ttaatgaatt gtttttaacg tatgttttaa tctattgtca atatacttta acatcaaggt 1740 cagaaaatta acaattttaa taattttaaa cgaatacttg caaatgaaca aaaatctaac 1800 ataaatagca ttgtttatgt caaagtcaac ctgtataact tattcaagca atataaaaag 1860 caaaattaca atatttaagg aaatgaagcg tgtggcaata ctcaaaactt tatacttgcg 1920 tgcatttttc tgagtcgttg caggtgaaaa gaggccattg agaaacttgc gccatcctag 1980 aataagtaaa aagttattta acacaaaaca acaagctcaa gtatcatgca aatgatttat 2040 gtttgaataa attgtcagga aaacttatct aattagtgag cagcaactgg ttgtactgct 2100 tggggtagag agtcctgggc ccacatttat atttccactt tttgccaata aacaccttcc 2160 gaaaatcact gtcacatgta aaggacacac tcgcgaacca ttagattata cgtgcgcact 2220 attttcaccg cgatttaata acaataattg aaggagaaag tacgcgacgc gacgcgggtg 2280 aggttagcca ttgcagcata gcagccgacg gcacgtgttt gttcagagcg aacgcgggcg 2340 cagacgcatg cgtgcgagcg tggttactag gcgcggctag ggactttaca gcggtgcgag 2400 attaggcgca cgcaactgac gcggtcgagt cggagtagag tcaagaagta gagagagcgc 2460 gcggcagcca tcttgagtgg gcgcggaatt agaataatta aaacgaaaga ttatcatttt 2520 taagagctct acaacaataa taatcattaa ttattaacat agattagaga tcacgcttgt 2580 ttgacctttt ttgcaaataa ataatttttt ttattttatt ttattatttt atcaagagaa 2640 attaactcct taggaagtaa caattgttgg tttagggttt tattaattaa actttttgta 2700 gatttaagaa caatattaaa atatcctttt gtttttggaa taatacttca ttattaattc 2760 ggctttgatc aagaggtact taattatttt gattatttta attatttttg ataaacatat 2820 taataattgt ttagaattcc agtttcgttt ttttttatcg tgtttatcta attcttttta 2880 tgtttaacaa aattaaataa aaccgagtga ttaaatcaga ccgactcttc atatacaatc 2940 acgtaagaac tttgaaattt ttgttaaatt attgcaacac acttaatttc ttttctcgct 3000 aattttcacg tcatatgcca aaattcgatt atttaactca atctgctcaa tcaagattta 3060 cattcatttt taaataaaca attaaatgtc atcgataatt tttagaattt taggaattct 3120 aacttctatt aggctaattg gtaacgttta agtttttttt tttggtactg gtttacggtt 3180 ttgaatttaa ttattaagat atagctatct gattttcctt tgagtataaa aatcagacaa 3240 gttattttta ggataaaggt tatgataatt tgaaaggttt taggttaaat ttacttttga 3300 aactttcatt atgatttgcg tgaattttta tcaaaacatt atttttaccc cttttacgag 3360 cattacaccc cattagcgaa gaaaagttta ttacccaata aacccctttt gtgagcatta 3420 caccccatta gcgaagaata gtttattacc caataaaccc cttttatgag atacgaaacc 3480 ttagtaattt tgttttgttt tgttttggtt catacatttt gtatgtttcc gaggcactaa 3540 cccttttgaa agttttcttt ccttttgaat cctaaagtga aatcttaaaa aaaagggaga 3600 acaaaaacaa aagtgatatt ggcgacaagt gagagtggtg gcataaatta ggcaaaattc 3660 tacagttaat taaaagtaaa tgaagcaaca agaagaaaat caaagtaacc gaaagcatac 3720 taaataactc aaggtattat aagagtacaa gagaatcaat cgacgatagg agtgattatc 3780 taggttatct agaaggcaca tagtacttat tcgagatgac ggggaacgcc tcaaacaatg 3840 atgattcagc taatggagga gcaccagatg cgagcatgct attaatgagc accatgacaa 3900 acagctttgc actcatgaat tgcataagcg gcattcaagt tttgatggaa aaaagaatct 3960 tcgagactac atacaagatt tgacgaacgc cagcgagtta gtcaatgacc cgattaagct 4020 gcaagtctta caacacgctc tacataaaat aacgggatca gcaaagaaaa gtttaaacaa 4080 taaaacaata aaaagtgtag aagatttgat acaacattta aagcaaagat ttggacccgg 4140 aagaaatttc agttactata ataataagat acagaacgtg agaatgagag aacagaacac 4200 agtaggcgat ttttatgatg taataaacgt gcttctgagc agtgctaaga atgctttgaa 4260 ggaagaaaaa gtagaagaat tcaaaaatga aatgatggca ccattggaaa ctttagcagt 4320 ggacatattt attaaaggtt tacctgcaaa tttggctgag cgagtagatt acagtaaacc 4380 caatgatttg cgagaagctt atgaagaagc agtacgatta gaaatcagga tggaagctaa 4440 aataatccca gattcaagac catactgaga tagaggccga tattacaaca atcaagacaa 4500 ttataatgga ttcgatcaca agactggaag atactatgat gcaccaccgt atcctgacag 4560 gagaaatgac actaactggc gcaataacca accgcaacat aatgatcaga cccaaagagg 4620 taatttaaac taccaagagg ctcgccacga ccccagtata gcgagccaaa accaaggtta 4680 cccgagacaa aaccaaatgc atcaaatgca gagtcaaggt acaaatcaag tccaaaacca 4740 acagatgccc agaaacacgg atcaacagca ttatcaacaa cagctacagg catacaacga 4800 gaacaaaaat gcgaacccaa cgaatccgag caacaatcag tcgaaagcac cagtaatggc 4860 aatcttgtcg aggccagaga agtatcgaga atatggttca caaactgtaa aggacctcaa 4920 aatgttaatt gggaggacga caaggccaca gcagtgagga taagaatacc agttggaacc 4980 actaaaccat gggtgggatt catgattgat aatggagcca tagtaaactt aattaaagca 5040 agtgtattgg atgatgacat gcccatctgt actgctgacg ctagagagtt aggcgaaatc 5100 acaaaccaaa ctgtgaagat gtacgcatca atatacctag acatcaaagg aaccccagtg 5160 aaatttcaaa tggtatcgga cgacttccca attccatttg atgggatgtt aggacgaaac 5220 tacctaaaga aagaagaagc tgtaatatct tattacaaga acgcgttaat gataagcgga 5280 gatgtaatgc accccatacc atttattgaa catgaagagg aacataatcc gagaaatatt 5340 aaaagaggat ataagtgtaa acaaacaaat gaagtgctaa aagtagaatc tactcaaagt 5400 ggaaaaagtg aatctatgaa ccaaaatgat gaaaatataa ccgatcaaat tggaggtacc 5460 gtaaagcaca taatcaaagc cagaacccga caagtgatcc aaatcaatct gataaaaacc 5520 gaactaaaag aaggatacat accgagaata gatgtgggaa acaaacatct atttcaagga 5580 gaaggtgtag taaccaatgt aaataatact tgcaaaatta tggccatcaa tacgagtgag 5640 gaagatgtca tcatagaagt ggatccaaaa gaacttatac catttgacac aggaccagat 5700 ttcagtgagt ttaatggcga catgattgtg gaccaaataa agagactaga gagagtaaaa 5760 gaagcaatca ggagagatca tttaaatcga aaagaattgg acatagttga tcggattatc 5820 gaagactacc tggaccgatt tttacttaca ggagacaaac ttccatgcac ggatatgatt 5880 gaacaccata tacatctaga aaacgacgta cccatcaaca ccaagcaata taggcatcct 5940 ccgcagcata agcaagttgt aagggacagc gtagaaaaga aactaagata taaaatcata 6000 agagaatcca attcaccttg taactcacca atatggatcg taccaaagaa acccgacagt 6060 catgggaatc taagatggcg gatgatgatt gattttagag aattaaataa gaaaacaatt 6120 agagacgcat acccattgcc aaacatcgca gatatcatgg accaattggg aggagcaacg 6180 tacttttcta tcttcgattt agcaagcgga tttcaacaaa taccaatagc acccgaagac 6240 tgttacaaca ctgctttcac gccaattaat ggacattatg agtacaccag aatgccagaa 6300 ggtctgaaaa atgccacagc gacctttcaa aaactaatgg aaaaggtact gagaagactt 6360 cagaacattg aaatgcttgt atatttagac gacatcattg tttatagcaa ggatctgcaa 6420 gaacatggaa gcagaattcg caacctcatg caaagactaa gagaagcgaa attagttcta 6480 caacccgaca aaattgaatt cttcagaaaa gaagtgggct tcttaggtca cataatcagt 6540 gccagaggcg tggaaccaaa tccagagaaa gttgcagcca tcgccaaatt agctacacct 6600 aaaagcgcaa agaatataag gaaagtactt ggaatgtttg gctattattg aaagtacatt 6660 aatgattttg caaaaattgc aaaaccactg aatgatctat tgaaaaagaa tgttaagttt 6720 gagtggaccg aaaaatgcga agagagcttt gaaaaattaa aacagtgcct catggaggaa 6780 tccatactac aatttcctga cttcaacaga gagagagaat tcactcacaa ttggagccgt 6840 tctaagccaa gagaaagatg gattcgatca ccccgtgcaa tatctatcga gagcattaaa 6900 caaagcggaa agaaactact ccactacaga aaaagagtgt ttagctgtat tatacgcgct 6960 acatcaattc agaccatatt tattatgtcg caaatttacc ttggtcagtg atcatgaatc 7020 attaaattgg atgcatacta gaaaggacac tggacaaaga ttaatgagat ggatgtttag 7080 attcaccggt tacgaataca catttaaata caaaccaggc aaactgaaca agaatgctga 7140 tgccttgtcg agacatcctc cagaaatgac cgaaaaggaa ataaatgaaa atctaccaaa 7200 aataaaggtc atgattatag atgaaaagaa aaaacagaac gggaataaag ccaaaagtaa 7260 tgcttcaact gatggaacaa tctttcgaaa cagagctcac tcagcatctg aaaggactac 7320 ggcaccacgg ggaagaggga gaccagtagg agcgaaaaca aacaagaatg caccaaaatt 7380 gaatcatagc gtgatagcac agagaacgag agcgaaacgc atacagatac caagttcagt 7440 aggacaattt taaacaaaga agtacaacca aaacattcaa aaccatcggt ccaaagaaac 7500 ccgtacaacc agaagcgtca accaaggttg aagaatcacc aacagaaatt acctctgtca 7560 agcccagaga gttgccagaa agttcatcca acatagattc agacacttct atgacagtga 7620 attacgagtg gaagacccag aaacatcaga ggatgaaaag caaaaaatac cagaagatga 7680 aacattctct tcagaggaca actaagacaa gactataagt aatatagcca ttgagttgag 7740 cgcagtagaa ggatctcaaa ctgaagaaac cactgatgaa gactctatgg aaaacctgtc 7800 cataaaaaac actctgacaa aacaagaagt ggaagaggct agtagaaaat ttgaagagag 7860 tttgagaaga tatgaaatgg ataaaagaat ggataccagc gaatcaggga ctgaaatagc 7920 catacagtta aaattaccct cacacttttc agaggatgat aatgtaccag acgacatgcg 7980 tgaatccctg tacctgtcgg atccaagatt cgaaaatgag actgacgtga acaacgtgtg 8040 aagaaaaggt gttcaaaatc taataaaacg agcacgggga aagttacaag agacaccagg 8100 ccaagaaaat acgggagatg aatcagatgt cgacagtgtc acatacatta taccaccacc 8160 aataagaaaa accttatctg tcacgcctac cataacaagc gatagtccca cagccaacag 8220 tacacctcga aactaactca agaacaaaag aaaactagga gtgctgagcg agatagaagc 8280 cgaagaaggg taaatcatag acataaggtt ttccaaacta cctgaaccag caatgccatt 8340 ccttatacct acaccaccgg acttggaacc actaaataaa agtcgtggtt cccaccaaca 8400 ccatgaggaa tctgacaatg ctttaaccca tataacagca aaaatgatac gagctcctca 8460 caacataatc actgttagag aatgtctcac acacaagagg gacaatattg tgaatttctt 8520 atcagcggat tgtgacaaca cgtggtccgt cgccagactg ctagtagaaa cagaagccat 8580 cgatctaatg aaaataaaga atcagaaacc aaagtcagga caagtgctag ttactccttt 8640 taagaagcat cacatattta cggtgatagt gaaagacaaa tacttcaata ccatcaaaat 8700 gccgaatctg catgaagcac tcatcaacct aaaacaaata ctcgtagaga aaggcatcaa 8760 aagctttcga gtggccagaa aaggagatat cttggaccaa ctaggatctc caaccatcat 8820 agaaatgctc tatgaacact ttcacagaag tgggataaga gtgaccatgt gctatggcga 8880 ggcagaagta ccatctgaaa agcacagaaa agagataatc agccacctcc atgatagtct 8940 aacgggagga cataaaggaa ttaaccaaac ttatcaaaag ataagagaat ggtactattg 9000 gcccggaatg aggaacgacg ttcaggacta catcaggcgg tgtgcgaatt gtcaagaaca 9060 gaaaatagaa cgatttaaga caagagaacc aatgatcatc acggacacac caatagaagc 9120 tttcgacaag gtgtcaattg ataccgtagg aaaacttaaa atgacaccaa gaggaaattg 9180 ccacttgtta acgatgcagt gcaatctaac aaagtacctt atagcaatac ctatcaaaaa 9240 tctgcacgca actacgattg cagatgcatt ggcgaaatat ctcatttgtc aattcggagc 9300 cccgaggaac cagtttcttg tcgaatatag ttgaatctct actaaagtta tttaaaataa 9360 accacttaac aacctcggga tatagacctc aaacaaatgg gtcactagaa acaagtcacg 9420 ctccgctcat agaattcatc agaatatatc atgagaaata tgatgattgt gatcacctaa 9480 caccatttgc aaccttcaca tataatacta gcatacacgc ggcaaccaac tttacccctt 9540 tcgaactggt ttatgggagg gtagcgtgtt ttcccttaag aataccatct gacgaaaaat 9600 taagaaccta taatgtttac atgcgagatc tagtactaag gttagaggag atgacacttt 9660 tagcaagcga aacccaaata gctaacaaag tcaaaacaaa agcgaggtat gacaaaaagg 9720 ttagaacgtt taagggcaaa gtgggcggat atgccaggtt aattaatgaa ccccgagtca 9780 gtaaatttga tgaatacaga aacaaaccct tgaaaatcat tgaattcgtt ggcaggaaaa 9840 acgtaatttt agaatatcct aatggcaaaa gaattcgaaa acacattgat aaattaaaac 9900 atgtagagga cgactctgat accaattctg attaattaat cgcaataagt gtaaataaat 9960 tgtaaataat atatatatat atatatatat atatataact aaattttatg aacccccact 10020 aactcacatc taaatttgca ggatgaataa attaaatacg atcatacttt ttaatataat 10080 aacaatggtt cgaagtcaat tcccatatca aatggaccca ttgaacccga accccggcat 10140 ctacttcgag agagtggacg tcatgaggac aaagagtgca aactggaaga ttcaaatcta 10200 catcgacgtg ggtgcattaa aacacggcag cattaaaaac ttgtgacata aggtatggaa 10260 ctagtgcaaa aacctaatgg aaatatttag aaaatgacca atcctggcta tactcagttg 10320 tcaaaccaga acaaatgcga atttcatgca cgaaccaagg agacacaaaa ataaaactgg 10380 agaaagccgg aataatacga ctctcacgaa cgtgcgttgg gaggaccaaa gacagcatga 10440 tatttggaca agaaaccaaa tccagtcgaa ccacatattt gtataagctc gaaattgatc 10500 taaaaataca tgacatgtac cccattctaa atgaaaagga ccaaaatctt gaccacaatg 10560 caccccaaag cattgacata attcgaccta aatctgattt taatcaagcc aaacccttaa 10620 aacaaatgat agaaaaatta agtgaattga aagaacacac aaggcgagaa tataaaacaa 10680 atacactctt atatagtagt ctaactttcc aactagtaat gatagcaagc atcatagcat 10740 taatagtggg aatgaaatgc ggaatgatga taccctgccc aaaattaaat aaacaaacat 10800 ataaagtacc tcaaactaac gagcccaatc gcgataaccc cataatcgaa acgaacctaa 10860 ccgaaacctc tagtacttct aaaacgaaat ttaataaatt aagcaaatcc aaatcatccc 10920 taatagaaac acaatttaaa aatgaagcgg aatgcaatta ttaacatata aatgtatatg 10980 caccaaacct taacaacaca tcccctaata cccaaaaata tatttataaa atatatttac 11040 agaaagtagg aagatgacga cgtcaaccac aagcgtgtca atattctgcg acatgcaagt 11100 accagtttgg accgttgtgc gagaatttga acaagttggg aaaccatggt gcatttttac 11160 cgaaaaacac cgaacagacc ccggaagcaa aaagatcaat ataacttatg atacagtaat 11220 ggagaccatg cgagccaaag acttaaaaac catactcata agagaaaata agttcggaag 11280 accagctaga gcagcaccga taccaataag aagaccccta ctcaatgaga acaaccctac 11340 tccagaattt cacccatact gcaagagaga tatcgatgtg agcctgtatg aattacatga 11400 ccaaatttgt aagaaacaag ccaaatggcc aattgccaga accctatggc agcttcctta 11460 gccgaaacag caacacctca aaatgtcccc gtgctaacca aaatcagaaa ggaagctaag 11520 gcggaagtaa ttcagcaaga agtaccaaca ttaagcaaca tcgtcaagcc ataactacca 11580 gaactaccag agttaccaaa gataccaaca tcaaataaca ccatagaagt cggaccggaa 11640 caagacacca ccatagatca agaaaaacgt ttaatttttg aaattaactc aactaattct 11700 tatttcataa cacgcgttga aaacaaaaat atccccaaaa atgtcttgga aaaaatggat 11760 gtagatgata gatctaaaga ataaataaag aaaaatccca tcgtaatata aaaacagtcg 11820 ttcctttatc acaaatctaa caaacaaccc aaccatacat tcacacacaa acacaataca 11880 agcattaaac aaacacaata gaaaggggaa cattaatgat gacaatatgt gagtcacaaa 11940 tgaccatatt aactgaaatg tccccgttaa ttctatcaac aaaggtttaa aaagttgaaa 12000 cctaactcat acaactcaca gaaacacaaa atgttttatc tccacatact tggatgtagt 12060 caactcttgc atagcgttga accatatgca acaatatcct ttaaatcaag ctgaaaccta 12120 actcacataa cccactgaaa cataaattgt tctgccacca catacgtgga tgtagaattt 12180 ttcttttcct ctattcaact cacacattca gacatgaaat attgttgttt cttgataagc 12240 catggacaac agcagccttt aggtcaaacc cacaaactaa atctcctgct cgctaaatag 12300 agcaatcaag gaaataagtg aataaggccc tttttctctt gctgttagtg gacctgttct 12360 gctacgtctc ttgaatttat tttctttctc ataactcaac ctaaaatcca cacacacacg 12420 aataaaacat aaatgatgac cagcgtttcc acaggactga tatctctcat aaatacaaat 12480 tatcctaccg tacgctataa cataatacct acaccacccg atccctataa tcaaaaatct 12540 tgttttctat ttgaattaaa aataaaatgg acctcaataa actcctaatg ataaacagac 12600 aaaatgtata acccctaaaa caaataaaca ttgcatgacc aaatatgtca gaccctaaaa 12660 caatatctta cattacaaca tgcaacgata ccttctatct ttcattcctc tttgttacag 12720 gaaaatggac cataacaaaa tagacatctt ggacgttacg gacatgggaa attccaaagt 12780 cctcataaaa ttcaaaggac aaaaagaaga cgccatgaac agcttcatga cacatggacc 12840 atgcgtaatc gctttcacac caacagccac accaaacatg tatatagcaa acatcgagta 12900 cgcaaaagaa gagaataaga tcagggtgat taaaacatac acaaagtcaa aatcacaaag 12960 cataatggag aaaatgggct acatagaagg caagggctta ggcaaagatc tacaaggtag 13020 aacggaccca gtcaaagccg taccaaatga atacagacag ggcttaggtg ctaccaacct 13080 aaattattac tcggtccaca atgacacaaa ttttgtcatt gaacaagaac aaggaccacc 13140 aaccatcata gaagaaccag gagaggtaat gagggtcagc aaaatcagtg aaaaatccat 13200 atcaataagc ataaggggaa acccagacga cataaaagca aacttaaaac aatttggacc 13260 atgtgaaata gagaaagttt atacggacca acaaggccac accaaactag tagtgagata 13320 ccagatccct ctgcataaca agaaagcagt ggagtatgca acagcaataa taaaaaaaaa 13380 accatagaat aaccaatacc agatatccct acgcagacaa caccaggagc agcaataaac 13440 ataccactgg tccttccgat attaccactg ttaccaccag cagttcgatg accgctattc 13500 gcaacgtcat taacaacatc attaccagca cgaccacaga aatagccaag cgtaaaacca 13560 ataccgatgc cagaaataat aacaggacca ctggatgaca cgacagggct aagattctgt 13620 gcccaggtac cttttggcat gaactctgaa accttcggaa ggatatttgc tgtatttggc 13680 cctctggacc acataacttt tgtcctacac aaaaagcagg aacataacac accactcagg 13740 aagtacctag ccttcgtgca gtacttcgac aagggggatg ccaagaaagc aatagaccat 13800 tgcagtgaaa gattcaaaca aaaatgggca tcgccacggc acctcaagaa gaacgagaca 13860 cagtacagaa aaaaacacag tgtatgcggc agaatgagcg attatagaaa caaagaacac 13920 catgcttatg taaaatacaa aatggagaaa cactaaaaca tcacaaagag tgtggcaaat 13980 acatacaata ccaacacatg gaagctcaca aacaacattg cataagggtc caagagaacc 14040 caaaagaaga aaatatgacc acagaagacg aattaataag cagggaccat tcaacatcac 14100 ctgagtaatc atagtggagg aaatagacct agaggaacta ccacaaggac atgatgaaac 14160 aaacccacta ggaacaacaa tggaccaaaa tattaatcca gtcagtgatc cagaacgaat 14220 actctgggta cgggaaaatg accgacaaat caacgaggaa gaagaagaca gaatgagcat 14280 ctatagttac acgcgattat cacaataaaa ttttatttca ttactttttt ttatctttct 14340 aaattttttc tattaattca ctttgtacgt ggcacaaaaa aagaaaacta gggggaccca 14400 aagaaataaa caatatattt ttgtactttt acattacacc aagtttaatt gtaaccctta 14460 tgtatgcata taaatagaat ataaaacata aaattgtaca ctaacaaaat gaagtattta 14520 gacataagac aactaattag cgcagatgat caaaagtact aacaatttag catataaggc 14580 atctaatgta gggccacgga ccacagcaac aaaagtaagt ggaatattag ttaaaaccca 14640 cgtccaatat caataaataa tactaaggga acagaatgat tcgaaaatcc ctgcaaccac 14700 ccacccgaac ataaaagcaa aacatttaaa acccatagaa gtctcgtcaa tggcgttttc 14760 tctagccaaa atcccatcaa acaaacacac tagacgtgca tgtcaagcaa aacaatcata 14820 ggctagacat attaaagaac catgagcgaa agcattaaca catgataaaa catacataga 14880 tcatgtgctg agccataact cacaagacat taacgacatt atttaaaagg gaagagaaac 14940 agcacaatag attggagacc agaacaaata acaataaata aaaaaaattt gcaataacaa 15000 aaagtaaatt aaagcccttt gcccctctga ataaatggaa tcattctgac aaaatttctt 15060 tttttctttc actcaaccac atcgaacaca agtttctttt tcttcttttc cacttacagg 15120 accacgtacc caaaaataac ggcctcagtt gtttctcggg aacgagaaac gttctcaagg 15180 gggggatgat gttacg 15196 // ID Gypsy-33_CQ-I repbase; DNA; INV; 4876 BP. XX AC AAWU01012402; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-33_CQ_; KW Gypsy-33_CQ-LTR; Gypsy-33_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4876 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 445-445 (2011). XX DR GenBank; AAWU01012402; Positions 333 5208. XX CC Positions [3850-4275] - Integrase core CC 'ATAAC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 2365..4275 FT /product="Gypsy-33_CQ-I_1p" FT /translation="MEQMISRLKGSKFFTRFDIKQAFHQLVLKEDCRYITT FT FISPLGLMRYKRLVFGLSAAPELFQKAMENILCDFAWVIVFIDDILIHAPD FT KLTLERRTKIVMERLKDRNISLNEAKTLKCVTELEFLGYQLSGNGVTISNS FT KLDTITKFRIPKTAEEVRSFLGLITFVNRYIPHLSSIAKPLNNLIKKGIPF FT NWTSDHQEAFDKIKLILAKKETLAIYDTNLETFVMADASPVGLGAVLFQKN FT QNGTYQIIGYVSKTLSEVERRYAQIEREALALVWSVEKFYYYLYGKFFWLI FT TDHKPLVTIFGERAKPSARIERWILRLMSFKYQVIYQPGKSNIADPFSRLC FT QDAEICDESFDEEAERIIRMVSHDICPASLSHEEISDATCKDIELSDLIKW FT LPYPVKRWPKTLIKYKSFAKDLSTDGCIVLKNKKIIIPKILQNRVLQLAHE FT PHLGIIAMKKRLREKVWWVRIDKMIVNFVKACEGCLLVSKCSTTPMTRTPL FT PTGPWKKIAIDFTEIMNNTHLLVVVDYFSRYPEVEIMPSMTASATIYKLKI FT IFARFGYPEEIVCDNGPPFHSEEFIVFSKTCGIKINYSVPYSPFQNGMVER FT FNKSFLKTVKISATMGRDWKTDLQNFLLGEIQYTMNRDR" XX SQ Sequence 4876 BP; 1765 A; 715 C; 928 G; 1468 T; 0 other; ttggcgacga gcgggagtct tggaaacgaa gaagttgaag ttgtttactg aaaaatctaa 60 tttggaactg aagatcaatt tatttcaaaa ttcaaattgg aacttttaga atcaaattta 120 tcgaacattc aagtataaat ggcgagagta agtctgaatg tattaagaaa aatatattta 180 aaaaaatcaa aataataatt tcaaattatt agaaattttt gccaatttat ttttgtttga 240 aacggttaga ttacgctcta ccgggattgg aacaaatgaa ttgttcaatc tgatttttgt 300 cgagactccg gtctggcaag agaaacggca gtactccggt ctgccgggat cgaaacgaat 360 ccgtttgttt gattcatttt tgtcgaggct ccggcctgac aaaggaatcg acagtactca 420 ggtctgtcgg gatcggaaca aatccgtttg ttcgatctat tttgtcgagg ctacggcctg 480 acaaagaacc ggtttgactc aggttaaccg ggatcggaac gaatacgttc gttcgatcta 540 atttgttgag gttccggcct aacaaagata tcggtttgac tccggtcaac cgggattgga 600 atgatctatt cattcaatct gtgttgttca gattccggtc tgacaagtac aaataaaata 660 aattggcaaa cagaaaatct gaacaaattg agatatattt ttcactgaag tctagaaact 720 taatttattt caatatgata ctaatttaaa atatatttta gttggagtgt aacattttac 780 cgtttattga aaatgaagac gaaacaacca acgttcgatg ggaagcatgg aaagatcaat 840 ttttggctta tatggcattg aaaagaatta atgctcacga tgaaatgttt aatgctctac 900 aatgttttgg cggacctgat gttagaaaag ttattaagac atgtggagaa aatgctgttg 960 aaaatttatt tgaagctgct atggaagttt tggacaacta ttttgcaccc aaaatgtcat 1020 tacgatatga aagacatcgt tttcggcaaa tgttgttcaa tcctaaagaa aagttggatc 1080 attttgttat tcgtttacgt tctcaagcaa cactatgtgg ttttgatgat caacttgaag 1140 agagaacatt agatgaaatg ctgaaagtgc gccgtactca tgagtcagtc agagttcagg 1200 tacattttat ttattttgca gagaaattat ttataattca aataaaacta atcaattact 1260 ataggtacac gagttgcgta acaaaccagt gcatgattcc agagaaattt gcgaaattaa 1320 caggaaagat ataaaatgca gtcgatgtaa tggaaatcat ttagctaata atcctcgttg 1380 tccggctcgt gatatttctt gtcattactg taacatgacg ggccattttg cgagatgttg 1440 tatgtccaga agatctggtt ttcaaaagat gcataaaaag gatgattatg gtaacacgaa 1500 caaaaattat caacaaaatc aaaagcatca gatgaagaca caacaaaagt ttgttcggga 1560 agtggatgat gttacagaaa aagttgaaat ccgtgaatta ttccatttgg acgggaaacg 1620 atctgttgcg ctggatgtag gaggagtcga agtaaatttc attatcgaca ctggagcaga 1680 tgaagatgta ctaagcattg atgattggaa aaaactgaaa caaactggat ttaaaatgtt 1740 tgatgtgagg aaaggcagtg acaaaatttt tcgtgcttac ggtagctaca aaccattgac 1800 tgtacttgga gaagttgatg ttgaagtgag gcataaaaac caatgccatc gaaccacatt 1860 atttgttatt caagacggaa aatgttcatt attatctgga aaatcagcag aagttcttgg 1920 aattgttaaa tttttacatg ctattacaca gactccattt ccggcaataa gaggtaatga 1980 aatataatgt acataaagca gaagatataa taaaatgtaa tgattaaaat tattagatat 2040 gtttgcttca ataaaaatcg ataaaactgt tccaccagtt agacaatgtt tacgtcgtat 2100 tccaattcca ctagaggaac tcacattact taaattaaag gaattagaaa atcatgatat 2160 aatagaaaga gtcaatgaag cttcagaatg ggtctctcct atggtagtta agaggaaaag 2220 tgccaatgag gttaattttt attttaataa caaaatatta tttataagca ttatattatt 2280 taatgtaaat taaaggtgag aataataatt gatttacgag aagctaataa agcagtaatc 2340 cgagaagtac atcctttacc aacaatggaa caaatgattt ctcgattgaa aggcagcaaa 2400 tttttcacaa ggttcgacat taaacaagct tttcaccaac ttgtgctcaa agaagattgc 2460 cgatatatta caacttttat ttcacctctt ggtttaatga gatacaaacg attagttttt 2520 ggtttgtcag cagcccctga attattccag aaagcaatgg agaatatttt atgtgatttt 2580 gcgtgggtaa ttgtatttat agacgacata ttaattcacg ctcctgacaa actaacactg 2640 gaaagaagaa caaaaattgt gatggaacga ttaaaagatc gaaacatatc tttgaatgaa 2700 gctaaaacat taaaatgtgt aacagaatta gaatttcttg gatatcaact gtctggaaat 2760 ggtgttacaa tctcaaattc aaaattggat acaataacta aatttcgaat tccaaaaact 2820 gctgaagaag tacgaagttt tttaggattg attacatttg tcaatcgcta tattcctcac 2880 ctatcttcaa ttgccaaacc attaaacaat ttgatcaaga aaggcattcc ttttaattgg 2940 acatcggatc atcaagaagc atttgataaa ataaaattaa tattagctaa gaaggagacc 3000 ttagcaattt acgacacaaa cttagaaacg tttgtaatgg ctgatgcgag tccagttgga 3060 ttgggagcag tattatttca gaaaaatcag aacggaacat atcaaattat tggttacgtg 3120 tccaaaacat tatcagaagt tgaaagaaga tatgctcaaa ttgaacgtga agcgctagct 3180 ttagtgtggt ctgttgaaaa gttctattat tatctttatg gaaaattctt ttggttgata 3240 acggatcaca aacctttggt cactattttt ggagaacgag caaaaccgag tgcaagaatt 3300 gaaagatgga ttttaagatt gatgtcattt aaatatcaag tgatctatca acctggaaag 3360 tctaacattg ccgatccatt ttcgagacta tgtcaagatg cagaaatttg tgacgaaagt 3420 tttgatgaag aagcggagag aattattcga atggtttcac atgatatttg tccagcatca 3480 ttatcacatg aagaaataag tgacgccaca tgtaaagata ttgaactttc tgatcttatc 3540 aaatggttgc cgtatccagt taaaagatgg cctaaaacct taattaaata taaatctttc 3600 gctaaagatt tatcaaccga tggctgcatt gtactaaaaa acaagaagat tatcatacca 3660 aaaattttac aaaatcgtgt tctacagttg gctcatgagc cccatttagg aattattgca 3720 atgaaaaaac gattacgtga aaaagtatgg tgggttcgaa tcgataaaat gattgtaaat 3780 tttgtgaaag cttgtgaagg atgtttatta gtttctaaat gctcgaccac accaatgaca 3840 agaacaccac taccaactgg tccctggaag aaaattgcta ttgattttac ggaaattatg 3900 aataacactc acttattagt agttgttgac tatttttcac gatatcctga agttgaaata 3960 atgccatcaa tgactgcttc tgcaacaatt tataaactta agataatttt tgcaagattt 4020 ggatatccgg aagaaattgt gtgcgataat ggaccaccat ttcactcaga agagttcatt 4080 gtttttagta agacttgtgg catcaaaatt aattattctg ttccatattc accatttcaa 4140 aatggaatgg ttgaaagatt taataaatca tttttgaaaa cagtgaaaat tagtgcgact 4200 atgggcagag attggaaaac agatttacaa aattttcttt taggtgaaat tcagtataca 4260 atgaacagag ataggtaaag acttaaatcc tttattttag cttatcggtc tacaccacat 4320 tccgtaacag atgtggaacc agcaaaatta atgtttggaa gaaatattcg agataaatta 4380 ccgacaatag attctaccaa agcatcacca gattacgatg caattaaaga caaagaccga 4440 gaaaataaag aaaaaggaaa actaattgga gatagaaaga ggaaagcaaa agattcttcg 4500 attgatgtcg gcgatgaagt cattgttaaa aattttataa agaagaacaa aatgactacc 4560 aatttcaatc cacagcctca tattgtagtt aaaaggactg gaacgagatt aactgtcaag 4620 aataagcgaa ccggagttga attggataga catgttaatc atgcaaagaa actactttca 4680 gcaagtccat ttgaaaagga aataaatgat agagatgtca atccaactga ggaaaacggc 4740 agtacaacga ttggagaaga tgtcagcacg cgtacttcga gaaggatttc caagaaaccg 4800 aactacttac atgattattc attatatcag tttgatgaaa atctaaatta aagttcatgt 4860 taaaacaagg aagaga 4876 // ID I-80_AAe repbase; DNA; INV; 4839 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE An I non-LTR retrotransposon from Aedes aegypti. XX KW I; Non-LTR Retrotransposon; Transposable Element; I-80_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4839 RA Kojima K.K. and Jurka J.; RT "I clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1352-1352 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 10 sequences with >97% CC identity. This consensus is 5'-truncated. XX FH Key Location/Qualifiers FT CDS 203..4492 FT /product="I-80_AAe_1p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="MVSGRRGAGEPEALPQSESPGRPRHLADRNSLFIFPD FT SRGASLPEVLPQPESLGRLRRLGSDSDEELLQNYHRCPAISPGGRGPVSAD FT VSHTGTDGQPSALGGDLEEGSLAPTPATLDDVGSQDLQALGTRRALPATPM FT DDVGSEAILTQGIVQSPSPNNQSYPDDRTSRLSPLANPFFPAVGTPLHLVG FT STPGPALCGALSCSSNDNIRYQTPYANRSYSIPALPKPAKFCLQWNINGFF FT NNLGDLEILTSNDPPWILALQEINRVTIEQLNRSLGGKYMWTMKRMGNIRR FT SVALGVLKSIPFAILNIDSQLPVVGVKLQGSLPITVACVYLPCGNVPDLRR FT EVENCIHQLPEPRMIVGDLNAHHPAWGCSRSDARGNSLLDLLENNDIAILN FT NGSPTYFEGFYSTAIDVSAVTRSEVQRFRWNVNADPYGSDHHPIQISVAAE FT APTTTRRSRWLYDQADWAAYNNSINASFRTQQPSSISEFSTVLAQAAIVSI FT PKTSNKPGRKALRWWSPEVKKAIKARRKALRAVKRMEPDHPYRDATIVRYR FT FQRNECRQVIRDAKRACWEDFLDSINATQTTADLWAKVNALSGKRIAAPPT FT LIIDGNLTAEPTAIANGLGKYFANLAAFDSYDPEFIRRSGVTTDDIVNFNV FT PESNIQLSINEPFLVDELQYALSRSKGKSAGPDDIGYPMLKRLDGSGKLVL FT LELINNLWTGNTYPLEWRESLVVPIPKPGNQMRETASYRPISLTCCLSKVV FT ERMVNRRLTHHLQQHGLLDHRQHAFRPGYGTNTYFAALGEVLQEAKHKNHH FT TEMISLDISKAFNRTWTPFVLEKLVSWGLTGHILHFIRNFLTNRCFRVAIG FT DTKSASFKEETGVPQGSVISVTLFLIMMDGVFEDLPPGVQIFVYADDILIV FT VSGPTIAATRRKAQAAVSRVAKWAASVGFSMSAPKSIRCHICSSGHRISGP FT PIVINNQAIPVRKTAKVLGVIFDRGLTFKPHCEAVRANCRSRLNLLKSLSR FT PHRSNNRGIRFRVATALIDSRLFYGLETTFLAVNRLIETLSPIYNRYIRIV FT SGLLPSTPADAACVEAGLLPFRYLITATLCTKAAAFAERTAGNDRTRLLEE FT ADILLDSIAGVTLPPVAKIHWYGDRDWRHSSLKFDGRIKDNFRAGDSSTRL FT RQSVAELLRSDYTTHQRRYTDGSLSTQGVGIGIADDELAVSLGLPEQCSIF FT SAEAAAIFYAATKPTPRPIVIITDSASCFFALQSEAPRHPWIQGIIRHAPP FT GVTIMWVPGHCGVPGNSRADHLAGSGFSGRRYTTTVPLADIRRWVKSTMHL FT AWQAEWTNARGPYLRKIKGSVDAWTDTKSMKDQKVISRLRTGHTRVSHDLG FT GTRFHRICEVCETNNTVEHIVCVCPLYENSRQMNGISGSIRDALGDDPTST FT ASIISFIKEAGLYFKI" XX SQ Sequence 4839 BP; 1246 A; 1376 C; 1108 G; 1107 T; 2 other; atctcgcaaa gaatctctct cacaacagca gaaaaaatcc aaattcaaaa gcccctcgga 60 tagcttaggg accattccca agtcacggta agaccatcaa ccctacttcc ttttcacacc 120 gccaccccaa ctaaacaaga taaccatcca tacctcgccg tcccgtcacg agtcctccaa 180 tcaacagcgc agcgcagaac aaatggtttc aggtcgtcgg ggcgccggcg aaccggaagc 240 cttgccccaa tcggaatcgc cgggtcgacc ccggcatctg gccgaccgca atagtctgtt 300 cattttccct gacagtcggg gcgccagcct accggaagtt ttaccccaac cggaatcgct 360 gggccgactt cggcgtcttg gaagtgattc agatgaagag ctcctgcaaa actatcaccg 420 ctgcccagct atatcccctg gtggccgagg ccccgttagt gcggacgtct cccacaccgg 480 aactgacgga caaccctcgg cgctgggggg agacctggaa gagggaagtc tggcacctac 540 gcctgcaacc ctggacgacg tgggaagtca agacttgcaa gctctaggga cgcgccgggc 600 tctacccgct acccctatgg acgacgtggg aagtgaagct atattaacgc aaggaatcgt 660 ccaatcaccc agccccaaca accagtccta tccggatgac agaacaagtc gactatcgcc 720 tcttgccaac ccwttttttc ccgccgtcgg gacacccctg catctggtgg gttcaactcc 780 tggacctgcc ttatgtgggg ctctgtcttg ttcgagcaac gacaacatcc gttaccaaac 840 tccatacgcc aatagaagct actccattcc agcattacca aagccggcaa agttctgctt 900 acagtggaac atcaatggct tcttcaacaa cctaggtgac cttgagatcc tgacaagcaa 960 cgacccacct tggatcctgg ctctacagga gatcaatcgt gttacaatcg aacagctcaa 1020 ccgctccctc ggtggcaaat acatgtggac tatgaagcga atgggcaata tccgccgctc 1080 cgtcgctctc ggagtgctaa aatcgattcc gttcgccatt ctcaacatcg attctcaact 1140 accagtagtc ggagttaagc tccagggaag tcttccaata actgtagctt gcgtctacct 1200 accatgtggt aatgtccctg acctacgtag ggaggtagaa aattgcattc atcaactccc 1260 ggaaccaaga atgatcgtcg gtgacctcaa cgcacaccac cctgcctggg gttgctctcg 1320 ctctgacgct agaggcaata gtttgcttga cctcctggag aacaatgaca tcgctatcct 1380 taataacggc tctcccacgt atttcgaagg gttttattca acggcaatcg acgtttcagc 1440 cgttacccgg tcggaggtcc agcgattccg ttggaatgtt aatgcggacc cttacggaag 1500 cgatcaccat cctatacaaa tctctgtagc cgctgaggct cccaccacaa ctagacgatc 1560 ccgctggttg tatgaccagg cagactgggc tgcatacaat aacagcataa atgcatcttt 1620 ccgcacccaa cagccgagta gcatatccga gttctccacc gtcctcgccc aagcggcaat 1680 tgtttcaatc ccgaaaacaa gcaataagcc tggccggaag gcactccgtt ggtggtctcc 1740 agaggttaaa aaagctatca aagcgcgcag aaaagccctt cgtgctgtca aaagaatgga 1800 gcctgaccat ccttatcgag atgcgacgat tgtcagatat cgttttcagc gcaacgaatg 1860 ccggcaagtt atacgggacg ccaaaagagc ttgttgggaa gactttctcg acagcattaa 1920 cgctactcaa acaacagccg atctttgggc caaagttaat gcccttagcg gcaaacgaat 1980 cgccgctccg cccacactca ttattgatgg aaaccttact gcggaaccaa cggctatagc 2040 gaacggcctt ggcaaatatt tcgctaacct tgctgccttc gacagctacg atccggagtt 2100 tatccgacga tctggtgtta cgaccgatga catagtaaac ttcaacgtac ctgagagcaa 2160 catccaactt tccatcaacg aaccattctt ggtagacgaa ctgcaatacg ctctaagccg 2220 aagtaaagga aagtccgccg gtcctgatga cattggctac cctatgttga agaggctcga 2280 cggatcaggt aaactagtac tgctagagct gattaacaac ctgtggaccg gtaacactta 2340 cccacttgag tggcgggaaa gtctcgtcgt tcctatcccg aaacccggaa atcaaatgcg 2400 ggaaacagca agctaccggc caatctccct cacttgctgc ctttcgaagg tagtagagag 2460 aatggtaaac cgccgactaa cgcaccacct acagcaacat ggtctactcg accaccggca 2520 acacgccttc aggccggggt atggaaccaa tacctacttt gcggcccttg gcgaggtctt 2580 gcaagaagcg aagcataaga accatcacac cgagatgatc tcgttggaca tctcgaaagc 2640 ctttaaccgg acttggacgc ctttcgtact agaaaagctg gtcagctggg gccttaccgg 2700 tcatattctg cattttatcc gaaactttct gactaaccgt tgcttcaggg tagccattgg 2760 agacacaaag tctgcatcct ttaaggagga aaccggtgtt ccgcagggat ccgtcatatc 2820 ggtaaccctg ttcctgatca tgatggatgg cgttttcgag gatctccctc ctggagtcca 2880 aatatttgta tacgcggacg atattctaat agtagtgtct ggaccgacaa ttgcagctac 2940 acgcagaaaa gcccaagcag ccgtctctcg agtagccaaa tgggctgcct ccgttggttt 3000 ctccatgtcg gctccaaaga gcattcgatg ccacatatgc tcatccgggc ataggatcag 3060 tgggcctccg atcgtcatca acaaccaagc tatacctgtt cgtaagacag caaaagtcct 3120 tggagtcatc ttcgaccgcg gcctgacctt caagccccac tgtgaagcag tcagagccaa 3180 ctgtcgtagc cgcctgaacc tactgaaatc tctctcgaga ccgcaccgca gcaacaaccg 3240 aggtatccgg ttccgtgtcg ccaccgctct cattgacagt cgactgtttt atggtctaga 3300 aaccactttc cttgcggtga acagattgat tgaaacactg tctcccattt ataaccgcta 3360 catacgaata gtttcgggtc ttctaccgtc cactccagcc gacgcagcct gcgtcgaggc 3420 tggcctactt cccttccgct acttgatcac tgctacactt tgtacgaaag cagccgcttt 3480 tgcggagagg accgctggaa acgacagaac ccgcctccta gaagaggccg acatcctgct 3540 tgacagcata gccggcgtga cgctccctcc tgtcgccaag atccactggt atggtgaccg 3600 tgactggcgc cactcttctc taaaattcga cggcagaata aaggacaact tcagggcagg 3660 agactcatca acacggttgc gacaatccgt cgccgagctc ctacggtccg attatacgac 3720 acatcagcgg cgttatacag atggttccct ctcaactcaa ggggttggaa taggcattgc 3780 ggatgacgag ctcgcagtaa gcttaggtct ccccgaacag tgttcaatat tttccgcgga 3840 agcagcggct atattttacg ctgctaccaa gccaactccc cgcccgatcg tcataatcac 3900 cgactcagca agttgcttct tcgctctcca atcggaagct ccccgccatc cgtggattca 3960 ggggataatc cgacacgcac cgcccggggt tactattatg tgggttccag ggcactgtgg 4020 cgttcccggt aatagtagag ccgatcacct tgcaggttcc ggcttttccg gccgccggta 4080 tacaacaaca gtacccctgg ccgatattcg acgttgggta aaatcaacga tgcacctggc 4140 atggcaggct gagtggacta acgcgcgagg gccctacttg cggaaaatta aaggaagtgt 4200 cgatgcttgg acagatacca agtcaatgaa ggaccagaaa gtcatctccc ggctgagaac 4260 cggccatacg agggtctccc acgatcttgg agggacacgt tttcaccgaa tctgcgaggt 4320 ctgcgaaaca aacaacacag ttgaacacat agtctgtgta tgcccgcttt acgaaaattc 4380 caggcagatg aacgggattt caggcagcat ccgagacgct ctcggagacg acccaacctc 4440 gactgcatcg ataatatctt tcatcaaaga agccggactg tactttaaaa tttagttttc 4500 cttgacatcc tgagtcagtg ttttgcggca aaccttgctt taatttactt cgtgattttc 4560 aaacacctcg gatatcgtcc gttctgaact tttatttcaa ttcatgtttt ggtagtcccg 4620 tgactaccct taattgttgt gatgcaatct tttaatatgt ggctattgtc tttttaaatg 4680 tcccacgatk aagtaataac tgtgttaaga gcctaaaatt aattcaaaat actttgtagt 4740 tttttcaggc gagtccttct gactcgtcct gcttatcaga ctggggatga acctgcctac 4800 ggtagaaaat ccctcaataa aaaaaagaaa aaaaaaaaa 4839 // ID Gypsy-19_AA-LTR repbase; DNA; INV; 163 BP. XX AC supercont1.227; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-19_AA_; KW Gypsy-19_AA-I; Gypsy-19_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-163 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.227; Positions 677139 676977. XX SQ Sequence 163 BP; 44 A; 23 C; 38 G; 58 T; 0 other; tgttatatta ttatgtagtt tgtggttgct ccctctgagc gtaatgtttg aatgaccccg 60 acgtgatgaa agagttcagt tgttacctgg ttgatcgaca gttcggatgt acaataaagt 120 agagtttgta gcgaacggat ctttttattc ttgaaataaa aca 163 // ID BEL-183_AA-LTR repbase; DNA; INV; 746 BP. XX AC AAGE02026169; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-183_AA_; KW BEL-183_AA-I; BEL-183_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-746 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02026169; Positions 7871 7126. XX SQ Sequence 746 BP; 267 A; 157 C; 119 G; 203 T; 0 other; tgttggcaag cccctggcag ccagcgcgca ccttccatac atcaaccctg ggccgtacac 60 acaccagcag cgttcacaac gatgacaggt aacgttcacc gtctgtcaac cggtgagtag 120 aaacaacatg acaagaaagc agaaaacaaa cgaagcaaac gtgtggacaa attgtagctc 180 agctagttaa actaaaatta cttattacta ttttcttaac tgaacttaac taggtaaata 240 cttaaactaa aattgaactt tacaactatg tgctattaac tatgaattat gatttgtgtg 300 cctatgatta ggatttgcat ttgaggcacc actaggtcta acctacatat actgcgagac 360 tattattttg attagtgcac agtcacggta agggaactac ataaaattta ttacaaatta 420 cctaaataac tattgattct agcgactaaa cctacgaccc gaagacatac gaaccgacat 480 ctgactattt agggcaataa acgtaagttg actagaacga tacaaacaaa cagatcttaa 540 ctcacaccat atatgttcag tacaacataa cgtatgctaa tttacttata atgtgaattc 600 taaactacgc tcttctttgt ataattactg aattaaaaca ggaatattat aatctaccgt 660 taaactacac aaagtggacg tgtttcaata cggtgattat aatcgcggaa ttaacccctt 720 aaataccgtt ggccataaac ccaaca 746 // ID Rehavkus-1_DA repbase; DNA; INV; 6401 BP. XX AC . XX DT 30-APR-2006 (Rel. 11.04, Created) DT 26-MAY-2011 (Rel. 16.05, Last updated, Version 2) XX DE This is a recently transposed Rehavkus-1_DA DNA transposon - a DE consensus sequence. XX KW MuDR; DNA transposon; Transposable Element; Interspersed repeat; KW Rehavkus group; FB; FB4; NOF; Rehavkus-1_DA. XX NM Rehavkus-1_DA. XX OS Drosophila ananassae OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; ananassae subgroup. XX RN [1] RP 1-6401 RA Kapitonov V.V., Gentles A.J. and Jurka J.; RT "Rehavkus-1_DA, a family of Rehavkus DNA transposons from the RT fruit fly Drosophila ananassae genome."; RL Repbase Reports 6(4), 191-191 (2006). XX DR [1] (Consensus) XX CC Rehavkus-1_DA belongs to the Rehavkus group of the MuDR CC superfamily of "cut and paste" DNA transposons. Transposons from CC this group are widespread in different metazoa, including CC insects, sea squirts, sea urchin and fish. The genome harbors CC several copies of this transposon. Its 1.2-kb inverted termini CC are composed of a 79-bp terminal inverted repeat and 282-bp and CC 102-bp subterminal minisatellite-like units. The transposon is CC flanked by a 9-bp target site duplication and encodes a 1036-aa CC Rehavkus-1_DA transposase. XX FH Key Location/Qualifiers FT CDS 1424..4531 FT /product="Rehavkus-1_DAp" FT /translation="MPSKTHIDVSEIVDTLCLFNIFENETGALKSRNDSVW FT KEASDRLKTKISAPTLNVYVRINRNNMLTLIKRRCGITENSINETVNSTLP FT EDDPEFQISEVHGNGPLPNLFLTLELDEEVWGSIAPIKGKNEEVKDLRPKN FT YLQPGWTDIIAKALQEKVPLPCAFNFKKAKISGKEENIWLKIEGYCSECSS FT LFKGHCLVEPDPNLGISISVTVPDTRGIPHNQKRRCTGLKRLAIGHELLHK FT KASLWRKEATKEMKDGDPEPSYIPKLPTLRKLREEAINRSLGINKGHDPIS FT ALYLKKYEGELAGCILDIGLDEFYCIYCTPAQIKTYNAQIKRVRKISVDAT FT GSVVLPIVKPHGDTSYVFLYQVVMEGEDGICPVFQMLSAKHDTASIQFWLS FT RFMSKSGIYPPEVVCDFSLALLNAVSLSFNDCRLHTYMSSCFEALSCESSR FT IPRSYIRLDIAHLIKMMCRKNVFKGKLPNVKDFYIRCIGVTTTCETKKSFK FT KILKSILIVATSESSGLNKEGENYASQNNENFLLQKIATFTAPEYDEDYEN FT SITNTPEELEPEGENEGINEFIKNILRSAEKKASKCKSSRSRPNPYFLPEL FT VKPLIKLSQYFILWTNVMKNKFTSNYNVGSSALVEAYFKDLKKSDMSIFSR FT PVRADKFVTQHIRCIEAACKLERAATKRKTMKTPAFIEDKKLKLDDKIDAG FT HLLEEENWKIKNKAKNTQQLEIEDEEKPRGKYLKKCPNLEMIYNRPYRRKK FT DAILQNGGALGPIWIEEKSINLKNTCPFDSLVEILATAYMDNIYYKRAIDE FT EKDNSIILFSKKYAIDGISGELYCDRGRILTKIFSEYNSVVYCEANIGTFI FT KKVMEGIPSLQKKNVHENGNQSCNNMINCLECLELVDAEKISSLGVHDGLI FT ATILDYFAPKNGRCKDCGVSTISHAQPGPHVIIDMEFAFDSFQRELYANAP FT GSATLQQLPEKIKLQQKTYILSGAVEYVPVSGSGIGHYIAYCRRITGLWEA FT HNDFTKQAKHFTATNTSMKIHIVFYTEADV" XX SQ Sequence 6401 BP; 2264 A; 1005 C; 1104 G; 2028 T; 0 other; agctacgaga agttgggtcg ggtaaaatta tattttttgg aaaaaaaggg tgcaatcgat 60 tgcccatcaa atgaccacaa atgaccatca atattttgaa atgaccacct cccgtttgga 120 aaatattcaa aaatgaaaat tttcgaaaaa ttttcgaaaa ttttaaaatt tgactttttg 180 attttttttt tttttaaatc gaaataaacc aaaatgacca ccaatggtca tcttttagaa 240 ttttgaaaaa acttttagtt tttgaaatat ttaagtgttc agtttatctc atttttttca 300 caggtaaaac aaaatttcaa aaaaaaaact actgaaatcg ttttccaata gtgtgataaa 360 caatgaccat caatattttc aaatgaccac ctaccgtttt gaaattattt aaaattgaac 420 atagttaaaa atttttcaaa aatttcaaaa tcgacttttt gaattttttt ttttcaatat 480 cgaaataaac gcaattgacc acaaatggac atcttttaga attttgaaaa aactttcagt 540 ttttaaaata tttaagtttt tagttttggt cattttttgg ccaggtaaaa aaaaaattca 600 aaaacaaact actgaattcg ttttccatta gtgtgataaa caatgaccat cttttaaaat 660 tttaaaataa catttacttt tcgagatact tgattctcat agtcttaaaa ataaagaaat 720 taaaaatcca aaagatgacc atcatggccc atagcctaaa acatataaat taaaaaaaaa 780 aattgatttc catcattgcc catcccttaa aacttaaaaa atgaacaaaa aactatatgc 840 ccatcatggc ccacatctta aaacttaaaa attttacaaa aaaaactatg cccttcattg 900 cccatccctt aaaacttaca aattttacat aaaaaactat gcccatcatt gcccacatcg 960 taaaacttac aaattttaca taaaaaacta tgcccttcat tgcccatccc ttaaaactta 1020 caaattttac ataaaaaatc atgcccacca ttccgcatct cttagaactt gaaaatttta 1080 caaaaaaaat ctatgctcag catggccaac agcttaaaac ttaaaaattt tacaaataaa 1140 tctatgccca ttgctaataa cttaaaaatt taaataatgt gtgagttctt gcaatttgaa 1200 cattggaagt aaaatattaa gaagggtttt gttaagaaaa atatatcatc taatttttat 1260 tacttataaa ctaggtttcg atagttccta aacagaatta tttgtctttt ccgatgatct 1320 attggtcatt cagccgtgct gagaacgacc aattgaatct tccccgtgaa aagattataa 1380 gctgtatata atttaactgt tttgaaacct ggaatacacg aagatgccat ccaagaccca 1440 cattgatgtt tctgagatag tggatactct ctgcctcttt aatatttttg aaaatgaaac 1500 tggtgcgtta aaatcaagga atgactctgt ttggaaagaa gctagcgacc gactaaaaac 1560 caaaatcagt gccccgactt taaatgtata cgttcggata aatcgaaata atatgttgac 1620 ccttataaag cgaagatgtg gtattacgga gaactcaatc aatgaaactg ttaactcaac 1680 gcttcccgaa gatgatccag aatttcagat cagcgaagta catggaaatg gacctttgcc 1740 caatctgttt ttaacattag aattggatga ggaagtttgg ggatctatcg ctcctatcaa 1800 aggaaaaaat gaagaagtaa aggatttgcg accaaaaaat tatttacaac caggatggac 1860 tgacataatc gcaaaggcct tgcaggaaaa ggttcccttg ccttgtgctt ttaattttaa 1920 aaaagcaaag ataagcggaa aagaggaaaa catctggctc aaaattgaag gttactgcag 1980 tgagtgcagt tcactgttta aaggacactg tcttgtggaa cctgatccaa atttgggaat 2040 aagtatttcg gttacagtac cagatactcg tggcatacca cataaccaaa aacggagatg 2100 cactggatta aaacgattgg caattggcca tgagttgctg cataagaaag cgtcgttgtg 2160 gagaaaggaa gccacgaagg aaatgaaaga cggcgatcct gagccaagtt atattcccaa 2220 gttaccaaca ttaagaaaat tgcgcgagga agctataaat cgaagccttg gaataaacaa 2280 aggacacgat ccaatctctg cattatattt aaaaaaatat gagggcgaac tagctggatg 2340 catactggat attggtttgg atgagttcta ctgcatctac tgcactccgg ctcaaattaa 2400 aacatataat gcgcaaatta aaagagtccg gaaaatatct gtggatgcca ctggaagcgt 2460 tgttctgcca attgtgaaac ctcatggcga taccagctac gtgttcctat accaggtggt 2520 tatggaaggg gaggacggaa tatgtcccgt atttcaaatg ctatcggcca agcatgatac 2580 ggccagcatc caattctggc tgagtcgatt tatgtcaaaa tcgggcatat atccacctga 2640 agttgtgtgc gacttttctt tggctttgtt gaacgctgtg agtttaagct tcaacgattg 2700 tcgcctccat acatatatgt cgtcctgctt cgaagctttg tcctgcgagt cgagtcgaat 2760 tccaagatca tatattcggc tcgacatcgc ccatcttata aaaatgatgt gccggaaaaa 2820 tgtattcaaa ggaaaattgc ccaacgtcaa ggatttttac ataagatgca ttggagttac 2880 aacaacctgc gaaaccaaaa aaagttttaa aaaaatatta aaatcgatcc ttattgtggc 2940 aacgagcgaa tcctcaggat taaataagga gggagaaaac tacgccagcc agaataacga 3000 aaattttctt cttcaaaaaa ttgctacatt tacggcaccg gaatatgatg aagattatga 3060 aaattctatt accaatacac cggaggaatt ggagccagaa ggagaaaatg aaggaattaa 3120 tgaatttata aaaaacattt taagatccgc agaaaaaaaa gccagtaaat gtaaatcatc 3180 cagaagtcgt ccaaacccat attttcttcc ggaattggta aagcccctca tcaaattatc 3240 gcaatatttt attttatgga caaacgttat gaaaaacaaa tttacatcga attataatgt 3300 tggatcgtct gcattggtgg aggcttactt caaagattta aaaaaatccg acatgagcat 3360 cttcagtaga cctgtgcgcg cagataaatt tgtaacacag cacatccgat gcatcgaggc 3420 agcctgcaaa ttggagcgtg cagcaacaaa aaggaaaaca atgaaaacac cggccttcat 3480 tgaggacaaa aaattaaaat tagatgataa aatcgacgca ggccatctcc tagaagagga 3540 aaattggaaa attaagaaca aagcgaagaa cactcaacaa ttagaaattg aagatgaaga 3600 aaagcccaga ggaaaatatt taaaaaaatg tccaaatctc gaaatgatat acaaccgtcc 3660 ctatagaagg aagaaggacg cgatattgca aaatggcggt gcacttgggc ccatatggat 3720 tgaagaaaaa tcaatcaatt taaaaaatac ttgccctttt gattcattag tggaaattct 3780 agcgactgca tatatggaca acatctatta caaacgggct attgacgagg aaaaggacaa 3840 ttcaataata ttattttcaa aaaaatatgc cattgatgga ataagtggag aactatattg 3900 tgatagagga cggattctta caaaaatatt ttcggaatat aactccgttg tgtactgcga 3960 agcgaatatc gggacgttta ttaaaaaggt catggaagga ataccaagcc tgcagaaaaa 4020 aaacgttcat gaaaatggca accagagctg taataatatg ataaattgcc ttgaatgctt 4080 agagctggta gatgctgaaa aaataagcag cctgggagtt catgatggac taatcgccac 4140 tattcttgat tattttgcgc caaaaaatgg tagatgcaaa gattgcggtg tgagtacgat 4200 ttcccatgca cagcctggac cacacgtcat catagacatg gaatttgcgt tcgacagctt 4260 tcaacgagaa ctatatgcca atgcgccagg atcagctaca ctgcagcagc ttcccgaaaa 4320 aattaaatta caacaaaaaa cttatatttt aagtggagct gttgaatatg ttccggtatc 4380 aggatcagga attggacatt atattgctta ctgccgcagg ataacaggat tatgggaagc 4440 gcacaatgat tttacgaagc aagcgaaaca ttttacggca acaaatacga gtatgaagat 4500 acacattgtt ttttacacgg aggcagatgt ttaatttgaa taaaagctaa taaaaaattt 4560 taaaaaaatt tataattatg tttttcttgt ataaaaagat cttatataat aatggttata 4620 ctcgtatata taggctggtt tatgatctgc taaaatatat tttatgacgc tttttttatt 4680 agtttataag ggcataacat ggaaaatctg ggaaatatcc tcatctggga aatgtttatt 4740 ggttacctgt ttgggtttta agtccttccc agttaggaaa ccccaaaaaa tttgcggtaa 4800 tatgcttttc ccaaaccgat cccttttttc caaaccgcat aattatgcaa ttgaaacttt 4860 atggtgtact gtaggtagta catacatatt tcgtatgatt ttttccattt cctgttttaa 4920 gtgttttccc caactaccgt aactatcttt tacgacctgt gtgtgtatat attctggatt 4980 tccccatgac aggtataagt tgaacatttt tcaaaaatga cgagttagac tttataagta 5040 ataaaaatta gatgatatat ttttcttaac aaaacccttc ttaatatttt acttccaatg 5100 ttcaaattgc aagaacctac acattattta aatttttaag ttattagcaa tgggcataga 5160 tttatttgta aaatttttaa gttttaagct gttggccatg ctgagcatag attttttttg 5220 taaaattttc aagttctaag agatgggcaa tggtgggcat gattttttat gtaaaatttg 5280 taagttttaa gggatgggca atgatgggca tagtttttta tgtaaaattt gtaagtttta 5340 agggatgggc aatgaagggc atagtttttt ttgtaaaatt tttaagtttt aagatgtggg 5400 ccatgatggg catatagttt tttgttcatt ttttaagttt taagggatgg gcaatgatgg 5460 aaatcaattt ttttttttaa tttatatgtt ttaggctatg ggccatgatg gtcatctttt 5520 ggatttttaa tttctttatt tttaagatgt gggccatgat gggcatatag ttttttgttc 5580 attttttaag ttttaaggga tgggcaatga tggaaatcaa tttttttttt taatttatat 5640 gttttaggct atgggccatg atggtcatct tttggatttt taatttcttt atttttaaga 5700 ctatgagaat caagtatctc gaaaagtaaa tgttatttta aaattttaaa agatggtcat 5760 tgtttatcac actaatggaa aacgaattca gtagtttgtt tttgaatttt tttttttacc 5820 tggccaaaaa atgaccaaaa ctaaaaactt aaatatttta aaaactgaaa gtttttcaaa 5880 attctaaaag atgaccattt gtggtcaatt gcgtttattt cgatattgaa aaaaaaaaat 5940 tcaaaaagtc gattttgaaa tttttgaaaa atttttaact atgttcaatt ttaaataatt 6000 tcaaaacggt aggtggtcat ttgaaaatat tgatggtcat tgtttatcac actattggaa 6060 aacgatttca gtagtttttt ttttgaaatt ttgttttacc tgtgaaaaaa atgagataaa 6120 ctgaacactt aaatatttca aaaactaaaa gttttttcaa aattctaaaa gattaccatt 6180 ggtggtcatt ttggtttatt tcgatttaaa aaaaaaaaaa tcaaaaagtc aaattttaaa 6240 attttcgaaa atttttcgaa aattttcatt tttgaatatt ttccaaacgg gaggtggtca 6300 tttcaaaata ttgatggtca tttgtggtca tttgatggac aatcgaatgc accctttttt 6360 tccaaaaaat atcattttac ccgacccaac ttctcgtagc t 6401 // ID CR1-64_HM repbase; DNA; INV; 4154 BP. XX AC . XX DT 23-DEC-2008 (Rel. 13.12, Created) DT 23-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE CR1-type family - consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-64_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4154 RA Bao W. and Jurka J.; RT "CR1 families from Hydra magnipapillata."; RL Repbase Reports 8(12), 1891-1891 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(278..982,986..4033) FT /product="CR1-64_HM_1p" FT /translation="MVNAKEFKDALDALESKLRIEYESKISNAVADLKKTI FT QSLVLTVADLTKKNEDLNNKVVNVIVPAPNEAWNIVTGRKLNKTQSQIDII FT NTVSNENKEIERREKNVIIFGLEESNKTDGADIKKDDENKVLDLLDQVSID FT KLHMVRIFRLKSKNGIAPVIVELRNKDTRNDFIKKSYRKFNGIYVNPDLTD FT AQRNLDKQLRDKCRELNKPLNLKVDWDQVKSYHVIRNKQVVLINKQLLFKS FT SQYNTHNFKHSLHTPCQYXYNQLNNPDADTGGSNHLNSSANTLLVKKMNQD FT LQKGLKTENIDYLTCWYTNATSLNNKFDEFIDDISRSKAQVIFACETWWSA FT MSTTNIPGYNLFRKDRQFSRGGGVCIYIQENIKSYEINEKYLINENIEQVW FT CLIEIGKEQILCGCMYRTGLSNSEKCIDIIKSLKQAYVLKQNNKCSGILIC FT GDFNYANIKWNDESFSEIINDSDTTASMFIESLHDCYLHQNVISPSFQVKF FT GIDTNILDLILTDNSNRVYFLEHFPPLGGIDHGHHILKFRFGYENKPNLLI FT SLQTRKKKFLWKNGNYKALDLYFNNINWINEFEGLSVNECYEKWLRKYEIG FT CEKYIPTLNLDKNSFQLKNNSLWMTSELKSMCKEKKKCWFRNRNSGFKSLL FT EVNKYKKLNKDIKKLVKKSIRNFEKNLALNSKSNPKSVYAYINNKTRFKDS FT IKAIKLHNGIITTDNQEIVNTLNEFFASVFIQDDSPEEVLDDNLLTKCSDP FT DFSISIIQKHLSNLNMYKSVGPDNVHPKVLKECSESLSSSLAIIFVKSFIT FT GSIPKLWSCANVVPLFKKGNKLDPTNYRPVSLTSIVGKVMERIIKDHMMVF FT LIENNLISKEQHGFVNNKSCITNLLETLDLITQAYEEDISVDVLFLDFKKA FT FDSVSHTKLLKKLFRLGFVSSLLNWCKSFLSDRTQRVVIGEYISSWKMVTS FT GVPQGSVLGPLLFVIFINDLCRGIIKDTKLYADDTKVISINHCYDDNKILQ FT EDINRLVKWSEDWLITFNESKCKVMYIGKKNPRYEYKLNNSVLTETMIEND FT LGIFISNNLEWKYHXNSAIGKANRKLGMIKNSFEYLDELSLKLLYKSLVRP FT HLEYGATVWSPFWKKDIDNLEIVQHRATRIESLRGKSYEERLKILELPTLF FT DRRRRGDLIQMYKISKGKDIVNFHHPPLNFELERSSRSNNCRIRRQFTKSR FT IRHHFFTNRVINDWNSLPQWILDKDNLNIFKNSIDNYFGF*" XX SQ Sequence 4154 BP; 1629 A; 518 C; 664 G; 1336 T; 7 other; tatgatgtat agtatattaw actattttaa cataattcta aaaatagagt tcaattttta 60 aaaaacatta taaactagta aaaatattta aaaatttttt ttatayacta ttttcataat 120 aaatacwcat agaaatttct gatagtcata ttgatgaaya atctaaaata tatatatata 180 tatatatata tatatatata tatatacgga aaaaaacaca tactgaaact ttttagtccc 240 tgtgtaataa ggttaactat attataacta atatattatg gttaatgcaa aagaattcaa 300 agacgctctg gatgcactgg agagtaaact aagaattgaa tatgaatcaa agattagtaa 360 tgctgtagca gatttgaaaa aaacaataca atcattagta ttaacagttg ctgatttaac 420 taaaaagaat gaagatttaa acaacaaagt tgtaaatgtt attgttcctg ctccgaatga 480 agcttggaat attgtaacag gtagaaaatt aaataaaact caatctcaga ttgatattat 540 taatacagtt tcgaatgaaa ataaagaaat tgaacgaaga gagaagaacg taattatttt 600 tggtttagaa gaatccaata aaactgatgg cgcagatata aagaaagatg acgaaaacaa 660 agtattagat ttattggatc aagtatcaat agataaactt catatggttc gtatttttag 720 attaaaatct aaaaatggta ttgcaccagt aattgtcgag ttaagaaata aggacacgag 780 aaatgacttt ataaaaaaat cataccgtaa gtttaatggt atttatgtaa atcctgattt 840 aactgatgct cagagaaact tggataaaca actcagggat aaatgtagag aattaaataa 900 accactcaac ctaaaagtcg attgggacca agttaaaagc tatcacgtaa ttagaaataa 960 acaagtagtg ttgataaaca aatagcaatt gttgttcaag tcatcacagt ataacacaca 1020 taattttaaa cattctttac atacaccttg tcaatatmta tataaccaat taaataaccc 1080 tgatgcagac actggaggaa gtaatcattt gaactcaagt gctaatacac tgctggtaaa 1140 gaaaatgaat caagaccttc aaaaaggttt aaaaacagaa aatatcgatt atttaacatg 1200 ttggtataca aatgcgacct ctttaaataa taaatttgac gaattcattg atgatatatc 1260 cagatctaag gctcaagtta tatttgcttg tgagacttgg tggtcagcaa tgtctactac 1320 taatattcct ggttataatc tgtttagaaa agatcgtcaa tttagtcgtg gtggtggtgt 1380 atgtatttat atacaagaga atattaaatc ttatgaaata aatgaaaaat atctaataaa 1440 tgaaaatatt gagcaagtat ggtgtttgat tgaaattggc aaagagcaaa tcttatgtgg 1500 ttgtatgtac cggacaggtc ttagtaatag tgaaaagtgt attgacatta taaaatcatt 1560 aaaacaagca tatgtactca aacaaaataa caaatgctct ggtattttaa tatgtggtga 1620 ctttaactat gcaaacatca agtggaacga tgaaagtttt agtgaaatta taaatgacag 1680 tgacacaaca gcaagtatgt ttattgagtc gttacatgac tgttacctgc accaaaatgt 1740 tatcagtcct agttttcaag taaagtttgg tattgatact aacattttag acctcatttt 1800 gactgataat agcaacagag tttattttct tgaacatttt ccaccgttag gtggtatcga 1860 tcatggtcat catattttga aatttcgatt tggttatgaa aataaaccta atttattaat 1920 atcactacaa acgaggaaga aaaaattttt gtggaaaaat ggaaactata aagcgttaga 1980 tttgtatttt aataacataa actggataaa tgaatttgaa ggactaagtg ttaatgaatg 2040 ttacgaaaaa tggctaagaa aatatgaaat aggatgtgaa aaatatatcc caacgctaaa 2100 tttagataaa aactcttttc aactaaaaaa taactcacta tggatgacaa gtgaacttaa 2160 aagcatgtgt aaagaaaaga aaaaatgttg gtttagaaac agaaattcag gttttaaaag 2220 tcttttagaa gtcaataagt ataaaaaatt aaataaagat ataaaaaaac ttgtcaagaa 2280 aagtattcgc aattttgaaa aaaacttggc acttaattca aaaagcaatc ctaaaagtgt 2340 ctatgcttat ataaacaaca aaacaagatt taaagactcg atcaaagcca taaagctaca 2400 taatggtatt ataactactg ataatcaaga gatagtaaac actcttaatg aattctttgc 2460 ttctgtgttt atacaagatg actcaccaga agaagtttta gacgacaatc ttttaacaaa 2520 atgttctgat ccagatttta gtatttctat tattcagaaa catttgagta atttaaacat 2580 gtacaaatca gtaggtccag ataatgttca cccaaaagtt ttaaaggaat gttcagaatc 2640 tttgtcaagt tctttagcaa taatatttgt taaatcgttt attacaggat ctattccaaa 2700 gttatggtca tgtgcaaacg ttgttccgtt attcaaaaaa ggaaataaac tagatccaac 2760 aaattataga ccggtttcat taacatcaat tgttggtaaa gttatggaga gaattattaa 2820 agatcatatg atggtgttcc tcattgaaaa caatttaatt tctaaagagc aacatggttt 2880 tgttaataac aaaagttgta taacaaacct gttagaaaca ttagatttaa tcacgcaagc 2940 gtatgaagag gatatttccg ttgatgtttt atttttggat tttaaaaaag catttgattc 3000 agtctctcac acgaaattac tgaaaaaatt attcagacta ggatttgtat catctttgtt 3060 aaattggtgt aaatcatttt tgtcagaccg aacacagagg gttgtgatag gggaatatat 3120 ttcaagttgg aagatggtta ctagtggagt accacaaggt tcagtactcg gacctctttt 3180 gtttgtaatc ttcattaatg acttatgtag agggattatt aaagacacta aattatatgc 3240 agacgatacc aaagttattt caatcaatca ttgttatgat gataataaaa tattacaaga 3300 ggatattaac aggttagtta aatggtctga agattggtta attacattta atgaatcaaa 3360 atgtaaagtc atgtatattg gcaaaaagaa tccaagatat gaatataaat tgaacaactc 3420 tgttttaaca gaaacaatga tagaaaatga tcttgggatt ttcatttcaa ataatctaga 3480 gtggaaatat catrttaatt cagctatagg taaagcaaac agaaaattag gcatgatcaa 3540 aaactctttt gagtatttgg atgaactttc attaaagtta ctatacaaat cgttagtacg 3600 tccccatttg gaatatggag caacagtttg gagtccattt tggaaaaagg acattgacaa 3660 tcttgaaatc gtgcaacaca gggctacgag aattgaatct ttaaggggga agtcgtatga 3720 agaaagatta aagatacttg aattgccaac tttgtttgat agaagaagaa gaggtgatct 3780 gattcaaatg tataagatct caaaaggaaa agatattgtt aattttcacc aycctccttt 3840 aaattttgag ctggagcgat ctagtagaag taacaattgt agaatacgca gacaatttac 3900 aaaatcaaga attcgtcacc atttctttac aaacagagtt ataaatgatt ggaattctct 3960 tccacaatgg atattagata aagataattt gaacattttt aaaaacagta tcgataatta 4020 tttcggtttc tgaacactaa tttttgtaat cggctgtcat agtcttaatt ctacgtaatt 4080 aagactctac acacctcaag tgtgtacagc accttttcta ttctattcta ttctattcta 4140 tatatatata tata 4154 // ID SMAR6 repbase; DNA; INV; 1738 BP. XX AC . XX DT 29-SEP-2007 (Rel. 12.09, Created) DT 01-OCT-2007 (Rel. 12.09, Last updated, Version 1) XX DE Consensus sequence of Mariner-type family of repeats. XX KW Mariner/Tc1; DNA transposon; Transposable Element; SMAR6. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-1738 RA Jurka J.; RT "SMAR6: Mariner-type element from freshwater planarian (Schmidtea RT mediterranea)."; RL Repbase Reports 7(9), 995-995 (2007). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 288..1580 FT /product="SMAR6_1p" FT /translation="MSTKRKSYSVEYKKGIVEDSRGVNLVAFCKEKMLDLR FT MVRKWRADYDYLCQMVDQGNEKKRKVGSGRQLLFSELEECIWEWIADRRAM FT ALVVRRADIQKFALETAIEFGIFTEEFKASKHWLDNFLQRHELSLRRSTTL FT FKLEDNEVVKRALAFKRFVDGIDFSKYELSNMIAMDETAVFLGQGAQTTVD FT HKGSSSIYVSSSGYDSVRVTCILAIRLDGKKVSPFLITKGKKDQIQRISGI FT YVIESEKAWCTQAVIRKWVDFILPPLLRGRNRGLLVWDSASTHRAKDMKNF FT LAERRIDQIMIPAGMTGYLQTLDIAINKPFKDHLRMEINDYFENRMVRNHR FT GNFVKPGLQEVVNWIKNSWDKITDSCVSNALRTGYLDKKYIFNDSYIARHE FT RFGPMIQQEMDLEENQNRNQNLIYDDVPEEDDMIVNE" XX SQ Sequence 1738 BP; 584 A; 262 C; 372 G; 520 T; 0 other; taccgtgttt ccccgaaaat aagccctatg gagatttttg gaacttttta taatataagc 60 cctcccccga aaataagccc tagttaaaat aatgtgtaaa aaagaattcg ttaaaaatgc 120 tattgattta ttaagtatgt ttaacttctt ataaataatt gttcggcagt ctaaactgag 180 cgacagcgca gatacaacgt ttgtgtattt tattcttgta agtcggatta ttttttcata 240 cgtaatttta ttttttataa agtgaataat ttttaaccca agttaagatg agtacaaagc 300 gaaagagcta ttccgttgag tacaaaaaag gaattgtgga agactccagg ggcgtaaatc 360 ttgtagcctt ttgtaaagag aagatgttgg atctacgtat ggttcgtaaa tggcgtgcag 420 actacgatta cttgtgtcaa atggtagacc aaggaaatga gaaaaaacgc aaagttggat 480 caggtcggca gctgttattt tctgagttgg aagaatgcat ttgggaatgg attgctgaca 540 ggagagcaat ggctttggtt gtgcgcaggg ctgatattca aaaatttgcc cttgaaacgg 600 caattgaatt tggtatattc acggaagaat ttaaagcatc gaaacactgg ctagacaact 660 tccttcaaag acatgaactg tctctaagga gatcaacaac attgtttaag ctggaagaca 720 atgaagttgt taaacgtgca cttgcgttta agcgttttgt tgatggcatc gatttttcga 780 aatacgaact ctccaacatg attgctatgg atgagaccgc ggtattccta ggccaaggag 840 ctcaaacgac ggttgaccac aagggttcct cttcaattta cgtttcttct agcggttacg 900 atagtgtacg tgttacctgt attttagcga ttcgtctaga tggcaagaaa gtatcgcctt 960 ttctaatcac taaaggtaaa aaagaccaga ttcaacgcat ttcaggaatt tatgtaattg 1020 aaagcgaaaa agcctggtgc actcaagcag ttataagaaa gtgggttgat tttatactgc 1080 caccattgtt gaggggtcga aatagaggtt tactagtttg ggattcagcc agtactcacc 1140 gcgctaaaga catgaaaaac tttcttgctg agaggaggat tgatcaaata atgattccgg 1200 caggaatgac tggttacctc cagacacttg acattgcaat aaacaagcca tttaaggatc 1260 atttgcggat ggaaataaat gattattttg aaaacagaat ggtgagaaat catcgaggta 1320 attttgtaaa accagggtta caagaagttg tgaattggat aaaaaattca tgggataaaa 1380 taactgatag ttgtgtttct aatgcactac gaacaggcta cttagacaag aagtatatat 1440 ttaacgacag ctatattgca agacatgaga gattcgggcc aatgattcaa caggaaatgg 1500 atttggagga aaatcagaac agaaatcaga atttgattta tgacgatgtt ccagaagaag 1560 atgacatgat tgtaaatgaa taaattttag tttctgtatt ctgtgtaaaa taaataaata 1620 ttaaataaaa atgtataaat ttatttttat ttttgtctcc cccgaaaata agccctagtg 1680 catattttgg accaaaaaag aaagtaagac agtgtcttat tttcggggaa acacggta 1738 // ID CR1-101_AAe repbase; DNA; INV; 5246 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-101_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5246 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1189-1189 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >98% CC identity. Closely related to T1 and Q. XX FH Key Location/Qualifiers FT CDS 272..2107 FT /product="CR1-101_AAe_1p" FT /translation="MPCGVKSCSVNDDHLLWRCTYCDKKYHAACIGVQRHR FT EHIITAFMVPMCADCQDILMKEADVRKLLHQQEKLLEAVRSQNDTNQRIAA FT DLSKFSLVSVLFESIEHLLNEVNESLALSNKNNESIRNALDQHMVSCDSRV FT TSAITDINISNTKMSQKIKNFFDDSALAIEKAVNSAAEVAVNAITPTKPPT FT TPDPLPIIIDEVRNELASLAKSAAFAAIAEASTTTEHKHAEILFLSKEILE FT EVKSVSSAVETIESSTGKNSSQADKTLAEELAEDILSSVIHKSNTKTHKPS FT RAHQQIQHQQKPQPSPNKHANNVIKELFAERRAKMSRIRSVQTTESQPPNS FT GWRFINNGKKRLWLQDWSKYDATYSNKIAERGSTQNKHERDSNLNKKSNNN FT QIKPSDKQLLATAKVQFAGPPPNTDHPIAALTVPKLINFRKGETINPYRVE FT KQTQPQNNTQLPTVPTTNTAPSSTLTESPGWSLDPLRPPIVTLTQQSAEGD FT GRFLKARLRDPNIMKIVRLFLAYMKDQPANICIDGNTPTSIRMVLASEGLP FT TDPDHLLRIFSDVHQEYGVGPAEAAADLEAYRRYLSSERTNRLQQLRENSH FT KFFSPYPGPSNFRK" FT CDS 2233..5085 FT /product="CR1-101_AAe_2p" FT /translation="MKSPAKMKEIYKNILSSSFSVILATETSWDESVKSEE FT VFGSGFNVFRGDRNFSESGRKSGGGVLVALSTDFNSEVINTQHFKEFEHVW FT VKSHIAGETHLFASVYFPPDHAHKGTYENFLQCAEQILSQLPPEVKVHIYG FT DFNQRNADFIRDSENESILLPVVGDNETLQFIFDKTANLGLNQINHIKNRQ FT NCYLDFLLTNIYEDFCVTESLNPLWKNEAFHTAIEYSLFIHKNARPNDYEY FT EEAFEYKSADYEKIKCRLNSVNWQEIIRNEGSVETAAYIFNEIVLEIIKQE FT IPSKKIRRNLSTKNPIWYNNHIKNLKNRKQKAHKIYKKHQKDENLAKYLDI FT CTQLNLAISDAFEEYNSKTELEIKSCPKNFFNYVKTKLKSDNFPSVMHLDE FT NVGDNSEKICNLYANFFQEIYTTFSEEDRDRNYFAFYPEFSRDIGVNQINV FT REISDALMNLDASKGPGPDTIPPIFMKNLAKELTAPLFWLFNKSLESGIFP FT KIWKSSFLVPIFKSGRKSDIRNYRGIAIISCIPKLFESIVNEKLFLQIKNR FT ITDTQHGFFKGRSTSTNLLQFVDYSLNAMDNGNHVEALYTDFSKAFDRIDI FT PMLLFKLKKIGIEPRLLKWLESYLTDRQQIIKFKGNKSKPIQVTSGVPQGS FT HLGPLLFIMYVNDISFIFNKLKVLIYADDMKLYLEIRNEEDINVFRNEIEI FT FYIWCKKSLLELNVKKCNLISFSRKRTTPRISISLGNQNVEKCERVRDLGV FT ILDSKLSFVDHYNTIIHRANNMLGFIKRFCYNFHDPYTIKTLYIAYVRSIM FT EYCSIVWSPFSMTHEERLESVQKQFLLYALRKLGWTSFPLPPYKARCMLID FT IQTLKERREIARVSFVNDIVSQRIDSTEILSKLNFYAPSRQLRNRNLFLTN FT HHRTNYAKNGPLNQMMAIYNQHCETIDFTMSRQNLKTYFKSTRNRNT" XX SQ Sequence 5246 BP; 1849 A; 1131 C; 890 G; 1376 T; 0 other; ttctgtgacg aaccaccttt gagttcggac gcgctttcat agcgctcttg taaatatttt 60 acaactagac ttctgacccc gggcccgcgc gcgtaccggt cagtccgtat ttcaattgta 120 tatatttatt ttctactttt gacaacatag ctctgcgcgc ttctcgtgct ttgccagtaa 180 ttgcaattaa actttttttt ttaccctcga cgcgagcgta gcgaagtaac tttttttacc 240 cccgtgtgtg tagtgtggtc atatatcaaa catgccttgc ggagtgaagt cttgcagcgt 300 caacgacgac caccttctat ggaggtgtac ttattgcgac aaaaaatacc acgctgcctg 360 tattggagtc caaagacatc gggagcacat tattaccgcg tttatggtac caatgtgtgc 420 cgactgccag gacatcctta tgaaggaggc cgacgtgcgg aagctgcttc atcaacaaga 480 aaaacttctt gaagcagtgc gctcacaaaa cgacacaaat cagcgtattg ctgctgacct 540 aagtaaattc agcttagtgt cagtattatt tgaaagcatt gagcacctgc tcaatgaagt 600 aaacgaatcg ttggctttgt ccaacaaaaa caacgaaagc atacgcaatg cactcgacca 660 gcacatggtg tcgtgtgact cacgcgtgac ttcagcaatc accgacatca acatatcgaa 720 tacaaaaatg tcacaaaaaa ttaaaaactt ttttgacgat tctgcattgg ctattgaaaa 780 ggcagtcaat tcggcagcag aagtagcagt aaatgcaatc acgccaacga agcctccaac 840 cactcctgac ccacttccaa taataattga cgaggtcaga aatgaattgg cctcattagc 900 caaatcagct gcatttgcag caatagcaga agcatcaacg acgacggaac ataaacatgc 960 agaaatccta ttcttatcga aggaaatatt ggaggaagtt aagtctgtct catctgctgt 1020 cgaaactatc gagagcagca ctggaaaaaa ctcatcgcaa gccgacaaaa cattagccga 1080 agagttagcc gaagacattc tttcatcggt catccataaa tccaacacca aaacccataa 1140 accaagtcgg gctcatcaac aaattcaaca tcagcagaaa ccacagccgt cacccaataa 1200 acatgcaaat aatgtcatca aagaactttt cgctgaacgt cgagcaaaaa tgtccagaat 1260 acgatccgtc caaacgactg aatctcaacc accaaatagt ggttggcgat ttataaacaa 1320 cgggaaaaaa cgactttggc tccaagactg gtctaagtat gatgccacat actcaaataa 1380 aattgctgaa agaggatcaa cacagaataa acatgaacgg gattccaacc tcaacaagaa 1440 aagcaacaac aaccaaatca aaccatcaga caagcagctt ctagcaactg ctaaagtgca 1500 attcgcgggt ccgccaccta acaccgacca cccaatcgct gctctcacgg ttccgaaatt 1560 aatcaatttc agaaagggtg aaaccataaa cccctaccgc gttgaaaagc aaacacaacc 1620 acagaacaac acacaattgc caacagtgcc aacaacaaat actgcaccat catcaacgtt 1680 aactgaatca cctggctggt cactcgatcc cttgcgacca cccatcgtca ccttaacaca 1740 acaatcagct gagggtgatg gccgcttcct aaaagcaaga ctgcgtgatc ccaacatcat 1800 gaagatcgtc cgactgtttc tggcatacat gaaggaccag cctgcgaata tctgcattga 1860 tggcaacaca ccaaccagca ttaggatggt cctcgcgtcg gaaggattgc caacggatcc 1920 cgatcatctt cttcgaattt tttctgacgt tcatcaggaa tatggagttg gcccagctga 1980 agcagcagcg gatctggagg cgtatcgtcg gtacctatcg agcgaacgta ctaaccgact 2040 tcaacaattg cgggagaatt cgcacaaatt cttctctccc tatccaggac cgtcaaattt 2100 tcgcaagtaa ggacgcccac ttctacggaa gaacacagta taattttatc aaacgacgtg 2160 cctccaaaca cttctttgca atgtagacag agtgtgactg aaatcttgat atactgtcaa 2220 aatttcaatc gcatgaaaag cccagcaaaa atgaaggaaa tttataaaaa cattttaagc 2280 tcatcctttt cggttattct ggcaactgaa acaagctggg atgaaagcgt aaaaagcgaa 2340 gaagtctttg gaagcggctt caacgtattc agaggcgacc ggaatttttc agagtccgga 2400 agaaaatcag gcgggggagt actcgttgct ctttcaactg attttaattc tgaagtaatc 2460 aacacacaac attttaaaga attcgaacat gtctgggtaa agtcacacat agcaggtgaa 2520 acacacttgt ttgcctcagt gtattttcca cctgaccatg cgcataaagg cacgtatgaa 2580 aatttcttac aatgtgccga acaaatttta tcacaactcc ccccagaggt taaagttcac 2640 atctatggtg attttaacca acgcaatgct gatttcattc gggactctga aaacgaaagt 2700 attttacttc cagtcgttgg ggacaatgaa acattgcagt ttattttcga taaaaccgca 2760 aatttaggac taaatcaaat taatcatatt aaaaacagac aaaactgtta tttggatttc 2820 ttattgacaa acatttacga agatttctgt gtaactgagt cattgaatcc attatggaaa 2880 aatgaagcgt ttcacacagc aatcgaatat tccctattca tccataagaa tgcaagaccc 2940 aacgactatg aatacgagga agcctttgaa tacaaatcag cagactatga aaaaattaaa 3000 tgtagattaa atagtgttaa ttggcaagaa ataatcagaa atgaaggaag tgtcgaaact 3060 gctgcataca tattcaatga aattgtacta gaaatcataa aacaagaaat tccttccaaa 3120 aaaatacggc gaaacctcag tacaaaaaat ccaatatggt ataacaacca cataaaaaat 3180 ctgaaaaatc ggaagcaaaa ggcacacaaa atttataaga aacaccagaa agatgaaaac 3240 ttagcaaaat atttggacat ttgcactcaa ttgaatcttg ccatcagtga tgcttttgaa 3300 gaatataatt caaaaactga acttgaaata aagtcctgtc caaagaactt cttcaattac 3360 gtaaaaacaa aactaaaatc tgacaatttt ccatctgtta tgcaccttga tgaaaatgtg 3420 ggcgacaact cagaaaaaat ttgcaatcta tatgcaaatt ttttccaaga aatctatact 3480 acattttcgg aagaagaccg cgatcgcaat tattttgcgt tttacccgga attttctaga 3540 gatattggtg tgaatcaaat caatgtgcgg gaaatttcag acgctctaat gaatttggac 3600 gcttcaaaag gtcccggccc tgacacgatt ccaccaatat ttatgaaaaa tttagcaaaa 3660 gaacttacag cgccattgtt ttggcttttt aataaatccc ttgaatccgg aatcttccca 3720 aaaatatgga aaagctcctt tttagtgcct atctttaaat ctggccgaaa atctgacata 3780 cgtaattatc gtggtatagc cattatctct tgtattccaa aacttttcga gtcaatcgtc 3840 aacgaaaaac tatttcttca aatcaaaaat agaattactg acacacaaca tggcttcttt 3900 aaaggtcgct cgacctcaac aaatttactt caatttgttg actattcatt gaatgcaatg 3960 gacaatggaa accacgtaga agctctttat acggacttta gcaaggcatt tgatcgcata 4020 gacataccaa tgctactttt caaactaaaa aaaattggaa tcgagccaag actgctgaaa 4080 tggcttgaat cgtatttaac tgaccgccaa caaataataa aattcaaagg aaacaaatca 4140 aaaccaattc aagtcacttc gggtgtccct caaggctctc atttgggacc tcttcttttt 4200 attatgtatg taaacgacat ttccttcatt ttcaataaac tcaaagtgtt aatatatgcc 4260 gatgacatga agctctacct ggaaataaga aatgaagaag acatcaatgt attccgcaat 4320 gaaatagaaa tattctacat atggtgcaaa aaaagcctat tagaactgaa tgtaaaaaaa 4380 tgcaacctaa tatcatttag cagaaaaaga acaacaccgc gaatttcaat ttcattagga 4440 aatcaaaatg tagaaaaatg tgaaagagta agggacttag gagtaatctt agactctaag 4500 ctttctttcg tagaccacta caatacaata attcacaggg ctaataacat gctagggttc 4560 ataaaacgct tctgctacaa ctttcatgac ccatacacaa ttaaaactct atatattgca 4620 tatgtaagat caatcatgga atactgcagc attgtatggt ctcctttctc aatgacacac 4680 gaagaaagat tagaatctgt acaaaaacaa tttctactat atgctcttcg taaattaggt 4740 tggacatcat ttccacttcc accctacaaa gcacgctgca tgcttattga catacaaact 4800 ttgaaagagc ggcgtgaaat cgcaagagtt tcatttgtaa atgatatcgt ttcgcaacgt 4860 attgattcta cagaaatctt atcaaaatta aatttctacg ctccttctag gcaactacga 4920 aatcgtaact tattcttgac aaatcatcat cgaacaaatt atgccaaaaa cgggcctcta 4980 aatcaaatga tggccattta caatcaacat tgcgaaacta ttgactttac catgtctcgg 5040 caaaatctaa aaacatactt caaatccact agaaatcgca acacataaaa taaaacaaaa 5100 tgaaattaac attacttaca ctacactttt ctttttcaat attttttata aatttcataa 5160 ttttagtatt aagaaataaa acaaaaatat gtatatgtaa aatgtactag tctacgtcgt 5220 ttgacgaaat aaataaataa ataaat 5246 // ID Gypsy-100_AA-LTR repbase; DNA; INV; 255 BP. XX AC supercont1.273; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-100_AA_; KW Gypsy-100_AA-I; Gypsy-100_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-255 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.273; Positions 1459545 1459799. XX SQ Sequence 255 BP; 79 A; 65 C; 49 G; 62 T; 0 other; tgttgtatac cttgaaattg atataccttg attaatgatg tcgaatgaca gttcatttac 60 cgcgcgttcc cctacgcgcg caagcgaacc gaagggttcg agccaaaaga aaaatatacg 120 gacaagcagt ccactcagaa acacaactca gtgcagacgt tttagcagca aggaaaaccc 180 ccctaaactt cggtctttaa ccccagtatc cgcccgtgaa aattgagtcg taagtttcca 240 ctttagcttc aatca 255 // ID CR1-4_NVi repbase; DNA; INV; 4499 BP. XX AC . XX DT 17-APR-2009 (Rel. 14.04, Created) DT 17-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE CR1-type non-LTR retrotransposon: consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-4_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-4499 RA Bao W. and Jurka J.; RT "CR1 families from Nasonia vitripennis."; RL Repbase Reports 9(4), 751-751 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS join(8..451,405..1415) FT /product="CR1-4_NVi_1p" FT /translation="MAETSAGSSCAQATIYYKCHPKIEVKTVVCLICESVY FT HTSDFARLDGAIKLSSVLGICNEHHDLDLTSNISKETLSNDARHIVAQIKS FT NHNSESRKKLLDNLEIISDEEEGEVYENSLKRENSLLRELNKELTEKKQDA FT KRFTGETKKQKKNKMLRDLLEKQKSDTNTVKNKTFAEVMSYSKTSTRSKRV FT PSIIIKNKSAHTIDQINLFVAHYLNKDKAIQTKKVHKKKDEVIVNCLNEES FT TSKAFSILKKKLADECEVNKEKVENPKVKVVGINNFENLDDKKLEEDINER FT NFSKFSKKCIVLHTYKNNKTNLQTALIEMPAELYQSVRESNNKLFVGYQNC FT RVYDYCNIKPCFNCGRFGHNGLKCNNRPICFKCTGNHKTRECERQGNIMKC FT PNCIYSNSKYKTEYKTEHMATDSHECKILQAKIRKFIDFTDYTMQPILPRY FT VGMAGATTVKAAGWRNIGMSTASINTLNDNIPNKGRNKN*" FT CDS join(1587..2495,2538..4007) FT /product="CR1-4_NVi_2p" FT /translation="KVCQKSHVXIICTIICTETWIVQHPHFYQLHGYKLCY FT NESKINRADGVLVYIKNNIYEKTKVEVIDKLSVVSSDLKLQSGESLRVSAM FT YRCHDVPKTEFLNSVKKFLSKHIRTKNHCIIGDFNIDIIESNVQNNNIGAI FT SISQEFLNNFFEQEFIPYYRGVTRPSLDNESGSCIDNCFAKTRDVNIESYK FT LCTPFNDHYPLVISIDQLKVQNEQEKIENCINYRKLKEYAKKIEWNSCLHL FT QDPNLALDELIKMIHSCVGKATCNKKTKNKQLPRKKWITKAILVSCKNKEN FT LYNIWKKILQINKTIKDAKYKFENNEIRRSGDNPRQLWSIINSKMGKSNKQ FT MKSPDCIVVNNQKIVNKTNIANEMNDYFSQVGIDLSNQINNKCDDQIRLPA FT RNSESIFIQPTNVCEIHRVISEMKKKAGGVDNINASTIKCIAMFLAKPLEH FT IFNISIQQSIWPNALKRADIVPIHKSGDSTCIANYRPISLISNIAKIFEKI FT IYNRLFNFIMKHNIISERQFGFVRGRGATDALDCLADIIYKSLDKSKPIIT FT TFLDLAKAFDTVDHRILLDKLERYGIRGCALKLLTSYLSGRKQCVKINNFK FT SEYKDISIGVPQGTILGPLFFILYVNDLLIDMLKDSIMSYADDTVIISSDD FT SWSAAQDKMNFYLSKVAKWLMLNKLSLNVNKTVYITYGNYCDSVPEDINIK FT IGGNFINRVESHKYLGLHFDYNMKWNIHIDSIVKRTRYLIFVFAKIKSIMD FT RKTLMMIYHAFFQGLVNYGIIAWGGHITIVCTRYKEYKRSY*" XX SQ Sequence 4499 BP; 1871 A; 608 C; 802 G; 1216 T; 2 other; cgccaacatg gcggagacga gcgcaggctc gtcgtgtgca caagcgacaa tttattacaa 60 atgtcatcct aaaatagaag tgaaaacagt tgtgtgtcta atttgtgaaa gtgtgtacca 120 cacaagtgat tttgcaagat tagatggtgc gattaaatta agtagtgtat tgggaatatg 180 caacgaacat catgatctcg atctaacctc aaatataagt aaagaaacgc tgagtaatga 240 tgcgaggcat atagtcgctc agataaaatc aaaccataac tcggaatcta gaaagaaatt 300 gctcgataat ttagaaatca tcagtgatga agaagaagga gaagtgtatg agaatagttt 360 gaaaagagaa aattcactgt taagagaact aaataaagaa ttaacagaaa aaaaacaaga 420 tgctaagaga tttactggag aaacaaaaaa gtgacacaaa tacagtgaaa aataaaactt 480 ttgcggaagt aatgtcttat tcaaaaacta gtaccaggtc aaaaagggta cctagcataa 540 taataaaaaa caaatcagca cataccattg atcagataaa cttgtttgta gcacactatt 600 taaataagga taaggcaatt cagaccaaaa aagttcacaa gaaaaaggat gaagtgattg 660 ttaattgttt gaatgaagaa agtacaagta aagcttttag tatattaaag aaaaaactag 720 ctgatgaatg tgaagtgaac aaagaaaaag tagaaaatcc aaaagtaaaa gtagttggaa 780 ttaataattt tgaaaaccta gatgataaaa aattggaaga agatataaac gaaagaaact 840 ttagtaaatt ttcaaaaaaa tgtatagttc tacatacata taagaacaac aagactaact 900 tgcaaacagc gttaattgaa atgccagcag aattgtatca aagtgtcaga gaaagcaata 960 ataaactatt tgtaggatat caaaactgca gagtttatga ttattgcaat attaaaccat 1020 gcttcaactg tggcagattt ggacataatg gtctgaaatg caataatagg cctatctgtt 1080 ttaagtgtac gggcaaccat aaaactagag agtgtgaaag acaaggaaac ataatgaaat 1140 gtccgaattg tatatacagt aatagtaaat acaagacaga atacaagaca gaacatatgg 1200 caacagacag ccacgaatgc aaaattttgc aagccaagat aagaaagttt attgacttta 1260 ccgactatac aatgcaacct atattaccaa gatatgtcgg catggcaggg gccacgacag 1320 ttaaagcagc gggatggaga aatatcggga tgtcaacagc ctcgattaac acactcaacg 1380 ataatatacc aaataaagga agaaacaaaa attaaccatg gatcaggata ttaacattga 1440 agaagtaggc aaggcagagt gtgaagtaga aaacatcaga gaactcaaca ggaagatcat 1500 gaagatcgtg aagaagaaaa atttaatttt gtgtgtgaat ataagaagtc tcaatgcaaa 1560 ttttgaaaaa ctagaaacat ttgtagaaag tatgtcaaaa aagccatgtg tktataatat 1620 gtacgataat ctgtacggag acatggatcg tgcagcaccc tcatttctat caattacatg 1680 ggtataaatt atgctacaac gagagtaaaa ttaatagggc agatggtgtc cttgtgtata 1740 ttaaaaataa tatttatgaa aaaactaaag tagaagtaat tgataaattg agtgtggtta 1800 gctcagatct taagcttcaa tcaggtgaat cattaagagt ttcggctatg tacaggtgtc 1860 atgatgttcc caaaacagaa tttcttaata gtgtaaaaaa atttctatca aaacacatta 1920 gaaccaaaaa ccactgtata ataggagatt tcaatataga tataatagaa agtaatgtac 1980 aaaataataa tataggtgca atatcaataa gtcaggaatt cttaaataat ttctttgaac 2040 aagaattcat tccgtactat agaggtgtaa caagaccctc tctagataac gaatcgggtt 2100 cttgtattga taattgtttt gcaaagacta gagacgttaa tattgaatct tataaactat 2160 gtactccgtt taacgaccac tatccgcttg taataagtat cgatcagctt aaagtgcaga 2220 atgagcaaga aaagatagaa aattgtatta attatagaaa gttaaaggaa tatgcaaaga 2280 aaatagaatg gaattcatgt ttgcatttgc aggatccaaa tcttgctttg gatgaactaa 2340 taaaaatgat acatagctgt gttggaaaag ccacttgtaa caaaaaaact aagaataaac 2400 aattacctag gaaaaaatgg ataactaaag caatattagt atcgtgtaaa aataaagaaa 2460 atctatataa tatatggaaa aaaatcctac aaatataact ctaaagcatg aatataaaaa 2520 ttatgaaaaa attttgaaat aaaacaataa aagatgctaa atacaaattc gaaaacaatg 2580 agataagaag aagtggcgat aatccaagac agctatggag catcataaac agtaaaatgg 2640 gcaaaagcaa caaacaaatg aaatcgcctg actgtatagt ggtaaataac caaaaaatag 2700 taaacaagac aaatatagcg aatgaaatga atgattattt tagtcaagtt ggaatagact 2760 taagtaacca aataaataat aagtgcgatg atcaaataag actgccagca agaaatagcg 2820 aatccatatt tatacaacca actaatgttt gtgaaataca tagagtaata agcgaaatga 2880 agaaaaaagc tggtggggta gacaacatca atgcaagtac aatcaagtgc attgctatgt 2940 ttctggcaaa acccttggaa catattttta acataagtat acagcaatcg atttggccga 3000 atgcgctaaa acgagctgat atagtaccaa ttcacaagtc tggagatagt acatgtattg 3060 caaactatcg cccaatatca ctgatttcta atatagctaa aatatttgaa aaaattatat 3120 acaacagact tttcaatttt ataatgaagc ataatattat atcagagaga cagtttggtt 3180 tcgtcagagg cagaggagca acagatgcgc ttgattgtct tgcggatatt atatacaaaa 3240 gtctagacaa aagtaaacca ataataacga catttctaga tctagccaaa gcttttgata 3300 ctgttgatca cagaatatta ttggataagt tagaaagata tggaataaga ggatgtgcct 3360 taaaactctt aactagctat ctatcaggta ggaaacaatg cgttaagata aataatttca 3420 aaagtgaata caaagatatt tccataggtg ttccgcaagg aaccatttta ggtcctttat 3480 tttttatcct atatgtaaat gacctactga tagatatgct aaaagactct ataatgtctt 3540 atgcagacga cacagtcatc atctcgagtg atgactcgtg gtcggcagca caggacaaga 3600 tgaattttta tcttagcaag gttgccaaat ggctgatgtt aaataaacta tcgttaaatg 3660 taaataaaac tgtatacata acctatggca actactgtga tagtgttcca gaggatataa 3720 atattaaaat tggaggcaac tttataaata gggttgaaag ccataaatat ttagggttac 3780 atttcgatta caatatgaaa tggaatatac acattgactc cattgtaaaa agaacaagat 3840 atttaatttt tgtctttgca aaaattaaaa gtataatgga tagaaaaacc ttaatgatga 3900 tctaccatgc attcttccag ggcctagtca attatggaat aattgcctgg ggtgggcata 3960 taacaattgt atgtactcga tacaaggagt acaaaagaag ttattaaaaa ttattaacag 4020 aaatcatttt gaaacacaaa attccccact aacagttagg caattgttcc aaattgaagc 4080 aataacatat cactatggga aactaaggga tatttatagt aaaaatacga aaactactag 4140 gaaaaagagt atatcattgc caaaaattaa taaatctata agtaagaaaa gcagctacta 4200 catggctgtg aaaattttca atttactctc taatgatctc aaagagttaa cagttagaaa 4260 agtaactata aaaaataaat taaaggattt cataagaaat ctaatagtat gaaatgaatt 4320 agagacacta gtttaacttc tgttataagt acaagaaaga acagtaygaa aagattaatt 4380 ttagttttta gtttttaagc tatctgtata attattgtgc catacctatg tacaggcaac 4440 ttgtttgcct ctataggaag cctttaaaag gcttatgtat atggtcataa ataaataaa 4499 // ID SMAR29 repbase; DNA; INV; 1493 BP. XX AC . XX DT 22-JAN-2008 (Rel. 13.01, Created) DT 22-JAN-2008 (Rel. 13.01, Last updated, Version 1) XX DE Consensus sequence of Mariner-type family of repeats. XX KW Mariner/Tc1; DNA transposon; Transposable Element; SMAR29. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-1493 RA Jurka J.; RT "Mariner-type element from freshwater planarian (Schmidtea RT mediterranea)."; RL Repbase Reports 8(1), 17-17 (2008). XX DR [1] (Consensus) XX CC The youngest copies are ~8% divergent from consensus. CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS join(264..731,748..1272) FT /product="SMAR29_1p" FT /translation="FHSDEVRSIARKFKAEGKSYGEIAKLLNITKNAVVSL FT IKYENKVHKKKTGRKCIINNTEKTSVKRYINEQNKEGNKISCSSIIRNLNL FT NISRRTINNFLLRTEASYVKHSQKMILSQLHKNKRVEAIRLWIKSNIDWTK FT VVFTDEKRFSLDGPDSWFSCLMIFRSTWTYSKEKINRYRRHSGGIMIWGMV FT MANGLIAIKIVTGGFSAESYKNLMDTYALPIMRLNYQKVKMVQDNARPHVA FT KTTLKYFKEQNIDIIDWPARSPDLNLMETVWSMLSQDVYSLNQPMNKTDLE FT ERILRAVDRVNEERRHTIQHLFDSFVDRVISVLQKRKYV" XX SQ Sequence 1493 BP; 574 A; 176 C; 258 G; 483 T; 2 other; ctggtggaaa aaacaattca cacccctacg aaaaatacgt aatatcaatt tttaaaagaa 60 taaaactgaa atgaatacta tttgataaat tatgatctac taaaattgta aagttaacat 120 ttgttwaaat attattacca tataccattt atttaggcaa gtttttaaat tattaaattt 180 acagagatgg aaaaaactat tcacacagta aacaattttt agatgtttaa agatgtttta 240 caaaatataa ttttatttta tagtttcatt cggatgaagt tagatctatt gcacgaaaat 300 ttaaagctga aggtaaatct tatggtgaaa tagcaaagtt gttgaatatt acaaagaatg 360 cagttgtttc gttaattaaa tatgaaaata aggttcataa aaagaaaaca ggaagaaagt 420 gtattatcaa taatacagaa aagacgagtg ttaaaagata tattaatgag cagaacaaag 480 aaggaaataa aatatcatgt agcagcataa ttcgaaattt aaatctcaac ataagtagaa 540 gaaccattaa taacttttta ttaagaactg aagcaagcta tgtaaagcat tcacaaaaaa 600 tgattttaag ccaactacat aaaaataaaa gagttgaagc cattaggttg tggataaaaa 660 gcaacattga ctggacaaag gtggtcttta cggatgagaa aagattcagt ctcgatgggc 720 ctgatagctg gtaacatttt agattaattt agttgtttaa tgatttttag gagtacttgg 780 acatattcaa aagaaaaaat taatagatat cgtcggcata gtggaggtat tatgatttgg 840 ggcatggtga tggcaaatgg cctaattgca ataaaaattg taactggtgg attttcggcg 900 gaaagttata aaaacctgat ggatacatat gcgttaccaa ttatgcgatt aaattatcaa 960 aaggtcaaaa tggtccaaga taatgccagg ccacatgtcg caaaaacgac gttgaaatat 1020 tttaaggaac aaaacattga cataattgat tggccagcca gatctccaga tttgaattta 1080 atggagactg tttggagtat gctcagtcaa gatgtctatt ccttgaatca accaatgaat 1140 aagactgatt tggaagagag aatactcaga gcagtggatc gagttaatga agaacgaagg 1200 cacaccattc aacatctgtt cgacagtttt gtagatcgtg ttatatccgt tttgcaaaaa 1260 aggaaatatg tttaattatt aataattttt ctgtgtgaat tgttttttcc atctttgtam 1320 attcaattat tcgaaaactt tcctagataa atgttataag gtaatattta aacaaaagtt 1380 aactttacaa ttttagtaga tcataattta tcaaatagta tcattcattt tagttttatt 1440 cttttaaaaa ttgatattac gtatttttgt aggggtgttg ttttttccac cag 1493 // ID Helitron-2N1_DVir repbase; DNA; INV; 928 BP. XX AC . XX DT 31-MAR-2007 (Rel. 12.03, Created) DT 31-MAR-2007 (Rel. 12.03, Last updated, Version 1) XX DE Non-autonomous family of Helitrons - a consensus sequence. XX KW Helitron; DNA transposon; Transposable Element; Nonautonomous; KW Interspersed repeat; Helitron-2_DVir; Helitron-2N1_DVir. XX OS Drosophila virilis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; virilis group. XX RN [1] RP 1-928 RA Kapitonov V.V. and Jurka J.; RT "Helitrons in fruit flies."; RL Repbase Reports 7(3), 131-131 (2007). XX DR [1] (Consensus) XX CC This is a consensus sequence of a family of non-autonomous CC Helitron transposons transposed in the Drosophila virilis genome CC a few million years ago (numerous copies are less than 5% CC divergent from the consensus sequence). Helitron-2N1_DVir is a CC deletion derivative of the autonomous Helitron-2_DVir. These CC transposons are usually inserted in the TTT|TTT target sites CC without the target site duplications (the insertion site is CC marked by "|"). Different families of Helitrons constitute ~5% of CC the D. virilis genome. XX SQ Sequence 928 BP; 286 A; 179 C; 157 G; 306 T; 0 other; ttataccctt gcagagggta ttataatttt gtcgtgaaat gtgtaacgca tagaaggaga 60 catctccgac cccataaagt atatatattc ttaatcagca tcaacagccg agtcgatata 120 gccatgtccg tctgtccgtc tgtctgtttc tatgcgaact agtccctcag ttttaaagct 180 atcttaatga aactttgcag aactccctct ttctgttgca cgcagcacat atgtgaaaac 240 cagctggatc ggaccactat atcatatagc tgccatagga acgatcggtc gaaaattaag 300 tttttgtatg aaaaaacatt ttgtttatca agatatcttg accaaactcg gcatttatta 360 gttttacttt actcctcata tatatgcaaa atcctattaa gatcggacca ctatatcata 420 tagctgccat aggaacgatc ggtcgaaaat taagtttttg tatgaaaaaa cattttgttt 480 ttcaagatat cttgaccaaa ctcggcattt attagtttta ctttactcct catatatatg 540 caaaatccta ttaagatcgg accactatat catatagctg ccataggaac gatcggtcga 600 aaattaagtt tttgtatgaa aaaacatttt gtttttcaag atatcttgac caaactcggc 660 atttattagt tttactatgc tcctcatata tatgcaaaat cctattaaga tcggaccact 720 atatcatata gctgccatag gaacgatcgg tcgaaaatta agttgtatga aaaaacattt 780 tgtttatcaa gatatcttga ccaaactcgg catttactat tttcccggta cttcttagat 840 aggggcaaag cactatgagc attatgaaaa ggttgggtct gcaagggtat tagatctttg 900 gcgtgccgaa gatagccctt ctttctcg 928 // ID Kiri-23_AAe repbase; DNA; INV; 4456 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 16-FEB-2011 (Rel. 16.02, Last updated, Version 1) XX DE A Kiri non-LTR retrotransposon family from Aedes aegypti. XX KW Kiri; Non-LTR Retrotransposon; Transposable Element; Kiri-23_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4456 RA Kojima K.K. and Jurka J.; RT "Kiri non-LTR retrotransposons from the yellow fever mosquito."; RL Repbase Reports 11(2), 718-718 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 11 sequences with >95% CC identity. CC This family and related Kiri elements found in two mosquito CC species, Culex quinquefasciatus and Aedes aegypti, constitute a CC distinct lineage belonging to the CR1 group. The phylogenetic CC analysis with PhyML indicates that Kiri is a sister lineage of CC the L2 clade. Although several Kiri elements do not encode two CC proteins, probably due to 5' truncation, Kiri elements generally CC encode two proteins. Their ORF1 proteins commonly contain an CC L1-like RNA-binding domain. XX FH Key Location/Qualifiers FT CDS 255..1034 FT /product="Kiri-23_AAe_1p" FT /translation="MSEEDVNSMRLRSNSVSQQRNTSTPDMTVADLAKLVK FT SQFASFQRSTKDDIQKMGDKLSSQITQLRSELTSDIDKLREETNKTVSELV FT SSVEGTKTDVSHMVDRSTKRNDLIISGVPYTQGENLCSYVQAWCRSLGYPE FT NNIPLVDVRRLSKPGVTLGAAPIILLQFAFNTQRNEFYSRYLRSHNLSLSQ FT IGFSVNKRVYINENLTPIAREIRSKALQLKKNDKLQAVYSRDGIVFVKPMN FT GGAGKAISSIHDLELFTSR" FT CDS 1472..4297 FT /product="Kiri-23_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MFLTMDNLNTNDTTTNWCTPGIVMKSAFYPGKLSICC FT LNGQSICARRLSKLDELRRIVAASNVDVICINETWLNDNIDDSVIMLDGYQ FT SIRCDRVGRLGGGVMMYIKNTIRYNVIETTSCDNVARTEFILVELILVNFK FT ILLGTVYNPPDADCSLVLDELLNKYGSFYDHIHFLGDFNTNMLLDSPKAMR FT LRELLVTHHLYNIGSEPTFFHGSGSSQLDLHISNSFSEVLRFNQIEAPVFS FT NHDVIFSSLNFDTAHMDNVISHRDYSRINSDRLKHDIESINWAYFYSINSP FT DTLTNLLNSHLECLLDRHAPLKNVPLRKMYNPWFNSYIGRAMVDRDLAFKQ FT WKVTRDVTHHNLFKQLRNKVNLLIRKAKEIFYGEQLNSGLSSTELWKRLKS FT FNMSNTTPESTNNFTADEINDCFYKNFSSSIDSTPCSTGSAWGSFHFSEIS FT DVDVINSLYEIKSNAVGLDNIPLKFIKMVSPLIINPIVHLFNAILRSSQYP FT TAWKKAKIIPIKKKASINSINNLRPISILCALSKVFEKIIKKQICGYINEH FT NMLSCYQSGYRAKHSTKTVMLKVLDDIGIILDQGKPVLLVLLDFSKAFDTV FT SHSIMCRKLRQIYGFSFDAVNLIKSYLENRFQTVLNNGVFSKFLHTASGVP FT QGSILGPILFSLYINDLPSILKHCYVHIFADDVQVYIEHSDASVRKINEDL FT DSIFKWATKNKLSLNTNKSHAMFIGTSNQRQMKPDIKLNNVSLKYVETAVT FT LGFNIKSNFEWDSVILSQCGKIYAKLRTLQLKAAFLKTEVKIKLFKSLILP FT YFIACDLWLMQSSAFAVQKMRVALNNCVRFVYGLGRMSHVSHLQHRLIGCS FT FDNFCKTRCCLFMFNLVNNKMPGYLFEKLSVFRGSRSKKFVIPRHRTAKYG FT GTFFVRGVAFWNSLPNDVTMIQPSGLVNFRRRIFEHFE" XX SQ Sequence 4456 BP; 1381 A; 818 C; 796 G; 1459 T; 2 other; agtgagtgac gttggtgggc tatgacgttt gtgtgaactg cagtgcatca cagtcaacaa 60 aaagtgctat tttcgttacg taaaatcacc aaccatcgat gaatcactag attgcaacaa 120 cttcaaagtt gtaccgttga actataccgt ctcaaccacg atttacggtt tatttttgct 180 ggattcgtta catctacagc cagtctggga tatcttcagc catctttttg tgtttatcgc 240 tcgttaacat caatatgtcc gaggaagacg tgaactctat gagactcaga agtaactctg 300 tctcacaaca acggaataca agcactcccg acatgactgt cgccgatcta gcgaaacttg 360 tcaaatcaca gtttgcgtcg ttccagagga gtacgaagga tgatatacag aagatgggtg 420 acaagctctc gtcacagatc actcaactgc gctctgagct cacttctgac atagacaaac 480 ttcgcgagga aactaacaaa actgtaagtg agcttgtttc atccgtcgaa ggaacaaaaa 540 ccgatgtttc tcacatggtg gatagatcta caaagcgaaa cgatcttatc atcagcggcg 600 tcccgtatac ccaaggagaa aatctctgca gctacgttca ggcttggtgt cgttctctgg 660 gttaccctga aaataatatc ccactggttg atgtacgtcg attgtcgaaa cccggagtga 720 cattgggagc ggcaccaatt attctgctcc aatttgcgtt taacacccag agaaatgagt 780 tctattctcg ctatttgcga tctcacaacc tctctttatc gcaaatcggt ttttcggtca 840 acaagagagt ttacataaat gagaacttaa caccaattgc cagagaaatc cgctctaagg 900 ccctgcagtt gaaaaagaat gataaactcc aggcagtcta ctcaagggac ggaatcgttt 960 tcgtgaaacc gatgaatgga ggagcaggga aagcgatttc atctatacat gatctggagc 1020 tgttcacatc acgctgatta accctatcca atgctattcg ttttcctgtc actcctttcc 1080 ttgcaaatcc atgtatcctt ttccttttta tcagattatg ttatacctct ccgaaaaagc 1140 ttttcttttt cctaccaatc cgatcccaac gttattcctt gtatccttcc tgaaagttac 1200 ttcatcaaca gctgttgcta ctggatggtg cactacatgg agacaaccgt gcatggatga 1260 cgatgacttg caagcagata tcatgtgtga ttattatgct atgttgatca gtgttgctag 1320 tttgttggtg gtttttttat ttgatttgct aaagagaata ctttttgttc atgtgatttt 1380 ttgtgaataa tgtatgtttt tgtgaatatt gtttgttttt cttttatcag tatagtgctc 1440 aaaatgtttt tccttcttca gtttgctgtt catgtttttg acaatggata atctcaacac 1500 taatgacact accacgaatt ggtgtacccc aggaatagta atgaaatccg cgttttatcc 1560 tggaaaactt tcaatttgct gtttaaacgg ccagagtatt tgtgcccgta ggttatcaaa 1620 attggatgaa ctgcgtagaa tagttgctgc atctaatgtt gatgttatat gcatcaacga 1680 aacttggctc aatgataaca ttgatgacag tgttataatg cttgatggat atcagtcaat 1740 aagatgcgat agagttggtc gtttaggagg aggtgtgatg atgtatataa aaaatactat 1800 tagatataat gtgatcgaaa caactagctg tgataatgtt gcccgtactg agtttattct 1860 agtggaactt attctagtca attttaaaat acttttagga actgtatata atccaccaga 1920 cgctgattgt tctctagtat tagatgaatt gctaaataaa tatggcagtt tttatgacca 1980 tattcatttc ctaggtgatt tcaatacaaa tatgctactg gattcaccta aagctatgag 2040 actacgtgaa ttgcttgtaa cacaccatct ttacaacatt ggtagtgaac ctactttttt 2100 tcatggaagt ggctcttcgc agttagacct ccatatatca aacagttttt cagaagtttt 2160 gcgattcaat caaattgaag cacctgtttt ctcgaatcat gatgttattt tcagttcgtt 2220 gaactttgac acagcgcata tggataatgt aatatcccat cgtgattaca gtagaattaa 2280 ttctgatcgt ttaaaacacg atattgaatc cataaattgg gcttattttt attcaatcaa 2340 tagtcctgat acacttacta atttgcttaa cagccattta gaatgtcttt tagatcgcca 2400 tgcaccattg aaaaacgttc cactgcggaa aatgtataat ccgtggttca atagttatat 2460 aggcagagcg atggttgatc gtgatttggc ttttaagcaa tggaaggtta cacgcgatgt 2520 cacgcatcat aacttgttca agcaactgcg taataaagta aatcttctaa ttcgtaaagc 2580 caaagaaatt ttctatggag aacaattaaa ttctgggtta tcttcaactg agctatggaa 2640 gagacttaaa tcatttaata tgtccaatac aacacctgaa agtactaata attttactgc 2700 tgatgagatt aacgattgct tttacaaaaa tttttcaagt tcaattgatt ctaccccatg 2760 tagtactggt tcagcgtggg gttcttttca ttttagtgaa atcagtgatg ttgatgttat 2820 caactcttta tacgaaataa aatcgaacgc tgttggactc gacaatatac cccttaaatt 2880 tataaaaatg gtttcacctc tgataattaa cccaattgtt cacttgttta atgcaatctt 2940 aagatctagt caatacccca cggcatggaa aaaagctaaa ataattccta taaaaaagaa 3000 agcttcaata aactcaatta acaacttacg ccccattagc atactttgcg ccctatcaaa 3060 ggttttcgaa aaaattatta aaaaacagat ttgtggttat attaacgaac acaatatgct 3120 ttcatgttat caatctggat atcgggcaaa gcacagcaca aaaacagtga tgcttaaggt 3180 actggatgat attggtataa ttttagatca aggaaaaccg gtgctgctag tcctattgga 3240 tttttctaaa gccttcgaca ccgtttccca tagcataatg tgtagaaaac tacgacaaat 3300 atatggattt tcttttgatg cagtgaatct tataaaatca tatttagaga atagatttca 3360 aacagttctc aataatggag ttttttctaa attcctacat actgcctcag gggtacctca 3420 ggggtctatw ttgggtccaa tattattttc tctatatatt aacgatcttc catccatctt 3480 gaaacactgc tatgttcaca tttttgcgga tgatgtacaa gtatacatag aacactccga 3540 tgcttccgtt cgcaaaatca atgaagacct agattccata ttcaaatggg ctacaaaaaa 3600 taaactttcg ttgaatacta ataaatcaca tgctatgttt ataggtacca gtaatcaaag 3660 acaaatgaaa cctgatataa aactcaacaa tgtatctcta aaatatgtgg aaactgctgt 3720 gactcttggt ttcaatatta aatctaattt tgaatgggat tctgttatct tgagtcaatg 3780 tggaaaaata tatgctaaat taagaacgct ccaattaaaa gcggcattcc ttaaaacgga 3840 agttaaaata aaacttttca aaagcttaat tttaccttat tttattgcat gcgatctttg 3900 gctcatgcaa tcatctgcat tcgctgtaca gaaaatgcgt gttgcgctca ataactgtgt 3960 acgttttgtg tacggtctag gaagaatgtc tcatgtatct catttacagc atcgtttaat 4020 aggctgctcg tttgataatt tttgtaaaac gagatgctgc ttatttatgt ttaaccttgt 4080 waacaataaa atgcctggtt atttatttga gaaattgtct gttttccgag gaagtcgatc 4140 caagaaattc gttattccac gtcatcgcac tgctaaatat ggtggtacat ttttcgtgag 4200 aggtgtcgct ttctggaatt ccctacccaa tgatgtaacc atgattcaac ctagtggatt 4260 agtaaacttt agaaggagaa tttttgagca cttcgaataa gataccagta atttcattat 4320 ttgtaaatct aaatcaaaat tattcagtaa actcaatgtc aaatccacca tttacctaga 4380 atgtaacatt ataagatgtg aatcttgagt tacgtatttg aataaataat aataataata 4440 ataataataa taataa 4456 // ID BEL-631_AA-LTR repbase; DNA; INV; 681 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-631_AA_; KW Pao_Bel_Ele198; BEL-631_AA-I; BEL-631_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-681 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 681 BP; 229 A; 114 C; 135 G; 201 T; 2 other; tgttgcgacc cctcggtcga cgcggctcag cagaacgatc ttcctcaaca tcatcatatc 60 gattgaatta gcaacccgac tcagttagaa cgatgacgga tgatgaaaag cgaacgtcaa 120 aagcgatatg cgaagaatga tgatttgcaa aatagaaaag tcggtagaaa gtgtacattc 180 tcatgcttaa cgttggatag tcagaaattt tcttataatt attcttaaac tctaaatgtg 240 gtcctaaaat taattcttgg aaggtaatta tttgctgaac tatatatagg atgtaatttc 300 ttaaggatta ttattgttgt atagcttaaa acctatgaac tcgcaaagct aacctatcac 360 agatctcgga tatcttagaa cgaattacca tagtggtccg aagtaacgta agtcttgcta 420 ttgatatccg acacattaca tttattgaaa cttataawtw attaggtgaa gctggtaaag 480 ctagagtagt ttgagcggag taggagaaga aagcccatga agaacgctaa acgaactaaa 540 cgtaagttga gtataaaatt tcattgcgct gaactaatta atggaatttc aggaattttt 600 taatgctcgc taataaacat taagaaattg gaaattcgtg ctttcttctg ctatccagtt 660 ccgttggccg gagagccaac a 681 // ID Gypsy-20_RP-I repbase; DNA; INV; 2975 BP. XX AC ACPB02043182; XX DT 28-JAN-2011 (Rel. 16.02, Created) DT 28-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Rhodnius prolixus genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-20_RP_; KW Gypsy-20_RP-LTR; Gypsy-20_RP-I. XX OS Rhodnius prolixus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Euhemiptera; Heteroptera; OC Panheteroptera; Cimicomorpha; Reduviidae; Triatominae; Rhodnius. XX RN [1] RP 1-2975 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Rhodnius prolixus genome."; RL Direct Submission to RU (28-JAN-2011). XX DR Genome; ACPB02043182; Positions 4821 1847. XX CC Positions [2055-2537] - Integrase core CC 'CATCC' target site duplication CC LTRs are 95% similar to each other. XX FH Key Location/Qualifiers FT CDS 1053..2951 FT /product="Gypsy-20_RP-I_1p" FT /translation="MGMVNYIMKFVPRLAELTAPLRTLLQKGVAWTWGPTQ FT YQGWGNIKKAISSVPVLAGYETGLPIRVAADASTKGLGAVLEQQHIEGWKP FT VMFASRTLSETEGRYAQIEKEALAITWACEKFQDYLLGTVFELLTDHKPLV FT ELMARKPIAELSARLQRFRMRLLCFDFKVTYIPGKHFYTPDILSRDPMEQV FT GLYEDVLEEKSFVLSIIQALPCSDMKLQQIKEAQEKDMECKEIANCLKKQQ FT WQPNSCFWKYRDEFNVQNGLLLKGSRMFIPQEMRKEMLMKLHQGHLGIVKC FT RSRAKESVWWPGISEQIKSVVANCPTCREHRNNRSEPLMLSKLPDAPWEAF FT SADLLKFQGKWYLVIQDYYSRFLELVHVNKLTSEAIINRLKNIFARHGIPV FT SLRTDGGTQFTADMFKRFQEEYGFRHVISSPHFAQSNGSAEKAVDIAKRIL FT KKNKDPNLALLEYRTTPLEQGLSPAELLYGRKLRSTLPSIKNFARVSRNQM FT IKFKEKDAQLKERNKQNFDRRNGVKVLPQLQNNSSVWIPDMKKYGQVARPS FT AEPRSYIVKTPSGELRRNRKHIIVVAEDEPDENAEAKEKQAKGERVEEEET FT KQTQEEGKRKGVEEQLNHRRDTQTRISRTRHNPPT" XX SQ Sequence 2975 BP; 1027 A; 504 C; 723 G; 721 T; 0 other; tggtgtcagg tgtgtaatat atatataggt atatgttttc ttttgtgtcc gaacaagttt 60 aattgatcta tgaccaggga tagtgggatc gaagtgacgc ctaggaaaga cgcagatagg 120 aacgagaaca aagaaaggac aactatagaa gatttgtatg tagtaaaaga tcagccgatt 180 ccgttattaa gcaggcaagc attagtacaa ttagatttgg ttaaaattaa catagataat 240 gtacaacagt taccagcaat agcagctgaa tttccagaag tattttcgaa gagtggaaaa 300 atgaaagtag acggatataa aataaattta aaagaagggg cgctcccata tgcagtactg 360 actccgcgcc gtattgcctt tccttataga gaaaaagtta aagaggaact cgactggatg 420 caggcagaag gtataataga accaatagtg gagcccacag attggtgcgc acctatagta 480 gtagtacctc ggaaagagat gggtagagtc agactgtgtg tagactacac ggaactaaac 540 agaaacataa acagagagag atggcagctt ccagcagtag atgagtcctt agctcagctg 600 gagggcggtc agtatttttc cttgttggat gcctccaatg gtttttggca aattccatta 660 catgaagaaa gttggaaatt gactaccttc attactccgt ttgggaggtt tgtgttcaaa 720 aggattccgt ttggtatttg ctcagggcca gaagtctatc aaaaaagggt ttcccagatc 780 atcagtggac tggagggagt ggtgaataag gcagatgatt ttttggtagt cggtaaacta 840 gagcacaaca tgatcaaaga ttaagaaaat tattacaaag attacgtgag tataggattt 900 cgttaaactt aaataagtgt aaatttgcag tacaggaagt gactttccta gggcatttaa 960 ttaaaaaggg agagatttgc ccagacccgg aaaaaataaa ggcaatcacg caattaaatc 1020 caccaaaaga cgttaaagag cttagacatt tcatgggaat ggtgaattat atcatgaagt 1080 tcgttccaag gctagctgag ttgacagctc ccttgagaac cttgctacaa aaaggagtgg 1140 cctggacatg gggtccaacg caataccagg gctggggtaa tattaaaaag gcgatctctt 1200 cagtaccagt actggcaggt tatgagacag gacttccgat tagagtagca gcagatgcat 1260 ctacgaaagg tctcggagca gtgctggagc aacagcacat cgagggttgg aagccagtaa 1320 tgtttgcgtc tagaacgtta tcagaaacag aaggacgata tgcccaaatt gaaaaagaag 1380 cgctggcgat cacctgggcc tgcgaaaaat ttcaagacta tcttttagga accgtgttcg 1440 aattactaac cgatcataag ccattagtgg aattgatggc cagaaaaccc attgctgagc 1500 tgtctgccag attacagcgg ttcaggatga gactactctg ttttgacttt aaggttacat 1560 acatcccagg taaacatttt tatacacctg atattctatc tagagatccg atggaacaag 1620 ttgggttata tgaagatgtg ttagaagaaa aatcttttgt acttagcatt attcaggcgc 1680 taccttgctc agatatgaag ttacaacaga taaaggaggc tcaggagaag gatatggagt 1740 gcaaagagat agcgaactgc ttgaagaaac agcagtggca gcctaattct tgtttttgga 1800 agtatagaga tgagtttaat gtacagaatg gactcctgct taaaggaagc agaatgttta 1860 taccgcagga aatgagaaag gaaatgttaa tgaaactaca ccaagggcac ttgggcattg 1920 tcaagtgtag aagtagagcc aaggagtcag tttggtggcc gggtatctca gaacaaataa 1980 agtctgtagt ggcaaattgt ccaacatgta gagaacatag aaataatagg tcggaaccct 2040 taatgttgtc caaattacct gatgctccgt gggaagcgtt cagtgcggac ctgttgaaat 2100 ttcaaggaaa atggtacctc gttatacaag attattactc aagatttctg gaattagtcc 2160 atgtaaataa actaacttcg gaggctatca tcaatagact aaaaaacatt ttcgcgcgcc 2220 atggaatccc cgtatcatta aggacagacg gtggaacaca attcacagct gacatgttca 2280 agaggtttca ggaggaatac ggatttcgtc acgtaatttc gagtccccat tttgcacaat 2340 ctaatggttc tgcggagaag gcagtggata tagccaagcg catattaaaa aaaaataaag 2400 atcctaacct ggcattgcta gaatatcgga ccacacctct agaacagggc ctaagtccgg 2460 cggaattatt gtacggaaga aagctaagat cgacgctacc ttcaattaaa aactttgcgc 2520 gggttagtcg taaccagatg ataaagttta aagaaaagga tgctcagctg aaagaaagaa 2580 ataagcaaaa ctttgacaga aggaatggag taaaagttct ccctcagttg cagaacaaca 2640 gttcagtttg gattcctgat atgaaaaagt atggccaggt cgcaagacca tctgcggaac 2700 caaggtctta cattgtaaaa accccatcag gagaattgag aagaaacaga aagcatatta 2760 tcgttgtagc agaagatgaa ccggatgaaa atgcagaagc aaaagaaaag caagcgaagg 2820 gagaaagagt tgaagaagaa gaaactaaac agactcagga agaagggaaa aggaaaggag 2880 tggaagaaca attaaaccac cgaagagata ctcagactcg gatttctagg actaggcaca 2940 atccgccaac atagggttta ttaaaacggg gagga 2975 // ID LINER1-3_NVi repbase; DNA; INV; 5049 BP. XX AC . XX DT 10-APR-2009 (Rel. 14.04, Created) DT 10-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE Non-LTR retrotransposon. XX KW I; Non-LTR Retrotransposon; Transposable Element; R1; LINER1; KW LINER1-3_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-5049 RA Bao W. and Jurka J.; RT "Non-LTR retrotransposon from Nasonia parasitic wasp."; RL Repbase Reports 9(4), 788-788 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 485..1870 FT /product="LINER1-3_NVi_1p" FT /translation="MCWGHVSCPHRRGDPGGRTHAGMKKNAYNMGTTNELG FT ENSQENLEVETLRVPTTQDGGYPIHKSSSSTSIGSAMSAAGYRYESGTVGE FT ESQTTREWLQAHASLRDKIREVRDMTREMTKTLDKQKGVNSKIKDGLPLIG FT SMLIEAVTLLDKMEDVVKRQPLQQPRADNAKSQKTAKSKKRVRASSDGGGG FT TPAKKDKKDVSAPSTSRDSEGNGQTSHGQTQWEVAAGRKAKKKQKDKPEQT FT QPKKREEKVPRQRTDAILIKPESGKTYADILGQMKAHVKPEELNTEVKFVR FT KTRQGGVLVGVGKSNDKMKSLQKAIQDAIGQVGTIENKISKITLEIRDIDG FT LTTKEEVNSAIAAVTDCGEEDVKIHLFEPNTREQRMAVELDQTKAAALLKK FT GKIRIGWVNCRVRVRASVTRCYRCLGCGLXMHRXXXXXVFFAKTPRRLRTG FT RCTLQAPERAQSSGRX*" FT CDS 1893..4568 FT /product="LINER1-3_NVi_2p" FT /translation="MARILQGNLHRSRLAHDLLTQHVREQKVDVLLISEQY FT KNLDPPFWVSNDEKTAAIWVPGRGLANTGTGEDYVRGRMGRITFFSVYLSP FT NLTAADFTRKLGALEDAIREVPGEIVIGGDFNARATEWGMPTTNPRGRAIL FT EMAARLTLIVANEGNTTTYRRTGFGESIPDVTFASEMTIRNIRDWHVTEEY FT TASDHQYILFNINEDRQTSTRRNHSRAPRWNTRRMNRETLAELIQSANLPS FT NGIPHELGGRERAETIVERTTNLITEICDATMPRVSTSHQRRPVYWWTDDI FT AELRKENFHARRRAQRAWKQKRPEARDLSKRQDESRKILRDAIKESKRRCW FT TKIIDDVEKDPFGDGYAIVTRKMGEMKRNQSXEPXEAEKMEXVIHXLXSHT FT THQTPEXRHXSNGXREEADFXGQXHRVSVVPQPELLLEMYNQXLVQGVFSS FT RWKRARLVLIEKPKKEADTDTSYRPLSLLDTMGKTGEALVKPRILRAVREA FT GDLSDKQYGFRKGRSTIGAVREVTDTVKKVEEVRHATRDIVILVTLDVKNA FT FNSARWTDILAALESFGMPRYIMRLTEDYLRDRVIEYDTKEGXRRRPITAG FT VAQGSILGPDFWGILYDGLLKLEMPGDVMLVGYADDVAAVITARNVGIAQA FT KLNAIMSRVLWWMANHGLTLALDKTEIVLLTGKRIPTIIPMKVGGETITTK FT PSAKYLGITLDTKLNYGEHLDRVCKKATTRIAQLSRLMANVRGPRPTVRRL FT LMATTNSILLYGAEVWADAMTMNKYRKKIMAVQRRGALRIACSYRTVAAEA FT SMVIAGVIPVDLLAIERKRIYEARLASSSISIAVEEREITMRNWQTRWTDY FT PKARWTRTLIKDVGPWVDRKWGEVNFYLTQFFLRSRIL*" XX SQ Sequence 5049 BP; 1517 A; 1152 C; 1457 G; 896 T; 27 other; aataataata ataataatcc acgcggggaa gcctgacacc ccgtatgtgg cgcgtccccc 60 gggtggcgga tgggggagtg ctcgccgagg cacacacctc ggagggtaga gcagaaataa 120 aatatgctcc gtggaccaaa tcaggccgcc gcgtttgccg tgcggtgcga cccgtaagct 180 gccaaaagga gatcctggat ggagaggcga tctaacggtc tctctaagac ctacggttgc 240 gtgccgaagt gacccgaacc atctttggct cccgctgaca gcaagacggg aggccggtta 300 cctagctagt atcggtcaac cggctcgagc gagtgcgtga tattgtgtga catccgccgg 360 aaactctcca gagagccgag cacctttgtt cggagtggga ggctcgagag agtggaggtg 420 ggctagtgcg atagcgcgtg ctataccctg acatgatgta cccagtctgg caccgcctac 480 agtcatgtgt tggggccatg tcagctgccc ccatagacgg ggggatccag ggggtcggac 540 acatgccggg atgaagaaaa atgcttataa tatgggaacc acaaacgaac tcggggaaaa 600 ctcacaggag aacctggaag ttgaaacctt aagggtgcca acaacgcaag acggtggata 660 tccaatccac aaatcaagca gcagtaccag tataggtagc gccatgagtg ctgcaggata 720 ccgatacgag agtggaacgg taggtgaaga gagccagacc acaagggaat ggctacaagc 780 ccatgccagc ttgcgtgaca agataaggga ggtgagagat atgaccaggg aaatgaccaa 840 aaccctggat aagcagaagg gcgtgaactc caagattaaa gatggattac ccttaatagg 900 atcaatgttg atcgaagctg tcacgcttct tgacaagatg gargatgttg tgaagaggca 960 gccccttcaa caacctagag cggacaatgc aaagtcccaa aaaacggcaa agtcgaagaa 1020 gagggtacga gcatcatccg acggaggcgg cggtaccccc gccaaaaaag acaagaagga 1080 tgtaagtgct ccctcgacaa gtagggacag cgaggggaac ggacagactt ctcatggtca 1140 gacacaatgg gaagtagctg ctggcaggaa ggcaaaaaag aaacagaagg acaaaccgga 1200 gcagacccaa ccaaagaaga gagaagagaa ggttccacgt cagaggacgg acgccattct 1260 gataaagccg gagagtggca aaacctatgc tgacattttg ggtcagatga aggcacatgt 1320 aaagccggaa gaactgaaca cggaggtcaa gttcgtccgt aaaacaagac aaggtggcgt 1380 tctagtgggg gtaggtaaga gcaacgacaa aatgaagtca ttgcaaaaag ctatccaaga 1440 cgcaatcgga caagtaggca cgattgagaa caagatctcc aagataacac tagagatcag 1500 ggacatcgat ggacttacga caaaggaaga ggtcaacagt gctatcgcgg ccgtgacgga 1560 ctgtggcgaa gaggatgtta aaatccatct gttcgagcca aacacgagag agcagaggat 1620 ggcagttgaa ttggaccaga ccaaggccgc cgccctgctc aaaaaaggaa agattcgcat 1680 cggttgggta aactgccgag tgcgggtgcg agcgagcgtc acgcgatgct accgttgcct 1740 tggctgtgga ttggawatgc accgtcycra cmsawmargt gttttctttg caaagacgcc 1800 aagacgcctg aggacaggac gatgcacact ccaggcaccg gagcgtgcgc agtcttccgg 1860 acggcyctaa acgcaagcaa gacccaaaga taatggcgag gatactccaa ggaaacctgc 1920 acagaagtag gctcgcacat gatctgctaa cccaacacgt tagggagcag aaagtggacg 1980 tcctactgat cagcgaacaa tacaagaacc tagacccacc cttttgggtg tcaaacgacg 2040 aaaaaaccgc cgccatttgg gtgcccgggc gtgggttggc gaatacgggt acgggagaag 2100 actacgtaag gggaaggatg ggtcgcataa cattcttcag cgtatatctg tcaccgaacc 2160 taactgccgc cgactttaca aggaaactag gggcattgga ggacgctatt agagaggtcc 2220 ccggggaaat cgtgatcggg ggcgacttca acgcgagggc cacggaatgg ggcatgccga 2280 cgacgaaccc aagaggcaga gccatcctcg agatggcagc caggctgacc ttgatagtag 2340 cgaacgaggg taataccact acctaccgta gaacaggctt tggggaatcc ataccggatg 2400 taaccttcgc aagcgaaatg acgatccgca acataagaga ctggcacgtc acggaagaat 2460 atacagctag cgaccaccag tatattctat tcaatatcaa cgaagaccgt caaaccagta 2520 caaggagaaa ccatagtcga gcgccaagat ggaacacccg acgcatgaac cgagagacac 2580 tcgcggagtt aattcaaagt gcgaacctcc catccaacgg gatcccacac gaattaggcg 2640 gacgcgaaag agcagagaca atagtagaga gaactactaa tctgattact gagatctgcg 2700 atgcgaccat gcccagggtg tctaccagcc accagagaag accggtgtac tggtggacag 2760 acgacattgc ggagcttagg aaagaaaatt tccatgctag aagaagggct caaagagcct 2820 ggaaacagaa aaggcctgaa gcccgggacc tttccaaaag acaggacgaa tcgcgaaaaa 2880 ttctcagaga cgcgataaaa gagagtaagc gccgctgctg gacaaagatc atcgacgacg 2940 ttgagaaaga cccattcggc gatggatacg cgatcgttac acgaaagatg ggggaaatga 3000 aacgaaacca aagcamggaa cctatrgaag ctgaaaaaat ggaagamgtr atccatrggc 3060 tstyatcaca cacaacacat cagacgccag aaaracgcca caraagtaac ggcgkacgag 3120 aagaagctga cttcaycggg caaaytcatc gcgtmagcgt ggtccctcar cctgagctgc 3180 tcttggagat gtacaaccag tscctggtgc agggagtttt tagctcgcgg tggaaaagag 3240 cgagacttgt actgattgaa aaaccgaaaa aagaagcgga tacggacact tcctataggc 3300 cactgagtct tttagacaca atgggaaaga cgggggaagc actggttaaa ccgcgaatac 3360 taagggccgt acgcgaagcg ggagaccttt ccgataaaca gtatggcttt agaaaagggc 3420 gatccaccat aggggccgtt agagaggtaa cagacacggt aaagaaagta gaagaagtta 3480 gacacgcgac tagagacatt gtgattctag taacgttaga cgtaaaaaac gcgttcaact 3540 cggcaagatg gaccgacata cttgctgcct tagaatcatt cggtatgccg cgatacatta 3600 tgcgtcttac ggaagattat ctgagggacc gggttatcga gtatgacacg aaagaagggc 3660 rcagaagaag acctataact gcaggagtag cgcagggttc gattttgggt ccagacttct 3720 ggggtatctt gtacgacggg cttctaaaac tcgaaatgcc gggggacgtg atgctcgttg 3780 gatacgcaga cgatgtggcc gcggttataa cagcacggaa tgtgggcatt gcgcaagcga 3840 agttaaacgc gatcatgagc agagtgctat ggtggatggc gaaccacgga ttgactttag 3900 cgcttgataa gaccgaaatt gtcctattga caggaaagcg gataccgacc atcataccta 3960 tgaaggtagg cggcgagact ataacgacga aaccatcagc caagtatcta ggaataactc 4020 tagacacaaa gctgaactac ggggaacacc tagatcgcgt ctgtaaaaaa gccacgacta 4080 ggatagcgca acttagccga ctcatggcta acgtccgtgg gcctaggcca acagttaggc 4140 ggctactcat ggcgaccact aactctatcc tgttatatgg agcagaggtg tgggccgacg 4200 ctatgactat gaataagtat cggaaaaaga ttatggcggt acagcgaagg ggggccctta 4260 gaatagcatg ctcctaccgg acggtcgcgg ccgaggcatc gatggtaatc gcgggggtaa 4320 tccccgtgga tcttcttgcg atcgaacgca agcggatcta cgaggctcgc ttagcgtcga 4380 gtagcatctc gattgccgtg gaagaaagag aaataacaat gcgcaattgg cagactaggt 4440 ggaccgatta tccaaaggct agatggacta ggacccttat aaaagatgta ggtccgtggg 4500 tggacaggaa atggggggag gtcaactttt accttaccca gttttttctc cggtcacgga 4560 tactttagga gctacctgtt cgcgatgaac agggtggcaa caccggggtg taagtactgc 4620 ggtgacgaac gggacgacgt gagacacacg ttttttgact gcccccactg ggctgaaaag 4680 cgtagagtac tggagctgac gataggggcg ttcacgcccg agactgtagt cgaaacgatg 4740 cttggcagca agcagaactg ggrcgagata acggcgtacg tggagacggt tctccgcgcg 4800 aaaaacaatg acggttgttt acaagactag ttgacggcta gcacagctta agacaggcga 4860 ccccccgaag gaatgcgaaa gcggttaccg ggggggtttt cgcggcccta tgagaggtgg 4920 agttttagtg agtatgcctg gcgttgtccc gcgtcaggag agtctcacac acggggagcc 4980 tcgcacggct cctgtcatgg tgcattaagc atttctccaa ctctccggat aaacaaaaaa 5040 aaaaaaaaa 5049 // ID EHINV1 repbase; DNA; INV; 512 BP. XX AC X61182; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 28-SEP-1995 (Rel. 1.08, Last updated, Version 1) XX DE E.histolytica inverted repeat downstream of rRNA genes. XX KW EHINV1; Inverted repeat; Repetitive sequence. XX OS Entamoeba histolytica OC Eukaryota; Amoebozoa; Archamoebae; Entamoebidae; Entamoeba. XX RN [1] RP 1-512 RA Bhattacharya S.; RT "EHINV1."; RL Direct Submission to Genbank (01-AUG-1991)S. Bhattacharya, RL Jawaharlal Nehru University, School of Environmental Sciences, RL New Delhi 110067, INDIA. XX RN [2] RP 1-512 RA Mittal V., Sehgal D., Bhattacharya A. and Bhattacharya S.; RT "A second short repeat sequence detected downstream of rRNA genes RT in the Entamoeba histolytica rDNA episome."; RL Mol. Biochem. Parasitol 54(1), 97-9100 (1992). XX DR GenBank; X61182; Positions 1 512. XX SQ Sequence 512 BP; 184 A; 43 C; 39 G; 246 T; 0 other; aagctttaga gtttcttaat caatactctc gggtattgaa tgaaacgatt tgctgagact 60 taaccgtaat ttctatttaa atttatataa aattactcat tttaaaaagg ttattctttc 120 tttttttatt tatttaatat aaatttattg atatttattg ttttttatca ttaaatattt 180 aaattaattg attgtctttc ttttttatca tttaaataaa taatatttaa tatctcaaaa 240 aatatcaact aaagtataaa tattcataag gtaaatgttg ttcataatta aattctaaac 300 atttattatt ttaatattct ttaatgattg attaattgat tgtctttctt tttttatttg 360 attgtctttc ttttttatca tttaaataaa taatatttaa tatctcaaaa aatatcaact 420 aaagtataaa tattcataag gtaaatgttg ttcataatta aattctaaac atttattatt 480 ttaatattct ttaatgattg attaattgat tg 512 // ID Mariner-14_SM repbase; DNA; INV; 1690 BP. XX AC . XX DT 11-AUG-2009 (Rel. 14.08, Created) DT 11-AUG-2009 (Rel. 14.08, Last updated, Version 2) XX DE Mariner DNA transposon from Schmidtea mediterranea: consensus. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-14_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-1690 RA Jurka J.; RT "Mariner DNA transposons from Schmidtea mediterranea."; RL Repbase Reports 9(8), 1863-1863 (2009). XX DR [1] (Consensus) XX CC TSD : TA. CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 221..1516 FT /product="Mariner-14_SM_1p" FT /translation="MAGTRKSYSVEYKINIVEESRGKCLTTFCNEKNLDLR FT MVRRWREEYDNLIQQRDAGNLKVRRVGAGRKPLYNELEDMIFEWILQQRAE FT AFVVRRVDVQKLAIQLAPQFNISLDKFKASQHWIDGFLTRYELSLRRSTTL FT FKLDDVEIIKRALSFKTFMDNLNFAQYQPSNIIAMDETSVFFGQAVQTTID FT RKGATSISIPSTGYESTRVTCILAIDCNGKKLTPVLITKGKREMINCISGI FT YVLETEKAWCTQDVIKKWINLMFPPVLRHNQKKLLVWDSASTHRAKTMKEY FT LSNQNIDQVMIPSGMTAYLQSLDIGINKPFKNYLREEINIYIETKRERNVK FT GNYIKPKLQDVVGWVKAAWSKITNECVFNSLRAGYLDRSFSFETSAIAKHE FT RFGPLILRELESQTRPADFSILNDQDNVLEEDDMVVLE" XX SQ Sequence 1690 BP; 623 A; 241 C; 299 G; 527 T; 0 other; ccgtgtttct ccgaaattaa gccctaacac aattttttaa aaaaaattat attgtagccc 60 taacaccgaa attaagccct agcaattttg ttttttgata tatattaaat aaaatttcag 120 atcgttatat attgttaaaa attaaaaaca tagttatatt aatattaata actatttgat 180 aaaattgaat aaatttttta aaaaaattaa tccatacaaa atggcaggaa ctagaaaaag 240 ctattcagtt gaatataaaa taaatattgt agaagaatca aggggaaaat gtttaacaac 300 attttgtaat gagaagaatc ttgatttgag gatggttcga cgatggagag aagaatatga 360 caatctcatt caacaaagag atgctggaaa tttaaaagtc cggcgtgttg gtgcaggaag 420 gaaaccgtta tataatgaat tggaagatat gattttcgaa tggattttac aacaacgagc 480 cgaagctttt gttgttcgaa gagtggatgt tcaaaaactt gcaattcagt tagcaccaca 540 gtttaatatc tctttagata agttcaaagc ttcacaacac tggattgatg gatttctaac 600 cagatacgaa ctgtctttaa gaaggtcaac gacattattt aagctggatg acgtggaaat 660 catcaaacga gcattatctt ttaaaacatt catggataat ttaaattttg cacaatatca 720 accatcaaat ataattgcta tggacgaaac atcggtattt tttggccagg ctgtacaaac 780 aacaattgat cgaaaagggg ctacgtcgat cagtattcca tcaaccgggt atgaaagtac 840 aagagttaca tgtattcttg ccattgattg taatggcaag aaattaactc ctgtgctgat 900 tacgaaagga aaaagggaaa tgatcaattg tatttctgga atttacgtct tagaaactga 960 aaaagcttgg tgtacccaag atgttattaa aaaatggatc aatttgatgt ttccaccagt 1020 tttacggcac aatcaaaaga aactactggt gtgggactct gccagtacac accgagctaa 1080 aacaatgaaa gagtatctct cgaatcagaa tattgatcaa gtcatgattc catccggcat 1140 gacagcatat ttacaaagtt tggatattgg cataaataag ccatttaaga attatcttcg 1200 ggaagaaatt aacatctata ttgaaaccaa aagagaacgt aatgtgaaag gtaattatat 1260 caaacctaaa ttacaggatg ttgttggttg ggtaaaagca gcttggagta aaatcacaaa 1320 tgaatgcgtt ttcaactctt tacgtgccgg ttatttggac agatcctttt cattcgagac 1380 ttcagcaatt gctaaacatg aacgttttgg tccgttgatt ttaagggagt tagaatcgca 1440 aacaagacct gctgattttt caattttaaa tgatcaagac aatgttttag aagaagatga 1500 tatggttgta ctagaataaa aacattttaa ttgattaaat ttttattttg aatatttgta 1560 taatttttaa aagtttattt ttattccaaa aaatagaaaa cctcccgaaa ttaagcccta 1620 agtgtatctt tgccaaaatt ttttaaaaaa aatatattta agcccggggc ttaatttcgg 1680 agaaacacgg 1690 // ID Copia-9_CQ-I repbase; DNA; INV; 4019 BP. XX AC AAWU01013178; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-9_CQ_; KW Copia-9_CQ-LTR; Copia-9_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4019 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 333-333 (2011). XX DR GenBank; AAWU01013178; Positions 17070 13052. XX CC Positions [1400-1918] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 437..4009 FT /product="Copia-9_CQ-I_1p" FT /translation="MHLQRQILSMRFDGGRLTDHFLRFDKLVREFRGTGAV FT MDDIDAVCHLLLTLGSAFATVVTSIETMAEENLSMEFVKCRLLDEETKQKG FT LGVDSLTAGNEAAAFSGMKMQKKIRCFSCKKEGHKSVDCTEKKKRKQQYNQ FT KQHGAKANVAGTSERNSVCFVGVSGGEPQKVQRRVKWFLDSGASDHLAREK FT ELFVKLHKLSKPIEIAVAKDGEFIVAEHAGTVKMFSVVGGRSVECTVKDVL FT FVPQLRYNLFSVVRIEKAGMRVVFDGGKALIYRGDEIVVCGARRDKLYELD FT FRPFDCEKSLLSCGRVQSDSVLWHRRFGHLNERSLTDLARNDMVSGLTLTV FT DKGVDQTIVCQSCIEDDWSHFTVVFLMRSKDEVLGCFEQYEALATAKFQTK FT VSRLRCDNGGEYTGRRFKQFCRTKGIQIEFTVPYTPEQNGTSERMNRTLVE FT KARTMLLDSGLDKRFWGQAVRTAAYLLNRSPTSAVVYKQTPFELWEKKKPD FT VSKLRVFGCETFVHVPKELRKKLDPKSWRGFLVGYAANGYLVWDPVNKRIV FT TARDVDFVETKTTVKKNDQHEVEQFVVCPIPDEAEEPVAPVAPAEREGEEE FT VTSDEEFGSFDDDEEADGDQEELDAGGVPAAKEGPVGEDPEVGADADPGRP FT PRDRTAPSWHRDYDMDYAGFALCATSFLDNIPQTIAELKKRDDWCRWKSAI FT DEEIDSLGRNKTWTLTKLPAGRTPVTCKWVFCVKRGDGVVADKYKARLVAR FT GFSQKVGFDYTETYSPVAKLDTLRLVLALAHRQEMHVHQMDVKTAFLNGIL FT HEEIYMTQPDGFQQGKDLVCRLHRSLYGLKQASRAWNERFDEFIQKLKFVR FT SPNDQCLYVRGTGREQIIVVLYVDDILVVGSSLKLVKVVKKCLNGEFEMTD FT AGEVGQFLGLTIDRQPGVMRISQRRYFESLLQRFGMMECKPTSTPMENRLQ FT LEKGVEAQRTTKPYRELIGCLMYAALTTRPDLSAAVNYFSQFQACPNDEHW FT IHLKRVLRYVKGTLDVGLVYRIEKEAPLLEVFCDADWANDLVDRRSVTGCI FT FKLYGCTVNWITRKQPTVSLSSTEAELAALCTAACHTLWMVRVLRDLGLQP FT DGPISLHEDNQSTIRIAEDARDHGRLKHIDTKFHFLRDLINEGVLELKFVR FT SSAQQADMMTKGLPATSFRELCTKIGLGRCCG" XX SQ Sequence 4019 BP; 941 A; 920 C; 1286 G; 872 T; 0 other; tccgaacctc ttttaggtta tgggcccagc ccgttgctaa tccgggttgt cgtttcttag 60 taaacttgtc gggatgtcgg aggacgcggt tgtgcgcgag gatcgctacc atgtcgcgct 120 gtttgacggc aggaactttg ccgcgtggaa gtatcgagtt ctggtactcc tggaggagta 180 cgagctggac gactgcattc tacacgaggt tggagagctt cctgagctga ttccgaagac 240 ggaagattcg acgacggttt cggcggaaaa ggagaagagg cggaaggaac agctgaagag 300 aaggcggaag tgcaagcgtt tgttgatcga gcgcattcac gattcccagc tctagtacgt 360 tcaggacaag cagtcgccga aggaaatctg gacgacgctc aagaatgtgt tcgaacggaa 420 gagcgtcgcg agccgtatgc atctccagcg gcaaatcctg agcatgcgat tcgacggtgg 480 ccgtttgacg gaccacttct tgcgcttcga caagctggtg cgtgagttcc gtggaaccgg 540 ggccgtaatg gacgacattg acgcggtttg tcacctcctg ctgacgcttg gttcggcgtt 600 tgctaccgtg gtgacgtcga ttgagaccat ggcggaggag aacctgtcga tggaattcgt 660 caaatgtcgg ctgctagacg aggagacgaa gcagaagggt ttgggcgtcg attcgctcac 720 cgccgggaac gaggctgctg ctttctccgg aatgaaaatg cagaagaaga tccgctgctt 780 cagctgcaaa aaggaaggcc acaagagtgt agactgcacg gagaaaaaga agaggaaaca 840 acagtacaac caaaaacaac acggtgcaaa agcaaacgtt gctggcacga gtgagcgaaa 900 cagtgtttgt tttgtgggcg tgagcggcgg cgagccgcag aaagtgcagc ggcgggtaaa 960 gtggttcctt gactccgggg cgtcggacca tctggcgcga gagaaagagt tgttcgtcaa 1020 gctgcacaaa ctgtcgaaac cgatcgagat tgctgtggcc aaggacgggg agtttattgt 1080 ggccgaacac gctggcactg tgaagatgtt ttctgttgtg ggtggccggt ccgtcgagtg 1140 cactgtgaaa gatgttcttt ttgttccgca actgcgctac aacttgtttt ccgtggtacg 1200 gatcgagaaa gccgggatgc gggttgtttt cgacggtggt aaagcgctga tttaccgcgg 1260 cgacgaaatt gttgtgtgtg gtgctcgacg tgataagctg tacgagctgg attttcgtcc 1320 tttcgattgc gagaagtcgc tgttgtcttg tggccgcgtt cagagtgatt ctgtgttgtg 1380 gcaccgtcga tttggccacc tcaacgagag aagtttgaca gatctcgcgc gaaacgacat 1440 ggtgtcaggg ctgacactga cagtcgacaa aggagtcgac caaacgattg tctgccagtc 1500 gtgcatcgaa gacgattgga gtcactttac cgtcgtgttc ctgatgcgat cgaaggacga 1560 agtgctgggc tgtttcgagc agtacgaggc gctggccacg gcgaagttcc agacgaaggt 1620 ttcccgtttg cggtgcgaca acggtggcga gtacaccggc cggcgtttca agcagttttg 1680 ccgtacgaag gggatccaga tcgagtttac agttccgtac acaccggaac agaacgggac 1740 gagcgagcgc atgaaccgga cactggtcga aaaggccagg acaatgctgc tggattcggg 1800 actggacaag cgtttctggg ggcaagcggt gcgaacggcg gcatatttgc tgaacaggag 1860 tcctaccagc gcagttgttt acaaacaaac accgttcgaa ctttgggaga agaaaaagcc 1920 ggacgtttcg aagctgcgag tttttgggtg cgaaacgttt gtacatgtgc cgaaggaact 1980 ccgaaagaag ctggacccga agtcttggcg tggtttcctg gtcggttacg ctgcgaatgg 2040 gtatctggtc tgggatccgg tcaacaaacg aattgtcact gctcgagatg tggattttgt 2100 cgaaaccaaa acgaccgtga agaagaacga ccagcatgaa gtggagcagt tcgtggtgtg 2160 cccgattcct gacgaagctg aagaacctgt cgcccctgtc gcgccggccg agcgcgaggg 2220 agaagaagaa gtgacgtccg atgaagagtt cggcagtttc gacgacgacg aagaagctga 2280 cggcgatcaa gaagaactgg acgcgggagg agttcccgcc gcaaaagaag ggcctgtcgg 2340 agaagatcct gaagtcggtg ccgacgctga tccaggtaga ccgccgaggg atcgtacagc 2400 tccatcctgg catcgtgatt acgatatgga ttacgctggt ttcgccttgt gtgctacaag 2460 tttccttgac aacattccac aaacgattgc tgagctgaag aaacgggacg actggtgcag 2520 atggaaatct gccattgacg aggagatcga ttcgctgggg cgcaataaaa cttggacctt 2580 gacgaagctc cccgctggtc gcacgccggt tacctgcaag tgggtgtttt gtgtgaaacg 2640 cggtgacgga gttgtggccg acaaatacaa ggcgagactg gtggctagag gctttagcca 2700 gaaggtggga ttcgactaca cggaaacgta ctccccggtg gcgaagctgg acacgctgcg 2760 actggtgttg gcgctggcac atcggcaaga gatgcacgtt caccagatgg acgtaaagac 2820 ggcgtttttg aacgggatcc tgcacgaaga aatctatatg acgcagcctg acgggtttca 2880 gcaggggaag gacctcgttt gccgactgca tcgttcgctg tacgggctca aacaagcgtc 2940 gagggcgtgg aacgagcgtt tcgacgagtt catccagaag ctgaagttcg tccgtagtcc 3000 taacgaccag tgcctgtacg tacgaggaac cggaagagag cagatcatcg tagtactgta 3060 tgttgatgac attttggttg ttgggtcttc gctgaagctg gtgaaggtcg tgaaaaagtg 3120 cttaaacgga gaattcgaga tgacggacgc cggcgaggtg ggacagtttc ttggattgac 3180 catcgatcgg cagccagggg tcatgcgaat cagtcagcgg cggtatttcg agagtttgct 3240 acaacggttc ggcatgatgg agtgcaaacc gacgtcgacg ccgatggaga atcgtttgca 3300 actggagaag ggagttgaag cgcagcgtac caccaaaccg tacagagaac tcattggctg 3360 tttgatgtac gcggcactca ccacgaggcc ggacctgtct gctgcagtaa actactttag 3420 ccagttccag gcatgcccga atgatgagca ttggatccac cttaaacggg tgctcaggta 3480 tgtcaaagga acgctggatg tcgggcttgt ctatcggatc gagaaggagg cgccgttgct 3540 ggaagttttc tgtgacgctg actgggcgaa cgacctggtc gatcgacgat cggtgacagg 3600 gtgcatcttc aaactgtacg gttgcacggt gaactggatt acccggaaac aaccaaccgt 3660 ttcgttgtcc tcgacggaag ctgaacttgc agcgctgtgt accgccgcgt gccatactct 3720 ctggatggtg cgggtgctcc gcgacttggg gttgcagccg gacgggccga tttccctgca 3780 cgaggacaat caatcaacaa ttcggattgc cgaggatgcg cgtgatcacg ggaggctcaa 3840 gcacatcgac acgaagtttc atttcttgcg agatttgatc aacgagggag tgttggagct 3900 aaagttcgtg cgttcttcgg cacagcaggc cgacatgatg accaagggcc ttccggcgac 3960 ttccttcagg gagttgtgca cgaagatcgg gcttggacgg tgctgtggtt gagcagggg 4019 // ID BEL-55_AA-I repbase; DNA; INV; 6203 BP. XX AC supercont1.317; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-55_AA_; KW BEL-55_AA-LTR; BEL-55_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6203 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.317; Positions 128184 121982. XX CC 'AACTG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 630..5957 FT /product="BEL-55_AA-I_1p" FT /translation="MSGLERKLRYLKTRRRSILTSLASIEKFVASYQEEAD FT EPEVPIRLEAIVTLWTEFNKVQADLETQDEAPDALEIYLKERVEFENSYYK FT VKGFLSQRCPAPSSTPTSNPPAQSHSNHVKLPDVKLPIFTGNYETWLNFHD FT LFVSLVHSSSELSSIQKFYYLRSSMAGEALKLIQTIPISATNYSVAWNLLV FT EHFQNPKVLKRNYVQALFDFPVIRKESPDELHSLVEKFEANVKVLKQLGES FT TEYWDILLIHFLSSRLDPITRRDWEEFSATRDNVVFKDLTDFIQRRVSVLQ FT QMSSKQPESQQAYQPRKPSNPRIGSHGAFQENRRKCLFCPEEHPIYMCSIF FT SRMPVEEKEKLIKRHQLCRNCFRKGHMARDCPSENCCRRCNSRHHTQLCTT FT NTSTQGRSTNAAVTTSDVRNATPSTSGATPSDPPSSSASRASTSTMNAAHS FT GVTSCNSQQPGRSKVLLATAVVIVVDDAGREHFARALLDSGSECCFATSQL FT SQMMNVRRTRVDVPVAGIGKSALKVKHQFRAVIKSRNSDYTTSVDMLILPK FT VTMDLPSTDIDVSRWDIPSNITLADPAFFQSSTVDIVLGAEIFFDLFAVSG FT RIFLGESLPSLTNSVFGWVVSGRTAQYSSQTSVRCNVATVADLHDFMEKFW FT KVEEDPTATSYSPDETACEDFFQRTVSRDSSGRYMVRLPFKEAMIQRLGDN FT HKSALHRFRLLENRLSRDASIAQQYRDFMTEYLRLGHMAPLSSVGEEIEPK FT YYLPHHPVIRDSSTTTKLRVVFDASSKTSSGISLNDALLVGPIVQDDLRSI FT VMRSRTHEIVLIADAEKMYRQVRHSSEDYAYLCIFWRISPEQPIQTFVLQT FT VTYGTASAPYLASRVLKQLASDEAANYPIAAKVVENDFYVDDLFTGAATAA FT ETIELREQIDGMLSKGGFSMRKWASNCEAVLEGIPSENRALQHSIDFDRDQ FT TIKTLGLHWEPGTDLLKYKIDLHLPLNTVLTKRLTLSYIAQLFDPLGLVGP FT VVVTAKAYMQTLWTLRDENGKIWEWDRELPVSLRDRWIAYHSDLPTLNNLR FT INRFVLLSKPLEIELHMFSDASDIGYGTCAYLRSTDSLGRIKVALLTSKSR FT IAPLKRQSTPRLELCGALLSAELYQRISTSLQFPFIAVFWTDSMTVINWLN FT ASPSTWTTFVANRVSKIQHATQKCTWNHIAGLQNPADVISRGCLASELISN FT KLWWGGPDWLEKEKDYWPVPGQQCRATDHSDSERRRSAAVFASTTTEPSFI FT DEYSGKFSSYTKMVRTTAYWRRFFAILRSPKEDRTFTFLTTNELKTAEHAL FT IRLLQQQCFPDEWKRLQSGQHVAKSSRLKWFHPTISSNENIIRIGGRLGQS FT KQHDDFKHPILLPGTHQLSTLLLSSYHQKLLHAAPQLMVNTVRLKYWLLGG FT RSAARQVVHKCVTCVRARPKLIEQFMSELPAARVTAARPFSRVGIDFWGPI FT QLQPRHRRDAPIKAYVAVFVCFATKAVHLELVANLTTAKFLQAFRRFVARR FT GLCADVYTDNGKNFVGAANELKKLLRSREHKDQVAQECAENGIRWHFNPPK FT GSHFGGLWEAAIRSAQKHFVRVLGSRLIAHDDMETLLAQIECCLNSRPLTP FT LSEDPSDLNPLTPGHFLVGSALKAVPDANVLSIPYNRLTQWQQTQKAFQDI FT WSRWHLEYLATLQPRAKWCNPPVEIQRNRLVIIKDENLPPLQWPTGRIHEL FT HPGKDGVVRVVTLQTARGFVTRPVAKLCLLPISMPAGVGSISNDGEASSNA FT EQTQQ" XX SQ Sequence 6203 BP; 1621 A; 1643 C; 1413 G; 1526 T; 0 other; tctggtcctt cgaaccggat ggtcgtcgac gcgattccga ccacccggtg aagccacggc 60 aaatggttcc gacgccattc ctgcgaatcc aagctattca atcaggatat tgccattaaa 120 tattttcccg cccgtcgcca tctttcaacg ccaagtgaag ctgcttgaga ataattgttc 180 aaggcattga caattattct aaaaggtgag tgatattcca atcgctctac aagtttatcg 240 ctcgtctacc gggtcttctc tgcttacatg ttaccattgt ccaacgggac aattatcatc 300 gttttctgtg cactgtgatc tccacctgag acattccggt cgactttccc ggaattttgc 360 gatatttcat ccgatcctgc gcacaattga ggcaagccag cgccattgct gaacggacac 420 caacctgtac ttcgtgtaag cagcccaatt agcagccatt gctgacacag ccacatccac 480 agccaaagga cgtatcgcag ttcgagaaag ggaaatttta taaatacaag gcactttatt 540 gcctttgaga aggtaattat cgctccattt cgttagtttc gccttttccg ggctacctgg 600 tctcaatttg tagatcctga ttcaccgcca tgtccggact cgaacgtaag ctgcgctact 660 tgaagacgcg acgccgcagc atcctgacgt cattggcctc catcgagaag ttcgttgcat 720 cataccaaga ggaggcagac gagcccgaag tgccgatacg actagaagca atcgttacgc 780 tgtggacgga gtttaacaag gtacaagccg atttagaaac ccaagatgaa gcccctgatg 840 cccttgaaat atacctcaaa gagagggttg agtttgaaaa ttcctactac aaggtcaagg 900 gttttctttc tcaacgttgc cctgcccctt caagcacgcc cacttctaac ccccctgctc 960 aaagccattc gaatcacgta aagctgcccg atgttaagct accaattttc actggcaact 1020 atgaaacgtg gcttaacttt cacgacctgt tcgtgtcgct cgtacactca tcatcagagt 1080 tgtcgagcat tcaaaaattt tattatcttc ggtcgtcaat ggctggagaa gcactgaagc 1140 taatccaaac gataccgatt agcgcgacta attactctgt cgcatggaat ctcctagtcg 1200 agcatttcca gaaccccaag gttctaaagc gcaattatgt tcaggcattg ttcgattttc 1260 ctgtgattcg caaggagtcg ccagatgagc ttcactcgtt agtcgagaag ttcgaagcga 1320 atgtgaaggt gcttaaacaa ttaggagaaa gcactgaata ttgggacata cttttgatcc 1380 acttcctctc aagccgtctc gaccccatca ctcggcgtga ctgggaggaa ttttccgcca 1440 ctcgtgacaa cgtggtgttc aaggacctta cggatttcat tcagcgtcga gtcagtgtcc 1500 tgcagcaaat gtcttccaaa caacccgaat cccaacaagc ttatcagcct cgtaaaccat 1560 caaaccctcg catcggtagc catggagcct tccaggagaa tcgacgaaag tgcctcttct 1620 gcccggaaga acatccaatc tacatgtgca gcatattcag tcgcatgcca gttgaagaga 1680 aggaaaaact catcaagcga catcaactgt gtcgcaactg ttttcgtaag ggtcacatgg 1740 ctcgtgactg cccatcggaa aattgttgca ggaggtgcaa ttctcgccac cacacgcaac 1800 tgtgcaccac taacacatcc acccaaggga gatcaacaaa tgcagctgta acgacgagcg 1860 acgtccgaaa cgccacccct tccacttctg gcgccacccc atctgatcca ccatcatctt 1920 cggcttcgag agcgtcaaca tccacgatga acgctgccca ctcgggagtt actagttgta 1980 attcacaaca accaggacga tcaaaagtgc ttctcgccac tgcagtggta atcgtcgtcg 2040 acgacgctgg aagggagcat tttgcacgcg ctctactgga ttctgggagt gaatgttgtt 2100 ttgctacctc tcaactatcg caaatgatga acgttagacg gacaagagtg gatgtgccgg 2160 tcgctgggat cgggaaatca gcattgaagg tgaagcatca attccgagcc gtcatcaaat 2220 ctcgtaactc agattataca actagtgtcg acatgctcat tttgcccaag gtcaccatgg 2280 accttccctc cactgatatt gatgtatctc gctgggacat tccgtctaac atcactctcg 2340 ctgacccagc attctttcaa tctagcactg tcgacattgt actgggagca gagattttct 2400 ttgacctgtt tgctgtgtca ggacgcattt ttcttggaga atctcttcca tctctgacca 2460 actcggtctt tggatgggta gtgtcgggaa gaactgccca atattcgtcc caaacttctg 2520 tacgttgcaa tgtcgctact gttgccgacc tccatgactt catggaaaaa ttttggaaag 2580 tggaggagga tccaactgcc actagctatt ccccagacga aacagcctgc gaggatttct 2640 tccagcgtac ggtttctcgt gattcgtcgg gccgatacat ggtacgtctt ccgttcaagg 2700 aggcaatgat tcagcgacta ggagacaacc ataaatcagc actgcatcgt tttcgtcttc 2760 tggagaatcg tctttctcgg gacgcttcaa ttgcccagca gtatcgggat ttcatgaccg 2820 aatatctccg attgggccac atggcgccgc ttagtagtgt tggagaagag attgaaccaa 2880 aatactacct cccacatcac ccagtaattc gagacagcag cacgaccaca aaactgcgcg 2940 tcgtattcga tgcgtccagc aaaacgagca gtggcatctc tttaaacgac gctcttctcg 3000 tcggccccat cgtgcaggac gatttgcggt cgatcgttat gcgttctcgc acccacgaga 3060 tagttctcat tgcggacgca gagaaaatgt accgccaagt gcgacactct tccgaagatt 3120 atgcatactt gtgcattttt tggcgaattt ctccagagca accaatacag acatttgttc 3180 tgcaaacggt aacgtacgga acggcatcgg ctccctactt agcatctcgc gtactaaaac 3240 agctggcaag cgatgaggct gcgaactatc caatcgcagc gaaggtagtg gaaaatgact 3300 tttacgtcga tgatcttttc accggagcag ctacagcagc agaaacaatc gagctcagag 3360 agcaaatcga cggcatgctt tccaaaggag gattttccat gagaaagtgg gcatcaaatt 3420 gtgaagccgt tctggagggc attccgtctg agaaccgggc tctacaacac tcgatcgact 3480 tcgatcgaga tcaaacaatc aagacacttg gcctgcactg ggagccaggc acagatctcc 3540 tcaaatataa aatcgatcta catttgccac tcaacaccgt tctcacgaag cgcctcacgt 3600 tgtcctacat tgctcaactc tttgatccgt tgggactggt tggaccagtc gttgtaacag 3660 ccaaggcata catgcagaca ctctggactc tcagagacga aaatggaaag atatgggaat 3720 gggatcggga actcccagtg agtttgagag atcgatggat cgcctaccac tccgatcttc 3780 ccactctcaa caacttgaga ataaatcgct ttgttcttct ctccaaacca ctagaaatag 3840 agctccatat gttttcggat gcatctgaca tcggatacgg cacctgcgcc tacttgagat 3900 ccaccgacag tctcggtcga atcaaagttg ctttgctgac ctcaaaatca agaatcgctc 3960 ctctaaaacg ccaaagtact cccaggcttg agctgtgtgg agctttgctc tccgccgagt 4020 tgtaccagcg catttccaca tctcttcaat tcccttttat tgcagttttc tggactgatt 4080 ctatgaccgt aattaactgg ctaaatgcat caccctctac gtggaccacc tttgtcgcta 4140 acagggtatc caagattcag catgcgacgc agaaatgcac ttggaaccat attgctggac 4200 tccagaaccc ggctgatgtc atatcccgtg gctgtctggc atccgagctc atcagcaaca 4260 agctttggtg gggtggccca gattggttgg agaaagagaa ggattactgg cctgttcccg 4320 gacagcagtg tcgagcgact gatcacagcg atagtgagcg acgaagatct gcagcagtgt 4380 ttgcttctac cactaccgaa ccttcattca tcgatgaata cagtggaaaa ttctccagct 4440 acacaaaaat ggtccgtacc acagcgtact ggcgccgttt ctttgcgata ttgcgatcac 4500 cgaaagaaga tcggaccttc accttcctaa caaccaacga actcaaaact gccgaacatg 4560 cgctcataag gctgctgcag caacagtgct ttccagatga gtggaagcga ttgcaaagtg 4620 gacaacacgt tgccaaaagt tcacgactga aatggtttca tcctacgatc tcctccaatg 4680 aaaacatcat tcgaatcggc ggtcgtctgg gccagtccaa acagcacgac gacttcaagc 4740 atcctatctt gctacctggc actcaccaac tatccacgtt gcttctttcg tcgtatcatc 4800 aaaaactact acacgcagcc ccacagctga tggtcaacac agttcgatta aaatattggc 4860 tcttgggggg gcggagtgcg gctaggcagg tagtgcacaa atgcgtcact tgtgtacgtg 4920 ctcgaccgaa gctcatcgag caatttatgt ccgagctccc agcagctcgt gtgacggcag 4980 ctaggccatt ctctcgtgtg ggcatagatt tctggggacc catacagcta caacctcggc 5040 ataggcgcga cgcacccatt aaagcatacg tagccgtgtt cgtatgtttt gcgaccaagg 5100 cagttcatct cgaactggtc gctaatttga ctacagccaa gtttttgcaa gcctttcggc 5160 ggttcgtggc tcgccgaggc ctctgtgctg atgtatatac ggacaacggc aagaattttg 5220 tgggagcagc gaacgaactc aagaagcttt tacgaagcag ggagcacaaa gaccaagttg 5280 ctcaagagtg tgctgaaaac ggcatacgtt ggcattttaa ccctcctaaa ggatcccatt 5340 tcggcgggct ttgggaggcg gcaataagat cggctcaaaa acatttcgtt cgagtattgg 5400 gatcaaggct gattgcgcat gacgatatgg agactctgtt ggctcaaatt gagtgctgcc 5460 ttaactctcg tcctctcact ccgttgagcg aggacccgtc cgacctaaat ccattgactc 5520 cgggtcattt tctcgtgggc tcagctttaa aggcggtacc ggatgccaat gtcctttcga 5580 ttccgtataa ccggttgacc caatggcagc aaactcaaaa ggccttccag gacatttgga 5640 gcagatggca cctggaatat ttggcaactt tgcaaccgag agccaaatgg tgcaaccccc 5700 cagtagaaat tcaacgtaat cgattggtca tcatcaagga cgagaatctg ccaccattac 5760 agtggccaac tggcagaata cacgaactcc acccggggaa ggacggcgta gtccgggtag 5820 tcaccttgca aactgcacgt ggattcgtca cccgaccggt ggcaaaatta tgtcttctac 5880 cgatttcgat gccagccggt gtcggttcaa tatccaacga cggcgaagca tcaagcaatg 5940 cggagcagac acagcaataa aatcatacaa tgcccaaatt ccttgggtcc aaggtaagga 6000 cctgtccaat attttatgtt tcccctccgg atctaccggg tcttaatcat tacatgtaca 6060 atcgacgact gaccgccatt catcgagcga cgacgaaaac atgtgaatgg atttcagtta 6120 accccagcga taccaggccc tccataacca tcagagatga ttcgtgattc gtccagaaat 6180 agttatttct gagggaggca gga 6203 // ID P-1_AAe repbase; DNA; INV; 3131 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A P DNA transposon family from Aedes aegypti. XX KW P; DNA transposon; Transposable Element; nonautonomous; P-1_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-3131 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the yellow fever mosquito."; RL Repbase Reports 11(4), 1288-1288 (2011). XX DR [2] (Consensus) XX CC There are 3 copies with >95% identity. Both termini are CC uncertain. XX FH Key Location/Qualifiers FT CDS 152..2239 FT /product="P-1_AAe_1p" FT /note="transposase." FT /translation="METSVKKENLKLKKLLNQEKRKTKIYSSQLKKSKALN FT SRLAEIFSKGQLRKLTDPLKRRIQWSPDDISRAISLHAAGAKAYRLLLSRG FT YPLPAVSTLKKWAGKVKLTPGVLKPVLNLLQNSKFNEKERACVLSFDEMKI FT KSCYEYDRTSDKVLKPTKYVQVAMIRGLYKNXKQVIYYNFDTAMTATIIKD FT IIEKLXNIKFRVVAIVCDMGPTNRKLWKDFEISCEKPFFTVDDQKVFTFAD FT TPHLMKLVRNHFLDSGYSWQKEETTHLITXRPVVDLVNLQRSELKICHKIT FT LTHLDVKDAARQKVKYAVQLLSNSAAQGIKRAFSLGLISSPXALITSSFLK FT NMNDWFDIFNSSAASKGLRERLKAFDHSNTVHQSIIHESNEMISQLRVPNR FT RTLLPCQKAILVNNAALIGLSVFLKTIYGXSYLLTRRLQQDDLERFFGTIR FT SKGGLHDHPTALEFTYRLRNSILGNIIDYFILIKFVLLCVKLWLSIPRYVM FT IKIYFFQIGRSDASQQVNNSNVEQEQEEPYEELQFTGSLLKDISPGQTDFF FT QAEDQDYEMEENMDESECSMEYSELEEDALEYIAGYIIKKLKLRIPPPDKS FT AFTWVDQLSEGGLTKPTFDFVTQIMLLDKIFKQQHGETFNFNVKAVESCID FT ASENIGLSVEIKKLFFRTRLYIRIRNLNKKNDNIQKTKKRKMKKIVN" XX SQ Sequence 3131 BP; 1028 A; 605 C; 578 G; 899 T; 21 other; tcccatgata aagagtcaat caatcaatca awattagaat cgaggtatac ctgtagacgt 60 tttacgtggt ctatcgctgt ggttgttcga ttttgtttac tcagatggta ctttactaga 120 caaataacaa aaaaaaactt ctagaaaacg catggaaact tctgtgaaga aggaaaattt 180 aaaactgaag aagttactta atcaagaaaa acgaaaaacc aaaatatatt cctcgcagct 240 caaaaagtca aaagcattga attctcgtct tgccgagatt ttctcaaagg gtcagctgcg 300 gaaacttacg gatccactca aacgtcgaat tcagtggtct cctgatgaca tatctcgagc 360 gatatcwctg catgctgctg gagctaaagc atatcggtta ctactgtcta gagggtatcc 420 tttgccagca gtttctacgc tgaagaaatg ggctggaaaa gttaaactta cacctggagt 480 tctcaaaccg gttttaaacc ttcttcaaaa tagtaaattc aacgaaaagg agcgtgcatg 540 tgttctctct tttgatgaga tgaaaatcaa gtcatgttat gagtacgacc gcacttccga 600 caaagttcta aaaccaacta agtacgtgca agttgcaatg atacgaggac tatacaaaaa 660 twggaaacag gtcatatact ataatttcga taccgccatg acagcgacta ttataaaaga 720 tattatcgaa aaactgaawa atatcaaatt tcgcgtagtc gcaatcgtat gtgatatggg 780 gccaaccaac agaaaacttt ggaaagattt tgagatcagc tgtgaaaaac cwtttttcac 840 cgttgatgac cagaaggttt tcacttttgc ggatacgcct cacctcatga aacttgtacg 900 taatcatttt ctggattctg gttactcatg gcaaaaggaa gaaacgactc atcttatcac 960 ctmaaggccg gttgttgatt tggtcaattt gcagaggagt gaactcaaga tttgtcacaa 1020 aatcactttg acccatctag acgttaaaga tgcagctcgt cagaaagtga aatacgctgt 1080 gcaattgctg tccaactctg ctgcacaggg catcaaaagg gccttcagcc ttggattgat 1140 ttcttcgcca gawgctttga taactagttc atttttgaaa aatatgaacg attggtttga 1200 tatcttcaat tcgtcagcgg catcgaaagg actacgagaa cggctcaaag catttgacca 1260 tagtaatacg gtgcaccagt ccataattca tgaatccaat gaaatgatct cacagcttcg 1320 tgttccaaat agaagaactc twttgccgtg ccaaaaagca atcctcgtaa ataatgcagc 1380 actcatagga ctttcagtat ttttaaaaac tatttacggw wattcgtatc tattgacacg 1440 gagattacaa caagatgacc tagagcgctt ctttgggact attcggtcaa aaggaggatt 1500 gcacgaccat cccaccgcat tggaattcac ttacagattg cgaaacagca tattaggtaa 1560 tattatagat tactttatac ttattaaatt tgtgttgtta tgtgttaaac tttggttatc 1620 tataccacgc tacgtgatga tcaaaattta tttcttccaa ataggacgat ctgatgcatc 1680 gcaacaagtg aataacagta acgtagagca ggaacaagaa gaaccgtatg aggagcttca 1740 atttacggga agcttgctaa aagacatatc tccagggcaa acagactttt tccaagccga 1800 agatcaagat tatgaaatgg aggagaatat ggatgaatcc gagtgttcca tggaatattc 1860 cgagttggaa gaggatgctc tggagtatat tgctggatat atcataaaaa agttaaagtt 1920 acggatacca ccaccagata aatcagcatt cacatgggtg gatcagctct ctgagggtgg 1980 gttgacaaaa cccacttttg attttgtaac acaaataatg ttacttgata agatatttaa 2040 acagcaacat ggcgaaacat tcaactttaa cgtgaaagct gttgaatctt gtattgatgc 2100 ttccgagaat attgggctgt ctgtggaaat caaaaaactc tttttcagaa cacgattgta 2160 catccgaatc agaaatctga acaaaaagaa tgataacatc cagaagacta aaaagagaaa 2220 aatgaagaag atcgtgaatt aaatgtatca ctttgttgtt aagttatata cataaatata 2280 tagcaaaatt ttcaacaata ttctcattcg atgttttatg tttccaaaat tttgtatcca 2340 tggcttgaac aatgatagtc cttgatctgc tgaacaactt cgtcgaagat gccatctttc 2400 taaaaaatca ggctgttgag atatctgcaa aacaaaagtt ccatgctgac taccgccatg 2460 aatttgtagt aaatttgaat taactggcat gaccgagtaa ccwacttaac aactttscct 2520 wagamgcgaa ccattatcag gatcttgaaa tatcattatt attatacatt atcacattat 2580 tgtctctcaa ggttcaaata gtgcaagtta gaaatagcac ccctaccgga aaatttatca 2640 tcaaaaagga tgatcctcaa cttctaaaga cagatcgtac tgagtactga aacttwtcca 2700 aaggcagtac tatgctatct cctmwcgmat acmtccgaca acacccccaa attgatggaa 2760 ttcgataaat cctgggtctc ctwttcgatt cattttcgac ctcacaakat caatgcccac 2820 tcgcatcaaa taacatttcg tgtcgttttt ccaaaagtgc catgtatctc catgtctcgc 2880 gtttcctaat aatcgaacat ctatttctag aatatatcta ttggctcagt tcaacagcat 2940 taattgttat ctgattaatc gtaatcatag tcaaccatcg ttgaactcat cgttgcgtca 3000 tggatacctc ggtggatatt gatgcgtgtt gattcagacc actagatcat ttcgttcacg 3060 ataggaatga cgtcatcgag caactgtcag ttttcggagt atatcacctt atcaaataca 3120 gtacaccgtg g 3131 // ID DNA8-1_CQ repbase; DNA; INV; 2157 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A DNA transposon family from Culex quinquefasciatus - consensus. XX KW DNA transposon; Transposable Element; nonautonomous; DNA8-1_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-2157 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the southern house mosquito."; RL Repbase Reports 11(1), 78-78 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >96% CC identity. 8bp TSDs. 863bp TIRs. XX SQ Sequence 2157 BP; 670 A; 404 C; 394 G; 688 T; 1 other; tagcctgtcc cattttgagg tcatgtcgag gaatttcagg tgctcacttc ttaaatgata 60 gattatgatg ttaggaacaa tgttttctta gacaagaaaa aataataaaa tactttctcg 120 caccccctag tcgatttact gaaaaaagtc acttttttga acaaattttc taaaatcgct 180 tggaatcaat acaaaaactg tttcgatcag gtgtgtatta tcttcaaacg ataggttttt 240 gtccatagat taagatgcac tatcaatatt ggaccaaaat ttaagttttt ggactctccc 300 aggcggattt gtgtcgaaaa aatcgcattt tttcgaaact tttttcagaa atgttcattt 360 aatttagggt agcctatttt tcccagtgaa accaatacat gcagcttgta ggaaatttca 420 tggcgaacat ttttccctct gagaaaatgc aatttcgaca ctccagagcc gagatatttg 480 agttttagtg aggaaaaaag tgccaatttt caaaatttct cagaattcag aagcaaggcc 540 tactaattac acgatgtagc aagcatatgt ctcaaaggtc agggtatgct tttttatagt 600 aattttggcg gctgaatccg aatctgaaat caaatttcgt gtaaacagtg atgttttgga 660 gctacaccct tttggaatgt tatttgcgtg tttaagaggc agtttttgta aatattgctc 720 ggtttgttct agaggtcgta tcgaggtgct ccgatttgga tgaaactttc agcgtttgtt 780 tgtctataca tgagatgaac tcatgccaaa tatgagccct ctacgacaaa gggaagtggg 840 gtaaaacggg cattgaagtt tgaggtccaa ggcaaacata acaaaatggc ttaaaattgc 900 tgccgsatgc tttagtaaca acttcgtcaa ttccaactct cttagatgca ttcgaaaagc 960 attagaagca ctgcgggtca aaatgagcca taggacgtcc aggattggtt taacttttgt 1020 ctggcatagc ttttgcaaat tactgttaaa aatggttttg gtttttaaac ctcaatatct 1080 ttttgcaaca gccagcaaca ccccctagtc aaaagttagg gaatttcatg gactataagc 1140 ctggacggta ataacttttt ggccaatcgc agtttttctc atagtttttc gatttttcta 1200 taacaaacgt tttacaacgt tagttattgt cctgaggcgt ccatagcggc actttttagt 1260 ctcccttttg tcatattcga aatcctcgga aaattcaaac ttcaatgccc gttttacccc 1320 acttcccttt gtcgtagagg gctcatattt ggcatgagtt catctcatgt atagacaaac 1380 aaacgctgaa agtttcatcc aaatcggagc acctcgatac gacctctaga acaaaccgag 1440 caatatttac aaaaactgcc tcttaaacac gcaaataaca ttccaaaagg gtgtagctcc 1500 aaaacatcac tgtttacacg aaatttgatt tcagattcgg attcagccgc caaaattact 1560 ataaaaaagc ataccctgac ctttgagaca tatgcttgct acatcgtgta attagtaggc 1620 cttgcttctg aattctgaga aattttgaaa attggcactt ttttcctcac taaaactcaa 1680 atatctcggc tctggagtgt cgaaattgca ttttctcaga gggaaaaatg ttcgccatga 1740 aatttcctac aagctgcatg tattggtttc actgggaaaa ataggctacc ctaaattaaa 1800 tgaacatttc tgaaaaaagt ttcgaaaaaa tgcgattttt tcgacacaaa tccgcctggg 1860 agagtccaaa aacttaaatt ttggtccaat attgatagtg catcttaatc tatggacaaa 1920 aacctatcgt ttgaagataa tacacacctg atcgaaacag tttttgtatt gattccaagc 1980 gattttagaa aatttgttca aaaaagtgac ttttttcagt aaatcgacta gggggtgcga 2040 gaaagtattt tattattttt tcttgtctaa gaaaacattg ttcctaacat cataatctat 2100 catttaagaa gtgagcacct gaaattcctc gacatgacct caaaatggga caggcta 2157 // ID Mbcv_LTR repbase; DNA; INV; 667 BP. XX AC . XX DT 05-JUN-2008 (Rel. 13.09, Created) DT 05-JUN-2008 (Rel. 13.09, Last updated, Version 1) XX DE LTR retrotransposon from Monosiga brevicollis: long terminal DE repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; chromovirus; KW Mbcv; Mbcv_I; Mbcv_LTR. XX OS Monosiga brevicollis OC Eukaryota; Choanoflagellida; Codonosigidae; Monosiga. XX RN [1] RP 1-667 RA Carr M., Baldauf S.L., Leadbeater B.S.C. and Nelson M.; RT "Three Families of LTR Retrotransposons are Present in the Genome RT of the Choanoflagellate Monosiga brevicollis."; RL Repbase Reports 8(9), 1180-1180 (2008). XX DR [1] (Consensus) XX CC Chromovirus identified in the genome of the choanoflagellate CC Monosiga brevicollis. Not annotated in the JGI release Version CC 1.0. XX SQ Sequence 667 BP; 119 A; 226 C; 154 G; 168 T; 0 other; tgttagcgtc aaatccgtgc ttccacgcat ttgagcactc ccactgcttt cgcacacact 60 cagcatcggt cgctatgggt cttcctattg gtatcccata gctgcgacga cctcacactg 120 acacctacac ttggaccccc tgggcctggt cctcctcgtg ggagtcacat gagtaaacac 180 cgcgtgtgac gcggtcccca cgatgcctgg gcagctgccc tggggcaaca agcttgtgac 240 tactgttgtt tccctttgtt ggggtacctg cggtcgggca cgagttgcac tttccctatc 300 ctcgggattg agcgactcta cagctattct aacggtgcac ctggtctggt cactgtgaat 360 gcttgccgac cgcaggtacg tcaagccccc tgccattctg tccccccttt gctctcctgt 420 gtgctctgct ctcttgtgac tttccctatc ctcgggattg agcgactcta cagctattct 480 aacggtgcac ctggtctggt cactgtgaat gcttgccgac cgcagctggt tcaattgacc 540 ccaagacacc accaaagctt tcgctccgtc ccgacggacg tctctcaaac ctctctcaaa 600 cgcttcctca aacctacgcg cttcctcgca acccaaagac ggcccccaga cgtgctggag 660 cctgaca 667 // ID BEL-4-LTR_HM repbase; DNA; INV; 205 BP. XX AC . XX DT 15-JAN-2009 (Rel. 14.02, Created) DT 15-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE LTR retrotransposon from Hydra magnipapillata: long terminal DE repeat - consensus. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-4-LTR_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-205 RA Bao W. and Jurka J.; RT "LTR retrotransposons from Hydra magnipapillata."; RL Repbase Reports 9(2), 437-437 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX SQ Sequence 205 BP; 65 A; 21 C; 29 G; 90 T; 0 other; tgttaaaagt gttttattat atacggcttt ttgttattta ttaccggttt tatctggttt 60 ttcttgatta ttcaattttt ggattgtaaa agtttttgtt acggatatat tagttagtta 120 acacgaggtt acaagaaaga gaataactta gctaaaaccc ttctcaatta atgtctctaa 180 aataattaaa ttaaagtttt ttaca 205 // ID BEL-608_AA-LTR repbase; DNA; INV; 678 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-608_AA_; KW Pao_Bel_Ele157; BEL-608_AA-I; BEL-608_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-678 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 678 BP; 209 A; 120 C; 131 G; 192 T; 26 other; tgttggcgac acagctccac gtttggcaac cctkccaaca gtgaatcgct takgcgaaac 60 ggagagaacg gagcctagat acamcacacg aaggcgacaa tagagtgtag tcgamagtsa 120 wagcgatagt gcacgtgcct agatggtcta tgggwtamat tgcsaatatt camtttatga 180 ttcttcaawt tactaaattt aacaatactk tgtaataagt accgatagtg aamwtcgaaa 240 tcagttgtag ctcagtagtg gctgstgata ctgwggtwtg ctggtsgtwt ggtcgagaka 300 wgttktattg taggttagaa wgtgaattkc ccgatggatt acagtawcct waagattaaa 360 ctatatctct atagcaatcc caggcaccag tattgagccg ttattctagc cgtacgtcag 420 cgagtatgat aaacggacta agtatgctat catttctcat ctatgtttca taaaccccta 480 actgatcatc tcatcattta tatagagtaa actaccgtaa aaataaaccc ttttgttcga 540 ggaatattgc acacgaaagt aaaccgaatt tgtaagctga aatcacgtat gtaaaattta 600 gaactatact aatttaataa atattccagc tttaggcgtt acgctaaaaa gaatcggcgt 660 ttttgttatt ctcgaaca 678 // ID DNA-8-2_HM repbase; DNA; INV; 946 BP. XX AC . XX DT 24-DEC-2008 (Rel. 13.12, Created) DT 24-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE non-autonomous DNA transposon from Hydra magnipapillata- DE consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA-8-2_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-946 RA Bao W. and Jurka J.; RT "nonautonomous DNA transposon from Hydra magnipapillata."; RL Repbase Reports 8(12), 2077-2077 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX SQ Sequence 946 BP; 374 A; 110 C; 122 G; 340 T; 0 other; tagtgatttc gtcggttatc cgtttcaaaa accggataac agttctcatc aatttgcggt 60 taaccgacta ttcggtaaat tagtgacgct ttctgaatac aggatcttct aaatttcaac 120 gaacgtagcg ttttaaacta attttttaaa aattaatgtt aaagtgtaca ttttcaagac 180 agttagtgaa tacagtccat atatattttt atatacatat attatatatt tataacaaaa 240 catggaaaaa gatatatata ttttttatat tataatataa tatacctttt taattatata 300 attaaaaaag gtaggaataa tgtacaagta tagtttaaat gttttaatat cagcggatgt 360 tttaaaatca ttctctcata taaaaacatt tgatttggaa aaatgagatg ttatgaagta 420 aattttttta gcgtgtattt taaaaactta ttgttatgcg gacatagatg gaaacttgaa 480 gataaacttg aatgttctca aaacaacttg ttgcgtaaag ataatataaa gaatatttgc 540 taacactaaa ataccgccaa atctacgaat aaacaaattc tattggttca aaaaatacgc 600 tgaataagcg aattagagtt ccataatatt ctgaagaaac tttcttaaaa atgtattgct 660 tgagttgtaa aagtaataaa taaatctttg agatttctat tgtaaattgt tttaagaata 720 tattatcaaa caaaactgat taaattaaca tttaaattaa taaaagaaat atattataat 780 ctttgataaa ttaacatttg tcaaacaata atcgtcataa ttacgatgtt actttctata 840 aaatttcgta aaatttttaa gtcggttttt attattcgaa aatagatatt tgatattcga 900 acaaaatccg gttaaccgca aaaccggata accggcgaaa tcacta 946 // ID LanceleTn-1 repbase; DNA; INV; 433 BP. XX AC . XX DT 15-MAY-2006 (Rel. 11.05, Created) DT 15-MAY-2006 (Rel. 11.05, Last updated, Version 1) XX DE Non-autonomous DNA transposon - consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW LanceleTn-1. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-433 RA Osborne P.W., Luke G.N., Holland P.W.H. and Ferrier D.E.K.; RT "Identification and characterization of five novel miniature RT inverted-repeat transposable elements (MITEs) in amphioxus RT (Branchiostoma floridae)."; RL Int. J. Biol. Sci 2(2), 54-60 (2006). XX DR [1] (Consensus) XX SQ Sequence 433 BP; 124 A; 86 C; 105 G; 116 T; 2 other; tagggctggg tatcggtaca gcgtaccggt acaaaaccgg tttttcttat tggaccggtc 60 cagaaaaacc ggacctgaaa aaattaggtg gaccggatgt tggaccgatt rgaaaattaa 120 cagattattt tatcaggcat tcacacgttt tggcgcttgc aggtggaaga aaataacaag 180 agtgaagtag agtagagttt atagtaattt ctaccaagtt ttacagccaa tcgtacaggt 240 gcagttagcg ttgtaggatt ttaaaacgcc agtgtaagtc taatactcca ccaaacagat 300 ttctttgtag tgaaatggac cattggtatg agtcatactg aatcaggtcc aggttcaggt 360 ccggacctgg acctgatcct ytggacctga accggacctg gacctgaatt ttctgtaccg 420 gtacccagcc cta 433 // ID CR1-43_AAe repbase; DNA; INV; 4749 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-43_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4749 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1130-1130 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 10 sequences with >97% CC identity. XX FH Key Location/Qualifiers FT CDS 325..1188 FT /product="CR1-43_AAe_1p" FT /translation="MACKRCQKKVSDDEIVVCRGYCGASFHAVCVNVDEPL FT RKQLKHYRNNVFWMCDGCANLFTNAHFRTMMTGFDDKISAMPAAIETMKGE FT IEKLHNCLNNLSAKVDGMPSTPTPFSTPNPWPAIHRINRSVKSAKRLRDID FT GNSIKVDSGSIKTGTKALDALSSVRLAPRSNDDLFWIYLSAFHRDTSESQI FT TSFVTECLELPTNIEPQVVKLVPKGKDPNTLNFVSFKVGLSDKFRDKALSC FT DSWPENIRFRQFEDNRSKNLPQVVSLTSTIHPGTEDPSERASSSMIV" FT CDS 1062..4637 FT /product="CR1-43_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="KHPIPAIRGQPFKKLAPGGQSNFHDSSWDRGSFRASF FT FEYDSVSSLGPPGRTQRGIRGAPSDPVTVEPSSPAYLSNYAFDSVSCHHRR FT PGPVNGGLEGVSQPAPSGKYACSRRLSLGDEFDNFSSVPSKTIHVVNLERN FT EPSLSSTSLPGRTVESLMEAPDPSNPVEQIAVTAHHSRPGPVVGCGAGVFQ FT SVTTGEYSYNSRNGISIPDESIFRTSNGQSSTSTIQHADNDDVLIYYQNAG FT GMNSTTHDYLLATSDECYDVIVLTETWLDSRTISSQVFNSXYDVFRCDRGP FT RNSRKSTGGGVLVAVNKKRKSKAIENDQWSSIEQVWVCIEFTNRKVFLCGI FT YVPPDRTRDDDLIERHTRSIMSVVEMTSACDEIIVIGDFNLPGISWQPSHH FT GFLHPDPDRSTMHVGATKLLDCYCSATLRQINHLTNENNRSLDLCFVSAQD FT VAPTVSIAPLPLVKQARHHPPLILTLNDRHTIEMKGIVPVSYDFPNADHGG FT IAEYFASIDWLNVLDGRDADEAALTLSNIIGHVIDRHVPKKAFNGNRKPWI FT TRELSSLKTAKRAALRHYCKHRTIPLRDQYRSLNAEYKLRIRESFRHYQHG FT IQRNLQAKPKSFWSYVNSQRKESGFPSTMTLNGEVTSEPQKVSQLFADKFA FT STFCDEIISDEQVNQAASNVPLLVGSTLSSXDIDDAVIFRAASQLKSSTNL FT GPDGIPAVFVKKYIDNLIVPLRHVFNLSLSSGVFPSIWKTAIMFPVHKKGD FT RKDVNNYRGISSLSSISKLFELIVLDALMSHSKQYLSDEQHGFISGRSTTT FT NLLCLTSHVTESFANKGQTDVIYTDLSAAFDKINHAITIAKLEKYGVSGNL FT LRWLKTYLIGRKLSVTVEGFRSKDFLAASGIPQGSHLGPLIFLLYFNDVNY FT VLHGPRLSYADDLKIYSRIRSKADAIRLQQDLDNFKNWCTLNCMVVNPDKC FT SIISFARIRDPVVFDYKLHDTVIQRVDHVKDLGVVLDNQLTFKRHVSYVVE FT KASRTLGFIFRIAKDFTNVYCVKSLYCSLVRSNLEYCSVVWHPYYQNGIER FT IESVQRRFIRFALRRLPWRDPFHLPSYESRCRLIHIDTLQVRRDVSRALFV FT ADTLQGRIDCPDILRAVNLNVRPRTLRNNSLLRLPFQRTNYGQNSAIIGMQ FT RTFNRAASVFDFNLSRETVRRNFYVLFSNLNDD" XX SQ Sequence 4749 BP; 1269 A; 1118 C; 1013 G; 1346 T; 3 other; ggcaacattg tctacttgtg ctccaagttt gttattgttc gtcgtgcttt ttatcctttt 60 ttgtggtgca gtcaagtgcc ttaaaccacc gttttagtat tcaccgatta gttaatcgaa 120 ccttcgcgct ggaatataaa aatcgtgcga ttcatcgcga tatagttcgt gtactttgat 180 aatttgtcaa ctcgcagcaa caacaagtgt cggtgcccga tcgaatttgc aaccaaattt 240 tcacctaccc cgcacaccga ttttatctgc tctaagctcc atctactgtt ggatagccgc 300 acaccaaatt gatcgtttgc aaacatggcc tgtaagcgct gtcaaaagaa ggtttccgat 360 gatgaaattg tcgtatgtcg aggatactgc ggcgcatcgt ttcatgcagt ttgcgtcaat 420 gttgatgaac ccctacgcaa gcaactgaaa cactatcgta acaatgtgtt ttggatgtgc 480 gatggatgtg ccaacctctt tactaatgcg catttccgta ctatgatgac gggatttgat 540 gacaagattt ctgctatgcc agcagctatc gagaccatga aaggtgaaat cgaaaagctt 600 cataactgtc tgaacaattt atcagccaaa gtcgatggta tgccctcaac tccaacgcca 660 ttctctactc caaacccttg gcccgctatc catcgcatca atcgctccgt gaaatccgca 720 aaacgcctcc gtgatattga cggtaattcc attaaagttg acagtggatc cattaagacg 780 ggtacgaaag ctttagacgc actatcatct gttcgcctcg ccccgcgatc caatgatgat 840 ttgttttgga tatatttatc agcatttcac cgagatacgt ctgagagcca aatcacctca 900 tttgtcactg agtgcttgga gttgcctacc aacatcgaac ctcaagtggt caaattagtt 960 cctaagggaa aagatccgaa tacgctcaat ttcgtatcat tcaaagtcgg attgagcgac 1020 aaattcaggg ataaagcgct ttcatgtgat tcgtggcctg aaaacatccg attccggcaa 1080 ttcgaggaca accgttcaaa aaacttgccc caggtggtca gtctaacttc cacgattcat 1140 cctgggacag aggatccttc cgagcgagct tcttcgagta tgatagtgta agcagtttag 1200 gaccaccagg acgcacgcaa cgaggcatta ggggagctcc ttctgacccc gtaacagtcg 1260 agccatcatc tcctgcatat ctatcgaatt acgcttttga ttccgtttcc tgccatcatc 1320 gtcgtcctgg tcctgttaac ggtggtttgg aaggagtctc ccagcctgca ccttcaggca 1380 agtatgcgtg tagtagaaga ctttccctcg gtgatgaatt cgacaatttc agctctgttc 1440 cctcaaaaac cattcacgtc gtaaatcttg aacgcaacga accttcctta tcatcaactt 1500 cgctaccggg ccgcactgtg gaaagcctca tggaagctcc cgacccttct aacccagtcg 1560 agcaaatcgc agtcaccgct catcatagtc gccccggtcc tgtggttggt tgtggtgcgg 1620 gagtcttcca atccgttacc acaggcgagt attcgtacaa ttcacgtaat ggaatatcga 1680 tacccgatga atcaattttc aggacctcaa acggacagtc aagcacatcg acaattcagc 1740 acgctgacaa tgacgacgtt ctcatctatt accaaaatgc tggtgggatg aacagcacta 1800 cacacgatta tctgctggca acatccgacg agtgctatga cgtaattgtg cttaccgaaa 1860 cgtggcttga ctcacggacc atctccagtc aagtgttcaa ttccgamtac gacgtttttc 1920 gatgcgaccg cgggcctaga aatagccgta aatcgacagg tggcggtgtt ctagttgcgg 1980 ttaacaagaa gcgcaaatct aaggcgattg aaaacgacca gtggagcagt atcgaacagg 2040 tatgggtgtg catagagttc acgaatcgca aggttttctt gtgcggtatt tacgtccctc 2100 ctgatcgaac acgtgacgac gatttgatcg aacgtcatac tagatcgatc atgtcggtgg 2160 tcgaaatgac gtcggcgtgt gatgaaatca ttgtcattgg ggatttcaac ctacctggta 2220 tctcgtggca accatctcat cacggtttcc ttcaccccga tcctgaccgc tctacaatgc 2280 acgttggtgc aactaaacta ctcgactgct actgttcagc cactttacgt caaattaacc 2340 acttgaccaa cgagaacaat cgttcccttg atctctgctt tgtaagcgct caggatgtcg 2400 ctccgacagt ttcgatcgcc cccttgccgt tagtcaagca agcaaggcat caccctccgt 2460 taattcttac gttgaacgac cgccatacaa tcgaaatgaa aggaattgtt cctgtttcct 2520 atgactttcc gaatgctgac catggtggaa ttgcagaata ttttgcctcc atcgattggt 2580 taaacgttct ggatggacga gatgctgatg aggcggcttt gactctctca aacataattg 2640 ggcatgttat tgatcggcat gtgcccaaga aagctttcaa tggtaacagg aaaccatgga 2700 ttacacgtga gcttagttca ttgaaaaccg caaaaagagc agcactcagg cattattgta 2760 agcaccgcac gattccgttg cgtgatcaat atcggagttt gaacgccgaa tacaaattgc 2820 gaattcgtga gagttttagg cattaccaac acggaatcca gcgaaacctg caggcaaagc 2880 caaaatcttt ttggagttat gtgaattctc aacggaagga gtcaggtttc ccttcaacca 2940 tgactcttaa tggtgaagtt acctcagagc cacaaaaagt cagtcaactt tttgctgaca 3000 agtttgcgag tactttctgc gatgaaataa tttccgatga acaagtcaac caagctgcat 3060 ccaatgttcc tttgctggta gggagcactc tgagctcmst cgacatcgat gacgccgtga 3120 tatttagagc tgcctcgcag ttaaaatcat ctacaaacct cggaccagac ggcattccag 3180 ccgttttcgt taaaaaatac atcgataacc tcatcgtacc acttcgacat gtattcaacc 3240 tgtcactatc tagtggtgta ttcccgtcta tttggaaaac cgccatcatg tttcctgtcc 3300 acaaaaaagg agatcgaaag gacgtaaaca actaccgtgg catttcgtct ttgtcctcca 3360 tttccaagtt gtttgaactc atagtcctgg atgcattgat gtctcacagt aagcagtatc 3420 tgagcgatga acaacatggc tttatctcgg gaagatctac cactacaaat ctcctgtgcc 3480 ttacgtcgca tgtcacagag agttttgcga ataagggtca aacggatgtc atctataccg 3540 acttgtctgc ggcattcgat aagatcaatc acgccattac gattgccaaa ttggaaaaat 3600 atggtgtcag cggtaatttg ttgcgttggc tcaaaacata cctgattgga cggaaactct 3660 ccgttacagt agaaggtttt cgttcaaaag attttctcgc cgcatcggga atcccacaag 3720 gaagtcatct tggaccgcta atattcctat tatacttcaa tgatgtcaac tacgtgctcc 3780 atggccctcg cttatcctat gctgatgact tgaagattta ttcgaggatt cgttctaaag 3840 ccgatgcaat tcggcttcaa caagatttgg acaacttcaa gaactggtgt accctgaact 3900 gtatggttgt taaccctgac aagtgcagca ttatttcttt tgccagaatt cgcgatcctg 3960 tagtttttga ctacaagctg catgataccg taattcagcg agtagatcat gtgaaggatc 4020 taggtgtagt tttagacaac cagttaacat tcaaacggca tgtttcatat gtcgtagaga 4080 aggcatcccg tacgttaggg tttatctttc gcatcgcaaa ggacttcacg aatgtgtatt 4140 gtgttaaatc gttgtactgt tcgctagttc gatccaatct cgaatactgc tcggtcgtat 4200 ggcatcctta ctatcagaat ggtatagaga ggatcgaatc agtacaacgt cgcttcatac 4260 ggttcgccct tcgtcgactt ccttggcgtg atccattcca tctaccgagc tatgaaagtc 4320 gatgtcgttt aatacatatc gacacacttc aagtgcgaag ggacgtttcg agagctctat 4380 ttgtagccga tactttacaa ggccggattg actgccctga tattctacgg gcggtgaact 4440 tgaacgttcg tcctagaacg cttcgaaata attcgttatt gagactacct tttcaacgca 4500 ctaattacgg acagaacagc gcaataatag gcatgcaaag gacgttcaat agggccgctt 4560 ctgtgtttga ttttaatttg tctcgtgaaa ctgttcgtcg taatttctat gtactatttt 4620 ccaatttaaa tgacgattag tgatatcctt atgtgttata ttttttattg ttaacaaatt 4680 atgtttaggt taaggcatca ttgggacaat tgttgtctgt tggtgtaata taataaataa 4740 ataaataaa 4749 // ID Copia-21_SI-LTR repbase; DNA; INV; 228 BP. XX AC AEAQ01023582; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: long terminal DE repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-21_SI_; KW Copia-21_SI-I; Copia-21_SI-LTR. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-228 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01023582; Positions 383 156. XX SQ Sequence 228 BP; 52 A; 63 C; 35 G; 78 T; 0 other; tgttaaaata tgtaatcgaa ctttgtgtat ttcccgcgaa atttattggc gccatctccg 60 gattcaactg cgtggcttta cgaccaccct ctcctacttc gctttcgctg gctcgtgttc 120 tttcgcgcgc taagaacagt acttgtaacg acactcatta aaccttttct tgcaatatat 180 ctcgttcact tttctctcat cgactctcag accatagcta attcaaca 228 // ID Gypsy2-I_DV repbase; DNA; INV; 4141 BP. XX AC scaffold_13324; XX DT 15-OCT-2009 (Rel. 14.12, Created) DT 15-OCT-2009 (Rel. 14.12, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy2_DV; KW Gypsy2-LTR_DV; Gypsy2-I_DV. XX OS Drosophila virilis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; virilis group. XX RN [1] RP 1-4141 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(12), 3096-3096 (2009). XX DR Genome; scaffold_13324; Positions 25734 21594. XX CC Positions [2681-3154] - Integrase core CC LTRs are 93% similar to each other. XX FH Key Location/Qualifiers FT CDS 9..911 FT /product="Gypsy2-I_DV_2p" FT /translation="MTKMKNMELEELLVSVVLAHTAQLDSRLKKLVCTTNI FT KSRNELQQQLQVYEIGQRQETSKAETTVGPERKKQKFSNIVCHYCGKTGHK FT IAFCRQQENRESQPLGSGHAGRIGMRTKEQKSNVICYKCGQAGHISTGCPS FT TGQSSEKRVNICSVMEPKGVLVHSGESADFCFDSGAECSLIKESAAGKFSG FT KRLFNVVMLKGIGNDSVCSSTQILINVTINDFILEMLFHVIMDKYLKHDVL FT IGREILSQGFGVTIESDRFCIYKVKLINVVTEPKTYEECINQNVELDEDDK FT SRLINVLKP" FT CDS 1922..3676 FT /product="Gypsy2-I_DV_1p" FT /translation="MHKINDRNHDIEYYSKCTTPAESRYNSYLLETLAVFN FT AVKYYQHYLKGIKFTVYTDCNSLKASQQKIELDDKVYRWWLYLQSFEFDIV FT YRKEKRMAHVDFFSRNPITKKLQFAVKTPEMRVDLTELSDTWLIAEQQRDL FT EMTSLISKLKNNELNDNVASTYELRLGLLYRKIQRNGRTRWLPVVPRSYRW FT AVINHIHEAIMHLGWEKTLDKVYSCYWFEGMTKYVRKFVDNYITCKLSKPQ FT SGKIQQEMHPIPKIEIPWHTVHVDITGKLSGKKDQKDYVIVFIDAFTKYVY FT LHNTLKIDAANCIVSLKVVISLFGVPSRIIADQGRCFPGSGFKEFCSAHNI FT ELHLIATGASRANGQVERVMSTLKSLLTAVETGQRSWQEALKEVQLAINCT FT VNRVTKFSPLQLLIGKDVRPLGMLSNIVDEKYDNLQNLRAQAKENIENNAY FT YDKNRSDKGKANVVKHKVGDHVLLKCEERNQTKLDPKFKGPFVTTEVLEVD FT RYCLKSLTSKRTYKYPHECLRALPVETVSAEMDEDMMSSSVETDAIGTSES FT KGVGIETDEIGTSEGTGVGVETDARDCASEGTSVETGA" XX SQ Sequence 4141 BP; 1378 A; 691 C; 933 G; 1139 T; 0 other; cttcgttaat gacgaaaatg aaaaatatgg agctggagga attgttggtg tcagttgtgc 60 ttgcgcatac tgcgcaactg gacagcagac taaaaaagct tgtgtgcacc acaaacatca 120 aatcgcgaaa cgaattacag caacaattgc aggtatatga aatcgggcag cgtcaagaga 180 cttcgaaagc agaaactact gtgggacctg aaaggaagaa gcagaagttt tcaaacatcg 240 tttgccatta ttgtggaaag acagggcaca aaatcgcatt ctgccgccag caggaaaatc 300 gcgaatcaca gccattggga agcgggcatg ctggacgtat tggtatgagg acaaaagagc 360 aaaaatccaa cgttatatgc tacaagtgtg gacaggctgg acacatttcc actgggtgcc 420 cttcgactgg acaaagctct gaaaagagag tgaacatatg ttcggtaatg gaaccaaaag 480 gtgtgctcgt tcattctggt gagtccgctg atttttgttt tgactctgga gccgagtgtt 540 cactcattaa ggaaagcgca gctggaaaat tctctggaaa aagacttttt aacgtagtaa 600 tgcttaaggg tattggaaac gattctgtat gtagctccac gcaaattttg atcaatgtaa 660 ccattaatga ctttattttg gaaatgttat tccatgtcat tatggacaag tatttgaaac 720 acgatgtttt gataggcagg gaaattttaa gtcaaggttt cggagtgacg atcgagtcag 780 acagattctg catttataaa gtaaagctga ttaatgttgt tactgaaccc aaaacatatg 840 aagaatgtat aaaccaaaat gtagagttag acgaagatga caagagccgt ttaataaacg 900 ttttgaaacc atagatggaa cattttatct ctggtgttcc aaagaaacag gttaacacag 960 gccgtttgga aattcggtta atagatccaa ataaaacggt acaaaggcga ccatataggc 1020 ttagtgtaga tgagaagcaa actgttcgcg aaaaaattaa tgagttgatg gcggcaaata 1080 ttataagacc gagctgttcc ccatttgcga gtccaatttt attggtaaga aagaaaaacg 1140 gaactgatcg cttatgtttc gactataggg agcttaacgc taacacagtt tcggataaat 1200 ttccattgcc tcttatctcc gaccaaattg caaggttacg tgggggtagg tatttttctt 1260 gtttggacat ggccagcgga ttttaccaaa tacctattca cccagattca gtagagcgta 1320 cagcatttgt aacgccagaa gggcaatttg agttttggca atgccgtttg gtttaaaaaa 1380 tgctccgttg gtgtttcagc gggcaataat aaaagcgttg ttgccacttt cgtattcgta 1440 tgttgtggtt tacatagatg atgtattggt cgtggccgat gcaatttcaa gattgcagaa 1500 agtcatagaa attcttattc aagctggatt ctcatttaat ttcgaaaaat gtacattttt 1560 aaaatcaaaa atagagtatt tgggttacga agtagaggca ggtgagctta gaactaatcc 1620 tcgaaaattg taggcattag ttgaattgcc accgccgcaa acggttacgc agctaagaca 1680 gttcattgga ttagcatcat actttagaca attcgtgccg aaattctccg aaactttaca 1740 ccctctattt tgtcttacgt ctaaaggcaa taatagtttt gattggaaag aaagtcatga 1800 aatagtgaga aaggagataa tctttgcact gacccgttgc ccggtattaa ccattttcga 1860 tccacaatat ccggttgagc tgcatacaga tgctagctct gtgggttatg gtgcaatact 1920 catgcacaaa ataaatgata gaaaccatga catcgagtat tacagtaaat gtacaacccc 1980 agcagagtcc aggtataact catatctttt ggaaaccttg gcagtattta acgcagtaaa 2040 atattatcaa cattatttaa aaggtatcaa atttacagta tatactgact gtaactcttt 2100 aaaggcaagt caacaaaaga ttgagttaga tgacaaggta tatcgatggt ggttgtactt 2160 gcaatctttt gaattcgata ttgtgtatag aaaggaaaaa cgcatggcgc atgtagactt 2220 cttctcaaga aacccaataa caaaaaagtt acagtttgct gtcaaaactc ctgaaatgcg 2280 cgtagattta actgaattaa gtgacacctg gcttatagct gaacagcagc gagatctcga 2340 aatgacatct ttaatctcta aactaaaaaa caatgaatta aacgataatg ttgccagtac 2400 gtatgagttg agactaggct tgctgtatcg taaaattcaa agaaatggta gaacccgttg 2460 gttacctgta gttccaagat cttatagatg ggctgtcatt aaccacattc atgaggctat 2520 aatgcacctt ggttgggaga agaccctcga taaagtgtac agttgttact ggtttgaagg 2580 tatgacgaaa tacgtacgaa agtttgttga taattacatc acgtgcaagc tatcgaaacc 2640 gcaatctggg aaaattcaac aagagatgca tccgattcct aaaatcgaaa taccctggca 2700 tacggtacat gtagacatca ccggtaaact aagtggtaag aaagaccaaa aagattatgt 2760 tatagtcttt atagatgcgt ttacaaagta cgtttatttg cacaatacac tgaagataga 2820 tgcagctaat tgtatagtat cactgaaggt ggttatttcc ttgtttggtg taccatcccg 2880 tatcatagca gatcaggggc gatgctttcc aggatctggg tttaaagaat tttgctctgc 2940 acataacatc gagttgcatt taattgccac aggagctagt cgagcaaatg gacaggtcga 3000 acgcgtaatg agtacactta aaagtttact aacagcggta gagacgggcc aacgttcgtg 3060 gcaagaggca ttaaaagagg tacagttagc aataaattgc acagtcaatc gggtcactaa 3120 gtttagtccg ttacaacttt taattggaaa agatgtaaga ccattaggga tgctatcgaa 3180 cattgttgat gaaaaatatg ataatctaca aaatctaaga gcgcaagcaa aagagaatat 3240 agaaaataat gcctattatg ataagaacag aagcgataaa ggcaaagcaa acgttgtaaa 3300 gcataaagtg ggtgatcatg ttctgctcaa gtgtgaagag agaaatcaga ccaaactaga 3360 tcccaaattt aaaggcccat ttgtcacaac agaagtactt gaagtagacc gatattgttt 3420 gaaatcatta acaagcaagc gaacctacaa gtatcctcat gagtgtttaa gagcgttacc 3480 ggtcgagaca gtttctgcgg agatggatga ggatatgatg agtagtagcg ttgaaacaga 3540 cgcaattggt accagtgaaa gtaaaggtgt tggcattgaa acagacgaaa ttggtaccag 3600 tgaaggtaca ggtgttggcg ttgaaacaga cgcaagagac tgtgccagtg aaggtactag 3660 cgttgaaaca ggcgcataaa tgtgtaacgt cataatcaaa aggcctcttc gagaaacata 3720 ggtagacttg gtgcgctatc tggacattca tctgtatgat ttggctatca aataagttgt 3780 aaagattatt ttaacattga tttaaggaaa ggcgaaagaa cacttgagcg gagcatatca 3840 acatgaattg ctaaatttat acacgaggac atgtaattgt cagaattggc cgtgtcgtgt 3900 tattgttatt gttattcgaa tttaagtaca gcaacatgaa tattgtatga gtcgaaatta 3960 aaagaaagaa gaatagagaa aacagagtaa ctgtcattgt cgacttgtat tttttatatc 4020 actgagagtc ggtgagagga ctaactgtta attctctcac tgagtcagac ccgtgcatag 4080 agttgttggt gagcgtggag ttcattgtaa gctgattcgt gtaaggacac gaagaaagcg 4140 t 4141 // ID Chapaev-16_HM repbase; DNA; INV; 3036 BP. XX AC . XX DT 27-FEB-2009 (Rel. 14.02, Created) DT 27-FEB-2009 (Rel. 14.02, Last updated, Version 1) XX DE Chapaev DNA transposon -a consensus sequence. XX KW Chapaev; DNA transposon; Transposable Element; Chapaev-16_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3036 RA Jurka J.; RT "Chapaev transposons from the hydra genome."; RL Repbase Reports 9(2), 362-362 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(514..1161,1004..2587) FT /product="Chapaev-16_HM_1p" FT /translation="MPNFSKNHEDCRKSVCVLCLKKCSRELTQFLVEKILK FT HYHTSLDLTDTRVPRGICDSCRTILRRKDEGKDVNLPPLFPFSSIKVRPPT FT RDQGCDCLICQVGHLKLNEKSPVDYSKPVEDCTPQRCGDCLSLVGRGLPHQ FT CTPFTFRENLKQLATADPLAAEQIATQVISTKPSTPGGTVKISHPKGGPPL FT RVKTGDTFIYTYKDGRFRGNTFNFKTLLKSSLLNLPHLVVLSKYRILKAAH FT HLELKQVILLYIHIKMAGSEGIPLILKLKITKSVNIISFLGSSSARALFPE FT PSIKTEDLVQVQLNTGLSNXGMKKLASTINRVSDTKIVETSFLAKFEAIGK FT QLSEFFTQTSIQVLSESNSTDFVVVHCKDLAELLNEVLLSRRVFQNHIVKL FT GIDGGGGFLKVSLGIMELDSHCDSRSPPRKKLLTQRTAKDSSVKRQMLVAL FT SEGLPENFENVKQILSLIKLDKVNFVVSCDMKLANILCGLQSHASTHPCSW FT CDAESKHLSQSGSLRTLGSLVARYQEFAKSGGDIKKAKSFGNVVHEPLLSG FT SENKLILELIPPMELHLLLGVVNHMYKAMTEVWPNVVKWPSALHIQQEPYH FT GGQFAGNACHKLLSNLDLLQRIAETESAFQVFGIIDALRKFKSVVTACFGM FT VLEDDFSEKIDRFRDSYLAIMNISVTPKVHAVFFHVKQFIEIKRAPLGIFS FT EQATESMHHNFSSHWQRYKRDRNHPEYAKRLLSCVVDYNSKHL*" XX SQ Sequence 3036 BP; 951 A; 557 C; 585 G; 942 T; 1 other; cactgttgta aaaaatgacc aaaaactact ttgagctgac ccgagaaaac cgctttgaca 60 gggtgaaact acaatgtttc ctgactccaa aaatgtaaaa tttgaaatct aactatttcg 120 tccaagaaaa taattttgat ttttagtttt tagtactttg cacaataggg caaaatgaca 180 gcttttcaaa ttttgcattt ttacagcatt caaaaactct ccaaaattag catttattac 240 tctaaaacat aacagtctat atagcaagta taatttgtat ttattttaac tcagaagata 300 aaattttacc tacaatttca tggttgaact atattataaa cattttgttg catcaaaatt 360 gtaaacaagc taaaaattcc gggaataggc agtgaacatg tttattttaa gttttgcatc 420 aaaattgtaa acaagctaaa aattccggga attggcagtg aacatgttta tttaaagtgt 480 ttcacttatc agttgtgtga gaaaaatttc aaaatgccaa atttttcaaa aaatcatgaa 540 gattgtcgga aatctgtctg tgtcttgtgt ctgaagaagt gctctcgaga gttgacacag 600 ttccttgttg aaaagatcct caagcattac catacaagtc ttgatttgac tgataccaga 660 gtgccaagag gcatttgtga ctcatgtcga acaattcttc ggagaaaaga tgaaggaaaa 720 gatgttaacc tgccaccttt gtttcctttc tcatctataa aagttagacc accaacaaga 780 gatcaaggtt gcgactgtct tatttgtcaa gttggacatt tgaagttgaa tgagaagagc 840 ccagtggact attctaaacc tgtggaagat tgtacccctc aaagatgtgg tgattgtttg 900 agtctagttg gaagaggtct gccacatcag tgcactccat tcacttttag agagaatctc 960 aagcagctag ctacagctga tccactcgct gcagagcaaa tagctactca agtcatctct 1020 actaaacctt ccacacctgg tggtactgtc aaaatatcgc atcctaaagg cggcccacca 1080 cttagagtta aaacaggtga tacttttata tatacatata aagatggcag gttcagaggg 1140 aataccttta attttaaaac ttaaaattac aaaaagtgtt aatataattt catttttagg 1200 aagctcaagt gcaagagctc tttttcctga acccagcatc aaaactgaag atcttgtaca 1260 agttcagcta aacacaggac tgtccaacmt tggcatgaag aaattggcat caaccatcaa 1320 cagagtcagt gacaccaaga ttgtggagac tagttttctt gctaagtttg aagccattgg 1380 caaacagttg tctgagtttt ttactcaaac cagcatccaa gttttatcag agagtaactc 1440 cactgacttt gtggttgtgc attgcaagga tcttgccgag cttttgaatg aagtcctgct 1500 ttctcgaaga gttttccaga atcatattgt caagttggga attgatggag gtggaggatt 1560 tctgaaagtt tctcttggca tcatggagtt ggattctcac tgtgattcaa ggtcaccacc 1620 acggaaaaaa cttttaactc agaggactgc caaagacagt agtgtaaagc gtcaaatgct 1680 cgtggcttta tcagaaggac taccagaaaa ctttgaaaat gtgaagcaga tcttgtcttt 1740 aatcaagtta gacaaagtca actttgttgt ctcatgcgac atgaaacttg ccaacatcct 1800 ttgtggcctt cagagccatg caagtactca cccttgctct tggtgcgacg cagaatcaaa 1860 gcatctttct caatcaggat ctttgaggac actaggctca ctcgttgccc gttatcaaga 1920 gtttgctaag tctggtggtg atatcaaaaa ggcaaaatct tttggcaatg tagtgcatga 1980 gcctttgcta tctggttctg aaaacaaact catcttggag ctcataccac caatggaact 2040 ccatcttctc cttggtgtgg tcaatcacat gtacaaagcc atgacagaag tttggcccaa 2100 tgtcgtcaag tggccctcag ctctccacat tcaacaagaa ccataccatg gtgggcagtt 2160 cgcagggaat gcctgtcaca agcttctcag caatcttgac ctccttcaaa gaattgcaga 2220 gactgagtct gcttttcaag tctttggaat catcgacgcc cttcgaaagt ttaagtctgt 2280 ggtgacagca tgttttggta tggttctgga ggatgacttt tctgaaaaaa ttgatcgctt 2340 cagagactca tatttggcaa tcatgaacat cagtgtgact ccaaaagttc atgctgtttt 2400 cttccatgtg aagcaattca ttgagatcaa aagggctccc ttggggatct ttagtgagca 2460 ggcaacagaa tcaatgcatc ataatttcag ctcacattgg cagagataca aaagagaccg 2520 caatcatcct gagtatgcaa agaggcttct ttcatgcgtt gtggactaca acagcaagca 2580 tttgtgaaga atcaaatatt gagaagaatt tattttatgt accatttgct ctttatgtgt 2640 ttagaatcaa gaggttgttt ttgtgttttg gcttagaata actaaaagaa ttatttaaga 2700 aagtgtttat tatttccatc ttgttaatga aaaactatat cagttgtgtc agaataaact 2760 attattcaag tgaattttag acagttatgt tttggagtaa taaatgctta ttttgaagag 2820 tttttgaatg ctgtaaaaat gcaaaatttg aaaagctgtc atttcgccct attgtgcaaa 2880 gtactaaaaa ctaaaaatca aaattatttt ctcggacgaa atagttagat ttcaaatttt 2940 acatttttgg agtcaggaaa cattgtagtt tcaccctgtc aaagcggttt tctcgggtca 3000 gctcaaagta gtttttggtc attttttaca acagtg 3036 // ID Copia-9_AA-I repbase; DNA; INV; 4132 BP. XX AC supercont1.25; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-9_AA_; KW Copia-9_AA-LTR; Copia-9_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4132 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.25; Positions 1090244 1086113. XX CC Positions [1458-1985] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 303..4031 FT /product="Copia-9_AA-I_1p" FT /translation="MEDTQLIHVVQKTTAKEMWDALRNYHERASLSSKIHV FT VRKLVSTWMPEGGNMAEHLKTMTELQLRLTALGEELKDHWFVALMLSSLPS FT SYDGLITALESRPDDDLTVDFVKGKLLDEGRRRMEKSPEREKVFVTEPARK FT NKPNSVKQKVAKDKVCHYCKKEGHFRRDCRKLAQDRQEKERVSVAAECKKE FT FEMCLAAGSVGEKGVWYLDSGATSHMTNDASFLGKLESREASICLADGNIV FT KSTGAGSGQISCVNGEGARVKVNLGNVFHVPSLAGNLLSVGKIADLGLTVL FT FEKSGCKVLKGSEVVLVGQRNGGLYHLKQFPDKALYTEVKHSDKCEHLWHR FT RLGHRDSKAVMRIVREKLGHGLEINQCNVSSVCGPCFEGKMSRDSFPKASK FT SRASEVGELVHSDLGGPMEVATPRGNRYYIVLVDDYSRYSVVYLLQHKSDA FT ESKIREYCNMIKNQFGRFPKCIRSDGGGEFSGSALRKFFADNGIVQQMSAP FT YSPQQNGVAERKNRYLKEMMRCMLVESGLEKKFWGEAINTANYLQNRCPSS FT SIVSTPYELWSKKKPSYSHLKIFGSEAYVHVPKEKRKTLDAKSVKLVFVGY FT EEGRKAYRFLDVQTGHITISRDAKFLELCSVKEAVRAEPAATGEVFEVPLS FT SPPTEPDCESDLEEENDDFSDVNSVGSEVDLNSSLYGSADDNLSADEFPRR FT SQRSTKGIPPSRLIEEIFVAGTVAGEVEPKNLKEALACKQKQEWKSAMLDE FT LKSHSENGTWDLVDLPKGRKPVGCRWVFKLKRNAAGQVVKHKARLVAQGFS FT QKFGEDYDEVFAPVTTHSTFRLLLAMASRKRMTLKHFDVKTAYLHGQLDEG FT LYMAQPPGHVVKGKEHMVCRLRRSIYGLKQSARCWNKRLDTVLKSMGFKQS FT STDSCLYTKKVNGKIVYLLVYVDDILVGCVDDSEIEVVYKQLKKYFDMTDL FT GDLSYFLGMEVQKEPNGYSLSLKGYIKSVVEKFGLRDAKGAKTPMDAGYVK FT EEDKSCALSDGKKYRSLVGALLYIAVCARPDIAVSASILGRKVSAPTEADW FT VAAKRVVRYLKATSDWRLQYSNSGGDLVGFSDADWAGDIRSRKSTTGFVYL FT YAGGAISWCSRKQSSVTLSSMESEYVALSEASQELVWLLKLLDDLGEPYDV FT PVKVMEDNQSCITFASSERTTRRSKHIETRENYVRELCHDGILKLQYCPTE FT DMVADVLTKPLGTIKQRKFSEMMGLSASGGNKG" XX SQ Sequence 4132 BP; 1060 A; 813 C; 1247 G; 1012 T; 0 other; ataggtcatg ggcccagttg cgtgaaagtg cggtagttta attgagtggt ttgttcgtcg 60 cgattcgtga attccgattt aatcgaggac agtaaagtgt acagtgccgg gaacaaaatg 120 gccgacacca aagtttcgat tgagaagctg aacgaccaga attatgccat ctggaagttc 180 aagatgcggc ttctgttgac gcgggagaag gtgctaagtg ttgtgaccga tccgaagcct 240 gaaaatgctg atgcgacctg gatctcaaac gatgagaagg cgcaggcatt ggtccaaggc 300 caatggaaga cacacagctg atccacgtgg tgcagaagac gacggcgaag gaaatgtggg 360 atgctctgcg gaactaccat gagcgagcgt ctctttccag caagattcac gtcgtgcgga 420 agctggtgtc cacgtggatg ccggaaggag gaaatatggc agagcacctc aaaacgatga 480 ctgagctgca gcttcgactt acggcattgg gagaggagct aaaagaccat tggtttgtgg 540 ctttgatgct gtccagtctt ccatcgtcgt atgatggcct gatcactgcc ctcgaaagcc 600 ggccggatga cgatctcacg gtcgatttcg tcaagggtaa gctcttggat gaaggtcgtc 660 gtcgtatgga gaagagtccc gaacgtgaga aagtgtttgt gactgagccg gctaggaaga 720 ataagccgaa tagtgtgaag cagaaggtgg caaaagacaa agtgtgccac tactgtaaga 780 aagaaggaca cttccgtcgt gattgccgaa agcttgcaca agatcggcaa gaaaaagaaa 840 gagtttctgt ggctgccgag tgtaagaagg agttcgaaat gtgtttggct gctgggagtg 900 ttggtgaaaa aggtgtttgg tatcttgatt ctggtgccac ctcccacatg acgaacgacg 960 cctctttcct tgggaaactc gagtcaaggg aagcgtctat ttgcttggcc gatgggaaca 1020 tcgtgaaatc gactggtgct ggttccggac agatttcttg cgtgaacggt gaaggtgctc 1080 gagtgaaggt gaatctcgga aacgttttcc atgtgccatc gctcgctggc aatttgctct 1140 cggtgggcaa gattgcagac cttggactca ctgtgttgtt cgagaagtcc ggatgcaagg 1200 tgcttaaagg gagtgaagtt gtgttggttg gccaacggaa tggtggcctt taccatttga 1260 agcaatttcc tgacaaggca ctttataccg aagtgaagca cagtgataag tgcgaacacc 1320 tgtggcatcg tcgtctgggg caccgtgatt caaaggctgt aatgagaata gtgcgtgaga 1380 aacttggaca tggtttagaa ataaatcagt gtaacgtctc gtcggtatgt ggtccctgtt 1440 tcgaagggaa gatgagccgc gactcgtttc cgaaggcgtc gaagagccga gccagtgaag 1500 ttggtgaact tgtccatagt gatttgggag gtccgatgga agttgcaacg ccgcggggca 1560 atcggtatta cattgtgcta gtggacgatt acagtcggta ttcggtagtg tatttgctgc 1620 agcacaagtc cgacgcggaa tcgaaaattc gtgagtactg caacatgata aagaatcagt 1680 ttggccgttt tcccaagtgt atccgttccg atgggggcgg cgagttttcc ggaagtgcgt 1740 tgaggaagtt tttcgcggac aatggtatcg ttcagcagat gtcagccccg tattcgccgc 1800 aacagaacgg cgtggctgaa cggaaaaatc gttacctcaa ggagatgatg cgttgtatgt 1860 tggttgaatc tggcttggag aagaaattct ggggtgaggc gatcaatact gcaaattatt 1920 tgcagaatcg ctgcccgtcg tcgtcgatcg tgagtacgcc gtacgagttg tggagcaaga 1980 agaagccatc gtatagccat ctgaaaatct tcggtagtga ggcatacgtg catgttccga 2040 aggagaaacg gaagactctt gatgcgaaat ccgtgaagtt agtgtttgtt ggctatgagg 2100 agggtcgaaa ggcatatcgc ttcttggatg tgcagacggg ccatattact atcagcaggg 2160 atgcgaaatt tcttgagctg tgtagtgtaa aagaagccgt ccgcgcggaa cctgcagcga 2220 caggagaagt gtttgaagtg cctctgtcgt cgccccccac tgagcctgat tgtgagtcgg 2280 atctggaaga agaaaatgat gatttttcgg acgttaatag tgtgggatct gaagtcgatt 2340 tgaattcgtc gttgtatggt agtgccgatg acaatctttc ggccgatgag tttcctcgtc 2400 gttcgcagcg ttcaacgaaa ggaattcctc cgagcaggct gattgaagaa atattcgtcg 2460 ctggaactgt ggccggtgaa gtagaaccga aaaacctcaa ggaagcgctg gcctgtaagc 2520 agaagcaaga gtggaaatca gccatgctgg atgagctgaa atcccattcg gaaaacggaa 2580 catgggatct cgtcgaccta cctaaaggac ggaagcctgt tggttgccgc tgggtgttca 2640 agctcaaacg gaatgcagct ggacaggtgg tgaagcacaa ggcgcgtttg gtagcgcagg 2700 gtttctccca gaaattcggc gaagattacg acgaggtatt tgctccggtt acaacacact 2760 cgacttttcg gcttctgttg gcgatggcaa gtagaaagcg aatgacactc aaacattttg 2820 atgtgaaaac cgcctacttg cacggccaac ttgatgaggg gttgtacatg gcgcagccac 2880 ccggtcacgt ggtgaaaggt aaggagcaca tggtgtgtcg tttgaggagg agtatttacg 2940 gactcaagca gtctgcaaga tgctggaaca agcgacttga tacagtgttg aaaagcatgg 3000 gcttcaagca aagctcaacg gattcctgtt tgtacacgaa gaaagtgaat gggaagattg 3060 tgtacctgct ggtctatgtg gatgatattc tcgtcggttg tgtggacgat tccgaaatag 3120 aagtggtgta caaacagttg aagaagtact ttgacatgac ggatctcggt gatttgagtt 3180 acttccttgg catggaagta cagaaggaac caaatggata tagtttgtcc ttgaaaggtt 3240 acattaagag tgttgtcgaa aaattcggac ttcgtgatgc caaaggtgcg aagactccga 3300 tggacgccgg atacgtgaaa gaagaggaca aaagttgtgc tttaagtgat gggaaaaagt 3360 atcgcagcct cgttggcgcc ttgctgtaca tagcggtttg tgctcggccg gatatagcag 3420 ttagtgcgtc gatactcgga cgtaaagtta gtgcacctac cgaggcggac tgggtagcag 3480 cgaagcgtgt cgttcgctat ttaaaagcga caagtgattg gcgtttgcag tacagcaatt 3540 ctggtggcga tctggtcgga ttttctgatg cggattgggc tggtgatatc cgttccagga 3600 agtcgaccac tggttttgtg tacctgtatg ctggtggtgc gatatcttgg tgcagtcgaa 3660 agcaatcgag tgtcacgttg tcttcgatgg aatcggagta cgtggcgttg agtgaagcga 3720 gccaggaact agtatggctt ctgaagttgt tagacgatct tggtgaacca tacgacgtac 3780 ctgtgaaagt gatggaggat aaccaaagct gcatcacatt tgctagctcg gagcgaacca 3840 ctcggcgctc aaagcatatt gagacgcgtg agaactacgt gcgggaattg tgccatgacg 3900 ggattctcaa actgcagtat tgtccaacgg aagatatggt agcggatgtt ctgacgaagc 3960 cccttggtac gatcaagcaa aggaagtttt cggagatgat gggtttgtct gcaagtggtg 4020 gcaacaaagg ttgaggagga gtattggaga gacaactttc gtgccacccc tattatgtgt 4080 gcgactcgcg tgcacatctc tacactcgca gcagcggtgg caaccccaat gc 4132 // ID Gypsy-10_DPu-I repbase; DNA; INV; 10125 BP. XX AC scaffold_141; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-10_DPu_; KW Gypsy-10_DPu-LTR; Gypsy-10_DPu-I. XX NM Gypsy-10_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-10125 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 735-735 (2010). XX DR Genome; scaffold_141; Positions 240367 230243. XX CC Positions [4871-5350] - Integrase core CC 'ATAT' target site duplication CC LTRs are 97% similar to each other. CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX FH Key Location/Qualifiers FT CDS 5741..6922 FT /product="Gypsy-10_DPu-I_3p" FT /translation="MKPFHVRESNDNNLSPLSSTPAVVSCPIPPVESSISN FT LQQTKQLPSSDSVPTIVPPSVSINLEESLPPPTGPDYINNRGLITTLPPRT FT RRRPDFFMAGLVYFILIVFGCFLSGSEGFIIRDTVIFKDQPGIAISESAWT FT VVTDILLSDAELAVGAIEQHLFDLSQVAIKHRKDGIQFEKTDASSAWRDTT FT SFMAADKIDKRVLLLRRTLLTSKTRLAACALSLLGANRPKRAIFEFGGTAL FT KWLFGISTNAEFLNLNSRIKSMETHDRTVVHLLERNASIVNETLQISRGNL FT VLLQELQNQSIALHNRVDSIFDYIKKYELEQIQRTQYFEELDNTFATLDHI FT LIWLQQQLEAWEIGLTALAAGHLSPQILSSSTLQQVIKEINPHLPLGWAPR FT " FT CDS join(740..3778,3782..5584) FT /product="Gypsy-10_DPu-I_1p" FT /translation="MTTKKKLSSRYPSCGRAPSRRLHPELFLSSEEVVTAS FT TIATSSDNSSTPPTPVNSAASSIQVDDSENDDNNSTSYNSAEDSDSDSSVG FT TVQHVLQLPIVVPTMAVSVQKFIAPPIFSALSSDDVLDWLERYEMAAIYNR FT WSNDDRARNFPMYLDGAARKWFLVHTHPNHWENLPIRPNPADPTLPQLPAA FT TGLKDQFTAAFKQANFSLVQESKIRQREQGNEEDVASYFNDIVNMCRMVNP FT AMSQEQQLQYLYRGLKPTLFAKIYPLQPPTTADFLELAKLHAEASILVNNK FT VLSQSLLAVSQEQPPTTSQTTTLTDSVQTNLVTQFAGILNQLNQTMQGLQR FT PQNFISSGQIGQRQNNWNLSGRNKKTAEGRYICGFCNRIGHLDIKCFQNPN FT SPLFKNLGQQYRPIPPGNSTFNGSQPRFGQPTNSRNEPQWSVNNMTVTSNE FT GDQEFPSPSINNVVLSSSPQPNPYPIIQHTDVNPTVKNDQTHEGDVPILST FT DISCLIKERVLCGTVPTIVILDTGAAISVLSPTLLQQAGYELSEWQGPPIL FT LANGTRAAPLGVTYISITHNDRTFSGKAVIMNMEGMDFLMGNDFLKQFGQL FT HIDYTNDDTQVTLGELPFAQSTVMFPTTVKASRLQAQTGCTIPAFSVMPVA FT VQSKEILSTVSLFTPSVKLMDTKHLSVGHAILPQCVQYIPVANLSAVPVWL FT DQGSTLGTIQPYSGDIHPCELDKHTNTPNSTSVMMDMTLDEQDLLLILEKQ FT INHELSPDNKNILLALLRHHIQGFAKDDNDLGYCTVAEHQINVKEGTTPVF FT QRPYGSAWKARQIIQSLSDDMLVAGLIEFSDSPWGAPVVLIRKKDGTWRFC FT VDYRGLNAVTVRNVYPLPRITNILGKLKGAEYFSIMDLQSGYHQIPLRKED FT REKTAFITADGLFKFKDLPFGLTNGPSSFQRSMDVILGGLRWTSCLVYLDD FT GVVYAPTFDSHLNRLHLVLSCLIKAGLKFKGSKCKFSMTTLKVLGHVVSKH FT GIAPDPDIEAVVNFPECDQGRTTAQIIKRIQSFLGLCSYYRRHISGFAKIA FT HPLTMLTKQNVPFVWGESQRASFFVLKKALTSAPTLAHPDYESPMIIMSDA FT CGYGIGAVLSQIKEQKEHPLAYASRLLSTSEMNYSITEKECLALIWSLHKF FT RSFVWGCKIIIVTDHEALCWLRTKKDLAGRLARWSLCLLECDIEIRYRSGK FT LHTNADCLSRFPVENPTEEIDDRCLVVATVDQLDSILSSAEADQEFQKQQR FT SLPTWKSAIEKLKDTSVVEKRFCVREGKLFRLKSRLGKVYLRLCVPQDFRA FT SILKACHDDVISGHLGIQKTLAKTTQRFYWPELTKDVVDYVRACTSCQTRK FT SPKTTPAGLLQCIKVERPFQKIGIDLLGPFTTSHSGNKMIIVVVDYLTKWV FT ELDALPTGKADVVTTFFVNHVVLRHGLPETMISDRGKCFLANLTQSTLHKL FT GVKHKTTSSYHPQTNGQVERMNHTLAMMISMYVAEDQKDWDEQLPHICFAY FT NTSRQDTTGFSPFFLLYGREAILPIDVFLGAQPNPWLDVPQANVPYADRLL FT NDLHEARNLVRIRIQRAQEKQKQIYDAQHHDVSFQKGDVVLVELVLC" XX SQ Sequence 10125 BP; 2906 A; 2133 C; 2017 G; 3069 T; 0 other; attggtggag atgcaggacg aaaacccaac ttttcgtacg ggtcgttttc tttttcccat 60 acatcttttc ttttctcttg gctttgtttt cattgatcat tagttcctct tcatttttac 120 tctccgtttt agtcaattat ttgaacttct cttttgatag catttgtgtt ctctctcttt 180 ttcccctctc tggttttctt tctttcgttg caaactatat cagcctgttt ttttcttttt 240 tggggggagg gtaatgtttt gattcttgct gattttaatt ttcgttcaaa cgcatcgcgt 300 cacgtcgtct tttatatcat acagttcatt gtttcacggg ctcgtttttt ccgtaagtcg 360 ttcgaatttt tttattagag gccaatcaaa cgtttggtct tcgcctcatt gttgggtagt 420 gacattatcc tcttttaccg cgctcgattt tgcgcatatt ttagcgtagt attttttctt 480 ttttttttgt tgtttcggtt ttctctttaa tttttcacct cctctctgtt cggttctttc 540 ttcgatctgt tctaatttca cggtgaaaat agtaaaagct ctattgtcag tggtgtctat 600 atctaattat tatcagactt tactgtgaca tttactaccc gtacattgta aacaaccttg 660 aattcatagt ttaactgagt gattcgattt tgtgacatta cgacaaacaa cttcctctgt 720 tatatacaag tgctagttca tgactacgaa aaagaaactt tcatctcgat acccgtcgtg 780 cggtagggca ccaagccgtc ggttacaccc tgaacttttt ctctcttcag aagaagtagt 840 gacagcttca acaattgcaa caagcagtga caattcaagt acgccgccta caccagtcaa 900 ctctgccgct tccagtatac aagtcgacga tagtgaaaac gacgataaca actctacctc 960 ctacaattcg gcagaagata gcgactccga ttcatccgta ggaacagttc aacacgttct 1020 gcaactccct attgtcgtac caacaatggc agtaagcgta caaaaattta ttgctcctcc 1080 tattttttcg gcactttctt ctgatgatgt tttggattgg ttggaacgat atgaaatggc 1140 tgcaatttat aatcgctggt caaacgacga tagggctcgc aattttccta tgtacttgga 1200 tggggcggca cggaaatggt ttttagttca tacgcatccg aaccattggg aaaatttacc 1260 aataagacca aatccggcag atccaacgtt gcctcagcta ccggccgcga cgggattgaa 1320 ggatcaattt actgcagcat ttaagcaagc aaatttttct ttagtccagg aaagcaagat 1380 tagacaaagg gaacagggaa acgaggaaga tgttgcctct tactttaatg acattgtgaa 1440 tatgtgtcgt atggttaatc ctgctatgtc gcaagagcaa caacttcaat acctttaccg 1500 tgggttaaaa ccgacattgt ttgcaaaaat ttacccactt caaccaccga caacagcgga 1560 ctttttagag ctagcgaaac tacatgcgga agcttcaatt ttggtgaaca acaaagtgtt 1620 gtcacagtct ttattagcgg tttctcaaga acaacctccc actacttctc agacgactac 1680 acttactgat tctgttcaga ctaatctagt tacacaattc gcgggaattt taaatcaact 1740 taatcaaacc atgcaaggtc ttcaaagacc acaaaatttc atctcttccg ggcaaattgg 1800 acaacgacaa aacaactgga atctatctgg tcggaacaaa aaaacggcgg aaggacgata 1860 tatttgtggt ttctgtaaca gaataggaca cttagatatt aaatgttttc aaaatccaaa 1920 ctcgccttta ttcaaaaatt tgggccaaca gtatcgacca attccgccgg ggaattcaac 1980 cttcaatggt agtcaaccga gattcggaca gccaacaaat tcacgtaacg aaccacaatg 2040 gtcggtcaac aacatgaccg taacatcaaa tgaaggagac caggaatttc cctcaccttc 2100 gattaacaac gtcgttttgt ctagttctcc gcaaccaaat ccctatccca tcattcaaca 2160 tacggacgta aatccgacgg ttaaaaatga ccaaacccat gaaggagatg tccccatatt 2220 atctacggat atttcctgcc taatcaaaga acgtgtgctc tgcggaacag taccgacgat 2280 tgtaattttg gatacaggtg cggcgatcag tgttttgtca cctacgctgc tacagcaagc 2340 tggatacgaa ctcagcgaat ggcaaggtcc accgattttg cttgctaatg gaactcgggc 2400 tgctccgttg ggtgtgacgt acatcagcat cacacataat gatagaacat tttcgggaaa 2460 agccgtgata atgaacatgg aaggaatgga tttcttaatg ggcaatgatt tcttaaaaca 2520 gtttggccaa ttacacattg attacactaa tgatgatact caagttacgt tgggagaact 2580 gccttttgca caatcgacgg tcatgtttcc aacaacagtc aaggcaagta ggctacaagc 2640 gcaaaccggg tgtactattc cggctttttc agtaatgcca gtggcagttc agtcaaaaga 2700 aattttgtcc acggtttcat tgtttacacc ctcagtgaaa ctgatggata cgaaacatct 2760 atcagtagga catgctatct tgcctcagtg cgtgcagtat attccggttg caaatttatc 2820 ggctgttcct gtatggttag atcaaggatc cacattagga acaattcaac cgtattcagg 2880 agatattcat ccatgcgagc tggacaaaca caccaacaca ccaaattcca cttcggtcat 2940 gatggatatg accctcgacg aacaagattt attactcatt ttggaaaaac aaatcaatca 3000 tgaactctct ccggacaata agaacatttt attggcacta ctacgccatc atattcaagg 3060 tttcgctaaa gatgacaacg acttaggata ttgcacggtg gcggagcatc agattaacgt 3120 caaagaggga accacgccgg tgtttcaacg accatatggc agcgcttgga aagcaagaca 3180 aattattcaa tccttgtcgg atgatatgtt agtagctgga ctcatcgaat tttccgacag 3240 cccttggggg gcaccagtag tactgattcg aaaaaaagac ggtacttgga gattttgcgt 3300 agattatcgg ggcctcaacg cagtcacggt gagaaacgta tatcctttac ctagaatcac 3360 taacatcttg ggaaaactaa aaggagccga atatttctca attatggact tacaatcagg 3420 atatcatcaa attccgctca gaaaagaaga tcgggagaaa acggcattta taactgcaga 3480 cggtttattt aagtttaagg atcttccatt tggtcttacg aatggaccca gttcttttca 3540 acgttcgatg gatgtcattt tgggtggtct acgatggact tcttgtcttg tttatttgga 3600 tgatggggtg gtgtacgccc cgacattcga ttctcactta aatcgccttc acttagtttt 3660 atcctgtctc attaaggcag ggttgaagtt caaaggatct aaatgcaaat tttctatgac 3720 cactttgaaa gtcctcggcc atgtagtgtc caaacacggc attgctccag atccagatta 3780 aattgaagca gtggttaatt tcccagaatg tgatcaaggg cggactacag cacaaatcat 3840 taaacgaatt caaagctttt tgggtttatg ttcctattat agaagacata tttcgggttt 3900 cgcgaaaatc gcacacccac taaccatgtt gaccaaacag aatgtgcctt ttgtctgggg 3960 agaatctcaa agggctagtt tcttcgttct caaaaaggca ttaacgtccg ctccaacatt 4020 agcacatccc gactatgaat cacctatgat cataatgtct gatgcatgtg gatatggcat 4080 tggagctgtt ctttctcaaa ttaaagaaca gaaggaacat cccttggcat acgccagccg 4140 tcttctatct acttctgaaa tgaattattc cataacggag aaagaatgct tggctcttat 4200 atggagtctt cacaaatttc gcagtttcgt ttggggttgt aaaattatca ttgtgaccga 4260 tcatgaagct ctctgctggt tacgcaccaa aaaggatctc gcaggccgtt tagcacgatg 4320 gagtttgtgt cttctggaat gcgacatcga aattcgctat cgaagcggta aattacacac 4380 aaacgctgac tgtttgtctc ggttcccagt ggaaaaccct acagaagaaa ttgatgatcg 4440 gtgtcttgta gtagcaacag ttgaccagtt ggactctata ctatcatccg cagaagcgga 4500 tcaagagttt caaaaacaac aacgatctct acctacgtgg aaatcggcta tagaaaaatt 4560 aaaagacacg tcggtagtag aaaaacgatt ttgtgtacgc gaagggaaat tgttccgact 4620 taaatctcgc ctgggcaagg tgtatcttcg tttatgtgtt ccacaagact ttcgggcatc 4680 tatactaaaa gcatgccacg atgacgtgat atccggccat ttaggcatcc agaaaacact 4740 ggcgaaaact actcaacgat tttattggcc ggaattgaca aaagatgtgg ttgattatgt 4800 tcgggcatgt actagctgtc agacaagaaa aagtcccaaa acaacaccgg ctggattatt 4860 gcaatgtata aaagttgaaa ggccgtttca aaaaataggc attgacttat taggcccttt 4920 cacaacttcc cattctggaa acaaaatgat tatagtggta gtcgattatt taacaaaatg 4980 ggttgagttg gatgctctac cgacgggaaa ggcagacgtg gtcactacat tttttgtcaa 5040 ccacgtagtg ttacgtcatg gtctgccgga aacaatgata tcagaccgag gaaaatgttt 5100 cttagctaac cttactcaaa gtacgcttca taagttggga gttaaacata aaaccacttc 5160 aagttatcat cctcaaacta atggacaagt cgaacgaatg aatcacacat tggcaatgat 5220 gatttctatg tatgtggccg aggaccaaaa agattgggac gaacaactgc ctcatatttg 5280 ttttgcgtac aacacttcgc gtcaagatac tacaggtttt tccccatttt ttcttcttta 5340 tggacgggaa gccattttac ctatagatgt ttttcttggt gctcaaccaa atccatggct 5400 agacgttccg caagcaaatg ttccatatgc cgataggtta ctcaacgact tacatgaagc 5460 cagaaattta gtgcgcattc ggatacaacg ggctcaagaa aagcaaaaac aaatctatga 5520 cgctcagcat cacgatgttt ccttccaaaa gggcgatgtt gtgttggttg agttggtgtt 5580 gtgttgagtt tataaaccga ttcgaaaaaa gggaagatcc gacaaactac tccatcgatg 5640 gattggacct tatgttgtca tacggcgcac aacgccggtt aattacgaag tgaaattgca 5700 acagggccga cataaatcag atatagttca cgttgtcgcc atgaaaccat ttcatgtacg 5760 tgaatccaac gacaacaatc tctcgcctct gtcttcaacc cctgctgtag tttcgtgtcc 5820 tattccccca gtagaatcat caatttccaa tcttcaacaa accaagcaac ttccgtcttc 5880 ggactccgtt cccaccatag taccaccttc ggtgtcaatt aatcttgaag aatcgttgcc 5940 acctccgacg ggaccggatt atatcaataa tcgaggtctc attactactc ttcctcctcg 6000 tacccggcgt cgtccggatt ttttcatggc agggttagta tactttatac tcatcgtgtt 6060 cggttgtttc ctgtcaggaa gcgaggggtt tatcattcgg gacacagtta tttttaaaga 6120 tcaaccagga atcgccatta gcgaatccgc ctggacagtg gtaacggaca tcttgctcag 6180 tgacgctgaa ttagcggttg gggctattga acaacatctc tttgacctat cgcaggtcgc 6240 gattaaacat cgcaaagatg gaatacaatt tgaaaaaacg gatgcatcct cagcatggcg 6300 cgacacaact agttttatgg ccgcagataa aatcgacaaa agagtattgt tgttacgtcg 6360 aactttactc acatcgaaaa ctcggttggc ggcgtgtgca ctttcgttac tgggtgctaa 6420 tagaccgaaa cgcgccattt tcgaatttgg gggtacggcc ctgaaatggc tatttggtat 6480 ttcaacaaac gcggaatttc taaatctcaa ttctcgtata aaatccatgg aaactcatga 6540 ccgaacagta gttcatttat tggaacgtaa cgcatcaatt gttaatgaaa cattacaaat 6600 ttcacgagga aacttggtac tgttacagga actgcaaaac caatcaattg cgttgcacaa 6660 tcgtgtggat tctatttttg attatattaa aaagtatgag ctcgagcaaa ttcaacggac 6720 acagtatttc gaagaattgg ataatacgtt cgccacattg gaccatattt tgatttggtt 6780 gcaacaacaa ttagaggcat gggaaattgg gttgaccgca ttggcggcag gccatttatc 6840 accgcaaatt ttatcttcgt caacgttgca acaagtcata aaagaaatta atccacatct 6900 gccactgggg tgggccccca gatagcacag agcagtccca aagacgtcca aaacacgtca 6960 ctttgagacg tctgagatgt acttggaaca tagcagattc ccaagtttac attccttgga 7020 cgtcttttac gtctagccaa tgtccgcttc gccgagatcc gatagcggtc cctatcccgc 7080 cccatgggat ggactttaga cgtacttggg acggaaaaag gaaatcaaga agttataaat 7140 tataaatctt tttctggact ttgaatttcg tccctttttt ctttactctt ctttttctta 7200 tgagtgtttg ttgtgccgac attttttcac gaacgctcat ccggatatcc ggacgacatg 7260 tttatttgtc aaagtatgca ttgggttacc tgtcacatta tgaatcaact gattacgaac 7320 gagatagttg aataaatatg caatcaccct gcaccgggga tcgaacccgg atcgtcgtgg 7380 tgagaaacca gtattataca gatgggccac gaattcacat ctatagtttt gaagataaaa 7440 gctaataata agattttaga atcaaacttt gccacgaaaa gacaaagaca gtttcgccga 7500 aattttctaa gtgccgagaa ggagccaaaa aataaaaacg ggacggatca gagacttcta 7560 cgagagggag taaaaaatga cgtatccgac aagatgaaaa tgagtaacgt gttcaaataa 7620 gggctcgcaa tgggatatct tagagattcc cctgtgccca ccgcaacatc cgtacggctt 7680 tggtcccgcc ttgggacaag aattcctatc tgggccattc cttcggacga tttatgggtg 7740 ctttatcgag aatctaaagt atcagttgct gtggtcgctc ataaaattcg cctgtttatt 7800 gaaattccga tattcgatca cgctcaacat tttaacttcc cgtagaaaaa gtttagctta 7860 gccgatagcg cgactgcctg gtgcaaactg gttccaagct ttaggcttgg aaccagcttg 7920 caccaagtat accccaatga agtatttgat ggacttcttt ttaatgctga aactaaagcc 7980 actccaagca tgtttttttc tgtattatat ttcccctttt tatttatgct tgcgcaaggc 8040 tatttcaagg caatagtttt taacttttta aattgattgc tcagaatgag gctattccaa 8100 gcacagcaaa catttcatgt ttgctgtgct tggaataacc ttggcgcaag cataaatatt 8160 aacaaattaa ctgattatat attataggct aattatgaaa taaatattgc atatatgttt 8220 catttgaatt ccttatatga ataaaaatat atttatactt taaagttgca ttttttcttt 8280 caagatcaac actgcacagc agcggtagga tgcttatttt agccctattt cagcacttta 8340 agattactta acacgttaag cattttggta agaattataa acaaactgaa aagcaaaatg 8400 tcatgcttct tcgaagtgta acatgcttac aaaacaataa gtatgattat aaaatgaaat 8460 taagtatgtt tacagaacaa gtacttttag caacacactt atcatagatt taacgtactt 8520 ataaagtttt aaaaggtatg ttaatgctta taagtatgtt aaataaaaaa gaacaagtcc 8580 tttaagcgac acaattatca taaacttaac gtatttactt tatttataaa cgtactccaa 8640 tgttttctct gctgcaacga aaaaatgaca gagtcctttt cattaaccat taatgtaatt 8700 actacataca atttgcctct attgataagg tctctatagc tgagtggtta gcaagctggg 8760 ttgaaatatc agtggttgat ggttcaaacc tcatcaaagt cagagctttt tttcaattat 8820 acgtaaagtt tgcgtatcac aacattaaaa taaaaggctt ggggtaacct tgaataaaca 8880 agtttggtcc aagcttaaaa acagtattat taaaagttaa agtttttaag attggaatcc 8940 gcgagcttgt gctgggcttg gtaacaagtc tagcgcaagc ctgctctaag ccgtttattt 9000 tctacgggtt atttcaaatc atcacgttgt caaaagcctt ggcgaacggt actcatggta 9060 gctgttacaa caatctaccg gactatttag ccgtctcacc ggatctcgac atttttgctg 9120 agttaactca tttagacata ctaaaatgtc gctcgtttga taaacaagtg tgcatgtttc 9180 acaccggttt tgcaaaacgt ggatcccgca agtcttgcgc agttgcccta ttcaccaatg 9240 acgaaactca tgttcgggaa tattgtcaga ctcttttcat ctcctggaat ggtccggaag 9300 cggtttattt aggaaaaaat caatgggcat tctcagcagt aggacctcat acagtggttt 9360 tttcttgccc gccggatagt aaacatctac ccccacaacg tcgtcaatta ccttcagtgg 9420 gttatcttga agtacccccg ggttgtacag cccgtacaga tcattggatt ttgccggcca 9480 gtttcgaagg atctacaatg atggaagctg tctctgtacc gttacctcaa tttccggaac 9540 ttcgaccgaa cattactgcg ggtaaaacgg caacggtagt tttctttcct tctatgaatg 9600 atacccattt agataaggtg agtgcactgt tgtctcagca aacgtctgtc gaagttcaag 9660 cgtccatgac ccataatcaa attgtgaccc tgttagagtc gaaggaattg ccttcggcac 9720 atccctgtgc ttatccatac gaatggatag ctgcgtttgc ctttcttttt ctttgttttg 9780 gaggcctttc tggatttact tggtggctgt ttcgacaact ctcggatcac attcgctgta 9840 gcagtgggat atatgacgtc gccgactacg ttccagctgt acgtcgtctg gatgaaccca 9900 tacgtgcctc ctgtgcatct ctctgaacac ttccgttatt gtgtattttg tttcttttgt 9960 ttggttactt ccattatttt tcatttcttt ttcttttctg gtcttcttta ttgttttcta 10020 ttattttatg ttttgcctat atataaatat gttttctttt tgttttgttt gtggaatgaa 10080 tatgtcatgg tcaatcgggg cgattgcctt caacaggggg ggatc 10125 // ID Gypsy-90_AA-LTR repbase; DNA; INV; 1352 BP. XX AC supercont1.249; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-90_AA_; KW Gypsy-90_AA-I; Gypsy-90_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-1352 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.249; Positions 151168 152519. XX SQ Sequence 1352 BP; 383 A; 326 C; 347 G; 296 T; 0 other; tgttaccgac caatcgttac caattaaatt acccgcgtta tggagaccaa gaggctgttg 60 taaacggtag attaggagat aggagttttg tttgattgtg gcattataag aaaaagtaag 120 ggatattaga taatctagta taaacttctc tttaattata acaacccccc cccccctaac 180 tactaccctg acatttgaga acgtttcaca tctcgtctga atgattacct gctctacgaa 240 agtggtaccc ctgagggccc agtcgatact tccggatgag agtaaaagag ggaagagaga 300 gaggaggaca agagagatgg gaagagaaat tgactggggt agaaggaaag agagaaagta 360 tccagatgtc aagaaagata gacgtcgaaa tcgaactgga aaagaaaaga ttttcaaaaa 420 gttttgttcg ggccaaggag ataactcacc cgtcgaaaac gttttggggt ttccctgctg 480 gatccccgcg gtcgaatcgg caagaagaac tccgaagcag aagtgtcgtc ggacgtagag 540 gaggattaga tagttcggtg cacggaaagc actaagcgat cgtgggaagc ttgcgcacct 600 caaggacacg tggatccctg taaccaccct aagaaccatc ggtcggcgag ccgggtgctg 660 gtgaccaacg agcaacgcca tcgttgagcc acacgccacg ggacgaggta accaaagttt 720 tggaggtgag ggcggtagtg aggtccggct ggtttacccc gctcactatc gctcaatatc 780 cgatgtgttg ccgaacccgt cagcgagcac tcaggacgtc gatcaactta ccatctacga 840 gccagccacc aagaactcag ttgaacgctt tacgtccaga aagaacccac caacgaagaa 900 ggtaccacgt ggacaacata accaaccttg gaaagccttc aggagaacct ctgggcatgc 960 aggactacgg agcagcctcg tgtgccaccc aggaccggat ttagctgaca cccaaaggag 1020 tggcaagtaa aagatgtatg tgtctctctt accctcgcat gaaggccact gggccctaag 1080 tacctctagt tccggatcgg ccgcaacgaa cccccccccc cccccttttt ctcaatgtaa 1140 aatccttaag aaataaatat gttaagctta ttctatctgt gtattgtgcc actttattga 1200 atggaagaca gtcttggttt ccctatcagt ggggttgcct ctatcctgtt tcccgtctac 1260 cttggtgcca ttgggaaggt aagtgggtcc acccgaagca agcgaacgcc gaccctgagt 1320 gggtgtagcc gcggggctag ccgtaattaa ca 1352 // ID BEL-4_AA-LTR repbase; DNA; INV; 776 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-4_AA_; KW BEL-4_AA-I; BEL-4_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-776 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 858-858 (2011). XX DR [2] (Consensus) XX SQ Sequence 776 BP; 228 A; 158 C; 134 G; 256 T; 0 other; tgttcagaac aaaatagtat gtaagattaa gaagatgatt tacaaatttt gaattattgc 60 tgtataccag tttcccgatg aaccctgtca gtatttagca gtcgaattcg taaagaaaat 120 aaaattagcg ttctctctca taatgcatat ctcacataca ccaaaaggtt catttggact 180 ctcactgaat tgtattcctt aagccagttg aattatacct ctcgttaaaa gcagacggaa 240 tcgtgttttc ctttcgagac gatcgagaaa tacccaatta ctgtcggtct tacccaagaa 300 gtcgcgaatt gtccactttg ggtctcgcca accgaaattc atccgtcgta tcgccgcgag 360 ttcgctggtt agaaggtttc gcttctgaac agttgtattg tacaaatgct actggaaatg 420 ttgatttttt cactaaggtc gcacttttac tccttaataa tcaaaatatt gtactttgat 480 tcactgattt aatttaggaa tcgtttgcaa tcgaccccat taaatggcat aatcacacat 540 agaatctacc ttaaaaatta attgcctgtg tagtctaaaa gctattgtct cacttgtaac 600 ttcgcaaaaa gcatgaatcg cttagtgtta tcaccggcgt gtttaatcgc ttctaaaatt 660 catcaccgca ctcgacactc ttgttttgtc tactggatat tcatattgtg acctgtcatt 720 ataacgaaga gcaataatga ctctcctccg gtaggtagat tggattagct tgttca 776 // ID R2_DA repbase; DNA; INV; 1785 BP. XX AC U13640; XX DT 14-SEP-2005 (Rel. 10.09, Created) DT 14-SEP-2005 (Rel. 10.09, Last updated, Version 1) XX DE Drosophila ambigua transposable element R2 (partial cds). XX KW Non-LTR Retrotransposon; Transposable Element; R2_DA. XX OS Drosophila ambigua OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; obscura group; OC obscura subgroup. XX RN [1] RP 1-1785 RA Eickbush D.G., Lathe W.C., Francino M.P. and Eickbush T.H.; RT "R1 and R2 retrotransposable elements of Drosophila evolve at RT rates similar to those of nuclear genes."; RL Genetics 139(2), 685-695 (1995). XX DR EMBL/GenBank/DDBJ; U13640; Positions 1 1785. XX FH Key Location/Qualifiers FT CDS 46..1563 FT /product="R2_DA_1p" FT /translation="MIIDRLLRVLPNEIGATVGNAITNAERFADDLVLFAE FT TPMGLQKLLDTTVDFLSSVGLTLNSDKCFTIGIKGQPKQKCTVVIPQSFCI FT GSRPCPALKRSDEWKYLGIHFTAEGRTRYSPAEDLGPKLLRLTRSPLKPQQ FT KLFALRTVLIPQLYHKLTLGSVMIGVLRKCDIVVRSFIRKWLGLPMDVSTA FT FFHAPHTCGGLGVPSVRYLAPMLRMKRLSGIKWPHLVQSEAASSFLEVELN FT KARGRTLAGENELTSRTAIETYWADKLYMSVDGSGLREARLFRPQHGWVFQ FT PTRLLTGKDYRNGIKLRINALPSRSRTTRGRHDLARQCRAGCDAPETNNHI FT LQNCYRTHGKRVARHNCVVNNLKRILEEKGHTVHVEPNLQGESAVSKPDLV FT AIRQNHAFVIDAQIVTDGLSLDQAHLPKVERYKRPDVITAVRRDFNVSGAV FT EVLSATLNWRGIWSNQSVKGLITNNLLTTSDSNVISARVVIGGLYCFRQFM FT YLAGYSRNWT" XX SQ Sequence 1785 BP; 503 A; 398 C; 451 G; 433 T; 0 other; cccttgggat cccagggtga cccgctgtct cccataataa tcaacatgat cattgatcgg 60 ttgcttaggg tccttcccaa tgagattggt gccacagtcg gaaacgccat aacaaacgcg 120 gaacgattcg cagatgatct agtcctattt gcggaaactc caatggggct tcagaaattg 180 ttggacacga ctgttgactt tttgtcttca gtcgggctta ccctaaactc ggataaatgt 240 tttacaattg gaataaaggg gcaaccaaaa cagaagtgta ctgtggtgat acctcagagc 300 ttctgcatcg ggtcgcgccc gtgtccagca ctgaagcgtt cagacgagtg gaagtattta 360 ggcatacatt tcactgcgga agggaggacc aggtacagtc cagccgaaga cctcggtcca 420 aagctgttga gattgacaag gtctccccta aaacctcaac agaaattgtt cgccctcaga 480 actgtcctta tcccacaact ctatcacaag ctaacccttg ggagtgtgat gataggtgtt 540 ctgagaaagt gtgatatcgt tgtacgttcg ttcataagaa aatggttagg acttcctatg 600 gacgtgtcaa ctgcattctt ccatgctcct cacacgtgtg ggggtctcgg ggtgccttca 660 gttcgttatt tagctccaat gctacgtatg aaaagattga gcggtattaa atggccacat 720 ctcgtacaat ccgaggcggc cagctcattc ttagaggtcg aactgaataa agcccggggc 780 agaactctgg ctggagaaaa tgagttgaca tcgaggacag ctatcgaaac gtactgggcg 840 gacaagttgt atatgtctgt tgatggtagc ggcctacgtg aagcgcgact ctttcgtccg 900 cagcacgggt gggtgtttca gcccacgcgt ttgctaacag gtaaagatta cagaaacgga 960 atcaaactgc gaataaacgc cctaccttca aggtctcgaa ccacgagggg aaggcatgac 1020 ctggctcgac aatgtcgtgc gggatgtgat gctcccgaga caaacaacca catcctgcag 1080 aattgctacc gtacgcatgg gaagcgggta gcaagacata actgtgtggt caacaacctt 1140 aagaggattc ttgaggaaaa gggccataca gtacacgtcg agccgaattt gcagggcgaa 1200 tccgcggtaa gtaaacctga cttggtggca atccgacaaa atcacgcttt cgtgattgat 1260 gcgcaaatag taacagatgg actgtccctc gaccaagcgc acctgccgaa ggttgagaga 1320 tataaaagac ctgacgttat tactgcagtg cggcgagatt ttaatgtgtc tggcgctgtc 1380 gaggtccttt ccgcgacatt aaactggcgg gggatctgga gtaatcaatc tgtcaaagga 1440 ctgattacaa acaatctcct aacaactagt gacagcaatg tcattagtgc cagagtggtg 1500 ataggcggat tgtactgctt tagacagttc atgtatctcg caggatactc tcgaaactgg 1560 acctagccga aacactatgt tggaaagaag acgcttgcta cctaggcata atgtaaaatt 1620 aggtataaac atcgcagttg taaacttgag gtggtttagt acgtaggcgt gatgatgact 1680 tgttgaagtg aaaccacgaa tcgtgctcgc tattacgttg gcccttaata gtatctatga 1740 agatttccca tcctcagcgg tcaaaaaaaa aaaaaaaaaa aaaaa 1785 // ID hAT-N15_AP repbase; DNA; INV; 585 BP. XX AC . XX DT 26-AUG-2009 (Rel. 14.09, Created) DT 26-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW hAT-N15_AP. XX NM hAT-N15_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-585 RA Jurka J.; RT "hAT-type DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2115-2115 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 585 BP; 213 A; 70 C; 78 G; 224 T; 0 other; gggttcggga cttgttgcat ttgcatattg tttaataatg actgcagaat agtctaagta 60 agatataaat ttgttttaag tcataacgga ttgaaataca aaaattttaa ttgcatattt 120 tcgcatattt tgccgttttc tgagcaaaac gcatatttga caaattttgg cagtttaatt 180 gttatttata gcatatttac gcagttttcg ttttatgtta agtacatatt ttatttaaag 240 ttgataatat cagaaaatat cttattgtac agtcaaactt tacagataaa caattcatca 300 tttttatatt taataataat taaataatat attaaaaaaa acccaacctt ttttcattgt 360 aggagaagac gaagaaaacg aaatcgaaca tatacttcag taatataatg ttgatagtgt 420 tattgttgtc aaaaaaataa ctaattacaa aaaattttct aaatttaaat gaataaatca 480 aattttttaa tgcataattg catatttttg cgcatatttc ggttatttta agtgcatatg 540 tgcatgcata tttccaacaa tttgtgtgca ataagtcccg aaccc 585 // ID hAT-33_SM repbase; DNA; INV; 2468 BP. XX AC . XX DT 02-MAR-2008 (Rel. 13.03, Created) DT 31-MAR-2008 (Rel. 13.03, Last updated, Version 1) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-33_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-2468 RA Jurka J.; RT "haT-type repetitive family from freshwater planarian Schmidtea RT mediterranea."; RL Repbase Reports 8(3), 236-236 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS join(407..775,887..2194) FT /product="hAT-33_SM_1p" FT /translation="MSAETPTTNHTATIEVTTGTTELETTSAATSASNSIA FT TQHVEHEKTEPRDRHRQRKTEQNDSSIDIVVPFSENGNILTVSEPATTEIS FT RIYSSQISNDIGNFVGTIIDDLTKKPFNIAMETFSKGTLLQILCIIFIPPD FT CRISQTSNTSEVVKQPLQVFAKLFGKDGDLTMHEKATYHKEAVAAGRYFFK FT NYNEPAMDIINRVNSQRLQQVLENRQRLTPIVETVIFLGRQNIPFRGHRDD FT GSLLDIDDDDCDVMINEGNFREMLKFRVNCGDSNLENHLKTARASATYISK FT TTQNALIEGCGEEIRGQILSRVRESNYYAIMFDETTDASHKSQMTTILRYV FT IPDQEDFVGFVDMHDQNYQTEINXSIDNEPILDGKVIGQRIINTRFSKENL FT QCLQIYSLMPENIIKLNHKGVAETVSAISNVYGVILGLSIQNLPXPEFTSE FT VELWQCKCMRMKEEDGIKSLPTDLSVVFDLCDQVQYPITHKLFQIIKTCPV FT SVASAERSFSTLRRIKTWLRTRMTENRLVGLALLNVHRDIPVNVENVIDRF FT AKSRNRKRDFVL" XX SQ Sequence 2468 BP; 902 A; 408 C; 422 G; 734 T; 2 other; cagtggcgga tccagaccat ctagtaaggg ggcgatttag gtaatgacgt catatatagg 60 tatcccaata gtccgattta aaattcgcac cagacaattt tattaaattt catatcactt 120 ttaaacagtc tacttaataa gcctaattta cttccatttt caaacaacat gctgtgttag 180 ataaaaaaaa caataaatag taaatacaaa ttacttttaa aattaaaatt tttaaataat 240 tgtgttttaa taatagttta ttataaaaaa atattataga ttgttgctat aatcaatagt 300 attatcaatt aaacattatt aatatttttc tagacgctat gaagagaaca aatccttttc 360 ggtcacagag tagtttggat tcatttttgt ggcaacgaag aaagaaatgt ctgctgaaac 420 acctacaact aatcatactg ctactattga ggttactact ggaactactg aactcgaaac 480 cacatcagcg gcaacatcgg caagtaatag tattgctact caacatgttg aacatgagaa 540 aacagaacca cgtgacagac acagacagcg aaaaacagaa cagaatgaca gctcaataga 600 catagttgtg cctttttccg aaaatggaaa tattttaact gtgtctgagc ctgcgaccac 660 tgaaattagc cgaatatatt caagtcagat ttcaaatgac attggaaatt ttgttggaac 720 aattattgat gatttaacta aaaaaccttt taatatcgcc atggaaacct tcagctgact 780 acaaaatgcc gcattcagtt cacaagaaga aaaataatca agaaaagcgg tatcttaatc 840 attctcatct ggaaaagtac gaatggttag tgtattcaga tagtaaaaag ggactttact 900 gcaaatattg tgcattattt tcataccacc tgactgcagg atctcacaaa caagtaacac 960 ttcagaagtt gttaaacaac cattacaggt gtttgctaaa ctgtttggca aagatggaga 1020 tctaacaatg cacgaaaaag caacatatca caaggaagct gttgcagcag gaaggtattt 1080 ttttaaaaac tacaatgaac cagcaatgga tattataaac agagtaaatt ctcaacggct 1140 tcaacaggtt ttagaaaatc gtcaacgctt gacgcctatt gtggaaacag ttatattttt 1200 aggaaggcag aatattcctt tcagaggcca cagagatgac ggcagcttat tagacataga 1260 tgatgacgat tgtgatgtaa tgataaatga aggtaatttt cgagagatgc taaaatttag 1320 agtaaactgt ggcgattcaa atttagaaaa tcatttgaaa acagctagag cgtcagcaac 1380 atacattagc aaaaccacac aaaatgcatt aattgaaggc tgtggagaag aaatacgtgg 1440 acaaattttg agtcgtgtta gagaatcaaa ttattatgct attatgtttg acgaaacaac 1500 agatgcatca cataagtccc aaatgactac aatattgcgt tatgtgattc ctgatcaaga 1560 agattttgtt ggatttgttg atatgcatga tcaaaattat caaactgaaa ttaatwcatc 1620 aattgacaat gaacctattc ttgacggcaa agttattggt caaagaatta taaatacacg 1680 tttttccaaa gaaaatttgc aatgtctcca aatctattct ttgatgccag aaaatattat 1740 taaactgaat cataaaggtg ttgccgaaac tgtttctgca atcagtaatg tctatggcgt 1800 cattcttgga cttagtatcc aaaacttacc tkcacctgaa tttacatcgg aagttgaact 1860 atggcagtgc aaatgcatga ggatgaagga agaggacgga ataaaatcac tgcctactga 1920 tctttctgtt gtttttgatt tatgtgacca ggtgcaatat ccaatcactc acaagttgtt 1980 ccaaattatt aaaacatgtc ctgtaagtgt tgccagtgct gaacgtagct tctcaaccct 2040 gcgaagaatt aaaacctggc tgagaacacg aatgacagaa aacagactag tcggattggc 2100 cctacttaat gtacacagag acattccagt aaatgttgaa aacgtgattg atcgttttgc 2160 gaagtcgaga aatcgtaaac gagattttgt gttataaatt gttaaaataa taaacaaatc 2220 aaggttttaa atacgttatt agtcaaacaa ctaaactctc tagtctcaat caaataattg 2280 tatttcaata attttaaatc acttcaaaat ataatttaca acattatatt tctgcattaa 2340 atttgaacat ttttttgaca ttgcatgttt tcttaataaa gggctgcaaa caccaagtaa 2400 aagtcgcaaa aaaatttctt tcaaaataat ctaagggagg cgatgcccct tctatggatc 2460 cgccactg 2468 // ID CR1_Ele12 repbase; DNA; INV; 4503 BP. XX AC . XX DT 14-OCT-2010 (Rel. 15.1, Created) DT 14-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE A CR1 clade non-LTR retrotransposon family from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1_Ele12. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4503 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4503 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (14-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. This consensus is generated from 30 CC sequences with >98% identity, and ~99% identical to the original CC sequence in [1]. XX FH Key Location/Qualifiers FT CDS 140..1381 FT /product="CR1_Ele12_1p" FT /translation="MNCEICLLDSVTDSIMWSCIGRSRKFHPACVGVNIQR FT GSLRKKERRMDTSSFVLPCCSSCQTMVTASFEFKSLADQQAQLAEQINKNT FT EVIHRSIQPNSLGVIDEAIDRLEALVTDVKNQLATAKSSCFAGNVNAVKNH FT ITAMFDVALESSKSSISTAVQSIANNITDEVKGLSKDVRELASLTLDITSL FT EPMRNNPLLEIDILNELKAISANIMTNSSTEVESLDPTLDSPPSLNAELNA FT KKEDNSGWRLLGTRKVWKANWEEYDKRQLRRLDQQKQAEKARRRRKRNATR FT NSTTSRNSRLENNVNNDCNNNYTNAPRRTVFGSRRYDNDNSNKSNMNANYL FT PPDRELLAAAKNQFSRPPSNYRPTIRFEKRETLNPYQPDGPTNPQHIMTQR FT SPPSPPCEACACKHSCFRRT" FT CDS 1348..4398 FT /product="CR1_Ele12_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MRMQTFVFSSDLTLSSEEVDDNLTRSSEVEYKQTTQI FT ALDSFNEINSSSSSVNSSGTSNEIITYCQNFNRMRSAAKLTEIHRKVSGCA FT YSVILGTETSWDESIRSEEIFGNNYNVYRADRNYQWSEKKSGGGVLIAVSA FT KLNSEIIVTTKFKEFEHVWAKVQLSGETHIFVSVYFPPNNACKETYDKFFS FT VAENVMSSLPPEVKIHIYGDFNQRNIDFIPDMDNDYILLPVIGESELLQFI FT FDKIANLGLNQINHVKNQQNCYLDLLLTNVSDDFCVNEALTPLWKNEVFHT FT AIEFSQFISNTDFYIDNEFEEVFDFNSANYDNIRNKLNNINWQSILRSKDD FT VNNSTEVFYNCLFKIILEEVPIIKRRCNGFSKHPVWFNKEIINLKNRKQKL FT HKIYKKHKSQENLQVYLNMSVQLNSAMRLAYESYNSKTEQEIKNCPKNFFN FT YVKSKLKSNNFPSTMYFNEKVGHTSQDICNLFSNFFQEVYTTYSEADRDYE FT YFDFHPEFPNDVGISHINVHEILNGLKNLDATKGNGPDGVPPIFMKTLAIE FT LTTPLFWLFNMSLESGVFPKMWKSSYLVPIFKSGKKSDIINYRGIALISCI FT PKLFEAIINEKVFIQIKNRITTAQHGFFKGRSTATNLLEFVNYSLSAMENG FT NHVEALYTDFSKAFDRVDIPMLIFKLQKIGIEAKLLKWIESYLTTRELIVK FT FKGKKSTPIHATSGVPQGSHLGPLLFILFVNDLSFILDKIKVLIYADDMKL FT YLEIKTAQDIHTFKHEVQIFHTWCQKSLLQLNVKKCNLISFNRKRDIPNVT FT ITLGNQNVEKCNRIRDLGIILDSKLTFIDHYNTMINKANNMLGFIKRFAYN FT FNDPYTIKTLFIAYVRSILEYCSIIWSPFSTVHDERIESVQKQFLLYALRK FT LGWTEFPLPSYKARCMLINIQTLRERREYAMISFINDIVSHKVDSPELLSK FT LNFYVPSRRLRHRELFAINYHRTNYAKFGPMNQMMTTYNKHYNSIDLTWSK FT AKLKQYFRTLS" XX SQ Sequence 4503 BP; 1561 A; 831 C; 806 G; 1305 T; 0 other; cattgcgtta ttgatcgtca atcagttcgg acgcgttttt aatcgctccc ggtcgcgcac 60 ttttcattta aattgttatt gctaacgttt cgtttttttt ttgtacgtgc tagataccgg 120 tgtaaagtgt tttgtgaaaa tgaattgtga aatttgcttg ttggactccg ttaccgactc 180 aattatgtgg tcgtgcattg ggcgctcgcg taaattccac cctgcttgcg tgggagtcaa 240 tattcaaaga ggatcgctcc ggaagaagga aagaagaatg gacacttcat cctttgtgtt 300 accatgctgc agttcatgtc aaaccatggt tactgcaagc tttgagttta agtcgcttgc 360 tgatcaacag gcgcaactag cagaacaaat taacaagaac acggaggtga tccatcgttc 420 gattcaaccg aacagtttgg gcgtgatcga tgaagccatt gaccggttgg aagctctagt 480 gaccgatgtc aaaaatcagc tcgcgacggc aaaaagcagt tgttttgctg gcaatgtaaa 540 tgctgttaaa aaccacatca ctgcgatgtt tgatgttgcg cttgagtcgt ccaaatcaag 600 catatcaacg gcagtgcagt caatagcaaa caatatcact gacgaggtga aaggtcttag 660 caaagatgta agagaattag catctttgac cttagacata acttcgttgg agcctatgag 720 aaacaaccca ttactggaaa tagatatttt gaatgagctg aaagccatat cagcaaatat 780 tatgacgaac tccagcaccg aagttgaatc actagacccc acattggatt cacctcccag 840 tttaaatgct gaattgaatg caaaaaaaga ggacaactct ggctggcgac ttttaggtac 900 gaggaaagta tggaaagcga attgggagga atacgacaaa cgccaactcc gccgcttgga 960 tcagcaaaag caggccgaga aggctcgaag gcgacgtaag cggaacgcaa ctcgaaattc 1020 tacaaccagc agaaacagcc gtttggaaaa taatgtaaac aacgattgca acaacaatta 1080 caccaatgcc ccgcgccgca cagttttcgg cagtcgtaga tacgataacg acaacagcaa 1140 taaatctaat atgaacgcaa attatcttcc acccgatcgt gaactccttg cggcggcaaa 1200 aaatcaattt tctcgaccac catctaatta tcgtccaact attcgatttg aaaaacggga 1260 aacactcaac ccttatcaac cggacggccc cacgaatcca caacacatca tgacacagag 1320 aagtccacct tcaccacctt gcgaagcatg cgcatgcaaa cattcgtgtt ttcgtcggac 1380 ttgacgctct cttcagagga agtagatgac aacttgacgc gttcttcaga ggtagagtat 1440 aagcaaacga cccagatagc attagatagt tttaatgaaa ttaactcatc gagttcttca 1500 gttaactcat caggtacttc aaatgaaatt ataacttatt gtcaaaattt taacaggatg 1560 agaagcgcag ccaaattgac tgaaattcat cgtaaagtat caggttgcgc atactctgtt 1620 attctgggaa ctgaaacgag ttgggatgag agtatacgga gtgaagaaat ttttggaaat 1680 aattacaatg tttatagagc tgatcgtaat tatcaatggt ctgaaaaaaa gtcgggtggc 1740 ggagttttaa tcgcagtttc agcaaaattg aactctgaaa ttatcgttac cactaaattt 1800 aaagaatttg agcacgtgtg ggcgaaagtg caattgtctg gcgagacaca tatttttgtc 1860 tccgtatact ttcctcctaa caatgcgtgc aaagaaacat acgataaatt ttttagtgta 1920 gcggaaaatg ttatgtctag tttaccccct gaagttaaga ttcatattta cggtgatttt 1980 aatcaacgca acattgattt tataccagat atggacaatg attatatatt actcccagtc 2040 attggagaaa gtgaattgct acagtttata tttgacaaaa ttgcaaactt aggtcttaat 2100 caaatcaatc atgtaaaaaa tcaacaaaac tgttatcttg atctcttgct gacaaacgtc 2160 tctgatgact tttgtgttaa tgaggcattg actccacttt ggaaaaatga agtgtttcat 2220 acggcaatag aattttctca gtttatttct aatacagatt tctatattga caatgaattt 2280 gaggaagtat tcgactttaa ttcagctaac tatgacaaca ttagaaataa attaaataac 2340 attaactggc aatcgatttt gagaagcaaa gatgatgtaa ataattcaac agaagttttc 2400 tacaactgtt tgttcaaaat aatattagaa gaagtaccaa tcattaaaag aaggtgtaat 2460 ggtttttcaa aacatcctgt ttggttcaat aaagaaataa taaatttgaa gaatcgtaaa 2520 cagaaactac ataaaattta caaaaagcat aaaagtcaag agaatttaca agtatatttg 2580 aatatgagcg tacagctcaa ctcagcaatg cgattggcat atgaaagcta caactccaaa 2640 actgagcaag aaatcaaaaa ctgtccaaag aatttcttca actatgttaa atctaaacta 2700 aaatccaata actttccttc tacaatgtat tttaatgaaa aggttggcca tacctcacaa 2760 gatatatgca acttattttc aaattttttt caagaagtat acacaacata ctctgaagct 2820 gatcgtgatt atgaatattt cgatttccat cctgaatttc ctaacgatgt tggtatcagt 2880 catataaatg tgcatgaaat attgaatggt ttgaaaaact tggatgcaac caaaggaaat 2940 ggaccggatg gagttccacc aatcttcatg aagactttag caattgaatt aactactccc 3000 ttattttggc tctttaatat gtcactggag tctggagtct ttccaaagat gtggaaaagc 3060 tcataccttg ttcccatttt caaatctggc aaaaaatctg atattattaa ctatcgtggc 3120 attgccctta tttcgtgcat cccgaagctc tttgaggcca tcataaatga aaaagtattc 3180 atccaaatca aaaacagaat aactacagcg cagcacggct tttttaaagg gcgatcgaca 3240 gccactaatt tgctcgaatt tgtaaactat tcattgagtg ctatggaaaa tggaaatcat 3300 gtagaagctt tatacacaga ttttagtaaa gcatttgacc gtgttgatat acccatgctt 3360 atatttaaac tgcaaaaaat tggtattgaa gccaaattgc tgaaatggat tgaatcatat 3420 ttaaccactc gcgagctaat agtgaaattc aaaggaaaaa agtcaactcc aatacatgcc 3480 acttcagggg ttccacaagg ctctcattta ggtccgcttt tgtttatact gtttgtgaac 3540 gatttgtcgt ttattcttga caaaataaaa gtattaatct atgctgacga tatgaaactt 3600 tatttggaaa taaaaactgc acaggacata catactttta aacacgaagt acaaatcttc 3660 cacacgtggt gtcagaaaag cctactacaa ctgaatgtaa aaaaatgtaa cctgatatct 3720 tttaacagga aaagagacat tcccaatgta acaataactt taggcaatca aaatgttgaa 3780 aaatgtaata gaatcagaga cttaggcata attttagact caaaacttac tttcattgat 3840 cactataaca ctatgataaa taaagctaat aatatgttag gcttcattaa acgcttcgct 3900 tataacttca atgatccata cacaataaaa acattgttca ttgcctatgt tagatcaata 3960 cttgaatatt gcagcataat ttggtcacct ttttcaacag tacatgatga aaggatagag 4020 tcggtgcaaa aacaattttt attatatgct cttcgcaaat taggttggac agaatttcca 4080 ttaccatctt acaaagctag atgtatgttg attaatattc aaacactaag agagcgtcgt 4140 gaatatgcaa tgatttcatt tataaacgat attgtatccc ataaagttga ttcaccagaa 4200 ttattatcta aattaaactt ttatgtacca tccagacgtt tacgccatcg agaactattt 4260 gcaatcaact atcaccgtac taattatgca aaatttgggc caatgaacca aatgatgact 4320 acttataata aacattacaa ttcaatcgac ttaacttggt ccaaagcgaa gctaaaacaa 4380 tacttccgca cattatctta agaagaaaaa ttaggtaatt gaatgtagaa gtgtaactag 4440 cgcataagaa actctctgta acggtctaca aacatgattg acgacaataa ataaataaat 4500 aaa 4503 // ID Gypsy-165_AA-I repbase; DNA; INV; 4099 BP. XX AC AAGE02017896; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-165_AA_; KW Gypsy-165_AA-LTR; Gypsy-165_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4099 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02017896; Positions 45856 41758. XX CC Positions [3192-3668] - Integrase core CC 'GTAAT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 24..2600 FT /product="Gypsy-165_AA-I_1p" FT /translation="MFLRSHLRSEDEAPENLGSGLPSQDAINKPDMMNAGP FT SNSNMRRKRTIVGGSGDGHVTGQNCNRVVKLDELSAFLPFFSGSPNEDANF FT FISTVEQTRKTFNVEEEVMKLMVSKHLQNNAKVWLSSQSDIFTKSYSEVLN FT LIQETFTVTISSFEIRRKLELRMWRTEETFLEYFIEKRNLAMPLKIDEVEL FT VDYVIEGIPDRHLKNQAKMSGFTSISSLLKAFQTIRLPNNMFSSAPVCFNC FT YLPGQIAAKCMRPGNHHSRATYSGAASKRSSYDGRKPGYDRPSRIAVVQNQ FT DIVNKTDDLVHDGLVDITLTDFEIDLKALFDTGSPVCLIRLGLANCGKIYP FT YISRKKFKGIGGTGLKIVGRLVTNIKIQNLFFRINCLIVPDSAIHPHDVII FT GRDVILDSRVKTIIEKSQIRVFKKEIKNNTNEAELEVNDIVLICNLKTDEH FT VGDISDDHLSRLNHILKIYYFNFQRPICPEVDYEMKIVLKSAAPFHFKPRR FT LSQNDKIKVNKKIEELLKLGIIKESDSPFASPIVVIPKKDDDIRLCVDYRK FT LNKDTYRDNYPLPLIEDIFDKLKSKSVYTILDLKSGFHQIKIASECTKYTS FT FVTPNGQYEYLRVPFGLCNAPAVFQRFINKILKSLIDEGKIVVYIDDILIA FT SETMEEHFETLRMVLEILSRNLLELNLDKCKFCFNEIDYLGYTIDKWGRKP FT SKSHIESISNFPLPKTVKDVQRFLGLTSYFRKFIRNFASVARPLYDLLKKD FT AEFVFNEKEIQAFEMLSDKLQSSPVLAIYDPKAETQLHCDASKRNYIVMQA FT FGAILLQKHNNGNYHPVGFFSKKTDEYESKLHSFKLETLAVIHALKRFHVY FT LSGIKF" XX SQ Sequence 4099 BP; 1511 A; 587 C; 719 G; 1282 T; 0 other; aatatcagaa gtgggatatc aaaatgttcc tgcgtagcca tttgagatcg gaagatgaag 60 ctccagagaa tctcggaagt ggtcttcctt ctcaagatgc catcaataaa cccgatatga 120 tgaatgcagg accaagtaat tcgaatatga gaagaaaacg tacgattgtc ggtggatctg 180 gagatggtca tgtcactggt caaaactgta atcgtgttgt caaattggat gaactttctg 240 ctttcttgcc gttcttttcg ggatcaccaa acgaagatgc aaattttttc atcagtaccg 300 tagaacaaac aagaaaaaca ttcaacgtag aagaagaagt tatgaagtta atggtttcca 360 agcacctaca gaacaacgca aaggtttggc tgtcgtctca gtcagatatt ttcacgaaaa 420 gttactcaga agttctcaat cttattcaag aaaccttcac tgtaacaata agttcatttg 480 aaatacgaag aaaactagaa ttacggatgt ggagaacgga ggagacattt ttggaatatt 540 ttattgaaaa acgaaatctt gccatgccgt tgaaaattga tgaagttgaa ctcgttgact 600 acgtgattga agggattcct gatcgtcact tgaagaatca agccaaaatg agtggattta 660 cttcaatttc aagtttactc aaagcctttc aaacgattcg tctgcccaat aacatgtttt 720 caagtgctcc tgtttgcttc aattgctatt tacctggaca aatagctgcg aaatgtatga 780 ggccgggaaa ccaccatagt agagcaacgt atagtggagc tgcatccaaa aggagttctt 840 atgatggacg aaagcctgga tatgacagac cgtcccggat tgcagttgtt caaaatcaag 900 acattgtaaa caagacagac gacttagtgc acgacggatt ggtagatatt acgttgactg 960 attttgagat tgatttaaag gcattgtttg atacaggaag tccagtttgt ttaatacgtc 1020 ttggtttagc aaactgtgga aaaatttatc cttatattag tagaaaaaaa ttcaagggta 1080 ttggaggaac tggacttaag attgtgggta gattggttac taatataaaa attcaaaatt 1140 tgtttttcag aattaattgt ttgattgttc ctgattcggc aatacatccc catgacgtaa 1200 taattggaag agacgttatt cttgattcta gagtgaaaac aattatcgaa aagagccaaa 1260 ttcgcgtttt caaaaaagaa attaaaaaca acacaaatga agctgaattg gaggtaaacg 1320 atatagtttt aatttgtaat ttgaagactg atgagcatgt aggtgatata agtgatgatc 1380 atttgtcaag attgaatcat atcttaaaaa tatattattt taactttcaa cgacctattt 1440 gtcctgaagt agattacgaa atgaaaatag tactgaaatc tgctgcacca tttcatttta 1500 agccgagaag actttcccaa aatgataaaa ttaaagtaaa taagaaaatt gaagagctac 1560 tcaaattggg aatcattaaa gaaagtgatt ctccatttgc tagtcccatt gttgttattc 1620 ctaagaaaga cgatgacata cgattgtgcg ttgattatag aaaattaaac aaggatacat 1680 accgtgataa ttatcccttg ccgttgatag aagacatttt cgacaaatta aaaagtaaat 1740 cagtttacac aattttggat cttaaatcgg gttttcatca aatcaaaatt gcttctgaat 1800 gtaccaaata cacctctttt gtaactccga atggacagta tgaatattta agagttcctt 1860 ttggcctgtg taatgcaccc gctgtatttc aaagattcat taataaaatt ttgaaatcac 1920 ttattgatga aggaaaaata gttgtctata ttgatgacat tttaattgct tctgaaacaa 1980 tggaagagca ctttgaaact ttaaggatgg ttctagaaat tttgagtagg aatttacttg 2040 aactgaacct tgataaatgt aagttttgtt ttaatgaaat cgattattta ggttacacta 2100 tagataaatg gggaaggaaa ccaagtaaat ctcatattga gtctatttcc aattttccat 2160 taccaaaaac tgtgaaagat gttcagagat tcttaggatt aactagttat tttaggaagt 2220 ttattagaaa ttttgcatca gttgccaggc cattgtacga tttattgaag aaagatgcag 2280 aatttgtttt taatgaaaaa gagatccaag cttttgaaat gttaagcgac aaattacaat 2340 catcgccagt tttagctatc tacgatccaa aagcggaaac gcaactacat tgtgatgcaa 2400 gcaaacgcaa ctacattgtg atgcaagctt ttggtgcgat tttacttcaa aaacacaata 2460 acggaaatta tcaccctgta ggcttcttta gtaaaaaaac tgatgaatat gaaagtaagt 2520 tacatagctt caaattagaa acacttgctg taattcatgc tttaaaacga tttcatgttt 2580 atttgtctgg aatcaaattc taaattttca ctgattgtaa tgctttagct caaactttgg 2640 ctaaaaaaga aattaatcct aaaataagtc gatgggcatt gttcttagaa aattataatt 2700 acagtttaga gtatagagaa ggaatgaaaa tgcaacatgt tgatgcttta ggtagattac 2760 caatagatat agttgagaag aaatcagaaa aactgaaatt agattgctgt ccagagcaga 2820 atatatgtgt attgaatact gaagatattg aactagcaca ggaacaagat gataagatta 2880 gaaatattaa aattcactta gaatctaata cttttccaaa ctttgaaatg aaaaatggaa 2940 tattatttag gaaagattat cagagccttc tattagttgt acccaaaaca atgatagata 3000 atattatacg gatatgtcat gataagttag ggcacattgg tatagaaaaa acaatgattg 3060 aaatcaaaaa attttattgg ttttcaagta tgagaaaaat tgtgaaaaaa tatataaata 3120 attgtttaag ttgtatattt tatagtccat cagatacgaa gaaagaagga tttcttaaaa 3180 atatttatta agatcaggtt ccatttcata ctgttcattt agaccattat ggtcctattc 3240 agctaagatc atccaaaaat tttaaataca ttcttgtagt tgtagatgca ttcacaaaat 3300 ttgtaaaatt ttatcctact aaaacaacaa actctagaga agttattgat aatttacaat 3360 tatatattaa ccattatagc agaccattaa gaattgttac tgatagaggt agttgtttca 3420 cctctacatt tttcaaaaac ttctgtatgg atcgttgtat tgagcatatt aaaacagcag 3480 catatactcc tgaagcaaac ggccaagctg aacgaattaa tagaacactt actcctatgt 3540 tagccaaatt aattgatgat acaaatatta aatgggatat tttacttcca aacttagaat 3600 acgtttataa taatacattt aatagagcca ttaaaaactt tccaagtatt ttattatttg 3660 gtacacatca aaacaatata aataatgaaa ggaataatat agaaacttta gtaaaaaaca 3720 atcaactttt atcacaaaat caaaatacat tggaagttcg aactaaagca aaagaaaata 3780 taacaaaact tcaggcctat aataaagaaa tggtggacag aaaacgaaaa gctgtttgta 3840 attatagaga aggtgatttt attgtactaa aaactaatgt tgataacaaa ttatctaaaa 3900 aatttagagg tccatatatc ataaagaaaa tattgccaaa tgatcggtac tttatcacag 3960 acatagaggg ttttcaagtt tcaaatttac cattcagctc agtttgttca ccgaataata 4020 tgcaaatgtg gttatcagat tctaatttaa tttcaaatta tgacgtcgat gggtacatcg 4080 acgagtcagg ataaccgaa 4099 // ID P-1_Aplcal repbase; DNA; INV; 5081 BP. XX AC . XX DT 16-MAY-2011 (Rel. 16.05, Created) DT 16-MAY-2011 (Rel. 16.05, Last updated, Version 1) XX DE DNA transposon: consensus. XX KW P; DNA transposon; Transposable Element; P-1_Aplcal. XX OS Aplysia californica OC Eukaryota; Metazoa; Mollusca; Gastropoda; Heterobranchia; OC Euthyneura; Euopisthobranchia; Aplysiomorpha; Aplysioidea; OC Aplysiidae; Aplysia. XX RN [1] RP 1-3053 RA Yuan Y.W. and Wessler S.R.; RT "The catalytic domain of all eukaryotic cut-and-paste transposase RT superfamilies."; RL Proc Natl Acad Sci U S A 108(19), 7884-7889 (2011). XX DR [1] (Consensus) XX SQ Sequence 5081 BP; 1447 A; 1149 C; 1063 G; 1422 T; 0 other; catagaccta taattaaccg agactgcgcg tcacgtgact ttcgacatac acacaacatg 60 gctgccccct gtcaaaacag ttgggcaagc tgagagagcg aggtttgagc gttatcaaca 120 ggatttcgtt agatctacac gatggttcaa aattgctgtg tcgtcttttg caacaacaga 180 agtttcaagg gatgttcagt ttcatttcac aggtaaggat ttctttgtta ctgcctttta 240 tattataaat tatgggaata acaatgaatg aagaaagtga agaaaggcaa gaggacgcca 300 acaacacaaa gacaaagaca aagaaagcta agatctatct agactctaga tctagtgcct 360 ccctctcctc tctggctccc actcagagta tgaataccag atttaaaatg tccaaagcta 420 ttactattac agtgctgatc gaatcagtca gtttgaacga ggcccagttt gaaagcatcc 480 ctatcaatcc aagccttaaa taaaacaaag ttagtagagt ctacacattc aacaacatta 540 ttgattttat taatgaacat tgctttgact ctccccttag ggattatgtc gcgctaatgg 600 tcaaaacaaa agttcgttag tccgcatacc caccaagtac atatgtactt aatatcaaaa 660 caccaggctt tggtaggtca tgtaaagcag tgtagtctta ccccgcacgg atcgtacccc 720 gcacagtaca cagtgacagt gcctcggaga cataaaatgt gtgtcttgtg ctgtgggcaa 780 acctccctac cccccccccc cttcctcccg tggcagtctc agtgcctcgc ctttccagct 840 tccaacccca acatgcggtc ataaagtaat aaacagccct tacaaggatg ctatgtcaat 900 aggctcgagg tcaataaaag ttgcccagtg tgtagtgtgc acctcgcact gttcagtctt 960 tgttaggtac tcctaactcc tcttgtttca gtgtatagtt gggtgagcct acgtctcagc 1020 ttcgtgtaca ttctctcatt acacgtttca aaaggtcgac acgtttactg ctctgtccca 1080 gcacaagagt ttaggcctat tctgttgcta ttactgccct acattgtaga ggtttattct 1140 ctaagtctaa gcctcaagac cgtcatgttt cgccatggcc atgaaacaga aaaaccttag 1200 ttttgagtga gatgaaactt aaactctttg ctgatgtgta caagaaactg ctattcatgt 1260 aaaaaaagaa atcgcagcca agtacggttc taaaaaatag tgatgtattt taacagctga 1320 tctctccttg ttttttccac tttggctttg tattttactc atcgacatgt attataatta 1380 tattggcctc aaatcctctt cccaaaggat aacttgagag ttataacatt aacagtgttt 1440 aattgtgttt cctcaactga attgaactct tatatttttt tttcattctt ttcctcagct 1500 tcccccatcc catcaaagat gctgagcgga ggcaaaaatg gatctctgcc atcagcagag 1560 ctgaacctga cggcaaacct tggatgccga agaaaaacga caaagtatgc agtgaacact 1620 ttctgccaat agattatgtg gatggaaaag aaagacatac tctgaatgga aaggctgtac 1680 catctaagtt tcctggctat ccagctcaca agcaacaagc cgacaagaaa cgaccgtcac 1740 caaagaaacg taaacttccg ttaccagaag aaccatcccc aaaacgtcca actttaaacc 1800 cttcagtgtc tgttgggctt gaccacactt atcactcgcc ttctgctgaa gtcaaagtcc 1860 aacagctcca gaagaaatgg aaagaggcat gtggcaagct aaaggaaaag aacagggagg 1920 tgaaattact cagtgagaaa ttgaagagac gagagtctaa gatctcgacg ctactggaga 1980 agctgcagga gctcaagctt ataaacgagg aacactgcga aatcctgaag gaaaattttc 2040 cggaaaacac gttcttgctt attgagaatg agttggcaca acaggggaaa ctcaagtctc 2100 agttgcgata ttcagatgaa atcaaagcgt tcgctgtaac actgcactat tacagcgctg 2160 ctgcatatga gtatcttcgt acgttcctcc atcttcccca tgcagcatcg atcaggaggt 2220 gggcgtcatc gtacaacgta gagccaggat acctccgtgg cgtgttgagt ttcatggctg 2280 agcgtctaaa agttgagcca gatatgaagg atgtggtact catgtttgac tccatgagta 2340 tcaggaagga gttagtctat gtgcaaagca agggggaata cataggctgt gtaaatcatg 2400 gctacttaca gccagctagt gaggaggaac tagcaactga agcactggtc ttcatggcag 2460 tgggtatgaa gaagtttttt aaatacccag ttgcctactt cctggtagac aagatcactg 2520 ctcaacagca aagccagctt gtgaaggatt cgattacttt gctgtcagag gctggcttca 2580 caacgagggc tgtggtttgt gatagcgccc cgtctaatca agccacggct acaaaattgg 2640 gttgtgtcat cacatccaca gaggtgaacc ccacattccc ggatccaacc aacgtgaaca 2700 ggaacatcta tttcttgttt gatgcagctc acctgctgaa gaatgttaga aattgtcttg 2760 gtgaccgaaa ggttttaaaa gttgacaatc aggatgtgag gtgggaattc atttgccagc 2820 ttcataatct acagtcaaaa catactttgt ttctggctaa caagctaaga gccaagcaca 2880 ttgactatgt gaagaataaa atgaaggtga gtttggctgc tcaaacgcta agctcttcag 2940 tagctaatgc catagactac ctcagagatg acatgaatgt tcccgaattc gctgggagtg 3000 atgcgacaac cttattcatc aggacctttg acagcatttt cgacatgtgc aatgcgagca 3060 actgtctcgg aaaaggacta aaatccccaa tttgtgtgaa aaactttgaa gagaggtgcg 3120 agttagtgga caatgccata acttatattt cttcgatctg tgatgcaaat ggcaactcca 3180 tccttacagc gcgacggaag gcaggattcc ttggctttat tgccaccttg aagtctgtga 3240 tagggttgag tcgagactta cttttcgatt ccggatacag atttgtgttg cccttccgtc 3300 tttctcaaga ccacctggag acatatttta gtaaaattag aagacgtggt ggcaacaaca 3360 acaatcccaa tgccctccaa cttcagtggg ctatgagggc attactgcac agaaatgggg 3420 ttggctcctc tgctaatgcc aactgcagag caatcccaga ggtaaacaac acgctcattg 3480 aggatgcagt agcttcagat gcgacaccac aaatcaggga agaggagtct ttccataacg 3540 agagtggatc gagagagatg ccggagtcag aaagtccaca ccctgacatg gtgaatttac 3600 ttatgcgtcc tggcctctat cacaaccatg tgcttcatta tattgctggc ttcatgtgtc 3660 ggaagatcca gaagaagatt aagtgcagca catgcgcagc attcttgtcc acctcaacct 3720 cttcatcaga cgcggcaaca ttcaccagga ttcgagacag gggtggacta atttatgcca 3780 gcgacgattt gctaaatgta gtcaaaacag ctgatcggtg tctacgacga ctgctcgtca 3840 cagagaaagg aggtcccttg ttgtcagtgt cagctctcac cacacgactg gtgacacaag 3900 aagttcttga gtctgtggga ctaacatctt ttcaggacct taaagcccat tgccttgaga 3960 atcactcaat cgagcaccaa gatgaccacg ccacccagat aataaaggag ttatgtgcag 4020 tcttctgcaa cctaatgctc acacaccatg gaaggctatt taatgaacgc tttgttagga 4080 aaaacgaagg atcgcagcgt cacaagttgg caaagacaat cctcttcctt cacgtctaag 4140 tggctacccc tcccctggat tacgcttgga tgcccacaat gagctggtga gtacagaaaa 4200 taacttgtgt aacaaacaca tctctcagtc tctctctctc ctcactcttt agaactgtgt 4260 acatctttca tctttgttgt ctttataaag tcattttcct tttccttcct ctctatcttt 4320 ctcttgcccc tgaactgaag tattcttctc ctgcccaaga cagtatgcaa ccaacttgat 4380 tctttccttc cctttgcttc atgttaaaaa tgtatgccac cataatctcc ttttactaaa 4440 aaatataagt accggtttga aatgaacaat gtacgtgggt gggcaggtgt tgtgcactcc 4500 ttctttcaac tagatctgat attcatccaa gtgtcttaaa tctactgagt tctattttcg 4560 ttcttttact ttttagtttc aagtagctgt gcatctgttc ttatttgttt gttttcttgg 4620 ctattttcaa gttacaagta gatccttatg atattatgcg tttgtccata gactctactt 4680 ttaattttat gatgagatat cagcgtgcgg tgaccgaaac ttgatgtgtg ttgtcataac 4740 aacggaattt tgtattagat ccactataac tttatgtcct tacttttatc atcactgtgg 4800 tacgcgtatt gttctttata atttaaaaaa attgtggtat taatttcatc cattacatta 4860 gatacttatg tccctgttag ttacacctag agatagaaac aataaaacga acgcagtacg 4920 aatgctaagt cagggccgcg aaaggcctct ctaatctagt cagccgccca ggcagtagcg 4980 agcgattttt gcccaactgt tccgccatta ctgtctgccc tagtgcgctg gctgggaagg 5040 gagggctggc gtagcccgtg cgcagtctcg gctaattata g 5081 // ID Gypsy-3_DPu-I repbase; DNA; INV; 8217 BP. XX AC scaffold_140; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-3_DPu_; KW Gypsy-3_DPu-LTR; Gypsy-3_DPu-I. XX NM Gypsy-3_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-8217 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 721-721 (2010). XX DR Genome; scaffold_140; Positions 135930 144146. XX CC 'ATGG' target site duplication CC LTRs are 92% similar to each other. CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX FH Key Location/Qualifiers FT CDS 136..1962 FT /product="Gypsy-3_DPu-I_1p" FT /translation="MTDNRQPKLEVNIKSDSPSNLVVPKGSVESKPETELR FT RSQRSSQIAGRLQQLSTRSFELKHKVLKLKEKQKSDKREAKITKLEKEIAG FT VDKEFIDIYERNKTEIPAFDKSLSSTQEDYSSEQTDTEEEMPNTPTPSPGP FT PGQQGPPGQQGPPGQPGQDGQPGQDGQPGQQGAPGQVGQQGPPGNQGPPGP FT QGPPGPQGPPGPQGSIGQIGAGGIDPGDFRTFSAAMTTGMLNLAAVNYRKD FT IPEFSGDLDDNTTFESWLKKANRVGVEAGWTEDQKLKFFQSKLRRAAAAFN FT NSLGVNIKANLAAWTTAMEAGFDDATIQDMRRAQLSKIEQKFNERIREYRH FT RIDELYKSAYGRAAAESQDAEVIQLRNAVKKEALLKGMRLTIRTNLWNSLK FT ATDTYEEVVEKAQVCEQVVDLRRLTEESEAHKKVEQQENEKNKNPELQQIM FT QAIANIQLSADSAGIAAGTVAHISEQKQEGEKQVRFSGRSRSYSPRYRERQ FT NTEDADGRTRYYPLGQRSQSDNDWRQNRRQGAQETQPQRSNSLNRTPGNWD FT NQRNGGFPGGLEDRTCFFCKNREHIKRACRKYQRWREENVRGTQAPRNPTR FT RPFNNNKNVRK" FT CDS 1878..4163 FT /product="Gypsy-3_DPu-I_2p" FT /translation="MARRKCERNSSSQKPNQETVQQQQERTEIIAFGHVST FT KQCDPSQCKIQYKPFTILVTVGSRRVAALIDTGSTTSVISSDFEKQIPQSA FT KKEKSIENLNLITACGDPMPKVKTVELQVKLHSVDSDPIKNNFHVVQNLAV FT SCILGMDLITNLEFRINTASRRISYKNNGKQYHIVAKVSEIQPCQVCVIIH FT QKLEEEIKELIEKTEIRKEDLRKVLERNKHIFAASDSDFGEAIGEEHSIPT FT VGPPVYIPPRRQPRVMLPVIEKHVEKMLANDVIEESKSPYSSPILLVKKKD FT GTIRFCVDFRFVNDVTIKDKFQIPSIEAIKDDLKGAKYFSTLDFISGYWQI FT AIKKEDRHKTAFTTQNGHYQFKRMPFGLTNAPPCFQRIMTGILRKVIGKFA FT LVYLDDVIIFSKTEEEHIKHIEEIFSLIAKSGMKLKLSKCCFFELSVLYLG FT HIISEEGSATDPKKIEVVKNYPTPENKQHVKSFLGFVGFYRKYIRNFGKIA FT HPLTELTKNDVDFVWGTEQENAFQKLKLRLTTAPILAHPDFTEEYIVQTDA FT SGYGVGVVLGQTQWVEGKRRDVVIAYGSKHLTDTQSRWSTLDKELYAIIYA FT LEKFHTYLYGPKFTVYTDCQALVKIFKNPLKNETAKVTRWVLQTQKYNVEV FT KFRPGSANANVDGLSRIPLPTKTYLTPHVTTAPICLIVESLTTEQGKDAYC FT KAARETYDKQAKRFAVREEVYQQIQGNNKGKKSGKRSTLSPPNYDLATDPQ FT ESDSEEEQFTV" XX SQ Sequence 8217 BP; 3090 A; 1769 C; 1789 G; 1569 T; 0 other; agtggtgaac gtgccaaaat attgaacctt tcaatttgta caagtgtcta aactctattg 60 tgcaaatttg aacattttgt gaactatctg gagacgacga cagttcgacc cagaaaagga 120 acatcaagga cccacatgac cgacaaccga caacccaaat tggaggtaaa tataaaatca 180 gactcacctt caaacctagt ggtcccgaaa ggcagcgtag aatcaaaacc agagacagaa 240 ctacggagat cacaaaggtc aagccaaatt gcaggacgtc tacagcaatt aagtacgaga 300 agtttcgaac tcaagcacaa agtactaaag cttaaagaaa agcaaaaaag cgataaaaga 360 gaagcaaaga tcactaaact agaaaaagaa atagctggcg tcgataaaga attcatcgac 420 atatacgaaa gaaacaaaac agaaatcccg gcattcgaca aaagcctttc atcaacccaa 480 gaagattact catccgaaca gacggataca gaagaggaaa tgcctaatac ccctacaccg 540 tcgccaggac caccaggcca acaaggaccg ccaggccaac aaggaccacc gggacaaccg 600 gggcaggatg gccaaccggg gcaagatgga caaccaggac aacaaggcgc tccagggcaa 660 gttggtcagc aaggacctcc aggcaaccaa ggaccaccag gaccccaagg cccaccagga 720 ccacaaggcc cacctggacc ccaaggatct attggacaaa taggcgccgg aggaatcgat 780 ccaggagatt tccgaacatt ttcagcagcc atgacaacgg gaatgctaaa tctagcagcc 840 gtcaattata gaaaagatat tccagaattc agtggcgacc tcgatgacaa cactacattt 900 gaaagctggc taaagaaagc aaacagagtt ggcgtagaag ctggatggac cgaagatcag 960 aagttgaaat ttttccagtc caaactaagg agagcagcag ctgcatttaa taattcgcta 1020 ggagtaaata tcaaagcgaa ccttgcagcg tggacaacag ccatggaagc gggattcgat 1080 gacgcgacga tacaagatat gagaagagca caactcagca agatagagca gaaattcaac 1140 gaacgtatac gagaatacag acaccgaatc gacgagctat acaaatcggc ttacggaaga 1200 gcagccgcag aaagccagga cgctgaagta attcaactaa gaaacgctgt taaaaaggag 1260 gctttgttga aaggaatgcg cctaaccatc aggacaaacc tatggaatag cttgaaagca 1320 acggatacat acgaagaagt agtggaaaaa gcacaagtat gcgaacaagt cgtggacttg 1380 cgacgcctga cagaggaatc agaagcccat aaaaaagtgg aacaacaaga aaacgagaaa 1440 aacaaaaatc ccgaattgca acagattatg caagctatag ctaacataca gttgtctgct 1500 gactccgcag gtatagcagc aggaacggta gcacacatat ccgaacagaa gcaagaggga 1560 gagaaacaag tccgcttttc cggaaggagc agaagttaca gcccaagata tagagaaagg 1620 caaaacacgg aggacgcgga cggacgcacg cggtattacc cgctgggaca aagatcgcaa 1680 agtgataacg actggagaca gaatcgccgt caaggagcac aagaaacaca accacaaagg 1740 agtaattcgc taaatcgtac tccaggaaat tgggacaacc aaaggaacgg agggttcccg 1800 ggaggacttg aagacagaac ctgtttcttt tgcaagaata gagaacacat aaaaagagcg 1860 tgtcgcaaat accaaagatg gcgcgaagaa aatgtgagag gaactcaagc tcccagaaac 1920 ccaaccagga gaccgttcaa caacaacaag aacgtacgga aataatcgca tttggacatg 1980 tatcaacaaa acaatgcgac ccatcccagt gtaaaataca gtataaacca tttacaatat 2040 tagtgacagt aggcagcaga cgagtagcag ctttaataga cacgggctcc acaacgagtg 2100 taatctcttc cgacttcgag aaacagatac cgcaatcagc aaagaaagaa aaatcaatag 2160 aaaacctcaa tctcataaca gcatgtgggg atccaatgcc aaaagttaaa acagtagaac 2220 tgcaagtgaa attacactcg gtggatagcg acccgattaa aaacaacttc cacgtagtac 2280 aaaatttagc ggttagttgc atcttaggaa tggatttgat aacaaatcta gagtttcgaa 2340 taaacaccgc cagcaggcga atatcatata aaaataacgg caaacagtac cacatcgtgg 2400 ccaaagtatc agaaattcaa ccatgtcaag tatgcgtcat tatacatcaa aaactcgaag 2460 aagaaataaa agaattgatc gaaaaaacag aaataaggaa agaagacctt cgaaaagtac 2520 tggaaagaaa taaacatata ttcgccgctt cagattcgga cttcggcgaa gccattggag 2580 aagaacatag cattccgaca gtgggccctc cagtatatat accaccaaga cgccagccac 2640 gagtgatgtt gccggttata gaaaaacatg tagaaaaaat gttagcaaac gatgtgattg 2700 aagaaagtaa aagtccatat agttcaccaa tcctattagt gaaaaagaag gatggaacaa 2760 tacgattctg tgtcgatttc agattcgtaa atgatgtaac aataaaggac aaattccaaa 2820 tccctagtat tgaagcaata aaagacgact tgaagggagc aaaatatttt tcgacgcttg 2880 atttcatcag tggatactgg caaatagcaa taaaaaagga ggataggcac aagacagcct 2940 tcacaacgca aaatggtcat taccaattca aaagaatgcc atttggtttg accaatgcac 3000 ctccatgctt ccaaagaatt atgactggaa ttctgaggaa agttattggg aaatttgcat 3060 tagtatactt ggatgatgtg atcatattct cgaaaacaga agaggaacac atcaagcata 3120 tcgaagaaat attttccctc atcgccaagt caggaatgaa actaaaactc tccaagtgtt 3180 gtttcttcga attgagtgta ctatacctag gacacataat ctcagaagag ggttcagcga 3240 cagacccaaa gaaaattgag gtagtaaaga attatcctac accagaaaat aaacagcatg 3300 taaaaagctt tctaggattt gttggattct acagaaagta cataagaaat ttcggcaaaa 3360 tcgcccaccc actcacagaa ctgactaaaa acgacgtcga ttttgtgtgg ggcactgaac 3420 aagaaaacgc gttccagaaa cttaagctca gattaactac ggcacccatc ctagcacacc 3480 cggatttcac ggaagaatat atagtgcaga ccgacgcgtc agggtacggc gttggagtag 3540 tattgggaca aacccaatgg gtggaaggaa aaagacgaga cgtagtaatt gcgtacggat 3600 caaaacatct gacagacacg cagtcaagat ggagcacact cgacaaagaa ctatacgcaa 3660 taatctacgc gctagaaaaa ttccacacat acctttacgg tccaaagttt acagtatata 3720 ccgattgtca ggcattagta aagatattta aaaacccatt aaaaaacgaa acggccaaag 3780 taacacggtg ggtcctacaa acacaaaagt ataatgtgga agtgaagttt cgcccaggat 3840 cggcaaacgc aaacgtagac ggattaagtc gaattccact cccaacaaag acgtatctca 3900 ctccacatgt gacaacggcg ccaatctgtc tcattgtgga gagtcttaca acagaacaag 3960 gaaaagatgc ttactgcaag gcagcccggg agacgtacga taaacaagca aagagattcg 4020 cagtaaggga ggaagtgtat caacaaatac aaggcaacaa taaaggaaag aaatctggta 4080 aaagaagtac cctcagccca cccaattacg atttggcaac agacccgcaa gaatcggatt 4140 cggaagaaga gcaatttacg gtgtaaaaca acggattatt agcaaccagt tacggaaaga 4200 tactagcccc agaagcgcta agagagaaaa tcctattcag atatcatgac gacccattgg 4260 caggccattt agggacaaag aaaacaatag gcagaatacg ctgcaggtat ttctggcctg 4320 gaatgattaa agacataaaa acgtacgtaa ttattataat ttacggagaa tcaatttcag 4380 agatcacatt gaataaacac gctgagtcgc gattttggag cctgagcaac tggaccacat 4440 cattaaaaac gagcttaata tcagcaacag tgatcatccc aatagcatca tttgcctgtc 4500 tggcattggc ctattttaga ctaaagcaga taaagcatca aaaagagctg acagagttcg 4560 ctctacggct tcaggcaaat caaatgatga gcccaccgta agcaacagac caaatgttca 4620 aatataaatg tatcgattta tcattaaaaa attcacccat caagcatcat caacacacaa 4680 gcctatcata gaaaagtcaa ggaaagaaag aaagaaaaaa agacagagtc aaagcaactg 4740 cagccgagcg gccatcacaa aaagggaggg accggccgca taaaatccca aacccaacgc 4800 cattggcccg cccacagcca atgagcatcg acggcaaagg aaaaagttaa gcccaacccc 4860 cttctgtgaa caacacacag gccatacgat cgcgccgtca aagaaggacg catatagcaa 4920 catacgatct gacataggtt aacggttgca agactcatca ttcgaagaat atatatatag 4980 cagcatatca gttgcacatc cgtagtcata tgccagataa ctaattaaca cgcagagtgt 5040 cggattcaaa tagctcgaac gcctacgcag catcaaacag ctgcaaacaa atagccaaat 5100 gcaatcaaac aagccagtca gcggaaaatc ggtatataaa gacgaacgcc tacccattag 5160 aacgatgtag taagaaagca gcagcgcgtc ggccaacaat agacattgca cagaaaagat 5220 tttgccaaaa atgatgagac gcgacccagc agaagaaaga gccgaagaag aagagctgca 5280 cggacccgga tacaacttgc ggccacgctc aaggttggca agcacggaag acgcggcacg 5340 taaaaacaaa tacaaggtat taacggtcat tcacgcacca gaaattctta agacaccaac 5400 ctatagaacg cggacgaaaa ttatggtgac acgaagagaa gaagccccaa aagagacacg 5460 gcaagacgga ggaagcagcc aagcaggccc cggaaatgaa gacctagccg aagaaataat 5520 tcaagaatgg gaagaatttt gcaggacgcc aagcaagaag aaaaatcaag aattacaaga 5580 gtcaccagat tcaccagaag gaccgctgtc cggcgtgcgc gttcagccga tcaaaatgaa 5640 aagaacatcc ggatatacaa caaacaagga aatagaagat ctgctgaaag aacaagaaaa 5700 ttgggagctg actacatcat ctccggtagt gacggtgagt cacgatcgcc cacagtcaag 5760 aagaggtgaa gaaaatcaac aagaaaagga ctacgatcca gaaaatagaa ctctgccgag 5820 ttacctaaaa caacttaggc gtatattgtt gcgcctacgg gaggcagcaa cagccgcccg 5880 ccgaaatctg gaccaaatca gacgactcgc gtacgatggc gatatcgtag aaaacggaat 5940 ccaacagctg gtaggagagc aggagatgga caccgaggac gacgagtaac caagtattgt 6000 acgtgtaatg attaaaaaaa aaaaaaaaaa aatcattgca tttcccccta tggttaaccg 6060 tagtaacaaa taagaaagga cacactattt acttgtccta ttgattcaat gcatttttca 6120 ttttcttgtc ctattgattc aatgcatttt tcattatctt gtcctattga tccatacatt 6180 ttacctcgtc ttattctagt atagcttttg tcccatttat tcccattttt tttatacacc 6240 attttaaatg ttttctgttc aaaaaaaaaa aaaaagaata ataataataa tcagacgttc 6300 cacgcgagcg taggaattta ttcaaaaaat aaaaaaataa aataaatcac aaagcgaaaa 6360 agaaaagtca gaattaaaat cttttattga tcgaaggcag gagaatacat gaagaaattt 6420 ttttttttag ccaacgaaaa aaaaagaaaa taataaaaga gttagttgtc gcgatgaggc 6480 gggagcagcc ggtactgatc atcctgatcc aaagtatcca aatgggcgtc catctcccgc 6540 cgaaacgaat agtatgcagc tatacgctcc tgcgtagtat agcctgaagt caacaaaaag 6600 caaacaaaaa gaaaatttta ttactggaaa atacaactta tagtggagaa catagaaaat 6660 acttactcgt taaaagagtg gccaaataat tggcctcaag cgaatgctgt gtttcgaagt 6720 cgtacggctc cgccgggctc gacgaacccg gagacgaaac cagaggcgac ctggagccag 6780 cagctgacga acggccggcc accagagtcg gcggccagat ggggacaagc gctggcatac 6840 tgcctacgct ttgcaccctt acggacggcg acgcgggcgg gggctgaggc accggggtag 6900 ctacaaacgc ccggaacgcc gagcggaccg ggctggttgg ccgcacaaac gactcgacga 6960 cgaacgggcc ggaaaggcca cccgacggga cacgcagagg cactccacaa ctttttcgag 7020 ccgtcggccg atatgggccg gaacggccag ccgccggagg cactacggag ccagcttggc 7080 tttgagatgg cggccgaacg aagccagaag ggccagcggc cggaacaata cgaggcgatt 7140 cgggacgaac ccgaatgacg aggggggggg aactaccgat gttgagggca acgggggatt 7200 ccccgcagct gaactagcgg aggactccgc ataaaggacg cgcttgaaac gcttagggcg 7260 cggcatttcg taggttacaa cagatagggg caacttttca ctagcaaagt aaagcgaaac 7320 gtacgcacta gggcgaggcg aaataggcaa tgaacagaaa tagtgaaatc tatgacgaaa 7380 ataaaaagac cacccgttaa accatcgtca gcaagtaagg acacccacca aattctcgtt 7440 tcaagtagct aaaagatcgt ataaataaca aacaaaagaa aagaaaaaaa aaaaaagaaa 7500 agaatacatt tacactatgc ttccccatat atatatattc aagaaacaga accaaacaaa 7560 taacatgtaa cacaatacaa aaaaaatgta ttttgtccta ctgcaaagct taagaaagaa 7620 gaaatattaa tcaaagagga tgacaatata gagcacattg tatttacgct acatacatat 7680 taccctacat tcagcttcaa taattaacct acttcccatt acgaccatgt tataacaatg 7740 ttatgtccca ttatgaccat attatgtaca tcatgaccat gttatgtccc atgatgccat 7800 tataggaaat cgaaaaatcc tgaagccatc aaaggtgaag gaacaacaaa aaaaaagaga 7860 aggcgatgtc atttaatcaa aaaggcgaag tattaaaaac aagtgctttg aacgtatccc 7920 accaaaacgc gagagaagaa aaaacaaaaa acaaaaaaca aaagagccag caccaagcca 7980 gtgtaaccaa agcgtaccag tcgcgcatac gtaacactac agtagccggc gggcagataa 8040 attccgatga aagtttcaag gccaccccat gcatatgtat tgtacagcgc cgtaaaagct 8100 gtgaaattat cgtacataac ttgtttaaat gtttcttaca atgcattcta ttaaccgatc 8160 cttagcttag cttaatattt tatttgccca ggagtgcaaa tttaaagggg gggagtg 8217 // ID Gypsy-80_AA-I repbase; DNA; INV; 5002 BP. XX AC supercont1.247; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-80_AA_; KW Gypsy-80_AA-LTR; Gypsy-80_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5002 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.247; Positions 1373986 1368985. XX CC Positions [2316-2816] - Reverse transcriptase CC Positions [3898-4368] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 747..3317 FT /product="Gypsy-80_AA-I_2p" FT /translation="MSSYVVAPNIEPYRKGQSFATWIRRLAFHFRVNKVDD FT GQRKDQMFLLGGDYLFSVAEKIYPTEALLEEVTYEVLTQKLKEVLDKTDSV FT LLQRYNFSTKVQQAGESASDFIFSLKLLAEHCEFGDQKNRLILDRILVGLA FT DGSLKHRLFTEDSTKLTLDQAEKIIATWEMAAAHTKALTNNEDVGLVASLN FT TRNPLTGGRGAVIQRMREVSQGFRGPVKSRLGVRPAEDRPHSSRVHFKSQH FT SRYRDSSAGPSYNRFRKDEGQWPLDQRYCDFCQRYGHVRRKCYKLKNDRNE FT GVNHINAQAVANETDSLTERLTMLRTGNWDSDDDDSGELQCMHVGSINKIS FT DPCLLNVLIEKIPIQMEIDSGSSVSVMGKHMFASKFNLALSNSSKQLIVIN FT GSKLKVSGKVEVNVECNGRKIKLNMLVLDCDYQFIPLLGRPWLDEFFPNWR FT HFFGNSVQVNNILENQSDALIDEVKRKFSDAFVKNFSTPIKHYEADLVLKS FT DVPIFKKAYDVPYRLREKVLRYLDKLEAEKVITPIQTSEWASPIIVVTKKN FT NGIRLVIDCKVSINKFIIPNTYPLPVPQDLFAKLAGCTVFCALDLEGAYTQ FT LSLSKRSRKFMVINTIKGLFTYNRLPQGASSSASIFQQIMDQVLHGIENVC FT VYLDDVLIAGKDVENCKEKLFIVLDRLSKVNIKINWDKCKFFVTELEYLGH FT IISGKGLTPCSSKIATIREAKIPTNVTELKSYLGLINYYNKFIPNLSSKLY FT YLYNLLKNNVKWEWDSNCDEAFKNSKNLLLETEFLEFYDPEKPIIVVTDAS FT GYGLGGVIAHVVDEIEKPISFTSFSLNAAQKKYLILHLEALALVCTIKKFH FT KYFW" FT CDS 3418..4938 FT /product="Gypsy-80_AA-I_1p" FT /translation="MGNADFCSRFPLEEAVPIEYDQEFVKSINFGDTLPID FT FLLIAKETKQDSFLQEIMNFMCQGWPKRVPKRFAIIFASQHDLEIINECLL FT FQDRVVIPQVLQNRILKLLHSNHAGVVKMKRLARRSVYWFGINAHIEEFAA FT ACDVCSSMASVPKQNITTKWTPTSRPFSRIHIDFFYFEHRTFLLLVDSFSK FT WVEVEVMNKGTKCSEVLTKLIEYFARFGLPDVVVSDNGPPFNSFSFKSFLQ FT KQGINVLNSPPYNPASNGQAERLVRTVKDVLKKFLLEPEMLKLSLEDQIIF FT FSFNYRNNNLTMEGSFPSEKVFAYTPKMLIDLVNPKKHYRQMLVPITPDVE FT SVTVSNHTNIQLDPLDALIPGDVLWYKHNVPHMHEKWIKASFLKRLSKNLL FT QIMVGSGATTTAHPTQIRVVKNGRGASDQNRTSMRVVQTGRQLPTDAPESE FT MPIENDEITVRQLPDLSDDATEVSTTRRARKRKLIEPAELTGLPRRSSRIK FT RSRLDSEFEYY" XX SQ Sequence 5002 BP; 1537 A; 864 C; 1095 G; 1506 T; 0 other; ttttagtggc gacgagagaa tttggaagcg gcggaatctg gcacgaccat ttcgaagaac 60 ggattttccc tttccgtaga acgagctgga gtaggcagcg aaaattagct ggtgacaatc 120 cggtttaagg tagaaggtaa tagtggaaaa agtggtgaaa ccgcagctat tgtttgaagg 180 tgaacagctg gcgtggtttt tcggtggtaa caaaaggaaa taaagcgcct ttgttaattg 240 gaacgccatt tgcgttcgtt ggtggcgtgg cgtcgtttca ttgtgaacaa aagaatagtt 300 tattagaact gagattttgg agtaggacta cgtttctgct tactatgaat tcttttaaac 360 atattgaaga gttaaatatc agttattaag agtgtctttt tcctttattt ttgagaggtt 420 atacatcgta acatcttgcg gcggaccttg cccttgccgt taaggatcta ctacgtttca 480 tccggtggat ttctggtacc ttcttagcat cttggtttgg ggataatctg ttcaggtgag 540 ttttgaaaca tttgacattt taaccttgtg gttgaaacgg agttgcataa attgtgcctc 600 ttccattttg agtttttttt tttgctcgaa aaaagggata aaaagatatt agattaaaat 660 tgcactattc aacgaacaat tgccaattat tgttgtgcat ccattcataa atttattatc 720 tgtgtttcgt gtagatattc gctaatatgt catcttatgt ggtggctccc aacattgagc 780 cctaccgtaa gggacagtct ttcgctacgt ggatacggcg acttgctttt cacttccgag 840 ttaataaggt cgacgatggt caaaggaagg accagatgtt cctcttggga ggcgactatt 900 tattcagcgt tgccgagaaa atttatccta cggaagcgct gttagaggaa gtgacctatg 960 aggtacttac ccagaagctc aaagaagtcc tggacaagac ggattctgtc ttgcttcaga 1020 gatacaattt tagtaccaaa gttcagcaag ctggcgagtc agcgagcgac tttatttttt 1080 ctttaaagct tcttgcggaa cactgtgaat tcggtgacca aaagaaccgc cttatacttg 1140 accgtattct cgttggtttg gctgatgggt cactcaaaca tcgccttttc acggaagata 1200 gcacaaaatt gaccctggat caagccgaaa agattatcgc gacttgggaa atggcggcgg 1260 ctcatacaaa ggctttgaca aacaatgagg atgttggctt ggtggcttcg ttgaatacta 1320 ggaacccact gactggtggt agaggagcag ttatccaaag gatgagggaa gtatcccagg 1380 gtttccgtgg ccctgtaaaa agccggctag gtgttcgacc ggcagaagat cgtccgcata 1440 gttcacgtgt acacttcaag tcgcagcatt ccagatatcg tgattcatcc gctggtccat 1500 cttataaccg atttcggaag gacgaaggac aatggccatt agatcaacgt tattgcgatt 1560 tctgtcagcg ctatggacac gtgcgaagga agtgttacaa attgaaaaat gatcggaacg 1620 aaggtgtcaa ccacatcaac gctcaggcag ttgccaacga aacagatagt ttgacggagc 1680 gactgaccat gctgagaacc gggaattggg attcggacga cgatgattca ggtgaattac 1740 aatgtatgca tgttggttct atcaataaga ttagtgatcc ctgtttgtta aatgttctca 1800 ttgagaaaat tcccatacag atggagatag acagtggttc ttctgtatct gtaatgggaa 1860 aacatatgtt tgcttcaaaa ttcaaccttg ctctatctaa cagctccaaa cagctaattg 1920 tgataaatgg gtctaaattg aaagtgtcag gtaaagtcga agttaatgtt gagtgcaatg 1980 gtagaaagat aaaattaaat atgttggtgc tggattgcga ttaccagttc atcccattgc 2040 ttggtagacc gtggttggat gaattctttc ccaactggag acattttttt ggaaattctg 2100 ttcaagttaa caatatttta gaaaatcaat cagacgcact cattgacgaa gttaaaagaa 2160 aattttctga cgcttttgtt aagaattttt ctacacccat aaaacattat gaggcagatt 2220 tggttttgaa aagtgacgtt ccaattttta aaaaagcgta tgacgttccc taccgtttaa 2280 gggagaaagt tttaagatat ttggacaagt tggaagcgga aaaagttata acccctattc 2340 aaaccagtga gtgggcttcg cctataattg tagtcacgaa gaaaaataat ggtataaggc 2400 ttgtaattga ctgcaaggtt tctattaata agttcattat tccaaacact tacccattac 2460 ctgttcctca agatcttttc gctaaattgg ctggatgtac tgttttttgt gcattggatt 2520 tagagggcgc ttatacccag ctctcattat cgaaacgatc aagaaagttt atggtaatta 2580 acacaattaa aggactgttt acctataata gattgccaca gggggcttcc tctagtgcat 2640 ctattttcca gcaaattatg gatcaggtgt tacatggtat tgaaaacgtt tgcgtctatt 2700 tggacgatgt tttgattgca ggaaaagatg ttgagaactg taaggaaaaa ctttttatcg 2760 ttttggatag gttgtctaaa gtaaatatca aaataaattg ggacaaatgc aaattctttg 2820 tgactgaatt ggaatatctt ggtcatataa tcagtggaaa gggcctaaca ccatgttcta 2880 gcaaaattgc aacaataagg gaagctaaga ttcctacaaa tgtcactgaa ctcaaatcat 2940 atttgggcct tatcaattac tataacaaat ttatccccaa tttatcttct aaattatatt 3000 atctgtataa tttgttgaag aacaacgtta aatgggagtg ggatagtaat tgtgatgaag 3060 cttttaaaaa ttctaaaaac ttattgctag aaactgaatt tttggagttt tatgatccag 3120 aaaaaccaat cattgtagtc acagatgctt ctggctatgg tttaggcggg gtgattgctc 3180 atgttgttga tgaaattgag aaaccgatat cgttcacatc cttttcgttg aatgcagcac 3240 agaaaaaata ccttatttta catttagagg cccttgcgct cgtatgcact ataaaaaaat 3300 ttcataaata tttttggtaa agagggcaaa cactcaatat ttgtaacaag aattcaacga 3360 tatatattag aattatccat atatgatttt gaaatacagt acagaccgtc tacaaaaatg 3420 gggaatgcgg atttttgttc gcggtttcca ttagaagagg cggtgccaat tgaatatgac 3480 caagagtttg taaaaagtat taattttgga gataccttgc ccattgattt tctgcttatt 3540 gcgaaagaga caaaacaaga ttcgtttcta caagaaatta tgaatttcat gtgccagggt 3600 tggccaaaaa gggtacctaa acgttttgca attatttttg caagtcagca tgatttagag 3660 attatcaatg agtgcttgtt gtttcaagat agggttgtca taccacaggt gctacaaaac 3720 aggattttga aacttttaca cagtaaccat gcaggcgtcg tcaaaatgaa aagattggct 3780 aggagatcgg tatactggtt tggaataaac gcacatattg aagaatttgc tgctgcttgt 3840 gatgtctgtt caagtatggc aagcgtcccg aaacaaaata ttactacaaa atggacaccc 3900 acttcaagac cttttagtag gattcatata gatttttttt attttgagca tcgtaccttc 3960 ttattattgg tagatagttt ctccaaatgg gtggaggtag aagtgatgaa caaaggtact 4020 aaatgttcag aagttttaac gaaacttatt gaatattttg ctcgttttgg attgccagac 4080 gttgtagtgt cggacaatgg tcctcctttt aactcgtttt cgtttaaaag tttccttcaa 4140 aagcaaggta taaatgtatt gaacagtcct ccttacaatc ctgctagtaa tggccaggcg 4200 gaaagactcg taagaacggt gaaggacgtg cttaagaaat tcctactaga accagaaatg 4260 ctaaaattga gtttggaaga ccagataatt tttttttcat ttaactatag aaataacaat 4320 ttgacgatgg aaggcagttt cccttctgaa aaggtttttg cttatacacc aaagatgttg 4380 atcgatttgg ttaacccaaa gaaacactat aggcagatgc tggtaccaat aacccccgat 4440 gttgagtctg taactgtttc aaatcatact aatatacaat tagatccgtt ggacgcacta 4500 atacctgggg acgtgctgtg gtacaagcac aacgttccac acatgcatga aaaatggata 4560 aaagcatcat tcctaaaaag actttcgaaa aatcttttac agatcatggt tggaagcggg 4620 gcaacaacaa ccgcccatcc aacgcagatt cgagtcgtta agaatggacg tggagcatct 4680 gaccagaaca ggacgtcgat gcgtgtggta cagactggtc ggcaacttcc gactgatgcc 4740 ccagaatccg agatgcccat cgagaacgac gagattactg tcaggcagtt acctgatctg 4800 agtgacgatg caacagaagt tagtacaaca agaagagcca ggaaaagaaa acttatcgaa 4860 ccggcagagc ttactggcct accgagacgt tcaagtagaa ttaagagatc gagattagac 4920 agtgaattcg aatattatta aatccaagat tatatcacta gaattgtatt tccttcaaac 4980 tttccaaagg ggaaaggact at 5002 // ID P-16_HM repbase; DNA; INV; 3258 BP. XX AC . XX DT 21-MAR-2008 (Rel. 13.03, Created) DT 21-MAR-2008 (Rel. 13.03, Last updated, Version 1) XX DE P-type DNA transposon family: consensus. XX KW P; DNA transposon; Transposable Element; P-16_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3258 RA Jurka J.; RT "P-type DNA transposon families from Hydra magnipapillata."; RL Repbase Reports 8(3), 362-362 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 231..2753 FT /product="P-16_HM_1p" FT /translation="MVNKCCVVGCTSNYNGGEVVPVFGFPKDDDLRKLWVR FT FVNRNFWTVTKSSVICLKHFDENLIHKGASAKRFRLLYNLKPVPTIYPASI FT ATASLPVMSVPRKSPTKRIFQEDEYQKFLSNDVIEDLNSLTVADSPTGYSF FT SKFDDCVVFHKIIQNELHIPQVAECIRIDKNLHVKLFYKNLPVSLPPWFRV FT THQCVLKRRSMLENFNSHFSIEKEKLSSILEELKSHQVKKNQIYSSNMLQF FT ALMLRYTSLQTYNLLLKEFPLPSISLLGKLKAGEIDSMKVLKLLHENGSLS FT KDLLIIFDEMYLQKCAEYSGGDIIGISEENECYRSIVSFMVVGLKENISCV FT VKAVPIIKLNSDWLKTEILNLIQSLIKNSFNVRGVVCDNHASNVSSFTKLL FT NAHGKKPDDLFINVQSQKIYLFFDTVHLIKNIRNNLLNKKRFIFPEFHFSG FT FKDEIHVQPGEISWKIFHDLLEKDSLLDASLKKAPKINNKVVHPGNYKQNV FT TIALGIFHETTAAAIKSYFPDRLDIASFLTLFNKWWIITNSKNQFSNNCLG FT NAAIINDKKPEFFLTKNKWKNTKIPNFEKYTLSPQTLSALQRTLLCHASLI FT EDLLEDNYKFILTARFQSDPLERRFGQYRQMSGGRFLVSLKDVIYSEKIIQ FT IKSLIKEGIQIFENDINAVEDTQHARSLMNQIKERDLDCVTLSDDSREVAS FT YIAGFIAKKLNKKFKICCKLLCTQHCQQELNDYNYLNILSRGGLTTPSLPL FT LEYVCDSFAMLDHVFDVIYQSRMQHRKAAKLVLGSREGYQPFLCLNHQNCG FT KEIIHTIISNIYFNNIRKRVCDSIVADNIVVFKQKQRKK" XX SQ Sequence 3258 BP; 1158 A; 458 C; 504 G; 1138 T; 0 other; catggcctac ttaaatacac ggccttctat tctgcttaaa tcgaaaaaaa atagaaggcc 60 agttgaaata acgtatattc tattatatgg acagactctc tcaaaatatt ttaaatattg 120 cgctataaat gttaagtaag taaatctgac accggataaa agtttttgcg ttagattaat 180 gccaaactgt attgttattt gtgactactt tttaattatt aataataata atggttaata 240 aatgctgtgt tgttggttgt acctctaatt ataatggtgg tgaagtagtc ccagtttttg 300 gttttccaaa agatgatgac ttaagaaagt tatgggttag atttgtaaac agaaatttct 360 ggactgtcac taaatcatct gtgatctgtt taaaacattt tgatgaaaat ttaatacaca 420 aaggagcatc agcaaaacga tttcgtttgt tgtataattt gaagcctgtt cctacaatat 480 atccggctag catagcaacg gcttcacttc cagttatgtc tgtaccaaga aaatcaccaa 540 ctaaaagaat ttttcaagaa gatgaatatc agaaattttt atcaaatgat gttattgaag 600 atcttaattc tcttacagtt gctgattcac caacaggata ttcattttcc aaatttgatg 660 attgtgttgt gttccataaa attattcaaa atgaacttca catacctcaa gttgctgaat 720 gtattcgtat tgacaaaaat cttcacgtta aactttttta taaaaatctt ccagtttctt 780 taccaccttg gtttagagta acccaccaat gtgtgttaaa gagaaggagt atgctggaaa 840 acttcaattc acattttagc atcgaaaaag agaagctgtc ttctattctt gaagaactta 900 aaagtcacca agtaaaaaaa aatcaaattt attcttcaaa tatgcttcaa tttgccttga 960 tgctgcgtta cacatcattg caaacatata atttactgct gaaggaattc ccacttccat 1020 ctatatcact gttaggcaag cttaaagcag gtgaaattga ttctatgaaa gttttaaagt 1080 tgttacatga aaatgggagc ttatcaaaag atcttttaat aatttttgat gaaatgtatt 1140 tgcaaaaatg tgctgaatat tctggtggag atataattgg gattagtgag gaaaatgaat 1200 gttatagaag cattgtttcc ttcatggtag ttggtttaaa agaaaatatc tcttgtgttg 1260 tcaaagcagt tccaattata aaattaaaca gtgattggtt gaagactgaa attcttaacc 1320 tgattcaatc tttaataaaa aatagtttta atgttcgagg tgttgtttgt gacaatcatg 1380 cttcaaatgt ttcctctttt acaaagttgc ttaatgcaca tggtaaaaaa cctgatgatt 1440 tgtttattaa tgtacaatcg caaaaaattt acctattctt tgacacagtg catctgatta 1500 aaaacattcg aaacaactta ctaaataaaa aaaggtttat ttttccagaa tttcattttt 1560 caggttttaa agatgaaatt catgttcaac caggagaaat atcatggaaa atatttcatg 1620 atttattaga aaaggacagc ctgttggatg caagtttaaa aaaagcacca aaaataaata 1680 acaaagttgt ccatcctgga aactacaagc aaaatgtcac tattgcttta ggtatcttcc 1740 atgaaactac agcagcagca ataaaatcat attttcccga ccgtttagat attgcatctt 1800 tcttaactct ttttaataaa tggtggataa ttacaaattc aaaaaatcag ttttctaata 1860 actgtcttgg aaatgctgct ataattaatg ataaaaaacc tgagtttttt ctcactaaaa 1920 ataaatggaa aaacacaaaa attccaaatt ttgagaaata tacattatct cctcaaacat 1980 tatcagcact acaaagaact ttattatgtc atgcttcatt aatagaagat ctgttagaag 2040 ataattataa atttatattg actgctcgtt ttcaaagtga tccgttagaa agaagatttg 2100 gacaataccg ccaaatgagt ggtggcaggt tccttgtcag tttgaaagat gttatttata 2160 gtgagaagat aatccaaata aaaagtttaa taaaggaagg aattcaaatt ttcgaaaatg 2220 atattaatgc tgtagaagat acacaacatg ctagatcctt gatgaatcag atcaaagaaa 2280 gagatttaga ttgtgtgact ttatcagatg attctcgtga agttgctagt tatattgctg 2340 gttttattgc caaaaaactg aataagaagt ttaaaatatg ttgcaaattg ctttgtactc 2400 aacattgtca acaggagttg aatgattata attatttaaa tatattgtcc agaggtgggc 2460 taacaactcc atcattaccc cttttagaat atgtctgtga tagttttgct atgctagatc 2520 atgtatttga tgttatttat cagtctcgaa tgcaacaccg taaagcagca aagttagttc 2580 ttggcagtag agaaggttac caaccatttt tatgtttaaa tcaccaaaat tgtggaaaag 2640 aaattataca cactattatt tcaaatattt attttaataa catcagaaag agagtctgtg 2700 attccattgt tgcagataac attgttgtct ttaaacaaaa acagagaaaa aaataagaac 2760 attttttttt tcgcattaat cagtaaattt aatcagtttg tgttcaattt tgtttatact 2820 tttaaagctt acttttgaga gtttgtgtat ttttatttct tgtacagtcc ttgaatatca 2880 caaaaattct attgacaaat ataacccatg tctttataaa atcatttaag taatatgtaa 2940 ttatgaaggt taataaattt tgtgttgttt tcaataaaca cagcatttat taatcatgat 3000 tactaattta tcatgattta ttattaaaaa gaagcctcaa taaagtatac aaagtttagt 3060 ggtttagtag atatttcgat taagccaacg caaaaatttt tatcccgtgc cagatttact 3120 tacttaaaat ttatagcgca ataattaaaa tattttgaga aagattgtcc atccgacaaa 3180 atatgcgtta ttttaactgg ccttctattt ttttccaatt taagcagaat agaaggccgt 3240 gtatttaagt aggccatg 3258 // ID Gypsy-233_AA-LTR repbase; DNA; INV; 1691 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-233_AA_; KW Gypsy-233_AA-I; Gypsy-233_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-1691 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1070-1070 (2011). XX DR [1] (Consensus) XX SQ Sequence 1691 BP; 459 A; 345 C; 393 G; 491 T; 3 other; tgtaacaaaa tgaatatgca ccatagtttt gggtgtaata actgatcaaa agctacaaga 60 gcgaactctt gcgagaaaaa aatgagacta ttgaagcttt cgcaaaattc aaattgtgag 120 tgcctccact caggaactca ttctgggtag attccctgat gcgcaaaaac gcaataactg 180 cgaattcaag gcttcctatt accttccgta tgcaaattaa acgcatgaag taaaacaata 240 ccgtgtgggc gtgagaagga tgcacgatcg gaattgttac aaaagagaat tgtgttcagt 300 tcaaaataga tagccgttct attgggtaaa tgcaattatt gttcaccttt gtcgaacaag 360 ttcagtgcga ctcgttgact agagcagacg taatagtcga agtctaaaaa gtcagttgtt 420 agccacgttg catttacaag tgcattagcc aagttctgtg attgtaagtt cctttaaaat 480 tgaatggcct tttatttaga agaactagtg ttatcctttt gcagttttcg tcagttggaa 540 agcgaattwa attaaaatct acaaagtagg ttctgtgttc gaagcacgtg attgagactg 600 tccggtagtg tagattcgga caaaagacgg taaggccaac agtagggtta aataatttaa 660 gcgtgtgcta attcgttgga acccattgat cagaatactc gctgatcgat gcgggtattt 720 tcctaggttt tggaaattga gagaagaacc ctagcgctcg acctgtgagc ggagctctca 780 ccgcaagcga gaccgtcacc tcccaacccg ccacccaccg ccttttgtcg cactgtacgc 840 cgactctccg tggtgcaaca acaacaatca ccggcgtgac atcgctacca ctgtctgcgc 900 caaagttatc gctcaggagt cggtcggata caccgctggt attacaccgt tgcgcggtga 960 cacgaacccc atcgtgtgtc tgggatttcg aatttgaaat agtgcggtag tgagggagtg 1020 ctttccgtct acgtctcaag cataaccgag catagaagca atcacgatat tgggcgatat 1080 tgcatgcgaa cacgccagcg cggtgagaat acgttttcga attgcccgcc gacggatagg 1140 agcgtctcca ccgagccctt cgcttcgagg gattttgctg cmtggtttgt tcccgttgca 1200 tttaccgaaa ggtcagatct agattgtaag gctaatttta agggaaaact agatattaag 1260 gaatcattag agctaggttt tgagtgacgt caccttctaa aagcttgctg acaggcctct 1320 gtaaatattg agattttaaa tacacgaaat ttaaagcaag ctgtgattac gtctaggaag 1380 gcagacgaat caaacggttc ttgttcgaat ttaattattt cakatttaat tggctatcgt 1440 ttcgatcttg ttttcttctt tgattagtaa gtttgttgtt gggtttcata gtagattttg 1500 tccttgaatt cttggttttc aaatttttgg ttgcattttg ggttagttgg aatgtattcg 1560 agtgtttgag cagtctatta actcgcccta tttgaagagc ctctgccggg cccccaaaca 1620 gcctcagcag tctctcccct cggagagtgg cgcttaagct gctaaggtca ggaactcgtt 1680 cattcgttac a 1691 // ID Gypsy-25_DPu-I repbase; DNA; INV; 4332 BP. XX AC . XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from Daphnia: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-25_DP_; KW Gypsy-25_DPu-LTR; Gypsy-25_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-4332 RA Jurka J.; RT "LTR retrotransposons from Daphnia."; RL Direct Submission to RU (16-DEC-2010). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR [1] (Consensus) XX CC Positions [3341-3802] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 1445..4318 FT /product="Gypsy-25_DPu-I_1p" FT /translation="MRLDWNQILQLTNHPSTIPRSNMLPDCFQKKFTDLFS FT TKLGKIRGPPVHLDLKAEAVPRFHRARPIPYALRAKVKAALIKLTEAKVLK FT RVRHSNWGAPIVVVPKANGDIRVCGDYKVTVNPFLIVDQHPLPLPEDIFAT FT LEGGVLFTKLDLSQAYNQLELDEFSQELCTINTPEGLFQYTRMPFGIASAP FT GKFQRVMDDLFQDTPWVKCYLDDILIAGRTEKEHWTRVEIVLQKLQEAGVR FT LQLEKCSFGVPEIPYLGFIVSKDGLKTSPEKIKAVQDSEKPHNLTSLRAYL FT GLVNYYGKFIPKLAHVSAPLNELLKKEKPWRWETEQQDAWLEIKQLLSSAE FT VLCNYNPKWILRLACDASPFGVAAVLSHILPDGSERPISYASKSLSASEKS FT YSQLDKEALSIIFGVKRFHSYLYGRKFTLITDHKPLLAILGPKKGIPPLAA FT ARMQRWALILAAYAYELEFRKTTEHGNADALSRFPLEDPSETVGQLPELSH FT SRPELFGTALVTKDVRESTKLDPVLQEVSTRLRDGWRFSDKVSSTLATFYR FT KRTELSIKDGLILWGNRVVIPKTLQPQVLSLLHEEHAGIVRMKAVARSFVW FT WPGLDSQLTEMATSCLPCLQTRNNPKRRKEAAWPVPDQPWSRLHVDFAGPL FT PSGQYLFVLMDATSKWPEIFRLNRITSDTTISTLKTIFARFGLPSELVSDN FT GPQFTSEEFKRFMLVNGIVHHRGAPYHPQTNGLAERAVQSVKKALHKMRDQ FT PGTFDDKLQRFLTSYRNTPHKSTEKTPAEVLLGRQNRGKFDLFQPTVLKKK FT EKIEAEFQAGEKVMVRDFRPGKNKWLEGVVNHRIGSFLYEVQIGKQIMKRH FT CSQLLPRGRTEPEDTGDVMLPQVESESRDQRVVTTGIATENQPPMASPTPS FT QATAGPKDLPAVPTEAEVMEKQPAVLSPPPRRSTRACKLPNYLNDYVLKNV FT HFC" XX SQ Sequence 4332 BP; 1354 A; 914 C; 1047 G; 1009 T; 8 other; actggcgacg aggatttaag ataagaaacc acgtgttggt agtgaaggag tgactttacg 60 taagtgttat aagttatcat tctggttatt tatgttggtt atttgcggtt atgaaccgtg 120 gtaaatcaag aggtagaggt agactggtgg aatcccaaat tggacaagag caagatatag 180 agagtcaaca agaagaagag ggaagtatcc cttttgtgac agacagcatt caagagcaga 240 cagcaagatc cacagaagaa gcagtggccg atcctgtaat tccaccacca atggcgacta 300 acgtgattgg aacattggag ccgttcaaca ctgagacggg caagtggcag gtatacgaaa 360 aaagacttca acagttcttt attgttaaca gtattaccga agatgtcaag aagaaggctt 420 atttgttgac tgtgctggga gccgaaattt gtgatttgat atgggattta tttagtccca 480 cagaccccac ggctgcaacg gtaacacatg atatgatttg tacaaagtta agagagcatt 540 ttgttcctac aaaggtggag attgcagagc gttttcgttt ttttcaacat aagcaaaaac 600 cagaagaaac catagcaagt tttatggcat cgctacgaaa cctcgccaag gattgttctt 660 ttggagactt tcttaattca gccttaagag atgmtttcgt gataggcctg saagatcaac 720 ggattcaaac gaagctgttg gccgaatcgg ccttgacgtt ggactctgca ttcaagatgg 780 ctgtcagcat ggaatccgca acccagcaag ccgaammact tagacaagaa gaaccaatca 840 acgttttaag aactcgtact acgggatgct ggaggtgtgg tgaacctcat aatccggcag 900 ggtgttattt caaaacgcaa gagtgtttct actgcaagaa agaaggacac cgggcagctt 960 gctgcccaga gaagcaaaag aagaaaagtc gagagtcaga aaagtccaga gaagaaaagt 1020 cagcaatccc caagaagaaa gacagcacaa gaaaaatcga ttaaactttg cagataatac 1080 cgaagacgcc gagggattgg aagattttga tcctgacaca gattttaatt ttttctatct 1140 gccagatccg aacaggtcca caaaaccgtt ggtgacaact ctggatatag atggaagaaa 1200 ggtgaccatg gagatagaca cgggagccgg cttcaccata ttttcagaaa aggaatggak 1260 aaaccatggt agcccgaaat tggaagacac gcaggtgcgt ttacggacat acactggcca 1320 accagtggac ataaaaggaa aattcgtggc agaagtatcg gtacaggaac agagtaaaca 1380 gctaccaatc cttgtcgccg gaggcaatgg gccaccactg tgtggcagaa actggcttcg 1440 cgcgatgaga ctcgattgga accaaatact gcagctgact aaccatccta gtaccatacc 1500 acgttctaat atgttacctg actgtttcca aaagaaattt acagatttgt tttcaacaaa 1560 gcttggaaaa attcgaggac cgccagtgca tctagatctt aaggcggaag cagtgcctag 1620 atttcatcgg gctcgcccca ttccttacgc cttaagagcc aaggttaagg cagcactgat 1680 caagctgact gaagcaaaag tattgaaaag agtccgccat agcaactggg gagctcccat 1740 tgtagtcgtg cctaaagcaa atggtgacat aagagtgtgc ggagactaca aggtaactgt 1800 aaacccattt ttaatcgttg atcagcatcc gttgccactc ccagaagata tttttgcaac 1860 gttggaaggt ggtgtattat tcacgaaact agatttgtcg caggcctaca accagttgga 1920 gttggatgaa ttttcgcaag aactgtgtac aatcaacaca cctgaaggcc tgttccagta 1980 cacccgaatg ccgtttggca ttgcttccgc accaggaaaa tttcaacgag tgatggacga 2040 tctatttcaa gacacgccat gggtaaagtg ctatttagat gacatcctaa ttgcaggccg 2100 cacagaaaag gaacattgga cgcgagtgga gatagtattg cagaaactac aggaagcagg 2160 ggttcggctt caactggaaa agtgcagttt tggagtgccg gagataccct atttaggatt 2220 cattgtatcc aaggacggac ttaaaacatc accagaaaag atcaaagctg tgcaagactc 2280 cgaaaagcct cataacctta cttctctgcg ggcatattta ggattggtca actattatgg 2340 aaagttcatc ccgaaattgg ctcatgtgtc agccccgtta aacgagctgt taaagaagga 2400 gaagccttgg agatgggaga cggaacagca agacgcctgg ctggaaatca aacagctgtt 2460 aagctcagca gaagtattat gcaactacaa cccgaaatgg attctgagac tagcctgcga 2520 tgcatcacct tttggagtcg cagcggtatt gtctcacatc ttaccagacg gatcmgaacg 2580 acctatttcg tacgcctcca agagcttatc agcatctgaa aaaagttatt ctcagctgga 2640 caaagaagcg ttgtctatta tcttcggtgt taaaagattt cactcgtatt tatatgggcg 2700 maaatttact ttaattactg accacaagcc tctactagcc attttaggtc caaagaaagg 2760 aattccacca ctggctgctg ctaggatgca acgttgggcg ctcatattgg ccgcatacgc 2820 ctacgaattg gagtttcgaa aaaccaccga gcatggaaac gcagatgcat tatcccggtt 2880 tcccttagaa gatccatcgg aaacagtggg gcagctacca gaattaagtc attcccggcc 2940 ggagctgttt ggaacagcat tggtaaccaa ggacgtacgg gaatccacca aattagatcc 3000 agtactacag gaggtgtcca ccagattacg agacggatgg cgattttcgg ataaagtctc 3060 ttcaacatta gccaccttct ataggaaaag aacggaatta tccatcaagg atgggctgat 3120 cttatgggga aacagagtgg tcatcccmaa aacactgcag ccgcaggtgc tatcactatt 3180 acatgaagag catgcaggga tcgtgagaat gaaggctgta gcaagaagtt ttgtatggtg 3240 gccaggatta gacagtcagc tgacagaaat ggccacgtca tgtttacctt gccttcaaac 3300 cagaaataat ccgaagagaa gaaaagaagc tgcatggcca gttcccgatc agccatggtc 3360 acgcctccat gtggattttg caggtccgtt gccgagtggt caatatcttt ttgtgttaat 3420 ggatgccact tccaaatggc cagagatttt tcggttaaat aggattacct cggatactac 3480 aatatcgaca cttaaaacta tttttgcccg ttttggactc cccagcgaat tggtatcaga 3540 caacggacca cagtttacat cggaagaatt caaacgtttc atgttagtaa acggtattgt 3600 tcatcataga ggagcaccat atcaccctca gacgaacgga ctggcagaac gggcggttca 3660 gtcggtgaaa aaggccttac acaagatgag agatcagccc ggaacattcg acgacaaact 3720 ccagcgcttt ttgacctctt accgaaacac acctcataaa tctactgaaa aaactcctgc 3780 ggaagtacta ttgggtagac aaaatcgtgg aaaatttgat ttgtttcagc caaccgtcct 3840 aaagaaaaag gagaagatcg aagcggagtt ccaggcagga gagaaggtga tggtgaggga 3900 tttccgtccc ggaaagaata aatggctgga aggagtagtt aaccatcgta tcggaagttt 3960 tttgtatgaa gtacagattg gaaagcaaat catgaagcgt cattgcagcc aattactgcc 4020 gagaggaaga accgaaccgg aagacacagg tgacgtgatg ctaccccaag tggagtcgga 4080 atcgagagat caacgtgtgg tgacaaccgg aatcgcgacg gagaatcaac ccccaatggc 4140 ctcccctaca ccaagccaag cgacggcggg accaaaggat ctgcctgcag ttccaaccga 4200 agcagaagtt atggagaaac aaccggcagt tttgtcaccc ccaccgcgac gttcaactag 4260 agcatgcaaa ttacccaact atctaaatga ctatgtgttg aaaaatgtac atttttgtta 4320 gaacagagga gt 4332 // ID Mariner-24_SM repbase; DNA; INV; 1693 BP. XX AC . XX DT 12-AUG-2009 (Rel. 14.08, Created) DT 12-AUG-2009 (Rel. 14.08, Last updated, Version 2) XX DE Mariner DNA transposon from Schmidtea mediterranea: consensus. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-24_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-1693 RA Jurka J.; RT "Mariner DNA transposons from Schmidtea mediterranea."; RL Repbase Reports 9(8), 1873-1873 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 244..1572 FT /product="Mariner-24_SM_1p" FT /translation="MDNDSEQKQKKQRRSFSSAKKKEIIELARQLGNIREV FT ARRYDINETVIRRWIAQESVIKAMNPNRRSLRGGNARYPELETKLHEWILI FT QRERELQVSVVRIRLQAQVIAKEMKIEGFAASSQWTDNFMKRKNICVRRTT FT TKQQLPKDWEFQMAKFRSTIADLKKDLPDNQIGNFDEVSVQCDMPLGYTVE FT TKGANEVRIKTTGHEKKRFTVNLCVLKDGTKLPPFVILGTKKHPKTFIPES FT KLVIAANGTGWMNHETLEIWLKKVWKSRITLPPKTPWNQDPKVPSLLLFDM FT HRSHLVSTTLNKIKKESKVAFIPAGLTSKVQPLDLTVNRSFKSKLRKKWED FT WIIKDYENIKHTKSGNMKAADWNTIFEWIVSSWEDVNLSTILNGFRVAFGE FT DDDVLEIEENEMNVNESDPNIVELLENFTFIDNEKCDGFEECDVENLNK" XX SQ Sequence 1693 BP; 574 A; 280 C; 334 G; 505 T; 0 other; ctgtaaaatc ccgagtataa gctgcagcgc gtattagctg cagatgacat ttttataaaa 60 atgtattttt tagtcgtata aaccgcagca cgtattagct gcaacatttg cgttgaaaac 120 acatgatttt aaaatcagca acctatcgtt aaaaaacaaa tttcaccgtg acaataacca 180 ttttgtttta atcgtttata ttgtttacat tttttattgc actgttttga atccttgatt 240 aaaatggata acgattctga acaaaaacaa aagaaacaac gaagatcttt tagttcagca 300 aagaaaaaag aaattattga attagctcgt cagttaggaa acatacggga ggttgcaaga 360 cgatatgata ttaatgaaac tgttattcga cgatggatcg cccaagagtc tgttatcaaa 420 gcaatgaatc caaatcgaag atctttgcgt ggtggtaacg ctagatatcc ggaattggaa 480 acgaaattac acgaatggat cctaattcaa cgtgaacgag aacttcaagt gtcggttgtt 540 cgaattcgtt tacaggcaca agtaattgcc aaagagatga aaattgaagg atttgccgcg 600 agctctcaat ggactgataa tttcatgaaa cgcaagaata tttgtgttcg acggactacc 660 acgaagcaac aattaccgaa agattgggaa tttcagatgg ccaagtttag aagcaccatc 720 gcagatttaa agaaggattt gcccgataat caaatcggta attttgatga agtgtccgtt 780 caatgtgaca tgcctttagg gtacacagtt gagacaaaag gtgccaatga ggtccgaatt 840 aaaactacag gacatgagaa aaaacgattt accgtcaatc tatgtgttct caaagatggt 900 actaaacttc cgccttttgt tattttgggc acaaagaaac atccaaagac tttcattcct 960 gaatcaaagc ttgttattgc agccaatgga actggttgga tgaatcatga aactcttgag 1020 atatggctca agaaagtatg gaagagtcga attacattgc ctccaaagac accatggaac 1080 caggatccaa aagtaccatc attgctttta tttgatatgc atcgctcaca tcttgtcagt 1140 actacactta ataaaattaa gaaggaatcc aaagtagctt tcatacctgc cggattgaca 1200 tcaaaggtgc agccattaga tcttactgtt aaccgttcgt tcaagtcaaa attgcgcaag 1260 aagtgggagg attggatcat taaagattat gaaaacatca aacatactaa gtccggaaat 1320 atgaaagcag ctgactggaa tactattttc gaatggattg tttcctcttg ggaagatgtt 1380 aacctttcca ccattttgaa tggttttcgt gttgcattcg gtgaagacga cgatgttctt 1440 gaaattgaag aaaatgaaat gaatgtgaac gagtctgatc caaacattgt agaattacta 1500 gaaaatttca cttttattga caatgaaaag tgtgatggtt ttgaggagtg tgatgttgaa 1560 aatttaaata aataatctta tcttcaattt gaattgtttt ttagttgtat aagctgcaca 1620 tcatattagc tgcaattcaa aaatgttgcc tttttgtacg tattagctgc agcttatact 1680 cgggatttta cag 1693 // ID hAT-8_HM repbase; DNA; INV; 3514 BP. XX AC . XX DT 07-DEC-2008 (Rel. 13.12, Created) DT 11-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-8_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3514 RA Jurka J.; RT "hAT-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 1997-1997 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(1048..1533,1638..3353) FT /product="hAT-8_HM_1p" FT /translation="MGPSSNPSNFPRDQKERKFPTYIFLEEQRNGEKVKRD FT WLVWSKAVASLFCFPCSLFGSPNLTRPGVQSSLLSWNGGIHGNWRKLSDRV FT KSHQQSDFHRNCYLQWKTATIHLKSQKSIEHQLEKQIGDETKRWKVILQCI FT LDVTIFLASRNLAFRGKNIVFKFLGSNQKLFKNGNGNFLGALELIARHNKT FT LQDHMHLIAKHQEEEKRMQAHYLSWKSQNEFIKECGKLVVREVIREIKKSI FT YFTIITDSTPDSSHSEQITFVFRFLHFNDNQLWEVKERFLKLEELEKKKGS FT DITKLILNVLEENELDIKNCRGQGYDNGANMAGIYNGVQALIRQNNPQAIF FT IPCSAHSLNLCAVHAIESSAPAKSYFGNIQKLYNLFSSSPVRWKILQEETG FT QSLHKLSVTRWSSRIESVKPLAKKPREILKSLHRLKELDLPGESSNEVHYL FT IKWMNSFEFVVFTTFWFKCLMAVNNVSLLLQSTQLTLDDEVKILNGLLMDL FT DRIRLSWDLILIEATEVAKNLKFDKIMFTVKRTRKKIISDETAGYNHEEAS FT KNFECNVFNVALDNLTSQMRSRFKVIEEECSKFSFLWSLQKSQTDQELGLK FT AENLAKLYPEDLCKNEFSDEVRHFFSVRRNILASEKPVELLNEVYAKGLHS FT IFPQVCVSLRIFLTLPVSTSEGERSFSKLAIIKDYLRSTMGQERLCYLMIL FT SIESDLAINVNYEEVISNFAAKKARKMCLSIKN*" XX SQ Sequence 3514 BP; 1176 A; 581 C; 638 G; 1119 T; 0 other; cagggccgcc aagagggggg gggggcaatt gaagtagttt gccccgggca tcacagtcta 60 tggggcatca catgccttaa aaaaaaaatt attgaaatga taaaaaattt agctgataac 120 gtatttaaag attaagcaat ttttgtgtat taatttcttt ctccttttaa actttgtttg 180 catttaaaat aggataatta ttactttaca aggttttcat tatccatttg aacgcacgcg 240 ttttcattgg ccttttgttc gcatgataga ataaaaagcg ttaaaaataa ttttttctta 300 tttaacgttt gtaaaagcct gacttctgat agtttaactt atttttgctt aaaccttgcg 360 tccctgtgga tacttttgtg catatgtatg tttttgactc gcctgcattt tgttccgact 420 atattttgac aagaaactta tcttcagtgg atattttaat tgtaagtact gactttttct 480 ctgcaattaa taacaccaaa gtattataaa gtttttaatt tttaaaattt aaaaagttaa 540 aattctttat atcgtcataa gttttagcat gaataatcaa agacctccac gattgaaaca 600 gctgtctgga gctgctggga gaaaaagagc gatgattcgg aaaaaaaacg aaagcaaaaa 660 caaacgaacg cttttagatt gcgttcgttg gacaaaaatc gaagataaat gcaagcaggt 720 agttaaacta ttttttaatc ttgtgacgtt catgaaaaaa taattagaat aattctgttt 780 aatacttctt tatagttcaa tgatattgag atcggtatca atgactctaa tgagttgcca 840 gaccataatt ccgacaaact ttttcttgcc gatgacatcg atgactccca aaaaacatca 900 tcacatttaa aaaaggatca gttggatttt gtacttgatt gtcacaatga ccaacaaatt 960 tataaaagtg atctttctat accaatctct aacgatcctg cactatggaa atcaatatca 1020 gacactgatc gaacgaatat aattctgatg ggaccttctt caaatcctag caattttcca 1080 agagatcaaa aagaacgcaa attccctact tatatctttt tagaggaaca aagaaacgga 1140 gaaaaagtta aaagagattg gctcgtttgg agtaaggctg tggcatcact cttttgtttc 1200 ccttgctccc tttttggatc gcctaactta accagacccg gtgttcaatc cagtttgttg 1260 tcttggaatg gaggtataca tggaaattgg agaaaacttt cagatcgcgt taaaagtcat 1320 caacaaagcg attttcaccg taactgttat ctccaatgga aaacagctac gattcacttg 1380 aaaagtcaaa agtccattga gcatcagtta gaaaaacaaa ttggagatga aacaaagcga 1440 tggaaagtaa ttttacaatg cattcttgac gtcacaattt ttctagcttc aagaaactta 1500 gcttttaggg gtaagaatat tgtttttaaa ttttaaatta tattttgcat tattttattt 1560 gtaatttaaa aatcaaaatt acaagtttta attggatata aatatggatt actattttga 1620 ttattttttg tttgtaatta ggatccaatc aaaagttgtt caaaaatggg aatggaaact 1680 tccttggagc gctcgaatta attgctcgcc acaacaaaac attacaagat catatgcatc 1740 ttattgctaa gcaccaagaa gaagaaaaac gcatgcaagc tcattattta tcttggaaat 1800 cacaaaatga atttatcaag gaatgcggca aattagtggt tcgagaagtt attcgagaaa 1860 tcaaaaaatc aatatatttc accatcatca cggacagcac tcccgattcg tcccacagcg 1920 agcaaataac ttttgttttt cgttttttgc atttcaatga taatcagctg tgggaggtca 1980 aagaaagatt ccttaagcta gaagaacttg aaaagaagaa gggatctgat ataacaaagt 2040 tgatattgaa tgtgttggaa gagaatgagc tcgacataaa aaattgtcgt ggtcaaggct 2100 acgacaacgg cgctaatatg gcaggtattt acaatggagt tcaagctttg ataagacaga 2160 ataatccaca agcaattttc attccttgct ctgctcacag tttgaactta tgtgcagtac 2220 acgctattga atcatcggct cctgcaaaat cctattttgg aaatattcaa aagctataca 2280 atctgttcag cagcagccca gttcgttgga aaattcttca ggaggagaca ggccaatcat 2340 tacataaact ttctgttaca cgatggagtt caaggattga atcagttaag ccactagcaa 2400 aaaagccaag agagatcttg aaatcattac atcgtctaaa agagttagac cttccaggcg 2460 aaagtagtaa tgaagtgcat tatctcatta aatggatgaa ttcttttgaa tttgttgttt 2520 tcactacttt ttggtttaaa tgtttgatgg cagttaacaa cgtgagtctt ctgcttcaat 2580 ccactcaact cactctcgat gatgaagtaa agattctaaa tggccttttg atggatctgg 2640 atcgaatccg tctgtcttgg gacctgattt tgatagaggc aacagaagtt gcaaagaact 2700 tgaaatttga taaaataatg tttactgtga agagaacgcg aaaaaagatc atctcagatg 2760 aaactgctgg ttacaaccac gaagaagcat caaaaaactt tgagtgtaat gtcttcaatg 2820 ttgcactcga caatctgact agtcaaatga gatcaagatt taaagtaata gaagaggaat 2880 gtagcaagtt ttccttcctt tggtctttac agaaatctca aacagaccaa gaactaggtt 2940 taaaagctga gaatctggct aaactttacc cagaagactt atgtaaaaac gaattttctg 3000 atgaagttcg gcattttttt tccgtaagaa gaaatattct agccagcgaa aagcctgtcg 3060 aacttctaaa tgaagtttat gcaaaggggc tgcattcaat atttccgcaa gtgtgtgttt 3120 ctcttagaat ctttctaaca ttgcccgttt caacatctga aggagaaaga tctttcagca 3180 agttagctat catcaaagat tatcttcgtt ccacaatggg ccaggagcga ttgtgctact 3240 tgatgatcct ttcaatcgaa tcagatttag ccattaacgt taattatgaa gaagttattt 3300 caaattttgc tgctaagaag gcccggaaga tgtgtttgtc aattaagaat tgattttatc 3360 atagtttgtt cttgcaaacc aaaattaaat gtgattttag gttttttatg aaagtaaact 3420 atattaattt atttgctatt ttaagacctg aagtatgata gggcatccat taaacttttg 3480 gacccgggca tcacaaaagc tctgggcggg cctg 3514 // ID Gypsy-1_SI-I repbase; DNA; INV; 4904 BP. XX AC AEAQ01029017; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-1_SI_; KW Gypsy-1_SI-LTR; Gypsy-1_SI-I. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-4904 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01029017; Positions 5428 525. XX CC Positions [2246-2668] - Reverse transcriptase CC Positions [3722-4195] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 287..4594 FT /product="Gypsy-1_SI-I_1p" FT /translation="MDRQQLEDLSHAELQTEVRRYGLEPQATREALIDQIM FT SHLERNGPLNDLRQDATTALQAAGSFGSTSTSGVQSGSSREALPQICAMFA FT EQMKQQQAFMQQLIAAVTAGQSRSQPAPSLPTPPTVVQATLPPRNTILSAV FT SPGQAIKLLASQVPEFSGAEDDDVNSWIQKVEQVSRIHGASDDITLLAATG FT KLTKIARRWFDSKTGVVNQTWPIFKQAVISRFQRETLVQVTLHKVEARRWN FT YPRETFQEYAMDKIDMLHSLNLPDRDVINYLVSGINNMAIKSAAAALPVDN FT VDDFLKRMHNITASFSEPFKKAFVPPQKIEKDKAKPINGTSVRNSPAEKPV FT KDLFCVYCKRTGHVRDDCFRLKRKEQGQRTATPSTPATVAAAKEDDPSASP FT PQVVGAVSNLDRTLEISDTHLKVVAINGRPCNLIALLDTGSPCSFVHTTVF FT ERIFDRSSSSLVKANRSLVAINGLPIKTMGTVASSIQFESLPDFTGEIEFL FT VLQNNVFTVDLVIGRDFLRTHQITVLYSPKADTEENKLELLQHVASADVVD FT ETENSILSYMASVEIDFDGETKKSLLSIIDEVENTEVAPVKDDYAVRVSLK FT DDSTYAFAPRRFAWAERQQIREITDDLLKRGIIKHSVSPYCARVVPVRKRN FT GALRLCVDLRPLNSRVNKQKYPFPIIEDCIARLGNNTIFTSLDIQDGFHNI FT PVHPEHTKYFSFATPDGQFEFTRLPFGYCEAPAEFQKRLVHILQPLIREDR FT VLVYIDDILIPSASVTDNLATLKEVLLLLKRHAFQVNYKKCHFLKTTIEYL FT GYVISPNGITLSPRHVEAVLNFPLPKNINDLQRFLGLTGYFRKFIQDYASI FT AKPLRNLLRKAIDFNFDSSCLEAFNALKNKLISYPVLRVYNPKVETELHTD FT ASAHAVAGILMQKQESGKWAPIAYYSQATNQAEAKYHSFELEMLAIVKSVE FT RFHLYLYGLDFTIVTDCNALVYAVNKAHLNPRIARWALRLQNYKFKVVHRA FT GHRMTHVDALSRISAYVGAMPIERELEYKQLLDDRLKILAEELEFAENEKF FT ELIDGLVFRKCADKPRFVIPDSMINNIIRIYHDEMAHCGVEKTVQGIGSNY FT WFPSLRKKVRSYVDNCLTCLTANSSVHTREGEMQEVDSPKRPFQIVHCDHF FT GPLTDTSQGFKYILVLVDAYTRFTWLFPAKTTSSKETIKHLLGVFNVFGFP FT NTLVSDRGTAFTSQEFLDFTNSKKIYHRLVAVAAPWSNGLVERVNKFLKSS FT LKKLTEEPKTWNIHLDKIQYVMNNTHHASLNASPSKLLLGYDQKRHADADI FT VESLNQIARVELCCNEERENMRTIAEEVSNKIRNYNKIYYDKKHAKPTKYQ FT IGDYVLIRDTASKPGEDKKLKANYKGPYVVAKILNKNRYVIKDIPGFNITS FT RPYDSILSPDRLKPWIKPVNSE" XX SQ Sequence 4904 BP; 1472 A; 1048 C; 1081 G; 1303 T; 0 other; acatatcaga agtgggattc acctcgtggt gctgtgcgga aattcgtgcc ggtcgcgttc 60 gcttttgtct cgtggtctcg tgccgttgtg ctgctcacag agcagagagg atcgaccgcg 120 tggtagctac gcgccgccag ttttttttaa aaagaaaccg attatctcgt ctcgtggtgt 180 cgtgccgttg tgctgctcat agagcagaga ggtccgaccg cgtggtacac cacgcgccgc 240 cattttaagt ttcattcaaa gcgtctccca cgtgtcacgc catacaatgg atcgacaaca 300 gttggaagat ctttcgcatg cggagttgca aacagaggtt agacgctacg ggttagagcc 360 acaagccacc cgtgaagctc ttatcgatca aattatgagc catttggaaa gaaatggccc 420 gcttaacgat ttgcgccagg atgccaccac ggctctgcaa gctgccgggt cgttcggttc 480 gacttccact agcggtgttc agtcaggctc atcgagagaa gccttgccgc agatttgcgc 540 aatgttcgcg gaacaaatga agcaacaaca agctttcatg caacaattaa tagctgcggt 600 gaccgcgggg cagagccgtt cacagcctgc tccctcgtta ccgacgccac cgaccgtcgt 660 acaagctaca cttccgccga ggaatacgat cctttcagcg gtgtccccgg gacaggcgat 720 taaactacta gcgtcacaag tgcccgaatt cagtggagct gaagatgacg acgtgaacag 780 ttggatccag aaagtcgagc aagtgtctcg tatacacgga gcctcagacg acatcacatt 840 gttagccgcc acgggtaaat taaccaaaat tgcccgtcga tggtttgatt ccaaaacagg 900 tgtcgtaaat cagacctggc ctatttttaa gcaagcggtg ataagtcgtt ttcagaggga 960 aactcttgta caagttacct tgcataaagt tgaggctcgt agatggaact atccaagaga 1020 gacgttccag gaatatgcta tggataaaat agatatgcta catagcttaa atcttccaga 1080 tcgagatgtt ataaattatc tcgttagtgg cattaacaat atggccatta agagcgcagc 1140 agcggcgcta ccagtggaca acgtagacga cttcttgaaa aggatgcata acatcacggc 1200 ttccttctcc gagccgttta agaaagcttt cgtgccgcct caaaaaatag agaaggacaa 1260 agcgaagcca atcaacggaa ccagtgtgag gaacagcccc gcagagaaac cagtcaagga 1320 tctcttctgt gtgtattgca aacgcacagg gcacgttcga gacgactgtt tcaggctgaa 1380 gcgcaaggag caaggacaga gaacagcgac tccttcaacg ccagccacgg tggcggcagc 1440 caaggaagat gacccctccg cctctccacc gcaggtcgtc ggcgcggttt cgaacttgga 1500 cagaactctt gagataagtg acacacatct gaaagtcgtc gccattaacg gtagaccttg 1560 taatttaatt gcgctattag atacaggcag tccctgttct ttcgtacaca caaccgtttt 1620 tgagagaatc tttgatcgat cgtcgtcctc ccttgttaag gctaatcgtt ctcttgtagc 1680 tattaatggc ttaccgatta aaacaatggg tacagttgcg tcgtcaatcc aattcgagtc 1740 gttacctgat ttcacagggg aaatcgaatt tctggtttta caaaataacg tatttactgt 1800 agatcttgtt atcggtcgcg attttttgcg tacacatcaa atcaccgttc tgtattctcc 1860 aaaagcagac acggaagaaa ataaattaga gttgctacag catgttgcgt ccgcggacgt 1920 ggtggacgaa acggagaact caatactttc ttatatggca agcgtggaaa ttgattttga 1980 cggggaaaca aaaaagagtc ttttgtccat aatagacgag gtcgaaaata cggaggttgc 2040 tcccgtaaag gacgattatg ccgtcagggt ttctcttaaa gacgactcaa catatgcgtt 2100 cgcccccaga cgatttgcat gggcggaacg acaacaaata agagagatta ccgatgatct 2160 ccttaaacga ggaattatta aacatagcgt atccccatac tgtgctcgag tcgtacctgt 2220 tcgcaaacgg aacggtgctt tacgtttgtg cgtagatttg cgtcctctaa actcgcgggt 2280 aaataaacaa aaatatcctt tcccgattat cgaggattgt atagcacgtc tgggcaacaa 2340 cacgatattc acttctttag acatacaaga cggttttcat aatattccag tgcatccaga 2400 acatactaaa tatttttcat ttgcaacgcc cgatgggcaa tttgaattca cacgtcttcc 2460 tttcggttat tgtgaggcac cagccgaatt tcagaaacga ctggttcaca ttttacagcc 2520 tctgataaga gaagatcgcg ttttagtata tatcgatgat attttgatac cgtcagcatc 2580 ggttacagat aaccttgcaa cattaaaaga ggtattgttg ttgctgaaac gtcacgcatt 2640 tcaggtaaat tataagaagt gccattttct gaagacaacc attgaatatc tcggttatgt 2700 tatttcgcca aacggtatca cgttgagtcc gcgacatgtc gaggcagtac taaattttcc 2760 tcttccaaaa aatattaatg acttgcagcg atttctcggg ttaacgggct atttccgtaa 2820 atttatacaa gattatgcat ctatagctaa gcccctcagg aatcttttgc gaaaggcaat 2880 tgattttaat tttgatagta gttgtcttga agcattcaac gcgcttaaaa ataaattaat 2940 ttcatatcca gtacttagag tatataatcc aaaagtagag actgaattac atacggacgc 3000 cagcgctcac gcggtagccg gaattctcat gcaaaaacaa gaatcaggta aatgggcgcc 3060 tatcgcttat tatagtcaag ctaccaacca ggcggaggct aaatatcata gcttcgagct 3120 ggagatgctt gccattgtta aatcagttga gcgatttcat ctctatttgt acggtctgga 3180 tttcacgatt gtgaccgact gtaatgccct agtgtacgcg gtaaataaag ctcacttgaa 3240 tccgcgaatc gcgcgctggg cactaagatt gcaaaattac aaattcaagg ttgttcatcg 3300 cgcggggcat cgtatgaccc atgttgacgc gcttagtcgg attagtgcct atgttggtgc 3360 tatgccaatt gaaagagagt tggaatataa acaattacta gatgatcgac ttaaaattct 3420 agcggaggag cttgagtttg ctgaaaacga gaagttcgag ctaattgacg gccttgtctt 3480 cagaaagtgc gccgataaac ctcgtttcgt gatccctgat tcgatgataa ataatattat 3540 tagaatttat cacgacgaaa tggcacactg cggtgtcgag aaaactgtac aagggatagg 3600 ctctaattat tggtttcctt cgttaagaaa aaaagttcga tcttacgtag acaattgcct 3660 gacttgcctc acggccaatt cgtcagttca cactcgagag ggggaaatgc aggaggtaga 3720 cagcccgaag cgacccttcc aaatagtcca ttgtgaccat ttcggtccgc ttacagacac 3780 ttcacaaggg tttaagtata tattagtttt agttgatgcc tatactaggt ttacgtggct 3840 atttccagcc aagacaacaa gctcaaaaga gaccattaag catttattag gagtgttcaa 3900 cgtctttggt tttccaaata ccctcgtttc ggatcgaggt acagctttca cttcacaaga 3960 atttttagat ttcacaaatt cgaagaaaat ctatcaccga ttggtagctg ttgcggcgcc 4020 atggagtaat ggattggtag aaagagtcaa taaattttta aaatcttctt tgaaaaaact 4080 tacagaagaa cctaagacgt ggaacattca tttagataag atacaatacg taatgaataa 4140 tacacatcat gcgtcgttaa acgcgtcccc gtccaaatta ttgttggggt acgaccagaa 4200 gcgtcatgct gacgccgata ttgttgaaag tttaaatcaa attgcgagag tcgaactctg 4260 ctgcaatgag gagcgtgaaa atatgcgcac gattgcagaa gaggtatcga ataaaattag 4320 aaactataat aagatttatt atgacaaaaa gcacgcgaag cctactaaat atcaaatagg 4380 agattatgta ttaattcgag acacggcgtc aaagccaggc gaggataaaa aactcaaagc 4440 gaattataaa ggcccttacg tagtggcaaa aatcctaaat aaaaatcgct atgtcataaa 4500 ggatattcca ggctttaaca taacgtctcg cccgtacgat tcaatacttt ccccagatcg 4560 cctaaaaccg tggattaagc cagttaacag cgaatagtta aaaaaaaaaa aaaaatgcgt 4620 gcggcataga ttaagatttt gttaccaatt atccaaagat tgtaatactt ccaaaggtta 4680 taatactacc aaagattata atactttgat taatttcagt aagtaacagt aaatgtattc 4740 aaacaataat gataagtaaa acctcaggcg gtattcagac tgtaaatacg tccatacaag 4800 aatttccaaa ctgtattcat attgttatat catattgtgt tatatcgagt caataattgt 4860 caatcatact attcgagacg aatatccgtc agtccggccg aagc 4904 // ID Gypsy-94_AA-LTR repbase; DNA; INV; 132 BP. XX AC supercont1.342; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-94_AA_; KW Gypsy-94_AA-I; Gypsy-94_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-132 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.342; Positions 957210 957079. XX SQ Sequence 132 BP; 32 A; 31 C; 21 G; 48 T; 0 other; tggttatcaa ctatcaactt attagctcgt gtttaaatta ttactctgaa ggaaagtcat 60 ctctaaagga ttcatctcca tttcacttcg ctgattcatt tcgtccctgg attgtccccc 120 ggctggatat ca 132 // ID hAT-23_HM repbase; DNA; INV; 3428 BP. XX AC . XX DT 07-DEC-2008 (Rel. 13.12, Created) DT 12-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-23_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3428 RA Jurka J.; RT "hAT-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 2012-2012 (2008). XX DR [1] (Consensus) XX CC Average identity to consensus >96%. CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 613..3237 FT /product="hAT-23_HM_1p" FT /translation="MNRAKKLSGAEFKKKRRDRLESDAKLRNCFKKWLVTN FT SSVEKQIISEDICNEVDNDSTAEVAATSLITDLNSYHPLELIEEEILIGQS FT QDEFGPNDSEAQEDRNSQLSEPIKMGNSEFPPEISEISATSGENFQHSDPA FT TWPKMTDKTRCFLIQHGPEQERREFFPNTLCDFDNRMRHFSSKWYEKIHPN FT GEKFVRYWLLYSNKKDSLFCFCCLLFSTTKTNNFSEISKGFCDWKKLNPRI FT PEHENSNEHQRCYSDWKNLEKNLKEGKTLDSDLQRVINGEMKKWRDILKVI FT VDAILFCAKNNLALRGSTEVIGEQNSGIFLNLIELISHYYPLVAEHVASVK FT AKKTTTSYFSPRIQNELIELLGQKVRNEILSNVREAKYYSVLFDCTPDASH FT KEQMTQIIRYVHITEETCTIEESFVDFIESHEKTGKGLAAEITEKLEKDGL FT SISDCRGQGYDNGANMSGKYNGVQAHIHSLNEFARFVPCAAHTLNLVGVHA FT AEVSPLMITFFGKVQAIFNFFSSSTLRWEKLMKTLTISLKGNSDTRWSAKK FT EAITPLHRQIKEVLQVLESIIHTPKTNAVSVCSAKELIIQIDFSFLCLLDF FT WCQILSLIDRENKLLQYKNVSIDIAAKKMKGLKASIQNLRDVGVDNIIKAA FT AETAIQIGIEGDFPMKRKRKVKQMALYEAEDDFRRLSPETDFGSQCNLVFD FT SILTQIEWRFEAMSAVSSDFDFLSGHSLSKSSVDELKTKAKNLSKIYKADL FT DSSDFQSEMASFKYQAAAMMENFEKSSPMDILQLIHKYSLTDAYPNTAIAI FT RIFLTLPVTVATCERSFSKLKIIKNYLRSTMGQERLSCLAVVSIEHEVANA FT LDFDDVISDFASKKARKVTLN*" XX SQ Sequence 3428 BP; 1218 A; 600 C; 648 G; 962 T; 0 other; cagggccgta ccaagggcgg ggccagccgg gccgcggcct gaggcgccaa aattgggagg 60 gcgcaaaatt ccccaaatta atttatttat tttcacatta tgaaactaat ttgcgtggcg 120 tcacttttat gttgataaga aaattaatta aaattgcgcc tcctgacgaa ctttaaaata 180 aaagctcatt taccaaacag aagagcgccc ttaaacacta aaaagttgta ttttctgaac 240 attatttttt acaatgaatc gtgcaaaaaa attatctggt gctgaattta agaacgacaa 300 ttaaaaacca ataattattt ataatataaa caaatgattc attaaggatc atataaatta 360 atttaacgca cacattaaaa attattctaa aaagaaaaaa atatagaaaa aaagaacttc 420 gaatttaaaa aaaagaacgc ttaatttaag cggcttaaat aaagaaacaa atgaaatgaa 480 agctaaagaa agtaaagtaa gcaaataata ataaaaataa agagggagaa gaaaagaaaa 540 aagaaaaaat aaataataga agagcgcaat taaaaaataa aaagttgtat tttctgaaca 600 ttatttttta caatgaatcg cgcaaaaaaa ttatctggtg ctgaatttaa gaagaaaaga 660 agagaccgtc ttgaaagtga cgcaaaacta agaaattgtt tcaagaagtg gttggtaaca 720 aactcttctg tggaaaaaca aattatttca gaggatattt gcaatgaagt tgataatgat 780 tcaacagctg aagttgctgc tacatcttta attactgact taaattcata tcatccattg 840 gaattaattg aagaggagat tttgattggc caaagtcaag acgaatttgg accaaatgat 900 tcagaagcac aagaagatag aaattctcag ctttccgaac caattaaaat gggcaacagt 960 gaatttcctc ctgaaatctc agaaatttca gcaacctcag gagaaaactt tcaacacagt 1020 gatccagcta catggcctaa aatgacagat aaaacaaggt gctttttgat tcagcatggg 1080 ccggagcaag aaagaagaga atttttccca aacacacttt gtgactttga caacagaatg 1140 cgacacttca gctcaaaatg gtatgaaaaa attcatccca atggtgaaaa atttgttcgc 1200 tactggctgc tgtacagcaa caaaaaagat tctttgtttt gtttttgctg tctgctattt 1260 tcaacgacaa aaaccaacaa tttttcagaa atttcaaaag gattttgtga ctggaaaaaa 1320 ctaaacccaa gaattccaga acacgaaaac agcaatgaac accaaagatg ctattctgat 1380 tggaaaaatc tggaaaaaaa tctcaaggaa ggaaagacct tggattctga tctgcagaga 1440 gttatcaacg gagagatgaa aaagtggaga gatatcttga aagtgattgt tgatgcaatt 1500 ttgttctgtg ccaagaacaa tcttgccctt cggggttcaa cagaagtaat tggagagcaa 1560 aacagtggca tctttctgaa cttaattgag ctgataagcc attattatcc tttagtggct 1620 gaacacgttg catccgttaa ggcaaagaaa accacaacat cctacttttc tcctcgaatt 1680 caaaatgagc tgattgaatt acttgggcag aaagtcagga atgaaatttt gtcaaacgtc 1740 agagaagcta aatactactc tgttttgttt gactgcactc cagatgcctc ccacaaagag 1800 cagatgaccc aaataatcag gtatgtccac atcactgagg aaacctgtac aattgaggaa 1860 agctttgtgg acttcattga atctcacgaa aaaaccggaa agggccttgc agctgaaatc 1920 accgaaaaac tggagaagga tggcctgagc atttctgatt gtcggggcca aggttacgac 1980 aatggagcca atatgtctgg aaaatataat ggcgtccaag ctcacatcca ttctctaaat 2040 gagtttgcaa gatttgttcc atgtgcagct cacactttaa accttgttgg tgtccatgct 2100 gctgaagttt ctccactcat gattacattc tttggaaaag ttcaagccat ctttaacttt 2160 ttctcaagtt ctaccttaag atgggaaaaa ctgatgaaaa cattgaccat ctcactaaag 2220 ggaaatagtg atacaagatg gtctgccaaa aaagaagcaa tcaccccact acacagacaa 2280 atcaaagaag ttcttcaagt tttggaatcc ataattcaca ctcccaagac aaatgctgtc 2340 tcagtttgca gtgcaaaaga gctcatcatt caaattgatt tcagttttct gtgtttgttg 2400 gatttttggt gtcaaattct ttctttaatt gaccgggaaa acaaactgct tcagtataaa 2460 aacgtttcaa ttgacattgc agcaaaaaaa atgaaaggac tgaaggcttc cattcagaat 2520 ctgagagatg ttggggtgga caacattatc aaagctgctg cagaaacagc tatccaaatt 2580 ggaattgaag gagactttcc tatgaagaga aagcgcaaag tgaagcagat ggcactttat 2640 gaagctgaag atgactttcg ccgtttgtca ccagaaacag atttcggatc ccagtgcaac 2700 cttgtgtttg acagcattct cacacaaatt gagtggcggt ttgaagctat gtctgcagta 2760 tcctcagatt ttgacttcct cagtggacat tctctctcca aaagttcagt ggatgaactt 2820 aagactaaag ccaaaaattt gtctaaaatt tacaaggcag atttagattc ttcagatttc 2880 cagtcagaaa tggcaagctt taaatatcaa gctgctgcca tgatggaaaa ctttgaaaaa 2940 tctagcccga tggacatttt gcagctcatt cataaatact cattgactga tgcttatcct 3000 aacacggcaa ttgctattcg catcttcctc accttaccag ttacagttgc cacgtgtgaa 3060 agaagtttca gtaaacttaa gatcataaaa aactatttga gatcaactat gggccaagaa 3120 cgtttgtctt gtctggcagt agtttcaatt gaacatgaag tggccaacgc acttgatttt 3180 gatgacgtca ttagtgactt tgcctccaaa aaagccagaa aagtgacctt gaactgaagt 3240 ttgtgaaaca ttggtgacaa attgttcttt taacttgatg cgataaattt tgtgatcaat 3300 aattaatttt ttttcatgta aatatatata tatatatata tataatgtgt tttttgtata 3360 tttttttggg aggggggcgc aattttattt gcttgcccga agcgcaaagt tggcctggta 3420 cggccctg 3428 // ID Gypsy-192_AA-LTR repbase; DNA; INV; 171 BP. XX AC supercont1.65; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-192_AA_; KW Gypsy-192_AA-I; Gypsy-192_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-171 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.65; Positions 2820119 2820289. XX SQ Sequence 171 BP; 62 A; 30 C; 39 G; 40 T; 0 other; tgtagtgtcg gggttaactg ggttatgtat tagtattcat cactacaaga caacacaaat 60 acacccaagg gaaaggtatg ctaaacaaag agaataaagt aaggagaact agagttgagt 120 agactaccga tcgaagcaca cgtgttttct ttgcgagcaa ccgatatcac a 171 // ID Gypsy10-NVi_I repbase; DNA; INV; 4775 BP. XX AC AAZX01006256; XX DT 13-NOV-2007 (Rel. 12.11, Created) DT 13-NOV-2007 (Rel. 12.11, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy10-NV; KW Gypsy10-NVi_LTR; internal portion; Gypsy10-NVi_I. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-4775 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from Nasonia parasitic wasp."; RL Repbase Reports 7(11), 1147-1147 (2007). XX DR Genome; AAZX01006256; Positions 7410 2636. XX CC Positions [3509-4021] - Integrase core CC 'ATTAG' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS join(122..2986,2990..4726) FT /product="Gypsy10-NV_I_1p" FT /translation="MPKFGMNDDETNKGEDHFPEICAATTLVNLGNTTGQY FT SWPTATGTTTTTSQSSRIQSPAQHQIVPTSYGTYLIDGQEYSKTSTGFVPV FT STYNGGITDGQSNIPSNTPPPTIMQVPVVNFAGNIGRLEEFKINSDWNIYH FT ERLEQYFNANFVDDARKVSVLITAIGPEVYKILRNLCDPQLPHDKPYTELC FT EILSKQFSPRISIFQQRKRFYDLQQNSGESISQWYARIKKGAVHCDFGTEL FT EGRIKDKFVTGMKEGKILDRICERSHKDALKDIYETALAKEASLAVSANSN FT QCDVNKLTQHNRGSSSKKNAGEAGRGERGKQQCATSQSQPRDGAARQQGSS FT KQQKASCNHCGGVNHVFSKCKYKTYKCNNCSKKGHISKVCKEPKNGEVKYL FT EDGPDTETNYISLHNIVDREDAIPPIQTDVFIEGKRVVMEIDSGAGVTIIP FT FEIYKKKQLTAKIEKCQKVIKTYDNRPIKPIGQIVVDIIANDVTVRDAKII FT IIKENRQVSLMGRDLMNKLNISLVGPQRPIDVKKIDLNSPDDLKKLLNKYA FT DLFDGQLGKLKGDKVHLKLSSDAKPVFLKPNSIPFAFKSQVESDLKRLQDQ FT GVIEKIDTNDWGTPLVPVLKEDNTVRTCANYKKTINPWLVDHRYPIPRIEE FT IFSALKGGEEFTKLDLEWAYNQIEVDEETSRILAWSTHMGVFKVKRLSFGP FT KTACSIFQEKIEGVVRDLQGCKNYFDDLIVTGKDRKKHLENLENVLKALKD FT HGLKLRKNKCEFMKPRVFYLGHIIDKNGLSKNPENVKAIVEIKRPTNVQEI FT QAFAGMVNYYMKFIPNLSTMLSPIYKLLKKDTPFVWSKECEQSFNEIKREL FT AADRNLVHYDPDLEVKLTCDASNVGIGSVLSHVLPNREEKPICFASRTLTK FT SERNYSVIQKEALAIYWSVKKFYQYLIGRPFILESDHKPLLAIFGEKNGLP FT MAAGRLQRWALFLSGFNYKMKYVKGKNNGGADGLSRLPLRVDDKVNEPNDE FT YINFLIEEKLPVDYKQIVKEIRTDVVLSKVYVYSKNGWPDQVADELKPFMY FT RANEISIENNVLMWGYRVLIPKKLRDRILDELHCTHMGANKMKSIARQYFW FT WPKLDSDIEQYSKNCDVCNTLMSNPKKAELIKYDQCKEVLERVHADFLGPI FT QGKMYFILTDAYSKWPEVYVMSSTNSTNTVNKLRDFCSRFGLPRKIITDNG FT PQLVSEEFESFCRSNGIKHPLIAPYHPSSNGAAENAVRSFKNGFKKALLDP FT KNSKLNEETIISRYLFAYRNSPHAYTGESPAKLMFGRQLRNCFDLLRENNT FT IQQNKERQMRNYKGNRSEYFDIDDIVWVLDYRCPTKKVWTKAKIIDCLGER FT NYICKTLNEELYWKRHLDQMRRASNFVENDIINMNDRPNNSTESVTNDIVT FT SSVSFPLNGKHKDSGMNNDIIIPNNIEDKQVIEKRHEMSSSEFRREESVIP FT SKIENNVIDRDNVRVKETNGENSIVKTTNEKTNIETVGNAGNKSQKPPITL FT NVNERPKRTIKPPKRLNL" XX SQ Sequence 4775 BP; 1758 A; 769 C; 1022 G; 1226 T; 0 other; ttggcgacga ggataaaaac tttgataaaa acacatggtg ttatagtggc aaggaataca 60 gtgcaatcgt taaatagtaa agtgttacaa taacggaagg cacaaaaggt agaggtacaa 120 gatgccgaag tttggcatga acgacgacga gacgaacaaa ggagaagatc atttcccgga 180 aatatgtgcg gctactacgc ttgtgaattt gggaaatacg acaggtcaat actcgtggcc 240 aactgcaaca ggtacaacaa caactacttc gcagagttct cgaattcagt cgccagcaca 300 acatcaaata gtcccaacgt cgtatggtac gtatctaata gacggccaag agtacagtaa 360 aacatctacg ggatttgttc ctgtttcaac gtataacggc ggtataactg atggacaatc 420 caatataccc tcgaacacac caccgcctac gatcatgcaa gtgcctgttg taaattttgc 480 cggtaatatc ggaaggttgg aagaatttaa gattaattca gattggaata tttatcatga 540 acggttagaa cagtacttta acgcgaattt cgtggatgac gcaagaaaag tgtcagtact 600 aattacggct ataggcccag aggtttataa aattttaagg aacttatgtg atccgcagtt 660 gcctcatgat aaaccgtaca cagaattatg tgaaatactg agcaaacaat tttccccacg 720 aatctccatc ttccagcaaa gaaaacgatt ctacgatctt caacaaaaca gcggtgaatc 780 gataagccaa tggtacgcac gtataaaaaa aggcgcagtg cattgtgact tcggcacgga 840 gctcgaagga agaatcaaag acaagttcgt caccggcatg aaagaaggga aaatcctcga 900 cagaatctgc gaaaggtcac ataaagacgc cctgaaggac atctatgaaa cggccttagc 960 aaaggaagca tcgttagcgg tttcggcgaa cagcaaccag tgcgatgtaa acaagttaac 1020 gcaacacaac agggggagca gttccaagaa gaacgcggga gaagccggac gaggggaacg 1080 agggaaacag cagtgcgcaa cttcacagtc ccagccgaga gatggcgcag cgaggcagca 1140 aggctcatca aagcagcaga aggcttcatg taaccattgc ggcggtgtta accacgtttt 1200 ttccaaatgt aaatataaga cttataaatg taataattgt tcaaaaaaag gacacataag 1260 taaagtatgt aaagaaccaa agaatggtga ggttaagtat cttgaggacg gtccggatac 1320 cgagactaat tatatatcgt tgcataatat agtagaccgc gaggacgcga ttcctcctat 1380 tcagacagac gtttttattg aagggaaacg ggtcgttatg gaaatagact caggcgcggg 1440 agtaacgatt ataccatttg agatttataa gaaaaaacaa ttaaccgcta aaattgagaa 1500 atgccaaaaa gtaattaaaa cgtacgataa tagaccaatt aagccaatag gccaaatagt 1560 ggtagatata atagctaatg acgttacggt aagagatgct aaaattatta ttataaaaga 1620 aaatagacaa gttagcttga tgggtagaga cttaatgaat aagcttaata tttcgttagt 1680 aggacctcaa aggccaatag acgttaaaaa aatagatctc aattcaccag atgacttgaa 1740 aaaattattg aacaagtacg cagatttatt cgatggtcaa ttaggaaaat taaaaggtga 1800 taaagttcat ctaaaattga gtagtgatgc gaaaccagtt ttccttaagc ctaattcaat 1860 cccatttgct tttaaatctc aagtagagtc agatttgaaa agattgcaag atcagggagt 1920 tattgagaaa atcgatacaa atgattgggg tactccatta gtcccggtac taaaagagga 1980 taatacagtt agaacatgcg caaattataa aaaaactatt aatccttggc ttgtagatca 2040 tagatatccc attccgagaa ttgaggaaat attttcagct cttaaaggtg gggaagagtt 2100 tacaaaattg gacctagaat gggcttataa tcaaattgaa gttgacgaag aaacgtctag 2160 aatattggct tggtctactc atatgggagt atttaaggtg aagcggttat cttttggacc 2220 aaaaacagct tgctctatat tccaagagaa aattgaagga gtagttagag atttacaggg 2280 ctgtaaaaat tactttgacg atttaatagt aacgggtaaa gataggaaaa agcaccttga 2340 gaatctagaa aatgtattga aagcgctgaa agaccatggt ctaaaattaa gaaagaacaa 2400 atgtgagttt atgaaaccaa gagtttttta tttggggcac atcattgata agaatgggtt 2460 gagtaagaat cctgaaaacg taaaagcaat cgttgagata aaacggccga cgaacgttca 2520 ggagatacaa gcttttgcag gaatggtaaa ttattacatg aaattcatac ccaacttatc 2580 aacgatgctt agcccgatat acaagttgtt aaaaaaagat acgcccttcg tatggtcaaa 2640 agagtgtgaa cagtcgttta acgagataaa aagagaacta gcggctgata ggaatctagt 2700 tcactatgat ccagatctag aggtcaaact gacctgtgat gcttctaatg tgggaatcgg 2760 atcagtacta tcacacgtct tgccaaacag ggaagagaaa cccatttgtt ttgcctcgcg 2820 aactttaacg aaatctgaga gaaattattc cgtaatacaa aaagaggcgt tggccatata 2880 ttggagcgtt aagaaatttt atcaatattt aattggcagg ccatttattc tggaatcgga 2940 tcataaaccc ttactggcaa tttttggtga gaaaaatggg cttccataaa tggctgcagg 3000 acgtctgcag cgttgggctc tctttttaag cgggtttaat tataaaatga aatatgtcaa 3060 gggtaaaaat aatgggggag ccgatgggct atctagatta ccccttcggg ttgacgataa 3120 ggtcaatgaa cccaacgatg aatatattaa ttttttgata gaagaaaaat tgccggttga 3180 ttacaaacaa attgtaaaag aaattagaac agatgtagtt ctgagtaagg tttacgtgta 3240 ttctaagaat ggatggcccg accaagttgc ggatgagtta aaacccttta tgtatagagc 3300 aaatgaaata agtatagaaa acaatgtatt gatgtggggt tacagggtat taattccaaa 3360 gaaattaaga gataggattt tagacgaatt acactgtaca cacatgggtg ctaataaaat 3420 gaaatcgatt gctagacaat acttctggtg gccaaagctg gactccgata tagaacaata 3480 ttcaaaaaat tgtgatgtat gtaatactct aatgagcaac ccaaaaaaag ccgaattaat 3540 taagtacgat caatgtaaag aagttcttga gcgagtacac gctgattttt tgggaccaat 3600 acaagggaaa atgtacttta ttttgacgga cgcttactca aaatggcccg aagtatacgt 3660 catgagttca actaattcaa caaacactgt aaataagcta agagatttct gttcgagatt 3720 tggattaccc agaaaaatca taaccgataa tggtcctcaa cttgtttcgg aggaatttga 3780 aagtttttgc agatcgaatg gtataaagca tcctttgatt gctccatacc acccttcttc 3840 gaacggcgct gccgaaaacg cggtaagatc ttttaaaaac ggttttaaaa aagcacttct 3900 tgatccaaaa aattcaaagc ttaatgaaga aacgataatc tctcgatatc tcttcgctta 3960 tcgcaattca cctcacgcat atactggaga aagtcctgcg aaattgatgt ttggtagaca 4020 gttgcgaaat tgttttgatc tcttaagaga aaataatacg attcaacaaa ataaagaaag 4080 acagatgaga aattataagg gtaaccgtag tgagtatttt gacattgatg atattgtatg 4140 ggttcttgat tatagatgtc ctaccaaaaa agtttggaca aaagcaaaaa tcattgattg 4200 tttaggcgaa agaaactata tatgtaaaac tcttaatgag gaattatatt ggaaaagaca 4260 tctagatcaa atgcgtagag cgtcaaattt tgttgaaaat gatattatca atatgaatga 4320 tcggcctaat aattcaacag aaagtgtaac aaatgatatt gtaacatcga gtgtctcctt 4380 tccattaaat ggaaagcata aagatagtgg aatgaacaat gatataatta taccaaataa 4440 tatcgaggat aaacaagtaa tcgagaagag acacgagatg agtagttccg aatttaggag 4500 ggaggaaagt gtgataccta gtaaaataga aaataatgta atagaccgag ataatgttag 4560 agtgaaggag acaaatggtg aaaacagtat agtcaaaact accaatgaaa aaactaatat 4620 cgaaactgtc ggtaacgctg gaaataaatc ccagaaacca ccgattacct taaatgtaaa 4680 cgagcggcct aaacgtacta taaaaccacc taaacgttta aatttataat aaataagagg 4740 tactaaaact taagtgttaa ctaagggcgg aggta 4775 // ID Kiri-29_AAe repbase; DNA; INV; 4250 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 16-FEB-2011 (Rel. 16.02, Last updated, Version 1) XX DE A Kiri non-LTR retrotransposon family from Aedes aegypti. XX KW Kiri; Non-LTR Retrotransposon; Transposable Element; Kiri-29_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4250 RA Kojima K.K. and Jurka J.; RT "Kiri non-LTR retrotransposons from the yellow fever mosquito."; RL Repbase Reports 11(2), 724-724 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 20 sequences with >95% CC identity. CC This family and related Kiri elements found in two mosquito CC species, Culex quinquefasciatus and Aedes aegypti, constitute a CC distinct lineage belonging to the CR1 group. The phylogenetic CC analysis with PhyML indicates that Kiri is a sister lineage of CC the L2 clade. Although several Kiri elements do not encode two CC proteins, probably due to 5' truncation, Kiri elements generally CC encode two proteins. Their ORF1 proteins commonly contain an CC L1-like RNA-binding domain. XX FH Key Location/Qualifiers FT CDS 252..1040 FT /product="Kiri-29_AAe_1p" FT /translation="MSLNVKTNKPLTRSGSTSSASSTNNELPAQRAVPEFA FT TIASLDDLWTRMQGMFERTYEKIESSKTELAERITDVEGQLGKVREECSSR FT VDKLEETLSGVRLDLDHTTEAVHRMEKNDELIISGIPFQNNENLASIFLTI FT SQSIGYSEDSSPFVELKRLARQPIARGATPPILCQFAHRLTRNEFYRKYLG FT KRNLSLRNIGFENDNRIFINENLTSKARSIRTEAIKQKKLGRIQSVTTRDG FT VVYVKFSGSGELDAIHNIQQLN" FT CDS 1227..4082 FT /product="Kiri-29_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MWWSLNSIMDNLHTDVHNALNNLSIPRAVINSALVDG FT KLNICHINVQSLCARNFSKFEELKRTIENSKLDVVCFTETWLNDTISDRMI FT CLNGFKLIRNDRNRHGGGICVFIRSNLSCRVVCKSSVSDSQSTLQRTEFLI FT LEVVVGDHCLLLAVYYNPPDVDCADILEKHITDYSLHYSSTFLIGDFNTDP FT RKHTGRAVRLNDVFANMSYDVVNHEPTFFFNSGSSLLDLLITDSATSVLRF FT NQISMPGISKHDLIFASLDFDRDSQQAGFWCRDYYNYDANALHSEFSNFDW FT NSFLQIDDPDVLLSVLNPRLTDLHERFFPMKFKKFKKNPWYSRDIEKAMID FT RDLAYRIWKRTRSSQHRADYNHLRNRVNDLISKAKCDYDRRRLNLTLPSKQ FT LWNNVKSLGVSHKKSSSQNCAHSSDVINDYFCSNFSIDNSEGSYVRGVSPG FT FEFRAVQNAEIVNAIFGIKSNSTGLDNIPIRFIRIIAPYAIPIFEHVFNRI FT IFSEKFPAVWKRTKVIPLQKKQGVNSISNLRPISLLSTISKVFERLIKIQI FT SDFIDRMNFLSPFQSGFRKCHSTETALMRVHDDIAQTVDKSGITVLLLIDF FT AKAFDSVVHRKLVNKLSTTFRFSVSATNLIRNYLSERSQAVFSNGILSPFK FT PIISGVPQGSVLGPLLFSLFINDLPSVLDFCSVHLFADDVQVYLCSNGTID FT IRNMAEKINHDLYKIVQWSRKNSLPINPSKTKAMLLHKLKSPPNPPSLNID FT GVEVEFVSKASNLGVVFKNSLEWDGHINSQCSKIYSSLKRLNLTTRHLDIP FT TKKHLFKTFLLPHFIYGDFIYSNASISSLNRLRVAFNACVRYVYNLSRFSH FT VSHLHTSMIGCPFVNFYKLRSCVTLFKIIKTKAPNFLFSKLSPLRGIRTRS FT FYVPRHSSFYYGQSFFVRGIVNYNNLPNNIKSCESTTRFRSEIISYFQ" XX SQ Sequence 4250 BP; 1293 A; 774 C; 803 G; 1379 T; 1 other; agtttctgaa gggatgtaca gcgacgagta atcggttttg tgatcgcagc caaattatct 60 ttttcttcgc ttaattgcta cctaaaaata tccgctgctg gtgattttcc aaagttgaat 120 tagtgataaa tctgtgtctg gtggccgtta ttcaaaatat tcattaaact gtgaaatttc 180 ggcataagta gttttcatca taaagatttg cttgtgatta attgttataa ttacccttcg 240 gaatcgacac catgtctcta aacgtcaaga ccaacaaacc actcactcgc tcgggttcaa 300 cgtcgtctgc atcatccacc aacaatgagt taccagcaca acgtgccgta cctgaatttg 360 caaccatcgc gagtctggac gatttatgga cgagaatgca aggaatgttt gagaggacct 420 atgaaaaaat tgagagcagt aaaaccgaac ttgctgaacg cattactgat gttgagggcc 480 aactagggaa agtgagagag gagtgttcta gtagggtcga caagcttgaa gagaccttaa 540 gtggagtacg gctagatttg gaccatacca cggaggcagt acatcgtatg gagaaaaatg 600 acgagttgat catttctggg attccgtttc aaaataatga aaatctggct agtatctttc 660 taacaatttc tcaaagcatt ggctattctg aggatagctc tccttttgta gaacttaaac 720 gcttagctcg acaacctatt gcccgtggtg cgactccccc aattctttgt cagtttgccc 780 atcgtctcac tagaaacgag ttctatcgaa agtatcttgg taaacgtaac ctcagtctgc 840 gtaacattgg tttcgagaac gataatagaa ttttcatcaa cgagaacctt actagcaagg 900 ctaggtcaat tcgtactgag gcaatcaagc agaagaagct agggcgtata cagagtgtca 960 ctacgcgaga tggggtagtt tacgttaaat ttagtggctc tggggaactt gatgctatac 1020 acaatatcca acaactgaat taatatgaaa attgttatac tatatcagtg gtgggcgttt 1080 tttcttttta tcattgaata gcaatcaccg tagctaattt gatttggctg taaattaatg 1140 tgcaaacgat attcgaacta tgattagctt taagtttatg ttctctttta ctgttatcac 1200 tacaaataca ccgtctcaca cttcaaatgt ggtggtcatt aaattcgatt atggataatc 1260 ttcatactga cgttcacaat gccttgaata atctctcaat acctcgagcc gtaattaact 1320 ctgctcttgt cgatggtaag cttaatattt gtcacattaa cgtacagagt ttatgtgctc 1380 gtaactttag caagtttgag gaactgaaaa ggactatcga aaatagtaag cttgacgttg 1440 tttgctttac cgaaacatgg ttgaatgata ctattagcga tcgaatgatt tgtttgaatg 1500 ggtttaagct tattagaaat gatcgtaaca ggcacggagg tggtatatgt gttttcattc 1560 gaagtaatct ctcttgtcgt gtagtgtgca agtcttctgt cagtgattct caaagcactt 1620 tacaaagaac agaattcttg atattggaag ttgtggttgg tgatcattgt ctacttttgg 1680 ctgtctacta taatccacca gatgttgatt gtgctgatat tctggaaaaa cacataacag 1740 actattcact wcactattca tctacttttt taattggtga tttcaataca gatccccgaa 1800 agcatacggg tagagcagtg cgcttgaacg atgtttttgc caatatgtcg tatgatgtgg 1860 tcaatcatga gcctactttc ttcttcaatt ccggtagttc attattggat ttgctcatta 1920 cggactctgc cacttcagtg ttgaggttta atcagatttc aatgccaggt atttcaaaac 1980 atgatcttat attcgcatcg ctggatttcg atcgcgattc tcaacaggct ggtttttggt 2040 gtagagatta ttataattat gacgccaacg cactccactc tgagtttagt aatttcgact 2100 ggaatagttt tctgcaaatt gatgaccctg atgttttact aagtgttctc aaccctcgtt 2160 taaccgactt gcatgaacgt ttctttccaa tgaagttcaa gaagttcaag aaaaatccct 2220 ggtatagcag agacattgag aaggctatga tagatagaga tctagcatat cgtatttgga 2280 aaagaaccag aagtagtcag catagagcag attacaatca tctgagaaac cgtgttaacg 2340 atctcatatc caaagccaaa tgcgattacg ataggagaag attgaatttg actctaccgt 2400 cgaaacaatt gtggaataac gtaaaaagtc tgggagtatc tcataaaaag tcgtcatcgc 2460 aaaattgtgc tcattcatct gatgttatta acgattattt ttgttccaat ttttctattg 2520 ataacagtga aggttcatac gttcgaggtg tttcgccagg cttcgagttt agggcggttc 2580 agaatgctga aattgttaat gcaatctttg gaatcaaatc aaattccacc ggactagaca 2640 acattccaat caggtttatc agaatcattg ccccttatgc tattcctatt ttcgagcatg 2700 ttttcaatag aataattttc tcggaaaaat ttcctgctgt atggaaacgc acgaaagtta 2760 ttcctctgca gaagaagcag ggtgtaaatt ctatttcaaa tctaagacca ataagtttgc 2820 ttagtactat ttcaaaagtt tttgaaaggc tgataaagat tcaaatatca gattttatcg 2880 acagaatgaa cttcttaagt ccttttcagt caggtttcag aaaatgtcat agtactgaaa 2940 ctgcactgat gagagttcac gacgatattg cacagacagt agataagagc ggtattactg 3000 tgttgctact gattgatttc gcaaaagcat tcgatagtgt tgtacatagg aaactcgtta 3060 acaaattaag tactacattc agatttagtg tatcagcaac taatctaatt agaaactatc 3120 tgtcagaaag atctcaagct gtattttcta atggtatatt atctcctttc aaacctataa 3180 tatctggtgt tccccaagga tcagttcttg gaccgcttct gttctcttta tttataaatg 3240 acttgccttc tgttctcgat ttttgctctg tccacctatt tgcagatgat gttcaagttt 3300 atctatgttc caacggaact attgatattc gtaatatggc cgagaagatc aaccacgact 3360 tgtataagat cgtacagtgg tcacgaaaaa attcactgcc aatcaatcca tctaaaacga 3420 aagccatgct cttgcacaaa ttgaaatctc ctccaaatcc tccttctttg aatatcgatg 3480 gggttgaagt tgagtttgtg agcaaggcta gtaacttagg agttgtattt aaaaattcgc 3540 ttgagtggga tggacatata aattcgcagt gctctaaaat atacagttcc ctcaaacgct 3600 tgaaccttac tactagacac ctagatattc caacaaaaaa acatctgttt aagacatttt 3660 tgctccctca ctttatatat ggcgacttca tttattccaa tgcgtcaatt agctctctca 3720 atagactacg agtcgccttt aatgcgtgcg taaggtatgt ttataattta tcgagatttt 3780 cacatgtttc gcatcttcat acgtcaatga ttggttgtcc atttgtaaat ttttataaat 3840 tgaggtcgtg tgtgacgctt ttcaaaataa ttaaaactaa agctccaaac ttcctatttt 3900 ctaaactatc acctcttaga ggcatacgta caagaagttt ttatgtacca cgacatagtt 3960 ccttctatta cggccagtca ttctttgtaa gaggcatcgt gaactataac aatcttccga 4020 ataatattaa atcttgtgaa tcgaccactc gttttcgaag tgagataata tcatattttc 4080 agtagatgtg tagtgatttt tgtaaagaga acattagtag ttaagttacg taagaattat 4140 tgtaaaacct attacttttt cacattgtaa caatttaaaa ggcaatgcct tacgttacga 4200 agagtcaata aataaataaa taaataaata aataaataaa taaataaata 4250 // ID Copia-16_SI-LTR repbase; DNA; INV; 341 BP. XX AC AEAQ01021270; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: long terminal DE repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-16_SI_; KW Copia-16_SI-I; Copia-16_SI-LTR. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-341 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01021270; Positions 3537 3197. XX SQ Sequence 341 BP; 94 A; 57 C; 72 G; 118 T; 0 other; tgttgaatat aacaatcaca caggagtatg gcgctagggt cgcggtctag ttactattga 60 tgtgatagcg cttgagtcac gatatggtta taaatcagag ttggcattat tgtctatggt 120 ctagattctt taacattttg tgtgtaccga tactcccttt ttcaaaaaca aggcttgctt 180 atcttcctct ctgtatctga tgtctcatgt ataagtcgtt acgtaataaa ggattctgtg 240 tgatattaaa gtacgtacaa atacgcgttg attggttaac ccgttcagtt ttcagtatat 300 cacggtcaag tgaagaatac aagagggctt agttttcaac a 341 // ID Copia-106_AA-LTR repbase; DNA; INV; 147 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-106_AA_; KW Ty1_copia_Ele116; Copia-106_AA-I; Copia-106_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-147 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 147 BP; 49 A; 32 C; 23 G; 43 T; 0 other; tgttagatag caaccgacca ctttgcatag caacttagga caacaagctg tagtatgcca 60 tgattagtat gtaataaaaa ttttcatcat tcctactgtt accactaacc aaaccagacg 120 tttagtctag aagctctgct gatatca 147 // ID RTEX-1_BF repbase; DNA; INV; 4611 BP. XX AC . XX DT 30-JUN-2009 (Rel. 14.07, Created) DT 30-JUN-2009 (Rel. 14.07, Last updated, Version 1) XX DE Amphioxus RTEX-1_BF autonomous non-LTR retrotransposon - DE consensus. XX KW RTEX; Non-LTR Retrotransposon; Transposable Element; CR1-9_BF; KW RTEX-1_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-4611 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-4611 RA Kapitonov V. and Jurka J.; RT "Young families of RTEX non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(7), 1718-1718 (2009). XX DR [2] (Consensus) XX CC The 5' terminal portion is missing. The reconstructed RTEX-1_BF CC consensus sequence contains a C-terminal part of ORF1 and the CC complete ORF2. The RTEX-1_BF ORF1 protein contains the esterase CC domain. The 3' terminus is composed of the (CATTGT)n CC microsatellite. XX FH Key Location/Qualifiers FT CDS 3..701 FT /product="RTEX-1_BF_1p" FT /note="esterase." FT /translation="VIGDSNTRSIMPSILYPDKQVHKQPGMTIPEAIDIIE FT TTPFSNPKCIVYHVGTNDVKQERAASGVTENMRRLVATTHDKFPNAAIVLS FT SIPPRNDRHLMEIAKQVNAFIHILGQECGYISVADNDNMEENGSIKRVLFK FT DDGYHLNRGGVKVLAANLKEAIHPTVGLGDKLQRTSEARTRACGGTIYQGY FT NMTGLWNDGVQHTNPADGDVTLSPHSNPTVATTICTWTGFRQT" FT CDS 957..4487 FT /product="RTEX-1_BF_2p" FT /note="AP endonuclease and RT domains." FT /translation="MTMSRLQMSPHSISIGFWNVHGLGKKLEVNDFRNSLE FT KLDFFSLLETWSNDTSEVDLQHYQHFSSLRNKQSKARRNSGGIIFYFKNKL FT AKFIRRESSTSESKDILWIRVAKELFGLSRDVFIGNVYISPVNSSIHKRSE FT ESIFDVLEQDVARFTSRGFVILGGDFNSRLGSLKDFADETPIDSDVEHIPI FT LNIPRNYSDKNPPNTFGKDLTDLCYQSNLNILNGRVLGDLSGRFTCYQPNG FT CSTVDYAIASSTLFHNINIFRVHPLTEFSDHCMISLNIKTLIQESQNETPK FT INMRPLPDRFVWGPDSSEKFINALRSQALSHKLENFNLSTHLNSILNADID FT KEVHKFQEILEIAGKMSLQIKHNSKQKKLRTNKKWFNKNCFEARKELRNLQ FT YLLQRNPKNPFIRGSYFRKKKEYNKLIRKRKKDYETDCLMQLQNLQENNPT FT AFWNLLKQFKNANSTPKALEEIPPDSWVTHFKALHEKRRLERQNFDTEFEM FT KIKDNLKALEELNTEQGPLDCQITEAEISSAITALKNKKASGEDMIINEML FT KAGHSLLLKPLVALFNRTFFSGKFPQTWSKSYIVPIFKSGRKDDTNNYRGI FT SISSCLGKLFTSVLNKRLSSFLEENKILKPNQAGFRKNYRTSDNIFVLKTL FT ISKYTQSRNQLFACFVDLSKAFDSIWHEGLLFKLISKGISGRFFDMLKSLY FT SQSAASVKTNIGLSENFPVKIGVRQGCVLSPSLFNLYISDLADELNTGDFD FT APLLNTSEVSSLFYADDLVLLSTSQNGLQNAIDKLGLYCNKWKLSVNINKT FT KIIVFNKKGHMFSNLKFHILNEDIEIVSSYCYLGVVLTTGCNFTTALKTLQ FT KKALRALFSIKTTLGETNPSISVYCKLFDACVVPILTYGAEVWGKIGLNDK FT SPIEQVHLKFCKSLLGVRKNSSNLATRAELGRYPLWIEVYTRLYSYYKRLV FT KEVPRNSLQFEAFLVQQDLHSRNIPCWFSNVSKILEESGFAYLLINSDRNT FT SFAQSEVNQRLKDNFVQNFYSEMNSPGMKENQGNKLRTYRKFKLTYEREKY FT LEITNFSYRRAMTKLRISDHPLQIEAGRYTRTPPDDRICMFCNKNVRCIEN FT EMHFVLECSLYNDMRNSLVNVKRVKEKYTNITKVDMFKYLMSSTNYDILEE FT LCKFVYSCFKRRDEACRK" XX SQ Sequence 4611 BP; 1585 A; 915 C; 870 G; 1241 T; 0 other; tggttatcgg tgattcaaac acacgaagta ttatgccaag tatattatac ccagacaagc 60 aagtccacaa gcaaccggga atgaccattc ctgaagctat agacatcata gaaaccaccc 120 ctttctccaa cccaaaatgt atcgtgtacc atgtgggtac caatgatgta aagcaagaac 180 gggctgcaag cggggtgacc gagaacatga gacgacttgt tgcgacaact cacgacaaat 240 tcccaaacgc agccatagtc ttgtcctcta taccgccaag aaatgaccgc cacctgatgg 300 aaatcgcaaa gcaagttaac gccttcattc acatcctcgg ccaagaatgc ggctacataa 360 gtgttgcgga caacgacaat atggaggaaa atggctcgat caaacgagtg ctattcaagg 420 atgatggcta tcaccttaac cgcggtggtg taaaagttct agctgctaac ttaaaagagg 480 ctatacatcc aacagttggt ctgggtgaca agctccaacg gacctcagag gcgcgaacca 540 gagcatgtgg agggaccatc taccaaggct acaacatgac agggctatgg aacgatgggg 600 tccagcacac gaatccagcg gatggcgacg tgaccctttc cccccattcc aatcctaccg 660 tggctacgac gatctgtaca tggactggtt tccgtcaaac ctgaatggac aacggaaccg 720 gttccagcct gagcgctctc gtgactatcc gtggtacagg gactatgact ttgatgaata 780 ctaactcaag gtttagtatc tacttatata agacatcgta tagtacttgt taacaagtta 840 tagtagattg caaacgtagt tgtttaagta agctcttcag tgatgaagta aaagttttag 900 taaaggtagg gaataagtca gcactctgat agttttgtcg atactcttat taagccatga 960 caatgtcaag gttacaaatg tctcctcact ctatatccat aggtttttgg aatgttcatg 1020 gtttaggaaa aaaacttgaa gtcaacgatt ttagaaacag tttagaaaaa ttagatttct 1080 tctccctcct tgaaacatgg agtaatgata cgtcagaagt tgatctacaa cattatcagc 1140 acttttcatc tctaaggaac aaacaaagta aagccagacg gaactcaggt ggaattatct 1200 tttattttaa aaacaaacta gcaaaattta taagacgaga gagcagcact tcagaaagca 1260 aagatattct ctggatccgt gtagccaagg aattgtttgg cctatccaga gacgtattca 1320 taggtaatgt atatatcagt cctgtaaact cttcgatcca caaaagatct gaagaatcta 1380 tctttgatgt actagaacag gacgtggcac gttttacgtc cagaggtttt gtgattctcg 1440 gaggagactt taatagtcga ttaggttcct tgaaagactt cgcggacgaa actccaatag 1500 attctgacgt tgagcatatt ccaattctca acatcccaag aaattattct gataaaaacc 1560 cgccaaacac atttggcaaa gatttgactg atttatgtta ccaatccaac ctcaacattc 1620 taaatgggcg agtcctggga gatctcagcg gccgattcac ctgctaccag cccaacggct 1680 gtagcactgt ggattatgcc atagctagca gtaccctctt ccacaatatc aatatattta 1740 gagtacatcc attgaccgaa ttttctgatc attgtatgat ctcgctaaac attaagacac 1800 ttatacaaga atctcaaaac gaaacaccca aaataaatat gcgacctttg ccagacagat 1860 ttgtctgggg accggactcc agcgaaaagt ttataaatgc tttacgcagc caagcattat 1920 cgcacaagct tgaaaatttt aatttgtcaa cgcatctaaa ctcaatctta aatgcagaca 1980 tcgacaaaga agttcacaaa tttcaggaaa tcttagaaat agcggggaaa atgtcactac 2040 aaatcaaaca taacagtaaa caaaaaaaat tacgaacaaa taaaaagtgg ttcaacaaaa 2100 attgcttcga ggcccggaaa gagctccgta atctgcaata tttattgcaa cgaaacccaa 2160 agaacccttt tattcgaggt agctatttta ggaagaaaaa ggaatacaac aaactaattc 2220 gaaagcgcaa aaaagactat gaaactgact gtttaatgca gctccaaaat cttcaggaaa 2280 ataatccaac cgccttttgg aatctactca aacaatttaa aaacgccaat tctactccaa 2340 aagctttaga agaaattccg cctgattcat gggtaactca cttcaaagca ttacatgaaa 2400 aacgtaggct tgagaggcaa aattttgaca ctgaatttga aatgaaaatt aaagacaatc 2460 taaaggcact ggaggaactc aacacagagc aagggcctct tgattgccaa atcacggagg 2520 ctgaaataag tagtgcaata acggctctca aaaacaaaaa ggcgagcggc gaagacatga 2580 taattaacga aatgctgaaa gcgggacaca gcttactttt aaagcctctt gttgcattat 2640 tcaatcgcac cttttttagt ggaaaattcc cccaaacctg gtcaaaaagc tacatagtgc 2700 ccatattcaa atcagggaga aaagacgaca ctaacaacta ccgcgggatt tccatttcta 2760 gctgtctcgg gaaactgttt acctctgtac taaacaaaag gctctcctct tttctcgagg 2820 aaaacaaaat tcttaaacct aatcaagcag ggttccggaa gaattacaga acgtcagata 2880 acatatttgt tctgaaaacg ttgatttcca aatatacaca gtcaagaaat caactatttg 2940 cttgctttgt ggatctatcc aaggcatttg attcaatttg gcacgagggc ctattattca 3000 aactcatatc aaaaggcatt tccggtagat tttttgatat gctaaaaagt ttatatagtc 3060 aatccgcagc gagtgtgaaa acaaatatag gtttaagtga aaatttccct gtcaaaatag 3120 gagtgcggca aggttgtgtt ctaagtccct cactatttaa cctatatatc agtgacttag 3180 cagacgaact gaatacagga gattttgatg cgcccttgct aaatacatct gaagtatcaa 3240 gtttatttta tgccgacgac cttgtcctac tatcaactag ccaaaatggt cttcaaaatg 3300 caattgataa actaggtctc tattgcaaca aatggaaatt atcagttaac ataaataaga 3360 caaaaataat cgtatttaac aaaaagggtc atatgttctc aaatctcaaa tttcatatac 3420 tcaacgaaga catagagata gtttcttcat attgctattt aggagtggtg cttacaacag 3480 gttgcaactt cacaacagca ttaaaaacac tacaaaagaa ggcgctaaga gctctgttca 3540 gtattaaaac aacactagga gaaaccaatc cttcaatttc agtatattgt aaattatttg 3600 atgcctgcgt tgtcccaata ttgacttatg gtgcagaggt atggggcaaa atcggattaa 3660 atgataagtc acccattgaa caagttcatc tgaagttttg caagtcgtta ttaggcgtaa 3720 gaaaaaactc aagtaatctt gcaacaaggg cagaactagg caggtatcct ctgtggatcg 3780 aagtatatac tagattgtat tcatactaca aaagattagt taaagaagtg ccaagaaata 3840 gtctacaatt tgaagctttt cttgtacagc aggacttaca cagtcgcaac ataccatgct 3900 ggttttccaa tgttagtaaa atcctggagg agtctggttt tgcctatctt ttgattaaca 3960 gtgatcgcaa cacatctttc gctcagtccg aagtaaacca aagactgaaa gacaactttg 4020 ttcaaaattt ttactccgaa atgaattctc ctggcatgaa ggaaaaccaa ggtaataagt 4080 taagaaccta cagaaaattc aagcttactt atgaaagaga aaagtaccta gagataacaa 4140 attttagtta tcgccgagcc atgactaaat taaggatcag cgatcaccca ctgcaaattg 4200 aagctggcag atacacccga actcctcctg acgatagaat atgcatgttt tgtaataaga 4260 atgtcagatg tatagaaaat gaaatgcact ttgtactaga gtgttctctg tacaatgata 4320 tgcgtaatag ccttgttaat gttaaacgtg ttaaagagaa atatacaaac ataacaaagg 4380 tagacatgtt taaatatctg atgtcatcaa ctaactatga catattggaa gaactgtgta 4440 agttcgttta ctcttgtttt aagaggaggg acgaagcttg tagaaaatag ttattattag 4500 gatatatcat tgtttatttg ttgtcattaa ctttatatca aacttaatgc atttagcccc 4560 attggggcat gaacatgcaa taaacatcat tgtcattgtc attgtcattg t 4611 // ID CR1-23_CQ repbase; DNA; INV; 2144 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-23_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-2144 RA Kojima K.K. and Jurka J.; RT "CR1 non-LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 27-27 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 7 sequences with >97% CC identity. XX FH Key Location/Qualifiers FT CDS 2..1912 FT /product="CR1-23_CQ_1p" FT /note="reverse transcriptase." FT /translation="ALKKLKSSKRAALKKFSKCPISVNRLNLNQISNDYRR FT LNKSLFLAHQSRVQNRLKQNPKGFWKFVNEQRKESGLPSTMIRDGVEVSTS FT SEICDAFRVQFSSVFADNNLSDPDVAAAADSVPDSGLVCSDLQVSREDFNA FT ACSKLKSSTSPGPDGVPAIVIKKCANSLSEPLLRIFNLSLSLGVFPNNWKE FT SYVFPVFKKGCKQDVKNYRGIAALCAVSKLFEVIVLEFLRHNLSHVVSDDQ FT HGFFPKRSTNTNLVTYISKIQRAISDGYQVDAIYTDLSAAFDKINHRIAIA FT KLRRVGLHGSLLTWVNSYLTGRTMLVKISDNLSLPFSVTSGVPQGSHLGPF FT LFLLYINDVNFLLRCPKLCYADDFKLYAIIKRPTDMEDLQAQVDLFADWCR FT LNQMVLNPSKCSTITFTRKRCPTFFSYKILGNPLTRDSFVKDLGVLLDSKL FT SFKNHISYLCSKASKQLGMIFRMTKYFRDVNCLKILYLSLVRSTLEYGSVV FT WAPYYHNDIGRIEAIQRKFTRFALRHLGECPNYETRCSMLHLDPLSVRREA FT AKAFFVADLFRSNINCPFLTSELNVNIRRRVLRSHSFLVIPGARTNYAFNE FT PVSSMCRVFERCYPFFDFHLSRPRLKHDIXNFLREAYVF" XX SQ Sequence 2144 BP; 532 A; 485 C; 419 G; 704 T; 4 other; agccctcaaa aagctgaaat cctccaagag agctgccttg aaaaagtttt ccaagtgccc 60 tattagtgtc aatcgcttaa atttgaatca gatcagcaac gactacaggc ggttgaataa 120 gtcactgttc ctcgctcacc aatctcgtgt tcagaaccgc ctcaaacaaa atcctaaagg 180 cttctggaag ttcgtaaatg agcaacgaaa ggagtctggt cttccgtcaa ctatgatccg 240 tgacggggtg gaagtctcaa cctcatctga gatttgtgat gccttccgtg tccaattttc 300 gagcgttttt gctgacaaca acctaagcga ccctgatgtt gctgcggcag ctgactcggt 360 tcctgattcc ggtcttgtct gctccgacct tcaagtctct cgggaggact ttaatgcagc 420 gtgttcgaaa ttgaaaagtt ccacgtcccc cggacctgac ggagttcccg caatcgttat 480 caaaaagtgt gcaaatagcc tctcggagcc actccttcga atctttaatt tatcactttc 540 tcttggagtt ttccccaata actggaagga gtcatatgtg ttcccagtgt ttaaaaaggg 600 ttgcaagcaa gacgttaaaa actatcgcgg tattgccgcc ctctgtgcag tctccaagtt 660 gtttgaagtg attgttttgg agtttcttcg ccacaactta tcgcacgtcg tctccgatga 720 ccaacacggg tttttcccta aacgttccac caatacgaat cttgtcactt acatttccaa 780 gatccaacgt gcgattagcg atggctatca agtcgacgcc atctatactg acctttctgc 840 ggcatttgac aaaattaacc accgtatagc aattgcgaaa cttcgccgtg ttggcctaca 900 cggttctttg cttacgtggg ttaactccta cctgaccgga agaactatgt tggttaaaat 960 ctccgataac ttgtctttac cattttctgt tacctcgggt gttccacagg gaagccattt 1020 agggccgttt cttttcctgt tgtatatcaa tgatgtcaat ttcctgcttc gctgtcctaa 1080 actatgctac gctgacgatt ttaaattgta tgctataatt aagaggccta ctgatatgga 1140 agatttgcaa gctcaggttg atttatttgc tgactggtgc cgcttgaatc agatggttct 1200 caatccttct aaatgttcga caattacatt cacccgtaaa cgctgcccaa cgtttttcag 1260 ttacaaaatt cttggtaacc cacttactcg tgactctttt gtcaaggatt taggcgtgtt 1320 gttggattcg aagctttctt ttaaaaatca catttcctat ttgtgctcga aggcttctaa 1380 acagctagga atgattttta gaatgaccaa atattttcgc gacgtcaatt gcctaaaaat 1440 tctctatctt tccttagttc gctccaccct agagtacggt tcagttgtgt gggctccata 1500 ttatcataat gacattggac gaatcgaagc gattcaacgt aaatttacaa gatttgctct 1560 aagacatctt ggtgagtgtc caaattacga aactcgttgc tctatgctcc accttgatcc 1620 tttgagtgtt cgtagagaag ctgctaaagc cttttttgtt gctgaccttt tccgatcaaa 1680 catcaactgc ccttttctta cgagtgaact gaacgtcaac attagacgcc gggtgcttcg 1740 ctcacactct tttctcgtta tacctggagc ccgtacaaat tatgcsttca acgaacctgt 1800 ttctagtatg tgcagggttt ttgaacgttg ttaccccttt ttcgatttcc atttgtctcg 1860 tccccgtctg aagcacgata ttttkaattt tctcagggaa gcwtatgtgt tttgacaaag 1920 gtttagaccg gtttattttt agtcagtttt atttatttat ttatttattt tttacattgt 1980 gctagtttag tttgwtagct taagtttgat tgtatatatc ctctatgtca tttggattgt 2040 aaaatctgtt gataataaaa aagaggtttt gtgcctattt gagaaggacc catcggtagg 2100 cgttccactc aaacgggctt ttccctcata taaaacataa aaca 2144 // ID Copia-38_DPu-I repbase; DNA; INV; 4751 BP. XX AC ACJG01004237; XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the common water flea: internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-38_DPu_; KW Copia-38_DPu-LTR; Copia-38_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-4751 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the common water flea."; RL Direct Submission to RU (08-FEB-2011). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR Genome; ACJG01004237; Positions 11795 16545. XX CC Positions [1895-2449] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 689..2581 FT /product="Copia-38_DPu-I_1p" FT /translation="MSKQFYRLDILNPCHDAMSHISAIETIAEQLRNMGEP FT QTRLQICTKIIYTLPEHLRGFISVWESLSEEEQTIPLLTAKILNEESKNAM FT FKNQDLDLGYNATSNRGRRGGQGNGRGGFSGGDRSYDGPPAAKKPRYNGSL FT CEYCQPRGFIATHPFAECRKHAHAVKRESANAAVSRTAKPDSVNAPIQPII FT DFGLPAFPLTASAYLDQSVFYADSGATAHCTNMKHLLRNIVTVPKGSWIIQ FT GLRGAQAEVEAYGDIIFEATVNGQKRVGIFKRVLYAPDTGINLISIGQVTA FT LGTNVNFSGENCEFVKKNEVEATGLRIGNTLYQLNIKAIIESPKKEHAFLG FT KQSAASLMTWHLRLSHVNCRTIQRMEETNAVEGMKITSRKLPAICEGCIYG FT KMCRRSFQCSTEPKTWEVGELIVSYIGGPMQEKSLSGALYYVAMKDKASGY FT RHAFFIKEKSEVVGKFKIYIPSFQNETGKIIKCIRTDNGTEYRGQNWIWVD FT ELGIKRQYTIAYTPQQNGVSERDNRTIVEAARSALYGRKHPVTSHVLLRLW FT AEAINYAVYTLNRTLSRTRNVTAIERYSGTKPNVSHMRQFGISCFVHVADK FT KRQKLDPKAEKGMFLGYDMTSTGFRVLILSTSL" XX SQ Sequence 4751 BP; 1446 A; 1174 C; 977 G; 1154 T; 0 other; gctggaaata tctaataatt tcaacaggtt atgaggccca gttatcgcta atatattata 60 ctcaactgaa atgtctttaa gcctggccca cgtggtcaaa tttgacggtt ccaatctacc 120 actgtggaga ttgggcctag atgttgcatt agaaaaacac gaagtacagt cagttactga 180 tggaatatgt gcttgcccac ccgaggtaaa tctttttact cttaaggtat ctagttggac 240 tcagtctcta cttaccaaga gttttatctt tcttgctcat cgatactttt tcaaactcac 300 tcctgatgag cacctgaata gaaagaacga accaactaaa cactctctgt tctattacag 360 atgagacaag taatcgagcc tgttccacct gctgaagggg gccaaattcc acttaatcca 420 gttgcacaaa cactgggacc catcatgaat gctgaggcaa tcaaagaatg gaaagtcaag 480 gactgcactg cacgacgcat tctactagct actatcgaac tcaaactgca aaacacgttg 540 gttggatgta agacagcttt ccagatttgg actcgactga attctcaaca caacaaatgt 600 gcagccaaca acaaatatgc tgtacaacgg aatttcctga actatgacta tcaaaaaggt 660 atgttctccg ttacaatgag ttacaataat gagtaaacaa ttttaccgac tagatatttt 720 aaatccatgt catgatgcca tgtcccacat ttccgctatt gaaaccatag ccgaacaact 780 tagaaacatg ggggagcctc aaactcgact ccagatttgc accaaaatca tatacacgtt 840 gccagaacac cttcgcggtt tcatctcagt atgggaatcc ctttcggaag aagaacaaac 900 gatcccgttg cttacagcca aaattcttaa cgaagaaagc aagaacgcta tgttcaaaaa 960 tcaagacctg gaccttgggt acaacgctac atctaacaga ggtcgtcgtg gcggacaagg 1020 caatggcaga ggaggtttct ctggtggaga cagaagctac gatggtccac cagctgctaa 1080 gaaacctaga tacaacgggt cactctgtga atattgtcaa cctagaggtt tcatcgccac 1140 ccatccattc gctgaatgca ggaagcacgc tcacgcagta aagagagagt cagcaaacgc 1200 agctgtctca agaacagcca agccagattc ggtcaacgca cccatccaac caatcattga 1260 tttcggtcta ccagcgttcc ctctcacggc atcagcctat cttgatcaat cagtcttcta 1320 cgcagattct ggagcaacag ctcactgtac aaacatgaaa catcttctac gcaacattgt 1380 cactgtaccc aaaggaagct ggataatcca agggctgcgt ggagctcaag ccgaggttga 1440 agcgtatgga gacatcatct tcgaggcgac ggtcaacggt cagaagcgcg tcggaatttt 1500 caagcgagta ctttatgctc cagacactgg tataaactta atatctattg gtcaagtcac 1560 tgcccttggc acaaacgtga atttctctgg tgagaactgt gagtttgtaa agaagaatga 1620 agttgaagca actggccttc gaattggaaa caccctgtac cagcttaaca tcaaagccat 1680 aattgaatct cccaaaaaag aacacgcttt cttaggaaaa caatcagcag catcactgat 1740 gacttggcac cttcgtcttt ctcacgtgaa ctgcagaacc atccagcgaa tggaagaaac 1800 caatgccgtt gaagggatga aaatcacaag ccgcaagtta ccagctatct gtgaaggatg 1860 catctacggt aaaatgtgtc gtcgctcctt ccaatgctcg acagaaccga agacgtggga 1920 ggtcggagaa cttattgttt catacatagg tggccctatg caagaaaaat ccctgagtgg 1980 ggcactgtac tacgttgcga tgaaagacaa agccagtggt taccgacatg ccttcttcat 2040 caaagaaaaa tccgaagtag ttggtaaatt taaaatttat atcccatctt ttcaaaatga 2100 aaccggaaaa ataatcaaat gtatccgcac agataatggc accgagtaca gaggacaaaa 2160 ctggatctgg gtagatgagt tgggcatcaa acgccaatat accatcgcat acaccccaca 2220 gcaaaacgga gtctccgagc gagacaacag gacaattgtt gaagccgcac gaagtgctct 2280 atatggcagg aagcatccag tcacctctca tgtattgcta cgactctggg cagaagccat 2340 aaattatgcc gtatacactc ttaaccgcac tctttctcgg actaggaacg tgacagcaat 2400 tgaaagatac tcaggtacta aacctaacgt ctcacacatg cgccaattcg gtatttcatg 2460 ttttgttcat gtcgcagata agaaacgtca aaaattggac ccaaaagcag aaaaaggaat 2520 gttcctcggt tacgacatga cctcaaccgg tttccgtgtt ctgatcttgt ctacgtcgtt 2580 ataagcgacg aagtcaaatt tgaagaagat gaagaagaag ggctgatcaa tcctgaacct 2640 tcaaccttgt cgtcgttacc agattttccc ttcgcgacgc agtcaacatt cactccagat 2700 ggtgatttag gcgaccgccc aatagctgag tctccactac ccatcacaaa ccctccagct 2760 gtcgaacctc aggtcacaga gcaaccaatt cccccatcac ttggcgagca ctcagtcgac 2820 cccttgccca ccactgaatt acaaaccaaa gctagagcac agcgcgaaat caacggagtt 2880 gacattcaca cgttgaatca acacgatttg aaaaacaaca ctcaaccgga taacgacgat 2940 ctcgacaaca tcgcgacgcc catcgaacga cgatacccac tcagaaaccg gaaggccgaa 3000 aataatccag tcgatgactt caatggattt tggcgagcag tcttccgagc caggctcttt 3060 ccgagaagca atggagtctc gcgaagcttt cctatggcaa ccctctatcc aagaagaata 3120 tgactcccat atccgtaacg gtacgtggca gctggttcct ttgccacctg gtcgtacgtc 3180 tattggcact cgctgggtat tcacggtgaa acccgggtac ctagaaactc ccaaacgcta 3240 caagtcgcgt ttcgttgcaa aaggctattc tcaagttaaa ggtattgatt ttaatgagta 3300 cgctttatat gcaccagtag taggatacac ctcgctccgt atcatcctgt ccctatgtgc 3360 aatattcgac ctcgaaatgg cacaactaga cataaaaaca gcctttttga acggtctagt 3420 ggaggaagag atctacattg accaaccaga gggcttcatc acaccggaaa gtgagcacct 3480 cgttggaaga ttggtgaagt gcatttacgg acttaaacaa gccccacatg tttggaacga 3540 gaagttcaac gatttcctga tcctgtttgg ctttacccgt agcaagcatg acccatgtgt 3600 ttaatttaga cgaagagaaa cagaaatcct cataatggct atctgggtcg acaacggcct 3660 catttgcagt aataataaga cagcaattgc cgaaataatt tcttttttgg caacacactt 3720 tgagatgcga tcgcttccta cagatcgctt cgtaggtcta gaacttacca gaaacagaaa 3780 ggaaaaaaag ctttgagtta accaacaagc gtttattgaa aaagttatct ttcgatttaa 3840 catgtctcaa tgtaacctaa aactcactcc tgcagatcca aatgtacgcc tatccaaaga 3900 gatggaaccg aagacaaaat cccaaaaaca agaagccaaa agactgccct atcaacaagc 3960 agtcggatgt ctcaactacc tggctcaaac gacacgccct gacatcggct tcgcagtcaa 4020 ccagctgtcc cgttacagca acctttttgg agaagaacac tggaaagcag ccaaacacgt 4080 tattgcttat ctgaaaggca cggtcagtta tggtctctgc tacgggggag atggcgcgaa 4140 tgtcagtccg tcactcgtcg gctactcgga ctcggattac gctggtgatc tcgataattt 4200 ccgctcaacg accggctaca tcctattttt caaccttggg ccagtgttct ggagaagtcg 4260 acttcaaccg tcaaccgctg ggtccacgat gcaggcagag taccaggctt tgagtgattt 4320 cgctaaagaa acagtcttcg ttcgttggct tcttcaggaa ctgaatttct ttggccagca 4380 gcccgccatg cttttttgtg acaacacagc agccatcaac ctagccaaca atccaagctg 4440 ccaggcgaag acgagacata ttaatgtcgc ctatcacact atccgcgatt tcatcaagga 4500 tcggtcaatc gacgtcaatc aagtgagcag caaggagaac ttggccgaca cgctcaccaa 4560 atcacttccc gctgtagctt ttgagaagat cagagccctt attggcgttc aaccactctt 4620 gctatcttca taaaaaatat tttttgtcaa aaatgctccc cctgtttttc tttactctaa 4680 cccaacgagt aactctatca aattaagaag tatcggaatc ctttttcttt caaaattctt 4740 ttgaggggga g 4751 // ID BEL5_Cis_I repbase; DNA; INV; 6411 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 06-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE Internal portion of BEL LTR Retrotransposon from Ciona savignyi. XX KW BEL; LTR Retrotransposon; Transposable Element; internal portion; KW BEL5_Cis_I. XX OS Ciona savignyi OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. XX RN [1] RP 1-6411 RA Smit A.F.; RT "BEL5_Cis_I - BEL LTR Retrotransposon from Ciona savignyi."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX CC Ci000229, Ci000235, Ci000232, Ci000233, Ci000121 Product of ORF CC (bp 103 - 6066) closest to those of Bel_Cis4 and Catch3_DR of CC Danio rerio (~35% identity and ~55% similarity). XX SQ Sequence 6411 BP; 1726 A; 1351 C; 1639 G; 1683 T; 12 other; tacattgatt tctggttttt cgtcaagtta tttctagatt gtacgtttga ggtacaattg 60 atttctggag ctcggtttac tggtttatct ctgcatctca agatggcaac tgctgagaac 120 gcggcgccgg tagattcgct aattgacgtc actcctagcg cggcagaact aaaagccgcg 180 cgcacagctg ccttgagcac attaaccaga ctggagaata agctgcagaa gattatatcg 240 aaagaagctt cacataagga agtgtccaag gtaaaagatc attatgacaa tgcctgcgac 300 gattgcttac ggatttgccg tcaatacgtg tccactgtgt tgacggaagg cgctcgtact 360 gaagcagaag cgagagtgga gaacattatt caaaggaaac aaatcgcgaa tcaagattat 420 agacaatttg tgcaaaactg tgctgcgggc cggtctgtgg ggagtggcag taggtctaag 480 tccaaggcat cctctagtcg gcgaagttcc gctcgcatac aagcacaact gctggaaatg 540 gaagccatac aaatggaaaa cgaggccgag atccaacgaa agagtttgga gcacgaggcc 600 gagatccaac gaaagagttt ggagcacgag gccgagatcc aacgaaaaag cgtggagctt 660 gacctgcagg ttncccgagc aaaagcgaag gcaaagagag ctatcctttt ggcggaggat 720 ggaagtgaac tttccgattc tctcggcgct ttgtcaaacg ccgaggagag acctcnagaa 780 actgttttag acaaaataca gtcatttacc tgggatcaaa agagccgggc tgctgctgct 840 gatgctgctg ccccgagacc attagttcta caagaaccca atctgccgaa taggcactct 900 tcaccgttta ttcctaaccc aataccctat ggaacggggc cttccaacta tggcgttaat 960 cattgtgaac gcccagcagc cctaggtatt ggtacgatac atgatggaag gaacttaaaa 1020 gaaaccacgt tggaaaatcc gcgaggacaa tttgaggtta gcccgttcaa tgccgactcg 1080 gtcaccccgc gcttggaatt agacaccttt gacgggaaca ttttagaata ctatggattt 1140 atgagtgact ttcgcgaata tattgaggcc agagtgctga atgatggcgc tcgccttaag 1200 tttcttctna agctatgtgc aggggaagcc tatgacacaa tcaagggatg caagataatc 1260 tccgacagaa gtcgtggcta ccgcgaagct atacaccgtc taaatgctaa gtacggtgaa 1320 cgaaacttgg tggtgcaagc ttacatgaag caggttactg aagggccgct cttgaaaccg 1380 tcggactctg aaggtatgtt tttattagct gataaaatgt acaactgcta caatactcta 1440 tccgagtggg gctactcaca tgatttaaat agcacagaaa acctgggaag gatatacgac 1500 cggcttccta atcatataaa attggatttc tgtaaaatta gtcgattaaa ggaagccgaa 1560 gaccaacggc caagtttctt tgatttaatg gagttggtgc aggacagtgc gaaagcgtct 1620 cgctgttact tcaaccaacg atattacgct tcgacatcgg gaccctcaca atcaaggaaa 1680 gggaacagtg agtttaaacc taaacaaagt tttcctagaa aagtaatgtt tgccgccgct 1740 gagtctgtat ctgaacctat agaacacgca gggacgactn acaggagacc tttgtgctgt 1800 ttattttgca acgagacaca ctctatttgg agatgtaatc tctttaaagg tgaaaccgtt 1860 caggaaagaa caaattttgt cacggcacgg aagctgtgct tcaattgttt gcgtgctggc 1920 catttttccc ggcagtgtcg gtatcaagga tactgtgcga tatgtaagcg cagacataat 1980 tctttgttgc atatagatga aacccccagc ccctctggtg atcctccgcc aagtaatccc 2040 gagctgagag agaaaggtgt tgtatcttgt ggtgctcgtc agcgagatga aaggaagatg 2100 agaaccccta aaagtttcaa ggtggttcca gttaaggtgt ggaccatgga tcctggtaag 2160 agtgtgcaga cttatgcatt tttggatgat ggtgcggaca tatctatgtg tgcagtcagt 2220 ttggcagaaa agcttggtct accaagtaca aatgaacgaa tggatctgta cacacaaaat 2280 gcggtgtctc gtcatgcact tatttgcgat gacttgtttg ttcaaggatt tggggaaagc 2340 gagacattta acgtacacga aatgttgatt acagatgatc tgatagatgt cagtagcagc 2400 attcctacag aagaggtggc tgcgctttat ccccatttga gggacttgaa ctttcccaaa 2460 ctggagaatg gaaacgttga gcttttatta ggaaacaacg tcatcgatgc tttccgattg 2520 actgagcaac gcagtggaag gcacggggaa ccactggggc tccacaccgc acttggctgg 2580 acaatattcg gggctgatct cccagcagct ggtcggggca aagctgaggt gtctcatgct 2640 tgtgtgcaat tcatgagaca gcgtgaacct gcgggtgatt cgtgcaatga gattttacaa 2700 ttattttcca gggaattcag tgacctggca caggaacaga aatgtgagat gtctatagag 2760 gatagacaag caattaagat gatggaagaa acggcaacca aagttggcaa acatcatcag 2820 attggactac cttggaggca acctcgagtc acacttccga gcagccgatt gatggcgcta 2880 aagagactca agtcactaaa aaggcgctta acaggtaacc ctgaattatt ttccgagtat 2940 cgcctaaaga taagggaata catcagccag ggtcatgcca ccgttgtgcc tcgtnctgag 3000 ctggcaagta gtaacgcaat ctggtatata ccacaccact gcactggtcc gaaatttcgg 3060 gttgtctttg attgcgctgc ntgttttaac ggtgtgtgtc tgaatgacaa cttactgcaa 3120 gggcctgact tgactaataa cctggtcgga gtncttttac gttttcgcca ggagcgtgtt 3180 gctcttgtgg cagatataaa gggcatgttc catcaagtgt tggtagatcc taaggaccgt 3240 gatgccttgc gctttctatg gtggccagac gatgacatgg ataaggaacc agttgaccat 3300 cggatgaacg tgcacatttt tggagcaact tcgtcgccta gttgctcatc ctttgctctg 3360 atgaggactg ctcaggaaaa tgaaacgggg gctgatgaga taaccactcg gacggtaaga 3420 aggaattttt atgtagatga tttgttaaaa tcatgtagat ctgtngatga aatgctcaaa 3480 ctagtagcac aaatagttcc attgctggct agtggggggt ttcaccttac caagttctta 3540 actaatgaga aacgagtttt aagttcgatt ccagttgatg atatcgcttc atcgatgttg 3600 actatagacc tgcatggact tccaatngaa cgagcactcg gcgtatcctg gaacaccggg 3660 acggattgct ttgaaataaa ggtcaatgta acgttgagac cgccaacacg ccgaggaatt 3720 ctttcaatgg tgagccaggt atatgatcca ttgggttttg ctcaaccgtt tatactgccg 3780 gctaagaagt tacttcaagt tgcctgccat aatcgtttaa gctgggacga accggtctcc 3840 tctgaacaga gagaaacatg ggatgattgg ctacagaccc tggaaggatt acaggcgatc 3900 tctataccaa ggtgcttccg gccaccctat gaggtacacc aaatccagtt acacttattt 3960 tgtgatgcca gcaagctggg atatggggcc gtgggttact gtcgtatggt agatgaaaat 4020 cgcaaagtac attgttcatt tgtgatgggc aagtccagag ttgccccgat aagcccggtt 4080 agtgttccca ggcttgagct cacctccgct gttaccgccg taaggttagc tgctctgatt 4140 aaaacagagc tcgaatatga cattcatgat gttgtgtact ggacagattc taccactgta 4200 ctgcagtata tttcaaatac ggcctcaaga tttcaaacct ttgtggccaa caggttggag 4260 tttatacacc gttcaactca gccaagtcaa tggaggcacg ttagcacaaa cactaaccct 4320 gcggacatcg catccagggg tctgatgcct agtgaggttc acaaggcgaa aatatggttt 4380 acggggccgg aatttttgcg tcaacccgag gagaagtggc cagcacgtcc antggtgttg 4440 ccaccgttgt ccgaggatca cgcggagttc aggatgagga agggtatcgt gtcttccctt 4500 acccaagcgg caccagataa aaccaccttg caattgttat tcgaacgata ttctacgttt 4560 tctgccctga ggcgttctgt ggcgtggtta cttcggtttc gcaagtatct aatttggaag 4620 tcctgtcgac ctcgtattct ccacccctcg gaatttccgt ggacgggtta tctatcccac 4680 acggaattgg aagaggctct actctcaatt ataaaggtgg ttcaccaaga agtgtttgct 4740 gctgaaatga agcagctgcc caatcatgaa tcgttctcaa cgataacgct acattcggta 4800 cggcagttaa ggaagtcacc agtattggct tcgctgcaga aattgagccc gtttgttgtg 4860 aatggggtgt tatatgttgg cgggagactc cacaaatcca gcctgagtgc gcaagaaaag 4920 catcagataa tcctcccctc acgacacatt attactgatc taattatttc ttattatcat 4980 tgtagacagg gacacagtgg tactcttcat gttctctctg ctattcgaga gaggttttgg 5040 attttgcgag ggcaagcatc agttcgtcga gtactgcgtg actgccgaaa gtgcaagttc 5100 tggaacgcac gggttgggga acaaatgatg gcatccttgc ctgtagctcg ggtgagtgct 5160 ggtagcccac cattcacagc cgttggggtt gattatatgg gacctatcct gacaaaactt 5220 ggtaggagtc aggtgaaacg atatggctgc attttcacgt gcatggcgac cagagctatt 5280 catatcgaag tcgctgaatc attggagacg agcgccttca taggagcgta ccgacgtttt 5340 acctgcagaa gaattggaag accgaagcaa atatttagcg acaacggtac caatttagtg 5400 ggagcagagc gggagctacg cgaaggcatt cgaaattgga atataaagca gatgcacgat 5460 gcatttcttc aagatgagat atcatggaat ttcaatccgc ctgcagcaag ccatcaaggg 5520 ggtgtctggg aacggatgat tcgctcagtc aggaaggtaa tgcgttcaat caacggagaa 5580 aaggtgctcg atgattttgg tctgttaaca ctgatggcgg aaattgaacg gattttaaac 5640 gatagaccta ttacacctat cagcgatgac caccgggatg agctggctct cacacccagc 5700 atgttgctta aaggagcagt tgacgactcg cttccaccag acgagttttt aaagtccgat 5760 ggatacaagc gatcatggcg cgcggttcag ctttgtgctg ataagttttg ggagagatgg 5820 actaaggagt atttgccact cctgcaactc cgacagaagt ggctgcgacc caccagaaac 5880 cttgctgtgg gagacgttgt gttggtcgtg gacgagctga agaagcgcgg cgtatggccg 5940 aagggcgtaa ttgaacattg tacatatgat aaaactggcc ttgtcaggcg cgtacgtgta 6000 cgaacgatga tgtccactat ggagcgagat gtacgcaagc tgtgtctgct ggaagcacat 6060 ccttgaactg ttacccagtg tgctcgtggc cgcaaggctg taaattgtat gcttcgttta 6120 accactagtt ctagccacgg atttggctac tgtgttttat tttttgtata ctttccacct 6180 aagttgtaac tgngacgatg ttttgttatt actgcattcc tgtgtattta gaagtaggca 6240 tgttgttgcg cccattgttt atatacctct acggtagttt gtaattgcct ttattttcga 6300 acttaagtat gacgtagtgc agaagtgttt tttgggtgat tgaaatgccc gttgntgtgt 6360 gttgtaaaat tatttcattt actgttttca ataatttggg gccggtatgt t 6411 // ID Gypsy-18_CQ-I repbase; DNA; INV; 4464 BP. XX AC AAWU01022777; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-18_CQ_; KW Gypsy-18_CQ-LTR; Gypsy-18_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4464 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 415-415 (2011). XX DR Genome; AAWU01022777; Positions 31891 36354. XX CC Positions [3520-3969] - Integrase core CC 'GTGGT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 269..3562 FT /product="Gypsy-18_CQ-I_2p" FT /translation="MPDDKTNVTATAPVVEKRTKGRFPVYLAGLNVNSYLQ FT TVEIYFKLNQTPNDEKALEFITSVGQETANRIIGSFKPEKIVNKSYEQIIE FT KFQKLHEENKNVFAERYRLISRKQAVGESLDDFAIDLQDIVEHCGVKVETE FT AVLVQSIFVAGLKNDNTREIMLREGSEELDLAKLLEKAKSIETATQESRKM FT AQHEIVEVNYIDRRGASAGFSRNVVKRSSMGEATHAANRNADLYNHRGSFR FT PSSSTVCYNCYNKGHLSYECTFPKSKKPRNSAPTPPKGAGYKLAYEERINQ FT LTAAMEELKSSLQDDSDLSDKQSADGSEDDSPNLVNNVNSLLLGKLNTTPA FT FVDLTINGKNITLECDTGACATRCSLETYEKNFNKIKLIPIHKHFYVVSGD FT KVSVVGTLPVQVKLREKLLHLTIVVIKSLREFTPLLGRDWLNVIWPDWRES FT FALNSIHTDARKRWIERTVQNLKKDFPKAFDEDLTEPIKDVVVDIKIDENA FT KPFIHKPYTVAFKHREKVSKYLDDLEAKNIITKIEYAEWASPIVVVVKPNK FT TDIIICMDGSKTVNPHITTHHYPLPLIEELITNKSGAKKFVLIDLRGAYQQ FT LVVSDQSKKLLVINTHKGLYAYKRLPFGVKPAATLFQSALDKILFGIPNVQ FT AYIDDILVWAGSDEELLAKVRVVLEKLASHNVKINTDKCQWFVSHVKYLGH FT ILSEAGVSPNPEKVKAIEAVPAPQTKTQLKAFLGMITFYTKFVSKLNLILS FT PLFNLLTKDAVWEWNEDCKLAFENSKKAICSAQILTHFDPSKLITVTCDAS FT DDGIAGVLSHNVEGKERPVFFVSRRLSKAERKYPILHREALAIVFAMEKFY FT KYVIGQQVIIITDHKPLLGVFQSKKNSPMVIANRLQRYISRLSIFEFTIIH FT KPGRENHVADCLSRLPLNEKPSIADIEESKRSAPNFLNFLVEDQAFNLNAK FT LIAEQSKLDPVLVSIIRFIENGWPTYPRQKQFKNYFSKRHDLSIEANCLIF FT RNRVVIPTSLKEFALQLLHANHRGIHKMKLISRQFLYWEGIGTDIENYASS FT CKTCQLNGIDRTPKIYGNWPKATTPFERVHIDFFQNLTAHF" XX SQ Sequence 4464 BP; 1608 A; 813 C; 897 G; 1146 T; 0 other; aattggcgac gagggtgaaa atttcgcgat tggatccctt agttactttg tgtgactaag 60 gaagaagtat taagtacttg cagaagttgt gtacgacgtg cacgtggtgt acagcgtggt 120 tattattaaa gtaggataat tcgtattgaa aaccagtgac gggtcgtgca agtacccaac 180 agttaaagtt gagggattag tgcaaaaaat cgctagtgtg gggtcgtgaa agtacctaaa 240 aaagaaaaaa aaaaacaaaa aagtgatcat gccggatgat aaaaccaacg tgacggcgac 300 agcaccggtg gttgaaaaaa gaacaaaagg aagatttcct gtgtatttgg cgggtctgaa 360 cgttaactca tacctgcaaa cggtagagat ttacttcaag cttaatcaaa caccaaatga 420 tgaaaaagct ttggagttca tcacgagtgt tggccaagaa acagcgaaca gaataatagg 480 gagttttaag ccagagaaga ttgtgaacaa atcttacgaa caaattattg aaaaatttca 540 aaaactacat gaagaaaata aaaatgtgtt cgcggaacga tatcgcctca tatctcgcaa 600 acaagcggtg ggtgaatctt tagatgactt cgcgatagat ctgcaagata tagttgaaca 660 ttgtggggtg aaagtcgaga ccgaagcggt gttagtgcaa tcgatttttg tggccggttt 720 aaaaaacgat aacacgcgtg aaatcatgct gcgagaagga agcgaagagc ttgatctggc 780 aaaattgctg gaaaaagcca aatcgatcga aactgctacg caagaatcgc gtaaaatggc 840 acaacatgag atagtagaag tgaactacat cgatcgacgt ggtgcaagtg cgggattttc 900 gcgcaatgtg gtgaagagaa gcagtatggg tgaagcaaca catgctgcta acagaaatgc 960 agatttatac aaccatcggg gaagctttcg gccaagtagt tcaacagtgt gctacaactg 1020 ttacaacaaa ggtcatcttt cctacgagtg tacgttcccg aaatcgaaga aacccagaaa 1080 cagcgcacca acaccgccaa aaggagcagg atacaaacta gcttacgaag agcggatcaa 1140 ccagcttaca gcggccatgg aggagctcaa atcgtcgctg caagatgatt cggatctgtc 1200 cgataagcag tcagcagacg gatcggagga cgattcgccc aatctcgtaa ataatgtaaa 1260 tagcttgtta ttgggtaagt taaacacaac accagctttt gtggatttga caataaatgg 1320 taaaaatata acattggaat gtgatacagg tgcctgtgca actagatgct ctctcgaaac 1380 ctatgaaaaa aatttcaaca aaattaagct cattccgata cacaaacatt tttacgtggt 1440 gtctggagac aaagtttcag tggttggcac acttccagta caagtaaaat tgagagaaaa 1500 attattgcac ttaacaatag ttgtgatcaa atctctccga gaatttacac ctctcttggg 1560 tagggattgg ctaaacgtaa tttggccaga ttggagagaa agctttgctc ttaattccat 1620 acacacagat gcaagaaaac gatggataga gagaacggtt caaaatttaa aaaaggattt 1680 cccgaaagct tttgatgaag acttaactga gccgatcaaa gatgtagtag tagatatcaa 1740 gatcgacgaa aacgctaagc catttataca caaaccatac acagttgcat ttaaacacag 1800 ggaaaaagtt tccaaatact tggacgatct tgaagctaaa aacataataa ccaaaattga 1860 atacgcagaa tgggcttctc ctatcgtggt ggtggtgaaa ccaaataaaa cagatataat 1920 aatttgcatg gatggatcaa aaacagtaaa tcctcacata actactcatc attatccttt 1980 acctttaatt gaagaactaa taaccaacaa aagtggggcc aagaagtttg tactaattga 2040 tttacgtgga gcctaccaac aactggttgt gtctgatcaa tcaaaaaagt tgcttgtaat 2100 taatactcat aaagggttgt acgcatataa acgattgcca tttggtgtta aaccagccgc 2160 cacactattt caatctgctt tggataaaat tctctttggt attcccaatg tacaagctta 2220 tatcgatgat atacttgtgt gggcaggttc agatgaagag ttgctcgcaa aggtcagagt 2280 tgtattagaa aagttagctt cacacaatgt caaaatcaac accgacaaat gccagtggtt 2340 cgtttcccat gtgaaatatt tgggtcacat tctttcggag gcaggagtat cgccaaatcc 2400 ggagaaggtg aaagcaattg aggccgtgcc agcgccgcaa acaaaaactc aactaaaggc 2460 atttcttgga atgataacat tttatactaa atttgtttct aaattaaatc taatactttc 2520 tcccttgttt aacctattga caaaggatgc ggtatgggaa tggaatgaag actgcaagct 2580 tgcctttgaa aatagtaaaa aagctatttg tagtgcacaa atactaactc actttgaccc 2640 ttcaaaactc atcactgtta catgcgacgc aagtgacgac ggtatagcag gggttctcag 2700 ccacaacgta gagggaaaag aaagaccagt gtttttcgta tcgcgaagat tatctaaagc 2760 ggagagaaaa tatcctattt tacaccgaga agcactggcc atcgtttttg caatggaaaa 2820 attttataaa tacgttattg gtcaacaagt cattataata acagatcaca aaccattgtt 2880 gggggtattc caaagtaaaa agaacagccc aatggtaatt gcaaaccgtt tgcaaaggta 2940 catttcaaga ctgtcaatat ttgaattcac catcattcac aaaccaggca gagaaaatca 3000 cgttgcagat tgtctatcaa gactacctct caatgaaaaa cctagtatcg cggatataga 3060 agaaagtaaa agaagtgcac caaattttct aaattttctt gttgaagatc aagcttttaa 3120 cttaaatgca aaattaatag ccgaacaatc caaactcgat ccagttctcg tctcaattat 3180 acgattcata gaaaatggtt ggccaacata cccaagacaa aaacagttta aaaactattt 3240 ttcgaaaagg cacgacctca gcatagaagc gaattgccta attttcagaa atagggttgt 3300 cataccaact tctcttaaag aatttgctct tcagttattg catgcgaacc ataggggaat 3360 tcataaaatg aaacttattt caaggcaatt tctatattgg gaaggaatag gtacagacat 3420 agagaattat gcttcttctt gcaaaacatg tcagttaaat ggtatagaca gaacgccaaa 3480 aatatatggt aattggccaa aagcaacaac tccatttgaa cgtgtacaca tagatttttt 3540 tcaaaattta accgcacatt tttgatattg atagatgcct tttcacggtg gatagaagtc 3600 aaaagaatga gtaaaactgc agcagaagaa gttgttcaag aattggataa tattttcact 3660 atttttgggt ttccatccac tattgtcagc gacaacggtc caccattcaa tagtttaaaa 3720 tttgcacagt tttgcaaggc gagaaatatt gagcatatat tttcaccacc ttatcaccca 3780 gctagtaatg ggctagctga aagggcagtg caaaccacaa aagcggtgtt ggataaaata 3840 ataggcgagg gagattctca ttcatcttta cagatcgaca ataaaatacg gacattcata 3900 catcatcacc accaaactcc gagcacaggt gacaacatca tacccaacga acgaatattt 3960 gcatttcaac caagaacaca atttgtaaac cttaagttta aacccagcac tttcgctgag 4020 aatgatgaat tgaaaacaaa aactaatttt aaggtacaag ataaagtgat ctacacacaa 4080 aaagtaaatg gtagaagatt cagctataac gctacaataa ttaaaccatt atctgaactt 4140 gtttatctca ttgaggtgga gggctcaagc cgcaaagcgc atgcaaacca acttaagatt 4200 attccacaaa atcgttttgt gctaaaaaat agtgttgatt cagaaatctc aaaaagcact 4260 acttcatcct cttgctcaga agtatcagat gaagaagtcg acacgaaaaa taataaccca 4320 gacttagata tttcaaaaaa gagtgacaaa cgaaagtctc ttcgccgctc aaaaagaaaa 4380 cgaaactata aacatactaa cctgaatttg gaaacattag ttaagaaaaa gtgaaatcaa 4440 tatgtctaat cttaaggggg gaga 4464 // ID Gypsy-156_AA-LTR repbase; DNA; INV; 1765 BP. XX AC AAGE02024556; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-156_AA_; KW Gypsy-156_AA-I; Gypsy-156_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-1765 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02024556; Positions 13190 11426. XX SQ Sequence 1765 BP; 466 A; 435 C; 378 G; 486 T; 0 other; tgtaaccgtt ggttccaatt gctacttgca gcttccccgc aagacatttt tccttcccat 60 taaccccact gtgcttgtca ctcactttat gcagtcaagt ggtacccacc tgtcggcata 120 aagggtgagt accagttaga tcccaacacc aatacgcttc agagaagtaa cccatttgca 180 gcggtttgag cgcgaggaaa aaataaaagc tcgccgtacg ttcatcattt gcccacagat 240 ggagtaggat ggaagggttg ccaaattgga agagcattat aggtacttcg gcaattttcg 300 tgctcggtcg taccaattag ataattaaat acctcagctt tcatctcgga gagattattc 360 tgtccgaaac actagtcgga attgcagtgc acgccggaaa ataattatta aattcgatag 420 tgagttgttt ttcaatttta gtggtgtaga tttctaatct tagttaaatc ttttcagaat 480 agcatgcata gttaagctca aaaccgcatg cagagttagt ataatagatt tcagtgcaaa 540 taatcattat aaaaggttag tggtgttcaa agaagaaaag caattgtgtg aatagacaaa 600 aaaaaaccta ttgttcctat tgtaattata gttatccgtt gtgactattg taacgtcgcg 660 tccaaaaaga aggtcaagtg ttagattgac aagaaaaagg taattttaag actacatgaa 720 acatattgta cttgcaacgt gaaaattaag tgacgtcaca cgttagtaac catggggtag 780 gtccatctcg tgggcctttt tcttttttct accctctctt ggcggagcaa gggccgttct 840 tctactcaat aggatcgagc atcggcgatt tcctcggaca agcgcaccag aaaactcctc 900 gacacgccag tgtgggacag aattgggaaa ttcgtcgaag cgtggccctg tcgacctatt 960 gttccattgt tcggccggta ttcctggtca aaccatcacc atcgaccata ttgccacgcc 1020 acgcccggtc caaccacctc gtcattgctc ctcgccaaat ccgccacgtt gactgctggc 1080 cctatcagtc ggccgacctt atcatcacca ttgtccagtt gccccatcac gggagtataa 1140 tccgccgaag ggatttaggt gggccctggg cccctcctat acctacgacc gacaaggact 1200 cgtttcgtcg gtgactgacc agccaagcca tcaaccatca gcgcaagcct cgcctctcct 1260 ctcagcaaca gaattcgcct gtcatcatca tcaaccccca gctgctacca ccatggatcg 1320 agaacaagaa tgaaccgcaa gtaccccatt gatacgttgt gacgtcactc aaaaatgcca 1380 taattggcga ataaacaggc atgtaaattg aattgctttg taggtagcac actttattta 1440 agattcggct atccattcgt cttctccacc actcttaact ttgcttcgtt tgttcggact 1500 cataaaggag tcgccaggcc tcgcctactg tgtatcgaat gtgaattctt ctcttcgagt 1560 agcttttgag cccctcctag gtacggcact gtttgtcagt ttttccactt tgcaacatcg 1620 gtgggcacgt gtcgcgcaac acacgtgcat gtgtgtagct cccgccgtag gtgagtacct 1680 tctgtgggtg tatagtgttg agatctaagg taaacgaccc tgtaattgtc acccttccct 1740 agtctcaggg agctagcagg taaca 1765 // ID Copia-22_CQ-LTR repbase; DNA; INV; 233 BP. XX AC AAWU01004224; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-22_CQ_; KW Copia-22_CQ-I; Copia-22_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-233 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 360-360 (2011). XX DR GenBank; AAWU01004224; Positions 4877 5109. XX SQ Sequence 233 BP; 60 A; 61 C; 48 G; 64 T; 0 other; tgttggccgt acggtgcgaa gcccactggg cagacgcgct gtcaagcgtt ggcagcagtt 60 gctgtcacct gcgagtgacg tttattgaca ttacacattt ccgtttttta cactcgatca 120 ttcgcacatt agactttgta aaacctagac gctaaataca taaaggtaaa actgtgattc 180 cccgatttac tttaacatca cgcgactatt ccactgctga accgctgcca aca 233 // ID TEC2 repbase; DNA; INV; 5328 BP. XX AC L03360; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 12-AUG-2010 (Rel. 15.09, Last updated, Version 2) XX DE Euplotes crassus transposon-like element Tec2-1. XX KW Mariner/Tc1; DNA transposon; Transposable Element; transposase; KW TEC2. XX NM TEC2. XX OS Moneuplotes crassus OC Eukaryota; Alveolata; Ciliophora; Intramacronucleata; OC Spirotrichea; Hypotrichia; Euplotida; Euplotidae; Moneuplotes. XX RN [1] RP 1-5328 RA Jahn L.C., Doktor Z.S., Frels S.J., Jaraczewski W.J. RA and Krikau F.M.; RT "Structures of the Euplotes crassus Tec1 and Tec2 elements: RT identification of putative transposase coding regions."; RL Gene 133(1), 71-78 (1993). XX DR GenBank; L03360; Positions 1 5328. XX CC Complete 724-bp TIRs. XX SQ Sequence 5328 BP; 1787 A; 894 C; 986 G; 1661 T; 0 other; tagagggata aaaatattga aacaaattaa ttaataatta taattataaa tattgaaaaa 60 ttatttcctc ggctatcctt tcttacttcc tctctttccc tcctaactag tctctattac 120 caaatgtgat gatattaact ataatcatgt ctaatatctg cttaaattca gtctatattc 180 taaacattat ccattcttct aaatgcctaa attctagtca aactataagg aaatatcaca 240 tttagtgtct aaatttctca ctggttccgt gcggcgctgg ttaatttgct ctcaccccag 300 accccctaca atagggggtc ataaagacgc caccaacttc attttagagg ctttagtatc 360 ctctgaatat ggtcttagat cctatgtcta gcaccttcat tggatacaag accaagtcta 420 tatcattttg aggacttaga gcaaattttg attctaatct taaatttaat cattaaattg 480 atcaaagtct gtcataaaat ggtgagtcta gtggaatttc aacatatcct aaatactaag 540 gcataactag ttaggagctg tcggttgcag aagggtcttg acctctccag tacaacaggc 600 tgacaccatt aggcatctgc tccatcggtt atctttcgat tttcagagga attgctagag 660 tctagagtgg actctagcat ttctactaaa ccagagatat tctagcagat ttagtaaaaa 720 tttagcaatt ctattaatat gaatgttaat cattaactat accctactga ttgccatttt 780 gatgttgtgc agaaactcct tgttagccag gttctctctg cttaagttcc tcttcaaggt 840 tccaaatgta tgctcaatct tgttcagttc tggagaatat ggaggaattg taaatcagac 900 catcttcatc ccagtaattg ccttgacaac cttctcagta gaatgtattg atgcattatc 960 aaacacatat acagtcctct tcattagctg cttctttgct atcattgtct ttaagagact 1020 gttcagcttt tctaagaatg taatgaaatt ttggtctttg gtggtctcag ttttgatatg 1080 gaagatctta tgctgttcga cttgagctgc tattcagtta aatctctgat tagtgcttct 1140 aataagtttg acaggctcat ctccaatctt attccatgta tataaaggta aggctgatgc 1200 attaaatgag cattcatcaa tatacacaat agtgtagtgt cacttctcca ataacctaat 1260 gaagctctta aagatctcct tatcatcctt aagatgtgat cggagtgaac taggaggtct 1320 tgtattagat ctctgccaag aatatcctaa tgtattcttc aaacaactcc tgacttcatc 1380 atagctgtaa gatggaatgt cctcattctc cttagatttc gaattagcaa aatcctttat 1440 gctcttcaca gtcacacatc ttcccttctt gtttgcaata tatatttcaa agagcctttt 1500 gagattgcat agcctatcta ccttatctct cttcttctga agtgatttag cttcatttag 1560 tttcttccat ctgaaatatt tatcaattca gatctttaat ctcttcaatg gcaaacgcaa 1620 ccatttggca acttcctcta aactttgatt gttctcaagg agcatcttct ctactattat 1680 ccggttctga gagaacttaa tcatgtcttt gttcttgaaa ggccatgatc ttctcttaag 1740 ctttactcct ttctgaataa atgcctcata atattccaaa gtgactatga tttcctcatc 1800 ttgattatcg cttaatgatg gaaggatatc ttgctcattt ggttcctcct caattatctt 1860 ctcatacttc tttgcctctt cttctaacat agaagtagcc tcaaccattc tccttgtact 1920 tacttttgat aatgagagtc caataaagtc tttaagaagg aactgttttc cgctatatta 1980 attgagaaaa tgcgccaaac tgccatttgc taatcttggt tttagcttct tcaagttcct 2040 taactgatgc tgataatgta aaataactat accattaatg tattaggaag acagacatat 2100 aatttcatga gcatagccta gatattcaca agtaggatat ctcatcgcgc ggacttactt 2160 catgatcaaa tatataaatg attaaattag gattaatttc ataatattta atatggaaac 2220 acgtaattca gcgaagaaga aagatatcaa gaccaaagac ttctttgaat atgagatgat 2280 acctttcaaa acacatagag attcagcaat ggagaagaag agtaagaatt cttcacaagt 2340 agaaagcttc ttgaaggata atgaagatga taaatcagaa gatcttggac atggaaatag 2400 ccagacagag cagggagaaa tcatggctgt caaagattct attatggatg tgaatcaggt 2460 tctactttca caaatcctta atgcagtgaa taagaaggtt gaagtagtcc tgcctcaaga 2520 tcatatgaac atgatgcaga agatcctcag cacattagaa agaattgagc agaagattga 2580 tggcaggcac acttctgctc ccaaccaaca tgtattggag cctaactctc ctatacttga 2640 aaacgataac tctgatccgc ttggcgatat tcctcctgac catgaggtca tactagagcc 2700 agagattgta ttcccgtatg agctattctt gagagataag ttcctgttgc taccaaacaa 2760 gaaatagaaa gctatacaat aaatataaga gagagtgagg aggaaatgat ccagttgctt 2820 tctatatttt cagatctgga atatctctgg attcaacaag gagagactgt ggtagaatat 2880 ttgcaaattt tcaaaagcaa gatgtaggat tggatcataa tgatgttttt gattacttca 2940 agtttctaga tgaagagcaa attaagaaca agaatacaat caagaaggag tatggactaa 3000 tcagaagaat cttaaatgtt tcctatgcag tagacaagac caagtttcca agagccaact 3060 ttaagagtaa gaagagacta aaatcagtca ataagaatcc agcattaagt aggaaacttt 3120 tgctagaaat ctgtaatgag ctgtatgcca aggattataa gatggaggca ttagtggttc 3180 accttatgtt tgcttgtgca ttaagagctg atgaggttag atttttgatg tttaacaaca 3240 taaagaagga tagaatagga acatctatcg taatttatag gtcaaaaact gaagaagagc 3300 agacattggc tattgatcaa gatcttgaag atcgtattga agaatacaag cagatattga 3360 tcaaagccgg taaatatatt gaagagacca gatatacaac aagaggtgaa cagaagagag 3420 gcatattcat atttaaggat tcgtacattt gggtttatag actattcatc aagagattca 3480 aggaaatatt aggagatgat tttggaatca ctccagctat aatgagaagc tctgctatca 3540 gtgatacaac aggagaagga aacattatac aagctgctag acttgccaat cataaggata 3600 tgagaatcac tagaaagcat tatcttgctg ctcagaaacc atttgaagtt ggaagagaag 3660 agagaaaaga ataagcctaa tttgttgata aatattaaag attaaatttt atttgtaatt 3720 ttattttggg aagatgtcta aagctttatc agaccagact aagtttgtga agattagaag 3780 agaggaactt gaatcaaaga tccgcagaaa ggaagactta gagaacattt ttaagaattg 3840 ctgtaagaca tcatatcagc atctaggtca atattatcta cctcataaaa gtcactgccc 3900 aattaggttt ctgagagatg tattttcagg cagaaagaag gtattccata agtcttaatg 3960 aatataggca attaagcaaa gggacttgaa gcctattgtc gttcctcaat atgaggagtt 4020 atctcctaag aagatatatg agaagattaa gaagcattgt ccagagatca tcctgtatct 4080 tcctgactat ttagagaagg atgactatac tcctcccaag aagttcatgt gggatgtatt 4140 ttccacattg gattctgatc ttgcttttca gtttgtcaag tttgctattg atcaaagagc 4200 agaagaagag aacgaaggag acaagaccat cgaaatagat gaagatgttc tcagagatat 4260 gaaatctgtc aaatatttct cgaagaagaa aggcaaggca ttgtatatgc tcaaagccaa 4320 taaggattat actacagtca agagaaagag aagaagagag ttcactacct ttgacccaga 4380 taacaaggag gaggagaaag agtatcatgg caagagagtt aagagacagg aaactgaaaa 4440 tgtcatgata tctaatccat tcaagaagaa aagaaatgta ttcttggatg agaaggaggt 4500 gagaattagt agaaggaatt cagatgagag tgacatgagg atcgatgtac ctaagaaggg 4560 agctatgaca cctaatcctt ttagtcataa agattctgaa tatttaaatt tttactaaat 4620 ctgctagaat atctctggtt tagtagaaat gctagagtcc actctagact ctagcaattc 4680 ctctgaaaat cgaaagataa ccgatggagc agatgcctaa tggtgtcagc ctgttgtact 4740 ggagaggtca agacccttct gcaaccgaca gctcctaact agttatgcct tagtatttag 4800 gatatgttga aattccacta gactcaccat tttatgacag actttgatca atttaatgat 4860 taaatttaag attagaatca aaatttgctc taagtcctca aaatgatata gacttggtct 4920 tgtatccaat gaaggtgcta gacataggat ctaagaccat attcagagga tactaaagcc 4980 tctaaaatga agttggtggc gtctttatga ccccctattg tagggggtct ggggtgagag 5040 caaattaacc agcgccgcac ggaaccagtg agaaatttag acactaaatg tgatatttcc 5100 ttatagtttg actagaattt aggcatttag aagaatggat aatgtttaga atatagactg 5160 aatttaagca gatattagac atgattatag ttaatatcat cacatttggt aatagagact 5220 agttaggagg gaaagagagg aagtaagaaa ggatagccga ggaaataatt tttcaatatt 5280 tataattata attattaatt aatttgtttc aatattttta tccctcta 5328 // ID RTEX-11_BF repbase; DNA; INV; 6491 BP. XX AC . XX DT 30-JUN-2009 (Rel. 14.07, Created) DT 30-JUN-2009 (Rel. 14.07, Last updated, Version 1) XX DE Amphioxus RTEX-11_BF autonomous non-LTR retrotransposon - DE consensus. XX KW RTEX; Non-LTR Retrotransposon; Transposable Element; Jockey-9_BF; KW RTEX-11_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-6491 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-6491 RA Kapitonov V. and Jurka J.; RT "Young families of RTEX non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(7), 1728-1728 (2009). XX DR [2] (Consensus) XX CC The complete RTEX-11_BF consensus sequence contains two ORFs. The CC RTEX-11_BF ORF1 protein contains the esterase domain. The 3' CC terminus is composed of the (TTG)n microsatellite. XX FH Key Location/Qualifiers FT CDS 119..2440 FT /product="RTEX-11_BF_1p" FT /note="esterase domain." FT /translation="MCGVKLRSNNLENWLQHHKDGFDRDFESNKGTSVTWK FT STQGATSLTVSTNTRKTGKGLLSVHFYQRKKGLMVQGQCTKWWIDNFYAEI FT VRLPLPDKRTRATSKVQDGVQLRVRHALDSFPIPDLETTPTPASRHNVDET FT PTLPKRTNPRSLEGLKTPDGSDSGEQMRLESKNDAKCKDATIKDGAQTDGQ FT IKDNLPTNNETIMSPLSSAQKSGPSPKHEVAPKAVDKRDLERLASNIEKSL FT TISDTAFDDLQKQFVDLTDRQNSLRDDWITKEREREEVHLKSLKELSAQVE FT ATQKSLEDHIKKVISKLQNDLLLAQKDLVSQKEAMVKERQETMNEMMVERN FT LWRERYHGLSTKVDALVASNTDLQQIIETQNAEIQALVERSKELEQMFNSC FT KCKSESIKSTMPPGMTKADRTHDNDVPLKNPEDNEMHALSDNEAIVKQQSV FT VNSVKTPAANNHTRGTTDSTEIRIFADSIFRDVDADRAFNGRSAKIHRCST FT VQAAMNTMKTTQDSTTKTVIIHLGSNDLDNTKQHSDSVSETLNNTHNLLAA FT TKTSFPNADISVSQVLQRGSNPSSSLNQNIKSYNQEMLKSSKKGTFTYIKH FT KKLTQDRSLYLGDQIHLNHRSGTKLLVADLKHSLKISPGQNVERSQPNLTH FT QPVHLTHHQWNHQPAWRGPLNPPPSPYVRGRQPAWRNPANPAQGPSAGGNQ FT SAPGSFQNARPQHNQRGTLKGQHQQRDAHTVAVTSGSKQIVKADPARVGHR FT KYPGLELKHFGKSLRKALNFLLS" FT CDS 2564..6295 FT /product="RTEX-11_BF_2p" FT /note="AP endonuclease, RT domains." FT /translation="MPSKFLTVMGKKSALILSTWNIQGGCAKKFCDVDFCS FT FFSNSDLICIQETWADNFVDFDFPGFSLFSSCRSRKKKAKRNSGGVATLFK FT SSFKKGIHKLDSSSPDTIWCKLDKHFFGLTNDIYLCNVYIPPETFPKSSEI FT DPFDTLSSDICKFSNLGDVVIVGDLNARTGSITETYFDTDSSDMVTADVMD FT VKNRNNRDVKVNNYGRCLLDLCSSADLTILNGRFAGDLKGDFTCHHYNGSS FT VVDYCITQRSMLSDIQYVHVSPLSPFSDHCHVSFSLSTKHAPLISEEKCPS FT KLNPIRFIWDENSNDLFSAALNSSETSSKLNDFCQETFHSAENAVSRFTEI FT LLDASVRSLKIRRRKQVKKTQIKKNKAWFDQCCWSLRQQLKKLSLQIKKEP FT WAHEIRIKYFSTLKDYKKLIKQCKRRYKAEILTKLENMSEKNPKEFWSLLN FT FLDNEVNGKTQKNTTDPNITSTEWTDHFKGLNNLIKNKNFDTGFEKTIAER FT IKHLDKNPSNLLDYPFTMEEVLDGVKQLKSGKACGIDSISNDMLKQGSKQL FT CQSLITLFNTILSEGSFPSNWNTSILTPIFKSGDKSNVDNYRGIALSSCLS FT KLFTRLLNTRLQNFVEDNNLLADTQFGFRKKCRTSDNVFILKSLIEKYTQK FT KKGKLFVCFVDMKKAFDSVWRDGLFYKLLHYGIDGNFFNVLKSMYMNVNYA FT VRIENGLSDSFVSTCGVRQGCNLSPLLFNLFINDLPQCFDRDQCDAATLTS FT RSINCLSWADDLALISLSKQGLQQCINTLQTYCDNWKLTVNVSKTKVLIFS FT KGNTKKISKEFFIYDSEIEVTDSYTYLGIPFTSSGKFKTARKFLKTKAMRA FT IFKLKSLLSSENISIALGKDLFDKFVKPILMYGSELTCFDHTSKTLKLIVS FT KNIPISSPKNTLSFLLKKLQIDDNIPLSIRKSTCTDATHSYIINFERRPDK FT DMFLRHASGLDIEKDGFSTDNTRLPLDIPEFDILDTRFQKFLLGVHSKSSN FT DGVRGELGTFPIIIPARIQLIKYWHRLVNLPEDSLLRDAYNTVLTDNQDWI FT NHVKDILCYHGFGHIWMKPEAYKTDYIANQLRLRLQDTYIQEWFSSIENNS FT KLSFLSKSKECYEQETYLYDINNFEIRKSITQLRISSHKLNIETGRYHNVA FT PDQRFCPFCPKQIENEFHFVMECPMYDMLRNKLFTFLCLNTTDFKILDTRS FT RFDYIFRSDSPHNAKIGKYIKDCFVIRKDSDTHV" XX SQ Sequence 6491 BP; 2203 A; 1341 C; 1205 G; 1742 T; 0 other; aacgaagaaa actccgaagt caaagacgcc gaccaccaaa tcttggaaca tcaacttgaa 60 caagcttcca ctggatgaat ctctatccat cgattttgaa gacgaagaga aaactcggat 120 gtgcggggtt aaactgcgat caaacaactt ggaaaactgg ctccagcacc ataaggatgg 180 ctttgatcga gacttcgagt ccaacaaagg aacgagcgtc acctggaaat cgacgcaggg 240 agcgacaagc ctgactgtct cgacaaacac acggaaaaca ggtaaaggac tcttatctgt 300 tcacttctat caacgaaaga aaggattgat ggtccagggc caatgtacta agtggtggat 360 tgacaatttc tatgcagaaa tcgttcgttt gcccttgccg gacaaacgaa caagggcaac 420 ttccaaggtc caagatggcg tccagttacg cgttaggcac gcactagaca gtttcccaat 480 acccgatctt gagacaacac caacaccagc aagtcggcat aatgttgatg aaacaccaac 540 tttgcctaaa cgcaccaacc ccaggagtct agaaggtcta aaaacacccg atggtagcga 600 ttctggagaa cagatgcgac ttgagtccaa aaacgacgcc aagtgtaagg atgcgacaat 660 taaagatggc gcccaaactg acggacagat caaagacaat ctaccgacaa acaatgaaac 720 aataatgtcg ccactttcgt cagcgcaaaa atcaggcccc tctcctaaac atgaagttgc 780 accaaaagcc gtcgacaagc gggatctaga acgcctggct tctaatatag aaaagtctct 840 gacgattagt gacactgcct ttgatgatct gcagaaacag tttgttgact tgacagacag 900 acaaaattca ctgagggacg actggatcac aaaagaaagg gagagagaag aggtacacct 960 gaagtcgcta aaagagctct ctgcacaagt tgaagccact cagaaatccc tagaagatca 1020 cattaaaaaa gtcatctcca agttgcagaa tgacctttta cttgcccaga aggatttggt 1080 ctcacaaaag gaagctatgg tcaaagagag gcaggaaacg atgaatgaaa tgatggtcga 1140 acgaaaccta tggagagaga gataccatgg cttgtctact aaagtcgacg ccttggtcgc 1200 ctcaaatacc gatcttcaac aaatcatcga aactcagaat gctgaaatcc aggcacttgt 1260 tgagagatct aaagaacttg aacagatgtt taactcttgc aaatgcaaat ccgaatcaat 1320 caagtcaacg atgccaccag gaatgacaaa ggccgaccgc acgcatgaca atgacgttcc 1380 actgaagaac ccagaagaca atgaaatgca tgcgctctct gataacgaag ccattgtcaa 1440 acaacagtcc gttgtaaaca gcgtgaaaac tcctgcagct aacaatcaca caaggggtac 1500 aactgacagc accgaaatca gaatcttcgc agactctatc ttcagagatg tagatgccga 1560 ccgtgctttc aatggaagat ctgctaagat acataggtgt agcactgtac aggcggccat 1620 gaatacaatg aaaaccaccc aagattccac aacaaaaaca gtcatcattc atcttgggtc 1680 caacgacctg gataatacta aacagcacag tgactccgta agtgaaactc tgaacaacac 1740 ccacaacctt ctggccgcta caaaaacatc attccccaac gccgacatat cagtatctca 1800 agtgctacaa agaggttcca atccctcatc ctcgctcaac caaaacatca agtcctataa 1860 tcaggagatg ctgaagtcat ccaagaaggg aactttcact tacataaaac acaaaaagct 1920 cacgcaagat agaagcctat acctgggaga ccaaatccac ctgaaccaca gaagtgggac 1980 aaaactgctg gttgcagact taaaacactc tctgaaaatc tcacctggac aaaacgttga 2040 gagaagccaa cccaacctga cccaccaacc tgtccacctg acccaccacc aatggaatca 2100 tcagcctgcc tggaggggtc cactgaaccc gccacccagt ccgtacgtac ggggacgtca 2160 gccggcttgg aggaatccag caaacccagc acaaggacca agcgcgggag gtaaccagtc 2220 tgccccggga agtttccaga acgctcgtcc tcaacacaac cagcgcggga ctctgaaagg 2280 tcaacaccag cagagggatg cccacaccgt cgcagtcacg agtggaagca aacagatcgt 2340 aaaggccgat ccagctagag ttggacaccg caagtatccc ggcctagaac tgaaacactt 2400 cgggaaaagt ctccgcaagg ccctgaactt ccttttatct taaatactaa gtgaacgcta 2460 gcggtcactt atggatatca tagcagttat accaaggtat cgtgctatac atttatctta 2520 tagatctgaa tactatcata atagcttggt tcctgtagtt taaatgcctt caaagtttct 2580 cactgtaatg ggaaaaaagt ccgccttaat tttgtccacc tggaatattc aaggcggttg 2640 tgctaaaaag ttttgtgatg ttgatttttg ttctttcttt agtaatagcg atttgatttg 2700 tattcaagaa acctgggcag acaactttgt agatttcgat tttccaggtt ttagtctttt 2760 tagtagttgt agaagtagaa agaagaaagc gaaaagaaat tcaggtgggg tagccacttt 2820 gtttaaaagt tcttttaaga agggtattca caaacttgac agtagctcac cagacaccat 2880 ctggtgtaaa ttagataaac atttttttgg gcttacaaat gatatatacc tttgtaatgt 2940 ttatattccc cccgaaactt ttccaaagtc ttcagaaatt gatccttttg acacattatc 3000 cagtgatatt tgcaaatttt caaatttagg agatgtagtc atcgtaggag atttaaatgc 3060 caggacaggg tcaattacag aaacatattt tgataccgat tcctccgata tggttacagc 3120 ggacgtaatg gacgttaaaa acagaaacaa tagggatgtt aaagttaata attatgggag 3180 atgtctgctt gatttatgtt cctcggcaga cttaactatt ttaaacggca ggtttgcagg 3240 tgatttaaag ggagatttta catgccatca ctataacgga tcgagtgtag tagattattg 3300 catcacacaa agatcaatgc tttccgatat tcaatatgtg cacgttagtc cattatcgcc 3360 tttttctgac cattgtcatg tttcattctc cttgtcaaca aagcacgctc cacttatctc 3420 agaagagaaa tgcccatcta aactaaaccc aataagattt atttgggacg aaaattcaaa 3480 tgatttgttc tcggcggctt tgaatagctc tgaaacaagc tcgaaattaa acgacttctg 3540 tcaggaaaca tttcattcag ctgaaaatgc tgtgtcccgc tttacggaaa ttctattaga 3600 tgcatcagta aggtcactga aaataagacg tagaaaacaa gttaagaaaa cacaaatcaa 3660 aaagaataaa gcttggtttg accagtgttg ttggtcctta cgccaacaac tgaaaaaact 3720 atctttacag ataaaaaaag aaccatgggc tcacgaaata agaatcaaat atttctcaac 3780 tctaaaagat tacaaaaaac tcattaaaca atgcaaacgg agatataaag ctgaaatatt 3840 gaccaaatta gaaaatatga gtgaaaagaa tcctaaagaa ttttggtcac tattaaattt 3900 tctagacaac gaagttaacg gaaaaacaca gaagaataca acagacccta atattaccag 3960 cactgaatgg acagatcact ttaaaggact gaataattta ataaagaata aaaactttga 4020 tacgggtttc gagaaaacaa tcgcagaacg aatcaaacat ttagataaaa acccgtctaa 4080 tttgctagat tatccattta ctatggaaga agttctagat ggcgtcaaac aattaaagtc 4140 tggaaaggcc tgtggtatag attctatttc taatgatatg ttaaaacagg gatcaaaaca 4200 gttatgtcag tcattgatta cccttttcaa tacaattctt tccgaaggtt catttccatc 4260 taactggaat actagtattc tcactccaat attcaaatca ggtgacaagt cgaatgttga 4320 caattacaga ggcattgctc tctcgagttg tctatctaaa ttatttactc gtctcctaaa 4380 cactagattg caaaactttg tagaagataa taacctattg gcggacacac aatttggatt 4440 ccggaagaag tgccgaactt ctgataatgt gtttattcta aaatccctta ttgaaaaata 4500 cacacaaaaa aagaaaggca aattgtttgt ttgttttgtg gacatgaaaa aagcatttga 4560 tagtgtatgg cgtgatggac tattctacaa attgttacat tatggtatcg atggaaactt 4620 cttcaatgta ttgaaatcta tgtacatgaa tgtcaactat gctgttagaa tagagaatgg 4680 tttatcagat tcctttgttt ccacatgtgg tgtaagacaa ggttgtaacc taagtccgtt 4740 gctatttaac ctatttatca atgatctgcc gcaatgtttt gaccgtgacc aatgtgatgc 4800 agctacttta acttctagat ctattaattg tctgtcctgg gcagacgatc ttgccttgat 4860 ttcattgtca aagcagggac ttcaacaatg catcaatact ctacaaacat attgtgataa 4920 ttggaaatta actgtcaatg tatctaaaac aaaagttctt atcttctcta aaggcaatac 4980 aaaaaaaata tcaaaagaat tcttcatata tgatagtgaa atagaagtta cagacagtta 5040 cacctacctt ggaatccctt ttacgtcctc aggtaaattt aaaacagcca gaaaatttct 5100 aaagaccaaa gctatgagag ctatctttaa attaaaatca cttttatctt ccgaaaatat 5160 ttccatagcg ctaggtaaag atcttttcga caaatttgta aaacctatcc tcatgtatgg 5220 atctgagtta acatgctttg accatacctc gaaaacactg aaacttattg tctcaaaaaa 5280 catccctata tcttctccaa aaaatactct ctctttctta ttgaagaaac ttcagatcga 5340 cgataatatt ccactgtcga ttcgaaagag cacttgcact gatgcaactc attcttatat 5400 cattaatttt gagagaagac cagacaaaga catgttctta agacatgcgt ctgggcttga 5460 tatagaaaag gatggctttt caaccgataa cacacgcctc cctttagata taccggaatt 5520 tgacatactt gacacacgtt ttcagaaatt tctacttggt gttcattcaa aatcatctaa 5580 tgacggcgtg cgaggagagt taggaacttt tcctattata atacctgcta gaattcagct 5640 cattaaatac tggcaccgtt tggtaaactt acctgaggac tctctgcttc gcgatgcata 5700 caacactgta cttactgata atcaagactg gattaatcat gttaaggaca ttctctgtta 5760 ccatggcttt ggacacatat ggatgaaacc tgaggcttat aaaacagatt atattgccaa 5820 ccaacttcgc ctccggctac aagatacata tatacaggag tggtttagtt ctattgaaaa 5880 caattcaaaa ctgtcttttc tgagcaagtc aaaagaatgt tatgaacaag aaacatacct 5940 ttatgatata aataactttg aaattcgaaa atccataaca cagttaagaa tcagtagtca 6000 taaattgaat attgaaacag gaagatacca caatgtagcc cctgaccaaa ggttctgccc 6060 attttgtcca aagcaaattg aaaatgaatt tcacttcgtc atggaatgcc ctatgtatga 6120 tatgttacgc aataagttat tcacattcct atgtctaaat actacagatt tcaagattct 6180 agataccaga agtagatttg actatatctt tagatccgac agccctcata atgctaagat 6240 aggtaaatac ataaaagatt gctttgtgat tagaaaagat agtgacactc atgtttagcc 6300 ccacctgatt gtatagaggt ttttccttat tatcctatat tgtttaatta gtagaatctt 6360 attatcctat actgtttaat tagtagaatt gtaagatata ttctgtattc tttctctttt 6420 tgtgttgtag tttatagttg accatacaaa agtaactgta ctttttgtcg tgtcaataaa 6480 gcttgttgtt g 6491 // ID Harbinger-N13_BF repbase; DNA; INV; 213 BP. XX AC . XX DT 04-AUG-2008 (Rel. 13.08, Created) DT 04-AUG-2008 (Rel. 13.08, Last updated, Version 1) XX DE Amphioxus Harbinger-N13_BF non-autonomous DNA transposon - DE consensus. XX KW Harbinger; DNA transposon; Transposable Element; Nonautonomous; KW Harbinger-N13_BF; non-autonomous. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-213 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-213 RA Kapitonov V. and Jurka J.; RT "Harbinger-N13_BF - a family of non-autonomous DNA transposons RT from the amphioxus genome."; RL Repbase Reports 8(8), 805-805 (2008). XX DR [2] (Consensus) XX CC It contains 36-bp TIRs and is flanked by TWA TSDs. XX SQ Sequence 213 BP; 56 A; 56 C; 53 G; 48 T; 0 other; gggcgcggtc acataggtcg tgcgatcgtc gtacgatttt gatgccatag gattttcaag 60 caggccgtga cagaaccaac caccgacatg gatccaaaac agcattttgt gtaccaagac 120 agttgaactt tgcaggagac gtacattttt gtcctccaaa atgcccgaag ttagcgatcg 180 tacgacgatc gcacgaccta tgtgaccgcg ccc 213 // ID Copia-3_AA-LTR repbase; DNA; INV; 246 BP. XX AC supercont1.3; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-3_AA_; KW Copia-3_AA-I; Copia-3_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-246 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.3; Positions 2026230 2026475. XX SQ Sequence 246 BP; 72 A; 66 C; 30 G; 78 T; 0 other; tgacggacta cacaaccctg tagcaccgtt gtcaatttga agagcgaatt atttcagtgt 60 aacattttat ctaatttatc ccatttatcc tcactcgatg tacctcacac acaccaacac 120 gcacactgca tttgttacca cactgctata catttagatt ataataaatc atcattcagt 180 agttgtcacc caagagaacc acacgtgttt tcttttattc actgctctaa ccgctactct 240 gcatca 246 // ID L2-4_NVi repbase; DNA; INV; 5386 BP. XX AC . XX DT 13-APR-2009 (Rel. 14.04, Created) DT 13-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE CR1-like retrotransposon: consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; L2-4_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-5386 RA Bao W. and Jurka J.; RT "CR1-like elements from the parasitic wasp Nasonia vitripennis."; RL Repbase Reports 9(4), 754-754 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS join(3329..3727,3731..4930) FT /product="L2-4_NVi_2p" FT /translation="MEYQSLVTEAEERTAQARETFIQNRIFDALDNDKNIW FT NEFRNLGLITKAKNDLHGFTPDELNAHFAGVSISDLERETDLISCRKPARV FT ALLFVKLLFRMLCWQSLISRRKQRVRMGYPRVSSRSLYQFSATILLCSMLH FT YQVESFLGPGRGLVSKVLEKIVHDQISEHLESRKILDPLQTGFRSNNSTQT FT ALLKLTEDIRTGIDSDKQLLTILLLFDFSKAFDTISPSKLLRKLIGMGFSR FT AVVLWVKSFITGRNQRVVTTSNGESDWLTTNLGVPQGSILGPLLFSLYVND FT LKDVLTDFERSERPEELLTNSVAHLLYADDLQTYTQCTRDNLRAGMDRLSA FT VARAVSGWASDNALHLNTGKTKAIIFGSDYNINQLQGLNLPGIEVRDGIFV FT PFVDTVTNLGVVMDSKLTWKAQVDAVSRKVNRTLYGLRSFRSCTTEALRKQ FT LASALAVSHLDYCSVVYLDVSEELQTRLQRLQNACVRYICGAGRREHITPY FT RKKLGWLDTKERRAYFAAVLVYKAYCMGQPPYLAAL*" FT CDS join(600..1118,1018..2313,2317..2640,2644..2997) FT /product="L2-4_NVi_1p" FT /translation="MTRDSCKICKKITRTGGIICKNCSATYHNVCMAWHTN FT YICQACYVRNRSNNSIRTQKIISRVDKDKNKNKDKDTKDNKDNKDIDNSKN FT LNSDKNDRNNTVNNTVNNTADTVNGTEVCNAGDNSTDYADNVSADGNSADL FT ALSASTPTVCASKIISGYKCSALLFTAVINARPFLFLLRLQPSVPAKSSAD FT ISALHYCSLPSSMLDLFSNQFDLLERTVGDLRTLFVGKLDAISQRSDPGPA FT ASEMIDRLVDRVSSLEEKVVMQNSKLEELHDDRFLLSQENKILKDKLEELT FT RVMLKVENFVDGARSGRDDGSSNTSPSNSTGGDANNNNNNECDFDASRVFV FT SLRDIESADANGGAGASTSTDTDDFEVCKSRRRIRGRKCRGRVCEHSAGGQ FT SSSTDGLSSVESTTEIIVSNIFDGSEQERGDVAHAILSTVLPSVARDDIIS FT TRLLRANDDSRRLSPWVVRLSSGAVVKEIMRAKHRITGLNTRHINVSHMPS FT ETSNSLVQSKIFINEMLSKELFLQFNSLKSLARGLGFKYVWHRGGRFLVKM FT RDGELSHSFKTAADLHAIAASYDCAISTKNCVNSGNVRDVDAASLDSQTVT FT SEKTSNNLDRGRVTGAECXGFRAGFLNATSLYAHMGMFRQFLDTAWPSYHI FT FGVAETRFGHEVDDVFVRIDGFSIMRQDRNKRGGGVALYVRNDYKATLLCS FT SPTQIEGKPGIPELMCSVQQGTLTPIFVAVIYRPPGISLENSNFTDDLRNY FT SVGYDCRIVMGDLNANMLLASRDAEFFRELACEMNLKLVNHVGDSHTWIDV FT IFTDDDNVVLNENNSLATFPSKHNIIDV*" XX SQ Sequence 5386 BP; 1496 A; 1083 C; 1298 G; 1480 T; 29 other; aaaaatcgca ctattccgtg cgttttatga gcctagaggg gcctctccgc agactggaag 60 aatttctaac ctcaaaagaa gacagtgctg gggaaccatc attcatctca gtattttcgc 120 tgtggattta acatcaatac caagcgacat cgtcttcaaa gagacagggc cgttgtcagc 180 ttcatcgcag tcggcagacg ttcgacaatc gacttcaagt tgtcgactca tatctgaagc 240 atcgactttg aggttagatc tgtgtatata tcgcttagta agcgcacaga atcatcgcgc 300 cactgcgttc gagtttcaac tttctgccac aaaagtgcag catcatcgcg actcaaattc 360 ttcgagcatc agttggctac actgattgga ttttgcgcag gcgcaatcta ctcttcagcg 420 cgcgaatttt gaattttttc gtagggcgcg aagtttaaaa ttcgatttcg aaatttgaat 480 tctctgacaa acagtattac tattactatt gcaattatct ttctagtcat aactgtccta 540 tttattattt tacttattat atttattata taacctaggt ttactattaa ttgttaaaaa 600 tgacacgaga ttcatgcaaa atatgcaaaa agataactcg cacagggggt ataatttgta 660 aaaattgttc tgcaacatat cacaatgttt gcatggcctg gcatacaaat tacatatgcc 720 aggcgtgtta tgtaagaaat agaagcaata actctattag aacacaaaag attatttcga 780 gggtggacaa ggacaaaaac aaaaataaag ataaagatac caaagataac aaagataata 840 aagatataga taatagtaaa aatttaaata gtgacaaaaa tgatcgaaat aatactgtta 900 ataatactgt taataatact gctgatactg ttaacggcac cgaagtatgt aatgctggcg 960 ataacagtac cgattatgcg gataacgtca gcgctgacgg taatagtgct gatttagctc 1020 tttctgcttc gactccaacc gtctgtgcca gcaaaatcat cagcggatat aagtgctctg 1080 cactactgtt cactgccgtc atcaatgctc gaccttttta gcaaccagtt cgatctcctt 1140 gagaggacag tcggcgatct aaggactctt tttgtgggga agcttgatgc cattagtcag 1200 agatcagacc ctggcccagc ggcatcagaa atgatcgata gactggtgga tagggtctcg 1260 tcacttgagg agaaagttgt aatgcaaaac tctaagctgg aagagttgca tgatgatcgc 1320 tttctgctct ctcaagagaa taagatactt aaagataaat tggaggagct tacgcgagta 1380 atgctcaaag tggaaaattt tgttgatggt gctcggtctg gcagggatga tggtagttct 1440 aatactagtc ctagtaatag taccggtggt gatgcgaata ataataacaa caacgagtgt 1500 gatttcgatg catcacgagt tttcgtctcg cttagggata ttgagtccgc tgatgcgaat 1560 gggggcgcag gcgcttccac gtcgactgat acagacgatt ttgaggtctg taaaagtcga 1620 cgtcgcattc ggggtcgcaa gtgtaggggg agggtttgcg aacactctgc tggcgggcaa 1680 tcctcttcaa cggacgggct gagcagtgtt gagtctacca cagagataat tgtctccaat 1740 atttttgatg gttccgagca agagcgtgga gacgtggccc acgcaattct cagcactgtt 1800 ctgccctccg ttgcaaggga tgatattata tctacgcggt tgctgcgtgc aaatgacgac 1860 agcaggcgcc tatcaccgtg ggttgtccgt ctttcgagcg gcgccgttgt gaaggagata 1920 atgcgagcga aacacaggat cactggcctc aatacgcggc atatcaacgt ctcgcatatg 1980 ccttcggaaa cgagtaatag cctggtacag agtaaaatct ttataaatga aatgttaagt 2040 aaagaattat ttttgcaatt taatagtctg aaaagtttag ctcgcggtct tggtttcaaa 2100 tacgtttggc atagaggcgg tcgtttccta gtaaaaatgc gggatggaga attgtcgcat 2160 tcgtttaaaa ctgcagctga cctgcacgcg attgctgcat cgtatgactg tgcaataagc 2220 actaaaaatt gtgtaaatag tggtaatgtg cgagatgtag acgcggcgag cctagactcg 2280 cagacggtga ctagtgagaa gactagtaat aattgactag atcgcggacg ggtgacagga 2340 gcagagtgtr acgggtttag ggcgggtttt cttaacgcta cctctctcta tgcgcatatg 2400 gggatgttcc gtcaattcct ggacactgct tggccctcgt atcacatttt cggcgtggct 2460 gagacgcgat tcgggcatga ggtcgatgac gtgttcgttc ggatagatgg gttttcgatt 2520 atgaggcagg ataggaataa gcgcggcgga ggagtagctc tgtatgtccg gaatgactac 2580 aaagctacac tactgtgttc atcacctaca cagatcgaag gaaagccggg tattccggaa 2640 taattaatgt gtagcgttca acagggcacc ctgacgccga tattcgttgc tgtaatttac 2700 cgtccaccag gtatctctct cgaaaattca aattttaccg atgatttgcg aaactattcc 2760 gtaggatatg attgtaggat agtgatggga gatctgaacg cgaacatgtt attggcctca 2820 cgcgatgctg agtttttccg agaactcgcc tgtgaaatga atttaaagtt agtgaatcat 2880 gtcggtgact cgcatacatg gattgatgta atctttactg acgacgacaa cgtagtgctg 2940 aatgaaaaca attcgctggc gacgttccca agcaagcaca atattattga tgtttagatt 3000 aactttcaaa ctgtggaact cccggctctc aacagtttta catacaggga tttcaagtcg 3060 attaatacgg gggaactcct atcccttctt gcttcttgtg attggtcgat ggtgaattgt 3120 ccggacagtg gagttgacac ccgactcgaa catctgagtc aaaacataat aagtgtcata 3180 gaccagcttg ctccgttgaa agaatacaaa ccgaagaaag aaggtctccc tccgtgggtt 3240 gacgccgagc ttatataacc aacgggacgc ggtgaaaagg cgacacaaac aggcgcggcg 3300 ggacacacca cgacgagacg aactgtgaat ggaatatcag tctctcgtga ctgaggccga 3360 agaacgtact gcccaagcac gagagacttt tatacaaaac agaattttcg atgcacttga 3420 taatgataaa aacatatgga atgaatttcg taatcttggt ctcattacga aagcaaaaaa 3480 cgacttgcac ggttttactc cagatgaatt gaatgctcac ttcgctggtg tgtccatatc 3540 cgacctcgag cgtgagacag atctgatatc atgtcggaag ccagcgaggg tggctttact 3600 tttcgtgaag ttactttttc ggatgttgtg ctggcagtcg ctcatttctc gtcgcaagca 3660 aagagtgagg atgggatacc ccagagtgtc atcgcgaagt ctctaccagt tctcggccac 3720 catcttgtag ctttgttcaa tgcttcatta tcaagtggaa tctttcctgg ggcctggaag 3780 ggggctcgtt tctaaggtcc tggaaaagat agtacatgac cagatctcag aacatctaga 3840 gtcacggaaa attcttgacc ctttacaaac cggttttcga agcaacaact caacgcagac 3900 agcactacta aagctgacgg aagacataag gacagggatt gacagtgaca aacagttact 3960 gaccatctta ctgctttttg actttagcaa ggcgtttgac acgatatccc cttcgaagct 4020 actccgtaaa ctgatcggga tgggcttctc tagagcagta gtcctgtggg ttaagtcatt 4080 tattacagga cgtaaccagc gagtggttac tacatctaac ggagaatcag attggctaac 4140 taccaacctt ggcgtcccac agggctctat tctgggacct ctcctgttca gcctttacgt 4200 taatgatctc aaagacgtac tgactgactt tgagagatca gagagaccgg aagaactgtt 4260 gacgaatagt gtcgcgcatt tgctatacgc ggacgacctg caaacctaca ctcaatgtac 4320 aagagacaat cttcgtgccg gcatggatcg tctgtcggct gtggcgagag ctgtgtcggg 4380 gtgggcctct gataatgctt tgcatcttaa tactggcaag actaaggcta taatttttgg 4440 ttcggactat aatattaatc aactgcaggg gttgaacttg cctggtattg aggtgcggga 4500 cggcatcttt gtcccatttg tcgacacggt aactaacctt ggtgtggtca tggattctaa 4560 attaacatgg aaagcgcaag ttgatgcggt tagccgaaag gttaacagaa ccctttatgg 4620 actaaggtcc tttagatcct gtaccaccga ggcgttgcgc aagcagctgg catccgccct 4680 tgctgtttct cacctagatt actgctctgt agtgtacctt gatgtgtcag aggagctcca 4740 gacacgacta cagagattgc agaatgcatg tgtgcgctat atatgtggtg ctgggaggcg 4800 cgagcatatt actccctata gaaagaaact gggctggttg gatactaagg aaagaagggc 4860 gtactttgcg gcggtgttag tgtataaagc ttattgcatg gggcagccgc cgtaccttgc 4920 ggccctttaa aaaaaaaaca gtacaggact tccagtaggg tgtccaggga tatttcggtt 4980 ccaggatcgc gaactgatgt gggactgcat tcgtttgctg tgtacggtgc gagtctatgg 5040 aattcccttc cgcagggtgt gcgaactttg ccttcgctgg ctgaatttaa gcgagcgttg 5100 cgtaagcatc tgatgcgcaa tgatctatga tgcgataatt tatgtacctg cgatgttatt 5160 tattgtcgac tgtgatttta attgtaatga cgtgatrgaa taagtrtaaw rmaamgtgat 5220 attrrtatta taataagaaa trakamkcaw gtwatgtacr aamtgtgata kaactgytgt 5280 gwtawtcagc tattgaatgc yaattggacw rtacaawctg yratactatg tatattttrt 5340 tttttcaayc acgaaaataa aacatttcta ttctattcta ttctat 5386 // ID Ingi-2_AC repbase; DNA; INV; 4207 BP. XX AC . XX DT 28-JUL-2009 (Rel. 14.07, Created) DT 17-AUG-2010 (Rel. 15.09, Last updated, Version 3) XX DE A family of Ingi non-LTR retrotransposons from a sea slug - DE consensus sequence. XX KW Ingi; Non-LTR Retrotransposon; Transposable Element; I group; KW I-2_AC; Ingi-2_AC. XX NM I-2_AC. XX OS Aplysia californica OC Eukaryota; Metazoa; Mollusca; Gastropoda; Heterobranchia; OC Euthyneura; Euopisthobranchia; Aplysiomorpha; Aplysioidea; OC Aplysiidae; Aplysia. XX RN [1] RP 1-4207 RA Kapitonov V.V. and Jurka J.; RT "New families of I non-LTR retrotransposons from animals."; RL Repbase Reports 9(7), 1531-1531 (2009). XX RN [2] RP 1-4207 RA Kojima K.K., Kapitonov V.V. and Jurka J.; RT "Recent Expansion of a New Ingi-Related Clade of Vingi non-LTR RT Retrotransposons in Hedgehogs."; RL Molecular Biology and Evolution 28(1), 17-20 (2011). XX DR [1] (Consensus) XX CC [1] I-2_AC can be considered as a member of the Ingi clade in the CC I group. CC The consensus sequence was derived from multiple alignment of CC several copies ~98% identical to each other. The 3' terminus is CC composed of the (GAA)n microsatellite. Target site duplications CC are not present.The consensus does not contain ORF1. The 5' CC terminus is precisely defined by 6 different copies of I-2_AC. CC Therefore, it is unlikely that the consensus is incomplete due to CC 5' truncations. In addition to the APE and RT domains, the CC ORF2-encoded protein contains also the RNase H domain. CC [2] Renamed. XX FH Key Location/Qualifiers FT CDS 368..4180 FT /product="Ingi-2_AC_1p" FT /note="ORF2: AP endonuclease, RT, RNase H." FT /translation="MTLEVGNVRYGARPVRTHQAGAVPGPKPRRGPPQRLE FT GLPSSWSARRRGRKATGNTNTKQVKKSLNIMHWNAEGISNKKTELENFLKD FT NEIHISCIQETHLQEEASFKVRGYQCFRNDRQGRKKGGVLTLVRNNLNTVE FT IKKYTGEAEYIHNRITTSRGNLDLVNYYCPNDKNLVLETIEVPDTDFIIAG FT DFNSQSQSWGYNRLDRRGEEIEAWQDENHLLLVNDPLDQPTFYSRAWHSTT FT TPDLALCTENLHRHLSRKVETQLGGSDHRPVLLTIVESLPEDSPKRPRWNY FT KKADWRLFSIRTNELTKNIAVFDKNINNAITEFNENILKSAKETIPRGVRK FT NYTPYWTPELQKTHNDLIKAREEAETHPGIENNNKLQRCKAHYLKTKLECT FT RKSWRDKTHKLNLEKDTTKLWRLVKGLNDEGNKSAKITLEVEGEMEMEKRA FT AHTFVKAYAQESNIKIPRERQKEVRQEERRKRSTNNIPEAMEKDITMEELK FT KAINQLKQRKSPGPDEITNEMLQHLGNTALQTLLDIFNLSWKIGQVPQCWK FT EATMIPILKPGKSKAQPLSYRPISLTSCVCKTMERIIACRMQWYLEAQHII FT VPEQAGFRQFRSTEDQTTYLSQVIEDSFQAQKVMLATFVDLQKAFDKVWKE FT GLLVKLLRSGIQGNMYQWTKSYLHNRRARVLVDGYQGRKMLLRQGVPQGGV FT MSPLLFILFINDLMNELPKGIHAALYADDLVLWCSEEFVTTATYRMQLALD FT KLTTWTKTWCVTVNKEKSSATLFTLSQAEPKHLTLDKTPLKFENQQKYLGI FT IFDKRLTWKQHIQNAENKARKKLNIMRKLAGTQWGANEKTLRQVYLGAIRP FT HLEYGSSAWMTAAQTHKYTLEKVQNQALRIITGAMRSTPIEKMEQTTSIVA FT LQRRWECKALIQYTKATAMEDHILHTRTARSNTARLKRSNFIKESRKIQRS FT LQDHLPATVNMLNHTPEAPSWEKKNTNISIHTSIPMFTTKDDHSDLSKKSL FT TLSMLEERYPREAWIRIYTDGSATDAVKRGGAGVYVQFPNGRRQAEAVPTG FT RHCTNYRAETEALVHAANIIKNVVDPQSQVVFLTDAMSVLQAITSGKLPRL FT RKEIEDIPCLRIMLQWIPSHCGVLGNEQADRMAKLGAQKEQPDNSVNYTEI FT KTIINTLFKAPQTPDGYHELSRQEQVIIFRLRTQHTRLRQHMYKCFKLVQS FT PKCQCGEADQTVEHILQHCKTHRALRQETWPTPLTLQEKLYGPPDELTRTA FT QFMTDTGLQV" XX SQ Sequence 4207 BP; 1509 A; 981 C; 963 G; 754 T; 0 other; ccggggtggg gcgaccatcc ccgacccggc gggtcccagg tggcggatag ggaacggcca 60 ctgttataga ggttagctgt gaatatgttg aataagcagc cccggaccaa ctggcgatgc 120 tcttaccagg acggggtgag gtggtaggtg gcactaccac gtcattaaat gccctaggag 180 tcctctgacg ggagcatcat gcggaaaagg accgcattaa gtcctggcta cgggtgggat 240 ccggtaacaa ttcctccctg tgctcccagg gtaggtcctt ggacctggag ctggagtgct 300 gaaatcggag gtattgggcc gcaaggcgca ctctagagaa ccacaatacc acgtaatact 360 gcaatatatg acacttgaag taggtaacgt tagatatggt gctcgccctg tgaggaccca 420 ccaggcaggt gcagtgccgg gccccaagcc tcgaaggggt ccaccccagc gactggaggg 480 ccttccatcg tcttggagtg ccaggagacg aggaagaaaa gcgactggta acacaaacac 540 aaaacaggta aagaaaagtc ttaatatcat gcactggaat gctgaaggaa tctcaaacaa 600 gaaaacagag ctggaaaact ttctaaaaga taatgaaata cacatcagct gtatacagga 660 aacacacctg caggaggagg caagctttaa ggtaagagga tatcagtgct tcagaaacga 720 tcgccaaggt agaaaaaaag gaggagtatt gaccctggta agaaataacc taaataccgt 780 tgaaatcaag aaatatacag gagaagctga atatatccac aacaggatca ccaccagcag 840 aggcaatctg gatctagtga actattattg tccaaacgat aaaaaccttg tgctagaaac 900 catcgaagtc ccagacacag actttatcat cgctggagat ttcaacagcc agtcccagag 960 ttggggttac aacagactag acagaagagg agaagaaata gaggcttggc aagatgaaaa 1020 ccatcttctg cttgtaaatg atcctttaga tcaacccaca ttttattctc gagcttggca 1080 ttcaactacg actccagacc tagcactctg cacagaaaac ctgcacagac atctaagcag 1140 aaaagtggaa acccagctcg ggggaagtga ccatcgccct gttcttctta ccatagtaga 1200 gagcctgcca gaggacagcc cgaaaaggcc gagatggaac tacaaaaagg cagactggag 1260 actattcagc ataagaacta atgagctgac aaagaacatc gcagtcttcg acaaaaacat 1320 aaacaatgcc atcacagagt tcaatgaaaa catccttaag tcagcaaaag aaaccatacc 1380 tagaggggta aggaaaaatt acaccccata ctggacgcca gaactccaga aaacacataa 1440 cgacctcatc aaggccagag aagaagccga aacacatcca gggatagaaa acaataacaa 1500 gctacagagg tgtaaagccc actatctcaa aaccaaattg gagtgcacca ggaaaagctg 1560 gagagacaaa acgcacaaac ttaacctgga aaaggacacc acaaaactat ggaggcttgt 1620 aaaggggcta aacgacgaag gaaataaaag tgccaagatc actctagaag ttgaaggaga 1680 gatggaaatg gagaaaaggg ctgcccatac ctttgtaaaa gcatatgctc aagaaagcaa 1740 catcaaaata ccaagagaga gacaaaaaga agtcagacaa gaggaaagaa gaaaaagaag 1800 tacaaataac atcccggaag caatggaaaa ggacatcacc atggaagagc tgaaaaaagc 1860 aataaatcaa ctgaagcaaa gaaaatcacc aggccccgat gaaattacaa acgaaatgtt 1920 acaacaccta ggcaacacag cgttacagac tctactggat atattcaact taagctggaa 1980 aatagggcaa gtacctcagt gctggaaaga agcaacgatg atcccgatcc tcaagccagg 2040 gaaaagtaag gcacaaccat taagctatag acctatcagc ctcacaagtt gtgtatgtaa 2100 gacaatggag cgcatcatag cttgcagaat gcagtggtac ctagaggcac aacacatcat 2160 agtaccagaa caagctggat tccgacagtt caggagcaca gaggaccaaa caacatactt 2220 gtcccaagta atagaggact ccttccaggc acagaaagtg atgctagcaa cttttgttga 2280 tctacaaaaa gcgtttgaca aggtgtggaa ggagggtcta ctagtcaagc tccttcgaag 2340 tggtatccaa ggcaacatgt accaatggac caaatcctat ctacacaaca gaagagcacg 2400 tgtgctggtg gatggatatc aaggacggaa aatgctgctg cgacaaggag taccccaagg 2460 tggagtcatg tcacctttgc tctttatcct cttcatcaat gacctcatga atgaactgcc 2520 caaaggaata catgccgcat tgtatgcaga cgacctggtt ctttggtgct ctgaggaatt 2580 cgtcactaca gccacatatc gcatgcaact agcattggac aaactcacta catggacaaa 2640 aacctggtgt gtaaccgtaa acaaagaaaa gtcttctgct acactcttca ctctcagcca 2700 ggctgagcca aaacacctca cattagacaa aaccccccta aagtttgaga atcaacagaa 2760 gtatcttgga ataatctttg acaagcgtct cacatggaaa caacacattc agaatgcaga 2820 aaataaagcc aggaagaaac ttaacatcat gagaaaacta gcagggactc agtggggagc 2880 aaatgagaaa acacttagac aggtctatct gggcgcaatc aggccccatc tcgagtacgg 2940 atccagtgca tggatgactg cagcacagac tcacaaatac actctggaaa aagtgcaaaa 3000 ccaggctctc cgaataatca ctggagcaat gaggtcaacg cctatagaaa agatggaaca 3060 aacaacatct atagtggccc tgcagaggag atgggaatgc aaggccctca tacagtacac 3120 caaggccaca gctatggagg atcatatact gcacacgaga acagcgagat caaacaccgc 3180 aagactcaaa agatccaact tcatcaaaga aagcagaaaa atccaaagat ccctacagga 3240 ccatctacca gccacagtta acatgctgaa tcacacccca gaggcaccat catgggaaaa 3300 gaaaaacaca aacatatcta tacacacgtc cattcctatg ttcaccacaa aagatgacca 3360 cagcgacctg agcaagaagt ccctcaccct gtccatgctt gaagaaagat acccacgaga 3420 agcatggata aggatctata cagacggatc tgccactgat gctgtgaaaa gaggaggagc 3480 cggagtgtac gtacagttcc caaatggacg acgacaagca gaggcagtcc caacagggcg 3540 acattgtaca aactacaggg cagagacgga ggctctagta catgcagcaa acatcatcaa 3600 gaatgtggta gacccccaaa gccaagtagt attcctaaca gatgccatgt cagtactgca 3660 ggcaatcact tccggcaaac taccaaggct acgaaaagaa attgaggaca tcccatgtct 3720 gaggatcatg ctgcagtgga ttccatcaca ctgtggtgtc ttgggaaatg agcaggcaga 3780 caggatggcc aaactaggag cacaaaaaga gcaaccagac aacagcgtta actacacaga 3840 aataaaaact attataaata cactatttaa agcacctcag acaccagatg gctaccatga 3900 gctgtcccgc caagaacaag tgattatctt cagactaagg acacaacaca caagactgag 3960 acaacatatg tacaaatgtt ttaaactagt acaatcacca aaatgccagt gtggagaggc 4020 ggaccaaact gttgaacaca tccttcagca ctgcaagaca cacagggccc tgagacagga 4080 aacgtggcca accccactga cgctacaaga gaagctatac gggccgccag atgaactgac 4140 caggacggcc caatttatga cagataccgg acttcaagtg tgatgcgatc gagaagaaga 4200 agaagaa 4207 // ID DNA8-101_AP repbase; DNA; INV; 567 BP. XX AC . XX DT 29-AUG-2009 (Rel. 14.09, Created) DT 29-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-101_AP. XX NM DNA8-101_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-567 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2039-2039 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 567 BP; 221 A; 45 C; 54 G; 247 T; 0 other; cagtgttggg aattatctag ataaaattat ttttttaatt atcttatcta gataaaaata 60 attaaatcat ctaattatct catctagata aatgtaatgg ttatctgtgg taatttatct 120 agataaataa tataattata agaatatttt taaatactga aatagccaat aatttacgag 180 aatcgagtag tgattccaaa taccaatatt attatagtta aaaaaagaaa tatttactga 240 aatattgtca gttttatttt gttattttat ttttattcaa gttgtattaa aaattaaaat 300 tgtatattat tttagttgaa aaataacatt aaggatttat taattaataa atattaccta 360 attagtaatt actaataaat tacattattt ataatataat ttaaaaacaa attatctatt 420 ttttttatct agataaatgt atgtgtattt ttatctttat ctagataaaa aaaaatgatg 480 ttatatttta tcttatctag ataaattttg aatgaattat ctgttatctt atccagatga 540 ttttttggtt atcttttccc aacactg 567 // ID Tx1-3_AAe repbase; DNA; INV; 4688 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A Tx1 non-LTR retrotransposon from Aedes aegypti. XX KW Tx1; Non-LTR Retrotransposon; Transposable Element; Tx1-3_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4688 RA Kojima K.K. and Jurka J.; RT "Tx1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1459-1459 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 16 sequences with >97% CC identity. It is positioned at the deepest branch of the Tx1 CC clade, and does not show sequence specificity. XX FH Key Location/Qualifiers FT CDS 132..1205 FT /product="Tx1-3_AAe_1p" FT /translation="METRKNTVKLVFATSARVPIHLDVLRFMMRVLKIPAT FT DVHSVYKDENDQRFYVKFIDENSFNRFSSTMEEQYWFEYADGEKIRVQLEM FT ASRQFKYVRIFNLPPETEDKDIAAVLGQFGRIRQHVRERYPADYGYQVFSG FT VRGVHIEIEKEIPANLYIGHFRARCYYEGLKNKCFFCKAEGHNKANCPKLA FT EIKEKGVSGPANSRMYSQVAANLRIATSSSENESRPSSSMTVLQVPRHPSA FT GKIQERGEKATPRETIGTCAEQVSAENVAEIQPAQIDVQDSHTNDGDNAGS FT METDDTSRAHTEENGVKRPPATTSENDSSDVEKGKGKGRGRKKKQHTGNSS FT NTSRSRSRGNSRGSF" FT CDS 1361..4606 FT /product="Tx1-3_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MPYIRKIATINLNAISCRVKQHLLKEFVWNNDLDVVF FT MQEVAFENFSFVPTHTAVVNISEDGKGTGILLRNNISYTNLVMNTNGRITS FT LVIDGVNFINVYAHSGSKYKRERDNLFTDEILVHLGEFKENVVLGDFNCII FT DKKDSTGAVKNICNGLKKLVGELDLLDVELSKNSHRVFTFLRGDSKSRLDR FT LYGTKNFVERVRKVEILAVPFSDHHSVVVTVEIINSEQMNFYGRGYWKINS FT SLFADEVIKEKFKSYYAQLKTRNSFQNISFWWNNDLKRGIKIFFKGESFKM FT NQQLSREKSFYYKCFNDAAKYQALNIDVSEELKLAKSKLMNLEQSRLNRFR FT SKLKANSLHSDEKLSFFHVTSFINKSSDSKLLKLKLDDGVTSNPSRLKEEI FT FNYFYEKFKIDDNVRNGCGNTLEYLDRRVDENDKQRLIKPIEMIEIEHAIK FT EASKNSSPGPDGINYEFYSTFFDLLKDDMQTLYNSYFVDGEYPPGLFSSGI FT IALIPKKGDNFDLGNKRPISMLNSDYKIFTKLLCNRLKPIMEKLLGPGQTA FT SIENSSCINNLSLLRNIIIKANKSRKFKGIILSVDLEKAFDRVDHQYLWDI FT LKTFEIPNRFIECLQKLYKNASSKVLFNGFLTNSFPIKCSVRQGCPLSMAL FT FSLYIEPLLRMIDKHIKGVLIDDVFVRVIAYADDLNIFIRNDEEFSIALEL FT INYFSIYSKIKINFSKSHYLRLNNCSLGPQLISEARDIRILGLTFDEDYKK FT MIDSNYNELIRNINYCLSLQYKRRLNVVQKIWILNTYILSKLWYVAQIIPP FT ENKHIAVLRKKCGDFLWKGCFYKVERNELYAPALKGGLSLTCVESKAKALF FT IKNLLLSGKQATTDKFMMEQIMNNKLNRNTREWIKLAVKFEERNLKTSKQI FT YDDLITEKNIKIKQHDKQPHLQWETLYENVNQNFVCSESKSILFFVFRDII FT PCKSKLYRHRVRGIDSPTCDSCGRIDSVEHRIKNCSGSEKIWSWLNGVMKS FT KFKINVEDAIDLVQFGIKPKNCKFKAALWLVIETLTYCIKNHSTGCIADLK FT QQISVKRWNNKKLFDCQFKQYMYLL" XX SQ Sequence 4688 BP; 1624 A; 742 C; 1009 G; 1313 T; 0 other; cagtttatac tcggacgctg tagagtgaac acgtactcga gagtgcttgt caagttataa 60 acgttttgct cacacctagt gtgatttttt ttcttttgtt aaaagcggaa tctattcagt 120 gaatagatag aatggagact agaaagaata cggtaaaact ggtttttgct acgtcagcaa 180 gggttccgat acatctggac gttcttcggt tcatgatgcg cgtgttgaag attccggcga 240 ccgatgtaca ctcggtgtac aaagatgaaa atgatcagcg tttctacgta aaattcatcg 300 acgagaacag ctttaaccgg ttcagcagta caatggagga acagtactgg ttcgaatatg 360 cagatgggga aaagatccga gtgcaactcg aaatggcaag cagacagttc aagtatgtcc 420 gaattttcaa tttgccccct gaaacagagg acaaagatat cgcagcagtc ctgggacagt 480 tcggaaggat tcggcaacac gtgcgtgaga gatatcctgc agactatggc taccaggtgt 540 tcagtggcgt aagaggagtc cacattgaga tcgaaaagga aattccggcg aatttgtaca 600 tcggccactt tcgtgcaaga tgctattacg agggattgaa gaataaatgt ttcttctgca 660 aggcggaggg acacaataag gccaactgtc ctaagctagc ggaaatcaag gaaaaaggag 720 tctctggtcc agccaacagc aggatgtaca gtcaggtggc agcaaatcta cgaatagcaa 780 ctagttctag tgaaaatgag agcaggccgt cttcgtcgat gacggtgctc caagttccga 840 ggcacccaag tgcaggtaag atccaggaga gaggggaaaa agcgacgcca agggaaacaa 900 tagggacatg tgcagagcaa gtgagtgctg aaaacgtggc tgaaatccag ccggctcaaa 960 tagatgtgca agattcgcac actaacgacg gagacaatgc tggttcgatg gagacggatg 1020 atacatcacg cgcacacacg gaggagaacg gtgtgaaaag accaccggcg accaccagcg 1080 aaaacgatag ttcagatgtg gagaaaggga aaggtaaagg acgtggtcgt aaaaagaagc 1140 agcacaccgg gaattcatcg aatactagca gaagccgctc gcgtggtaac tctcgtggaa 1200 gcttctgatg gaatataaca cgactaaaat agtgacggta tgtgtgaaaa tattggacgt 1260 ttttgggtga attgtttata gccctggttt cgatggtttc ggtggagttc tatttagact 1320 gtagtcgttc tattaccgag tatcgtagtg tagagtgaac atgccataca ttcgaaaaat 1380 agctacaatt aacttgaacg ctattagctg tcgagtgaaa caacacctac tgaaggagtt 1440 tgtgtggaat aatgatcttg atgtagtatt catgcaggag gtggcatttg aaaacttttc 1500 atttgtacca acacacacag ctgtggtaaa catcagtgaa gatggtaagg gtactgggat 1560 tttgttacgc aacaacatca gttacacaaa tcttgtaatg aacactaacg gtcgaatcac 1620 atcacttgta attgacggtg taaatttcat caacgtctac gcgcattccg gttcgaagta 1680 caagcgagaa agagataatt tgttcactga tgaaatatta gtgcacttgg gcgaattcaa 1740 agagaacgtc gttctcggag acttcaattg tataattgat aaaaaggact ctacgggcgc 1800 agtgaaaaac atttgtaacg gactgaaaaa gttggttggt gaactcgatc ttttagatgt 1860 tgaactttca aaaaattcgc atcgcgtttt tacctttctg agaggtgatt cgaaatccag 1920 gcttgacaga ttgtacggta cgaaaaattt cgtagaacgc gtaagaaaag ttgaaatact 1980 tgcagttcca ttttccgacc accacagtgt agttgtgacg gtggaaataa tcaacagtga 2040 acaaatgaat ttttatggta gagggtactg gaagataaat tctagcctat ttgcggatga 2100 agttataaaa gaaaaattta aatcttatta tgcccagttg aaaaccagaa attcgtttca 2160 aaacatcagc ttttggtgga ataatgattt gaaaagaggg attaaaattt tttttaaggg 2220 tgaatcgttt aagatgaacc agcaactatc tcgtgaaaaa agtttttact acaaatgctt 2280 caatgatgca gcaaaatatc aagctttaaa catagatgtt tcagaagaat tgaaattggc 2340 aaagtctaaa ctgatgaacc tggaacagag tcgtttgaac aggttcagaa gtaaactaaa 2400 agcgaattct ttgcactcgg acgaaaagtt gtcttttttc catgttacat cattcataaa 2460 caaatcttca gactcaaaat tgttgaaatt gaaattggat gatggagtca cctcgaatcc 2520 atcgcggcta aaagaagaaa tttttaatta tttttatgaa aaatttaaga tcgatgataa 2580 tgtgcgaaat ggctgtggaa ataccctaga atatcttgac aggcgggtag acgaaaacga 2640 taaacagcgt ttgattaaac cgattgaaat gattgaaatt gaacatgcaa tcaaagaagc 2700 atccaagaac agttcacctg gtccggatgg tatcaattat gaattttatt caacgttttt 2760 tgatctactt aaagatgata tgcagacatt atacaattct tactttgttg atggagaata 2820 tcctcctgga ctattttcct ccggaattat cgctttgatt ccaaaaaaag gagataactt 2880 cgatctaggc aataaaaggc caatcagcat gctcaatagt gactataaaa tttttacaaa 2940 attattatgc aaccgtttaa aaccaataat ggagaaacta ctaggcccag ggcagacagc 3000 aagtattgaa aatagttctt gtataaataa cttgtcgtta ttgagaaaca ttattattaa 3060 agcaaacaaa tctaggaaat tcaagggtat tatattgtcc gtagatctgg aaaaggcctt 3120 cgatcgagtt gatcatcaat atctttggga tattttgaaa acttttgaga tcccgaatag 3180 atttattgag tgtctgcaaa agttatataa gaacgcaagc tccaaagttc tgttcaatgg 3240 atttttgaca aattcgtttc caattaaatg ttctgtaaga caaggatgtc cattaagcat 3300 ggcactattt tcgctgtata ttgagccttt gttgaggatg attgacaaac atattaaagg 3360 tgtacttatt gatgatgttt tcgtaagagt cattgcttat gcagatgatc tgaatatttt 3420 cattagaaat gacgaagaat tcagcattgc attggaactg ataaattact ttagtatata 3480 ttcaaaaatt aaaataaatt tcagtaaatc acactatttg agactgaaca attgttctct 3540 tggtcctcag cttataagtg aagcgagaga cattaggata cttggactga catttgatga 3600 agattataaa aaaatgatcg attcaaacta caatgaatta attaggaata tcaattactg 3660 cttatctctg cagtataaaa gaagactgaa tgttgttcaa aaaatatgga ttttaaatac 3720 ctatattctg tcaaaattgt ggtacgttgc acaaatcata ccaccggaaa ataagcatat 3780 tgcggtgttg aggaaaaaat gtggggattt tttgtggaaa ggatgttttt ataaagtgga 3840 gcggaatgaa ttatatgctc ctgctcttaa gggcggtctc tcgcttacat gtgtggaatc 3900 gaaagccaaa gcacttttta ttaagaactt actattatcc ggaaaacaag ccactactga 3960 taaatttatg atggagcaaa taatgaataa caaattgaat cgaaacacga gagagtggat 4020 taaacttgca gttaaatttg aagagcggaa tttaaaaaca agcaaacaaa tttacgatga 4080 cttaattaca gaaaaaaata taaaaattaa acagcacgat aaacagcccc atttgcaatg 4140 ggaaaccttg tatgaaaatg ttaaccaaaa ctttgtatgc agtgagtcca agtcgatttt 4200 atttttcgtg tttcgggaca taattccatg caaatctaaa ctatataggc acagagtgag 4260 agggattgat tctccaactt gcgattcttg cggaagaatc gactctgttg agcataggat 4320 caaaaattgt tcgggatctg aaaaaatatg gagctggtta aatggtgtaa tgaaatcaaa 4380 atttaaaata aatgtagaag acgcgataga tttagtgcag tttggtataa aacctaaaaa 4440 ttgtaaattc aaagcagctc tgtggttagt catagaaaca ttaacctatt gtattaaaaa 4500 tcattcaact ggttgtatag cagatttaaa acagcaaata agtgttaaac gctggaataa 4560 caaaaaactg ttcgattgtc aattcaaaca atacatgtat ttattgtaag tatttgtaaa 4620 tagtcttaat caatgtcaag aactttgtag atataatgaa ataaatagtt aaaaaaaaaa 4680 aaaaaaaa 4688 // ID MuDR5x_SM repbase; DNA; INV; 1584 BP. XX AC . XX DT 14-AUG-2009 (Rel. 14.08, Created) DT 14-AUG-2009 (Rel. 14.08, Last updated, Version 2) XX DE MuDR-type DNA transposon element - a consensus. XX KW MuDR; DNA transposon; Transposable Element; MuDR5x_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-1584 RA Jurka J.; RT "MuDR-type elements from Schmidtea mediterranea."; RL Repbase Reports 9(8), 1904-1904 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 179..952 FT /product="MuDR5x_SM_1p" FT /translation="MKFSTTQKGNRVLLYQGFEYTKYRENANGVVTRRFRI FT YFSAKCKAFLKMHNENIIGDAPQHCHDSNPQKAQANFLKASRNIMGNVLSK FT VNMDILEHLPKQSSLSRSLFQHKETSHMPNPTTTNFLIPNKYADLILHDTG FT ADDIERILAIGNLDMLAELEKDVIYGDGTFDKAPTMFYQLYTWHAKIGNSY FT PPCIYFLLQRKNTDTYNRMFAILQQLLPNIMPQKILLDFEKACMSAAEFAF FT STSRNKRVLLSSLPECD" XX SQ Sequence 1584 BP; 551 A; 264 C; 280 G; 488 T; 1 other; cagaacatga taaaacagaa ctatgataaa acagaaaaat tctaattcaa ggaattaatt 60 agttttaatg taatactatt tgtatattat ttaataagtt gaaataaaat taaatctata 120 gcttcatttt aattcggtcg aacatttaaa gcggtaatct taaaaattta ttaatatcat 180 gaaattttca acgacacaga aaggaaatcg agttctactg taccaaggat ttgaatacac 240 caagtacaga gaaaacgcga atggagtagt tacacggaga ttcagaatct actttagtgc 300 caagtgcaag gcattcttga agatgcataa tgaaaatata attggagatg cgccacaaca 360 ttgccacgat tcaaatcctc aaaaagcaca agcgaatttt ttgaaagcga gcagaaatat 420 catgggaaat gtgttatcca aagtgaatat ggatattttg gaacatttac caaagcaatc 480 atcactttca cgaagtcttt ttcaacataa agaaaccagc catatgccaa atccaactac 540 aaccaatttt cttataccaa ataagtatgc tgacctcatt ttgcatgata ctggagctga 600 tgacatagaa agaatattgg caattgggaa tcttgatatg ctcgctgaat tagaaaagga 660 tgtcatctat ggagacggaa cttttgataa agctcctact atgttttatc agctgtatac 720 atggcacgcc aagattggta attcatatcc tccgtgcata tattttcttt tgcaaagaaa 780 aaatacggat acatacaaca gaatgtttgc tatattgcaa caattgttgc ccaatataat 840 gcctcaaaag attctgctgg acttcgaaaa agcatgtatg tccgctgctg aatttgcttt 900 ttccacaagc agaaataaaa gggtgctact ttcatctttg ccagagtgtg attagaaaaa 960 tcaatagcgt tggtttaaag acagtgtacg agtcggatat tgacctaaag ttaaagctga 1020 agtctctgcc tgctctctct tttgtgccaa taacagatgt aagaactgtt tttgatgaat 1080 tagctgccac atttccggat gaagacaatt acaacgagat tctttcatat ttctattcaa 1140 cttacattga aggagttgcc ggaagatcac ctctctttcc aattcgaatt tggaatcatt 1200 ttgatgctgc tgcagaaagg tgtcctaaga ctacaaattg ctgtgagggc ttccacaacg 1260 ctttaaattc gctgttccat tgtagtcacc caagcatttg gaacctanta gatggattga 1320 ggagggatat agcttgccag cgactaatat tagctaattt tcgaacaggt cgcccagaaa 1380 ttacaaaaaa aagtattcgg ctctttgtaa tcaagtggca agagttgttc aggactatga 1440 aaatagagaa gataagttga agtttttacg tcgaatggcc aacctacaat aattatattt 1500 atatatatta ctaaataatt aataaatcgt ttaaaaaaaa atttatgttc tgttttatca 1560 tagttctgtt ttatcatagt tctg 1584 // ID Copia-25_CQ-LTR repbase; DNA; INV; 165 BP. XX AC AAWU01017673; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-25_CQ_; KW Copia-25_CQ-I; Copia-25_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-165 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 366-366 (2011). XX DR GenBank; AAWU01017673; Positions 43263 43427. XX SQ Sequence 165 BP; 53 A; 37 C; 28 G; 47 T; 0 other; tgtaggagaa ggctttccaa tttaacgtgt caccctattt tgatttaagt tggcacccct 60 gttaacggaa gagcaactgt cactctgaaa aattctacga gcaaaacaac acgcaaaata 120 tatttgctct cacaccgtaa agtatcgcgt tttattacga ctaca 165 // ID Gypsy-42_CQ-LTR repbase; DNA; INV; 183 BP. XX AC AAWU01014973; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-42_CQ_; KW Gypsy-42_CQ-I; Gypsy-42_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-183 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 464-464 (2011). XX DR GenBank; AAWU01014973; Positions 38420 38602. XX SQ Sequence 183 BP; 54 A; 51 C; 32 G; 46 T; 0 other; tgttgcatac gggccttggg gcaaacaact gtactaaaaa atgtaacact aatacactca 60 caaagaaatg aataaaacat caacgcgacg gttgtcgcgc tctctttcgc tgttcaccag 120 tgcaacatac aagacgtgcg taattcttct tactcgactt gctgtccgat ctctccccac 180 cca 183 // ID CR1-54_BF repbase; DNA; INV; 2209 BP. XX AC . XX DT 30-JUL-2009 (Rel. 14.07, Created) DT 30-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE Amphioxus CR1-54_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-54_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-2209 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-2209 RA Kapitonov V. and Jurka J.; RT "Young families of CR1 non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(7), 1625-1625 (2009). XX DR [2] (Consensus) XX SQ Sequence 2209 BP; 590 A; 472 C; 385 G; 762 T; 0 other; aatgctaccc ttcctgattt taattctctc acagatgcga gactgtcttg cgttttcact 60 acacccgaag aagtcgaact ttacattaga agcctggaca cctctaaggc taatggatac 120 gataatattg acaattattt tcttaaactt attacccctc ttatatctga caaggtagct 180 tatgtattca atctatctct gtcatacgga atatttccca tgatctggaa acgggcaaac 240 gttgttccca tttttaagaa aggcgatcct catgataaat gtaattatcg tcctgtgtca 300 ctgcttccta acttgtcaaa ggtcctcgaa aaaattgtat acaaacatat ttataaccac 360 cttttttcaa acgaccttat ctattctctc caatctggtt ttgtacatgg tgactcaacc 420 gtgtgtcagc tagtttatat ttgtcatact atccttaagg cttttgaaga agggaaagaa 480 gttcgctccg tttttcttga tttttcgagg gcctttgaca aggtatggca ttctggtttg 540 atatttaaac ttcagcaata tggaatcgat ggtcccttgt taaattggat ttacagttat 600 ttatgtgaga gatctcaaag agtcgttata gacggacagt cttctccatg gtccaagatt 660 ggcgcagggg tccctcaagg ttccgtgtta ggccccttac tgtttattat ctatataaat 720 gatattgtaa ataacttaag ttctttaccg tttttattcg cggacgacag ttctctactt 780 gaagttgttg aagacccaac tacatctgca tatcggctca actctgattt atctacaatt 840 ctgtcctggt cccataaatg gttgatggaa ttaaatccat taaaaactga agaaatggta 900 ttttcggcta aaagatcccc tgtcaaccac ccccctcttt atcttggaac gtctgaaatt 960 aaacgagttt tattccataa acatattggg gttgttttga cttctaactt gtcttggcac 1020 aaccacatca ttcagatgct cgctaaaatt tctaaacaag tcaatgtatt tcgtggtcta 1080 aaattcaaat taactcgtaa agttttagaa accatttata aatcatttat tcgcccttgt 1140 ctcgaatatg cggatgctgt ttgggatggc tgtactgccg aagactccaa tcttattgaa 1200 cgaattcagt atgtttgctc ccttatcgta tcaggagcag ttaaaggctc ctcatacatt 1260 tctgtttgcc aagaactagg ttgggagtct ctctctagca gacgccatat tcaccgttta 1320 tctttatttt acaaaatagt ccacggccaa acacggaggt accttataga tctgatcccc 1380 cctgaaattt cccaaactac ttcctacaac ttaagaaata ggtcaaattt tcgactttcg 1440 atacaatcta ctgatagatt tggcaaatcg tttgttcctc actgtttgaa tctctggaac 1500 gacttggacc ttgttattcg ttcctcacgt tactcgctat ttcggaaata cctaattaaa 1560 tcagtacgtc ccattcgccc cccacactac gattgtggcc ctcgttacac ctgcgccctt 1620 cttgtccgac ttcgcatcgg cacctgtgct ctcaatcaaa gtttgtttat ccgaggactg 1680 tctcccagtg cctcctgcag atgcgggttt cgatgcgaat ccgtattaca ttttatgctt 1740 tattgcccat tatatatcca tcagagatcg gaattcttcg gcaatcttgc agatttactt 1800 ggtcatcgtg ttaaccttaa taatatgtct gacaatgtca aattacattt tattctcaga 1860 ggttccgcat tgctaccttc tgtttacaat tttaaaacga tgctactcac gcaaacttac 1920 attgaggaca ctaagcgctt ctagatctgc atgactgcag tcaagttgat atagaagtcc 1980 cagctctaat tttgttattt ctcttttttc atatgtaccc aattaggcca attgatacta 2040 ttatatgtct tggggtgggg ggatatggcg ccctgatatg ttgacctaga ttacttaatg 2100 ttattaattt atttgtattt tgttatgtgg cgctgttaaa ataagttgtt aaacttgagt 2160 gcagcgccac attgtccttg ttctgtgttt ttgaataaag aaaaaaaaa 2209 // ID Transib4_AA repbase; DNA; INV; 928 BP. XX AC . XX DT 13-JUN-2005 (Rel. 10.05, Created) DT 29-JUL-2005 (Rel. 10.05, Last updated, Version 2) XX DE Transib4_AA is a DNA transposon, a partial consensus sequence. XX KW Transib; DNA transposon; Transposable Element; KW Interspersed repeat; DDE-class; TRANSIB superfamily; KW Transib4_AAp transposase; Transib4_AA. XX NM Transib4_AA. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-928 RA Kapitonov V.V. and Jurka J.; RT "RAG1 core and V(D)J recombination signal sequences were derived RT from Transib transposons."; RL PLoS Biol 3(6), (2005). XX DR [1] (Consensus) XX CC Transib4_AA belongs to the Transib superfamily of DNA CC transposons. CC The consensus sequence is not complete; termini are not known. CC Transib4_AA encodes remnants of the Transib4_AAp transposase. CC The transposase is not perfectly recovered due to available CC sequence CC data. XX FH Key Location/Qualifiers FT CDS 71..862 FT /product="Transib4_AAp" FT /note="transposase" FT /translation="LLNVVLTVKWGFDGTSGVSEHKMKFEGDDGTFSDKSM FT FVTSLVPLKATANGKVLFVNKRPSSTRYCRPIRIQYIKETVAVTVQEKEHV FT IQQINELTPFNIPMHGRVIRIHCRMHLTMIDGKAINAITSTKSTQTCYVCK FT ATPKDMNNLDRIYSKVIDASTFDFGLSPLHAWINFFEFFLHLAYKLPVKKW FT RITNEYVNIVEERKKQIQRKFKIEMGLIVDQPCAGGAGTSNDGNTARRFFF FT KPGNVGQNNRAKPGTDQEMCSNT" XX SQ Sequence 928 BP; 317 A; 170 C; 204 G; 237 T; 0 other; cctggaacac acagcacaaa ggataataga agccaatgca aatttgttgt gtaattttac 60 ggatgcagaa ctattgaacg ttgtgctgac ggtaaaatgg gggttcgacg gaacatctgg 120 ggtgagcgaa cacaaaatga aatttgaagg agatgacggt actttttcgg acaaaagtat 180 gtttgtcact tcactcgtgc ccctgaaagc tactgcaaat ggaaaagttc ttttcgtaaa 240 taagcgacca tcatcaacca ggtactgtag acccatcaga attcagtaca tcaaagagac 300 agtggcggtc acagtacaag agaaggagca tgtcatacaa cagatcaatg agttaactcc 360 gttcaatatc cctatgcatg gaagagtgat cagaatacat tgtaggatgc atttgacaat 420 gattgatggt aaagctatca atgcgataac gtctactaag tccacacaaa cctgctatgt 480 ttgcaaggct actccaaaag atatgaataa tttggatcgg atctacagca aagtgataga 540 tgccagcaca ttcgactttg gactgtcacc tcttcacgca tggatcaatt tttttgagtt 600 tttcctccat ctagcgtaca aattaccggt gaaaaaatgg aggattacaa atgagtatgt 660 gaatatcgtt gaagagcgta aaaagcaaat tcaacgaaaa ttcaaaatag aaatgggact 720 cattgtggac caaccttgtg caggaggagc aggtacgtca aatgatggca atactgctcg 780 tcgttttttt tttaaacccg gaaatgtcgg ccaaaataac cgagctaaac ctggaactga 840 tcaagagatg tgcagtaata cttgaaacta tggcatcagg aatggaaatt aacatcgaaa 900 aatttgattt ctgtcgaaaa cagctgag 928 // ID Copia20-NVi_I repbase; DNA; INV; 3939 BP. XX AC NW_001818714; XX DT 20-DEC-2007 (Rel. 12.12, Created) DT 20-DEC-2007 (Rel. 12.12, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: internal DE portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia20-NV; KW Copia20-NVi_LTR; internal portion; Copia20-NVi_I. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-3939 RA Kohany O. and Jurka J.; RT "LTR retrotransposon from Nasonia parasitic wasp."; RL Repbase Reports 7(12), 1206-1206 (2007). XX DR Genome; NW_001818714; Positions 7383 11321. XX CC 'CTTTT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 1190..2956 FT /product="Copia20-NVi_I_1p" FT /translation="MFNREYLFINSEGEGTEYITLNHPAPYGRRLGMVVNS FT IEAMKAAMNNYSEVLEKQANAIISLQKSMVESNKVTMEMMNLIKELQNQKV FT TNPLANSTIRDINQSTNNTSLRNTNTSNTINIKRNYMLTRKMPLNEWLDYL FT KSDLKTNDLLFILEDNESLISLYDKSELEKKTSTVRDIIINHVDRYYHKRI FT LEIDDPKLILNKIRDFRRAETNVTDSAMRSKLYSLKMSKKESVNDFCDRFE FT TIVNEFEFCNTDAPLLEAEKRSAFFQAACNAYAELGQANITRLQSTGTEMS FT LDNMITYLLQTEAQNKTSDGKPSDAKANLVTQIFSQTTKCFRCNKVGHKAA FT DCPLAEYSLWYCFYCQAVANHKGDTCPNKDNPKDGYAKSLVYSNNTQNTHT FT HNNFRGIRGRGRGGYNNVRGKFRGNNKRGNFKKQVTIQTPGRKQKQAEINK FT ARLAGKYNKSNIRDDSSKLVFIADSGATEHIINKSLFLRNFKKSSGKIIRS FT ANKNKSANIIIDGKGDLYLKPTSDENNKILLTNVIAAQDISDNLISLRRFA FT DAGLSIYLDDKILKIFDKKSGSEYLTGIYEKPNWIISFEVEN" XX SQ Sequence 3939 BP; 1510 A; 669 C; 745 G; 1015 T; 0 other; ggttgtgggt cagcatgcat gacggtattt ttttcgattc agaggaactg ttcaactatt 60 caccacgatt tcagaggtac cgtttttttt cttttccaaa caaagggaaa tagaaattta 120 gaagtgaaaa gctaatcaga tgaacaaata attaaatctc tcatttgctt agtctcggga 180 gtgctttttg tttaaacgtg ccgtattcga gttgtgaaaa ataggataga cagtgcgatg 240 tctaattagc tcagaataga gaaaaacatt agaaagagag attttgtagt cagagaaact 300 aagttttgtt tccattatcc ttcagagaga gatatagaga gagaacaaag cacaggagtg 360 ctatagcgcg cgtagaaacc gcgcgaacac aaaccggcgc gagctcgcgc gggccaattc 420 tcggaatcgg agacaagcgt gcagcgacgc gcggcgtgga gagtcatgcg cccggcttgt 480 ttacgtttcg acctctcggt ttttctctca gaaacatgag gcaaacgttt gtttcctgaa 540 tatttttttc ggatgtttat ggaataatac cgtatgttta aatatttgca accagaacat 600 ttttcctctc gtgatcgttg ctttctatca gagtattttt tttctcttgg acttaaaaat 660 taaaaatcaa tcagacgaga attttaaagt aaaaagaaac acgaattaaa tagaatcctc 720 gataatacgg cattgcagta aagtcgagtc tttctataaa gaataaaact acaaatttag 780 caaactgctg tttgcagttt caccgcaaca gcatgtgcgt gtgcgatcgc ccctcacaca 840 aaattgagaa aattgcgttc gtttgctcag gttgggaggg agcacgtgta tgttgttcgg 900 cggaacaaaa catttttctc aagctagtct aacttgcaat gcggtcagac taagcaaaaa 960 ttaaaactaa aagaaacatt caacattaca gggtgaaata aaactcggaa atatcagaag 1020 tgtcaggatg tccgacaatt tagacgagga aattctacca aacatataag gtaaaagtgt 1080 ttctattaga atataaacaa ataaatcaga gacagacgat aggcctataa aaacacttat 1140 ttttatataa attaataagt aaattcagaa gtaagaaatc ggaagttgaa tgtttaatcg 1200 agaatatttg tttataaatt cagagggaga gggaacagag tatattactc taaatcaccc 1260 tgctccgtat ggcagaagat tggggatggt ggtaaacagc atcgaagcca tgaaggcagc 1320 catgaacaac tattcagagg tcctagaaaa gcaagccaac gccatcataa gtcttcagaa 1380 gtccatggta gaaagcaaca aagtaactat ggaaatgatg aatcttatca aagaattaca 1440 aaatcagaag gtaacaaatc ccctcgcaaa ctctaccatc agagatataa atcagagtac 1500 taacaacact tccttaagaa acacaaacac atcaaacaca attaacatca aacgtaacta 1560 catgcttaca cgaaaaatgc ctctgaacga gtggttggat tatttaaaat cagatctaaa 1620 aacaaatgac ttactattta ttttggaaga taatgaaagc ttaatcagtc tatacgacaa 1680 atcagaatta gaaaagaaaa cctccacagt cagagacatt atcataaatc atgtagatag 1740 atactatcac aagagaatat tagagattga cgatccaaaa ctgatattga ataaaatcag 1800 agatttcaga agggcagaaa caaacgtaac ggattctgcc atgagaagta aattatatag 1860 tttgaaaatg tcaaagaaag aatccgttaa tgatttttgt gatcgtttcg aaactattgt 1920 taatgaattt gaattttgca atactgatgc ccctctatta gaagcagaga aaagatctgc 1980 tttttttcaa gcagcttgta acgcatatgc tgaacttggt caagctaaca taacgagact 2040 tcagtcgact ggaactgaaa tgagcttgga taatatgata acgtatttgt tgcagacaga 2100 agcacaaaat aagacttcag acggaaaacc ttcagatgct aaagcaaact tagtgactca 2160 aatcttcagt cagacgacta aatgcttcag atgcaacaaa gtaggacaca aggcagcgga 2220 ttgcccctta gcagaatata gtctatggta ctgcttttac tgccaggccg tagcaaacca 2280 caagggagac acttgtccaa acaaggataa tcctaaagac gggtatgcaa aatcattagt 2340 ttactctaac aatacacaaa acacacatac acacaataac ttcagaggta ttaggggtag 2400 aggtagaggc ggatataaca atgttagagg taaattcaga ggtaataaca aacggggaaa 2460 tttcaaaaag caagttacga tccaaactcc tggccgaaaa cagaaacaag cagaaatcaa 2520 caaagcaagg ctagcaggta aatacaataa atctaatatc agagatgatt catcaaaatt 2580 agtttttatt gcggattccg gggcgacgga acacattata aataaaagcc tatttctaag 2640 aaatttcaaa aagagttcag gaaagataat cagaagtgcg aacaagaata aatcagccaa 2700 tattattata gatggaaaag gcgatctata cttaaaacct acatcagacg aaaataacaa 2760 aattttatta acaaacgtaa tagctgcaca agatatttca gacaatttaa tatctcttag 2820 acgctttgca gacgcaggtt tgagcatata tttagatgac aaaatactta aaattttcga 2880 taaaaagtca ggatcagagt acctgacagg aatttacgaa aaaccaaact ggattatttc 2940 atttgaagta gaaaattaga gagtatcaga cgatcacaac attaaataca atacatactc 3000 ctgtatggca aatatagttt ccgttgatga atttttacaa aaaatcaatg aagaaaatca 3060 aataagaaca caagctccct atagagaagc aataggtagt ttactctatc tagctggagc 3120 aaccagacct gacatctcat tttcagtaaa ttatctgtca cggagacaac tgaaccccac 3180 tgaaaatgac tgaaaaaatg taaaatgtgt tttcagatat ctcagagata cctcagatgt 3240 tggactgacg ttcaaagcag aaaaggaaga attagaagct atgtcagacg ccagcttcag 3300 agattggtac gattcatctt ccactggagg atatgtaata ttactatctg gtgatccaat 3360 tatgtggcga agttacaagc aaacacatat atcattgtca acttgccagt cagagtattt 3420 agcgatgagt aacagttgcc aaggcttaat ttcactagac aaagcaatca gaagcatcat 3480 tgggaaaaca atgtacccta ttaccatttg gtgcgacaac aattgagcag gagattgcac 3540 ccagatggaa ggaaatcaca aattgaaaaa ctttgatgac gatcttgaaa caattcagaa 3600 gaatttagag gaaagagaaa aatcaggcag caaagcttac atggcagaca cacatggaga 3660 ttatattaaa aaatgtgtat cagaaggaaa attcagagta aaatagattt ctacctcaga 3720 aaatatagca gacattatga caaaacctct atctatagaa aaacataaat acttcagaga 3780 caaaattctc aatttaaaat aagcacagta ttcataatat aattttgaat cagatgtaaa 3840 agagtttctt ttctttcaga tatgtcaaac caagacgaca atgattcaga gtgaataaat 3900 aaaaaaaaaa aaaaagaaga atcactcgca ggaagggag 3939 // ID hAT-16_SM repbase; DNA; INV; 2387 BP. XX AC . XX DT 06-FEB-2008 (Rel. 13.02, Created) DT 03-MAR-2008 (Rel. 13.02, Last updated, Version 1) XX DE hAT DNA-transposon - consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-16_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-2387 RA Jurka J., Bao W. and Tempel S.; RT "hAT DNA transposon from Schmidtea mediterranea."; RL Repbase Reports 8(2), 66-66 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 313..2184 FT /product="hAT-16_SM_1p" FT /translation="MDKFLRKECSTLSTNDDEPSTSNDVKRPKIVKRQYNA FT EYIQYGFSWCGEEAAPKPECVICREQLSNEAMVPSKLIRHLKSKHPNCANK FT DRQYFQRLLNQNLKQRQFMRSSVTISDKALEASYHVAKLIARQKKPHTIGE FT TLIKPACMEIIRLMFGPNEINEVSKVSLSADTVKRRIDDMSSDILKTLITK FT LKMTENFSLQIDESTDIKNQAQLICIVRFVDEESIKEHYLFCKELPERTTG FT EEIFRVTDEFFKTYGIQWINCMSVCTDGAAAMMGYKKGFVSCVKRQNPAIQ FT ITHCCIHREALMVKNLPSELLATMNECISIINLIKSKALNSRIFGILCAEM FT GSEYQSLLLHTEVRWLSRGKVLARLFELREEVSNFLLNQNVPELHKLLQDN FT HWMTKLAYMADIFEHLNELNKKMQGRNENLLTCSDKLNGFKQKLDLWQSEL FT QQGSLEMYQRTNQTIGRIAQAGKNKQIILRLAEQHLTLLQQKFNQYFHIIN FT TDQYDWIRNPFSANAENSTVALSLQIRDEFFDLRNDGTLKIKFSDVPLDTF FT WIAVKEEYPRISEKAIEVLLPFSTTYICEESFSTLVLIKNDKRSCLKGLDQ FT ELRVALSNIEPNIKLLCSLKQAQVSH" XX SQ Sequence 2387 BP; 833 A; 400 C; 453 G; 701 T; 0 other; cagagctccg caaccggtgt gccgcgaggt gatgctaggt gtgccgcggc gcaatgccta 60 cccgagaaaa aatatagggt ttatgacgga cacgtgcgtt atgcttatat ctatcttgtt 120 tacaaataat tgcgttgttg acaaaataga agataaaaat taatgtaagt attcaaataa 180 ttatttttct aaataacttg actagcgctc aattctaaat attgaatttg aataattttt 240 aatatataaa tattacctgg gcgtgtaata ataaaattat tttattcata tttattttat 300 ataattttaa atatggacaa gtttttgcgt aaagaatgtt caactttatc gacaaatgat 360 gatgaaccca gcacgagcaa tgatgtaaaa agaccgaaga tagtgaaaag gcaatacaat 420 gcagaatata ttcaatacgg attttcatgg tgtggtgaag aagcggctcc gaaaccagaa 480 tgcgtaattt gcagagaaca gctttccaac gaggcaatgg tgccaagtaa actgattcgc 540 catctaaaat caaaacatcc caattgcgcc aacaaggaca gacaatattt tcagcggctt 600 ttaaatcaaa atttaaaaca gaggcaattt atgagatcgt ccgttacaat ttctgacaaa 660 gcgttagaag ctagttacca cgttgcaaaa ttaattgcac gtcagaaaaa accacataca 720 attggcgaaa cccttattaa accagcatgc atggaaatca ttcgattaat gtttgggcct 780 aatgaaatta atgaagtcag taaagtttcg ttatcagctg atactgttaa aagacgtatt 840 gatgatatgt caagcgatat tttaaaaaca ttaattacaa aattgaaaat gacggagaat 900 ttttctcttc agattgacga atcaaccgat attaaaaacc aagcacaact aatatgtatt 960 gtacgtttcg ttgacgagga gtccataaaa gaacattatt tgttttgcaa agaacttcca 1020 gagcggacta ccggtgagga aatttttcga gtaactgatg agttcttcaa aacctatggt 1080 attcaatgga ttaattgcat gagtgtttgc acggatggcg ctgcggcaat gatgggttat 1140 aaaaaggggt ttgtctcatg tgtgaagcga caaaatccgg caatacaaat tacacattgc 1200 tgcatacatc gagaagcttt gatggttaaa aatttacctt cggaactatt ggcaacaatg 1260 aatgaatgca tcagcattat aaatttaatt aaatcaaaag ccttgaattc gcgaattttt 1320 gggatactct gcgcagaaat gggatcggaa tatcaatcct tattacttca cacagaagta 1380 cgctggttgt cccgaggtaa agttctagca cgactatttg agcttcgcga agaggttagt 1440 aattttctat taaatcaaaa tgtgcccgaa ctacacaagc tcctacaaga taatcactgg 1500 atgaccaaac ttgcctacat ggctgacata tttgaacacc tgaatgaact taataaaaaa 1560 atgcaagggc gaaatgaaaa tttgttgaca tgctccgaca aattgaacgg gttcaaacag 1620 aaacttgatc tttggcaatc ggagttacaa caaggatcac tagaaatgta ccaaagaacc 1680 aaccagacga ttggaaggat tgcacaagca ggaaaaaaca agcagatcat attgcgtttg 1740 gccgaacaac atttaacatt gctacaacag aaatttaacc aatactttca cataattaat 1800 acagaccaat acgattggat tcgaaatcct ttttcagcga atgccgaaaa ttctacagta 1860 gctctttccc ttcaaattcg agacgaattt tttgacttga ggaacgatgg gacgctaaaa 1920 ataaaatttt cggacgttcc attggatacg ttttggattg cggttaagga agaatatcct 1980 cgaatttccg aaaaggctat tgaagttctg ctcccatttt caaccacgta catatgtgaa 2040 gaaagttttt caactctagt tttaataaaa aatgataagc ggtcgtgctt gaagggcctt 2100 gaccaagaac ttcgagttgc tctttcaaat attgagccca atataaaact attatgttcc 2160 ttaaaacaag cacaggtatc tcattagtga ttgttcatat tagttatagt tataaataaa 2220 aaatagttgt taacggcgat aataaattta tcaaataaaa aaaaatacat tttctgtttg 2280 tgcatttctt atcacgtaaa tttttaaaaa tttttgtttg tggtgtgccg cgaaatattt 2340 ttaagtcttt aggtgtgccg cgggtgagaa aaggttgcgg agctctg 2387 // ID BEL1-NVi_LTR repbase; DNA; INV; 480 BP. XX AC AAZX01006515; XX DT 13-NOV-2007 (Rel. 12.11, Created) DT 13-NOV-2007 (Rel. 12.11, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: long terminal DE repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL1-NV; KW BEL1-NVi_I; BEL1-NVi_LTR. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-480 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from Nasonia parasitic wasp."; RL Repbase Reports 7(11), 1164-1164 (2007). XX DR Genome; AAZX01006515; Positions 15740 15261. XX SQ Sequence 480 BP; 142 A; 90 C; 121 G; 127 T; 0 other; tgttacggtc tcgagcattg ccaaaaagct gcttgttcgg ttactgggtt tagcctaaaa 60 taaaaagaaa aatgaatcat atgcagtcac gttgttttgg aatgcgaatt tggctgcgtg 120 cacgacaatc atggattgtt cggggccgct tttccccttc ctcgatcaaa gatcggaaat 180 agaagggcac gtggcgagga aacaaaaggt tttctcgcgg acggaaagcc gcgattcaat 240 tgttaaggga gcgaactacg gttgccgggc aagcggatgt agaaaaacaa aattgtttag 300 aaaaaaaaga cagggttgct ggagagcggg agatggtgcg agtcagtcga gtgccaagct 360 tgagcgaaaa agcacttact tgtataacat tcattataat caatcttcct caataaagtt 420 ttatttcctt atcagtctcg attaataatc tcgtgcgcat tgctcgattt gcgagctaca 480 // ID Gypsy-35_AA-I repbase; DNA; INV; 5274 BP. XX AC supercont1.299; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-35_AA_; KW Gypsy-35_AA-LTR; Gypsy-35_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5274 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.299; Positions 88177 82904. XX CC Positions [4250-4717] - Integrase core CC 'CTTTG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 991..2916 FT /product="Gypsy-35_AA-I_1p" FT /translation="MDSSRAIPPFRCNEIEKTRLHKEWQIWKRSLEYYFEA FT EDITDQKRKRAKLLHLGGPQLQAVFQSLPNSEKFAVVAVEKAYYDVAIQAF FT DEFFQPGRQDVLERHNLRRMKQLEGERFAEFIMRIRQQIEECGVDKYEPEV FT RKILTDIMLTDTIVEGCLSEELRKQILQKDRSVEEIEDIGKSLEGVQKQMK FT DFVSSREGWKQQERVYGIQARRRYVSTRPKVPGDSRNKSKNPELQCFKCGF FT YGHISTDTRCPARGKECKRCKKTGHFEARCRMQARGALPWKEETVEPKKIR FT LVERAADQSVQESNAVEADKPSASTPPKTYYAFYAGNESNTVTCEIGGVEH FT PMLVDSGAEANLITDVAWEKLKTAGIEVTSCVKGSDRVLRSYANNIPLTVL FT GTFKATVRIGARSAEAEFFVIRGGQRSLLGDTTSKRLGILKVGLEIDQVVQ FT RMEPLSKINGVSVHIHLDPDVKPIFQPMRRIPVALEEAVNNKLKELMARDI FT IEEKKGPVSWVSPLVVVGKANGEPRICLDLRRLNEAVMRERHPMPVIDDFL FT ARIGPNMIRSKLDVKDSFLQLELNEESRDAMVFLTARGLYRFKRMPFGLVS FT APEVFQKTMDRILAGCEGTWWYIDDVYIEGKDKEEHDERVAKVG" FT CDS 3446..5239 FT /product="Gypsy-35_AA-I_2p" FT /translation="MADASPIGLGGLLIQVDGSGKHRVISYASKSLTDTEK FT RYCHTEKEALALVWCVEKFYIYLYGIEFEILTDCKALIFLFTPRSRPCARI FT ERWVLRLQGFDYSITHIPGDQNQADVLSRLSTLKPVPFDQREEIMIREIMD FT FAAHAVAVPWEKMVEATKSDSELQMVLTLMDQDRKHELPTEFRVFGNELCR FT VGDILLRGDRIVVPKELRPRILKIAHEGHLGTTMMKSTLRLSVWWPKLDKE FT VENFVKNCRGCILVSAPDAPEPMTRKQMPAGPWQEIAVDFLGPLPEGQWLF FT VVVDYYSRFVEVVEMKSITAAETIRELSTIFGRFGIPTTMRADNGPQLSAE FT CRELKQFCEEYDIELVNTIPYWPQANGEVERQNRTILKRLRISQELGKDWR FT LELRKFLLAYHSTVHPTTGKSPAELMFGRRIRNKLPSIPGYKEDEEVRDRD FT RIVKEKGKEYSDNKRKARSNQIEEGDYVYAKRQRKDHKLNTDFSSERFKVV FT NRKGAEVTIKSTVSGVEYKRYVSHLKKAEDKAQEQERRSDLDNEAELLRRD FT EVCRDEGYSGESAGDEQKASNGGESTVDTGAQVSDKRTRLAPKKFQDYVAY FT " XX SQ Sequence 5274 BP; 1633 A; 930 C; 1441 G; 1270 T; 0 other; tttggcgacg aggagaagta aaaaaggtat gtaaaatatt aaagaatgaa gtggagaaca 60 acgattttat tttgcgcctc catttggctg gttgtatcaa aatggctgaa attcgttgaa 120 aagcatactc aaaagtggga gaaagcttta atgtgggggg agatgtgaac cgaaataatg 180 tgacaggtgg caaagctttc aatgacgaat gtatgctggt gtaaacagag tggaaagtgg 240 acgagaagaa ggtgaaaaga taaaaagcaa aacaagaatg atgattcgct ctcaaatgag 300 acatacaggg aacaaaaagc ttgaatgatt catattggaa ttctggtggt gatagcggag 360 cataactgtg cgctgatgat ggtatgtgta tttggcagtg aaaagaaaaa aaaaaagaaa 420 aactccagtt tgggaactgg ttgaaagctg ttgaaagtta gtctgattgg cctgattagg 480 gaatcaggtg tgtaaggtat gagctgattt gggaatcagt atgaatcagt gcaactggat 540 ggaaagcgtt gactggtggt agattctgat tttgggaaat cagtagagct gccgatttgg 600 gaatcggtat gttgtgaaaa tgattgtagg tcgaaagctc gagtcagccg gttggggaat 660 cggatgctcg aagaggatgc aaaacacaac gaaaacaaaa gttgttcacg atgagggata 720 agaaacgatg cactatagga aatgggagta taataactgg tcaaaactaa taaaaccgat 780 aataaacaaa gtgatacata agcgttttat ttattgcgta gaatctgcgt tctattttgg 840 ggttaatgtc ataaacaatt tttcagtgat tcattttttt attgcactac tactaaatat 900 aatcaacaga atattgagtt taaaaataaa aatgctcggt ctgggggagg taatatcatt 960 gttttattat cttacttctg atgtttccag atggactcat cccgtgccat accaccgttc 1020 aggtgcaacg agattgagaa gactaggttg cataaggaat ggcagatttg gaaacggtca 1080 ttggaatact attttgaagc agaagacatt accgaccaaa agcggaagcg agcaaagctt 1140 cttcacctcg gggggccaca gttgcaggca gtatttcagt cgttaccgaa cagtgaaaaa 1200 ttcgcggtgg tagctgtgga gaaggcgtac tacgatgtgg caatccaggc atttgacgag 1260 ttttttcaac cgggaagaca agatgtcttg gagcgccaca acttgaggcg aatgaaacag 1320 cttgaaggtg aacgatttgc tgaatttatt atgagaataa gacagcaaat tgaggagtgc 1380 ggcgttgaca agtacgagcc ggaagttcgt aaaatcctta ccgatatcat gttaacggac 1440 accattgtcg aaggctgtct gtctgaggag ctgcgcaagc aaatcttgca gaaagaccgt 1500 tcggtagagg aaatcgaaga catcgggaag tcgctcgaag gtgttcaaaa acagatgaag 1560 gattttgttt caagtcggga aggatggaag caacaggaac gtgtgtacgg aattcaggct 1620 agacggcgat acgtatccac gagaccgaag gtgcctgggg attcccgaaa caagtcgaaa 1680 aatcctgagt tacaatgctt caaatgtgga ttctatggac acatatcgac tgacacgcgg 1740 tgtccggcca gaggaaaaga atgcaaacga tgtaagaaga caggacattt cgaagcccga 1800 tgcaggatgc aagctagagg tgctcttccc tggaaagaag aaacggtgga acccaagaag 1860 atccgattgg tggagcgtgc tgcggatcag tcggttcaag aatcgaatgc ggttgaggct 1920 gacaaaccct cagcgagcac cccaccaaag acatattacg ctttttatgc cgggaacgag 1980 tcaaatacgg taacgtgcga aattggtgga gttgaacatc cgatgctggt ggactctggg 2040 gctgaagcaa acctgattac cgatgttgct tgggagaaac ttaaaacggc gggaattgaa 2100 gttaccagct gtgtaaaagg aagtgaccgc gtactgagga gctatgccaa caacattccc 2160 ctaacggtac tgggaacttt caaagcaaca gtacgaattg gagcgaggtc tgcggaagct 2220 gagtttttcg tgattcgggg aggtcagcgg tctctcctag gagataccac ctcaaaacga 2280 ctgggaatac tgaaggttgg tctggagatc gatcaggtgg tgcaacgtat ggaacctctt 2340 tctaaaatca acggcgtgtc cgtgcatatt catttggacc ccgatgtcaa accaatcttt 2400 caaccaatga ggagaattcc agtggcgcta gaagaagcag tcaacaacaa gcttaaggag 2460 ttgatggcga gagacataat cgaagagaag aagggcccgg tgagctgggt atctccattg 2520 gtcgtagttg ggaaggcgaa tggggaaccg aggatatgtc tcgatttgcg aaggttaaat 2580 gaagctgtca tgcgagagcg acacccaatg ccggtgatcg acgactttct ggcaagaatc 2640 ggtccgaata tgattcgcag taagttggac gtgaaggatt cttttctgca gctagagttg 2700 aacgaagaat caagagacgc gatggttttt ctgacggccc gaggattgta tagattcaag 2760 cgcatgccgt ttgggttagt atcggctcca gaggtgttcc agaaaactat ggacaggatt 2820 ctggcaggct gcgaaggtac ctggtggtat attgacgacg tgtacatcga aggcaaggac 2880 aaggaagagc atgacgagcg ggttgctaag gtagggtaat tctggacaaa gggctaatgt 2940 ttttttttta tctttttaat gaaccatttg aattggggtt gataattaaa taaactaggt 3000 tttaaggttt ctgtcgttga tgagaacttc tattgcgtat ttttgaatat attcgaaatt 3060 tattcaggtg ctagcgagac tgaaggcatg gaacgttaaa ctcaactggg aaaagtgcat 3120 attcggggtt gctgaacttg agttccttgg acataagata actaaggccg gaatagtccc 3180 atcggatgct aaagtggatg cgctagttgc ttttcggaga ccggagaact caaacgaggt 3240 gcgcagtttc ctcggcttag ccaattatct gaatcgattc ataccaaact tggcaacgat 3300 agatgccccg ttgagaaact tgactaggaa gggtactgct ttcgattggc tggtcgagca 3360 tgaaactgca ttcagcaaaa tcaaagctat tttgtcagat ccttctacac taggattctt 3420 ccgaaaagga gacagaacac tcgtgatggc cgatgccagc ccaattggat tgggaggctt 3480 actcatccaa gttgacggga gtggtaaaca tcgggtaatc agctatgctt ccaaatcctt 3540 gaccgacacg gagaagcgtt attgccatac cgaaaaggaa gcactagcat tggtttggtg 3600 tgtcgagaaa ttttacatat atttgtatgg aattgagttc gagatcttga ccgactgtaa 3660 agctttgata ttccttttta caccgcggtc acgaccatgc gctcggatcg aaaggtgggt 3720 actccggctg caaggttttg attattcgat cacgcacatt ccgggcgatc aaaatcaagc 3780 cgatgtgtta tcccgattgt ctacactcaa accagtgccc tttgatcagc gagaagaaat 3840 aatgatcaga gagataatgg actttgctgc ccatgcggtg gctgttccgt gggagaaaat 3900 ggtagaagct acaaaatcgg actctgaact acagatggtt ctaacgctaa tggatcaaga 3960 cagaaaacat gagttgccta ctgagtttcg agtgtttggt aatgagttgt gcagagtagg 4020 agacattctg ttgcggggag ataggatagt tgttcctaag gaactgcgtc caaggattct 4080 caaaattgca cacgagggac acttgggaac cactatgatg aagtcgactc tgcggttatc 4140 cgtgtggtgg cccaaattag acaaagaggt ggagaatttc gtaaagaact gccgaggttg 4200 catactggta tccgcaccgg atgctccaga acccatgacc agaaaacaga tgcctgcagg 4260 accttggcag gaaattgccg ttgactttct agggccactg ccggagggtc aatggctttt 4320 tgtggttgtt gactattaca gccggtttgt agaagtcgtc gaaatgaaaa gtatcaccgc 4380 cgctgaaaca atacgagaac tgtcgacaat atttggacgg tttggaatac ccaccacaat 4440 gagagctgac aacggaccac aattgagcgc agagtgcaga gagctcaagc agttctgcga 4500 ggaatacgac attgagttgg tgaataccat cccgtactgg cctcaagcaa acggcgaggt 4560 agagcgccag aataggacca tcctgaaacg cctacggatt tcacaggagc ttggtaagga 4620 ttggcgtttg gagttgagga aatttcttct ggcgtaccat tctacagtac acccaacaac 4680 tgggaaatcg cctgcggaat taatgttcgg gcgaagaata aggaacaagt tgccatcaat 4740 tccaggatac aaagaggatg aagaagtacg agatcgggac agaattgtca aggagaaagg 4800 gaaagagtat tcggacaaca aacggaaagc gcgtagtaat caaatcgagg agggggatta 4860 tgtgtatgct aaacggcaga gaaaagatca caaactgaac acagattttt cttcagagag 4920 gttcaaggta gttaatcgaa aaggggcaga agttacgatt aagtctacgg tatcaggtgt 4980 cgaatataag cgatatgtat cacatctaaa gaaagcagaa gataaagcgc aggagcagga 5040 gagaagatca gatctagata atgaggcaga gttattgagg agagacgaag tttgcaggga 5100 cgaaggatat tcaggagagt cagcaggtga tgagcaaaag gcttcaaacg gaggagaatc 5160 aactgtggat acaggtgctc aggtatcaga taagcgaaca cgtttggctc cgaaaaagtt 5220 ccaagattat gtagcatatt aaatctgtaa aaaaaaaata gaagtgaagt ggga 5274 // ID CR1-62_HM repbase; DNA; INV; 3867 BP. XX AC . XX DT 23-DEC-2008 (Rel. 13.12, Created) DT 23-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE CR1-type family - consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-62_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3867 RA Bao W. and Jurka J.; RT "CR1 families from Hydra magnipapillata."; RL Repbase Reports 8(12), 1889-1889 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(80..766,770..3796) FT /product="CR1-62_HM_1p" FT /translation="MPPKKEEEKKLDDLLERIKQLELKLNFKEERIKNLEE FT RIIKLEEENTSNKNNPCIIDDKAWNVVASKNSVKTPLQIDIINTVAIETAE FT RARRENNVIVFGLNHSAKSDLNERKEDDKIEVLKVIKDINLESVQIKRITR FT LRSRDMSQKSPVILELDNVHARNQVLKAAFRNRDKIPNIYFNTDMTEAERI FT LTKQLRSEVKILNGNSDMNNSSFYYGIRSNKVVKLTKKTQFEYKVANKISN FT ILMFKRTVTGIQDFKIKINDNAQKPVISTGYNSNSLKCLYTNATSFEKNKL FT NEVIAEIEISSPDIIFITETWWNDTSATIIPGYNLYRRDRMNSRGGGVCIY FT VNDLIDSFEVNNEPYNNKNIEQIWCTIKMSNECILCGCIYRPGSDDLVNFQ FT NVINSIKESYNYFSKTKIYTEVLICGDFNFPSLNSIWNSXYESVCIGEKER FT QFLECIEECYLTQHVKKPTFQLNDTKLTNILDLVFTASPFRISSLVHMDPL FT GKTKRGHHVLNFNFNYQLHNRKNESITTKRVYMKGNYHEFSKYLNKFNWSH FT EFTGLNVDQCYESFIRTIEYGCKDFIPSFNQKTTESKRSLWMNKELKHEIR FT LKRNLWYRYKASHFKNLDLKEEYKLKNKLVKKLVKITIQKYENKIATESKK FT NPKQVYAYINSKLKAKDHIRAISIENNEISTDSQVIANQLNRFFNSVFINE FT SLTNMPTCENITEKICPTPLFNEKDIKKRLLNLNVHKSPGIDRIHPMFLSK FT CADSLARPLCMIFNQSFLTGTLPTSWLKANITPIFKNGDRLNAGNYRPISL FT TSVVCKVMESIVKDTIMNHLNKNKLLIPEQHGFIKSKNCVTNLLETLDIIT FT EAINCGKCVDVAFLDFSKAFDSVPHSRLLLKLKSFGIVGKLLQWCKAFLAN FT RKQRVVLGLFESEWETVRSGVPQGSVLGPLLFIIYINDLTKKLNNKAKLYA FT DDLKIIAVLEKNKINNSLQFDLDILFNWTKRWLISFNKDKCKIMHFGRDNP FT KIKYNLKYNATETEIIQHGLIVTSLERDLGVYISDNLKWEDHIEKMVSKAN FT QKLGMLKHAILYPDETSTKLLYMSLVRPHLEYAFQVWNPYFIKDINLIEKI FT QQKATKFGYKKKKDSYEQRLEKMDLQNLSNRREKGDLIQMYKFMNKIDTIN FT WHREPEVLGRLTRGHNQRLRRQYVQNFLPRHHFFTNRIVENWNRLDSEVVN FT ANSVIDFKNKLKKFSEKQLLKSNIYSC*" XX SQ Sequence 3867 BP; 1594 A; 512 C; 616 G; 1144 T; 1 other; tttttttttt gccgttttaa agatggcaga cgtattattt taaaaaagta aacaaaaagt 60 taataaaaca acaatcacaa tgcctcctaa aaaagaagaa gaaaaaaaac tagacgattt 120 attagagaga attaaacaat tagaattaaa gctaaacttt aaagaagaaa gaattaaaaa 180 tctagaagaa agaataataa aattggaaga ggaaaacact tctaataaaa acaatccatg 240 tataatagat gataaggcgt ggaatgtcgt cgccagcaaa aatagtgtaa agactccatt 300 acaaatagat attataaata cagtagcaat tgaaacagct gagagagcta gaagagaaaa 360 caatgttatt gtttttggtt taaatcattc cgcaaaaagt gatctaaatg aacgaaaaga 420 agatgataaa atagaggtat taaaagtcat aaaagatata aatttagaaa gtgtgcaaat 480 caaaagaatt acaagattac gatcaaggga tatgtcacaa aagtccccag ttattttaga 540 acttgataac gtgcatgcaa gaaaccaagt gctaaaagca gcttttcgta acagggataa 600 aataccaaat atttacttta atactgacat gactgaagca gaaagaattt taacgaagca 660 attaagatct gaagtgaaaa ttttaaacgg aaattcagac atgaacaata gctcatttta 720 ttatggtata agaagtaata aagtagtaaa actgactaaa aaaacatagc aatttgagta 780 taaagtggca aacaaaataa gtaacattct aatgtttaaa agaactgtaa ccggtatcca 840 agattttaaa attaaaataa atgataatgc acaaaagcca gttataagta ctggatacaa 900 ctcaaatagt ttgaaatgtt tatataccaa tgctacatca tttgaaaaaa ataaattaaa 960 tgaagttata gctgaaattg aaataagctc tccagatata attttcataa cagaaacatg 1020 gtggaatgat acttcagcaa caattatacc tggttataac ctttatcgca gagaccgaat 1080 gaatagcaga ggtggtggag tctgtattta tgtaaatgac ctaattgatt catttgaagt 1140 caataatgaa ccctataata ataaaaatat tgagcagata tggtgtacca ttaaaatgag 1200 caatgaatgc atattatgtg gctgtattta tagaccagga agtgatgatt tagtaaactt 1260 tcaaaatgta atcaattcta ttaaagaaag ttacaattac ttctcaaaaa ctaaaatata 1320 tactgaggtt ttgatatgtg gtgacttcaa ttttccatcc ttgaatagta tatggaattc 1380 twgttatgaa agtgtctgca ttggagaaaa agagagacag tttctagaat gcattgaaga 1440 atgttatctc acacaacatg taaaaaaacc aacatttcaa ttaaatgaca caaaattaac 1500 aaatatatta gatttagtat ttacagcttc tccttttcgc atatccagct tggtacatat 1560 ggatcctttg gggaaaacaa aaagaggtca tcatgtttta aattttaatt ttaattatca 1620 attacataat cgcaaaaatg aaagtattac tacaaagcga gtatacatga aaggaaatta 1680 tcatgaattt tcaaaatatt taaataaatt taactggagt catgagttta ctggacttaa 1740 tgttgatcag tgttatgaaa gttttattag aactattgaa tacggctgta aagatttcat 1800 accgtcattt aatcaaaaaa caactgaaag caaaagatcc ttatggatga ataaagaatt 1860 aaaacacgaa attcgactta agcgtaacct ttggtatcga tacaaagcct ctcatttcaa 1920 aaatttagat cttaaggaag aatataaact aaaaaataag ttagttaaaa aacttgtaaa 1980 gattacaata caaaaatatg aaaataaaat tgcaacagaa tcaaaaaaaa atccaaaaca 2040 agtttatgct tatattaaca gtaaactaaa agctaaagac catataagag caatttcaat 2100 tgaaaacaat gaaatatcaa ctgacagtca agttattgct aatcagttaa atcggttttt 2160 caattctgta ttcataaacg aaagcctaac taatatgcct acatgtgaaa atataacaga 2220 aaaaatttgc ccaaccccat tgtttaatga gaaagatatt aaaaaacgat tattgaatct 2280 aaatgtgcac aagtccccag gtatcgatag aattcatcca atgtttttaa gtaagtgtgc 2340 tgacagttta gctagaccac tctgtatgat atttaaccaa tcatttctaa ctggaacatt 2400 gcctacaagc tggttaaaag caaatatcac tccaattttt aaaaatgggg ataggttgaa 2460 tgcaggtaac taccgtccaa tttcattgac ctctgttgtt tgtaaagtaa tggaaagtat 2520 tgttaaggat acaataatga accatttaaa taaaaataag ttattaatac cagaacaaca 2580 cggattcatt aaatcaaaaa attgtgtaac aaaccttctc gagacattag acataataac 2640 agaagcaata aattgtggaa aatgtgttga tgttgccttt ttagattttt ctaaggcatt 2700 tgattcagtt ccgcattcca ggctgctctt gaaacttaag tcatttggta ttgtcggaaa 2760 attgttgcaa tggtgtaaag cttttttagc aaatcgaaaa cagagagttg ttttgggact 2820 gtttgaatca gaatgggaaa cagttcgtag tggggtacca caaggatcgg tcttaggacc 2880 cttacttttt ataatttata taaatgattt aacaaaaaag ttgaataata aagcaaaact 2940 atatgcagac gatttaaaaa taattgcagt tctagaaaaa aataaaataa ataactctct 3000 acagtttgat cttgacatct tatttaactg gactaaaaga tggcttatca gttttaataa 3060 agataaatgt aaaatcatgc attttggtcg tgataatccg aaaataaagt ataatttaaa 3120 atataatgct acagaaacag agattattca acatggacta atagtaacgt cattggaaag 3180 agaccttggt gtttatattt cagacaattt aaaatgggaa gaccacattg aaaaaatggt 3240 aagtaaggcg aatcaaaaat tgggtatgtt aaagcacgca atcctctatc cagatgaaac 3300 atcaacaaaa ttgttatata tgtcactcgt tcggccacac ttagaatatg cttttcaagt 3360 gtggaaccca tattttatta aagatataaa cttaatagaa aaaattcagc aaaaagctac 3420 aaaatttggc tacaaaaaaa agaaagacag ttacgagcaa agattagaga aaatggactt 3480 acaaaatcta tcaaatagaa gggaaaaagg tgatcttatc caaatgtaca aattcatgaa 3540 caaaattgat actatcaatt ggcatcgtga accagaagta ttgggtaggc taactagagg 3600 acacaatcaa aggttgcgaa ggcagtatgt tcaaaacttt ttgccgaggc atcatttctt 3660 cacgaaccgt atagttgaaa attggaatcg tttggacagt gaagttgtta acgctaattc 3720 agtaattgat tttaaaaata aacttaaaaa gtttagcgaa aaacagttgt taaaatctaa 3780 catctacagt tgttagattc tttcttgaaa ctctcaagaa cacagcttca attattatta 3840 ttattattat tattattatt attatta 3867 // ID Gypsy-1_Cfl-I repbase; DNA; INV; 4203 BP. XX AC AEAB01004336; XX DT 26-JAN-2011 (Rel. 16.02, Created) DT 26-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Florida carpenter ant genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-1_Cfl_; KW Gypsy-1_Cfl-LTR; Gypsy-1_Cfl-I. XX OS Camponotus floridanus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Formicinae; Camponotus. XX RN [1] RP 1-4203 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Florida carpenter ant genome."; RL Direct Submission to RU (26-JAN-2011). XX DR Genome; AEAB01004336; Positions 4533 331. XX CC Positions [3077-3607] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 834..2543 FT /product="Gypsy-1_Cfl-I_3p" FT /translation="MLVSSEVPEQRNEVPTPVSVEGGKRGESFVNAALDDD FT SRSRRIFVTDRKTKISFLVDTGADVCVYPRNRLRNTKKCEYELFAANGTPI FT ATYGTTAVDSDFSLRRSFKWRFVIADVSTSIIGVDFLSHYGLLVDPRNKRL FT IDSITSLSTIGHAASAQVVTVKTVFGKSAYHQLLSEYPELTRPPVFRGETI FT KHNVKHYIKTTPGPPVYNKPRRLAPDRFKEVKAEFDLMIEQGIIRPSKSPW FT ASPLHVVPKKDGSLRPCGDYRALNARTIPDRYSPPHIEDFAQRLHGKRIFS FT KVDLVRAYHQIPVAPEDVEKTAITTPFGLFEALNTMFGLRNAAQTCQRFVD FT EITRGLDFVYAFIDDFLIASEDETQHCEHLKVLFKRLSEYGVIINPAKCVF FT GVNEITFLGYTVNEHGIKPLAERVEAIEKFSEPATLKQLRRYLGMINFYRR FT FIPGAARILQPLNELTKGLKKGNAPITWTDRTQNAFVESKRALANATMLAH FT PVPGASISIAVDASDYAMGAALQQLVAHEWQPLGFFTKSLSPSQQKYSAYD FT RELLAMYSAVKLACSTKRALSTL" FT CDS 3011..4051 FT /product="Gypsy-1_Cfl-I_2p" FT /translation="MNKDCRNWTRQCIPCQRCKVTRHVSAPVNDFGSLAGR FT FEHVHIDLISMPYSQGYKYCLTCIDRFSRWPEVIPIIDMEAATVASAFIST FT WIARFGVPLRITSDQGRQFESDLFRELCRMLGTKHIRTTAYHPEANGMVER FT LHRQLKAAIKCHETENWTEILPIVLLGIRTAIKDDLKATAAEMIYGTGIHL FT PAEFFVPSDQEANSDFVARLRQRVNDIKPSPVTQHGTKRTFVFKELSSSPY FT VFLRHDAARGPLQPPYDGPYEVIERGDQNFVVKIKGKNVRVTIDRLKPAFV FT VSEHIDQRGHEDNIREEIIAAEPQPQPTQNENGRRFTSRAGRSIRFPERFQ FT AGFR" XX SQ Sequence 4203 BP; 1208 A; 1051 C; 997 G; 947 T; 0 other; attggacccc gacgtgatcg cttgggtaaa gaggtgaacg tgggaattag tgaaattttc 60 gcgaagacgg aagtgcgtgt gcacgtgaaa atgtcgctcc cggaaaggaa cgtaaccctt 120 gacgccagtg ccgaaaacga aacgatcaac attataagcc aaaataattt gcgcatcccg 180 caattctggc cgcaaaaagt cgctttgtgg tttcgactgt tagaagcgca gttcgcatcc 240 gcgcgtatca cgaaggacga cacgaagttc aatgtggtca tcgcgaacct cggcgagaaa 300 tatatcgagc aagtagaaga tgtagtgatg gacccgccgg cagtcgaaaa atacgaacac 360 cttaagaagg aagtcatcaa gcggttgacc gaatcagaca gttcgcgcgt gcggaagtta 420 ttggagagcg aagaaatcgg cgatcggaca ccttcgcaat ttttccgcga cctcaggaaa 480 ctcacgaccc cgtcaatacc tgatgatttc gtcataacac tgtggaagac tcgtttgccg 540 gcaagcaccc agcgggtatt ggcagcttcc gcggatactg acgttaccgc catcgtcgaa 600 atggctgacc ggatccacga aatcaaaccg gaacaagaac ggattacagc ggtagcgaag 660 gacgcgattt cgaccgagat gcaagtcgcg tttgcgcagc taacattgaa gatcgacgcg 720 atcgagaaag cgcagcggcg ttttctttcg cggacgcggg gacgcgaccg atcccgcggt 780 aaatcccgcg accggagggc cctgacgcaa gggggaagtg aattagaggg aatatgctgg 840 tatcatcaga agttccagaa caacgcaacg aagtgccgac ccccgtgtca gtggaaggcg 900 gaaaacgagg agagtcgttc gtgaacgcgg cactcgacga cgattctaga tcccgccgca 960 ttttcgtgac cgatagaaag actaagattt ctttcctcgt tgacacgggc gctgatgtgt 1020 gcgtgtatcc tcgcaatagg ttgcgaaata cgaagaaatg cgaatacgaa cttttcgcag 1080 ctaacggtac gccaatagcg acatacggaa caaccgcggt ggattcggat ttctcccttc 1140 gccgaagttt caaatggcga ttcgtgatcg cggacgtgag tacatctatt atcggcgtag 1200 acttcctgag ccattatgga ttgctcgttg acccgcgaaa caaacgttta atcgattcga 1260 tcacgagtct gtcgacgatc gggcacgcgg ctagcgctca agttgtcacg gtgaagacgg 1320 ttttcgggaa gtcggcgtat catcagcttt tgagtgaata ccctgaactc actcgaccgc 1380 ctgtgttccg aggggagacg ataaaacata acgtgaaaca ttacattaaa acaacgccag 1440 gaccgccagt ttataataaa cctcgccggc tcgcccccga tcgattcaag gaagtcaagg 1500 ccgaattcga cctcatgatc gaacaaggaa tcattcgacc ttcgaagagt ccgtgggcat 1560 ccccattgca cgtcgtaccg aagaaagacg gcagcttacg accttgtgga gactatcgtg 1620 cactcaatgc ccgcaccatt cccgaccggt attccccgcc gcacattgaa gatttcgcgc 1680 agcgcctgca tggaaaaagg attttctcca aggttgattt ggtgagagca tatcatcaga 1740 tcccggtagc gcccgaggat gttgaaaaaa cggcaatcac gaccccgttc ggtcttttcg 1800 aggcactaaa taccatgttc ggattgcgta acgctgctca aacgtgtcaa agattcgtcg 1860 acgagatcac acgcggttta gattttgtat acgcttttat cgacgatttc cttatcgcgt 1920 cggaagacga gacgcaacat tgtgaacact taaaagtcct atttaagcgt cttagcgaat 1980 acggcgtcat tataaatccc gcgaaatgcg ttttcggcgt taacgaaata actttcctcg 2040 ggtatactgt caacgaacac ggaattaaac cgctggccga gcgcgtcgaa gcaatcgaaa 2100 agttttccga acccgcgaca ttaaaacaac tccgaaggta tctaggaatg attaatttct 2160 accgacgttt cattcctgga gccgctcgaa tcttacaacc gctaaacgaa ctcacgaaag 2220 gtctgaaaaa gggcaacgcg cctataacat ggaccgatcg cacgcaaaac gctttcgtcg 2280 aatcgaaacg cgctctcgca aacgccacaa tgttggctca ccctgtaccg ggcgcgtcga 2340 ttagcatcgc tgtggatgca tctgactacg caatgggagc agctttgcaa caactcgtgg 2400 cccatgaatg gcaaccttta ggattcttta ccaaatcttt atctccttca cagcaaaaat 2460 atagcgcata cgatcgcgaa ctactcgcta tgtatagcgc agtcaaacta gcgtgctcaa 2520 ctaagcgtgc tttaagcacg ctttagaagg taggaatttt gctattttca cggatcataa 2580 accgcttatt ttcgcgttta atcagaattt agacaaatgc tcgccacgcc aatttcgata 2640 tttggattat atcggacaat tcacaacaga cgtccggtat attaaaggtt tagacaataa 2700 cgtagccgat gcattgtctc gcgtcgaaac aatctcgaaa acggttgatc acaaaactct 2760 ggaagccgcg caaaaagacg acctcgagtt aggtagaatt ttaaaatccg aaaatagtgc 2820 gttaaaactt aagaaaatac attttcccga ctgcgacagt ggcatttatt gcgacatagc 2880 gagcgacacg gcaagaccat atgtccctca acccctgcgg cgcgtcatat ttaactcgtt 2940 gcatggactt tcacatccag gtattcgagc tacgcaaaaa cttataacca aacgctttat 3000 ttggccatta atgaacaaag actgcagaaa ctggacacgg caatgtatcc cttgccagcg 3060 atgcaaagta accagacatg tgtctgcacc cgtcaatgat tttggaagtt tggccggacg 3120 attcgaacac gtgcacattg accttatttc aatgccttat tcgcaaggat ataaatattg 3180 cctcacgtgc atcgaccgtt tctcccgctg gccggaagtc atcccgatca tcgatatgga 3240 agcagcaact gtagcatcag cttttatctc cacatggatt gctcgttttg gcgtaccttt 3300 gagaatcaca tccgatcaag gacgccaatt cgaatctgat ctattcagag aattatgtag 3360 aatgctgggc actaaacata ttagaacgac agcctaccac cctgaagcaa acggcatggt 3420 agagaggctg catcgacaat tgaaagcagc tataaaatgt cacgagacag agaactggac 3480 ggagatttta cccatcgtcc ttttgggtat acgaacggcg atcaaagacg acctgaaagc 3540 gacggcagcc gaaatgatct acggcacagg aatccatcta ccggcggaat tcttcgtacc 3600 tagcgaccaa gaggcaaact cagatttcgt cgcgcgtctc agacagcgcg taaacgatat 3660 taagccaagc ccagttacac aacatggcac caagcggacc tttgttttca aagaactttc 3720 atcatcaccc tatgtattcc ttcgtcacga tgctgcaaga ggaccattgc aaccacctta 3780 cgacggacca tatgaggtaa tcgaacgagg cgaccagaat ttcgtagtca agatcaaagg 3840 aaaaaacgta agagtcacga tcgaccgact aaagcctgct ttcgtcgtat ctgaacatat 3900 agatcagaga ggccacgaag acaacatccg tgaagagatc attgcagctg aaccccaacc 3960 tcaaccgaca cagaacgaaa acggacgtcg attcacgtca cgagccggaa gatcgatccg 4020 ttttcccgag cgttttcagg ccggattcag ataagtccct ggatatatgc ccgattttgc 4080 ttaacatata agcaatggga ataatgccac cttcaccacg tacaaaatcg acgcgaaaat 4140 ttttttccga acaacagtac atcgtttgcg ttcacgagcg ttatcactgg taagggggta 4200 ctg 4203 // ID BEL-68_CQ-LTR repbase; DNA; INV; 810 BP. XX AC AAWU01019429; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-68_CQ_; KW BEL-68_CQ-I; BEL-68_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-810 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 290-290 (2011). XX DR GenBank; AAWU01019429; Positions 46687 45878. XX SQ Sequence 810 BP; 271 A; 186 C; 148 G; 205 T; 0 other; tgctaccgac gacaacaaca cctccgaggc aggcacctct cggatgaacg actcgaaagc 60 agaacccaga aacgcagcca cacccgaagc tagagaaggg atcgacgaac aagcagtgta 120 acgaaataga accaatacca ccaccaggaa taaaaggcgc tagggacaat agatgcactt 180 ttactctatt gaatctatcg tgtaaactcg cagtaggccg acgcttcact gaaaattaat 240 ttctagcttt cgtacgcatc ttatttacct agtttaagga taattgtgct tgagaaaact 300 gtttttagag cacctgcacg caggtactaa tgcgacatga atttattgcg aaattactaa 360 ttatcaacat atctctaacc aggtcgtaag cacattctat tgcccagaaa aagcgcaaca 420 aactctagcg aattctatta taattaacta ttaagtaaac tgtcgatcag gtaaagcgta 480 acttgcgatg attaactgcg ggaatctaat atctatgaat tctatagcgc accggctgac 540 acgaaccgca tgttgatcat tccggcttac gagcggacat agggcactaa acgtaagcaa 600 acaaacttat ctagataagt ttcactgaat tcctgtactg aattacctct attttccttc 660 ccaaacatgc attttaggga aaaattaata cgtcgaaata aagttgtact ccttactcgc 720 aaaattgaag tcaattcacg tagccacttc acgtacaacg gagcgtattt aatcgtctgg 780 aaatcctaag cccacagtta ccttgtaaca 810 // ID Copia-13_CQ-LTR repbase; DNA; INV; 216 BP. XX AC AAWU01014418; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-13_CQ_; KW Copia-13_CQ-I; Copia-13_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-216 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 342-342 (2011). XX DR GenBank; AAWU01014418; Positions 39104 39319. XX SQ Sequence 216 BP; 46 A; 59 C; 36 G; 75 T; 0 other; tgtaaacaac gactgtccta tccctcgacc gttgtcgcta ctttgacagt tgtgcttcct 60 ttcttgcttg cgagcaacaa cgccttctgc gttttgccac ttccgtttct cactctcaaa 120 gttgatctcg aactgtaaac ctcgcgctcg aatcgaaata caattaattc gtttgtaaaa 180 ctcatttctc gcgttttgtt tccgttaaac tgtgca 216 // ID Transib-7_AAe repbase; DNA; INV; 3778 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A non-autonomous Transib DNA transposon family from Aedes DE aegypti. XX KW Transib; DNA transposon; Transposable Element; Transib-7_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-3778 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the yellow fever mosquito."; RL Repbase Reports 11(4), 1308-1308 (2011). XX DR [2] (Consensus) XX CC There are 3 copies >94% identical to consensus. TIRs are ~720 CC bp CC long. XX FH Key Location/Qualifiers FT CDS join(963..1646,1655..2545,2490..3056) FT /product="Transib-7_AAe_1p" FT /translation="ISERTISCRELVDLFLSGKSVGESVEVLKQKIQIDGS FT TLDIPSALRTLRLFKLKFVSNFNKCYRKKPIFLEKCSKWLDSVVGFKIASS FT KSSRHVGRPKKSFEDSSRKTKIRKIKSIRRRIPSEKLILAAASSQRAAGNR FT DLAWVIEHLVGHPEKASSVRKFLEMENLETNPRPLSINDALTFLCHNDLSK FT LQYQNIRMLSLERGANIYPTYNELREGKSLCYPEGTSTKHLLTHNIYFICT FT FIGMTICPSTAEVPLQLLTDHTIQRLFVSREDNVAPYLERTGKFSVIFKWG FT CDGSSNHSKYKMPFMEEIEGEQRWINDSHIFLMCIVPLRIVFHDDNSNKDY FT VVWNNDLPSSINLCRPLKMVFQKENAALTKTEVANVNYQISQLTPTVLLLQ FT NRQVAVESVLCMTMIDGKVLNDLTNTSSQACHLCGASGKALSNPPVVPITD FT EPTDFEIMPILHAYMRSMELLLNISYRIPLKKWRISKGNEVLKERQQTIRG FT KSRYGMFVELTLHILLLFRKIQANGSTDKHYIYYYYLGKFKLMGLRISEPL FT PSGGNSNDGNTARKFFENPSKTAEITGLDEDIIRMFSNILRALFSKISVNA FT TAYHAYAEQTRLRLTRLYPFYPLSPSVHRMLVHGASIIKHCILPIGMMSEE FT VQERGNKSIRYFREHHSRKCGRQANIEDVFKRLMLTSDPVISLNGKLXKSV FT PNEYPEEIRNLLIL" XX SQ Sequence 3778 BP; 1255 A; 676 C; 715 G; 1125 T; 7 other; cactatgggc cacggacgca atctggcggg acaaaattaa taactcgtca acaaagcatt 60 tccggcattt ggtgtcttcg gcaaagtttt ttgtaacaac gaggaccatc tactgacctt 120 aacggatagt tcgaaaatcg tccgcagagg tggcgccaca atctaacttt ttactttgac 180 gtcctagagt cttggcgtgt tcgacaaagt tgttcatttt gataaaataa acaactctct 240 cgaagacgtc aaaattccac agcctactgt ttttgagtta ttggcaaaat aagaccaaat 300 taacaaaaaa acgttttttt caccaaatct gacatttttc ttaagaatta tgtcaaatta 360 tattatttkc taatgtaaac ataaaaaaca caaactatag tatataattt tttacagtat 420 gacgcataam aattaaaaaa tagcgaaaaa acctgaaaaa twgtgcagtt tttaaacatt 480 aatatmtcaa aaagcgcaaa aaatcgcaag ctcaaatttt caggcaacat agaacaatgc 540 tagatgaaac ggatgtcaaa atttgagaaa ggtatatttt ggtgtttttg agatattttc 600 atttttttac aatgattatt aacgaattat ggaacagact tttatgaaaa attattcaaa 660 aaggccatcg tagtagccat gaaattcagt ttaagtgcgg ttccttgagt caatatcaag 720 taatggagca tcaaaacatt gaggtgtaac tgtacacagt tctgttgagt gtaaattgaa 780 aattgaacat ttatgtttta ctctaatgat acaaaatata tagtgtgcca tgaacatttt 840 gatcaagtgt cagttaatta ttgtcataat ggctcagggt gagtaattta atttatgaat 900 ttgttatttc tactacaaaa gatcaaatta ttaagcatat ccatgtcgat aactcaactt 960 aaatttcaga gcgcaccatc tcttgccggg agcttgttga tttgtttttg agcggaaaga 1020 gcgttggtga atccgtggaa gttttgaaac agaagattca aatcgatggc agcacattgg 1080 acattccttc ggctttgaga acattaaggt tgttcaagct gaaatttgtg tctaacttca 1140 acaaatgcta tagaaaaaag cctatatttc tcgaaaaatg ttcaaaatgg ctcgattccg 1200 tcgtcggctt caaaattgca tcaagtaaat catcgcgaca cgtaggacga ccaaagaaaa 1260 gcttcgaaga tagtagcaga aagacgaaga ttagaaaaat taaaagtatt cggcggcgta 1320 taccatcgga gaagctaatt ctagcagcag cgagctctca acgagcagcc ggaaacagag 1380 atttggcgtg ggtaattgag catctcgttg gtcatccaga aaaagcatca tcagtgcgca 1440 agtttctcga aatggaaaat ttagagacta acccacgccc tttgtcgatc aatgatgcac 1500 taacatttct ctgccataac gatctatcaa aacttcaata tcaaaatatc aggatgctat 1560 cactagaacg aggtgctaat atctacccga cttacaacga gctccgtgaa ggaaaatcat 1620 tgtgctatcc ggaaggtacg tcaacataaa atgaaaacat ttgttgactc ataatattta 1680 ttttatatgt acatttatag gtatgactat ttgtccatct actgctgaag ttcccttaca 1740 attactaaca gaccacacca tacaacggtt atttgtttca cgggaagata atgtagcacc 1800 atacctcgag cggactggga agttttccgt aattttcaag tggggttgtg atgggtctag 1860 caatcactcg aaatataaaa tgccctttat ggaagaaata gaaggggaac agcggtggat 1920 caatgatagc catatatttc tcatgtgcat tgtaccactt agaattgtgt tccacgacga 1980 taattctaat aaggattatg ttgtatggaa caatgactta ccatcctcca tcaacttgtg 2040 tcggcctttg aaaatggtgt tccaaaagga gaatgcagct ttgacaaaaa ctgaggtagc 2100 aaacgttaat tatcaaattt cacagctaac tccgacagtt cttctgttgc aaaatcgtca 2160 agtagctgtg gaatctgtgt tatgcatgac catgattgat ggcaaagtat tgaacgatct 2220 gacaaatacc agctctcagg catgtcatct gtgcggcgct tcagggaaag cactaagcaa 2280 ccctcccgta gtgccaatta cagacgaacc cactgatttc gaaattatgc ccatactcca 2340 cgcgtatatg agaagcatgg aattgctcct caacatctca tatcgaattc cattgaaaaa 2400 atggcgaata agcaagggaa acgaagtgct aaaggaaagg cagcaaacga tccgaggtaa 2460 atcaagatat ggtatgtttg ttgaattgac actacatata ttattattat ttaggaaaat 2520 tcaagctaat gggtctacgg ataagtgagc ctttgccatc gggtggaaat tcaaacgatg 2580 ggaacaccgc acgaaaattt tttgaaaatc cttcaaaaac ggctgagatt acagggctgg 2640 atgaggacat catacgaatg ttttcgaaca ttctaagggc tctgttttcg aaaatatcag 2700 ttaacgctac tgcatatcac gcgtatgcag agcaaacgag attgcggttg acacgactgt 2760 atccgtttta tccgctatca ccgtctgttc acaggatgtt agtacacgga gcatctataa 2820 taaagcattg cattttgcca atagggatga tgtcagaaga agttcaggag cgtggcaata 2880 agtccatccg gtattttagg gagcatcact cgcgaaaatg tggtcgacag gcaaacatcg 2940 aggatgtttt caaacgatta atgcttacat cggatccagt gatttcctta aatggaaagc 3000 ttkcaaaatc tgtaccaaac gaatatccag aggaaatccg aaaccttctt atactgtaga 3060 tataaataat aatttcatga actaataaaa atttagtttt ccactcaatt taaattatac 3120 ttaaataatt gtcacaaaaa tcaaattcgc gcaatgaaag aaaaccctcc gaaaaatgga 3180 attatctcaa aaacaccaaa atataccttt ctcaaatttt gacatccgtt tcatctagca 3240 ttgttctatg ttgcctgaaa atttgagctt gcgatttttt gcgctttttg agatattaat 3300 gtttaaaaac tgcacwattt ttcaggtttt ttcgctattt tttaattttt atgcgtcata 3360 ctgtaaaaaa wtatatacta tagtttgtgt tttttatgtt tacattagca aataatataa 3420 tttgacataa ttcttaagaa aaatgtcaga tttggtgaaa aaaaacgttt ttttgttaat 3480 ttggtcttat tttgccaata actcaaaaac agtaggctgt ggaattttga cgtcttcgag 3540 agagttgttt attttatcaa aatgaacaac tttgtcgaac acgccaagac tctaggacgt 3600 caaagtaaaa agttagattg tggcgccacc tctgcggacg attttcgaac tatccgttaa 3660 ggtcagtaga tggtcctcgt tgttacaaaa aactttgccg aagacaccaa atgccggaaa 3720 tgctttgttg acgagttatt aattttgtcc cgccagattg cgtccgtggc ccatagtg 3778 // ID DNA-4-1_HM repbase; DNA; INV; 5644 BP. XX AC . XX DT 13-JAN-2009 (Rel. 14.02, Created) DT 13-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE nonautonomous DNA transposon from Hydra magnipapillata - DE consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA-4-1_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-5644 RA Bao W. and Jurka J.; RT "DNA transposons from Hydra magnipapillata."; RL Repbase Reports 9(2), 376-376 (2009). XX DR [1] (Consensus) XX CC TSD is 4-bp long. This element contains some insertion sequences CC from hATw-1_HM (masked out). CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX SQ Sequence 5644 BP; 2151 A; 693 C; 721 G; 2076 T; 3 other; tgtagtagaa tattagatcc gcctgaaata ttagttccca ctgcgtaaaa ctcaaaacga 60 gattgcgctt tcgggttata actgttagaa gtgtacctaa aacgtccgta ctatctttgt 120 aaacataaac aaggagaaaa aaacgcttag gacaagaatt ctagaaatta aaaaagttta 180 cataaaagta atgtaagaaa gacgatgtaa gatagagtgg tagtaataga aaaaaagcaa 240 atataaacta attagttatt tatttactaa tgcttttgtt atataactgt gtatgtaaaa 300 gtaaaattca gaccgatata atttaaatct gtctgaatta ttacctaccc cttgtaaatc 360 gctcaatgta agatggtctg atcttcatta aagaaaaatg attgaaattc tacacatata 420 aaataattaa taattagttg atagttgtat aatttctgtt tatacaaact gattaaataa 480 attagttata gttttataat ttataattat gtaaagatca tttacaaata tctgtctgaa 540 ttataccctt tccccgagta aattactaaa cttgcgtctg aattatctcc ttcctcctag 600 taaaccattc aataaaagat ggtgcaataa tggtattgag tgatgtaata aatagtaaat 660 aactataaac ttttagttat atattaaatt gaaaatgctt cacaaatacc tttttaagtt 720 tctaaattat tacctcccct tagtaaaccg tttaatgtaa gatggtgtga taatggtata 780 aagtgataat aataggagtt atgcaactat atatcaataa gttaaaaaat tatgttaaaa 840 tgatttacaa atatactgtt tgaattatca cgtccccctt agtaaatcat gcaatacaag 900 atgtatgata ctggtattga atgatataaa ttattattat attttactgt agttgggaac 960 ttttagagct ttcgatattt aggttgtagg actttttttt tctataggaa catagaataa 1020 aagtaatttc atgcttactt taattagtat agttattagt ccaacatatc tttattattt 1080 tagcctggtt taaagatatt ttactgttat attaatattt aatttttttc caaatttacg 1140 catatcaata ttttgttatg tttatatgtt tttaagttct atgtttgcta ttttttatgt 1200 tctacaaaaa ctagtattga tttttttata tttttcttaa aagggttgtt tgttttgtgc 1260 aaattggaaa atgtttcttt attaaattat tatttttttt tttagttcat tcatgagttt 1320 ctgctctact tcatttttta tgtttaattt gactactgtt gaaatatggt tttaattaag 1380 taaattttca tgaattgaaa tttactttat taatcattag ccaacagtgt aggtaataaa 1440 attgaattca tcaccattaa ggtaagaatt actttgttct ctcgtaatac cattaaacag 1500 ttttttatga ttattatcat gtcattgcta ttttatgtaa aaagattatc tttttcatgg 1560 caacattaga ttatgttata cctaaagtaa aattagaatt ttgttaattt tttgtttgct 1620 tttaggtaac atttcttact aatacttctc tcaaaaaaat ttgaattttt atacttttag 1680 aggggtatta tactaataat aagattcttc ataaataaac ctatcaaaag tttatttagt 1740 ttcagtagtg gtgtagtggt aaagtgcttg tctcataagt ccatggtagt actgcacaca 1800 acttgtttct taacgcagca gccttgtttg tcaaggtttg tgttaytgag agagggttgt 1860 aaccaaaatt aaaagttgcc tccttgaatg tagtgccctt catggccttg gaaaggtgaa 1920 ttgatatgca agaaaaaaat aagataactc attaaatgta taattactca tttttaaaca 1980 ggtttttaaa ttaggtaatt gctggttttc tatttttaaa atctgttgcc aacaagaatt 2040 gggtttaata gaaagaactc ttttttttca acaactttgt ttccaacatc tatagatatg 2100 gcaacaatat tccatacaag tacagatttt catttaacaa aatttgagta aacaaaacaa 2160 caaaagtgca taatgcaatt ttgaacgctt ctgtggaaaa tcctaagttg cctcaaactc 2220 agttgtgaaa attgtgtcat attcaaaaaa gattatggtc agaaatatgt aacatataat 2280 gcattaaata aaaagtgtac aagcagctac agtttaacaa aaaaaaaaaa aaaaaaaaaa 2340 agattttttt gggggtaaga gttggatcga atttccatat tgctgagctt ttatcctata 2400 atgataaaag aaaaacaata aagaaacctt tattgttttt cttttatcta ttatagtctt 2460 taagtctttc aaaaaatata aagctctaga ttgcaatgct aaaaaaattt atttgcaaaa 2520 agttgtgcag gtaagttgta caaatttgta atctaaaatc taaaaaaaaa aaaaaaattg 2580 taaaaaaaaa ctatttataa taaaggctga tgatacttaa ttgaaattaa atttcaaaat 2640 aatacaagga catttaaatc tcgaggaaaa tgatacagtc taattttaaa ccaataaaac 2700 taacgaaatt ccttgaaaat ataatggttt ggcaagccaa acattaatgt gaattaaaat 2760 tacaaagttt tgtaacaaat attgaaattt accaaaagga ttatctagaa aaaccttatt 2820 gcaattatgg aacacccaca atggtaagtc aatactttgc ctgagattyt aaccatcaca 2880 tcaaacccat ggtaaacttt ctctctttag gagatattaa ttattaaatc caaattatgg 2940 aacaattaaa ggtaaaataa atcaagcact gaatttggaa aaaaatataa acctctagca 3000 gagtttggtc tttaccttat gctaaatctt gtgactccaa tcaagaaaat ttttagatat 3060 tatcatacca ataaaagtat tttttttaat taaccttttg tggtagatgc caacagcttc 3120 caatttaaaa aaattaaatt ttttgagctg aaatttaatt tttttaattc aagaaaaagt 3180 gtcaaattta tataaatttg attaagaatt attttattat ttctgatatt ttgactgttt 3240 tacaggtacg aatttcaaaa akgttgctac aacccttaga aaaaaatttc atttcattct 3300 ttttatatta aaagtataaa agctgtgata gaagaatttc aataatcatt ttttgttcaa 3360 gcttctcttg aaatttttaa caaaattatc tgaaacaaaa aagtaatgga aaaaaaaagc 3420 tattttattt tcagaattat gttagcagaa ttatctttag gtatagttac acaaaatagc 3480 aatgatataa caataatcat aaaaaaatgc tttagtaaac aataacaata aaaaaaatat 3540 atatatatat aaaactgttt aatggtgtta caagagaaca aagtaattct taccttaatg 3600 gtgatgaatt caattttatt acctacactg ttggctaatg attaataaag taaatttcaa 3660 ttcatgaaaa tttacttaat taaaaccata tttcaacagt agtcaaatta aacataaaaa 3720 atgaagtaga gcagaaactc atgaatgaac taaaaaaaaa aataataata atttaataaa 3780 gaaacatttt ccaattttca caaaacaaac aaccctttta agaaaaatat aaaaaaatca 3840 atactagttt ttgtagaaca taaaaaatag caaacataga acttaaaaac atataaacat 3900 aacaaaatat tgatatgcgt aaatttggaa aaaagttaaa tattaatata acagtaaaac 3960 atctttaaat cagactaaaa taataaagat atgttggact aataactata ctaattaaag 4020 taagatgaaa ttacttttat tctatgttcc tatagaaaaa aaagttttac aacctaaata 4080 tcgaatttac agatacatag catgacttac cggtatgtca aatcaaatat ataataataa 4140 tttatatcat tcaataccag tatcatacat cttgtattgc atgatttact aagggggacg 4200 tgataattca aacagtatat ttgtaaatca ttttaacata attttttaac ttattgatat 4260 atagttgcat aactcctatt attatcactt tataccatta tcacaccatc ttacattaaa 4320 cggtttacta aggggaggta ataatttaga aacttaaaaa ggtatttgtg aagcattttc 4380 aatttaatat ataactaaaa gtttatagtt atttactatt tattacatca ctcaatacca 4440 ttattgcacc atcttttatt gaatggttta ctaggaggaa ggagataatt cagacgcaag 4500 tttagtaatt tactcgggga aagggtataa ttcagacaga tatttgtaaa tgatctttac 4560 ataattataa aactaattta atcagtttgt ataaacagaa attatacaac tatcaactaa 4620 ttattaatta ttttatatgt gtagaatttc aatcattttt ctttaatgaa gatcagacca 4680 tcttacattg ggcgatttaa aggttcaagt ggaaatttta tataataata atttatatca 4740 ttcaatacca gtatcataca tcttgtattg catgatttac taagggggac gtgataattc 4800 aaacagtata tttgtaaatc attttaacat aattttttaa cttattgata tatagttgca 4860 taactcctat tattatcact ttataccatt atcacaccat cttacattaa acggtttact 4920 aaggggaggt aataatttag aaacttaaaa aggtatttgt gaagcatttt caatttaata 4980 tataactaaa agtttatagt tatttactat ttattacatc actcaatacc attattgcac 5040 catcttttat tgaatggttt actaggagga aggagataat tcagacgcaa gtttagtaat 5100 ttactcgggg aaagggtata attcagacag atatttgtaa atgatcttta cataattata 5160 aataactatt atttaatcag tttgtataaa cagaaattat acaactatca actaattatt 5220 aattatttta tatgtgtaga atttcaatca tttttcttta atgaagatca gaccatctta 5280 cattgggcga tttacaaggg gtaggtaata attcagacag atttaaatta tatcggtctg 5340 aattttactt ttacatacac agttatataa caaaagcatt agtaaataac taattagttt 5400 atatttgctt tttttctatt acaaccactc tatcttacat cgtctttctt acattacttt 5460 tatgtaaact tttttaattt ctagaattct tgtcctaagc gtttttttct ccttgtttat 5520 gtttacaaag atagtacgga cgttttaggt acacttctaa cagttataat ccgaaagcgc 5580 aatctcgttt tgagttttac gcagtgggaa ctaatatttc aggcggatct aatattctac 5640 taca 5644 // ID DNA-ATAT-1_CQ repbase; DNA; INV; 2155 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A DNA transposon family from Culex quinquefasciatus - consensus. XX KW DNA transposon; Transposable Element; nonautonomous; KW DNA-ATAT-1_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-2155 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the southern house mosquito."; RL Repbase Reports 11(1), 49-49 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >94% CC identity. TSDs are 4bp; usually ATAT. XX SQ Sequence 2155 BP; 746 A; 360 C; 395 G; 653 T; 1 other; gggtcattcc acctgaagtg tgcaagaaaa aatgcaaatt tgaaaattac catctccgat 60 tctgctcaaa tttggcagag ctgttgagac tatcaaaaca tgcaaaaatc ccgaatttca 120 tccaaatcgg accaccccct ccatttttgt accctcccaa aaaatcgact ttttggcgat 180 ttttgagcga aacccctatc ttcaaacgac gataactcag gaaccacaaa tcttagaggg 240 tcggtcttag actcaatttt gaaggaaatt ggacgtagaa tccatttccg tgatcaaaat 300 ttagattaaa atatgttttc tacctgtatt gcgcaattga aaactttaaa tggccgtatc 360 tcaaaacagc cctatttatt ttttaaattt gacctcacca tcgtattccc cgtccgattt 420 tacataagaa tcacttatcg acagaaagga atatgtttcg ttccagagat atcgaatttt 480 aaagttttta gtatttgaga ttacctacat tagcttcatt cgccgcatat gctagaagca 540 ctcgggcgcg ctgatcaata actgctacct ggttatttat tgaaataaga tttcatccca 600 aataatcatt agcaaattag gcaaaaatgt aatgtacacc gctgtaggaa actgctgtat 660 acggtgaatt tgtgtgctag cccacaggac agagcaagaa cgacacgcaa tacagggcag 720 aatggaattc ccgcagagct agcaggaaat tgtaattgat cttgtaagta acacttaaga 780 aaaagtgtac gatacattat cgtgttaaaa attatgaatt aacatgaata aaactctcgt 840 gaagcatgat taaggaggag gatttctgga aagtataaaa ctggaaaaag gtaagaaaat 900 cacaaagatt aatctgattt attatctagt tgtgttatta ttcagttgtg ttgatttcga 960 caccgtccga acaataatct ttttttttgt ttgtcaatca tttcgaaatc ttggaagacg 1020 atacagctgc tgtccacatg tatcagaagt atatggtatt gcttttgaaa ttaaatgtac 1080 cagaattgaa aaaaaaatca tcaattattc agatgaaact ggctcacaat ataaaaatcg 1140 gtttaatatg atcaatcttt caaaccgtaa ggcagatttt ggggtttttg ctgaatggca 1200 ctatttcgct accgcacatg gcaaaagctc tagaacccat gagaattatt tcatctacta 1260 cagccagctc tacatttgtt ctcgcttcaa ataaaagaag aaataaaaga aataatttga 1320 taaagtggga agaaatgagg aattaggaaa aatatgaaaa taatagaata atgataaatt 1380 ttcgaaaatt gtattgtaaa aactctgtag catttgcaaa taaatattaa aagttcaaat 1440 tttacttaat atggttcaat gacactgaga tcaggaaaaa cagtgcatta aaaattaccc 1500 aataaatatt ccttaaacaa aaaatagatt gttcaaggta ttgattatct caaaatcgta 1560 taaaattgta aaaataattc atcttacaga acaccaggta gtwattattg atcagcacga 1620 ctgagtgctt ctagcagatg cggcgagtgt agctaattta ggtaatctca aatactcaaa 1680 actttaaaat tcgatatctc tggaacgaaa catattcctt tctgtcgata agtgattctt 1740 atgtaaaatc ggacggggaa tacgatggtg aggtcaaatt taaaaaataa atagggctgt 1800 tttgagatac ggccatttaa agttttcaat tgcgcaatac aggtagaaaa catattttaa 1860 tctaaatttt gatcacggaa atggattcta cgtccaattt ccttcaaaat tgagtctaag 1920 accgaccctc taagatttgt ggttcctgag ttatcgtcgt ttgaagatag gggtttcgct 1980 caaaaatcgc caaaaagtcg attttttggg agggtacaaa aatggagggg gtggtccgat 2040 ttggatgaaa ttcgggattt ttgcatgttt tgatagtctc aacagctctg ccaaatttga 2100 gcagaatcgg agatggtaat tttcaaatgg tgttccgctt caggtggaat gaccc 2155 // ID BEL-113_AA-LTR repbase; DNA; INV; 195 BP. XX AC supercont1.256; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-113_AA_; KW BEL-113_AA-I; BEL-113_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-195 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.256; Positions 863171 862977. XX SQ Sequence 195 BP; 64 A; 38 C; 35 G; 58 T; 0 other; tgttacgcga gaaattgcta ctgcagaatc caactttaac aggaagaccg gaagataaac 60 ctttcacata cacacatgta tctttcatga ataaagttaa agtaattgtt gaactcatta 120 gaatcggacg cgttttattt tcgatttcga gtttccttaa aaatatacag tccgctcggc 180 ttatcggaag gcaca 195 // ID hAT-28_HM repbase; DNA; INV; 6865 BP. XX AC . XX DT 07-DEC-2008 (Rel. 13.12, Created) DT 12-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-28_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-6865 RA Jurka J.; RT "hAT-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 2017-2017 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 2154..4199 FT /product="hAT-28_HM_1p" FT /translation="MSKTNKSVIKKGIRKRSFVWTYFEMDQKNQTKCSICY FT KILPYNSSTSSMQSHLSIDHCINSKISKKSTTLLNDDDISDIESGNENGFE FT ETGHKKLNRLLVNFIVGTDQPISIVRHPDFVNLVSELNKSYKMPCINTVSK FT KLIPSITDNLKSILMLELKGCRFITITCDSWSSLGKNSYLGVTCHFIDKNM FT TLVDRNLDLKFMSEMKTAEYLFNTLNEIFSEWSINNKIFALVTDSGANIFS FT AVNKFDDNVLKIPCAGHRLNLVVNDLLNARDIKVKKFKNGSKRYQVKDYDG FT NGKLINREITESEVKEIEKINEGKLEIAKVISSCRHIVGSFKHSEMLQKKL FT KEVQETLRYETKIKLVQDIAIRWNSQFDMIESILTNKTALDMISIEFQNIK FT DYLPTATEFKVLEDLCDLLHPIKELTKLFSGKKYVTVTFLFPTLYSLLNEI FT LESQPIFTELVKTFQKELLRSLKGRFKYIFANDFFLAATFLDFKFRKFEFI FT KNDIYRYECISKAKMFIKTFYEMHLKETEEIDITIETNCLNNTLNSSNSSA FT DNTIKTFQQLSTSPIANNITLVNNKRRKSINLIELVTDKSCCNLIESNDKL FT EKEINDYSLQYFHSYENSAIEFFQANQKLFPVLTQIARIIFSVPATSVPSE FT CLFSSTGLIDTELRNRLAPSLLKQLVFIKENK*" XX SQ Sequence 6865 BP; 2444 A; 1014 C; 978 G; 2429 T; 0 other; gggttgttca cgaaagccaa agttgaattt tcgattaaaa aaaaagtttt cgaaactcta 60 ttttcgtttc ggtaagttcg aaataatttt aaagtaatta attttgcatc gaaacgattt 120 ttttactttc gaagttgtaa aatttgcttc gaaatatttt tgcgatttcg aaattctgaa 180 aatattgtcg aaataatttt ttactttcga aaatgtaaaa tttgcttcga aatatatttg 240 cgatttcgaa aatcggaaaa tattaccgaa aaaatttttt actttcgaaa atgtaaaatt 300 tgcttcgaaa tatatttgcg atttcgaaaa tcggaaaata ttaccgaaaa aattttttac 360 tttcgaaatt gtaaaatttg cttcgaaata tttttgtgat atcgaaattc tgaaaatatt 420 acacaatatt actattaaat tctaaaattt gcttatattt attttattac tgaaattgta 480 caaaatttta ttaaaatatt tttataaata attaaatttt aaaaattaac ataaaataat 540 acttaaaatt tttggtgtta cggccagaga aaagggttta actacctccc actattttat 600 agaagaatct gcaattaatt taataataac tcagggatcg ttcgcaaatc acctcacata 660 tttaagccta gcgtggtttc ggattttgtg acactttggg gagctttttt atttttagta 720 acaggttttt ttaaattttg gggccctaat gactttttta gtgtaaacgc tgtatacact 780 ggtttgataa tggctctctg aaatgccagt cctggctagc cgtaaaacga gtattgctga 840 gcagcgcaag tgttattgcc gaagcgtaac atcagacatc tgaccaatat atgggtttat 900 atgtattgtt tgcactgata agtccataga cgtagagttt ttatttttcc agattgggct 960 caaacagtgg cgtcaattca gaaaaaactt ataaggaggg gccaatttgt gaaaacaatt 1020 taactctttc cccccccccc ccccttcaca aaccaaaatc ataacattga gcagcacaat 1080 tcaattttgt ttcttgggta ttttgttaaa aactttactt taaaaacttt actttaataa 1140 aactctaaat atatagaaaa taatactatt tttgctactt tttaaaagaa actgctacca 1200 aaaagcaagg atatactcac agccgtggat aggctcccca cgctcctcct taattggggg 1260 ggggggacaa atttaggggg tctttttcaa atttttgagg gccttttacc gaatttgccc 1320 catggacatc tgctgcccgg gacaaacaat accttcccct tcacctctct cgcctgatta 1380 gattaagctt catgactttt tacttttata aaattatttt tagggattat atttaccctc 1440 accaaaagct tgaatagcct ttttgatagc ttgaatagcg ttaaaaaaat aaaaagtttt 1500 ttttgcgcgt cgttttgatt gcataaaaat ttcttttgaa aatacttttt ttatttcgaa 1560 aacacaacgt ttaaaaatca aagtaaattt gaaagaaaag ttgcgtaact ttagacactt 1620 tagaagagtc tgtaaggaat tttattttta atatctgtaa atttttaaaa ataaagtaga 1680 aggtcagctt aaaaaaatta gcaaaaacta aattattttt cgaatttttc tctaaaatcc 1740 tataaagatt atctggttaa acaagaaata tagaattttg acagtccctt ctaagaacgc 1800 atagtttaca gcacgcacat aagtcattat tattcctcaa cggcaattcc cttgaggtct 1860 taaacgcatg acatgaaatc attgttgcaa tgtaaacggc taattacttt ctagtggtta 1920 tttagagatt tgcaacggtt tcaaatcgac ttatgccttt gcgatttttg tgttacataa 1980 ttctaaaaat ggtgtaattt tatattaaag ttataacaat ctcattaata tctcggaaac 2040 gtaagtgaag taacatgtgg tacaaattta ttttatcata agttactaat ttgttaatat 2100 ttgttacata cgaacgagta aaaaaactac tcgcttttta gtttgaataa acaatgtcaa 2160 aaacaaataa atccgttatt aagaaaggaa taagaaaacg aagcttcgtt tggacatatt 2220 tcgaaatgga ccaaaagaat caaacaaaat gcagcatatg ttataagatt ttaccttaca 2280 acagttcgac cagttcaatg caatcccatt tatcaattga tcattgcatt aattctaaaa 2340 tatctaaaaa atcaacaact ttactcaacg atgacgatat atccgatatt gagagcggta 2400 atgagaatgg ttttgaagag actggtcata aaaaattaaa caggttattg gtgaatttta 2460 ttgttggtac cgatcagccg atttcgatag tacgtcatcc agactttgta aacttagtta 2520 gtgagttaaa caaaagctac aaaatgcctt gcataaatac ggtgagcaag aaattaattc 2580 catcaattac ggacaatctt aaatcaattt taatgctaga attgaaggga tgtcgtttca 2640 ttacaattac ttgtgatagt tggtcatcat taggtaaaaa cagctacctc ggtgtgactt 2700 gtcattttat tgacaaaaat atgactctag ttgaccgaaa cctggatctt aagtttatgt 2760 ctgaaatgaa aacagctgaa tatttattta atacgttaaa cgaaatcttc agtgaatggt 2820 caatcaataa taagattttt gcactagtta ctgactcagg tgcaaatatt ttctctgcag 2880 taaataaatt tgatgataat gttcttaaga ttccgtgtgc tggtcatcgt ttgaatctcg 2940 tggttaatga tcttctaaac gcacgagata taaaagtaaa aaaatttaaa aatgggtcaa 3000 agcgttacca agtaaaagat tacgacggta atggaaaact aattaataga gaaattacag 3060 aaagtgaagt taaagaaatt gaaaaaatta acgaaggcaa attagaaatc gccaaggtaa 3120 tttcatcctg caggcacatt gttggttcat ttaagcattc agagatgtta caaaaaaagt 3180 taaaagaagt tcaagaaaca cttcgttatg aaactaaaat aaagttagta caggatatag 3240 ctattcgctg gaatagccaa ttcgatatga ttgaatctat attaacaaat aaaactgcat 3300 tggatatgat aagcatcgaa ttccaaaata taaaagacta tcttcccact gctactgaat 3360 ttaaggtgct ggaggatttg tgtgatttgt tacacccaat taaagagcta acaaaacttt 3420 ttagcggaaa gaaatatgtt acagtaacat tcctgtttcc aacactgtat tctctactaa 3480 atgaaatact tgaatcgcaa cctattttta cagaactagt taagactttt caaaaggaat 3540 tattaaggtc tctaaaagga cgttttaaat acatttttgc taatgatttt tttttagcag 3600 ctacttttct tgatttcaaa tttcgtaaat tcgagtttat caaaaatgat atctatagat 3660 atgaatgcat ttcaaaggca aaaatgttta taaaaacatt ttatgaaatg catctaaaag 3720 aaaccgaaga aatcgatata acaattgaaa caaattgttt aaataacact ttgaactcgt 3780 caaacagctc agctgacaat accataaaaa catttcagca attgtcaacg tcacctattg 3840 caaataatat tactttagtt aacaacaaaa gacgaaagtc aatcaactta attgaactag 3900 taacagataa atcttgttgt aatttaattg agagcaatga taaacttgaa aaagaaataa 3960 atgattacag tctgcaatac tttcatagct acgaaaatag tgccattgaa tttttccaag 4020 ctaatcaaaa attgttcccg gttttgacac aaattgcacg gataatattt tcagttccag 4080 cgacatccgt gccatcagaa tgcttattta gttcaaccgg tttaatagat acggagctac 4140 ggaacaggct tgcgccaagc ttattgaagc agttagtttt tattaaagaa aacaagtaat 4200 caaaaacatt tctttattaa aataaaaaaa attatttagt aaaagtgtat tttataaatc 4260 ttagtaattt aaagcacttg tattcttttt tttttatctg atgaactttt ttacaaacag 4320 cttaattaat gacaatatgt tttctgctac attcttttca attctggaaa aaacaccaat 4380 atccaatgat tgagctgcca tcagggcagg tgcaagtagg tgtgacgtgg tggggtgtca 4440 ttaaaatttt tgttgagtaa aaaattaggt ccttgttttt tgaagttcga cgactgacct 4500 gtctaaaagg tctacatttg ccgacttctt taaaagggtt atttttgccg actaaatgca 4560 caaaaaatgc atttaggaag gtgtgtgtgt aaatgattcg taaatttaaa taatctttcc 4620 ccgttattga atggtttaga atttgtaact aaagaggact aatttcatag cttgtagcag 4680 tagcatttac tgttatgata ttgccctttt ttaagaaatc ttgttgtcca aacaactaat 4740 tataaattgg attgttttat ctttccctaa cttgaggaaa atctttatga acaaaaaagt 4800 gattagaaga gagtcttttt ttaactttct aggagtgatt atagatgaaa atatatcatg 4860 gaaatatcat attaaatcta ttgaaattaa aatatccaaa aatacttcaa tgctatatat 4920 agttaaacct taactttatt tagcataaaa aatatataat tttcttaata cggagtcttt 4980 gaaaattata tatttttcat ttagggagag tggggcaaag tgggtcgaat ttctttcttt 5040 tttaatttga gtgaaagtta aaatatatat tttatgtttg tcaaggctaa aggattgaga 5100 atatgtcctt catgataaga aataactaaa atctaaatat atattatcta acagctcaca 5160 gaaaggttta aagtgagttt tgttaagtga ttcactgtac cccagtactg gggtacagtg 5220 aaacaattaa ctcatttgca atttagtggt agttaatggg ggcgtagcca aataattttg 5280 aggatgaggg tgtaaataca ttgtacacta tacttttggt gcacttttgg taactaatat 5340 taaataattt aaattttttc tgattgacaa aataaagaat aaattttatt gataacaaaa 5400 cttattcaac catctaaaat cacaaaaaaa aatttattta ataaaactta tttaaccaac 5460 aacacaagaa attttttaaa gtgaattttt aatttttata aacattttta tcacagtgaa 5520 ttaaccgttt cacggtaccc cgattcattg tgccccacta taaaaaataa ggttcaagtt 5580 taatttcttg gggcatgagt tttattaaag gaaaagcaaa actactaaaa tgaaccataa 5640 agatcagtat tccgaatata aaactttaac ttataaaatg attttaacca tgatctccac 5700 aacaaacata aagatggtaa gagcatttac tttagttagc taaaaacata aattacctac 5760 ctaaaaccga gagtcaaaaa ttatgtggta acacctataa ataaagatgg gtgatatatg 5820 cgcacataaa cttattacat catcgtgatg aacaaaatca taatctttga aaaaatgctt 5880 cttattcact gtgcccctca atccactgta ccctgacttc ccctattcat agttttttta 5940 tttataaact taatatatac caagttatgc tttttacgtt taaatcaaaa catgcgttat 6000 ctccttatat atttcactct tactttgcag aaatatctca ttaataccca tctgataatt 6060 ttgtaattcc aagatcttat ttaaaactca attcatttca aattcaatac cgtggaccat 6120 ttatatggaa aagtttttaa aaaaacatta ttaaaaataa aattcttcta agagcaattc 6180 taagaacatc tccttagaat aattcgaaag agaatctaaa cgttttctat aagaaaaaaa 6240 tttcaatatt tcttttaata tttcttttaa taagttcgaa attaatttag cttaccaaat 6300 atctattggt tttttctcga aattaattgc acacttcgaa actctttttt ttctttcgaa 6360 attaatttga cttttcaaaa tacacttttt tttttcgaag ttaatttgac ttttcaaaac 6420 tctttttttt ctttcgaaat taatatcact tttcgaaata catttttttt ttcgaaatta 6480 atttcacatt tcgaaattct ttttttcctt tcgaaataaa tttccttttt cgaaatccag 6540 ctttttattt cgaaattaat ttcacatttc gaaattcttt ttttcctttc gaaattattt 6600 tcctatttcg aaatccacct ttttatttcg aaattaattt cacatttcga aattctttat 6660 ttcctttcga aattaatttc ctatttcgta atccaccttt ttatttcgaa attaatttca 6720 catttctaaa ttactttttt ttttcgaaat taatatggct agttcgatta ttatcgaaag 6780 tttttttaga tttcgaaaaa aaagaagtat ttcgataacg ttttaatcga aaattttcga 6840 aaattgcaat ttcgataatg aaccc 6865 // ID EnSpm-N2_BF repbase; DNA; INV; 3172 BP. XX AC . XX DT 04-AUG-2008 (Rel. 13.08, Created) DT 04-AUG-2008 (Rel. 13.08, Last updated, Version 1) XX DE Amphioxus EnSpm-N2_BF non-autonomous DNA transposon - consensus. XX KW EnSpm; DNA transposon; Transposable Element; Nonautonomous; KW EnSpm-N2_BF; non-autonomous. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-3172 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-3172 RA Kapitonov V. and Jurka J.; RT "EnSpm-N2_BF - a family of non-autonomous DNA transposons from RT the amphioxus genome."; RL Repbase Reports 8(8), 791-791 (2008). XX DR [2] (Consensus) XX SQ Sequence 3172 BP; 897 A; 669 C; 669 G; 936 T; 1 other; cactagacga aaagacctaa tatgtagacg tatgctgacg gattccaatt tgcaacacct 60 gacggatgct gacgtatgct gacaaatgct gacggatccc aaaaaccatc acggatccca 120 aaaaatgacg gatcccactt gcatccgcaa cgcattcgta ggtatccgta actctcatcg 180 gtggatacaa acacggaacc cgtatactta cgtacttcgt gacggatgct aacactgcat 240 ccgtatggat acttcctaaa ttacggaccc tatacacata cgtaacgcat tcgttggtat 300 ctataagcca ctaacggatt caaacattgc attcgtatgg atactttact tttaacgtaa 360 acgaaagcat acgtggtcag gattcttgtt tatcagcaaa aagtcttcag atgcagacgw 420 ataccaagga tacgaatttg caacatcctg acggatgcta acggatacca aagattacca 480 aaaatcacgg atcccataca catccgtaac gcattcgttg gtatccgaaa ctcgctgtgg 540 atgcaaacac ggaattcgtt tgcttactta cttcgtggcg gatgctaaca cagcttccgt 600 aagaatactt actttttatg tatgcgtggc tagcactctt gtttttgagc atatatttta 660 ttttttagat tggtcatatc tttccccctt tttgtacctc ggacaccaca gtttgtaacg 720 tcaatacaag tgagacaatc cacgttacgc tagataccag gatactcgga ttcgtaagga 780 tgcgaaatgg tttttttctg cattcgtggc aagattacca acaatgcact gctccagtgg 840 attgctccag tggaggggat tccattggag cagtgcattg ttggtaatct tgggatatct 900 taaatatgcg actctgtatt tttttgcatc cgtacattgg ggcggtgtat tgtgggtaat 960 ccaatcgttg tctctctgac caatcagact gcacatcagt cagtttaaaa ataaagcgga 1020 gatagtcagc gttgtccgtg aggaccgcaa tctttttgta cctcaaactt tgagttttga 1080 aacgttgaac tcgggatatg gatttgtaca cacgtgtgtg aacttctgta gtagggtgat 1140 gttgaggcgg tgaccaaact caagtggcag tcgacggacg ggacggcgat cgagggactg 1200 aaacagccag ccctgttttg tagttcggcg tcctggagct attttacatc actgtcctgc 1260 ggatctcacg gcgaggtatg ttttatacaa tgattgtttc caaacaatac ccgctttgaa 1320 ttcgtctttg tgactacatt ccttgttggc tagtagttag tagcgctttt ggtatgcttg 1380 ggctgaacag aaacttatca gaaaacgaca taacggagcc ttacaaaaca caagtcacaa 1440 gcatgatact tgctgagtaa ctgttgactt atttgggaaa aatttgaatc atcatcaaaa 1500 cagtacttgt atacacatat gtttgtctca agcctggtat ttattcgcat aggaacattc 1560 tgttccaacg gtgtttacgt gtttgtttat atgcgacgtc tcgtacacaa tgcaaagtcc 1620 ggccggggaa gcagatcaca ccttgtcact aattatgcaa cgctagactg tttctgccgc 1680 gccctcccgg gggacttttt tggtatgata atgcggaatg taacggtgat aatggcgata 1740 ccacaatcag atgcatttgt tcctgttgta taacaagcca atcaccattt cttttcatat 1800 acatcctaat tactttgacg gacgcatcca tcaaacatgg atgctttagc tatctcaaaa 1860 tttgatggga ttattctttc tctttttcag aaggcgggtc gacattctct ggccgtgtca 1920 acagctggga gaacactgga ctacaggaag ggcacagaag gaattgagct gggagggatc 1980 agggcgtaac tagcctgaaa agaagcgaag cactgtcact agtagaaagg agactttgtg 2040 ctgttgcgag aagctgtaca tactgatact ataagtcatg ttagttgtac tttacaagct 2100 ggattcacta tctgtatcct gtagccggat gtggaaattc cttagagttg gacatcgcga 2160 agactttgat ttgtaataga aacagatatt cgtagaagca agattagcca cagactccca 2220 tgtcagcttt acataaaagt caaattgtat tcacgtttga tcctatattc tatttggaag 2280 tataagaagt catgtgatat tggcggttaa aggattcaac tttcccttaa cataattaga 2340 taacagggac tctctgtaca tacctggtat aacccattct tgttgcaata ttgtatctgt 2400 atacatatca tgtggcagac aggattcggc ctcgatgaca ttacgtcgtt tagttcttct 2460 caggactggt ttagaataag attattctgt gcattctttt ctatttcaaa tcgtgttagc 2520 cttttataag aaagtaattg ggcacacatg atattgtgat ttgtcaaatg tgcggaaaat 2580 tcattaaagt gactttgtat ctgcagaaaa tcccgttttg gattctttat gatctgatca 2640 tacgaagaaa tccctaaaaa acgacttcgc atccgcggca aatgccgcat tgggatttct 2700 taggatccga ccatgccaaa tatacgagga aaatcctata aaaacgactt cgcatccacg 2760 gcaaattcca tattgggatt ctttaggatc cgaccatgac aaatatacga ggaaaatcct 2820 ataaaaacga cttcgcatcc gcggcaaagg ccgttttggg acccgttcat tccaggtatt 2880 cgaagaaatc ctttgaaaaa cgacttcgca tccgagggaa ataccgtttg gggattcttt 2940 gcgatccgat cattccaggt attcaaagaa atccctaaaa aaaacgactt cgcatccgtg 3000 gaaaatccca ttttgggata tgtttgagtc ttttcattcc ggatcccaaa aatccaaatt 3060 tcggatattt tagggatcca actcgaatcc gtttgcattc gctagcattt gttacaaata 3120 cgcaaatatc cctgctcgga tatggctaca tccgataccc tttcccccag tg 3172 // ID POKEY_DP repbase; DNA; INV; 6631 BP. XX AC . XX DT 17-NOV-2004 (Rel. 9.1, Created) DT 28-JUL-2005 (Rel. 10.08, Last updated, Version 2) XX DE Daphnia pulicaria transposon Pokey - a consensus. XX KW piggyBac; DNA transposon; Transposable Element; POKEY_DP; KW piggyBac-like; putative transposase. XX NM POKEY_DP. XX OS Daphnia pulicaria OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RA Penton H.E., Sullender W.B. and Crease J.T.; RT "Pokey, a new DNA transposon in Daphnia (cladocera: crustacea)."; RL J. Mol. Evol 55(6), 664-673 (2002). XX RN [2] RA Gentles A. and Jurka J.; RT "Pokey transposon from Daphnia pulicaria."; RL Direct Submission to Repbase Update (OCT-2004). XX DR [2] (Consensus) XX CC Predicted protein sequence from Ref [1]. XX FH Key Location/Qualifiers FT CDS 3957..5417 FT /product="POKEY_DP_1p" FT /note="putative transposase" FT /translation="MQKKADAKKVAQHKAVNRNWKECKVWGDVGRQSRADD FT DSPRPQFKERAIQFAPLEEPGPVLSQFSRFSSPLGVFKTMLGGEDTLYLLR FT EATNDYIAEFWNSRKTPPSRKVGGKPEGRLIGPRLPAGGFVTDHELIAFFG FT IQFLIGYHRLPELSMFWEQQPDTGLGLGIIQQAMTRERFKFISKHIACASP FT WDDVDPNSDDPDQEPERPDPIRKIRPLVKRLNESYHVCRKPPRGQSIDESM FT VKYKGRSMLRQTMKNTPIKSGFKIWSRCCLRGYTYKFEIYQGARFGEKQKR FT SRNNEAVERVVVDLCQPLTDQGFVVAFDRFFTSIALLDKLRENGVNAVGTI FT LPSRVNQPIMTKNESNLRPDEFAAKFGGEPGTCRKGIFVWRDTKAFRVASN FT YHGSDIVKVRRKQRDGSFRSKSCPKAIDDYVNNMGGVDTANQLRSYYERDR FT KAKKWWHRLLYSLLETCLVNSWICFNDMVRSQMAEIKTLNTQ" XX SQ Sequence 6631 BP; 1865 A; 1385 C; 1726 G; 1650 T; 5 other; ttaacccttt ttcgactgac gggacgtttc tttttcgacg gctttcagta gggtctggtg 60 ggacctcgcc cacagacacg agagatttca aatcggctaa attcattcct atcaaagctt 120 atatctcgaa atccatccta cagaattgaa ttgccttaca acgaaaaatt aattgaccta 180 ccgtctttca atgataaaaa ggaggagaaa ttaaaacgag tatttctcgt gttttaccga 240 gttttcggtg acgggacatt tctggcccgt caggcggctc tgatcgaagt ctacacgaga 300 accttcgatc tggactgtcg ggacatataa gtctccgcca gatatgtccc tccaggcggc 360 tgcaatttta tcaaaagccc ggccagattt gaaccaccag gtggttgggc actttttcaa 420 gtattgatca gtggtgagct cctagtgtaa cgtgcactgg gatgatctga aattcggttc 480 atttcgatag taggatcatt tcaggagtca gaattagtga tgccgcgctc agtagatatt 540 tgataaaagg gaaccaacca ccctggttag agatgtcagg aattgtgagg ttttatttgt 600 gttcctcatt aaaaaaaaca aagaagtaga gctgaccagc caggcaggcg cggtcgaaag 660 aaatgggatc gagcgatggt tggatagata tcgaagggaa ggcgaaagag gcgaaaagga 720 aagcgcgaaa tttctttagt agctttgacg caaaaacttt ggtatcgacg gggagctttg 780 ggaaatggaa cgaagacgca gaatgggaag ttgtccgatt cggcgttagt tgtagtttca 840 tttccatcgt agctgagctc ctttccgacc atcaattggt tttattttga ttccgcacgg 900 gcatatatga gttcgagagc atggcgggtg aaagaaatga attcgaagtt gtgtttggat 960 gggaagctct gttggaatgg gtcaaaaacg aagaatcaga agtgtccgat tcaaagttac 1020 tagtttcatt ccttgtggcc agagctcctg aacctacccg ccgcttcaaa tgaattcagt 1080 ttcgattccg ttgcgtcgct tctcggtctt atagtaccca cctgtctacg gccggaagat 1140 ataaatccga accgaaagat tatgagagag acgagggtat cgacccgcga aagcaatttt 1200 accctaggaa gtctcgaaag tcggttgctt aatcaggaag tgaatacctg atggtgatgg 1260 tattattaat tcttccttga ttaccgatgg aacagcaaag cgaaagattg gagacgaaaa 1320 gaggagagtt catccccaat tcgaatctta gtagttttca tctttcgtgt tctactcacc 1380 ggtccaagat cgtggaattc cacttctgta cacagttcgc gtaatggttt taaatgatgg 1440 atatggtcga agcggtcggt gaatggttcg aatcagagac gaaagcgttt tggcagataa 1500 ctcctaacac gtttatttgg gctggtcccc gtgcccattg cagtaccggg cgaatgcgcc 1560 aggcgccgag tcttgaaaaa tcgaaccttg ccgagtcccg acgacggggt agtgacgtcg 1620 ttgccgggac ttggatgaat gcgagcttgt ggaagacaaa ataatagacg aacggtgcaa 1680 ttcatcctgc tgaacttttt gttcgaatac tgatgaaact taaaaaccta gtaatctttc 1740 caatacaaga gtaactccga ttcatttgcc cgattgccgg gttttgggcc gcctagaaat 1800 gatcctgatt caaccagaaa tctttccgga tttatagtgc ccgttgggtg gtcagcaatg 1860 cctcttgtct tgtcttttct gtgcgttgat cacaaatgtg tggcaatctg tggcagatgt 1920 gaaaagcgct ggataagatg agaaacaaac cccgtattat tgaatgtgca tgcatgtccc 1980 agcatgagga gaccgtcgaa ccaaataaaa ttggatcctt ttggccaata tagaagcctt 2040 gaataaacat ggtaataaaa aataaagtcc taagagtttt ttggaagtgg cgtagagacg 2100 aatgtctcgt gaattggctc ctaatttaac ctagagccag gtgcaaattc gattaagcca 2160 gagcagactg gccggttaga gatgtcagaa attcaaagtt cagatatgtc agaaatatat 2220 tggacaacac ggatgaaaga aaaatgaagg ggtggttgta atgaattccc ctgggaatta 2280 gcgaaaacag aacgggaagt gtcagttcta aatagttttt ttccaggcgg tatttttatt 2340 ccgttccggt ccgtgtaatc gaatgaaatg aatttgaagc cggtggttgg atagatatcg 2400 aagggaaggc gaaagaggcg aaaaggaaag cgcgaaattt ctttagtagc ttcgacgcaa 2460 aaactttggt atcgatgggg agctttggga aatggaacga agacgcagaa tgggaagttg 2520 tccggttcgg cgttagttgt agtttcattt ccaccgtagc tgagctcctt tccgcccgac 2580 gctccttatt tatttttatt cctatctggg tgccgcgcga cttaagtgtc cgcgatgcct 2640 caaatacaaa taaaaaaagg gttcaagtga atttgaagcc ggtggttgga tagatatcga 2700 agggaaggcg aaagaggcga aaaggaaagc gcgaaatttc tttagtagct tcgacgcaaa 2760 aactttggta tcgatgggga gctttgggaa atggaacgaa gacgcagaat gggaagttgt 2820 ccggttcggc gttagttgta gtttcatttc catcgtagct gagctccttt ccgtccaacg 2880 gttcttttga attgatcttg aagtcttcga ggcgctatta gtgcccgtgt aattaaattt 2940 tacacaaagc ctcttgtact ttcttatccg tgcgttgatt actgctgtct cgtgaggaaa 3000 ctccgaaagg cggtgaaagg cgctggataa gaatcagccc cggtaactcc tgcatgtgca 3060 tgcttttccc ctgcatgata aaataagacc gtcgtataag gcttaggacg taaggcaatt 3120 cgttatcctt ttgccgaatt cacaaagact aatcgtgcct tttttttcga ttctgtattt 3180 tcagattgat taaaaatggc gaaaaataaa aaaagagaga cccaagatat actaagtcga 3240 gagccgtgaa aaattcagcc aagaatgccg ccgccgcagc tgtattaaaa aggaggcagc 3300 tgaaagaaga aagagaacgt ttagctggag atcaagagcc ggtaatcaat tttgatactg 3360 taccggaggc tcaagatcag gaggaggagg cggttgccgt cgaggagccg gtcgaggagc 3420 cggtcgagga gccggtcgag gagccggtcg aggagtcggt tgagaagccg gatgaggagc 3480 ctggtcagga gttggttgct tcgcagcggc cggaacttga tcaggatccg gaaatagagg 3540 atgaacaatc tgcctcagag gatgatgaac aaaaagctgg gacagaagaa gaggagccgg 3600 tggtagagcc attaaccccg aaagacgtta cagttcaact tgaacgattg gaactatcac 3660 cggcgacgct tcaaagactc aaaagagttc aaaaagcgaa aaaaagaaag gaacgtgaag 3720 acagttcaaa gagtggctcc tccccaccaa aacgaaccag aagtcagacc cgaagaccaa 3780 ctagtgcagg gtcttcttct gaaacggccg atcccgatgg tgaagtaaca cggcactaca 3840 cagggaaaaa ggtcggtctc acgaacgcga ctttgaggtg tcgtgggcct cgcgttctta 3900 cagcgcatcc gttccggctg gccaaaaaaa cgaactcaga ttcggccaag ctgaacatgc 3960 aaaagaaggc cgatgccaaa aaagtcgccc aacataaagc cgtcaaccgg aattggaaag 4020 aatgcaaagt ctggggcgac gttgggagac aatcacgggc tgatgatgat tcaccgaggc 4080 ctcagttcaa ggaaagggca attcagtttg cacccctaga agaaccaggc cccgtgttga 4140 gtcaattcag tcgtttttcg agtcccctcg gagttttcaa gacgatgctt ggtggtgagg 4200 acactttgta ccttcttcgg gaggcaacaa atgactacat cgctgaattc tggaattcaa 4260 ggaagacgcc accgtctcga aaagtgggtg gcaagcctga aggcaggtta atcggcccgc 4320 ggttgcctgc tggaggtttt gtcacagacc acgaacttat agccttcttt ggcatacagt 4380 tcttgatcgg atatcaccgt ctgcccgagc tgtcgatgtt ctgggagcag caaccagaca 4440 ctggtctcgg gctcggaata attcaacagg ccatgactcg cgagcgattc aaattcattt 4500 ctaaacacat agcttgcgca agcccatggg atgatgttga ccctaattct gacgacccgg 4560 atcaggagcc agagcgaccg gatcctatcc gtaagatccg tcctctggtc aaaagactca 4620 atgaaagcta tcatgtgtgt agaaagcctc ctaggggtca gagcatcgac gagtctatgg 4680 tcaaatacaa aggccgctca atgctgaggc agacgatgaa gaacacaccg atcaagtctg 4740 gtttcaaaat ttggagccgc tgttgccttc gtgggtatac ctataagttt gagatctatc 4800 aaggagctcg ctttggggaa aagcaaaaaa gatctcgtaa taacgaggcg gtggagagag 4860 ttgtcgttga cctttgtcaa cccctgacag atcaaggttt cgtagtggcc tttgaccggt 4920 tctttacgag cattgccctc ctggacaagc tccgcgagaa tggtgtcaac gccgttggaa 4980 cgattcttcc aagtcgggtc aaccaaccca tcatgacgaa gaatgagtcg aacctgcggc 5040 cggacgaatt tgcagccaag ttcggtggtg agcccggaac atgccgaaag ggaatcttcg 5100 tatggagaga cactaaggcg tttcgggtcg cttccaacta tcacggctca gatattgtca 5160 aggtcaggag aaagcagcgt gatggttcgt tccgatccaa gtcttgtcca aaggctattg 5220 acgattacgt caataacatg ggcggtgtag acacggcgaa ccagctgcgt tcttactacg 5280 agagagaccg taaagccaaa aaatggtggc accgtctctt atactctctc ttggagacat 5340 gtctggtgaa cagctggata tgcttcaatg atatggtgag atcacagatg gcggaaatta 5400 aaactttaaa cacacaataa tttccgtttt ttttcttgac ttaggtcgag gaaaattatc 5460 tggaggatta cgaagttcag atgccgtttc tcgagttcaa acggaacgtc acgatgggtc 5520 ttttgtccca cgccctgaac gaaaataaaa ccaaagctgg tcgagccggg cgaatgatgc 5580 caaccattca cccttccgct gaaccaggag caaagcggcg gaaaagtcga ctgagtgtaa 5640 gggacgacat tcggttaact tgtgtgggca accatctacc catctttggg gaggcgagag 5700 gcaggtgcga gtggtgccag gcaacgacac ccaaaaaatt ggaatcaaga ccattctcaa 5760 agtgcaaaca atgtaatgtg tttttgtgtc tcgggaagaa gagaaattgc tttgtcgagt 5820 ttcacgatga taattacact caatctgagg aggaggctcc gctggacgag gtggaagctg 5880 tctattccgc ttcggaggaa tcagacggcc aagcttctga ccatgagcca accaacaagg 5940 agtgttcttc ttcatccgct gattccgacg agctagaaka ttctaaccga cgtgawggct 6000 atatrattga tgattatgat gatttccaag accgtgatgg atttgtaatt gatgaatatg 6060 atgaatttca gcaataaatt tgtgttttgt ttcgtacgtc mttgtgcatt tttactcgaa 6120 agtgcggcta cttgccagag tggtgaccga cgaaggatct gactgacttt tctaattaga 6180 gatcctattc cttttcataa ggagccgtcg actggaaaga cccctggtcg atggtaaaga 6240 cctcaacgtc aggcggggcg cctttcattc tgccaggtga atgagagtca agcccgcctg 6300 acgggacaca agaggcacat taaacggcaa aagtctctcg ggacaaatat gacgaaaaaa 6360 actcgagccg tctgtcggga caaaaatttg aacaagtggt agttgcatgc aactrtcggg 6420 acaaattttt tgtcccgcta gacggctcga gtcgaaaaaa gtgtctcttt ttagagacca 6480 tgtccgcctg aagggacatt cctgtcggga caacggtggc cgaaacgcgg ttaggccgga 6540 aaaaatcgga tattccgaat ttttttttaa tgagtggtct taggaccact caatgatatt 6600 tcgatcgagc attcaatcga atccgaccat c 6631 // ID Gypsy-27-I_NVi repbase; DNA; INV; 10610 BP. XX AC . XX DT 11-MAY-2009 (Rel. 14.05, Created) DT 11-MAY-2009 (Rel. 14.05, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW internal portion; Gypsy-27-I_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-10610 RA Bao W. and Jurka J.; RT "LTR retrotransposon from Nasonia parasitic wasp."; RL Repbase Reports 9(5), 990-990 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 1986..4550 FT /product="Gypsy-27-I_NVi_1p" FT /translation="MDNHRKMMDALRKDREVRNLASEIRKYDKEDVEEELK FT SLKISTKGNKAILRDRLLRAEILRAGLSETVPWYEWDNGGVIPTDSEEALT FT AILNHKKSKSRTSSKRDKKEVIAESASQSEIETGKETEYEAGDESLEEIVN FT SSVRQIQALSSLTSTDDGTPGFGCPLTVSNQLQQLISARVLSALQGSSSYA FT TSQKSIDPTSLLFVNKTTRAQPEVVPVSRLASSPTLDSNEGAQKFMGIWPY FT AETIRNSVQPNTGTIKESSTLRRSISDDGKFTRQVQEWELRTPKSKNRPHV FT QPTNLKSALNETEDVERKKPTEPCAINQPRRRKITVQNPGLPWFARGASKK FT PASQPRAKKSSDSSSDSEEDSSSDESEKEESKNRALGKTPTKAIDTKSKQP FT TRESLKKEKSKKKQTKRDSSSSSSDSSDSLSSQENDFDSSSSSDRHEKRKT FT KSCKKKSSRRLEQEERLANHLYQLQIIQGRNIKFAGETRDDPEEYLSQIQE FT CKESLNLKSEEILAMIPHTLSKKASMWFRTEKEGLKTWKTFKKAFEEQFIS FT EVSDNDVIEELRRRTQGKGEKISTYIMNFKYIAAHLKRPMRLKDQLTLLTE FT RLRPEYRRAIRRKKIESYEDVKRYCRTLEEEMERDAMYTPPPTVEKSRFPA FT AAWTPTPKAKVAAAKEVEEAAGVTDANSNKQNINNNNKNKKQKQQQNPPVV FT TSASITDQPVNNNSTSEPKQTNNGPPVWRPVNVQSQRQTQQSQGRADASDA FT TQSQPLSRYNNRGSTPFVGACYHCQQVGHRASACPTVECYACHQKGHKSPV FT CPNRSRSQIQCQVCGQFGTTFQNCGNCATFRTMWGNGNAGASLGPMPPATQ FT Q*" FT CDS 4382..8857 FT /product="Gypsy-27-I_NVi_2p" FT /translation="IASLPKSQPEPDPVSSLRPIRHYLPKLWKLRYLSNDV FT GKWQRGGIPGADATRNSTITASNVVVKTSSSVTPLSEVVEVLGGGGSGRGP FT LSTEVKEEKEKLLPKANATRWPNNSTETKEEKEKLLPKANATRWPNNSTET FT KEEKEKLLPKANATRWPNISTETKEEKEKLLPKANATRWPDNSTETREEKE FT KSRPNANVTRWPNNRDEVEEENRVAAGEAVLNSSSNEEDSVSRPWRNELQS FT KVHYSQFETQNEWNKPTPLTLDSSSESDGSLLDERVKNNRSQTSANLNASK FT TRGLTLEQGTLYSQECINKVRQRVASREENEKTVVGKECEIEPLKVEKSEK FT RVSFELESEQSDLSDDSDLVEVSGTWEDTPTQVKREKIIELIEEEEDFEKL FT SKIELAVEQLQSTDIAAENQSLPETKEIAEQKKPIPPDPNVDYVSPPIQIS FT DEVEHQAPTAAVITDNRNYMQVVLSGTTYRALFDPGAVITLVGPRVAEKFE FT NRLIAAKTSIQAITGELSPVMGYLQINIELDGIDATITARAVKEIGHDVVF FT GRDFCEIFKIDTDHRGWWRANGGIWRRFNSQNPSENDKVFAECAGITELDE FT DERRQIEEIVDGTLPEYSPDVLGFTDLTEHHISLTCDTPVRQKFRRRSPKK FT IESIQKGAKELERLGIIERSASDFVSQVVMVPKQGTDEERLCVDFTDVNKF FT TKKDGYPLPQMDAILDRLRCARYLSKIDLRQAYFQVKMEESSKKYTAFAVP FT GYGLWQFTRMPFGLINAPMTLQRLVDTLFGPEDTPQIFGYLDDIVIATDSF FT EEHKIKVRYVLQKLISAGLTINPKKCQFCVSQIKYLGFVLDKDGLRTDPDK FT VAPVLNCPAPCNVRELRRVLGMIGWYARFIENSSETKIPLVKLLRKDQKWT FT WGDEQQQAFEKLKKALTVAPVLARPDFNKPFCIQCDASNVAIGAVLTQEFD FT DGEHPIVYVSRVLTAAEKNYTTTERECLALVWAIKKLRPYVEGYHFTVITD FT HSALRWLRSLKEPSGRLARWALEVQQWDFDIIHRRGANHRVPDALSRMLEP FT ELAAVAEISDPWYLRRFKEVEEFPNKFPQWRVEDGRLYRFKRDPLLDPITH FT SDEAWKLVIPAEQREKVMTVAHVEVTAGHMGIEKTYDRIAREYFWPGMYHD FT VHSFITACPVCQRYKVSQQGPQGLIGKRIVEKPWTVVAADMMEFPRSKGQN FT KYLLVFQDLFTRWVEVKPLRKADGKSVARAFEELVLFRWETPEYLLTDNGT FT EFLNKHVAETLKEYGVTHVTTPPYHPQANPVERSNRTLKTMIAMFAESHRD FT WDKHVHELRHAMNTAVQSTLKTSPAFLNFGRHPSPVKSLRTEIEGRGPKLQ FT LDPAVWQDRMKRLDALRELVAKFIDNARNKQGETYNRRRRLTNFAVGDMVL FT RRTHYQSKAAESFSAKLAPKFEGPFRITEQKSPTVYILEAEEGNSRKIAKA FT HVSDLKRYLPPRNPKTNSPNTDPTSR*" FT CDS 8981..10351 FT /product="Gypsy-27-I_NVi_3p" FT /translation="MLQIRNSVMPRRTRSRSLLTESRHVAEEEFVPPTFTA FT VPPDFVPPQWSTRHERIRDPNGPKSEETMHRELRVRDAIRRREMEEEEERE FT RELLAGSDDEARTVEPMEVAAFTLVPAETNTKVGGGRNPSSKVSPWFRLES FT AEERESREAVEELRRVVEKETGVDLPTQVEERTLDTRGLPQRPFTTRDARE FT GTTTPKKDPVCSTPEDTVRERKRGLQLKVKLSGLRYSGLKPIPTDPELDPP FT PYSCFNCWGRGHTVFNCPEPRHGKICLNCGRFGEDLNSCPRCRDRHAEYLA FT TVGSHKGVTTRTSSPKPGTSRDEAPRPTTPLSPEGATATSRTASISRHRRE FT RSITWATPLAKESPKPSPPRSPKSCSAGRPRIRRGDPYATTEPTTTSSSST FT SSRSGSPPREESPTRGVAEAVAETLRLIDEVRDLPQDIRDSILRRTFCQPK FT PRRSRSRSRKD*" XX SQ Sequence 10610 BP; 3382 A; 2325 C; 2558 G; 2345 T; 0 other; gtggctgccc aatgtgtggg gcacggtcgt ggaaaataat ttacgagtga tagtgttcag 60 tgcgcgacat tttgaaacgt ggcaccaatt tcgttcaaag ttaatcgtga gccgtgtcta 120 actaaaatca agtgtataaa taattccaat gaagcagaga gttgtatgat actgccggaa 180 aaatttaaat tttattcgtg taaccgccag agaaaaattt actcgccacg tgctgacagc 240 cgccgtgatc gcaaaaactg gggttctcaa atgtaaaatc atggtttacc tcagtgctct 300 ttttcgatta tagcctaaca aaagacgagt ttgtgcattg ctctaacagt cggttgagtg 360 tgcaaatttt tcccaacacg cagtagtgga aattctgccg aaaaccactc gccacgtgtc 420 gatagtcgct gggagtgtaa aaactggggt tctcacaagt gaaatcatgg tttacagaat 480 tacgcccttt taaactttac agtggcaata tacgggttta ttaatatttt tacggccaat 540 tgtgagtgca gaatattttc agatttaata cgaagccgaa ttaaattttt tttttttgcc 600 ttcttgcaaa attcaagttg tagtagttga gagaaaatta acgtttgacg gacccgtttt 660 aaaatttttt tttcttttgc ctcgtttaga attataaagg gattcgaaac gaagaaaata 720 aatttttttt tgtcgtgaaa atttacagcc gtaattttcc gaagaataat tggagttata 780 gaatttttaa aataccgagc agtagttgaa gacgtcgaga gttgaaaaat aatttttttt 840 ttttttcccc tttgttgatg tgtttgttga gagagtagaa gccgataaga atagtgaaga 900 aaattttctt ttgcctatag tcgcttgata attgcttaaa acaatcattt tagccgaaca 960 aaattttttt ttgggaattc aatgtcggga aaaattttgt gatagcctgt ttgacttgat 1020 tggaaagttt taaactaaga ttcaattcta gccatactgc cgactatcct taaccccgaa 1080 cgagttaata aacaaatttt gtggcccgat tgcgagttga atcaagcact agcaatgaaa 1140 gaaaattaat ttacccactt tactatttaa ttacaattga cgcaactcat gccgattgcg 1200 catgaataat tataagtatc aattgaggta atatttcaat atctataaaa tatatacaca 1260 taatctattt ataatttata aacatatata tgtatattga tatttttgta cgcattccca 1320 ttgtaaaaat tttaacattg aaatccgatc ccgctggctt cgtataacct ttctgcctgg 1380 atgaaagttt ttttttttgg ccccaaataa atcacttgtt ttagccggtt aacagtagaa 1440 cggctaaatc agttgatgaa tattacaccc ataagctagc agtcgaataa cccatttggt 1500 ttatcagtaa gacggtcaat cgttactggt ttctttttcg gtctaaccct agtaactatt 1560 tctttccttt ttttttgccc agattctaga ttcgcaaaat acgttgtact aatcgaaatt 1620 tttgaagtcg ggttgaattc gttttccagt gagtgaatga gagatagacc ctgggtgggc 1680 ccttcgacct ttacagcggt ttagctgaga aatatattga attttttttt gcttttcccc 1740 gattattcga acccaatcac caccgtgtaa agtattgcgt agagaagaaa accagtcgtt 1800 taaccggtta ttatctgcca agaattttat ttttgagaag ttcaagaagt atttacagcc 1860 tatcttatca agaaataaaa gaagaatata gagagaaaat tacagtcaga ccaccaagaa 1920 gtttagaaaa ttgtatacgc ttttccatca accaaggctt aaagggtaaa tagcagaaag 1980 taaccatgga taatcacaga aagatgatgg acgcactgcg taaagatcga gaagttcgta 2040 acttggccag cgagattaga aagtatgata aagaagacgt agaagaagaa ctcaagtcgt 2100 tgaaaattag tacgaaggga aataaggcga ttctaagaga tcgcttgtta agagccgaaa 2160 ttttacgagc cggtttgagc gagacagtgc cgtggtatga atgggacaat ggtggcgtta 2220 tcccaactga ctccgaggag gcgctgacgg cgatcctcaa ccacaagaaa agtaaatcca 2280 gaacgagtag taaacgagat aaaaaagaag tgatcgccga gtctgcctca caatcagaaa 2340 ttgagactgg caaggaaacc gagtatgaag ccggtgatga aagcctcgaa gaaattgtta 2400 actcttcagt gagacaaata caagctctgt cgagcctcac ttccacagac gatggaacgc 2460 ccggttttgg atgcccgtta acagtatcaa atcagctcca acaactcatt tctgcacgag 2520 tactcagcgc ccttcaggga tcatcgagtt atgctaccag ccagaagtcg atagatccca 2580 catctctact attcgtcaat aaaaccacta gagcccagcc agaagtcgtg ccagtgtccc 2640 gtcttgcctc cagcccaaca ctcgactcta acgagggagc tcaaaaattt atggggattt 2700 ggccttatgc agaaaccatt aggaactccg ttcagcccaa tacgggaaca attaaggaaa 2760 gtagcacgct gagaagatcg atatcagacg atgggaaatt cacgagacaa gtacaagagt 2820 gggaattaag aacgccgaag tcgaaaaaca gacctcacgt tcaaccgacc aacttaaagt 2880 cagcgctcaa cgaaactgag gatgtagaaa ggaagaagcc gacagaaccg tgtgccatta 2940 atcagcctag aagaagaaaa atcacagtgc aaaaccctgg gttgccctgg ttcgccagag 3000 gagcttctaa gaagccggca tcacagccga gagccaagaa atcatctgat agtagtagtg 3060 atagcgaaga agatagtagc agtgacgaat cggaaaaaga agagtcgaag aacagagccc 3120 taggtaagac tcctacaaag gccatagaca ccaagagcaa acaaccaact cgagagagtc 3180 tgaagaaaga gaagtccaag aagaaacaga ccaaaagaga cagctcgagt agtagctccg 3240 acagcagtga cagccttagc agccaagaga acgacttcga tagctcctca agttctgacc 3300 gccacgaaaa acggaaaaca aaaagctgca agaagaaatc gtcccgacga ttagaacagg 3360 aagagagatt ggccaatcac ttgtatcaac tccaaattat acaggggcgg aacatcaagt 3420 ttgctggtga gactagagac gacccagaag aatatttatc tcagatacaa gagtgcaagg 3480 agagtctaaa tcttaaaagc gaagagattt tggccatgat tccccatact ctatcaaaga 3540 aggctagcat gtggttcaga acagaaaaag aaggacttaa aacctggaag acctttaaga 3600 aggcattcga agaacaattc atcagcgagg tgagcgacaa tgacgtaata gaagaactac 3660 gtagacgaac gcagggtaaa ggggagaaga tatctacgta tattatgaat ttcaaatata 3720 tagcagccca tctaaagaga ccgatgagat taaaagacca gctgacatta ttgacagaga 3780 ggttacggcc ggagtatcgt agagcgatta gaaggaaaaa gatagaatcc tacgaagatg 3840 ttaagaggta ctgccgaaca ctagaagagg agatggaaag agatgcgatg tatacaccac 3900 cacccactgt tgaaaaaagc cgattcccag ccgcagcttg gacgccaaca ccgaaagcca 3960 aagtagcagc cgccaaagaa gttgaggaag cagcaggagt taccgacgcg aattcaaaca 4020 aacaaaatat taacaacaac aataaaaata agaaacagaa gcaacaacag aacccgccag 4080 tagtaacttc agcctcgatc acagatcaac cagtgaataa caactcaact agcgagccca 4140 agcagactaa caatggccca ccagtgtgga gaccagttaa cgttcaatcg caaaggcaga 4200 cccaacagtc gcaaggcaga gctgacgcca gtgatgctac ccagagtcaa ccactaagcc 4260 ggtacaataa cagaggttct acaccatttg tcggagcatg ctatcactgt caacaagtgg 4320 gccacagagc atcagcctgc ccgactgtag aatgttacgc ctgccatcag aaaggccata 4380 aatcgccagt ttgcccaaat cgcagccgga gccagatcca gtgtcaagtt tgcggccaat 4440 tcggcactac cttccaaaat tgtggaaact gcgctacctt tcgaacgatg tggggaaatg 4500 gcaacgcggg ggcatccctg gggccgatgc cacccgcaac tcaacaataa cggcctcaaa 4560 cgttgtggtg aaaacttcta gctctgtaac tccgttgtca gaggtagttg aagttttggg 4620 agggggaggt agtggccgag gaccactaag tacagaggtg aaagaagaga aagagaaatt 4680 gctgccaaaa gctaacgcaa cccgttggcc aaataacagc acagagacga aagaagagaa 4740 agagaaattg ctgccaaaag ctaacgcaac ccgttggcca aataacagca cagagacgaa 4800 agaagagaaa gagaaattgc tgccaaaagc taacgcaacc cgttggccaa atatcagcac 4860 agagacgaaa gaagagaaag agaaattgct gccaaaagct aacgcaaccc gttggccaga 4920 taacagcaca gagacgagag aagagaaaga gaagtcacgg ccaaatgcta acgtaacccg 4980 ttggccaaat aaccgagacg aagtagagga agagaacaga gtagcagccg gcgaagcagt 5040 tctaaactcc tcgagcaacg aagaggacag tgtatcccgc ccgtggagaa acgagctcca 5100 gagcaaagta cattacagcc aattcgaaac tcaaaacgag tggaataaac ctacgccgct 5160 gacactggac tcaagtagtg aatcagatgg gtcgttattg gatgagagag tgaaaaataa 5220 tagaagccaa acctctgcaa acttgaacgc atcgaagact agaggcctaa ccctagaaca 5280 gggcaccctt tactcccaag aatgcattaa taaagtcaga cagagagttg ccagtagaga 5340 agagaacgag aagacagtag ttggaaagga gtgtgaaatt gagccgttaa aagtagagaa 5400 gtcggagaag agagtctctt tcgagttaga atcggaacag tcggatctat cagatgactc 5460 agaccttgtg gaggtttcgg gaacctggga agatacgcca actcaagtca aaagagagaa 5520 gattatcgag ttgatagaag aagaagaaga ttttgaaaag ttgagtaaga ttgaattggc 5580 tgtcgagcaa ctacagtcga cagatatagc agccgaaaac caatcactac cagagactaa 5640 ggaaattgct gaacagaaga agccaatacc gccagacccc aacgtcgact acgtaagccc 5700 acccatccaa atctcagacg aagtagagca tcaagctcct acggcagctg tgataaccga 5760 taatcggaac tacatgcaag tagtcttgag cggaacaacc tatcgggcac ttttcgatcc 5820 tggcgccgta atcaccttag taggccccag agtagccgag aaatttgaaa accgactgat 5880 agccgctaaa acttctattc aggcgattac gggcgaacta tcacccgtaa tgggatattt 5940 acagatcaat attgaattgg acggcattga tgcaaccatt acagcccgcg cagttaaaga 6000 gattgggcac gacgtggtat tcggtagaga tttctgtgaa atctttaaaa tagatacgga 6060 tcacaggggt tggtggagag caaacggtgg aatatggcga agatttaata gccaaaaccc 6120 gtctgaaaat gataaagttt tcgccgaatg tgccggaata acagaattag atgaagacga 6180 aagaagacag atagaagaga tagtcgacgg aaccctgccc gaatactcgc cggatgtatt 6240 aggtttcact gatctcaccg aacatcacat cagcctaacg tgtgatacgc cagtgagaca 6300 gaaattccgc cggagatcac ccaaaaagat agagtcgatt cagaaaggag ccaaagagtt 6360 agagagacta ggtatcatag agagatcagc cagtgatttt gttagccaag tagtgatggt 6420 gccgaaacaa ggcacggacg aagagagact gtgtgtagac tttacagacg tgaataaatt 6480 tactaaaaag gatgggtatc cgttaccgca aatggacgca attcttgaca gactgaggtg 6540 tgccagatat ctgagtaaaa ttgacctacg tcaagcctat ttccaagtca agatggagga 6600 aagtagtaag aaatataccg cctttgccgt accaggatat ggattgtggc aattcactag 6660 gatgccgttt ggcctgatta atgcccccat gactctacaa cgtctagtgg acaccttatt 6720 cggccccgaa gatactcctc aaatttttgg ttatttagat gacatagtga ttgccacaga 6780 ctcttttgaa gagcataaaa taaaagttag gtatgtacta cagaaactga tttcagccgg 6840 actaactatc aaccccaaga aatgccaatt ttgtgtgagt cagatcaaat acctaggttt 6900 cgttctagat aaagacgggc ttaggactga ccccgacaaa gtcgcgccgg tattgaattg 6960 tccagcaccg tgtaatgtca gagaattaag aagagtattg ggaatgatag gttggtatgc 7020 cagattcatt gaaaacagtt cggaaactaa aatcccacta gttaaacttc tacgcaaaga 7080 ccaaaaatgg acttggggag acgagcagca acaggcgttc gaaaaactca agaaagcctt 7140 aaccgtagcg ccagtactcg ccagacccga tttcaacaag cctttctgta tccaatgtga 7200 cgccagcaac gtcgccatag gagctgtgtt aactcaggag tttgacgacg gagaacaccc 7260 gattgtgtat gtcagccgag tcctaacagc cgccgagaag aactacacta ctacagagag 7320 agagtgtcta gctttagtat gggccataaa gaagttacgg ccatatgtcg aaggatacca 7380 tttcacagtg atcacagatc acagtgcact acgctggctt cgctcgttaa aggagcccag 7440 tggaaggcta gctagatggg cacttgaagt tcagcaatgg gacttcgaca ttattcaccg 7500 tagaggcgcg aaccatagag tgccagacgc cttgtcgaga atgcttgagc cagaattagc 7560 agccgtggcg gagatctctg atccttggta tctccgacga tttaaagagg tagaagagtt 7620 tcctaataaa tttccccagt ggagagtaga agatggtcgg ctataccgat ttaagagaga 7680 tccgttgttg gaccctatta cccatagcga cgaagcctgg aaattagtca taccagccga 7740 acagagagag aaagtcatga ctgttgccca cgtggaagtt accgccggac acatggggat 7800 cgaaaagacc tacgatagaa tagcgagaga atatttttgg ccgggaatgt accacgacgt 7860 ccactccttt atcaccgcct gcccagtctg ccaaagatac aaagtgtcgc aacaaggccc 7920 acaaggcctt atcggcaaaa gaatcgtaga aaagccttgg accgtggtag ccgccgatat 7980 gatggaattc ccaagaagta aaggccagaa taaatactta ctagttttcc aagacctgtt 8040 tacccgctgg gtggaagtca agccattgcg taaagcagac ggaaaatcag tcgccagagc 8100 ttttgaagaa ctagtgttat ttcgctggga gacgccagag tatttgttga ccgataacgg 8160 aacggaattc ttgaacaagc acgttgccga gacgttaaag gaatacggtg tgacacacgt 8220 tactacacca ccataccacc ctcaagccaa tcctgtcgag aggagtaatc gcaccctcaa 8280 aacaatgata gcgatgttcg ccgagagtca cagagattgg gataaacatg tccacgagct 8340 tcgtcacgcc atgaataccg ccgtacaatc aacgctcaaa acatccccag cctttttgaa 8400 ttttggcagg cacccctcac cagtcaagag tctacgtacg gaaatcgagg gtcgaggtcc 8460 aaaattacaa ttagatcctg ccgtgtggca agatcgtatg aaacgactgg atgcgttacg 8520 agagctcgtc gccaagttta tagataatgc gagaaataag cagggggaga catataatag 8580 aaggcgtaga ttgaccaatt ttgcagtcgg ggatatggtt ttgaggagaa ctcactatca 8640 gtcgaaagcc gctgagagtt tctctgccaa gttagcaccc aaatttgagg gaccctttcg 8700 aattacagag cagaaatccc ctaccgtgta tatcttggaa gccgaggagg gtaacagtcg 8760 gaaaatcgct aaagcacacg tatcggacct gaaaagatac ttaccgccgc gaaatcctaa 8820 aactaacagc cctaacaccg atcccacgag ccggtagtga actgcagccc gtggaaggac 8880 gtacctggcg tatagacatt atatgtggtt tgtattggtg gtggagtaga aaactgtgtt 8940 gtccctatga tgagtgtgga attcgcattt aattacagta atgttacaga taaggaactc 9000 cgtgatgcca cgccgaactc gctccagaag cctcctgacg gagtcacgac atgttgccga 9060 ggaggagttc gtgccaccga cgtttactgc cgtgccgcca gattttgtgc ccccgcagtg 9120 gtccacgaga cacgagcgta ttcgtgaccc taacggcccg aagagcgagg agacgatgca 9180 ccgcgagctc agggtccggg acgctattcg tcgccgagag atggaagagg aagaggagag 9240 agagagagag ttgttagccg gatcggacga cgaggcgaga acagtcgagc cgatggaggt 9300 ggccgccttc actctggtac cggccgagac caacaccaag gtgggtgggg ggaggaaccc 9360 gagctccaaa gtctcgcctt ggtttcgcct ggagagcgcc gaggaacggg agagcagaga 9420 ggccgtggag gagttgcgcc gcgtcgtgga gaaggaaacg ggtgtagacc tccctaccca 9480 ggtcgaagag cgtactctcg acacgagagg cctaccccaa cggcccttca ccacgcgcga 9540 tgctcgcgag ggcaccacga ccccaaaaaa ggaccccgtg tgctctactc ccgaagatac 9600 ggtaagggag aggaagagag gactccagtt gaaggtgaag ctgtcaggcc tccggtacag 9660 cgggctgaag cccattccaa cggaccctga actggatcca cccccataca gttgtttcaa 9720 ctgttggggg aggggccaca cagtcttcaa ctgcccagaa ccgagacacg gcaaaatctg 9780 cctaaattgt ggtagatttg gagaggacct caactcgtgc ccgcgctgtc gggaccgcca 9840 cgctgagtat ttggcgacgg taggctctca caagggtgta acgaccagga cgtcaagccc 9900 taaacccgga acgagtcgag atgaggcacc tcgccccaca acacctctgt cacccgaagg 9960 agcgacagcc acctcccgca cggcgtccat ctcccgtcac cgccgagaga ggtccatcac 10020 ctgggccacc ccgctcgcca aggagtctcc caagccctca ccgcctcgta gtcccaaaag 10080 ctgctcggct gggcgtccgc gaattcggag gggggatcct tatgcgacaa ccgagcccac 10140 taccacctcc agttcttcta cctcgtccag gagtggatca ccgcctagag aggagtctcc 10200 cacccgtgga gtagctgagg ctgtagccga gacgttacgc ctcattgatg aggtccgaga 10260 tctgccccag gacatcaggg actcgatctt gaggcgaact ttctgccagc cgaaaccccg 10320 acgatccagg agtaggtcca ggaaggactg agttcgctct cacagccaaa ctacagccag 10380 ttacagccag tagagtgagt ttcctaatct agaccgggtt tatcggccca aaattttgtt 10440 gacctttttt tttattattc attgtataaa ctaatcatgc aatccttaca gccctgcagt 10500 ctcgagatag tagttgtgag aagaagagag agagagagtg acatttttct tccttcttag 10560 gtaaagaaaa actgaaacat tcagagtatg tttagttttg ggagggggaa 10610 // ID Gypsy-5_DWil-I repbase; DNA; INV; 7194 BP. XX AC scaffold_181075; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-5_DWil_; KW Gypsy-5_DWil-LTR; Gypsy-5_DWil-I. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-7194 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_181075; Positions 153096 160289. XX CC Positions [4124-4681] - Reverse transcriptase CC Positions [5984-6472] - Integrase core CC 'ACAG' target site duplication CC LTRs are 94% similar to each other. XX FH Key Location/Qualifiers FT CDS 4346..6826 FT /product="Gypsy-5_DWil-I_1p" FT /translation="MSGFHQIELEEKSRDITSFSTSNGSYRFTRLPFGLKI FT APNSFQRMMTIAFSGPEPSQAFLYMDDLIVIGCSEQHMIKNLVDVFKLCRK FT NNLKLHPEKCSFFMHEVTFLGHKFTDKGILPDDTKFDVIKNYLVPHDADSA FT RRFVAFCNYYRRFIKNFAEYSRHITRLCKKNVPFEWTAECENAFQYLKEKL FT VQPNLLQYPDFSKEFCIITDACKQACGAVLTQNYNGLQLPVSYASRAFTKG FT ESNKSTTEQELAAIHWAIPYFRPYIYGKHFTVRTDHRPLTYLFSMVNPSSK FT LTRMRLELEEYDFTVEYLRGKDNFLADALSRITISDLIDMNKRILKVTTRY FT QSRQNNFCAENKQELLPRQNLEKASKPNVFDVIVNDEVRKVVTLRITDSST FT TFKHGRKVIARIDICDIYTNGILDLGQFFQRLKKEAGILSITKVKVAPSEK FT IFETISIESFKNMGNQILQFLRVALLTPVTLIQNDEEKEAILSTFHEDPIQ FT GGHTGINKTIAKIKRRYYWKNMTKYVKKYINKCPKCLTSKTTKHTRSPMII FT TETPQYAFDRVIVDTIGPLPRSENGNEYAITLICDLTKYLVAIPVANKSAT FT TVAKAIVESFILKYGPMKTFITDMGTEYKNSIINDLCKYLKIDNITSTAHH FT HQTLGTVERSHRTFNEYIRSYISVDKTDWDIWLQYFVYCFNTTPSVTHNYC FT PYELVFGRQSNLPIQFNSVNTIDPIYNIEDYSKEVKFRLENACKRARILLE FT KNKIKQKEINDSKVLDFKIEIGDQVYLKNETGHKLESSYIGPFEVISIEKN FT YNVVIKNRKNREQKVHKDRLKRNNK" XX SQ Sequence 7194 BP; 2845 A; 1309 C; 1176 G; 1864 T; 0 other; tggcgaccgt gacaggactc taaaggacct acaaggactc tccatgctaa caacagcaac 60 aataaaaaaa aaaccaagaa aatttcaaca aaacaaacaa aaaaaaaaag cataaaataa 120 tgtctcacaa tgtgttgcaa ctactttagc gatagagagc agcattagca gcagcagcaa 180 gcagtctggc aaccctgttc cagctctccc cccacccggc tttaaaccaa caatctttat 240 aaaaaacaaa gaattcaatt gaacgtttta agcgccaaaa acaacaaatt cccgcccccc 300 gtcaagtatc catcttttcg gcagtggcgt tcgaaatcga ccaatcaacc aacaaggagc 360 acgaatgaac agtgaagata ttccctttgc ggaatctgct cctatcgagc aaagggatgc 420 accgacgcta gaggaagcgt tagagctgta tcccaacgat ggaccgcgcc cactcacaat 480 agctgagtac ggggcccgca acacaaaatc tacaaagcca gagccaaaga agaccaaaag 540 aggaggccaa atacacaagc tactccagca gaagaggctg gaagtagact tactggccac 600 tctgaagacg gaggaagacc gacaacgctg catagagcgt attgacatcc tgaaagctga 660 acttcgccaa agagcaaaga aacgcaaggg gccgcagaaa tgcgcaagcc ttaacttgcc 720 tttaatttaa taattctaat tataagctta atagcaagtt ggagcacgaa agtgcaactt 780 gcgagctact aactttctgc taggctatta ctaacaatgt ggttggagaa tttttttttt 840 aaaacaataa aacaaatgaa atgtatacct gaattgtgag ccaaccacat tctctatatt 900 ttttccacgt ctctggctcg ctgtgcaaga cacagttgag ataaatttta tttctctctt 960 actttttttt tcccttcacg aataatttaa ataaaaaaaa aaaaaaatat gcatgaatcc 1020 ataacaattt aattttagta tttcaaatag aaatcaaatt caagcaaata tgtaccggtc 1080 aaaacatatt tacaaaaaaa attaaagtaa ttgctaaaca aaagttttta acctgtcttt 1140 atgaacttta tttcatattt tgcacacttg taataaataa aaacaaaaat cccatacgta 1200 tgttcgataa tagcataagt aaatttgatg acatatccga tatacccaca atggggttaa 1260 tcgaaacaaa atcagtggac tccactgcga atgtcataaa taaaattgaa tcatcaaatg 1320 ctgatatgag attaacgaac atactactat taataattgt cataatattg tgcgcatatg 1380 tattatataa ggcttaccta ttacacaata aatgtgtaac aaaaagagct ctaagcctct 1440 aagagctcta aaaaataaac aacaggcacc ctagcatagt ccaagactta attgaggttt 1500 ttagaagcga gggaagtacc tcacctcaac gctccataaa cttagtacaa attaggttag 1560 cttaccttga tataacgggc gaagatttcc cagtcaaagg aggcactagg gtacagatga 1620 gtttcattct cacggtacca tatgtatgct gcttttcaaa cgcagcaggc acactgcgtt 1680 tctacctaat ggaggaacca aagcagtaaa gaactaaata catccataat cagttaaata 1740 ggcttaggca gttagtgtag aatatggata aagcagaaac aaaaaagctc caaaaatatt 1800 tatctttcct tgatttcctg atacaatcag gtgaaacaaa tcacaaatac aaggaaataa 1860 ggtattggat agcccacaac tacaaatttc gaaaaaacat tgtagatctt atggaaaaat 1920 ccataaaagt agcttatagt agattattag ctaagactcc tttaagaaaa gacgtgcccg 1980 gatgggtcca cgaacttttc aaggactcaa ttgcgctgat aaacatatta gaatcgagtg 2040 tagaaccaat agatttaaca gactcagata tagaaataat agacatagaa tagtaaaacc 2100 ttacacattt tcaaaatttt ttttcttaaa ttaacaacag gtctaaacgc caaattcctt 2160 ttttgtttag taaaatgggt tttcaagaat ccaaaataga aatagaaatt aatcttaaca 2220 aactatggtt acttgggatg atataataga aaaagctaga atacacgcac aagaattaac 2280 taaatcttgt aagtgtctct cgcagaatag agaaacatcc caagacacag taaccaaaca 2340 cacgaagata gtaaccgaga atcttcaagc tattagagcg gtattaactg aacactatag 2400 caaactaaag acatggcaaa aaacagacgc aaatgcgttc tacagtgatt gcaaacaaaa 2460 agcaataact gtgctaaaca gacacaatat tgaatttaaa caaacatctg atttggaaac 2520 ttcaatagta ctggaagaaa aaacgcaact tatggctgat gaagaaataa gtcaagaaga 2580 accatcagtg ataagtcgta acaaaccaac aattcccaaa gaaaacaaca aaatggtaca 2640 aacaagggta gaattcataa atacagccac caagctgata ccatacttcg atggtgatag 2700 taaaagtcta ccaggtttca ttggagcgct tcgcgttctt gacaccataa aagaaccgca 2760 cgaagcggta gccatccaac tcataaaatc aaaattaaaa ggtcaagcgg ctaatagcct 2820 aagctcagaa aaaacgattg atgaagttat acaaagacta tcaacaatct tcaagggtga 2880 ctcagtagat cttttaacag caaaaattaa aaatctgcag agaaacccca ataactgtgt 2940 ttcatacaca aacgaaatta ctgagttgac cgatgccctt aaaggcgcat atatttcgga 3000 tggtcttcca actgaacaag caataaaata caccaccaaa gtggcagtaa atgctatggt 3060 aggtcatacc aaaagtccaa gagtccaatt attaatggaa tcaggaccga gcaaataacc 3120 aacagattgg ttataacaat accaatgtcc gagtagcaca agatactact acgagaaact 3180 tccagacacc tccagatact cagcgctaaa taaaaaaata tatacggtta acttgagtct 3240 caataatttc gtattattca aaaatgtttc aacaaataaa aatttaacat ttctgattga 3300 tacaggtgca gatatttcaa ttataaaaga aaatactgat gtctttcacg acatccatac 3360 aaataaaatt atagatattc gaggcatagg agaagggata ataaaatcca aaggactagc 3420 ctcaattgag ctacaaacag acaaatacat tattccatat aaatttcata tacttcatga 3480 agatttcgca atcccctgcg atggaataat aggactagat ttcatgaaaa gatttaattg 3540 ccaattagat tacaatagtt ccgaggattg gttaataatt agaccaccaa atttaacgta 3600 cccaatttat gttccaataa cattcaattc tggtaataac tcaacacttc taccagcaag 3660 atcccaagtg gtacgcaaaa tagaactatc ttcagcagaa gataacattt taattccgaa 3720 tcaggaaatt aaaccaggcg tttatgttgc aaacactatt gcaacagccc aaagtacttt 3780 tgtccgattt ttaaatacca cagataaatg ccaaatagta ggcgttaaca atataaaatt 3840 tgaatcactt tcgaactacg atatagttca aaacaccaaa gatataagaa aagaatctgt 3900 aattcataaa ttgaggaaaa gggtcccaat actattcaaa gacaaactcg aaaagttatg 3960 cactgggtat agtgatgtgt tcggacttga caccgaaccg atcacaacaa ataattttta 4020 taaacaaaaa tgaagactta aggatgatga accggtatat gtgaaaaatt tccggagtcc 4080 gcatagccaa ctaaacgaga tccagcgaca agttggaaag ttaattgaac agggtattgt 4140 tgaaccatct gtttcgcctt ataatagccc actcttgcta gtcccaaaga aaacactttc 4200 gggatctaaa gaaaaaaact ggcgattagt aattgactat cgtcaaataa acaaaaaatt 4260 gctgtcagat aagtttccgc tccccggaat tgatgatatt cttgatcaat tgggaagagc 4320 aaaatatttt tcttgccttg acttaatgtc aggttttcat caaattgaac tagaagaaaa 4380 atcaagagat ataacgtctt tttcaacgag caatggctca tatcgtttta cacgattacc 4440 tttcggtcta aaaatagcac ctaattcatt tcagagaatg atgacaatag cattctcggg 4500 gcctgagcca agtcaagctt tcctctatat ggatgattta atcgttattg gttgttctga 4560 acaacatatg attaaaaatt tagtagatgt ttttaaactc tgcagaaaaa ataatctaaa 4620 actgcatcca gaaaaatgct cattctttat gcatgaggtg acattcctag gtcacaaatt 4680 cacagataaa ggaatcttgc cggacgacac aaaatttgac gtcatcaaga attatctagt 4740 cccacatgat gcagatagtg ctagacgttt cgttgcattt tgcaattact atagacgttt 4800 cataaaaaat ttcgccgaat attcccggca cataacaaga ttatgtaaaa agaatgttcc 4860 attcgaatgg acagcagaat gcgaaaatgc attccaatac ttaaaagaaa agctcgttca 4920 acctaatcta ttgcagtacc cggattttag caaagaattt tgcataatca cagatgcatg 4980 caaacaagca tgtggagcag ttttaacaca aaactataat ggactccagc ttccagtttc 5040 atatgcatca agagcattca ctaaaggtga aagcaacaag agtactacag aacaagagct 5100 agcagcaata cactgggcaa taccatattt tcgaccatat atatatggaa aacatttcac 5160 agtgcgaaca gaccatagac cattaacata tttgttctcc atggtcaacc cgagttcaaa 5220 actgacacgc atgcggctag aattggaaga gtacgacttt acagtagagt atttgagagg 5280 taaagataac tttttggcag acgccctatc aaggataaca atatccgatc taattgacat 5340 gaacaaaaga atcctgaaag tcactaccag gtaccaaagt agacaaaaca atttctgcgc 5400 agaaaataaa caagaactat tgccaaggca aaatcttgaa aaagcttcca agcccaacgt 5460 atttgacgtt attgtaaatg acgaagtacg aaaagtagtg accttgcgaa taactgattc 5520 ctcgaccaca ttcaaacatg gcaggaaagt tatagcaaga attgacattt gcgatatcta 5580 taccaatgga attctcgact taggtcagtt ttttcaaagg ctcaaaaaag aagccggtat 5640 acttagtatc accaaagtca aagtggcacc gagcgaaaag atctttgaaa ctatttcaat 5700 agaatctttc aaaaatatgg gcaatcaaat tttacaattt ttaagagtag cgctactcac 5760 gccggtgacc ctcatccaaa acgatgaaga aaaagaagct atattgtcta cattccatga 5820 agatccaatt caaggaggtc atacaggcat aaataaaaca attgcaaaaa taaagagacg 5880 atattattgg aaaaacatga ctaaatatgt aaaaaagtac attaataaat gtccaaaatg 5940 cctgacgtcg aaaacgacga aacatacacg atcacctatg attataaccg aaactccaca 6000 atatgctttt gacagagtga tagtggacac aattggtcca ctaccacgtt cggaaaatgg 6060 gaatgaatac gcaatcacat taatatgtga cctgacaaag tatttagtag caattcctgt 6120 ggcaaacaaa agtgcaacaa ctgtggcaaa agccatagtc gaatctttta ttctgaagta 6180 cggtccaatg aagacgttca ttacggacat gggaacagaa tataaaaatt ccattataaa 6240 tgacttgtgc aagtatttaa aaatagacaa tataacgtct acagcacacc accaccagac 6300 attaggtacc gttgaaagaa gtcataggac cttcaacgag tacattcgtt catatatttc 6360 agtagataaa actgattggg acatatggtt acagtatttc gtgtattgtt ttaacacaac 6420 accatcagta acacataatt attgtccata tgaacttgta tttggtagac aaagtaacct 6480 accaattcaa tttaatagcg ttaatactat agatccaata tataatatag aagactattc 6540 taaggaagtt aagtttagat tagaaaacgc gtgtaagaga gctcgaattc ttttagaaaa 6600 aaataaaatt aagcaaaaag aaataaatga tagcaaagta ttagatttta aaatagaaat 6660 aggagaccaa gtttatctca agaatgaaac aggtcataaa ttagaatctt cttatatagg 6720 accgtttgag gtcattagta tagagaaaaa ttataatgta gttataaaga ataggaaaaa 6780 tagggaacaa aaagttcata aggatagact aaaaaggaat aacaaataaa acacaaaaaa 6840 atatatatat aaaaaaaaaa aaaaaaaaaa atttaataag aatatatata tatagatatc 6900 ttttaaaaaa aaaaaaaata cataaatata taatatcaaa aagtgaaaac gaaaaaaaaa 6960 aatataaata taaaattttt tttatggatt catctattat tgaaaatata ttcatatgat 7020 ttcgaaaaga aagagagaaa aagaatttgt ctgagcagac acaaactaag acacaaaaaa 7080 aaaatttttt tccccttata tacacattta ccttaccctt attttacctt taattaatta 7140 tctaatgatt attaataact ccataatatt acgttattct ctaaaaaggg gagg 7194 // ID RS3_LM repbase; DNA; INV; 385 BP. XX AC . XX DT 14-SEP-2004 (Rel. 9.08, Created) DT 14-SEP-2004 (Rel. 9.08, Last updated, Version 1) XX DE L.major repetitive DNA - RS3 repeat, consensus. XX KW RS3 repeat region; RS3_LM; junction region; minichromosome; KW Repetitive DNA. XX OS Leishmania major OC Eukaryota; Euglenozoa; Kinetoplastida; Trypanosomatidae; OC Leishmania; Leishmania major species complex. XX RN [1] RA Ortiz G. and Segovia M.; RT "Characterization of the novel junctions of two minichromosomes RT of Leishmania major."; RL Mol. Biochem. Parasitol 82(2), 137-144 (1996). XX RN [2] RA Gentles A., Kohany O. and Jurka J.; RT "L.major repetitive DNA - RS3 repeat, consensus."; RL Direct Submission to Repbase Update (JUL-2004). XX DR [2] (Consensus) XX CC Average similarity to consensus 98%. XX SQ Sequence 385 BP; 56 A; 167 C; 102 G; 60 T; 0 other; cgtgtccctt ctccgccaca gcgcaggaga tcaaacgcca ctcgagcccc gcaggcgtct 60 gccacagggg cccacaccag cgtggtgcga agcagtcgtg gcacacacgc cgttgcagca 120 gtgcgctcac tcagccatct gagagcacgc ccctcccccc gcccctgcct cgtgccctac 180 caacctctcg ctttcccctt ctctttgccc ccacccgcct ccaggtcccg cctcacagcc 240 gctcccacca cgccggtcgc cacccctggt gcatccctcc tcggggcatg gctcagcctc 300 cccgcgccgg caggcagtgc gcgagggccg ggtgagagat gcgcccgggt cgctctgaca 360 cgctgcccaa tcatatggat ggcgc 385 // ID R2A_NVi repbase; DNA; INV; 6975 BP. XX AC . XX DT 22-DEC-1995 (Rel. 1.11, Created) DT 08-NOV-2010 (Rel. 2.02, Last updated, Version 2) XX DE 28S rDNA-specific non-LTR retrotransposon R2 in Nasonia DE vitripennis. XX KW R2; Non-LTR Retrotransposon; Transposable Element; KW reverse transcriptase; Retrotransposable element R2; NVR2; R2_NV; KW R2A_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-6975 RA Burke D.W., Eickbush G.D., Xiong Y., Jakubczak L.J. RA and Eickbush H.T.; RT "Sequence relationship of retrotransposable elements R1 and R2 RT within and between divergent insect species."; RL Mol. Biol. Evol 10, 163-185 (1993). XX RN [2] RP 1-6975 RA Stage D.E. and Eickbush T.H.; RT "Maintenance of multiple lineages of R1 and R2 retrotransposable RT elements in the ribosomal RNA gene loci of Nasonia."; RL Insect Molecular Biology 19(Suppl 1), 37-48 (2010). XX DR [2] (Consensus) XX CC This family is specifically inserted into 28S ribosomal RNA CC genes. A partial sequence of this family was previously CC registered as R2_NV. XX FH Key Location/Qualifiers FT CDS 701..2164 FT /product="R2A_NVi_1p" FT /translation="DTQNPTCTDASKLRSYYSQNASKEQQLISEGHDEDLG FT MTEGLFSPPVDIGKAKDVEDEIRGHLYFLETVDLSRLSKESQAGIAKAVTG FT LTKGMDLVMYHVRTMTKEAGLAGAFSKVLQETMRKVILKEREQHLQRVQNA FT VKSVGEKVQKAIAEISEIGTGMSSNVDINALAESTANKIARGWKESEKQQM FT SKLEELKQSIDEAKTTGNNITYAQAAGNQWTTVGQKRKRILSSAGLEVTVT FT ETEDVLIKPEQERGEEYPNAAVIIKKLKATIDPDEQNITVERMIPRTNMVV FT AVVKKGDGPKLIEGITRKGIGLLAETRKKLQPRILVQDIPEEMEEEELMAR FT LKKNVSLEAQRDEVRLIRMIKTRRGNKLAVIELPARAHEDLTHLQKVKIGW FT SICRIATDIRPNQCYKCQAFGHHAARCASDAVCAKCAQNHETKTCRNKGAR FT KCANCSKACRADCNHPAFDATKCPIFRAELEKSARNIDYSYSE" FT CDS 2231..5305 FT /product="R2A_NVi_2p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="NQIKKSNTSTGARIPKAMTNPADNFAGGQWKPPGRRS FT ARTSATGMFVCEHCLRAFTTNTGRGLHIKRAHEEQANEAITTERSRARWTN FT EEMEAVQAEIDCEGRTAINQEILRIIPYQRTIDAIKCLRKQQKYKTIRERV FT ANRRAENRARETELTRLETADEDPASQEQDNPNMSLKNWLKEVIESDDDRL FT CADLRTAIEMALAGQSPLDVCTVGCYQYTMTNLPLVPVRLGGPIYWCNAQS FT RSNPGETQRRQTIKESNNSWKKNMSKAAHIVLDGDTDACPAGLEGTEASGA FT IMRAGCPTTRHLRSRMQGEIKNLWRPISNDEIKEVEACKRTAAGPDGMTTT FT AWNSIDECIKSLFNMIMYHGQCPRRYLDSRTVLIPKEPGTMDPACFRPLSI FT ASVALRHFHRILANRIGEHGLLDTRQRAFIVADGVAENTSLLSAMIKEARM FT KIKGLYIAILDVKKAFDSVEHRSILDALRRKKLPLEMRNYIMWVYRNSKTR FT LEVVKTKGRWIRPARGVRQGDPLSPLLFNCVMDAVLRRLPENTGFLMGAEK FT IGALVFADDLVLLAETREGLQASLSRIEAGLQEQGLEMMPRKCHTLALVPS FT GKEKKIKVETHKPFTVGNQEITQLGHADQWKYLGVVYNSYGPIQVKINIAG FT DLQRVTAAPLKPQQRMAILGMFLIPRFIHKLVLGRTSNADVRKGDKIIRKT FT VRGWLRLPHDTPIGYFHAPIKEGGLGIPAFESRIPELLKSRIEALGASNMQ FT TARSLLGGDWVAERKKWINTQKIKNSEWAQKLHLTTDGKDLRDTRKAEASY FT SWIRDIHVAIPASVWIKYHHTRINALPTLMRMSRGRRTNGNALCRAGCGLP FT ETLYHVVQQCPRTHGGRVLRHDKIAEQVAIFMQEKGWLVLREAHIRTSVGL FT RKPDIIARKGQDCKIIDCQIVTTGNDIRIQHERKIQYYASNWELRRSAATM FT IGHQGQVSVEAITISWKGVWEPRSYCLLRDCGIPKVKIKGLTTRVLLGAYL FT NFNTFSKATYRTERRRTAN" XX SQ Sequence 6975 BP; 2321 A; 1496 C; 1777 G; 1381 T; 0 other; agagcggttc gatcgcacaa tggctctgtg taaaagtgtc taattagtgg tttaggagtg 60 ccggagtgga ccagaaagtg aataacgagt gctgttggtg attctgcatg tgtggtaata 120 ataattgttg caatacctgc taagtggacg cgaaaagaac agaaaatgtt aaactaacct 180 caaaatctac cacgcgccac atggccgtac attacttgaa agacgctaaa caaaccgtat 240 gtgaacctat ggaacaatat atatcaaaat gttcggaaaa aagtcgcgcg aggttgtgaa 300 ctgttagcag aatgttattc taacaatatc tcgagcaatc atcggaaaaa gtgaacacgt 360 ggactaacat aacctcaaaa agtgcgtggc catctgccgg gggatatggt actgccgtgt 420 gggctcggtt ggtaagacag tatactatct tatcgactgc aggtcaaact gtgtgtactc 480 ggtaagttat ctttgtggcc ttccgacagt gggtactgca gttgcgtagg caggttgtcg 540 caacataacc tccaccaaga cacggtgctt ctggcgatac gatggggaca ccaccaaaaa 600 ggtcaggctc ggaagattcc agaatcaatc ggaagtcaac atcaaaggat acgatgaaga 660 aaccaccaaa tgtccagaat agccaagata agaagcatga gacacccaga acccaacatg 720 tactgacgcc agtaaactca ggtcctacta ttcccagaac gccagcaagg agcagcagct 780 tatttctgag ggacacgatg aggacctcgg catgactgag gggctgttct cacccccagt 840 tgacatcgga aaggcgaagg acgtggaaga tgagattcga ggtcaccttt acttcctgga 900 aacggttgat ctctcgaggc tgagcaagga atcgcaagcg ggcatagcaa aggcagtgac 960 tggactaacc aagggtatgg acttggtaat gtaccatgtc aggacgatga cgaaagaggc 1020 cggactggca ggtgcatttt caaaggtact gcaagagacc atgagaaaag ttattctgaa 1080 agagagagag caacatctgc aaagagtaca gaatgctgtc aaatcggtag gagagaaagt 1140 gcagaaagct attgctgaaa taagcgaaat aggaacaggg atgtcatcaa acgtggacat 1200 aaatgcgcta gctgaatcca cagcaaacaa gatagccaga ggatggaagg agagcgagaa 1260 acagcagatg tcaaagctgg aggaactgaa gcagtccatc gatgaagcaa agaccaccgg 1320 aaacaacatc acgtatgctc aagctgctgg aaaccaatgg accacggtag gacagaaaag 1380 aaagagaata ctttctagcg cgggcctaga ggttacggtg accgagaccg aagatgtgct 1440 tataaaaccg gaacaggaaa gaggggaaga gtaccccaat gcagccgtga taattaagaa 1500 gctcaaagca accatcgacc cagatgaaca gaacatcacg gtggaaagga tgatcccgag 1560 aactaatatg gttgtagcag tcgtgaagaa aggagatgga ccgaaactga ttgagggtat 1620 aacccggaaa ggtattggtc ttctggctga aacaaggaag aagctacaac caagaatcct 1680 ggttcaagac atcccagaag aaatggaaga agaagaacta atggctcggt tgaaaaagaa 1740 cgtatcacta gaagcgcaaa gagatgaggt cagactaatc agaatgatca aaaccagaag 1800 aggtaacaag cttgcagtaa ttgaactgcc agccagagcg catgaagacc taacgcatct 1860 ccaaaaggtg aaaattggat ggtctatctg caggatagcg acagacataa ggccaaacca 1920 atgctataaa tgccaggcat tcgggcacca tgcagccagg tgtgcctcgg acgcggtgtg 1980 tgcaaaatgc gcccagaatc atgagaccaa aacatgcaga aataaaggcg ctaggaaatg 2040 tgccaactgc agcaaggcct gcagagctga ctgcaaccac ccggcattcg atgccactaa 2100 gtgtccaatc tttagggcag agctagagaa gagtgcgagg aacattgact actcctattc 2160 ggaatagtca ggttacgtat acaggggtga ctccctgccg caaaggcgcc gcaaggaaca 2220 cctgcgataa aaccaaatta aaaagtcaaa cacatccact ggcgcacgca tacctaaagc 2280 tatgacaaac cctgcggaca atttcgcagg gggccagtgg aaaccacccg ggcgaagatc 2340 tgcccggact agcgcgacag gcatgtttgt ctgcgaacac tgcctaagag cgttcaccac 2400 caacacggga agaggactac atataaagag agcccacgaa gaacaagcaa acgaagcaat 2460 aacaacagaa agaagcaggg caaggtggac caacgaagaa atggaagcgg tgcaagctga 2520 aattgactgc gaggggagaa cggctatcaa ccaggagatc ctaaggataa taccctacca 2580 gaggaccatc gatgccataa aatgcctaag gaaacaacag aaatacaaga ccattaggga 2640 aagagtcgca aacagaagag cggaaaacag agccagggag actgagctga caagactgga 2700 aacagcagat gaagacccag caagccagga gcaggacaat ccaaatatgt ccctgaagaa 2760 ttggctgaaa gaagtcattg agagcgacga tgacaggttg tgcgcggacc tgcggacagc 2820 catagaaatg gcactagcag gtcagtcacc acttgatgtc tgcaccgttg gctgctatca 2880 atacacaatg acgaatctgc cactggtacc agtgcgactg ggaggaccca tctactggtg 2940 caacgcccaa agccgcagca atccaggaga aacgcaaaga aggcagacta taaaagaatc 3000 caacaactct tggaagaaga acatgagcaa agcagcccac atagtacttg acggagacac 3060 tgacgcgtgt ccagctggtc ttgaaggaac agaagcatct ggtgcgatta tgagggcagg 3120 ctgtccaaca acacgacact tgcgatcaag gatgcaaggt gaaattaaga acttatggag 3180 gccaataagt aatgatgaaa tcaaggaggt tgaagcctgc aagcggactg cggctggtcc 3240 tgacggaatg acaacgacag catggaacag catagatgag tgcataaaaa gcctttttaa 3300 catgataatg taccatgggc aatgccccag gagatatctt gactcaagaa ctgtactcat 3360 cccaaaggag cctggaacaa tggacccagc atgctttagg ccgctgtcca ttgcatcagt 3420 tgcactgcga cacttccaca gaatactggc aaatagaata ggtgagcatg gactcctcga 3480 cacaagacaa agagcgttca ttgtggctga cggtgttgcg gaaaacactt cgctactatc 3540 ggccatgatc aaagaggcca gaatgaagat aaaaggctta tacatcgcta tactcgacgt 3600 aaagaaagcg tttgactccg tagagcacag gtcaatctta gatgccctga gaagaaagaa 3660 actaccactt gaaatgagga actacatcat gtgggtgtac agaaactcca aaaccaggct 3720 ggaagtagta aaaacgaagg gcagatggat tcgcccggcg aggggagtga gacagggtga 3780 cccgctctcg ccactcctgt tcaactgcgt gatggatgct gtccttcgga ggctgccaga 3840 gaatacaggc ttcttgatgg gtgcagaaaa gattggtgct ctcgtcttcg cggacgacct 3900 ggttctcctt gcagagacga gagagggtct gcaggcgtct ctaagtagga ttgaggctgg 3960 actacaggag caaggcctag aaatgatgcc aaggaaatgc cacactcttg cgctggtgcc 4020 gtccggaaaa gagaaaaaga taaaggttga aacgcataaa ccgtttactg taggcaacca 4080 ggaaataacg cagcttggac atgcggacca gtggaagtat ctaggtgtgg tgtacaactc 4140 ctacggacca attcaggtta agatcaacat cgcgggtgac cttcagagag taactgctgc 4200 cccactaaag ccacagcaga gaatggctat cctgggtatg ttcctgatac ccagatttat 4260 acacaaactc gtgcttggca ggacatcaaa tgcggacgtg cgtaaaggag acaagatcat 4320 taggaagacc gtcagagggt ggctcagact gccacatgat actccgatcg ggtacttcca 4380 cgcgccgatt aaggaaggtg gtttgggcat tccagcgttt gagtccagga ttccagagct 4440 tctaaaatca agaatagaag cacttggagc atccaacatg caaactgcaa gaagccttct 4500 tgggggcgac tgggtggccg aaaggaagaa gtggatcaac acccagaaaa tcaagaactc 4560 ggaatgggct cagaaactac acctaacaac ggatggcaaa gacctacggg acaccaggaa 4620 agcagaggcg tcatacagtt ggataaggga catacatgtt gctataccag ctagcgtctg 4680 gataaagtac caccacacca gaatcaacgc acttcccaca ctgatgagaa tgagcagagg 4740 cagacgaaca aatggaaatg ctctgtgcag agccggatgt ggacttccgg agaccctcta 4800 ccacgttgtt caacagtgcc cccgcaccca tggaggaaga gtattacgtc atgacaaaat 4860 agctgagcaa gtagccatct tcatgcaaga gaaaggttgg ctggtgctaa gagaggcaca 4920 cattagaact tcagtgggac taagaaagcc agacattatt gcacgaaaag gacaggattg 4980 taaaatcatc gactgccaaa tcgttacaac ggggaatgat atacggatac aacatgagag 5040 gaaaattcag tattacgcca gcaactggga gctgcgaaga tcagcagcaa ccatgatcgg 5100 gcatcaaggg caagtaagtg tggaggctat aacaatatcc tggaagggcg tgtgggagcc 5160 acgatcatac tgtttgctca gggattgcgg cataccaaaa gtcaagatca aagggctaac 5220 gactagagtt cttcttgggg cgtatctaaa ttttaacacc tttagcaagg cgacatatag 5280 aactgaaagg agacgaacag caaactaaga gaagtaccac accagctata aacgcctaaa 5340 atatgcatta aagaaaaatt ttgcttagac atccggaaag gccagcaccc ttgggagcat 5400 caccaggaac tggagagtgg ccacacatag taagaacgaa gaaatagcgc tcaaaaaccg 5460 caaaaactga gcaatcttag caaattataa aagacggaaa aacagcctaa ataacaaaaa 5520 taacggaaaa tcttgtttag tcatcgtaga aagtactcca ccagacgaac acatacaaac 5580 aagaaagaca tcggaaaaat aattatatca aaagttatga ctaaaagagt aaacaagtag 5640 tgcagccctg caaaggaaaa ctaagtggtg aagccctaca tgtagtcagg cagaaacccg 5700 caagtgattc tgccaactgt ggggcatata agtaaagcag tgtaagaata gggacattga 5760 aactagagga catctacagg caagaaagcg atagaaatga caactagacg taccagaagc 5820 cggagtgcag ccacgaaagt gaaaatggca acatcaggcc cagcaaagga acctgaagaa 5880 gggaaacctg acctggttaa ggcagcatta tggctaaatg gacggctttt tggcaacaag 5940 taaaggggtt tacctaccgc tggggaccaa ggatgacctc catgtatgca ttcacgaaca 6000 ccagaaccag ggagattgtt gtaagctggg acctgctaag ggagctgccg atggccgtct 6060 tcgaaggggt gctggcgcac gagatgctgc acgctttgat gttcctagaa aactatgaca 6120 ttttctggaa acgtaccagt aaggagctga tcagcgtgtg gaaaaagaga agtcaaagga 6180 agccacccag agctacgcag acttcgtcaa agaacatggt gaggtgctag acgaccatgg 6240 cccggagttc aagcgtcgat gcatcccctt gtctaaggcc ttgggagtaa atgtggcaga 6300 tgtctctgac aaggacaaaa ccgagaagaa cttccatcta aggcggtttg aatgggaatg 6360 taggtcatgc catgagatgg catacctctc cactaggcgc aagcctggct cccttaccaa 6420 gcacaaggat ggatgcaaga aggcaaattg gaaactcgtg gaggagtatg agaggccgcc 6480 aaaagtcccg aaccacgaat cctacgcaag aaacgctcgg aagtacgacc aatagtagag 6540 gagtttggac aacagggaca atgctgcagt acatgaacca cctacaccca ggcatcatga 6600 agactcaaat gcaattattt aaaatcatct ttttgttttt tttgcttatt ttattttagc 6660 cttatcaagt gaacgctatg tcgcgctagt attaattctt attattattt cttttcactg 6720 tcctaaactt ctccttcttt tctttttctt ttgcattctc tatatcttta gccttgttaa 6780 atagaaacta taaatttttt gttttctttt cttttctatc aagaaaaggg agagccttca 6840 atatattttg ttattctagt tttattttgt aaatactata attacaatat gtaaacaatg 6900 cacgaacaaa aaagtgcatt tcttttgtta gctgtacccc atgcaggagt gctatgggca 6960 ataaatcata ttatc 6975 // ID BRP1_NV repbase; DNA; INV; 214 BP. XX AC X64093; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 25-MAR-1997 (Rel. 2.02, Last updated, Version 2) XX DE N.vitripennis repetitive DNA from B chromosome. XX KW SAT; Satellite; Simple Repeat; BRP1_NV; Repetitive DNA; KW satellite DNA. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-214 RA Eickbaum C.D.; RT "BRP1_NV."; RL Direct Submission to Genbank (27-DEC-1991)D.C. Eickbaum, RL University of Rochester, Dept. of Biology, Hutchison Hall 334, RL Rochester, NY 14627, USA. XX RN [2] RP 1-214 RA Eickbush G.D., Eickbush H.T. and Werren H.J.; RT "Molecular characterization of repetitive DNA sequences from a B RT chromosome."; RL Chromosoma 101, 575-583 (1992). XX DR GenBank; X64093; Positions 1 214. XX SQ Sequence 214 BP; 80 A; 40 C; 41 G; 53 T; 0 other; ggcagaagaa gagggtcaaa agtctagact ttggcttact ggcttcaatc acgccagttg 60 taactacaat ttaatacact agactccaag atatatacaa gtagcttata taaaaataat 120 acacgtaaga tcctcgcaga gggaagatga cgctcctaaa actcaaaatt attagggtcc 180 acgaaaaaaa gtcggccata gtaaggcatt tttt 214 // ID Gypsy-23_IS-LTR repbase; DNA; INV; 139 BP. XX AC ABJB010963117; XX DT 15-FEB-2011 (Rel. 16.02, Created) DT 15-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the black-legged tick genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-23_IS_; KW Gypsy-23_IS-I; Gypsy-23_IS-LTR. XX OS Ixodes scapularis OC Eukaryota; Metazoa; Arthropoda; Chelicerata; Arachnida; Acari; OC Parasitiformes; Ixodida; Ixodoidea; Ixodidae; Ixodinae; Ixodes. XX RN [1] RP 1-139 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the black-legged tick genome."; RL Direct Submission to RU (15-FEB-2011). XX DR Genome; ABJB010963117; Positions 936 1074. XX SQ Sequence 139 BP; 37 A; 41 C; 35 G; 26 T; 0 other; tgatacatag acggcgacga agacaaagac aggccatcgg cagtggctcg tgcacacagg 60 cgctgcagcg ggcagttttt aattaaaact atcgttcgga aggcaccccg ttcccatcgc 120 tccttccagc accgtaaca 139 // ID Crack-3_BF repbase; DNA; INV; 4898 BP. XX AC . XX DT 30-APR-2009 (Rel. 14.04, Created) DT 30-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE Amphioxus Crack-3_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; Crack-3_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-4898 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-4898 RA Kapitonov V. and Jurka J.; RT "Young families of Crack non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(4), 808-808 (2009). XX DR [2] (Consensus) XX CC Its ORF1-encoded protein is similar to L1/Tx1 ORF1 proteins. XX FH Key Location/Qualifiers FT CDS 432..1190 FT /product="Crack-3_BF_1p" FT /translation="MAPRKGNTSEDDSVSLSVVKELLELQERNFKSFLDTI FT MESTHKRMDNIIREVQEIKDSLHFSQKEIDDNKLNLYRHVQKVEDIETEIT FT LLKKDITANDTKVDYLENQSRRNNILIDGVADTKDETWDQCEQKVRDLLKE FT KLKLDPKQIEIERAHRNGRFQDGGRPRPIVAKLLRFKDKDTIIKRAKYLKG FT STIYMNEDFSEKVRQKRKELIPEMKAARERGNIAYLKFDKLIVHPPGEGGA FT DRRRTRADSRH*" FT CDS 1249..4308 FT /product="Crack-3_BF_2p" FT /translation="MNSNPPQPSTLIFNPCDLNNLDNGSQISPLDPDTNFL FT NDFYTSINTTSQYHTVESFNKSCSTPCCNTLSLFHLNVRSLPCHFDELNEY FT ISSLHQHFTVIALTETWLTENIAENYSIPGYNSIHHCRKNRSGGGVSLYIR FT NDMYHQERGDLEFVIDDTGTKSISIEIIPSNPKEKKTILSCVYRPPNTDII FT EFNERIKESINNINRERATCYIIGDFNINLLNNTNQPTTDHINEMFSSGFY FT PLICRPTRITPYSATLIDNIYTNSGSNNIKVGILITDISDHFPIFLSSELN FT EPVRQADKISYRQVNTTNIQKFTQSLSEIVWSDVLNETDTEKAYNQFLETY FT GSLYEKCFPKKTTKISQHSRKKPWLSNGLLKSIRRKNKLYTKYVKNPTPLN FT HSIYKKYRNKLNHLLKTTKKDYYHKKFNEATSNIKQTWSLINEVINRNKSN FT SAGPNKIQIGTKTTENKDEICNEFNDFFINIGPTLESKIPKGTQDPLSYIT FT LQVNSNLSTFVPPTDAEITDIVKNLKDSAAGHDEISPKVLKLSLPYVIAPL FT THILSLSLQNGIVPHPLKVAKVTPIFKSSDPLNVNNYRPISVLPCISKVLE FT KLVYSRLLKHLNQNNILYKHQYGFRKGYSTSLALIHLMEKISTAIDNSEYT FT IGIFLDLSKAFDTVNHSILLNKLKRYGICDSAIKWFSSYLSNRQQYVALND FT NKSQPKIIECGVPQGSILGPLLFLIYINDLEKASSIIFFILFADDSNLFLS FT NSNFDTLVKLANEEMKKVMSWFEANKLSVNVKKTYYLIFCNRNKTYNKSAK FT IFLKSVPLSQESQAKFLGIIIDDRLTWKSHITMLGKKISKTIGIIGKLKHI FT LTQKTIITIYNSLIYPYLTYGNIVWGCAYKTTLQPLILLQKRFVRSATGAS FT FHAHSAPLFTKLKILNIFDINRLHLMLFTFKFLNCSLPETFNGLLNLNSHF FT HNYQTRQSAHFHIPLVRTSVAQMSVKYKCIINWNNLPLHLKCLTSIISFKR FT NLKNHLLSFPTDI*" XX SQ Sequence 4898 BP; 1681 A; 1031 C; 760 G; 1426 T; 0 other; aacggtccaa gatggcggca gctgcttttg tctgagctct catctttatc atatattgta 60 aaattctgtc ctaaaatctt gccatcgtcc ataaatcata ccagaatctg aaagttgaat 120 gtcttaaggc ttattctgta tgagaaaatt ggacgtttga cgaatctggg cggagatatt 180 ggacggaaaa acctggtctc ccagacaaca aaggacgctg tatccgggca acaacaaagg 240 acgccacttt ctcgccagcg gaaacaccgc agcgggctcc cttagttgga gcccccgccg 300 gtaggtttct ctctctgcaa catttacact gattttttac atagtctgtg tttgtttgtg 360 gtgatttatc accagctcgt cgctgccgag agtatctctt gtcatttcag tacccccccc 420 cccccggcag aatggcacca agaaaaggca acacttccga agacgattct gtatctctgt 480 cagttgtcaa agaactacta gaactacaag agaggaactt caaatcattc ctggacacga 540 ttatggagtc aacacacaag agaatggaca acattatacg ggaagttcaa gagattaagg 600 acagccttca cttctcacaa aaggaaatcg acgacaacaa actcaaccta tatcgacatg 660 tacaaaaggt agaggacata gaaacagaaa taaccctact caagaaagac atcaccgcaa 720 acgacaccaa agttgactac cttgaaaatc aatcgcgaag aaataatata ttgattgacg 780 gagtcgctga cacgaaggac gagacttggg accagtgtga gcagaaagta agggatctac 840 tcaaggaaaa actgaagcta gatccaaaac agatagaaat agaaagagcg cacaggaacg 900 gacgttttca ggacggagga cgaccgcgcc caatcgtagc gaaactgcta agattcaagg 960 acaaggacac aatcatcaaa cgagcaaagt atctgaaagg aagcactatc tacatgaacg 1020 aggatttttc cgagaaggtg agacagaaga ggaaggaact gatcccggaa atgaaagctg 1080 cacgtgaacg tggaaacatc gcgtacctga aatttgacaa acttattgtc cacccacctg 1140 gagagggagg agcagatcgc aggcgcacta gagccgactc cagacactga agacacggaa 1200 tttcctgaac tgttttgagc taagtcaaat aacccttaac cacccaacat gaactcaaac 1260 cccccacaac ctagcacctt gatttttaat ccatgtgacc tgaataattt agataatggg 1320 tcacaaatct ccccactgga tcctgatact aactttctga atgactttta cacttctata 1380 aatacaactt cccaatatca tactgttgaa tcctttaata aatcttgtag cacaccctgt 1440 tgtaacactc tatccttgtt ccacttaaat gtacgaagcc taccatgcca ttttgatgaa 1500 ctgaatgaat atatctcttc cctgcaccaa cattttacag ttattgcatt aacagaaaca 1560 tggctaacag aaaatatcgc ggaaaattat agtatacctg gttataacag catacaccac 1620 tgcagaaaga atagatctgg agggggtgtg tcactataca ttagaaacga catgtatcat 1680 caagaacgag gtgaccttga atttgttatc gatgacacag gcacaaagag catatccatt 1740 gagatcattc caagtaaccc gaaggagaaa aaaacaatat tgagctgtgt gtatcgccca 1800 ccaaataccg acattataga attcaatgag cgcataaagg aatctataaa caatattaat 1860 agggaaagag cgacttgtta tattattggt gatttcaata ttaatctatt aaacaatact 1920 aaccaaccca ctactgatca tataaatgaa atgttctcat ctggcttcta cccacttata 1980 tgcagaccta ccagaattac accttactct gccaccttaa tagacaacat ttatacaaac 2040 tcaggtagca acaatatcaa ggttggcatc ctcatcactg acatatcaga tcactttcca 2100 atattccttt catccgaatt aaacgagcca gtcagacagg cagacaaaat atcttatcgc 2160 caagttaaca caactaacat tcagaaattt acacaatccc tatcagaaat agtatggagc 2220 gacgtattaa acgaaacaga cacagagaaa gcatataatc agtttctcga aacatacgga 2280 tcgttatacg aaaaatgttt cccaaagaaa actacaaaaa tctcccagca ctctcgcaaa 2340 aaaccctggc ttagcaatgg attgttaaag tcaatacgca gaaaaaacaa gctctacact 2400 aaatatgtca agaaccccac gcccttaaac cacagtatct ataaaaaata ccggaataag 2460 ttaaatcatc tcctaaaaac tacaaagaag gattactacc acaaaaagtt caatgaagct 2520 accagtaaca taaaacaaac atggtccctg ataaatgaag ttataaatcg taacaaaagc 2580 aactcagctg gtccaaacaa aatacaaatt ggtaccaaaa ctaccgaaaa taaagatgaa 2640 atatgtaatg aattcaatga ttttttcatt aatattggac ctacacttga aagtaaaatc 2700 ccaaaaggaa cacaagatcc tcttagctac atcactttac aagtaaattc taacctatca 2760 acttttgtgc caccaactga tgctgaaatt acagatattg tcaaaaactt aaaagactca 2820 gctgctggtc atgacgaaat tagccctaaa gtactaaagt tgtcactacc ctatgttatt 2880 gcccctctaa cacatattct ttctctatcc ttgcaaaatg gaattgttcc ccacccacta 2940 aaagtcgcaa aagtgacgcc tatttttaaa agcagtgacc cactaaatgt taacaactat 3000 cgaccaatat ctgtccttcc ctgcatatct aaagtcttgg aaaagctagt atatagtcgt 3060 cttttgaaac atttaaatca aaataacatc ctatataaac atcaatatgg tttccgaaag 3120 ggctattcta cctctctggc actcatacac ttaatggaaa aaatttctac cgccatagat 3180 aattcagaat acacaatagg aatctttctc gatctatcaa aagcatttga tacggttaat 3240 cattccattc tactcaataa actaaaacga tacggtatat gtgactccgc aataaaatgg 3300 ttctcgtctt atctatccaa tagacaacaa tacgtagctc ttaacgataa taaatcacaa 3360 cccaaaataa tagaatgtgg tgtcccacag ggttcgatat taggacctct actttttctg 3420 atttatataa atgatctcga aaaagcatcc tcaattatat tctttatcct atttgcagat 3480 gattcaaatc tttttctttc aaactcgaat tttgatacgc tagtaaaact tgctaacgaa 3540 gaaatgaaaa aggttatgtc gtggtttgag gcaaataaat tatcagtaaa tgttaagaag 3600 acatattatc taattttctg taatagaaat aaaacctata acaaaagtgc caaaatcttt 3660 ttaaaaagtg tgcccctgtc tcaagagagt caagccaaat tcttaggtat aatcattgat 3720 gatcgactga cctggaaatc gcatataact atgttaggta aaaagatttc aaaaactatc 3780 ggtatcattg gaaaacttaa acatattctt actcaaaaaa ccattattac catatataat 3840 agtttaatat acccgtatct aacctacggg aatatagtct ggggatgcgc ctacaaaacc 3900 acactacagc ccctgatatt gctccaaaaa agatttgttc gttcagccac gggcgccagt 3960 tttcatgcac attctgcacc tttgtttaca aaacttaaga ttttgaatat ttttgatatc 4020 aacagacttc atctgatgct atttacgttt aaattcctaa actgttctct tccagaaaca 4080 tttaatggtc ttctaaatct taattctcat tttcataatt accaaactcg acaaagtgct 4140 catttccata ttcccctagt tagaacgtca gtagcccaaa tgagtgtaaa atacaaatgt 4200 ataattaatt ggaataatct ccccctccat ctaaaatgtt taacatcaat aattagtttt 4260 aaacgcaatc taaaaaatca cctactttcc tttcccactg acatttgaca tcttatctta 4320 atttctccat ttgatagctt aagttgttat ttagttgata tatttccctg tttttaataa 4380 tttaattgtt gttttaatca ttaggtattt agctgctaca tttccttgtt ttagtcattt 4440 gataatagct ttatttgtgt cactttagtt acgatatatc cttgtttttt cttaattcgt 4500 taactttagc ttctctatta tattatatag cttcatttgt gttacgatag tttcaatatt 4560 taaattgttg ccaataattt tttccttaac ttctccattt gatagcttaa gttgttattt 4620 agttgatata tttccctgct tttaataatt taattgttat ttagcttctc tatttgatgg 4680 cttttttgtg ttaatttagt tgatacattt ccttgtttta ttatcaattg tttatcacct 4740 caattatttc attagaattg catacgttat aggaggaggg ctcgcataag ccgttaggct 4800 tcctcccttc ctcccgcaca gtatttgttt tatccaatgt acatttctct gtttttgtgt 4860 tctacgtgcg aataaataaa ctcaaactca aactcaaa 4898 // ID hATm-2_AA repbase; DNA; INV; 6235 BP. XX AC . XX DT 30-OCT-2007 (Rel. 12.1, Created) DT 30-OCT-2007 (Rel. 12.1, Last updated, Version 1) XX DE hATm-2_AA, a family of autonomous hATm DNA transposons - a DE consensus sequence. XX KW hAT; DNA transposon; Transposable Element; KW Autonomous DNA transposon; hAT superfamily; hATm group; KW hATm-2_AA. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-6235 RA Kapitonov V.V. and Jurka J.; RT "hATm, a distinct group of hAT DNA transposons in animals."; RL Repbase Reports 7(10), 1047-1047 (2007). XX DR [1] (Consensus) XX CC hATm is a separate group of hAT DNA transposons. Significant CC identity between hATm and hAT transposases appears after PSI CC BLAST iterations. hATm transposons exist in genomes of different CC animals, including segmented worms (Helobdella robusta, leech), CC flatworms (Schmidtea mediterranea, freshwater planarian), insects CC (Aedes aegyptus, mosquito), and tunicate (Ciona savignyi sea CC squirt). All identified hATm transposons are characterized by CC 8-bp target site duplications and conserved termini CC (5'-TAGGgtGgyccnnA). The planarian hAT-7_SM,hAT-8_SM and CC hAT-10_SM transposons also belong to the hATm group. Their CC putative classification as hAT transposons was not supported by CC significant similarity (BLAST and PSI-BLAST) to known hAT CC transposase proteins. CC hATm-2_AA is a young family of hATm autonomous DNA transposons CC identified in the mosquito genome. The consensus sequence was CC built based on multiple alignment of 3 copies. The 5'-terminal CC portion of the transposon is not known. XX FH Key Location/Qualifiers FT CDS join(500..773,836..1036,3027..3235,3641..5017) FT /product="hATm-2_AAp" FT /note="transposase." FT /translation="MASSAVKQCLRSKVQVPLIGPVLINFSKNRIPTKREA FT LQVLFFYNYVKHLNVSMSISEAADQIMKLEIEGLAIKRKDIIQACLQRLYN FT DWRDTQRSKSKMSLNAEEKRQLFVNKLSDKLPCTVPCSESPLKVNETLESD FT KENTEKKAERNFTDRGKEKRPVAYKINLTTPKVAAAFERTGISNRNASMIL FT EALIEDNHTKSSSSTEYSINYQTIRRSRMQHRQELAEKLKQSFDPSVPLVL FT HWDGKMLPDQNGKRVDRLPIVVTGSGVSKLLSVPTVSSGTASAETAAIMEV FT ISEWGLQERLVGMSFDTTSVNTGVNNGIISRLPALLNKNMLALACRHHVAE FT IVLKHVFEIFGDKSKSSNLDCFNEFKKEFNENRGDQKLKTVLDDIRLKTLT FT EPWRDSVVEFCKDQLANQQQRGDYEELLELVIMFLGEVPPKGIRLRKAGCL FT SRARWMGRSIYLLKLWIWTDNREDQKLWDHYTTVALFIAQCFVRYWFRVNC FT SISAPRVDLEFSNEIHRFHDEKYRSAALKAFKNHLWYLSEWNAAMAFFDPA FT IDSLEKQKMVVALNSSATPLGSHAKRRKTDKVQNKIPMRATQIDENMTVSS FT YITEASKTFFDIMRIKTSFLHIDPAEWEKDEAFLLGRDRVMALEVVNDVAE FT RAVKLATDYNNKITKDPKQHNDLLQVVEYHRKLKPLKY" XX SQ Sequence 6235 BP; 2103 A; 1104 C; 1105 G; 1923 T; 0 other; gccatggttg atgatattta gggagatagc gtgcagattt atgttaataa tccacttatt 60 ggttgtatcg atagcaatta aataatctct ggattgaaat gattgccaca tgatggcgat 120 tataatgcac agcacacgat caatctttaa aataacgttc aattcgggac tacaaaatag 180 ctgatatttt catcggttta atgtgagaaa acaagagaaa aaatatgtct ttctttatta 240 tttctcacac acgatgacag tacttaacga ggtattccgt aagtgcaagt gctttgtgaa 300 atatattttt gcgtcttctt gcaaattctc ttctatcctc tccgctgttt ctctgctttt 360 cattctcttc ttatcctact ctctctgtca accgtgtcaa cccagtgaat ggcattgttg 420 acgtgattcc ggcagctact agtcgactgt ataaaaaatc aaaagtagag cgcgagctgt 480 gattgttgta tcgaaaaaca tggcaagctc ggcagtaaaa cagtgtttgc gttcaaaagt 540 ccaggtgcct ttaatcggac cagtgttgat taattttagt aagaacagaa ttcccacgaa 600 aagagaagcc ttgcaggtgt tgttttttta caattatgta aaacatctca acgtttcaat 660 gagcataagt gaagcggccg accaaataat gaaattggaa attgagggcc tggcgataaa 720 acgaaaagat atcatacaag cttgcttgca gcgattgtac aatgattggc gaggttaaat 780 tcgatattta tctttatttt tttctactga tatgacgaaa atttacattc tatagatact 840 cagcgttcaa agagcaaaat gtcgttaaat gccgaagaaa agagacaact ttttgtgaat 900 aaactttctg ataaattgcc ctgtacggtg ccatgttcgg agagtccttt aaaagtcaac 960 gaaacgctgg aatcagacaa ggaaaatact gagaaaaaag cagagcggaa cttcactgac 1020 cgcggaaaag agaagagtga gtaaaaattt cttaatattg caaattatgt tgggcagcgt 1080 ccataaatga tgtagcattt ttcggccaat ttttgacact cccctccccc gtcgtagcct 1140 attttcccat acctgataag tagcaaaatc gtaagctccc ccctccccac taaaatgcta 1200 cgacatttat ggacggtccc ttggtgtttt ctgcacttct tgatataaat atcggatatg 1260 taatgttagc acatcctggg aattaattac taagtcaatt tcttattttt tccgtattta 1320 atgcacacaa aaatactatg tattgcgaag catattcgtg tggcataaat ttgagaaaat 1380 agaaacaaaa tcagacaatt tgacaaatcg tcattatcac tacctacaat ataatggaga 1440 tacaggtcgg actcgattat ccggagtttc gatttttttt cactccggat aatcaaatca 1500 ctaaaaaaat attgaaatct tcgataaaaa aaacttaaat attatctttg ttcgctgttt 1560 tattaatatg atgcggtggc gtagccagaa attatttcta ggaacaagaa aaaaaaaatt 1620 ttttgaccag catacacaat aaaccacgtt ttcaaaaccc cctttttctt aaatacgttc 1680 aaaactgcaa aaccattcaa attgtatgtt gcaataccaa attatcttag atttattcat 1740 aaaaattgaa aaacaagaac attaaaaaga ttcggataat cgggtctaaa attccggata 1800 atcgaatccc agataatcga gccaccggtt aatcaagtct ttagataatc gagtccgacc 1860 tgtactaaat tttgatatga actgtcataa tatgtccaac tgacatggtt tatccaagca 1920 ttgtccttaa gaaggatttt tctagaagtt attctaagga ttttcccgta atttgcctgg 1980 agtagtttca acatatcatc ctaataattt ctctagatag gctgattggg ctatttcaag 2040 gaagtttcag agaattgcag ttagctgtaa cgtaaagtta gttaagaaat tttacccgta 2100 tccaagctat ccttattata atattcttaa cccctggtat tacacaaaaa catcttataa 2160 cgattcctgc aagaatttct ccagagaatt tttttttcaa gaattttgtc taaaaattca 2220 cttagcagta attctgcatt ttattttttc ctcataactc cctcaaatat taaaacagaa 2280 attcttccaa gtaaataatt taaaacaatt cgaagatttt ttcaggtttt caacaaatgc 2340 ttcatccggc ttttatttgt ataatgattt ttttctacga aaatctccaa gaacactaat 2400 acacaaaaat aacccagcag aaccaaaaaa gaaataatct agatatcttt ctttgaattc 2460 tgcaaggttt attcttagaa gcaatattcg aatttgttga taaattcaat gaattacttg 2520 aatagttgaa atagtttacg gaggaagttt tgctacaaat gattagatta acatttagct 2580 ttttagtaac ttaagctttg tgttttgaag atttactttt cataaaacac ataaccggtg 2640 gaaaaacggt aaataaatct taaaataact caaatggatt aaaggagcct tttgagatgg 2700 aatttcagaa aaatattcaa agctgcgggt gttatttctg gaagaaactt tgtttaaatg 2760 tatctcatga atccttgaag aaactataca acattaccaa gatgtgtgta agtctgctca 2820 ttgtgaatga atgaataaga tggatgtcct acctagttta agatttcatg taaaaaatgt 2880 gcaatgttgt gaagaagcag ttttctccgg aataaatgac aaatctacaa aatagcacag 2940 ctcgttaaat ttgcaagaca tcgactaaga accggtcaaa ataaacgaga acaaattgct 3000 cgaggaattt tattttcaat tttcagggcc agtcgcttat aaaattaatc tcacaacccc 3060 gaaagtggca gcagcatttg aaagaactgg aatttcgaat cgaaatgcaa gtatgatact 3120 tgaagcccta attgaggaca accatacaaa atcatccagc tcaactgaat actcgatcaa 3180 ttatcagacc attcgtcgtt ccaggatgca gcatcgccag gaactggcag aaaaagtaaa 3240 tatatatttc ttataacata ggtcgaatta ttaatcggct ttaataaaaa cagttcgact 3300 atttaaataa gaaaacataa cttccgtttt aaaagtttat aaaggcttaa caattttgat 3360 tgcttaaaat tacatacaac tttttttcaa ccaacgtgaa tcggtttttt gaattagagt 3420 gcaatgaaaa aaataaacat ttcaactgtt gatttcagtt cttgtcatag ttcttgtaat 3480 attaatattt taaaatagtt ctaaataaga aaattattaa catgcccaat cataatttct 3540 taaatcctta tcatgtcgac tggctttatt ttaaatatca tttaatcttt taatctttaa 3600 atcttttttc ctttgttttc ttactttgtg tattcatcag cttaagcaaa gttttgatcc 3660 atctgtacct ttggtacttc actgggatgg aaaaatgcta cctgaccaaa atgggaaacg 3720 tgttgaccgg cttcccatag tagtaactgg atcgggagtg tcaaaactgc tttcagtacc 3780 tacagtttcc agtggaacgg ccagtgcaga aacggctgcg ataatggaag taattagcga 3840 gtggggattg caggaaaggc tagtcggtat gagtttcgac actaccagcg ttaacaccgg 3900 cgttaacaat ggcatcataa gtcgccttcc tgcgctcctc aacaaaaaca tgctagcgtt 3960 ggcatgccgc caccacgtgg cagaaattgt actcaaacat gtcttcgaaa tcttcggcga 4020 caaatcgaaa tccagcaatc tcgactgttt taatgaattc aaaaaagaat tcaacgaaaa 4080 ccgaggtgac caaaaactta agaccgttct agacgacatc cgtttgaaga cattgactga 4140 gccttggagg gatagcgttg ttgaattttg caaggatcaa cttgcaaatc aacagcaacg 4200 aggagattac gaagagcttc ttgaattggt aattatgttt ttgggtgaag ttcctcctaa 4260 aggcataaga ttacgaaaag caggttgtct cagccgggca agatggatgg gtcgaagcat 4320 ttatctgctg aagttatgga tttggacaga taatcgagaa gatcagaaac tgtgggacca 4380 ttacacgacc gtggctctgt tcatcgccca atgttttgta aggtattggt ttcgagtgaa 4440 ctgttccatt tccgccccac gagtcgattt ggagttttcc aatgagatcc ataggttcca 4500 cgatgaaaaa tacagatcgg cggcattgaa ggcgtttaaa aatcatttgt ggtacttgtc 4560 cgaatggaac gcggccatgg cattttttga cccagcaatc gatagtttgg aaaagcaaaa 4620 aatggtggtg gccttaaaca gctcagcgac accactagga agccatgcaa agagaagaaa 4680 gacggacaag gtccagaaca aaattccaat gagggcaact caaatcgatg aaaatatgac 4740 agtttcaagc tacattacgg aggcttccaa aacttttttt gatataatga gaataaagac 4800 gagttttctt catattgatc ccgcggagtg ggaaaaagat gaagcctttc tattgggaag 4860 agatcgtgta atggctctgg aagtagtgaa cgatgtagca gagagggcag tgaaactggc 4920 aacagactat aataacaaaa taacaaaaga tccaaaacaa cataatgatc tgctacaagt 4980 agtagaatat cacagaaaat tgaagccgct gaaatactga agtttagaat aagcataatg 5040 atacgtattg ttttctatga tttgaaattg aactcaaaag aggttttgtt attgatgaat 5100 tggattaaaa acttgttgag aaataaaaaa gggaaaaata taattcaaac atgtgtttca 5160 cttataattt gaaagaagtc atggtttttc agtcaaaaat tgcaggtaac aatttgttat 5220 taaatttatt aattattttc tatacttcaa tgtattccaa atttgaaaac ttaaactcgt 5280 tataatgaac acatcgtcaa caataatttc ggttgagata tttatgaaac ctaaaacacg 5340 ggttcttctt actatatctt catagacata tactaagtgt tcaacacaat taaaaataaa 5400 cacgccccga ataactagta atgctaaaca acctttaaaa aataaagcgc atattaaaac 5460 actttaaaat cgtaaatgtc ttcgcgagtt ttcgtgtttc tttgtctttt tatttaagta 5520 gattataaac aaaaacctgc acgctatctc cctaaatatc atcaaccatg gctatatttt 5580 ttttatggaa tttgagctta tatgcgaatt tggttatgac atgtaagtca taattatgcg 5640 gctttatagc tctactgatc cttcaaaatc atatagtaaa tacgccgttc aataaataac 5700 acttggatta aataatattc acacctgtgg aaattacgcc gcccatcata ccaacaacaa 5760 atgtgaaaat ttcattcaaa tcgttcactt ttaacgcacg ctatcgggtt tctcccttgc 5820 tgcgtgaatc cctaaccata gcaaacacgc caccaccacc acccaccgac agcagaagca 5880 gcagtgctgc tagttaaatg aaataatgat ccgttcggag tattactaat tggtttatgc 5940 ttaataactt tttcggcatg cgtccgatcg atttgtggtc ttcgaaggta tgattcattg 6000 atattttttc tatacagaaa ttcaacagat ttgaattatg gggaccatgg acggcctcta 6060 gagatttttt tttggaaaac ttagttttcc catagtaaat cctatgtaaa cttcaaaccg 6120 ctggcgctaa aatatagttt cacggattca gctaaaattt tgcacagtca ttacaggacc 6180 caagaggagc acgaaaagtg gactggagct gggatttgat ttttgtccca cacta 6235 // ID BEL-111_AA-LTR repbase; DNA; INV; 370 BP. XX AC supercont1.78; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-111_AA_; KW BEL-111_AA-I; BEL-111_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-370 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.78; Positions 2573487 2573856. XX SQ Sequence 370 BP; 94 A; 72 C; 90 G; 114 T; 0 other; tgttcagtct tcgctgagtt gacagctttt atagctgatt tttttgagtg tgttactccg 60 actactgcga ttgcgtttta tgctacccct ggcaacacag caagtagtgc taccaccttt 120 gtagaaaaat aatgatttcc cgcgcgatgc ggaaggagag ttgcgtacga ttattgtaac 180 gcgataggtc gccgagtgaa gtggcgaagt ttttgttaag ttttcttttt aaataaaaca 240 agtgttaaag tgatcgaaac gtgtttttct ttccaccgga agaaacgaaa aagtgtccgt 300 agtgaatcca gtccgccggt gaagtgtcca cctcgaacta tttgtgttgg catccgtcgg 360 atgagcaaca 370 // ID Gypsy-51_CQ-LTR repbase; DNA; INV; 133 BP. XX AC AAWU01036160; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-51_CQ_; KW Gypsy-51_CQ-I; Gypsy-51_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-133 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 482-482 (2011). XX DR Genome; AAWU01036160; Positions 10745 10613. XX SQ Sequence 133 BP; 45 A; 36 C; 21 G; 31 T; 0 other; tgtagtaggc catgcctaca acagccccag aatctcactc aaatcctgac agtctacaaa 60 gtcagacgca attcaataaa tgctctagtc tagcaacgaa actgtctttc attgcctagc 120 aatagacgta aca 133 // ID Copia16-NVi_I repbase; DNA; INV; 4224 BP. XX AC AAZX01024383; XX DT 13-NOV-2007 (Rel. 12.11, Created) DT 13-NOV-2007 (Rel. 12.11, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: internal DE portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia16-NV; KW Copia16-NVi_LTR; internal portion; Copia16-NVi_I. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-4224 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from Nasonia parasitic wasp."; RL Repbase Reports 7(11), 1153-1153 (2007). XX DR Genome; AAZX01024383; Positions 4856 633. XX CC Positions [1560-2093] - Integrase core CC 'TGTAT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 144..2897 FT /product="Copia16-NV_I_2p" FT /translation="MEGELTRIIKLRDINNWSIWKFQVKVISKSSGVWDVV FT SGTRTCPGALATTAAADAITENQKSIEKWNKDDSLAQKIIVTTVSEQAMLH FT IINCDTAKSMWDNFVTVYEQKSLATTHMLQQKWFTITKEPSDDIATHIAKL FT EDIAHRLKLAGETMNDNMIITKILLTLPPSYDHFINAWESTPETERTKANL FT TSRLINQEMRESSRDETALASHKYNKRGNFERPAADKSKNKWKPGKCHNCN FT KPGHWARECRSKKSNDKENYGTKSTESKSKPHGNALMIDAFASIQSSDNHT FT NLWYKDSGASSHMTHHKDWFLNYEPFKKPTWVRIGDGSLLEACGSGDINVL FT SYNGNSWDPSHLSNVIYVPDLKYNLFSSNVTMDKGLMLSSDNRTCKFTKGG FT KTIVIGERIGNLFVMKLKVEDASNAPHQAMVASEHSLATWHEKFAHQNYQR FT VKQILSQRNIVLKDKNEPFCDACAIGKTHRLPFHGSKSETSRIGEIIHADL FT CGPMQEASLKGSKYFLLLKDDFSKFRTVYFVTAKSDVYNCIEDYVKKAEKH FT CPKGVKIIRTDNGLEFVNQNVSKLLNKYGIEHQRTVAYTPEQNGAAERENR FT TLIEAVRSILAGKGFEKKFWAEATNTATHVMNCTGTSRIPNKTPYELWYNQ FT EPKIQDLHIFGEKVYVHIPKEKRRKLDDKGKPGLFMGYEQYVKGFRILILE FT TNKIEIARDVVFSGIAEPVNKTRIEENYDIILYLPKETDDEPEREREENLP FT DENDREENVPDEPEIGLEENLPDNHDREENVPAEPEREAEENLPVRRNLRK FT KHECSKPKRFDDYIDPDEIDIEELFITDIQEPLTFHEATNCPDADEWKRAM FT QEEINSLSKNQVWTLVDAPKKETVMESRWIFKLKRDANDNIKRFKARLVAK FT GFSAERRNRLQRNL" FT CDS 3025..4191 FT /product="Copia16-NV_I_1p" FT /translation="MKQPHGFSDGSKKVCKLLRSLYGLKQSSRCWNQKFTS FT FLSKHDLKATNADPCIFVSTNENKRLILGIYIDDGLIAAQSQAVVNELLAE FT LKQEFEITHSCVNIFLGLQIECLKDGSFFVHQSSYVKKILEVFNMNNANSV FT TIPADTNQEMCVSMHSGDQTKATNAPYREAIGKLMYLSTGTRPDITFAVNR FT ASRYMENPNKIHWNAVKRILKYLKGTQTLGLLFSPTEDDQLYAYSDADYAG FT EVETRKSTSGSLITLGNSLITWNSRKQSTVALSTTDAEYIAACETTRDIMW FT IKKLVREVCNTKHVDTILHLDNQSAIKLIKNPVFHKRTKHIDIKFHYVREK FT FEQKEFELEYINTHKQLADVFTKALAKMQFEQHRSFLLHDLQTVIN" XX SQ Sequence 4224 BP; 1603 A; 746 C; 815 G; 1060 T; 0 other; ggttatgggc ccagttacga attaaacact atattttttt tacttctcca ctattggagc 60 aaaaatagaa gggtgttcaa tcttacgttt gaggttatcc agccacgttg tgcaaaagtg 120 aaagtactca acacgtaaaa accatggaag gtgaactaac tcgaataatt aaactcagag 180 acataaataa ctggagtata tggaaattcc aagtaaaagt aatttcaaaa tccagtggag 240 tatgggatgt ggtttcgggc acaagaacgt gtccaggtgc gcttgcaaca accgcagctg 300 cagatgcaat tactgaaaac cagaaaagta ttgaaaaatg gaataaagat gattcccttg 360 cacaaaaaat catcgttaca acagtttcag agcaagcaat gctacacata ataaattgcg 420 ataccgctaa aagtatgtgg gataattttg taacagtata tgaacaaaaa tcacttgcta 480 caacacacat gttacaacaa aaatggttca caatcacaaa agaaccgtct gatgacattg 540 ctacacatat agcaaaacta gaagatatag ctcatcgcct gaaacttgct ggtgaaacta 600 tgaatgataa catgataatt acaaagattt tgttaacctt accaccaagc tatgaccact 660 ttatcaatgc atgggaatca acgccagaga ctgagcgaac caaagcaaat ttgacttcaa 720 gactgattaa tcaagaaatg agagaatcct ctcgagatga aactgcacta gcttctcata 780 aatacaacaa aagaggaaat tttgaaagac cagcggctga caaatctaaa aataaatgga 840 aaccaggtaa atgtcacaat tgcaataagc ctggtcattg ggctcgtgaa tgccgtagta 900 aaaagtcgaa tgataaagaa aattatggaa caaaatctac tgagtcaaaa agtaaaccgc 960 atggaaatgc attaatgatt gatgcttttg cgtcaattca aagtagtgac aaccatacaa 1020 atttatggta taaagattca ggagccagta gtcatatgac acaccacaag gattggtttt 1080 tgaactatga accgttcaaa aagccaactt gggtacgtat aggagatgga agcttgcttg 1140 aagcctgtgg atcaggggac ataaatgtac tatcgtacaa tggaaatagt tgggatccta 1200 gtcacttatc caatgtaata tacgtacctg atttaaagta taaccttttt tcgtcaaacg 1260 taaccatgga caaaggttta atgctctctt cagataaccg cacttgcaaa tttaccaaag 1320 gaggtaaaac catagttata ggtgaacgaa taggaaacct atttgtaatg aagctgaaag 1380 ttgaggatgc atcaaatgct ccacatcaag ccatggtagc atctgaacat tcgctagcca 1440 catggcatga gaagtttgcg caccaaaact accagcgagt caagcaaata ctaagtcaac 1500 ggaatatagt attaaaggac aaaaatgaac ctttctgtga tgcctgtgca ataggtaaga 1560 ctcacagatt gccctttcat ggtagcaaat ctgaaactag tcgcatagga gaaataatac 1620 atgctgactt gtgtggacct atgcaagaag catctctcaa agggtctaag tatttcttat 1680 tacttaaaga tgatttctcg aagtttcgca cagtatattt tgtaacagca aagtcagatg 1740 tatacaattg catagaagac tatgtcaaga aagctgaaaa acattgtcct aaaggagtca 1800 aaattatcag gacagacaat ggattagagt ttgtcaacca gaatgtttca aaattattaa 1860 ataagtatgg gattgagcat caaagaactg tggcttatac tccagagcaa aatggagctg 1920 ctgaaaggga gaaccgaacc ctaatagaag ctgtccgatc catccttgct ggaaaaggtt 1980 tcgaaaagaa attttgggca gaagccacga acactgctac tcatgttatg aactgtacag 2040 gtactagtcg aataccaaat aagactccgt atgaactttg gtataaccaa gagccaaaga 2100 ttcaagatct acatatattt ggagaaaaag tgtacgtaca cattcccaaa gaaaaaagaa 2160 gaaaattaga cgacaaagga aaaccaggat tatttatggg atatgaacaa tatgtaaagg 2220 gattcaggat attaattcta gaaacaaaca agattgaaat tgctagagat gtagtattca 2280 gtggtatagc tgaacctgta aataaaacta gaatagaaga aaattatgat ataattttat 2340 acttacccaa ggaaaccgat gatgaacctg aaagagaaag agaagaaaac ttacctgatg 2400 aaaacgatag agaagaaaac gtacctgatg aacctgaaat aggattagaa gaaaacttac 2460 ctgataacca tgacagagaa gaaaacgtac cagctgaacc tgaaagagaa gcagaagaaa 2520 acttacctgt aagaagaaac ttgaggaaga aacatgaatg cagtaaacca aaaaggttcg 2580 atgattatat agatcctgat gaaatagaca tcgaagaact ttttattact gacattcaag 2640 aacctcttac ctttcacgaa gccacaaatt gtcctgatgc tgatgaatgg aagcgtgcca 2700 tgcaggagga gattaattcc ctaagcaaaa atcaagtctg gactttagtt gatgcaccta 2760 aaaaagaaac tgtgatggag agtcgttgga tattcaagtt gaaaagagat gccaacgata 2820 atattaaacg attcaaggct cgcttagtag caaagggttt ttctgcagag agaaggaatc 2880 gactacaaag aaacctttag ccctgtcatc agatttgatt caatcagatt aatacttacc 2940 atagcagcaa aagagaggtt agtccttaga caatttgatg tgaaaacagc cttcctgtat 3000 ggcaacatcg aagaaacaat atacatgaaa cagccacatg gtttctctga cggatccaaa 3060 aaagtatgta agttacttcg aagtctatat ggtttaaagc aatcatccag atgctggaac 3120 caaaaattca catcctttct aagtaagcat gatctcaaag caacaaatgc tgatccatgc 3180 atttttgttt ctactaacga gaataaaaga cttattctag gcatatatat tgacgatgga 3240 ctcattgcag cacaaagtca agctgtagta aatgagcttc tggcagaact aaaacaagaa 3300 tttgaaataa cacatagttg tgtaaatata tttttaggat tacaaataga atgtctgaaa 3360 gatggatcat tctttgtcca tcaatcaagt tacgttaaaa aaatacttga agtgttcaat 3420 atgaataatg caaacagcgt taccatacct gctgatacta accaggagat gtgcgtttct 3480 atgcattcag gagatcaaac aaaagccacc aatgcacctt accgtgaagc tattggtaaa 3540 cttatgtatc tctctactgg gaccagaccg gatatcactt ttgctgttaa ccgagccagc 3600 agatatatgg aaaacccgaa taagatacac tggaatgctg taaaaagaat tctaaaatac 3660 ctgaaaggga cacaaactct aggattgtta ttcagcccca ctgaagacga tcaattgtat 3720 gcatatagtg atgctgacta tgcaggcgaa gtagaaactc gaaagtcaac atctggaagt 3780 cttatcacat tgggaaatag tctaatcaca tggaactccc gcaaacaatc tactgttgct 3840 ttgtctacaa ccgatgcaga atacattgca gcttgtgaaa caacaagaga catcatgtgg 3900 attaagaaac tggtcagaga agtttgtaac acaaagcacg tggacacaat tcttcatctt 3960 gataaccaaa gtgctattaa actaatcaaa aatccagtct tccataaacg tacaaaacat 4020 attgacatca agtttcacta tgtaagagaa aaattcgagc agaaggagtt tgaactggaa 4080 tatatcaaca cacacaaaca actagcagac gttttcacca aagcgctggc caagatgcag 4140 tttgaacaac acagaagctt tttattacac gatctccaga cagtgattaa ttaaaataaa 4200 catatattat agaaacaggg ggag 4224 // ID Chapaev3-5_HM repbase; DNA; INV; 2555 BP. XX AC . XX DT 31-DEC-2008 (Rel. 13.12, Created) DT 31-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE an autonomous Chapaev DNA transposon - consensus. XX KW Chapaev; DNA transposon; Transposable Element; Chapaev3; KW Chapaev3-5_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2555 RA Bao W. and Jurka J.; RT "Chapaev3 DNA transposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 1830-1830 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 473..2155 FT /product="Chapaev3-5_HM_1p" FT /translation="MSSNRRSCVNHPDVFCYICGEYMLKENRKKVSDFLKR FT AYLAYFGVRLGDQDKTWAPHQVCKTCTEHLRQWTTGKRKSMKFGVPMVWRE FT PRNHFNDCYFCILNITGINRNNRSKWTYPDLLSARRPVLHSEEVPIPTFLQ FT LPELCEDEYCASNSFDTNEIDSDYEGKASIPQQFNQDELNDLTRDLNLSKE FT ASELLASRLKDKNLLEQGTKITFYRTREKDLLPFFSQEEDLVFCHEIRGLL FT EKMSLPKYFQDDWRLFIDSSKRSLKCVLLHNGNKFGSIPIAHSTKMKEEYN FT TIALILEKLKYQQHQWVICVDLKMVNFLLGQQAGYTKYPCFLCLWDSRDKT FT HHWTKKEWPKRKNMEVGKNNLINNALVERDKIIFPPLHIKLGLMKQFVKAL FT DKNGSCFGYIGEKMPQLSTEKIKAGVFDGPQIRQFINDSAFVNSMNETERK FT AWTSFVAVVRNFLGKRKDENYVKLVDDMLSSFKSLGCNMSIKVHYLHSHLD FT RFPKNLGDMSEEQGERFHQDIKTMEDRYQGRWDTHMMADYCWNLKRDCSKI FT HSRMSRKRSFWVV*" XX SQ Sequence 2555 BP; 857 A; 403 C; 490 G; 805 T; 0 other; acacaggcct acaaaaaaaa aaaagttttt tggtgcctaa gatttaataa ttgattttag 60 ctatattttt gttgctgaat ctgaatttga aaacagtttt tctctatcag ctctagtttt 120 tgagataaat ttttttaagg tgcgatttta catgaaatta acagttttaa ctccatttag 180 taatgttata actcgtatag acaatgtttg gttactgtat tctatatatt tgttatatat 240 tcagtttatt ggtctgtttg caacaggaaa atccactagg tggagtaacc atactgtaaa 300 ttaagtgaac ttcaactata attgagatta gtatgagagt tgttgccgtg cggaatagat 360 gaaagttggt gtcgcgtgat atttctattt tctgaagtgt tgttgttaag ttaacctaaa 420 tagtattgtg ttgtaaattg ttcttgcatt tttgtattta gatctagata aaatgagttc 480 aaatagaaga agttgtgtga atcaccctga tgtgttttgc tacatatgcg gagaatatat 540 gttaaaagaa aacagaaaaa aagttagtga ctttttaaag agggcttatc ttgcatactt 600 tggtgttagg cttggggatc aagataaaac ctgggcccca catcaggtct gtaaaacttg 660 tacagaacat ttacgacagt ggactaccgg gaaaagaaaa agtatgaaat tcggcgtgcc 720 aatggtttgg agagaaccac gaaaccattt caatgattgt tatttctgca tattaaatat 780 tactggaatc aatcgaaaca accgtagcaa gtggacttat cctgatttat tatctgcaag 840 acgaccggtt cttcactcag aggaagttcc aatcccaacg tttctccagc taccagagct 900 ctgcgaggat gaatattgtg cttctaattc ttttgatact aatgaaattg acagtgatta 960 tgagggaaag gcatctattc cacaacaatt caaccaagat gagttgaatg accttaccag 1020 agatctcaat ctatcaaaag aagcttctga gttacttgct tccagattga aggacaaaaa 1080 tttactggaa caaggaacca aaattacttt ctaccgtaca cgagaaaagg atctgctacc 1140 cttcttctct caagaagagg atcttgtatt ttgtcacgaa attagaggac tcctagaaaa 1200 gatgagtctt cctaaatatt ttcaagatga ctggcgtcta tttattgata gctcaaagcg 1260 aagtttgaaa tgtgttcttc tgcacaatgg aaacaaattt ggttcaatac caattgctca 1320 ctcaacaaaa atgaaagaag aatataacac cattgcttta atcttggaaa agctaaagta 1380 tcagcaacat caatgggtga tttgtgtgga cctaaaaatg gttaactttc tcctgggaca 1440 gcaagctggt tacactaaat atccttgctt tttgtgtctc tgggatagca gggacaaaac 1500 acaccactgg actaaaaaag aatggcctaa gagaaaaaac atggaagttg ggaaaaataa 1560 tttaattaac aacgccttag ttgaacgaga taaaatcatt tttcctccac tgcatatcaa 1620 gctgggcctt atgaagcagt ttgtaaaggc tcttgataaa aacggatcgt gttttggata 1680 tataggggag aagatgccac agttgagtac agaaaaaatt aaagcaggag tatttgatgg 1740 tccgcaaata agacaattta ttaatgactc tgccttcgtg aattcaatga atgaaactga 1800 gcgaaaggct tggacttcat ttgttgcagt tgtcagaaat tttctgggca aacgcaaaga 1860 tgaaaactat gttaaactgg ttgacgacat gcttagcagt ttcaaatccc tcggatgtaa 1920 tatgagtatt aaggtgcact atttacacag ccatttagat cgttttccta aaaacttagg 1980 agacatgagt gaggagcaag gtgaaagatt tcaccaagat attaaaacta tggaggatcg 2040 gtaccaagga agatgggata cgcacatgat ggcggattac tgctggaatt taaaaagaga 2100 ttgttctaaa atacattcaa gaatgtcacg gaaaagaagt ttctgggttg tttagtgact 2160 atagtttgtt tcataaatat tttttctttt gtctttacat ataataacta tatatataat 2220 agagtgtaat ataccatata gttatacatt tctatagcat aataatcaaa atttaatcaa 2280 aatttcatgt ttattttgtt aatttgatcc caatcaaaac ccttttcaaa aggataaaaa 2340 ccattttcat aaaaggtata aaagattttt gttttttagg gtaaaggagc catcgccccc 2400 cctagccccc tcattggcgc ttcttatttc attaaaatga tctcaaaaat gcgagctgat 2460 tggcaaaatc taacggtata tttgagttca gggcatcaaa ctcatataaa atttgctgca 2520 aataccccgg caccaaaata gctgttggcc tgtgt 2555 // ID I-55_AAe repbase; DNA; INV; 6046 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE An I non-LTR retrotransposon from Aedes aegypti. XX KW I; Non-LTR Retrotransposon; Transposable Element; I-55_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6046 RA Kojima K.K. and Jurka J.; RT "I clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1326-1326 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 9 sequences with >95% CC identity. XX FH Key Location/Qualifiers FT CDS 423..854 FT /product="I-55_AAe_1p" FT /translation="MGASTPEAYWGLVXLGKPVRLTDVPIELINKDSEPFG FT HPRRLLTGGKSHFTKTFPPKSEPTNRKKINAVPGHGTEIAAFSEIPIATTS FT KKSARTLQYTTRKIGSGACIKMQANQNPKIEAAIRLPPVSWSHRQGAKDWP FT TTLIM" FT CDS 860..4873 FT /product="I-55_AAe_2p" FT /note="endonuclease, reverse transcriptase, and FT ribonuclease H." FT /translation="MTTITNGNPENLPAEALSNANIYDHLMASRLGYSWNN FT KPGRSTDTEFANCAPGHSEKLFSGGKSHASLSRFGTPLLLPNENPSTEPFV FT LTQPPMVEHSCPQSSLTRSPSFGSIDNLNVSAISSTSPNLLRKGFTLALQW FT NINGYVNNLTNLEKIISQREPLVIALQEPHKVSPTTMNKTLGQKYVWTTHV FT GQNFYHSVAIGVSCEVPYDTIDIDAEFPIVAARIAWPFPVTIVSFYLPNNK FT LPNLKNLLNKALDDITGPILVLGDGNGHHTAWGSRSNNVRGLTIAEVAGER FT SLCILNDGSMTFVRKRIESAIDVSLASRSIINRLHWVIDDDLSGSDHLPIW FT INLADSPPEISRRPRWLYEQADWISFQENIENSLDSSKPTTMEELTKLIFD FT VALATIPRTSATPGRRALHWWTDKTKAAVKARRKALRAARRIPVGHPNREP FT AMNKYRLEHLKCQKTIREAKEKSWVNFLDEINDQQSSAELWKRVNAIQGKR FT RAKGMGLMIDGSITRDPVAVADALADYFGDLSSFNRYPADFTKKHGSANMI FT LDNIYIPQNPGKRFNQPFTLTELNFALGKANGKSAGPDEIGYPMIKKLPPM FT GKRMFLDRINEEWTAGSLPTNWKKSLVVPIPKASGINTDKSSYRPIALTSC FT LAKVMERMVNRRLTEYIEEKGLLDNRQHAFRPGFGTGTYFAALNDILDRAM FT EKREHVEIASLDLAKAYNRAWTPGILRRLLDWGVSGNLAQFVRNFLSNRSF FT QVIIGNHRSRETSEETGVPQGSVIAVTLFLIGMNGVFEVLPKGVYILVYAD FT DILIIVTGQYPKIIRRKLQTAVNAVAKWASNVGFDISANKCARLHICLDKH FT QPPKQPVMVHGSPIPLKNSWKILGVTFDRKLNFRPHLNNVKNACKTRLNLL FT KAISGKRTKSDRKSRYRVAEAVICSRLFYGLEATSRNFAELVSVLAPTYNN FT AVRIISKHLPSTPADSACVEAGVLPFRYKAAAVLGSRVASYLEKTSDDGSE FT TPLLTQANKILRSIIGIILPPVLERYRNGARSWYAKAPMVDNYIKMRFRSG FT CNSERVQAHFLDRVRTKYTIFNIRYTDGSKSLGKVGIGWCGVGVWEQRSLP FT EQCSVFTAEAAAIFQAITHQQDIIGPVLIATDSASVILATQSENNRHPWIQ FT AIQNALDTHPNTTLMWIPGHCGIAGNENADRCANSGRDSTPLTNKVPCADI FT KLWLKKKTEEAWSKQWHLERSLFTRKIKCCTEHWEDRKDRREQIVLSRLRT FT GHTRVSHDMGEGRNFRKLCGTCQEHNSVEHFLNNCPTLEHIRQMYGLGSVQ FT STLQHDRASEIVLISFLKDAKLYNNI" XX SQ Sequence 6046 BP; 1806 A; 1574 C; 1379 G; 1270 T; 17 other; agkcggtgag scccacwtcg ttaaaacatc gacwccsaaa tcwgcatcgg aaascaggaa 60 ggwaatwcac gccgtgattg gtcawggakt tcgcaacgcc gctctmagtg cagcatcaga 120 agckagaacc aacatcgagm acctccgaca ccmckttgcc acgagtagcg agggcgccgg 180 cagaccagaa gttggcgaag aactggaacg gccgaatcga ccgtcgcact cggccggtga 240 aaaggatacc cctgatagct cgggcaccag aaaaatcaaa aacaatagcg agtacccccg 300 aaacctcttt gccacgagta gcggtggcgc tggcagacca gaagtcgacg gaggaacggc 360 cagatcgact accgcactcg gcaggtgaac ataaggaaaa taagagaaat cacacggaca 420 gcatgggcgc cagcacaccg gaagcctatt ggggactggt atkgctaggc aaacctgtgc 480 gtctgacaga cgtacccata gaattaatca acaaagacag cgagcctttt gggcacccta 540 gacgcttact gaccggcggt aagtcccact tcaccaaaac atttccccca aaatctgaac 600 cgacgaacag gaagaagata aacgccgtgc ccggccatgg cacagaaatt gcagctttta 660 gcgaaatacc aatagcaacg actagcaaaa aaagcgcaag gacactgcaa tacacaacaa 720 ggaaaatcgg atctggcgcg tgcattaaaa tgcaagccaa ccagaacccc aaaatcgagg 780 ccgccatcag actccccccg gtgtcctgga gccaccgaca gggagcaaag gactggccca 840 ccactctcat catgtgacaa tgaccactat caccaacgga aacccggaga acctcccagc 900 ggaggctctt tccaatgcca atatttacga ccatctgatg gcctcgagat tagggtatag 960 ctggaacaac aagccaggca ggtccacaga taccgagttt gcaaactgtg cccctgggca 1020 ctccgagaaa ctattctcgg gcggtaagtc ccatgcgagc ctatcgcgct ttggtacacc 1080 tctcctcctt ccgaacgaaa atcccagcac agaacctttc gttcttactc agcccccaat 1140 ggttgaacat tcttgcccac agagctccct gaccagaagc ccatcctttg ggtctataga 1200 caacctaaac gtctcagcga tctcatccac ctctcctaat ctgctaagga agggatttac 1260 tttagcccta cagtggaaca tcaacggtta cgtgaacaac ctgactaacc tagaaaaaat 1320 aatcagtcaa cgtgaaccgc tggtcatcgc tctccaagaa ccacataagg tatctcccac 1380 gaccatgaac aagacactgg gccaaaagta cgtttggaca acccacgttg gacaaaactt 1440 ctaccattcc gttgctatcg gtgtctcatg cgaggtaccc tatgacacaa tagacataga 1500 cgcggagttt cccatcgtag ccgccaggat agcgtggcca ttcccagtca caattgtctc 1560 tttctacctg ccaaacaaca aattgcccaa tttgaaaaat cttctgaaca aggccttaga 1620 cgacattacg ggaccgatat tggtgctagg agacggtaat ggtcaccata cagcttgggg 1680 tagcagaagc aacaacgtca ggggtttgac tatagcggaa gtagcaggtg aaagaagtct 1740 atgtatactt aacgacggat ctatgacttt cgtccgaaaa aggattgaat ctgcgatcga 1800 cgtatccttg gcctcccgaa gcatcatcaa tcggctgcac tgggtcatcg atgacgacct 1860 ctcgggaagc gaccatctcc caatatggat caatctagcg gactcaccac ccgaaatatc 1920 cagacgaccc cgatggctct atgaacaggc cgactggatc tccttccagg aaaatatcga 1980 aaacagtttg gattcctcca agccaactac tatggaagag ctcaccaaac tcattttcga 2040 tgtagccctc gccaccatcc cgagaacaag tgctaccccc ggccgtcgag cattacattg 2100 gtggacggat aaaaccaaag ctgcagtgaa agcgagaaga aaggccctca gggcagccag 2160 gcgtatcccg gtcgggcatc caaataggga accggctatg aacaagtata gattagagca 2220 tctcaaatgt caaaaaacga tccgagaggc caaggagaaa tcctgggtca atttcctaga 2280 cgaaatcaac gaccagcaat cttcggctga attatggaaa cgagtcaacg caatccaggg 2340 caaaagacgg gccaaaggaa tgggcctgat gatagacgga tcgatcaccc gagacccagt 2400 cgcagtcgcc gatgcactag cggactattt cggtgatcta tcttccttca accgataccc 2460 agcagatttc acgaagaaac acggatccgc taatatgatt ttggacaata tctacatccc 2520 tcagaaccct ggaaaacgct tcaaccaacc tttcacattg accgaactta acttcgccct 2580 cgggaaagcg aatggtaaat ccgcaggacc ggatgaaatc ggatacccga tgatcaaaaa 2640 actacctcct atgggaaaaa gaatgttttt ggatcgcatc aacgaagaat ggacggctgg 2700 ctcgttgcca accaattgga aaaaaagtct ggtggttccg atcccgaaag cttccggaat 2760 aaatacagac aagagcagtt atcggccgat tgctttaacc agttgtttgg ccaaggttat 2820 ggaacgaatg gtcaacagaa gactcaccga atacattgag gaaaaaggac tgctagataa 2880 ccggcaacat gcttttcggc cgggtttcgg aacgggtaca tatttcgccg cgttaaatga 2940 cattcttgac cgagccatgg aaaaacgaga acacgtcgag attgcctcac ttgatttggc 3000 caaagcctat aacagagctt ggacgccggg aattttaaga cgtttgttgg attggggagt 3060 ctctggaaat ctcgcccagt tcgtaagaaa ttttctgtca aatcgtagct tccaggttat 3120 tatcggaaac caccgctccc gagaaacgag cgaagaaacc ggagtgccac aaggctccgt 3180 aattgccgtg accctgttcc taatcgggat gaacggagta ttcgaggtac tgccaaaagg 3240 ggtatacata ctcgtctatg ccgacgatat ccttattata gtcactggtc agtatcctaa 3300 aataataaga cgtaaactac aaactgcagt caacgcggtc gctaaatggg ctagtaacgt 3360 cggttttgac atctccgcca acaagtgcgc aagacttcac atatgccttg acaagcatca 3420 gccaccaaaa cagccagtga tggttcacgg tagcccgatt ccgctcaaaa actcttggaa 3480 aatattggga gttacctttg accgcaaact caattttcgg ccacatttaa acaacgtcaa 3540 gaatgcctgc aaaacacgat taaatttgct aaaggcaata tcgggtaaaa ggacgaaaag 3600 cgatcggaag tcacgctacc gggtagcgga ggccgtcatc tgtagtaggc tattctacgg 3660 actagaagca acaagtagaa actttgccga gttagtctcg gtactcgcac ctacatacaa 3720 caatgctgta aggataatat caaaacacct cccatcgaca ccggcggact cggcatgcgt 3780 ggaggctgga gtgctccctt ttcgctacaa agccgccgcg gtactgggaa gcagagtcgc 3840 tagttactta gaaaaaacta gcgatgatgg ttcagagacc cctttgttaa cccaagccaa 3900 caagatcctt cgatcgataa tcgggatcat ccttcccccg gtacttgagc gctaccggaa 3960 cggagctaga agttggtacg ctaaagcacc aatggtagat aattatatca agatgagatt 4020 caggagcgga tgcaacagcg agcgggtaca ggcacatttt ctcgaccgtg tccgtacaaa 4080 atacacaatc ttcaacattc gatataccga cggttccaag tctcttggaa aagttggaat 4140 cggttggtgt ggagtcggag tctgggaaca gaggagtctt ccagagcagt gttccgtttt 4200 cacggcagag gcagcagcca ttttccaggc cataacacac caacaagaca taattggtcc 4260 ggttttgata gccaccgatt cggcgagtgt cattctggcc acccaatcgg agaacaatcg 4320 ccacccttgg atccaagcaa tccaaaatgc attggataca cacccaaaca ccactctaat 4380 gtggataccc ggccactgcg gtatcgccgg gaacgagaat gcagaccgat gcgccaactc 4440 cggtagggat agcacacccc tgactaacaa ggtaccgtgt gcggacatca agttatggct 4500 gaagaaaaag acggaggaag cttggagcaa acaatggcac ctagaacgat ctcttttcac 4560 caggaaaatc aaatgttgca ccgaacactg ggaagaccgt aaagatagaa gagaacaaat 4620 tgtgctttcc cgccttcgaa caggacacac tagagtctcc catgatatgg gcgaaggacg 4680 caatttccga aaactctgcg gaacatgcca agaacataat tccgtcgaac atttcctaaa 4740 taattgtcca acactggaac acatccgaca aatgtatgga ttgggaagcg tccaatcaac 4800 cttgcagcac gaccgtgcca gtgaaattgt cttaattagt ttccttaagg acgctaaact 4860 ttacaacaat atctgatcat ccgtggcaaa gtaggcttgg catccggaac cgcactcagg 4920 ttgtaagacc ctctccatcg aaacagacac tcgcatgaca cccatgatac cgaagagcaa 4980 tagtggaaac atcggacatg aatctgttga cctaattgag cgtaaaagaa tcatctctaa 5040 aaattgcacg aaacaccccg cctccaactt taaaggaacg gcggttatcc actctgggct 5100 ccccgagcac cgtaactcgg gtggtaagtc ccacgtctgt ggccgtgttt taaaagctaa 5160 tctgagcgaa tcaaccgggg aattacctgt atcgcgttcc cctctaccga cgaccgctgg 5220 ttcagtccta aaaaaggaaa atttaaaaaa gacaatccaa acaaagctac agcattagga 5280 cttacctgga ctgctgttac ctccccaatc tgcctaagga cagggattgg cagcctccga 5340 cccctagggt gatacgcccc cggagcaacc tttctcagtc ccgtgcgtac cgctgctact 5400 ttcaacgaag tttgtcggta cctctacgcg gcctctttta cccgaacaaa ccaaggtgct 5460 aggggaacga ccggtaggct cttcgacaga accagaacga ttgatttcga tgcggtggtt 5520 catctctcca acaaatttcc cgcggcgata atagtcgtcc ccctcgaaga gctgctttcc 5580 gaatctgctc cctccatcga acaccagtcg cctccgctgt atttttcccc ggattctcgg 5640 cagcctccat gtggaaaact tcttcacttt tcatacggat caaatgattg acagtgaaga 5700 gaggaaatgc ttagatgtca aagtggaacg atggcaacga aacatctcaa gcttgcacgc 5760 aacattcaga ggccacctag tggcaggtag aagaacaaat gaactagaaa acctccacac 5820 tgaagaaaat gtgtttgtga ttctttgctt ttattttaat ttgtattgta aaaaattaac 5880 tctgtaatta aacgataggc gagaccccgt ggttgggctc gaaggtaggc ccctttggtc 5940 taacctttta tttctttttt gcgaggagtc cgtctgactc tctctgacga gtgaagaact 6000 agccggatgt gttaaaattc acataaataa agacataaaa aaaaaa 6046 // ID Gypsy-15-LTR_HM repbase; DNA; INV; 86 BP. XX AC . XX DT 06-JAN-2009 (Rel. 14.02, Created) DT 06-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE Long terminal repeat of the LTR retrotransposon - a consensus DE sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy-15-LTR_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-86 RA Bao W. and Jurka J.; RT "Gypsy LTR retrotransposons from Hydra magnipapillata."; RL Repbase Reports 9(2), 403-403 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX SQ Sequence 86 BP; 28 A; 14 C; 15 G; 29 T; 0 other; tgtgtcatat attatacatg tatatatacg gcccgatctt atatatcggt gccggtcttt 60 aaggagaata tattatgaac acaaca 86 // ID BEL-22_CQ-LTR repbase; DNA; INV; 255 BP. XX AC AAWU01010298; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-22_CQ_; KW BEL-22_CQ-I; BEL-22_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-255 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 198-198 (2011). XX DR GenBank; AAWU01010298; Positions 1113 1367. XX SQ Sequence 255 BP; 62 A; 64 C; 67 G; 62 T; 0 other; tgtttgcgga atacctgcgg tcgacccctc ggtgcgcggc cggtctgtca acgcgagcgt 60 cgttgcgtgc tttttgcgtt gaagtgggag aaaaacaaac aacaataaat atcagtccga 120 agaagttttg aagagagcaa gacgtgtttc atttttcctc gcgcgcgatc cttaatccgc 180 cggaaagccg tctctcacgt gtggccactt ggttccgacg aacaaagtag gccaattgta 240 tacagtccgc taaca 255 // ID DNA4-7_AP repbase; DNA; INV; 707 BP. XX AC . XX DT 26-AUG-2009 (Rel. 14.09, Created) DT 26-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA4-7_AP. XX NM DNA4-7_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-707 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 1954-1954 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 707 BP; 239 A; 114 C; 104 G; 250 T; 0 other; tcgatggtta gaaggtaaaa gggcgtaaac gccaatttcc aaattgttca ggagagccga 60 aaaacagata ttttcatatg aactgtgtta caaccttaga acatcgtata acacttcgat 120 ttgagactta atatgattta taatgagttt ttacgtattt tgtattgatt aattgttcat 180 aatcataaca tatttgtgta tttaaagcca attaaaaaaa aaaaatgact cttacgcccc 240 tttttttctg atcgtggctc ttacgcccca aagatgttct attgactata aagtataatg 300 gatagtttaa tattgttatt tgttttatat tattgattaa attcaatgaa gatttgtatt 360 aaaagtattt cgaaaattgt agatataaac gctttttaca gtgctgacaa aaccactaac 420 aaaaatgata aattgatgta accgtcaaca atgacgctta cgtattatca agtttccttg 480 aatgccgttt acatctttta attgacgttt acacccagtt ttttcattcc caccaattct 540 atgccgttta cgaccacata ttaaaacatc gattcttgga tatatacaaa tgatagcgat 600 gtttaggttc gattaatcga tgtataataa tgccgtttac aacaatatgc aaaaacatcg 660 attttgaaaa aagtggcgtt tacgcccttt taccttctaa ccatcga 707 // ID Copia-38_CQ-LTR repbase; DNA; INV; 125 BP. XX AC AAWU01006555; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-38_CQ_; KW Copia-38_CQ-I; Copia-38_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-125 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 380-380 (2011). XX DR GenBank; AAWU01006555; Positions 22371 22495. XX SQ Sequence 125 BP; 31 A; 30 C; 21 G; 43 T; 0 other; tggagatgat caagcaacct atggatttgt acttgtttgt tagttacctg gcagctcaat 60 aaaacttttc attctgttct aaaccgtaac ccacgctaga cgtgttttat ttcacctctg 120 cccca 125 // ID TFP3 repbase; DNA; INV; 831 BP. XX AC M29093; XX DT 30-APR-1998 (Rel. 3.03, Created) DT 30-APR-1998 (Rel. 3.03, Last updated, Version 1) XX DE Autographa californica nuclear polyhedrosis virus TFP3/2 DE transposable element derived from Trichoplusia ni after DE infection. XX KW DNA transposon; Transposable Element; Nonautonomous; TFP3; TIRs; KW TTAA superfamily; nonautonomous DNA transposon. XX OS Trichoplusia ni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Noctuoidea; Noctuidae; Plusiinae; Trichoplusia. XX RN [1] RP 1-831 RA Wang H., Fraser J.M. and Cary C.L.; RT "Transposon mutagenesis of Baculoviruses: analysis of tfp3 RT Lepidopteran transposon insertions at the FP locus of nuclear RT polyhedrosis viruses."; RL Gene 81, 97-108 (1989). XX DR GenBank; M29093; Positions 1 831. XX CC TTAA target site duplication. XX SQ Sequence 831 BP; 237 A; 181 C; 161 G; 252 T; 0 other; cccattcgct gccgatctaa agagtaacaa tattacccag ttgccgccat actgtactat 60 gcagcccgcc catcggcggt cccggcatct taattatgta tttttgcccg cccattggcg 120 ttttcggcat gaaactgcag attccgacat gaatcgtttg atatcgcatc caaaaaaatc 180 ctagaaagtg ctattaattt gacttaagat aacgcgaacc gataatataa gccttcattg 240 cttttatcag ccagcccttt gtattggccc ggccgaccgg cagaagtgcc ggccggctgt 300 ctgggacgtt ttaggccaaa attaaccctg ccgaagggtt ttttttaagt aagagacgtc 360 caaatgtttt ttcttagtat tttttcacaa cttaatcaga actttctgtt gatcaatgtt 420 tcgaaggttt tacaccttta tgtgaaatag atgataacac tttattttta tacacgaaac 480 tcatgcttaa gaagtagcga gcagagaagt ttttacataa cttaataata gccgttagtc 540 gttctaaaat ttacaaaagt ttgggacctc tgatataaaa tcttcggccc gcccaacggc 600 gggctcggca tgacagtata gaattgcctt ttttcaaaac ttcaaatatc agcaggccat 660 ttgagttgca aatcaatatt aataaaacaa atcattacac attatctcat tactttctga 720 cgttatttcg taaagttatt gcagtttaaa atcaaaccgc ccatgggcgg tctcggcgtg 780 aaagttaaaa aaatttttgc ccgccgatgg gcgggctcgg cactgaatgg g 831 // ID CR1-37_BF repbase; DNA; INV; 2894 BP. XX AC . XX DT 30-JUL-2009 (Rel. 14.07, Created) DT 30-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE Amphioxus CR1-37_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-37_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-2894 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-2894 RA Kapitonov V. and Jurka J.; RT "Young families of CR1 non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(7), 1608-1608 (2009). XX DR [2] (Consensus) XX SQ Sequence 2894 BP; 870 A; 754 C; 607 G; 663 T; 0 other; ggactgttac tagcggacat agtagataga tatggactac atcagtccca gcacgagggg 60 accaggttaa cccagacatg tactagtgta ttggacttaa tgatcttatc tgatcctgct 120 agaataaacc acatgtatac gttagcaccg ctaggtacgt cagaccactc tgtagttgta 180 tgtaaactta acttgatcct accaagtaaa tgtgtaaagc gtcatatatg gctatacaac 240 agagcaaact ttgatgtatt tagacatgag ttaagtaagg ctaattggca aagttgtaag 300 ataggttcgg tagataccag gtggaacagc tggaagcaga aattcctgaa catagcaaaa 360 aacaccattc caaacaaaat ggctaataca aactccagaa cccacaaacc ctgggttacc 420 ccagacatac tagagatgat cagacacaaa acccttcttt accatgagta caaacaatca 480 agaacacagg acagctggac agactatact acgtataaga acagtctaac atacgagctt 540 cgtaaagccg aacgtgaata ctacagcacc atcagtgaca agtttaaaac tcccgagggc 600 tcaaaactgt tatggtctgt tctcagcaat cttacaggaa agggaagtag aggtattcca 660 acactgtcct acaggggtgt tctagtggag aaagatggcg agaaagccga acttctaaat 720 gaaatatttg taaacatcac caaagactgc aaccgccctg acaggactaa gaaactacca 780 atacttacct ccaagcagtt tacatcactc caagtcactt cagaggaggt actaaccgta 840 ctacaaagtc tccaagtgaa caaagctccg ggacctgacg gcatcccaaa cagattactc 900 agggagactg cgcctgaaat ctgtgactcg ctacggtgtc tattcaacta ctctctggcg 960 actggactat tccctacgga gtggaagcag agcaatgtaa cccctgtaca caaaaagggc 1020 gacagaactg atcccttcaa ctatagacct atatccttgc taccaacagt agctaaggtg 1080 ctcgaaagac tggtccacaa ccatctgtat acctaccttg aagagaaccg ccttctgcat 1140 aaaaatcagt ctggcttccg aaaaggggac ggcaccgtcc tgcagctgat ccgacgagtc 1200 gacgactggg ccaagtccat cgaagatccg gacatctctt gcacagcagc cgtctttctc 1260 gacgtaaagc gtgcgtttga taccgtctgg catcaaggac tcacctacaa gttgactcgt 1320 tacggagtgg gagggcagct ggtccgctgg tttgacagct acttgacggg gaggcaacaa 1380 cgagtcgtca taaacggtgt cgcctcaacc tggggaacaa caacagcggg agtacctcaa 1440 ggcagtatac tgggaccgtt gttgttcctg ctgtacttaa acgatatgat gagcctccca 1500 tgcaagtcat cactaaattg ctttgcagat gacacatctc tctacaactc agccaagaca 1560 gtgcaggaag tggcagtcac cacaaatgca gatctacaac ttgtatccca ctggttttcg 1620 gactggggcc taagtctaca ccctgataag tgtaaggtgg tctgcatcaa atctaatcgt 1680 aacaacgtcc agctacctcc tatctaccta ctcgggaaga ttgtggaaca agtgcctttc 1740 tacagtcatc taggtgccac catccaccaa tccctcagat ggacagagca cgtacaagag 1800 gccacgagca aaagtagaaa actgctaggc ctcttgcgca aagtcagtgg caaacttgga 1860 agacaggcct tagaaacagc atactttgct ctagttcgac ccaagttgga gtacgcgtca 1920 gcgctactgg gtgacttggc aagctcagct agcaggatgc tagaacaggt ccagtaccaa 1980 gcaggccttc tcgtcacagg ggcgatgaag ggtaccccaa agtctaacct actgcaggaa 2040 ctggaatggg actctctagc caccagacga cagctcaact cactcaccat catgtacaaa 2100 atgacaaacg gacttgtacc cccacattta cagctgctca ctccgtcaac aagaggcgcc 2160 cagtctacaa cacgtctaca actacgcaac aacactcacc ttcatgttcc tcgttgccgg 2220 acacagacct tcaaaaacag ttttattccc ttcacgtctt cactctggaa ccagctacca 2280 cagacagtca gagacgcccg tagcctcgct gattttaaga gagtgagcaa aaagcacctg 2340 ttcaccacaa ggcaccacca aacttaccga cgactcggcc caaggcagag caacatccaa 2400 gctgccaggc tacgcatggg ttggtgtcaa ctgaacagta ccttgcacaa aatgaacatc 2460 aagagcaaac aatgtgtctg tggggcctct tctgaaacag tccaacacta cctactacat 2520 tgccccctgt actctgagca gcgtagcaaa ctcaccacca ctgttgaatg ccttgtacga 2580 caacggccta ctgtttcact actgctccac ggctcacctg accatgaccc tgtcatcaat 2640 agacagcttt cagaagcact gtgtgtacat gatatctacc aaaagattct cttccagccg 2700 tgccactgtc tagtaaattt tagctagtca ggcataagtg ttagctttac ctgcttttat 2760 tgcaacaact attttattct attactgtat atacatgttc atcttgtatt gtatatttat 2820 atgtctgtgg ccacgatacc agccatgctg cctagtgctc acagtgtatt gtcttgttac 2880 aacctgaata aaga 2894 // ID hAT-10_HM repbase; DNA; INV; 4244 BP. XX AC . XX DT 07-DEC-2008 (Rel. 13.12, Created) DT 11-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-10_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4244 RA Jurka J.; RT "hAT-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 1999-1999 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX SQ Sequence 4244 BP; 1442 A; 727 C; 690 G; 1381 T; 4 other; tagacactag ataatcggtc tcgagtctcg atttctcgtc tggtatcggt gcgataccat 60 acagtatcgg tatcgaaatc acgcgatact ggtatcgaaa gtatcgttta tgttggtatc 120 gtttggtgcg ataccttaca gtatcgatat cgagatcacg cgatactggt atcgaaagta 180 tcgtttatgt tggtatcgtt tggtctcggt ctttcgtata attaatgaga tatataatgg 240 ctcatttatt ataaaaaaaa agttctctga gctggttgta tgacatcatg tttatcattt 300 aatttctcat ctggtatctt tgttttaatt taaataattg tgtggtacca cttctgtatt 360 ttaatttctc atctgttatc cttattacaa ttttttctac cgtgattaaa aggaaagaaa 420 agattttttt aacatttatt tctatattaa atttttacaa ataattaaat tatgttaaac 480 agtgtgtaag gggcgaacaa aaaggtaagg gagggtaagg tggtattcgg ggtaactacc 540 cccttcccct acctaagaaa tgtcaaactt caaaatatta ttgataaaat taaaaaaata 600 agaaaaaata caaaaaactt atataaccat tcaatatatt tattattact gactatctta 660 taacctttct aaaatcattc taaatatcaa atgaaacagt tattttattt taaaaaaagg 720 cgaaccttat tttttatatt tttttggtcc gactaccccc ctagaacaat gacctggatc 780 agcccctgca atgtgtagtt tcccatatcg ttattccaac tctatcgaat ttttattcca 840 ttcaagcraa aattttctgt gcggaattaa tctccaagga aatagttttg caatcattaa 900 agagtgtttt tcatataatt ctccaaagaa aattgcttst gatgcaatgg ttaatgatta 960 ttctaaacga gtattagaaa tggttgttca aagttttgta actatcatta aagagagctt 1020 ttcatataat tctccaaagg gaattgcttg tgatgcaatc gttaatgatt attctaaacg 1080 cttattagaa atgattgttc aaaggaattg atttcgtaac agtcattaca gttattttag 1140 tttaattaat tgtgtgatat cgcattcata ttttaatttc tcttctggta tcgttacttt 1200 attaaatctc tcgtggtatc gtaattctcg cctggtcttt gttctgtgtt caacattcaa 1260 taactaaact ttagttttaa acaattaaac ataagtatac tttagttatt aaagaaaact 1320 aatcttcact cgttatgcct ttaacaacat ctttaatctg ggattatttc aagaaaaata 1380 taggtaagca ctacttacta agtatttttt atactttatc tcaaaaggta attgccggca 1440 atgtcaggaa ataattctaa aattaattac cggcagaatt attattagat ataractart 1500 tgaactttaa aaacggtatt ttggaaacat atattgcaat ataaagttca aggaattatt 1560 ttatagtgaa ttcaagaatt gagaattaaa ctttttatac tagatataat aacttatctt 1620 caaacgcgtg agtttgcgtg gccgagcgga taaagttatg gcatataagc acggagttct 1680 aggttcaagt ctttaatgag aattaaactt tttttaattt ttttaactaa aaacgcagtt 1740 ttttttttac gcttttacgt tataactatt attgtttatt aactattagg atttaaaaaa 1800 aaaaaaaatt taattggtgg agcaataatt cataaatcgc aaagcacagt gtgctaaatt 1860 gattgccagg ttgtatgtcg gcccactctg aatttttttt tgtttcaaat aaagatacct 1920 actaactaat tgtaaaaaaa attaaattca cgcctatcaa agggtctttt tttggtaata 1980 atatttctga aattatttaa ctcctttgac attttttaga ggtaaaattt acagtaaatg 2040 cccccaaaac ttaattttgc ctcaggcccc aagtgagttc gaagcgtccc ggctctgttg 2100 tttttaataa aaataaaata aaattataat aaaaaaataa aaccatttta tttgttatta 2160 aactgtattt tacattttca ggagaatctg ctctgtgtaa aacttgtgga aataagcttg 2220 caacaaagca agcaaatact actggcttgc gaaagcatat tcaaacacaa cataggaagt 2280 tattcatcga acttcaaaaa aaagaagacg agcggagatc acaaggagat gaagaactaa 2340 acgtaatcga ggcaggtgct tctggatccg caaaagatgt agcctttaaa actctcactc 2400 tcccgaagaa ggcggttaca aaaaagtggc cacttcaaga tcggcgtcaa ttagtagttg 2460 atcatgacat cattactttg attgctactg agtgcttgcc gttcagtttt gctgattctg 2520 aaaattttaa gaggtttatg aacaaaatat taccaaatgc cacaataaaa cattcaacaa 2580 ctttctcaaa aaataaactc cctcggcttt accatacact gaaggtagta atgcatgaag 2640 ttatggaaaa agagctcata gatcttaatc aggttgccat aaccaccgac cactggacat 2700 caagggcaaa tgattcatac atgtccgtga cattacatta tatctcgcca gactttgccc 2760 taaagaaatt tacattagaa gtgtgcctat tcaaagagcg acacactggt atcaacattg 2820 caaaagcact tgataacaca ctctcatgcc ctgaatcatt aaaaataaaa tacccaataa 2880 gctgtgtgtt attgatcagg catcaaatat ggcatgtgct ctaaacaaca gtgtgtcttt 2940 aataactaaa gaagaaggct ctgttacatg cagtgatcac aaactcaaca ctgcacttca 3000 aagaacagtt gagggtaccc cggaactgaa agatgctttt cttaaagcca gctcacttac 3060 taccaggctt cataaatcga cagccttcag tgcttcgttg aaagatgctt gttccaggtt 3120 agatgtgagt tacctcaaaa taccctccac agtaaccacg agatggaaca gccattacga 3180 tatgctgcac gccatgcttc gactgaaggt tccactcata tacttacgcg atacagaagg 3240 tgatgattgg aagaaaaccg tacctacaga tgaacagttc ctactctttg aagctattgt 3300 tccagtgctt aaatctgtta aagaattatc agttttttta tcttctgaca cagagatccg 3360 aattgacatg tcactgtgga aagtaacagc actcatcaac ttcatcgaga aagaaagaaa 3420 taattatgtt actcatggaa acaacaaatt agtagaaaaa ttcttggagg acttattatc 3480 agaactaaat aagagattcc cagacaaagg ccgtaaatta cccgcatttg cactaggaca 3540 ttaccttcat ccatttttcg ggtgttgcaa gatcttggtt ggctactcca ccaacctctg 3600 catcatctga aagagcattt tctgcttctg gccttgtcat cactgatcgt agatacaatc 3660 tcgatagtga caaggcagat aagcttgtat ttataatgca gaactattcc gctcttgaaa 3720 attacatcaa aaaatggccc atcgaacaag aaaatgaaag tgaaagtgat ataggacatg 3780 aatcagactt atctttgcca atattaatag aagaaaatcc taaaatacca acgaaagtac 3840 ggcaaaggaa aaaacaacga atggacgata gccaaagtat tgcagattat taaaaaacaa 3900 taactaaatc tctaaaaagc aattgtactt gtattttgtt aataaaaaat tcaataagtc 3960 acattttaat tttttgtttt tttgttttat tgagagtctc gattcgagac catgcgatac 4020 tatacgattc tttttattat cggtatcgtt tggtatcgaa gcgatactga tagtatcgac 4080 attggaacat aagcgatacc acacgatacg agaccatcct tctcggtatc gagtggtatc 4140 gaaccgatac tgatagtatc gacattagta tataagcgat accacgcgat acgagaccat 4200 tcttctcggt atcgacttat acgataccga ttatctagtg tcta 4244 // ID Hoyak3 repbase; DNA; INV; 3271 BP. XX AC . XX DT 22-SEP-2009 (Rel. 14.09, Created) DT 22-SEP-2009 (Rel. 14.09, Last updated, Version 1) XX DE Hoyak3 is a putative autonomous DNA transposon. XX KW hAT; DNA transposon; Transposable Element; HOBO; transposase; KW Hoyak3. XX OS Drosophila yakuba OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; melanogaster subgroup. XX RN [1] RP 1-3271 RA Ortiz M.F. and Loreto E.L.S.; RT "Characterization of new hAT transposable elements in 12 RT Drosophila genomes."; RL Genetica 135(1), 67-75 (2009)DOI 10.1007/s10709-008-9259-5. XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 633..2333 FT /product="Hoyak3_1p" FT /translation="MAPSEVWKYFENKAVEGLAVCNICKSSLKNNRISNLK FT AHLQKQHKIHVNVISLENLETSSTSSSKIIRKKIKIEVNKKQLIRLYIGLV FT TEDSIPFNVLNSQNMRNILDPICEGLKTAEGKMMKLNAVNCKNTLEIVANN FT IRTHITAELKNTLLSLKIDSATRMSRNIFGISAQYINSAQIKSIILGMIEL FT KGAGSSGAKNLAAEVIKVLNKYNINLNQIVSITSDNGANMLKATNILSFVS FT EQNNEENDYECTNDDYLDKIDLIEKIPKVLIGNIEVCRCAAHTAQLVALDV FT NKSLDIIKYLLNCRNMTKYIRKPSNGYREIFELKKLKVPQLDCPTRWGSTY FT IMLKNLLEAKDILSKIESLKNRTDEKNFDVDASFWDFIETYCNVFSPLQKS FT IQRFQEEQLHYSNFYAQWLKLKILTEKIVNDSSHNLTKTIGNKILNSIETR FT TKTLLNNKFVVSCLFLDPRFQHILSPQQKTDAINHLKSIWDRLYSLNPTGN FT MCSTPSSSFNQIDNFFYDEEDEMLEAYLSQGVQSKVNSSIDVYTKIENIQL FT AYQRFDIDVLSYWMERKAVY" XX SQ Sequence 3271 BP; 1143 A; 568 C; 597 G; 963 T; 0 other; cagtgttcga aaagctgcag tcgaattagc agaagtcgca atatttttgc ttttcactgc 60 ttctgctgcg agcgatatca gaaagcgaac aattgcttct gcttctgctt ctgaaaaatc 120 agaagtcgca gtcaaagcat aagtaaaaat cccgcagcga cgaatatcaa aaatcagcag 180 cagtcgccgt ttttcgtttt ttcgctttct gctttgctgt tgcttttcgt cgctgctgca 240 gctaccgacg actttcggaa gtcgtcggca gtcgcagcag cataagtcgg tgccttcgac 300 tgtttgctgc ttttgatttt cgttgctatt gctgctgacc aacagtcgta gcagcataag 360 tcggtgcctt cgactgtttg ctgcttttga ttttcgttgc tattgctgct gaccaacagt 420 cgcagcagcg taagtcggtg ctttcgactg tttgctgctt ctgatttttg ctgccgagca 480 gcataaggcg acagacgcat gaggcgacaa atttcggaag tcatccgcag cgaccgcagc 540 tttcgcagcg tttatttcct catattaatt ttacttgtaa aacgtttgct gcttgattga 600 cacgtacgct cacttgcata tttctgctaa ccatggcacc aagtgaagtt tggaaatatt 660 ttgaaaataa ggcagtcgag ggcttagctg tatgtaatat atgtaaaagc tctctaaaaa 720 ataacagaat ttcaaattta aaagcacatt tacaaaaaca acacaaaatt catgttaatg 780 tcattagttt agaaaatctt gaaacttcat caacaagttc ttctaaaata ataagaaaaa 840 aaataaaaat tgaagtaaat aaaaaacaat taataagatt atatatagga ttggtaaccg 900 aagatagcat tcccttcaat gttctaaatt cacagaatat gagaaacata cttgatccaa 960 tttgtgaagg tctaaaaaca gcagaaggaa aaatgatgaa attaaacgca gtaaactgta 1020 aaaatacatt ggaaatagtg gcaaataata taagaacaca tataacagct gaacttaaaa 1080 acacgttatt atctttgaaa atagacagtg caacgcgtat gtccagaaat atttttggaa 1140 taagtgcaca gtatataaat tcggctcaaa taaaatctat aatattagga atgattgagt 1200 tgaaaggtgc aggatcaagt ggagccaaaa atcttgcagc cgaagtgatc aaagtgctta 1260 ataaatataa tataaattta aatcaaatcg tatctattac ttctgacaac ggggcaaata 1320 tgcttaaggc aactaatatt ttgtcatttg tgtctgaaca aaataatgaa gagaacgatt 1380 acgagtgcac caacgacgat tatttggata aaatagactt aatagaaaaa attcccaaag 1440 ttttaatagg aaatattgaa gtttgtcgtt gtgctgcaca cacagcacag ctggttgccc 1500 tcgatgtaaa taaatcattg gacataatta aatacctact taattgtagg aatatgacta 1560 aatatattag aaaaccatcg aatggatacc gcgaaatatt tgagcttaaa aaattaaaag 1620 tgccacaatt agactgtcca acaagatggg gatccacgta cataatgtta aaaaacttgt 1680 tggaagcgaa agatatttta agtaaaattg agtcacttaa aaacagaacg gatgaaaaga 1740 attttgatgt agatgcctcg ttttgggatt ttattgaaac atattgtaat gtttttagtc 1800 ccttgcaaaa atctatacaa agattccaag aggagcaact acattatagt aatttttacg 1860 cacaatggct taaactaaaa atattgaccg aaaaaatagt caatgattca agtcacaatt 1920 taacaaaaac tattggaaat aaaattttaa attcaataga aacacgaaca aaaacattgt 1980 taaataataa atttgttgtt tcatgcttgt ttttggatcc ccgctttcaa catatattgt 2040 ctccgcaaca aaaaaccgac gcaataaatc atttaaaatc aatatgggac agactatata 2100 gtctgaaccc tactggcaat atgtgttcaa ctcccagctc tagcttcaat caaatagaca 2160 acttctttta tgacgaggaa gacgaaatgt tagaggccta tttatcgcaa ggcgttcaat 2220 caaaagtaaa tagttcaata gacgtgtaca caaaaattga aaatatacaa cttgcatatc 2280 aaagatttga catagacgtg ctatcttact ggatggaaag gaaagcagta tactgaccag 2340 gagttatacg cagttagtaa cgtttgttat gccgtaccgc caactcaggt aaataaaaat 2400 ggcataaata atttaggttc cttcttagat agtttcttat tttataacag gtgtcgattg 2460 aacgagcatt ctccactctg aggctggtgc ttaccgacta ccgcaatcga ttaagccagg 2520 atgttcttga gaacatattg ctagtcaaat taaatccaac ctttttggac atggccattg 2580 atactttgcc actttttgaa aacgaagaac ttactgttag tggtttataa aataatagtt 2640 ataacagaaa gcaataaaaa tttatttatg tatcattttt caaaattcat atactaatat 2700 gtttttattt ttgctaattt tgtttctata atttttttaa aattgattag ttgtattatt 2760 ttataagcat tatacaaaaa gaaagtccaa aaaataaaaa agtacaagaa gtacgctata 2820 agtaatttcc aaaaaattta gtcattcgta attaaataaa agtcgcggct actgaatgaa 2880 tttgtcgcct catgcgtctg tcgccttatg ctgctcggca gcaaaaatca gaagcagcaa 2940 acagtcgaag gcgccgactt ctgctgctgc gactgttggt cagcagcaat agcaacgaaa 3000 atcaaaagca gcaaacagtc aaaggcaccg acttatgctg ctgcgactgc cgacgacttc 3060 cgaaagtcgt cggtagctgc agcagcgacg aacgtctacg atgaaaagca acagcaaagc 3120 agaaagcgaa aaacggcgat tgctgctgat ttttgatatt cgtcgctgcg ggatttttac 3180 ttatgctttg actgcgactt ctgatttttc agaagcagaa gcagaagcaa aatcatgtga 3240 aattaaactt acggcagttt tttaaacgct g 3271 // ID SINE2-2_CQ repbase; DNA; INV; 244 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A SINE family from Culex quinquefasciatus - consensus. XX KW SINE2/tRNA; SINE; Non-LTR Retrotransposon; Transposable Element; KW SINE2-2_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-244 RA Kojima K.K. and Jurka J.; RT "SINEs from the southern house mosquito."; RL Repbase Reports 11(1), 624-624 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >96% CC identity. XX SQ Sequence 244 BP; 56 A; 63 C; 68 G; 57 T; 0 other; tgtcccgccc gtgtggatca atcggaccgc gcactggacc cacaatccag aggttgctgg 60 ttcgaatccc gcggcgggcg ctctaaaatt ctaagtgtaa atatgggtat ccggcgccgt 120 cgctccgtgc cgtactcaaa cacttaggag cccagggcgg cgaagtcctt gtagttaaaa 180 ggaagacact agtggttggt actagcaatg gtggccgaca gctataaagt caacttcgtt 240 tttt 244 // ID Gypsy-7_DPu-LTR repbase; DNA; INV; 228 BP. XX AC scaffold_126; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-7_DPu_; KW Gypsy-7_DPu-LTR; Gypsy-7_DPu-I. XX NM Gypsy-7_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-228 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 730-730 (2010). XX DR Genome; scaffold_126; Positions 283925 284152. XX CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX SQ Sequence 228 BP; 46 A; 47 C; 64 G; 71 T; 0 other; tgttgtgtat ttgtgttgat catgtctgtc aagtgtgtat gcgtgattgg gttggcgcaa 60 agaatacgcc gctagaggag cggcgagcgt ggtggctgag gcggcattcg ggctcgcgtt 120 ctgatgcgaa caagttctac ttcatgtccg tctcgtttac atctcgtgtt aattcctcgt 180 gtatcgtttc aatacagtgg cgatctttaa tcgcatcaag acacaaca 228 // ID Gypsy-92_CQ-LTR repbase; DNA; INV; 204 BP. XX AC AAWU01007055; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-92_CQ_; KW Gypsy-92_CQ-I; Gypsy-92_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-204 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 564-564 (2011). XX DR GenBank; AAWU01007055; Positions 58339 58542. XX SQ Sequence 204 BP; 53 A; 43 C; 30 G; 78 T; 0 other; tgttgtatta cgttatttat taccatgcca tagtttcgta gtttatggac agtccctaca 60 aacgtcattc attcacgcgc gctttctctt tctttctcac ttttcattca ttctgtatcc 120 gagtgccgag cagtacagtc gtgtcaataa acgttagttt gttaagttaa tcaaacttgt 180 tttaattgac taccaaataa aaca 204 // ID REP-6_CQ repbase; DNA; INV; 748 BP. XX AC . XX DT 29-DEC-2010 (Rel. 16.01, Created) DT 29-DEC-2010 (Rel. 16.01, Last updated, Version 1) XX DE A repeat family from Culex quinquefasciatus - consensus. XX KW Repetitive element; nonautonomous; REP-6_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-748 RA Kojima K.K. and Jurka J.; RT "Repeats from the southern house mosquito."; RL Repbase Reports 11(1), 609-609 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >93% CC identity. No TIRs. XX SQ Sequence 748 BP; 239 A; 145 C; 139 G; 225 T; 0 other; aaaaaacgtt acttaatcca cctttaggtg gttggtgcct tcctctcatt tatagagtga 60 ttccaacccc cctaaagtgt ccacatggtt tatggatctc ccctaacgtg atcgcagcac 120 ctaagaggtc ctaataaaaa taagtgatga gtaaaaaaaa cagactacct acagactttt 180 tgtcaagatt atacagacac cgcctttgag gaaaatggca tccctctaca cggctttttc 240 gtggcgatca gacccgacca tttgaggtta tgtaaattca gaccattttt taaactgctt 300 gtaattttag ataggtaagt cagatcttga aaattcttaa tccacaagaa aggtcatttc 360 agaacctttc taaaaatata taatatggca ggttttcatg caaaaaccac ccttttttgc 420 aaattgtgaa atttatatgc agcagttttt taagcataac ttttgatgtg ctttactaaa 480 tcttataaat ttcaataggg acttatggga ccccaagacg aatcgaatga ggccaatacg 540 gtcaaaatcg gctcagtcag tgacgagata ttccagtgac attgattcgg tacacatgtc 600 tacatacagc caaacacaca gacatttgct cagctggtga ttctgagtcg atatgtataa 660 atgacggtag gtctaggagg tctaattaaa aagttcattt ttcgagtgat tttatagcct 720 ttcctcagta aggtgaggaa ggcaaaaa 748 // ID HOMER repbase; DNA; INV; 3789 BP. XX AC AF110403; XX DT 27-JUL-1999 (Rel. 4.06, Created) DT 01-JUL-2005 (Rel. 10.08, Last updated, Version 2) XX DE Bactrocera tryoni transposon Homer putative transposase gene, DE complete cds. XX KW hAT; DNA transposon; Transposable Element; HOMER. XX NM HOMER. XX OS Bactrocera tryoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Tephritoidea; Tephritidae; Bactrocera; Bactrocera. XX RN [1] RP 1-3789 RA Pinkerton C.A., Whyard S., Mende A.H., Coates J.C., O'Brochta A.D. RA and Atkinson W.P.; RT "HOMER."; RL Direct Submission to Genbank (01-DEC-1998)Entomology, University RL of California, Riverside, CA 92521, USA. XX RN [2] RP 1-3789 RA Pinkerton A.C., Whyard S., Mende H.A., Coates C.J., O'Brochta D.A. RA and Atkinson P.W.; RT "The Queensland fruit fly, Bactrocera tryoni, contains multiple RT members of the hAT family of transposable elements."; RL Insect Mol Biol 8(4), 423-434 (1999). XX DR GenBank; AF110403; Positions 1 3789. XX FH Key Location/Qualifiers FT CDS 349..2208 FT /product="HOMER_1p" FT /translation="MAEKEDSIGCKIANGSYSLSGKRKGRSFVWNILSEIL FT KPDGTLLEGFVYCRTCLKVLKYYNGQTSNLIRHPCCRNIKTSQELRTVTVE FT DKNEAVKKFTSWVVEDCRPFSVVQGSGFIKLVKFFIKIGASYGEHVDVEDL FT LPSATTISRNVQKCANEKKEELKEEINNIVSSGGASATIDMWTDNYVKRNF FT LGVTFHYQKDLKFFDLVLGMKSMNYESSSGDNVLNKLSSMFKEYGVQNMSQ FT IKFITDRGSNMVKALKQNIRLNCSSHLFSNVLDKSFEETEELTDVLASCKK FT IVKYFKKANLQHLLPTSLKSQCPTRWNSNYGMIKSIIDNWFEINNIMTDNE FT QSQRLLHVNVSILKVLVSMCKDFETVFNKLQLCSSPSLCYVLPSILKIKNI FT CEFNNSDIAAISVLKENIRKKIEEIWVVNLSIWHKVATFLYPPAINMQPND FT LDDIKNFCISQIQNNAISAPTSATSAGNDSLSPTAISPTLTTHFSFNESNQ FT IDQSHSSSLCSKTGGDTFFFTYLVKRTTPQATEKPEDEVERYSREEVEMTE FT NFNLMEWWQSNQSKYPHLSKFALQIHAIPASSAAAERSFSLAGNLITEKRN FT RIAPGSVDSLLFLNTYYKNYNA" XX SQ Sequence 3789 BP; 1254 A; 651 C; 687 G; 1197 T; 0 other; cagagatctg caacggacac acaatttccg tgcacactcg aacacttacc caaaaccggc 60 ttgtatcgct ttctttgaac gagcagaagg attgctcgtt agcgactgtg cgcacgcgaa 120 cgtgctgatc acaacactcg agtgtcgtgc atacgtgttc gtttgggttc gggtgttttg 180 cgaatcaata tgttcgagtg tccgagtgta ggctcgagtg tccgagttta ggctcgagtg 240 tgcaaagcat tttttgttta tgtataatcg ttagtacagt tgattaattt gctttgtgag 300 tgttaggaat tttatatgat acatattaca tgatatatat tgaaaataat ggccgaaaaa 360 gaagattcta ttggttgcaa aattgcgaat ggcagttaca gcttgtctgg taagcgcaaa 420 ggccgaagtt tcgtatggaa tattttatca gaaattttga aaccggacgg aactttactg 480 gaaggttttg tttactgccg cacttgctta aaagtattga aatattataa tggtcagacg 540 tctaacctaa ttcgtcaccc atgctgcaga aatattaaaa catcacaaga attaagaacc 600 gtcacggtgg aagacaagaa tgaagctgta aaaaaattta catcctgggt tgttgaagac 660 tgccgaccat tttcggttgt gcaaggctca ggctttataa aacttgtgaa attcttcata 720 aagataggtg catcgtatgg cgaacacgtc gatgtggagg atttgttacc gagcgcaaca 780 acaatttccc gcaatgttca aaaatgtgca aatgaaaaaa aagaagaact taaagaagaa 840 ataaataata ttgtgagcag cggtggagct tcagccacaa tcgacatgtg gacggataac 900 tatgtgaagc ggaacttttt aggagttacc ttccactacc aaaaagactt gaaattcttc 960 gacttggttt tgggaatgaa atctatgaat tatgaaagtt cttcgggtga caatgttctt 1020 aataaactta gcagtatgtt taaagaatat ggtgttcaaa atatgagcca aattaaattc 1080 attactgaca gaggaagtaa tatggttaaa gctttaaaac aaaatatacg tcttaactgc 1140 agtagccatc tattttccaa tgtattggac aaatcattcg aggagacaga agagcttaca 1200 gatgtgttag cttcttgtaa aaaaattgtt aaatatttta agaaagcaaa tttacagcat 1260 cttctaccca cttcattgaa atcccaatgt cctactcggt ggaattctaa ttatggaatg 1320 attaaatcca taattgacaa ttggtttgaa attaacaata taatgactga taatgaacaa 1380 agccaacgat tattacacgt aaatgtatcc attcttaaag ttttggtttc aatgtgcaaa 1440 gattttgaaa ctgtattcaa taaattacaa ctttgcagtt ctccatcgct atgctacgtt 1500 cttccgtcca ttttaaaaat aaaaaacatt tgtgaattta ataatagtga tatagctgca 1560 atttcagtct taaaggagaa tataagaaaa aaaattgagg aaatatgggt tgtcaattta 1620 agtatatggc ataaggttgc aactttttta tacccacctg ctataaatat gcagccaaat 1680 gatttggatg atataaaaaa tttttgcatc tctcaaattc aaaataatgc aatatcagca 1740 ccaacttcag caacatctgc gggaaacgat tccttaagcc caaccgccat ctcacctact 1800 ttgacaacgc atttttcttt caatgaaagt aaccaaatag atcaaagcca ttcttcttct 1860 ttatgttcaa aaactggagg cgatacattc tttttcactt atttggttaa acgaaccacc 1920 ccacaagcca cagaaaagcc tgaagatgaa gttgagagat atagcagaga agaggtagag 1980 atgactgaaa attttaattt gatggagtgg tggcaaagta atcagagtaa atacccccat 2040 ttatcaaagt ttgcattgca gattcatgca atcccagcca gcagtgcagc tgcggagaga 2100 tctttttctt tagctggcaa cttaattact gaaaaacgta acagaatagc gccaggctct 2160 gttgacagtt tgctgttctt aaatacatat tataaaaact ataatgctta atggttggga 2220 gttatatcta aagtatgcgt gcaatgatag ttttataaac atattttcct ttatttatat 2280 ttaatttcaa gagtgataaa aagcatggaa tgcgtttttt ttaatgattg aatgattgaa 2340 atatcttttg tttatgcgaa gactgattta ataaaagtat attgaagaat gaaaacagaa 2400 gtttttataa ttatataagc atatttgtcg ccaagaggtt tggcgggaca ggttattgaa 2460 aacgtggtca atacccccaa aagtggacgt atgggcatcc tactctaggt tggatatttt 2520 cattcattta aaaagtatta taatctgcaa taattcagaa tcacgatcac gcaaacaaac 2580 aaagagtgtg aaactttgca ttatacactt cagaactcaa acgcattttg tacggacatt 2640 catcttaaaa ttattttaat ttcgtatgac acttttagca agcttaggca gctaagatat 2700 atacatttat atatttacat atatattcct tgcgctcgct atgattagga ataagcgaaa 2760 tcttaaattt aataaactta tactagcaaa aatcaaaata tcattgctca tataaattcg 2820 tatataaaat gtatgctaca attactatat agtagtccaa aaatatgaat tcttattttt 2880 cacagaaata cttttgctaa aaaaggatct ttttcttttc ttatttttcc cagaaataca 2940 tttgtaaaag gatcgtaatc cttttttatt ttcctcataa aagatttaaa cagttcaaga 3000 gctcaaaaca tgcgctacat cagctaccta ctaaattttt caatagctca aagcatgcgc 3060 tacatcagct cgaacctacc taccctacgc ctttgcgcaa atgattatct tatgataaca 3120 aatgcccaca tatagatttg gcaatggcaa tagcaataga tttgacagtt agcaagacat 3180 agattttatc ctgtcgagat gccccgtaag taagtatgaa tgtacatata tctatactag 3240 ccactctctt tattgattgg tttttcccaa aacacaaaat gcagttgcag tcaagtcaca 3300 ctgcttggaa acacacgtat gcatgtaaaa aagcaactat cattattctt ctctgattct 3360 taactgggtc ttatgttatg gttatgcaac tatattttta attttaaaat gcgtcattcc 3420 tttgatacat gcatacatat agtttaaatg ctgtatttca atgatattgg ctgttaaagt 3480 gtatacagca ttaataattt ctcatatttt tgctcagaaa aatgtgatac acttcttcta 3540 catccgtgtt ctgtccgagt atccgtgcaa tagttgaaca aatttcgggt tagcttcggg 3600 tgtcgagtgt ggacactcgc ttgtgatcgt tttgaagaga gtatgatcga acagactcga 3660 acacacagaa agcagctgcc gggtgtgcca ctcgatcgaa cccattgctg ccgagtgtgg 3720 cgctcgctcg tgctcgattt ttaatgaggg tgatcgaaca tgttcgaaca cacccaatgc 3780 agctctctg 3789 // ID L1-99_Cis repbase; DNA; INV; 7206 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 06-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE L1 Non-LTR Retrotransposon from Ciona savignyi. XX KW L1; Non-LTR Retrotransposon; Transposable Element; L1-99_Cis. XX OS Ciona savignyi OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. XX RN [1] RP 1-7206 RA Smit A.F.; RT "L1-99_Cis - L1 Non-LTR Retrotransposon from Ciona savignyi."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX CC Ci000729, Ci000099, Ci000682. Bit of an out-group; ORF2 (bp CC 3323-7135) protein closest to L1 subclass, but only 28% CC identical (48% similar) to nearest neighbor in other species CC (L1-1 in Drosophila). ORF1 (bp 1924-3345) protein has very CC faints similarity to L1-1-6 ORF1 pep zinc finger regions. 5' end CC seems to extend, even though there is already 1923 bp of it. 3' CC UTR is also very long for a subgroup and is separate in database CC as L1-99ext. XX SQ Sequence 7206 BP; 2463 A; 1442 C; 1432 G; 1839 T; 30 other; atgaaattgc aaaatttatt gtttttattg aattactacc atttctatgt ctacttgaat 60 ttcgtagacc agtcatatat tccgattttt ctatattaat tatgggtccc cttcgtattt 120 ccatattgat ttccctccgt ttattttttt tatatatttt gtttttctac gtcacgcttt 180 ttttcactgc tcttattttg aaacgacctt atattttaca aaaacaaaan tggatttttt 240 gttatcggaa tttgttatcg gaaagaaaaa gaggcctttt tgagcatttt gtgctttatt 300 gtcgatattt ccgatttaga aggggatgtt tactcaaatt tctaagtgag taaacgataa 360 acagcaaaaa gcatttaaat gacccaatgt acccggaaat ttcccaaaaa atttaaaata 420 cctaaccgaa cattaatata caatcctcgt acaaatgagg tggaaacaca aaactanata 480 tataaaaaat acgaaaacaa aacataaaat atacgaatta cacaaaaata tacaaanagt 540 anatcttctg gtggagttat agctatagca gttgatattt cggaacnggt gcggcttatt 600 cacataaaat tgcaaggaaa tttgtaggtt taacaatgat aaagttaaaa tacactgata 660 aagtgnctac atancgctaa taaacacgga aaatccgtta caatataaaa tccataacgt 720 aatatagtat attatttcaa cgtatataaa ttagcccaac ttttaaaacc attacaagta 780 tagtccatcg atgatttata ccaagcttaa acagttcagt agataatagt tatgaacttt 840 tgatatcggt aattttgaaa tantgtttat ttaattaata cacttttgta aaatactgtc 900 tttcaaacaa aaccacaaac aagacgcttt gttaaacaaa gcaatcagag cattctctta 960 ccgcggcgca ccaattaata aaaccgcggt tactaaaaat gattaccatt aagttttaat 1020 ctgagttaag gcagaaaatg ttagctaatt atgtgtagat gtgcacaaat ccgtagtaat 1080 ttccgacaaa attaaagata tcacctcaat ttattgtgtg aaaacgtgtt tatcttgccg 1140 aaaagatctg atgatgataa ttattccatg tggcccgtca cactcccagc acaagttacg 1200 tttcagtata tttccctcat cccacaatac catacggttc agatgaagta atcacgagca 1260 cctggttctg tttgccatgt tataatatgc cctgcatgtg ttctttgaag ttgctatcac 1320 tgtgtaagtt atagctcggt tctctgtttt agaantttgc catgttgatt catgtgaaac 1380 gtggtaaagc agtcggtagg gctttaaagc gaatgaacaa atccggtcat atctatcgaa 1440 ataagggcgg gaggttcaag ttgaccggga aagggcgtgg cttgaaggcc aagaaacgcc 1500 gccgtcgccg caagaagggc aggaaaggca aaaagaagcg aaagggaaag aaatccaaga 1560 agggcaaaaa gaagaaggga aagaaatcca agaaaggtaa aaagaagagg aagagacgac 1620 gatccaagaa acgtggccgc aagggaaaga agcgatcacg caaatctcct aaacgagcca 1680 accgtaaacg aaagtccaaa cgcaagaagg gacgcaaagg aaagaaaggg aagaagcgac 1740 gacgataagc gatcaatatt aaaaaatatt taaatcaaaa aaaagaaaaa tcaccgccaa 1800 taaattgcca ttcaggcgca ccaaacattg gtttctttgt accaacggtc taaccgcaag 1860 gtggcgttgt ttgtataaaa agttgtcggg gaatctcagt ggaacaacat agccattctc 1920 tttcagcagc ggtggtagca ggacgtgttt gttctgctac gccatgagcg ataaggcaaa 1980 aatgcatcaa gcagcgccac gtggcaacat gccaattaga agttacgcga gggcagcaag 2040 agccagagaa cacacaaagc aactggaaag accgaattgt ttatcaataa aaggaaaaat 2100 caggctcgac cgcccaaaaa cgctagaaat tattaaaaaa ttaaaatttc aaaaaaagat 2160 atttggaatc gccgaaatgc agggaggcat tattgatatc acctgcacct ccagagaagc 2220 agttcttaag cttcatgaaa ttttagaaaa acaagaggaa gtccagtcgg tgcgacttta 2280 ccaaaccgat aagattggca tcgcactggg ctgggtaccg attcccatgg aaaacggggt 2340 aatacaagaa gcattgaaag aattcggatg tgtacataaa attataagaa aaacagataa 2400 agatggaatg gtgactggca tgcgaattgt aataatgaac aaacaagaca ttaaagacaa 2460 cccaatacct agctacataa taattcaagg ttatgaagta tacgtaacgt acactggcca 2520 agaaaaaaca tgccgttttt gctccgaaac ggggcacgtg agactggatt gtcctcattg 2580 gcgcgacgaa ttcccngtga tgggcgaggc acgcgaaaaa tcagcccaat caggcccgac 2640 ttcccgcgaa gtcacgccgc gtgacgtcac cacgcccacg catgtagcct catccagcag 2700 cctcgctgcg gagtcgttgc taccaaaata tgatgaaggt atccttgaag caaatagcaa 2760 taaaaaaaca acagcaacat ctggaaataa aagaccactg tcaagcccgg aagaagttag 2820 tcattcaaaa ataaaatcta taagcctaag tgacaacaag ataagcgtgt ggtgcaaaac 2880 ttgcaacaag gagggtctgg ttgcagaaga aagcaccaaa tacctatgct ggggatgcaa 2940 aaacgaattt aacgtgataa agacatgctg caccctggac gaattcttct tagtctccag 3000 cgccgacaag tctgcagtct gtcccgcgtg ccgagaagga atgagactga tgccatgttg 3060 cacgacgtac caacctacaa tgcacatcgc cgatggaaca ctacaatgcc cgaactgtga 3120 aaggtattgt gcggattgca attgcggccg cttcaattat tttttaaaca aagagcttag 3180 caaaccctgt gaaaatttaa aatgtaagca cagatacatt cattgcaact gtgataaaaa 3240 atacgttaca gaaatatccc cagaacaacc gtttaaatgt aaatgtggct tcgaatatga 3300 ctacaatgtg aagaatggtg tcaagactac ttaatctttt tataatggct ttaagtgtcg 3360 caacttataa cgtagctgga attagagaca acgataaacg aaatgccgtc cttaacttta 3420 tccgacaaaa acgctttaat ataatattat tgcaagagac gcattgttca accgaaatcc 3480 aaaaacgttg ggaattagag tggggctcaa aaataatatg ggcgcatggc acgagtcgaa 3540 gtaaaggtgt ggcaatatta tttgacaaca aaacccctta taaagtaggg aagaccctta 3600 tcgatcccaa tggtcgtttt gtcatagcgg aggtcggatt cagcgcgacg acgtttgtac 3660 tcgttaatgt gtacgcgcct aataatgacg acccacagtt ttttcgagat ttgtttcaaa 3720 atatcgtttc cctcgccggt ataaatgaag taataattgg tggagacctc aatataatta 3780 tggacgctac cttagaccgc catcgncctg gagcaagcaa taaggcaaca gctactaatg 3840 tggtccgtgc gcatctgcaa acgctcgggt tggtggacgt atttcgatct cgaaacccgc 3900 agttgaagcg gttcacccgt ttccaagcga acccgtattc ggcgagcagg atagattacc 3960 tnctaacctc taaaaatatn ataccgaaaa taacaaattg caatattgtc cctagcataa 4020 agtcagatca caatatcgta tacntgaaac tggccctcga caccagtgat cgcggcagag 4080 gaatttggaa gctaaacacc tccctgctgg ataagaacgc ttttatttca accgttaaac 4140 aggcgatcac tgattatggc atcaataatc cacctggcca cgtgaatgct catgtgagat 4200 gggatgctct aaaatgcacg ctcaggggca catgtataaa cttcagcgcc cgactaaagc 4260 gatcggcgca gaaggaacaa cgacgcatcg aaaatgaaat aaaaatggcc gagaccgttc 4320 tcgccgactc gtcaatagtt gcacaatcaa atgtagaaac gttaaataag ttaaaatgtg 4380 aacttaacga tcttgtagaa cgccggtcca agggtgcaat aattcgaagt cgcgtaagat 4440 gggtggagca nggtgaaaag tgtnctaaat actttttaaa tctcgaaaaa cgaaatgcag 4500 aaaaaaaggc aatttatcga cttgagagca atggcgctat tttaacagat caagcagaaa 4560 tcgtaaacga acttgccaca ttttatgaaa gtctatataa gaaaagcgcg acgttaacac 4620 ccgtcgagga natatcttca catttgtcct cactggacat accgtgttta gatgaaaata 4680 ccgcgaagag catagaatat gccataactg aagtagaatg tcgtaaagcc ctggatacga 4740 tgccaaacaa taagtctcct ggtagcgacg gttttccagc cgaattttac aaactgtttt 4800 gggttgccat aaagtcttat tttatggacg cgctccattt aacagaaaga aacggatatc 4860 tgtcggaaac gcaaaagcac ggggtaataa cactaattcc taaacctaac aagaatttat 4920 tgctagcgac caactaccgc ccaatcactc ttttgaacgt cgattataaa ataatatcaa 4980 aagttataaa taacagaatc aagagcctcc ttcccccttt gatacatccc gatcaaaatg 5040 ggttcatgaa aggcaggtat ataggcaata acattcgact tttatttaac ttaatcgatt 5100 attgtgacgc tgaaagcata cctggggcaa tatttaacgt ggacttctat aaagcattcg 5160 atacgctaag ctggacgttt gtcaaccaaa tgttaaaacg atggggattc ggtcaacata 5220 ttcaaaattg gattagaacc ttttataccg acccaagctg ttctattatt aacaataatc 5280 acgtttccaa acaatttaaa tcccaacgtg gcgttcgtca aggtgatcct ttgtcaccta 5340 cattattcgt tttggcaatc gaaatattag cggtggaaat tcgctcgnca caaattcccg 5400 gcataacagt canggagaag gagataaaaa tttccatgtt ggcggacgat accatgataa 5460 cgctggacgg ggaggctaaa agctttaact tagcatataa aatcctggaa aactacggaa 5520 aaatgtccgg ctgcgtaata aacctacaga aatctaacgc gttttacatt ggcagtaaac 5580 ggtcgtgcga tttaaaaccc cttctaaata aaggattgcc gtggcctaac gacgagataa 5640 aatatcttgg cattgtattt cctattggtc gcaacaacaa tggtcgagat ctttttaata 5700 ttaatatgac ggaaccttta aataaaatca aaacgctgtt aaatatttgg gcctctcgaa 5760 ccttgactct actcggcaaa ntaactgtga ttaaatcctt agtcataccg acactgacgt 5820 acaaactaag tgtaatacct acgccgccac ctatatatag cgtcttctta aaaaaactta 5880 aacaaagcat ttttcacttc atatgggggt cgaaatggga acgnatcaag agaacnacgc 5940 tatgtagaga cattggcgag ggaggggcaa aaatgcttga tattgacgcc tatatcttgg 6000 cattgcaagc caaatggata anaaacttac tagactcgga ctaccatgca gcctggaaag 6060 acgtggaaaa tgtatatttt agtaaagcgg cattagcagc aacgctgcag tccaagttac 6120 gggaaacgtc caacgacatc gtccggctat tccccttgcg cgcgcttcgg accacggcag 6180 tagctgttag acgtgtatac tccagcggaa ctcgccttga tcgcattaac gaccctcgct 6240 acctctggtt gaacccaaat atcaaatcgc gcaatagaac attctttatt gtggacttgg 6300 cggaggccgg gataaccaac tactcacaaa ttattaacac cgagggtaac tatatgagct 6360 tcaaggagtt acgggaagct tattcgataa ccgatacctt agacaatttt acagagttca 6420 acaaattaat ctgcgccgtg ccggaagctt gggatnaagg cgatccacaa agcccaagta 6480 agaccgatta ttccaccaaa ttgtttgact tgatcgaaaa cttaaaagat taccaaacaa 6540 cgaaatcaat atataacgat gttatcgcag ctagcgcaac aacctccatn aaaccgcagg 6600 aaagctgggc gcgctcgctc gatattacca tcgaaaactg gggacaaatt taccgtgata 6660 attatttctg taccttagaa actcggtacc gctcctttca aattaaacta aataacagag 6720 ctatagtaac caatgataaa ttacatcgta tgggatttaa ggacactaat cggtgcggat 6780 tttgcaagat cgaaccggaa acattaaccc acctattcta tttttgcccc aaaatcaaag 6840 cttattggaa taatgtcgaa aactggttaa gcgcggcgtt tcgtagacca tncagcctaa 6900 ccccacaaaa ttgtttcttt ggtattgtta acaattgtgt catcaactgt attttattat 6960 gtgcaagatt ctgtatatac agagggaaaa caacggacac agtgcccacg attcacttac 7020 tcattagtga agtggaagcc ctnagaaaaa gggaaatgat aatagcaaaa aaannaaaca 7080 aactagantc tcatcaccgt aaatggacta ccattccaga gtgaagtgta ttaaatgtat 7140 tgctgtattt ttgtgtcgtt gcccgttcac cttcttctct aaataaacgt tcaaatgtta 7200 aaaaaa 7206 // ID DNA8-99_AP repbase; DNA; INV; 758 BP. XX AC . XX DT 28-AUG-2009 (Rel. 14.09, Created) DT 28-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-99_AP. XX NM DNA8-99_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-758 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2036-2036 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 758 BP; 321 A; 77 C; 65 G; 295 T; 0 other; cagtcttgta aaaaatactt ttgaaaaagt attaagatac taatactagt attcgaataa 60 aaaagtatta aaaatacttt taaaatacta ttataaaaat gtattttacg aatactcaca 120 aagtatttga atacaaatac aagtattttt tttaaatttt atgctatata atatatatat 180 tatattgtac accagtatag gtacctatta tgttactact taccatagat tttaacaatt 240 actattaaca tgtctgtcgt ctgactattt agagatatat acatgttcac atatttcatt 300 ctgcattcag atatatttca actaatattg attaaatttt aacgaaaaaa ataaactaaa 360 gagtggtatt aactattaat atggaactat tttaaaatta ttagattttc agttttctta 420 aaaaattcca atatagaaac catataaaat ttgatgtggt attttttaaa ttttaaacat 480 tgttattacc tacttttact atacaaaaac ataatattag ttttcaacga aaacaatatt 540 atttttattg atgtaaattt tttgtgaaaa ataataaatt aaaattaata aaacggactt 600 tatggtattt taatacaagt atttaaattt tgttaaaaat acttttaaaa agtattcaaa 660 atacttaaaa aatacattaa aaaaattgta tttagaatac catataaata cttcaaaaaa 720 gtattcaaat acttgtattc gaatacttta caagactg 758 // ID I_Ele4F_AAe repbase; DNA; INV; 5687 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A I non-LTR retrotransposon family from Aedes aegypti. XX KW I; Non-LTR Retrotransposon; Transposable Element; I-6_AAe; KW I-6B_AAe; I-6C_AAe; I-6D_AAe; I_Ele4F_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5687 RA Kojima K.K. and Jurka J.; RT "I non-LTR retrotransposons from the yellow fever mosquito."; RL Repbase Reports 11(4), 1375-1375 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 6 sequences with >99% CC identity. CC The consensus is ~92% identical to I_Ele4E_AAe. XX FH Key Location/Qualifiers FT CDS 384..1685 FT /product="I_Ele4F_AAe_1p" FT /translation="METDGDIVSVVDKDSDKSSSTVKPIRIKTYPSTYLGP FT FVVFFRKKEKPINVLLISSEVYKLYKSVKEIKKISLDKLRVVFGSREDANA FT LLESKLFYNSYRVYAPCDSCEINGIIYDESLDCEDVLNYGSGVFKNKTISP FT VKILECVRLSKLLFSDKGSSYTHSNCIKLTFSGSVLPDYVDIDNVKFRVRL FT FYPKIMHCDRCLLFGHTSHFCSNKPKCLKCGGVHSPSECKKQSDSCIYCGK FT KHEFLKECSVYIAHQKQFNLKIKNKNKSSYSEVIKTSDVFSTKNMFEPLSQ FT NNDLNLNEEPNNFVYKPPIKRKRINKSNNHNHNLNPQPSTSYDQNFPPIKC FT SSSTQNIPGFQKTTPGFSGNHNDFNKNNNKSHNYEHDGDEGSILNILEDIV FT NFLGLNDFWKKIIEKCLPFLAKILEKLNSFGPLISSLFCS" FT CDS 1688..5380 FT /product="I_Ele4F_AAe_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="MASFKNSNLNILQWNCRSVIPKIDRLKALIANFDIDI FT FCLNETWLIDTKFFRIPSFNIIRKDRNIAYGGVMIGIRENIEFKFLNFIVD FT SPIEYVAILVKHKGLEFSIVCLYIPPQAKFSLQNLRTILNNVPSPFYVLGD FT LNAHNLAWGSDITDGRGELVMNLIDELNLNILNDGSFTRVAVPPANHSCID FT LSLCSNNLSIRSSWSIINDPNGSDHLPIKISIHDPSRDQFQHEPFVHDFTK FT NVDWNKFSDLVSSALINFDNSLPPLQNYNSFSKILIQCLHKSQSKKIFLGP FT SKKRPPCFWWDHDCTLALKNKSNAFKKFRKSGSRDHYIFYCKTEAQFTRVI FT KFKKRNYWKTFIENLDSETSLTKLWAVARNLRNYNIPSTSILEYSEDWIDQ FT FASKICPDFVPTPITFKKHQCYNYYPDLCNKFSIEEMELALTITNNTAPGI FT DNIKFIVLKNLPIDGKLHLLSLYNSFLFQNIFPMEWRSIKVVSLLKPDKNP FT SLVESRRPISLLSCLRKLMERMILNRLELWAEKHNIFSSSQYGFRKGRGTR FT DCIALLASQIELSFNKKQDVVSTFLDVSGAYDSVLIDLLFNKMNDCKIPII FT ISNFICNLFSFKIMHFFHNGSSRMVRYSYFGLPQGSCLSPFLYNIFTRDII FT SIIPNGCYFVQFADDNVIFINGKNREIIRHYMQNSLDNIHTWAHNNGFTFS FT AQKTKFILFSRKHSPVHIDLFLDNHQIEQVFDYKYLGIWFDSKLTWNNHIQ FT YVQKICSKRINFLRMITGTWWGAHPNDLITLYKTTIRSVMEYGCFAFGSAV FT QTHFSKLEKIQLRCLRISLKLMNSTHTKSVEILAGIIPLKNRIFELNCKFL FT IHCFSINHPLIDILKSLFEINSSNRMLKSFIYCSSENIIPNSSPGFHEYSI FT DIHSFCPYIDFSLYEELKQIPYHVHSNYANMLFKRKLIGVDNXQIFFTDGS FT LIGNVAGFGVYNFHLAHFYKLDSPCSIFTAELTAIYFACSLIKNYAPNIFV FT VCSDSFSCLQTLNSSKFHFKTHHILLSIKGLLHNLYSKGYMIKFVWVPAHC FT NIYGNEQADLLAKLGVSRGIIFNRDIQHFEYFSNLKKYTLNNWQISWNTSD FT QGRYCYSICPKVKVYPWFRYFSVGRNFICTFSRLMSNHYICKCYLYRMNII FT DSNICECNETYEDIDHIVFRCSRFNLPREQFFDRISSLGYDIPVSVRDILG FT NHNLPILKLLYQYLCGISCFV" XX SQ Sequence 5687 BP; 1797 A; 826 C; 881 G; 2182 T; 1 other; cgatcactat tatggttcat cttcggtaag taggccttga ccgggttaca ggtctttttt 60 tctgcgcttg tcattttttt tcgtttcagg tgaatttcta ccccgaagta ttgtcctttg 120 aaggtgattg cttttgcgtg ttggttgctg attagaattg cgtaaacaga ggaataagtg 180 tgctgcaaaa cttcaaatcg caagactcct ggttattgtt ggtgatacaa gccgtttgtc 240 gtttttgagt gctgttgctg gtgtcactga agtttattgg cgtttgttaa gtatttgttt 300 tcatttttag ttagattagt gtttcatttt attatttttt cggtgtttgc ttagttgaat 360 ttcgtccccg tcatttttcc attatggaaa ctgacgggga tatagtttct gtagtagata 420 aagattctga caaatcttca tctacggtta aacccattcg cattaaaacc tatccttcta 480 catatcttgg tccttttgtt gtgtttttcc gtaaaaaaga aaaacctatt aacgttcttt 540 taatttcgtc agaagtttat aaattatata aatctgtcaa agaaatcaaa aagatttctc 600 ttgacaaatt gcgggtcgtt tttggatctc gtgaggacgc taatgcgtta ttagagtcca 660 aattgtttta taattcatat cgagtctatg ctccatgcga ctcatgtgaa ataaatggta 720 taatttatga cgagtcatta gattgtgagg atgttttaaa ttatggctct ggtgtattca 780 aaaataaaac gatttcccct gtaaaaattt tagaatgtgt tcggttatct aaattacttt 840 tttccgataa aggttcctca tatactcatt cgaattgtat aaaattaaca ttttcaggat 900 ctgtccttcc tgattatgtg gacattgata atgttaaatt tcgcgttagg ctcttttacc 960 caaaaatcat gcattgtgat cgttgccttc tttttggcca tacatcacat ttttgctcca 1020 ataaaccaaa atgtttaaaa tgtggtgggg tacattctcc atctgaatgt aaaaaacaat 1080 ctgatagttg catttattgt ggcaaaaagc atgaattttt gaaagaatgt tcggtttata 1140 tagcacatca gaaacaattc aatctaaaaa taaaaaataa aaataaatca tcttattcgg 1200 aagttattaa aacttctgat gtgttttcaa ctaaaaatat gtttgaacct ttatctcaaa 1260 acaatgacct taatttgaat gaggaaccta ataattttgt atataaacca cctattaaaa 1320 gaaaaagaat aaataaatca aataaccata accataattt aaatccacaa ccttcaactt 1380 cttatgatca gaattttcct ccaatcaaat gttcaagctc aactcaaaat attcctggtt 1440 ttcagaaaac tactcctggt ttttctggta atcataatga ctttaacaag aacaataata 1500 aatctcataa ttatgagcat gatggtgatg agggtagtat tttgaatatt ttggaagata 1560 ttgtgaattt tttgggattg aatgattttt ggaaaaaaat cattgaaaaa tgtttaccct 1620 ttttagcaaa aattcttgaa aaattgaatt catttggacc cctcattagt tccttgtttt 1680 gttcctaatg gcttcattca aaaatagtaa cttgaatatt ttacaatgga attgtcgtag 1740 tgtcattcca aaaattgata gacttaaagc tttaatagca aattttgata ttgatatatt 1800 ttgtttaaat gaaacatggt tgattgatac taaatttttt cgaattcctt cattcaatat 1860 aattcgtaaa gatcgtaaca tagcttatgg gggtgttatg atcgggattc gtgaaaacat 1920 tgaatttaaa tttttgaatt ttattgttga ttcgccaatt gaatatgtgg ctattttagt 1980 caaacataaa ggtttggaat tttcaattgt atgtttgtat attcctcccc aagcaaaatt 2040 tagcttacaa aacctcagaa caatattgaa taatgttcct tcaccatttt atgtacttgg 2100 tgacctaaat gctcataatt tagcttgggg tagtgacata actgatggta gaggtgaact 2160 agttatgaat ctaattgatg aattaaattt aaatattctg aatgatggat ctttcactag 2220 agttgcggtc cctcctgcta atcattcttg tatagatttg tcgctttgtt caaataattt 2280 gtccataaga tcttcttggt ccatcattaa tgatccaaat ggtagtgatc atttacctat 2340 taaaatatca attcatgatc cttcgcgtga tcagtttcag catgaacctt ttgttcatga 2400 tttcactaaa aatgttgatt ggaataaatt ttcagattta gtttcttcag ctcttatcaa 2460 ttttgataat tcgcttccac ctcttcaaaa ttataacagt ttttcaaaaa tcttaattca 2520 atgtttacat aaatctcaga gcaaaaaaat atttttaggt ccttctaaaa aaagacctcc 2580 ttgtttttgg tgggatcatg attgtacttt agcacttaaa aataaatcaa atgcattcaa 2640 aaaatttcga aaatctggat ctagggatca ctatattttc tattgtaaaa ctgaagctca 2700 gtttactcga gttataaaat ttaaaaaaag aaattattgg aaaactttta ttgaaaatct 2760 tgattctgaa acatcattaa ctaaattatg ggcagttgct cgaaatttaa ggaattataa 2820 tattccttct acatctattt tggaatattc agaagattgg attgatcaat ttgcttctaa 2880 aatttgtcca gatttcgttc ctacccccat cacattcaaa aaacatcaat gttacaatta 2940 ttatcctgat ctctgcaata aattttcaat tgaggagatg gaattggcat taactattac 3000 taacaacact gctcctggca ttgataacat taaatttatt gttttaaaaa atttaccgat 3060 tgatggtaaa ttacatttac tttctttata taattcattt ttgtttcaga atatttttcc 3120 tatggaatgg cgttctatta aagtagttag tttacttaaa cctgataaaa atccttcatt 3180 agtagaaagt agaagaccta ttagtttatt atcgtgcctc cgtaaactga tggaaagaat 3240 gattttaaat cgtcttgaat tgtgggctga gaaacataat atattttcat catctcaata 3300 cggatttaga aaaggtcgtg gaactcgaga ttgtattgct ttacttgctt cacaaattga 3360 actatcgttc aataaaaaac aggatgtagt ttcaacattt ctggatgttt ctggtgcata 3420 tgattctgta ttgattgatt tattgtttaa taaaatgaac gattgtaaaa ttcctatcat 3480 catttccaat tttatatgta atttattttc tttcaaaata atgcattttt tccataacgg 3540 atcttcaaga atggtccgtt atagctattt cggtttacct cagggttctt gtttaagccc 3600 gtttttatat aacatattca ccagagacat catttccatt attcctaatg gatgctattt 3660 tgttcaattt gcggatgata atgtcatttt tatcaatggc aagaatagag aaattattcg 3720 tcactatatg caaaattctt tagataatat tcacacttgg gcacataata atggttttac 3780 attttcagct caaaaaacaa aatttatatt attttctcgc aaacattctc cagttcatat 3840 tgatttgttt cttgataatc atcaaattga acaagttttt gattataaat atcttggtat 3900 atggtttgat tcgaaattga cgtggaataa tcatattcaa tatgtccaaa aaatttgttc 3960 taaaagaatt aattttcttc gtatgattac tggtacatgg tggggtgctc atccaaatga 4020 tttgataaca ctttataaaa caactattcg ttcagtaatg gaatatggtt gttttgcttt 4080 tggaagtgct gtacaaacac atttttccaa acttgaaaaa attcaacttc gttgtttgag 4140 aattagttta aaattaatga attctactca taccaaatct gttgaaatac ttgctggtat 4200 tattccactg aaaaatcgta tttttgaatt gaactgtaaa tttttaatac attgtttttc 4260 aattaatcat ccgctaattg atatattaaa atccttattt gaaataaatt ctagtaatag 4320 aatgttgaaa tcgttcattt attgctcttc agaaaacatt attccaaatt cttcaccagg 4380 ttttcatgaa tatagtatag atattcattc cttttgtcct tacattgatt tttctttata 4440 cgaagaatta aaacaaattc cttatcacgt tcattcaaat tatgctaata tgttatttaa 4500 acgcaaattg attggggtgg ataatwatca aatatttttt acagatggtt ctctgattgg 4560 aaatgtagca gggtttggag tgtataattt tcatttggcc catttttata aattagattc 4620 tccctgttca attttcacag ctgaattgac tgctatatat tttgcatgca gtttaattaa 4680 aaattatgca cctaacatat tcgtggtgtg ttcagatagt tttagttgtc tgcaaacttt 4740 aaattctagt aaatttcatt ttaaaaccca tcatattctc ttgtcaataa aagggttatt 4800 gcataattta tattctaaag gatatatgat taaatttgtt tgggttccag ctcattgtaa 4860 tatttatggc aatgaacaag ctgatttatt agcgaaattg ggtgtttcgc gtggtataat 4920 attcaatcgt gatatacaac actttgaata tttttctaat ttaaaaaaat atactctaaa 4980 taattggcaa atttcatgga atacaagtga ccaagggaga tattgctatt ccatttgtcc 5040 aaaggtaaag gtatatccct ggtttagata cttttcggtt ggacgtaatt ttatttgtac 5100 cttctctaga ttaatgtcca accattacat ttgtaaatgt tatttatacc gtatgaatat 5160 tatagattca aatatttgtg aatgtaatga aacatatgaa gatatagatc atattgtatt 5220 tcgatgttct cggttcaatt tacctagaga gcaattcttt gacagaatca gtagtttggg 5280 ttatgatatt cctgtatctg tccgagacat tttgggaaat cataacctac caatattgaa 5340 attgttatat caatatttgt gtggaatttc ttgttttgtt tgatacttgc tgctttgttt 5400 ttctttttta ttttcagata taacaagttt ggcctccatt tgtgttgtcg ataggcatcg 5460 tggatacgcg tgggaggacc tatatgacga tttttccgga tgattcggct ctgtgatgga 5520 tccattccga atgagcctgt agttttaata tttattttat aacgttctat tagaaaagat 5580 aaagaggttt tgtgcccttt tgagaaagat ttcgaaagga aatcactcaa aggggctttt 5640 ccctctttca aaattattga gttaataaac aataacaata acaataa 5687 // ID BEL-32_CQ-I repbase; DNA; INV; 1601 BP. XX AC AAWU01004057; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-32_CQ_; KW BEL-32_CQ-LTR; BEL-32_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-1601 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 217-217 (2011). XX DR GenBank; AAWU01004057; Positions 9463 7863. XX CC 'TAAGC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 19..1599 FT /product="BEL-32_CQ-I_1p" FT /translation="MVIAESVTMEKLEHLRTVHLLRVNGVLAAAGKLAERK FT ASHSEATERLDQLHGLAGDFRWTQQRIEEILDDPEAIATARFKAEEFKAAY FT QGAKELLEKHISSTKPVRAVPRNGSTGSLLEEIRKFSIEVKEWNATCVGQF FT GAVREGNDDSNGSNSVSNRAPSSGPSTTTNKNGAKKINGQSGAVGCGDADG FT KQLPSCAPVLGPTAAPVMDSALEDYNIEPTGGIQISNWLPQSEKLLPPIEV FT STFHPGWTSWNRSKDGPETFSEDVEKASVSDNNAKMEARDSRGAQRKAHIL FT PTEAVSVRGNDAALLQASLGSKVRARLGLPHSSEKVVATGQQLQATSAHHV FT SQRENGRLIKREQEHCGSARQRPEQQDLARSIDVFRASGQWKQVKINGGVP FT SAHTVSSASNVPRNQSINSMEEQANLDKITFTQNRTGSHVGRTSRKLTLSR FT ALLENTDRKSTLQQGTTTRITWSNHSRFDDQRIIVRLLLYDSVPAPESGNV FT TTGTTQRGQARCHGRETPRKGLRLRCLKAGE" XX SQ Sequence 1601 BP; 437 A; 415 C; 480 G; 269 T; 0 other; ttggtccttc gtcgtcggat ggttatcgcg gaatcggtta ccatggagaa actggagcac 60 cttcgaacgg tgcatttact gcgtgtgaac ggggtgttgg cggcagccgg gaaactcgcc 120 gagcggaagg cttcccacag cgaggcaaca gaaaggctcg accaactcca cggacttgct 180 ggagacttcc ggtggacaca gcagcggata gaggagattc tggacgatcc ggaagcaatc 240 gccacagcgc ggttcaaagc ggaagaattc aaggcggctt accaaggagc caaggaactg 300 ctcgaaaaac acatttcgag caccaaaccg gttcgagcag ttccacggaa cggtagcacc 360 ggaagcctgc tggaggaaat ccggaagttc agcatcgagg tgaaggaatg gaacgcaacg 420 tgcgtcggcc agttcggagc cgttcgtgaa ggcaacgatg attcgaatgg cagcaattcg 480 gtgtcgaatc gtgcaccctc ctcgggtcct tcgacaacaa caaacaaaaa tggtgcgaag 540 aagatcaacg gccagtctgg agcagtggga tgcggcgatg cagatggcaa acagctgcca 600 agttgcgcgc ccgttttggg tccaacggcg gcaccggtga tggattctgc gctggaagat 660 tacaacatcg aaccaacggg aggaatccag atatcgaact ggttacccca atcggagaag 720 cttcttcccc caatcgaggt gtcgacattc caccctggtt ggacgagctg gaatcggagc 780 aaggatggtc cggagacgtt ctcagaggac gtagagaagg catccgtcag cgacaacaac 840 gcgaagatgg aggcacgaga ttcaagagga gcacagcgga aggcccacat tctgccaact 900 gaagctgtca gtgttcgggg aaacgacgcg gctcttctcc aagcatcgct gggttccaag 960 gtccgtgcca gactcggtct gccgcattcg agcgagaagg ttgtggccac cgggcagcaa 1020 ctgcaagcaa caagcgctca ccatgtgagc cagagggaaa acgggcgcct gatcaagcgg 1080 gaacaggagc actgtggttc agcgcggcaa cggccggaac agcaggacct cgcacggtct 1140 attgatgtct ttcgtgcctc aggtcagtgg aaacaagtca agatcaacgg tggtgttccg 1200 tcagcacata cagtgtcatc cgcatcaaac gtgccaagga atcaatccat caactcaatg 1260 gaagaacaag caaacttgga caagatcact tttacgcaga acagaacagg aagtcacgtc 1320 ggacggacgt cgaggaagct gactctcagc cgagcgctgc tggagaacac agatcggaaa 1380 tctacattac aacaaggaac aactactcgg atcacctggt cgaatcattc tcggttcgac 1440 gatcaacgca tcatcgtacg actgctgctc tacgattcgg tgccagcccc agaaagcgga 1500 aacgtcacga caggaacaac gcagcgcggt caagcgagat gccacggtcg agaaacgccg 1560 aggaaaggat tgaggctcag gtgcctcaag gcgggggaga a 1601 // ID TTAA3_AP repbase; DNA; INV; 429 BP. XX AC . XX DT 21-AUG-2009 (Rel. 14.08, Created) DT 21-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; TTAA3_AP. XX NM TTAA3_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-429 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(8), 1783-1783 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC TSD is TTAA tetranucleotide. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 429 BP; 134 A; 74 C; 78 G; 140 T; 3 other; gaggacgcca caccggcatt tgttgtctcc gtcttacaag tgcgtaacat agcaaattta 60 cgctcagcag atcacgttta gctccgttag tttaaaaatt agagtgaatc gacctattat 120 gaaacttgat ggtaagaaca ttatctgtgt ttgtatgtgt tgggtttttt acgataattc 180 agtttttaag tgagttatga gcattattta atttacgcta tattatawat gmtcataact 240 tacttaaaaa ttgaactatc gtaaaaaacc macatacaaa cacagataat gttcttacca 300 tcaagtttca taataggtcg attcactcta atttttaaac taacggagct aaacgtgatc 360 tgctgagcgt aaatttgcta tgttacgcac ttgtaagacg gagacaacaa atgcgggtgt 420 agcgtcctc 429 // ID Gypsy-63_AA-I repbase; DNA; INV; 4309 BP. XX AC AAGE02026010; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-63_AA_; KW Gypsy-63_AA-LTR; Gypsy-63_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4309 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02026010; Positions 44964 49272. XX CC Positions [3232-3690] - Integrase core CC 'CTAAT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 39..944 FT /product="Gypsy-63_AA-I_2p" FT /translation="MAENDPNAVFEGAQQGAQGAAPPAAPRLVLPAVVPAA FT VPPSFSIDPFDRHKMKWSRWVQRLEGAFVLFGIQNDVRLYMLLHYMGGETY FT DILSDKIAPELPQQRTYDQIVAVLEAHFNPEPLEILENFRFKCRKQADDRP FT EETIDEYLIALRKLAITCNFGPYLNTALRNQFVFGLKDRAIQSRLLEVRNL FT TLDRAREMAVSMELSNKGGREIQAKQGKSEVNLVENSRGGAAKGNYAKAGK FT MKSEKFERKKKGECFRCGSDAHFADKCIHIRTQCNYCKLPGHLEKVCIKKK FT KSAQGIGEEI" FT CDS 1228..4230 FT /product="Gypsy-63_AA-I_1p" FT /translation="MLKVNVLYGEKSLVLPLYVSNVKKHPLLGRQWMRAMR FT IDLNEIAYAEVNKLAVATTPALESTPAAVKSLVEKYAALFDESTGQIKGVS FT AKLCLKPNTNPVYLRARSVPFSMRAAVEEELDKLINEGVLVKVNHSSWATP FT VVPVLKANNKVRLCGDYKVTVNPNLVVDDHPLPTIEELFANVAGGEKFTKI FT DLTQAYLQLQVEEGDQEVLTLNTHKGLYRPTRLMYGIASAPAIWQRLMEQI FT LNGIPGVTVFLDDIRITGPNDRVHMERLEEVLKRLRSYNMRINLQKCQFFA FT NEIDYCGYLINKTGIHKAKKKITAIQNMPVPENKDQVRSFVGLVNYYGRFF FT PNLSSTLYPLNNLLKNDVRFEWSRQCDEAFNKIKLEMQSNRFLVHYDPTQS FT LVLATDASPYGVGAVLSHVYSDGSERPIQYASQTLNKTQQAYKQVDREAYA FT IIFGVRRFYQYLYGRRFTLLTDNKPLKQIFSDTKGLPKMSALRMHHYATFL FT ASFNYEIKFRPTSQHYNADAFSRLPLKKKPAENVIEEADYLEINIIETLPL FT TVDELAKATAADQTVKVLLQGLRNGKEVDARERFGVDQLEFTLQKGCIMRG FT IRVYVPPSLRAKVLAELHSTHFGTSRLKALARGYVWWEKIDNDIEDVIRNC FT NNCQSTRAEPAKVPTYCWETPSKPFERVHVDFAGPFKNTYFIVLVDAYTKW FT PDVRILNNITTATTIRVCREYFSTYGIPSVLVTDHGVQFTAEEFQRFLKMN FT GVYHKMGAPYHPATNGQAERFVQTFKSKLKSLQCDASTMHAELCNILLAYR FT KTIHPATGKSPSMMLYNRQIRSRLDLLVPDSAAARKQVDEKVRNLSEGTRV FT AARDYMDKTMWKYGVVSEKLGMLHYHVQLDDGRTWKRHIDQLREVGPNLPK FT GRDQTTPVLVEDVPVEVPQPNVPEPAASASVSVTPNVPSPSQEVPAESPPE FT IPGPSEQRSVGAASGLRTPAVSNQKANHESPRRSGRVIRPPNRLNL" XX SQ Sequence 4309 BP; 1167 A; 1007 C; 1162 G; 973 T; 0 other; ggtattttgg cgacgagtgg ttgtggacgc tttgagagat ggcggaaaac gatccgaatg 60 ctgttttcga gggtgcacag caaggtgcac agggagctgc tccgcccgct gctccgagat 120 tggtgctacc ggctgttgtt ccagctgcag tgccgccttc gttttcaatc gacccattcg 180 accgacacaa gatgaagtgg tcgcggtggg tgcagcggct cgaaggtgcg tttgttttgt 240 ttggcataca gaacgatgtt cgtctctaca tgttgctcca ctacatgggg ggagaaacct 300 acgatatatt gagcgacaaa attgcccccg aactaccgca gcagaggacg tacgatcaga 360 tcgttgctgt gctggaagct cacttcaatc ctgaaccact ggagattcta gagaatttcc 420 gtttcaagtg ccgcaagcaa gcagacgacc gcccggagga aacgatcgat gagtacctaa 480 ttgcactgcg aaagctcgcg atcacctgca atttcgggcc gtatctcaac actgctctgc 540 ggaatcagtt cgtgtttggg ttgaaggaca gagcgataca atcccgacta ctggaagtgc 600 gaaacctgac gctggatcga gctcgggaaa tggccgtttc aatggagctg tcgaacaaag 660 gcggacgaga aattcaagcg aaacaaggaa aatcggaagt gaatcttgtg gaaaattccc 720 gaggtggtgc tgccaaaggc aactatgcga aagcaggcaa gatgaaaagt gagaagttcg 780 agcggaagaa gaagggcgag tgttttcggt gcgggagcga tgcgcatttc gcggacaagt 840 gcattcacat tcgcacacag tgcaactact gcaaactccc cggacatttg gaaaaagtgt 900 gcataaagaa gaaaaaaagt gctcagggca taggggaaga aatctgaggc aaattagctg 960 gaagagccca acggtaacca aaatgattcg aaagtgattt cgctggacga agtatgtaag 1020 ctggaggttg cggccaagcc gccatcgacg aagttttggc tggtggttca agtgaatagt 1080 gcgaagatcc ggtttgaggt tgacacgggt gccccggtta cgattattag tgccgaagat 1140 aagaagaagc atttcgctac cgatagtctg ctgccgtcgg atctgaagtt ggtgagtttt 1200 tgcgacacga aaatccaaat tcatggaatg cttaaagtga atgtgctata cggtgaaaaa 1260 tccctggttc ttccgttgta cgtttcgaac gtgaagaaac atccgctgct ggggcgacag 1320 tggatgcggg cgatgcgaat tgacttgaac gagatcgcct atgcggaagt aaacaaactg 1380 gctgttgcta ctactcctgc attggagtcg acgcctgctg ctgttaaatc gcttgtggaa 1440 aagtacgccg ctctgttcga cgagtcgact ggccagatca aaggagtgtc tgcgaaattg 1500 tgcctcaagc caaatacgaa tccagtctac ctgcgagcca gatccgttcc gttttcgatg 1560 cgagcagctg tcgaggagga acttgataag ctgatcaatg aaggagtcct agtcaaggtc 1620 aaccacagtt cgtgggctac gcccgtagtt cctgttctca aggccaacaa taaggtgcga 1680 ctatgcggtg actacaaggt tacggtcaac ccgaacctgg tggtcgacga tcacccattg 1740 ccaacgatcg aagaactttt cgccaacgta gctggagggg aaaagttcac taagatcgac 1800 ttgactcaag cctacctaca actgcaagtc gaggagggtg atcaggaggt gctgaccctg 1860 aatacgcaca agggtctcta caggcctaca cgtttaatgt acggcattgc gtctgcaccc 1920 gctatttggc agagactgat ggagcagatc ctgaacggga tacctggcgt caccgttttc 1980 ttggacgata tccgaattac gggtccgaat gaccgggttc acatggagag actggaggaa 2040 gttctgaagc gtctccgttc atacaacatg cgcatcaatc tgcaaaagtg ccaattcttt 2100 gcgaacgaga tcgattactg cgggtaccta atcaacaaga caggtatcca taaggcgaag 2160 aagaagataa ctgcgatcca gaatatgcca gtaccggaga ataaggatca agttcgttcg 2220 tttgtgggac tagtgaacta ttacggtagg tttttcccaa atcttagttc caccttgtac 2280 cctctcaaca atttactgaa aaacgacgtg cgcttcgaat ggtcgaggca atgcgatgag 2340 gcgttcaaca agatcaagct cgagatgcag tcgaataggt tcctggtaca ctacgatccc 2400 actcaatcgt tggtacttgc caccgacgct tcgccgtatg gagtcggcgc agttctgagt 2460 cacgtctact ccgatggctc agagcgacca atacagtacg catctcagac gctgaataag 2520 acgcaacagg cctacaaaca ggtcgaccgg gaagcatacg cgatcatatt tggagttcgc 2580 cgcttctacc agtacctgta cgggagaaga tttacgctgt tgactgacaa caaaccgctt 2640 aaacagatct tttctgatac gaaaggcctt cctaagatga gcgcactgag aatgcatcac 2700 tacgccacat tcttggcatc tttcaactac gaaatcaagt tcagaccaac gagccaacac 2760 tacaatgcgg acgccttttc gcgtttgccg ctgaagaaga agccagctga gaacgtgatt 2820 gaggaggcag attatctaga aatcaatatc attgaaacgc taccgctgac tgtggacgag 2880 ctcgctaagg cgaccgcggc ggaccagacg gtgaaggtgc tgctgcaggg attgcgtaat 2940 ggaaaggagg tggatgcaag agaacgtttc ggtgttgatc aactggaatt cacgctgcaa 3000 aaagggtgta tcatgcgtgg cattcgtgtc tacgtacctc cgagtttgcg tgccaaggtt 3060 ctagccgagt tacactcaac acatttcggc accagtcgat tgaaagccct tgctcgtgga 3120 tacgtttggt gggaaaagat cgacaacgac atagaagatg taatacgcaa ctgtaacaac 3180 tgtcagtcaa caagggctga acccgcaaaa gttccgacgt attgctggga gacaccatca 3240 aagccgtttg aacgggtgca cgtagatttt gcagggccgt tcaagaacac gtacttcatc 3300 gtgttagtgg acgcctatac gaaatggccc gatgtacgca tcctgaacaa cataaccacc 3360 gctacaacga ttcgagtttg tcgagaatac ttttcaacgt acgggattcc ctccgtgctg 3420 gtgacggacc atggcgtaca gttcactgcg gaggaattcc agcgcttcct gaagatgaac 3480 ggagtttacc acaaaatggg agcgccgtac cacccggcaa cgaatgggca ggccgagaga 3540 tttgtccaaa cgttcaagag taagttgaaa agcctgcagt gcgacgcgag tacgatgcat 3600 gccgagttgt gcaacatact acttgcctat cggaagacta tccatccggc gacgggcaag 3660 tcaccgtcta tgatgctgta caacagacag atacgatcgc gacttgatct gctggttccg 3720 gatagtgctg ctgcgagaaa gcaagttgac gaaaaagttc gcaatctctc tgaggggacg 3780 agagtagcgg cgcgcgacta tatggataag accatgtgga agtacggtgt ggtttccgaa 3840 aagctcggaa tgctgcatta ccacgttcaa ctcgacgatg gcaggacatg gaagcgccat 3900 atagatcagc ttagggaagt tggtccgaac ctaccaaagg gacgagatca gacaacacca 3960 gttttggttg aggatgtgcc tgttgaagtt ccgcaaccga atgttcccga gccagctgca 4020 tcagctagtg tcagtgttac cccgaatgta ccatcgccta gtcaagaagt tcctgctgaa 4080 tccccgccgg aaattcctgg cccttcagag cagagatccg tcggagctgc ttctggttta 4140 cggacacctg ctgtctccaa ccagaaggcc aaccatgaat caccgaggcg atcgggcaga 4200 gtcattcgac cgccaaatag actcaatctt tgaaatgttc ctaacactaa atatttcatt 4260 attactattt tgtattgaag ttgaatttca tttgttaggg gggaagagc 4309 // ID L1_Ele4 repbase; DNA; INV; 4683 BP. XX AC . XX DT 15-OCT-2010 (Rel. 15.1, Created) DT 15-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE An L1 clade non-LTR retrotransposon family from Aedes aegypti. XX KW L1; Non-LTR Retrotransposon; Transposable Element; L1_Ele4. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4683 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4683 RA Kojima K.K. and Jurka J.; RT "L1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (15-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. This consensus is generated from 20 CC sequences with >93% identity, and ~100% identical to the original CC sequence in [1]. XX FH Key Location/Qualifiers FT CDS 168..1406 FT /product="L1_Ele4_1p" FT /translation="MAQVKPRENTFRVDLTNFPKRPSFEDLHLFVHNVLGL FT KLEQVVRLQMNHVQNCVHVKCATLKIAQDTVDLHNGKHEITINKTKVKVRL FT VMEDGGVEVKIHDLSENIRNDDIVAYLKQYGEVITLQEQMWGDIYTFKSRP FT SGVRVAKMILRRHIKSFVTIQGEQTLVTYRGQPQTCKHCMNAIHTGISCVE FT NKKLIGQKTDLNARLNSDRQRTQGYAGVLSSPPPDQEALSMNFVDLNEHNR FT KAAAVSNTSTSTQSKEEIVVEMASDANEPQSSAGPASSDAASAEQHTENVD FT QPNDVELNKALTGASWAQLVNAESPNDVQMQLSDRPNGQSRDSSLLDEARG FT GSTKIVPTQVRTDISPTSVFKKPQDDNCGGSDYSMEISDNEGNAYQCSSEK FT EFTTVRKGRDRSKKQRVGK" FT CDS 1417..4617 FT /product="L1_Ele4_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MANRTSSYNIASINTNAISSNTKIQALRTFIRLTDLD FT IILLQEVEDPYLTIPGFNVITNVDSQKRGTAIALKSHMTCSNIQRSLDSRV FT ITVTLEGSVKIVNIYAPSGTTNYASRENFFQNIVPFYLQSTTEHLVLGGDF FT NCVVSHKDATGTSNFSPALGNVLNSLHLKDTWDLLKPNQIQYSYIRPNCAS FT RLDRLYISESLIEQLRTAEFYVTSFSDHKSYKIRCCLPNQGRNFGKGFWSI FT RAHVLTEDNMQEFEIKWNRWLRERRNFSSWMSWWVNCAKPKIRSFFRWKTN FT ESFRTFHAENEVLYQRLRTAYDNLYQNPTGMTEINKIKGKMLLLQNQFSHS FT FERLNDKLISGEKLSTFQLADRIKRKKNSTITTITDNGRVLRNLDDIESHV FT FNYFNQLYATQNHQPSTNFPSNRTIDMNCEANNEAMNEITTEDIFFAIQSA FT AGRKSPGNDGIPKEFYVKAFDIIHRQLNLILNEALQGNLPLEFLEGVIVLC FT RKNSNHSSITAYRPITLLNCDYKLLARILKLRLEKIMINNRILNENQKCSN FT SGKNIFEAVLGIKDKIAEINCKRKVGKLISFDLDHAFDRVDQNYLLSIMRE FT IRFNNDLIAILEKIMQNSRSKILINGHLSQNVPIQRSIRQGDPLSMHLFVI FT YLHPLIEKLCTICDNRLELVVAYADDISVIIVDDHKIGVVRKAFENFGLCS FT GAVLNLAKTIALNIGSNNDRSNMYEWLNIHDSVKILGITYFNSLKRTIDHN FT WTEIIRSTSRLMWLYKTRLLSLHQKIILLNTFITSKLWFAASILSIPNAIL FT ARITSQMGYFIWGRYPTRIAMDQLSLPFDKGGLNLQLPMHKCKALLINRFL FT RCKQNLPFANSFEELMSNPPHIQGIPALYPCLKVIAKEVPYLPEHIIEHLS FT AIELHKYYKSKLSTPKVMIEQSTVSWSKIWSNIRNKSLSPDEKSCYYLLVN FT GKIPHATSLYRQNRLQSPMCQICRGANEDLEHKFSVCIRVRHLWNHILPKL FT EIITGRRHKFRDFSLPELKNIEASAKRKALKLFIVYVKFIIEENVNLTTEA FT LDFVLDYKLL" XX SQ Sequence 4683 BP; 1577 A; 893 C; 927 G; 1286 T; 0 other; cagtttgcct cgtgcgttcg ccgcgttcag acgcaaaatc actgcgctga tagggttcgt 60 tcgagttttt ttttgtatac acctagggtg tagaatatct tctgttttcg cgtcagtttt 120 tctggcgaca ctctgtgtga attgcattag taaatgcaca acgcaacatg gctcaagtga 180 aaccaagaga aaatacattc agggttgacc tgaccaactt cccgaagcgt ccttcattcg 240 aggatttgca cttgtttgtg cacaacgttc tgggcttgaa gctggaacag gtggtgagac 300 tgcagatgaa ccatgtgcag aactgtgtac atgtcaaatg tgcaaccctc aaaattgcac 360 aggacacggt cgatctacat aacggtaagc acgagatcac catcaacaaa accaaggtca 420 aggttagatt ggtaatggag gatgggggcg tagaggttaa gattcacgac ctctctgaaa 480 acatccgaaa cgacgacatt gtggcgtacc tcaaacaata tggcgaggtc attaccctcc 540 aagaacagat gtggggagac atctacacat tcaaaagcag accatcggga gtccgagtag 600 ccaaaatgat cttacgcaga cacatcaagt cgttcgtaac gatacagggc gaacagacac 660 ttgttaccta cagagggcaa ccacagacgt gcaagcattg tatgaatgcc atacacactg 720 gcatttcatg cgttgaaaac aaaaagctaa ttggccaaaa aaccgatctc aatgcaaggc 780 tgaattcaga tcgtcagaga actcaagggt atgcgggagt actcagttct ccaccgcctg 840 accaagaagc gctctccatg aacttcgttg atttgaacga acacaaccgc aaagctgcgg 900 cagtttctaa tacatcgacg agtactcaat ccaaggaaga aatcgtcgtg gagatggcga 960 gtgatgcgaa tgaaccccaa tcatcagcgg gacctgcctc ctcagatgct gcgagtgctg 1020 agcaacatac tgaaaatgtg gatcaaccta acgatgtgga actcaacaag gctctcaccg 1080 gagcgagctg ggctcaactt gtcaatgcgg aatcaccgaa tgatgtacaa atgcagcttt 1140 ccgatcgccc taatggtcaa agcagagatt cgtctctact tgatgaagcc agagggggat 1200 cgacaaagat cgtacctaca caggtaagaa ctgacatcag tccgacgagc gttttcaaaa 1260 aaccgcagga tgataactgt ggtggcagtg actactccat ggaaatatcc gacaacgaag 1320 gtaatgcata tcaatgttca tctgaaaaag agttcacgac tgttaggaaa ggccgagatc 1380 gatctaagaa gcaaagagta ggtaaataga tttgcgatgg ccaatcgaac atctagttat 1440 aacatagcca gcatcaacac taacgcaatt tcaagcaaca ctaaaattca ggccttacgt 1500 acatttatca gattaacaga tttagacata atcttactac aagaagttga agatccatac 1560 cttacaatac cgggtttcaa tgtgataacc aacgttgata gccaaaaacg aggaaccgcg 1620 atagctttga aatcccacat gacatgttca aatattcaac gtagtcttga cagtcgagtg 1680 ataaccgtta ctttggaagg tagtgtaaag attgttaata tctatgctcc atcgggaacc 1740 acaaattatg ctagcagaga gaactttttc caaaatattg taccatttta cctacagtcg 1800 accacggaac acttagtgtt aggtggcgat ttcaattgtg ttgtatcaca taaggatgca 1860 accggaacta gcaacttcag tccagctttg gggaatgtgc taaacagttt gcacctaaaa 1920 gatacgtggg atctgctcaa accgaaccaa attcagtaca gttacattcg accgaactgt 1980 gcatctcgtc ttgatcgatt atatatttcg gaatctttga tagaacagct tcgaacagcg 2040 gaattttatg taacatcgtt ctctgatcat aaatcttata aaattagatg ctgcttgccg 2100 aatcagggta gaaattttgg aaagggtttt tggtcaatac gtgctcatgt tttgacagaa 2160 gataacatgc aggaattcga aatcaaatgg aatcgctggc ttcgcgaacg tagaaatttt 2220 agcagctgga tgtcttggtg ggtgaattgt gccaaaccaa aaattagaag tttttttcgg 2280 tggaaaacaa acgaatcatt tcgaacattt catgcagaaa acgaagttct gtatcaacga 2340 ctaagaacag catatgataa cctataccaa aatccaacgg gaatgacaga aattaacaaa 2400 attaagggga aaatgctctt gttgcaaaat caattttcgc attctttcga aagactgaat 2460 gataaactca tctcgggaga aaaactttca acttttcagt tagctgaccg tattaaacgg 2520 aagaagaata gtacaattac tacaattact gacaatggta gagttttgcg caatttagac 2580 gacattgaaa gtcacgtttt caattacttc aaccagcttt acgccactca gaatcatcaa 2640 ccaagtacga actttcccag caatcgaaca atagatatga attgtgaagc taataatgaa 2700 gcaatgaatg aaattacaac tgaagacata ttttttgcca ttcaatcagc ggctggcaga 2760 aaatcgcctg ggaatgatgg cattccaaaa gaattttacg ttaaagcttt tgacattatt 2820 catagacaat tgaatttaat tttaaatgaa gcgctccaag gaaacttgcc tttagagttt 2880 ttggaaggag taattgttct atgcagaaaa aactccaatc attcatcaat aacagcatat 2940 agacccatca ctttgttgaa ttgtgactac aaattattag ctcgaatttt aaagttacgt 3000 ttggaaaaaa taatgataaa taaccgaatc ctaaatgaaa atcaaaaatg ctccaactca 3060 ggtaagaata tttttgaggc agttttagga attaaagata aaatagctga aattaactgt 3120 aaacgcaagg taggaaaact aatttcattc gacttagatc acgcttttga tcgggtcgat 3180 caaaattact tactcagtat tatgagagaa attcggttca acaatgattt gatcgcaata 3240 ctggagaaaa ttatgcaaaa ttcacgatct aaaattctta taaatggaca tctatcccaa 3300 aatgttccca tccaacgatc catacgccag ggcgaccctc tgagcatgca tttatttgta 3360 atatatctgc atccgttaat agaaaagtta tgtacaatct gtgataaccg gttggaactg 3420 gttgttgctt acgcagacga catttccgtt attatagttg atgatcataa aattggagta 3480 gtaagaaaag cttttgaaaa ttttgggctc tgttcaggag cagttttaaa tttagcaaaa 3540 acgatagctc tcaacattgg atcgaacaat gatcggtcca atatgtatga atggttgaat 3600 atacacgatt ctgtaaaaat cttagggata acttatttca attcactcaa gaggacaatt 3660 gaccacaact ggactgaaat tattcgaagt acatcacgac taatgtggct gtacaaaact 3720 agacttcttt ctttgcacca aaagattatt ctcttaaaca catttataac atctaaactg 3780 tggtttgcag catcaatttt aagtattcca aatgccatat tagctaggat aacatcgcaa 3840 atgggatact ttatttgggg caggtatcct acaaggatag cgatggatca gctgtcactt 3900 ccctttgata aagggggctt aaatcttcaa ttgccgatgc acaaatgcaa ggccctttta 3960 ataaatcgat ttttgagatg caagcaaaat ttacctttcg caaattcatt tgaagagctt 4020 atgagcaacc caccccatat tcaaggaatt ccggctctat acccttgtct caaagtaatt 4080 gctaaagaag tgccttatct tccggagcac attattgaac atctatcagc aattgagcta 4140 cataaatatt ataaatctaa gttaagtaca ccgaaggtaa tgatagaaca atccactgta 4200 tcctggagca agatttggag taacataagg aataaatctt tatcaccaga tgagaaatca 4260 tgctattact tgctagtaaa tggtaaaata cctcacgcta cttcgcttta tcggcaaaat 4320 aggttacaga gtccgatgtg tcaaatctgt cgaggtgcaa atgaagatct cgagcataaa 4380 ttttcagttt gcattcgtgt tagacatctg tggaatcata ttcttccaaa attagaaata 4440 attacaggca ggaggcataa attcagagat ttttccttac cagaactaaa aaatattgaa 4500 gcaagtgcaa agcgcaaagc tcttaagtta ttcatagtat atgtgaagtt tattatagaa 4560 gaaaatgtaa atcttactac agaagcacta gattttgttt tagattataa gttgttgtaa 4620 ttatgtaata tgtgtgtatg tatttgtaaa taatgttctt aataaatgtg tttataaaaa 4680 aaa 4683 // ID CR1-44_HM repbase; DNA; INV; 3841 BP. XX AC . XX DT 18-DEC-2008 (Rel. 13.12, Created) DT 18-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE CR1-type family - consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-44_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3841 RA Bao W. and Jurka J.; RT "CR1 families from Hydra magnipapillata."; RL Repbase Reports 8(12), 1872-1872 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 144..848 FT /product="CR1-44_HM_1p" FT /translation="MVNVRKGEFDTFKEETNKKILALEDLVKKLLSKNLDL FT EKQLKFLEDNQAQIISKNNQINSKPLFSSLFHNKSLEPEETRILTAISAEN FT KEKSRREYNLILHGLDELNKSTEENIQHENEQIKKIFNYLKVDESFIKRHH FT RLKSKTDKPGIVIVELNNKEKVADILKSASNLKYNVEYKNKVYINQDLTFA FT QRESRKLLLKERDRLNNADNNQPFRYIVKNESIVKVKKSEKTKN*" FT CDS 763..3759 FT /product="CR1-44_HM_2p" FT /translation="MLIITSPFGILSKMSQLSKSRKVKKLKINSLSSVNKN FT KSDIIFPFNKKSNYLKFWYTNSTSLVNKITEFETKIMIDSPDIILISETWF FT KNDLIINISNYNCFYNNRTNKQGGGVAIFTKYDIIAYEVSVSPLYKEVEQI FT WCSIVLHKTKILVGCIYRPPNSTDLINNAINELIKTAKKEVMNGKYSNVII FT VGDFNYPNIIWNEDGTIELKQNCQTSRAFINNLQNNYLHQVVNKPTFQMAV FT DKPANILDLIICDDLDLIINIEYDAPLGQITKSHYILKWEINILNSYEKKF FT NNKRFNFRNGKYKEMNLMFYKVNWVEIFDKMNTNQCYEYFLKIYEKACKQY FT IPRHFKPTSSIKNPCINKKIKNLVRKKRSLWCRVRATNFKNTNLNNEYKLI FT NKTIKKQIKKSTFDYEINLAKDVKHNPKRFYAYAQRKNQTHTKLLALKTSD FT NKIITDKKVIANLLNCSFQSVFVKENQDLPYFSNKTSHLFECDWDQELKED FT IIFNYLLELNENKSLGCDNISPYVLKHCAEGLSKPLSLIFKKSIHSGELPE FT LWKSANITPLFKKGDKKDPLNYRPISLTSIPSKIIEKIIRKKLIIYITENN FT LLTSQQHGFIQNKGCLTNLLETFDIITTEIENGYPVDILFLDFAKAFDKVP FT HYRLIHKISNYGIKGNILNWIDSFLKNRKQRVVMGEVVSDWCEVYSGVPQG FT SVLGPLLFLLYINDLPQCISSIAKLYADDTKIISVIKEHNDLTNMQNDIDL FT LTEWSNVWLMDFNINKCSVMHIGRKNINYNYTIKHDNQSFLLKTTTKELDL FT GVIVSSNMKWNHQVQVATSKAQRILCLIRKSFTYLNTEMVRQLYTSLVRPH FT LEFAAPVWSPSNKNNINDLEKIQRRATKLAPELKHLSYTERLTKLGLTTLE FT TRRIRGDLIAFFKLTTGIDKILWYKEPNKASSQRLRGHPMKFEREFAKTTP FT RHNFFTNRVIPHWNALPEKVVSAMTVNSFKARLDKHLLTAVKV*" XX SQ Sequence 3841 BP; 1622 A; 570 C; 548 G; 1101 T; 0 other; accgcgtaaa aaagatggcc gatgtttttg ataacaaaaa gtaaacaaaa agtaatcaaa 60 atagaatatt aactacatac attaaataga gatagtaatc aagaaaaata aataaataat 120 aaatattgag tacgtaaatc aatatggtca acgttcgaaa aggagaattt gatacgttca 180 aagaagagac taacaaaaag attctagcac tggaagatct ggtaaaaaaa ttactatcta 240 aaaatcttga cttagaaaag caattgaagt tcttagaaga taatcaagct caaattatta 300 gcaaaaataa tcaaattaac agcaaaccac tttttagtag cttgtttcat aacaaatctc 360 ttgaaccaga agaaacaaga atactaaccg cgatttcagc tgaaaataaa gaaaaatcta 420 gaagagaata caatttgatt ctacacggtc ttgatgaatt aaataaatca actgaagaaa 480 atatacaaca tgaaaacgag caaataaaaa agatctttaa ttacctgaag gtcgatgaat 540 catttattaa acgacaccac cgcctaaaat caaaaaccga caaacctggt attgtaatag 600 tagaattaaa taataaagaa aaagttgcag acattttaaa gtcagcaagc aatttgaaat 660 ataatgtcga atataaaaat aaagtatata taaatcagga tttaacattt gcacaacgtg 720 aaagcagaaa attgcttcta aaagaaagag atcgtttaaa taatgctgat aataaccagc 780 cctttcggta tattgtcaaa aatgagtcaa ttgtcaaagt caagaaaagt gaaaaaacta 840 aaaattaatt cattatctag tgtaaataaa aacaaatctg acattatttt tccattcaat 900 aaaaagtcta actatcttaa gttttggtat acaaattcta cgtctttagt aaataaaata 960 acagagtttg aaacaaaaat tatgatcgac tcaccagata tcatacttat ttctgaaaca 1020 tggttcaaaa atgacttaat tataaatatt agcaactaca actgctttta caataataga 1080 acaaacaaac aaggtggagg tgttgcaatt tttacaaaat atgatataat tgcttatgaa 1140 gtatcagttt cacctttata taaagaagtt gaacaaatat ggtgtagcat tgtacttcat 1200 aaaacaaaaa tattagttgg atgtatatac agaccaccaa acagtactga cctaataaat 1260 aatgcaatta atgaattaat aaaaactgca aaaaaggaag taatgaatgg aaaatactca 1320 aatgtaataa ttgttggtga ttttaattac ccaaatatta tttggaatga agacggaact 1380 atcgagctaa agcaaaactg ccaaacaagc agagcattta ttaataactt gcaaaacaat 1440 tatttacatc aagtagttaa caagccaact tttcaaatgg ctgttgataa accagcaaat 1500 atattagatt taataatatg cgacgatcta gatttaataa ttaacattga atatgatgct 1560 ccacttggtc aaattactaa aagccattac atattaaaat gggaaattaa catactcaat 1620 tcatatgaga aaaaattcaa caataagaga tttaattttc gaaacggaaa gtacaaagaa 1680 atgaatctca tgttttataa agtaaattgg gttgaaatct ttgacaaaat gaatacaaat 1740 caatgttatg aatacttttt aaaaatttat gaaaaagctt gcaaacaata tattcctcgc 1800 cacttcaaac ccactagttc tatcaaaaat ccatgcatta ataaaaaaat caagaatcta 1860 gttagaaaaa aacgtagttt atggtgcaga gttcgtgcta caaacttcaa gaacacaaac 1920 ttaaataatg aatacaaact tatcaacaaa acaattaaaa aacaaattaa aaaatctact 1980 tttgattatg aaataaactt agcaaaagat gtaaaacata atccaaaacg cttctatgca 2040 tatgcacaaa gaaaaaacca gacacataca aaattgctag cacttaaaac tagtgacaac 2100 aaaataataa ctgataaaaa agtaattgct aatctactaa attgtagttt ccaatcagta 2160 tttgtcaaag aaaatcaaga tttaccatat ttcagcaata aaactagtca tttatttgaa 2220 tgtgactggg accaagaact aaaagaagat attatattta attatttatt agaacttaat 2280 gaaaacaagt cacttggttg tgataacatc agtccctatg tgctaaagca ttgtgcagaa 2340 ggattatcca aacctttgtc tttaatattc aagaaatcga tacattcagg agagctgcca 2400 gaattatgga aatctgcaaa cattacacca ctttttaaaa aaggagacaa gaaagatcca 2460 ctcaactaca gacctatctc cttaacatcg ataccaagta agataataga gaaaataatt 2520 agaaaaaaac tgataattta tataacagaa aacaatctgt taacgtctca acaacatgga 2580 tttatacaaa ataaagggtg cctaacaaac ctattagaaa ctttcgatat tattacaact 2640 gaaattgaaa atggttatcc agttgatata ttgttcctag attttgcaaa ggcgttcgat 2700 aaggtgcccc actatcgact tatacataaa atttctaact atggaattaa aggcaatatt 2760 ttaaattgga ttgattcttt tttaaaaaac agaaaacaaa gagtggtaat gggtgaagtt 2820 gtatcagatt ggtgcgaggt ttatagtggt gtaccacagg gctctgtatt aggtcctctt 2880 ctttttctgt tatatatcaa tgatcttccc caatgtatct cttcaatcgc aaaattatac 2940 gctgatgata ctaaaatcat atctgttata aaagaacata atgatctaac taatatgcaa 3000 aatgacattg atttgttaac tgaatggtca aatgtatggt taatggactt taacataaat 3060 aaatgcagtg ttatgcacat tggaagaaaa aatattaact ataactacac tataaagcac 3120 gacaatcaaa gttttcttct aaaaacaaca acaaaagaac tagacttggg tgtaatagtt 3180 tcatccaata tgaaatggaa tcatcaggtt caagtagcta caagtaaagc acaaagaatc 3240 ttatgtttaa taagaaaaag ctttacttat cttaacactg aaatggtaag gcaactatac 3300 acatcgttgg ttagaccaca cttggaattc gcagctccag tgtggagtcc aagtaataaa 3360 aacaatatca atgacctaga aaaaattcaa aggcgagcaa caaagctagc ccctgaacta 3420 aaacatctaa gttataccga aagattgaca aagttgggcc tcacaactct ggaaacaaga 3480 cgtataagag gagatctcat tgctttcttt aaattaacaa ctgggattga taagatatta 3540 tggtataaag agcctaataa agcttcttcc cagagacttc gaggtcatcc aatgaaattt 3600 gaaagagaat ttgcaaaaac aacaccgaga cataattttt ttacgaacag agtgatacct 3660 cattggaatg ctttacctga aaaagtggtt tctgcaatga cagtaaattc ttttaaagct 3720 agactagata aacatttgtt aacggctgtt aaagtataaa ttttaaactt tatactccgc 3780 gacaatcgtc gttacagcaa tttgtattac tattactatt actattatta ctttaacaag 3840 t 3841 // ID BEL-630_AA-I repbase; DNA; INV; 7475 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-630_AA_; KW BEL-630_AA-LTR; Pao_Bel_Ele231; BEL-630_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-7475 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [6536-7093] - Integrase core CC 'ATATC' target site duplication CC LTRs are 96% similar to each other. XX FH Key Location/Qualifiers FT CDS 1359..4217 FT /product="BEL-630_AA-I_4p" FT /translation="MEKNPPTARSLSRRSSSSTARARILLDLKRLEDERIL FT QEKAEEKVSREKEYLAKKYDLMQAQLEEEEEGSLRSRRSMSSIQLVEQWLQ FT NEPVANSGSGIGCSGIPAGTELRTGTVHKQKPSATTTFAAASTPKSGQQAE FT LTFGLNPSLPLAQGATSEANCGQPAEPEFHAQDPLDWSLTWDVPVAHQPLP FT KGQFAVDLQEIQRKFLGVQLCPPKPYTQQVGPSSSSQPASGNHPVASVHGT FT SSSQQENIATIPATTGGPQPCQVILDPTVSQHDQSSGLEALTTKQYAPAAV FT TTVSQLCSDYIIPQNRTNALHPNATFKVPKLSSLIEHTPFGVTEPYKPPQP FT LRSDTSTFHPMYKSTPYVRPVTAPSVHFADHPVASSSGLPILPNVRTVPSN FT PTPVVSSIAVNSGHTIPIPPSDSTSALVPATEPWISAPTSQQLAARHIVPK FT ELPIFSGDPVEWPLFASCYQNTTQLCGFSNGENLMRLQRSLKGNALEAVRS FT LLLEPSSVPLIVTTLQTLYGRPDLIINSLLHKVRATPSPKPDKLETLVSFG FT LACQNLCGHLRAAGQQAHFSNPALLQELVNKLPANIKLDWALFKQKCATVD FT LGTFGDYMAQLVVAASDVASFQPSEETRFQSDKRKGKERLFVNAHSSGTPP FT ENVGRRQQAGTYESKQNQPKPCPICEKPGHKVRECDAFKKCSLEDRWKRVQ FT ELYLCRRCLIAHGKFPCKASLCGMDGCEERHHKLLHPGKPEPAKPSKPTKP FT TETLNIHNQRKMTTLFRIIPVTLFGNGRSVHTFAFLDEGSSSTLIEARVAR FT ELGVKGEVYPLCLQWTSDIERTEDKSQLIRLEISGRGTSKRHVLKAAHTVE FT KLCLPNQTLPFDQLSEQFPYLRGLPIEGYSNATPTVLIGVDNTHLKIPLKI FT REGKIGQPAATKTRLGWTIYGEIPGETSSIERFHYHLFEERRSPDDEPW" FT CDS 5257..6201 FT /product="BEL-630_AA-I_1p" FT /translation="MRKSLQICSVNGVGGSNCWDNWMQFAFRGVIFRDTIR FT KASNLLSCTYSSMLARRRTWRLRSSELWTDLKRAVLLVSSKTKVAPLKPLS FT VPRLESQAALLGVRLAKSVAENHTLVIVRRFFWSDSTTVLSWLQSDQRKYR FT QFVACRVSEILDTTKVDEWHYVPSRLNVADDATKWKDGLQINSNHRWFRGP FT SFLYDPPRTWPKPKELPGATAEELRPVHVHQEKAEQQLVQFCRFSKWERCL FT RAVAFVHRFIDQLKRKRRDEITKDPRILTREELQRAENTILRLAQFEVFAD FT EIITMAKKSVTATGSTSAFGENK" FT CDS 6251..7405 FT /product="BEL-630_AA-I_3p" FT /translation="MDSRISSFSEALYDFKYPVVLPKSHYVTQLIVDSYHR FT RYNHCNGETAVNEMRQRFHLSEMRAAFRKSRKICSWCRVYKATPSVPRTAP FT LPEARVTPYVKPFSFVGLDYFGPLLIIQGRHEVKRWVALFTCLTIRAIHLE FT LVSSLSTECCKMAIRRFIARRGAPSEIYSDRGTNFVGVSGELREQIRGINS FT DLASTFTNTITQWRFNPPASPHMGGAWERMVRSVKCALAALSAERKPNEET FT LVTLLVEAESVVNSRPLTYMPLETAEHEALTPNCFLLLSTSGANQPPSQLS FT DGKLALRSSWALYQRLLDQFWVRWVKEYLPTITKRTKWFVESKPVSAGDLV FT MIVEDRLRNGWIRGRVLVCSLDEMGDVVAQMSKLQPAFYAVQ" XX SQ Sequence 7475 BP; 1978 A; 1851 C; 1794 G; 1851 T; 1 other; aaatctttta gattttcatc tgtgaaagca acaattagct gttatcaacg caatcttcgc 60 cacgcgggtg cagcagttcg caatggcggc ggcctcattt acttgtgtat gcggtcggcc 120 aggtgaaccc gacgaagaga tgctgcagtg cagtgattgt ttacgaccaa ttcatctcgc 180 atgtgcttca tctagaccag tgtttcccaa acttttttga ctttcgcccc cttgagacca 240 atattttttg gtatcgcccc cctgatatga aatctaaaca tttgtactaa gtgttacact 300 tacgttgaag tttggatcaa tatctcatca ttagtatacc aaacattggt caaatttaaa 360 ggatgtagca gtcaatcata atgcttcaat aaaaataccg caaaagagtg tctactactt 420 gtatgtccat aaaggagtca tgtaatatct atcatatgca tcaattgcct ttgtatcaac 480 aaatctgatg tgcgatcatc aaaaaaaagt cagctcactt tctgcttttt tgtcttcttt 540 ttttaatgtt tcttaagttg ttcaaataaa cttataaaat ttcgggtaat acattattgg 600 ctgttctgga aacaattgaa tctatatgca aaatataaga aaatatcagt aaacccagac 660 ttttttcaac tcataattca ctgaaatata acgttcaatt tagatatttt aaagctttga 720 tccatgaaat tttatgtcgc tcaggcctat ttttgagtag ggtgatttat gtttgtggta 780 tttaatgatt ttatcattga aacccgattt ttgtcgacca aagtcacatg gatttgttta 840 taatatgatt tgtactataa atactattta gtgtgcgtat taacaaaata tacaaatgag 900 tactttaata aatgtgtgaa acataataaa aacttaaggt gccttattct gaagattccc 960 taataaaaag tcaacaaact ttaccttaga cttggaaata aatattacac atcgccaaaa 1020 gtttctgctt tcaaaaagca ttcaatatta tttttagcaa gaactaatta gatccgaacg 1080 agaaccgata aattagttag gtgtcaacta aattcttctt cagatgtata tgctatcatt 1140 aaaatatgag tgcctcttcc aaactttgaa tctatttttg attttcagtt gcttccactt 1200 tcgccccctt tctagcattt tcgcccccca aattcagttt tataaatttt cgcccccttg 1260 gacctgaaaa tcgcccccag gggggcgaat tcgcccactt tgggaatcac tgatctagac 1320 gttccattgg tggaagctgc gttgtgtgcg ttcggtgtat ggagaagaat cccccaacgg 1380 cacggtcgct ttcgaggaga tcgtcaagtt cgactgctcg tgcgagaatt ctcctggatc 1440 tgaagcggct ggaagacgag aggatcctgc aagaaaaggc cgaggagaag gtttcgcgwg 1500 agaaggagta tctggcgaaa aagtacgacc tgatgcaggc ccagctagag gaggaggaag 1560 aaggcagctt gagaagccgt cgctcgatgt ccagtatcca attagtggaa cagtggcttc 1620 agaacgaacc cgtcgcaaat tcaggctctg gcatcggatg cagtggcatt cccgctggca 1680 cggaattacg caccggtacc gttcacaagc agaaaccatc ggctacaacg acattcgctg 1740 cggcttcaac gccgaaatcc ggacaacaag cagagctcac cttcgggctc aatccaagcc 1800 tgccactggc ccaaggcgcg acttctgaag caaattgtgg gcaaccagct gagccggagt 1860 tccacgcgca ggatccattg gactggagcc taacttggga cgtacctgtc gcacatcaac 1920 cgctaccgaa agggcaattc gctgtggact tgcaggagat tcagcggaaa ttccttggcg 1980 tacaactatg tccgccaaag ccgtatacac agcaagtggg tccatcgtcg tcaagccaac 2040 ctgcgagtgg taaccacccg gttgccagtg tccatggaac aagttcgtcg cagcaggaga 2100 atatcgctac aataccagcg acgacaggtg gtccacaacc gtgccaggta atcttggatc 2160 caactgtctc tcagcacgac cagtcgtccg gcctcgaagc acttaccact aagcagtatg 2220 caccagcagc agttaccacg gtgagtcaat tatgctctga ttatattata ccacaaaata 2280 gaacaaatgc attgcatccg aatgcaacgt tcaaagtacc gaagctttct tcgttaattg 2340 aacacactcc cttcggcgta accgaaccat acaagccccc ccagcctctt cggagcgata 2400 catcaacgtt tcacccaatg tacaaaagca ccccctatgt ccgccccgtt actgccccct 2460 ccgtgcactt cgctgatcat ccagttgctt cctcttccgg tttgccaata ttgcccaatg 2520 tacgaacagt tccgagtaat ccgaccccag tagtgtcttc aatcgccgtc aatagtggtc 2580 atacgattcc gattccgcct tccgactcaa catctgccct cgtaccagca actgaaccgt 2640 ggattagtgc tccgacatct caacaactcg ctgccaggca catcgttccc aaggaacttc 2700 caatcttttc gggcgatcct gtcgagtggc cgttgttcgc aagttgctac cagaacacga 2760 cgcagttgtg cggtttctcc aatggggaaa atttgatgcg tttgcagcgg agtcttaaag 2820 gtaacgcact cgaagcagta aggagcttgc ttctggagcc atcgtcggtc ccattgatag 2880 ttacaactct gcagactctt tatggacggc cggatttaat tatcaactca cttctgcaca 2940 aggttcgcgc aactccgtct ccgaagccgg acaagttgga aacgttggta tcctttggcc 3000 ttgcatgcca aaacctttgt ggccatcttc gtgcagcggg tcagcaagct catttctcca 3060 atccagcttt gcttcaggag ctagtgaaca aactgccagc aaatattaag cttgattggg 3120 cgctcttcaa gcaaaagtgt gctacagtcg accttggaac cttcggtgat tacatggcac 3180 aactagtggt agctgcgagc gatgttgctt cctttcaacc ttccgaagaa actcgcttcc 3240 aatcggataa acgaaagggt aaggagagac tctttgtcaa cgcacattcg tctggcaccc 3300 cgccggaaaa cgttggaaga cgacaacaag ctggtaccta cgagagcaaa cagaatcagc 3360 caaagccgtg tcccatttgc gaaaaaccgg gccacaaggt acgagagtgc gacgctttca 3420 agaagtgcag cttagaagat cgttggaagc gcgttcaaga gctttacctt tgtcggaggt 3480 gcttgattgc gcatggaaag ttcccttgca aggcgtcgtt atgcggtatg gacggttgtg 3540 aagaacgaca ccataagctg ttgcatccag ggaagccaga gccagcgaag ccttcgaagc 3600 caaccaagcc aacagaaacg ctcaatatcc acaatcaacg caagatgaca acgcttttcc 3660 gtataatacc ggtaacgcta ttcggaaacg gaaggtctgt ccacaccttt gcgtttctgg 3720 acgaagggtc ttcttccacg ctgattgaag ctcgggttgc ccgcgagttg ggagtcaaag 3780 gagaggttta tccgctttgc ctgcaatgga caagtgacat agaacggact gaagacaagt 3840 cacagttgat tcgacttgaa atttctggtc gaggaacgtc caaacgacat gtgttgaagg 3900 cagcccacac ggtagagaag ttatgcctcc caaatcaaac tctacccttt gatcagctgt 3960 cggaacagtt tccgtacctc cgtggcctgc ccattgaagg atacagcaac gccacgccaa 4020 ccgttctcat tggtgtggac aacacgcatc tgaaaattcc tctaaagatc cgggaaggaa 4080 aaatcggtca accagcggca acgaaaacta gacttgggtg gactatatac ggcgaaatcc 4140 ctggggaaac ttcttcaata gaacgttttc actaccatct gtttgaggaa cggcgtagtc 4200 ccgacgacga gccctggtga aggagttttt ttcggtggaa aatgttggag tttctgtcgt 4260 tcctctcctg gaaagttccg atgaggtgcg atcaagaaaa atcctcgaag aaacgactgt 4320 tcgcctacct tctggacgat tccagaccgg actgctctgg aaatatgatc acatcgattt 4380 tccagacagc aaacccatgg cagaaaatcg tctgaggtca ttagaacgac gtttgttgcg 4440 caaacctgag cttttcgaga atctcaaaca gcagatactc gaatacgagg aaaagggcta 4500 tgctcataag atcacacaag atgagatcct caactctgat cccaagaggg tatggtactt 4560 gccattgggg atagttgtgc aacccaagaa acccgggaag gtgcggattg tatgggacgc 4620 cgcagcggca gtacggggcc agtcacttaa ctctgctctg ttacccggac cggatcttct 4680 gacctccctt ccgtcggtcc tctcccgata tcgacaacgc caagttgcga tcagtggtga 4740 tatacgggaa atgtttcacc agctccagat caggcccgaa gataagcagg ctcaacgttt 4800 cctttttaga agcgatccca ccaaggagcc cgatacgtac gtcatggacg ttgctacatt 4860 tggtgcaact tgctcaccct cctcagcaca gttcatcaaa aatcagaacg ccagagattt 4920 cgaggccgag taccccgaag cagctgccgc aatcgtccgc aatcactatg tggacgacta 4980 tttggatagt ctcgacacca tcgaggaggc aatagatctt gcgctgcaag tcaaaacggt 5040 gcacgccaag gccgggttcc atatacggaa ctggatgtcc agttccaagg aagtcctctc 5100 acgggtcggg gattccgcag agaacaacag aaaagcttca aaacccatct gcttcgacct 5160 cggaacgcgt tcttgggatg agctggtttc ccagttcgga cgaattcgtc ttcaacggat 5220 attttcgcga cgaaccgacg ccactttttc acggagatga gaaaatcact tcagatctgt 5280 tcagtgaatg gcgtcggtgg atcaaattgt tgggacaact ggatgcaatt cgcattccga 5340 ggtgttattt tccgggatac catccggaaa gcttcgaatc tcttgagctg cacatattcg 5400 tcgatgctag cgaggaggcg tacgtggcgt ctgcgttctt ccgaattgtg gacggatctc 5460 aagcgcgctg tgctgctggt ttcatcgaag acgaaggtgg ccccactcaa acctctttca 5520 gttcctcgtc ttgagtcgca ggcggcacta ctgggagtca gattagcaaa gtcggtggcg 5580 gaaaaccaca ctctagttat cgtccgtcga tttttctgga gtgattcaac cactgttctc 5640 tcctggctgc agtccgacca acggaaatac cggcagtttg tggcgtgtcg agtgtctgaa 5700 atcctggaca caacgaaggt ggacgaatgg cactacgtac cctcgcgatt gaacgttgca 5760 gatgatgcga cgaagtggaa agatgggctc cagatcaaca gcaaccaccg ttggtttcgt 5820 ggaccgagct tcttatacga ccctccacgc acgtggccca aaccgaagga gcttcctggt 5880 gcaacggcag aagagcttcg gcccgtccat gttcatcagg aaaaagcgga acagcagtta 5940 gtacagttct gtcgtttctc caaatgggaa cgttgtctgc gagctgtcgc ctttgttcat 6000 cggttcatcg accagcttaa acgtaaaagg cgtgacgaaa tcaccaagga tccacgcatt 6060 cttactcgag aggagctcca gcgagcggaa aacacgatat tgcggcttgc acagttcgaa 6120 gtttttgcgg acgagatcat cacgatggca aaaaaatcag ttactgccac cggatcaacg 6180 tcagcgtttg gagaaaacaa gtaaactcta caagctaact ccttttctgg acgaacgagg 6240 cgtagttcga atggacagca ggatctccag tttctctgaa gcgctctatg attttaaata 6300 cccagttgtg ctacccaaaa gccactacgt cactcaactg atcgtcgaca gctaccatcg 6360 gcggtacaac cattgcaatg gagagacagc ggtgaacgaa atgcgacaga gattccattt 6420 gtcggagatg agagcagctt ttcggaagtc tcggaagatt tgttcttggt gccgggttta 6480 caaggcaact ccatccgttc caagaacggc accgctaccg gaagcacggg tgactcctta 6540 tgtcaagccg ttcagtttcg tagggttgga ctacttcggt ccccttctga tcatccaggg 6600 tcgacacgaa gtgaagcgct gggttgccct cttcacttgt ttaaccatca gggcaatcca 6660 tctggagcta gtgtccagcc tatctacgga atgctgcaag atggctatac gacgcttcat 6720 agcacggagg ggggcacctt ccgagatcta tagcgataga gggacaaact ttgtcggagt 6780 cagcggcgaa ttgcgagagc agataagggg catcaacagt gatctggctt ccacgttcac 6840 aaataccatc acccaatggc gcttcaaccc accggcttcg ccacatatgg ggggggcttg 6900 ggaacgtatg gttcgttcgg tgaagtgtgc tttggctgcg ttgtctgccg aacgtaagcc 6960 aaacgaagaa actttggtga ctcttcttgt tgaagcagag tctgtggtaa attcgaggcc 7020 gttgacctac atgccgctag agacggcaga gcacgaggcg cttacgccaa actgttttct 7080 gctgctgagc actagtgggg cgaatcaacc accaagtcag ctttcggatg gtaagttggc 7140 cttgcgatcc agctgggcgt tgtaccaacg tctactggac caattctggg ttcgctgggt 7200 gaaagaatac ctgcccacca tcactaagcg aaccaaatgg tttgttgaga gcaaaccagt 7260 ctcggccggt gacctggtta tgattgtaga agaccgactg cggaatggat ggatcagggg 7320 acgagttctc gtgtgttccc tggacgagat gggagatgtc gtagcgcaaa tgtccaaact 7380 gcaaccggcg ttctacgccg tccagtagcc aagttagccg tccttgaagt ggcggatact 7440 gctcgtgagg acacggacca atacgggtcg gggga 7475 // ID Copia-138_AA-I repbase; DNA; INV; 4167 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-138_AA_; KW Copia-138_AA-LTR; Copia-138_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4167 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [1558-2061] - Integrase core CC LTRs are 95% similar to each other. XX FH Key Location/Qualifiers FT CDS 139..3594 FT /product="Copia-138_AA-I_1p" FT /translation="MASSAKKAGIVQFAGDGFDTWKFRVETQLSAHGVRDV FT LDEDAPAETATEAVKKAFKEKDEKAKALLVAFIADSHLEYVRDKATAKAMW FT KSLESTFAKKGFAAQTYIRRSLAMLRMEEGCALKDHFRDFDELVRQLKDAG FT ANLTELDQVSQLFISLPASYDVVTTAIENLEEKQLTLATVKSRLLAEEQKR FT NGRDTCSGLSNRSDGTVLAARTEKQKKGIREGKPKFAGKCYGCGAFGHKRS FT TCPKSRDQRVRAQAAEIPRDREVALIGSVPTVSSSAKSKIVWILDSGASRH FT MAKEPGVFYGMETLAEPVTIESAMDNGSMKGAQQGTIKGIVSTRSGQVRLA FT ISKVLFVPNLKFNLISLGALMNVGVDVEFMKDRAILTKDGEIIGEATKQGN FT LYYLTMDVEIVGTALTTDDGVSKQDLWHKRYGHLSHRNLLKLQKAEMVTGM FT GKLKTEEAFCETCAEAKLVRLPFNGSRPPTSRPLERVHTDVCGKITPATFD FT GYQYFVTFLDDYTHFAVVYLMKRKDQVFEYFKIYQAMATAKFGVRIAALRC FT DLGREYFSSSQLAYYEQHGIQVESTVGYTPQHNGVAERFNRTVSEKMRTLL FT LESGAPKYLWGEALYNAVYVANRSPTEALEVEKTPAEMWHGEKPNVEKLRV FT FGSLAFSWIPKQKRSKLDPTGKRCVFLGYAPCGYRLWDPKARKQFVARDIR FT CNEVEFPFKITPAAQMSGCLTEQSIPLVVPEFTESEGDGVEVPEIDPANEI FT EVAPDNITIADPQDDESEQEGEEAEDMVLEPSALPPQAVGKSSSQDGTTPR FT TSGRECKRPGWFRDFMSSYSANTAVNISTQIPMCYDDIDGHPEATRWYQAV FT NDELRALNHNQTWTLVKRPATVVPVPCKWVFAVKTDADGKPVRYKARLVAK FT GFMQRKHIDYEETFAPVAKLTTIRTILAVANTRDLEIHQMDVRTAFLNGKL FT TETVYMEVPEGVKSKPDEICLLERSLYGLKQSPRCWNNRFNNFLQDQGFVR FT SNHDYCLYVRQQKEEIIYIIVYVDDLLIVAKTRATVEEIKRILKSEFEMAD FT MGPLHHFLGIKVERDRARGVMKLSQTGLIDKMLQRFEMQDCKACKTPAEVR FT LQLTRANGDCQYPYRELIGSLMYIMMGTRPDLCYIVGYLSRFQD" XX SQ Sequence 4167 BP; 1206 A; 898 C; 1141 G; 921 T; 1 other; taactttagg ttatgggccc agtctcactt taacccctga agaccgaaga taggaaaatt 60 gttggaatct gaagccctct tggaaatcgg aatcgttatt tagaagcagc cgtttacttt 120 gtgaagaatc tgggaacgat ggcaagcagc gcgaagaaag ctggcattgt gcagttcgct 180 ggtgatgggt tcgatacttg gaagttccgg gtcgagaccc agctttcggc ccatggagta 240 cgggatgttc tcgatgaaga tgccccggca gaaacggcta ctgaagcggt caagaaggcg 300 ttcaaagaga aggacgaaaa agcaaaggcc ttattggtgg cattcatagc ggacagccac 360 ctggaatatg tacgggacaa ggccacggcc aaggccatgt ggaagtcttt agagtcgacg 420 tttgcgaaga agggatttgc ggcccaaaca tacatccgga ggtcgcttgc catgctgcgg 480 atggaagaag gatgtgccct gaaggaccac ttccgggact tcgacgaatt ggtgaggcag 540 ttgaaagatg ccggcgcaaa tctgacggag ttggaccagg tgagccaact gtttatctcg 600 ttgccagcct cgtacgacgt cgtcaccacg gcaattgaaa atttggaaga aaaacagctc 660 accctggcaa cagtgaaaag tcgtttgttg gctgaagaac agaagcgaaa cggtcgtgat 720 acatgcagcg gactatcaaa tcgatccgac ggaacggtac tggcagcaag gaccgagaag 780 caaaagaaag gcatccgtga aggtaaaccg aaatttgctg gaaagtgcta cggttgcgga 840 gcctttggcc acaagcggtc aacatgtccg aaatctagag accaacgagt acgagcgcaa 900 gctgcagaaa ttcccaggga tcgtgaagtc gcgttgattg gttccgtacc gactgtaagt 960 tcaagtgcga agagcaagat agtgtggatt cttgattcgg gtgccagccg gcacatggcc 1020 aaagaacctg gagtgttcta tggaatggag acacttgctg aacctgtcac gattgagagt 1080 gcgatggaca atggcagtat gaagggcgca cagcaaggaa ccatcaaagg tatcgtatcg 1140 acacgaagcg gccaggttcg tttggccata agcaaggttt tgttcgttcc aaatttgaaa 1200 ttcaacctga tctcgttggg agcgctgatg aacgttggag ttgatgtcga atttatgaag 1260 gatcgtgcca tcctgaccaa agacggggag atcatcggtg aggccaccaa gcaaggtaat 1320 ctctattatt tgaccatgga tgtggaaatc gttggcactg ccttgacaac cgatgatgga 1380 gtcagcaagc aggatttgtg gcacaagcgc tacggacatt taagccatcg aaatctattg 1440 aagctccaaa aagcggaaat ggtgactgga atgggtaagt tgaaaacgga ggaagccttc 1500 tgtgaaactt gcgccgaagc aaaactggtg cggctaccat tcaatggatc aagacctcct 1560 accagcaggc cgctggagag agttcacacg gacgtttgtg gaaaaattac accggccaca 1620 tttgatggtt accagtactt cgtgacgttc ctggacgatt acactcattt cgcagtggta 1680 taccttatga agcgaaagga ccaggtgttt gaatatttca agatctatca agcaatggca 1740 accgcaaaat tcggtgtccg gatagcagcc ttgagatgtg acctaggacg tgaatacttt 1800 agcagctccc agttggctta ttatgagcaa catggcatcc aggtggagtc aacggtaggt 1860 tatacaccac agcacaatgg ggtcgctgaa cgtttcaatc ggaccgtttc tgagaagatg 1920 cgaacgttgc tgcttgagtc tggggctcca aagtaccttt ggggtgaagc actgtataac 1980 gcagtatacg tcgcgaatcg aagcccgacg gaagcactgg aagttgagaa aacaccagct 2040 gagatgtggc atggcgaaaa accgaacgtt gagaagttgc gagtattcgg tagcttggcg 2100 ttttcgtgga taccaaagca aaagcgaagc aagctcgatc caaccggtaa gcgatgcgtg 2160 tttcttggat atgcaccatg cggctatcga ttgtgggatc cgaaggccag aaagcaattt 2220 gtggcacgtg atataagatg taacgaagtc gagtttccgt tcaagattac accagcggcg 2280 caaatgagtg ggtgtttaac agaacagtcg atcccacttg tagttccgga attcactgag 2340 tcagaggggg acggtgtgga ggtaccggaa atcgaccctg ccaacgaaat cgaagttgct 2400 ccggacaata ttacgattgc tgacccgcaa gatgacgagt cggagcaaga gggggaagaa 2460 gccgaagaca tggtgttaga accgtcggca ctcccaccac aagcggtagg taagagttcc 2520 tcccaggacg ggacaactcc gaggaccagt ggtcgggagt gcaagagacc aggttggttc 2580 cgagatttta tgtcttctta tagtgcaaat actgctgtaa atatttccac tcagattcct 2640 atgtgctacg acgacatcga tggccatcca gaagcaacac ggtggtatca ggctgtaaat 2700 gacgagcttc gtgcgttgaa ccacaaccaa acctggacgt tggtgaagcg accggcgact 2760 gtcgttccag ttccatgcaa gtgggtgttc gctgtgaaga ccgacgctga tggcaaacct 2820 gttcgctaca aggctcgctt ggtagccaaa ggatttatgc aaaggaagca catcgactac 2880 gaagaaacat tcgcacccgt cgccaagtta actacaatcc ggaccattct ggcggtggca 2940 aacactcggg acctcgaaat acatcagatg gatgttcgta cagcgttctt aaatggcaag 3000 ttaaccgaaa cggtatacat ggaggtacct gaaggcgtaa aatctaaacc ggacgagata 3060 tgcttgctag aacgttcact ctatggcttg aagcaatcgc caaggtgttg gaacaaccgt 3120 ttcaacaatt ttctccagga tcaaggcttc gttcggtcga atcatgatta ttgcctctac 3180 gtgcgtcaac agaaagaaga gataatttat atcatagtat atgtagatga cttgttgatc 3240 gtggcgaaga ccagagcgac ggtggaagaa atcaagcgaa tactcaagag cgaattcgag 3300 atggctgaca tgggtccact tcatcacttt ttagggatca aggtggagag agaccgcgca 3360 agaggcgtca tgaagctttc ccaaaccgga cttattgaca agatgctgca acgattcgag 3420 atgcaagatt gcaaagcgtg caaaacacct gctgaagtgc gactacagct gacacgagca 3480 aatggagatt gccaataccc gtacagagag ctgattggaa gcttgatgta cattatgatg 3540 gggacacgac cagacctgtg ctatatcgtt ggctatttat caagattcca agatgmagca 3600 ggacctgatc attggcaaca agcgaagcga gtacttcggt acttacaagg gacaaagcat 3660 catcaactgg tatacaagag aaatccacat aaatcaccga cagagggatt cgttgatgct 3720 gactatgcag gagatgaatc agatagaaag tgcgtctcag gttatctact gaaagtattt 3780 ggctgtaccg tggcttggtc ttcaaagaaa caaggaaccg ttgcgatgtc atcaacagag 3840 gccgaatacg tagcgatgag tagctgcatc agcgaggtaa tttggatgac cggtctaatg 3900 actgatctgt ctgaagccaa gaatctgcgg cctgttccgg tatttgaaga caaccagggt 3960 gccattgcca tggccaaaaa ggaggaaacc aaacgggtga aacacatcga cgttaaattc 4020 cacttcattc gggatgctgt ggccgacgga cgaattcaac tggtatacat ccaatccaaa 4080 catcaagagg cggatatttt aacaaaatcg ctagctgcac cactatttca agagctgcgg 4140 cagaagattg gactatacgt aatcaac 4167 // ID Mariner-1_BF repbase; DNA; INV; 1887 BP. XX AC . XX DT 13-APR-2011 (Rel. 16.04, Created) DT 13-APR-2011 (Rel. 16.04, Last updated, Version 1) XX DE Mariner-1_BF DNA transposon DNA transposon - consensus. XX KW Mariner/Tc1; DNA transposon; Transposable Element; TA TSDs; KW Tc1-1_BF; Mariner-1_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-1887 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M. and Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-1887 RA Kapitonov V.V. and Jurka J.; RT "Transposable elements in the amphioxus genome."; RL Deposited to Repbase as the tmplanrep.ref file (02-May-2008). XX RN [3] RP 1-1887 RA Kapitonov V. and Jurka J.; RT "A family of Mariner-1_BF DNA transposons from the amphioxus RT genome."; RL Direct Submission to Repbase Update (13-APR-2011). XX DR [3] (Consensus) XX CC This transposon belongs to the Tc1 group of the Mariner CC superfamily. XX FH Key Location/Qualifiers FT CDS 333..1448 FT /product="Mariner-1_BF_1p" FT /note="Mariner/Tc1 TPase." FT /translation="MLPLKTPASCRCGARNEHLWITVPPSRCRTVIHRCSF FT FGILCCCCCLTLNFTMPRLSYVDKLRAVGQVEAGAYQRDVAALFGVRQSTI FT SKLVTKFQATDDVKDMPHTGRPRATTPEQDRYLTRTTLRNRRLSSTRLQTR FT FSDRYRTRLSRQTIRNRLHAANLRARVAARKPALTEAQRQARLRWCRRHRG FT WRDREWRHVMFSDESRFSLRQLDGRIKVWRRRGERYSKACTDTVTAFGGGS FT VMVWGGITTRGKTRLVIIPGNLNAEGYRDIILNPVAISFLRDMGPHAMLQD FT DNAPPHSARIVRDHLQHTGVERMEWPSGSPDLNPIEHLWDQLGRAVHDRVT FT DRTTLADLRRLSGWWRSGTLSRSIALHDW" XX SQ Sequence 1887 BP; 489 A; 442 C; 486 G; 470 T; 0 other; cactcagcac gaaaagtttg gaatatcaac tttggttgag catatttccg ttgtttttgc 60 atcaatttca atatattata taccattgga atgcttgtgt aattttcttt tatttgatat 120 ccaactcatt gagaatgcaa attcatgaaa caagcaccgg acctactaat gtgagtaggt 180 cgcaaaaaaa ttccctggtt ggtgtccaac gcagtgttgt cctgttgaga tgggcgcggg 240 tgtcaatgat ccaatccttc gtttgttctt ggtataatgt tatttttagt acaagggaca 300 ttcgaggtta tttttagcat atcagacatc caatgttacc actgaaaaca cctgcatcct 360 gcaggtgtgg agccaggaat gaacacctgt ggataacggt gccgccgtca cggtgccgta 420 ccgtcatcca caggtgttca ttttttggca ttctttgttg ttgttgttgt cttacactaa 480 acttcaccat gccacgtcta tcctacgtgg ataagctccg tgctgtaggg caggttgaag 540 ctggcgccta tcaaagagat gttgctgcgc tatttggagt ccgccaaagc accatctcca 600 aactggttac aaagttccaa gcaaccgatg atgtaaaaga catgccgcac accggacgtc 660 ccagggccac aacacctgag caagatcggt atcttacccg tacaactctg aggaaccgca 720 ggctgtcatc cacacgtctc cagacaaggt tctcggacag gtacagaacc cgactatcgc 780 gtcaaaccat cagaaacagg ctgcacgcgg ccaacctgag agcgcgtgtg gctgccagga 840 agcctgcact gacagaagct cagcgccaag cccgcttacg gtggtgccgc agacatcgcg 900 gttggaggga ccgtgaatgg aggcatgtca tgttttctga tgaaagcagg ttctcccttc 960 ggcagttgga tggccggatc aaagtgtgga ggcggcgagg agaacgctac tccaaggctt 1020 gcacagatac agtgacagca tttggtggag gcagcgtcat ggtgtggggc ggtatcacca 1080 ccaggggaaa gacaaggctt gtcatcatcc cagggaacct caacgcagag ggatataggg 1140 acatcattct caatcctgtg gcaatctctt tcctgcgtga tatggggcca catgctatgc 1200 tgcaggatga caacgcccct ccacacagtg cgagaatcgt cagagatcat ctccagcata 1260 caggcgtaga gagaatggag tggccgtcag gcagccctga tctgaacccc atcgagcatc 1320 tctgggacca gctggggcgt gctgtccatg acagggtgac cgacagaaca acactggccg 1380 acttgcgacg actcagcggc tggtggagga gtgggacgct atcccgcagc atcgcattgc 1440 acgactggtg agaagcatga ggaggaggtg ccgggctgtg gtgacagcgc gcggcgggtc 1500 tacccgctac tgaataagac tcattcgtcg tcgacactga gactcagaca gattgtttta 1560 tttaagaatg aacttggcat tgacgaatgt gtattgcttg tttttgttca tttgtcatat 1620 taccctcgca cacaagtcaa taccaacaga acggtatgta ctgaagagag tgactacagt 1680 agtgtaattt gggccctttt tttggcggcc cgctcacgtt aatatgctta gagctcgttt 1740 catggattta caaacacaac gagttggata tcataggaaa gacaactaca caagcatttc 1800 aatgatatgt aatatactag aattggtgct gaaacgactg aaatatgagt aattaaagtt 1860 gatattccaa accttttgtg ccgagtg 1887 // ID Tx1-9_CQ repbase; DNA; INV; 4853 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A Tx1 non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW Tx1; Non-LTR Retrotransposon; Transposable Element; Tx1-9_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4853 RA Kojima K.K. and Jurka J.; RT "Tx1 non-LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 641-641 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 7 sequences with >94% CC identity. XX FH Key Location/Qualifiers FT CDS 3..1307 FT /product="Tx1-9_CQ_1p" FT /translation="XXKXXXVSSLTDFKRDRSVSPXANAYXALCEXKSHCX FT TGIVCQGFFXCGIFRGVAPXMEPKREFTIKLRFGAEARNPNNAEIFKFFNK FT QGWTSEDLSAMYRDDHSVFVRFKTEPMMRGALTKLGPXVKFDYDNGTSLEV FT XVQAAAGTFKYVRIFGLPPEVDDRHIVTTMSKYGAIQQLVRERYSVETGFP FT IWNGVRGLHIELVSEIPAQLTIQHCKARIYYDGLQNKCFGCGALDHLKATC FT PKRNSVNKRLAAAATPAGGSFASIVANGTPAEPVGASPASGMVVLNPKPPA FT PISKPNLEDHVEPVAPEFTAPAEQLALPQGGGDEVAEESGDQSMDEDDDNQ FT QNEGEVSPNGAASMETDDAWTEAKGKGKKGKGKRGRPRKPIESDASESDAI FT GGKQFIVPAQGADLLAALALPKPTGPRTRSASEQRSPHDNK" FT CDS 1586..4747 FT /product="Tx1-9_CQ_2p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="FFLQEVSYEDFSFIFTHNALVNISLDRKGTAILIRKT FT FDYSDCIYDPSGRVISVIVNNVNFVNIYAHSGSNMKKERDNLFTEALTVHL FT NKPNSASTLIGGDFNCILDAADTKGPYKNFSCGLKSLIDLFAYKDIAKLRK FT ENQFTFYRGDSASRLDRFYAPSSLVENVVACSTLPLAFSDHHSVVIKIKYL FT NGNLLTKGRGYWKVNPSLLSMNNITXKFKQEYENLKXRNVYNYNFNRWWNQ FT HXKNKVKQFYKNESYNLNLTVRNSKSALLSSLNNLSKRLVEGEDVADEMSI FT VKSKLMNLEQNRLKNLGEKMPSSTIAEGEKMTVYQLSNSIHRGSSFGLRLK FT DISNGNIVTESTKIGPMVHSYFSEQYNNVSDLVDQDEADVILQNVTKNLTI FT EEKQSLEEYISVEEIAATLKLCNRKKSPGPDGLTYEFYLTHFEILKFDICK FT LFNAYLSGELIPPKEFSEGIITLIPKKGDTTLLQNQRPISLLNTDYKLFTK FT IIANRIQSKIGNLIDIGQSACIAGRSCTDNLNDVRRILTKSIESKSFKGFL FT LSLDLEKAFDKVSHNFLWIVLKRFGFSDKLISCLKCLYGKASSKILFNGFL FT TEEIRIRCSVRQGCPLSMILFVLYIEPLIRNIDQAILGVLTFNKFLRVIAF FT ADDLNVFLRNNEEFDIVLNIIDSFSQCSKIKLNKDKSCFWRINKCTGGPFM FT VKETENLNLLGVTFKSNWSNTIDYNYEKLINGIKYRINLNNFRTMDLIQKV FT WYLNTFILSKLWYLSQVLPPKNKHIASIRKISGMFLWKGCIFRVDKRQLYL FT DMDRGGLGFIDPEAKCKALFIKNILKETDDNSQDEKYLLNFNHLQRLTRNG FT REWVEESNRLINDNIQIKCTKQLYRTFIDQYKFTPRIETEFPLINWNILWN FT LMNKPYLLSHDKSKLYLLFNDLVPNKRKLLHYNIGRVENEICEVCGEIDTN FT LHRVRSCLAGKEIRDWVFGIFRKRFAVSAKQIDDLFFWRIDTESFPQRAAM FT WLAVHFVAYCVDKFPRFNLFIFQRSIREFRWNSKGLVKKYFEDYLNIC" XX SQ Sequence 4853 BP; 1552 A; 859 C; 1037 G; 1361 T; 44 other; atttmgwcaa attmaktwgg gtcagttcgc tgacagactt caagcgagat agaagtgttt 60 cwccagwsgc taacgcatat tgwgcmctct gtgaamgcaa aagccattgt ttkactggca 120 ttgtttgcca gggatttttc mtttgtggaa tttttcgggg cgtagcwccc waaatggaac 180 cgaaacggga attcaccatc aagctccggt ttggggctga ggcccggaat ccgaacaacg 240 cggagatctt caagttcttc aacaagcaag gatggaccag cgaggacttg agcgcaatgt 300 accgcgacga ccacagcgtg ttcgtgcggt tcaaaacgga accgatgatg cgcggtgccc 360 tgaccaaact ggggccgmag gtmaagttcg actacgacaa cggaacctcc ctggaagtgm 420 aagtacaggc ggccgcaggt acgttcaagt acgtccggat tttcggactc cccccggaag 480 tggacgatcg gcacatcgtt acgacgatgt ccaagtacgg tgccatccag cagttggtgc 540 gggaacggta ctcggtggaa actggtttcc cgatctggaa cggggtcagg ggcctccaca 600 tcgagctggt gtcggagatc cctgcgcagc tgacaatcca gcactgcaag gcaaggatct 660 actacgacgg attgcagaac aaatgcttcg ggtgtggggc acttgatcac ctgaaggcga 720 cctgcccgaa acggaacagt gtcaacaagc ggctggccgc ggcggckaca ccagctggcg 780 gatcctttgc tagcatcgtt gcaaacggaa cacctgcgga accggtkggc gcatcgcccg 840 catctggaat ggtggtactt aacccaaaac ccccagcacc catcagcaaa cctaacctcg 900 aagaccacgt ggagcctgtt gctccagagt ttacggctcc tgcagagcag cttgcgctgc 960 cgcagggtgg aggggacgaa gtcgcggagg agagcggcga tcagtccatg gacgaagacg 1020 acgacaacca gcaaaacgaa ggagaagttt cgccaaacgg cgcagcttcg atggagacgg 1080 acgacgcctg gactgaggcg aagggtaaag gcaagaaggg maaggggaag cgtggtcgtc 1140 cacgaaagcc catcgagtcg gacgcatccg agtcagatgc gataggagga aagcagttca 1200 tcgtgccagc ccaaggtgcc gacctgctgg cggcgttggc gctgccgaag ccgacgggac 1260 cacgaacgag atcggcttcc gagcaacgat caccacacga caacaagtaa amcatttcaw 1320 ckcwcmcamm cwccttttcc ggacatgacc cttgcccacc agtammctga ctctcttcgg 1380 acacckwtac taacacaaac gatamgccaa tactaaccma tactaactca actaacatwt 1440 actaatactg atawtactaa cccaagccac actaacacwa tgaactatcc gtattctatt 1500 gccaccataa atttaaacag caccamcacg acggttaaca agggattatt gaaggatttt 1560 gtatataacc acgacatcga tataattttt tttgcaagaa gttagttatg aggatttttc 1620 ttttattttt acgcataatg ctttagttaa tattagttta gatcgtaagg gaacagcaat 1680 acttataagg aaaacttttg attactctga ttgtatatat gatccatctg gtagagttat 1740 ctcagtaatt gtaaataatg taaattttgt gaatatttat gcgcactctg gaagtaatat 1800 gaagaaggaa cgggataatt tatttactga agctttgact gtacatttaa acaaacctaa 1860 ttctgcatca acgttgattg ggggagattt taattgtatt ttggacgcgg ccgacactaa 1920 agggccctac aagaattttt cttgtggatt aaaaagtttg attgatctat ttgcttataa 1980 agatattgct aaactcagaa aagaaaacca attcaccttt tatcgaggtg attccgcctc 2040 cagactagat aggttttatg cgccttctag tttggtggaa aatgttgtag cttgctctac 2100 gttgcccctc gctttctcgg atcaccactc tgtwgtaatt aaaataaaat atttaaatgg 2160 gaacttattg acaaaagggc gtggctattg gaaagtcaac ccctctttac tttcaatgaa 2220 taatataact gamaaattta aacaagagta cgaaaatctt aagsaaagga atgtatataa 2280 ttataacttt aacaggtggt ggaatcaaca tttwaaaaat aaagtwaaac aattttataa 2340 aaatgaaagt tacaatttaa atcttacggt caggaattct aagtcagctt tgttgagctc 2400 tctaaacaat ttatcgaagc gattggttga aggggaagat gtggcagatg aaatgagtat 2460 tgttaaatct aaactaatga atttagaaca aaatcggttg aaaaatcttg gggaaaaaat 2520 gccgtcgagc acaatcgcag agggggaaaa aatgacagtt tatcagcttt caaattcgat 2580 acatcgtggg agttctttcg gtcttcgttt gaaagacatt tcaaacggaa atattgttac 2640 tgaaagtacc aaaattggcc ccatggtgca cagttacttt tcggaacagt acaataacgt 2700 ttcagatctt gttgaccaag acgaagcaga cgttatttta caaaatgtaa caaagaattt 2760 aacgattgag gaaaaacaaa gtttagaaga gtacataagt gttgaagaaa tcgctgcaac 2820 tttaaaatta tgcaaccgta aaaagtcgcc gggcccggac gggctgactt atgaatttta 2880 tctaactcac tttgaaattt taaaatttga tatttgtaaa ctttttaatg catatcttag 2940 tggagaatta attcctccaa aagaattctc agaaggaata atcactttga tacctaaaaa 3000 aggtgataca actcttctac agaatcaaag accaatcagc ttattaaaca cagattataa 3060 actttttaca aaaattattg caaataggat acagtcaaaa attggaaatt taatagatat 3120 agggcaaagt gcatgtatag cagggcgttc atgcacagat aatcttaatg atgttcgacg 3180 tattttaact aaatcgattg aaagtaagag cttcaaaggt tttttgctta gtttggatct 3240 cgaaaaagct tttgataaag ttagtcataa ttttctatgg attgttttaa aaaggtttgg 3300 attttcagat aagttgatta gttgcttgaa gtgtttatat ggaaaagctt cttctaaaat 3360 attgtttaat ggttttctta cggaggagat tagaattagg tgttcagtcc gtcaaggctg 3420 ccctttgagt atgattttat ttgttttata tatagaacct cttattagga atattgatca 3480 ggccatactc ggggttctta cgtttaataa attcctcaga gtcattgcct tcgctgatga 3540 tttaaatgtg tttcttagaa acaatgaaga gtttgatatt gtcctcaata ttattgactc 3600 tttctctcaa tgttcaaaaa taaagttaaa caaagataaa tcatgttttt ggaggataaa 3660 taaatgtaca ggaggtccct ttatggttaa ggaaacagaa aatttaaatt tgctcggggt 3720 gacctttaag agcaattgga gcaatactat tgattataac tacgaaaaat taataaatgg 3780 gataaaatat aggatcaact taaataattt tcgaacaatg gatttaattc aaaaagtttg 3840 gtatttaaac acatttattc tttctaagtt gtggtatctc tcacaagttt taccaccaaa 3900 aaataagcac attgcttcaa ttaggaaaat atcgggaatg tttctttgga aaggttgtat 3960 ttttagagtt gacaaaaggc agctatattt agatatggac cgcggtggat tgggattcat 4020 tgatccagaa gccaagtgca aagcattatt tataaaaaat attttaaaag aaacagatga 4080 caactctcaa gatgaaaaat acttattaaa ttttaatcat ttacaaagat tgacgagaaa 4140 tggcagggaa tgggtagaag aatcaaatag gctaatcaac gacaatattc aaataaaatg 4200 tactaaacaa ctttaccgta ctttcattga tcaatacaaa tttaccccac gtattgaaac 4260 agagtttcct ttaattaact ggaacatact ttggaattta atgaataaac catatctttt 4320 atcacatgat aaatctaaac tttatttgct tttcaacgat cttgtaccaa acaagcgcaa 4380 attattacat tataatatag gaagagtgga aaatgaaatt tgtgaagttt gtggtgagat 4440 agacactaat ctgcatcgag ttcgtagttg tctagccggg aaagaaattc gcgattgggt 4500 ttttggaatt tttcgaaaaa ggtttgcagt ttcggctaag caaatagatg atcttttttt 4560 ctggcgtatt gatactgaaa gttttcccca aagggctgcg atgtggcttg ctgtacactt 4620 tgtcgcttac tgtgtggata aatttccccg tttcaatttg tttattttcc agaggagcat 4680 aagagagttt agatggaatt cgaaaggttt agtgaagaag tattttgagg attatttgaa 4740 catttgttaa aaaagcgtag ggagtagatg tcgtcgtgtg tggtgcctgc attttgtaaa 4800 gttacaatgc aatagaaaaa taaaggatgt tttagaaaaa aaaaaaaaaa aaa 4853 // ID Gypsy-27_AA-LTR repbase; DNA; INV; 105 BP. XX AC supercont1.320; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-27_AA_; KW Gypsy-27_AA-I; Gypsy-27_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-105 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.320; Positions 426582 426478. XX SQ Sequence 105 BP; 22 A; 27 C; 27 G; 29 T; 0 other; tgttggggtc aaatggtcac cctaacgcta tcaactaggc tctccgttgg cagcctcgac 60 tgtcaccggt agaggtcatt gttgtatgaa cgatgctcga ttcca 105 // ID CR1-5_BF repbase; DNA; INV; 3871 BP. XX AC . XX DT 30-JUL-2009 (Rel. 14.07, Created) DT 30-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE Amphioxus CR1-5_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-5_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-3871 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-3871 RA Kapitonov V. and Jurka J.; RT "Young families of CR1 non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(7), 1576-1576 (2009). XX DR [2] (Consensus) XX SQ Sequence 3871 BP; 1100 A; 852 C; 715 G; 1204 T; 0 other; acattttcta ccgctcttgg taattgtttc gatcttgatt aaaagcaatc gcccatagcc 60 aggctgtgat tattattgta attaagtcac agctgacctc acggcaattg tatgacgatg 120 tacctatgtt tgaactatgt tatagttagg ataagatctc attttgtgtg ttatgatgtt 180 ttatgtctta ccaccgtctt gtgtagctgg tgtccttacg attatctaat aacaagcctc 240 ccgtttgcgc ggcaaccaga ctaacctaac ctcgtcatgg gcaaattccc attagataaa 300 caaagacacc tgtctatatg ccactcaaac gtgaggagcc tgtctgccaa cgatttcatt 360 aaacttcatg agcttgaaac gattgctctt gaggatgatt ttgatgttat tgctttgtct 420 gaaacctggt gcgattccac catacccgat tcggacattt cacttgcctc ataccagtcc 480 ccgatgcgac gtgaccgaaa tcggcatggt ggtggagtcg cactttatgt caggaactca 540 ataccttcta aaagaaggag tgatttagaa tgcgatcttt ttgaaacaat ttggtgtgaa 600 attgaggcca acaattggca agtatttatc tgtgtctgct acagacctcc tggtcaaagt 660 gcgcaagaac gcacagcatt catagaacac ctgcagaatt cagcagacgt tattatgact 720 gacccaggtg gtaaaactgc agtagttatt gttggggatt ttaacaatac ttcagataat 780 tcctcagtcg acatcgacag gtttgtggct gataataact ttgtacagct aattcgagag 840 cccactcgaa ttacggagcg gacgtcaacg ttgattgacc taattatcac cgattctccc 900 agtcttttca ccgagtgtgg tacacatgca cctctatcca actgcgatca ctctcttata 960 tatggaaaaa tgtctctcag gatcccacct gttaaaacat tcaaaagaac cgtctacgat 1020 tttggaaagg cggattacga gggactgaac tccaaaattc tgactatcga ttggaatgcc 1080 atgttttcgg cgaaccttcc tgatattaat gccacaactg agtcatttat ggcgaccctc 1140 tctagcgcta tcagaatgtt tgtgcctagt aaagtgatca ctgttcgccc gagggatcag 1200 ccgtggatga cctgtgaaat taggcatatg ctgagaaaaa agaagcgact ttacttgaaa 1260 tacaaaagga caaatagtcc agatttgttt cagactttca aacatatacg caacgcgtgt 1320 acaagcctca ttaggaaagc taaattgaac catcgtctca gcctgtgtct cgatctcgcc 1380 aaccctgcta caaatcctaa aaagtggtgg tcagtttcca aagaaatgtt cttgggtaag 1440 ctggatcgag caataccatc cctgattgac aatggacgtg ttgtatcaga tgacaaggcc 1500 aaggctgagg cttttaatca atactttgcg tcacaaacgc acctaaacac agctggagcc 1560 acccttccgc aatttacacc tagaactgaa tctactaaga actgcattaa ttgttctccc 1620 cttgatgtct atacaatttt gtgcaatctc gacgttggta aggccactgg acccgatggc 1680 atcggcaaca gacttctcaa acacattgcc ccatcccttg ctgcgccact ttccaatttg 1740 tttaataggt cagttcaagg cggctgcttc ccgggaattt ggaagcgctc aaatgtaatt 1800 cccgtccata aaaatggtga caaacagaca aagacaaatt ataggcctat atctctgctg 1860 tgttgtatat caaaagtctt tgagagaatt gtatatatcc ccttgtcatc ctacttgtac 1920 gaaaacaaac tgctcaccga tcgaaactca ggtttctctc cgggagactc gacagttaac 1980 tcactgttat atctaacaca taatatgcat caagcacttg atcaaggact tgatgtacgt 2040 actgtttttt tagacatatc taaagcattt gataaagtct ggcatttggg gctaattttt 2100 aaattaaaac aattaggaat aactggtccc ctccttgcct ggattgagtc atatttaaat 2160 gatcgtgagc agcgtgttgt actcaacgga gcctgttccg aatggttacc cattggtgct 2220 ggtgtcccac agggctctat tttgggccca cttcttttcc ttgttttcat caatgatttg 2280 gtggacaact tagaaacgga tggccgctta tttgcagatg atacctcact aactgagatt 2340 gttagagacc ctattacctc agctgtaagg ctaaatcacg atcttagact agttcataac 2400 tggtcacgtc agtggctcgt tacattcaac cctcttaaaa ctcaactaat aacttttaca 2460 accaagcgaa ctaaagttgt acatccacca ctgtatttta tgggtattcg gttgactgag 2520 gtatcctctc ataaacacct cggcctacat cttacgccaa aattatcatg gtcattacat 2580 gtttcaatta ttacaactaa tgcaatgaaa cgaacaaatt tactaaagag aatttcaaga 2640 tttgctccaa ggcaaacact cgaaattctt tacaaaacta tgatacgacc actattagag 2700 tacgccgatg ttgtatggga tgaattgtcg gttaaagatt gcaacctaat tgaatctgtc 2760 cagctcactg cagctcgaat ttgcacaggt gcaatgagag gaacaagtaa cgacagtatc 2820 ttgtcagaag ttgggtggga aactactgcc agcagacgcc aaaagcacaa acttattcat 2880 ttttacaaaa tggttaacgg tttaacacct gaatatcttt gtgaaattat tccaccccag 2940 atctgtgacc gcactacata cagccttcgt gctaatcaaa attatatcca aattccttct 3000 agaactgaga agtacaggag atcattctta ccttcctctg ttctttcatg gaataaacta 3060 ccgctcacta ccagagtatc tccctctgtt gctagcttca aaaatgagtt ggagtctttt 3120 ttctatagac ctcctaccca ccttcacttc tcgcatggtc ccagataccc agcaatccat 3180 ctcactagac ttagacttaa cttcggtcag ttgaatttcc acctgttcca acataaatgt 3240 gtaaatagtc ctatgtgtaa atgtggccac cctaacgaat caactataca ttattttcta 3300 cactgtcctc tatacaatga tgcaagactt gatatgttgt catctttaac cactatgttc 3360 atgcagggta tcttaccatt tgacccaaga cacaccactg aaaatatttt atgttcaacc 3420 ttactcaggg gaaacccagt actatctacc attcagaact gtcatttatt tgatgttgta 3480 tttaagttca tcaaacagac agatcgtttt agctgaacct tatttgatta tttctaagtt 3540 tgtttatcag ccatatgtca ctctgttgtt tgattccttg tttattttca tgtagttatg 3600 tcttgtgttc aatagacagg ttgtttaagt tgatttataa ccttatttga ttacttctaa 3660 gtttgtatgc cagctacacg tcactttgtt atttgattcc ttgttgtttt atttcatgtc 3720 gttatcatta tctcttgcct ttgttattta gcgttttgta gtcctgtatg tgtgtcttta 3780 ttttgtttat gtacatgagg agaagacctc gataagcagg gtctgctttc cgtctcatcc 3840 tctttaaaga aatgtgaata aaaaaaaaaa a 3871 // ID Mariner-8_HM repbase; DNA; INV; 2796 BP. XX AC . XX DT 22-MAR-2008 (Rel. 13.03, Created) DT 22-MAR-2008 (Rel. 13.03, Last updated, Version 1) XX DE Mariner-type family: consensus sequence. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-8_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2796 RA Jurka J.; RT "Families of Mariner elements from Hydra magnipapillata."; RL Repbase Reports 8(3), 225-225 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(499..1170,1239..2405) FT /product="Mariner-8_HM_1p" FT /translation="MPRNPKKKLGSRKYKDYTDETLKIAVEEVKKGSSLRK FT IAEKFAISRFALTAALKGKVNKCGRPPVLNSEEERRLVESISMAGEWGFPL FT TALDIRLIVKGYLDRCGRNEVRFKNNMPGVDFIDSFLKRNSKDLSKRLGQN FT IKRSRAAISRDIVNNYFNELEKTLKDVDPSMIVNYDETNMTDNPGRKQVIV FT RRGSRHPERIVDSSKSSTSVMFLEVQVECYCLLIIEEWFTLEIFENWFQRL FT ALPFKKFDSDAKKVLIGDNLSSHISAHIEECNKNNIKFVLLPPNSTHFTQP FT LDVAFFRPLKIKWRQTLTEWKEANRGSIPKDRFPRLLKKCLDALGEENISK FT NLKSGFKAAGIVPLDRNEILKRLPDGDATAVSEELNSSNNWTDSIKAFFEE FT SRKRETEPLKQRKKRKLNVPAGKGIEGIEDTILINDDDPQPNRSNTPNTKT FT KENHSKLKTLKHNKKLANSDSFDDSDDQYSLRDSNSDMDPETFSDLESEIP FT VDLQLEETTYDEPPKIGVSKGELKPDIYVIVELIFDKNTKKETTKQFYGVI FT LRTCFEKNEVTVKFFRRSSKSKGNDYIFPNVDDEMDIQLTDVISIVKPSKV FT NRGRYTFPYELPVKH" XX SQ Sequence 2796 BP; 965 A; 482 C; 533 G; 816 T; 0 other; taccgtagat gggggtaact aaagcctacg gggaaactaa agccattata gcaggtaatt 60 tttggatggt gatagtggca gcactgtttg gattcaacag ctgatgttgt gttcttctaa 120 taggaacttg gactgctttc caaatacttg tctggattat taaaacaata tcacggtaag 180 tgtacagtaa aactatatat ttaacctaaa aaatcgataa aaaagttacc ttgcataact 240 tctgctctga acaaatcatt tttaatctta aattctcata actctcaaca aacataacct 300 caatttatgt ttcatcacac ctttaaaggt tataacctat aacttgattc tttcttttgt 360 tgtgatgttt tttttttcag tgatacacaa catggggtaa ctaaagccac atcataaagg 420 tggcttttgt taccccatac tttataataa gtcaaaattc cttaagtaat aaattttttc 480 tcttttttag gactaaaaat gcctcgaaac ccaaagaaga agcttggtag ccggaagtat 540 aaggattaca cagatgaaac gctgaaaata gctgtagaag aagtgaagaa agggtcatct 600 ttaagaaaga ttgctgagaa atttgctata tcaagatttg ctctcactgc tgccctgaaa 660 ggtaaggtta acaaatgtgg taggcctcca gttcttaatt ctgaagaaga aagaagatta 720 gttgaatcta taagcatggc cggagagtgg ggattccctc ttacagcact agacataaga 780 ctaatagtga aaggttacct ggatcggtgt ggccgaaatg aagttaggtt caaaaacaat 840 atgcctggag ttgattttat tgacagtttt ctcaaaagaa acagtaaaga tctttccaaa 900 cggttgggtc aaaacattaa aagaagtcga gcggctatca gtagggacat cgtgaataac 960 tactttaatg aactagaaaa gacattaaag gatgttgatc catcaatgat agtaaactac 1020 gatgaaacca acatgactga caacccaggg agaaaacaag tgatcgtgag aaggggatca 1080 aggcatccag agcgtatagt agattcttct aagtctagca catcagtaat gtttttggaa 1140 gtgcaagtgg aatgctactg cctccttata taacttataa ggcagcccat ctgtatgact 1200 cttggacaga gtggtccagc aggcactgtg tataatagat cgaagagtgg ttcactctgg 1260 aaatttttga aaactggttt cagagattag ctttgccctt taaaaaattt gacagtgatg 1320 ccaagaaggt tttgatagga gacaatttgt cgagtcacat ctctgcccat attgaagaat 1380 gcaacaaaaa taacatcaag tttgtactgc tacctcctaa cagtacacac ttcacgcagc 1440 ctttagatgt ggcattcttt aggccattaa aaataaaatg gcgccagact ttgacagagt 1500 ggaaagaggc taatcgtggc tctattccaa aagacaggtt tccaagactt ctgaaaaagt 1560 gccttgatgc tttgggagag gaaaacataa gcaaaaactt aaaaagtgga tttaaggcag 1620 ctggtattgt acctttagac cgaaatgaaa ttctgaaaag gcttcctgat ggtgatgcca 1680 ctgcagtatc tgaagagctt aattcaagca acaactggac tgattctatc aaagcctttt 1740 ttgaagaaag taggaagaga gagaccgaac ctcttaaaca gcggaagaaa aggaaactta 1800 atgtcccagc aggaaagggg attgaaggga ttgaagacac aattctgatc aatgatgatg 1860 atcctcaacc aaataggtcc aatacaccca ataccaaaac aaaggaaaac cacagcaaac 1920 taaagacttt gaaacataat aagaaacttg cgaattccga ctcatttgat gatagtgatg 1980 accagtattc tttgagagac tctaattcag atatggatcc agaaacattt agtgacctgg 2040 aatcggagat accagtagat ctacaactag aagaaacaac ttatgacgaa ccccccaaaa 2100 taggagtctc taaaggagaa cttaaacctg atatttacgt cattgttgag ctcatctttg 2160 ataagaacac aaaaaaagaa accaccaaac agttctatgg agtcattctt agaacttgtt 2220 ttgagaagaa cgaagttaca gtaaagtttt tcagaagaag ctccaaatca aaaggaaatg 2280 actatatatt tccaaatgtt gatgacgaaa tggacattca gttaacagac gtcatttcaa 2340 ttgtgaaacc atccaaagta aacagaggac gttatacttt tccttatgaa ctaccggtaa 2400 agcactgatt tcaacaagac ctattttaaa taatattttt gtctcttttt aatgtttttc 2460 ttaaataata aagctttttt cttgtcaata gtggttcttt attgttacag atgcatgatt 2520 tatggattta gttaccctag cactctctaa ctaaagccaa acattgcttt ttaatggatg 2580 gcttttgtta ccccttttcc aggggtaact aaagccacca tgttttcatt ttttttatta 2640 aaaaaatgta tatttctgca atttatgaat atatttcttt aatatatgac cacagaccca 2700 agttttgcaa tatttattta acctataaat taaagggtct gacttgggca gaagtttgaa 2760 tttcgaaagt ggctttagtt acccccatct acggta 2796 // ID hATm-1_CS repbase; DNA; INV; 3057 BP. XX AC . XX DT 30-OCT-2007 (Rel. 12.1, Created) DT 30-OCT-2007 (Rel. 12.1, Last updated, Version 1) XX DE hATm-1_CS, a family of autonomous hATm DNA transposons - a DE consensus sequence. XX KW hAT; DNA transposon; Transposable Element; KW Autonomous DNA transposon; hAT superfamily; hATm group; KW hATm-1_CS. XX OS Ciona savignyi OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. XX RN [1] RP 1-3057 RA Kapitonov V.V. and Jurka J.; RT "hATm, a distinct group of hAT DNA transposons in animals."; RL Repbase Reports 7(10), 1044-1044 (2007). XX DR [1] (Consensus) XX CC hATm is a separate group of hAT DNA transposons. Significant CC identity between hATm and hAT transposases appears after PSI CC BLAST iterations. hATm transposons exist in genomes of different CC animals, including segmented worms (Helobdella robusta, leech), CC flatworms (Schmidtea mediterranea, freshwater planarian), insects CC (Aedes aegyptus, mosquito), and tunicate (Ciona savignyi sea CC squirt). All identified hATm transposons are characterized by CC 8-bp target site duplications and conserved termini CC (5'-TAGGgtGgyccnnA). The planarian hAT-7_SM and hAT-8_SM CC transposons also belong to the hATm group. Their putative CC classification as hAT transposons was not supported by CC significant similarity (BLAST and PSI-BLAST) to known hAT CC transposase proteins. CC hATm-1_CS is a young family of hATm autonomous DNA transposons CC identified in the sea squirt genome. The consensus sequence was CC built based on 7 copies that are only 1% divergent from it. TIRs CC are 15-bp long. XX FH Key Location/Qualifiers FT CDS 439..2598 FT /product="hATm-1_CSp" FT /note="transposase." FT /translation="MATSVQTRSKTFIWLIGSPSSEVLEKRLPTKLEVFKT FT FFYHHIEEHKTVHQSAIETVNKVFHVWSNARIPTTQKHNAVKRLENLIQKY FT QKLKKNRNRVSDTQQDKEVEFLTNLEELWDIAHQNAMDKIKIEEDRQFLLC FT QRQGTGYMMGIDKKLKKKEERILKRKKEERDRYERHCKETFLDKPSTSTEI FT DSSTSSLNETSSSASSNDYSYSTSPIKVHSTTHTNKKQKIITPNLSSALDR FT VKMSDRNAMYTVVAAASSMGHDIGKLALSRSTIRRTRISQRETIANEIKTS FT FDPQTKLVVHWDGKLLCDITGNKERVERLPIIVSGQGIEKILSVPKLSQGS FT GRAMANATINTLQQWDVVSLVRGMCFDTTASNTGAYSGACSLIEKKLERKL FT LHLPCRHHIHELILANVFKGLFGCSSGPEVLLFKRFQAYWKYIDQAKFEPY FT TEDSDSGEDIVQWKTEAIVNAFKIVKETQPRDDYREVLELSLIFLGETPPR FT GTRFQAPGAFHHARWMAKMIYCIKMYLFRNQFKLTKNEVKAITEFAKFSVL FT VYVKAWFSSPLPILAPINDLNLLKTLNRYEAINTSVSALSLKVLKRHLWYL FT SEEIVLLSLFSDENIIPQAEKLKMVDSLKKKAKVRKKRYRPTNIKDIASAC FT LSDFVTEDSLSLFEILDLDQTFLEHHPSTWNEQQHSYKTAKDCVMSLRVVN FT DCAERGVALIWRVQQLSYEG" XX SQ Sequence 3057 BP; 1132 A; 515 C; 554 G; 856 T; 0 other; tagggtggtc caaaaaaaaa attttttttt ttactcaaac cttaaccctg cattttgtta 60 gaattactaa aataatcacc cagaccaaat ttcttctttt taaaataatg tttaaagggt 120 gctcaacgat atcaaagttt tgtgaaattg tgatatgtag gtcaacatgc aatatgcatt 180 aactcatatg caatgtattc taaaaacaca attagccatt gttacatcat agttatggca 240 gttctaaata tagaacaaat aaatgtttca ccatattgaa gtttaaaagc aagtttagtg 300 tttgtctgtt tgaaagatac agtgcagtaa aaagttagtt taatagaacc atgcagtgaa 360 tgccagaata atcttaattt tttacagatt ataccagtac taaaagtgaa ttagatatta 420 tataaatgtt atgttattat ggcaacttca gtacaaaccc gatcaaagac tttcatatgg 480 ctaataggca gtccttcatc agaagtttta gaaaagagac tgccaacaaa gcttgaagtt 540 tttaaaactt ttttttatca ccacattgag gaacataaaa ctgtgcatca aagtgctata 600 gaaactgtaa acaaagtgtt ccatgtgtgg agtaatgcaa gaattcccac aacacaaaag 660 cataatgcgg tcaagcgact tgaaaatctt atccagaaat accaaaaact gaaaaaaaat 720 cgcaataggg tcagtgacac tcaacaagac aaggaagtag aatttttaac aaacttagaa 780 gaactgtggg atattgcaca tcaaaatgca atggataaaa taaaaataga agaagacaga 840 caatttctac tatgtcaaag acaaggaaca ggttacatga tgggaattga caagaaatta 900 aaaaaaaaag aggagagaat tttaaaacga aagaaggaag aaagggatag atatgagaga 960 cattgtaagg agacattctt agataaacca tctacttcta cagagataga ttcgagtacc 1020 agcagcttaa atgaaacatc gagtagtgca agcagcaatg actacagtta cagtacctca 1080 ccaataaaag tccattcaac aacacatact aataaaaaac agaaaataat cacaccaaat 1140 ttgtcatcag ctttagacag agtcaagatg tcagacagaa atgccatgta tactgtagta 1200 gcggctgcca gcagcatggg ccatgatata gggaagctag cacttagcag aagcaccata 1260 cgccgcacaa gaatatctca aagagaaaca attgcaaatg aaataaaaac ctcatttgac 1320 ccacaaacaa aactagtggt acattgggat ggaaaactat tatgtgacat tactggaaac 1380 aaagaaaggg tggaacgcct acctattatt gtttctggcc aaggaataga aaaaatatta 1440 tctgttccaa agttatcgca gggaagtgga agagcgatgg ccaatgccac aattaatacc 1500 ttacaacaat gggatgtagt gtcacttgtc cgaggaatgt gtttcgatac tacagcgagc 1560 aatacaggag catactctgg cgcatgttct ttaattgaaa aaaaactaga gagaaagctg 1620 ctacatctcc cttgcagaca ccacattcat gaactgattt tagcaaatgt ttttaaagga 1680 ttatttggtt gcagctctgg ccctgaagta ctattgttca aaagatttca agcatactgg 1740 aaatatatcg accaagcaaa atttgagccg tacactgaag acagtgatag tggggaagac 1800 atcgtacaat ggaagactga agcaattgtt aatgccttta aaattgtgaa agagacccag 1860 ccaagggacg attacagaga ggtgttggag ctgtcattga tttttttagg tgaaacacca 1920 ccacgaggaa cacggtttca agctccaggt gcatttcacc atgcacgttg gatggccaag 1980 atgatatatt gcatcaaaat gtatctcttt agaaaccagt ttaagctcac taaaaatgaa 2040 gtaaaagcca taactgagtt tgcaaagttc tcagtcttag tatacgtaaa agcctggttc 2100 tcctccccac ttccaattct tgcaccaata aatgacctaa atttattgaa gaccctaaac 2160 agatatgaag ctataaacac aagtgtttct gccttatcct tgaaagtatt aaagcgtcat 2220 ctttggtatc ttagtgaaga aatagtttta ctgagcctct tttcagatga aaacatcata 2280 ccacaagcag aaaaattaaa gatggtggac agtttaaaaa aaaaagcaaa agtgagaaaa 2340 aagcgttata gaccaacaaa catcaaggat attgcttcag cttgtttgtc tgattttgtg 2400 acagaagatt ccttgtccct gtttgaaata ctagatctgg atcaaacatt tttggaacat 2460 cacccatcaa cttggaatga acagcaacat tcctacaaga cagcgaaaga ttgtgtaatg 2520 tcattgcgtg ttgtaaacga ttgtgcagaa agaggagttg ctctaatttg gagagtacaa 2580 cagctttctt acgaaggatg aggagcaaaa acaatacttg cttcaagttg tggaacaaca 2640 cagaaagaac taccctactc caaacaaatc taattacata aacaatgaat tgggcttgta 2700 gagctagagc gttttaataa tttcaaattg cattcgttta aacatgttta ctctgtagtc 2760 tatgtgtatt attctgtgtt gtacttcttt actgctatct atgattgctt aataaatcat 2820 ggtgtttggt ttttaaacct gtttactttg gttgttttat acattatgct ttcaaacaac 2880 atcacaggct taatatcaga atatggctaa gttttaaagg acttgagcaa cctctaaatc 2940 ttgtttaagg gaagtgaaat ttaacaggat gagtttttag aacattcctc acaaaaaaaa 3000 acttaagact taaacttttt taaaaaaaat ttttttcaca aattttggac cacccta 3057 // ID GYPSY1-I_CS repbase; DNA; INV; 4080 BP. XX AC . XX DT 11-APR-2005 (Rel. 10.04, Created) DT 11-APR-2005 (Rel. 10.04, Last updated, Version 1) XX DE Clonorchis sinensis LTR retrotransposon, internal sequence DE consensus. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; gag-pol; CsRn1; Ty3/gypsy-like; GYPSY1-I_CS. XX OS Clonorchis sinensis OC Eukaryota; Metazoa; Platyhelminthes; Trematoda; Digenea; OC Opisthorchiida; Opisthorchiata; Opisthorchiidae; Clonorchis. XX RN [1] RP 1-4080 RA Bae Y.A., Moon S.Y., Kong Y., Cho S.Y. and Rhyu M.G.; RT "CsRn1, a novel active retrotransposon in a parasitic trematode, RT Clonorchis sinensis, discloses a new phylogenetic clade of RT Ty3/gypsy-like LTR retrotransposons."; RL Mol Biol Evol 18(8), 1474-1483 (2001). XX RN [2] RP 1-4080 RA Gentles A. and Jurka J.; RT "C. sinensis gypsy LTR retrotransposon."; RL Direct Submission to Repbase Update (11-APR-2005). XX DR [2] (Consensus) XX SQ Sequence 4080 BP; 969 A; 1248 C; 752 G; 1110 T; 1 other; aaattggtga gcccgtgtct ttctagattt ttttttgaaa acttttcctt tccggtcgtc 60 gtaatttttt tcttttgccc ggtcataatg gaggcacact ttgattcagc ggatttagtc 120 gagaacgcga ttagcatacg tttgccagac tataatccac gtaatcccag ggtatggttt 180 caccaggtag aggccgtttt tgctacccga cgcatcacct cccaagccac ccgattctcc 240 tatgtcgtcc agcatcttcc ttgtgatgta gcaacagaag tggaagatct gttggaagat 300 atccctaagg aaaaccccta tgactccctt cgagcggccg tcatcagtag gacaggaaag 360 tccgaaaaca agatgctacg ggacttattc accactgtcg aacttgggga cagatcacca 420 tcacaactgc ttcggcacat gcgcagtctt ctctgcggcc gtaagctagc ggtggaaatc 480 atggctcaac tatggctcga caaacttcca ccgtcaatgt cccgcgtcat cagcgctttc 540 gttgatgacc acagcttgga gcagttagca cagatggccg ataaaatcca cgaaacctac 600 cccagtaacc ccgtgaactc cgtctcccgc tctgcaccca ctgcttctac agattccgat 660 ccgattctgg cttccatatc gcaactacaa gccaagtttg atactttgac cgatcgccta 720 caacggctag agatcaacaa gcatcgacca cgttcacgat cccgatcgtt gtctcgtgcc 780 cgtccttcct ggtgctggta tcaccaaact ttcggtcctc aagcacgcaa atgccaaccg 840 ccttgttcct tcaagccgcg ttccaacacc cctttaaaca ccccggccag tcagtaacgg 900 cgacgactgt ctgtggccct cctcgcacta gtcgcctatt ccacatcacc gatcgttcaa 960 gtggtctacg tttcctcgtg gatacgggag ctgaagttag cgtgctacca cataagactc 1020 cacttcacga ttctgcctct tattcgttac aggctgctaa cggtacccgc atcgctacat 1080 acggcgaacg ttccctcacc cttgacctcg gattgcgacg tgcgttcaag tggatcttcc 1140 ttttggctga cgtccaaaca cccatcattg gtgcagactt cctgacccac tacaacttgt 1200 ctgtcgatgt tcgaaacaaa cgcctcttgg acatgctaac ttccctctca gtcaacggaa 1260 ttagtgcccc agcttcatcc acaggtatcc gagttatctt acctgattca cattttgctg 1320 acatacttag ggattttcca acccttactc atccatgtca gtacacccaa ccggtcactc 1380 attccgtcgt tcaccacatc cagacgaaag gtccacctgt ccatgccaac ccccgccgac 1440 ttcatccgga taagctacca atcgcaaaac acgagttcga acacatgctc gaactcggca 1500 tcatccgcac ttcatcgagt cattggtcgt cacccctaca tatggtaccc aagaaatcta 1560 aaggtgattg gagaccttgt ggagattacc gttctctcaa taacgctact atcccagaca 1620 ggtatcccat cccacatatt cacgatttcg caagtacctt atgccacaca aacattttct 1680 ccaaactcga tctcgtacgt gcatactacc acatacccgt agccccggat gacatcccta 1740 aaacagccat cacaacaccg tttggccttt tcgaattcac gcgtatgccc tttggtcttc 1800 gaaatgcagc tcaaacgttt caacgtttca tggacgaagt tttacgtggt ttaccttttg 1860 tgtatgctta tcttgacgat gttttaattg ctagcacctc cccaacggag catgcagctc 1920 atctcagggc agtgtttgaa cgtctttcca cttacagcat acgtttaaac attgacaaat 1980 gcttattcgg tgttactagc ctagattttc ttggccacca cattgactca acaggtattt 2040 caccattgcc cratcgcatt ttggccctag agtcttttcc catacccacc actctcactc 2100 aacttcgtcg tttcattggt atcatcaact actatcggcg gttcatccct cattgtgcag 2160 atattttgca gccactgacc gatttacttg gttgcaaaga aaaatctgtt actttgccct 2220 ccgtcgccat tgccgctttc gaaagagcca agcaagccat tgctcacgcc acaaagttat 2280 ctttcctcga tactcacgaa agcacaaagc tgattttgac aactgatgct tcgaacgctg 2340 ccgtcggcgc cgtacttcat caagttgtta acaacgcatc tcaaccacta gcctttttct 2400 cgcagaagat gcaggctgca caaacgcgtt acagtacttt tggtcgtgaa ttactcgcaa 2460 tttaccttgc tattcgccat ttccggcact tgttagaagg tagatcgttc accattcaaa 2520 ccgaccataa gccacttacg tatgccttta acgccaaacc tgatcgatat tcaccacgcg 2580 agatacgcca tcttgactac atttcacagt tcactaccga cattcggtac actccaggtt 2640 cggacaatgt cgtcgcagac gcactatccc gtcccgacat caacgctctt cacccttcca 2700 agcagttgga cctagccaag ctggcgaacc ttcaacatag tgatcctaat tttttgccca 2760 ttctatcctt cccttcattt caagtctctt caatcccctt gcctctccaa tctggatcca 2820 ttttttgtga tatgtctacc ggctcagctc gcccaatcgt ccccgaagcc tttcgacgag 2880 ttgttttcga ccacttccat ggcttttctc atccaggaat cagagcaacc aggaagttga 2940 ttagtgcacg ctttgtatgg cccttcatga ataaggatct gacatcctgg gccaaacagt 3000 gcatcgcctg tcaacgcagc aaagtaactc ggcatacgaa ctcacctatc ggttcatttg 3060 cagttcctga tgctcgattc acacatgttc atctcgacat tgttggtcca ctttccccat 3120 caaacggttt cactcatatc cttaccatga tcgatcgttt tacccgttgg ccagtagctg 3180 tacccatttc ggatacttca gctgaaacgg tcgccttttc atttttgcag cattgggtga 3240 gtaactttgg cacaccttcc atcgttacca ccgatcgcgg ttcacagttt cagtgtaatc 3300 tcttccgtga gttttccaca ctattcggtt tccatcacat ttcgaccacc gcgtatcacc 3360 cctgctccaa cggtctggtt gagcgtttcc atcgctacct caaagctgca ttgactgccc 3420 aaatgaaccc ctcttcatgg tccttttctt tacccctcat tttgcttgct attcgctcca 3480 ctatcaagga ggatcttcac tgttcaccag ccgaactagt atacggtaca actttacgtt 3540 tacccggtga actggtttcc acttctggcg cacagcccga atcgcccgtc acattcgtca 3600 cccgtctgaa acagcatatg tccgaccttc gtgcgacacc gaccagacga tcaacgcgaa 3660 aagaacacat ctcaaccgat ctctcttcta caccatttgt atttgtccga catgacgcaa 3720 cgaggaaacc cctccaaccc tgttatgatg gccccttcaa ggttatcgaa cgtcactcca 3780 aataccttgt tttagagaga agcggaaagc acgacactgt ctccattgat cgcctgaagc 3840 ctgccttcat cgaagcaccc acgaccacat ccagcaatcc tgctaaccca gcaacacccg 3900 cacctaccca agataccccc gccgccaacc tcactccttc caccagtcgt tccggacgac 3960 gcgtccgttt ccctcagcac ttagctgatt acgaaacttg aagtgagcaa cttttcgatt 4020 aattacctta attattcgag cgttttcccc cttcacggtc aacgctctag tgggggagta 4080 // ID Gypsy-7_CQ-LTR repbase; DNA; INV; 166 BP. XX AC AAWU01000353; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-7_CQ_; KW Gypsy-7_CQ-I; Gypsy-7_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-166 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 394-394 (2011). XX DR GenBank; AAWU01000353; Positions 15277 15442. XX SQ Sequence 166 BP; 47 A; 44 C; 29 G; 46 T; 0 other; tgttatgata ttgccagacc cctcgtcatt acactaacac tcactcacac actcgcagca 60 ctggacgctt gagtcagtct ctctctagtt ctcgccgtgc taagatcgtc taaataaagc 120 tagtattaaa acgctaataa ccggtctttt gggagaactt agaaca 166 // ID TAS_I repbase; DNA; INV; 7128 BP. XX AC Z29712; XX DT 13-MAR-1998 (Rel. 3.02, Created) DT 13-MAR-1998 (Rel. 3.02, Last updated, Version 1) XX DE Retrovirus-like element TAS; an internal sequence. XX KW LTR Retrotransposon; Transposable Element; TAS; TAS_I; TAS_LTR; KW endogenous retrovirus; env; gag; pol; internal portion. XX OS Ascaris lumbricoides OC Eukaryota; Metazoa; Nematoda; Chromadorea; Ascaridida; OC Ascaridoidea; Ascarididae; Ascaris. XX RN [1] RP 1-7128 RA Felder H., Herzceg A., de Chastonay Y., Aeby P., Tobler H. RA and Muller F.; RT "Tas, a retrotransposon from the parasitic nematode Ascaris RT lumbricoides."; RL Gene 149(2), 219-225 (1994). XX RN [2] RP 1-7128 RA Aeby P., Spicher A., de Chastonay Y., Muller F. and Tobler H.; RT "Structure and genomic organization of proretrovirus-like RT elements partially eliminated from the somatic genome of Ascaris RT lumbricoides."; RL EMBO J 5(12), 3353-3360 (1986). XX RN [3] RP 1-7128 RA Heinz H.F.; RT "TAS_I."; RL Direct Submission to Genbank (04-JAN-1994)Felder H. F. Heinz, RL Institute of Zoology, University of Fribourg, Perolles, Fribourg, RL CH-1700, Switzerland. XX DR GenBank; Z29712; Positions 253 7380. XX CC TAS has a strong structural similarity with retroviral proviruses CC and related mobile elements. CC The element is flanked by long terminal repeats (reported in CC Repbase CC as TAS_LTR) and contains three distinct regions encoding CC putative proteins typical for retroid elements. The first region CC encodes a putative Gag protein including a 'Leu zipper', a CC nucleic CC acid binding motif, as well as an aspartic protease domain. CC The second region contains an incomplete ORF with sequence CC similarities to known retroviral reverse transcriptases (RT), CC ribonucleases H and integrases. A third ORF, which is located CC adjacent to the 3' LTR, might encode an env-like protein. CC Based on amino-acid sequence analysis of the RT domain, Tas falls CC into a new subgroup of LTR-containing retrotransposons [1,2]. CC Approximately 50 Tas copies are dispersed over approximately 20 CC different chromosomal sites. Their genomic distribution varies CC between individuals, indicating that TAS elements are mobile CC in the Ascaris genome [2]. XX SQ Sequence 7128 BP; 2239 A; 1674 C; 1556 G; 1659 T; 0 other; aaaacttggc gaccacggca ggatcagatt tttctccgga ttctgcaaat ttcggattct 60 gcgaaagtta cctctgcgac tagttcatac tctttgattc tgctgaagat attcgctgag 120 aaatggcccg acgcatgaaa gccgggctgg aacctatggt tcaagagtga aaacgatggt 180 tggaaggatc ccgccaactg aggacgatgt cgacataaca caggaaagtt cttttcaaat 240 taccgctttt ataaaacgta gagactgtct gcaccattgc ttgaaaaatg taatgcgcct 300 catttcgcaa attgaaaata agcacgaaaa gtggcttgaa ttattggaca ctttaagtga 360 cccgcgagca gcagaagcag agaataaggc gtacgaagac tacgctatgc agccagacag 420 ctttctctat ctatgcgagc aagctaaaac cattgctgat caactaaacg ctaggatctc 480 taatatcact gacgttctga agactttgca atcctcggtt tcgtcaccca ctccgctaat 540 cgcgcaagtg tcctcgaatt tacaagcaat ctctgctcct tcgcctccag aacccatagg 600 cagccacgtt gctctgcccc agatccaact cagaaaattt tctggaaacc ctaaagagtg 660 gactggattc tgggaaacat ttaattctgc tgttggccac ttgccaaaaa ttcagtgctt 720 gaactatctg ctttcactac ttgaaggccc agcagcaacc gtcgctcagg gctacgctat 780 gagtgaaagc agctttgatt tggtggtaac tgcgctaaaa tcaaattacg gaaacccagc 840 cgttataata gccagcttac acaccgaatt gcaaaacctt cctagtgaca atgggtccca 900 actggtggca actgtacagg cactggaaag aatattaaga caacttgagc aagcaggcga 960 aaatctcgac tcacctttgt tagtatttat aattgagcaa aagctctcgc agcgaataaa 1020 aagacgcatt agtgaataca aggcgttaga tccagaatgg aacgtcgaaa aattacgggc 1080 gaaacttcgc gaatttatga ccgcggagtt aatctacgca cagtcgcacg gaagagctca 1140 gcaagtgaaa gctgttagca cgccgaggca gccgcaaatc aaaacatctt cccttctcgc 1200 tcagcaaaaa caacaaaaaa agccgtcaaa accatgtgca ttctttggtg aatcacactg 1260 gaatcgcgac tgcccttcgt acagcaccac agagaaaaga attcagcgag tgaataaact 1320 tcaattgtgc acaaaatgtt tgaaaagggg acattcactg caagactgca aagcaatgca 1380 acattgctac tactgccata atcgcaatca taactcactc ctatgcatgg agaggtgcaa 1440 atcgcaagaa aaggggaaga atggtacaga ccagcccctt tcacctgaag aagtgattgc 1500 agcaaatatg gcaagtggaa ataaagtact gctattctgt aaagaagtag tggtctccaa 1560 ccctcaaaaa gcaaattgcc gaacgaaagc gttaatgttt tttgattcgg gcagtcaaaa 1620 gtcgtacata acaacgcgct tagcgaatcg gctacggcta aacacaacac cgcaaaagct 1680 gagcataagc acttttggtg attgtgcgcc gcaaattatc caatcaaatc gaactgttat 1740 ttccgttcaa cttcagaacg gtacccgcaa ggctgtcaga gtaagcacaa ttaagcgaat 1800 tactagcaac ctagaagtta ttggtgacaa cattgaagct cattttggtg acctgaagcc 1860 atcagagctg ccgcgaaaaa ttcgttctcc agatattctg atcggtcatg actatttcag 1920 cgactttatt ccgcttacag gaatcaaacg cctaggaaac ggctacgaat taattcccac 1980 taaagttggc tatattttat cgtccaaaca ggccgaaaag aaagaagcag ccccggcagt 2040 ggtgtgtatt gaaaaaaaaa atgccagatg acgctaggga agaaatggaa tttgcgaacc 2100 aacattcctc agaacaggtt gagcaaatct ccaactattg gaatgttgag ctccttggta 2160 ttcgagattc accaaccgag catgaagacc aacaagcaat ggacatgttc aaggaaacgg 2220 tgaagagaat cgacggtcgc tattcagtga gcctcccatt cagagactct gtgccgcaaa 2280 ttccacagaa ttttggccta gcatatggtc gattgctttc aacttggaag cgtttgcaga 2340 atgatcgcga gcttctcgaa aaatatgacg caatcttttc tcagcagaag gactgtggag 2400 tcatcgaaga agtacctcag gaggaactaa aacccacatt caatattgtc cactatttgc 2460 cacatcaagc agtaattaat ccccataaaa cgactacaaa gatttgagta gttttcgatg 2520 gttcttccaa atcgagagga actaaaagca tcaacgaagc aattatgaga ggaccagttt 2580 tactcccgaa tttaggtggt ttactgcttc gatttcgttc aatgcctatt gcaattatag 2640 gagatttaga aaaggctttt ctacaagtgg ctctcaataa accagatcga gatgcggctc 2700 gctttctatg gctgtctgat atcaactcat cgccatctca cagcaaccta taggtctttc 2760 gctttgcacg agtcccgttt ggactcaacg caagtccctt cctccccacg gcaacaatta 2820 aaacgcatct agattcgtac ggtgacgtag cattagccga agaaatttgg aagaacattt 2880 atgtggataa cgtgctgatc agtgcacgta cgacaaaaga ggccttcaac aaattcgaga 2940 gcataaagaa actattcgaa gacgcacgaa tgaacatacg cgaatttctg agcaacgacg 3000 caaccttcaa caaaacaatt ccggaaaggg accgtcacga agcaactaac gcgaaactgc 3060 taggactgcg atggaacact acagcagact atattcagtg gaaagtaagt ccaccgaagc 3120 agccgatgat gaccaaacgg caactattgt cctatatcgc tgctcaatac gatcccaccg 3180 gacacatccc tcctttgttc actccattaa aagtaattct acaagatcta ctgagaaaga 3240 atcaaaagtg ggacgagagg ttacccgaaa atataaccaa gcagattgac agcatctgcg 3300 gtgagtggaa tgaccgcaca attgaagtac cacgattcgt cataaagagg caagctgctc 3360 tgcaacttca ctgtttcacg gatgcatcga agacagcgta cgcttcagcc atttacatcc 3420 gaacagaaat gaattcagaa attgaatgtc atctactaat atgcaaaact cgacttgagc 3480 cactcaaagg atccaccatt cctcatctgg aacttatggg agctcttatt ggaacgaggc 3540 tgcttgaata cacctcacag cagctgcaac tggaaagcgc cgacaagtat atttggactg 3600 acagtcagtg cactttacaa cgaattcgct ccagtgacgt gcatcagaaa gatcgattcg 3660 tggaaaatcg cctaaaagaa attcggaaaa cggatgccag atttgaaaac ggatcgtccc 3720 ttcagattcc aatccggccg atattcgtac gagaggtagt gacctcgacg atctgatcaa 3780 caacaaaata tggtggcatg gaccacattg gctcctctat tctagtgagc agtggccaca 3840 attggtaaat cacactgcta ggtcagaaga caaagacagt gagcagtggc ccagaagacc 3900 acttagaaac agaagcaaca acagcgaata cgcacatggc gtgtgaaata tatgatcgtt 3960 catcgactat catcaacgct atgcgtttca gccaatggtc tcgtctactg cgaacgacag 4020 catgggtatt gcgtgcagtt gcctgcttca aaggaaattg ccccgaaaag ggatcgttaa 4080 aaggaaacga aatatctcga gctgaagaac tactgattaa gcaagctcaa agagaggaca 4140 tttgtctgca agagagagaa aaatggaacg ctcatctcga caaagatggc ttgtggaaat 4200 gcgaaggtag aatgaaatat gcttcatcac cactattaat ctacttgcct gcagacaacc 4260 gcataactca gttgcttatt ttgcacatcc acgaggagct tttgcacgct gggatctctt 4320 caattctcgc gaagatgcgt gaggcgtact ggatccctcg tggccgtcag gccataaaac 4380 gagctctcaa caattgcttt cattgccgac gatggaagtc taagcccttt cagatgcccc 4440 caatgccacc ctatccgccg gaacgagtat ctaagcatcc tcctttcgaa aatacaggtg 4500 tcgactatat aggaccattt acgatacgtt cgatgaaaag gatcacacaa aacgatggat 4560 ctgtttattc acttgtttct caactagagc tgtacatctc gaagtcgcct ctgacctatc 4620 cggaccaggt ttcattcagt gtttacgttc gctttgtttc cagaagagga cttcctaaga 4680 gaatgctttc ggataacggc acacaattcg tctgggctcg atctgtactc acctctgtgt 4740 ctaaagtgca ccagacggac aacgcaattc tcgactattg cgctgcacac aacattcagt 4800 ggtcatttat tactccactc gtgccatggc aaggtggcat ctatgagaga atggtcggtc 4860 ccgtaaaagg cagtatgaag aaaacaattg gatggaaaag gctgacgcaa gaagaacttc 4920 aaacactgac gacggatatc gaagcagtgg taaactgccg tccgattatc cccctcacat 4980 cggagaatac gacagttcta cgaccagtcg attttctgct gccacacggt aattcttcac 5040 cgggagtatt cagaagcctg gagaatgatg cacgagaacc cacatatcga gaccctagtg 5100 acatccacca aaagcttttg aaatattggg aagaaacgag agcaaaactt gataactttt 5160 ggacgatatg gcgagaagaa tatctcacga tgtgaaacaa cgatatgaaa caactcacaa 5220 acagccaatc agtactgaaa agagaccgcc tgaggtcaac gaaatagtgc tgctcaacga 5280 gtcaccatgt cctcgcggca cctggccgct cggagttatc gaacccctac atcccgaacc 5340 gtcaaaggtg agaacaacaa ctattcgtct ttcaaacggc aagaaggtgc aaagaacagt 5400 caatagtctc ttcccacttg aaattcgagc caaagcagat gcaataacag caacggcagc 5460 aacagtaatc tccacgactg ttatccgaga aacactcaat aaagcggtca caagacgaaa 5520 acatacccta cccttctacc tactgccttg cttccttatc tcggggttac taactttagt 5580 cgtcgggagc gccgcagcca gcggaactca aggtatacat ctaccaattc cagaaactca 5640 gaactgtaca gtagaacggt tcagccatct tacgacgaca agagtaagcg ttttcgaaag 5700 aaaattgaaa cgagtggaag ctatcagttg tttccagatc actcataaga actgcactcg 5760 agcgtttctc cgattcagta tgtacgaaac tcttgaagaa ccaattatat cagccgtcac 5820 agctcaagca tgtgaagaga ttagaaacac caagcgcttc agcgggctcc agctcgaaag 5880 gatcggtgca aaaaggtgga agagctctca accgtcaacc gcttcttacg ctatctttgg 5940 tgaaagatgt actgctactg tgaactacga gttggatgaa ggctacgtgt tgattggcga 6000 tgatgtgcat atcgattctt gtctggcaaa cgtcacggct tgcaaagcat ccaccggaaa 6060 gtgcatggct gatggtcgaa ttgttctatg ggataaaaaa atgctcgttg aacagtgcgc 6120 atggcaccga gtgggcacat atctcgcaca ggttacagag aagaggatca tcgttgacaa 6180 acttcaagct tctttcgttc cactcgaggc atttgacgaa agctggacgc aagcagaaca 6240 gtgtcaactt cagtcgtttg cacctatgca aaacggtatt ctcatcacat ttgatgactg 6300 gaaacaacag gatgtcaccg actggtaccg aaaggcaaga gttcacgcac cgaacatatc 6360 ctctgttgaa cttggaacaa gattcgatcg agagaacgcg aagtttcaat acgcttctga 6420 taggtggctg cagcagttta aactgcacct aaataagtat gccgaaacac aatgtcaaac 6480 gagaaacaat ttccttatcc ttattcggag catcagtcga agtgaccctt cgatggcagc 6540 taagttgttg cttcatcggg gcgacgtgtc ggcaatacga gaaggcgaca aactaattgt 6600 gtttccacat aacggaatca ttgaaaagag caacgaggga acccacggtg gggaaattga 6660 gaacgaactc gtattctggt cagccaccct tcaagacacg tcaatacaac aagctcaagc 6720 gcagatcgat ctactcaaat cacgaagtgc tcgttggggc ttgagagcaa aaccgtcaac 6780 gttctctgat agccgcacgg ctctctcaga cttcgggaac gaaattgaaa cgcttgcaga 6840 agaagctaac aacaatatcc aacatatctt tgaagcgttc aagtattggg aagctttact 6900 catcgttatt gcgatcatcg cagtatctat aataattact tttgcaacaa tcaaatgcat 6960 ggccgtcgtg aagaaacttc gaaagaaagg cgaaatatcc aacccgacag ttattttcat 7020 caaggaaaat gcagcggaaa caactgtaat cgaaagcctt cagcaacaca tcgatcggat 7080 gcggttcgaa tcagttatca ttgggcgcaa ggatcgtcgc ctcgggag 7128 // ID SAT_PL repbase; DNA; INV; 375 BP. XX AC U68053; XX DT 09-JUL-2004 (Rel. 9.06, Created) DT 09-JUL-2004 (Rel. 9.06, Last updated, Version 2) XX DE Planococcus lilacinus satellite sequence. XX KW SAT; Satellite; Simple Repeat; SAT_PL; satellite repeat. XX OS Planococcus lilacinus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Coccoidea; Pseudococcidae; Planococcus. XX RN [1] RA Khosla S., Augustus M. and Brahmachari V.; RT "Sex-specific organization of middle repetitive DNA sequences in RT the mealybug Planococcus lilacinus."; RL Nucleic Acids Res 27(18), 3745-3751 (1999). XX DR Genbank; U68053; Positions 1 375. XX CC Satellite sequence repeated 5-6 times. XX SQ Sequence 375 BP; 99 A; 102 C; 84 G; 90 T; 0 other; gggcgagcgc gaaagcgaac tgccgctcgt tttcgcttct gcgttcacgt cgtcgttgta 60 cgagtccatt tggttgcgta cactacacta cactacacta cactacagta gaccacaaaa 120 agtaggtaca accgctacgc aacgtccacg tccacgtcca cgtccacgtc cacgtccacc 180 aggtcagctg taggtataca ggtataggta aaagtatacg tattacgtac ctactcgtat 240 acgtatgtgt ctacgtatgt aggtagaggt agaggcgcct acctattcgc aagcgaaata 300 tagaaaacgt tgaccgaatg catcgggcat cgtcgtcccg catcatttcg ccatcaatat 360 atagctagct gatcc 375 // ID Tc1_Ele13 repbase; DNA; INV; 1725 BP. XX AC . XX DT 12-OCT-2010 (Rel. 15.1, Created) DT 12-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE A Mariner/Tc1 DNA transposon family from Aedes aegypti. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Tc1_Ele13. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-1725 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-1725 RA Kojima K.K. and Jurka J.; RT "Mariner/Tc1-type DNA transposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (12-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update and extension. ~95% identical to consensus. CC This consensus is ~97% identical to the original sequence in [1]. CC TA TSDs. 405-bp TIRs. XX FH Key Location/Qualifiers FT CDS join(393..983,968..1480) FT /product="Tc1_Ele13_1p" FT /note="transposase." FT /translation="MPKKAQISISIRNLIIEHHKTGKSYRDIAGIVQLHKD FT TVARIVQMYKKRGSILPAKRSGRPSKTTGRIDRAIVKKAELDRGLSAPKIA FT KQIESEFSIQLTSQSVRNRLHKKGLKGRVALKKPWLSAKNIKKRLNWAKDH FT ISWTSDDWNKVIWSDETKVMLFGSDGITRKWRRTGESLKKECLRPTVKHGG FT GELLYQHVTLPTLIFQPTEHDFDFFSAGRVMVWGSMATSGVGELEFIDGIM FT TKEVYLDILKRKLPASAQKLKLRHRYTFQQDGDPKHTSKLVSQWFKEKRIK FT VMEWPAQSPDLNPIEHLWSILKIKVQEQKPTSIQQLRQVISSEWDKITPET FT TAKLVESMPRRCRAVIEAKGGHTKY" XX SQ Sequence 1725 BP; 543 A; 312 C; 369 G; 501 T; 0 other; tacagtatgc aaaaaaagtt tatccacccc ctccccccta ccaatttaga attcttttgt 60 ttttctggtg aaaatcatgt caatttaatg agtgtttctc catggtttat caatttatgt 120 ttatttatca cattgccatt aaaaaaatga cgtttatata tttttttgag attttatgtg 180 atgaattatg aaaaagggta aaaaataacg caaaaaaagt ttatccaccc ccaatcggat 240 actaagattt aagtgtgtat tacagagttt tagtgtgtat tctgcaactt tctcgaatgt 300 aaacattgtg tctagcatcc attttgtgct cttgaagttt tcgtgctcaa aatttcagtt 360 tcgtgttaaa attggaggtt ttttgttgca aaatgcccaa aaaagcacaa atttcgatca 420 gcatccgaaa tttgattatt gaacatcaca aaactggcaa aagctaccgt gatattgccg 480 gaattgtgca gctccacaaa gatacggtcg ctaggattgt gcagatgtac aaaaagcgtg 540 gttctatctt accagctaag cgatcgggac gtccttcaaa gacaactggc cgaattgaca 600 gagctatcgt gaagaaggcg gagctggacc ggggactatc agctccgaaa attgcgaaac 660 aaattgaaag cgagttttcc attcagctga catcacaatc cgtacgaaat cgtcttcata 720 agaaaggtct taagggcaga gtagcactta agaagccatg gttgtccgca aaaaatatca 780 aaaaaagatt gaattgggcc aaggaccaca tttcatggac cagtgatgac tggaataagg 840 tgatttggtc agatgagacg aaagtaatgc tatttggatc tgatggtatc acaagaaaat 900 ggcggcgaac cggtgaaagt ttgaaaaagg agtgtttgcg gcctacagtc aagcatggcg 960 gtggtgagtt actttaccaa cattgatttt tcaaccaact gagcatgact ttgatttttt 1020 ttctgcaggt agagtcatgg tgtggggatc tatggctaca tccggcgttg gagaacttga 1080 gtttatcgat ggaattatga ccaaggaggt ttacttggac attttgaagc gaaaactacc 1140 agcgtctgcc caaaagctta agttaaggca cagatatacg ttccaacaag acggagatcc 1200 gaaacatacc tcaaagctgg tttcgcagtg gttcaaggaa aagcgtatta aggttatgga 1260 gtggccagca cagtctcctg accttaatcc gatagaacat ctctggtcaa tcttgaaaat 1320 taaggtccag gagcaaaaac caacaagcat ccagcaattg cggcaggtaa tctcaagtga 1380 atgggacaaa atcaccccgg aaacaactgc taagcttgtt gagtcaatgc cccggcgatg 1440 ccgagcagtt attgaagcca aaggtggtca cacaaagtac taaaactatg tgtgtcagag 1500 gggtggataa actttttttg cgtcattttt taacttattt ttaacatcag ctctcaaatg 1560 caagagaatt tgttttttta catcgaactc gcacgaattg aacttgtttt tacctaaata 1620 ataagaaact gaacaacaaa tagccgtgtt ctaatttttt agtgggattt ttggaagaaa 1680 tctaattttc cactgggggt ggataaactt tttttgcata ctgta 1725 // ID Gypsy-254_AA-LTR repbase; DNA; INV; 144 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-254_AA_; KW Gypsy-254_AA-I; Gypsy-254_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-144 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1110-1110 (2011). XX DR [1] (Consensus) XX SQ Sequence 144 BP; 47 A; 21 C; 37 G; 39 T; 0 other; tgtagtggtg gagctagcag attaccgtac cttaagcagt gaacgttgtg tatatgatac 60 ctgaaaatat aaagagactt tgcaatagaa agtggtatta ttatctgagc gacgaacagt 120 gacggttgat atcgggacac taca 144 // ID RTE-1_NVi repbase; DNA; INV; 2812 BP. XX AC . XX DT 15-FEB-2009 (Rel. 14.02, Created) DT 15-FEB-2009 (Rel. 14.02, Last updated, Version 1) XX DE Non-ltr retrotransposon: consensus. XX KW RTE; Non-LTR Retrotransposon; Transposable Element; RTE-1_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-2812 RA Jurka J.; RT "LINE retrotransposons from the parasitic wasp Nasonia RT vitripennis."; RL Repbase Reports 9(2), 485-485 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Nasonia CC Genome Project. XX FH Key Location/Qualifiers FT CDS 743..2806 FT /product="RTE-1_NVi_1p" FT /translation="MTNRIAYGGTSKKTVKEAANKVLGKKKKPKSKPWFDE FT ECELWFERRKKAKLDSLHNRSDRTVEEYSNVRKQAGAIYRNKKREYQKNLI FT RRIETNSKENNPREMYRGINAIRKGFRSRAQLMKDENGDLVTNDNELLSLW FT KNYFDKLLNVHENSEELGDEIHTAELHVEEPSYQEVEAAIKKLKNNKAAGN FT DSIPAELLKYGGVELTFKIYKLVCAIWKNETIPENWKESIIIPIFKKGDKT FT DCNNYRGISLLATCYKVLSNVIQARLTPFAEDIVGDYQCGFRRNRSTSDQM FT FTIRQLLEKKWEFCETIHQLFIDFKKAYDSIKRSKMYQILVLLGVPKKLVR FT LIQICLNGSTGKVRVGGNVSEPFTIRDGLKQGDGLSTVLFNLTLEYVVRKM FT QVSQLGATLNGTTQILGYADDLDILGDCRETVARNAEILIKAAEYTGLEVS FT ESKTKYMIVDKLGICRGEGDLRVGNFTFEKVSEFRYLGTTINDRNEINVEI FT NKRLHSGNACFYAVSNLLKSRLLSKNVKIRIYRTIILPVVLYGCETWALTK FT QADNRFRVFENKVLRKIYGPKKDEETGEWRRLHNDELHNLYASPNINRIIK FT SRRLGWAGHVARMGDDRTAARVMKGRPMVTRPLGRPRRRWEDNVKADLVEI FT GRVGVDRRGASWVGLTQDRAAWKACVDEAMNFRVPNAT*" XX SQ Sequence 2812 BP; 1000 A; 485 C; 722 G; 605 T; 0 other; atggcggagt actagcatcg ggtaacttca cgtacctgta tgggagcggg ggatccctag 60 gcaccggatt tctcgtaagc aaaagcatca tacattcggt taaaagtttc aaatccgtca 120 atgataggct ctcgtacatc atcatcgaag gtgaatggta tagatatgta tttattaatg 180 tacactgtcc tacggaggac aaagaagaag aagctaagga tctctattac gaaactttag 240 agcaggtaat cgaccagttc gcgtcttacg acacaagaat agtattaggc gatttcaatg 300 ctaaaatagg tagggaggaa atgtttaggc ctactatagg gaaggaaagc ctgcacgaag 360 ccagcaatga taacggtatt agggtcataa atttcgcggc agcaaaagat cttataatca 420 aaactacgtg ttttaagcac aaggacatac acaaagcaac gtggacatcg ccggacgggg 480 ccacacagaa ccaaattgat cattttctca ttgaaaaaag acgtaagggc ttatagaggg 540 gcagatagcg actcggacca cttcctagta gtagccaaat taagagctag attagtagcg 600 aatcaaaata gtaagcgaac aaacaaggta gaaagcttcg atattgaaaa gctacgagat 660 agaacagagc gaattaggta ccagatagaa attaataaca ggtttcaggc acttgaagaa 720 gcaaacacat cgccggaggg gaatgacgaa ccgaatagct tatgggggga catcgaaaaa 780 aacggtaaaa gaggccgcga acaaagtact gggtaaaaag aaaaagccaa agagcaaacc 840 atggtttgac gaagagtgcg aactctggtt tgaaaggcgc aaaaaggcta aattagatag 900 cttacacaat agaagcgata ggaccgtaga agagtattct aacgtaagga aacaggcggg 960 cgcgatctac agaaataaga agcgggagta tcaaaagaat cttattagga gaatagaaac 1020 taacagtaag gaaaataacc cccgcgaaat gtacagaggg attaacgcca tcagaaaggg 1080 ttttaggagt agagcgcaat tgatgaagga cgaaaacggg gacctcgtaa caaatgacaa 1140 cgaattactg tcgctgtgga aaaactattt cgataaatta ttaaacgtgc acgaaaatag 1200 cgaagaatta ggggacgaaa ttcacactgc tgaactccac gtggaggaac cgagctacca 1260 agaagtagag gccgcgatta aaaaactaaa aaacaacaaa gcagcgggaa atgactctat 1320 accagctgag ttactcaaat atgggggcgt cgagcttact tttaaaatct ataaactagt 1380 atgtgccatc tggaaaaatg aaacaatacc cgaaaattgg aaggaatcta tcattatacc 1440 gatttttaaa aagggggata agacagactg caataactat aggggtatct cacttttagc 1500 aacgtgctac aaagttctgt caaacgtaat acaagctaga ctcactccat tcgcggaaga 1560 tatagtagga gattatcagt gcggatttcg gcgcaacaga tcgacgagcg atcaaatgtt 1620 taccataaga cagttgttag agaaaaagtg ggaattttgc gaaaccatac accaactatt 1680 tatagatttt aaaaaagcgt acgattctat taagcgaagc aaaatgtatc aaattctagt 1740 acttctcggt gtaccaaaaa aactcgtgag attgattcaa atatgtctga acggaagcac 1800 gggaaaggtc cgagtaggcg gtaatgtatc agaacccttc acgatacgcg atggtttaaa 1860 acaaggggat gggctctcta cggtgctgtt caacttaacg ttagagtatg tcgttagaaa 1920 aatgcaggtt agccagctgg gcgcaacgct taatggaaca acgcagatac taggctacgc 1980 agatgatttg gatatactgg gggattgtag ggaaacggta gcaagaaacg cggaaatcct 2040 cataaaagcg gcggagtata cagggttaga agtgagtgaa tcaaaaacaa agtacatgat 2100 tgtggataaa ctaggcatct gcagagggga gggagatctc agagttggga attttacttt 2160 tgaaaaggtt agcgaattca ggtatctggg tacgaccata aatgatagaa acgagattaa 2220 tgtcgaaata aacaagagac tccattcggg taatgcttgc ttctacgccg tgagtaattt 2280 acttaagtcg aggctgttgt ctaaaaacgt taaaataaga atatacagga caataatact 2340 gccggtggtt ctgtacgggt gcgaaacgtg ggctctcact aagcaggcgg acaaccgttt 2400 tagggtattt gaaaataaag tcttgcgaaa aatatacggg ccgaagaaag atgaggaaac 2460 cggggaatgg aggagactac acaatgatga gttacacaat ctgtacgcgt caccaaatat 2520 taacagaata ataaaatcac gcagattggg atgggcaggg cacgtagcga gaatgggaga 2580 cgaccgtacg gcagcgcgtg tcatgaaggg caggccgatg gtaacgcgac ctctaggtag 2640 acctagacgc agatgggagg acaacgtaaa agcggatcta gtagaaatag gacgggtggg 2700 tgtcgatcgg aggggtgcat cttgggtggg gttgacacaa gatagggcag cgtggaaggc 2760 ttgcgtagat gaggcgatga actttcgagt tccaaatgcc acttaaaaaa aa 2812 // ID RTEX-12_BF repbase; DNA; INV; 3780 BP. XX AC . XX DT 30-JUN-2009 (Rel. 14.07, Created) DT 30-JUN-2009 (Rel. 14.07, Last updated, Version 1) XX DE Amphioxus RTEX-12_BF autonomous non-LTR retrotransposon - DE incomplete consensus. XX KW RTEX; Non-LTR Retrotransposon; Transposable Element; Jockey-10_BF; KW RTEX-12_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-3780 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-3780 RA Kapitonov V. and Jurka J.; RT "Young families of RTEX non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(7), 1729-1729 (2009). XX DR [2] (Consensus) XX CC The 5' terminal portion is missing. The incomplete RTEX-12_BF CC consensus sequence contains only the second ORF. XX FH Key Location/Qualifiers FT CDS 251..3529 FT /product="RTEX-12_BF_2p" FT /note="AP endonuclease, RT." FT /translation="MPIQKVALKFGSWNIQGGLKRKSQDEDFISYFDDFDF FT LAMQETWLKPGDHLRLESFSYCSSIRNVKKSSRCRRSSGGVLLMFRNKFSK FT GVTKLPTRHPDFLWCKLDKIFFGLEQDLFVGCIYLSPENSTSVDTRNNTLF FT DILQEDICKFSIQGSILVLGDLNARTSSLQETFIDDDCLNQEFLDIKDIRH FT RSNCDKGTNRYGTQLVETCSASNLVILNGRTPGDFNGKLTCHKYNGSSLVD FT YCIVNYDFMPKVQYLKISDISVFSDHCLLTCSISTNYSPYVQQMKPSTHPL FT PTKYIWDDDSKDKFTEALQAPLVLNDLQNFLDKSFDNVDNAANEFAAIITR FT VADKSLKKIVHNNRKKKKKKKASKPWFTSSCASLRRKVNQLATLLSKYPWK FT LDIRHSYFSNLKLYKKKIKTEKRHFKEKQWQNLSKLSKVDPKKFWREMKLQ FT RQECSNREEVDNAISPKDWRNHFENLNNLAGSSQSDIKFHQYIAQERPDIC FT QTTNNQMTTEEHTDSIISSLEKLTPNPLDLEITDGEILESLRNLKGNKACG FT SDSIINEMLKYGDTKLILPLKKLFNTVLHSGNFPHQWSRSILVPIHKGGTS FT SNPSNYRGIAISSCVGKLFTFILSRRFQHFLENDNLLSPFQSGFRKNFRTS FT DNIFVLKTLIDKQTSEPGGKLYACFVDFKKAFDSVWRNGLFYKILSSGIGG FT NFFKIIQSMYSNVEYCVKTPNGLSDFFQSTCGVRQGCNLSPLLFNLYINDL FT PATLMSPLCDPVVLKDTPISSLLWADDLVLLSKSETGLKECLKRLDDFCSL FT WKLSINRDKTKIMIFSKHGRTSKQQQSTFRTQGHTIDITKFYTYLGIDITP FT SGSFSRATKQLRLKALRASFKLKSLLASNHNPSISLALDLFHSLVKPILLY FT CAEVLGTNIQCRDLIFNGVQEYPAENIREGIYNILSPILGSNIQLLKVTRL FT GKKNDVTSPRPVLVRFKRYSDKLNILYQSELLSNQNILLTHPKFSYEVSDL FT EHVLLNYCKILLQVPRSSVNAAVRGELGTFPLYIEAQSQLIKYWLRLQNLP FT NDRIVKKAYDFAVEQNRTGPYMYKTYCADTVSKMFG" XX SQ Sequence 3780 BP; 1285 A; 791 C; 639 G; 1065 T; 0 other; actaccaata tgtgaacaga cacatacccc aggtgacaat ctttcatctt cctaacatta 60 ttaagtaata gtacccattg atttatatca tgcacatatg ttgtggtctt ataccacttc 120 agtagtctag tcttgtatct attcactcta catacatatg taaccatgct tgcattgttg 180 tatacctcga cgtcactggt tcatccgctc tcctcctatt cagtggcgca cattcggttc 240 ttacctcaga atgccaatcc aaaaggtcgc gctgaaattc ggctcctgga atattcaggg 300 cggcctgaag cgtaagagcc aagatgaaga ttttatttct tatttcgatg attttgattt 360 tttagctatg caggaaacct ggctcaagcc aggggaccac cttagattag aatcgtttag 420 ttactgtagt agtattagaa atgttaaaaa gtcctctaga tgtagaagaa gttctggcgg 480 agtacttctc atgtttcgca acaaattttc aaaaggtgta acaaagctcc ccacgagaca 540 tcccgatttc ctttggtgta aactagacaa aattttcttt ggccttgaac aggatctatt 600 tgtaggatgt atttacctat ctccggaaaa ctcaacatct gtagacacaa gaaataacac 660 actatttgat attctgcaag aagatatttg caaattttcc atccaaggca gcatattagt 720 gctaggagac ttaaacgcaa gaacgtcatc tctacaggaa acatttatag atgatgactg 780 tttaaatcaa gaatttttag atattaaaga catacgacac agatcgaact gtgacaaagg 840 gaccaacaga tatggcactc aacttgtgga aacttgttca gcatccaacc ttgtcattct 900 aaacggccgt acacccggag attttaatgg aaaattgaca tgccacaaat ataacgggtc 960 aagtctagta gattactgta ttgtcaacta cgatttcatg ccaaaagttc aatatctgaa 1020 aataagtgat atttctgtat tctctgacca ttgcctatta acttgttcga tatccaccaa 1080 ttattcacct tacgtccaac aaatgaaacc ttcaacccac ccgctgccca caaaatacat 1140 ctgggatgat gactcgaaag ataaattcac agaagcactc caagcaccac tggtactaaa 1200 cgacttacaa aacttcttgg ataagtcctt tgataatgtg gacaatgctg ccaatgaatt 1260 tgcagcgata attactcgtg tagcagacaa atctctgaaa aaaatcgtcc acaataatcg 1320 taagaaaaag aaaaagaaaa aggctagcaa accatggttc acctcctcat gcgcttctct 1380 tagacgtaaa gttaaccaat tggccactct cctatccaaa tacccatgga aacttgacat 1440 tcgacattct tatttttcta acctcaaatt atacaaaaag aaaattaaga cagaaaaaag 1500 gcactttaaa gaaaaacaat ggcaaaatct tagcaaactt agtaaagtag atcccaaaaa 1560 attttggcgg gaaatgaaat tacaaagaca ggaatgtagc aatagagagg aagtagataa 1620 tgccattagc cccaaagatt ggagaaacca ctttgaaaac ctcaataact tagcgggaag 1680 tagtcagtca gatattaaat tccaccaata catagcccaa gagagacctg atatctgtca 1740 gacaacaaac aatcagatga caacagaaga gcacacagac agcataatta gtagccttga 1800 aaagttaaca ccaaacccat tagatctcga aattacagac ggcgaaatac tagagagttt 1860 acgaaatctt aaaggcaaca aagcttgtgg ttcagactca atcataaatg aaatgttaaa 1920 atatggagat accaaactca tattaccctt gaaaaaactc tttaacactg ttcttcattc 1980 cggcaatttt ccacatcaat ggagtcgaag catcttggtt ccaattcata aaggcggtac 2040 ttcatccaac ccgtctaatt atagaggaat agccatatcc agttgtgtcg gtaaattatt 2100 cacatttata ttaagcagac gttttcaaca cttcctagag aatgacaact tactatcccc 2160 ttttcagtct ggctttcgta aaaacttccg caccagtgac aacatatttg tactcaaaac 2220 gctaattgac aagcagacat ctgaacccgg gggtaaactg tacgcctgct ttgttgactt 2280 caagaaagcg tttgattctg tttggagaaa tgggctattt tataaaatct tatcttctgg 2340 tatagggggg aatttcttta aaatcatcca atccatgtat tctaacgtag aatactgcgt 2400 caaaacgcca aatggactct cagatttctt tcagtcgaca tgtggtgtgc gccaaggatg 2460 taatttaagt ccgctcttgt tcaaccttta tataaacgat ctcccagcca ctttaatgtc 2520 acctttatgt gatcccgtcg tcctaaaaga cacccccata agtagtcttt tatgggcaga 2580 cgaccttgta ctgttgtcaa aatcagaaac aggcctcaaa gaatgtctca aaagattaga 2640 tgacttttgc tctctttgga aactatcaat taacagagat aagactaaaa tcatgatttt 2700 ctcaaaacat ggcaggacaa gtaagcaaca gcaatcaact ttccgaacac aagggcacac 2760 aatagacatc actaaatttt atacttactt aggcatagat atcaccccat cagggtcatt 2820 ctcacgggct actaaacagc tgcgactaaa agctctcagg gcctccttta aactcaaatc 2880 cttactagca tcaaaccata acccatcaat ttcccttgcc cttgatctgt tccatagttt 2940 agtcaaacct atattacttt actgtgctga ggtactaggg acaaacatcc aatgtagaga 3000 cttgatcttt aatggagtac aggaatatcc tgctgaaaat atacgggaag gaatttataa 3060 catcctttct ccgatattag gttcaaatat acaactccta aaggttacac gattaggaaa 3120 gaagaacgat gtgacatcgc ctcgtcctgt tctagtccgg tttaagagat atagtgacaa 3180 attaaatatt ctatatcaat cggaattact atcgaatcaa aatattttat tgacacatcc 3240 caaattctca tatgaagtgt ctgatctaga gcatgttctt ctcaattact gtaaaatcct 3300 acttcaagtt cccagaagtt ccgttaatgc cgccgtaagg ggtgaactcg gaacatttcc 3360 attatacatt gaagcccaat ctcaacttat taaatattgg cttagactac aaaaccttcc 3420 aaacgataga atagttaaaa aagcttacga ttttgctgtt gaacagaaca ggactgggcc 3480 gtacatgtac aagacatatt gtgcagacac ggtttccaaa atgtttggtt aaaccctgca 3540 gttgacgcca aacattttgg aaacgtgttc aaggaacgtc tgaaagacac gtttgtttct 3600 ggctggaggc aagagataag aaattatagc aaattgcaaa cctattctaa attcaagacc 3660 aacttcactc tcgaaaagta cctttcatca attcaaaata aatcatttgt tgctaacata 3720 acaaaattgc gaattagtgc acattgtttg gagatagaaa aaggtcgata taataatata 3780 // ID Gypsy-5_OD-I repbase; DNA; INV; 6046 BP. XX AC CABV01000629; XX DT 24-FEB-2011 (Rel. 16.02, Created) DT 24-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Oikopleura dioica genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-5_OD_; KW Gypsy-5_OD-LTR; Gypsy-5_OD-I. XX OS Oikopleura dioica OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; OC Oikopleuridae; Oikopleura. XX RN [1] RP 1-6046 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Oikopleura dioica genome."; RL Direct Submission to RU (24-FEB-2011). XX DR Genome; CABV01000629; Positions 26826 20781. XX CC Positions [4317-4796] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 645..5717 FT /product="Gypsy-5_OD-I_1p" FT /translation="MSSISDPTSIYEAGVKISLLLEMNRSLEDQILADPGK FT ETSLRPLIASNISEKDRINGMMGNLCKSEINKSQSFHQPQANADYREEAFI FT ATTFSRFMVDTPKLDGSSHAAFTQFLGRLKIFHRAHVTGHASRLRPFINCV FT YAHLDITIERQIGTELLASDSWEVAKSLLEKNFGDKSHFYLSLSEVWDIEF FT DNEKPVAHFAAEIAQKMIEAKGAISAAFQAHRQKELEAADVFEVIEAMLLY FT IQLRARLPEVYNQISPSLSNVFNIQEMARLAQQTIQNRVDTSVSVNFQRKK FT GGQWHKKKRKKTSDAQDKPKQDDMKQMKTNIDDDLEPFRINQLKKSGPGIF FT YFASASIITDSKRFDVMMEIDSGASTSTIAVDCVPEPILKLLKPCPFTISG FT FDPTTAPKRPLGLLECDLVFNDGPKLKVKLVVTPKGFPNLLGRDILNHQAI FT NKFAIDNVQRLLTLKFNNSFTETDSQQIRLKSITDSRVKCFRVTYNERSKA FT TIISAEASSKPPLPALSPPNVDQCPVKQVMKELNIKFPEDSELSHQVQVAK FT LILEFRDIFGADGKPLGEFPFEAEIRTNGQTRAIPQYSVPQAFHEAITKQV FT SQMLQDGVVKEIQDPKGWNSPLMPVLKKDKSMRLCVNFKNTVNRCLSDKND FT QYHLPSMDAVISQLGSGNCFFSTCDLRSGYWQLRLKESDQVKTSFTWGNKT FT YCFRRLPFGYKNSGNIFSRQVNLMLDSSPSRACTHSYIDDVIVSQSTFPAY FT LKALRDFFKALRKYGARLKSTKCTFLEKSARFLGRVITQNGVRPDPDNLSA FT LRNMSPPRNLKELTSLIGSLNWVRSFLETRMGEKIATGCFSHRMQPLNQVR FT NSANETSIFKWTKEADNALRELQLRLASAPTISFPNYKETFHLYTDASHFA FT AGGVLIQEYENRAHLVAAISHTFTKAERKWNVSEKETFSILWCCERLEMIL FT KGRHFYLHTDHRSLVYLHNKRFKNSKISRWQSRLEEFDFTIIFIKGSENVF FT ADFMSRRPGVDDPTLDLATDFPVGKEYETKPGSKFRVFIPFWSENQFPDKI FT ELSRVKIHQLKVSAFSGPQEIPVVSANYDNLLSYQMEDFALSSIIRHLESS FT KEPIQIANLLKNDHRASVLMKHQKCLFVDHVTGALMVHMKTGARAIVPESV FT KSHFMYVAHDLSAHGGIPRTLERLSDLWWPDIAADVENYVSSCIPCLRRKG FT ATGKRTKPPLGQLYRGTAPGDSLSIDFAHMPNVKGFKYYLIVVCNFSRYVW FT CIPTRTDTAVACANALIKHVFLPFDFMPTHLHSDHGTHFLNSTVDALCKSN FT GIRHSISTAFHPESNSFCERNNRTVKNALFCTQNMKDGRKWVDALPHVMRC FT LNSLANKSTKITPRECWFGRKSTFTHKNGEEMSAESPRVYGLNLKSLAIAV FT EKAVKISMDAADRALELRKKNQPSPRPIPPGSCVYVKREMLSDPKRKSEKW FT IGPLVEDASGSRDIVHREHTCHASKRLDNLKFIEEFESEFIFPNLTANVKS FT KVRQTDTPQPGFSPNSARQGLSSQGERSSTPLKRGQVQTTNPDISPVRITQ FT PVSTIPDNDVTMRTDDSVISAGIGAEVTLFDTANSESILAEISDIRPRDAA FT ELEITPQNVALPKRPTPVKDKRNQSQKRPRENPPSPNSTIIEMQTKAPRTS FT SRSRTQVQPMNIKNSRSKSYFN" XX SQ Sequence 6046 BP; 1776 A; 1583 C; 1265 G; 1422 T; 0 other; gacccttata gtggtgtcta gaagaccctt ttccggtaaa aaaataatat tatctgaaat 60 tctccgacga agccgatttc tgccaatcag aatcgattaa acgaatattc tggagaattc 120 gctcgtctcc gccgctaaaa ataaatcgcc gccgagagag taattccaaa aatcttcgaa 180 ttttttgccg agaaaaaaaa aaccgttgac gccgcactcg ccgcgccgac cgttatcgcc 240 gcgagaaatt cgacgcatcc acgcgccaca caaacagaaa aacacctcac tccctcatac 300 acctccaaca cctcgctagc gccaccacgc ccgccgcgca cggcccgccg ccggaaagtg 360 acgcactgca ttcaacctac gcacgccgcg cgaaacagaa acagacaaac agaaatacga 420 aacatcgcac accggcaaat aattttcgtc cgacgatttc acgaagagaa tgacttcgac 480 gatactcacc tagagaaata cgcacactac aacaagaatc aagatcccgc gcgcaaatcg 540 gcacctcagc gtcaagtatt tcaaaccaaa cggttcgaat atttctgagg gccgtaaagg 600 taaatttact tttcgcctga gccctgattc ttgcactttt gagcatgtcg tccatatccg 660 acccaacttc catctacgaa gcaggagtga agatttcgct gctgttagaa atgaacagat 720 cattggaaga tcaaatctta gccgaccctg ggaaagaaac atctctgcgt ccgctgattg 780 cttcgaatat cagtgaaaag gatcggataa acgggatgat gggaaattta tgtaaatctg 840 aaattaacaa atcccagtca ttccatcaac cacaggccaa cgccgactac cgcgaagaag 900 cttttattgc gactacgttc agtaggttca tggtcgatac tccaaaactg gatggcagct 960 ctcatgccgc tttcacacaa ttccttggcc gactaaaaat atttcaccga gctcacgtta 1020 ccggtcacgc ctcacgactt cgaccgttta taaactgcgt ttacgctcat ctggacatca 1080 ctattgaacg acaaattgga acggagttgc tagccagcga ctcctgggaa gttgcgaagt 1140 ctcttttgga aaagaacttc ggagataaat cacatttcta tctttcactc tccgaagttt 1200 gggacataga attcgacaac gaaaaaccag tcgcccattt cgcggccgag atcgctcaaa 1260 aaatgatcga agccaagggt gctatctccg ccgcttttca agctcatcga cagaaagaac 1320 tcgaagccgc ggacgttttc gaagtcatcg aagcgatgct actatatatc cagctacgcg 1380 cacgcctacc agaagtatat aatcagatct cgccttcgct ctccaacgtc ttcaacattc 1440 aagagatggc ccgccttgca caacaaacga ttcaaaacag agtcgacact tctgtttccg 1500 tcaactttca gagaaagaag ggcggtcaat ggcataagaa gaaacgcaag aagacaagcg 1560 acgctcaaga taaaccgaag caagacgaca tgaaacaaat gaagacgaat attgacgacg 1620 atctggaacc cttccgaatt aatcaactta agaagtcggg tcctggaatc ttctactttg 1680 cttctgcttc tatcataacg gactctaaac gttttgacgt tatgatggaa atagatagcg 1740 gagcgtcaac ttcaacgatt gccgtcgatt gtgtacctga accgatacta aagttactga 1800 agccctgccc gttcacgatt tccggctttg acccgacgac ggccccgaag agaccccttg 1860 gtctcctcga atgcgactta gttttcaacg acggcccgaa attaaaagtc aagctggtgg 1920 ttacccccaa gggtttccca aatctccttg gcagagacat tctcaaccat caagcgataa 1980 ataaattcgc cattgacaat gttcagcgac tcctaacttt gaaattcaac aattctttta 2040 cggaaacgga ctcccagcag atcaggttga aaagtataac cgacagcagg gtcaaatgct 2100 tccgcgtgac atacaacgag cgatcaaaag caacgattat atccgccgaa gcaagttcga 2160 aacccccgct tccagcactt agccctccca atgtcgacca gtgcccagtg aagcaagtaa 2220 tgaaagagct caacatcaaa tttccagaag acagcgagct gtcccaccag gtacaagtgg 2280 ccaagctgat tctagaattc cgcgacattt tcggggcgga tggtaaaccc ctgggagaat 2340 ttccgtttga ggcggaaatt agaactaatg gacagactcg ggcaattcct caatacagcg 2400 taccgcaagc gttccacgag gccataacca aacaagttag tcagatgtta caggacgggg 2460 tagtcaaaga gattcaagac cccaagggat ggaacagtcc actcatgccc gtccttaaaa 2520 aagacaagtc gatgcgcctg tgcgtcaact ttaagaacac agttaaccga tgcttgagcg 2580 acaaaaacga ccagtatcat ctaccttcga tggatgctgt catctctcag ctgggcagcg 2640 gaaactgttt cttcagcacc tgcgacttac gatccggtta ttggcaactt cgtttaaaag 2700 agtcggatca agtaaaaacg tcattcactt ggggaaacaa gacgtactgc ttccgacggt 2760 tgccatttgg atacaaaaac tccggtaaca tcttttcacg ccaggtaaac ttgatgctgg 2820 actcgagccc gtcccgtgct tgtacacatt cgtatatcga cgacgtaatt gtaagtcaat 2880 ccacgttccc cgcctatctc aaggcgctcc gcgacttctt caaagccctt cgaaaatacg 2940 gggcacgtct aaaaagtaca aaatgtacat tcctcgaaaa atcggctagg ttcttaggcc 3000 gagtcataac gcagaacgga gtccggccgg atcctgataa cctgtcagct ctgcgaaata 3060 tgagtccccc acgaaaccta aaggaactta cgtcattaat tggttccctt aactgggtac 3120 ggtcatttct agaaacaagg atgggggaga aaattgcgac aggctgtttt tctcatcgca 3180 tgcagcccct caatcaagtt cgaaactcgg cgaacgaaac ctcaatcttc aaatggacaa 3240 aagaagctga taacgcgctg cgcgagctac agctgcgatt agcgagtgcg ccaacgatct 3300 ccttcccaaa ctacaaggag acatttcacc tgtatacaga tgccagccac ttcgctgccg 3360 gcggggtact tatccaggaa tatgaaaatc gggcgcattt agtggccgct atcagtcaca 3420 cgttcacaaa agctgagcgt aaatggaacg taagtgaaaa ggagactttt tcgattttgt 3480 ggtgctgtga acgcctcgag atgatattaa aagggcgtca tttctacctc cacaccgatc 3540 atcgatcact ggtctattta cacaacaaac gcttcaagaa ctcgaaaatt tctcgctggc 3600 aatcgagact tgaagagttc gatttcacca tcattttcat aaaaggcagt gagaacgttt 3660 tcgctgactt tatgtcgcga agaccaggag tcgacgatcc gaccctggac ttggcgacag 3720 actttccggt tggaaaggag tacgagacga agcccggatc aaaattccgt gttttcatcc 3780 cattttggtc ggaaaaccag tttccggaca aaatcgagct atcccgtgtc aaaatccacc 3840 agctgaaagt ttcagcattt tctggccccc aggaaattcc agtcgtctct gccaattatg 3900 ataatctact ctcttaccaa atggaagatt tcgctttatc ttccattata cgtcacttgg 3960 agtcatctaa ggagccgatc cagatcgcca atctactcaa aaacgaccac cgtgcttccg 4020 ttttaatgaa gcatcagaag tgcctcttcg tagatcacgt aacaggcgct ctgatggttc 4080 atatgaaaac cggcgcacgt gcgattgtgc ccgaaagtgt aaaatcgcac ttcatgtacg 4140 tagcacacga tttgtctgcg catggaggta ttccccgaac gctagaaaga ttatccgacc 4200 tctggtggcc agatatagcc gccgacgtcg aaaactacgt ctcttcttgt ataccatgtt 4260 tacgacggaa aggtgctact ggaaagcgca caaaaccccc acttggtcaa ctttaccgag 4320 gaactgcccc aggcgacagc ctaagtatcg attttgccca catgccaaac gtcaaaggat 4380 tcaaatatta tctcatcgtc gtgtgcaatt tttcaagata cgtctggtgc atccccactc 4440 gcactgatac ggccgtggca tgcgctaatg ccctgatcaa acatgtcttc ctccctttcg 4500 attttatgcc aactcacctt cattcagacc acggaactca ttttttaaat tctacagtgg 4560 atgcgctctg caaatctaac ggaattcgtc attctatttc gaccgccttt cacccagaat 4620 caaattcttt ttgcgaacgg aacaatcgca cagtcaaaaa cgcgctgttc tgcactcaaa 4680 atatgaaaga cggccggaaa tgggttgacg cgttacctca tgtgatgcga tgcttaaaca 4740 gtctcgcaaa caaatccacg aaaattaccc cgcgcgaatg ttggttcggg cgtaaaagta 4800 catttacgca caaaaacggc gaagaaatga gcgccgaaag cccccgcgta tatggtctta 4860 acctaaaaag tcttgccatc gctgtcgaga aagctgtcaa aatctcgatg gacgctgcag 4920 atagagcact ggaactccga aagaagaacc agccatctcc gagaccaatt ccgcccggct 4980 cttgtgtcta tgtcaagaga gaaatgctct ccgacccgaa acgtaagtca gaaaaatgga 5040 taggccccct tgtggaagat gcaagcggtt ctcgcgacat cgttcaccgc gaacacacat 5100 gccatgcttc taaacgactt gataatttga agtttatcga agaattcgaa tcagaattca 5160 ttttccctaa tcttaccgcc aacgtaaagt caaaagtacg ccaaacggat accccacagc 5220 caggtttttc tcctaactca gctcgacaag gtttgtcgtc tcagggggag agaagctcaa 5280 ctccgcttaa acgtggtcag gtacagacaa ctaaccccga tatttcgcca gttagaatta 5340 ctcaacctgt atcaacgatt cccgataacg acgttacgat gcgaactgac gatagcgtta 5400 tatccgctgg tattggagcg gaagtgacac tcttcgatac ggcgaactca gaatctatac 5460 tcgccgaaat ttcagatatc aggccgcgtg atgcggcaga attagaaatt acgcctcaga 5520 atgtcgcatt accaaaaaga ccaactccag tgaaagacaa gagaaatcaa tctcaaaagc 5580 gccctcgaga aaatccgccg tcgccaaata gtaccattat cgaaatgcag acgaaggcgc 5640 cgcggacatc atctcgctcg agaactcaag ttcaaccgat gaacatcaaa aactcccgca 5700 gcaagagtta cttcaattag gcgtatactg ctgccccttc aggtcctcac agagcttgct 5760 ctgtctcttc gaccggggcc tgaaaaggtt aggattacag aactttctca actgccgacc 5820 atatttgagc atcaaccacg atccagtcga ttaccgctcg ccgaaacatc ctatgaagag 5880 ctactcatca tatttcagcc tcgattcatg cttttttatc ccacagggtt ttttattgca 5940 gaatcaactc tgaaatgtag cgctccattt ttgctgttca aattgtgtcg attattcctg 6000 ccttatcata tttacgtgtt ctaagtctca ggggaagtat agggtt 6046 // ID Gypsy-1_DGri-I repbase; DNA; INV; 5609 BP. XX AC scaffold_15203; XX DT 07-MAR-2011 (Rel. 16.03, Created) DT 07-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the Drosophila grimshawi genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-1_DGri_; KW Gypsy-1_DGri-LTR; Gypsy-1_DGri-I. XX OS Drosophila grimshawi OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Hawaiian Drosophila; OC grimshawi group; grimshawi subgroup. XX RN [1] RP 1-5609 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Drosophila grimshawi genome."; RL Direct Submission to RU (06-MAR-2011). XX DR Genome; scaffold_15203; Positions 1877810 1883418. XX CC Positions [4698-5174] - Integrase core CC 'AGCT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 711..2042 FT /product="Gypsy-1_DGri-I_2p" FT /translation="MAVRRSAEDLRSATPIEPGLCTICNDTRSDSEFVETP FT CKHKFHRWCLLEWFLSSESCPVCRQYCSKIMIQIRESTTTDTNTPSGKDQT FT SITKASSGAIPKTGVNTRSTSRNKPNTNVRPNPNLNTKRDLPRPKVTSESN FT IISEERIQQLIANSLDTFKASMTAAMTEQLSAAVRSLSLSTQPRETRDEGH FT LDWENDIVPPRPRLSNSSNSRNNSNQRPGNCYASANVGDGVVEHPERISNV FT ISNWRIKFSGLANDIPIEDFIYRVNCLTSQCLNGNFHLLVQFAHLLFTGPA FT LSFYWRVHRSAVHLNWFELCNRLQERYKDQRTDRDIKDAIRRRKQGNNESF FT DDFLDAILSIADSLREPMLDTDLTIEVRHNLKTELKHELLHVETRDLASLR FT KECHRHEDFFRSLNSRQQPRVNVNRRLLFSTNKFQELNRKRKLMQLMKSV" FT CDS 2440..3723 FT /product="Gypsy-1_DGri-I_1p" FT /translation="MRKFWQSKRALNRCLINSIVNKSNDIRPFINIDIFGQ FT DYLALLDSGANKSVIGGYLAKRVLSENFNVDKSKGVVRTADGQTHPVAGTI FT TIPLTYNSTQADLEFLIVSSIRTDVICGMDFWKKFGISISSFGDINVSAIE FT TEKLEDDSTKLDLSSFQKSKLKAVIDFFPSFENNGLGITTLIEHTIDTAEA FT KPIKQRFYPLSPAKEELLCKEIDRMIQMDVIEEAPSSPWSSPITLHIKPGK FT VRFCLDARKLNAVTIKDAYPIPIMAGLLSRLPPVYCISKIDLKDAFWQICL FT DKESRAKTAFTVPNRPLYQFKRMPFGLTNAPQTMSRLMDLVIPYQLKSNVL FT VYLDDLLVLSDSFESHLLHLSEVATQLRKAGLTINVQKSQFCLRKVEYLGY FT IVGEGTLQVNPEKVRPLKSFRFRKLGSSSDDSWE" FT CDS 3723..5528 FT /product="Gypsy-1_DGri-I_3p" FT /translation="MTGWYQRFISRYSSIIFPLTELLGKNLFTWNDQAQTA FT FEEIKMALCSAPCLVHPQYDKPFIVQCDASSYAVGAVLAQCDESGHERPIA FT YMSKKLNKAQRNNTVTELECLAVVLGIRKFRMYIDGHRFTVVTDHASLRWL FT MDQPDLSGRLARWAISLQGYGFSIEHRKGSQNVVADALSRSFESVVEVASL FT NAEVLPEVDLQSEAFQSAEYCALRDKFKVSKMPDYQVIDGYIYHRKDFSDD FT SSYQGNDSWKLLVPESLRNSVICSAHDQPSSAHCGTAKCLERIRRHLYWPR FT MVVDVRNYISDCDTCKITKYPNRSLKPPMGARITSERPFQRLYFDFVGPFP FT RTRKGHIGIFIILDHFSKFTFLKPVKKFTSNVIIDYLSENIFPCFGVPEIV FT VSDNGTQFKSRVFDDFLLTFGIQHIFTGAYSPQSNAAERVNRSINAALRAY FT IRVDQREWDTFLSRINSSLRNSIHQSIGYSPYFMVFGQHMISHGKDYKLLR FT NLELLKDSTARLPRADEFQKFRADVDKHLKKAYDRNQRSYNLRSQTRAFEI FT GQEVIKRNFSLSKAAANFCSIGTKARIKGKLGRSIYLLEDMNGKELGNFHA FT KDIW" XX SQ Sequence 5609 BP; 1663 A; 1170 C; 1129 G; 1647 T; 0 other; caataattgg cgcccaacgt ggggcccgac cgattggtcg tttccgaaca aatcccgacc 60 gattggtttt ttttcactgt atgtatcact gctaggataa tctttattat cctaagttaa 120 ggtgaacacc ccgcgcttcc caaattacct ttctacattc cttttctcga aacaatattc 180 cacccatatt atattttgca attaaatccg tacggtactt tctgtttact tcggaaatgc 240 tggaattctg attattgtcg gagccagacc agtagacaga gtagttccag aaagttgatt 300 tcattttaga gtctctttct aacgaaagaa aatttctaga catttaatca attcttttgt 360 ttaatctcga attgcgttgg aattttagtt ctgatttcag tcggtcctaa accaggcaat 420 cgatatttaa ataataattt atattccgga cctaaattga gatttgctca gactggagtc 480 tttcggcctg ttcctcttta gattgggtaa tatacactct tatatatttt atagtaaatt 540 ctgtgtacga gtctgatatg tttatacgcc gtcaattata cctgcggtgt cagacctttg 600 gataagtcct tattaattat tcagcgttta tttcgatata gtaaatttcg tcactttata 660 tatctcactt ttccgaccac aaaaccagtt taccgacatt ttattttgct atggctgtca 720 gacgtagcgc agaagattta agatccgcaa cccctattga acctggctta tgtacgattt 780 gtaatgatac caggtcagat agtgaatttg tagaaacacc ctgtaaacac aagtttcaca 840 gatggtgtct attggagtgg tttttgagca gtgaatcatg ccccgtgtgt aggcaatact 900 gttccaaaat catgattcaa ataagagaaa gcactacgac tgataccaat accccgtctg 960 gtaaggacca gactagtatc acgaaagcga gttctggagc gattcctaaa actggtgtaa 1020 acaccagatc gacttcacgc aataaaccga atacgaacgt ccgaccaaat ccaaatttaa 1080 acacaaaacg tgatttgccc cgacctaagg ttacatccga gtctaatatt atttcggaag 1140 agcgtatcca acaattgatt gcgaattctt tagatacctt taaagcgagt atgactgcag 1200 ccatgacgga gcagttaagt gcagccgttc gaagtttgtc attgtctacc caacctcggg 1260 aaactcgtga tgagggtcat ctcgattggg agaatgacat agttccaccc cgtccacgtt 1320 taagtaattc cagcaatagt cgaaataaca gtaatcagag acctggaaat tgttacgcta 1380 gcgcaaacgt aggggatgga gtagttgaac atcccgaaag aatttcgaat gttatttcca 1440 attggcgaat caagtttagc ggattagcta atgacattcc aattgaagat tttatatatc 1500 gagtaaactg tttaacctct caatgtttga atgggaattt ccatctactg gttcagtttg 1560 ctcatctcct tttcactgga ccagccttat ctttctactg gcgcgttcat agaagtgcag 1620 ttcacttgaa ttggtttgaa ctgtgtaacc gcttgcaaga gagatataag gatcagcgaa 1680 ctgatcgcga catcaaagat gcaattcgtc gtcgcaaaca aggcaacaac gaaagctttg 1740 atgatttttt agatgcaatt ctgtcgattg ccgattcctt gcgtgagcct atgttggaca 1800 cagacctaac gatcgaagtc agacataatc tcaaaactga actaaagcac gaattgttac 1860 atgtcgaaac tcgtgattta gcaagtctgc gaaaggaatg tcatagacat gaggatttct 1920 ttcgtagtct taattcaagg cagcagcctc gagtaaatgt gaaccgacga ttactgttct 1980 ccacgaacaa attccaggag ttgaatcgga agaggaaatt aatgcaactg atgaaatctg 2040 tgtagttcga tcccccgaaa aatgtatatg ctggaattgc gaggaacagg gtcacagata 2100 ccaagattgc ctgaagcatc gtcggatttt ttgttacggt tgtggcacac ctgagacata 2160 caagcccaac tgtcaaaaat gtaatcccaa atcggaaaac tcgacgaagg atgtccgccg 2220 tgttccgaaa ggggatgtcc gatattagcc tcggatccac aaaatgttca aacagctatt 2280 tctaaacaac taccagacca atcaccttta accccggaat tacctagtct cttgcgattt 2340 aagccatacc atgtgcgagt tcatgagtac gagaagcgta cgtccgaaat atttaacgcg 2400 gatgaattac tttcatgtaa atctcgttct tctttacgta tgcgaaaatt ttggcaaagc 2460 aagcgagcac taaacagatg tttaattaac tctattgtta ataaaagcaa tgatattcga 2520 ccttttatca acattgacat ttttggtcag gattacctag ccttattaga tagcggggcc 2580 aacaaaagtg tcattggcgg atatcttgcc aagagagttt tgtcagaaaa ctttaatgtt 2640 gacaaaagca aaggcgttgt tcgcacggcc gatggccaaa cacatcctgt tgctggtaca 2700 attaccatac ccctgactta taattcaacc caagctgacc ttgagtttct tatagtttct 2760 tccattagaa cagatgttat ttgtggtatg gatttctgga agaaatttgg tatatcgatt 2820 tcatcttttg gcgatattaa tgtttcggcg atagaaactg aaaagctcga agatgactct 2880 actaaattgg atttatcgtc attccagaaa tccaaactaa aagctgtgat tgatttcttt 2940 ccatcttttg aaaataatgg acttggcata acaacgttaa tcgaacatac aatcgatact 3000 gccgaggcga aacccatcaa acagagattt tatcctcttt ctccagcaaa agaagaactc 3060 ttatgcaagg agattgacag gatgattcaa atggacgtta tagaagaggc cccaagttct 3120 ccttggtcat ctcccatcac cttacacata aaaccaggca aagtgcgctt ctgtctcgac 3180 gcacgaaaac ttaatgcagt cacaattaag gatgcgtatc ctattccaat aatggctgga 3240 ctgctgagtc gtcttccgcc tgtatactgc atttcaaaaa tcgacctcaa ggatgccttt 3300 tggcaaatct gtttagataa agaatcccgt gctaaaacag cgtttacagt cccaaataga 3360 cctctttatc aattcaagag aatgcctttt ggattgacca atgctccgca aacaatgagt 3420 cgtctaatgg acttagtaat tccatatcag ctaaaatcaa acgtattagt gtaccttgac 3480 gatttgctgg tattgtccga tagttttgag tcacatcttt tgcatctttc agaagttgct 3540 actcaattgc gaaaggctgg ccttaccata aatgttcaaa aaagccagtt ttgtcttagg 3600 aaggtagagt atctaggcta tatagtaggc gaaggaacac ttcaggttaa tcccgaaaag 3660 gtccggccgt taaagagttt ccgattccga aaactcggaa gcagctcaga cgattcttgg 3720 gaatgactgg atggtaccag cgttttattt ccagatactc ttcgatcatt tttcccctta 3780 ccgaacttct aggtaaaaat ttgttcacct ggaacgatca agctcaaact gcattcgagg 3840 agatcaaaat ggcattgtgc tcagctccat gtttggtcca tccacaatat gacaagccgt 3900 tcatagtgca gtgtgatgct tcttcgtatg cagtaggtgc tgttcttgca cagtgcgacg 3960 agtccggtca tgaacgccca atagcgtaca tgtcgaaaaa attgaacaaa gcacaacgta 4020 acaataccgt tacggaattg gagtgcttgg ctgtcgttct gggtattaga aaatttcgca 4080 tgtacatcga cggtcatcgt ttcacggtag tcaccgatca cgcaagtctt cgatggttga 4140 tggaccaacc tgaccttagt ggtagattag ctcgatgggc gataagtctc caaggttacg 4200 gcttcagtat tgaacaccgc aaaggaagcc aaaatgtggt agctgacgcc ctatcccgtt 4260 cgtttgagtc agtagtcgag gttgcgagcc taaatgctga agttctaccc gaagtggatc 4320 tgcaatccga agcatttcag tcagcagaat actgtgcttt gcgagataaa tttaaggtat 4380 cgaagatgcc agactatcaa gtaattgatg gttacattta tcatcgcaaa gatttctctg 4440 acgatagctc ctatcaaggc aatgattcat ggaaactgtt agtgcccgag tctttgcgaa 4500 acagcgtcat atgttcagct catgatcaac catcttccgc acactgtggt accgcgaaat 4560 gtcttgaacg aatacgacgt catttgtatt ggcctaggat ggtggttgat gtgcgtaatt 4620 acattagtga ttgtgatacg tgtaagataa ccaaatatcc caatcgatca cttaaacctc 4680 ctatgggtgc tcgaattacc agcgaaagac catttcagcg gttgtatttt gacttcgttg 4740 gtccgtttcc tcgcacaaga aagggtcaca ttggaatttt cataatttta gatcactttt 4800 caaaattcac ctttctaaag cccgttaaaa aattcacctc caacgtgatt atcgattacc 4860 tgagtgaaaa tatatttcca tgttttggag taccagagat agtcgtgagt gacaacggta 4920 ctcaatttaa aagccgtgtc tttgacgatt tcctcctaac cttcggtatt caacatatat 4980 ttacaggagc ctattcacca caatcgaacg ccgccgaaag agtgaatcgt tcgattaatg 5040 cagctttaag agcctatatt cgtgttgacc agcgcgaatg ggataccttc ttaagtcgca 5100 ttaacagctc ccttcgcaat tctatacatc aatcgattgg ttactctccc tattttatgg 5160 tctttggaca acacatgatt tctcatggca aagattataa attattgaga aatttagaat 5220 tgctaaaaga cagcaccgct aggctaccca gagctgacga atttcagaaa ttcagggctg 5280 atgtagataa acacctcaaa aaggcttatg acagaaacca gcgatcgtat aatttacggt 5340 cacagacacg agcatttgaa ataggtcaag aagtgatcaa aagaaatttt tctttaagca 5400 aagctgcagc taatttttgc tctatcggca ctaaggctcg aatcaaggga aaactgggcc 5460 gatcgattta cctcctagag gatatgaatg gaaaggagct aggaaatttt catgcaaaag 5520 acatttggta gccagcactt cctattcgca tccgtttgaa tttttttcag ctcttatctt 5580 tctctaatag aagtagatca gacctgtgg 5609 // ID MuDR9x_AP repbase; DNA; INV; 2125 BP. XX AC Contig13211; XX DT 25-JUN-2009 (Rel. 14.07, Created) DT 25-JUN-2009 (Rel. 15.12, Last updated, Version 2) XX DE MuDR-type DNA transposon. XX KW MuDR; DNA transposon; Transposable Element; MuDR9x_AP. XX NM MuDR9x_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-2125 RA Jurka J.; RT "DNA transposons from pea aphid."; RL Repbase Reports 9(7), 1358-1358 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX FH Key Location/Qualifiers FT CDS join(256..876,923..1474) FT /product="MuDR9x_AP_1p" FT /translation="MSTIFDSDNVCLTAKYRSCILRILYSFTVMSFKNIEY FT LILQKNKPLVKYDGFLYRKNCENGEVSYWRCVEVEKCRGRINIKNYLVIKD FT SSFKHVHVPNPAKIKVKTLTEEMKKSALTTQHNPSKIISDIVITASSNVNQ FT GNLTSVFNLKRIIQRTRQRNQRSPPNPLTLKDLGEIPDMETCINKQTMEVS FT FCCSIITKLTVLQKIEFTYWFANGTFKSAPSIFTQIYAIHVLKYNTVILVV FT FALLRDKSTATYSYLIKTLKSHIPLNPKTIMTDFEQSAILAFKENFPNVTS FT RGCHFHSIQCVWQKIQSITTIHEKYITDAECCIQIRLLPALAYLPAYDVIA FT GFEHLCESDYYIEHEELFRSLLDYFEDTWLGKTVHKRRRGPTFSLEM*" XX SQ Sequence 2125 BP; 807 A; 303 C; 320 G; 695 T; 0 other; accgcgaatc caaaaaagcg aatttcaaaa tagtgacgga aaaatagcta agttgaaagt 60 agctaaaaac cataatagcg aatgttcaaa acagctaaag tatattaatt gtattattta 120 tctgtagtat taagtattaa gtattaagtt ttaagtatat aaagatacag ataatgacaa 180 tcagttttat agctgggtta tccaatatta ttgtccggtc ggaccaagta taataataaa 240 aaaggtggaa tagatatgag cacaatattc gacagtgaca atgtatgttt aaccgcaaag 300 tatcgcagtt gcatacttcg gatattatat agtttcacag ttatgtcgtt taaaaatatc 360 gaatatttaa ttttacaaaa aaataaacca ctcgtaaaat acgatgggtt tctgtatcgg 420 aagaattgtg aaaacggcga agtttcatac tggagatgtg tggaagttga aaaatgtcgg 480 ggtagaatta acattaaaaa ttatttggtt attaaagatt cttcattcaa acatgtacat 540 gtccccaatc cagctaaaat caaagtcaaa acattaactg aagaaatgaa aaaaagtgct 600 ttaactactc aacacaatcc aagtaaaatt attagtgata ttgtaattac ggcgtcatcg 660 aatgtaaatc aaggcaatct cacaagtgta tttaatctta aaagaattat ccagcgaact 720 aggcaacgta atcaacgctc tccgcctaat ccactaactt tgaaagattt aggagaaata 780 ccagacatgg agacatgtat aaacaaacaa acaatggaag taagtttttg ttgttcgata 840 attacgaaat taacggtcct acaaaaaata gaattttaat attttgtacg agtgaaggat 900 taaagttaat gtctgaatct gaacatattg gttcgccaac ggtacattta agagcgctcc 960 ttctatattc acacaaatat acgcaattca cgtattgaaa tataatactg taatactagt 1020 agtttttgca ttattacgtg ataaaagtac tgcaacatat tcttatttaa tcaaaacact 1080 aaaatcgcac attccattaa atcctaagac tattatgact gacttcgaac aaagtgcaat 1140 attagcattt aaggaaaatt tccccaatgt aacttctcgt ggttgccatt ttcactcgat 1200 tcaatgtgta tggcaaaaaa tacaatcgat aaccacaatc cacgaaaaat atataactga 1260 tgcagaatgt tgtattcaaa tacgattgct accagcactt gcctatttac cagcatatga 1320 cgtaatagca gggttcgagc acttatgtga aagtgattat tacatagaac acgaagaact 1380 ttttcgatca ttattggact actttgaaga tacttggctt ggaaagactg tgcacaaacg 1440 acgtagaggt ccaacatttt ctttggaaat gtgaaattgt tatgacttga tagatttgcc 1500 aaagactaat aattctgtcg aaggtaatta atttctataa ataaatatat ctaaaatata 1560 ttaagtatgt ttaatagatt tttacatatt atataggttg gaatcgtggt tttaattatc 1620 tacttggatc gtgtcaccca acactttgga aactcgtaga agggttcaaa caagaacaga 1680 ctaacacaga aatgaaagta gagcaataca taagcggaca aaaaccaata aaaaaaaaat 1740 atatcaagat actgcattaa ggattttaga aattcaaaaa caatatacaa aaagaaatat 1800 tttaaattat ttgaaaggca tagcctataa tttatcacta caagtataaa ctacaaatat 1860 aatattataa tatattttta tgtttttttt tttaaattct ttatatttaa tttaatatat 1920 gactatgttt gacgtataat attaataatt ctgaaataat atacttttta aatacctagg 1980 taatatattt ttatttttta tattaaatta ttttaaaata atatacctta gctgttttga 2040 acattcgcta ttatagcttt tagctatttt atacttagct attttgccgt cgctattttg 2100 acattcactt ttttggattc gcggt 2125 // ID Tx1-2_BF repbase; DNA; INV; 5095 BP. XX AC . XX DT 28-APR-2009 (Rel. 14.04, Created) DT 28-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE Amphioxus Tx1-2_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW Tx1; Non-LTR Retrotransposon; Transposable Element; L1-2_BF; KW Tx1-2_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-5095 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-5095 RA Kapitonov V. and Jurka J.; RT "Young families of Tx1 non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(4), 839-839 (2009). XX DR [2] (Consensus) XX FH Key Location/Qualifiers FT CDS 450..1214 FT /product="Tx1-2_BF_1p" FT /note="ORF1." FT /translation="MPPKRNTNTNAVMDEEVPVSVAFMKEMLEQQKIYYKD FT LLDQQEKNFRSFVQIITDSTNQRMDNLVREMQDLKNSLNFSQGELGDMKTV FT SGKNVDKIKQLEQDILTYNNEVTEIVNKVDYIENQSRRNNLIFDGIKDDKK FT ETWEQSEVKVKEVLKTKLRLNTDAIEIERAHRNGRPGDRPRPIVVKFSRFK FT DKQSILRHAKLLKGTSIYINEDYSERIRQKRKDLLPALRAARERGQVAHIR FT YDKLVISDRGTANN" FT CDS 1305..4979 FT /product="Tx1-2_BF_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MSSSNSLTSKLTFISLNVRGLNNKLKRQKIFRWIHNQ FT KTDIAFLQESFSAKENEKLWANEWGGKIVFSHGTAHGKGVMMLFNPEADID FT LEQTLLDQDGRYIMVNGRVNDNDVSLINLYSPNKRTEQVKFFKEIHKLAQE FT NASGTVIIGGDCNISLTELDKTGGRPIDDKNYAIKEMQCLLKSFNLADVWR FT EKNPNTRQYTWQNENIKCRLDYWFAPYNLVNLSECSIKVAPVPDHRAVIMT FT VKSPRYQKRGPGFWRLNNSLLEDQTFDKEMTDLLTNIKDKLKDIQQPGIQW FT EMIKMEIRSLSIQYSKNKARSRKNEEKELEEQLNSLMRETDEFDDPEKQKL FT ICETKERLESIQTHKTKGAIIRSRCNWNEQGEKSTKYFFNLEKRNYKEKTI FT SSLTLEDGSITTDPKTILLEEAEYYKKLYTSSYPDMKESTYFLDNDDVKKL FT NTEQKTSCDGEMKEVELRKALQAMSKNKNKSPGCDGITIEFYIKYWNLLKD FT YLLNTINDAYHNNLLPISLRRGVIRLIPKKGLDTSNLDNWRPISLLNVDYK FT IVSKALTLRIERLLPFLIHEDQTGFVKQRYIGTNIRRIYDTIDYYDKKKKP FT GLLMFLDFKKAYDSIEHSFILKVLETVNFGHNMIKWVKLLYTDITSCVLNN FT GYISPWFSVHRGVRQGDPLSSSLFIICIEMLASAVRADINIKGLQVYGRMQ FT KLSMYADDMTSLLLDIPSALKLLETLNLFKDCSGLDLNYGKTEAMWIGSNK FT DRQDKPLPIRWPDGPIKALGAHFGHDRHRCYKEDLDKNYTKMVQACSIWKQ FT RNLTLIGRILIVKTIGISKLIYICSVMHVPPQFVKQVNEYIFKYIWREKPP FT KVKMKTLIGKKCQGGLKMIDFNVMNKALKAIWIKRFVMDPSKEKEHLFARW FT GGFLIFRCNYATSQLDLNDMPLFYKDVLVAWEEVIGYEPKTHRQICSQVLW FT NNRFILVKNAPIFYKIWIDAGIIILNDLLYGDGSFLTNKDLHNLYGLTMDS FT RQILNYNSIKAAIPKNWKDNIKNNGKKCASKTIIEYTERIHPLTTTCKQIY FT DTLIEKQVVPPTRENEIIRKIGKDNIEKVYTTAFNVTKETNLQYFQYKIIH FT HFLPTNSWLKKTNLVKTDRCNNCKEKHTIVHLFVYCKEVKIFWKQFCLWWR FT SYENQDIVLDENLIVYGVCCHRTASTTLIYYILLAKYCIYLCYVKNMKPNF FT AHFKNRVNNSIEIAL" XX SQ Sequence 5095 BP; 1911 A; 880 C; 990 G; 1314 T; 0 other; aatcggtagg tggcgcttac ggtagaagcc aggtattcaa cgtctctccg tatgttgttc 60 caaagaatga gtttttgatg tgatttatcg catccaaagg agaacttttg agttgaagtg 120 ttgcagagac agtttgtaca acttcgaaga agatttttgg ggagacggga ccagtttctc 180 caggattttg gatcacagac ctgcataggt gggagggcgg agagtgcgtc ctttgttctt 240 gtgcagacaa cgcacgccgg tgtgtgcaat taacgtatcc aggaccacta caccgaagaa 300 acgaaatctc tccaacaaac cggtaaaatc cacttattta acgaaattgt gaactatata 360 gaaccaagaa agggctgtac tgtgtatttt gcacttagag acaagtttga gagagataat 420 tagattagtt caacctcctg agcgccacca tgccaccaaa gagaaacaca aacactaacg 480 cggtcatgga cgaggaagta cccgttagtg tagcttttat gaaagagatg ctggaacaac 540 aaaaaatcta ctacaaagat ctcctggacc aacaggaaaa aaatttccgc tcatttgtac 600 aaataataac agattcaacc aatcaaagga tggataatct cgtcagagag atgcaggatt 660 taaaaaacag tctgaacttc tcacaaggag agctcggtga catgaaaacg gtaagtggaa 720 agaacgtgga caaaataaaa cagttggaac aagacatcct gacctacaac aacgaagtga 780 ctgagattgt aaacaaagtg gactacatcg agaaccagtc acgaaggaac aatcttatct 840 tcgatgggat caaagacgac aagaaggaga catgggaaca atctgaagtc aaggttaaag 900 aagtactgaa gaccaagttg cgtctgaaca ccgacgcaat agaaatcgaa cgggctcatc 960 gtaatggcag accaggagat cgaccacgcc ccattgttgt gaagttctca cgcttcaagg 1020 acaaacagag catattaaga cacgcaaaac tgctgaaagg tacatcgatc tacattaatg 1080 aagactactc ggaaaggatc agacaaaaga ggaaagactt actaccagca ctacgggcag 1140 ccagagaacg aggtcaagtg gctcacatac ggtatgataa acttgtgatc tctgatcgcg 1200 ggactgccaa caactaggac tgtacggtat tagtttttac tatccatatc aagtcatgct 1260 cttgttaatg cttagatatt tttttttgtt ataactcttt taacatgtct tcatcaaact 1320 ctctaacgtc caaattaacg tttatctcct taaatgttag aggtttaaat aacaaactga 1380 aacgacaaaa aatatttaga tggatacata accagaaaac agacatagcc tttttacagg 1440 aatctttctc ggctaaagaa aatgaaaaac tttgggctaa tgaatggggc gggaaaatag 1500 tttttagcca tggcacagct cacgggaaag gagtaatgat gctctttaac cctgaagcag 1560 atatagactt agaacagaca cttcttgatc aagacggtag atacataatg gtaaatggta 1620 gagtaaacga taatgacgta agtttaatta acctttattc accaaacaaa aggacagaac 1680 aagtgaaatt cttcaaagaa attcacaagc tagctcaaga aaacgctagt gggacagtta 1740 tcattggcgg tgactgtaat atttccctaa cagaattaga caagacagga ggaagaccca 1800 ttgacgacaa aaattatgca atcaaagaaa tgcaatgcct attaaagtct tttaatctgg 1860 ctgatgtatg gcgggaaaaa aaccccaata ctagacaata cacttggcaa aatgaaaata 1920 taaaatgtag gctagattac tggtttgcac cttacaattt agtaaactta agtgaatgta 1980 gcattaaggt agcgcccgtc cctgaccaca gggcggtaat aatgacagtc aagtcgccta 2040 gatatcaaaa gagagggcca ggattttggc gtttaaacaa ctctttgtta gaggatcaaa 2100 cttttgataa agaaatgaca gatctactca ctaatatcaa ggacaaacta aaagatatcc 2160 aacagcctgg tatacaatgg gagatgatta aaatggaaat aagatcactt agtattcaat 2220 actctaaaaa taaagcgcgc agtcgcaaaa atgaggagaa agaattagaa gaacaattaa 2280 actccttgat gagggagaca gatgaatttg atgatcctga aaaacaaaaa ttaatctgtg 2340 aaacaaaaga aaggttagaa agtatacaaa ctcacaaaac taaaggtgct atcattagaa 2400 gtagatgcaa ttggaatgaa caaggagaga agagtacaaa atacttcttc aacctggaaa 2460 aaagaaacta caaagaaaag actatctcgt ctttgacttt ggaagacgga tcaataacaa 2520 cagaccctaa aacaatttta ttggaagaag ctgagtacta taaaaaacta tatacctctt 2580 cttacccaga catgaaggaa tcaacttatt ttttagacaa tgatgacgtt aaaaagttaa 2640 acactgaaca aaagaccagt tgtgatggtg aaatgaaaga ggtagaactg agaaaagctt 2700 tacaagctat gtcaaaaaat aaaaacaagt caccaggatg tgacggtatc acgatagaat 2760 tttatattaa gtattggaat ttactaaaag attaccttct taatacaatt aatgatgcgt 2820 accacaataa cttactaccc atctcactaa gacgtggagt aataaggcta attcccaaga 2880 aagggttaga cacaagtaat ctagacaatt ggcgaccgat aagtctatta aatgtagatt 2940 ataaaattgt cagtaaagca cttacactac gaattgaacg tttactccct ttcctaatac 3000 acgaagacca aacagggttt gtcaaacaaa gatatatagg gacaaacatt aggcgtatct 3060 atgacacaat agattattac gataaaaaga agaaaccggg attgttgatg tttctggact 3120 tcaaaaaagc ttatgatagt atcgaacatt ccttcatact aaaagtacta gaaactgtaa 3180 actttggaca caacatgatc aaatgggtta aattgctata cactgatata acaagctgtg 3240 tgctcaataa tggctatata tctccatggt ttagtgttca tagaggtgta aggcaggggg 3300 atccactgtc atctagcttg tttataatct gtattgaaat gttagcttca gcagttagag 3360 cagatataaa cataaaaggt ttacaagtgt acggtagaat gcaaaaactc tcaatgtacg 3420 cagacgacat gacatcacta ctactagata taccctctgc actgaaactg ttagaaacgt 3480 tgaacctctt taaggattgt agtggtcttg atctaaatta tggaaaaact gaagctatgt 3540 ggataggctc aaacaaagat agacaagata agccactccc tatccgttgg cctgacggtc 3600 ccattaaagc actgggcgcg cactttggcc atgacagaca tagatgctat aaggaagacc 3660 ttgataagaa ttatacaaaa atggtacaag cgtgcagcat atggaaacaa cgaaacttga 3720 ccttgattgg tcggatattg attgtgaaaa ccatcggcat ctctaagttg atctatattt 3780 gttcagtcat gcatgtccct ccccaatttg taaaacaagt aaacgaatac atattcaagt 3840 atatctggcg agagaaacct ccaaaagtaa aaatgaaaac tctgattggc aaaaaatgtc 3900 aaggtggact taaaatgatc gattttaatg taatgaacaa ggccttaaaa gccatatgga 3960 ttaaaagatt tgtaatggac ccatcaaaag aaaaagagca cttgtttgca agatggggag 4020 gtttcttgat tttccgatgc aattacgcta cttctcagtt agatcttaac gacatgcccc 4080 tattttacaa agatgtttta gttgcatggg aagaagtgat aggatacgaa cccaaaacac 4140 acagacaaat atgttcgcaa gttttgtgga acaatagatt tattttagta aagaatgcac 4200 cgatatttta caaaatctgg atagatgccg gaattatcat tctgaatgat ttgttgtacg 4260 gggatggatc ttttcttaca aacaaagacc tacacaattt atatggtttg accatggatt 4320 caagacaaat attgaattac aattctataa aagcggcaat accaaaaaat tggaaagata 4380 atataaaaaa caatggaaaa aagtgcgcct ctaaaactat cattgaatac acggagagaa 4440 tccatcccct cacaacaaca tgtaagcaga tatacgatac ccttatagaa aaacaagttg 4500 taccacccac tagagaaaac gaaatcatac gaaaaattgg caaagacaac atcgagaaag 4560 tttatactac cgcctttaat gtcactaaag aaaccaattt acaatatttt cagtacaaga 4620 taattcacca tttcctacct acaaacagct ggttaaaaaa aacaaatcta gtaaaaaccg 4680 atcgatgcaa taactgtaag gaaaaacata ccatagtaca tttatttgtg tactgcaaag 4740 aggttaaaat tttttggaag caattctgtc tttggtggag gagctacgaa aaccaagata 4800 tagtgctgga tgaaaatttg attgtgtatg gtgtatgctg tcatcgtact gccagtacca 4860 cacttatata ttatattctc ttggcgaaat actgtatata cctatgttat gtgaaaaaca 4920 tgaaaccaaa ctttgcgcat tttaagaaca gggtaaataa tagtatagaa atagccttgt 4980 aaggtgatac agtctgatca caactacttt tatgtattat aatgttatct gattgttatg 5040 tgaatgtatt gttatgtaaa tgttaataaa gtttgaaaaa aaaaaaaata aaaaa 5095 // ID ISL2EU-4_HM repbase; DNA; INV; 7117 BP. XX AC . XX DT 18-DEC-2008 (Rel. 13.12, Created) DT 18-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE A family of autonomous ISL2EU transposons - a consensus. XX KW ISL2EU; DNA transposon; Transposable Element; ISL2EU-4_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-7117 RA Bao W. and Jurka J.; RT "ISL2EU-type transposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 2061-2061 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 1501..2562 FT /product="ISL2EU-4_HM_1p" FT /translation="MSYTEKSEIDIEIPLKTSERQQLYSEINNLRLERDLF FT MSKWISLKDSNCSLGINKIRANPDKCNMLTGLNLPVLEKLLSFLCKGTPES FT SLKRFNTEDKIVFCLVKLRHNINFDMLSFMFDIKKTTALDNFWKWIDIMYV FT KLKYLIKMQDRDHIYETIPPVFKNKFPRLTSIIDCFEVFVESPSSLMARAL FT FYSQYKKHCTIKCLISCTPNGTINFISKCYGGRASDNQITRESEFASSKYH FT MPGDQILSDRGFTLQDDFAAGSCSILINPAFTKDKAQLSASEVEKSRKISS FT VRIHIERVIGLLKNRYTILQGVLPLRTVKNITDEATSVTLSNCDKIVTVCA FT ALTNLGEGIV*" FT CDS 6137..4671 FT /product="ISL2EU-4_HM_2p" FT /translation="MTSSTPTDLPCQWNQTFVKNIVGSPVSQINLYSDAAK FT EKLTGKRTRKMPVSPTNIEKYQFLSELKSVQPKTAALSLFKDFDTEFIHLE FT STVVAKLPLSLRNFFNANYKSLDNEQLNSFFDEKIKDLKLNSDDIVYVEEA FT TRNQSLCDSWYQVRVGRITGSTLYQVFHARVETPPKSLILNICSEKNITIK FT SPPIVWGRENEKNALTLYTQLYSDCNEAHNLLALSDIVIHQNLKVEKLGLC FT IDYEKPWYAASPDASVYCTCCGHGVLEIKCPFSLKDKSLKEEILKDVFYVG FT LNTDGKYFLQKQHTYFFQVQLEMRVTGVSYCDFMVWTPSEFIVLRIEPDIS FT FMESVFNKCDVFWNRFILRELVTREVESGMKLPSSTLTETINNDDNVFCIC FT KSKSSENDDTMVGCDSCDNWFHLKCISLKSVPRSSVWYCNSCKNKKKSAHK FT NVIIFYNFSSKKFFFNNTFFYYYLLRNQYINGAVLMLFSLSFS*" XX SQ Sequence 7117 BP; 2555 A; 1040 C; 1021 G; 2501 T; 0 other; ccatctagtc ggcgtgacgt cacgccgaat tgcgaaataa attaaccatc ttgaatggag 60 ggaaaacaaa catataaata tcaacagcgt tttaaacttt ttttatgaca aaaagatgcc 120 aaggagttgt tgtgttgtag gttgcacaaa caattgtaaa aaaaacaatg aacttaaatt 180 ttatcgtatt cctaagaatg aagcggtcag aagaaaatgg ttaaatgcaa ttaatcgtgc 240 tgcagatagc acttcgtcaa aacagtggtc tcctacgtct tcaaatgttt atgtttgttc 300 agcacacttt attactggta aacaactttt attaagattt gattttatat actaaatgaa 360 ttacaataaa tactacttgt tcaaaaattt caggtaaaag agagctcttt gagaaacatc 420 cagattcagt tccatcaatt tttgatactg gaatgaaagg ttctcctcaa agaaatacca 480 acaataacaa aattaaccga gtcagtagga ggataaatag acaaagtaaa tattattgtt 540 tgcttcaatt catttgtaaa taattacttt gatgttgttt aattttaaat ttttttttta 600 gaaataagtg ttcctgcatg caaaaaatta aaactatgtg ctcaatcaat tagtgaggat 660 gttgattttg tcattgaaaa aaggagtttt agtaacgaag gtaagtttga aaaaataatt 720 attaaattta ggtatttaca atttaaattt gtataatttt tgttataatt aataatgatc 780 atgtagatct tgtaaaagtt atgaaaccgc tccagtaaat ttttataaaa taaatgtatt 840 tcatactcaa cagaaatcga tatccaagtt agtactgaag attatttgca gcaatctgat 900 atcagtaatc ctatatgctc caaaaatttt ttagaacaat cttccagtca tggtattgat 960 taatttttta ttttagtttt ttacattttt gcatagtttt agatttttct ttttaacaat 1020 ttttcagaaa ttaataattc tatacttact agagctattt tagaacgttg accatcataa 1080 atttaattat aaaaataaaa tggctataca aatataagtt taaaatatgt tttctctaaa 1140 aaaatttgtg cgaattaagt tttttttagc actaaggtct tcaattactt ctaacttaac 1200 ttctatttgt attttttgtt tttgtaattt tgtttatttt ttgtttatta tctttatttt 1260 gtagaagttg aaaaataatt taataatata tttcagaatt tagtgttcct ataattaatg 1320 aagatgttta tgttcaagat tccagtaaga gttatataca aggtattaat ttaaaagtta 1380 tttaaacatc atatgcaaat ttaaaatatc tattgtattt ttttcttatt taaaggaaag 1440 agtgctcctg taaacatcaa tgttcagcaa agttcaacca atagttcttt aaagtcagaa 1500 atgagttata ctgaaaaatc tgaaattgat attgaaatac ctcttaaaac atcagaaaga 1560 caacaactat atagtgaaat aaataatctt cgattagaaa gagatttatt tatgtcgaag 1620 tggatatcat taaaagatag caattgcagt ttaggaataa ataaaattcg agccaatcct 1680 gataagtgca atatgctcac tggacttaat cttcctgtat tagaaaaact gttgagtttc 1740 ctatgcaaag gaactccaga aagttcatta aaaagattta atacagaaga taaaattgtt 1800 ttctgtttag ttaaattaag gcacaacatt aactttgata tgttatcatt tatgtttgat 1860 attaaaaaaa ctactgcctt ggataacttt tggaaatgga ttgatattat gtatgtgaaa 1920 ttaaaatatt taataaaaat gcaagatcga gatcatatat acgaaacaat tcctcctgtt 1980 ttcaaaaaca aattcccaag actaacatca attattgatt gttttgaggt ttttgtggaa 2040 tcaccatcat ctctaatggc aagagcacta ttttacagcc aatataaaaa acattgtaca 2100 ataaaatgtc tcatttcatg cacacctaat ggaacgatta actttatttc caaatgctat 2160 ggtggaagag cttctgacaa tcaaataact cgggaatcag aatttgcatc tagcaagtac 2220 catatgccag gcgatcaaat cttatcagat agaggtttta ctttacaaga cgattttgca 2280 gctggaagtt gctctatatt aataaatcca gcttttacaa aggataaagc acaactttct 2340 gcttcggagg tagaaaaatc tcgcaaaata tcatctgtta gaattcacat agaaagagtt 2400 attgggctgt tgaaaaatcg ttacaccatt cttcaaggtg tattgccttt acgaactgtt 2460 aaaaacatta ctgatgaagc tacctcagtt actctttcta attgtgacaa aattgtaaca 2520 gtttgtgctg ctttaacaaa ccttggtgaa ggaatcgttt aaattttctc aaaacggcta 2580 ttttcatagc cacatatctt aataaaagca ttatttgaca aaaaggacct caagaggtgg 2640 tacaagagtt gtactcatgg attttctttt tccggaaatt ttggaatttc ggatattatt 2700 attaacttta agtttcccca gatatcctaa aattttccag aaattaaagt tgaggtatat 2760 cttttggaaa tatatttatt ttcaagcttt ttcacttatt gctcatactg tattagcgat 2820 aagtgaaaaa gcataaaaat aaatatattt ccaaaatata tacctcagct ttaatttctg 2880 gaaaatttta ggatatctgg gaaaacttaa agttgataat ttttgtatgt catactaaaa 2940 ttaaaatgga aatcaactca ggtaatagca aaataaagtg gaaaatataa tccggattat 3000 accggatatc ttacctaatt tttaaaatat gacccggatg atcaatgtat aatcagggat 3060 tatctttttc cggatatttt atttagtttt ttcggatatc caaaaggttt tctatacata 3120 caagtttata tatatgttta gataatatat ttgattttta ggggcttttt agtcatcact 3180 cgttcaaaaa agaaaatttc aaaaaaatta agatacaatt cagctgaaag caaattatga 3240 ggaaaataag agaaaaaata ttccgcattt tttccggata ttttacgcaa agaattttcc 3300 tttttttttt taaaatatta cctggatgat ctatgtatat cgtccctggt gccccggaaa 3360 aaggagaagc ctttctgcat tacatggtac atattttcag accgcaaata catttgggat 3420 ctacttttgc tgttaacagc cctgtaaata aggattttca ctattcaata cgaaacgtat 3480 atggtatgga atacaatacg atacggaaaa tgatcatgaa aactagtcca ttctttaaag 3540 ttatttaggt gtgttgaata agtttaatag aatatttata aacctaataa atgtgcgaaa 3600 tacggattag aatatttatt aggtttgaaa aaaacgagcc acaactttat gtcgttacaa 3660 cataaaatat aaatgaataa aatatcgacc ggcgtctgaa ttaaatatga aaatacattt 3720 cgataccgct ttgtcaaaaa tgtaataact tgttaatacg attttttagt aaaataattt 3780 gttactacaa ttttatttcc taaatttgat tttgagtaaa tttgcgtaaa aaaatatcgt 3840 tggaatctat tcaatagaaa cattgttatc attagtattt tgagagttca atttacgcat 3900 tagacttttt gtataacatt gataaataag attttgttta tttaggtgct ccaagaagtg 3960 ctaagggtct ttttatagag caccgcgtaa cagtatttat ttcacgcttc cttcattact 4020 aatcttgcat acatgagccg tcatcgaaca acgatagttt acagtgatta tccaagtcat 4080 attttaaaaa tccggtaaac agttgtgtat aaacagttgt atattcggaa aaaatccgtt 4140 atatatttac tcttactttc ctatttcttt tttatattgt tgtatgaatg aaaaaccaaa 4200 taaagtatat ataaattaca aaaatacaaa caaataatat gtaggtcaga cagtttggat 4260 atccggaaaa acataaaatt tagaaaaata tccggaaaaa gattacccct gggtttaaaa 4320 caatattata acataaaata cgacaaaatg ttttcccccg aggaataagg taaaatattt 4380 ttttccaaaa aagaaaccga ttaacttaag gaaaaatacc gccttacttc ttttttattt 4440 agaccaggta atcaaactct tcctattcac cgagatttcg gggaatagga agagtttgat 4500 tatctgttaa tctgtaacag ttttttcaat attatatcat atgctttaat gtttttaata 4560 atatgtggct atgaaaatag ccgtttttta aaaatgggat attctttatt tcacctcaaa 4620 ttatgatttt tccattccat ttctcagcat aataatgtta acaacaaata tcatgaaaat 4680 gataaggaga atagcattaa tactgctcca ttaatgtatt ggtttctaag aagataataa 4740 taaaaaaatg tattattaaa aaaaaacttt ttgctgctaa aattataaaa aataattacg 4800 tttttgtgtg cagacttttt cttgttttta cagctattgc aataccatac agacgatctt 4860 ggcacagatt tcaaagatat acatttaaga tgaaaccaat tatcacatga atcacagccg 4920 accattgtat catcattttc tgagctttta gatttgcaaa tacaaaaaac attatcatca 4980 ttatttatgg tttctgttaa tgtagaggag ggcaatttca ttcctgattc aacttctctt 5040 gttactaatt ccctcaaaat aaatctattc caaaaaacat cacatttatt aaacaccgat 5100 tccataaaac taatgtcagg ttcaatacgt aaaactataa actcactggg tgtccatacc 5160 ataaaatcac aatagctcac tccagtaact ctcatttcaa gctgaacctg gaaaaaatat 5220 gtgtgctgtt tttgtaaaaa atatttccca tccgtattta atcctacata aaaaacatct 5280 ttcagtattt cttctttcaa agatttgtct tttaaactaa aagggcattt gatttctaaa 5340 actccatgtc cacaacaagt gcaatacaca gaagcatcag gagatgctgc ataccatggt 5400 ttttcatagt cgatgcaaag ccctaatttt tcaactttta aattttgatg gatgacaatg 5460 tctgaaagtg ccaataagtt atgagcttca ttacagtctg aataaagttg tgtatataaa 5520 gtcaatgcat ttttctcatt ttctcttccc cacactattg gaggactttt tattgttata 5580 ttcttttctg aacaaatatt taaaattaaa gatttaggtg gtgtctcaac tctagcatga 5640 aaaacttgat acaaagtaga accagttatt cgtccgactc gtacctgata ccaagaatca 5700 cacaaagatt gatttcgagt tgcttcttca acatagacaa tgtcatcaga atttaatttt 5760 aaatctttta ttttctcatc aaaaaaagaa tttaactgtt cattatctag agatttatag 5820 tttgcattga aaaaatttct gagagataaa ggcaattttg ctacaacagt agattctaag 5880 tgtataaatt cagtatcaaa atctttaaac aaactgagag cagctgtttt gggctgtaca 5940 cttttcaatt ctgacaaaaa ctgatatttt tcaatatttg ttggagaaac aggcattttc 6000 ctagttcttt ttcctgtaag tttctcttta gcagcatcag aatataaatt tatttgagaa 6060 actggggaac caacaatatt ttttacaaat gtttgattcc attgacatgg taaatcagtt 6120 ggtgtggatg atgtcattcc aatgcgaaca gcagcttcaa ctttatataa tgttgcagct 6180 acatgtgagc atgtttcacc aaggctgtta aaaacaataa accatagatt taaataatcc 6240 aaaaacacct ttaaataatt tcaacagatt taaataatgt aaacaaaatt tagtagttca 6300 taaaacaaga taacttttag tgttaaatta ttatagtcaa gtgctctcca gacacttacc 6360 cggccataca attacagtga gcagtcacta cattaccaag tgaaccaaga gcaatccaag 6420 gcttatgatt aggggatgtg gttctgtaag aaggtctaac atcacatttt aagatcatga 6480 tatcttgact tatcttcata tggacaacag tttgcaccca accttcttga aagaacttcc 6540 aagcctccaa tgacttgtag ttttccatag cttctggaga atagatgcct acatagtgtt 6600 gaatgtttaa aataagtgta agcaaacatt aatattcact gagcaaacta acataatttt 6660 aggatttagg aattaaatat ataataactt aataaatata taataactta taaacaacta 6720 tttagaaata tttcttacat ggagtcttaa tcaggtattg atatagatta tgataactta 6780 aggaaggcca ttgacttgga tcttcagtcc aagaatcagc cggatactta tatggacact 6840 tgtccaatcc taaaacgatc aaaatgctat gcattatttt cgtttattct gacgaaaatt 6900 atttaaaaaa gtaaattaac atgaaaaata atcacctaaa agacttaatt ttcgctgata 6960 tctttctgct gcttcttttg ataaactttc aaaataattg aacgacatta ttacgagcgc 7020 tttcttatct ttgcgttaag gtgtaatgat ttgttttccc tacattcaat atggttactt 7080 ttgttgcgca ataagtcacg tgattccgac tagatgg 7117 // ID Crack-26_BF repbase; DNA; INV; 2141 BP. XX AC . XX DT 30-APR-2009 (Rel. 14.04, Created) DT 30-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE Amphioxus Crack-26_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; KW Crack-26_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-2141 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-2141 RA Kapitonov V. and Jurka J.; RT "Young families of Crack non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(4), 831-831 (2009). XX DR [2] (Consensus) XX CC Its 5'-terminal portion is incomplete. XX FH Key Location/Qualifiers FT CDS 2..1828 FT /product="Crack-26_BF_2p" FT /translation="AAQSVIQGSDGGQAGVWKAIRSLLPNKTQNTVTYLEH FT EDQGITDSKDIANTFNNFFQKVIQDLCKTLSNTITPFSPTQFVNETDSTFS FT FTPISVQTVTEQLTMLNVKKATGLDTIDNRLLKAGAEALAPSLTNLFNKSL FT TTGEFPTKWKIAKVMPIHKKDDRTKPGNYRPVSILPSISKLLERVVHNQLY FT SYLNQNQLLSQCQSGFRKLHSCETALHSVAEVWIDSIDKKQQKNRCDILLL FT IHLSRAFDTLNHDILLSKLKAYGVNERACSWFKSYLTGRLQCTCVNSVLSD FT TSCVTCGVPQGSILGPLLFIVYINDLPNCLQSCRVAMYADDTIVYFSHRSI FT QTIQETLQNDCIRLMHWFVANKLSLNPSKCKSMLVGSQKSKATDQSLQLLL FT DDTELEQVDTFKYLGVVIDKALLWNYHFLHISKKLSKIIGIMKCLKPYLNR FT QALLTIYTTLFLPHIQYCSTIWDQGGKGGIDKLQKLQHRAGRVVLGCDHHT FT PRETILNTLGWTPVAQHHKRNKAVFMFKALNGMSPPHISQLFTTSAETHSH FT HTRHSTQGGGGVQLPKTRLQYKKKSLSFSGASLWNSLPPHLRAAASISTFK FT TIYKTEYHMV*" XX SQ Sequence 2141 BP; 650 A; 521 C; 405 G; 564 T; 1 other; tgcagctcag tcagtaattc aagggtctga cggggggcag gcaggggttt ggaaggccat 60 acgttctctt ctacccaaca agacacagaa cacagttact tacctggagc atgaggatca 120 gggtatcaca gatagtaaag atatagcaaa cacattcaat aatttcttcc agaaagtcat 180 acaggatctg tgtaagacac tttcgaacac catcacaccc ttttcaccta cacagtttgt 240 gaatgaaaca gacagcacat ttagtttcac tccaatctca gtacagactg taactgaaca 300 gttaactatg ctgaatgtaa aaaaggcaac tggccttgac acgattgaca atagacttct 360 taaggcaggt gcagaagccc ttgccccctc ccttacaaac ttgttcaaca aatctctgac 420 taccggagaa ttcccgacaa agtggaagat tgccaaagta atgcccatcc acaagaaaga 480 tgacaggacc aagccaggta actatcgccc agtgtccatc ctgcctagta tttctaaact 540 actagagcga gtggttcata accagttata cagctacctc aaccaaaacc aactactttc 600 tcaatgtcag tcaggcttca gaaagctcca ttcgtgcgag actgcactgc actctgtagc 660 cgaggtctgg atagactcaa ttgataaaaa acaacaaaaa aacaggtgtg atattctgtt 720 acttatccac ttatccagag cctttgacac tctaaaccac gacatactgt tgagtaaact 780 caaagcctac ggtgtcaacg aaagggcttg tagttggttt aaatcgtacc tcacaggtag 840 actgcaatgc acatgtgtca actcagtcct atctgacaca tcttgcgtga cttgtggagt 900 tcctcagggg tccattcttg gacccctcct ttttattgta tatatcaacg acctgccaaa 960 ctgtttacag tcatgtagag tagctatgta cgcggacgac accattgtgt acttctccca 1020 tcgcagcata cagacaattc aggagaccct acagaatgac tgcatccgcc tcatgcactg 1080 gttcgtcgcc aacaaactat ccttaaaccc ttcaaaatgc aaatcaatgt tggttgggtc 1140 acagaagtcc aaagctactg accagtcact acagctcctt ctagatgata ctgaactgga 1200 gcaggtggac acttttaagt atttaggagt tgttatagac aaggccctgc tgtggaacta 1260 ccactttctc cacattagca aaaagctgtc taagataatc ggcatcatga aatgcctaaa 1320 gccatatctg aacaggcaag cgctattgac aatttatact acactattcc tcccacacat 1380 tcaatattgt agcaccatat gggaccaggg agggaagggg ggcattgaca aactacaaaa 1440 gttgcaacac agggcgggca gggtagtgct agggtgcgac caccacactc caagagagac 1500 cattctgaac acactgggct ggacccctgt agcgcagcat cacaagagaa ataaggctgt 1560 ctttatgttc aaagcactca atggaatgtc tcctccacac atttcccaac tcttcaccac 1620 atctgctgaa acacactcac accacacccg gcacagtaca cagggggggg gtggggttca 1680 actacctaaa acacgtctac aatacaagaa aaagtctctc tcgttctccg gcgcatccct 1740 gtggaactcg ctacctccac acttaagagc agcagcatcc atctccacct tcaaaaccat 1800 atacaaaaca gaataccaca tggtctaaaa cgtaactatt atgtcttttc ttgttttgtt 1860 cacattggta tgcaaatctc tattcatttc catttatttc attttgataa tgatttcgag 1920 ttatattatg cttacgttgt ataattgatt tgttttactt tatcatttgt ataccaagtt 1980 gtgttaattt tcgttagtac tcgatatgtt accatttatg ttacttgata tcgcttttat 2040 gttgcatgta tgtttcgggg ttgcccccca gggaactttg aaaaacgcct aaatwggcga 2100 tatgtatccc tggttaaata aaaataaaca aacaaacaaa c 2141 // ID BEL-98_AA-LTR repbase; DNA; INV; 551 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-98_AA_; KW BEL-98_AA-I; BEL-98_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-551 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 872-872 (2011). XX DR [2] (Consensus) XX SQ Sequence 551 BP; 184 A; 111 C; 113 G; 143 T; 0 other; tgttcgcaca tatatggggc accatttcca cacctatcat catcatcaac gtcggtcaac 60 cgcaatccac caccatggca agtcgggcga atgcactgaa cggaaattcg tccatcatcg 120 tgactcgcta ccaatgatca tcggcggcta atataaaagg actagctgaa cgtcatcgcg 180 ccctcttttg ttgtaccagc agattcgcat cacggattaa cgccgtcata ctagtgttcg 240 ggagtcagta ctagtagtca gtgtactaga actagatttg aattttacac gctaagaaga 300 aattagtgaa gttataaatt agaaataaaa agttaagtga aattgaattc gaaataaatt 360 agagttagtg gacatttagc gagtaattta gtgagaacta aggattaata aaccaaaaag 420 aagaagaact tacctgtagt gacttatact attgtgttga actgattact cacctgtgac 480 tgtgaaaaag gaacctgaaa aagaaagtgt acccgtgaca accctcctgg ctttactgag 540 cattcggaac a 551 // ID Gypsy-137_AA-LTR repbase; DNA; INV; 202 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-137_AA_; KW Gypsy-137_AA-I; Gypsy-137_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-202 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1008-1008 (2011). XX DR [2] (Consensus) XX SQ Sequence 202 BP; 63 A; 40 C; 44 G; 55 T; 0 other; tgtaatggca tgagccatgc ggtaaaaccg ctgaacccgc tggaagggtt ctcagcctta 60 agctgagtac actgagtgta aagtagggtc ttgtgaggaa attatcagaa gaataaacag 120 ttgttatttt actgcgaact gaaacacacg tctttaccag ttaataccga aagtctttaa 180 acccggtttt ccacatatta ca 202 // ID Gypsy-211_AA-LTR repbase; DNA; INV; 161 BP. XX AC AAGE02027032; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-211_AA_; KW Gypsy-211_AA-I; Gypsy-211_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-161 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02027032; Positions 943 1103. XX SQ Sequence 161 BP; 48 A; 37 C; 25 G; 51 T; 0 other; tgtaacatag tttgtaattg actgttgtat aacccctcgt tatggtaact tccttatgga 60 gcacacacac atatacactt ccttgttatg taccttgaga gaaaccgtca aataaatcag 120 tatgttcaag acacgcgtcc gagttacacc tcattcttac a 161 // ID Gypsy-10_RP-LTR repbase; DNA; INV; 207 BP. XX AC ACPB02036805; XX DT 28-JAN-2011 (Rel. 16.02, Created) DT 28-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Rhodnius prolixus genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-10_RP_; KW Gypsy-10_RP-I; Gypsy-10_RP-LTR. XX OS Rhodnius prolixus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Euhemiptera; Heteroptera; OC Panheteroptera; Cimicomorpha; Reduviidae; Triatominae; Rhodnius. XX RN [1] RP 1-207 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Rhodnius prolixus genome."; RL Direct Submission to RU (28-JAN-2011). XX DR Genome; ACPB02036805; Positions 105404 105610. XX SQ Sequence 207 BP; 69 A; 29 C; 30 G; 79 T; 0 other; tgtggcggca gcactctaaa gaattcaact cccatatggg actatatatt tttcttctat 60 tttatacata tgataattat tattttctgt atataataat gttaaataaa taagatttag 120 gcagttcgat tcagcatttg aaagaagtaa cgccttgttt ttattaagtg taattacttt 180 ccataactgt agtatagaga cactaca 207 // ID BEL-88_CQ-I repbase; DNA; INV; 5773 BP. XX AC AAWU01005995; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-88_CQ_; KW BEL-88_CQ-LTR; BEL-88_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-5773 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 309-309 (2011). XX DR GenBank; AAWU01005995; Positions 15409 9637. XX CC Positions [4815-5372] - Integrase core CC 'GGCGG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 11..3859 FT /product="BEL-88_CQ-I_1p" FT /translation="MNNLTGYCCGICKAAVEKDGDLPFCDHCQQFFHYECA FT QLEKGVKDDSWRCPNCAAEMEVTQVIVKFDDLEKTLQELEAEAKRQEERIF FT AEQKLHKRRLELQQQTQQRQQEAERQMLALEQQLMEQQLAAEKQFQEQQKK FT NKEDFEAKMAQLKAEAKSPKTKKEKKIKKSKGDKKKLDASGTAGPSGISTP FT LRHPGVTIPEVPKPKPTPIPADSGEEGNGNERDESGSDTSGTETESSSSDD FT EQEKAEVEPPKQLRGPTKAQLTARQFLARKLPVFTGRCEEWPMFISSYETS FT NLACGFSNVENLARLQDSIREPALTNVRSLLLLPEKVPAAIETLRMLYGRP FT EQLLNTLLIKVRKADSPRSDRLESFIYFGVVVQQLVDHLEATHMNDHLVNP FT LLIRELVEKLPAGSKLEWIRFKKLKKHVSLRTFADFLSDIVTDASEATLFT FT EPQLQHQPQSRRDAKPKAGSRTHESYLHTHAAAENSGDTNHAGVNNKRPCR FT ICDRTDHRVRNCEKFKQLKLDDRLDAVKKWQLCKLCLNEHGSARCRMNFRC FT NVGNCKERHNALLHSDVPATVVNCNAHDVRARQPVIFRMIPVTLYNGGRSV FT NIIAFLDEGASYTLVDESITKLLNVRGAVQPLRIMWTAGVSRLEKQSENVC FT LSISARGVSTQFQIKNAHTVNELKLPEQTVCFADVVKEYNHLHDLPVADYR FT GAPKILIGLKDLHLYAPLESRIGGAGEPIAVKSKLGWTIYGPHENNAPAAG FT FVGHHASHQVSNQDLHDLLKNQYLLEETGISMALLPESDEDKRARSILEST FT TVRNGDRYETGLLWKADDTSFPDSYVMALKRLKSLERKLLKNDSLYDNVRR FT QIVEYQQKGYAHKASQQELAACDPRKVWYLPLNVVTHPKKPEKVRLVWDAA FT ATVAGVSLNSQLLKGPDMLTSLPSVIFRFRERPVGFGGDIREMYHQLRIRK FT EDTNAQRFLFRNDPTAEPEVYIMDVATFGATCSPCAAQFVKNKNAEEFSKQ FT YPAAAKAIVENHYVDDYYDSASTVDEAIQRAKEVRLIHSKAGFEIRNWVAS FT SSEVVRALGEKEADQKVHLSQNKTTGYERVLGIVWNTSTDEFLFSAEMRED FT LERYLKGELRPTKRVVSSCVMSLFDPQGFLISFTIFGRILIQDLWRAGCDW FT DKAVDDEAWEKWKRWVSRLQEVEGVRIPRYYFQGRARSELRDVGASCVLRR FT ESVRLRCCRLLLDYVVGRPNLRARVREVKGGTAEATQHPALGTAVRCGRLE FT THELRAGEPHPQDCGTVHLVRL" FT CDS 3681..5771 FT /product="BEL-88_CQ-I_2p" FT /translation="MSSVGPICALVSGKSKVAPLKLLSIPRLELQSDVVGS FT RLMNFVQENHTLKIAERFIWSDSEVALSWIHSDQRKYKQFVAFRVGEVLTL FT TKQSEWRKVPSKLNLADMLTKWDNKHSFRQDGPWSRAPEFIEEAKRLWAIS FT KQQVQLNTPEEMRASVLYHSIIVTEQIIDPQNFSRWTVMVRSVACLARFRS FT NCRRKAKGLPIEALPLSKAMKGLVKRTVPVVPVPLKREEYQMAETYLWRWA FT QAGYFPDEVTTLLKNLELPADKKLSLEKSSVLYKLTPFLDEQKVMRVDGRL FT ERATMVPFEVRFPVILPKGHPVTTKLLEHYHQKMGHAYFETAINELRQRFF FT IPNLRAELKRVMTACVKCKVEKSVPAIPRMAPLPVQRVTPYQRAFSYTGVD FT YFGPVAVTVGRRSEKRWVSLFTCLTTRAIHLEVVHSLTTQSCVMAIRRFAC FT RRGMPIEFFSDNGTNFQGASKEIVRVDAECREEFTDARTSWNFNPPSAPHM FT GGAWERLVRSVKEALKAFDDGRKLTDEVLLTSLAEAEDLINTRPLTYLSTE FT AGSTEALTPNHFLRGVTTNDRGAQRAESTSPAEALRDYYKRSQLLADCFWK FT RWIVEYLPTLNQRTKWHAEAEPIACGDVVYVVEGANRGNWVRGIVTEVFKG FT ADGRIRQAMVRTSRGELKRPVSKLAVLEIQERKTGAKEAPPTRVTGRG" XX SQ Sequence 5773 BP; 1441 A; 1554 C; 1736 G; 1042 T; 0 other; aatcttaaaa atgaataacc tcactggcta ttgctgcggg atctgcaagg ctgcggtgga 60 aaaggacggt gatctgccct tctgcgacca ctgccagcaa ttcttccact acgagtgcgc 120 gcagctcgag aagggcgtca aggacgattc ctggaggtgc cccaactgcg cggccgaaat 180 ggaagtcacc caggtcatcg tgaagtttga cgacctggaa aagaccctgc aggagctgga 240 agcagaagcg aagcgccagg aggagagaat cttcgcggaa caaaagctgc acaaacgtcg 300 cctggagctg caacagcaaa cacagcagcg tcaacaagag gccgaaaggc agatgctggc 360 cctcgagcag cagctcatgg agcaacaact cgctgccgag aagcagttcc aggaacagca 420 gaagaagaat aaggaagatt tcgaggccaa gatggcacaa ctgaaggccg aggcaaagag 480 tcctaaaaca aaaaaagaaa agaaaatcaa gaagtccaaa ggggacaaaa agaagctcga 540 tgcctcaggc acggctggcc ccagcggaat ctcaacgccg ctccgacatc cgggagtaac 600 gattccagaa gtaccaaaac caaaaccgac tccgataccg gcggactcag gtgaagaggg 660 taacggcaac gaaagagacg agtccggctc ggacactagc ggaacggaga cggaatccag 720 ttcgtcggat gatgagcagg aaaaagcgga agtagaacca cccaagcagc tacgggggcc 780 aacgaaggcg cagcttactg cccgccagtt cttggccaga aaactgcccg tattcactgg 840 acgctgcgag gagtggccaa tgttcataag cagctacgag acgtcgaacc tggcctgcgg 900 attttcgaac gtcgagaacc tcgccagact gcaggacagc atccgggagc cagcgttgac 960 aaacgtgcgc agcctactat tgctgcccga gaaggtcccg gcggcgatcg aaactctgcg 1020 catgctgtac ggccggcccg aacagctgct caacacgctg ctgatcaagg tacggaaggc 1080 cgactctccc aggtcggatc gtctggaatc gttcatatac ttcggcgtcg tggtccagca 1140 gctcgtagac cacctcgagg ccacccacat gaacgatcac ctggtcaacc cactgttgat 1200 ccgagagctg gtggagaaac tacccgccgg gtcgaaactc gaatggattc ggttcaagaa 1260 gctgaagaag cacgtctcgc tccgaacttt cgctgacttc ctatcggaca tcgtgaccga 1320 cgctagcgag gccacactgt tcaccgaacc gcagctgcaa catcagccgc agtctcgcag 1380 agacgccaaa ccaaaagctg gatccagaac gcacgaaagc taccttcaca ctcacgctgc 1440 tgcggagaat agcggagaca caaaccatgc cggtgtaaac aacaaaagac catgccggat 1500 ttgtgatcgc accgaccatc gagtccgcaa ctgtgagaag ttcaagcagc tgaaattgga 1560 cgatcgtttg gacgcagtca agaagtggca actctgcaag ctgtgcctga atgagcacgg 1620 gtctgcccgc tgccggatga acttcaggtg caacgtggga aactgcaagg aacggcacaa 1680 cgcacttctg cacagcgacg tccctgcaac ggtcgtcaac tgcaacgctc acgatgtgcg 1740 ggccagacag ccggtgatct tcaggatgat tccggttacg ctgtacaacg gcggcagatc 1800 ggtcaacatt atcgcgtttc tggacgaggg agcttcgtac acgttggtgg acgagtccat 1860 cacgaagttg ctgaacgtcc gcggcgcagt tcaaccgctg cgcattatgt ggaccgctgg 1920 agtttcaagg ctggagaaac aatcggagaa cgtctgcctg tcgatctctg ccagaggggt 1980 gtcgactcag ttccagatca agaacgccca caccgtgaac gagctgaagc ttccggaaca 2040 aaccgtgtgc ttcgccgacg tcgtcaagga gtacaatcat ctgcacgatc tcccggtggc 2100 agattaccgc ggcgcgccga agatcttgat tggcttgaag gatctgcacc tgtacgcgcc 2160 tctcgagtct cgtatcggcg gcgctggaga accaatcgct gtaaagtcga aacttggatg 2220 gacgatctac ggcccccacg agaacaacgc gccagcagct ggattcgtag gtcatcatgc 2280 tagtcaccaa gtctctaacc aagacctaca tgatctcctg aagaaccagt acctcctcga 2340 ggaaactggc atctccatgg cgctgctgcc ggagtcggac gaggacaagc gagccaggtc 2400 gattctggag tcaacgacgg tgcggaacgg agatcgatac gaaaccggcc tgttatggaa 2460 agcggacgac acaagtttcc ccgacagcta cgtgatggcg ctgaaaaggt tgaaatccct 2520 ggaaagaaag ctgctgaaga acgactcgct gtacgacaac gtgcggcgcc aaatcgtcga 2580 gtatcaacag aagggctacg cgcacaaggc ctcccagcaa gaactggctg cgtgcgatcc 2640 gcgaaaggtg tggtacttgc cgctgaacgt ggtgacgcac cccaagaagc cggagaaagt 2700 acgcctcgtg tgggacgcgg ccgctactgt ggcgggagtc tcgctcaaca gccagctgct 2760 caaggggccg gacatgctga cgtcgcttcc ttccgtgatc ttccgattcc gtgaacgtcc 2820 tgtcggattc ggtggtgata tcagggaaat gtatcaccag ctgcgcatac ggaaggagga 2880 tacgaacgcc cagaggttcc tgttccggaa cgacccaacg gccgagcctg aggtgtacat 2940 catggacgtc gccaccttcg gggcaacttg ctcgccgtgt gccgcgcagt ttgtgaaaaa 3000 caagaacgcg gaggagttct cgaagcagta tcccgcggcg gcgaaggcaa tcgtggagaa 3060 ccattatgtc gacgactact acgacagcgc gtcgacggtc gacgaagcga ttcagcgggc 3120 gaaggaggtc agactcatcc actcgaaagc gggcttcgag atccgcaatt gggtggccag 3180 ttcaagcgag gttgtccgcg cgctgggcga gaaggaagcc gatcagaaag ttcatctcag 3240 ccaaaacaag acgaccggtt acgaacgagt gctgggcatt gtgtggaaca cgagcaccga 3300 cgagttcttg ttctcggccg agatgagaga ggatctggag cggtacttga aaggagagct 3360 gcggcccacg aagcgggtgg tgtcgagctg cgttatgagc ctgttcgacc ctcaaggttt 3420 tctgatctcg ttcacgatct tcgggcggat tttgatccag gacttgtggc gggctggctg 3480 tgattgggac aaggcggtcg acgacgaagc gtgggagaaa tggaagcgct gggtgagtcg 3540 cctgcaagaa gtcgaaggcg ttcgaatccc gcggtactac ttccaggggc gcgcacgatc 3600 tgaactacga gacgttggag cttcatgtgt tctgcgacgc gagtcagtgc gcttacggtg 3660 ctgtcgcctt cttctggatt atgtcgtcgg taggcccaat ctgcgcgctc gtgtccggga 3720 agtcaaaggt ggcaccgctg aagctactca gcatcccgcg cttggaactg cagtcagatg 3780 tggtcggctc gagactcatg aacttcgtgc aggagaacca caccctcaag attgcggaac 3840 ggttcatctg gtcagactct gaagttgctt tgtcgtggat ccattctgac cagcgaaagt 3900 ataagcaatt cgttgccttt cgcgtcggcg aagttctgac gctgaccaag cagagcgagt 3960 ggcggaaggt gccatcaaag ctgaatctgg cagacatgct caccaagtgg gacaacaagc 4020 acagtttccg gcaggacggg ccatggtcac gagcgccgga attcatcgaa gaagccaaga 4080 gactgtgggc catctcgaag cagcaggtgc agctgaacac gcccgaggag atgcgagcga 4140 gtgtgctgta ccacagcatc attgtcaccg agcaaatcat cgatcctcaa aacttctctc 4200 gctggacggt gatggtaaga tccgtggcgt gcctggctcg atttcgctcc aattgtcgca 4260 gaaaagcaaa gggtttgccg atcgaagcac ttcccctttc gaaggcgatg aaggggctcg 4320 tgaaaaggac cgtgccagta gttccagtcc cgctgaagcg ggaagaatac cagatggcgg 4380 aaacatactt gtggagatgg gcgcaagcgg gctactttcc ggacgaggtg acgaccctcc 4440 ttaaaaacct cgaactcccc gcggacaaga agctatccct ggagaagagc agcgtgctgt 4500 acaagctgac cccgtttctg gatgagcaga aagtgatgcg agttgacggt cgacttgagc 4560 gagctaccat ggttccgttt gaagttcgct ttcccgtcat cctgcccaaa ggtcatcccg 4620 tgacaacgaa gctgctggag cactatcacc agaagatggg ccacgcgtac ttcgagacgg 4680 caatcaacga gcttcgacag cgtttcttca tcccgaacct acgagccgag ctgaagcggg 4740 tgatgacggc gtgcgtgaag tgcaaggtgg agaaaagcgt accggcgatc ccgagaatgg 4800 caccacttcc agtccagcgt gtgactccat atcagcgggc gttcagctac accggggtcg 4860 actacttcgg cccagtagca gtgacggtcg ggcggcgttc cgaaaagcgt tgggtcagcc 4920 tcttcacgtg tttgaccacg cgtgccatcc atctagaagt cgtgcatagc ttgaccacac 4980 agtcgtgcgt catggctatc agacgattcg cttgtcggag gggaatgccc atcgagttct 5040 tcagcgacaa cggcacgaac ttccagggcg cgagtaagga gatcgtacgc gtggacgcgg 5100 aatgcagaga agagttcacc gacgccagaa cgagctggaa cttcaaccca ccatcagcgc 5160 cgcacatggg cggggcctgg gagaggcttg tgaggtcggt caaggaagca ctgaaggcgt 5220 tcgacgacgg gaggaagctg acggacgaag tgttgctgac cagtttggcg gaagcggagg 5280 atctcatcaa cacacgtccg ctgacgtacc tgtcaaccga agcaggatca accgaagctc 5340 tcacgcccaa ccatttcttg cgaggagtga cgactaacga ccgcggagct caacgcgctg 5400 aatcgaccag tccagccgaa gctctccgag actactacaa gcggtcgcaa ctccttgctg 5460 actgtttttg gaaacgctgg attgtggagt accttccgac cctgaaccaa cggacgaagt 5520 ggcacgcgga ggcggagcct atcgcgtgtg gagatgtcgt ctacgtcgtg gaaggcgcta 5580 atcgcggtaa ctgggtccgg ggtatcgtga cggaggtctt caagggtgcc gatggacgca 5640 ttcggcaggc gatggtgagg acaagcagag gtgagctgaa gcggccggtg agcaaactgg 5700 cggtgcttga gattcaggag cgtaaaactg gtgcgaagga ggctccaccc accagagtta 5760 cggggcgggg cga 5773 // ID TRAS6_BM repbase; DNA; INV; 1908 BP. XX AC AB046671; XX DT 09-JUL-2004 (Rel. 9.06, Created) DT 30-JUN-2010 (Rel. 15.07, Last updated, Version 3) XX DE Bombyx mori TRAS6 gene, non-LTR retrotransposon, partial cds. XX KW R1; Non-LTR Retrotransposon; Transposable Element; KW endonuclease domain; reverse transcriptase domain; TRAS6_BM. XX NM TRAS6_BM. XX OS Bombyx mori OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Bombycidae; Bombycinae; Bombyx. XX RN [1] RA Kubo Y., Okazaki S., Anzai T. and Fujiwara H.; RT "Structural and phylogenetic analysis of TRAS, telomeric RT repeat-specific non-LTR retrotransposon families in Lepidopteran RT insects."; RL Mol. Biol. Evol 18(5), 848-857 (2001). XX DR Genbank; AB046671; Positions 1 1908. XX CC Experimental evidence. XX SQ Sequence 1908 BP; 623 A; 389 C; 482 G; 414 T; 0 other; gggactgtta aagctgcgat agttgtcttc aacaacgact tcaaagttat acagtatcca 60 aaactcatca ccaaaaacat catggtggtg gggatccaaa cggttgcttg ggagatcaca 120 ctagtctcct tttactttga gccggactcc cctatagaac cctacctgga gcatctgaaa 180 aggatagagc tcgagattgg gtctataaaa ttgctcgttg gaggagatac gaacgcgaaa 240 agctcgtggt ggggaagccc aataatagat cataggggtg aagacctttc tgggacgctt 300 gaggaaatgg gcctacatat actcaacgca ggtgaaactc cgactttcga ctgcgttcga 360 ggaggcaagc ggtatacaag ctatatagac atcaccgcgt gttcagtcga cctactagac 420 ttggtggacg gctggaaaat tgacgaaggt ctcacgagct cggatcacaa cggtattgta 480 tttaatattc ggctacaaag gtcaaaaggc attaatatct caagaacaac taggaaattt 540 aacacaaaaa aagcaaattg gcctaagttt catgagaagc ttagccaatt aatgcaagaa 600 aacaaactaa cagcagcaga aataaataac ataaacacta tagaacaatt ggaaaaaaca 660 ataaatacac ttaccaaaac aatagataat acatgtacaa tctcaatacc aattaaacaa 720 acaaaagaaa aacttacctt gccgtggtgg tccgagaaac tagctgggat gaagaaggaa 780 gtcgccacca ggaaacgtag agtgcgaaac gccgctccaa ttcgtaggtc caaggtcgtt 840 gaagagtacc taaaaaagaa agaagagtat gaggaggagg cagccaaagc gcagacagat 900 agctggaaag atttttgttg taggcaaggc ggggaggggg tttggagcgg aatatataga 960 gtaatatcga gaacgactac tagggaggaa gactctatac tggtaaagga cggagagttc 1020 ctggacgcga aggggtccgc aaagttacta gcggataact tctatccgga ggatctgagg 1080 accaacgata acgcctatca ccgccagatt agaagtgagg ctaatattgt gaatgttggt 1140 aaacaaactg agtattgcga cccacctttc acgatggccg aattgagaca ggcgagtgga 1200 tccttcaacc caaaaaaggc cccgggcatg gacggtttca ctgcggacat ctgctgccat 1260 accatagaag ccaatccaga actctttttg tcgttgctca ataaatgcct ggagctatat 1320 catttcccca tggcttggaa ggtagctaca gttgtaatgc tgaggaagcc aggaaaagga 1380 gactacacca ctccaaaggc atacagacca attggactac tgcctatact aggcaagatt 1440 tacgaaaaga tgctggtgac ccgcctcaaa ttccatctat taccaaggat gagtactcgc 1500 cagtacggat tcatgccaca gaggggtgcc gaagactccc tctatattct gatgcaacat 1560 atccgcaaga agctaaaaga aaagaaaata attgcattag tatcgttgga tatagaggga 1620 gccttcgaca gtgcctggtg gccagcaata agagtccgac tggctgagga aaagtgtcca 1680 gtaaatctga ggcgggtcat agacagctat cttagtaaca ggaaggtggt ggtcaagtac 1740 gctggggagg aatatgataa gggaacgaat aagggatgtg ttcaaggctc aatcgggggc 1800 ccaattttat ggaacctgct gctcgaccct ctcctgaaaa gtctcgaaaa cagtggagag 1860 tattgccagg cgttcgcgga tgatgtggtc ctggttttcg acggagac 1908 // ID Gypsy-28_CQ-I repbase; DNA; INV; 4445 BP. XX AC AAWU01011844; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-28_CQ_; KW Gypsy-28_CQ-LTR; Gypsy-28_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4445 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 435-435 (2011). XX DR GenBank; AAWU01011844; Positions 17509 13065. XX CC Positions [3363-3881] - Integrase core CC 'AAAAC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 128..2071 FT /product="Gypsy-28_CQ-I_1p" FT /translation="MDELKALLKNQNKLFEDLLKKQQAEPRAVAPGPHNVP FT LPPPLSLEGDMDENYAFFEDNWNNYATAVGMGDWPEADNPKKVSFLLSIVG FT PDALKKFSNFDLTQADRATPTTVLAAIKKKVTRTRNVIVDRLDFFSAAQSP FT VESIDEYTSRLKSLAKPAKLGAVEAELITFKLATSNKWPHLRSKMLTMADL FT SEAKAVDLCRVEEITAKHVQVLSADKLSEVNKLKASSSKARQCKFCGDWHA FT FTKGSCPAYGKKCKLCSGKNHFEKVCRKNRARMSSSKRNSVRRVKKINDES FT SEDSESDDEWSDQSGTETEIEGEIGKIFDNSKKGGNVLAEVSLKVKGKWKS FT VKCKLDTGANTSLIGHDWLCKLTGESDPELQPSPFKLQAFGGGIINVMGQV FT KLPCKCQNKKYILVLQVVDVSHRPLLSLKVCTTFGLIKFCNSVSMVPAKAT FT PEGAQDLMRIYRIEAEKIVDEFGDVFRGYGKFDGEVTLEIDESVPPVIQQP FT RQVPIALRPKLKAELDQLEKDGIIAREYSHTEWVSNILLVKRGIAGAESIR FT ICLDPIPLNKALKRPNLQFVTLDEILPELGQAKVFSTVDARKGFWHVVLDE FT QSSKLTSFWTPFGRYRWLRLPFGISPAPEIFQSQPAGDSPRIERGRVHSR" FT CDS 2103..4421 FT /product="Gypsy-28_CQ-I_2p" FT /translation="MEEALRDHNANLKNLLSRLKLNHVKLNNSKLKLCETS FT VKFYGHVLTTEGLQPDHTKISTIKHYPVPTSRTELHRFIGMVTYLSRFIPN FT LSAAFTRLRRLISEKEPWRWTREEDEDFSKVKSLVSDITTLRYYNVREPLT FT IECDASCFGLGVAVFQKDGIVGYASRTLTDTEKNYAQIEKELLAILFACVR FT FDQLIVGNPQTTVKTDHKPLINIFNKPLLTAPKRLQHMLLNLQRYHLALEF FT VTGKENVVADALSRAPHNDSTVRDEYQKLNIYKIFKQLEDCNVSGYLSISD FT NCLDTIMKATEQDAALLTVIDFIRNGWPSTIDRVPSAAKIYFKYRSELSTQ FT DGLVFRNDRILIPCALQRSMIDKVHVSHNGIESTLKLARENIFWPGMSAQI FT TDVVKECHVCAKFAASQQKPPMQSHAVPIYPWQVVSMDVFFTSYQGKRHQF FT LVTVDHYSDYIELDILKDMSARSLVETCRKNFARYGCPQIVVTDNGTNFVN FT EEMKEMAIKWNFKHSTSAPHHQQANGKAEAAVKIAKRLIQKAEETDQDVWY FT VLLHWRNIPNKIGSSPASRLFSRSTRCGVPASFEKYTPRIVQNVPEAILEN FT KRKVKYYYDRKSRNLPALETGSPVYVQAHPEISKTWTPAIVTEKLNDRSYV FT VDVNGASYRRDLTNLRPRKEPTTIPLVPAQSHPPTESSAAQTGADLTAEPE FT ASSLAAMRFDETATTTTTSPVVNQSAAATPSVRERVSQLAAINTPGKQTTD FT ATNRPTRERKIPSKFADYVLET" XX SQ Sequence 4445 BP; 1188 A; 1164 C; 1230 G; 863 T; 0 other; tggtgtcaga agtttttgcg gcataaagtt cgagaaacgt tgagaatcgt cgggcatcgt 60 acaacccggg atcgcgtgaa gagtcagtga aggaagcgat tcaaacttcc accgaaaatc 120 cacaacaatg gacgagctga aggcgctgct aaagaaccag aacaagctct tcgaagattt 180 gctcaagaag caacaggccg agccccgagc ggtagcacca gggccccaca atgtgccgct 240 ccctccacca ttgtccttgg aaggggacat ggacgagaat tacgcgttct tcgaggacaa 300 ctggaacaat tatgccacag cggtgggcat gggcgactgg ccggaagcgg acaacccgaa 360 aaaagtgagc tttttgctgt cgatcgttgg ccccgatgcg ctgaaaaagt tcagcaactt 420 cgatctgacg caagccgaca gggctacgcc gacaaccgta ttggccgcca tcaagaagaa 480 ggttactcga accagaaacg tcatcgtgga ccgactggac ttcttctcgg cagcccagtc 540 accggtggaa agtatcgacg agtacacatc ccgactgaag tcgctggcca agccggccaa 600 actcggagcc gtggaagcag aactaatcac cttcaagctg gccacctcca acaagtggcc 660 tcatctgagg tccaagatgc tcacgatggc ggatttgtcg gaagcgaagg cggtagatct 720 gtgccgtgtg gaggaaatca cggccaagca cgttcaggtg ctatcagcgg acaagctgtc 780 ggaggtgaat aagctcaaag catcgtcgtc gaaggcccgc cagtgcaagt tctgtggcga 840 ttggcatgcg ttcaccaaag gatcgtgccc agcctacgga aagaagtgca agctttgttc 900 cggcaagaat cacttcgaga aggtgtgccg gaagaatcgg gctcgaatga gttcgtcgaa 960 gcgcaacagt gtgcgccgcg tcaagaagat caacgacgaa tcctcggagg acagcgagtc 1020 ggacgacgag tggagcgatc agtccggaac agagacggag attgaaggcg aaatcggcaa 1080 gatcttcgat aactcgaaga aagggggaaa cgttcttgcg gaagtgtccc tgaaagtgaa 1140 aggcaagtgg aaaagtgtga agtgcaagct ggatacggga gcaaacacga gcttgattgg 1200 ccacgactgg ttgtgcaagc tgacgggaga gtcggacccc gaattgcagc cgtctccgtt 1260 caaactccaa gcgttcggcg gtggcatcat taacgttatg ggccaagtca agctgccgtg 1320 caagtgccag aacaagaagt acattctcgt gcttcaagtc gtcgacgtca gtcacaggcc 1380 gttgctgtcg ctgaaggtct gtaccacgtt cggcctgatc aagttctgca actccgttag 1440 catggttcca gctaaagcga cgccggaagg agctcaagat ctgatgcgaa tctaccgcat 1500 cgaagctgag aagatcgtcg acgagttcgg cgacgttttt cgtggctacg gaaaattcga 1560 cggagaagtc accctggaga tcgacgagtc ggtgccgccg gtgattcagc agcctcgcca 1620 agtcccgata gctctgcggc cgaagctgaa agccgaactg gaccagttgg agaaggatgg 1680 aatcatcgcg cgagagtaca gccacaccga atgggtgagt aacattcttc tcgtgaaacg 1740 gggtatcgcc ggagccgagt cgattcgcat ttgcctggac cccattcctc tgaacaaagc 1800 gttgaagaga ccaaatctgc aatttgtgac gctggacgag attttgccgg agttgggtca 1860 ggcaaaggtt ttctcgaccg tggatgcacg gaagggtttc tggcacgtgg tgctagacga 1920 gcagagcagc aagttgactt cgttctggac tccatttgga cgctatcggt ggctgcgact 1980 tccgtttggc atctccccgg cgccggaaat tttccagtcc caacctgcag gggatagtcc 2040 aaggattgaa cggggtcgag tgcatagcag atgacgtctt ggtgtacggc cgtggtgcaa 2100 cgatggagga agcgctacgt gaccacaacg caaatctgaa gaacctcctc tcgcgcttga 2160 agctaaacca cgtcaaactc aacaattcga aactcaagct ctgtgagaca tccgtcaagt 2220 tttacggaca cgttctcacc accgaagggc ttcaaccgga ccacaccaag atctcgacca 2280 taaagcacta cccggtgcca accagcagga cggaactgca ccgtttcatt gggatggtca 2340 catacctcag ccgattcatt cccaatttga gcgccgcctt cacaaggctg cggcggttga 2400 tttcggagaa agaaccgtgg cgttggactc gagaggagga cgaagatttc tccaaggtga 2460 agtcgctagt gtccgatatc acaacgctgc ggtactacaa cgttcgagag ccgttgacta 2520 tcgagtgcga cgcgagctgc ttcggtctcg gagtggccgt cttccagaag gacggcatcg 2580 tgggctacgc atcgagaacg ttgacggaca cggagaaaaa ctacgctcaa atcgagaaag 2640 agttgcttgc gattcttttt gcatgtgtgc gctttgacca gctgattgta ggcaaccccc 2700 aaactacggt gaaaaccgac cacaagcctt tgatcaacat tttcaacaag ccattgttga 2760 cggctccaaa gcgactgcag cacatgctcc tgaacctgca acgttaccat ctggcgttgg 2820 agtttgtcac cggaaaggag aacgttgtgg cagatgcgct ctctcgcgca cctcacaacg 2880 actcgactgt gcgtgacgag taccagaagc tgaacatcta caagatcttc aagcaactcg 2940 aagattgcaa cgtaagcggc tacctgagca tctccgacaa ctgcctggac acgatcatga 3000 aggcgactga acaagacgca gccctactga cggtgatcga tttcattcgg aacggttggc 3060 ccagcaccat cgatcgcgtt ccctcagccg ctaaaatcta cttcaagtac cgaagcgagc 3120 tgtccacgca agacggattg gtgttcagga acgatcgtat tctgatcccg tgtgcactgc 3180 agcggtccat gatcgacaaa gtgcacgtga gccacaacgg tatcgagtct acactgaagc 3240 ttgcgcgaga gaacatcttc tggcccggaa tgagcgcgca gattacggat gtggtgaagg 3300 agtgccatgt ctgcgctaaa ttcgctgcaa gtcagcagaa accgccgatg cagagtcacg 3360 cagtcccgat ctatccgtgg caagtcgttt cgatggacgt gttcttcacc agctaccagg 3420 gaaagcgaca ccagtttcta gtgacggtgg accactactc ggactacatc gaattggata 3480 tcctcaagga catgtctgcc agaagcctag tcgagacctg caggaagaac tttgcgcgtt 3540 acggatgccc ccagatcgtt gtcacggaca acgggacaaa ctttgtgaac gaggagatga 3600 aggaaatggc gatcaagtgg aacttcaagc attcgacttc ggcccctcac caccagcagg 3660 caaacggcaa agcggaagca gcggtgaaga ttgcgaagag gttgatccaa aaggcggagg 3720 aaaccgacca ggacgtttgg tacgtcttgc tgcattggag gaacatcccg aacaagatcg 3780 gcagcagtcc ggcgagcagg ctgttctcac gcagcacaag gtgtggagta cccgcttcgt 3840 tcgagaaata cactcctcga atcgtgcaga acgttccgga agcaattctg gaaaacaaga 3900 ggaaggtcaa atactactac gatcgaaaat ctcgcaacct acctgcactg gagactggct 3960 cgccggtgta cgtccaggct caccccgaaa tcagcaaaac ttggacgcca gcgattgtga 4020 ccgagaaact gaatgatcgc tcgtatgtgg tggacgtgaa tggagctagc tatcgccgag 4080 acctgaccaa cctgcgaccg cgcaaagaac caacaacaat ccctctcgtt ccagcccaaa 4140 gccatccacc aaccgaatca agtgctgcac aaactggcgc tgatttgaca gctgaacctg 4200 aagcgtcgtc gcttgctgcg atgaggttcg acgaaactgc aacaacgaca acaacatcac 4260 ccgttgtcaa ccaatcagct gctgccaccc cgtcggtgcg agagagagta agccagcttg 4320 ctgcgatcaa cacacccggc aagcaaacca cagatgcgac gaacagacca acacgcgaga 4380 gaaaaattcc aagcaagttt gcggactacg ttttggaaac ttaatttttt ataaagaaag 4440 gagga 4445 // ID Kiri-2_CQ repbase; DNA; INV; 4674 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 16-FEB-2011 (Rel. 16.01, Last updated, Version 2) XX DE A Kiri non-LTR retrotransposon family from Culex quinquefasciatus DE - consensus. XX KW Kiri; Non-LTR Retrotransposon; Transposable Element; Kiri-2_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4674 RA Kojima K.K. and Jurka J.; RT "Kiri non-LTR retrotransposons from the southern house RT mosquito."; RL Repbase Reports 11(1), 121-121 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 10 sequences with >92% CC identity. CC This family and related Kiri elements found in two mosquito CC species, Culex quinquefasciatus and Aedes aegypti, constitute a CC distinct lineage belonging to the CR1 group. The phylogenetic CC analysis with PhyML indicates that Kiri is a sister lineage of CC the L2 clade. Although several Kiri elements do not encode two CC proteins, probably due to 5' truncation, Kiri elements generally CC encode two proteins. Their ORF1 proteins commonly contain an CC L1-like RNA-binding domain. XX FH Key Location/Qualifiers FT CDS 290..1114 FT /product="Kiri-2_CQ_1p" FT /translation="MATNTTTVDDKNNNKQFLQSSYVQGCQSSIDGRIGEN FT SMTLDIAVRMLMMQFTEMKLMLETFRNDFNRKIDTIKSELQGKVSVLQSDI FT TTLKAEYEGKFVNQDAALGQINHRVNHLYLNIGALENQKELIISGVPFVSD FT EDPDALFAMICRQLECSEGEELLTSTRRVYVNGLKNGDVSPLLVEFALKTT FT RDRFYSTYLRRRDLKLRHLGMPSDRRVFINENLNAGAREVKKAALLLKKVG FT KLTSVFTKGGIVHVKRRVGDPPIAVQSVKDLDDV" FT CDS 1272..4118 FT /product="Kiri-2_CQ_2p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="MPVPHNHTNTNIPGAVLNAALLPGKLNICHGNAQSLC FT ARKNSKLEEVRAALSSHKVDIACFTESWLTASRNNRSIGVQGYSVVRNDRV FT YKRGGGIAVYYRDTMSCTKIFNTELTPESADKTECLALEFRLNGFKVLLVV FT VYNPPDNNCAPFLEEKLTSLTLRYESVFLVGDFNTDLLQASNKRTQLLSAF FT ESFSLVPYGCEPTYYHDTGCSQLDLLVTNDSDKILRFGQVDFPGLSQHDLI FT YVSLDFDATQPVTVNTYRDYVHFDADALKFATVSIPWRNFYAILDPNHSLE FT FFTERLKVVHDTCFPLRASSRRNAPNRWFTADVQRAILERDLAYKDWRTAA FT SDVKDLKRRTYKTLRNRANSVIERAKKTFLGGYLDANIPSKLLWTRVKSLG FT VGKDKSSQPCDHDPDEVNRMFLSSFTPAETRNDQHARILPSQYNFSFRNVH FT YWEVVNAICEVKSNAVGMDGLPIRFLKIVLPLVIQQITHMFNLFINTSTFP FT NLWKHAKVLPLKKKSNSNDVTNLRPISILCSLSKAFEKLLDQQMANYIDNN FT HLLTEHQAGFRKGQSVKTAALRVHDDLASTIDKRGVGVLVLLDFSKAFDTI FT PHNKLLDKLETEFNFSSTALNLMNSYLRRRKQTVFCDNRRSDCAEPTSGVP FT QGSIIGPRLFCCHINDLPNVLKHCAIQLYADDVQLYIRGVGPCARDLIRMV FT NQDLQNVADWSNRNRLLVNPAKSKALFIQSANRRAALVPDLRIEMNGEQIQ FT WTERASNLGFIFQSDLQWDGLVSQQCGKIYAGLRTLYSCTPTAPVATKLKL FT FKALVLPHFLFADVFFVRLSSSLFNRLRVALNCCVRYVYGLRRRDHVSHLQ FT KNLVGCPMLNLFAYRSCLFLRNLLSSQTPPALYQKLILSRSQRLRNLVVPA FT NRTAGYACTMFVRGVVHWNALPPAVKNSLSDATFKRGCKEHWNEV" XX SQ Sequence 4674 BP; 1286 A; 1238 C; 1079 G; 1071 T; 0 other; acgtaaacaa cactgacgag caagatgtgt tgtgagtagc gcggagtaaa caaacttcaa 60 tagtgatgta attcaaactt accgttcccg aaatcccacc cgcgaaaacc gaagtacccg 120 tacgaaaccc gtgtgtgtta tccgcggaat tcgtcgtcat ctggaccaaa taaaacaaat 180 cgaaatttgc cccgctgagc tggatccacc accgccatcg caaacaagcc gcaccgctga 240 acgccgtcac gcacactttt gctgttttcg tagtgcatac tgctgcacga tggccacgaa 300 cacaactact gtcgacgaca aaaacaacaa caagcaattc ctgcagtcgt cgtacgtcca 360 gggctgtcaa tcgtcaatcg atggaagaat tggggaaaat tcgatgacgt tggatattgc 420 tgtgaggatg ctgatgatgc agttcaccga gatgaagttg atgcttgaga cttttcgcaa 480 cgacttcaac cgtaaaatcg acacaatcaa atcggagctt caaggcaagg tgagcgttct 540 tcaaagcgac atcacgacac tgaaagcaga atacgaaggc aagtttgtga accaggatgc 600 tgcgttgggc caaataaacc atcgtgttaa tcacctctac ctgaacatcg gtgccttgga 660 gaatcagaag gagttgatca tctcaggtgt gccattcgtg agtgacgaag acccggacgc 720 acttttcgcg atgatctgtc gacagctgga atgcagcgaa ggtgaagaac tgctgacgag 780 taccagacga gtttacgtga acggattgaa gaacggggac gttagccctc tactcgtgga 840 attcgctcta aaaactacac gcgaccggtt ctacagtaca tacctgcgta ggcgggacct 900 taagctgcga catctcggca tgccgtcgga tcgacgtgtg tttatcaacg agaacctgaa 960 cgctggcgct cgtgaggtga aaaaggcggc cctcctgctg aaaaaagtgg gtaagctgac 1020 atccgttttc accaaaggag gaatagtcca cgtaaagcgc agggttggtg acccaccgat 1080 cgctgttcag tctgtgaaag atctggatga cgtgtaaccg tagtcaaagc gtcacttttg 1140 aagttatgtt gaagttatgt ttatgtttta ttgtgaaact tgtaaaattg tctttttgaa 1200 attagacgtt aagaagccct ttgaagctgc ttctctaaaa cacccccccc ccccccttac 1260 acacacacat aatgcctgtc ccacacaacc acactaacac gaacatccca ggagccgtgc 1320 taaatgctgc tcttctccct ggtaagctca acatctgcca cgggaatgcc caaagtctgt 1380 gcgcgaggaa aaactctaag ttggaagaag ttcgagctgc cctgtcgagc cacaaagttg 1440 acatcgcgtg ctttactgag tcttggttga ctgcctcaag gaataaccgc agcatcggcg 1500 tccaaggata ctccgtcgtg cgtaacgata gggtgtacaa gcgaggcggt ggtatcgctg 1560 tttactacag ggatactatg tcgtgcacta agatcttcaa caccgaacta acccccgagt 1620 ccgccgataa aaccgagtgt ctagccttag aatttcgtct gaacggattc aaagttctgc 1680 tggtagtcgt ctacaaccct cctgacaata actgtgcacc ttttctcgag gaaaagctga 1740 ctagtctcac tttgcgctac gagtccgtct ttctggttgg cgattttaac acggacttgc 1800 ttcaagcgag caataagcgt acgcagctcc tctcagcatt cgaaagtttt tcactcgttc 1860 cttacggctg cgagccgacg tattaccacg atactggatg ctcccaactg gatctgttag 1920 tgaccaacga cagtgacaaa atcctacgtt tcggtcaagt cgactttccg ggattgtcgc 1980 agcatgacct gatctacgta tctctggact ttgacgctac gcaacctgtc acggtgaaca 2040 cctatcgtga ctacgtgcat tttgatgctg atgccctgaa gttcgcaacg gtcagtattc 2100 catggagaaa cttctacgca atcttggacc ccaaccactc cttggagttc ttcacggagc 2160 gactcaaggt tgtgcacgac acctgttttc cgcttcgggc gagctctcgg cgcaatgcac 2220 cgaatagatg gtttacagct gatgttcaac gagcaattct agaacgagac ctggcgtaca 2280 aggattggcg aactgctgcg tcggacgtaa aggacctgaa gcgacgtacg tacaaaacgc 2340 tgcgaaatcg tgcgaactca gtcattgaac gggcgaaaaa gacgtttctc ggtggttacc 2400 tggacgcaaa tatcccatca aagctgctct ggactcgcgt gaagagcctt ggagttggaa 2460 aagacaaatc atctcaacca tgtgaccacg atccggatga agtaaatcgt atgtttctct 2520 cgagcttcac accagctgaa actaggaacg atcaacacgc aagaatactg ccatctcaat 2580 acaatttctc gttccgcaac gttcactact gggaggtcgt taacgccatc tgtgaagtga 2640 agtcgaacgc cgtagggatg gacggtttgc caatcaggtt tctgaaaatc gtgctgccac 2700 tagtaatcca acaaataacc cacatgttta acctgttcat caatacatct acctttccca 2760 acctttggaa gcatgctaag gttctgcccc taaagaagaa atcgaactcg aacgacgtga 2820 cgaacttgcg gccgatcagc atcctgtgct cgttgtccaa agcatttgag aagcttctcg 2880 atcagcaaat ggcgaactac atcgacaaca accatcttct aacagaacat caagcaggtt 2940 tccggaaggg ccaaagcgtc aaaactgcag cgctccgcgt ccatgacgat ttggcgtcca 3000 caatcgacaa gcgtggtgtt ggcgtactgg tcctgttgga tttctcaaaa gctttcgata 3060 ccatccccca caacaaacta ctggacaagc tggaaaccga attcaacttc tcttcgactg 3120 ccttgaacct aatgaactcc taccttcgaa gacggaagca gacagtgttc tgtgataatc 3180 gccgctcaga ttgtgctgag ccaacatcgg gcgttccgca aggatcaata atcggtcctc 3240 gattgttctg ttgccacatc aacgatctac cgaacgtcct caaacactgt gcaatccaat 3300 tgtatgcaga tgatgtccag ctctacatcc gtggtgttgg tccatgcgcg cgtgatttga 3360 tcagaatggt aaatcaggac ctccagaatg ttgccgattg gtcgaaccga aatcgtttgc 3420 tcgtaaatcc cgctaaaagc aaggcccttt tcattcaaag cgcaaaccgt agagctgccc 3480 tggttcctga tctgcgcatc gaaatgaacg gtgagcaaat tcaatggacg gagcgtgcaa 3540 gcaacctagg attcatcttt caaagcgacc tgcagtggga cggtcttgtg tcacagcagt 3600 gtggcaaaat ctatgctggc cttcgcacac tctacagctg cacaccgact gcacccgtgg 3660 cgacaaaact gaagctgttc aaagcccttg tcctgcctca ctttctgttc gccgacgttt 3720 tctttgtccg tctttcgtcg agcctgttca accgactacg cgtggccctg aactgctgcg 3780 tacgctacgt ttacggatta cggaggcgtg accacgtgag ccaccttcaa aagaacctcg 3840 ttggatgtcc gatgctaaac ctgttcgctt accgatcctg tctgtttttg agaaacctgt 3900 tgtcgtccca aacaccacca gcactctacc agaaactcat cctttcacga agccaacgtc 3960 tacggaactt ggtcgtacca gcgaatcgga cagcaggata cgcttgcact atgttcgtca 4020 gaggagtagt gcattggaac gcactccccc ccgcagtcaa aaacagttta tccgatgcaa 4080 ctttcaagag aggctgcaaa gagcattgga acgaagtgta accactgtcg cgcccggagg 4140 tcccggggcc cccctaacca caagtcaaag gaagagtgga atagggacaa ccctccggct 4200 cctccggggt gcctggtaaa cacgacacgc aaaccgacga tacgccaacc gacgaaacac 4260 cgacgaaaca ccgacaggag aatccactga ggttcaggag acccggggcc cccctaacca 4320 caagtcaaag gaagagtgga atagggacaa ccccccgggc tcctgatgcc ccctggaaga 4380 ggtcgcacaa ggggcaaacc cccatcccct cggataaacc ctggacttcg ccgcacgaca 4440 acgctacacc accagcaaca aacgaccaac aaacgacgac tacgggagga cgatttaaac 4500 ctgcgaaaat ttattaaaag tgactctagt ttttaaaatt tagtgcttca aattattaat 4560 attaaattta ggttatcaaa ttattgcatg cacgtcgatc tacaccggaa gtagcaattt 4620 taaaaggagc aatccttacg ctaccagata taataaacaa acaaacaaac aaac 4674 // ID Mariner-30_HM repbase; DNA; INV; 3690 BP. XX AC . XX DT 30-DEC-2008 (Rel. 13.12, Created) DT 30-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE Mariner-type family: consensus sequence. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-30_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3690 RA Bao W. and Jurka J.; RT "Families of Mariner elements from Hydra magnipapillata."; RL Repbase Reports 8(12), 1964-1964 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 1216..2955 FT /product="Mariner-30_HM_1p" FT /translation="MKPNTKLIRKNSLDVEDAVKSVLKGMSIRDAAEMYSL FT SKSAVGRAVKMARCQKLPDYKHIVNIGNRKIFHVSEESAIADYLQTSSNMC FT YGLTKLQTRQLAYKYGIALNSNSIPESWHQNKMAGKQWLESFLIRHPKLSL FT RKPEKVSLARATAFNEHTVSMFFDNLLEVQLKYKFKPECIFNADETGLLTV FT TDPPKIISTRGTKRVSQAVSAERGSLVTMLAFVNAMGNTVPPVFIFPRVNY FT KDFMICGAPTGSLGLCNKSGWMTSENFLVAMKHFVSHAKPSAEHPVLLLMD FT NHESHISFETITFAKENSLILLTFPPHCSHRLQPLDVSVFGPFKSYFRLAQ FT NEWLASNPGRIINIYNLPQLACKAYNMSFTIKNVCSGFSKCGIYPLNRQIF FT GESDFISSSVSDRPLLVEQPEIKQPDNQQSYEINKQVDSSESSVKIVSPES FT IHPYPKAPPRKDSKKRRKQGKSTILTKTPEKTVNKKNIFETGKITTQNLKR FT KIDLKRGKETEIFDADFLIVSSVKNAPVNQQQQNTCKICKCNWKNFVGPAW FT IQCFTCQWWICGNCNNNSKKRYYECELCEVDC*" XX SQ Sequence 3690 BP; 1295 A; 493 C; 579 G; 1323 T; 0 other; ggggagaacc ttaatgaatg ggacaattat tgaatgggac aaaatcaaat ttgcatagaa 60 gcatctacgg ttggctatcc aaactcctgt ggtaataagt gtttgacaca accattcaac 120 agaccaaatc tttatttcta ttatggtcta aacgtaatgg cattggatgt tattgtgttt 180 tgaagaacaa taagtaattt ttaaaaacaa ttttctgtgt ctgattttgc agcgattact 240 aaagttaagg aaaatgaaac tccaattttc aacacccaaa tgtctctact ttttaaattt 300 gtaaagtatt tttgtttagg tattactgtt tttaacctgt cctgtcattt gtggatatag 360 tgggaagcct attgaatggg acaaagtatt gattattgaa tgggacttgt ccaaacagcc 420 tgatatgaaa aaagtagaaa aatatatttc ggcaacttag ttatggtcac acataatttt 480 atttttaaat acgttttaat ttgatgttaa aatgtggttc taaataagtg tgaccaatgc 540 tttagctaaa attcaaaatt gtgttcatag aactaaatct ctattaaaat cttttcttta 600 gtattcttta aagttgagtc gatcaatagt ttattaaatg aaattaattt taaatattta 660 attagatatg ttgatgtatg atatttataa tatgtataaa tgtttaaata gatatgttga 720 tgtatgatat ttgtaatatg tataaatatg gataaatgtt taattagata tgttgttgat 780 aacagagtat taaaaagatg gcaaatatta tatatatata tatatatata tatatatata 840 tatatatata tatatatata tatatatata tatatatata tatatatata tatatatata 900 tatatatata tatatatata tatatatata tatatatata tatatatata tatatatata 960 cttataatta tttttttgta tttatttatt gtatttatat acatttatgt atatctattt 1020 acacacacac acatatgtat ttatatcata tactgtatgt atgtatatgt atcatacata 1080 tacatacata tatatgtgtg atacatatat atacgtatat ctgtgtgtat tttaattata 1140 tatttcatgt ttgtatttaa atatttgcat atggcatata tttggttttt tgtgaagttt 1200 caaattttat tttagatgaa accaaacaca aaattaatac gtaaaaactc tttagatgtt 1260 gaagatgcag taaaaagtgt tcttaagggt atgtccatac gagatgcagc tgaaatgtat 1320 agtttatcaa aatctgcagt aggtagagct gttaaaatgg caagatgtca aaagctgcca 1380 gactataaac acattgtaaa cattggaaat agaaaaatat ttcatgtgtc tgaggaaagt 1440 gctattgcag attatttgca aacgtcttca aatatgtgct atggtttaac taaattacaa 1500 actaggcaat tagcctacaa gtacggaatt gcattaaatt ctaacagtat tccagagtca 1560 tggcaccaaa ataaaatggc aggaaagcaa tggttggaaa gttttttgat acgacatcca 1620 aaattatcac tacgtaagcc agaaaaagtt agtcttgctc gtgcaactgc ttttaatgaa 1680 catacggtgt caatgttttt tgataacttg ttagaagttc aattaaaata taaattcaaa 1740 ccagaatgta tttttaacgc agatgagaca ggattattaa cagtaacaga tcctccaaag 1800 attatttcaa ctcgtggaac taaaagggtt tcacaagctg tttcagcaga aagaggttca 1860 ttggtgacaa tgttagcttt tgtgaatgct atgggtaata cagttccacc tgtatttata 1920 tttccgagag taaattacaa ggattttatg atatgtggag ccccaacagg aagcttaggg 1980 ctgtgtaaca aaagtggttg gatgacaagt gagaactttt tagttgcaat gaagcacttt 2040 gtttcccatg ctaagccatc tgcggagcat ccagtgctgt tattaatgga caaccacgaa 2100 agccacattt catttgaaac aataacattt gcaaaagaaa actctttgat tttattaaca 2160 tttccaccac actgcagtca tagattgcag cctttggatg tttcggtgtt tggtcctttt 2220 aaatcttatt ttagattggc tcagaatgaa tggttggcct ccaatccagg gagaattatt 2280 aacatatata atttacctca gctagcatgc aaagcatata atatgtcgtt cacaataaaa 2340 aatgtttgca gtgggttttc taaatgtggg atctatccac tcaatagaca aatatttggg 2400 gaaagtgatt tcatatcatc tagtgttagt gacagacctc ttcttgtcga gcaaccagaa 2460 attaaacaac cagataatca acaatcatat gaaataaata aacaagttga ttcttcagaa 2520 tcatcagtca aaatagttag tccagaatcc attcatccgt accctaaagc acccccaaga 2580 aaagattcta aaaaaagaag aaagcagggt aaaagtacta tattaactaa aactccagaa 2640 aagacagtaa ataagaagaa catttttgaa actggtaaaa ttaccacaca aaatctaaaa 2700 agaaaaattg acctcaaaag agggaaagaa acagaaatct ttgatgcaga ttttttaatt 2760 gtgtcatctg taaaaaatgc acccgtaaac caacaacaac aaaatacttg caaaatatgc 2820 aaatgtaact ggaaaaattt cgttgggcct gcttggattc agtgttttac ttgtcaatgg 2880 tggatatgtg gtaattgtaa taataacagc aagaagcgtt attatgaatg tgaactatgt 2940 gaagttgact gttgatatct tctacacttt ctttgatggt ttttatttca tgtactttgc 3000 aacaagcaat aattgttgat ttttatattt tggttaaaca taaatacaat gaatttttta 3060 aatatctaat aagatgcttt gttatttcgt tttttgaaag taaactagat actttatatg 3120 gtattaagtt gtttatttat aatttagcat agggatgctg caggcaatga ctcgtttccc 3180 tattggaaat gtgtcattac tgggaattat atccaagcat ttaaaaaacg ctttttgttt 3240 tactaaaaac aattattcgt gtttttaatt taattttaat atacatatat tttttaataa 3300 aactatttta ggtttgtcaa tatatgcaga cattcggcca tatttatagc tttttcgggt 3360 tccctaatgg tagaaactac ccttactgct tgactggctt gaaagttact taggacctct 3420 cttcttttat taatttttta gatttgtccc attcaataat tttatttgtc ccattcaatg 3480 agtagttctt aatgaatggg acaaatgcca tggtcttatt aaactctctt tttaccaata 3540 cccatatgcg gaatcgattt ttttgttttt gtttatcata gtacgagagt taacggttat 3600 caaactattt tcaaagtgat ttttaaacaa atttatcaac ctcaaaaaat gcattttacg 3660 aaaactgtcc cattcattaa ggttctcccc 3690 // ID Kolobok-2_BF repbase; DNA; INV; 6496 BP. XX AC . XX DT 27-FEB-2007 (Rel. 12.02, Created) DT 27-FEB-2007 (Rel. 12.02, Last updated, Version 1) XX DE A family of autonomous Kolobok transposons - a consensus. XX KW Kolobok; DNA transposon; Transposable Element; KW Interspersed repeat; Kolobok-2_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-6496 RA Kapitonov V.V. and Jurka J.; RT "Kolobok, a novel superfamily of eukaryotic DNA transposons."; RL Repbase Reports 7(2), 117-117 (2007). XX DR [1] (Consensus) XX CC Kolobok-2_BF is a consensus sequence of a family of autonomous CC Kolobok transposons that were active in the lancelet genome in a CC last few million years. The Kolobok-2_BF transposon is CC characterized by 343-bp terminal inverted repeats, TTAA target CC site duplications, and it encodes two proteins: (i) the 803-aa CC transposase, Kolobok-2_BF1p, composed of the THAP DNA-binding CC domain and catalytic "DDE" domain, which is conserved in all CC Kolobok transposases, and (ii) the 281-aa Kolobok-2_BF2p protein. CC The second protein is conserved in highly diverse Kolobok CC transposons present in the genomes of vertebrates (frog, fish), CC chordates (lancelet, sea urchin, sea squirt), and cnidarians CC (starlet sea anemone). See also comments in Kolobok-1_XT. XX FH Key Location/Qualifiers FT CDS 6313..5471 FT /product="Kolobok-2_BF2p" FT /translation="MTSVGADSCVVCCSTRHFEGSRFTTNRPAFHNKPAGA FT QRSCGDSVGTSVYSSVQGAVWCSMEPGLEEEIAGMSLGPIAYNFHPRRSDR FT ERSGTGDGSVSVVRPRPRTPVSPPSEPELLESMTVSEWDGVDVETENWRLE FT QFTWCRCTNCQPMPSVRECVCCHDLTEAEKKGIGDGILCLVEHEDFHANNI FT NKTVLRSALLARVENLREALGDPILHRTYRMQAYRQCTYWLHERLGTHIRR FT VIPSCVVWAIRDAYPEKKREHYRGFLEADEVYDFIYEARR" FT CDS join(380..736,1235..2653,2836..3064,3361..3507, FT 3912..4168) FT /product="Kolobok-2_BF1p" FT /translation="MPVCVVCRNTNKNTKGISFHRFPRWEEDRLQKWLVAV FT RSKLRLPWTLEKIKDSIATNNNARVCSDHFSPSCCTDNQKAKYVPYKVPAK FT VISNDAVPTQFGLKSSRSSSECQREKRTRIQLLEDLLQPNSEQLKIPALPT FT SPKQSTFHCESQEPMNEAAVEAGHCGSTSGTNLSGEDPSSRLTSMFHTYSK FT PPCDQPPRFVNASTQTDLDGDTIASLLSTSRPKTCDTVTCTSHLPSTPKVI FT PRGSVSSLPSSTPILKHSDLDQSFLSSPSVVDMKDTSFHPSDFTSDSENEE FT EIDCGEDEGGDDGEDDFISGDDDSQYYVVHKDKIRERFKTCHCGEPLLITE FT QFTTGSMLTVKYECTSFHRGTWESQPKVGRMAEGNLLTSAAILFAGGSYQK FT FSDICNTLKLKLFSETYFMNVQRTFLLPAINDFYISQQEFTLDAFRNAGTE FT SPEDQIQVTLLGDGRCDTPGHNAKYCSYSLMEETSQLILDFQLVQVSETTS FT SNAMERLGFERSLDFLEEEGIKIDCIVTDRHRGVGAVLKKRRDINHQFDLF FT HFAKSITKKLTQEASRRAKKDLGPWIKFIVNHLWHISDTCQGDDVLLQEKW FT LSMLKHIGNIHVFPENSIFTRCAHPHLDRADTDDIMWLRPGSPPHQALYNI FT ATNKTLLKDLSHLTGFKHTGTVEVFHSMLLKYAPKRQHYFFHGMKGRLQLA FT VLDHNENVLREQAKTLEDEARRSAAFSKRTKRWVYRRLFEGKTYEFREYLM FT ADVVRRQASGQYKCGKVVLPDPGPDISGNIAPEERGDKQAIIDGYMTRFGL FT R" XX SQ Sequence 6496 BP; 1868 A; 1421 C; 1376 G; 1831 T; 0 other; agcaaaacac aaggcagcgt gaaacctttt attattttac gcaaagtacc cccaaaagtc 60 gatttacgag ttgttggtat aggccttcgt acctacgagc ggtgccggca gatcatattt 120 ccggctaacc gagaatcccg gggactcccc gccgcggcca ttttgcctac atccgggtac 180 ttcatgacgt cagtgggagc agattcgtgc gtcgtctgct gttccactcg ccattttgaa 240 ggatcgcgtt tcacaacaaa ccggccggcg tttcacaaca aaccggccgg cgcgcagcga 300 agctgtggcg attccgtagg cacgtcagtt tattccagcg tccgctcacg tcctgccgaa 360 acgcggaggg agggaaacca tgccggtctg cgtggtgtgt aggaacacca acaaaaacac 420 gaaggggata agcttccatc gctttccccg ctgggaagaa gacaggctac agaaatggct 480 ggtcgcggtt agatcaaagt tgcgactgcc atggactctc gaaaagatca aagatagtat 540 cgctacaaat aacaacgccc gcgtctgctc ggatcatttc tcgcccagct gttgcaccga 600 caaccagaag gccaaatatg tgccttacaa agtgccagcc aaagtgatat ccaacgatgc 660 agtacccaca cagtttggcc tgaaatcatc gcgctcctca tccgagtgtc aacgtgaaaa 720 gagaacgcga atacaggtat cctactgctt tttctgcttc acgtttattc aaattgttat 780 acatgtatat gcaaaaaatt gattgaacca gatattttgt gctgttcatt aatttaaact 840 gttagacatt gtttctggtt gaatacattt ttcatggctc aaaaaaaaaa tgtaagaatt 900 tatgatacca aatgtctgaa aacatgtttt tagcaatgta tatactatat gagacttctg 960 actgaacaat tttgtcaaga atcataagca taataaggca gatgttaatt ttagatgtta 1020 agcttgttga ggcttgaagg ctttacctcc agctcccttg gcttttctat agtgttttac 1080 cttttatgtt tacatgttgt ggggttgggg tcagaggtga tttatgaaat gaaaggttca 1140 tttagctttt gtgtaaccac agtgaatccc tatttaatat gttagtttta tcattccaca 1200 tgaaatcaat tttttgttta tgtccatatt tcagctcctg gaagatttgc ttcagccaaa 1260 ttctgaacag ctcaaaatac cagcacttcc cacaagtcca aaacaaagta catttcattg 1320 tgagagccag gaaccaatga atgaggcagc tgttgaagca gggcattgtg gcagtactag 1380 tggtacaaac ctcagtgggg aagatccttc cagcagactc accagtatgt ttcatacata 1440 cagcaagcct ccatgtgacc agcctcccag gtttgtaaat gcaagtaccc aaacagacct 1500 cgatggtgac actattgcat cactactttc cacttcacgc cccaaaacat gtgacactgt 1560 gacatgtaca tcacatcttc cttccacacc caaggtgata cctagaggct ctgtttcatc 1620 actgccttcc tcaaccccca ttctgaaaca tagtgatctg gaccaatcat tcctatcatc 1680 tccaagtgta gtagacatga aagacacttc cttccatccg tcagatttta catctgattc 1740 agaaaatgaa gaagagattg actgtggtga agatgagggt ggggatgatg gagaggatga 1800 cttcatttct ggagatgatg actcgcagta ctatgtcgtg cacaaagaca agattcgtga 1860 aaggtttaag acatgccact gtggggaacc cttattgatc acagaacagt tcacaactgg 1920 ctccatgctg actgtcaagt acgagtgtac cagttttcac agagggacct gggaatccca 1980 gccgaaggtt ggacgtatgg cagagggcaa cctcttgaca tctgcagcta tactttttgc 2040 tgggggaagt tatcagaaat tcagtgacat ctgtaatact ctaaaactga aactgttttc 2100 cgagacgtac ttcatgaatg tccaaaggac attcctccta cctgccatca atgatttcta 2160 catatcacaa caggaattca cccttgatgc attccgtaat gcaggcacag aaagtcctga 2220 agatcaaatc caggtcactc tcctaggtga tgggagatgc gacactcctg ggcacaacgc 2280 aaaatactgt tcttacagtc taatggagga aacgtcacaa ctcatcctcg atttccaact 2340 agtacaggtt agcgagacaa caagctctaa tgctatggaa aggttgggtt ttgaaaggtc 2400 gctggacttc ctggaagagg agggaatcaa gatagactgc attgtaacgg accggcaccg 2460 gggagtgggg gctgtcctga agaagagacg tgacatcaac caccaatttg atctcttcca 2520 ctttgcaaaa tccattacga agaagctaac gcaagaggcc agccgaagag caaagaaaga 2580 cctaggcccg tggataaaat tcattgtgaa tcatctgtgg catatctcag atacctgcca 2640 aggagatgat gtggtaagaa gaaaattaca attgaaaaaa tataacaata aaattgaaat 2700 tgatattgta ttgccttaga cttagtgtac ctatcaacat ggaagtcaat atttctctat 2760 tatatcattg cttgtatgca tggtttaatg tgacagaagc attcctttga attaacagtt 2820 ctttattcat tgcagctcct ccaagagaaa tggctgtcca tgttgaaaca cattggcaac 2880 atccatgtgt ttcctgaaaa cagcatcttc acccggtgtg cccaccctca tctagacaga 2940 gccgacacag atgacatcat gtggttgcgg ccagggtccc ccccacatca ggcgttgtac 3000 aacatcgcaa ctaacaaaac actgcttaaa gacttgtcac acttgacagg cttcaagcac 3060 acaggtaatt agatactatt attttatcat gtgcatttgc tctaaatgtg gactgggttt 3120 aagcatacac atttacttga acctgcacaa gaacatgaaa attgtatatt agtactagta 3180 atgtcataga tgaaattctg atttcttaca atatgagtag agatgggttt tgataattct 3240 atatcatagc attgggaatg taaaaaaatt cactttgctg tatcataatg cataaaacag 3300 gaaattgtca attcaaaaac aaaaatatac atgtacactt tctttctttt tgttctacag 3360 ggactgtgga ggtgtttcac tccatgcttc tgaagtatgc accgaaaagg cagcactact 3420 ttttccatgg aatgaagggg aggttgcagc tagcagtact tgatcacaac gaaaacgtgc 3480 tgagagagca ggcaaagaca cttgaaggta cagacataac tgtggagatt tgtcctcttt 3540 atattaagtc aacctctctt tatctgcatc acaacaattg tatgcagcta ctcctcttct 3600 aaacatttca agtgtaccat atattctata tagaatgatg gctctcacct atatattttt 3660 tgtaatatat ctattatagt ttattgctta tacatgtatc catgaattgt taattgccag 3720 gagacaaaag actcatttct tatattttgt attactatca cacacacacc ccaaaacaca 3780 ctacacatgt gtaaagcatt agagatacat gtagttgaga aacttaacaa tgctatagta 3840 taactttttt tttaaataca acagttactg ttttatgatc catataactt tcctcatttg 3900 tcatactaca gatgaggcta gacgctctgc tgccttctct aagagaacga agcgctgggt 3960 ctacagaagg ctattcgagg ggaaaacgta tgaattcagg gagtacctga tggcagatgt 4020 ggtgcgaagg caggcgagtg gccagtacaa gtgtggcaag gtggtgctgc cagatccggg 4080 ccctgacatc tctggaaaca ttgctccaga ggagcgtggt gacaagcaag caatcattga 4140 tggttacatg actcgtttcg gactccgcta gctttacata ctcagagcat aatatcacta 4200 gtactgtttt cactgtagat ggaggcaagt tccataacac tgtaacactg tactgtttat 4260 tcctcttatg gatgtccacc actggggtcc atccgagtgt atatgcataa tggtttctct 4320 gggtgtatag agcccaggat aaccctacct gcctgttttg gagtttcttc agtttgtcta 4380 tgtaagcctt gctgccttgg tcccagacag tactgcagta ctgtaagtgt ggaaggaaaa 4440 gggatgtaca gtatatattg tcagtaaggc ttgtctgttg atgaaaggct tcaggcactt 4500 catgataccg atagcttgcg agagtttctt tgtgatatga atgatatgat cattccacca 4560 tagagcactg tcaaggacta cacctaagta gttaaatgtg ttaacctagt caaggtttga 4620 ttctggtgat gcgagatgca atttaatatt tgtagcttat attataaaaa tttataggaa 4680 ggagagattt aatttacatg gttgaatagc agtagtgtat gaatattgga atgtgatgaa 4740 tatgatgatg ttagaatgtg atgatattag aataagattt tgtatgctat tattaattat 4800 tgttaaagca tgcacagaaa gtggcacaca actctgagat ccctttaata tataacattg 4860 gtactttgca tacatctttc ctactgaact tttaacactg ttaatgttac atgtattaga 4920 attagttact gtttacttat ctattctggt acattctgta ttacagaatt taaaatgcac 4980 gatgaacgac tcccttcatc ttagcaaatg tcacacaaca agaacaagaa taaataggtg 5040 gccgacataa aacttgaatg tttgtgttat atgtcatatt aaactcttag agtcaatttt 5100 acaataccaa gcgggtgttt cttgcctttt cttccaacat tttggaaatc tttataacat 5160 tctaattgta ataagtaact ttttagaact gttataacat cactgatcac atgttccaac 5220 aataacatgt acaaaaacac atagatgttt ctagatgttt ctaaattgtt ctaaataaag 5280 agatatttgt accattacct tcccaatatt tctaacatat ccagataggt tcaaacattt 5340 tcaaactttc acttttaatt gtttgaacaa tttagaactt gagtgactga aatattcgtc 5400 tatctagaat agtttaaact taatacaaat attatatcaa aaccctctcg ctgactggga 5460 attgacttta acgccgagct tcatagatga aatcatacac ctcatcggcc tctaggaagc 5520 ccctataatg ttcgcgtttc ttctctggat aggcgtctcg gatggcccag accacacacg 5580 acgggatgac tcgtctgatg tgcgtgccga gtctctcgtg cagccagtag gtgcactggc 5640 gatacgcttg catccggtag gtgcgatgaa ggatcggatc ccccagagct tccctcaggt 5700 tttccacgcg cgccagaagt gccgaacgaa gcacggtctt gttgatattg ttggcgtgga 5760 agtcctcgtg ctcaactaaa cacaaaatgc cgtcaccaat gcctttcttc tccgcctcgg 5820 tcagatcatg acagcacaca cattcccgga cggacggcat cggctggcag ttcgtgcacc 5880 ggcaccaggt gaactgctcc agccgccagt tctctgtttc tacatccacc ccgtcccact 5940 ctgaaacagt catgctctcc aacagctccg gctcagacgg cggggatacc ggggttcgag 6000 gtcgtgggcg gactacactg acgctcccgt ccccagtccc agatctttca cgatcgctgc 6060 gccgagggtg aaaattgtaa gctatggggc cgaggctcat cccagcgatt tcctcttcga 6120 gacctggctc catactgcac cacactgctc cctggacgct ggaataaact gacgtgccta 6180 cggaatcgcc acagcttcgc tgcgcgccgg ccggtttgtt gtgaaacgcc ggccggtttg 6240 ttgtgaaacg cgatccttca aaatggcgag tggaacagca gacgacgcac gaatctgctc 6300 ccactgacgt catgaagtac ccggatgtag gcaaaatggc cgcggcgggg agtccccggg 6360 attctcggtt agccggaaat atgatctgcc ggcaccgctc gtaggtacga aggcctatac 6420 caacaactcg taaatcgact tttgggggta ctttgcgtaa aataataaaa ggtttcacgc 6480 tgccttgtgt tttgct 6496 // ID CR1-17_HM repbase; DNA; INV; 4457 BP. XX AC . XX DT 15-DEC-2008 (Rel. 13.12, Created) DT 15-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE CR1-type family - consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-16_HM; KW CR1-17_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4457 RA Bao W. and Jurka J.; RT "CR1 families from Hydra magnipapillata."; RL Repbase Reports 8(12), 1845-1845 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(63..509,499..807,861..3302,3212..3970) FT /product="CR1-17_HM_1p" FT /translation="MLKMEITMKSIEKMISSKLEEHKKSILKETERLLKEQ FT EKTFTSIMSANLKIITDRLDVIEKDNNNNKRKISNFEKDINDIKDSLNFQE FT DKISEKLSHIKKYYDNEVNILNKKTVDLENRSRRNNLRIDGLHETPGENWD FT DCEKAVKDNDKIMIKKQLKITSEVVIERAHRIGQHKHNKPRTIVLKLLNFQ FT DKNKILNAVKHLKGTGLYVNEDFAPETTELRRKLWEEVKKLRSEGKYAILK FT YDKIFSRDFKKFLLLNLMTYKTIDFESLRFNVFETANNILNDFCDVDTQIF FT QVNNFDSPYFNIKNFKTELQTFKNNFMAIHINIRSINKNFDKLKHFLTDCN FT YSFSMICLTETWCSDESIQKNSNFQIPYYKLLSSERKADKRGGGIATYIRN FT DQAIKARKDLSISDSSCEVLTIEITNSKTKNILVSTCYRPPEGDIKKFSSY FT LEDIFLKINREQKKLFCIGDLNIDCLKYKKFPNATTKLFFDNMFQHCIFPI FT INKPTRITPNSISAIDNILTNAFQDSSLKTGIVKTDISDHFPIYFSINQDT FT RINNNSKTKIYIRKTNKISIQKFKDTLSIVNWDEVYKKCNLEDTNSAYNTF FT IDIFFNHYNSYFPIEEKIVKEKYLNCPWITSGIKKSSKTKQKLYIKYLKNR FT NEVNLSIYKQYKNLFEKIRKNSKKIYYSKLLQNTNGDIKKSWNIMKEIIGK FT KNTKTNTLPDRIVVDEKEYNDKSSIAEHFNSFFANIGPNMASKIQCPNTYF FT ETYLTNQQSELKFNGLDYDELEAAKNSLKINKAPGIDEISSNVVISVFPVI FT RKPIFEIFKSSITTGTVPDKLKIAKVVPIFKSGEQCSLNNYRPISILSVFS FT KLLERIVYNKLYKYLTINNILNKKQFGFQKQHSTEHAIIDLVNKINDSFDN FT NKFVLGIFIDLSKAFDTVDHAILLKKMEKYGIKNVALHWFESFLINRQQCV FT IMDKNTHSKLLKIKCGVPQGSILAPLLFLLYINDLPKVSKKLDAIMFADDT FT NLFYSSTSIAELFETAGIELEKLNTWFKSNKLSLSTEKTNYILFHTNQRRK FT NLYQFHYRLQIIIEYRKNQLHSISHQSKKKKSIPIPLPSLKIENKFIERTK FT ATKFLGILIDENISWKAHINLLNTKIRNNICMLYKARPMLPQKNLKLLYFS FT FIQTYYTYGNIAWASTHRSNLVALYRHQKHAARIVFNKDKFTHAEPLLKLL FT NALNVYKINIYQNLIFMLKYKLGLVPLHFSEEFFKCNRNKYETRGIGNFKV FT PYKKTKLSRFSISYRGPYLYNKLIAKNTIITKLLNHNSLKTLLKKLVLNLN FT NFTNVY*" XX SQ Sequence 4457 BP; 1851 A; 742 C; 559 G; 1305 T; 0 other; agtggaaagc ggcgtggtat gcggacgtgt tttttagtcg cagcgtaaaa agttctaact 60 ttatgttaaa aatggaaata acaatgaaga gcattgaaaa aatgatttca tcaaaacttg 120 aagaacataa aaaaagcata cttaaagaaa cagaacgttt gctaaaggag caggaaaaaa 180 ctttcacctc aataatgagt gcaaacctta aaattataac cgataggtta gacgttatag 240 aaaaagacaa taacaacaat aaaaggaaaa tatcgaattt tgaaaaagat atcaatgaca 300 ttaaagacag ccttaatttt caagaagata aaatatcaga aaaattatca cacataaaaa 360 aatactatga caatgaagta aatatattaa acaaaaaaac agtagatttg gaaaatcgat 420 cgagacgaaa caacttaaga atagacggac tacacgaaac accaggagag aattgggatg 480 attgcgaaaa agcagtaaaa gataatgatt aaaaaacaac ttaaaataac aagtgaagtg 540 gtgattgaac gagctcatcg tatcggtcaa cacaagcata acaaaccaag aacaatagtc 600 ttaaaactct taaattttca ggacaaaaac aaaattctca acgcagtaaa gcaccttaaa 660 ggaaccggcc tatacgttaa tgaagacttc gctccagaaa caactgaact ccgaagaaag 720 ctttgggaag aagtaaaaaa actacgcagc gaaggtaaat acgctatttt aaaatacgat 780 aaaatattta gtcgagattt caaaaagtag cgccgcgttc tcttttactt aaactctttt 840 ttaaacgaac gcgtatttaa tttttgcttc ttaacttaat gacttacaaa acaatagatt 900 ttgaatcgct gcgttttaat gtcttcgaaa ccgcaaataa catactcaat gatttttgtg 960 acgttgatac gcaaatcttt caagttaata actttgattc gccttatttc aacattaaaa 1020 attttaaaac tgaactccaa acttttaaaa ataactttat ggcaatacac ataaatataa 1080 gaagtataaa taaaaatttt gataaattaa aacatttctt aaccgattgc aattactcat 1140 tcagtatgat ttgtctgaca gaaacttggt gctctgatga atcaattcaa aaaaattcta 1200 actttcaaat tccatattat aaattattat cttctgaaag gaaagcagac aaaagggggg 1260 gagggattgc aacatatatt cggaatgacc aagcaataaa agcaagaaaa gacctttcga 1320 tttctgattc cagttgtgag gtacttacaa tagaaatcac taactccaaa actaaaaata 1380 tcttagtttc tacctgctat agaccacctg aaggagatat aaaaaaattt tcaagctact 1440 tggaagatat ctttcttaaa attaatagag aacaaaaaaa actattctgc ataggtgatt 1500 taaatataga ttgtttaaaa tacaaaaagt ttccgaatgc gaccaccaaa cttttttttg 1560 acaacatgtt tcaacattgt atctttccga ttatcaacaa acctacgcga ataaccccga 1620 attcaatttc cgcaatagat aacatcttga caaatgcatt tcaagattcc tctttaaaaa 1680 caggtatagt taaaaccgat atttcggatc acttcccaat ttacttttct ataaatcaag 1740 atacaagaat aaacaacaat tctaaaacca aaatttatat aagaaaaact aataaaattt 1800 ctattcaaaa atttaaagac acactatcaa tagtgaattg ggatgaagta tataaaaaat 1860 gcaaccttga ggacacaaac tccgcatata atacatttat cgatattttt tttaatcatt 1920 ataatagtta tttcccgatc gaagaaaaaa tagttaaaga aaaatatttg aactgtccat 1980 ggattactag tggaattaaa aaatcctcta aaacaaaaca aaaactctac ataaaatatt 2040 taaaaaacag aaatgaagtc aacttatcta tctacaagca atacaaaaat ctctttgaaa 2100 aaataaggaa aaactctaaa aaaatatact actcaaaact acttcaaaat accaatggcg 2160 atattaaaaa atcatggaac atcatgaaag aaataatcgg gaaaaaaaat accaaaacaa 2220 ataccttacc tgatagaatt gttgtggatg aaaaagaata caacgataaa agttctatag 2280 ccgaacattt caacagtttc tttgcaaaca tcggccctaa catggcgtcc aaaattcaat 2340 gcccaaatac ctattttgaa acctatctta caaatcaaca aagcgaacta aaattcaatg 2400 gactagacta tgatgaactc gaagctgcaa aaaattctct aaaaattaac aaagcaccag 2460 ggattgatga gatttctagc aacgtagtaa taagtgtctt cccggtgata cgcaaaccca 2520 tattcgaaat ttttaaatct tcaattacaa caggaactgt tccagacaaa ttaaaaattg 2580 caaaagttgt accaatattt aaatccggag agcaatgctc acttaataac tacagaccca 2640 tctcaatcct ttctgtcttt tctaaacttc tagaacgaat agtttacaat aaactatata 2700 aatacctaac tattaacaat attttgaata aaaaacaatt cggatttcaa aaacaacatt 2760 caaccgaaca tgcgattata gatcttgtaa ataaaataaa tgactctttt gataataaca 2820 aatttgttct aggaatcttt attgatctat cgaaagcatt cgatactgtt gatcatgcta 2880 tactacttaa aaaaatggaa aaatatggca ttaaaaatgt tgctttacat tggtttgaga 2940 gctttcttat aaacagacag caatgtgtta ttatggataa aaatacccat tcaaaattac 3000 ttaaaataaa atgcggagtt ccccaaggtt ccattctagc tcccctactc ttcttgctat 3060 acatcaatga tcttccaaaa gtctcaaaaa aacttgatgc cataatgttt gcagatgata 3120 caaacttatt ttattcatct acttctatcg cagaactctt cgaaactgca ggtattgaac 3180 ttgaaaaact taatacttgg tttaaatcta acaaattatc attgagtaca gaaaaaacca 3240 actacattct atttcacacc aatcaaagaa gaaaaaatct ataccaattc cattaccgtc 3300 tctaaaaata gaaaataaat ttatcgaaag aaccaaagca acaaaatttt taggaatact 3360 tatcgatgag aatatttcat ggaaagctca cataaactta ttaaatacca aaataagaaa 3420 caacatttgc atgctttaca aagctagacc gatgctaccc caaaaaaatt taaagctact 3480 ttatttctcc ttcattcaaa cctattatac gtatggtaat atagcatggg caagtaccca 3540 tagatccaac ctagtggcac tttatcgaca tcaaaaacat gctgcaagaa tagtatttaa 3600 taaagataaa tttactcatg ccgaaccttt gctaaagctt ctaaatgcac taaatgtcta 3660 caaaattaac atttaccaaa atttgatatt catgctcaaa tataaactcg gacttgtccc 3720 attacatttt tcagaagaat tttttaaatg caatagaaat aaatatgaaa ctcgaggaat 3780 aggaaatttt aaagtaccat ataaaaaaac aaaactatca cgtttctcca tctcgtatcg 3840 cggcccttac ctctacaaca aactaatagc taaaaatact ataattacaa aactacttaa 3900 ccataattct ctgaaaacac ttttaaaaaa acttgttcta aacttaaaca attttacaaa 3960 cgtatattaa aacaaataac taaaaacccc taaaaaatcc actttaaagc aaagtaaatg 4020 ttagatttta tttatatttc gtacatgtca atatactaag tataacaaac catgtaacta 4080 tatatttatg tgtatcaaat acagctcttt tccgtttttt gtttttgatt atctagtagc 4140 tttttatgtg tgactcttat gccaccatga gactttgaca aactgaaatt tcttaataaa 4200 ttaagtatac aaaatatcgc aactaaaagc ggtttcttga tgacaagacc aaaatggtct 4260 tctgcaagtt tcccgcgtct tttaacagtg ttactttttt atatatatta ttatttgtat 4320 taaaaaaagt tttcttatta tatgtacggt atttttttta tacaatgtaa acgaactagt 4380 ttatatttat gatcacgtat ttgtaaagct ttgataaaag acaaaaaaaa aaaaaaaaaa 4440 aaaaaaaaaa aaaaaaa 4457 // ID Sola2-3_DPu repbase; DNA; INV; 5367 BP. XX AC ACJG01003466; XX DT 17-FEB-2011 (Rel. 16.02, Created) DT 17-FEB-2011 (Rel. 16.02, Last updated, Version 1) XX DE Sola2-type DNA transposon from Daphnia. XX KW Sola; DNA transposon; Transposable Element; Sola2-3_DPu. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Direct Submission to Repbase Update (09-FEB-2011). XX RN [2] RP 1-5367 RA Jurka J.; RT "DNA transposons from Daphnia."; RL Direct Submission to RU (24-MAY-2010). XX DR EMBL/GenBank/DDBJ; ACJG01003466; Positions 44939 39573. XX FH Key Location/Qualifiers FT CDS 3007..4215 FT /product="Sola2-3_DPu_1p" FT /translation="KWESTNCTMLHNIELPSNEFVTLFFAKIDKLKSHHFI FT SKSQSRFSRELKSNFPLNKCLLQGDFTQNYSMLSQDSTQSSLVPPPPQCTL FT HTFIAYINVYGKIISHSMCVFSNDMTHNTVAVHTFLKPVIKQIFLICPTLE FT KIIHFTDGAASLYKKFKNFSNLLCHLEDFKVAGEWHFFAASHGKGPCDGIG FT GTLKRLARRASLQETANIQTPESLYNWCHTNVTNIHSFYEPSSEIEETEKL FT LENRFKSAKPIRGTQSFHSFVPVDAYSLEARVILCSDTFKSFVVIPLPTFL FT SFNFQDVQVNSVIAVAYKDGKWYLSNVVEKNDAAFEFKVHFYKPSGEDSRT FT TGFKLSKAENTAIVPIKNAIQITKSFMRSSRRDRSFKILSTEDDEIEIKFS FT NLMANGVEL" XX SQ Sequence 5367 BP; 1882 A; 854 C; 867 G; 1764 T; 0 other; ggtcagtcca cgcgaaatct gccgatcgat tgcgccgtca aaattgggat tcaattcatt 60 ttttgggttg ggtaaagatg ggtcaaaaaa agtgattctg aaattttcag aattatcgct 120 ttaatagttt aggagatatt taaatttaaa tttctttttt ttttattttg ccgtgaactg 180 aaaaaaaaaa tttgtttttc tttgattgcg actttatttg tcattataat tttattaaac 240 caaaaccaat cgacgtaaaa ttaaatttta cgtcgatctt acacatttat ttttattttt 300 aagaaatatt tttatcgcaa tttgcaaaaa aatctaaaat ggtcacactt taaaaattaa 360 tataaaataa aactgaaaat taaaaaaaaa atcgaataat tttccgatga acaagagata 420 tgtttacccc aatcaaaaaa ttgggagggt gtgttttgtt aatcgtgaga tatatagcag 480 tttagaaaca taaattctcg tgttttgtgc gtgtttttcc atcgccccaa gccatttcgc 540 tccctttcgg tcggccattt tggaaggggg agtttttctc caactcatag taaaaacgac 600 cgagagactt gagtgactag cagaacaaaa aaaaccctct gcagtgtatt cgtatatata 660 ttattcatag tatatatata tatatattat tataagtata catatatata tatataaata 720 atatatatat aaatatatat atataaataa gtataagtat atatatatta ttcatagtat 780 tcgaagtttt tcatatatat atattatata ttattataag tatacatata tatatatata 840 aataatatat atataaatat atatatataa ataagtataa gtatatatat attattcata 900 gtattcgaag tttttcattt ttcacactgt gttaagactt tagagatgta taccattcga 960 attttatgga atggctgaca aaaattcggt aaatcagtgt tcagtgggtc agcatgttaa 1020 atcacaatgt tattcttcca aatttgtgcc agaagacttc agtttatcta cgtttgacga 1080 tctttacgaa gacacgcgca aaacagttca gtttagaacc aacatgaaaa gtataacttc 1140 tatttgtctt caccacaaaa caacatacac aaatcatttt cgtgtatatt tgttataaaa 1200 gaaatctaac aaatgtagca atccactagg cattcatttg ccgaagaagc gacccaatgt 1260 ttcgtgttcc aaaacaataa catcgaattt ttgcgaaaaa tttcttcgaa gagaagtgat 1320 attcatgtct atcctggaca gcagttatgc gttacttgtt acaaacagtt attgcagatt 1380 agtaacgaaa acgaagtgaa attgtctaca acaagtgaat ccagttcaaa agtaggagat 1440 ttagagtctg ttttgcaaga aattgttagc gataatgctt tcctgtcaag aattattaaa 1500 cataaatttc gctttactta tcaatggaat atctgtgtat ttgtattatt tggtacccaa 1560 agcacattac aagaaaacat cagacatgac aaaacttact tgacagtaaa acgatgaaaa 1620 cgctcgtggt gggaattgtt gtaaacttac ttttaataag acaaaatcat agactaagtt 1680 taagtatgaa taactaaaga aaaagtccta ttaacaccat aattcaacga tcgattctta 1740 tacgctggga tgttctttat tttttaataa caatagaaaa aatttaggtc ctatagttta 1800 taagttatcg tcactttact tagggactat cgggcataag agaccgtgtt aaaccttttt 1860 ggcgccactg taaaattttt aaaaatgtaa caacttctaa attatacaac tacaataccg 1920 ttcgacagaa aacattcata tttggtgata tgatgtagcc tatcacaagg tatcatttca 1980 tgactttttt ttattttaaa tattttaaat tcagtattta attaatagtt aaatgaaatg 2040 ttttatttaa attcaatatt tttaatatta aatataatta tttcttcggg ggtttttaaa 2100 caagtattta ttaccttgat agtaataaag ctcattaaaa aaatcaattc ttgcaatttg 2160 ttctaggtta aaatataact ttactccact aaacgtcgaa atccttatat tgccaagaaa 2220 gcggagcaag tgaagaacgc gttcatatta ctccacaaaa atcacataga tgatagttta 2280 gttacacttg ttagactaca cttctcaaat gtgcgaaata aaacagaaat gcattgaact 2340 aaaaatggaa gggaaagtaa aagaagtaat tagttttttt actttagcac ctgtgtcatg 2400 gagtaaagaa gttactgctg actatttttt ctgtgaccgt ctctgaagtg aaacgtgccc 2460 gtctattaaa acaagaaaag ggaatacttg ccgtaccaga ccccaaaaga ggtagaaaga 2520 tttctccaga ggaaattgat attgtcgagg agttttatct atcagacgaa ttctccagac 2580 ttatgcctgg catgaacgat tatctgccag attgcaagat ggagagaaga aaacaaaagt 2640 tcaaaagcgg ctattattac ttaacattga agaattattc ttcaagttta aagagtatag 2700 cttgaataaa ctttgcatga agtgctgcag caactctaaa ttctatgaac tacaaccgaa 2760 gcatgttatt gaagttggag ctgctggcac ccacaacgtg tgtgtgtgcg aaagacacca 2820 aaacatgaaa ctgatgctcg acgctattca tggtaaaaca gaaaagtatt atctaatgaa 2880 tctaattgtt tgtgatgttc attactatga atgtatgctg aaaatatgtg ttaattgtcc 2940 tggtacgaat gcattgcttg aatatcttat gacaactata ccaaacgatt caataattaa 3000 atttaaaaat gggagtcaac aaattgtact atgttgcaca acatcgaact acctagtaat 3060 gagtttgtca cattattttt tgccaaaatc gataaattga aaagtcatca ttttatctcc 3120 aaatctcaat caagattcag tcgtgaatta aaatctaact ttcctttgaa taaatgcctt 3180 ttgcaagggg attttaccca aaactactca atgttgagtc aggattccac gcagtcttct 3240 ttggttcccc cgccacctca atgtacattg cataccttta ttgcgtacat aaatgtatac 3300 ggtaaaatca tttcacatag tatgtgtgtt ttctcaaatg acatgactca taacacagtt 3360 gctgtacata cttttttaaa acctgtcatc aaacaaattt ttcttatttg tcccaccctt 3420 gaaaaaataa tccattttac ggacggggct gcttctctat acaaaaaatt caagaatttc 3480 agcaacctat tatgccactt ggaagacttt aaagttgcag gggagtggca cttttttgcc 3540 gcaagccacg gaaaaggccc ttgtgatggc ataggaggaa cgttgaagag gcttgcaaga 3600 agagctagtc tccaggaaac agcaaacatt caaactccgg aatcgttgta taattggtgc 3660 catactaatg taacaaacat tcattcgttc tacgaaccat catctgagat agaagaaact 3720 gaaaaattat tggaaaacag atttaaatct gctaagccaa ttcgtggcac gcaatcattc 3780 cattcgttcg ttcctgtcga cgcatattca ttggaggctc gcgtaatatt atgttccgac 3840 acttttaaat cttttgttgt tattccactc ccaacatttt tgtcttttaa ctttcaagat 3900 gttcaagtca actctgtaat cgccgttgca tataaagatg gaaaatggta cttatcgaat 3960 gtcgttgaga aaaatgatgc tgctttcgag tttaaggtcc atttttacaa gccgtcaggt 4020 gaagactcaa gaaccactgg tttcaagcta tcaaaggcag agaacacggc tattgttcca 4080 ataaaaaatg ctatacaaat aacgaaatca tttatgagat cttcacggcg tgatcgctcc 4140 tttaaaatct tatcaacaga agatgatgaa attgaaataa aatttagtaa cttgatggcc 4200 aatggagtgg aattgtaaga caaataatta ttaaattaca acatctatct atgtgatgtg 4260 aggggatgtg gggtgtagtg gactagagcg tcaacacttt ggaaaaagga agtgaggctc 4320 tggtttcgaa ccccagctga acccgtcata aaactgtaag tagcgccctt gaggcagcat 4380 gcataacaca ggattcttga ttgctttcaa ggatatgacg ttaaatatac tgtggtgtgt 4440 tcaatcacat ctaaattgga aactgaaggc gctttgagca cataatggaa aagcgccata 4500 taaatacgta cattacatta cattacatta catgtgagtt tttcaattta cttaataatt 4560 atttctatag tgaggtgttg taattaatat gtgcatgttt tttaatgtgt ctttttctaa 4620 gaaacaacag cgcattattc atattaaccg atataaacca aggtattcat agggacgatc 4680 aaattgaagt atcaatttca aaagttttta tttcgccgcg caaaatccat aaacaaacaa 4740 gttttgctag tcaataaagg tattttttag tatgagttgg agaaaaactc ccccttccaa 4800 aatggccgac cgaaagggag cgaaatggct tggggcgatg gaaaaacacg cacaaaacac 4860 gagaatttat gtttctaaac tgctatatac ctcactatta acaaaaatac cctcccaaat 4920 cttttatatg ttatgtaaac atatctcttg ttcatcggaa aattattcga tgtattttta 4980 aattttcagt tttattttat attaattttt aaagtgtgac cattttagat tttttttttt 5040 agattgcgat aaaaatattt cttaaaaata aaaataaatg tgtaagatcg acgtaaaatt 5100 taatgttacg tcgattggtt ttggtttaat aaaattataa tgacaaataa agtcgcaatc 5160 aaagaaaaac aaagtaattt atttttttca gttcacggca aaataaaaaa aaataaattt 5220 aaatttaaat atctcctaaa ctattcaagc gataattctg aaaatttcag aatcactttt 5280 ttttacccat ctttacccaa cccaaaaaat gaatcgaatc ccaatttgga cggcgccatc 5340 gatcggcaga tttcgcgtgg aatgacc 5367 // ID Gypsy-24_CQ-I repbase; DNA; INV; 4301 BP. XX AC AAWU01011196; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-24_CQ_; KW Gypsy-24_CQ-LTR; Gypsy-24_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4301 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 427-427 (2011). XX DR GenBank; AAWU01011196; Positions 15930 20230. XX CC 'AAATG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 1035..4214 FT /product="Gypsy-24_CQ-I_1p" FT /translation="MSLTSSVEAFRKGLSFTDWADHLSYSLDANCVTNDKI FT KKAHLMNLCGPFLFTQMKHLFSPEALAKVTFDEIVTKLKQSLNKTEPDLVH FT RLRFSQLVQQSDETAEEFVEMVKLHAEFCGFGIFKDVAIQDRILAGLNDKN FT LKQTLLNEGNLTVSSMEKFIRSWVAAKLNFKSLTGKPNNKFYSGSQMHQEF FT YQHYFSNQNNYPQMRRNYTQTQSREGNSRKIQLYNANMQHNTRHNNNHNFE FT YNHINFIQTQSKHLRFNDQTQQRINDGDSYGRRFKTSDTFCDFCKQMGHSK FT QNCLKCNSINKYFADLVNIEAGVINCNGNGSNNQIIEECMKKITTTETKSA FT AKADEDGKTAIIDKNDLTQNGKFHDTAKVQNNMQIDFMIDDLNLSVLFNQN FT WDDFENDDLCQGVLFNVKQSSNEQLCTTYYTEKCDESAKSCDDFSYFENNV FT DDFDLENLFFDENVMLSSNEQKSKYIDMFDDVPNRNDDLNMLFKIDDGVDI FT LKANSDNDAGSSELKIDRYTVKNYKNLKANADVVTNFAPDWVDSVKKQILF FT FDDVSDNSAKVLFDMCSIEHKNEKIKTILNLNCFGQICWMLMIIICVKLFI FT IRIKIIEISVALCKLLRMFLTSNAKLQKWVAMSKGWTINVDNCFALMSNQY FT DWKLDDDSCLSNDNGSLLRAYQSRILKLVHVIYAIVTKTIWNNWYQNFNEI FT TESSYLHLVVNIKINVLISSTIGLTHKVDECFVKVEKLLFTWRVICSMYFS FT LNFVQEQSQMTNGEFQDFIIQFVLIFGVSFNYDDGHLSTINTKIYKWLKAW FT TQEKFNYFQMKTLLGDQLVWQMILKLNLFCSRWQIFERVKKRVISSIFFKL FT SNSLEKCFLKLVNIMQIKKNMMIKAMIVNDNRAKCSMFDQVSIKLFCILTI FT VWITVHHLKINSIVCIIKLILFDIYLKQKPCFHQSQRRCGASGSTLEQLVS FT KDKEPNIGFTSKVRVKKPSNPELKTIAGYSLRGVVPWEIPEDIKAGFCFES FT GHAFEAKLLERDMLHTKNVLIKSDFGYMIRSGISSKSRNRIFPVDISILAR FT LLS" XX SQ Sequence 4301 BP; 1447 A; 592 C; 898 G; 1364 T; 0 other; actggcgacg aggaaattac gaactcagca accggcggga caagtaattt tctctaaaaa 60 ctaaaatatt cttgttccgt gggaagcaat ttttgaccaa atttggtgag aaaaatacgc 120 tcagttctgc tcagctggtc attagtggtg tactgggtgg atgctgtagg tggtgatatc 180 gtgcagcaga aatctatcat tctttgggtg gagtaagccg gagaaaaatt gttcgcggag 240 gcgctgtacc agatccacat tcatcccgga atatcgacgg tggcatcatc cggacggttt 300 cggcaagatt ttgttcagcg tggacgctaa aaaaactggc cattttgggt ggtaagacaa 360 agaacaacaa ggtaatttta cgctgcacca ttgtgtgtta tttttcccct ttgctttggt 420 gatcttttgt tttcaacgac aaagttggtg agacgcggtg cgtttgatgg taaaactttt 480 gcaaaaaaaa aaaaaaaaaa ctgactagaa tttcgctgaa gttattgaaa ctaacgaaat 540 ctaaaaaaac agttacattt gtcatttaag gttaaatttt tttaaggtat tcttttgttt 600 aaactgtata atgaaagtaa atttattttg tttaatttgt gtgtagattc attgtgtgct 660 gccgataagc ttgattggct gcagtggttt gaaaaaagtg ttcggtggac cttccacggg 720 acctggagat tgtctctggc ggtgctttcg taaagtgaag ttggtgattt ccggactggg 780 ctttacgacg gcaagaggac aatcgggaac tttgcagcgg acggcgtaag gtggacgaac 840 tttggcgctt cttgaaggaa cgcacggtgt ttgaggtgaa gtctgttgaa agtcgttcgc 900 tggcaagggc cggacgaaga tttggtgagt cgttttaaac cattatttgg ttcgtggatt 960 tgtttgtgac caactggtca agtggaatta ctctggcagt gttaaattta atttgcatta 1020 tattatattc aaacatgagt ttaacgtctt ccgttgaagc atttcggaaa gggttatcct 1080 ttactgattg ggcggatcat ttgtcttatt ctttggatgc aaattgtgtt actaatgata 1140 aaataaagaa ggcacatttg atgaatttgt gcgggccttt tttgtttacc caaatgaaac 1200 atctctttag tccagaagct cttgcaaagg taacttttga tgaaattgta acaaaattaa 1260 aacagagttt gaacaaaact gaaccggatt tggttcatcg acttcggttt agtcaactag 1320 ttcagcaaag tgatgagaca gctgaagagt ttgtcgaaat ggttaaattg catgcagagt 1380 tctgtggttt tggcattttt aaagatgtag cgattcaaga ccgcattttg gctggactga 1440 atgataaaaa cctcaagcaa acgttgttga atgaaggaaa tttaactgta tcatcaatgg 1500 aaaagttcat caggtcttgg gttgcagcaa aattaaattt taaatcgctt actggcaaac 1560 caaataacaa attttattct ggttcgcaaa tgcatcagga attttatcaa cattattttt 1620 caaatcaaaa taactatccg caaatgcgaa ggaattatac acaaacacaa tcgcgagaag 1680 gaaatagtcg caaaattcaa ttatacaatg caaatatgca acacaacaca aggcataaca 1740 acaatcataa ttttgaatac aatcacataa attttatcca aacacaatca aaacatttga 1800 ggtttaatga ccaaacacaa cagagaataa acgatggcga tagctatgga agaaggttta 1860 aaacttctga cactttttgt gatttttgta aacagatggg acactcaaaa caaaattgtt 1920 tgaaatgtaa tagtatcaat aaatattttg ctgatttggt aaatattgaa gcaggtgtga 1980 taaattgtaa cggcaacggc tctaacaatc aaattattga ggaatgtatg aaaaaaataa 2040 caacaacaga aacaaaaagt gctgcaaagg cagatgaaga tgggaaaact gcaattattg 2100 acaaaaatga tttaacacaa aatggaaaat ttcatgacac tgcgaaggtt caaaacaaca 2160 tgcaaataga ttttatgatt gatgatttga atttgagtgt tttgtttaac caaaattggg 2220 atgattttga gaatgatgat ttatgtcagg gtgtattatt caatgttaag caaagttcaa 2280 atgaacagct atgtacgact tattacacag agaaatgtga cgagagcgca aaaagttgcg 2340 acgatttctc atattttgag aacaatgttg atgattttga cttggaaaat ttgttttttg 2400 atgaaaatgt tatgttgagt tcaaatgaac aaaaatctaa atatattgat atgtttgatg 2460 atgtcccaaa taggaatgat gatttgaata tgttatttaa aattgacgat ggtgttgata 2520 ttttgaaggc taatagtgat aacgatgcgg gttcgagtga gctgaaaatt gatagataca 2580 cagtaaaaaa ttataaaaac ttgaaagcaa atgctgatgt tgttacaaat tttgccccgg 2640 attgggtaga ttcagttaag aagcaaattt tattttttga tgatgtttct gataattcag 2700 caaaggtttt gtttgatatg tgttcaattg agcataaaaa tgaaaagatt aaaactatat 2760 taaatcttaa ttgttttggg caaatatgct ggatgttaat gataataatt tgtgtaaaat 2820 tatttataat acgaataaaa attattgaaa tttccgtggc tttatgcaaa ttgttgcgaa 2880 tgtttttgac aagtaatgca aaattgcaaa aatgggttgc tatgagtaaa ggatggacta 2940 taaatgtaga caattgcttt gctctgatgt ctaatcaata cgattggaaa ttagatgacg 3000 atagctgttt gtcaaatgac aacggttcat tactgcgtgc ttatcaaagc aggatactaa 3060 aattggtaca tgtcatctat gccatagtaa ctaaaactat ttggaacaat tggtatcaaa 3120 atttcaacga gataactgag agttcttact tgcatttagt agttaacatc aagataaatg 3180 ttttgattag ttcgacgatc ggccttacac acaaagtaga tgaatgtttt gtaaaagttg 3240 aaaaactgtt attcacttgg agagttattt gctcaatgta tttttcactg aattttgttc 3300 aggaacaaag tcaaatgaca aatggagaat tccaggattt tattatacag tttgttttga 3360 tttttggggt tagtttcaat tatgacgatg gtcatttgtc aacgattaat acaaaaatat 3420 ataaatggct aaaggcttgg actcaagaaa agttcaatta ttttcaaatg aaaacactgc 3480 ttggtgatca gttagtttgg caaatgattc tgaaactgaa tttgttttgt tcccggtggc 3540 aaatttttga gagggtgaag aagagggtga tttcatcaat ctttttcaaa ttaagcaatt 3600 ctttggaaaa atgttttcta aaactagtaa acattatgca aataaagaaa aatatgatga 3660 taaaggccat gatagtgaat gataatcgtg ccaaatgcag tatgtttgat caagtatcaa 3720 ttaaattgtt ttgtatactg acaattgttt ggataactgt ccatcattta aagattaatt 3780 ctattgtctg tataataaaa cttattttgt ttgatattta tttgaaacag aaaccatgtt 3840 ttcatcaatc tcaacggaga tgtggagcga gcggatcaac gctggagcaa ttggtgtcaa 3900 aggacaagga gccaaacatt ggtttcacat caaaagttcg cgtaaagaag ccgagcaacc 3960 ccgagctgaa aacgatagct ggatacagct tacgaggagt tgttccttgg gaaatcccag 4020 aggacatcaa ggcaggattt tgtttcgaat cgggacatgc atttgaggcc aaattgctgg 4080 aacgcgacat gctacataca aagaacgtac tcatcaaaag tgattttggt tacatgatta 4140 gaagtggaat ttcttcaaaa agcagaaatc gaatatttcc agtagatata tctatcttag 4200 cacgtttact atcgtaaagt tcaatagctt gcaatcatgt ataattatag gttttaatta 4260 aaaatacaat gttattgtaa tcttccaaag ggagggcgag t 4301 // ID hATm-7_HM repbase; DNA; INV; 3390 BP. XX AC . XX DT 22-MAR-2008 (Rel. 13.03, Created) DT 22-MAR-2008 (Rel. 13.03, Last updated, Version 1) XX DE hAT-type family: consensus sequence. XX KW hAT; DNA transposon; Transposable Element; hATm-7_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3390 RA Jurka J.; RT "Families of hATm elements from Hydra magnipapillata."; RL Repbase Reports 8(3), 211-211 (2008). XX DR [1] (Consensus) XX CC This family is very distant from most hATm elements and is closer CC to hATx-9_SM CC than to most hATm elements. CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 634..2952 FT /product="hATm-7_HM_1p" FT /translation="MSSAAVTTRKKSTCPIFGSPADLRANVLPTYKHVMLS FT YLLSMQKMRVETGGKQPSLKEISENLAHKIENLWGKASIPTVSHKRIVKMF FT LDYNQKYRNIMKSVNKTKHTDKLKSKIITFQKHAENTLFDVAACKCLDFSK FT CICEKAKKVPEKEHEFLSDQRFMRIMYMEGLDIQATLRLRRKEARKNRVNS FT PLPSNPSKQPCAITGIEAQSTLSSRIHNSSFYSISQDSDSDDTVIWNTDSE FT SDADYLDTAEGHKQKRNRKDKTEQMRSPLLSLAQACDRVGVSSRGAAFIAC FT AVLEDINIISKGDQSKVIDKNKIQRERKKNRLKMQHLDHENIQMLEGLYFD FT GRKDTTFVIDTNKGKNIRKTIKEEHVVLVQEPGSKYLGHVETASGHGYSIK FT TSIKTFLTQNNIDTSKLVAMGCDGTVVNTGFRNGVIALFELELKRPVHWFI FT CQIHANELPLRHLIQNLDGPTSGPNGWSGPIGREIKLCDTLPIVNYIPVSV FT TLPKIDKKSLSSDQKYLLDICKAISTGYCPNALANRNPGKLNHARWVTTAN FT RILRYYVGCSEPSHKLREIVEFVIRVYCQMWFNIRYDPSCTEGARHLWLTI FT NLSRYASSEIKSIIDPVIQRNAFFAHPENMLLAMISDTRPNVRKFALKQIM FT KARSIKTNKVRKFSIPEINFEATDYIDLIDWIKCELSEPPITTHISDENLI FT EMLTNGNLPDVQHFPCHTQGVERCIKLVTEASDQVCGLASRDGFIRARLAS FT RKKMPIFNTKRQFETETNESDS" XX SQ Sequence 3390 BP; 1260 A; 549 C; 566 G; 1015 T; 0 other; tagggtgtat cgaaaaacaa ctttttttga atttgaaatt gtaatagagc ggaaaagttg 60 cataatgata taaaaagtaa ttgtgccaaa ttattttgta ttttttttat gtcttcagtt 120 cgcgctaggg ttttattttt cgtagaaaaa cacgttttct gacctttttt acaaaataaa 180 aataaaaaat tctatgcata tactagagtt gtaatagagt ttaaacatta aacaatgact 240 aagttagaaa tggttacaat ttttaagaaa ttattttaat gtcttcagtt gacgctagtg 300 tgctgagttt tgacgaaaaa acactgtttt ttatttaaaa caattttaat tatattgttg 360 atgcttccac ttgtcgtcac ctgtcttcac tgttacaaca atggaacaaa agtaattaaa 420 atgtaataaa ccaaataatg ccaatggtta cagcggtgat tttaaaatga cattagtatg 480 tatgtaaaat gtcactaatt acgtcattta ttaccatcac catcagcatt attagaaaat 540 tagctaaaag cacaggttat aatttattag ttctgattta catctgttta ttaataaaat 600 aaattgaaat caaagaaaat tttaacggca aaaatgtctt ccgctgcagt gacaacaaga 660 aaaaagtcta cttgtcctat ctttggttca ccagcagact tacgagcaaa tgtcttgcca 720 acatacaaac atgtcatgct aagttatctt ttgtcaatgc agaaaatgag agtggaaact 780 ggtggaaaac aaccttcttt gaaagaaata tctgaaaatt tggcgcacaa aatagaaaat 840 ttgtggggaa aagcatcgat acctactgtg tctcataaac gcattgtaaa aatgtttctt 900 gattacaatc aaaagtatag aaatattatg aagtctgtta acaaaacgaa acatacagac 960 aaattgaagt cgaaaataat aacatttcaa aagcatgcag aaaatacttt atttgacgtt 1020 gcagcatgca aatgccttga tttttctaaa tgcatctgtg aaaaagcaaa gaaggtacca 1080 gagaaagaac atgagtttct tagcgatcag agatttatga ggattatgta tatggaagga 1140 ctcgatattc aagctacttt aagattacgg cgaaaagaag caagaaaaaa cagagtaaat 1200 agccccttgc cgtctaaccc atcaaaacag ccttgtgcta ttacaggtat tgaagctcaa 1260 tcaacattat caagccgcat tcataattca agtttctatt ctatatctca agattctgac 1320 tctgatgaca ctgtgatttg gaatactgac tcagaatcag atgctgatta tttagacaca 1380 gctgaaggcc acaaacaaaa aagaaacaga aaagataaaa ctgaacaaat gagatcaccg 1440 ttactatcct tagcacaagc ctgcgataga gttggtgttt caagcagagg tgctgcattt 1500 atagcatgtg ctgtgctgga agacataaac ataatttcaa aaggagacca atctaaagta 1560 attgataaaa ataagattca aagagaacga aagaaaaaca gacttaaaat gcaacattta 1620 gatcatgaaa atattcaaat gttagaaggt ttatattttg atggaagaaa agatacaaca 1680 tttgttattg atacaaacaa gggaaaaaat atacgaaaaa ctatcaaaga agaacatgta 1740 gttttagtac aagaaccagg gtcaaagtat ctagggcatg tagagactgc atcaggacat 1800 ggatacagca ttaagacaag tataaaaaca tttttaactc agaacaacat tgatacatcc 1860 aaattagttg ccatgggttg tgacggaaca gttgtcaata ccggatttcg taatggtgtt 1920 attgctttgt tcgaattaga actcaaaaga cctgtccact ggtttatttg tcaaatacat 1980 gctaatgaac tgcctctacg gcatctaatt caaaacctag atggacctac atcaggacct 2040 aacgggtggt ctggaccgat cggaagagaa attaaattgt gtgatacctt gccaattgtc 2100 aactacattc ctgtcagtgt cacattgcca aaaatagaca aaaaatctct tagttctgat 2160 caaaaatact tacttgatat ttgcaaagct atatctacag gttattgtcc aaacgctttg 2220 gctaaccgaa atcctggaaa acttaaccat gccaggtggg tgaccactgc aaacagaata 2280 ttgaggtatt atgttggatg tagcgaaccg tcacacaagc tgagagaaat agttgagttt 2340 gttattagag tttactgcca aatgtggttc aatataagat atgatccctc atgcacggaa 2400 ggtgctagac atttatggct aaccatcaat ttatccaggt atgccagtag tgaaataaaa 2460 agtatcatcg atccagttat acaaaggaat gctttttttg cacatccaga aaatatgctt 2520 ctggctatga tatcggatac tcgtccaaat gtgagaaaat ttgcattaaa acaaataatg 2580 aaagcaagat caatcaaaac aaacaaagtt cgaaagttct ctataccaga aatcaatttt 2640 gaagcaacag actatataga ccttattgat tggattaaat gtgaactaag tgaacctccg 2700 attaccactc acatttcaga tgaaaatcta attgaaatgt taacaaatgg taatcttcca 2760 gatgtgcagc attttccatg ccatactcaa ggagtagaga gatgcatcaa acttgtaacc 2820 gaagcttcag atcaagtttg cggtttggct agtagagacg gcttcattcg tgcaagactt 2880 gcatcgcgaa aaaaaatgcc tatattcaac acaaaaagac agtttgagac agagacaaat 2940 gaatcagata gttgaacttt attacagctc aactttaaac atttttcaaa aacattttcc 3000 actctattaa aactttaatt gtacattatg gactgctaag cttgttttcc agaaactaat 3060 gaaacagtgt tttttcgtca aaactcagca cactagcgtc aactgaagac attaaaatat 3120 tttcttaaaa attgtaacca tttctaactt agtcattgtt taatgtttaa actctattac 3180 aactctagta tatgcataga attttttatt tttattttgt aaaaaggatc agaaaacgtg 3240 tttttctacg aaaaataaaa ccctagcgcg aactgaagac ataaaaaaaa tacaaaataa 3300 tttggcacaa ttacttttta tatcattatg caacttttcc gctctattac aatttcaaat 3360 tcaaaaaaag ttgtttttcg atacacccta 3390 // ID Dbusck2cons repbase; DNA; INV; 504 BP. XX AC . XX DT 19-APR-2011 (Rel. 16.04, Created) DT 19-APR-2011 (Rel. 16.04, Last updated, Version 1) XX DE Consensus of mariner-like element internal region of irritans DE subfamily. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Dbusck2cons. XX OS Drosophila busckii OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila. XX RN [1] RP 1-504 RA Wallau G.L., Hua-Van A., Capy P. and Loreto E.L.; RT "The evolutionary history of mariner-like elements in Neotropical RT drosophilids."; RL Genetica 139(3), 327-338 (2011). XX DR [1] (Consensus) XX CC Consensus of clones that show less than eight percent divergence. CC Dbusck2cons. XX SQ Sequence 504 BP; 147 A; 118 C; 115 G; 124 T; 0 other; tgggtgcctc acgagctcac atttgactaa aaacaacaac gtgttgatga ttctgagcgg 60 tgtttgcagc tgttaactcg taatacaccc gattatttgt gtcgatatgt gacaatggat 120 gaaacatttc tccatcactt cactcctgag tccaatcgac agtcggttga gtggacagcg 180 accggtgaac cgactccgaa gcgtggaaag actcaaaagt ccgctggcaa agcaatggcc 240 tctgtttttt gagatacgcg tggaataatt tttatcgact atcttgagaa gggaaaaacc 300 atcaacagtg attattatat ggcgttattg gagcgtttga aggtcgaaat cgcggcaaaa 360 cggccccata tgaagaagga aaaagtgttg ttccaccaac acaacgcacc gtgccacaag 420 tcattgagaa caatggcaaa aattcacgaa ttgggcttcg aattgcttcc ccacccaccg 480 tactctccag atctggcccc aagc 504 // ID Gypsy-1-LTR_HM repbase; DNA; INV; 221 BP. XX AC . XX DT 19-DEC-2008 (Rel. 13.12, Created) DT 19-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE Long terminal repeat of the LTR retrotransposon - a consensus DE sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-1-LTR_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-221 RA Bao W. and Jurka J.; RT "Gypsy LTR retrotransposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 1969-1969 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX SQ Sequence 221 BP; 81 A; 24 C; 33 G; 83 T; 0 other; tgttgtgtgt cattaataat tagcgtgcta attattgaca aaagtcacaa gttgtgattt 60 tctattatta taacgttttt ttattaattg tcgacaagag tcacaagttg tattatatgc 120 tatataaagc gaacaatttt tttttgtaaa tgtcaacttt gtatataagt aaaagaaaac 180 gttaacaaca aaaaaatggc ttttaaatgt gtcgaacaac a 221 // ID CR1_Ele40 repbase; DNA; INV; 5055 BP. XX AC . XX DT 28-SEP-2010 (Rel. 15.1, Created) DT 28-SEP-2010 (Rel. 15.1, Last updated, Version 2) XX DE A CR1 clade non-LTR retrotransposon family from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1_Ele40. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-5055 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5055 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (06-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. This consensus is generated from 30 CC sequences with >98% identity, and ~100% identical to the original CC sequence in [1]. Closely related to T1 and Q. XX FH Key Location/Qualifiers FT CDS 383..1240 FT /product="CR1_Ele40_1p" FT /translation="MTSACDHCAKPIKGEDESVSCMAFCDRMIHLRCSVTK FT LNKNFVNIVQTCPNLFWMCDECVKLMKCARFKATVSSFGDAINAMTERQEL FT AHAELRKEIAKQGEQIAKLSKGIALSTPTLSGSGGLVRQPPLKRRRNEGLS FT TSKPLVTGTRIVADDNVFTEVRTVPEPPEMFWLYLSRIHPSVKSESVEKLV FT KDCVHCQDPVTVVPLVKRGADTSRMSFISFKVGMDSKFRETALNSDTWPQG FT ILFREFEGTGSKNMWLPQLTTPTISITPAMERSPFVTPTTSMDQR" FT CDS 1501..4938 FT /product="CR1_Ele40_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MHQATLPGRTTESTLGVASYSSSVAPFAASVLQSRPG FT PESSCGLAASQSAISGKYNDDRELSLPDTNQIYSTSGYTSSSSPYPDLTLE FT EVLHGLAEDISLGTPTATNADSSSFDMRSNNEHRFQHPGRTILSTTGAPLS FT PSTVVTMQPALDSRPGPVVGCGEGVSQPVQSGKYTECADSILPDDLIPSSL FT RIYYQNVRGLRTKIESFFLAVTELDYDVIVLTETWLDHRILSAQLFGSHYS FT VFRTDRSALNSNKSRGGGVLIAVSSQFNCCSDPTPVNDQLEQLWVRILLPD FT RNVSIGVFYLPPDMKNDADVIRCHIESIAAVHNNLSENDLILQFGDYNQSD FT IYWAAEADNRLSIDLDRSRMCPASSALLDGFCFHGLSQVNTLTNTTNRTLD FT LVLVCDTVLPNFTLLAAAEALTTLDDHHPAIELIVMCPLPVTYEMSHDYTG FT FNFQKANFESLNVALMSVDWNHLDSIPDINEAVDFFTNSVLQAVTNNVPPR FT RPPSKPPWSNGQLRLLRRRRSAALRCYCNNRSQQTKLAFNEASREYRNYNA FT LLYARYKRRTEQNLRTNPKQFWSFINSKRKENGLPTSMYLDEQSADCASDK FT CELFAAQFQRAFNNFVAAPSQVHVALDDTTRDVFSYEMFEISEREVASAIT FT KLKPSYTPGPDGIPPALLKRCSALLIPLVKLFNRSLGCRVFPRSWKMSFLF FT PIHKKGDKRSVSNYRGITSLCVCSKVFEIIINDSLFNSCKQYISSDQHGFF FT PKRSVTTNLVEFSTQCIRAIDAGKQVDAVYLDLKAAFDRVDHGILLQKLRK FT CGVSDSFIDWFESYLTNRSLCVKIGSNESSSFTNLSGVPQGSNLGPLLFSL FT FINDASLVLPPGTRLFYADDTKVYMIVDSLDDCHHLQRLLNDFEAWCSRNC FT MTLSIEKCQVISFNRKRNPIHFQYTLSGTAIERVQRVRDLGVWLDEEFTFN FT YHFNDIVSRANKQLGFVLKVTEGFSDPLCLRSLYCALVRSILEFAAIVWCP FT FHASWITRIESVQRKFLRHALRNLPWRDPSNLPPYEDRCRLLGIDTLENRR FT CVSQAAFVAKLLQGDIDSPSLLADVSIYAPERNLRRRNFVSLGSRNTLYGQ FT HDPTRYMATKFNEIYHLFDFNITMTTLRNRFAEYFRHS" XX SQ Sequence 5055 BP; 1304 A; 1229 C; 1114 G; 1408 T; 0 other; gcgatcgtgt ttgatggttg tttacgttct gtaccgataa ctagtaccgt tttttctcgc 60 gtaaatttca ctatcggaat tgaatttcgt gtattgtgga agccgtgtga ttgaacaaca 120 cctgtcgtgt ggacgtttac gcgtagatcg tgaaaacatt gcgaaatcga ccgtcgtaaa 180 tatactgttt gctgcgaaat attagtgctg tgttttgttt tacttgctgt gctctctgtg 240 cactttttcg gctaccgtac ttctcccgtg ctgcgataaa caacgagaaa cgagacactg 300 cagaatccaa attcaagcgt aacatttgaa aagggcctaa ctggaaaacc accgttgctg 360 cctctaaagt gcctaattca tcatgacatc ggcgtgtgat cactgcgcaa agcctatcaa 420 aggagaggac gaatctgtct cctgtatggc gttttgtgat aggatgattc atttgagatg 480 ttcggttacg aagttgaaca agaatttcgt gaacattgtg caaacctgtc caaacttgtt 540 ttggatgtgc gatgaatgtg tgaaactaat gaaatgtgcg cgtttcaagg ccactgtctc 600 atcgtttggt gatgcgatta acgcaatgac cgagagacaa gaacttgctc acgccgaact 660 gagaaaggaa atcgctaagc aaggtgagca gattgctaag ctttccaaag ggattgcact 720 ttctactccc accctctctg gctccggggg actcgtgcgc caacctccgc tgaaacgtcg 780 acgtaatgag ggattgtcta ctagcaaacc attagtcact ggcacgagaa tcgtagctga 840 tgacaatgtt ttcacggaag ttcgtacagt tcccgaacct cctgaaatgt tttggctgta 900 tctctcacga atccatccta gtgttaaatc agaatcagtt gagaaactgg ttaaagactg 960 tgttcattgc caggatccag tcacggtagt gccactagta aagagaggcg ctgacaccag 1020 tcgaatgagt ttcatatcgt tcaaagtcgg aatggactcg aaatttcggg agaccgcact 1080 taactcggac acatggccgc aaggtatcct gttccgggaa tttgaaggca ccggttcaaa 1140 aaacatgtgg ctacctcagt tgaccacacc taccatttcg atcactccgg cgatggaacg 1200 ttccccattt gtgacgccca ccacctcgat ggaccaacgc taattccgca actcagacgc 1260 atcgtagaaa gctttttgga agcccctaat cgccccaatc cagtcctgcc tcctgctgcc 1320 tgcgctcatc actgtcgtct gagtcctgtg attggtggtc gaaagggggt cttccatcaa 1380 gcttgcctag gcaagtattc ttgtaattca aactattcgt tgcgtgataa aatttcctgc 1440 actcagaacg tttgctgctc agaccacaac aacttaccat cgacgcctac cacacatcta 1500 atgcatcaag caactttacc aggacgcacc acagaaagca ctttgggagt cgccagttac 1560 tctagctcag tcgcgccatt tgcagccagc gttcttcaaa gtcgtcctgg ccctgagtct 1620 agttgtggat tggcggcctc ccagtccgct atctcaggca agtacaatga tgatcgagag 1680 ctttcactgc ctgatacgaa tcaaatttat agtacctccg gctacacttc atcatcttca 1740 ccgtatcctg acttgacgtt ggaagaagta ctccatggct tagctgaaga catctctctt 1800 ggcactccta ctgcgaccaa tgctgattcg tcatcttttg acatgcgtag taacaacgag 1860 catcgttttc aacacccggg acgcacaatc cttagcacta cgggagcccc tttgtccccc 1920 tctacagtcg tgaccatgca gccagcgctc gacagccgtc ccggtcctgt agtagggtgt 1980 ggtgaagggg tctcccaacc tgtgcagtca ggcaagtaca cagaatgtgc tgacagtata 2040 ctacctgacg atctcatacc ttccagtctg cgcatctact accaaaacgt ccgtggcttg 2100 cgtacaaaaa tcgagtcctt cttcctcgct gttaccgagc tggattatga tgtgattgtt 2160 ttaaccgaaa cctggctaga ccaccgcata ttgtcagctc agctatttgg gagtcactac 2220 tccgtgttca ggacagatcg cagcgctcta aacagcaata aatcgagagg tggcggagtc 2280 ttgatagctg tttcttcgca attcaattgc tgttcggacc ctacacctgt caatgaccag 2340 cttgaacaac tttgggttag gatcctgttg cctgacagaa atgtgagcat tggggtattt 2400 tacctaccac ctgacatgaa gaacgatgcg gatgttattc gttgtcatat tgagtcaatc 2460 gcagccgttc acaacaacct ctcagaaaac gacttaattc ttcagtttgg tgattataat 2520 cagtcagaca tctactgggc tgctgaagcg gacaatcgtc tctctatcga tctagatcgc 2580 tctcgtatgt gcccagcgag ttctgcactt ttggatgggt tttgcttcca tggactctct 2640 caggtcaaca cattgacgaa cacgacaaac cgtaccctcg atctcgtcct cgtctgtgat 2700 actgtgttgc ctaatttcac acttcttgca gcggccgaag cactcacgac acttgatgac 2760 catcatcctg caattgagct gatagtcatg tgcccgttgc cagttacata cgagatgtca 2820 cacgactaca ctggatttaa cttccaaaaa gctaattttg agtcattaaa tgtcgcatta 2880 atgtctgttg actggaacca tctggattcg atccctgata ttaatgaagc tgtcgatttc 2940 ttcacgaatt ctgtgctcca agcagtcacg aataatgttc caccgcgtcg cccgcccagc 3000 aagccgcctt ggtctaacgg tcaattacgt ctgcttaggc gccgacgttc tgctgccctt 3060 cggtgctatt gcaataatcg atcgcaacag acaaaattgg cattcaatga agcaagtcgt 3120 gagtacagga attataatgc tttactgtat gctcgctata aacgaagaac ggaacagaac 3180 cttcgtacga acccaaaaca attttggtcg ttcataaatt cgaaaaggaa agaaaacggc 3240 ctaccgacat cgatgtacct agacgagcag tcagccgact gcgccagcga caaatgcgaa 3300 ctcttcgcgg cacagttcca acgtgctttc aacaatttcg ttgctgcgcc atcgcaagtt 3360 catgttgcat tggatgatac tactcgtgat gtcttcagct atgaaatgtt tgaaatctct 3420 gaacgggaag tagcaagtgc gataaccaag ctgaagccct cctacactcc tggaccagac 3480 gggattccgc cagcgctact aaaaagatgc tcagcgttgc ttataccttt ggtgaagctg 3540 ttcaatcgtt cacttggatg ccgagtattc ccacgaagtt ggaaaatgtc gttcctattt 3600 ccaattcata aaaaaggtga caaacgtagc gtgagtaatt accgtggtat tacttcacta 3660 tgtgtctgct caaaggtatt tgaaattatc atcaacgact ccttattcaa tagctgcaaa 3720 cagtacatct ccagtgatca gcatggtttc ttcccgaaaa ggtcggtcac tactaacctc 3780 gttgagtttt caacgcaatg catacgagcg atcgatgctg gtaaacaggt ggacgcagtg 3840 tatctggact tgaaagcggc gtttgatcgt gttgaccacg gaatccttct acaaaaactt 3900 cgcaaatgtg gcgtgtccga tagcttcatc gattggtttg aatcatatct caccaaccga 3960 tcgttatgcg taaaaatagg ctctaatgaa tcgagctcat tcacgaattt atccggagtg 4020 ccacaaggca gcaatcttgg gcctttgttg ttctcactat tcatcaacga cgcatctcta 4080 gtcttaccgc ccgggaccag gctgttctac gccgatgata caaaagtcta tatgatcgtc 4140 gactccttgg atgactgcca tcaccttcag agattgttga atgatttcga agcgtggtgc 4200 tcacgaaact gcatgacgtt gagcattgaa aaatgccaag ttatatcgtt taaccgaaag 4260 cggaacccta ttcatttcca atacacgcta tctggtacag cgattgaacg agtacaacgt 4320 gtccgtgatt tgggcgtttg gttggacgaa gaattcacgt tcaactatca cttcaatgac 4380 atcgtttcga gggcaaacaa acagcttggt tttgttctga aggtaacaga aggtttcagc 4440 gacccgctgt gtctgagatc cctttactgc gctctagttc gttctattct ggaatttgct 4500 gcgatagtgt ggtgtccgtt ccacgccagt tggattaccc gaattgaatc tgttcagaga 4560 aaattcctgc gccacgccct tagaaatctt ccttggcgtg atcccagtaa cctgcctcct 4620 tacgaagacc gctgtcgttt attggggatt gatactttgg agaacaggcg ttgtgtatcg 4680 caagccgctt ttgtagccaa gttgcttcaa ggggacatcg attcgccatc ccttctggct 4740 gatgttagca tctacgctcc ggagcggaat ctgcgtagac ggaattttgt cagtctcggc 4800 agtcggaaca ctctatacgg acagcatgac ccaactagat atatggcaac gaaattcaac 4860 gaaatttatc atcttttcga ttttaacatc actatgacga cgcttcgtaa tcgatttgct 4920 gaatacttta gacatagtta agtgtgcatt tcatgtaatg ttgatgtttg ttgctcgttt 4980 tgttttttag ttttacttat attcattaag acaaatttta tgtcagatgg atcaaataca 5040 aataaataaa taaat 5055 // ID EnSpm-14_HM repbase; DNA; INV; 5872 BP. XX AC . XX DT 15-JAN-2009 (Rel. 14.02, Created) DT 15-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE EnSpm-type family from Hydra magnipapillata - consensus. XX KW EnSpm; DNA transposon; Transposable Element; EnSpm-14_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-5872 RA Bao W. and Jurka J.; RT "EnSpm-type families from Hydra magnipapillata."; RL Repbase Reports 9(2), 385-385 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 287..2305 FT /product="EnSpm-14_HM_1p" FT /translation="MSRKEYMKNWRKKDNLLRKFLYESDKDEDFLEKKFEK FT QKPCHIEDSSTPNSSLNCTTICSNHLNQLESSSQLDSSIQHYSSCELYSSS FT QLDSFSEFDSSSSSEENVVETDVSNSVLDSKETYFILNLKKWFIKNRITHT FT ASNELLSLLRCSGHSALPKSSRTLLKTQRCVLIEKKCGGEYIYLGLRNGIS FT RTFRENPLFPIENNIISLLVNIDGVPLHKSSNSQLWPILCQFSKFTPFIVA FT IFYGSQKPTNLKDFLYDFLIEFANLRACGYDKDGVNLLVEIRAFICDAPAR FT QYLKCVKGHTGYYSCERCEIKGKSELFRMILDNTNCVPRTNELFKQYLYKN FT THQISRSILIDFNVNCVDSFVLDYMHLVCLGVVKRILIFLTNGPLPYRLSS FT AQLLNISENLLKFSGKFPSEFARQSRSLYEVKRWKATEFRQFLLHTGFLVL FT KDAFTPERYQHFLSLTLAMRIFLDSNAIVRNNHLQYAKELLNYFVVKSKEF FT YGATFSVYNVHSLSHLHEDVSFFNCSLDEISSFQFENYLQVIKKFIRKPQR FT PLSQVLKRFSEIENLEINCNQPAKSHRVIISTKYKDSWFLLYTGEFAQIIE FT KLGDDQFNCVVLKNCLYKSVFTDPCDSKIFNIVFVRNIKTHLKPTRLSKKH FT FQRKAVCLAYKEGFVLSCLCHDI*" XX SQ Sequence 5872 BP; 2074 A; 756 C; 867 G; 2174 T; 1 other; cccagcaaac acacgacgtc gtgacgacgt tgaaattagg ttatttttag gttatgacgt 60 cggacgacct tgtaatgacg tcgctaggac gtcgttttaa cgacgtcttt taaagaccct 120 tgttttgacg tcgcggggac gtcagtttta cgtcgtcttt taaagaaacc atttatcaga 180 tttttttttt ttaactgttg cgttcaggtc cgaattactt tttttttact tcacacgctt 240 ataatataac tttaatggaa gctttttcta aaagaaattg agaaaaatga gcagaaaaga 300 atatatgaaa aattggagga aaaaggataa tttattaaga aaatttctat acgagagtga 360 caaagatgaa gattttcttg aaaaaaagtt tgaaaaacaa aaaccatgcc acatagaaga 420 ctcatccact ccgaatagtt ctttgaattg tactacaatt tgtagtaatc atttaaatca 480 actagagtct tcgagccagc ttgattcttc cattcaacac tactcttctt gcgaacttta 540 ttcttccagt cagcttgatt ctttcagtga atttgattct tcttcaagta gcgaagaaaa 600 tgtagtagaa acagatgtga gtaatagtgt gcttgactct aaagagacat attttatttt 660 aaacttaaaa aaatggttta ttaaaaatag aattactcat acagcaagta atgagctact 720 ttctctttta agatgcagtg gacattcagc tttacctaaa tcttctcgta ctttgttaaa 780 gactcaacgc tgtgttttga tagaaaaaaa gtgtggtggg gagtatattt atttagggtt 840 aaggaatggt attagtcgaa ctttcagaga gaatccttta ttcccaattg agaacaacat 900 catatcttta ttggtaaaca ttgatggtgt gccattgcat aaatcttcaa actcacaact 960 ttggcctata ctctgccagt ttagtaagtt tactccattt attgttgcaa ttttctatgg 1020 tagccaaaaa ccaacaaact taaaggattt tttatatgat tttttaattg agtttgcaaa 1080 tttaagagca tgtggttatg ataaagatgg agttaattta ttggttgaaa tccgtgcatt 1140 tatttgtgat gctcctgcta ggcaatatct taaatgtgtt aaaggtcaca caggttatta 1200 tagttgtgaa agatgtgaaa ttaaaggtaa gtccgagttg tttagaatga ttttagataa 1260 tacaaattgt gtgcctcgta ctaacgagtt gttcaagcag tatttgtata aaaacacaca 1320 tcaaataagc cgtagcatat taattgattt taatgttaat tgtgttgata gttttgtctt 1380 agattatatg catttagtat gtcttggcgt tgtcaaacga atacttattt ttctaaccaa 1440 tggtccttta ccatatagat taagttctgc tcaattgttg aatatatcag aaaatttgtt 1500 aaagtttagt ggtaaatttc catctgagtt tgctcgtcaa tcaagaagtt tatatgaagt 1560 aaagagatgg aaagccactg agtttcggca atttttgtta catactggct ttttagtttt 1620 aaaagatgcc tttacacctg agcgttatca gcattttctt tctttaactc tagcaatgag 1680 aatttttcta gattctaatg cgattgtccg aaataatcat cttcaatatg caaaagaatt 1740 gttgaattac tttgtagtta aatcaaaaga attttatggt gcaacttttt cggtttataa 1800 tgtacacagc ttatctcatt tacacgaaga tgttagcttt tttaactgtt ccttagatga 1860 aatatcttct ttccaatttg aaaactattt gcaagtaatt aaaaagttca ttcgcaaacc 1920 tcaaagacca ttatcgcaag ttttaaaaag attttccgag attgaaaatc ttgaaataaa 1980 ttgtaatcag cctgctaaat cacacagagt tattatttct actaaatata aagactcttg 2040 gtttctcctt tacacaggtg aatttgcaca gattattgaa aaattaggcg acgatcaatt 2100 taactgtgtt gttttaaaaa attgtttata caaaagcgtt tttacagatc catgtgactc 2160 taaaatcttt aacattgttt ttgttagaaa tataaaaact cacttgaaac ctactcgact 2220 aagcaaaaaa cattttcaaa gaaaagcagt ttgtctagct tacaaagaag ggtttgtctt 2280 gtcatgtttg tgtcatgaca tataaagttt gtttaaaatt tttactaaat agtaaacttt 2340 gaaataattt ttggaacttt tgtggcgtat tagattacat ggcgtattag attattagat 2400 tgtggcgtat tagagtaaca ctatatgtaa tatatactaa acaatgttag acatcaaata 2460 attaaattaa atttttcaaa gtttaccttt ttggtgattg tctttaatgc ctgtaagatt 2520 ttaatattaa tttttaattt ttttaaaact ataaattatt tttttttaat tattttttaa 2580 attaattata aaaatacaat tattttgttt atttaaatta ttataacaaa ggatattact 2640 aaaaattatt atttattaaa gtatttatag tagaatatta ctgtttccta tagccaaaat 2700 tgtttttata agaaattaat atatattttt aataaacaat attttaaaca gtttttcaaa 2760 taaaaactat tatatattaa aaacaaatat aaataataca aagatgtcaa tgaaagattt 2820 atgggcatta gttgtaactt caagcaaaga caaaattgga aaaattagag agacggaaac 2880 agttgttcca tttcattgga tttcctcaga tgagaaatgg ttgttttggc ctccttttcg 2940 aaaagatgaa aaatatacaa aggcaattcc caatgaagtt acttggaaga aatttgaaat 3000 tttaaaatta aaattacgag gtaatgattc tgtttttcct tatttgttgt aactatttta 3060 ttaacaattt attagttttt cttctactta ttaaaacatt taggtgaaaa agatttgtgc 3120 gaatatgtgt ttggaaacag tattctgtcg tcatcaagcg aaaaagagtg tgtagtgagt 3180 gaaataaagg atattgattt taacagtaag acaaatttaa aaatatttag aatttaaaat 3240 tttattcaaa tattttttgc aatctgacat tatttgcggt attaaaaata ttgttttttc 3300 tttgttgata attttattgt tattattata gaagtggaat tgcctaaaaa accttctgaa 3360 aatattatgt caccaatatt ttctttgtca agtaaatcta ataaaaaaca ggaagtgtgt 3420 ttacaaccaa catcttgtaa aggttattct tctgtttatt tgtcaccaat aagttctgag 3480 tcactatcac ctagccataa attaagatta cgttctcatt cacctgtctc caaaacaagg 3540 tcatgttcac tctcacccat cttaatgtct gaatcaggct cactttcgcc tatttgcaag 3600 tcaagatcaa aatctcgctc ccgttcacct gtgcatagct ctagatcata ctctaaaaca 3660 cagaatagtt cccaatcaac attttcaaag tcaaaatcta aagcaaagaa actggttcga 3720 tgtatgaata gtaatttccc aatggatgaa gcacgtaagt taacaaagtt ttgtttaaat 3780 attattgact aaatcatgac tttattacta aataataaat atattgacta aaattatcta 3840 tgaaattcat ctagaaaaaa aagctgacgt tatatgtcag ttttcccaac ctaattaaag 3900 tacttttatt tagtttactt agtttgaaaa atggtatatg aaagaataca caattgtcta 3960 aagaagaaaa ctgtctcatc gattcacttt tgaaaataaa ctattgctct atactattac 4020 ttttaatttt attttaaact gatattttct gaaatgttag tattatataa tttacatatt 4080 aattgaaaca aaattgaata tggaaattat aattttttta tattaaattt taggatttca 4140 aaaaaaaaca atatctcttt taacttgcat cgttgactta ttggagaatc gagttcacca 4200 aacaaataat cttggtacaa aagacgtgga aaaagttgat aacatggata aattcaaaga 4260 actagaggag tctttatcaa aagaagactt tttttcattt atggtaaagt gtttttaatt 4320 tttattgttt taatgcccta tacaaattta tttttgtgaa atatttaatc tattctttgt 4380 aaaacgtgct gtcaacatgt ccgtatgaca tattataagt ttaaagtaaa caattattaa 4440 ccccactaac agtcgaagtt attaatgtga tttttttttt aatgttattc aaatatgtat 4500 atgcaaatat atatatatat atatatatat atatatatat atatatatat atatatatat 4560 atatatatat atatatatat atatatatat atatatatat atatatatat atacatatat 4620 atatacatat atatatattt aggttggtca gttgaaaatg gtcggtggca agacagttcg 4680 tgaaaatgtg agaaatgtca tgaaaaggta cataccaata tacattgata tattttcaag 4740 tttatatgtc ttattccata ttaatggatt atataactga tagacaataa tatgggttaa 4800 aattaaaatg atatgttttg gaaattacat cggaacaggt tgcaaaattg actataaaaa 4860 ttggaactca acagctgaat tcataaggtt gcctctaaaa tttagaattt agcaggttga 4920 gtttgtggat ggtttttata tctaagttaa gaaataatcg tttggtaatc tttatttaat 4980 aactcttttg taagatttct acaattaatt aacaatttac tcctctacgc agaaacaatt 5040 attttatttc agtttcttat ttttttaaaa agtatttgta ttcctgataa tttcttttaa 5100 tttaggttaa tgtctcacac tgttatgtca tgttttaaca tgtttggaac aaatagacac 5160 ggacaaaaaa aactatcatt taaagaaaca aagatatgta gtgtggttgt cggtgagtta 5220 agattgacat atttatattt ttaattgata tactataaaa tgagtaaaaa taactttaaa 5280 gtatttcaac atatttcatt ttgtttagcg agtgtgttat caaatcatga gacaactgaa 5340 gtcgaaattt tggattcagt aaaaaacgtt ttaaagtatg ctccagaaaa aaaaagagat 5400 gtatagtttt taatttatag aaataaatcc aagcaaatgc attatatttt tatttgagac 5460 tttatcgtgg ccattctttc atagatagag ttagaaactg taattggtta aaacatcagc 5520 aataattaaa aaacagaaat atgttgtatt atggcgcttt agttaaatat ttaaatattt 5580 aaaacacata catgtgtgat tgttaaaaga tgttatttaa ataaataata taatttttat 5640 gaataacaaa tataaaaatt gcatttaaaa acaagagagg catatttagt cggctttccg 5700 accaataaac gacattttga aratcttttt tgcgacgtcg tatcgacgtc tatttttcac 5760 cccggatacg acgtcgtgaa attgttacac tttccgacat tacagcgacg tcaaaacgac 5820 caacctcgtg acgtcgaaac gatgtcgtac cgacgtcgtt gtgcttgctg gg 5872 // ID CR1-7_HM repbase; DNA; INV; 3687 BP. XX AC . XX DT 11-DEC-2008 (Rel. 13.12, Created) DT 13-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE CR1-type family: consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-7_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3687 RA Jurka J.; RT "CR1 families from Hydra magnipapillata."; RL Repbase Reports 8(12), 1835-1835 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 1123..3276 FT /product="CR1-7_HM_1p" FT /translation="MMPIINKSTRITQHSCSAIDNIFTNTLFHSTFKSGII FT RSDITDHFPVFAIFKKPKDLQKKDWYFKKRIFTTRAKNMFKLELSTVNWNF FT VFLETCANNSYDLFLDKFFNLYDKHFPNKIIRIKTKGLLNPWFTKSLLKSS FT KRKQKLYIRFINSKKERDKHNYLEHKRLYERTKKIARTTYFKNKLLELKGN FT CKKTWDAIKEILGKQKIHNKALPSVITVDGTQISDVKSVATKFNEFFVNIG FT PNLASKIKHTPAHFLSYLSKSDKSLAFKKLTSKELDDSIKKLNKNKATGYD FT AISASVIKDCYELIKDVLLDIFNKSLKTGVFPDKMKVAIITPIFKSGDESL FT VTNYRPISVLPVFSKILERIMYNRIFAFLYDNNLLYYNQYGFRSNYSTEHA FT LMQLVEVISEASSHNKFTLGIFLDLSKAFDTVNHQILLEKLKYYGISHEYH FT KWFTSYLTDRKQFISLNDGFQTNKLKISCGVPQGSILGPLLFLIYVNDLYK FT ASNTMSTIMFADDTTLFYSHSNIISLFQVANNELEQISNWLIANSLSLNAD FT KTKFSLFHSKRKFQSIPKSLPMLYINNINIERSFTNKLLGILIDENLSWRD FT HINHISNKISRSIGILYKTRQILDKSTLTQLYFAFINSYLSYGNIVWGSTN FT KSKLDCLLKQQNHAARIIHFQNRFTNAKPLLKSMRALDIYELNIFQNYTIY FT VSTSVKKNAEGFRKYFP*" XX SQ Sequence 3687 BP; 1413 A; 644 C; 474 G; 1154 T; 2 other; aaacgttatc atctatggac tagaagcaac ccaatgttct tctcagacgg aaaaactaga 60 aaatgataaa tcgcaagtaa ataatttact tagtgctatc gaatgcgata gagcaaaagt 120 gagatctatt cgccgtctta aatctaaagg aactatcgat cctttgctta tcgagcttga 180 agacgttaat aacaaaaata tggttcttaa agcatcaatt aaattaagaa aaatcaaaaa 240 atatgaaaat gtgtatataa atcgcgatct aacagcaagt caaagattca ttctaaaaaa 300 taaattaata gaacgaaacg aaaaaaatgc gttaagaagt gaacaagaaa aaattaaatt 360 tcattacggt atgacaatct tactcgaatt tttcacaaat agccactagc aaccactaat 420 ttagcaacgt catcaaaagt atctccccca aaaaagacaa atattgacta tactcgaata 480 aaatctcaat ttattcaaga tccacttctc aacataaaca aactgaacac ttcttacttt 540 gattcggata aagccgctat tttccttaaa agatgtgaat tatatttttc taatcaattc 600 tcagtatttc atttaaatat tcgtagtatt actaaaaact tcgacgaact aaaagtccta 660 ctagataatc ttaactttaa ctttagctta atttgtttaa cggaaagctg gattttagat 720 acaacattta atcaaaattc taaatttata ttgcccaact acaaagccat tagttttgaa 780 cgtaagtctt caaaaacagg cggtggtata tgtgtctatg tgcttgatga atttacttat 840 caaattagat atgacctgtc gttctctgag ccggattttg aatctttcac tttagaaatc 900 tacaataagt catctaaaaa tattattgtg acaactacat accgaccacc aaacggatgt 960 ttaaataact ttcaaaatca cttaagtgat tattttgaaa cagcatgcaa aaaaaataaa 1020 gaccactcat ttttatttct tggagattat aacgtcgact atttaaaata tggtgaaaat 1080 aataaagtac aacaatttta cgatagcttt aatgagtttg gaatgatgcc tataataaat 1140 aaatcaacaa gaataacaca acactcttgt tctgcaatcg acaatatttt cacaaataca 1200 ctcttccatt ccacatttaa atcaggcatc atcagaagtg acataaccga ccattttcca 1260 gtctttgcaa ttttcaaaaa accaaaagat ttacaaaaaa aggactggta cttcaaaaaa 1320 agaatcttta caacccgagc aaaaaatatg tttaaactag agctatcaac tgtaaattgg 1380 aactttgttt ttctcgaaac atgtgctaat aacagctacg accttttctt agataaattt 1440 ttcaatttgt acgataagca ttttcctaat aaaataatcc gtattaaaac taaaggtctt 1500 cttaatcctt ggttcactaa aagcctttta aagtcttcga agcgaaaaca aaagctatat 1560 atacgcttta ttaattcaaa gaaagaacgt gataagcata attacttaga gcacaaacgt 1620 ctatatgaaa gaactaaaaa aatagccaga actacatact tcaaaaacaa gctgttagaa 1680 ctaaaaggta actgcaaaaa aacgtgggat gctattaaag agattcttgg aaaacaaaaa 1740 atacacaaca aagccttacc aagcgtgatt actgtagacg gcacacagat ctctgatgta 1800 aaatcggtgg caacaaaatt taacgagttt ttcgtaaata ttggtccaaa cttagccagc 1860 aaaattaaac acactcctgc acatttcttg agctatctat caaagtcgga taaaagctta 1920 gcatttaaaa aactcactag taaagaactt gatgattcaa taaaaaaatt aaataaaaat 1980 aaagcaacag gctatgatgc tatttcagca agcgtaatca aagactgcta tgaacttatt 2040 aaagatgttt tattagatat ttttaacaaa tcgcttaaaa caggtgtttt tccagataaa 2100 atgaaagttg caattattac acccattttt aaatcaggag atgaaagtct ggttacaaat 2160 tatcgaccta tatcggtatt acctgttttt tcaaaaatcc tcgaacgtat catgtataat 2220 aggatatttg catttttgta tgataacaat cttctttatt ataaccaata tggttttcga 2280 tcaaattact ctacggaaca tgcacttatg caacttgttg aagttatcag tgaagcatct 2340 tctcataaca agttcacctt aggtatattc ctagacttgt caaaagcatt tgacaccgtt 2400 aaccatcaga ttttattaga aaaactaaaa tattatggaa taagccacga ataccataaa 2460 tggtttacca gttacttaac tgatcgaaag cagtttattt ctcttaatga tggctttcag 2520 acaaacaaac ttaaaatatc atgtggggtt ccacaaggct ccatcttagg tccgttgctg 2580 ttcttaatct atgtcaacga tctatataaa gcatcaaata ctatgtctac tattatgttt 2640 gcagacgaca ccactttatt ctattctcac agtaatatta tctctctatt tcaagtagcg 2700 aataatgaac tagaacaaat atcaaactgg cttatcgcta acagcctatc tttaaacgct 2760 gataaaacta agttttcctt attccattct aaacgaaagt ttcaatctat tccaaaatcc 2820 ttaccaatgc tatatataaa taacatcaat attgagcgct cattcacaaa taaattactt 2880 ggtattttaa tagaygaaaa cttatcctgg agagatcata taaatcatat tagcaataaa 2940 atctcaagaa gtattggcat cttatataaa accagacaaa tcttagacaa atctacacta 3000 actcaactat attttgcatt cataaatagt tatttatctt atggcaatat tgtttggggt 3060 agtacaaata aatctaaact cgactgtctc cttaaacaac agaatcatgc agcacgcatt 3120 attcattttc aaaatcgctt cacaaatgct aaacctcttc taaaaagcat gagggcactt 3180 gatatttatg arctaaatat ttttcaaaat tacacaattt atgtttcaac atcagtcaaa 3240 aaaaacgccg aaggttttcg aaaatatttt ccataaaaca ttaaaccgat acaacacgcg 3300 atctaccgga agatttaaaa aacctttccg ttcatcaaaa cttacgagct ttgccatttc 3360 ctatcgtggc ccaatgctat ggaaccattt tacaagtctg agtggtaaac ttaaagaagc 3420 caatacctct caaaaattta aaaagaacat aatcgaactt attgccaatc acgattcttt 3480 acatgatttt ttctaacaat acacgattat ttccaatttt tattatcaaa ggttactgat 3540 gaaaagactt tatcttcttc aggttctccg gtcttgtaac tattataaat aatgaaatgt 3600 acttgtattt aatatttcta aatattttac ggaaattatc tttaattgta aatggttgct 3660 tgacaaaaat tgaaataaaa aaaaaaa 3687 // ID Copia-114_AA-I repbase; DNA; INV; 3989 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-114_AA_; KW Copia-114_AA-LTR; Ty1_copia_Ele29; Copia-114_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-3989 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [1412-1915] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS join(8..2182,2186..3979) FT /product="Copia-114_AA-I_1p" FT /translation="MGTTQKELAHGIPQFRGNGSEFQNWSYRVKLFIEAAG FT VSVVLTGEAPEVEADRAKFVEMDRKAKSLLVSFISDECLEVVREKATAKEM FT WESLEATYAKKSVASQTLIRKQLARLRMKEGNGMRSHLVEFDGLVRKLKTA FT GATLAEGDLVSQLFLTLPDSYDPLVTALENVSEEDLTLDLVKQRLLAEESK FT RSDRQEDASEDKPAAFSGSSQRKFDGKCHKCGMRGHMKKDCKQRGQDDGHS FT NGARGNAHSNGRGKREANCVKRSAVSFMADASLSKRDRTNTVAFKLDSGCS FT DHLVSQKSFFSSFETLVRPIEIKVAKEGQEMVARTIGKIDATSNKGVLFNM FT KDVLFVPDLRENLMSVKKLARAGIDVVFSDNKALLKKRGEILATGKLVGSL FT YEIVLTLENVCANVCETDPGTLWHRRLGHISQHGMDTMLREGMLQGIKKLP FT TVGFCEACVQGKQCREPFNGTRSHASRPLERIHSDVCGPIDPTAWDGSKYI FT VSFIDDYTHFAMIYLLKKKSDVFEKFREYEATVSAAFGLQISKITVDQGRE FT YCSNAQKQYYKRKGIQIESTVAYTPQQNGVAERFNRTLVERIRAVLIDSGV FT PKYLWSEAALACVYLLNRSPTVALPVGVSPAERWFGSKPSLDKLRVFGCRC FT FAWIPNQQRKKLDPKSREMIMIGYAPNGYRLWDMQARKVRIARDVKFDEDT FT FPYAQLKPVDDVPLVVNFPFEQEGESADEDLGIVPNEAESDSGGDEAAGGT FT IEDGDISTDYEDSADALPSQIERHETPSEPVSRRSERERRLPGKLFDYFVN FT YRATSGESISTDVPMRYDDIMQRDDRQQWLGAVKDELHSMAANNVWRFVKC FT PDGVKPLKAKWVFRVKDDVNGNPVRYKARLVAKGFLQKAGLDYEETFAPVA FT KLATIRTILAAGVHQGFSFHHMDVKTAFLYGELKENIYMAVPDGVKAPSNT FT VCQLLKSLYGLKQSPRCWNEKFNEQLLKLGFTRSRHDYCLYTRIQDGDEIY FT IVLYVDDLLICGRNPMTILKLKRQLTTMFAMTDCGEVSHFLGMRLDYDVRR FT GSLRLTQEANVDKLLARFDMVDCNPVKTPMEKGIQLTRTGEPTDRPYRELL FT GSLMYVMLCTRPDICYSVGFLGRFQQKPTTQHWQSLKRVVRFIKGTKTKCL FT EYKRTESAEPLVGYVDADWASDVEDRKSVSGFIFLVFGNLVSWSSKKQTTV FT ATSSSEAEYIALSSAVSEAVWLSGILDDLKLKSSNIPVTIYEDNQGCIGMA FT KNCESKRSKHIDVRHHFVRDHISNGSVKIEYVRTEDQLADLFTKPMDASRL FT GQLGRRLGIAD" XX SQ Sequence 3989 BP; 1080 A; 873 C; 1124 G; 910 T; 2 other; ataggttatg ggcaccacac agaaagagct ggcgcacgga attccgcagt tccgcgggaa 60 tggaagcgag ttccagaatt ggagttaccg ggtgaagctt ttcatcgaag cagctggcgt 120 gtcagtggtt ctcaccgggg aagccccgga agtagaagct gatcgagcga agttcgtgga 180 aatggaccgg aaggcgaaat cactgttggt gagttttatt tccgacgagt gcttggaagt 240 cgttcgggaa aaggcaaccg caaaagaaat gtgggagagt ctggaggcaa cgtacgcgaa 300 aaagtccgtg gccagccaga cgttgataag gaagcaactg gcacggctcc ggatgaaaga 360 aggcaacggt atgcggagtc atctggtgga attcgatggg cttgtacgaa agctcaaaac 420 ggccggtgca accctagcgg aaggggatct ggtgtctcaa cttttcctga cgctaccgga 480 ctcgtacgac ccgttggtga cggcgctgga aaacgtaagc gaagaggatt taacgctcga 540 tttggtgaag caacggcttt tggctgagga gtcaaaacgg tcggatcgac aagaagatgc 600 cagcgaagac aagccggccg cattttccgg aagcagtcaa cgcaaattcg atggtaagtg 660 ccacaaatgt ggcatgcgtg gtcacatgaa aaaagattgc aagcaacgtg gtcaagacga 720 tgggcatagc aacggtgcac gtggtaacgc ccatagcaac gggcgaggaa aacgcgaagc 780 caactgcgtt aaaagaagcg cggtgagctt catggcggac gcatcgttat caaaacgtga 840 ccggaccaac actgtggcgt tcaaattgga ctctggctgc agcgatcatt tggtgagtca 900 aaaatctttt ttctcgtcgt tcgaaacgct agtgcgccca atagaaataa aagtggcaaa 960 agaaggtcag gagatggttg ccaggacgat cggtaagatt gatgcaacga gcaacaaggg 1020 ggtcctgttc aacatgaagg atgtcctctt tgtcccggac ctgagagaaa atttgatgtc 1080 ggtgaaaaag ctggcacgag ctggcatcga tgtagtgttc tccgacaaca aggctctgtt 1140 gaagaaacgt ggcgaaattt tagcgaccgg caagttggta ggtagcctat acgaaatagt 1200 gctgacgttg gaaaacgttt gcgcgaatgt ctgtgagacg gatccgggca cgctgtggca 1260 tcggcggctg ggtcatatca gccaacatgg aatggacacc atgcttagag aaggtatgct 1320 gcaaggaatc aagaagctac ccacggtcgg tttttgtgaa gcttgcgtgc aagggaaaca 1380 gtgtcgtgag ccgttcaatg gaactagaag tcatgcaagt cgccccctcg agcgcattca 1440 ctccgacgtk tgtggcccaa tcgacccgac agcctgggac gggtcaaagt acatcgtttc 1500 cttcatcgac gactacacac attttgcgat gatctacctg ctcaagaaga aatccgacgt 1560 ctttgaaaaa tttcgtgagt atgaagccac agtttccgcc gcatttgggt tgcagatttc 1620 gaaaatcacc gttgaccaag ggagagaata ctgttctaac gcccagaaac agtattacaa 1680 gcgaaaaggc atccaaattg aaagcaccgt tgcgtataca cctcaacaaa acggggttgc 1740 tgagaggttc aaccgaactc tcgtagaaag aattcgtgcc gttctgattg attctggtgt 1800 accaaaatat ctttggtccg aagcggcgtt ggcgtgcgtg tacttgttga accgaagccc 1860 cacggttgcc ctaccggtcg gtgtttctcc ggctgaaaga tggtttggtt cgaagccaag 1920 tttggacaag ttacgagtat ttgggtgtcg gtgtttcgca tggattccaa atcagcagcg 1980 caaaaagttg gatccaaaga gccgtgagat gattatgatc ggctacgcac ccaacggata 2040 tcggctttgg gacatgcaag cgcgaaaggt gaggattgct cgtgacgtaa aattcgacga 2100 agatactttt ccgtatgctc agctgaagcc tgtcgacgat gttccactgg tggtgaattt 2160 ccctttcgag caagaggggg aaakctctgc agatgaagat cttggcattg tgcctaacga 2220 agctgaaagc gattctggtg gtgatgaggc tgctggtggc actattgagg atggcgacat 2280 ttcaacggac tacgaggatt cagcagacgc gctcccttcg caaattgaaa gacacgaaac 2340 tccgagtgag ccggtgtcga ggcgtagcga acgggagcgc aggcttcctg gtaagttgtt 2400 cgattacttc gtaaactata gagcaacttc tggcgaatcg atttccacag atgtacccat 2460 gcgttatgac gacatcatgc aacgtgacga ccgacaacaa tggcttggtg ccgttaagga 2520 cgagctgcat tcgatggctg caaacaacgt gtggcgtttc gtcaagtgcc cggatggcgt 2580 caaacccctt aaggccaaat gggtgttccg agtcaaggac gatgtgaatg gaaatccagt 2640 gcggtacaag gcgcgacttg tcgccaaagg ttttcttcaa aaggctgggc tggattatga 2700 agagacgttc gcccccgtgg cgaagctggc cacaattcga acgatactgg cagcgggcgt 2760 tcaccaaggt ttttcctttc accacatgga cgtcaagaca gccttcctgt acggtgagtt 2820 gaaggaaaac atctacatgg cggtcccgga tggagtcaaa gcaccgtcga acacagtatg 2880 ccaacttctc aagtctctct atggcttgaa acaatcgcca agatgctgga atgagaaatt 2940 caatgaacaa ttattgaaat tgggtttcac acgctcccgg catgattact gcctctatac 3000 taggattcag gatggtgacg aaatctatat cgttctgtat gtggatgacc tgctgatatg 3060 cggaaggaac ccaatgacca tcttgaagct gaaaaggcaa ttaaccacga tgtttgcgat 3120 gaccgattgc ggggaagtca gccatttcct tggcatgcgt ttggactacg atgttaggcg 3180 tggaagtcta cgtttaacgc aagaagctaa cgtagacaaa cttttggcgc gcttcgatat 3240 ggtggactgt aatccagtga agactccaat ggaaaaagga atccaattaa ctcgaaccgg 3300 ggaaccaacg gatcgtccgt acagagaact tctggggtcg ttgatgtacg tgatgttgtg 3360 caccaggccg gacatctgct attccgttgg attcttagga cgatttcaac agaagccaac 3420 cacacaacat tggcagagct tgaaaagggt ggtccggttc atcaaaggaa ccaaaactaa 3480 atgtttggag tacaagcgaa ccgaatctgc cgagccattg gttggttacg tcgatgcgga 3540 ctgggcttcg gacgtcgaag atcgtaagtc ggtgagtgga ttcattttcc tcgttttcgg 3600 taatctcgtc agctggtcaa gtaagaagca aactacggtg gctacttcat ccagcgaggc 3660 tgagtatatc gccctcagct ccgcggtttc cgaagcagtt tggctatctg gtatcctgga 3720 tgatctgaag ttgaagagct ccaacattcc agttacaatc tacgaggaca accaaggttg 3780 tattggaatg gcaaagaact gcgaatcaaa aaggtcgaag catatagatg taaggcatca 3840 ttttgttcgc gatcatattt ccaacggcag tgtgaagatt gaatatgtac gcaccgaaga 3900 tcaactagct gaccttttta ctaagccgat ggatgccagt cgattgggac aacttggacg 3960 tcgtctcgga atagcggatt gagaggggg 3989 // ID Chapaev-N2_AAe repbase; DNA; INV; 3829 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.03, Created) DT 11-APR-2011 (Rel. 16.04, Last updated, Version 2) XX DE A non-autonomous Chapaev DNA transposon family from Aedes DE aegypti. XX KW Chapaev; DNA transposon; Transposable Element; nonautonomous; KW Chapaev-N2_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-3829 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the yellow fever mosquito."; RL Repbase Reports 11(3), 833-833 (2011). XX DR [2] (Consensus) XX CC ~98% identical to consensus. TIRs are ~140 bp long and ~70% CC identical to those of Chapaev-1_AA. CC The region 1748-2031 is an inserted FEILAI_AA (~95% identical to CC consensus). XX SQ Sequence 3829 BP; 1205 A; 666 C; 767 G; 1191 T; 0 other; cacggtgtct ctggtagaaa acattaatat aaaaaaatca gtatcgctga aacacccacc 60 aaatgaaagc ttaggctttc ccctttcatt tgagaccggt ttgaaaatgt tctatcgggg 120 ggtcctgaac attttttttt attttagtaa gagatacgta ccataggtga tcttcttcca 180 ccaatagtat gatacaatga atcgcttgta ccatatgatg acaactcaat cgaaccgtag 240 aaggccatga caaatatttc tggacaaaca tttgaaaagg gctggagagg tcttgagcct 300 tttttcagaa acgttgctgt tgagcccttt ttagctcgcc catgcaaact aacaccacta 360 tcagaaagat tagtgttagc tcttcctgat agtagtattg gtgagcgtga tcgagttgaa 420 ttgggctcaa cagcgtctta caaagccatt gtttaccaat gtcgatgtgc cctttttaaa 480 tgtttgtcca gatttccata aaacaagctt gtttttgttt tggcagtgtt gctgttcagt 540 tgattataaa ttcagtaaat agtaaaattg tatattttta atcaaaattt atttatttat 600 acaaaactaa aatggttgta aatagtattt aaaattttaa ataaaaaaaa tgtttagaga 660 tttgaaagat tttcaataaa taacccttat ttagattcag ttgtatgcta ccgtcttaaa 720 attcaaatat attttcataa gatattgctt tttacgatta gtaataaaca aagcgtttga 780 gaaaacattt actcatggat cattacatta caaatggttt cacagattgc aacaaacaac 840 taaattggca tcaatggtag ggaaatcggc gctgcgttat aacgcttcat tgcttaagtt 900 gtcgcgttct aaggtggtat agcgaaaaat tggtgcttgt gtcgcgtgat aagtttataa 960 attcattgtt ttgggtcaac attttggttt gctgattaga ggtatgcctg ttaataatta 1020 caataaatat atcccaattt cacacctatt aataaatata tatatttaca ctacactctg 1080 tcaggagacc tgaaacatga aaccgaaaag ccacgaagac aatcgttcga agatttgcgt 1140 attgtgttta aagaatttgt gtctgcaacc gttatcagca atagtggttg atattattga 1200 acagcacatt cttcacgaca tcaggattaa agtctggtat tgtcctacaa aaatatgtcg 1260 agcttgcacg tttattcgtt cagtaaccaa tttaaacttt tgaatatagc actgtataaa 1320 tataaataca atttaaaacc taccactcga agcatgtaat gatgtgagaa caacaatctc 1380 accaaaacta aagggaaaac gtggtagaca cagtaatgca aaaagtaagt gccaattgaa 1440 ttggttggtt gtcgaatcta cacggtttac cacccaactg agaatctatt caacatctgt 1500 gcaacctaaa aaaaaaccca tttgttcgct gatgacttgt ttttgattaa aaaaaatgtg 1560 atttgagcac cagatgagcg ttgcagttga gcagctgtct aaggggtttg gatgttgcaa 1620 tcgccccaaa catcgctttc aaaatcagca gacttaacca atcagttcag gggtttttag 1680 aggtaacaca tctgcacttt gatttgaacg ataatgcaga aactgtaaat gcatataaat 1740 gcgttcaggg gccttcctta gccgagtggt tagagtccgc ggctacaaag caaagccatg 1800 cgaaggtgtc tgggttcgat tcccggtcgg tccagaatct tttcgtaaag gaaagttcct 1860 tgacttccct gggcatagag tatcatcgta cctgccacac gagatacgaa tgcgaaaatg 1920 gcaacattgg caaagaaagc tctcagttaa taactgtgga agtgctcata agaacactaa 1980 gctgagaaac aggctctgtc ccagtgagga cgttaatgcc aagaagaaga acgttcaggg 2040 gttagttgat tatgtgacgt ccaaaaaggg gggttggcct gggcagttcg atctgcaagc 2100 gatggagaag ggtcctttaa tcatttaccc tgtcgattat cgagcataat tccgacgtca 2160 tcgggactca aggccgatgt aaggatactg gcgtaaaaat actgttgctg cttgctgttg 2220 cttctggcgt tccagaaaaa tatggcaata ttgagaaaat ctggaagaca ctgttctgct 2280 atttgaaatc gtcaatgttt ttgcaaaatg gagtaatacc atagtcatca aatgatgatg 2340 tttgatcact tctgaatttg tagattgttt agaattcgaa ggaatacaag ttgttttgcc 2400 acattggaat cgtaaccgca taaggcgcat taggtttaaa tggcctagta atgcaatagc 2460 atgaaaccat atcagaatta ttttctgacg attcacctag ttgtgatatg agtgccaaaa 2520 ttcagctcgg ataacactcg attcggtaat tttagaaatt tgttcctgca tactgccata 2580 aaatgcacac ttggtatccc atttgaatcc tgttctgtaa tgctacaaat tgcaatttgg 2640 gcagtgttgc ttttgaataa tttaggtgct ccattgctgg agatttgaaa atcgtcaatc 2700 tgtttttcgt aataatggga ggtagttcaa catttccttg cccatactgt accacacaaa 2760 aaagatttat ccaatttagc tggtcatcca cgaatacttg tcacatacgc gtccgcgaaa 2820 tggaaaagtc tcggctcgga ccataaactg gcggagcagt tcttcaattg tatttactat 2880 ccactgctcc aaattgatcc aaataatgtc gaggaatggg taagcatcag cagtttgatt 2940 gtgtgattca taaagatgac cttaaaataa ttcttttatc acaggtgagg atcatcagtt 3000 gccaacgcca caacaaatgt ggattcacca gctgaaactg tcagaagctt ttgaaaaatc 3060 ggcatgtttt ggaagtgaaa tgggatccaa gtttgctagc aattgtaatt tatcgtactg 3120 ttctgtaaat tatctctttt caatgaaaaa atgtatttca ggcgcttgat cattctgatc 3180 aggtacataa aagttgtttt caaacgagtc tggatgatga ttttgcagaa aaggttgaga 3240 acttttcaaa aacgtggatt gcttacccaa tagctcgaag ttccatattt gcgctatcat 3300 atcccggagt tttgtgaaat cacaggaagt ggtttgagca aatacgggaa acaagccagt 3360 gaaaccgtgc atttcgattt tcaagttgct tggaatcatg ttaaggttcc ggagaactcc 3420 gaagcatatg atcgacagct tctttgagct gtagtaatgt acaatagtga gcatttgtaa 3480 gccgtattaa ccgtatttac tttatcgata tgtcgtgtat gtaattggtt taaatagaat 3540 ggtataatgc taaatctaaa tcctttttat taattggtta actattcgta gtaaaatttc 3600 tcatggctac ctagataagc atgaaagaaa atgactgtgt gaaccataat ttagaaatca 3660 gtactgaaaa aacggcattg tactgttttg aaaaaaatgt tcaggacccc ccgatagaac 3720 atttccaaac aggtctcaaa tgaaagggga aagcctaagc tttcatttgg tggatgtttt 3780 agcgatactg atttttttat attaatgttg ttctaccaga gacaccgtg 3829 // ID hAT-1_DP repbase; DNA; INV; 2699 BP. XX AC . XX DT 24-MAR-2005 (Rel. 10.02, Created) DT 19-MAY-2005 (Rel. 10.02, Last updated, Version 2) XX DE hAT-1_DP is a family of hAT DNA transposons - a consensus DE sequence. XX KW hAT; DNA transposon; Transposable Element; Interspersed repeat; KW Autonomous DNA transposon; hAT-1N_DP; hAT-1_DP. XX NM hAT-1_DP. XX OS Drosophila pseudoobscura OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; obscura group; OC pseudoobscura subgroup. XX RN [1] RP 1-2699 RA Kapitonov V.V. and Jurka J.; RT "hAT-1_DP, a family of autonomous hAT transposons from Drosophila RT pseudoobscura."; RL Repbase Reports 5(2), 44-44 (2005). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 417..2204 FT /product="hAT-1_DPp" FT /translation="MSLIRKYSEIWNHFEEVGSQEAKCKYCKSILSCKSQS FT NLSRHLKSKHPASMEPVIRQNTDGPIFVQSAQPKITSFAERPVPASKAQQI FT DLQLVRMIAKGHHALRLVEEPEFKKFIHDVSHAPNYKLPTRKTLTSSLIPK FT IRDEYTGTIIEELRAATAVCLTTDGWTSITNESYLAVTVHFIDEESTMLTS FT YTIACQAFEASHTAANLCSFLEKIVNEWDLKNKVAAIASDNAHNIALAIRT FT GNWSHIRCFAHTLNLIVQKALDKMSSVRRKAKAISEYFHRSSSGLKKLKDM FT QALLKLADLKLTQDVPTRWNSTYKMFERLSILKEAVVAALSTRTDLILSPE FT DWDVIDGVLPVLKPFYEVTEEISSAEKNVTLSKIIALTGLLQRKMAQIYPT FT VKNNLVAEVINEIINEMDGRFNDFEANILYAESTVLDPRFKGRAFKSAEAF FT KKSVADINKKLAQTIRSLPEPPQEAISNKKQEEDTIWAEFDTTFQQVSQPT FT NNTAASIREMDKYLAEEYISRKDDPLVWWNQRKAQYPLLYTYMLKRLCLVA FT TSVPCERIFSSAGETIRKRRSLLKSTTVENLIFLHNNMQIGSEYNLICT" XX SQ Sequence 2699 BP; 838 A; 520 C; 547 G; 793 T; 1 other; tagtgttgtg ttgctcatga gtgagtgaac aaaaaagagt tgttcactta agtgagcaac 60 tgaacatgtt ccttccaagt tgttcttttt tgttcacttg ttctttgttg ttcacttgtt 120 cttttttgtt cacttgttct tttttgttca cttgctcttt tttgttcact tgttctttgt 180 tgttcacttg ttcttttttg ttcacttgtt ctttgttgtt cacttgttct tttttgctca 240 cttgttcttt gttgttcact tgttcatttt tactcacttg ttcattttta ctcacttttt 300 gcgcaatttt gctcctcaaa gggcattttg cgatctggta gctttgctca tatttgcgtt 360 tacttcacac tttttatatt acgggcaaag ctgtttacaa ttttgaaaaa ttcactatgt 420 cgctaattcg gaagtatagt gagatttgga accatttcga ggaagttgga agccaggaag 480 ccaagtgcaa gtattgcaaa tcaattttga gttgcaaatc tcaaagcaac ttaagccgcc 540 atttaaagag caagcacccg gcgtctatgg agcccgttat ccggcaaaac accgatggcc 600 cgatttttgt gcaaagtgcg caaccaaaaa taacaagctt tgctgaaagg cctgtaccag 660 cgagcaaagc gcagcagatt gatctgcaac ttgtacggat gatcgccaag ggtcatcacg 720 ccttgcgctt ggtcgaagaa ccagaattta aaaaatttat tcacgatgtg tcgcacgccc 780 caaactataa gcttccgacg aggaaaacat taaccagctc cttgattccc aaaattcgtg 840 acgagtacac tgggacgata atagaagaac tacgtgcggc cacagcggtg tgcttaacta 900 cagacggttg gacatctata accaacgaaa gttatctggc tgtcacagta cattttatcg 960 atgaggaatc aacgatgctg acctcctata caattgcttg tcaggccttt gaagcgtcgc 1020 acactgctgc gaatctgtgc agctttttgg aaaaaattgt gaatgaatgg gatctcaaaa 1080 acaaagtcgc agcaatagca tccgataatg cacacaacat tgctcttgct attagaaccg 1140 gcaattggag tcatatacgg tgttttgctc ataccctaaa tctaattgtc caaaaagctc 1200 tagataaaat gagcagtgtg cgcagaaaag ctaaagctat aagcgaatat tttcatcgca 1260 gctcatcagg cttaaaaaag cttaaagata tgcaagcgct gcttaagttg gctgacctca 1320 aactaacaca ggatgtgcca acacgttgga actcaacgta caaaatgttt gaacggctgt 1380 ctatcctgaa ggaggcagtc gtggcagctc tgtcaacaag gacagattta attttgtcac 1440 cagaagactg ggatgtcatt gatggagtgc ttcccgtatt aaagccattt tatgaagtta 1500 cagaagagat ctcttcagct gaaaaaaacg taacattgtc aaaaataatt gctttgactg 1560 gattacttca aagaaaaatg gctcaaatat atccaacagt taaaaataac ctagttgccg 1620 aagtaatcaa tgaaatcata aacgagatgg acggcaggtt taacgatttc gaggctaata 1680 ttttgtatgc ggagagcact gttctggatc ccaggttcaa aggacgcgca tttaaatctg 1740 cggaggcctt taaaaagtca gtagcggata tcaacaaaaa attggctcaa acgatacggt 1800 cactccctga acctccacaa gaagctataa gcaacaaaaa acaagaagag gacacaatat 1860 gggctgaatt tgacaccact tttcagcaag taagccaacc caccaacaat acggcagcat 1920 ccataagaga aatggataag tacctggctg aggagtacat cagccgaaaa gatgatccat 1980 tggtatggtg gaatcagcgg aaagcgcaat acccgctact ttatacctat atgctgaagc 2040 gcctctgctt agtcgccacc tcagtgccat gcgagcgcat attttcaagt gcaggcgaaa 2100 ccatacgtaa aaggcgatcc cttttaaagt caacaaccgt tgaaaattta atttttttgc 2160 ataacaacat gcaaatcgga agtgagtaca acctaatttg tacttgattg cttgtttgct 2220 tgattaacat ttttttctct tatagcgcta aaatgaagaa aacgaaaagt ataagaattc 2280 ctatgaaagg atctccttat ttttatgtta gtataagaat tcctatgaaa ggaaataaaa 2340 agtttgtttt tttgttatat acattttttt tttgttttta gttataaatc tttgatgaat 2400 attgaaatgt aacgtatttt atttgtktaa aagtctgtgg tgcttagggt tctgtgcatg 2460 aatagtgaac aactgagcaa gtcagcaact gagcaagtga acaactgagc aagtgaacaa 2520 ctgagcaagt gaacaagtga gtaactgagc aagtgaagaa ctgagcaagt gaacaagtga 2580 gtaactgagc aatctaagag agcaagtgag caagtgagca atatgagtga gttgactcac 2640 ttactaaaaa agaacaagtg aacatgttca ccaaaatgag caatgtttac ccaacacta 2699 // ID Gypsy-114_AA-I repbase; DNA; INV; 5086 BP. XX AC AAGE02027028; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-114_AA_; KW Gypsy-114_AA-LTR; Gypsy-114_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5086 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02027028; Positions 68539 73624. XX CC Positions [4106-4573] - Integrase core CC 'ACTTT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 877..2244 FT /product="Gypsy-114_AA-I_1p" FT /translation="MRAKLLHFGGKQLQRVYENLPDADKLPLVSTKSNWYD FT HAIAKLDEYFEPGRQYILERCRLRKMRQEKNERFAHFVLRIRQQLGDCGLE FT KYPADVREVLTEIYVIDVIVEGCASEELRRRILQKDITLAEVESMGAMMEG FT VEQQVNDFTCLNGNEQSREQQAEKVFRVQKRGPMKQPPVVPNRWPNRRMPQ FT SREIKCFNCGTPGHIAMSVECKARNQTCRRCKRVGHFEAVCRKRFAPADVP FT LHQKTKKVRLVEKIDQKPNDLPSTDSKLDSKSDTNGKSYYCFHFGNDTNIL FT ECKIGGVLLDVLVDSGSDVNLIHSTAWETLKQQRVVVYEMQKGGDEVIKGF FT GSKTPLNILGSFKAKIEIGNKSECAKFFVVKEGQRCILGDTTAKALGVLKI FT GVEVNQVNNVPFGKIKDVQVQIHMDPTFKPVFQPVRRVPLPYESAVNQKLD FT QLLDRDIIEVK" FT CDS 3125..5077 FT /product="Gypsy-114_AA-I_2p" FT /translation="MNKFINNLATLDEPLRRLTEQSVKFEWTSEHQASFEA FT IKNALSHSVSLGYFNVEHATSVIVDASPNALGAVLVQTDNFGEHRVICYAS FT KSLTKTEKRYCQTEKEALAAVWGIERFQMYLLGKKFDLITDCKALLYLFTS FT RSKPCARIERWVLRLQAFDYSIKHIAGEKNVADVLSRLATLVPTPFDYSEE FT LFVNEIANAAATNAAIRWEELDAICNEDDEIQELLRKLEEGRLFELPAEYR FT LIAQELCQVGNVLMRGDRIVVPKCLREKVLVLAHDGHPGTRMMKCHLRASV FT WWPKIDTDVDTFVKKCRGCMLVSTPDPPEPMSRRDLPTGPWQDVAIDFLGP FT LPEGQFLLVVVDYYSRYFEICEMTSITAESTIDELRTIFSRFGVPTTLTAD FT NAPQLSEDCEQFAEFCRSYGIKLINTIPYWPQMNGEVERQNRTILKRLQIA FT QELGQDWRIELQKFLLTYRASNHSTTGRSPAEMLFGRKIRTKLPQLSNSRY FT DDEDVRDRDAMQKEKGKEYSDNKRRAKVRDLVVGDRVLLKRMKKDNKLSTE FT YMNEEFIILRKCGSDLIVKSLVTGKEFRRNTAHVKKCEILNDAESSTGAMD FT TEQDASETNTTSPNGGSSSNLDAQQSLADQGGKRKRKEPMWFEDYMPHFIK FT KC" XX SQ Sequence 5086 BP; 1591 A; 973 C; 1222 G; 1300 T; 0 other; tctggcgacg aggataatgg aaaaactagt aagtacagcg aaaactgcaa ttggaagcat 60 attttatcac attgttttat aaggccacgc gagtcaactt tgacggggcg tagtcccgga 120 ctaggaaaaa aaagaaagaa aaatgaaggt taggttatag aacaggtgtc gcatcagtta 180 caactcacaa ggactgtttt gtatcttgaa tctggaatct tggacacgca caaatagaat 240 gaagatttaa tatattgtct ctgaggagaa tacaagaaat gaagttatgg gtacaatttt 300 atggaacttt gaggagttaa agaaaaaagt tctaaggaac tgctgttgag attactatac 360 ttaaaagatt ggttccgaac tctgaggaga ttctgaggaa cttctgtcga gaaagttaaa 420 tcaaagaata taagaaataa ttatgctgga gagcaaaagt ataactctga ggagttttgg 480 aggttatcac tttgaaaagt gtgtgagatt aggctctgag gagttttgaa ggttataact 540 ttgagaaata tgtgagatta agctctgagg agtttcttgt taactttctt gtgcgtgttc 600 cgttagatga aaatattgcg gtatatcaga gcaggcagtg atattgtcac ttttatctgc 660 aatacatgta acactacaac gagataataa tttggttttt gtgtaaaaag aactaaattg 720 atcaaaatta taaatttaca gatcgacgag tctcgtccga ttcctccatt ccgatgtgag 780 gagattggaa agaaccaatt ggccaaagag tggaagtgtt ggagatcggc tctcgagatc 840 tatttcgagg cttatgacgt ttcggaccag aaaaagatga gagcgaagtt gcttcatttc 900 ggtggcaagc aactgcaacg tgtttacgag aatctacccg acgctgataa gctacctctg 960 gtatcgacca aatcgaattg gtacgaccat gcgattgcca agctagacga atacttcgag 1020 ccagggcgcc agtatattct cgaacgttgt cgattgcgca aaatgcgtca agaaaaaaac 1080 gaacgattcg ctcattttgt cttacgcatc agacaacagc tcggagactg cggtttagaa 1140 aagtacccag ctgatgttcg cgaagtacta actgaaatct acgtaattga cgtgatcgta 1200 gaaggatgtg cttcggaaga gcttcgtcgt cgaattctgc agaaggacat cacgttggcg 1260 gaagtcgaaa gcatgggtgc catgatggaa ggagtggaac agcaggtcaa tgactttaca 1320 tgtctcaatg gtaacgagca gtcgagagaa caacaggcag aaaaggtgtt cagagttcaa 1380 aagcgaggac ccatgaaaca gccaccagta gttcctaacc gatggcccaa tcgtcgaatg 1440 cctcaaagcc gagaaattaa atgctttaac tgtggcacgc cagggcacat cgctatgtct 1500 gtggaatgca aagctcgaaa ccaaacttgt cgccggtgca agcgtgtcgg acattttgag 1560 gctgtgtgca gaaaacgctt tgcacctgcg gacgtgccat tgcatcagaa gacgaaaaag 1620 gtacgtctag ttgagaaaat cgatcaaaag ccgaacgacc tcccaagcac cgatagcaaa 1680 ctggactcta aatcggatac aaacggaaaa tcttactact gtttccattt cggaaacgac 1740 accaacattc tggaatgcaa aatcggtggt gttctgttag atgtgctcgt ggactcagga 1800 tccgacgtta atttgatcca ctcgacggct tgggaaacct taaagcaaca acgagttgtg 1860 gtttacgaaa tgcaaaaagg aggcgacgaa gttatcaaag ggtttggaag caaaacacct 1920 ttgaatattt tgggatcatt caaagctaag atcgaaatcg gaaataaaag tgaatgcgcc 1980 aagtttttcg tcgtaaagga aggtcaacgt tgtatcttag gtgatacaac cgcaaaagct 2040 cttggagttt tgaaaattgg tgtcgaagtt aatcaagtga ataatgttcc atttggcaaa 2100 ataaaggatg tacaggtaca aattcacatg gatccgacgt tcaaacctgt atttcagccg 2160 gtgcgacgtg tcccactgcc ctacgagtct gctgtgaacc aaaaattgga tcaacttctg 2220 gatcgagaca tcattgaggt gaaataattt cgtcaaatga tattcttttg ttaccatatc 2280 agtctgaata atacatgata tttactcacc tatttttgaa taaataatta ttgattcgaa 2340 tagggtaaca ctgaaatata caataagcta aatgttcttc agtaggtagt agcacacatc 2400 cttacgttga atttgttgga aacactattt taggtcaaaa caggacctac aagttgggta 2460 tcgccgctag tcgtggtagg caagtcgaac ggtgagccac gtgtatgctt agaccttaga 2520 cgcgtgaacg aggcagtttt acgagaacga tatcccatgc cggtagtcga cgagttacta 2580 gcacgcatcg gtaagggcgc aataagaagt agacttgata ttcgtgatgc atttttacaa 2640 acggagcttg cccctgagtc acgtgacgtg acgacattca tcaccagccg aggtctcttt 2700 agattcaaga gacttccatt tgggctggtt tcggctcctg agattttcca gaaggtgatg 2760 gaagaaatct tggttggatg tgagggtaag tcatgttgag ttgttactta ttaaaaaaaa 2820 aataaaataa aataaacatt gattaatagg gacggtatgc tacctggacg atatatacgt 2880 tgaaggcgaa aacttagagc aacataatgt tcgcttaaac gcagtatacg aacgactgcg 2940 agaccgtggg gtggttttga atcaagagaa gtgcgtgatc ggtgttccag aagttcaatt 3000 cattggtcat gtaatttccc ccaatggtat acgtccttca ccatccaaag tcgaagccct 3060 cgtatccttt aggcgtccgg aaaatgcgtc cgaagtgaaa agtttcctag ggcttgcaaa 3120 ttatatgaat aaatttatca ataatttagc aaccctagac gaaccattgc gaagactaac 3180 agaacaatct gtaaaatttg aatggactag cgaacaccag gcatcttttg aagccatcaa 3240 aaatgcactt tcacactccg tcagtcttgg atattttaat gtggaacatg caacgtctgt 3300 catcgtagac gccagtccta atgcattagg agcggtgcta gtgcagactg ataattttgg 3360 ggaacaccgt gtcatatgct atgcatctaa atcgctgacc aaaactgaga aacggtactg 3420 ccaaacggag aaagaggcgt tggcggccgt ttggggcatc gaaagattcc agatgtacct 3480 tctcggaaaa aagtttgact taattaccga ctgcaaagcg ttgctatatc tattcacctc 3540 acgctcaaaa ccatgcgcta gaatagaaag gtgggtgctt cgcctgcaag cctttgatta 3600 ttcgatcaaa catatcgctg gagaaaagaa cgtagctgac gttctgtcac gtctggcgac 3660 tctggtacca accccgttcg attattccga agagcttttc gttaacgaga ttgccaatgc 3720 agccgccacc aacgcggcta ttagatggga agagcttgat gccatttgta atgaagacga 3780 tgaaattcag gaactgctac gaaaactaga agagggtcgt ttgtttgagc ttccagcaga 3840 gtatcgtctt attgcacagg agctttgcca ggttggaaac gttttgatgc ggggcgacag 3900 aattgttgtt ccaaaatgct tgcgtgaaaa ggtactagtt ttggcacacg atggccatcc 3960 tggaacacgc atgatgaaat gccatcttcg tgcttccgtt tggtggccca aaatcgacac 4020 tgatgttgac acatttgtta agaaatgtcg gggatgcatg ctggtttcca cccccgatcc 4080 acccgaaccg atgagccgcc gagacttacc aactggacct tggcaggatg ttgcgataga 4140 ctttttaggt ccactcccag aaggacagtt tctactggta gtcgtggact actacagtcg 4200 atactttgaa atatgtgaga tgactagtat cacagcggaa agtacgattg acgagttgag 4260 aactatattt tctcgtttcg gcgtgccaac aacactaacg gcagataatg caccacaact 4320 gagcgaagac tgcgaacaat ttgcagaatt ttgccgcagc tacggaatca agcttatcaa 4380 caccattccg tactggcctc aaatgaacgg ggaagttgag cgtcaaaacc gaaccatcct 4440 caaacgatta caaatagcac aagagctggg acaagattgg cggattgaac tccagaaatt 4500 cctgctcaca tatcgcgctt caaaccactc cacaactggt cgctcgcctg ctgaaatgtt 4560 gtttggaagg aagattcgaa caaaacttcc gcagctgtca aactcccgtt atgatgatga 4620 agacgttcga gaccgtgatg cgatgcaaaa ggaaaaaggg aaagaataca gtgacaacaa 4680 aaggcgggct aaggtgaggg accttgttgt tggggatagg gtgcttctta aaaggatgaa 4740 aaaagataat aaactgagta cagagtatat gaacgaggag ttcattatat tgcgaaaatg 4800 tggatcggat ttgatagtta agtcgctagt tacgggaaaa gaattcagaa gaaacacggc 4860 acatgtcaag aagtgtgaga tattaaatga tgctgaatct tcgactggcg ctatggatac 4920 agagcaagat gcatcagaaa caaatactac atctccgaat ggtggttctt catcaaatct 4980 ggatgcacag caatcactgg cggaccaagg aggaaagcgg aagcgtaaag aaccaatgtg 5040 gtttgaagat tatatgccac attttattaa gaagtgttaa ggggga 5086 // ID BEL-12_DWil-LTR repbase; DNA; INV; 353 BP. XX AC scaffold_181136; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-12_DWil_; KW BEL-12_DWil-I; BEL-12_DWil-LTR. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-353 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_181136; Positions 1748944 1749296. XX SQ Sequence 353 BP; 104 A; 58 C; 83 G; 108 T; 0 other; tgtcgcggga cgggaattga tcaccgaata attagcagcg gggtgacagt caccgtctag 60 ctatagctgt tagtatctag tattttatat ttaagttttt tggcgcttga tgaatatggc 120 catactgtga aatatgaaat tcaagtttag cttccgcatg ttctttctct cttgttgtgt 180 aagccgagaa ggacacactg cggtaagaat aggtcctaaa tgtacttgca catatctctt 240 aatattaatc tttacagaat aaaacatcca atgagttaga tttgaacgag atgcgtttga 300 atttaaacag cggttgttaa aggcaacggt cggcagaggt agagagagcg aca 353 // ID Gypsy-169_AA-I repbase; DNA; INV; 6053 BP. XX AC supercont1.339; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-169_AA_; KW Gypsy-169_AA-LTR; Gypsy-169_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6053 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.339; Positions 134932 140984. XX CC Positions [3347-3775] - Reverse transcriptase CC Positions [5045-5506] - Integrase core CC 'CTCGG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 2753..5935 FT /product="Gypsy-169_AA-I_1p" FT /translation="MVRNVEMFNVDEVKEEKLPVYNIADEENELIKCRIGG FT VEIEMLIDSGSTYNLIDDTTWNLLKLKDAKFAAERHDNSKRFLAYGRVPLH FT LLKVFDAVLDVRDGNETISTEATFYVIEGGQQSLLGKNTARKLGLLQVGLP FT STFGKNVNQVEATKDFPKIKDFQLIQPIDHSVPPVIQPLRRCPIPILGRVK FT SKLDELLQMGIIERVTKPASWVSPLVPILKDNGELRLCIDMRRANQAIQRL FT NHPLPVFEDMLSRFSGAKYFTTLDIKQAFHQVELAEDSRDITTFITNWGLY FT RYTRLLFGVNYAPEFFQSLMESILAECPYTVVFIDDILIWGLTEEEHDKAV FT KHTLDVLHRHGLSLNVNKCKFKQAEVRFHGHMLSANGVLPSDDKMKSLLEC FT RSPQNKEELRSFLGLVTYVSRFIPNLATESQPLRQLLKNSNNFQWLQQHKN FT CFEKLKQRIANLKHLGYYDPKDETILVTDASGVGLGAVLVQLRDNQPRVIS FT YASRSLSEAEQRYPPIEKEALGIVWGIERFKIYLIGISFTLETDHRPLEVL FT FTPNSRPTARIERWMLRLQAFRFRVIYRKGSSNIADTLSRLASHVEDHSWQ FT EDSDVYSRRSIATTLAELSEASEESDYDADIESTIRSIQETAAIDISEVTE FT ATMNDRELQAVKEAIMSNNWSSLQLKPYSAFRTEFSFANDLVLRGSKLVIP FT SVLRGRMLMLAHEGHPGQSCMKRRLRDRCWWPNMDQETVKVCEKCEGCRLV FT QVPDPPEPMQRRPLPDRPWVDIAIDFLGPLPSGEYILVVVDYFSRHVDLEV FT MTSITAKETIKRLDKIFRLWGIPRTITLDNAKQFVATEFEEYCRIKGIFLN FT HTSPYWPQANGEVERQNRSLLKRLKIANALYGSWRTEMDRYLEMYNNTPHS FT TTGKTPNELLQNRKIRSKLPDLADISTAVPSTDYADRDKIQKFAGKEREDA FT HRRATLSNITAGDVVLMRNLLQTNKLSTNFLKEKFLVVERNGSNVRVQSLE FT TGKSYERNVSHLKKISESAETTNEVAIPESDHSNSAAPRRPERSRQVPNRY FT DSGK" XX SQ Sequence 6053 BP; 1964 A; 1055 C; 1360 G; 1674 T; 0 other; agttggcgac gagggtaaaa gaaaaaaaaa atatcggagt gaaagcaagg caaagaatat 60 catttattga atatattgat aatctgtgtt tccaggtaaa ataatcgaaa taaaatttca 120 aagaatactg tgaaattgaa caaagaacgg aaagttttca tcacccacga atgtgacgcc 180 aattatgaat gtgaagaata aggctttgtg taaaaagtga aaaacggggt acgaatatgt 240 gatgccaatt acgcatgtga aaaaaaaatg aagccttgtg tagaaacaaa tatggagaaa 300 cgtgattatt gataatgaat gtttggggta cgaatatgtg attgcataaa aagtgtggta 360 atgtgaaaat gatcgaaaaa tggtagagtg ttgaatgcta ttgtatgatt atatactgtg 420 aatttttgat catttctgtt ggtaagtgag cgaatggagg ttgactggta tcttcaattc 480 aaggtatgtt gcccatatta atttgaaaag aatagaatgt atatacataa ttggaatgtc 540 tcattttcgt tgatgatttg tttgcagtac tttctttcgt actggcgtac aaaatggcgt 600 tgcagtgacc atcactaatc attcaaccaa gattagaggt ggcacaggag tatttgtgac 660 cgaaaatcac actactctgt atgcaagaga aattattgct tattcaaacg cagccgccca 720 tacaatcact tatctaagca gtgcgttctg taagtacgta tcaacgcgat agtagctata 780 acgaaaaaag aagcacgcga atatgcaagc gtgctgaatc tatattccac actaacacgt 840 gttcgcacat cgatataacc acgggaaaaa aattagcggt ttcgctttgc ggcgatgact 900 gcgcgtaggg tggagataac atgcagttgc gctcactgcc gttgggagaa aaaaaaaacc 960 cgaagcgaag agcgaaagag agaaagaaac agttaatctc gttctccgaa gaccgtacac 1020 tgctgagttg agcgaggtgt ctccagataa tgctccattc cgagtgtgga tgtgattaaa 1080 ttcactagtt tatattggca taggcaaatg agaggaatcc aataagaatg aaatgcaata 1140 ctggttgtaa gaaggacgtg attgaattca ctgaatgaaa tgaagttatg agagaaaaag 1200 agacggcttg atcttgaata accaagggtt gttgttagta tgaactgtga tgatgaatga 1260 ttgccgtagg gctgtatgga ctccaatccc taacagctgg gatgtatgta tgtatgtatg 1320 tatgtttgat tttctttttc gaatgctgct ctattttttt tttagctgta tattagggtg 1380 attggttatg agtgtatgtc aactttgtct gtttttccaa cggaataatt atatactgtg 1440 attgaattca ctggaacttg aatggaaaac tatggaaaac gtgatttaat tcactattga 1500 atatacggaa aaatagaatc gatgaatgct tttgtgcaat gatgagtcaa caaggagctt 1560 agaaactgaa agaaatgtga tttgattcac tgttgtatga atgagcttgt aaataaaaga 1620 agaatgaaat gtgaaaataa aatgtgattt aattcactga ttagctagac cataacaggc 1680 aatgtcaggc tgggctttga tgaaaatgat ttgatgattt ccattacagt tttgttagaa 1740 tgatagagct attatgtttt gttcaatagg aaaatggatt cgtctgcagc agaaataatt 1800 caagaagtac aggtgactcc agtcttcaac ccggcttcgt acaatctacc cccttttaag 1860 tattctcatc ttccgccatc tgagataaga aatgcttgga ttggatggat acgttggttt 1920 gaaaacgtga tgactgccac aaacattaat gatgggtatt gccgcaaagc tcaaatgctt 1980 gcaatgggag gcttggagtt acaaaacgta ttctacggta taccaggagc tgacgaaatc 2040 gcagttgacg accctgactt caatccatat gaaaaggcca aacagaaatt gacagaacac 2100 ttctctccaa agcaacacga aagttttgaa cgattccagt tttggtcgat gtctatgtgc 2160 gatgatgaac caattgagaa atttcttttg agggtacagc aaaaagccga gaaatgtttc 2220 tttggccata ccgaacagca atgtcgacaa atcgcaatta tggacaagat tatccagaac 2280 tcatcggaag atttgagaca gaaattatta gaaaaggaac gattaaatct tgatgatgct 2340 acgaaaataa ttaatgcgca tcaatccata aaacagcaag catcattgat gaattctggt 2400 tctaaggtgg atattaatcg tcttctggtg gataagcgaa aaattccata ttataagaat 2460 agtggaatat tagcgcataa agcaagatgt acacgttgcg gtagatttca acatcgagga 2520 agcgaaaaat gtcctgctat ggataaaact tgtcatcgtt gtcacatgaa aggacatttt 2580 cagtcaatgt gtaaatctaa ggatggacga acagtaagag attatttttt ttccgaatag 2640 gtgttatttc attattatgt aacatttttc ttcgtttatt tgtaaggtga caaaacggaa 2700 tcattcgcct tataggtttg gaagtgaatc tagctattcg aagcgaggga gaatggtgag 2760 aaatgttgag atgtttaacg ttgacgaggt taaggaggaa aagttgccag tctacaatat 2820 cgctgatgaa gaaaatgaac ttatcaaatg tcgcatagga ggagtagaaa tagaaatgct 2880 tatagattct ggttcaacgt acaacctaat cgatgataca acatggaatc tgttgaaatt 2940 gaaagatgct aagtttgcag ctgagagaca tgacaattcc aagcggtttc tcgcatatgg 3000 gcgtgtaccc ctgcaccttc taaaagtgtt cgatgcggtc ctcgatgtta gagatggaaa 3060 cgaaacaatt tcaacagagg caacattcta cgtcattgaa ggcggtcaac agtcgttgct 3120 agggaaaaat accgcaagaa agctaggtct tttgcaagtt ggtctgccaa gcacgtttgg 3180 gaagaatgtc aatcaagtgg aagcaacgaa agattttcca aaaatcaaag attttcagtt 3240 aattcaacct atcgatcact cagttcctcc cgtgattcag cctcttcgtc gttgtccgat 3300 tcccattctt ggtagagtga agtcgaagct agatgaattg ttacaaatgg gaattatcga 3360 acgtgttaca aaaccagctt catgggtgtc tcccctagta ccgattttga aagataatgg 3420 tgaattgcga ttgtgcattg acatgaggcg agcaaatcaa gctatccaac gattgaacca 3480 tcctttgccc gttttcgaag atatgctatc gagattcagt ggagcaaaat actttacgac 3540 attggacatc aaacaggcgt tccaccaggt tgaactagcg gaagatagta gggacatcac 3600 tacgttcata acgaactggg gactataccg ctacactcgt ctattattcg gtgttaatta 3660 cgccccagaa tttttccaga gcctaatgga aagcatatta gcagagtgtc cctatacagt 3720 ggttttcatt gacgacattt taatatgggg tttaacagag gaggaacatg acaaagcggt 3780 aaagcatacg cttgatgtgc tacacagaca tggcttatca ctcaacgtca ataaatgcaa 3840 gtttaagcaa gcagaagttc gctttcatgg acatatgttg tctgcgaacg gcgtgcttcc 3900 ttctgatgat aagatgaagt ctctgttaga atgccgatca ccacagaaca aggaagagct 3960 acgcagtttt cttgggcttg tcacctacgt ttcccgcttc attcccaact tggcaactga 4020 aagtcaacct cttcgacagc ttttgaaaaa ttcgaataat tttcaatggc tacagcaaca 4080 taagaactgt ttcgaaaaac tcaagcaacg gattgcaaat ttgaaacatc taggatatta 4140 cgatccaaag gacgaaacaa ttctagtgac ggatgcttca ggagtaggtc tcggggcagt 4200 tttggtccag ttgagagaca atcagccacg agtaatcagc tatgcatcgc gtagtttatc 4260 agaagcagag caacggtatc ctccgataga gaaagaggcc ctaggaatcg tttggggtat 4320 tgaaaggttc aagatctacc ttatcggaat atcctttact ttagaaactg atcatcgacc 4380 gctggaggtt ttgttcacgc caaattcacg tccaacagct agaattgaac gctggatgct 4440 gcgattgcaa gcattcaggt tcagagtcat atataggaaa ggctcgtcaa atatcgcaga 4500 tactttgtct cggttagcgt ctcatgtaga agatcattca tggcaggaag actcagacgt 4560 atacagtaga cggtcgattg ctacaacgtt agctgaacta tccgaagcat cagaagagtc 4620 agattatgac gcagacatcg agagtacgat tcgatcaatc caagagacgg cagcaattga 4680 catttccgag gtgactgagg ccactatgaa tgatagagaa ctacaggcag ttaaagaggc 4740 tattatgtcc aataattggt catccttgca attgaaacca tattcagcgt ttcgtaccga 4800 attttcgttt gctaatgatc tagttcttcg agggtcgaag ttagtcattc catcggtttt 4860 gagaggccga atgctaatgc ttgctcatga agggcaccca ggtcaatctt gtatgaaaag 4920 gcgtttacga gatagatgtt ggtggccaaa tatggaccaa gaaacggtga aagtgtgcga 4980 aaaatgtgaa ggatgtcgat tggtacaggt tccggatccc ccagaaccaa tgcagcgcag 5040 accgttaccg gatagaccgt gggtagatat agctatcgat tttcttggcc cactaccatc 5100 tggagagtac attctggtag tggtcgatta ttttagcaga catgttgatc ttgaggttat 5160 gacgagcatc actgctaagg aaactataaa aaggttagac aaaattttcc gcctttgggg 5220 tattccgcgt actattactt tagataacgc aaaacagttc gtcgctaccg aatttgaaga 5280 atactgccgg ataaaaggta tctttttaaa tcatacgtca ccatactggc ctcaagctaa 5340 tggtgaagtt gaacgccaaa accgttctct tctgaagaga ttgaagatag ctaacgcgtt 5400 gtatggttca tggcgtacag aaatggatcg atacttggag atgtataaca atacgcctca 5460 ctcgactacc ggcaagacgc caaacgagct tcttcaaaat cgcaagatac gttcaaaact 5520 cccagatctg gctgacattt caacagcagt tccttcgact gattacgctg atcgtgataa 5580 aattcaaaaa tttgcgggaa aagaacgaga ggatgcacat cgtagagcaa cgctaagtaa 5640 catcacggct ggcgatgttg tacttatgcg aaatcttctt caaacaaata aactctcgac 5700 aaacttcctc aaggagaagt ttctagttgt agaacgcaat ggttctaacg tgagagttca 5760 atcgttggaa acaggaaaat catacgaaag aaatgtttca cacctgaaga agatttcgga 5820 atcagcagag acaactaatg aagtcgcgat tcctgaaagt gatcattcta atagtgcagc 5880 accccgacga ccagaaagat cgcggcaagt acccaacaga tatgattccg gtaaataata 5940 ggattaaaac atttatttga ttatcttgtt taactctgaa gagtaaggaa ctataataaa 6000 tatgctatga ctcggtgata agtaataatg tctttgtttt ataaaaaggg gga 6053 // ID ISL2EU-1_HM repbase; DNA; INV; 4196 BP. XX AC . XX DT 15-DEC-2008 (Rel. 13.12, Created) DT 17-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE A family of autonomous ISL2EU transposons - a consensus. XX KW ISL2EU; DNA transposon; Transposable Element; ISL2EU-1_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4196 RA Jurka J.; RT "ISL2EU-type transposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 2058-2058 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 875..1819 FT /product="ISL2EU-1_HM_1p" FT /translation="MQYRNELIEKSNEIIELKKKNARLDHNKYISYQNLTD FT EQMNFFSGLSRTTFMWLFERVKNYIKKVHSRLLLEDHLLIVLIKLKLGLLN FT KDIAFRFRLSPAVVSKIFRTIIPIFSARVVNLIVWPDRGIIRSNLPICFKK FT KFKDCVCIIDCTEIFIERPKNLTSRSQTWSNYKHNNTIKYLIGITPAGAVS FT FLSPGWGGRVSDKQITFESNFCQKLYPGDCVLADRGFNIKDELNAVGATLK FT VPAFTKGRQQLSGWEVDTSRQISNVRIHVERVIGQIKKFRIIHMTVPITQV FT DLLDDIMIIICALVNLNMSVVT*" XX SQ Sequence 4196 BP; 1441 A; 585 C; 650 G; 1515 T; 5 other; gatgagtttc aaatgtttac attctcgttt tcaaaaaaag atgcgcgcgc aaacttgtag 60 gtagaaaaac aaataatcgg aaattttcma gtaaaaacat tgagacgttt gtatgttawa 120 tttattcact gattcttatt tactcactga ttcaccgatt taacaagaaa taaattatgc 180 caaccagttg ttgtgcagtt ggttgttcaa acaaatttaa aaaagaatgt ggattaaaat 240 tttataaatt tccatcagga aaaacgcctt ttgagaaaag aaggcgaata gactggatta 300 aagctattaa tcgtaaggat tgggacagtt gggaaagtga aaarattagt aaagaacgag 360 tatgcagtgc acattttgta tcaggtatgt tcttttaatg aatgakgtta gctaataaaa 420 ctatatacgt agtaatacgt ttaataagat tctttaaatt agtaatttat aatatttggt 480 acatttatta attcaattaa atattatatt ttattataat atttaattaa attaatattt 540 aattaaatat tataattgtt atattttagg gaaacgttct gatgacccat ctgacgttga 600 ctggagtcca tctgtattta atcaccttgg agatgctacg atttctagtt tgattaaaaa 660 aaggaagaga aaagattgct ataaagcttt atgtaaaaaa cgatctttca aaaaatcaat 720 actgccatct agccaatcaa atgatataga gcttgataac tataaagata atataaataa 780 tgacataaat ccaccaaatg acaagtcagc tatttctaaa ttttgtcagc atgatactta 840 taaagtagaa gtggaaactc ttaactttca gttaatgcaa tacagaaatg aattgatcga 900 gaaatctaat gaaattattg aacttaaaaa aaagaatgcc agactagacc ataataaata 960 tatttcatac caaaacctaa cagatgaaca aatgaatttt ttttcaggct tgtctcgaac 1020 tacgtttatg tggctctttg aaagagttaa aaattatatt aaaaaagttc actcaaggtt 1080 attgttggag gaccatctct taattgtact cataaaacta aaattaggtc tattaaataa 1140 agatattgct tttcgattta gactaagccc agcagtagtt tcaaaaattt tcagaacaat 1200 aataccaatt ttttctgcta gagttgttaa tctcattgtt tggccagacc gtggaataat 1260 aaggagtaac cttcctatat gttttaaaaa aaagttcaaa gattgtgttt gtattattga 1320 ttgtaccgaa atatttattg aaaggccaaa aaatttaact tcacgatcac aaacttggtc 1380 taattacaaa cataacaata ccattaaata tcttattggt attactccag ctggggcagt 1440 tagtttttta tctcctggct ggggtggtcg tgtttcagac aaacaaataa cgtttgagtc 1500 aaacttttgt cagaagcttt atcctggcga ctgtgttctt gctgacagag gattcaatat 1560 taaagacgaa ctaaatgctg ttggggcaac gttaaaagtt ccagcattta cgaaaggtag 1620 acaacaatta tctggatggg aagtggatac atctagacag atatctaatg ttcgcataca 1680 tgtagaaaga gtgattggtc aaataaaaaa gtttcgcatt attcatatga ctgtgccaat 1740 cacycaagtg gatttgttag atgatattat gattattatt tgtgctcttg taaatctcaa 1800 tatgagtgta gttacttgat taatgtaaat gtaaaaataa atttacagag aagcaatttt 1860 tacagtttta aaatctaaat acatattaaa ggaaagtaat atttgttctg gtttattgct 1920 tggttttaca ttctttgcat aaccaattag tagtttttgg agctcttttc agtccaacac 1980 aagcgtaatg aaaccattca tatttgcagt tggtaaaatt gcatggtatc atttttccaa 2040 aacttggtct ttggcatata cagtaaactt tcctactatt tattgtcgat ttatcattac 2100 gacgtgttac aagctctggc aataaaatgt ttttgaaata cataaaaaac ttatcaatca 2160 ttttgagaat aagatctgta tctttttcaa ttgttaaaat aaaactttgt tcacctttaa 2220 ctgcagggga gaatacaaga aaatcagcct cagaacagag acatacatgt agttgtaatt 2280 gaatttggta gtaatatttg tgatcctttt ttataatgag atctttattg attggaaaat 2340 ctttatcttc ttcccatcct tcaaagccat tttgatattt ataaggacat ttgatctcta 2400 aaattttagt ttgatgacaa ctgcatctaa tcaatgcatc tggagaagct ccaatgaagg 2460 ggtattgttc tctcacatga aaaccagtat ctaaaaattg gaaatttata tgactatatt 2520 ttactttttc aatataatat tttcgtgcaa catgttccgt ttctgttcca tattgtgtgt 2580 acttattaac agaatttttt gcataatgca taattttttt gaccaatgat tcacttgaca 2640 tatagtttgt gttagaattg tatacttctt ttgcatttga agctgttata cgccctgcac 2700 ggtgcatcat ccaaacagga ttcaatgatt gtgtttttgt ttggttgcat aagttattat 2760 actgcctttg attattatca tttttataat cgttatatat actttcgcaa aaaatgatca 2820 acttatcctc agataaattt atacttgttg gatcatatat tgatgtgaga gtttcaggaa 2880 gtgttgtctc ttctgtctca gatgaagttt cagtagcact agagtcatat tcaacagaca 2940 taaattctgt aagattaata cttgtaaaaa ttgcagattt tgggttaatt ttaaaaagtt 3000 tttccatctc ttgagctgtt aaccggtgct caccatccat tggattattg ttggaaaagt 3060 atttttttaa aggtagcttt gcaaccaaat ctttttgtaa aagatttttt tgaggtcgct 3120 gaaaatttat gagctgcaat ggagcaggct gaacagattt tttgcatttc ttccaagtac 3180 acaagatact tgtaggaggt gtttcgtgta attttaaacg tgaagctgct tctaatttaa 3240 ataataatgc tgctacatgg ctacaagctg atcctagtcc agccatacat gaacaattag 3300 cagttagtac ccaaccaagt tctttctcaa gacaagtcca tacatcatac atatttttgt 3360 tgttttgtct ttgacttggc aagacctatt acaaaaagat tatttctata taaaaattat 3420 tatttattcg tctctagtaa aaatttattt tttaattata tattaaaaat aacctttgat 3480 tttataaaac agaatttgct cctattcgat attttattat aaaaacattc ttgaacatga 3540 ccacaagtga agtaatcata tgcttccaag gatttatagg ctctgattga ttccttcgtg 3600 tatatgctag gtgtatcgat taaatattga gtaacgtctt gccatgtaat gtttggcaaa 3660 tgattgagat cgttgctcca ttcttttggt tcaattataa acggatccgg taaaacttcc 3720 ccatttttaa gagttaactt ctttgtgtag tctattgctt cttctaattt tagactttta 3780 aaatattcac tcatcatatt gttaactatt ttgaaaagtt aagtgttatt gttgacttcc 3840 aaaaacagaa aaaacttata agaaaaatcg caagagcaac agctattttt aaactcaaaa 3900 tgagtttaaa atttgaacaa gaacatctgt tttaaaattt tttaaacaaa agtttttaat 3960 aacaataaaa acttttatca ccaaaattta tatgaaatat ttgtaattaa ttttaattat 4020 tattctttta ataatttttt aaagaaaaaa ttaattttag ggtacaaata ttgattttag 4080 ggtagaaaag aaatacaaga aacacaatta agttcttgcg tttttatttt ttctacctac 4140 aagtatgcgc gcgcatgttg tttacaaaaa gacgtgacgt cgcacggaaa ctcatc 4196 // ID SMAR11 repbase; DNA; INV; 2604 BP. XX AC . XX DT 02-OCT-2007 (Rel. 12.1, Created) DT 22-OCT-2007 (Rel. 12.1, Last updated, Version 1) XX DE Consensus sequence of Mariner-type family of repeats. XX KW Mariner/Tc1; DNA transposon; Transposable Element; SMAR11. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-2604 RA Jurka J.; RT "Mariner-type element from freshwater planarian (Schmidtea RT mediterranea)."; RL Repbase Reports 7(10), 1069-1069 (2007). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 1133..2239 FT /product="SMAR11_1p" FT /translation="MPSRTFISKEEKSAPGFKVSKDRLTVLFCGNAAGDCK FT NKPLIIYKSENPRAFKGISKNSLPVHWVSNKKAWMTAKLFENWFLNCFCVQ FT AETYCKDKNIAFKILLIVDNAPSHPPHLAELHPNVKVIFLPPNTTSLIQPM FT DQGIIAAFKLYYLRRTFRQLIEATDGPDNLSIREYWRQYNILSSVKNIKLA FT WDEVKNSTMNAVWKKLWPESCIREAVVEMPEIVEIVTLGRTIGGDGFQNLE FT SEDITEVLVSNEENNTVDDILTQFGGNNDENDESEEDVQITEEKKFTTEKL FT SSFLKLGSQLEQQALNMDPDMERALKFRRNLTAALAPYAEIYKEKQKAGKQ FT SVLTSFFPKNIQSQPIESSSSACISD" XX SQ Sequence 2604 BP; 910 A; 354 C; 469 G; 865 T; 6 other; tacagtaaac cctccaactg cgcggttttt cattgcgcga tttcgcaatt gcgcggccaa 60 acctttaata ccaaaatttc gatctgcgcg ctaaaaattt cgtagctgcg cggttttttt 120 tgtttttgtc tattttaaca agaaaaatca tgtcaattta attttttatg gtgaatcccg 180 taaaactacg aatacatatg gtagtgaaaa tagtgattca tcgtatttcg atgcactgta 240 ttttttatat aaaaataata aaggcagttt catttttcta ctgatgtggt taggattagg 300 atgtaaaatt acatagtttt aattaataga tatttataca agtgtttatt taattaaaaa 360 tatattagta ttaaattcaa aaaaatctta ttcgtgtgca ttagatataa ccgtaataaa 420 aaattattac atcaattaaa gagtgttata aagtaagtat aaataaaatt tttgattttt 480 gatttaatac tcattttgat ttaataggtt tttaataatt tctacacctt atatataata 540 atggaaaaag ctggcaaaag aaaaatattg acgttagagg caaaacttga aattgtgaag 600 caattagaaa atggcgaaaa aatgtctgtt ttagctaaga gacacaatat gaacgaatcc 660 agcataagga ggattaaatt gaatgccgaa aaaattaaaa gttcagtagt ttgttctaca 720 tcgctagctg caaaaacact acaatgatta gaaaacgtga taacattatt gctgaaatgg 780 agaaagtgtt artgttgtgg attgaggatc aaactagtaa taacatkcca ttaagtcaag 840 ccctaattca atctaaggcc ctaactctat ttaactcaat gaaggctagc aaaaggagga 900 acctgggtca tctaatttag acgaaattaa gtttgaagct agtagaggct ggtttgtgaa 960 gtttaaagaa cgccaaaact tacataatat taagtttaca ggtgaagcag ctagttctga 1020 tatagaagct gcatatttat atccaraaam gttaaaacat atcattgawg gwtgtggata 1080 tttaccaagc caagttttta atgttgatga gactggccta ttttggaaaa ggatgcctag 1140 cagaacattt atatctaaag aggagaaatc tgctccgggg ttcaaggttt ccaaagatag 1200 attgacagtt ttgttttgcg gtaatgcagc tggcgattgt aagaacaaac cactaataat 1260 ttataaatca gaaaatcctc gtgcttttaa aggcatatca aagaactcac tgccagttca 1320 ttgggtatca aataagaaag cttggatgac tgcaaagctt tttgagaact ggtttttaaa 1380 ttgcttttgt gtacaagctg aaacttactg taaagacaaa aacattgctt ttaagattct 1440 gttgattgtt gacaatgcac caagtcatcc accacattta gctgaattgc accccaatgt 1500 taaagtaata tttctgcctc caaatacgac atctttaatt cagccgatgg accaaggcat 1560 aattgccgca tttaagctgt attatcttag aaggactttt cggcaattaa ttgaagctac 1620 ggatggccca gataatcttt caattcgtga gtattggcga cagtataata ttttatcttc 1680 cgtaaaaaat attaaattag catgggatga agttaaaaac tcaaccatga atgctgtatg 1740 gaaaaaatta tggccagaaa gctgcattag agaagctgtt gttgaaatgc cagaaattgt 1800 ggaaattgtg actttaggta gaacaattgg tggggacggt tttcaaaatt tagaatcaga 1860 agatatcacg gaagtccttg tgtcaaatga agaaaataat actgttgacg atattttgac 1920 acagttcggc ggtaataatg atgaaaatga tgaaagtgaa gaagatgtac agattacaga 1980 agagaaaaag tttactaccg aaaaactatc aagtttttta aaacttggta gtcaacttga 2040 acagcaagct cttaatatgg accctgatat ggagcgtgcc ttaaaatttc ggagaaatct 2100 gacagcagct ttagcaccat atgctgaaat ttataaagaa aaacaaaaag ctggtaagca 2160 atcagtgtta acaagctttt ttccaaaaaa tatacaatca cagccaatag aatcaagctc 2220 atcagcatgt atcagtgatt aatataacgt tacttatgtt aatttttatt tatgtttagt 2280 cgttgaattg agactgtttt ttagttcttg tattgttttt tataattttt atttatgtta 2340 aatttttatt aatttttata atagtaatat taattatgta aaagcaatga attgagactg 2400 tttttattag tttttatgtt tatataaatg taaatatgta tattagtcgt tgaataaaga 2460 cttttattat attttttaat atttttgtaa tatctggaac ctaaccccat aattacatat 2520 aaaatcgggt tcacagtgcg cggtttcggt ttgcgcgttg tttttgtgga acgtatctac 2580 cgcgcagttg gagggtttac tgta 2604 // ID HERMES repbase; DNA; INV; 2749 BP. XX AC L34807; XX DT 22-DEC-1995 (Rel. 1.11, Created) DT 22-DEC-1995 (Rel. 1.11, Last updated, Version 1) XX DE Musca domestica Hermes transposon transposase gene, complete cds. XX KW hAT; DNA transposon; Transposable Element; Ac element; HERMES; KW Hermes transposon; Tam3 element; hAT element family; KW hobo transposon; transposase. XX OS Musca domestica OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Muscoidea; Muscidae; Musca. XX RN [1] RP 1-2749 RA Warren D.W., Atkinson W.P. and O'Brochta A.D.; RT "The Hermes transposable element from the house fly, Musca RT domestica, is a short inverted repeat-type element of the hobo, RT Ac, and Tam3 (hAT) element family."; RL Genet. Res 64(2), 87-97 (1994). XX RN [2] RP 1-2749 RA Warren D.W.; RT "HERMES."; RL Direct Submission to Genbank (21-JUL-1994)William D. Warren, RL Molecular Genetic Resource Service, CAMBIA, GPO Box 3200, RL Canberra, ACT 2601, Australia. XX DR GenBank; L34807; Positions 1 2749. XX CC repeat_unit 1..17 CC /rpt_type=terminal inverted repeat CC CAAT_signal 354..357 CC TATA_signal 394..399 CC CDS 450..2288 CC /codon_start=1 CC /product="transposase" CC polyA_signal 2300..2305 CC polyA_signal 2336..2341 CC polyA_signal 2349..2354 CC polyA_signal 2353..2358 CC repeat_unit 2733..2749 CC /rpt_type=terminal inverted repeat. XX SQ Sequence 2749 BP; 946 A; 457 C; 502 G; 844 T; 0 other; cagagaacaa caacaagtgg cttattttga tacttatgcg ccacttgcta cttatgagta 60 caattgtgct ttgccacttg aacaaaaaat tcattgattc atcgacactc gggtatgttt 120 tgtcgtgtcg ttctgcgcac tcagttaaat tttttgtctt actctcttgc tctcagcaca 180 tcaagtgttg ttacttgttg ttactcagtc gcctgcctta tgcttttgga gagcgaaagc 240 acaacgatca gaacggagaa gtaacaactt gttttgctaa caagtggctt atgcacttga 300 gtgtgtttta cacatgtttt tgagtttcac agcaaaatgt tccgatttga gcacaataat 360 tttaccgtta ttttgagttt tttagttttg aataataaat gtgatttact gttcatcctc 420 aaaagagttt aagcagtagt agagattaga tgcagaaaat ggacaatttg gaagtgaaag 480 caaaaatcaa ccaaggatta tataaaatta ctccgcgaca taaaggaaca agttttattt 540 ggaacgtttt agcggatata cagaaagaag acgatacatt ggtggaaggg tgggtgtttt 600 gccgaaaatg cgaaaaagtt ttaaaataca caactaggca gacatcaaac ttatgtcgtc 660 ataaatgctg tgcctctcta aagcaatccc gagaattaaa aactgtttca gctgattgca 720 aaaaggaagc aattgaaaaa tgtgcacaat gggtggtacg agattgtcgg cctttttcgg 780 ccgtctctgg atccggcttt atcgatatga taaaattttt tattaaagtt ggagccgaat 840 atggtgaaca tgtcaacgtt gaggaattgt taccaagtcc aataacgcta tcgagaaagg 900 taacttcgga tgcaaaagaa aaaaaagctc tgattagtcg agaaattaag tctgctgtag 960 agaaagatgg tgcatcagca acgatagatt tgtggaccga taattatata aaacggaatt 1020 ttttgggagt aacgttacac taccatgaaa acaatgaact gcgagatcta attttaggtt 1080 taaagtcctt agattttgaa agatccacag cagaaaatat ttataagaag cttaaagcca 1140 tttttttaca attcaacgtc gaagacttga gtagtataaa atttgtgaca gatagaggag 1200 ccaatgtcgt aaaatcattg gcaaataata tcagaattaa ctgcagcagc catttgcttt 1260 caaacgtgtt ggaaaattca tttgaggaga cacctgaact caatgtgcct attcttgctt 1320 gcaaaaatat tgtaaaatat ttcaagaaag ccaatctgca gcacagactt cgaagttctt 1380 taaaaagtga gtgccctaca cggtggaatt ccacatacac gatgcttcga tctattctcg 1440 acaactggga aagcgtgatt caaatattaa gtgaggcggg agagacacag agaattgttc 1500 atataaataa gtcgataatt caaacaatgg tcaacatcct cgatgggttt gaaagaattt 1560 ttaaagaatt acaaacatgc agttcaccat ctctgtgttt tgttgtgcct tccattttaa 1620 aagtaaaaga aatatgttca cctgacgttg gcgacgttgc agatatagca aaattgaaag 1680 tgaacattat aaaaaatgta agaataatat gggaagaaaa tttaagcata tggcactaca 1740 cagcattttt tttctatccg cccgccttgc atatgcaaca agagaaagtg gcacaaatta 1800 aagaattttg cttatccaaa atggaagatt tggaattaat aaaccgcatg agttccttta 1860 acgaattatc cgcaactcag cttaaccagt cggactccaa tagccacaac agtatagatt 1920 taacatccca ttcaaaagac atttcaacga caagtttctt tttcccgcaa ttaactcaga 1980 acaatagtcg tgagccacca gtgtgtccaa gcgatgaatt tgaattttat cgtaaagaaa 2040 tagttatttt aagcgaagat tttaaagtta tggaatggtg gaatcttaat tcaaaaaagt 2100 atcctaaact atctaaactg gctttgtcgt tattatcaat acctgcaagt agcgctgcat 2160 cggaaaggac attttcccta gctggaaata taataactga aaagagaaac aggattgggc 2220 aacaaactgt cgacagcttg ttatttttaa attcctttta caaaaatttt tgtaaattag 2280 atatataatt acatttttaa ataaaaagaa tattttttat aagtttgttt gttaaaataa 2340 aaaaaaaaaa taaataaatt ttggactgga aaaaatttaa gtttaaaaga agcatttttc 2400 tttttttttt taatatactt atgctctttt cctagtcttg tacagaatca tatgcaatac 2460 tacaaacaat agcacacaca cacaaccctc atgttcaatg agtatacaac acaacaagaa 2520 gtgagtataa tttgccaatt gacaaatcgc acacgtccac ttgtgagttt gtacactttt 2580 tactctctca tactctagcg gtgatcttaa catcaaacaa ctgttgttgt taagttgtga 2640 aaaaatactc gtgtataaaa aaatacttgc actcaaaagg cttgacaccc aaaacacttg 2700 tgcttatcta tgtggcttac gtttgcctgt ggcttgttga agttctctg 2749 // ID Jockey-N1_CQ repbase; DNA; INV; 1589 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A non-autonomous Jockey-like non-LTR retrotransposon family from DE Culex quinquefasciatus - consensus. XX KW Jockey; Non-LTR Retrotransposon; Transposable Element; KW nonautonomous; Jockey-N1_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-1589 RA Kojima K.K. and Jurka J.; RT "Non-autonomous Jockey non-LTR retrotransposons from the southern RT house mosquito."; RL Repbase Reports 11(1), 584-584 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >97% CC identity. ~11 bp TSDs. This family encodes a protein similar to CC Jockey ORF1p but does not encode ORF2p. Thus it is a CC non-autonomous non-LTR retrotransposon derived from Jockey, CC like CC HeT-A. XX FH Key Location/Qualifiers FT CDS 75..1295 FT /product="Jockey-N1_CQ_1p" FT /translation="MPHGKKKRRSSPAGSADLKKLKNAEALPAKPGSLSKD FT AQNSSGNQFATLPVDVSEKEEFERREKLPPIFVKTSSSDSVRKWLTGFIKS FT GALRASIRLCADGLKILLPTRKDYNYVRDFLNNTKIEYYSHDDPGKRPMKQ FT VLRGLYDMDVSVLKEELKTLKLNVIEVFKMTRHNKDIKYRDQLYLVHLEKG FT STTPSELKAVRAIFNIIVTWERYRPVHRDVTQCSNCLQFGHGGRNCFIKSR FT CATCGGEHKTQACETINENIEAKCFNCGGDHSTKNRSCPKRAEFVKIRQQA FT TTRHQPNRRKTPPTFTDVDFPALASPGAGSVRVVPNLQPLPLNQRQKVAEN FT TTPPGFSQQPRENQPASTDEGSSDLFSPQELLNIFIEMTTTLRGCKTRAEQ FT VRTLGGFILKYSS" XX SQ Sequence 1589 BP; 522 A; 329 C; 359 G; 378 T; 1 other; agtctgcgat cagcagagaa gcgagtcaga cgtgcgtccc tcacaaacca aaaaagtgct 60 taaaaagtgc aaaaatgcct cacggcaaga aaaagaggcg gtcctcacca gcaggatcgg 120 cagatttgaa gaagctaaag aatgccgaag cgctacctgc aaagccaggc agtttgagca 180 aggacgctca aaattcgtct ggaaaccagt tcgctaccct ccctgtggac gtgagcgaga 240 aggaagaatt tgaacgacgg gaaaagttgc cacccatttt tgtgaaaaca tcgtcatcgg 300 attcggtgcg aaagtggctg accgggttta tcaaatctgg tgctttacga gcttccattc 360 gcttgtgtgc tgatggactc aaaattctgc tacctaccag aaaggattac aactacgttc 420 gggatttcct gaacaacaca aagattgaat actacagcca tgacgatcca ggtaaacgcc 480 ccatgaaaca ggtcctccga ggcctgtacg acatggatgt gagtgtgctg aaagaagagc 540 tcaaaactct taagttgaac gtgatcgaag tcttcaagat gacgagacac aacaaggaca 600 tcaagtatcg tgatcaactg tacctggttc atctcgagaa aggatcgaca acgccgtctg 660 agctgaaagc agttcgggca attttcaaca tcatcgtgac ttgggaacgt tatcgtccag 720 tgcaccgtga cgtgacgcag tgttcgaact gtttgcagtt tggacatggt ggaaggaact 780 gtttcatcaa gagtcgttgt gcaacctgcg gaggtgagca caaaactcaa gcttgcgaaa 840 caatcaacga gaacatcgaa gcgaaatgct tcaattgcgg cggcgaccat tcgaccaaga 900 atcgaagctg cccaaaacgw gctgagtttg tgaaaattcg gcagcaagcg acgacgaggc 960 accaaccaaa tcgtcgcaaa acaccaccaa ctttcacgga cgtggatttt cctgctttgg 1020 cgtcacctgg agcgggatct gttcgagtgg ttccaaatct gcagccattg ccgttgaatc 1080 agcggcaaaa agttgcagag aatacaacac ctcctggctt cagtcagcaa ccgagggaaa 1140 accaaccagc atcaacggat gaaggcagta gtgacctgtt ttcaccacaa gaacttctga 1200 acattttcat cgagatgaca acaacactgc gtggttgcaa aactcgtgcg gaacaagtaa 1260 gaacgcttgg aggatttatc ttaaaataca gttcgtagtt tactcgttct aatgattttg 1320 aatatttcaa tgaatttact tttttagttt taagttagga ttagttgtta aagtgatttt 1380 ttttttcttt tttacccaaa tgtattcgat tcaatgcaaa aatgtaaaac aatttgaagt 1440 aaataaatga atgtgaacag ttaataagaa aacgaaaaat ataacaaaga tgaagagcat 1500 gtagccatac cacttagaat gtaaatgaaa tgtaattatt ataagaaagt tcaataaaga 1560 catatttaat ttcaaaaaaa aaaaaaaaa 1589 // ID BEL-125_AA-I repbase; DNA; INV; 5426 BP. XX AC AAGE02023612; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-125_AA_; KW BEL-125_AA-LTR; BEL-125_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5426 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02023612; Positions 266021 260596. XX CC Positions [4450-5055] - Integrase core CC 'CACAT' target site duplication CC LTRs are 96% similar to each other. XX FH Key Location/Qualifiers FT CDS 41..1096 FT /product="BEL-125_AA-I_2p" FT /translation="MEKLIGRRNAQLVQVKWQFAAAEKLHERNASQGEVLD FT RLEQLRELARKFRDTQSEIEENQPDPEAIASVHDFREEFNSHFYRAKDLLE FT QYISNDGDDGDGSSGSVRGSTGSYRSVAGSSRDLRDAVQMPLETQRAMLLN FT QSSGGRNVNNVDPAGGEQVSNHAPFANVRLPAINVPTFDGDRKHWRSFKDI FT FIQTIHARGDLRDSLKMQYLISYLSGDAKRLVNSFPISDANYKEAWAALTN FT FYDKKKYTVFALVREFVEQPAIANATADNLRKLVTTSDEVVRQLNALGEEY FT NTRDPWLIHLLLEKLDKDTRSLWAQKIIDVENPSFEEFLKFLDNRTDALET FT CTASGNPCV" FT CDS 3376..5397 FT /product="BEL-125_AA-I_1p" FT /translation="MAYGAVVYSRAMDDAGNIFVNLVAAKTRVAPIKQVSL FT PRLELNAAVLLAELMQRVTQALSHITVEHWAWTDNTIVLQWLSSHPRKWKT FT YVANRTSAILDFLPRDRWDHVSSQDNPADCASRGLSPGEFVSFDLWFNGPE FT WLHLDEEFWKTEPQESIVDEDRLEARKLKALHTSPVNIRSSYVVEMELLNR FT RSSYGLVVRALAHVNRFLQAVRSYRLEATLTPSEIDSAKMQLARAAQFDVY FT KQEIEILAKGKELPTKNKLSSLHPFLDDNGTMRVGGRLQSSPYSFNVKHPI FT ILPRDHRLSELLLRELHLQNLHAGPTLLTATVNQQYWIVGLQTAVRRIVRG FT CVRRVRLKGQTANQLMGNLPVPRVMATRAFTHVGVDYAGPFKIHAMCVRGI FT KATKGYLAVFVCMATKAVHLEAASDLSSNVFIAALKRFIGRRGYPNEIWSD FT NGTNFVGTDRWLQEIQQSLKNNGKAVDHFLNNQGIKWVFNPPSAPHRGGIW FT EAAVKSAKKHIVAVLGSEPLTFEEFSTILSQVEACLNSRPLCALSSNPDSY FT EALTPGHFLVGHPLNLIPEPGVRHIPSNRLDRWQTLQKYNEEIWKRWREEY FT VATLQPRTKWRTTEDNVKVNQLVLVKNENVPPAQWELARIVKLHPDASGIV FT RTVTLRRGQTEYQRPVQKVCVLVSD" XX SQ Sequence 5426 BP; 1422 A; 1319 C; 1451 G; 1234 T; 0 other; gtttttggtc cttcttcgtc ggattacgtc gcgaaccggt atggaaaagt tgatcggtcg 60 aagaaacgca cagctagtgc aagtgaaatg gcagtttgct gcagccgaaa aactccacga 120 gcgaaacgct tcgcaggggg aagttttgga tcggctcgag cagcttagag agctcgccag 180 gaagttccgt gacacgcaaa gcgagattga ggaaaatcaa cccgacccgg aggcaatcgc 240 ctcagtgcat gacttccggg aggaattcaa ttcgcacttc tatcgtgcaa aggatttact 300 cgagcagtac atttccaatg acggcgacga tggagatggc agcagcggct ctgtcagagg 360 ttccaccggg agctacagat cggtggccgg ttccagccgt gatcttcggg atgccgtgca 420 gatgccactg gagacccaac gagcgatgtt gctgaatcag tcgtcgggcg gaagaaacgt 480 aaacaatgtt gatcctgcag gaggagaaca ggtgtcgaac catgcacctt ttgcgaatgt 540 gcgtctgcct gcaatcaacg tgccaacatt cgatggcgat cgaaagcatt ggagatcctt 600 caaggatatt ttcatccaaa cgatccacgc aagaggcgat ttgcgggatt cgttaaaaat 660 gcaatacctg atttcgtatc tgagcggcga tgcgaagcgt ctggtgaatt cattcccaat 720 atcggatgcc aactacaagg aggcgtgggc agccttaaca aacttctatg ataagaagaa 780 atacacggtc ttcgctctcg ttcgggaatt tgtcgagcaa ccggcgattg ctaatgctac 840 tgcggacaat ctgcgcaagt tggtcactac ctcggacgaa gtagttcgcc aactgaacgc 900 actgggagag gagtataata cgcgagatcc gtggttgatc cacctgttac tggagaagct 960 tgataaggac acacggtcct tgtgggcaca aaagatcatc gatgtcgaga atccttcctt 1020 cgaggagttt ttaaagtttc tggataatcg caccgacgct cttgaaacct gcacggcgtc 1080 tggaaatccg tgcgtctgag gtgtcgacag atgggcagcg caaggatact gggaagaaga 1140 tgcatgctga gaagaagatg caatcgttgc acacggctgc ggtgatacaa aagtgcgcga 1200 aatgctctag tgagcatccg attcatttgt gtgaagactt taagaagatg gacttacagt 1260 gcaaacgcga acttgtacaa aaggcgaaat tgtgctacaa ctgtctgcgg tcgtcgcatt 1320 cagtgaaaaa ctgtgcttcg aagtcggtgt gtcgcactgt gggttgcaaa cagcgccatc 1380 atacgctgtt gtgcgcaaag gcggaaaatt cgagtgacca gcgagacgaa caaggagagc 1440 cgaaagagca agtgcaacct ttgcaaggtt ctggagaagt agtcaactct ctggtagttc 1500 aagttccaaa cccactgaag gaaacgttcg tactaccgac tgcgctcatc aacgtgaaag 1560 cggtgaacgg aggattcctg aagtttcgtg cattgattga ttccggctcc ggtgcttcac 1620 tgatcaccga agcctgtgtg aacaaactcg gtattccacg tacaagcggc aaggtagcgg 1680 ttttccggac tggcgcagca gtctgccgtg acaactcgag gactcgtgaa gcttgtgatt 1740 gcaaatcggt gcagcgataa cgtgattctg cggaccaacg cgttcgttat gggcaagctt 1800 acttcaacac ttcctgcgca gtgcattgtg cccaattcca agctgcttga gaagtagatt 1860 caagagtcgt tagcggatcc ggcatacaac cagccaggtc caatcgacat aattttgggt 1920 tcggatgtct ttcttgccat actgcaacca gggcaagtca aggacgaatt cggtatccca 1980 atcgcccaga acacgatctt cggctggatc gtatctggca atcaatcgat ctacacagca 2040 ggggttccca acaatttctc catcatcaac ctacacgcag aggtggatgt caatcgtaca 2100 ttgcgccaat tctgggaaca agaggaagtg ccgaaatctc agcagcttac accagcggac 2160 caggctgcgg ttgagtgttt tcggtccacg accacacgtg acgattcggg acgcttcatg 2220 ttcgattgcc tttcgacgat tcgaagtcag cgctaggcga atcaatcgca ccaactatca 2280 aacggttgag gtccatggag aaacgcttcg aacgtgattc aagctttcgt cgacaatact 2340 cggacttcat cgcagaatac ctgacactgg gtcacatgga ggaggtacct gcggatgaga 2400 taagcgtgga agttgacaaa tgcttctacc tagcccacca tgcagtgatt aaagcagaca 2460 gttcaacaac taaactgagg gtagtttttg atgcctcgag tgcatcggga tcaggtgtct 2520 ctctcaacga ccggctctta tccggaccaa acgtgaatca agatttattc gatgttcatt 2580 tgcgtttccg gtcgaacgag gttgtattcg ctgcagatgc agaaaagatg ttccgccaag 2640 tgttggttca tccactagac cgagactacc aacgaattgt atggcgtaac gaccccagcg 2700 aacccatcaa gcattttcgt ctctgtaccg tcacatacag tacgaagcca gcgccgtttc 2760 tggcaattga agccatgcgc gaggcagcca gaagctatga aacagtgtat ccagtagctg 2820 ccgcgaggat agtgctcgac atgtacgtcg atgacttcat gtctggcgcg aagaacatcg 2880 atgaggcaaa gaagttgaaa gaccaggtgt gcgagatcct aaagtccgcg ggattcaact 2940 tacgcaagtg gacgactaat cgaccatagc tgctgggtga cagtgagtac tcggagcaaa 3000 taccaatgga gatgaagcta gaggagcagc cggatgctgt caaggctttt ggcatacatt 3060 gactacccaa ggacgacgtt ttttccttca aggtcagcct atcggcggac agtgttaaca 3120 ctaaacgaca actgctctcg gattcatcac ggctcttcga tccgtacggg tggctgtctc 3180 cggttatcgt gaagataaaa attctctttc aacaactgtg aaatttgctg ggataatccg 3240 ttgcctgcaa cagtggaggt accatggaag gacatcaagg aaagtctccg atttttggaa 3300 cagattcgta taccacgcta catcgtaaac ttcaacggga aggtgcagtt gcacggattt 3360 tcagacgcat ccgaaatggc ctacggtgcg gtggtataca gtcgagcaat ggacgacgcg 3420 ggaaacattt tcgtaaatct cgttgctgct aagactagag tcgctccaat caagcaggtt 3480 tcgcttccac gcttggagct gaatgctgca gttctactgg cagaactcat gcaacgagtt 3540 actcaagcac tctctcacat cactgtggaa cactgggcat ggacggacaa cactatcgta 3600 ctgcagtggc tgtcttcgca tccgcgcaag tggaagacgt acgtagctaa ccgtacttct 3660 gccattcttg attttctacc tcgtgaccgc tgggaccacg tgagcagcca agacaacccg 3720 gctgattgtg cgtctcgagg tctttcgcca ggtgagtttg tatcgttcga tctctggttc 3780 aacggaccgg agtggcttca tttggatgaa gaattttgga agactgagcc ccaagagtca 3840 atcgtcgatg aagatcgtct cgaagcacgc aaactaaagg ctttacacac cagtccggtt 3900 aacatcagaa gcagttacgt cgtcgagatg gaattgctta atcgacgttc aagttacggt 3960 ttggtagtgc gcgcactagc gcacgtgaat cgttttctac aagctgtaag aagctatcgc 4020 ctggaagcaa cgctaacgcc aagtgaaatt gactcagcga aaatgcagtt ggcaagagct 4080 gcacaattcg acgtgtacaa gcaagagatc gagatactcg ccaagggaaa ggagttgccg 4140 accaagaata aattgtcatc actacacccc ttcttggacg acaacgggac aatgcgcgtg 4200 ggtggtcgtc tacagagctc accctactcg ttcaacgtga agcatcccat aatacttcca 4260 cgtgaccatc ggttgtcgga actacttcta cgagaacttc accttcagaa cctacatgca 4320 ggtccaacgc tattgacagc aacagtgaac cagcaatatt ggattgttgg actgcagacc 4380 gcagttcgac gaatagttcg tggatgcgtt cgacgcgtga ggctgaaggg gcaaaccgcc 4440 aatcagctca tgggcaacct accagtacct cgtgttatgg caacgcgcgc attcactcac 4500 gtcggcgtgg attacgctgg cccattcaag atacacgcaa tgtgcgtcag gggaatcaag 4560 gctaccaagg gatatttggc cgtgtttgtg tgcatggcca ccaaggcggt gcatttagag 4620 gccgccagtg acctctcaag taacgtgttc atcgccgcac tcaagcgatt catcggaagg 4680 cgcggctacc ccaacgaaat ctggtccgac aacggtacca attttgttgg tacggatcgt 4740 tggcttcaag agatccagca atcactgaag aacaacggca aggcggtcga ccactttctc 4800 aacaatcaag gcatcaagtg ggtttttaac cctccatcgg cgccgcaccg tggcgggatt 4860 tgggaagcgg ccgtaaagag cgccaagaag cacatcgtcg ctgttcttgg ctccgaaccg 4920 cttacatttg aggaattttc gacgattttg tcccaagtgg aggcctgcct gaactccagg 4980 cccctctgcg cgctctctag caatccggac agttacgagg ccttaacgcc gggacatttt 5040 ctggtgggac atccactgaa ccttatcccg gaaccaggtg ttcggcatat cccatccaac 5100 cggctagata gatggcaaac gctgcaaaag tacaacgaag agatatggaa gcgatggcgt 5160 gaagagtatg tggctacact acagccgcgt acgaagtggc gaactacgga agataatgtg 5220 aaggtgaacc aactagtgtt ggtgaaaaac gagaacgttc cgccggctca atgggagttg 5280 gcacgcatcg tcaagctaca tccggatgct tcgggtattg tcaggactgt gaccttgcga 5340 cggggtcaga cggaatatca gcgtcctgtg cagaaggtct gcgttttggt atcggattga 5400 ggcacgtggc ctcaaggtgg ggagga 5426 // ID Gypsy-54_CQ-I repbase; DNA; INV; 4566 BP. XX AC AAWU01036387; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-54_CQ_; KW Gypsy-54_CQ-LTR; Gypsy-54_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4566 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 487-487 (2011). XX DR Genome; AAWU01036387; Positions 5447 10012. XX CC Positions [3525-3989] - Integrase core CC LTRs are 95% similar to each other. XX FH Key Location/Qualifiers FT CDS join(99..2768,2772..4553) FT /product="Gypsy-54_CQ-I_1p" FT /translation="MDGDESMNGQVPPNGQLPPNGQLPPNGQLPPNGQLPP FT NGQLPPNGQFGQQPHGFRPGADPYMQWFSQQQQVYVTEMFRQQQEVLQQQQ FT AAMQQQQRAFMEQQEQLLRNVMTRINVQVPPNPEVILDSLANNIKEFRYDP FT ENNVTFAAWYNRYDELFANDASRLDDPAKVRLLLRKLGMSEHERYTSYILP FT KTAKFFSFADTVAKLTSLFGTAESVISRRYRCLQIMKQPLEDYVTYACRIN FT KACVEFELPKLSEEQFKCLVFVCGLKSEKDAEIRTRLLCRIEERADITLEL FT ISEECQRLLNIWHDAAMIEEQVAAVQALDSKPKFPRKQFKSGPAGDRAKSG FT RSGPDTACWLCGGMHYARDCSFKAHECTDCGLVGHKEGYCRSAEKVRKKYV FT STKVVKVNTCSVEDRRKYVPVVMNRTSVRLQLDTGSDITIISTGTWRSIGS FT PPMSTTSVVAKTATGKPLEIDGEFWCEMVVGGRKSSGLVRVTSQPLNLLGS FT DTIDSFGLWSVPLDTICHLVETSTTTTTNVDALKAEFPHLFSGELGLCVKT FT KVKLELKPEKSPVFRPKRPVAYAMYQAVDEELDRLERCNIISPVDFSEWAA FT PIVVVRKASGAVRICGDYSTGLNDALQPNQYPLPLPQDIFASLASCTVFSQ FT IDLTDAYLQVEMDESSRAMLTVNTHRGLYQYNRLAPGVKPAPGAFQQVIDG FT MLSGLVRTCGYLDDVVVGGLDEEDHKKNLRAVLQRIEEFGFTIRAEKCSFG FT QPQIRYLGHLLDRQGLRPDPAKIEAIANLAVPTDVSGVRSFLGAINYYGKF FT VANMRNLRYPLDELLKDGNKFRWTPECQQAFDRFKQILSSDLMLAHYDPTQ FT EIIVSADASSVGLGATISHKYTDGSVKVVQHAARALSAEQGYSQPDREGLA FT IIFAVTKFHKMIFGRSFRLQTDHAPLLRIFGSRKGIPVYTANRLQRYALTL FT LLYDFTIEHVPTEKFGHADVLSRLISNHAKPDEDYVIASITLDNDLRSVVV FT DVASALPLSFRVVERDTQVDPLLKKIYRYLRNGWPQKHTIVDSEILRFFAR FT QDSLSTVDGCVLFGERLVIPERHRQRCLRQLHQGHPGISRMKAIARSYVYW FT PSIDDDVEALVKACKHCASAARSPPHSAPVPWPRPTGPWKRVHVDFAGPME FT GAYFLLIVDAYTKWPEVIKTNRITSVATIGMLRSVFARLGMPELLVSDNGT FT QFTSAEFLEFCTTNGIQHLTTAPFHPQSNGQAERFVDTFKRAVKKIQAGRS FT SIDEALDVFLMTYRTTPNRQVEGGVSPSEAMFGRRIRTNLELLLPPPPKPP FT SESTELKGGSRKRRFEPHDLVYAKLYSGNKWHWAPGVVCDRVGQVMYTVWV FT EDRRMLRSHANQLLSRADWDPRTDGAEKPKLPLDILLDSWNLSKPTPAADP FT TTPTVLDASLQSASPDQRSSHGVPECSPSIPRSPEQVAPPESEPASLVPGL FT RRSTRVRKKPQRFNSYQLN" XX SQ Sequence 4566 BP; 1050 A; 1215 C; 1348 G; 953 T; 0 other; aaagtggcga ctcgggttag ttttggccta atctttacgg gtccaagacg acgcccacgc 60 ggaaacgtag aagacttcgg gaaagttccg agagttcgat ggacggcgac gaatccatga 120 acggccaggt tccgccgaac ggtcagttgc cgccgaacgg acagttgccg ccgaacggac 180 agttgccgcc gaacggacag ttgccgccga acggacagtt gccgccgaac ggtcagttcg 240 ggcagcaacc acatggattc cgacctgggg cagatccgta catgcagtgg ttctcgcagc 300 agcagcaagt ctacgtgacg gagatgttcc gccagcagca ggaggttctc cagcagcaac 360 aagcggccat gcagcagcaa cagcgagcgt tcatggagca gcaggagcag cttctacgga 420 acgtcatgac gcggatcaat gtacaggtcc cgccgaaccc ggaggtcatt ctggactcgc 480 ttgccaacaa catcaaggag ttcaggtatg atccggaaaa caacgtcacc ttcgcggcct 540 ggtacaatcg ttatgacgag ctctttgcga acgacgctag ccggctggac gatccggcca 600 aggtgcgctt gctgctcagg aagctgggga tgtcggaaca cgagcggtac acgagctaca 660 tactgccgaa aacagctaag tttttctcgt tcgcggacac cgtggcaaag cttacgtcac 720 tgttcggtac ggccgaatcg gtgatcagtc gacggtaccg ctgcttgcag atcatgaagc 780 aacctttgga ggactatgtc acctatgcct gccgcatcaa caaagcttgc gttgaattcg 840 aactgcccaa gctctcggag gagcagttca agtgtcttgt ttttgtgtgt ggtctgaagt 900 cagagaaaga cgcagaaata agaacgcggt tgctgtgcag gatcgaggag cgagcagaca 960 taacactgga gctgatttcg gaagaatgcc aacgccttct caacatttgg cacgacgcgg 1020 cgatgatcga ggagcaggta gccgctgtgc aagcgttgga ttcgaagccg aagttcccga 1080 ggaagcagtt caagtctggt ccagctggtg atcgcgcgaa gtccggtaga agtggtccag 1140 acacggcctg ctggttgtgt ggtgggatgc actacgcaag ggactgctcg ttcaaggcgc 1200 acgagtgcac cgactgtgga ctggttggcc acaaagaagg ctactgccgg agcgctgaga 1260 aagtccggaa gaagtacgtc tcgacgaagg tggtgaaagt taacacgtgc agcgtggaag 1320 atcgtcggaa gtacgtgcca gtcgtgatga atcgcacctc tgtgcggctg caactcgaca 1380 ccgggtccga catcaccatc atctcaacgg gaacgtggcg cagcattgga agtccaccaa 1440 tgtcaacaac ttccgtcgtg gcgaaaactg ctacgggcaa gcccttggag atcgacggtg 1500 aattctggtg cgaaatggtg gtcgggggac gaaagtcttc tggtctggtt cgagtgacaa 1560 gtcaaccact gaatttgctg ggctctgaca caattgacag ctttgggctg tggtcagtgc 1620 cgcttgacac catctgtcat ctggttgaaa cgtcaacaac aacaacaaca aatgtcgatg 1680 ctcttaaggc ggagtttcca catttgtttt ctggagagtt gggcttgtgc gtgaagacga 1740 aggtgaaact tgagttgaag ccggaaaaat cacccgtttt tcggcccaag agaccggttg 1800 cttacgcgat gtaccaagct gtggatgaag agttggaccg actcgagcgc tgcaacataa 1860 tttctcctgt ggatttttcg gagtgggctg ctccgatcgt ggtggtacga aaggcaagcg 1920 gggcagtacg aatctgtgga gactattcca cggggttgaa cgacgcgttg cagcccaacc 1980 agtacccgtt gcctcttccc caggacattt ttgctagtct ggccagttgt accgtcttca 2040 gtcagattga cttgaccgat gcgtacctgc aggtggagat ggacgagagc tcgcgtgcca 2100 tgctcaccgt caacacgcac cggggactat accagtacaa ccgcctggca cctggagtaa 2160 agccagcacc gggagctttc cagcaagtca tcgacgggat gctgtccggg ctcgtgagga 2220 cttgtggtta tctcgacgac gtggttgtcg ggggtttgga cgaagaggac cacaagaaaa 2280 acctacgggc ggtgctgcag cgcattgagg agtttggttt cacgattcgt gcggaaaaat 2340 gttcgttcgg tcagccgcaa attcgctacc tgggccatct tcttgatcgc cagggattgc 2400 ggccggatcc agccaagatc gaagcaatcg ccaatctcgc cgttccgacc gatgtcagtg 2460 gagttcggtc atttctggga gccattaact actacggcaa gtttgtggcg aacatgcgca 2520 atttgcggta tcccctcgac gagctgctga aggatgggaa caagtttcgg tggacgccgg 2580 aatgccagca agcgttcgac cgattcaagc agattctcag ctccgacttg atgctggccc 2640 actacgatcc gacgcaggag attattgtct cggccgacgc gtcatccgtt gggctaggcg 2700 caaccatcag ccacaagtac accgacgggt cggtgaaagt cgttcagcat gcagctcggg 2760 cgctgtcgta ggcggaacag gggtacagcc agccggacag agagggtcta gccatcattt 2820 ttgccgttac caagttccac aagatgatct tcggtcgctc gttccggctc cagaccgatc 2880 acgcgccgct gttgcggatc ttcgggtcgc gtaagggcat tcccgtgtat acggcgaacc 2940 ggctgcagcg ttacgcgttg acccttttgc tgtacgattt caccatcgag cacgtcccga 3000 ctgagaagtt cggtcacgcc gacgtgctgt cacggcttat cagcaaccac gccaaacccg 3060 acgaggacta cgttattgct agcatcaccc tggacaacga tttgaggtca gtagtagttg 3120 acgtagcaag tgcccttcct ctcagtttta gggttgtgga gcgtgacacg caagtcgacc 3180 cgttgctgaa gaagatctac cgttacctgc gcaatggctg gccacagaag cacacgatcg 3240 tcgactcgga aattttgcgc ttcttcgctc ggcaggactc gctgagcact gtggacgggt 3300 gcgttttgtt tggagaaagg ctggtcatcc ctgaacggca ccggcagcgg tgtctgcgcc 3360 agctgcacca aggacacccg gggatctcac gcatgaaggc catcgcccga agctatgtct 3420 actggccttc gattgacgac gatgtggagg cacttgtgaa ggcgtgcaag cattgcgcat 3480 cagcggcacg atcccctccg cactcagcac cagttccgtg gcctagaccg acgggaccct 3540 ggaaacgtgt gcacgtggac ttcgctggtc cgatggaagg tgcgtacttt ttgctgattg 3600 tagatgccta caccaaatgg ccggaggtga ttaagacgaa tcgcatcact tcggtagcaa 3660 caatcggcat gctccgcagc gtttttgctc gtctgggtat gccagaactg ctggtcagcg 3720 ataacgggac acagtttacg agcgctgagt tcctggagtt ctgtacaact aacggcatcc 3780 agcaccttac gacggccccg tttcatccgc agtccaatgg ccaagcggag cggtttgtgg 3840 atacgtttaa acgagcagtg aagaagattc aggcggggag gagctcaatc gacgaagcgc 3900 tagacgtctt cctgatgacg tatcgaacta caccgaatcg tcaggtcgaa ggcggcgtgt 3960 caccatctga agccatgttt ggacgccgca tccggacgaa tctggaactg ctcttgccac 4020 caccaccaaa accaccaagc gagtcaacgg agttgaaagg tggttcccgg aagcgacggt 4080 ttgaaccaca cgacctggtt tacgcgaagc tgtacagcgg aaacaagtgg cactgggcgc 4140 ctggcgttgt ttgcgaccgg gttggccagg tgatgtatac cgtttgggtc gaggaccgcc 4200 gtatgctccg atcgcacgca aatcagcttc tcagtcgtgc agattgggat ccgaggacag 4260 atggagcaga gaagccgaag ttgcccctcg acatcctgct ggattcctgg aatctgtcga 4320 aaccaacacc ggctgcggac ccaacaacac cgacggtctt ggacgcgtca ctacagtcag 4380 caagtccgga tcaacgctcg tcacatggtg tcccagaatg ctcgccatcc ataccaagat 4440 cacccgagca ggttgcacca cctgaaagcg agcctgcgtc gctggtaccg gggcttcgtc 4500 gttctacacg agttcgaaag aagcctcagc ggttcaactc gtaccagctc aattgagaag 4560 ggggga 4566 // ID Proto2-2_CS1 repbase; DNA; INV; 4814 BP. XX AC . XX DT 15-JUL-2009 (Rel. 14.07, Created) DT 15-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE Annelida Proto2-2_CS1 autonomous Non-LTR Retrotransposon - DE consensus. XX KW Proto2; Non-LTR Retrotransposon; Transposable Element; KW Proto2-2_CS1. XX OS Capitella sp. 1 OC Eukaryota; Metazoa; Annelida; Polychaeta; Scolecida; Capitellida; OC Capitellidae; Capitella. XX RN [1] RP 1-4814 RA Kapitonov V. and Jurka J.; RT "Proto2, a novel clade of metazoan non-LTR retrotransposons."; RL Repbase Reports 9(7), 1557-1557 (2009). XX DR [1] (Consensus) XX CC Proto2-2_CS1 is a very young family of non-LTR retrotransposons CC present in the annelid genome. It belongs to a novel clade of CC metazoan non-LTR retrotransposons called Proto2. This clade CC includes families of non-LTR retrotransposons present in the CC hydra (from Proto2-1_HM to Proto2-5_HM), annelid (from CC Proto2-1_CS1 to Proto2-8_CS1), hemichordate (Proto2-1_SK) and CC amphioxus (Proto2-1_BF) genomes. A model Proto2 non-LTR CC retrotransposon is ~4.3 kb long, contains two ORFs; its 3' CC terminus is composed of microsatellites; usually there is no CC target site duplications generated upon retrotransposition of CC Proto2 elements. ORF1 codes for a protein conserved in Proto2 CC elements from all species mentioned above. ORF2 codes for a CC protein composed from the AP endonuclease and reverse CC transcriptase domains. It appears that the Proto2 clade is a CC clade ancestral to the RTE and RTEX clades. XX FH Key Location/Qualifiers FT CDS 369..1559 FT /product="Proto2-2_CS1_1p" FT /note="ORF1." FT /translation="MSETDATPPDDPGDQSSPVTNALLGYVFNCMHFSSVY FT NLRAVVLGHFSADAIHKAKETLAQHVASSVVGEAFFTTRRGSSARGKAKAR FT TGKAKEKEEMELDDIIEAMEAMDKAGVSVEFHIPAFELHLLPRTRPEDLLS FT VTFMENVKRLEARVAALDTHAALSTEQMSILKQKVDHLETTAHASGSSCVS FT PETIPMSSMHFPSLNAINSAGTMSSSMDPTSSAPVYSTRDKPFLEPNHANP FT PRSYSVALQKNSDGGTAIPWTKVRKTRKSRPINRTDVKNATQQLRTVTGKS FT ARDSELVGSMPAKHIYVSKLNTDITSDHISDYMRKKNIHVRQTRKVSKDEW FT LHGSFKVAINAEDLERVLDEEFWPERVCCREWLPFIAKNSNGDDPKKNYND FT DDE" FT CDS 1426..4617 FT /product="Proto2-2_CS1_2p" FT /note="ORF2: AP endonuclease, RT." FT /translation="MPRTWSESLTKNFGQKESAAGNGCHSLQRIVMEMIPK FT KIIMMMMNNEITQNLKVCTYNIAGFNRCNWLYVQQLCDNNDFVFLQELWLH FT SSEGHRITESLSNVSMHFVSGMPDDEIHVGRPYGGCCIIWNSKLKCKVSPL FT PCGNSRVCAIKIDLSDGPLLLINAYMPCDTQVAYHDNQLVFEEVLNDVADL FT IQISNIDRIVFGGDFNTALSRRASTHTRSLNQFVTNESLSFLSALSCYEVD FT YTHESDAHGTRSTLDHFIVSSQLVCSVNRVQCDHNALSSSDHTALLLSLSI FT SMPSISNGQSVPRRGPVPLWPKATDAHIANYRNSLSESTEDLTPPYEALRC FT DSPQCTAHTETILSYYNALISLCLSSGRRSIPHSPVSRSGSRRSVPFWSTH FT VRPLKDNALFWHAIWKSCDSPATGALTNIRRSTRAKYHRAIRYFKQNDQLA FT RFTRMGEYFVNQGRDDFWTEVQKMRGNNSATPTIVDDCLSEDDIALCFREK FT YNTLYNSVGYVPEEMARLRQEIEANACRHEGEMCRSHVLSIRDVTIAVSNL FT KRGKHDGHLGHYSDHLRFAPNRFLCCLSLTLNSLLVHGLVPDEMALSTVSP FT IPKNKRKSLNDSDNYRAIALSSIIGKMLDRVLIEKCSALTDTSDLQFGFKA FT RHSTNQCSFVAREIIEYYHSRDSDVYITLLDASKAFDRVEYTALFRILISK FT SVCPIVARFLLNLYTQQQIRVRWGASHTDSFASSNGVKQGGVLSPLLFSLY FT LDPLLRQLESSAFGCWIGFQYCGALAYADDVVLIAPTLRAVKEQLKVCAQY FT AQTYQVSFNATKSKLIPLCRNDPLTSTAFSVSLMDEPIECAAQDKHLGNVI FT GFFNEEEVIDNVIKDFNKRVGMIRSHFKWLAPDCAYHLFKSFCMPLYGSVL FT WNFSHRSVNRFYTAWRKAIRCILHLHPRSHSCFLAPVCNDVDVEIQLLCRN FT VRFLRSLERSNNRVVRCCFQLVQNGSRSTVSRSFSKLCDVTQKSRHHIVES FT VCHPTALFEISNDHEEIAGLIRDLLALRAELKLDQNLNAHDIEFLRDIEFS FT IQELCTA" XX SQ Sequence 4814 BP; 1228 A; 1094 C; 1070 G; 1421 T; 1 other; tgttagagta gatcaggttc ggttgagacg cattgaaagt ttaggagccc cttttttgca 60 attttctctg gattttgtga tcggattccg cggattttgt ttgcatattg tgaggtattg 120 ttctatagaa cctatctgtg ttctcgttat tttgttttct ttttgttttg tacgtctcaa 180 ccgaggttat tgtacatcat atttctctaa gtgcatgact ttgagcgatc gaggtcatcg 240 ttacgcttga ctggtcagca ttgaccggtt cgttcgtatt gtcatatttt tatctcgatt 300 tttcccctct tttacctgtg ttcagctgat tgttgcgtag ttctggtgtg catgaacaac 360 catacagcat gtccgaaact gatgctaccc cacctgatga tccgggtgat caaagttcac 420 ctgtcactaa tgcattatta ggctacgtgt tcaactgcat gcactttagt tctgtgtata 480 acttacgggc tgttgtactt ggacactttt cggctgatgc aatacacaaa gcaaaggaga 540 ctctggctca acatgttgct tcttcagttg tgggagaggc ctttttcacg actcgacgcg 600 gttcctctgc aagagggaaa gcaaaggcaa ggactggaaa agcaaaggaa aaggaagaaa 660 tggagctgga tgacatcatt gaggccatgg aggcgatgga taaagccgga gtcagcgtgg 720 aattccatat tcccgcgttt gagctgcatc ttcttccgag aacaaggccc gaggacttgc 780 tttccgtgac attcatggaa aacgttaaac gcttggaggc acgtgtggca gctcttgata 840 cccatgcggc gctgtcgacg gagcagatga gcatcctgaa gcagaaggtg gatcatttgg 900 agactacggc acatgcatct gggtcgtcct gtgtctcccc tgagaccatt cccatgtcct 960 caatgcattt cccaagcctc aacgccatca actcggcagg aacgatgtca tcctcaatgg 1020 atcccacatc ctctgcaccc gtctactcta cacgggataa gccgttcttg gaacccaatc 1080 acgcgaaccc tccccgtagt tactccgttg ctctgcagaa gaattccgat ggcggcacag 1140 caatcccatg gaccaaggtt cggaaaactc ggaaatctcg ccctattaat cggactgatg 1200 tgaagaatgc gacccagcaa ctccggacgg tcacagggaa gagcgctcgt gattcagagc 1260 ttgtaggaag catgccggcg aaacacatat acgtcagtaa actgaacaca gatatcacct 1320 ctgatcacat cagtgattac atgcgaaaaa agaatatcca cgtacgtcaa acgagaaaag 1380 ttagcaaaga tgaatggcta cacgggtcat ttaaagttgc aataaatgcc gaggacctgg 1440 agcgagtcct tgacgaagaa ttttggccag aaagagtctg ctgcagggaa tggctgccat 1500 tcattgcaaa gaatagtaat ggagatgatc ccaaaaaaaa ttataatgat gatgatgaat 1560 aacgaaataa cgcaaaactt aaaagtttgt acatataata tagcagggtt taatagatgt 1620 aactggctct atgttcagca attatgtgat aataatgatt ttgttttcct tcaggagcta 1680 tggttacaca gctcagaagg acaccgcatc actgaaagct taagtaatgt ttccatgcac 1740 tttgtttcgg gtatgccaga tgatgagata cacgtcggta ggccatatgg cggctgctgt 1800 attatctgga attccaaatt aaaatgtaaa gtatctccgt taccatgtgg caacagtaga 1860 gtttgtgcga tcaaaatcga tctaagtgat ggccccctgc tgttgattaa tgcttacatg 1920 ccatgtgata cacaggttgc ttatcacgac aaccaattag tgtttgaaga agtcctgaat 1980 gatgtcgccg atttaattca gatcagcaac atcgacagaa tcgtgttcgg tggcgatttc 2040 aacactgcat tgtctcggcg tgcttcaacg cacactcgtt ctcttaatca gtttgttact 2100 aatgagagtt tatcttttct cagtgctctt tcatgctatg aagttgacta cactcacgag 2160 agtgatgcgc atgggacgag atcaacactg gaccatttca ttgtatccag tcagttggta 2220 tgtagcgtca atcgcgttca gtgcgatcat aacgcactca gctcatctga tcatacagca 2280 cttttactat ctctctctat tagcatgccg tccattagca atgggcaatc agtccctcgg 2340 agaggtccag tgcctctctg gccgaaggcc actgatgctc acatagcaaa ctatcgcaat 2400 tctcttagcg aatccactga ggatctcaca ccaccttacg aagcacttcg atgtgattct 2460 ccccaatgta ctgctcatac ggagaccata ttgtcctact ataatgcgct aatcagtctt 2520 tgtctttctt ctggtcgcag gtctattccg cactcccccg tgtcccggtc aggtagccga 2580 cgatcagtcc ctttctggtc gacgcatgta agacccctga aagataacgc actcttctgg 2640 cacgctatct ggaagtcatg tgactctccc gccactggtg ctctcaccaa tatccgacgc 2700 tctacgcgtg caaaatatca tcgtgcaatt agatacttca aacaaaatga ccagctcgct 2760 cgtttcactc gcatgggtga atattttgtt aatcaggggc gtgatgattt ttggactgaa 2820 gtgcagaaaa tgcgtggcaa taatagtgcc actccaacta ttgttgatga ttgcctgtct 2880 gaagatgata ttgctctttg ttttcgcgag aagtacaaca ccctgtacaa cagtgttggt 2940 tatgtgccag aagaaatggc ccgtttgaga caggaaatcg aagccaatgc atgtcgtcat 3000 gagggagaga tgtgccgctc tcatgtgctc agtatccgag atgttaccat tgctgtgagt 3060 aaccttaagc ggggcaagca tgatggtcac ttggggcatt actctgatca tctccgtttt 3120 gctcctaacc gttttctttg ctgcttatca ctgactctca acagtttgct cgtgcatggt 3180 cttgttccag atgaaatggc tctatctacc gtttctccga ttccgaagaa caagcgcaaa 3240 agtttgaatg actctgataa ctaccgggct attgccttaa gcagtataat cggtaagatg 3300 ctcgacaggg ttctgatcga gaaatgttct gctctgactg atacatctga tctccagttt 3360 ggtttcaagg cacgtcattc taccaaccaa tgttcgtttg ttgccagaga gataattgaa 3420 tattatcaca gccgtgattc tgatgtctat attactttac ttgatgcctc taaggccttc 3480 gacagagttg aatacacggc actgtttcgc attctgatct caaaaagtgt ctgtccgatt 3540 gtggccagat ttctgctgaa cttgtacacg cagcagcaga tacgggtgcg ttggggtgca 3600 tcgcacactg acagctttgc ctcatcaaat ggcgtgaagc aaggaggcgt actgtctccg 3660 cttctctttt cactctatct agatcctcta ctgcgacagc tggaatcatc tgcattcgga 3720 tgttggattg gtttccagta ctgtggcgct cttgcctacg ctgatgatgt cgtactaatt 3780 gcacccactc tcagagcagt gaaggagcaa ttgaaggtat gtgctcagta cgcacagaca 3840 taccaggtat ctttcaatgc aacaaaaagt aaattgattc ctctatgccg taacgatcct 3900 ctcacctcta cagcattcag tgtgagcctg atggatgagc caatcgaatg tgctgcacag 3960 gataaacacc tggggaatgt tattgggttt tttaatgagg aagaggttat tgataatgtc 4020 atcaaggact ttaacaaaag ggtcgggatg attcggtccc actttaagtg gcttgctccc 4080 gattgtgcgt atcatctgtt caaatcattc tgtatgcctc tgtatgggtc cgttctttgg 4140 aacttttcac accggtcagt aaatcggttt tacactgcat ggcgtaaagc tattcgatgc 4200 atactgcatc tgcatccgcg ctctcactca tgttttctcg ctccagtctg caatgatgtt 4260 gatgttgaaa ttcagctact gtgccgcaac gtcaggttcc ttcggtcttt ggaaagaagc 4320 aataaccgtg ttgtgcggtg ctgctttcag ttggttcaga acggcagtcg ctctactgtt 4380 agccgctcgt tctctaagct gtgtgacgtc actcagaaat cccgtcatca tattgtggaa 4440 agtgtttgtc atccaactgc tctgttcgaa atcagcaacg atcacgaaga aatagctggc 4500 ttaatcaggg atctccttgc tctgagagct gaattaaagc tggatcagaa cctaaatgct 4560 catgatattg aattcctgcg agatattgaa ttctctatac aagagctatg taccgcctga 4620 agctgtgctt caactcattg atactttttt tccctctcat tcattatatt ctttctgttc 4680 tttttgcacc ccggtccgtt cagattgacc ctagaaattt tttttttttt ttattytcac 4740 tgaattatta ttattattat tttcactaaa atcacattga tgttcaatga agaacgaata 4800 aacatatata tata 4814 // ID CR1-89_HM repbase; DNA; INV; 3805 BP. XX AC . XX DT 14-SEP-2009 (Rel. 14.09, Created) DT 14-SEP-2009 (Rel. 14.09, Last updated, Version 2) XX DE Non-LTR retrotransposon: consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-89_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3805 RA Jurka J.; RT "Non-LTR retrotransposons from Hydra magnipapillata."; RL Repbase Reports 9(9), 1931-1931 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 180..602 FT /product="CR1-89_HM_1p" FT /translation="MKKENNILIYGITQCNNEDILEKKKFDQNEIKTIMSK FT INTEAIKIKKLTRLRSKTEGKSAPIILELESKACRNLMLKNAYNNKEKING FT IYFNVDMTEAERYLMYQLRNQTKELNTSITNTSSIYYGIRDYKIVKLKRTQ FT N*" FT CDS 609..3677 FT /product="CR1-89_HM_2p" FT /translation="MSLNISSFKPLLPNNKFNKKNIVTNTLTKYVKKSQTR FT DVIISEKRSIDITKNCLSCRYTNATSLNNKFDLFLLDIATFKPDIYMITET FT WFNENSVTHVNGYQIIHNNRSNRSSVKAHGGGVAIYVKEGLLVYSPNDSFT FT KNINIEHTWCILSIKDEQILLGCIYRPPDASEEINNYICEIIEAAKNKIDH FT NRYSGLLISGDFNYPNIIWKKNNVQTMKKYDKYSTKFVEYINDNYLHQAVM FT SPTYNADLINGNILDLILTETPERINKIHHNPTLGNTRNGHQILSWKYFLK FT NTTKKEMSNYGKSGFNYKLGNYTVINKAINENDWESLFGKNSINECYEIFL FT NIYDKLCNKYIPKKKIYFNKQPQPKPWIDREAKKAIRNKTSLWHKLLSNGF FT KCDQLKVQYILINKTIKNLIKTKRINYEKKIADSSKENPKLFYAYVNSNRK FT IKLGINIITDKNGSLQTNRDDIANILNENFHSVFVIEDPTNFPNIALKTSK FT TLELDIDSIITQDIVRIKLSELNVNKALGADGVSSYVLKKCQNSFCKPLEL FT LFKRSLQEEQIPLIWKMANVTPLHKNGDKNDPANYRPISLTSIPCKILESI FT LGNKIMDYMLANNLLNSNQHGFQKNKSCTTNLLETQDILLDAIENGWCVDV FT LYTDFSKAFDKVPHMRLMSKLISYGIVGVILNWIEAYLHNRKQRVILGDCV FT SKWLTVESGVPQGSVLGPLLFLIFINDLPDTLKNNIKLYADDSKIINIFKS FT DEDIRINKLQLDIDNIMKWSSLWLMKFNYEKCKIMHIGFKNPCITYSMFDA FT ETQQNNILAASKVERDLGVMMQSNLKWTSHIDKAVGKSNQMLALIKRTFKY FT FDSNMTKKLYTTFVRPHLEFAIQAWCPYLKQDILKLEKVQRRATKLVPAFK FT KMPYEKRLQLMKLTTLEERRLRGDLIFQYKIQNGYEKINWINDSTSQVQLK FT PYNFRSHNQKLTKAYCKSNYRYNFFSCRVVNAWNGLPPYIIESMTIDQFKN FT KLDKSDYCKNYLLNTK*" XX SQ Sequence 3805 BP; 1612 A; 532 C; 529 G; 1132 T; 0 other; tattatctcg agttgaacat ttagaacttt tagaaagaca gaataaaaat gtacaaactt 60 tagataatgt tgttgttgaa aacgaaaaag catggaacat cgtagctgga aaaaatatta 120 agaagactaa agagcaaatt aacatcatca atactattgc aaatgaaaca gcggaacgaa 180 tgaaaaaaga aaataacata ctgatttacg gaataaccca atgtaataat gaagacattt 240 tagaaaaaaa aaagtttgac caaaatgaaa taaaaacaat tatgtctaaa attaatacag 300 aagctataaa aataaaaaaa ttgacacgtc taagatctaa aaccgaaggt aaatcagctc 360 caataattct tgaattagaa agtaaagcct gtagaaattt aatgttaaaa aatgcataca 420 ataacaagga aaaaattaat ggtatttatt ttaacgttga tatgactgaa gcagaaaggt 480 accttatgta tcaactaaga aatcaaacca aagaacttaa tacttcaatt actaatacat 540 catcgatata ttatggtatc agggattaca aaatagtgaa attaaaacgt acacaaaatt 600 agcaatcaat gagtttaaat attagttcat tcaaaccact tttgcctaat aataaattca 660 ataaaaaaaa cattgtgacg aatacattga ctaaatatgt taaaaaatct cagactcgtg 720 atgtaattat ttccgaaaaa cgtagtatag atataactaa aaattgtcta agttgtagat 780 acacaaatgc tacatcatta aacaataaat ttgacttgtt cctattagat attgcaacct 840 ttaaacctga tatctacatg atcacagaaa cttggtttaa tgaaaactca gttacacatg 900 taaatggtta tcaaataatc cataataata gatcaaatag gagttcagtt aaagcacatg 960 gaggtggtgt tgctatctat gtaaaagaag gtcttttagt ttactcaccc aatgatagtt 1020 tcactaaaaa tatcaacatt gaacatactt ggtgtatttt aagtataaaa gatgaacaaa 1080 ttttacttgg ctgtatttac cgaccacctg atgcatctga agaaataaat aactatatct 1140 gtgaaataat agaagcagct aaaaataaaa ttgatcataa caggtactct ggtttactta 1200 tttcaggaga ttttaattac ccaaatataa tatggaaaaa aaataatgtt caaaccatga 1260 aaaaatatga taaatactcc accaaatttg ttgaatatat aaatgacaac tacttacatc 1320 aagcagttat gtcacctaca tataatgcag atttaattaa tggtaatatc ttagatttaa 1380 ttttaacaga aacacctgaa cgtataaata aaatccatca taacccaact ctaggcaaca 1440 ctagaaatgg acaccaaata cttagctgga aatatttttt aaaaaacact actaaaaagg 1500 aaatgtcaaa ctatgggaaa tctggattta actataaact tggtaattac acagtcataa 1560 ataaagctat caatgaaaat gattgggaaa gtctttttgg aaaaaatagt attaatgagt 1620 gctatgaaat attcttaaat atctatgata aattatgcaa caaatatatt cctaaaaaaa 1680 aaatatactt caataaacaa ccacagccca aaccttggat tgatagagaa gcaaaaaaag 1740 caataagaaa caaaacaagt ttatggcaca aactattatc taatggtttt aaatgtgacc 1800 agctgaaagt ccaatatatt ttaatcaaca aaacaataaa aaatcttatt aaaacaaaaa 1860 gaataaatta tgaaaaaaaa atagctgact catcgaaaga aaatccaaaa ctgttttatg 1920 catatgttaa ttcaaacaga aaaataaaat taggcattaa tataataaca gataaaaatg 1980 gttctttaca aacaaatcgt gatgatattg ctaatatcct caatgaaaac tttcactcag 2040 tatttgttat agaagatcca acaaattttc ctaatattgc attaaaaaca agcaagactt 2100 tagaattaga tatagactca ataataactc aagacatcgt tagaattaaa ctaagtgaac 2160 taaatgtcaa taaagcatta ggagcagatg gtgtaagttc atatgtgtta aaaaaatgtc 2220 aaaatagttt ttgcaaacct ctagagttac tattcaaaag atccttacaa gaagaacaaa 2280 ttccactcat atggaaaatg gctaatgtaa caccactaca caagaatgga gataaaaatg 2340 atcctgcaaa ctatagacca atttcattaa cgtcaatacc atgcaaaatt ctagagtcaa 2400 tcttaggtaa caaaataatg gattacatgt tagcaaacaa tttattaaat agtaaccaac 2460 atggttttca aaagaacaaa agctgtacta ctaatctact agaaacacaa gatattttac 2520 ttgatgctat tgagaatggc tggtgtgttg atgtgttgta tacagatttc tcaaaagctt 2580 ttgataaagt acctcatatg cgattgatgt caaagttaat atcatatggt atagttggag 2640 taattctaaa ttggatagaa gcctatttac ataatagaaa gcaacgagta attttagggg 2700 attgtgtatc aaaatggcta acagttgaga gtggagttcc acaaggttct gtccttgggc 2760 ctcttctgtt tcttattttt ataaacgatc tgccagacac attaaaaaat aatattaaat 2820 tgtatgcaga tgatagcaaa attataaata tttttaaatc agatgaagat atccgtataa 2880 acaaattaca actagatatt gataatatta tgaaatggtc gtcactatgg ttaatgaaat 2940 ttaactatga aaagtgtaaa attatgcaca ttggttttaa aaatccttgt ataacttatt 3000 caatgttcga cgctgaaact caacagaata atatattagc agcatcaaaa gtagaacgtg 3060 atttaggagt aatgatgcaa tctaatttaa aatggaccag tcatattgat aaagctgttg 3120 gtaaatcaaa tcaaatgctt gctcttatta aacgaacttt taaatatttc gactcaaata 3180 tgacaaaaaa attatataca acttttgtaa gaccgcatct tgagtttgca atccaagcat 3240 ggtgccctta tctgaagcaa gacatcttaa agttagaaaa agtacagaga agggcaacaa 3300 aactcgtacc agcttttaaa aaaatgccat atgaaaaaag attacagtta atgaaattaa 3360 caactctaga agaacgccgt ttaagaggag atcttatttt tcaatataaa atacaaaacg 3420 gttatgagaa aataaattgg ataaacgact cgacatctca agttcaatta aaaccttaca 3480 attttcgtag tcataaccaa aagttaacaa aggcatattg taagagcaat tatagatata 3540 atttcttttc ctgtagagtc gtaaatgctt ggaatggttt accaccatat attatcgaat 3600 caatgacaat tgaccaattc aaaaataaac ttgataaatc agattactgc aaaaattatt 3660 tattaaatac taaataatat ctatatacat tgaaaagtaa acaatgctat tataaaacat 3720 ttatagctgt tatagcacaa catcttctgt tgttgtgctc cacaattgat ttcatcagtt 3780 gtcacagtgt acattattat tatta 3805 // ID Gypsy-204_AA-I repbase; DNA; INV; 3615 BP. XX AC AAGE02024741; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-204_AA_; KW Gypsy-204_AA-LTR; Gypsy-204_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-3615 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02024741; Positions 4929 8543. XX CC Positions [2294-2803] - Integrase core CC 'GAAC' target site duplication CC LTRs are 93% similar to each other. XX FH Key Location/Qualifiers FT CDS 515..3178 FT /product="Gypsy-204_AA-I_1p" FT /translation="MTESGIIEEARSEWNSPVLLVPKKSCDDKKKWRMVID FT YRKVNNTLQDDRFELGNIEDIIDSLAGAKYFTHLDLSQGYYQCEIDPKSRP FT ITAFSTASGQFQMTRLPMGLKISPSTFSRLMTVAMSGLCMEKCLVYLDDII FT VFGKTLEEHNKNLISIFERLRGVNLKLNPSKCNFLKQELLYLGHFISEEGV FT RPDPSKIESIKNWPVPKTADEVKRFVAFANYYRKHIKDFSKICIPLNHLTR FT KDVTFEWTPECQQTFEMLKNEFMNPPVLDYPDFENSFKLQTDASGYALGAV FT LLNRKNGRPVGYASRALNKAELNYGTIEKELLAVVWAIKHFRPYLYGRKFD FT LETDHRPLVYLFSMKEPSSRLTKFRLALEEYDFNITYIPGKDNVLADALSR FT MSINDLREINKKVDQDVLITTRSKTVKENSDKNQDNEVTGQHSLSGLSIEI FT IFNESNTIPEVKFNAPKHQIIIRPAKTLIQLRRIVVMLREICKSKNINELV FT IKNTDPGKQFYNMVYTYNLNNGMPPFRIVDSKVKCIENKTEQQLIINDFHL FT LPTAGHAGITKTLKNIQRRYFWSTMKKDITNFIKSCEACQKNKHIKPKNIP FT QIVTTTASSAFSKIYLDLVGPLLPSNEGHAYILTTQCELTKYITATPIINK FT TTEVVAKAFVENVILNYGVPDEIATDRGSEFMSELFTKICELLKISKLNST FT AYHHQSIGALENSHKSLGNFLRIYASGTPGNWSSWIKFYQFAYNTTVHLET FT DKSPFELVFGKICKLPSSLNNSETPNPIYNLDDYSKILKFKIQNTQKEVQK FT RLVESKLERVKKLNCNARFMNYSPGQLLLVRNETGNKLSQIYEGPYPVVMD FT KGTNVEVRIRDKIDIVHKNRVKPFITRNAASNTEN" XX SQ Sequence 3615 BP; 1356 A; 560 C; 673 G; 1026 T; 0 other; tggcgatcct gccaggtatt tgtgtacccg ctaggacaga aataacatca tatatattgc 60 gaacattaag gaatctcatg tgcttttgag cgaggaagta aaaccacacg tttttatagc 120 caatgcgatt gtgcaaccta cgaatggtaa aattccgatt aaaattttga atgtctccag 180 aaaaacagtc tttttaaatg ggctgaaacc aaaattggaa aaacttgaaa aatatgaaat 240 actacaacta aatgatgtca aaggtgaacc gaatagggtg gaaaaattat tgaaagaatt 300 aaagattgat catttgcctt ctaaagagag agcaactatt cgggaaattt gtataaaata 360 taacgacata ttttgtttgg cagatgataa attaacggtt actaaaattt taacaccgac 420 aattaccgtc aaagaaaaca cgcaaccggt gtatacgaaa ccatataggt tgccacaatc 480 tcaaaaagac gagatagcga aacaaattaa aaacatgact gagagtggaa taatagagga 540 agcccggtct gaatggaata gtccggtgct ccttgtgcct aagaagtcat gtgatgacaa 600 aaagaagtgg agaatggtta ttgattaccg caaagtaaac aataccttac aagatgacag 660 gtttgaatta ggaaatatcg aagacataat agactcttta gcaggggcca agtactttac 720 gcacttggat ctctcgcaag ggtactatca atgtgagata gatcctaaga gcagaccaat 780 cacagcattt tcgacagcct caggtcaatt ccaaatgact aggctaccga tgggattgaa 840 aataagccca tcaacgtttt caagattaat gacagtcgcg atgtcgggac tgtgtatgga 900 aaaatgtcta gtttatctag acgatataat agtgtttggc aaaaccttag aggaacacaa 960 caaaaattta atttcgatct ttgagagact tagaggagtt aatcttaaat tgaatccctc 1020 taaatgcaat tttttaaagc aagagttact ttatttagga cattttatat cggaagaagg 1080 ggttcgacct gatccttcga agatagaaag tattaaaaat tggcctgttc ctaaaacggc 1140 cgatgaagtt aaacgttttg ttgctttcgc aaattattat cgaaagcaca taaaagactt 1200 ttcaaaaata tgtattccct tgaaccattt gacgaggaaa gatgtaacat ttgaatggac 1260 acctgaatgc caacaaacct tcgaaatgtt gaaaaatgaa tttatgaatc caccagtttt 1320 ggattatccg gattttgaaa attcatttaa attacaaact gacgcatcag gatatgcttt 1380 aggagcggtg ctgttgaata ggaaaaacgg aagaccagta ggatatgcgt ctagagcatt 1440 gaataaggca gaactgaatt atggaactat cgaaaaagaa ctattggctg tagtttgggc 1500 gatcaaacat ttcagaccat atctgtatgg cagaaagttt gatttagaaa ctgaccacag 1560 accattagta tacttatttt cgatgaagga accctcgagt agattaacca aattcaggtt 1620 agctctcgaa gaatatgatt ttaacataac atacattccg ggaaaagata atgttttggc 1680 agacgcctta tcgcgaatgt caatcaatga cttaagggaa attaacaaaa aagttgacca 1740 ggatgttctt attaccacac gaagtaaaac agtgaaagaa aattctgata agaatcaaga 1800 taatgaagtg actggccagc actcattgtc aggattgtct atcgaaataa tctttaatga 1860 aagcaatacg atacctgaag tgaaatttaa cgctccaaaa catcaaatta taattcgtcc 1920 agctaagact ctaatccagc tacggcggat agtagtaatg ctgagagaaa tatgtaaaag 1980 taaaaatata aacgagctcg ttattaaaaa tacggatcca gggaaacaat tttacaatat 2040 ggtatatacc tataatttaa ataatggcat gccaccattc agaatagtgg attcgaaagt 2100 taaatgtatt gaaaataaaa cggagcaaca actcataata aatgattttc atctactgcc 2160 tacagcagga catgcaggta tcacgaaaac tttaaaaaac atccaaagaa gatatttttg 2220 gtcgacaatg aagaaagaca ttaccaattt tattaagtct tgtgaagctt gtcagaaaaa 2280 taagcatata aaaccaaaaa acataccgca gattgtcact actacggcaa gtagtgcgtt 2340 tagcaaaatc tatctagatt tagttggacc attattgcct agtaatgaag gtcatgccta 2400 tattttaacg acgcagtgtg aattgactaa atacataact gctacaccta taatcaacaa 2460 aacaaccgaa gttgttgcca aagcgtttgt ggaaaatgtt attttgaatt atggagtgcc 2520 agatgaaatt gctactgatc gaggatcaga atttatgtcc gaacttttta cgaaaatttg 2580 tgaactttta aaaatttcta aattaaactc aacagcttat catcatcagt cgataggcgc 2640 attggaaaat tcacacaaaa gtttgggaaa ttttttgaga atttatgctt caggaacacc 2700 aggaaactgg tcttcatgga taaaatttta tcaatttgcg tataatacca ctgttcattt 2760 ggaaacagac aaatctccat ttgaactagt gtttggaaaa atttgcaagt taccatcaag 2820 tttaaacaat tccgagaccc caaatccaat ttataattta gatgattatt caaagatact 2880 caagttcaaa atacaaaaca ctcaaaagga agtacagaaa cggttagtag aatcaaaact 2940 agaaagagtt aaaaagttaa attgcaacgc gagatttatg aattactctc cagggcagct 3000 tttgttggtt aggaatgaaa cgggaaataa gttgagtcag atttatgaag gaccttaccc 3060 agtagttatg gataaaggaa ccaatgtaga agttagaatt agagataaaa tagatattgt 3120 acataaaaat agagtaaaac cttttattac taggaatgcg gcaagtaata cagagaatta 3180 gcttttagga cgtgtgacgg atgcaatccg tagacgtgtc aatcaattgt caattttctt 3240 tacttattat caataatcaa ttatcaattt ttagtaatca atagtcagtt atcgatgtat 3300 taatttctaa aaacttacaa tattcaataa tccgtaatat gtaaccatat atgatatcca 3360 aattaaatga gtatttctaa tgttgatttt tgtaagcctt gagaaggcag taccaaagat 3420 atgagatatc taaggtatca tcaacgtcac attagttaaa cgcatttaaa gtaattattt 3480 ttaaataaaa gaaattgaat gttttaaatt tttaactaaa gataaaaatt taaaaataaa 3540 taaatagggc gtgttgcgga tgcacccctg actacgctag tggtgtaatg gttgaagtag 3600 aactcggacg atggc 3615 // ID BEL-613_AA-I repbase; DNA; INV; 5638 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-613_AA_; KW BEL-613_AA-LTR; Pao_Bel_Ele41; BEL-613_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-5638 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [4691-5251] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 333..1382 FT /product="BEL-613_AA-I_2p" FT /translation="MLKNLVSQWREARRQRKRKRRLVIKAVVPDVNVPHSR FT RNDELPLDNTSKPRVKNAPTESGIVPDLRHQLASMDFNRAAENAVPNEEPS FT SHASNFRTHHESRSQDPPRNRPVENLLETELNDSHLHTRGGRQYVEEYDGP FT TNRQLAARQVMGKDLPPFSGRPEEWPLWFSNFQRSTTTCGFSDDENLIRLQ FT RCLKGEALETVRGKLLCPSSVPHVIRTLEMRYGRPSTLIRVMTERIRQLPS FT PRMNDLNSIIEFGLAVNSLVEHLQNSAQVSHLNNPSLLHDLVTKLPVDYRL FT KWAAYKSSLREPNLTAFGNFMSTMVELAYEVADDVPSEKSKPKERAFLHAH FT SESPSSK" FT CDS 1550..5638 FT /product="BEL-613_AA-I_1p" FT /translation="MALKTWNGCEFGDCRLRHHTLLHSISGSSNVAILNNH FT MGALQSINGPLFRIMPVTLYGNGCQINVFAFIDEGSQLTLLEDVVAKQLGI FT DGPTEPLGLQWTGNIKRNEAKSQRISMEVSGTGSTKRFELSDARTVGGLLL FT PSQSMRYEELAKRYPHLRGLPLQDYANVSPKILIGLDNLKLTVPLKVREGG FT WKDPIAARSRLGWSIYGCSQAPASSVICGFHFGGWTDPDRELNDLVRNFFS FT IEDAGVTSPQKILESEEDKRARMLLETTTRRVATGFETGLLWKTDDVHFPE FT SFGMAARRLRALESKLSKDPAMYENVRLQIHQYLEKGYAHIASEQELAQVP FT PDQTWYLPLGVHRNPKKPEKIRLTWDGRASVQGVSFNSALLKGPDMLSSLP FT SVLCHFRQYRYALTGDIKEMFHRIRIREEDRQFQRFLWRNAPHDPPKIYVM FT DVATFGSTSSPCSAQYVKNLNASEFTEKCPRAVYAIHHYHYVDDYLDSFSS FT RDEAITVGRQVKMIHSEGGFELRNFLSNDAQISARVGEESTENDKSLSLDG FT AESVLGMRWVPICDCFTYDLTMRSDLADIMKDGRIPTKREVLRVVMSLFDP FT LGLIAFYVVHGKILMQEIWVSGVHWDDPISQQLYDRWIQWCRLLPQLSSIR FT IPRYYFPKAESEVYRSLQVHVFVDASESAYSSVAYFRVESQHGPLVALVGA FT KTKVAPLKMLTIPRLELQAAVLGTRLLNNVISMHGLPVVQRFLWTDSATVL FT AWLHSEQRRYHQYVGFRIGEILSTTEVGEWRWIPSKLNVADVATKWGSGPQ FT IDSTNRWFTGPEFLQLPETHWPQKIKLQLSTDEELRSCNVHLPAMIQFVDV FT ARFSKWERLLRSVAYVHRFVNNIRRSQRGELLEVGALIHDELKSAEESLWR FT QAQREFYGAEIELLQRTRGRLDACHVGVPKSSNIYKLSPFIDADGILRKRS FT RLGAAFWLPYDAKNPIILPREHRISLLLTDFFHRRFRHANRETIVNEMRQR FT YEISKLRSLIARVSRDCMECRVHHTVPRSPPMAPLPKVRLTPYVRPFTFVG FT IDYFGPVLVKVGRSNSKRWIALFTCLTIRAVHLEVVHNLTTESCVMAVRRF FT VSRRGSPAEIYSDNGTNFHGADNQLKREIEERNYRMATVFTNSTTRWRFNP FT PGAPHMGGVWERMVRSVKAAIGTILETQRRPDDEVLETVMIDAEAMINCRP FT LTYIPLEYAEQEALTPNHFLLGSSSGIKQLPVEPVDFRSTLRSGWKLAQHL FT TDGFWRRWLKEYLPVISRRSKWFDEVRDLEVGDLVLVVEEAVRNQWLRGKV FT ERVVCGRDGRTRQAWVRTKNGVLRRPVVKLALLDVMDERKPVQGNALRAGE FT " XX SQ Sequence 5638 BP; 1553 A; 1258 C; 1456 G; 1369 T; 2 other; aactttagaa tttttcacat ccgttaacct cgaaaccatg gcggaaggtg ggaatgattc 60 tcaggcctat aggagctgtg ccgcttgtga tcgaccggat tcattcgatg acatggttgc 120 gtgcgattcg tgccagatat ggtgtcacta ttcgtgcgca aaagtgggtg ccgatgtcaa 180 gaatcgggag tggtactgct cgacctgtga acccctgatg agagccaaaa ctgtacagat 240 tcccaagcaa tccaaggaaa tgtggatcac ccagtccaag acaggtgacc gagcgtcgaa 300 tgagaattta aaactcacgg ttccagagaa ggatgctaaa aaatctggta agtcagtggc 360 gggaagcaag acgtcaaaga aaacgaaaaa gacgattggt gataaaagcc gtagttcccg 420 atgtgaacgt accacattct cgaagaaatg acgaactacc cctcgataat acaagcaagc 480 cccgggtcaa aaacgcacct acggaatctg gaatagttcc ggatttgagg catcagttgg 540 cgagtatgga ttttaataga gccgcagaaa atgctgttcc gaatgaagag cccagttccc 600 atgcttcaaa tttccgtaca catcacgaat ctcgctctca ggatccacct agaaatcgtc 660 ctgttgaaaa ccttctggaa actgaactga acgattctca tttgcatacc aggggtggtc 720 gtcaatatgt tgaggagtat gacggaccga cgaaccggca gttggccgca cgccaagtaa 780 tgggtaaaga ccttccacca ttttcgggta gaccagagga gtggccatta tggttcagca 840 attttcagcg ttcgactaca acttgcggct tctcagatga tgagaacctc atcaggctgc 900 agcgctgctt gaaaggcgaa gcattggaaa cagtgcgagg taaattgctt tgccccagta 960 gtgtacctca tgtcatcagg actttggaaa tgcggtatgg acgtcccagc acactgatac 1020 gtgtgatgac agagcgtatt cgtcagctgc catcccctag aatgaatgat ctgaacagta 1080 tcattgaatt cggattagca gtcaatagcc tggtagagca tctacagaat tccgcacagg 1140 tstcccattt gaacaatccg tccctgctcc acgacctagt taccaagcta cccgtggatt 1200 atcgtctaaa gtgggcggcg tacaagagtt cactgcgtga accaaatctc actgcatttg 1260 gaaactttat gtccactatg gtggaattag catacgaggt ggcggatgat gttccttccg 1320 aaaaatcgaa gcccaaggag cgtgcttttc tacatgcgca ttctgaatca ccatcaagca 1380 aataattttc gcccaacgag cacgtttcga agaaatgttg cgtagtttgc catgaagaag 1440 gacacaaggt cggaggctgt cccacattca agcgaatgga catggacgaa aggtacaaag 1500 tggtgcaaca gaacggatta tgtagaacgt gtttaaatca gcatgggcga tggccttgaa 1560 aacgtggaat gggtgtgagt tcggagactg taggctgaga caccataccc tgctacattc 1620 catcagcggt tcgtcaaacg tggccatttt gaataatcat atgggagcac ttcaatcgat 1680 caatggtcca ctcttcagaa ttatgccagt tactctgtat ggaaacggat gccagattaa 1740 cgtgttcgcg ttcatagacg aagggtctca gctgactcta ctagaggatg tcgttgccaa 1800 gcagctagga atcgatggtc ccacggaacc cctgggccta caatggacag gaaacatcaa 1860 gaggaacgaa gcaaaatccc agcgaatttc tatggaagtt tcgggaacag gatcaaccaa 1920 acgcttcgaa ctgtcagatg ctcgaactgt aggtggactt ttactgccat cgcagtctat 1980 gcggtatgag gaactagcga agaggtatcc acatttaaga ggtctaccgc tgcaagatta 2040 tgcgaacgta tcaccaaaga tcctcatcgg tcttgacaat ttgaaactca cagtaccact 2100 aaaagtcaga gaaggtggat ggaaagaccc aatcgcagcc agaagccgtc ttgggtggag 2160 tatctacggt tgttcgcagg caccggcktc gtcagtaatt tgtgggtttc attttggagg 2220 atggaccgat ccggatcgag aactgaatga tctcgtgaga aacttcttta gcattgagga 2280 tgcgggcgtc acaagccccc agaagatctt ggaatccgag gaagataaaa gagccaggat 2340 gttattggaa acaaccacac gtagggttgc aacggggttc gagacgggtc tactttggaa 2400 gacggatgat gtgcattttc ctgaaagttt tggtatggct gctcgacgac ttcgtgcact 2460 cgaaagtaag ctaagcaagg atcctgcaat gtacgagaac gttcggcttc aaatacacca 2520 atatctcgaa aagggttacg cccatatagc ttccgaacaa gaattggccc aggtaccgcc 2580 ggatcaaacg tggtatctgc cattgggtgt ccataggaac ccaaagaagc cggaaaagat 2640 tcgtctgact tgggatggca gagcatcggt ccaaggagtt tccttcaact ctgcgcttct 2700 caaaggaccg gatatgcttt cgtcgcttcc aagtgtactt tgccattttc gtcaataccg 2760 ctacgcacta acaggtgaca tcaaagagat gttccaccgc atacgaatcc gtgaagaaga 2820 tcgtcagttt caacgatttt tgtggagaaa cgctccccac gaccctccca agatttacgt 2880 catggacgta gcgacatttg ggtcgaccag ctctccgtgc tcggcccagt acgtaaaaaa 2940 tttgaatgct agtgagttca cggaaaagtg tccacgtgct gtgtacgcaa tccaccatta 3000 ccattatgtg gatgactatt tggatagttt tagttcacgc gacgaagcga tcacagttgg 3060 caggcaggtt aagatgatac acagcgaagg tggtttcgag ctcagaaatt tcctgtcgaa 3120 tgatgcgcaa atatctgctc gggttggaga ggaatcaaca gagaacgaca aatcgctgtc 3180 tttggatgga gctgagtctg tcctcggcat gagatgggtt cctatctgtg actgctttac 3240 atacgacctc acaatgcgca gcgacttagc ggacatcatg aaggacggtc gtatcccaac 3300 aaagcgagag gttctgagag tggtcatgag ccttttcgat ccactaggat taatagcctt 3360 ctacgttgtg cacggaaaaa tcctcatgca ggaaatttgg gtttctggtg tacactggga 3420 tgacccaatc agtcaacagc tgtacgatag atggattcaa tggtgcagac ttttgccaca 3480 gttaagctcc attcgaatac cgcgctacta cttcccaaaa gcagaatcgg aggtgtatcg 3540 atcactgcaa gtacacgttt tcgttgatgc tagtgagtcg gcatattcca gcgtggcata 3600 cttcagagtg gaatctcagc acggtccact agttgcactt gtgggtgcaa agaccaaagt 3660 ggcaccgcta aaaatgttga caataccgcg gttggaactt caagccgcag tattaggaac 3720 tcgtctgcta aataacgtta tttcgatgca cggtttgcca gttgtacaga gatttctttg 3780 gaccgactca gcgacggttc tagcttggct gcattcagag cagcgtcgat atcatcagta 3840 tgtgggattt cgaattgggg agatactgtc gaccacagaa gtaggagagt ggagatggat 3900 tccctcgaaa ctgaatgtag cggatgttgc caccaaatgg ggttctggac cacaaatcga 3960 ttcgaccaat cgatggttta ccggacccga atttcttcaa ctgcctgaaa ctcattggcc 4020 gcaaaagatc aaactacaac tttctacgga tgaagaacta cgttcatgta atgtacacct 4080 tcctgcaatg attcagttcg tagatgtagc tcgcttcagc aaatgggaaa ggcttctccg 4140 ttctgtggcg tatgttcatc gtttcgtgaa taacattcga cgatctcaaa gaggagaatt 4200 actggaggtt ggtgctttga tacacgacga gctgaagagt gcagaagaat ctttgtggcg 4260 acaagcgcag cgggaattct acggtgcgga aattgagcta cttcaaagga ctagaggacg 4320 cctagacgct tgtcatgttg gtgttccgaa atcgagcaac atctataaac tatcgccgtt 4380 catagacgca gacggtatac tacgcaagcg aagtcgtctc ggtgcagctt tttggttacc 4440 atatgatgct aaaaacccta taattcttcc tcgtgagcac aggataagcc ttctgctaac 4500 tgattttttc caccgacggt tccgccatgc taatcgcgaa acaatcgtca atgagatgag 4560 acaacggtat gaaatttcga aactacgttc acttatcgca agagtatccc gcgattgtat 4620 ggaatgtcgc gttcatcaca cagttcctcg ttctccacca atggctccat tacccaaagt 4680 gaggcttaca ccatatgtac gacccttcac atttgtgggg atcgattatt tcggaccagt 4740 attggtgaaa gtcggtcgaa gtaattcaaa gcgatggata gcgttattca cctgtttgac 4800 catccgggcc gtacatttag aggtggtgca caacttgacg acagagtctt gtgtaatggc 4860 agtgcgacga tttgtctccc ggagaggctc tccggcagaa atatattccg ataacgggac 4920 aaacttccat ggggcagata accagttgaa acgtgaaatc gaggaacgca attaccgtat 4980 ggcaacggtt tttaccaatt ctacaacacg ttggcggttc aacccacccg gggctccaca 5040 tatgggtgga gtctgggaac ggatggtccg ttctgtgaaa gcagcgattg ggactattct 5100 ggagactcag cgccgaccag atgacgaagt attagaaaca gtcatgattg atgctgaggc 5160 catgatcaat tgccgtccgt tgacgtacat cccacttgaa tacgctgagc aggaggctct 5220 tacaccaaac cacttcctgc ttggaagttc aagtggaatt aaacagttac cagtcgaacc 5280 agtggacttc cgctcgacac taagaagtgg ctggaaactt gcccagcatt tgacagatgg 5340 tttctggagg cgatggctga aagagtatct cccggtgatt tcacggcgtt caaagtggtt 5400 cgacgaagta agggatttgg aagtcggaga tttggtcttg gtggtggaag aagctgtgag 5460 aaatcaatgg ctgagaggaa aggtggaacg tgtggtatgc ggccgagatg ggcggacacg 5520 tcaagcgtgg gtacggacaa agaacggagt gttgcgtagg ccggttgtga agctggcgtt 5580 actcgacgtc atggatgaac gtaaaccggt ccaagggaat gctttacggg cgggggaa 5638 // ID DIRS-4_DPu repbase; DNA; INV; 4465 BP. XX AC . XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE DIRS retrotransposon from Daphnia. XX KW DIRS; LTR Retrotransposon; Transposable Element; nonautonomous; KW DIRS-4_DPu. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-4465 RA Jurka J.; RT "DIRS retrotransposons from Daphnia."; RL Direct Submission to RU (08-JUN-2010). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR [1] (Consensus) XX CC The ORF is truncated by mutations. XX FH Key Location/Qualifiers FT CDS 72..3230 FT /product="DIRS-4_DPu_1p" FT /translation="MHSGRARSPTSSEPFQLRPNLVMGDIPSSPSSSESTK FT RRRALRSMSGDTLTVDLGENSVNTPELPPGIPNSTDSHGAKKNKKKRRRSR FT DSSPSDSGSGSSSSSRSPDRKKSRKDGTDSGGFDEPSNVAMLPRSRVRDLR FT RWMKEGIESKTEGNTLRSAYVPKFEGDFELMAPLLDPSMARKWLRNLGESS FT DRAKLKDFWEKQLLSLQREVKDAFQPLAYMLGVMPENSDAEMPVQTAIRLL FT GHVFSRITKMRRSNAMRHCAPKFSTMLIDNRLFSSRDYRNLFGNKFIDALE FT KEAIADAKMDQIGRYGGPSNSRGGNTHFRRGGSNSGSGANFKSNGNWNFNK FT PHSSNRNKSGGQQFTGASNNNKYVSNFLPPVSSVDSSIVGGRLSLFSDAWH FT AFTDDPWILNIIEYGYSIEFTHPPVQKLIPPEVIMDKEKTSICNSEVESLK FT KKGAIVVAPSPVDSFISNQFVVPKKATGKFRPIFNLKALNQFIRYEHFKME FT CLDNVKYLICRNDWLVKLDLQDAYFVVPVASQHRKFLRFIWDGVVYEYVCL FT PFGLSSAPRIFTKLMKPIIAHLRKLGIRLLIYLDDLLILGSSPNGVLQHLR FT SVVNLLASLGFLINWEKSVVTPSQSIEFLGLEIDSRMLSFSLPKSKVDNIV FT SLCQSILQKDRVKLRELAAVLGNFSWSITSVPYAPSHYRSLQRFYIQAVGD FT FSNLNKLVTLTQEARIDIQWWVHNLKLSNGKSIIPPCPDLSIYSDASLKGW FT GSVCDGVTARGPWPLADQSRHINELELLGSLYALQIFTQNSSDISVSLLLD FT NSTAVAYVNNCGGTKSRSLSAISSQMTSWCETRNISLSASHLPGMFNSIAD FT RESRSTLDPSDWMLFTGAFKRLQDVWPMEVDLFSAAWNKQLPKFVSWLPQP FT NAIAVNAFSLNWSRLNCYAFPPFAMIPRCLTKIKKEKAVVILVCPLWPSQP FT WYPLLLEMATDIPRVFSSHPILIHSNSLEPHPLIQSKKFLLAAWKLSGDAL FT KSEAFRHQLLDYCWPAPVDLRYLPTSQPGTVGTVGAWQGTKIPCQTI" FT CDS 2867..4024 FT /product="DIRS-4_DPu_2p" FT /note="tyrosine recombinase." FT /translation="MSDKDKEGESCCHFGLPALAVSAVVPSPVGNGNGHSE FT SVQLTPDPHSFELPRATPTHSIEKIPPSRMEIIRGRLEKRGLPPSVVRLLL FT ASSRGSTLSTYQSAWNGWYRWCMAGNQDPLSNDLNTILQYLTHLFDSGLAS FT QTINLHRSMLSMTLEPVNGINIGEHPLVVQLLKGCYNSLPPRPRYNNMWNP FT DDVLKFISSLPDNSDLSLSVISYKLVTLTALSSLLRVSEIASISRTSIHCS FT ASAATFSLSRPRKTQHDGPLQSISXPRLSGRICPVDCLENYLGRSKSLCPS FT SDSLFISLKKPYRAVGSSTIARWIKKCLSDAGIDDSFSAHSTRGSGASKAA FT KIGIPIEQILKAASWSNESTFNRFYNRPLTSASVASSILSQTD" XX SQ Sequence 4465 BP; 1120 A; 1115 C; 921 G; 1307 T; 2 other; gaccttaacg gtgtagaact gatttattag ctcctccaaa gacaaaaaat aatctggcaa 60 cgcccgaccg tatgcacagt gggcgtgcca gatcccccac atcctcagaa ccttttcagc 120 tccgtccaaa cttagtcatg ggtgatattc cgtcctcgcc ttcgtcaagt gaatctacta 180 aacgtcgtcg tgccttacgt agcatgtctg gtgacacgct tactgtagat ttaggtgaaa 240 atagtgttaa tacgcctgag ttaccacctg gtattccgaa ctccacagat tcgcatggtg 300 cgaagaaaaa caagaagaag cgtcgtcgtt ctcgcgatag ctccccttct gattcaggct 360 caggatcatc gtcttcttct agatctcccg atcgtaaaaa atcccgcaag gatggcacgg 420 attcaggcgg attcgacgag ccatccaacg tcgccatgct gcctcgatca cgtgttcgtg 480 acctacgtcg atggatgaaa gaaggtattg aatccaaaac cgagggaaac accctccgat 540 ctgcttatgt tcccaaattt gagggtgact ttgaacttat ggctccttta ttggacccat 600 caatggctcg caaatggctg cgcaatcttg gtgaatcgtc tgatcgagct aagctgaaag 660 acttttggga aaaacagctc ttatcattgc agagagaagt gaaagatgcc tttcagcctc 720 ttgcctacat gttgggcgtc atgcccgaaa attccgatgc tgagatgccc gttcaaactg 780 ctatacgtct gctgggccac gtgttttctc gcatcaccaa gatgagacgc tccaacgcca 840 tgcgccactg tgcgcccaaa ttctctacga tgttgatcga taatcgtctg ttctcatctc 900 gtgattaccg taatttgttt ggcaacaaat ttattgacgc tttggagaaa gaagcgattg 960 ctgatgccaa aatggatcag atcggacgct acggtggtcc ttccaacagc cgtggaggca 1020 atactcattt ccgcagaggc ggatccaact ctggttctgg cgccaatttc aaatccaacg 1080 gtaactggaa cttcaacaaa ccacactcgt ctaaccgcaa caagtctggc ggacaacaat 1140 ttaccggcgc ttctaataat aacaagtatg tatcaaattt tctcccccct gtctcttctg 1200 tcgatagctc gattgtaggc ggtcgcctct ctcttttctc tgatgcctgg catgctttca 1260 ctgatgatcc ttggatctta aacattattg aatatgggta ttctattgaa tttactcatc 1320 cccccgtcca gaagcttatt ccgccagaag taattatgga caaggagaaa acctctattt 1380 gcaactcgga agtcgagtcc cttaaaaaga aaggagccat cgtcgtggcc ccctctcccg 1440 ttgatagttt tatcagtaat caatttgtcg tccccaaaaa agcaaccggg aagtttcgcc 1500 ccatttttaa tctcaaagcg cttaaccaat ttattcgcta tgagcacttc aaaatggaat 1560 gcttagataa tgtaaaatat ttgatttgca gaaacgattg gcttgtcaaa ctcgacttac 1620 aagatgctta ttttgtagtc cctgtggcca gtcaacatcg aaaatttttg cgtttcatat 1680 gggatggcgt cgtatacgaa tacgtatgct tgccgttcgg tttgagtagc gccccgcgga 1740 tatttactaa gctcatgaaa ccaataatcg ctcatttgcg taaacttggt attcgtctgc 1800 taatttatct tgatgatctg ttaatcttgg gcagttcccc taatggcgtt ctacaacatt 1860 tgcgctcagt agtgaattta cttgcgtccc ttggtttcct tataaattgg gaaaaatcgg 1920 tcgtaacccc gtctcagtcc atcgaatttc taggtctcga aatcgactcg cgcatgttgt 1980 cattttctct tcccaaaagt aaagttgata acattgtctc cctctgccaa tctattttgc 2040 aaaaagatcg agtaaagctt cgcgagctcg ccgcagtact cggaaatttc tcttggtcga 2100 taacatcagt tccttacgcc ccaagtcact accggtcctt acagagattt tatattcagg 2160 ctgtcggaga ttttagtaat ttgaacaaac tcgttaccct cacgcaagaa gcgcgaattg 2220 atattcaatg gtgggttcat aatcttaaac tttcaaatgg caaatcaata attccaccat 2280 gtcccgactt atcgatttat tctgatgcct cattgaaggg atggggatcg gtttgcgatg 2340 gcgttactgc aagaggcccc tggcctcttg cagaccaatc acgtcatatt aacgagctcg 2400 agttactagg ctccctttac gccctgcaaa tttttaccca aaattcatcc gatatatctg 2460 tttctctcct tttggacaat tctacagctg tagcttacgt aaataattgc ggtggtacca 2520 agtcccgttc cctctccgcc atctcttctc aaatgacttc ctggtgtgaa acacgcaaca 2580 tctctttgtc ggcttcccat ttacctggaa tgtttaattc cattgcagac cgggagtcac 2640 ggtcgacatt ggacccaagc gattggatgc tattcacagg ggcattcaaa cgtctccagg 2700 atgtgtggcc catggaggtg gatctttttt cggcggcgtg gaacaaacaa ttgccgaagt 2760 ttgtttcctg gctaccccag ccgaacgcga tagccgtgaa cgccttctcc ctcaattggt 2820 cacgcctgaa ttgttacgcg ttccccccat tcgcaatgat tccgagatgt ctgacaaaga 2880 taaagaagga gaaagctgtt gtcattttgg tttgcccgct ttggccgtct cagccgtggt 2940 accctctcct gttggaaatg gcaacggaca ttccgagagt gttcagctca cacccgatcc 3000 tcattcattc gaactccctc gagccacacc cactcattca atcgaaaaaa ttcctcctag 3060 ccgcatggaa attatcaggg gacgccttga aaagcgaggc cttccgccat cagttgttag 3120 attattgctg gccagctccc gtggatctac gttatctacc taccagtcag cctggaacgg 3180 ttggtaccgt tggtgcatgg cagggaacca agatcccttg tcaaacgatc taaatactat 3240 cttgcagtat ctcactcact tgtttgattc cggtcttgcg tcgcaaacga taaatttaca 3300 tcgatcnatg ttatcaatga ccttggagcc cgtaaacggc ataaatatcg gggaacaccc 3360 gttggtggtc cagcttttaa aaggatgcta caattcatta ccccctcgcc ctcgatataa 3420 caacatgtgg aatccagatg acgttttaaa attcatatcg tcgttacctg ataattctga 3480 tctctccctt tcagtcatct cttataaact cgttaccctg acggctctgt catctcttct 3540 tcgagtgtcc gaaatcgcgt ctatttcgag aacctcaatt cattgctccg cgtctgcagc 3600 gaccttttct ctttctcgcc cgcgcaaaac ccaacatgat gggccgttac aatctatttc 3660 tttkccccgc ctctccggtc gtatttgtcc agttgattgc cttgaaaatt atctaggacg 3720 ctccaaatca ttgtgtccat catcagattc tctgtttatc tcgcttaaga aaccttatcg 3780 agcggtgggc tcttccacta tcgccagatg gataaagaag tgtctgtcag acgctggaat 3840 agatgattca ttttctgcgc attccacaag aggctctggt gcatctaaag cagccaaaat 3900 tggcattccc attgaacaaa ttttgaaagc cgctagttgg agtaatgagt caacatttaa 3960 tcgattttac aatcgccctt tgacttccgc gtctgttgcg agttcaattc tctcacaaac 4020 agattgacat catctttaaa atcaccgtta aggtcgaagt ggctgaatga taataattgt 4080 ggattactcg agggtcgcgt agcgacccga agagtaatct agattattag aaatgaaaga 4140 acgagacctt aacggtctcg ttattccctc cctcacctcc ctgtttaatt atcatgttct 4200 tttttctttt acagaggtaa acgcttcgtt aactccaaga acggctacgg caattaacgt 4260 atccttggca agattttgtt cgcgttcctc atcaacacag aatttctcat ttttggaagt 4320 caatcatgtt tgtgtttatt tgtgtctgtg aaaagttctg aggatgtggg ggatctggca 4380 cgcccactgt gcatacggtc gggcgttgcc agattatttt ttgtctttgg aggagctaat 4440 aaatcagttc tacaccgtta aggtc 4465 // ID Gypsy-37_DPu-I repbase; DNA; INV; 5913 BP. XX AC scaffold_112; XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from Daphnia: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-37_DP_; KW Gypsy-37_DPu-LTR; Gypsy-37_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-5913 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Direct Submission to RU (16-DEC-2010). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR Genome; scaffold_112; Positions 285590 291502. XX CC 'CCAG' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 3337..5262 FT /product="Gypsy-37_DPu-I_1p" FT /translation="MRIPVTFNNLKTQALVDTGAAASFLAHRLLVCIPYNY FT VKEVKVSDPNTQLFRTVSGELVKPIGRYELCIKLARRHTFNHPFYVLSSLD FT EGCILGYDFLATHGIIINPSERSISYTHDNELRNLIIPPLPICSISITTPP FT QFDLLGVPEEQRGNLKNLLMSYFSLFTENINELGKAKSIKHSIHTTQLPDI FT MPMRRTPERLRLVVKKQIEDMLRNNIIRPSTSPFACPILLVAKKEEGQMRF FT CVDYRPLNNVTVKDRYPIPNIQLIIDSLHGAKYFSTLDLLSGYWQIEIEEK FT HKYKTAFICEYGLYEFNCMPFGLCNAPSTFQREMNTLFKDVLYEFVLVYLD FT DIIVFSKTMDEHTRHLRIVFDLLTQENLKLKLSKCDFFKTKIKYLGHIITA FT GGFHPDAGKTRSILNYPEPLNVKQLSSFLGLANYYRKFVREYAEIAHFLTE FT LTKRDAFQRIKNCLTSNPLLRYTDFTREFLVYADASGYGIGAVLAQIQYIP FT PSETDPEAIHGEHEVVIAYTSRHLDERERIWNISEKECLAILHALEHFRPY FT LYGRTFRVFTDHKPFETLMKKKDPHGRLARWAYEMQAYNMTLIHRPGQENQ FT NADGLSRQPLPVIGAILKPEIIQNDWITAQRNDEYCKKINGIVS" FT CDS join(92..1270,1274..2764) FT /product="Gypsy-37_DPu-I_2p" FT /translation="MTTPTLPSKDTSYAALNESLIGEIDLEEDSYIRSLPQ FT NILPIRERFLSQWNELIKLTKDVADRTSTRRVTRSDFLAKQKDIKSQVYNS FT LKQYTYEFYKKELGGPQYITGEAKEKVVDDIATYLVQLKGDSLVYNPLIQN FT IRTAHSKEVEFRNKAFKVFKRELIFKQLSLENYANSLEAQLETQKAKSERE FT VDNVRRDLSKKLTDCLVDNTRLTAEVENLQLSNDKQAARITELNSKVEHHQ FT ASGESKSFFNEYEAVKKDLETAEDNLSKRDEYIKNLKDTLAQLENDSKDKV FT KQLVDGYNALSIKHDSLESSCKTADSKVIELKNLNSQLNAQLNQIINDNLA FT NLTDLSDQNVQLTTQLTKLQQDVATKELSIDSLRQTNKGLNLDIAALQQQA FT QQTAPITQPPNMTPTPPTPPTAPIQNPLNQQQAPRAPSPDPNNVPLTKSHL FT RELYSQDERKSIPVYKGKRGDQLINNWLKDAERVAQSAGWSAKDKIKYFSD FT RLRGDAADWHSDYIDHAADKEDYDAWEKALISRFLTETEIENLKKQLNELK FT QMPDQSTQTYVSRLNHLYDIIHGKEIVLDETVAPPEAVVLARSLRKIRGEA FT KQKILLKGLLPKIVQVVWTRIDVNSSYEDVCEIVYAAETIVNRMEQNEDKS FT LKAAIAGISAHEDEQDVELQRQKNKLSKLEKQLAELNIDPTAQQETPAVVL FT PTVAITEAYNRHRSPSGDRRRSHSEGRIRFQQPSQQRQNTDSSYSRDRGRN FT NTYFHSRSPSGNRYNPNSDVRQRREQTPPRQNYSGSQTSRTQFPRPNNDSY FT RRRFSDNNSFQRNRNFQDQRSTPANYRPNNSNFRSAPYNGARPRTNGGYEQ FT QREVRNRRDVECYACGKRGHYARECRTNPPGAPQQQQ" XX SQ Sequence 5913 BP; 2069 A; 1206 C; 1147 G; 1491 T; 0 other; aattggtgaa cgtgccagtt tggttttcga acgcctgctc cttcaacatc tacaagcacc 60 gagaggagag aaacaccaaa gcggtatgtg aatgacgact cccactttgc cttctaaaga 120 tacatcttat gctgctttga acgagtccct gataggggag atagacttag aagaagacag 180 ttacatcaga tcccttccac aaaatatttt accaattaga gaaagatttt tatcgcagtg 240 gaacgagtta attaaactta caaaagacgt cgcagatagg acatctaccc gacgagtgac 300 aagaagcgac tttcttgcga aacagaaaga catcaaaagt caagtctaca attcattaaa 360 gcaatatact tacgaatttt ataaaaaaga attaggggga ccccaatata taaccggtga 420 ggcaaaagaa aaagtagttg acgatatagc aacatatttg gtacagttaa agggagattc 480 cttagtatat aatccattga ttcaaaatat acgaacagca cacagtaaag aagtcgaatt 540 cagaaataaa gcttttaaag tcttcaaacg tgaattaatt tttaagcaat tgtctctaga 600 aaattatgcg aatagccttg aagcacaact agaaacccaa aaagcgaaaa gtgagagaga 660 agtcgataac gtaagaagag atttgtctaa gaaattgaca gattgtcttg tcgataatac 720 tcgattaaca gcggaagtag aaaatctaca actttccaat gataaacaag cagctcggat 780 tacagaatta aatagtaaag tagaacacca tcaagcgagt ggagaaagta aatcattttt 840 taacgaatac gaagcagtaa agaaggattt agaaacagcg gaagacaatt tatcaaagag 900 ggacgagtat attaaaaatc tgaaagatac tttagcccag ctagaaaacg attcaaagga 960 taaagtaaaa caactcgttg atggctacaa cgcgttaagt attaaacacg actccctaga 1020 aagctcctgt aaaacagctg acagcaaagt tatagaatta aaaaacctaa attctcaatt 1080 gaacgcacag ttgaatcaga ttataaacga taacttagct aatttgacgg acctaagcga 1140 ccagaacgta caattaacta cccaattaac gaaattacaa caagacgtcg ccactaaaga 1200 attatcaata gattctttaa gacaaacgaa caaagggtta aacttagata tagctgcgtt 1260 acagcaacag tgagcacaac aaacagcacc aatcactcaa cccccaaaca tgactccgac 1320 tcctccaaca ccaccgaccg ctcctataca aaatccttta aaccaacagc aagcccctag 1380 agcaccttcc cccgatccaa acaatgttcc attgactaaa agtcatctcc gtgaactata 1440 ctcacaggat gaacgaaaat caattcccgt atacaaaggg aaaagaggag atcagctcat 1500 taataattgg ctaaaagacg ccgaaagagt agcgcaaagc gcaggatgga gcgccaagga 1560 taaaattaaa tatttttcag atagattaag aggtgatgcc gcggattggc acagtgacta 1620 catagatcac gctgcggata aagaagatta cgatgcgtgg gagaaagctc tcattagcag 1680 atttttaaca gaaacagaga tcgagaatct taagaaacaa ttaaatgaac taaaacagat 1740 gcctgatcag agtacccaaa catatgtttc aagacttaat catttgtatg acattattca 1800 tggaaaagaa atcgttcttg atgaaacagt tgcaccccct gaagccgtag ttctggctag 1860 atcccttaga aaaattagag gagaagcaaa acaaaagata ttgctaaaag gcttgttacc 1920 aaaaatagta caagtcgtgt ggaccagaat tgacgtaaat agctcgtatg aagatgtttg 1980 cgagatagtt tatgcagcag aaactatagt aaatagaatg gagcagaatg aggacaaaag 2040 ccttaaagcg gcgatagcag gtatttcagc tcacgaagat gagcaggacg tagagcttca 2100 gagacagaag aataaacttt cgaaacttga aaagcaattg gcagaactaa atatcgatcc 2160 caccgctcaa caggaaacac cggcagttgt cctgcctaca gtggccatta ctgaggccta 2220 taaccgacat cggtcacctt ccggtgatcg tagacgctca cattccgaag gtcgtatacg 2280 tttccagcaa ccctcccagc agcgacagaa tacggatagt agttactcgc gggacagagg 2340 taggaataac acatattttc attcacgaag cccaagcgga aataggtaca atccaaattc 2400 tgatgtgcga cagcgtagag aacagacccc accacgtcaa aattattccg gtagccaaac 2460 gagtagaaca caattcccac gtccaaataa cgattcgtat agaagacgat tttcagataa 2520 caatagtttc caacgtaata gaaattttca ggaccaacgt agtacaccag caaattaccg 2580 cccaaataat tcgaattttc gttcagcccc ttacaacggc gctagaccaa gaacgaacgg 2640 aggatacgag caacaaagag aggttagaaa tagacgtgac gtcgaatgtt atgcatgtgg 2700 taagcgtgga cattacgcta gagaatgtcg cacaaatcca ccaggagcac cccagcagca 2760 acaataggtt tactagccac ccccaatgta cagtatcctc caaaaaaatc cgaatggatt 2820 ttctttcaaa aagcccccta tccatattgc ggggaactgc gttgggcttt taatccaaat 2880 tgagggggta aggaacaaag cgaaaggtgg gaagaaagaa ccctttcccc caaaagggaa 2940 tagccttagg cccactgtcc attttttgaa caagtgacga gaaaccccaa aagcttccaa 3000 atcttccatc acaaaccggg gaaaactgga aaaactcaaa ggaaccgcct taagtcggtt 3060 caatcagttt ttcgcgccat gaaaattcca gacagtctgt ctgatttcat tcatatttag 3120 gggggaaggg agggggaaga catggccagt ggctaattca ttcttttcct ttcttcccac 3180 ctttcgcttt ctttctagcc cccccaaatt tgggctaaaa gaccaacaca gttccctgca 3240 atatggatag ggggcttttt gacaataaat cgcattcgga tttttttgga ggatactgta 3300 catacattac cccttatagc tagtgtagct caattaatga gaataccagt aacatttaat 3360 aacctaaaga cacaagcatt agtagatacc ggtgcagctg caagtttctt agcacaccgt 3420 ttgttagtat gtattccata taactatgtt aaagaagtaa aagtttctga ccctaacacg 3480 caactgtttc ggacagtttc aggagaactc gtaaaaccaa ttggccgtta tgaattatgt 3540 attaaattag ctcgtagaca cacatttaat catccatttt acgtactatc aagtttagat 3600 gaaggttgca tcttaggtta cgatttccta gcaacacatg gaattatcat taatcccagt 3660 gagcgcagta taagttacac acatgacaac gaattaagga acttgatcat accgcctttg 3720 cctatatgtt cgattagtat aacaactcca ccacaattcg acttattagg agtaccagag 3780 gaacaaagag gaaatttaaa gaacctttta atgtcgtatt tttcactatt taccgagaat 3840 attaatgagt tgggtaaagc gaaatcgatt aaacattcaa ttcataccac gcagttacca 3900 gatattatgc ccatgaggcg caccccagag agattgagat tagtagtgaa gaaacagatt 3960 gaggatatgt taagaaataa tatcattaga ccaagtacca gcccatttgc ctgtcctatc 4020 ttgctagtag cgaagaagga agaaggtcaa atgaggtttt gtgtcgacta tcgcccgcta 4080 aacaatgtaa cagttaaaga cagatatcca attcccaata ttcagttgat tatcgatagt 4140 cttcacggcg cgaaatattt ttcaactttg gatttgctca gtggatattg gcaaatcgaa 4200 atagaggaaa agcataaata caaaactgct ttcatatgcg agtatggact ttacgagttc 4260 aactgcatgc cttttggttt gtgcaatgct ccgagtactt tccagaggga aatgaacacc 4320 ttatttaaag atgtcttgta cgaatttgta cttgtttatt tggacgacat catcgttttt 4380 agtaaaacga tggatgaaca tactcgacac ttgaggatag tattcgacct gctaacacaa 4440 gaaaatctta agcttaagtt aagtaaatgt gacttcttta agacaaagat taagtaccta 4500 ggacatatta tcacagctgg tggatttcat cccgacgccg ggaaaacacg ttccatacta 4560 aactaccctg aaccgttgaa cgttaagcaa ttatcgtcct ttttaggttt ggctaattat 4620 tatcgcaaat tcgtacgcga gtacgcggaa atagctcatt tcttgacaga actaacgaaa 4680 agggacgcct ttcaacgtat caaaaattgc ctaaccagca atccacttct gaggtataca 4740 gattttaccc gggaatttct agtctacgcg gacgcctcag gatacgggat tggtgcagta 4800 ttagcgcaga tacagtatat accgccctca gaaacggatc cagaagccat acacggagaa 4860 cacgaagtag tgatagcata cacttcgcga catttagacg aacgagaacg catttggaat 4920 atttcagaga aagaatgttt agcaatttta catgcattag agcattttag accatatctt 4980 tatggtagaa cctttagggt atttacagat cataaaccgt ttgaaacgtt gatgaagaaa 5040 aaggacccac acggaagatt agcacgttgg gcgtacgaaa tgcaggcata taatatgact 5100 cttatacacc gtcctggcca ggaaaatcaa aatgcagatg gtttaagtag acaaccactc 5160 ccagtaatag gcgctatatt aaaaccagaa ataatacaaa acgactggat aacagcgcaa 5220 cgtaacgacg aatattgcaa aaaaattaat ggaattgttt cctagtactt acggtttgtg 5280 agtactaggt acagtgagta cagctcctct tacgagccac catcggccca gatggagagg 5340 atcccatcac cattccctcc gtgttatcat ccaccccaac acttccaaaa tatgtacgcc 5400 ccaccaatta tgttgcaaag tgaaagggag tactacgccc agcgggaggt actgatccag 5460 cagtacctgc gagacagctt ggatttatat tacaaatatg aattccaggc tctcaaattg 5520 aaaagggctc gggatgtact cacaaaccgt aagtactagg aaacaattcc attaattttt 5580 tttagtactt acggttcgtg aagcgttcca ttacattact tatccacatt ccattgaatg 5640 ccgcattgta gttcttatgt tattcctgta ctacgcattc cacatactta tatttagcta 5700 ttcacattat gcctatattc tattcgaaat attatgcaat attatgcaca tttataccaa 5760 ggtgaaggta ttatatttcc caatgtattc aacttaatct acatatatac acttaagtag 5820 aattagcagc attctaaaaa aaaattcttg catgtattga tacaaaagaa atagtttcct 5880 tccccgacca aacttctgcg gaagggggag gag 5913 // ID Gypsy-60_CQ-I repbase; DNA; INV; 4115 BP. XX AC AAWU01037505; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-60_CQ_; KW Gypsy-60_CQ-LTR; Gypsy-60_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4115 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 499-499 (2011). XX DR Genome; AAWU01037505; Positions 6296 2182. XX CC Positions [3134-3595] - Integrase core CC 'AAAAC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 124..2568 FT /product="Gypsy-60_CQ-I_1p" FT /translation="MNDVVNHSNLSAFDPGDSSTVSARWKKWKRSFQIFLE FT VNNVTIAQRKKSYLLHFVGQQVQDIFFNLQGEEEPAVPNGSDVYKEALKLL FT DNHFLPMKCLPLERHIFRNLEQGADESIEKFVLRLREQGNLCEYDEHLEDE FT IKEQIFEKGSSDDLRAKILTKPRMTLAETVEMGRSLETIAKHRKNNSVKPV FT EVNKVQKASGSSAKGECYRCGRPGHFANDNTCPAIERKCNRCGLKGHFETR FT CKTKTPRKKTRDDNRLRQVKEDPRLADSDDSDDVEYESEVSDDECVKYVFA FT AEPDCGEKVICEVGGVKIEWVVDSGAGVNVINRGTWEHLKKNKVRVKSQTT FT EVKKSLKAYGGHALNVAGVFSTDVATKQKVAEAQIYVVENGTCCLLGRKTA FT TELGILNINTAVWAVQGADERIGKIKGVVARIQVNPEVKPVQQTQCHVPIP FT LRERTEKEIQRLLSQDVIEEAPRDSPWISRLVVRPKVGEPSAVRLCVDMRD FT ANQAIVPQHYPLPTFDSIVPHLHNCKWFSKIDLNKAFHQVELAEDSREITT FT FAAHNGYFRYKRLMFGLSCASEVFQGIIERMLTGLPGVKAFIDDILVFAAT FT KKEHDEILRAVLDRLKSCGVTINSRKCEFGKNEVVFMGHRLSAEGISPTED FT KVETIKRCRDPQTSEELRSFLGLVNYLGKFIPDLATLTTPLRSLLRKQTRF FT TWGKEQKAAFSKIKAVLSHPKNLGFYSPLDKTIVIADASPTGLGAVLLQEK FT DNIKRVICYISKGLSDTEQNYAQNEKEALALVWSVERLEMYLRGLRFYLLT FT VLFIDIRYRVVHM" FT CDS 2729..4087 FT /product="Gypsy-60_CQ-I_2p" FT /translation="MLLAIVETARSEALTMGDIVNYTLKDEELAAVKKALI FT SGQWTDAIKRYVPFRNELMAVNEIVVRGERLVIPKNLRKKVLEIAHIGHPG FT IERTKQRLRAKVWWPNLDKDAETLVRSCLDCQIVGQPGPAEPLKIRELPQG FT PWSYLSMDMLGPLPSGESLLVVIDLYSRFRVVEVLRQTTTTDILKKLKPLF FT MRLGFPDTLLTDNATNFSSREMMEFCNLYGISLRHSTPYWPQANGETERQN FT RHILKTLRIAAHKGTDWKSDLDEANYVYSLTPHPATGRSPAELAFGRKFKD FT WIPQFGARDVVEDGEIRDRDHSYRTVAKEYYDKRHNVREDTVAEGDVVLMK FT NQTKQNKLSTPFMPGPGTVIKKEGNSVVVETPDGIRYRRNSSHVKKIPAHP FT TTHEEIEDDDAEPTWATPEPLAPAAVTTTQGEAAGRPQRTIRRPLRFDDYD FT LELGQGGH" XX SQ Sequence 4115 BP; 1151 A; 1021 C; 1176 G; 767 T; 0 other; ttggcgacga agatcgtact agtccgcgcg aaggcgtagt tcgaaccggc cggtcggaaa 60 ccggaagaaa atcgacggaa cgtaaacgaa tacgttgaaa ctgatttgaa tcgtgacggc 120 acgatgaacg acgtggtgaa ccactcgaac ctctccgcgt tcgatccggg agattcgtcg 180 acggtgtccg cgaggtggaa aaaatggaag cgatcgttcc aaattttcct cgaggttaac 240 aatgtaacca ttgcacaacg gaagaagtcg tatttgctcc actttgtggg ccagcaggtg 300 caggatatct ttttcaactt gcaaggggaa gaagaaccgg cagtcccaaa tggatccgac 360 gtctacaagg aggccctcaa attgttggac aaccactttc tccccatgaa atgccttccc 420 ctcgaacggc atatctttcg gaacctggag cagggagcag acgaaagcat cgagaagttt 480 gtactcaggc tgcgggagca gggaaatctc tgcgagtacg acgagcatct ggaggacgag 540 atcaaagagc agatctttga gaaggggagc tcggacgacc ttcgggctaa aattctcacc 600 aagcctcgga tgacactcgc cgaaaccgtc gagatgggcc gttccctcga aaccatcgcg 660 aagcacagga agaacaacag tgtgaaaccg gtggaagtga acaaagtgca aaaagcatcc 720 ggttcgtccg ccaagggaga gtgttaccgt tgcggtagac caggacattt tgccaacgac 780 aacacttgcc ctgcgatcga acggaagtgc aaccgctgtg gactgaaggg acacttcgag 840 acgcggtgca agacaaaaac gccgaggaag aagacccgtg acgacaaccg gttgcggcaa 900 gtgaaggaag atccgcggct agcggactcg gacgactctg atgatgtgga gtacgaaagt 960 gaagtgtcgg acgatgagtg cgttaaatac gtgtttgctg ccgaacctga ctgcggagag 1020 aaagtgattt gtgaagtggg cggagtgaag attgagtggg tggtggactc cggtgccgga 1080 gttaatgtga ttaaccgggg tacgtgggag catttgaaga aaaacaaagt gcgcgttaag 1140 tcccagacga ccgaggtgaa gaagtcactg aaagcgtacg gtggccacgc cctcaacgta 1200 gctggagtat tttcgaccga cgtggcaacc aagcagaagg tagcggaagc gcagatctac 1260 gttgtagaga acgggacgtg ctgcctgctc ggtcgtaaga cagcgacaga gttgggcatt 1320 ctgaacatca acacggctgt atgggccgtc caaggagctg acgaaagaat cggcaaaatc 1380 aaaggtgtcg tcgcgagaat ccaggtcaac cccgaggtga agccagtgca gcagacacag 1440 tgccacgtac caataccgtt gcgagagagg accgagaagg agatccaaag attgctgagc 1500 caggatgtaa tcgaggaagc tccacgggac tcgccgtgga tttcaaggtt ggtcgtcaga 1560 cccaaggtag gagaaccttc agcagtgcga ctatgcgtcg acatgcgaga cgcaaaccag 1620 gccattgtac cgcagcacta cccgctgccg acattcgaca gcatcgtgcc acacctccac 1680 aactgtaagt ggttctccaa gatcgatcta aacaaagcct tccaccaagt cgaactagcg 1740 gaggactccc gagagatcac gacgtttgct gctcacaatg gttatttccg gtacaagagg 1800 ttgatgtttg ggctgagctg tgcgtccgag gtatttcaag gcattatcga gcggatgttg 1860 acaggattgc ctggagtcaa agcgttcatc gacgacattc tcgttttcgc ggccacgaaa 1920 aaggaacacg acgaaatcct tcgagcagtc ctcgacagac tcaagtcctg tggcgtgacg 1980 atcaacagtc gcaagtgtga gttcgggaaa aacgaggttg tcttcatggg tcacagactg 2040 tcagcagaag gaatcagccc taccgaggac aaagtggaga ccatcaagcg ctgccgtgac 2100 ccgcaaacct cggaagaatt gcgcagcttt ctcgggctag tcaactacct gggaaagttt 2160 attcccgacc tggccaccct aactacaccg ctgcgttcgc ttctgcggaa gcaaacacgg 2220 ttcacgtggg gcaaggagca gaaggcagcg ttcagcaaga tcaaggccgt cctctcgcac 2280 ccaaagaacc tgggattcta ctcgccgctc gacaagacaa tcgtcatcgc cgacgccagt 2340 ccaactggcc tgggggcagt gttgctgcag gagaaggaca acatcaagag ggtcatctgc 2400 tacataagca agggactctc agacaccgag cagaattacg cgcaaaatga aaaggaagcc 2460 cttgccttgg tgtggtccgt ggagcggctg gagatgtatc tcagagggct gcggttctac 2520 ctgttaacag tactgtttat tgacataagg tatagggtgg ttcatatgta actgttacga 2580 tagagtggtt caaatacaac aggccctacg gttgcaatcc ttccggttca gaatcgtgca 2640 catcgctgga aaagcaaaca ttgccgaccc tctgtcacgg ctgcccgaat tccaggagtg 2700 cactacgtac gacgagtacg gagaatccat gctgttggcg atcgtggaaa cggcaaggtc 2760 ggaagcgctt acgatgggag acatcgtgaa ctatactctg aaggacgaag aattagcagc 2820 tgttaagaaa gcgttgattt ccggtcagtg gactgatgcg atcaagcgct acgtgccctt 2880 cagaaacgag ctgatggcag tgaacgagat agtagtcaga ggagagcggt tggtgatccc 2940 aaagaacctc cgaaagaaag tcttggagat cgcgcatatc ggacaccctg gtatcgaacg 3000 taccaaacag cgtctgcgag ctaaagtatg gtggccaaac ctggacaaag acgccgaaac 3060 tctagtacgt tcatgcctag actgccagat cgtcggacaa cctggaccag cggaacctct 3120 caaaatacgt gaactgccac aaggaccctg gagctacctc agcatggaca tgctgggacc 3180 cctgccatca ggtgaatcac tcttggttgt gatagatctg tacagccgtt tccgtgtcgt 3240 ggaggttctc cgacaaacaa caacgacgga tatcctgaag aagctcaaac cgctgttcat 3300 gaggctcggt ttcccggaca cgctgttgac ggacaacgca acgaacttct caagccgaga 3360 gatgatggag ttctgcaacc tgtacggcat ctcactgcgg cactcgactc cgtactggcc 3420 tcaagccaac ggagaaacgg aaaggcaaaa tcggcacatc ttgaaaactc tgcgaattgc 3480 tgctcacaaa ggaaccgact ggaagagtga tttggacgag gcaaactatg tctactctct 3540 aacaccacat ccagcaacag gccgctctcc ggctgaactt gctttcggga ggaagttcaa 3600 agactggatc ccgcagttcg gcgccaggga cgtggtcgag gatggagaaa tacgggacag 3660 ggatcactcg tatcgaacag tagccaaaga atactacgac aagcggcaca acgtgcgcga 3720 agacacagta gccgagggag acgtggtgct gatgaagaac cagacgaagc agaacaagtt 3780 atccacgcca ttcatgccgg gaccggggac agtgatcaag aaggaaggaa acagcgtggt 3840 cgttgaaaca cctgacggga tccggtacag acgaaattcg tctcacgtta aaaagattcc 3900 ggctcaccct acaacccacg aagaaatcga ggatgacgac gccgaaccaa cgtgggccac 3960 accggaaccg ttggcgcctg ctgcggtcac gacaacacaa ggagaagccg cgggcagacc 4020 acagcgaacc attcggcgac cgctgagatt cgatgattac gatctggagc ttggacaggg 4080 cggacactaa aaggacaacc tgagaaagga gagaa 4115 // ID Gypsy15-I_Dpse repbase; DNA; INV; 6155 BP. XX AC Unknown_group_825; XX DT 15-MAY-2009 (Rel. 14.05, Created) DT 15-MAY-2009 (Rel. 14.05, Last updated, Version 1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy15-I_Dpse. XX OS Drosophila pseudoobscura OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; obscura group; OC pseudoobscura subgroup. XX RN [1] RP 1-6155 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1096-1096 (2009). XX DR EMBL/GenBank/DDBJ; Unknown_group_825; Positions 17445 23599. XX CC Positions [5196-5693] - Integrase core CC LTRs are 96% similar to each other. XX FH Key Location/Qualifiers FT CDS 1216..2841 FT /product="Gypsy15-I_Dpse_1p" FT /translation="MPVDRSPDYRIQPKESETVCCVCTQEVEYPGQLLATS FT CNHYFHHTCFSARSGNKKVCPVCRTALSKSTLNLAEKLEAAQASGHQPRAV FT SVTRSRSKTLKSQQNQLSAANMQSGGRSISQPEGPSVSGPMAAPDRPRGQP FT PNPAAAVVSASAARTGLPATDGSVQQTIADAVAAALAQHTQMLANAIEAGF FT QRLSLQNAAPVPTERVHPAHRQSSPVRSNQSFEQLFRLSARENAALPNSES FT ERPTNLSASPESQQSHQLRADRISQIIANWKLRFSGRSTMSIDDFLYRVEA FT LTSQTLDSNFELLSRYASNLFEGSAGEWYWRYHKSVPRVRWIDLCIALRSQ FT FSDGRTDLEIRTAITQRKQKLNEPFDSFYEAIVALADKLVQPMTEQSLLEV FT LRANLLNDIQHEILYVPIQNIVQLRQVVRKRERFIENTAKPIPATRRFVPR FT NQVNEMSIQTEGPAESDSEEADAFDIDAMTLSCWNCNKQGHRYQECEAERK FT VFCYGCGKRDTYKPSCRKCSSSKNELARAPLSSARKLVSKATNTE*" FT CDS 2880..6065 FT /product="Gypsy15-I_Dpse_2p" FT /translation="MITTTMTPLQDDDQETTPAPSKTRDRKRRKLFRTQSK FT NTRRAILSSVISNITDLRPYANVTLFGKTISGLMDTGASISCLGGKFAEEL FT ARTNPEVKPMKSAVRTADGKKQVILGRISTLICFRGNAKRICLYLVPSLSQ FT ELYLGIDFWRSFNLLPFFLMSKANNNSLAAINFDNPMQIPLSELQQKELVA FT IVQMFPSFAEKGLGKTNWLTHDIEISETRPIKQRHYAVSPAVEKCIYEELD FT RMLGLGVIEESDSPWSSPVVIVRKPGKVRLCLDSRKVNGVTVKNAYPMPLI FT DGILSRLPKAQFISSIDLKDAYWQIPLSPRAREKTAFAVPGRPLYQFTVMP FT FGLCNAPQTMSKLMDKVIPPSLRNEIFIYLDDLLVISESFEKHMSVLKALA FT DRLSQAGLTINVEKSKFCLKEVKYLGHVIGYGMIQTDPEKVTAIREFPVPR FT SVKQGRRFLGTTGWYHKFIKNYAGIAAPISDTLKKRRSFVWTNEAQRAFES FT LKDQMCGAPVLHSPDFTIPFAIHCDASHTGVGGVLMQTDSDGNDIPIAFMS FT RKLNQCQRNYSVTEKECLAAIMCVKKFQAYVEGHEFSIHTDHASLKWLMSQ FT SDLSSRLARWALKLQGFNFKIFHRKGSQNVVPDALSRVHTEDLSAFSCEGL FT VDLQSDCFKSAEYLELIRKIECNQTQMPDIKVIDNHIYRRAEHAAADQIAD FT DLCWKLWVPKELVSDVLKQSHEQSLAAHSGINKTLERVRRYYYWPSLVTDV FT RAFINSCSVCKCTKYPNRSLRPPLAPVKETQRVFQKLFVDFLGPYPRSKSG FT NIGIFVVLDHLSKYPFLKPVKIFTAEAITRFMEEDLFHCFGVPEVVVSDNG FT VQFKSHHFNSLLKKYNVQHFYTAVYAPQANASERVNRSVIAAIKAYIKPDQ FT TNWDEQLSSISCALRSSLHSALNDSPYRVAFGQHMITNGDTYQLLRNLQVL FT EDRTVSFGREDSFDVIRKTAQRTIQKQHQRNEQQYNLRSRDVAFKVGQEVY FT RRNFQQSNFIKGFSAKLAPTFVKARIKRRLGHSYYELEDPQGRLIGKYHAK FT DIKQ*" XX SQ Sequence 6155 BP; 1731 A; 1402 C; 1462 G; 1560 T; 0 other; gtaataatta ttttatttgg cgcccaacgt ggggccccga aaggcccgaa ggtcagggtt 60 gacgtaattg taaaccttac aaagcgaagt cacattaaca tatatggttg tgtgtgaccc 120 cagccaacct tgactccgaa gcagaaaaat ttttgagaaa gtccataaca cctgcgagca 180 gtgcatagtc atgcactggc ctaagacttt tgttccccgt ttcccacccc cttccttaaa 240 tcccttcctt ttgtcctggg attggatact gtgatggtgg atgctgatta gtttccgata 300 acgtttagcc cgcatcggaa taggatccat aacccctgtg cgatagtaag atgttctgga 360 accgataata gcttcaacat agctttattg ccgagcagta tatgccatag cagtgagatc 420 taaatggatt cactggtatg tggaagttgg ctggacacag ggttattcgt ggagtaaatc 480 tacgacctgc cgctatacag aagcgtatta gtgaacacat tctcccgtat cccttggaaa 540 ttgacaggta tcctgggtag ttttgtttca gttgtcgccg gagtcagcta gaagaattaa 600 cagatttttt tgttgccgct caactcgaat cggggttctt agggtgcctc cgtcttttgt 660 ttctctctct actttctagt gattggtcta ttcgtgcgcc gatttataca gcaggttatc 720 gccggccgaa cgtggtaccc ctggtgctgg tgcgtttctt cgtgctacgc ttccggcccg 780 gaccgataat ggctgcggaa aaggtatagg tcgtagcatg cataaactaa tacaaactcc 840 ccgtaaccgc ttcggtaatg tgtcttcgat tggcggctga cgagcaccgt cgtgatcctg 900 gtatgggcac gcgagcaaga aagtcgtccg tccctatgtt taggcattcg ccgcagacaa 960 gatgtggcct agagctgcta cttgggtcac ggttgccgaa gatacctttt ctttatttct 1020 ccctttatcg ctctaccacg tatagttatt ggcaaccttc tcggagggaa agggttgcat 1080 ggtttcccca aatctcttta tgaaatatcg ttttttttga gttgtccccg ttattattgt 1140 ttgtcgttaa tatttgtctg aactagattt ataggaaagt gcttaaagct agttaatata 1200 taggataact ttacgatgcc ggtagaccga agcccggatt ataggattca gccaaaagag 1260 agcgaaaccg tttgctgtgt atgtacgcaa gaagtcgaat accccggcca gctattagct 1320 accagctgta accactattt ccatcacacc tgctttagtg cacgttcagg aaataaaaaa 1380 gtctgtccag tgtgcagaac tgcgctgagc aaatcgaccc ttaatttggc agaaaaacta 1440 gaagcagctc aagcctcagg ccaccaacca agagcagtgt cagtgaccag atctcgatcg 1500 aaaacactaa aatcacagca aaaccaatta tccgctgcca atatgcagag tgggggccga 1560 agtataagcc aacccgaagg gcctagtgtt tccggtccaa tggccgctcc agatcgtcct 1620 agaggccaac cgccaaaccc tgcggcagca gtggtgtcag cttccgcagc tcgaactggt 1680 ctgccagcca cggatggtag cgtgcaacaa accattgcgg acgcagtggc agccgcacta 1740 gcacagcaca ctcaaatgct agctaacgcg attgaagcgg ggtttcaaag attgtcgctc 1800 cagaatgcag ctccagttcc aacggaaagg gtacacccag cacataggca gtcatctccc 1860 gttaggtcaa atcaatcctt tgagcaactc tttaggttat ccgccaggga aaacgcagcc 1920 ttaccaaact cagagtccga gcgaccgact aacttgtcag cttcacccga gtcgcagcaa 1980 tcacatcagt tgcgcgcgga tcggattagc caaataatcg cgaactggaa gttgagattc 2040 tcagggcgct ccactatgtc aatagatgat tttttgtatc gcgtcgaagc tctgaccagc 2100 cagacattag acagcaattt tgaattgtta tctcgttacg cgagcaacct gttcgaagga 2160 agcgcgggtg aatggtactg gaggtatcat aaaagcgtac caagagtgcg ctggattgat 2220 ctctgtatag ctctgagatc gcagttttcg gacggcagaa cagatttaga aatcagaacc 2280 gcgatcactc agaggaagca gaaactcaac gaaccgtttg acagtttcta tgaggccata 2340 gttgcgcttg cagacaagct tgtgcagccc atgaccgagc agtccttgtt agaggtgtta 2400 cgagcaaatc tgctcaatga catacaacat gagattttat atgtgcccat acagaacatt 2460 gtgcagttgc gacaagtggt acgcaagcgt gaaaggttca tcgaaaacac tgctaagcca 2520 atcccggcta ctcgacgatt tgttcctcgc aatcaggtga acgaaatgtc catccagaca 2580 gaaggtcctg cagaatccga ttcggaagaa gctgacgctt ttgatatcga cgcgatgacg 2640 ctttcttgct ggaattgcaa taaacaaggc caccgatatc aagagtgcga agcagagcgt 2700 aaggtgttct gttatggatg tggtaagcgg gacacatata aaccatcgtg ccgaaaatgt 2760 agctcgtcaa aaaacgagtt ggctcgtgca ccgctatcga gtgcacgcaa attagtgtcg 2820 aaggcgacga atacggagta accagggcaa cgtcaagctc actcccacta ctccctgaca 2880 tgataacgac cactatgaca ccattacagg acgacgatca ggaaacgacc cctgcgccta 2940 gcaaaactcg agaccgtaag cgtagaaagc tgttccgcac gcagagtaaa aatacacgca 3000 gagcaatatt atcctctgtt attagtaata taactgatct acgaccctac gccaatgtca 3060 cgctgtttgg caagacgata agtggattga tggacacagg cgcctcaatc agctgccttg 3120 ggggtaaatt tgcagaagag ttagctcgaa caaaccctga ggtcaagccg atgaagtcag 3180 ccgtgcgaac cgcagatggg aaaaagcaag ttattctagg taggatctcc acactcattt 3240 gctttcgtgg aaatgcaaaa cgaatatgtc tttatcttgt accatcgtta tcgcaggagt 3300 tatatctagg aatagacttt tggagaagct ttaatctatt gccgttcttt ttaatgagca 3360 aagcgaataa taacagtttg gcggctataa attttgacaa cccgatgcaa atacctttgt 3420 cggaattaca gcagaaggaa ctagtagcga tcgtacagat gttcccttcg tttgccgaaa 3480 aaggattagg aaagacaaac tggcttacac atgacataga aatttcagag actcgcccca 3540 taaaacaacg tcactacgca gtttcgcctg cagtcgaaaa atgtatatac gaagaattgg 3600 accgaatgtt aggccttggt gtgattgaag agtcagatag cccgtggtct tcccctgtcg 3660 tgattgtgcg taagccaggt aaggttcgcc tttgccttga tagccggaaa gtgaacggag 3720 taacagtcaa aaatgcttac cctatgcccc tcattgatgg cattttaagc agacttccaa 3780 aagcgcagtt catttcaagc atcgatctta aagatgctta ctggcagatt ccgcttagcc 3840 caagggctcg ggagaagacg gcttttgcgg tgcctgggag gccattatat caattcacgg 3900 tgatgccgtt tggcttatgc aatgccccac agaccatgtc caaactgatg gataaagtta 3960 tcccaccatc tttgcgaaat gagattttca tttatctcga cgacttattg gtgatttcgg 4020 agtcattcga gaagcacatg tccgtgttga aagcgctggc cgaccggctg tctcaagctg 4080 gtctgacgat aaacgtagag aagagcaaat tttgtctgaa ggaagtgaaa tacctgggcc 4140 acgttatagg gtatgggatg atacaaacgg atcccgaaaa agttactgcc attcgggagt 4200 ttccagtgcc gcgatcggtg aagcaagggc gacgattttt gggaaccact ggatggtacc 4260 ataaatttat taaaaactat gctggaattg cggccccaat atcagatacg ctaaagaaac 4320 gtcgttcgtt tgtatggacc aacgaagcgc aaagggcgtt tgaaagttta aaagaccaaa 4380 tgtgtggggc gccagtccta catagccccg actttaccat tccatttgcc attcattgcg 4440 atgccagcca caccggagtt ggtggcgtgt taatgcaaac tgatagcgat ggtaatgata 4500 tcccaatcgc attcatgtct cgcaagctga atcagtgtca gcgaaactat tcagtcacgg 4560 agaaagagtg tttggcagcc ataatgtgtg tgaagaagtt tcaagcttat gtggaagggc 4620 acgagttctc catccatacc gaccatgcat cattaaagtg gctaatgtca caatcggacc 4680 taagttctag gttggcaaga tgggctctca aattgcaggg gttcaacttc aaaattttcc 4740 accgcaaagg tagccaaaat gttgtcccag atgctctgtc gcgagtgcac accgaagatc 4800 tatctgcatt cagctgcgag ggattggtag acctgcagtc ggattgcttc aagtcagcag 4860 aatatttgga actgatacga aaaattgaat gcaatcagac acagatgcca gacattaaag 4920 tgatcgataa tcatatatat cggcgagcag aacacgcggc tgctgatcag atcgctgacg 4980 acctgtgttg gaagctatgg gttccaaagg agctggtgtc agatgtacta aaacagagtc 5040 acgaacagtc gttagcagct catagcggca ttaataaaac gttggagaga gttcgacgat 5100 attattattg gccaagttta gtaaccgatg tgagagcgtt cataaacagc tgttcggttt 5160 gcaagtgcac caagtacccg aatcgttcgt tacgaccccc attagctcca gtgaaggaaa 5220 ctcaacgagt ttttcagaag ttgtttgtgg atttcctggg tccgtatcca cgaagcaaat 5280 ccggaaacat cggtatcttc gttgttttgg accatctttc aaaatatccg tttttaaaac 5340 ccgtaaagat atttacagcg gaggcgataa ctcgatttat ggaagaggac ctattccatt 5400 gcttcggggt acccgaagtg gttgtgtccg acaatggagt acaatttaag tcccatcact 5460 tcaattctct gttgaaaaag tacaatgtgc agcattttta tacggctgtg tacgcacctc 5520 aagccaatgc atcggaacgc gttaaccgtt ccgttattgc agccattaaa gcttacatca 5580 agccagatca aactaattgg gatgagcagc ttagttctat tagttgtgcc ttacgatcta 5640 gtctgcattc agccctaaac gacagcccct atcgggttgc atttggacag cacatgatca 5700 ccaatggaga cacctatcag ctcttacgaa atttgcaggt tttagaagat cgtacagtga 5760 gttttggtcg ggaagattcg tttgatgtga ttcgaaaaac agcccaacga acaatacaaa 5820 aacaacacca acgaaacgaa cagcagtaca atcttcgaag tcgtgacgta gcgttcaagg 5880 ttggccagga ggtctacaga cgcaactttc agcagagcaa tttcattaaa ggtttcagcg 5940 caaaactagc tcctacgttt gtaaaagcta gaataaaacg caggcttggt cacagttatt 6000 acgaattaga agatccccaa ggtagactta tcggaaaata ccacgctaaa gacataaagc 6060 agtgaccatc ctaatacagg ttaacggttt tccgccaagc gtctatcacc ctttgatgtg 6120 attaacacca aagtgtgatc ttggccgggg ggatt 6155 // ID Gypsy-38_NVi-LTR repbase; DNA; INV; 2302 BP. XX AC . XX DT 06-JUL-2009 (Rel. 14.07, Created) DT 06-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: long terminal DE repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy-38_NVi-LTR. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-2302 RA Bao W. and Jurka J.; RT "Gypsy LTR retrotransposons from Nasonia vitripennis."; RL Repbase Reports 9(7), 1386-1386 (2009). XX DR [1] (Consensus) XX SQ Sequence 2302 BP; 565 A; 557 C; 623 G; 549 T; 8 other; tgcgacgtgt agcctgctcg gctacacgtg tacactagca ctaaattgtc tgcgcaatca 60 atgttgttcc ctgctgcaat ttatcatgca gcagaggaac aataacattg attgtcgtcg 120 cgccgaggaa gataacctca cctgccagcc gcatggagca acggtggctc ccctgcaggc 180 ttggcaggtg actatgccct cacggtagcg ccgccgcgga tcgccaggtg ttgtcggaca 240 ccgctgtacc gacgcagccg acacgcacct gctcctgttc gcaccaggag tcgcgccgcg 300 aaacgaagaa gcgagatcgg gacgagcgcg gaamacagca cgaggcggaa tgacgcgtgc 360 gcatcgccgc cgcgcatatg cgagcgcgcc gcggtagctc ggcggagcga gmgcgcgcga 420 gmgagagaga gagagakagc gagagagaga gagagagaga gagagagagg ggctcgcgcg 480 cgaagcccgc atgccgtcgc ggagtcggcg cgactagtgg ggggagcgcg crtacaccac 540 tagcgcgcac cactaaccga gccatgccac cgcactggcg cagtccagtg cgctagtggg 600 agtcccgtgc accatgactg cacggtgact taaatttaac tatagcgcca gatgacgcgt 660 cagagcaatc tgacgtgttc gcgctatctc tctctcgctc tctctctctc tctctctctc 720 tctctctctc tctattagac gttgatggaa gagagcgttc gtgcggttga cgctggcggt 780 gtcgcgccgc cggcaattaa tatagagggc gcggattcga gtttccgggt acgcgcagga 840 cttcgaggac gagagagact tcctcctcgt ccagaatagt ctcaaccgct cgagcgagta 900 gacgcctcta accagcctag tgcaaccccg agtttcagag taacagagag agagagagag 960 agtagagaag ccattgagaa ggagagttga ggttccgaag aagaagataa agtaagtcgt 1020 cgcatattcg gcacggttgc caataattat tgtctaattt atgtggttgt tggggtaatc 1080 gtcgcgtcgc ggaaataagt gtttttgaca ccgagcgaat actcgtttcg tagcggttga 1140 gacgcggtcg atccaasagg atcccgccat tttataataa gcgtttgtgc cgctgtcgat 1200 tgtaaatcaa tccagcatat aawtattggg gaaagagttg agccgctgta gattgcgaag 1260 caatccagca ccgttatacg gaaagtcgtt gatcgaaatc gtcaagtgta ttgcgttgag 1320 ccgctgtaga atcgctaagc gatccagcaa ttgttmaggg tattgcgttg cgccgctgtt 1380 gatcgccaag cgatccagct ttcgaaaaga agaattatcg agccgctgtt gatcaaaaga 1440 tccagcattt aagagtatcc ttaggttgag ccgccgtcga tctttaagat ccggaaaaac 1500 acgattttcc cgattaatat ctgcgttgcg tcgtcgccga tcaaaccgtg atcgaagtgg 1560 ctgtatcgag agtaatttcg cgagtcgtac tcgcgtcatc gcaccttata tatagctgtt 1620 ttccgttacg ccgtaaaaac gttccgttat tgcggtggtt tatcgcgacg ttctatgatt 1680 ttcgcgcgtt tcagcaagga tccgtacaga tacgaagtgg caccgagcgt ccgatgacca 1740 gccctaatcc aggagtaaat caaaacggaa aacgccatct tggaggacgc cgcgtccttt 1800 gatcgcgacg ccgacgccat tttgggcgaa cgaataacga atcttagagt cacgaaagcg 1860 cggtttattg tttcgcgtcc gttgatttca aggagtattt cgttaccgtg tcgtagagtc 1920 aataatcggt tattatcgtg tcaccaaatc cattgtcggt tgaccaagga tataagtctg 1980 aattgtttat tgttaaatgc gttattattt gcgtaacatc gcgccttgag tctgagttga 2040 tttattacgg ggaaatcttg taatcagacc aagtctgatc catactgggg gttgaagaat 2100 acatcgtatt gttattgttc agcataaacc gttgtgtatc aagctttatt tttgaacctc 2160 taccacgcca ggctctcctg tcctaaaacc tacacgttgt ccccctctgt agatgtaaga 2220 gtccgcggta cgaacatcaa gtagaaaata gtgtaaactg attgtttgcg tcaccgaagt 2280 cggttataaa aacgcaacat ca 2302 // ID DNAX-3B_AP repbase; DNA; INV; 138 BP. XX AC . XX DT 25-AUG-2009 (Rel. 14.09, Created) DT 25-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNAX-3B_AP. XX NM DNAX-3B_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-138 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2053-2053 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 138 BP; 21 A; 42 C; 44 G; 31 T; 0 other; tctgtccagc gagagaacgc gccgattctg cgcatccccc cgcgtcacct tgctatcgaa 60 accggttccc ctatggcagg atgttgggtg ggggaaccag ttttggtcgg tgcgcggcgc 120 gttctctcgc tggacaga 138 // ID CR1-26_HM repbase; DNA; INV; 4673 BP. XX AC . XX DT 16-DEC-2008 (Rel. 13.12, Created) DT 16-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE CR1-type family - consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-26_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4673 RA Bao W. and Jurka J.; RT "CR1 families from Hydra magnipapillata."; RL Repbase Reports 8(12), 1854-1854 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 99..935 FT /product="CR1-26_HM_1p" FT /translation="MDKKNIVVRKKSNETGNIDETQIXTNVVRDIFHEMFS FT KQQKDIFKLISGNLKITNDKINELLDEIHKSKQICDSLKKENEKVNIKLKE FT ITEKVRILEKTNKDIEESLTVTQDIQEKKVCELEKKIKNKNIIGEEEKNKL FT RQLEDRLRRNNLRIDGLPENDQETWDETEKKLLILFENELNIRNVDIERAH FT RVGKKEENKTRTIVVKILHYKDKIKVLNSSNRLKGTGIYINEDFSFETTVI FT RKKLLEEMKMHRKNGKYSSIKYDKLIVREFRKIKASA*" FT CDS join(1052..1405,1260..2063,2082..3101,3105..4232) FT /product="CR1-26_HM_2p" FT /translation="MKIMLHLTQNLIHYKIKIGIYILDNNSDPDINFYSNN FT KVFQNINTLYYDPTSEIIMQGLDVNSFSILHLNIRSFSKNFELFKQFLCEV FT KFNFKIISLSETWCNDEHIRDKYKFSATKLILGVFQKILNYLNNFYARLNL FT ILKLLVSARHGVMMNILETNTNFQLPNYKVIHQFRGSGKKGGGLCVFISNS FT LLYKLKKDFCSTTNDCESLCVEIINKTTKNIIIHVLYRPPSGSIKQFEKHI FT KNIIKNKLSKNKSIYFIGDFNLDLNMSHLNTXINNFFNVIYQKGFIPLINK FT PTRITRESATIIDQIVTNELKSKIKTGIFKCDISDHFPVFLISQKCDNIYA FT QKVKKNTRYINEKSMENFNTLLSDLNWDDLLHIEHVTSFILNTTRDLLDIE FT HADKAYDKFIDKLQQLYNSAFPVVTKCVKKKTLLNPWITPGIIKSSKKKQR FT LYNKFLKKKTFNNETNYKNYKRIFEMVIKKSKKYYYTEHLIKHKNDPKKTW FT DTIKEVIGTKQNDGNSLPIKLNIKNKTITNKSLIAETLNQYFVSVGSTLAS FT KIETTEVNFESYLTPNKTSKMDNYEITEKELLDSVYFLKLNKSVGFDNISS FT NVIKKSIKYLTIPLLHIFNLSLKLGIYPEKLKIARVIPIFKSGDIFNPENY FT RPISILPCFSKILERIMYNRILTFLNINNILYNKQFGFQPGYSTDHAIINI FT VHDIFKAFDESKFTLGVFIDLSKEFLILAKPLIQAFDTVDHNILIKKLESY FT GIKSTYLEWLKSYLNNRKQYIAHEERKTEYLTITCGVPQGSILGPLLFLIY FT VNDFHKFSNILNSVLFADDTNLFYSNGDINLLFKIVNKEFLNLAEWFKANK FT LSLNLNKTKYTFFHRLHDKENIPLKLPDLYIGNSKIVRESSLKFLGVILDE FT NMTWREHIRTIENKTSKNIGILYKAKQILNQNCLKILYFSLIHCYINYANI FT AWCSSNVSKVKKLLSRQKHAVRIISNAGRFAHSKELFKNHHILNVLQLNLY FT QILIFMYKLHNKLTPIIFNTFFNKINHVYPTRFSNYNYEQPKIHYLATKFS FT IAIRGPKLWNTLLNNQLKNCSSLSLFKQKLKQKLLNDVNELNSF*" XX SQ Sequence 4673 BP; 1893 A; 641 C; 581 G; 1552 T; 6 other; ttttgtggcg astagacgtg ttttattcag aggaaataat tactttttga aacttttttt 60 tttttttttt ttttaataac aaaaaataaa cttattaaat ggataaaaaa aatattgttg 120 ttcgaaagaa atctaacgaa acggggaata tagatgaaac acaaattyca acaaatgttg 180 ttcgagatat tttccatgaa atgttttcaa aacagcagaa agatatcttt aagcttataa 240 gtggtaacct aaaaataact aacgataaaa taaacgagct acttgatgaa atacataaat 300 caaaacaaat ttgcgactca ttaaaaaaag aaaacgaaaa ggtaaacatt aaacttaaag 360 aaattacaga gaaagtaaga atcttagaaa agacaaataa agatatcgaa gaaagcttaa 420 cagttacaca agacatccaa gaaaaaaagg tatgcgaatt agaaaagaaa attaaaaaca 480 aaaatattat aggcgaggag gaaaagaata aactacgaca gctagaagat agactcagaa 540 ggaataatct tcgcatcgac gggcttcccg aaaacgatca agaaacctgg gatgaaactg 600 aaaaaaaatt attaatttta tttgaaaacg aattaaatat taggaacgtt gatatcgaga 660 gggctcacag agttggaaaa aaagaagaaa ataaaactag aacaattgtt gttaaaatac 720 tacattacaa agacaaaata aaagtactaa attcatctaa tcgacttaaa ggaacaggaa 780 tatacatcaa cgaagacttc tcttttgaaa caacggtcat aagaaaaaaa ctcttagaag 840 aaatgaagat gcacagaaaa aatggtaaat attcttctat aaaatatgat aaacttatag 900 ttagagaatt taggaaaatc aaagcaagtg cttaattact ttatcattta tattaagaat 960 tttttttttt tttatcatta ttattattat ttttttttct ttttcagtaa aataataatt 1020 gaacagttta taaaaaatac cctactcatg aatgaagata atgttacact taactcagaa 1080 tttaattcat tacaaaataa aaataggaat atatatctta gataacaatt ctgatccgga 1140 tattaatttt tatagtaata acaaagtatt tcaaaatatt aatactttat attatgaccc 1200 tacatctgaa atcataatgc aaggactaga tgtaaattca ttttcaattt tacatttaaa 1260 tattaggagt ttttcaaaaa attttgaatt atttaaacaa tttttatgcg aggttaaatt 1320 taattttaaa attattagtc tcagcgagac atggtgtaat gatgaacata ttagagacaa 1380 atacaaattt tcagctacca aattataaag tcattcatca atttagagga tctgggaaga 1440 aaggaggagg tttgtgtgta tttataagta attctttatt atacaagcta aaaaaagatt 1500 tttgctccac cactaatgat tgtgaatcac tatgtgtgga aataataaat aaaaccacaa 1560 aaaacattat catccacgtt ttatacagac caccttccgg ytcaataaaa caattcgaaa 1620 agcacattaa aaacataatt aaaaataaat taagcaaaaa taaaagcatt tattttatag 1680 gtgattttaa ccttgacctg aacatgagtc atttaaatac aaamataaat aatttcttta 1740 acgtcatata tcagaaaggc tttattccac taattaataa acccacaaga ataactagag 1800 aaagcgcaac tattatagat caaatagtca ccaatgaatt aaaatcaaaa ataaaaacag 1860 gaatatttaa atgcgacatt tcagatcatt tccctgtatt tcttatctca caaaaatgtg 1920 ataacattta tgcacaaaaa gttaaaaaaa atacgcgata tataaatgaa aaatccatgg 1980 aaaattttaa tactctttta tcagatttaa attgggatga cctccttcat attgaacacg 2040 tgacctcctt catattgaac acgtgacctc cttcatattg aacacgtgac ctccttgata 2100 ttgaacacgc agataaggca tacgataagt ttattgataa attacaacaa ctttacaatt 2160 cggcttttcc tgtagttaca aaatgtgtga aaaaaaaaac attattaaat ccatggataa 2220 cgcctggaat tataaagtcg tcgaaaaaaa agcaacgact ttataataag tttctaaaaa 2280 aaaaaacttt caataatgaa actaattaca aaaattataa acgtattttt gagatggtta 2340 tcaaaaagtc aaagaaatat tactatactg agcatttaat aaaacataaa aatgatccaa 2400 aaaaaacttg ggacacaatt aaggaagtaa ttggcacgaa acaaaatgac ggaaatagtc 2460 ttcctataaa actaaacatt aaaaataaaa ctattacaaa taaatcttta attgctgaaa 2520 cacttaatca atattttgtc agtgttggtt ccactttagc gtcgaaaata gaaactactg 2580 aagtaaactt tgagtcatat cttactccta ataagacctc taagatggat aattatgaaa 2640 ttactgaaaa ggaactctta gattcagtat attttttaaa actaaacaaa agtgtggggt 2700 ttgataatat tagtagtaat gtaattaaaa aatcaataaa gtacttaaca atccctcttt 2760 tacatatttt taatctctct ttaaaactag gcatttatcc ggagaaacta aaaattgcaa 2820 gggtcatacc tattttcaag tcaggagaca tatttaatcc tgaaaattac agaccaattt 2880 caattcttcc ttgcttctcc aaaatcttag agcgaattat gtataaccga attttgactt 2940 ttctaaatat aaataatatt ctatataaca aacaatttgg atttcaacca gggtattcaa 3000 ctgaccatgc cattattaat attgttcacg atatatttaa agcatttgat gaaagtaagt 3060 ttacccttgg agtttttata gatcttagca aggagttttt atagatctta gcaaagcctt 3120 tgatacaagc ttttgataca gtagaccata acattcttat aaaaaaactc gaaagttatg 3180 gtattaaaag tacatattta gaatggctca aaagttatct aaacaataga aaacaataca 3240 tagcacacga agaaagaaaa actgaatact tgactattac ttgcggtgtt ccccaaggct 3300 caattctggg tccactttta tttcttattt atgtaaacga tttccataag ttttcaaaca 3360 ttttaaactc agttttattt gcagatgata caaatttatt ttattctaat ggagatatca 3420 accttttatt taaaatagta aacaaggaat ttttaaatct agcggaatgg tttaaggcaa 3480 acaaattgtc cttaaattta aataaaacta agtatacttt ttttcataga cttcacgata 3540 aagaaaatat tccattaaaa cttcctgatc tttacattgg caactctaaa atagttagag 3600 agtcatcatt aaagttttta ggagtgatac ttgacgaaaa catgacatgg agagaacaca 3660 taagaacaat tgaaaataaa acttcaaaaa atattggtat actatacaaa gctaagcaaa 3720 tcttaaacca aaactgctta aaaatcttat atttctccct catacattgt tatataaact 3780 atgcaaatat tgcatggtgc agctctaatg taagtaaagt taaaaaattg cttagtagac 3840 aaaaacacgc tgtcaggatt atttcaaatg caggtcgatt cgcacattct aaagaactat 3900 tcaaaaatca tcacatactc aatgttcttc agttaaacct ttatcaaatt cttattttta 3960 tgtacaaact acataataaa ttaactccta taatttttaa tacatttttc aataaaatta 4020 atcacgtgta ccctacaaga ttttcaaatt acaattacga gcaacctaaa atacactatt 4080 tggctactaa attttctatt gccatcagag gtcctaaatt atggaacact ttattaaata 4140 accagctgaa aaattgttct tcactttctc tatttaaaca aaaactcaaa caaaaacttc 4200 ttaacgatgt taatgaatta aattcttttt aagattcttt tatcttcttt tttaacaact 4260 atcttattaa ctagtctaac caatttaaat tttctttaat tatcctatta gcttatcttc 4320 agtttttaag tgtgattgct tttttttttt ttttttttat tttatttttt tttttttttt 4380 tttttgtttt tactattttc ttttgattat agattagtaa ctgatcttaa catatttgta 4440 tatatttgta tataattttt taaatggtac aaagtaattt tatrttttat gtttamattt 4500 taatggttat acacgcttgc ttatgaaagc acgtcggggt gtagtgataa ggcaaatatt 4560 gtcttctttt agccccggtc atgtagattt tatttatata acacggcatt gtattctttt 4620 ttttttaatg acgaaataaa aaaaaaaaaa aaaaaaaaaa aaataaaaaa aaa 4673 // ID Gypsy-226_AA-I repbase; DNA; INV; 7106 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-226_AA_; KW Gypsy-226_AA-LTR; Gypsy-226_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-7106 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1055-1055 (2011). XX DR [2] (Consensus) XX CC Positions [5151-5633] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 1435..2727 FT /product="Gypsy-226_AA-I_1p" FT /translation="MSEYVHFSQIEAYVKRCVEQITQQGTRYSMGQENLVN FT NLADEIAQVRFADQEAEPILRNRVGPPQSSMVREPSPPLQLSGPRPDARST FT PRQNPANQHRTFSLEGDRFVPVQFGNIPNPRATMESPGPRRYQADVDGYRV FT NSFSRRQPHQQCAIIEKWPKFTGDTNSVPVTDFLRQIDILCRSYDISKQEL FT RMHAHLLFKDNAYVWYTTYEEKFNSWETLEVYLKMRYDNPNRDRLIREEMR FT NRKQRPNELFSAFLADMEMLAQRMIRKMSEAEKFEMIVENMKLSYKRRLAL FT EPIQSIDHLAQMCYKFDALESNLYQAYNQPKSVHHIDLDDEDNEECLDPSE FT LEICALRSKMMQNRGRETQKVNLESKSSTEQKNTEMTCWNCNKAGHLWRDC FT DKRKRIFCHICGMMDTTAFKCPNQHNIRSENEEPKNE" FT CDS 2772..4388 FT /product="Gypsy-226_AA-I_2p" FT /translation="MKIPNSNFSYFQRVFQINTFVSRCPHLKVRILSEELE FT GLADTGASISIISALDLIEKLGFEIYPIPLQVKTADGTGYHCKGYVNIPFS FT IGNVTHVLPTIVVPEITKKLILGMDFLNKFGFHLTASRNEGVSSNTSSVAA FT PEVNQIDLCFMDEYFGENIETICFELQPCKVDETSKIELDESLEMPTVEIP FT ESHISTANGLLTEHHLSDSERQSLFEAVQTLPATKEGQLGRTALLKHKIEL FT VPGAKPKKFPSYRWSPSVESVIDTEVERMKKLGVIEECQQAVDFLNPLLPI FT KKANGKWRICLDSRRLNQFTKRDEFPFPNMMAILQRIPKSKYFTVIDLSES FT YYQVALDDSSKSMTAFRTAKNLYRFTVMPFGLSNAPATMARLMTRVLGHDL FT EPRVYVYLDDIIIVSNSLEEHLELIREVARRLRNAGLTINLQKSKFCQTKI FT RYLGYVLSEDGLSMDISKIQPVLEYPVPKTIKDIRRLLGLAGFYQKFIPNY FT SEVTSPISDLLKKNRKRFTWTEEADAALTKLKTLLVTAPVLAN" XX SQ Sequence 7106 BP; 2133 A; 1435 C; 1598 G; 1935 T; 5 other; ttttggcgcc caacgtgggg cccgagtttg aatagaattt gccaataatt tgaaatctgg 60 gtttgggtcg aaaagattgt atatatttag agttaagacc cttatcatta acaggtttct 120 cgaattggtt gattgaataa tcagtctgat tagaattgta aattttgaat tgtgtttatt 180 aaagttggtt agtcagttaa ttttggttag aattaagttt tgaacttgag tattgaattt 240 gttggtttag gtcggttagt ttgcatattt cttcgattca tatttgtttc taaattatta 300 ttttgttcta agttacctga attcgacaaa gagttacttc gaattgaatt tcattatctt 360 ctgctgtcaa ttttgtgtta tcgagcctac gcatttgaat cggacgtgag tcggtggctt 420 ttcttaaaac tagtgataca gtaaactaag tgcaccatgg attggtaccg tatctatgct 480 gacgacctcg atgaagaaga gcttgactat gagctggcga ttcgagcgtg tcccctcgtg 540 ggcggcmcag agactcggcg aagggaactg aggaacctcc ttcgggaccc gaattcgtcg 600 aatacgttgg tgattgtcga tcacatgatg caggctgacc ttgatattgt gccatacaag 660 ctacaagaaa tcgagaggat catgattgag gaatctaaca gagggatgct atcgagattg 720 gtacattacc atcagagaat ccgacgttac gtcccacatg atgccgcgga actggagaat 780 cgcaggcgtt tgttggagac ggttacggat ttggccagga ggtatttcca tttggacttc 840 cgtctgatgg atactccagt accacaggac gaagttatgg gcagagccac agtcaactca 900 ccacggttta cgtcgcaagc aacgagtgcc actcgtcctg gagaagatgt acaggcaatt 960 gtcgggccga tggtgaatcc agttcggtcg aactccgccg aatcttcaaa tgtagcagga 1020 gcggttggcg gcgaagtcgg aatagtttca gataatcgtc aattcgcatt gaataatcca 1080 gaacctgccg ttattaatcc agatccagac cacagtgcat ttcctctttg gcgtccgact 1140 gccgcccagc agccatatac aggggcgatt cctcgagcgg caaggccatc cattccggag 1200 tttgttcagt caccgccgca gatgatttcg tcactcaact tcaatctcca gaatcctcaa 1260 cagggcacga gatgcagtat tccttgttct gctscaatcc aagagactat cggaaatcag 1320 ccaccttttg tacgtcggca gcagggagat gagcgacgcc cgataggagt gcagcttcct 1380 cagagggccg acgaagttcg tccgtcgtcg agaacgttag ctgaaaatat caatatgagt 1440 gagtatgtgc atttctcaca aattgaggct tatgtaaaga ggtgtgtgga acaaataaca 1500 caacaaggta caagatattc gatgggacag gaaaacctgg tgaataacct ggcggatgaa 1560 atcgcgcaag tgcgttttgc agatcaagaa gcggaaccaa ttttgcgaaa tagagttggt 1620 cccccacagt cttcaatggt gcgtgaacct agtccaccgc ttcagctcag cggacctcgt 1680 ccggatgcta gaagcacacc gagacaaaac ccagcgaacc aacacagaac cttttcattg 1740 gaaggagatc gatttgttcc agttcaattt ggaaacatcc ctaatcccag ggcaactatg 1800 gaatcgccgg gtccgcgaag gtatcaagca gacgtcgatg gttatcgcgt gaattcgttt 1860 tcaagaaggc aacctcatca acagtgtgcg atcattgaga agtggccgaa atttaccgga 1920 gatacgaatt ccgttccggt gaccgacttt ttgaggcaga tcgacattct ttgtagatct 1980 tacgacatca gtaaacagga acttcgtatg cacgcccatc ttttgtttaa agataacgcg 2040 tatgtttggt acactacgta cgaagagaag ttcaattcgt gggaaacatt ggaggtctat 2100 ttaaaaatgc gatacgacaa tccaaaccga gaccgactga tacgcgaaga aatgcgaaat 2160 cgtaaacagc gcccgaatga actcttcagt gcatttttag cggatatgga aatgttagct 2220 caacgtatga taagaaaaat gtcggaagcg gaaaaatttg agatgatagt tgaaaacatg 2280 aaactctcgt acaagcggcg tctggctttg gaaccaattc aatcaataga ccacctggct 2340 cagatgtgct ataaattcga cgcgcttgag agcaatttat accaggccta caatcaaccg 2400 aagtcggtgc atcatatcga tttggatgac gaagacaatg aggaatgttt ggacccttca 2460 gagctcgaaa tctgtgcact ccgttcgaaa atgatgcaaa atagggggag agagacacag 2520 aaggtcaatt tggagtcaaa gtcgtccaca gaacagaaaa atacggaaat gacttgctgg 2580 aactgcaata aggccggcca tctttggcgc gactgcgata aaagaaagcg aatcttctgt 2640 catatctgcg gaatgatgga tacaacggcg ttcaagtgtc cgaaccaaca taacattcgt 2700 tcggagaacg aagagccaaa aaacgaatag tagaggacga ttccgggaat cattgtcctc 2760 ctccaggtga tatgaagatt cccaacagca atttttccta ttttcagcga gttttccaga 2820 tcaatacgtt cgtttcgcgt tgtccccatt tgaaagtacg aattttatcc gaggaactcg 2880 aaggattagc ggacactgga gccagtattt cgataattag tgccttggat ttgattgaga 2940 aacttggatt tgaaatttat ccaattcccc ttcaagtaaa aactgcggat ggtactggct 3000 accactgtaa gggatacgta aatattcctt tttctattgg aaatgtcact cacgttcttc 3060 ctactattgt cgttccagaa ataactaaaa agttgatatt ggggatggac tttctgaaca 3120 aatttggttt ccacttaact gcttcaagaa atgaaggcgt ttcaagcaat acttctagtg 3180 ttgcagcgcc agaagtgaac cagattgact tgtgcttcat ggatgaatac tttggcgaga 3240 acatcgaaac aatttgcttt gagttacaac cctgtaaagt agatgagacg tccaagatcg 3300 aattggacga aagcttagaa atgcctactg tggagattcc agagtcccac ataagcactg 3360 cgaatggtct actaacagaa caccatctga gtgactcgga gcgtcaatca ctgtttgagg 3420 cagttcaaac actccctgcc acaaaagaag gtcaactagg aagaacagct ttgttaaaac 3480 acaaaattga attagtccca ggagctaaac cgaaaaaatt cccatcctac cgttggtctc 3540 cgtcggtcga aagcgtaatt gacacagaag tcgaaagaat gaagaagtta ggcgttattg 3600 aggaatgcca acaagccgta gacttcctta atcctttact acccattaaa aaggccaatg 3660 ggaaatggcg tatttgcctg gactcgagaa ggttgaacca atttactaaa agagacgagt 3720 tcccttttcc gaatatgatg gccatactcc agaggatacc gaaatccaag tattttacgg 3780 taatcgatct cagcgaatct tactaccagg ttgctttaga cgattcttcg aagagtatga 3840 ctgcattccg tacggccaag aatctctacc gatttaccgt tatgccattt ggtttatcta 3900 atgctccagc aaccatggca cgcctaatga cacgagtatt gggtcacgat ttggaaccaa 3960 gggtttatgt atatctcgat gacataatta ttgtttcaaa cagcttggaa gaacatttag 4020 aactgataag agaagtagct cgaaggcttc gtaatgcggg tctcacaata aatttacaaa 4080 agagtaaatt ttgtcagacg aagatccgat atttgggata cgtactctca gaagacggct 4140 tatctatgga tatcagcaag attcagcccg ttcttgagta ccccgtaccg aagacgatca 4200 aggacataag aagacttctt ggattggctg gattttatca aaaattcata cccaattatt 4260 cggaagtaac tagtcctatc tcagatctac tgaaaaagaa ccggaaaaga ttcacttgga 4320 ctgaagaagc agatgcagca ttaacaaagc tcaaaaccct tctcgtaaca gcccccgtgc 4380 ttgcgaaccm cgacttcacg aagactttcg tgatcgagac tgatagttca gaccttgcta 4440 taggagcggt gctgacacag aatcatgaag gggaaagacg accgattgcg tacttttcaa 4500 agaagctctc gagtacccag cgtcggtata gtgccacgga aagagagtgt ctcgctgttc 4560 tattgagtat agagaacttc aaacacttcg ttgaaggatc acagtttgta gtacaaacgg 4620 atgcgatgag cctcactttt cttcgtaata tgtccattga gagcaaatca ccaagaattg 4680 ctcgatgggc actgaaactg tcaaagtatg atctcctgct acagtatcga aaaggtagcg 4740 aaaacattcc ggcagatgcc ctatcgagag ctgtgaatac cgtcgacgtt actctcaaag 4800 atccgtatat cacgcagctc aagcagatga tcgaaaagtt tccagaaaag taccgcgatt 4860 tcaacctcca agatggaaaa gtgtataagt tcattaccaa tacgtcggtt gtagaagata 4920 atagttttag atggaagtat gtggtaccct tccacgaaag accggatatt atgcgcaaaa 4980 ttcatgaaga agcacatctc ggtccggtta aaacgctggc taaaatacgg gagcggttct 5040 attggccacg aatggcggcc gaaatcaaaa ggttttgcca gcgatgcgaw atttgtcgtg 5100 aatcgaaagc tccgaacctg aacgtgactc ctcgttgtgg aaagccaaaa ctttgtacac 5160 gtccatggga actcatttcc ctggattttc tcgggccgta tccacgcagc cgaaaaggaa 5220 atgtttggat tttggtagta agcgatttct tttcgaaatt tgtgctagtg caatgtatgc 5280 gtactgccac tgcacaatca gtatgtgcat ttgttgaaaa tatggtgttc aacctgttcg 5340 gtgcgccttc tgtgtgtatc actgataatg cacaggtgtt taaaggagaa atgttcacca 5400 aattgctaca aaagtataag gtaactcact ggaatctctc cgtgtatcat ccggccccta 5460 acccgaccga acgtgtcaat cgcgtgatcg tcaccgcgat tcgatgttcg ttgaaccata 5520 aacgggacca cagggattgg gatgagtctg tccaccaaat tgcgaaagca attcgtacga 5580 acgtacacga cagtacggga tatacaccgt acttcgtcaa ttttggtaga aatatggtga 5640 gcagtggaga agaatacgag ctcctgcaac gcgtccaggg taaccgaagt tcggaacacc 5700 tgaaacgaga aacaagtaac ttgtttaata tcgttcgcga aaatctacta aaagcataca 5760 agagatatag cacgccgtac aatctgcgag ccaatgcaaa acatcacttt gctgtgggag 5820 aagtagtata caaaaaggaa atgcacttat cggacaagca gaagaacttt gtcggtaaat 5880 tcgggaataa atttagtaag gttcgagtgc gtgaagtgct cgggaccaat acttatgtgt 5940 tagaggatct gaagggaaac aggatccctg gcagttatca tggatccttt ctcaaacgcg 6000 cataaaaaca ccagctatga cggtgcatcc gccgatgcac ataaacgagc cgatatgaac 6060 aaaatactcg aagaggtgtc aggcaaaaat ccaacgtcga gatgtcactg tgtgtttcct 6120 ccgttgttga tttattggta ccattagtgc cacttggtta tttaagaaaa taaacataat 6180 cttgtagaac attgtttgat tcgttggttg tagattaaaa ttgaagtagg ttgaatgagg 6240 aatcgttgca ttcatcactt tcactttttt tttgtgtaca tacttccccc aggcgaaaag 6300 actaattatt agcattatct cattatatac gtacctcatt caatagccca gcaactcctt 6360 tttatccata gtaaagtatg ttctaacgtt agcccaaaaa aatagcagat ttctccataa 6420 ttttccaatt tttcaatagt ttgatgctga aatatcactt gttttgtttt ggttcccctc 6480 actttccctt tcaattgaca atagcccggc cacgtatcgc tcgatatgac agatccgttt 6540 gacagcattt ccacaacatc aagtaactta cagcatagat taacatttca tttggtccaa 6600 agtatatttg tggcaaagta gtgtagttaa tggaataaga atgttcagag ttgtttctga 6660 gtcgaagagg agaaaattag tgttattgaa cgaaaagtgc tcagatattt ctgaggttta 6720 tttgtggtaa cgacgattac tcttctcaga cttttctgag gtaaaagtgt cgttagattt 6780 cgttaagttg attagatgat taaaatgatc agacatttct gagttaaatt ggtttaggta 6840 aatgaaaata tatctcagat atttctgagg tttggaataa agtttagtta aggtagtgaa 6900 taaccagaag accaaagcca caacatacct gcaatagttt tcgaaataga ttacgttcaa 6960 tacgattcat tcagattctt cattcaatca accttcagtg atgttcgatt cgtagttctt 7020 aattaatatg taaaatcgcc gtgttttata aaaattttga aattttcgtt tgctcaaaat 7080 tkttgttttg ttagaatggg cgagaa 7106 // ID Tx1-1_CQ repbase; DNA; INV; 4960 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A Tx1 non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW Tx1; Non-LTR Retrotransposon; Transposable Element; Tx1-1_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4960 RA Kojima K.K. and Jurka J.; RT "Tx1 non-LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 633-633 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 6 sequences with >98% CC identity. XX FH Key Location/Qualifiers FT CDS 143..1384 FT /product="Tx1-1_CQ_1p" FT /translation="METRKNTLKILFATGAKVPSYLEVLKFMSTGVKIPAT FT DVHSVYKDENDQKFYIKFMDENSYNRFCNSVEDQYWFRFDDGSRSPVQLEL FT ASRQFKYIRLFNLPPETEEKEIAMALSKFGKIRQHVREKYPQDLGYHVFSG FT IRGVYMEVEKEIPANMYIAHFRARVYYDGLKNRCFFCKAEGHMKVECPKLA FT SLRVNAETGGQRSYSSVTANLAIANSSTIEPDNTLTMTTLPLTSPPARPKM FT PMQPQEGDGASSAKISTPANNPFVTLTPAEVPMETPVPRPEETTNNQASAA FT TTTERPEETTNNQASAATTTETPNKPVDATATVATPKETSAAEAERSGHDG FT MIIDGEQLMVDEAIKGTAEGEGKLELKKRPIETSLSLGSNDSDLGQGKGSN FT SKKKRGAQGQGKGKGKGRGK" FT CDS 1556..4798 FT /product="Tx1-1_CQ_2p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="MPFIRKFATVNLNAISSKLKLSLFKDFVWNNDLDFVF FT VQEVAVENFNFLSSHTALVNISVDGKGTGILVRKNIDYSNVIMNSNGRITS FT AEIDGINFINIYAHSGSKFKKERDKLFTEDILIHLSETKENVLLGDFNCII FT DKNDSNSSIKNVSNGLKNMTSSLTLIDIELNHNKNRIFTFVRGNSKSRLDR FT IYASKEFAENVRSVETIAVAFSDHHSVVLKLEIGHQFMNFYGRGYWKINSF FT LLNDPDIIDKFKKKYTELKLRNSYSNLNFWWNTVFKSNAKSFFKKENFLSN FT QQSAREKSFYYNCLNEIVQKQEQNIDCSNELSVVKSKLMELEQRRLSKIKF FT KLKAETLHADEKLSLFQVTSFINRSSTSNLIKLRINDEITSDIKKLKNEIY FT HHYSQKFQNQHVNNTDAETVLGYLNKKLEINDKIHLNRPIEIYEIENALKN FT ASSRSSPGPDGLSYEFYSIFFDVVKDDLLNLFNDYLVNEEYPPALFTAGVI FT TLIPKVGDKYDLQNKRPISMLNTDYKLFTKILWNRIQPMMKNVLGAEQAAC FT VEDNSCITNLRLLRNILVKANKSKKFKGAILSLDLEKAFDRVDHDFLWKVL FT KKIDFPDEFINCLKKLYKNATSTVLFNGFLTEPFQILSSVRQGCPLSMALF FT AFYIEPLIRMIDQNINGVLIDNRFLKVVAYADDINIFVKNDSEFDIILQLV FT HYYSLYSKIKLNLFKSQFLRFNSCMIGPQQIKEVDDLKILGVHFSQSFDKT FT IDQNYNGIIKFINISLLQQYSRRLNLFQKVYVLNTYILSKLWYVAQVFPPD FT NKHLASLRSICGKFLWKGHFYKVERNELYLPVLKGGIALLDVECKTKSLFI FT KNILFGKSDELDTFMINQLVNKNLTRNTREWLQEAEVLKNENHLNTSKKLY FT DYFIESKKIEIKKQKENPHLQWNTLFENTNKNFLSTTAKSYLFMASRDIIP FT CNSKLFRHGVKGTESPFCELCGAQDTVEHRFKDCVHSKIIWTWLNNIVINK FT LKLKVKDAEEVLWCDFNDQNVKLKAALWLTVESLAYCLQTFGEGTLDELKS FT RLCESRWNSREIFKKHFKHFLNVF" XX SQ Sequence 4960 BP; 1724 A; 803 C; 1024 G; 1409 T; 0 other; cagttcgctc ttggacttcg gagagtaaag tcacgtttct cgaagagtgc gcgtacaagc 60 agattattgg tggtgcacac agggtgtgtt acccatctat ctctttctaa caaagcggaa 120 ttaaaccaac ggaaagatca aaatggagac tcgaaagaac acgctgaaga tcctgtttgc 180 gaccggtgcc aaggtgccgt cctacctcga ggtcttgaaa ttcatgtcga ccggtgtaaa 240 gataccagcc actgacgtac attcggtgta caaagatgaa aacgaccaga aattctacat 300 caagtttatg gacgagaaca gctacaacag gttctgcaac tcggtagagg atcagtattg 360 gttccggttc gatgacgggt caaggtctcc ggtgcaactg gagttggcca gcaggcagtt 420 caaatacatc cggcttttca acctgccacc ggagacggag gaaaaggaaa tcgctatggc 480 gctgagtaag ttcgggaaga ttcggcaaca cgtacgggag aagtatcctc aagacctggg 540 ctaccacgtg ttcagcggca tccgtggcgt ttacatggag gtcgagaaag aaataccagc 600 caacatgtac atcgcccatt tccgagcaag ggtgtattat gacggattga aaaatcgttg 660 cttcttctgc aaagctgaag ggcacatgaa ggtcgaatgc cctaagctgg ccagtctcag 720 ggtcaacgct gaaactggtg gtcagcgttc gtacagtagc gtaacagcaa atttggcgat 780 cgcaaacagc agtacgatcg agccagacaa cacgctcacc atgacaacgc ttccgcttac 840 atcaccacca gcacgaccca agatgccaat gcagccgcaa gagggagatg gcgcaagctc 900 agcgaagatc agtacaccag caaacaatcc atttgtaaca ctaacaccag cggaagttcc 960 gatggagacg ccagtaccac gaccagagga gacgacgaac aaccaggcgt cggcggcgac 1020 gacgacggag agaccagagg agacgacgaa caaccaggcg tcggcggcga cgacgacgga 1080 gacgccgaat aagccggtgg atgcgacggc gacggtagcg acgccgaagg agacatcggc 1140 tgctgaagca gaacgatcgg gacacgatgg gatgattatc gacggcgagc agttgatggt 1200 ggacgaagcc atcaagggga cagcggaagg ggagggaaaa ctggagttga aaaagcgacc 1260 gatagaaacg agtttgtcgc tcggcagcaa tgattcggat ctagggcaag gtaagggaag 1320 caatagtaag aaaaaacgtg gcgctcaggg ccaaggtaag ggcaagggga agggaagggg 1380 gaagtgagga gggctaaatg ttgaatatct tttcacactc gacttatttc ttcctttgat 1440 ggacatctat gagtccagcc atgaatacgt tcaagaagtg ttctttgcgt ttgattacgt 1500 ttgacatgat agtcttgata ttcatgcact ctactaagat taatccatcc ttctaatgcc 1560 ttttattcga aaatttgcaa ccgttaacct gaacgctatt agcagtaaat taaaattgtc 1620 tttgtttaaa gatttcgttt ggaataacga cctcgatttt gtatttgttc aagaagttgc 1680 tgttgaaaat tttaattttt tatcttccca cactgcactt gttaacataa gtgtagatgg 1740 taaaggaact ggaattttag ttcgaaaaaa tattgactat tcaaacgtca ttatgaatag 1800 caatggcagg ataacatctg cagaaattga tggaataaat tttattaata tctatgcaca 1860 ctcgggatcc aaatttaaga aagaaagaga taagctgttc acagaagata ttttaattca 1920 tttatcagaa actaaagaaa atgtattgct gggagatttc aattgcataa ttgacaaaaa 1980 cgattcaaac agttccataa aaaatgtgag taacggactt aaaaatatga cttcatcgtt 2040 aactttgatt gacattgaat taaatcacaa taaaaataga atatttactt ttgttcgagg 2100 aaactcaaaa tctaggttag atcgcatcta tgcttcaaaa gaatttgcag aaaatgtaag 2160 atctgtagaa actattgctg ttgctttttc ggatcaccac agcgtagttt tgaaacttga 2220 aattggtcat caatttatga atttttatgg acgtggatac tggaaaatta attctttttt 2280 actaaatgac ccagatataa tagacaaatt taaaaaaaag tatactgaat tgaaactgag 2340 aaactcttat tcaaatttaa atttttggtg gaataccgtt tttaaaagta atgcaaaatc 2400 tttttttaaa aaagaaaatt ttctttcaaa tcagcagtcg gcacgagaga aaagtttcta 2460 ctataattgt ttgaatgaaa tagtacagaa gcaagaacaa aatatcgatt gttcaaatga 2520 attatcggta gtgaaatcca aattaatgga actcgaacaa cggcgtttga gtaaaattaa 2580 attcaaatta aaagcggaaa cattacatgc cgatgaaaaa ctatcactgt ttcaagttac 2640 atcttttatt aaccgttctt caacatccaa tctgattaaa cttagaatta atgatgagat 2700 aacatctgat attaaaaaat taaagaatga aatttatcac cattactctc aaaagtttca 2760 aaatcagcac gttaacaaca ctgatgcaga aacggtttta ggatatttaa ataaaaaact 2820 tgaaatcaat gacaaaatcc atctaaacag accaatagaa atttatgaaa ttgaaaatgc 2880 tttaaaaaat gcttctagca gaagttcacc aggaccagat ggcctaagtt acgaatttta 2940 ttccattttc ttcgatgttg tgaaagatga ccttttaaat ttatttaacg attatttagt 3000 taatgaggaa tatccacctg cgttgtttac agcgggggtg ataactctca tccctaaagt 3060 aggagataaa tatgatttac aaaacaaacg tcctattagt atgctgaaca ctgattacaa 3120 attgttcact aaaatattgt ggaaccggat acaacccatg atgaaaaacg ttttgggtgc 3180 ggaacaagca gcttgtgtag aagataattc ctgcattaca aatttacgtt tattgagaaa 3240 tattttagta aaagcgaata agtcaaaaaa gtttaaagga gcaattttaa gtcttgatct 3300 tgaaaaagcg tttgatcgcg tagatcatga tttcctttgg aaggtgttga agaaaattga 3360 ttttcctgat gaattcatta attgtctaaa aaaactctac aaaaatgcaa cctcaaccgt 3420 tttgttcaat ggttttttga cggagccgtt ccaaatttta agctcagtga ggcaagggtg 3480 tcctcttagt atggcgcttt ttgcttttta tatagaaccg cttattagaa tgattgatca 3540 gaatattaat ggtgtgttaa ttgataacag atttcttaaa gtagtagcat atgcagatga 3600 catcaacatt tttgttaaaa atgatagcga gtttgatatt atactgcaac ttgttcatta 3660 ttacagtttg tattcaaaga ttaaacttaa cttgttcaaa tcacagtttt taagattcaa 3720 cagttgcatg attggccctc aacaaattaa agaagtagat gatttaaaaa ttttaggagt 3780 acatttttca caaagttttg ataaaacgat cgatcaaaat tataacggaa ttataaaatt 3840 tattaatatc agcttattac aacaatattc tagaagattg aatttattcc agaaagttta 3900 tgtgctaaac acatacattt tatccaaact atggtatgta gcgcaagtat ttcctcctga 3960 taataaacat ttggcttctt tgcggtccat atgtgggaag tttttatgga aaggacattt 4020 ttacaaagtt gaaagaaatg aattatattt accggttttg aaaggaggca ttgctctttt 4080 agacgtagaa tgtaaaacaa aatcactttt tattaaaaac atattatttg gtaagtcaga 4140 tgaacttgat acgttcatga taaaccaatt agttaacaaa aatttaacgc gaaatacaag 4200 agagtggctt caagaagcgg aggttctgaa aaatgaaaat catttgaaca ctagtaagaa 4260 attatatgat tattttattg aaagtaagaa gattgaaatt aaaaagcaaa aagaaaatcc 4320 ccacttgcag tggaatacac tttttgaaaa taccaacaag aacttcttgt caactacagc 4380 gaaatcttat ttatttatgg cctcacgaga tataattcca tgcaactcaa aattatttcg 4440 tcatggagta aagggtacag aatcaccttt ttgcgaattg tgtggtgccc aggatactgt 4500 agaacacagg tttaaagact gtgtacattc aaaaataatt tggacttggt taaataatat 4560 agtaattaac aaattaaaat tgaaagttaa agatgcagaa gaggttttgt ggtgtgattt 4620 caatgatcaa aatgtaaaat tgaaagcagc gttgtggtta actgttgaaa gtttagcgta 4680 ttgtttgcag acgtttggcg aaggaacgtt ggatgagttg aaaagtagat tgtgtgaatc 4740 gcggtggaat agtagagaaa ttttcaaaaa gcacttcaaa cattttttga atgtattcta 4800 ggtagtagtg taaaataaca agttgttgag tgtggggaga tatttttgag tctgtttgag 4860 ccacagtagg ataatagaga atagatagtc atgttattgt aaaatgtatt ttcattgctg 4920 ccaataaaaa gttacttcat aaaaaaaaaa aaaaaaaaaa 4960 // ID Gypsy-8_AC-I repbase; DNA; INV; 4473 BP. XX AC AASC02007279; XX DT 18-JAN-2011 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from Aplysia californica (California sea DE hare): internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-8_AC_; KW Gypsy-8_AC-LTR; Gypsy-8_AC-I. XX OS Aplysia californica OC Eukaryota; Metazoa; Mollusca; Gastropoda; Heterobranchia; OC Euthyneura; Euopisthobranchia; Aplysiomorpha; Aplysioidea; OC Aplysiidae; Aplysia. XX RN [1] RP 1-4473 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Aplysia californica (California sea RT hare)."; RL Direct Submission to Repbase Update (02-FEB-2011). XX DR Genome; AASC02007279; Positions 8154 12626. XX CC 'GTCCC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 1252..2802 FT /product="Gypsy-8_AC-I_1p" FT /translation="MKVKLDTGAQVNIISEQQAKQVSALIVPSKVRLTSYS FT GNKIPVSGQVHLLCFHKGKKYYVDFVVASSPDAQSILGLKTCSQMNLIQRI FT HELKSQTGHQRPPELSNILKEYSDIFGEMGCIKDTHHIKVDHNVTPVVAVA FT RRVPLAMKPKLKVKLERMQRLGVIEKVEEPTDWVNPMIMVQKPNGDMRVCI FT DPKRLNEAIRREHYQLPTLEEVTIEMKGAKYFTKLDASSGFWQVPLDHKSS FT ELCTFATPFGRFKFLRLPLGSKSAPEVFQKIIHRFFGDIEGVVSYEDDLCI FT RSQSVEEHCKRLRQVLQRARENGLKFNKTKCEFFKTELKYLGHTLTESGLK FT TDPSKVTAIKNMPAPKNRQNLMRFLGMVMYLTKYILNMSEKTASLRQLLQK FT DVEWQWLSQHKKAVKDIKEILSTSPVLAFFDPNKPVVVSADACKDGLGAVL FT LKDNKPVAFASRSMTSAEVNYAMIEKETLAIVFAMERFHQYVYGRRVTVET FT DHKPLVAIQNKAYDKCPARIQ" XX SQ Sequence 4473 BP; 1532 A; 963 C; 1006 G; 972 T; 0 other; tggtgtcaga agtgcagaac taacgtctgt ttccccattg taactggcag gtaggcttta 60 taacattgcc tttgctgtaa tcacaatttt cattgttcac atttgtattt tgggcagcgc 120 atccgcccaa gtactggtac cgtgagcgca ggtactggta ccgtgagcgc aggtactggt 180 accgtgagcg cacccacaac gtgtttcgtt cagaccacgt ggagttatct gcaggtgcag 240 actgcaggta gtttcgggaa gcgtactgac cgcagagcgc atactcccca gatttgcttt 300 ctcgatacag accaaagttg gtcaggttcg tcgccgatga ccagattggt ggcagttaag 360 gataaaatac aaaatgtccc gactaaaatc accagaggaa cttagtttct cagactccca 420 tagcctcgca gagagatgga gaatgtgatc acaagaaatg agactatata ttgatctcac 480 catggcaaaa gaaaccaaaa aagacaaatg ctcagcattt ttatatatca ttggcagatc 540 tggaagagaa atctacaata cttggactat tcctgaagaa gaacagaaca aactcgaggt 600 cttgttcaga agatttgacc aatattgcaa accaaagcaa aacgtgactc tagagagata 660 taagttcaac agtagagtac aacagcctaa tgagtctctg gacgagtttg tcacagacct 720 gaaactctta gcaaaacttt gccagtatgg aactttggag gaagacatga ttagagaccg 780 tattgtcgtg ggaataaaga aaatggtagt aaaagaacga ctgctcagag agccagacct 840 caccgttgag tctgcaatgt ctatttgcag agcggaagaa gaatccagaa aagggctcac 900 cataatggct gaggaggcag aagcagtgaa tgtggtgaag actaagaagt tcgtcaaacc 960 accaggaaaa cggaagaata aagtcaagac atcaaccttc cactgtaaca gatgcggaag 1020 tactcatgaa agtaaagcgt gtccggcgtt tggaagaact tgcaacatat gcaagaaaca 1080 gaatcacttc gccaaagtgt gcagagctaa aaaagttcag caagtagtgg aatcaagaga 1140 acaaaggcgc catgagaggc gaagtgatga agagtttttc ataggtgcag tcaataacac 1200 taacaagagg aagaaaactg gtttgttcaa ctcaactgta gaggccagca gatgaaagtc 1260 aagctggaca caggagctca ggtgaacata atctcagagc aacaggcaaa acaagttagt 1320 gctttgatag tcccatcaaa agtaagactg acgtcatact cggggaacaa aattccagtg 1380 agtggccaag ttcacctcct ttgttttcat aaaggaaaga aatattatgt ggacttcgtg 1440 gtggcgtcat caccggatgc tcagtcaatt ttgggattga aaacatgttc tcaaatgaac 1500 ctcatccaga ggatacatga actgaagtca cagacaggac atcaaagacc tccagaactg 1560 tcaaatatac taaaggagta cagcgacatt tttggagaga tgggctgcat caaggataca 1620 catcacatca aggtagacca caacgtcaca cctgttgttg ctgtggctag acgagttccc 1680 ctagctatga agccaaagtt gaaagttaaa cttgaaagaa tgcagagact tggagtcatt 1740 gaaaaggtag aagaaccaac agactgggtg aaccccatga ttatggtaca gaaaccaaat 1800 ggagacatgc gggtctgcat tgatcccaaa aggctcaacg aggctataag aagggagcac 1860 taccagttac ctacactaga ggaagtcaca atagagatga aaggtgccaa gtacttcacc 1920 aagctcgatg cctcaagtgg tttctggcaa gtaccactag atcacaaaag ctcggagctc 1980 tgtacatttg ctacaccatt tggcagattc aagtttctca ggctgccatt gggaagcaag 2040 tcagcccctg aagtcttcca gaaaataata catagatttt ttggagatat tgaaggtgtt 2100 gtgagctatg aagatgacct ttgcatcagg agccaatcag tagaagaaca ttgtaaacga 2160 ttgcgccaag tcttgcagag agcaagagaa aacggactga agttcaacaa aacaaaatgc 2220 gaattcttca agacagaact gaagtacctt ggacacacac tcacagaatc aggattaaaa 2280 acggatccat caaaggtgac agccataaaa aacatgcctg caccaaagaa cagacaaaat 2340 ctgatgagat ttcttggcat ggtaatgtat ctgaccaaat acattctcaa catgtcagaa 2400 aaaactgcgt cactccgtca gctgctacag aaagatgtag aatggcaatg gttgtcccag 2460 cacaaaaaag ctgtgaagga cataaaagag atcctctcca caagcccagt gctggcattc 2520 tttgacccta acaagcctgt ggtcgtgtca gcagacgcat gcaaagatgg actaggagca 2580 gttctgctca aggacaacaa gccagttgcc tttgcatcca ggagtatgac cagtgcagaa 2640 gtaaactatg ccatgataga gaaagagaca ctggctatag tgtttgccat ggaaaggttc 2700 catcagtatg tctacggcag aagagtgact gtggaaactg atcacaaacc tctcgtcgca 2760 atacaaaata aggcgtacga caagtgcccg gcaaggatac aatgatttat gctcaggctc 2820 caagcatacg acgttgatat caaatacaaa aaaaggaaag gaccagatac tgtcagacac 2880 tctgtctaga gctgtagaga cagaagcgaa accagaaatt ctggaaaaag aaataaatgc 2940 atttgttgac atggtgatgc aacaagctgc agtgtcacaa gagaagttgg aagaaataaa 3000 acgtctccag ctggcagatg agcagtgtca gctactggca gacctcataa agaaaggttg 3060 gcccgagaac atctcaaaag cccccttagc agcaaagcca ttttggacct tcagagaaga 3120 actggtaatt tacaaaaagc tactcctcaa aggacaacaa attgtaatac ctgcggcctt 3180 gcagaaagaa atattatcaa aaattcacca aggacacatg ggcatggaga tgtccaagca 3240 aagagcatgc caaagcgtgt actggcctgg aatgtctaaa gacattgaaa gagtggttgg 3300 gacatgcccg atatgtctga aatatagaaa aactcaacaa agacagccac tgcaaagtca 3360 cgccattcca gaaagaccat ttgaaaaggt aggggcagat cttttccact tcggaagaaa 3420 aaactacttg cttgtcgtgg attacacaac aaaatttttt taagtcagtc tcctgaacga 3480 tacaacgagt ggaacggtca tacaccatat gaaggcaata tttgcaagga atggcatacc 3540 gctccagctt gtgactgaca atggacctca gttctcgtcc acagacttta gaaagttcac 3600 agagtcatgg gagattgaac ataagacatc cagtccactg tatccaaggt ctaatggact 3660 tgcagagaga acagtgcaga ctgtgaaaca aatcctcacg aagcaaagaa cagtggcaag 3720 gatcccaatc taagcatcct caacttccga acaacagaca aaactgacca agcctcacca 3780 gctgaaatgc tcatgggaag aaagctccgg acacttcttc cgtctgcaaa caggatcaaa 3840 aagatcatta gcagtcgaat tatcaaagac aggaaggaca aacatcatcc gtacaaagag 3900 aggtttgctt cgcagaaacc gacagcacct cttgcagtca ggagagcctc atcatgacct 3960 ccaacccgac cagtgcgagg gtggttcata cagaagcaat gtccacaata atgtccccgc 4020 acccatcaca cgtggtcaag ctggacctca actgtccaga ccactggtcg agttcccaaa 4080 ctcaccacga aatgatcaag aagtgtacga acacactcct gatcgtggag gcaccccgcc 4140 ttcaccggaa caccccgaca cacacaacgt tccacattat atcactagaa gtggtcgcac 4200 agtgaggaaa ccaaacaaat atacgttgta ggcatagaca ttaaaaatgt gattcactgt 4260 tgaccacaac actcatgcaa aatacactac gcatatttta cactaaagta aaagcttaga 4320 gctgaccagt ttaatgcttg tgttaaaact agctcaccac tggatacatt tatgtgtgtg 4380 atcaagtcat gatgtcatca agaactatcg ttctcaaggt tgatgttcaa catccgttac 4440 aaaaggcaaa tgtaactttt taaaaagggg gga 4473 // ID BEL-19_CQ-LTR repbase; DNA; INV; 474 BP. XX AC AAWU01039046; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-19_CQ_; KW BEL-19_CQ-I; BEL-19_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-474 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 192-192 (2011). XX DR Genome; AAWU01039046; Positions 62270 62743. XX SQ Sequence 474 BP; 131 A; 116 C; 114 G; 113 T; 0 other; tggtgcgatc aaagttgagt gcttgatcgc taccgatatt ggtagtgctg accagcgctc 60 caatcacatt gtcaacacaa aatgtgctag accaccacgg ccctatttgg ccaaactgag 120 tttatttgtt taaacaaact cactaacagc cgagtcggca attatcaagt gccgccggat 180 tgagctgcaa caatgttgca gctcattcag tgttcaaaca ccaccaataa tgtggccctt 240 ctcaaacaaa ttctgcgccc gaagtggcga tcgaaaacac cgtcagcacc cacgaggtgc 300 tgggatacag tatcgacgcg atgatgatcg tcgtcgtcat catcattatc atcatttggc 360 gctggcgacg caatgtgcgt cgcttgcggg aattggagtc ggtggccacg cggctggccc 420 gacaagcagt atgaattagg ctaagttcat atagaagaaa ataaaagtag ttca 474 // ID RSAI repbase; DNA; INV; 151 BP. XX AC M32422; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 28-SEP-1995 (Rel. 1.08, Last updated, Version 1) XX DE G.morsitans centralis RsaI repetitive sequence. XX KW RSAI; RsaI repetitive sequence. XX OS Glossina morsitans OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Hippoboscoidea; Glossinidae; Glossina. XX RN [1] RP 1-151 RA Trick M. and Dover A.G.; RT "Unexpectedly slow homogenization within a repetitive DNA family RT shared between two subspecies of Tsetse fly."; RL J. Mol. Evol 20, 322-329 (1984). XX DR GenBank; M32422; Positions 1 151. XX SQ Sequence 151 BP; 45 A; 36 C; 31 G; 39 T; 0 other; actacaggtc catgacaaac tcgcccaatg tgctgcacgt agcatttgat gaaccacagc 60 aagctattag aatgcgcggt atgctaaatg ttgcttttga tacagaacaa cgatttcaga 120 atttgttgta acggctctca gcaccacacg t 151 // ID BEL-45_AA-I repbase; DNA; INV; 6236 BP. XX AC AAGE02018048; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-45_AA_; KW BEL-45_AA-LTR; BEL-45_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6236 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02018048; Positions 23479 29714. XX CC 'CTTAG' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 55..6234 FT /product="BEL-45_AA-I_1p" FT /translation="MMQPAKVCGKCSSGGIVGQMLGCDMCDVWFHAGCVGE FT TETSLNPDSTWKCSHCSKEERREVASQHTNKSGRSSTSSRARRELLLQQLE FT EQRALKLKQRAEEDEIRRKRAEEDEAFLKQKLDLILEDDNESRSSGLSSRA FT SRKKVVDWLNDGRGGQTVVTSSSHNGVIVQHSNAQQLAPSSTGHTTEQMLP FT TSVPPNGAPASTSTPQSGKTVKALDIDVPQTSKSLQLGSISSNANIGQHNF FT EISYSTQSSLATKNMGRQSFPLNASYQIPRVGNLYQKYNPKSVADTKVTFS FT GQYGAIVPPSCQILYGPQSGAVSTNWSSSVIPPDPRSSMITPVASEATIVP FT SVSVPYANVTASTLSTAHLDAGQQAYPVNIAANPTGLESGVQQPNASQIAA FT MCQPSPSSAQLAARQVMPRELPTFSGDPQDWPLFSSSFYNSTTACGFTDAE FT NLARLQRCLKGHALDSVKSRLLMPESVPHVMETLRKLYGRPEVLIHTLLRR FT LRSVPPPRTENLQSVITFGMAVRNLVDHMFVAQLLDHLRNPMLLHELVEKL FT PSQLKMQWSWYKRTQTEVNLETFGEFMNELVNTASDVTLPMDILAQPSKPS FT QAGRDKQKLYVHAESREEQATTSTVFSKSKQISEVNSSKKSCCYCSNEEHV FT AADCPQFKALDLDGRWKAIRSKGLCRLCLIPHRKWPCRSGKECGVGGCRLR FT HHALLHSQASPEATTSQRRSNSTTDSSGSRFTVQNHHTTLNYCLFRYLPVT FT LEMNGKQVETYAFLDDGCQTTLMEAGLAADLDIAGPSESLWLGWTSNISRE FT EKGSQRVTVNISGSGMTSQYRLSNVRIVQKLQLQGQTFQYEELRKTYPHLR FT GLPMRSYVDAVPRLIIGIEHAQLLTTLKVREGRSNEPVAVKTRLGWCVYGK FT QTAEGNSAIERIHLHTEEQQIRNRELHDLMKQYFAVEEAAVATPIESVEDS FT RARTILEKTTRRIKGGFETGLLWKYDNPVFPDSYPLAFRRLQCLEKRLAKD FT QELWKKVVALIRDYEAKGYAHKITQKELESTESGQYWFLPLGVVTNPKKPE FT KVRLIWDAASRVHGISFNDTLLKGPDMLTSLFAVLLRFRQRSVAVCGDIRE FT MFHQIRIISQDKQYQRFLFRERQDETPQIYVMDVATFGATCSPCSSQFVKN FT KNAREFETKYPRAAEAIIKAHYVDDYLDSVDSVEEAVQLVKEVKHVHSQGG FT FEIRNFISNSPDVLHQLGEQNSLQKKSLDLDGSGKVERVLGMVWKPTEDLF FT TFDVVLKDDLKQFLSQRALPTKRQVLRLVMSLFDPYGFIAHFVVHGKILMQ FT HIWRSGTEWDEEISPELYKMWQDWIRLLERLREVEVPRCFFGTTGSSIRAN FT IQLHVFVDASEQAYACAAYLRIVQNGVVHCALVAAKTKVAPVKPLSIPRLE FT LQACIIGCRLMETINAALNLNVEERFFWTDSTTALAWIKSDSRRYHPYVAF FT RVGEILNSSNVDEWYHVPSKQNVADVATKWGSGPDFRPNSRWYMGEPFLHQ FT SKSEWPKQAQKQWYTDEELRAVYHLHREVPQPLIDVNRFSNWNRLLRTAAY FT VCRAAKKFRNQSNNGQLTSDDLLQAENLLWRQTQFEAFPDEYSVLKYNVQH FT PNEPAVHLERNSKLYQESPFMDDAGVIRMNSRISAAPTASFETKYPVILPK FT EHAMTRLLVDSYHRRFLHGNNNTVFNEVRQLFRIPQLRSIIKRVTGECQSC FT KIKKAIPRSPMMAPLPAVRLTPHIRAFSYTGVDYFGPIFVKQGRSLVKRWA FT ALFTCLTIRAVHMEIVYSLSTQSCVMAIRRFVSRRGSPAAFYSDNGTCFRG FT ANNLLCEQIQALHQDCAVTFTNAKTSWHFNPPSAPHMGGCWERMVRSVKTA FT MAAIADHPHHPSDEVLETVLLEAEAIINSRPLTYVPLDDADQEVLTPNHFL FT LYGSTGVVQPRSALTIEGSTLRDSWKLSQYLVDLFWRRWVREYLPTLTRRT FT KWFHPIKPLETGDVVLVVDDTKRNGWLRGRIIEVMKGKDGQTRRAVVRTKD FT GLLTRPAVKLAVLDVQANRKSCLDAGSPELHGWG" XX SQ Sequence 6236 BP; 1756 A; 1459 C; 1615 G; 1406 T; 0 other; atcttcaaag attttccggt atttgttgcg tagtaagcaa ctcgaagctt cggaatgatg 60 cagccggcaa aagtatgtgg aaaatgctcg tctggaggaa tagttgggca gatgctagga 120 tgcgacatgt gcgatgtctg gttccacgct ggatgtgtcg gagaaacgga gacaagtcta 180 aacccggata gtacgtggaa gtgtagccat tgctccaagg aagaacgcag ggaagttgcc 240 agtcagcaca ccaacaagtc ggggagaagt tcaacgagta gtagagcccg tagagaacta 300 ctgttgcaac aactggagga acagcgagcg ttgaagctca agcagcgggc agaagaggat 360 gagatccgga ggaagcgagc tgaagaagat gaagcctttc tgaagcaaaa gctggatttg 420 attctggaag acgacaacga gagtagaagc agcggactga gcagtcgggc aagcaggaag 480 aaggttgttg attggcttaa cgatggtcga ggcggccaaa cagtagtgac cagttcgtca 540 cataacggag tcatcgttca acactctaat gcccagcagt tagcaccgtc gtcaacaggt 600 cataccacgg agcaaatgct gcctacttca gtgccaccca acggagctcc cgcctcaaca 660 tcaacaccac agtcaggtaa aactgtaaaa gctctagata tagacgtccc tcagactagc 720 aaatctctgc aattaggctc catttcctcc aatgcaaaca ttggacagca caattttgag 780 attagttact caacacagtc cagtttagca acaaaaaata tgggtagaca gtcctttccc 840 ctaaacgcat cctatcaaat tccccgggta ggcaatctat atcaaaaata taatcccaag 900 tcagtcgctg acacaaaagt gactttttcg ggacagtatg gtgcaatagt tccaccctcc 960 tgtcaaattc tgtacggtcc gcagtcaggg gcagtatcta ccaattggtc gtcgtcagta 1020 ataccaccag atccgcgaag ttcgatgatt actccggtcg ccagtgaagc aacaatagtg 1080 ccaagtgtat ccgtgccata cgcgaacgta acagcatcaa cgttatcaac cgcacatcta 1140 gacgcaggtc agcaagcata tcccgtcaac atagcagcaa atccaacagg actggagtct 1200 ggtgtgcaac agcctaatgc atcacaaata gcagcaatgt gtcagccttc gcccagtagt 1260 gcgcagttgg cggcaaggca agtgatgcca cgcgagctac cgactttctc cggtgatcca 1320 caggattggc cgttgttctc tagttccttt tacaattcaa cgacagcgtg tggattcaca 1380 gatgccgaga atttagccag gttacaacgt tgtctgaaag gtcatgctct cgattcggtg 1440 aaaagtcgtt tgttgatgcc agaatcggtg ccgcatgtga tggaaacact acgaaaattg 1500 tatggaaggc cagaagtact catccacaca ctcttgcgaa ggcttcgtag cgtaccaccg 1560 ccgaggacgg agaatctcca gtcggtcatc acgttcggaa tggctgtgag gaacctggtt 1620 gatcacatgt tcgttgcgca gctcttggat catctgcgta accctatgct acttcatgaa 1680 ttggtggaga agcttccatc gcagttaaag atgcagtggt cgtggtacaa gcgtacacaa 1740 acagaggtga acctggagac atttggggag ttcatgaacg agcttgtcaa cacagcatcg 1800 gacgtgaccc ttccaatgga catattggcg caaccgtcca aaccaagcca agcgggaaga 1860 gacaagcaga agctgtacgt acatgctgaa tcaagagagg agcaagcaac gacatcgact 1920 gtgttcagca aatcaaagca aatctcggag gtgaatagta gcaagaaatc ctgttgctac 1980 tgttctaatg aggaacatgt agcagccgat tgtcctcagt tcaaggcatt agatctagat 2040 ggaagatgga aagcaatccg atccaagggc ctatgcagat tatgcttgat tccgcatcgg 2100 aaatggccct gccgttcagg caaggaatgc ggagtcggcg gttgccggtt acgccatcat 2160 gcactgctcc attcgcaggc atctccagag gccacgacta gccaaaggag gtccaacagc 2220 acaacagatt caagcggatc taggtttacc gtacaaaatc atcatacgac attgaactac 2280 tgcttatttc gttacctgcc ggtgacgcta gaaatgaatg gaaagcaggt cgaaacgtat 2340 gcttttctcg acgatggttg tcaaacaacg ctcatggaag ctggattggc tgctgattta 2400 gacattgccg gaccaagtga atcgctctgg ctaggatgga cgagtaacat ctcgcgggag 2460 gaaaaggggt cacagcgcgt aacagttaat atatcgggca gcggaatgac gagtcaatac 2520 agattgagca acgtgcgcat agtgcagaaa cttcaattgc aaggacagac gtttcagtac 2580 gaggaactaa ggaagacgta ccctcactta cgtggtcttc caatgcgcag ttacgtcgac 2640 gctgtaccaa gactcatcat cgggattgaa catgcacaac tacttacaac cctgaaagtg 2700 cgagaaggca gatcaaacga gccagtagcg gttaagactc gactaggctg gtgcgtctac 2760 ggcaagcaga cggctgaagg taattctgcc atcgaacgaa ttcatctcca cacagaggag 2820 caacaaatca ggaaccgtga gctgcatgac ttgatgaagc aatactttgc tgtggaggaa 2880 gctgcggtgg ccaccccgat cgaatccgtc gaagacagtc gagctcgcac aatcttggag 2940 aaaaccaccc gtaggattaa aggaggtttt gaaactggat tactctggaa atacgacaat 3000 ccagtctttc cagacagcta ccctctagcc tttcgaagac ttcaatgttt ggagaagcga 3060 ctggctaagg atcaagagct ttggaagaaa gtagttgcac tgattcgtga ttatgaagca 3120 aagggttacg cgcacaaaat cactcaaaaa gagttggaat ccactgaatc gggccaatac 3180 tggtttcttc cattaggggt tgtgacgaac ccaaagaagc cagagaaagt tcgcctaatc 3240 tgggatgctg cgtcgcgggt gcatggcata tccttcaatg acacgctgct gaaaggacca 3300 gacatgctga cgtcgttgtt cgctgtttta ctgaggttca gacagaggtc ggtcgcggtt 3360 tgtggagaca tccgcgaaat gtttcatcaa attcgaataa tttcgcagga caagcaatat 3420 caaagattct tgttccgcga gcgccaagat gaaacacccc agatctacgt gatggacgtg 3480 gctacgtttg gcgcaacgtg ctctccctgt tcgtctcagt tcgttaagaa caagaacgcc 3540 cgggagtttg aaacgaagta cccgagagca gcagaggcga tcatcaaagc tcactatgtt 3600 gacgattacc tcgatagtgt agattccgtg gaagaagctg tgcagctagt gaaagaggtg 3660 aagcacgtac acagtcaagg tggtttcgaa attcgaaact ttatatcgaa ctctcccgat 3720 gtcctgcacc agttaggaga acaaaatagt ctgcagaaaa agtcgctcga tctagatggc 3780 tcaggaaagg tcgagcgtgt gcttggcatg gtatggaagc cgaccgagga cttgtttacc 3840 ttcgacgtgg tactgaaaga tgatcttaaa cagtttctgt cgcagcgagc tctaccaaca 3900 aagcgacaag tgctacggct ggtaatgtca cttttcgatc cttatggatt catcgcccac 3960 tttgtcgtcc acggcaagat attgatgcaa catatctgga ggagcggaac agaatgggac 4020 gaagaaattt cccccgagct gtataaaatg tggcaagact ggatacgttt gctggagcgg 4080 ttacgggagg tggaagtccc acgatgtttc tttggtacaa ccggcagcag catacgtgca 4140 aatatccagt tacacgtatt cgtcgacgca agcgagcaag cttatgcgtg tgcagcgtat 4200 cttcgtatag tgcaaaacgg agttgtccac tgcgcgcttg tcgcggcgaa aacgaaagtg 4260 gccccagtga aaccgctttc tattcctcgg cttgaactgc aggcttgcat aatcgggtgt 4320 cgtttgatgg aaaccataaa tgcagcactg aatctcaacg ttgaggagcg attcttctgg 4380 acggactcca cgacggcact tgcgtggatt aaatccgaca gccgccgcta tcatccatac 4440 gtggcatttc gagttggcga aattcttaac agctcaaacg tcgacgaatg gtaccacgtt 4500 ccttcaaagc aaaatgtggc agacgttgcc acaaaatggg gtagtggtcc agacttccga 4560 ccgaacagtc gttggtacat gggtgagcct ttccttcacc agtccaaatc cgagtggccg 4620 aaacaggcgc agaagcagtg gtacacggat gaagagttgc gtgcagtcta tcaccttcat 4680 cgggaagtgc ctcagccgct cattgatgta aacaggttct ccaattggaa tcgtttgcta 4740 cggacggctg cgtacgtttg tcgcgctgca aagaagtttc gcaatcagag taacaatgga 4800 cagctaacga gcgatgacct gttgcaggca gaaaatcttt tgtggcgaca gacacagttt 4860 gaagctttcc cggacgagta tagcgtgctg aagtataatg ttcagcatcc gaatgaacct 4920 gcagtgcatc tggaaaggaa cagcaaactg tatcaagagt cgccgttcat ggatgatgcg 4980 ggtgtaatta gaatgaacag cagaatttca gctgcaccaa ccgcttcgtt cgagacgaaa 5040 tatccagtca ttttgccaaa agagcatgct atgactcgtc tactggttga cagttaccac 5100 cgacgctttc tgcatggtaa caataataca gttttcaacg aagttcgaca gctcttcagg 5160 attccccaac tacgttcaat catcaagcgg gttaccggag aatgccagag ttgcaaaatt 5220 aagaaagcta ttccgaggtc acccatgatg gccccgcttc cagccgtacg actaactccg 5280 cacattcggg cgttcagcta tactggtgtg gactattttg gccctatatt tgtcaaacag 5340 ggtcgcagct tagtgaagcg gtgggcagca ttattcacgt gcctcactat acgtgcagtg 5400 cacatggaaa tcgtatatag tctgtcgact caatcctgcg tgatggcaat tcgtcgtttt 5460 gttagtcgaa gaggatctcc agcagcattc tactccgaca acggaacctg tttcaggggt 5520 gccaacaatc tgctgtgtga acagattcaa gcccttcacc aggactgtgc ggtaactttc 5580 accaatgcaa agacttcgtg gcatttcaat ccgccctctg cacctcacat gggaggatgc 5640 tgggagcgta tggtacgctc cgtcaaaacg gcaatggcgg ctatagcaga tcatccacat 5700 catccaagtg acgaagtgct tgagactgtg ctactagaag ctgaagcaat cattaactcg 5760 cgaccgctca cctacgttcc actcgatgac gcagatcaag aggttttaac accgaaccat 5820 ttcctactct acgggtcgac gggagtggtt caaccaaggt cagcattaac aatagaagga 5880 tctacattgc gggacagctg gaaactctca caatacttgg tggatctgtt ctggaggcga 5940 tgggtgcggg agtacttacc aactttgacc aggcgaacaa agtggttcca cccgataaaa 6000 ccgttagaaa ccggagatgt agtgctcgta gttgatgaca ccaaacgaaa tggatggcta 6060 cgaggacgga tcattgaagt aatgaagggg aaggacggac agaccaggag ggcagtagtt 6120 cgaaccaaag acggattgtt aacgaggcct gctgttaagt tggcagtact ggacgtacag 6180 gcaaaccgga agtcgtgcct agatgcaggc tccccggaac ttcacgggtg ggggaa 6236 // ID hATm-57_HM repbase; DNA; INV; 3500 BP. XX AC . XX DT 20-JAN-2009 (Rel. 14.02, Created) DT 20-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE hAT-type family: consensus sequence. XX KW hAT; DNA transposon; Transposable Element; hATm-57_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3500 RA Bao W. and Jurka J.; RT "Families of hATm elements from Hydra magnipapillata."; RL Repbase Reports 9(2), 391-391 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 468..2744 FT /product="hATm-57_HM_1p" FT /translation="MRNSKLRKNDEKFDNIVGKLLPTLEAKMYSQLPTNGD FT VVRHYLYLTRFELKLTGKKEVASVVWEKVYVFWKMAGIPIKQKKDIVRMIL FT HLIHLRENLIKNAKKKSYAAQLLSFTNKLSLLFDIAASNAENLIRKDTLRS FT INSQISDIDFLSDQRHLKKQFLGPLDKSYAMKVKRMEEKEKNKIKAIEKEK FT SRKENIKEKSFNVLAAFDNEGKTHNKNEESTIARQKKKIKILENPKVSLMA FT DRLNLSNRERSGIILAVAEALGHNLDEISISKSTAYRSGNKIRQQVSNDLK FT KTFISPMYSSIHWDSKLVSEVDGKVERLAIVVTGKPSAPEGKLLCVSRIPD FT TTGRSQATATIDAVKTWKLEKTLVAMIFDTTPSNSGWINGSAKLIEEHFNK FT KLLWCACRHHIFERVLSSVYFKVFGECRSPNYEPFRHFKNEVWPKLSLTSD FT TTFRKIKIEEPVLKTKKAEVIIFLSGLLNRKSRKILRDDYRECCQLALQLL FT NPKQINIQWHKPGAFHHARWMCTILYTSKMFAFADLAEYEESFIEKLSRFC FT KFTLLFYVKIWLTCTSTVDAPFNDLEFYKEMLIYKKIDNDVAVAALETFNR FT HLWYLTEEMSPLSLFSDKVSDRDKTKMAKIILKNIKENCEHDIGFPQFPLL FT KQSTCLTDLIGPKSSNILNLFGYSNEDGGWLVESPKFWKTNTNFNHMFEHI FT KSLKSVNDTAERAVKLIQDYATSITKDDEQKEFLLRAVEHHRKVLPHINKQ FT AIQKLNF*" XX SQ Sequence 3500 BP; 1319 A; 467 C; 564 G; 1150 T; 0 other; tagggtgtcc caaaaaaaaa aaattttttt tttcctttag ttttgtgtgt agttttgtcc 60 gttaaaagtg aaaataagaa gatttttgaa aaattttgaa gtttgacccc cccgcccttc 120 ttttaaagtt tttaccccct ctgccgccca ctttttcaaa attattcact tttaaactga 180 ttttaggcta aatatataaa tgtattattg caattaaata tgttttgtat caatattgat 240 cactattata attctaaact atgtatcata gctaatatta gtaaatataa tttgtttcaa 300 ttcaacgatt ctcaaaaatt attaaatatt attattcaat gcttgagatg agaaaatttt 360 ttatgatgaa tattaatgtt acatacatta ttcaaataat tttataattt atagctactt 420 ttttatctat agatataata gcatatataa aaaatatttg atcagttatg agaaatagta 480 aattgaggaa aaatgatgag aagtttgata atattgttgg taaattatta cctacccttg 540 aagcaaagat gtatagtcag ctaccaacaa atggagatgt tgtacgccat tatctttatc 600 ttacaagatt tgaattgaaa ttaactggta aaaaagaagt tgcatctgta gtttgggaaa 660 aagtttatgt tttttggaaa atggcaggaa ttccaataaa acagaagaaa gatattgtca 720 gaatgatact acatctgatt catcttagag agaacttgat taaaaatgca aaaaaaaaaa 780 gttatgcagc tcaacttctg agttttacta ataaactatc attattgttt gatattgcag 840 catcaaatgc cgaaaatttg ataagaaaag ataccttgcg tagtatcaat agtcagatct 900 cagatataga ctttctctca gatcaacgtc acttaaaaaa gcaatttttg ggcccattag 960 acaagagtta tgctatgaaa gtgaaaagaa tggaggaaaa agagaaaaat aaaatcaagg 1020 ctattgaaaa agagaaatct agaaaagaaa acataaaaga aaaaagcttc aatgtattgg 1080 cagcatttga taatgaaggc aaaactcata ataaaaatga agaaagtact attgctagac 1140 agaaaaaaaa aatcaagata ttagaaaacc ctaaagtttc cttaatggct gataggttga 1200 atctatccaa cagagaaaga tctggaatca ttttagctgt agctgaagca ttaggtcaca 1260 atttagatga aatatctatc tcaaaaagca cagcgtacag atcaggtaat aaaattcgac 1320 agcaagtttc aaatgatctt aagaaaacat tcatttcacc catgtactct tccattcact 1380 gggattcaaa gctggtatct gaggtcgatg gtaaagtgga acgtttggct attgttgtaa 1440 caggaaaacc aagtgcacct gaagggaaat tgttgtgtgt atccaggatt cctgatacaa 1500 ctgggcgaag tcaggcaaca gctaccatag atgctgtaaa gacatggaaa ctggaaaaaa 1560 cacttgtggc aatgattttt gacacaactc catcaaattc aggttggata aatggttctg 1620 caaagttaat tgaagaacac tttaacaaaa agttactttg gtgcgcctgc cgacaccata 1680 tttttgaaag agttttgtca tctgtttact tcaaagtttt tggagaatgc agatctccaa 1740 attatgagcc atttagacac tttaaaaatg aggtatggcc taagctttca ttaacttcag 1800 atactacttt tagaaaaata aagatcgaag aaccagtatt aaagaccaaa aaagcagaag 1860 taataatttt tttgtctggt ttacttaacc ggaaatctcg aaaaatatta agagatgatt 1920 atcgagaatg ttgtcaactt gctttgcaat tactcaatcc aaaacaaata aatatacagt 1980 ggcacaaacc tggagccttc catcatgcta gatggatgtg tacaatcttg tacacatcaa 2040 agatgtttgc ttttgctgac ctggcagagt atgaagaaag ttttatagag aaattgtcaa 2100 ggttttgtaa atttacttta ctattttatg tgaaaatatg gcttacgtgc acaagcacag 2160 tcgatgctcc atttaatgac ctggaattct acaaagaaat gcttatttat aaaaaaattg 2220 acaatgatgt tgctgttgct gccctggaaa catttaatcg gcatctttgg tatttaactg 2280 aagaaatgtc ccctctttca ttattttctg ataaggtatc tgatcgagat aaaactaaaa 2340 tggctaaaat aattctgaaa aatataaaag aaaactgtga acatgatata ggctttccac 2400 aatttcctct tctgaagcag agtacttgtt tgactgattt aattggacca aaatcaagta 2460 acattctaaa tttattcgga tattcaaatg aggatggcgg ttggttggta gaatccccaa 2520 agttttggaa aactaacaca aattttaatc acatgtttga gcatatcaaa tcactgaaat 2580 ctgtaaacga tacagcagaa cgagctgtaa aactaattca agactatgca acatctatta 2640 caaaagatga tgagcagaaa gaatttcttt tacgtgcagt agaacaccac cgaaaagttc 2700 ttcctcacat aaataaacaa gcaattcaaa aacttaattt ttgaaaatat gctttaaaaa 2760 atctcttttt tattatacct ttaaaatatt gttattttgt tgttataatt taaaaaaaat 2820 gagattttga acttcttagt acttattccc attgtgcttt aatattgtta acttatacca 2880 atcatttcta tccttctttg tttaataatt gagtatttat caaatttttt gttttaatta 2940 aaaattataa tttataaaat tccttattta ggcagaaaaa aatcatacga caaaggtaat 3000 attagaaagg atgggggggg gggggagggt cataaaccag gtgaaaattt atttttaata 3060 aattttcatt tttatttatt acaaaactat ttgaaattat aataacagtt aattatattg 3120 gatataaata atagatatat aaaaggagta tattaaggat tgttttaggg caaaaacaca 3180 ttaaaggggt tatgaggggg gaggggtcaa atatgaattt ttttttgaaa ataactattg 3240 attatgtttt atcaaacaaa attacacaag aaaattatat aaaatacgaa catagttgat 3300 tatcaaaaat aattttagtc ctttgtgtca tttttttaga aaaaaaacac aaaaaaaaga 3360 gcggcggggg ggtcaaatat gaaaattttt tgattttttt tttttttttg cttttagtaa 3420 ataaaaatac atatgaaact gtttgaaaat ctagaaatag ttaaaaacta aaaaaattca 3480 tgtcattttg ggacacccta 3500 // ID CR1-8_BF repbase; DNA; INV; 3827 BP. XX AC . XX DT 30-JUL-2009 (Rel. 14.07, Created) DT 30-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE Amphioxus CR1-8_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-8_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-3827 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-3827 RA Kapitonov V. and Jurka J.; RT "Young families of CR1 non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(7), 1579-1579 (2009). XX DR [2] (Consensus) XX SQ Sequence 3827 BP; 1107 A; 955 C; 791 G; 974 T; 0 other; aacattgaga ggtcagaggt caccgaggtg ccaggtagcc ataacttttc ttcctaacac 60 catcttctct atccactgta atctcacagt atacagccat aaactttcct aaattcacat 120 atagtgttta actacatatt aagcttcctg tttttgttga catttctgtg atttttgaaa 180 ctgttaacag tactatatca gatcagttcc tcactgttct ctgaagtggg cacggcaccc 240 cgtagatctc aggcctccga atcttatcac tccgtttgtt tgttagcgag aactaggtat 300 tatgtaagta ctattttaaa ccaattttgt gtattatacg gatagaaaaa ttgtatgtat 360 tttggagcgt ttagtatagc caagttgtta tgaaatttta agcggcagca aacagtttta 420 atggtcatgc catattgata tgtaaatttc tcaactgtct aattaaatct atagctgcat 480 gaattaccag gagataggca gtaaattcct ctaggggtat attgatacat gtatttagat 540 tctgctgttt aattgatatg taaattctca gtgtagtgat gtggccaatg gatttgtaag 600 tcatagcagg ctgcacatta acctagcaag gagtccggtc ctagacctac caattgtaat 660 ctcacaatct actggtacct ccacatgtga tactatatgt tactacatcc atcatgttta 720 ccatagcaac tacgaatgtt agaagcttga atccttttac tgagactggt caccagaaaa 780 tcacagaact tgaaagccat ctccgaacca acaacatttc tgtgatgggt ctcactgaaa 840 cttggtttaa tgaaaatatt ctttcaaatg acatcagcat ccctggatat aacataatgc 900 atagaaagga cagactgggt aggactggtg gaggggtcgc cctcctcgtt agtgacacac 960 tcacaagtag acgaagatca gacttggaat cctgcgacaa cacttgggag gacttatgga 1020 ttgagattcg cgtggctgga cagaaaatcc tagcgtcatg cgtatacaga ccaccctctg 1080 caccagactc tttttacgat cagctggaaa acagcctctc caaagctgct tccgagaaag 1140 gattaacagt cattactggg gacttcaact gccaccacac cgagtggggg gactcgtcca 1200 cggacaacca cgcactacag ctggcggaca tcatcgaaag atacggtctg caccagtccc 1260 aacaccagcc aacccgctcc actctgacgc ggcaaagtac cctggacttg atcatactgt 1320 ctgacccatc cagaatcaat gaaatgttta ctctagcacc tgtaggaaat tccgatcatg 1380 ctactgttat atgtaagctg ggcatttctg tttcaacaaa gcacatcaag aagcatgtat 1440 ggtactacaa tcgtgcagac tttgataggt ttagggctga actaggtaaa gtcaactgga 1500 ataattgtat gtgtggtcct acaattgacg agaaatggaa tagctggaag gttaagttct 1560 taaacattgc caaaaacatt gtcccaaaca aaacaaaaaa ggtggtaact gcacgaaagc 1620 cttggattac gaccgaacta ctggaggcaa tcagtcaaaa aacggaactc tacaacacct 1680 tcaagtgctc cccaacacca gagaactgga gaaactacac tcaaaccaag aaccgcctaa 1740 cgaaagacct acgccgagcg gaggctgagt actactctac tgttagtgac agactgaaaa 1800 ctgctgaagg cgcccgagcc ttctggtctg tactcaaaca agcaacaggg aagagtaaat 1860 ccggcattcc tgcactctcc gtaaatggca tcgtcctgga caaagacaaa gataaggcag 1920 aaacccttaa tgacgtcttt gtcagagtca ctaaggaagc tacgcatcca aactttacaa 1980 gcaggctccc taagttcact gatgaagtac tctctacaat tcacgtctca gaggaagaag 2040 tcctgggtgt gctgcaagga ttacatccta acaaggctcc aggccccgat ggaataacca 2100 accgactcct taaggaggga gcaccagcca tcagcgcctc cctttgtcag cttttcaact 2160 tgtcactcgc tacaggacaa ataccaaaag actggaagca gtcaaacgtg tcggcaatat 2220 acaaaaaggg tgatcggact gatccatcaa actacagacc gatcgcactt ctgcccaccg 2280 tagccaaagt gttggagaga gtcgttcaca accgcctgta cacttacctt accgtcaaca 2340 atctgttaaa cgtcaaccag tctggtttca aaaaagggga tgggacagtc ctacaactcc 2400 tgcgattggt tgacaactgg gccaagtcca ttgatgactc caatgtctcc tgtacggccg 2460 ccgtctttct ggacgtccgt agagcgttcg atactgtgtg gcacgatgga ttaacctaca 2520 agctatcgcg ttacggagtt agtggaccac tcatcagctg gtttaatcac tacctcacag 2580 gccgccagca gcgtgtggtc attaatggtg ctacgtcgtc ttggaggtac acgtacgcgg 2640 gcgtcccaca gggcagtatc ctcggcccgc tgttattctt agtctacata aacgacatca 2700 aggacttacc ttgtacatcg gacatcaaca gttttgctga tgacacgtct ttgtcaaact 2760 cgggaccgac ggccgagcag gtagccagca ccacaaactc cgacttacag cttgtttctg 2820 actggtttga tgtctggggt ctagaactgc atccagggaa gtgcaaggtt atttgcatca 2880 agtctgcacg aagcagagta gtacttcctc ctatctacat ctcaggccag ctcatcgaac 2940 aggttccctt ctactcgcat cttggtgtca ccttacacca ggctttgggc tggaatgagc 3000 acgcacagaa tacttccagt aaggccagaa aggtacttgg atacctctgg aggttgcgag 3060 gcaagttgtc caaggaagca ctggaactgg cttaccttac ccttgtgcga cctaggctgg 3120 aatatgcctc catcctcttc accaacatga gtgtagccgc ctccaaagta ctagagcgcg 3180 tacagtacca tgcaggccgc ctagttacag gtgcagcgcc aagaacaccc tactcggacg 3240 tgcttcaaga gcttggatgg gacaggctct caactaggcg agactatcac cgtctagtca 3300 tcatgtacaa actggttacc ggcactgtac caccccacct gcaaccactc atccctcaaa 3360 caagacaaac ccaaacccag cttcgtctgc gaaactcttc acacctccat atccctcgct 3420 gcagaactac aacatacagc aacagcttta ttccatatac ctccagactg tggaacaacc 3480 taccaaacca catcaccgac tccaccactt tcccacagtt caaagcccgt tgcagaagcc 3540 acctcctgtc aggattttaa tattaaaccc gttgtatgat agtatactga ccttccacct 3600 actccaatct tttaagttag ctgacaaatg gtatagatct atctgtatag ttgcatgaca 3660 ccttttaatc attgattatt ttattgtatt atattgtatg acttctttgt atgttcgtcc 3720 gtatttttgt aggttctgcc gctaccagcc caaagctgcc taggcagacc cttgtaatga 3780 catgtctcat cgtcaataaa gaaagaaaga aagaaagaaa gaaagaa 3827 // ID Hopseu2 repbase; DNA; INV; 4962 BP. XX AC . XX DT 10-OCT-2009 (Rel. 14.1, Created) DT 10-OCT-2009 (Rel. 14.1, Last updated, Version 1) XX DE Hopseu2 is a putative autonomous DNA transposon. XX KW hAT; DNA transposon; Transposable Element; HOBO; transposase; KW Hopseu2. XX OS Drosophila pseudoobscura OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; obscura group; OC pseudoobscura subgroup. XX RN [1] RP 1-4962 RA Ortiz M.F. and Loreto E.L.S.; RT "Characterization of new hAT transposable elements in 12 RT Drosophila genomes."; RL Genetica 135(1), 67-75 (2009)DOI 10.1007/s10709-008-9259-5. XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 1351..3798 FT /product="Hopseu2_1p" FT /translation="MAPKKREKTNVWIYYVNNTESGQATCRMCNGTLKNNR FT VSNLKTHLWNRHNIKVLKITPIETKVDESSDQDSEISEKIIRKQKTTAVHK FT KQTLRQKRNALWKYFENNIDSGQAKCKICKAPLRNRVSNLKGHLFQQHDIN FT LYYAKRQPMKKIRVNVNKLLLRNNKYDVWNYYENIIDSGIARCKTCNGTLK FT NNRVSNLKTHLLKMHNLNLSVKKIQPIESASSEADSVHSVKIIPKEILKIN FT VNRKQLLRSFIGLVTEDCIPLKVLDSPNMRNIIGPICDGLEAAAGKPMSLK FT ASSCIKHLQLVASNIRLDIKTELKNKLLSFRIDSASRLCRNIVGISAQFLS FT EAQIKSRFLGMIELRKEPPKNVAVEVINVLKRYDIKVAQMVSATSDNGEPI FT EESGDDCSNEEYLKKIELVEKSPNILLGHVRVSRCAAHSAQLCVLDVTKSS FT EIINYIFTCRNLTKYIRKPSNRYHETFEQKQLKMPQLDCPTRWGSTYAMLE FT HLKLAKDVIPKNESVRSNIQDGSYVIDSFWEFIESYCVVFGPLQKTILKFQ FT EEQLHYGNFYAQWLKCKICTEKIVKDASQTLTTTIGNIILNSIDKRTNKFM FT NSKRLVSCLYLDPRFHHTLTAEQKVQARDYLKQVWDRITEVNPEVTCSASM FT VSTPDQSVGFDEEDELLNQYLTQGLLEANVGSKSDVYTKIETLQLPFHRID FT VDVLSFWQAKENSDQELYAISKVCFAIPPTQVNSIHFYYKFKVLLNIFHSS FT QVAMERQFSTLRLMLTDDWNQHGQETLENILLVKLNPNFLESAIDQLPIFQ FT NDHDPPPIESIKDE" XX SQ Sequence 4962 BP; 1571 A; 1028 C; 1020 G; 1343 T; 0 other; cggcttcatg ttgtactcgg taagggggaa agcctcatca gagaccaatg tatatgggga 60 tacacggttc gagcgcggca gagcagacgg cgatggtacg tgaatgctgt agtcatccag 120 gctgtatcgc aaatcagagt gcagccacac accagcgtcg ccaattttgg catctgttgc 180 gacattcaca tagacgaacc tatacttggc gtcaacgatt accattagca tcatggaggg 240 ttctccattg cttggcatgg gtcttgtttt gcaatgcttt ccttaaatat cgatgctata 300 attaattata ttaaaggaat gtaattattt atgttcctta ccatctatag ctccaatgca 360 attgggaaat ccccaaatat cgtagaagtc tgaggcgatt tgtttccatt cgtctgctga 420 tgtcggaaac ggtaggtatt cttcattgag cacattgtac aaaacaggta ttgtttcctt 480 taatatcttc gtaattgtcg gtatatatgt gggttttgtt tgacgaaagg gcatacggcc 540 agtggacaaa tagaacaggg tgactttaag tcgttcctct gaggtaatgc tgtttctgaa 600 tgggctcttc ttttcctgta taaaaggccg caccttttcc aaaattatat taaactgatc 660 tggcgagatt cgcaatactc ttccatagag tacggagtct tcatctgcta tttcccttcc 720 caacttacat tcgaagcctg aaattgatta atatccatga atttatacaa tacaaaaatt 780 gttttaatgt tttactggcc ttcaaactct cgttgcaaca atcccctttt taatgcctcc 840 atcttcctcc tgcgctggtt ctgctttcgc agcatgagca atgcagtcaa ataattattt 900 tgcattaata acaatgctgc acatagtttt cttctttttt ctatttccat tttgcttgtt 960 attttgtgta ctgtttatat tttgtgcatt cttcttttca tttttctgtg tagaagctac 1020 cagcactgtt tgtgccaaca acggtgtttg ccaacacgaa cagctgttcg ttcagcccga 1080 ttttgtggtt caaaagtaaa ttctgtgcaa gcaaaccaaa aaaagttgta aattattgct 1140 tttctctgcc tcaaactgta caaaatgaac tgtgatgctt tttgcacttc ttcgaatatc 1200 agaagtcgcg gtcaaagtag aaaaactaag caaaaactgt gcaagtcgcg agtggctgcg 1260 aagattgaag aagtcgcagt cgtgacggca gtcgtcgtcg atatttgcaa gtgttttccg 1320 aggctcaata agtaactcaa tctattgaca atggcgccaa aaaaacgcga aaaaactaat 1380 gtttggattt actacgtaaa taatacagaa agcgggcagg ctacgtgcag aatgtgtaat 1440 ggcaccttaa aaaacaatag ggtgtcgaat ctaaaaacac atttatggaa tcgtcacaat 1500 ataaaagttt taaaaataac cccgatcgaa accaaagttg acgagtcatc cgatcaggat 1560 tccgaaattt ctgaaaaaat aatacgcaag cagaaaacaa cagctgtgca taagaaacaa 1620 acacttagac aaaaacgtaa tgctctttgg aaatattttg aaaataatat agacagcggt 1680 caggccaagt gcaaaatatg caaagccccg ctaagaaaca gagtttcaaa tctgaaaggt 1740 catttattcc aacagcacga tataaactta tattatgcca aaagacaacc catgaaaaaa 1800 attagagtta atgtgaacaa gctgttactt cgaaacaata aatatgatgt ttggaattac 1860 tatgaaaata ttatagacag cggtatagcc aggtgcaaga catgcaatgg cacactgaaa 1920 aacaatagag tttcaaactt aaaaacacac ttattgaaga tgcataatct aaacctatca 1980 gttaaaaaaa ttcaaccgat cgaatcagcc tcatccgagg cggattcagt gcattctgtc 2040 aaaataatac ccaaggaaat attaaaaatt aatgtgaata gaaaacagtt acttagatct 2100 tttattggtc ttgtgaccga agactgcata cccttaaagg tgttggattc accgaacatg 2160 aggaatatca ttggcccaat ttgcgacggg ctggaagctg cagctggaaa gccaatgagc 2220 ttaaaggcct caagctgcat taaacatctg caattggttg catctaacat aagacttgac 2280 atcaaaactg aactgaaaaa caagctgctg tcattcagaa tagacagtgc ctcgcggcta 2340 tgcagaaata tagtgggaat cagtgcacaa ttcctaagtg aggctcaaat aaaatcccgc 2400 tttttaggaa tgattgaact tagaaaagag ccacccaaaa atgttgccgt cgaagtgatc 2460 aatgttttaa aaaggtacga cattaaagta gcccaaatgg tgtcagctac atccgataac 2520 ggagagccca tagaagaaag cggagacgac tgttccaacg aagagtatct aaaaaaaatc 2580 gagttggtag aaaaatcacc caacatccta ttaggacatg ttcgagtctc tcgctgtgct 2640 gcacacagtg cccaactttg tgtccttgat gttacaaaat cctcggaaat aattaattac 2700 atattcactt gtcgtaattt gacgaaatat atcagaaaac catcgaaccg atatcacgaa 2760 acttttgagc aaaaacaact aaagatgccc cagctagact gccccaccag atggggctcg 2820 acctatgcga tgttggaaca tttgaagctt gctaaagacg tgatacccaa aaacgaatcg 2880 gtgagaagta acattcaaga cggaagctat gtaatagact cgttttggga attcatcgaa 2940 agctattgcg tcgtattcgg tcccttgcaa aaaacaatac tcaaattcca agaggagcaa 3000 ctgcactacg gcaattttta tgctcaatgg ttaaaatgca agatatgcac tgaaaaaatc 3060 gtaaaggatg ccagtcaaac tttgacgaca acaattggaa atataatttt gaactccatc 3120 gataagcgaa caaacaaatt tatgaatagt aaacgtttag tttcgtgttt gtatttggat 3180 cctcgattcc atcatacctt gacagcagag caaaaagtgc aagctagaga ttatttaaaa 3240 caagtttggg atagaatcac ggaagttaat cccgaggtca cttgttcagc atcgatggtt 3300 tcgactccag atcaatccgt tggctttgac gaagaagatg agttactaaa ccaatacttg 3360 actcaaggac tacttgaagc gaacgttgga agtaaaagcg atgtgtatac aaagattgag 3420 actcttcagc ttccatttca cagaattgat gtagacgttc tatcgttttg gcaagcgaaa 3480 gagaactctg atcaggagct gtatgcaata agcaaggtgt gctttgcaat accgccaact 3540 caggtaaata gtattcattt ttattataaa tttaaagttt tattaaatat ttttcattca 3600 tcacaggtgg cgatggagcg acaattttct acgctacgac tgatgcttac tgacgattgg 3660 aatcaacatg gtcaggaaac actagaaaac atattgttgg tgaaattaaa tccaaatttt 3720 cttgaatcag ccattgacca gttgccgatt ttccaaaacg accacgatcc ccccccgata 3780 gaatctataa aagatgaata attttaaatg aacaatttac aaaaaaatta aatattaaaa 3840 tattaaaaat caactggcac tttttcgctt tgacgacttg cacaccagcc gtgggccgac 3900 tgacgtcagt tttcgttact caggaaagtg gaagtgctgt gcagctttga ctgcgaattc 3960 ttatttcgtc tatttatata actacaatat gtagtctata acttcaatat taaatatgaa 4020 atagtttaag cataaacatc tatttatcta aatttatttc catgcatttt cgctcggatt 4080 tattttcccg cgacgttttg caacactaac aaccacatac atttgaatta cgaaaaagtg 4140 tgccaaagtt ttcgcgaatc tagttacaag ttgacacaaa tatgcacata agcataaaac 4200 gggctgcata tatgcggcca taaatcacgg gccacgtgga tacgcagtgc gtatgctata 4260 tgcaaatgaa tacggctaca aagtgccggc gataaaacgc tgccgactta tcgccagtgg 4320 aagcgccgcc aagtcgccgc caacaatttt tggaggtcta ggtgagacta taaattggcc 4380 gtcgtcccag tccggtccca cagaaaataa gggaacgtgg cgtaatggca aagttggaac 4440 gaattctcac atctttgttc cttctaggcg tggccggggc cctcccgatg gtggacctgg 4500 actctgagga cgtgatagac ttgtcggatc tgggggaggc tatattcggt aatccggaca 4560 gcgagaccac gggcgtcctg gtcgatgccc atgatgagca gtcccaacag aatcccgagg 4620 agctgggaac ctactacgaa ggcgacatcc tcattcctct aagttatcgc gaggcgcgtt 4680 ccaacggcac ccgcaacggc atcctcgcgc agagcttccg ctggccggga gccgtcgtgc 4740 cctacgagat caagggtccc tttacgaccc aagagctggg caacattaac cacgccttca 4800 aggagtacca caccaggacc tgcgtgcgct tcaagccgcg ctccaccgaa aaggactata 4860 tatcaattgg aagcggaaag tcgggctgct ggagcagcat tgggcgactg ggcggacgcc 4920 aggaggtcaa tctgcagtcg cccaactgcc tccggaccta tg 4962 // ID EnSpm-2_BF repbase; DNA; INV; 12364 BP. XX AC . XX DT 04-AUG-2008 (Rel. 13.08, Created) DT 04-AUG-2008 (Rel. 13.08, Last updated, Version 1) XX DE Amphioxus EnSpm-2_BF autonomous DNA transposon - consensus. XX KW EnSpm; DNA transposon; Transposable Element; EnSpm-2_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-12364 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-12364 RA Kapitonov V. and Jurka J.; RT "EnSpm-2_BF - a family of autonomous DNA transposons from the RT amphioxus genome."; RL Repbase Reports 8(8), 789-789 (2008). XX DR [2] (Consensus) XX CC This autonomous transposon shares common termini with the CC non-autonomous EnSpm-N3_BF. It is characterized by 3-bp TSDs and CC codes for the 761-aa En/Spm transposase. XX FH Key Location/Qualifiers FT CDS 5810..8110 FT /product="EnSpm-2_BF_1p" FT /note="transposase." FT /translation="MTWKVLGLNRDNFTKYCVCPDPKCCKLFDIDKLSETV FT NGTEKPRRCDNVRMVRNRSVRCNTLLSRQVITSVSGVRKYYPFKVYCMKSV FT IETLESFVKRKGFEEKCELWRDRDMSDQELLGDVFDGQVWRDFQEWNGKDF FT LSAPRNYGLMLNLDWFQPFKRRNDYSVGAIYMTVMNLPRKERFKVENVILV FT GIMPALKSEPKCLTHFLQPIVTELQFLWQGVHFTSYMSPKFPLKFRAALLC FT CAADIPAARKLCGFLGHAAKIGCSKCKKKFEKVGKKTNYSGFDKASWHLRS FT DAQHRREGLQVQSCCNVTRRKELEKSFGTRYTPLMDLEYYQSVRFCILDPM FT HNLFLGTAKRMFKLWIDREILTKTKLNKLEKRIGTINLPSGIGRLPTRITS FT NYGSFTADEWKNWTLYYSMYCLHDLLPQEHYQCWQMFVLTCQRLCKPAVSR FT ADIDWADSAFVMFGKKVERLFGELSITPNMHLHCHLKDCILDYGPMHGFWL FT FSFERYNGLLGSETTNNREVELQIMRKFLISDFAHNLPLPSYLSDVFGPIF FT ERFSPKDKVADVPSEKERMLLKMCASRYISNFDWSADEQTIKLPPVFTRNA FT LSPDDKEFLHNTYNEMYPNKEINASNLATVVRKYGTCQVGQELFGSRIAAR FT SSRAAYVMAAWLGENGNIDSSSEHRPGVIQNFYAHSVEIGGEVKCHVFAVV FT RWYRTSPCKSDIGNGKPFTVWEKSTEVGGPASFIPVQRLYRKSAWAETVMD FT GKKVVVVCPIARKTLL" XX SQ Sequence 12364 BP; 3652 A; 2430 C; 2569 G; 3713 T; 0 other; cactgcatga aaactgctct ttaagcactc ttaaagtgtg cttaagggta ccgtattggc 60 attctttaag ttacattacg tcatgtttac gcattcggaa atattatatt taagctatga 120 ttacgtgcgt ccgtaaagga cacgttgtgg cacacattta agcaactttt aagtatcgta 180 aactatgctt aaattcgtac gtttacacag cgtttaagca ccgtaaatgc tgcttaaatt 240 gagtgtcata cgtcgactta aatagtccgt aaagagcacg tagtggcaca catttaagca 300 acttttaagt actttaaact aagcttaaat ccaaatgttt aaacattgtt taagcaccgt 360 aaatgtatct taaatcgtat atttttttgc cttacgtctg gtttaagtag gttttaagta 420 ttttaatata gtcttaaata ttttatttaa gctgctttta agaatcgtaa tgctccttaa 480 atattgtgtg ttgaatagta ttatatgccg tgtaaccgtg tatgctgcag caggcataca 540 tcaatgtaag aatgctatcg acttaacctg aaatattgac aaacagtcca caatatcaag 600 tcaacattct ttttattcta ttctcagagt tccatataca caaggcaatc acattgaaaa 660 ccagcatgtt gttgtatcag attatcttaa accgattaaa aacatagctc atttatgaat 720 gtcaaaaaaa cgatattgaa ttcgataaca atgagatatt tccgacttac aggcggttcg 780 ggtgacatta atcactcttc ttcagtttta caagggttga ttaactggta gaataaacaa 840 gtaaacatac ctggaaaaac cggttgaaca tgtaaaaaac actaactcac aggggctgta 900 attactagta gtaaattgtg tttacatagt ataaaggaca aacaaatgtt gacattaatt 960 actcaaattg gttgttgcaa acgactattc tgtggctaca aacttgactt tgaaaactct 1020 ttgagaaaaa aactgctaca ctaaactaat caaacaaaat ctgcttctaa ttgtctgtcg 1080 tcctactcgc tatccaactc cagtctcagc tcgctaacgt cagttgctgg tgctctgccg 1140 ttctgatacc ttttgctgat cgcccactcc ggggtcccct ctggtacttt tctgtccgaa 1200 agctcacgag aagggattcg aggagagccc cgaagtggca cgacagcgtc cagcgactcc 1260 ttgattttgg tcatctttcc tgactcccag gccagctttc tgaccttgaa ggttttcggc 1320 ccctccactt cactgtcgga agactccgac gacatgaagt ctttgtggag agtctcctcc 1380 gcccgtctcc tagacttctc tgaccagctg tcgcgtgcca ggttgtacgc ctttcgtcgt 1440 ctcgccagtt tctgaaatga gatatgagcc tgatgtcaaa catttctcca tggtgaagta 1500 catactatac agctaactat caattgtcac ttggttgcaa tacgatggct aaaatgtgct 1560 ttttgatgta agaataataa aaaatgttat gcatacaaat ttgtatatat tcctgtggta 1620 tggagtgaaa tactattaaa caatgcaata gataagagcc gtgcttcaag agacaatctc 1680 ccatatatta taagtatttt tttccttgat ataaagaaat accggctctc agtaacattg 1740 tggcgttatt ttcctggatt gaaaagttga ggagaacgag tcacaaaact tcagaatcac 1800 gacttcatat gttaactttt aatcctacag gtattgcgct ttgacattac atgtgtataa 1860 gactaacctg ttgctgcctc ccgtttcttt tgaccgagtg tcggtgctcc gcatacgtcc 1920 ccttctgggc ccgcacctgg tcatccctct tggatctata gtacctcttg gcagcatcta 1980 gaatgaagga aaatgaactt aaaacttgca ggcaaatcgg tttctacaac tggataagat 2040 atggagatta attccggaca tgaatttaca ttgagggcta atactgctat acatgagatg 2100 tcactgggaa aaccggtttt acctgagtca acaagactta gtcgtcacat atgcaaatgt 2160 gtttgaatag ttaatatagg ttctaaggta acgccataga aactaagatt tgtaaaagta 2220 tcgttgacac cgtgaagttt ctgctcactg gccgtagaga tttaaaagtc agtgctattt 2280 gcacaataca tgcgatttag tgctccaacc ctgctgcagt ataaccgtag ttgcatcttg 2340 cactgaattt acagtttgaa aaaaagatat ttagggagaa atcagagctt gcttcttcaa 2400 acgattaatc ttacatactt ggtgtgtaat ttacttaccc ttcagttcgg ctgatgaaaa 2460 aggacacttc tcaggccctc catgctcttt catcacctcg tctctcaaga tcttcacaat 2520 gtgctggttg gctgctgagt ccatactatg ccaaaaaatg aataaattat gttatcaatc 2580 tacaagtact agaggaattt gtttcaggat tatcaattta tcaaacagga tgttatcata 2640 gggtttcttt catggtcatt actgacggaa gggagtcata taaacatagt gaacatgtta 2700 aacggtacat ctggtactat cattatgtct tgacaatgcc atggcaagag aaacgtctta 2760 ttaaatagga gttgctatac cgttcagtaa gccgaaagcc ctgaaagtct ccttcgcctt 2820 cttccatcgc ccggtacacc ttgcgcacgg catcctgcaa aacattttaa acaaatgtat 2880 tagctgtctt tgcgcgcaag tttgtaacac tcgcatggga aaattcatga aaattcaaaa 2940 gtcattgggg aatacgtcta atttggctgt tgctggcggt ctaacaagct cgcacgctag 3000 cctggggagc tgtgttcatc ggtccgtttc ttgttaaaaa cctgctagta tgcgtgctag 3060 gtatctacat aagtggccag cattatacag atacaagcgc taaaatgtgt ttgaccaact 3120 acctgaataa tgctacggtc acattattga aaaaaaagac tcaacggcac tttacgggca 3180 cttttgggga accccttttt tcagtcgttc gggtgtgagc catacatggc cccgtgtgac 3240 gccccgtggc taacgggcat ttggggttgg ggttacaccc catacgggaa acgatgcaaa 3300 tatgaccaaa atgacaaaac tcaaaataaa tcacctaccc tgcaagtgga tggaatagag 3360 gatttcctcc gaaccctgcg gcgttggcga gcgtgccctg gcgacgggct gtcctctcgg 3420 atgtcccgcg gcggttcttg attagcctgc cggcgttcct gctcctggag ctgtgccttc 3480 agtgcctgca gttctgtggt ttgagactca aacttcctaa ccatatcctg aaggatgatg 3540 gcaaggttct cgctgtctcc tcgttccatg gggttgtcag tacgagaaat cgcggacatt 3600 ttgtaaaacg gtgaaaatgt taacgttgca cactctagaa tcgtcagact tagaatgacg 3660 ctaggtgttc agttcaaaaa ttcaaaactt gcatcatcaa cgatgattgg ccatcgtcat 3720 aaggaaactg catgtgattg gcttatttct tatcaaaatt tacatgtccg atagacacaa 3780 tgatatgtga attgtcgggc acaccttttg taaatcggcg tgggtgccaa gaccagtcat 3840 tgttacctgg cgtgggatcg tcctgccatg ccaccacaaa acaataacaa acacaacagt 3900 tcactcaaga tattcatgaa atgtcagacg tttttttgca aagtatagac atactgattg 3960 cctggcggac atttcgttga aatataatgc accgatttcc atcaaaattg ccccgcaaca 4020 gtcatcacct gcttctttta gaaagacttg aaatgttgtc tttaaacagt ttcctattgt 4080 acgaaaacga gtcacacagc cagtgaacat tttggtggca ttcggtagaa gttttccaca 4140 gaagttgcac gggctggaca tgtactccag gtaagacatt tccaaattgt atatcttagt 4200 tacattgtca gattttaaca aggtacatta tccgcggcgc ccgtacgcgt atatgccttt 4260 aacatatagg catcactgta taaaatactg aaacgagaat ctttatatta gtaggatggg 4320 cgaatatcaa gaaatatatc gattttgcct agaatgcctt gccctgttac ttttagtaaa 4380 ccaaggcttt tgaatgacgc cacttgcttc attttttcct actgtgtgac taatacataa 4440 aacttataca cctgtcccta tttcactttt acaggtaata tggacggcac tgctacaccg 4500 gttagctcca agcggaagag gtattgtggc cactgcaaag cttataagcc tgcacgtacc 4560 tatttccgac acaagaacga ctacttcaac gcggctacgg ggcagtggaa gcaggccaag 4620 gctgacgaaa cacggcccgc gcggcccctg ttccagagcc ctgtaaacgt gcctgatccg 4680 ctgctgaact cgagcagtag tagcgctgag agtgattcag acgcagaaac gaccgcaaat 4740 ggtaagggga atttaatatt aagtttacaa agtcaaaatc tattttttca ggtcccagta 4800 atgatatata cattagtcgg tgactgaata tagtcgaacg ttaccgaata tctgaacaat 4860 cgtacttttc actttgttaa aggctgtaaa cattcagtca atttatgatg attggcaaat 4920 ttaatacatg ccgtaacact cttttaccag gtgaccagtc aagttctgtc gtcggtgcga 4980 gcccttgcaa cagttcgcaa cagtcagtgt cgtccagcga tgagtccgta agtattatat 5040 tcatgctgtg gatttgccac aagtttctag tatttcgtat gtatgcgttg aaaacatttg 5100 tttcttgttt atctgagatt tatcaaaatg acctatattg accacaaccc atagaatgtt 5160 tttgactata cagccttata aataccacga actaatatca actatatcgt acctttgaaa 5220 tcaatgatga ttatgatttt tgcgacagta cttgtaccgg taaaagatgc tgttttacgc 5280 caacgggcgt gtacgccacg ggcggttaga cgggctatat agatacagac ggtataacag 5340 aatcatgttc tattatatta aagcagattt agtatttatg agtaaaaagt acatagcgct 5400 ttccgtagaa agaaaagtta gaaaagtgtt cttaaacatt tgcgatgtcg ccttttagga 5460 agctagctca tctgaagaca actcaaccga tgagagctcg tgggaagaaa gcttggacag 5520 cggtgatgag cacggggagg agatttggga gggtcaggag ttggaagacc agcttctgaa 5580 gaacctccac gcatccgccc ttgctgacag ctgtgaacgc ccgacaaaga cctccaggga 5640 tgatcctatc gttaccttgg taacatggct gcttactttt ctgttgttat ggcaagctca 5700 caaccatgtc agtgataatg gtttacagca tcttatcact tttctgtgcc cctggcttga 5760 caagctagga ataaatccta actatctagg cgggctgtcc tccctctaca tgacatggaa 5820 agtcttggga ctgaacaggg ataactttac aaagtactgt gtgtgtcctg acccgaaatg 5880 ttgcaaactg tttgatatcg acaagttaag tgaaactgtc aatgggacag agaaaccaag 5940 acggtgtgat aatgtaagaa tggtacgaaa ccgaagtgtt agatgcaata cgttgttatc 6000 taggcaagtc ataacttctg tttctggtgt aaggaagtac tatcctttca aggtatactg 6060 tatgaagagt gtgatagaaa cacttgagtc atttgtaaaa aggaaaggat ttgaggagaa 6120 atgtgagcta tggagggata gagacatgtc agatcaggaa ttactaggag atgtgtttga 6180 tggacaggtt tggcgtgatt tccaagaatg gaatgggaaa gactttctgt cagcgccgag 6240 gaattacggc ttgatgctaa atctagactg gttccaacca ttcaagcgca ggaacgacta 6300 ttctgttggg gcgatatata tgacggttat gaatctgcca cggaaagaaa ggtttaaagt 6360 tgaaaacgtc atattggttg ggataatgcc agcgctaaaa agcgaaccaa aatgtctgac 6420 acatttcctt caacctattg taacagaact acagtttctg tggcagggag ttcatttcac 6480 gtcgtatatg tcccccaagt ttcctctgaa gtttcgagct gctctccttt gctgcgctgc 6540 tgatattccg gctgctagaa aactgtgcgg ttttctaggg cacgcagcaa aaattggttg 6600 ttctaaatgt aagaagaaat ttgagaaggt tggtaagaaa acaaactatt ctggattcga 6660 caaagccagt tggcatttgc gaagcgacgc gcagcacagg agagaaggat tacaagtaca 6720 gagctgctgt aatgtgacta ggaggaaaga gcttgaaaag tctttcggta cacgctacac 6780 accattaatg gatcttgagt actaccaaag tgttcgattc tgtatcttgg atcctatgca 6840 caaccttttt cttggaacgg ctaagagaat gttcaagctt tggatagacc gtgaaattct 6900 gactaaaact aagctcaaca aattagagaa acgcatagga acaataaatc ttccaagcgg 6960 aattgggcgt cttccaacca gaattacctc aaattatggg tctttcactg ctgacgagtg 7020 gaagaattgg acactgtatt atagtatgta ttgtttgcac gacttgttgc cacaagaaca 7080 ttaccaatgt tggcaaatgt ttgtcttgac ttgtcagcgt ctgtgcaaac ccgcggttag 7140 tcgcgccgac atagactggg cagattcggc ttttgtaatg tttgggaaaa aggtggaaag 7200 gctgtttggc gaactgtcca tcacacccaa tatgcatcta cattgccatt tgaaagactg 7260 tattcttgat tatggcccta tgcatggttt ttggctgttc agctttgaga ggtacaatgg 7320 cctgttgggt tcagagacca caaacaacag ggaagttgaa ctccaaataa tgaggaaatt 7380 cctcattagt gattttgcac ataatctccc tttacctagc tacctaagtg acgtctttgg 7440 tccaatattt gaaagattta gtccaaaaga caaagtagct gatgtgccaa gtgaaaaaga 7500 aagaatgttg ttgaaaatgt gtgcctcccg ctatatctca aactttgatt ggtcagctga 7560 cgaacagaca atcaaactcc cgcccgtgtt tactagaaat gcactgtcgc cagatgacaa 7620 agagttcctg cataatacgt ataatgagat gtaccctaac aaggaaataa atgcttcaaa 7680 cctggcaaca gttgttagaa agtacggaac ttgtcaagtt gggcaggaac tgtttggctc 7740 taggatagca gctaggtcaa gtagagcagc gtatgtcatg gctgcgtggt tgggtgaaaa 7800 cggaaacatt gattcaagct ctgaacacag gcctggtgta attcagaact tttatgcaca 7860 ctctgtggaa atcggtgggg aggtgaaatg ccatgtgttt gcagtagttc gctggtatag 7920 gacatctccc tgtaagtctg acattggaaa cgggaagccg tttactgtat gggagaagtc 7980 aaccgaagtt ggcggcccgg cttcgttcat acccgtgcag agactgtatc gaaaatctgc 8040 atgggcagag actgtaatgg acgggaaaaa agttgtcgtt gtctgcccaa ttgctagaaa 8100 aactttgcta tagaatggcc gggtgaatga catctactgt tgtatataga ttgtgtatat 8160 agatcttaaa cttgtgaata tgatgtatct agaattgcca acattagaag ttgacttgtt 8220 aggcagtaag gacatgagca cattccttga taagcattag aaaatacaca ctgctgggtt 8280 attgacgata ctagtatata gggatgtata aactattgta catactgtat tttactgata 8340 cgcacatgtt cttcttctat gtatagatac ttgggtcata caaacataca agtgaggccg 8400 atcgaggctt tgtaatatac gaatcgccat attggtattt actgaacgtg tttattacct 8460 gatccaacac ggaaagaaag caacgcattg taattattgt gaatgtacct tttgactgtt 8520 tgaatttgta ctattgaacg acactataaa gctgtcaaca tggttgggtc tgacatatat 8580 tttgtgttgt tgcaatttta ataaaatcaa aagcaaatca actgtgacat cgtcctgtct 8640 gatattcgtg cgttaagaaa gtgcagatta tttcatggta gaacttgaaa ccccactggt 8700 accatatcta acaccccggg gccattccac aagaacggcg acctttatga tcagaacaaa 8760 attagcgctt gccctgcgcc accttggttt ttctaatgac tacgcctgca aattctttgg 8820 ataattaaac tgctgttttt ggaaactttc tattctagac gtcttacata agttgcatca 8880 cacttgttct tccaaatttc gtgaccgatc atactaatac ctttcaatgc cacgaataca 8940 catttttttt cacaaacaag aaatgtttcc gaaaagctta cattttattc tttcaacatt 9000 gctagaacgt cagtctcaat gtgtatatgt ctactagtag gcaatttaaa cgtattttgc 9060 aatttactag tacgtacagt gttatgtttt ctcctgaaac gtttggacca gattcaaaat 9120 ggcggctacc ttgtgagcct tgcggtgttg tttgttttgt caaatcaagg tacttggtca 9180 atttctgcct tgtttgtctc tagatatagc tcggaacacg aaatgtacag tttattcaac 9240 gtttcctcgt acagttgaaa tttttcagaa tcaagtgaag gtggctatga acaccacacc 9300 gcttgaccaa aacaaagtgc tttgataaaa gggcaaacaa actttaaaag cttcttgtta 9360 caaattcagc ggaagcggcc attttgcatt cgcggagcgc cccaagcgtg atttcccgcg 9420 acattcaaac tatgaaagag aaacaaacat ttatggattt tgaaccttct tgctagggtc 9480 aaactggaaa atgtcaaatt gttgacatga gcgtttaatg aagaaccaat agcaacaatt 9540 ttctgttgtt tctaatttta tatattgttg taaagtttag gaatgaatcg aaacaaagtt 9600 ggcgtccgac gggaaattgt caactccttg cactctgcag cgcttttatc taataattaa 9660 gagtttgcag aaagcatgaa cgttatggga tcctgtacca acgactttct gttgtatcta 9720 ccatataatt tggcatttaa cttcctttat agttaaagaa ttaatccaaa caaagacggc 9780 gtgggttaac gttgagccaa ttatagattg taaacaaaga aaaatcctgc tcggaactag 9840 tttgttttta tgaacttgct tgctctcttt tccaaattca acacgctcag ttcggtcgcc 9900 agcggctaga acgccagtca caatgtatat aggccgcatg cctaggcaat ttaaacttat 9960 tttacaaatt acgtagagtg ttacgtttcc cctagaacgt ctagacgcct caaaccagat 10020 ccaaaatggc ggcttcctcc cggcggtagt ttgttttgtc aaatcaaggt acttggtcaa 10080 actctgcctt gtttatctcc agatataact cgaaacatga aatgtacagt tttcttgtaa 10140 tgttgacatt tttcagagtt aagtgaaggt ggcaatgaac aacacaccgc ttgaccaaaa 10200 tcaagtgctt tgataaaagc tcaaacaaac tttaaaagct tcttgttaca aattcagcgg 10260 aagcggccat tttgcattcg cggagcgccc caagcgtgat ttcccgcgac attcaaacta 10320 tgaaagagaa acaaacattt atgggtttga accttcttgc tagggtcaaa ctggaaaatg 10380 tcaaattgtt gacatgagcg ttatgaagaa ccaatagcaa caattttata tattgttgta 10440 aagtttagga atgaatcgaa acaaagttgg cgtccgacgg gaaattgtca actccttgca 10500 ctctgcagcg cttttatcta atgtaatgat aaagagtttg tagaaagcat gaacgttatg 10560 ggatccttta ccaacgactt tctgttgtat ctaccatata atttggcatt taacttcctt 10620 tatagttaaa gaattaatcc aaacaaagac ggcgtgggtt aacgttgagc caattataga 10680 ttgtaaacaa agaaaaatcc tgctcggaac tagtttgttt ttatgaactt gcttgctctc 10740 ttttccaaat tcaacacgct cagttcggtc gccagcggct agaacgccag tcacaatgta 10800 tataggccgc atgcctaggc aatttaaact tattttacaa attacgtaga gtgttacgtt 10860 tcccctagaa cgtctagacg cctcaaacca gatccaaaat ggcggcgata ttccgcctcc 10920 cggcggttgt ttgttttgtc aaatcaaggt acttggtcaa attctgcctt gtttatctcc 10980 agatataact cgaaacatga aatgtacagt tttcttgtaa tgttgacatt tttcagagtt 11040 aagtgaaggt ggcaatgaac aacacaccgc ttgaccaaag tcaggtgctt tgataaaact 11100 caaacaaact ttaaaagttg ttcttgttat aaattcagcg gtagcggcca ttttgtattc 11160 gcggagtgct ccaagcgcga ttatccgcga cattcaaact acgaaaaaga aacaaaaatg 11220 tatgcatttt ggagcttctt gctagggtca aactggaaaa tgtcaaatcc acgcactgtg 11280 cagcgtctta tggtatccag tatcaacgtt ttttgttgcg tctacggtat ttgatagtgt 11340 tttcaagttt aggaatgaat tcaaacaaag atggcgtggg ttagccggtg gacttataaa 11400 caaaagaaaa ggcccactcg gaactacatg ttgtttgttt ttatggacta gtgacgctta 11460 ctctctttaa accctgctaa tcacaatttg tgagctatta ttatctgcat tggattggcc 11520 gatctttggg tattgtgcag gtatttacaa ggctgaaaac gtgccctatt ctagagaatt 11580 agttaccaca tcgttaaacc aagtaccagt agttgtatta tggtttgttt tgttttgctt 11640 agactggtag gccgggactc ctacgtgtcg cctttgaatt ggctatataa tcgatgtcct 11700 gttacgtttc aaaattgctt tgtatatatc aatagatcaa actacatttt tgcttgaaca 11760 tacagacaat gcatactagt aataggtgtc tctatttatg tgtagttttt aaaagtcgtt 11820 gtaagagcac aaagtccgct cctcctattg cctgctcagg cgtgtccgca gaagcttttg 11880 aataaactat gatgtgacca gaagtcaagg agtgtagttt tcaaaattca ttgattgaag 11940 agtacaattg ccagtaagac taatgagagt ctaccccaca ctttaagtag cttaaacgaa 12000 gcttaaataa atagctttaa actatattta aggacgctta atggtatcgt atataaacga 12060 attaaaatgc agagtaaatt gatggttatt ttagtcagta tatatctata atgtgttggc 12120 aaaactgcac aaattactgt tatatcttta atgcatagtt aagtgacctt aaggaacaag 12180 taaagttact tatatcttta agtgaactta acgattttct taaatgtgtc gtaaagattg 12240 acttttagtt gtggcatttc cgacttcgta aatgtagctt aaagataact taattatatt 12300 caaatgttta agcttcattt acgaatcaca ttaagtatgc ttaaatatcc gatttggtgc 12360 agtg 12364 // ID Gypsy-159_AA-I repbase; DNA; INV; 6613 BP. XX AC AAGE02017724; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-159_AA_; KW Gypsy-159_AA-LTR; Gypsy-159_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6613 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02017724; Positions 11377 4765. XX CC Positions [4926-5402] - Integrase core CC 'ACAT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 427..2631 FT /product="Gypsy-159_AA-I_1p" FT /translation="MDLTTQYHKMDVSHLSIDEVEYELFIRKISYNLNEHE FT SVKRRKLKDQLRKERALDSPVLIGSGTWSTTSDELRVIFSKLITIGSLLES FT SKLDARNREKLITRLVHFRVRISRLHSASDSHKFVAEIAKLEKEVNHICKQ FT HCPLLASVAEEEDVEVELSKVLEEVRTEIVDLNQSIAPAESEEAQSNERRN FT FQIAEKEKELNSSIRRMEEILGQLTNYENGQVEDLPNLLRAFKDFVIQTSE FT LDKKRRQREIEKEQQKMKEAEQHLQNKIRLEKLLANLNENLKSDVVAKKST FT GESLSLASIEPRKSEKVRFESNRSSSKLYATASENESSCSESSVLCVEPRY FT SKPTKTRSRKLSHAPRSRSSHKHDSERKGRKKYESTDSVTESESSRSSKSS FT SRSSLSSECSSDHHRRHRRGKSKRERSRSTRRVSVSEWRLKYDGRDEGRKL FT SEFLKEIKMRARAEDVSDRELFRCAIHLFSGRARDWFLDGYENRDFRNWSQ FT LKRELKREFLPPDLDFQLEIQATNRRQGRGERFVDYFHEMQKLFHTMTKPI FT SERRKFDIIWRNMRHEYKNAMTSANIRSLSKLKRFGRTVDENNWYLYHRSS FT DNQKSRNPQVNEISGSKDRPKKEAPSNNTRVFQNSKYATNKPQKIDSEKPR FT QPEPSKESRQETQPSEVMAGPSNALLKISPDQYKRPPNGVCYNCRKSGHHY FT SECKERLQKFCRVCGFLDVFTSACPVCSKNVESSA" FT CDS 3012..5795 FT /product="Gypsy-159_AA-I_2p" FT /translation="MIVAPKLHRRLILGAGDFWRAFNIRPTVQTTVEIDEI FT DQPENELELTDQQISELETVKQMFKIAIEGEYLDTTPLITHRIEIKDEFKN FT SPPIRINPYPTSPQMQTKINAELDKLIAQHVVERSHSDWSLSTVPVVKPNG FT EVRLCLDARRLNDRTQRDAYPLPHQDRILSRLGKSRYLSTIDLTKAFLQIS FT LEPKSRKYTAFSVLGKGLYQFTRLPFGLINSPATLSRLMDEVLGYGELEPE FT VFVYLDDIVIVSETFEAHLNSLREVARRLKNANLSVNLEKSKFCLNELPYL FT GYLLTPDGLKPNPDRIEAIVNYERPTSIRALRRYLGMANYYRRFISNFSEI FT TAPLTNLLRKKPKTIVWNEQAEKAFIDIKECLIAAPVLCNPDFSRPFQIQS FT DASDTAIAAILTQQYDEGEKVVAYFSQKLTPAQQAYAASEKEGLAVLSAIE FT KFRPYIEGTHFVVITDASALTYIMRGRWRTSSRLSRWSIELQSYDLEIKHR FT RGKENIIPDALSRAVESMDVEDSGDTWYSDLMNKVRESPEDNLDYKIEDGK FT LFKLLPTKTDVLDYTHQWKLCVPHDLRSQVLKEEHDDSLHIGFEKTLDRVR FT KLYFWPKMSASVQRHVQHCRVCKETKPANVSQHPEMGKPRLVTKPFQILSI FT DYIQSLPRSKSGNAHLLVLLDLYSKWTVLVPVRKISTSLTIKILEEQWFRR FT FSVPEILISDNASSFVSHDFKSFLERFHVKHWTNARHHSQANPVERLNRSI FT NACIRTYVKSDQRLWDTRISEVEHVINTTRHSSTGFTPYRIVFGHEIVSSG FT DQHIREVDVRDLSPEERIEQKQAADSKLADLVRRNLEKAHEHSRRNYNLRY FT NKPSPVYQVGQQVYRRNFTQSSAAESYNAKLGPTYIPCTVVARRGTNSYEL FT ADDQGKIVGIFSAADLKPGVAVSE" XX SQ Sequence 6613 BP; 2076 A; 1370 C; 1491 G; 1676 T; 0 other; aatggcgccc aactaaatca atcgaaaagc taatcagatt tcaaaaaata attggcatta 60 tttttttcac acggagagta ggagagacaa attgaacaaa tttacagcaa tacacggtaa 120 ttaaccccat acttctgata taacggaacc acgcaaaggc ggcttactag tgcgcgtacg 180 ttttctgaag agagcatttg agcagttgtt cggtgaataa gtgagcgagt gaacgaagtt 240 ggaacaagta tttcgagttt agaacaaagt tagaaacggg tcttgattcg atgatttttt 300 aggaacaaat ttagcttact gattgaacga ttctcagagg aagtgattgc gatattatcc 360 cgctagtgcg gtagcttaga ataacaaatt ttgattttct ttattggtca tagttgatag 420 ctcaacatgg atcttacaac acagtaccat aagatggacg tgtcccattt gagcattgat 480 gaagtagagt atgaattatt cattcggaaa atttcgtaca atttgaatga gcacgaaagc 540 gtgaagcgac gcaagctgaa ggaccagttg aggaaggaga gagcgttgga tagtccagtc 600 ctgataggta gtggtacttg gtccaccaca tccgatgaac ttcgagtgat tttctccaag 660 ttgatcacta tcggaagcct gttagagagt tccaaactgg atgctagaaa tagggagaaa 720 ttgattacgc gattagtaca tttcagagtg cgaatctcac gattacactc agcgtcggac 780 tctcacaagt tcgttgcaga gatagctaaa ttagagaaag aggtgaatca catctgcaag 840 cagcattgtc cgctactagc atcagtggcc gaagaggaag atgtcgaggt agaactgtcg 900 aaagtgttag aagaagttag gaccgagata gtcgatttga atcaatcgat tgctccggct 960 gagtctgaag aggctcagag taatgaacgt aggaattttc aaattgcaga gaaggagaaa 1020 gaactgaatt cgtcaatcag acggatggaa gaaatattag gtcaattaac aaattatgag 1080 aatggacagg tggaagatct cccaaattta ttgagagctt tcaaagattt tgtaatacag 1140 acgtcagagt tggacaaaaa acgtagacag agggaaattg aaaaagaaca gcagaaaatg 1200 aaggaggctg aacagcatct tcagaataaa attcgactcg agaagttgtt agcaaatctg 1260 aatgagaacc taaaatccga tgtagttgcg aagaaatcga ccggtgaatc gttgtcgtta 1320 gcatccatcg agccgaggaa atctgaaaaa gtaaggtttg agtcaaatag gtctagttcg 1380 aagttatatg ctacagctag cgagaatgaa agctcatgtt ctgagagctc tgtcttatgc 1440 gtagaacccc gatacagtaa acccaccaaa actcgatcac gaaaattatc tcacgctcca 1500 cgatctcgat ccagtcataa acatgactca gagagaaagg gtaggaagaa gtacgaaagc 1560 acggactccg tcacagagag tgagagttct aggtcgtcga aaagtagtag tagatcgtcg 1620 ttgagctccg aatgctcttc tgaccaccac cggcgacatc gtagaggaaa atctaagcga 1680 gagaggtcta ggtcaactag acgagtctca gtctcagaat ggcgactaaa atacgatggt 1740 agagatgagg gtagaaaact ttccgagttt ctcaaagaaa ttaaaatgag agctcgagcc 1800 gaggatgtga gcgataggga gcttttccga tgcgcgattc atctgttttc aggcagagcc 1860 agagactggt ttctcgatgg ttatgaaaac cgtgactttc ggaattggtc tcagttgaaa 1920 cgagaactca agagagaatt tttgcctcca gacctggact tccaattgga aatccaagcg 1980 acaaatcgtc gccaaggtcg cggtgagaga tttgtggatt attttcatga aatgcagaaa 2040 cttttccaca cgatgacgaa acctatttct gagcgtcgaa aattcgacat aatttggcgg 2100 aatatgcgcc atgaatataa aaatgcgatg accagtgcaa acattcgtag tttgtccaaa 2160 ctcaaacgtt tcggccgtac tgtagacgaa aataactggt atttatatca cagatcgtct 2220 gacaaccaga agtcaagaaa cccacaggtg aatgaaatct ccgggtccaa agaccgaccc 2280 aagaaggaag caccgtcaaa caatactaga gtgtttcaaa attccaaata tgccactaac 2340 aaaccccaga agatagattc tgaaaaacca agacagccag aaccgagtaa agagagtcgg 2400 caggagacac agccctccga ggtaatggct ggaccctcca atgctttatt gaaaatttca 2460 cccgatcaat acaagcgacc cccaaacggt gtctgttata actgtcggaa atcagggcat 2520 cactacagcg aatgtaagga gaggctacaa aaattctgcc gagtttgtgg tttcttagat 2580 gtattcacga gtgcttgtcc cgtctgttca aaaaacgtgg agagttcagc ttgagagggc 2640 aagctgaatt tcttcaagaa agtccctcca atgcagagat ttcgcaaagc ttacgagata 2700 atggctttat tcctttcgcc gaaacggatt cttataaacc accagagctg gaagagctct 2760 tcctttacaa cgatggagat gctcgacctt tcgttaaagt acagatactg gggaaagaaa 2820 taattggctt gctcgatagc ggtgcagaac gctctgtgtt aggcagagga gctgaaaaat 2880 tattgaaatc tctcaatctc aaaattcacc cctctactat cacacttgta aatgcagcag 2940 gcaataacat tccagttatt ggctgtgttg atttacccat atactttgac aaacaagtga 3000 aaataattcc tatgatcgtg gcaccaaagc tccaccgacg attgattctc ggagcaggag 3060 atttttggag agcattcaat attcgtccca cagtccagac gaccgtggag atcgacgaaa 3120 ttgaccagcc agagaatgag ctcgaactaa ccgatcagca gatcagtgaa ctcgagaccg 3180 tgaagcaaat gtttaaaatt gcgatcgaag gagagtatct ggacactaca cctttaatca 3240 ctcacagaat agagattaag gatgagttca agaactcacc accaatacgc ataaacccat 3300 acccaacatc tccccaaatg cagacgaaga taaatgccga actggacaaa ttgattgctc 3360 agcatgtagt tgaacgaagt cacagcgact ggtcactgag cacagtaccg gttgtcaaac 3420 caaacggaga ggttaggctc tgtctagatg cccgtcgttt aaatgatcgt acccaacgcg 3480 atgcctaccc gctaccccac caggaccgta tattaagccg tttaggaaag agtagatatt 3540 tgtcgactat cgatctcacg aaagcgtttc ttcaaatatc gctcgaaccc aaatcgagaa 3600 aatatacggc tttctcggtg ttgggtaagg gtttgtatca attcactcga ttacctttcg 3660 gtctcataaa tagtccagcc actctctcca ggctgatgga cgaagtattg ggatacggag 3720 aacttgaacc ggaagtcttc gtttatcttg atgacattgt catcgtaagc gaaactttcg 3780 aagctcatct caatagctta cgagaagtgg cacgtcgact taaaaacgct aacttgtccg 3840 ttaatttgga aaaatccaaa ttttgcctga atgaacttcc ttatcttggc tacttattaa 3900 ccccagatgg cttaaaaccg aatcctgatc gcatcgaagc gatcgtgaat tacgagcggc 3960 ccacctcaat tcgcgccctt cgacgctatt tggggatggc gaattattac cgccgcttca 4020 taagcaattt cagtgagata accgcgcctc tcactaattt gcttcgtaaa aagccaaaga 4080 caatcgtttg gaacgagcaa gctgaaaaag cattcattga catcaaagaa tgcctaattg 4140 ctgctccagt tctatgcaat cctgacttct ctcgaccgtt tcaaattcag agcgacgcga 4200 gtgatactgc gatagcggca atactgacgc agcagtatga cgaaggagaa aaagttgtcg 4260 catacttttc tcagaagctc actcctgctc agcaagcgta tgcagcttcg gagaaagaag 4320 gacttgctgt tctgtctgca atcgaaaagt tccgacctta catagaggga actcacttcg 4380 ttgtcattac tgacgcatcg gcgctcacat acatcatgcg tgggaggtgg cgaacttctt 4440 cacgtttaag ccgatggagt atcgagctac agagctacga cctcgagata aagcatcgac 4500 gggggaaaga gaacattatt cccgatgcac tatcaagagc cgttgagagc atggatgttg 4560 aagactcagg agatacctgg tactccgatt tgatgaacaa ggtgagagag tcccccgagg 4620 acaatctcga ctacaaaatc gaagatggaa aactcttcaa acttttgccg actaaaaccg 4680 atgttttaga ttacacacat caatggaaat tatgtgtccc tcatgattta cgttctcagg 4740 tgctaaaaga agagcacgac gattcgttac atatcggctt tgagaaaaca ttagaccgag 4800 tgagaaaatt gtatttttgg ccgaagatgt ctgcttctgt ccagaggcac gtccagcatt 4860 gtcgagtttg taaggaaacg aaaccggcta atgtttctca acaccctgaa atggggaaac 4920 ccagactagt aaccaagcca ttccaaatat tgtccattga ctatatacaa tcattgccca 4980 gaagcaaatc tggcaatgcg caccttttag tccttctgga cttgtactca aagtggacag 5040 tgttagtacc agttcgaaag atatcgacta gtctaaccat aaaaatcctt gaagaacagt 5100 ggttccgaag attttcagtt ccagagatct tgatatcgga caatgcgtca agctttgtca 5160 gccacgattt caaaagcttc cttgagcgat tccatgtgaa acactggacg aatgctcgac 5220 accacagtca ggccaatcct gtggagaggc taaatcgtag catcaatgcg tgcatcagaa 5280 cctacgtgaa atcggaccag aggttgtggg acactcgcat ttccgaggta gagcatgtga 5340 tcaacacaac tcgacactca tcgactggtt tcacgcctta caggattgtg tttggacatg 5400 agattgtctc ttctggtgac cagcacatcc gagaagtaga cgtaagagat ttgtctcccg 5460 aggaacgaat tgagcagaag caagctgctg atagcaagct cgctgatctc gttcggagaa 5520 acctcgaaaa ggcacatgag cacagtcgtc gcaattacaa tttacggtac aataaaccct 5580 ccccggtgta ccaagtaggt cagcaagtct atcgtcgaaa tttcactcag tcgtctgcag 5640 cagaatccta taatgctaaa ctaggtccaa cttacatccc ttgtacagta gtcgcacgta 5700 gaggaactaa ctcttacgag ctagctgatg atcaggggaa gatcgtcgga atcttctccg 5760 ctgctgatct taagccagga gtcgcagttt ctgaatgagt tgtcttgtct gtccatgtca 5820 agtcattgtc aggtcagttg gtgaattcca cccaatgtag tatagtcgtt ttgtccgaga 5880 taaatagaaa cgtccttagg ttcaatcttt gttgcatatt tagcaaaatt gtcgatcgat 5940 cactaaaagt aaacaaatcg ttgagagaga aaattaggta gagactcaag cttgcgtgtt 6000 aggtgtttgt gtagatgttt tttttttgtt catagcccca atccccatca accgccattg 6060 cattctaaag gagtagcgtg ctcattgaaa ttgtcgaaat gaccaaggta attaaaacag 6120 ctaaatgcag acaatcaatg cgcttagtag caattttaac cggtttagct ccgcactcaa 6180 gaagaataca atattggcca attttcaacg caatagacca ctaaatttac aaaatttccc 6240 agaaattccg caacaacact tagcactgcg aaccacacga agctatttga gaatcaaatg 6300 gctattgtgt tgcttgagag atgctctatc agtaagcgag tccgagaaag gtataggttg 6360 attttcttgc agtatacgaa taaaacaccc ctgaaagtaa tagggtaaag cgcgtaaaat 6420 ggtacttgtt cgagtttcag gaactagttt gctattttgt tgaatttcgc gttcataatg 6480 ctacttttca catgagtcga gtcaaacatg aaaggtgttg tcccaaacct gggcttattt 6540 tgtaagggaa agtttaaatg aaaataaact ttatcttcga taaagtttat tttctagagg 6600 gagtatgtgg tag 6613 // ID Gypsy-9_AA-LTR repbase; DNA; INV; 178 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-9_AA_; KW Gypsy-9_AA-I; Gypsy-9_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-178 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 988-988 (2011). XX DR [2] (Consensus) XX SQ Sequence 178 BP; 52 A; 31 C; 39 G; 56 T; 0 other; tggtatccta ctagtatcca agttagccat gatttatgtt aagaactaga ggaggcagtt 60 agagtttggt gtgtgaaact aacaggtgaa tatatttatc gtgcgtgcta attgtgtgta 120 caatctactg agcatccgaa tccgttttcc tgaatctccg gaaacaccag atattaca 178 // ID Gypsy-13_RP-LTR repbase; DNA; INV; 249 BP. XX AC ACPB02039362; XX DT 28-JAN-2011 (Rel. 16.02, Created) DT 28-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Rhodnius prolixus genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-13_RP_; KW Gypsy-13_RP-I; Gypsy-13_RP-LTR. XX OS Rhodnius prolixus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Euhemiptera; Heteroptera; OC Panheteroptera; Cimicomorpha; Reduviidae; Triatominae; Rhodnius. XX RN [1] RP 1-249 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Rhodnius prolixus genome."; RL Direct Submission to RU (28-JAN-2011). XX DR Genome; ACPB02039362; Positions 11035 10787. XX SQ Sequence 249 BP; 76 A; 41 C; 43 G; 89 T; 0 other; tgtggcgacc cactgttggc acgagactgc ctcgggctcc tccgaggcga aaatgtgttt 60 tatttaattt atttctattt ttattcattt aaatattata cacatgtttt ttgtcaatat 120 gacacatgtg agtttttcat aaaagactag gacctgttac aaagcaaaaa atagattgta 180 tgtactaagg tacaagaaat acagctttta aggtacatat ttgtggttct tattttacgc 240 ataaccaca 249 // ID DNA4-5B_AP repbase; DNA; INV; 263 BP. XX AC . XX DT 27-AUG-2009 (Rel. 14.09, Created) DT 27-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA4-5B_AP. XX NM DNA4-5B_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-263 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 1952-1952 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 4 bp TSD (TATA). CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 263 BP; 50 A; 77 C; 79 G; 57 T; 0 other; tactccggcc cacgagagtc catatgtcag aacgcgtccg tcgctgcgca tcggttcccc 60 ttccatacgc cgaatcatac cgatcggaaa ccggccggca acgatcgccg acagccgtcg 120 ttagccgtgt ttttatgacc cgctgtattc acgttgccga gtggtgggaa cggcactatg 180 gtgggagaaa attcggccgg cagccgtcgc gtaccgtcag tatggatgcg cggtttacgt 240 atggactctc gtgggccgga gta 263 // ID CR1_Ele7 repbase; DNA; INV; 5156 BP. XX AC . XX DT 08-OCT-2010 (Rel. 15.1, Created) DT 08-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE A CR1 clade non-LTR retrotransposon family from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1_Ele7. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-5156 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5156 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (08-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. This consensus is generated from 20 CC sequences with >96% identity, and ~100% identical to the original CC sequence in [1]. XX FH Key Location/Qualifiers FT CDS 377..1207 FT /product="CR1_Ele7_1p" FT /translation="MAKNCGKCSKAINGIDVVICRGYCGGFFHLNECSGVT FT RAMQSYFVSNKKNLFWLCDNCAELFENSHFRMISNQVDEKSPLTSLTTAIS FT ELRTEIKQLHAKPAASPAVTPRWSTIVQKRGTKRPRIIEPNAHAQDSCRVG FT CKKAQENGNVVSVPICKPEAENLFWLYLSKIRPDVTVEAVRDMTKANLGIE FT DDPKVIKLVPKDRDIESLSFVSFKIGLDPSLEDRALNPDNWPEGLLFREFV FT DYGAPKFRPPLKFNTQSPLTLHQTTISPITPVMDLS" FT CDS 1102..5049 FT /product="CR1_Ele7_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="LRSAKISPPAEVQHAITVDTTSNNNISNHPRHGFELI FT FVIPGRNCLSISEASDSPNTVPLFLPSFISRPGPVFGCGDEVFQAISAGKY FT SCKLNISLPLKCPTSSSLTSMHIYSCSHLRHTNFSSMRNQAVDSIHHQASS FT LIKIVSTFLELPGCTPASLKEALNPLAAVELFLPAINSHPGPVGEYGEGVF FT RTSTAGKYTCNKDISSPVTSIVSSQPISSFMRTTSNFYDSPVPTRTISTFD FT QLPGCMPASLMEAPDPLITVENFPPASCSHSGPVYEFGEGVFQPVLAGKYF FT HESTISLPDENASSSVLIRNDDNILLPRPTAHPVPVGPSPSSVFPIGNSTV FT RVLEGAGHENATETSKILMYYQNLGGMNSSLAEYQLAISDGCYDIYAISET FT WLNGNTSSNQLFDDRYSVFRQDRSPSNSDKSSGGGVLLAVRSKFKSRRLVH FT PNIASVEQLWVAVTTTDATLYLCVIYFPPDRVNDNLLIENHIDSLNWIVSQ FT LGPRDNIIVLGDFNLSAISWQQDSNGSFFPTTSRSSIGSASRALLDSYSTA FT GLKQKVGVVNENNRMLDLCFVSEELQNDCKIMQAPSPLVKTCKHHPALLME FT IAVNPQLCFRNTLEDVRYDFGKADFNGMNNFLTSLDWDDILHNSDANLAAS FT TLSGILLYAIGQFVPIKPSFKPPKPIWSNPELKQLKRAKQAALRRHSKYRT FT DSTREMYVQANTEYKRLNDHLYNAHQDNLQRRLKHNPKSFWNYVNDQRKES FT GLPSTMSNGTFEADSTAGIADLFRTQFSSVFTDELVDPCVVDEATRIVPQL FT PISVQRFYVTDGMVKSAGKKLKSSTGSGPDGIPSLVLKRCVDSLASPLASI FT FNLSLSTGIFPNCWKESYVFPVFKKGCKRLVSNYRGIAALSATSKLFELIV FT LRELVHSYAHYISPQQHGFMAKRSTTTNLASFTSFVIRQIEDGQQVDAIYT FT DLSAAFDKMNHQIALAKFDKLGMDTQILSWLHSYLVDRSMSVKIGDHVSLP FT FSVCSGVPQGSHIGPFLFLLYMNDVNFVLNCLKLSYADDLKLYYTIKKAED FT ALFLQQELEVFAEWCRINRMVLNVTKCSVISFGRKHALHHFGYNLNGVQLQ FT RETTIKDLGILIDSKLTFKDHVAYVVSKASSQLGFLFRFAKKFKDIYCLKA FT LYCSIVRPILEYSSAVWSPYYQNEIQRIEAIQRKFIRFALRHLRWNNPLNL FT PSYRSRCKLINLDLLEDRRNVAKCCFIGDLLQGNIDCPTLLSLVNINIPSR FT NLRSHSFLTISPARTNYGLHEPMRSMCRVFNQCYRVFDFNMSRVANKCNFR FT RILC" XX SQ Sequence 5156 BP; 1363 A; 1242 C; 1075 G; 1476 T; 0 other; ttggcaacac tgttgatgtg tatatgcgaa tgtttacggt cttgaaattt ttcgttcgat 60 atttcgtgaa attgaatgcg taaaatcgcc gtttgactat accgtgctac ctgttcgtcg 120 tgctagctta gtgcgtgtat tttagtgtcc gtgtgacgca tcgtagtgat attttatcaa 180 atttatctag tgactcgatt tgtgtattgt tttactcggc aaaagctcac tacccccacg 240 tcgtcgtttt tttttctgaa gcgacatctg ccggcaaatt ttgtaaacta ccgctgtaaa 300 gttttgctga tctgagcttt ttagtgccat aaaacttcga aggtgcttct ggtcacatac 360 gtgtaggcgc ttcaggatgg ctaaaaattg cggaaagtgc tcaaaagcca taaacggcat 420 cgacgttgta atttgccgtg gctactgtgg tggattcttc cacttgaatg aatgctcggg 480 ggttactaga gcgatgcagt catatttcgt atcgaataaa aaaaacttgt tctggttgtg 540 cgataattgt gctgagcttt tcgagaactc ccattttcga atgatttcta atcaagttga 600 tgaaaaatcg cctcttactt cgcttacaac ggccatctca gaactgcgaa ccgaaattaa 660 gcaactacat gctaaacccg cagcatctcc agcagttacc ccacgttggt ctactatcgt 720 tcaaaaaaga ggtactaaac gtcctcgtat aattgaacct aatgcgcatg ctcaggatag 780 ttgccgtgtc ggttgcaaga aggctcagga aaatggaaat gttgtatcgg taccgatttg 840 caaacctgaa gctgagaact tgttttggct atatctgtcg aagattcgtc ctgacgtcac 900 tgtagaagcg gtacgcgaca tgacgaaagc gaaccttggc attgaagacg acccaaaagt 960 gattaaactc gtcccgaaag atagagatat cgagtcgctt tcgtttgtct cttttaaaat 1020 cggactagat ccttcacttg aagatagagc gcttaacccc gataattggc ctgaaggatt 1080 attgtttcga gaatttgttg actacggagc gccaaaattt cgccccccgc tgaagttcaa 1140 cacgcaatca ccgttgacac tacatcaaac aacaatatct ccaatcaccc ccgtcatgga 1200 tttgagctga tttttgtcat cccaggacgc aactgcttaa gtatttcgga agcctcggat 1260 tcccccaaca cagtcccgtt attcctgcct tcgttcatca gccgtcctgg tcctgtgttt 1320 gggtgcggag atgaggtctt ccaagctatt tcggcaggca agtacagttg caaattgaac 1380 atttcccttc cgttaaagtg ccctacttcc agcagcttga cgtccatgca catctattcg 1440 tgctcacatc ttcgacatac gaatttttca agcatgcgta accaagctgt agattcgatt 1500 catcatcaag catcatcgtt aatcaaaatt gtatcaacgt tcctcgagct accgggatgc 1560 acgcctgcaa gtctcaagga agcccttaat cctctcgccg cagtcgagct tttcctgcca 1620 gcgatcaaca gccatcccgg tcctgttggc gagtatggag aaggggtctt ccgaacctca 1680 accgcaggca agtatacgtg caataaggac atctcttctc cggtaacatc catcgtttct 1740 agccaaccaa tatcatcgtt catgcgtact acaagcaact tctatgactc accggtgcca 1800 accaggacta tttcaacatt tgaccaacta ccgggatgca tgcctgcaag tctgatggaa 1860 gcccctgatc ctctcatcac agtcgagaac ttcccgccag cgtcctgcag tcattccggt 1920 cctgtgtatg agttcggaga gggggtcttc cagcccgttc tagcaggcaa gtacttccat 1980 gaatcaacaa tttctctgcc tgatgagaat gcttcttcca gcgtattgat ccgaaatgac 2040 gacaacatat tgcttccccg accgaccgcc caccctgttc ccgttggacc ttccccctcc 2100 agcgtgttcc cgattggaaa cagcaccgta cgagtcctcg aaggtgccgg tcatgaaaat 2160 gcaacagaaa cctcgaaaat tctgatgtac tatcagaacc ttggtggaat gaacagctct 2220 ctggctgagt atcagttggc cattagcgac gggtgttacg acatttacgc catttctgag 2280 acttggctta atgggaatac gtcatcaaat cagctgtttg atgaccgcta ctctgttttc 2340 cgtcaggatc gatcgccttc taacagcgat aagagttctg gcggcggtgt actgcttgct 2400 gtccgttcga aatttaaatc acgtaggctt gttcatccca atatcgcgtc tgtcgagcaa 2460 ctttgggtcg cagttaccac tactgatgca acactctatt tgtgcgttat atacttccca 2520 ccagaccgag taaacgataa cctgttaatc gagaatcata ttgattcgct taactggatt 2580 gtttcacaat tgggaccaag agacaatata atcgttttag gcgatttcaa cttgagcgca 2640 atttcgtggc agcaggactc caatggttcc tttttcccga caacctctcg gtcgtccatt 2700 ggctcggcct ctcgggccct gcttgattca tacagcactg ctggacttaa gcagaaagtc 2760 ggagtcgtaa acgaaaataa ccgcatgctt gacctgtgct ttgttagcga ggaacttcaa 2820 aatgactgca aaatcatgca agctccatca ccgctcgtca aaacctgcaa acaccatcca 2880 gcgttactaa tggaaattgc agtcaatcca caattatgtt tccgcaacac tctggaagac 2940 gttcgatacg atttcggcaa agctgacttc aatggcatga acaactttct cacaagcttg 3000 gactgggatg acattctcca taatagcgat gccaatctcg ctgcttcgac gttgtccggt 3060 atcttgttgt atgccatcgg ccaatttgtc cctatcaagc ctagtttcaa gcctccgaaa 3120 ccgatatggt caaatcctga acttaagcaa ctaaaaaggg ccaaacaggc agctctccgg 3180 cgtcacagca aatatcgtac ggactctaca agagaaatgt acgttcaagc taacaccgaa 3240 tataaacggc tcaacgatca tctgtataat gcccatcagg acaatctgca aagacggctc 3300 aagcataatc ccaaaagttt ctggaattat gtcaatgatc aacggaagga atccggtttg 3360 ccatcaacga tgtccaacgg aacttttgaa gcagattcta ccgcgggtat tgccgatttg 3420 ttccgaactc aatttagtag cgtttttaca gacgaactag ttgacccatg tgtagtagac 3480 gaagcaacca ggattgtacc tcaacttccg atttccgttc agcggttcta cgtcaccgat 3540 ggaatggtaa aatcagcagg aaagaaattg aaatcatcca ccggaagtgg gccggatgga 3600 atcccatcgc ttgttttaaa acgctgcgtg gattcactag cctcaccttt agcatcgata 3660 tttaacttat cgttatctac tggtattttt ccgaactgct ggaaagaatc gtacgtcttt 3720 ccagttttta agaaaggatg caagcgattg gtttcaaatt atcgtggcat agctgcactc 3780 agcgccacct cgaaactgtt tgagttaatc gtattgcgcg agctggttca cagttacgcg 3840 cattatattt cgccacaaca gcatggattt atggccaaac gatcgactac aactaatttg 3900 gcgtcattca cctcgttcgt aattcggcag atagaagatg gccaacaggt tgacgctata 3960 tatactgacc tttccgctgc gtttgacaaa atgaaccatc aaatcgcact ggcaaaattt 4020 gacaagcttg gaatggatac tcaaatcctt agttggctcc attcttatct ggtcgatcga 4080 agcatgtccg tcaaaattgg agaccatgta tcgctaccat tttctgtttg ttccggcgtt 4140 cctcaaggca gccatatcgg cccattttta ttcttgctgt acatgaacga tgttaatttc 4200 gtgctaaatt gcttaaaact ctcgtatgcc gatgacctga agctttacta tacgatcaag 4260 aaggccgaag atgccctttt cctacaacaa gaactggaag tttttgctga gtggtgccgc 4320 attaatcgta tggtactcaa tgtaactaaa tgctcagtaa tctcgttcgg ccgcaaacat 4380 gcactccatc attttggcta taatttgaat ggtgtgcagc tacagcgcga aacaacaatc 4440 aaggatttgg gtatcctgat tgattccaaa ttaacattta aagatcatgt cgcatatgtc 4500 gttagtaaag cttcgtcgca actgggattc ttgttccgat ttgcaaaaaa atttaaagac 4560 atatattgcc tcaaagctct ttactgctcc attgtgcgac ccattctgga gtattcctct 4620 gctgtctggt cgccttacta tcagaacgaa attcaacgga tcgaggctat ccagcgcaaa 4680 ttcatccgct ttgcgttgcg tcatctcaga tggaacaatc cgctcaactt acccagctac 4740 aggagccgtt gcaagctgat aaatttggat ttactggaag atagacgcaa tgttgcaaag 4800 tgctgcttta ttggagacct cctgcaaggc aacattgatt gtccaacgct gcttagtctg 4860 gtgaatatca acatacccag ccgcaaccta cgctcgcact cgtttttgac catttctcca 4920 gccaggacga actacggatt gcatgaacct atgcgtagca tgtgccgtgt tttcaatcaa 4980 tgttatcgtg tctttgactt caatatgtcc cgcgtagcaa ataagtgtaa cttccgtaga 5040 attttgtgtt aatttttttt tctcctgttc gtaattttat tattttactt ttaatgtcgt 5100 gctaaactag tcactggggt aaatatttta cctgttggca taaataaata aataaa 5156 // ID BEL-74_AA-I repbase; DNA; INV; 6876 BP. XX AC supercont1.271; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-74_AA_; KW BEL-74_AA-LTR; BEL-74_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6876 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.271; Positions 1313141 1306266. XX CC Positions [5837-6415] - Integrase core CC 'AGCAG' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 1487..6793 FT /product="BEL-74_AA-I_1p" FT /translation="MAPGIKKQPTLKLLMVQLKNIQTSLDDILRFVQNYEP FT TCTASAVNVRLHSAENLWEKYGTVLNDIQAHDDFEDEEGTFDQARLIYSDR FT YYDCVSFLMDKAKELQGPDDAIDISMRGNETLSAGHGMMDHVRLPQIKLQI FT FNGNIDEWISFRDLYTSLIHRKVDLPEVEKFHYLKGCLQGEPKGLIDPLKI FT TRNNYQVAWEMLLKRYDNSKQLKKRQVQALFSLPTLSKESVTDLHTLMDGF FT ERIVQNLDQVIKPEEYKDLLLVNLLTSRLDPVTRRGWEEQSSTKDQDTLAD FT LTDFMHRRICILESLPSKASDVKGSQQVQQPSRYKSSAVKASCGSVQSSGG FT RCVVCKENHPLYMCSSFQRLSVRERDGVLRSNALCRNCFKSNHQAKECPSK FT YLCRSCKGRHHTMVCFKAEKNETAKVAAVTSTPTECVETSGSTSQVVNMVA FT TESSVSGSSQQFSSQVLLATAVVIVEDDEGSQFPARALLDSGSESNFITER FT LSQRIRTQREKVDISVLGIGQVAMKVKYKIQALVCSRVSEFSRNVNFLVLP FT RVTADLPTAKVNTAGWSIPNGISLADPAFFSPGAVDMVLGIEFFFDFFESG FT RRISIGTQLPTLNESVFGWVVCGGLTKSTEGLRVNCSTASTATLEELVTRF FT WISEEVGGTKILSSEEKRCEELFQRTVRRDSNGRYFVSLPKDEDAISRMGE FT SRDIAFRRFQGTERRLARDASLKEQYHKFMAEYVQLGHMTKVDGMAESDKR FT CYLSHHPVIKEASTTTKLRVVFDASCKTSSGVSLNDSLLVGPIVQEDLRSI FT MLRSRMKQVMLVADVEKMFRQIFILPEDRHLQCILWRFDSADPVDVYELNT FT VTYGTKSAPFLATRTLNQLAMDEEHQYPLAAKAATEDTYMDDVITGADTIQ FT AACELRVQLEEMTMKGGFRLRKWASNNPTVLEGISEDNLAIRLSEGIDLDP FT DSTVKTLGLTWLPNTDQLRFKFDIPLRSFSQQLSKRQVLSVIATLFDPLGL FT LGAVITTAKTIMQLLWKFRNERDQALDWDQPLPSTVGEIWRAYYDQLPLLN FT DVRIDRCVTIPEAIKIEIHCFSDASTKAYGGCVYVRSQDGQGGLKVRLLSS FT KSKIAPLKTQTIPRLELCGALLTAQLYEKIRDSIRTDAQAFFWVDSTCVLR FT WIQASPNVWNTFVANRVAKIQAITEPSQWFHVSGKENPADLISRGIAPGDI FT VKNTFWWEGPEWLKSTREYWPLSPLESFPDEESQERRRTMVSCAITLDAEF FT NEWYLGRFSSYSDLLRKTAYWLRLMRLLRKDMRDRVVANSRLSNEELNEAE FT LIMVRRVQKEVFADEWNALTKSGTVPRRSPLRWFHPHIAEDNVIRVGGRLE FT HSAEAYDVKHPIVLPARHAFTRLILNHYHLKLLHAGPQLLLGVVRLRFWPL FT GGRNLARHVVHQCIKCFRSKPSAIQQFMGDLPSSRVTPSRPFSQTGVDYFG FT PMYIRPAPRKPAIKVYVAIFVCLCTKAVHLELVMDLTTDRFLLALRRFVAR FT RGKCQDIYSDNGTNFVGARNRMKDFLQLLKSPNHREKVSKECNEDGIRWHF FT SPPSSPHFGGLWEAAVRSAKNHLLKVMGESCPSAEELNTLLVQVEGCLNSR FT PLTPMSDDPNDLEPLTPAHFLIGTSLQALPEPNLESVALNRLDRFQLMQRI FT LQDFWKRWRMEYLCQLQGRFKRWNSPIKIEVGRMVIIRDENMPPMKWKMGR FT ICKLHPGTDGVVRVVTLKTSSGELKRAVEKLCFLPISEDQPNQHPAEQK" XX SQ Sequence 6876 BP; 1846 A; 1472 C; 1694 G; 1864 T; 0 other; tttatggtcc ttcgaaccgg attggatgga ccgggaaggg acagaaagga cctggaaaac 60 ttattgtgcg ctacttggcg ccattgcgtg gcatccgtct atggtataag gacaattaga 120 ggaagccatc gcactcaagg acaattgtcg cagtagccat caatttcgcg aacaattagc 180 ggatcgccat cactgcatgc tacaaaggat cgaggatagt gtggtctgga catccatcga 240 atggaacctt ttcataggtg agtggaacca tcactatatg tccagccgaa gcaaggcaat 300 tttcgattct aacacctgtc atcatattgc atgatacaca aaccatctgc tgtaaccgga 360 atttgaggat cgtttgcgga ttcgaggcta attggcttga ttggttggtc gaggcttatc 420 tgactgattg actggctggt acaaggactg ttgattgagc tacccggttt gatcacacca 480 ctggcacgaa tttggactac tgaaccagcg gacaatctgg ttctccaggt aaatattttt 540 aagttcagta tatgtccagt cgtcgttagg attgcatgag tctaacacct tatttcgatg 600 tttcttccct ggatcggatg gatttatcaa ccgtcgagac gatggacctt ggcgactact 660 gaagaatgta ttggaggaca atcgttcaga caatttacat cggttccatc gcatcactct 720 ggtttggatg agcttctagt acttcaacaa cccgaggcaa ctgcttgatt caccaaaatt 780 gaagaggctg tgacgtatat tttctatatt ggtgagtgct atgaactgta tatgtcaccg 840 aagcgggaca cagaaatata cacattgtaa tttgaattca tcctcttcaa cttattggcc 900 acgaatccct gagaacaaac tgttcgttac cagaacaact gtctcgattt cgctggattt 960 attcgttttc tgattggatt ggtgctgcta aggtttctgg gttagtgaac aagtatatgc 1020 cttgccgagg cttcaatctc tgtatactac atgcccgatt cctcttcttg gacctcttcc 1080 cttaaatccc gatgaaattt gctgttggtg agaaattgat cgctgttaat gagtttgctg 1140 taattgagct tggaatttgg gatcccttcg acgatggaat gcatttgaat agctgttagt 1200 tctcgtgggt gagttctaca gtatatgtct gtcgaagcaa ggactatcgg atactacatt 1260 tggctcttct cctctttccc caacgattcg atactgttgg atgagtggcg tttccgaacc 1320 gaatcaccgt gacgactgga ttcatttgga tgaggtgtag ttgatcccag tgcagcttac 1380 tattctgtca ggtaggcttc tacagtatat gtcagccgaa gccgtggact ttggatctct 1440 acaccatcat atttctcttt gccggatcga gtgacgtgtt tcaaccatgg ctcccgggat 1500 taagaaacaa cctaccctta agctgttgat ggtccagctt aaaaacatcc agacatccct 1560 ggacgacatc ttgagatttg tacagaacta cgagccgacc tgtaccgcaa gcgctgtaaa 1620 tgttcgtctg cacagcgccg agaatttgtg ggagaaatat ggcacggttt tgaatgacat 1680 ccaagctcat gacgattttg aagatgagga agggacattt gatcaggcta ggttaatcta 1740 tagtgataga tattacgatt gtgtatcatt tctcatggat aaggccaagg agttgcaggg 1800 acctgatgat gctattgata tatcgatgcg agggaacgaa acgttgtcag ctgggcatgg 1860 catgatggat catgtccgtt tgccccaaat taagctccaa atcttcaacg ggaacattga 1920 tgaatggatc agtttccgag atttgtatac ttctctcatc cacagaaagg tagacttacc 1980 tgaagtagag aagttccact acttaaaggg ttgtcttcaa ggtgaaccaa agggcctaat 2040 tgatcccttg aagataacaa ggaacaatta ccaagtagct tgggagatgt tgcttaagag 2100 gtatgacaac agtaagcagt tgaaaaaacg tcaggttcag gccttattca gtttgccaac 2160 actttccaaa gaatctgtaa cagacttgca cactcttatg gacggatttg aaagaatagt 2220 gcaaaatttg gatcaggtca tcaagcccga ggaatacaag gaccttttac tggtcaactt 2280 gttgacttca cgtctagatc cagtgactcg tagaggatgg gaggaacaat cttcaacgaa 2340 ggatcaggat acgttggcag atctaaccga tttcatgcat cgacggattt gtatcttgga 2400 atcactgcca tcgaaggctt cggacgttaa aggatctcaa caagttcaac aaccgtcaag 2460 gtacaaatcg tcagctgtga aggccagctg cggttctgtt cagtcgtctg ggggacgatg 2520 tgtggtttgc aaggagaacc atccacttta catgtgttct tcattccaac ggttgtctgt 2580 gcgggagaga gatggggtat tgagatctaa tgcactgtgc agaaattgct tcaaatcaaa 2640 tcatcaagca aaggaatgtc catcgaaata cttgtgtcgt agctgcaagg gtcgccacca 2700 taccatggtg tgcttcaagg cggagaaaaa cgaaacagcg aaggtcgctg cggttacctc 2760 tactccaacg gaatgtgtag agacttctgg ttcaacatcc caggtggtga acatggtcgc 2820 caccgaatcc tcggtttctg gttcatctca gcagttttcc tctcaagttc tattggctac 2880 tgcggtcgtt atcgtggaag atgacgaagg tagtcagttt ccggctcgcg cactactaga 2940 ctccggctct gaaagcaatt tcattaccga acgtctgagc caacgtattc gaactcagcg 3000 agagaaggtg gacatttcag ttctcggaat cggtcaagta gctatgaagg tgaaatacaa 3060 aatccaagca ttggtttgtt cccgagtatc ggaattctct cggaacgtca acttcctggt 3120 tcttccaagg gtaacagcgg atttgcctac ggcgaaggtg aatacagcag gatggtcaat 3180 acctaatgga atcagcctag cagatcctgc gtttttcagc cctggtgcag tggacatggt 3240 gttgggcatt gagtttttct tcgatttctt cgaatctggt cgaagaattt ccattgggac 3300 acaactgccg acactaaatg aatcggtatt tggatgggtg gtttgcggag gactaacgaa 3360 gtctacggaa ggtctacgtg tcaactgtag cacggcttct acggcaactt tggaagaatt 3420 agtcactcgg ttttggatta gcgaagaagt aggcggaact aagattcttt catcagagga 3480 aaaacgatgt gaagagcttt ttcaaagaac ggttcgaagg gattccaacg gtcggtattt 3540 cgtttcgtta ccaaaagatg aagatgccat ctctaggatg ggcgagtcac gggacatcgc 3600 attccggcgt tttcaaggta cagaacgcag attggcgcgg gatgccagtc tgaaggaaca 3660 gtaccacaag tttatggcag agtacgtgca gctaggacat atgaccaagg tagatggaat 3720 ggcggaatcg gataaacggt gttatttgtc acaccatcca gtgattaaag aagctagcac 3780 cactacaaag ctgcgtgttg tcttcgacgc gtcatgcaag acgtcgtctg gagtttccct 3840 caacgattcc ttgcttgtag gcccgattgt acaagaggat ttacgttcca tcatgcttcg 3900 gagtcgcatg aagcaggtaa tgttggtagc cgacgtagaa aagatgttcc gacagatctt 3960 cattctgccg gaggaccggc atttgcaatg catcttatgg cgttttgatt cagctgatcc 4020 tgtggacgtg tatgaactca acaccgtcac gtacgggacg aaatctgcac ccttcttagc 4080 aacacgcacg ttgaaccaat tggcaatgga tgaagaacat caatacccgc ttgctgcaaa 4140 ggctgccacc gaggatacgt atatggatga cgttataaca ggagcagata cgattcaagc 4200 agcttgtgaa cttcgtgtac agctggagga aatgacaatg aagggaggat tcagactgcg 4260 caaatgggca tccaataacc cgacagtact ggaaggaatt tccgaggata atctggcgat 4320 tcgactatcg gaaggcatag atttggatcc cgattcgacc gtaaaaaccc ttggattgac 4380 gtggttgccg aataccgatc aactacggtt caaattcgac attccattac gttcgttttc 4440 tcaacaactt tcgaaacggc aagtcttatc ggtgattgcc actctgtttg atccgcttgg 4500 acttcttggt gctgttatta ccactgcaaa gacaattatg cagctcttgt ggaaattcag 4560 gaatgagaga gatcaggcat tggattggga ccaacccttg ccttcaacgg tgggtgagat 4620 ttggagagcc tactatgacc aacttcctct gttgaacgat gttcgtatag accgttgtgt 4680 gaccattccg gaggcaatta agatcgaaat tcattgtttt tcggacgcat cgaccaaggc 4740 atacggagga tgtgtctacg taagaagcca agatggacag ggaggattga aagtccgact 4800 gctgtcttca aaatctaaga tagcaccctt gaagacgcag actattccga ggctagagtt 4860 atgtggcgca ctcttgaccg cacagctata cgagaagatt cgtgactcca tcagaactga 4920 tgcccaagcc ttcttttggg tcgattctac ctgtgttttg aggtggatac aagcttcgcc 4980 aaacgtgtgg aatacgttcg tagcgaaccg agtggcaaaa atccaggcta tcacagagcc 5040 gtcccagtgg tttcatgttt cgggtaagga aaacccagcc gatttgatct ccagaggaat 5100 tgccccagga gatattgtta aaaacacctt ttggtgggaa ggaccagaat ggttaaaatc 5160 aactcgtgag tattggcccc tgtcaccttt agaatcgttt ccggatgagg agagtcaaga 5220 gcgacgccgt acgatggtgt catgtgcgat tacgttagat gccgaattca acgaatggta 5280 tttgggaaga ttttcgtcct attccgactt gctccggaaa accgcgtatt ggttgcggtt 5340 aatgcgacta cttagaaagg atatgcgaga tcgagttgta gccaatagtc gattatcgaa 5400 tgaagagctt aacgaggcag aattgatcat ggttcggcgt gtgcaaaagg aagtctttgc 5460 cgatgagtgg aatgctctaa cgaagagcgg aacggttcca agacggtcgc cacttcggtg 5520 gtttcatcca catatcgccg aagacaacgt tattagagtt ggcggacggc tcgaacattc 5580 ggcagaagca tacgacgtaa aacatccaat tgtccttcct gcacgtcatg catttacacg 5640 attaatcctc aatcattacc atttgaaact cctacatgct ggaccgcagc tacttctggg 5700 agttgttcgg cttcgattct ggccattggg tggacggaac cttgcacgac atgttgttca 5760 tcaatgtatt aagtgttttc gatcgaaacc ttctgcaatt caacagttta tgggtgacct 5820 gccatcatca cgtgtcacac catcaaggcc attctcacaa actggcgtag actatttcgg 5880 gccaatgtac atcagacctg caccacgtaa gccagctatt aaggtatatg ttgcgatttt 5940 cgtgtgtctc tgcaccaagg ctgtccattt ggagttggtg atggatttga ctaccgatcg 6000 tttcttatta gctcttcgtc gtttcgttgc cagaagaggg aaatgtcaag acatctattc 6060 cgacaacgga actaactttg tcggtgcgag gaatagaatg aaggatttcc tacaactact 6120 caaaagtcct aatcatcgcg agaaggtttc taaggaatgt aatgaggacg gaattcggtg 6180 gcatttcagt ccacccagct cgccccactt tggaggactg tgggaggcgg ctgtgagatc 6240 agcaaaaaat catcttctaa aggtcatggg tgaatcgtgt ccttcggcag aagaattgaa 6300 tacgctttta gttcaagttg aaggatgtct caattcacga ccccttactc ccatgtccga 6360 tgacccaaat gatttggaac ccttgactcc agctcatttt ttaattggca cttcgctcca 6420 agcattacca gaaccaaatt tggaatccgt cgccttgaat agacttgatc gctttcagtt 6480 gatgcaaaga atactgcaag atttttggaa acgttggcgt atggagtatc tgtgtcaact 6540 tcaaggaagg ttcaaacggt ggaacagtcc gatcaagatc gaagtcggga gaatggtaat 6600 tattcgtgac gaaaacatgc cccccatgaa atggaagatg ggtaggattt gcaagcttca 6660 tcctggcacc gatggagttg tacgcgtcgt cacattgaag accagctcag gagaattgaa 6720 aagagctgta gagaagttgt gtttcttacc aatctcagag gatcagccta accagcaccc 6780 agcagaacaa aaatgaatcc caccctttcc ccatttccct tcgaagagga ttttctttta 6840 tttcagaaat cataccaatt tctgggtggg tgaggg 6876 // ID BEL-23_AA-I repbase; DNA; INV; 5409 BP. XX AC supercont1.344; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-23_AA_; KW BEL-23_AA-LTR; BEL-23_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5409 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.344; Positions 458526 453118. XX CC Positions [4426-5007] - Integrase core CC 'GCAAC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 75..1322 FT /product="BEL-23_AA-I_2p" FT /translation="MSPRKTPIKTPAEKQSTENNDEDDISALIHSRGQVEA FT KVTRLKSTLVQAELDETELTPPQVRLFMKKLESANAEFSSIHQRIMEKVPA FT TIREEQDECYLHFDNLHDEVAMFLETRLERFSSSPGAVPTGQHTASQPMIV FT QQPLPRAIPTFDGRYEHWEKFKMMFRDVVDKTNESDRMKLYHLEKSLVGDA FT AGLIDAKTISDGNYQRAWQLLEERFEDKRRIIHLHIKEIFNTQKLTSTSYG FT ELRKLVESFSNHVENLKFLGQQFNDASEQFLVYILSEALDDATKMHWESTV FT QRGVLPAYDDMIKFLKDRVSVLERCQHQGHDPNPKQRNKPSSSIGKMSVQK FT SNAVTTSSSSSKGPCHICGNGHFSYHCPMFKQLSASQMSDKVLQLPPQGTS FT SQRVHIEQDVSQVQQAAQHSSP" FT CDS 1366..5409 FT /product="BEL-23_AA-I_1p" FT /translation="MSGIKHPDQSQPPAVVHPSTSVVDHPVSTTCASKHVQ FT AAKTVLLLTAVVRVTDRNNRFHQCRVLLDSGSQVNFVTEEMANRLGMPKTS FT ANVSITGINALRSLARDKVIIKMQSSDGHYHANIECLVTPKVTGTIPSTRI FT EVSHWKLPQGIQLADPEFFKPGKIDMLIGGEHFFDILKPAHIYMGDNEPQL FT RDTHLGWVVAGVINEPHMLSATIQHTNLASLDSIEELIHQFWKVEEVPNAS FT PLSSEQQACEAHFQSTYQRDSTGSFIVNLPFKDNMDQLNNCRSLALKRFFM FT LEKRFERNPVLKQQYVDFIKEYEDLGHCREIKEADDPPNHCSYYLPHHAVL FT RPSSSTTKCRVVFDASAKSEPSKMLLNNVLQVGPVVQNGIFQITLRFRKHA FT YAFSADISKMYRQVLVAQPDRRFLRIFWRSDRSHPMKVLELNTITYGTACA FT PYQATRCLVQLAKDDGPEFPIGAQILTDDFYVDDTLSGANTIEQAIECQCQ FT LKALLAKGGFHIHKWCSNPEEFIQRIPIEDREQQVPLNEYGPNKVIKVLGI FT LWDPNIDTFMIANPPSIPPSSSQPTTKRIIYFEVAKLFDPLGLFALVIVIA FT KLLVQQLWRTSAGWDDPIDDEIQDRWSTLQSTLPELGRIHVPRCVTFPDTI FT AHEVHGFADASNVAYGACIYLRSIFTDGSACLRLLCSKSKVAPLHELSIPR FT KELCSALLLVNLVVQVIKTLQMPFREVVLWSDSTIVLAWLKKPLDQLQTFV FT RNRVATIQSASSEYQWRYIRSADNPADIVSRGQLPEELRKNSLWWNGPRFL FT TSANYNIEVTEEIPEFELPELKSNIAAPVVRCDPFQFFDRFSSFHKIQRIM FT GYVLRFISRCCKSRSTDLGQGTNLSVQELRRSTEAIIKVIQQKHFGDEIQR FT VISNQPCKRLGNLNPVYEDGLLRVGGRLDHSKLPYAAKHQIILPDKDPLTR FT KLIITLHEEHLHIGQAGLISAIRQRYWLLNARSTVRQVTRACVKCFRVRPG FT ETSQLMGNLPSSRVVPSPPFAVTGVDYAGPFLVKQGTHRPKIVKAYVAVFV FT CMATKAVHLEVVSDLSTDAFLASLKRFISRRGMVQEIHSDNATNFHGANNE FT LHELYRQFQDQRTVDEIQHFCQVREIQWHFIPPDAPEFGGLWEATVKSTKT FT HLKRVVGNVKLTFEELATVMAEIEAILNSRPLFSISNDPSDPQVITPAHYL FT IGRPLTAPAEPSLEDINVHRLNRWQHLQLMREHFWRAWTKDYLNSLQPRKK FT NLRKTTNLRPGLVVLLHDKTQPPLNWKLGRITAVYPGDDGLVRAVDVFASS FT TTFRRSINKVSILPIEDNLNPSLEEYVDTRSQPGGE" XX SQ Sequence 5409 BP; 1553 A; 1406 C; 1222 G; 1228 T; 0 other; tatggtccaa tcgaaccgga taaggagatt catcgtatgt ggcgccatcg tcgcttctgg 60 agaggataac aaggatgtca ccgagaaaaa ccccgatcaa gactccggct gaaaagcagt 120 caaccgagaa caacgacgaa gatgacatca gtgccctgat ccattctcgt ggtcaggtgg 180 aagcgaaagt taccaggctg aagtcaacct tggtacaagc cgagttagat gagactgagt 240 taacaccccc acaagtgaga ctgttcatga agaagttgga gtcagcaaac gcggaatttt 300 catccatcca ccagcggata atggagaagg ttcctgccac catcagggag gaacaagacg 360 aatgctatct tcacttcgac aaccttcacg atgaggtagc aatgtttctc gaaactaggc 420 tcgagagatt ttcatcatcc ccaggtgctg tgccaacagg acagcacacc gccagtcagc 480 caatgattgt gcaacagccg cttccacgtg ccatacctac gttcgatgga aggtatgagc 540 attgggagaa attcaagatg atgttccggg atgtggtgga caagaccaac gagtccgacc 600 gcatgaagct gtatcatctg gagaagtcgc tagtaggaga tgcagccggc ctgatagacg 660 caaaaactat cagcgacggc aattatcaac gagcatggca acttctggag gaacgtttcg 720 aggacaagag acgtatcatt catctgcaca tcaaagaaat cttcaacacg cagaaactga 780 ccagtaccag ctacggcgag ttacgcaaac tcgtcgaatc attttccaac cacgtggaaa 840 acctgaagtt cctaggacag cagttcaacg atgcttctga gcaattcctg gtgtacatcc 900 tatcggaggc attagacgat gccaccaaga tgcactggga gtcaacggtg cagcgtggag 960 tattaccagc atacgatgac atgatcaagt tcctgaaaga tcgtgtttcc gttttggaaa 1020 ggtgccagca tcaaggacac gatccaaatc cgaaacaacg aaacaagcca tcatcatcca 1080 tcggcaagat gtcggtgcag aaaagcaacg cagttaccac cagttccagt tcatccaaag 1140 gtccatgcca catttgtggc aacggtcatt tcagttatca ttgcccgatg tttaagcagc 1200 tatcagcatc ccagatgagc gacaaggtgc tacaattgcc tccgcaaggg acatcaagtc 1260 agagagtgca catcgagcaa gacgtgtcgc aagtgcaaca agcggcacaa cactcttctc 1320 catgaagaca gcaatcggaa caacgaccaa gaagtgaagc cagccatgtc tggtattaag 1380 catcctgatc aatcacaacc gccagctgtt gttcatccgt ccacttcggt ggtggaccat 1440 cccgtgtcca ctacatgcgc cagcaaacat gtccaagcag caaagacagt cttgttgctt 1500 accgctgtcg tccgtgttac cgatcgaaac aatcgatttc atcagtgccg tgttctgctc 1560 gacagtgggt cacaagtcaa cttcgttact gaggaaatgg ccaatcgcct cggtatgccg 1620 aagacatcag ctaacgtgtc aatcaccgga atcaacgccc tgcgaagcct tgccagagac 1680 aaagtaataa tcaagatgca atcatccgac ggacattacc acgccaacat cgagtgtctg 1740 gttacaccaa aggtaacagg tactattccc agtacacgga ttgaagtcag ccactggaag 1800 cttccgcaag gcatccaact cgccgatcca gaattcttca aacctggaaa aatcgacatg 1860 ctgatcggag gagaacactt cttcgacatc ctcaagcccg ctcacatcta catgggtgac 1920 aacgaaccgc aacttcgaga cacgcatcta ggatgggtgg tggcaggagt aatcaacgag 1980 ccacacatgt taagcgctac cattcaacac acgaacctcg catcgctcga ttccatcgaa 2040 gagttgatac atcaattttg gaaggtggag gaagtaccaa acgcttctcc actatcatcc 2100 gagcaacaag catgtgaggc gcactttcaa tcaacgtatc agcgtgattc aacgggtagc 2160 tttattgtga atctgccatt caaggacaac atggaccagc tgaacaactg tcggtcactt 2220 gctctcaaac gattctttat gttggagaaa cggtttgaac gcaatcccgt gctcaaacag 2280 caatatgtcg atttcatcaa ggagtacgag gatctcggcc actgccggga aatcaaggaa 2340 gctgatgacc ctccaaatca ctgctcctac tatttgccac atcatgcggt tcttcgtcca 2400 tcgagttcaa caactaaatg ccgagttgtc ttcgacgcca gtgccaaatc ggagccatca 2460 aaaatgttac tcaacaatgt cctacaagtc ggtccagtgg tgcaaaacgg aattttccaa 2520 ataaccctac gattcagaaa gcatgcctac gctttctcag cagacatctc gaagatgtat 2580 cgccaggtcc tcgtggctca accagacaga cgattcctgc gcatcttctg gcgatcagac 2640 cgatcgcatc caatgaaggt actcgagctg aacaccataa catatggcac agcatgtgca 2700 ccataccaag caacaaggtg tttggttcag cttgccaagg acgacggacc agagtttccc 2760 atcggcgccc agattctcac agatgacttc tacgttgacg atactttatc tggtgccaac 2820 acaatcgagc aagccatcga gtgccagtgt cagttgaaag ccttgctggc caaaggaggt 2880 tttcacatcc acaaatggtg ctccaatccc gaggaattca tccaacgcat accaatcgaa 2940 gatcgtgagc agcaagtacc actcaacgag tacggtccga acaaagtcat caaggtactc 3000 ggaattctgt gggacccaaa catcgacact tttatgatcg ctaatccacc aagcattccg 3060 ccatcatcga gtcaaccaac tacaaagcga atcatctatt tcgaggtggc aaaactattc 3120 gaccctctcg gattgttcgc tcttgttatt gtcatcgcta aactactggt acaacagctg 3180 tggcgcacgt cagccggttg ggacgatccc attgatgatg aaatccagga cagatggtca 3240 acgctgcagt caacattacc cgagctaggt cgtattcacg ttccaagatg tgtcacgttt 3300 cctgatacga tcgcacacga agtacatgga ttcgccgatg catcgaacgt tgcgtatggt 3360 gcatgcattt atctgaggag tattttcact gacggatcag catgccttcg cctcctgtgc 3420 agcaagtcaa aggttgctcc cttacacgaa ctgtccatcc cacgaaagga actttgttca 3480 gcgcttcttc tggtaaattt ggtcgtccaa gtcatcaaaa cgcttcagat gccttttcgt 3540 gaagttgtgc tttggtcgga tagcacaatc gttctggcat ggctgaaaaa acctttggat 3600 caactacaaa cgtttgtgag aaatcgtgta gccacgatac aatcagcatc cagtgagtat 3660 cagtggagat acatccgttc agctgacaat cctgccgata tcgtttcccg tggacaacta 3720 ccggaggaat taagaaaaaa cagcctctgg tggaacggcc caaggttctt gacatcagcc 3780 aattacaaca tcgaagtcac cgaagaaatt cctgaatttg aacttccgga gttgaaatcc 3840 aatatagcag caccagttgt cagatgcgat ccattccagt tcttcgaccg cttcagttcg 3900 ttccacaaaa ttcaacgcat tatgggctac gttttgcgtt tcatcagcag atgttgcaaa 3960 tcacggtcaa ccgatcttgg tcagggaacc aatctctccg tccaagagct acgtcgatca 4020 accgaagcaa tcatcaaagt aatacaacag aaacactttg gtgatgaaat tcagcgtgtg 4080 atctccaatc aaccatgcaa gcgactgggt aacctgaatc cagtatacga agacgggctg 4140 ttacgagtgg gtggacgatt ggatcactca aaattgccgt acgcagcaaa acaccagatt 4200 attctacccg acaaggaccc gctcaccaga aagctgatca ttactctgca tgaagaacat 4260 ctacacattg gacaagctgg tttaatcagt gccattcgac agaggtactg gttgctcaac 4320 gcaaggtcaa cggtgcgcca agttacacga gcatgcgtga aatgcttccg tgtaaggcct 4380 ggagaaacat cgcaactaat ggggaacctt ccatcatcac gtgttgttcc atcgcccccg 4440 tttgctgtaa ctggcgtgga ctatgccggt ccattccttg ttaagcaagg cacacatcgt 4500 ccaaagatcg tgaaagcata tgttgcagtt ttcgtttgca tggcaaccaa ggctgtgcat 4560 ctggaagtcg tgtccgattt gtcaacagat gcttttttgg cgtctttgaa gaggttcatc 4620 agtcgaagag gaatggtcca agaaatccat tcggacaacg ccacgaattt ccacggagcc 4680 aacaacgaac ttcacgagtt gtatcgtcaa ttccaagatc aacgcactgt cgatgaaata 4740 caacatttct gccaagttcg cgagatacag tggcacttca tcccaccaga tgcaccggag 4800 tttggtggtc tttgggaagc aacagtcaag tcaactaaaa cacacctcaa gcgagtcgtt 4860 ggcaacgtaa aattaacctt cgaggaattg gcaaccgtta tggccgagat tgaagcaatc 4920 ctgaattctc gcccattgtt cagcatttcc aacgatccat cagacccaca ggtcatcact 4980 ccagcacatt atctgatagg tcgcccactg actgctccag cagagccgtc cttggaagat 5040 atcaacgtac atcgcctaaa ccgttggcaa cacctgcaat tgatgcgaga gcacttttgg 5100 cgagcatgga cgaaggatta cctcaattct cttcaacctc gaaagaaaaa tctacggaaa 5160 acaacaaacc ttcgccctgg tttagttgtt ttactccatg acaagacgca accaccgctg 5220 aactggaagc taggacgcat cactgctgtg tacccaggag acgacggttt agttcgagcc 5280 gtcgatgttt ttgccagcag tacgaccttc cggcgttcaa tcaacaaggt ttccatcctt 5340 cccatcgagg acaacctcaa cccttcgctg gaagaatatg ttgatacgag gtctcaaccg 5400 gggggagaa 5409 // ID Polinton-2_SM repbase; DNA; INV; 15944 BP. XX AC . XX DT 31-DEC-2008 (Rel. 13.12, Created) DT 31-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE Autonomous Polinton from Schmidtea mediterranea - consensus. XX KW Polinton; DNA transposon; Transposable Element; Polinton-2_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-15944 RA Kapitonov V.V. and Jurka J.; RT "Autonomous Polinton-2_SM from the freshwater planarian genome."; RL Repbase Reports 8(12), 2244-2244 (2008). XX DR [1] (Consensus) XX CC Polinton-2_SM was reconstructed as a consensus sequence built CC from ~50 copies, which are ~98.7% identical to the consensus. The CC genome contains several hundred copies of Polinton-2_SM. This CC transposons is characterized by 6-bp TSDs and canonical TIRs. CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 4246..3779 FT /product="TBPpol-2_SM" FT /note="TBP transcription DNA-binding factor." FT /translation="MLSNINYKGRCSVSHMNFPFGKPQQIVDRSGKYPIMF FT FKSGKCRIMGCKQPLKVEDLPFTIEDVQIQSITVTADIGVDINLYDLLNKI FT NCIYEPEIFPALRYVKYNPLCVNVFASGKIVILGLKTLDYHEIVGNIIDDI FT KSVIENDEILKFLLSIT" FT CDS 4338..4958 FT /product="ATP-2_SM" FT /note="Polinton ATPase." FT /translation="MELRNDAVYQIVGPSGSGKTLFVCKLLQSNMFKSKFN FT KIYWHRGADDEHGMTQDQFCKIKMIIIKGFDKNWVSRLRQGDVIIIDDLYQ FT EANKEKDFNNLFTKIARHKGVTVIFITQNLFHQGGSHRTRNLNVQYLIVFK FT NPRDATVIDFLARQAYPNNRNFLINAFQDATKIPHGYIFLDFTQQCPDDLR FT VRTDIFNKEGAMFYKQS" FT CDS 5578..6579 FT /product="INT-2_SM" FT /note="Polinton integrase." FT /translation="MEKIWTNPKDEAGFSGVEKLKKKVTNSKKETQKWLSG FT QLAYSLNKPMLKRFPTRAYKTYGINDLWQMDLMEMIPFSKINKGYKYILTC FT IDVFSRFVRAIPVKTKSANEMALAIKSMFKNGKPINLQTDLGKEFYNSKVK FT EIIKGINHYSVFSQFKAAHVERFNRTLRDKLKRYFVHQGNKTWINVLPKMI FT YSYNHSSHRGLKGLKPIDVTSKDSFIQEEQIKKPKYNVDDYVRISKISASP FT FIKNFDNNWSDEVFQISKINTAQNPIMYVIKDGENNEVSGKFYEQELQVIP FT PPTVFRIEKILKTKMVNKDKQYYIKWHGYQNPSWISSKDLVK" FT CDS 6591..7580 FT /product="PX-2_SM" FT /note="PX protein from Polintons." FT /translation="MILPSNSCPTIHPDNQASKFMVNLQNPIYLQGNWEVA FT LQDFTFVYSTFPFYSKAKIEYKEIVSKSVSSLYFIEPEKEEMPIKEFNEHS FT YMYITKGVLHIYCNFYPFQISFEDTQSAKRMGVSEPIITSPSETFQIVVPP FT NTDMNNVVKIKLTYNTVIQSVFSFRDNLYLTNLIDLYDYIVYNCPHIFTSL FT QIKDKIFTFSIHDNITEVVFDKNLVSSLGLEKQTYNVKQFESIYKAKLYDH FT HQQMFIYSNIIEPVIVGDSLVPLLKAVWIKKYEQDEVVQIVLDNPMYLSLS FT TSCINNIEINIRDDGGKLINFPKNSKTHLTLHFRKLND" FT CDS 11032..14709 FT /product="POLB-2_SM" FT /note="DNA polymerase B." FT /translation="MNHSDECHDLMKKYYRHQFNGIEIFICPACKTYHAQP FT YLDFFFDHKVHLTQFKQNKLDFFRLIFKADDFKIKKRKSDIGLNLGAPKIS FT KNEIIVISDDEENIQIGGSVDISNDPIVELDLRGSRKLLKFNIDEFNVNQI FT TLTDRIITRLKIEINKMGNVKIQFAVNIQFEKDDITKDEFISNNAEVFSDN FT FLVEGVRKLNEKIERFTQLGSGWTISRIREIHFILTKYSQINRLTGHGFIE FT TPIALKGKRAIINIKNSDNLCFIYSILALLKHDVLVNNRSNMYSYTNFLNE FT LNYDEIDDCPMRLCNISRFEEKNPGLAINVLSYNPTVINNDENENDDFVPH FT HPHLDIIHRTKVTDVDPLYLLLLENDDQYHYTAVIDLQRLMNSHINDIVSV FT RIQCQWCPRCLNGFRLQDAFNSHYALCTKNQVGTTLYTLPQNKYLEFGDYS FT KTVTPPFVIYADFESILPIDIRFHQKHLPISAGLLLINNFNNTKSYFSFIG FT LDCVLEFLKKIEEIASTIVLPYFNNYANSTMNILSFAQDRDFKTCSNCYLC FT KKPIRNRVRDHDHFSGKYLGAACRSCNISRQIRRHLPVIFHNLRGYDLHHI FT LKYGLSSFPSWGLNIIPTTTEKFISLIAHIDKVSVRFIDSYQFVSASLDNA FT VKTLSNFPLTDSVFTGSIMGCKGIFPYNFAKSLEILETTFELPPIWPEVTE FT SQYLRAQKIWTENNCNTLLDYMMIYLKLDVFLLADFFQQFRSKSIAHNGLD FT PLNFYGIPGMSWASALMTLKEPIELLTDMEMYNFYEGGIRGGITFVNKHYV FT KTSDDTELLYIDINNLYGWALSQSLPFGDFSWVINNLDSILDECMSTDIEM FT LSYGYTMEVDLEIPESIHEFLDKFPVAPEKMCPPDSKVEKLMLTHYPKKNY FT VIHWRLLKMFVSLGVKVVKIHRAIRFKQAKIFKEYIDTNTCLRADATNNLD FT KNYFKLLNNSLYGKSVENLKKRMNLRLCNSFEKMVTYSSKPSFRKCIKIDD FT DLIAAHLNKELICLDRPSYIGQTVLDLSKLRMYQLQYLELEKYRNLFNCRI FT DIVAGDTDSFFLEIKNCKLSTLLPAMVSDNLLDTSNYNISHPLYSRKLDSV FT IGKFKDESKGLGYKEWVFLRPKCYSLLGETESNKAKGITLEGTEIKHLSYL FT DCYRNDIIFSVPQTRIGTRNHQLFTFKNSKIALSNNDDKRQWVGKNKSFAF FT GHYLGINANEDIEAFI" FT CDS 7974..9191 FT /product="PY-2_SM" FT /note="PY protein from animal Polintons." FT /translation="MICGKSELCIFDRPSPQAVIEYGAFEEVFPMNSITDS FT RNDVEFYINGSQTEYLDLNDTLLTVQIKVVNVDRKNLVDTSDVKPNNYMFH FT TLFKDAILGFNHIKIEGGNNTYAHKALIETILNYNGDTKNTCLTPMGYGNS FT EERKSWIKDSKVFTMCSSLQFDFMDQPKYLLPGVNVHIRLKRSDSALSLFS FT KTGEPTCQLLDAKLMIRRVRVESSVLAGHQLGLNSKHAIYPIKTKEIVQFA FT IGKGASSFYKEQIFGDRRMPNFILVTFQSESQYNGSYTDSSSKFRHYNVTS FT LSLSKNTDYRETYTQDFENNNFCTTYMQSIVRNMGYLDKNLNCGISLDDFK FT SKYPFFTFVLAPDFDLNQSQLPQNGNLRLDIKFSKAVEEPVHVIIYGVFEN FT EIQITANRTVLV" FT CDS 7576..7973 FT /product="PW-2_SM" FT /note="PW protein from animal Polintons." FT /translation="MINPDIISLYSNSQTGGEIPYFIGKQYGSGWLRTIGR FT FALPILKRIGSFGMKTAKDVIMNDKKILPSLKSNALSEISKALPAVSTMIP FT GVANMFKSEEPQPEYTASGGSMPKRRKHGRHINNRMKGHGTIFQX" FT CDS 9252..9614 FT /product="PRO-2_SM" FT /note="adenoviral cycteine protease." FT /translation="FYLISSIQVDKIGKQLLGNEFVGVYPLDKIPYLDDNN FT AIVVNTQSSNLGGEHWIAVYVKPNKIHVFDPMGFYYPALLVSKLQTIGKSI FT VYNKHRYQNPLTTTCGQHCLVWLNSMYKRIDFK" XX SQ Sequence 15944 BP; 5775 A; 2315 C; 2383 G; 5468 T; 3 other; agtagtagga tgtgaacccg tcagatacaa ccccgcttgg cttacgtaat tattacgtaa 60 catttgacct ctgtctacgc tgttgctacc tggagattga cctctgtcca ctagttgcta 120 cctagtaatt gacctttggt cagatgacct catgattacg ttacgtgctc actggtcact 180 cactacctgg agacacactg attacctaac ccttattata ttacgtggac aaacacacat 240 attacgtcaa atggaccaac acaccgatta cctaactgtt attaagtcac gctgttgtgg 300 aacaacacac ctaccttcta gaaccttaca cacacctacc aacttccaca cacctacctt 360 ctagaacctt ccacacacct acgacattcc acacacatac caacttccac acacctacca 420 agacctaccg ttcntaccga cacacaccta tccacaccga tccgcaccga tcagcaccga 480 tcagctccca tcatcgccca acacgcctac gactaagtcg tagacttaac aaaatcactc 540 actagtacta ttgrtattat tattatactg tttacttcat ataaacgtaa gagtttcgtt 600 tacctcatag attcatgtta cttgcatgtg atatattaaa aaattatatt tacatgaaat 660 tttcaattgt taattatatt attacataaa aaattataaa ataatcatta aacataataa 720 attaacatca aactcgtctt aaaataaata ttcaatccta aattatttag cttactggct 780 attttgttag tagtatatta cgtaacaatt ttatcaaaat aatcataaaa ctttaataat 840 tttcatgaaa atttaagttt acaacaatta tagtatattg atagtttatt taattaaaaa 900 taagtcactc aaaacttaaa tttgaacact ggatttacaa aattcaatat tttaagaaaa 960 taattcaaaa agtttaaaat ttcaacttaa ctaaatgtaa aaaattaaat tatacaagat 1020 ttactttacc aatttctata aaattttaaa tttacagcaa ataaaatata cagatttata 1080 ttctattaaa aaatttaatt tttaaaataa aaaataattc gcccactata ggatttgaac 1140 ccatgactca acgaattatt accgacacca gactgttaca agtctcattt tattacccaa 1200 atttctaata ataataataa taataattac aaaacaatag aataaaaaga caaacacatt 1260 tttaagacaa catttattta aataaaagct tcaatatctt ccacatcagt ggcattgata 1320 cctaaataat gtccgaatgc aaaacttttg tttttaccta cccattgacg tttatcatcg 1380 ttgttactta aagcaatttt acaattttta aaagtaaaca actgatgatt tctcgtccct 1440 attctcgttt gaggaacaga aaaaattgta tcgtttctat aacaatcaag ataagaaaga 1500 tgctttattt cagttccttc caaagtaatt ccctttgctt tattactttc agtttcacct 1560 aaaagactat agcattttgg tcgtaagaaa acccattcct tatatccaag tcccttactc 1620 tcatctttga atttacctat tactgagtca agttttcttg aatacaaagg atgaccaata 1680 tcataattcg aagtgtccaa aagattatcc gaaaccattg caggtaaaag agtacttaat 1740 ttacagtttt taatttcaag gaagaaagag tcagtgtcgc cagcaacaat gtcaatccta 1800 caattgaaaa gattcctata tttctctaat tcaagatact ggagctgata catccgtaat 1860 ttcgataaat caagaaccgt ttgaccgata taactaggtc ggtctaaaca aattaattct 1920 ttatttagat gagctgctat tagatcatcg tcaatcttta tacactttct gaaagacggc 1980 tttgatgaat aagtaaccat cttttcaaat gaattacata atcttaaatt cattcgtttt 2040 ttaagatttt ccacagattt tccataaaga ctattattta aaagtttgaa atagttctta 2100 tcaagattat ttgtagcatc tgctctcaaa caagtgtttg tatcaatata ctctttaaaa 2160 atctttgcct gtttaaacct tatagctcta tgaatcttaa caaccttaac accaagagaa 2220 acaaacatct ttagaagtct ccaatgaata acatagtttt tcttgggata atgggtgagc 2280 attaattttt caactttaga atccggtgga cacatctttt caggcgccac agggaattta 2340 tccaaaaact catgtacact ttcaggaatt tcgagatcaa cttccatcgt gtaaccatat 2400 gaaagcgttt caatatctgt actcatacac tcatccaaaa tagaatcaag attattaatt 2460 acccaactaa aatcaccgaa aggaagactt tggctaagag cccaaccata cagatttttg 2520 gtacattaca gtttttaatt tcaagcaaga aagatgctta agttccaaag tacaataaat 2580 ataataatca caaaaacaat ttataacaca aagtaaaaag ataaacaatt tttaaatata 2640 aatctattta gttaacaata gcaattctag cattatcaat tctccatatt ggtgacataa 2700 cttcattttg atcaacttta taaccttgaa tagtaatatt aatgtaccaa ctttccattt 2760 ctttgaattg gatttcttca aaagcgtcat gatatcgagt gtttctagaa ttaatataat 2820 tgttttcact gcaacataat ttcttattga tttgatgtaa aattttattg attgaatctg 2880 catcttcctt cgtaatattc acctgaagtt ttgaaaattg agtttgactc aatttagcag 2940 ctttggaaaa tttatattct atttcattca ctaagaagta cattctatga ccgtgatata 3000 tctgaacagg cttcaaattt acatcttttc ttgttcgagt atttttagct gtttcgttct 3060 gattgtttac catcacggat ttatcgctaa aaagtccttg catctgaaaa atgaaaataa 3120 tttacaaatt tcattattta tacttaaaat aacacgtgtg cgtccaatca taatcatttt 3180 aataaaaagg tttactatat aaacaaaatt atgaaaacca ataaaaacat aaaacatata 3240 taaaaaagac aacaaaattt aaataaaaat aatttatttt taattaaaat cttcgctcca 3300 tcttgcgagc atatagagaa gatctgcttt cgccttttcc gccctccaat tggtgcaaca 3360 attgcagggt gatccgcaaa taggttctgt ttttattgga ktttctttct tttccttata 3420 ctccataaga atcttttcga cgctgatacc agtcgcctca aagattctgt atgtgatcac 3480 ttccggagcc cagaatgcca tatttttccc tagttcggga tatgagagaa ttatcctttc 3540 gataaggaat tgaatagcca tttctaaaaa cataaattaa aacatatcaa tttataataa 3600 aagaaactta ccatcagttg aattgtttct cattttgtta tttgaatgaa taactcactt 3660 gtatttatac taatttttaa cacacatcag actcatttca acgaatcaga atcattcaca 3720 caaattacaa acacatcaca gccaattaaa actatacaca actattacat ctacgttagg 3780 taatgctaag taaaaacttt aagatttcat cgttttcaat gactgactta atatcatcaa 3840 taatatttcc aacaatttca tgataatcca atgttttaag acctaaaatt actattttac 3900 cactagcaaa cacgttaaca cataaaggat tatatttaac atatcgtaaa gctgggaata 3960 tttcaggttc atatatacaa ttaattttgt tcaataaatc atataaatta atatctactc 4020 caatatcggc agttactgta attgattgaa tttgaacatc ttcaattgta aatggtaaat 4080 cttcaacttt tagaggttgc ttacatccca taattcgaca cttacctgac ttgaaaaaca 4140 taattggata cttaccactt cgatctacaa tttgttgcgg tttcccaaat ggaaaattca 4200 tgtgagaaac tgaacatctt cctttataat ttatgttact aagcatctat gacgtaatta 4260 tttttgaata ttttttattc gtgtataact attaatttaa acaaatataa acaatataat 4320 taaagttaat atttaaaatg gaattaagaa atgatgcagt atatcaaatt gttggtccaa 4380 gcggaagtgg gaaaacatta tttgtttgta aattattaca atcaaacatg tttaaatcaa 4440 agtttaataa aatatattgg cataggggag ccgatgatga acatggaatg acacaggatc 4500 aattttgtaa aataaaaatg ataattataa aagggtttga taaaaattgg gtatctagat 4560 tacggcaagg tgatgtgata attatcgatg atttatatca agaagcgaat aaagaaaagg 4620 attttaataa tttatttaca aagatagcta gacataaagg tgttacagtt atttttataa 4680 cacaaaatct ttttcaccaa ggaggatctc acagaactcg caatttgaat gtccaatatt 4740 tgattgtttt caaaaatcct agagatgcaa ctgttattga tttccttgca agacaggctt 4800 accctaataa tcgtaatttc ttgataaacg cctttcaaga tgcaaccaaa attccacatg 4860 gatatatttt tcttgatttt acacagcagt gcccagatga tttacgagtt agaactgata 4920 ttttcaataa agaaggtgca atgttttata aacaaagttg aaaatgatac tatatacatc 4980 caatcatgct acttaaatct cgtgttaaac atccaaaatt atcaaaactt cgaaaagtgt 5040 ttaaaaagac tcgtgaacca ctgattccgc caacgactag tttcacccca tctaaaaaac 5100 ttataactga aaatattcct gcacaaaatc ttatttcaca caatttaaaa acaaaatcaa 5160 ttccgaataa aaatgaaaag tctataaaat caagtcatcg tagttttccg ttttttaatg 5220 ctttactaaa ggctccaagt gtgaaaagaa tgaatatctt acaatctttt ccaaattacg 5280 ttattgacga tttacttgat attatagtca aagttgtcaa aggtaaaatt gaaatcagta 5340 aatcaagtaa aaaggtatta aacaaacaca aaaaaccact gttatcactc gtaaattcaa 5400 aaaatcgaat gcagatgcgt aaaatcgtat ataaacaaca aggtggtttt attgcggcgt 5460 tgctaccatt agcattatca ttattaggtg gagtaatagg aaattcatta tcataaacaa 5520 tgggtacaac tacttcatta cctaatctta tacttaattt taagtctacc tgttgtaatg 5580 gagaaaattt ggacaaatcc aaaagacgaa gcaggtttca gtggggtaga aaagctaaag 5640 aaaaaagtta caaactcgaa aaaagagacg caaaaatggt tatccggtca actggcttac 5700 agtttgaaca aaccgatgtt gaaaaggttt ccaacaagag catataaaac atacggaatt 5760 aatgatttat ggcaaatgga tttgatggaa atgataccat tttctaaaat taacaagggc 5820 tacaagtata ttttgacatg tattgatgtt tttagtagat ttgttcgcgc tataccagtc 5880 aaaacaaaat ctgcaaatga aatggcttta gcaataaaat caatgtttaa aaatggaaaa 5940 ccgattaatt tacaaactga tttgggtaaa gaattttaca acagtaaagt taaagaaata 6000 ataaaaggaa taaatcatta ctcagtattt tcacaattca aagcagcaca tgttgaacga 6060 tttaatcgaa cattaaggga taaacttaaa aggtatttcg tgcatcaggg taataaaaca 6120 tggataaatg ttttaccgaa aatgatttac agttacaatc attcgtctca tagaggttta 6180 aagggattga aaccgataga tgtaacttca aaagatagtt ttattcagga agaacaaatt 6240 aaaaagccaa aatataatgt cgatgattat gtaagaatat ctaaaatatc agcatcacca 6300 ttcattaaga attttgataa caattggagt gatgaagttt ttcaaatctc aaagatcaat 6360 acagctcaaa atccgatcat gtatgttata aaagatggag aaaataatga ggtatctggt 6420 aaattctacg aacaagagct acaagttatt cctccaccaa cagtatttag aattgaaaag 6480 attttgaaaa ctaaaatggt gaataaagat aaacaatatt atatcaagtg gcatggatat 6540 caaaacccat catggatatc ctcaaaagat ctagtaaaat gaatttctac atgattttac 6600 cgagcaatag ttgtccaaca attcatccag acaatcaagc cagtaaattt atggtcaatc 6660 ttcaaaaccc gatttattta cagggtaatt gggaagtagc tttgcaagat tttacatttg 6720 tgtatagtac gtttccattt tatagtaaag caaagattga atacaaagaa attgtatcaa 6780 aatcagttag cagtctttac tttattgaac cggaaaaaga agaaatgcca ataaaagaat 6840 tcaacgagca tagttacatg tacataacta aaggagtttt acatatttat tgtaatttct 6900 acccattcca aatatcattt gaagatacgc aaagtgcaaa acgaatgggt gtatctgaac 6960 caataattac aagtcccagt gagacatttc aaatagtagt tccaccaaat acagatatga 7020 ataatgtagt aaaaataaag ttaacatata acactgtaat tcaatcagtt ttctcatttc 7080 gtgataattt gtacttgaca aatcttatag acttgtatga ttatatagtg tacaactgtc 7140 ctcatatttt tacatctctt caaataaaag ataaaatatt cacattcagt attcacgata 7200 atattacaga ggttgtattc gataaaaact tggttagttc actcggatta gagaagcaaa 7260 catataatgt gaaacaattt gagagtattt ataaagcaaa gttatatgat catcatcaac 7320 aaatgttcat ctatagtaac attattgaac cagtaattgt tggcgattct ttggttccct 7380 tattgaaagc agtctggatt aaaaaatatg aacaagatga agttgtacaa attgtattgg 7440 ataatccaat gtatttatcg ctttccacgt catgtataaa caatatagaa attaatattc 7500 gtgatgatgg tgggaaactt atcaactttc caaaaaattc aaaaacacat ttgacattac 7560 attttcgtaa attaaatgat taatccagat ataatttcat tgtatagtaa tagccaaact 7620 ggtggcgaaa taccatattt tattggtaag cagtatgggt ctggttggtt acggacaatt 7680 ggaagatttg cattaccaat tcttaagcgt ataggcagtt ttggaatgaa aacagcaaag 7740 gatgtgataa tgaatgataa gaaaatatta ccatcattaa aatcaaatgc tctgtctgaa 7800 attagtaaag ctttaccggc tgtatcaaca atgatacctg gtgttgcaaa tatgtttaag 7860 agtgaggaac ctcagcctga atatactgca tcaggtggat ccatgccaaa acgtagaaaa 7920 catggtaggc atataaacaa tcgaatgaaa ggacatggaa ctatatttca aaaatgattt 7980 gtggaaaatc tgaattatgc attttcgatc gtccatcacc tcaagcagta attgaatatg 8040 gtgcatttga agaagtattt ccgatgaatt caattactga tagtagaaat gatgttgaat 8100 tttacattaa tggttcccaa actgaatatt tggatttgaa tgatacttta ttaactgttc 8160 aaattaaagt tgtaaatgtc gacagaaaaa atttggtgga tacttcagat gttaagccaa 8220 acaattacat gtttcataca ctttttaaag acgcaattct tgggtttaat catattaaaa 8280 ttgagggagg taacaatact tatgcacaca aagcattgat tgaaactata ttaaattata 8340 acggtgatac taaaaatact tgtcttacac ctatgggata tggaaattct gaagaaagaa 8400 aaagttggat aaaagactca aaggttttta caatgtgttc ctcactacag tttgatttta 8460 tggatcaacc aaaatattta cttcctggtg tcaatgttca cattagattg aaacgatctg 8520 attctgcatt atcattattt tcaaaaactg gggaaccaac atgtcaatta ctagatgcta 8580 aattaatgat tcgaagagtt cgtgttgaat cgtcagtatt agcaggtcat caacttggtc 8640 ttaattcaaa acatgcgatt tatccaatta aaacaaaaga aattgttcag tttgctattg 8700 gtaaaggtgc atcatcattc tataaggaac agatatttgg agatagaaga atgcctaatt 8760 ttattcttgt aacttttcaa agtgaatctc aatacaacgg ttcatatact gattcaagct 8820 caaaattcag acattataat gtcacttcat tatctttatc aaaaaacaca gattatcgcg 8880 aaacatatac tcaagatttc gaaaataaca atttctgtac cacatatatg caaagtattg 8940 tgagaaatat gggttattta gataaaaacc taaattgtgg aatatctctt gatgatttta 9000 aatcaaaata tccatttttc acatttgttt tggcacctga ttttgattta aatcaaagtc 9060 aattacctca aaatggaaat ttacgattgg atataaagtt ttctaaagct gttgaagaac 9120 cagtacatgt aataatatat ggcgtatttg aaaacgaaat tcaaattact gcaaatcgaa 9180 ctgtactagt ataaatatag taactctatg taatcagatt atcatgaagt aagattttaa 9240 ttaataatta attttattta attagttcta ttcaggttga taaaattgga aaacagttgt 9300 tggggaatga gtttgttggt gtatatcctt tagataaaat tccatatctt gatgataaca 9360 atgcaatagt ggttaatact caatcatcta atcttggcgg tgaacactgg attgctgtat 9420 atgtgaaacc aaataaaatc catgttttcg atccaatggg attctattat ccagctttat 9480 tagttagtaa attacaaaca ataggaaagt caattgtata caataaacat cgatatcaaa 9540 atcctttaac aacaacttgt ggtcaacatt gtcttgtttg gctaaactca atgtataaac 9600 gtattgattt taaataaatg ttttataaaa tgtctagtta ttttattgat attagtgttt 9660 ttgaaaatga atttggtgga ttagttataa aagaattatg tatgatcaat actaataata 9720 ttttaagacc attacattgg gtgttttcgg caaacgaata tcatgaagat caaatactta 9780 acggtcagaa ttttttctgt ccaacgtgta tattaagtac tatcggttct gatcgaaaaa 9840 cagctatttt ctatgtgaca gatgaaacaa aagaaaagat tttaaaaata tatttcccat 9900 ctttgagaat tgtaaagtat tcattaaaag aatatcagag ttttccaaga aacatatctt 9960 gtccatttta tgaacacgat gatgattgtg cttataaaaa ctgtttactt ggtgctttag 10020 attatctgaa atattaaaaa tatatgtgtg caatttaccg tgttttgatt ggttaattta 10080 ttgtagtagt gataacgtaa ttattcatat ataaggatat atttttcatt ttatggtcac 10140 tttttatata tacccaactt cgaacaataa tcaatatggt aaaactttta attttaattt 10200 taattttatt attattttat ttagaacagt aatagtaata ataattcatc taatatgaat 10260 aattccaaac aaataccaaa tacaaatcat atggtaaatt ttaaatttta tttgaaaatt 10320 attaagttta tttagaattc taataacttt cctaatcaaa tgtattggca gaatatgaat 10380 gctaactttg taagatttaa atttaaatta aatgtttatt aattatttat ttagaatatg 10440 ccatatacta acatgccttg gtattatcca aatttccaat ttgccaatcc aaactttcaa 10500 aacatgcaaa atgttccaca atttcaaaac atgcaaaatt ttaacaacaa tcttcaaaat 10560 tcttcatcaa atatgcccca tattaataca aaattaaatg ggaagaaaat taaatgtaag 10620 tattctttga atttttaaat tgtttttaag taatatttta aataaagaat taaattttta 10680 tattttaaat aatattactt ttaaattcta gttataatat tgatttaaaa ttaggtatat 10740 aaattagtga ttaaaaatct ttaaaaataa atagtttaga ttgataattt gattgtaaac 10800 tgtttatttt tataaatatt taattacata atttatataa ccttttacat aaggttgaaa 10860 atagtataac cttttcatat taaagggttc gaatttggta atttattaac cttttaaata 10920 acttttaatt atttaataat tacgccataa ttttcaaaaa ttattaattt tcacttataa 10980 ttaattttta tataaatagg caatttatac cctcagtact gcaattcatc catgaatcat 11040 agtgatgaat gccatgacct aatgaagaaa tattatcgtc atcagttcaa tggcattgaa 11100 atatttatat gtcccgcgtg caaaacttac catgcccaac cctatcttga tttctttttt 11160 gatcataagg ttcacttgac acagtttaaa cagaataaac tcgatttctt tcgtttgatt 11220 tttaaggctg acgattttaa aattaagaaa agaaaatctg atattggttt gaatttaggg 11280 gcgcctaaaa tatctaaaaa tgaaataatt gtaatcagtg atgatgagga aaacatccaa 11340 ataggtggtt ctgtcgacat tagtaatgac ccaattgtag aattagattt aagaggatct 11400 cgtaaattac taaaatttaa tattgatgaa tttaatgtta atcaaataac attaacagat 11460 agaattataa cccggctaaa aattgaaatt aataaaatgg ggaacgttaa aattcaattt 11520 gctgtaaata tccaatttga gaaagatgat ataactaaag atgaatttat atctaataac 11580 gccgaggtct tctcagataa ttttcttgtt gaaggagtta gaaaattaaa tgaaaaaata 11640 gaaagattta ctcaattagg tagtggatgg actatttcta gaattcgaga aattcatttt 11700 atcttgacta aatattccca aataaatagg ttaactggtc acggttttat tgaaactcct 11760 attgcgctta aaggaaaaag agcaattatt aatattaaga atagtgacaa tctttgtttt 11820 atatatagta tcttggcttt attgaagcat gacgttcttg taaataatcg tagtaatatg 11880 tatagttata ctaacttctt gaatgaatta aattatgatg agatagatga ctgtcctatg 11940 agattgtgca atataagtag gtttgaggaa aagaatccag gtcttgcaat taatgtcttg 12000 tcttataacc caactgttat taataatgat gaaaatgaaa atgacgattt tgttcctcat 12060 caccctcatt tggatattat tcatcgcact aaagtgactg acgttgatcc tttgtatctt 12120 ttacttctcg aaaatgatga ccaatatcat tatactgctg taattgactt gcaacgctta 12180 atgaattcac atattaatga tatagttagt gtacgaatac aatgtcaatg gtgtcctaga 12240 tgtcttaatg gttttcgttt gcaggatgct tttaacagtc attatgcctt atgtaccaaa 12300 aatcaggtag gaacaacttt atacactctt cctcaaaata aatatttaga gtttggtgat 12360 tactcaaaaa cagtaacccc accatttgta atttatgcag actttgagtc aattttaccc 12420 attgacataa gatttcatca gaaacatcta ccaatatccg caggattatt attaattaat 12480 aattttaata atactaaatc gtattttagt ttcattggtt tagattgtgt cttggaattt 12540 ttgaaaaaga ttgaggaaat tgcttccaca attgtcctac catatttcaa taattatgcc 12600 aactctacaa tgaacatttt atcttttgct caggatagag actttaaaac ttgttcaaat 12660 tgttatttat gtaagaaacc gataagaaat cgtgtaagag atcacgatca cttttcaggg 12720 aaatatcttg gtgctgcctg tcggtcttgt aatatatcta gacaaattcg tagacattta 12780 cctgttattt ttcataatct taggggctat gaccttcatc acattttgaa gtatggtctt 12840 agtagttttc cttcttgggg tttgaatatc attccaacta caacagaaaa gtttatatcg 12900 ttaatagcgc atatcgataa agtatcagtt aggttcattg atagttatca gtttgttagc 12960 gcatcattag acaatgcagt taaaactctt tctaactttc ctttgacaga ttcagtattt 13020 actggttcaa taatgggttg taaaggtatc tttccttata attttgcaaa gagtttggaa 13080 atcttggaaa caacttttga attaccacca atttggcctg aagtgacaga atcacaatat 13140 ttgagagctc aaaagatttg gactgagaat aattgtaata ccttgttaga ttatatgatg 13200 atatacttga aattagatgt atttcttctt gctgactttt ttcaacaatt tcgttcaaag 13260 agtattgcac ataacggttt agatccctta aatttttacg gtatacctgg aatgagttgg 13320 gcttctgctt tgatgacatt gaaagaaccc attgaactcc ttacagatat ggaaatgtat 13380 aatttctatg aaggagggat aagaggtgga ataacatttg tgaataaaca ttatgttaaa 13440 acttctgatg atacagaatt gttgtatatt gatatcaaca atctgtatgg ttgggctctt 13500 agccaaagtc ttcctttcgg tgattttagt tgggtaatta ataatcttga ttctattttg 13560 gatgagtgta tgagtacaga tattgaaatg ctttcatatg gttacacgat ggaagttgat 13620 ctcgaaattc ctgaaagtat acatgagttt ttggataaat tccctgtggc accagaaaag 13680 atgtgtccac cggattctaa agttgaaaaa ttaatgctga cccattatcc caagaaaaac 13740 tatgttattc attggagact tctaaagatg tttgtttctc ttggtgttaa ggttgttaag 13800 attcatagag ctataaggtt taaacaggca aagattttta aagagtatat tgatacaaac 13860 acttgtttga gagcagatgc tacaaataat cttgataaga actatttcaa acttttaaat 13920 aatagtcttt atggaaaatc tgtggaaaat cttaaaaaac gaatgaattt aagattatgt 13980 aattcatttg aaaagatggt tacttattca tcaaagccgt ctttcagaaa gtgtataaag 14040 attgacgatg atctaatagc agctcatcta aataaagaat taatttgttt agaccgacct 14100 agttatatcg gtcaaacggt tcttgattta tcgaaattac ggatgtatca gctccagtat 14160 cttgaattag agaaatatag gaatcttttc aattgtagga ttgacattgt tgctggcgat 14220 actgactctt tctttcttga aattaaaaac tgtaaattaa gtactctttt acctgcaatg 14280 gtttcggata atcttttgga cacttcgaat tataatatta gtcatccttt gtattcaaga 14340 aaacttgact cagtaattgg taaattcaaa gatgagagta agggacttgg atataaggaa 14400 tgggttttct tacgaccaaa atgctatagt cttttaggtg aaactgaaag taataaagca 14460 aagggaatta ctttggaagg aactgaaata aagcatcttt cttatcttga ttgttataga 14520 aacgatataa ttttttctgt tcctcaaacg agaataggga cgagaaatca tcagttgttt 14580 acttttaaaa atagtaaaat tgctttaagt aacaacgatg ataaacgtca gtgggtaggt 14640 aaaaacaaaa gttttgcatt cgggcattat ttaggtatca atgccaatga agatattgaa 14700 gcttttattt aaataaatgt tgtcttgtaa atgtgtttgt ctttttattc tattgttttg 14760 taattattat taaatgagac ttgtaacagt ctggtgtcgg taataattcg ttgagtcgtg 14820 ggttcaactc ctatagtggg cgaattattt tttattttaa aaattaaatt ttttaataga 14880 aaataaatat atatatttta tttgctgtaa atttaaaatt ttatagaaat tggtaaagta 14940 aatctggcat aatttaattt tttatattta gttgaaattt taaacttgtt gaattatttt 15000 cttaaaatat tgaattttgt aaatccagtg ataaaattta agttttgagt gacttgattt 15060 taattaaata aactatcaat atactatagt gcttttaaac ttaaattttc atgaaaatta 15120 ttaaagtttt atgattattt tgataaaatt gttatgtaat atactactaa caaaatagtc 15180 agtaagctaa ataatttaga attgaacatt tatttttaag acgagtttga tcttaattta 15240 ttatgtttaa tgattatttt ataatttttt atgtaataat ataattaaca attgaaaatt 15300 tcatgtaaat ataatttttt aatatatcac atgcaagtaa catgaatcaa tgaggtaaac 15360 gaaactcata cgtttatatg aagtaaaaag tataataata ataccaatag tactagtgag 15420 tgattttgct aagtctacga cttagtcgta gacgtagttg taggcgtgtt gggcgatgat 15480 gggtgctgat gggtacggat gggtactgat gggtacggat gggtactgtt cggtgtgtat 15540 cggtgcggat gggtgtggat gggtgtggaa cgttggttgg tatgtgtgtg tggaaggttc 15600 tagaaggtag gtgtgtggaa ggttctagaa ggtaggtgtg tggaaggttc tagaaggtag 15660 gtgtgttggt ccacaactac gtgacataat aacagttagg taactagtct ccgagtagtg 15720 accagtgaca taataagtgt gaccagtgac ataataagtg tgtactgacc agtggtcaag 15780 tgatgtaatc atgaggtcac tgaccaaagg tcaattacta ggtagcaact agtggacaaa 15840 ggtcaatctc caggtagcaa cagcgcgtgc agaggtcaaa tgttacgtaa taattacgta 15900 agccaatggg ggggttgtat ctgacgggct cacatcctac tact 15944 // ID BACSAT repbase; DNA; INV; 316 BP. XX AC . XX DT 02-AUG-1999 (Rel. 4.07, Created) DT 02-AUG-1999 (Rel. 4.07, Last updated, Version 1) XX DE Bacillus satellite DNA (a consensus). XX KW SAT; Satellite; Simple Repeat; BACSAT. XX OS Bacillus lynceorum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Orthopteroidea; Phasmatodea; Verophasmatodea; Areolatae; OC Bacilloidea; Bacillidae; Bacillinae; Bacillini; Bacillus. XX RN [1] RA Mantovani B.; RT "Satellite sequence turnover in parthenogenetic systems: the RT apomictic triploid hybrid Bacillus lynceorum."; RL Mol. Biol. Evol 15(10), 1288-1297 (1998). XX RN [2] RP 1-316 RA Jurka J.; RT "BACSAT."; RL Direct Submission to Repbase Update (AUG-1999). XX DR [2] (Consensus) XX SQ Sequence 316 BP; 84 A; 39 C; 41 G; 152 T; 0 other; agatctgtat ttgttgattt atttcatttc catttgtatc gattcagctt attttggtta 60 ttttacatta atatgatcaa gtagattcca tttccactca tttcacttat atccttttat 120 ttatagtgtt ataattcatg catttaaaca ggttaattcg gttctttaaa taacacttat 180 ttcattataa atgggttatt ttgatgcatt atattgattt ttaacttcat tttatttgtt 240 ttaacatatt tgggtagaaa tcatttcatt tctattgtaa caagtgtttt tgtttggcga 300 atcaggtttc cgaatg 316 // ID MuDR7x_AP repbase; DNA; INV; 2285 BP. XX AC Contig26943; XX DT 25-JUN-2009 (Rel. 14.07, Created) DT 25-JUN-2009 (Rel. 15.12, Last updated, Version 2) XX DE MuDR-type DNA transposon. XX KW MuDR; DNA transposon; Transposable Element; MuDR7x_AP. XX NM MuDR7x_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-2285 RA Jurka J.; RT "DNA transposons from pea aphid."; RL Repbase Reports 9(7), 1356-1356 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX FH Key Location/Qualifiers FT CDS 548..1807 FT /product="MuDR7x_AP_1p" FT /translation="MQKYIQKKNFDNEVLYTENEHHHEAVNENTINRQIVN FT TSCKRKAIEDVSTRPKKIILQELVHNECAKEITVNDINRIRKNMYEKRRKT FT LPANPKSISEVHQALNVLNIETKQKENFLLLNNEKENIIIFSCSTNLKFLT FT SVDTVYMDGTFDYCTRFFMQMFTIHGFQNGHYVPLIFCLLPNKLTSSYIFA FT LRSICEKCKDMNLIFLPANVTIDYEKAIINAATEVWPESNIIGCRFHLTNS FT WWKKIQSLGLSMEYKEKNSEIGKWLRHTFGLLFLDAQEVSDCFSYDFMSDR FT PVNELLTKYSDYLVDNYIGDDCSYPPSLWASLSASLTRTTNACESFHAVFK FT DHFYKRTPHIIPWITVLIENIQTNVYVTLKSINECKTPRKNMNDKRKRNEE FT LISKYQRDEISRYEFVKLISNHYAK*" XX SQ Sequence 2285 BP; 838 A; 323 C; 353 G; 771 T; 0 other; gggtgtcacc ggtttccgcc aaaaattacc tttccgtttt ccgccaatac ctttttccgc 60 caaatcagaa aaaacgtcat tcctttttcc gccaattata aattatagaa tttcaaagtt 120 tgggaaccct tacgtatcat aatactcata acgataaaca tatggtttat tttgagatat 180 aatagtctat gttatggtat ataatatctt atatggtggg ttttaaaatt ttatacctgt 240 atgcgtctct atctacgcag aaaatattat ttcatagatt tagataatat tatctaatct 300 gtcatctgat atcatgctgc aatagttggt tcgatttgtt agaaatatta cggtattgag 360 tagtgagtag tgactagtga tcaccgatca gtggataggg tttgatgtgt tttaactaca 420 ataatgtctg cactattaat tacgagtaaa cgcgaaaaac cacaaattgt gattgataat 480 tttaaattta gtaaggcata tgaaggagta aataaaattc gttggagatg cattaacaaa 540 gtatgtgatg caaaagtata tacaaaaaaa aaatttcgac aatgaagtat tgtacacgga 600 aaatgaacac catcacgaag ctgttaatga aaataccata aatcgacaaa ttgttaatac 660 ctcgtgcaaa aggaaagcga tagaagacgt aagtacaaga ccaaaaaaaa tcattttaca 720 agaattagta cataatgagt gtgccaaaga aataacagta aacgacatta atagaataag 780 aaaaaatatg tatgaaaaga gacgcaaaac cctcccagca aatccaaaat ccatttctga 840 ggtacaccaa gctttaaatg ttttaaatat tgaaacaaaa caaaaagaaa atttcctact 900 attaaataat gaaaaagaaa atattattat tttttcgtgt tccacaaatt taaaattttt 960 gacttcggta gatactgttt atatggatgg gacatttgat tattgtactc ggttttttat 1020 gcaaatgttt actatccatg gatttcaaaa cggccactat gttccactca tattttgctt 1080 attaccaaat aaactaacat cttcctatat ttttgcgctt cgttctattt gtgaaaaatg 1140 caaagatatg aatttgattt ttttacctgc aaatgtgact attgattacg aaaaagctat 1200 tataaatgca gctactgaag tttggccaga aagtaatata attggttgta gatttcattt 1260 gactaattct tggtggaaga aaatacaatc tcttggttta tcaatggaat ataaagaaaa 1320 aaattccgaa atcggcaaat ggttacgaca cacatttggt cttttgtttc ttgatgcgca 1380 ggaagtatca gattgttttt catatgattt tatgagtgat cgtcctgtta atgaactatt 1440 aacaaaatac tctgactact tagttgataa ttacatagga gacgattgtt cttatcctcc 1500 atcactgtgg gcttcgttat cggcaagttt aactaggacg acgaacgcgt gtgagtcatt 1560 tcacgctgta tttaaagatc atttttataa acgaacacca catataatac catggattac 1620 tgtacttata gaaaatattc aaactaatgt ttatgttact ttaaagagta ttaatgaatg 1680 caaaacacca agaaaaaata tgaatgataa acgtaaacga aatgaagaat taatttcaaa 1740 ataccaacga gatgaaatca gtagatatga attcgtaaaa ctgatatcaa atcattatgc 1800 gaaataaggt tcgaataaaa tatttaaaat attatatact tatattatat atatttaatt 1860 ttatttcaac tatgtttcag ttttttacaa cgtgatctca atttcaagtt tcgaccaatt 1920 aatcaagcgc tgacatactg acatattgtc atcaagacaa tgcaaatata aatatcattt 1980 ttttaatact tatatctatt atacctacct aataagtgtt atgataaaat tatatttttt 2040 gacttatttt gtcggtacac taataggtat cattacttat atataataag taataactgg 2100 aaaaaatatt attatttttt aactctatag tgtgcaatta ttaaatcttt acgtaagggt 2160 tcctaaactt taaaattcta ttatttataa ttggcggaaa aaggaatgac gttttttcta 2220 atttggcgga aaaaggtatt ggcggaaaac ggaaaggtaa tttttggcgg aaaccggtga 2280 caccc 2285 // ID Copia-7_CQ-LTR repbase; DNA; INV; 130 BP. XX AC AAWU01003031; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-7_CQ_; KW Copia-7_CQ-I; Copia-7_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-130 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 330-330 (2011). XX DR Genome; AAWU01003031; Positions 8342 8213. XX SQ Sequence 130 BP; 33 A; 27 C; 25 G; 45 T; 0 other; tgttggagtg cagcaagcct tgagtttggg acagatcgta gttaacctta tttgttaaat 60 cagtttgaat aaacattctc tatctgaact tcttagaaac cagtcgcctg atttctttcg 120 ctctccgaca 130 // ID Gypsy7-LTR_Dpse repbase; DNA; INV; 551 BP. XX AC Unknown_singleton_20; XX DT 13-MAY-2009 (Rel. 14.05, Created) DT 13-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy7_Dpse; KW Gypsy7-I_Dpse; Gypsy7-LTR_Dpse. XX OS Drosophila pseudoobscura OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; obscura group; OC pseudoobscura subgroup. XX RN [1] RP 1-551 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1066-1066 (2009). XX DR Genome; Unknown_singleton_20; Positions 10489 9939. XX SQ Sequence 551 BP; 165 A; 109 C; 157 G; 120 T; 0 other; tgagttggaa gatttgcagg gccgaatcat tgggaaatac cacgggaaag atatccgcca 60 ataacagcta caagcggttc atccttgtga gccttctgca ccccaagtct gatcttggca 120 ggggggagtt gtacggtgag gcttgtagct gctgtgagga ggcctagggt ttttggggtc 180 cggcgaaaat taggcaccgt ctaacgagag gcaagacaaa ctctctttaa cagtcggcaa 240 agagggcctg ggatcaaaga gtgggtcaga gccaggtcag acttcctcgt cgaaaggaga 300 gcgagttcga agtcgaagtc ggacaagata aaaggctata aatagatggc attggacgcc 360 attgcgggat tccaagcctg gtcgacagca tttgattgtg ttcaccaagt gtcctctata 420 gtggcagccg taattgaata ttcaaagaaa gaaaaaagag aaagaccata taaatacaaa 480 ctagtgaaaa gtagccagga agtgagacgg tgattagtga aacctcttcg atacaccctt 540 tcgacgggtc a 551 // ID Gypsy-197_AA-I repbase; DNA; INV; 5723 BP. XX AC supercont1.83; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-197_AA_; KW Gypsy-197_AA-LTR; Gypsy-197_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5723 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.83; Positions 2900896 2895174. XX CC Positions [3669-4130] - Integrase core CC 'AACGT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 434..2251 FT /product="Gypsy-197_AA-I_1p" FT /translation="MILCSKRAAVTRNNCLFRMDKNNAFSNFPTFTYGVVP FT LAERRAKWFSWKRGFEICLRASKIIDADEKKDLLLAKGGFELQDIFFNIPG FT ADVVTDKEKNIDAYSVAIEKLDGYFAPQRHEAHERYIFWAMKPEPDESLEK FT FLMRAQMHASKCNFGKSSSESSGIAVIDKLLQLVPSHLREKLLLEVDLTVE FT KVIQQVSAFETTRIASEQISGQSILQQPSKSSEIVSRIPSTCKFCGRSHAS FT DGSCPAWNKTCSNCGKRGHFRAVCFSRVAVPSSSRSVDPLHGARFSKRSFG FT QAFQSNVGTSAKNVNNPQKRRPGRLHAIEDDENDEEIPELVEMVSSASDSE FT ELLCVKVGGVLIEMQIDSGVQSNIIDDQTWTSMQRSGVKTIGSIRCSDRKF FT KAYAQTDCLKVPAMFDAEVSITDGNKQLRVDAKFYVVKGGPQPLLGKVTAK FT QLGVLVVGLPSQQEALHQVDIVRPFPSVQGLKIHIPIDKSVEPVAQRLRRL FT PFATLDRVEEKLKELESRNIFEKVSEPSSWVSPLVVVVKDSGDIRLCIDMR FT QVNKAIMRETHPLPTIEDIRWKLNGAHVFSRLDIKDAFHQLELDDESKPLT FT TFITHKGM" FT CDS join(2373..3359,3363..4517) FT /product="Gypsy-197_AA-I_2p" FT /translation="MFQKVLEQILSDCPNTINFIDDIIVAGKTEREHDEAL FT EKVLQKLKQCGILLNQAKCAFKLNEVVFVGHRFNKNGMTASPDKIDAIRSF FT RSPTTGEEVRSFLGLVNYVGSFIPNLATISFPLRELTKHNVPFRWTEEEQN FT AFDFIIDQMTCVGNLAHFDPKLKTRVVADASPVGLGAILLQYFEGKPRVVL FT YASKSLTDTEKRYAQTEKEALALVWAVERFQIYLLGIRFELETDHKPLEAI FT FSPSSTPCLRIERWVLRLQAFSYNIVYRKGKSNIADPLSRLSQPTDKEIFD FT ADSDVYIRSVTEMAAVDITELEEISKEDPELSALRECDQGVWNHTVNIIKP FT YHAFRNELGKVGFMVVRGDRLIVPFGLRQRMLQLGHEGHPGKTKMQQRLRN FT TCWWPGMDDAIARTVDGCEGCRLVSLPERPEPMQRRQLPEAPWLDVAIDFL FT GPLPSGDYLLVIIDYFSRYKEVEILRKITAQETAERLERIFVRLGYPRTIT FT LNNGRQFVSREFEEFCNRRGIILNRTTPYWPQENGLVERQNRSLMKRLKIS FT QALNRDWKKDLNDYLTMYYSTPHCTTSKTPSKLMSGRNIRTKLPSLRDLSS FT SVPSTEYRDKDQQAKEKSRVAEDQRRRAKTSDLTVGDKVLMKNVLPGNKLT FT PTFGSSVMTVTARDGSRVTVQNDETCKSYTEAKISHDFHKITLMNAFFIIF FT LLVHKTLAYFIRRD" XX SQ Sequence 5723 BP; 1789 A; 1021 C; 1311 G; 1602 T; 0 other; aattggcgac gaggatggga ttattttatt tgttgtaatt gtgctaacat ttgtacgtgt 60 tctctgtgtc gaatttacgg ggtaagtagg tttttctttc tgaaataaag ctggaaatgt 120 tctaaatgag tgatttttgt agcgtctacc gctacaaagc acgaagaaga tacggccaaa 180 gagagccttt gttgaaaacg aaagaaaggg ttgtgcgtga gccctagatt tgcgacaagg 240 agcagggatt tgcgacacgg agcagggatt tgcgacttga agcagggatt tgcgacttgg 300 agcaggaaga gctaccgcgg caagaaagcg gttggtacgc gcggatcgat tctgttaaat 360 tcagaagaga aaaaagtagg tcggaggcac caacgtgtag ggtgaccaga ttcggataag 420 acaggtaaaa aaaatgattt tgtgttcgaa aagagctgct gtgacgagaa ataattgtct 480 gtttaggatg gacaagaaca acgcattttc gaattttccc acgttcacgt acggtgtggt 540 gccgttagct gaacgtcgag cgaaatggtt cagctggaaa cgcggttttg aaatttgctt 600 gagggcctcg aagattatag atgctgacga gaaaaaagat ctgttgttgg caaaaggtgg 660 tttcgaactg caagatatat tttttaacat ccccggagcg gatgttgtca ctgataaaga 720 gaagaacatc gatgcatatt cagtagctat cgaaaagctc gacggttact ttgcgccgca 780 gcggcacgaa gcacatgagc gctatatttt ctgggccatg aagccagaac ctgatgagag 840 tttggaaaag ttcttgatgc gtgctcaaat gcatgctagt aaatgtaatt ttggaaaatc 900 gtcgtcagaa agttctggta tagctgtgat cgataagttg cttcaacttg tgcctagcca 960 tttgagagaa aaacttttgt tggaggtaga tttaactgtg gagaaggtta ttcaacaagt 1020 aagcgcattt gaaactactc gaattgctag tgagcaaatc agcggacaga gcatattgca 1080 acagccaagt aagagctctg aaattgtatc ccgcattcct tcaacgtgca agttttgtgg 1140 acgttcgcat gcttccgatg gttcatgtcc tgcatggaat aaaacatgtt ctaattgcgg 1200 gaagcgtggg cattttagag cagtttgctt tagcagagtt gctgttccct caagcagtcg 1260 aagcgtagat ccacttcatg gtgccagatt ctcaaaacgg agttttgggc aggcttttca 1320 gtcgaatgtt ggtacttcag caaagaatgt taataacccg cagaagcgcc gaccaggtcg 1380 tcttcatgca atcgaagatg atgaaaacga tgaagagatt cctgagttgg tggaaatggt 1440 gtcatcggcc tctgattctg aagagctttt gtgcgtcaag gttggtggag tattaataga 1500 gatgcaaatt gattctgggg tgcaatccaa cataatcgat gatcaaactt ggacttcaat 1560 gcagcgcagc ggtgtgaaga cgatcggatc tattcggtgt tcagacagga aatttaaggc 1620 ttatgcgcag acagattgtc taaaagtacc ggccatgttt gatgctgaag tatctattac 1680 tgacgggaat aaacaactac gtgtagatgc aaagttttat gttgtaaaag gaggaccgca 1740 accgttattg ggaaaagtta cagcgaagca gctaggagta cttgtggtgg gattaccaag 1800 tcaacaggaa gcgctgcatc aggtggacat cgttagacca tttccaagcg ttcaaggttt 1860 gaagattcat attccaatcg ataaatcggt agaacctgtt gctcaacgat tgagacggtt 1920 gccttttgca acactcgacc gggttgagga gaagctaaag gaattggaat ctaggaatat 1980 attcgagaaa gtttccgaac cgagctcgtg ggtttctcca ttggtggtcg ttgttaagga 2040 tagcggagac atcagattgt gcatcgacat gcggcaggtt aataaagcga taatgcgaga 2100 aactcatccg cttccaacaa ttgaagatat tagatggaaa ctgaatggag ctcatgtgtt 2160 ttctcgactg gatatcaaag atgcttttca ccaactggag ctcgacgatg aaagtaaacc 2220 tctcacaacg ttcatcactc ataaaggtat gtaattagaa aaaacttgta ttaaaagaaa 2280 gcctgtatta aatattgtag tatgattatg tttatattta agggcttttt cgttataaac 2340 ggcttctttt tggagtgtct tgtgcaccag aaatgttcca aaaggtgttg gagcaaattc 2400 tgtcggattg tccaaacaca attaatttca tcgatgacat tattgtagca gggaaaacag 2460 agcgcgaaca tgatgaagca ttggaaaaag ttttacagaa actaaaacag tgtggaattt 2520 tgttaaatca agccaaatgc gccttcaagc tgaatgaagt tgtctttgtg ggtcaccgtt 2580 ttaataagaa tggaatgact gcttctcctg acaaaatcga tgctattaga agtttcagat 2640 caccaaccac cggtgaggaa gtgcgaagct tcttaggttt ggtaaattac gttggatcct 2700 ttattccaaa tcttgccaca atatcgtttc ctttacgcga acttacgaag cataatgttc 2760 cattccgttg gactgaagaa gagcagaatg cgtttgattt catcatagat caaatgactt 2820 gtgttggaaa tttggcgcat tttgacccta aactgaagac ccgtgtagta gctgatgctt 2880 ctccagttgg actaggagct atacttctgc aatatttcga agggaaacca agagttgttt 2940 tatatgcaag taaaagttta acggataccg aaaaacgtta tgcacaaact gagaaagaag 3000 cccttgcctt agtttgggca gtagaaagat tccagatata tctgttaggt attcgttttg 3060 agctggaaac ggatcataag ccactggaag cgatattttc accaagttcc acgccttgtc 3120 tacgtattga gcgttgggtt ttgcggttgc aagcgttctc ctacaacata gtctacagaa 3180 aagggaagtc aaacatagca gatccattgt ctagactatc gcaaccaact gataaggaaa 3240 tatttgatgc tgattccgat gtgtatatca gaagtgtgac agaaatggcg gcagtcgata 3300 taactgaact agaagaaatc tcaaaggaag atcctgaatt atcagcttta agagaatgtt 3360 aggaccaggg tgtttggaat cacacggtaa atatcattaa accgtaccat gcattccgaa 3420 atgagttagg aaaagttgga ttcatggtgg taagaggtga taggttgatt gttccttttg 3480 gtttgaggca acgaatgtta caattaggac atgaaggtca tccaggtaaa actaagatgc 3540 aacaacgact gagaaatact tgttggtggc caggaatgga tgatgctata gctcgcacag 3600 tagatggttg tgagggatgt cgtttagtga gtctaccgga acgtccagaa cctatgcaaa 3660 gaagacagct acccgaggct ccatggttag atgtagctat tgatttttta ggacctcttc 3720 cgtctggaga ttatcttttg gtaatcattg attatttcag tagatacaag gaggtagaga 3780 ttctgaggaa gataacagca caagaaacag ccgagcgatt agaaaggatt tttgtgagac 3840 ttggttaccc acggaccatt actttaaaca atgggcgaca atttgttagt cgtgaatttg 3900 aggaattttg caatcgtcga ggaattattc tgaaccgaac aacaccttat tggcctcagg 3960 aaaatggact ggtagaacga caaaaccggt cattgatgaa aagactgaaa attagccaag 4020 cccttaaccg cgattggaaa aaagacctta atgattactt aactatgtat tattccactc 4080 cacattgtac aacgtccaaa actccatcaa aactaatgag tggcaggaat atccgcacta 4140 aacttccttc attacgggac ttgtcatcat ctgtaccatc aacggaatat agggataagg 4200 accagcaagc gaaagagaaa agcagagtag cggaggatca acgtagacga gcaaaaacat 4260 cagatttaac agtcggagac aaagttctta tgaaaaacgt gctacctggt aacaagctta 4320 cacctacttt tggatcaagc gtaatgacgg tcacagcacg ggatggttca cgcgtaactg 4380 tgcaaaatga tgagacctgt aaaagctaca ctgaggcaaa aatctcgcat gattttcata 4440 agatcacact aatgaacgcg ttttttatta tttttttgtt ggttcataag acacttgcgt 4500 atttcattcg acgagattga gattcataag acaagtgtaa ttttttcata acaatcaatt 4560 gtcaatgtcg tacgaacgct tatgaacccc attgctcatc attgtgaaga accacgtgag 4620 gcaaattttc tcggtacagc catttcacta tcataaattc aatatggctt ccgaaatgga 4680 cgtgtttgct gatttgtcgc tggctgactc cgtttttaat gcatcaaagg gtaaacattt 4740 tctatttatc atcaattccg ctgtaactaa cactttcttt caatcagatt aaggaattgg 4800 atccaaagaa taagattttt ggatgaaaat tcaaaacaat ttgtatgacg cggaagaaat 4860 cacaaacccc cctcatccat taatccgccc ctaacgtaaa cctgcaccaa ttcaggcgtt 4920 cttcacaact tttctcaata atgtttcgct gctataatat acttcttcgc gttcaaaata 4980 ggttagccaa ctagagaaaa tagaattttc atccatttta tcgtatcgca aatcgattct 5040 aatgcttaca gtaaagtgct ttaatcgaat tgcgacatga caaaatggat gaaaatttga 5100 ttttcccaag ttgtttaacc tatttttcaa cgcggattag tattattcat tcattgaaat 5160 ttataatcta aagctcaaat tacacaagac agtcttatgg aaccaagaat ggatcagaac 5220 gtaattcata agactatcta tggattttgt tatcaagtca atgctggttc ataagatcat 5280 cttacgaaat tccttcgagg tgtattatgg aaacaacatc tgccgaaaac catacgatgg 5340 tcttatgaat ttcaaagatt tttcttgtgt cgtaattcca taagcaaatc ttatgacagt 5400 cttacgattg ctttcctcag tgtacgatag aaatgtgagt catctaaaga agatagtggg 5460 cccatcatta gaagaaaaag aggaagatca atctcaacag ccagcgcaga gagtggagga 5520 aaatcaggaa ctcgagcaag ttttagaacc aacaatagga gaaccaggaa cgatggacaa 5580 tgtggttcaa caggatacga gcaatgtaac actgttaccc cggaacagac cccatcgaaa 5640 cattagaaaa cctcgttatt ttgatgattg tgagctagat tattgatctt tcttaaactt 5700 aagcttctat aaagaaaggg aga 5723 // ID Gypsy5-I_Dpse repbase; DNA; INV; 6846 BP. XX AC Unknown_singleton_95; XX DT 13-MAY-2009 (Rel. 14.05, Created) DT 13-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy5_Dpse; KW Gypsy5-LTR_Dpse; Gypsy5-I_Dpse. XX OS Drosophila pseudoobscura OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; obscura group; OC pseudoobscura subgroup. XX RN [1] RP 1-6846 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1051-1051 (2009). XX DR Genome; Unknown_singleton_95; Positions 34162 41007. XX CC Positions [4311-4814] - Reverse transcriptase CC Positions [5913-6398] - Integrase core CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 1987..3594 FT /product="Gypsy5-I_Dpse_1p" FT /translation="MVVSRSTENLALSEDTGKHCQICSEVIVYQVQLLSTS FT CGHTFHRVCFTAKSGNKRKCPTCQQDLNITSASAENLSPIGPSTSRARMPS FT VTQSLQTRSQSKRVEALSPLEMANSENANRSPISVDPQQNQATMAETIQQS FT IVDAIRVAMVEQTRILATSMADQTNRLVRAISDGLQSTASVPSPNINAARM FT QPVAELEQQTIEQLFRLSTSEQVPVPGINQSGNSSSPSNLGQTRNVGSGVS FT TGSSASSLRADRVSQIMSNWKLRFTGKAGMSIEDFIYRIEALTKQTLDGNF FT EITARYASNLFDGSASEWFWRYHKSVPEVRWADLCLALRGQFKDDRTDRNI FT RTEIELRKQRPNESFDDFYGVIAALADKLSEPMSEAALVETLRANLLADIQ FT HEILYESAATVKSLRHLVRTREIFMQSVAKTKPPLAAPRTLPRRQVNAILD FT ESIAPDSEEVGEEEDLEVSAVEFVCWNCGGKGHRYQECMAERRVFCYGCGR FT ADTYKPSCRKCNPHTAKNSLSRAPPKSARKQLVNQATNTE" FT CDS 3792..6767 FT /product="Gypsy5-I_Dpse_2p" FT /translation="MLFGKPVIGLMDTGASVSCIGGSCASDVIRNKVQYKP FT LSLKVRTADGKSQSIVGKISAKVTYRGESKDPSLTGNLYLGIDFWTAFELL FT PLHLMTQNLRIETLEDPTMRALLPSQQVLLQEVIALFPSYSKQGLGKTSVI FT THTIEVADAKPIKQRHYAVSPATEKLMYEELDRMLKLGVIQESVSSWSSPV FT VLVKKPGKVRLCIDCRKVNSVTVKDAYPMPLIDGILSRLPKAEFITSIDLK FT DAFWQIPLDQKSRDKTAFTVPGRPLYEFVVMPFGLTNAPCTLSRLMDKVIP FT AHLRNEIFIYLDDLLVISDNFEHHLEVLSLVASQLSKANLTINVEKSKFCR FT KEVKYLGHVIGGGNIRTDPDKIATVKDFPAPQSIKQVRRFLGMTGWYHKFI FT HNYAAISSPITDTLKSKRKFIWTDEAQLAFESLKSQMCSAPVLHSPNFSIP FT FSIHCDASHTGVGAVLMQENNEGADVPIAFMSKKLNKCQRQYTVTEKECYA FT AVLAVKKFRAYIEGQEFTIITDHASLKWLMSQADLSSRLARWSLKLQGFEF FT KILHRKGSDNIVPDAVSRTHSEDLSFLAIDSMLVDPDSPEFKSPEYQSIKE FT KIRLNSSRLPDLKIKDERIFRRAEHADGTKITDDSCWKLWIPPSLVKPVLK FT AAHEDPLCAHSGINKTLERLRRSYYWPNLAVDVKELINNCHICKTTKHPNQ FT PLKPPMGNTGLPSRFFQKLFVDFLGPYPRSKSGNIGIFVVLDHLSKFPFLK FT PVRKFTADSIKRFLEEDLFHCFGVPETIVSDNGVQFKSHTFNSLLDKYNIS FT HVYTAFYSPQANASERVNRSVLAAIKSYVDPDQRNWDEKLSSICCALRSSH FT HLAMSTTPYRLAFGQHMVTDASSYKLMRNLDMLEDRGVQFSREDSFDIIRS FT KAKGSIQAQHFRNARSYNLRTRQISYKDGQEVYRRNFQQSNFAKGFSAKLA FT PTFIKSRVRRKLGNCYYELEDLQGRLVGKFHTKDIKP" XX SQ Sequence 6846 BP; 1965 A; 1606 C; 1457 G; 1818 T; 0 other; agataatctg gtgctagccg tggtcatgcg tgtgcatgtg tgtgtgctct ctcttctctt 60 caaaacttac aggtattccc ctcgccgaaa gatcttttgg acgtggccgg aacatgaacg 120 tacgtaagtg aacgaactcc tagttaacgt ttataaactt gtatcataaa gtaacaaaag 180 aaaaaaacca tacatgagcc cggtcgcccg tattctttga tttttcccta atttatgttg 240 gagctttttt tgttgtaaag aggccttaag cccacgaaca gctcaagagc tttagaagct 300 ttaagcttta gcgtagatca gcggcagcag gtgcttcacc cctatgcgac ggttattagt 360 tactcttact tttgcattaa taccaaatca ccgaaaatga agaaatctta aattacaaaa 420 gttcctactt aagtattcat tattattgtt attattcatt caattattcc cagtacccta 480 ccaccggttc ttaagatcat atgttttatt cctattattt aaaattatta ttccatagtt 540 ataattcttt tattacccta gcttatactc cacccatacc tatatatcat tttccttcta 600 cccatgtatt tagctttagg cactatcgat aaaaataaat attgttaatg aattggaaat 660 gaattgctag aagatttttg tttaggagcc atgcagggat cgatgaggga gagtgaggta 720 tgacccccgg cgggaaaagg gatcatccat aagacatatt ttcttcggaa ataggatgat 780 caaggttggg agggccatgg ctgttgcatc caacaccagc acagctaccc gtcgggtcgg 840 atatgcatgg cggaagccca cttccctgtc cccttaattc ctttgtccct tctaattccc 900 ccagcctcct catcctcctt ccccaacccc ctaacaccag tcatacggtc atacgcaggg 960 tatcgcacgc gaaaaaataa taagataagg catcgacgcg agcacgtata ggaaaggacc 1020 agtataggaa cccccccata ctaccgacca tacagccgga gccagcgcgt gcaccagtcc 1080 tgcccgatac acccccgacc agctcgagtg gcaccgccac acacaacaat cgttgtatgt 1140 gatatgctac atttatctta ttttggcgcc caacgtgggg cccgatccgg cttgccccat 1200 ttaaagaaat aacttggtca aaagtaacac ctgactttta tccttgctag accgtttttt 1260 tgtggttttt ttgctcgcct ttcctctgca acttatccag tggatactgt ggatccgtgt 1320 aagctcatca tggcctacag ggcctagtat cgtacagggt atattggcgt gcagtagggc 1380 cgatacactg gaagtgttat aacttatatt ggctataact ccgtgggtca agcttagcct 1440 atcaactagt tgaagattgt tataggagct aagcgagaac actactgcag gaaaggagaa 1500 cgcattccaa atcgtttatg atgaaattct acgaagattg cccctaatta agtgggtttt 1560 ggctgtcctg aggaacccct gagaagatta ctgaggttgc aagcctcggt aagcctaggt 1620 gacatcggac tcccatatta cctgatgatt agtggaacta atccagcacc ttcatcagcc 1680 ccaatacttc tgatctgacc ttttagattt cgctggatga gggataattt atcctgccac 1740 ggacctaact tgtgaggcag atttcagtta taagtagatt accccattcc tgttccttcc 1800 ctacccgaaa acaatgcaat ccgctttact ctattcgtac ccaaaaaaac gattgcatgg 1860 ctcctaaact gttttgatat ctacatatat atatctatat ctattgactg tgactttttt 1920 gtattatttt ttttgcattt gctatatata tctaataaat tcatacttgg tggtgcccaa 1980 accgaaatgg tggtgtctcg tagtacggag aatttagccc tctcagaaga cacggggaaa 2040 cattgccaaa tttgctccga agtaatcgtt taccaggtac aactattgtc gaccagttgt 2100 gggcacacct tccacagagt gtgcttcacc gctaaaagtg gtaacaaacg gaagtgtccc 2160 acttgtcagc aagacctaaa catcacatcg gcctcagctg agaatcttag tcccattggt 2220 ccgtctacgt ctagagcacg aatgccatcg gttactcagt cccttcagac tagaagtcaa 2280 agtaaaagag tggaagcgtt aagccctctt gagatggcta actcagagaa tgccaacagg 2340 tctccaatct ccgtcgaccc tcaacagaat caggccacta tggcggaaac tatacagcag 2400 tccatagtgg acgcgattag agtagctatg gtagagcaaa cccgaatttt ggcgaccagc 2460 atggctgatc aaactaaccg gctagttcga gccatttcag atggacttca aagtacggca 2520 tctgtaccct ccccgaacat aaatgcagct cggatgcaac cggtagcaga actagaacag 2580 caaacgatcg agcaactttt tcggctgtcc actagtgaac aggtaccagt gccgggaatt 2640 aatcaatcgg gaaatagttc gtcaccctcg aatttggggc aaacccgtaa tgttggctca 2700 ggagtttcca caggttcctc cgcttcatcc cttcgggcag accgtgttag ccagataatg 2760 tctaactgga aactgcggtt tacgggaaag gctggtatgt cgattgagga cttcatttac 2820 cgtattgaag cacttacaaa acaaactttg gacggtaact ttgaaataac agctcgatac 2880 gccagtaacc tctttgatgg gagcgccagc gagtggtttt ggcggtatca taaaagtgtc 2940 cctgaagtcc gttgggcaga tctatgccta gcattgcgtg gccaatttaa agacgataga 3000 acagatagga atatcagaac agagatcgaa ttaaggaaac agcgaccaaa cgagtccttt 3060 gatgattttt atggcgtaat tgccgcattg gcagataaac tgtctgagcc catgtctgag 3120 gcagccttgg ttgagacact gcgagcaaac cttttggctg atatacagca tgaaatttta 3180 tatgagtcgg ctgcgactgt gaaatctcta cgtcacctgg tccgaacgcg agaaatattc 3240 atgcagtccg tagctaaaac caagcctccc ctggcggccc cccgaacttt accacgccgc 3300 caagtcaatg ctatattaga cgaatctatt gccccagata gtgaagaagt aggtgaagaa 3360 gaagatctgg aagtatcagc cgtagagttc gtctgttgga attgcggtgg caaaggccat 3420 cgataccaag agtgcatggc cgaacgtagg gtattttgct atggttgtgg cagggccgat 3480 acttataagc ctagttgcag gaagtgcaac ccccacacgg caaaaaactc gttgtcgcgt 3540 gcaccgccga aaagtgcacg caaacaactg gtcaaccaag cgaccaatac cgagtgacca 3600 attcttctcc ttccactgac cttattatac cagacgtgtc aaacagcgta tacgctgacc 3660 accttgctag ccctccaaat cgatcccagc gccgaagaag cgattttaag aaacattgca 3720 aacttgagaa aagactgctc ttagcttctg tagttggaaa accaaatgat ttacgacccc 3780 atgctcagat tatgcttttt ggtaaaccag tcataggact aatggatact ggagcttcag 3840 ttagctgcat agggggaagt tgcgcgtccg acgttattcg gaataaagtc caatataagc 3900 cgctgtccct aaaagttaga actgccgacg gaaagagcca aagtatagtc ggcaaaattt 3960 ccgcaaaagt cacgtataga ggggaatcca aggatccttc cttgacgggt aatctctatt 4020 taggaatcga tttttggacg gcttttgaac ttttaccgtt gcacttgatg acacaaaatt 4080 tacgtattga aaccttagaa gaccccacaa tgagggcgct tctgccttca cagcaagtat 4140 tactacagga ggtaatagct ctcttcccat cttactcaaa gcaaggtcta ggaaagacct 4200 ctgtaatcac tcacacgata gaggtagccg acgcaaagcc cattaaacaa cgccactatg 4260 cggtgtcacc ggcaacggag aagctaatgt acgaggaatt agaccgaatg ctgaagcttg 4320 gggttataca agaatcggtt agttcctggt cttcgcccgt agtgctggtt aaaaagcctg 4380 gcaaggttag attgtgtatc gattgtagga aagtaaactc ggtcacggta aaggatgcat 4440 atcccatgcc ccttatagac ggaatcctga gtagactacc gaaagcagag ttcatcacca 4500 gcatcgatct aaaggacgcg ttttggcaaa ttcctctcga tcaaaaatct cgtgataaga 4560 cagcgtttac cgtgcctggc cgtcccttat acgagtttgt cgtcatgcca ttcggcttaa 4620 ccaacgcacc ctgtaccttg tcgcgactta tggataaagt cataccagcc catctacgga 4680 atgaaatatt catatatctt gacgacctct tagttatatc cgataatttt gagcaccatc 4740 tggaggtgtt gtccctggtc gcatcccaac tttccaaggc caatcttact ataaatgtgg 4800 agaaaagcaa attttgccgg aaagaagtta aatatttagg ccacgtgatt ggtggtggga 4860 atattcgcac tgatcctgac aaaatagcca ccgttaagga ctttccagct ccccagtcga 4920 tcaaacaagt tcgtcgattt cttggaatga caggctggta ccacaagttc atccacaatt 4980 atgcagctat cagttcaccc attacagata ctctgaaatc taagcgcaag tttatttgga 5040 ctgacgaggc tcaacttgct ttcgagtcat taaagtccca aatgtgttcc gcgccagttc 5100 tacatagccc aaatttctcc ataccatttt caatacactg tgacgccagt cataccggcg 5160 taggggcagt tctaatgcaa gaaaacaacg agggcgcaga cgtgccaatt gcttttatgt 5220 cgaaaaaact taacaaatgc cagaggcagt acactgtaac tgaaaaagag tgttacgctg 5280 cggttttagc cgtaaaaaag ttcagggcct acattgaggg ccaagaattc acaatcatta 5340 cagatcatgc ctccttaaag tggctaatgt cgcaggctga tcttagctcc cgattggctc 5400 gctggtcctt aaagttgcag ggatttgaat ttaaaatctt acaccgaaaa gggagtgata 5460 acatagtccc cgatgccgtg tcccgtactc actcagaaga cttgtccttc ttggcaattg 5520 atagcatgtt ggtcgaccct gattctcctg agttcaaatc accagaatat cagtcaatta 5580 aagagaagat taggttgaac agttcccgat tgcctgatct gaaaatcaaa gatgaacgca 5640 tctttcggcg tgcggaacat gccgatggaa ccaagatcac agacgattcc tgctggaaac 5700 tttggatacc cccatccctt gtaaaacccg tcctcaaagc tgcccatgag gatcccttgt 5760 gtgcccacag tgggataaat aaaacccttg aaagactcag gcgatcttat tattggccaa 5820 acttagcggt ggatgtcaaa gaactcataa ataattgcca tatctgtaaa accacgaagc 5880 atccaaatca acccttaaag ccaccaatgg gaaatacagg attaccttct cgattctttc 5940 aaaaattatt tgtagacttt ttaggacctt atccgagaag caagtcagga aatattggca 6000 tctttgtcgt gttagaccat ctgtcaaaat ttccattttt gaagcccgta cgaaaattta 6060 cagcagattc cataaaacgc ttcctggaag aagacctctt tcactgtttc ggggtccccg 6120 aaaccatcgt aagcgataat ggggtccaat ttaaatctca cacttttaac tccttgcttg 6180 acaaatataa tatctctcat gtctatactg ccttttattc tcctcaagcc aatgcttcgg 6240 agagagtaaa tagatccgtt ctggcggcca tcaaaagtta tgttgacccg gatcaacgta 6300 attgggatga gaaattaagc agcatttgct gtgcacttcg gtctagtcat catttagcca 6360 tgagcactac cccgtatcgg cttgcttttg gacagcatat ggtaaccgac gccagtagct 6420 ataaactgat gcgtaaccta gacatgttag aagatcgcgg tgtccaattt tcgcgagaag 6480 attccttcga cataattcgc agcaaagcaa aaggctctat acaggcccag cacttccgaa 6540 atgccagatc ctataactta cgaacgagac aaatttccta taaggatgga caggaagtat 6600 atagacgtaa ctttcagcaa agcaatttcg ctaagggctt tagcgcaaag ttagcaccaa 6660 cctttataaa gtcccgagtg cgcaggaagc taggaaactg ctattacgaa cttgaagatc 6720 ttcaaggccg ccttgtaggc aaatttcata ccaaggatat aaaaccttga catgaattgt 6780 tacaagcgcg gttcccactt tgggtgtccc accccaaacg tgatttaagt gggggggatt 6840 atatag 6846 // ID piggyBac-21_SM repbase; DNA; INV; 2308 BP. XX AC . XX DT 14-AUG-2009 (Rel. 14.08, Created) DT 14-AUG-2009 (Rel. 14.08, Last updated, Version 2) XX DE DNA transposon from Schmidtea mediterranea. XX KW piggyBac; DNA transposon; Transposable Element; piggyBac-21_SM. XX OS Schmidtea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae. XX RN [1] RP 1-2308 RA Jurka J.; RT "Families of autonomous piggyBac elements from planaria."; RL Repbase Reports 9(8), 1831-1831 (2009). XX DR [1] (Consensus) XX CC Contains 2 overlapping ORFs. Probably corrupted. CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 1246..2073 FT /product="piggyBac-21_SM_2p" FT /translation="MPLQCLVYIGQGTFAHGVIEKYTSFCEAVTMELAHSY FT LEKGHNITMDNYFTTFALADRLLGKNTTMVGTLRHNKREIPPAAKCVTNRN FT RGDSKHYYNGNRTICSFWDKSNKPVLLLSTMHGCQVDRGDGKPDIVSYYNS FT TKSGVDTLDKLVRMFRSKRKCRRWPYSIFFTCVDVAIISNMKLFGEEHYYF FT KRELGYEMVLPHIQRRKNHPKLRETVKQAMRSVGVAFPDVVRERSQSQGRC FT EFCGRQGDRKTKKSCNRRGKLICVQHTVGLCPTCS" XX SQ Sequence 2308 BP; 731 A; 418 C; 494 G; 665 T; 0 other; ccctgtgggg tggaaattac cccactacaa aatgcactat tttactatac aaatgtgact 60 attcacacta ccatggtaga gtaaatgatt gggacattca cacacacata catatatata 120 tatactactc tgtattccaa gcggattgtc ccatgacagt actcatcaga gaatcgtttt 180 agatagaaca atattgtttc atttatcaat gaacatcata ggaggatctg atgattataa 240 tggcaataga aaatttagtt aacacaggag gcaatgtaat gagtcacttt ttactgtgtc 300 tgaggaattg atgactgtct ttataaggaa cggtacataa gatgcaaaga atagtgtcct 360 ttgcaaaggt tggctggtca aagtacatat taaaagtcgc caggaatggt cgacgccatc 420 ctcagttctt tgcactttgt atgaagagat ttaaacagcg atattgtgca gggtgatgta 480 gggtggaggc aatgtgatga gtcatttttt actgtaaaca ataaataata gagacacaca 540 tgtatattca cacacataca ctactcttac acacagcttt cacagacacg tatataacga 600 cagatattta gtggctattc ataatttgat tattggccat accatgtcta aacgtcaact 660 tccccaacgc aacgctcgta ctgtggcaaa tgctttaatt tataacgttc ctgtcgaaga 720 agaagatgag tttgaaaatg aattagaaga agatgattac gtaaatgaag taattgctcc 780 gcaacaagat gacagtacag ttgatgatgt tgaagaacaa gatgatgtgc ttgatgaaga 840 gaacgtaaac gttgaatcca gtagtgacga tgattcagtg gaggagaatg tgtttcgtgc 900 taccatgagt tataaacgat tcgcccagtt aaaagaagct ttacgattcg acgatgctgc 960 tcgccgtgat cgcgatgatg ttttgagtcc aatccgcgac attatcaacc gcaagttgta 1020 cgagttctac cgacccggac ctcatctttg tatcgacgaa atgttggttg aatttcatgg 1080 acgtgtccgt tttcgccaat acatcccatc aaaacagaca tgtatatata tatatatatg 1140 tgtgtgcgtc ttgctgaata tgccattcac tgatgactat tttatctatt tacggcaggc 1200 aagtttggca tcaaaatctt ctggctcacg gacgctgaaa atgcaatgcc tctacagtgc 1260 cttgtttaca ttggccaagg tacatttgcc catggcgtga tcgaaaaata tacatcattt 1320 tgcgaagctg ttacgatgga gttggctcat tcgtatttag agaaaggcca taacatcaca 1380 atggacaatt atttcacaac cttcgcattg gctgaccgcc tcttaggaaa aaataccaca 1440 atggttggta ctttgagaca caataaaaga gagataccgc ctgctgcaaa atgtgtaaca 1500 aatcgaaatc gaggagacag caagcactac tacaatggca atcgaactat ttgttcattt 1560 tgggacaagt ccaacaagcc ggtcctgttg ctttcgacga tgcatggatg tcaggtggac 1620 agaggtgatg gcaaaccaga tattgtgtca tactataatt caacgaagtc tggcgtcgac 1680 acactagata aacttgtacg tatgtttcgg tcaaagcgca aatgtagaag gtggccttat 1740 tccatttttt tcacatgcgt ggatgtcgct attatttcca atatgaaact gtttggagag 1800 gaacattact atttcaagcg agagcttgga tatgaaatgg tgttgccaca tattcaaaga 1860 aggaaaaacc acccaaagtt gcgtgaaact gtcaaacaag caatgagatc ggtcggagtc 1920 gcatttccag atgtggtgcg ggaacgttca caaagccaag gacgatgtga attttgtggt 1980 cgacaaggag atagaaaaac gaaaaagagc tgcaatcggc gtggaaaact tatctgtgtg 2040 caacacacag tgggcttatg tcctacttgt tcgtgatcat tatttgttaa atttatcata 2100 atcatgtgac tatttatatc attttgcaaa gatgtttgga ttcagagcgt tctgatttga 2160 taataaaatt actatcttat acagcttaat tggttcaaag ttgacgggaa tataatagca 2220 gcgattttgc tatctaatat aatgaaatga acgaaaagtg cgactgaaca atatgtagtt 2280 tctgtggggt aatgtacacc ccacaggg 2308 // ID DNA4-3_CQ repbase; DNA; INV; 1108 BP. XX AC . XX DT 29-DEC-2010 (Rel. 16.01, Created) DT 29-DEC-2010 (Rel. 16.01, Last updated, Version 1) XX DE A DNA transposon family from Culex quinquefasciatus - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA4-3_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-1108 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the southern house mosquito."; RL Repbase Reports 11(1), 73-73 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >96% CC identity. 46 bp TIRs. 4-bp TSDs. XX SQ Sequence 1108 BP; 363 A; 216 C; 208 G; 321 T; 0 other; ctgccgtttt acggcgatat gatgtatacg ccattttgga cattttcaat tttattgtac 60 agtctgataa agaatcttaa aagctacctc ctgacgaaag aatttttcaa aaaactgctc 120 tggggcccat tttattaagc aaacaaacaa tgatgtatac gccaatttta ctcatcatga 180 tgtatgtata cattgagaat tgatagaagt caaattcaat actgtttgaa tgttgatttg 240 aggttttggg accaaatcaa gactgggcaa tattttatta cattatttcg atttaattct 300 taaggggacg tccattaggc gtcatccata aagtacgtca cgctctgagg gaagaggggg 360 gttctgagaa agcgtgacgt tgcgtgttaa aagcataggg aaatcgtgac aaagggggtg 420 gagtaaattt tggctgattt taatgtgacg tacttaatgg atgacgcctt atccacatcc 480 acgtagacac ttttttgaaa attctagacc cacccacctc ccttgctgac aattgttcat 540 gcaaaaaatg tgtttatgga gcatagacaa tcgctaaggg agcgttattt tattacgtaa 600 cgcaaaaaat cggattttta gaccccaccc cccctcgtaa caaaatttcc atacaaattt 660 taaaaaaatt gtatggagcg taacacggcc cccccctccc cccaactgcg ttacgtaata 720 aaagaacgct ccctaatacc ttcccccccc ttaaagtatc cacgtggaca atgcattacg 780 ggatccactc agctatcaaa atagctggca caaaaaataa agccagcttt gatcgagcaa 840 tgtatacata catcattggg caggaaaagt agtgattttt tattgctatt tttacggcca 900 aatgatgttt gtggtttaaa atcgtagaga ataggaaaaa acaagaaatt ttgtaggggc 960 cacattttta ttgtactttt taacagtttc tacagagcag acctcaaatt agtcatttta 1020 aattgaaaaa aatgaacaaa aacagaaaat cgtgtctaca gcaaaatcac ctcaatggcg 1080 tatacatcat atcgccgtaa aacggcag 1108 // ID Copia-4_CQ-I repbase; DNA; INV; 3454 BP. XX AC AAWU01030460; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-4_CQ_; KW Copia-4_CQ-LTR; Copia-4_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-3454 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 323-323 (2011). XX DR Genome; AAWU01030460; Positions 4767 8220. XX CC Positions [866-1399] - Integrase core CC 'GCAA' target site duplication CC LTRs are 96% similar to each other. XX FH Key Location/Qualifiers FT CDS 1073..3454 FT /product="Copia-4_CQ-I_1p" FT /translation="MATAYFGKRISRLRCDNGGEYTGHAMRRFCKSKGIQL FT ETTVPYTPEQNGSSERLNRTVVEKVRAILAASRLPRNLWGEAVYTAIYVLN FT RSPSVAVEGDVTPYEAWHGKKPDVSILRVFGSDCYAHVPKQRRKKLDSKTE FT KVKFMGYGPTGYRVWNGQKIFTARDVVFNELVFTKEKEENSASEEWVVVEK FT SVVRQVEPVRQVEPARPVEPALPEDSEAESEYNTGEESAEEDEVVPVPVLT FT PVGPSPQRDTEGREKRTIVPPAWHADFDMTVFALCAEEYVDEIPSSVGELR FT KRQDWPKWEQAIQEELTSLAKNDTWRLVAPPDGARIVDNKWVFKVKKNGDG FT TVERYKARLVARGFTQRAGFDFSETYSPVAKMATLRILLALANHEGWHLHQ FT MDVKCAFLNGSLDEDIYMCQPLGFERRKDLVCKLNRSIYGLKQSSRNWNRR FT FDDYVTKLGFKRSQHDLCLYHWKADGVVTYMLIYVDDILIAGNDLKAIRAL FT KGKLSTEFEMKDMEEAKIFLGLRIDRERKAGSMTIDQKAYTLGILKRFGME FT NCKSSPVPMQPRLKLERGDPKELTDKPYKELVGCLMYLTVTSRPDICPAVN FT YFASFQSSATEEHWVHLKRVLRNLQGTLDYKLVYQRTDSAVGVEAFADADW FT GNDPTDRKSVSGFVLKLHGQTVNWSTRKQTSVALSSTEAELMSLCQASCEV FT MWMLNLLASIEYEVPHPVTVYEDNQPCISVTADPRKLKRMKHIDVQYRFVK FT ELIDNGKLRLEYLSTEEQVADLMTKGLAAPRFKKLREMLGIAN" XX SQ Sequence 3454 BP; 870 A; 834 C; 1110 G; 640 T; 0 other; gcaaccacgc gttttattac tactgctaag aggttacggg ccccagccat aacctagaaa 60 gttcgggaaa aagttcgtgc aaaaagtttg gcggaaagtc gtgcgcgatg ggcgaaaaag 120 ttgaaaccgt ccggattcgg cagttcgatg gcagcaactt cagcaactgg agttaccggg 180 tcctccggta cttgaagacg ttgggactga ggcactgcat cgagaaggcg gccgaggagg 240 aggacttctg gaacgttccg gaaggggacg cgcaagcggc agccaaggaa gttttgaagg 300 cacggcggat gaaggaggac gaccggtgcg cgaacgcgct cgtcgagttg gtcgcggaca 360 gccaccttga gcacgtcaag ggaaaggaga gtccgaagga aatttgggac aggctgtgcg 420 cagttttcga gcggaaaagc accacgaacc ggtacctgct gaaaaagcag ctcctgacga 480 tgaagttcga cgagaaggaa cccctgcagg accacttttt gaaattcgac aagctggtcc 540 gggagctgaa gggtgctggt gcgaaaatgg acgaggagga cacgatctgc catctgatgc 600 tgacgatgcc ggattcctac gacaccgtga cgacggccat gagcgtcatg tcggaccaac 660 tgacgatgga cgtcgtccgg cggaactatc tggatttcga ggccaagcag aaaggaaagc 720 gtgcagaaca acaaacggaa gacgctgctt ttgcgggcca ttcgaagccg aagttcaagt 780 gtttttcctg cggcggtgtc gggcacaaga agaaccagtg tccgaagcgg aacgacaaga 840 agccggaaca gcggaggaag ccggagcaca aggcgaacct gggatccaat cccgtgtcgt 900 tcgtggcgga cgtgcaagaa gtcgcggcga cgtctgtggc cccgtcacac ccgtgacgtg 960 ggacggcaat gcgtatttcg taagtttcac tgacgatttt acgcatctgg ccgttgttta 1020 cctgctcaag accaaggacg aagtgctgga gaagttcgtc gagtacgaag cgatggccac 1080 ggcgtacttc ggcaagcgga tttcacgact ccggtgtgac aacggcggag agtacactgg 1140 ccatgcgatg cggagattct gcaagtcgaa ggggatccag ttggagacga cggttccgta 1200 cacgccggaa caaaacggat cgagcgagcg gctcaaccgg accgtcgtgg agaaggtgcg 1260 ggcaatactc gcggccagca gacttccgag gaatctgtgg ggcgaagctg tttacacggc 1320 gatctacgtg ctgaacagga gtcccagcgt agcggttgaa ggtgacgtca ctccttacga 1380 ggcgtggcac ggcaagaaac cggacgtgtc gatcctgcgg gttttcggca gcgactgcta 1440 cgcgcacgtg ccgaagcaac ggcgtaagaa gctggattcg aagacggaga aagtgaagtt 1500 catggggtac gggccgactg gatatcgagt atggaacggc cagaagatct tcacagcgcg 1560 agatgttgtg ttcaacgagc tagtgttcac aaaagaaaaa gaagaaaaca gtgcaagtga 1620 agagtgggtc gttgtggaga aaagtgtcgt tcgccaggtg gaaccagtgc ggcaagtcga 1680 accggcgcgg ccggtggaac cagcgctgcc agaagattcg gaggcggagt ccgagtacaa 1740 cactggtgaa gaaagtgccg aagaagacga agtggttcct gtcccggtac tgacgcctgt 1800 tggaccaagt ccgcaacgtg acactgaagg gagagaaaag cgtacgattg tgcccccggc 1860 atggcatgcc gacttcgaca tgaccgtgtt tgcgctgtgt gctgaagagt acgtcgacga 1920 gattccttcg agcgtcggtg agttgcggaa gcgacaggat tggccgaagt gggagcaagc 1980 cattcaggaa gagctcactt cactggcgaa gaacgacacg tggcgcttgg tggcgccccc 2040 tgacggagcg aggatcgtcg acaacaaatg ggtgttcaaa gtcaagaaaa acggtgacgg 2100 aactgtggag aggtacaaag caaggctcgt ggcacgtggg ttcactcagc gggcgggctt 2160 cgatttctcc gaaacgtact caccggtggc gaagatggcg acgttgcgga tactcttggc 2220 cctggcgaat cacgaaggct ggcatctcca ccaaatggat gtcaagtgcg ctttcctgaa 2280 cggttcgctg gacgaggaca tctacatgtg ccagccgctt ggctttgaga ggaggaagga 2340 ccttgtttgc aagctgaaca gatcaatcta tggcttgaag caatcctcca ggaactggaa 2400 ccgacgattc gacgactacg tgacgaagtt ggggttcaag cggagccaac acgatttgtg 2460 cttgtaccac tggaaggcgg atggtgtcgt cacgtacatg ctgatctacg tggatgacat 2520 tttgatcgcc gggaacgacc tgaaggcgat ccgggccctg aagggtaagc tgtccacgga 2580 gttcgagatg aaggatatgg aggaggccaa gatctttctc ggactgcgga tcgatcgtga 2640 aaggaaggcg ggatcgatga ccatcgatca gaaggcatac acgctgggca ttctcaagcg 2700 attcggcatg gaaaactgca agtcttctcc agtgccgatg caaccgcgct tgaagctcga 2760 gagaggtgat ccgaaggaac tcaccgacaa gccgtacaag gaactggtcg gttgcttgat 2820 gtacttgacc gtgacgtctc gaccggatat ctgcccggcg gtcaactact tcgccagttt 2880 ccaaagcagt gcaaccgagg agcactgggt ccacttgaag cgcgtactgc ggaaccttca 2940 aggtacgctg gactacaagc tggtctacca gcgaacggat tcggcggtcg gagtcgaagc 3000 ttttgcggac gctgattggg gaaacgaccc caccgacagg aagtccgtgt ccgggttcgt 3060 gctcaagctg cacgggcaaa ccgtgaactg gtcgacgcgc aagcaaactt ctgtggcgtt 3120 gtcgtcgacg gaagccgagc tgatgtcact ttgccaagca agctgcgagg tgatgtggat 3180 gctgaacctg ctggcgtcca tcgagtacga ggtgccccat ccggtcaccg tctacgagga 3240 caatcagcca tgcatctctg tcacggctga tccacgcaag ctgaagagga tgaagcacat 3300 cgacgtccag tatcggttcg tgaaagagct gatcgacaac ggtaagctgc gtctggagta 3360 cttatcaacg gaggagcaag tcgccgatct catgacaaag ggattagcgg caccacgatt 3420 caagaagctg agggagatgc taggaatagc aaat 3454 // ID Copia-130_AA-LTR repbase; DNA; INV; 172 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-130_AA_; KW Ty1_copia_Ele193; Copia-130_AA-I; Copia-130_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-172 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 172 BP; 53 A; 35 C; 35 G; 49 T; 0 other; tgtgagaggc cacgtgctca gtagcatagc atcgtagcaa cataggcgta gcaaggtgta 60 acatagtaac ctagttgact tgttatgttg aataaataag tttttcattc cctgttgata 120 aatcactcaa acaagacgtg tttcaatacg cgccagctgc ataactctat ca 172 // ID Copia-27_AA-LTR repbase; DNA; INV; 185 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-27_AA_; KW Copia-27_AA-I; Copia-27_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-185 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 954-954 (2011). XX DR [2] (Consensus) XX SQ Sequence 185 BP; 44 A; 41 C; 29 G; 71 T; 0 other; tgttgagtgt acaaccctgc tgcctattgc ccctactttc attccacacc aatgcgttat 60 gttggtatgt tgtatttttg acatttctct cattccaatg taaaatcaat ttgaataaac 120 acgttgtaat ttttcttcaa tcgtaaaacg tttgtcgcga gtttatttct gctgcggcca 180 atcca 185 // ID CR1-92_AAe repbase; DNA; INV; 5010 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-92_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5010 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1180-1180 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 12 sequences with >94% CC identity. XX FH Key Location/Qualifiers FT CDS 74..1006 FT /product="CR1-92_AAe_1p" FT /translation="MLPTFWRYSSLRCTLHRLIFFCCVLRIPITKMSEICH FT SCTNEIHEDRVTCGGFCTAVFHLKCCRLPSGLLDEINRNKQIFWMCNACAV FT IMSDVRHKKNVKAAYEAGLEKQLSSHTEILGNLKQQILDELKIEIRSSFSK FT MSNIAFTTPITSRRSSNVPRTIGSRRLFAKKDNNVPTLIHATGESESPSLG FT RLTVQQPTKKFWIYLSRISREVTPEQIAELTKRRLSTENIEVVRLVAKNKD FT IRTMSFISYKVGVEMELKSVALSSSTWPKGMLFREFIDDRKSENFWRPQTN FT FLHTTPNSVNSDDQSPMIE" FT CDS 901..4734 FT /product="CR1-92_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="IYRRSQIGKFLAASNEFSAHYTEFGQFRRSIPNDRMM FT LLSPSPQTLQTSGRKLALSRLEVLDPLSTVELFQPAIVSRPDPVFESGDRI FT FQTQIQGKYHDVLNKSVPEMPTSFSQPPINDSCMIDIIRKPGRTYAASIME FT APHPLAAVVPFQPATSSRSGPACESGDGVFQNSTSGKYDKAPDSSLPKIYD FT AYSRSSSQSLQTASSKLTPGRTLASVIMEASNPLITVPSLLPATIRRPGPV FT FELSDGGFQTPSIGKYNRIKATPPSERKLSFSAPSSLPVSRQLWSNYDSSS FT NLRSSTAVTIGETAEILSEEFRSAPENLPTSSQHDLRVFYQNVRGLRTKID FT SFHLAVADSEYDVIVLTETWLDDSILSTQLFNGSFSVFRTDRSPLNSRKSR FT GGGVLIAVSSHLCCHVDPAPVSQSLEQLWVKIKLQTHSLSVGVLYLPPDRR FT SDISLIQQHVDSIGCVLAGLSPGDGAILLGDYNHPEMRWISSDNGGLKIDI FT HNTRLTTAGSSLYDAFCFHGLTQINHIENSNSRTLDLILVNETLLPLSRVI FT EAVEPLVMTDISHPAIEVTLSVPSPQRFVQWNDNNRFDFRRADFNALTDAL FT HALDWQFLESEVNLDSMVETFNSQMQMVIQTHVPRQQPPLKPPWTTNRLRK FT LKRLRRSALKKYSRNRSDLTKHRFVLASNEYIRHNRFLYDCYIDRTQRNLK FT SNPKQFWSFVRSKRKEEGLPSNMFFGESTANTALGQCELFAEHFKSAFSHV FT TVSATQLIEAARDVPANSFDFQIFEITEEHILNALSKLKNSYSPGPDGIPS FT SFLKKCSAALMTPLLKIFNLSLQRGNFPAGWKVSYLTPIHKRGDKCNVANY FT RGITSLSACSKLLEIIMNNALFETCKNYISSAQHGFYPKRSVTTNLVEFTS FT LCTRSMDAGLQVDAVYTDFKAAFDRVNHEILLYKLERMGVSITVVRWLKSY FT LADRSVRVRIGSTYSETFSILSGVPQGSNLGPLLFSLYINDMSRVIPRGTR FT LLFADDVKLFMIIASLDDCQVLQQNLNTFEDWSTRNCLELCVTKCCTISFS FT RRQNPMRYDYCLCGHILERKDEVKDLGVILDRHMTFRPHFDDVIAKANKQL FT GFIFRIACGFDDPHCFKSLYCALVRSILESSVLVWCPYSRNWIDRFEAIQR FT KFVRLALRNLPWQDALNLPPYEHRCMLLGIDTLEKRRHSMQAEFISKVLNS FT EIDAPAILAELQLYVPERPLRRRNFLHLPSRNSRYGQHDALRFMALRYNET FT AEHYDFYMSDQNIPLIS" XX SQ Sequence 5010 BP; 1374 A; 1141 C; 1030 G; 1465 T; 0 other; ccacacacgt tccttttctc catcgaagcg tcatctgcct gcaaaaattg aaagctacct 60 gcctgtacat cgtatgctgc ctactttttg gcgatattcg tctttgcgct gtacgctaca 120 tagattgatt tttttctgtt gcgtattgcg tatcccgatt acaaaaatga gtgaaatctg 180 ccactcttgc actaatgaaa tccatgaaga tcgtgttacg tgcggaggat tttgtactgc 240 cgtatttcat ttgaaatgtt gccgcctacc aagtggatta ctggacgaaa tcaatcgaaa 300 caagcagata ttttggatgt gtaacgcttg tgctgtcatc atgtcggatg ttcggcataa 360 gaaaaacgtt aaggcggctt acgaagccgg gttggagaag cagctaagtt cccatacaga 420 gattcttggg aacttaaaac aacaaattct tgatgaattg aaaattgaaa tccgatccag 480 cttctccaaa atgtcaaata ttgctttcac tacaccgatt acttcaagac gctccagtaa 540 tgtgccacgc accatcggaa gtcgacggtt gtttgcgaag aaggacaata atgtgccaac 600 tttaattcac gctactggag aatccgaatc tccaagtcta ggtcgattga ctgtccaaca 660 accgacaaaa aagttttgga tttacctttc acgaatttct cgagaagtta ctccagaaca 720 aatcgctgag ctcaccaaaa ggcgtttatc tactgagaac atcgaagttg tccgtttggt 780 tgcaaaaaac aaagatattc gcactatgtc gttcatatct tacaaagtgg gcgttgaaat 840 ggaactgaaa tctgtggctc tctcttcgtc cacttggcca aaagggatgc tatttcgtga 900 atttatcgac gatcgcaaat cggaaaattt ttggcggcct caaacgaatt ttctgcacac 960 tacaccgaat tcggtcaatt cagacgatca atccccaatg atagaatgat gcttttatcc 1020 ccgtctccac aaacgcttca gacatcggga cgcaaacttg ccttaagccg cttggaagtc 1080 ctggatcctc tcagcacagt cgagttattc cagccagcga tcgtcagtcg tcccgatcct 1140 gtgtttgagt ccggtgatag gatcttccaa acccaaatcc aaggcaagta tcatgatgta 1200 ttgaacaaat ccgttcctga aatgcctacc agttttagcc aacctcccat caatgattcc 1260 tgcatgattg acataattcg gaaaccggga cgcacgtatg ccgcaagtat tatggaagcc 1320 cctcatcccc tcgccgcagt cgtgcccttc cagccagcga catccagccg ttccggtcct 1380 gcgtgtgagt caggtgacgg ggtcttccaa aactcaactt caggcaagta cgataaggct 1440 ccggacagtt ctctccctaa aatctacgat gcttacagcc gatcctcatc acaatcattg 1500 caaactgcct cttcaaaatt aacaccggga cgcactcttg cctctgtaat tatggaagcc 1560 tccaatccac tcatcacagt cccgtctctc ctgccagcga ccatcagacg tcccggtcct 1620 gtgtttgagc taagtgatgg gggcttccaa actccatcaa tcggcaagta caatcgtata 1680 aaagcaactc ctccatctga aagaaaatta agtttcagtg cgccctcttc actacccgtg 1740 tctcgccagc tctggtctaa ttatgacagt agcagcaatc ttcgttcatc aactgccgtc 1800 actatcggtg aaacagccga aattctgtcc gaggagttcc gctctgcgcc agagaatttg 1860 ccgactagtt ctcaacacga tttacgtgta ttttatcaga atgttagagg gctacgtacg 1920 aaaattgact cttttcacct ggcagtagct gattcagagt atgacgtaat tgttctcacc 1980 gagacgtggc ttgatgatag catattatca actcagttgt tcaacggttc attttccgta 2040 tttagaactg atcgcagtcc tctaaacagt agaaaatccc gaggaggtgg agtactaata 2100 gccgtttcct cacacttgtg ttgtcatgtt gacccggcac ctgttagcca gtcgttggag 2160 caattatggg ttaaaattaa gctgcaaaca cactctctaa gtgttggtgt actttatctg 2220 ccaccggatc gaaggagcga tataagtcta attcagcaac atgttgactc tattggatgc 2280 gttttagccg gtttgagtcc cggtgatggt gctatccttt tgggtgatta taaccatcct 2340 gaaatgcgtt ggatttcatc ggataatggt ggtctcaaga ttgatattca caacacgcga 2400 ttaacaacag caggtagtag tttgtacgac gcattctgtt tccatggttt aacgcagatc 2460 aaccatatcg agaactctaa ctccagaact ctcgacttaa ttctggtaaa tgaaacgtta 2520 ttgccgcttt ctcgtgtaat tgaggctgtt gaaccacttg tcatgacgga catttcccat 2580 cctgccatcg aagtgacact ttccgtccct tcgcctcagc gcttcgtaca atggaacgac 2640 aataatcggt tcgattttcg tcgagcagat ttcaatgcac taactgatgc tctccatgct 2700 ttggactggc aattcttgga atcggaagta aatctagaca gtatggtaga aactttcaac 2760 agtcagatgc aaatggttat tcaaacacat gttccaagac agcagccacc attgaagcca 2820 ccgtggacaa cgaatcgcct acgtaaactg aagcgcctta gacggtcggc tttgaagaaa 2880 tatagccgta atcgctctga tcttacaaaa caccgttttg tgcttgccag caacgaatat 2940 attaggcata atcgctttct gtatgactgc tacattgacc gtacgcagag aaatctgaaa 3000 tccaatccta aacagttttg gtcatttgta cgatctaagc gtaaagaaga aggactgcca 3060 tctaacatgt ttttcggaga gtctacggcg aatacagctt tgggacaatg tgagttattc 3120 gcggagcatt ttaaaagtgc attctcacat gttacggtat ctgcaacgca gttaattgaa 3180 gccgcacgtg atgttcctgc aaacagtttc gattttcaga tcttcgaaat aaccgaggag 3240 cacatcctca atgctctcag taaactcaaa aactcgtatt cacctggtcc ggatgggatt 3300 ccgtcgtcct ttctgaaaaa atgctccgct gctttgatga cgccattact caaaatcttt 3360 aatttgtcgc tgcaacgagg aaattttcct gctggatgga aagtatcgta ccttacacct 3420 attcacaaga gaggtgataa atgcaatgtt gccaactacc gtgggatcac atccttaagt 3480 gcatgttcga agctgctcga aattattatg aataatgctc tttttgaaac ctgcaagaat 3540 tatattagta gtgctcaaca tggcttctat cctaaacgat cagttacaac caacctcgtc 3600 gaatttacat ctctatgtac gcgttccatg gatgctggtt tgcaagtgga cgctgtttat 3660 actgatttta aagcagcttt tgatagagtc aatcacgaaa ttcttctgta caagttggaa 3720 agaatgggtg tgtcgataac ggtagtacgc tggttgaaat cttacttggc agacagatct 3780 gtgagagtac gcattggttc aacatactca gaaacattct caatcttgtc cggcgttccg 3840 caggggagta atttgggtcc tctcctattc tccctttaca ttaacgacat gtctcgtgtg 3900 atacctagag gtacccgcct gttatttgcg gatgacgtta aattattcat gattatcgcc 3960 agcttagatg attgccaagt acttcagcaa aatttaaata cgttcgagga ttggagcacc 4020 cgtaattgtt tggagctgtg tgtaactaaa tgttgcacaa tttcatttag caggagacag 4080 aatcctatgc gttatgacta ctgcctctgt ggtcatattt tagagcgcaa ggacgaggtt 4140 aaagatcttg gagtgatctt ggatcgccac atgacatttc gtccgcattt cgatgatgtg 4200 atagctaaag ctaataaaca acttgggttt atattcagga ttgcctgtgg atttgatgat 4260 ccccattgct tcaagtcact ctattgcgct ttagtacgct cgatactgga atcttctgtg 4320 ctggtttggt gtccttacag cagaaactgg atcgatcgat ttgaggccat tcagcgaaaa 4380 tttgtgcgat tagcacttag gaatcttcca tggcaggatg ccctcaatct cccaccctac 4440 gaacaccgat gcatgctgct tggaatcgac accttggaaa agagaagaca tagtatgcag 4500 gcagagttca tttcgaaggt tctcaacagc gaaattgatg caccagctat tttagcggag 4560 ctgcaactct acgttcccga acgtcccctt cgtcgacgaa acttcttaca tttaccaagc 4620 cgcaacagcc gttatggaca acacgatgcc ctgagattta tggccctccg ctacaatgaa 4680 actgctgagc attacgactt ttacatgtct gaccagaata ttcctttgat aagttgaagc 4740 ttacattgtt tactagttta ttaattcttt gtatttaagt attttgtaat tttgtttata 4800 tttgtgttat aattactagt attttaattt gtattgccag acgtttatat gttatgtatg 4860 tgtaaatcat tgtcgaaaag atatggggtt tttatgccaa cttgaattat gcaccatgct 4920 agcaattcaa tttggctttt ccccccatca atcttcatta agacaatcgt gtcagatgga 4980 gaaaattaat aaaataaata aataaataaa 5010 // ID Gypsy-10_IS-LTR repbase; DNA; INV; 119 BP. XX AC ABJB010104551; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the black-legged tick genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-10_IS_; KW Gypsy-10_IS-I; Gypsy-10_IS-LTR. XX OS Ixodes scapularis OC Eukaryota; Metazoa; Arthropoda; Chelicerata; Arachnida; Acari; OC Parasitiformes; Ixodida; Ixodoidea; Ixodidae; Ixodinae; Ixodes. XX RN [1] RP 1-119 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the black-legged tick genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; ABJB010104551; Positions 12181 12063. XX SQ Sequence 119 BP; 26 A; 30 C; 37 G; 26 T; 0 other; tgtatcggtg ttgcgcgggc gctcgagcag cagctcggag cctggcgcgg gctcatgaca 60 taaagcacga gagtcagttc tcttctggta ccgagatgtg tggactcaca atcaataca 119 // ID MAR1_BM repbase; DNA; INV; 1310 BP. XX AC . XX DT 14-SEP-2004 (Rel. 9.08, Created) DT 14-SEP-2004 (Rel. 9.08, Last updated, Version 1) XX DE Bombyx mori transposon Bmmar1 mariner-like element - a consensus. XX KW Mariner/Tc1; DNA transposon; Transposable Element; MAR1_BM; KW mariner-like transposon; transposase domain. XX OS Bombyx mori OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Bombycidae; Bombycinae; Bombyx. XX RN [1] RA Kumaresan G. and Mathavan S.; RT "Molecular diversity of the mariner-like transposable elements in RT the silkworm, Bombyx mori."; RL Unpublished. XX RN [2] RA Gentles A., Kohany O. and Jurka J.; RT "Bombyx mori transposon Bmmar1 mariner-like element - a RT consensus."; RL Direct Submission to Repbase Update (JUN-2004). XX DR [2] (Consensus) XX SQ Sequence 1310 BP; 420 A; 223 C; 276 G; 390 T; 1 other; cttagtctgg ccataaatac tgttacaatt aaaaataaac aaaatattac atttgaattt 60 ggaatctgtc atttttatat gattgctcat tgagttttct cattttggcg ccaatacatt 120 gtacaatatt ttgcgatatt aaaatggagt ggggtgataa agagaaccga atcgctgtga 180 ttgcattaca caaagtaggt atggagccaa atgcaatttt taaaactctc catacgcttg 240 gtattagtaa aatgtttgtg taccgggcta ttaataggtg caatgagacc tcctctgttt 300 gtgacagaaa aagatctggc cgtccacgta gtgttcgtac gaaaaaggtg gtcaaagcag 360 taagggaaag aattcgaaga aatcctgtcc gaaagcaaaa gattttatct cgggagatga 420 agatagcacc tagaaccatg tcgcgtattt taaaagatga cttaggactt gcagcctata 480 agagacgtac tggtcatttc ttaactgata atttaaaaga gaatagggtg gtaaaatcga 540 aacaactact gaagcggtac gcaaagggag gtcatagaaa atttttgttt acggatgaga 600 aattttttac aattgagcaa cattttaaca aacaaaatga ccgtatttat gctcaaagct 660 ctaaggaagc ttccccaatt agtcgacaga gtgcaacgtg ggcactatcc gacttcagtg 720 atggtttggt ggggtattag ctatgaagga gtgactgagc catacttttg tgaaaaaggt 780 atcaaaacat cggcacaagt gtatcaagat accattcttg agaaggtagt gaagcccctt 840 aacaacacca tgttcaataa tcaagaatgg tccttccagc aagactcggc gccaggtcat 900 aaagctcggt ctacgcagtc ttggttggaa acgaacgttt cggacttcat cagagctgaa 960 gactggccgt cgtctagtcc cgatcttaat ccgctggatt atgatttatg gtcagtttta 1020 gagagtacgg cttgctctaa acgccatgat aatttggagt ccctaaaaca atccgtacga 1080 ttggcagtga aaatttttcc catggaaaga gtgcgtgctt ctattgataa ctggcctcaa 1140 cgtttaaagg actgtattgc agccaatgga gaccacttcg aataagcttt ttatacttta 1200 aattgtttta tatttatgta ttaaactaac acactgtaaa agtaataaat gttatttgca 1260 atagaatttt ttttatttty cattgtaaca gtatttatgg ccagactaag 1310 // ID Gypsy-19_DPu-LTR repbase; DNA; INV; 193 BP. XX AC scaffold_613; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-19_DPu_; KW Gypsy-19_DPu-LTR; Gypsy-19_DPu-I. XX NM Gypsy-19_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-193 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 754-754 (2010). XX DR Genome; scaffold_613; Positions 4615 4423. XX CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX SQ Sequence 193 BP; 35 A; 61 C; 47 G; 50 T; 0 other; tgttgtgtac tggacattct agacctacca cgtcgaccgc cagaggtctc ccacccttgc 60 cacgggcgat gcgagaggct tctcgtctcg accggcagtc tcttttccct tgtgctctcg 120 cgcgtatgtg tgtgaatcga gcctcacgtt aatacaatca cgccgacgta tgccggcgtc 180 gacttactta aca 193 // ID MARINER_TF repbase; DNA; INV; 1293 BP. XX AC U40493; U76904; U76905; U88160; U88161; U88164; XX DT 20-FEB-2000 (Rel. 5.01, Created) DT 20-FEB-2000 (Rel. 5.01, Last updated, Version 1) XX DE Mariner sequence from tephritid flies - a consensus. XX KW Mariner/Tc1; DNA transposon; Transposable Element; mariner; KW MARINER_TF. XX OS Trirhithrum coffeae OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Tephritoidea; Tephritidae; Trirhithrum. XX RN [1] RA Torti C., Gomulski M.L., Malacrida R.A., Capy P. and Gasperi G.; RT "Characterization and evolution of mariner elements from closely RT related species of fruit flies (Diptera: Tephritidae)."; RL J. Mol. Evol 46(3), 288-298 (1998). XX DR [1] (Consensus) XX CC Based on the following accession numbers: U40493, U76904, U76905, CC U88160, U88161, U88164. XX SQ Sequence 1293 BP; 403 A; 254 C; 297 G; 337 T; 2 other; ttggatgagt gcataagttc gtgcccgatt ccgctggatg ccgtwcgaat cgtttgagag 60 tggcgctcgt gaaagaaata tatacatatt atatatcgtt ggaaaggtga cagtccgaac 120 tgtaaatcaa gcataaaata acttcatttt gatttgagtt gcttagtgaa aagacgcatg 180 gaagatgrac aacgaaaaag atcatatgcg tcatattatg ttatacgaat tccgcaaagg 240 aaaaacagtg ggcgctgcaa ctaaagatat tcgcgaagtt tatttggacc gtgctccagc 300 actccgcaca gtaaagaaat ggttcgcgaa atttcgttct ggagatttta acctcgaaga 360 tcaacctcgc agtggacggc cttctgagct tgatgacgat gttctaagga ctttagttgc 420 gaataactca cgtatttcga cggaagaggt tgccagtgaa ttgaacgtca acaaatcaac 480 tgcgtttcgt cgtttaaaaa aggttgggta cactttgaag ctcgatacat gggtgccaca 540 tcagttgagt gaaaaaaaca aagtggaccg tatgtcaaca gcaatttctt tgcttcccgg 600 acggatcaaa aacgaacctt ttttggatcg gctcgtgact ggtgatgaaa aatggatcct 660 gtacaacaat gttcaacgca aaagaacatg gaaacaggca cacgaagggg cggaaccgat 720 gtcgaaaggt ggattgcatc cgatgaaggt actgctgtgc atttggtggg atatccgagg 780 cgtgatctat tttgagctct tgccagctgg agaaacgatc actgccaaca agtattgtca 840 gcaattggtc gaattgaaga aagcaattga tgaaaaacgt ccgattttgg ccaatcgcaa 900 aggagttctt ttccatcatg acaacgccag gccacatgtt gcaaaaccga ccctggccaa 960 actgaaggag attgaattgg gaaatcatgc cgcatccccc atattcaccc gacattgcac 1020 cttctgatta ttcatttgtt tcgatcgctg cagaacaatt tgaatggaaa aaaatttaaa 1080 aatgtggaag acgtcaaaaa ccaccttgac acctttttca acgagaaacc gcgcgatttc 1140 tatgaatcag gcatccgtaa attggttgaa cgttgggagt ggattgccga acatgatggc 1200 gaatacataa ttgattaata aaagcgcttt cttcaaaaaa tttcaatttt agtttgcact 1260 tgaaatcggg cacgaactta tgcactcatc caa 1293 // ID Gypsy-20_DPu-LTR repbase; DNA; INV; 333 BP. XX AC scaffold_242; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-20_DPu_; KW Gypsy-20_DPu-LTR; Gypsy-20_DPu-I. XX NM Gypsy-20_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-333 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 756-756 (2010). XX DR Genome; scaffold_242; Positions 56198 55866. XX CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX SQ Sequence 333 BP; 101 A; 84 C; 53 G; 95 T; 0 other; tgtaatgagc atacacagct catacatgta aactgacaag aaaactgaag acaagaacaa 60 gttctaactc cttcttgtct gccctccttt aacgaaaaac cggaaccatc cacgcccaga 120 tttaattggg atgttcacgt atgtttccct tcccctattc gagtataaat actcgtggtt 180 ttggtacttg aagtcagagt tctctcagct ctccttcaag ctccagctcc agctccggaa 240 aacgtaagac caacctctta gactcttctt atttgtaaac catatgcatt gtacctaagt 300 tgaatacaat actagcaatt gaaggactct aca 333 // ID SAT-5_NVi repbase; DNA; INV; 110 BP. XX AC . XX DT 08-MAY-2009 (Rel. 14.06, Created) DT 08-MAY-2009 (Rel. 14.06, Last updated, Version 1) XX DE Nasonia vitripennis satellite repeat. XX KW SAT; Satellite; Simple Repeat; Nonautonomous; SAT-5_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-110 RA Bao W. and Jurka J.; RT "Satellite repeats from Nasonia vitripennis."; RL Repbase Reports 9(6), 1161-1161 (2009). XX DR [1] (Consensus) XX SQ Sequence 110 BP; 41 A; 31 C; 19 G; 19 T; 0 other; caaagcgctt aaacccagct accccatgct ccccgagctc aataatttat ttaatacaac 60 acctcggcac atcgaaggaa ggcgagcgaa gcgagtataa aatcaaaaac 110 // ID Gypsy-88_CQ-I repbase; DNA; INV; 5783 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-88_CQ_; KW Gypsy-88_CQ-LTR; Gypsy-88_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-5783 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 555-555 (2011). XX DR [2] (Consensus) XX CC Positions [4714-5193] - Integrase core CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 989..1969 FT /product="Gypsy-88_CQ-I_2p" FT /translation="MTSPLEMIKLVSSLVAPYDGESSKLECTLDALKTIKP FT LVTEANKAATLQSIVSRFSGRARQFVPEIDNATTIDTIMAALGGCKIKTSP FT EAALVTLNATKQTGDIKTFTREIEKHTFALEQAYRDDDMPGDTANKYASKA FT GIKALANGLKNKEAKTIIKASAGAGKILKLSDAINIALEESIDTPAEILLF FT QQSKSNKNYRNNNGNNNNTNGNNYRNRNGNNRYHGNYNNNYNNNYNNNNRN FT HNNNNRNRNDNRNNNRNNNNNYNNNNNYNNRNNNNQSNRGNGHGTNRNPHQ FT IFQLNTDPNNQQNVQTQNVNNNQTGSNRENQHFLD" FT CDS 2035..5535 FT /product="Gypsy-88_CQ-I_1p" FT /translation="MANSKCSFIIDTGADISIFKAGKLKPEQKVNTAKQYN FT LTGVTDGSIRTLAETETQLKFDNGLIVSHMFQVVPSDFPIITDGLLGRDFF FT IKYKCSINYENWLLTFNFQENTVEVPIEDNLNNSIVVPPRCEIIRRIPQLS FT VDEDSVVFSEEIQRGLFCGNTIISPNTSCVKLINTTNSPILLKKFKPKIEP FT LRNFELLILNNQNRVEDIIEKINFDEIPNYTHDPLKQLITKFSDIFCLPDE FT KLTTNNFYEQKINLTDPSPVYIPNYKTIHSQKPVIEEQVKKMINDKIIEPS FT VSNYNSPILLVPKKSSDGSKKWRLVVDFRQLNKKILADKFPLPRIDTILDQ FT LGRAKYFSTLDLTSGFHQIPLEEECRKFTAFSTDTGHFQFLRLPFGLNISP FT NSFQRMMTIAMAGLTPEVAFIYIDDIIVIGCSMRHHIKNLTTVFERLRHYN FT LKLNPSKCKFFKSEVTYLGHKVTDKGILPDDSKYESLVKYPVPQNADEVRR FT FVAFCNYYRKFVENFSTIAHPLNQLCKKNCKFDWSVQCQLAFDTLKHKLLS FT PKILQYPDFAKTFILTTDASNVGCGAILSQITSEGDRPIAYASRTFTPGEK FT NKSVILKELTAIHWAINYFKPYLYGNRFVIRTDHRPLVHLYGMKDPTSKLT FT QMRLDLGEFNFDVIYIAGKENVGADALSRIVITSEQLQNMQMLVVNTRSMQ FT KTKSMQSTQVDTSITYQAAYTVENINETRKLCALRVVNNTLQIYKNKKIMM FT KIPFQPENERPTLEMTLSRLEKELKKMRIDRISLSTEDEIFKKVTTSTFKI FT VANKTLQSLQILIYKPPMYVDNKNKIQEILRNFHNTPTGGHVGQTRMYLQI FT RDLYRWKAMKRSITLFVQACELCKRNKIVQHTKEPQIITTTPSKPFEIITI FT DTVGPLPKTNNGNRYAITIQCELTKYIVLAPIQNKEANTIARAIVDNFILI FT YGKFLEIRTDQGTEYNNEVLSQICKQLKIKQTFATAYHPQTIGALERNHRC FT LNEYLRCFVNEHQSDWDDWLKYYQFNYNTTPHTIHGYTPYELVFGVKANLP FT HSNNQRNLDPVYNLEAYHNELKFRLQKSQNLTKSILEQQKQEKTQICNKII FT NPIHIQIGDLVYLKNENRRKLDSFYQGPFRITQIRDPNCELEEPITKKKTI FT VHKNRIIK" XX SQ Sequence 5783 BP; 2274 A; 1100 C; 938 G; 1467 T; 4 other; ggtttccccg cggtagtgcc gaacgttttt gagatattat atggcgaccg tgacagggct 60 tcgcgtcaaa gttttgacag cacgccgtag tgacccgaaa tgccaccaca ctggaagcca 120 acgcctataa aaaaaaaata aaaattgtgg atttcctgaa aagtagcaga attctcccaa 180 ccattaccgg gcatgtactc atgtcaacaa cgagaggact gatgacccac catcaggcca 240 tccaggacca aaccggtgga atcatcttgg cgatcatcta ctaagacgcg atcgcaacga 300 gcaacgagtt ggaatcgaat gaggtcacca acaacagcag cagtagcagc acccagatgc 360 agccagatca aaaaaaaaaa aaaaaaaaca aacaaccgta aaggggagcg cgcttttagc 420 ggacgcatat ccttacgcgg tgtttctaca cggactcaac atcggtaatc atctagaagg 480 ggagcgtcgg ttgtgggaac ggacggttcc ascacaaccc ggaccgcaga cccgcagcaa 540 catcagtatc gagacaacaa gtaacaagaa gtaacaattg atgaaaattt ttgaagcaaa 600 tagttagtaa tatttttttt ttaatgaaaa atggccgacg aatgccatgg aagaactaga 660 tttaaacata aaaaccttgc ataagatctt agaaaccctt gaaaaaaatc cttataggaa 720 ttttctagca cacacactac tactcaaaag ggaagtagca caaaaaagtt actcgaacat 780 agaaaaatat cttcttatta acgaagagaa gatcgaatca agtaaactta atttttatat 840 aaaagcttct agaaacactt ttcaaaaaat cgaagacttt attacacttc gattaaatca 900 aaagcaaaca aagcgattgt ccttkaagac agtcgcaaaa atcacaataa ttttccawcg 960 gtcaatcttt gatataagaa gcaaagaaat gacttcaccg ttggaaatga taaaattggt 1020 ctcctccctt gtcgctcctt acgacgggga gtcgagcaaa cttgagtgca cattagatgc 1080 actcaagacc attaaacctc tggtaacgga ggcaaataaa gcagctaccc tgcaatctat 1140 tgtgtccaga ttttctggta gagcgcgtca gttcgtgcca gaaattgaca atgcaacaac 1200 tatagataca attatggcag cactgggtgg gtgcaaaata aaaaccagcc cagaagctgc 1260 cctagtaaca ttaaacgcca ctaaacaaac tggcgatata aaaaccttca ctcgggaaat 1320 cgagaaacac acgtttgctc tcgagcaagc gtatcgtgat gacgatatgc ccggtgatac 1380 kgcaaataaa tatgccagta aagctggcat aaaggcactg gcgaatggtt taaaaaataa 1440 agaagcgaaa accattataa aagccagtgc cggagctgga aaaatattga agctttctga 1500 tgcaattaac atcgcattgg aagaatcaat tgatactcct gccgaaatac tattattcca 1560 acaatctaaa tcaaataaaa attaccgcaa caataatggt aacaataata acacaaacgg 1620 taataattac cgtaatcgta atggtaacaa ccgttatcac ggcaattata acaataatta 1680 taataataat tataacaata ataatcgtaa ccataacaat aataatcgca atcgtaacga 1740 taacagaaac aacaatcgaa ataataacaa taattataac aataacaata attataataa 1800 tagaaataac aataatcagt caaatagagg taatggtcat gggactaaca gaaaccctca 1860 ccaaattttt caactaaata ctgatcccaa caatcaacaa aacgttcaaa ctcagaacgt 1920 taataacaac caaaccgggt caaacagaga aaaccaacat tttttagact agacgcaaag 1980 tccatttttt ccgtaaacaa tgcctcagcg aattttgtat ttatcactgc aggcatggcc 2040 aactcaaaat gcagttttat aattgacaca ggtgctgata tttcaatatt taaagccggt 2100 aaattgaaac ccgagcaaaa agtaaatacc gcaaagcagt acaatttaac aggagttaca 2160 gatggatcaa tacgtacatt ggccgaaact gaaactcaat tgaaatttga taatggattg 2220 atcgtcagtc atatgttcca agtcgtccct agcgattttc ccataataac agatggactc 2280 ctaggacgcg atttctttat taaatacaaa tgttcaatta actatgaaaa ttggttattg 2340 acatttaatt ttcaagagaa caccgttgag gtaccaattg aggataattt aaataactca 2400 atagtagtac cgcccaggtg cgaaataatt agaagaattc cacaacttag tgtcgatgag 2460 gattctgttg tgttttctga ggaaatccaa cgaggattat tctgcggaaa cacaataata 2520 tcccctaaca caagttgtgt gaaattaata aatacaacga attcaccaat tttactgaag 2580 aaatttaaac cgaaaattga accattaagg aattttgaat tattgatcct aaacaaccaa 2640 aacagagttg aagacataat tgaaaaaata aattttgatg aaattccaaa ctatacacac 2700 gaccctttaa aacaattaat cacaaaattt tcggatattt tctgcttacc cgacgaaaaa 2760 ctaaccacaa ataatttcta tgaacaaaaa attaacctca ctgaccctag tccagtatat 2820 atccctaatt ataaaactat acattcacaa aaacccgtaa ttgaggaaca agttaaaaaa 2880 atgataaatg acaaaattat tgaaccatct gtatcgaatt ataattcacc aattttattg 2940 gtgcccaaaa aatcaagtga tggttctaag aaatggagat tagtcgttga tttccgtcag 3000 ttaaataaaa aaatattggc agataaattt ccattgccga gaatagatac gattttagac 3060 cagctaggaa gagctaaata ttttagcacc ctagacttga catctggatt tcaccagatt 3120 cctttagagg aggaatgtag aaaatttact gcattctcca cggatactgg gcacttccaa 3180 tttttaagat taccgtttgg tttgaacatt agcccaaata gttttcaaag gatgatgacc 3240 attgcaatgg caggtctcac tcctgaagta gcttttattt atattgacga tattattgtc 3300 ataggatgct caatgagaca tcacattaaa aatttaacta ctgtttttga aagattgcga 3360 cattacaatt tgaagcttaa tccttcaaaa tgtaaatttt ttaaatctga agtaacttat 3420 ctggggcaca aagtcacaga taaaggaata ctcccagatg attctaaata tgaatcactg 3480 gttaaatatc cagtgccgca gaatgctgac gaagttcgaa gattcgtcgc attttgtaac 3540 tactatagaa aattcgttga aaatttttca acaattgctc accctttaaa ccagttatgt 3600 aaaaagaatt gcaaatttga ctggtctgtt cagtgtcaac tcgccttcga cacactgaaa 3660 cacaaactgc tttcaccaaa aattttacaa taccctgatt ttgccaaaac ttttattctc 3720 acaacggatg cttctaacgt cggttgtgga gcaattttat cccaaatcac tagtgaaggc 3780 gatcgaccga tagcctacgc tagcagaact tttacacctg gtgagaaaaa taagtctgtt 3840 attctcaaag aactaacagc aatccactgg gcaattaact acttcaaacc atatctttac 3900 ggaaataggt ttgtaatcag aactgatcac agaccacttg tacatttata cggaatgaaa 3960 gacccaacat ccaaattgac acaaatgcgt ttggatttag gagaattcaa tttcgatgta 4020 atatatatag ctggtaagga aaacgtgggt gcggacgcgc tctcacgaat agttataaca 4080 agtgaacaac tacaaaatat gcaaatgctc gtagtaaaca cgagatcaat gcagaaaacg 4140 aaaagcatgc aaagcacgca ggttgatact agtatcacct atcaagcggc ttatacagtt 4200 gaaaacatta atgaaacgag aaaattatgc gctttacgtg ttgtcaataa cactttacag 4260 atctataaaa acaaaaaaat aatgatgaaa atcccatttc agccagaaaa tgaaagacca 4320 actttagaga tgactctttc aaggcttgaa aaagaattaa agaaaatgag aatcgatcga 4380 atatctcttt caacggaaga tgaaattttc aaaaaagtaa caacttctac atttaaaatt 4440 gtagctaata aaactttgca aagtctacaa attttaatat acaaaccacc aatgtatgta 4500 gataacaaaa ataaaattca agaaatctta cgaaatttcc acaatacacc gacaggagga 4560 cacgtaggac aaactagaat gtacctacag atccgagacc tgtaccggtg gaaagcaatg 4620 aaacgttcta ttacactttt cgttcaggcc tgcgaactgt gtaaaagaaa taaaattgta 4680 caacatacta aagaaccaca aataataaca acaactccga gtaaaccttt tgaaatcatt 4740 acaatagata cagtaggacc cctaccaaaa accaacaacg gcaacagata tgcaataaca 4800 attcagtgcg aactaactaa atatatagtt cttgcaccaa tacaaaacaa agaagctaat 4860 actatagcaa gagccatagt agataacttc atcttaattt atggaaaatt tttggaaatc 4920 agaaccgacc aaggtaccga atataataac gaagttctta gtcagatttg caaacaattg 4980 aaaataaaac aaacttttgc aactgcctat cacccacaaa ccattggtgc tcttgaaaga 5040 aatcataggt gtttaaacga atacctacga tgtttcgtta acgagcatca atcggattgg 5100 gatgattggt tgaaatatta tcaatttaat tacaacacaa ctccacatac tattcatggg 5160 tacacacctt atgaattagt gttcggtgtt aaagcaaact taccacatag caataatcaa 5220 agaaatctag atccagtcta caatttagaa gcataccaca atgagttgaa atttagatta 5280 caaaagagcc aaaatttaac aaagtcaata ttagaacagc aaaaacaaga aaaaactcaa 5340 atttgtaaca aaataattaa tcctatccac atacaaatag gcgatctggt ttatctaaaa 5400 aatgaaaata ggagaaaact agattcattt tatcaaggtc catttagaat cacacaaatc 5460 agagatccga attgcgaatt agaagaacct ataacaaaaa agaaaacaat agtgcacaaa 5520 aatagaataa tcaaataaac caccgagcct agcatcaaca atgcaggtaa atttttccca 5580 gcaatagact gcggaagtga aaaattcgga tacaaaacaa catacacata cacatacaca 5640 acacaatgta acaaaacaca atgtaacaaa gcaaacacac aacaacaaaa attcataaat 5700 atatcaaata aattgtaaat ggctactgga gaatggcaaa gaataatttt acttcataaa 5760 actattcttt taaaggggga agg 5783 // ID Gypsy-17_IS-I repbase; DNA; INV; 3936 BP. XX AC ABJB010032597; XX DT 15-FEB-2011 (Rel. 16.02, Created) DT 15-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the black-legged tick genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-17_IS_; KW Gypsy-17_IS-LTR; Gypsy-17_IS-I. XX OS Ixodes scapularis OC Eukaryota; Metazoa; Arthropoda; Chelicerata; Arachnida; Acari; OC Parasitiformes; Ixodida; Ixodoidea; Ixodidae; Ixodinae; Ixodes. XX RN [1] RP 1-3936 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the black-legged tick genome."; RL Direct Submission to RU (15-FEB-2011). XX DR Genome; ABJB010032597; Positions 1489 5424. XX CC Positions [3071-3574] - Integrase core CC 'CTGCC' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 23..1660 FT /product="Gypsy-17_IS-I_1p" FT /translation="MEASSNSENVTAPTTLLATGFTAPPAFDFKRPEEWPR FT WITRFNRYRRVSKLDKEDEGSQVDALIYFLGEDAEDILSASALTDTEQRQF FT EAVAQLFESHFVGKKNVIYERAKFHQKKQEPGESVENFMTSLHRLVTHFGY FT GALKEELVRDRFVVGLQDFSLSEKLQSDPNLTLEAAYTRARQKEAVHRQQA FT VVRGESQAEDMTLDALHKRMSAKEKGAKRREYTPKNAENNAARTQTCGRCG FT NSAREKSKCPAAGSICRNCGKKGHWQQVCFKKLRNEGKSTLKEVTLQDPNE FT APNWMIGSIHAAEGSAWKVKLRVNGTPATFKIDSGADVSAMPPNTVPSKVT FT LEETAARLYGANRQQLEVLDKFQATIQHNNKSIQHVIYVVNGLEEPLLGRK FT ASSELELIQLLCTVSGQTAKRYHEQYPELFQGLGEMALEYKIKLSKDATPV FT SVHTSRRVPLALMTKVQEQLASMQKDRIIRKVEEPTPWCSAMVVVPKPNGE FT VRICVDLTHLNKNIEREKLVLPSVEETLGRLSRTSLYTKLDAKSGFWQIR" FT CDS 2888..3913 FT /product="Gypsy-17_IS-I_2p" FT /translation="MRKQVLHRLHDGHQGISRCRQRAKMSVWWPGLSSQMA FT DVVTNCLSCAHERVPKTQPMIVSELPERPWQRVGTDLFELKGAQYLLVVDY FT YSRYVEITRLKQTTSEAVISHLKALFSRHGIPEVLRSDNGPQFSSKPFEGF FT CKDYKIEHVTSSPRYPQSNGEAERMVKIAKDIIKKSEDPELGLLVYRTTPG FT PSGYSPAQLLMGRQLRTTLPVLASTLLPRLPNHEEFRKKDREEKNKQAENY FT DKRHGVCRKQELDIGSEVWVRDSKATGIITGKRPEPRSYTIRTRSGYLRRN FT THPLVPTKPKPTDQTHPEASTPEFSVPPTEENIPKTRSGRTIRKPKRFGE" XX SQ Sequence 3936 BP; 1240 A; 972 C; 974 G; 750 T; 0 other; tggtgtcaga agtgctccgg aaatggaagc ctcatcgaac tcggagaacg taacggcccc 60 aacaacactg ctagcaacgg gtttcacggc gccgccagcc tttgatttca agcgcccaga 120 agaatggcca aggtggatca cccgttttaa caggtaccgt agagtttcga aactcgataa 180 agaagacgaa ggatcgcaag tagacgcatt aatctacttt ctcggtgagg atgcggaaga 240 cattttgtcc gcctcggctt tgaccgacac cgaacagcga cagttcgaag cagttgccca 300 actcttcgaa agccacttcg tcggcaagaa aaacgttatc tacgaacggg ccaaattcca 360 ccagaagaaa caagagccgg gtgaatctgt tgagaacttc atgactagcc tgcatcgcct 420 agtaacacat ttcggttacg gcgcgttgaa agaagaactc gtccgcgatc gttttgtcgt 480 tggattgcaa gatttttcgc tctcagaaaa acttcaaagc gacccaaatc tgacactgga 540 agctgcgtac actcgagctc gacagaaaga agccgtgcat cgacagcaag ccgtcgtcag 600 aggcgagtcc caagcagaag acatgacgct ggatgccctc cacaaaagaa tgtcagcaaa 660 agagaaaggc gcaaagcggc gagaatacac gccgaaaaac gctgaaaata atgcggctag 720 aacacagacg tgtggaagat gtggcaacag tgcacgcgaa aagtccaagt gccccgcagc 780 gggatcaatc tgtcgcaact gcggaaaaaa aggtcactgg caacaagttt gctttaagaa 840 gcttaggaat gagggcaaga gcaccctaaa agaggtgact ctacaggacc caaatgaagc 900 gccaaactgg atgattggaa gcatccacgc agctgaagga agtgcgtgga aagtcaaact 960 gcgagtcaac ggaacgcctg ctacgttcaa gatagactct ggtgcagacg tcagtgccat 1020 gcctccaaac actgtcccaa gcaaggtgac ccttgaggag acagctgcca ggctctacgg 1080 tgcgaacagg cagcaactcg aagtgttgga caagtttcaa gcgacaatac aacataacaa 1140 taagtctata cagcacgtga tctacgtagt gaacggcctg gaagagcctc ttttgggaag 1200 aaaagccagc agcgaattgg agctgatcca acttctgtgc acagtctcgg gccaaacagc 1260 caagaggtac catgaacaat accctgagtt gttccaaggt ctcggagaaa tggcattaga 1320 gtacaaaatc aaactctcga aagacgccac gccagtctcc gtccacacat cacggcgtgt 1380 ccccctggca ctgatgacga aagtacaaga gcagctggca agcatgcaga aagatcgaat 1440 cataaggaaa gttgaagagc ctacaccatg gtgttctgca atggttgttg tgcccaagcc 1500 gaacggagaa gtgcggatct gcgtggattt gacgcacctc aacaagaaca tagaacgcga 1560 aaagctagtt ttgccctctg tcgaagaaac actgggacgg ttgtcacgca catcactata 1620 cacaaaacta gacgccaagt ccggattttg gcaaattcgg taggcagaag agtcacaaac 1680 tatgactacc tttatcacac cttttggcag attttgtttt caacgattgc cgtttggaat 1740 ttcttccgcc ccagagtatt ttcaacgcag gatgaacgag atcctggaag gaatcccagg 1800 agtcctatgc catatggacg acatcctaat cactggtgcc aacgagaaag agcatgatga 1860 gaggctagag gaagtcctta agaggttgaa agcagcaggt gtccgactca acgacaaatg 1920 cgagttcaag caaaagtcca taaagttctt gggacacatc gtcagccagg aagggatcaa 1980 gccagaccca caaagggtca cagccatctc caatatgcca gcgccaacga cagtgaccga 2040 ggtgagacaa ttcatcggaa tggcaaacca agtaggacgc ttcatccccc atctttccga 2100 aaaattgaag cctctgcgtg accttctcaa gaaggcaagc gagtggacgt gggacaccgc 2160 acagaaaata gcattcaatg cggtgaagaa ggacatccag gaggcaacag ccctaacact 2220 gtatgatccg caacaagaaa ttatttttca gcagatgctt cgtcttacgg tttgggggca 2280 gtgctactac aaaaggacaa gaacggaaac attcgaccag tagccttcgc ttcgaaagga 2340 ctgagtacca cagaaacaaa ttacgatcag attgagaaag aagcgtacgc cgtgacatgt 2400 gcatgtgaac gattcgagaa gttcctgcta ggaacgcagt ttcacatcca aacagatcat 2460 cgcccactcg taccccttct tgggtcaaaa gaactggatg cgattccagc cagagttcag 2520 cgattcaaac tgcggctgat gagattctgc tacacaatct cacacgtgcc tggaaaagaa 2580 ctctggacgg ccgatgccct gtcaaggtca ccttttgaac ctccagacaa tatggaagaa 2640 gaaaccgagg cttttgtgca agtcgtcctt cagtcagtcc ctgcatcccg agagtacctg 2700 atcgaggtaa agaagatgca agaggaggac cccgtctgta ctaaagtgag agagtactgt 2760 gaaaaagggt ggccaactag caagcagcaa gtgccaatcg agatcacccc atatttccaa 2820 gaaagacaac ctcacggtac aagaaaacct actcgtgaga ggcaaccgcc ttgtgatacc 2880 aagtcacatg aggaagcaag tcttgcaccg actgcacgat ggccatcagg gtatctcgag 2940 atgccggcaa cgagccaaga tgtcagtttg gtggccagga ctaagctctc agatggctga 3000 tgttgtcaca aactgtctca gctgcgcaca cgagcgggta ccaaaaacac agccgatgat 3060 cgtgtctgag ctaccagaga gaccatggca acgagttggg accgatttgt tcgagttgaa 3120 aggagcccaa tatttgctcg tcgtagacta ctactcgagg tacgtggaaa taaccagact 3180 gaaacaaacc acttcagaag cagtcatatc acatctcaag gctctatttt cacgtcacgg 3240 aatacctgag gttctgcggt cagacaacgg tccacaattc agctccaagc cattcgaagg 3300 cttctgtaag gattacaaga tcgagcacgt tacaagtagt ccacgttatc cgcaaagtaa 3360 cggggaagct gaaagaatgg tcaagatagc caaagacatc atcaagaagt ctgaagatcc 3420 cgagcttgga ttactggtct acagaacaac accaggacct tctggataca gtccggcaca 3480 gcttctgatg ggaaggcagt tgcgaaccac actgccagta ttggcttcta cactcttgcc 3540 aagactgccc aaccatgagg agttccggaa aaaggatcga gaagaaaaaa ataagcaagc 3600 ggaaaactac gacaagagac acggagtgtg tcgaaaacag gagttagata tcggcagcga 3660 ggtatgggta agagacagca aagcaactgg aataattacg ggcaagagac cagaacctcg 3720 atcctacact atcaggacaa ggtccggata tctccgccgg aatacacatc ccctggttcc 3780 aaccaaaccc aaaccaaccg accaaacaca tcctgaagca agcacaccag aattcagtgt 3840 tcctccgaca gaagaaaaca tcccaaagac ccggtccgga cggaccataa gaaaaccaaa 3900 aagatttggc gaatgacttt tgcgtggaaa aagaga 3936 // ID Gypsy-243_AA-I repbase; DNA; INV; 5473 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-243_AA_; KW Gypsy-243_AA-LTR; Gypsy-243_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-5473 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1087-1087 (2011). XX DR [1] (Consensus) XX CC Positions [3098-3577] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 2174..3685 FT /product="Gypsy-243_AA-I_1p" FT /translation="MGLMGFYQKFIPRYSHLTAPITDLLKKSKKFKWSEDA FT EKALEGLKSVLTSAPVLANPDYSRPFIIETDASQLAVGAALLQEFDDGKRI FT IGYYSKKLSSTQRKYAATEKECLAVLMAVENFRHYIEGTTFTVITDCKSIT FT WLFSITAASANSRLLRWALKLQSYDFVLRYRKGKDNILADCLSRIEALKII FT DKDYSQIVDDILKSPHDYKNFKVAGNKIYKYVEEPGKLKDKRFDWKYFPPK FT AERLSLIESVHNTAHLGYEKTLSALKEKYFWPKMSKEVKEYCKSCLACKTS FT KGINVNPTPTMGSQKKYCDYPWQFLTLDYVGPFPTSGKGRCTCLLVVTDVF FT SKFIMVQPFRQATASSLVLFLEQTVFLLFGVPEIILTDNGSQFTSKEFASL FT LRQYGVKHWLTPSYHPQVNNTERVNKVVTTAIRATLRGNHKHWTDNLQQIA FT CAIRNSVHDSTKYSPYFVTFGRNMISNGAEYERMRNSNGIANSALNDNERK FT ELYKTIRK" XX SQ Sequence 5473 BP; 1509 A; 1029 C; 1203 G; 1635 T; 97 other; ttttcgtttt agccatgttc gtttaccgaa ggcacgaact ctcgccatat caaattagtt 60 tgggcgcaat ttttgaaaga ataggctctt tatgtgagca ggtttcttgt ctgctccaaa 120 gaagtagact cgttcatttc gcttcttaaa cggctggagt tcagaagttc gtagttcaat 180 tttaaagcta atttatttaa agtgtgttag attcgttgta actaaaaggt attgaaagtg 240 attgaagtga tttttcgttt gtactttgtt ctaattttgt tctatttgtt tgttctataa 300 ggtagtgaaa cgtttcggag tcggtgtgtc cgacaaacgc gaactgttct gttaaattta 360 taattgcgac tcaaaaaggt aatttcaatg aactcgattt tctaaattta gtttttaaaa 420 taaatttcgc gttggacttt ctagcgccct gcgctttttg ggtacaggct ggagtttttc 480 gagtgtcgac mgaagtcgaa tacggtcccc gtggcgtttc gggccaccat ttccgtccga 540 ggacgttgtt tcgagtccgt gcaaccgcca aggcccgccc ttcggggtag tcaaacggag 600 cgtggtgttt cgcctttcaa aaaggagcga aaattccccg acgatcgcca attaatcgtc 660 gtagacggtc caccaacgtt acaaccccgg accaagctaa acgagagcgc ggatcgtgaa 720 ctgcgtcgcc aattgcgcat cacgttccgg atcagaacct aaggggagga gttcatttca 780 tcacgtacgg tcagagcgcc atacgcgagg ccgtacggag gtacgttggc tctaaaccgg 840 aggaagaaga tcaaccccgg gtcgcacgac actccgtcgt agccaaagct ctctggagaa 900 aatcggaggc catccgtcca gccgccaccc gcttcgcaac tttagttcgg ccatacttgc 960 acgggctcat cgcgctgcac gccattcctg cagcgacggc gaaacgatcc ccttcgccgc 1020 catcgttacg aaggttctcg ccgccatcgc cgaggttcct ggctgtttcg agtcgtcgcc 1080 gagattcaac aagtcgacga agaaaaaccc tgcgcaggta tgtggtagtt gtagacatgg 1140 ccatgctcta gagtgtttcc ccaatccaaa cccaacccaa accgtccaac ccgaaaccga 1200 ataaattgtt gaatttgtaa ttaatagaaa atcgagtagt ttttggttta gttgagttcc 1260 gtttagtgca attgtgagga aggtcacttc aattgcgagt tttgtttcag gttgctgctt 1320 cgattttcat taagaggttt ttaagttctt cagttcagtg ggttgaacga gtctaaggcg 1380 ggattgagcg tgttgtggga gacatttcag ctttcagaga ttgcgcttct ttgtttcgaa 1440 tttgagccag cgaccgttcg agtacccctg aggtaaaagt cgaaatagtc gggcaaatca 1500 aatcgaccgg cggtctctcc gattgcgaga gtggcgctta aatcgccggt taactatcct 1560 cgaatttgtc gaacggttac agtggcgccc aacgtcaaaa gtttacgaaa cttwaattca 1620 attggattaa ttttgaatag tttcagttaa tcttgagtgc ttttatttgg attaagtgaa 1680 attgaattga attttgaaat tgttttaact ttaactttct agtgtttaag gtttgaattc 1740 gcttaagtta atttgaattg aatttgattg atttttaatt gtttagtttt atcttattga 1800 atttttaata cattgaattt tgaatttgat taggttaagt aattatgagt tgaatttttt 1860 ctttttattt ttgaattgaa ttggattttt agaagttgcc tccgtgcttg aatttgattt 1920 ttctaggata aaggaatagt tagctctctc atattttttc tttagactga atcttggatt 1980 gaattaaatt tttgaattga atttataata ttttgtcacg tctttgagtt ctgaattttt 2040 gaaattctgt agaaagcaag ttaagtatct agggtacttg ttaacagaaa agggtctatc 2100 aatcgatagt gctaaattag agcctatcct caattatcct cgtccaaaaa ccatcagaga 2160 tgtaagaagg ttgatgggtc taatgggctt ctaccagaaa tttataccaa ggtacagcca 2220 tttgactgcc ccaatcacag atttacttaa gaaatctaag aaatttaaat ggtcagaaga 2280 tgcggagaag gcgttagaag ggcttaaatc ggttttgact tcagctccag ttttggcaaa 2340 tcccgattat tcccgtccct ttattataga gacagatgct tcacaattgg cagtcggagc 2400 agcgttattg caagagtttg atgacggtaa acgtattatt ggatactaca gtaagaaact 2460 gtcaagcact caaaggaaat atgcagctac tgagaaggag tgcttagccg ttcttatggc 2520 tgttgagaat ttcaggcatt atatagaggg aacaacattt actgtgatta ccgactgcaa 2580 aagcataact tggctgtttt caatcacagc tgcaagcgct aattcgcgat tattacgttg 2640 ggctctsaaa ttgcagtctt acgattttgt gctaagatac aggaaaggaa aagataacat 2700 tttagctgac tgtttgtcac gtatagaagc attaaaaatt attgacaaag attactcgca 2760 aatagttgac gatattttaa aaagtcctca cgactacaag aattttaaag ttgcgggtaa 2820 taagatttat aagtatgtag aggaacctgg aaagctgaaa gacaaacgtt tcgactggaa 2880 atattttcca ccaaaggcag agcgcttatc actgatcgag tcagtccata atacggcaca 2940 tttgggatat gaaaagactc tcagtgcgct taaagaaaaa tatttctggc caaaaatgtc 3000 caaagaagta aaggagtact gtaagagctg tttagcctgt aaaacatcaa agggaataaa 3060 tgtgaaccca accccaacaa tgggatctca gaaaaaatac tgcgattacc catggcagtt 3120 cttaacccta gactatgttg gcccttttcc cacttcaggg aagggaagat gcacctgttt 3180 acttgtagta acagatgtgt tctctaagtt catcatggtc caaccattta ggcaggccac 3240 ggcttcatca ttagtactgt ttttagaaca gaccgtattt ttactttttg gagtaccgga 3300 gattatattg acagataacg gtagtcaatt tacctcaaaa gagtttgcct cacttctcag 3360 gcaatacggg gtgaagcatt ggctgacacc ctcataccac ccacaagtca ataacaccga 3420 gagggtgaat aaagtagtca ccacagcaat aagggctacg cttcgaggca atcacaaaca 3480 ttggaccgac aatttgcagc aaatagcttg tgcaattagg aattcggtcc acgattctac 3540 caaatattcc ccatatttcg taacatttgg tagaaatatg atttcaaatg gagctgagta 3600 tgagagaatg agaaattcga atggaattgc aaattcagct ctcaacgata atgaacggaa 3660 agaattatac aaaacaatta ggaaaaawtt agccgaagct tatgaaaaac aagcaaaata 3720 ttataattta cgttcaaata aacgcgcacc cacttaccaa gtaggagaaa aggtgctaaa 3780 gaaaaataca gttttatcta acaaatcaaa agatttctgt gcaaaacttg atgcaaaata 3840 tgttgtagca tatgtaaata gagttttagg tgatagttat gaactcatag atgagaaagg 3900 caattcctta ggcatatttc atgccagttt tttaaagaaa ttctagtcaa ttcgagaaca 3960 agctatgaca ttttcccgtt aaggaaaatg tacaaaaatc atcactagga tgaaaggaaa 4020 atgcgtcgca taaatgggtt ggttagcgaa aattctctta cctaatgtta gtttacaatc 4080 tgacctagcc ktacaatgtt tgcaaatcta ctwaatctat attagtttta atcctaggtt 4140 agtggaaagm taatcggaac cgactcaccc gktgatmcgg tcctamcgcg gtggamacgk 4200 ccakgggwwc actccctsgc cgkgagggwc aggtccctts gtcgswcgck kcccttmgtc 4260 ggtmggccga tgtagwcatm cgtacgwgmw gctcackcgt cgatagkcgm cgcstttcgw 4320 ctcccgtcsg attccgakta cttgtcwscw gtmgctssta cmcwcggagc gkcasgmmtm 4380 mcgtccktcg wttccgacgt aktgscagka aattgccggc gtcgggwgcw ccgtggtcgt 4440 cgwtcgkgga mgkktcagsa ccggtwwttg tcggawgcmg tcctmatcaa amatkccwgg 4500 atcgsgtast cgaccwgctc mgttagcgat ggttaccgct kggacgaswc cgcgtggtcs 4560 ttcacgtgtk wmttttccma awagttttcc cgaccggtgt cggtattgga taatcccacg 4620 acgagtttcc tgaaaataac gaagaaaaga ttaatctgat gcatawttag tataawtacc 4680 tmatactccc ccggtttatc gacgattaag gatgtttttt ccggtcccac gacgtcctcc 4740 agcagcttgt ccgacacttt ctaacaggaa actgaacttk gtgttmgttt tcctgttgtt 4800 ttcgttcwtg tttatcattg acactgacag tgcgatgttt ttcttcgcag atgtcactcg 4860 cagtgacagc gcgatgtttt tcgcagttgt cagtgagcgc ttgacaggca gtgagttgca 4920 tgcaagttcg tatgctgtca gtttsstttg aamgcaaata gggttgcgcg ctatgccgtg 4980 tatgagttct agactggtca gaaagaattt ctgattctag aattcagttt ttagtttcgc 5040 ttgtacggat gtcaaattgc tgwgtcagac gaagtgttct gatgaatttg attatccagk 5100 ttwcaatttt cgtttacaga tatcagtagt aagagtcaga cgaagcgttc tgatagtaac 5160 tgattatcta gttttgagtt tgtgttgact gcggttgggt caaatacgat ttgatgatgc 5220 agttaaccag tttagttaga gtaacgattg gcgtaattga tggacaagtt ccattgtgct 5280 gatcgtaaga tgcagtttag ctgagttagt ggtagtaagc gctctgtttg tgatcctaga 5340 aactggtgag atgattatta atgttcaact gcaggaggta atttctatta atgttttcac 5400 gtagatcatt ttgaaagtta tgaaaatttg atcagtgcgc ctgatcaaat tttcataaaa 5460 cctggtggtg tga 5473 // ID Proto2-1_BM repbase; DNA; INV; 1110 BP. XX AC . XX DT 29-APR-2010 (Rel. 15.07, Created) DT 29-APR-2010 (Rel. 15.07, Last updated, Version 2) XX DE Non-LTR retrotransposon - a consensus. XX KW Proto2; Non-LTR Retrotransposon; Transposable Element; KW Proto2-1_BM. XX OS Bombyx mori OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Bombycidae; Bombycinae; Bombyx. XX RN [1] RP 1-1110 RA Jurka J.; RT "Non-LTR retrotransposons from Bombyx mori."; RL Repbase Reports 10(7), 1048-1048 (2010). XX DR [1] (Consensus) XX CC ~99% identical to consensus. XX FH Key Location/Qualifiers FT CDS 17..1006 FT /product="Proto2-1_BM_1p" FT /translation="GTYATPVYACFLDLSKAFDLVSYDLLWKKLENIKLPT FT EINNILKFWYGNQINNVRWAGALSLPYRLECGVRQGGLSSPTLFNLYMNEL FT IGELSRTRIGCFIDGVCVNNISYADDMVLLSASICGLRQLVSLCEGYAKSH FT GLVYNCKKSEIMVFETRGGTHDKIPPVVLNGTPLTRVYKFKYLGHVLTPDL FT KDDEDIERERRALSVRANMIARRFARCSLKVKVTLFRAYCTNFYTCSLWAG FT YTQRSYNALRVQYNNAFRVLVVLPRFCSASGMFADAHIDCFYATMRKRCAS FT LVNRVRASSNSILSMIASRLDCIYMGRCGAISHGMLPQ" XX SQ Sequence 1110 BP; 292 A; 221 C; 282 G; 315 T; 0 other; gtgtcatgtt ctatgaggca cgtacgcgac acctgtttat gcctgctttc ttgatctgtc 60 caaggcgttt gatttggttt cctatgacct cttgtggaaa aagttggaaa atattaaact 120 acccacggaa ataaataaca ttctaaagtt ttggtatggg aaccagatca acaatgtgcg 180 atgggctggg gctttgtctc ttccgtatag gttggagtgc ggggtgagac aaggcgggtt 240 gagctcgcct acgctcttca acctgtatat gaacgagctg atcggcgagc tcagtaggac 300 caggattggt tgtttcatag acggggtgtg cgtaaacaac atcagctacg cagacgatat 360 ggtgctgctg agcgcgtcta tctgtggcct aaggcaactt gtttctttgt gtgagggata 420 cgccaaatca cacggtttgg tgtacaactg taaaaagtcg gagatcatgg tctttgagac 480 caggggtggg acgcatgata aaatacctcc cgtagtactc aacggaacgc cattgacaag 540 agtttataaa tttaaatatc tcggccatgt gttgacccct gatctcaaag atgatgaaga 600 tattgaacga gagcggagag cgttgtcggt tagagcaaac atgatagctc gcaggtttgc 660 acggtgttca cttaaagtca aggtcaccct ctttagggcc tactgtacta acttttacac 720 gtgcagcttg tgggctggat atactcaaag atcgtacaac gctctccgtg ttcaatacaa 780 taacgcgttt agggtgctgg tggtgctgcc ccgcttttgt agcgcatcag ggatgtttgc 840 agacgcgcac atagattgtt tttacgctac catgcgtaag aggtgtgcat ctctggtaaa 900 cagagtgagg gccagctcca acagtatcct gagcatgatc gcaagcaggc tggactgtat 960 atacatgggt cgctgtggtg ccatctccca tgggatgtta ccacagtaat tcattgtgat 1020 gtaaatataa ctactaacat aggtctaagt cttattaaca atttatatga gtcatggata 1080 cttgaaataa aaacattatt attattatta 1110 // ID BEL-70_CQ-LTR repbase; DNA; INV; 265 BP. XX AC . XX DT 07-JAN-2011 (Rel. 16.02, Created) DT 07-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-70_CQ_; KW BEL-70_CQ-I; BEL-70_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-265 RA Jurka J.; RT "LTR retrotransposons from the southern house mosquito."; RL Direct Submission to RU (07-JAN-2011). XX DR [2] (Consensus) XX SQ Sequence 265 BP; 69 A; 59 C; 38 G; 98 T; 1 other; tgttaaagat aagaacccta ttgctaactg ttatatttaa gattctctcg cggcacaatt 60 gcgtttactc tttccgctca gatttatctc tcgatttcgc tatatattgc ccagcaaacc 120 acattcagtc ctcttagtac gaaatatatc gctataccag acggagatca tctccatttt 180 cgttttcgtc gggaataaaa ttttagtagt tttttwtact ctacattcct cgcgtgtgat 240 ttttagttta atcctaatcc gaaca 265 // ID DNA8-55_AP repbase; DNA; INV; 327 BP. XX AC . XX DT 25-AUG-2009 (Rel. 14.09, Created) DT 25-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-55_AP. XX NM DNA8-55_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-327 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 1988-1988 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 327 BP; 102 A; 75 C; 57 G; 93 T; 0 other; cagtggtttt caaccttttt gggtccacgg ctccttttga ttttgaaatc aaatatacgg 60 ctcccatacg caaatttaaa acaaacaatt aatttacatt attcaatcca actttattca 120 atgacgaaat ctgtcgatcg gccgatctat ctctcacata cccaagtgcg tacaacatgt 180 atagggcgtg cggacttaca cggcagcgaa aagtagataa caaaaactat aaattattac 240 aatttttttt tagttttcga acgcggctca tttgataata accgacgacg cccctgggag 300 ccgcgacgca caggttgaaa accactg 327 // ID hAT-5_BF repbase; DNA; INV; 2115 BP. XX AC . XX DT 30-SEP-2008 (Rel. 13.09, Created) DT 30-SEP-2008 (Rel. 13.09, Last updated, Version 1) XX DE Amphioxus hAT-5_BF autonomous DNA transposon - consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-5_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-2115 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-2115 RA Kapitonov V. and Jurka J.; RT "hAT transposons in the amphioxus genome."; RL Repbase Reports 8(9), 926-926 (2008). XX DR [2] (Consensus) XX CC The transposon is incomplete: it contains only the CC transposase-encoding region. XX SQ Sequence 2115 BP; 595 A; 535 C; 536 G; 449 T; 0 other; atgaacaggt actgccagag ggaatggtta gagacattcg agttcgtgtc ctactctaca 60 gaacaggatg gtctgtactg tctcccctgt gttttgtttc ccacctcttc aacgcgtgaa 120 caacccaccc ttctagtgaa gcggccgttt agaaactgga aggacgctaa aaacgatctg 180 caagtccatg gactcttgga gaaccaccga aactcagaag cgaggatgaa ggcgtttaag 240 gacacaatgc agaatccagg gcgtcgtatt gacctgtcca tgacatcgga gtcgattaat 300 cgcacaaaga agaaccgcgc ttttctcctt tcagtcatca agtgcctgga ggtatgtggc 360 cggcaaggat tcgcccttcg cggacaccgt gacgatagta cagcggaccc cttatctaac 420 aaagggaatt tccacgcatt actacagtta agggtggatg ctggagacgt ggcacttgca 480 gagcacctgg agacatgcgc acgcaatgca acctacattt caaaaacgtc acagaatcag 540 ctcctagaat gtattaaaca atacattcta gagacaattg taaaggacat aactagccaa 600 cctttcggtc cacactatgg cattatggcg gacgaggtta ctgacgtaag caactgggag 660 cagttgggcc tactagttcg ctacacaaaa gatcaaaaac ctgtggaaag gttgctcatg 720 ttttcagagt gtgagaagat aacaggaaga gccctgtgcg atacgatcgc tgctgacctc 780 gccggggtga acctggaccc agcaaactgc agggcgcagt gtttcgatgg tgctgggaac 840 atggcgggag tacggaatgg ctgcgctgca aacttcaagg agatcgccac cagagcaccg 900 tacttccact gtgcaagcca tgatctcaac ctggcccttt gcaaagcctg caaggtacag 960 gatatccact gcatgatgga aactctgaag gcagtgggca tcttcttcaa gtactcaccc 1020 aaaagaacac gagagctgga aaaggccgtt gagaatgaga acgaagaaag gcgtgccgcc 1080 ggccgtacac aactcctgaa aaccaaagtc aagccgatgt gcgagacacg ctgggtagag 1140 aagcacacct gcctggagga cttccacgaa cttcacgagc ccctcacgga ttgtctggaa 1200 gcaatcacat cagaggaggg atgggacgca aaagcacgca cggaagcagg tgggttattg 1260 tctcaactga agaggcctgc gttcctgtgt gccttcgagt gtgctctgta catctttggc 1320 tacacaaaat ctctcagtac tcttctacaa ggaagcacga tggacataat caaagcgtac 1380 gaacaagtgg accttgttcg caaagagttg cggtccatca gagagagagc tgatgaggag 1440 ttccatcaag tgtacctaag tgcaagcgag atggcgcaga ggcttggcca agacatggaa 1500 gtgccgcgtc ggtgtagacg acagacccaa aggtgcaatg ttgaggccga cacgcctgaa 1560 gtttacttca gaaggtcaat ctttgtgcct tttgtggacg gtatggtgga gcagcttaca 1620 tcacgcttcc cacagctaac agcccatgct tgtcgcgctc ttatgctcat cccgtcaaac 1680 gtacacagcc ttcaacaaga ccacatccag cagctacgcg aatactatga accggaccta 1740 ccatctccgc tatctttaca acaggagctt cgtctgtgga aggcacagtg gcaaggtgta 1800 attgttactg aaaagccaga ctcacttgac gcaaccatca gccatcaata cagcaaccca 1860 gcaatcttcc ccaatgtgtg tatgatgttg caactcctga tgctgacgcc agtgaccact 1920 gctggagtag aaagagcaaa ttccgcactc aggtacatca aggatgtgta caggagcacc 1980 atgggtgagg atcgcctcaa cgcgctgata ctcctctttg tccacaggga cattccgctt 2040 gattataaca gaatcataga tatctacgct acagggcatc cacgcagaat gatgttcatc 2100 aacccactag cagag 2115 // ID Gypsy18-I_Dya repbase; DNA; INV; 7092 BP. XX AC chrU; XX DT 19-MAY-2009 (Rel. 14.05, Created) DT 19-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy18_Dya; KW Gypsy18-LTR_Dya; Gypsy18-I_Dya. XX OS Drosophila yakuba OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; melanogaster subgroup. XX RN [1] RP 1-7092 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1109-1109 (2009). XX DR Genome; chrU; Positions 4988903 4995994. XX CC Positions [6139-6648] - Integrase core CC LTRs are 89% similar to each other. XX FH Key Location/Qualifiers FT CDS 3772..5433 FT /product="Gypsy18-I_Dya_2p" FT /translation="MYLLRSKKPARACHKGTQTDGFEIDDDKLNDFNQTSS FT SVSTTFEVDQVQQKLEVTDTSRSKVRREKRKKLRKEERKLLLSAMVGQLRD FT VRPYAQVQMLGKTVVGLLDTGASVSCICGSLATELPGLNILCQKFTADVRT FT ADGQSQKITGRVTTEVSIRNENKKITFYVVPSLAGDLYLGIDFWKHFQLLP FT EAFCGNNQVVAALEELSARELTQSQQKTLLQVVQLFPSYALKGLGRTELIT FT HSIDVAQAKPVKQRHYAVSPATEKLMYAELDRMLKLGVIEESSSAWSSPVV FT LVQKPGKVRLCIDSRKLNEVTVKDAYPMPLINGILSRLPKAEFITTLDLKD FT AFWQIPLDQSSKDKTARPLYQFVVMPFGLSNSPQTMCRLMNKVIPADLRNE FT VFVYLDDLLIVSDTFERHIEVLQEVASYLQKAKLTINVEKSNFCIKEVKYR FT HVIGNGTVGTDPDKISAIVEFPSPRSMKQLRRFLGMTGLYNKFIKNFAALA FT SPITDTLKNKRRFVWTKEAQEAFEGLKEKMCTAPVLHSPNFDAPFSIHCDA FT IQELVQS" XX SQ Sequence 7092 BP; 2217 A; 1464 C; 1527 G; 1884 T; 0 other; acccctagag ggtaagtttt gtgcgtgtgt gaagaatatt atatttcaaa aacaaaaaat 60 caccattgag gccagtgaga ccagtgaggg aagcataaat tgccgaccag gaggatttct 120 ggatttcatc agtcagcagc gccaccgtta agcgcaaaaa catgatttaa ttgaattcac 180 acctgaccga ctgcttgtta agcagatcgg caaccagaca agcagatcag cataagttat 240 tccgatagaa aaaataaatt ccgccttatg ctaggaaatt aatttaagtt atgttgtaag 300 agaaatagta ttcatttagg gcaattcctt tcttttcttt tctttccttc tccctttatc 360 ccattttcct tcctcttccc acataagcaa aagcccgcct aaactggcgt gagcacaaga 420 atgaagtgaa aagagaagaa agagtaccgc agtaggagag cggctcaaaa gagcagatca 480 aaaagaaata gctctcttag gattaaaagg cggtgtaagc tgatttctct ttcacaggca 540 accacttgaa gcgcttcatc gtgggaaatg cagagtgagt aatataaatg tttttattta 600 aaggagaaag cagctgccgg aagaaaaatc aaataattag ctactgcggc gctacttgtt 660 tccaagtcca ttgtttacca ataacaaatt caatggaaat aaaaattgct catgtttaaa 720 atttaccctt gccccaatac tcgcttttct aaaatcttaa aagaataaaa ataagaaaaa 780 tttaaaaatt aaaatacaca tcccatatac cctacttttc ctattcttaa tgccctgccg 840 agcgtaagta ttcctttttt taagcaaaca aaaaacttta catttaattt aaatctagat 900 gtttaaaatg tgtaggttag ggtaatcatt taggatatgt tgtctaaacc tgttccaatt 960 aaaacattgt cagtttttaa gtttattcca tgctaaaaat taaattttat aatgtttaaa 1020 tccgagtaaa tctctctagg ggtgggtgcc atgcggcgtc ggaaggtgta tgtcgtgtaa 1080 aatgtagcat acttgtaggg atcaccggtt atgtcaaatg gttagtggtg gtcaaggtag 1140 ggtgggcgtc gacagcttaa cagctggtgt cgtacgagtg tgctatgtcc ttgtttgtcc 1200 ttagtcctgt cgtccgatag tcctttagtg tttcttgtgg agatcaattc ctaagcacgc 1260 gaaataaata agtgtgtgat aggagacgag caagtagtcg cgatgtcaaa tattcccccc 1320 ctttactcta ccgagaatta gtagccggaa ccagcgcgtg cttgatcgcc caactgcaat 1380 ttaccatctg ctcgggaaag ttatactcgt agttcgctac agtatactta atatttatca 1440 taatcttatt ttggcgccca acgtggggcc cgattctgtc atggtgacag atcgatcaaa 1500 ccgtcaataa catctgacga tcgtccccaa gatctccaca tgttcatcaa caaaagtatg 1560 tggtcaaagc aaaaaaaaat cgcgatctag aaatccgaaa attgcttaaa tcaggcagca 1620 ggaaaagtat tagtcatttg gctcttacgc cgctactgaa tcagaacaca tattgtttta 1680 ttactattat aatattaaat tgtcctatta tctgctggaa gagtttcagc tatttagctt 1740 taacgccgcg tcataattag tgcatttaat gttttaataa tttattttgt tatttttatt 1800 cctttttaat tttgttttgt tcgttagtct tttgtataaa attaaaaaaa ttaaatgtat 1860 aaattgtaca agccaaatct tgtcttttgc cttttgtgtt ttgtataatt tgtcatatat 1920 taatgaaatt caaaaggtta atgtacatag gatgttaaac ataggactag tgaaaagttc 1980 actaggattt caaagaatca aggaaccact tgttgccttt ggccttgtgt tgagccaaga 2040 tatttgtttt ctatgtctaa atacaacatc cttcaccttc cattctagta attttctctt 2100 ttaaatcagt agtgtcaata gcgtacattt aaaccacgtc aggactttag taacaaggaa 2160 ttttacacaa cgatacctga aaacgattgc atggcacccc gatttttatt tgtttattta 2220 ttttctttac cttatttttg tgttctcaat taattattta ataggataat tttttttatt 2280 aatagtccac attatttatt tatttatttt attatatgtt cttgagtgta tagtagaggt 2340 cagagtaaag cataggttca ttagagatgc cactagcaca cagcaccgaa gacttatcta 2400 gaattgtcga aaatatatgc cacatctgcg ctagaactat ttcttatcct agccaaattt 2460 taaccaccag ttgcggacac gaatttcacc gaacttgctt tactgcaaaa tcggggggtc 2520 tgctctagtt ccctttcatt accgtccacc gaaaacttgt ccggggacct agaaaaccaa 2580 ccctgagggt caaaactcaa aactccaaga gcacaaccga acgttctgac taggtccaaa 2640 accagtcaag aaaggctaca acaaaaaacg acaatgtcca acttaaaaaa ccgagaacag 2700 ccttcacctc cgactgaacc cacaaccgat tttcagacag caatggctga aagcataaaa 2760 tcggctatct cgggaaccat tacggcagcc ttggtccaac aaacaaaatg ttttgcagac 2820 gctttgggcc aacacacaaa taatctgacc caagcgatca ctgcgggctt tcaagctcag 2880 gcatcaagcc tttcgtctaa tcaagtattc ctagacccat tatacgaccg atctaaagga 2940 gttccacaag gctcaccaaa tataggtcca acccgtagcg ttaaccagtc acctgcatca 3000 gcaatgacct ccttaagacc ggataagatc gcccagataa tttttaactg gaaactacgt 3060 ttctcaggaa aaggtccaat ggctgtcgag gactttattt accgagtcga agctttagcc 3120 ggtcaaacat tggacggtaa tcttcaactc gtggctcaaa atgccagtac gttattcgag 3180 ggtattgcga gcgagtgcta ttggcgttac cacaaaagtg taaccgttga gcgctggtat 3240 gaattatgcg ccgcactcaa agggcaattt aaggacgacc gaactgacag aaatatcagg 3300 gccgagatag accagagaaa gcaacgcgga agcgaatcat tcgatgattt ctaccgggta 3360 atcgcaactc ttgcggatag actatgtcag aaggagctat ggttgaaacc ttacgagcca 3420 atctccaacc cgatattcag cacgaaattt tatacgagcg tgccgatacc gttcccgaat 3480 gaagagggtt ggttaggact cgggaaatct taatgcaaag cgtagggaag cctcaggggc 3540 tcaggggcca caacgccagg tacccaggcg acaggtgtgc gaggtccaat gccagatcga 3600 gtcagaatcg gaactcgaag atctagagga agacctagag ttagcggcca tggaattgca 3660 gtgctggaac tgccaggaaa agggtcacag gtatcaggag tgcaccgcag acagacgtgt 3720 cttctgttat tgttgcggta aagcagatac gtataagccg tcatgcagta aatgtacttg 3780 ttacgctcca aaaaaccagc aagggcgtgc cacaagggca cgcagacaga tggtttcgaa 3840 atcgacgatg acaaactaaa cgatttcaac caaacaagta gcagcgtatc tacaaccttt 3900 gaggtggacc aggtccaaca aaagctagag gtcacagata caagccgatc aaaagttcgc 3960 agggagaaaa gaaaaaaact aagaaaggaa gaacggaagc tgcttctctc agccatggtg 4020 gggcaacttc gcgacgttag gccgtatgct caggtacaaa tgctaggaaa aacagtagtg 4080 ggtttattgg atacaggggc atctgttagc tgcatatgtg gtagtctggc cacagagctg 4140 ccaggtctaa atattttatg ccaaaaattt accgcagatg ttagaacagc cgatggacaa 4200 tctcaaaaaa ttaccggccg agtaaccact gaagtgtcta tccgaaacga gaataaaaaa 4260 atcaccttct acgtagttcc atctttagca ggggatctct atttaggaat agatttttgg 4320 aagcactttc agctactacc tgaagctttc tgtggcaaca accaagtagt ggcagcctta 4380 gaggagcttt cggctcgaga gcttactcaa agccagcaga aaacgctgct tcaggttgtt 4440 caacttttcc cgtcgtatgc cttaaaagga ttaggaagga ccgaattgat tacccactcc 4500 attgatgttg ctcaagcaaa gccagtaaag caaagacact atgccgtatc gcctgcaaca 4560 gaaaaactga tgtacgccga gttggatagg atgctcaagt taggggttat cgaagagtca 4620 tccagcgctt ggtcctcgcc agtagtcctg gtacaaaaac caggaaaagt taggctttgt 4680 atcgatagcc gaaagttaaa tgaggtgacg gtaaaagatg cataccccat gcccttaata 4740 aatggaatat tgagtagact ccctaaagct gaattcatta ctaccttaga ccttaaggat 4800 gcattctggc agatcccgct agaccaaagc tccaaggaca aaacagcgag acccctttac 4860 caatttgtgg ttatgccttt tgggctatca aattctcctc agactatgtg caggctgatg 4920 aataaggtga tacccgcaga cttgagaaac gaggtgttcg tttacctgga cgacctacta 4980 atagtctcag acacgttcga aagacacata gaagttttgc aggaggttgc ctcttatcta 5040 caaaaagcca aactaaccat aaatgtggag aagagcaact tttgcatcaa agaagtaaaa 5100 tatagacatg tgattggaaa tggaactgta gggacggacc cagacaaaat atccgccata 5160 gtcgagtttc caagcccgcg atcaatgaag caactacgtc gatttttagg gatgactggg 5220 ttgtacaata agttcataaa gaacttcgca gcgctagcct ctccaattac agataccctt 5280 aaaaataaac ggagatttgt ttggacgaag gaagctcagg aagcctttga ggggctaaag 5340 gaaaaaatgt gcactgcccc tgttttacac agtccgaatt ttgacgcacc tttttccatt 5400 cattgcgatg caatacagga gttggtgcag tcctgatgca gcaagactcg gaaggaaacg 5460 aggttctgat ctcctttatg tctcggaagc tgaaaaggtg ccaatcaaac tacacgatga 5520 ccgaaaagga gtgtctggca gccgtgttgg cagtaaaaaa aatttagggc atatgtggag 5580 ggtcaagaat tcagtattat cacggatcac gcttctttaa aatggcttat gtcccaggca 5640 gacctcagtt cccgtctggc ccgctggtcc ctaaaattac aagggtttaa gtttaatata 5700 agccatagaa aaggcagtca gaacgtggtg ccagacgccc tgtcaaggac ctttacagac 5760 gacctatccg ctttcgaagt agcaagttta gtggatatgg aggccaaagc attccgggga 5820 atagagtacg aaaatttaaa gtcagctgtc tcttcaggaa caccaagatt gttggatgtc 5880 aggattagca acgggtactt atatcgccgc gcggaacacg ctgcagacga aaaggttgga 5940 gatgagtctt gctggcttca caaagaaata gttcccgact tgctaaaaag agatcctgaa 6000 gatctcatgg cagcccactg cggagtaggt aaaatgttgg agaaactaag aagatatttt 6060 tattggccaa atttagcagc agatgtcaag gaatttgtca acaagtgagc ggtgtgtaaa 6120 actaccaaac accccagtca aatcttaagg ccacccttag gaaatactgg tcgttcaaac 6180 aggatcttcc aaagacttta tgtcgatttt ttaggaccgt atccgcgtag ctcagaaggg 6240 aatataggat tgttcatagt cgtcgatcat gcttcaaagt tcccatttct caaagccgta 6300 aaaaagttta caacggaggc tatccttccc ttcctggaaa aagaaatctt ccactgtttt 6360 ggcgttccgg aaacaatcat ttccgacaat ggggtacaat ttaaatccgt ggctttcaat 6420 tccctgttag aaaaatatgg cgtggcgcat tcttatgccg ctatttatgc accgcaagcc 6480 aacgcttcag aaagggtcaa cagatccgtt ttagctgcta ttaaagctta catagaccca 6540 aaccaaaaga actgggatag taagctcagt agcatctgtt gcgcgttgag attgagttta 6600 cactcagcca gtaaaacaac tccctacaga atggtgtttg ggcagcacat gatagaagac 6660 gggatgtcgt atcgaattat tagagaactc agactgctgg aagatagaac tatgcagttt 6720 gataaagacg actcgaggga tataatcatg cgtaccgccc aggaggcgat ggacgtccaa 6780 aggcaaagaa acgagcggtc ctacaatttg agaagccgag aggcttatag acctggacag 6840 gaagtgtatc gacggaactt ccagcaaagc aatttttgca aagggtttag ctcaaaactg 6900 gctccggtct tcataaaatc aagagttcgg aagaagctag gagctgctta ttacgagatc 6960 gagaatctcc aaggaaaggt tgtaggcaag taccatgcta aagacctacg gcaatagaaa 7020 gtatccgccc gacaagcttg aatcttcctt gtgtaatact ttaccccaag ttagattttg 7080 gtagggggga at 7092 // ID Ginger-1_TriAdh repbase; DNA; INV; 5030 BP. XX AC . XX DT 17-MAY-2011 (Rel. 16.05, Created) DT 17-MAY-2011 (Rel. 16.05, Last updated, Version 1) XX DE DNA transposon: consensus. XX KW Ginger2/TDD; DNA transposon; Transposable Element; KW Ginger-1_TriAdh. XX OS Trichoplax adhaerens OC Eukaryota; Metazoa; Placozoa; Trichoplax. XX RN [1] RP 1-3053 RA Yuan Y.W. and Wessler S.R.; RT "The catalytic domain of all eukaryotic cut-and-paste transposase RT superfamilies."; RL Proc Natl Acad Sci U S A 108(19), 7884-7889 (2011). XX DR [1] (Consensus) XX SQ Sequence 5030 BP; 1772 A; 782 C; 813 G; 1663 T; 0 other; tgtatttaca atgttgcatt actatgcatt attgtcatac cgatcacatt ttctctacct 60 aatacatctg gtggacctta catcacttct tgtggcttga ttttgtattg cacaactact 120 actatttgta tcttcttcat tcctaaggta attataggat atttgatgtt cgcctaaatc 180 taagatagca atagacaatt attgctcatt acattaaatc gacgtagtat agattctgat 240 agctaccgta gggtattatt catttttcta taggtcagac ataacaaaat tggtcctgct 300 cagccttatg aatgaattga tgaaattggc ttttgataaa agatcagtta taggtgtaca 360 cacctgataa tgtttatcaa ctaatataac gtccaacaaa atagatattc tttgtattag 420 taatgttact gaaaggaaaa gattgccaaa acaattttcg taactactaa gtataaacta 480 ctgttaaatt attcattact gaatatcatc agaggggaca taattgacat gcaggcccag 540 cctgatggag gctacaaata tattttgaat taccaggacc atctcaccaa gttttgtgtg 600 ttacgtcctt taaaatctaa gtagcaagag gtagtgagta gggaaatatt cgaaattttt 660 ggtttactag gagctcctac catcttgcat accgataatg gtcgtgaatt ttcgaataag 720 gtattgcaat tcaagtttaa aatctttaga acaattagag gctcgaatga caagttattc 780 attaatacta aatattgtta cgaaatgctg ttttctcgat tttatactac ctattcttag 840 aaatgtcatg cttctctaaa aagtatttac atattagtat ttaacagaaa tcttgacaat 900 ttggccaaat tgtaaacaag tacatggcaa gccgcgacat tctcagtcgc aaggctccgt 960 agagcgagcc aaccaagatg tagaggtaat ccttgcatgt tggaagaggg aacatagtat 1020 tgtaaattag gctagccatt taccgttgat tcagtggcag aaaaattcaa ggtttcattc 1080 tggtgagtta gccatttata cataaaatag ttttcgtaat attgtgcgtc atttatgtaa 1140 tcaaaaaata tatcgaaatt tcatatttag ggattgggag gtcaccttac gaggcaatgt 1200 ttggtaaaaa gcctagggtc gatattgacg gcagaaaatt ttctaaaagt gtgctagaca 1260 agataggtta catacaaggt acagatgaag ataatatcga atcacagccg tttgaaaata 1320 taccagattc agaagatgag ctaagcagcc aagaatcttc cttcgacgat ttagaatcga 1380 atgctgattt atgcgtcagt tgccaaagac catctagtag agcacataag tgtcgtattt 1440 gcaaaaacta ttgccatgca ataccgccct gttgtgcttc tacggatatt gaagaagggt 1500 ttggtagtct ggttacctgt cgactatgtg aatcagaaga gcacacacgt aacttgaggc 1560 agcaatcgaa gagaaaattg cagcaacaag cagaaacaat gctgcagaaa tcggtaaaaa 1620 ggctaaagtc aggcatagtt ggtgacaccg tgctgattcc attgccagaa cctgatagag 1680 agaaaagcca attaaggaac gtgcaagcct gtattttttc tgtatcaaaa gataattatt 1740 accagttagg aacatggcat ggcctaataa ataggttaca tgatgccacc cagcttaccc 1800 tatgtaagga atcgttctta gcaaaagatg atataaattg taatgttaaa attagtctta 1860 gagaaactgc caaattagaa tctgttatca ctggccaaaa atctcatttc tgtggttgct 1920 atacatcttg cattactagc cgatgcaaat gctttaaggc taatgttaag tgcaacagta 1980 aatgtcatca tcagagaacc tgtaaaaata aataattata ataataatca gaacataata 2040 aattctttaa taactactta actgataata atatctcagg tttatttggt aaagttactg 2100 caagaataaa gattattata atcaacaaag aaatgtgact caggtttagg tagtaaagtt 2160 attgcaataa taaagattat tataatcaac aaagaaatat atctctggct aattagtaat 2220 attactgcat gaataaagat tattataatc aacaaaaaat atatctctgg tttatataat 2280 aatgtttctg tatgaataaa gagtattaca atcaacaaag aaatatatct gggcattgac 2340 taggcatttc gcgaactgac taggcatttc gcgaattgac taggcatttc gcgaattgac 2400 taggcaattc gcgaaatcac tggatttcgt acttgcacgg taacatcgcc ttcctaaatt 2460 aactgattaa tcaatggtaa tataatctaa tactctttta actcttttaa taatcaccaa 2520 attttaactc aattttatgt taggatttgg gttaaagtat tattgacttc agaaaatcgg 2580 tgtgaaactc cagttgtgag ctacagtatt agctacatat tttaattaaa cagcaactga 2640 acagttacct ttatcacgta gatgaggtaa accaaactta aataatattt gctaatccaa 2700 aattacaaca catggtgata cgcctacatc ttttttatgt acttcattgc ttaagctgta 2760 cgctactcta tgacaattaa atggcctgta agttaggaaa tcacttcata tatccgatat 2820 taaatgaaaa tttaatggac agctatagct aaaatagtta aagctatcat atagacttga 2880 ttactaaact tcgtttaata aggtgtaaac tttaacgaaa tggtaaatag ataaatcatt 2940 aatgccctct tctgaacgtg ctactgaatt acagcataca gtatattaat tattattcta 3000 aattaataag tatattctac attaaggtca gcaatattag gacatactgt gccagataac 3060 agttcattag gtcactttgc attctatact cagtaataca atctatactt ttaatattta 3120 actgatctat gtataattgt aatcatgctg tattgattat agttttgtac gaagatatag 3180 taactgtcga tactcattct tatactaaat ttagtaagca ataattggtg gtattcaata 3240 attcttgctt atgatcgtca aattatgaac tagttaggca tataagttgt gaaaattacc 3300 atagtcagtt ttgtcgttga aattgcaatt cacagttgta aaaattttct tagcgagaac 3360 tcataataga cattagtcta ctccaaatct tttgtgattg catttgtggg gaaaattcga 3420 agtaggttta ttacaatagt cgtaatcttt gatactctat aatttaaaca gacgtatatt 3480 aagtctttaa cctaactata agcatcaatt tatagtgcga taaggcaaaa agaataattt 3540 tccactattg atgtacatat ttattttgtc aggatatctt tcgaatttaa gatacattca 3600 ggccaaaata ttttaacaaa attttttccc gtactatatc aaatccatgt actgaacatc 3660 aaaatggtgt tttataagta taaagcagta agtaccatgt taccatgaat ttaaataaga 3720 gaattctaaa aagctaaact aacgcaagga gaagagataa ttgtaacgat atctgacata 3780 ttaagcgtac attcagttta tacatgtcgg aataacacgg atcaatagca gtttaattaa 3840 gaaacgcagt aatacaatga agttatgaaa ggggaaagga taaatgcaat tatctctcgt 3900 atacagtaat tgagtataca tccaattcga ttatatggtg gctgtatggt ttcatacaga 3960 ataccttgct aagaatatga ttttgcaatg atgctgtaaa gtaaaaatgg tgatgttgta 4020 atttctgtga ctggtgaata caaagtaaaa atcttttaag taccggtccc atattgactt 4080 aaactagctt ttacttcacg taatatgcat aactaagaat agtgaagata gaacttttat 4140 tagcactttc tgttactcga attattccca ttattataga agtatgttac ttgtaaacaa 4200 cccttagtct agttatcctt tattacagta cgtttcttta accataatat cctaatatag 4260 ttattctatt agcaaatatt ttaatgtatt acggtgctca tacgtaaacg taactgccgc 4320 aagactattg cctagattaa ctaacctgac ttaggattaa agcatctcat tactttctac 4380 acgtaatagc aataaatacc aaatatataa cagccaccct aaatgaaaat tacgacaaga 4440 catttatcta tatatcatat tttgtacaac aattgcagac tttactataa tagctacgat 4500 gataaagtta tttaaatgct caggtaacat cgtaatagct aattacagtt gttcattaac 4560 tgataccttg attaatggtg ctgctttgga gtaacttttg cgtagtaggc ctagcacagt 4620 tattattaga taacatttag gaattactaa aactgatttt atgcataaaa ttttactttt 4680 ttcgacagat aactacgaat tcttcgaata ttatcaagta ctgacaacta atattcgtgt 4740 aatgtagcaa tataacaaat cggttgattt tattttccag taaaattttt gaatttatag 4800 tttaatctat gcgagtatca aaagatgctg tctactttac aaatatgtga aactccgaat 4860 gataatatca ccaatgaaaa gcaaactata ttatctggca aagataagaa tctctgtaaa 4920 aaattatttt gagacactaa tcttcctatt ttatcactta gagtcgtaat aaggaattta 4980 taaccactga tatatttagt tactaatcct aatgcaacat gataactaca 5030 // ID Gypsy-239_AA-LTR repbase; DNA; INV; 131 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-239_AA_; KW Gypsy-239_AA-I; Gypsy-239_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-131 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1080-1080 (2011). XX DR [1] (Consensus) XX SQ Sequence 131 BP; 52 A; 25 C; 18 G; 36 T; 0 other; tgtagtattc agatattaag taattacatt acactaagct actcccaatc ttaagtaggc 60 catgacttag acaagacctg acattagact gattaaggac cataactcaa taaacgttaa 120 aactgataac a 131 // ID BEL-218_AA-LTR repbase; DNA; INV; 374 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-218_AA_; KW BEL-218_AA-I; BEL-218_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-374 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 894-894 (2011). XX DR [1] (Consensus) XX SQ Sequence 374 BP; 128 A; 66 C; 61 G; 119 T; 0 other; tgttcgatca catcgattta tatttatccc atcatatctg tcgatctact acaacaacaa 60 atttgtgcta taaatcttcc tatatctgcg ttgtgtttgg aagaagcaca atataatagc 120 cgtgtgctcc aataaatgca aaggcaataa gcatagagag gaggaatgac aacctagaca 180 taacaacaac tcacggaaga caaaatcgca tccacgtatt ttagatttta actatgtatt 240 ccaatctgtt ttgtatcgaa agtgttttta tgtgtccgat taaatgttat tttaagtata 300 aatatagtgt ttgtgtgaac agaactacaa aaccgactat tcttcgagtg aactgattcg 360 caagctatta aaca 374 // ID BEL-1_IS-LTR repbase; DNA; INV; 232 BP. XX AC ABJB010932093; XX DT 15-FEB-2011 (Rel. 16.02, Created) DT 15-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the black-legged tick genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-1_IS_; KW BEL-1_IS-I; BEL-1_IS-LTR. XX OS Ixodes scapularis OC Eukaryota; Metazoa; Arthropoda; Chelicerata; Arachnida; Acari; OC Parasitiformes; Ixodida; Ixodoidea; Ixodidae; Ixodinae; Ixodes. XX RN [1] RP 1-232 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the black-legged tick genome."; RL Direct Submission to RU (15-FEB-2011). XX DR Genome; ABJB010932093; Positions 4465 4696. XX SQ Sequence 232 BP; 75 A; 44 C; 56 G; 57 T; 0 other; tgttgtgaat tgtaaattgg aatcctcctc acgggaaata gccgcacagg gctcgtgtaa 60 atatgtggtt aacgagaacg aggaagacga aggaacggtt tgaaaaaagg cagatcgaag 120 aagatgtacg gaacgactgc gatgaccgac acccgtagat taacttcgcc atctaaatag 180 taaattcatt ttgtcgctta ccgtaagtct ttgacgttca attactgcaa ca 232 // ID Outcast-16_AAe repbase; DNA; INV; 4897 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A Outcast non-LTR retrotransposon from Aedes aegypti. XX KW Outcast; Non-LTR Retrotransposon; Transposable Element; KW Outcast-16_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4897 RA Kojima K.K. and Jurka J.; RT "Outcast clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1430-1430 (2011). XX DR [2] (Consensus) XX CC There are 3 complete sequences with >99% identity in the genome CC of Aedes aegypti. XX FH Key Location/Qualifiers FT CDS 20..985 FT /product="Outcast-16_AAe_1p" FT /translation="MLLETTGIIKNVPISLTEKDIYHNTISAKKITRIERI FT KKRSEPKDPNSKLVDTRSIKITFEGPELPETVCIYGVLEKPEIYIYPVKIC FT TKCWRLGHKEIACKSKKTCIKCGEYVTENHTHDEEHLTPKCRNCGGDHFPT FT NKDCPERIRMEHINVAMTVNKMTFPEAEQLYPKTNIKTSNRFALLESTVEF FT PQLEQPSRTNAQFRRINQTNTTKVDYRKIINNMTNDPRPKPSKTRPEIQKN FT IFPTPETYPEQVHVIENNPHKVTDTEKLKTEQENLTKQLQHIFEKITDKNN FT DNKIESDVLLIEIASMIQNLIQLTNSHNKK" FT CDS 1012..4659 FT /product="Outcast-16_AAe_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="MGNCFLGDLIISQLNIHSIRPLHKRETIKTYLKTKNI FT HIFLLQETWLKPDEKYKFLNYNFIKNCRLDGYGGTGILVHPTLVYEELTFP FT NIDLELTAIIIKNTKHPIIFISIYIPPDTPFNEVKEPLEKLFDFINNSTTP FT VFLGGDLNAHHPTWDNVSNKIDRRGELISELIENHDIILLNTGVATRWEDL FT NYNPSAIDLTITTPELGPLTNWETSTDDLGSDHKLIECSITSYNKFDSDTP FT RTIISKKQAIENINEIDFRQLTSVDELNSEVNKAINKATFTIRPNSKKKVK FT PYWNENIKQLYETKNKKHIEFRNNLTLENKHEFKKAERNFKKALKKEVYEY FT RNKMLDEINEQTSVNEMWKIVKCISGNFKYKDNSELINNQDLANKFIELNF FT KNETNNDISRIGTNFDVVEEKYIAPLNIKNMINTIKGKKDTSAAGQNKLSY FT YILKNINSNLLIKILELTNKVWMDELIPEEWLLIKIIPILKPGKDKMKETS FT YRPIALININLKIINCEIKERLNSFLDQHNLIPDLSYGFKKGYSAINCVNH FT INSIITEARRQKQQVATIFLDLTKAFDSVDINILLKQLEGLRIPRKIINWF FT ELYLKNRKIVMQTTQGEITQSTNKGLPQGCPLSPILFNVYTRDLHNITTDK FT VIFIQFADDFSITIIGDSLEEIELTSNIVLETLREQLASLKIEINPDKSAV FT VLFNSKMNNKINVKINNTPIEQKEHHKYLGYIVDNKLSHKPHINYVNTKIK FT KRLNIMKMICKKNSGAHPKTALKISKAIIRSQVDYGLTLYGGTAKTNLLKL FT NSSFNSALRTSLRLLKSTPINVLYSEAGEIPIYQRAKWLTKKEAINTFANN FT KPITQILSKFLELENLTKYFTFLELTVFENNYLIIQTHNASQQIESNINLK FT EIIQNEIPNIKLKTMNNETVKRITLNYLNENYGNFQKIYTDGSKTTDACGI FT GVWCEDNNEKLKCQVNQAMSIMNVELMAIKTAIEIIETKEQQTFIICTDSK FT SALTSIKNKNIKNNFIIQDIINKLNKTNKTIVLQWVPGHKGITGNEKADIL FT SKEGCTSEHKIHTKIPLNDTLNLAKAETLQEWIQEYTTTSTQKGTKHFNLM FT QIPSFKPWFHKLDLNTQQIVTIGRLRTYHIITKEKLHMWNLVQDDKCDHCG FT VKEDSNHIFLQCSKHSQSRRKYKILTQHNDMESVLNNATTKDYRQISEFVK FT TNNITI" XX SQ Sequence 4897 BP; 2164 A; 966 C; 681 G; 1085 T; 1 other; caaagctttc attccgcgaa tgttgctcga aactaccgga ataatcaaaa acgtaccaat 60 atcgctaacg gaaaaagaca tataccacaa cacaatttca gccaaaaaaa tcacgcgaat 120 cgaacgaatt aaaaaacgta gtgaaccaaa agatccgaat tccaaactag tagacacaag 180 atcgataaaa atcacttttg agggaccaga acttcccgaa acagtatgca tctacggtgt 240 actcgaaaag ccagaaatct acatctaccc agtgaaaatt tgtaccaaat gttggagact 300 cggtcacaaa gaaatagcct gtaaaagtaa aaaaacatgt attaaatgcg gagaatatgt 360 aacagagaac catacacacg acgaagaaca cctaacacca aaatgtagaa attgtggagg 420 agatcacttc cccacaaaca aagactgccc agaacgaatc agaatggaac acattaatgt 480 agccatgacc gttaacaaaa tgactttccc cgaagctgaa caactctatc caaaaacgaa 540 catcaaaaca tctaacagat ttgctttact cgagtctacc gttgaattcc cacagttgga 600 acaacccagt aggactaacg cacaatttag acgaattaat caaacaaata cgacgaaagt 660 ggactacagg aaaatcatta acaacatgac aaacgaccct agacccaaac cctcaaagac 720 caggccagaa atacaaaaaa acatcttccc aacaccagaa acgtatcctg aacaagtcca 780 tgtgatagaa aacaatccac ataaagtcac agacacagaa aaactaaaaa cagaacaaga 840 aaatctaacg aaacaactac aacacatctt cgaaaaaatc acagacaaga ataacgataa 900 caaaattgaa tcagacgtac tactaatcga gatagcgtca atgatacaga atttaattca 960 actaactaat tcacacaata aaaagtaata atccttagtt aagacacgat tatgggtaat 1020 tgctttttag gtgatctcat catatcccaa ctaaacatcc atagtatcag accattacat 1080 aaaagagaaa caatcaaaac atacctaaaa acaaaaaaca ttcacatctt tcttcttcaa 1140 gaaacttggc ttaagccaga cgagaaatac aaatttctca actacaattt cataaaaaac 1200 tgtagactag acggttatgg gggaactgga atcctagtcc accctacact ggtttatgaa 1260 gaactaacgt ttcctaacat agacttagaa ctaacagcaa taataattaa aaacaccaaa 1320 catccaataa tattcatttc aatatacatt ccaccagata cgccatttaa cgaagttaaa 1380 gaaccattag aaaaactttt tgacttcata aacaacagca caaccccagt attcctggga 1440 ggagatttaa atgcacatca tccaacttgg gataatgtat ctaacaaaat agacagaaga 1500 ggagaactaa ttagtgaact catagaaaac catgacataa tactattaaa caccggagta 1560 gcgacaaggt gggaagactt aaactataac ccatccgcca tagatctaac tatcactaca 1620 ccagaattag gaccattaac aaactgggaa acaagtactg acgacctagg atcagaccac 1680 aagctcattg aatgcagcat aacctcttat aacaaatttg atagtgacac accaagaacg 1740 ataattagca aaaaacaagc aatagaaaat attaacgaaa ttgacttcag acaattaaca 1800 agcgtagatg aactaaattc agaagtaaat aaagcaatca acaaggccac atttacaata 1860 cgtcctaaca gtaagaaaaa agtcaaacca tattggaacg aaaacataaa gcaactctac 1920 gaaactaaaa acaaaaaaca catagaattc agaaataatt taacattaga aaataaacac 1980 gaattcaaaa aagctgaacg taatttcaaa aaagctttaa aaaaagaagt atatgagtac 2040 aggaataaaa tgctcgatga aattaacgaa cagacgagcg tcaacgaaat gtggaaaatt 2100 gtaaaatgta taagcggtaa tttcaaatac aaagacaact cagaactaat caataatcaa 2160 gatttagcta ataaattcat tgagctcaat tttaaaaatg aaacaaataa tgatatttca 2220 agaataggaa ccaatttcga tgtggttgaa gagaaataca tagcaccatt gaatattaaa 2280 aacatgataa atacaataaa agggaaaaaa gacacatctg cggccggaca aaataaatta 2340 tcatattaca tactcaaaaa catcaattcg aacctactga ttaaaatctt agaattaacc 2400 aataaggtat ggatggacga actaatacca gaagaatggc ttcttatcaa gatcatacca 2460 atcttaaaac cgggtaagga taaaatgaaa gaaacatcgt atcgaccaat agcactaata 2520 aacataaatt tgaaaataat taactgcgaa attaaagaaa ggctcaattc attcttagat 2580 caacacaatc ttatacccga cttgtcatac ggattcaaga aaggatattc tgccattaac 2640 tgcgtaaacc atattaattc aattataaca gaagccagac gtcaaaaaca gcaagtagca 2700 acaatattct tggacctcac taaagctttt gattccgtag atattaacat tttgcttaaa 2760 caactagaag gactaagaat cccgagaaaa atcataaact ggtttgaatt atacctaaaa 2820 aacagaaaaa ttgtcatgca aaccacccaa ggcgaaataa cacaaagcac caataaagga 2880 ctaccacaag gttgcccact ctcccctata ttgttcaacg tctatacgag agacttacac 2940 aacataacaa cagacaaagt gatttttatc caatttgccg acgatttctc aataacaatc 3000 ataggggact cattggaaga aatagaatta acatcgaaca ttgttttaga aacattaaga 3060 gaacaactag catccttaaa aatagaaata aacccggaca aatcagcagt agttttgttc 3120 aattctaaaa tgaacaataa aattaatgtt aaaatcaaca acactccaat agaacaaaaa 3180 gaacaccaca aatatctagg gtacatcgta gataataagc tctcccacaa accacatata 3240 aactacgtca acaccaaaat taaaaaaaga ttaaacatta tgaaaatgat ttgtaaaaaa 3300 aatagtggag cacacccaaa aaccgcactg aaaattagta aagcaataat cagatctcag 3360 gtcgattacg gtctgacact gtatggagga acagcaaaaa ctaatttatt gaaactaaat 3420 tcatctttca actcagcgct tcgaacaagt ctaagacttt taaaatccac cccgataaac 3480 gttttgtact cagaagcagg agaaattccc atctaccaaa gagctaaatg gttgactaaa 3540 aaggaggcca tcaacacttt cgctaataat aaaccaataa cccaaatctt atcaaaattt 3600 ttagaattag aaaatctaac taaatacttc acattcctag aattaacagt attcgaaaac 3660 aattatctaa taattcaaac gcataatgcc tcacaacaaa tcgaaagcaa cataaatctt 3720 aaggaaatca tacaaaatga aatacccaac attaaactaa agaccatgaa caacgaaaca 3780 gtaaaaagaa taacattaaa ctatttaaac gaaaactatg gaaattttca aaaaatttac 3840 accgacggat cwaaaactac cgacgcctgc ggaataggtg tatggtgcga agacaataat 3900 gaaaaactaa aatgtcaagt aaaccaagca atgtcaataa tgaatgtaga attaatggca 3960 ataaaaacgg ccatcgagat aattgaaact aaagaacaac aaacatttat aatttgcaca 4020 gactccaaat ctgccctaac aagtattaaa aataaaaata ttaaaaacaa ttttataata 4080 caagatataa taaataaact caataaaaca aacaaaacaa tagtactcca atgggtacca 4140 ggtcacaaag gaataacagg taacgaaaaa gcagacatac tttccaaaga gggatgtact 4200 agtgaacaca aaattcatac taaaatccca ttgaatgata cactaaacct agccaaagct 4260 gaaacactcc aagaatggat acaagaatat actacaacat caacacagaa aggcacaaaa 4320 cacttcaacc ttatgcaaat cccatcattc aaaccatggt tccataaact ggatctaaac 4380 acacaacaaa tagtaacaat tggccgtctg agaacatatc acataataac taaagaaaaa 4440 ttacacatgt ggaatctagt tcaagatgat aaatgtgacc actgtggggt aaaagaggac 4500 tccaaccata tctttctaca atgctcaaaa cattctcaat caagaagaaa atacaaaata 4560 cttacacaac acaatgacat ggaatctgta cttaataatg ctactaccaa agactaccgt 4620 cagatatcag aatttgtaaa aacaaacaac attacaattt agactaacac tacagaaaca 4680 caaaatcact cgtggtttca aataaacaaa aatgaccgat ctaatgaatt cggtcccatt 4740 attaattaca cgaaaggaaa agatcaatag agatcgcgaa cgattttaga attgagcgat 4800 agagttgctc taacacactt cacgacttga cattccttgg cagtatagac tgacagtaca 4860 gcagtctcgg ccacccagag gagaagaaga agaagaa 4897 // ID DNA-3_PPac repbase; DNA; INV; 588 BP. XX AC . XX DT 07-JUL-2010 (Rel. 15.07, Created) DT 07-JUL-2010 (Rel. 15.07, Last updated, Version 1) XX DE Non-autonomous DNA transposon from the Pristionchus pacificus DE genome. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA-3_PPac. XX OS Pristionchus pacificus OC Eukaryota; Metazoa; Nematoda; Chromadorea; Diplogasterida; OC Neodiplogasteridae; Pristionchus. XX RN [1] RP 1-588 RA Jurka J.; RT "DNA transposons from the Pristionchus pacificus genome."; RL Repbase Reports 10(7), 955-955 (2010). XX DR [1] (Consensus) XX CC >86% identical to consensus. XX SQ Sequence 588 BP; 146 A; 151 C; 145 G; 137 T; 9 other; taggggccac aacgcagata gccganttcg cgcgatgggg tcaaaagtgt gttcaaacta 60 tccctcgtgg tcccaataac gtcatacttt ttgccgtttg tcgatatctt gtctctgagc 120 cgcgctagag cccgagaagt acgcgcgcaa cncactgggg gagaggaggg aaaggcgaga 180 cgagagggaa agggggcgga gcctaattag cgtgtgtctc tctccctccc cccgccctct 240 cgcacgcctc tccccgccca aaagncacaa gccggggaag catgctaaaa atctgaantt 300 ttctctaaaa ttcctaaaat attcgaatta gtcagataag ttcggtccga gaagaatcgt 360 caaactggaa catgccaaaa cgacctattc ccttcnctta natgcgaatt tgatctactt 420 agcgatgnta tgcatatctg ggaccgngct cgatcgaaag aggaatgtca cagctatcca 480 tagcgctgtc gaacgtccga tttggtttag aacgaacctt gtgactttac tttctcgctt 540 ttncgcgaaa tcgtgtccgc acaaagtggg cggagcttgt ggccccta 588 // ID Gypsy-75_CQ-I repbase; DNA; INV; 1942 BP. XX AC AAWU01021168; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-75_CQ_; KW Gypsy-75_CQ-LTR; Gypsy-75_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-1942 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 529-529 (2011). XX DR GenBank; AAWU01021168; Positions 11887 13828. XX CC 'GTATT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 225..1928 FT /product="Gypsy-75_CQ-I_1p" FT /translation="MDAKQFAKFMASMEKLLGSFKSMAGQAAAGTSTAGGS FT AIGGSVTIPFIPLPPPLELEGDMERNYNFFENGWKNYTSAVGMDAWPVERN FT KQKTSVLLSVIGQAALKKYFNFELTEVQQGDPALALSAIKAKVVRERNPII FT DWTEFFSMEQWEEESVDDFVGRLKALAKLCKFGVLEAEMVKFKIVTSNKWS FT HLRATLLTAQNLTEAIVVDLCRAEEVSERHRKAVSSSSFGVNKVRRMARKC FT KFCGGRHEFVKGVCPALGKKCNRCGGKNHFEKACKNDQKKRSKKKVKDVYG FT RSGDSAQTSEESDSEPESSDAANIGQIYDKSSYGGHVQADLDLFVGETWQS FT VQCELDTGANASLVGRDWLIRVGGQNSPQLLPSRCRLQSFGGGHIPVLGEV FT KIPCHRNGRKYNLSLQVVDVEHGPLLSAQVCKKLGFVKFCNSVSASAPHYP FT NDLIAIHRIKAQDSQLIDSGKEYRQKDVPSRMQQKPVHVCKQPRKEPQTSK FT QNLDQSTTQIGKAVVSVADQEAVRVWTNGVRSAEEPNTLKLPVPVEMPNRL FT KRNSVVFPVFEHKVQENFIFV" XX SQ Sequence 1942 BP; 511 A; 416 C; 579 G; 436 T; 0 other; tggtgtcaga agtagtcgtg gtttagtgac agttttccgg cgtcatttta catcgcggaa 60 gttactgtgc ggaaaaaaac tttgttcgag tggcgaaaca atcattcgcg aaagttccgg 120 aatagcgtta cgtgattcgt tcagcggacg tgttcggact tgttcggcgg ccatatttgt 180 attcgtttcc acgttgaaaa agtgtctttc cggaagtgaa aaaaatggat gcaaaacaat 240 ttgctaagtt tatggcgtcc atggagaaac ttctaggctc gtttaagagc atggccgggc 300 aagctgcggc cggtacgagt acagctggtg gaagcgcgat tggtggaagt gttaccattc 360 cgtttattcc tcttccgccc ccgttggagc tagagggcga catggaaagg aactacaatt 420 tctttgagaa cggttggaag aattatacga gtgccgtcgg aatggatgcg tggccagttg 480 agcggaacaa gcagaagaca agtgtgttgc tgtctgtgat tggccaagcg gcgctcaaga 540 agtattttaa cttcgagctg accgaagtgc aacagggtga cccggcgctg gcgctgtctg 600 cgatcaaagc gaaggtggtt cgagaacgaa atccgatcat cgattggact gagtttttct 660 cgatggagca gtgggaggag gagagcgttg acgatttcgt gggacgattg aaggcattgg 720 caaagctctg taagtttggc gtgctggagg cggaaatggt gaaattcaaa attgtcacat 780 ccaacaaatg gtcccatttg cgtgcgacgc tgctcacggc ccagaacctt acagaggcga 840 tagtagtgga cctgtgtcgc gctgaagaag tttcggaaag gcacagaaaa gcagtgagca 900 gttcgagctt cggcgtcaac aaggtgaggc gaatggcgag aaagtgcaag ttctgcggcg 960 gtcggcatga gtttgtcaag ggtgtctgtc cagctctggg gaagaagtgt aatcggtgcg 1020 gcgggaagaa tcactttgag aaggcatgca agaacgatca gaagaagcgg tcaaagaaga 1080 aggtgaagga tgtgtatgga cgcagcggtg attcggcaca aaccagcgag gaaagtgact 1140 cagagccgga aagcagcgat gctgcgaaca tcgggcagat ctacgacaag tccagttatg 1200 gtggccacgt gcaagcggat ctggatctct tcgtcggtga gacatggcag tctgttcaat 1260 gtgagttgga caccggtgcg aacgccagtc tggttggccg agattggcta atcagagttg 1320 gcggacaaaa cagcccacag ttgctgccat cgagatgccg cctacaaagc tttggtggtg 1380 gccacattcc tgtcctggga gaggtgaaga tcccgtgcca tcgcaacgga cgtaagtaca 1440 atctctcgtt acaagtggtt gacgtcgagc atggccctct cctgtctgcg caagtctgca 1500 agaaattggg attcgtaaag ttctgcaatt cagtgtccgc atctgcaccg cactatccca 1560 atgacctgat cgccatccac cgcataaaag ctcaagattc tcagctgatc gactccggca 1620 aagaataccg ccagaaggat gtaccaagcc ggatgcaaca gaagccagtt cacgtctgca 1680 agcagccacg taaggagccc caaacgtcca agcagaatct cgaccagtca acgacgcaga 1740 ttggaaaagc tgttgtatcg gttgcagatc aagaagcggt cagggtatgg accaacggtg 1800 tccgatcagc ggaagagcca aacacgctta agctgccagt tcccgtggag atgccgaaca 1860 ggctgaagag aaattcagtg gtttttccag ttttcgaaca taaggttcag gagaatttta 1920 tttttgttta aagcagggaa ga 1942 // ID Gypsy-20_IS-I repbase; DNA; INV; 4012 BP. XX AC ABJB010875357; XX DT 15-FEB-2011 (Rel. 16.02, Created) DT 15-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the black-legged tick genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-20_IS_; KW Gypsy-20_IS-LTR; Gypsy-20_IS-I. XX OS Ixodes scapularis OC Eukaryota; Metazoa; Arthropoda; Chelicerata; Arachnida; Acari; OC Parasitiformes; Ixodida; Ixodoidea; Ixodidae; Ixodinae; Ixodes. XX RN [1] RP 1-4012 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the black-legged tick genome."; RL Direct Submission to RU (15-FEB-2011). XX DR Genome; ABJB010875357; Positions 7972 11983. XX CC Positions [1525-1974] - Reverse transcriptase CC Positions [3094-3576] - Integrase core CC 'AACCT' target site duplication CC LTRs are 96% similar to each other. XX FH Key Location/Qualifiers FT CDS join(43..1071,1075..2427) FT /product="Gypsy-20_IS-I_1p" FT /translation="MSDEDQSDSSGQKPSASTQPKIMSSVQQPPPFNFDNV FT AEWPAWLDEFEDYMFASGLSERTQEAQVRSLLYCMGRKAREILKTFNLTDA FT EFKNYELVKGKYNAHFVHSKNVVYESACFNRRVQEAEETVDQFVTELHTLA FT ARCKLGELKERLIRDRFIVGLRDAKLSETLKMDAELTVDTALAKARLRETV FT RKQQNDLAKTEPSRRSEDAALNADSVSKPPRKRDVRAPKIEAGPPTQACEN FT CGYGPHQKRDCPARNKACMKCKKVGHFARKCRRALSNLSKTHVAEVHASES FT GSFLGSVCQSGPNARLAVVTVSGHPLRAKIDSGADATVVGEDFAGKPPRLL FT AEDLKGPNKKSLDVLGKFNAVVEWQGKVSEETIYVVRDLQMPLLGMPAIES FT LGMVRFLNNVSQNKSKYESMCPEVFQRLGTIQGEHRIQLKPDAEPFAVSTP FT RRIPFPMRPAVEKELQRLEATGIIRKVHGPTDWCAPIVTIQKPSGDIRICV FT DLTKLNVSVRRERHVMPPVEEVLANIGEAKFFSKLDANSGFHQIVLSPESQ FT ELTTFITPFGRYCYKRLPFGITSAPEVFQRKMTEILDGLQGIQNLMDDVLV FT YGATREERDNRLRAALQQLARSGVTLNLDKCQLSVTEVSFLGVILDKDGIR FT ADPKKVKAIQLLPAPTDVSGLRRVLGMVNHVARFLPTLSQVTAPLRELLQK FT NNEWIWGLEQQRALDKVKQMITSERCLAKYDSSARTIVSADSSYHGLGAVL FT LQEQRDGELRAVAFASRSLTSTEQRYAQIEKEALALTWAAERF" XX SQ Sequence 4012 BP; 1090 A; 1035 C; 1087 G; 800 T; 0 other; tggtgtcaga agtgacaact cccaccgccg ttgccgaaaa gcatgtctga cgaggaccag 60 tcagactcca gcggtcaaaa gccttccgct tccacgcaac cgaaaatcat gagcagcgtc 120 caacagccgc cccccttcaa cttcgacaac gtggccgagt ggccggcgtg gctggacgag 180 ttcgaagact acatgttcgc atccgggctc agcgaaagaa cccaggaggc acaggtcaga 240 tccctactat attgcatggg acgaaaagca agagaaattc tgaagacctt taacctcacc 300 gacgctgagt tcaagaacta tgagctcgta aaggggaagt acaacgccca ttttgtgcat 360 tcaaagaacg tcgtctacga aagcgcctgt ttcaaccgac gtgtacagga agcggaggag 420 acagtggacc agtttgttac cgaactgcac accctggcgg cacggtgcaa gttaggcgag 480 ttgaaagaaa gactcatccg cgaccggttt attgtgggcc tccgagacgc gaagctgtca 540 gaaacgctta aaatggatgc cgagttaacg gtggacacag ctttggcaaa agcccgactc 600 agagagaccg tgaggaaaca gcaaaatgac ctggcgaaaa ctgagccttc gcgacgcagc 660 gaggacgcgg ccttgaatgc agactccgtt tccaagccgc cgcgcaagag agatgtacgc 720 gcaccgaaga tcgaggccgg gccgccgaca caagcatgcg agaattgtgg gtacggtccc 780 caccagaaac gagactgccc ggcaagaaac aaagcgtgta tgaaatgcaa gaaagttgga 840 catttcgcca ggaaatgccg aagagcgcta tcaaacctga gtaaaacgca cgttgcagaa 900 gtgcatgcgt ctgagagcgg ttcattcctc ggaagcgtat gccaaagtgg accgaacgcg 960 cgtttggctg tggtgactgt gagcggacac ccgctccgag caaagattga ctcaggcgct 1020 gacgccaccg tcgtcggaga agacttcgct ggcaagccgc caagacttct gtaagccgag 1080 gatttgaaag gaccgaacaa gaaaagcctg gacgtactgg ggaagtttaa tgctgtggtc 1140 gagtggcagg gcaaggtttc agaagagact atctacgtgg ttcgagatct ccaaatgcca 1200 ctgctgggaa tgccagcaat cgagtccctc gggatggttc gctttctcaa caacgtctcc 1260 cagaacaagt ccaagtacga aagtatgtgt ccggaagtgt tccaaaggct tggaacgatc 1320 caaggggaac acagaattca gctcaaacct gacgcggaac catttgcagt cagcacgcca 1380 agacgaattc catttccgat gagaccagca gtcgaaaagg agctgcagag gctcgaggcc 1440 actggaatca tcagaaaagt tcacggccct accgactggt gtgctcctat tgtaaccatt 1500 cagaagccat cgggcgacat cagaatctgc gtcgacctta caaaactgaa cgtctctgtg 1560 cgtcgagagc gacacgtcat gccccccgtg gaggaagttc tggcgaacat cggtgaagca 1620 aagtttttct ccaaactaga cgcaaattct ggatttcacc agattgtctt gtcccccgag 1680 tcacaagagc tgaccacgtt tataacacca tttgggagat attgctacaa gaggctgccc 1740 tttggtataa cctcggcacc tgaagttttc cagagaaaga tgacagagat tctggacgga 1800 cttcaaggta tccagaattt aatggacgat gttttggttt acggagccac gcgagaagaa 1860 cgcgacaacc gactgagagc agccctgcag caactcgccc ggtccggtgt gacgctcaac 1920 ctcgacaagt gtcagcttag tgtcactgag gtgtcgtttt taggagtgat cctcgacaaa 1980 gatggcatcc gtgcagatcc aaagaaggtc aaggcaatcc agcttctacc tgcccctaca 2040 gatgtatctg gactccgacg ggtgttgggt atggtgaacc atgttgcacg ttttttgccc 2100 accctgtcgc aagtgacagc accacttcgt gaactacttc agaagaacaa tgagtggatt 2160 tggggattgg agcaacaacg agcccttgac aaggtcaagc aaatgatcac gtcagagaga 2220 tgccttgcaa agtatgacag ttctgctcgt accatcgttt cagccgactc ctcataccat 2280 ggcctgggag cggtgctcct tcaagaacaa cgagatggtg aactccgtgc agttgcgttc 2340 gcttctcgtt ctcttacttc aacggaacag cgctatgcgc agatcgaaaa agaagccctc 2400 gcactgacat gggcggctga aagattttaa gagtatctcc gaggactgga gttcgttttt 2460 cagacagacc ataagcccct ggtcccactg ctgggccagc ataacctgga tatgctgccc 2520 ccacgggtgc aacggtttcg aatgcgcctg atgcgatatt cgtacaagat cgaatacatc 2580 ccaggaaaga acatggcaac ggctgacacg ctctcgagag cgccctgcga agatgcgtca 2640 tctgagggca tgttgaaccc tgccgaggtt tcgccatttg ttttgggagc aatcagcacg 2700 ctgccggcgt cggaggacct tcttcagagg atccggaagc tccagaggga ggaccaagac 2760 tgtgcaagct tgttcaagta ctgtgaggag ggctggcctt cgaaaagcaa actttcatgg 2820 aacctgaagc cgttctttgg agagcaaggc gacataacgg tgtgccaagg cctactgttg 2880 aaaggacccc ggcttatcat tcccaagtgt cttcgagcag agatgctaac gcggatacac 2940 gagggacatc aagggattgt acgatgtcaa gaacgggcaa gggagtccgt ttggtgaccg 3000 agaattagca gggccatagt agacatcata gacaagtgtg cagaatgtgc gatgtacaag 3060 aatcaagctc gagaaccgat gttgtcgttt caaacgccag aattgccctg gcagaaagtc 3120 gccgctgacc tgttccagct cgatggtgtg aactatctcc tgctggttga ttaccgttcc 3180 aggtatcctg aagtagcgat tcttcagtcc tccacttcag caaaggctgt tattgagcga 3240 atgaagagta tattcgcccg acatgggata ccggaggtcc tagttacgga caatggtccg 3300 caattttccg cgacagagtt caagaacttc gcccagcatt acgggtttca acatgtcaca 3360 acaagtccga ggtaccccca agccaatggg gaaattgaac gaatggtgaa gacaatcaag 3420 agctttgtga aaaagagtgc agatgcttat ctggcgctgc ttaactaccg caacacgcca 3480 ggccctacag ggctcagtcc cgcccaactg ctcatgggaa gaaggctccg cagcagactt 3540 cccatgtatc ctgatcaact gaagccccga accttcaacc agtcagcatg gaaaaggaag 3600 gatcagaaac agagacaaag gcagaagcgg aatcatgaca ggcgccacag agcaagacca 3660 ctggcacccc tcaaagagga cgatccggtt tggcttccag agcttcgcac gtcgggcacg 3720 gttgtgggcc atgcacaaac gcccagatcc ctgtacgtcg aaacaagttc tggaaccctg 3780 agaagaaacc gcagccaggt gattcccctg acgcagccgc aggactcgcc ctgggagacc 3840 tctgatgaaa cgtgcagtgc caaccagaac acagcgagat ctcctgtcgg actccagcat 3900 acgcggtcgg gtcgaatcgt gagaccaccc aaaagactgg gatttgatga ctgaacgtcg 3960 agctgactag ttcagttgta cattagggcc atagaacttt aaagggggaa ga 4012 // ID Chapaev-21_HM repbase; DNA; INV; 2812 BP. XX AC . XX DT 16-SEP-2009 (Rel. 14.09, Created) DT 16-SEP-2009 (Rel. 14.09, Last updated, Version 2) XX DE Chapaev-type DNA transposon - a consensus. XX KW Chapaev; DNA transposon; Transposable Element; Chapaev-21_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2812 RA Jurka J.; RT "DNA transposons from Hydra magnipapillata."; RL Repbase Reports 9(9), 1915-1915 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(326..508,528..1028,1205..2446) FT /product="Chapaev-21_HM_1p" FT /translation="MSYFCGVFFKNFWEKACISKNKQFSQHTRTYKLQKCL FT TKPKTTKRIEKLFVFCAFRREIVSESAKVLKVELDFSDNRIPSGICERCRV FT AIRRKDEGENPPLPSLYDFKSISVRPATRESPCDCLICQIARNKLNIPHPL FT SASKPKNEGSIEKRCSDCFSVIGRGLPHNCTKGTMRQNLVAVASKDAVAAE FT MVATMVISNKPPSPHGTVRLSQAKGGNSFPVVPGMIQFIKPLLIRLLKKIC FT DKFESLGRTLAEQFTSNHIQDSTKKDSTRVVVHCKDVSDIVNHVSKVRNIP FT GEPLIKVGIDGGGGFLKVSLGVIEANREDSSPPSKRVAKDTGVKRQLLIAV FT SENLPESYENLKSVLNLLQLQKISFVVSCDVKVANLITGLQSHASAHPCTW FT CDADSKNLHICGKPRTFGSIQADFDAFQRSGSILRRAKEFNNVVHKPIICL FT DNQTLLLDFIPPMELHLLLGVVNHLYKNLCQTWPDANQWPAALHIREQPFH FT GGQFAGNDCQKLLKHTDVLQNLAEQSSSFNVMPFVETFRCFERVVNSCFGN FT NLSPNFADKIKEFQSSFLKLPNTSITPKVHAVFFHVKHFIDRHKSSLGVFS FT EQATESLHHNFNVHWQRYKRDPSHPEYEKKLLNCVVDLNSKHSF" XX SQ Sequence 2812 BP; 898 A; 525 C; 531 G; 858 T; 0 other; cactgtgtac ccaaaccatg atcaacccta aactcatgtc cccagaaaac ccttttgggg 60 gatgatagcc catatatagt agaacattta gatataaatt tggaaatcaa actattacgt 120 ccgagaaaga tattttgatt ttaaattttg aagctaatag gcctaaaaaa ttgcgaaatg 180 acaggggatt ttgcattttt acagaactta ttactcaact tggttatggg gaaaacatct 240 agaagtgatt ttaagtgaaa acattagtgt ttctgaaaat tattttggtt gtttttatca 300 ttcccaaaac aaatacctta aaataatgtc ttatttttgt ggcgtttttt tcaaaaattt 360 ctgggaaaaa gcatgcattt caaaaaataa acaattttca caacacacta gaacatataa 420 gttgcaaaaa tgcctaacaa agccaaagac cacgaagaga atagaaaagc tgtttgtttt 480 ttgtgccttc agaagggaaa tcgtgagcta acaacttttc tgattgagaa agcgcaaaag 540 ttctcaaagt tgagctggat ttttctgata acaggattcc ttcaggtatt tgtgaaagat 600 gtcgtgttgc catcagaaga aaagatgagg gtgaaaatcc tcctcttcct agcctttacg 660 attttaagag catctcggtt cgaccagcta ccagagaatc tccttgtgat tgtttgatct 720 gtcaaattgc cagaaataag ctgaacattc ctcatccact gtcagcaagt aaacctaaga 780 acgaaggaag cattgaaaaa agatgttcag actgtttttc tgtgattggg cgagggctgc 840 cccacaattg tacaaaagga acaatgcgac aaaatttggt tgcagttgcc tcaaaagatg 900 cagtggctgc agagatggtt gcaactatgg tcatctcaaa caaacctcct tctcctcatg 960 gcactgtccg tctcagtcaa gcaaagggag gcaacagttt tccagttgtt ccaggtatga 1020 tacagtttta aactatcata agcattaatc ccctgtgtct aatagcatag tcttttctta 1080 gggccatcca gggcaagaga actctttcct gaggagaaaa agctaacagc tatagatatg 1140 ctcaaggttc aaaatgaaac tggactttct aatcgtggca tgaaaaaact tgcatcatca 1200 ctaaatcaag ccactcctca tcagattgtt gaaaaaaatt tgtgacaagt ttgagtcact 1260 tggaaggact cttgcagaac agttcacttc aaaccacatt caagactcaa caaagaaaga 1320 ttcaaccaga gttgttgttc actgcaaaga tgtgtctgat atagtcaacc atgtttcaaa 1380 agtcagaaat attcctggag aaccattgat caaagtggga attgatggtg gaggtggctt 1440 tttgaaagta tccttgggag ttattgaagc aaacagagag gattcttctc ctccctcaaa 1500 gcgtgttgca aaagacacag gtgtgaagcg ccagttatta attgctgttt ctgaaaacct 1560 accagagagt tatgaaaact tgaaaagcgt tttgaatttg ctccagttgc agaaaatttc 1620 ttttgttgtt tcatgtgacg tgaaagtagc gaatctcata actggacttc agtctcatgc 1680 cagtgctcac ccttgtacat ggtgtgatgc agattcaaaa aatcttcaca tctgcgggaa 1740 acccagaaca tttggttcca ttcaagctga ctttgatgca tttcaaagga gtgggtcaat 1800 tttgagaaga gcaaaagagt tcaacaatgt tgttcacaaa ccaattattt gcttggacaa 1860 tcaaacactc ttattggact ttatccctcc aatggagcta catcttcttc ttggagtagt 1920 aaatcatctc tacaaaaact tgtgtcaaac atggcctgat gccaatcaat ggcctgctgc 1980 tcttcatatt cgggaacagc cttttcatgg gggccaattt gctggaaatg attgtcaaaa 2040 gctattgaag catacagatg ttctacaaaa tctggctgaa caaagctctt cattcaatgt 2100 catgcctttt gttgaaactt tcagatgttt tgaaagggtt gtgaattctt gctttggaaa 2160 taatctttct cccaactttg cagacaagat caaggagttc cagagttcct tcttgaagtt 2220 gccaaacacc tccatcactc caaaggtgca tgcagtcttt tttcatgtga agcattttat 2280 tgaccgccac aagtcaagcc ttggagtatt ttctgaacag gccacagaat ctttgcatca 2340 caacttcaat gtacattggc aacgatacaa acgtgatccc agtcatccag aatatgaaaa 2400 gaagcttctg aattgtgttg tagacctcaa cagcaagcat tcattttagt tgaaagggtc 2460 tttttcaaac caaaattaag aatactgtaa tcttaaagaa ttatttcccg atgtgatatt 2520 ttcattcaga aatgttgttt tatatacata agaatgtttc ttttgttgtc acaactatga 2580 aatgactttt aaaaaagaat aataaatgct gtaaaaatgc aaaatcccct gtcattttgc 2640 aattttttag gcctattagc ttcaaaattt aaaatcaaaa tatctttctc ggacgtaata 2700 gtttgatttc caaatttata tctaaatgtt ctactatata tgggctatca tcccccaaaa 2760 gggttttctg gggacatgag tttagggttg atcatggttt gggtacacag tg 2812 // ID I-5_AAe repbase; DNA; INV; 5827 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A I non-LTR retrotransposon family from Aedes aegypti. XX KW I; Non-LTR Retrotransposon; Transposable Element; I-5_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5827 RA Kojima K.K. and Jurka J.; RT "I non-LTR retrotransposons from the yellow fever mosquito."; RL Repbase Reports 11(4), 1356-1356 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >92% CC identity. XX FH Key Location/Qualifiers FT CDS 523..1827 FT /product="I-5_AAe_1p" FT /translation="METDDGEGSKENVLSNDSSSSNKSFRVKVYPSSFPGP FT FVVYFRKKEKPINVLLISSEIYKLYTSVKEIKKISLDKLRVIFGSRDDANA FT LLESKLFFDSYRVYAPSNACEINGIIYDESLDCEDIKIHGSGVFKNKSISP FT VKVLDCVRLSKLFFAGNDSKYMHSNCIKITFSGSVLPDYVMIDNVKFRVRL FT YYPKLMHCDKCLLFGHTSKFCSNKQKCSKCGEPHQSSECNKISDLCFYCKQ FT KHNSLKECVVYIDHQSKFNQKIRNKNILSYAEITKSTDSFSSPNSFDILHD FT DVGNEPSQNLNNYVYKPPSKRKRISNKSLNNQNLFFEPQPSTSFDTNFPPL FT GASVPSNNIPGFQRVDSTHSTNKNDQNSNNISSDKSNESSNQSILNILEEI FT VDFLGLSDFWKKIIKLILPLLASLLDKLNSFVPLISSLFSS" FT CDS 1830..5519 FT /product="I-5_AAe_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="MASINSKKLHILQWNCRSIIPKIDRLKVLLSNFDIDV FT FCLNETWLGTARFFRIPNYNIIRKDRDTSFGGVLIGIRDNIEFKYLNVSFQ FT TQIEYVAVTIRNGKNEFSIICFYIPPNASFSLSEIKSILNEIPSPFYILGD FT FNAHNFAWGSEKTDGRGSLIMDLIDDLNLNILNDGSITRIAVPPNNHSCVD FT LSLCSNNLSMQSSWRTIHDPNGSDHLPILIDMQTSEDNEVFYEASTPDLCK FT NVNWSKFSDLVSISLINIVNSDTPIDSYNTFSKLLIDCLQKSQHNKPSSKT FT TKKNRPSFWWDNECSISLRNKSEAFKLFRRSGSRNDYFLYCKAEAQFTRLT FT KFKKRNYWKNFVENLDRETSLSTLWSVARNLRNYNSSPPTILEYSEKWMDD FT FSSKICPDFVPPSIDYKTNSINYFPELCNPFSIAEFDMALSITKNTAPGID FT NIKFIVLNNLPYEGKLHLLSIYNALFCQNIIPFEWRFIKVINIVKPGRDPA FT XADSRRPISLLSCLRKLLERMILNRLXLWAEDNNIFSSSQFGFRRGRGTRD FT CIALLASQIQLSFNKKQDVVSTFLDVSGAYDSVLIDLLYNKMHNLKIPNLI FT SNFVCNLFSFKIMNFYHNGKSKIIRYSYFGLPQGSCLSPFLYNLFTSDIAS FT IIPNGCYLLQFADDNVISISGKNREIIRHFMQCALNNIGMWAHENGFSFSV FT QKTKYIIFSRKYSTSLINLYLNGIEIEQVDEYKYLGIWFDFKLNWNAHIKY FT IQKVCSKRINFLRAITGTWWGANPSDLITLYKTTIRSVMEYGCFTFGSAAR FT TYFAKLEKIQFRCLRICLKLMNSTHTQSIEVLAGILPLKIRFQELNCKFVM FT NCFSNNHPLIDFLKSLFDINPTSKMLDSFVHCSTENIEPIPSYAFHNSKTS FT IHSYKPLIDLSLFHEMKQIPKFHHSHFANILFERKFNGITPAQFFYTDGSL FT IEGIAGFGVYNLFSGHFFKLQSPCSVFSAELIALYFACNLIKDCAPNIYIV FT CSDSLSCLQALNTINFNFKTHHIILIMKKILLELSSRGFIIKFVWIPAHCK FT IYGNEQADTLAKFGVHRGITYSRIILPSEFYPKLKVKSLLDWQTCWNSSDK FT GRWCHSICPKVNRFPWFKNMPASRNLICSFSRLMSNHYNCNSHLYRINIKD FT SNLCDCGEFYEDIDHIVFQCDKFITPRNKLINNIVDLGQPIPVSVRDILGT FT KWYPVMKLLFQFLNEISYLV" XX SQ Sequence 5827 BP; 1806 A; 902 C; 950 G; 2161 T; 8 other; cattgttgta gcaggccttg aacgggcagg acgttgattt ttccaacgac tgacattttt 60 ttctcttttc taggtgagaa aattttcaag acctacggag ttacggatcc ggtgtgttgc 120 ggaagaagaa caagattgcc tgwgcggtgc gtttagaatc tccgtggtcg cggtagcggt 180 cgattccctg tgtggcgcgg tagtagaaga ttccctgtgt ggcgcggtag gagtacttat 240 tcgagggcgt taaccatttg tgtggcctct gaaaggacgt cggagaacat aattcctgtg 300 tggtgcggaa gtagtaagga tttcctgtgt ggcgcggaag gagtatattg tcaagaggac 360 gttattcgcc ttggattttt gaaaattgaa gccaccttgg tattggttgc agtttggttc 420 agctgaagtt tgttaagtat ttgtttttat tttattttta twtttttcta ttcttttaag 480 tattacataa tacttctaat tactaattgt tgagtttcca tcatggaaac tgatgatgga 540 gaggggtcaa aagaaaacgt tctttctaat gattcctctt catcgaataa atcctttcga 600 gtaaaagttt atccgtctag ttttcctgga ccttttgtag tttactttcg taagaaagag 660 aaacctatca atgttttatt gatctcttct gagatatata aattatacac gtccgttaaa 720 gaaattaaaa aaatttctct ggacaaattg cgtgtaattt ttggatcaag agatgatgct 780 aacgctcttc ttgaatccaa attatttttt gattcctatc gcgtttatgc tccttctaat 840 gcatgcgaaa ttaatggaat tatatacgat gaatctttgg attgtgaaga tatcaaaatt 900 catggttcag gcgtttttaa gaacaaatcc atttctcctg ttaaggtttt ggattgtgtt 960 cgtttatcga aattattctt tgctggtaat gattcaaaat acatgcattc aaattgtata 1020 aaaataacgt tttccggttc agttcttccg gattacgtta tgattgataa tgtaaaattt 1080 cgcgtgagac tttattatcc aaagctcatg cattgtgata aatgtcttct tttcggacac 1140 acttcaaagt tttgttcaaa caaacagaag tgttctaaat gtggagaacc tcatcagtca 1200 tccgagtgta ataaaatctc ggatctatgt ttttattgta aacaaaagca caattctttg 1260 aaagaatgtg ttgtttacat tgaccatcaa tccaaattca atcaaaaaat taggaataaa 1320 aatatattgt cctacgctga aataacaaaa tccactgata gtttttcatc tcccaattca 1380 tttgatattt tgcatgatga tgttgggaat gaaccatcac aaaatttgaa taattatgtt 1440 tataaacctc ctagtaagag gaaaagaatt tctaataaat ctttaaataa ccaaaatttg 1500 ttttttgaac ctcaaccatc tacttctttt gatacgaatt tccctcccct tggtgcatcc 1560 gttccctcca ataatattcc tggatttcaa agagttgatt ctacccattc aactaacaaa 1620 aatgatcaaa attcgaacaa tatttccagt gacaaaagca atgaaagctc taatcaatca 1680 attttgaata ttttggaaga aattgtagat ttcttgggat tgagtgattt ttggaaaaaa 1740 ataatcaaat tgattttacc attattggcc tcccttttag ataaattgaa ttcatttgtt 1800 ccccttattt cttctctatt ttcgtcctaa tggcttcgat caattccaag aaattacaca 1860 ttttacaatg gaattgtcga agtattattc caaaaattga tagattaaaa gttttattga 1920 gtaatttcga tatagatgtg ttttgtttga atgaaacgtg gttaggtaca gctagatttt 1980 tccgtatccc taattataat attattcgaa aagatagaga tacatcattt ggcggagttt 2040 taattggaat tcgcgacaat attgaattta aatacttaaa tgtatccttc caaacgcaaa 2100 tagagtatgt ggctgttacg attagaaatg gaaaaaatga gttttcaatt atatgttttt 2160 acatccctcc aaatgcaagt tttagtttgt cagaaattaa aagtatctta aatgaaatcc 2220 catcaccatt ttatatacta ggagatttca atgcgcataa tttcgcctgg ggtagtgaaa 2280 aaactgatgg tagaggttca ttaattatgg atttgattga tgatttgaat ttaaatattc 2340 ttaatgatgg atcgattact aggattgctg tgcctcctaa taatcattcc tgtgttgatt 2400 tatctttatg ttcaaataat ttatcaatgc aatcatcttg gagaactatt catgatccaa 2460 atggtagtga tcatttacca attttgattg atatgcaaac ttctgaagac aatgaagtgt 2520 tttatgaagc ttctactccc gatctttgta aaaatgttaa ttggagtaaa ttttctgatc 2580 ttgtttccat ttcattaatc aatattgtca attcagatac tccaattgat agttacaata 2640 ctttttcaaa attgttaata gattgcttgc aaaaatctca acataataaa ccatcttcaa 2700 aaactacaaa aaaaaatcgt ccttcgttct ggtgggataa tgagtgttcc atttctttaa 2760 gaaacaaatc tgaagccttc aaactatttc gcagatcagg ctccagaaat gattactttc 2820 tatattgcaa agctgaagca cagtttacca gacttactaa atttaaaaag agaaattatt 2880 ggaaaaattt tgttgaaaat ttggacagag agacttcatt gtcaactttg tggtccgtag 2940 cacgtaattt gagaaattac aactcttctc cacctactat tttagaatat tcagaaaaat 3000 ggatggatga tttttcttcg aaaatttgtc ctgattttgt tcctccgtcc attgattaca 3060 aaactaattc aatcaattac tttcctgaac tctgtaatcc attttcaata gctgaatttg 3120 acatggcttt atcaataacc aaaaatactg ctccaggtat tgacaacatt aaatttattg 3180 twcttaataa tctaccttat gaaggaaaac ttcatttact ttcgatatat aatgctttat 3240 tttgtcaaaa tattattcct tttgaatggc gtttcatcaa agttataaat attgtcaaac 3300 ctggtagaga tccagcatkc gctgatagta gaagacccat tagtttattg tcatgtttgc 3360 ggaaattatt agaacgtatg attttgaatc gccttgamtt atgggctgag gacaataaca 3420 ttttttcatc ttctcaattt ggatttagaa gaggtcgagg tactcgtgat tgtattgcmc 3480 ttttagcttc acaaattcag ctttcattca ataaaaagca ggatgttgtg tcaacattcc 3540 tggatgtttc tggagcttat gattctgttt taattgattt attatacaat aaaatgcata 3600 atttgaaaat tccaaattta atttcaaatt ttgtatgcaa tttattttct ttcaaaatca 3660 tgaattttta tcataatgga aaatcgaaaa taattcgtta tagttatttc ggtctaccac 3720 agggatcttg tttaagccct ttcttgtata atttattcac aagtgacatt gcttcgatta 3780 ttcctaatgg atgttattta ttacaatttg ctgatgataa cgtcatttct atcagtggca 3840 aaaatagaga aattataaga cattttatgc aatgtgccct gaacaatatt ggtatgtggg 3900 ctcatgaaaa tggtttttct ttttctgttc aaaaaacaaa atatataata ttttcgagaa 3960 aatattcgac ttctttgatt aatttatatt taaatggtat tgaaattgaa caagttgatg 4020 aatataaata tcttggtata tggtttgatt ttaaactgaa ctggaatgct catattaaat 4080 atattcagaa agtttgttca aaaagaataa actttcttcg tgcaattaca gggacttggt 4140 ggggcgcaaa tccttctgat ttaattacac tttacaaaac tactattcgt tcagtaatgg 4200 aatatggttg tttcacattt gggagcgctg cacgtacata ttttgccaaa cttgaaaaaa 4260 tacagtttcg ttgcttgaga atttgtttga aacttatgaa ttcaacacat actcaatcaa 4320 tagaagttct tgctggcata ttaccactga aaattcgttt tcaagaattg aattgcaaat 4380 ttgtgatgaa ttgtttttca aacaatcatc ctcttattga ttttttgaaa tctttatttg 4440 atattaatcc tacgagcaaa atgttagatt ccttcgttca ttgctctaca gaaaatattg 4500 aaccaatccc ttcgtatgcc tttcataatt caaaaacttc cattcattca tacaaaccat 4560 taattgacct gtctctattc catgaaatga aacaaattcc aaaatttcac cattctcatt 4620 ttgctaacat tttgttcgag cgcaaattca atggaattac tcccgcacaa tttttttata 4680 ctgatggttc attgattgaa ggaatcgcag gttttggtgt ctataattta ttttctggtc 4740 atttttttaa attgcaatct ccttgttctg ttttctcagc agaattaatt gctctttact 4800 ttgcgtgcaa tttaattaaa gattgtgcac caaatattta tattgtttgt tcggatagtt 4860 tgagttgtct ccaggctttg aatacaatta atttcaactt caaaactcat catattatat 4920 taattatgaa aaaaatattg cttgaactca gttctcgagg atttatcatc aaatttgttt 4980 ggattcctgc tcattgtaaa atttatggta acgaacaggc agatacgtta gctaaatttg 5040 gtgttcatcg cggaataact tacagccgta tcatactacc ttctgaattt tatccaaaat 5100 tgaaagttaa atctcttttg gattggcaaa cgtgttggaa ttcaagtgac aaaggacgtt 5160 ggtgtcattc tatttgtcct aaggtcaatc gttttccatg gtttaaaaat atgccagcaa 5220 gtagaaattt gatttgctct ttttcgagat tgatgtcaaa tcattataat tgtaacagcc 5280 atttgtatcg catcaacata aaagattcca atttatgtga ttgtggggaa ttttatgaag 5340 acatcgatca tattgtattt cagtgtgaca aattcattac tcctagaaat aaattaatca 5400 acaatatagt agatttgggt caacctattc ctgtatctgt acgtgatata cttggtacaa 5460 aatggtatcc tgtaatgaaa cttttatttc agtttttgaa tgagatctca tatttggttt 5520 gatactcttt gttttttttt tacttttatt ttcagatgtc acgttggatt cgattttgac 5580 tctctccctc actggcttcc gatgacmctc ctaacttaac gccgatkgtt atgtttcaag 5640 attacggctc tgttatggtt tgttaccgtg tgagccttta gttttaagtt tgatttgtaa 5700 cgtttattga aaagataaag aggttttgtg cctctttgag aaagatttcc aattgaaatc 5760 actcaaaggg gtttttccct ctttcaaaat tttagttaaa aataaataat aataataata 5820 ataataa 5827 // ID Copia16-NVi_LTR repbase; DNA; INV; 293 BP. XX AC AAZX01024383; XX DT 13-NOV-2007 (Rel. 12.11, Created) DT 13-NOV-2007 (Rel. 12.11, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: long terminal DE repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia16-NV; KW Copia16-NVi_I; Copia16-NVi_LTR. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-293 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from Nasonia parasitic wasp."; RL Repbase Reports 7(11), 1154-1154 (2007). XX DR Genome; AAZX01024383; Positions 632 340. XX SQ Sequence 293 BP; 95 A; 50 C; 33 G; 115 T; 0 other; tgttaaaata acaatataag tttcaataat atctaattca actattatct tataattgtt 60 atatgaatat gtaaaactaa gaggtagtct gcactttgta gtgctaccaa ccccttttgt 120 ttttcttaag ttactacctc ttatgaatat aggttagtct tcacgcgctc ccactttttt 180 ttttttgttc tgttcctaca cacgaacaaa tatgtaaatc ttagagataa ataaatcttt 240 tatttactgg aacatcgcaa gttaagagtt aattattcac ccaattctta aca 293 // ID BEL-193_AA-I repbase; DNA; INV; 7241 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-193_AA_; KW BEL-193_AA-LTR; BEL-193_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-7241 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 877-877 (2011). XX DR [2] (Consensus) XX CC Positions [5163-5747] - Integrase core CC 'GCATT' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 720..6116 FT /product="BEL-193_AA-I_1p" FT /translation="MPAADLRALVKQERHAWVTLDNIQEFLRTYDDQRDKN FT ELQFRMQRLDEVYNKYCEVRVKIEVITDDLDTVEVEGAVGAEGVDDAPVDQ FT AAMAEARQRENEEIFKEFENKYFRLKQQLHTKMIGNQGVIVERQAGAEVCQ FT PSRMKYPELRLPTFTGKHSDWINFRDNFRSLIHANNQLNMIDKFNYLRSSL FT KDEALLQINQIQVTAANYTLAWETLEAKYENHELIAQEHLRALFAVAPMKT FT ESFQGLNHLLTTFRVNLQQLEKLGERPNDWSTLLAFMLSQRLDDDTIRHWE FT THHSSKDIPTYQRMVEFLENHCSILQSTSARKSSDFKRPVKPPVVHAAVSP FT SGNCQICNGGAHSIEQCRRFGKMKVIDRKATVRRLGLCLNCLRPGHYVVDC FT SRSSCGKCGQNHHHLLHPFVASVQQEQTSSQPSSQGPPKRPQSANPPPQHP FT NQAQNRNTSHNTPQSTQTQSTSSQNAPPTNTPTVHATATPFETLRQSNIAL FT LSTAIVKLGDRYGNTVLARASLDNGSQICVMSESLSQRLNFQRLRENLPVD FT GVGGCSSVSKQAVLASVSSRSSSYVSCELKFYVLPKVTSNLPQQNIDVSSW FT NMPERITLADPSFNESGAVDVILGNIIFYDLLLNAQRRISDSGLILRNTRL FT GWIVAGGLPETSVVSYSAVASSPVTTEELYEELAKFWELESCRTKSCLSIE FT ESTCEAIFQKTTTRDSDGKFRVTLPKKKNMLEKLGESKAIAKKRFLAMERR FT LNGNPEMKALYTAFMHEYLSMGHMEEVASDDEEPGPEYYVPHHAVIKPDST FT TTKLRVVFDASCSTDTGVSLNDVLMVGPVVQDDLRSILLRFRLFKYAVVGD FT AEKMYRMVWQHKSDQQLHKIFWRDSIDEPLRAYKLTTVTYGTASAPYLATR FT CLNKCAEDGAERYPSGSVVVKKSFYVDDMLAGADTVEESQQLCKEVLELLK FT GSGFNLRKWNSNNPEVLKEIPLHLRDDRVKLDLDEKATVKTLGLTWEPATD FT TLWMKVPDWRPGGPITHRIVLSEIARLFDPYGLVGPVIVLGKLFLQDLWRG FT KYSWDEPLSGELQPKWLEFRTNLRELDAISVPRWINFGKDVVSCEFHGFSD FT ASDKAYGAAIYLRCVNQDGQVTINLLMAKSRIAPLEDLSKRKKKQNIPRLE FT LSAALLLAHLFESVSSSMMIAAKSFFWTDSTIVKCWISSHPSRWQPFVANR FT VSEIQHATTEGIWNHVPGIENPADIISRGAAPAEIANNSLWWNGPEWLRKE FT THYWPKNAQVHEQQFDSVTLEERPLVSAVLQILPPSEIFKLRSSLSNLVRP FT TAWLRRFSFNCKPQNRQQRRSGIITVAEHEEALLVLVKLAQSECFPLELSD FT LAATSAVRPASKLRTLNPKMIDGVLCVGGRLGNAPVSRGRRHPIVLENRHP FT LTKLIMIDYHHRLLHGGPQLMISCVRERFWPLNVRNLARQVTHECVKCFRA FT KPRAHEQLMGDLPFERVSPAPAFQRVGIDYCGPFELQHARRKGAPLKCYIC FT LFVCMVSKAVHIEVVTDLTADAFLAALRRFVARRGRPEQIFCDNATNFVGA FT RRQLSELNRMFRQQQFQERLSTEAARNSITFNFIPAKSPNFGGLWEAAVKS FT LKGHMRKVIGTRRLSQDEMQTVVIQIEACLNSRPLTPLSNDPADLQVLTPG FT HFLIQRPLTAIPEPNLPDLSEGRLSRWQKVQRYSQMIWKHWSTDYLSNLQQ FT RNKWTRQRNNLNVGTMVLLKEDNIPPLKWKLGRVMEIYPGQDGNVRVVAVR FT TQDGRFQRAISKICILPIRDNQTEEDQD" XX SQ Sequence 7241 BP; 1906 A; 1714 C; 1866 G; 1753 T; 2 other; tttggtcctg acgacccgga tttacgccaa acccgttaat agcctttcgc gtcgcttggg 60 ttcgataccg gtcatcgaac tgccggacaa tagacaagtg cattgtgttt attcgctttg 120 tggtgccggt gtggggacgt cgcttgaaaa gtgaatgtga ttgcccgggc agtgtggtga 180 taatagaaga aaattcaaaa cgtcgccttt gctgaggatt ctttgccgtc gcttctgcgg 240 ccgttggatt tgcacgtgtg gttgctggag acgatttaga ggatcggaga catcgctcgc 300 tgttcctgct gaccatcatt gctgtgccca aggcccaaag tggacatctg gagatcgcca 360 aaggacgtca gataagtaca gtgtgtgcta gaggttgagt ggttgaggta cctggcttgg 420 attgcattgt gcgcctgcca gtggtgtgct tgctctacct agggtagaca aatacaaaac 480 gctcctgatg agcatataaa tcgcatgcaa gattgtacaa tttaagtaga tattgtggag 540 atcgtgtttg agattctctg gttttccgat ttattgcctt tgaaggcaca atttgagatt 600 gctggaaagt gattaattca ctttgatmat tttggcctgt cttctgtcaa ttcgcgcggt 660 cgtttgaggt ctcgggagtg agttttgcta acagtgattg tgtagtgctt ccagtcatca 720 tgccagctgc ggatcttaga gcacttgtga agcaggaacg tcatgcctgg gttacgttgg 780 acaacataca ggagtttttg agaacgtacg atgaccaacg cgacaagaat gaactacagt 840 ttcgaatgca gagattagac gaagtgtaca ataaatattg tgaagtgcga gtgaaaatcg 900 aagttattac tgatgacttg gatactgttg aagtggaagg tgctgttggt gctgaaggtg 960 tagatgatgc tccagtcgat caagcagcca tggccgaggc tcgacagagg gagaacgaag 1020 agatcttcaa ggaattcgag aacaagtatt ttcgcttgaa gcagcaactc catacgaaga 1080 tgatcggaaa ccaaggggtg atagttgaac gacaagcagg agccgaggtt tgtcagccgt 1140 cgaggatgaa gtatccggaa ttaagactcc caacctttac tggaaaacac tcggattgga 1200 tcaatttccg ggacaacttc aggtcgttga tacacgccaa caaccaacta aatatgattg 1260 ataagttcaa ctacctacga agttcattga aggacgaagc gctgctacaa atcaatcaga 1320 tccaagtcac tgctgcaaac tacacacttg cgtgggagac tctagaagcg aagtacgaga 1380 atcacgaact gattgctcag gagcatttga gagctctctt cgctgtcgcg ccaatgaaga 1440 ctgaaagctt tcagggactc aatcatcttc tgacgacgtt cagggtgaac cttcaacaat 1500 tggagaagtt aggagagagg cctaacgatt ggagtactct gttagcgttc atgctttccc 1560 aaaggctgga cgacgatacg attcgtcact gggaaactca ccatagttca aaggacattc 1620 ctacttatca gcgcatggtg gaatttctgg agaatcactg ttccatactg cagtccacgt 1680 ctgcacgaaa aagcagtgac ttcaagcgcc cggtcaagcc tccagtggtt cacgctgctg 1740 tatcgcccag cggaaactgt caaatctgta acggtggagc acattcgatc gaacagtgta 1800 ggcgtttcgg gaaaatgaag gtcatcgata ggaaggcgac cgttcgacgg cttggattgt 1860 gcttgaactg tctacggcca gggcactacg tcgtggattg ttcgagatct tcgtgtggca 1920 agtgtggtca aaaccatcat cacttgcttc atccgttcgt ggcttcagtt caacaagagc 1980 aaacttcgag tcagccctcc tcccaaggcc caccgaaaag acctcaaagc gcgaatccgc 2040 caccccagca tccgaatcag gcccaaaacc ggaacacgtc tcataacact ccacagtcta 2100 cacaaacaca aagtacatct tcgcaaaatg cacctcccac caatactcca actgtacacg 2160 ccactgctac accattcgaa acgctccgtc agtcaaacat cgcgcttcta tctaccgcta 2220 tcgtgaagct cggtgatcgt tacggaaaca ccgttcttgc acgcgcttcg ttggataacg 2280 gatctcagat ctgtgttatg tctgaatcgc tgtcacaaag gttgaatttt caacgtctac 2340 gagaaaattt gccggtcgac ggagtgggtg gttgttcatc tgtcagcaaa caagcagtgt 2400 tagcatcagt ttcgtcgcgc agctcatcgt acgtgtcgtg tgagctgaaa ttctacgtgc 2460 ttccgaaggt gacgtcaaat ttgccacagc aaaacattga cgtgtcttcc tggaatatgc 2520 cagaaagaat caccctagcg gatccatcct tcaacgaatc cggagccgtc gatgtcattc 2580 ttggaaacat catcttctac gaccttctgc tgaacgcaca gcggaggatt tctgattcag 2640 gattgattct acgtaataca aggctgggat ggattgtcgc cggcggtctt cctgaaacgt 2700 ccgtggtcag ttacagcgct gtwgcttcct cgccagtgac tactgaagag ctatacgagg 2760 agctagccaa attctgggag ttggaatcat gtcgcacgaa aagttgcctg tcgattgagg 2820 agtccacatg cgaagcgatc ttccagaaga cgactaccag ggattcagat gggaagttca 2880 gagtaacgct accgaagaag aagaacatgc tggaaaaact tggagagtcg aaggcgattg 2940 cgaagaaacg tttcctagct atggaaaggc ggttaaatgg aaacccggaa atgaaggcgt 3000 tgtatacggc attcatgcac gagtaccttt ccatgggtca tatggaagag gtagccagcg 3060 atgatgaaga gccaggtccg gaatactatg ttccacacca cgcggtgatc aaaccagaca 3120 gcactactac aaaactaaga gtagttttcg acgcatcctg ttcaacggac acaggtgtgt 3180 ctctgaacga cgttctgatg gtgggacctg ttgttcagga cgacctccga agcatattgt 3240 tgcggttccg gttgttcaag tatgcagtag taggagacgc ggaaaaaatg taccggatgg 3300 tatggcagca caaatccgat cagcagttac acaaaatatt ttggagagac tcgattgatg 3360 aaccattgcg agcgtacaag ctcaccaccg taacatacgg aacagcatca gcgccttacc 3420 tcgccacgcg gtgcctgaac aagtgtgctg aagatggagc agagcgatat ccatcgggat 3480 cagtggtggt caagaagagc ttctatgttg acgacatgct tgcaggtgcg gacacagttg 3540 aggaaagtca acagctctgt aaggaagttc tggaacttct gaagggttct gggttcaatc 3600 tgcggaaatg gaactctaac aatccggaag ttctgaagga aatccctttg catcttcgag 3660 acgatcgagt gaagctggat ttggatgaga aggcaactgt gaagacacta gggctcacgt 3720 gggagccggc tacggatacg ctgtggatga aggttccaga ttggaggcca ggtgggccaa 3780 ttacgcatcg tattgtgctg tcggagatag cacgcctttt cgatccctac ggcctagttg 3840 gtccagtgat agtcctcgga aaactttttc ttcaagatct ttggagagga aaatactcct 3900 gggatgaacc actaagcggt gagctccagc cgaaatggct ggagttccga acgaatctgc 3960 gtgagttgga tgccatatct gttcctcggt ggatcaactt cggaaaagac gtagtctcgt 4020 gcgagttcca tggattcagc gatgcaagtg acaaggcgta cggagctgcg atatatttgc 4080 gttgcgtgaa tcaagacggt caagtcacaa tcaatcttct aatggccaag tcaaggattg 4140 cgccgttgga agacctcagc aaaagaaaga agaagcagaa tatcccacgg ctggagctat 4200 cagcagcgct attgttggcg catctctttg aatctgtctc ctccagcatg atgatagctg 4260 cgaagtcttt cttctggacg gactcgacaa ttgtaaaatg ctggatatct tcgcatccgt 4320 ctagatggca gccatttgtg gccaatcgtg tatcggagat acagcatgcc acgacggagg 4380 gaatatggaa ccacgttcca ggaatcgaga accccgcgga tataatttcg cgaggcgccg 4440 ctccggctga gatagcgaat aattcgctat ggtggaatgg tccggaatgg ctgagaaagg 4500 aaactcacta ttggccgaag aacgcgcagg ttcatgaaca gcagttcgat tcagttactc 4560 tggaggagag accgctagtg tcagcagttc tgcagattct gcctccaagt gagatattca 4620 agttacgttc gagcctttcg aacctcgttc gtccgactgc ttggttacgc cggttttcct 4680 tcaactgtaa accccagaac aggcagcaac ggagatcggg aatcatcaca gtcgcggaac 4740 acgaagaagc cttactagtt ctagtaaaac tagctcagtc ggaatgcttc ccattggagt 4800 tatcggatct tgcagcaacg agtgctgtta ggccagcttc gaaacttcgt accctgaacc 4860 cgaagatgat cgatggagta ctttgcgtcg gcggccggct cggtaatgct ccagtctccc 4920 gcgggcgtag acacccgatc gtactggaga accgacatcc actgacgaag ttgataatga 4980 tcgattacca ccatcgactg ctgcacggag gaccgcagtt gatgatatcc tgtgttcgcg 5040 aaaggttttg gccactgaac gttcgaaacc tggctaggca ggtgactcac gagtgcgtga 5100 agtgtttcag agcgaaacca cgagctcacg agcagttgat gggtgattta ccgttcgaga 5160 gagtatcgcc agcccctgca tttcaacgtg tgggaattga ttactgcgga ccgttcgagt 5220 tgcagcatgc gagacgaaag ggcgctccgt taaaatgcta catctgcctg ttcgtctgca 5280 tggtcagcaa ggccgtccac attgaagtgg ttacggacct tacagctgat gcatttcttg 5340 ctgctctacg gaggtttgtt gcccgacgtg gaaggccgga acaaatattc tgcgacaatg 5400 cgacaaactt cgtgggagcg cgtaggcagc taagcgagct caatcgcatg tttagacagc 5460 agcagttcca agaacgactg tctacagaag cagcgcgaaa ctccatcacc ttcaacttca 5520 ttccggccaa gtcacccaat ttcggtggct tatgggaagc cgcagtcaag tcgctaaagg 5580 gacatatgcg gaaggtcatc ggcactagaa ggctaagtca agacgagatg cagacagttg 5640 tgatacagat agaggcatgc ctcaattcga ggcctctgac tccgctgagc aacgatccag 5700 ccgacttgca ggtgctaacc ccaggacact tcctgatcca acgtccactt actgcgatac 5760 ccgagccaaa tcttccggac ctatcagaag gtagactctc cagatggcag aaagtgcagc 5820 gctacagtca gatgatctgg aaacactggt ctactgacta tttgtcaaac ctccaacaac 5880 gcaacaaatg gacaaggcag cgaaacaacc tgaacgtcgg gactatggtt ctgttgaaag 5940 aggacaacat tccaccgctc aagtggaagc tgggacgagt gatggagatc taccctggcc 6000 aagacgggaa cgtgagggtt gtcgccgttc gcacacagga cggtcgcttt caacgggcaa 6060 tatcgaaaat ctgtattctc ccgatacgcg acaaccagac cgaggaagac caggactaag 6120 tatttacgcc cccttatggg gcatctttag gcatcctgcg gtatagaata tcgctactgc 6180 ctgggggcaa ccctacagtt acgtattcta tcatttgtgt gtccgggacc attctgatgc 6240 gttctccctc gacagtgttc atctacatta cgaatcctgc gtggagccat catctgttct 6300 ggcttatatc gtcgccctat cctgtaaatc tcatcgaagc taagtacgcc cccatttggg 6360 gcatccgttg gcaatccgtg gcatcctagc cacatccgcc gagggggcga cacctaagtt 6420 atgtttcatt catccaatct tgaattccgg atcccaatcg ctgttattct ttggcagata 6480 cggatcaact ctaccaattc gtttgcggtc atcgctgaga tcactggaga tcaacacagc 6540 taagtatggc cccatttggg gcaacaattg gcgattcacg acggttaacg tcgtatatgc 6600 caggggccgc agcacaagtt atgtttatta gcgttaagta ttatttcgga agccaatcga 6660 atttccttcc atcagcatca acccacggta cgcaaactcg ccggacaagt actcttccac 6720 ggatcatcat caatctctgg cagtcaacga taatcggaga tgccttcaag ctcgttctac 6780 cactacaact tccgctcaat cgtttcacgc ccccatttgg ggcacacttc ggcaaatcag 6840 atagtttcat ctggtatcgc cagggggcga ctcctaagtt acgtccctca tctttcaaca 6900 ttagcgtcgg acttaagcca tcttattctc caacaggaaa cgaggatacg gggatcaact 6960 ctagtggtca aggtcatcaa gattgttcca ggcagcggaa gtttgtgggt agtgaagcca 7020 ccaaaggtca tcatcaggaa atcaacagat caggtcgaag ttatcgctga ctttcagacc 7080 ggctgaatgg cccaatcgca aagcgagtgg atgcatcagc tcgggtagag ttcgacgaga 7140 attgaaggaa ccgtgaatta tctagtacca cattttgaat tgtttcttta cctttagaat 7200 atttgaatta attgttttga gtgaactcaa ggcggccggt a 7241 // ID BEL-99_AA-LTR repbase; DNA; INV; 487 BP. XX AC AAGE02034270; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-99_AA_; KW BEL-99_AA-I; BEL-99_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-487 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02034270; Positions 4622 4136. XX SQ Sequence 487 BP; 158 A; 114 C; 82 G; 133 T; 0 other; tgttgaggcc accagcaagc cgccggggtg ttttcataca ccactcaggc ggacgatcat 60 gcctcatgcc ggcagcctca caaacacacc acaacactac atactcacca catttttcac 120 ctggatccat ctagaaatac ctaaaagtag aaaaagaaca ttaatgctat tgaaatcata 180 cttaccttta tgccttcaga acacactaga actacctatt gtttttgttt tgctttgatt 240 gtttgacagt atgctagaat gttcgaagat tcaggctagg ttacactttg ctttcgcaaa 300 cacgctaata ataagtaagg caacaataga gtattgaacc aatggaaaat tccacacgct 360 tattgtaaac gagcagtata aataacaccc agttggcaaa ggtcaagtca gtctgaaata 420 aaccgtgaat tcaaacgtat ccactttgtg aaaattattt cttccctagt tcgttaccgc 480 tcgaaca 487 // ID hATm-50_HM repbase; DNA; INV; 3721 BP. XX AC . XX DT 07-DEC-2008 (Rel. 13.12, Created) DT 12-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type family: consensus sequence. XX KW hAT; DNA transposon; Transposable Element; hATm-50_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3721 RA Jurka J.; RT "Families of hATm elements from Hydra magnipapillata."; RL Repbase Reports 8(12), 1944-1944 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 1267..2778 FT /product="hATm-50_HM_1p" FT /translation="MRLSLPSLSLACDRHGLSDRSAAAVASAVLEDIGIIS FT YDNKTQVIDRSKIRRERIKTRDNQKVGEIQVKALYFDGKKDRTLKFDSNKR FT TIVMEEHISLIQEPGGGYLGHCTPLSGTAESLKFAIIQYSKEHNIILDNLQ FT VLGCDGTNVNTGNKGGVIRLIELELQRPLQWVICLLHANELPLRHLFAELD FT GPTSGPRCYSGLIGKSLETCEVLKVVDFQPINHNVEMPEVDRKVLSTDQLY FT LFDICQAVLSGICSTNLEKRSPGKLAHSRWLTTANRILRIYVSTSSPSSNL FT VILSTFIMRVYAPMWFIIKKESSIFHGSRHLYSTIAKSRYLPAPLRKVVDK FT VIQTNAFFGHQENILLAMVSDDQPHIRELGVRRIQRARKERIMKANQSSVR FT QFKVPTLNFMANNTWELIDWSNSQLSEPPLTMQLTDEQLEEVKEHRILLSD FT LLDKLPCHTQAVERCVKMVTEVSASVCGEINRDGMIRVRLQSRKNMPIFNT FT KAHYNTN*" XX SQ Sequence 3721 BP; 1283 A; 641 C; 654 G; 1142 T; 1 other; ttagggtgga gtgaaaacaa tttttttaga aattcatatt ttctccctga tttctataat 60 aattatagga aaactaaaaa aatgagacca agatgagcaa aattggacaa gaattacccc 120 cagcaccaat ttttgaccaa ttacttatta ttgaaagcgc aaacttaaaa tttttttaaa 180 tatttttttt gaagaagctt tgaagttata acatattagg tatgtttgta ttatccatta 240 gtattatgac caaaatactc acttttgcat tgaaaaggtt attstaatac ttttaactca 300 tttaaaagat attttttgaa ataaagttaa tatttagcat ataataaaat ccacgctcta 360 ctactttccc caaaccatat ttttaaaggt tagtattcaa ttaataaata ataatattaa 420 ttataatatc tataatactt gttatcattt agtcagtaat aataattatc accatttata 480 tattttgagt tttgttttaa atttcagtaa ataaaatgtt aagatctaaa acagatcatc 540 cattctttgg gtattcaaag ccactctctg atgtgcaact gccaacagga ctagatgtta 600 tgcggcatat ggtgtattta aaagaaaatg taagcaagct atattacata tcgtttgtca 660 aataataata caattatatg taaacatata tatatatata tatatatata tatattataa 720 atttacagaa acctgctgga aaaaccaaca aaaagaatga agcaggttca accgcaatag 780 aggtcttaag aatttgggga aaatgtaaca taccaactgt gtcatcaatt ccatctttga 840 caaagagagt ttttaacctt tatgagcaga acataaagct gcaaaaaact ccaacaaaca 900 ggaaaaataa aaaaaacaaa tttcagctct acaaggaaaa actagcaaaa ctgtttgata 960 tatctgcttg ccaatgcaaa gatctaaaga aatgctgctg tgccaaagtt aatatggtac 1020 cagctatgga gcaccctttc ctggaagatc aaaggagtga acgcaaaatg gctataggtt 1080 caatagacaa aaaaattaca gcaaaactgc agaatatcca aagcaaacag agctcaaacc 1140 ttaattatgc agtgtcatta gattctttca tttctaccaa acctattact tcaaaacggc 1200 ctgctatcga aacggcctgc tatcgttgaa aacacccagg catcaacacc atcatcttca 1260 aagcagatgc gattgtctct cccttctctt tcattagcct gcgaccgcca tggactctca 1320 gacagatcgg ctgcagctgt ggcatcagct gtacttgagg atattggcat tatttcatat 1380 gataacaaaa cacaagttat tgaccgcagc aagatcaggc gtgagcgcat aaaaacaagg 1440 gataaccaga aagttggtga aatccaagta aaggcgttgt actttgatgg caaaaaagac 1500 cgaacactca aatttgacag taataagaga accatagtga tggaggagca catatccttg 1560 attcaggaac ctggaggagg ttacttggga cactgcactc ctcttagcgg aacagctgaa 1620 tcactaaaat ttgcaattat acagtattct aaagagcaca atattatttt ggataatttg 1680 caagtgctgg gctgtgatgg aacaaatgta aacacgggaa acaagggtgg tgttatcaga 1740 ttgattgagt tagaacttca acggcctcta caatgggtaa tttgtttgct acatgcaaac 1800 gagcttccac tacgccattt gtttgcagag ttggatggtc caacatctgg tcctagatgc 1860 tattcaggct taattggaaa gtcccttgaa acttgtgagg tcttaaaggt tgttgatttt 1920 caacctataa accataatgt agaaatgcca gaggtggaca gaaaagtgct cagtacagat 1980 caactttatc tttttgacat ctgccaagca gtattaagtg gtatttgttc taccaacctg 2040 gagaagcgca gccctgggaa actggctcat tctaggtggc tcaccactgc caataggatc 2100 ctcagaatat atgtttctac cagcagtccc tcgtcaaatt tggtaatttt atcgacgttc 2160 attatgcgag tttatgcccc tatgtggttc atcatcaaga aagagtcttc tatctttcat 2220 ggcagcaggc atctgtacag cacaattgca aaatcacgat atctgccagc tccattacgt 2280 aaagtagtag acaaagtaat tcaaacaaat gcattctttg gtcatcaaga gaatattctt 2340 ctggcaatgg tatcagatga tcagccacat atccgagaat taggtgttag gcgtatacag 2400 agagctagaa aagaaagaat tatgaaagct aaccaatcaa gtgtgcgtca gttcaaagtt 2460 cctactctga attttatggc aaataatacg tgggagttga ttgattggag taattcgcag 2520 ctgtcggagc caccattaac aatgcagctc actgacgagc agctggagga ggtaaaagag 2580 catcgcattc tactatctga tcttttggat aagttaccgt gccacacgca agcagtggaa 2640 cgttgtgtaa agatggtgac agaagtttcc gcatcagttt gtggcgagat aaatagagat 2700 ggtatgatac gtgtccgtct acaatctagg aagaatatgc caatatttaa taccaaggca 2760 cattataaca ctaattagta caatttttga ctctataaat gtaaatattt tgtattatat 2820 gcacctgttc aattctaaaa ctctcttaaa ataacctgtt tataaattaa tataaatcca 2880 tttgaaagaa atgatactaa taatcattat ttaaattaat aaatattaaa tcagtggcgg 2940 atccagcgtt ttttgttgtt agagatagct aacttccaga catggagcaa actttaaaca 3000 ttcatattta tacttacgtc aaatagggat gctttgcaaa aaattgcaat gattggggcc 3060 cgggaacaaa aatgccccaa acaaattatc ccctacccca aatttgaggg caactagttt 3120 gcaacatatt ttttttactt ttctcaactc aattagaatt catcgataaa tttcaaattt 3180 cgtgaattaa tctcttattg ttcaaaatta tggcaacata aaattgcgac cccctcaaat 3240 tgagggggat caactttata cggcataact tttgaatgct aagagattat ttcatgaaat 3300 ttgaaatcta tcgatgaatt ctaattgagt tgagaaaagt aaaaaaaata tgttgcaaac 3360 tagttgccct caaatttggg gtagggggat aatttgtttg gggcattttt gtttacgggc 3420 cccaatcatt gcaatttttt gtaaagcatc cctatttgac gtaagtataa acatataaat 3480 gtttaaagtt tgctccatgt ctgttagcta tctaacacca aaaaacgctg gatccgcgcc 3540 tgtattaaat gtagaaaatt tggaaacttt tggtttagaa tagtccaaaa attggtgctg 3600 ggggtaattc ttgtccaatt ttgctcattt tggtctcatt tttttagttt tcctataatt 3660 attatagaaa tcagggagaa aatatgaatt tctaaaaaaa gttgttttca ctccacccta 3720 a 3721 // ID L1-34_AAe repbase; DNA; INV; 4426 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE An L1 non-LTR retrotransposon from Aedes aegypti. XX KW L1; Non-LTR Retrotransposon; Transposable Element; L1-34_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4426 RA Kojima K.K. and Jurka J.; RT "L1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1387-1387 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 9 sequences with >98% CC identity. XX FH Key Location/Qualifiers FT CDS 142..1179 FT /product="L1-34_AAe_1p" FT /translation="MSVRRENTFRIDYANVPKKPTFEELHDFVGSTLGLQY FT EQVVRLQPSRALGCAFVKVVDLELAQKIVADHDNKHETEVDGKIYKLRITL FT EDGTVEVKLTDLSEDVSNERIAEFLSAYGEVLSVTDQVWDSKFRFAGLPTG FT TRVARMLLKRNIESYVTIDGQVTNVVFFGQLQTCKYCSEFVHNGISCVQNK FT KLLVQKTYANVAKQTVTNPSAAKPTATKQKTPFSKLFRSKSKETKQQSQNQ FT PTKMTAAESAVAAPKQTMPITSDQTTSSSPLTKTNVAEKSGSLMAPPAAPQ FT SVQTASRMTTRQASDGNETDISSASTSSKRRCGRPPGKKLRHNHDDDVNAE FT LDD" FT CDS 1163..4357 FT /product="L1-34_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MRSWTINPMAFNSYNIASININTITNPTKINALCSFL FT RSSEIDIAFLQEVENEQLQIPGFNVVCNVDHNRRGTAIAMKDYIQFSNVEK FT SIDGRIIALRIQNTTLCNVYAPSGTALRAERERFYSNTLAFYLRHNTDNTL FT LAGDFNCVLRQRDATGHNHSPALLATIQQLQLVDVWEKLCPNMPGYTYMTH FT NSSSRLDRVYVSQSLRSHLRTIDIHVCSFSDHKAITARICLPYLGREPGRG FT FWSLRPHLLTTENIQEFHCRWQYWTRQRRNYNSWMQWWLSYAKPKTTSFFR FT WKSKAAFNDFQREYQRLYTQLRQAYDGYFQNPAMLSTINRVKAQMLTHQRN FT FTQTFMRVNESYVAGEPISSFQLGDRRRKRTTITQLRDDNDHLINTPEAIE FT RHMFVFYRSLYAAGQTDRQVADTFECERVIPENDPSNEHCMSEITPADVFS FT AIKSSSPNKSPGGDSLPREFYLKTFDIIWREITLVMNDALAGNFPADFVNG FT VIVLVKKKGNDQTAHAYRPISLLNCDYKIFSRILKQRLENVMRAHQVISDG FT QKCSNFGRNIFQATLALKDRIAQLKKRKQRGLLASFDLRHAFDLVDREFLS FT RNMRSLGFHSDLVRLLNKIGELSTSRLLVNGRLSEAFPIERSVRQGDPLSM FT HLFVLYLHPLIKRLEQIGGPDLVVAYADDVSVISTCRARLERMRETFRCFE FT RVSGARLSLEKSSSISVGYTDDMPLTVPWLRNENTVRILGITFANSVRLMS FT KLNWDELVGKFARLVYLHAQRTLTLHQKVTLLNTFITSKLWYLASVLSASA FT AQTAKLTATIGTFLWRGITARVPIQQLARIKKEGGMNLHLPALKFKALILN FT RHLQEIDSLPFYKSFFTQTNPQPPADCPCLKIILTEFPSLPPVLQDSPCAG FT SIYSTYLERTDPPKISQKYPAANWPQIWQNISLRQLSSYQRSSLYLVVNAK FT LEHRSLMFRMNRADSENCLHCGGARETIEHKLSGCVRVAAAWRMLQQKINS FT VLNGWRRLTIEELLRPQLQGIEKLKKIRILKMFHEYIIFIMNCNNAIDVLS FT LEFEIDCV" XX SQ Sequence 4426 BP; 1258 A; 1096 C; 999 G; 1073 T; 0 other; cagttcgcgc tcaacttcca tgctgagcag tcgtgttctc tgctggaagc cacacgcaag 60 ctattgttta ttttttttct ttcggattgt atccaaaatc tcgtttcgcg cttcgttttc 120 gtcgcgagtt gtgctgccgc gatgagcgtt cgacgcgaaa acacgtttcg tatcgattac 180 gcgaacgtgc caaagaagcc aacattcgaa gagcttcatg atttcgtcgg ttccacattg 240 ggcctacaat acgaacaagt ggttcgtcta cagccaagta gggcacttgg atgcgccttt 300 gtgaaggtcg tcgacctgga gctggcgcaa aaaatcgttg cagaccacga taacaagcac 360 gaaacggagg tggatggaaa aatctacaag cttcggatta cgctcgagga cgggacggta 420 gaggtaaagc taaccgacct atccgaggat gtatcgaacg agcgaatcgc tgagtttctc 480 agcgcctacg gagaagttct ctctgtgacc gaccaagtat gggacagcaa atttcgcttc 540 gctggtctcc caactggtac tcgtgttgcc cgaatgttgc taaagcgcaa catagagagc 600 tacgtcacaa tcgatggcca agtcacgaac gtggtatttt ttgggcagct acaaacgtgc 660 aaatattgca gcgaatttgt acacaacggg atctcctgtg tccaaaataa gaaacttctg 720 gtacagaaga cttatgctaa tgtcgcaaag caaactgtga caaatcccag tgccgcaaag 780 cccactgcta caaagcagaa gacaccgttt tcgaagctgt tcagatcgaa gtccaaagag 840 accaagcaac aaagccaaaa tcaaccgacg aagatgactg ctgccgaatc tgcagttgct 900 gctccgaagc aaactatgcc gatcactagc gatcaaacga cgagttcctc accgttgacg 960 aaaacgaatg tagccgaaaa aagcggcagc ctcatggccc caccggccgc gccgcaaagc 1020 gtgcaaacag caagccgtat gacaactcga caagcaagcg atgggaatga aaccgatatc 1080 tcttcagcat caacaagcag caaacgccga tgtggccgcc cgcccggcaa gaagcttcga 1140 cacaaccacg atgacgatgt gaatgcggag ttggacgatt aatccaatgg ccttcaacag 1200 ctataacatt gcctctatca acataaatac tatcacgaat ccgaccaaaa ttaatgccct 1260 ctgttctttc cttcgttcgt cggagatcga tatagcgttt ctgcaggagg tagagaacga 1320 gcagcttcag atacccggct ttaacgttgt gtgcaatgtg gaccacaata ggagaggtac 1380 ggcaatagct atgaaagact acatacagtt ctctaacgtc gaaaagagta tagatggccg 1440 aataattgcc ctgagaatcc aaaacacaac actatgcaac gtatacgctc catctggtac 1500 ggccctgcgt gccgagagag agcgttttta cagcaacacg ctcgctttct accttcgtca 1560 caatacggat aacacattat tagctggtga tttcaactgt gtgttgcggc agcgtgatgc 1620 gacgggccac aatcacagtc ccgctctcct tgcaaccata cagcaacttc agctggtcga 1680 tgtgtgggaa aaactatgcc cgaacatgcc tggttatacg tatatgacgc acaactcctc 1740 ttctcgccta gatcgtgtgt acgttagcca aagtctgcgt agccacctgc gaacaattga 1800 tattcatgtt tgctcgttct ctgaccacaa ggccatcaca gcacgtatat gtctgcccta 1860 ccttggccga gaaccgggtc gcggcttttg gtctctccga ccccatttgt tgacaacgga 1920 aaacatccaa gagttccatt gccgctggca gtattggacc cgccagagaa gaaattacaa 1980 ctcatggatg caatggtggc tgtcgtatgc caaaccaaaa acaacttcct tctttcggtg 2040 gaaatcgaaa gccgctttca atgattttca gcgtgaatac cagcggctct acacgcagtt 2100 acggcaagca tacgatggat attttcaaaa ccctgccatg ctgtcgacaa tcaatcgtgt 2160 gaaggcgcaa atgttgacac atcagcgtaa ctttacccaa acgtttatgc gagtaaacga 2220 atcatacgtt gccggagaac caatctcgtc gttccagcta ggcgataggc ggcgaaaaag 2280 aacaacgatc acccagctgc gcgatgacaa cgaccatctg attaacactc ctgaagcgat 2340 cgaaagacac atgtttgtgt tttaccgttc gctttacgct gctggtcaaa cagatcgaca 2400 agtagcagac acattcgaat gcgagagggt aattcctgaa aatgatccct ctaacgaaca 2460 ctgcatgagc gaaatcaccc cggcggacgt gttctccgcc ataaagtcca gcagcccgaa 2520 taaatccccc ggcggggact ctcttcctcg tgagttctat ctgaaaacgt tcgatataat 2580 ctggagggag atcactttgg tgatgaatga tgcgcttgcc ggaaactttc cggcagattt 2640 tgtcaatggt gtgatagtgc tcgtaaagaa gaaggggaac gaccaaactg ctcacgcata 2700 ccggccaatt tcactgctca actgtgatta taaaatattt tcacgtatct tgaagcagag 2760 actggaaaat gtaatgcgtg ctcatcaagt gataagtgac ggtcaaaagt gctccaattt 2820 cggtcgcaat atatttcaag caactcttgc tttaaaagac cgaattgcgc agttgaaaaa 2880 acgaaaacaa cgtggtctac ttgcttcatt cgatctgcga catgcttttg atctggtgga 2940 tcgtgagttc ctctcgcgga acatgcgctc actcggtttc cactcggatc tcgtgagact 3000 gctcaacaaa attggagagt tgtctacatc gcgcctgttg gttaatgggc gactatctga 3060 agctttccct atcgagagat cggttcggca aggggaccct ctctcgatgc atctgttcgt 3120 gctttatctc catcctctaa tcaaacgact cgaacagata ggaggacctg atctagtggt 3180 ggcatatgct gatgacgtgt cggtgatctc aacctgcaga gcaagactcg aacggatgcg 3240 agagactttt cgttgcttcg agcgtgtctc tggtgcgagg ctgagtttag aaaaatcctc 3300 gtcaatatcg gtggggtaca ctgatgacat gccgctcacc gtaccgtggc tacggaacga 3360 aaacactgtg cgaatattag gaatcacgtt tgcgaattcc gtacgtctta tgagcaaact 3420 aaattgggat gagcttgttg gtaaattcgc tcgtttagtg tacctccatg ctcagcgcac 3480 actcactttg caccaaaaag taacgttgct caacacgttc atcacatcca aactgtggta 3540 tctcgcttcc gtactctcag catcagctgc tcaaacggca aaactgactg caacgatcgg 3600 aacgtttctg tggcgaggca taacagcaag agttccaata cagcagctag cacgtatcaa 3660 gaaagaaggc ggaatgaact tacacttgcc tgcattgaaa ttcaaagccc tgattttgaa 3720 ccggcatttg caagaaatcg actccctccc gttctacaaa tccttcttta cccaaaccaa 3780 tcctcaacca cctgcagact gcccgtgtct caaaattatc ctgaccgaat tcccatcact 3840 cccgcctgta ttacaagatt ccccctgcgc tggaagcatc tacagtacgt accttgagcg 3900 aacagaccca ccgaaaattt cgcaaaaata tccggcagca aattggcctc aaatttggca 3960 gaacatttcg ctgagacaac tttcatcgta tcaacgaagc tctctctacc tggtggtgaa 4020 cgcaaaactt gagcaccgtt ctttgatgtt tcgtatgaac agagccgaca gcgaaaactg 4080 tctgcattgc ggtggtgcac gtgagacgat agaacacaag ctaagtggat gcgtacgagt 4140 cgccgcagct tggcggatgc tacaacaaaa aattaattct gttctcaatg gatggcgcag 4200 actgacgata gaagagctct tgcgcccgca actgcaaggc atagaaaaac tgaagaaaat 4260 tcgaattctc aaaatgtttc atgaatatat catctttatt atgaattgta ataatgcaat 4320 tgatgtattg tcgctagagt ttgaaattga ttgtgtataa ataattatat tgtaattact 4380 ttttacgcta cggtcaaata aactaaattt tacaaaaaaa aaaaaa 4426 // ID BEL-158_AA-LTR repbase; DNA; INV; 456 BP. XX AC supercont1.323; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-158_AA_; KW BEL-158_AA-I; BEL-158_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-456 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.323; Positions 431064 430609. XX SQ Sequence 456 BP; 168 A; 84 C; 76 G; 128 T; 0 other; tgttgccgct gggtacactg cttacgctca cattatagct acaatcggac aatgacagta 60 gctgacaata gtcagctgga tcgaagatat gcagacagtt cagataggag aaaacattgt 120 tcttcctaca atagaataat tttatctaaa gtgaatttaa taacacaatt caattcatag 180 ttcttcttaa atctaatcgg cgaaagtgag ttaaaattca tgcatcttaa atagtacaat 240 taaaaactaa agtattcaca gatcattgtt gcgagaggag atatggtagg aatcataaca 300 aggattcacc gaaagtgtaa gaaacactat aatatgaact atatatgtgc taataaaaat 360 ctcattgcag tttaaagctg caccaaacag aaaaacaact tttcttcgtg ttgctcacaa 420 cccggtaatc aaatcctacg tatttcgccg ccaaca 456 // ID Gypsy-27_DWil-I repbase; DNA; INV; 5332 BP. XX AC scaffold_181141; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-27_DWil_; KW Gypsy-27_DWil-LTR; Gypsy-27_DWil-I. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-5332 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_181141; Positions 564664 569995. XX CC Positions [3585-4064] - Integrase core CC LTRs are 95% similar to each other. XX FH Key Location/Qualifiers FT CDS 2652..4298 FT /product="Gypsy-27_DWil-I_1p" FT /translation="MYPEYKKPFDLTTDASADGIGTVLSQNGRPVTMISRT FT LKPCEINYATNERELLAIVWSLNKLRQYLYGVKDINVFTDHQPLTFAVSDS FT NPNAKIKRWKARIDETGAKILYKPGRENFVADALSRQNINALNDDDCESDV FT ATIHSEASLSYTITSSDKPVNCYRNQLILEQARFPLKRQFILFGQQIRHLI FT NFSNNETLLGEAKDAINPNLVNAIHCDLPTLARVQHELVKTFPATKLYYCK FT QIVADIFDSNESREIFITEHNRAHRAAQENVKQIMSDYYFLKMTQIANEVV FT TNCKICSIAKYKRHPKRQELGSTPIPGYVGEMLHVDIFSTNKKYFLTCVDK FT FSKFAVVQPIVSRTIEDLKAPLFQLMSLFPKAKTIYCDNELSLNSMTIKTL FT LQNNFGVEIANTPPMHSSSNGQVERFHSTIAEIARCIKLQRGNVDTVDLVL FT LATVEYNHTIYSVTQKTPSQIFCPLSEASKCVRDMINKAQVTQRERVNRNR FT QSRVWNVGDKVFVKVNRRLGNKLSPLYVEETVEADMGTCLLIKGRMVHKDN FT LR" XX SQ Sequence 5332 BP; 1744 A; 1140 C; 1069 G; 1379 T; 0 other; actaaatcaa cccaatactt taaccagggt aagcaattaa tgcacaattt gttgttgttt 60 tttttttgag aaaacgttaa atttgttttt tctgattctc ttttgcgcta tgttttctgc 120 tgtgttttgt tcactttcga acgtgccggt gtcttaaata agaatcgaaa tttaaaatac 180 aaatcaaaat ttaagtgatt ttgattgact gtgtgttaat gaatatcatt ggacatatag 240 agtgctgtag tgaagacctt aagttgcagg atttgattaa agatattgaa ggcttaaaag 300 taacccagaa acaaaataag aggggcggca aaccacacct ttgcttcatt tgatacagtc 360 tttaatttca gagcaataat agcgagactt gactttactt acgctgacca gacccaaggg 420 gagtcattca gcataagctt gccactgtcc ggcaaggcga tcgtaccttg ctagaatatt 480 acgatgaggt agaaagaacg ttgtctcttt taacaaataa gaatgtaatg agcaatgagg 540 aagccgcagc ggcgatccta aatcaaagga ctagggaaga agcattagaa gcttttatat 600 cgggcctaaa aaaatcttta agattagccg cgtttccttc tcggccaaag gacttaccaa 660 cagctttagc cttagcgcag gccgccgaga ctagagaaga tagaagcgtg ctcgctgcca 720 attacgcgaa aaccatggga gataagaacc aaaaattttc aacttaaaaa tcccaagaca 780 accgtcagtc cggtttttgg tctagtaagt cccaaagtgc ccatcagggg aatagtcccc 840 actatcagaa taggaagccc taacaaaata gttccgccaa acaaaggttg tataataaag 900 aaaccacaca ccaggcccca gaaccaatgg atatcgatcc atcttcgtcc aggctaaggc 960 aactgacagc ctatcggagt cgggcggact ccttcagatc tggatatcaa agcggaaagc 1020 tgaaaaactc gtccgaacgc atttcgggtc aaggggcgca aaggatcaat ttgataaaat 1080 aatttgattc aaaaaagtgg cacgagtgaa aactacgatt gcatcgcaaa tggcagcgct 1140 gctgccaccg actgtcaagg cgacatcgag tatgcaaccg aaaacgattc cgaattcgaa 1200 gtagacacag acaatgtcaa tgttttgaag ggaagatcac ttctcccgtt cgtgcaaaaa 1260 aaggtagcag gcggcatctt aaggttttta atagacacgg gggctgctaa gaactatgtt 1320 aagcctctat cttggctaaa aggtgttagc ccggttgaga aacccttttc cgttacgtcc 1380 attcacggct caaccgtaat agaccaaaag tgcaaaattt acatatatgg aataaattca 1440 accttttttg ttttaccgga tctgtcaaac ttcgacggaa tcattggcct cgatctgttg 1500 aaacaggcaa atgccacatt agaccttaag tcgaatagaa ttacagtcgg ctcagaatcg 1560 gaactgcttc tttatgctaa atgtccgaat gtcaatttta ctaacgtagt tggcgagcaa 1620 atatcaccca aatataaaaa cgaatttatt aaaatgttgc agcattcgcg agcccaaacg 1680 aaaaactccc ctacaacacc gctgtcgttg ccaccattag aaccacgaca gataatcata 1740 ttttctcaaa actctatccg tatcccatgg ttgtttcaga ctttgtagcc gcagaggtaa 1800 acgacctttt aaagaatgac atcataagaa agtctagatc cccgtataac aacccgattt 1860 gggtaggaca aaaaatgcac tgacgaaaag ggaaacagga aaaagcgact agttatagat 1920 tttagaaagc ttaatgagaa aactatttcg gatacctatc ccatgcccag cattcatatg 1980 atattagcaa atttgtggaa gtctaaattt tttacaactc tcgatttaaa gtcggggtat 2040 caccaagtaa ctttggctga gagagaccgc gaaaagacag cattctctgt aaatgggggc 2100 aaatttgagt tctgtcggct tccattcggg ctgaaaaatg caggcagtat ttttcagcga 2160 actattgacg acgtcttaag agaacaaatt ggcaaaactt gttatgttta tgttgatgat 2220 gtcattattt tttcagaatc cgagacggct cacattaaac atgtagattg ggctttaaac 2280 cagctatttg aagcgaacat gcgtgtggcc caagaaaaat ctcagttttt taggacaagc 2340 gtcgagtacc ttgggtttat tgtcactatc gagggcgcga gaacagaccc cgaaaaaatc 2400 agaaccataa cagaataccc cgagccgaaa ctctattcgg tgtcagatca ttcttaggcc 2460 tcgcaagtta ttacagatgt tttataaaag acttcgcctc aatagcaaga ccgatctcaa 2520 acatcctgaa gggaaaaaac gggtcaatta gtaaatacaa atcaaaaagc atccctatac 2580 agttcaatga actacaatct cactcttttc ataaactaaa gaacattctg gcgtccgaag 2640 acgttacctt gatgtatcca gaatacaaga aacccttcga cttgacaact gacgcatctg 2700 ccgatggcat aggcacagtc ttgtctcaga acggtcgtcc cgttacaatg atttccagaa 2760 cgctcaagcc gtgcgaaata aactatgcaa caaacgagcg ggagcttcta gccattgttt 2820 ggtccttgaa caaattgcga cagtaccttt acggggtaaa agacatcaac gtgtttaccg 2880 accaccagcc gctaaccttt gcggtatctg atagcaatcc taatgcaaaa atcaagcgct 2940 ggaaggcccg tatcgatgaa acgggagcaa aaattttata taagccaggt agggaaaatt 3000 tcgttgcaga cgcgctgtcg cgacaaaata ttaatgcact aaacgacgat gactgtgagt 3060 ctgacgtggc cacaattcat agcgaggcgt cgctgtctta cacaattact tcatcagaca 3120 agcctgtaaa ttgctataga aaccaattaa ttttagagca agcgcgcttc ccgctaaaaa 3180 ggcaatttat tctattcggt caacaaatca gacaccttat caatttttcg aacaacgaaa 3240 ccctcctcgg agaggcaaaa gacgctatta atcctaactt agtaaacgca atccattgtg 3300 atttgcctac cttggctcgc gtccagcacg agttggttaa aacttttcct gccacaaaat 3360 tgtattattg taaacagatc gttgctgaca tttttgatag caacgagagc agggaaatct 3420 tcataacaga gcataaccgc gctcacaggg ccgcacaaga aaatgtaaaa cagatcatgt 3480 ccgattatta ttttcttaaa atgacgcaaa tcgctaacga agttgtcacc aattgtaaaa 3540 tttgctcaat tgctaagtat aaaagacacc caaagagaca ggaactcggc agtacaccta 3600 ttccaggcta tgttggcgaa atgttgcacg ttgacatttt ttccactaat aagaaatact 3660 ttttaacctg tgtcgacaaa ttttccaagt ttgctgtcgt acagcccatt gtctcgcgca 3720 ctattgagga tttgaaagcc cccctgtttc agttgatgag tctcttccct aaagcaaaaa 3780 ctatctattg cgacaatgaa ctttcattaa attctatgac cataaaaacc cttctacaaa 3840 acaatttcgg cgtagagata gctaacacgc cgccgatgca cagctcatcc aacggccaag 3900 tggagcgatt ccacagcacg atcgctgaaa tcgctagatg cattaagctt caaaggggaa 3960 atgtcgacac tgtcgatttg gttttgttag ccacagtaga gtataaccat acaatttatt 4020 cggtaacaca aaaaacaccg tcccagattt tttgtccttt atcggaagct agtaaatgtg 4080 ttagggatat gataaacaag gcacaggtga cacagcgcga aagggtcaac agaaataggc 4140 aaagcagggt ttggaatgtc ggagacaagg ttttcgtaaa ggtcaacagg agattaggta 4200 acaaattatc acccttgtat gttgaagaaa ccgttgaggc agacatgggg acatgccttc 4260 ttattaaggg gaggatggtg cacaaagaca acctcagata gaatcccaaa tggaaagcag 4320 tacatttctt tttcatggct taaatattta agcttacatt ttcattagtt atagtcgaca 4380 ctctatctta atccaacaat acgaaaatta ttattaacac atgagttcaa taaataaatt 4440 aacaacactc gatctttatg aattgtaaca aaacaaacaa attgccgaag tgaattgaaa 4500 aacagcatga aagcaacaca agatccgaga ttttcttgtc aatgaatgaa tataaaacct 4560 atatagaatt acaagaatta ttaatttcta tccctcacgt aaaaagcgaa ttgtaattgc 4620 cacagttttg ggctatttag ataaaaataa cttccaagta tccaactcac agggagatag 4680 atcaaaatat gtcaatagta ttaagattcc aaacgcaatt attgttccat tactaataat 4740 gtcctcgagt caaaatgtgt tccaggttga gcaagtccaa ttcacagagt agtcgacacc 4800 gacgagtaac cctgtctctc acggctcgcc ggaactacca tcaccaaatc atgcgagctg 4860 atgtccaagt actcatctat ccagttagga ccaggcgatg cagctcatac caaggccaaa 4920 tccaaccaac tggtggctgt caaggtgacc tccagccagt ccagttatat caagaactaa 4980 ctcgaccaac tggtgggacg tcctgggatc agcgacaaga tatgttcggc atctgcatcg 5040 taacaacccg aaatttataa gcacgaacca cggtaaagaa acaccgttcg tcatacttcc 5100 aactgctgtc aacgaggaat cttaggcacg tcttcaactt caagacgaat tgtcgacaag 5160 atgtaacttg aggcaaggcg gtccggattg actttaatcg gcatcgtcgg cggtctcgcc 5220 tcccttggat catgtaagga cccaaccaga ctgtcagacc tgatgtcagc aaatcgtaag 5280 gcgaccgagg actgtcgaat tttagagggg gaatagttaa ctcgtattac tt 5332 // ID CR1-38_BF repbase; DNA; INV; 1147 BP. XX AC . XX DT 30-JUL-2009 (Rel. 14.07, Created) DT 30-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE Amphioxus CR1-38_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-38_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-1147 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-1147 RA Kapitonov V. and Jurka J.; RT "Young families of CR1 non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(7), 1609-1609 (2009). XX DR [2] (Consensus) XX SQ Sequence 1147 BP; 335 A; 311 C; 230 G; 271 T; 0 other; ggttgtggat ggggagagat cgaccagtgc caaggtcaag tcaggggtgc cgcaggggac 60 agtactgggc ccccttctat tcttattata catcaatgac atcgataaaa atatctcctc 120 ccaactccga ctttttgctg acgattgtct aatctaccat cccatacaaa cagaacagga 180 ccagctagac ctacaaaaag acctagacac acttactgaa tggtccaaca catggcaact 240 caagttcaat gtgaaaaagt gctgtattat ccatgttcac aaatccaaac gctcaccaca 300 attcatgtac acaatgtctg cagaaccgct tgaggtcagc gcccaccacc cctacctggg 360 gatcatctta tcacaggacc tcaagtggtc aacacacatc aactctgtca cagccaaggc 420 caacagtaca ctgggattcc tacgaagaaa cataaggggg gctaccaaag aggtaagggc 480 caaagcttac acatccttag ttagacccaa acttgagtac gcatccacca tatgggaccc 540 acagaagcgt caatatgcca attcaaatgt acttatagat aaaatagaat ctgtacaaaa 600 acgagctgcc aggtttgtct ttaacgacta caggaacacc accagcgtgt catacctgca 660 gaacaaactg ggctggccga ccctgcaaga aaggaggagt caggctcgcc tggtgttgtt 720 ctataaggcc tttcacggac tggtatccat acccttacac cattacatct ctaggaatac 780 aaaccccacc agaggacacc cactacgctt taacacgctc tcctgcagga ccgaaattta 840 cagaaacagc tacctcccaa caacagtaag tgcctggaat aagttaccat cacatattgt 900 ttcggctcca acacttgcta gttttaagtt ggggttggtc caacaaatgg gccaccccaa 960 ctcccagatg taaatcggga tttcctgcgg ccatgtacat agtcacacac gtcttttttt 1020 gtttttggtt tctgcacgtt gcactttatt tttcctcacg ccaccggccg cacggacctg 1080 cacctcacct ggatgtgatg ctctaattaa gggctttgtc cagtacccta ttgagattga 1140 gattgag 1147 // ID Pr_Helitron1 repbase; DNA; INV; 5373 BP. XX AC DQ138288; XX DT 18-AUG-2005 (Rel. 10.08, Created) DT 18-AUG-2005 (Rel. 10.08, Last updated, Version 1) XX DE Helitron-type DNA transposon from Philodina roseola. XX KW Helitron; DNA transposon; Transposable Element; KW Interspersed repeat; Pr_Helitron; Pr_Helitron1. XX OS Philodina roseola OC Eukaryota; Metazoa; Rotifera; Bdelloidea; Philodinida; OC Philodinidae; Philodina. XX RN [1] RP 1-5373 RA Arkhipova I.R. and Meselson M.; RT "Diverse DNA transposons in rotifers of the class Bdelloidea."; RL Proc Natl Acad Sci U S A 102(33), 11781-11786 (2005). XX DR Genbank; DQ138288; Positions 1 5373. XX CC It was derived from Genbank entry DQ138288 after excision of CC another insertion. Therefore it is a modified sequence labeled as CC "consensus.". XX FH Key Location/Qualifiers FT CDS 1417..2037 FT /product="Pr_Helitron_1p" FT /translation="MPPKGRISLGRSTPASRRMTAARAAETPEQVESRRGD FT DRTRRAVSRAARWGFMEGEAFRYDPANSYDSHPQLYIGQMNNICPHCNALK FT WPGEAAGMCCSGGKVKLQPLRCPPEPLESLLSGNTPISKHFLENIRKYNSC FT YQMTSFGVTNEVCESGFIPTFKVQGQVYHCVGSLLPPENDQHRFLQIYFMG FT DALQEAKQRCSNLPGVR" FT CDS 2808..3881 FT /product="Pr_Helitron_2p" FT /translation="MRYKRSKSTTNGSCLTVHYSLRTFQAHINVEYCNSVK FT SIKYICKYVNKGSDQAMFGLEKDGRAIDQVQRCQLGRYISSNEAIWRILDF FT PIHIRYPIVVHLAVHLENGQRVYFTKDNLHERVSEPPQTTLTAFLQLCQSD FT DFAKNLLYCDVPKYYTWNASEKAFKRRVQGSAVPGHHNIRETDALGRVYTV FT HPNNFECFFLRLLLHTVRGPTSFEDLRTVNGRMCATFQEACQLRGLLENDA FT QWDLTMAEAATVQSPGKLRNLFVILLITCGPANPRQLWESYKESMTEDILQ FT QARRRNPGINSNYTPDTYVQSNTNFSGRQSTGDGWQRPYTTRPSHSRKKSW FT RSIEQRNAQRNKLQR" FT CDS 3736..4389 FT /product="Pr_Helitron_3p" FT /translation="MFNQTLIFLEDKALGMAGKDLTQLGLPTPERNHGDQL FT SREMLRETSYNVDELREYVLENEPLLVSDRRAAYNAILERIDRKTGGIIFL FT DASGGTGETFVINLLLAKIRKQSKIAVAVASSGIAARLLSGGHTAHSTLKL FT SLNLTHCDALCNISKGTGEAKVLQECELIVWDECTMSHRQALEALDRTLQD FT LRGNGKQMGGGGGEFWYFSLVIFVKHCR" XX SQ Sequence 5373 BP; 1706 A; 1101 C; 1123 G; 1443 T; 0 other; atcgaacact ctcttgaatt gttgttgaat ttacgaaggg atttcaagca ttgccaacac 60 tagtttcatt tcagcgacgg atctcatcgg ttggctttga ctcttctata tcttttcaat 120 tgattcttta ttcaaagcga atcgaaatta aaaaaaacaa cctgcaaata tctatcaaat 180 tttaaagagg aaaaaaatct tcaatgttaa atctatatat gtaaagaagg atgtctgtct 240 gtttgtttgt ttgttcgcta tgcgtttcca taccgtgcaa acgatttcga tgaaactttc 300 cagggatgat ctctagaccc aggaggaggt cgacatctat ttggttcgac aaaaaaacca 360 atcctaacca tggtgtagac aacctttgca aatgaccaat aggattgcag tatcccacat 420 tgtaaaaaga attggtagcg aaggcggccg aaggccgtct gagcgctcat tcagaatgca 480 gcttatgttt tacttatctg gagcatccct gtgattgctg agataacttg gacgactgat 540 caataggatt gcagcgtttc aaattgtaga aaaaccgtaa cgaagaacat atgaagggac 600 atctgctaca agctatcaat ctaatcatct aaccatatct atgtatataa agaagaatat 660 ctattcatct gttcgctatg tattcatgcc tgtgacagat acaaccatta cgctttacgc 720 ggtgctttcc aaggtaccaa gaaagaaaag aacgagatcg gcgtgattga aaaaggagtt 780 tgaaaaagat tttcaagaga actttcgaaa aaccgaaagt aatttcaatg cgtcatagaa 840 tcattcatga cactttccta ggtcaaagaa gaagttcacg aagaatttgc aagacctaag 900 tcgacttttt ctattttgtc agaaaaatgt tttttttgtc caaaatctac ttgccttcct 960 tccgaagaat gacatgtgct gtagaatcaa tatgcaaatg aaccttctat tatatggaaa 1020 taacgacgta ttaattctta aatcagttga atttgttatg cattccatga tgtattacat 1080 tctttctgac tcacttacat tttcactatg agttgtgagt cagaatatgg ggatcacacg 1140 atgggaaatt ttctgatgtt tctcaatagc aaagctatat agttacttaa atatttcata 1200 tctctctaaa cgaaaggttt atcttacaag tggtattcat acaagaacaa gtaagtgtgg 1260 aatttccaat caaaagtctt tgagactact ggattatcaa agtatattgt cgtgtgttat 1320 ataaagcgct aattaaaaag gtaccttggt acaaattaaa caatcttgaa aaaatgcctg 1380 cgattttact attcatttag gaaatactag aacaaaatgc ctccgaaagg aagaatctct 1440 ttgggaagaa gcacgccagc ttcaagaaga atgacagctg caagagcagc tgaaacccca 1500 gaacaagtag aaagtcgacg tggagacgat cggacaagaa gggcagtctc aagagcagca 1560 cggtggggat tcatggaagg cgaagcgttc cgatacgatc cagccaacag ctatgacagc 1620 catccacagc tctacatcgg acagatgaac aacatttgcc cacattgcaa tgccctgaaa 1680 tggcccggag aggcagcagg catgtgttgc tctggtggca aagtaaagct tcaacctctt 1740 cggtgtcctc ctgaaccact agaatcatta ttatctggaa acacacccat atcgaagcat 1800 tttttggaga atatcaggaa gtacaactcc tgctatcaaa tgacgtcttt cggcgtaaca 1860 aatgaagtgt gtgagtcagg ctttatacct acatttaagg tgcagggtca agtctaccac 1920 tgtgttggat ctctattacc accagagaac gaccaacata gattcttaca aatttatttc 1980 atgggtgatg cactgcaaga agcgaagcaa cgctgcagta atcttcctgg tgtacgatag 2040 gaaatcgtca tggacttaca acaaatgttg catcagtgta acagctacgt ccatattttc 2100 aagtctccac tacaaagaat gccttctgat gcatacaaag tggtgattcg cgcggacaag 2160 aaacctattg gagagcacgc aggacgcttc aatgcaccta caacagataa agtagctgtt 2220 gttattgttg gcaacgagtt agatcgacga gagattatct tggagaaaag aaacaatcag 2280 ctgcgatacc ctcttctatt tccagaaggt gaagatgcat atcactttct catcatgcag 2340 acgaatccca caacaggcat gcctatcgaa ggtaagaagg tatcagccat ggatttctat 2400 ggttacagga taatgatgcg atctggaact ggcaatcata tttggaccaa cacgatgctg 2460 gatgtattct gctgaatggc agaagagagg actaccacat gctcacatct tggtatggtt 2520 aagagacaaa atcaaatcag atcaaataga cagcgtcatt aatggagaat taccagaccc 2580 tcaacgagac ccacggcttc ttgaaataat cgtgaaaaat atggttcatg gaccctttgg 2640 taatgtcaat ccaaactctc tctgcatgaa agatggaaaa tgtacgaaga gatatccaag 2700 acaacttctg caggatactc aaactggcga gaatggatac ccactatacc ggagaagacg 2760 cccggacgat ggaggtttca aaaccaaaat taatatgaag attggtaatg cggtacaaga 2820 gatcgaaatc gacaacaaat ggatcgtgcc ttactgtcca ttactctctc aggacctttc 2880 aagcgcatat taatgtagag tattgcaact cagtcaagtc aataaaatac atctgcaagt 2940 acgtcaacaa gggtagtgac caagcaatgt ttggtcttga gaaagatggg agagctatcg 3000 accaagttca acgttgtcag ttgggtcgat atatcagcag taacgaagca atatggcgaa 3060 tcttagattt tccaattcat atacgatatc caatagtagt acatttagct gtacacttgg 3120 aaaatggaca acgagtttac ttcacgaaag ataacttaca tgaaagagta agtgaaccac 3180 cgcagacaac actaacggca ttcttgcaac tttgtcaaag cgacgacttt gcaaaaaacc 3240 ttctgtattg tgatgtacca aaatattaca cttggaatgc gtcagagaaa gctttcaagc 3300 gccgtgttca aggaagtgca gtccctggcc accacaacat tcgagaaact gatgctttgg 3360 gtcgtgtcta tacggtacat cctaacaact tcgaatgttt ttttcttcga cttctactgc 3420 acacagtcag aggtccgact tcgtttgaag atcttagaac ggtgaatggt cgaatgtgtg 3480 caacgttcca ggaagcgtgc caattacgag gcttgcttga aaacgatgcg caatgggatt 3540 tgacgatggc tgaggcagct acggttcaat cccccggaaa gctcagaaac ttattcgtaa 3600 tcttattgat cacttgcggt ccagctaatc cgcgacaatt gtgggaatca tacaaagaaa 3660 gcatgactga agatattcta caacaagctc gcagacgaaa tcctggaatc aactcaaatt 3720 acactcctga tacatatgtt caatcaaaca ctaatttttc tggaagacaa agcactgggg 3780 atggctggca aagaccttac acaactaggc cttcccactc ccgaaagaaa tcatggagat 3840 caattgagca gagaaatgct cagagaaaca agttacaacg ttgatgaatt gagagagtat 3900 gtgttagaaa atgaaccttt actggtgagt gatcgaaggg ctgcttacaa tgccatactg 3960 gaaagaattg atagaaaaac tggtggaata atttttctcg acgcatctgg tggaactgga 4020 gagacctttg tcatcaacct acttttagct aaaattcgaa aacagtcgaa aattgctgtt 4080 gcagtggctt cttccggaat agctgctagg ttgcttagcg gtggtcatac tgctcactca 4140 acactgaaac tgtctcttaa tcttacacat tgtgatgctc tttgcaacat cagcaaagga 4200 actggcgaag ccaaggttct acaagaatgt gaacttattg tgtgggacga gtgtaccatg 4260 tcacacaggc aagcgttgga agctctggat cgtactcttc aagatctgcg tggcaacgga 4320 aaacagatgg ggggaggggg gggggagttt tggtacttct cgctggtgat tttcgtcaaa 4380 cactgccggt gattccgaga gggacgatgg ccgatgagat taaagcttgc ttgaaatctt 4440 cgtatctttg gagacatgtc attacgcttc agctgaaaac aaatatgcgg gttcatttgc 4500 agagtgacgt atctgatggc cagttcgctc aacagctcct cgctatcgga gatggaaaat 4560 ttccagttga tgctataagt ggactgatta aaattccgga caacttttgc aacgtcgtcg 4620 aatcgatcga aaaattgaag aacagcgtgt tccctaatat tcagcaccac ttcaatgatc 4680 acaaatggct ctgtgaacgc gcaatattgg ctccgaaaaa caacagcttc cgggggaaac 4740 taaatcttac aagtcagtcg acactgttac agatgtaaat gaagcagttc agtatctcac 4800 tgaatttctc aattcactcg aacctcctgg gatgccacca cacaacttag tactgaaagt 4860 tagttccccc atcatgtttt taaggaatct ggatgcacca agactttgca acgatacaag 4920 actatctgtg aaacagctaa tgcctcatgt tatcgaagcg acaatcctta caggatgcgc 4980 aaagggtgag gacatattcg ttccaagaat acccatgatt ccaacggata tgccatttga 5040 attcaagcgt cttcaatttc cagtacaaat tgccttcgcc atagccatca acaaagctca 5100 aggacaatct cttaaagttg cgggtatcaa tttagagtct ccttgctttt ctcatgctca 5160 actctacata gcgtgttcca gagtcggagc tggcaaaaac ctttatgtct ttgcacccga 5220 tgcaaaaact agaaacgtcg tttatcaaac agtcttacaa tagaaagttg ttcgaattga 5280 gtactctttg taataaaaca attgaaaaac ttttctctca cgctcacgat gtgctaccca 5340 ctcatcttga ccgacatcgg gtacttcccc tag 5373 // ID Gypsy-11_AA-LTR repbase; DNA; INV; 248 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-11_AA_; KW Gypsy-11_AA-I; Gypsy-11_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-248 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 992-992 (2011). XX DR [2] (Consensus) XX SQ Sequence 248 BP; 83 A; 58 C; 37 G; 70 T; 0 other; tgtagtgtac cttatagata gtatccttgc taatatcaaa ataatatcgt ttactcgtca 60 gtaggtatgg caaaggtcaa agctaacaat cccctgacca tgtaattgtt acccatataa 120 aagatcgatc aaccattgta tctttctctt tattcgctca ctcctcgagc ctacaacatc 180 agcacggaat aaatagtaag tagaatccta aaccagtgtt tgaacccaac ccacgtgatc 240 cgaaaaca 248 // ID MSAT-4_AAe repbase; DNA; INV; 176 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE Minisatellite-type sequence: consensus. XX KW MSAT; Satellite; Simple Repeat; MSAT-4_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-176 RA Jurka J.; RT "Tandemly repeated DNA from Aedes aegypti."; RL Repbase Reports 11(4), 1450-1450 (2011). XX DR [1] (Consensus) XX CC 22-bp unit. XX SQ Sequence 176 BP; 64 A; 24 C; 40 G; 48 T; 0 other; tggaatcttc caataggaag attggaatct tccaatagga agattggaat cttccaatag 60 gaagattgga atcttccaat aggaagattg gaatcttcca ataggaagat tggaatcttc 120 caataggaag attggaatct tccaatagga agattggaat cttccaatag gaagat 176 // ID TTAA6_AP repbase; DNA; INV; 431 BP. XX AC . XX DT 21-AUG-2009 (Rel. 14.08, Created) DT 21-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; TTAA6_AP. XX NM TTAA6_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-431 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(8), 1786-1786 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC TSD is TTAA tetranucleotide. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 431 BP; 132 A; 72 C; 83 G; 144 T; 0 other; gaggatgtca cacccgcatg tgttgtctcc gtcttacaaa tgtacaacat agcaaaaact 60 gttttgcgcg ggacaacttt tatctttctg tgtttgtagt agtggctcca aaatttctga 120 acatataggg tggaaaatta tctatgcaac agcgtgccca tattttagat aacataaata 180 caataaaagt tatttgcttc taaacacatt tggttttttg tttttttcat ttgcttataa 240 aaaattgatt ggatagtgct attaaaaaaa aaagcacgct gttgcacaga taaagttctt 300 ctttacgtgt tgtgaaaatt tggtagttct acaacaaata tggtgcgaga aaagttgtcc 360 cgcgcaaaac agtttttgct atgttgtaca tttgtaagac ggacgacaac acatgcgggt 420 gtgacatcct c 431 // ID Mariner-20_HM repbase; DNA; INV; 2587 BP. XX AC . XX DT 14-DEC-2008 (Rel. 13.12, Created) DT 14-DEC-2008 (Rel. 13.12, Last updated, Version 2) XX DE Mariner-type family: consensus sequence. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-20_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2587 RA Jurka J.; RT "Families of Mariner elements from Hydra magnipapillata."; RL Repbase Reports 8(12), 1954-1954 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(297..863,857..1252) FT /product="Mariner-20_HM_1p" FT /translation="MFFYMPIGTYLPIRNMRIYKRKDGSRAYRSHSETQLS FT AALQCCRKGTMSLLKASKKFKIPYGTLHNRMKGKHVKKKLDIPVALSLNCK FT SHLLKVQINVLMTDWKMPVDGLDIRLMVKHYLDNTGVKNRQFNNNLPGIDW FT MLXFIERHKLTTRIADNVKPSRAEVCGQTVNAYFDELAKTLEGVEPCNYLL FT FITMMRLIFRTTLAQNRSYADEDLGKESNEKCNXQSSCVSIMYCGNAIGVF FT LPPMVVYKSQNVYQEWTRGGPEGTCYDNTKSGWFDSRTFERWFLEIFLSNV FT SSKIMDQKSSLVTILPLIFLFKSLRPLN*" XX SQ Sequence 2587 BP; 940 A; 415 C; 463 G; 760 T; 9 other; ccgtcaactg gggttacttt gatacccggg gtaacattga taattttttt agaattactt 60 tttcagtata ttatcgtart atatttttaa tactatttta ctatcttttc tattttcagg 120 ttaaaaataa ataaacacta atcaaaagtt tacaaagatt tcaccaaatt gatttgaaag 180 gtaaaaaaca tagtaaaatt gttaaaaata aaataaaatt tacaaagttt tttgtaactt 240 ttttgtaaaa aagtgtaaaa aacgtattct aaacagtaat aaaaaactat aaaaacatgt 300 ttttttatat gcctataggt acatacctac ctatccgtaa tatgaggatc tacaagagga 360 aggacgggag tcgtgcttat cgttcacatt cagaaactca gttgtcagcg gcccttcaat 420 gttgccggaa gggtactatg agtttgctaa aagcatctaa aaaatttaaa attccatatg 480 gaacattgca caacagaatg aaaggtaaac atgtgaaaaa aaagttggac ataccagttg 540 cactttcttt aaattgcaag agccaccttt taaaggtcca aataaacgtt cttatgactg 600 attggaaaat gcctgttgat ggattagata tacgtttaat ggtcaaacat tatcttgata 660 acacgggtgt taaaaacaga caatttaaca acaatttacc wgggatcgat tggatgctaw 720 gttttatcga acgccataag ctgacaacac gcattgctga caatgtaaaa ccttcacgcg 780 cagaggtttg tggacaaact gtgaacgctt aytttgatga attagcaaag acattagaag 840 gcgttgaacc ctgtaactat ttataactat gatgagacta atctttcgga cgaccctggc 900 tcaaaatcga tcatatgcag acgaggactt aggcaaagag tcgaacgaaa aatgcaacay 960 tcaaagcagt tgtgtaagca tcatgtattg tggaaatgct attggagtat tcctaccacc 1020 tatggtagta tataagtctc aaaatgttta tcaagaatgg accagaggtg gcccagaagg 1080 aacttgttat gacaacacaa agtcaggctg gtttgattca agaacctttg aacgctggtt 1140 tctagaaata tttctttcta acgttagttc aaagatcatg gaccaaaagt cgtcattggt 1200 gacgatcttg cctctcattt ttctcttcaa gtcattgagg ccgctcaact gaagccacga 1260 ttcaacataa cattcgtttt acctgtttgg taccaaatgc aagtcatctc cttcaaccac 1320 tggatgttgc agtttttaga tccatgaaga aagagtggaa gagaattctt agaatgttgg 1380 agaaaagaaa gtcgcatcaa aggtacaata ccaaagacaa cattttcctg ggttattgtt 1440 gaagttgcat cgaagactaa aagaacaaaa cctttgattt ctggctttta agcttgcgga 1500 atatatccac taaacagaga tgaagtcttg aaaagactkc caaatgagaa ccaagaygtt 1560 ggtggtgaaa aggttaaaga aatctttggt acggttgttc aggaaatgct aaaagatcat 1620 tgtggattca ataaaccgga caaacaaaca cagaagagag gaaaaaagat aattcctggt 1680 aaagaaataa catcaaatga tctcagcgtg atcaaagaaa caaaaaaacg aacacagtgt 1740 agtggtactg gtacaaaatt ggttgtcaaa aaaaaaaacg atcaaataaa aaaatccagt 1800 cttcaacatc ttcagataat gattctgaaa ttgaaatgga acttcaagat gacagtgaag 1860 actctttgga atcagctgat aaaacaactg acattccatt tgaaccaaag caatgggtag 1920 ttgtagccta caccgggaag aaatctgttg tcaattacat tggtcaagtt gttgaagtgt 1980 tcgagagaat aaaactgctt catattaaat ttttgaaaaa acaaccggcg agtaaattct 2040 tcaaatttac ggaaagtaaa gatgatatgg atgaagctgt tccatattct gctgtggttg 2100 aaagaatttc agatgaaccg agaaaaacaa gggatcaata tcggttccca tacctgtctc 2160 accaagtcat caactaaaat caaraacttt tgataattta taattttgag tttgatawag 2220 tttgatattt tttgacattt tttcaaccaa ctttgtaatt ttttggatag tattaggtgt 2280 actaggtcta taaatctttt ttataagcct aaaaaactat caatgaaacc ctgtgatgag 2340 ttaacttgat atttgattga aataacattt actatcaaag taaccccgac atgcggggtt 2400 acattgataa ctgtttgggc tttaaaaaca aatacaagac aagttctgat atttctaaaa 2460 agcatttttt cctgttaagg atagatcatt tggtatattc aaaccaaaaa caacagcaat 2520 tcattaaaat tctatcagca cttttaaaaa aagcacaaaa acctatcaaa gtaaccccag 2580 ttgacgg 2587 // ID Crack-31_AAe repbase; DNA; INV; 5283 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A Crack non-LTR retrotransposon family from Aedes aegypti. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; KW Crack-31_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5283 RA Kojima K.K. and Jurka J.; RT "Crack clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1247-1247 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 8 sequences with >97% CC identity. XX FH Key Location/Qualifiers FT CDS 922..1989 FT /product="Crack-31_AAe_1p" FT /translation="MDTDDVYICNICSKKDTDGSKFLTCMYCLTSSHFKCK FT NIVGNAVRRMRDTEYFCSSDCSAIYQRIVTMQNTNHSLMSSFAAEMKATVS FT ASIAKEMLSVKADVKQITTSIEKSQEFLSAKFDEIVTDFKDLKLENEKLKE FT EIDHLKKSQHQLQGMVHKLEANVDRSDRASLVNKAVVWGIPTVPGENVLQL FT VEKILLYLGLREPSELVASAERIFVNTKASNELVPIRIVFHDSESKEAVFN FT KKKQFGKLLSTAIDKTFNVNGKPMNVTLRDELSPLSLELLKELREHQEMLN FT VKYVWAGRGGVVLVKKDDGSKPEIVKTREDLSRIMNRFLLSTHGPDDANAS FT GHSPSPKRRKNVQ" FT CDS 2036..4939 FT /product="Crack-31_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MFKMDTLPNMIVNKYYNSLNAFNQNCGMNNNCLNVIQ FT WNVRGMNNFEKFDELIYFVNECKIDIDVIIVGESWLKKENICVYNIPGYRA FT FFSCRDSSSGGLAVFIRTSLEYKLIDNKTIDGLHYVHLEIKFRGNFLNVVG FT VYRPPSFDYNEFQNILEGWLSTSTSSKPFFLFGDVNVPINMQNNNIVIRYK FT NLLESYGYLCANTIPTRPISNNILDHCICPMDLASQFQNYTIFSDMSDHLP FT VISSLPFGVGKRQQELKIKMVDHEKLRTQFSILLNNLQITDDVDATLYTLV FT TSYNSLLLECTKTVTKIVKIKGVHCPWMTYGLWRLIQIKNNYLEKHKINPN FT DRRNTEMLAHVSKKVKNAKIRCKRVYYENILKSSSHANVWKTLNQIFGRSK FT ETDQIKLLKDGRLLDSEYDVAEVFNSFFSSIGNNLADKLPNDNYDFLKTVK FT HVSNTIFLRPTTTNEVRILISRLDGRKSKGHDNFPADLLKANIEPFSVILA FT KLFNKMITTGSYPNILKIAKVTPVFKSGDKQDPSNYRPISTLSVLNKILEK FT LLTCRLVSFLCANNVFYKFQYGFREGCGTTTAIIELIDELISQIDQKNIVG FT GLFIDLKKAFDTINHEILLKKLDRCGIRGIANNLIRSYLSNRKQFVVINGV FT RSSLRPIDIGVPQGSNIGPLLFLVFINDLGNLRLRGIPRLFADDTALFYPQ FT ANVQLIVNDIEKDLESLIEYFNGNRLSLNLAKTKYMVFHSSRKSIPQHPDP FT CVRNVHLEKVSSFKYLGLHLDPCLSWDCHIRNVSGKMASLCGLVKRVRSFV FT HKDVLMQFYYACIHSILQYLIIVWGHAAKSKIRRLQVLQNRCLKVIFNLSP FT FFPTFSLYSNLNHKVIPVLGLRDCQTIMYVHNTLNNRSFHHNISFPYRLNI FT HNTRHAFELSRSRAYTNLGLERITSYGPLKFNSLPNDLKNIINNIHFKSKL FT KQHLLRNVNDYLL" XX SQ Sequence 5283 BP; 1649 A; 887 C; 981 G; 1760 T; 6 other; ctggcaacac tgttgtatgg ttgacacgca atctttasct tcccgcgttt taaattcagt 60 ttaattgtgt ataaacttaa attaattctt gatttttgtg cgattgtaaa gtgataacgt 120 tgttgtgcaa gctaaatggt gttatggttg tggctactat ttgaatacta ctgaaaaaac 180 agattgttat gcccattagt gatcttgata aaagggtcta ttgttttttt gtgtgattca 240 ttctgaacgc gctcctgctt gttgttgctg ctgttgttat gctgtcgatg ctgtattgaa 300 acgaaaaaaa ttgtcggtga aaatataacg aattgagtag aagttaattg twagtacacc 360 tatagataca ctttgctagt gtcgtcgatt ctttcaaagg tttatcgtgc tattctttgc 420 cgcccaaacg gccacttgac aagcgaaaac atttgttagt tatctgattg atgctcattg 480 tggtctcgct ccctagacaa aaaaagcttg tatttgtgat acttcttgcc gcccaaacgg 540 ctacttgacg agcaaagtgc tttggtcctg gttgagcggt ttagtctggt aggctggaaa 600 acaaacttca acgttcgttt tacagactag ccgtggatac tagagttcat gttcagcatt 660 ggttgtcgtt attgttgtgg ttattattca cgcacaaaaa aaccgtagga ctttttatta 720 aatgtttgta tctggcaaca cctatctgtc gttattctag tacttcaccc cattsaacca 780 aatagcttgt gtgtttcaat tttgttctga ttttacgttt tagtattagt tcgtgttgct 840 cctcatatag attgtttatt tacattcctg tttgttgata tttttttgct tcagcttgct 900 ttttcatcta aamtttgcat aatggataca gatgacgttt atatctgcaa tatttgttca 960 aaaaaagata cggatggctc caaattttta acgtgtatgt actgtttaac gagttctcat 1020 tttaaatgta aaaacattgt tggaaatgct gtgcgtcgta tgagggatac cgaatatttt 1080 tgttcatctg attgttctgc tatttatcaa cgaatcgtaa ctatgcaaaa tactaatcac 1140 tccttgatgt cttcgtttgc tgctgaaatg aaagcaactg tttcagcatc tattgctaag 1200 gaaatgctgt cggttaaagc agacgttaaa caaatcacaa cttcaatcga aaaatcccaa 1260 gaatttttat cwgctaaatt tgatgagatt gtcaccgatt ttaaggatct aaagctagaa 1320 aatgaaaagt tgaaagagga aatagaccat ttgaaaaaat cgcagcatca gttacagggt 1380 atggtacata agttagaagc taatgtggat aggtctgaca gagcatcctt ggttaacaaa 1440 gctgttgtgt ggggtattcc tactgttcct ggtgaaaatg tactgcagtt agtggaaaaa 1500 atactcttgt acttaggact cagagagccc tctgaacttg tagcatctgc tgagcgtatt 1560 tttgtcaaca ccaaagctag taacgagctg gttcctatta gaattgtatt ccatgatagt 1620 gaatcaaaag aggcagtgtt caataagaaa aaacagtttg gaaagttact ttcaacagcc 1680 atcgataaaa cgttcaatgt taatgggaaa ccgatgaacg taacattacg agatgagttg 1740 tctcctctct ctttggagct tttgaaagag cttagagagc accaggaaat gctgaatgtt 1800 aagtacgttt gggctggtag gggcggagtt gtgttggtta agaaagatga tggcagtaaa 1860 ccagagattg ttaaaaccag agaagatctt agtcgtatta tgaatcgctt cttgctgtct 1920 acacacgggc ctgatgacgc taatgcttct gggcattctc cgtcacctaa gcgtagaaaa 1980 aatgtacagt aaaataattt caagaataat ttcgattagt agtataaata cctaaatgtt 2040 taaaatggat accttaccga atatgattgt aaataagtat tataatagtt taaatgcttt 2100 taatcaaaat tgtgggatga acaacaactg tttgaatgtg atccagtgga atgtaagagg 2160 aatgaataat tttgagaaat ttgatgaatt gatttatttt gtgaatgagt gtaaaataga 2220 tattgatgtc ataattgtcg gtgaatcgtg gttgaaaaag gagaatattt gtgtttataa 2280 cattcctggt taccgtgctt ttttctcatg ccgtgatagt tcgtctggtg ggttagctgt 2340 ttttataaga acatctttag aatataaatt gatagacaac aaaactattg atggcttgca 2400 ctacgtgcac ttagaaatca aatttcgtgg caactttctc aatgttgtcg gtgtttatcg 2460 tccaccatca tttgattaca atgaatttca aaatatttta gaaggctggt tatcaacctc 2520 cacgtcgtct aaaccatttt ttctttttgg cgatgtaaac gttccaatca acatgcaaaa 2580 caacaacata gtaattcgtt ataaaaattt attagaatct tacggatatt tatgtgcaaa 2640 tactattcca acgcgcccaa taagtaacaa tatcttagac cactgcattt gtccaatgga 2700 tcttgcttca caattccaaa attacactat atttagcgat atgagtgatc atcttcctgt 2760 gatttcgtct cttcctttcg gtgttggtaa aaggcaacaa gaacttaaaa taaaaatggt 2820 agatcacgag aaactgcgaa cccaattttc aatcttgtta aataatcttc aaatcactga 2880 cgatgttgat gctactttgt acactcttgt cacaagttat aattcattat tgcttgagtg 2940 tactaaaacc gtaactaaaa ttgtcaaaat taaaggagtw cattgcccat ggatgactta 3000 tggcctctgg cgattgatac aaataaaaaa taattatctt gaaaaacaca aaattaatcc 3060 caatgataga cgtaatactg aaatgctggc tcatgtttct aaaaaagtta aaaatgcgaa 3120 aatacgatgc aaaagagtat attacgaaaa cattttgaaa agttctagcc atgccaatgt 3180 gtggaaaacc ctcaatcaaa tatttggccg ttctaaagag accgatcaaa taaagttact 3240 taaagatgga cggttattag attcagagta tgatgttgcc gaagtattca atagcttttt 3300 ctccagcatt ggaaacaatt tggccgataa attgccaaat gacaattatg attttttaaa 3360 aactgtaaaa catgttagca acactatatt tctcagacca acaaccacga acgaagttcg 3420 tattttgata agccggcttg atgggagaaa aagcaaaggg catgacaact ttccagcaga 3480 tttgttgaaa gctaacatag agccattttc agtaatcttg gctaagctct tcaataaaat 3540 gataacaact ggatcttatc ccaatattct aaaaatcgca aaagttacac ctgtattcaa 3600 atctggagac aaacaggatc cctctaacta tcgcccaatt tctactttgt cagtcctgaa 3660 caaaattttg gagaaacttc taacatgccg gcttgtaagt tttttgtgtg ccaacaatgt 3720 attttataaa tttcaatacg gctttcggga aggttgtgga actactactg caattattga 3780 gcttattgat gagttgatca gtcaaataga tcaaaaaaac attgttggag gtcttttcat 3840 tgatttgaaa aaagcttttg acacaataaa ccacgaaatt cttctgaaaa aactcgacag 3900 atgtggtata agaggaatcg caaacaatct tataagaagt tatctttcaa ataggaagca 3960 atttgttgta ataaatggcg ttcgtagtag tttacgccca attgatattg gagtgcccca 4020 agggagtaac attggtcctc tattgttttt agtttttatt aatgatttgg gtaacttaag 4080 acttcgagga attccaagac tttttgcgga tgacactgca ttgttttacc cgcaggccaa 4140 tgttcaacta attgtcaacg atattgaaaa ggatcttgag agtctcatag aatacttcaa 4200 cggcaatcgt ctttcgttaa atctggcaaa gacaaagtat atggtttttc attcttcccg 4260 taaatcaata ccgcaacatc ctgacccatg cgtcagaaat gttcatcttg aaaaagtttc 4320 atctttcaaa tatttaggtt tgcatttgga tccttgcctt tcgtgggatt gccacataag 4380 aaatgtatca ggtaaaatgg cgtcgttatg tggactagtg aaaagagtac gttcttttgt 4440 gcacaaagat gtactaatgc aattttatta tgcttgtatc cattcaattc ttcagtatct 4500 cattattgtc tggggtcatg ccgcaaaatc gaaaattagg agattacagg ttttgcagaa 4560 cagatgcttg aaagtcattt ttaatttatc tccattcttc ccaacatttt cgctctacag 4620 taatttgaat cacaaagtca tccccgtact tggattgcgc gattgccaaa caattatgta 4680 tgtacacaat actttgaaca accgcagttt ccatcacaac atttcttttc catatagatt 4740 aaatatccat aatactagac atgcatttga gctttcgaga agcagagcat acacgaatct 4800 ggggctagaa cgtatcacca gttacggccc tttaaagttt aattcacttc caaatgattt 4860 aaaaaacata attaacaaca ttcattttaa aagcaagctc aagcaacatt tactgaggaa 4920 tgtaaatgat tacttactgt aacgtgtcaa ccatttaata atgtagggtt tttctggctc 4980 atgctagttt ttacctgcat aattttctta ctcgtagtta tttacttttt gttatattaa 5040 gccacaatcg ttatcactgt agtttgtaat atgtttatca atctgttata tattccttat 5100 gtctgtgaat cccttcaaag gaacaatttt ccactgggat ttcacactgt ttgtatatgt 5160 tattcaaacc tccgcgtatt agtttaaaat tgttttatac aagtaagtgt ccactaccag 5220 ggggctctct gtgtgagctc tttggtgtgg gggtcagtgg tgggtcttaa aaaaaaaaaa 5280 aaa 5283 // ID Copia-7_SI-LTR repbase; DNA; INV; 284 BP. XX AC AEAQ01011831; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: long terminal DE repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-7_SI_; KW Copia-7_SI-I; Copia-7_SI-LTR. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-284 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01011831; Positions 447 164. XX SQ Sequence 284 BP; 59 A; 61 C; 55 G; 109 T; 0 other; tgttgagaat aattgttaag tatgttcatt tgtatgtttg gcgccataag cgtgtagtct 60 gcgtgcgata gtgtcgctag tgatcgacgc gcatgcgttg ccctagtcgt tcgttttttt 120 cgtttttttt ctctctctcc tctctgacac acgcttgcct cagaacgctg ccttttgtgt 180 acaactctct attctctggt tgttggaata aacgcttttc tatttgctct aataaagtgt 240 gtctaattaa aaacctctga ccattcgata agcataatct atca 284 // ID Gypsy-3_AC-I repbase; DNA; INV; 4948 BP. XX AC AASC02058326; XX DT 18-JAN-2011 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from Aplysia californica (California sea DE hare): internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-3_AC_; KW Gypsy-3_AC-LTR; Gypsy-3_AC-I. XX OS Aplysia californica OC Eukaryota; Metazoa; Mollusca; Gastropoda; Heterobranchia; OC Euthyneura; Euopisthobranchia; Aplysiomorpha; Aplysioidea; OC Aplysiidae; Aplysia. XX RN [1] RP 1-4948 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Aplysia californica (California sea RT hare)."; RL Direct Submission to Repbase Update (02-FEB-2011). XX DR Genome; AASC02058326; Positions 4808 9755. XX CC Positions [1614-1988] - Reverse transcriptase CC Positions [3247-3756] - Integrase core CC 'CTCCC' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 330..3161 FT /product="Gypsy-3_AC-I_2p" FT /translation="MAEAKFRLPSPLQMSEDNLADHFRKWKRQLDVYMEAS FT GNTSKPAKTQTAIILHCAGPEAREVFDQFEFDNPEDKDDPQQVLLKLQQYC FT NPRSNEVLQRYRFWNLPMSTPFDKFSTELKAQAEKCNFQEKDKMIKDKIIF FT SVPQHLKEKLLREPDITLKKTIDICQAYEQTANYLREMKGEQKIDKVYKGK FT KQSFKLSGGPSRKKDGRPTTSTTSKDAECRFCGKRHELNKAKCPAWGKTCS FT NCGGRNHFRIKCKKIHSLVETDDEAWINAVGDGTRRATACLKINDCKVRFQ FT LDTAADVNTIQQRFVSKNQVTRTSHTLVMWNGTKTAPLGETRLLVTNEKTG FT QTLTVKFTVVKNNLNCLLGLTTCQDLGLVTINHDKFITKVEEEPGHLGTAN FT LKTDPEIRPQVLPCRKIQFALEKKVKEELDSLIQRGILVPMTEPTDWVSQM FT AIVEKENGSLRICIDPRPLNKALKREHYKLPTLDDVLPKFRDTKMFTKLDV FT KEAFWHIKLDEESSKLTTMITPFGRVRWTRLPFGLNVSSEIFQRHLATALE FT DLPGIVNVADDILVIGNGLTKEKAQKDHDRKYDLLKERCKEKCIKLNQEKA FT RERQEEVRFMGHKIDKRGIEPDARKVEAIIKMPTPKDVHDVRRFCGLVQYL FT AKFLEKLSDRLQPLRDLTRKDTPFVWSKECQTAFSEIKEMIATTPVLRFYD FT PNKPLEVQVDSSKDGLGACLMQEGQPIEFASRTLSETEQRWAQIEKEMIAV FT VFGLERFDQYTFGRKVHVTTDHKPLETIIRKPLSEAPKRLQRLIMRSNRYD FT FDLTWARGSSLLIADTLSRAVVSNIISSENETEVQAKSNIPDAMIEKLRIE FT TSKDEDMQTLAETIRTGWPDSKSELPPSTRPFFDYRETMSIENGLITRGEK FT VLVPRSMREEIKRRLHAAHLATESMLRRARRTVFLARHDR" FT CDS 3154..4293 FT /product="Gypsy-3_AC-I_1p" FT /translation="MTAEIKQMADRCEVCQHSKPRNQRETLVQHSIGRAPW FT EKLGSDIFEIKGRQYLILVDYFFNFIEVDYLPTSTSGTVITKMKSQFARYG FT VPKMLVTDCGSQYTASTFQQFTQHWGITHVRSSPGHHQANGKAESAVKIIK FT TMVKRFVQNHEDQYEALLELRNTPRQDGTTPTQMMFQREIRTLLPQMRKEN FT YVPENDMKRLLAPKMKVKRQFDRKARDMNPLQKRQTIYYQNPDKPGWNAGR FT IKEKLNNRSYRIEGETGGTYVWNRVHLRPKSTPFHERKDGPDDNVMLTPAE FT SPAVKDHSLQRRPNTTVDRPDQEVSAPDGSSRCSPTVVPVESTSQQPTTSW FT PTSLKSNLTPCPSHNKAQNKHYITRSGRIVKPNTKDT" XX SQ Sequence 4948 BP; 1768 A; 980 C; 1075 G; 1125 T; 0 other; tgctggcagc ggtaaaagtt tcataaaagt gagactctag tcttgaaaat gtctggtgtt 60 tccatacata tttctatatc ttgtatgaac atgtcttgaa aattcaggtg ttatatatat 120 attctacgaa gcctatattc tatttagatt ctagagatcg gatctgaaca gaagtttggc 180 catgaggcta gccaaggaaa aaagtgtttg tagaaaataa caaaactgta agtcaagtat 240 atttttcctt gcaacaatta ttttgcactg agcagacgac gacccagata gtgttaattt 300 ctgcccgatt tatctgcaac ctgacgaaca tggcagaggc taaatttcgc cttccctcac 360 cactccaaat gagtgaggat aatttagcag atcattttag aaaatggaag agacagctag 420 atgtttatat ggaagcctct gggaacacca gcaagccagc taaaacccag acagccataa 480 ttctgcactg tgcaggccca gaagcccgag aagtttttga tcagtttgaa tttgataatc 540 ctgaagacaa ggatgacccg caacaagtac tacttaaatt gcagcagtat tgcaacccaa 600 gaagtaacga ggttctgcaa agatatagat tttggaactt gcctatgtca acgcctttcg 660 acaagttttc aactgaattg aaggctcagg ctgagaaatg caactttcaa gagaaagaca 720 agatgatcaa agacaaaatt attttcagtg ttcctcaaca tctcaaggaa aagttgttga 780 gagaacctga tatcacttta aagaagacaa tcgacatttg ccaggcatac gagcagactg 840 caaactacct gagagaaatg aaaggtgaac agaaaattga taaggtctac aagggcaaga 900 agcaatcatt taaacttagt ggaggaccat caagaaagaa ggatggaaga cctaccacga 960 gtacgaccag caaagatgct gaatgcagat tctgcgggaa acgccatgag ctcaacaaag 1020 ccaaatgccc agcctggggg aaaacctgta gcaattgtgg aggaagaaac cactttagga 1080 taaaatgcaa gaaaatacac tcacttgtag aaacagatga cgaagcatgg ataaatgctg 1140 tcggcgacgg aacaagaagg gctacagcat gtttgaaaat taacgactgc aaggtcagat 1200 tccagctaga tacggccgca gacgtcaata caatacaaca gagatttgtt agtaaaaatc 1260 aagtaacaag aaccagtcat acactcgtca tgtggaacgg cactaagacg gcacctttag 1320 gagaaacaag attgttagtt accaatgaga agacaggaca gacgctcaca gtgaaattta 1380 cagttgtcaa aaacaatcta aactgtctac ttggtttaac gacatgtcaa gatttgggat 1440 tagtcaccat caatcatgac aagttcatca caaaggtaga agaggaacca ggtcatctgg 1500 gaacagcgaa tctgaagaca gacccggaga taagaccaca agttctgcct tgcaggaaga 1560 tacaatttgc cctagaaaaa aaggtcaaag aagagttgga ctctctcata caaagaggaa 1620 tcttggtacc catgacagaa ccgacagatt gggtgagtca aatggccatc gtggagaaag 1680 aaaatggatc tctacgcata tgcattgatc cccgtccttt gaataaagct cttaaaaggg 1740 aacactataa actccctacg ttggatgatg tcttaccaaa atttagagac accaagatgt 1800 tcacgaagtt ggatgtaaag gaagcatttt ggcacatcaa actggatgaa gagtcgagca 1860 agctcaccac gatgatcaca ccatttggtc gagtgagatg gacaagacta ccatttggac 1920 tgaacgtgtc cagcgaaatt ttccagagac accttgcaac cgcacttgaa gacctacccg 1980 gcatagtgaa tgtcgcggat gacatcctcg tgatcggaaa cggcctcaca aaagagaaag 2040 cacagaaaga ccatgatagg aaatatgacc ttctcaagga aagatgtaaa gaaaaatgca 2100 tcaaactcaa ccaagaaaaa gccagagaaa gacaggaaga agttagattt atgggtcaca 2160 agatagacaa aagaggaata gaacctgatg ctcgcaaagt agaagccatc ataaagatgc 2220 caaccccaaa agatgttcac gacgtgagaa gattttgtgg actggttcaa tacctagcca 2280 aattcctcga aaagctctca gacagattac agcctctcag agatctgaca agaaaagaca 2340 caccattcgt ctggtccaaa gaatgtcaga cagctttctc cgaaatcaaa gaaatgatag 2400 cgacgacacc agtcttacga ttttatgatc caaacaagcc tttagaggtg caagtggaca 2460 gcagcaaaga cggtttaggc gcatgcctga tgcaggaggg ccagccgatt gaatttgctt 2520 ccagaacgct ttcagaaacc gagcaaagat gggcacagat agagaaagag atgatagccg 2580 tcgtatttgg tcttgagaga ttcgaccaat acacctttgg gcggaaagtt catgtgacaa 2640 cagatcacaa gcccctggaa acaataataa ggaagccatt gagtgaagct ccgaagcgac 2700 ttcaacgact cattatgaga agcaaccgct atgattttga tttgacatgg gcgagaggaa 2760 gctcactact aattgcagat acgttatcgc gagcagtcgt cagcaatatc atttcatcag 2820 agaacgagac ggaagttcaa gcaaagagca acataccaga tgcaatgata gagaaactga 2880 ggatagaaac gtcaaaagat gaggacatgc aaaccctggc agaaacaatc agaactgggt 2940 ggccagacag caaatctgaa ctaccaccgt caaccagacc attttttgac tacagggaaa 3000 ctatgagtat tgagaatggg ctgattacaa gaggagaaaa agtcctcgta ccaaggtcaa 3060 tgagagagga gataaagaga agactacatg cagctcacct agcaacagag agtatgctga 3120 ggagggcgag gcggactgtt tttttggcca ggcatgaccg ctgaaataaa acaaatggct 3180 gacaggtgtg aagtttgtca gcacagcaag ccaagaaacc agagagaaac actggtacaa 3240 cacagtattg gacgagcacc atgggaaaag cttggatcgg acatttttga aataaaaggt 3300 agacagtacc ttattctggt agactatttt ttcaatttca ttgaggtcga ctatttgcca 3360 acatcgacat ctggaacagt aattaccaaa atgaagtcac agtttgctag atatggagtg 3420 ccaaaaatgc ttgtcactga ctgtgggtca cagtataccg cttcaacgtt ccaacagttc 3480 acacaacact ggggtatcac tcatgttcga tcatctccag gccatcacca agccaatgga 3540 aaagctgaat cggcggtaaa aataataaaa acaatggtga aacgctttgt gcagaatcac 3600 gaagatcaat acgaagcact gttagaactt agaaacaccc caagacaaga tgggacgact 3660 ccaacgcaga tgatgtttca gagagaaata cgaacgctct tgcctcaaat gagaaaagaa 3720 aattatgttc cagaaaatga catgaaaaga ttattagctc ccaagatgaa agtcaagaga 3780 caatttgaca gaaaggccag agatatgaat cctctgcaaa agcgtcaaac tatttattac 3840 caaaaccctg ataaacccgg ttggaacgca ggcagaatta aagaaaagct caacaacaga 3900 tcttatagaa tagaaggaga aactggcggc acctacgttt ggaatagggt gcatttgaga 3960 ccaaaatcaa cacctttcca cgaaaggaag gatgggcctg acgacaacgt catgttgacc 4020 ccagcagaaa gtcctgcagt aaaagaccac agtttgcaaa gacgacccaa caccaccgtt 4080 gatcggcctg atcaagaagt gtcagctcct gacgggtcga gcagatgcag tccgactgtt 4140 gttcccgttg aatctacaag tcaacaacca acgacttcat ggcccacaag cttgaagtca 4200 aacctgactc cctgtccaag tcataacaaa gcacaaaaca aacactacat aaccagaagt 4260 ggacgcattg tcaaacccaa cacaaaagac acatgaagtt gagatcatcg attgcaccat 4320 ttgaatatat ctatggcctg tttttaagat tgtgatacgg acaagtgtgt tagtgtgcaa 4380 catgcggtct acaacaacat caaaacaaag cagtacatat tgtctacaca attgaacatt 4440 tagtcacata tgcacagaca ccagtcaccc atcagaacag tagagttctg tttagaaatc 4500 agtcatactc tttaaaagac aattttccat tctcatcacc gctgagaact gacgagcaga 4560 cgaccagacc cgaagaagtt atcgaaaagg attttgacaa gacaattaac agaaagtttt 4620 gatttgtgtt attagaatat aaatgtaaat ggactatcac cttttagtat acatcggata 4680 caatgtggaa aatcgcagtt tgatttaaaa gatttacagt tatgtaaaat cagtagtaat 4740 ttcttgggaa tgggattcgt cagatatata ttatgctttc tattttgtgt cggtaacgct 4800 caaatgatat tagtagtaga attacttata attaaacttg gtaataaaga atatttctta 4860 taaagggttt tgaactaatt gactaataag atgttgaaat gttacgttta aggaatgcac 4920 ataaagcatc ttgctaaaga agggtgga 4948 // ID Copia-1_AA-I repbase; DNA; INV; 2552 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-1_AA_; KW Copia-1_AA-LTR; Copia-1_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-2552 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 937-937 (2011). XX DR [2] (Consensus) XX CC Positions [1371-1874] - Integrase core CC 'TTTGC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 102..2492 FT /product="Copia-1_AA-I_1p" FT /translation="MAEQRLPGFPLTKLNNHNFQSWKFKMEMMLVREELWH FT VIAEERPDPVTDTWKKADAKARATIGLCMEDNQTSLIRSCTCAKDAWKSLK FT DYHDKGSEVYLLKKLTRLELSEDGDMEQHLQSFTDLVQRIADVGDEVPPKL FT QVALLLCSLPDSYDSLATVLEQRPRNELTIDLVKSKLLSEAEKRREKLGNT FT AGSSEKALRADYWKNRGGDREVGKDTRACFHCKKPGHIKRNCRILQREISS FT SQDSSRKDEELCSGLQQKETAKAKQAQSVMEGPIAFMTGQGELSDWFVDSG FT ASRHMTGNESFFAELKSTIGVSVLLANGEKAEAKGIGSGCIVGIDSVGAPV FT DIQLNDVLFVPNLTSGLISVSALARKKFSVDFSANQCEIKNKAGIVVVVAD FT RCGSLYRLCTPDRMLCDQGKGRKINSRKSRVTNKVEKAQKRLDVVHISLSD FT LMVNTPSGNKHYLTFVDEFTGMTFVYFLRKKSDVCFKMMDFIDFCKTKCGE FT MPKMILQDGGKEFMDTDLQRVLDNEGISVSFSPPNSAVAQKKSRYLKKIAE FT CMLIDSKLDRKYWGEAISTATHIQNRSTSKFKSISSFERWFGQKPNIGNFR FT RFGSKALVRISATMQEKALIFVGYSNQNQAYRFLDLKSGGIFVSRSVKFME FT GAGIRRGHPPDECLQESIVILEPLIPDEERLVRQEELSQQRVRVKQVADES FT VHPEDDLEYQADDTLSFESCEEVLYSDDQVQTGTEKAIEPQPDSSRPGRST FT KGVLPTRYKDYVVSPFPKKQQKPVVLHDDFKACSVDEIWPSTAQSN" XX SQ Sequence 2552 BP; 764 A; 449 C; 683 G; 656 T; 0 other; ggttatgggc gccagtatta attctcgaaa ccattagtaa tttgcggtcg aaaatatcgt 60 gattgttcga acagtgatta cagcgagtgg aagaaaataa gatggcggaa caacgattac 120 caggattccc actcacgaag ctcaataacc ataatttcca atcatggaaa tttaagatgg 180 agatgatgtt agtgcgtgag gagctatggc acgtgatagc ggaggaaagg cctgatccag 240 tcactgatac ctggaagaag gcggatgcga aagcccgagc cacaataggt ctttgtatgg 300 aggataatca gacaagcctg atacggagtt gtacctgcgc aaaggatgca tggaagtctt 360 tgaaagatta ccatgacaaa ggatcagaag tctacctgtt gaaaaagctt actcgtcttg 420 agctatcgga ggacggagac atggagcagc atcttcagag ttttacggat ctagtgcaaa 480 ggattgcgga tgtaggtgat gaggttccac cgaaattgca agtagctctt ttgttgtgtt 540 cacttccgga ttcctatgat tccttagcaa ccgttttgga acagaggcca agaaacgagc 600 ttaccatcga tttggtgaag tcgaaactgc tttcggaagc cgagaaacga cgtgaaaagt 660 tgggaaacac tgctggatca agtgaaaagg cactgcgagc ggattactgg aaaaaccgtg 720 gaggtgatcg agaagttggc aaggatacac gggcatgttt ccattgcaaa aagccaggtc 780 acattaagcg gaattgcagg attcttcaga gagagatttc gtctagtcag gattccagtc 840 gaaaggatga agaactatgt tcgggattgc agcagaaaga aacggcgaaa gcgaagcagg 900 cacaaagcgt aatggaaggt ccaattgctt tcatgactgg tcaaggagaa ctgtctgact 960 ggtttgttga tagtggcgca tctcgtcaca tgacaggtaa cgaaagcttt ttcgctgagt 1020 tgaaatccac gattggtgtg agtgtgttgt tagccaacgg agagaaggca gaagctaagg 1080 gaattggcag tggttgtatt gtcggaatag acagtgttgg tgctccagtt gacatacagt 1140 tgaatgacgt tctatttgtt ccaaatctga ccagcggatt gataagtgtt agtgcactag 1200 caagaaagaa attctccgtc gatttcagtg caaatcaatg tgagatcaaa aacaaagcag 1260 gcattgttgt ggttgttgct gatcgttgtg gaagcttgta tcgtttgtgt accccggatc 1320 gaatgctgtg tgatcaagga aaaggaagga aaataaactc gaggaaaagt cgtgttacga 1380 ataaggttga aaaggcacag aaaagactgg atgtggtgca tattagtttg agtgacttga 1440 tggtaaacac gcctagtggt aacaaacact atcttacttt tgttgacgag ttcaccggaa 1500 tgacttttgt ttactttctc cggaagaaat cagacgtgtg tttcaagatg atggatttta 1560 ttgacttttg caagacgaag tgtggtgaaa tgccgaaaat gatcctgcag gatggtggaa 1620 aagagttcat ggataccgat ctacaacgtg ttctggacaa cgaaggaata tctgtatcat 1680 tttcacctcc gaatagtgct gtggcgcaaa agaagagtcg ttatttgaag aagatcgcgg 1740 agtgtatgtt gattgattct aaacttgatc ggaagtattg gggtgaagct atatcgacag 1800 caacgcacat tcagaatcga tcgacttcca aattcaagag cattagctct ttcgaaagat 1860 ggttcggtca gaaaccgaat attggaaatt tcaggagatt tggttcaaaa gcgttggttc 1920 gcatttcggc caccatgcaa gaaaaagcgt taatttttgt gggatattcc aaccagaatc 1980 aagcatatcg atttctggat ctgaagagtg gaggcatatt tgtcagtcgt agcgtcaaat 2040 tcatggaggg agctggaatc aggcgtggtc atcctcctga tgaatgtctg caggaatcga 2100 ttgtcatatt ggagcctttg attcctgatg aagagcgatt agtgagacag gaggagctat 2160 cacagcaacg agttcgggtg aagcaggtgg cagacgaatc tgtccaccca gaggatgatc 2220 tggaatacca agcagatgat acattgagtt tcgaatcatg tgaagaagtt ctctatagtg 2280 acgatcaggt gcaaactgga actgagaagg cgattgaacc ccaacccgat tcgtctcgtc 2340 caggtagatc tacaaaggga gttttaccaa cacggtacaa agattacgtc gtcagtccgt 2400 ttcctaagaa acaacagaaa ccagttgtac ttcatgatga ttttaaggcg tgctctgtgg 2460 atgaaatctg gccatctaca gcacaatcca actgactgca ttttttcgaa tgatctgtag 2520 gatcaacgca gggatgtttt cgcgaggagg ag 2552 // ID BEL3-LTR_Dpse repbase; DNA; INV; 319 BP. XX AC Unknown_group_180; XX DT 15-MAY-2009 (Rel. 14.05, Created) DT 15-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL3_Dpse; KW BEL3-I_Dpse; BEL3-LTR_Dpse. XX OS Drosophila pseudoobscura OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; obscura group; OC pseudoobscura subgroup. XX RN [1] RP 1-319 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1017-1017 (2009). XX DR Genome; Unknown_group_180; Positions 5990 6308. XX SQ Sequence 319 BP; 119 A; 41 C; 68 G; 91 T; 0 other; tgacgcccct taatagaata taggattaaa ttacttaata atggttaaca atttgaagat 60 ttgaattaat atgtaaatag tactaaaatt gtgaattaat ttgaagttgg cgcgttacat 120 tatgtatgta cgtataaaga agcgcaatcg atagtatcaa tggtatcgag ccaaaacgat 180 ctgaatgatg atcactagat ttaagatata tatacgaaat aaacgaacga gcaaggaact 240 acatcgagcg gattaaggaa ggtctgaatt gtgtagacgg attccactgg aaaactgttg 300 gagggactta gctcgtaca 319 // ID Kolobok-6_HM repbase; DNA; INV; 2199 BP. XX AC . XX DT 19-DEC-2008 (Rel. 13.12, Created) DT 19-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE Kolobok-type family - consensus. XX KW Kolobok; DNA transposon; Transposable Element; Kolobok-6_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2199 RA Bao W. and Jurka J.; RT "Kolobok-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 2064-2064 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(471..1004,935..1411) FT /product="Kolobok-6_HM_1p" FT /translation="MKTDALRRRSIGASRVFKKRRRTSRTSSSSIFPQTEN FT TFASRELPLIESNDLSILPNFTRSTSNVVANNEAVPQPQSNKNESNYILID FT LDIIVQIITLVGTCPDCESKSINLNIDHKNKKGLSSLLILNCSECLWKTKF FT YTSKKVSSKTFEKSFDINLRSVIAMFIKKNQVCLLLLGTYKFTFCNCNVYK FT EKPGLPTVIRDIIKPIFIDLSDENLLKRCLHGKTQNNNEALNAIIWKRCPK FT DIYVGQTALEIGAASAVINFNEGLAGILNVYLELGMNPGKNCITFCNQRDN FT QKIKNIAKKLTDKVKQRRKQIRAKRKGYSDQDEEREGIAYPGSF*" XX SQ Sequence 2199 BP; 754 A; 277 C; 338 G; 830 T; 0 other; ggtggtagta tcgtcagaaa tttcaaattt tttgattttt tgaatataac ttttcctggc 60 attttcattc atgctgaaaa cgatggaact attatttttt caatataatc tatataaagc 120 cttctatttt tcattttaaa aaaggtacta tgaattattg ctccttagca actccatagc 180 aacaaaagta aacatcatgt attggttctt attgagctgc cttaaacaag tattttcttc 240 agtaagtgtt tagaaaatac taagtgatat ttggcttctt tttttgcctt aaaaattaat 300 aaaccagcat ttataatcac tttatttctt ggcgtgattg taaagtggct ttaaagcaaa 360 tattatttca gttattaatg ttgaatatag cctgagttta attgcttagt attgttattt 420 ttagttattt agtgtatatt aaaataaata aattacttct aatcgttaaa atgaaaactg 480 atgctttgag gagaagaagt attggtgctt ccagagtttt taaaaaaaga agaagaactt 540 caagaacatc aagttcgtct atttttcccc aaactgaaaa tacttttgca agcagagaat 600 tgccattaat tgagagtaat gatttaagca ttttacctaa ttttacaaga tcaaccagca 660 atgttgttgc taataatgaa gctgttccac aacctcaaag taataaaaac gagtctaact 720 atattttaat tgatcttgat attattgtac aaattattac cttagttgga acatgccctg 780 actgtgagag caagtctatc aacttaaata tagatcataa aaataaaaaa ggtttaagtt 840 cattgttgat attgaactgc agtgagtgtt tatggaaaac aaagttctat accagtaaaa 900 aagtttcaag taaaactttt gaaaaatctt ttgatataaa tttacgttct gtaattgcaa 960 tgtttataaa gaaaaaccag gtctgcctac tgttattagg gacataatca aaccaatatt 1020 tattgactta tctgatgaaa acctgttaaa aaggtgcctt catggaaaaa ctcaaaacaa 1080 caatgaagcc ttgaatgcaa taatttggaa acggtgtcca aaagatattt atgtggggca 1140 aactgctttg gagataggtg cagcatctgc tgttattaat ttcaatgaag gattagctgg 1200 tatactaaat gtttatttag agcttggtat gaatcctggt aaaaactgta ttacattttg 1260 caatcaaaga gataatcaaa aaattaaaaa tatagcaaaa aaattgactg ataaggtaaa 1320 gcaacgtcgg aaacaaatta gggcaaaacg aaaaggttat agcgatcaag atgaagaaag 1380 agaaggaata gcttatcccg gaagttttta gttgttttgc tttatttcaa ttgatataac 1440 tatttaattt aaaattataa attcggttgt cgtaattttt acttttatat attttttatt 1500 ttattttata tttttcattt tttcattaca aggatataac cttctatatg gtacttaaaa 1560 aagtaattaa aattaaagat aaacttagtt tattgctttt aatacattga ggagtgatta 1620 caaagtattt agaaatataa cgagaactta tttgatattt taacttgttt tttcgatgtt 1680 aattttatta aattgctgtt ttctcagatt gccatttttt agcacggggc gataaataac 1740 ttgggaacca cttgttatgg tgctttaaaa ttttatacag ttgtaataac atatatgatg 1800 agctcattga agcagtatat ttttttaata ttattttttt gcagagttat gggcttctaa 1860 agggggctgt tttggggtaa attttgaccc actagttata tatgctgtat ctccttattg 1920 atttaagcta actttacgat tttgcttcca tgagttcatt aatatattat aagtaaacac 1980 tgaaagtttg agttttatag catctaccat tgtctcatta tttatcgcag cgtcttttga 2040 cttttttagg tttcgcgacc taaaacttgg gtccaaatgt aaaattaatt tttttttttt 2100 tgttacttca ttatttttgc aattttttaa tactatccaa aaaaatcaat agtctttgtt 2160 caaaattaaa gattttagaa ttctgaggat actaccacc 2199 // ID Merlin-1_HM repbase; DNA; INV; 2198 BP. XX AC . XX DT 16-MAY-2011 (Rel. 16.05, Created) DT 16-MAY-2011 (Rel. 16.05, Last updated, Version 1) XX DE DNA transposon: consensus. XX KW Merlin; DNA transposon; Transposable Element; Merlin-1_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2198 RA Yuan Y.W. and Wessler S.R.; RT "The catalytic domain of all eukaryotic cut-and-paste transposase RT superfamilies."; RL Proc Natl Acad Sci U S A 108(19), 7884-7889 (2011). XX DR [1] (Consensus) XX SQ Sequence 2198 BP; 784 A; 288 C; 349 G; 777 T; 0 other; ggcggacttt aaaacggcgc accccaaact cggtaaacaa cgaaaatttt cgttacgaaa 60 attaaaagtc gattttcatt ggctaataaa ttaaattaga cttaagcacg ttttcaactt 120 ttcaagttca cttcagtaca cgacaacttc aagcaagact acttctttgg cagaatttat 180 tgaaagcatt aaagaaaaaa ggctaataaa ttttctccat caaaacagac taatccatgt 240 tcgcaaatct tgtatctgtg gtcagcaaat gactattcaa agatttgagc ggtgtctagt 300 aagtaaatat catatttaat catatcatgt ttcatgttat atgtattata ttcgtattat 360 atcataaaga ccttataaaa tcggaaaata ataactcaaa tgaaacttaa gcttgtttac 420 aatttaaata ctaatcagat tagacttggc ttatattaaa aaagcaaaca agctaaaatg 480 ttattatgag ttcaaataag ttattatgtt attatgatta ttgattatga attattatga 540 attatgagtt attatgttat tatgttatca aatgagttat tatattatta tgagttatta 600 atccttataa aggattaaat tacttacata aaatctaatc ctgttaaatt ggattttgtt 660 taaacttttt ctgatgcttt ttttctttaa ggataaacac atttggcgat gcagcagctg 720 taagaagaca aagactatac gaagtggttt ttttgaatgt tcaacatctt taagtgtaag 780 tgatatactt aaacttatgt tgttatgggt ttcagaggtc cctgctgctg ctgctgctga 840 attagtaggg ataagccaac caacttcagt tcaatggtat cagtatttca gagacatctg 900 ctcattcaaa gttttagatg ttgcagctgt aaaccagctg ggaggtccag gacacatagt 960 tcaaattaat gaatctttat ttttcaaacg caagtataat gtgggtcaca acgttgaaaa 1020 gcattggatt ttcggggctt atgatacaac aacttaaaaa agggtatttg actagggtag 1080 aggatcgcac tgcaaataca ttagtacctt tgattcaaac ttggattgct cctggttcaa 1140 ctatacaatc agatgaatgg gcttcgtaca ataatttaag caacttaggc tatattcatt 1200 caactgttaa ccataccaca aactttgttg accctgaaac agggacaaca acaaaccaca 1260 ttgagtcctt ttggtatagg atgaagttta aattaaagtt tgtttttggt tatcaaggtg 1320 atatgaaatg gtctaggttg gatgaggcag tatataggga atattatgag tttaaaaata 1380 aagatttatg gcaaaatttc aatatgttcc tgggtcatat tcacgataag tatcctcttt 1440 aaattacaca tttattaata ttttaattct attttaattt atgaatttat tttatgaatt 1500 tttggagcaa atactcgaat tttctcaaat gatcacatac tgtgttacac tgatgacgag 1560 aagatacaca tactgatgat gagaagaata tacatagagg aaaagcttaa aacgacgaga 1620 ttaagaaaaa aagacatttg aagattgaaa gatagtgaag attttgaata aaaaaaagtt 1680 aatgaaagaa aagtaaactg aagaatatca gaacatttat tttaaaaagc aataatatat 1740 tgtaattatt tatttgagat ttaaaactgt taacacctat acataacttt ttttaaagat 1800 tttttgtaaa tttttaactt attacaaaac atttctcaag taagcctttg tgaaatagta 1860 aaaccttact taaaataata aagccttaaa aaaaacttaa ttaaattaat aaaacttcta 1920 cttattttaa ttctaatatc agtagtatag gtgtaaatgt tttaatttta taattgtaaa 1980 tgtttatatt tgatttgtaa atatttaaat taaagtttta aattttaaaa atggttttaa 2040 aaagtttgtt ttttttcttt cgaggaagcc attttggcac ttgtcgtaac ggcgcggttt 2100 taccattctg taaaatactc agtatattta ccgagaattt tcgtaacgaa agctttcgtt 2160 gtttaccgag tttggggtgc gccgttttaa agtccgcc 2198 // ID Gypsy-590_AA-LTR repbase; DNA; INV; 273 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-590_AA_; KW Ty3_gypsy_Ele50; Gypsy-590_AA-I; Gypsy-590_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-273 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 273 BP; 76 A; 72 C; 55 G; 70 T; 0 other; tgaaccagcc accacaggtg caacgtcaca tgcaccgtca tcacctttgg atcggaccac 60 tgccacgtgc tatgcaacgt caacaccttt atgcccaaca gccggaattg ctaggtcagc 120 agttcaattc tacaaaacca ccggactgac cgtgaccggt gttcagttta tttttatccg 180 tgtttcattg cgtttcggct taagtaataa agtgatttag tgaggaaaat cagttaagtg 240 ttttttaaaa agcccgaacc gccagcaaac cca 273 // ID CR1-51_BF repbase; DNA; INV; 1509 BP. XX AC . XX DT 30-JUL-2009 (Rel. 14.07, Created) DT 30-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE Amphioxus CR1-51_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-51_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-1509 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-1509 RA Kapitonov V. and Jurka J.; RT "Young families of CR1 non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(7), 1622-1622 (2009). XX DR [2] (Consensus) XX SQ Sequence 1509 BP; 399 A; 359 C; 247 G; 504 T; 0 other; atacatcaat gatattgtaa atgatctcgt ttcgcaaccc tacctcttcg ccgacgacag 60 ttctcttctc gagatagtta agaacccttt tgattctgca gcgatcctaa actctgacct 120 gtctaaaatt cactcatggg cctgtcagtg gctcatggag ttgaacccac agaaaactga 180 agaaatgtgt atttctgtca aaagacatcc cattgttcac ccacctcttt atctcgacaa 240 ttgtgtgatc caatctgtta tcacccataa acatatcgga gtgattttaa gttcaaatat 300 gtcttgggat gcacatgtca atcatattgt ttccaaagtt tcaaaaagtg ttgatctctt 360 cagtggtctg aaatttaggc tcccccgcaa tgtcctggaa acgttatata agtcatacat 420 tcgtccatgt ctcgaatacg ctgacgtcat ttggcacggc acaacaagtg agcaatcaaa 480 tctcattgaa cgtatccaat atcactgctc aattgtcgtg acaggtgcca taagggggtc 540 gtcgtatttg tcatgtcgtc aggagctggg ttgggaaacg ttatcagata gaagacatgt 600 ccatcgtctc tgcttgttct tcaagattgt taacggttta actcgtgatt atctagtaga 660 atttgtccct ccagctattt cttctgaatg tacatacgat ctacgtaaca agctaaacct 720 aaggtctacc aatctctcga caaaccgctc tctaaagtcc ttctatccat actgtatatc 780 acactggaac aacctcgacc cctcaacccg ctccttaaac ttttgccagt ttaaaacctt 840 tattttcaaa cagttccgcc ccgctaaatg taaatatctt agtcatggcc cccgctatcc 900 ctgtcttctg ctgactcgtc ttcggatcgg cacctgcagc ttgaactcca gtctcttatc 960 ccgaggccta cgtaacagcc cagcatgtag ctgcgggtgc cggtccgaaa cagtcgccca 1020 ttacctactt cactgcccaa attacaacaa ccaaaggtta cttctcctcg gcaaactttc 1080 tcggctttta ggacagtcat ttaacttatt tacacattcc gaaaatatgc atatttcttt 1140 attacttaat ggaagtcctt ctttctcata ttccttaaat tccaaaattc ttattttaac 1200 ccaaaacttt attacaatat ctaaacgttt tatatcctga tttatacttt ttagttctct 1260 gcaaatctga tcttttgtta ctggggtagg ggatccggca tgcgctcata ccaacttttg 1320 ttacacatca tttaattttt acttgatgtt agtattattt atatcttcct taattttgta 1380 tttcgcttag ttcagttatg tctatgtctt tgtatatatg tatagtggtg gcgtgaatat 1440 aagttcccaa cttgagtgcg ccaccactat gtctgtcttc tatgtcttat ttgtctgaat 1500 aaaaaaaaa 1509 // ID Gypsy-22_SI-I repbase; DNA; INV; 5160 BP. XX AC AEAQ01030800; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-22_SI_; KW Gypsy-22_SI-LTR; Gypsy-22_SI-I. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-5160 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01030800; Positions 5824 665. XX CC Positions [4038-4511] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 1115..4108 FT /product="Gypsy-22_SI-I_1p" FT /translation="MRLWVQRVTQVARIHGAPEDVTLLAATSRLTKAARKW FT YDHGKGPMLASWEAFKEAILKRFERRTLHYLALRKVEARRWNHQKESFHEY FT ASEKLALMYDLELSERETIHLLIGGIASRLLRGTASALKVEAVDDFLEEMS FT WITAATAEAAERSDGGRKAPMKPKEQCKGCGRRGHASQDCRSREVTCFGCK FT EKGHYQTDCPKRKKERATGAPGTSTSSAVAAVEDVSTVAALVSRSKTLYVS FT GAPIKIVALNQQACSLFALIDTGSPVSFVRRDVFAKYFRSGNAKVESVGQT FT YYAVNDTPVEICGTITAAIEFERLPGQAFHAKLNILKTSRTSAHLIIGRDF FT LNDQSIEVIYKPNVNKVEEGREDFPAELLQIGVMEAAGENSLLDEVETDFD FT AEVTGELKRTLADVEQSPTSIVADEYFVTVKLKDDSTYAYAPRKFAHSERK FT QIQQITDDLLTRRIIQKSISPYCARIVPVRKKNGNLRLCVDLRPLNDRVVK FT QNFPFPLIEDCIARLAGKTVLSFLDIRDGFLQIKIHPEHYKYFSFATPDGQ FT FEFTRLPFGYCEAPAEFQKRLAQILERFIRNDEVIVYMDDILIATTTVEQN FT LKILKEVLLELKSHNFEINYDKCQFLRRKIEYLGYMLADGKITLSDRHTQS FT IRNFTKPRNEFEVQRFLGLTSYFRKFIRNFATKAQPLYNLLKKTSKFEFDA FT QCEETFETLKRELTIYPILRIYNPAVETQLHTDASSVGLGAILLQKQGGDG FT WFPVAYFSQATKAEANYHSFELEMLAIVRAVERFNIYLAGIEFTVVTDCNA FT LVYAVNKANLNPRIARWTLQLQNYRFKLIHRAGKQMTHVDALSRQVGYLEA FT LPLERELEVRQFQDPQLKEIMEQLEFRDNEKFELIDGLLYRKGSDRPRFVV FT PSSMTNNIIRIYHDEIDGPLRGGKNSPRYIQVVLVFLNEKAYSRIHRKLCN FT LPNGRQIPAFEGGGNAGDLRRDRPMRNFTRRSFRSAGSDRGGI" FT CDS 3876..4898 FT /product="Gypsy-22_SI-I_2p" FT /translation="MAHCGAEKTHHGIYKSYWFSSMRKRIREYIENCVICL FT MADKSPHSREGEMQVTSGATGPCEILHADHFDPLEVTGGGFRHILLIVDTF FT TRFTWLFAVKSTGSKEAIKHLKSIFQTFGKPKEIVTDRGTSFTSNEFAGFL FT EKHKIKQRKVAVASPWANGIVERVNRFLKSSLKKLIDNPADWRTKLSRAQY FT VINNTAHSGIRGTPSKLLLGYEQREHSDAALVSLINKLAEIDRNFLEERET FT ARSNEEEATEKLRQYNKEYYDKRHGKPTMYHTGQFVMIRDRQPKPGENRKL FT KPEYKGPYAIAKVLRNHRYVVEEIPGLNVPPRPHNSVLSPDKIKPWIKM" XX SQ Sequence 5160 BP; 1499 A; 1231 C; 1460 G; 970 T; 0 other; gcgagaagct gccagggggt cgagcccgag cgagcgagca ataacgatta tatacccggt 60 catccgacaa cctgtatcat atatatcgga gaataaaaga ctacttcaac ctaacaccaa 120 atactctccg tcatcatttt aattatccta cagctcagaa gtgggattcg aacgcgtatc 180 ctccgcgcga agaatcgaga gcatcgaacc gggcaaagtt cgcgtaacgg gcaggagtga 240 gtccggaata atctcgcgtt tagagcgatc cgtgtggtaa ccggaatagt gtaccgcgat 300 tcggggcagc tacggctgag agaggtcggc gatcttgcga ggagatccgc gtggcgagat 360 aacgtgcggc cgcgatcgcg cgtaacgaaa gagacgccac gaggtacgcg tgtgaccacg 420 cgtaggacgg tacgtgatcg cgcgctgaag tgtagcggtt cccgcgcgag gaaaaagtga 480 aagaagatcg agaagatgcc acgagcgacc aacgccagta aaagggcggg acaggggtcg 540 tcggacgata ttcccgaaga agacgaggtg gacgccaacg gagcgctcga cgtggaaatg 600 gagcgcttaa gggcggaggc acaagctctc gggctacccg cgacagggag tcgcggagaa 660 ttgctcgcag caatagcggc gcgcttggcg agacgcgggt cgagggacgg actctcggtg 720 gccggcagta gccgggccga atccgtagat cagccgttgc cagagttcgg cgcggggcac 780 atggagcagc caagtgccgc cgcggccgac gggcaaagcc ccaccgcgaa tcagaatgcg 840 cggccggtcg agctggcgca agtgctcgcc ctcatgcaag gccagatgca acagcaggct 900 cagatgttgc aacaactgat gctcaccatg gcgggcggga cgaggatccc ggggactccc 960 ccgatggcag aaatcagtga ggagcggcag gttagccgtc agacgagcgg cagccggatg 1020 ccctcgcgag cgacagcggt gtcaacggcg aacccggggc aggcagctgc tatcgccgca 1080 aatcccggag ttcgccggaa cggagcaaga caacatgcgg ctgtgggtgc agcgggtgac 1140 gcaggtggcg cggatacacg gagcgcccga ggacgtcacg ctgttggccg ccaccagccg 1200 gcttaccaaa gcggccagga aatggtacga ccatgggaaa ggtcccatgt tagcctcttg 1260 ggaggcgttc aaggaagcga ttttaaagcg ctttgaacgg agaacgctcc actacctcgc 1320 gctcaggaag gtcgaggcca ggaggtggaa tcatcagaaa gaatcattcc acgagtacgc 1380 gtcagagaaa ctggcgctga tgtacgacct ggaattgtca gaaagggaga ccatacatct 1440 cctgatcggg ggcattgcca gccgattgtt acgaggaact gcatcagccc tgaaggtgga 1500 ggcagtggat gacttcctcg aggagatgtc gtggatcacc gccgcgaccg cggaggccgc 1560 ggagagaagc gacggcggca ggaaggcgcc gatgaagccg aaggagcagt gcaaaggttg 1620 cggcaggaga ggccacgcca gccaggactg cagatccaga gaggtgacct gcttcggctg 1680 caaagagaag ggtcactacc aaacggactg cccgaagagg aaaaaggagc gggcaacggg 1740 cgcaccaggg acatccacat catcggcggt agcggccgtg gaagatgtgt caacggtagc 1800 agcgctcgtg agtcgatcaa aaactttata cgtaagtggc gcaccgatta aaattgtcgc 1860 gttgaatcag caggcgtgct cattgtttgc gctgatagat accggcagtc cggtctcgtt 1920 cgttaggcgg gacgtgttcg cgaaatattt ccggagcggg aatgcgaagg tcgaatccgt 1980 ggggcaaacg tactacgcgg taaacgacac gccggtggaa atatgcggca cgataacagc 2040 agcgatcgag ttcgaacggt tgcccgggca agcattccac gcgaagttga atattttgaa 2100 aacaagtcga acaagcgcac acttgataat aggtcgcgat ttcctcaatg accagagcat 2160 agaggtgatt tataaaccga acgtaaacaa ggtcgaggaa gggcgagaag attttcccgc 2220 ggaattattg cagatcggag taatggaagc tgcgggcgaa aatagtctcc tagatgaagt 2280 agagacagac ttcgacgccg aagtcaccgg cgagctaaag agaacgctgg ccgacgtaga 2340 gcagagtccg acttcgatcg ttgctgacga atatttcgta acagtaaaac taaaagacga 2400 ttcgacatac gcgtacgcgc ctaggaaatt cgcgcattca gaacgaaaac aaatacagca 2460 gatcacggac gacctgctga cgcgaagaat tatccaaaag agtatttcac cgtattgcgc 2520 gcggatcgtg ccggtgcgga agaagaatgg caatctgcgg ctatgcgtgg acctgcggcc 2580 attgaacgat cgagtggtga agcagaattt tcccttcccg ctgatcgaag attgcatcgc 2640 cagactggcg ggcaaaacag tattatcttt tctcgatatt cgggacggat ttctacaaat 2700 aaaaatccac ccggaacatt acaaatattt ttcgttcgcg acaccggacg gtcaattcga 2760 gtttacaaga ttgccattcg gatactgcga ggcgccggcc gagttccaaa agagattggc 2820 acaaattctg gaacgcttta taagaaacga cgaggttatc gtctacatgg acgatatcct 2880 catcgccacc accacagttg agcaaaatct gaaaatctta aaagaagtac tgctagaatt 2940 gaaaagccat aatttcgaga taaattacga caagtgccaa tttctacgac ggaagattga 3000 gtacctcggt tacatgcttg ccgacggcaa gataactttg agcgaccgac atactcagtc 3060 tatccggaat ttcacaaaac cgagaaacga gttcgaggta cagcggttcc tcgggctcac 3120 cagctacttc agaaagttca ttcggaactt tgcgactaag gcacagccgt tgtacaacct 3180 gttaaagaaa acgtcgaagt tcgagttcga cgcgcaatgc gaggagacat tcgaaacatt 3240 gaaaagagag ctgacgatat atccgattct tcggatatat aacccagcgg ttgagacaca 3300 gctgcatacc gacgcgagct ccgtcgggct aggcgcaatc ctgctgcaaa aacagggcgg 3360 cgatggttgg tttccagtag cgtattttag tcaggcgact aaagcggaag ccaactatca 3420 cagcttcgag ctggaaatgt tggccatcgt cagggccgtc gagcggttca acatttacct 3480 cgcaggcatc gagttcacgg tggttacgga ttgtaacgcg cttgtctacg ccgtgaacaa 3540 agccaacttg aaccctcgca tcgcgcgctg gacgcttcaa ctacagaact atagatttaa 3600 actgatacac cgagcgggca agcagatgac gcacgttgac gctctgagcc gtcaggtagg 3660 gtacctggag gcgctgcccc tcgagagaga gctcgaggtc aggcaattcc aggacccaca 3720 attaaaggaa ataatggaac agctcgagtt ccgggataac gaaaagttcg agctgatcga 3780 cgggctgcta taccggaaag gctcggaccg tccacgtttc gtcgtgccca gctcaatgac 3840 gaacaacata atccgaatat accatgatga gatagatggc ccactgcggg gcggaaaaaa 3900 ctcaccacgg tatatacaag tcgtactggt tttcctcaat gagaaagcgt attcgagaat 3960 acatagaaaa ttgtgtaatc tgcctaatgg cagacaaatc cccgcattcg agggaggggg 4020 aaatgcaggt gacctcaggc gcgaccggcc catgcgaaat tttacacgca gatcatttcg 4080 atccgctgga agtgacaggg gggggattta gacacatcct gttgatagtt gacactttta 4140 cacgcttcac gtggctcttt gcagtcaagt cgaccggaag caaagaggca atcaagcact 4200 tgaaatctat ctttcaaaca ttcggtaaac cgaaagagat cgtaacggac agaggcacct 4260 cgtttacctc caacgaattt gccgggtttc tagaaaagca taaaatcaag caacgcaaag 4320 tagccgtagc ttcgccatgg gccaacggca tagtagaaag ggtcaatcga tttcttaagt 4380 cgtcccttaa gaaactgatc gacaacccgg ccgactggag gacgaaactt agcagggcgc 4440 aatacgtaat aaataatacc gcgcactcgg ggatccgagg caccccatcg aagttgctac 4500 tcggatacga gcagcgagaa cactccgatg cggccttagt aagcctaata aataaattag 4560 cagaaatcga ccgaaatttc ctcgaggagc gagaaaccgc gcgaagtaac gaggaggagg 4620 caacggaaaa actaaggcaa tataacaaag aatattatga taagagacat ggtaaaccca 4680 ccatgtatca cacgggtcaa tttgtcatga tacgcgaccg gcagccaaaa ccgggggaaa 4740 accgcaaact aaaaccggag tacaaagggc catacgccat cgcgaaagtt ttgagaaacc 4800 ataggtacgt ggttgaggag atacctggtc tcaatgttcc accgagacca cataattcgg 4860 tactatcacc cgacaaaata aagccatgga tcaaaatgta agcgcgtggc ttagaggtaa 4920 acacttagac ttaattacat aagttagcat taattagaat aagcgttaca atttaatagt 4980 ttaaaggtaa atacttagac ttaattacat aagttagcat taaataaaat aaatgttaca 5040 atcagttagt agtaaagaag aaaaaaaagg ggagagggaa aagaaataat aagatgtata 5100 tatgtatatg cgtttcggtc acacagactc gagacgagcg taaagtcagg ctggccgagc 5160 // ID Jockey-N3_CQ repbase; DNA; INV; 1612 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A non-autonomous Jockey-like non-LTR retrotransposon family from DE Culex quinquefasciatus - consensus. XX KW Jockey; Non-LTR Retrotransposon; Transposable Element; KW nonautonomous; Jockey-N3_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-1612 RA Kojima K.K. and Jurka J.; RT "Non-autonomous Jockey non-LTR retrotransposons from the southern RT house mosquito."; RL Repbase Reports 11(1), 586-586 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >94% CC identity. ~10 bp TSDs. This family encodes a protein similar to CC Jockey ORF1p but does not encode ORF2p. Thus it is a CC non-autonomous non-LTR retrotransposon derived from Jockey, CC like CC HeT-A. XX FH Key Location/Qualifiers FT CDS 534..1484 FT /product="Jockey-N3_CQ_1p" FT /translation="MSTCDVQPEHKLTRYGTKITCFSSDDFDTVQAHLKKN FT KVQFYTHGKRSARPHRVVMRGLPNAEPDYVKELLKTEHQLDVLAVHAIRRK FT QQLPAIDETPFIVLFPKGHTSLKELSSKVKKVGPVVVRWVAYRNKEPHVTQ FT CKNCLQFGHGTSNCHLKPRCSSCGGAHSTEKCKAEETEAKKCVNCSGSHEG FT LDRSCPKRAQFIQSRQQASKPKPPAWKKDKQTPAGAAFTAADFPPLPGAVP FT SVGTEEKHPRPAGRSRENTGAGAGATPKEQQGERKLYSESELWAIYREYRV FT RLRQCKTPEEQIDVIAHLLTHGAGK" XX SQ Sequence 1612 BP; 438 A; 415 C; 487 G; 272 T; 0 other; cattccagtt tgacagcttg cccagagcag acgtgtgtcc ttttagcgcg tttttgacgg 60 tgtttttttt aactttcgcc gacggccatg gcggcggccg cgacggcgga gagtgagtgg 120 acggagcaca gggctgagga tggaaggccg ttctactgga acgcggcctt gaagaaaagt 180 gtgtgggaga aaccggacgg gttccaaccc aaaacgcagg cggcgacggg catggaggtg 240 gaagactccg aaggaggagg agaggacggg ggcgaatttc gtcccgtgat caaggttcgg 300 cgcaggaagg ctaaggagga gatcaacgtc ggcgatggcg cagcgaagtg gagaaactgc 360 tcagcaacaa caagttcagc ccgctagcgg agaagagcaa caacaacaac aatgcgaacc 420 cagcagcaga aaaaaccacc cccgtggtac cagcacccgg caagtcggcc ggggagaaga 480 agcagccgcc gctggtggtg aaagaaacga gcttcgcccg ccttgcgaag gtgatgtcga 540 catgtgatgt ccagccggaa cacaaactga cgcggtacgg caccaaaatt acgtgcttct 600 cgagcgacga cttcgacacg gtgcaagccc acctgaagaa gaacaaggtg cagttctaca 660 cgcacggaaa gcgcagtgcg agaccgcacc gggtagtaat gcgaggtctc ccgaacgcgg 720 aaccggatta cgtcaaggag ctgctcaaga cggaacatca gctggacgtc ttggcagtac 780 acgccatcag gcggaagcag cagcttcccg ccatcgatga gacgcctttc atcgtgctct 840 tccccaaggg gcacaccagt ctgaaggagt tgagtagcaa ggtaaagaaa gtgggaccag 900 tcgtcgtccg gtgggtggcc taccggaaca aagaaccgca cgtgacccag tgcaagaact 960 gcttgcagtt cggtcacgga accagtaact gccatctcaa gcccaggtgc agcagctgtg 1020 ggggtgccca cagcacggaa aagtgcaaag cagaagaaac ggaagccaag aagtgtgtca 1080 actgctctgg atctcacgag ggtctggacc gcagctgtcc caaacgtgcg cagttcatcc 1140 agtcgaggca gcaggcgtcc aagccgaagc cgccagcatg gaagaaggac aaacagactc 1200 cggcaggagc cgcgttcacc gcggcggatt ttcctccgct acctggagcg gtgccgtccg 1260 tcgggacgga agaaaaacat cctcgtcccg caggaagaag tcgagaaaat actggcgccg 1320 gcgccggtgc aaccccgaag gagcaacaag gcgaacggaa gctgtacagc gagtcggagc 1380 tgtgggccat ttaccgagaa tacagagttc gcttgaggca gtgcaagaca cccgaggaac 1440 agattgacgt gatcgcacac ttgctgacac atggcgcggg aaaatgatca tctattcagt 1500 tacggttttt ttatttttct tttattattt tttgtactat tgatcctcgg tcctaacctg 1560 gtcacagcac ctaaaaggac ctaataaaaa taagttatga aaaaaaaaaa aa 1612 // ID Copia-23_NVi-LTR repbase; DNA; INV; 345 BP. XX AC . XX DT 01-JUL-2009 (Rel. 14.07, Created) DT 01-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE LTR retrotransposon from parasitic wasp: long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; KW Copia-23_NVi-LTR. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-345 RA Bao W. and Jurka J.; RT "LTR retrotransposon from Nasonia parasitic wasp."; RL Repbase Reports 9(7), 1515-1515 (2009). XX DR [1] (Consensus) XX SQ Sequence 345 BP; 80 A; 82 C; 82 G; 101 T; 0 other; tgttgagtat aagcgagcag aattcgtccc gccagcgact ccatgcgtgc aaggcatgat 60 gggactgcgc tctccccctc tcaccggcct cgacggccag cgtcgacgtc gcggctgcgg 120 cggcggcgct cacttctctg tgtgctctca tgcgagaaag acgcgttcgc attcttatac 180 gttttgaata tacttatttg tgttttatat aagctggatc tttgtaataa catactctgt 240 ttataacaga tcgaaggtaa gctcgatgtg aataaagaat tgtcatttgc cataaagtgt 300 gttagttatt ttcccaagac acgccgggtc gattaatttt caaca 345 // ID Gypsy-11_RP-LTR repbase; DNA; INV; 319 BP. XX AC ACPB02037242; XX DT 28-JAN-2011 (Rel. 16.02, Created) DT 28-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Rhodnius prolixus genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-11_RP_; KW Gypsy-11_RP-I; Gypsy-11_RP-LTR. XX OS Rhodnius prolixus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Euhemiptera; Heteroptera; OC Panheteroptera; Cimicomorpha; Reduviidae; Triatominae; Rhodnius. XX RN [1] RP 1-319 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Rhodnius prolixus genome."; RL Direct Submission to RU (28-JAN-2011). XX DR Genome; ACPB02037242; Positions 30345 30663. XX SQ Sequence 319 BP; 83 A; 31 C; 71 G; 134 T; 0 other; tgtagtaaat ttagatgtat tggaatgtta gatatttggc aaccctgttg agggtgtaag 60 agaagggctg accgttaggg gtagctagag gtagcaggta gttggagaca ttccttagtg 120 gagtagagtt gagtttggtc actctttatg gaagttatgc cttatgaatg ttatgtctta 180 tgaattatgt gaataaacta tatttaggtt aattagtgtt tattcaaccc ttatgaaatt 240 agtaaattat tattatgttt tatgtgtctt agttatttgt gttttccccc attttgttat 300 ttttatcttt tattttaca 319 // ID Gypsy-123_AA-LTR repbase; DNA; INV; 154 BP. XX AC supercont1.240; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-123_AA_; KW Gypsy-123_AA-I; Gypsy-123_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-154 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.240; Positions 426707 426554. XX SQ Sequence 154 BP; 53 A; 32 C; 31 G; 38 T; 0 other; tgcgcactcc cctgcgcatt cgaaataaaa cgtcaatgat cgtttttact tcgggagttc 60 gatctgcgat cgaacggtta cctatgacta tgaaataaaa cagagaggca gaaagttgtg 120 agacactgaa caaacaacaa gtaatatttc taca 154 // ID R1B_NVi repbase; DNA; INV; 6140 BP. XX AC . XX DT 08-NOV-2010 (Rel. 15.11, Created) DT 08-NOV-2010 (Rel. 15.11, Last updated, Version 2) XX DE 28S rDNA-specific non-LTR retrotransposon R1 in Nasonia DE vitripennis. XX KW R1; Non-LTR Retrotransposon; Transposable Element; R1B_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-6140 RA Stage D.E. and Eickbush T.H.; RT "Maintenance of multiple lineages of R1 and R2 retrotransposable RT elements in the ribosomal RNA gene loci of Nasonia."; RL Insect Molecular Biology 19(Suppl 1), 37-48 (2010). XX DR [1] (Consensus) XX CC This family is specifically inserted into 28S ribosomal RNA CC genes. XX FH Key Location/Qualifiers FT CDS 541..2376 FT /product="R1B_NVi_1p" FT /translation="TVPRTERPPYGLALQGGRYELDVMTNKNQASDKTKEG FT KGGKENVRRSRRRVSSASSASSGDRNRTPYRGPRSRSQSRSGTGLRDEPSD FT ALDPKSKKEHQRREKAREKDLRDQELERAILKERGMEAGKVTEERGMETEE FT GVMETEDIPEPTAQPENEVIVEEKEEYAVKNKRTLKRKTDAEEKRLSDEDK FT KMKKMKNRQKRAEARAEITAKLKEEAKDRQRKKVDKRDEKEKSLQLLKELK FT EATEKLAGMKDVNLIETESDGESEEEDGEGRVKESVHASALETVGQLANEI FT RLFMLSQLNVNKSVCVTVFEKLGKMEQIMNGLVCENEGLKSKLECMGAKEG FT ANANVGASSASAAVPRAAPATPAVRPSYAVIVRGKENESSDMVRKKIVDEV FT SKNVDVRVKAVRKVRDGVVVETVSVNECNRLLNEKQFERVGLQVTEARRFG FT PKVILYDVPCSMTDETLLTEVYQKNMDGCVSERDFHESVKVLFRSGKKGMD FT VSNVILEVSPRVKMRLLGQSRVYVKWMSFRVNEFERINRCFNCFDFGHVAR FT QCKRERVCVRCCKSGHVGSECREQFNCGNCARRGMPAGHPVTSSECPGYKM FT RLRWQKERTLQDGE" FT CDS 2342..5506 FT /product="R1B_NVi_2p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="GGKKKGHSKMANREARFLQLNCQRSMNVMSDLSECIV FT NEGIDVCLLQEPYFAHGKVCGLPSGMRVFWSRSGLSAIVVNDLNCVCMLIE FT ELVSENSVCVSVKGEYGEIFMISLYSKYDGEIENDIEYLEKVFECLRGKRV FT LVCMDANASSELWFSKGVYVGKPRGNRGEKLCEWIAGSEWSVDLLNEPSEW FT YTFDGPRGQSDIDITLAGGFENECEFTWSVMCEWSLSDHNPIMIRMRMNGG FT RENEANVDLRWRMKTNKWSKYTKRLREIAYEFGYGEYVELCAREKVKKMDE FT WITRVNDECFERVSVHNNKVVWWNKELERMKKNVCKLRKTYQKMRCKNDER FT MEESRNEWKKCETQYARKLKEEKNDDWKKYMSTKGNQKPWKARNICIGKRR FT EELASLRKAGGGLTTSWAETADLLLNEFFPPDDGVPAPEVQELFLDVNDRE FT YEWNEISMAVKKMNMRKSPGMDGIMNEMILRVHRAIPGFMKEMYDSCLRES FT YFPNEWKRAKVVVLLKSVDKDKASPRSYRPISLLPGLGKVLERMMVERLTK FT IVNGRWSEYQYGFRNGRSCEDAWNKMKESVNASEHKYVCGVFVDFKGAFDN FT LLWRTIMATLEEYGCGGMDLEMWKSYFTDRRVCMKNSMEEVWKQVSRGCPQ FT GSIGGPILWNLSMNGMLIELAGSGVSVCAFADDVAILAEGNTRKEVERVIN FT EKMEIVYKWGEKMGVSVSEEKTVCMLLKGKYVTSNRAVRVSKTINVYKRIK FT YVSCFKYLGVNVTEGMGFGEHVRGMKVKVAKGVQKLKRVLRRDWGLRRAAS FT HLVLRGTFLPQVSYCASAWYEVLKYEYGRNALNAAMRYVMYACLNVCRTVS FT TEAMQVLTGWLPWDLECLKRANVYKVRKGLSMNEMDVVKNEEVESMSVIKL FT ERLVDERVYEIWQERWDVSMKGRVTARFIKDVKYAGRSRSFEPDRWVVNIL FT TGHGTLNAFLHKRGLSESASCMCGDVSEDWEHVLCRCSMYESFRNLDEMGV FT TVCVNGEYEFTGVLSSNVTYGKMCEFINRAYEMRESVRRRLNLELDVNG" XX SQ Sequence 6140 BP; 1797 A; 970 C; 2000 G; 1373 T; 0 other; ccccctcacg atcactggct tgtcgcgccg agggggcccg agttcaggag tttgcgcaag 60 cgaactcctg ttccgctaaa gtgatccaaa cccaggaagt ggtggaggat ccggtgcttt 120 ggctagccga ccccactctc aggaaccgat ccgcgaaagc ggaaaggaga ccctgagaag 180 tgtgggaacg caaaccgaag cgccggatcg gaagccaccg ggcttgacaa ccccggggtt 240 cctagtccca agcgatgagc gccgtcgctt ggcccgtaaa ccttgcaccg tcttcaacaa 300 ccttcgcagg gcgtggcgtt ggtacgttgg cggtgtgggg agcgggggct cggcttaacc 360 gcccggccgc gaggtggggt cctctataaa acccccccaa tcctacgctc aggtgggtgg 420 ctatgggatg aatggctcct gggtgaggag tcccaagccc accaaccccc gttggctgtg 480 gcggctgacg ggatgtactt gccttaccgg ggtagggccg taaccccggt ggacctatag 540 accgtgccta gaactgaacg ccccccgtac gggcttgcct tacaaggggg gcggtacgaa 600 ctagatgtca tgacaaataa aaatcaagcc tcggataaaa caaaggaggg taaaggagga 660 aaggagaacg tgaggaggtc taggaggagg gtttcctcag ctagttccgc ctcatcgggt 720 gatcgaaacc gtacgcccta tcggggaccc cgttcacgat ctcaatctcg atcgggaacg 780 ggtcttcgag atgagccgag tgatgctctg gacccaaaat ccaaaaagga gcatcagaga 840 agagagaagg cacgagagaa ggacttgaga gatcaggagc ttgagagagc cattctgaag 900 gaaaggggta tggaagcggg taaggtaacg gaagaaaggg gtatggaaac ggaggaaggg 960 gttatggaaa cggaggacat ccccgaaccg acggcgcagc cggaaaatga ggtgatagtg 1020 gaagaaaagg aagagtatgc tgtgaagaac aaaagaacgc ttaaacggaa aactgatgca 1080 gaggaaaaaa gactgagtga tgaagataag aagatgaaga aaatgaaaaa tagacaaaaa 1140 agggcagaag caagagcaga aataacggca aaactgaagg aggaggcaaa agatagacag 1200 agaaagaaag tagacaagag ggacgagaaa gagaaatcct tgcaactctt gaaggagctg 1260 aaggaagcga cagagaaatt ggccggcatg aaagatgtga acctgatcga aactgagagt 1320 gatggagaaa gtgaggagga ggacggcgaa ggaagagtga aagagagtgt gcatgcgagt 1380 gcgctcgaga cggtgggaca gctggcaaac gagattcgtc tgtttatgct gtcccaactg 1440 aatgtgaata aatctgtgtg tgtgactgtg ttcgaaaaac tagggaaaat ggagcaaata 1500 atgaatggtt tggtatgcga gaacgaagga ctgaaaagca agctcgaatg tatgggtgcg 1560 aaggaagggg cgaatgcaaa tgttggtgcg tcaagtgcaa gtgcggcagt cccgcgtgct 1620 gcgcctgcta ctccagcagt aaggccaagc tacgcggtga ttgtaagggg gaaagagaat 1680 gaaagcagcg atatggtacg aaagaagatt gtggatgaag tcagtaagaa cgtagatgta 1740 agggtgaaag ctgtgagaaa ggtaagagat ggtgtggttg tcgaaacggt gagtgttaat 1800 gaatgcaata ggttgttgaa tgagaagcag tttgagagag tgggtttgca ggtcactgag 1860 gccaggaggt ttggtccaaa ggtcatatta tatgatgtac catgtagcat gacggatgag 1920 accctcctga ctgaggtgta ccagaaaaat atggatggat gtgtaagtga acgcgacttt 1980 catgagagtg tgaaagtact gttcagaagt ggaaagaagg gtatggatgt gagtaatgtt 2040 atactggaag tcagtccacg agtgaaaatg agattgctag gtcagagtag agtgtatgtg 2100 aaatggatgt cttttagagt gaatgagttt gaaagaatta acaggtgttt caattgcttc 2160 gactttgggc acgtggccag acaatgtaag agagaaagag tatgtgtgag atgttgtaag 2220 agtggacatg tgggaagtga gtgtagagag cagttcaatt gcggtaactg cgcgaggaga 2280 ggtatgcctg caggtcatcc cgtgacatcg tcggagtgtc caggatacaa aatgcgactg 2340 aggtggcaaa aagaaaggac actccaagat ggcgaataga gaagcgcggt ttctgcagct 2400 gaactgccag cgtagtatga atgtgatgag tgacctgagt gagtgtattg tgaatgaggg 2460 aatagatgta tgtctgctcc aggaaccata ctttgcccac ggtaaggtat gtggccttcc 2520 atcgggcatg agggtctttt ggagtagaag cggattgtct gcaattgttg tgaatgacct 2580 aaattgtgtg tgtatgttga ttgaggaact ggtgagtgag aatagtgtgt gcgtgagtgt 2640 gaaaggggaa tatggggaga tctttatgat ctctctatat agcaaatatg atggtgaaat 2700 tgaaaatgat attgagtacc tggaaaaagt gtttgagtgt ctgagaggca aacgggtgtt 2760 ggtttgtatg gacgcgaatg cctcgagtga actgtggttt agcaaaggtg tgtacgtggg 2820 caagccgaga gggaacagag gggaaaaact gtgcgaatgg attgctggat ctgaatggag 2880 tgtggatctg ctaaatgagc cgagtgaatg gtatacgttc gatgggccca gggggcagag 2940 tgatatagat atcactctgg ccgggggctt tgagaacgag tgtgaattta catggagtgt 3000 gatgtgtgaa tggagcctga gtgaccacaa ccccattatg attcgcatga ggatgaatgg 3060 tgggagggag aatgaagcga acgtggattt gagatggcgt atgaaaacaa ataaatggtc 3120 taagtatacg aaaaggttga gagaaatagc gtatgaattt ggatacggcg aatatgtgga 3180 actgtgtgcg agggaaaaag tcaaaaaaat ggatgaatgg attaccagag taaatgatga 3240 gtgctttgaa agagtgagtg tgcataataa taaggttgtg tggtggaaca aggagctgga 3300 gagaatgaaa aagaatgtgt gtaaactgag aaaaacttat cagaagatgc gttgtaagaa 3360 tgatgagcgt atggaggaat caaggaatga atggaaaaag tgtgaaacac agtatgcaag 3420 aaagttaaag gaagagaaaa atgatgactg gaagaagtac atgagcacca aagggaacca 3480 gaaaccctgg aaagccagaa atatttgcat tggaaagagg agggaggagc tcgcgagttt 3540 gaggaaggct ggtggaggtc tcacgacgtc gtgggctgaa acagccgacc ttcttctaaa 3600 tgagttcttc cctccggatg atggggtccc ggcccctgaa gtgcaggaat tgtttttgga 3660 cgtgaatgat cgagagtatg aatggaatga aataagtatg gctgtgaaga aaatgaatat 3720 gaggaaatcc ccgggtatgg atggtattat gaacgaaatg attttgagag tacatcgagc 3780 aattcctgga tttatgaaag agatgtatga ttcttgcctg cgtgaaagtt atttcccgaa 3840 tgaatggaag agagcaaaag tggttgtttt actaaaaagt gtggacaagg ataaagcttc 3900 ccccaggtcc tacagaccaa taagtctttt gccgggcctg ggaaaagtgc tagagcgaat 3960 gatggtagag aggttgacga agatagtgaa tggaagatgg agtgaatatc agtatgggtt 4020 tagaaacggt aggtcgtgtg aggatgcttg gaataaaatg aaagagagtg tgaacgcatc 4080 agagcataag tacgtctgcg gtgtgtttgt ggacttcaag ggggcctttg ataatctctt 4140 atggagaaca atcatggcaa ccttggagga gtatggatgt ggtggcatgg atctggaaat 4200 gtggaaaagc tattttacgg ataggagggt gtgcatgaag aatagtatgg aagaagtatg 4260 gaagcaggtt tccagagggt gcccccaagg ttctattgga ggtccaatac tctggaacct 4320 aagtatgaat ggaatgttga ttgagttggc tggaagtggt gtaagtgtgt gtgcatttgc 4380 ggatgatgtt gcgatactgg ctgaaggaaa cacgaggaaa gaggttgaga gggtcattaa 4440 tgagaaaatg gaaatcgtat ataaatgggg ggagaaaatg ggtgtgagtg tctcagaaga 4500 gaaaactgtg tgcatgttgt taaaaggaaa atatgtgacg agtaacagag ctgttagagt 4560 aagcaagacc ataaatgtgt ataaaaggat aaagtatgtg tcgtgtttta aatatttagg 4620 tgtgaatgta actgagggta tgggctttgg agagcatgta cgagggatga aagtaaaagt 4680 ggccaagggg gtgcaaaaat taaagcgtgt gcttagaaga gactggggat tgaggcgtgc 4740 cgcgtcgcac ttggtgctaa gagggacgtt tttgccccag gtctcctatt gtgcgtctgc 4800 atggtatgaa gtactgaagt atgaatatgg aaggaatgcg ttgaatgctg cgatgagata 4860 tgtgatgtat gcgtgtttga atgtatgtag gacagtctcc acggaggcaa tgcaggtctt 4920 aactggatgg cttccgtggg atttggagtg cctaaagaga gcaaatgtgt acaaggtgag 4980 aaagggatta agcatgaacg aaatggatgt ggtaaagaat gaggaagttg agagtatgag 5040 cgtgattaaa ttagaaagat tggtggatga acgagtatat gagatatggc aagaaagatg 5100 ggatgttagt atgaaaggac gtgtgactgc gagatttatt aaggatgtta aatacgcggg 5160 gaggagtagg agctttgagc ccgatcgatg ggtagtaaat atcctgaccg ggcacggaac 5220 cctaaatgcc ttcctgcaca agagaggttt gagtgagagc gcttcgtgta tgtgtggtga 5280 tgtaagtgag gactgggagc atgttctatg tagatgtagc atgtatgaat catttaggaa 5340 tttggatgaa atgggtgtga ctgtatgtgt aaatggtgag tatgaattta ctggtgtatt 5400 aagctcgaac gtgacgtatg gaaaaatgtg tgaatttata aatagagcgt atgaaatgag 5460 agagagtgtt agaagaagat tgaatttaga gcttgatgtg aatggatgag gtgtctctat 5520 atgaggggca cctaaaacgc agggaggtac gttctgtgcg tgcacggagc ggacctttgg 5580 gtttggcgca cccacgtacc cgagctggtt tgccggtgcc gaggtggcaa gacctctacc 5640 ggccctcata ccagaggcta aacggtacca cgggcggccg ggaggcccgc gggggggact 5700 acgtccgccc cgtcccggtc actgggtttg gactcgtggt ggcagtggtt acagcccata 5760 tcgctcaggg ctagaggtcg gggttgagtg aaaggctcct taggtgctct gcaccatcgg 5820 agtctggaac tcttctctgg cctcgacgtg tgagttgcgg tctcaactcg gggagcggcc 5880 tgcttagccg ttagggattg gatgggtccc ggcccccacc gagggttccc tgcggccttg 5940 ctagccaagg ggagaatcgg tagtcgcggc gaagctaagg gtgcggtttc gggtgaaatg 6000 cttgtgcatt cctccttaaa ccatgctcaa ggtggagccg cgacgcgttg gccgagcgta 6060 tctcggcccc gcgccccatg gggggccgtg tgggtgagca caagcaggaa ccgcacgtta 6120 aatcatctag ggcttctcaa 6140 // ID Shinagawa-1_AAe repbase; DNA; INV; 2255 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.03, Created) DT 16-FEB-2011 (Rel. 16.03, Last updated, Version 1) XX DE A non-autonomous DNA transposon family from Aedes aegypti. XX KW DNA transposon; Transposable Element; Shinagawa-1_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-2255 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the yellow fever mosquito."; RL Repbase Reports 11(3), 838-838 (2011). XX DR [2] (Consensus) XX CC ~92% identical to consensus. 8-bp TSDs. TIRs are ~130 bp long, CC and composed by degenerate repeats. Related non-autonomous CC elements, named Shinagawa, are found in Aedes aegypti and CC Culex quinquefasciatus. XX SQ Sequence 2255 BP; 699 A; 416 C; 405 G; 734 T; 1 other; gattgtgcga tatttgccag aaaaccattc gccagaaaac tattcgccag aatgacatct 60 gccagaaaac cattcgccag aatgtaccat ttaccagaaa accatttgcc agaatgtacc 120 attcgccaga aagtaccatt ccccagaatt atttttcttg tcgattttga tcaattaatg 180 catggtgaaa tatctcggaa tccaaagatg gccgtcacaa tattcgactt gtgccacttt 240 tcacatcttc aagagcacta aattgaacta atagtacggt aacaatctcg ttcttctcat 300 tttaagcttt ctgctagtgc acagagcgga gacaatataa cactgttatt ctacatctct 360 cgcgagattt ttgcctatga ggttttgata caatattcta agctgtttta cctagtttta 420 acttcgagaa aattgcattt cctttcaccg cgagctgttc ttcatcagaa tattgtattg 480 tacagaattt ctcgtacaac taccagataa tttttgcttt tcaattgaca gtttattggg 540 gttgtgctac tttctcccga atatcgatac cccgaatgtc gtttccctga ataccccgtt 600 ttctcaagtg gcctatttcc tcgaaaatgt tttggcactc ataattatca taacttattg 660 tatatttcag ggtgatgaac ggttcatatg atatgcaccc ttcttttttg atcagcggtt 720 ctttaaagtt tctccgaact cagcattttt tcttgttttc tttatcgttc tttctagttc 780 accactctgt ctctgaagat tgtttcttat ccattttgaa ctgcatctat tttggggccg 840 tccatgaacc acgtggtcgt tcaagagggg gtagaagggg gtggttctat cctaatacca 900 cgatccatac aaattttggg tcattcggga gaatggattg cggggaaaag tgtcattcgg 960 gtaactggcg ttcggggaaa tgatattcga gggaacaaca ttcgggtaaa aatagaccaa 1020 aaggttttta gtgataatgg tcacctaaat ttcagtacat gaaaaagtat tcaaaaaaga 1080 gtcattgcct atcgctactc gacatggttt ggggcaaata agaattctcc attctccata 1140 tataggatgt tcttattaat tttacaactg tcttatttgt ctgaaagctc ctgaaagttg 1200 ttatagtaaa tgtaactttg gtttgcgagt ggaggcatta attacttcat cgaacagacg 1260 gcacaaatag ttgcaacgta tatcgaaatc ttgtaaatat ccaatgaact agcccttaag 1320 aagcaacaaa actttgaaga acagcttatg atgaaaagaa gggagaacta cttgtgaaat 1380 attccaaacc ataattccct gtaaaactta ataatataaa aaaaaattaa ttgaaactga 1440 aaaattcaga aagaaaatcg tatggaattt actaattata aaatagcata atctcatgct 1500 ataacatctg tataacaata gttgcaacgt atattgaaat ctcgtaaata ttcaatgaat 1560 tagtccataa gaagcaacaa aactttgaag aacagcttat gctgaaaaga agggcgaatc 1620 tcttgtgaag ccttccaaac cataatacca ttttacaggg acaacgtata ttaaaaaaaa 1680 acttactatg ctatcagtca tgctgagtat taaatcttta catcagtaag cagtatcttt 1740 catacatagt ggagcagctt ttaatgaatg caactgttat tctattgtat gttctttcta 1800 gttccacggt cccccagkat taagtgctgc cttttcaatt catatttata aaacttcaaa 1860 tgttgatgtt gatcgttgaa tgctcgagaa atcgatattt tcataaggaa gtcatgtgta 1920 cctagtccaa caattagaat acggatattc gaagctggct cactaagtca acggtagatt 1980 gtgtccaaga gaccctgtac ttttttgagt atactaacca aaataatctc acacattttt 2040 taaggctcta caattgcaaa cttgatttac ctgaaaataa aataaaaaaa ttctagggga 2100 tggtaaattc tggggaatgg tacattctgg cgaatggttt tctggcgaat ggtactttct 2160 ggcaaatggt tttctggcat atggtacttt ctggcggatg tcattctggc gaatagtttt 2220 ctggcgaatg gttttctggc aaatatcgca caatc 2255 // ID Gypsy-154_AA-I repbase; DNA; INV; 5625 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-154_AA_; KW Gypsy-154_AA-LTR; Gypsy-154_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5625 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1033-1033 (2011). XX DR [2] (Consensus) XX CC Positions [2437-3051] - Reverse transcriptase CC Positions [4593-5060] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 906..2270 FT /product="Gypsy-154_AA-I_1p" FT /translation="MVFEGKQLSFEPFNDKVDTHDLRREWEEWHRAFELTL FT ELRNLESQHEKLVLLLASGGRGLQRIFYNLRPSSDEIYPGPVKVPLMPVEV FT PEYDNAIKRLNKFFIGKRNERIELEVFRSMKQTGDETFNQFLLRLRGQAAR FT CDFSDREEKEVLQQITMGAQDERVRDKGLEDVMDLDELINYAINREILMKQ FT KGKSKLLGDEQPGSSVAAVHQNWNKRQNYNDANQVKHRRNLEVENRQTVGK FT RDQSECGRCGSSNHKTESPTCYARNARCNGCGRLGHYMRKCRYLGRNVTRQ FT RAGSDTRYRRGRPNADLNAVNRYEEWTKETKPLTDEEPRPKVKTLIEPDVC FT SNGMVECSIDNYPVTFLVDSGSAINAVTEEVWNSLVRNNATVYKTKLQCNR FT TFTAYASQQPLRVRAMFEAWISVNDSKPSCYAEFFVIEGATKSLLSKTTAE FT LLKILKMWSR" FT CDS 3660..5597 FT /product="Gypsy-154_AA-I_2p" FT /translation="MMYLPSRYLGTVFGEEQTKAFDDLRCELANSARKLGY FT FDSADETELYVDASPVGLGAVLVQRYRGSPRIICFASKGLTAAERVYPQTQ FT REALAVVWAVEKFYLYLFGLRFTIFTDHKTLEFIYGNKHVEGKRALSRAEG FT WALRLQPFDFEVKHIAGLMNISDPLSRLCSQIDAPFDDNADHYLFAVGEGP FT TAISLEEIRDETKRDEILIAVKLALESGNWHPDLFRFQAFSKELGISNGII FT IRNERIVLPKKLRSRALDIAHRGHPGIVAMRRNLREKVWWPCMDRDVIEKI FT EECAGCAAVRSQMPPEPMFRKEMPDRAWQQIAIDFFSAKECATFLVVIDYY FT SRFLQVVEMKSTTATKTVDALEAIFLDHTYPECIRSDNGPPFSSEEFANYC FT SIKNIKLVRTIPYWPQMNGLVERVNQGILRALRIAKATNTDWRKALKEHVY FT IYNTTPHSVTEKAPMELLTGRPVKDLLPSLRTDPAWNRDESVRDKDAIRKM FT QGKIYADQRRHAKTSKIEVGDEVMIKNFNSGKLEPTFRLEKFKVVSKSGND FT TTVKGEDGVTLRRSVTHLRKWPKSPSASFQQTSSELTPASPSSLTSSNQGV FT DGTSLHGCSNEGKRKNGNAHGEQATPAKRPMRNKKVPKRYVQVINTVE" XX SQ Sequence 5625 BP; 1697 A; 1147 C; 1354 G; 1427 T; 0 other; ttttggcgca ccgtcggtag gatatgattc cgcaagctat aaaattataa ttacgattat 60 aataaaaaaa attgcagcag cgaaaaaaaa aacataacca aagaacaaat gtttaatgcg 120 agtccggttt tgtaccgaag cgcaccaata ccagtctgta tgctgtgaaa ttagcgagtc 180 cggtcttgca ccgaagcgca ctagcaccat ttgcaaatgt gatttaaaaa aagcgagtcc 240 ggctcagtac cgaagcgcac caatactgtt cagcgagtcc ggtttttgta ccgaagcgca 300 gagacatatg acaatagtat agtcagcgag tccggccttg taccgaagcg caccactaca 360 aatcagccca gcccatacac tgggagcaaa tatgagtccg gtacacacac ccaagcgcac 420 cattacaatc ttagctcacc agcttatctt tggcttgcaa aaaagagaaa aaaaaaaaca 480 ccagaaaaaa aaagttttga aattccatgt cttgatttgt ttttttttaa tgtgttttat 540 tacgtattat tctcaatgag ttttaaatat ctaacggtat tgacgtcctt cctgattgtt 600 aaatcatcgg catgttaacg atttcaattt aattcacgat tatttcaaac ttcctagacc 660 tcacgctgaa tattaaagta atgatatatg ttcataaact aggtcgatcg tttctttcgg 720 tgtacctatc gaactgtcat ctgtttacag aacgatgaac ataaacattg agtgtcggtt 780 ttgtttttat ctgctggtgt taagtttgct cataaccgtc gtctacgtga aactattgaa 840 gtaaattgca aagaacttaa gaaaatcaag tgctataatt acattcttat gaccaacatt 900 tacagatggt tttcgaagga aagcagttat cgtttgaacc tttcaatgat aaggtcgaca 960 cccacgattt aaggcgcgag tgggaggagt ggcacagagc tttcgagttg acactggagc 1020 tacggaatct cgagtcgcaa catgagaaac tcgttctgtt gctagctagc ggcggaagag 1080 gtctacagcg tatattttat aacctgaggc cgtcgtcgga tgaaatctac cccggacctg 1140 tgaaagtgcc attgatgcct gttgaagttc ccgagtacga taacgccata aagaggctaa 1200 ataagttctt tattggcaaa cggaatgaac ggattgagtt ggaagtgttc cgttcgatga 1260 aacagacagg tgatgaaact tttaatcaat tcctgttgag gcttcgtgga caagcagcga 1320 gatgcgattt ttccgaccga gaagaaaaag aggtacttca acaaatcacc atgggcgcac 1380 aggatgaaag agtgcgcgac aaggggcttg aggatgttat ggacctcgac gaattgatca 1440 attacgccat caatcgtgaa atcctcatga agcaaaaagg aaaatcgaag ctacttggag 1500 atgagcaacc gggatcatca gtagctgcag tgcatcagaa ctggaataag cggcaaaatt 1560 ataacgatgc gaaccaggtc aaacatcgta gaaatttgga ggtcgagaac cgacaaaccg 1620 ttggtaaacg ggatcagtcc gaatgcggcc gatgcggatc atcaaaccac aagaccgaat 1680 ctccaacctg ttacgcccgg aatgcacgtt gcaatgggtg cggaagactt gggcactaca 1740 tgaggaaatg tcggtacctt ggtcggaacg tgaccagaca acgcgctggc tccgatactc 1800 gatacagacg aggtcgcccg aatgctgatc tgaatgctgt taaccgttac gaagagtgga 1860 cgaaggaaac aaaaccgcta actgatgagg agccgagacc aaaggttaaa actctaattg 1920 aaccggatgt ttgtagtaat ggaatggtag agtgttcgat agacaattac ccggtgactt 1980 tcttagttga ttcgggatca gcaattaacg ctgtcacgga ggaggtatgg aacagtctcg 2040 ttagaaataa tgcaactgta tacaaaacaa aactacaatg caatcgaaca tttacggcat 2100 acgcgagcca acaaccgtta cgagtaagag ctatgttcga agcatggatt tcagttaatg 2160 attcgaaacc cagctgttac gcagagtttt ttgtgataga aggagcaacc aagtcactcc 2220 tgagcaaaac taccgccgaa ctattgaaaa tactcaaaat gtggtctcga tgactgaaca 2280 atgttgatgt gtcgaccgaa cattttccca aatttcccaa cgtacttgta taagctgtcc 2340 attgataagt cgattgttcc tcggaaaatt gcttacctta aagtacccga agcactgaaa 2400 gagaaagtgg atatgaagat tatggatatg ttgcgttctg acgtgattga accagccaac 2460 ggtccagcgg aatggatttc acctatggtc gttgtgccaa aaggcaagga cgacatccgg 2520 ctctgtataa atatgagaca ccccaacgaa gcgattcagc gcgagcattt tccattaccg 2580 attatcgaaa ctatttttaa ataaattgcg gggtgctacg tatttttcga agctcgatat 2640 aacgtcagca tatcatcacg tggagctgca tccagagtca cgaagcatca ctacgtttat 2700 gacggataga ggtctaatgc ggtttaaaag gctgatgttt ggcataaact gtgcccctga 2760 gatctttcag cggatcatga ctcagatgct tgcaggtacg tttttatgtt ccttatgctt 2820 ttaaggggtt gggttaaatt acatgatgta atacaatact ctgttttaac tcaggtgttg 2880 aaggggtaat cgtttacatt gacgacattg tcgtatttgg tagaaatcag caggagcatg 2940 atgaacgact gcgacaggtc ctggagattc ttgagaaaaa caacgctaaa ctcaacagag 3000 acaagtgtgt atttggcgtg ggtgagctcg aaatccttgg tttcaaggtg agtgctgcgg 3060 gtatcagtcc aacggaagag aaaattgaag ctatcaagaa cttcagaaaa cctgaaagca 3120 aggaggaagt tagaagtttt ctggggcttg taaacttcgt aggtcatttt atccctcatc 3180 tttcgactag aacagaaccc ttacgtcagt ttattcgtgg tgatgctact gtccaagatt 3240 gattgagggg aggcaggatc aggcagtcgg tgggtgcgaa gtgaggtcgg ttacagggga 3300 ggtgaaagcg acggctgctg gcgagtatgg tttcgtgtat cgcttcatgg gaacgttagg 3360 ggttaccaca ggacggaccc tactggtaag tgaaaatttt ctcccgcatt tctgggcgac 3420 gtttacctct tcgtgtgttt cagaggtaaa aattgcggtg ttttatgata aggtgaggaa 3480 aagctactag gatgctacat ttcttctatg actggtgaaa cgaaaagccg aacgaaacga 3540 gagggagtgt ggcaatgaaa aatgtttgga ggtacgtctt gagttagcac cctcgaacca 3600 aaacaaactc gtcagctgtc acctggtatt tcaaaatggc gaattcaaca tgctgacata 3660 tgatgtacct cccttccaga taccttggta ctgtctttgg cgaggagcaa acaaaggcat 3720 ttgatgacct gcgatgtgaa ctggctaaca gcgcaagaaa actaggctac tttgattcgg 3780 ctgatgagac cgaattatac gtagatgcgt ctcccgttgg cctcggggct gtgctggtac 3840 agcgatatcg aggttccccc agaatcattt gcttcgcgtc aaagggttta actgcggccg 3900 aacgtgtgta tcctcaaaca caacgtgaag cgctagcagt tgtatgggcc gtcgagaaat 3960 tttacttgta cctcttcggt ctacgcttca ccatattcac ggaccataaa acgcttgagt 4020 tcatttacgg aaacaagcac gtagaaggga aaagagctct ttcgagggcc gaaggctggg 4080 ctctgcgcct tcaaccgttt gattttgagg ttaaacacat tgcagggctg atgaacatat 4140 ctgatccgtt atccagattg tgctcccaaa tagatgctcc gtttgacgat aacgctgatc 4200 attacctatt cgccgtcgga gaaggaccca cggcaatatc gctcgaggaa attagagacg 4260 aaaccaaacg ggacgaaata ttgatcgcag tcaaactggc acttgaatct ggaaattggc 4320 accctgattt attccgcttc caggcatttt ctaaggagtt ggggatttcc aatggtataa 4380 ttattcggaa tgaaagaatc gtacttccga aaaaacttcg ttccagggct ctggatattg 4440 ctcatagggg ccatcccgga atagtagcca tgcgtagaaa tttgagggaa aaggtgtggt 4500 ggccgtgtat ggaccgtgac gttatcgaga aaattgagga gtgtgctggc tgcgcagctg 4560 tcagaagtca aatgccaccc gaaccgatgt tcagaaagga aatgccggat cgtgcttggc 4620 agcagatagc tatcgacttc ttttcggcga aagaatgtgc tacgttttta gtagtcattg 4680 attattatag cagatttttg caagtcgtcg aaatgaaaag tacaacggct actaaaacag 4740 ttgatgcatt ggaagcaatt ttcttggatc acacgtaccc ggagtgtata cgcagcgaca 4800 acggcccgcc attttcgagt gaagaatttg ctaattactg ttcgattaaa aacatcaagc 4860 tcgttcgcac aataccgtac tggccccaaa tgaacgggct cgtagaaagg gtgaatcagg 4920 gtattcttcg tgcactcagg atcgctaaag cgactaacac ggattggcga aaagcattga 4980 aggagcatgt gtacatctat aatacgactc ctcattcggt tactgagaaa gcaccgatgg 5040 aacttctaac aggacgtccg gtaaaagatc tactaccgtc cttgcgaacg gacccagctt 5100 ggaaccgcga cgaaagcgtg cgtgataaag atgccattag gaaaatgcag ggtaaaatct 5160 atgctgatca gcgtcgacat gccaaaacct ccaagataga agtaggagat gaagtgatga 5220 tcaaaaactt caatagcgga aaactggagc ctactttcag actggaaaaa tttaaagtcg 5280 tcagtaagtc tggaaacgac actaccgtta aaggcgaaga cggggtcacg ttgcgtcgct 5340 ctgttaccca tcttaggaaa tggcccaaat caccatctgc atcatttcaa caaacttcat 5400 cagaactaac gccggcctct ccatcatcgc tcacatcatc taatcaagga gtggacggaa 5460 catctttaca cggatgctca aatgaaggaa agcggaagaa tggtaatgca catggggaac 5520 aagccacgcc tgcaaaacgt cccatgagga acaaaaaagt gcctaaaaga tatgtacaag 5580 ttatcaatac agtcgaatga tttttttttt tgagtaggag aggga 5625 // ID TCRP1 repbase; DNA; INV; 204 BP. XX AC M21330; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 28-SEP-1995 (Rel. 1.08, Last updated, Version 1) XX DE T.cruzi antigen DNA repetitive element. XX KW Repetitive element; TCRP1; tandem repeat. XX OS Trypanosoma cruzi OC Eukaryota; Euglenozoa; Kinetoplastida; Trypanosomatidae; OC Trypanosoma; Schizotrypanum. XX RN [1] RP 1-204 RA Ibanez F.C., Affranchino L.J., Macina A.R., Reyes B.M., RA Leguizamon S., Camargo E.M., Aslund L., Pettersson U. et al.; RT "Multiple Trypanosoma cruzi antigens containing tandemly repeated RT amino acid sequence motifs."; RL Mol. Biochem. Parasitol 30(1), 27-33 (1988). XX DR GenBank; M21330; Positions 1 204. XX SQ Sequence 204 BP; 46 A; 57 C; 76 G; 25 T; 0 other; agcatgaatg cccgcgcaca ggagctggcg cgcgagaaga agcttgccga ccgcgcgttc 60 cttgaccaga agccggaggg cgtgccgctg cgagagctgc cgctcgacga cgacagcgac 120 tttgttgcga tggagcagga gcgcagacag cagctcgaga aggacccgcg caggaacgcg 180 aaggagattg ctgcgcttga ggag 204 // ID DNA8-42_AP repbase; DNA; INV; 692 BP. XX AC . XX DT 25-AUG-2009 (Rel. 14.09, Created) DT 25-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-42_AP. XX NM DNA8-42_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-692 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 1972-1972 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 692 BP; 209 A; 117 C; 125 G; 241 T; 0 other; cagggccgga ctgggcaggc gggataccgg gagtttcccg gtgggccgct tgtactcgta 60 gggacgtttt ttttttaggt ttatcggtta aggctcatcc tgtcgacaga catagacgca 120 ttttctaatg cgattgtatt agaaaactat ttagagttaa ggaaattcct caaaatagat 180 atgaaaaaat gtataaattg tcaaaaatag atgcttgtgt gttgtgtcat attattatta 240 tttaatgtct tcgttaatcg gtaatcccta ccttatatcg taacgcacgt ctgcaccgcc 300 taaaatgtgc gacaattatc atcacacaaa ctcccaccta ttccggtgcc gccgtagtgg 360 ctgaactaag gtataataat gtgctcggtg gacacaattt cgcaaaagag cctcaaacct 420 taacaaattg cttaatatgt attaatgtgt attctggact ctagagtcta aacatatgta 480 ttttaaataa aaatgataac atttctttgt actaaccaat ttttaaattt ttaaaatttt 540 taataataat attaagtgtt caaattctca aaataatata caatcatttt atatgaaaat 600 atatgttttt tagggttgtt tttataggta tacctaagtg ggccgatttt ttttattttc 660 ccggtagaaa tttttgaccc agtccggccc tg 692 // ID Copia-11_DPu-LTR repbase; DNA; INV; 319 BP. XX AC scaffold_233; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-11_DPu_; KW Copia-11_DPu-LTR; Copia-11_DPu-I. XX NM Copia-11_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-319 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 686-686 (2010). XX DR Genome; scaffold_233; Positions 12042 11724. XX CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX SQ Sequence 319 BP; 85 A; 57 C; 73 G; 104 T; 0 other; tgttgaggtt acattccaca accaactatt tgcgcttggg ttgctgaaac cgaggtggag 60 tttcaggcct gaggtgatcg tcgtctgtat gacgttgact cggaagaaag aaactcagac 120 tacaagtatt gcgttccaga ttactgtgtg tgtgtaggac gagtaagcat cgtcctcgat 180 cttgctattt gctatagttt aatacaagtt aaactaacct ggactttatg taagcatatt 240 tcattcactt tattattgtg tgcagttttg tatgtgttac ctctagaaag agtacgtgag 300 ctgatacaga acttcaaca 319 // ID Copia-6_DWil-I repbase; DNA; INV; 4258 BP. XX AC scaffold_181155; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-6_DWil_; KW Copia-6_DWil-LTR; Copia-6_DWil-I. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-4258 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_181155; Positions 2209922 2214179. XX CC Positions [1588-2115] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 265..3453 FT /product="Copia-6_DWil-I_1p" FT /translation="MSVASVRIEPLGKENFDTWKIQMEALLIKNDTWKYVN FT NTCVKTEQNAAQWVKEDAKAKSDLILAMATTELKQIKNCETSNEVWKKLHS FT IYQSTGPARKATLLKSLILLKLGPGADMRDHVDKFFDIVDKINETEVININ FT DLISILLLYSISEEYETFRIAIESQDNLITPEALKVKLLDEYEARNRNKGE FT SSPGALMINKRFNNGAPGSSKAGGFKYKCYNCGKVGHKAAKCPKKFPKNSG FT ENSESYSSMYIQEANLTRTSLKGWCLDPGATSHMCSEKERFNKTYAEKSQE FT LKLANGESTAINGIGSVTFKPNCKFSANLLNTLYVPDIRENLLSIAKICDH FT GFNVLFKKNFAEIIQVSNGQVVFVAQRKHDLYHVEEMGEESHLAVGNKTSI FT REWHERFGHLNEKDLKDVIQKGRVHGACVNINESLPICEACIKGKQSRKPF FT PASDSKSKELLELIHSDVCGPMRVSSHGGSRYFVTFIDDFSKWCELYTIKS FT KSEVFNKFKEFKVYVELKTGKKIKRFRSDNGTEYTNNEFNKYLKAAGIKHE FT HSVEYTPQQNGTAERKNRTLVEMARCMMIQAVLPPSYWADAINTANYLRNR FT CPSKSLRGEVPFTLWNHRIASIKHLRTFGQIVHSLDKRAKSKFSPRSKRCI FT FIGYCNDAKAYRLYDPEAKMVLRSRDVIFTNIFGNDHEFEEFSELSAVNAE FT YQEEHSQEDDFQEERVVPEENTPEENADIEENIQEEPVLQRRGPGRPRKKH FT TGKPGRPKLIYNEVQIQSTIGKEPELESDEEDEYYEASSLISIGDESSTRA FT ALSGPNAIEWKNALISEYKALQKNKTWETVKRERNQKVIETKWVLRTKYKP FT NGEVDSRKARLVAKGFTQREGIDYNETYSPVARIGSIRTLIAIAVELDLDL FT HQFDFNSAYLNGDIEEELYIKVPPEFTEILTEKEKQRYTDDKVCKLKKALY FT GLKQSGRQWYKKLDYKLKQMNLKPLAADPCVYMKKQGNGIIIVSLYVDDLI FT VATNNNKLLTQLKAELTRNFEMKDLGKLSYCLGIEFLQNKNKGECSLGQSR FT YVKDIL" XX SQ Sequence 4258 BP; 1534 A; 775 C; 937 G; 1012 T; 0 other; ggttatgggc ccagcaacct tccactgcgc caataattcg atggattcag aatagtgaaa 60 ataaaataaa aagtgtaaaa gttgtggcga aaaagttcta gaaagatttt tgtgtctcgc 120 ctgtgtggcg ccgaactgcc gtttttttta cgttgcttgt gtagcttcgc ggtcggcttt 180 tgcttgtgta gctaagtcaa gaaaaaacgt gcgagagaga acacaataga aaacagttaa 240 ataaagagaa acagaacata aaaaatgagt gtcgcaagtg tacgtataga gccgttaggc 300 aaagaaaatt ttgatacttg gaaaattcaa atggaagcac ttttgatcaa aaatgatacg 360 tggaaatatg tgaacaatac atgtgtaaag acagaacaaa atgctgcaca gtgggtgaaa 420 gaagatgcta aggcaaaatc tgacttaatt ctggccatgg cgacaacaga attaaaacaa 480 ataaaaaact gtgaaacttc aaacgaagta tggaaaaagt tgcatagcat ctaccagtcc 540 acgggtcccg cacgcaaagc caccctactg aagtctttga ttttgctgaa attaggcccc 600 ggcgccgaca tgcgagatca cgtcgataag ttttttgaca ttgtagacaa aataaatgaa 660 actgaagtaa taaacataaa tgatttaatc tcgatcttgc tactgtacag tatatcagaa 720 gaatacgaaa catttcgtat agcaatcgag tcgcaggaca atttaattac acctgaagca 780 ctaaaggtaa aattattgga tgaatacgaa gcgcgaaatc gaaacaaagg agaatcatca 840 ccaggggcac tgatgataaa taagagattc aataatggtg caccaggatc gtccaaggca 900 ggtggtttta aatacaaatg ttataattgt ggaaaggtgg ggcacaaggc ggcgaaatgt 960 cccaagaaat ttccgaaaaa ctccggagaa aacagtgagt catattcaag tatgtacata 1020 caagaagcaa atcttacacg cacaagccta aagggatggt gcctggaccc tggagcaacg 1080 tcacatatgt gttcagagaa ggaaaggttt aataaaacgt atgcagaaaa atcgcaagag 1140 ttaaaactag caaacggcga aagtacagca attaatggaa tcgggtcggt aacttttaaa 1200 ccaaactgta agttttctgc aaaccttttg aacacactgt atgtaccaga tattagagaa 1260 aatctactgt caatagcgaa gatatgtgac cacggtttta atgtgttatt taagaagaat 1320 tttgcagaaa taatacaggt atccaatggg caagtagttt ttgtcgcaca gcgaaaacat 1380 gacctatacc atgtcgagga aatgggtgaa gaaagccatt tagcagtcgg aaacaaaaca 1440 agtatcaggg agtggcatga acgttttggt cacttaaacg aaaaggactt aaaagatgtc 1500 atccaaaaag gaagagtaca tggtgcatgc gtaaatataa atgagagtct accaatctgc 1560 gaagcatgca ttaaggggaa gcaatcacga aaaccgtttc ctgcatccga ctccaaatca 1620 aaagaactac tagaactaat ccatagcgat gtatgtggac caatgcgagt aagtagtcat 1680 ggaggtagtc gatactttgt tacatttata gatgattttt caaaatggtg cgaattgtat 1740 actatcaaaa gtaaatcaga agtttttaac aaattcaagg agtttaaggt atacgtcgaa 1800 ctcaaaactg gaaaaaagat taaacgtttt agatcagaca atggaaccga gtataccaat 1860 aatgagttta ataagtattt aaaagccgca ggtataaagc acgagcattc agtggaatac 1920 accccacaac aaaatggaac ggctgaaagg aagaatcgaa ctcttgtgga aatggctcgc 1980 tgcatgatga tccaagccgt actaccacct agctactggg ctgacgcaat aaacactgca 2040 aattacttac ggaatagatg cccatccaaa agtctacggg gagaagtccc ttttacatta 2100 tggaatcatc gtattgcatc tatcaaacat ttgcgcacat ttggtcaaat tgtacactcg 2160 cttgacaaac gtgcaaagag taaattcagt ccaagatcta aacgttgtat atttataggt 2220 tactgtaatg atgccaaagc ctacagacta tatgatcccg aagccaaaat ggtattaaga 2280 agtcgtgacg tcatattcac aaatattttt ggcaacgatc atgaatttga agaattctca 2340 gagctgagtg cagtcaatgc ggaatatcaa gaggaacata gtcaggaaga tgattttcaa 2400 gaagagcgtg tcgttcccga ggaaaatacc ccagaagaga atgccgatat tgaagaaaat 2460 atccaagaag aaccagtact gcaaagacgt ggacctggaa gacccagaaa aaagcataca 2520 ggaaaacctg gtcgtcccaa gctcatctat aatgaagtcc aaatccagtc aaccatcgga 2580 aaagagcctg aactggaatc tgatgaggaa gacgaatatt atgaggcatc gtcactaata 2640 agcatcggag atgaatcgtc aacgcgcgca gctctaagtg gtccaaatgc aattgagtgg 2700 aaaaatgcct tgataagtga gtataaagcg ctacaaaaga ataagacttg ggaaacagta 2760 aagcgagaga ggaatcagaa agtcatagaa accaagtggg ttctccgcac aaagtataaa 2820 ccaaacggcg aagtggatag ccgcaaggct cgtctagtag caaaaggttt cactcaaaga 2880 gagggaattg attataatga aacttattcc cctgtagctc gtataggatc aatacgaaca 2940 ctgatagcaa tcgccgtgga actcgacctc gacctccatc aatttgattt taacagcgca 3000 tatctaaatg gagacataga ggaagaactg tatataaagg ttccaccaga attcaccgaa 3060 atacttacgg aaaaggaaaa gcagagatac accgacgata aagtatgtaa attaaagaaa 3120 gctttgtatg gtttaaagca gagtggacgc cagtggtata agaaactgga ttacaaattg 3180 aagcaaatga acttgaagcc acttgcagcc gatccttgtg tgtatatgaa aaaacaaggt 3240 aatgggatta ttatcgtttc actgtatgtc gatgatctca tagtcgcaac gaataataac 3300 aagctactaa cacagctgaa agccgaatta acacgtaatt ttgaaatgaa ggatctagga 3360 aaactgtctt attgcctggg tatcgaattt cttcaaaata agaacaaagg tgaatgttct 3420 ttgggtcaat caaggtatgt aaaggatata ctatagaaat tcaacatgga ggactgtaaa 3480 ccagtgagca cacctatgaa cgccagcgag aaaatttcaa atcaaatgtg cccaaagaca 3540 aaaccggacc tggatgaagt atcaggcgtc ccatatcaaa gcttgatagg tgctctaatg 3600 tggcttgcag tgtcgacgcg ccctgacatt gcacataccg taagtcttct tagccagtat 3660 aataattgtt acgggaaaca acactgggtt gcagccaaaa gggtacttcg ttatttgaag 3720 ggtaatcaag accatggact aatatataga aaaaccggaa aagatttggt tggctatgct 3780 gatgccgact gggctgctaa tattgatgat cgtagatctt ttacaggttt cgcatttaag 3840 atggcaaacg caccaataag ttgggaatcc cgtaagcagc gaactgtcgc tctttctagt 3900 acagaggcgg aatacatggc cctatctgag acgtctaaag aggctgtaca tctaagatct 3960 ttcttacaag agatttttgg ctcgctccgt tccaccataa tctacaacga caattaagga 4020 gctggccaac tcacacgaaa tcctctattc cacaagagga caaagcatgt cgacatccgt 4080 catcacttta taagggaact ggtggacaaa ggggaccttc gcattgatta catctcaaca 4140 agtgaaatgc ccgcagacgt tttaaccaaa ggactaggaa cagcaaagca tgagtcctgt 4200 gtgacactgt tgggagtaca aaaaatagag aaataaggcg ccatctgagg ggaagtat 4258 // ID CZAR repbase; DNA; INV; 7291 BP. XX AC M62862; XX DT 26-APR-2005 (Rel. 10.04, Created) DT 03-JUN-2009 (Rel. 14.07, Last updated, Version 3) XX DE T. cruzi SL-RNA-associated non-LTR retrotransposon. XX KW CRE; Non-LTR Retrotransposon; Transposable Element; KW reverse transcriptase; CZAR; SLACS_TC. XX NM SLACS_TC. XX OS Trypanosoma cruzi OC Eukaryota; Euglenozoa; Kinetoplastida; Trypanosomatidae; OC Trypanosoma; Schizotrypanum. XX RN [1] RP 1-7291 RA Villanueva M.S., Williams S.P., Beard C.B., Richards F.F. RA and Aksoy S.; RT "A new member of a family of site-specific retrotransposons is RT present in the spliced leader RNA genes of Trypanosoma cruzi."; RL Mol Cell Biol 11(12), 6139-6148 (1991). XX DR Genbank; M62862; Positions 1 7291. XX CC Putative endonuclease domain; reverse transcriptase in ORF 2. CC Reported 22 bp target site duplication. XX SQ Sequence 7291 BP; 1940 A; 1931 C; 2046 G; 1374 T; 0 other; aactaacgct attattgata cagtttctgt acgacattga tacaggttct gtgttcaaac 60 cctgttcttg ttaaaacttt tcccattttt gtcttagtca tttttcggat atattgtccg 120 acgaggttgc gcctcgtcgg aaaaaaagta gttcctagga gagtgctgct ccgacgcggt 180 tcgaacccta cttgtgcaaa aaattttttt ttttgtctta gtcatttttt caaaaaaaaa 240 aaatttttca aaaaattttt tcccttgact agggttattt ggacgcgagg gcaaaaccta 300 cttggcaaaa gaattgtttt tgactttatc atcgttttca aaaaataaaa aataaaaata 360 tataaaataa taacatatct atatatatag ttatttgttt tatttttgac actgtatcac 420 cctacttgat aacaactatt gtggttggac tctaaatacg taatataaaa ggacacaaaa 480 gctcctttta ctcagctgag atcggcgtac gtgtttggtt tgtgataata caactattgt 540 aaataagaac aaaaaagaaa aaggaaaaac aaaatcaaca gaagcaaaat tagcagaact 600 aaaatacagt tcattaggca caaagagatc ctcatagctg taaaacggat ccatttggta 660 ggaaattaat acttgtcgtg gtctttgaat actcttctaa taaattttag gtaaaagaac 720 taccctcttg tcgtagttca ccacaaaaaa gaagaaacag gtattgacgc gtattgtttg 780 tcagagatcg atattatttt gcaacaattt gagagcgagt gagtaacagg agaagtaccc 840 cttctggaaa aataaacatt tttgagggtc tatctgggac caggtttctc gtatttttaa 900 ggaaagaaat atcattgcgg atattaaacg atattggaca gagagggcat cgatactcat 960 cggttgggga gatcgatatt atttggtaaa cagttcagat cgagtgagta acaggagaag 1020 taccccttct ggaaaataac catttttgag ggtctatctg tgaccaggtt tctcgtattt 1080 ttacggaaag aaaatattat tgcggatatt aaaagatatt ggacagagag gatattgacg 1140 attattggtt gttggagatc gttgttaata ctcagcgaag aatcgaagaa gtaataaaga 1200 agagaattga cgactgcgtc atttggcaga acacggatag tggacagcgt gttggatcgg 1260 gagtgacggg tctttgggtt cgatccattg acacggctcg ttatacgttg ttacacggac 1320 aattactaca ttcaggatac taattatcac ggctgggcaa accaggactc ggcagataca 1380 tctttacgta ggtaagtatt atggcctact ccaatcgagc atgccaatgc cttttgactt 1440 tcgcaacggg aggacggaca gtgactggac gataggtggg aagcggcagc aaccaccacg 1500 gtatatgaca cggtcagcgg ctgcgtcaag aactccgttt atccaccgag acagcgtgga 1560 tccatctcag cttcaggata tcattacggc ggcaacgaca gaggctatca aacggctcac 1620 gtcccctact aatgggcgat acacggataa caatgggaat cgcagctgga ggtcaaggag 1680 tgggcggccg aacagaagag acgcagagtt actctggcga tctcaaccac cccagggagg 1740 gccgcaaatg atgcaaggag gaaaccgttc ctggcagtac agagaccaac agcccagacg 1800 gtacagatca caatcaagct tggaaagaca taacacgagc cacaggcgcc gcgaaggact 1860 caacaggagg ttcgttcggg gtcccagggg gtccgacgag gggcctcaca cgaacagcgg 1920 ggaccacagt caagcaccgc aaccccatcc tcccgggaag aagggaggtc gcgcaggggt 1980 tgccacacgc aggttttcgg ggacaactaa ccagaccaaa aaacccgcat cgtgcaatag 2040 cagcgctagc ggagctcgag ctgcagccta ccaggaaaag caccagcggt acataggagc 2100 cactaagact ctagtgccat ggtggacggc cttgtaccaa agtggagcac aaaattacca 2160 ctgcccactg tgctcgttta accgacctgg ggagcacgac gtgttttacc actgtcgcca 2220 ggctcaccca acatacgaca agtgctaccc gtaccgacta catctaaacg ggcactgcac 2280 cttccccgtc gagaaatcct gctcgctcaa tgcagtgctg gcggtcctgt ctcattacga 2340 ggacgagagc cagttgtcgc aggaagtacg aagtacctac acggtgccat gcagagagaa 2400 tacggaacgg atcatacggg caatggacgt gaacctgcgc tggcaccggt atcggctttg 2460 gagctcctca ttaaaaacga ttcgaagtta cggtgtatgt tttccacaac ggcggtggcc 2520 ggaatccact gcgaggcatg cgggtggacc ctgccaatgg cggatgcgta tccgtcctac 2580 tcggaaacac tgcaggacga acccgccgtc atcaccttgc agccgggaaa gaaggcaccc 2640 atacaactca cacagaagtt gttgatgcaa caataccggt ccacgtgggt aacggaagaa 2700 cacgtgtgca ccggcgagaa gcgggaacgc atcacactcc atggcacatg acagccacca 2760 aatcacggaa attcgaagac gcggtggcac tggagttcgg ccactgggac aagcaagcga 2820 tgggggtgga ggacatccca tttgtgatcc tcctaccgca tcaaaacggc acggcggaaa 2880 acggactggt ggcctagtgg ccaccaacgc cccaaaccat gtcgtcgcct acataccaag 2940 cgctctacgt aaggacacgg agtgggtcat gatcgatggt atggtccaaa aggtgcagcg 3000 aaagcccctc aacacccaca aggtaatcct ctgcctctac cggcgcctta tgccggagac 3060 gggaccggag gaagatgagg aggacgatga gcaagaaccg caggcgcgta acagggagca 3120 ggacggccgc acgctagcca ctgcgcccaa gggctcacgc cgcccccaat cacaaaaatg 3180 ggaagaggcc ccagacgact ccagtatggt ggacgggttt ggtaagaatg gatatgggca 3240 accgccgggc agtgcaactc attccagtcc gccagcctcg tcgtcacctt ccacccagac 3300 gatcacacag gaccccttcc tcactcctat cgccccacga aagcgaggca ggcgggggga 3360 ggaccaggag gaggaatccg aggcagacgg caacaaaggc ggggggattg tctgcgaaga 3420 ggacgaggag ggactcttac aaccgtcagt tcccccaaca tcgtcccaac caaaccagca 3480 ggcaccacaa gttcacttcc tagatgagga ggaagaagaa ataacactga ggcggacaca 3540 acctcaagag acaccccaca ctaataagga tgacacccca ccacaccaac aatcctcact 3600 gccccaggaa gaagtggaga tggaggagga aggtgtggga ggcgacgagg gggactcaga 3660 aacaccgaga gacactacgg gccaggaaga cggtggacac ccattttcac acgacaagac 3720 aacccggttg ctggatcccg tgtggtgtct ggcaggtggg tgccaccaca agttctcggg 3780 accgcgtcgt ggggaacacc tacgatcgca tattcatgca gtacaccgta agcaggaacg 3840 gatggacatc actaatgaag cgctcatctc ccaaggactg gttcggtgcg acgcatgcgg 3900 ggaagtttgc tcggccagcc tgcgagcacg ggcagcacat cgcccacgct gtggccagta 3960 cacatgtcgt aaggaaaaca tggcgacaca gcgggaggag taccgtgcgt cggttaccgg 4020 atcacactat accaagacgg cggcgtttct ggaacggaca cctgccgtgg aatggccgac 4080 gacaccggcc acagacccgc ggcaggaccc gtggctgcag gaaagagtgc cgacgagacg 4140 ttacctccac aagcgtgagt ggcccaactg gctggatgta tgccgcacag tcatgttggg 4200 atacaacgcc tcggcgccgg aggaacgcag ccgcaaacaa gtggccatca tggacctggt 4260 gcgacaacac ctacgactgc cggagaaccc gcgtagtcgg cgccaagcca cccgcaccaa 4320 ccatgacaag cagcatacaa ccgatcctcc acccgaccac tgcaacacca cgacagtggt 4380 gcggggtgct atggagacca cgaaagacgg ggaggagtcg gtcagcgatg agactgagca 4440 acaggacaca acagcacacc agccagagcg catactcacc gcgtccgaca tatacaagac 4500 gcgacgcgta gagacattgt gcacactgca ggcaacggga cgcgcagcgc tcctcacggc 4560 ggcagaggcg gagccggtgg tcttctcccc cgagctggta caaagcctgg acgacctcta 4620 cccacaggag gacacaagcc tgtaccccga gccggccgtc agtgcacccc tggtcacgtt 4680 cgacagcaag gagctggcca agataattgg gagtcgcctt acgcggggcg cagcacccgg 4740 tcttgacggc tggacacggg agctgctgta ccccctcacg aaggacaagg cgctcctcat 4800 ggagatcacc gccattttga cggacatggc caacggaaac gtggccccgg aggtggcaca 4860 ccgcctcagg gcaaccaatc tcacggtgct ccggaagcca aacaagaagt tcccgccgat 4920 cggcgccgag tgtgtatggg caaaagccat atcactcatg gcggtggacg cggtcatgcc 4980 agccctcaaa acctgcttta agaacctgca gtatggggtc ggcaacaaca tcgaattggc 5040 gatccagaag attcggcggg acttccacct caagggcagt gtggccatgc tggacggccg 5100 aaacgcgtac aacgccatca gccgcacggc catcctgtcc gccgtgtacg ggaacaccgc 5160 ctggagcccg ctgtggcggg tcacacgact gctcctgggc acggaggggc tggtgggctt 5220 ctacgaaaag ggccaactgg tccactcgtg gaagtcgacc cgtggagtgc gccaaggcat 5280 ggtgctggga ccggtcctat tctccatcgg caccatcgcc accctccgcc aactggaaag 5340 cagcttttcc aacgccagct tcacggcgta cctggacgac gtgacggtgg cggcaccacc 5400 gggcatgctg gggaaggtgt gcgaggcgac ctcccgggcg atgcgtgccc tgggcattga 5460 gacaaacgag gacaagacgg aggtcctcaa caaaggaggg cccgtggaca tgcccacgga 5520 gtacattcgg ccgtttgccc gcgtgctcgg tgcgggagta gcaaacgacc cagagagcga 5580 gctgattacg cagtttgtgc aacgcaaggc ggaggaaacc gaccgcctgt tccgagccat 5640 tgtggagctg ccattcgcaa agcacacgca ggtgcggctc ctctcggtgt cggcgctgcc 5700 acgcgtgacg ttcctgctac ggacgcatgc ccccgcacac acgcgggcag cggcgagtgg 5760 ttcgacgacc gcgtcaccgg cgtcctgggc gtcatcatgg acggccccgt cacgaagcgc 5820 gcacgcacat tgcggccatc ccggtgcgcc gggagggtgc ggtctccgac ggcagaggga 5880 gattgcggag tttgcctacg cgtgtcttgg cgagaagggg aagcaacgcg ccatgaccga 5940 cgagttggat gcaaagcacc agagcgacct ttacgaaacc ctgcagggtc ctgatcgcaa 6000 ggtgttcgtg tccaacacgg cggccggcgc tggcagaccc ctcacggacc cgcaggtgca 6060 tgcggacgac agaggtttct ccacctacct acgggaacga ctactgatgc gcgtgctgcc 6120 ggagggacag aagtgcgtat gtggggcaga cgcatccaac gagcacgtgc atacgtgcac 6180 gcgactccaa caaaacccgc ggaccacacg ccacgacatg atcaacatga cgttcgcaaa 6240 tgggttgcgg ctgtgcggat tccaatgcgg catggaaccg cgcctgacgg aggcaagccg 6300 acggcgaccg gacatcctca tcgtcggact cgacacgtac gcgatcaccg acgtgaccgt 6360 cacgtacgcc gggcgggtca ctgcctatgt gtcggaggag tccatggaag aggcagaccc 6420 actacgcgcg gcacgggatc gcctcacgca gaaacgacaa aagtaccgtc actgggccct 6480 ggccaacggg ctggactttg agccattcgt catgctgacc aacggggcaa tccacccggc 6540 aagtcggcgg tggctgcggc ggatcctggg caaccaggac caccgactca ccatcacgaa 6600 tgcgtacgat atgatagtgg cggacaccct ggcggccatg ctgcgtggaa acgtgcacgt 6660 cttcaacgca gcgtgcgccg cacgggccgg gtaataccgc cccgggtagg cgagtgcccg 6720 ggcgacccct cgggtagcag agaggcaatg aacacagagg tcaaggaaac tacggagacg 6780 gaaaatcaca agtgaatact cgccaacaca gaaagcagca ggaaaacatc cacggaacgg 6840 caagcaatgc agaggcaata tcaaacgttc ggcatacgcg gaaggtcgtt ttttgtttgt 6900 tattcgagtc accgtaccaa agccaaagga ttcaagaggg gcttcttctc tatattgtgg 6960 aatatgacgc ccaagagggt actggaagac tgcttttttc ttgcttctta ttccctcccg 7020 cccgggttgg ttcggatctt tactttgttt ttatttcttt ggcgggtagg gcatgttttg 7080 tactttgggt atgggttttt aactattctt tactttttct ggtggggtgt agaggggtgt 7140 tggtgtggaa atggaggcca tgtaggtggt cacagaatcg gtgcgatcag gtcggggcac 7200 cattcacggt cgcggtggcc aactttaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 7260 aaaaaaaaaa ttattgatac agtttctgta c 7291 // ID BEL-16_DPu-LTR repbase; DNA; INV; 1201 BP. XX AC ACJG01007345; XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the common water flea: long terminal DE repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-16_DPu_; KW BEL-16_DPu-I; BEL-16_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-1201 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the common water flea."; RL Direct Submission to RU (08-FEB-2011). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR Genome; ACJG01007345; Positions 2214 1014. XX SQ Sequence 1201 BP; 344 A; 292 C; 288 G; 277 T; 0 other; tgtgcccgac ctttcactta aaatgggccg tcaaaccagg acattatgcg gcggaacatt 60 agctactttc ctatataggg aaggcacccc ttttataaaa aattcttaga gtgtggactt 120 agaaatattt cgacgactct atcgaaatac taccccggag caagagagca aaccggggta 180 gcaaatgact atcaactcgg taagatagtc ggagtagtag agagtctggt gtagtacatc 240 cagcaaagct ggatggccag actagagtaa aatagccgga gagatagaga gcacaccggc 300 ggatggcatc cgtagtccca ggatgaccgg aagattgcag cgtagaagaa gaagcgcgtg 360 cggattttat ccagtgggac ttagaagaaa taagagataa tggacttaga agaaagccat 420 tactctagaa acccagaggc tacccatacc aagagggcca gccgcgaaga agataaagag 480 gcgagactcg cgcctcctat agtccgcatt aattaaatgt tataaattga caatgttgta 540 taaccatgcc tccctttgta tgtaatgtca ggggcaaatt gtggactaat aaatcactca 600 ttcacaagca ctcaaacagt aagcgtctca tttatttgat tctcttccta gggtcttaga 660 gtcatcccct cggcctcgaa gcgccgttgt attaaacact gttcaggttg ccagcttcag 720 tcgagaagcc agcctttgta aaacttgtaa aagttgggac ttagaatcgc tctcgattac 780 gggcatccta agtgtcggga ctcggacaca tccgaaggac tgactggaaa gagcacacgc 840 attgccgtgt cttcgagtag cacggcgctc gctctctttc tcgggctcct agtacgaccg 900 accgtcgaac ggacccatcc cgggactaac ccacctctac tcccggccaa cttgcagtcg 960 actctgcgag cagttggatc ccactcgtct attctgatta gcgcccgaag tcactaatca 1020 gtctgattta tctttgcagg cctgtacgag cctattcata ttcctcagcc gaagcagaga 1080 atacttctcc cgaaagagat tacaagtaag taccgattga gcgatcgata gcttgaggtc 1140 cagccgaaca aactgggaga aatctcagac gaggtgtcgc cgagctcagc ggcatttcac 1200 a 1201 // ID CR1-16_CQ repbase; DNA; INV; 5651 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-16_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-5651 RA Kojima K.K. and Jurka J.; RT "CR1 non-LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 20-20 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 10 sequences with >99% CC identity. XX FH Key Location/Qualifiers FT CDS 376..1413 FT /product="CR1-16_CQ_1p" FT /translation="MSATSTSNISKNTILCHACAEDVLDEYIPCQGFCNAV FT FHPNCSGVKPALLKEIASHRQIFWVCKSCTNLMVDLRHRRSIQCAYEAGQE FT LSLSHHNRIVEQLKLELLTVLKAELSANFAKLVASNSLTPRSSSLNKGGFR FT GSGSRRLFDNNPAPPLGPNPIPPKPVTGKPGGPADVPERIVAKGFGAPAGD FT DSPRFWLYLSRVSRNVTEEQITKLAIDRLGVSEIKAKRLVAKGKDVSRMRF FT ISFKVGMNFDLKDKALESSTWPDGLLVREFEERSGEIFWEPTNAVNFQQNP FT TPPVPPQDPRNTPTPKSTPVSMPLESKEMMTNPKDTEPTSPGSQGCSPMDL FT WTK" FT CDS 1485..5537 FT /product="CR1-16_CQ_2p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="MMSSFSPGRAVESLVRVPRLPDATTSSPRYHLRTPAS FT RSHQSRPGAGVGVGGGVSQPAPFGKYPSRLQHFLPDVRPTSSPCTITTAAQ FT TAIVLQLSARAIQSLPGRTVESLVRAPRLPDATTSTLRHHMRTPASRSHQS FT RPGAGVGVGGGVSQPAPFGKYPSRLQHFLPDVRPTSSPCTITTAAQTAIVS FT QLSARAIQSLPGRTVESLVRAPRLPDATTSTLRHHMRTPASRSHQSRPGAG FT VGAGGGVSQPAPFGKYQLSSFHTRPDARSAFSATVPSALQTSPAAPGHRNA FT TSVAVRSVRLVSPCPASSITANHHTDEPRTSGRTHLGSEECPRPPYTVESL FT PTSSPGVSSSQQRRPDPVYGQGSGGFRLQPSGEYNEANDSGPALNRRHFQQ FT PNGPLSFITVYYQNAGGIRTKTKQFYLALASSDYDVIALSETWLQDDIVDA FT ELSSNYHIFRQDRSALTSDRRRGGGVLVAVKKSHGVTSTRVLSRGYEHLEQ FT VAVRVKVRNHIVYVCCIYIRPNGPPEVYASHGTAVQELLDLSSHDDSIIVT FT GDYNLPHLSWTFDDDVNGFIPLNASSEQELALTENVVATGLLQICSLMNAN FT GRILDLAFVNDAHSVELIEPPSSILRTDRHHKPFVLRVDLLDVPDEATGNY FT EMEPDFRRCDFDLAAEALSIIDWDAILLGRDTNATTTILYDVLYEIVRQFV FT PVRRIPCNRTEKLPWWNADLRNRRNILRKARKKLFRASTPENRATVEHLEA FT EYESLQDSSFREYLNRVQVDLKENPSSFWRYVRSRKRSSVLPARISFNNTT FT AENPADAANIFADFFSNVYETTAPAASADYLNTLPTHDLQYSQPEFSQVDV FT QTALDVVDPSKGAGPDRLPPAFIKRLSAQLAKPVSIIFNRSLSEGVFPDEW FT KLAAITPIHKSGSTMAADNYRPISILSCLPKVFEVLIHEGMYSAAQVVISE FT FQHGFVKKRSTVSNLMAYVSALNENLEKFKQTDAIYFDFAKAFDKVPHTLI FT IAKLDRLGFPRWLTAWIRSYLSGRFGYVRLGGAQSRKFPIPSGVPQGSHLG FT PLIFILFVNDICQQLQSQKLLYADDLKIFRTVTSAIDCVALQQDIDKILEW FT CSLNGMKVNPSKCKCISFTRSSAPIRYNYSFDRHELDRVSSIRDLGLLIDR FT KLNFSEHVSATAAKAFAMLGFLRRNTAEFENINALKTLYITMVRSILEYAV FT QVWAPHHANQRDRLEKVQRRFTLFALRRLPWRNGVWWSSYSDRCALLRMQS FT LEQRRVLLQRMFVFDVLSDRIDSQQIRDEINVYRPVRQLRNLPMLRIPVHR FT TLYGHNRPIDRCCRFFNEVSNEFSPGLPRKNFKLKIINL" XX SQ Sequence 5651 BP; 1327 A; 1609 C; 1338 G; 1377 T; 0 other; tttttttttt tcctccgtgt aatttacagc agtttgttta ctctttctca gtgctattat 60 attttgtcgg ttttagtgcg caacacaatt gttttcgtcg cgaaacctcg gccgatccac 120 ttatcggacg cctatattga cggaactatt ggagtgtacc tgattatccg cgagttaatt 180 tcgggaaagt gattaagagt ttagttctgt gtacaaagtg ccattggaga cacacagctt 240 ctctacgtcc cccccaacac gcagttgttt ttaatcagcg ccatctattg gtgaatagcg 300 gagttacagc tatctctgtc gtcccggttg cttacttgtt ggcgattttt cggattgtgg 360 agcaacatct tcgtaatgtc tgcaacatcg actagcaaca tctccaagaa caccatcctg 420 tgccatgcgt gcgccgagga tgttctggac gagtacattc cctgtcaggg gttctgtaac 480 gcggtgttcc atccgaattg cagcggcgta aaaccagcgt tactgaagga gattgcatca 540 caccggcaaa ttttttgggt ttgcaaatca tgcactaacc ttatggtgga cctgcgtcat 600 cgtcgttcga tccaatgcgc atatgaagca ggccaggagt tgtcactgag ccaccataac 660 cgcatcgtcg agcagctcaa attggagctc ttaaccgtgc tgaaagctga gctgagtgca 720 aacttcgcca aacttgttgc gtcaaactca ctgactccca ggtcttcaag tctgaataaa 780 ggtggttttc gcggttctgg aagtcgccga ctatttgaca acaatcctgc tccacccctt 840 ggaccgaacc ctatcccacc caagcctgtc accggcaaac ccggagggcc tgccgatgta 900 ccggaaagga tagtggcgaa agggtttggg gcacctgctg gtgatgattc acctcgcttc 960 tggctttacc tttcccgcgt ctcccggaac gtgactgagg agcaaattac caagctcgct 1020 atcgatcgct tgggagttag cgaaatcaaa gcgaagaggt tggttgccaa aggcaaagat 1080 gtcagcagaa tgagattcat ctctttcaaa gttggtatga acttcgattt gaaggacaaa 1140 gccctcgaat cctccacgtg gcccgacggt ctcctggtcc gtgaattcga agagcgttct 1200 ggtgaaattt tttgggagcc cacaaacgcc gtcaactttc agcagaatcc aacgccacct 1260 gtcccaccac aagatccaag gaatacgcca acgccgaagt ctacgccagt atccatgccg 1320 ctggagtcaa aggaaatgat gacaaatccg aaggatacag agccaacttc accggggagc 1380 cagggttgtt cacctatgga tctctggaca aaataacgcg cacgtccgcg ccgtttgccc 1440 gatcttccct acactcaacc atatcgccat ccacttgccc acgcatgatg tcatcgtttt 1500 caccaggacg cgccgtagaa agccttgtga gagtccctcg cctgcccgac gccaccacgt 1560 cttcgccccg ctaccacttg agaacccctg ctagccgcag ccatcagagc cgtcctggtg 1620 ctggtgtcgg tgtcggggga ggggtctctc aaccggcgcc cttcggcaag tatccttctc 1680 gattgcaaca tttcctgcct gatgtccgtc caacttccag cccatgtacc atcacaactg 1740 cggctcagac tgcaatcgtc ttacagctgt cagctcgagc catccagtct ttgccaggac 1800 gcaccgtaga aagccttgtg agagcccctc gcctgcccga cgccaccacg tctacgctcc 1860 gccaccacat gagaacccct gctagccgca gccatcagag ccgtcctggt gctggtgtcg 1920 gtgtcggggg aggggtctct caaccggcgc ccttcggcaa gtatccttct cgattgcaac 1980 atttcctgcc tgatgtccgt ccaacttcca gcccatgtac catcacaact gcggctcaga 2040 ctgcaatcgt ctcacagctg tcagctcgag ccatccagtc tttgccagga cgcaccgtag 2100 aaagccttgt gagagcccct cgcctgcccg acgccaccac gtctacgctc cgccaccaca 2160 tgagaacccc tgctagccgc agccatcaga gccgtcctgg tgctggtgtc ggtgccgggg 2220 gaggggtctc tcaaccggcg cccttcggca agtatcaact cagcagtttt cataccagac 2280 ctgatgctcg ttcagctttc agtgcaactg ttccgtcggc gctccaaact tctcctgctg 2340 caccgggtca ccgaaacgcc acctcagtcg ccgtgcggtc ggttcgtttg gtcagcccct 2400 gccccgcgtc ttcaatcact gctaatcacc acactgacga accaaggacg tcgggacgca 2460 cgcacctagg tagtgaggaa tgcccccgcc ccccgtacac agtcgagtca ttacctactt 2520 ccagtccggg tgttagttcg agccaacaga gacgtcccga ccctgtgtac ggacaaggca 2580 gcgggggctt ccgacttcaa ccctcaggcg agtacaatga agctaacgat agtggtcctg 2640 cactgaatcg tcgtcatttt cagcaaccca acggcccgct tagttttatc accgtctact 2700 accaaaatgc cggtggcatc cgcactaaaa cgaagcagtt ctacttagcc ttggccagca 2760 gcgattatga cgtcatcgcg ctttcggaaa cgtggttgca ggatgatatc gtggacgccg 2820 agctgtcgtc gaactaccac atatttcggc aagatcgcag cgccctgacc agcgatcgcc 2880 gcaggggagg cggcgtactc gttgcagtca agaagtcaca cggggtcacg agtacacgtg 2940 ttctttcgag gggttacgag catctggagc aggttgcggt tcgagtgaag gtccggaatc 3000 atatcgtcta cgtgtgctgc atttacatac gccccaacgg cccaccagaa gtatacgcct 3060 cgcatggaac tgccgtccag gagctcttgg atctttcttc ccatgatgac tcgatcattg 3120 tcaccggtga ttataaccta ccccatctct cgtggacctt cgacgacgac gtgaacggtt 3180 ttatcccgct gaacgcctct tcggagcaag agctggcatt gactgaaaac gttgtcgcaa 3240 ccggactcct gcagatctgc tcgttaatga acgcgaatgg aaggatcttg gatctagcat 3300 tcgtaaacga cgcgcactcc gttgaactca tcgagccacc ttcctcaatc ctcagaaccg 3360 accgacatca caagcctttt gttcttcgag ttgatcttct tgacgtcccg gacgaggcaa 3420 caggaaacta cgaaatggag ccggatttcc gccgctgtga ttttgacctt gccgccgaag 3480 ccctcagtat tattgactgg gatgctatcc tactaggtcg agacacaaat gctacgacta 3540 cgatcctcta cgacgttctt tacgaaatcg tgcgacagtt tgtacccgtt aggcgtattc 3600 cctgcaaccg gaccgaaaag ctcccgtggt ggaacgctga cctgagaaat cggcgaaaca 3660 tcctcaggaa agcgcggaaa aagctgttcc gtgccagtac acccgaaaac agggccactg 3720 ttgagcattt ggaagcggag tacgaatcgc tgcaagactc atcttttcga gaatacctga 3780 atcgtgttca agtagacctg aaagaaaacc catcgtcatt ctggagatat gttagaagca 3840 gaaagcgctc cagcgtactt ccagcaagga tctcgttcaa caacactact gctgagaatc 3900 cagcggatgc tgcaaacatt ttcgccgact tcttcagtaa tgtttacgag acaacggcgc 3960 ccgctgcatc ggccgactac ttgaacacgc taccaactca cgatctacaa tattcccaac 4020 cggagttctc gcaagtcgac gtacaaaccg ctcttgatgt tgttgaccct tctaaaggcg 4080 ccggacctga ccgtcttcca cccgccttca tcaagcggct ttctgcgcaa ctggccaaac 4140 ccgtcagcat aatctttaat cgctctctat ccgaaggtgt ttttcctgac gaatggaaac 4200 tagcagcaat cacgcccatt cacaaatctg gtagtaccat ggctgcggat aactataggc 4260 caatctctat cctgtcctgt ctacccaagg tttttgaggt gcttatccac gagggaatgt 4320 actctgccgc acaggtagta atttccgagt tccagcacgg ttttgtcaaa aaacggtcca 4380 cggtatccaa cctgatggcg tacgtcagcg cactgaatga aaacctcgaa aagttcaagc 4440 aaactgacgc gatctacttc gacttcgcca aagcgttcga caaggtaccc cacacgctta 4500 ttatcgctaa gctagaccgg ctggggttcc caaggtggct gacagcatgg atccgttcgt 4560 acctttcggg acggtttggc tacgtccgcc tcggtggcgc tcaatcaagg aaattcccca 4620 ttccatccgg tgtcccccaa ggcagccatc tgggtccgct gattttcatt cttttcgtca 4680 acgatatttg tcaacaactc cagtcacaaa aattgttgta cgcggatgac ttgaagatct 4740 ttcgtaccgt aacctccgcc attgattgcg tggcgcttca gcaggacatc gacaaaattc 4800 tggagtggtg tagccttaac ggaatgaaag ttaatcctag caagtgcaaa tgcatttcgt 4860 tcacacggtc gtccgcccca attcgttaca attacagttt cgaccgtcac gaactggacc 4920 gcgtcagttc catcagggat ctcggtttgt tgatcgacag aaaactaaac ttttccgagc 4980 acgtttctgc gacggcggcg aaggcgtttg caatgcttgg gtttctccgc cgtaatactg 5040 ctgagtttga gaacatcaac gcactgaaaa cactctacat cacaatggtt agaagcatat 5100 tggagtacgc tgtacaggtg tgggcgcctc accacgctaa tcaacgtgac cgtttggaga 5160 aagtccaacg tcgtttcacc ctcttcgccc ttcgtcgtct accttggaga aatggtgttt 5220 ggtggtcaag ctacagcgac aggtgtgcac tacttcggat gcaatcgctc gaacaacgcc 5280 gtgtcttact tcagcgaatg tttgtctttg atgtcctctc ggatcgcatc gactcccagc 5340 agatccgcga tgaaatcaac gtgtacagac cagtgcgaca gctgaggaac ctgcccatgc 5400 tgagaattcc agtccatcgc acgctctacg gccacaatcg accaattgac aggtgttgcc 5460 gctttttcaa cgaggtgtcc aacgagttct cacctggcct gccgaggaag aattttaagc 5520 tgaaaatcat caatctgtga tatatttgtt gtcaatgttt tgtatgattt tattttttaa 5580 gtttagtttt aaggtattca gtctgtgcga ttaactcgaa gacggtgtaa taaataataa 5640 taataaataa t 5651 // ID Gypsy-261_AA-LTR repbase; DNA; INV; 146 BP. XX AC . XX DT 13-JAN-2011 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-261_AA_; KW Gypsy-261_AA-I; Gypsy-261_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-146 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (13-JAN-2011). XX DR [1] (Consensus) XX SQ Sequence 146 BP; 38 A; 31 C; 20 G; 57 T; 0 other; tgttatatta tcaataccta ttttgatgtg taccttgtga tgaatttact tgccaacctt 60 tcattttgta ccttgatacg ttgtacctta cactcgcaca tccttgccaa taaaaagtca 120 ttctttgttg aactgtacac gagtca 146 // ID Copia-12_SI-I repbase; DNA; INV; 4404 BP. XX AC AEAQ01030007; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-12_SI_; KW Copia-12_SI-LTR; Copia-12_SI-I. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-4404 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01030007; Positions 1629 6032. XX CC Positions [1618-2118] - Integrase core CC 'TATAA' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 79..4266 FT /product="Copia-12_SI-I_1p" FT /translation="MEINIDEDARYRVPLFDGTNYSNWKFRLEVLLDEKDL FT LRYVETPLQEILAQFQIAAGDTSEVRAQKEKSREAAQKSAKRCRGLIIQKI FT AESHLEYIKDKTSAYDTWETLKQIFERKSVANQIRLKRKLLCMKFNPTQEN FT LENHILKFDRIVKDLKSSGSTMEENDIICHLLLTMPSEYDAVETAIETVSQ FT DQLTLSFVKGRLFDEEAKRGTRKPKPKNEFQPNSVFVAQTKTRNSEYQQGN FT FKGQKGNAGSKSFTFTCHNCGLAGHKRSQCKKPQQPYKNFNTSTQAHVTQD FT NDNQPKEKSSSKAAFTFLSCENSQEIEKISKERWYLDSGASEHLSSQESKM FT YNIRNLEVPVKIKVAKSGSYLIAKKRGDLKASFCVNGQYNDIIISNILIVP FT DLETNLISVRKLEIKGFKIIFENGMGKIFYKGNIAAIAYRKQKLYELEFSN FT TAEEKALNSEVANNIELWHKRLAHLSYDGIKKLKTLVDGIQVEKVNSGPCK FT ICMEGKQAKLPHKGLRNKTTRPLELVHSDVMGPISPTSYDNMRYIVSFIDD FT FTHFTVIYLMETKDEVFKYFKMYQAMATAHFNLKISRFRCDNGREYISRKL FT KEEFESKGIQFEYTIRYTPENNGVAERMNRNIMEKARCLLLESGLQKIFWS FT EAVMNAVYIINRCPTAALENSVPAEKWYGKRQNAQKIKLFGSLVYLHVPKE FT VRRGKLDSRSERCLMLGYCSNGYRLWNLKENKLTTGRDVVFDEEKTAKTLF FT KKQDFYQGTNIDDNEEKDTQEDAQEELPVEEKTTSEDYEENSNTGINSEME FT YNMKEKQGISGKNQSPTLSKNKIESSDVHDSEELGRGRRQKWAPKRLEDFV FT TNLWPSWNNEEEEDYMAYALSAEEFVDNVPQCYKEIEEREDKEDWKQAVHE FT ELTSLIENGTWSLTPLPAGKKAIDSKWVFKVKRDEDGNPLRYKARLVIKGC FT AQKKGLDYKETYAPVAKISTVRILLSIINYKDFIAHQMDVKNAFLQGNVHE FT DIYMKPPDGFDIKDKSVVCKLNRPLYGLKQAPREWNFRFDSFVKELEFKQS FT KNDECLYIYRKGTVTMYLLLYVDDFIITSNNLEFLRIVKEKLMKEFKMTDL FT GELKFFLGIKIERKTQGMYLSQRQYIINLLSKFNMSNCKPVKTPIEEGFPG FT EDVLKEKLITDKPFKELIGCLMYLMNTSRPDLSLSVNKLSRFQRCPTESLW FT KGVKRILRYLQSTVDLALLYPKNSKLKEQLVGFADADWAGDTFDRKSTSGF FT MVKLFGATVTWTTRKQNTVALSSTEAELVSLCELICDLLWIVRILLDLDLV FT IDFPITLFEDNQSCIRAALSNNFNKRLKHVDVKYHFICDLIKKKEIFEVKY FT LSTNAQTADILTKPLGGTKFCIFRESLGLSLID" XX SQ Sequence 4404 BP; 1652 A; 643 C; 887 G; 1222 T; 0 other; ggttatgggc ccagcccacg ttgcttgaaa taaaaacact cttggaaaca cattggaatt 60 tttggaagaa tcccggaaat ggaaatcaat attgacgaag atgctcgata ccgtgttcct 120 cttttcgacg gaaccaatta tagtaattgg aagtttcgtt tggaagtctt gttagacgag 180 aaagatctgc tgcgttatgt tgaaacgcct ctacaggaaa ttcttgctca atttcaaatt 240 gctgcaggcg atacatctga ggtaagagca caaaaagaaa aatctaggga agcagcacag 300 aaatctgcaa aacgttgcag agggcttatt attcagaaga tagcggaatc tcatctcgaa 360 tacatcaagg acaaaacatc tgcatatgat acatgggaaa cgctaaaaca gatctttgaa 420 aggaaaagcg ttgcaaatca gattcgttta aaaaggaaat tattatgcat gaaattcaat 480 cccacgcagg agaacttaga aaaccatatt ttaaaatttg atcgaatcgt aaaggattta 540 aagtcatcag gctcaactat ggaagaaaac gacattattt gccatctgct tttaactatg 600 ccatccgaat atgatgcggt agagacagcc atagaaacag taagtcaaga ccaattaact 660 ttatcttttg ttaaaggtag actgtttgac gaggaagcta aacgaggaac gaggaagccc 720 aagccgaaga atgaatttca accaaattca gtatttgtag cacaaactaa aacaaggaat 780 tctgaatatc aacaaggaaa tttcaaaggt caaaagggca acgcaggaag taaatcattc 840 actttcacgt gtcataattg tggtttagca ggccataaaa gatcacagtg taaaaaacca 900 cagcagcctt acaaaaattt taatacttcc actcaagctc atgttacaca ggacaacgat 960 aatcagccta aggaaaagtc atcgtccaaa gcagctttta ccttcttgtc atgtgaaaat 1020 tcgcaagaaa ttgaaaaaat ctccaaagaa agatggtatt tagattctgg agccagtgaa 1080 cacttatcat ctcaggaaag caagatgtat aatatcagaa atttggaagt acctgtcaag 1140 ataaaagtag caaaatcagg aagctacttg attgccaaga aaagaggtga tttaaaggca 1200 agtttttgtg taaacggtca atataatgat attattatta gcaacattct aattgtgcca 1260 gatttggaaa ccaatctaat ttcggtaaga aaattggaga taaagggttt taaaatcatc 1320 tttgaaaacg gaatgggaaa gatcttctat aaaggtaata ttgcagcaat agcttataga 1380 aaacaaaaat tatatgaatt agaattttct aacactgcgg aggaaaaagc cttgaatagt 1440 gaggttgcaa ataacattga gctgtggcac aaacgcttag cacatctaag ttatgatgga 1500 attaaaaaat taaaaactct agtagatgga atacaagtag agaaggttaa ttcaggtcct 1560 tgtaaaattt gcatggaagg aaaacaagct aagctacctc ataaaggtct gagaaacaaa 1620 actacaagac ctctggaatt agttcacagt gatgtcatgg gaccaataag tccaacgtcc 1680 tatgataata tgcgatacat tgtttcattt atagacgact ttactcattt taccgtaatc 1740 tatttaatgg aaaccaaaga tgaagttttt aaatatttta aaatgtatca agcaatggcg 1800 actgcccatt tcaatttaaa aatcagtcga tttagatgtg ataacggcag agagtacatc 1860 tcgaggaaac tcaaagaaga atttgaatct aaaggaatcc aatttgaata taccattaga 1920 tatacaccag aaaacaacgg agtagcagaa aggatgaata ggaatataat ggaaaaagca 1980 agatgtctat tattggaatc aggactacaa aagatttttt ggtctgaagc agttatgaat 2040 gctgtttaca taatcaatag atgtcccact gcagctttgg aaaattctgt acctgcagaa 2100 aaatggtatg gaaagaggca aaatgctcaa aaaataaaat tatttggaag tctggtgtat 2160 cttcacgtgc caaaagaagt aagaagagga aaacttgata gtcgatctga aagatgtcta 2220 atgctaggct actgttctaa tggatataga ctgtggaatt taaaggaaaa caaactgact 2280 acaggaagag atgttgtttt tgacgaggaa aaaacggcga agactctttt caagaaacaa 2340 gatttttatc aaggtactaa cattgatgat aacgaagaaa aggatactca agaggatgct 2400 caagaggaat tacctgtaga agaaaaaaca acgtcagaag attatgaaga aaacagtaac 2460 acaggaataa attcggaaat ggaatataac atgaaagaaa aacaaggaat atcaggaaag 2520 aatcaaagtc ccacactatc gaaaaataaa attgaatcat cagatgtaca tgatagcgaa 2580 gaattaggaa gaggaagaag acagaagtgg gctcctaaac gactagagga ttttgttaca 2640 aacctttggc caagctggaa taatgaggaa gaagaagatt acatggccta tgctttgtct 2700 gcagaagaat ttgtagacaa tgttcctcag tgttataaag agatagaaga aagagaagac 2760 aaagaagatt ggaaacaagc tgtacatgaa gagttgacat ccctgattga aaacggaact 2820 tggagcctaa caccactgcc tgcaggaaag aaagcaatcg acagcaaatg ggtatttaag 2880 gtaaaaaggg acgaagatgg aaatccctta aggtacaaag ccagattagt aattaaaggt 2940 tgcgcgcaaa agaaagggct cgattataaa gaaacgtatg cacctgttgc taaaatatct 3000 actgttagaa tcttactttc gattattaat tataaagatt ttattgctca tcaaatggat 3060 gttaaaaatg ctttcttgca aggtaatgta catgaagata tttatatgaa gcccccagat 3120 ggatttgata ttaaagataa aagtgtagtt tgtaaattaa atagaccatt atacgggtta 3180 aaacaagctc ctagagagtg gaattttcgt tttgatagct ttgttaaaga gttagagttt 3240 aaacaatcta aaaatgatga gtgtttatat atttacagga aaggaacagt aaccatgtac 3300 ttattactct atgtagatga ttttattatt actagcaata acttagaatt tttaaggatt 3360 gttaaagaga aattaatgaa agaatttaag atgacagatt taggagaatt aaaatttttc 3420 ttaggaatta aaatagaaag aaagacacaa ggaatgtatc tgtctcagcg tcagtatatt 3480 ataaatttgt taagtaaatt taacatgagc aactgtaaac cggttaaaac tcccattgag 3540 gagggctttc caggcgagga tgtcttgaag gaaaaattaa taaccgataa gccttttaaa 3600 gaattaattg gttgtcttat gtatttgatg aacacatcaa gacctgatct gtctttgtct 3660 gttaataagt taagcaggtt tcaaagatgt cccacggaaa gtctctggaa aggagtcaag 3720 cgaattttac gatatctgca aagcactgtt gacttagccc tgttatatcc taagaattct 3780 aaattgaaag agcaattggt tgggtttgcg gatgcggact gggctggcga tacgtttgac 3840 cgaaaatcaa cgtcaggatt tatggtaaaa ctttttggag ctactgttac ctggacaact 3900 agaaagcaaa atacagttgc tttatcctcc acggaagcgg aacttgtgtc actttgtgaa 3960 ctaatttgtg atttactttg gattgtaaga attctattag atttagattt agtaattgat 4020 tttccgatca cactgtttga agacaatcag tcttgtatac gcgctgcatt atctaataat 4080 ttcaataagc gactaaaaca tgtggatgtt aaatatcatt tcatttgtga tctaattaaa 4140 aaaaaggaaa tttttgaagt taaatatctc tcgaccaatg ctcaaacagc tgatatatta 4200 accaagccac ttggtggaac aaagttttgt attttcagag agagcttagg attaagttta 4260 attgattaat agcttaagaa tgtatgtaat acgtttcttt taactattct gtgtaacttg 4320 ccagaaatag gttgacaaag tttattcact gttgatgcca actaggatag taagttttgt 4380 ttgtaatgac aaattgaggg cggg 4404 // ID BEL-36_AA-I repbase; DNA; INV; 6348 BP. XX AC supercont1.382; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-36_AA_; KW BEL-36_AA-LTR; BEL-36_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6348 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.382; Positions 639666 633319. XX CC Positions [5417-5977] - Integrase core CC 'ATGGG' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 60..3620 FT /product="BEL-36_AA-I_3p" FT /translation="MDQSNPHQNETVRHCNLCDRPDSADNLVQCDRCDQYA FT HFSCAGVNDSISDPDRSFVCQSCTANDDLASVLSKRSSRSSHSSRRSARLA FT LNLQLLEEENRIRMKQLEEAEKFRQLKIEAEQEMLRKKFELLQMELESEDD FT DGASRKSRMSKRIVQKNTEKWVQKEGQTDGANKTATTSSNSNSQQRSVSDN FT DRAQNAAAAPELVSLDQVPPTTSASSTPKSDLTTGVPSGSPAGVAIVPSER FT TTVGNVRVINTTSAATRKAPLSEVIVSPQLFSNTTMISKAPVVPICSTMIE FT LPPTSLQHGIYPNTTLPPPSSLVSPPDFLSIAKHAPAPTTSLLPPPGSDLP FT VMPSGRDNMMSNRMIQTRYGEPMNSLPTSAPHGSGQNYSGAVYTTQVPQQS FT IPSLRQPTGSVSHHQAVTISNPIPNNATVTSSLPQPSVSDGLAPNYPVATF FT GPNSIQLAARQVIPRELPVFSGDPQDWPLFFSAFRNSTAVCGYGDAENLAR FT LQRCLRGHALEAVRSRLLIPESVPHVIATLQRLYGRPEVIINALLKNLRSV FT PSPRTDDLKTLIKFGMAVQNLVEHMIVAEQQHHLSNPMLLFELVERLPSHF FT KLEWASYKYGLVDVNLATFSEFMARLVTMASDVTLHVDTGYSPNSKPEKYK FT REKTSAEKLFLHHAEVNSKHEKKPQVERSSPESTSTPLVKTCSYCHRTGHK FT IADCDAFKALDCDGRWKAVREKGLCRICLVPHKSWPCRSKRDCGYEDCRLR FT HNVLLHSKAPKNNPVTGIQNIASTSNIAHQNHHFLKSSTLLRYVPVTLYGN FT GRTVSVFAFLDDGSSSTMMDVELAEDLGVEGISEPLTLTWTSDVSRVEGKS FT QRLQLTISGADKDQKHPLVVRTVNQLKLPSQTMDYEHMCQMHPYLKNLPLE FT SYSNVIPRIIIGVDNVKLISVLKVREGGQGTLVAAKTRLGWSVYGKYSNME FT AVEHVNFHQHQVKCTIDNDQLHDMMKHFFNIEESCIAIKPESEENQRAVQI FT LERTSVRVPSGFQVGLLWKKDEFNLPDSYPLALRRLKCLEQRLLRKPELYA FT KVNELIQNYINNGYAHVASDVELESADPRRTWYLPLGVVQNPRKPSKVRLI FT WDAAASVKGLCFNDLVLKGPDLTTSLIAVLIRFRQRNIAIGGDLREMFHQI FT AIRPEDRQFQRFCGEDIKMNRRKFWLWT" FT CDS join(3323..4198,4202..6346) FT /product="BEL-36_AA-I_1p" FT /translation="MVLTSRGCTKSQETKQSSLDLGRGGIRERSVLQRSCT FT QRTRPYDIIDSRADQISPKEYCHWWRSERDVSSDRHSTRGPTIPKILWRRH FT QNESPQVLVMDVAIFGASCSPCIAQFIKNKNASEFLDEYPRAAQAIIDNHY FT VDDYLDSVDTVGEAVELMEQVKYIHANGGFEIRQFVSNSTEVLRNLGVEES FT MADKNLNLETESSRVLGMFWLPSRDVFTFTTPSNSEPLLLETHSSSPTKRQ FT VLRVIMSVFDPLGLIAHFVIHGKILMQHIWRAGTNWDEEIPAHLRQSWTTW FT NYLPNLKEIQIPRCLLGFLRSEKSITSELHVFVDASREAYACAIYLRTMGT FT SGVECSLIVAKSKVAPLKPTSIPRLELQAALIGVRLMEHVQQSLTIRITRR FT IFWSDSSTVLSWLQTDGFRYHQYVALRVGEILSSSKANEWRYVPSGLNVAD FT DATKWKTGPDLSITSIWFNGPMFLRQSESSWPVQPRVQQTNEEQIQHVCLH FT REKIGESIIDISRFSNWNRLHRAAAYVCRAVESFKGRRATSVCAMLTTSEL FT AAAENLLWRQVQREVFQMEYHIFSSRNDESVEVVVDKSSTLYKLSPFMDKN FT GVIRMGSRIGAASFAPYEAKYPLILPKQHHLTFLLVDSYHRRLLHANNETV FT LNDMRQRFYVPALRRLVRRGAAECQQCKVRKASPQAPRMASLPRARVTPFV FT KPFTYVGVDYFGPMMVKVGRSQVKRWVALFTCLSIRAVHMEVVHSLSTNSC FT IMAFRRFVARRGAPLEVYSDNGTCFVGANRRLKKEIQEINEKCAATFTNAR FT TSWIFNPPAAPHMGGLWERMVRSVKTAMHAIGDGLRHPNEETFETIVLEAE FT AIVNSRPLTYIGLDSGDQEALTPSHFLLYGVQGVNQPAVQPIEYRATLRDS FT WKLAQWMIDEFWRRWIREYLPVITRRSKWFEQVKPIQLGDVVLVIDDSMRN FT SWIKGRIVEVYPGKDGQVRRAKVQTAKGIIIRPAVKLAVLDVLGSTRSHES FT G" XX SQ Sequence 6348 BP; 1827 A; 1442 C; 1539 G; 1540 T; 0 other; ttcttcaaag atttcatctg tttccaccgg attgaaggca gtagtttcat ccggagagaa 60 tggatcagtc gaatccacac caaaatgaga ccgtgcggca ctgtaatttg tgtgacagac 120 cggattccgc agataatctg gtccagtgtg atcgatgtga tcagtatgct catttctcgt 180 gcgccggggt gaatgattcc atatctgatc cggaccgtag ttttgtgtgt cagagctgta 240 cggcaaacga tgaccttgca tcagtacttt ccaagcgatc gagtaggagt tctcatagta 300 gccgtagatc agcgcgactc gctttaaact tgcagctact ggaagaggaa aaccgaatcc 360 gcatgaaaca attggaagaa gcggaaaagt ttcgccagtt gaagatcgag gcagagcaag 420 agatgcttcg aaagaaattt gagctgctac agatggaatt ggaaagcgaa gacgacgacg 480 gagcaagtag gaagagcaga atgagcaaac gaatagttca gaagaatact gagaagtggg 540 tccagaaaga aggtcagaca gatggagcga acaaaacagc taccacgtcg tccaacagta 600 actcacaaca gcgttctgtg agcgacaatg atcgtgcgca aaatgccgct gccgctcctg 660 aacttgttag ccttgatcaa gttccgccta caacctcagc aagtagtact ccaaagagtg 720 accttactac cggtgtacca tccggaagtc cagccggcgt cgcgatcgtt ccttcagaaa 780 gaacaacagt gggcaatgtg agagtgatta atacgacgag tgctgctaca cgtaaggctc 840 ctctttcgga agtcatcgtc agcccgcagt tgttctcgaa tacaacgatg atttcgaaag 900 ctcccgtggt accaatatgt agtacgatga ttgaattacc accaacatca cttcagcatg 960 gcatatatcc aaatacgacg ttaccacctc cttcgtcttt agtgtcgcct ccggatttcc 1020 tttcgattgc aaaacatgct cccgcaccaa caacaagctt gctaccgccg ccagggtctg 1080 atttaccagt catgcctagt ggccgggaca acatgatgtc aaataggatg atccagacac 1140 gttatggaga accaatgaat tcgctgccaa catcagcccc tcacgggtcg gggcagaact 1200 atagcggtgc ggtgtatacg actcaagtcc cccaacaatc gattccgtcg ctaagacaac 1260 ctacgggctc agtatcgcac catcaagcag tcaccattag taatccgatt ccgaacaatg 1320 caactgtgac ttccagcttg ccacaaccat cagtatccga tggattagcg ccgaattatc 1380 cagtcgccac atttggaccg aattcgattc aacttgccgc tcggcaagtg attccaagag 1440 agctaccagt attttctggt gatccgcaag actggcctct ctttttcagt gcctttcgga 1500 attcaactgc agtctgcgga tacggcgatg ccgaaaattt agctcgttta caacgatgct 1560 taagagggca cgcattagaa gctgtacgca gtaggttgct gattccagaa tccgtacccc 1620 atgtgattgc caccttacaa cgtttatacg gtaggccaga agtaattatc aatgccttgc 1680 tgaaaaatct tcggagcgtc ccttccccca gaacagatga tttgaaaaca ttgattaaat 1740 ttgggatggc ggtgcaaaat cttgtcgagc atatgatcgt agctgagcaa cagcatcatc 1800 tctccaaccc tatgttgcta ttcgaactgg tggagcgttt accctctcac ttcaagctgg 1860 aatgggcaag ctataagtac ggattggtgg atgtgaatct agcaaccttc agtgaattta 1920 tggctcgatt agtcacaatg gcgagtgatg ttacattgca tgtcgataca ggttattctc 1980 ctaactcaaa gccggagaaa tacaaacgag aaaaaacttc agcggaaaaa ttattccttc 2040 atcatgctga ggtgaactca aagcatgaga agaaacctca agtagagaga tcctcaccag 2100 agtcgacaag tacgcctttg gtgaaaactt gctcgtattg tcatcgcact ggtcataaaa 2160 ttgcagattg tgacgcgttc aaagctctag attgtgatgg tcgatggaag gccgttcgag 2220 agaaaggatt atgccgaata tgtctagtgc ctcacaagtc ctggccctgt cgatccaaaa 2280 gagattgcgg atatgaagat tgtcggttgc gtcacaacgt gctgttgcat tcgaaggcac 2340 caaaaaataa cccggtcacc ggtatccaga acatcgcaag caccagcaac attgcgcacc 2400 agaatcatca cttcctaaag tcaagtacac ttttacgtta cgtcccggtt acgctttacg 2460 gaaatggaag aacggtaagc gtgttcgcct ttttagatga cggatcatct tcaaccatga 2520 tggacgtaga gcttgcggaa gaccttggag tagaaggcat aagcgaaccg ttaacgctca 2580 cctggacgag cgatgtatcg agggtcgaag gaaagtcaca acgcctacag ttgacgatct 2640 ctggagctga taaagatcaa aaacatccat tggtagttcg aactgtgaat caacttaaac 2700 ttccgagcca aacaatggac tacgaacata tgtgtcaaat gcacccgtat ctcaagaatt 2760 taccgcttga aagttacagt aatgtaattc cacggatcat cataggagtg gataacgtta 2820 agttgataag tgtactaaaa gtacgcgaag gaggccaagg gacacttgta gctgcaaaaa 2880 ctagacttgg ctggagcgtg tatggaaaat acagtaatat ggaagccgtc gaacacgtta 2940 atttccacca gcatcaagta aaatgcacca ttgataacga tcagctgcat gatatgatga 3000 aacacttctt caacatagaa gagtcgtgca tcgccattaa gccggagtcc gaagagaatc 3060 agcgagcagt tcaaattctt gaacgaactt cggttcgggt accttccggg ttccaagtgg 3120 gcctcttgtg gaaaaaggac gaatttaacc ttcctgactc ctatccgttg gcactccgcc 3180 gacttaaatg cctggaacag cgccttctgc ggaagcctga gttatatgca aaggtgaatg 3240 agctcattca gaattacata aacaatggat acgctcacgt ggcatcagat gtggaactgg 3300 aatcggcaga tccgagacgc acatggtact tacctctagg ggttgtacaa aatcccagga 3360 aaccaagcaa agttcgcttg atctgggacg cggcggcatc cgtgaaaggt ctgtgcttca 3420 acgatcttgt actcaaagga cccgacctta cgacatcatt gatagccgtg ctgatcagat 3480 ttcgccaaag gaatattgcc attggtggag atctgagaga gatgtttcat cagatcgcca 3540 ttcgacccga ggaccgacaa ttccaaagat tttgtggaga agacatcaaa atgaatcgcc 3600 gcaagttttg gttatggacg tagccatttt tggtgcatct tgctccccat gcatagcaca 3660 attcataaaa aataaaaatg cttctgagtt cttggatgaa tatccacgag cagcgcaggc 3720 gatcatcgac aaccattatg tcgatgatta cctcgacagt gtggatactg ttggtgaggc 3780 ggtcgaatta atggaacaag tgaagtacat ccacgccaac ggtgggtttg aaatcagaca 3840 gttcgtatcg aattcaacgg aagttttacg gaacctggga gtcgaagaaa gtatggccga 3900 taaaaacctg aacttggaaa cagaatcatc gcgtgtgctc ggaatgttct ggcttccatc 3960 tcgagacgta ttcaccttca ccacaccttc aaactctgaa cctttgctgt tggaaacgca 4020 ttcatcatcc ccaacaaaac gacaagtact ccgtgtcatt atgagcgtct ttgatccgct 4080 tggactaata gcccattttg ttatccacgg aaaaatcctg atgcaacata tttggcgtgc 4140 aggaacaaat tgggacgaag agataccagc gcatcttcgg caaagttgga caacttggta 4200 gaattacttg ccaaatttga aggaaatcca gattccacga tgtctcttag gtttcttgcg 4260 ttctgaaaaa tcgatcacat ccgagttgca cgttttcgtg gatgctagtc gagaagctta 4320 tgcttgtgca atttatctgc gaacgatggg aacatctgga gttgagtgct ctcttattgt 4380 agccaaaagt aaagttgcgc cgttgaaacc tacatcaata ccgagattag aactacaggc 4440 agcattaata ggagtaagat tgatggaaca cgtacagcag agcctcacta ttcgaattac 4500 cagaagaatt ttctggtcag actcatcaac cgttctgtcc tggcttcaaa cagatggatt 4560 tcgatatcat caatacgtgg cattgcgggt aggagagatt ttgtcttcgt ccaaagcaaa 4620 cgagtggaga tatgtgcctt cgggactaaa cgtggcggac gacgccacaa agtggaagac 4680 cggaccagac ctcagtataa ccagtatctg gttcaacggc cccatgttcc ttcgtcaatc 4740 ggagtcgtcc tggccagttc aacctcgagt tcaacaaact aacgaagaac agattcaaca 4800 cgtgtgtcta catcgcgaaa aaattggtga atcaatcatt gacatcagcc gtttctccaa 4860 ttggaatcga ctacatcgag cggcggcata cgtttgcaga gccgtagaat cattcaaagg 4920 tagaagagcg acatcagttt gtgcgatgct gactactagt gaacttgcag cggcagagaa 4980 cctgttgtgg cgccaggtac agagagaagt tttccagatg gagtatcata tttttagttc 5040 tcgaaatgac gaaagtgtcg aagtggtggt ggacaaatct agtacgctgt ataaattgtc 5100 cccatttatg gacaagaatg gagtcatacg tatgggcagt cgcattggag ccgcttcgtt 5160 cgctccgtat gaagcaaagt accccctcat tttgcctaaa cagcatcatt tgacgttcct 5220 tttggtcgat agctatcacc gtcgtctgtt acatgcaaac aatgaaaccg ttttgaacga 5280 catgagacaa cggttctacg ttcccgcctt acgacgcctc gttaggaggg gggcagcaga 5340 atgtcagcaa tgcaaagtcc gcaaagccag ccctcaggct cctcggatgg cttcactacc 5400 cagggcaaga gtcacacctt tcgtaaaacc tttcacttat gtaggtgttg attattttgg 5460 accgatgatg gtaaaagtgg gacgaagcca agttaagcga tgggtcgcat tatttacctg 5520 tctttcgatc agggcggtgc atatggaggt tgtccacagt ctgtctacca attcttgcat 5580 aatggccttc aggagattcg tcgcgcgacg tggggctcca ctggaagtat acagcgataa 5640 tggaacttgt tttgttggag ccaaccgacg attgaagaag gagattcagg agataaatga 5700 gaaatgtgca gcaacgttca ccaacgcaag aactagctgg atattcaacc ccccagccgc 5760 acctcacatg ggcggactgt gggaacgaat ggtgcgttcc gtgaaaaccg ctatgcacgc 5820 catcggcgat ggacttcggc accctaacga agagacgttc gaaaccattg tattggaagc 5880 tgaagcgata gttaattcac gtccgctgac gtatattggc ttggactcag gagatcaaga 5940 agcgttaact cctagccact ttcttctcta tggcgtacaa ggtgtgaatc agcccgcagt 6000 ccaacctata gaatatcgag ccacattgcg agacagttgg aagttggcgc agtggatgat 6060 cgacgaattt tggcggcgat ggatacggga atatctacca gtgataacga gacgatccaa 6120 atggtttgaa caagttaagc caatacaatt gggagatgta gttctggtga tcgatgactc 6180 catgaggaac agctggatta aaggaagaat tgtggaagtg tatccaggaa aagatggaca 6240 ggttcgacga gcaaaggtgc aaacggcgaa gggtatcatt attcgtccag cagtgaagct 6300 ggctgtcctg gatgtcctcg gaagtaccag gtcacacgag tcggggaa 6348 // ID Gypsy-7_DVir-I repbase; DNA; INV; 6469 BP. XX AC scaffold_10188; XX DT 10-MAR-2011 (Rel. 16.03, Created) DT 10-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-7_DVir_; KW Gypsy-7_DVir-LTR; Gypsy-7_DVir-I. XX OS Drosophila virilis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; virilis group. XX RN [1] RP 1-6469 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (09-MAR-2011). XX DR Genome; scaffold_10188; Positions 8353 1885. XX CC Positions [5460-5948] - Integrase core CC LTRs are 95% similar to each other. XX FH Key Location/Qualifiers FT CDS 1487..2686 FT /product="Gypsy-7_DVir-I_2p" FT /translation="MEWSELAKKINNFKDKFEKSYKWVNKNAPIRVETLIH FT HINILVLNYNNIVQLYLSKENRLTEQHVNHQCKKIIKSLGTRISNISTRHN FT IVIDIPTDLKLMATFDPDIEADSDIEPLELTEPNMAVNQVALDREYVRQVS FT STIPEFDGKKLSLARFLTALRLVDRTKGTQEDLAVEVIKSKIIGPILYKVQ FT NETSIIGTINKLNTNIKGESTDVIKAKLLNIKQKGKSASQYTTENDSLRKQ FT LEAAYIDDGLDFDNAEKFSTKETISAMTKNCEYEKLRIILETGNFSNFNDA FT MGKYIQCSTEMTGCPSTILYYKGRNDRGNYSRNNQRGRDKSRGNSNFHYNG FT NNNNRGNFRGNYRGGNRGRNNSNRGGYQNQNSNNRGNVRVTQGNSENQQTP FT SDVQSQ" FT CDS 2725..3870 FT /product="Gypsy-7_DVir-I_1p" FT /translation="MFVKIKNERTGKSITLLVDTGADISILKENTDIFENV FT NETITTQISGIGEGIIPSKGLAFIELTTGKYIIPHNFHLVEQNFPIPSDGI FT LGIDFIKQYNCQLDFTQSGDSLILRPNKLNYPIRLKILHTCDNNATVLPAR FT SEVVREIQIPSIQNQILILNQEIDEGIYIANTISNHKNTYVRNINTTNANK FT IVNLNKIKYETLNNYDIVKETNEGRDNFVMEKLIKNFPPLFKNQLKDLCTR FT YTDIFGLETESITTNNFYKQKLRLKDDEPVYIKNYRSPHSQVQEIQAQVQK FT LIDDKIVEPSASEYNSLLLLVPKKSLPDSDKNKWRLVIDYRQINKKLLSDK FT FPLPRIDDILDQLGRAKYFSCLDLMSAFTKSNSKKVLEI" FT CDS 4386..6302 FT /product="Gypsy-7_DVir-I_3p" FT /translation="MALINPTLLQYPDFAKEFCIITDASKQACGAVLTQNH FT NGLQLPIAYASRSFTKGESNKSSTEQELAAIHWAITHFRPYIYGNHFTVKT FT DHRPLTYLFSMVNPSSKLTRMRLELEEYNFTVEYLKGKDNFVADALSRIAI FT KQLQEITGTIHKVTTRYQSRQNSCAGKYQVELPRQSTEKASQPNVYEVINN FT DEVRKVVTLHIKDTLCFFKHGKKIIARINIGDLYTNGNIDLGQLFPRLENE FT AGILKISQLKMAPWENIFDTISIDNFKNMGNKILTKLRVALLNPVTTITNN FT KEKEAILSTLHDDPKQGGHTGITKTLAKVKRHYFWKGMTRDSTEYIRKCHK FT CQKAKITKHSKTPLIITDTPVHAFDRVIVDTIGPLPKSDNGNEYAITLICD FT LTKYLVAIPIPNKNASTVAKGIFQSFILKYGPMKTFITDMGTEYKNSIIND FT LCKFMKIDNITSTAHHHQTVGTVERSHRTFNEYIRSYISVDKTDWDVWLQY FT FVYCFNTTPSTAHNYCPYELVFGKTCNLPKYFNSIANIEPIYNIDDYAKES FT KYRLEVAYKRARNMLEFNKMNQKKRYDLNSKELKLSIGDKVLLKNEVGHKL FT DFKYTGPYRVEKIEDRDNIVISGNKNKTQTVHKDRLKIFNS" XX SQ Sequence 6469 BP; 2595 A; 1153 C; 1099 G; 1622 T; 0 other; tggcgaccgt gacaggacgt cacaaaaaca tgaaaacgtt aagatacata acttcaactc 60 taattttaaa ccagtcataa aagtaattaa agaaagtaaa ttaaaaggaa ataacactca 120 aaaacaactg aaacaacgag gcatctggaa cggcacacac cctggagaat caaaaattaa 180 ctgagtgagt acagggggca tgagctctaa aaatacttta aagctaaaaa ggcgcagtgg 240 cattaaagag cggatcaggc aaagaaagtc tgagatggta agactcttat ataagtccca 300 aacccaatac cttgggaaac tgcagtcaag ctacatgacg aactcaaaca tcttgagtcg 360 ttaaacaaag taaagaaccc tccctgtccc tcagaggaaa aacttattaa tccagataca 420 catcgaccac agccacaacc gatcccgaga aggcaacgtg aatgcccagc agactgcgag 480 ggaccgcttt acagccaaca ttgcctgggc acatgtcagc ggccacccac gagtaaaaac 540 tagtacgaga ggcaaaccgc cccttgtacc aacgctggaa caagaagaag tacgagaggc 600 agaccgcccc ttgtaccaac gctggaaaac accaagggtg aactgaccga cagcgccaca 660 gagggcgcac agcaaaaaaa agtttataat tataattatt attaaaatat ttgtaaaact 720 attgtaatgc ccgacgacaa ttgagtcgac cggcaagacg tttggcccgt aaagccatcg 780 aaatactagg caggacaaaa tgtacattca gtaaaaactg aattaatatt aaatttctcc 840 ccgcccacgg gcgtaaatac aaaaaatttt aaatataaac tcaaactaca aaccttataa 900 agttaatgaa tagcgttaac aaaactataa gttcatatat aaatcaagta tatattaaga 960 ataacaaccc caaatatggg tttggtagac gttaagactg ttgactcgac ggctaacgta 1020 ataaataaag ttgaggtcag caacacagat gtaaaattga taacaatttt tgtcgttatg 1080 atatttgtca tattagtagg acatctattg tacaaattat acacgtatca taataaatgc 1140 ataaaaaaga gataccagag tcacgcaaat gacctggata aaatctaaat ttttcttgaa 1200 caatgataag cgacaaaata aaataaataa ataaaatgaa atttcgagaa atatatatat 1260 attcttatta ccagaaatgc ctaataataa gtttgcgata gaagtatact tagaagcaat 1320 atataatagc gacgcagact ttagaattcg tgcattgaat attctacaac aataagataa 1380 aaggcataat tttaaataca ctgccatatg ttggcattaa agaaatagtg gtccagtgga 1440 cggcgagtgc ccaattataa tggtaaaact tcataaatag agaactatgg aatggagtga 1500 attagcgaaa aaaataaata atttcaaaga taaattcgaa aaatcttata agtgggttaa 1560 taagaacgca cctataaggg ttgaaacgct aattcaccac ataaacatat tagtgctaaa 1620 ctataataac atagttcaac tttatttaag taaagaaaat aggctcacag agcagcatgt 1680 aaatcatcaa tgtaagaaaa taatcaaatc acttgggact agaatatcga atatcagcac 1740 taggcacaac atagtaatag atatcccaac agatttaaaa ttaatggcaa cctttgaccc 1800 tgatattgaa gctgatagcg acattgagcc acttgaatta acagaaccaa acatggcagt 1860 aaatcaagtg gccttagacc gcgaatatgt aagacaggta tcgtctacaa ttccggaatt 1920 tgatggcaaa aagctatcac tggccagatt tctaacggct ttaagactag tcgatcgtac 1980 taaaggtaca caagaagact tagccgtaga agtcataaag tctaaaataa ttggcccaat 2040 tttgtacaaa gtacaaaatg aaacatccat tattggtact ataaataaat taaataccaa 2100 tatcaaaggc gagtcaactg acgttatcaa agctaagttg cttaacataa agcagaaagg 2160 caagtcagcc tcgcaatata caactgaaaa tgacagtttg cgtaagcaac tggaagcagc 2220 ctatattgac gatggactag attttgataa cgcagagaaa ttttctacta aagaaacaat 2280 atctgccatg acaaagaatt gcgaatacga aaagcttcgc ataatattag aaacgggcaa 2340 tttcagcaat ttcaatgatg ctatgggtaa atatatccaa tgcagcactg aaatgactgg 2400 atgcccaagt acaatattgt actacaaagg cagaaacgat cgtggaaatt acagccgtaa 2460 caaccaaaga ggtagagaca aaagccgcgg caatagtaat ttccactata atggtaataa 2520 caacaatagg ggtaatttca gaggaaatta ccgaggtggc aaccgaggta gaaacaattc 2580 aaaccgaggt ggctaccaaa accagaacag taacaataga ggtaatgtcc gagtaactca 2640 aggtaactcg gaaaaccaac agaccccctc agatgttcaa agccaataaa tggctcaata 2700 aaaaccatca accttaatct tagcatgttt gtaaaaatta agaatgaaag aacaggtaaa 2760 tcaattacac tgttagtaga tactggtgca gatatatcca tattaaaaga aaatacagat 2820 atattcgaaa atgtaaacga aacaattact acccaaatat cgggtattgg cgaaggcata 2880 attccttcaa aaggcttagc ctttatagaa ctcactacag gcaaatatat aattccacat 2940 aactttcatt tagtcgaaca aaacttccca ataccgagtg atggcatact aggcatagat 3000 tttataaaac aatataattg ccaactagat ttcactcaaa gtggcgactc acttattctt 3060 cgcccaaaca agttaaatta tccaattcga ttaaaaatac ttcatacttg tgacaataat 3120 gccacagtac tcccagctcg ttccgaagtt gtccgtgaaa tacaaattcc ctcaatacaa 3180 aatcaaattc ttatattaaa tcaagaaatt gatgaaggaa tttacatcgc aaacacgata 3240 tccaatcaca aaaatacata tgttcgtaat ataaatacaa caaatgccaa taagatagta 3300 aacttaaata aaataaaata tgaaactttg aacaattatg atatagtaaa agaaaccaac 3360 gaaggaagag acaacttcgt tatggaaaaa ttaatcaaaa actttccacc tttatttaaa 3420 aatcaattaa aagatttatg caccagatat accgacatat tcggattaga aaccgaatcg 3480 ataacaacta ataattttta caagcaaaag cttagattga aggatgatga gcccgtatac 3540 ataaaaaatt atagaagtcc tcacagccaa gtacaagaaa ttcaagccca agtgcaaaaa 3600 ctaatagacg acaaaatagt cgaaccatca gcctcagagt ataatagcct acttttatta 3660 gttccgaaga agtcccttcc ggactctgac aagaacaaat ggcgattagt aattgactat 3720 cgtcaaatta ataaaaagct attgtccgat aaatttccgc taccgagaat tgatgatatt 3780 ctagatcagc taggtagagc caaatatttt tcatgcctag acttaatgtc ggctttcacc 3840 aaatcgaact cgaaaaaagt tctagagata taacgtcatt ttcaacgagc aatggctcgt 3900 atcgctttac gcgattacca tttggcttga aaatagcccc aaaatccttt caaagaatga 3960 tgacaatagc attttctggt cccggcctgg ttcaggcctt cctgtatatg gatgatctga 4020 tagtaatcgg ttgttccgaa aaacatatgg ttaaaaattt aactgaagtt tttgacaaat 4080 gtaggaaatt caacctgaaa ctacatccag aaaaatgttc atttttcatg catgaagtca 4140 catttttggg acataaatgc actgataaag gaatcttgcc agacgataaa aaatacgacg 4200 tcataaagaa ctacccagtc ccacatgatg cggacagcgc taggcgattc gttgcatttt 4260 gcaattacta taggcgcttt attaaaaatt ttgccgacta ttcacgtcac ataacaagat 4320 tatgttaaaa agacgttaag ttcgaatgga catctgaatg ccagaatgcc ttcgatcacc 4380 tcagaatggc tcttataaac ccaaccttgt tacaatatcc agatttcgct aaagaattct 4440 gcataataac tgatgcaagt aagcaagcgt gtggagcggt tttaactcaa aaccataatg 4500 gactccaact tccaattgca tacgcatcaa gatcctttac taaaggagaa agtaacaaga 4560 gttctactga gcaagaatta gcggctattc attgggcaat aacgcatttc cgaccatata 4620 tttatggtaa ccatttcacg gttaaaactg accatagacc gctaacctac ttattttcaa 4680 tggtaaatcc aagttcaaag ttaacacgta tgaggcttga attagaggaa tataatttta 4740 cggtagaata tctgaagggt aaagataatt tcgtagccga tgcgttatca agaatagcca 4800 taaaacagct acaggaaata acagggacaa tccacaaagt cactacaaga tatcaaagta 4860 gacaaaattc ctgcgcagga aaatatcagg tagagttgcc aaggcaatct acagaaaaag 4920 cttcacagcc caacgtatac gaagtcatta acaatgacga ggtacgcaaa gtagtgacct 4980 tgcatataaa agatacgctt tgtttcttta aacatggcaa gaaaattatt gcaagaatta 5040 acattggtga tctatataca aatggaaata ttgacttagg tcaactattc ccaaggctcg 5100 aaaatgaagc cggtatattg aaaataagcc aattgaaaat ggcaccgtgg gaaaatatct 5160 ttgatacaat ttcaatagat aatttcaaaa atatgggcaa taaaatattg actaaattaa 5220 gggtagcgct actcaacccg gtgaccacaa taacaaataa taaagaaaaa gaagctatat 5280 tgtctacact acatgacgac ccaaaacaag gaggtcatac tggcattaca aaaaccttgg 5340 ccaaggtcaa acgacattac ttttggaaag gtatgactcg agatagtact gagtacatac 5400 gtaaatgtca caaatgccag aaagctaaaa taactaagca tagtaagacc ccattaatta 5460 taactgatac acctgtacat gcgttcgata gggttatagt ggacacaatt ggtccgctac 5520 ccaaatctga taatggtaat gagtacgcaa tcacattaat ttgtgacttg actaaatacc 5580 tagtcgcgat accaatacca aacaaaaacg ctagtacagt agccaaaggt atattccaat 5640 cttttattct aaagtacggt ccaatgaaga cgttcattac ggacatggga acagaataca 5700 agaactcaat tataaatgat ttgtgcaagt ttatgaaaat agacaacata acatcaactg 5760 cacatcatca ccagacagtt ggaacagtcg aaagaagtca cagaactttt aacgaataca 5820 tacgatctta tatatcagtt gataaaactg attgggacgt atggctccaa tactttgtct 5880 attgttttaa tacgacccct tctacggcac ataattattg tccgtatgaa ttagtatttg 5940 gtaagacatg taatctacca aaatacttca atagcatagc aaacatagaa ccaatatata 6000 acatagatga ttatgctaag gagagtaagt atagattaga agtagcatac aaaagagcaa 6060 gaaatatgct agaatttaat aagatgaatc aaaagaagcg atatgactta aatagtaagg 6120 aattaaaatt atccatagga gataaggttt tattaaaaaa tgaagtagga cataaactag 6180 attttaaata cacagggccc tatagagtag aaaaaataga agatagggat aatatagtta 6240 tttctggaaa taagaataaa acgcaaacag tccataagga cagattaaaa atttttaatt 6300 cataatcaag catgcaaaaa tacaaaaaaa aatatatata tataaaaaaa aaaaaaaaaa 6360 caacaaaata aatatatata aaaaaaaaaa aattttaaaa taataaaaat aaataattat 6420 taaggaaaat aactccattg tattacgtta ttttttaaaa ggagggaga 6469 // ID Copia-2_SI-I repbase; DNA; INV; 4215 BP. XX AC AEAQ01001041; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-2_SI_; KW Copia-2_SI-LTR; Copia-2_SI-I. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-4215 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01001041; Positions 1141 5355. XX CC Positions [1718-2218] - Integrase core CC 'CTGTA' target site duplication CC LTRs are 95% similar to each other. XX FH Key Location/Qualifiers FT CDS 254..2647 FT /product="Copia-2_SI-I_1p" FT /translation="MMADSLVDLKNVTKFDGQNFQLWKFQMKAIFIAHDLL FT DIVNGTAVKPEATGQPQIAWIQRNAKATVLISAAMQPSQLEHLITCDSAAE FT MWNKLSAIHEQTSATSKLLLTTRFHEYRMKLGDSVSQHITKVENMARQLKD FT VGEAVSDVTIMAKILGSLPPKYNAFVTAWDSVEEANQTIFNLTQRLIKEEN FT QLTATDEAASALAAVNLKHHKKGPRRGKLRQPQESDAEHGKYQNFECHFCH FT NKGHIARYCKKKRNVQKGQDEKKRGANDDRKSDLADHDAFVASSDISVTEC FT TDTDDAWFSDSGASQHMSFRKEWFAEFESCNGITVSLGDNSLLKAEGRGTI FT FIKKLVNGEWQDGKLVNVLYVPLLRKNLFSIGACTAKGHVFVIRDNGIDIY FT SKQNKLIAHGIRQGNNLFRLEFTAIVKREANAVVRSTLKLWHDRLGHVNNK FT YLQAAIENKLIDGVKVSEISDFFCENCQYGKQHRLAFKSKSPRKMLKPGEF FT IHTDLCGPMQTPSIGGAKYFILFKDDCTGFRYVHFLRHKDDVFDVFKEYEA FT MIFNKFSQRIQTIRSDNGTEYSSHRFRDHLKSRGIQLECSAPYTPQQNGSS FT EREMRTIVESARTMLLAKKLPTRLWAEAVNTAVYVLNRTPSSRSKLTTPYE FT VWTGKKPNLSHAKIFGCDAYVHVPEQQRKKWDSKSKKLVFVGYQGDSRNYR FT LIDPETDRITISRDVSFNECAEPIFECPAASLPLPNERAVLDNNEQPEQID FT DCPRREADQCENPHHEVEPDVINSRPKLRNRNLLRALPIRSLFCFIH" FT CDS 3033..4145 FT /product="Copia-2_SI-I_2p" FT /translation="MRVPKGLNVPNDNSICKLNKALYGLKQAGRCWNKKFD FT AFFKKFKFARGSADRCVYFGLINNDKVYLALYVDDGLLMASKIETINLILN FT ALKDNFDVTASEADCFVGMQIERDRVNKKICIHQSKYIDSILHRFAMCDAH FT VISVPADPHVTLEKCLSNDELHDIPYREAVGSLLFVSLVSRPDITYAVGLV FT SRYLERHSNPHWQAVKRIFRYLKGTKNLGIIYTNSGSKLNLVGFSDSDYAG FT NKDTRRSTTGYLFELANGPITWCSKRQSTVSLSTTEAELIAASEAAREAIW FT LRKLLNDVGHPCETPSLLYVDNQSAIRLTKNPEFHRRTKHIEVRHHFIRER FT YESGEINIRRLHTFKRSKNGPVNQTDSA" XX SQ Sequence 4215 BP; 1291 A; 927 C; 1002 G; 995 T; 0 other; ggttatgggc ccagatcacg gacattttcg agtgaaataa gatttcgaag aatacgtgag 60 aaggtatagt gtgccgcttg ttccttcggc gttttttttt tctgattgca cgaggcacag 120 tcgaacacgt cgggaacttt ccgacgcctc gcagtttttt tctgttctcg tggtatacac 180 gtgttaagtc gtcgcccatt tgacgcgcgt gttgtgatcc tgtctggact gaaaaggatt 240 gtgaaacgag aaaatgatgg cagactccct tgtagacctg aaaaacgtga cgaagttcga 300 tggacagaat ttccagctct ggaaattcca gatgaaggcg atcttcatag ctcacgattt 360 gctggacata gtcaacggaa ccgcagtgaa accggaagcg acgggacagc cacaaatcgc 420 ctggatacaa cggaacgcaa aggcgacggt gctcatatcg gccgctatgc agccctcgca 480 gttggaacac ctaattacgt gtgatagtgc cgctgagatg tggaacaagc tatcggcgat 540 ccacgaacag accagcgcaa caagcaagtt actgctcacg acgaggtttc atgagtaccg 600 catgaaactc ggcgactcgg tctcgcagca tatcacaaaa gttgagaaca tggcaaggca 660 actcaaggac gttggtgaag cggtctccga tgtcacaatc atggcaaaaa tccttggatc 720 gctaccgcca aaatacaatg cttttgtgac cgcttgggac agcgtggaag aggcgaacca 780 gaccattttc aacctgacgc agaggctcat caaggaggaa aaccagctca ctgcaaccga 840 cgaggcggcg agcgcactag cagccgtcaa cttgaagcat cacaagaagg gtccgagaag 900 agggaaacta cgccaaccac aggagagcga tgccgaacac ggaaagtatc aaaattttga 960 atgtcatttt tgtcataata aagggcacat tgcccgctat tgtaaaaaaa agcgtaacgt 1020 acagaaaggc caagacgaaa agaaacgcgg tgcaaatgac gatcgcaaaa gcgacctcgc 1080 ggatcacgac gcgtttgtcg cgagcagcga catttctgta accgagtgca cggacacaga 1140 cgacgcatgg ttctctgaca gtggcgcgtc gcagcacatg agttttcgga aagaatggtt 1200 cgcggaattc gagtcgtgta acggtatcac tgtaagtctc ggtgataact cacttctaaa 1260 agccgaaggc cgcggtacga tttttataaa gaaattagtt aatggcgaat ggcaagacgg 1320 taaattagtg aatgttctat atgttccgct gctgcgaaaa aatttgtttt cgatcggcgc 1380 ctgtaccgcg aaaggacatg tatttgtcat ccgagacaac ggtatcgaca tctattcgaa 1440 gcaaaacaag ctgatcgcgc acggtatcag gcagggcaat aatttattta gattggaatt 1500 caccgcgata gtgaaacgcg aagcgaacgc cgtggtaagg agcactctga agctctggca 1560 cgaccgtcta ggccacgtca ataataaata tctacaggca gcaattgaga ataagctaat 1620 cgacggtgta aaggtttccg aaatctccga tttcttttgc gagaactgtc agtacggtaa 1680 acaacatcgc cttgcattca aaagtaaaag cccaagaaaa atgttaaagc ctggtgaatt 1740 cattcacact gatctgtgtg gaccgatgca aactccatcg atcggaggcg caaagtattt 1800 cattttgttt aaggacgatt gtaccggttt tcgttatgta cattttctac gacataaaga 1860 tgacgtattt gatgttttca aagaatacga ggcgatgatc tttaacaaat ttagtcaaag 1920 aatccaaacg atacgttctg acaatggcac agagtattcg agtcatcgat tccgcgatca 1980 tttaaagtca cgcggcattc aattggaatg ctccgcaccg tacaccccgc agcaaaatgg 2040 ctcttcggaa cgcgagatgc gtactatcgt agaaagtgcc cgaacaatgt tgttagcgaa 2100 aaagttacct acccggctct gggcagaagc cgttaatacg gctgtttacg tgttaaatag 2160 aaccccatca tctcgatcaa aattgacaac accatatgaa gtatggacgg gtaaaaaacc 2220 gaatttgtcg catgcaaaaa tcttcgggtg tgatgcatac gttcatgtcc cagaacagca 2280 gagaaagaaa tgggattcga agtctaaaaa acttgtcttt gtcggttacc aaggcgattc 2340 gagaaattat cgtctgatag acccagagac ggataggata acaatctcgc gggatgttag 2400 ttttaacgaa tgcgccgaac caatattcga gtgtcctgct gcaagtttac ccttaccaaa 2460 tgagagagct gtattagaca ataacgaaca acccgaacaa atagacgact gtcctcgacg 2520 cgaagcggat caatgtgaaa acccgcatca tgaagtcgaa ccggatgtta ttaattcgcg 2580 accaaagcta aggaatcgca acctgttacg cgccttgccg atacgaagct tgttttgttt 2640 catacactga atcgaataca tatgacgaag ccgtgaccgg aaaagactca gaaaagtgga 2700 cacaggctat ccgcgaagag ttagcagccc atgagcggaa tcaaacatgg gagatagtgc 2760 ctttgccaca cgaccgtaaa gccatagggc acaagtgggt tttcaaagta aaaaccacgt 2820 cgtcgggtga gattacgcgt tataaggcgc gtttatgcgc ccaaggtttc tcgcagaaag 2880 ccggagtaga ttacgacgag attttctccc ctgttgtgcg atacgattct gttcgaacgc 2940 ttctgtcaat cgctgcggcg aacgatttgg aaatttatca gttcgatgtt aaaaccgcat 3000 atttgaataa taatttaaac gaagaaattt acatgcgtgt tcccaaaggt ctaaatgttc 3060 cgaatgataa ctcaatctgt aagcttaata aagcattata tggcctaaaa caagccggtc 3120 gatgctggaa caaaaaattt gatgcatttt ttaaaaaatt taagttcgct cggggcagcg 3180 ctgacagatg cgtctatttc ggtttgataa ataatgacaa agtgtacctc gcattatatg 3240 tcgacgatgg cctattaatg gcctcgaaaa tcgagaccat aaatttaatc ttaaatgcat 3300 taaaagataa ttttgacgtc actgctagtg aggctgactg tttcgtggga atgcaaatag 3360 agagagatcg cgttaacaaa aagatttgca tccaccaaag taaatatata gattccatct 3420 tacacagatt cgcgatgtgt gacgcgcacg tcataagcgt gcctgcagat ccccacgtta 3480 ccctcgaaaa gtgtctcagt aacgacgagt tgcacgacat tccgtatcgt gaggccgtcg 3540 gctcactttt gttcgtgtca ctggtctcaa gacccgacat aacgtacgct gtcggacttg 3600 taagtagata tttggaaaga catagtaatc ctcactggca ggccgtcaaa cgaatatttc 3660 gatacctaaa aggcactaaa aacctgggca taatatacac aaacagtggg agcaaactta 3720 atttagtagg cttctcagat tccgactatg ccggtaataa ggacactagg cgctccacaa 3780 ctggctacct cttcgaatta gcaaacgggc ctatcacgtg gtgctcaaag cgccaaagta 3840 ccgttagtct tagtaccacc gaggcggaac tcattgcggc gagcgaagct gcgcgagagg 3900 caatttggct gcgcaagcta ctgaatgatg tcggacatcc gtgtgaaacg ccttctcttt 3960 tgtacgtaga taatcagagt gcgatcagat tgacaaaaaa cccagagttc catcgacgca 4020 cgaaacacat tgaggtccga catcacttta taagagaaag atacgagagc ggagagatta 4080 acatacgtcg actacatacg ttcaaaagat caaaaaacgg acctgttaac caaaccgata 4140 gcgcatgatc atttccaaga tttacggcgg aaaattaatg taatcgatgt atcgaaaacg 4200 ctcaatcagt gggag 4215 // ID BEL-184_AA-LTR repbase; DNA; INV; 420 BP. XX AC supercont1.151; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-184_AA_; KW BEL-184_AA-I; BEL-184_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-420 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.151; Positions 1273595 1274014. XX SQ Sequence 420 BP; 142 A; 87 C; 78 G; 113 T; 0 other; tgtacgagac gcaaacaact agtcccgagt gagatgacca atttcagcgt gcatcaaaaa 60 tgtaggtaaa tttttctgtt cctaattaga cgactcccga ggaataaaga cgaaacatta 120 gatagagata gatatagacg ttttaccttt gcacaaggta tataagctag ggtaattagg 180 aatcaagatc tccttttttg acctctcttt gacttaatac cactcgaccc aacagaagtg 240 taacgcaaat cagttaattc aagttattcg ttccgagaga ataaagttca attgtcgttc 300 cgagaataaa ccggtgttaa agtgtttcga tctgagtgaa gacgcaaatt accccttcaa 360 ctatccggga agcagtgacc tattccaaaa actacagtcc acttaacaac gttccgaaca 420 // ID DNA8-25_AP repbase; DNA; INV; 674 BP. XX AC . XX DT 22-AUG-2009 (Rel. 14.08, Created) DT 22-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-25_AP. XX NM DNA8-25_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-674 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(8), 1767-1767 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 674 BP; 220 A; 102 C; 102 G; 250 T; 0 other; gccggttgtc gtttgcgcat cactattgtt atttttgacc ttttttgtaa gacagttgcg 60 catcgttcga tttttgggtt atacatcgtt tgcgcatcga tcgtttaaaa ggtcgtacat 120 catttgcgca tcaatattat ttaacaatag aactgaaaaa aaggtgtata cccaaaagcg 180 tttagctttt ctcaaactat atgtatattt gtattataat tataactatg actgtaatat 240 tcctaattga actgaacaaa atttagttga attaattttt actcaaacta cctatatgtt 300 atattatata tattttttac tataacaatg cccatgcaat gtacctaatt aaactgaata 360 ttaaattttt aactacaaaa taatttgcaa attgttgagc tttttataaa tttggtcaac 420 atttgaactt taaatgctta ttgaacaaaa acttgtgcct atgaatatta aattattcat 480 tatctatata atattatatg tgtgtacaat tatttttata cccttctatt gtttagaatt 540 ttgatgcgca tatgatgtac gataatttgg tgcaggtaaa cagacgatgc gcaaacgata 600 atctgccgcc gatgcgcaaa cgatgtacga tattttggca caggtaaaca gacgatgcgc 660 aaacgatata cggc 674 // ID Helitron-6_NVi repbase; DNA; INV; 3746 BP. XX AC . XX DT 22-APR-2009 (Rel. 14.04, Created) DT 22-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE Helitron DNA transposon - consensus. XX KW Helitron; DNA transposon; Transposable Element; Helitron-6_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-3746 RA Bao W. and Jurka J.; RT "Helitron DNA transposons from Nasonia vitripennis."; RL Repbase Reports 9(4), 767-767 (2009). XX DR [1] (Consensus) XX CC The consensus is incomplete at both ends. XX FH Key Location/Qualifiers FT CDS join(2..1672,1647..2159,2113..2325,2277..3386) FT /product="Helitron-6_NVi_1p" FT /translation="RNCKPKNIIMSITRNDELDKNRYNEVAMIFVGEDGHP FT PIERDIVIYSTHDKPISIPLISKHTDPMTYPLIYPNGGYKWMPNMKCKFGK FT NNISNLQFYNYRLSVRNEFNPYLNLGKVSQQAIIDAFVKVESSRLYFIRKN FT QNILRSDCYKGVMDYLHKKGDAENIKIGKMFILPSNFTGSPRHLKQNYIDS FT MSLVNRYGKPDLFLTMTCNPNWKEITENLQKYENSIDRPDIVAKVFQQKVK FT EFKELVIKDGILEKSIAYTYVIEFQKRGLPHMHMLLFLDENYKLDTPEKVD FT NLINAEIPNKNFYPRLYSIVKQFMIHGPCGVINKNSPSMDKNTEKCTKNYN FT NETSFQATGYPIYKRRNDGIKINYKNKNSQRTADNRFVVPYNPFLLLKFNC FT HINVEVCSTVQCIKYLFKYCYKGHDCAFVEIRNADINFEEDNNNKNQTLST FT EEDEYKYDEINQYLNTRYLSPPEAMYRLLEYCLHEISHVIKRLAIHDKDQQ FT YVYFEKGKEEKIKNKIIETTLTAFFKLNQTDLDARNYLYSDIPIHYTFDEK FT LKTWNKRKKIKHGTKGKKYNKPILSRMYLVNPKDRERYFIRLLLLHVKGPT FT SFEDLRTVNGVLYPTFYEAAIAKQLVNEDEEWDKCLQEAIQIQFPTVLCEL FT FAFICIFHNPINARELYEKYKSNFYHPKYEECAGENIALNAIEKVMQANGL FT TLSDFNLPNITMNIEEFDNATFLYKEKNTLKNLIMQHFCIKKKTNMLNELQ FT KKIASLNINQKNIYDAVLESLNYKQDKCIFIDGPGARREWKKLFLKKVRTR FT XQERVEKVILKESLNYKQDKCIFIDGPGGSGKSYLLNVLIDYLKLHNIPFL FT CVAWTGIAANLLSNGRTVHTTFKLPLNINDCTTCNVKPNSVNGRKLKDVQI FT IIWDEISMTSKFAFEAVDRLFQDLCDVTVIFGGKSMLVSGDFRQTLPVVRH FT GSRIEVIENSVKYSYLWTNFRRMTLNQNLRVSNNDDNFKQWLLNVGDGKRL FT NLYEEENELFEIPVNMISKGDIITEIFGNNIKMNDETLKDKVILAPKNNDV FT IDINNKILDIMEGHFVEYLSIDTAEDDNGENLDVMLPTEFLNSLTPNGLPP FT HKLKLKIGAIVILLRNLNINEGLCNGTRLIVKQFLKYSIQAEILNGKNIGK FT LVLIPRITI*" XX SQ Sequence 3746 BP; 1510 A; 456 C; 598 G; 1179 T; 3 other; aaggaattgc aaaccaaaaa atattataat gtcaattaca agaaatgatg aattagataa 60 aaatagatat aatgaagttg caatgatatt tgttggtgaa gatggccatc caccaatcga 120 acgtgatatt gttatatatt caacgcatga taaaccgatt agcatacctc ttataagtaa 180 acacactgat ccaatgacat atccacttat atatccaaat ggtggttata aatggatgcc 240 taatatgaag tgtaaatttg gtaaaaataa tatttctaat ttgcagtttt ataattatag 300 acttagtgta agaaatgaat ttaatccgta tcttaattta ggcaaggtat ctcaacaggc 360 aataatagat gcgtttgtta aagtagaaag ctccagatta tattttataa gaaaaaatca 420 aaatatttta cgaagcgatt gctataaagg tgtgatggat tatttgcaca aaaaaggaga 480 tgctgaaaat ataaaaatcg ggaaaatgtt cattttacca tcaaatttca ctggaagtcc 540 taggcattta aaacaaaatt atattgattc tatgtcactg gttaatcgtt atggaaaacc 600 tgatctattt ttgacgatga catgtaatcc gaattggaaa gaaattacag aaaacttaca 660 aaaatatgaa aattcaattg atcgaccaga tattgtggct aaagtatttc aacaaaaagt 720 gaaagaattt aaagaattag ttataaaaga tggaatttta gaaaaaagta ttgcatatac 780 ttacgttatc gaatttcaaa aacgaggtct acctcatatg catatgttac tatttttaga 840 tgaaaattat aaacttgata ctcctgaaaa agtagataat ttaataaatg cagaaattcc 900 aaacaaaaat ttttatccgc gcttgtatag cattgtaaaa cagtttatga ttcatgggcc 960 ttgtggagta ataaataaaa attctcctag tatggataaa aatacagaaa aatgtacaaa 1020 aaattataat aatgagacaa gtttccaagc aacaggatat ccaatataca aacgaagaaa 1080 tgatggtatt aaaattaatt ataaaaacaa aaattctcaa agaacagctg ataatcgatt 1140 tgttgtacct tataatcctt ttttattatt aaagtttaac tgtcatatta atgtagaagt 1200 atgttcaact gtccaatgta ttaaatactt atttaaatac tgttataaag gtcatgattg 1260 tgcttttgtt gaaattcgta atgcagatat taattttgaa gaagataata acaataagaa 1320 tcaaacttta agtacggaag aagacgaata caaatatgat gaaatcaatc aatatcttaa 1380 tacaagatat ttaagtcctc ctgaagctat gtacagatta ttagaatatt gtttacacga 1440 aatttcgcat gttataaaaa gattagctat tcacgataaa gatcaacaat atgtatattt 1500 cgaaaaaggg aaggaagaaa aaattaagaa taaaataata gaaacgacat taaccgcatt 1560 ttttaaatta aatcaaactg atcttgatgc aagaaattat ttgtattctg atataccaat 1620 acattacaca tttgacgaaa aactaaaaac atggaacaaa aggaaaaaaa tataataaac 1680 caatattaag cagaatgtat ttagtaaatc cgaaagaccg agaacgatac tttatcagat 1740 tattactttt acatgttaaa gggccaacct catttgaaga tttacgtact gttaacggag 1800 ttttatatcc tacattttat gaagctgcta tagcaaaaca gcttgtcaat gaagatgaag 1860 agtgggataa atgtttacaa gaagcaatac aaatacagtt tccaacagta ctatgtgaat 1920 tgtttgcttt tatctgtatt tttcataacc ctattaatgc acgagaatta tatgagaaat 1980 ataagagtaa tttttatcat cctaaatatg aagaatgtgc aggtgaaaat atagcactaa 2040 atgcaattga aaaagtaatg caagcaaacg gtttgacgtt aagtgatttt aatttgccta 2100 atataacaat gaacattgaa gaatttgata atgcaacatt tttgtataaa gaaaaaaact 2160 aatatgctaa atgaattaca aaagaaaatt gcatccctta acattaatca aaaaaatata 2220 tatgatgcag ttttagaaag tttaaattat aaacaggata agtgtatttt tattgacgga 2280 ccaggrgcca ggagagagtg gaaaaagtta ttcttaaaga aagtttaaat tataaacagg 2340 ataagtgtat ttttattgac ggaccaggcg ggagtggaaa aagttattta ctyaatgttt 2400 taatagatta cttaaaatta cataatatac catttttatg tgttgcttgg acaggtattg 2460 cagcgaactt acttagtaat ggaagaacag tacatacaac atttaaatta ccactgaata 2520 ttaatgactg tacaacatgt aatgttaaac ctaattctgt aaatggaaga aaacttaaag 2580 atgtacaaat tataatatgg gatgaaatat caatgacatc gaaatttgct tttgaagcag 2640 ttgatcgtct ttttcaagat ttatgtgacg taacagtaat atttggtggt aaatctatgc 2700 ttgtttcagg agattttaga caaacattac cggtagttcg acacggtagt cgaattgaag 2760 taatcgaaaa ttcagtgaag tatagctatt tatggacgaa ttttagacgc atgacattga 2820 atcaaaacct tcgtgtatcc aacaatgatg acaattttaa acaatggcta ttaaatgttg 2880 gtgatggaaa acgtttaaat ttatatgaag aagaaaatga attatttgaa atacctgtta 2940 acatgatatc aaaaggagac atcattacag aaatctttgg aaataacata aaaatgaatg 3000 acgaaacatt aaaggataaa gtaattcttg ctcctaaaaa caatgatgta attgatatta 3060 ataataaaat actggatatt atggaaggac atttcgttga atatttaagt atagatacag 3120 cagaagatga taatggagaa aacttagatg ttatgttacc tacagaattt ctgaattctt 3180 taactcctaa tggacttcct ccacataaat taaaattgaa aattggagct attgtaattt 3240 tattaagaaa tttaaatatt aatgaaggtc tttgtaatgg aacacgatta atagttaaac 3300 aatttttraa gtattcgatc caagcagaaa ttttgaatgg aaagaatatt ggaaaattag 3360 ttttgatacc aagaatcacc atctaaagaa gaacttccat ttaatatgcg tagaaaacaa 3420 tttcctatta gattaggttt cgcattgaca ataaataaat cacaaggtca gtcatataat 3480 aaagttggcg tctatctgcc atcaccagtt tttagtcatg gtcagttata tgtggctcta 3540 tctagagtaa aagacagaca atagtaaaca tacaaaaatt tacactaaga atattgtata 3600 caaggaaata ttataaacag tacaatatta taacagctgc actttttttt taaatgcagc 3660 gctggaaata tgagaatcgg ctgcagcatt ttttaaatgc agcgctggaa atatgagaat 3720 cggctgcagc attttttaaa tgcagc 3746 // ID hAT-30_SM repbase; DNA; INV; 3369 BP. XX AC . XX DT 14-FEB-2008 (Rel. 13.02, Created) DT 03-MAR-2008 (Rel. 13.02, Last updated, Version 1) XX DE hAT DNA transposon from Schmidtea mediterranea: consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-30_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-3369 RA Bao W. and Jurka J.; RT "hAT DNA transposon from Schmidtea mediterranea."; RL Repbase Reports 8(2), 79-79 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 664..3117 FT /product="hAT-30_SM_1p" FT /translation="MSKRKQTTLLNFFAKKQSRSSESEKIAELEPEKANNE FT FIAVADDQIMHVDHQAQPSNSQNLENRNDNDNDVHNVLSRHSNLNILKDEE FT KIYYLNNFWKPPPNFCFPGTKIQNENFKRKFNYSWLLNYPWLVYSKSIDAV FT YCKYCCLFATNNPADRATQALGQLVKDPFKTWRRATRTFNKHQITNYHKYS FT VLRADDLQTVVDGKRQSVIDLIDTCRVKQAKENRLKLTPIIKTLLFCGRNG FT FPLRGHRDTDTFNMNIDEIDKSTIRQEGNFRQLIRFRLESGDQALKEHLET FT SSKNANYLSWKIQNEIINACNQIILKRIVERANQSECFSVLADETTDMSTQ FT EQFSICIRFIDKDKKMNESFLQFVPVVSTSSKSLANTLVQFLSSIGLNLSY FT LKGQGYDGAAAMSGRFNGVQAKIMDLYPSALYVHCASHSLNLALSDVCQIQ FT DIRNCMGVVEKCYSFMNTPKRDAVLQKKISDICPDAKRTKLKKLCPTRWVE FT RHDSIMIMVELLNPLILALEEIETWKDKESSSGAHILLCAIKSTNFFLSIL FT TVEKIFSYTLPLSKILQTESLDLVTAIKICEDVVGQFEKFRKSAVTVFNEI FT YKRAEVLLREMVDTNYAISIPRINKRQIHRCNISSQSPEEYYRISLFIPFL FT DATVTQLKERFLKHKAILKAFNILIPSNTNTSHHEYSTNLEILLQKYNTDL FT NCTINMLIAEYELWQTKFSLNPIQKNASVLQYLCECSEEIFPNIFKLLKIL FT ATLPVTTSTVERSFSTLKYLKNYLRSTTAEDRLNGLALLYIYRDMPITEEE FT IFNILQEKKRKLDIVL" XX SQ Sequence 3369 BP; 1176 A; 609 C; 607 G; 977 T; 0 other; agtggcgcaa tcagggtgta cgctggtacg ctaccgcgca ccccttcgac caacaaaaaa 60 acaaaataaa aaatccgaaa aagctcgatc ataaaatctg ttagaaactg aagaagtgag 120 aagtggtatg atgcaaaccg gcaacccgag cgtgtgcttt gtgctatgtg gccttgactt 180 ttggcaggca cagacacctc aggcgcaggt ctgagcaggc cagcaggctc agcagattga 240 gattgagaga ctgagcacag gggcgacaag caacccgcga tcggcaacgg cacacggcac 300 ctaggaccct agcagtcggc agttcgccat gtggatgcgc gttgttcgtc cgtttgtata 360 ataacacata agacgcatta aggtataagg tacatccaca atccacatct acatacacaa 420 acagctaatc taaatacaga gtgagactgc tactactact aatactacta cactactgca 480 tatataacaa tcaacaattc attttcattc attacgataa ggataacatc tttacacaaa 540 ttgcaaaata aattatgaat taaaaagtaa gttattatca attttctagt aaattcataa 600 tttattatat tctcattctt tattagttaa ttcattcaac attcatgcat ataaagcatt 660 aagatgagta aaagaaagca aacaacttta ttaaattttt ttgcaaagaa acaatcaaga 720 tcaagtgaaa gcgaaaagat tgcagaatta gaaccggaaa aggcaaataa tgaatttatt 780 gccgttgctg atgatcagat tatgcatgtt gaccaccagg ctcagccttc taatagtcaa 840 aatttggaaa atagaaatga taatgataat gatgtgcaca acgttctgtc cagacattcg 900 aatttgaata ttctcaagga cgaggagaaa atttactatc taaataattt ttggaaaccg 960 ccaccaaatt tttgtttccc tggcaccaaa atacagaacg aaaattttaa gaggaaattc 1020 aattactctt ggcttttaaa ttatccatgg ctagtttatt ccaaatcaat tgacgcggtc 1080 tattgcaagt attgctgctt gtttgctaca aacaaccctg cagatcgtgc aactcaagct 1140 cttggtcagt tggtgaaaga ccctttcaag acttggcgta gggcaacaag aacatttaat 1200 aaacaccaaa ttaccaatta tcataaatat tcagttttgc gtgcagacga tttgcaaacg 1260 gttgttgacg gaaagcggca atcggttatt gacctcattg atacttgcag ggtcaaacag 1320 gctaaggaaa atcgtttaaa gctaacgcca ataataaaaa cccttttgtt ttgtggtaga 1380 aatggttttc cacttcgggg acatcgagat acggacacct ttaatatgaa tattgatgaa 1440 attgacaaaa gtacaatacg ccaagaagga aactttcgtc aattaataag atttcgtctt 1500 gaatcaggcg atcaggctct taaagaacat ttagaaacct cttcaaaaaa tgccaattat 1560 ttgtcttgga agattcagaa tgaaatcata aatgcctgca atcaaataat actaaaacgt 1620 attgtcgaaa gagcaaatca aagtgaatgt ttttctgtgc ttgccgatga aaccacagat 1680 atgtcgacgc aagagcaatt ttctatttgc atacggttca tcgacaaaga caaaaaaatg 1740 aacgagtcct ttttgcagtt tgtgcctgta gtaagcacat caagtaaaag cctggcaaat 1800 acattggtgc agtttttgag tagcattggg ttaaatctat cttatttaaa aggacaaggc 1860 tacgatggcg cagcagcaat gtctggcaga tttaacggag ttcaagctaa aataatggat 1920 ttataccctt ctgcattata cgttcattgt gcttcacact cgttaaattt agctttgtct 1980 gatgtatgtc aaatacagga catcagaaat tgtatgggtg ttgtggaaaa atgttacagt 2040 ttcatgaata caccgaaaag agatgcagtt cttcaaaaga aaatttctga tatatgccca 2100 gatgcaaaac gaactaaact gaaaaaatta tgtccaacac gttgggttga aagacatgat 2160 tcaatcatga taatggttga attacttaat ccacttattc tcgccctaga agaaattgaa 2220 acttggaaag acaaagaatc gtcttcgggt gcacatattt tactctgtgc tatcaaaagt 2280 accaattttt tcttatcaat tttgactgtg gaaaaaatat tttcttacac tctaccgcta 2340 agcaaaatat tgcaaactga gtcacttgac cttgtaacag ccattaaaat ttgcgaagat 2400 gtagttggac agtttgagaa atttcgtaaa tcagcagtta ctgttttcaa tgaaatatac 2460 aaaagagctg aagtattatt gagagaaatg gtagatacca actatgctat ttcaattccg 2520 cgcataaata aacgacaaat ccacagatgc aatattagtt ctcaatcccc ggaagaatat 2580 tatcgcattt cattatttat cccatttttg gacgcaacag taacccaatt aaaggagagg 2640 tttctgaagc acaaggcaat attgaaagca tttaatattt taattccatc aaatacaaat 2700 actagtcacc atgagtatag cacaaattta gagatattat tgcaaaaata taacaccgat 2760 ttaaattgca caataaatat gttaatagcc gaatacgaac tgtggcaaac aaagttcagt 2820 ttaaatccaa tacaaaaaaa tgcaagtgtt ttgcaatacc tctgcgaatg cagtgaagaa 2880 atatttccca acattttcaa attgcttaaa atcttggcca cacttcccgt gacaacaagt 2940 acggttgaaa ggtcattttc aactttaaaa tatttaaaga actatttgag aagcaccacg 3000 gcagaagacc ggctaaatgg tttggctttg ctctatatat atcgagacat gcctataacg 3060 gaagaagaaa tatttaatat attgcaggaa aaaaagagga aattagatat tgttctgtaa 3120 gggtcgattt ataagtaata ttactccttt gttgtttttg cattttgcac aaatttatcg 3180 atttgttttg taagttttta tgctctgctt tataatttat caaattttta taataaatca 3240 aatattaaat attaaaaaag ctctggccag tgtagcatat gtgggtcctt tagtgcaagg 3300 cccgcacttg accctccttg ggccgatact gacagcgtac cccattcaaa attcctggtt 3360 gcgccactg 3369 // ID Gypsy-1_PPc-LTR repbase; DNA; INV; 251 BP. XX AC chrUn; XX DT 06-JUL-2010 (Rel. 15.07, Created) DT 06-JUL-2010 (Rel. 15.07, Last updated, Version -1) XX DE LTR retrotransposon from the Pristionchus pacificus genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-1_PPc_; KW Gypsy-1_PPc-I; Gypsy-1_PPc-LTR. XX OS Pristionchus pacificus OC Eukaryota; Metazoa; Nematoda; Chromadorea; Diplogasterida; OC Neodiplogasteridae; Pristionchus. XX RN [1] RP 1-251 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Pristionchus pacificus genome."; RL Repbase Reports 10(7), 995-995 (2010). XX DR Genome; chrUn; Positions 152252936 152253186. XX SQ Sequence 251 BP; 73 A; 62 C; 53 G; 63 T; 0 other; tgtgacgctc tattgccgcg gaacagccga tctagagaga gaagagactc gagagagaga 60 tagtggccac ccccgatcaa caaccaacca gtcttcatgt aatcttcatc tagtctattc 120 atctagtctt catgtagttc tcaaaccctg aatagctgat aaaagatagt ccataaatca 180 tgtagctgag tacactatag tcccgttagc aagaggctgg acctacgggg tcctccagca 240 ttcttgttac a 251 // ID Gypsy-39_DPu-LTR repbase; DNA; INV; 405 BP. XX AC ACJG01004895; XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the common water flea: long terminal DE repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-39_DPu_; KW Gypsy-39_DPu-I; Gypsy-39_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-405 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the common water flea."; RL Direct Submission to RU (08-FEB-2011). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR Genome; ACJG01004895; Positions 6125 6529. XX SQ Sequence 405 BP; 61 A; 91 C; 104 G; 149 T; 0 other; tgtcacgttc agtccctttt cgtgtggtct atcatgtttc ctttcttcca ttgtgtctgc 60 caggtggccg ctagatggtg tcggttgctt ggctgcctag ttaaaccggg ttcgtctttc 120 tttttcttta gagtttcgcc tgctgttgtc cggtccggtc gcaagttgac ttgctctgct 180 cttttctgtg tgtgtgcgtt ttgctttatt gaggtgtgtg cttctcgtgc tagttttaga 240 cccactaggg gcaccatcgg agtagttaat ctagaatagt gtagaccgat ggtgtgattc 300 actgtcccgt actgtgccag tgtcgacggc cagcgtttca gtgatttaca ttgtggcgtc 360 acattcagca agtgtattaa ctgccatttg aatttacccg tgaca 405 // ID Gypsy-56_CQ-LTR repbase; DNA; INV; 235 BP. XX AC AAWU01017208; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-56_CQ_; KW Gypsy-56_CQ-I; Gypsy-56_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-235 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 492-492 (2011). XX DR GenBank; AAWU01017208; Positions 19623 19857. XX SQ Sequence 235 BP; 56 A; 53 C; 55 G; 71 T; 0 other; tgttggagga ccaccctgac gatgccacga atcgtctgcg atggctcact gcgtgacagc 60 tgtcatgctc gtcgatgctg acgtggatcg agagagagga gcaagtcatc gtcgatcact 120 tcctttgagg ccactgcccg aacaggccac gtttttctta acattttttg gaagcagttt 180 taataaagtt ttaatagttt tgttgtgatt aattcacctc gtttatatca caaca 235 // ID BEL-5_DWil-I repbase; DNA; INV; 5821 BP. XX AC scaffold_181039; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-5_DWil_; KW BEL-5_DWil-LTR; BEL-5_DWil-I. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-5821 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_181039; Positions 415654 409834. XX CC 'GGTAG' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 377..1423 FT /product="BEL-5_DWil-I_2p" FT /translation="MATKDLSSGIYHCKKCMQEDNDRMVQCNRCCSWYHFD FT CVGVIDEKSENSWNCDCVNVSSEVITTILSNALQNLGNTTRAGEPQATSSP FT AKPTGPLPEPDRDFRLCSPKIFPDTARTAVPIFTSAKQKAMITHSAVRDTW FT HYDHEAPQLPFLTSNNTRATTREQPHLSATPMAPWNFTNTQNRAASYYPES FT KVRMDIDLNTAIPSAAQLAARQAIPKELPTFDGNPQNWPLFYSSFQTSTEI FT AGYTNAENLMRLQASLKGKARELVQSKLLLPAMVPEIIETLRMCFGRPEHI FT LHNLLEKTRQLPAIKDKLEPLIEYALCVRNIYSTMEACGLTGYMQNPMPPK FT TPYKSY" FT CDS 1363..5820 FT /product="BEL-5_DWil-I_1p" FT /translation="MWTHRLHAKPNAPQNPVQELLEKLPTQLKLQWAMFPK FT DTKVPSMESFSKWIYDIAEAASQVTSPLYHKKSGSLNHHTEAPKEQVKEIT FT CVVCNEPGHKIHSCPSFKATPHQRKMSFLKNHKLCQQCLNRHHQKCQREKV FT CGIRSCRSKHHPLIHEMQPMPEAAKSSSPNRPAMVTNVHTPEGKHTQRSRE FT SGSTYFRVIPIQIINKTKVTNTFAFLDEGSALTLIDNKVFKQLGLKGVPDP FT LCLKWTNDTTREENASERVTLTVANPHNGNQFYLEEVHSVDTLDLPVQSID FT IRALAKEFPHLAGLPIASYTNARPSILIGTNNWNLAVPLKIKEGAWHQPIA FT SKTRLGWALQSSTPSRATRANVSVHMCGCRDADAKLHEIIKQAFSLDATTA FT KPLSSPEETQALTRLESTTVLKNGRYEVGLMWKDPEVLLPDSYANALKRLK FT CLQKQFSRQPQLKEQITKQIENLVEKGYARMQTPEEIAAPNRRTWYLPTFI FT TKNPNKPEKVRLVWDAAAQSGGKALNDYIWSGPDLLNSLFDLLLSFRVGRV FT AICGDIAEMFHQIRVRPADTHAQRFLWFDSKSERQEPNVYVMEALTFGINC FT APCIAHFIRDKNADRFCQQYPQAAQAMKDYHYVDDFIYSGDNYVEVGNIAT FT QVRDIHAAGGFHIRNWSSNSKEVLRILKSDSLLPEAVEITATEKVLGLYWM FT PNSDVFTFICKFARLKRNVLDSDKTPTKRELLQVLMSMYDPLGFISCFTIE FT LKILLQEVWRSGIGWDTGLPDALLPKWIRWKRILTTIAGLTIPRCYFNSSD FT QIHDVQLHTFVDASELAYAAVCYLRIRQGERTYLSFVASKAKVAPLSPLSI FT PRMELQAAVIGAKLSNRIQRNPRLSINSSCYWSDSKTVLKWLRMDPRKFQQ FT FVMHRVGEILEFTNVSQWNWVPTNLNPADLATKTNNANKYKTWLHGPDYLL FT QDQHEWPKCDDLGPPNNAEVRHNILFIDNSPMELKLNAEYFSDWRRLYRAL FT ARFILYIEKLKAKQKKASPPTEVSYEMIQQARTLLLRHAQSSEFNPEIRNL FT TRGKPLKTNSKLICLNIFLDENQLLRTRGRAENLNGQNQILLPYGHHITRL FT IVNWYHQKMHHTSHETCINRIRGMFYIPRLRVLYKGMRKSCQRCKNESAMP FT VPPQMAPLPIARLAAYQRPFTYVGIDYFGPFLVAVGRRSEKRWGVIFTCLT FT IRAVHIELACSLDTASCIMCIRNFINRRGTPREIYSDNGTNFKAAEKIICD FT KAQTINFNAVQPAFDDIKWKFNPPAAPHMGGAWERLVRSVKTVLYAICPAR FT KFTSEGLQSALWEVEFILNSRPLTFVSLDSKDDEAITPNHLLLGSASGYKP FT VFETTHTVQHMWQAVQEFADQFWRRWVREYVPDLARRGKWFTKRPPIAAGD FT VVVILDETLPRNRWPKGIVEQTILAKDNQVRRVIIRIANGTLQRPVAKVAV FT LDVGCFGAKQPPEESSFTGGG" XX SQ Sequence 5821 BP; 1832 A; 1501 C; 1243 G; 1245 T; 0 other; attctataaa ttaatggcag gaattaattc tataaattaa ttctataaat tatgccagga 60 attaattcta taaattaatt ctataaatta cgccataaag gcaatgaatt aattctaaaa 120 attaattcta cagattaatt ccataaatta atcttacagt ggtaaaccac taaatgaacc 180 ggaaaagtta attatttaac tcaactcgac gggtcaaccc acggttctac acgccaatca 240 tatagaggtt aacctcgatg attgcataac caatcactaa attgtgatac ataacctcaa 300 cggtcaaccg tttacgacct acataggtca cataactaat caacatcgct taaaagtata 360 aactcattgg taaaccatgg caacaaaaga cttatcaagt ggaatatatc attgcaagaa 420 atgcatgcag gaagacaatg accgcatggt gcaatgcaat cgatgctgtt cctggtatca 480 tttcgactgc gttggagtaa tagacgagaa atctgagaac agctggaact gcgactgcgt 540 caacgtgtca tcggaagtca tcacaaccat attgagcaac gccctgcaga acctcggaaa 600 taccacacga gcaggcgaac ctcaggctac ttcgtcacca gcaaaaccaa cggggccgct 660 accagaaccg gatcgggact ttagactatg ctcgccaaaa atattcccgg acacagctag 720 gacggctgtt cctattttta cttctgccaa acaaaaagcg atgatcacac actcagctgt 780 gcgtgacacg tggcattacg atcacgaggc accgcagctg ccgttcctaa cctcaaacaa 840 cacgcgtgct acgacacgcg aacaaccgca cctaagtgct acgccgatgg ctccctggaa 900 tttcacaaat actcaaaaca gggctgcatc gtactatcca gaaagtaaag tgcgaatgga 960 catagaccta aacacagcaa taccaagcgc ggcacaactt gcagcccgtc aagcgatacc 1020 gaaagagctt ccaacatttg atggcaatcc tcagaactgg cctctattct acagtagttt 1080 tcagacaagt acggaaatcg ctgggtatac gaatgcagaa aatcttatgc ggctgcaagc 1140 tagcctgaag ggcaaagcac gtgagttagt tcaatccaaa ctcctgttac cagcaatggt 1200 ccctgaaatc atagaaacat tacgtatgtg tttcggtcgg ccagagcata ttttgcacaa 1260 cttactggaa aaaacgcggc aactacccgc aataaaagat aagcttgaac cacttatcga 1320 atatgcactg tgtgtacgaa acatctactc aacgatggaa gcatgtggac tcacaggtta 1380 catgcaaaac ccaatgcccc ccaaaacccc gtacaagagc tactagagaa actccccaca 1440 caattgaagc tgcagtgggc aatgtttccc aaagacacta aggtaccatc aatggaatct 1500 tttagcaaat ggatctacga catagctgaa gcagcaagcc aggtaacatc gccgctttac 1560 cataagaaga gcggaagctt aaatcatcac acagaggcac ctaaagagca ggtcaaggaa 1620 attacgtgcg tcgtatgtaa cgaacctggg cacaaaatac acagctgccc atcgttcaaa 1680 gccacgccac atcaacgaaa gatgtctttt ctgaaaaatc acaagctctg ccaacaatgc 1740 ctaaaccgtc atcaccaaaa gtgccaacgt gaaaaggttt gcggcatacg aagctgccga 1800 agcaaacatc acccattgat ccatgagatg cagccgatgc cagaggctgc gaaatccagc 1860 agcccaaaca gaccagctat ggtcaccaac gtgcatacgc cggagggcaa gcacacccaa 1920 cgatccagag agtctggctc aacatatttc cgtgtcattc caatacaaat catcaataaa 1980 acgaaagtca ctaacacttt tgcatttcta gatgagggat cggcactaac gctcatcgac 2040 aacaaggtct ttaagcaact cggcttaaaa ggcgttccag atccactgtg cctaaaatgg 2100 accaacgata caacacgcga agaaaatgca tcggaacgtg tcacgcttac ggttgccaat 2160 ccacataatg gcaatcaatt ctacttggaa gaggttcaca gtgtagatac cctggatcta 2220 cctgtacagt ctatcgacat aagggctctc gccaaagaat tcccccacct agcgggacta 2280 ccgatagcat catacaccaa cgcacgccca agcatcctaa tcggcacaaa taactggaac 2340 ctcgccgtac cccttaaaat caaagagggg gcatggcacc aaccaatagc gtcaaagacg 2400 cgactgggat gggcactaca aagctccacg ccatcaaggg ccactagagc taatgtcagc 2460 gttcacatgt gtggatgccg agacgcagac gccaagctgc atgagatcat caaacaagcg 2520 ttctcattag acgctaccac agcaaaaccg ctatcatcac cagaagaaac tcaagctcta 2580 accagacttg aatctactac cgtcttgaag aacggcaggt acgaggtcgg tctaatgtgg 2640 aaagaccctg aagttttatt acctgatagc tacgcaaacg cactaaaacg gcttaaatgc 2700 ctacaaaagc aattttcccg acaaccacag ttgaaggagc aaatcaccaa gcaaatcgaa 2760 aacctcgtcg aaaagggcta tgctcgcatg cagacgccgg aggaaatagc tgcaccaaac 2820 agacggacat ggtatctacc aaccttcata actaaaaatc caaataaacc agagaaggtc 2880 cggcttgtat gggacgcagc cgcccaatcc ggcggcaagg ccctgaatga ctacatctgg 2940 agtggtccag atctcctaaa ctccctgttt gacctcttac tctcattccg agtgggacga 3000 gtggcgatct gtggcgatat cgccgagatg ttccaccaga tccgagtgag accagcagac 3060 acccatgcgc aaaggtttct gtggttcgac agcaaatccg agaggcaaga gcctaacgtc 3120 tatgttatgg aagcgcttac tttcggaatc aattgcgcac cctgcattgc acactttatt 3180 cgtgacaaaa atgcagatcg attctgccaa cagtaccctc aagcagcaca agccatgaag 3240 gactaccact acgtggatga ttttatatac agcggcgaca actacgtgga agtcggaaac 3300 atcgcaactc aagtcagaga catccatgca gcaggtggtt tccacatacg caattggtct 3360 tcgaactcaa aggaggtcct acgaatacta aagagcgatt cgctactccc cgaagctgtc 3420 gaaattaccg caacagaaaa ggtactcggc ttatattgga tgccgaactc cgacgtattc 3480 acattcatct gcaagttcgc caggctgaag cgcaacgttt tagacagcga caaaacacca 3540 accaaacgcg agctcttgca agtacttatg tcaatgtacg acccactcgg cttcatctca 3600 tgctttacaa tagagctgaa aatcctactg caagaagtct ggagaagcgg cattggttgg 3660 gatactggtt tgcccgacgc attgttaccc aaatggatac gatggaaacg gatccttaca 3720 acaatcgccg gattgaccat cccgagatgt tatttcaaca gcagcgatca aatccatgat 3780 gtccagttac acacgttcgt tgacgccagc gaattggcat acgcagcagt ctgctacctt 3840 cggatacgcc agggggaaag aacctacctc agcttcgtgg cctccaaggc gaaagtggca 3900 ccgttgagtc cactatctat accaaggatg gaactgcagg ccgcagttat cggagcaaag 3960 ctgagtaacc gaatccaacg caatccaaga ttgtcaatta actcgagttg ttattggtcc 4020 gactcaaaga ctgttctcaa atggcttcgt atggatccac gaaagtttca gcaattcgtc 4080 atgcaccgcg tgggcgaaat actggaattc acaaacgtta gccaatggaa ctgggttcca 4140 accaacctta accctgcaga tcttgctact aaaacaaaca acgccaataa atataaaact 4200 tggctacacg gtccagacta tttgctgcag gatcaacacg agtggccaaa atgcgatgat 4260 ttggggccac caaacaatgc tgaagtcagg cataacatac tcttcatcga caattcgccg 4320 atggagctaa aacttaacgc ggaatatttt tccgactggc gcaggctata tcgagcttta 4380 gcgcgcttta ttctctacat tgagaaactg aaagcgaaac aaaagaaggc atcgccacct 4440 acagaggtgt cttacgagat gattcagcag gcacgcacac tcctactgcg gcatgcgcag 4500 tcatctgaat tcaacccaga aattcgcaac ttaacgagag gtaaaccttt aaaaacaaat 4560 tctaaactca tatgtttgaa tatctttcta gatgaaaacc agctcctacg tacccgaggg 4620 cgagcagaaa acctaaacgg ccaaaaccaa atactgctcc catatgggca ccacattact 4680 cggctcatag tgaactggta tcatcagaaa atgcaccaca catcccatga gacgtgtatt 4740 aacaggattc gaggcatgtt ttacatacca cgtctacgag tcttatacaa aggcatgcgg 4800 aagtcttgcc aacgctgtaa aaatgagagc gccatgcctg ttccaccgca aatggctccc 4860 ttgcccatcg ccaggctggc tgcctaccaa cgtcccttca cttacgtcgg tatcgactac 4920 ttcgggccct tcttggttgc tgtaggacgg cgaagcgaaa aaagatgggg cgttattttc 4980 acctgcttaa ccatacgggc agtgcatata gaactggcgt gctctttgga cactgcatcg 5040 tgcataatgt gcatccgaaa cttcatcaat cggcgcggga ccccgagaga aatttacagc 5100 gacaatggca ctaacttcaa agctgctgaa aagattatct gtgacaaagc gcagacgatc 5160 aacttcaacg ccgtgcaacc ggcgttcgat gacatcaaat ggaaatttaa tccaccagcc 5220 gctcctcata tggggggagc atgggagcga cttgttcgtt ccgtcaagac tgtactctac 5280 gcaatctgcc cagcgagaaa attcaccagc gagggcctac aaagcgcact ctgggaagtg 5340 gaattcatcc taaattccag accgcttact tttgtctctc tggacagcaa agacgacgag 5400 gctatcaccc ctaaccatct actactagga tcagcaagcg gctataaacc agtcttcgaa 5460 accacacata ccgtacagca catgtggcaa gctgtacaag aattcgccga tcaattctgg 5520 cgacgatggg tacgcgaata cgttccagat ttagccaggc gaggaaaatg gtttacgaag 5580 agaccaccaa tagcagctgg agacgtcgtt gtaatattgg acgagactct gccccgaaat 5640 agatggccaa aaggtatagt cgagcaaaca atcctggcca aagacaacca ggtgcgtcga 5700 gtgatcatca gaattgccaa cgggactctt caacggcccg tggctaaagt agctgtatta 5760 gacgtcggct gttttggagc aaagcaaccc cctgaagaga gcagctttac tgggggggga 5820 a 5821 // ID DNA3-2_AP repbase; DNA; INV; 126 BP. XX AC . XX DT 25-AUG-2009 (Rel. 14.09, Created) DT 25-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA3-2_AP. XX NM DNA3-2_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-126 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 1943-1943 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 3 bp TSD CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 126 BP; 35 A; 32 C; 26 G; 33 T; 0 other; gggccagtat taccatctcc cgttaaattt aaccgactcc taaccggccg aattcggccg 60 gtaataaaat ctgttatcag tcggttaagc gtaatcgaag cataccgatg attgtaatac 120 tggccc 126 // ID Gypsy-51_AA-LTR repbase; DNA; INV; 1160 BP. XX AC supercont1.286; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-51_AA_; KW Gypsy-51_AA-I; Gypsy-51_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-1160 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.286; Positions 1349026 1347867. XX SQ Sequence 1160 BP; 274 A; 345 C; 285 G; 256 T; 0 other; tgtaaccagg aaatggttac agtttctaca gctcaacatg ctgagagaat atttgtccaa 60 atgcgatttc accaccttga tggattatgc aaagtggggg aatgttttag agccctatta 120 ctccgggagg tgctaccaac cagtgggtag ctgccacgaa atctcctaca ggtggctagg 180 tggctatggg tggagcaatg acagtgacaa gagaacggat gtggggagaa cggaacgaac 240 gaaagtcgaa ccgggtcgga catgagggtt tggatcagat tttttggcgc gatcgaggat 300 tttaatcgag accacagaac gagacacatc gacgagcata cgaactctga gcaagtgaag 360 tctgcggtgg ttaccattaa ccgcgaagaa cccaagtcgc cagataatcg attttcccag 420 ccgagcctcg atttgtcgag tggacattct ggttcaggtc cagtttagaa gccccttggg 480 ctctagtaga agaagccttg cctgccattc cagttcgtcg gagccacagc attcatctac 540 accgcgtggc ccgcattgtt cagtgccaag aacctacgtc gctcacggca ttacacccga 600 agcttcgccg ctaaaccgcc ttcgtcatct gcgaccgtca cctccatagg tcgaccgccg 660 gattcccgcc atcgctcatc ctgggagtct gttgaggcat accaccttcc cacgtcgttc 720 atccaccacc catctactca ggccggccgc ctcgccacgt cgtccaccct actcccaaca 780 attcaagact accgcctcgg tcagccgtcc atcaccatcc cgctgtgacg ctatcggggc 840 accctccgtc gcccacattc cacggcctcc gccgttggtg gtttcctcgc tgccttggag 900 agacagcgta tattctgtac cgtgagtcaa tacaaaactt taaaactaac cctaatttgt 960 atgctaacaa ggtccgaaaa cctccccact acctcctcta acgaccctgg gactcgagga 1020 ccccaaaaga ggccttttga gcccgccccg ctgactgccg ttggtcagac cagcagtctg 1080 gggttttgac cggttcccca tctgggcttc ggtgtccgat cctcgagggc taagagagcc 1140 gtaccagaaa aacggtaaca 1160 // ID PiggyBac-3_HM repbase; DNA; INV; 2883 BP. XX AC . XX DT 08-JAN-2009 (Rel. 14.02, Created) DT 08-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE PiggyBac-type family: consensus. XX KW piggyBac; DNA transposon; Transposable Element; PiggyBac-3_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2883 RA Bao W. and Jurka J.; RT "PiggyBac families from Hydra magnipapillata."; RL Repbase Reports 9(2), 452-452 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 224..2389 FT /product="PiggyBac-3_HM_1p" FT /translation="MASYKRTYSNMEEILKIVMDESSDDMDLGELSADQSE FT PETDWEYEEEDIEFVRAPNLPLSNDFQNKNCSSLPEFCQINHVYKIDIDEE FT EEEEIEQNSSEETYPNENKETLFSDQLFDSTRENGIAERKRVKVRGGIQKN FT SVNNKNRILQVTNRGKEKKGNNNRGCRGRPWGRRGQSNARVLKINQQGIDL FT ISTWERVKIDNTADPIIPFFELEGLRVRMTHENPSPLDFVELYLTPSIIEL FT IVIETNRYAQDFIDKNPNKASNEYVGNWEPVTVLEIKKYLALVLLMGIIYK FT PDIHMYWSTSEFFSTPIFSKIMSRTRFQLILKFLHFNNNNDPNYDVNDENR FT DRLHKVRPLIELIRDRCKNVFCPGQNLSVDESLVLFKGRLHFKQYIRTKRA FT RFGIKLYELTTSSGITLDFLVYCGKGMFYEDGNEDMPSTERIPAALMEPFL FT DKGHILFTDNYYTSPSLATFLLGRNTHLCGTVKSNRKKYCKEIAEFALEKG FT EAVFYRSTIDSRVIACKYRAVKNKSGNKQKVVYMLSTCHSAKMIETGKTYA FT NDSAIYKPSIVRSYNSHMGGVDRVDQQLHSLRTLRKSYKWYKKLAFRMISQ FT VILNAHKVFQHETGKSKITFLDFLHETISSLALINPAIPNNQILDDTLSRL FT TGRHFPSLKVASPDAKDKRPTKRCRVCYARGKLSAKCQPLKTTYVCRFCPS FT EPGLHPDTCFEAYHTQVNYLL*" XX SQ Sequence 2883 BP; 1017 A; 439 C; 510 G; 917 T; 0 other; ccctttcact ccctgagacc cgcacagcgg gtcgtaacta atatcgcgat tttccctgtg 60 atccgctgtg cgggtcattt caacttctac gatttaaaca cggttttata gcgttttttg 120 ttgtgttctt tacagatatg gaaaatagaa tagttattac atcataggct gaaaaccgtt 180 cgtcaaaatt tatcttagtt aaaaagatat agcattttta aaaatggcta gttacaaaag 240 aacttattca aatatggaag aaattttaaa aatagttatg gatgaaagca gcgatgatat 300 ggaccttggt gaattatcag ccgatcaaag cgaacctgaa acagattggg aatacgaaga 360 ggaagatata gagtttgtgc gagctcctaa tttaccttta tctaatgatt ttcagaataa 420 aaactgttct tctttaccag aattttgtca aataaatcat gtttataaaa tcgatatcga 480 tgaggaggag gaagaagaaa ttgaacaaaa tagtagtgag gaaacatacc ctaatgagaa 540 caaggagacc ctattctctg atcaattatt tgattccacc agagaaaatg gtatagcaga 600 acgaaaaaga gtaaaagttc gcggtggtat tcaaaaaaac agtgtcaata ataaaaatcg 660 aatacttcaa gttactaaca gaggtaaaga aaaaaaaggg aataacaatc gaggttgtag 720 aggaaggcca tggggtagaa gaggacaatc caatgctaga gttttaaaaa ttaatcaaca 780 gggaattgat ttaatatcaa catgggaacg tgttaaaatt gataatactg cagatccaat 840 tatacctttt tttgaactag agggacttag agtaagaatg acgcatgaaa atccttctcc 900 tttagatttt gttgaactat atctaacacc tagtattatt gaacttattg taattgaaac 960 aaatcgatat gctcaagatt ttatagataa gaacccgaat aaagcaagta acgagtatgt 1020 aggaaattgg gaaccggtga ctgttttaga aataaagaag tatcttgctc ttgttttgct 1080 tatgggaatt atttataaac ctgatattca tatgtattgg tctacatctg aattttttag 1140 cactccaata ttctcaaaaa ttatgtcacg cactcgtttt caactgattt tgaagttttt 1200 acacttcaac aataataatg atccaaatta tgatgtaaat gacgaaaacc gtgatcgatt 1260 acacaaagta cgaccattaa ttgaactaat tagagaccga tgtaaaaatg tattctgccc 1320 cggacaaaat cttagtgttg acgaatcact tgttcttttt aagggacggt tgcattttaa 1380 gcaatatatt cgtacaaaaa gagcacgatt tggcataaaa ctatatgaat tgaccacatc 1440 aagtggaata acactagatt ttttggtgta ttgtggtaaa ggtatgtttt atgaagacgg 1500 aaatgaggat atgcccagca ctgagagaat accagctgca cttatggaac cttttttaga 1560 taaaggacat atactgttta cagataacta ctacacaagc ccttcattag caacttttct 1620 tttaggaaga aatacacacc tttgtgggac agttaaatca aatagaaaaa agtactgtaa 1680 agaaattgcg gagtttgcgt tagaaaaagg ggaggctgtg ttctatcgat ccacaattga 1740 ttcacgtgtt attgcatgta aatatcgagc tgtcaaaaat aaatcaggaa ataaacagaa 1800 ggttgtatat atgttgtcaa cttgccattc agcaaaaatg attgaaactg gtaaaaccta 1860 tgcaaatgac tctgcaatat acaaaccatc aatagtgcga tcatataaca gtcacatggg 1920 aggggttgac agagtggatc aacagttaca tagtttacga acattgcgta aaagttataa 1980 gtggtacaaa aaattagcat ttagaatgat ttcacaggtg atactaaatg cacataaagt 2040 ttttcaacac gaaactggaa aatctaaaat aacattcctt gattttttgc atgaaactat 2100 ttcatcttta gcactaatca atccagctat accaaacaac caaatactag atgatacatt 2160 atctcgtctt actggcagac attttccttc gctcaaagta gcttcaccag atgcaaaaga 2220 taaaagacca accaaaagat gtagagtgtg ctatgcaaga ggaaaactat cagctaaatg 2280 tcaaccacta aaaactacat atgtttgtcg attttgccca tctgaaccag ggcttcaccc 2340 agacacttgt tttgaagctt accacactca agttaattat ttattgtagt ttgttttgtt 2400 tgtattggca gacttaatct ttcttttttt tttttcattc tacaaacata ataaaaaatt 2460 taacttaaaa ataacggttt tttcaaaatt cttatatttt tagaatctct cctttttcca 2520 gagtttttgt gttaaacctt ttattttttt gctatgaact agtaaaatga ttgctaaact 2580 gtttttactt acgtttcatt tgtttataac gtaatatttt ggtttatttt tggattttac 2640 tccaaaaaag caaaattttg attaatgtca aaggccgttt tttgagtcat gtgacgtctt 2700 ataagtgtca taaatatata ttttttccat attttgttag attaatgtat tagtaaaaaa 2760 ggggccaagt ttcacataga attgctcatt attttagtca ctataagctt ttaaagtgtt 2820 tgagtcagaa acgggaattt tgactctggg aatatagcgg aaaatatccc gggagagaaa 2880 ggg 2883 // ID BEL-5_SI-I repbase; DNA; INV; 5407 BP. XX AC AEAQ01019002; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-5_SI_; KW BEL-5_SI-LTR; BEL-5_SI-I. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-5407 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01019002; Positions 6337 931. XX CC Positions [4392-4973] - Integrase core CC 'ACGGG' target site duplication CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS join(2292..3206,3210..5063) FT /product="BEL-5_SI-I_1p" FT /translation="MDEYLQKGHMELVPLNTIEEKNACFIPHQPVLRPDNI FT TTKLRVVFDASARTDNNNSLNDLLMAGPNLQAELLHILLRFCIYNFVITGD FT IAMMFRQITVAKADRSLQLILWRKEEEGSLEAFSLSTVTYGTKCVPFLVMR FT CLKQLAEENGDQFPLAQRALVSDFYMDDVLTGSDKLEEAISLQGQLTDLLA FT KGQFPLNKWRSNDNRILKHLAQEGLTEKLLVIHKDAPLKTLGVLWNHREDV FT LQYNARETTSSRITKRSVLSEISQTYDPLGLLGPVMIVAKIMMQQLWTLNL FT QWDESLPQELHSKKNYQASWGQLRDLKIPRQSKDRGTTSRIIKHGFSDALE FT RAYGACLYAVTRTLDGKLQSQLLCAKSRVAPLKIITIAKLELCAALLLAKL FT YKTVQETLGNKIADVRFWSGSTIVLGWIRTCPSTLKTFVANRVSQIQILGS FT QYVWHHVPSAENPADILSKGTTIEELQTNRLWWHGPWWLQEENSWPEQPSS FT VRELPEKKTVVSFVTTIQASEVLPSVSTFKRLFRIVAYCYRFGARHNSKVE FT TGALTVAELTKAETAVVKVVQREAFAQEYRDLQKGQPVKDNSQLAALDPFL FT DSRDLICVGDRLQHSRLTEEAKHPIVLPSKHHVTRLIFKDEHVRLHHCGAE FT QLLASVRQKYWVLSGRKVARKITRSCVHCFRWRPSSLQVKMGNLPEARVAN FT YVRPFAITGVDYAGHIKIKESRRRGRVHSSKAYIALFICFHTKAVHLELVT FT DLTTESFLAALPRFTGRRGICSQLHSDNATNFVGAERELREIYDFLRGQNE FT VIQAELANQRIEWYFIPPTVPNFGGLWESNIKSMKKHFYVVTKGLTLTFEE FT CYTLLVGIEAVLNSRSLTPISSDVKDLSVLTPSHFLIGERLSQPVERDHLQ FT EPDNRLKIWQHLQKVRQDF" XX SQ Sequence 5407 BP; 1701 A; 1075 C; 1318 G; 1313 T; 0 other; tttttctggc gcccaacgtg gggcgcgagt ggaatcactg gaattcgtaa aagcattgga 60 ggattcgcaa gcagaatcgt agcaaccgct acataacctc gcatcgtaga attgcataac 120 ggcaagatgg cagagttggg acagagatga gtaggttgaa ggccacacac gctcaactga 180 aaggccaggt aactcgagtc aattaattcc tgacaggaaa tcaagaaata acttgtgagc 240 aagcacaagc gcgattggac aagctgcaag agctgtggca cgcgttcaat gagaatcaga 300 cgcagatcga aatcctcaag gcaggagagg atggagatgt ggaagcaata gcacaacaag 360 aactcgcaga acaagagatt ttcgagtggg catattacag agcagtggat gaggctcgcg 420 cgataatagc ggcgacccag gcggctgcgc aaccagcagc actgctattg caagacgcac 480 gacagggcgt acagtaagac attgaatcgc aaggtaatat taatattaaa ttacctacat 540 tgaagttgcc tatctttgca ggcgaatacg atcagtggat gttattcaag gacgcgtttc 600 aatccttgat tcatgataat cgcaagctgt cggatgtaca gaagtttcag taccttcgaa 660 gcacattgaa ggacgaggcg ttgcaagtca ttagtggact gaatacgtca acggaaaatt 720 accttgttgc atgggactta ctgaagagtc attacgagaa caaaaaacta atcataaata 780 gtcatttatc gaagttatta gagttccctg cggtaacaaa agataagcat gtatccctca 840 agcagtttat tatgcatctt cgcatgcatc taaaggcctt acaagtttta ggccagccaa 900 cggaccaatg ggatacgatc atcatattct tagcaagtta gattattcgt ctcaacgagc 960 ctgggaagaa gagattggac aacaagagca ggatcacatg cctacaattg acgaattcct 1020 caaatttctc aacgagaggt gtcgtacttt ggaaatgtta gatactaata gcaaccggcg 1080 tgagtcagtt ccaaaattta atgtaaacaa aaagatagac aaaagagtag cactggcaac 1140 aatgtcacat gcatgttcaa tgtgtaaaga atctcatagt ttgtttaatt gttcggagtt 1200 tctaaaatta tcggttcaaa atcggctcgc ggctgttaag ggaaaacagt tatgcataaa 1260 ttgctttaaa tctggtcatt acgcaagaga atgtagagca tcaaaatgtc gtaagtgctc 1320 aaagccccac aactcgttat tgcacttcga acacgaagat tcttcatcaa aagaacctcc 1380 gaataaacct gcggaaaaga aggaagcggt tgttatgcac tgtgtgcaaa ggggcaagca 1440 agattcaaaa acgcaaacaa gatcgaagcg ctcaaaggaa aacagctcag ggtgttttgg 1500 ctacagcgca agtgtatatt tgggacggac aaggtaaaag gcagacttgt cgcgcactgt 1560 tggatccagg atcacaatcc cacttcataa cggaagattt ggtaaggaga ttgcaattgc 1620 tttgcaaggc aacatcgttt tgtattacta gcctcgagcg caatacaacg aaaatagagc 1680 agaccgctca aatccagata gaatcgcaga atactgcgtt caaggccgca ttgaaatgtt 1740 tggtggtcca gaaaatagcg gagagaatac ctctgttcaa gattgacaag aaactcgtcg 1800 gcattccaga aaacctcaaa ttggtggatc cttcatttga tcaaccaagg ccggtagaca 1860 ttctaatagg ttgcggtctg ttctggtctt tattaagcgt tggtcaaatt acaaacggca 1920 gaaggcatcc tgtctggcaa aagactcagc tcggttgggt cttcagtgga gaatttgtcg 1980 gggagcaaac agcttcgaca ggttcagtac catgtttggt aacaaatcaa aacttaaatg 2040 aaagtcttga gagattttga actcaagagg aggtaccgga gagcagacag cttagtggtc 2100 cggaagctta ttgtgaagat tatttcaaaa gcactacgac aagagatatt acaggcagat 2160 ttgttgtgcg actaccaaaa aaagaaggag tcgtgctagg ggagtcaagg cagcaagcag 2220 ttagaagatt tcgtgcgtta gaacgtcgtt ttcgaagaca gccttccctt aagagagaat 2280 atacaaaatt catggacgaa tatttgcaga aaggccacat ggagttagtg ccattaaaca 2340 caatagaaga gaaaaatgca tgctttattc cacatcaacc agtgttgaga cctgacaaca 2400 ttacgacaaa actcagagtt gtattcgatg cctccgcgag gacggacaac aacaactcat 2460 taaacgattt gctaatggct ggacctaatt tacaggctga gttgttacat attctcttaa 2520 gattttgtat ttacaatttt gtcattacag gtgacatcgc tatgatgttt cgtcagataa 2580 ctgttgcgaa agcggatcgt agtctgcagc ttatcctttg gagaaaggaa gaagaagggt 2640 cattggaagc attttcgttg agcaccgtaa catacggaac caagtgcgtt ccatttttgg 2700 tgatgcgctg cttgaagcag ttggcagaag agaacggcga tcaattccca ttggctcaaa 2760 gggcgttagt atcggacttc tacatggatg atgtcctaac tggtagcgac aaattggaag 2820 aggccatcag cttacaagga cagcttactg atctgctggc aaagggtcag tttcccttaa 2880 acaaatggcg atccaacgat aatcgaattt taaagcattt ggcgcaagaa ggcctaaccg 2940 aaaagttgct agtaatacac aaggatgcgc cgttgaaaac attaggagtt ttatggaacc 3000 atcgagagga tgttctacaa tacaacgcga gagagacgac atcaagtcgt ataacgaaac 3060 gcagcgtact ttctgaaatc tcacaaacat acgatccatt aggattactt gggcctgtca 3120 tgatcgttgc aaaaatcatg atgcaacaat tgtggacgct caacttgcaa tgggatgaaa 3180 gcctgccgca agaacttcat agtaaatgaa aaaattatca agcatcgtgg ggtcagctcc 3240 gcgatttgaa gatacctcgt cagagtaaag acaggggtac cacatcaaga atcatcaaac 3300 atggcttcag tgatgcattg gagcgtgcgt acggagcatg tctgtatgca gtgactcgca 3360 ctttagatgg taaacttcaa tcgcagctgc tctgcgccaa gtcgagagta gccccgttga 3420 agatcataac aatcgctaag ttagaattgt gtgccgcatt gttgctagct aagctgtaca 3480 agacagtgca ggaaactctt ggaaataaaa tagcagacgt acgattttgg agcggctcaa 3540 ccattgtgtt aggttggatt cgcacgtgcc ctagcacctt aaaaacattt gtagcgaaca 3600 gggtttcgca aatacaaatt cttggatcgc aatacgtttg gcatcatgtg ccgtcagcag 3660 aaaatcctgc ggatatttta tccaagggaa caacgattga agaactacag actaatcgat 3720 tatggtggca tggcccgtgg tggctgcaag aagagaattc atggcctgag caaccaagca 3780 gcgtgaggga gttaccagag aagaaaactg tagtcagttt cgtgacaacg atccaagcct 3840 cagaagtgtt accttcagtc tcaacattca agcggctgtt tcgaattgtt gcatattgtt 3900 atcgatttgg agctcgacat aactcaaagg tcgaaaccgg agcattgaca gtagcagagc 3960 ttacaaaagc agaaacagca gttgtgaaag tagttcagag ggaagccttc gctcaagaat 4020 acagagattt acaaaaaggt caaccagtca aagataacag tcaactagca gcattggatc 4080 ctttcttaga cagcagagat ctcatatgtg ttggagatcg gttgcagcat tcaaggctaa 4140 cggaggaagc gaagcatccg atcgtgttgc catcgaagca tcacgttacc agattaattt 4200 tcaaagatga acacgtgcgt ttacatcact gtggagcaga gcagttgttg gcatcagttc 4260 gacagaaata ctgggttctt tcgggaagaa aggtggctcg gaagataacc cgctcttgtg 4320 tacattgttt tcgatggcga ccaagtagcc tgcaggtcaa aatgggcaat ttgcctgagg 4380 caagagtcgc taattacgtt cgtccctttg ccataaccgg agtggattat gccggccata 4440 tcaagatcaa ggagagtagg cgtcgtggga gagtgcatag ttccaaggcc tacatagcat 4500 tgttcatctg cttccacact aaagcagtgc accttgaatt agtgacagat ctaacgactg 4560 aatcattttt ggctgctttg cctcggttca cagggcgcag aggtatttgt tcgcaattac 4620 attcagacaa tgcgacaaac tttgtcggag cagaaaggga gcttcgtgag atctacgatt 4680 tcttgcgagg gcagaatgaa gtaatacagg cggagctcgc caatcaaagg atcgagtggt 4740 actttatacc accgacagta cccaatttcg gaggactctg ggagtcaaac ataaagagca 4800 tgaagaaaca tttctacgtc gtgaccaaag gactcacatt gacatttgaa gagtgctaca 4860 ctttactggt aggaattgaa gctgttttga attcgaggtc gcttactcct atatccagcg 4920 acgttaaaga cttgtcagtc ctaactccat cgcatttctt gattggagaa cgtttatcac 4980 aaccagtgga aagagaccat ttacaagagc cagataacag actaaagatt tggcaacatt 5040 tacaaaaggt tcgtcaagat ttttagcgac gttggcaaag agagtaccta gtggagtagt 5100 aacgcaggaa caagtggata aatggcagag agaatttgta accaggggtc ctagtcttgt 5160 tgaaggaaga taatgtgccg cctttgcagt gggctctggg aagagttact gaggtccatc 5220 cagctagcga caacgtcgtt cgtgtcgtta cggtacaaac ggccagtggg aaattcaaga 5280 gagcagcaag gaatttatgc cctctacctt atgaagactg tataggttga cacataaatt 5340 cacacaacat aacacatacc caacacgcga taaggtttga aagggagcct ttcaaggcgg 5400 gagggat 5407 // ID BEL-629_AA-LTR repbase; DNA; INV; 743 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-629_AA_; KW Pao_Bel_Ele167; BEL-629_AA-I; BEL-629_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-743 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 743 BP; 251 A; 134 C; 145 G; 213 T; 0 other; tgttgcgtac actttaagtg ttgcgccgca cctgatgtct acagcactta ttgaacgctg 60 acctgtcgtt tcaagtactt tgcgcctgcg gaactgacgt acgtgtgacg gttaacgatg 120 acatgggaga aatgtcagag atgacaacaa gaatataaaa aaacgaggat tggcaaatct 180 acagaccggg cgtgaacaca ttatagttag tcacagttag ccttttttcg ttgttctata 240 aagttattgt ttctataatt ctataaagtt atttgttata ctagtttcta tttatgctta 300 cactaatttg tatcggtatc taaagtgaaa aggcaagatg tgaaaccttc tggcttatga 360 gaagtggaag agaaatctaa agagaaaaat ctaaattata ggttagcaaa ctaggaaatc 420 tgttccagtc gcctgcgaag aatcgcgctc aaatacagtt cgtgaactaa ggtaaattac 480 taaaaactac aaacgagtag tctacgacta atttttgtac aaaaatagga ttattcactc 540 ccacaccact ggtcactact gatcgattcg ttgcaagaaa aaaggaaaat gcacaaacgt 600 gagtaaaata aaattaacct aaaaactatg cctaaatgat actttatgta acaggaaaat 660 tttaatcgct gcgctcggaa ttgcgccaca aatatacggt tcaaagtttc ggagaagtcg 720 tttcccttgt tgcatccgca aca 743 // ID Gypsy-2_DWil-LTR repbase; DNA; INV; 2521 BP. XX AC scaffold_180697; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-2_DWil_; KW Gypsy-2_DWil-I; Gypsy-2_DWil-LTR. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-2521 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (04-MAR-2011). XX DR Genome; scaffold_180697; Positions 2657987 2660507. XX SQ Sequence 2521 BP; 760 A; 414 C; 470 G; 877 T; 0 other; tgtagtagcg cttgtaaccg tttgtattta ggaagaaaat aaaaaaaaat gagaaagaat 60 gatttttttt gtctataact aaaacgacat ataaacgttt tatccctaaa agcgagcaat 120 ataagtttca tatagtaacc gtgttcactg ctttcactgc ttacactgct tgcgctgctt 180 gtattgttag ttagcgttat actaactaac agaatgttag tctactaggt aaaatttggg 240 aaaggtagga gctgcgtttt cgagattcgt ttttttttgg ttgtgttcag aaaattttct 300 tcgaattttt caatccgatt actttctctt gcgtttttgt ggcgccccct aggctcctct 360 aaagttggga tcagggaagt ttaggggtaa tcgaggtgaa gagaggggtc caaagaggac 420 taatggatag agagttaacc aagtatgtcg gcccccatgg gaatgacttt tctttttcaa 480 gttatcttca agtagtgcgg tcacaaaaaa gaaaaaaaaa ttaaagtgat agtgaaaacg 540 aattttattg aaaaaaaaaa gtctgactgt gaaaaagtcg gcataaattg tgaattcgaa 600 gcagttttct gctaaggagt atagtatgtg tcttaagaaa gtgaatgcac atgtcagcct 660 gaattagatt gctaagagaa aggcatatat acatattttt aatccggtgc caaatcaact 720 gaattggcac ctagttttgg tctaaaccta gttttggtct aaacctagtt ttggtctaaa 780 cctagttttg gtctaaacct agttttggtc taaacctagt cctggttcaa acctagtcca 840 cagactagat ataatggtat aagtctaaca gcgaacttaa gagaggcgca ggagaggcac 900 gcccgtcgaa acggactgaa gcccacggga gcagcagact ctcgagaagg agtattttgg 960 cgcagacatt tggccagaag agatctgatg cgacaggcgt aaaaagagaa aaaaaatata 1020 aagagtatat gattctctag gcccccccca ctacccttgc tttgcgtgat aagcaaaagc 1080 tttctttaga tgattatcag ccagcgaagc tttcatgtgt agccgcttgg cttgcgtttt 1140 gttttgtttt ttttggttac tggtgattgc cgcatagcct ccaaccacaa aaaaaaaaaa 1200 agaaaaatta aatatcttcg ccaggcccta atgatgaatg taactcttta attttgggca 1260 caaataattt atgcccttga ttttttttta tttttttttt atatataatg ttatcttaaa 1320 gttaatcttt ttaaatcgta aaattctgat gcattggcgt tctccaaagc caattttatt 1380 tttttttctt ttatttcatt ttgcttttgt aatttttttt atgtaatttt ttttggtcat 1440 gattgaactt cgatgaaaat ttttatttaa cccctttttc ggcacgcccc tggcaaaagc 1500 ttatcatgtg gtttaattct ctctcaagcc agcaacaaca actctcccta tatttaatgc 1560 aattgcttac agctgtcacc agaaagatct ttctccgtgg aatgcttaag tgagtactct 1620 tttgttttaa tatcattttt ttttgctgtt ggtatatgat ggtaaagtaa agagagagtt 1680 aaagcagttt tcgttactct cgtttaaata acataaaaaa aattaaattt tggtaaagta 1740 aagagagagt taaagcagtt ttcgttactc tcgtttaaat aacataaaaa aaattaaatt 1800 ttggtaaagt aaagagagag ttaaagcagt tttcgttgct ctcgtttaaa taacataaaa 1860 aaaattaaat tttggtaaag tcatcagaag tttaacttat gctcaaaaaa aaaaataatt 1920 tttctttcct tgaatttcaa aatgctttct atataaacat tgtcatacct atatatatat 1980 tatcatatca cctttatatc attatctata tttttgatct ttgaaatgcc ttattatatt 2040 gcttattgtt attaaatata agtttttgaa atgaatttgg tctgagacgt tcgtgagggg 2100 gggaatcatg cgaagtaagt taggttgaaa aattggtttg gcatagggat cgctgtgtaa 2160 ttttatattt tggctttagt acagcgatca tggtagggtg ggcgaatttt tcttttgggc 2220 gtcactgaca tctgacccct ttctttttca gcagctgcgc taaaaacgac ggtctttctg 2280 ttctcttctc atccttttct ctgttctttt ctttttttgt acgaaacgtt aaaccgagta 2340 taagtagtct tagtacgcga atttattaat aaaattttct actagcagag cgggtagtct 2400 atccattccc ccatccccca tttctcaccg actaatttga gccgaaacca gcgcgtacac 2460 ttgataggta ctaccagttt ccaacagctc gcagtaaact tgtagcaggt tatatgttac 2520 a 2521 // ID Gypsy-4-LTR_HM repbase; DNA; INV; 206 BP. XX AC . XX DT 22-DEC-2008 (Rel. 13.12, Created) DT 22-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE Long terminal repeat of the LTR retrotransposon - a consensus DE sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-4-LTR_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-206 RA Bao W. and Jurka J.; RT "Gypsy LTR retrotransposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 1975-1975 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX SQ Sequence 206 BP; 74 A; 20 C; 32 G; 80 T; 0 other; tgttaatagc taatttgtaa acgctattat aatttaatag cgtttacaaa tccttagtgt 60 aaatatgtcg taatttcttt tttacttttt gtttgtacat aattatccca agggagcttt 120 tgttgttgac gaaaaatgta taaaaattgt aaatagtatg aaaaagaaca gagttcataa 180 aaagtaaaga tagttgtcgt tttaca 206 // ID Gypsy-14_AA-LTR repbase; DNA; INV; 256 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-14_AA_; KW Gypsy-14_AA-I; Gypsy-14_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-256 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 998-998 (2011). XX DR [2] (Consensus) XX SQ Sequence 256 BP; 72 A; 40 C; 73 G; 71 T; 0 other; tgtaacgatg tgtcaaatat cgttactggc tatacaaagg ttattatata agttaatggt 60 gacacaaggg ttattgtata acagtcagcc tatctgctag ttggaaacgg gctatgattg 120 tgtttgtgtt gggatcaaca gtagagcgat gtgaaaacgg ggattcgtgt gtaggctatt 180 gacggagaca gacgtgtcta cggagctcgc ggcgaacgag tagttcaagc atagaaacgg 240 cccgtcggat tttaca 256 // ID DNA8-82_AP repbase; DNA; INV; 749 BP. XX AC . XX DT 25-AUG-2009 (Rel. 14.09, Created) DT 25-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-82_AP. XX NM DNA8-82_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-749 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2018-2018 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 749 BP; 324 A; 98 C; 101 G; 226 T; 0 other; gggttcggat ttgaaggcaa tataatcaat aaaaaaagca tttaaaatgt aaaaaaaggc 60 aaaaaaaggc atttaattta tataaaaagg caattcataa atatatacta caaaataaaa 120 tagtgaacat tttttaaaca ctttcaaatg cacacatttt gagaaaaata caaattaaaa 180 atgagcatga atactaaact acattaagtt tatgctattt tgtataaata gattaccttt 240 ttctttaacc aggtttgtcc taagaattag ttaaaaacaa ttttaaaaat taacttactt 300 gagttatacc agttaattta atgcaaaata aactagttgg tattaaaaaa aaaattgact 360 accatgtatt ttgataagct gtcttcagaa aaacggttac ggtttgggct caaaatattt 420 ttatacatcg aaaaggatct ttctacgtcc accgaagtta tgggcgtacg ggtaatttta 480 tggccatacc aaatactcga taagacgata tcgataaata ttgcgaagaa taaaacccat 540 ttgtagcgat aatacgtaag ctgtgtataa aattataaaa tcaactgaaa attaaacaca 600 aaaaaggcat ttataagtac aaaaggcaaa aaaaggcaaa aaaaaatgca tccatcgtct 660 aagaaattaa atgaaacaca tttttgtata aaaattattt ttctattatg attaataaaa 720 gaaaggcatt tgccttcaaa tccgaaccc 749 // ID DNA8-4_AP repbase; DNA; INV; 224 BP. XX AC . XX DT 22-AUG-2009 (Rel. 14.08, Created) DT 22-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-4_AP. XX NM DNA8-4_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-224 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(8), 1746-1746 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 224 BP; 75 A; 40 C; 31 G; 78 T; 0 other; cacagatatg tataataact agatacccga ctccatatac aacaacatta ttaaattatg 60 atgcccgacg ccatttgcaa ctttggcaat gtttacattt attcatatac atatttactt 120 atatatacat atatactgat aatactattt tgccctagtt gtaaatggct gccagcgtca 180 ttgataagaa gaagtcgggt atctagttat tatacatatc tgtg 224 // ID Gypsy-110_AA-I repbase; DNA; INV; 4966 BP. XX AC AAGE02027584; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-110_AA_; KW Gypsy-110_AA-LTR; Gypsy-110_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4966 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02027584; Positions 112090 107125. XX CC Positions [3980-4456] - Integrase core CC 'AGCGT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 731..2680 FT /product="Gypsy-110_AA-I_1p" FT /translation="MDLLQVDDSRPVQQFRCDDIEKHRLYNEWRIWKGALE FT CYFEAYDIVDQRKKRAKLLHLGGPQLQRIFQNLPNRESFPLVSLEKQWYDV FT AINALDDFFQPCRQDCLERHKLKQMKQKEGERFADFMLRLRQQASDCGFDK FT YSNEIKDVLTEIFLTDAVIEGCASMELRRRILQQDRSLSEIETLGVALESI FT DIQVKDLNGKSKEIAGGQQVYKISAGYTEDRNKMQFAQRGARRDSKKFSDP FT RDNKFSQKIYRCYNCGRRDHISTDRRCPARNIVCHGCKVTGHFESCCRLRN FT SRKKSVVRSAGDRDRINVVEEEASTPELPVEQQSGKTYYAFYSGNDTNVLT FT SEVGGIKLDLLVDSGSDANLIPDTVWEKLKHDNVNVRQSTKGSSRILKGYA FT SELPLPILGTFVADVKAGNCLVAAEFYVVKGGQRSLLGDKTSKELGILKVG FT LDVYPVTSETKPFSKIKDVQVQIHMDPSIKPVFQPIRRVPIPLEDAVNRKL FT EQLLIKDIIEEKQGPASWVSPLVVVGKNNGEPRICLDLRRVNEAVLRERHP FT MPVVDEYLARLGKGRYWSKLDVKDAFLQVELAPESRDVTVFITNKGLFRFK FT RLPFGLVTAPELFQKVMDQILAGCEGTSWYLDDVIIEGSTREEHDERLEKV FT C" FT CDS 3056..4936 FT /product="Gypsy-110_AA-I_2p" FT /translation="MLTKKDSKFEWTGLHQKSFDSIKAAMTDVLRLGFFNK FT EHRTTVMADASPTGLGAILIQTDPSGNSRVVTCASKSLTDTERRYCQTEKE FT ALGIVWSVERFQVYLYGRQFDILTDCKALVYLFTERSRPCARIERWVLRLQ FT AFDYLVCHIPGERNLSDVLSRLSTVAPVPFDQREEMFVKQVVLSSATSVAL FT RWDEIVAASLEDSKIQEVLDMIDKGNIAELPLSFRVIASELCRFGDVLLRT FT DRIVIPNSLRERVIKIAHEGHLGMRTMKSHLRAAVWWPKMDMAIESFVKKC FT RGCLLVSTPDPPQPMIRKELPNGPWEDIAIDFLGPLPNGETLLVVVDYYSR FT YVEICEMHKITAKLTIEELGKIFCRYGVPITIRADNGPQFNAGCEEFKNFC FT EEFGIRAVNTIPYWPQQNGEVERQNRSILKRLRIAQELGQDWRSVLCQYIL FT SYHATPHPTTGRSLSELLFGRRIRTKLPQLPHSSHTDEAVRDKDRFQKEKG FT RIAADSKRRARSSMIEVGDRVLVKRMRKENKLSSTFAPEEFVVVRKSGADT FT TVRSTSDGKDYRRNIADLKRIEPEEQQQNAPDDIAEACTLHHEELQPQQSH FT IVPVASEDQTSGSKRVRKEPVKFSDYIPH" XX SQ Sequence 4966 BP; 1476 A; 905 C; 1243 G; 1342 T; 0 other; atttggcgac gaggaaatta agaatggtaa gtggaaatat ttataataat tcaagtgcac 60 gaggcttcgg aaattgagtt tgtttcgttt gaaaatggcg gagctaagga atgttgacct 120 gcatgtgtat tagtgtgttc atatatgaca aataccatgc tccgctaggt gtggaattcc 180 aataagaatt gacatgttta tgttcatgga ccaaccgttt gcgatgagca tagcaatgca 240 cagtgttgct tgtgatgaag atagtgaatg gaaatgattt cattaatcca gccaagccaa 300 tgatagttta aagtaaatat gtacctatag agtcttctat aggtggtgtt cagatgagtt 360 gaacacgggt agattttcag gaaggaccac tagttgttgc tagaggaggt gattatgtta 420 atcacgggat caaatatgta agtactttag ttgttgctat aggaggtggt caacgcattg 480 atcacgggtt ggttttgttc agtacctgta gtcgtacagg aggtggtcgg ataaaccaac 540 cacgggcatg tattatggaa tgcgaaccga acacaaaatg aaaaatgatg tctatcaggg 600 tttgaaatta gaaaacaaat aaaaagatga tggaattaaa attgaaagtt tgttttgatg 660 cataaaatta tatggtttgg aaagatttat ttacgtgcat atacaatgtg tctgattggg 720 gttttgttgg atggatttgt tacaggtgga tgatagccga cctgtacagc agtttcgatg 780 tgatgacatc gaaaaacatc gtttgtacaa tgagtggagg atctggaaag gtgcgttgga 840 atgctacttc gaagcatacg atattgtgga ccaaaggaaa aagagagcca agcttttgca 900 tttgggtggt ccacagcttc agcgaatctt ccagaatctt ccaaatcgtg aaagtttccc 960 actcgtgtca ttggagaaac aatggtacga tgtggctatc aacgcattag atgatttttt 1020 ccaaccgtgt cggcaagatt gcctagagcg ccataagcta aaacaaatga agcaaaagga 1080 gggagaaaga tttgctgatt ttatgctgag attacgccaa caggcttcgg actgtgggtt 1140 tgataagtat tcgaatgaga tcaaagacgt tctgacagag atattcctca cggatgcggt 1200 aattgaaggc tgtgcgtcta tggaattacg gcgccgaatc ctgcagcagg atcgttcact 1260 tagcgagatt gaaactctgg gagtagcgtt ggaatccatt gacattcaag tgaaagatct 1320 caacgggaag tcgaaagaaa ttgctggcgg acaacaagtc tacaaaatct ccgctgggta 1380 taccgaagat cgaaacaaga tgcagtttgc acaacgaggt gcacgacgcg attcgaagaa 1440 gttcagcgat ccacgggata acaagtttag tcagaaaatc taccggtgct acaactgtgg 1500 ccgccgtgac cacatctcta ccgatcgaag gtgtccggca cgtaacatcg tgtgccacgg 1560 atgtaaagtg actggtcatt ttgagtcttg ttgccggttg cgaaattcaa gaaagaagtc 1620 tgtagttagg tcagcaggtg atagagatcg catcaatgtc gtagaagagg aagcatctac 1680 tcctgaactc cctgtggaac agcaatctgg taaaacatac tacgcgtttt attccggcaa 1740 tgataccaac gttctgacga gtgaagttgg tggaataaaa ttagatcttc tggttgattc 1800 tggttctgac gccaatctaa ttccggatac cgtttgggaa aagttgaagc acgataatgt 1860 caatgttcgc cagagtacca agggtagttc tcgtatccta aaaggttacg caagtgagtt 1920 gccgctacct atcttaggca ctttcgttgc tgatgtcaaa gctgggaatt gtttggttgc 1980 agcagaattt tacgtcgtta aaggtggtca acgatcctta cttggagaca aaacctctaa 2040 agagttgggc atcctgaaag tcggtttgga tgtctatcca gtcacgagcg aaacaaaacc 2100 gttttctaag attaaagatg tccaagttca gattcatatg gatccgagta tcaagccagt 2160 gttccagccc attcggcggg ttccaatacc attagaagat gcagtaaacc gaaagctaga 2220 gcaactgctt atcaaggaca tcatcgaaga gaagcaaggg cctgcttctt gggtttcgcc 2280 ccttgttgtc gtcggtaaga acaatgggga accaagaatc tgcctggatc tacgtcgtgt 2340 caacgaagcg gttctacgtg aaagacatcc aatgccagtc gtagatgagt acctagctag 2400 actaggcaaa ggaaggtatt ggagtaagct agatgtgaag gatgcctttc tacaggttga 2460 acttgcacca gagtcacgtg acgttaccgt gttcattacg aataagggat tgtttcgttt 2520 taaacgactt cccttcggtc tagttaccgc acccgagcta ttccagaagg tgatggacca 2580 gatactggcg ggctgcgagg gaacttcatg gtacctggat gacgtgatca tcgaaggtag 2640 cacgcgagag gaacatgatg agcggctgga aaaggtatgt taaaactgta ggctttgatt 2700 tgtttgattt gagaagtgaa ttgaaacatt ttggtttttt tttgtaacac ataatttgac 2760 attttcaaga taataaagac ttttgtttga tgtatcatgt tgtcaggtgc tgaaacgatt 2820 tgaggaacgc agagtcgaac taaattggga aaaatgcgta tttggggtaa ccaaagtgga 2880 ctttctggga catcaaattt ctcctgatgg aattgttcca gcaaatgaca aagtggtcgc 2940 agtccagtca ttccgacagc cggaaaatga aggagaggtt cgaagttttt tgggtttagc 3000 caactatcta aacaagttta tacctaattt ggctacactg gatgaaccac ttcgaatgtt 3060 aacaaaaaaa gattctaagt ttgaatggac tggtttacat cagaaatcgt ttgatagcat 3120 taaggctgcg atgaccgatg ttttaaggct tgggtttttc aacaaagaac accgaactac 3180 agttatggca gatgcaagtc ctacgggatt aggtgcaata ttgattcaaa ctgacccatc 3240 aggtaatagc cgagttgtta cctgtgcatc gaagtcgctt accgacaccg agagacggta 3300 ctgccaaacg gagaaggagg ccttaggcat cgtttggagt gtggagcgtt ttcaggttta 3360 tttgtatggc cgccagttcg atatcctaac agactgtaaa gcattggttt atttgttcac 3420 ggaaagatca cgcccatgcg caagaataga acggtgggtt cttcggctac aagcatttga 3480 ttatctagta tgccatatac caggtgaaag gaatttgtca gatgttcttt cccgattgag 3540 taccgtagct ccagtaccat ttgatcaaag agaggaaatg tttgttaaac aagttgtctt 3600 atcatcggca acgtctgttg ctttaagatg ggacgaaatc gttgcagcat cgttagaaga 3660 ctctaaaatt caggaagtac tagacatgat tgataaagga aatattgcag aattgccgct 3720 ttcttttcga gtgatagcca gtgaactgtg tcgatttgga gacgttctgc tacggactga 3780 tcgaatagtc attcctaatt cgctacgtga gagggtaata aagatcgccc acgaaggaca 3840 tttggggatg cgaacgatga aatcgcatct acgcgctgct gtgtggtggc cgaagatgga 3900 tatggcgata gaatcttttg tgaagaagtg ccgaggatgt cttttagttt caactcctga 3960 tccacctcaa cccatgattc gaaaagagtt gccgaacggt ccttgggaag acatagcgat 4020 cgattttctt ggaccacttc caaatggcga aacgttactg gtcgttgtag attactacag 4080 ccggtatgtt gaaatttgtg aaatgcataa aatcactgca aaactaacaa ttgaggagtt 4140 aggaaagatt ttctgtcgct atggtgtacc aatcacaatt cgggctgata acggtccaca 4200 gtttaatgcc ggctgtgaag agttcaaaaa tttctgcgag gaatttggca tccgagctgt 4260 caacaccatt ccttattggc ctcaacaaaa tggggaggtt gaaaggcaaa atcgttccat 4320 cctgaagcgg ttaagaattg cacaagaact aggccaggac tggaggagtg tactgtgtca 4380 atacattttg tcgtaccacg caacacccca tcctactaca ggtcgatctc tctcggaatt 4440 gttgtttggt cgaagaattc gcacaaagct tccacaactc cctcacagtt ctcatactga 4500 tgaagctgtt cgagacaaag atagattcca gaaggaaaag gggagaattg cagctgactc 4560 aaagcgtcgt gcccgttcaa gtatgattga agttggtgat cgagttttgg tgaagcgaat 4620 gcgtaaagag aataagctga gctctacatt tgcaccagaa gagtttgttg ttgttcgcaa 4680 aagtggggct gacacaacgg tgcgttcgac atcagacgga aaagattata gacggaacat 4740 agcagacctt aagagaattg aaccagaaga acagcaacaa aatgctccag acgacatcgc 4800 ggaggcttgt acgttacatc atgaagagtt acaacctcaa caaagccaca ttgtaccagt 4860 tgcttcagaa gaccagacgt cgggttcaaa acgagttcga aaggaaccgg ttaagttttc 4920 ggattatatt ccacattaac aatgtttaaa taaaaccaag gggggt 4966 // ID BEL-591_AA-LTR repbase; DNA; INV; 572 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-591_AA_; KW Pao_Bel_Ele161; BEL-591_AA-I; BEL-591_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-572 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 572 BP; 205 A; 104 C; 89 G; 172 T; 2 other; tgttgatact ggcagcactt ctggtcacac ktcaatgacg acaggagtsa accgttcaac 60 ggtctgtcaa accgatggca tgacaaagag acaatgcaat gatctgctat gtcattctta 120 tcattgatag ctagagagca aaccagatag caagtaattt gtaaaaggtg atttctaaaa 180 ctttgattat ttgttattct aaactacaac taaatttgaa cttacagtta ttctcaagct 240 gtttctaaac agtaaatttg ttccctatta taaaagttca tgcacacaca aattgtaagt 300 tgttatatct aatttgttaa aacttaacct aacttatagt aaatctacag attacacgtg 360 tgagaattcg ttagtacggt tggattgtgt caaaataccc tcttataccg atttttgtaa 420 gtaccatgca acaattcaat taccttaaaa ctaacgataa aataaaatta cagctaaagc 480 tacatttaca acataaaaca cggtcgttgt aaattgctca aagagatcga acaggggttc 540 acaatctaca ccgataaatc acaacgggaa ca 572 // ID BEL-11_AA-I repbase; DNA; INV; 5258 BP. XX AC supercont1.187; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-11_AA_; KW BEL-11_AA-LTR; BEL-11_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5258 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.187; Positions 553140 558397. XX CC Positions [4308-4856] - Integrase core CC 'ATGTG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 25..1479 FT /product="BEL-11_AA-I_1p" FT /translation="MEGKMVDRTPSPRSHTPRNCQSCDRPDHAEEEMVQCS FT ICKLWEHFGCAGVDEQVRQHDVKYICKRCEAKQGTSENLIPQLPTDGTRLS FT KGAKAASRAGSKRGKKVPDPPKSTTSSMRVALLEEQLKLVEEELRLKEQEL FT LEQDEMKQRQLKEEERQLEEKRKIAEEERNIRERKLKEELALKIKQQQIRR FT ESLEKRQQIIRQQAEMSSRGGSIPDVGERVGNWLDSQAKGSEGGQKMTCDD FT SQSIDSNLGDVSNEPVVIQPHAAELMGTPKTTRPQHIDSRAQEVSPFNNRL FT TQVQIAARHVLGKDLPIFSGNPEEWPVFISNFEQSTATCGYSDAENLVRLQ FT RCLKGQALESVRSRLLLPASVPHVIKTLRMLYGRPELIIRSLMNKIHQVPP FT PRHDRLETLIHFGLSVQNLVDHLKAASQHNHLSNPALMHELVEKLPGSMRL FT DWAIYKNKNQPATLETFGDFMSGLVDAASEVSFEVSTCTG" FT CDS 1962..5258 FT /product="BEL-11_AA-I_2p" FT /translation="MDLVMKRDYFGKPTILNFLTATHALKEDQNLSLEGLE FT RRLEREPDLKKRVHEQILDYERKGYAHKATLAELTSVDVDRVWYLPLGVVI FT NPRKPDKIRLIWDAAAKVGNTSFNSRLLKGPDLLSPLPRVLYKFRQHPVAV FT CGDIMEMFHQIRIRSPDCQSQRFLFREHSSDSPQVYVMDVATFGSTCSPAS FT AQFIKNLNADEFASEYPRAAVAIKFHHYVDDYLDSFESVTEAVTVVNEVKL FT VHSKGGFTLRRFLSNEAEVLQKIGEFSEQAPKNLDLERGGKSESVLGIKWM FT PFEDLFMYAFGFRGDLQYILNEGHIPTKREIARVVMSFFDPLGFIAFFLVH FT GKAILQDTWVKGTEWDQKIPEDLNERWRQWSGLFHQLNQLHIPRCYFRSSF FT PKNLDALQLHLFVDASEAAYAGVVYFRLESEKGVQVSLVGAKVKVAPLKTL FT SIPRLELKAAVLGIRLQEAIQSQHTFPISRRFCWSDSGTVLSWIRSQDHRR FT YHKFVAVRVGEILSSTESSEWRWVPSKLNVADLATKWGNGPQLNMDSPWFQ FT GPNFLREDKENWPIQQKIDSTEVELRPVRCHFSHFTPIMDFTRFNGWIKLH FT RVTAYVLRFVENILRKKKGQQLELGVLTSNELRCAEETLWKGAQGEAFPDE FT VAVLTKSKGPPEKRHSIVQKSSVIYTKWPFLDERGVLRSRGRIGAAPHAPT FT EAKFPVILPKDHLITFYIVDWFHRRYRHANRETIFNEIRQRFDIPTLRRLL FT DKVASKCAWCRITKALPKPPAMAPLPEMRLKSFIRPFTYTGLDYFGPVLVK FT VGRSHAKRWVALFTCLTIRAIHMEVVHSLSTESCIMAVQRFVSRRGIPTEF FT WTDNATTFQGTSNEIKATKQALAQKFTTTQTTWKFIPPASPHMGGAWERLV FT RSVKQTIGTVLDESRKPDDETLETVLLEAEAMINSRPLTFIPLESADEEAL FT TPNHFLLGNSSGTKFLPTGPIDNSSTLRSSWKLARFITDEFWRRWIKEYLP FT VITRRCKWFQETKDLEVGDLVLIVGGAARNQWIRGRIEEVFPGTDGRIRQA FT LVRTSMGILRRPAVKLAVLDVEEKCKPGSIASEPDQGLRAGV" XX SQ Sequence 5258 BP; 1460 A; 1273 C; 1310 G; 1215 T; 0 other; ttctttaaga tttttgttcg ggttatggaa ggtaagatgg tagatcgcac cccttccccg 60 cgatctcata ctcctcgtaa ttgccaatca tgcgaccgcc ccgatcatgc cgaagaagag 120 atggtccaat gcagcatatg taagctttgg gaacacttcg gatgtgccgg tgtggatgag 180 caggtgagac agcacgatgt gaaatacatc tgcaagcggt gtgaagcgaa acaaggaaca 240 tccgaaaacc taattcccca acttccgacg gatgggacac gtttgtcaaa gggagcgaaa 300 gcagcgtcaa gagcggggtc aaagaggggg aaaaaggtac ccgatccacc gaaaagtacc 360 acatctagta tgcgcgtggc tctcctagag gaacagttga agttagttga ggaagagctg 420 cggctaaagg aacaggagct tcttgagcaa gacgagatga agcaacggca actgaaagaa 480 gaagaacgtc agctagaaga aaaaagaaaa attgcggaag aggagcgtaa tattcgcgag 540 cgcaagctta aggaagagtt ggcactgaaa ataaaacagc agcagatacg aagagaatcc 600 ttagaaaagc ggcagcaaat catccgtcag caagcagaaa tgagcagcag aggtggttca 660 atcccagatg ttggcgagag agtgggaaac tggctggatt cccaagctaa aggatcggag 720 gggggtcaaa aaatgacctg cgatgacagt caatctatcg actcaaacct cggagacgtc 780 tctaatgagc cagtagtgat ccagccccat gccgcggaac ttatggggac accaaagaca 840 acacgacccc aacatattga ttctcgagct caagaagttt ctccattcaa caatcgtctc 900 acacaagtgc agattgcggc acgacacgtc ctggggaaag acctgccgat attcagcgga 960 aacccagagg agtggcctgt cttcataagc aattttgagc aatcaacggc cacctgcggg 1020 tattcggatg cggaaaatct agttcgcctc cagcgatgct tgaagggtca agcattggaa 1080 tcggtaagaa gtcgcctact tctacccgca agtgtacccc acgtcattaa gacgcttcgc 1140 atgctttatg gtaggccaga gctcattatt cgatcgctga tgaataaaat ccatcaagtt 1200 ccacccccca ggcacgaccg gttggaaaca ctgatccatt tcggcctctc cgtacaaaac 1260 ctagtggacc acctcaaggc agctagtcag cacaaccatt tgtcgaatcc agctttaatg 1320 catgagctag tagaaaaact accaggatct atgcgactag actgggccat ctataaaaac 1380 aagaatcagc cagcgactct cgaaaccttt ggagatttca tgtctgggct cgtagacgcg 1440 gccagcgagg tatcctttga agtctcaacg tgtacaggtt aacattgctg gcaaggactg 1500 tgacactcaa tataaactcg ttgacgcccg tacagtcagt cggctggtac tgccgtcaca 1560 aaccctgcag tatggagttc tggcacaacg cttccctcat ctacgaggtc ttcccctaca 1620 agactacgaa ctagttcaac ccaaacttct gatagggctc gataatctga gattgtgcgt 1680 tccactgaag ctgcgggaag gaaggccatc agaaccgata ggcgcaaaat gtcgactggg 1740 atggagcatt tacggatgca ttcctggtca atcgtcccag aaggcgatcg tcaatctaca 1800 tgtagctgca gtttgtgacc cagaccgaga gctgaacgaa cagttgcgtg attttttcac 1860 cctagagtgt gctggtgttt cgggttcacg cgaggcacct gaatcatgtg acgaaaggcg 1920 tgctagagag ttgctgacga gcacaacacg gcgtgtttca catggacttg gttatgaaac 1980 gggactactt tggaaaaccg acaatcctca atttcctgac agctacccac gccctcaaag 2040 aggaccagaa cttgagtctc gaaggactgg agcgaaggct cgagagagaa ccagacttaa 2100 agaaaagagt tcatgagcag atcttggatt atgagcgcaa gggatacgca cacaaagcca 2160 ccttagctga attgacgtcc gttgatgtcg atcgggtatg gtaccttcct ctcggtgttg 2220 ttatcaatcc tcgaaagcct gacaagatac ggttgatctg ggacgctgca gccaaggtgg 2280 ggaatacctc cttcaactca aggttattga aaggacccga tctactttct ccccttccaa 2340 gagttctcta caagttccgt caacatccgg tggccgtctg tggagatata atggagatgt 2400 ttcaccagat aagaattcga tctcctgatt gccaatctca acggttcctc tttcgcgagc 2460 actcgtcgga ttcgcctcaa gtgtacgtaa tggatgtggc cacattcggc tctacttgtt 2520 cacctgcatc ggcacagttt atcaaaaatc tcaacgccga cgagtttgca agtgagtatc 2580 cgcgagccgc tgttgctatc aagtttcatc actacgtaga cgattactta gacagctttg 2640 agtcggttac tgaggctgtt acggtggtga acgaagtcaa gctggttcac tccaagggtg 2700 gctttactct gcgacgattc ctgtccaatg aagctgaggt tctacaaaaa attggagaat 2760 ttagcgaaca agctcccaaa aatcttgatc tggaacgtgg aggaaaatca gaatcggtcc 2820 ttgggataaa gtggatgccg tttgaggatc tgttcatgta tgcctttgga ttcaggggcg 2880 atctccaata catacttaac gaaggacaca ttcccaccaa gcgagagatt gcacgagtgg 2940 tcatgagctt ctttgatcca cttggcttta tcgccttttt tcttgtgcat ggcaaagcta 3000 tccttcaaga tacttgggtg aaggggaccg aatgggacca aaaaatacca gaagacctga 3060 acgaaagatg gcgacaatgg tcaggtctat tccatcaact gaaccaactt cacataccaa 3120 gatgttactt ccggtcttca ttccccaaga acctcgacgc acttcaactt catttattcg 3180 tagatgccag cgaagcagcg tatgccggcg tggtttattt ccgcttggaa tcagagaagg 3240 gcgtgcaggt gtctctggtc ggggcgaagg tcaaggtagc cccccttaag actctttcaa 3300 ttccaagact cgagttgaaa gctgccgttc ttggaattcg tctacaggaa gcaatccaaa 3360 gtcaacacac cttcccgata agtcgtcgtt tctgttggag tgactctgga accgtactct 3420 catggattcg ttctcaagat cacaggcgct atcacaaatt cgttgccgtt cgtgtgggcg 3480 aaattctgtc ttccacagaa tcgagcgaat ggaggtgggt cccatccaaa ctaaacgtcg 3540 cggacctcgc aacgaagtgg ggcaacggcc ctcagctcaa tatggatagt ccgtggtttc 3600 aagggccgaa tttcctacgt gaagacaagg agaattggcc aatacagcaa aagatcgatt 3660 caacagaagt agagttgaga cccgttcgat gccacttctc acactttact cctattatgg 3720 acttcactcg tttcaacggt tggattaaac tacatcgtgt gactgcatat gtgttacgtt 3780 ttgtagaaaa tatactacga aagaaaaagg gtcaacagtt agagcttggc gtactcacta 3840 gcaacgaact tcggtgcgcc gaagaaacac tgtggaaagg tgcccagggt gaagcgtttc 3900 cagacgaggt ggctgtcctt accaaatcaa agggtccacc cgaaaaacgt cacagcatcg 3960 tacagaagtc cagtgttata tacacaaagt ggcctttcct cgatgaaaga ggagttctac 4020 gaagccgtgg cagaatcggt gctgcacccc acgcaccgac cgaagctaag ttccctgtca 4080 ttcttccaaa ggatcattta atcacctttt atattgtaga ctggttccat cgccgttacc 4140 gccacgccaa cagagagacc atcttcaacg aaattcgcca acgatttgac ataccaacgc 4200 taagacgact tttggataaa gtagctagca aatgtgcttg gtgccgcatt acgaaggcct 4260 taccaaaacc acccgctatg gccccgctcc cggaaatgcg cttaaaatct ttcattcgcc 4320 cctttacata tactggtttg gattacttcg ggccagtact tgtcaaggtt ggcagaagcc 4380 atgccaagcg ttgggtggca ctttttacct gcctcaccat cagggccatc catatggagg 4440 tagtgcattc attgagcaca gaatcgtgca ttatggccgt acagcgattt gtatcccggc 4500 gtggtattcc cacggagttt tggaccgaca atgcgacaac cttccaaggt accagtaatg 4560 aaattaaagc aacaaagcaa gcgctagctc aaaaatttac caccactcaa acaacgtgga 4620 agttcattcc accagcatca cctcacatgg gcggagcttg ggagagactc gtccgttccg 4680 tcaaacagac aatcggaact gttttggacg aatctcgcaa accggacgat gagaccctag 4740 aaacggtctt attagaggct gaagccatga tcaattctcg gcctcttact ttcattccgc 4800 tagagtcggc agacgaagaa gccctcacgc caaaccactt cttgttgggc aactcatctg 4860 gcacgaaatt ccttcccaca gggccgatag acaactcatc aaccctaaga agcagctgga 4920 aactggccag gttcatcacc gacgagtttt ggcgtagatg gataaaagaa tatctcccgg 4980 ttatcacccg taggtgcaaa tggtttcagg agacgaagga cttggaagtt ggagatttgg 5040 tcttgattgt tggtggtgcg gctaggaacc agtggatcag agggcgtatc gaagaagtgt 5100 ttcctggaac agatggacga attcgccagg cattggtacg gacctcgatg gggatcctgc 5160 gaagaccagc cgtgaagctg gctgttttgg acgtcgagga gaagtgcaaa cctggttcca 5220 ttgcttcaga gcctgaccaa ggtttacggg cgggggta 5258 // ID BEL4_Cis_I repbase; DNA; INV; 5609 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 06-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE Internal portion of BEL LTR Retrotransposon from Ciona savignyi. XX KW BEL; LTR Retrotransposon; Transposable Element; internal portion; KW BEL4_Cis_I. XX OS Ciona savignyi OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. XX RN [1] RP 1-5609 RA Smit A.F.; RT "BEL4_Cis_I - BEL LTR Retrotransposon from Ciona savignyi."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX CC Ci000710, Ci000626, Ci000646 ORF from 27-2287 (peptidase) and bp CC 2187-5127 (RVT, int). Products 57% identical (73% similar) to CC BEL3_Cis protein. XX SQ Sequence 5609 BP; 1790 A; 1102 C; 1286 G; 1410 T; 21 other; gtaacttaac aacgacgtcg gcgttgattg gcctctatag tgactggaac cagataatcc 60 acagagtgaa ctaataatgg atgacacaat tacggacaaa attnatctgg cggtaattga 120 tgagatgaat gtgacagtgc gtcggcatgt aaatctgaat agaaagntaa ttgcccaaga 180 agcaaagttt anttattcaa tcgaaaaggt gcaggaatcc gtgatcgtgt ctgatgaaca 240 cgaaccaata caacgggcag taaccaattt atttaaacan aaaaggaatt ttcgaatctt 300 aatgaccgac cttgagaatg agggtgatcc gtccatgttc agtaagtggt ccgaaaaagc 360 aggacgaatg cagagtttat tcaatgagac cagagaccta gctcaaaacg caattgacaa 420 gcggcgacaa accgaattat gtgacaaggc gactaggtca tcccctgcaa gctacgctct 480 accaataaaa ttagaagttt ttacgggtaa tccccttcat tttccatcct ggaataccgc 540 gttcgacgcg ctcgtagatt cgaacactaa tacgtgtgct gcacagaagc tgaatttgct 600 gcagcaatac ctttcggggg accccagatc ggtggtggaa ggctacgttc tgttgcagtc 660 ggaggaggcg tacgcacaag caagaaaaat attaaaagaa cggtatggaa acgatagttt 720 agtcagtaaa tccttcacgg ataagctaaa tgcttggccc aaagtaaatg taaatgatgc 780 taagggtcta aggagattct ccgactttct caatcaaatt gtggccgcca aaagccaaat 840 caccgaatta ggggttcttg acttctccaa tgaaagtaca aaaattgttg cacggttacc 900 gccgtttatt attaacaaat ggcgtgacat tgtactcgca tacaaattat ctgaacatgg 960 aacttatcca ccattcgaaa aattagcaga atttgtgtct ttgcaatcgg agcgggagaa 1020 tatcccggaa ttgcaaggna ttggcttgac ctctgatcgc agagggaact tacaaagcaa 1080 accgaccact ccattntaca acgcgtcaag tgaaccgtgg aggtatcaac aacaggttaa 1140 tagatcgtcg ccaaatttga actgcacgta ttgtaaaagt tataaccatg gactaaacga 1200 atgcaatata tttatgaaat tgaatttatg tgatagaaag gattttctgc gaataaataa 1260 tttgtgttat gggtgtggtt caagctatac tcacattgcg agagagtgca cgcatcgggc 1320 tacatgcaaa atatgcaagc gttctcactt gacctcgtta cacgtttatt ggaccccaca 1380 attntattgc aatcgcctga aatctactcc aactgacata acccgcagac aagaaagttc 1440 gatgatagtt ccggtatggg taagggatat aagtcaacca caatattcaa tcctttgtta 1500 cagcattttg gatggtcaat cgaacacaac gtttatatcg gaggagatat gtcgacgatt 1560 gaataagaac ggaactccaa ctcacctgga tttaactacc atggctggac gagggcagat 1620 tgcacgcagt cgtcgaatan ccaatatcga ggtactgagt tatgacagaa aagtgaaatt 1680 taaaatagac caatcgtaca cctgtccaga aatttcacng gatcgttctc aaataccagt 1740 accgagtcac gctgaaaaat ggtcacattt aaaacagatt gcaaaattta tattacccct 1800 gcaagcgaac gcaccaatcg gaatgctaat tggaactaat gttcctggag ccatcagacc 1860 acgtgaaatt gtagcagggg gagaaaatga gccatatggt caaaaatctg ctctaggctg 1920 gggaattgtg ggtagctgtg taccaggaag cgataaaaaa aatttgcatc ggtgctatgc 1980 caccgacaat agcgcaagat tggccatacg gttaaacgaa ggatcgaaga cgaaagagat 2040 tattagtcca tcccgactga gacgatggat ggaaaccgaa tttgttgaac agcggtgtac 2100 cgacacaaaa agatactccc aacacgacag aatatttata aaatctatta gcacaaatat 2160 cacaaagcgg gaagatggac actatgaaat gccacttcca ttaatatatc aagatattac 2220 tatgaccaaa aaccgacctt tagccctgaa acgactatgg caacaaaaaa gacgtttcat 2280 aaaatagccg caatatgcaa aagattacat cgcttttcat gcaagacgtc attaaaaact 2340 gcnctgagaa gctttaaaaa ctggatcaac tatataccgc acactggaat atacaaccaa 2400 cataagcctg ggaaaatcag agtagttttt gattgcgctg cccgcttcgg tggtttatgt 2460 ttgaatgacg tattgctcca gggacctgac ttgatgaatg atttagtggg aatattatgt 2520 cgttttcgca ggcatgatat agcggtgact gcagatattc aaggtatgtt tcaccagttt 2580 ttcgtggagg cgaaatatcg ggatctacta cgatttttgt ggtgggaaga tggcaatata 2640 gataacccta ttactgaata tcgtatgaaa gtgcatattt ttggcgctgt tagttcgcct 2700 gcgtgtgcta attttggact acgacgagct gctgacgatg gagaactaaa atatagaaaa 2760 gacgctgcgg atttcgtaag aaatgacttt tatgtggacg atggaattac ctcggtaagt 2820 gaccgagcaa ccgcaatttc tctcataaaa cgaagcaaaa gtttgtgtgc agaggcggga 2880 ctgcgtttgc acaaatttac atccaatgat cgcatcgtat tggatagtat accagaagag 2940 gacagagcaa acagcgtaca gaggatagac atttacaaga atccgcttcc tgttgaaagg 3000 atattaggaa tatattggtg cattgaaaac gattcgttcc aattccgact tataatgaag 3060 gacgttcctc tcacttgccg tggcattctg tcaactataa gctcggttta cgaccctttg 3120 tcttttatag gtccagtaat gctagaaggt aaacttgtgc tacaacaact ctgccgggat 3180 aaagcggatt gggatgatat ccttccgagc gatttacaag tacggtggga gaaatggaga 3240 acagaaatgc agagtttgga gcaaatccaa atcaaacgtg tctttcatcc aaaagggttt 3300 ggtaaagtac aaaaattgga aatgcatcat ttctcggatg cgagtaacgt aggttgtggt 3360 cagtgctctt acgtacgttt ggtgaatgag aatcacgatg tgtgttgtgc attagttatg 3420 ggaaagtcca gagtagcccc cctaaaacct gtcaccattc cgcgactgga gttggtcgct 3480 gccgtgttgt cngctaaagt aagtaaatat cttggcgagc atttgccatg ggaaaaaaga 3540 aaggattatt tttggacgga cagcaaagtg gtacttggat atttatctaa cgatgcaaag 3600 aaattcgaag tttttgtagc aaaccgaaaa caggcgataa gagagattac tgacaaacgg 3660 agctggaatt atataccgtc aaagcaaaat acgagtgatt gtgccagtcg aggattacga 3720 ccaagtcaat tgacaagcga ttmaacctgg ctacgagcac cttcgttctt gtggaaaggg 3780 gaactggtca aatttgatgt cggagtggat gacgacaccc ttacaaatca acaaatcgcg 3840 acggaaagta aaaattcggt agttctgatc acagacggaa gggatgtcga cttggatttg 3900 gaagaacgat tgttccacct ttctacctgg tatagggcta aaagggcaat tgccaacgta 3960 ctactgttta aacataagtt gcaacgaacc ctggaaatac cgttttttac gcgaacacta 4020 ctgctggagg ctgaggcttt gatcttgaaa aatctgcagt gtcatagttt ttctaatgac 4080 ataactatcg tttcaggaat atccacagaa agagatttag atcgacacac atccaaacta 4140 aggaaacacc aattgaagaa aagcagttca ttatttcagc tcgacccatt tctcgatcag 4200 catggagtac ttcganttgg tggnagattg ggatatacta atattaacga taaacttaaa 4260 caccccataa ttattccaag aaatacacac atcactgagc tagtgatacg acactgccat 4320 gaaaaggttc accacatggg tcgaacgtca actcacaact atgtccgtca gagtggtttc 4380 tggatagtaa atggttcgtc tgctgtggcc cattatataa gtaaatgtgt ggaatgtcgt 4440 cgcctccgtg gatctttgca aactcaaaag atggccaatc ttccgggcga canattatat 4500 caaggtccac cgttcaccta ttgcggagta gattactttg gacccttttt aattaaggac 4560 aagagatcta tagtnaaaag ttatggagtt ttatttacat gtatgtcatc ccgagctata 4620 catatcgaaa cggcaattag tatggataca agttcgtgta taaatgcgct tcgacgtttt 4680 ctggccaggc ggggaccggt ggtacaaatc cgctgcgatt gcggcaccaa cctagttgga 4740 gctaacgatg aactgacagc caattgcatc atcaatcaac aagaactaag gcagttcctt 4800 ctcaggcaag agtgcgatat ggttgaattt aacttcaacg ccccacacgc cagtcatatg 4860 ggtggagtgt gggaaagaca gatccgaacc gtgcgtagcg tacttggcac cttattgttt 4920 cagtacggcc agcagttgga tgacgagacg ttccgaacct tcatgacgga agtggagaat 4980 atcgtcaact ctanaccatt gacgatagcg agtatgaccg atgctggatg tccagagccg 5040 ttaactccaa accacatctt aacaatgaag aagagaccgc ttttgagccc accgggagca 5100 tttgtaaaaa aacgatttgt acgttaagcg acgctggaga agtgtgcaat acctaaccga 5160 tcaattctgg actcgctgga gaaaagagtt cntgcacact ttgcaaccga gaacgaagcg 5220 gcaaagggag caaacgaaaa aagcaagtgg gggatgtagt aattatgaag aatgatgaca 5280 gcccaagaaa tcgatggcca cttgcacgaa tactggagng ttatgcaagt agcgacggac 5340 tcgtacggaa ggtgaaagtg ttagtggggt aacacgacag agatgacctt ggacgaagaa 5400 ttaatgcacc atcggtcatg ttaaggccaa tacacaaact tgttttattg tgcanggccg 5460 ataccgggga ttcccgaacg gggagccaag attaaaggtc actcagacta gacaatgata 5520 tcgtaaattg gatgttgaca tgtgcgaaca aaatataccg actacgcccc acggctttaa 5580 aaaaatctgt gtanattttt ggagagcca 5609 // ID Gypsy-264_AA-I repbase; DNA; INV; 5418 BP. XX AC . XX DT 13-JAN-2011 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-264_AA_; KW Gypsy-264_AA-LTR; Gypsy-264_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-5418 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (13-JAN-2011). XX DR [1] (Consensus) XX CC Positions [2385-2885] - Reverse transcriptase CC Positions [4020-4487] - Integrase core CC 'ATCAG' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS join(2169..4076,4080..5315) FT /product="Gypsy-264_AA-I_1p" FT /translation="MLVNHTGVSKEDRVVEDIKSKFSKVFDNDFSVPITSF FT EADLVLKEDKPVFKRAYDVPYRLRDQVVEHIENLEKDGVITPIEASEWASP FT VVIVVKKDKGIRMVVDCKVSINKLIVPNTYPLPVPQDLFASLSGSKVFCSL FT DLAGAYTQLRLSKKSRRFMVINTIKGLYSYNRLPQGASSSASIFQKVMDQV FT LKGLDGVFCYLDDVLIAGKDFEDCESRLYLVLDRLAKFNIKVHLKKCKFFV FT SSLPYLGHVLTEQGLLPCPEKVETIRRAKPPNNVTELKAFLGLINYYGRFI FT PNLSSRLSPLYSLLRKDTKYVWNDDCNEAFEDSKKLLLKPNFLEFYDPDKP FT IVVVSDACGYGLGGVIAHVVGKEERPISFTSFSLNDAQKSYPILHLEALAV FT VSTIKKFHKFLYGKKFTVFTDHKPLIGIFGKDGKNSIFVTRLQRYILELSI FT YDFEIVYRPSTKMGNADYCSRFPLPQEVPRALQREYIKSINVSGDIPLDYR FT TIAKETQHDEFLQQILSFMKTGWPKKLSQRFKDIYSNHQDLEELDGCLIFQ FT DRIVIPTKLQNSVLKLLHRNHSGITKFKQLARRSVYWFGINSDIESFVKHC FT RVCSQMAVVAKNSTQTSWIPTNRPFSRIHADFFYFQQKFLVVVDSHSKWLE FT VDYMRYGTDATKVKSKFMAIFARFGLPDVIVTDGGPPFNSKYLIDFWSQQG FT IQVLKSPPYHPSSNGQAERMVRAVKEVLKKFLLDPELKGLDIEDQISYFLF FT NYRNICSEDNRFPSEKLLKFTPKTTLDLVNPKSSFKKHLTTHTCEDSLIVK FT NDKPAVPDKFSQLRHGEPIYFKNFRATEVKRWLDAKFLKRLSPNVFQVSVG FT GRIYSAHRDQLKLKPRPPKTLVCGWKPSRKRQREDDEEYSDDSNSSDFYGF FT LADSFINEPQQQQTVEVGSSLSIPSTSDNSLSIPSSSSNSLSKHVTGNSLS FT VDEMASSGSSLLPSSGVQMPMSANSCIPTSSREDGAFEVSSGDYRSSSHGS FT SVPVDHRKSVDHQKASISRNVVQPRRSRRTKRQKQNSEFYYY" XX SQ Sequence 5418 BP; 1512 A; 993 C; 1271 G; 1597 T; 45 other; gtggcgaacg aggtgaaaat tttgatttta aaacgggttt cgctaagcat tttccgcgac 60 tcgcgagtga tatcgtttta attcgatccc cgtggtgtgc agtgagtttg gagcaattag 120 tggtgcaaag attttgccgc caaagtgcaa tattaatcta tcgcttacag atcacctcaa 180 gtgtggttga tcacgtgtgg tgcgataatc gttttattgc ctaattgctc tgcagaaggt 240 ttgagattat tgaaaatcgt ttgtgctatt tccagcaagt ttaatacagc acacatcacg 300 tgtttgaaaa ggtttagtga tctgctagtt tgtgctgagt gtgattcagg tatccaccat 360 tgttttttga tcaccaaaag gacgctaaag ttgtttgacc tgtgtgaacg accaacaaag 420 agcattcagg aggtcagttc gggtcagtag tgtgcactgg agaacaaata gatagtgaaa 480 aaaagcgttt tgttctacag cagtttcaac gttgtaagtg gcttgggtgg atcagccaat 540 ttctgcaatt gtttgtttgg acaagttttg tggatcatcg ttgagagttt tggcctacag 600 ctaccaggta tgttttaaat tgttttattc gtttttggtt taatttgttt tgttgtgatt 660 gcggttgtgc gtgtttgttt ttttttccga agttatttaa aaacttgagc gatctaagtt 720 tgttttcttc atcatttgtg tctctctggg agtagtttga aatttatgct agtacaacgt 780 attatgtgtt tgtagttgat tttktttttc gtaaaatatc aaaaattata ctaaatttag 840 aagttgatct tttttttcgt ttctccgttt gtttctaatc tttattaaag cgacaatgcc 900 aattatgagt acaattgaac cgttccatcc agatggaagc ataactttct cgcaatatct 960 ggatcaactg gagtggattt acttgcatca aaaagtagca gatggggata agaagactac 1020 attccttgca tcgtgtggta ccgctgtcta cagtgagctc aggttgctgt accctggccg 1080 aaatcttgaa agatattgaa tacaaagatt tgacagattc gctacgtaaa cgtttcgaca 1140 aaagtgaaag tgatttagtc caacgtatca aattttatgc gagggtgcag aaacctggcg 1200 aaaaagccgt agattttgtc ctggcggtaa agcagctggc agagtactgt aacttcggca 1260 cgtttaagga gacggcaatt cgtgacaggt tgctgtgcgg cgtttcaagc aagcagctac 1320 aggagcggct cctagatgaa gaggatttaa cccttgctag ggccgagcgg atcatagcca 1380 accgcgagga ggctgctgag cgattgacat caatcgggct agtctgatga atgaaccaga 1440 aacaggtaga gttagtgcca ttgaacgttt tggaggtagt aagagggttg gttttcgtcg 1500 agatggccca ccgccgcttc ttcacgtgct aggagtaggg gtcgtgagaa tcacgacckk 1560 cgtaaccgtt tgaacggcag ccgsagtmgt agcaggagcc kttctgmccc gaggaagagt 1620 ggtaaaaagg tctactwctg cacktactgc cgcaggaagg gtcacacccg caagtactgc 1680 tacgacctga agcaccakaa gccatcsgtw aagtckgtmg ccgtcgwgmc gaagaaacca 1740 gacctscckg asmggtttca agaagagcag ctaagtcgtc ggactckgaa gaggamatgg 1800 acgttctsat ggtktctgct agtcggcgwt swagcaccag cgaaccwtgc ctgatcgawg 1860 cwwkcgtsgg cggtgaaamc ttgcggatgg agatmgacag cggatctgcw gtctccgtga 1920 tcagcaggca ggcgtacggg magcgcttca aagacwascc gwtgggkasg tgtmgcmtta 1980 agctggcagt cgtcgatgga gcgcgtctgt tggtcgcggg tcgaatcgca gttttggcga 2040 aggttaacgg acgtcgcgga aaacttcctt tagtggtgct ggagagcgca aaggagatca 2100 caccgctgct aggcagggac tggcttgacg ttttcttccc agcctggcga gatgcattcc 2160 gaggaccgat gctggtgaac catactggcg tttcaaagga ggacagggtc gttgaagata 2220 taaaaagtaa gttttcgaag gtttttgaca atgacttctc cgttcccatt acaagttttg 2280 aagcagattt ggtactcaag gaagataaac ctgtttttaa acgggcttat gatgttcctt 2340 atcgtttgcg tgaccaggtt gttgaacata tagaaaattt ggagaaagat ggagtaatta 2400 ctccaatcga agcaagtgag tgggcttcac ctgttgtcat cgtcgtcaag aaagataagg 2460 gaatccgtat ggtggttgac tgtaaagttt ccatcaataa gctcatagta ccaaacacat 2520 acccgttgcc tgtgcctcaa gacttgtttg cttcgctgtc cggttcaaaa gttttttgct 2580 ctcttgatct tgcaggagca tatacccaat tgcgtctttc caagaaatcc agaagattta 2640 tggtgatcaa tacgatcaag ggtttatatt cgtacaaccg tttgcctcaa ggagcatcta 2700 gcagtgcttc tatttttcaa aaagtgatgg atcaagtttt aaaaggtttg gatggtgtat 2760 tttgctattt agacgatgtg ctaatagccg ggaaggactt tgaagattgt gagagtagac 2820 tttatttggt cttggatcgt ttggccaagt tcaatataaa agttcattta aaaaaatgca 2880 aattttttgt ttccagtttg ccttatttgg gacatgtgtt gacagagcaa ggtttgttgc 2940 cttgtcctga aaaagttgag acaattcgta gagccaagcc tccgaataac gtcacggagt 3000 tgaaggcatt tttgggtttg atcaactatt atggtcgctt cattcccaat ttatcctccc 3060 gccttagtcc actttacagt ttgctaagga aagatactaa atacgtttgg aacgatgatt 3120 gtaacgaagc ttttgaagat tccaaaaaac ttttattgaa accaaacttt ttagaatttt 3180 acgatcctga caaaccaata gtcgttgttt ccgatgcctg tggatatggt ttaggtggtg 3240 taattgcgca tgttgttggt aaagaagaaa gaccaataag ttttacttcg ttttcgctga 3300 acgatgcgca gaaatcgtat ccaattttgc atttggaggc acttgctgtg gtaagtacga 3360 taaaaaagtt ccataaattc ttgtatggaa agaaattcac agtttttaca gatcacaagc 3420 ctttgattgg aatttttgga aaagacggca aaaattcaat ttttgtaact cgtttacaac 3480 ggtacatttt ggagttatcg atttacgatt ttgaaatcgt ttatcgtcca tcaacaaaaa 3540 tgggtaatgc cgattactgc tcccggtttc cgttgcctca agaagtgcca cgtgctttgc 3600 agagagaata catcaagtca attaacgttt ccggtgatat tcctctggat tacaggacta 3660 tagcgaagga aacacagcat gacgagtttc ttcagcaaat tttatctttc atgaaaacag 3720 gttggccgaa gaaacttagt caaagattca aggatatcta ttcaaatcac caagatttgg 3780 aggaactgga tggttgttta atattccaag atcgtatagt aattccaacg aagttacaga 3840 attcagtttt aaaacttctt catagaaacc attcaggaat cacaaagttt aaacagttag 3900 ctagaaggtc ggtttactgg tttggaatca atagcgatat tgaaagtttc gttaaacatt 3960 gcagagtttg tagtcaaatg gctgtggttg caaaaaactc aacacaaacc agctggattc 4020 caacaaatcg cccttttagc aggattcatg cagatttttt ctattttcaa cagaaattkt 4080 tccttgtcgt agtcgacagc cattcgaaat ggttggaggt ggattacatg aggtatggta 4140 cggatgcaac aaaggtaaaa agcaaattta tggccatatt tgcaaggttc gggcttcctg 4200 acgttattgt gactgatgga ggccctccgt ttaactccaa atatctgatt gacttctgga 4260 gtcaacaagg catccaagtt ttgaaaagtc cgccatacca cccctccagt aatggccagg 4320 ccgaacgaat ggttcgcgca gtaaaggagg ttttgaagaa gttcctgcta gatcctgaat 4380 taaaaggttt agatattgaa gatcagattt catacttttt gtttaattat cgaaacattt 4440 gctcagaaga caaccggttt ccttctgaaa aattgcttaa atttacaccg aaaacgacat 4500 tggatttagt caatccaaag agcagcttta agaaacattt gacaactcac acttgtgagg 4560 attctcttat agttaagaat gacaaacctg ctgtaccaga taaattttct cagctacgac 4620 atggagaacc aatctacttc aaaaacttta gggctaccga agttaagagg tggttggacg 4680 ccaagttttt gaaaaggctg tctccaaatg tatttcaggt ttcggtcggt ggtcgaatct 4740 attcggctca tcgcgaccag ctgaagttaa agccaagacc gcccaaaaca ctcgtttgtg 4800 ggtggaagcc tagtagaaaa cgacagagag aagacgatga agaatacagc gatgattcga 4860 acagcagcga tttctacggt tttttggcag actccttcat caatgagcca cagcagcaac 4920 aaacagttga agtcggcagt agtttgtcga ttccttcaac tagcgacaat agtttgtcga 4980 ttccttcaag tagtagcaat agtttgtcca agcatgtaac aggcaatagt ttgtctgtag 5040 acgagatggc gtcgagtggc agtagtttgc taccttcatc aggcgttcaa atgcctatga 5100 gcgcaaattc ttgtatacca acaagttcga gagaggatgg cgcgttcgaa gtgtcgtccg 5160 gcgattatag atcttcaagt catggctcaa gtgtgccagt tgaccatcgt aaatccgttg 5220 atcatcagaa agcatcaatt tctcggaacg tcgtccagcc aaggcgatct cgtcgaacaa 5280 aacgtcagaa gcaaaacagt gaattttatt attattaatg tttaagtcta tatttgaatt 5340 tttaccagtt aagcttgtaa tatcgaattt taatggaatt gtaagcaacg tcttaagagg 5400 agagaactgt tgtatcag 5418 // ID Galileo_DM repbase; DNA; INV; 5989 BP. XX AC BK006357; XX DT 27-FEB-2008 (Rel. 13.02, Created) DT 28-FEB-2008 (Rel. 13.02, Last updated, Version 1) XX DE Drosophila mojavensis transposon Galileo, complete. XX KW P; DNA transposon; Transposable Element; Galileo_DM. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-5989 RA Marzo M., Puig M. and Ruiz A.; RT "The Foldback-like element Galileo belongs to the P superfamily RT of DNA transposons and is widespread within the Drosophila RT genus."; RL Proc Natl Acad Sci U S A 105(8), 2957-2962 (2008). XX DR EMBL/GenBank/DDBJ; BK006357; Positions 1 5989. XX FH Key Location/Qualifiers FT CDS join(1249..1569,1853..2626,2508..3167,3101..4579) FT /product="Galileo_DM_1p" FT /translation="MQTKCSVSMDIRGKGMKVLMRTKNMISRQACTHTCTL FT STCMTKCMTVREALAASCTHKEYIFPTSMHTHIYSMGFCLQPSRNSILSQY FT VRQMSVVVPFYMYLLLCMCQRNGGKCIIETSGASYRNSSIHSPVSLFAFPS FT KEYYLNKWIIACNLPNNFDRKRARICNRHFERKYIGKRYLKANAVPTLHLG FT NSNLISNNNADVSDDIYSLDIQEEAITPHSYQKKCLNKVVDDFMKPSDTEQ FT QHQSSNIQTQSDGEENLENFLSFDNSLQNQLWQDLSASRSSFCSNFLKREQ FT NEVYYRKKYYEMKVALQNVQNKNYNLKKRYSFLRNAHRQRNIYQRARKEKN FT NLPHVSDQSKVLCKMLFKKSGLEMLIGKETFTKEPERRKTIYLMFQTNQRS FT YVKCFLKNQVYNSAEKVISQNMHFYSARAYDYLRDVLNLRLPCKKQLNRWA FT ILKNLVPGFNPELLENLKDIVGKMSAKEKYTVLVCDEIKVKRGLQYNSSLD FT EIQGFENDGIKRTKFLGQQVCVFLVRRLFDNWKYVLSYTVSARGINHTDLK FT KKFEENIGLSQALGLKSKPLFAIRDPIIKQCLIDGVLIVKAVVCDQGSNNK FT AVFNRWGIDLNNHSFEVNGEKFFAIFDAPHLVKSLRNILLKNNILAPEGTV FT SWGIIRRLYETETKNLTRLCPKLTLKHVSPNCFEKMKVKFATQIFSHSVDA FT AIRTVVETGGFADCKDSAVATAIFIDTINNLFDCLNSHVLFDSNPYRCALR FT ENNNVHEYLQEMRDYFLNLHYPHKVYCIDGMLITISSVIALAEDIWNNNTD FT IFFVATSKLNQDPLENLFYLIRCRGGTNSNPTVFEFNTIISKMLSMKMLTS FT ASVSGNCIPDEDLMLANIIKDSGSQLSVFHEQCNSCHTPTEIEPLEDDLEI FT ELSLDTTIANIKNDFNENALRYFAGYLLHKLLQNTDCEVCTNLLKSSDEMQ FT CSSEYLILNKNFHYINRYLKLKAPSDHFYNLIKLHFESFRKIFEKKPYIAR FT IKEKIVLYCMHATAKSSLDNEWFSPTHPCFEHRKSILNQFVLILIRKNCKW FT QTEKIVGKTSISKRKLKIIHQ" XX SQ Sequence 5989 BP; 1964 A; 1022 C; 1118 G; 1885 T; 0 other; cactgaccag agaacacata gactagacat gtagtacaaa gtttttcaac accgaaaaat 60 ctgtctccta gggcctatca atttcgtatt tgctcggctc ttcacttttt aggttcgggc 120 tacgaaaata tgcgaaagta caatcattcc gcttcatgcg ctcgaccgag atggtcaaat 180 gttgaataaa agatgcctct atgctcaatt tttgttcttg ctctgtcgct ctctctgctc 240 tctgctctct caatatgtat atgtatgtat gcatgtgagc acaagcagca gctcacgaaa 300 atcgaaattt cttcgcgtta gactgccgtt tcttccctta cgtgaagtgt attccgaact 360 cttcaagccg tttgccatgc gttttcgtta ttatttgtgg gaagggtatg aaagcgttgg 420 tgcgcataat gaaaatgttt tcgcgattag catgcataca cactctcgct tgctacatgt 480 actatgagca tttgtatgcc tgtgtgcgag gctcttgccg cctgtagcaa ttttacagtc 540 cagcatttgc atttggtcga tcttgtgaag tcctcacata catatgtatg tatgtatgta 600 tatacatttg attgtgtgta cgcatgggat gtgctgttca caattatggg atatcacttg 660 tagaagcatt tgtatgccta tgtgccttta catgtgcgtg tgagtatatc tttgtttgta 720 tgcttgggag atgaacattg gcgtgtgcct acctcaactt ttaatggtgt gaacgatttt 780 gggttttaat tgaagaggcg cgcttatatt taaatagcgg actaactaaa gtgacaggtg 840 cacttcccat tacttttgat acgtttttta cttgcatgca cctgtatatg tatgtacata 900 tatgaacggg tacaggggat cgaaaatttt atcccatttt ttccactgag tttaatcttt 960 aaaaatgcat tctgataggc aaacccacag tcattgaaat aaaatcaggt tcttaaagat 1020 aagcgcgagt tacccagtca ctataccaca cacatatatg tacatacatg tgtgtatgag 1080 ttactcgccg gcccgttcta tgcgctctgc gttttttgtc cttctctctc tctctttatg 1140 ggctctcaaa tcatcaagct cagcatcgca caaaaatgaa atttgttcgc gacgatcccc 1200 catttgagtg aagattacac cgaaatttta aaagcgtttg ccacgcccat gcaaacaaaa 1260 tgtagcgttt cgatggatat tcgtgggaag ggtatgaaag ttttgatgcg cacaaagaat 1320 atgatttcac gacaagcatg cacacacact tgcacactta gtacatgtat gaccaaatgt 1380 atgactgtga gggaagctct tgctgccagt tgtacacata aagaatacat tttcccgacg 1440 agcatgcaca cacacattta tagtatgggc ttttgtctgc agccgtcgcg aaactcaatt 1500 ctctctcaat acgttcgtca aatgtccgtt gtagtcccgt tttatatgta tttgctatta 1560 tgtatgtgtt agtgtgggaa acgattctta gtaagaattt cttactgcct gtggtatatt 1620 gtaagtgtgt atgttcataa tacgtgaaca ttggcgtgtg gctgacacgg gcttttaatt 1680 gtgcgagcga tttcgaatta aatctgcaaa agggggtgct aaccgacctt ttattatcac 1740 ttcagagttt tctattgtta attctgccaa aatttatata agtttttttt ttttaactat 1800 ggctgaaaca agtgctataa aaaaatcaac cgttggtgtc aaaaaatgtt aacagagaaa 1860 tggcgggaaa tgcattattg aaacgagtgg cgcgagttat aggaacagtt ccattcattc 1920 tccggttagc ttatttgcgt ttccgtctaa agaatattat ttaaacaagt ggattatagc 1980 atgcaattta cctaataatt tcgaccggaa aagggctaga atatgcaaca ggcattttga 2040 acgaaaatat attgggaaac gttatttaaa agccaatgca gttccaactt tacatttggg 2100 caattcaaat ttgatttcaa ataataatgc tgacgtaagt gatgacattt attcattgga 2160 catacaggag gaagctatta ctcctcatag ctatcaaaag aaatgcttaa ataaagtagt 2220 ggatgacttt atgaagccat ctgacacaga acagcagcat cagagcagca atattcaaac 2280 tcaaagtgac ggggaagaaa atttagagaa ttttctgagc tttgacaata gtttacaaaa 2340 tcaattgtgg caagatttaa gcgctagtag gtcttctttt tgttcaaatt ttttgaaaag 2400 agagcaaaat gaggtatatt atcgtaaaaa atattatgaa atgaaagtag ctcttcagaa 2460 tgttcagaat aagaattata atttgaaaaa aaggtattca tttttaagaa atgctcatag 2520 gcaaagaaac atttaccaaa gagccagaaa ggagaaaaac aatttacctc atgtttcaga 2580 ccaatcaaag gtcttatgta aaatgctttt taaaaaatca ggtttataac agcgctgaaa 2640 aagttatttc gcaaaatatg catttctact cagctagggc atatgattac cttagggatg 2700 ttcttaattt aagactcccg tgtaagaaac aattaaatcg ctgggcgatt ttaaaaaatt 2760 tagttcctgg atttaatcca gagctattag aaaacttgaa ggacattgtc ggaaaaatga 2820 gcgccaagga aaaatatacc gtattagttt gcgatgaaat taaagtaaag agaggtcttc 2880 agtataattc atctcttgac gagatacagg gatttgaaaa cgatggcata aaaaggacta 2940 agtttttagg acagcaagtc tgtgttttcc ttgttaggag gctgtttgac aattggaagt 3000 atgttttaag ctatacggtt tcagcccgtg gcataaatca tactgaccta aaaaagaaat 3060 ttgaagaaaa tattggacta tcgcaagcat tggggcttaa gtcaaagccg ttgtttgcga 3120 tcagggatcc aataataaag cagtgtttaa tagatggggt attgatttaa acaatcatag 3180 ctttgaggtt aatggtgaaa aattttttgc gatattcgat gcaccgcatc ttgtaaaatc 3240 ccttagaaat attcttttaa agaataatat tttagcacca gaaggtacag tttcatgggg 3300 aataattaga agattatatg aaactgagac caagaattta acacgtttat gtccgaaatt 3360 gaccttaaaa catgtcagtc cgaattgctt tgaaaaaatg aaagtgaaat ttgccacaca 3420 aatctttagc cacagtgtag atgctgcgat acgcacagtt gtcgaaacag gtgggtttgc 3480 cgattgcaaa gatagtgcag ttgcaacggc aatatttata gatacaatta ataatctttt 3540 tgattgctta aatagtcatg tattatttga tagcaatcct tatagatgtg cccttaggga 3600 gaataataat gtccatgagt atcttcagga aatgcgtgat tatttcctaa atctgcatta 3660 tccacacaaa gtatattgca ttgatgggat gttgattaca atctcatcgg taattgcttt 3720 ggcggaagat atttggaata acaacactga tattttcttt gttgccacat ccaaattaaa 3780 tcaggacccg ttggagaatt tattttattt gattagatgt cggggaggaa caaatagtaa 3840 tccaacagta ttcgaattca atacaataat atctaaaatg ctatccatga aaatgttgac 3900 atcagcttcg gtatctggaa attgtattcc agatgaagat ttgatgttgg ccaatatcat 3960 aaaagatagc gggtcccaat tatctgtttt tcatgaacaa tgtaattcat gtcacactcc 4020 aaccgaaatt gaacccttag aagatgattt agaaatagaa ttgtcattag atacaaccat 4080 tgccaatatt aaaaatgatt ttaatgaaaa tgcattgcga tattttgctg ggtatcttct 4140 acacaaactg ttgcaaaaca ctgattgcga agtttgtaca aatcttctaa aatcatcaga 4200 cgaaatgcag tgttcttctg agtatttgat actcaataaa aattttcatt atataaatag 4260 atatcttaaa ttgaaggccc cttctgacca tttttacaat ttgataaagc tgcatttcga 4320 atcatttaga aaaattttcg aaaagaaacc atacatcgct cgcattaaag agaaaattgt 4380 tctgtactgc atgcacgcta cagcaaaatc atctttagat aatgaatggt tttctcccac 4440 ccacccttgt tttgagcata gaaaatctat tcttaatcag tttgtcttaa ttctaattag 4500 aaagaattgt aaatggcaaa ctgagaaaat tgttggtaaa accagcataa gtaaaagaaa 4560 attaaaaata atacatcaat agggaatttg actttatatt aattttggca caattctcag 4620 cagtagaaaa cagtgaagtg ataataaaag gtcggttagc attctccttt tcagatttaa 4680 ttcgaaatcg ctcgcacaat taaaaaccct tggtaggcat tcgccaatgt tcacgtatta 4740 tgaacataca tacatacata catatgtatg catgcgcata cataatatac atacatacgt 4800 acttaaattt ttcacaggag gcatgactag agattcgcac atatgtacat acatacatat 4860 gtaattggat gtgtatgcat gtaagtaaaa acatactcat tatgcacagt acaaataata 4920 acaataatct taaaatctga cattatcatt gtaaactctc aacagtatgg caatggagga 4980 aatgacgacg aacccatcgt cgccaagaag tcataatttt tgaacgcaac tggcaaacac 5040 acatagaaat ttatatttct aaattgatta ttaatttatc ttgttaatgt tattgaatta 5100 taaacttatt gaattatctt gttaatttta ttaaattatc cttgttttta ttgatttctt 5160 tatattaaat ttaatgtttt ctatttttat atttattaaa ttagtgaaat aatattaaac 5220 agattcctat ttaaaacaac tgtttttatg aattgttaat agaaaataaa aggtacttga 5280 atacaagcgc gcatcttcaa ttaaaaccct aaatcgttca caccattaaa agttgaggta 5340 ggcacacgcc aatgttcatg tcccatgcgt acacacaatc aaatgtatat acatacatac 5400 atacatatgt atgtgaggac ttcacaagat cgaccaaatg caaatgctgg actgtaaaat 5460 tgctacaggc ggcaagagct tcgcacacag gcatacaaat gctcatagta catgtagcaa 5520 ccgagagtgt gtatacatgc taatcgcgaa aacattttca ttatgcgcac caacgctttc 5580 atacccttcc cacaaataat aacgaaaacg catggcaaac ggcttgaaga gttcggaata 5640 cacttcacgt aagggaagaa acggcagtct aacgcgaaga aatttcgatt ttcgtgagct 5700 gctgcttgtg ctcacatgca tacatacata tacatattga gagagcagag agcagagaga 5760 gcgacagagc aagaacaaaa attgagcata gaggcatctt ttattcaaca tttgaccatc 5820 tcggtcgagc gcatgaagcg gaatgattgt actttcgcat attttcgtag cccgaaccta 5880 aaaagtgaag agccgagcaa atacgaaatt gataggcctt aggagacaga tttttcggtg 5940 ttgaaaaact ttgtactaca tgtctagtct atgtgttctc tggtcagtg 5989 // ID hAT-25_SM repbase; DNA; INV; 2988 BP. XX AC . XX DT 13-FEB-2008 (Rel. 13.02, Created) DT 03-MAR-2008 (Rel. 13.02, Last updated, Version 1) XX DE hAT DNA transposon from Schmidtea mediterranea: consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-25_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-2988 RA Bao W. and Jurka J.; RT "hAT DNA transposon from Schmidtea mediterranea."; RL Repbase Reports 8(2), 74-74 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 691..2607 FT /product="hAT-25_SM_1p" FT /translation="MPHMKGKEWDHVIVLSETANNFQVKCIYCSKVFWGSG FT NRIRAHMGIETATGVAKCVKVPVEILECFKAADAVKQSEKQENARKRNLDK FT AVTSTSLSSSSAKQPKLTNMFKNIEKAEVDESIARMMYSTGISFNVVNNKH FT FREMCSQIGKFGPAYKPPSDHPIRTTLLDKEYAKVQNRVQTSIFTDLNLKM FT GTIVSDGWSDAQQKPLLNILLVTSSGSTFIESIDTTGNTKDSGYIAQVIMS FT SIEKIGPELILQVITDSAANCKAAWQIIAKKFPKIVCGPCSAHCLDLLLED FT WGKLIWISSILKDVIAVVKFIKGHDGSRAMLKKHSPNKFLLHPAETRFGTC FT VIMTQRLVELKDALQEMIVSREYKAWLSNKSYKIAGEEVSVSVLSEPFWKK FT CQLYLDINKPVFELLRLVDGDAPVTGKIYFKMFTIQESINNFPNISQAQRK FT ELYDSFARRWAMMHTTLHAAGFLLDPEYVNMAQHANDEVMTGFYQLVELLH FT PDVEEQVIITNQLNNFRSSQGIFARPVAKAAASTMPAHQWWHSFGSGVPEL FT QRFAIRVLSQTATSSAAERNWSLFGFFQNKRRSRLNPKTIEKMVYIHANTR FT LMDKVEEVDYVEENVKWNEPNDDISDTSEDSVQEESDSPDD" XX SQ Sequence 2988 BP; 965 A; 496 C; 596 G; 931 T; 0 other; aaagttcgac cgtacgttag agcagaagat agcgtacgcg tacggtacgt tcgtattccg 60 tacgtatgag cagcctttaa tattagttgc ttctcgtgac gtcacatttt tttctcgtca 120 ttttctcgtc ttttttttct cgtctttatt acgtcacttt agagttttta tttgtagttt 180 cgtttcagcg aagaggtaaa ttattatttt ggttttattt tgtataattt gattatttta 240 tctacagaaa taaaacaggc tgacggattc atagtttttt tgcatgggtt agtacggccg 300 cgtggcaata acgcacatgg cgagtgcagg ctgcgccttg taaaggaata atacacgatg 360 ttggccgcag tcaatgtgtt aaggcacttg aaaattagcg gagttgcggc aatttaggtt 420 tcgtgtctgc agtgttgaag tttggtgtcg ttgttctaac ttccgcggat gcatcatgca 480 atcttatcat gtataataat aatagtatat cattttaagg ggtaatcata atttaatgta 540 aatttttgta atttataaat attttttatt tgatttgtgt tataactaat aacttaataa 600 gcatttaaaa gaattttatt ctatggattt tcagaaaatc attttctcta tttaatattg 660 tttacaactt tacattacag aaaataaaaa atgccccaca tgaaagggaa agagtgggat 720 cacgtcattg ttctttcaga gaccgccaat aactttcaag tgaaatgcat atactgttcg 780 aaagtatttt ggggtagtgg taatcgtatc cgagcacaca tgggtattga aactgcaaca 840 ggcgtagcca aatgtgtcaa ggtgccagtt gaaattctgg aatgcttcaa agcagcagat 900 gctgttaaac aatctgaaaa acaagaaaac gcaagaaaac gaaatctaga taaagcggtt 960 acatccacat cgttgtcctc atcgtccgca aagcaaccga aactaaccaa tatgtttaaa 1020 aatattgaga aagctgaagt ggatgaatct attgcaagaa tgatgtatag cacaggtatt 1080 agtttcaatg ttgttaacaa caaacatttt agagagatgt gctcacaaat cggaaaattt 1140 ggaccagcat acaagccacc atctgatcat cctatacgaa cgacattgct tgacaaagaa 1200 tatgctaaag tacaaaatcg agtccaaacg tctatcttta ctgatttaaa tctcaagatg 1260 ggtactattg taagtgatgg ttggtctgat gcacaacaga agccattatt gaacattttg 1320 ttggtgacct catctgggtc aacatttata gaaagtattg atacaactgg taacacaaag 1380 gattctggtt acattgccca agtaatcatg tcatcaatag aaaaaattgg acccgagttg 1440 attcttcaag ttatcacaga cagtgcagct aactgtaaag cagcatggca gataattgca 1500 aaaaagtttc caaaaattgt ttgtggtcca tgttctgccc actgccttga ccttttactc 1560 gaagactggg gtaaattaat ttggatttct tcaattttga aagacgtgat tgctgttgta 1620 aagtttataa aaggacatga tggctcacga gctatgttga agaaacattc tcctaataag 1680 tttttgctcc atccagctga gacacgcttt ggcacttgtg ttatcatgac tcaacgttta 1740 gtggaactta aagatgcttt gcaggaaatg attgtgagca gagaatacaa ggcgtggtta 1800 agcaacaaga gctacaaaat agcgggtgaa gaagtatctg tttcagttct gtctgaacca 1860 ttttggaaaa agtgtcagtt atacttggat ataaacaaac cagtttttga actgctacga 1920 cttgttgatg gtgatgcccc tgttacagga aagatttatt tcaaaatgtt tacaatccaa 1980 gaatctatta acaactttcc aaatatatca caagctcagc gtaaagagtt gtacgattca 2040 tttgctagaa gatgggccat gatgcacaca acgcttcatg ccgctggatt ccttttagat 2100 ccggagtatg tgaacatggc tcagcatgca aatgatgaag tcatgactgg attctatcaa 2160 ctagtagaac tattgcatcc tgatgtggag gagcaagtca ttataacaaa ccaactaaat 2220 aactttagat caagtcaggg aatatttgca cgcccagtag caaaagctgc agcaagtacc 2280 atgccagcac atcaatggtg gcacagtttt ggatcgggtg taccggagct ccagagattt 2340 gctatacgcg tattaagcca aaccgcaact tcgagtgctg ctgaaaggaa ctggagctta 2400 tttggtttct tccagaataa aaggaggtct cgactgaatc cgaagaccat tgagaagatg 2460 gtctatatcc atgccaacac acgtttaatg gacaaagtgg aagaagtgga ttatgttgaa 2520 gagaacgtga aatggaatga accgaatgat gatatcagtg atacgtcaga agactcagta 2580 caagaagagt ctgattctcc agatgattag aatttagata cagttttgca actttataca 2640 ttcattcttt ataatactgt tgatattata gtgataaaga tgaatatttt aataaaattc 2700 aaaagtcact gctggaattt aatcaattga tgaccgtagt ttattgttag aaaattggaa 2760 taaaaataca caattgttca cggaattgtt ttaaaacgtt ttccattctt taatttttaa 2820 ttttgttatt ccttgttaat tctataataa aaaaaatatt tcgactaaca cgtaatgcag 2880 aaaaattagc gtacgctatt tattaattta gccaaaaaat agcgtacgcg tacggtacgc 2940 atttcagcag aaaaaatagc gtacgttccg tatgtacggt cgaacttt 2988 // ID Copia-17_DPu-I repbase; DNA; INV; 4542 BP. XX AC scaffold_44; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-17_DPu_; KW Copia-17_DPu-LTR; Copia-17_DPu-I. XX NM Copia-17_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-4542 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 697-697 (2010). XX DR Genome; scaffold_44; Positions 996399 991858. XX CC Positions [1857-2276] - Integrase core CC 'CTTTC' target site duplication CC LTRs are 98% similar to each other. CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX FH Key Location/Qualifiers FT CDS 3247..4476 FT /product="Copia-17_DPu-I_1p" FT /translation="MEVAQFDIKTAFLYAKLDKIIYTEGFVVKGRENDVCL FT LEKSLYGLRQAPLLFFKENDKVMHKFGLESCDGDRCIYIRRTGDELTIVIT FT HVDDSFVASTRKEVLVDIATHMSNNFTISLVPPTRYVGLNIHRDRATRRIF FT LSQSHMVEKLCKRFKMSDLPPKSIPADPSVHLASDNVSKGEGEKTAFPYPY FT REAVGALFYVALMTRPDIAYDVGQVSKYCQNPNQSHWNAVSQIFSYLVGTK FT DYGIWLGGSQSGLVGYTDANFAGNRDDCRSTSGGIFFLHGGPVAWFSKKQP FT NIAQSTTESEYIALCEGTKSIVFQCRLQEDFTGVEQLKIPIMCDNQSAVRL FT SYNPEYHHRTKHILVRYHYTRQKVNEGKIEIKYIPTEDQLADILTKPLPGP FT RFTKLRYRIGVRKHTD" FT CDS join(42..1853,1857..2954) FT /product="Copia-17_DPu-I_2p" FT /translation="MAENSEGKRDFTKFNGVNFPQWKFAVMLILKQKKLYG FT IVLGYDRKPNEVCQEMEVERVKVTVITNQNEIDTWIDKDISAQGIIFYNIE FT PSFQVALEGSGSSNEMWNRLILQYAQVAIANADFLLGKFQLYKMDPDHSVM FT AHINRLKMMADELNSIECHVSEQALIVRMLQTLPPSFRHFLSVWDSVPSAE FT RNLVNLTARLVTEELRSKSLNNGQANPVDVAFLASHPNRVQKEAVANAAHS FT KDLKRDYNPRNNNSNYRDNRQNYRDDSRCQSSRHRDNRPNQRNYHDNHRGG FT RSGNRDRGCFICGRTNHKAINCWKRKNDERREARGNKFDRNRDSKHDDNNN FT DDSKESFAALSSVCFLARKPSYWYADTGASHHMTNQRSYFTSFEEISDSWT FT VNGIGGVQVKALGVGTVPIVSYINGESKAGTLHEVLFVPDLGTNLFSVGIA FT TDHGIEAHFAKDGATFVKNGIEVMSGKRLGKSLYHLNVTATNSFEDCSTAI FT VATRSNRLLPLSLIHQRLAHLNNAAILKMSRSNAVIGLNLDPSTKHTPCEG FT CIFGKSCRTPFCSSTTKSTGVGHIIHSDIGEVPVPTPNNEIYYVVFKYDFS FT NWTSVVMKKKSDATKLLMKFIAFVKRATDRNVKIVRTDGGKEYDNDAINDF FT FASEGIVHQITNPYTPQQNGVSERLNRTAMESARSMMHIRTNKFTNLFKKA FT DHSILELWGEFLRSAVYVLNRTLSCSSSSNSSSKMPHELFFDQKPDISHLR FT IIGCHGYPLIPKAKHRKLDPKGIPCWLVGYGEETKGWRLWNPVTRKIILSR FT DVTFDENLLISDFKEDSDPHATRQSQPNLINPYLLASKILGLPTECNSQLN FT TPAEKEQHVSENPATELMEQDLHQQDPAQEDTAPADENAELGLPQEPSATA FT PEPQPTAPQRNDAENLRRTTRSHKYTARYEEFRRSLGLTATINEFKNGHFS FT ALFTESFKPQATRRH" XX SQ Sequence 4542 BP; 1443 A; 1189 C; 954 G; 956 T; 0 other; ggttatgggc ccagctgacg cccacacaat taaatacaaa aatggcagag aattccgagg 60 gtaaacgtga cttcaccaaa ttcaatggag tgaattttcc gcaatggaaa tttgcagtca 120 tgctgatact gaagcaaaag aaactgtatg ggatagtcct gggttacgac aggaaaccaa 180 atgaggtatg tcaagaaatg gaggtagaaa gagtgaaggt cactgttatt accaaccaga 240 atgaaatcga cacatggata gacaaggaca ttagcgcaca aggaatcatt ttctacaaca 300 tcgaaccgtc tttccaagtt gcactcgaag ggtcaggcag ctctaacgag atgtggaaca 360 ggctcatact acaatatgct caagtggcca tagcaaatgc tgatttcctg ctcggaaagt 420 ttcagctgta caaaatggac cctgatcact ctgttatggc acacataaac agactgaaaa 480 tgatggcaga cgaattaaac agcatagaat gccacgtatc tgaacaggca ctgattgtga 540 gaatgctaca aacgttgccg ccaagtttca ggcatttctt gtcagtgtgg gacagcgttc 600 catcggccga gaggaatctg gtaaacttaa ctgccagatt agtcactgaa gaacttagat 660 cgaaatcact caacaacggt caggccaacc ccgttgatgt tgcgttcttg gcatcccacc 720 cgaacagagt gcagaaagag gcagtagcca atgctgccca cagcaaagac ctgaaacgag 780 actacaaccc acgtaacaac aactccaact atcgagacaa cagacaaaac taccgcgatg 840 attctcgatg tcaatcaagc cgtcatcgag acaaccgacc aaatcaacgc aactaccacg 900 acaaccaccg aggaggaagg agcggaaata gagaccgagg ctgtttcatc tgtggtagga 960 ccaaccacaa agccatcaac tgctggaaaa ggaaaaatga cgagaggaga gaagccagag 1020 gcaacaaatt tgaccgaaac cgtgactcca aacacgacga caacaacaac gacgactcaa 1080 aggagtcgtt tgcagcccta tcctccgttt gttttctcgc tcgaaagcct agctactggt 1140 acgccgacac aggtgcatca caccacatga caaaccaacg ctcctacttc acctcatttg 1200 aggaaatctc tgactcctgg accgtaaacg gaatcggcgg cgtccaagta aaggcccttg 1260 gtgtcgggac cgttccgata gtctcctaca tcaatggaga aagcaaagcc ggcacactac 1320 atgaagtcct ttttgtccca gacctgggaa ctaacctttt ttcggttggc atcgcaactg 1380 atcacggtat cgaagcccat tttgcaaaag atggagcaac tttcgtgaag aacggcattg 1440 aagtcatgtc tgggaagaga cttggaaaat cattatacca ccttaatgta actgccacaa 1500 actcctttga agactgttca acagccatcg tcgccacaag gtcgaatcgt cttctcccgc 1560 tgtcactcat ccaccaacgg ttagcccacc tgaacaatgc agctattctc aaaatgtcaa 1620 gatccaatgc cgtcatcggc ctcaatcttg acccgagcac caaacatact ccctgtgaag 1680 gatgtatatt cgggaaatca tgtagaacgc cattctgctc cagcacaaca aaatcaaccg 1740 gcgttggcca catcatccac tctgacatag gagaagtacc ggttccaact cccaacaacg 1800 agatctatta cgtcgtgttc aagtacgact tctcaaactg gacatctgtc gtttgaatga 1860 aaaagaaatc cgacgccact aaacttctca tgaaatttat tgcattcgtc aaacgagcca 1920 ccgatcgaaa cgtaaaaatc gtcagaacag acggtggaaa agaatacgat aacgacgcca 1980 tcaacgattt cttcgcttca gaagggattg tgcatcagat caccaaccca tacacccccc 2040 agcagaatgg agtttccgag cgccttaatc gcacagcgat ggaatctgcc cgaagcatga 2100 tgcatattag aacgaacaaa tttacaaacc tattcaagaa agctgatcac tcaatcctag 2160 aactgtgggg agagtttttg agaagtgctg tctatgtcct gaatcgcacc ctgtcctgtt 2220 catcatcatc caattcgtca tcaaaaatgc cgcacgagct attctttgac cagaagccgg 2280 acatcagtca tctccgcatc attggttgtc atggataccc actcatcccc aaagccaaac 2340 acagaaagct agatccaaaa ggtattccgt gctggctagt tggatacggc gaagaaacca 2400 aaggatggag gctgtggaat ccagtcacga gaaaaataat cctaagccgc gacgttacgt 2460 ttgacgaaaa tttactcatc agtgacttta aagaggactc cgatcctcac gcaacaaggc 2520 agtctcaacc caatctaatt aatccatatc tactagcatc gaaaattctc ggtttaccaa 2580 cggaatgcaa tagtcaattg aacactcccg ctgaaaaaga gcaacacgtg tcagaaaatc 2640 cagccacaga actgatggaa caagatctcc accaacaaga tccagctcaa gaagacactg 2700 caccagcaga cgaaaatgca gagcttggtt tgccccaaga accttctgct acagctcctg 2760 agccccaacc gactgctcca caacgcaacg atgccgaaaa cctccgtcga acgactcgtt 2820 cgcacaaata caccgcaagg tatgaagaat tccgaagatc acttggcctc accgcaacaa 2880 tcaacgagtt caaaaatggc catttctcag cgttgttcac cgaatcattc aagccccaag 2940 ctacaaggag gcactaaaat ccgatcaagt ggagaagtgg atgaaagcct ttgaggaaga 3000 gtacagctcc ctaattgaaa acaaaacctg gagactagtt accctccccc aggatgcacc 3060 acattgaact gtaaatggat tggaaaagtt aagcccgtgt atgacaacat ccaagaaaga 3120 tacaaaggaa tactagtagc agtcggctgt gcacaacaac cagggagaga ctacgaccaa 3180 ctctattctc cagtaccgca ccacgaatca attaaagcag ccctcaccga gatcgtcttc 3240 cggaacatgg aagttgcaca attcgacatc aagaccgcgt tcctatacgc caaactagac 3300 aagatcatct atacagaagg tttcgtcgtg aaaggaaggg agaacgatgt ctgtctactt 3360 gaaaaatccc tctacggact cagacaagcc cctcttctat tttttaaaga aaacgacaaa 3420 gtaatgcaca aatttggact agaaagctgt gacggagatc ggtgcatata catccggcga 3480 actggagatg agttaactat agtcataact cacgtcgacg acagttttgt cgcaagtacc 3540 agaaaggaag ttctcgtaga catcgcaacg cacatgagca acaatttcac gatcagtttg 3600 gtaccaccaa ctcgctatgt tggcttgaac atccaccgcg accgcgcaac gaggaggatt 3660 ttcctgtccc aatcacacat ggttgagaaa ctgtgtaaac gcttcaaaat gtcggacctt 3720 cccccgaagt cgatcccggc cgacccaagt gtccatctcg catcagacaa cgtctcaaag 3780 ggcgagggag agaagactgc ctttccgtac ccctaccgtg aagcagtcgg cgccctcttc 3840 tacgtcgcac tgatgactag accggacatt gcatacgacg tcggacaggt atcaaaatat 3900 tgccagaacc caaatcagtc ccactggaac gctgtgtctc aaatcttctc ctacctcgtc 3960 ggcaccaagg actatggaat ctggctaggt ggaagccaaa gtggactcgt cggatacacc 4020 gacgctaact ttgccggcaa tcgagatgac tgccgttcaa cctctggagg aatctttttt 4080 ctgcatggtg gcccagtagc gtggttcagc aagaaacagc ccaatatcgc acaatcgacc 4140 acagaatccg agtacattgc cctgtgcgaa ggcacaaaat ccatcgtctt ccaatgccga 4200 cttcaagaag atttcaccgg cgtcgaacaa ctaaagattc caattatgtg cgacaatcaa 4260 agcgcagtac ggctgtccta caaccctgaa taccaccatc gcacaaaaca catcctcgtc 4320 agataccact acacacgcca gaaggtcaac gaaggtaaaa ttgaaataaa gtacataccc 4380 actgaagatc aactcgcgga catactgacc aaaccccttc ctggccctag gttcaccaaa 4440 ctgcgctaca gaattggagt gagaaaacac accgattgag tagcaaaatc tcgacccctt 4500 tttgaataaa tctatcttcc gtttgtttgt tttgagggag ag 4542 // ID Gypsy-3_DPer-I repbase; DNA; INV; 5851 BP. XX AC super_615; XX DT 04-MAR-2011 (Rel. 16.03, Created) DT 04-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-3_DPer_; KW Gypsy-3_DPer-LTR; Gypsy-3_DPer-I. XX OS Drosophila persimilis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; obscura group; OC pseudoobscura subgroup. XX RN [1] RP 1-5851 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (22-MAR-2010). XX DR Genome; super_615; Positions 2259 8109. XX CC Positions [4897-5406] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 996..2552 FT /product="Gypsy-3_DPer-I_2p" FT /translation="MSLDWSFDEIDQNDHSDIICCICVRQVEYRTQLLRTS FT CNHVFHLVCFNTRRGEKTVCPACNKPLQSSAPSLAEELAVAQASASNVESS FT QLRMRTRSQRVTAGPAANTENLPSVGPTEVGVPIMPVQSSEPANEASPGLA FT QMAQALTDAMGQIGLLSQAMTQMTQTMEIGFQRLASPTVQPPNDQAAGSSQ FT PSGENQTFEQLFQIPTRESVGAPINQANDSNVSVTPWGSQQIAGLRPDRIG FT QIIAGWKVRFSGNSSMSVEDFIYRIEALTKQTLEGNLDLVARNAIYLFEAS FT ASEWFWRYHKSVPSVTWPQLSTALKTRFRDGRTDLEIRTAIVQRKQKPHEL FT FDTFYEAIVTLSDKLVNPMSEMSLLEVLKSNLLADIQHEILYIPILSVAQL FT RNIVRTRERFLQSVAKPMGIAEPKRNPIRRQVHEIVVQTDKEGDEVDETEQ FT LDIAAVTLSCWNCGCQGHRYQDCVGDRTVFCYGCGKRDTYKPSCSRCNDPK FT NSSSRAQSTGARKPNSSKATNTI" FT CDS 2803..5775 FT /product="Gypsy-3_DPer-I_1p" FT /translation="MDTGASISCIGGKFAADLVQNPNQIKPVRAAVRTAHG FT QSQPIIGKVTTEVGFQGQKKVLKLFVVPSLAQDLYLGIDFWSRFNLLPAFL FT ANKSSSPEVSSLAFDSTLHVCLSTHQQAQLAETVNLFPSFAKQGLGKTGWL FT SHDIEVGSGKAIKQRHYAVSPAVEKLMYVEVDRMLALGVIEESDSAWSSPV FT VLVRKPGKVRLCIDSRKLNEVTVKNAYPMPLIDGILSRLPRAEFISSIDLK FT DAYWQIPLEVSAREKTAFTIPGRPLYQYTVMPFGLCNAPQTMSKLMDKVIP FT PSLRNEIFIYLDDLLVISESFEQHIQVLNVLSARLREAGLTINVEKSKFCL FT KEIKYLGHIIGNGTIQTDVEKVQAIQEFPVPRSVKQVRRFLGLTGWYHKFI FT KNYAGIAAPISDTLKSKRKFEWTDEAQNAFDSLKSQMCNAPVLHSPDFTIP FT FSIHCDASHTGVGGVLMQSDEDGNEVPIAFMSRKLNQCQRNYSVTEKECLA FT AVMCVKKFRAYVEGHQFSIITDHASLKWLMSQADLNSRLARWALKLQGFTF FT KIFHRKGSQNIVPDALSRVHTEDLSYIGIDSLVDLQSEAFKSTDYQELIAK FT IQANPDRLPDVKVMDNLLYRRAEHATGEQISDDMCWKLWVPRALVTEVLKQ FT GHEHPLASHSGINKTLERVRRFYYWPNLAGDVKAFINNCEVCKCCKHPNRP FT LRPPMGKAGETFRLFQKLYVDFLGPYPRSRSGNIGIFVVLDHFSKFPFLKV FT VKKFTADAVIRFMEEDLFHCFGVPEVIVSDNGVQFKSHKFNDLLAKYQIRH FT SYTAAYAPQTNASERVNRSVLAAIKAYIKPDQSNWDEQISFITCALRSSVH FT AAVKSSPYRVAFGQNMITGGSTYQLLRNLSMLEDRTVVFDRDDSFELIRKT FT AAEAKKQQHVRNENIYNLRSREVNFQVGQEVYRRNFDQSSFVKGFNAKLAP FT TFRKARIKKRLGNSYYELEDFQGRFIGKYHAKDIKQ" XX SQ Sequence 5851 BP; 1642 A; 1282 C; 1392 G; 1535 T; 0 other; ggtaaagaca aattttaaat ggcaattggt tcgaaaagtc tgtcacgatt ttatggcagg 60 gtttcttaaa gaatttgtaa atttggcgcc caacgtgggg cccttcttcg gcttgccccg 120 accaagaaag ttttataaag tcatcaacac caatgcgagc tgcatctcaa tgcagtgtca 180 caagacgtat ttcttcatcc attaattccc aacagatata catggtcgtg cattgtctgg 240 gtgattgact tcgagtagct ggattgcgta gcagtttcta aaagacttta aagccccgcg 300 ctcgaggtta atcaaattac taggcatagt aacgaacctg tagtatctgg ccggataggc 360 cactccgcta aatagcgcgt tctaatgttt cggtcggatt taggtttcaa agtggattag 420 ctctactgcc tacccaccaa gatgggagcg tgttgtttgg gggatgaact tgtccatacc 480 gttgctgatc attgtctgcc gttttggcct ggatcaggtt agtcagtggc ttagagctag 540 ggcctgcccg cgaacgaagc caatagaatc ggagatggta tatttcctgg gtcagcataa 600 ccttgtcttc ttcttcagtt tttctattgt tgttccgttt ttttttttgt ttgacagacg 660 taggtggaat ggctctactg tattgtggtt ctggacgacc tacagttttt gcgacctaca 720 gtataagtag agggccatcg gtttgtcagg gtttcggatt gctcggatat ccgtcagtgg 780 tgtagaacta ccgaacggtc ccagttcatc cgatgactca ttgtactgat tgttgattgt 840 taataatatc accagagtca gtgggggaca cactcctatt agacgaatgt tgcatggttt 900 ccgaattttg tggatacaga gtagatatat cgatcgttaa aagttaataa tcgagagtag 960 aaaagaaaat catatattag taaaattagg ataggatgtc attggattgg agttttgacg 1020 agattgatca aaacgaccat tccgatatta tctgttgtat ttgcgtaaga caggtagaat 1080 atcggacaca gcttttaaga acaagttgta accacgtatt tcatttagtt tgttttaata 1140 cgagaagagg tgaaaagacc gtgtgcccgg cgtgcaacaa gcccctacaa tcatccgctc 1200 ctagcttagc tgaggaatta gccgtagcgc aggcgtccgc ctcgaacgta gagtcttctc 1260 agcttcgaat gagaaccaga tcacagcgtg tgacagcagg tccagctgcg aatactgaaa 1320 atttaccttc cgttggtcca actgaagtcg gtgtaccaat catgccagtt cagagtagcg 1380 agccagcgaa tgaagcaagc ccaggcttgg cgcaaatggc acaggcgttg acggacgcta 1440 tgggtcaaat cgggttgttg tctcaagcca tgactcaaat gactcagact atggaaatag 1500 gattccaacg tttggcaagt ccaacggtac agccgccgaa tgaccaagct gctggatctt 1560 cacagcctag tggagaaaat caaaccttcg aacagttgtt ccaaatacca actcgagaat 1620 ccgtaggtgc cccgataaat caggctaatg attctaatgt ctcggtaacc ccatggggat 1680 cccaacagat agcgggacta cgaccagata ggattggaca aataatcgca ggatggaagg 1740 tgcggttttc cggaaattcg tccatgtcgg tggaagattt catatatcgg atagaagcgc 1800 taaccaagca gaccttagag ggtaatctcg atctagtggc cagaaacgcc atttatttat 1860 ttgaagccag tgcgagcgag tggttttggc gttaccataa aagcgtacca tccgtgacct 1920 ggccgcagct aagtacagcc ttaaaaactc gatttagaga tgggcgtacc gatctggaaa 1980 ttcgtacggc aatagtgcaa cgaaagcaga agccgcacga actatttgat acattttacg 2040 aagccattgt aaccttgtcc gacaaactag tcaaccccat gtccgaaatg tcgctgttgg 2100 aagtgctaaa gtcgaatctc ttggccgata tccaacatga aattttatac attcccattc 2160 tgtcagtagc acaattgcgc aacatagtgc gtactcgcga acggtttttg caatcagtgg 2220 ctaagcctat gggcatcgct gagccgaagc gtaatcctat acgacgacag gtgcatgaaa 2280 ttgtggtaca gacagacaag gaaggcgatg aggtcgatga aacagaacag ctagatatcg 2340 ctgcagtaac cctttcctgt tggaactgtg gttgtcaggg gcatcggtat caagactgtg 2400 taggtgatcg caccgtcttt tgttacggct gtggcaaacg ggatacgtat aaaccgtcgt 2460 gctctaggtg caacgatcca aaaaactcgt caagccgtgc acaatcgaca ggtgcacgca 2520 aaccaaacag ttcgaaagca acgaacacca tttaatggac gtttcgacag ttccattact 2580 ccctgactta aaaaaactcc cagacattcc ttcacccact tgtattttac ccccaagtga 2640 ggaccgaatt ccattaccga ctcgatctcg aaagcgattg ctccaatatc ggcaaactgt 2700 caaaccgagc gctctaacgt attggcgtca gtcgttaata cgtctaagga ctggcgaccc 2760 tatgcggaag tcaccgtact gggaaggacg ctcagtggac tgatggacac cggtgcatcc 2820 ataagttgca taggtgggaa gttcgcggcc gacctggttc agaatccgaa ccagatcaag 2880 cctgtgagag cagcagtacg tacggcgcat gggcagagcc agccaataat tggcaaggtc 2940 accaccgaag tgggttttca aggccagaag aaagtcttga aattgtttgt agtcccatcc 3000 ttggcgcaag acctatatct aggcatagac ttttggtctc gttttaattt gttacctgct 3060 tttctagcca ataaatcatc cagtccagag gtgtcaagct tagcatttga tagtacgtta 3120 catgtgtgtt tgtcgacgca tcagcaagct cagttagcag agacggttaa cttgtttccc 3180 tcctttgcga agcaaggtct gggaaaaacg ggctggttgt cgcacgatat tgaagtgggc 3240 agcggaaagg caataaagca acgacattat gctgtgtctc cagccgttga aaagctaatg 3300 tacgtagagg tagaccgaat gttagctctt ggagtaatcg aagaatcaga tagcgcatgg 3360 tcgtcacccg tggtcctcgt tcgtaaaccg ggaaaggtgc gattatgcat cgacagtcgg 3420 aaactgaacg aagtcactgt taagaacgct tatcccatgc cattgatcga tggcattctc 3480 agtcggcttc ctcgagctga gttcatttca agcatcgatt tgaaggatgc ctattggcaa 3540 attccgctgg aagtaagcgc tcgtgaaaaa acggcgttta ccatcccagg gcgcccgttg 3600 tatcagtaca cggtcatgcc ctttgggttg tgtaatgcac cccaaacgat gtccaagttg 3660 atggataaag tgataccgcc atccctacga aatgagatat ttatttatct agacgatttg 3720 ttggtgatct ctgaatcctt tgagcagcat attcaagtcc tcaatgtcct gtctgcccga 3780 ttacgcgaag ctggtctgac cataaacgtg gagaagtcca aattttgcct aaaggagatc 3840 aaatacctag gacatattat aggcaacgga acgatccaga cagatgttga gaaggtccaa 3900 gcgattcaag agtttccggt cccccgatcc gtgaaacaag tccgtaggtt cttaggtctc 3960 actggttggt accacaaatt cataaaaaat tatgcgggta tcgcagcacc aatctcagat 4020 acacttaaga gcaagcgaaa atttgaatgg accgatgaag cacagaacgc gttcgattcc 4080 ttaaaatcac agatgtgtaa cgctcctgtg ttgcacagtc ccgattttac cattcctttt 4140 tcaatccatt gtgatgcgag tcacacagga gtgggtggtg ttctgatgca atcagatgaa 4200 gatggcaacg aggtacccat cgcttttatg tcacgcaaat tgaaccaatg tcaacgtaat 4260 tattccgtaa ccgaaaaaga gtgtttggca gcggtcatgt gcgtcaagaa atttagagcg 4320 tatgtagaag ggcatcagtt ttccattatt accgatcatg cctcattaaa atggctgatg 4380 tcacaggctg acctaaactc gagactagct cgatgggccc tgaaactgca aggcttcacc 4440 tttaagattt ttcatagaaa gggtagtcag aacatcgttc cggatgcctt atcacgagtg 4500 cataccgaag acttatcgta tattggcatt gatagcctag tggaccttca atcggaagcg 4560 tttaaatcca ctgactatca ggaactcatt gcgaaaattc aagctaatcc agatcgttta 4620 cctgatgtaa aagtaatgga taaccttctg tatcgtcgag ctgaacacgc cactggtgag 4680 caaattagtg acgacatgtg ctggaaactg tgggttccca gagctctggt gacagaagtg 4740 cttaaacaag gccacgaaca tcccttagct tctcacagtg ggatcaataa gacgctagaa 4800 cgagttcgac gcttctatta ttggccaaat ttagcgggcg atgtaaaagc ttttatcaac 4860 aattgcgaag tgtgcaaatg ttgtaagcac cccaatcgtc ctctacgtcc tccaatgggc 4920 aaagcaggag agacctttcg gctatttcag aaactttacg tcgattttct aggtccgtac 4980 ccccgaagtc gatctggaaa tataggcatt tttgtagtac tcgatcattt ttctaaattt 5040 cctttcctaa aagtggtaaa gaaatttacg gcggatgccg taattaggtt catggaggaa 5100 gacctgtttc attgctttgg agtcccggaa gtgatagtgt cagataacgg tgttcaattc 5160 aagtcacaca agtttaatga tctgttagcg aaataccaaa ttcgacattc gtacaccgcc 5220 gcctacgctc cccaaaccaa tgcctcagag cgggtaaatc gctcagtact cgctgctatc 5280 aaagcgtaca ttaagcccga tcaatccaat tgggatgagc agattagctt cattacttgt 5340 gcgttacggt ccagtgtgca tgcagccgta aaaagtagcc catatcgtgt tgcttttggc 5400 caaaacatga ttactggcgg ttccacttat caactcctac gaaacctcag tatgttggag 5460 gaccgaactg tggtatttga tcgcgacgat tccttcgagc taattcgcaa gaccgctgca 5520 gaagcaaaga aacagcagca tgttcgaaac gaaaatatct ataatcttcg aagccgagaa 5580 gtaaactttc aagtgggtca agaagtttat agacgaaatt ttgatcagag tagttttgtg 5640 aaaggcttta atgcgaaact ggcacctacg tttcggaaag cccgcattaa gaaaagactc 5700 gggaacagct actatgagct ggaagatttc cagggtcgct ttataggtaa atatcacgca 5760 aaagatatta aacaataaat cgtatcaagc actccaagtg tggatcacgc tttggttagc 5820 aagaccccaa agtgtgattt cagcgggggg g 5851 // ID BEL-50_CQ-LTR repbase; DNA; INV; 385 BP. XX AC AAWU01014601; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-50_CQ_; KW BEL-50_CQ-I; BEL-50_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-385 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 254-254 (2011). XX DR GenBank; AAWU01014601; Positions 22293 21909. XX SQ Sequence 385 BP; 112 A; 92 C; 102 G; 79 T; 0 other; tgtcagctat gcttccgtct tccgagaatt cttgatgaca aatgaaaacc cagctcgatc 60 ggcaacactg gttgctgcga tcgacacacc caacgcgaca cccctcgtcg ccgcctgcga 120 tgacagcaag gaacgatcga gaaggtggga attggcgcgc aacggatgcg cgcgcaatcg 180 agccgaagga ggagagaaag agagagagaa gtggagattt ttcactccag ttcgcgacgc 240 gttcaagtaa agacgtgttt tgtacatagt gatcagaata aaggttttgt taggaaaagt 300 gaattccgca aagtgttttg atcacccgga gaaaatccct aatacactcc acccgaagat 360 ccagttggga gatttggccc caaca 385 // ID AVMAR1 repbase; DNA; INV; 1195 BP. XX AC . XX DT 11-AUG-2005 (Rel. 10.08, Created) DT 18-AUG-2005 (Rel. 10.08, Last updated, Version 1) XX DE Mariner-type element from Bdelloidea - a consensus. XX KW Mariner/Tc1; DNA transposon; Transposable Element; AVMAR1. XX OS Adineta vaga OC Eukaryota; Metazoa; Rotifera; Bdelloidea; Adinetida; Adinetidae; OC Adineta. XX RN [1] RP 1-1195 RA Arkhipova I.R. and Meselson M.; RT "Diverse DNA transposons in rotifers of the class Bdelloidea."; RL Proc Natl Acad Sci U S A 102(33), 11781-11786 (2005). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 90..1145 FT /product="AVMAR1_1p" FT /translation="MIKKIVDMDVDVESKINLSHREVRVLLLHEFRLGHKA FT TEAASNICGTMGQGLVSTRTAQRWFNHFKNGDLELDDLPRSGRPMEVDVDF FT LKQLIEEDPRLTLRCLAEQLGCSHTTVEKHLNELGKTWKYGVWIPHELSAH FT QLQQRVDACMDLITSHRNYQWLSNLITGDEKWVLYVNYTRRRQWLSAGQTG FT VATPKADLHPKKLMLSVWWGIKGVIHWEVLPNGYTITADLYCQQLDRVAEK FT LKGKQDRVYFLHDNARPHVAKSTREKLLKLGWITIPHPPYSPDLAPTDYHL FT FRSLSNDLRDKKFDDESDVKTELVKFFDEKSQDFYERGIMSLPERWQQVVD FT SNGKYISEN" XX SQ Sequence 1195 BP; 365 A; 233 C; 274 G; 323 T; 0 other; gtttttttct aaaatcttgt agaacaacca agaaagttct agaagtttcc atactaaggg 60 tatatatata ggacatgaat gtcttttgaa tgataaaaaa aatcgttgat atggacgttg 120 atgtcgagtc aaagataaat ctttctcata gagaggttcg agtgctttta cttcatgaat 180 ttcgtttggg gcacaaagca acggaagcag ccagcaacat atgcggcacg atgggtcagg 240 gtctagtctc cactcgtacc gcgcaacgtt ggttcaatca tttcaagaat ggcgatttag 300 aactcgacga cttacctcga tctggtagac caatggaagt ggatgtggac tttttaaagc 360 agcttatcga agaagatcct cgactgactt tacggtgttt agcagagcag ctcgggtgct 420 ctcatactac ggtggaaaaa catctgaacg aattaggcaa gacctggaaa tatggggtat 480 ggatacctca tgaattatcg gcacatcagc tgcaacagag agttgatgca tgtatggatt 540 taatcacatc tcaccgcaac taccaatggt tatccaatct gattactggt gatgagaagt 600 gggtgttgta tgttaactac acgcgtcgac gtcagtggct gagcgctggt caaaccggcg 660 tagcaacacc gaaggctgat ctccatccca agaaattgat gctgagcgtc tggtggggta 720 tcaaaggagt tatccactgg gaagttctac caaatggtta caccatcact gctgatctgt 780 actgtcaaca attggatcgg gttgcagaaa agctcaaggg aaagcaggat cgagtttatt 840 ttttacatga caacgctaga ccacatgtag caaagtcgac ccgtgaaaaa ttattgaagc 900 tcggctggat taccattcct catccacctt attcccctga cttggctcca acagactacc 960 atttgtttcg atctctttcg aatgatttac gtgataaaaa gttcgacgac gaaagcgatg 1020 tcaaaacgga gcttgtcaag ttctttgatg aaaaatccca ggacttctac gagcgcggga 1080 tcatgtctct gccagagcgt tggcaacaag tggtagatag taatggtaaa tatatatctg 1140 aaaactagtt gtacttttga agtgaaaaaa caaaaataaa atttttgaaa aaaac 1195 // ID Gypsy-8_CQ-I repbase; DNA; INV; 4444 BP. XX AC AAWU01000706; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-8_CQ_; KW Gypsy-8_CQ-LTR; Gypsy-8_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4444 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 395-395 (2011). XX DR GenBank; AAWU01000706; Positions 28397 23954. XX CC Positions [3285-3743] - Integrase core CC 'TTTTG' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 36..4433 FT /product="Gypsy-8_CQ-I_1p" FT /translation="MSPEFEANLLKILENQSKLLQELAAARITEAQPADNG FT GGDGSLPANQQRPVHPTRPNQTEFVIESLSSGIREFCYDPEAGVTFEAWFA FT KYEDLFAEDAQNLDDPAKVRLLLRNLNTVAHKKYLSYILPNKPKDVDFKET FT VKTLKSIFGRHTSLFNSRYQCLQLQKNPSDDYFTYAGIVNEKCEEFQLDKI FT DADQFKCLLFVCGLSSSRDADVRTSLLSKIESANPAAPMTLRSLAEECQRL FT LNLKRDTAMLERTKPTVSAVKQPPRTKNAPPAQQPAETPNSPCWRCGEMHY FT SKNCPFIQHECKSCKKVGHREGYCACFNSKSGRKKKKQTTAKAQGIYSVNQ FT VSIADRRKFVTVELNGMPARLQLDSAADISVISADVYKQLGCPAGDPPSIN FT VVNASGDDMGLELEIICTITLNDVVKQGRCFVSRTNDMNLFGAEWIELFGL FT WDVPFNAVCNQVSMKTYPKSAELVNRFKEKYSTIFSESLGLCTKKKVSFTL FT KPGAKPIFRPKRPVPYASVEKIEAELDRLQSLGIITPVSYSEWAAPIVAVR FT KPNGKVRICADYSTGLNDALEPNHHPLPLPQDIFAKLADKKIFTQIDLSDA FT YLQVDVDEESRKMLTINTHKGLFQFNRLSPGVKPAPGEFQHIVDSMIADLD FT DTSCYLDDIIVASNSLDEHIEQLHRLFARIQEYGFHLKIEKSNFFMQQIKY FT LGFIVDSDGIRPDPEKVKPIVNMPAPHDVPTLRSFLGAINYYGKFVKSMHE FT LRHPLDALLKKDAKWNWSPACQRSFEEFKRILQSELLLTHYDPNQEMIVAA FT DASQKGIGAVLLHRFSDGTIKAVCHASRTLTDAEKNYAQGEKEGLALVFAC FT TKFHRMIFGRRFTLQTDHKPLLGIFGTKKGIPVHTANRLQRWALTLLLYDF FT QIEFKATDIFGYADVLSRLIGEHTKPDEDYVIASLQLEKDVRSIQAESVAA FT LPVTHQLILQETKRDKTLQAVLKQLRDGWSTKASSGEFLQFYKRRESLYEV FT DGCVMFLDRIAIPEKLRAPVLKQLHAGHPGMQRMKSIARSYVYWPNIDAEI FT EDYVRKCSRCAAAAKAPVKTTLSSWPIPVQPWTRLHLDYAGPIQGKFFLVI FT VDAHSKWPEIFAVNNSTASTTVSKLRECFARFGCPVSVVTDNGTQFDSELF FT AKFCRELGIEHIKTPPFHPQSNGQAERFVDTLKRALLKIGGEDIDAALQTF FT LQTYRYTPNAVLPDNKSPAEALLGRKVRTVFDLMTKPTPDMPLINQRQNEQ FT FNRKHGAKKRTFAAGEEVYAEIHIRNEKYWAKGVVIERKGQVVYNVLLDDP FT RRRGLIRSHANQLRSRAAADEPVVQAETDLPLNVLLEEFGVNRTGVETHAD FT PVGGELEEMPLEQTTVAEDLLSVPEEVMAVPGPSRPPNPVPVEAKAVTGPS FT RPRYVCVPRPNVDVPAPRPIVRKRGIEPVSPGRGPSGRKRRLPSHFHHYDL FT F" XX SQ Sequence 4444 BP; 1148 A; 1215 C; 1175 G; 906 T; 0 other; attggcgacg agaaaataca agcctacgca agaagatgtc tccggagttt gaagcgaacc 60 ttttgaaaat tttggagaac cagtcaaagc ttcttcaaga acttgcggct gctcgaatta 120 cggaagctca accggcggac aacggcggtg gggacggtag tttgccagca aaccagcaga 180 gaccggtcca tcctacccgt ccgaaccaga ccgagttcgt aatcgagtcc ctgtcaagcg 240 ggatccggga attttgctac gacccggaag caggagtcac gtttgaagcc tggttcgcca 300 aatacgaaga tcttttcgcg gaggacgctc aaaatctgga cgacccggca aaagtcagac 360 tcttgctacg caacctcaac accgtggctc acaagaagta tctgagctac attttaccaa 420 acaaaccgaa agatgtcgat ttcaaagaaa ccgtcaagac cctgaagtcc atcttcggcc 480 gtcacacgtc gctcttcaat tcacggtacc agtgcctgca gctacagaag aatccgtccg 540 acgattactt cacgtacgct ggcatcgtga acgagaagtg cgaggagttc cagctggaca 600 agatcgacgc ggaccagttc aagtgcctgt tgttcgtgtg cggcctgagt tcatcaagag 660 atgcggacgt tcgtacgtca ctgctgtcca agatcgagag tgccaatcct gctgcgccga 720 tgaccctgcg ttcgttggcc gaggaatgtc agcgactctt aaatctgaag cgggacaccg 780 ccatgctcga gaggaccaag cccaccgtat cagctgtcaa acagccacct aggaccaaga 840 acgccccgcc tgctcagcag ccagccgaaa ctccgaactc accgtgttgg agatgtgggg 900 aaatgcacta ttcgaagaat tgcccattca tccaacacga gtgcaagtcc tgcaagaagg 960 tcggacatcg tgaaggatac tgcgcctgct tcaattcaaa atccggacgg aagaaaaaga 1020 agcagactac agccaaggcg caggggatct actccgtcaa ccaggtcagc attgcagaca 1080 gaaggaagtt cgtcacggtg gagctgaacg gaatgccagc acgactgcag ttggatagtg 1140 ccgcggacat cagcgtgatc tcagcggacg tctacaagca actcggttgt ccagccggcg 1200 atccgccgtc catcaacgtg gtcaacgctt ctggagatga catgggactg gagttggaga 1260 tcatctgtac cataaccctc aacgacgttg tcaagcaagg gaggtgcttc gtatctcgga 1320 ctaacgatat gaacctgttt ggagctgaat ggatcgagct tttcggactg tgggacgtac 1380 cgttcaacgc ggtgtgcaac caggtctcca tgaaaacgta tcccaagtcg gcagaactgg 1440 tcaaccgttt caaggagaag tacagcacca tcttcagcga gagtctggga ctctgcacca 1500 agaagaaagt gtcgttcacc ctcaagcctg gtgccaagcc gatttttcgc ccgaagcgac 1560 ccgttccgta cgcatcggtt gagaaaattg aagctgaact ggaccgtctt caaagtcttg 1620 ggatcatcac accagtctcg tactcggagt gggccgctcc gatcgtggct gtacgcaagc 1680 ccaacggcaa ggtgaggatc tgtgcagact attccactgg actcaacgac gcgctggagc 1740 cgaatcacca tccgctaccc ctgcctcaag atatcttcgc caagctggcc gacaagaaga 1800 ttttcaccca aatcgatctg tcggacgcct acctgcaagt cgatgtcgat gaggagtcgc 1860 ggaagatgct gacaattaac acgcacaagg gtctgttcca gttcaatcgc ctgtcgccgg 1920 gagtcaagcc tgcacctggc gagttccaac acatcgtgga cagcatgatt gcggatctgg 1980 acgatacgag ctgttatctg gacgacatca tcgtggcgag caactctctg gacgagcaca 2040 tcgagcagct tcatcgcctg ttcgcccgca tccaggagta tgggttccat ttgaagatcg 2100 agaagagcaa tttcttcatg cagcagatca agtacctggg cttcatcgtc gattccgatg 2160 gaattcgacc tgatccagag aaagtcaagc cgatcgtcaa catgcctgcc ccacacgacg 2220 tcccgacact gcgctcattc cttggtgcga tcaattatta cggcaagttc gtcaagtcca 2280 tgcacgagct tcgccatccg ttggacgctc tgctcaagaa agatgccaag tggaactgga 2340 gtcctgcctg tcagcggtcg ttcgaggagt tcaagcggat tctacaatca gagttgctgt 2400 tgacccacta cgacccaaac caggagatga tagttgccgc tgacgcctcg cagaaaggaa 2460 ttggtgcggt tcttctacac cgtttctctg acggtacgat caaagcagtg tgccatgctt 2520 cccggacact cacggacgct gagaagaact atgcccaagg cgagaaagag ggccttgccc 2580 tggtcttcgc ctgcaccaag tttcatcgga tgattttcgg cagaagattc acgctacaaa 2640 cggatcacaa gccgctgctg ggaatttttg gaacgaaaaa gggcattcca gttcacacgg 2700 cgaatcgact ccagcgatgg gcactcacac tgctgctgta cgacttccag atcgagttca 2760 aggcaacgga catctttggg tacgctgatg ttctgtcacg tttgatcggc gagcacacca 2820 agccggacga ggactacgtc atcgccagcc tgcagctcga gaaagacgtc cgaagcatcc 2880 aggcggaatc cgtggcagct ctaccagtta ctcatcagct gatcctgcaa gagacgaagc 2940 gcgacaagac actccaggcc gtactcaagc aacttcgtga cggatggtca accaaagcat 3000 catctggaga gttcctgcag ttctacaaac gacgggaatc gctatacgaa gtggacggct 3060 gcgtgatgtt cttggaccga atcgccattc ctgagaagtt acgagctcct gtcctgaagc 3120 agttgcacgc tggtcacccg ggaatgcagc ggatgaagtc gatcgcgaga agctacgtct 3180 actggccaaa catcgacgct gagattgaag actacgtgcg gaaatgttcg cgctgtgctg 3240 ctgcggcaaa agctccagtc aagacaactc tttcctcgtg gccgattcca gttcaaccat 3300 ggacgcgctt gcacctggac tatgctggac ccatccaagg aaagttcttt ctggtgatcg 3360 tcgacgcaca ctcgaaatgg ccagagatct tcgcagtcaa caactcgacg gcttcaacca 3420 ctgtcagcaa gctacgagag tgtttcgcaa gattcggatg tccagtctcc gtggtcacgg 3480 acaacggaac gcagttcgac tccgagcttt tcgccaagtt ctgccgcgag cttggtatcg 3540 agcacatcaa gacaccacca ttccacccgc agtctaacgg acaagcagag aggtttgtcg 3600 acacactgaa gcgagctctt ttgaagattg gcggtgagga cattgatgct gctctacaga 3660 ctttcctgca aacctatcgc tacacaccaa atgcggtttt acccgacaac aagtcacctg 3720 ctgaagcttt gttggggagg aaagtgcgca ccgtcttcga cctgatgacg aaaccaacac 3780 ctgacatgcc gctcatcaac caacgccaga acgaacagtt caatcgcaaa catggagcca 3840 agaaacgcac gtttgcggca ggtgaggaag tttatgctga aatccacatc cgcaacgaaa 3900 agtattgggc caaaggtgtg gtgatcgagc gcaaaggaca ggtggtgtac aacgtgctac 3960 tggacgatcc acgacgtcga ggattgattc ggtcgcatgc aaaccaactg cgatcgcgag 4020 cagcagctga cgaaccagta gttcaagccg agaccgacct accactaaac gtgctgttgg 4080 aagagtttgg cgtgaaccgg actggagttg aaacacatgc tgatcctgtt ggaggggagc 4140 ttgaagagat gccgctcgag cagacaacag tcgctgagga tctactttcg gttcctgagg 4200 aggtgatggc cgttccagga ccgtcacgtc caccaaaccc ggtacctgtg gaagcgaaag 4260 ctgttacagg gccatccaga ccgcgatacg tttgcgttcc aagaccaaac gtagacgtgc 4320 ctgctccaag gccgatcgtg cggaagagag gaattgaacc tgtaagccca ggacgaggac 4380 catccggcag aaaacgacgc cttccatcgc acttccatca ctacgatctc ttctaggggg 4440 gaga 4444 // ID Copia7-NVi_LTR repbase; DNA; INV; 287 BP. XX AC AAZX01002465; XX DT 05-NOV-2007 (Rel. 12.11, Created) DT 05-NOV-2007 (Rel. 12.11, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: long terminal DE repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia7-NVi; KW Copia7-NVi_I; Copia7-NVi_LTR. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-287 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from Nasonia parasitic wasp."; RL Repbase Reports 7(11), 1113-1113 (2007). XX DR Genome; AAZX01002465; Positions 18578 18292. XX SQ Sequence 287 BP; 92 A; 62 C; 39 G; 94 T; 0 other; tgttgtgata ttagagtcaa ttaatcattt aaatggtcgc gcgcttatac acttataata 60 gactaacgta gtctctcgta tcatactaga atcttctagg caacttataa agtaccagaa 120 actagctgac tcgttcactt gagttcacaa cgcgatcaag acaagtacag ttttgctctc 180 gccctctcat ctcaactgtt ataaataata aatctattat acatatttag aagataaact 240 actggctcgt ttatttcact cgccctccga aatacctaat tatatca 287 // ID Gypsy-13_AA-LTR repbase; DNA; INV; 133 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-13_AA_; KW Gypsy-13_AA-I; Gypsy-13_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-133 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 996-996 (2011). XX DR [2] (Consensus) XX SQ Sequence 133 BP; 51 A; 16 C; 28 G; 38 T; 0 other; tgaagtgtat tagcttaccc tattatgata gaaactctct tgaaagcgtt atacgtatta 60 taaagtatat tgaaaagata tctagataag aatgatgaac agggacatgg aggaagaatg 120 aacttccggt aca 133 // ID Crack-15_BF repbase; DNA; INV; 3181 BP. XX AC . XX DT 30-APR-2009 (Rel. 14.04, Created) DT 30-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE Amphioxus Crack-15_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; KW Crack-15_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-3181 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-3181 RA Kapitonov V. and Jurka J.; RT "Young families of Crack non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(4), 820-820 (2009). XX DR [2] (Consensus) XX CC Its 5'-terminal portion is incomplete. XX FH Key Location/Qualifiers FT CDS 1..2976 FT /product="Crack-15_BF_2p" FT /translation="MARMCGFLVVRASCLLHSVLVCDHCVTVFVVILLLLC FT GDIHNNPGPARLNLPQKGLHIGHLNICSWSSKTCDLRELLDDNGFHVFGLS FT ETHLDPSISDQELGADGWTVLRKDRNRHGGGVAFLVRDGLPWKHRTDLEKN FT NTEMLWIEVRLPQTRPILLSCIYRPPRTGMQYLNDILESVDRATDTNSEVF FT VMGDINIDWSNQLCPMREHLADVATSCNLSQVVKVPTRISVNKNNRRTATC FT IDLVFTNCKDRCTPARSVPVGFSDHNIVYISRKTKVPRAQARVVHKRSFRR FT FCPEDFLNDIELAPWHLVQDEQDVDEALHLFTVMFNEIADQHAPMKKQQQK FT SNPVAWLDEELRELMRLRDEARRESVISGLQSDIQVYKKLRNAVVKLNRKK FT KATYYKEKLEENKNEPKAMWKTLNGILGKGSKRATGVVEQGGTYLTKPKDI FT AEHFNAFFLHKVNTLRQGMEKQPDNTVLHLIEENIMAGKDCNFEFRQVNRE FT EVYQLLLALPEGKAAGLDNMDNKLLRIAAEHVSTPLCYIINLSFVTSVYPS FT EWKKAKVVPIPKSTTEPFCGANSRPISLLPTTSKIMERIVCKQVSDYFSKN FT SLMSENQHAYRKNHSTCTALLHMVDDWYHSIDQGKLVGAIFLDFSAAFDLV FT DHNCLLSKLACYGFDESTTKWMASYLTGREQCVHINGTNSSFKELPCGVPQ FT GSCLGPLLFTIYTNDLPLAITQATADMYADDTSAYICAPSIETISHNLQTE FT VNNICSWVRVNRLFLNTSKTKCIVLGSKPKMSAKPKLTLTANGKVIEQVSE FT VKLLGSTVDECLTWNTHTKLTSKKMARSLGMIKRCAYLMDQTLLKIVIECL FT VLSHIDYCSPVWSCTTKTNINLLQRMQNKATRLFQSKPQWKPVHTRLLINT FT MTTFKRITLTNEPICISRQLRSVNNLHSYRTRFAAKQSFLLEKPRTNALMK FT CFMYRAVRVWNNLPIVMQSTQSLRVFKDGLHKYFHQ*" XX SQ Sequence 3181 BP; 975 A; 711 C; 695 G; 800 T; 0 other; atggcgcgca tgtgtgggtt tctggtcgtt cgtgcgtcgt gtttgctgca ctcagtgtta 60 gtgtgcgatc actgcgtgac tgtgtttgtc gtcattcttc tccttctttg cggggacata 120 cacaacaacc cgggccctgc tcggctcaac cttccacaga aagggctaca cataggacac 180 ctaaacatct gcagctggag cagcaagacc tgcgatctga gagagctgct cgacgacaac 240 ggcttccacg ttttcggact ctctgaaacc cacttagatc caagcatcag tgatcaagaa 300 ctgggtgcag atggctggac agttctcaga aaggaccgaa acaggcatgg aggcggagtg 360 gctttcttgg ttcgggacgg actgccatgg aagcatagga cagatctgga gaagaacaac 420 actgagatgc tgtggattga ggtacgctta ccacaaacta gaccgatatt gctaagctgt 480 atctataggc cgccaagaac aggaatgcaa tatcttaatg acatccttga atctgttgac 540 agagctactg ataccaattc agaagtcttt gttatgggcg atattaacat tgactggtct 600 aatcaactct gccctatgag ggaacaccta gcagatgtag caacaagttg taacttgtcc 660 caagtagtga aagtaccaac gagaataagt gtaaacaaga ataaccgtcg gactgctact 720 tgcattgatc ttgtcttcac aaactgtaaa gacagatgta ctccagctag gtctgttcct 780 gtcggttttt cagatcacaa catagtctac atctcgagaa aaaccaaggt tcccagagca 840 caggcaaggg tagtacacaa gaggtccttt cggaggttct gccctgagga ctttctgaat 900 gacatcgagt tggcaccatg gcatcttgta caggatgaac aagatgtcga tgaagcactt 960 catctcttca ctgtcatgtt caacgagatt gcagaccagc acgctcccat gaaaaaacaa 1020 caacagaagt cgaatccagt ggcatggctg gacgaggagc tcagggaact gatgcgtctt 1080 cgggacgagg ccaggaggga atccgtcatc tcgggtctac agtccgacat ccaggtctac 1140 aagaaactcc gaaacgcagt tgtgaagctt aataggaaaa agaaggcaac atactacaaa 1200 gaaaagttag aagaaaataa aaacgaaccc aaagcaatgt ggaaaacctt aaacggcatt 1260 ctcggaaagg gatctaaacg cgcaaccggt gtggttgaac aagggggaac ttaccttaca 1320 aaacccaagg acattgcaga acatttcaac gccttctttc tccacaaagt caacaccttg 1380 cgacaaggaa tggaaaaaca acccgacaac accgtgcttc acctgattga agagaacatc 1440 atggcgggca aggactgcaa ctttgaattc agacaagtga acagagagga agtgtaccaa 1500 ctcctactgg ccctgccaga aggaaaagca gctggactgg ataacatgga caataagcta 1560 ctgagaattg cagcagagca tgtttctact cccctctgtt acattattaa tttatcattt 1620 gtaacctcag tctatccaag tgaatggaag aaggctaaag tggtacctat tcctaagtcc 1680 actacagaac ccttctgcgg cgctaacagc agacctatta gccttcttcc taccactagt 1740 aagataatgg aacgtattgt ttgcaaacaa gtgtcagact atttctcgaa gaattcactt 1800 atgtctgaga atcagcatgc atacaggaag aaccattcaa cctgtactgc actgctgcac 1860 atggtagatg attggtacca cagtatagat caggggaaac tggtaggcgc catcttcctc 1920 gatttctctg cagctttcga ccttgttgat cacaactgtc tactttcgaa attagcatgt 1980 tatggctttg atgaatctac caccaaatgg atggcaagtt acctgacggg aagagaacaa 2040 tgtgttcaca tcaatggaac aaactcttcc tttaaggaac tgccttgcgg tgtgccccag 2100 gggagctgtt tagggcccct tctttttacc atttatacta atgatctacc gcttgccatc 2160 actcaggcca cagcggatat gtatgctgat gacacatcag cctacatatg cgctccttct 2220 attgaaacta tctcacataa tctacagaca gaggtaaaca acatctgtag ctgggttaga 2280 gtgaaccggc ttttcttgaa cacttcaaaa acaaaatgta ttgtactggg gagcaagccc 2340 aaaatgtctg ctaaacccaa actcacacta actgcaaatg gtaaggttat cgaacaggtc 2400 tcagaggtaa aactacttgg ttcgacagta gatgagtgcc ttacctggaa tactcacaca 2460 aaattaacat ctaagaaaat ggctagatca ctggggatga tcaaaagatg cgcatacctt 2520 atggatcaaa cattgttaaa aattgtaata gaatgccttg tactatcaca catcgattat 2580 tgcagcccag tttggtcatg cacaactaaa actaacataa accttctgca acgaatgcag 2640 aacaaggcaa ctcgtctgtt tcagtcgaag ccacagtgga aacctgtgca cacgagattg 2700 ttaataaaca ccatgacaac atttaaaagg ataacactca caaacgaacc tatttgtatc 2760 agtaggcaac tgcgttcggt caacaacttg catagttaca gaacccgttt tgctgctaaa 2820 caatcttttc tgttagagaa acctagaact aacgcactga tgaagtgttt catgtacagg 2880 gctgtcagag tgtggaataa ccttccaatt gtcatgcaat ccacacaatc cctgagagtg 2940 tttaaggatg gtttgcacaa atacttccac cagtagttac aagcactgtg cgacaggatt 3000 gtcaattttt aaattatgta tattgtaaat atgttattga tttagatgtg tttgtgactt 3060 gtctagttgc caaccatgtt tgtataatgt attgtctctg tgaatgaata ccaggaagac 3120 tagcggtgga ccctagggtc caccgctaat ggttattcta ataaagtttc aaagtttcaa 3180 a 3181 // ID BEL-607_AA-LTR repbase; DNA; INV; 601 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-607_AA_; KW Pao_Bel_Ele58; BEL-607_AA-I; BEL-607_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-601 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 601 BP; 185 A; 143 C; 120 G; 152 T; 1 other; tgtaaatatt tgtaaatagt ttaaattgta aatagtccta gagttaagta atktgaattc 60 agaatcaaca ctgaatttga aaccagcaac cagtaactac aattgtatga cacactgtac 120 ggtcaaattg tccacacacc acacaccacc tcacccattc aatcacatgt agaagaagca 180 agaactgtat tacgtaacac acccaatacc cagcatcaag caaacagtac caaaatagac 240 aaccaaaacg atcggtagcg tttgttcttt tcctcgtccg ttccgcgcaa taaatcaagt 300 gcaagtgcaa tttaatcaat ctctatctcg aatccgaaac tatatcttcg gctgtggcca 360 acggtggcaa ttaatcaacc cttgagaaga gatcgcgcgc gaacattatt ctgcggcaac 420 caggtttcgt gcacctggtg tgaaatttag tgcaaattgt ggctcctgtt tcggtggata 480 accgccacct gtgaagtgct gtggtggtga atttcacgat cccgtgattt ggtgcgccgc 540 tattgacata gcgggtagaa aaacctccgg tggccggcaa ttagagctcc acgctcgaac 600 a 601 // ID Copia-121_AA-I repbase; DNA; INV; 3932 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-121_AA_; KW Copia-121_AA-LTR; Ty1_copia_Ele105; Copia-121_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-3932 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [1364-1867] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 8..3931 FT /product="Copia-121_AA-I_1p" FT /translation="MGSSKEGIPQLTGPNYENWRFRVKLHMDAVEVSSVLS FT EPAPAVGDAGRAKWEQTDRKAKSVLVGFVADEILEVVREKETASEMWKALE FT ETFAKRSVSSQTLLRKQLARLRKKEGTSMRAHFNVFDDLVRQLKSAGAKLE FT EGDLVCQLFLTLPDSYDPLVTALENLAEKDLKLETVKQRLLAEESKREDRL FT DDSSEDNGAAFVGGKKKNQKKFTGKCHRCGKLGHMKKDCRSKKLEGNANAA FT VGSKAVAFMANGDQTKSEAGKIHFCVDSGCSDHLVNDAKHLRSMRKLKDPF FT VIDVAENDVTLVGKFAGDIKGMSNKGIEFQIKNVVLLPDLRENLLSVKKLS FT QAGIDVLFTGRGGQDRAEFKKNGELIGVAYLRGNLYQLELDVGSVANISVA FT GMSKLWHRRMGHASQQALNTLVKHEMATGFTKKLETIGFCDTCVMAKQCRE FT SFDGVRERATRPLERVHSDVCGPIDPPAWDGSRYFVSFIDDWTHFAVIYPI FT KRKSEVYRCFKEYEAMATTQWQTRICKFTVDQGREFFSNEQRSYYKKRGIQ FT VQPTVAYSPQQNGVAERFNRTLVEKVRSMLIDSKAPKNLWSEAALTATYLL FT NRCPTVAVPENVTPAERWTNAKPNLDKIRIFGCKAMAWIPSQQRKKLDPKS FT RETVMIGYAPNGYRLWDRASRKIIIARDVKFNEECFPYAEQTDESSQVPLV FT VPTLYEPEGELLNNPGEEEPHTDVVDHESDEETIFEEVEEEANPGALPSQH FT RGDGCSESNTRRSERERRLPGKFFDFLTGFRAISAADNSDSSDPPSSYEEI FT DGRNDRDCWLAAVRDELRSMDTNQVWRLVKRPSGVKPLQPKWVFCVKEDAD FT GNVVRHKARLVVKGFLQKSGIDYQETYAPVAKLTTIRVALAVALQRGMVIH FT QMDVQTAFLHGELQETIYMAIPEGVEADAETVCLLKKSLYGLKQAPKCWNE FT KLNQVLLKLGFKRSKHDYCVYTRIDERGNDDVYIVIYVDDLLIMGVRAKTV FT DDIKKKLSQFFKMTDCGELQHFLGMKIAYDRVLGTMCLSQEASIEKVLKKF FT GMDDCNPTKTPMEKGLQLTRSNTGMIGEPYRELLGSLMYIMMSTRPDVCFP FT VGFLGRFQQQPEQQHWTALKRVVRYMKGTKNICLEYSRNKEAEPLVGFADA FT DWATDTVDRKSVSGFLFQVYGNTISWSSKKQTTVATSSSEAEYIALSAGVA FT ESIWISGLLTDLGVKMTKPVTIYEDNRGCIGMAQNLECKRAKHIDIKHHFI FT RDHIAAGRIRVEPVCTDRQLADIFTKALDIARFENLRRTIGLHNREG" XX SQ Sequence 3932 BP; 1147 A; 803 C; 1095 G; 887 T; 0 other; ataggttatg ggctcgtcaa aggaaggaat cccgcagctg accggaccga attacgagaa 60 ttggcgcttc cgggtcaagc ttcacatgga tgcagtggaa gtatcgtccg tgttgtcgga 120 accagctccg gctgtaggcg atgcaggtcg tgcaaaatgg gagcaaacgg acaggaaggc 180 taaatcggtg ctggtgggat tcgtggcgga tgaaattctt gaagttgtcc gggaaaagga 240 aacggcgtct gaaatgtgga aggctctgga agaaaccttt gcgaagcggt cagtttcaag 300 ccaaacgttg ctgaggaaac aattggcccg tttacgtaag aaggagggaa cttcaatgcg 360 agctcatttc aatgtattcg atgaccttgt ccggcaactc aagtccgcag gagccaaact 420 ggaggaaggt gatttggtat gtcaattatt cttaaccctg cccgatagtt atgacccctt 480 ggtgacggca ctcgagaatc tggcggagaa ggacctgaag ctagagacag tcaagcagcg 540 gctgctggct gaagaaagca agcgtgaaga ccgtctggat gactcgagtg aagacaacgg 600 ggcagcgttc gttggtggca agaagaagaa ccagaagaag tttactggca agtgccatcg 660 gtgtggaaaa cttggccaca tgaagaagga ctgtcggtca aagaagctgg aaggtaacgc 720 aaatgcagca gtcggcagta aggcggtcgc attcatggca aacggtgacc agacgaagtc 780 tgaagcaggg aagatccact tttgcgtaga ctcgggatgc agtgatcatc tcgtgaatga 840 tgcaaaacat ctgcgatcca tgcggaaatt gaaggatccc ttcgtaatcg atgtagccga 900 gaatgatgtg acactcgtcg ggaagttcgc aggagatata aaaggtatgt ccaacaaagg 960 aattgaattt cagataaaaa atgttgtgct cctgccggat ctccgagaaa acctgctctc 1020 ggtgaagaag ctctcgcagg ctggaatcga tgttctattc actggtcgtg gaggtcaaga 1080 tcgtgcggag ttcaagaaaa atggagagct tattggcgtg gcataccttc gcggaaactt 1140 gtaccaactg gaattggatg tcggatctgt agctaatatc agcgtggctg ggatgagcaa 1200 gttgtggcat cgtcgtatgg gacatgcaag tcagcaagca ttgaacactc tcgttaagca 1260 tgaaatggcc acggggttta cgaagaagct tgaaacaatt ggtttttgcg atacatgcgt 1320 gatggccaag cagtgccggg aatcattcga tggagttcga gaacgtgcca cacgtccact 1380 ggaacgagtt cattcagacg tgtgtggccc tattgatcca ccggcctggg acggatcgcg 1440 gtacttcgta tcgttcatag acgactggac gcattttgct gtaatctatc caatcaagcg 1500 caagtctgag gtataccgat gttttaagga gtatgaagcg atggctacaa ctcaatggca 1560 gacaagaata tgcaagttta ctgtggacca aggccgggaa ttcttttcca acgaacaaag 1620 aagctactac aagaagcgag gcattcaggt acaacccaca gtagcttatt cccctcaaca 1680 aaatggagta gctgaacggt tcaaccgaac tttggtggag aaggtaagat caatgctcat 1740 tgattcaaaa gctccgaaga acctgtggtc agaggcggca ctgactgcga cgtacctcct 1800 gaaccgttgt ccgacggtgg cagtgcctga aaacgttact cctgcagaaa gatggacgaa 1860 tgcaaaaccg aaccttgaca agataaggat tttcggatgc aaggccatgg cgtggatccc 1920 gagccaacaa aggaagaaat tggaccccaa aagccgtgag acggtgatga ttggatacgc 1980 tccaaacgga taccggcttt gggacagagc atcaagaaaa ataataatag cgagagacgt 2040 gaagttcaac gaagaatgtt ttccatacgc agagcaaact gatgaatcaa gtcaagttcc 2100 gttggtagtc cccacgctat acgagccaga gggggagctg ttgaataatc ctggtgaaga 2160 ggaaccacat actgatgttg tcgatcatga atctgatgaa gaaaccatat ttgaagaagt 2220 tgaagaggaa gcaaatcctg gggcgctccc ttcgcaacat agaggtgacg gctgttcaga 2280 gtcaaacacc aggcgcagcg aacgggagcg caggctccca ggtaagtttt tcgatttttt 2340 aactggattc agagcaatct ctgctgctga caattccgat tcttcagatc cgccgtcctc 2400 ctatgaggag atcgatggac gcaatgatcg agactgttgg ttggcggctg tccgggacga 2460 gctgcgttcg atggatacca accaggtgtg gcgactagtg aagcgaccct ccggagtgaa 2520 accattgcag ccaaaatggg tattctgcgt taaagaagat gctgatggaa atgtcgttag 2580 gcataaggca agattggtcg taaagggctt tcttcaaaaa tctggcatcg attatcaaga 2640 aacctatgcg ccggtagcca aactgacgac gattcgagtg gcattagctg tggcgctgca 2700 aagaggcatg gttatacacc aaatggatgt gcagacggcc ttcctgcatg gtgagctgca 2760 agagacaatt tatatggcaa tacccgaagg tgtcgaagcg gatgctgaaa cagtctgcct 2820 gttgaaaaaa tcgttatatg gtctgaaaca agctccgaag tgctggaacg agaaattgaa 2880 tcaggtacta ttgaagcttg ggttcaagag atcgaagcat gactattgtg tatatacccg 2940 catcgacgag agggggaacg atgatgtcta cattgtcata tacgtcgatg atttgctaat 3000 catgggagta cgcgccaaaa cggtggacga tatcaaaaag aagttatccc agttcttcaa 3060 gatgactgat tgtggtgagt tacaacactt ccttgggatg aagatcgcct atgaccgagt 3120 attgggaaca atgtgccttt cgcaagaagc tagtattgaa aaagttttga agaaattcgg 3180 tatggatgat tgcaatccaa caaaaactcc catggagaag gggctacagc tgacacgaag 3240 caataccgga atgattggcg aaccgtatcg tgaacttctc ggaagcttaa tgtatataat 3300 gatgtcaact cgaccggatg tttgtttccc agtaggattt cttggacgtt ttcagcaaca 3360 accagaacag caacactgga cggccttgaa acgagtagta cgatacatga aagggaccaa 3420 gaacatatgt ttggaatact ctcgaaataa ggaagcggaa cctttggtcg gatttgctga 3480 tgcggactgg gcgacggata cagtggacag aaaatcggtg agtggattct tgttccaggt 3540 ttatggcaat acgatctcct ggtcaagcaa gaaacaaacg actgtggcca cctcgtccag 3600 tgaagccgag tatatagcgt tgagcgcagg tgtggcagaa tcgatttgga tatctggact 3660 cttaactgat cttggagtaa agatgacgaa gccggtgacc atctatgagg acaaccgtgg 3720 ttgcataggg atggcccaaa acctggagtg caaacgggca aagcatattg atattaaaca 3780 tcattttatt cgcgaccaca ttgcagccgg acgcattcgg gtggaaccgg tttgcacaga 3840 caggcagtta gcagatattt ttacaaaagc gttggacatt gcaagatttg aaaacctacg 3900 acgaaccatt ggactccaca atcgagaggg gg 3932 // ID Vingi-1_HR repbase; DNA; INV; 3214 BP. XX AC . XX DT 17-AUG-2010 (Rel. 15.08, Created) DT 17-AUG-2010 (Rel. 15.08, Last updated, Version 3) XX DE A family of Vingi non-LTR retrotransposons - consensus sequence. XX KW Vingi; Non-LTR Retrotransposon; Transposable Element; Vingi-1_HR. XX OS Helobdella robusta OC Eukaryota; Metazoa; Annelida; Clitellata; Hirudinida; Hirudinea; OC Rhynchobdellida; Glossiphoniidae; Helobdella. XX RN [1] RP 1-3214 RA Kojima K.K., Kapitonov V.V. and Jurka J.; RT "Recent Expansion of a New Ingi-Related Clade of Vingi non-LTR RT Retrotransposons in Hedgehogs."; RL Molecular Biology and Evolution 28(1), 17-20 (2011). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 167..3184 FT /product="Vingi-1_HR_1p" FT /note="includes endonuclease and reverse FT transcriptase domains and a CCHC zinc-finger FT motif." FT /translation="MKNKQEEIQQHLINHNIHIAALQESKLTKGSKNPVFK FT EYAIYRKDRGIGRGGGLILLIHNSIPYSIKSLPDVPTIESQAITINASDTD FT INIINIYIPPQSTCPTGFTASISQFLLIPNVVLLGDVNAHDPLWYSSLEDT FT RGEALASEIENSGCGTLNLDTPTRLPASGQPSSPDISVASDTLIINMEWST FT MTSLSSDHLPIHISLQTTIQHINPPKIRFINFAKANWDGFTKESEAGFSNQ FT VFPPNLSAAEKSFRRILQDASDHNIPSGRRISYTPGLSKEITDLMKRRDDI FT RATDPTSKDIATTSTIIQHKINKLKQSRWKEFLGTFNRITNTKRLWSTIKA FT LSGKPTQYPNIAINFNNRSLTNNLQIANSFNKIFTSATKHSSNKLNRITNK FT AVKKFPLDDYPTLTPNQVTEAIKTSKPSKATGPDNLTIFHLKHLGPKGISF FT LTNIFNESLRTCRIPEIYKKSKIIPLLKPGKPVNDAKSYRPISLLCPAAKV FT FEKCILPIISNHLTLASHQHGFRPLHSTTTALTTMVTDIANGLNQKRPAHR FT TILTALDLTKAFDTVNHRILISDLMSSSLPRSIVRWLSNYLHGRSASTEFR FT RQFSKQRTIRSGVPQGSVLSPTLFNYYVSKIPSPPPDVKVISYADDFSIYT FT TGPNADDLTLRLNCYLPKLYDFFNSRNLEISTSKSTVTLFTTHTSQYHYHP FT AIKINNNLLPLEHNPKILGLTLDTMLSFSQHHKETASKASIRNNILKALSG FT TTYGQDKETLLQTYKTIGRSTIEYACPAWASLTSDSSFSRLQRSQNAALRL FT ITGCHVIASERHLHSECQMLTVRQHSELIASQFLLLCHNPSHPCHHIITQQ FT HPHRNIKTTLLSKWASTTHAVLPTTINSDSLKSALKTLHNNTIKAATETYQ FT SPLLLLDGWPPPTPKIDPTEESLPRAIRRTLAQLRSGHSILLNSYKNRIDS FT GHLNICPFCNNHTDDVPHLFSCPQNTTKLQPIDLWRNPIAVAVWLRPRLEP FT " XX SQ Sequence 3214 BP; 1046 A; 946 C; 505 G; 717 T; 0 other; ctgtcctcac ccgtcctcaa actcaagatg cttgttggcg tctgctggcg cctcacggcg 60 gccaacttca acatcatcat tcaacagctc agcccgcaac aaaccatcaa catcacaagc 120 agtaagtgcc ggaaagttaa atttccttca actaaactgt aacggcatga aaaacaaaca 180 agaggagatt cagcaacacc tcataaatca caacatccac atcgcagccc ttcaggagtc 240 caaactaaca aaaggctcaa aaaatccggt cttcaaagaa tacgctatct accgcaaaga 300 ccgaggaata ggtcgaggag gtggtctcat actactcatc cacaacagca ttccctacag 360 catcaagtcg ttaccagatg ttccaacaat tgagagtcag gccatcacaa tcaacgcaag 420 cgacacagat atcaacatta tcaacattta catcccccca caatccacct gcccaacagg 480 tttcacagca tccatatccc aatttttatt aatcccaaat gtcgtcttgt tgggagatgt 540 caatgcacac gacccactgt ggtattcaag tctagaggac acacgaggag aagccctagc 600 cagtgaaatc gaaaattctg gctgcggcac actcaacttg gataccccaa cgcgtctccc 660 agcatccggc cagccttcat ctccagatat atctgtcgcc tctgatacgt tgatcatcaa 720 catggaatgg tccacgatga cctcactctc atcagatcat ctacccattc acatcagctt 780 acaaacaaca attcaacata ttaacccccc aaaaatcaga ttcataaatt tcgcgaaagc 840 taactgggat ggtttcacca aagaatccga agccggattt tccaatcaag tcttccctcc 900 caacctatca gcggccgaga aatcgtttcg tcgtattcta caggacgcaa gtgaccacaa 960 cataccttcc ggtagacgaa tttcatatac tccaggtcta tcgaaagaaa tcaccgacct 1020 aatgaaaaga cgagatgaca tcagagctac agatccaaca tctaaggaca tagcaaccac 1080 atcaacaata attcaacaca aaatcaacaa attaaaacaa tcgcgttgga aggagttcct 1140 gggcactttc aacaggataa ccaacaccaa gcgcctctgg agcaccataa aagctcttag 1200 cggcaaacca actcaatacc ccaacatcgc tattaatttc aacaacaggt cactaaccaa 1260 caacctgcaa atagccaact cattcaataa aatttttaca tcagctacga aacattcatc 1320 aaacaaacta aatagaatca ccaacaaagc cgttaagaaa tttccgcttg acgactatcc 1380 tacactaacc cccaatcagg ttacagaagc tattaaaact tccaaaccgt ccaaggcaac 1440 aggaccagac aacctgacaa tatttcacct aaaacacctc ggcccaaagg gaatttcctt 1500 cctaacaaat atttttaacg aatcactacg aacctgtagg attcctgaaa tttacaagaa 1560 atcaaaaatc attcccctac taaaacccgg caaaccagtt aacgacgcga aatcataccg 1620 tcccatatct cttctatgtc cggcagctaa agtatttgaa aaatgcattc tgccgatcat 1680 ctcgaatcac ctaaccttag cgagccatca gcatggtttt cgcccacttc attctacaac 1740 aacagctctc accacgatgg tcaccgacat cgcaaacggc ctaaatcaga agaggccagc 1800 tcaccgaaca atcttaacag cattggacct cacaaaggcc ttcgataccg ttaaccatcg 1860 catcctaatc tccgacctca tgtcatcatc gcttccacgg tcaattgtcc gctggttgtc 1920 aaactatctg catggacgct ctgcatcaac agagttccgt cgacaatttt caaaacaacg 1980 caccatccgc tcaggagtgc ctcaaggttc tgtattgtca ccaactctgt ttaactacta 2040 cgtctccaaa ataccaagcc ctcctccaga cgtcaaagtt atctcttatg cagacgattt 2100 ttcaatctac actacaggtc cgaatgcaga tgatttgaca cttcggctca attgctacct 2160 tccgaagctt tacgacttct tcaatagcag aaacttggag atttccacct ccaagtccac 2220 agtcaccctc ttcaccactc acacttcaca gtatcactat catccagcaa taaaaataaa 2280 taacaattta ttaccactcg aacataatcc aaaaatatta ggcctcacgt tagacacgat 2340 gctgtccttc tcacaacacc ataaagaaac agcatccaag gcctcaataa ggaataatat 2400 tttaaaggca ctaagtggta caacctatgg ccaggacaag gaaactcttt tacagaccta 2460 caaaaccatc ggtcgttcaa caatcgagta tgcctgtcct gcctgggcat cattgacatc 2520 agactcaagc ttctcacgtc ttcaacgctc ccagaatgcc gccctgcgtc tgattacagg 2580 ttgccatgtg atcgcttcag aacggcacct tcattcagaa tgtcagatgt taacggtaag 2640 gcagcattcg gagctaatag catcacaatt cctactccta tgccataacc ctagccaccc 2700 gtgccatcat atcatcacgc aacaacaccc gcaccgcaac ataaagacca ccctcctctc 2760 caaatgggct tcaacaaccc atgcggttct accaacaacc ataaatagcg attctctgaa 2820 gtccgctcta aaaacacttc acaataacac catcaaagcc gccacagaga cgtatcagtc 2880 gccgttgttg ttgcttgacg gatggcctcc acctacacca aaaatagatc caacagaaga 2940 atcacttccc agggcaatca gacggaccct agcccaactc cgttctggcc atagtattct 3000 tctaaacagc tacaaaaaca gaatcgacag tgggcacctc aacatctgtc ctttctgcaa 3060 caatcataca gatgatgtac cgcatctgtt tagctgtcca caaaacacca caaaactaca 3120 accaatagac ctctggagaa acccaattgc agtggcggtt tggctccggc ctcgattgga 3180 accctagggg tcccccagaa tggggcaaca acaa 3214 // ID BEL2-I_Dmoj repbase; DNA; INV; 5850 BP. XX AC scaffold_6541; XX DT 13-MAY-2009 (Rel. 14.05, Created) DT 13-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL2_Dmoj; KW BEL2-LTR_Dmoj; BEL2-I_Dmoj. XX OS Drosophila mojavensis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; repleta group; OC mulleri subgroup. XX RN [1] RP 1-5850 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1010-1010 (2009). XX DR Genome; scaffold_6541; Positions 203701 197852. XX CC Positions [4863-5471] - Integrase core CC 'CCTAT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 609..5828 FT /product="BEL2-I_Dmoj_1p" FT /translation="MANLDASAVAPEDRVRFYKLKVQSVFERVKKLQSKVD FT SDALADHSDSTLNVLLEHIDKLSHSFSKAHESLEELDFTEMSSNLRTDFDD FT LIMVMQSTIMSEVQSRTAQVHHSSTFSRPSDPPRASAPRLQAAPALPPLKL FT PTFSGGYANWADFYSMFSTIIESRPEISNMERLQHLRSCLDGAALDTVRSL FT EISNDNYTIAINLLKQRYDNRRLIFQAHIVKILGLKRVENGSITKLRELSD FT SFNVHMRALSSLGTTEQIASAMIAQILLQKLDEASQAKWEEKLDNSETALQ FT IPRYEEVSSFLELRCRTLESMKFALTNYSSSKPMNSCSKVATSRSAFLVTG FT HSTHSCNFCNDLEHTIYKCPRFANLSPSLRLNEVRKAGLCLNCLKGGHQSR FT QCGSRSCRVCGVKHHTLLHLGHSSTSQAAIPPHQQSSTSRVQTIPQPSAPA FT TTLLSKDRHSDVVLLATAVVLARNRFGELVPCRALLDSGSQLHLITARFAN FT LLQLKRTKSVASVCGVGDSNVAMDGSSICLTLQAHASEYTTSITAMVATNI FT TGRQPSSNVNTSNWSIPQNIVLADPAFHRPQRVDLLIGASLFYDLLCVGQI FT KLMPGLPLLQKTRLGWVVSGGEARNHNSVLIASKTPLDPITDSCADIKLDS FT LVRRFWEVERCSDSIVKFSKEDSDCEAHFARHYNRLASGQYTVRLPVKLSC FT ELLGDSYEQARHRFLNLEKKLSRLPHIKSQYSAFLKEYLELGHMSRVPLNS FT FHLCRYFLPHHCVLKDDSTTTKLRVVFDGSASTTTGHSLNDILMSGPVIQP FT KLVDILLRFRSYPVALTGDICKMYRCVKVPEPDSYLQCILWRNSPDDELEV FT FRLDTVTYGTKPASFLSVRAMHQLAVDERNAFPVGSDVLLRDFYVDDLISG FT GNSVEEARLVMQETAGILACGKFKLRKWCSSHPMVLDGVTDCDKESLIRFN FT DGSDITKTLGLAWDPASDQLLFSYLALNSASKPSRRSVLSSIARFYDPLGL FT VGPVITKAKIFLQQLWRHKLDWDESLPSACHTAWLDICSNFGNGLSSAFPR FT LSLIPGSKVEVHGFCDASIEAYGACIYIVSRGQHTISRLLCSKSRVAPLKT FT LTVPKLELCGAELLARLMNEVARLGIFAGEFHCWSDSTVALSWINDEPGRF FT NVFVSNRIATIQDLTSAMEWHYIPTTLNPADILSRGALPSDLNASLHWFNG FT PGFLCKPRSQWPVSALPERTSIERRRTVLLVKDAVHDVSMDCKFVNSFGRL FT QRTFAYVSKFIKKSHGPGITLADIRNGTHILLRMIQRTHLSDDINALRNKG FT MVPSSSSLASLSPFLDEIGLLRVDGRLKNSTLNFDSRHPIILPKVHPVTRA FT IIMHFHQRNLHAGPRAVLALVRSQYWPIGGRKTVASVIQRCVICFRARPRL FT VEHIMADLPKDRVDGYRVFGVTGVDFCGPFFYKPESRNKAPVKCYVCVFIC FT FATKAIWLELVKDLTTAAFLHALQRFICTRGRPSQIWSDNATNFLGARNQL FT QELKRLFLSDTHQKAVKDFCLADSIEWHFIPPRSPHFGGLWEAAVKTAKHH FT FYRCVGSSVLTYDELRTLVCNIAAVVNSRPLVSISEHPSDIDVLTPAHFIN FT GGPPSSFSEPNLTGLNFNRLDAWQRVSYLQQVFWTRWRDEYLNLLQQRAKW FT RTSKPGLAVGNIVLVKDENLPSLKWPLARIIELIPGSDGVARVALLKTEAG FT EIRRATNKLCLLPLKDSVED" XX SQ Sequence 5850 BP; 1518 A; 1376 C; 1313 G; 1643 T; 0 other; tttggtgacc ccgacgtgat tctctttcgt ttctttacat atttgcatat atatattgac 60 agcatcacat acatacatac acccgtgaac ataagcattg tatatagcga ttagttgcgt 120 gccgttactt tgccaattcg ccagacactc tttcgttcgg tctgcaatac actggcgccc 180 gtaaaagtag tacacatgcg aaaacttctg caaatgacaa tctaataaag aatcaactaa 240 tttctgaact atgtctatgg ttagtgactg cgtgttctat agcttcaatc tataagtgat 300 taagtttaca taagcacagt gtaattacat gcttgccgtt attcttcgtt gacattgcac 360 tctttcacca cggctgcgat acgagcgcgt ccataataat cgtaccggtg tgcaacaatt 420 catacgtgta tattatttga gtaccataat ataattcttt gccagtttat taaagttgca 480 agtactctgc tcgacaaata cgacttattc caagtctgcg ctatagcata tatacattta 540 cttactcttg caatcgcgct cgcaccaacg actttgaact ctatcgtgtt gctcttgctg 600 tactcgtcat ggcgaattta gatgctagtg cagttgctcc cgaagatcgc gtgcggtttt 660 ataagctcaa ggtacagtcg gtatttgaac gcgtaaaaaa gctacaatcc aaggtcgatt 720 cagatgctct tgccgatcac agtgacagca ctctaaatgt tctactggag catatcgaca 780 agctaagcca ctcctttagc aaggcccatg aaagtctaga ggagctcgac tttactgaga 840 tgtccagcaa tttgcgcact gactttgatg atttaataat ggtcatgcag tcaacaataa 900 tgtccgaagt gcaaagtcgt actgctcaag tgcatcatag ttccacgttt agccgtccaa 960 gtgaccctcc tcgcgcatcc gctcccagat tgcaggctgc acctgcgttg ccgccattaa 1020 agctacctac gttcagcggt ggatatgcta attgggctga tttctactca atgttttcta 1080 caatcattga aagtcggcca gaaatcagca atatggaaag acttcagcac ctccggtcct 1140 gcctcgatgg cgcagcgttg gatacggttc gctctttgga aatttctaat gataattata 1200 caattgccat aaatttatta aagcagagat atgataaccg acgtttaatc ttccaggccc 1260 acatcgtaaa gatcttaggt ctcaaaagag tagaaaacgg atccataacc aagctgcgtg 1320 agctatctga cagcttcaat gtccatatgc gtgcgctcag tagtctgggc acaacagagc 1380 agatagccag tgccatgatt gctcagatac tgctacagaa gcttgatgag gcctcacaag 1440 ctaaatggga agagaaattg gacaactcag agacggctct tcagataccc agatatgaag 1500 aagtctcctc atttcttgag cttcgttgcc gtactttgga atcgatgaaa tttgctctta 1560 caaactattc gtcaagtaag ccgatgaatt cttgcagtaa ggttgctacc agtagatcag 1620 catttctcgt aaccggccac agtactcaca gttgcaattt ttgtaatgat cttgagcaca 1680 caatctacaa atgcccaaga ttcgcaaact tgtctcctag tcttcgcctg aatgaggtcc 1740 gaaaggctgg cttatgtttg aattgtctca agggaggtca tcagtcgcga caatgtggct 1800 cgcgcagttg tcgggtctgt ggagtcaagc atcatacgct tcttcatctc ggccactcca 1860 gcacatcgca ggcagcaatt ccgccccatc aacaatcttc aacatcaaga gtacagacca 1920 tccctcagcc ctcagcacca gccactactc tcttatccaa ggatcggcac agcgatgttg 1980 tgctcctggc taccgcagtg gttctggcaa ggaatcggtt tggcgagctt gttccttgtc 2040 gtgcactttt agattcaggc tctcaactac atctcattac agccagattt gcgaatctcc 2100 tgcaacttaa gagaacaaaa tcggtggcat ctgtatgcgg cgtaggcgac tccaatgttg 2160 ccatggatgg cagcagcatt tgcctcactc ttcaagctca tgcatctgaa tatacaacta 2220 gcataacggc tatggtagct acaaatataa ctggcaggca gcccagttct aatgtgaata 2280 cgagcaattg gagcataccg caaaatatag tgctggcaga tccagctttc cataggccgc 2340 aacgagtaga tctgcttatc ggagctagcc tgttttatga tttgctctgt gtgggacaaa 2400 tcaagttgat gcctggactt ccattgttac agaagactcg tcttggttgg gttgtatcag 2460 gtggcgaggc tcgcaaccac aactccgtgc taatagcttc aaagacgcct ttggatccga 2520 ttacagactc ctgtgctgat atcaagttgg acagtctcgt tcgtcgcttt tgggaagttg 2580 agcgctgcag cgattccatt gtaaagttca gcaaggaaga ctcagattgc gaagcgcatt 2640 ttgcgaggca ttacaatcga ttggctagtg gccaatatac agtacgcctg cctgttaagc 2700 ttagttgcga gcttcttgga gattcttatg agcaagcgcg acatcggttt ctcaatttgg 2760 agaagaaatt gagtcgtttg ccgcatataa agtctcaata ctcagcattt ctgaaagagt 2820 atcttgaact tggtcacatg tcacgcgttc ctctaaattc attccatcta tgcagatatt 2880 ttcttcctca tcactgcgtt ctgaaggatg atagcacaac taccaagctt cgcgtcgtct 2940 ttgatggatc tgcatctaca acaactgggc attcgttgaa tgacattcta atgtcaggtc 3000 ccgtcattca gccaaaatta gtcgacatat tattgcgctt ccgatcttat ccagttgcac 3060 tcaccggcga tatttgcaaa atgtatcgtt gcgtgaaagt acctgagccg gacagctatc 3120 tacaatgcat tttatggagg aattcgccag atgatgagtt ggaagttttc aggcttgata 3180 cggtaacata tggaacaaaa ccagcgtcat ttttgtctgt gcgcgccatg catcagcttg 3240 ccgtagatga aaggaacgca ttcccagtcg ggtccgacgt actgcttcgc gatttctatg 3300 ttgatgattt gattagtgga ggaaattcag tcgaagaagc aagacttgtt atgcaagaga 3360 cagctggaat cttggcatgt gggaagttta aattgaggaa gtggtgttca agccatccca 3420 tggtactgga cggcgtaaca gactgcgaca aggaatcgct cattaggttc aacgatggca 3480 gcgacattac aaagacactt ggccttgcat gggatccggc atctgatcaa cttcttttct 3540 catatctagc tctgaattca gcctctaagc cctccaggcg ctctgttctc tcgtctattg 3600 ctaggtttta tgatcccctc ggcctggtcg gcccagtaat aacgaaagct aaaatcttct 3660 tgcaacaact ctggaggcat aaacttgatt gggatgagag tctcccgtct gcatgtcaca 3720 ctgcatggct agacatttgc agcaactttg gcaatggtct tagtagtgca tttcccaggc 3780 tatcattaat accaggatcg aaggttgagg tccatggatt ctgcgatgcc agcatcgagg 3840 catacggagc ttgcatttat attgtatcca gaggtcagca taccatcagc aggttgcttt 3900 gctctaagtc tcgagttgcg cctctaaaaa cgctcacggt tcccaagctg gaactatgcg 3960 gggcagagtt gcttgcgcgt ctaatgaatg aagtggctcg cttaggcatt ttcgcaggcg 4020 agtttcattg ttggtctgac tctactgttg cgctttcttg gatcaatgac gagcccggac 4080 gatttaacgt ttttgtttca aatcgcatcg ccacgataca agatctcaca tcagcaatgg 4140 aatggcacta tatacctaca acactgaatc ccgcggatat attatctcgg ggtgcattgc 4200 cctcggattt aaatgcatca ttacattggt tcaacggtcc aggtttcctc tgtaagccca 4260 gatctcagtg gccagtatca gcacttccag aaagaacttc aattgagcgt cgaagaactg 4320 tgcttcttgt aaaggacgca gtccatgatg tctctatgga ctgcaagttt gtgaattctt 4380 tcggccgctt gcagcgtacg tttgcatacg tatcaaagtt tattaaaaaa agtcatggtc 4440 caggaataac cctagcagac atccggaacg gcactcatat actgttaaga atgattcaac 4500 gaacacatct ctcagacgac ataaatgctc ttcgaaataa aggaatggtt ccgtcatcca 4560 gcagtctagc ctcgctgtca ccgttccttg atgagattgg gctgctacgt gtcgatggtc 4620 gtctcaaaaa ctcaacttta aatttcgata gtcgccatcc aattattttg cccaaagttc 4680 atccagtaac tagagccatc ataatgcatt tccatcagcg aaacttgcac gctggtcccc 4740 gagcagtgct agcgttggtt cgttcccaat attggccaat tggaggtaga aaaactgtag 4800 ccagtgtaat tcagagatgc gtaatttgct ttcgtgcaag gccccgactc gttgaacaca 4860 taatggcaga tctacccaag gatcgagtcg acggctatcg ggtctttgga gtgactggag 4920 tcgacttttg cggacccttc ttttacaagc ctgagtcacg aaacaaggcg cctgtgaaat 4980 gctacgtttg tgtgtttatt tgttttgcta cgaaggctat ttggttggag ttagtcaagg 5040 atttaactac cgcggcattt ttgcatgcac ttcaacgttt catatgcacc agaggaaggc 5100 caagtcaaat ctggtcggat aacgccacaa actttcttgg tgcccgcaat cagcttcagg 5160 agttgaagcg tctatttctg tcagacaccc accagaaggc tgtgaaggat ttctgtttgg 5220 cagactcaat cgaatggcat ttcatccctc cacgatcacc tcactttggt ggcttgtggg 5280 aggctgcagt gaagacagcg aagcatcatt tttaccgatg tgttgggtcc tccgtcctaa 5340 cttacgatga gcttcgcaca ctggtttgca acatcgctgc tgtcgttaat tcacgccctt 5400 tagtttcgat ttcagagcac ccatcagaca tagacgtgtt gactcctgcg catttcatca 5460 acggtggccc tccttcgtcc ttcagtgagc caaacctgac agggctcaat tttaatcgtt 5520 tggatgcttg gcagcgcgta agctatctcc agcaggtgtt ctggactcgt tggcgggacg 5580 aatatctcaa cttgttacaa cagcgagcga aatggagaac ctccaaacct ggcttggctg 5640 ttggcaacat agttcttgtg aaggatgaga atctaccttc acttaagtgg ccactagcca 5700 gaataatcga acttatccct ggaagtgatg gcgttgcacg ggtagcccta ctaaaaactg 5760 aagctggcga gataagaaga gctacaaata agctgtgctt gctgcccctt aaggattctg 5820 ttgaagacta agtcttcaac ggggggagga 5850 // ID BEL-198_AA-I repbase; DNA; INV; 5561 BP. XX AC supercont1.1432; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-198_AA_; KW BEL-198_AA-LTR; BEL-198_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5561 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.1432; Positions 74888 69328. XX CC Positions [4569-5153] - Integrase core CC 'GGCAC' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 208..3621 FT /product="BEL-198_AA-I_1p" FT /translation="MDKLIRERKSLEPRLKRISDTVDKIRPEEAEEVDIQI FT ELDALSDVWAAFCSVHKKILDNSEDDEAYDDAVHRQGRFEACYRALKNRLL FT KMLKVVKDRDGVVSQQIPPNDVIRQLADQQAEFLRLMSSNMAAGPSSSAVV FT HSNAPASPLSDLKLPRMNMPIFSGNYLEWQSFYDLFDSLVHQNPSLKDSQK FT LYFLKTNLAGEAASLISHLKIEDANYQSALQKLKSRYDKPREIANQHIKRF FT LAQPALTSSSSQGLRSLHDVSDEVIRALKAMNREDRDTWLLFILSEKVDPD FT TKHLWCQKMAEMDEANINLQCFLKFVESRSFALQAAQPSKPKISVPFKQPL FT KAAPQNRGANAFVATNPPFCNVCNKQSHHLYQCGKFIHMNYEDRLAHVTRM FT RLCNNCLKKHSGESYQSGTCRKCNLPHHTLLHPVSSFASSQPSTSSSVRAP FT ANPGTAAQSLISALDSPTCLDASSVLLATVAINVLDCHGRPYACRAVLDCA FT SQVSFISHDFCRQLGLKTSAANMDLEGISSTPAHADKCAEIIIASRCTDYR FT TSVSCMVLERITKMLPCKPANIDDWPIPGSIHLADPLFHRPGKVEVLLGIE FT LFFQLLEPGKIALSPDDSLPTLQNTKLGWVVAGRYHDSNTSIRSHASTCLL FT TSTDDDLSQQLRKFWELEEYAPTSNHLTEEEQRCEQHFADHTIRDETGKFI FT VRLPFLMDPNQLGESRLIAEKRFRHIERKLDRNPPLRSEYHAFIREYIDLG FT HMSLVEDTTTASKSVYLPHHCVVKSTSSTTKCRVVFDASAKTTSGLSLNDV FT LMCGPVVQDSLINILVRFRFPPIVLVGDAKQMYRMVWLHELDRDFLKILWR FT WSKDDPIEEYRLNTVTFGTKSASYLATKCVQQLLNSYREQYPAAVEKAEKG FT IYVDDVLIGADSEEEARLLRDQLIEMFGAGGFHLRKWASNSAAVLEGVPVA FT DLEMKIPIEESGSCTIKALGMQWQPCSDEFQFSYQPTEILQPTKRSILSQI FT ASLFDPLGLLAPIIVKAKLVMQRMWELKVAWDATPPGELTNDWLVLVQKFS FT LLNSFQIPQRVIDMRNWTRLYTATATHLMWPWALASIFELWATTVVLRPIY FT SALNRSWHLLATVERLHPGLNCVRQLFSHD" FT CDS 3471..5561 FT /product="BEL-198_AA-I_2p" FT /translation="MGACIYIRAVGNDGSTSSHLLCAKSKLAPIGNGRTTT FT PRLELCAAVILARLITNVRAALSTTIFYEVRAFSDSKVVLAWLAGGAARWK FT TFVANRVAEISTHLPSINWTHVGTLHNPADLISRGAFPEQLHTNTLWWHGP FT AWDPSESKNETVPNSDLDTIDRRHVDREQRTTVVVCFTVYENRFLDDMMGR FT YYPDLKILLRVTARILRFGHPEFRDSTRLSPDEITSALRKYLQHTQQQNFS FT GELSRLQKGLSVDRSSSLRQLNPFLDEHGLVRVGGRLQESDLSYDNKHPIV FT LPRHSILTSLILLHEHQEHLHCGPQSLLAATRTRFWILRGSSAARKICRDC FT VKCNLVQPARIHQQMGQLPADRLKPLPPFAITGVDYAGPVNIVGRRTRGAV FT PSKGYISLFVCLGTRAVHLEAVSDLSAPSFLAALTRFTSRYGVPSKMYSDN FT ATNFRAAARTLRELYQQIDATEHSDEVNDFFTDKQVQWLFIPARSPHHGGL FT WEAGIKVAKTFLNKIGGNYNYTFEELSTLLAQVAACMNSRPISPISDDPAD FT PQPLTPAHFLIGRPLDALPEVNHLEQQVSSLSRWKYVQRVAQDFRARWQSE FT YVLSLQKLSKWQQSSPNIAEGDFVLLVDDNEKPQQWPLGRVQELFPGNDGH FT VRVVAVKTAKGVFRRDVRKLRRFPLDNDEYVPGRHGVEIPTRNLVGGL" XX SQ Sequence 5561 BP; 1403 A; 1489 C; 1341 G; 1328 T; 0 other; tatggtcctt cgagccggat cgcgaagtct tcgccgcgaa ccgcgtagtc tgtgaccgct 60 agtgcccgta aattggcccc gtcgccgccg tatcggcccc gagtgcttct gccatttggc 120 ataatagtgg tgcagaacaa agcagcaagt gaggattgtg tcggacagtg acttttgttg 180 aacgacaagt gcgtgaaatc caccaggatg gataagctaa tccgcgaacg taaatcgctc 240 gaaccaaggt tgaaacgtat ttcggatacg gtggacaaaa taaggccaga agaagccgaa 300 gaagtcgata ttcaaatcga gctggacgct ttgagtgatg tgtgggctgc gttctgttca 360 gtgcataaga agattctgga caatagtgaa gatgacgaag cttacgatga cgcagtgcac 420 cgccaagggc gtttcgaagc atgctacaga gcattgaaga atcgtttatt gaaaatgcta 480 aaggtagtga aagatcgtga tggtgtcgtt tctcagcaaa ttccgccgaa cgacgtcatc 540 agacagctag ccgatcagca ggccgagttt cttcgactga tgtcttccaa tatggctgcc 600 ggccctagtt cctcggccgt cgtgcacagc aacgctccag catctcctct ctcggatctc 660 aagctgccta ggatgaatat gcccattttc agcgggaatt acctagagtg gcaatccttc 720 tacgacttgt tcgacagctt ggtgcatcaa aatccgtcac tgaaggacag ccagaagctg 780 tacttcctga aaaccaacct cgccggtgaa gctgcatctt tgatttccca cctgaaaatt 840 gaagatgcca attatcaatc ggcactgcag aagctgaaat ccaggtatga caaaccacgt 900 gagatcgcta atcaacacat caagcgattt ttagctcagc cagctctaac gtcatcctca 960 tcacagggct tgcgatcgct tcatgacgtg tctgatgagg ttattcgggc cctcaaagcg 1020 atgaataggg aagaccgcga cacgtggctt ctgttcatac tgagcgaaaa ggtggatccc 1080 gataccaagc atttgtggtg ccagaagatg gccgaaatgg atgaggcgaa catcaacctc 1140 cagtgcttcc tcaagttcgt cgagtcccgc agtttcgctc tccaagccgc tcaaccgagt 1200 aagccgaaaa tcagtgtgcc cttcaagcaa cctttaaaag ctgcacctca aaaccgagga 1260 gctaacgcat tcgtcgccac gaaccctccg ttttgcaacg tttgcaacaa gcaaagccat 1320 catctctacc aatgtggtaa gttcattcac atgaactacg aagatcgact cgctcacgtc 1380 actagaatga ggctgtgcaa caattgcctg aaaaaacatt ccggtgaaag ctaccaatca 1440 ggtacgtgta ggaaatgcaa cttacctcac cataccttgc tgcacccggt ttcgtcattc 1500 gcaagttcac aaccttcaac atcctcatca gtcagggccc ctgcaaatcc tggaactgca 1560 gctcagtcct taatttcggc cctcgactca ccaacctgcc tcgatgcgtc aagcgtccta 1620 ctagcaaccg ttgccatcaa tgtcttggac tgccatggtc ggccctacgc ctgtcgtgcc 1680 gttctagatt gcgcgtcgca agtcagcttc atcagccatg atttttgccg acaacttggt 1740 ctcaaaacat cagcagccaa catggatctc gaaggcattt cttccacccc agcacacgcc 1800 gataaatgcg ccgaaatcat cattgcttct cgctgtacgg attatcgcac ctccgtatcg 1860 tgcatggtgt tagaaagaat taccaaaatg cttccctgca agccagcgaa tatcgatgat 1920 tggcctatac caggatcaat tcatctggcc gatccactct tccatcgccc gggtaaggta 1980 gaagttttac tgggcattga attattcttt cagctgctcg agcctggcaa aatcgccctc 2040 agtcccgacg acagcttgcc aacactacaa aacaccaagc tcggctgggt ggtcgctggt 2100 cgttatcacg attcgaacac ttcgattaga tctcacgctt cgacctgcct cttgacttcc 2160 accgacgacg atctctctca gcagttgcgt aagttttggg aactcgagga gtacgctcca 2220 acgtcgaacc atctcaccga agaagagcaa cggtgcgaac aacactttgc tgaccacacc 2280 attcgcgatg agacgggaaa atttattgtg cgtctcccat tcttgatgga tcccaatcag 2340 ctaggtgagt cccgactaat tgcagagaaa cgctttcgcc atatagagag aaagcttgac 2400 cggaacccac cgctcagaag tgagtaccat gcgttcatcc gggaatacat tgatcttggg 2460 cacatgtcac tcgttgagga tactaccacc gcaagcaaat cagtttatct cccacaccat 2520 tgcgtggtca aatcaacaag ttcgaccacc aaatgccgag tggtgttcga tgcgtcggca 2580 aaaactacca gtggtctctc tctgaatgat gtgctgatgt gtggccccgt agttcaagac 2640 tccctgatca atattttggt tcgatttcgc ttcccaccaa tcgttcttgt tggtgatgcg 2700 aagcaaatgt atcgcatggt atggctgcat gagctcgatc gcgacttcct caaaattctg 2760 tggaggtgga gcaaggatga tcctatagag gagtatcgtc taaataccgt tacctttggg 2820 accaagagcg catcctattt agcgacgaaa tgcgtgcagc aacttttgaa ctcctatcga 2880 gagcaatatc ctgcagccgt tgagaaggcg gaaaagggca tctacgtcga tgacgttctc 2940 atcggcgcag actctgaaga agaagcaagg ttgttacgag atcagctcat tgaaatgttt 3000 ggcgccggcg gattccacct ccggaaatgg gcttcgaaca gcgcggcagt attagaagga 3060 gttcctgttg ctgacctgga aatgaaaatt ccgattgagg agagcggtag ctgtacaatc 3120 aaggccctcg gaatgcagtg gcagccgtgc agtgacgagt tccagttctc ctaccaacca 3180 accgagatcc ttcagcccac aaaacgcagc atcctatcgc agatcgctag tctcttcgac 3240 cccctcggcc ttctagcacc aatcatagtg aaggcaaaat tggtgatgca gcgtatgtgg 3300 gagctgaagg tggcctggga tgcaactccc cccggtgagt taaccaacga ttggttagtt 3360 ttggtgcaaa agttctcgct tctcaactcg tttcagattc cccaacgtgt aatcgacatg 3420 cgcaactgga ctcgtctcta cacggctact gcgacgcatc tgatgtggcc atgggcgctt 3480 gcatctatat tcgagctgtg ggcaacgacg gtagtacttc gtcccatcta ctctgcgcta 3540 aatcgaagct ggcacctatt ggcaacggta gaacgactac acccaggctt gaattgtgtg 3600 cggcagttat tctcgcacga ttgatcacca acgtcagagc tgctctttcc acgacgatct 3660 tctatgaggt gcgagcgttc tccgacagca aggttgtctt ggcatggctg gctggcggcg 3720 ctgcgaggtg gaaaaccttc gtggctaatc gtgtcgcaga gatatccact caccttccgt 3780 ccatcaactg gactcacgta ggtactcttc acaatccggc ggatttaatc tctcggggag 3840 cctttcccga acagcttcac actaatactc tttggtggca tggaccagcc tgggatccat 3900 ctgagagtaa aaacgaaaca gtaccaaaca gcgatttgga cactattgat cgacggcacg 3960 tagacagaga gcaacgaacc acagtcgtgg tttgcttcac tgtgtatgag aatcgttttt 4020 tggacgacat gatgggcagg tactacccgg atctcaaaat ccttctccga gtcacagctc 4080 gcattcttcg ttttggacac cctgagttcc gagactctac ccggctctct ccagatgaaa 4140 tcacatcggc tctgagaaag tacctgcagc acacacaaca acaaaatttc tccggggagc 4200 tcagtcgatt gcagaaaggt cttagcgtcg atcgtagtag ttcgctacga caactcaatc 4260 cattcctgga tgagcacggc ctcgtcaggg tgggcggcag gctccaagaa tcggatctca 4320 gttacgacaa caagcatccg atcgtgctgc cccgacattc gatattgaca tcgttaattc 4380 ttctgcatga gcaccaagag catctccact gtggccctca gtcgttatta gctgctacac 4440 ggactcggtt ctggatacta cgcggcagca gtgccgcccg aaagatctgt cgtgattgtg 4500 tgaaatgcaa tctagtccaa cctgctcgca ttcaccaaca aatgggacag ctacctgccg 4560 acagattgaa gccactgcct ccgtttgcca tcacaggcgt tgattacgcc ggtccggtga 4620 acatcgtcgg acgaagaaca cgtggagcag taccatcaaa aggatacatt tcgctgttcg 4680 tttgcctggg aacacgagca gtgcacctag aagcagtatc ggatctaagt gccccttcgt 4740 ttctcgctgc cctcacccgg ttcaccagtc ggtacggagt gcccagcaag atgtattccg 4800 ataatgcgac gaatttccga gccgcagcta ggacactccg tgagctatac cagcaaattg 4860 atgcaactga acacagcgat gaagtcaacg actttttcac cgacaaacag gttcaatggc 4920 tcttcatacc cgctcgatct ccacatcacg gtgggctgtg ggaggctgga ataaaggtgg 4980 cgaagacctt cctcaacaaa atcggaggta actataatta cacctttgaa gaattaagca 5040 ctcttcttgc ccaggttgct gcgtgcatga actctcgtcc gatttctcca atttcggatg 5100 accctgcgga tcctcagccg ctcacgcccg cccatttttt aatcggtcgc cccttggacg 5160 ctttgcctga ggtcaaccat ctcgaacaac aagttagttc cttatcaagg tggaaatatg 5220 ttcagcgagt cgcccaggat tttagagctc ggtggcagtc cgagtacgtt ctgtcactgc 5280 agaagctatc aaaatggcaa caatcatccc ccaacattgc cgaaggtgat tttgtgctcc 5340 tggttgacga caacgagaag cctcagcagt ggccgttggg ccgtgttcaa gagcttttcc 5400 ccggaaatga tggacacgtc agagtggttg cggtcaagac agctaaggga gttttccgtc 5460 gagacgtcag gaagcttaga cggttccctc tcgacaacga cgagtacgtt cctggaagac 5520 atggagtgga aattcctact cgtaatttgg tgggcggatt a 5561 // ID Gypsy-227_AA-I repbase; DNA; INV; 3020 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-227_AA_; KW Gypsy-227_AA-LTR; Gypsy-227_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-3020 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1057-1057 (2011). XX DR [2] (Consensus) XX CC 'TACA' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 422..2188 FT /product="Gypsy-227_AA-I_1p" FT /translation="MATMQDVNVWPFQPISGDAPSASNPQLQFALSENGIQ FT FNAIDNTFMRLVDQLKFSNTAVADLTQELSVMRSEMNSMRAETSRLSSVIE FT KIRSGSTVPQQASTPNGGRCDEGGHVRVEQMRRGATSENIAMNSKLAVGNN FT AMSDGYGRGHDNVLRRYVRDVNEKANNNLVGDGPNNKFVTRDLQNLFADRK FT CTYENVEFGKDRNSRKTVIKSCCGSEAYGGLRGDYHLEAHAHDIGSRIEQQ FT LSVSDAECAFSEFSGTDFYPVRKWIDDFEELSDSIGLTELRKFVIAKRKLT FT GLAKISLNSTNNVSNWRTLKDILINEFEFSENSAVVHERLRNRKRKLGENV FT LKYFLQMREIGAKANVDISSIITYTINGINDNGPDKTILYGSKSMDEFREK FT LREYKRIKDNKYCRDYKSKSSASERFFSSGRPNGQKNFENRNDILPKTVKN FT YNCGKGGRVPTKYPKNIGSNYVNLCYSDPSSLKRNSLSIFVHTIQCQALFD FT SGSSVSLLREGWTIKMNFKINRNDRKRLMTFKGVIWTLGSVALNIVIENTP FT LNITFDVIRDKDMENNIVLGKNLLIFGVVNITGIAQKTKFYA" XX SQ Sequence 3020 BP; 1004 A; 449 C; 698 G; 845 T; 24 other; aatttggggg ctcgtccggg aacgcgtgat gaaactgtcg tgatattcgt tagtgtgcaa 60 gattgagcgt gaagctgaaa tttgctgtga aatagcattg taaagaaact tttattttga 120 gtgttctgaa tttcaatagt tttttagata gatatgaaat aaataattta gtgttaagac 180 attgagtaag cttgattagt gttatttagc gataggaaaa ttgaattcgg attttttttt 240 agcatactgt aaataaaaca aactgccact gattttttcc cgccgcgaaa aaaaactact 300 agtgaactgt tgtaaaagtg gattgcattg aatttcagat ttgacctggg gttaagggtc 360 attgtaccaa aatagataac gcgtgtgatt tggtgcggta aaacggaaca gaaaaaatag 420 aatggcgaca atgcaagacg tgaacgtttg gccttttcag cccatcagcg gtgatgcgcc 480 ttctgcttct aatccccaac tacagtttgc cttgtctgaa aatggcatac aatttaatgc 540 aattgacaat acgttcatgc gtttagtgga tcagctaaaa ttttctaata cggctgtggc 600 ggacctaacg caagagctta gtgtgatgcg tagtgagatg aactctatgc gggcagaaac 660 atctcgttta agttcagtta ttgaaaaaat acgttctggc tctacggtac cacagcaagc 720 aagcactcca aatggaggta gatgtgatga aggtggacat gtaagagtcg aacaaatgag 780 acgtggtgcg acaagtgaaa atattgcgat gaacagtaag ctggctgttg gaaacaacgc 840 aatgagtgac ggctatggtc gtggtcatga taatgtgttg agaagatatg ttcgagatgt 900 gaatgaaaaa gcgaacaata atctggtggg tgatggcccg aataataaat ttgttactag 960 agatttgcag aatctatttg ctgatcgtaa atgtacgtat gagaatgtcg agtttggaaa 1020 agaccgaaat agtagaaaaa cagtgattaa gagctgctgt ggctctgagg cgtacggtgg 1080 cttgcgtggt gattaccact tggaggctca tgcgcatgat attggaagta gaatagaaca 1140 acaattatcg gtaagtgatg ctgagtgcgc attttctgaa ttttctggaa ctgattttta 1200 tcctgttcga aaatggatcg atgattttga ggaactgtcg gattccattg gcttgacaga 1260 gttacgcaaa tttgtaattg caaaacgaaa attaaccgga ttagctaaaa tttcattaaa 1320 ctccactaat aatgtctcta actggagaac gttaaaggat attttaataa atgagtttga 1380 gttttcagaa aatagtgctg ttgtgcatga gagattacgg aatcgcaaaa gaaaattggg 1440 cgagaatgtc ctaaagtatt ttctgcaaat gcgggaaata ggggcaaagg ccaatgttga 1500 catttcatca attattacgt acacaatcaa cggaataaat gataatggtc cagacaagac 1560 tatattatat ggatcaaaaa gtatggatga gttcagagaa aagcttcgtg aatataaaag 1620 aataaaagat aacaaatatt gtagagatta taagtcaaag tcttcagctt cagaaagatt 1680 ttttagtagc ggaaggccaa atggccagaa gaattttgaa aaccgaaatg acatcctacc 1740 caagactgtg aaaaattaca attgtggaaa aggaggtcgt gttccaacca aatatccgaa 1800 aaatattgga agcaattacg taaatctctg ttattctgat ccttcttcgt taaaaagaaa 1860 tagtttgtca atattcgttc atacaattca atgtcaagca ttatttgatt ccggttccag 1920 tgtttctttg ctgagagagg ggtggacaat aaaaatgaat tttaagataa atcggaatga 1980 tcggaaacgt ttaatgacgt tcaagggagt catatggact ttaggtagtg ttgcgttaaa 2040 tatagtgatt gaaaatacac ctttgaatat tacatttgat gttataagag ataaagatat 2100 ggagaacaat attgttttag ggaaaaattt gttaattttt ggcgtggtaa atattacagg 2160 gattgcccag aaaactaaat tttatgccta acgaaataac ctggaggctt catatgtgca 2220 aaatttcact tataaaatgt cgtgaaaagg ccttctacgt acaaaagtgg aggcgatatg 2280 cgtccgtaaa tcctgtcatt tataatgacg tgcaatgcga gcgctagatt tctaccgaat 2340 caacctgcga kcttatgcgg cttaacttgt gataacgaat stcggtttsm cggctkctac 2400 ggaccgatgc gactcasctt gtacgagtgt acgagacgaa acaaaatgtm caggtkagct 2460 cagmaamgaa gcgcgattta cgcgakgaac tcgtcaaact gacgmtttta tccgckggtt 2520 tscsggtttg atgtaattag tatgaataaa cgaattgcaa cgtgaacttt catgcgattt 2580 ctggtcgkct gkgcgagatc tmggttttat cctaaaaata kccaatttaa gagattsggm 2640 accstgaatc ctgtmatttg kgcgtgtata attgtacgca gttcagaaat caatcaaaag 2700 actctggaaa gtatttcgaa atgcttgata acaccaagaa gaagctaaat tcaaaatgtg 2760 acgtcaatgc gatcaaaatt gggaacaatt taaggaaata tggtaacaga catcaagtag 2820 cacagaatat aaagatttgg atgcagccaa cgatgaagat atggaggact acatccaaat 2880 caaggagtaa ggctaagcaa agcgaagaag attgggagca aaacgacgaa agtgagagat 2940 tgaagctagg caaggaaaga caacatgttg gacgatctga tagcaccatc cgggccggat 3000 ggtggtcagg atggccgatt 3020 // ID BEL-3_BMa-LTR repbase; DNA; INV; 496 BP. XX AC AAQA01001266; XX DT 24-FEB-2011 (Rel. 16.02, Created) DT 24-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Brugia malayi genome: long terminal DE repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-3_BMa_; KW BEL-3_BMa-I; BEL-3_BMa-LTR. XX OS Brugia malayi OC Eukaryota; Metazoa; Nematoda; Chromadorea; Spirurida; Filarioidea; OC Onchocercidae; Brugia. XX RN [1] RP 1-496 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Brugia malayi genome."; RL Direct Submission to RU (07-FEB-2011). XX DR Genome; AAQA01001266; Positions 2005 1510. XX SQ Sequence 496 BP; 147 A; 107 C; 60 G; 182 T; 0 other; tgtcgcggat tttaatatcg cgatctaaaa atatgagttg caaattgtag taaagatatc 60 tacgtaattt ccgcaaataa tgcttatact tcggactctt tgtttccaaa tttatgcttc 120 tacatattga aacaatttac atcgttcttt gttagcaatc cttacaatca tgaggatatc 180 tacattgttc tttgttagct atcataatca tgcggttatc taataagctc tttattagca 240 actggtgccc atgcaagcac ctcctctcct ttgtttcgat cattgtctaa catcaatgta 300 gatttataac catactattg tcaattagat ctgtttttaa atcatccaat cctccgaatc 360 agtttttatt caattgttgc ccaaatattt agctggcatc caacacatct ccctgacatc 420 aaacttctcc cccgaataac aatttaacca aatcataaag gtttgtttcc aaaaacattt 480 ttttgcttcc actaca 496 // ID L1-48_AAe repbase; DNA; INV; 5766 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE L1-type non-LTR retrotransposon from Aedes aegypti. XX KW L1; Non-LTR Retrotransposon; Transposable Element; L1-48_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5766 RA Jurka J.; RT "L1 non-LTR retrotransposons from the yellow fever mosquito."; RL Repbase Reports 11(4), 1403-1403 (2011). XX DR [2] (Consensus) XX CC >99% identical to consensus. 5' end undetermined. Low-copy CC number. XX FH Key Location/Qualifiers FT CDS 239..1882 FT /product="L1-48_AAe_1p" FT /translation="MPVNHKLTPLPHRCPTLLPPTPQVSLSPPRMAATAPL FT TSQHHAPPPAPSRSPTSPLLPAFPALPPRHAPSTTSAHPPPVGVTPISSLS FT ETVECASLTSSVRQGSLEVFLSLGRHDVAAAFAFVRITPQLAVSAPAPFIR FT EAIRAAFPLITFEVLPSSFGSLLVRFQSAGDCTAAVSRSPLLHEDVAIFFE FT RHDGRSVKQPVDVLAVLSVHGFPLTFWHEEFIRQAFQVFCSVVEVDWRCTS FT GLDYSSVRVVVRVEEVTSIPADLCIHDSSSGDSFVCQVKLVDSWPLPVDYG FT QHGTLRPYFGWPPHPVLPPPAPTPHRTTPLQPLRHHLSRNLARQLTVVPLA FT TLQPDQSLVTCLARTLLALVLNQPHRLLLPCLPSFPIIIHKNILPLHPAPA FT FSAPFPPFPLTSCSTVAPAIADVEVLSPSPLPPSSLQPDPDHESSARKSRA FT RTKRTADLAAKNKKIAPASRHSVRLRDKEPANYVDMSSKATNLKALKNELA FT NCSNNLKAKVKRHGLLRKCTLNAADLRALAGTIALPALATAELEQALESST FT C" FT CDS 2055..5609 FT /product="L1-48_AAe_2p" FT /translation="MSSNRLITVISWNTRGLGEDNKCVDVRDVFSSCCPHI FT ACIQETKLSEISAQKFKSFLPASLSGHSFLPADGSRGGIATAWNAAHFTLV FT STSSTTHSLSTVLSYNASDLTFTLTNVYAPADHRYTAEFLGELRSLANLIS FT GPWLVTGDFNLVREPSDKNNDNFNVCLASAFNSAIHDMSLIELPLLDRLFT FT WSNKRTLPVLARLDRTFINGDFESAVPNTSLTSLPHATSDHTPLKITIDTS FT IPKKHCFRFENAWLHDPSFLPAIEPVWSAVTNLAGGATGAFVARLKGTRRV FT AKAWSKNKHTPFLINNCKFIIKLLDLHEELHFLTTGETVLRSLCREKLILQ FT IRARAAYWKQRGKFKVLKEADENTRFHHARASQRLRQNQIRALELDGVRYT FT SHASKAMVLDNHFTALLGAAMDTVWDFDMATLYRGLPKVNLVPLIEPFTEA FT EAWAAIKAMNHNSAPGPDGFGPSFYRASWHIVKSTVMQFLASFHHGDADME FT RINRAYIVLIPKIVGAVAPGSFRPISLQGCPVKIVGKILTSRLQQQVTSLV FT DLDQTGFLKGRSISENFVYATELVQCCYKRKAPTVVLKLDFAKAFDSVSWD FT ALLTVLQARGFPPLWCDWIKLLQVSAKSAVLLNGVPGRWISCKKGLRQGDP FT LSPYMFILVADVLQQLLARDSTIRHPLTADRSCTILQYADDTLIVARADEL FT AMLQLKSILQSFTRATGLDINYNKSTLVPMHVPAPDVTRFVNVLGCAEGAF FT PQTYLGLPLSNEKLNLAAFAPIIASADRYLSGWWASLLNHHGRLTLVNAVL FT DSLPVYAMGALALPPGVIEAIDSRRRAFLWAGEETVSGAKCLVNWERACLP FT RKDGGLGVRDLRLQNTCLLLKLLHRAHNSRDSAWARWLEVEFGGPMSAPDN FT TAAGTHWAALRRLLPDYRLLTTVEVGDGRSTAFWHDCWLSTGPLVDAMPAL FT YSHARRKETSVCNVLSTALRLAFVPRLSTVAARELEQLEALLAGVSLSATP FT DLRLCPWEDMAHKLNSSTVYQAVVTNGQGCEYYKFIWENRAPPKVKFFGWL FT LVQNRIQTKDNLLKKHCVENDICELCLSAVESSVHLISGCPFAVGFWARLG FT ISLTDEDASHLWCVRSPGHLPAAHFNVFLLLCCWRLWKHRHDVVFRSLSLR FT AMSVLWLAVVMMQNCGLVGSPLMIVM" XX SQ Sequence 5766 BP; 1170 A; 1724 C; 1290 G; 1582 T; 0 other; cagagtgcac ttagaaatat aaaatccaaa tttattattc cctctcgtcc gggacgtgct 60 attgctcggc gctgttgaag cccgctcatc cacctctccc tctcgtcgct cccccacaaa 120 aataaaacag ttcgcctgct tccgctgctt cgccactgat caccaaattg ctgcttgccg 180 ggatcccatc cgctgtcgct cttgtcgacg ccagggtcac cgttgtcgtg agtgccccat 240 gccagttaac cataaactaa cccccttgcc ccaccgctgc cctacgcttc tccctcctac 300 gccgcaggtc tctctgtcgc cgcctcgtat ggctgctaca gctcctctaa cctcccaaca 360 ccatgcccca ccaccggctc cctcccgctc ccctacttcg cctcttctcc ctgcattccc 420 tgctttacca ccgcgccacg ccccttccac cacctccgcc catcctcctc ctgttggtgt 480 tacccccatc agttccttgt cggaaactgt cgagtgtgcc tccctgacta gctcggtccg 540 ccagggatcc ctcgaagttt ttctctctct cggccggcac gacgtggccg ctgcattcgc 600 atttgttcgt ataactccgc aattggcagt gtctgccccg gcaccattta ttcgggaggc 660 gatccgtgct gctttcccgc tcatcacctt cgaggttctc ccctcgtcgt tcggatcgtt 720 gctggtgcgc tttcagtcgg caggtgattg tactgctgct gtgagtcgct cccccctatt 780 gcatgaggat gtggcgatct tcttcgagcg ccacgacggg aggagtgtca aacagccggt 840 tgatgtgctt gcggtcttgt ctgttcatgg gttccctctg accttctggc acgaggagtt 900 catccgccag gcgttccaag tattttgctc agtcgtggaa gtcgactggc gttgcacctc 960 gggcctcgac tactcgtctg ttcgggttgt ggtccgcgtg gaggaagtga caagcattcc 1020 agcagacctt tgcattcatg attcttcatc tggggattcc ttcgtttgtc aggtgaagct 1080 ggttgattcg tggccgctgc cggtggatta cggccagcat ggcacgctcc gcccctactt 1140 cggatggcct ccccatcctg tgctgccgcc acctgccccc acgccccacc gcacaacgcc 1200 cctgcagccc ctccgccacc acctatctcg caaccttgcc cgccagctca cagtcgtccc 1260 cctcgccacc ctacagcccg atcagtcgct tgtgacgtgc ctcgcccgca cactactggc 1320 tctggtcctt aaccagcccc accgccttct gctcccctgc ctaccatcct tccccatcat 1380 aatccacaaa aatatcctcc ccctgcatcc tgcacccgcc ttctcagctc cattccctcc 1440 ctttcccctc acatcatgct cgactgttgc gcctgcgatc gctgacgttg aagtgctctc 1500 cccctccccc ctccccccct cctccctgca acctgaccca gaccatgaat cttcagcaag 1560 aaaatctcgc gcacgcacca aacgtactgc tgacctcgca gccaaaaaca agaaaatagc 1620 tccagcttca cgccacagtg tccgtctgcg ggataaggaa cccgcgaact acgtggacat 1680 gtcctctaag gcgaccaatc tgaaagctct gaagaacgag ctggcaaact gctccaataa 1740 tctcaaggca aaggtcaagc gccatggtct gctgcgcaaa tgcactctga atgctgctga 1800 cctgagggcg ctcgctggca ccatagcgtt gccggctctg gctactgctg aactggagca 1860 ggcgctggaa agttccacat gttagaattc aacttctgtg ctatggtgag gctaggctca 1920 catcctccta gcaagttatc cccctgtatt tcctattatg tttgtttctc ctcagactta 1980 atgtatgtcg ctcttttacc tccagttgct cttcacattc ctcttgttat ctggatgttc 2040 tgctggtgtt aatcatgagt tcaaatcgtc ttataactgt catctcctgg aatactcgtg 2100 gcctcggtga ggacaataag tgtgtcgacg tgcgtgatgt tttttcctct tgttgtcccc 2160 atatcgcttg cattcaggaa acgaaactca gtgaaatctc tgctcaaaag tttaaatcat 2220 ttcttcctgc ctccctttct gggcacagtt tcctgccagc tgatggctct cgtgggggga 2280 ttgcaactgc ctggaatgct gctcacttta ccctcgtatc cacttccagc accacacact 2340 ctctctcaac agtcctctcc tataacgctt ctgatcttac tttcacgctc actaacgtgt 2400 atgctcccgc tgatcaccgc tacaccgccg aattccttgg tgaacttcgt tcccttgcca 2460 acttgatctc aggcccctgg ttggtcactg gcgattttaa tcttgttcgt gagccaagtg 2520 acaaaaataa tgacaacttc aatgtctgct tggcgagtgc cttcaacagt gcaattcatg 2580 acatgtcgct cattgagctt cccctccttg atcgcctctt cacttggtcg aacaagcgca 2640 cccttcctgt tcttgcgcgt cttgatcgca ctttcataaa tggtgatttt gaatcagctg 2700 tccccaacac ttccctcact tctctccctc acgctacctc tgaccacact cctcttaaaa 2760 tcaccataga cacttctatt ccaaaaaaac attgctttcg gtttgagaat gcttggcttc 2820 atgatccctc cttcctccct gctatagaac cggtctggtc tgcggtcact aatcttgcag 2880 gtggtgccac aggtgcgttc gttgcaaggc taaagggcac acgtcgtgtc gctaaggcat 2940 ggtcaaaaaa caaacacaca ccattcctta ttaacaattg caagtttatt attaaactac 3000 ttgatctgca tgaggagttg catttcttga ccacaggaga gacagtcctc cgctctcttt 3060 gtcgtgagaa gctcatactt cagatcagag ctcgtgctgc ctactggaaa cagcgtggga 3120 aattcaaggt gcttaaagaa gctgatgaaa atacaagatt ccatcatgcg cgggcttcac 3180 aaaggctacg tcaaaatcag ataagggctc ttgagctgga tggtgtccgc tacacaagtc 3240 atgcaagtaa agcaatggta ctggacaatc atttcactgc cctcctcggt gcagcgatgg 3300 atactgtgtg ggactttgat atggcaactc tgtaccgtgg cctcccaaag gttaatctcg 3360 tgcctctgat tgaaccgttt actgaagctg aggcctgggc agctatcaag gcaatgaacc 3420 ataatagtgc ccccggtccg gatggcttcg gtcctagttt ttacagagct tcctggcata 3480 tagtgaagtc cactgtcatg caattcttag cttccttcca ccatggagat gctgatatgg 3540 aaagaattaa tagagcatac attgtgttga tcccaaaaat tgttggtgca gttgcgcctg 3600 gatcctttcg tccgatttca ctccaaggtt gtccagtaaa gattgtgggt aaaattttga 3660 catctcgcct ccaacagcaa gtcacatcgc ttgttgattt ggaccagact gggttcttga 3720 aaggacgctc aatttctgaa aattttgtct atgctactga gcttgtccaa tgctgttaca 3780 aaaggaaagc tccaactgtt gttctgaaac ttgactttgc caaagctttt gatagtgtaa 3840 gctgggatgc tctactcact gttctacagg cacgtggctt ccctcctctt tggtgtgact 3900 ggattaagct actgcaagtc tctgctaagt cggctgtcct tttgaatggt gttccgggca 3960 gatggatttc ttgcaagaaa ggccttcgtc aaggtgaccc gctgtcaccc tacatgttca 4020 tactagttgc tgatgtctta caacagctgc tggccagaga tagcaccatt cgacaccccc 4080 ttacagctga ccgttcatgc acaattctcc agtacgcaga cgatacactg attgtggcaa 4140 gagctgacga gctggctatg ctgcagctaa agtcgattct tcaaagcttc acccgtgcaa 4200 caggtctgga tataaactac aacaaaagta cgctggttcc aatgcatgtg cctgctcctg 4260 atgtcactcg attcgttaac gttcttgggt gtgcagaagg tgcgttcccg cagacctacc 4320 ttggtctccc tctctctaat gagaaattga accttgcagc ctttgctccc atcattgcta 4380 gcgctgaccg ttatttatct ggatggtggg cgagtcttct taatcaccat ggcagactta 4440 ctcttgttaa tgcagtcttg gacagcttgc ctgtttatgc aatgggggct ctcgccctcc 4500 ctccaggcgt gattgaggcc atcgactcta gacgtcgggc tttcctatgg gccggggaag 4560 aaacagtctc tggtgcgaag tgtcttgtga attgggaacg ggcatgtctc ccgagaaaag 4620 atggcggact gggtgttcgt gatctccgct tgcagaatac ctgtctgcta ctgaagctcc 4680 tccatcgtgc acacaactca cgtgactcag cttgggcccg ctggttagaa gtggaatttg 4740 gaggccccat gtctgctcca gacaacacgg cagctggtac tcattgggct gccctccgac 4800 gccttctgcc tgactatcgc ttacttacca cggtggaggt gggcgatggt cgctctacag 4860 ccttttggca cgactgctgg ttgtctacgg ggcctcttgt ggacgccatg ccggcgctct 4920 actcgcacgc ccgacgcaag gaaacctcgg tctgcaatgt cctgtctaca gcccttcgcc 4980 ttgcatttgt ccctcgcctc tccaccgtgg ctgctagaga attggagcag ctggaggcgc 5040 tccttgctgg tgtatccttg tctgcgactc ctgatctccg gttgtgcccg tgggaagaca 5100 tggcacataa actcaattca tcgactgtct accaagctgt tgtgactaat ggccagggat 5160 gcgagtacta caagtttatc tgggaaaacc gtgctccccc taaagttaag ttctttggct 5220 ggctattggt gcaaaacagg attcaaacca aagataatct tctcaaaaaa cactgcgtgg 5280 agaatgatat ctgtgaactt tgcttgtcgg ccgtggagag ctcagtgcat ctgatttcgg 5340 gctgcccttt tgctgttggc ttctgggcgc gactaggaat atctctcaca gacgaagacg 5400 cgtcacatct ttggtgcgtt cggtccccgg gtcacttgcc cgctgcacac tttaatgttt 5460 ttctcctctt gtgttgctgg cgtctctgga agcaccgcca cgatgttgtt tttcgctctc 5520 tctctctccg agctatgagc gtcttatggc tggctgtcgt gatgatgcag aattgtggtc 5580 ttgtaggctc ccctctaatg atcgttatgt aacccttgca tgggtttcga tattctcttc 5640 gtcaatcccc tctgtaacaa cttaaccgac atgtaactaa ctcatgtatg agcctttatg 5700 ttggcaatga aaaggtgggg tatctcgccc cccccccccc ccccccccga atttctcaaa 5760 aaaaaa 5766 // ID DNA8-10_AP repbase; DNA; INV; 244 BP. XX AC . XX DT 22-AUG-2009 (Rel. 14.08, Created) DT 22-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-10_AP. XX NM DNA8-10_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-244 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(8), 1752-1752 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 244 BP; 66 A; 46 C; 43 G; 89 T; 0 other; catgacaaaa taaataggta tgctatcaac ttcgtgatac gcaactgtgt gttctcccac 60 cacaaaaact acgtcatccg ttctccgata ttatatacca ttattattat atttattatt 120 attattatta ttatttctgc tggtgagaac ggcttacgtc acagttcttg gcgaagttcg 180 ggaggggtgc gcgttctcga ccaatattag ttatgttgat agcataccta tttattttgt 240 catg 244 // ID Kiri-16_AAe repbase; DNA; INV; 4205 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 16-FEB-2011 (Rel. 16.02, Last updated, Version 1) XX DE A Kiri non-LTR retrotransposon family from Aedes aegypti. XX KW Kiri; Non-LTR Retrotransposon; Transposable Element; Kiri-16_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4205 RA Kojima K.K. and Jurka J.; RT "Kiri non-LTR retrotransposons from the yellow fever mosquito."; RL Repbase Reports 11(2), 711-711 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 13 sequences with >97% CC identity. CC This family and related Kiri elements found in two mosquito CC species, Culex quinquefasciatus and Aedes aegypti, constitute a CC distinct lineage belonging to the CR1 group. The phylogenetic CC analysis with PhyML indicates that Kiri is a sister lineage of CC the L2 clade. Although several Kiri elements do not encode two CC proteins, probably due to 5' truncation, Kiri elements generally CC encode two proteins. Their ORF1 proteins commonly contain an CC L1-like RNA-binding domain. XX FH Key Location/Qualifiers FT CDS 267..1073 FT /product="Kiri-16_AAe_1p" FT /translation="MTTKRTPPSNINNVVKRPRTGEDGETDGNSLTLDSAV FT RMLMKQFNETKLLIDEMRTEINSKIDAVKNELEGKLTVVSNDINSLKAECA FT SKFQTSNAMINGLTVRVDEISSAMDNLGNRNELILSGIPYLRGEDLMKHFS FT AICMQLGIDERSTPSVDIRRLKSGAMNDGDISLVLVQFALRNLRDDFYSAY FT LRKRDLQLNHLGINSTRRVYVNENLAAVPRKIKVAAVRLKKAGKLASVYTK FT NGTVIVKSTTAAQPIVINSEEQLNQFSN" FT CDS 1209..4064 FT /product="Kiri-16_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MPSLNNHTLTNIPGAVMNAVLLPGKLNVCHGNAQSIC FT ARRSTKMEEVSNMLLNSVVSIACFTESWLSPKIADRSIAIPGFKIARNDRL FT YMRGGGIVIYYKEHLRCHKVFETVLQRDSEDKTECLALVFHFGTETLLLMA FT VYNPPDNDCSRFLADKLSDLSAHYDTVLLIGDFNTDLSSRSNKRACFEAVL FT QSFAMTSIGEEPTFFHNNGCSQLDLFITSCNQKVLRFNQVSFPALSQHDLL FT FSSLDFDATPIPKIKIYRDYVNFNAPALEVAIQLVPWDNMHSIDDPDELLR FT FFNDHLKRIHDHYIPLRTSCGRRQNNTWFDYDIRKAILERDLAYRDWLNAP FT SDSKTQVKRRYKLLRNRTNTMIKQKKALHLSGFLDSRLPAKTLWQRIKLVG FT AAGKENASADCEFDPNDVNRMFLASYISNTPSDTSTFLSQDHPQSFSFRPV FT YDWEIVNAVWDIKSNATGLDGLPIRFIKIVLPLILQQVTYIFNKFIESSCF FT PSCWKHSKILPLRKKPHLNTLENLRPISILCALSKVFEKLMAQQISSFIAE FT NHLLSDCQAGFRQGQSIKTATLRVYDDLAATIDKRGSAILLLLDFSKAFDT FT ISHRKMCKKLERQFNFSHSAVRLVESYLNGRTQAVYCGDQVSEIGVVTSGV FT PQGSVIGPLLFCCYINDLPTVLRWCSIQIYADDVQLYIKRLGPCSRELIRM FT INEDLVRVAEWSDRNELRVNHSKSKALFIKGRRRNTILTDSLPTITIKGQP FT IEWTDKSMNLGFVFQSDLEWDGLVNQQCGKIYGCLRTLYSTTSSARTDVRL FT KLFKALILPHFLFGDALHVTPSAYCMDRLRVALNSCVRYVYGLNRYSRVSH FT LQKNLLGCSIQHFYAYRSCIFLWSLAKSQSPIALYQKLIPSQSRRSKNFIV FT PLNNTTSYASTMFVRGVVNWNSMPPVIKHSTSEANFKRGCLDFWNCRQ" XX SQ Sequence 4205 BP; 1218 A; 940 C; 894 G; 1152 T; 1 other; acaccggcta gcagaaatgt gtagctaata cgagcgcgca ttaaacaacc ctgaaagtgg 60 tgttatccgt cgcgtttttt tcccgaagtt atatccgtag accctgttga ttgtccgtac 120 gsgttatcct gctgatgtgt gtccggattg gttcagaata accgaaacaa aattccgtca 180 aatcctcacg cggtgttgat atcgttcact attgtttgcg atttactcat ctagtcccct 240 tcgttacaca tacatacgga gcaacaatga cgaccaaacg cactccgccg tcaaacatca 300 acaacgttgt gaagcgtcca agaaccggcg aggatggcga aaccgatggg aactcgctca 360 cactggactc tgctgttcgc atgctgatga agcaattcaa tgaaacaaaa ttgctgatcg 420 atgaaatgcg caccgaaata aattctaaaa ttgacgcagt gaaaaatgag cttgagggga 480 aactgactgt tgtttcgaac gatatcaact cactaaaagc agagtgtgca tcaaaatttc 540 aaaccagcaa tgccatgatt aatggtttga ccgttagggt tgacgaaata tcctcggcaa 600 tggataatct gggtaacagg aacgagctca ttctcagtgg aatcccgtat ctgcgaggtg 660 aggacctgat gaagcacttc tcggccatat gtatgcagct tggaatagac gaaagatcga 720 caccatcggt agacattagg cgtctcaaat ccggagcaat gaacgacgga gacattagcc 780 tggtgttggt acaatttgca ctgcgaaacc ttcgagatga cttctacagt gcataccttc 840 ggaaacgtga tcttcagctg aatcatctgg ggataaactc tacgcgacga gtgtacgtga 900 acgaaaattt ggcggcagtc ccgcgtaaga taaaagtcgc agctgtgcgt ctgaaaaaag 960 cgggtaaact ggcctctgtg tataccaaga acggaactgt catcgtgaag tctactaccg 1020 cggctcagcc catcgttatc aattccgagg aacaactgaa tcaattttcg aactaagtat 1080 tttaatctgt attccactat tagagtttgt tttatgttgt attattgtga agattgtgta 1140 atgttgataa ttaaaaaaag tgttctcttg cagcccccta aaaattgctt cctcctacgt 1200 ttttctctat gccctcgttg aacaaccaca ccctgacgaa catccctggt gcagtgatga 1260 atgcggtact tctcccggga aaattgaacg tgtgtcatgg taatgcgcag agtatctgcg 1320 ccagaagatc gaccaaaatg gaagaagtga gcaatatgct actaaattct gtagtcagca 1380 tagcatgttt taccgagtct tggttgtctc caaagattgc cgatcggagt attgccattc 1440 ctggattcaa aatagcacgt aatgatcgac tgtatatgcg tggcggggga atcgtaatct 1500 attacaagga acatctacga tgccataaag tctttgaaac cgtattacaa cgcgattctg 1560 aagataaaac agaatgcttg gcgctagttt tccatttcgg aactgaaaca ctgctgctca 1620 tggccgttta caatcctcca gacaacgact gttcacgttt cctcgccgac aaactgtctg 1680 acctctctgc tcattacgac accgttttgc tcattgggga ttttaacaca gatttatcat 1740 ctcggagcaa taaacgagca tgtttcgaag cagtacttca gagctttgct atgacgtcga 1800 ttggtgagga accaactttc tttcataaca atggatgctc gcagctagat ttgttcatta 1860 ccagctgcaa tcagaaagtt ttacgtttca atcaggtaag ctttcctgca ctttcgcaac 1920 acgatctgct cttcagctct ctggattttg atgctacgcc cataccgaaa attaagatct 1980 atcgtgatta cgtgaacttc aatgcaccag ctttggaggt cgccattcaa ttagttccct 2040 gggataacat gcattccatt gacgatcctg atgagctact tcgattcttc aacgatcatc 2100 tcaaacggat tcacgaccac tatattccac ttcgcaccag ctgtggtcgc aggcagaaca 2160 atacatggtt tgattacgac attaggaagg ccattctgga acgtgatttg gcgtatcgag 2220 attggttaaa tgcaccgtcg gactcaaaaa cacaagtgaa acgccgatac aaattactaa 2280 gaaaccgaac caacacgatg attaaacaga aaaaagcact tcaccttagt ggcttcctgg 2340 acagcaggct accagccaaa actctctggc agcgcatcaa attggtaggt gcagctggta 2400 aggaaaacgc ttcagcggat tgcgagttcg atccaaatga cgttaatcgc atgtttttgg 2460 caagctacat cagcaataca cccagtgata cttcaacttt cctatcacag gatcaccccc 2520 agagtttctc cttccgtcct gtgtatgact gggagatagt gaatgcagta tgggatatca 2580 agtcaaatgc aacgggtctc gacggtttgc caatacgatt cataaaaatc gtgcttcctt 2640 tgattttaca gcaagtcacc tatatcttca acaagtttat cgaatcgtca tgttttcctt 2700 cttgttggaa gcattcaaag atactgccat tacggaaaaa gcctcacttg aacacgcttg 2760 aaaatctcag gccaattagc atcttatgcg cattatcaaa agtttttgaa aaacttatgg 2820 cgcaacaaat atcgtctttc atcgcagaga atcatcttct gtcagattgt caagctggct 2880 ttcgccaggg ccaaagtatt aaaactgcaa ctctacgcgt ttatgatgat ttggccgcca 2940 ctattgataa gagaggatct gctattctgc tactgctcga tttttccaaa gcgttcgata 3000 caatctcaca tcgcaagatg tgcaagaaac tagaaaggca attcaacttt tctcacagtg 3060 ccgtgcgttt ggttgaatcg tatctaaatg gaagaactca agctgtttat tgtggagatc 3120 aagtttccga aattggggtg gtaacttcgg gtgtgccgca aggttctgta atcggtccac 3180 tgttgttctg ctgctacatc aacgatctcc caacggtttt aaggtggtgc tcgatacaaa 3240 tctatgctga cgatgtccaa ctctacataa aacggttagg accatgttca cgtgaattga 3300 taaggatgat taacgaagat ctagttagag tcgctgaatg gtccgatcgt aacgagctcc 3360 gtgtgaatca ttcgaaaagc aaagcactgt ttatcaaggg tcgtcgtcgc aacactattc 3420 tcaccgattc attacctact ataacaatta agggacagcc tattgaatgg acggataaat 3480 cgatgaatct tggttttgtg ttccagtctg atctggaatg ggatggcctc gtcaaccaac 3540 aatgtggtaa aatctacggt tgtctgcgca cgctctacag taccacctca tctgcacgca 3600 cggacgtgcg actcaaacta ttcaaagcgc ttatactgcc acatttctta tttggagatg 3660 ctttgcatgt caccccaagt gcatactgca tggacagact acgagttgca ctcaacagct 3720 gtgtacgata tgtttacggc ctcaaccgtt atagcagagt gagtcatcta cagaaaaact 3780 tgcttggatg ttcaatacag cacttttacg cataccgatc ctgcattttc ttgtggagtc 3840 tagctaaatc tcaatcccca atagcgctgt atcagaaact aatcccatct caaagccgcc 3900 gatcgaagaa tttcatcgtt ccgttgaaca acacgactag ttacgcaagt acaatgttcg 3960 tcagaggtgt ggtgaattgg aattccatgc ctcccgtgat caaacactca acatcggaag 4020 caaattttaa gaggggctgc cttgattttt ggaattgtag acagtaagtt aagaaaagat 4080 aattttaata atgttagtag acataagtat tgtatgtaat ggaacgtttt acgacacgga 4140 ggtagtattt gaaaagggct tcccttacgc tgcctaatga ataaacaaac aaacaaacaa 4200 acaaa 4205 // ID hAT-83_HMa repbase; DNA; INV; 3972 BP. XX AC . XX DT 22-JUN-2010 (Rel. 15.06, Created) DT 22-JUN-2010 (Rel. 15.06, Last updated, Version 3) XX DE hAT-type DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-83_HMa. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3972 RA Jurka J.; RT "DNA transposons from Hydra magnipapillata."; RL Repbase Reports 10(6), 796-796 (2010). XX DR [1] (Consensus) XX CC >96% identical to consensus. CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 875..3253 FT /product="hAT-83_HMa_1p" FT /translation="MNKIEKKAYFKEEWLNKDIYPHFDSWLMKGKNNTQAR FT CKLCNKIIELSNMGIQAIKSHEKGKKHVSVASNFSCFFKSSGSKNTNSPNS FT VEESNLKTTDSFSTGQMKQQSLELIVSNSNKICAEIMWALHCCLNRISNNS FT NKNVTNLFQAMFPDSEIAKSFQMGPNKVGYSITHGLGPYFKGLLKQQMSLS FT PWLVVSFDESLNKKTQTCQMDLLIRYWNEEKMQVEVRYWDSSFMGHSTSLD FT LVNHFNEKISDINISKIVQVSMDGPSVNLKFHRDIQSKREEHELSKLIDIG FT CCSLHIIHGAFKTGVESTDWELKKTFKSCFTLFHDSPARRSDFISITEGSV FT FPLSFCATRWVEDKKVADRLISIWPSIIKIVNYWESLPKSKRPSCKSYKFV FT VNAVKKELSLVKFQFFSYLASMFEPFLKLYQTDAPMLPHMNGDLVQLIKSI FT LRMFIKSETIEATSKLTKIDLQNKENHLKPSEIDIGFAAELSLSKLRKNDA FT VILKDMKEFKIQCMHILISAAEKLFKRCPLDSVIVNTSTCLLPNFYAQITP FT EKNRNQMKILLHHLISLGILTASYSDKVISQFSKFISSEFAXNQDVFISYD FT RNNKRLDDFYFKNIDIKKYPELSSVIKLVLTLSHGQASVERGFSVNKDVVT FT DNISTDGIVGRRLVRDFMLTNNLNPHSVQITSEMKVAFKSAHQKYQIMLEE FT EKSKNKKHAMLDQKAIILSEIEELKSKMGFLTKTSLMLEKEFVLTVEQAEK FT ENNLSLVSKANALKRKSEEKMKDVAKLQETLLILEEKRKKLN" XX SQ Sequence 3972 BP; 1418 A; 556 C; 625 G; 1372 T; 1 other; cagggttcgt actgtacctg gaaaacctgg aaaatcagcc aaatttggat aaagtcatgg 60 aaaacctgga aaagtcaggg aaattttttt ttttctaggt agtcagggaa aagtcaggga 120 attctgattg agaaactact cgttttacaa ataagctatt atgtttctgt tggtgcttcc 180 catttatttt gacaattaaa aaatacccta cagtgtttca caggaaggca tgtgatggca 240 cacctttacc acgcatataa tattttattt ttaaaaattt gaccacggct ttttaacttt 300 tttaaaactt gaatattttt aaatgtattt ttaataggca aaacgctctt attactaaca 360 attgttttaa aatattaatc aataaaaata ttttatttaa attgtctatg gttattattt 420 tatttttaca ataatagtgt atgtttcttt tttttattta ttataaaaaa aataattcct 480 tttttttatt tattataaaa aaactaattt tttatgatca atttcaatcg tttaaaacta 540 tttctgtaat ttattctata aacaacagca tctttaactt taaacatttt tgtgtcttta 600 aagcgaaaaa cttctttttt tattttaagt tacataagtt acagaaagtt tcaaagattt 660 tcaactttgt tgaaaacttt tgaaagtttt tattaaaaaa gtattggttt ctataagcat 720 tacaatgact attagattaa aatccttgtg aaaatgaatt ttgaaaatta aaaaaaggtt 780 tttaaagtta atacaatgaa atttgttaca gtgccctatg ttataaatat aaatcatttt 840 gaaatatttt gtttccattt ttagaattaa aataatgaat aaaatagaaa aaaaagctta 900 ttttaaagaa gaatggttaa ataaagacat ttaccctcat tttgactctt ggctaatgaa 960 agggaaaaat aacacccagg caagatgcaa attatgcaat aaaataattg aactttcaaa 1020 catgggtatt caagcaataa aaagtcacga gaaaggaaaa aaacatgttt cagttgcaag 1080 caatttctct tgttttttta aatcatctgg ctctaaaaat actaattcac ctaattcagt 1140 agaagagtct aatttaaaga caactgatag tttctcaaca ggacaaatga agcagcaatc 1200 tttagaactc atcgtatcaa attctaataa aatctgtgca gaaattatgt gggcattgca 1260 ttgttgctta aatagaattt ctaacaattc aaacaaaaat gtgacaaatt tgtttcaagc 1320 aatgtttcct gacagtgaga ttgctaaatc ttttcaaatg ggaccaaata aagttggata 1380 tagcatcact catggtcttg ggccttattt taaaggttta cttaaacaac aaatgagcct 1440 gtctccttgg cttgtcgtat cctttgatga atccctcaac aaaaaaactc aaacttgtca 1500 aatggattta ctcatccggt attggaatga ggaaaaaatg caagttgaag tgaggtattg 1560 ggattcctca tttatgggac atagtactag tcttgatttg gtaaatcatt ttaatgaaaa 1620 gataagtgat attaatatca gcaaaattgt tcaggtctct atggatggac ccagtgtcaa 1680 cttgaaattt cacagagaca ttcaaagcaa acgtgaagag cacgaattat ctaaacttat 1740 tgacataggc tgctgttccc tacacataat tcatggtgct ttcaaaacag gagtagaatc 1800 aactgattgg gaacttaaaa agacatttaa aagctgcttt accttatttc atgattctcc 1860 tgccagaaga agtgacttta tcagtatcac tgaaggttct gtgttcccgc tttcattttg 1920 cgcaaccaga tgggttgaag acaaaaaggt agcggataga ttaattagta tttggccatc 1980 tattataaaa attgttaatt attgggagtc tctgcctaaa agtaaacgac catcttgcaa 2040 atcatataag tttgttgtaa atgctgttaa aaaagaactt tcccttgtaa aattccagtt 2100 ttttagctac ctggcgagca tgtttgaacc gtttctgaag ctctatcaga cagacgctcc 2160 aatgttgcct cacatgaatg gtgatcttgt gcagttgatt aagagtattc ttaggatgtt 2220 cattaaatct gaaactattg aggctacctc aaaacttacc aaaattgatc tgcaaaataa 2280 agagaaccat ttgaaaccca gtgaaattga tattgggttt gctgctgaat tatcactgtc 2340 aaagctcaga aaaaatgatg cagttatttt gaaggatatg aaagaattta aaattcaatg 2400 catgcacatt ttgatttcag ctgctgaaaa gttatttaaa agatgtcccc ttgattccgt 2460 gattgttaac acgagtactt gtttgcttcc aaatttttat gctcagatca cacctgaaaa 2520 aaataggaat caaatgaaaa tattgctgca tcatcttatt tcacttggaa tattaactgc 2580 ttcttattct gacaaagtta tcagccagtt ctcaaagttt atttcatcag aatttgctmt 2640 gaatcaggat gttttcatta gttatgatcg aaataataaa cgtcttgatg atttttactt 2700 caaaaacatt gacataaaaa agtatccaga gttgtcatct gtcattaagt tggttttgac 2760 tctgagtcat ggacaggcat cagttgaacg ggggtttagt gtcaataagg atgtggttac 2820 cgataatata tcaacagatg gtattgttgg acggcgtctt gtgcgtgatt tcatgcttac 2880 caacaacctc aacccccact ctgtccagat tacatcagaa atgaaagttg cctttaaatc 2940 tgcacatcaa aaataccaga tcatgcttga ggaagagaag tcaaagaata aaaaacatgc 3000 aatgttagat cagaaagcca ttattttgtc tgaaattgaa gagctaaaat ctaaaatggg 3060 tttccttaca aaaacttcat tgatgctaga aaaagagttt gtcttaactg tggaacaagc 3120 ggaaaaagaa aacaatttat cattggtcag caaagcgaat gctttaaaaa ggaaaagcga 3180 ggagaaaatg aaagacgtag ctaagttgca ggaaacatta ttgatattag aagagaagag 3240 aaagaaactg aattaacaat agttgctttt tgacaatttg tttgtttata taaaatatta 3300 tttatgttga catttatgca aatatggcta attaaaaaac tttaattata ttttcaatat 3360 aagattttgc ttaaaaaata tatgcaattt aaaattaatt aaaagttttt tttttaatgt 3420 ttctaagcat gttttgttat actgtgctat ttttatctaa aggtattatt gaattagcga 3480 aggtacagct acaaatagtg ataaaaaaca actaacccaa agtagtctct gttatagacc 3540 aaatattgct caaaatgttg aaagatatct acatagacat tttagtttat ttttgtgtta 3600 gtgaaaaata attatatata aaatatttta tgtgcataat atttagattt tcttgaagat 3660 catctgaaac taaattttat gtgcataata tttagatttt ttaagaagct cacctgcttg 3720 tttattctta gtcatcctta ataatatgca taataaaatc tatactaaca tccaacttaa 3780 ctaatagcta agttttataa gttttgtcat tctaagcatg tacattttag cattttcatt 3840 acagtagtaa tttttaacat tttataaaaa ccattgcaaa tgttaaaaat ggtcagggaa 3900 attcatctga tttttcaaaa aagtcaggga aaagtcaggg aaatttttat aaatatgtgg 3960 ctacgaaccc tg 3972 // ID MARINER_HC repbase; DNA; INV; 1255 BP. XX AC M63844; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 25-MAR-1997 (Rel. 2.02, Last updated, Version 2) XX DE H.cecropia mariner-like repetitive element. XX KW Mariner/Tc1; DNA transposon; Transposable Element; HCMARINER; KW highly-repetitive sequence; MARINER_HC; mariner-like element. XX OS Hyalophora cecropia OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Saturniidae; Saturniinae; Attacini; Hyalophora. XX RN [1] RP 1-1255 RA Lidholm A.D., Gudmundsson H.G. and Boman G.H.; RT "A highly repetitive, mariner-like element in the genome of RT Hyalophora cecropia."; RL J. Biol. Chem 266(18), 11518-11521 (1991). XX DR GenBank; M63844; Positions 1 1255. XX SQ Sequence 1255 BP; 405 A; 262 C; 234 G; 354 T; 0 other; ttaatattag gtccttacat atgaaattag cgttttgtcg tactgaccac tttgatctga 60 aatatctcct ctttggttaa gaattccaat tccaattcat ttagctggta catttgagat 120 ttgaaaacag tcattcattt gttttttcct cattattttt tccttaggca tcgtttatta 180 ccgcacaatg gcaaacatga aatatcggta tatttacgag tacgagttct accgtggcac 240 cagtgctgca gaaacagctc gaaggattaa taatgtatac ggtgctggtg ccgcaaaaga 300 aagcaaggta cgtttttggt ttcaacgttt tcgttctgga attttcgacc ttcagaacca 360 accccgtgga cggccggaga tcaaagtgga aaatgaagaa taaaggctat tgtgtaagcg 420 gatccatcac aaagcacttc agagatagct gcaggcttcg gggtaagtga taaaactgta 480 ttaatctact tgaagcaaat cggaaaagta aaaaaacttg aatgatgggt acctcatgaa 540 ttgagtgaat cgaacttgca aacacgcgtc gactgctatg ttactttgct caaccgacac 600 aataatgaac gaaaaatgta cgataatcgg aagggctcgt tgcaatggct gaaccctgga 660 gacccagcca aatcctgccc taaacgataa ttgactcaga aaaagttact tgtgagtgtt 720 ttggtggact agcgccggtg tcattcacta cagctttcta aaatgtggcc aaacgattac 780 agtagatatc tattatcagc aactgcaagc catgaaggaa gaactagctg ctaaacatcc 840 gagattggtc aatcgctcta ggtcactgct gcttcacgac aacgcaagac cacacactgc 900 taaacaaaca accactaagt taaataagct acaattggaa tgtctgcgac atccaccgta 960 ctccccggac cttgctccaa tagattacca ttttttccga aatttggaca acttcttaca 1020 tggaaaaaaa ttcaactcct attcggtagt ccaaaccgcc ttcaaagagt ttattgatcg 1080 tcgcccccat gcttttttta ataaagggat caatgaacta cctgtaagat ggcaaaagtg 1140 cataaataac aacggtgcat actttgataa attaaatata ttttacaaaa aaataattga 1200 cttcttattc ctcccttaca aaacgccaat ttcatatgta aggacccaat attaa 1255 // ID CR1-1_DWil repbase; DNA; INV; 4718 BP. XX AC . XX DT 23-FEB-2007 (Rel. 12.02, Created) DT 23-FEB-2007 (Rel. 12.02, Last updated, Version 1) XX DE A family of CR1 non-LTR retrotransposons - a consensus sequence. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-1_DWil. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-4718 RA Kapitonov V.V. and Jurka J.; RT "CR1-1_DWil, a family of young CR1 retrotransposons from the RT Drosophila willistoni genome."; RL Repbase Reports 7(2), 110-110 (2007). XX DR [1] (Consensus) XX CC This is a very young family of CR1 non-LTR retrotransposons. CC CR1-1_DWil copies are less than 1% divergent from the consensus CC sequence. XX FH Key Location/Qualifiers FT CDS 641..1648 FT /product="CR1-1_DWil_1p" FT /translation="MVVCSKNNCIVATNAVDSHDYILCWLCDNAVHVKCAG FT YTGRIKESIAKGGGLKYGCDACREVENEMRSFMRRTRDSFDTLRAGFNKLQ FT AEFTAVETQFRSLSLMNESPKRKRTSPWGVSVSVDPPASLIVPDRSHAPST FT PNVQQLISFKSPVVNAISNELPVSPGNDLLKTMPIVVSDVSGQLPIPMEIA FT DSSKSIEGPVNKLTAAVGDLSNVTAAADASGPRPLVGIPPKKHIFVSRLDP FT VVTSDDVIAYIRKKVNVSNIAVEKFKFSYSRDISSFKISVSANIFDTICRE FT DFWPKHIIVKEYTVKKKNRQPIRLPTTTTNTIPATSASAGVSKN" FT CDS 1606..4482 FT /product="CR1-1_DWil_2p" FT /translation="YYSCHISICWCVKKLASSLNLGYQNVRGLKSKLPNLY FT VDSLSFEHNILAFTETWLKPDIADASVFSTNFSIFRRDRISRIGGGVLIAV FT DAALSSEMIQFSSTQEIEFVGVKVNFKSFNAFITCSYIPPLSDMVVYLNHL FT EAIQFVSSLVSSQDMLIVVGDFNLPALAWSLTDESTTLLPSLSHDFIDGLL FT GLSLSQVNPVLNSNGKSLDLIFVLDSSYCEVSRIEPFVLPEDKFHPTLNLK FT IDLPLLSPLLGSNELPLTRCFRKTDFASLNENIATTDWSYLYNCSDMNSAV FT RFFYDTLNSLFDICVPLGRLERLNKPPWFSRELSNLKNVKTRYYKKFQKTR FT RSSDYALYTVARSKFMMLNTQCYKNYLQRCKLQFSSDPKQFYNFVNSKRKT FT SVHPSFLSFQNKKASTDQAIADMFASFFKTTYSSTEYRASPYPYHLKKANC FT IFNPVIDESSILTELKLVKPVYSPGPDGIPGCILRFCANALCKPILKLFLL FT SVESSSFPSIWKESYIIPLHKNGKKSEASNYRGISKLSAIPKAFEKIITSQ FT LQHFCSSIISPSQHGFVKRRSTTTNLLEFTSIIIKGFKNGKQTDVIYTDFS FT KAFDSVNHTLLLSKLSDIGFPPLFLSWVREYLTNRKQKVLFKSSFSVSISV FT TSGVPQGSHLGPLLFTLFINDLPSVIVHSRVLMYADDVKLCLTYKDTDSFS FT RLQADLKNFQSWCQFNLLNLNTTKCKVMTFFRNSPQLVTYILNNSPLERLN FT NMNDLGVLMDHKLNFNTHISTTVAKAMSVLGFIKRWSKEFDDPYTTKVLFT FT SLVRPILEYGSCIWSPQYESHQIRLESVQKQFLLFALRGLSWDRNVNLPSY FT SSRLLLINLPSLTNRRIMLGVIFMHKLLIGDIDSPELLAQVNLSVPCRRSR FT HPLIPLSLSRCSSNYAMHEPFRVLCSDYNLLSPVIGSEFSINMLKTSILSH FT LVNR" XX SQ Sequence 4718 BP; 1254 A; 951 C; 846 G; 1667 T; 0 other; aaaaactttc actcgttgca gaggacggac gtgttttttt tgcgcttatc aacttttatt 60 ttttctcgcg attaaatact tttatataac tcttaaatta tcggtttaat taataacaat 120 tacaacatcc gaattttacg attaaatact tttatataac tcttaaatta tcggtttaat 180 taataacaat tacaacatcc gaattttatt ttcatttcgc gaattcttgc tgtcgttttt 240 gtttaaattt aaataaaatc aaaacttaca acaatacttg cattagcggt gttgttgttt 300 tgtttatctc ttttgctttt gtgtctttac cgttttaact aactctcgcg ctcttgcgta 360 caaattgttg ttgcatgtgt tcaatttgac gcgctctccc gctctttcct attcttgatt 420 gatttgcgct ctcgtctttt gcctacccgc gactctgtga tacgtgtttt tgtttttgtg 480 tttttcattt ctttttattc tttagttgtg aaggcttact ctgcacatta acccttgtgt 540 cagaactggt cagtatattc ttatacacag atttttaacc tttagaattt ttattttttt 600 ttgtatttcc tttttcaatt aaattaaact ttttgataaa atggttgtct gctcaaaaaa 660 taattgcata gttgccacta atgcggttga ctcccatgat tatattcttt gttggttgtg 720 tgacaatgcg gttcacgtca agtgtgcagg ttacacaggc agaatcaagg agtctattgc 780 taaaggtggt ggcttaaaat atggatgtga tgcttgccgg gaggtcgaga atgagatgcg 840 ctctttcatg aggcggacta gagatagttt tgacacttta agggcaggtt ttaataaact 900 ccaagcggag tttactgcgg tggagactca gttcagaagt ttatcgctta tgaatgagtc 960 tccgaaacgt aaaaggacca gtccttgggg tgtttctgta tctgtagatc cgccagcgtc 1020 tcttatcgtc cccgaccggt ctcatgcacc gtccactcct aatgtacagc aattgatatc 1080 gtttaagagc ccggttgtga atgctattag taatgaatta ccggtatcac ctgggaatga 1140 tctcttaaaa acgatgccta ttgtagtgtc cgatgtgtct gggcaattac ccattccaat 1200 ggaaattgcc gatagttcta aaagtatcga gggccccgtg aacaagttga ctgctgcagt 1260 gggtgacttg tccaacgtta ctgctgcggc tgacgcttcg ggaccgcggc ctctcgttgg 1320 cataccacca aagaaacata tatttgtttc taggctagac cctgttgtca catctgatga 1380 tgtaatagcc tatattcgga agaaagttaa cgtctcgaat attgcggtgg agaagtttaa 1440 attttcttat tctcgggaca tttcgtcctt taaaattagt gtttctgcca atatcttcga 1500 cactatatgt cgagaagatt tttggcctaa acatataata gttaaggaat atactgtaaa 1560 aaagaaaaat cggcaaccga ttcgcttgcc tacaacaaca actaatacta ttcctgccac 1620 atcagcatct gctggtgtgt caaaaaacta gcttcgtccc ttaacctggg ttaccagaac 1680 gttagaggac tgaaatctaa acttcctaat ctctatgtag atagtctttc ttttgagcat 1740 aacattctag ctttcacaga gacgtggcta aaacctgata ttgctgatgc atcggttttt 1800 tccacaaact tttctatttt cagacgtgac cggatttctc gcataggtgg tggcgttctg 1860 atagctgtgg atgctgcttt atcatctgaa atgattcaat tttctagtac ccaggagatt 1920 gagtttgtcg gtgttaaagt aaattttaaa tctttcaatg ctttcattac ttgttcctat 1980 attccaccat tgtcagacat ggtggtatat ttaaatcatt tagaggcaat tcagtttgtc 2040 tcctctttag tttcgagcca agacatgctc atagttgtag gtgattttaa tttacccgct 2100 ttagcatggt cactaacaga tgaatctacc acgttgctgc cctctttgtc acatgacttt 2160 attgatggcc ttctaggctt atctttatcc caagtcaatc ctgtcttaaa ttctaatgga 2220 aaatcgttag atcttatttt tgtattggat tcttcttatt gtgaggtttc taggatcgaa 2280 ccatttgttc ttccagaaga caaatttcat cctacattaa atttaaaaat tgatttgccc 2340 ttgctttcac cattacttgg ttcaaatgag ttaccactaa ctaggtgctt tcgtaagact 2400 gactttgcca gccttaatga aaacattgct actaccgatt ggtcttatct gtacaattgt 2460 agtgatatga actcggctgt tcgttttttc tatgatactc tgaattcact ctttgatatt 2520 tgtgtgcctc ttggtaggct ggagcggctt aacaaacctc cctggttctc tcgcgagcta 2580 tccaatttga aaaatgtgaa gacacgatat tataagaagt tccaaaaaac acgtcgttca 2640 tcagattatg ctctgtatac agtcgctcgt tcaaaattca tgatgcttaa tacacaatgc 2700 tataagaatt atcttcagcg gtgtaagctt cagttttctt ctgaccctaa acaattttat 2760 aattttgtta attctaaacg taagacctct gtgcacccat cttttttgtc ttttcaaaat 2820 aaaaaagcga gcactgatca ggcaattgcc gatatgtttg cgagtttttt caaaacaact 2880 tattcctcaa cggaatatcg agcaagtccg tatccttatc atttaaaaaa ggctaattgc 2940 attttcaatc cagtgattga tgaaagttct attcttaccg aacttaaatt agttaaacct 3000 gtttattctc cggggccaga cggtattcct ggatgtatac tcaggttctg tgcaaacgca 3060 ttatgtaaac ccattttaaa attgtttcta ttgtctgtag aatcttcctc ttttccatct 3120 atttggaagg agtcgtatat tattcctctg cataaaaatg gcaaaaagtc cgaagcctct 3180 aactataggg gtatctcaaa attgtccgct attccgaaag cttttgagaa aattataact 3240 tctcaattgc aacatttttg cagctctatt atttcgccat cccaacatgg gtttgtgaaa 3300 cgtagatcta caaccactaa ccttttagaa tttacatcaa taattatcaa ggggttcaaa 3360 aatggtaaac aaactgatgt catatacact gacttcagca aagcctttga ttccgttaat 3420 cataccctat tactttctaa gctgagcgac attgggtttc cacccctttt cctttcttgg 3480 gttcgagaat atttaacaaa tagaaaacag aaggtacttt ttaagtcttc tttttccgtt 3540 tccatttctg tgacttctgg tgtacctcaa ggtagtcatc ttggccctct gctcttcaca 3600 ctatttatta acgatcttcc atcagtcatt gttcattcgc gtgtgcttat gtatgctgac 3660 gatgtcaagc tttgtcttac ttataaagat acagattcat ttagtcggct acaagctgat 3720 cttaaaaact ttcaatcctg gtgccagttt aatctactaa atcttaacac taccaaatgt 3780 aaggtaatga ctttttttcg taactctcct caactggtca cttatattct aaacaacagc 3840 cctttagaac gtctaaataa tatgaatgat ttgggcgtac ttatggacca taagttaaat 3900 tttaacaccc atatttccac cacggtcgca aaggccatga gtgtcctggg tttcattaaa 3960 agatggtcaa aagaatttga tgacccctat acaactaaag ttctttttac ttcgcttgtc 4020 cgtcctattt tagagtatgg atcttgcatt tggtcacctc aatatgagtc gcaccagatt 4080 agactagaat ccgttcaaaa gcagttcttg ctatttgctc ttcgtggctt aagctgggat 4140 cgcaatgtca atttgccttc gtattctagt agacttctct tgattaacct tccgtcccta 4200 actaaccgta gaattatgct tggtgtcatt tttatgcaca agctcctaat tggtgatatc 4260 gactcgcctg agttattagc tcaggtgaat ctgtcggtac catgtagacg gagtagacat 4320 cctttaatac ctttatccct tagtcgttgt tcttctaact atgctatgca tgaacctttt 4380 agggtcctct gctctgatta caatcttcta tcccctgtaa tcggctcaga gttttctatt 4440 aatatgctta aaacatctat cctatcccat ttagttaata gatagctata gtattatgtg 4500 tttgtctgtc ttttctattg tgtattttgt ccacgcgatt cgtgccgtgc gttatacggc 4560 tgcacccctc ggtcggtcgg gtgggaggtg ggcagttatc tgcttgggct cgcgcgtaac 4620 aggctttgtc ctggtgtcgt aggggccact tgaacgtact gcgcatagta tcgtcaacgt 4680 ccgctaataa aaaaaaaaaa aaaaaaaaaa aaaaaaaa 4718 // ID Crack-18_AAe repbase; DNA; INV; 4135 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A Crack non-LTR retrotransposon family from Aedes aegypti. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; KW Crack-18_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4135 RA Kojima K.K. and Jurka J.; RT "Crack clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1234-1234 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 8 sequences with >95% CC identity. CC Closely related to Crack elements in Culex pipiens (Crack-1_CP CC to Crack-4_CP). XX FH Key Location/Qualifiers FT CDS 60..860 FT /product="Crack-18_AAe_1p" FT /translation="MMQNNNNSMVSSLASEMRATISASVAKEMRTVTTEVR FT QIIYAIEKSQEFLSXKFDDIVIEFNRLKCENDRLVAEIEKLKKSQSDLREM FT VYTLECNADKADKVALDNNAILWGVPTATEENVPRLVEKLLVSVGLQGNSD FT HIKSAERLFSNGRXXTMVPIRIVFNNKKSKEMVFNKKKQYGKLYSTVIDDK FT LKVNGKATSVTLRDELSPLSLDLLKKLRESQEIMGVRYVWAGRGGTILVKR FT DNDSKPELVKNRVDLDRVMCHFVRSG" FT CDS 911..3808 FT /product="Crack-18_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MDLTTIMIQNKYHESLFDFNQKCENVVENSFNVVQWN FT VRGINDFEKFDELVYFVTECRMNIDVVVVGETWIKKENSNIYNLPGYRAVF FT SCRESSSGGLAVFIKNSIDYKLINIKSIEGLHYVHLEVKPNGQFYNVIGIY FT RPPSFDYAEFQNLMEGWLTHASPSKPILIVGDVNVPINLQHNNVVTRYKNL FT LESYGYVCTNTFATRPASNNILDHFICPIDLAAKLQNHTIFNDLSDHLPIF FT SSIPLSNEKKQQELSMSIVDRATLQHKFSLFLTDFQVTEDVEASLKSLIET FT HNRLLTECTRSVTKTVRVKNDHCPWMTYSLWRLMQIKNNYLAKSKKDPNNA FT QVKEMLTHVSKKVKAAKTRCKREYYENILKSSNHSNVWKKMNQIFGRSKVR FT EQIKLKIDGETVHSDRDVVEIFNTYFANIGHNLANKLPKNNFDVLKHVKTV FT NNSIFLRPTNVSEVSILISHLDVKKSRGHDNISADLIKANMGPFSEIIAKL FT FNKIIATGVYPNVLKIAKVTPVFKTGDAYDPSNYRPISTLTVLNKIFEKLL FT TSRLVNFLTANNVFYKFQYGFREGCGTSTAIIELLDDLYCEIDRKKIVGGL FT FIDLKKAFDTIDHGILLKKLDRYGIRGVANDLIKSYLSDRKQYVALNGTRS FT SLCSIDIGVPQGSNIGPLLFLMFVNDLGNLKLRGTPRLFADDTALFYPHSN FT ISSIVEDIESDLVSLLHYFNGNHLSLNLTKTKYMIFHSSRKKILQHSDPYV FT ENLQIEKVFSFKYLGLYLDSTLSWDAHIKHVSNKVSSLCGIMKRVRSFVPN FT EALLRFYYACIHSIIQYLIIVWGHAAKSKLKKVQTLQNRCVKVIFNLPPLY FT STVSLYTRLQHTIIPILGLRDTQTIMFVHNILCNRSFHHNIVFPTRVTIQH FT TRQANELLRSNACTNMGLVRIKNYGPLKYNGLPSDLKSISNIILFKSKLKQ FT YILRNVNEYIL" XX SQ Sequence 4135 BP; 1338 A; 724 C; 767 G; 1300 T; 6 other; tgakaagtwc tgattacttc tgctcctctc aatgttctgc aatctaccaa cggattgtta 60 tgatgcaaaa caataataat tccatggtgt cttctttagc ttcagaaatg agagcaacga 120 tctcagcctc cgttgctaag gaaatgagaa cagttacaac tgaggtcaga cagattattt 180 atgctatcga aaaatcccag gaatttttgt ccwcaaagtt tgatgacatt gtcattgaat 240 ttaatcgtct caaatgtgag aacgatagat tagtggccga aatagaaaaa ttgaagaaat 300 cgcaatctga tttacgggaa atggtttaca cacttgaatg taatgctgac aaggctgata 360 aggtagcgct ggataataat gcgatattgt ggggtgttcc aacagcgaca gaagaaaatg 420 tacccagatt agttgaaaag ctacttgtct ccgttgggct gcaagggaac agtgatcata 480 tcaaatctgc tgagcgactg tttagtaatg gacgtaktkk taccatggtc cctatcagga 540 ttgtgttcaa caacaaaaaa tcaaaagaga tggtcttcaa taaaaagaag caatatggga 600 aattgtattc taccgtgatt gatgataaat tgaaggttaa tggaaaagct acaagcgtga 660 cgttgcgcga tgagttgtct ccattatcct tggatttgct gaaaaagctt agagaatccc 720 aagaaattat gggtgttcgg tacgtttggg caggtagagg tggcaccata ttggttaaaa 780 gagataatga tagcaaacca gaattggtaa agaatagagt ggatcttgat cgcgttatgt 840 gtcactttgt acgcagtggt taatgccaat gcggtagtat atttcaagtt ttttctatat 900 aactttaaaa atggatttaa caactataat gattcaaaac aagtaccatg aaagtttgtt 960 tgatttcaat caaaaatgtg aaaacgttgt tgaaaatagc ttcaatgtag tccagtggaa 1020 tgttagaggg ataaacgatt tcgaaaagtt tgatgaattg gtttattttg taactgaatg 1080 tagaatgaac attgatgttg ttgttgttgg cgagacttgg attaagaaag aaaacagtaa 1140 tatttataat cttcctggat atcgtgctgt attttcttgc cgcgaaagct cctctggtgg 1200 gctagcggtt ttcataaaga actctataga ttataaattg ataaatatta aatctattga 1260 aggattgcat tacgttcatc tagaagtaaa acctaatggc caattttata atgttatcgg 1320 catctaccgt ccaccttcgt ttgactacgc tgagttccag aatcttatgg aaggctggct 1380 aacacacgct tctccgtcaa aaccgattct tatagttggt gacgtgaatg tacccattaa 1440 tttacagcac aataatgtgg taactaggta caaaaaccta ttagaatcct acggatacgt 1500 gtgcaccaat acttttgcaa ctcgtcctgc cagcaataat atattagatc attttatttg 1560 tcccattgat cttgctgcaa agctccaaaa tcatactata tttaacgatt taagtgacca 1620 tcttcctatt ttttcgtcca tacctctaag taatgaaaag aaacaacaag aattgagtat 1680 gagtatagta gatcgtgcta cattgcaaca caaattttcc ctttttttga ctgacttcca 1740 agtcaccgaa gatgttgaag cttcgttgaa atctcttatc gaaacccaca atagactttt 1800 aaccgagtgc acgagatctg tcacaaaaac ggtccgagtt aaaaacgatc attgtccatg 1860 gatgacttac agtctctggc gattgatgca aatcaaaaac aattaccttg cgaaatccaa 1920 aaaggatccc aataacgctc aggtcaagga aatgttgaca cacgtttcaa aaaaagttaa 1980 agctgcaaaa actcgctgta aacgggaata ttatgaaaat attcttaaga gttctaatca 2040 ctccaatgtg tggaaaaaaa tgaatcaaat atttggtcgt tcaaaagtca gagaacaaat 2100 caaattaaaa attgacggtg aaactgtaca ctctgataga gatgttgttg aaatttttaa 2160 cacctacttt gctaacatcg gacacaatct ggctaataaa cttcctaaaa ataatttcga 2220 tgttttgaaa catgttaaaa ctgtcaataa ctctatattt ctgcgtccca ctaacgtaag 2280 tgaagtaagt atcctaataa gtcacctaga tgttaaaaaa agtagaggac acgataatat 2340 ttccgctgat ctgatcaaag ctaatatggg ccctttctca gagattattg caaaactttt 2400 taataaaata atagcaaccg gtgtgtaccc aaatgtttta aaaattgcta aagtaacgcc 2460 agttttcaag actggtgacg cttacgaccc ttcaaactat cggccaatct caaccctcac 2520 agttttgaac aaaatcttcg aaaaactttt aacgagtcgg cttgtaaact ttttgaccgc 2580 taacaacgtt ttttataaat tccagtatgg attccgggaa ggttgtggta catcaactgc 2640 aatcatcgag cttttagatg atctgtattg cgaaatagac cgcaaaaaga ttgtgggggg 2700 cttattcatt gatttgaaga aggctttcga cactatcgat cacggcattt tacttaaaaa 2760 acttgatcga tatggaataa gaggagtggc caatgaccta ataaaaagtt atctgtcaga 2820 tcgtaagcag tacgtagcgc ttaatggtac ccgtagtagt ttatgctcta ttgatatagg 2880 agtgccacag gggagcaaca ttggccctct attatttttg atgttcgtaa atgacttggg 2940 taatctgaaa ctacgtggaa caccaaggct ctttgcagac gatacggcat tgttctatcc 3000 tcactccaat atatcatcaa tagttgaaga tattgaatca gaccttgtaa gcctcttgca 3060 ttattttaac ggcaatcatc tctctttgaa cctcacaaag actaaatata tgatcttcca 3120 ctcgtctcgg aaaaaaatac tgcaacactc tgatccgtac gttgaaaatt tacaaattga 3180 aaaagtgttt tctttcaaat atttagggtt gtacttagat tcgacccttt cctgggacgc 3240 acatataaag catgtttcaa ataaagtctc ctcgttatgc ggtattatga aaagagttcg 3300 ctcatttgta cccaatgaag cacttctgcg attttattat gcgtgtatcc actctataat 3360 tcaatattta attatcgttt ggggtcatgc cgcaaaatcg aaacttaaaa aagttcaaac 3420 actgcagaac agatgtgtaa aagtaatctt taatttacca ccattatatt cgacagtttc 3480 actttatact agactacagc atacaattat acccattctt ggtttgcggg atacacaaac 3540 tatcatgttt gtacataata tcttgtgcaa tcgtagtttt caccacaata ttgtttttcc 3600 gaccagagtt accatacaac atactagaca agcaaatgaa ctattgagaa gcaatgcatg 3660 cacgaacatg ggtcttgtgc gtatcaaaaa ttacgggcca ttgaagtata acggacttcc 3720 ttccgactta aagtcaattt caaatatcat actatttaaa agcaagttaa aacagtacat 3780 attgaggaac gtaaatgaat atattctttg atcgtgttaa ccttttgttt atgtaggttt 3840 tttctggctt ctgctggttt ttccctgcat gtttagtttt tagtttagtt tattcgttaa 3900 agccacaaaa cctatcacta gttaacttat ttcggattta aatttagttg ttcattcagt 3960 aaatcccttc aaaggaacaa agttccactg ggatttcatt ttgctattgt tgtttttcct 4020 tatctccgcg ttttgtaatt aattattagt ttttctattt ttgtaagtgt ccattaccag 4080 ggagctcagt gtgagctttt tggtatgggg gtcagtggag ggtattaaaa aaaaa 4135 // ID Gypsy-6_SI-LTR repbase; DNA; INV; 221 BP. XX AC AEAQ01015719; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: long terminal DE repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-6_SI_; KW Gypsy-6_SI-I; Gypsy-6_SI-LTR. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-221 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01015719; Positions 26 246. XX SQ Sequence 221 BP; 58 A; 39 C; 64 G; 60 T; 0 other; tgtatgtatt gtaggatcgg agtttagata gcgttttagt tttgtgttgc gttagatacg 60 tagttagttt ttattttcgt tatagcagaa gcaattaggc gcacggtgat ttcgatgcga 120 gcgcccgtaa agacattcgg cgcgaacgcc cgtaaagaca ttcggcgcga ttcggcaaag 180 ttcggcgcgc aacgtgaaaa agaggaagac tcacggagac a 221 // ID DNA8-59_AP repbase; DNA; INV; 217 BP. XX AC . XX DT 25-AUG-2009 (Rel. 14.09, Created) DT 25-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-59_AP. XX NM DNA8-59_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-217 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 1993-1993 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 217 BP; 56 A; 43 C; 55 G; 62 T; 1 other; catagacctt atactaatag acggaacatc caggcgggat gcgggtgggt ggtgcggttt 60 gcgttacagt gatatatata tattatatac agtaatatac agagtattgt atcatctagt 120 gcgcatgtgc ggtgtacgtt gtactacctt gatcagcgcc gtcacgcttg gacgccggaa 180 ctacaaccac gtnccgtcta ttagtataag gtctatg 217 // ID AeBuster1 repbase; DNA; INV; 2650 BP. XX AC . XX DT 27-OCT-2010 (Rel. 15.1, Created) DT 27-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE A hAT DNA transposon family from Aedes aegypti. XX KW hAT; DNA transposon; Transposable Element; AeBuster1. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-2650 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-2650 RA Kojima K.K. and Jurka J.; RT "hAT-type DNA transposons from the yellow fever mosquito."; RL Direct Submission to Repbase Update (11-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. This consensus is generated from 5 CC sequences with >99% identity, and is ~100% identical to the CC original sequence in [1]. 8-bp TSDs. XX FH Key Location/Qualifiers FT CDS 493..2409 FT /product="AeBuster1_1p" FT /note="transposase." FT /translation="MDKWLLKKPKSEDLEAEKKNNADIVAGPSGGGTRRKL FT EEELPSTSAKKKTRQYDSDYLQYGFMEDLKDKFPRPQCVICLEVLANESMR FT PNKLIRHLETKHPEFKDKTLDFFTAKVKALNCSRDTIQAYTKVSDKAMVAS FT YHLSLKIAKSGKAHTIGENLILPALKETVSIMFGPKFAQEIESIPLSNNTV FT ARRIEDLSGWVENELVERIKASTHFSLQLDESTDVEGLSQLIVFVRYFWKE FT DVHEEMLFCEPIMRGTSDEIFDKLNSYIKANGLYWENCIGVCTDGARAMCG FT KNSGVVTRILKLSPNASWTHCSLHREALVAKTLCDDFKNVLTTTVKIVNFV FT KTKPLQSRLFEKLCEDMGSNFTSLLLHTEVRWLSRGKVLTRVVELREELAT FT FLEGKENFSTLLRDKDFSLKIAYLSDIFQKLNHLNTYLQGSTSMDIFDVHD FT KIRGFMRKLDLWSRNLKNGNYDCFDSLQTLIVEKQFKPSSTIINGILGHLK FT SLKEKFGEYFDGEMKKTEKNLWIVDPFNQNEASTNISTKADEELIDLSENS FT TLKINYDRKNILRFWISVRHMYPCLYEEAVKFLLPFTTSYLCEAGFSEMVA FT IKTKYRNKLRLSPSLRLKLTGIEIDVSQVIDNNRKQSHPSH" XX SQ Sequence 2650 BP; 833 A; 480 C; 556 G; 781 T; 0 other; catagattcc caaactgtgg gtcgcgaccc cctggggggt cgcgagacag ttcctggtgg 60 gtcgcgagaa aaaaaaaatc aagtttgatt tttcacctcc actttcaaac agttctgtac 120 attttggcaa tcaatttata tattccaatt ccattacaca ccaattcatt cattctttat 180 ttttcgagta atgtaattgt gtttgtacca tgtacaatct atcatcactg aactgcgacc 240 gctttgtttt cataaaagtg tgtgtgaaag caagcagcgt tattgttgaa taccaacata 300 ccaatctagt tcactgtgtg ctttctttca gtgcccatat tcattcattt tgcgttgata 360 ccaacaggtg gcttcattga gtcagttaat tgttcggatt ttactttgtt tctaaacgaa 420 aattaatcaa caagtaagtt ttatttacat cggaaatcag aatatgcgat tcattatttg 480 ataatttcag atatggataa atggttgttg aagaagccca agagtgagga tttagaagca 540 gaaaagaaaa acaatgccga catagtagca ggtccttcgg gtggcggaac gcgaagaaaa 600 ttagaggaag agttgcccag taccagtgct aaaaagaaaa cgcgacaata cgattcagat 660 tatttgcagt atggtttcat ggaggatctt aaagataaat ttccacggcc tcagtgtgtt 720 atttgtcttg aagtgctggc aaatgaaagc atgcgtccga acaaattaat ccgtcacctc 780 gaaactaagc acccagagtt taaggataaa actcttgatt tttttactgc aaaagtgaag 840 gcgctcaatt gttccaggga tactattcaa gcatacacca aagtatccga taaagctatg 900 gtggcatctt atcacttaag cttgaaaatt gcgaaatctg gtaaggctca cacaattggt 960 gaaaatttga ttcttccggc gctcaaggaa acggtgtcaa tcatgttcgg tccaaaattc 1020 gcacaagaga tagaatcaat tccgttgtca aacaacactg tcgcaaggcg tattgaagat 1080 ttatctggtt gggttgagaa cgagctggtt gaaagaataa aggcaagtac acatttttca 1140 ctacagttgg acgagtcgac agatgttgaa gggttatcgc agctgatcgt ttttgttcgg 1200 tatttctgga aagaggatgt tcatgaagaa atgttgtttt gcgaaccgat aatgcgagga 1260 acaagtgacg aaatatttga caagttgaac tcttacatca aggccaatgg actttattgg 1320 gaaaattgca ttggagtttg cacagacggt gcacgtgcta tgtgcggcaa aaacagcggt 1380 gttgttacta gaattctgaa gcttagcccc aacgcttctt ggactcactg tagtctgcat 1440 cgggaagctt tagttgccaa gacactttgt gatgatttca agaatgtatt gacaaccacg 1500 gtaaaaatag taaatttcgt taaaaccaag cctctccagt ctcgcttatt cgagaaactt 1560 tgtgaagata tgggtagcaa tttcacttca ttgttgctgc ataccgaagt ccgatggcta 1620 tcacgaggaa aagttcttac gagagtggtt gagcttcgtg aagagttggc tacatttttg 1680 gagggaaaag aaaacttttc tacgctttta cgagataaag acttttctct gaaaatcgct 1740 tatttgtcgg acattttcca aaagttgaac catctgaaca cttatctcca aggatccact 1800 tcaatggata tattcgacgt tcacgacaaa attcgaggat ttatgcgaaa acttgatttg 1860 tggtctcgca atttgaaaaa tggaaactac gattgtttcg attctctcca aacattaatc 1920 gttgagaaac agttcaaacc atcatctacc attatcaatg gtattctggg ccacctgaaa 1980 agtttgaaag aaaagtttgg tgagtatttc gacggagaaa tgaaaaaaac cgagaaaaat 2040 ctgtggatag ttgacccttt caaccaaaac gaagccagta ccaatatatc tacaaaagca 2100 gatgaagaat tgatcgactt gtcagaaaac agcacactca aaatcaatta tgatcggaaa 2160 aacattctgc ggttctggat cagcgtgagg catatgtatc cttgcctata tgaagaagct 2220 gtgaagttcc ttttgccctt tacaacttca tatttgtgtg aggcaggttt ctcagaaatg 2280 gttgccatta aaacgaagta tagaaacaaa cttcgattat ctccatcgct acggttgaag 2340 ttgacaggaa ttgagatcga cgtaagccag gtcatcgaca ataatcgcaa gcaaagccat 2400 ccatcacact aatgcgcatc gaacaacatt tttagtgaga aaaagaaatc tcgtgtttta 2460 ttactacaac ttgtataatt tgtttaatag catttatgtg gaaaataaac atttttattg 2520 ttattcaaca gttgtttttg atatcgaatt aaagagtatt gcgtatatta tccattttga 2580 ggaggggggt cgccaaaaat acatcactcg gctagggggg tcgcgcatcc gaaaagtttg 2640 ggaacctatg 2650 // ID MARP1 repbase; DNA; INV; 342 BP. XX AC Z11862; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 28-SEP-1995 (Rel. 1.08, Last updated, Version 1) XX DE T.brucei Marp1 repeat protein. XX KW Satellite; Simple Repeat; MARP-1 repeat; MARP1; KW microtubule-associated protein MARP-1; Repetitive sequence. XX OS Trypanosoma brucei OC Eukaryota; Euglenozoa; Kinetoplastida; Trypanosomatidae; OC Trypanosoma. XX RN [1] RP 1-342 RA Affolter M., Hemphill A., Roditi I., Mueller N. and Seebeck T.; RT "The repetitive microtubule-associated proteins MARP-1 and MARP-2 RT of Trypanosoma brucei."; RL Unpublished. XX RN [2] RP 1-342 RA Hemphill A., Affolter M. and Seebeck T.; RT "A novel microtubule-binding motif identified in a high molecular RT weight microtubule-associated protein from Trypanosoma brucei."; RL J. Cell Biol 117(1), 95-103 (1992). XX RN [3] RP 1-342 RA Schneider A., Hemphill A., Wyler T. and Seebeck T.; RT "Large microtubule-associated protein of T. brucei has tandemly RT repeated, near-identical sequences."; RL Science 241(4864), 459-462 (1988). XX RN [4] RP 1-342 RA Seebeck T.; RT "MARP1."; RL Direct Submission to Genbank (23-MAR-1992)Seebeck T., University RL of Bern, Allgemeine Mikrobiologie, Baltzerstrasse 4, Bern, RL Switzerland, CH-3012. XX DR GenBank; Z11862; Positions 1029 1370. XX SQ Sequence 342 BP; 103 A; 92 C; 87 G; 60 T; 0 other; gaagaagttg caaccgatat gcgccacgtg gatgagagcc acttcctgac cacgacacat 60 gaggcataca aacccattga ccccagtgag tatcgtcaga agcgtaccgt tggggaagaa 120 gttacaaccg atatgcgcca cgtggatgag agccacttcc tgaccacgac acatgaggca 180 tacaaaccca ttgaccccag tgaatatcgt cagaagcgta ccgttgggga agaagttaca 240 accgatatgc ggcacgtgga tgagagccac ttcctgacca cgacacatga ggcatacaaa 300 cccattgacc ccagtgaata tcgtcagaag cgtaccgttg gg 342 // ID Gypsy-68_CQ-I repbase; DNA; INV; 4346 BP. XX AC AAWU01020187; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-68_CQ_; KW Gypsy-68_CQ-LTR; Gypsy-68_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4346 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 515-515 (2011). XX DR GenBank; AAWU01020187; Positions 24367 20022. XX CC Positions [3358-3825] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 2032..4335 FT /product="Gypsy-68_CQ-I_1p" FT /translation="MVAGLDGVEVYLDDVLVHGRTPEEHRSRLLKVLERIQ FT EWGFTLRIEKCSFFMPEITYLGFIVNNRGIRPDPSKTSAICSMPPPHDVSS FT LRSFLGAINFYGKFVRNMHDLRHPLDALLRKDAKWDWNENCQESFEQFKNL FT LRSDLLLAHYNPELETIVAADASNYGVGACLMHRYDDGSIKVVCHASRTLT FT SAEQNYGQVEKEALALIFGVTKFHRFIWGRKFTLHTDHKPLVAVFGSKKGI FT PVHTSNRLQRWALILLSYEFDIEHISTQDFGYADMLSRLIDARIKPDEDFV FT IAAIQLEDELAAAMDEALNILPVTFKHLQVATSNDKLLQDVIKHVQSRWPG FT AIKLVDAELRPFYARKDALSVVQGCLMLADRVVVPQQYQQQVLKALHRGHP FT GQERMKKVIRSHVYWPGVDQDVDDFVRSCHGCAAVAKTPTKTLLSSWPLAE FT HPWQRLHVDFAGPVEGHYFLVIVDSYSKWPEIFKTPTITARTTIDLLFETF FT ARYGLPETIVSDNGTQFTSSLFKEFCESLGIIHIRTAPYHPQSNGQAERFV FT DTLKRSLRKIMPGAEHDIGRSLQIFLSAYRATPNKSAPDGLSPAEVLMGRK FT HRTTLDLLKPPSEHRAALQRNQRQEEQFNKRHGARKREFGKGDSVHASTFV FT HGKLVWQTGTVIERVGNVMYNVWLNDKKRLIRSHTNQLRARGGSKVEAYPE FT EAESEVKLPLHILLEDFQIIEDPVVAEDEQQAPPAPSQRRARVRPPSILRR FT STRIRRMPERYNPYFYT" XX SQ Sequence 4346 BP; 1034 A; 1272 C; 1162 G; 878 T; 0 other; ttttggcgac gaggaaagtg aaattacccc gcaatttcac ccgcgaaacg aacgcgcagc 60 acgatggaca aggagtttaa ggcgatgttc cagcagctga tggagtcgca gcagaagctc 120 atcaccgaac tggccatgtc ccggacggca ccaccgacga cgcccgcgcc tggtacgtcg 180 acgaccgatg cccgtatgga agctctagcc aactccatga cggagttcca ctacgatccg 240 gaatctaacc tcacattcga gaactggtac gcccgccatg aggataccct gaagtgtgac 300 gctgccggtt tggacgatgc tgcccgtgtt cgtctcctgc ttcgtaagct cagcgacgcc 360 acccacgcag agtacatgaa ccacatcctg ccgaagatga cgagggacta caagttcagc 420 gaaactgtgg acaacctgac gaagctgttc ggcgctcacg tctctctctt ctccaagcgg 480 taccactgcc tcacgctgac caagagggcc tcggttgact tccagtcgta cgccgggcag 540 gtgaaccgct gctgcgagga tttcggtgtg gcccagtgca gcgtcgacgc attcaagtgc 600 ctcatattcg tgtgtggcct acacaacaag caggacgctg agattcgcac atctctgctg 660 gcgaagctgg aaggaaacgc gaacatgacc ctggacaacc tcactgcaga atgtcaccgc 720 atcatcaatc tccgctaaga tgcgtctatt gtggaggtct caagtgacaa gtcatcggtg 780 aatgcgatca cccacggccg ttctgacagc acacgacact catcagctga tcagaagccc 840 actgcaacac gatcgcgctc ccctcacaac tccaacaaca acaatccaca cattagcagc 900 aacagtcgtg gtggcagcag caaaaacaat aacaaaccac gaacaccgtg ctggaagtgt 960 ggcgagctgc actacgtgga gttttgtcag tacctgaacc atcgctgccg ggattgcaac 1020 cgggtcggcc acaaggacgg gtactgccag tgtgtgcggt cgaatccagg taaaaagcgt 1080 cgcgagaagt ccagccagcc agtgcacacc agggggatct acgtgagaca gctagctgtt 1140 caccgcaacc gaaagtttgt gaccgtggtg attaatcacg tcgagcttca actccagttg 1200 gactgtgctt ccgacatcac cgtcctcacg gaagccagct ggagacgcat cggacgacca 1260 ccattggagc caccgtcgca gcaagctcgg actgcttccg gacagccgct ggacctgatc 1320 ggcgagatcg tctgtgacgt cacgctgaaa ggagtgcatc gttcaggtaa agtttacgtg 1380 accaacatta acgatctgaa ccttttaggc ctcgacttca tcgatctctt caacctgtgg 1440 gatgtccccc tgagcacggt ttgcaacctg gtgacgtcat cgaaggacaa cgtggagtgg 1500 ctgaaagcgt cgttcccgca gctgttttcc gactcactcg gatgctgcaa gaaggctgaa 1560 gtgaaacttt acgttttgcc agacgttcag ccagtgttcc gtgccaaaag acccgtgccg 1620 tttgccgccc tgcagccgat ccaaaccgag ctagaacgcc tccagaagct ggacatcatc 1680 tcgccggtgg agttttcgga ctgggctgcc cccatcgttg ccgtcaagaa gaagtccgtc 1740 aacggtgagc caaacaaagt gcgagtgtgc gcggactact cgactggcct aaacagcctc 1800 atccagccga accaacatcc gctgccattg cccgaggaga tttttgccaa gcttacgggt 1860 agcaagatct tcacgcacat cgacctgtcg gacgcgtacc tgcaggtgcc cgtcgagaag 1920 gagtccaggc agtacctgac gatcaacact cacctgggcc tgttcgagtt caaccgcttg 1980 tcgcccggtg tcaagtcagc ccctggcgcc ttccagaaga tcgtcgagtc gatggttgct 2040 ggcctcgacg gagttgaggt ttacctcgac gatgtgctcg tccatggcag aacaccggaa 2100 gagcaccgct cacgtttgct gaaggtcctg gaacgcattc aagagtgggg tttcaccctg 2160 cgcattgaga agtgctcgtt tttcatgccg gagatcacct atttgggatt catcgtcaac 2220 aaccgaggaa tcaggccgga cccgtccaag acgtccgcca tctgcagcat gccacccccg 2280 catgacgtca gcagtttgcg ttccttcctc ggtgcgatca atttctacgg aaagtttgtt 2340 cgcaacatgc acgacctccg acacccactc gatgctctgc tgcgcaagga tgccaagtgg 2400 gactggaacg aaaactgcca ggagtcgttc gagcaattca agaacctgct ccggtcggac 2460 ctgcttttgg cgcactacaa ccctgagctg gaaactatcg tcgccgctga cgcgtccaac 2520 tacggtgtcg gcgcatgttt gatgcatcga tacgatgacg gctcgatcaa ggtggtgtgc 2580 cacgcgtccc gaacgttgac gtcggccgag cagaactacg gtcaagttga gaaggaggct 2640 ttggcgctga tcttcggagt taccaagttt catcgcttca tctggggacg gaagttcaca 2700 cttcacaccg accacaagcc gctggtcgcc gttttcgggt ccaagaaggg aatcccggtt 2760 cacacctcca accgtcttca gcgatgggcg ctgatcctgc tgtcctacga gtttgacatc 2820 gagcacatct caacgcaaga ctttggctat gctgacatgc tctcgagact gatcgacgcg 2880 cggatcaaac cggacgagga ttttgtgatc gccgcaatcc agctcgagga cgaacttgca 2940 gcggccatgg acgaagcact caacatcctg cctgttacct tcaagcacct acaagtggcc 3000 acaagcaacg acaagctact acaagacgtc atcaagcatg tccaatctcg ctggcccggg 3060 gcaatcaagc tggtcgatgc tgaacttcgc ccgttctacg cgcgaaaaga cgctctctcc 3120 gttgttcaag ggtgtctcat gctggctgat cgcgttgtcg ttccccagca gtaccagcag 3180 caggtgctca aggctctcca tcgaggacac ccaggtcaag agcgcatgaa gaaggtgata 3240 cgaagtcacg tgtactggcc aggtgtggat caagacgtgg acgacttcgt tcgttcgtgt 3300 cacggctgcg ctgcggtggc aaagaccccc accaaaactc tgctgtcgtc gtggccccta 3360 gcagagcatc cttggcagcg cctccacgtg gatttcgctg gtcctgtcga gggtcactac 3420 tttctggtga tagtggactc gtacagcaag tggcctgaaa tattcaagac accaaccatc 3480 acagcgagga caacaatcga tctgcttttc gaaacgttcg ctcgctacgg gctcccagag 3540 acgattgttt ctgacaacgg tacgcagttt actagctcgc tcttcaagga attctgcgaa 3600 tcgctcggca tcatccacat ccgcactgcc ccgtatcatc cccagtcaaa cgggcaggcc 3660 gaaaggtttg tggatacgct aaaaagaagt ctgaggaaaa tcatgccagg agctgagcat 3720 gacattggac gatcgctgca aatctttttg tcagcctacc gtgccactcc taacaagtca 3780 gcacctgatg gactgtcacc tgctgaagtc ttgatgggga ggaaacatcg cacaactttg 3840 gacctgctca aaccaccgtc ggaacaccgg gctgcgctgc agaggaacca gcggcaagaa 3900 gagcagttca acaaacggca cggggcacga aagcgtgaat ttggcaaagg cgattcggtt 3960 cacgcaagca cattcgttca cggcaagctg gtctggcaaa ctggcacagt aatcgaacgc 4020 gttggcaacg tgatgtacaa cgtctggctg aacgacaaga agcggctcat caggtctcac 4080 acgaaccaac tccgtgctcg ggggggctca aaagtcgagg cctatccgga agaagctgag 4140 tcggaggtca agctacctct gcacatcctg ctcgaggact tccagatcat cgaggatcca 4200 gtagtagccg aggatgaaca gcaggcgccg cctgcaccat cgcaacgtcg agccagggtt 4260 cgaccaccgt cgattcttcg ccggtcaaca aggattcgtc gtatgccgga gaggtacaac 4320 ccgtacttct acacctaaag gggagg 4346 // ID CR1-14_BF repbase; DNA; INV; 4727 BP. XX AC . XX DT 30-JUL-2009 (Rel. 14.07, Created) DT 30-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE Amphioxus CR1-14_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-14_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-4727 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-4727 RA Kapitonov V. and Jurka J.; RT "Young families of CR1 non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(7), 1585-1585 (2009). XX DR [2] (Consensus) XX SQ Sequence 4727 BP; 1225 A; 1125 C; 1081 G; 1296 T; 0 other; atggcgacct acgcgtctgt taccaccttg acgacgcggt cggcagaatc caccatgccg 60 atggtcaagc cggcctacat gaagtacagc gaggccatcg ggagagaaga tcggtggctc 120 agtacgaggg aaatctgtgg tgcagccgag aagatcgccg gctacgagac catggaggga 180 gcccagctgg tgggagggct gtggcgtctt tacccaaaga caaaggcggc acggacagcc 240 cttctggccg aagggctgac gctgagaggg aagaagatcc ccctctacga ccacaacccc 300 tacatcgttg gtggtcaggg agaaaacaac aaccccacga caagggtgac gatcgagggg 360 attcctttgt ctgccgacga tcgcgacatt gtgaaaaccc ttgagcgctt cggctgcaag 420 ctcagaagcg ccgtcatgta cgagaaagac agggatgaag acaagaagct gacaagatgg 480 aagaacggca acaggttcgt gttcatcgat atcccggccg aacctctgcc tgtcaagatg 540 acgttcggaa acttccgtgg tgctgtaaaa catcgagagc agtaccacaa gaagccagaa 600 gacatgacct gcgggaaatg tcttctcaag gggcactaca gcaagatctg ccccaacgac 660 atcgtctgca gggactgcgg catgtcaggg cacaagcgag gtgaccctgg ctgccattac 720 atcacggcag tacagcagat ctcttcacca tcacctgaag atgtcatcga ggaaagtttc 780 cggctctacg tggagccagc ggcaacaaca tcgcccgaca cttccgacaa caaagacact 840 tccggaaaca acactgaggt cagccagagt acttccgggg taccagctga cgagagtgtc 900 agcgctccaa agtcaccact gccgaagcgt gacagcaaga ccaggggccg gaaaagatcc 960 aaggcccatg ctcggcagca gagagacagc gagcgagacg acgccctgac gaggctgatg 1020 aaggaggcaa ggatgcgggc ggttccagcc agcatgccga ggaagacctc aggctctcca 1080 accttacatt cttcttgata tgctagacta gatttgtgta caagtgcact tgagtcatgt 1140 tattttaacc ttagcccttg tgtagaattt aattctttgc aacatttgcg ataatgtagt 1200 aatgatcttt ctatctgagc atgttttgta caagatcaat agtggctcat ttgtcataat 1260 ctgtgcaata tcaagtgtgc agcgagggtg tacttttcac tgtagcattt gaattttata 1320 tactgttaag tcacgatctt ttaaatgtat gaaaccaatg cgttttagct gtagaattta 1380 gatctgcctt agtagccaat tgatatgcaa attcccattg tggtctgagc acatgtatca 1440 ggaagccctc cactcatcag tgccaaattg atttgcaaat cagtagaagt atgttaccac 1500 atgtgtgctt cctgttgttc ttcattttaa tgatgtcatc caattttctt aatgctatat 1560 atctcaatgc taggagtgtg aaatctgtta accagagtag gaacaagttg gtacagctca 1620 ccaacctgtt acaccttaat gctccggatc ttctcgcaat cactgagact tggttgactt 1680 ctgatgtgca tgatttagag gttattcccc ctgaatttgt gacctatcgc aaggatcgtc 1740 atgttactca caataataag tctggtgggg ggattctact tgctgttaaa tcgtcaatat 1800 gtagtagtcg tagacttgac ttggagcctc aggatgagat tcttgtatgt gaaatccgtc 1860 ctccagctat gggcaaaatt gctgttattc tttgctacag acctccctct ggtaacttgt 1920 catcttttac gcacaacctg ggctcgactt tagagagcgt gaacaaagag tataacatgt 1980 gctgtgtcct cggggacttc aatctgcctc gaatcgactg ggccaattgc gtagtgttaa 2040 acgagggtag ggaagctgac ttttgcaaca tgattaataa ctacttttta aaacaaatca 2100 atcatgttcc ctcaaactcc actcacaacc tgttagatct tgtgtttacg gacttccctg 2160 agaggttctc tacgatcagg gaacttcccg ctgactttga taccgaccac actgttctcg 2220 agttctctct ccgctgtcgt atccaaacta ggcaggagct tcctcgtaaa gtgtacaact 2280 ttacacgagc cgactggctt ggtctcagtg ctcacctaga gtcactgagg ctatcagagt 2340 cagttactca ccaccatgac attgactctg cctgggaagc atggtcttca gccgttcagt 2400 cggctgttga catgttcgtg cccagtagga agctcaaggt atcaacatca cctccttgga 2460 tcgacggcga ggtacgcaac ttgcagaaca agaagcggac agcgtggaaa agggccaagc 2520 gttccgactc cccctctcac tgggagaagt tccgtaaact aagaaacaaa ctgaagaact 2580 tgttatcagc taagtataac aaatatcttg ttagcctctc ttccacttta caggagtcgc 2640 cgaagagatt ctgggctttt gtacgagcaa agtcaaaatc caggtctctt ccgactgacg 2700 ttcatctcaa tggcacggtc gttcagtctg cacccgataa ggccaatttg ttcaatgact 2760 actttttctc taccttttct gcacctgacc ctaatgttat caagcccgac atcgacgtga 2820 tagttgatga aagattgtgt aatttgctac ttgctgtaga atccgtggaa aatgtgttag 2880 ttaatctcaa cactagtaag gctgtgggtc cggataacat ctctgcacat gttcttaagg 2940 gctgtgccaa gactcttgct ccttccttga ccttgctgtt caatagatca ctatccgctg 3000 gttgtgtacc gagtagatgg aaggatgcaa acgtactacc agtgcataag aagggagaca 3060 aggagaatgt ggccaattat aggccagtat ctttgttatc cctggtaagc aagttgatgg 3120 aacgctgtat gtacaatcag ttgattccaa ttttgagagg gtcgcttcac gaattccaac 3180 acgggtttat tgccggccgt tctactacaa ctcaactagt agaaacctac catcaagtag 3240 gatccatact cgataagggg gggcaggtag atatgttatt cctggacttc gcgaaagcgt 3300 tcgattcagt ttcccacact ttgctaatcc acaaactgca gatgtttggt ttcggtggta 3360 acctcctggc gtggtttacc tcctacctaa ccgatcgtcg tcagcgtgtg gtagtggaag 3420 gttgccagtc caaatggctc ccggtgactt caggtgttcc tcagggctcc attcttgggc 3480 cctttctctt ccttctgttt atcaatgact taccttgtgc agcaactcaa tctactgtag 3540 cactcttcgc tgacgactct aaatgtttta gggaagtcag caagctaaat gactgtgtta 3600 agctgcaaaa tgacatcact gccatgtaca actggagtct tagttggaag atgtcttttc 3660 atccttccaa gtgtaaactt atccgtttaa ctcgctctaa gagtcctatt gtgtactctt 3720 atcacatgtc tggtactaca ataacatctg tagacagtat gcctgacctt ggggtcattg 3780 tccagtcgga ccttcagtgg aacagacaca tcttaaagat ggtagccagg gccaattcta 3840 tgactggttt catcaaaaga acaatcgggt tcaactctag cactgatgta cgcaaggcac 3900 tgtacaccac attggtgaga agcatcttag aatactgctc acctttgtgg tcccctcagt 3960 cccgcaaact catgtccctt gtagaagggg tgcaacggcg ggcgactaaa ttcattctag 4020 gagcccactc gaacaccctg acctacaagg aacgtctctc cacccttggt ctgttgcctc 4080 ttacgcacag gcgcgaagta tgcgacgtca tggtgtttat caagtctaaa ccaattgttc 4140 acaagtatgt atccttccct tccagaccct ctagatatct cagatctcac atgcttaccc 4200 ccattagagt aaggacatca gccttcagct cttcttacat accaaggctg gtcaccatct 4260 ggaatcagct gagtcctgtc ctccgtaata ttggggttca agccactgaa aagtctaaca 4320 tcttgacatt caagcgatcg cttgtgagag ccacgtttgt gagatttgac acccagttca 4380 atatcgacgt cccctgctct tggtccacat cttgtgggtg tgcctcttgc actgctttaa 4440 ggctacatta gtattataat tagcttttta ctctgtatat aatgatattt gataattttg 4500 tatatacatg taatcaattc tgtgtataat tgtcaccgaa ttgtatattc atgtctactg 4560 acttttgatg tctaatgttt attgtttgat tttggtttcg tattgttttc tattatcatt 4620 tcaggtttgg gaggacaact tgtaaaggtg ttgatacacc tgttttgtcc tcccttccac 4680 ttgtttttgc atgtggtaaa ttcaaataaa gaaataaaga aataaag 4727 // ID DNA-2-1_NVi repbase; DNA; INV; 920 BP. XX AC . XX DT 10-APR-2009 (Rel. 14.04, Created) DT 10-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE non-autonomous DNA transposon - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; TSD 2-bp; KW DNA-2-1_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-920 RA Bao W. and Jurka J.; RT "DNA transposons from Nasonia vitripennis."; RL Repbase Reports 9(4), 760-760 (2009). XX DR [1] (Consensus) XX SQ Sequence 920 BP; 244 A; 188 C; 203 G; 285 T; 0 other; ccctagtgga aatggattgt gtgccaaagt gtgtcaaagt gtgccaaagt gtgccaaagt 60 gtgccatagt gtgccataat gttccaaagt gttcaaattc ggaaatttat tacactttgg 120 cataccttgg tatactttgg cacactctgg cacactttgg cacattttga cacacatggg 180 tttttcggaa attgaggact tagaaaattt ttgcacgaaa tctggaaaac ttgtattccc 240 gacggattcc tctgttgagg catgtttttc caacatttac ggcggacagc agctttaagg 300 caattacggg caattgttaa ttaatattgt aaaagtacta agtatctaaa gctgcttgaa 360 gtacatgagt ttaaattttt gtattaactg taacgagtac ttgtggctca gtggtagacc 420 ttcagcttac taacccaaag gatccgggtt caagtcttac cgacgctaga attttttttt 480 ttttttttat taaattattt attgttacgt tctataactg ggacttcaag ctccacgcgt 540 gccgtcgata tgacggcatc gtgcagtggc aatgttttgg cgcgcgggcg gcgggatata 600 tctcactgta ccgaagtgtg ccaaagtgtg ccaaagtgtg ccaaagtgtg ccaaagtgtg 660 ccaaagtgtg ccatgctgtg ccataccgtg ccaaactgtg ctatactgtg ccatactttg 720 tcaaaaaaga gatgtttgtt tccgaaaaac ccatgtgtgt caaagtgtgc caaagtgcag 780 caaatttccg aatttgaaca ctttggcaca ctttggcaca gtatggcaca ctttggcaca 840 ctttggcaca ctttggcaca ctttgacaca ctttagcata ctttggcaca ctttggcaca 900 cccaattatt tccactaggg 920 // ID CRE1 repbase; DNA; INV; 3483 BP. XX AC M33009; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 28-SEP-1995 (Rel. 1.08, Last updated, Version 1) XX DE C.fasciculata retrotransposable element (CRE1). XX KW CRE; Non-LTR Retrotransposon; Transposable Element; CRE1; KW integrase; retrotransposable element; reverse transcriptase. XX OS Crithidia fasciculata OC Eukaryota; Euglenozoa; Kinetoplastida; Trypanosomatidae; OC Crithidia. XX RN [1] RP 1-3483 RA Gabriel A., Yen J.T., Schwartz C.D., Smith L.C., Boeke D.J., RA Sollner-Webb B. and Cleveland W.D.; RT "A rapidly rearranging retrotransposon within the miniexon gene RT locus of Crithidia fasciculata."; RL Mol. Cell. Biol 10(2), 615-624 (1990). XX DR GenBank; M33009; Positions 416 3898. XX SQ Sequence 3483 BP; 776 A; 923 C; 1227 G; 557 T; 0 other; aacggcattc ggtctagtgg gtccactccc aaccttctcc tccttggtct cgggttcgat 60 tccggtcggg cacgaaactc tcttcctctc taaactccac atatacatcc acaataacca 120 ctctcataac tgttctggtg cggggccatt tcgaaccata ttgacatttg gcgcaccaaa 180 ctcacacata ctaaccaaaa atggcagagc cacccggggc gacggtccga gcgtggcgga 240 gcacctcaag cgccggcggg tcgaaggatc tgagccggtc gtggtggggt ctcgccagga 300 gggtctctcc ggtgagtcag cagtcgaagc gattgttgtg gaaagtggca gcgaggctga 360 cgaagaatcc acggcgaggg ggctgggagc ctctcgacaa gcgccaggtg atgactcggg 420 cgggccggtg cccagggcgg aggaggacct cgccccggtt gggtacctgt atcccgagaa 480 cctggccgtc ccacaggggg cggcgtgtcc ggtggtgggt tgtgggtacc gaccgaacac 540 ccgggtggga ccgagactgg tggaacacct gaacacggtg caccgggata tcctgggcaa 600 cattcctgtc gacgcctggc gacgtcaggg actcgtccgc tgcctacggt gtggctcggc 660 cttgaccgcg tccggtcatg gtcggggtgc acacggtggg aagtgtggtc cgtacagaag 720 cagaaacgcg gccattaggg ctcggacgca gagtttcttt ggtggaattt cgcagaattc 780 tgacacactt acggcgggaa acggggtctc agaagctggt ggtctagtgg aagtgggtag 840 ggtagaagac ccaaccacag acccgtggta cagggttcga acccccttga agcgacaaat 900 ttaccgtacg gacttccggg tgtggcaggg cttggcccgg ccagtactcc tggggtactc 960 tctggcggat acagcaggga aggagggacg ccttctcgcc ctgctcaatc tccccaggga 1020 ccatctggag gtccaggtaa atgctaagag gggaatccaa ccacagccgg ctgaaatcga 1080 ggcccaggtt cgaaggaagg tagtggagtt ggcagggata ggagcggtag gacgagcgat 1140 ggcagtgatg actcgggggc ggctggtcga ggtgcccctg gagcgggtga tggagcagct 1200 cgaggagctc cacccgcagg aggacccgcg gggttatccg gcagcaccag atacgagcga 1260 ggtgctgcgg gccaaggagc agaaggtgcg gcgggcgatc gcggcacgga tggggagggg 1320 tacggcgccc ggcctcgatg gctggacgcg ggagctcctc ctccccctcg cagaagaccc 1380 ggccctgcta cacgagatca cgtcggtggt ctcggatatc atgcagggga aggtggccga 1440 ggtggtggcg cggaggctgc ggagcagcgc cgtcaccccg attccgaagg acgaggcggg 1500 gacgaagata cgcccgatcg tgccggagtc ggcctggctg aagctggcct cgctggtggc 1560 gatggcggag ataccatcca gcttcaagga gaccttcaag gggtggcagt acggggtctg 1620 gggggacgtc gccaaagcag tggcgaagat ccgccgggac agcgaggagc acgagtacct 1680 ggtggcactc gacggggtca atgcatacaa tacgatgagc agggcccaca tcctccaagc 1740 cgtgtacgcc gagcagcgcc tgaagccgat ctggggggtg gtgaaggtgg cgcttggggg 1800 gccggggttc ctgggagtat acagggacgg ctgcctcaag ggcaacctgt ggtccaccaa 1860 gggaatccgg cagggcatgg tgctgggccc cctcctgtac gcgaccggga tggcagcggc 1920 catcgggccg gtacggcagc gtatccccgg ggtccccgtg acggcctaca tcgacgacat 1980 caccctcgcg gccagcgggg cggagggagc cagggcagcc gaggcatacg cagacgccct 2040 cgagacggtc ggggtggtca ccaacgccag gaagtcgatg gtggtggggc cagaaggcac 2100 ccgggtgggg atcgggggtg tagacctgcc ggtggtggcc gaggcccgga tcctgggggc 2160 ccacttccga gcaaggggga caccagaggc ccgtaccatc gagtggctgc aggcggccgt 2220 cgagaagtgg cgacccatcc accagaagct gcggcaggac atcatcccga agaacattgc 2280 gatgatgatg acccgcatca gcctggggtc caagatgacc ttcctcctcc agacccactc 2340 gccacaggaa ctggagaccg cagcgaagac ggcagacgac gaggtcgagc agaccctcca 2400 gcaccttatg gggcaggtag agatcacgcc ccgagcaagg ctgctggcac aactcccgat 2460 cagagagggg gggctgggtc tccggcgaag cagcgagatt gcgaagttcg cacaggcaga 2520 cgtggggcaa ggcgaggccc accaggcaca cacaaaggca ctagatgaag ggatcaagca 2580 ccagttacaa ccactcctct cggagtccga ggtgcagatc ctgaagtcga acgctggaat 2640 gggggccggg cgggtgctga cagatagtag cctgaggatc ccagacgtgg cagcaacaat 2700 cgcgctgagg gagagactcc tgctcagggt gctcccggag ggatgcagtg tgtgtgtggg 2760 ggggacgcga cgaactacca tgtacacacg tgctccaaca tacccaccaa gccccggacc 2820 cgacgacacg acggggtggt ggatgagctg gtggccctgg ccaggaagat ggggtacgag 2880 cccagcaagg agccgagggc ggacgttgac gagtagggcg aggccggacc tgtacatcac 2940 aggaagcctg aagccggcgg cgacggatgt aacaataacc tacccgggca ggcaggcgag 3000 gggagcacac tcccgttgca gcaggcctac cggaataaga tgggggcctg ggaggcatgg 3060 gggaacctgc gaggggtgga catgcagccg gtggtcctcg ggacgaacgc ggagatacac 3120 ccggagagtg cgaatggata cgaaggttga cctcggtcga agacaaagac aaaatacata 3180 ccagttacaa cgaggtgacg ggacgaatcg tggagacggt gttggttggg aacgtggagc 3240 tgttcaacgc agtgacgaac ctggcgctgg tcagggagtt gatgtaggaa ggtacgggtg 3300 ggtagtagta gaatatcgaa cggcgactag gaagttggac cattgttact tgttacttgt 3360 gattctgacg acgacgaaag attgttcctt gttacttgtt accctgaaat ttgatattac 3420 ttgaattgat atgatatgat tgatacaaaa aaatttaaaa aaaaaaaaaa aaaaaaaaaa 3480 aaa 3483 // ID CVG repbase; DNA; INV; 486 BP. XX AC . XX DT 04-OCT-2002 (Rel. 7.09, Created) DT 01-JUL-2005 (Rel. 10.08, Last updated, Version 2) XX DE CvG: putative non-autonomous DNA transposon element from oysters DE (Crassostrea virginica) - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; CvG; KW nonautonomous DNA transposon. XX NM CVG. XX OS Crassostrea virginica OC Eukaryota; Metazoa; Mollusca; Bivalvia; Pteriomorphia; Ostreoida; OC Ostreoidea; Ostreidae; Crassostrea. XX RN [1] RP 1-486 RA Gaffney P.M., Pierce J.C., Mackinley A.G., Titchen D.A. RA and Glenn W.K.; RT "Pearl, a novel family of putative transposable elements in RT bivalve mollusks."; RL J Mol Evol 56(3), 308-316 (2003). XX DR [1] (Consensus) XX CC Uncommon, structurally similar to CvA. Modular organization CC includes CC subterminal inverted repeats (nt 27-37, 474-484), perfect CC inverted CC repeats (nt 76-83/108-115), self-complementary regions (nt 29-43, CC 438-453, CC 457-484) and an (ACGG)n microsatellite region (nt 420-435). CC Putative CC target site duplication AA. Individual CvE elements contain CC several copies CC of a 161 nt core repeat unit, the first copy being truncated at CC the 5m CC end (nt 94-252 and 253-413). XX SQ Sequence 486 BP; 151 A; 95 C; 104 G; 136 T; 0 other; aacaagtgag acctaatgat gtctggacct gcgtcgcttg cagttatttc atttataaaa 60 aaaattgaca taaaaaactt agattgttag ttgatttacg aatattctct aagttccaag 120 gggctaactc ccacaaaaat gaggcgatca aaatttcctg ccgatatgca taacaccata 180 tgatgtccta tctacatacc aagtttcatg ataattggat cagtagtttc agaggagttg 240 cgatgacaag gtcatttatg agtattttct aagttccaag gggcataact cccacaaaaa 300 tgaggggatc aaaatttcct gccgatatgc ataacaccat atgatgtcct atctacatac 360 caagtttcat gataattgga tcagtagttt cagaggagtt gcgatgacaa ggttttggga 420 cggacggacg gacggacgga gtgactgcac tcctagacct gctttcgcct tcggcgacgc 480 aggtaa 486 // ID DNA8-53B_AP repbase; DNA; INV; 410 BP. XX AC . XX DT 27-AUG-2009 (Rel. 14.09, Created) DT 27-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-53B_AP. XX NM DNA8-53B_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-410 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 1984-1984 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 410 BP; 155 A; 62 C; 51 G; 142 T; 0 other; catagataat ataagaatgg ataggacata agaacttatc actagaaaat ttagtgatac 60 gtttttaatg ttatatgggg caaatattga tatgataatt gataaatgat aataaataac 120 attaatgatt aattagtaat aattaacgat ctatattatt ctatattcta attacaatta 180 attatgtacg caaagtttag agtaatttga gtattccaat ttccattttc aggcacaaca 240 atcaaaatac aaatttgaaa ttgacaaata ttgtaaacag tttttggaca taaccttaaa 300 aaaagcccca tatcgtcttt ctctctttac gttctctctt ccccaaaaaa gcacacggcc 360 ccatatgaaa ctggtatagg catgtcctat ccattcttat attatctatg 410 // ID DNA-TA-5_CQ repbase; DNA; INV; 1419 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A DNA transposon family from Culex quinquefasciatus - consensus. XX KW DNA transposon; Transposable Element; nonautonomous; DNA-TA-5_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-1419 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the southern house mosquito."; RL Repbase Reports 11(1), 55-55 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >96% CC identity. TSDs are TA. XX SQ Sequence 1419 BP; 493 A; 222 C; 243 G; 461 T; 0 other; ggggagtttg gggtaatatg gacagtgggg gtaatttgga caccccttta aaaatcacgt 60 tttcttcgga tataaagtga aaacggcaat ttaagtgata aggctagtac tagtaatggc 120 ctaggagtat ggacaaacca aaaaagttgg aataggttaa gtagttttgg tataatgtgt 180 aaaagtttcc aaaggccggt tggcactgcc ttaaattcta atatttgaag gttaggaaat 240 aaggttttgc aggggttata tggcaaaaaa gtggacattt tttgtttaag ggggttgctt 300 ggacgatgcc tgagatatgt acgtataaaa attgtgcatt ttatccattt ttatcatcaa 360 aaatcgcaat ttatgataga atatgctatg ggggtaattt ggacatggct tgtggggtaa 420 tttggacata cctgaaagtc tacctgtcag gcaacaaaaa agtaactttc atggttggat 480 gagtttttaa agggatttag ataaatttag agggatttaa tatgtcaaaa caatcaaaaa 540 tgaatgtgag caatgagaac tttcacaaaa cccctatttt ttaatttaga taaatttatt 600 ccaatattcg aattgatgaa aatctgatta ctgtacattt ggcgttcaac aagactctaa 660 tgttgattgc ttttaattaa tttctttgac atttctaatt tctatgctgg tttacaataa 720 tatttaaatt tgaagggcta cagaatgtaa aatttaaaaa aaatgatatt ttcaggctaa 780 taaattctat ataaagattg atttatataa acataaacga taccttaatc actaaaaaca 840 agtatagtgt gcctgatcca gaatcaatgt ttaaatcatt tcagtattaa tgattaaatt 900 atgtatacta cactgaacta tttgttatct tcctattgaa ttatagttct atcacaaaat 960 cattatttaa acctttgatg aaatattgtg agcaacactt caaaatataa agcaattaca 1020 aaaccaggtt gaaggataaa aaacatgctc aaaaaatcaa atgttaaact tattcttcaa 1080 catgtgtcca aattacccca caaaaggtgt tcatattacc ccacaaggta tgtccatatt 1140 accccacaag gcatgtccaa attaccccac aaggcatgtc caaattaccc cataggggtg 1200 ttcatattac ccccacacaa aaatgtgggg gtaatttgga caccttttaa cttttgcgat 1260 aaaaatgggt ttttcatcaa aatttatcga aatgaatgac atttttccat caaatatggt 1320 ctctattatc ctctggtgtc atccacattc ctttaaaata tccatacagc ccgtaacaga 1380 gcaatttcct tagggtgtcc aaattacccc acactcccc 1419 // ID Gypsy-15_DWil-I repbase; DNA; INV; 6323 BP. XX AC scaffold_180708; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-15_DWil_; KW Gypsy-15_DWil-LTR; Gypsy-15_DWil-I. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-6323 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_180708; Positions 8567940 8574262. XX CC Positions [4117-4596] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 2095..4833 FT /product="Gypsy-15_DWil-I_2p" FT /translation="MLHKCADVNFIEINDEDVPLAIKEKFKKMIKLRASSF FT ADPKIALPFNINTVATIRTQGEPVFSRPYPYPLGVSDFVNTEIKQLLKDKI FT IRPSKSPYNNPIWVVDKKGVDELGHKKKRMVIDFRKLNQVTVSDKYPIPSI FT TTILSNMGKSQYFSTLDLKSGFHQIELAEGDREKTAFSVNSGKYEFCRLPF FT GLKNAPSIFQRAIDDVLREHIGKICHVYVDDVIIFSETKEDHVKHIDIILQ FT CLGDAGMRVSQEKSKFFKKSVEYLGFVVTRGGIKTSLEKVQAIQSIKPPTS FT LYELRSFLGLSSYYRCFIKNYAAIARPLTEILKGDNGKVSTNHSRKVKVEL FT TATQLQTFDRLKNILASEDVMLRYPDYKKPFDLTTDASGHGLGAVLSQDGR FT PITMISRTLRDNEPDLATNERELLAIVWALKTLRNYIYGVRDLNIYTDHQP FT LTFAVSDRNTNAKIKRWKSIIDDHNAKLIYKPGKENKVADALSRQNINALQ FT SQESDIATIHSEESLTYTIDSTDKPVNCFRNQIIIEEADFPSVRTFLLFQT FT KTRHVIQVSDRSTLMQTLQDVVNADVVNAIHCELPVLAFIQHSLVEMFPTT FT TFRYSKYLVSDIPDRTEQREILTTEHNRAHRAAQENVKQVLRDYFFPNMNR FT LATEIAANCKICSVAKYDRHPKKHILGVTPIPSFAGEILHVDVFSTDKKLF FT LTCLDKFSKFAVVQPIASRAIVDIKAPLLQLINLFPKIKTIYCDNERSLNS FT ETIKSLLSNNFDIHIANAPPLHSTSNGQVERFHSTLTEIARCKKLSSQTNE FT TVDVILMATIEYNRTIHSVTNKQPIEIVHTSTAEMEATIRKKLEKTQIKQL FT EQGNKKRAEKRYRVGDKVWVKSNKRLGNKLTPLYTEGTVEADLGTSVLIKG FT RVVHKDNLR" FT CDS 5405..6322 FT /product="Gypsy-15_DWil-I_3p" FT /translation="MVISDLQNLILTITLAKANLISPVILNDLDIEEIKNK FT HLTNISVTDILEVASIKAFQNNEILHFLIKYPNPKLACKKINVYPVQRKNI FT VLDFEKGSIVADCGTESYAVTGCDVAVSTTFCRRSPTPNCAQNLISGTPAR FT CGTRSSHLDPLTVVDDGIIIINDVTATITNRVNQTQVVAGTYLATFDDEVT FT INGTLFINNYGALKKKPMVAAISQINITSHREKLSLPLLHELSLTNLQNIG FT QLRSNLVSGSIISSGATLAISAVAIFIIYEVRRRWKLKHQAQLITSLKTAL FT KKSEDVHHLAEGGV" XX SQ Sequence 6323 BP; 2256 A; 1236 C; 1205 G; 1626 T; 0 other; acttaaataa ctggcgccca acggcataag aaaaatattt aatatttaaa ataaatatat 60 taaaataaat aatcgaaact ccgtgggtga aagtctacgt aaaccaaacg cgaatccacg 120 aaacacaacg cgtgtgaata acaacaattc aaaaaatttt ttgtggaaac ataaaaccat 180 taggagagct aaaggcgaaa aactacaaat ccaaacatcg aacaacgtca tcaccgcccg 240 tcacaccgac atctgactga cacttgggga atttgttata actaaaataa ggaggaaata 300 tcttaaattg cagtcaatgt atttcatgta agtgagattt ttgttcttga tttccatttt 360 tttccatttt ttaataaaaa aaggaataaa taataaataa aaaaatatat taaaatttta 420 gttataaaac aaaatttgct ctaataaagt gtaataggaa aaaaattcct tatgataagt 480 gattgcagtg acgaagagtt ggtagagctg tgtaaagaaa taaaatcgag tgctgaaagt 540 gaagagaata aaacaaaaat gaatgcccaa gaagtattta atttaatgca agaagcagtg 600 aaatctgctt taaatgctca gtctcaacaa ttttcaacac atttagaaga aaaaatagga 660 gatttaacca gacaagtgaa tgagttaaaa atgtccacac cggaagtgga agtgtttgaa 720 acagtaaaaa tagtaccagg agttgagtgc aaagaaccac ttgacatagt gagatcagtc 780 cctgaattta gtggtagtca agcagagtat gttgcatggc ggtcagcagc aacatttgct 840 tatgatttgt tccgcccata caatgggagt tcgacacact accaggcagt gggaataatc 900 agaaacaaaa ttaggggtac agcaagttcg actctttcct cttacaacac cgcgttaaac 960 tttgacgcca taatttcaag attagacttt acatatgcag acaaaactcc agctcgtgtt 1020 ataaaacaga aattgggtat acttagacag ggagaactat ctcttttgga gtattacgac 1080 gaaatcgaaa agaccttaac tctcttgaca aacaaaaaac agaaattggg tatacttaga 1140 cagggagaac tatctctttt ggagtattac gacgaaatcg aaaagacctt aactctcttg 1200 acaaacaaaa cgttaatgac gtacgataca tcgacggctt taacccttat gacacatcgc 1260 gaggatgcgt tacattcttt tgtttcggga ctgggaaaag tccctcaaag cacatgtcct 1320 accggcaaag gttgcagacg acaaaaacaa aaattcggtt aacaaccaag aacaatcaaa 1380 aaaatcacag taccatcata gaggtaactc aaattcgggt catacaaaaa acccacattt 1440 cactaaaaaa cagaactacc atggcaaagg gaatcaaagt gccagacaat ataatagctc 1500 aaagtccaag tttcaaggca gaaatccata tcagccgttt gagccaatgg atgttgatcc 1560 atctacatcg cgctttaaac agccaactgc atatcaacaa agaaagcctt tcgcaaattc 1620 gacgcaaagc cgtcaaaatg tgaatcataa tcaagttgtg ccagacggtg acgaagatta 1680 tgaatcaaaa gcagaagatg ctgtaacaca tttcgaggat gagcaatccg acattgaagc 1740 ctgtaatttt ttaggccagc atccttcctc ccgtacatca ttaaacaatt gcacgggaga 1800 acaataaaaa ttctagtaga cactggggca tcaaaaaatt acattaaacc attgtccgaa 1860 ttagaacaca tagtcccagt gcagagtccg tttcaagtta aatctatcca tggctcctct 1920 tctgttacca aaaaatgtaa aatcagcctc ttcggagaaa aatctttttt cttcatttta 1980 tccactttag ctgactttga cgctattata gggttagatc ttttaagaaa agtaaacgcc 2040 actatcaatt ttacgaaaag taaaattttt tttaatgggg gctctgagga tattatgttg 2100 cacaaatgtg ctgacgtaaa cttcattgaa ataaatgatg aagatgttcc tcttgctata 2160 aaggaaaaat ttaaaaaaat gattaagcta agagcttctt ccttcgcgga ccccaaaata 2220 gctttacctt ttaatataaa cacagtggca accattcgaa cacagggaga acctgtattc 2280 tcaagaccat acccataccc attgggggta tctgatttcg ttaacacgga aataaagcaa 2340 cttctaaaag ataaaattat acgaccatcg aaatcgccat ataataatcc catttgggtt 2400 gtagataaaa aaggcgttga tgaactcggc cataaaaaga aaagaatggt cattgatttt 2460 cgaaaactaa accaggtaac tgtgtcggac aaatatccga ttccctctat tacgacaata 2520 ctgtcgaaca tgggaaaatc acagtatttc tcgactttgg acctaaagtc gggattccac 2580 cagatcgagc tggcagaggg ggatagggag aaaaccgcct tctctgtcaa cagtgggaag 2640 tacgaatttt gtcgactccc atttggacta aaaaacgccc caagtatttt tcagagggcc 2700 attgatgatg ttctacgaga acacatcggt aaaatatgtc acgtctacgt ggacgatgtc 2760 attatatttt ccgaaacaaa agaagatcat gttaaacaca ttgacataat tcttcagtgt 2820 cttggtgatg cgggcatgag agtttcccag gaaaaatcga agtttttcaa aaaaagcgtc 2880 gaatacctcg ggtttgttgt tactagggga ggtataaaaa cttcactaga aaaggttcag 2940 gctatacagt ctattaaacc accaacatct ttatacgagc taaggtcatt tttaggccta 3000 tccagctatt acaggtgttt tataaaaaac tatgccgcaa ttgctagacc acttacggaa 3060 attcttaaag gagacaatgg gaaggttagc accaaccact ccagaaaagt caaagtcgaa 3120 cttacagcta cacagctgca aacattcgat agacttaaaa atatcttagc ctctgaagat 3180 gtcatgttaa gatacccgga ttacaaaaag ccatttgacc taactacaga tgcctcgggg 3240 cacggcctgg gagcagtatt gtcgcaggat ggaagaccca taacaatgat ctcgagaaca 3300 ttgcgggaca acgaacctga cctggcgact aatgaaaggg aactcttggc aattgtttgg 3360 gctttgaaaa cacttcgaaa ttacatttac ggagttaggg acttaaacat ttataccgat 3420 catcaacccc tcaccttcgc agtgtccgat cgaaatacaa acgcaaaaat aaaacgatgg 3480 aaatctatta tagatgatca caacgcgaag cttatttata agccagggaa agaaaataaa 3540 gtcgccgacg cgctgtcgag gcaaaatata aatgctttac agtctcaaga atcggacatc 3600 gctacaatac atagcgaaga atcactgacc tacaccattg actcaacgga caaaccggtc 3660 aattgcttta ggaatcaaat tatcattgaa gaagcagatt tcccttcggt aagaacattc 3720 ttactttttc agacaaaaac aagacatgtt attcaagtgt ctgataggag cacattaatg 3780 caaacgttgc aagatgtggt aaatgcggat gtagtcaatg caattcactg cgaacttcca 3840 gtgcttgcct tcattcagca tagtctcgta gaaatgttcc caactacaac ttttaggtat 3900 tcgaagtacc tagtcagcga tatccctgac cgaacggagc aaagggaaat attaaccact 3960 gagcataaca gagctcatcg agcagctcag gaaaatgtta agcaagtact tcgcgattat 4020 ttttttccaa atatgaacag gctagcaaca gagattgctg caaattgcaa aatttgttca 4080 gttgcaaaat atgacagaca ccctaagaaa cacatattag gtgtcacacc cattccttct 4140 tttgcaggag aaattctaca tgtcgacgtc ttttcaacag acaagaagct ctttcttaca 4200 tgtcttgaca aattttccaa gttcgctgta gtacagccca ttgcttcccg agctattgtc 4260 gacataaaag ctccattact ccagctaatt aacctctttc cgaaaattaa aactatttat 4320 tgtgacaacg aacggtcact aaattctgag acaataaaat cattgttgtc aaataatttt 4380 gatattcata ttgcaaatgc acctccattg catagcactt cgaatgggca ggtggagaga 4440 ttccatagca ccctgacgga gatcgcaaga tgcaaaaaac ttagttctca aaccaacgag 4500 acagttgatg ttatactgat ggcaacaata gaatataaca gaaccatcca ttcggtgaca 4560 aacaagcaac cgatagaaat agtccataca tcaaccgcag aaatggaggc gactattcga 4620 aaaaaactcg aaaagacgca gatcaaacaa ttggaacagg ggaataaaaa aagggctgaa 4680 aaaagatatc gagtaggaga caaggtatgg gtgaaatcta acaaacgatt aggtaacaaa 4740 ctgaccccac tctatacaga gggtacggtt gaggcagacc tgggaacgtc tgttctcata 4800 aaagggaggg tggtccataa ggacaacctt cgataaaata tttttcatta ttttctaatt 4860 tttaatattt catttaggat taaacttttg ttgccgatac taacggcttt attggtggct 4920 tcggtgaaat tgacggatta ctcgatgtcc aattacatac ctgtaatgga cggagatgta 4980 actatttggg gactacgcgt accttggaca tacaacaaac gtgacgtcat accaaaccta 5040 cgccgaagaa acgagagtgt tggtggagtc gttcaaacaa gaacatgtga ggaaagtaat 5100 actagcagat atcgatagaa ttagtacact aatagggaca ttaagagttc atcataggta 5160 tgctaggagc ctaaacatat taggcactgc tcttaaagta gttgcaggta cacccgattt 5220 tgatgattgg gaacaagtaa aatttagaca aaatcagtta agcgaacaag gaaatagaca 5280 gattgaaata aatagcaaac tccaagatag attaaataag ctcactaact ctttaaacga 5340 aataagcaaa attgatgaat taaatattga acacttattt gaaactgtcc tggcaaaaaa 5400 tagaatggta atttcagacc ttcaaaattt gatattaact attacacttg ctaaagcaaa 5460 tttaatcagc cccgtaattc ttaatgacct tgatattgag gaaatcaaaa acaaacatct 5520 cactaacatt agtgtaactg atatcctaga ggtggctagc attaaggctt tccaaaataa 5580 cgaaatccta cactttctaa ttaaatatcc taatccaaaa ttagcatgta agaaaattaa 5640 tgtttatccc gttcaaagaa aaaatatagt tttagatttt gaaaagggta gcatagttgc 5700 ggactgtggt acagagtcgt atgcggttac tgggtgtgat gtggcggtga gcacaacatt 5760 ctgcagaaga tcaccaacac caaattgcgc gcagaacctc atatctggga caccggcgag 5820 atgcggcact cgttccagtc atctggatcc actgacagtg gtagatgatg gaataataat 5880 catcaatgat gtcaccgcaa ccatcaccaa cagggtcaat caaacacaag tggttgctgg 5940 cacataccta gcaacattcg atgatgaggt aaccattaac ggcaccctgt tcattaacaa 6000 ctatggggca ttgaagaaga aaccaatggt tgcagctatc tcccaaatca acataacaag 6060 ccatcgggag aaactaagtc tgccacttct acatgagcta agcttgacca acctacaaaa 6120 cataggtcaa cttcgctcca acctggtatc tggttccatc atcagtagtg gtgctacttt 6180 ggccataagt gcggtggcaa tcttcatcat ctacgaggtt cgtcgccgtt ggaaattaaa 6240 acatcaagcc cagctgatta caagccttaa gaccgctttg aagaagtccg aggacgtcca 6300 tcacttagct gagggaggag tta 6323 // ID BEL-44_CQ-LTR repbase; DNA; INV; 272 BP. XX AC AAWU01003709; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-44_CQ_; KW BEL-44_CQ-I; BEL-44_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-272 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 242-242 (2011). XX DR Genome; AAWU01003709; Positions 19244 19515. XX SQ Sequence 272 BP; 69 A; 67 C; 70 G; 66 T; 0 other; tgtttgcaga agaaaagtga cccgcccctt gggctcggct ggtctaccaa acgcaaacgt 60 cattgacgcc gttgcgcgct ttttgccgtt gcggcaggag aaagggagga agaagaataa 120 aagttagtca ggaaggaggc ttttcagaga aagacgtctt tttttatttt ctcgtgcgcg 180 aagtcgctaa tccgccggaa agtcttcctt caatcccgcg ccactgtttc cgacgacgga 240 acttggccaa tctacgatac agtccattaa ca 272 // ID RTE-1_CQ repbase; DNA; INV; 3052 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE An RTE non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW RTE; Non-LTR Retrotransposon; Transposable Element; RTE-1_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-3052 RA Kojima K.K. and Jurka J.; RT "RTE non-LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 611-611 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 6 sequences with >97% CC identity. XX FH Key Location/Qualifiers FT CDS 25..3039 FT /product="RTE-1_CQ_1p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="MEPESGLRATETARRKTDWRNGTTLGMKTRTRIGTWN FT VLTLAQAGKLETLGREAVRLKLEILGLSEVRWPDSAEHKMPTGQVLLYSGL FT RSENAQRGFLLSPNARSALITWKPINERIIVARFRTRVRNLTFVQVYAPTD FT AADLQEKEEFYSQLSXVVNEIPKGDIRIYAGDFNAKIGSDNTDLERIMGPH FT GXGEMSENGELFTEFCGNQDMVIGGSLFPHRSVHKVTWVSRDGKTENQIDH FT ICISRKWRRSLLDVRNMRSADIASDHHLVVGEIRLRVARVVRQEEKVGCRF FT NVQRLSNPEVXRAFVGELQARALEIPDSTVEEEWQFIKDAFAVTGENTLGM FT LRTQRKEWISDATWDKIEERKQAKAAIVSARTRATKARTRQTYAELEKAVK FT RSCRRDKRAWTDSLADEGEKAANNGDMRLLYDISRRLCGARTNTRIPVKDR FT SGQLITDPADQLKRWFEHFEQLLQISHPRPSPQYHPQNVRRINRVNSSPPT FT LREIEEAVKGMKSGKAPGIDRISAEMLKADAHLSARMLHRLFSKIWDTATF FT PVDWMQGILVKVPKKGDLTNCDNWRGITLLCIALKVLCKVILCRIKSKIDE FT SLRRQQAGFRSGRSCVDHIVTLRVILEQVNEFQDSLHLVFVDYEKAFDRLD FT HENLWEALGRKGVPERIINLIKAQYEAFVCRVVHNGVLSDPIRVVAGVRQG FT CILSPLLFLIVIDEVLAGALDRVQNRGLLWQPIRMEHLNDLDFADDIVLFS FT QRGNDMQNKLDDLAACSSAAGLKVNVSKTKSMAVNTERAQGFTVAGENVEK FT VDTFQYLGSQIAPDGGTKLDISTRIKKARSAFAGLRTLWRSNQISLRTKLR FT IFNSNVKSVLLYGSETWCVSSQNTSKLQVFVNRCLRNILRAWWPRNWVSNA FT ELHRRCQQRPIHVEIRERKWRWIGHTLRKDAHEICREALDWNPQGQRGRGR FT PRGSWRRSLHQEISAVDGTLSWRQVKAKAENRQQWKSFNAALCSTGEQXA" XX SQ Sequence 3052 BP; 779 A; 753 C; 908 G; 605 T; 7 other; tagggctcct taggctagcc acttatggag cccgaaagtg gactgagggc tactgaaacc 60 gcaagaagga aaacagactg gaggaatgga acaacccttg gcatgaaaac acggacacga 120 atcggaacat ggaatgtact gacgcttgcc caggcgggta agttggaaac gctagggagg 180 gaggcagttc ggctsaagct agaaatcttg ggtctgagcg aagtccgttg gccggactca 240 gcagagcaca agatgcctac cgggcaggtt ctgctgtact ctggcttgcg aagcgaaaac 300 gctcaacgtg gcttcctgtt gagcccaaat gcacgttcgg cgctcattac gtggaagcca 360 ataaacgagc ggataatcgt ggctagattc aggacgcggg tcagaaacct gaccttcgtt 420 caagtttacg cgccgacgga tgctgccgac ttgcaggaaa aggaggagtt ttacagtcaa 480 ctgagcgkag ttgtcaacga gatccckaag ggtgacatcc gaatctacgc aggcgacttc 540 aacgcgaaga taggctctga caacacagac ctggagcgca tcatggggcc ccatggtcwt 600 ggagaaatga gcgagaacgg tgagctgttt acagagttct gtggtaacca agacatggtg 660 attgggggat cgctctttcc ccatcgctca gtgcacaaag tcacgtgggt gtcacgtgat 720 ggaaaaacgg agaatcagat cgaccacatc tgcatcagcc gcaagtggag aagaagcctg 780 cttgatgtgc ggaatatgcg cagcgccgac attgcatccg accaccacct cgttgtcggc 840 gaaatacgac tacgggtagc gcgtgtcgta cggcaagagg agaaagtcgg ttgtaggttc 900 aacgtgcaac gtctgtcgaa tcccgaagta sscagggctt ttgtcggcga gctccaagcc 960 cgagccttgg agattccgga cagcactgtt gaagaggagt ggcagttcat caaggacgcc 1020 ttcgctgtta ccggcgagaa tactctggga atgctgcgaa ctcagaggaa ggagtggatt 1080 tcggatgcca cctgggataa gatagaggag aggaagcagg ctaaggctgc cattgttagc 1140 gcaaggacga gagcgacgaa ggcacgtacc cgccaaacgt acgccgaact ggaaaaggct 1200 gttaagcgct cttgcaggcg ggacaagaga gcctggacgg attccctcgc cgacgaggga 1260 gagaaagctg ctaacaatgg cgacatgcgc ctcctctacg acatttcgcg acgtttgtgt 1320 ggtgccagga cgaatacgcg cattccggtg aaagacagga gcggtcagct gattaccgat 1380 ccagctgatc agcttaaacg atggttcgag cattttgagc agctgctaca aatctcgcat 1440 ccacgtccaa gccctcagta ccatccgcaa aatgttaggc ggatcaaccg ggtgaactca 1500 agtccaccta cgctgaggga gattgaggaa gccgtgaaag gtatgaagtc tggaaaagcg 1560 ccagggatag atcggatctc cgcagaaatg ctcaaagctg acgctcattt gtcagctcgg 1620 atgctgcatc ggcttttcag caaaatctgg gacaccgcga cttttccggt cgactggatg 1680 cagggcattt tggtgaaggt tcccaaaaag ggtgacctca ccaactgcga caactggcgt 1740 ggcatcacgc tgctgtgcat agctctgaag gtcctatgca aagtgatctt gtgcaggatc 1800 aaatcgaaga tcgacgagtc tcttcgacgg cagcaagcag ggttccgcag tggccgatca 1860 tgcgtggatc acattgtgac actccgtgtc atcctcgaac aggtcaacga attccaggat 1920 tctctccatc tggtcttcgt tgattatgaa aaagcgtttg accgtctcga ccacgaaaac 1980 ctgtgggaag cccttgggcg caagggagtc ccagagagga tcatcaacct catcaaggca 2040 cagtacgagg ctttcgtttg ccgtgttgtg cataacggag tcttgtcgga cccaatccgg 2100 gtggtagctg gggtgaggca aggatgcatc ctgtcgccgt tactgttcct catcgtcatc 2160 gatgaggtgt tagctggagc gttggaccgt gtgcaaaacc gtgggctgct ttggcagccg 2220 atcaggatgg agcatctcaa cgacctcgac tttgcggatg acattgtgct gttctcgcag 2280 agaggaaacg acatgcagaa caagctggac gacctagctg catgctcctc ggcggcgggt 2340 ctgaaagtca atgtctccaa aaccaagtcg atggcagtga acacggaacg cgctcaaggc 2400 ttcacggtag ctggtgaaaa cgttgagaag gtcgacactt tccagtacct tggtagccag 2460 attgcgcctg atggtggtac caagctcgac atatccacgc ggatcaagaa ggccagaagt 2520 gctttcgcgg gtctgcgcac cttgtggcga tcaaaccaaa tcagcttgcg cacaaaacta 2580 cggatcttca actcgaacgt caagtcggtg ttgctctacg ggagtgagac ctggtgcgta 2640 tcgagccaga acacgtcgaa gctacaagtc tttgtgaacc gctgcctacg caatatcctt 2700 cgcgcctggt ggccccgaaa ctgggtatcg aatgcggagt tgcatcggcg gtgtcagcag 2760 aggcccattc acgttgagat ccgagagcgc aagtggagat ggatcggcca cacgctacgg 2820 aaggatgcgc acgagatctg ccgagaagcg ctggactgga accctcaagg acaacgcggt 2880 cggggcagac caagaggctc atggcggaga agcctccacc aagaaatcag tgctgttgac 2940 gggacgttaa gctggcgcca agtgaaggcg aaagcagaga atcgtcaaca gtggaaatcc 3000 ttcaacgcgg ccctatgctc cactggggag caggamgcat aagtaagtaa gt 3052 // ID BEL-603_AA-LTR repbase; DNA; INV; 487 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-603_AA_; KW Pao_Bel_Ele73; BEL-603_AA-I; BEL-603_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-487 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 487 BP; 183 A; 93 C; 74 G; 137 T; 0 other; tgtgcgcagc ggtagaagac ccctcgtcac cggttaacac agccaactct cggttggcaa 60 atcggtagac ttcaaccgtt tcagctatga cagactagcc ttttcatgca atctgacatc 120 gtcaattttg cttatcacta tatctatacc aaacttcaca aaacacgtta aaaagttgaa 180 tttcctaaag tttatttctt gaatttgtta gtttgcctag taagtattaa aataaattaa 240 aaattgaaca taaaaacact aattaaataa atagtttaca aggtcacact tatctacaaa 300 aaaaagcttg ttatacgaag acgaattgtg aacgaagaag actagttaat gtaagtaacc 360 taaaaacata tgaatctatg aactaaaaac taaacaaata ataaaattta cagcttaaag 420 cttactccga actaaaagga cgagtttgac tttgagaggt ccgaacgact tcgccgtctc 480 cgtaaca 487 // ID Copia-5_AA-LTR repbase; DNA; INV; 189 BP. XX AC supercont1.211; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-5_AA_; KW Copia-5_AA-I; Copia-5_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-189 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.211; Positions 1838294 1838482. XX SQ Sequence 189 BP; 68 A; 32 C; 35 G; 54 T; 0 other; tgtagaagga gttgagtagg cgacgaaaaa cttgtacata gcaaccaagt cttgttgatc 60 aagttaacac acacaatgtc atgtatgagt ataaatagag gagagctcaa taaagttatt 120 cattcgttgt ttgaccattg aacagttaac aagttgtttc aattcgatct acaacatcta 180 aacctacca 189 // ID Mariner-23_SM repbase; DNA; INV; 2287 BP. XX AC . XX DT 12-AUG-2009 (Rel. 14.08, Created) DT 12-AUG-2009 (Rel. 14.08, Last updated, Version 2) XX DE Mariner DNA transposon from Schmidtea mediterranea: consensus. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-23_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-2287 RA Jurka J.; RT "Mariner DNA transposons from Schmidtea mediterranea."; RL Repbase Reports 9(8), 1872-1872 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 320..1234 FT /product="Mariner-23_SM_1p" FT /translation="MAEKRTGEPQAHDGKQSRKSLTLEMKMNVIRRIENGE FT RQSEVSSRLGLAGSTIRTILKNTDNIKKSICTTNSLSAKKVTRKRNAQVEE FT MEKLLTFWIDDQTQRNMPLSQGIIMEKAKILYLELKNRTEECSTAESFVAS FT RGWFEKFKLRANLHSISLSGESASADKESAKVFKKSLDDIIKQKNYPRQLI FT FNFDETGLFWKKMPSKTFISKEEKSASGFKAAKDRLTLLLGGNSTGDFRMK FT PLLIYHSENPRAMKGISKQSLPVIWMANRKSWMTAILFKKWYQEHFCPAVK FT SYCERNGLEKKHC" XX SQ Sequence 2287 BP; 853 A; 349 C; 394 G; 691 T; 0 other; caggcatacc tcggttaacg ctacctcttt tagcgctgtt tcgctacagc gctctttttt 60 tcgaaaaaaa aattatttta actttaaaat attttcaaaa aggctttctt tttcaaattt 120 cgcagaaaaa ttaatttata attgtggaaa tccttctttt tgaaaaaaag tcaaaaataa 180 ctataattcg gggaatccca attgtgaaaa tttgcattaa gttgttgatt cgaaataatg 240 atatggtttg atatagcacc agtaatttta ttaaataaat aaaattgatt aataaaaaaa 300 atcaattgct tataaaaata tggctgagaa aagaacgggt gaaccacagg cccacgatgg 360 taaacaaagt agaaaaagct tgacgttgga gatgaaaatg aatgttatta ggcggattga 420 gaacggtgaa cgacaatccg aagtgtcttc cagactgggt ttggccggat caaccattag 480 aaccattctt aaaaataccg ataatattaa gaaatccatt tgtacaacaa attcattatc 540 tgccaagaaa gtaacacgca aaagaaatgc tcaagtggaa gaaatggaga aattattaac 600 attttggatt gatgatcaaa cacagagaaa catgccgctc agccagggca tcataatgga 660 aaaggctaag atattatatt tggagcttaa aaatcgaact gaagaatgct ccacagccga 720 gagttttgtt gcaagccgag gctggtttga aaagtttaaa ctacgtgcaa acttacacag 780 cattagtctt tccggagaat ctgccagtgc tgataaagaa tctgcaaagg tatttaagaa 840 gagtttggat gatattataa aacaaaaaaa ttatcctcgg cagcttattt tcaattttga 900 tgagaccggt ttattttgga agaaaatgcc atcaaaaaca ttcatctcca aagaagaaaa 960 atctgcatca ggttttaaag ccgcgaaaga tcggttaacc ttattattag gtggaaattc 1020 aactggtgat ttcaggatga aaccgctgct aatttatcat tctgaaaatc ccagggcgat 1080 gaaagggatt tctaaacaat cactgcctgt catttggatg gcaaaccgaa aatcatggat 1140 gacagcaatt ttatttaaaa aatggtatca agagcatttc tgtcctgctg taaaatcgta 1200 ttgcgaaaga aatggattgg aaaaaaagca ttgctgattc tcgacaatgc tcctggacac 1260 cctcaaaatt tatcagaatt tcaaacttgt ttgcctgtag aaattatttt tacacctcca 1320 aataccacat caattttgca accgatggac caaggagtta ttgcaacttt caaggcatat 1380 tatattcgaa aaacctttaa gcaacttttt caagcagttg acagcaatca atgttccatc 1440 aaagaatttt ggaagaattt taatatcatg aatgccatag aaaatatcgg ttcatcatgg 1500 caagaagtac aacaaagttg tctcaaagga tgttgggact gcttgtttat taaaacgatt 1560 gaagatgaaa taattgatga actaccacaa gttattactg aaatttcgga aatagcaaat 1620 tatattggat ttgcggaggt tgatgaggga aacatccaag aactaattga aagtgatgac 1680 catggtatct ccaatgagga tcttattgct agtttatcac aaattcagcc agaaattgcg 1740 gacaatacgt tagaaacaga agatataaca gaaaatattt taacgaaaaa atgtttggaa 1800 cataatctag ggataatcca aaatgcctta gaagatttaa gtgaaaatga tccgaatttt 1860 gaaaggtcct gcgctataaa acgaggagta atgaacactt tatcacccta catggaaatt 1920 ttaaaagaaa aaagaaaact atttaagcaa ggaaccttag acaagttttt taaaaaatca 1980 taattaaata tgaatttaaa cctttatttt gttaaatgta cttttatatt tttttgtttt 2040 tattggaatt atttttttat attgaatcct actttatttt ctaattttta aatgaagaaa 2100 aaatgaagca aaaaaatcat attaaaataa aaataccatc ttttttattt ttgccttcaa 2160 agtaccaaaa aaaattaatt aggaacgtaa ccccatattt taatgtaaat aatacctctt 2220 ttaacgctgt ttccgttaac gctcaatttt caggaacgca ttatgagcgc taaacgaggt 2280 atgcctg 2287 // ID Gypsy-26_OD-LTR repbase; DNA; INV; 379 BP. XX AC CABV01003036; XX DT 24-FEB-2011 (Rel. 16.02, Created) DT 24-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Oikopleura dioica genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-26_OD_; KW Gypsy-26_OD-I; Gypsy-26_OD-LTR. XX OS Oikopleura dioica OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; OC Oikopleuridae; Oikopleura. XX RN [1] RP 1-379 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Oikopleura dioica genome."; RL Direct Submission to RU (24-FEB-2011). XX DR Genome; CABV01003036; Positions 26796 27174. XX SQ Sequence 379 BP; 105 A; 93 C; 56 G; 125 T; 0 other; tgatacgact caagaaaaaa atgaatttcc tactgtatat ttaacccatc cgttaatttc 60 cgctttttat tataccacgt ttatttctca ttttctgaat tcaaaaaagt tttgaattgg 120 cgaaacttct atttccgacc catcaaacag ttattccgcc attttctgcc tttacttgat 180 ccgtaagaac tcaggcgcag cgggaattga cacgccctac tgctctgatt gacaactgac 240 ctgtttctct gatttatcac cagctgtgtg accgggatcc agcccaatat aaggggcctc 300 gaaactgaac tctttacact ttacaataca ccatcactca gaatccgaat ccgtttttga 360 tattttctga atcgtatca 379 // ID CR1_Ele24 repbase; DNA; INV; 5131 BP. XX AC . XX DT 25-OCT-2010 (Rel. 15.1, Created) DT 25-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE A CR1 clade non-LTR retrotransposon family from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1_Ele24. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-5131 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5131 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (25-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update and extension. This consensus is generated CC from 20 sequences with >91% identity, and ~98% identical to the CC original sequence in [1]. XX FH Key Location/Qualifiers FT CDS 354..1184 FT /product="CR1_Ele24_1p" FT /translation="MDSKCYHCAKQVKTIEFLQCGLCNQAVHIKCVGLKRQ FT DMEFVNEHKNVIWFCDKCIDQLQHVKDNPIKSTADVVSEISDSIKVSLDEL FT KTELRETKELTKSLAEKKQPVESSGFYRNRPWPSIKRTREAATRDTPKSRP FT DVKLLSGTKSVEADNIIVETIAKPAEKFWIYLSRIARHVTEADISELVKTC FT LQIKDEVVVRKLVRKDADLKQFAFISFKVGINKDLRDTALDPSIWPKGILF FT REFENLQSERDFWGPTKVPRIDDRTPAMETSPNLLQ" FT CDS 1088..5062 FT /product="CR1_Ele24_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="EPAIRTGFLGTYESSQNRRQDPSDGNLSKSTAINSHS FT SLLTTERMSKSTMDAPKPPDTVEPCRCSSIAACSSRQSRSVPVFGTGDGAI FT HLACPGKYYDISSFSFPKSPPACRNSSRRYRTSPTSPFNSRAACSHPQGTE FT LDRRAEHNPRCSVDGFFETCSSSQTAMSAEHTRLVSVTSNSGLILKQPDDA FT SSTPGRTVAGIMEGLHPPSTVAPQQPPLISRPGPVSGTGGGLFRAVNTGKF FT DLSVCRSILDPFTSFSMPFAPQRSSSSIGRILGRTTSSNMEAPEPPTAVQL FT FQPTTDSRPSPVRGCGEGVFQLTVNGKYAPSSNNSLADVVAASSSSSSEQR FT RPSTSRGRFQAPSRETNASGPVLATNPSPIVKLPTSVYYQNVRGLRTKTAK FT LRLALSSCDYDILAFTETWLRPDISNAEFSSGYTIFRCDRSALSSNFLRGG FT GVLIAVKKSFQCELVICTGCEHIEQITVRIKLQGCWLIICTIYLPPNSGPD FT LYSACSSVIQQITESMSESDIFLLLGDFNLPNLRWQLDEDFNGYLPSNAST FT EQELALVEATIANGLQQMNNIVNANGNLLDLIFVNLPEHLDLVEPPLPLLA FT VDTHHMPFVVLLSTNFPDVSDDQSEDIGFNFLRCNFDVLNAALSSISWEIH FT PDSSVDYALTTFYNKLKQVILEYVPRKHRATNSVYNKPWWTTELRNLRNVL FT RKSRTRYFSSRSAANKSHLRDVEIRYNTLLLATHNDYIARIQSYAKHNPSS FT FWNFIKRQRSNNCIPANVEYSGHSASTNSEAADLFAAFFESVFTRVSPVPR FT HDRFQNIPSFDISIPLPQFSIEEVSQALNDLDVKKGSGDDNLPPLLFKRCA FT STLAGPITDVFNRSLAERKFPCAWKNAIIVPIHKSGSRNRVENYRGITLLC FT CLSKVLEKLMYNILYNAALPIISNSQHGFMRNRSTTTNLMCYVTALSREVE FT AKQQVDSIYIDFAKAFDTVPHLLTVEKMSHLGFPGWTTQWLFSYLSNRTAY FT VTVNSASSSAFEITSGVPQGSVLGPLIFIIFVNDLYQLLSSCNLSFADDLK FT LYRVIRSALDCIAMQADVDAIIVWCSDNGMRINSNKCKVISFTRRNHQYRH FT EYTAGSAVIERVQSICDLGVTIDSKIRFNEHVAIITAKAWSALGFVRRHAS FT DFTDIYALKSLFCALVRSILEYAAPVWSPYHVTQTLRIERVQKKFVHFALR FT HLPWIDPTNLPHYPDRCRLINLETLSARRTQLQQLFIFDTICGNLDCPTLL FT GEVSFNVPPRRFRSSSFLTIPYHRTAYGQNNSFSLCVKAFNDVSDRFDFNL FT SKVMFKTRIKNLG" XX SQ Sequence 5131 BP; 1356 A; 1343 C; 1086 G; 1344 T; 2 other; gtttgcggac ctctgcttcg tactgtagtc gcgttatcat cgctattgtt ttgatgtgta 60 aawtccccta gtgckgtttc gaaacaaatc gtgaagcgta cttttctcta gttgtttata 120 ctccatctgc tagttcaact acctgcgagt ggatgtttct cgtgtaaagt gcttatttag 180 tgccaaaaac tatatcgcta gtgaatcgtt tgtcctaaat aaagtgttga tcttctcctc 240 acgtgacgtc atcatcatct gaattgaatc cagcgccatc tattggtcgg tgacacatat 300 ccagcaaagc tttcttttcg gatcccgtcc ctctatcatc taacaccagc agtatggatt 360 caaaatgtta tcattgcgcc aagcaggtga aaacaatcga gtttttacaa tgcggtcttt 420 gcaaccaagc agttcacatc aaatgcgtcg gtctcaaaag gcaggatatg gaatttgtga 480 acgagcacaa aaatgtgata tggttctgcg acaaatgcat tgatcagctg caacatgtaa 540 aggacaatcc cataaagtcg actgctgatg tcgtcagtga aatttccgat tcaatcaagg 600 tctcactaga cgaattgaag actgaacttc gtgaaaccaa ggaactaacc aagtcacttg 660 ctgagaagaa acaacctgtt gaatcttctg ggttctatcg aaaccgacca tggccaagta 720 taaaacgaac tcgtgaggct gctacacggg atacaccaaa atctcgtcca gatgtcaagc 780 tgctcagcgg cacgaagtca gttgaggcgg ataatattat tgtcgagacc attgcgaaac 840 cagctgagaa attctggata tacttgtcta ggattgctcg acatgtaacc gaagcggata 900 tttcagagct agtcaaaacc tgtctacaga tcaaagacga ggttgttgtt cgaaagctcg 960 tgcgcaagga cgccgatttg aaacaatttg cgttcatctc gttcaaagtt ggcattaaca 1020 aagacctgag agatactgct ctggatccat ccatctggcc aaagggtatt ttatttagag 1080 aatttgagaa cctgcaatcc gaacgggatt tttggggacc tacgaaagtt cccagaatcg 1140 acgacaggac cccagcgatg gaaacctctc caaatctact gcaataaatt cacactccag 1200 tttgctaaca acggaacgca tgtcaaaaag cacgatggat gcccctaagc cacccgacac 1260 agtcgagccc tgcagatgtt ccagtattgc tgcttgttcc agccgtcaaa gtcgttccgt 1320 tcctgtgttc gggaccggcg acggggctat ccaccttgct tgtccaggca agtattatga 1380 cattagttct ttttcgttcc ctaaatcccc tcccgcttgc agaaattcct cacgcagata 1440 tagaacgagc ccaacaagtc ccttcaacag tcgtgcagca tgcagccacc cacagggtac 1500 tgaacttgat agaagagctg agcacaatcc gcgatgcagc gtagacggct tctttgagac 1560 ctgttcatcc agccagactg caatgagtgc cgaacacaca cgactagtgt cagtcacatc 1620 caattctggc cttatcctca aacaaccgga cgatgcctca tcgacaccgg gacgcactgt 1680 agctggtatt atggaaggcc ttcacccacc cagcacagtc gcgcctcagc agccaccgct 1740 catcagtcgt cccggccctg tgtctgggac tgggggaggg ctcttccgag ccgttaacac 1800 aggcaagttt gatttaagcg tatgccgttc gattcttgat ccgttcacca gtttcagcat 1860 gccatttgct ccacaacggt cttcatcttc aatcggccgt atactgggac gcacaacatc 1920 cagtaacatg gaagccccag aaccccccac agcagtccag cttttccagc caacgaccga 1980 cagtcgtccc agtcctgtac gtgggtgtgg tgaaggggtc ttccaactca cagttaatgg 2040 caagtatgct ccctcatcga acaattcact cgctgatgtt gttgccgctt ctagctcatc 2100 gtcatccgag cagcgtcggc cctctacatc gcgaggtcgg tttcaagcac cctcacgcga 2160 aacgaacgcg tctgggccag tcctagcaac gaatccatcc ccgatagtga agcttcctac 2220 gtcagtctac tatcaaaacg tcaggggtct acgcaccaag actgctaagc tgcgtttggc 2280 actatcaagc tgtgactacg acatcctcgc cttcaccgaa acatggctta gaccggacat 2340 cagtaacgct gagttttcat ctggttacac catattccgt tgcgaccgca gtgcattatc 2400 cagtaatttt cttcgaggtg gaggagttct tatcgccgtc aagaaatcat tccagtgcga 2460 gctggtaata tgcactggtt gcgagcatat cgagcagatt actgttcgca taaaactgca 2520 aggttgttgg ctgatcatct gtacaatata ccttccgcca aattctggac ctgatctgta 2580 ctccgcttgc tcatcagtca ttcagcaaat tacggagtcc atgtcggagt cagatatttt 2640 cctgctgcta ggcgatttca atcttccaaa tctccgctgg caattggacg aagatttcaa 2700 cggctacctt ccttcaaacg cttccactga gcaagagctg gcactcgtcg aggccactat 2760 tgcaaacggt ctccagcaga tgaacaacat cgtgaacgcc aacggtaatc tactagatct 2820 aatctttgtt aatttaccgg agcacctcga tctagtcgag ccaccactcc cacttttagc 2880 agtggacaca catcatatgc cattcgtcgt gcttctcagc acaaactttc ccgatgtctc 2940 tgacgaccaa tctgaagaca ttggctttaa cttcctgcgc tgcaacttcg acgttttaaa 3000 tgctgctcta tcctctatca gctgggagat ccatccagac agttcggtgg attacgctct 3060 cacgacattc tacaacaaac taaagcaggt aatcctagaa tatgttcctc gtaagcatcg 3120 cgctactaac tccgtctaca ataagccatg gtggaccaca gagctacgaa accttcgcaa 3180 tgtcctccga aagagcagga ccagatactt ctcgtccaga tctgctgcca acaaaagtca 3240 tctccgtgat gtagagatcc gttacaacac gctgcttctt gccacccaca atgactatat 3300 cgctcgaatc caatcctacg caaaacataa cccctcgtcc ttctggaact tcatcaaacg 3360 ccagcgctct aacaactgca ttcctgcaaa cgtagagtat tcaggacatt cagccagcac 3420 aaattctgaa gcagccgact tgttcgcagc gttctttgaa agcgtcttta ctcgagtctc 3480 tccggtgccc cgccatgaca gattccagaa cataccctcg ttcgacatta gcatcccatt 3540 acctcaattt tcaatagaag aagtttcgca ggccttgaac gacctcgacg tcaagaaggg 3600 ttctggagac gacaatcttc ccccgttgtt gtttaaacga tgcgcgtcca cactcgcagg 3660 tccaatcacg gatgtcttca atcgatctct cgccgagagg aaatttcctt gcgcgtggaa 3720 aaatgctatt attgtaccaa tccacaaatc tggcagccgt aaccgtgttg aaaactaccg 3780 aggtatcacg ctgttatgct gcttgagtaa ggtgctcgag aagctcatgt acaacatcct 3840 gtacaacgcc gcattgccga tcatatcgaa tagtcagcac gggttcatga ggaatagatc 3900 gactacaacg aacctcatgt gttatgtcac cgcattatct cgcgaggtgg aagcgaaaca 3960 gcaggtagat tcaatctaca tagattttgc gaaggctttc gacactgtac cgcatctttt 4020 aaccgtcgaa aagatgtcgc acctcggatt tccaggatgg acaactcaat ggctgttttc 4080 gtatctctcc aatcgaacgg catacgtgac cgtcaactcc gcaagctcta gcgcgttcga 4140 aataacttcg ggagtaccgc aaggaagcgt cctcggcccg ctgatcttca taatttttgt 4200 gaacgatctg tatcagctgc tctcgtcttg caatttatcc ttcgcagacg atcttaagct 4260 atatcgggta attcgctctg ctttggactg catagctatg caagcggatg tcgacgccat 4320 tattgtgtgg tgcagtgaca acggaatgcg aatcaatagc aacaaatgca aagtgatatc 4380 tttcactcgc cgcaatcatc aatacaggca tgaatatacc gcagggtctg ccgtaattga 4440 acgcgttcaa tcaatctgcg atctcggagt tacaatcgac tcaaaaatca gatttaacga 4500 acacgtcgca attatcactg ctaaggcttg gtcagcgttg ggattcgtta gacgacacgc 4560 ctcggacttc accgacatct acgcgctgaa atccctattt tgtgcactag tccggagcat 4620 actagagtac gcagctcccg tgtggtcacc ataccatgtg acgcaaacgt tgaggattga 4680 acgtgttcag aagaaatttg tgcattttgc attgcgccac ctaccgtgga ttgatcctac 4740 aaatcttcct cattacccag accgctgccg actcatcaat ttggaaactc tgtcggcaag 4800 gaggacccag ctgcagcaat tgtttatctt tgacaccata tgtggcaacc tcgactgtcc 4860 tactcttctc ggtgaagtct ctttcaacgt tcctccacgg cgattccgta gttcctcttt 4920 cttgacaatt ccttatcaca ggactgctta tggtcaaaat aactctttta gtttgtgtgt 4980 taaagctttt aatgacgtta gtgatcggtt tgattttaat ttgtcgaaag tgatgttcaa 5040 aactaggatt aaaaatttag gttaatcaaa ttttaattaa ctttaagaca gtctgtacgt 5100 cgtagacgaa gacggtggaa tataataata a 5131 // ID Copia-35_DPu-LTR repbase; DNA; INV; 354 BP. XX AC ACJG01003352; XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the common water flea: long terminal DE repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-35_DPu_; KW Copia-35_DPu-I; Copia-35_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-354 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the common water flea."; RL Direct Submission to RU (08-FEB-2011). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR Genome; ACJG01003352; Positions 8 361. XX SQ Sequence 354 BP; 97 A; 64 C; 67 G; 126 T; 0 other; tgttgagttt tattttctat aaatcaaacc aaataaacat tcgcgccgac ggagatgctg 60 tttagcatct catgtcaaat aaacatgcgc gccgacggag atactgttta gtatctgatg 120 tcagtcttcg ttgactttca ttcgtctgct gttcgggttc tctgtgttca actgtcatct 180 ggtaaagagg taaagtctat tgcttgtaaa ataattgtaa gaaattatct tgtgtgaatt 240 attgcactgt attgcttgct atatgtccca gcagtgaaaa cctcccaggt tacaatacag 300 tattggatta atgagactaa ctctttctgt gtacttgtat atttaatctc aaca 354 // ID Copia21-NVi_LTR repbase; DNA; INV; 180 BP. XX AC DS265683; XX DT 20-DEC-2007 (Rel. 12.12, Created) DT 02-JAN-2008 (Rel. 12.12, Last updated, Version 1) XX DE LTR retrotransposon from parasitic wasp: long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia21-NV; KW Copia21-NVi_I; Copia21-NVi_LTR. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-180 RA Kohany O. and Jurka J.; RT "LTR retrotransposon from Nasonia parasitic wasp."; RL Repbase Reports 7(12), 1212-1212 (2007). XX DR Genome; DS265683; Positions 397635 397814. XX SQ Sequence 180 BP; 37 A; 52 C; 44 G; 47 T; 0 other; tgttggcaat cgccgagcct cccagggacc ttagcgccat ctgcacacaa aagtttacaa 60 cggagagcgt cgtgaggcgc gctctcactt gctcgctatc acttgagtcg tgcgcaagtc 120 cgcccgagtg ctgaagagta tttttctatt actgagttcc ttcttagcct agcttagcca 180 // ID Sola1-1_Dpulex repbase; DNA; INV; 744 BP. XX AC . XX DT 12-MAY-2011 (Rel. 16.05, Created) DT 12-MAY-2011 (Rel. 16.05, Last updated, Version 1) XX DE DNA transposon: consensus. XX KW Sola; DNA transposon; Transposable Element; Nonautonomous; KW Sola1-1_Dpulex. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-744 RA Yuan Y.W. and Wessler S.R.; RT "The catalytic domain of all eukaryotic cut-and-paste transposase RT superfamilies."; RL Proc Natl Acad Sci U S A 108(19), 7884-7889 (2011). XX DR [1] (Consensus) XX CC incomplete. XX SQ Sequence 744 BP; 154 A; 212 C; 171 G; 207 T; 0 other; tacctcaccg ctgcttcggc cttcgccttc cgaacgtccg ccattccacg gacgctcagc 60 atcttcgccg atgggtggga cagcaagaag ctcagcttcc cgtccgtgtt caacaacccc 120 aagtacttcg acggactgcc ttcatttcac tgccgcgtca tcggggtgct tgttcacgga 180 tggtactatt tctgtcatgt cgatccgcac aaccgccatg gcaccaacat caacgtcgtc 240 gccatcatgt acgccatcac ccgtatgcgg gaacggtttg gcggattcct cccccccacc 300 ctcaacctcc aactcgacaa cgccgcagac aacaagtgca aatacttttt gcacttttgt 360 tatttccttg tcttgagcgg tgcgtttcgc aaggtcaaac tcgggtactg taaaccgggc 420 cacacgcatt cggatggcga cgggcgagct actggccctg gtcagatggc ccggcgcgag 480 ggggtggagg attttgagga tgtcctccgc atcgttggct caagtgagtg tgtcttgtat 540 aaggaatgtg aactgctttg ttcgaatttc tgcagtctat ttttgtgtcc taaagttgac 600 ttgtgcctcg tccgataaac ttcccaaaat tgtatcatgc caaatcattt ctgtagtcta 660 tttttgtttc ctaaagttga catgtttctc gtccgataaa cttcccaaaa tggcatcatg 720 ccaaatcatg ccagattgtc tttt 744 // ID BEL-19_DPu-I repbase; DNA; INV; 5565 BP. XX AC ACJG01001375; XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the common water flea: internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-19_DPu_; KW BEL-19_DPu-LTR; BEL-19_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-5565 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the common water flea."; RL Direct Submission to RU (09-FEB-2011). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR Genome; ACJG01001375; Positions 9342 3778. XX CC Positions [4485-5069] - Integrase core CC 'GTATA' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 541..3813 FT /product="BEL-19_DPu-I_1p" FT /translation="MPKTVKGANAENILTEETDEPFVDVDLARKKRRSIRS FT QLTSTSRALTEDIKKAGSRGAMIGLVQHLQDLLKQAATIHTELLIAEELEE FT NEKQEEIHLRYVQEAGEIIAKVDQHLDSRKNEAPSVIQPRCGGRRQAKREE FT ELQAAQQRADDARSQAEEAHSRAEELRNQQEEAEEALLNLQLGDDDADDHL FT TSVSQHSHVSQLATNWKLRQRQENVAPDAWIDSYAAGKLKPTNPNNSRSSI FT KADLEPYSGRALDWFAWVDLFRALVHDTAKAPGEKLALLKRHLRGDCLDVV FT HGLGGGEGAYIEALVRLKKSCGRRDVMRAAHLQAIEKLELKNDPAVFKRYA FT EKIRTHFFDLSRIGETATADLIEKVCLRLQLNDGRKGGLETRSLNAFGVWL FT CDRAAAYQNAYSTAADQLQSTPKPVRFAARTNQVSSKQHSGPSTHSKPTTR FT NFCFKCEGEHKLETCGDFKNLYVGNRVAFCARHRLCFGCLKAKHSVRFCSQ FT RKPCSQPDCTHFHHPLLHEATPSSVDTSVTARPSILHIESRKSGRVAMGMM FT RLQVRKADGNWTLANVFVDEGSDSTLMRQGFANYLNLRGARHLLTIIGAGN FT VINRYPSQRITFDMRDTNGETISIPCSTLPSVVSDTPVTDWPSLKPRWKHL FT SDLPVTATGGRVDILIGTDLSHLVAALESRVGGDYEPTATLNRFGWLVRGV FT VQEGTMVTAVRAHTVTGSFQLAQLTEEMKQFCETENYGTEYQLPGMSADEK FT RAVSILDDGTRKLDLGYEVPITWREGEPDLINNRRMVEDRFRSLLRRFERD FT PQFEADYRAAMKKTLDQGYASRLAGPTADEARYFLAHHGVYKGPKLRVVFD FT AAAPFKGKCLNDAILSGPALQPSLPAVLIQFREGEVAWASDVEAMFSRFRL FT RPADANFFCFLWKEPDSPDYIVCRMDRLPFGATCSPFIAIHTSRRATIDAG FT AREKIVEAVKGKLYVDDYLSSSSSVEKGLEEAVAVERVLSSADLHLQGWIS FT NSPEFTQAIMKDKPAKPVVNSPGCYNLSSKEFDKNARARVAYQDGRYGIPS FT GQSGQHRVHPRRNHSQGRQRVRSIGNSRTFNREG" FT CDS 4458..5537 FT /product="BEL-19_DPu-I_2p" FT /translation="MGDLPSFRLDSYSPPFAHVAVDYFGPLETSPGRNRVL FT KRYGVLFTCLVTRAVHLEMAESLSTEDFLLVFRRFISLYTKPLTVHSDNGT FT NFVGAENVLNSLLHDMPKDPSFQRFNKEKNIDWKFQPPRAPHFGGAHESLV FT RSTKKALYRALEIEKEGLRYPTDEMLRTLLAEIGGMLNARPLTYVSTDPSD FT FRPLTPNDFLNRPPACDLPPGSFSDALPRERFRYVQRTAQLFWDLWTKIYL FT PSLVPRKKWKVEQPNLAVGDVVLMIDPNQPRGQWKIGHIIQTFPGEDGLVR FT VVKIQTETGVYSRAIHRLCLLERASTICAPTAADPAIDKNVPIRAARCNHL FT RLLYSNRPRLDSSITSV" XX SQ Sequence 5565 BP; 1446 A; 1506 C; 1366 G; 1247 T; 0 other; tttggtcctt cgaaccggag ctctgtgtaa tcttgagttg ctgactcaac agtcttgtca 60 gagagacacc tcttcatcac ctagtcgatt actgctcaca cagatctctt tgctgtgcag 120 catcaagtca tcttctgtcg agctctcctt gctgtgcagc atcaagtcat cttctgtcga 180 gctctccctg tgcagcatca agtcatcttc tgtcgagctc tccttgctgt gcagcatcaa 240 gtcatcttct gtcgagctcc ttgctgtgca gcatcaagtc atcctctgtc gagctcttct 300 cgtacatgct gcctcattgc tgtgcagtat taaagtcatc ttctgtcgag ctctcctgta 360 cccgctggcc tcactgctgt gcagcattca aggtcatctt ccgtcgagct tcccctgtac 420 acgcagacct catcaccgtg cagtcttcaa gacatccgtt ggtcccattc atgtgcaggc 480 tctagcacgt ccgtctcatc taccattttg aaatcgtctg agtcctcgac ttatttcaac 540 atgccaaaaa cggtgaaagg agctaacgct gaaaacattt tgaccgaaga gaccgatgag 600 cctttcgtag acgtggatct cgcaaggaag aaaaggcgat ccatacgttc acagcttact 660 agcaccagcc gcgcactcac cgaggacatc aaaaaagctg gttcaagagg agcaatgatt 720 ggcctcgtac aacatctcca agatctactg aaacaagccg cgaccatcca tacagaactg 780 ctgatcgcgg aagaactaga agaaaacgaa aaacaggaag aaattcatct tagatatgtc 840 caagaggccg gtgagatcat agccaaggtc gaccaacatc tcgactccag gaaaaatgaa 900 gccccgtcag tcatccagcc cagatgtggt ggtcgtcgcc aggcaaaacg agaggaggag 960 cttcaagcag cccaacaacg agcagacgac gctcgaagcc aagcagaaga agcgcattct 1020 cgggccgagg agctgcgcaa tcaacaggaa gaagcagagg aggcactcct aaatctacaa 1080 ctgggcgatg atgatgcaga cgatcatcta acttcagtca gccagcatag tcatgtgtca 1140 cagctagcca ctaattggaa gcttcgacag cggcaagaga acgtagctcc tgacgcctgg 1200 atcgacagtt atgcagccgg aaagctgaaa ccgaccaatc ccaacaattc taggtcctct 1260 atcaaagcgg atctcgagcc atacagcggc agagcgttgg attggtttgc ctgggtggac 1320 ctgttcagag ccttagttca cgacacagct aaagcccctg gtgaaaagtt agccctgctt 1380 aaacgtcatt taagaggaga ttgcctggat gtcgttcacg gacttggagg aggagaagga 1440 gcctacatcg aagccttggt tcggctgaag aaatcatgcg gacggcgtga cgtgatgcgg 1500 gcagcgcacc tccaagccat cgaaaagctc gagttgaaga atgatccagc ggtcttcaag 1560 cggtacgccg agaaaattcg gacacatttc ttcgacctct ctcgcattgg ggagacggcc 1620 acagccgatc tgatagagaa agtttgcctc cgactacaac tcaacgacgg ccgaaaagga 1680 ggattggaga ctcggagcct gaacgcgttt ggagtctggc tttgtgatcg tgctgcagcc 1740 tatcagaatg cttacagcac agccgccgac caacttcaat caacacctaa acccgtacgt 1800 tttgccgccc gaacgaacca ggtctcttcg aagcaacatt ctggtccttc cacgcactcg 1860 aagccaacca ctcgtaattt ctgcttcaag tgcgaagggg agcacaaact cgaaacctgt 1920 ggagatttta aaaatcttta cgtcggcaac cgcgtcgcgt tttgcgcacg acaccgcttg 1980 tgttttggtt gtcttaaagc caaacactcc gtccgcttct gttctcagag gaagccctgc 2040 agccaaccgg attgcaccca tttccatcat ccattgctgc atgaagccac tcccagcagc 2100 gttgacactt ccgtcaccgc caggccgtcc attctgcaca tcgagtcaag aaaatccgga 2160 cgtgtggcga tggggatgat gcggcttcaa gttcgaaagg ctgatggaaa ctggacactg 2220 gccaacgtat ttgtagacga aggcagtgat tcgactctaa tgcgtcaagg ttttgcaaat 2280 tacctcaacc tacgcggagc tcggcacctt ttaacaatta ttggagctgg taacgtgatt 2340 aatcgttatc cttctcagcg aatcaccttc gacatgcgag acaccaacgg cgaaaccatc 2400 tccattccat gttccactct accatcggtg gtcagcgaca ccccggttac cgattggccg 2460 tctctgaaac cgcgttggaa gcatctttca gaccttccag ttacagcaac cggcggacga 2520 gtcgacatct taattggaac agacctttct catctagtcg ccgccttaga gtcaagagtg 2580 ggaggagact acgagccgac ggcaactctg aacagattcg gatggcttgt cagaggagtg 2640 gtgcaagagg gcaccatggt taccgcagtt cgtgcacaca ccgtcaccgg atcgtttcaa 2700 ttggcccagc tgaccgaaga aatgaaacaa ttctgtgaaa ccgagaacta cgggaccgaa 2760 taccaattgc caggcatgtc agcggatgag aaacgggcgg tctcaatcct ggacgacgga 2820 acgcgaaagc tggacctagg atatgaagta ccgatcacat ggcgtgaagg ggagccggac 2880 ctgatcaata atcgtcggat ggttgaagat cgattcagaa gccttctacg cagattcgaa 2940 cgggatcccc aatttgaagc agattatcgt gccgcgatga agaaaactct cgaccaaggt 3000 tatgcgtccc gtctggccgg accaaccgca gatgaagcaa gatatttcct cgcccatcat 3060 ggagtctaca aagggccgaa attgagagtc gtcttcgacg cagcagcacc tttcaaaggg 3120 aaatgcctaa acgacgcgat cctgagcggt cccgctttac agccatcact tcctgccgta 3180 ttaatccagt ttcgtgaagg agaagtggcc tgggcctccg atgtggaagc gatgttcagt 3240 cgttttcgac tacgcccagc ggatgctaac ttcttttgtt ttctatggaa ggaacctgat 3300 tcgccagatt atatcgtctg ccgcatggat cgactcccgt ttggagccac ttgttcacct 3360 tttatcgcga tccacaccag tcggagagcc acgatcgatg ccggagcaag agagaaaatc 3420 gtcgaagcag taaaaggaaa gttatacgtc gatgattatc taagttcatc cagttccgta 3480 gaaaagggct tagaagaagc cgtcgcggtc gagagagtct tatccagcgc cgatctacat 3540 ctacaagggt ggatatcgaa ctcacccgag tttacccaag ccatcatgaa ggataagcct 3600 gcaaaaccag tcgtcaactc acccggatgc tacaatctct ccagcaaaga gttcgacaaa 3660 aatgctcggg ctcgtgtggc ataccaagac ggacgctatg ggattccgag tggacaatct 3720 ggacaacatc gagtacaccc gcgtcggaat cacagtcaag gtcgccagcg tgttcgatcc 3780 attgggaaca gccgcacctt taatcgtgaa ggctaagata agattgaggg cgctcgggct 3840 gaaagcatcc agctggacgg aagcagtcga cgaagctgat cagacttggt ggaccgcctg 3900 gtttgacgtc gtccgaaaat tatcggctac tacagtggaa cgatgtttaa ttcccgaaga 3960 ctcccagata gaagagtctc agcttcacgt attctgtgac gcctccgagg aagcttacgc 4020 ggccgtcgtc tatgtgcgga acagttatcg tgacggacgg atcggcgtgc atcaaattaa 4080 ggccagcaac aagttagcgc cgaagaagac ggtgtcggtg cccaaactgg aactaaacgc 4140 agcccttcta ggatccagat tagcgcgatt cgtcggtagc tgtttaaaca agaaaatcca 4200 gagtcgtttc ttatggaccg atagtagtac ggtccgtaat tggatccgtg ccaccgcatc 4260 gtattaccag gtctacgtct ccaaccgtgt aggcgaaatc cagacgatca ccgagccgga 4320 agagtggcgc ttcgtgccgg gcaagatgaa tccagcggac gaagcgcagg atagttcgag 4380 gacgtgaagc ggtgaagaaa atccgaagac tgtgcccaat ctgtattcga gaacgagcca 4440 ctcccgccag tcaacttatg ggagatctac cctctttccg actggattct tattcgccac 4500 cgtttgccca tgtagctgtc gattatttcg gaccgttgga gaccagcccg ggccgaaata 4560 gagttttgaa acgctacggt gtgctcttca catgcctagt aactcgtgcc gtgcatttgg 4620 agatggccga gtctctttct acagaggatt tccttctagt cttccggcgc ttcatcagtc 4680 tctacacaaa accacttaca gtacattccg acaacggcac caatttcgtt ggtgctgaga 4740 atgtgcttaa ttctctcctt catgatatgc cgaaagatcc gtcatttcaa cgtttcaaca 4800 aggagaaaaa catcgactgg aagtttcaac ccccccgagc tccgcatttt ggtggcgcac 4860 acgagagcct cgtccgatcg accaagaagg ccctttaccg agccctggaa attgaaaagg 4920 aaggactcag gtatccgacc gacgagatgc tccgcacgct gctggccgag atcggtggaa 4980 tgttgaacgc ccgaccctta acctatgtca gtacggaccc gtcagatttc cggccactaa 5040 cgccaaatga tttcctcaac cgtcctccag cttgcgacct tccaccagga tctttctcag 5100 atgccttacc acgtgaaaga ttccggtacg tgcaacgcac tgctcaatta ttctgggacc 5160 tttggacgaa aatctacctt ccgtcactcg ttccgcgcaa gaaatggaaa gtggagcaac 5220 caaatttggc agtcggagac gtcgtactaa tgattgatcc caatcaaccc cgtggccaat 5280 ggaaaatcgg tcacatcatt cagaccttcc caggagaaga tggacttgtc agagtcgtga 5340 agattcaaac cgagacgggt gtctattcca gagcgatcca tcgtctatgt ctgctggaac 5400 gcgcatccac catctgcgca ccaaccgcag ccgacccagc tatcgacaag aacgtcccga 5460 ttcgagccgc gcgttgcaat catctccgac ttctatattc aaatcgcccc agattggact 5520 catccatcac ctctgtttaa attcagctct tgattcgggg gagaa 5565 // ID DENSOV_HM repbase; DNA; INV; 5133 BP. XX AC . XX DT 26-MAR-2008 (Rel. 13.03, Created) DT 01-APR-2008 (Rel. 13.03, Last updated, Version 1) XX DE Integrated DNA virus. XX KW DNA Virus; Integrated Virus; DENSOV_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-5133 RA Jurka J.; RT "Densovirus-like endogenous DNA virus."; RL Repbase Reports 8(3), 182-182 (2008). XX DR [1] (Consensus) XX CC This virus is present in >400 copies in the genome. The CC replication protein on the opposite strand is weakly similar to CC Papillomavirus helicase. The virus has ~60 bp long imperfect CC TIRs, and no obvious TSDs. CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 3895..2804 FT /product="DENSOV_HM_2p" FT /note="Replication protein." FT /translation="MCRHYTKTVRTPLPRRVTQEFRLSTLLPLLTTRMLLS FT TCLTVISYVKAHITFDLDNIQDETIITKRQPLKQQQKQNFANYLFKHNIMN FT SRHYDDELARSEELRQLEFTIQLKDLEIIKKTIFRQVYLQQRNIDVLTPRI FT VVSNNKSRLGALMFFLSNIALQNQISVNRLKQLLCNFIFNLNHKKRGLILC FT GPSDSGKTFLANLLYSNFKPHEIGYFNCPTGPNPSAFLLQALSNCLAYRCD FT EMVFEHLGVIQLMKQLLEGSNTLTTDVKYKDAMPIDPHPTLITMNSTSKFD FT ILKWHPSEYQAFENRCTILFLGTPLLNIFNDKELQEINTCGPELLYMLSMH FT KQDKDTXTAGVLDKFQDYIFL" FT CDS join(191..1159,1159..2754) FT /product="DENSOV_HM_1p" FT /translation="MNVLIIIYLHAIFGIYVGALKLFNFQMGDDYYFQIPE FT GFNKNQAQFKQNKVNPKKSLQVAPGWDNQIFGVNEKNFKNKEVARIAESVV FT TGLTGAINPKSVVSAVKNVKAVAGLAKETVPYMHMPNVRKILKSALLPKNN FT ISKTEYIALKEMGKTRPTNIFEDMLYSDSDWSRSSTSNTSSVYSNSTKVGN FT VTDMSTVSESVEGWNYDPDYQPVPQDDLPDYMRLKPGNVDIYDPKEIPYIK FT EANIAKQLHKYSFEPYQFPKFYDELPKYAMPWSVMKESQTILEKLFGNYSF FT DVAAAYGADTQKNIRSSKRKKRLVAERLSRNLMAEQPPNHEVSASPDFVIE FT RGFDCSKGYYDEFCVEQSIYWYVDCTNDTVQNIYYFPVDYPNLWTNTDQYF FT MATGGIYYVDKSVECDVMDVRVENQLKTTQNSSVPLEMSNIGLRFFDIEGD FT QLISDVLYPLADSDDYAKIFDLKLSSADKKPSPLPTVKLLENSLFFNDKTG FT KPVTTRFANVVSTKDPRFFEKNNVAEKVIHSIVWKNDIGLRKPIKLLVNDP FT FAXRSFKFTDTSFDPGGQFIDSTASIERQKANGFFDENWSIINTYPCDYRF FT GHIHMPNYGIKNLIARSIVSEGDDKRKNIRYPVQLDRWYLQRQGISPYEQY FT NTGNIGXIYNAITPYCADSRDPYNIEQCFDVYLQPYDEFDHSSVLFKNRSN FT YDICAGDKSXESVNNTYWNNLPTVMPFSGNKNCKRLYFCIDPAYIGKDVMP FT TTVRGRLRLRHRIQIFERDWTYDRTRVDPVNELYKRPTIKQSRFGKDATYT FT TVTMANPFVKNNPKITFVNCYQKIFYPAFSTMFIGKAEXLDNDEPEEKKQK FT LL" XX SQ Sequence 5133 BP; 1469 A; 715 C; 945 G; 1977 T; 27 other; tggttggaag gatggtgggt tgatagtgaa tttataaatg tttcgtatta ctgctgcttt 60 gtaatccgat tataaaaggt atataaggcg tggttataat tatatattaa aatgtttttg 120 atgttgttag cgttggtatc ggttacacat tgcgattggt gtggccgtac tgcttgtaag 180 tcattgtata atgaatgtgt tgatcataat ttatttgcat gcgatttttg gaatatatgt 240 gggtgcactg aaactgttca attttcagat gggcgacgat tattactttc aaataccgga 300 agggttcaac aagaatcaag ctcagtttaa acaaaataaa gttaacccta aaaagtcgtt 360 acaggtygcc cctggatggg ataatcaaat atttggtgtt aatgaaaaga attttaaaaa 420 taaagaagtt gctcgtattg ctgaatcagt tgttactggt ttaactggtg ctattaatcc 480 taaaagtgtt gttagtgcag ttaaaaatgt taaagctgtt gctggtttag ctaaggaaac 540 ggtaccttat atgcatatgc ctaatgttcg taagatatta aaatcagctt tgttacctaa 600 aaataatatt agtaaaactg agtatattgc tttaaaggaa atgggtaaaa caagacctac 660 taatatattt gaagatatgt tgtattctga ttctgattgg tctcgttcta gtacttctaa 720 tactagctct gtttattcta attcaactaa agttggtaat gttactgata tgagtactgt 780 ttctgaaagt gttgaaggtt ggaattatga tcctgattat caacctgttc cacaagatga 840 tttacctgat tatatgagac ttaaacctgg aaatgttgat atatatgatc ctaaagagat 900 tccttatatt aaagaagcta atatagctaa acagttgcat aaatatagtt ttgaacctta 960 tcaatttcct aagttttatg atgaattacc taaatatgca atgccatgga gtgttatgaa 1020 ggagagtcag actatattag agaaattgtt tggaaattat tcatttgatg tagcagctgc 1080 ctatggtgcg gacacacaga agaatattcg ttcatccaaa aggaagaaaa ggttggtcgc 1140 agagaggtta tcacgtaatt aatggcagag caacccccga atcacgaagt cagcgcgtca 1200 cccgatttcg ttatcgaaag aggatttgat tgcagtaaag gttattatga tgaattttgt 1260 gttgaacaat ctatatattg gtatgttgat tgtactaatg atactgttca aaatatttat 1320 tattttcctg ttgattaccc taatttatgg actaataccg atcaatattt tatggctact 1380 ggtggaatat attatgttga taagtcagtt gaatgtgatg ttatggatgt tcgcgttgaa 1440 aatcaattga aaactacgca gaattcctct gtacctttgg aaatgtcaaa tattggtttg 1500 agattttttg atattgaagg tgatcaatta atatctgatg ttttatatcc tttagctgat 1560 tcagatgatt atgctaaaat atttgatttg aaacttagtt ctgctgataa gaaaccttct 1620 cctttaccta ctgttaaatt gttggaaaat tctttgtttt ttaatgataa aactggtaaa 1680 cctgttacta ctcgatttgc taatgttgtt agtactaaag atcctagatt ttttgaaaag 1740 aataatgttg ctgaaaaagt tattcatagt attgtttgga aaaatgatat tggtttacgt 1800 aaacctatta agttattggt taatgatccg tttgctwgta ggtcatttaa gtttacagat 1860 acctcatttg atccaggtgg tcaatttatt gattctactg cytctattga rcgtcagaaa 1920 gctaatggat tttttgatga aaactggtct attattaata cttatccttg tgattatcgt 1980 tttggtcata ttcatatgcc taattatgga attaaaaatc ttattgctcg ktctattgtt 2040 tcagaaggtg atgataagcg taagaatata cgttatcctg ttcaattaga tagatggtat 2100 ttgcagaggc agggtatatc tccttatgaa caatataata ctggtaatat aggttstata 2160 tataatgcta ttacacctta ttgtgctgat tcacgagatc cttataatat tgagcaatgt 2220 tttgatgttt atttgcaacc ttatgatgag tttgatcatt cttctgtatt atttaaaaat 2280 cgttctaatt atgatatatg tgcaggtgat aaatctrcwg aaagcgttaa taatacctat 2340 tggaataatt tgcctactgt tatgccattt agtggtaata aaaattgtaa gcgtttatat 2400 ttttgtattg atcctgctta tattggtaaa gatgttatgc ctactactgt tagaggtcgt 2460 ttacgactta gacatcgaat tcaaatattt gagagagatt ggacatatga tcgtactcgc 2520 gttgatcctg ttaacgaatt gtataaacgt cctactatta agcaatctcg ttttggtaaa 2580 gatgctactt atactactgt tactatggct aatccttttg ttaaaaacaa tcctaaaata 2640 acatttgtta attgttatca gaaaatattt tatcctgcct ttagtactat gtttattggt 2700 aaagctgaac wgttagataa tgacgaaccc gaagaaaaaa aacaaaagtt attgtaaatt 2760 atatatttta ataaaattat ttacaaatat attgttatta ttataaaaaa atatagtctt 2820 gaaatttgtc taatacccct gcagtgtwtg tgtctttgtc ttgtttgtgc atgcttaaca 2880 tatacaataa ctcaggacca caggtgttta tttcctgtag ttctttgtca ttgaatatgt 2940 taagtagtgg tgtmcctagg aagagtatcg tacacctatt ttcaaatgct tgatactcag 3000 aaggatgcca ttttaatatg tcgaattttg atgtagagtt catggtgata agagttggat 3060 gtggatcgat tggcatcgca tctttgtatt tgacatctgt tgtaagtgta tttgaccctt 3120 ctaatagttg tttcattaac tgaatcacgc ctaggtgctc aaaaaccatt tcgtcgcatc 3180 tataagctaa acaattactt aatgcctgta gtaggaaagc tgatggatta ggtccagtgg 3240 gacagttgaa gtacccaatt tcatgaggtt taaagttaga atataataag ttagcaagaa 3300 acgttttacc tgaatcagat ggtccacaaa gaatgagtcc tcgtttttta tggtttaggt 3360 tgaagataaa gttgcaaagt aattgtttta gcctattgac ggagatttgg ttttgtagkg 3420 cgatgttaga tagaaaaaac atcagagcac ctaaacgtga tttgttattt gaaacaacta 3480 ttctaggtgt tagtacgtct atgtttcgtt gttgtaggta tacttgtctg aatatagttt 3540 tttttattat ttccaagtct tttagttgta ttgtaaattc tagttgtctt aattcttcac 3600 ttcttgctag ttcgtcgtcg taatgtcttg aattcattat gttatgttta aataaataat 3660 ttgcaaagtt ttgtttttgt tgttgtttta gtggttgtcg ttttgttatt attgtttcgt 3720 cttgtatatt gtctaagtca aatgtaatat gtgcttttac gtagctaatc acggtyaagc 3780 acgtagataa cagcattctt gtagtaagaa gcggtagtaa cgtgcttaac cgaaattctt 3840 gagttacacg tcgtggcagt ggtgtccgta ccgttttcgt gtaatgtctg cacatcctct 3900 gcagaggtat tgtcgtgctc tgaagaatgc tcttctaatt cgkagttgtt ctccctctgg 3960 gcttctttct gatttntcga ggacatakga tgcaaggatg gagtgtgtgt gtagttgtcc 4020 tgaatattcg ctttctttac catcgaagta tttgcatatt gtatgggttt tagtgcatag 4080 gtcgagggca tcacatttcg tttctttttg gttgtaccag atagatgtgy gttggctttc 4140 ctgtatgaat aacatgaaaa atatcaaatc ctckttttaa atctttrtta aaacacagta 4200 gttcctaaaa ttatatattt attatttaag tttttaaata aagttatata tattaatttt 4260 aatatatcct atatatttat aatattttct taccgaattg tctaatttca attctgcgca 4320 tctgtttgtt cttgtgatgt tgtcacagca tccggcgatt tcttcttctt cgatttcttc 4380 racccattct tcgtccgaag atatttcgat aatattyttt ccgcttgttt tagttgtctc 4440 tkcttcgttt tcaacagcat cttcgcactc tgattgaata rttgttgatc cttcttcact 4500 atccagctca gatccttata tgwtagattt tccaggtttt gttccggcat cattaatatt 4560 atcttctcct cgaagcattt tgctagakct tgctctgttg tcatttgttt cgtcctgttg 4620 ttgtgatatc tcgatttcag tatttgaatc gctttgtcgt aatgggtatt cgtttcctgt 4680 gattgtatag tattcgttga gcagctctct gtattcgtat tcgttgtgtc ctttttgtag 4740 ttgttgttcg atatattcga gttttcgttt gattgttggt gagtctakgt taattcgtcg 4800 ttttggagct ctttcttttc ctcttggaag attcttttct ratgtttggc attttaattt 4860 aatattccaa taaacgtcgt ttgcgtaatc tttatcttga tgaaatttta tttcgcttgt 4920 ttcttcgctt tctaaactca taactgcttt attaaaatta agtttggttt ttattataaa 4980 aaatgattga cgttgctatg gtaaataata gagaaaaaaa caaaaagatt atccttttat 5040 aataggggaa attcccaggt taagtcccta ttgtttaaag tgacagtaat acgaaacatt 5100 tataaattct tctccaacct accatcccaa cca 5133 // ID BEL-82_CQ-I repbase; DNA; INV; 5636 BP. XX AC AAWU01005025; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-82_CQ_; KW BEL-82_CQ-LTR; BEL-82_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-5636 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 305-305 (2011). XX DR GenBank; AAWU01005025; Positions 6237 602. XX CC Positions [4628-5218] - Integrase core CC 'AATAC' target site duplication CC LTRs are 96% similar to each other. XX FH Key Location/Qualifiers FT CDS 200..5605 FT /product="BEL-82_CQ-I_1p" FT /translation="MHQPRARETEEQKEEKKTFYSPIREQILEHKRKELKR FT KMEEKVKALVRQRGAVKSKLTRISTALVDVVEGRQNPNLKNIPFLRQNSKN FT VDACYREYNTIQNEIVALPLSDDARTEQENRYIEFEHLYNEVSIKLETLLE FT TAVKREQRLSLASANQLVPANAAAAVVQPPPLSVPLPSFDGNYENWKSFKC FT MFTTIMARYQTESPAIKLYHLKDCMKGKAMGFIDEETIKNNDYDAAWASLE FT ERFEDNRLIIDKHIEALFNLPKIARESAEELRKLLDTCTKNVNALKNLDLP FT VVGLGEQMLLNVLSAKMDKDTRKTWETRQKAGQLPAYAATIAFLKERCKIL FT EKIEVNTKSNSEATKPSRAVARSNTLVSTTEQKCAVCKSDHELWKCDQFKN FT YSAKEKYSVLRKSGSCYNCLQRGHRLSECSSSSCQKCGKRHHTLLHVGDKK FT PADPSSKNEEADLCRSKEPQVRTEQATGSSSTRSETTATLCAQVERSVNVT FT LLSTAVVLAYGSGGVSYPCRVLLDSASQTHFVTEQFAKLLALERQPASFLV FT CGLNGSDTRIRSKIEVRIKSRVNEFSLALETLIAPKITGELPSYSIDVRKW FT PLPPGTELADPAFFKRNKIDMLIGADVFWDLIKSEQIEMGPNLPTLRDTKL FT GWIVGGSVSSASPAKLRTLCTVTENDRLSDILKQFWTIEGSDELLAPGPGD FT TASNAECLEHFRRTHQRDGEGRYIVRLPFNERKNQLGDSKQMAMKRFLTVE FT RRLDREPELKQQYAAFMQEYENLGHMREVRTEDADNGGPAYYLPHHCVIKP FT TSTTTKLRVVFDGSARTTSGVSLNDALMTGPTVQNDLMAILLNFRCYRFAL FT TLDIPKMYRQVRINPDDARFQRVFWRDDKNQPLKIFDLLTVTYGLASSPFQ FT ATMTLNQLADDHGAEFPRAAAVLQSSCYIDDVLTGAQSLDDALQLQREIVG FT LLRCGGFAAHKWCSNAPSILEAIPEAQQGAKLNVAELDLNALVKTLGVAWQ FT PASDTFSFDVVDFESASNERLTRRKVASQVAKIFDPLGFIGPVLTAAKLIL FT RGVGSLKTDWDELVPPNMGRQWQSFRRELPVLKKLRLPRWILYKEMQFVEL FT HGYCDASDQAYGACIYTRLTRLDGSTVMKLVCSKSRLLPKATKKKKEISTP FT RGELLAALLLARLVTKVLSSTSTTFASVNLWSDSQIVLSWIRKPPQHLQLF FT VANRVAEIQRLTGAFRWGYICTDSNPADLISRGTTPNKLLKSPIWWDGPPA FT VPTTTDAGPMIPDERLPELKSAVALALTAVERLKAFDDVSDFFKLRRAMAY FT VARFAEFIISKRKKVTKGWLTGKEIKRAEGIIVKLVQAEAFHTEIAALLDD FT KRARHRLCGLNPFLDVDGLLRVGGRIKHAFIPYDSRHQMVLPAKHPVTELL FT IRHLHEENLHLGQRRLLAVVRQRFWPLNVKTTIRKVVRSCITCFRVNPTKT FT SQLMGDLPSYRVQPAPTFARTGIDFAGPFWIKSNAAVRKPTITKGYVCLFV FT CLCTRAIHLELVSNLSSDAFLAALRRMASRRGVPSTILSDNGTNLVGGNNE FT LEELARLFQNELHQKNLDSFCSSKNIEWRFIPPRSPHFGGIWEAGVKSMKY FT HLKRIVGERRLTFEELYTTLTQIEAVLNSRPLTQSSDDPNDFTAITPAHFL FT IGRELQAIPEPSYLALKESTLSRWQLVQTLQQHFWKRWTKEYLPELQNRQK FT WYKKTTIKPGALVLIIDRNTPPMQWPLARIVALHPGKDNVTRVVTLRTPKG FT ELKRAVHEICLLPLDQEVQPEERPEEQLAKN" XX SQ Sequence 5636 BP; 1430 A; 1541 C; 1561 G; 1104 T; 0 other; taaatttctg gtccttctgt gccgaatgaa gtagtgaacc ggttaaagtt acgggaaagt 60 ggattttcga cggaaaatcg cgaaaatgag cattcggttc ggctaaagtg tcggtcgagt 120 gtagcggcgc catcttgaat tcgcgagagc ccgcttcagt gaaaaaaaaa aaagatcagt 180 gttcccagca tggaaaacga tgcaccagcc tcgcgctcgg gaaactgagg aacaaaagga 240 agaaaagaag acgttctaca gccccatcag ggagcagatt ttggagcaca agcgaaaaga 300 gctgaaaagg aagatggaag aaaaagtgaa agctctcgtc cgtcagcgag gggcagtgaa 360 gtcgaaattg acgcgcatca gcacggcttt ggtcgacgta gtggagggaa ggcaaaatcc 420 taacctcaaa aacattccgt ttttgcgtca aaactcgaaa aatgtggacg cttgttatcg 480 tgagtacaac actatccaaa atgaaattgt cgcgttgccc ctctctgacg atgcgagaac 540 ggaacaagag aaccgttaca ttgagttcga gcacctgtac aacgaggtca gcatcaagct 600 ggagaccttg ttagaaaccg ccgtcaaacg ggagcagcgc ctgtcgttgg cgtcggcgaa 660 tcagctcgtc ccggcgaatg cggctgccgc tgtggtccag ccgccgccgt tgagtgttcc 720 gcttccgagt tttgatggaa actacgagaa ttggaagtcc ttcaagtgta tgttcacgac 780 gatcatggcg cgctaccaga cggagtctcc ggctatcaaa ctctatcacc tcaaggattg 840 tatgaaggga aaggcgatgg ggttcatcga tgaggagacc attaaaaaca atgattatga 900 tgctgcctgg gccagtctgg aggagagatt cgaggacaac cgcctcatca tcgataaaca 960 catcgaggcg ctcttcaacc tacccaagat cgccagggag agtgccgaag aactccggaa 1020 gctactggat acgtgcacca agaacgtgaa cgccctcaag aatttggacc taccagtcgt 1080 gggtctagga gaacagatgc tgctcaacgt tctgtctgcc aagatggaca aagacaccag 1140 gaagacctgg gagactcgcc agaaagctgg tcagcttcca gcgtacgcag cgacgatcgc 1200 ctttctcaag gaacgctgca agattctgga gaagatcgag gtcaacacga agtcaaactc 1260 ggaagcgacg aagccatccc gtgcggtggc tagaagcaac acgttggtgt caacgacgga 1320 gcaaaagtgt gcagtgtgta aaagtgacca cgagttgtgg aagtgtgatc agttcaagaa 1380 ctacagtgcc aaagaaaagt acagtgtttt gagaaaaagt ggttcctgtt acaactgcct 1440 acaacgtgga caccgcctca gcgagtgttc atcgagttcc tgccagaaat gtggcaaacg 1500 ccaccacact ctgctgcacg tcggagacaa gaagccggca gatccgtcgt ccaagaacga 1560 agaagcggat ttgtgccggt cgaaagagcc gcaggttcgg acggaacagg caactggttc 1620 tagcagcacc cgatcggaga cgaccgcaac gctttgtgcc caggtcgaga gatccgtgaa 1680 cgtcacgctc ctgtcaactg ccgtcgtcct ggcctacggc agtggaggcg tcagctatcc 1740 gtgcagagtc ctcctagact cagcgtccca aacgcacttc gtgactgaac agttcgcaaa 1800 attactcgct ctcgaaagac aacccgccag tttcctcgtc tgcggattga acggttcgga 1860 cacgcgcatt cgatccaaga tcgaagtccg tatcaagtca cgagtcaacg aattcagctt 1920 ggcactcgag acgctcatcg caccgaagat caccggggaa ctaccgtcct actcaatcga 1980 cgttcgcaag tggccgctcc cgccagggac cgaactcgcg gaccctgcgt tcttcaagcg 2040 taacaaaatc gacatgctga tcggtgcgga tgttttctgg gacctcatca agagcgagca 2100 aatcgagatg gggccgaact tgccaacgct gcgtgacacg aaacttggct ggatcgtcgg 2160 aggatcggtg tccagtgcaa gtccagcgaa actacgaacc ctctgcactg tgactgagaa 2220 tgaccgcctc agcgacatcc tcaagcagtt ctggacgatc gaaggcagcg acgagctgct 2280 ggcccccgga cccggcgaca cggcatcgaa tgccgaatgc cttgagcatt ttcgccgtac 2340 tcaccagagg gacggcgaag gacgatacat cgtcagactc ccgttcaacg agcggaagaa 2400 ccagttgggc gattcgaagc agatggccat gaaacgcttc ctgaccgtcg agcggcgact 2460 cgacagagaa ccggaactca aacaacagta cgctgcattc atgcaggaat acgagaacct 2520 gggccatatg cgtgaagtcc gcacggaaga tgcagataac ggtggaccgg cgtattatct 2580 cccgcaccac tgtgtcatca aaccgacgag cacgacgacc aagctgcggg tggtgttcga 2640 tggctccgcc aggaccacgt ccggcgtctc actaaacgat gccctgatga ccggaccgac 2700 cgtccaaaac gatctgatgg cgatactcct gaacttccgg tgctaccggt tcgcgcttac 2760 cctggacatc ccaaaaatgt atcgccaagt gagaatcaac cccgacgacg cacgatttca 2820 gcgagtgttt tggagagacg acaagaacca gccgctgaag atctttgatt tgctgaccgt 2880 aacgtacggt ctcgcctcgt caccgttcca agcaacgatg accctgaacc agctcgccga 2940 cgaccacgga gcagaatttc cacgcgctgc cgctgtgtta caaagctctt gctacatcga 3000 cgacgttttg acgggtgcgc agtctctgga cgacgcactt caactgcaac gagagatcgt 3060 cgggttgctg cgatgcggtg gtttcgccgc acacaagtgg tgctcaaacg cgccctcgat 3120 tctggaagcc attccggaag cacaacaagg agccaaactg aacgttgctg agctcgattt 3180 gaacgccctc gtaaaaacac tcggagttgc ctggcaaccg gcgagcgaca cgttctcttt 3240 cgacgtggtg gatttcgagt cagctagcaa cgagcgactc actaggcgga aggtcgccag 3300 ccaggttgct aagatcttcg acccgttagg gttcatcggg ccggtcctga ccgcggccaa 3360 attgatcctg cgcggagtgg ggtctctcaa gacggactgg gacgaactcg ttcccccgaa 3420 catgggtcga caatggcagt cattccggcg tgagctgccg gttctgaaga aactccgcct 3480 accgaggtgg attctctaca aggagatgca gttcgtggag ctgcacgggt actgcgacgc 3540 gtcggaccaa gcctacggcg cctgcatcta cactaggttg actcgactgg acggatcgac 3600 cgtcatgaag ctggtatgca gcaagtcgag attgctcccc aaggccacca agaagaagaa 3660 agagatttca acaccgcgcg gcgaactgct ggcggcgcta ctgctcgctc gtttggtaac 3720 caaggtgctg tcgtcgacaa gtacgacgtt cgcttcagtc aacttgtgga gcgactccca 3780 aattgtactc agctggattc gaaagccacc gcaacatctg cagttgtttg tggccaatcg 3840 agtagcagaa attcagcgac tgactggagc ttttcgctgg ggctatattt gcactgactc 3900 caatcctgct gatttgattt cccgaggcac aactccgaac aagctgctga agagccccat 3960 ctggtgggat ggcccgcctg cagtgccgac gacaactgac gccggaccga tgattcccga 4020 tgaaagactc ccagaactga aatcagcggt tgcgctcgct ttgacagctg ttgaacgatt 4080 gaaagccttc gacgacgtga gcgacttctt caagctgcga cgcgccatgg cctacgtcgc 4140 tcgatttgca gagttcatca tttcaaagcg caagaaggtg accaaggggt ggctcactgg 4200 taaggagatc aagcgtgcgg aaggaattat cgtgaagctg gttcaagccg aggcgttcca 4260 caccgagatt gctgcactac tggacgacaa gagagcacgg caccggctgt gcggactcaa 4320 cccgttcctc gacgtcgatg gcctcctgcg ggtcggtggg cgcatcaaac acgccttcat 4380 cccgtacgac agtcgccatc agatggtgtt gccggccaag catccggtaa ccgagctcct 4440 cattcggcac ctgcacgagg agaacttgca cctcggccaa cggagacttc tcgccgtcgt 4500 cagacagagg ttctggccgc tcaatgtgaa gacgacgatc aggaaggtgg tccgcagctg 4560 catcacctgt ttccgggtca acccaacgaa gacgtcgcag ctgatggggg atttgccctc 4620 ataccgagtc cagccagcac caacgttcgc cagaaccgga atcgattttg ctggcccgtt 4680 ctggatcaaa tccaacgctg ctgtgaggaa accaacgata acgaagggat acgtctgtct 4740 cttcgtttgc ctgtgtacaa gagctatcca cctagagctt gtatcaaacc tttcgtccga 4800 cgccttcctc gctgcgctgc ggcgcatggc gagccgtcgt ggtgtgccca gcaccattct 4860 ctcggacaac ggaaccaacc tggttggtgg caacaacgaa cttgaagagt tggctcgtct 4920 cttccagaat gagctacacc agaagaacct ggacagtttc tgcagcagca agaatatcga 4980 gtggcgattc atccctcccc gcagcccaca ctttggagga atttgggagg ctggggtgaa 5040 gtcgatgaaa taccacctca agcggatcgt tggagagcgc cgtctcactt tcgaagaact 5100 ttacacgact ctcacccaaa tcgaagcagt gctcaactcg cggccgttaa cgcaatcgtc 5160 ggacgacccg aatgacttca ctgccatcac cccggcccac tttttgattg gaagagagtt 5220 acaagctatt ccggaaccgt cgtacctcgc cctgaaggaa tccacgctgt ccaggtggca 5280 actggttcag acgctgcaac aacatttttg gaagaggtgg acgaaggagt accttccaga 5340 attacagaac cgtcaaaagt ggtacaagaa aacgacgatc aaaccgggcg ccctggtgct 5400 gatcatcgac cgcaacacgc caccgatgca gtggcctctc gctcggattg tggcgctgca 5460 tccgggaaag gacaacgtga ccagagtcgt gacgctgagg acgcccaagg gagagctgaa 5520 gcgtgcagtg cacgagattt gtcttctacc gttggaccag gaggtccagc cggaggagcg 5580 accggaggag cagctggcga agaattgaaa tggttcattt caacgcccgg ggagaa 5636 // ID DNA8-81_AP repbase; DNA; INV; 932 BP. XX AC . XX DT 25-AUG-2009 (Rel. 14.09, Created) DT 25-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-81_AP. XX NM DNA8-81_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-932 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2017-2017 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 932 BP; 315 A; 155 C; 130 G; 330 T; 2 other; caggggcgtg ctcacggggg ggggggtcat ggggttcaac cccccccccc ccccattgac 60 cgtattattt ttatttttta ttataaaaca aataatgcat ctttataatg tacttgtcga 120 attatattat cataaaattt cccaattcgt attttatcgt taaatgcttc caacttcgaa 180 tatgtacact gccccgcagt tgccgatact tttaatatac ttttaatagg tactcaaaat 240 ataaatttgt tttttaggct ttcagtgttt attttctgga acttattcag aaaactttga 300 tgaacttgtt gaattttacc tcgatactga taagctaaca gttaaaagct taaagctgaa 360 ctacatatat ggtataccaa acttaataag catgtaaaat atccaaaaaa ttgtttagaa 420 gcgttgagac agtgtgataa agaaatattt cccaatatac attttntatt aaaaatacta 480 tgcactttac cngtttcaac atccaccccc gagagaactt tttcttgtct caaaagattg 540 aaatcttatc tacgaaacac aatgactgag gtattttctg gatattaaaa taaaaattta 600 tatcaatttt tttaacattt ttttggtttt agactcgact taatggctta gctatgttag 660 ctgttcataa agaaattcca ctaacagcag aagaagtttt aaatgaattg agcaaaaaat 720 caagaaaact agattttgtt ctttaatttt tgaatgtata ttgtatatta gtataatagt 780 ttctatatta gcaatataat atttaaatct ataaataatt gaatttgttg ttgattaaaa 840 aaaaaaaaaa aatggtcgtt aggtaaaaga ctagctcctt caattttact tgaacccccc 900 ccccccacat aaattttctg agcacgcccc tg 932 // ID BEL-620_AA-LTR repbase; DNA; INV; 488 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-620_AA_; KW Pao_Bel_Ele68; BEL-620_AA-I; BEL-620_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-488 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 488 BP; 129 A; 113 C; 105 G; 138 T; 3 other; tgtcggagtt caaggtaagc tgcattgagc ctcccctcga tcattggcta ccaatctaac 60 gatcgttacc actcacagcg atcgttgggt ttctcgcttg sgcgggaata tgmgagaaga 120 gattttgtat tatacaatag actggccttt tgccaaaccc caaattgccc aatttaccag 180 ttctttagaa atttgcaatc cgggcagtcg cgtcgaatac tttgtattga agttaatttt 240 gtagttaatt atgtaactag tttaagcagt ttagttaaat aaatcacagt tatagtacgg 300 tgtktgttta aagtaaaacc cgatttctat tgtgctacgc cgaaagaccg cgaaaacccg 360 gtgataaact gcccactttc caaacgtgtt ggctgccaac gactccaccc ccgcgaggta 420 ccccaggatt gcgctgcgtt gagatttcgg gaggttcaca cccacggaac cagtcttgct 480 atcgaaca 488 // ID CHAPKA1_EM repbase; DNA; INV; 2548 BP. XX AC CHAPKA1_EM; XX DT 16-MAY-2005 (Rel. 10.05, Created) DT 01-JUN-2005 (Rel. 10.05, Last updated, Version 1) XX DE Chapka-Em1, a new member of the hAT DNA transposon superfamily DE from the single-celled free living eukaryote Entamoeba DE moshkovskii. XX KW hAT; DNA transposon; Transposable Element; Interspersed repeat; KW Chapka-Em1; CHAPKA1_EM. XX OS Entamoeba moshkovskii OC Eukaryota; Amoebozoa; Archamoebae; Entamoebidae; Entamoeba. XX RN [1] RP 1-2548 RA Pritham E.J., Feschotte C. and Wessler S.R.; RG Department of Biology, University of Texas at Arlington, RG Arlington, TX 76019, USA; RT "Unexpected diversity and differential success of DNA transposons RT in four species of Entamoeba protozoans. Mol. Biol. Evol. (2005) RT In press."; RL Direct Submission to Genbank (16-MAY-2005). XX DR Repbase; CHAPKA1_EM; Positions 1 2548. XX CC Chapka-Em1 is a member of the hAT superfamily. The TIRs are CC 21-bp long and are flanked by 8-bp TSD. The element contains a CC large ORF, which can potentially encode a 697-aa protein 39% CC similar to the Homer transposase from Bactrocera tyroni. There CC are several elements closely related to Chapka-Em1 in the E. CC moshkovskii, E. invadens and the E. terripinae genomes and CC multiple divergent lineages of transposase. XX FH Key Location/Qualifiers FT CDS 286..2377 FT /product="CHAPKA1_ORF" FT /translation="MEESLSTISFVEEEEDILKQISERRKNIPKNSVVWCV FT FDLEENTKTLDGIKTKIWRIDEIFYEEKPTQTKRMFRKSFVEALNRSDSSF FT KLKCKLCYYEHCYYKSSKSSSAVLKHIEMCHPEIIDEHQRNHRRPYASITG FT SIDPFNTKANVLLVSFIILNGLPYSLVDSPMFNNFIGHINERYVVPKRNSV FT SDKLIPAMVKMAKQFILNQMGNCQSICCTIDGWTTKFQQIPFFSFTAHYYC FT DNKLVSRVLKLSPXFERKTKDNISTFIXXSIKEWQLEKYNSFPIICDNAND FT VMSGVQESGMIKIGCCCHRMNLVMNNVLDSCLIFNNLVKKCXSIESKFKNS FT SSLNSILNEIENNYFGSTLNTLVQVETRWFSNLTMLKRIFEIHNVLEGIIQ FT AVSENYPSNVRNLFLDSYLLTKNELILIEFFIGHSSKLNSICNDFSSQEIG FT TLSIVIPTIKNVVAELKFEKEQILATESSVISISNYSIVTNEGNITVNNFK FT DIEECIDSVLTIESNESNYDIKEIKKNFIDQLIISIEEKFENRNNIFKNET FT FLIATTLNPRFKIDFFNDEELSQVKDIINRMIGDNSPHHEIHELGRVRRAS FT SINEFDRYVAEECIKSNDFNDVINYWERSKTIFPNLYPLAMKHVSYLCSST FT ASERLFSNASNNFINKRTGLLCYHLEELCILKSFIENDGITLFENLSIX" XX SQ Sequence 2548 BP; 944 A; 356 C; 371 G; 868 T; 9 other; tagagttgtg taaactgttt tcctcagttc gcagaaaaaa atttgcgaac aaattaattt 60 tgcagaattt taatttccca aaatcagttt aattttccac gaaattttgt ttgcgaaatt 120 agtcagtccg taaaaaaatg ttattttttt gcagtttttt tgttattttt ksmaaaatca 180 ttcgcagctt ttatgatttt taatcatttt tggaatgtaa taaactggaa attgtgaaat 240 aaagttgaac aaagttttga gttaatctta taattaaata ccataatgga agaatcactt 300 tcaactattt cctttgttga agaagaggaa gacattttga aacaaatttc agagagaaga 360 aaaaatattc caaagaacag tgttgtttgg tgtgtttttg atctcgaaga aaacaccaaa 420 acacttgatg ggattaaaac caaaatatgg cgcattgatg agatttttta tgaagaaaaa 480 cccacccaaa ccaaaagaat gtttaggaaa tcatttgttg aagccctcaa cagatccgat 540 tccagtttca aattaaaatg caagttgtgt tactatgaac actgctatta caaatcatct 600 aaatcaagtt ccgcagtatt aaaacatatt gaaatgtgtc accctgaaat tattgatgaa 660 catcaaagaa accatagaag gccatatgca tcaattactg gatctataga tcctttcaac 720 acaaaagcga acgttcttct tgtttcattt ataatattaa atggacttcc ttacagcctt 780 gttgactctc caatgttcaa taacttcatt ggtcatataa atgaaagata tgtagttcca 840 aagagaaact ctgtaagtga caaattgatt cctgcaatgg ttaaaatggc aaaacaattt 900 attttgaatc aaatgggtaa ttgccagtca atttgttgca ccattgatgg gtggacaact 960 aaatttcaac agatcccatt tttcagtttt actgcacatt attactgcga taataaatta 1020 gtatctcgag tattaaaact atcaccatyt ttygagcgga agacaaaaga taacatttca 1080 acattcatam yawattcaat caaagaatgg caacttgaaa aatataattc atttcccata 1140 atttgtgata atgcaaatga tgtaatgtct ggagttcaag agtctggaat gatcaaaatt 1200 ggatgttgtt gtcacagaat gaatttagta atgaacaatg tcttggacag ttgtttaata 1260 ttcaacaact tggtcaagaa atgtcrgtca attgagtcaa aatttaaaaa ttcgtcatcc 1320 cttaactcaa ttttaaatga aatagaaaat aattactttg gatctacttt aaatacttta 1380 gttcaagtag aaactagatg gttttcgaat ttgacaatgc taaaaagaat atttgaaatt 1440 cataatgttt tagaaggaat tattcaagct gtatctgaaa attatccttc aaatgttcga 1500 aatttgtttc tagacagtta tttgttaaca aaaaatgaat taattcttat tgaatttttc 1560 attggtcatt cttcaaaact taattcaata tgtaacgatt tttcttctca agaaattgga 1620 acattatcta ttgtcatacc aacaataaaa aatgtagtag ctgaattaaa atttgaaaaa 1680 gaacaaatat tggcaactga atcaagtgta atttcaatat cgaattactc aatagttaca 1740 aatgaaggga atattactgt taataatttt aaggacattg aagaatgtat tgattcagta 1800 ttaactattg aatcaaatga atcaaattat gacattaaag aaataaaaaa gaattttatt 1860 gaccaattaa tcatttccat tgaagaaaaa tttgaaaatc gtaacaacat tttcaaaaat 1920 gaaacatttt taattgcaac cactctcaac ccccgattta aaattgattt ttttaatgat 1980 gaagaattgt ctcaagttaa agacattatc aatagaatga ttggagataa ttctcctcat 2040 cacgaaattc atgaattggg aagagtaaga agggcaagtt caataaatga atttgacaga 2100 tatgtggcag aggagtgcat aaaatcaaat gattttaacg atgtcattaa ttattgggaa 2160 agaagcaaaa caatatttcc taatctatac ccacttgcaa tgaaacatgt atcgtattta 2220 tgttcttcaa ctgcttcgga aagacttttt tcaaatgctt caaataattt tatcaacaaa 2280 agaactgggt tgctgtgcta tcacttggaa gaattgtgca ttttgaagtc tttcattgaa 2340 aatgatggca ttactttgtt cgaaaattta tccatctaat tcatttttat ttgttgaaat 2400 ctttcatttt ccgtggattt tattttttgc aaccttatat tttaaaaaaa ttattttgta 2460 aactttctac ggaatccacg aaaagtcaaa atttcgcgga aattttttcg cggaatccac 2520 aactataaaa aaaatttaca caactcta 2548 // ID BEL-180_AA-LTR repbase; DNA; INV; 194 BP. XX AC supercont1.7; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-180_AA_; KW BEL-180_AA-I; BEL-180_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-194 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.7; Positions 3241623 3241816. XX SQ Sequence 194 BP; 62 A; 42 C; 42 G; 48 T; 0 other; tgatgacgcg tgtagagtac catcaggcga ccccccgtcg ttcttgcgct tggcgcgagc 60 atcgtgtgga gagaaaaaga aaaagtacag tcaacatcag tttgaataaa aacgtaaaac 120 ggatcgcacg cgtttcattc gattcatttg caaattgtct gaaaaatcca attcctaaac 180 atttcggtta aaca 194 // ID Gypsy-34_NVi-I repbase; DNA; INV; 12855 BP. XX AC . XX DT 29-JUN-2009 (Rel. 14.07, Created) DT 29-JUN-2009 (Rel. 14.07, Last updated, Version 1) XX DE Gypsy LTR-retrotransposon from Nasonia vitripennis, interanl DE region, consensus. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-34_NVi-I. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-12855 RA Bao W. and Jurka J.; RT "Gypsy LTR retrotransposons from Nasonia vitripennis."; RL Repbase Reports 9(7), 1387-1387 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS join(10353..11144,11113..12003) FT /product="Gypsy-34_NVi-I_2p" FT /translation="MLEKHTALVAHACGPHCEIGELQQAVKEAVRQAGRVL FT DLLLVHNIGDEPRRARRSLLPFIGTIHKFLFGTLTEADEAEIQEAVRAIAN FT DTKITAALLANQTEIIDRALSNLDTKLTRLEATTAILINKSITSDNEIAIR FT SAVQTVKDNLLQFKMDTEVLTDAILFATQGMVHPRIMPPETLLYAAKTAAN FT TVANAKFPSPDGNFSAIPILKISKVTVLLANSHLVYQIAIPLLDIQKFNLF FT KASPLPSVQRALNISNIAAYIWPIFQISRPIFGPEFHFFAVSESNRSYMPI FT PPERIHKLRKLGDLLIAVNPEPIREIRSNVACEIKIASGHALGKAEHCDIR FT IKQLRDTVWLRLHKTNAWIFSVSNAENIYIQCQRAEQITSEISGTGVLELR FT PGCSAHTANAHLAASRSLTSYANASVFDIVVFNVSAMLTEIGSIEGKAIEL FT QQAIEIEAKNRAENSHGVKFETLETGTALRDITTKAREIADRKEKSFELEN FT LSSFTSKFSYSSWTAVIIIIVIISGVWIIKKRRAGQPMRKLIKQQEKQLEL FT QTLREMNRARSRD*" FT CDS join(840..3575,3536..4879,4852..7575,7542..9245) FT /product="Gypsy-34_NVi-I_1p" FT /translation="YLGVPSNLTIKESTRGIKISLPVSYIPGPSYTHTVVN FT RQLSKTVMAPTGILGAARGVLGLGARKKQLDTGAKSEPEWDSEQMRYARNV FT YRNTQKMTPRTRRVVRKHISKDCLLSSDSENEPRDARMAVNWIEARKNLLD FT EHSRPPKSYELPKQSDDSGEKLIVLDDENQARNEADASAEAPESNVADPSN FT TDANAELAGAAAANRENAVRVVDVAGVQEKFGRLNLRTNICDKMEVPLAAK FT ERLAINEKCVSGVSSLFEIKNIALGNGLSQILPKTGIDDRPWAGRSQTSIG FT PGHRSYGGNSQNTAQRKIAPIEIPAAQRPVCKFELSREELEVILRMREDRL FT LKESRINPKPIIDQFSSDESESEHRSVHKAVVHKTQLENKTTRPRLPRNRD FT AGRLNAVVNQVTNPFSEGEPRETARRHHYGENYNRDEFSPDERRPMSRTGN FT ERAGRTSALNKYSSRYDDSRDRGRAYPSDSDSDEDPRDRRHPQRRAPDRDR FT EPPRRGPNRRNHRNPAGFWPSGNDPEDDPSEPSDDNIVAINNGNPRRDVRP FT RGGMPFHQAVNFIPQFGGDPDDLNQFCNSVRKVLYSFGREYEDYLLMYIAN FT KLKGKAADGYRARTTSYLSVEQLLNDLTLQYGNIGIADEVQAQIKVLKQKS FT GEAVGDYGLRMQRLHNRLLTIIESAPDLSSFDRKARRRYADDDALQQFTFG FT LVSPLDHQVRSERPRTLNEAIKIALEFEGKQSARRIMYGDIEASIPAMLAL FT PAPARVRRAVAEEAKPPDNQAVETNANNQQSKYCNYCDMKNHALKDCGVLI FT KHAQKNIIMNPNRKNFDNNSNNNGNNSGNNSNNNGNNNNNNNTNGGQRNYR FT KNNRYNKNNHSNNNNNNNSNNNNGNNGNNNSNNNNDSSNNHGVNGNANGAG FT DSNSQNLNRERRRRQQFPEFKLDGRSPRSSSVERRYIEPESKQKTESANIR FT FLTAGRQPPIVNIECPQLINKKGKFYADSGADISIVKIGKLAPGYPIDTHK FT IIKVQGVTPGTAHTLGQAVIQLQGLECDVQVVPNDFPIENAGIIGWDTIDR FT HKGCVDAANKRLKFGDVSVPFETDERITVPPRVKMVISARVRDSSVKIGWV FT PLMDLHPNLLFGNFVAENRNGRIFAECINIGDTEVSVASPIVDLQECETIA FT AHPLYQADGDGSPDRVAEFSASLRRMFDANKLEQKYREVNTLNEKMRANNK FT ARLERVEKISQLADLEGCNAEEVEYIREIIDEYSGVFGLEGEPLPATHLLQ FT HKIILKSNKPVKCQRFRFPPALKEHMIRELQKLREQDIVVPSNSDYSSMLW FT IVPKKPDANGNRKFRLVTDFRAINEISEGSCHPLPFTSDILALHPPTIYSC FT LASANYITVMDLKQGYHQIEMHPDSAHLTAFYAPDGRHGNQLLQFNRMAMG FT LKEATITFTRAMSLAMAGLQGDEVEIYLDDLMIFSETLDEHRVRLRRVLKR FT LLDANMTVEPRKCQFLKREAHVLGHIVGGGFIKTDPQKIRAMAEYAVPTNP FT KKLKQALGLFSYYRRFIENFSKVAYPLFKLLKKDTKFVWGAEEQAAFDELK FT MLMCKEPVLKTPDLEQPFIVTTDASDFALGAILSQGKLGKDQPCAYASRCL FT KGSELKYPTYDKELLAIVFAKEQFRHYLYGRKFTIVTDHEALKHFHTSKKP FT DLRFNRLKAALVGYDFDIIYRPGNKNANADALSRNPMLKPGETNPELPRLE FT LYELAEKQIEADPDEEAGAPPGRIFRARAIKQLGVNKNQKAKQAISDSDES FT RKTSEIKKKCNDAALIYKAGETLAVRNEFEDFYLCRALHDIYHGDDLIKIQ FT WFTEIENCKFTYALDYHDSTRFGCILTNTSLKKAGKNRYTLSKTERERIRK FT ILAESIKFRASPRIVQSSTAAMDIGKEPDSDATMSTVQSVVSNSSSCGVIL FT PPGAKEYLNYVAPLSVVTRKAKASQSQISRKPYSPSNTSSSDESDVIQASG FT TALEIEPLQSFKFPAKPVIVDEASIRSKTSSPESPRRMHAGNPDQREASVT FT HTPRVDANKVTAPRNEHAPRPHGTPVKTRTPEIWQVCNGKFQKKLVMNIEE FT KEIKGRDGMQINIYEAEPSVASEKGKTNDSSVIETPEIPNDNARASNALEK FT SKTPRARKTSAAAGMEGQISRNGDPQTREKTPEPGSGTKALAHQSADESEF FT GDVGPFEDFAGIPRNYDRKPDPSAENSSISSGNVLMRTYIRKKRCAEPYVE FT LVRSDDIPSSPPGNCLFFSLIKAAKLNISAIDLRKHLQYLGNRPKETFAIN FT SCGEPSETARILKSKNEYGTVDCAYLFAQEFNFNICIHYDLPNKHRILCHV FT LSSEAKEFLHLNLTGQHFTPYIRVNVPPPTTKPTKRQRDPSPTSDGSHSSA FT KSLVFGKKKAEKRKQRFKPDSSKQEARKGRDRRGAESAMSVGSNIIIATCT FT SPDPSAQPTQAANESPNISNVSLVLNPATSIASPPRTRISSFDNSEKSCAD FT APPHNQSHPNFPESPPQINSGDNCEDAPILASPNLRASTSAAKRIRDAGNS FT IGTSDDEKTAAAVTAALSHEKMKRYVVAGARPPREKPPWAIPEGQEPQPFV FT HLQSLSEHPFRYKENLVYLTSSDNYLSTEVQEALVERGYVNPEDLENKKFG FT IGEINVIDCKGFSVIGVYIKAHFDDRPLRVDLIKCLRNLKNVMIARGLNSV FT ALIRDLEMLTPVEWTKLIELFDDTFMNKRVAAILYKNNLPVPPVNERFKLI FT QEYHIAAMGGHRGMTKTYSKIANDFYWRNMRPDIKQFVARCATCQSNKLVR FT VKTRLPMLISNTPSTPFTQIALDFYGPLENSRRGKSTSYPSKTC*" XX SQ Sequence 12855 BP; 4127 A; 3277 C; 2962 G; 2488 T; 1 other; ttttggcatc ccttgtggcg cgaggattcg cagcagaaga gacaactgca tcataaatta 60 tctgcagtca cgagaaatcc agatcgatag aaggaaatta atcctcgtaa gaagaaaagt 120 tcgtcacaga acacaccgtg ccgcataaaa agcacaataa taaaataaag aagtagtagt 180 gaggagctct agcagtcgcc gaggctcgtg taaatcaaag cacgagctca aagaaatttt 240 tttttgtctc acgggagccg ctcgagcaat cagctcaaaa atcgccttaa agaagtgagc 300 ggcaatccaa caccaggcaa ccgccggcga gcgttgccaa aatcctcaaa gtccatctgg 360 accacctaag cccacctggg ccaacgaata aagctgcaca accatcggcg cggaccagcc 420 tccacgcgca gcacagtgtc gacgcggacc agccgtcgat acgcgcgact acaaaacctc 480 cagcccagcc agcagccacc ggttccacat cagtcatcat cagtcatcag ctacaggaat 540 accatgcgag ccgtctacgc agcgctattc atcgcaatgt aagtcgtttt ttttactaga 600 cgcaatttgg cttgttcgcg tgctttcgta ctataatacc tcggaggccg tccaacttgt 660 aaaagagtca acgcgcggaa taaagattag tcttcccgtc tttacattcc ctcctataca 720 accaaccatc agcttcggaa ataccatgcg agccgtctac gcagcgctac tcatcgcaat 780 gtaagtcgtt tttttactag acgcaatttg gcttgttcgc gtgccctttc gtactataat 840 acctcggagt gccgtccaac ttgacaataa aagagtcaac gcgcggaata aagattagtc 900 ttcccgtctc ttacattcct gggccctcct atacacacac agtagtcaat agacaattgt 960 caaaaacagt catggctcct acggggatac tcggcgcggc gcggggcgta ctcgggctag 1020 gcgcgagaaa gaaacagcta gataccggag cgaaaagcga accagaatgg gatagcgaac 1080 aaatgcgata cgcgcggaac gtatatagaa acacccaaaa gatgactccg cgcacgcgta 1140 gggtagtgcg aaagcatatc agtaaagatt gtttactatc aagcgattcc gaaaatgagc 1200 ctcgggacgc gagaatggcg gtcaactgga tcgaggcgcg aaagaaccta ttagacgaac 1260 acagccgacc gccaaaatca tacgaactgc ctaaacaatc cgacgatagc ggggaaaagc 1320 taatcgtatt agacgacgag aaccaagccc ggaacgaagc ggacgcgagc gcagaggccc 1380 ccgaatcaaa cgtagcagac cctagtaaca ccgacgcgaa cgcagaactt gcaggagctg 1440 cagccgcgaa tcgggaaaat gcagtaagag tagtagacgt agcgggagta caagaaaaat 1500 tcggtcggct aaatttaaga actaacattt gcgataagat ggaggtccca ctcgcggcaa 1560 aggaacggct cgcaatcaac gaaaaatgcg tgtccggcgt atcaagtctt ttcgaaataa 1620 aaaatatcgc gcttggcaat ggtctaagcc aaatcctgcc gaaaaccggg atagacgata 1680 ggccttgggc aggtagatct caaaccagca tcggccccgg gcatagaagc tacggcggca 1740 actcgcaaaa tacagctcaa agaaaaatcg cacctatcga aattcccgca gctcagcgac 1800 ccgtgtgcaa gttcgaactg tcccgggaag aactcgaagt aatattgaga atgcgcgaag 1860 accggctact caaagaaagt cgcataaatc cgaagccaat catagaccaa ttttccagtg 1920 atgaaagcga gagcgagcac cggagcgtac acaaggcggt agtacataaa acgcaacttg 1980 aaaataaaac caccagaccc cgtctcccca gaaacagaga cgccgggcgt cttaatgctg 2040 tagtcaacca ggtgaccaac ccgtttagcg agggcgaacc ccgggaaacc gcgcggcgtc 2100 atcactacgg cgaaaattat aaccgcgacg agttcagccc cgacgagcgt cggccaatga 2160 gtcgaaccgg aaacgaaaga gcaggaagga ccagtgcatt aaataaatac tcatcgagat 2220 acgatgatag ccgagaccgc ggcagagcct acccgtcgga cagcgattca gacgaggatc 2280 cgcgcgatcg cagacacccg caaaggcgtg caccggaccg cgatcgcgag ccaccgcgtc 2340 gaggaccgaa ccgccgaaat caccgaaatc ctgccgggtt ctggccatca ggcaacgatc 2400 cagaggatga tccaagcgag ccgagcgacg acaatatcgt agcgatcaat aatggcaatc 2460 ccaggcgcga cgttcgcccc aggggcggca tgcccttcca tcaggcggta aatttcatcc 2520 cgcaattcgg cggcgacccc gacgatttaa atcaattttg caattccgtc agaaaagttt 2580 tatattcctt cggcagagag tacgaagact acctgctaat gtacatcgcg aacaagttga 2640 aaggcaaagc cgccgacgga taccgcgctc gtacaacaag ttatctatcg gtcgaacaac 2700 tgttaaatga tctaacgttg caatacggaa atatcggcat cgcagacgag gtacaagcgc 2760 aaataaaagt tttaaaacaa aaatcaggcg aagccgtcgg agattacggt ctgcggatgc 2820 aaagactgca caacagactg ctcacgatta tcgaatcggc cccagacctg tcgtcgttcg 2880 acaggaaagc gagaagacgt tatgcagatg acgacgcgct gcagcaattt acgttcggtc 2940 tcgtatcacc tctcgatcat caagttcgga gtgagcgtcc acgcaccctc aacgaagcaa 3000 ttaagatagc attagaattt gagggaaagc agagcgcgcg acgtataatg tacggcgata 3060 ttgaagcctc tatcccagca atgctcgctc tgcctgcccc agcgcgggta cgtagggccg 3120 tagcagaaga agctaaacca ccggacaatc aggcagtcga gacgaacgcg aataatcagc 3180 aaagtaaata ttgtaattac tgcgatatga aaaatcacgc acttaaagat tgcggtgtcc 3240 taataaaaca cgcgcagaaa aacataataa tgaacccaaa cagaaagaac ttcgacaata 3300 actcgaacaa taacgggaac aactccggca ataattcaaa caataacgga aataacaata 3360 acaataacaa cacgaatgga ggacaacgca actaccgcaa aaataatcgt tataacaaaa 3420 ataaccacag taataacaac aataacaaca atagcaacaa caacaatggt aacaacggca 3480 ataataacag taataacaat aacgatagtt caaataacca cggcgttaac ggtaacgcga 3540 acggcgcagg agacagcaat tcccagaatt taaactagat gggcgctcgc cgcgctcctc 3600 aagcgtcgag cgtcgataca tagagccgga atccaagcaa aaaaccgagt cagcaaatat 3660 aagattcctg actgctggcc ggcaaccccc aatagtaaat atagaatgcc ctcaattaat 3720 aaacaagaag ggtaaatttt acgcagattc gggagcagac atatccatag ttaaaatcgg 3780 gaaattagca cccggttatc caattgacac gcacaaaata ataaaagtgc agggagtaac 3840 ccccggcaca gcgcacacgc tcgggcaagc ggtaatacaa ctgcagggcc tcgaatgcga 3900 tgtccaagtc gtaccaaacg atttcccgat agaaaacgcg ggcataatcg gctgggacac 3960 gatcgacaga cacaaaggtt gcgtcgatgc ggcaaacaaa cgcctgaaat tcggcgacgt 4020 atcggtgcca ttcgagaccg acgagcgcat taccgtacca ccacgggtaa aaatggtaat 4080 tagcgcacgt gtgcgagata gtagcgtgaa aataggatgg gtcccactaa tggacctcca 4140 ccctaacctg ctcttcggga attttgtcgc ggaaaatcga aacgggcgca tattcgcaga 4200 atgcataaac attggggata cggaagtatc ggttgccagc ccgatagtgg atttacaaga 4260 atgcgaaaca atcgcagcac atccactata tcaagccgat ggggacggct cacctgaccg 4320 ggttgcagaa ttttcggcaa gtctaagacg catgttcgac gcgaataaat tagaacaaaa 4380 atatagagaa gtcaatacat taaacgaaaa aatgcgcgca aacaataagg ccaggctcga 4440 acgagtcgaa aagatttcac aattggccga tttggaggga tgtaacgcgg aagaagttga 4500 atatattcgc gaaataatcg acgaatactc gggcgtattt ggtctcgaag gagaaccgct 4560 accggctaca caccttctac agcataaaat catattaaag tcaaacaaac cggtaaaatg 4620 ccaacgcttt agattcccgc ccgcgctcaa agaacacatg atccgcgagt tgcagaagct 4680 tcgcgaacaa gatatcgtcg tgccttcaaa ttcggactat tcctcaatgc tttggattgt 4740 cccaaaaaag ccggacgcga acggtaaccg aaaattcagg ctagtaaccg acttccgggc 4800 gataaacgaa atttcagagg gaagctgcca tcccttaccc tttactagtg atattcttgc 4860 cttgcatccg ccaactatat aacggtgatg gacctaaaac aaggctatca ccagatagaa 4920 atgcatcccg attccgcgca tttaaccgca ttctacgcgc cggacggcag acacgggaat 4980 caattattac agtttaatag aatggctatg gggctgaagg aagccaccat aacgttcaca 5040 cgcgctatgt ctctggcaat ggccggactc caaggagacg aggtcgaaat atacctagat 5100 gatcttatga tattcagcga gactctcgac gaacatagag tacgtctccg ccgggtattg 5160 aaaagattac tagacgccaa catgacggtg gagccgagga agtgccagtt tctaaagaga 5220 gaggcgcatg tactcggcca catcgtaggc ggaggcttca ttaaaaccga tccccagaaa 5280 ataagggcaa tggcggaata cgcggtacct actaacccca aaaagctcaa gcaggcccta 5340 ggcctgttca gctactatag gaggtttata gagaacttct cgaaagtggc atatccgctg 5400 ttcaagctcc taaagaaaga cacgaaattt gtatggggag cagaagagca agccgcgttc 5460 gacgaattaa aaatgttaat gtgcaaggaa ccggtcctaa agacgccaga tctcgagcaa 5520 ccctttatcg taacaaccga cgcgagcgac tttgccctag gcgccatcct aagccaggga 5580 aagttgggaa aagaccagcc atgcgcgtac gcttctcgct gtcttaaggg cagtgaatta 5640 aaatatccta cttacgataa agagttactc gcgatcgtgt ttgccaaaga gcaatttcgg 5700 cattacttat acggccgaaa attcacgata gtcacggacc acgaggcgct caaacacttc 5760 cacacatcga aaaaaccaga cctaagattt aaccgtctaa aagcggctct tgtcggatat 5820 gatttcgaca taatataccg cccgggcaac aaaaacgcga acgccgacgc gctttcgcgc 5880 aatccaatgc tcaaaccggg tgaaacaaat cccgaattac cacgactaga attatacgaa 5940 ttagcagaaa aacaaatcga agcggacccg gacgaggaag ccggcgcacc gccgggtaga 6000 attttccgcg cgcgagcgat caaacagcta ggcgtgaata aaaatcaaaa agcgaaacaa 6060 gcgatatcgg actccgatga gtcaaggaaa acaagcgaaa taaagaaaaa atgcaacgac 6120 gccgcgctga tttacaaagc cggagaaacg ttagccgtaa gaaacgaatt tgaagacttt 6180 tacttatgcc gcgcgcttca cgacatttac cacggcgacg atttgatcaa aattcaatgg 6240 tttaccgaaa ttgaaaactg taaattcaca tacgcgttgg actatcatga ttccacgaga 6300 ttcggatgca ttttaactaa tacttcgtta aagaaagctg gtaaaaatag atatacattg 6360 tcgaaaaccg agcgcgagcg cataaggaaa atactcgcag aatcaatcaa attcagggcg 6420 tcgccgcgaa tcgtgcaatc atcaacggcc gcaatggaca tcggaaaaga gccagactcc 6480 gatgcaacta tgagtacagt gcaatcggtc gtttcaaaca gctcgagctg cggagtaatt 6540 ctaccccctg gggcaaaaga atatttaaat tacgtagcgc cactgagcgt cgtgacgcgc 6600 aaggcaaaag ccagccaaag ccaaatatcg cgcaagccct attcgcccag caacacgagc 6660 agcagtgacg aaagcgacgt aatacaagca tcaggaaccg cgctagaaat agagccactc 6720 caatcattta aattccccgc taaacccgtg atcgttgacg aggcatcaat acggagcaag 6780 acaagctcgc ccgaatctcc tcgacgtatg cacgcgggaa atccggacca acgcgaggcg 6840 agcgtaacgc acacgcccag agtcgacgct aataaagtca cggcgccgcg caacgaacac 6900 gcgccgcgtc cacacggcac tccggtcaaa acccgtactc cagaaatatg gcaagtatgc 6960 aacggtaaat tccaaaagaa actcgtcatg aatatcgaag aaaaggagat caagggtagg 7020 gacgggatgc aaattaatat atacgaggca gaaccatccg tagcgtccga aaaaggaaaa 7080 acaaacgaca gctcggtaat tgaaacgccg gaaatcccga acgacaacgc gcgagcgtcg 7140 aatgctttag aaaaaagcaa aacgccgcgc gcgcgaaaaa cgagcgcagc tgccggaatg 7200 gaaggtcaga tttcgagaaa cggagatcca cagacgcgag agaagacacc cgaaccagga 7260 agcggaacga aggcgctcgc gcatcagagc gctgatgagt cagaattcgg agacgtcggc 7320 ccattcgagg atttcgcggg cataccacga aattacgatc gtaaacccga tccatccgcc 7380 gaaaactcgt cgattagctc gggaaacgtg cttatgcgta catatatacg gaaaaagcgt 7440 tgcgccgaac cgtacgtaga gctggtacgc tcggatgata taccaagctc gccgccaggg 7500 aactgcctgt tcttttcgct gatcaaagca gcgaagctta atatctcggc aatagaccta 7560 aggaaacatt tgcaataaat tcctgtggtg aaccttccga aaccgcgagg atacttaaat 7620 ccaaaaatga atacggcact gtcgattgcg cctacctgtt cgcgcaagaa tttaatttta 7680 atatatgcat tcattatgat ctaccaaaca aacaccgtat actctgtcac gtgttgtcga 7740 gcgaagcaaa agaattttta caccttaatt tgacgggtca gcatttcacg ccgtacatcc 7800 gcgtaaacgt tccgccgcca accacgaagc ccacgaagcg acagcgcgat ccatcgccca 7860 ctagtgacgg cagccattcg agcgcaaagt cgctggtatt cggaaagaaa aaagcggaaa 7920 aacggaaaca gcgctttaag ccggattcat cgaaacaaga ggcccgcaag ggacgcgaca 7980 ggagaggcgc cgaatcggcc atgagcgttg ggagcaacat tataatcgcg acatgcacga 8040 gcccagaccc ttctgcccag ccaactcaag ctgcgaacga atctccaaac ataagcaacg 8100 tgtcgctagt cctcaaccca gccacatcta tcgcctcacc accgagaact agaatttcga 8160 gcttcgataa ttccgaaaaa tcatgcgcag acgcccctcc gcacaaccaa agccacccaa 8220 actttccaga aagcccgcca cagataaact cgggggacaa ctgcgaggac gcgcctatac 8280 ttgcaagccc gaacttgaga gcaagcacgt ccgcggcgaa gcgtattaga gacgcgggca 8340 atagcatcgg cacaagcgat gacgaaaaaa cagccgccgc tgtaacygca gcgttatcgc 8400 acgaaaaaat gaaacggtac gtcgtggccg gggccaggcc tccacgcgag aaaccacctt 8460 gggctattcc cgagggccaa gaaccccagc ccttcgttca cttgcaatcc ttgagcgagc 8520 accccttccg gtataaagaa aatttagttt acctaacctc atccgacaat tatttatcga 8580 cggaagtaca ggaggcgctc gtcgaacgag gctatgttaa cccggaagac ttagaaaaca 8640 aaaagttcgg aatcggcgag ataaacgtga tcgactgcaa aggatttagc gtaatcgggg 8700 tttatatcaa agcgcacttc gacgatcgtc ccctacgcgt agatctgata aaatgcttgc 8760 gaaacctaaa gaacgtcatg atagctaggg gcttgaacag cgtagcgtta atccgagacc 8820 tcgagatgct gacaccggtc gaatggacaa aattaataga attatttgac gacacattca 8880 tgaacaagcg tgttgccgcg atactttaca aaaataacct gccggtaccc ccggttaatg 8940 agcgctttaa attaatccaa gaataccata tagcggcgat gggtggacac cgtggaatga 9000 cgaagacgta cagcaagatc gcgaacgatt tttactggcg caatatgcgc ccagacatta 9060 agcaattcgt cgctcgatgc gccacctgtc aaagcaataa gcttgtccgc gtaaaaactc 9120 gcttgccaat gctgatcagc aacacgccat cgaccccatt cacgcagata gcgttagatt 9180 tttatgggcc cttggagaat tcgaggcgcg gaaaaagtac atcttatcca tccaagacat 9240 gttgacaaag tacataattt taatccctac aaagcatgca agcgccgacg aagtggcgcg 9300 agccctaacc gaaaaggtca tttgcgtatt cgggccacca gccgctatcg taacggacca 9360 gggctcacac ttccagaatc gggtactgga aaagctcgcg aaaatttttc aaataaaaaa 9420 gttctgcacc acagcttacc acccgcaatc gaatgggtca atagagcgca tgcatcacac 9480 gctcacggag tacttacgca aatacgtgaa agacactacg cgatgggacg agtggacagc 9540 gatttgccaa catgcataca actgcacgga gcacgagagt acccgttact cgcctcacga 9600 gctgcttttt ggtacaaagc ctcgcactcc gtccagcttc acgccaagca tcgacgacgt 9660 tacgtacaac caatacatag acgaaatgac aactaattta acggcgctac agacgaccgc 9720 ggccatgaat ctggtccagt cgaaataccg gtcgaaatac tactacgatc ggaaacttaa 9780 cacaaaacac ttccgggagg gcgaaaccgt cttcctacta aaagaaccca aaaagggaaa 9840 gcttgaagcg atcgagtacc tcggtccgtt cgaaatcacc gacattaaca gaaaaacgca 9900 caacgtgaca atacgcaacg acgacattac caagaccgta cacataaaca aattaaagcg 9960 gccgagtgaa ctagccaggc agatgaacgc gtcggaagaa gagtaggagt aggacgtttt 10020 ttttctgttt ttttttcgcg gcaccggacg tgccgctcaa aaaaaaaacg taacatgtat 10080 accccagaga gattagtctc cggcgtcttg ctacaagcac cagcgggcta ctggcaataa 10140 tcactaacgc caactaatat tttagggtag ccgcgggaag ccgccaggag aacctcaacg 10200 cgatccctga ccccggatcg gccccattca ttcgcctatt ggaagaaaac gtaggcctaa 10260 tcaccgagaa aatttcaccg ctcgccacgt ccagcacaga ctggaaaata atccagaaag 10320 tggacctacg gccgtacttc aaagcgagcg aaatgctaga gaagcatact gcgttggtgg 10380 cgcacgcatg cggcccgcac tgcgaaatcg gggagctgca gcaagcggtt aaagaagccg 10440 tacgccaggc gggaagggtg ctagatttgt tgctcgtaca taacatcggg gatgaaccac 10500 gccgcgcacg aagatcgcta ctcccattta tcgggacaat acacaaattc ctgtttggca 10560 cgctaacaga agccgacgaa gcagaaatcc aagaggccgt aagagccata gcaaatgata 10620 ccaaaatcac tgcggcgctg ttggcaaacc aaacagaaat tatcgatcgc gccttatcca 10680 acctggatac aaagttgaca aggctagaag ctacaacagc catcctaata aacaaatcaa 10740 tcacatcgga caacgagatc gcgatacgca gtgcggtaca aactgtaaag gacaacctac 10800 tgcagtttaa aatggacacg gaagtcctga cagatgcaat tctcttcgcg acacagggca 10860 tggttcatcc gcgaattatg ccaccagaaa cgctattata cgcggcgaaa acagcagcca 10920 acactgtcgc gaatgcgaaa ttcccctcgc cagacggcaa cttctccgcc ataccaatcc 10980 ttaaaatatc caaagtaacg gtacttctgg ccaacagcca cctggtgtac cagatagcca 11040 tcccgctctt ggatatccaa aaatttaatt tatttaaagc ctcgccgctg ccatctgtgc 11100 agcgcgcgct aaatatttca aatatcgcgg cctatatttg gccctgaatt ccattttttt 11160 gcagtaagcg aatctaacag atcgtacatg ccgatacccc cagaaagaat ccataagcta 11220 cgcaagctcg gtgacctact gattgcagtt aaccctgaac caatcagaga aattcgaagt 11280 aacgtcgcat gcgaaatcaa aatcgcgtca gggcacgcgc tgggaaaggc cgaacactgc 11340 gatattcgaa taaaacagct acgcgacacg gtgtggctca gactccataa aacaaatgcc 11400 tggatttttt ccgtcagcaa cgcagagaac atttacatac agtgccagcg agccgaacaa 11460 ataacatctg aaataagtgg cacaggggta ctcgagttac gcccagggtg ctcagcgcac 11520 actgcgaacg cgcacttggc cgcctcgcgc tcgctaacat cttacgcaaa cgcatcggtt 11580 ttcgatatag tcgtattcaa cgtctccgca atgctgaccg aaataggtag tatcgaaggc 11640 aaagcgatcg aactgcaaca agcaatcgag atcgaagcca aaaatcgagc ggaaaattcg 11700 catggcgtaa aattcgaaac tttggaaacc ggtacggctc ttcgcgatat aacgacaaag 11760 gcgcgggaaa tcgcggaccg caaagaaaag tcattcgagc tagagaattt gtcaagcttc 11820 acgtcaaaat tcagctactc aagctggacg gctgtaataa taataattgt aatcataagc 11880 ggcgtctgga ttatcaaaaa gcgtcgcgcc ggtcagccga tgcgcaaact aattaagcag 11940 caggaaaaac agctagagct gcaaaccttg agggagatga accgggctag gagccgtgat 12000 tagaactgcc atctacgcat cgttcacaaa taatttcttt ttctctctgt acaccattca 12060 ccagcaagag aatcctcctg cccgcagttg gatacttctc cgttcataga ggtacggagg 12120 cttcgaagcc ggataaatgg aactaagtta cgcgcagtga aacaatgatt gcttcaatat 12180 attatgagtt taaaaaaaaa aaaaaaagcc aaattgcgac tagtactaaa acgacactag 12240 ccgcgcagaa gcttaaaaag ctctctacca taataataat gaatgtttct ttttcaggaa 12300 ttagaaaatc aaaatgttat cgttaaataa ttcactatgt aacttacgtc attatatatt 12360 agcttctcat gggttttata tattactata tttgactatt tcaggggtac tactattagt 12420 attaatattt gggcgtaaaa caaaataacg attagtcaat ttttcataaa atatcattta 12480 tctcataaaa taatcagcgg aattattatt cagtctaccg cacatctagt attaaggaat 12540 agaattatgt acatacagga gctccgttcc atacaaattg tgaagcagca gcggccgcta 12600 aggaacaaca tccgcgttca cacgcacaca ctcacacata cacacacttg actagcgtct 12660 cgcacgcaga agacggagga ggacgaaccc acacactctc ataatcctac tgtaagccag 12720 tcaaaaaagg ggacacacat aaatacgcat acattcacac acacatgcac ttttcggcgc 12780 ggtgttctga ctattttttg cataagttct gtttttttcg ggcgacgggg acgtctgccc 12840 acgcgacctc ggacg 12855 // ID Gypsy-4_SI-I repbase; DNA; INV; 4106 BP. XX AC AEAQ01007762; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-4_SI_; KW Gypsy-4_SI-LTR; Gypsy-4_SI-I. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-4106 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01007762; Positions 4391 286. XX CC Positions [3072-3542] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 75..3992 FT /product="Gypsy-4_SI-I_1p" FT /translation="MEGIGRLPEPMKFEGNLDENFKKFYQSFELYLVATEK FT DQKADTVKTALLLNTIGSEGIEIYNTFRLTSTQKSDYKVVVNEFKKYCAPR FT KNRTYERFVFNSRNQQVDEPFDKFFGDLKKLIQSCEYADQEDTLLVDRIIL FT GTNDLKVQEKLLNIQNVTLETAVETCRNYEATRRHLQNVRNKEEAAVDVVR FT RQKRHNKQKEERGQASTEENSGKHFKCNKCDKVHGYRECAAYGKTCYKCGK FT MNHFSNVCKTKLKNVKDVTKDEESESEEDDLYVSSIVRVGELSKKTKSIWT FT EIIEVNGKALKFKIDTGSEVNIIPLVIYKSIAADGSQRIRQTRTLLQAYGG FT GKIKPIGKVKLRCKNTDRDEVLEFIVVDLNVKLILGLPSIQKLGYIKHINN FT IQINSEKEKFIQSNIDIFTGLGTFKDTCTITLKKNSEPVARPCRRVPLTIK FT SKLKSKLEQLEKQKIIAKVNGASEWVNSLVIIEKPNKTLRLCLDPQELNKC FT IEREFFEIPSFEEINSKLSGKKYFSVLDFKNGFYQVKLDSESSKFTVFSTP FT FGCYKFLRLPFGIKTAPEIFQKINQKNFGDIDNVIIYFDDLLIAANSKEEH FT DKILHKVIDRAREIGVKFNKEKVQYRKTEVKYMGHIFNREGMRIDKDRIRA FT IDELKTPTCKKELQRLLGLVNYVRKFIPKLGEIASPICDLLRKDVEFQWLE FT AHDKALSTIKSEINKNTVLMNFDPTKQITIQTDASLNGIGCCLMQEGRPVA FT YASRGLNETEKKYAVIEKEMLSIVYATQRFHNYIYGRQIKVITDHKPNVCV FT LNKRICQVHSPRLQRLKLKLLKYDIKLEYLPGKYLYVADCLSRNYGNEVGK FT LDKDMNEIVHSVEKHLRMSENRKEEFRSHTKRDVILSDVCKYCMNGWKCKK FT YEGELKIYYQIRNDLHVYRDMIFYNDRVVVPKRLRERMLQLLHEGHFGINR FT TYDRAREILFWPGMSTDIKNMIQNCKVCERYRYANVKEPLVAHEIPKLPFQ FT KIASDILEFGGKSYLVIVDYLTKWLEIILLRSKQSCDIIETFKKVFATHGI FT PDIVIADNMPYSSYECQRFAKEYDFEFQTSSPGYPKSNGLAERFVQTAKNI FT LRKSEDLNGALMEYRNTPITSLKRSPAELLYQRKLKTKLPVVEKVNTKIRK FT FRDELQNRNKIIKKYYDRHAKPRDDFKKGDTIVYKNKKQWQPAVVVSKHKS FT PRSYLINTGSNILRRNSNHLKKSVIKHETSDTEIHNELPETSINQEINNNN FT NNIQNDSNRDNVASDNEAKDEISKESVRPVRARKVPSKFNNYVLY" XX SQ Sequence 4106 BP; 1707 A; 522 C; 801 G; 1076 T; 0 other; tggcgcagtc gtggaagaac attgaaaaca aaataaagta atttgtttaa agaactgcaa 60 cataaaaaca aataatggaa ggaattggaa ggttaccaga accgatgaaa tttgagggaa 120 atctcgatga gaatttcaag aaattttatc aaagttttga actgtatctg gtagcaacag 180 aaaaggatca aaaagcagat acagtaaaaa cagcattatt gctaaacact ataggcagtg 240 aaggcatcga aatatataac acatttaggt taaccagcac acagaaaagc gattataagg 300 tagtggtaaa tgaattcaag aaatattgtg ccccaagaaa aaacagaaca tatgaaaggt 360 ttgtgttcaa cagtcgaaat caacaagtgg atgaaccttt tgacaaattc ttcggtgatc 420 tcaaaaagtt aatacagtca tgtgagtatg cggaccaaga agacacatta ctagtagata 480 gaataatact aggcacaaat gatttaaagg ttcaagaaaa actacttaac atacaaaatg 540 ttacattgga gactgcagta gaaacatgca gaaattatga agcaaccagg agacacctgc 600 agaatgtaag aaataaagaa gaagctgcag ttgatgttgt tagaagacag aagcgacata 660 acaaacaaaa agaagaaaga gggcaagcaa gcactgaaga aaatagcgga aaacacttca 720 aatgcaataa gtgtgataaa gttcatggat acagagaatg cgcagcatat ggaaaaacat 780 gttacaagtg tggaaagatg aatcatttta gtaatgtatg taaaacaaaa ctgaagaatg 840 tgaaggacgt cacaaaagat gaagaaagtg agtctgaaga agatgatctg tatgtaagca 900 gcatagtcag agttggagag ttgtcaaaga agaccaagtc aatttggacg gaaatcatag 960 aagtcaatgg gaaagcattg aagttcaaaa tagatacggg atctgaagtc aacataatac 1020 cactcgtgat atacaagtca atagcagcgg atggaagtca acggatacga caaacacgaa 1080 cattactaca agcatatgga ggaggaaaaa tcaagccaat aggaaaagtt aaacttagat 1140 gtaaaaatac agatagggac gaagtattag aatttattgt tgttgattta aatgtaaaac 1200 tgatactcgg gttgcctagt attcaaaagt taggttacat taaacacatt aataacattc 1260 agataaatag tgaaaaagaa aagttcatac agtcgaatat tgatatattt acaggtttag 1320 gtacatttaa agatacttgt acaataacat taaagaaaaa cagtgaacct gttgctagac 1380 catgtagacg tgtaccttta acaattaaaa gtaaactaaa atcaaaactt gaacagttag 1440 aaaaacaaaa aataatcgct aaagtcaatg gtgctagtga atgggtaaac tccttagtaa 1500 ttatagagaa accaaataaa accttgcggc tctgtttgga tccacaagaa ctgaataaat 1560 gtattgaaag agaatttttt gaaattccta gttttgaaga aataaatagt aaacttagtg 1620 gaaagaaata cttttcagta ctagatttta agaatggatt ttaccaagtt aaattagata 1680 gtgaatccag taaatttaca gtattttcca ctccatttgg atgttataaa tttttaagat 1740 taccatttgg tattaaaaca gcaccagaaa tatttcaaaa aataaatcaa aagaattttg 1800 gagatatcga caatgtaatt atatattttg atgacttact aatagctgct aacagtaaag 1860 aagaacacga taagatatta cataaagtta tagatagagc tagggaaata ggcgttaaat 1920 ttaacaaaga gaaagtacag tacagaaaaa ctgaagtaaa atatatggga cacatcttca 1980 atagagaagg aatgcgtata gataaggata gaataagagc aattgatgag ttaaaaaccc 2040 caacatgtaa gaaagaatta caaaggctgt taggattagt taactatgta agaaaattta 2100 tacctaagtt aggtgaaatt gcaagcccaa tatgtgatct tctgagaaaa gatgtagagt 2160 ttcaatggct ggaagcacat gacaaagcgt taagcactat taaaagtgaa ataaataaaa 2220 atactgtatt aatgaatttt gatcctacaa aacagattac aatacaaaca gatgcttctt 2280 tgaatggtat tggatgttgt ttaatgcagg aaggaagacc tgtagcgtat gcttctagag 2340 gacttaatga aacagaaaag aaatatgcgg tgatagaaaa agaaatgctt agtatcgtat 2400 atgcaactca aagatttcac aattatatat atggtagaca gattaaagtt ataacagatc 2460 acaagccgaa tgtatgtgta ttaaacaaac gcatttgtca ggtgcattca ccaagattgc 2520 aaagattaaa attaaaatta ctaaaatatg acataaaatt agaatactta cctggtaaat 2580 acttatatgt agctgactgt ttgtctcgaa attacggcaa tgaagtagga aaattagata 2640 aagatatgaa tgaaatagta cattccgtgg aaaaacattt aagaatgagc gaaaacagaa 2700 aggaggagtt tagatcacat accaaaagag atgtaatatt aagtgatgta tgtaagtatt 2760 gtatgaatgg ttggaaatgt aaaaaatatg aaggagagtt aaagatatat tatcagattc 2820 gaaatgacct tcatgtatac agggatatga tattctataa tgacagggta gtagtaccga 2880 agagattgcg tgaaagaatg ttacaattat tgcatgaagg acatttcggg ataaaccgta 2940 catatgatag agcacgcgaa attctatttt ggccagggat gtcaacggat ataaaaaata 3000 tgattcaaaa ttgtaaagta tgtgaacgat ataggtatgc taatgttaag gaacctttgg 3060 tagcacacga aataccaaag ctgccatttc agaaaatagc ttcagatata ctcgaatttg 3120 gcggaaagtc gtacttagtt attgttgatt acttaacgaa gtggctcgaa ataatattac 3180 ttagaagtaa acaaagttgt gacattatag agacatttaa aaaagtattt gcaacgcatg 3240 gtattccaga cattgtcata gcagacaata tgccatactc ttcttatgaa tgtcagcgct 3300 ttgcaaaaga gtatgatttc gaatttcaaa caagcagtcc aggatatccg aaatccaacg 3360 gacttgctga aagatttgta caaacagcta agaatatcct gaggaaatct gaggatttaa 3420 atggagcact tatggaatat cgaaatacgc ctattacaag tcttaaaaga tctccagcag 3480 aactattata tcagagaaaa ttaaaaacga aactacctgt agtggagaaa gtaaatacaa 3540 aaataagaaa gtttagggat gagctgcaga ataggaataa aataataaag aagtattatg 3600 acagacatgc gaaaccacga gatgatttta aaaagggaga tactatagtg tataagaata 3660 aaaagcaatg gcaaccggca gtggtagtaa gtaagcacaa gtctccgcgt tcatacttga 3720 ttaatactgg ttcaaacata ttaagaagaa atagtaatca tcttaaaaaa tcagtaataa 3780 aacatgagac aagtgacaca gaaatacata atgagttacc agaaaccagt atcaaccaag 3840 agataaataa taacaacaat aacattcaga atgattctaa tagagataat gttgcaagtg 3900 ataatgaggc caaagatgag attagtaagg aaagcgtaag acccgtccga gcgagaaaag 3960 taccgtctaa atttaataat tacgtattat actaaatgtt aaatttacaa acttatttgg 4020 tgtgaagcag atattatttt ctagttttgt ttgtattaat ttcatttaaa gttttggtat 4080 tatatcactt tcgaataagg gaaagg 4106 // ID BEL-638_AA-I repbase; DNA; INV; 6011 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-638_AA_; KW BEL-638_AA-LTR; Pao_Bel_Ele78; BEL-638_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-6011 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [5016-5606] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 489..5969 FT /product="BEL-638_AA-I_1p" FT /translation="MSKMSMTRTPIVTRSGKKKSSALEDTVCEDKTTGGED FT SVNGNATPVKASNQNSDQVSISEQKKNEKNKEAMKKMAAEYKVLKNQMSAL FT KRKLVRVKAAMEADTDDPNQNLDCKQFLQLQLKVVESVNDDYNTHQQRAHG FT LDIGDDDRDELETLCIDFEELYGELYIVLNKLLETSSKKDVISVQLPASPT FT ANMSQLPPLKVPLPTFDGSYENWYAFKSMFETIMARYSSESPAIKLYHLRN FT SLVGKASGIIDQEVINNNDYDAAWATLTERFEDRRLIIDKHIDALFDLPKL FT TDENATGMRKLLDTCTKNVDALKNLGLPVSGLGETMLVNRIALKLDHETRK FT AWELEQVSTTLPTYESTLDFLRERCRVLEKIRPLSRPPVKTPSKPPLRSAV FT GSQVRASSLVTTTEICPQCSGSHELWKCDVFKKASIADRYDTLRRVGACFN FT CLQKGHRTTDCSSKNSCKKCNKRHHTTLHPADRRQRKAEDSTPETPQPRPE FT KKPTPTPSTQQIAETNDTEVQTALCTHSKGEGKQVLLSTAVVLVYGKADVY FT PCRVLLDSGSHSNFVSEHFATLLSVKKDSANVSICGLNDINTTVRLKIHTK FT IKSRISDFTACLDFFVVPRITGHLPPSRVNCSNITIPNDIQLADPNFSIPD FT RVDMLLGAEVFFEMLKSGRMRIPNTSAVLQDTQFGWVLSGPIPSKALEQQQ FT SFFITAEENLDEIVRNFWQIESCCEPEVKTGSDDEICVDHFRETHQRTAEG FT RYVVRLPFNDAKDQLGDSKTMAEKRFFGLERRLDKAPEVKTQYVNFLREYE FT ALGHMTQIDEEEGDDFRPAYYLPHHYVLKPSSTTTRLRVVFDGSAPSDTGI FT SINETQLVGPSVQNDLISILLNFRSYKFAFSADIPKMYRQVEVHEDDVRYQ FT RILWRERQDQPIRIYDLRTVTYGLASSPFLATMALRQLAEDEKDKYPLAAE FT AVIKSFYIDDVLTGASSLEEAIELKDQIIGLLKRGGFEAHKFCTNSEALLA FT TVAEDQRESCVDITDPSVNAIMKTLGVSWSPKEDRFTFVVPDNTNQAVQLT FT KRSILSQIARIFDPLGFVGPAITAAKLILRELWSMNLDWDQPVPHELAKLW FT TDYRDQLHSLNNVKIDRWLYSDGVHAYELCGFADASDMAYGACLYARTLRT FT DGTANMILVCSKSRILPRKKNKQKEITTPKAELLAALLLSQLTAKVLEAID FT TNFNSVRLWSDSQIVLCWLQKSPDSLTVFVGNRVKQIQQLTVGYTWQYVAS FT NENPADLISRGIAPSKLGNQKLWWHGPPRLASSQPFYDQPPALSEDQLPEL FT RRTALVSAPEHSRLPIFDRISRYMIIQRAMAYVIRFCDYIRSSRSQLTKGL FT PTVSEMQRASTLITRLVQAEVFHKEIQAMKDGKQIKLPFQNMNPFIDADDG FT VLRVGGRLRNADIPYQHRHQAFLPEKHPLTINLIRYLHHTNLHLGQRSLLG FT VVRQQYWPLKARNVIRKILHHCIPCFRMRPKKSTQLMGDLPDYRVKPSPVF FT SHTGLDFAGPFNLRSSAISRHPTTTKGYVCVFVCMATRAIHLEAVSDLSSD FT SFLAALQRFVSRRGLVQKLYSDNATNFEGANNQMRRLAELFHDEQHIRAVN FT EYCAPRGIEWSFIPPRSPHFGGIWEAGVKSVKSHLKLILAENRLTFEALST FT VLAQIEAILNSRPLTPASDDPNDLNAITPAHFIIGREFQAIPEPSYASIPQ FT GRLSRLQFIQDMKQKFWRRWMNDYLHELQKRQRDLKVTEFKVGAMVVIIDE FT NAPPLKWALARIIELHPGKDGNTRVVSLRTENGTTKRAVKKICLLPLDDEK FT DDQNM" XX SQ Sequence 6011 BP; 1684 A; 1432 C; 1473 G; 1421 T; 1 other; ttttgtggtc cgatcgatcc ggattgtttc ggttttagat caattgcttc ggttgcagtt 60 gaattcgcga aaaagtggtt tgaaacacgc atccgtcgcg cagtgaacgc tcgtagcagc 120 ttgccgatca gtgacgccat taccgtggca aaacggctgc cctcggatta agtgtttgca 180 gtttttggac gtgctggcgt cgatgctagc agcgttttcg aagaaaagtg cctctgggaa 240 taagcgaatt cccacggatt aagtgctttt ccgaattgat ttgattcgcc tagtgatacg 300 atcacaacgt gtacgcgcat gctcatgtgc cgagaattga gtgaaaacaa aatggccgct 360 atacatgaag attgaaaaag ctatagtgtt tttcgaaacc tgcttccctc gagaggaacg 420 caagtgattt tgttaagggc ggtattcagt gtcgtggaat tgaatccgca tatktggtgc 480 gaatatgaat gagcaagatg agcatgacac gtacaccgat cgtgacgcga agtggaaaaa 540 agaagagcag tgcccttgaa gacaccgtct gcgaagataa aaccactggc ggtgaagatt 600 ccgtcaatgg aaatgctacg cccgtgaaag cgagcaacca gaacagtgac caagtgtcga 660 ttagtgagca aaagaaaaac gaaaagaata aagaagcgat gaaaaagatg gctgctgaat 720 acaaagtgtt gaaaaatcaa atgagtgctt taaaaagaaa gttagtgcgt gtgaaagcgg 780 cgatggaagc tgataccgac gatcccaacc agaacctgga ttgcaaacag tttttgcagc 840 tccagctaaa ggttgtggaa agcgtgaacg acgattacaa cacccatcaa caacgagctc 900 acggtcttga catcggcgat gatgaccgtg acgagctaga gacgctttgc atcgatttcg 960 aggagcttta tggtgagttg tatattgtcc ttaataaatt gttggaaacg tcttcgaaaa 1020 aggacgtgat ttcggttcag ctacccgctt cacccactgc gaacatgtcc cagctgccac 1080 cgttaaaagt accgttgcct acgttcgatg gatcgtacga gaactggtac gcattcaagt 1140 cgatgtttga gaccatcatg gcccgctaca gctccgagtc accagcaata aaactctatc 1200 acctccgtaa ttccttagtg ggcaaggctt cgggtatcat cgatcaggaa gtgataaata 1260 acaacgatta cgacgcggct tgggcaacgt tgacagagcg tttcgaggac cgtcgactga 1320 taattgataa gcatatcgat gcccttttcg atttgccgaa gctcaccgac gaaaacgcga 1380 ctgggatgag aaagctactg gatacttgta cgaagaacgt cgacgccttg aaaaacctag 1440 gtctaccagt ttcaggtctt ggggaaacga tgttggtgaa ccgcatcgct ttgaaactcg 1500 accatgagac tcgtaaggct tgggaattgg agcaagtttc gacgactttg cccacctacg 1560 aatcaacatt ggatttcttg cgcgagagat gtcgcgtgct tgaaaagatc cgtcccctct 1620 ccaggccacc ggtgaaaact ccatcgaaac ccccacttcg aagtgccgta ggatcgcagg 1680 tccgtgccag ctccttagtc actacaacgg agatttgtcc acaatgttcc gggagtcacg 1740 agctttggaa atgtgatgtc ttcaagaagg caagcatcgc tgaccggtac gatacactcc 1800 gacgggtcgg cgcctgcttc aattgcttgc aaaaggggca tcgcacgacg gactgttcat 1860 cgaagaattc gtgcaaaaaa tgcaacaagc gtcaccatac aacgttgcat ccagcagacc 1920 gacggcaaag gaaggcggag gattctacac ctgaaactcc gcaaccgagg cctgagaaga 1980 agccaacacc gacaccatcg acacagcaaa tagcagaaac caacgatacc gaagtgcaaa 2040 ccgccctttg tactcattcg aaaggagaag ggaaacaagt tctgctttcg acagcagtag 2100 ttcttgttta cggaaaggct gatgtttacc catgcagagt ccttttggat tcgggttccc 2160 attcgaactt cgtttcggaa catttcgcta ctttgttgtc tgtgaaaaag gattctgcca 2220 acgtatcaat ctgtgggctg aatgacatca acacaacggt tcgattgaag attcatacga 2280 agatcaaatc tcgtatcagc gatttcacgg cgtgcctcga cttctttgtc gtaccccgaa 2340 taacaggaca tttacccccg tcgagagtga actgttccaa cattactatt ccgaacgata 2400 ttcagctcgc agatccgaac ttcagcattc ccgaccgtgt agacatgctc ctcggtgctg 2460 aggtattttt cgagatgttg aaatcaggcc gtatgcggat tccaaacact tctgcggttc 2520 ttcaggacac ccaatttgga tgggtgttga gcggtcctat tccaagtaag gctttggagc 2580 aacaacaatc gttcttcatc accgctgaag agaacctcga cgaaatcgtg cgaaatttct 2640 ggcaaattga atcctgttgc gaacccgagg tgaaaactgg atctgatgac gagatttgcg 2700 tcgaccattt ccgtgaaaca caccaacgta ccgccgaagg aagatacgtc gtccgccttc 2760 cttttaacga tgcgaaggat cagctcggcg attcgaagac tatggcggag aagcggttct 2820 tcggcctgga aaggcggctt gataaggctc cagaagtaaa aacgcaatac gttaatttcc 2880 tgagggaata tgaagcgctt ggacatatga cacagatcga tgaggaggaa ggcgatgatt 2940 tccgaccagc gtattatctg ccgcatcatt atgtcctgaa accaagcagc acaactactc 3000 gacttcgtgt tgtgttcgat ggttcggctc cgagtgacac tggtattagt atcaacgaaa 3060 cccaattggt cggcccgtcg gtgcagaacg atctgatatc gatattgcta aactttcggt 3120 catacaagtt cgctttctct gccgacattc caaaaatgta tcggcaggtg gaggtacatg 3180 aagacgatgt ccgttaccaa aggatactct ggcgtgaaag gcaggatcaa ccgatccgca 3240 tttatgatct gcgaaccgta acatacggac tcgcatcttc cccatttctg gcaacgatgg 3300 cactccggca acttgcagag gacgagaagg acaaataccc gcttgctgcc gaggcggtta 3360 taaaatcctt ctacatcgac gacgtactta ctggagcaag ctctttggaa gaggcgattg 3420 aactaaagga tcaaatcatt ggactgctaa agcgaggagg tttcgaggcg cacaaattct 3480 gtacgaattc cgaagcattg ttggctacag ttgctgaaga tcaacgggag agctgtgtag 3540 acataacaga tccctctgta aacgcgataa tgaaaacatt gggcgtctcg tggagtccca 3600 aagaagacag gttcacgttc gtcgtgccag ataacacgaa ccaagccgtg caactgacca 3660 aacgatcgat tctcagccaa attgcaagaa ttttcgatcc gttaggattc gtaggacccg 3720 caatcaccgc tgcgaagctc attttgcggg aactctggag tatgaatttg gattgggacc 3780 agccggttcc gcatgaattg gcgaagttgt ggacagatta tcgtgaccaa ctacacagct 3840 tgaacaacgt gaagattgac cgatggctct atagtgacgg agtacatgcg tacgaacttt 3900 gcggctttgc tgatgcgtca gacatggcat atggcgcctg tctgtatgct cgcacattac 3960 gtaccgatgg aacagcaaac atgattcttg tatgcagcaa atcacggatt cttccacgaa 4020 agaagaacaa acagaaagag atcacaacac cgaaggccga actgttagca gcgttattgc 4080 tttcgcaact cactgcaaag gtgttggaag ctatcgacac taatttcaat tctgttagac 4140 tgtggtcgga ttcacagata gttctttgtt ggctgcaaaa atctccagat tcactgactg 4200 tgtttgtggg aaaccgagtc aaacagatcc aacaattgac tgtcggttac acttggcaat 4260 atgttgcctc gaatgaaaat cccgcggatt tgatttcccg tggcatagca ccctcgaaac 4320 tgggaaacca gaagctttgg tggcacgggc ctccaagatt ggccagctca cagccattct 4380 atgatcaacc accagcactt tcggaagatc aactgccaga gctcagaagg acagcacttg 4440 tttctgcacc cgaacattcg agattaccga tattcgatcg aatcagtcgg tatatgatta 4500 tacagcgagc aatggcatat gtgattcgtt tttgtgatta catcagaagc agtcgaagcc 4560 aattgaccaa aggattaccc acggtgtccg aaatgcagcg agcatccacc ttgattacgc 4620 gcttggtgca ggccgaagtg ttccacaaag agatacaggc gatgaaggac ggaaagcaaa 4680 tcaagctacc gttccagaat atgaatccgt tcattgatgc ggatgacgga gtactacgcg 4740 tcggcggacg cttgaggaat gcagacatac cctatcaaca tcgtcatcaa gcattccttc 4800 cagagaaaca tccactcacg atcaatctga tacgttactt gcaccacacc aatttacatc 4860 ttggccaacg aagtctactt ggagttgtcc ggcagcagta ttggccgctc aaggcaagga 4920 acgtcattag aaagatattg caccactgca ttccgtgttt tcgcatgagg cccaagaaat 4980 ccacgcagct aatgggtgat cttccagatt atcgcgtcaa accatcgccg gtgttctcgc 5040 ataccggtct ggacttcgca ggaccgttca atcttcgatc gagcgcaatt tccagacacc 5100 caacaaccac caagggatat gtttgtgtat tcgtgtgcat ggccactcgt gcaatacacc 5160 tcgaagctgt ttcggatctg agctcagact ccttcttagc cgcactccag cgatttgtta 5220 gtcggcgagg gcttgtgcag aagctatatt ccgacaacgc aaccaatttt gaaggagcca 5280 acaatcaaat gcgtcgctta gctgagctat tccacgacga gcagcacatt cgtgctgtga 5340 atgaatactg cgcaccaagg ggaattgaat ggtcattcat tcctccacga agcccgcatt 5400 tcggtggaat ctgggaggcg ggagttaaat cggtaaaatc gcatctgaag ttaatccttg 5460 ccgaaaatcg tctgacgttt gaagcactgt caactgtcct tgcacagatt gaagcaattc 5520 tgaactcccg tcctctcaca cctgcttccg acgaccccaa tgatttgaac gccattactc 5580 cggcacattt cattatcggg agggaattcc aagccatacc ggaaccctcg tacgccagta 5640 ttccacaagg tagattgtca cgtttgcaat tcatccaaga catgaagcag aagttctgga 5700 gaaggtggat gaatgattat ctgcacgaac tccagaaacg ccagcgagat ctcaaggtta 5760 ccgaattcaa ggttggagcc atggtagtca tcatcgacga gaacgcaccg ccgctgaagt 5820 gggctcttgc ccgcataatt gaactgcatc caggcaagga cgggaacaca cgtgtagtga 5880 gcttgcggac tgagaacggc acaactaagc gagccgtgaa gaagatctgc ttgctacctc 5940 tagacgacga gaaagatgat cagaatatgt agttggaatc ttgaaatttc caatttcgac 6000 gcccgggaag a 6011 // ID I-48_AAe repbase; DNA; INV; 5673 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE An I non-LTR retrotransposon from Aedes aegypti. XX KW I; Non-LTR Retrotransposon; Transposable Element; I-48_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5673 RA Kojima K.K. and Jurka J.; RT "I clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1319-1319 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 10 sequences with >97% CC identity. XX FH Key Location/Qualifiers FT CDS 2..772 FT /product="I-48_AAe_1p" FT /translation="LGLIRKPTRPFYPRPTRCFNCLDYGHIGRSCKKARRC FT PNCCNAWHDEECSLPTFCLHCNENHSIYSSFCKKFKAEQEIVKIRIDRDIP FT FPQARRLFYEQRGSYSQVMQQQANKPCGCRCSCSTPFQASTNAKTTANNNT FT ETMDLTQTQTDDRNPITQNTISISGQTADHSKTDQNYTTQPDPFVDPPSQI FT PGPSNHSKKSGKKNRNNTPVGYQSRDSSRGTGVDSPPRKKTLPSSCHSDAD FT INHPHEHESRRTRQSK" FT CDS 738..4388 FT /product="I-48_AAe_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="MNMNQEERDNLNKSGRSILQWNLNGLKRHYNELRQII FT SKYNVYALALQETHTREDFRIPHFQSYYCHDTGHARGTWGGVAIAVHDDYS FT HHCIPLKTNLSVIAIQIHFPFTLTLVSLYLPHGVRDKNALTTLFKQLPQPY FT LVLGDLNAHHSAWGSSKISPGGEIILQVIDEENLVLLNDGSHTHMTASTGN FT TDAIDLSLGSLDLGTKLSWEVIDDTHDSDHYPILLKVETTDLTLRTRKKWK FT INAANWNVFEDHITNNLPPGRESNMQSITRAIISAAEVAIPQSSGLCRPNT FT VPWWNPELAEAIKNRRKALREFKRVPVGHPDKQAKKEEFNCAKKLARKLMW FT DSKKRSWEEFVSSINKDTPVTTLWRKVRSLQGNYRPFKIPSLLSENRLVTE FT PTDVANTLANHFERVSSNAGYSAEFTRNKIRSESRNPTFPPRLSADYNKPF FT EMHELLHVIDTVRGSSPGPDGIHYEMIRRLPFHAKHTLLHEINRIWETGDF FT PDAWRESLLIPIPKPGKDLKEPSNYRPISLTSCTSKIMERMVNRRLSHQIE FT NQGLLDERQHGFRKGKSIYSHLTVLEGEFRDSISNNSHTEIVSLDISKAYD FT ITWRRLIIDTMINWGLGGNLVKYVINFLQNRRFRVLVGGQQSEEKNQANGV FT PQGSVISVTLFLIAINSLFSAVPKFVKVIIYADDIIIYISGKSHKAIRKRL FT QQTLESLEKWSLKTGFRISHEKTTSMHVCRKRSHQDTPALKLEGFEIKQVQ FT TMRILGMVFDSRLNWNAHIDKIKADVEPRINILRSISGNRLGGHPKVMLNI FT YKAIIQSKLLFGAPIYNGASDAQLKKLDACQNKAIRISIGAFATSPIDSIM FT SISGILPLKYLIWETSVKNLISLSSVKFAENRSFAGLERARDIAMSKLDFE FT IPNIETYSTHPKSWAYKRPQICWKIKHAGGRTLERSIARQTFLQLRNKDFV FT TAEFVYTDGSKVTGKVGAGIITENETYSIGLPADLSIFSAEAYAILEAIRL FT PKNRESQRVICTDSESVVSAVENGHSKHPWVQEIEELLSLDNNTVLCWLPG FT HMGIRGNELADAAAKLGLGMRPKEISIPAADLKREIKLKVRLKWENYWNEL FT RDNKLREIKNTTSVWIPTGNRQEQTALTRLRIGHTRATHKHLFDKSDQVCE FT TCGTFLDVKHILIDCHKYTEERKNFHIQGSLQEILSVDQSKELNLRKFLKA FT SKLIEHI" XX SQ Sequence 5673 BP; 1994 A; 1162 C; 1222 G; 1295 T; 0 other; actcggtctt atcagaaaac caacccgacc attctacccc cgtcctactc gctgcttcaa 60 ttgtctcgac tatggacata ttggcagaag ttgtaaaaaa gcccgcagat gcccaaactg 120 ctgcaatgcg tggcatgatg aagaatgctc tcttccaacc ttttgtctac actgtaacga 180 aaatcattct atctactctt cgttttgcaa aaagttcaaa gcagaacaag agatagtaaa 240 aatcagaata gatcgagata tcccctttcc acaagctcgc cgcttatttt acgagcagcg 300 tggctcatac tcacaagtga tgcagcaaca ggcaaacaaa ccgtgcggtt gtagatgctc 360 atgtagcaca cctttccaag cctcaacgaa cgcaaaaaca acagctaaca ataacacaga 420 aacaatggac ctaacgcaga cccaaactga cgaccgaaac ccgataacac agaacactat 480 ctcgatctca ggccaaacgg ccgaccatag caaaactgat caaaactaca caacccagcc 540 cgaccctttc gttgacccac cctctcaaat acccggacca tcaaatcatt cgaaaaaatc 600 aggaaagaaa aatagaaaca acacacccgt tggttatcaa tctagagatt ccagtagggg 660 aactggggta gactcacctc cacggaaaaa aacgttaccg tcttcgtgcc actcggacgc 720 tgatataaac caccctcatg aacatgaatc aagaagaacg cgacaatcta aataagtcag 780 gaagatccat cctccagtgg aatctaaatg gcttgaaaag acattacaac gaactacgac 840 aaattatttc caagtataat gtgtacgctt tggcgctaca agaaacacat acgcgtgaag 900 attttagaat accacatttt caatcctatt attgccacga cacaggacat gccagaggaa 960 cttggggagg tgttgccatt gcagtgcatg acgactattc ccaccactgt attccactta 1020 aaacaaattt atccgttatc gccattcaaa tacattttcc gttcaccctg actctcgttt 1080 cattatacct accccatggt gtgcgcgata aaaacgcact aactactctg ttcaaacaac 1140 ttccgcagcc ttaccttgtg ctaggcgatc ttaacgccca ccactcagcc tggggaagca 1200 gtaaaatttc tcccggcggc gagatcattc ttcaggtcat tgatgaagaa aacctcgtac 1260 ttttaaatga tggatctcat acacatatga cagcaagtac aggcaataca gatgcaatcg 1320 atctatcact aggctcttta gatcttggta ccaaactttc atgggaagta atagatgata 1380 cacacgatag tgatcactat cctattttgc ttaaagttga aacaacagat ctaacactga 1440 gaaccaggaa aaagtggaaa attaatgcag ccaactggaa cgtctttgaa gatcacatca 1500 ctaataactt acccccaggc cgtgaaagca atatgcaatc aattacccgt gcgattatca 1560 gcgctgcaga agttgctatc cctcaatcaa gtggtctttg tcgcccgaac accgttcctt 1620 ggtggaaccc cgaattggcg gaagcaataa aaaatcgtcg taaagcttta agagaattca 1680 agagagtacc tgttggacat cctgataaac aggcaaaaaa agaagaattc aattgtgcta 1740 aaaaacttgc tcgcaagcta atgtgggatt ctaagaaaag atcttgggaa gaatttgtca 1800 gtagtataaa caaggatacc ccagtaacaa cactttggag gaaagttcgt tccctgcagg 1860 gtaactatcg accatttaag attccgagtc ttttgtctga aaaccgtctc gtaaccgaac 1920 caacagatgt tgcaaatacg ttagcaaacc acttcgaacg tgtttcttcg aacgctggtt 1980 actctgctga gtttacaaga aacaaaatac gatctgagtc tcgcaaccca acattccctc 2040 ctcgactatc cgcagactac aataaacctt ttgaaatgca tgaactacta catgtaatag 2100 acactgtacg tggttcaagt ccggggccag acggaataca ctatgaaatg attcgtagat 2160 tgccttttca tgcgaaacac actttactgc atgaaataaa tcggatatgg gagacgggag 2220 attttcctga tgcatggagg gagtccttac tcattccaat tccaaagcca ggtaaagatt 2280 tgaaggaacc ctctaattat cgcccaatat ccctcaccag ttgcaccagt aagattatgg 2340 agcgtatggt gaatcgtagg ttgtctcacc agatcgagaa tcaaggattg ctagacgaac 2400 gacaacacgg attcaggaag ggaaaatcta tctacagcca ccttaccgtt cttgaaggag 2460 aatttcgaga ttcaatatca aacaatagcc acacagaaat cgtatcattg gatatatcca 2520 aggcatatga tataacttgg aggcgcctta tcattgatac catgatcaac tggggattag 2580 gaggaaattt agtgaaatac gtcattaact ttcttcaaaa cagacgcttt cgtgtccttg 2640 ttgggggcca acaaagtgag gagaaaaatc aagcgaatgg tgttccacag ggatccgtaa 2700 taagtgtgac ccttttctta atagctatta actctttatt cagtgctgtg ccgaaatttg 2760 tcaaagtgat aatttacgct gacgacataa ttatctatat ttcgggtaaa agccacaaag 2820 ccattcggaa acgattgcaa caaacgcttg aatcactgga aaaatggtca ttaaaaactg 2880 gtttcagaat ttcccacgaa aaaacaacaa gtatgcatgt gtgtcggaag cgatctcacc 2940 aggatacacc agcattgaaa ttggagggct tcgaaataaa acaagttcaa acgatgagaa 3000 tactggggat ggtttttgat agcagattga actggaacgc gcatattgac aaaatcaagg 3060 cagatgtgga gcctagaatc aacatactgc gttctatatc cggaaatcga ctaggtggtc 3120 acccaaaagt tatgctcaac atctataagg caatcatcca gtctaaactc ctatttggtg 3180 cgccaatata taatggagct agtgacgctc aacttaagaa gttggacgcc tgccaaaata 3240 aggcaatacg catttccata ggtgccttcg ctacaagtcc catagatagc attatgtcca 3300 tcagcggtat tttaccactt aagtatttga tctgggaaac ttctgtaaaa aacctaatca 3360 gtttatcctc agtaaaattc gcagaaaaca gatcttttgc tggcttggaa cgagcaaggg 3420 atattgctat gagcaaactt gacttcgaaa ttccaaatat cgaaacatat tcaacacacc 3480 caaagagttg ggcgtataag cgacctcaaa tatgctggaa aattaaacac gctggtggga 3540 ggactttaga aagatcaatc gcacgtcaaa catttcttca actgcgaaat aaggacttcg 3600 tcacagctga atttgtctac acagatggat ctaaagtaac cggaaaagtg ggagctggaa 3660 taattacgga gaatgaaaca tacagtatag gattaccagc tgatctttca atattttcag 3720 ctgaggctta tgctatttta gaagcaatac gtttgccaaa gaatagagaa agtcaaagag 3780 tgatctgcac cgactctgaa agtgttgttt cagcagttga aaatgggcac tctaaacatc 3840 cttgggtgca agaaatagaa gaactactct ccctagacaa caatacagtc ctctgttggt 3900 taccgggaca tatgggcatc cgtgggaatg aactggcaga tgcagcagca aaattaggtt 3960 tgggtatgcg tccaaaggaa atttcgattc cagcagccga tctcaaaaga gaaataaaat 4020 taaaagtccg tttaaagtgg gagaactatt ggaacgaatt gagagataac aagttacgtg 4080 aaatcaaaaa caccactagt gtgtggattc cgacgggtaa taggcaagag caaactgcac 4140 taactcgtct acgaataggg cacactagag ccacacataa gcatcttttt gataaaagtg 4200 accaagtttg tgaaacctgt ggaacctttc ttgatgtcaa gcatattcta atagattgcc 4260 acaaatacac tgaggagcgc aagaatttcc acatacaagg atcactacag gaaattttga 4320 gtgtagatca atccaaggaa ttgaacttga ggaaattttt gaaggcaagc aaactaatcg 4380 aacacatctg aagcataaaa acccacgagg gtggccgtgc gggcgtagaa gtcatgttcg 4440 tcgctacacc cttatcggta cacgtctcga tggtatgctg atgttcatgc tttttgatac 4500 tcacgagggt tggctgtgcg gaagaggacg ttatgatcgt cgccacttcc ttatcagtac 4560 acgtctcgct gagcaaaagc ttaattcgaa tcacggaaat caggatgagg agcaaagaag 4620 aaggcaaaag gaaatcgtcg tggaaggaaa ctgtaggaaa gcaagccgga agtggacaat 4680 aaggaaaaca tgccggaagg aagcagacag aagaagaaaa acgcaagaaa tgaaggaatt 4740 ggaatcaaac ggaggagaac tgtagcaaga atttgaaaag gaagcaaaca aggatttgat 4800 gcggcaatta aaggaagaaa cagtccgcct gaagaagaaa agtgatgaaa agtgagaaaa 4860 tgaagaagag agaggaagac gaaaggaaaa gaagtataaa aacccacgag ggtggccgtg 4920 cgggcgtaga agtcatgttc gtcgctacac cctgatcggt acacgtctcg atggtatgct 4980 gatgttcatg ctttcttgaa actcacgagg gttggctgtg cggaagagga cgttatgttc 5040 gtcgccactt ccttatcagt acacgtctcg ctgagcaaga gcataactcg aatcacggaa 5100 aacaggatga ggagcgaaga agaaggcaaa aagaaatcgt cgtgggagga aactatagga 5160 aagcaagctg gtagtggaca ataaggaaaa catgcaggaa ggaagcagac agaagaagaa 5220 acggaaagaa atggaggaat tggaaccaac gagaagaata gaagcaggag ttcgacgaag 5280 aaaaaaaaat aaggatttga tgcaggaatt aaaggaacaa gagccgaagg aagccgtcac 5340 cgaacgaaac cgaaggagga ccgcatgaag aagaagagtg aagataagtg atgaaaagtg 5400 agaaaatgaa gagaggttga agaaaaggaa aagaagtatg agttgtaaga gaaaatatgg 5460 aaaagaaaag aaaagcaaga gatgacaaat gaacaaatgt gaagtgaaaa cattagaaac 5520 atagatatcg aaaacgacga gaaggaaaac tcagatggac gaaaacaaaa tacctaagat 5580 aagtacaaaa ttttatcatt tttagtatta agcttattta aatcgaatgc cgcctcaggc 5640 ggtaaagata aataaaaaca acaacaacaa caa 5673 // ID BEL-5_SI-LTR repbase; DNA; INV; 297 BP. XX AC AEAQ01019002; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: long terminal DE repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-5_SI_; KW BEL-5_SI-I; BEL-5_SI-LTR. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-297 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01019002; Positions 930 634. XX SQ Sequence 297 BP; 88 A; 67 C; 47 G; 95 T; 0 other; tggtaaggct agataactat tgattaataa tttatttaat tttgtctgtt ttatattgca 60 taatttgttc aggtataaaa tttaatttgc gcgtacgcga cctcgccacg ttgaatagag 120 actgcgcgcc ctctcttaga tcgcttacgc gactcttaca caagcaaccg ttcacacaca 180 cacttactta cactctaagt ctcgatgtac cacgttttcg tgccttgaaa taaataattt 240 tggtaaaaac aaatactcgc aattcactga gggctaacaa gtacgctctc ctaacca 297 // ID Gypsy-131_AA-I repbase; DNA; INV; 3922 BP. XX AC AAGE02027875; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-131_AA_; KW Gypsy-131_AA-LTR; Gypsy-131_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-3922 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02027875; Positions 111491 107570. XX CC Positions [2968-3435] - Integrase core CC 'GTTTG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 1459..3906 FT /product="Gypsy-131_AA-I_1p" FT /translation="MPVVEDVFAKLNGGKFFSKLDLKSAYNQLLLDEESKK FT MLAWSTHKGIYYVNRLPFGTKPACAIFQEILEKLLQDCPGSVNFLDDVLVT FT GATISEHLNNLSKVLEKLLDAGFRLNRNKCEFFKERLQFLGHIVDGDGLHK FT DPEKVRAIMDVPRPANVKLLREFLGMVTYYSKFIPNVSAILSPLYRLLKKE FT EKYEWSPECETAFQDIKLKICSDNVLIPYDPALPVVLVTDASGKGIGAALF FT QVCSDGNQRPVTFISRVLKNHEKEYSSLDMEALAVYYAVRRLSNYLLGRNF FT TIWTDHQPLVSLFGRKGIPDMVFGKLQRWAVFLANYDYEIKYIKGVNNKVA FT DFLSRSPVLYHEDDDINDEEVMFLQFIEAETKSLVERKQLIVETRRDPVVS FT RVVNYVKTGWPTNVQDSELKKFFVRRSELIVEEEVLLWGYRIVAPAKLRPF FT LLQELHSTHMGIVKMKSLARNYFWWPNIDKDIEDMGKRCEQCIQCRPELGN FT VPISPWKLCSRPFERVHIDHLFLKNKNFLIITDSYTKWVEAYVVNTLSTKE FT TIEKLEDCFGRFGNCDVLVSDNGRSFTAADFQEFCKEKGIRHLTSAPYSPC FT SNGAAENAVKTFKNALKKMLAESSSRGKSLASTMNRYLQMYRATPHCTTNE FT TPFKLMFGREMRTRFDSLRKDNFVQQTVKIWKHNAKRKDTSFIAGEIAYAR FT DYRDPDRQRWIKTKIMKRKGENLYECYAAEIGTFIRRTHQLLKYEYDDFED FT DRSSVQETIVGNNTTNVQLPDDEYLSAEDEAIEDPSSVQQQRGYYRDDGVY FT VSRSNRMVRAPERF" XX SQ Sequence 3922 BP; 1244 A; 675 C; 937 G; 1066 T; 0 other; attggcgacg attggagttt aaacctgcgt taagatggaa ggttactgtc cgccgagatt 60 tgccttggga gatcaatggg aattgtacca ggaacgtttg gagcaatact ttgtagctgt 120 tgatacggaa gaggagcgta aatccgctgt actcctaaca tcaatttcgc tcgaggtgta 180 ccagacggtg aagaatattg tcatcctgcg aaacccaatt ctaagaccta tgatgagctg 240 tgtgtgttgc ttaaaggaag attttgtcca acggttgtta cgtatcgtga gagagcattg 300 ttttatagag cccgccaaga cctggaagag agcgtcctgg aatggtatgt gcggttgaag 360 cgattgtcgt tgaattgcga gttcggagaa catctggatc acacgttgaa agatattttt 420 gttacgggac ttcgacctgg accaatattt gaaagattat gtgaagaaga agagtctgta 480 tcattggaga atttgatcaa aatagcatcg aaacgggaag ctgcgctgaa gaaccgagcg 540 gtattggagg tgaacaagat agttgagaag aaagctaatt atgttccgaa gttgacgaag 600 aaacctggat ctgcaacttg ctatgcttgt ggtaaaggta atcatgattt tcgcaactgt 660 caatatcgaa gttacgtttg caagatttgt aaagaaaaag gacatattgc tgttgtttgc 720 cagaagaaaa ctaaagagaa tttcggaagg aaggagaaaa acgttaatca tttggagatc 780 aacaacgtgg tctatagcga cccttacttt gtggaagttt ctgtgaatgg acgatgtacg 840 aagttcgagg tagacactgg atcgccgatt actgttattt cagaagaatt ttatcagtca 900 cactttaacg aatttgccct gcacccattt cgaggaaaac tagtattcta cactggaggt 960 gaagccacgc ctaaaggtgc attcgatgcg gtattgaaat atcaaggtcg acaagcgatt 1020 ggcgaattgg tcgttgttga aggcggaagg aatccgttga ttggaagaga tttcattggt 1080 gagttactga acattcaatt caacaagatc gatacaaaga agtcagaaaa gtttgatgga 1140 caactgaaaa gcgttttgga caattatgaa gagttgttcg acgattcgtt agggtgctat 1200 aagtattcaa aagtgaatct gcaactgaag cccgatgctg ttccaaagtt tgtgaaacca 1260 aggaaaattc cgatttcttt tcaacccaaa gtcgaagaag aattggagag gttggagaat 1320 actggaataa tttcaaaagc tgaaaatgct gattggggga cacctttggt tcctgtattg 1380 aagaaggaca attcgatccg cttgtgtgct gattatcgcg ttacagtgaa tccatttctg 1440 gaagataaac ggtatccgat gcctgtagtg gaagatgtgt ttgccaaact caatggcggc 1500 aaattttttt cgaaactgga tttgaaatca gcatacaacc aactactact tgatgaagaa 1560 tcgaagaaga tgttggcgtg gagtactcat aaaggaattt actatgtgaa cagattgccc 1620 tttggaacca aaccagcatg cgctattttt caagagattt tggaaaaact gttacaagat 1680 tgccctggat ctgttaactt tttggacgat gttctagtta ctggagcaac aatcagtgaa 1740 cacctcaaca atttatcgaa agtattggag aagcttttgg acgctggttt tcggctcaac 1800 cgaaataaat gtgagttttt caaagaacga ttacaatttc tgggacacat tgtggacggc 1860 gacggtctcc ataaggatcc ggaaaaggta cgagcaataa tggatgttcc gaggccagca 1920 aacgtaaaac tgttacgaga atttttggga atggttacgt actactcaaa atttattccg 1980 aatgtctcag ccattttaag cccactgtac cggctgttga agaaagaaga aaaatatgaa 2040 tggtcgcctg aatgtgaaac tgctttccag gatatcaagc ttaaaatttg ttcagataat 2100 gtattaattc cttatgatcc tgctttaccg gtagtactgg taacggacgc ttctggcaaa 2160 ggtattggtg ctgcgctatt tcaagtttgt tccgatggaa accaaagacc ggtgactttt 2220 atttcaagag ttctcaaaaa tcatgaaaaa gaatactctt cgttagacat ggaagcattg 2280 gcagtatact atgcagtacg aagattaagc aattacttac taggaagaaa ttttaccatc 2340 tggacggatc atcaacccct tgtatcactt tttggacgta aaggcatccc tgacatggtt 2400 ttcggtaagc ttcagcgatg ggccgttttc ttggccaatt atgattacga gatcaaatac 2460 atcaaaggag ttaacaataa agtagccgat ttcctgtcgc gatcaccagt attgtaccat 2520 gaagacgatg atatcaacga tgaagaagtc atgttcctac agtttataga agctgaaacg 2580 aaatcattag tagaacggaa gcaattgatt gttgaaactc gacgagatcc tgtcgtaagt 2640 cgtgttgtga actatgtcaa aactggatgg cctaccaatg tgcaagattc tgagttgaaa 2700 aagttcttcg ttcgtagatc agagctgatt gtggaggaag aagtgctcct gtggggctat 2760 cggatcgtag cgcctgcaaa acttcggccg tttttactac aagaattgca ctcaacgcac 2820 atgggaattg tgaaaatgaa atcgttagca agaaattatt tctggtggcc taacatcgat 2880 aaagacattg aggacatggg taagcgatgc gaacagtgta tccagtgtag accagaattg 2940 ggaaatgttc caatatctcc atggaagcta tgttctcggc cattcgagcg agtacacatc 3000 gatcacctgt ttttgaaaaa taagaatttc ctgattataa ccgatagcta tacaaaatgg 3060 gtagaagcct atgtggtgaa tacattatca acaaaggaaa ctattgagaa gttagaagat 3120 tgtttcggac gtttcgggaa ttgcgatgtt ttggtgtccg acaatggtag atcgtttact 3180 gcggcagact ttcaagaatt ttgcaaggag aaaggaatac gacatctcac ttcagcacca 3240 tacagtccgt gctctaatgg tgcagctgaa aatgcagtga agacattcaa gaatgcactt 3300 aaaaaaatgc ttgcggaaag ctctagccgt ggtaagtcac ttgcttcaac gatgaaccgt 3360 tatcttcaaa tgtatcgtgc tacaccacat tgcacaacaa atgagacacc attcaaactt 3420 atgtttggac gtgagatgag gactagattc gattcgctcc ggaaggacaa ttttgtacaa 3480 caaacggtta aaatatggaa gcataatgcg aaaaggaagg atacatcttt cattgcgggt 3540 gagatagcat atgcaagaga ctatcgagat cccgacagac agcgatggat taagacaaaa 3600 attatgaaac gaaaaggaga aaatctgtac gaatgctatg cagctgaaat tggtacattt 3660 ataagaagaa ctcatcaact gctgaaatat gaatatgacg atttcgagga cgatagaagt 3720 agcgttcaag aaacgattgt tggaaataat actaccaatg ttcaactacc agacgacgaa 3780 tatctatcgg cagaagatga ggctatcgaa gatccaagtt cagttcaaca gcagagagga 3840 tactatcgag atgatggggt atatgtttct agaagtaacc gaatggtacg tgctccagag 3900 cggttctaac ttacggagga ga 3922 // ID Gypsy-241_AA-I repbase; DNA; INV; 6428 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-241_AA_; KW Gypsy-241_AA-LTR; Gypsy-241_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-6428 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1083-1083 (2011). XX DR [1] (Consensus) XX CC Positions [5450-5917] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 657..1886 FT /product="Gypsy-241_AA-I_2p" FT /translation="MEFEMRPLPPFRCEQIERSKLSREWRSWRNALECYFE FT AHSIFDQRIKRAKLLFLGGPQLQRVFEHLADTDKIPLVAVKETWYDVAIEK FT LNEYFQPARQHTLERHRLREMKQMKDERFAQFVMRLKQQSADCGFDKYSTE FT ISKILTEITLIDVIVQGCTSNELRRRILKEDHSLAEIEALGAMFECVDEQV FT KSLSNVTANTQEKIFKVTESKKVEDKPDTNGQLSCYRCGNTGHFSKSPNCP FT ARNKTCRRCKRIGHFESVCRGFSKRPAQKLNDNITAKKVRAIETVDSETEL FT MLPGKDSKKDEQTPSKTYYAFYTGNETNMIECHIGGIKWKVIIDSGSECNL FT ITKNAWEKMKTCNIAVQSSTKGCDRVLKAYGSNVPLKVNFKIRIRYDCTIS FT IINTFMFYIGCWFIHG" FT CDS 4469..6400 FT /product="Gypsy-241_AA-I_1p" FT /translation="MNKFIYNLATIDEPLRRLTLKKVPFEWTEEHTNAFEK FT IKQEMRIAGKLGYFNLNDRTVVTTDASPVGLGAILAQYDDNDQIRVISYAS FT KSLTDTEARYCQTEKEALALVWAVEKFQRYLIGRSFELVTDCKALQYLFTT FT RSRPCARIERWVLRLQAFDYKVIHVAGECNVADALSRLATLNPLPFDQNEE FT LMIREVAVSAANAMALDWQQIEDAPKEDEEIRNIIALLDSNRQRELSVPYR FT VIANELCHIGDVLMRVDRLVVPSKLRTVVLNLAHDGHPGSRMMKSHLRSSV FT WWPKLDQQVEEFVRNCRGCCLVSAPDAPEPMIRKPLPSRPWEDIAVDFMGP FT LPEGQYLLVVVDYYSRFIEVKEMNTISANDTIQELSVVFSRYGFPNTLRAD FT NGPQLSEHCEELKLFCRENGIKLVNTIPYWPQQNGEVERQNRSILKRLKIS FT QELGKDWKKELSQYLLVYHSTNHSTTGKSPAELMFGRRIRNKLPHVPMYRL FT DDEEVRENDLVQKEKGKEYADGKRKAKQSEIVVGDIVLMKRMKRTNKLDTD FT FCKEEYIVKRREGSDCTVESMESGKQYRRNVSHLKRVGKKTESNNVTEASE FT ENVDLQVQDGNNKTDEIQLELNEDENQRTRKRIRIEPHHFKDYIAH" XX SQ Sequence 6428 BP; 2110 A; 1131 C; 1437 G; 1750 T; 0 other; acctggcgac gaggatgaac tccggtaaga tttcattacc acaataacga aaaaacgtga 60 gcttacatag taaaaaaaaa aaactgaaaa taatgccaga actgggagag ttcactttga 120 gaagatgctt aaatgcggag ttttatgtga aagaaccagt aaaactcgga tcgggagggt 180 ccactgaagt ttataaaaaa aactgtcttc tggcagtgcc cgggctggga gagtccacta 240 tcaggttact gtaagtaaaa aaaaaaaaaa aaaacgggcc gggagggtcc actgaagttg 300 acaaatttgc tgctagttgg caattcctgg actgggagag tccactaaat gattatgaat 360 atcttgaaat tttctgaatt tgacgaatat acaggacgat cgtttttgaa atagtgaaaa 420 aaaaaaatct tactcaacat tgtaagtgtc atccagaatg ttcgaaaaaa aagagttgct 480 atggagacaa gcacaagtgc acacgagtcg atgactactt tgattttttt taagaatgga 540 tgacgttttt aaaaattaca aattaattga actctatgta agctcatgtc aacgttaagt 600 gaataatata ataaaaaaat aataataaat aataataata aacatatatt gcagcgatgg 660 aatttgaaat gagaccattg cccccgttcc gttgtgaaca aatagagaga tcgaagctaa 720 gccgagagtg gaggtcgtgg agaaacgcgc ttgaatgtta tttcgaagcg cattccattt 780 ttgaccaaag aataaaacga gcaaaacttt tatttcttgg tggaccacaa ctccagaggg 840 tgtttgagca tcttgctgac acggataaaa ttccgttggt agccgtaaag gaaacgtggt 900 acgacgttgc gattgaaaaa ttgaacgagt attttcaacc agctcgtcaa catactttag 960 aaagacatcg tcttcgtgag atgaaacaaa tgaaagatga acgtttcgct caatttgtaa 1020 tgcgcttgaa gcagcagtct gcggattgtg gctttgacaa gtactcaaca gaaatctcca 1080 agattctgac cgagattact ttgatagatg taatagtcca aggatgtacg tcgaatgagt 1140 tacgtcgtcg tatattgaaa gaagaccatt ctttggctga gatcgaagcc cttggggcga 1200 tgtttgaatg cgtagatgag caggtgaaga gtctgagtaa cgttaccgca aatacgcaag 1260 aaaagatttt taaagtgacc gaaagcaaaa aggttgagga taaacccgat acaaatggcc 1320 aactgtcgtg ttatcggtgt ggaaacacag ggcatttttc gaaatctcct aattgccctg 1380 cccggaataa aacatgcaga cgatgtaaac gtattggcca tttcgaatca gtatgccgtg 1440 gattctcgaa acgaccagca cagaaactga atgacaatat aacagcgaaa aaggttcgag 1500 ccattgagac cgttgacagc gagactgagt tgatgttacc aggtaaagat tccaagaagg 1560 atgaacagac tccatcgaaa acatactatg ctttttacac gggtaacgaa acgaacatga 1620 ttgagtgcca cattggagga attaaatgga aagtgatcat tgattctggg tcagaatgca 1680 atttgatcac taagaatgcg tgggaaaaaa tgaagacttg caacatcgcg gtacagtcat 1740 ctacgaaggg ttgcgatcgg gtattgaaag catatggtag taatgtacca ttaaaggtaa 1800 acttcaaaat taggattaga tatgattgta ctatttctat aatcaataca tttatgtttt 1860 atataggttg ttggttcatt cacggctgaa gtgcgaatta acgaacgaca tgtggttgct 1920 gaattcattg ttgtggatgg tggccagcag tgcttgctag gtgatacaac ttcaaaagag 1980 ctcggtgttc taaaggtagg tctggacatc aacaatgttg aagagttgaa gccgttcaac 2040 aagattatcg gattacaagt acaaattcat atggatacat cgatcaaacc ggtatttcaa 2100 ccactgcgca aagtacctgt tcctcttgaa gctgctgtga acaaaaagct ggagcaacta 2160 cttcaaaggg acataattga ggtaaagact ggaacaacgc agtgggtgtc acccatggtc 2220 gaggggggcg acacatccaa atcaccgatt agatgcactt ttggataaaa ttagattttt 2280 tagatcagtg tatcattcta gattttgacg tgcttcatgt gctgacatct tctgctaaac 2340 gatgcgcgta gattgatgag atgcatgaag atgtgaatgg gtggaaggag aatcaatgat 2400 gatgtagatt atggtgtgca tagtgctaca gtttttctgt aaacagtttt gatttattta 2460 tgcttttaat tttcattatt tttgctttaa ttacttatgt tacttgaatt caattgcaat 2520 acaagttaac tgggtgaaaa gatagtggaa aactgtgagc ctgtgtgtaa aaatttagta 2580 aatggtaact atttgctacc agaattgcac gtaagaggct taattctggg aaatcttaag 2640 aaaactattt ttgtttgtat ataggtaatt tttctttaca tgcgagaaaa cgcgttactt 2700 atacaaatcc gtcgactatg attacggcac tggattccgg cttccagaag gggtaggtaa 2760 atgtatactg ttgctatcta caatttcact aatcatcatt ctattattta aggagcagca 2820 tattaaacaa cactgcagaa cgatattgtc cgtaaatgtg atagcccaat cctcagcagg 2880 aaacccgaaa atatgcgatt taatctatgg aatttccagc aatgagcttt gttaatgtaa 2940 taatattatg attaatttaa aacccaattt aataaccaat aaaatatttc ggagttgtga 3000 gtaagggcat atacagagaa accagggaca atttattaat tcacaacatg ccccaacccc 3060 ttccccccca ttgaaaaaaa taaactggtc aacgaacgcc aactgtgttc cgtccatctc 3120 cacccacaaa gtactttgat caccgacatg atccagatca tcttttcctt cagccacatg 3180 ccgcttctgg tcatcttccg tcaagtttga cagcaactct tgcacctggc gccgaagacg 3240 atgcttctta tctcgaagta aatcaccgtt ttgatcatgc aggaaccgct tctgtcggaa 3300 aacctggcca acacatccta tagacgactg ttctgcttct tgggcagtcc gtgtatcacg 3360 cctccgttat tttatccgtc gtatccatcc tttccggcca catcacggaa ccgataatca 3420 gcatcctgct gaatccgggc caatctccgg accactagca agcccaggag aacatctaaa 3480 tggtgttgta gaaccggctg ggattggaat tcgcttgatg tttagacaca ttccaccgat 3540 tatgcagctc cgtgcttcct ctttgttgtt gatcaacgac aactgagtca atccgctcat 3600 gtaagacgcg caaagatgag ctagatttgt gaagatggat aaagatgttt tgcatgatac 3660 cgcagatgaa acaagataat atactgcaag aagatgaatg gtcgtcgtaa agattttttc 3720 aaaggtataa ggtaatctaa aatatctgga tgtgtcgtcc ccctcgccca tggtagtagt 3780 tggcaaagca aatggcgaac cgagactttg tttggattta cgtagagtaa atgaagccgt 3840 attacgcgaa cggtatccca tgcctgtcgt ggatgattac ttggctagaa tgggcaagaa 3900 tatgattcgt agcaagcttg acattcgcga agcattcctt caagttgagc tagagccaaa 3960 ctcacgagac atcaccacat tcatcaccag taaaggattg ttccggttta agaggatgcc 4020 ttttggtttg gtttccgctc ctgaaacatt ccaaagagtg atggacgaga tattgactgg 4080 ctgtgaagga gtatactggt atcttgatga tgtcatggtc gaaggtgcaa ccgtcgaaga 4140 gcatgatgca cgactcaatg aggtataact tgatggaatt ttattaaatg cgattttatt 4200 tgcattgcat ttgaatataa gtttgctgaa atagaaaaca ataatacatt atttttttta 4260 ataataggtt ttaaatcgtt tcaagaatcg cggtgttcaa ttaaattggg aaaaatgcgt 4320 ttttagggaa caagaacttg aattcctggg acatgtagtc tcacctgaag gaatttttcc 4380 agtaaaatct aaaacagatg ccattcagac cttccgaatt ccaaagaacg aaactgaaat 4440 taagagtttt cttggactag ctaattatat gaacaagttt atttataatt tggctaccat 4500 tgatgaaccg ttgcgtaggt tgactttgaa gaaggtgcct tttgagtgga ctgaagaaca 4560 tactaatgct tttgaaaaaa ttaagcaaga aatgagaata gctgggaaat tagggtattt 4620 caacctaaac gaccgcacgg tggtgacaac tgatgcaagc ccggttggac ttggggctat 4680 tcttgcgcaa tatgatgaca acgatcaaat acgtgtcata agctacgcgt ccaaatctct 4740 gactgacaca gaggcaagat attgccaaac tgagaaagaa gcgcttgcgc tagtatgggc 4800 tgtagagaaa tttcaaagat accttatagg gagatctttt gaattggtaa ccgactgtaa 4860 agcattgcaa tatctattta ctaccagatc ccgcccttgt gcacgcatcg agcgatgggt 4920 tttgcgccta caagcgtttg actataaagt tattcatgtt gctggtgaat gtaacgttgc 4980 agatgcactg tcgcgtttgg ccactttaaa cccacttcct ttcgatcaaa acgaagaact 5040 tatgattcga gaagtagcag tatctgcagc aaatgcaatg gctttagatt ggcagcaaat 5100 agaagatgcc cctaaagaag atgaagaaat tcgtaatata atagcgcttc ttgattccaa 5160 ccggcaacga gaactatctg ttccatatcg ggttatagcg aacgaattgt gccacattgg 5220 agacgttctt atgcgagtag atagattagt tgttccctcg aaacttcgaa ctgtggtact 5280 gaatctagcc catgatggtc atccgggcag ccgcatgatg aaaagccatt tgcgtagttc 5340 agtatggtgg cctaagttag atcagcaagt tgaagagttt gtacgaaatt gtcgcggatg 5400 ttgtttagta tctgcaccgg acgcacctga gccaatgata agaaaacccc taccatctag 5460 accctgggag gatattgcgg tagatttcat gggtcctcta ccggaaggtc aatatttgtt 5520 ggtagtagtg gattattact cacgattcat agaagtgaaa gagatgaata ctatttctgc 5580 aaatgatacc atccaagagc taagtgtagt tttcagcaga tatggatttc cgaatacact 5640 tagagctgac aacggacctc aacttagtga acactgtgaa gagctgaaac tattttgtcg 5700 tgaaaatgga ataaagctag taaacacaat tccgtattgg ccgcagcaaa acggtgaggt 5760 agagagacag aaccgttcta tattaaaaag gttgaaaata tctcaggaac taggcaaaga 5820 ttggaagaaa gagctaagtc aatatctact ggtgtatcat tcaacaaatc attcgacaac 5880 aggaaagtct ccggccgaac tgatgtttgg tagacgcatc cgaaacaaac tgcctcatgt 5940 tccgatgtac cgtttggatg atgaagaagt tcgagaaaat gatttggttc aaaaagaaaa 6000 aggaaaggag tatgctgatg gcaagagaaa ggcaaagcaa agcgaaattg tggtaggaga 6060 tattgtgcta atgaaaagaa tgaaaagaac gaataagttg gatacagatt tctgtaagga 6120 ggaatatatt gtaaaacgta gagaaggtag tgattgtacg gtagaatcta tggaatcagg 6180 aaaacaatat agaaggaacg ttagtcatct caaacgagtg ggaaagaaaa cggaaagcaa 6240 taatgttaca gaagcttctg aggaaaatgt agacttgcaa gtacaggatg ggaataataa 6300 aacagacgaa atacagttag aattaaacga agatgaaaat caaaggacta ggaagaggat 6360 tcgcatagaa ccgcatcatt tcaaagatta tattgcacac taatccgata aaaaagcaaa 6420 agtgggat 6428 // ID hATN-4_SM repbase; DNA; INV; 402 BP. XX AC . XX DT 16-AUG-2009 (Rel. 14.08, Created) DT 16-AUG-2009 (Rel. 14.08, Last updated, Version 2) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW hATN-4_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-402 RA Jurka J.; RT "haT-type repetitive family from freshwater planarian Schmidtea RT mediterranea."; RL Repbase Reports 9(8), 1852-1852 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX SQ Sequence 402 BP; 129 A; 57 C; 67 G; 149 T; 0 other; cagtggttct taaccagggg tgcgatctag tatacacggg ggtgcgagtt tacaatttta 60 caaattcact gtaattagta ggtacttatg cacttaagta cctcccgggt tcgaatccaa 120 aaaaatacaa aatactaacc ctaactggat tttttttggt ccagaactga tctaacccta 180 tacctaaccc tgttttattt tatttaattg attattttag aatctaaatt ttagataata 240 gttaagattt ttaaaaaatt gttatgcaca taatattaat aaaaaatata ttgtgcattg 300 ttttttttca atttatataa ttccttattt ttggtaaggg gtgcgagaat tttgtttgaa 360 cactttaggg gtgctagtag ctaaaaaggt taagaaccac tg 402 // ID BEL-12_AA-LTR repbase; DNA; INV; 625 BP. XX AC supercont1.141; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-12_AA_; KW BEL-12_AA-I; BEL-12_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-625 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.141; Positions 1028930 1029554. XX SQ Sequence 625 BP; 214 A; 103 C; 131 G; 177 T; 0 other; tgttgcggtg catcgcgctg aactcagttg gcaacacggg tatcactata ttatcgatag 60 ggccaggagg aactaatagc tacctgaaga acaatagggt tgttttgacg cgatgattca 120 ttgtacggta tttgctacat aaagttaagc tacaaaagaa gatttgattg gattaaagct 180 aaattaacga agtaaagaat tcagataaat aaaatttgtt gaactgattg tgcatctgaa 240 tcgtaagtag gcattattat cattaagaca taagttctct tattccgcta ttattcagta 300 ggagacagct ggataaactt ggtgttcgcc ttgaagcgat actattgctc ctgagtaggt 360 tagccatagc gtaagtaaac cgataaatcg tgtaagaatg catgattgaa atctattgaa 420 actttcatat acagaaatta taagtcggca ctctaagaag accattcgga ctgactaaga 480 aaagcttgga gattagggag accaatgtaa gatatttgga atgaacaatt tataaaatgc 540 gctaattaat atatcagctt ttagcatatc tgcactacaa accgacggtg tacgtggctc 600 aaaagacccc gaaagctcct caaca 625 // ID Copia-136_AA-LTR repbase; DNA; INV; 124 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-136_AA_; KW Ty1_copia_Ele126; Copia-136_AA-I; Copia-136_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-124 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 124 BP; 38 A; 28 C; 19 G; 39 T; 0 other; tgttagagat tacgaatcgt agcaaccatt gaattatatt gaatgtaacg attactcgaa 60 taaattagtc tttagtcaac cgtcaatccg ttcagtcgaa cttctcttgc cctccgaata 120 ccca 124 // ID CR1-116_AAe repbase; DNA; INV; 4792 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-116_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4792 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1204-1204 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >94% CC identity. XX FH Key Location/Qualifiers FT CDS 1617..4757 FT /product="CR1-116_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="PSSYLPFSPDHVVHRTPLENNSPVLINVNDLQPLRDS FT PPLHIRDDLLPPRQTSEYPRSTLTEEQCTSTDSIQPLRSSCCAHTADRNES FT HQTNYSIYYQNVRGLRTKTRDLYLALVECDYDIVVFTETWLHAGIASAEFS FT ANYSVYRCDRSSHTSQHSRGGGVLIAVKPHVRCKEVFFKHSGIEQVCIEVK FT LNDCSLFISSVYMSPSLSTDSYERYIKCMNELLERITDKDTLLLMGDYNLP FT NVEWQFDDDLCCFLPLNQFDENSTGVLTKESCIISGMFDGGLHQINGIRNV FT NDRLLDLAFVNTMDCWDLSEAVVPIMPLDAHHPALLLNIMCSTQTLNLLDL FT EGAFDFRSCNFAEVNASLSEIDWNTLARSLSVDDAVDAFYRTIYFTIGNHV FT QRRRPLVTKKVPWWNSELSLLKNRLRKARKKYSRSPSVESKTWLRSCEAEY FT QVLHRNVYRTYITDIQRKLKTDPAHFWSYIRSRRGTNGLPISLSLGDEKSE FT SSQEAVNLCADYFKSTYVHHEPLTTDEHLRFVEQFDINIPHLEVSVQDIEE FT ALARINEQKGPGPDGLSPTFLKNCRHTLALPVSILFNMSLSTSRFPSAWKI FT ASMVPVHKSGSTHKVENYRGISILNCLAKLLERFVYDLIKSTTDHIVSECQ FT HGFVGKRSTVTNMMVFVPFVSQSLTERYQVDAIYIDFAKAFDRLSHDIIVS FT KMHHLGFPLWLCTWLRSYLVDRKAFVKCGTVSSDTFAIKSGVPQGSHLGPL FT LFTLSINDICVIIRSEKILFADDLKIYRKIKDSVDCVALQNDLQSLNEWCK FT VNRMQVNVKKTKVISFYRSEGTTIHQYTLESTVIDRVTSIRDLGIIVDKSL FT TFKEHLSVTISKAITTLGFIIRNTKDFDDVYVLKTLYCSLVRSILEYAIQI FT WAPRQIGELRLLESVQKRFLRYALRSLPWNDPIRLPPYLSRCQLIRLESLG FT GRSEMLRRLFIFDLLQGNINCSSLLAAVPIHAPTRSLREFEFLRIPNYGHR FT TFFNPFVECCRLFNDVYYVFDFNVSKTMFKDRIRNNAQSER" FT CDS join(349..651,627..1613) FT /product="CR1-116_AAe_1p" FT /translation="MVVCELLACPDPEKVARVIQCDGRCRKTFHIACVGVT FT GPHLKAHQENDGVLWLCSDCRRNRGDALLEKXDANRIGLADLAKRMETYFE FT VVVTKLDTHLTLRARYSLNVALIRELASTTATSNALNVELITRTSSNATFN FT FEDSLSAVPDKICPSASENPYDEAIQRSSETAVQVSTSQGGNENDMQRNSS FT PIPVELQSGNPKSSASRIQSTNAKIVDSRSCSHGSATAHQANIPTSNANSV FT SPFSSNPFVINAPPSAYPACYYAPWTPGASGLFCHPYPMATALPRTQQSYF FT GFAHPADSANKRSNVKRKKNNIRGSNVEATFEPKLTAAQNLPSGSHSDQTI FT TYCVRNLSPDTNAAQVQRYIASKGRIPETAISCSSVAKKGKHISFQTFRVI FT IPQCYAGVVSSDDFWPDGVTISVFGRPKAVSLPIEI" XX SQ Sequence 4792 BP; 1413 A; 954 C; 986 G; 1438 T; 1 other; acgtttcgct tggctgttgt tgctgttgct tgtcgctgtt gctgttcacc tcgttctcat 60 tggtttaaca atcccaaaaa ttcctaaaac ataacagtct gttggcggaa tacctttttt 120 gtccggatgt catgaaatgt tcagaataag tggcgaattt tgagataaat tcgtgttttt 180 gtggcaaata aaacagcatt tcacagcatt gtttggctgt gatcaagttt tttattcaat 240 tgcatatatc cgttaaacga ttgttgagct ctcacacgat catcttcaat cacagttcgt 300 tttctgcact caccgttcgg catatactgg tggtatactc tcgttgccat ggtggtatgc 360 gagcttctag cttgtcctga tcctgaaaag gttgcaagag tgatacaatg tgatggacgt 420 tgtaggaaaa catttcacat cgcatgtgtt ggtgttactg ggcctcacct gaaagcacac 480 caggaaaatg atggagtttt atggctgtgc tccgattgcc gccgtaatag aggtgatgca 540 ctgctggaaa aaawtgatgc gaacaggatt ggattagcag acttggcaaa acgcatggag 600 acatatttcg aagttgtggt aactaagctc gatactcact taacgttgcg ttgattagag 660 agttggcatc caccacggca acttcaaatg ctttgaatgt tgagcttatc acccgtacgt 720 cttcgaatgc aactttcaat ttcgaggatt cactcagcgc tgtccccgat aaaatttgtc 780 catctgcaag tgaaaacccc tatgatgaag ctattcaacg atcaagtgaa actgcagtac 840 aagtttcaac cagtcaagga ggcaacgaga atgatatgca gcgaaattcc tcacctattc 900 cagtagaatt gcaatctggc aatccaaaat cctctgcttc aaggatacaa agcacgaatg 960 caaaaatcgt cgattcacgg agttgtagtc acggcagtgc aacggcacac caagctaaca 1020 ttcctacaag taatgcaaac tctgttagtc catttagttc caatccgttt gttatcaatg 1080 ctcctccaag tgcttaccca gcttgttact acgccccatg gacacctggt gcttctggtc 1140 tgttttgcca tccttatcca atggctacgg cgcttcctag gactcagcag tcatattttg 1200 gctttgccca tcctgcagat agtgccaata agagaagcaa tgtcaagcgt aagaaaaaca 1260 acattcgtgg aagtaacgtg gaagcaactt ttgaaccgaa gttaacagct gctcaaaatt 1320 tgccatcggg ttcacattcg gatcaaacta ttacttactg tgtacgtaat ttatcccctg 1380 acaccaacgc tgcccaagtt cagaggtata tagcatcaaa aggcaggatt cctgagacgg 1440 cgatttcttg ctcaagtgtg gctaaaaaag gaaaacatat aagttttcaa acatttagag 1500 tgataattcc gcaatgctac gctggtgttg tatcatcgga tgacttttgg cccgacggtg 1560 ttactatatc cgtttttggg agaccaaaag cagtttcctt gccaatcgag atatagccat 1620 catcttactt accgttttca cctgatcatg tcgtacacag aacaccattg gaaaacaact 1680 ctccagtact catcaacgtt aacgatctgc aacctcttcg cgactctcct cctctacata 1740 taagagatga tttattaccg cctcgacaaa ccagtgagta tcctcgttct acacttactg 1800 aagaacagtg tacatccacc gatagcatac aaccactacg cagttcatgc tgtgctcata 1860 ctgctgatcg taatgaatcg caccaaacaa attattcgat atattatcaa aatgttcgtg 1920 gcctccgcac caagacaaga gatttatatt tggctttggt agaatgtgat tatgacattg 1980 ttgttttcac tgaaacctgg ttgcatgctg gtattgctag cgcagaattt tctgccaatt 2040 attcagtcta cagatgtgat cgtagctccc atacaagcca acactcgaga ggtggcggtg 2100 tattaatagc cgttaaacca catgtacgct gtaaagaagt gttttttaaa cattctggaa 2160 ttgagcaagt atgcatagag gtaaagctca acgattgctc gctgtttata tcttctgtat 2220 acatgagccc aagcttgagc acagacagct atgaacgata tatcaaatgc atgaatgaat 2280 tactcgagcg gataactgat aaggacacat tgttgttgat gggtgattac aatctgccaa 2340 atgttgaatg gcaatttgac gacgatctgt gttgtttcct tccattgaat caattcgatg 2400 aaaatagcac gggagtctta acgaaggaat catgtattat ttctggaatg tttgatggcg 2460 gtctgcatca aattaatgga ataagaaatg tgaacgatag attactggac cttgcattcg 2520 tgaacaccat ggactgttgg gatctatctg aagcagttgt tccaattatg ccacttgatg 2580 cgcaccatcc agctctactt ctaaatatca tgtgtagcac gcagacactc aatctgttgg 2640 atttagaagg cgcttttgat tttcgttctt gcaacttcgc cgaagtaaat gcatctttaa 2700 gtgagataga ttggaacact ctcgcgcgct cattgtctgt tgatgatgct gtagatgcat 2760 tttacaggac tatatacttc acaatcggaa atcatgttca gcgcagacga cctctagtga 2820 ctaaaaaagt tccgtggtgg aatagcgagc tttcattgtt gaaaaatcgc ctgagaaagg 2880 cccgtaagaa atacagccgt tctccatctg tagaatcaaa aacgtggtta cggtcatgtg 2940 aagctgaata tcaagttctg caccggaatg tgtatagaac ctatattaca gacattcaac 3000 gaaaattgaa aactgatcca gcacactttt ggagttatat tcgtagcaga agaggaacca 3060 acggcctacc aataagttta tcgcttggag atgaaaaatc tgaatctagt caggaggcag 3120 taaacttatg tgcggattat ttcaagtcaa cttatgtaca tcatgaacct cttaccacag 3180 atgagcacct caggtttgtg gagcaattcg acattaatat accgcatctg gaggtttctg 3240 tccaagacat cgaggaagcg ttagcaagga tcaatgaaca aaagggtccg ggtcctgatg 3300 gtttatctcc aacatttctg aagaattgca gacatacgtt agcgctacct gttagtatat 3360 tgtttaatat gtcactatct acgtccagat ttccgtcagc gtggaagatt gcttcaatgg 3420 ttccagtaca taaaagtgga agtacacata aggtagagaa ctacagagga atatctatat 3480 tgaactgtct cgcaaagttg ctcgaacgtt tcgtttatga tttgattaaa tctacaactg 3540 accatatagt atctgagtgt cagcacggtt ttgtaggaaa aaggtcaact gtcaccaata 3600 tgatggtttt tgtacctttt gtaagccaat cacttacgga acgttaccaa gtcgacgcta 3660 tatatattga ctttgcaaag gcgtttgata ggctatcgca cgatataatc gttagcaaaa 3720 tgcatcatct tgggttccct ttatggttat gcacatggct tcgttcatac cttgttgatc 3780 gtaaagcgtt tgtcaaatgt ggcactgttt catcagacac ttttgctatt aagtcaggag 3840 tacctcaagg cagccattta gggccattgt tatttacatt gtccattaat gatatctgtg 3900 taataattag atcagagaaa attttgtttg ctgacgatct caaaatttat agaaaaatca 3960 aggatagtgt agattgtgtt gctctccaaa atgatttaca gtcattaaat gaatggtgca 4020 aagtgaacag aatgcaggtc aatgtgaaga aaactaaagt catctccttt taccgtagtg 4080 aaggcaccac cattcatcaa tacactctcg aaagtactgt aattgataga gtaacttcaa 4140 ttcgcgacct gggaataatt gttgataaaa gtcttacatt caaagaacat ctctctgtaa 4200 caatttcaaa ggccattaca acgcttggat ttattattag gaacactaaa gactttgatg 4260 atgtatatgt acttaaaaca ctatattgtt cgcttgtaag aagtatattg gagtacgcta 4320 ttcaaatttg ggcaccacgt caaattggtg aacttagatt gcttgaaagc gttcagaaaa 4380 ggttcctaag atatgcgttg cgctcccttc catggaatga cccgatacga ctcccgccat 4440 atcttagtag atgtcagctg attagactag aatctttagg tggtagatct gaaatgctgc 4500 gtagattatt tatttttgat ttactgcagg gcaatataaa ctgtagctca cttctagcag 4560 ctgtgccaat tcatgctcca accagatcat tgagagaatt tgaattctta agaatcccga 4620 actacggtca taggactttt tttaatccgt ttgtagaatg ttgtaggtta ttcaatgatg 4680 tttattatgt gtttgatttt aatgttagta aaactatgtt taaggataga ataaggaata 4740 atgcgcagtc tgagcgatag aaatatctta gacggtggtc aataaacaat aa 4792 // ID Mariner-35_HM repbase; DNA; INV; 3616 BP. XX AC . XX DT 08-JAN-2009 (Rel. 14.02, Created) DT 08-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE Mariner-type family: consensus sequence. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-35_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3616 RA Bao W. and Jurka J.; RT "Families of Mariner elements from Hydra magnipapillata."; RL Repbase Reports 9(2), 393-393 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 1747..2859 FT /product="Mariner-35_HM_1p" FT /translation="MSKWLLTQDGPNSATYGVTQSGWMEGPAFAGWFEKTF FT VPHCRKLEGSKVLFLDGHASHVTLELIHLAKEENIIIFKLPAHTSHVLQPL FT DVGVFRTVKSKWRGILNKHFIQNGFKDLTKPRYPGLLKQLIDQSFLSDNVR FT SGFKATGIFPLCREKMTNEKLAIGSIFAQSTQALPTLPIPVTTIPEIPSSI FT QKHRSSSTQTSSSSLKQIPPLTCCTTACEHLKTSILNALHVGLPKASVSES FT KLNVSNNNMRLMKSYIMTSEEALIELTNKQTMAQNKKNEIQARKVLREKTK FT QEANSIKTQKRLAKEAGLFNKPTIKKRKRQLVGVIETPIMSESEATSIYQH FT CKSDGSLNSKKCQACDIFFGDETAAQQ*" XX SQ Sequence 3616 BP; 1292 A; 555 C; 587 G; 1181 T; 1 other; ccgtaaaacc gcctaagttc ggtcaccatg taacttcggt catttaagaa aaaatataaa 60 aaaagcatgc aaaactcaaa ttttacaaac ttttttttct caaataatat ttgtaatcag 120 ttcgtaaata taaaaaaaaa aaaaataata tttatatgca acctgctgaa ataattcttt 180 agcaaaacaa caatttgaaa aaatgcatac tattaaaatc taaaataagc aaactattaa 240 aataaatgat catatttaat cttagcatta ttaagaatca ccaaattaat tatatttaca 300 aaaaaaaaat aattaatttt taaaaattaa ctactttttt taataaaaat agtgaaattt 360 tgaaatactc taacttcggt cattcatgga aaaacaacga aggaaacaat tagcacgaat 420 gttgtaaaat gttagtatgt taaaaatgtg acttagtgaa tatgaacttg tgttgcaatt 480 gcgtaaattt acaggtaact ttacctgtaa atttaccagt aagttttgac atttttataa 540 tggatactca ccatggtatt ttatttgaat aaagaatttg tttataattt taaaattacc 600 ctgttttaaa acaaaattta ctggtcaatt tgcaggtaaa ttttgatttc aaaaaatact 660 ggtaaattta aatcactgaa tatgaagtaa ttacaaaaca ataatatata ttgcaacgca 720 ttgaaatagt tttttgtcaa gttaatacat ctatgtcatc tatggattta cgtgagaaag 780 ctcggatggc tcgtggtttg aaggctgtgg ataaaggtat gagtaaaagg aaagcagcta 840 taacatttgg ctttaaaaga tcaactttca tcgacagaag taacggcaaa catactaaaa 900 aaaatggcaa accaactaag ttgagcgaaa acatcgaggg cattttagta aaaattttaa 960 tatttatgag tgatattggt tttagtttga ctagacttga aatactgtgt gttgttaaca 1020 actatttggt taattcaaat caaatgtgtt ttaacggcgg cgaagttaca cttgcctggt 1080 attactcatt tattagacgt cattgtgaag agctatcgct cagaacttca aataatatgc 1140 cgtcaaaccg tgcaatgtcg actcagccag gcactattga tcgatggtat ttattaaaat 1200 tatatgttta tatgtaaata tatttcataa ttaattgtta tttaataaaa aaaaaaagtg 1260 caatttttaa aaggtaaatt tcaatttgat tgtaatgttt acaaattgtc ctttttttta 1320 taaactgctc aatttatagc attttaataa cctactttat aattttttat ttatttctta 1380 aatcaaggtt tgcacaagtg tctgatatgt acaatataca tggattcgca actaagccgt 1440 tccacatttt taattgcgat gaatctggtc ttcaatgtga tcaaggcaaa tttaaggttg 1500 tgtgtcggaa aggttcaaaa gggcctaaga aactttgcag tagcaacgaa aaggcttcag 1560 tcacaatact cgtttgttgt gatgcttttg gaacattctt gcctgttcat gttttattcc 1620 aaggccaaag gtatttatga tgaatttggg tatgtatttt aagaaggatt taaaacagat 1680 gtgccaataa tccggaaaac tctgcatttt aaaaactact atttaattat ttgaaatttt 1740 agattgatgt ccaaatggct tctaactcag gacgggccta acagtgctac ctatggtgtc 1800 acacaatctg gctggatgga aggacctgct tttgctggat ggtttgaaaa gacatttgtc 1860 ccacattgtc gcaagcttga aggatccaaa gtcctttttc tagatggcca tgcctcacat 1920 gtaacgctgg aactaattca cttagcaaaa gaggagaaca ttattatttt taagctaccg 1980 gctcatacat ctcacgtgct acagccactt gatgtaggcg tatttaggac ggtgaaaagt 2040 aaatggcgcg gaatcctaaa caaacatttc attcaaaacg gttttaaaga tctcacaaaa 2100 cctagatatc ctgggttgct caaacagctt attgaccaaa gcttcttgag cgacaatgtg 2160 cgttcgggct tcaaagcaac aggtattttt cctttgtgtc gggaaaaaat gactaatgaa 2220 aagttagcta ttggatcgat ttttgcgcaa agtactcaag ccttaccaac actgcctata 2280 ccagtcacca caattccaga aattccttca agtatacaaa agcatagaag ttcaagcaca 2340 caaacttctt cgagctcatt aaaacaaata cctcctttaa catgctgcac aactgcttgt 2400 gagcatttaa agacatcaat tttaaatgct ttacatgttg gcctgcccaa agcttccgta 2460 tcagagtcaa agctaaatgt ttctaataat aacatgcgcc tcatgaaaag ttacattatg 2520 acttctgaag aagctttgat agaacttaca aataagcaaa caatggcaca aaataagaag 2580 aacgaaatac aagctagaaa agttttgcgt gaaaaaacaa aacaggaggc taattcaatt 2640 aagactcaaa aacgtttggc aaaggaagct ggtttgttta acaaacctac tattaaaaag 2700 cggaaacgac aactagtggg agttatcgaa actccratta tgtcagaatc cgaagcgact 2760 tccatatatc aacactgtaa atctgatggt tctttgaatt caaaaaaatg tcaagcttgt 2820 gacatttttt ttggagatga aactgcagct caacaataat taaggatgcg ttgcaaatcc 2880 tgtgacaatc ggatcttgga tggatcttgc cgttctaatt tggtgaacga tatgtgcaag 2940 aattgccaat aaactagtat ttttttataa aaaaaaaatt tctttttaac tttattgaaa 3000 aatatctttt taattttatt aaccagcggc gtagcgaacg tagcacgtag atgcccccaa 3060 gaagtcctta tggacttttc tgtgataaga ccataaggac ttcttagggc acttgaacaa 3120 caaaaaaaag gggataaaac ccaaagtaat aaaatagttg actaattata ataaaaataa 3180 aaaataatac tatttatact taaaaaggcc tttttttatc tgctccatgt tgttttcttt 3240 ggcaacccaa aagacttttt tttagagtcc gggacgacgc cctccttggt tactggtatt 3300 acctgatatt atatcatcat taaacaataa aacaacattt aaaaattatt ttttattatt 3360 cgtgacgtga tgcgttatta ggatttatat agcgaccgaa cttatgagat ggtattgaaa 3420 gttcggtcaa agtagatgtt ctcatttgaa attaaatatt gagtactagt ttttattttt 3480 cattttttta tttttacaat atcgattatc aatataattt tctgtatgtt tcagaaaaaa 3540 aaattgaaaa tttttaattt atgcgtgata aataaatgtt caaagcttaa gtgaccgaac 3600 ttaggcgatt ttacgg 3616 // ID Gypsy-70_CQ-I repbase; DNA; INV; 4457 BP. XX AC AAWU01039681; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-70_CQ_; KW Gypsy-70_CQ-LTR; Gypsy-70_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4457 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 519-519 (2011). XX DR Genome; AAWU01039681; Positions 11609 7153. XX CC Positions [3439-3795] - Integrase core CC 'CTGGA' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 226..3795 FT /product="Gypsy-70_CQ-I_1p" FT /translation="MPDDEKKPGVSSASPSGADKRKGKFPVFVPGHNVNSY FT MQTVDMYFTLNKTPEDEKKLEFITSVGQDTANHISGSFKPTKIVDKTYDQI FT IEKFKQLYEEKKNVFAERYRLITRKQALGESLDDFAIDLQDIVEHCGVKTD FT TEAILVQSIFVAGLKNDSTREAMLRDGNEKLDLAQLLEKAKGLEVATRESR FT KMSQNEFLEVNYVDRGGMHVGHSRNVVKRGKFGEAAASTSKNSDLYSHRGT FT FKPSSATVCYNCYNKGHLSYDCPFPKAKKPKSSVPPQRNAGYRKRAFEERI FT NQLTAEMEQLKSSLQDDLELSEKLSETESEEDSTNFVNNILLGESSTAPAF FT VKLKINGKIHTMECDTGACATICSFQTYYENFSKQTLSPLNKTFNVISGDK FT VSVLGTLPVKVQLKDKIVELTLVVIDSQKEFVPLIGRDWLNIIWPNWRKSF FT ALNSVKSESREEWTKHVVNNLKREFYKAFDEDLTQPIKDVVVDIKIEDNAK FT PFIHKPYTVAFKHKDKVAKYLDDLETKGIITKVEYAEWASPIVVVVKPNKT FT DIRICMDGSKTVNPHITTHHYPLPLIEELITNKSGARVFALIDLRGAYQQL FT VVSEQTKKLLVINTHKGLYAFNRLPFGVKPAATLFQCAMDNILNGIPNVQA FT YIDDILVWANSDNELLGRLKVVLKKLAEHNVKINTDKCKWFVPQVKYLGHI FT LSEAGVSPNPEKVEAITAVPVPETKTQLKAFLGMITFYNKFVSKLNLLLTP FT LYNLLTKDAIWNWTTQCQTAFENSKRAICSAQILTHFDPTKLITVTCDASD FT EGISGVLSHNIMGKEKPVFFISRRLSKAERKYPILHREALAIVFSMEKFYK FT YVLGQKVLIVTDHKPLVGVFDNKKNAPIVIANRLERYFSRLSIFEFTIIYK FT PGKDNHVADCLSRLPIQQQICQADLDESKRSSPNCLNFLVENHEFNLNAKL FT IAEQTKHDPILSQIIQFIENGWPSRISCKSVKNFYSKKHELNIEANCLIFR FT NRVVLPTSLKMYALQLLHANHRGIQKMKQLSRQFVYWEGIGVDIEVFANSC FT KICQLKGIDRKPQIYGNWPLASTTFERVHIDFFQKYNRTFLILVDAHSRWV FT EIIRMNKTTAEEVAQELDTIFATFGFPGSMVSDNGPPFSSQKFSSFCKARN FT IEHIFSPPYHPASNGLAERAVQTTKAFHRYR" XX SQ Sequence 4457 BP; 1592 A; 828 C; 890 G; 1147 T; 0 other; ttctggcgac gaggattaaa atcaaccgag tgtgataagc tcaactgaat ttagataagt 60 tgcgtgcttt gaaagtgggg gataagagta acctgtggtt ccacgtggtg gttgattaac 120 agtgcacgtg aacgccagtg aaaagcgtta gctacgtgac cagcgagaaa ggggcgtgaa 180 agtttcccgg aaaattaaaa aaaaaagtgt aaaattgatt taaatatgcc agatgacgag 240 aaaaagcccg gagtgtccag cgcgtcacct tcgggagcag acaaaagaaa aggaaaattc 300 cctgtgtttg tcccggggca taatgtgaac tcatacatgc agaccgtgga catgtacttc 360 actttgaata aaactccgga agatgaaaaa aaattagaat ttattacaag tgtgggtcaa 420 gatacggcaa atcacataag tggtagtttc aagccgacaa aaattgtgga taaaacgtat 480 gaccaaataa ttgaaaagtt caaacaatta tatgaggaaa agaaaaacgt gtttgccgag 540 cgatatcgat taattacgcg aaagcaagca ttgggcgaat cgttggacga tttcgctatt 600 gatctacaag acattgtgga gcattgtggt gttaaaacag acaccgaagc aatcctagtg 660 caatctatct ttgtggcggg gttgaagaac gatagcactc gggaggcaat gttacgcgac 720 ggtaacgaaa agcttgattt agcacaattg ctagaaaagg ctaaagggct agaagttgcg 780 acacgtgagt cgaggaaaat gtcgcaaaac gaatttctgg aagtgaacta tgtggatcgc 840 gggggcatgc acgtagggca ttcacggaac gtggtgaaaa gaggaaaatt tggggaagcg 900 gctgcttcca ccagcaagaa tagtgacctg tacagccacc gtgggacgtt caaacctagc 960 agtgcaactg tatgctacaa ttgctacaat aagggccatt tgtcgtatga ttgcccgttc 1020 ccgaaggcca agaaacctaa gtccagtgta ccaccacaac gaaatgcggg gtacaggaag 1080 agagcattcg aggagcgaat taatcaactc acggcggaga tggagcaact caaatcatct 1140 cttcaagacg acctggagct ttcagagaag ctttcggaga cggagagcga ggaggattct 1200 acaaactttg ttaacaatat acttttgggt gagtcaagta cagcaccagc ttttgtgaaa 1260 ttaaaaataa atggaaaaat tcataccatg gagtgcgata ctggtgcttg tgctacaata 1320 tgttcatttc aaacttatta tgaaaatttt tccaaacaaa cattatcacc actaaataaa 1380 acattcaacg ttatatcagg tgacaaagta tcggtattag gtaccttgcc tgtaaaagtc 1440 caattaaagg acaaaattgt tgaattaacc ctagtagtga tcgattccca aaaagaattt 1500 gtgcctttaa ttggccgaga ctggctgaac attatttggc caaactggag aaaaagcttc 1560 gctctcaact cagttaaatc tgaatccaga gaggaatgga ccaaacatgt ggtgaacaat 1620 ttaaaaagag agttttataa agcttttgat gaggatctca cgcagcccat caaagatgtt 1680 gtagttgata ttaaaattga agataacgct aaaccattta tccacaaacc ctatacagtg 1740 gcattcaaac acaaagataa agtagctaaa tacttagatg atcttgaaac aaaaggaatt 1800 atcacaaaag tagaatatgc cgagtgggct tctcccatag tagtggtggt gaaaccaaac 1860 aaaactgaca tcagaatttg tatggacggt tctaaaactg ttaaccctca tataacaaca 1920 catcattatc ccttgccgct catagaagaa ctgattacta ataaaagtgg agcaagagtt 1980 tttgcattga ttgatttacg aggcgcatac caacaacttg tggtttcaga acaaacaaaa 2040 aaattattag taattaacac ccacaaaggt ctttatgcgt tcaaccgact acctttcggg 2100 gttaaacctg cagctactct tttccaatgt gcaatggata atattctcaa tggaattcca 2160 aatgttcagg cttacataga tgacatcctt gtatgggcaa attcagataa tgagttactt 2220 ggtaggctta aagttgtgtt gaaaaaatta gctgaacata atgtaaaaat caacaccgat 2280 aaatgcaaat ggtttgtgcc ccaggtgaaa tatttgggcc acattctctc ggaggcagga 2340 gtttcgccaa atcccgagaa agtggaagcg ataacggccg tgccagtgcc agaaaccaaa 2400 acacaactaa aggcatttct cggcatgata acattttaca ataaatttgt gtcaaaactt 2460 aacttattac taacaccact atataacttg cttacaaaag atgcgatatg gaattggaca 2520 acacagtgcc aaaccgcatt tgaaaacagc aagagagcta tctgtagtgc acaaatactt 2580 acacacttcg atcctaccaa gcttattact gtaacgtgtg atgcaagtga tgagggcatc 2640 tctggggttt taagccacaa cataatgggg aaagaaaagc cagttttctt tatttcgaga 2700 agattatcca aagcagaaag aaagtatccc attttgcaca gggaggctct cgctattgta 2760 ttttccatgg aaaaatttta caagtatgtt cttggtcaaa aagttttgat agtcactgat 2820 cataaacctt tagtgggtgt tttcgataat aaaaagaatg caccgatagt aattgcaaac 2880 cggttagagc gctatttctc acgattgtcg atttttgagt tcaccatcat ctacaaacca 2940 ggcaaagata atcatgtagc agattgtctc tctaggttac caatacaaca acaaatatgt 3000 caagctgatt tagatgaaag taaacgaagc tctccaaatt gtttgaattt tttagtggaa 3060 aaccacgagt ttaacctgaa tgccaaatta attgcagagc agacgaagca tgatccaatt 3120 ctctcccaaa ttatccaatt tatcgaaaat ggatggccat ctagaatatc ttgtaaatct 3180 gttaaaaatt tctattcaaa aaaacatgaa ttaaatatcg aagcaaattg tttgatattc 3240 aggaacaggg ttgtattacc aacatcatta aaaatgtatg cacttcaact actgcatgca 3300 aatcacagag gaattcaaaa aatgaaacaa ttatcaagac aatttgttta ttgggaggga 3360 attggcgtag atattgaggt ttttgctaac tcttgtaaaa tttgccaatt aaaaggaatt 3420 gatagaaaac cacaaatata cggcaattgg ccattagctt caacaacatt tgagcgagtg 3480 catattgact ttttccagaa atacaatcgc actttcttaa tattagtaga cgcacactca 3540 cgatgggttg aaataataag aatgaacaaa acaacagcag aagaagtggc ccaggaactt 3600 gacacaattt ttgccacctt tggttttcct ggctccatgg tgagtgacaa tgggcctcca 3660 tttagtagcc aaaaattctc atcattttgt aaggcaagaa atattgaaca tattttttca 3720 ccaccgtacc acccagccag caacggactt gctgaaagag cggttcaaac cacaaaagcc 3780 tttcatcgtt acagatagac aataatattc gcacatttct acatcatcat caccaaactc 3840 ccacaactgg agaccgaatt attccaaacg agagagtttt caactttatt ccacgttctg 3900 aacttataaa tttaagagaa aagaaaacag ttttctgcac gaattacgaa aatgaaaaca 3960 ctagaaaatt taagaaaaac gataaggtaa tttacactta taagatgaat ggtaagaagt 4020 tcactcatga agctgtaatc ataaagccaa tgtcagaatt aatttttctc attcaaattc 4080 agggaaacat tcgcaaagct catgtaaaca aattaaaaag aatcccacaa gttccattta 4140 cattaaaaat caggccagat aattctactc ccatacaccc aaacaacaca tccccaacat 4200 tgtcaacaaa cagttctgat gaagcagaca acacaaaaga ttcgtcagct gatgtagagg 4260 caaacaataa agaatcatca actgaagagt cactttcatc agctgacgaa gaaacagaac 4320 caaaacaaaa agacaaacca attcggcgat ccaaaagaag caaaagttct aggcacagct 4380 ctctcaattt gaaagttctt ggtaaaaaga aatgatcaag gttgtcgaat tagggttttg 4440 tcctaaaagg ggggaga 4457 // ID Chapaev-17_HM repbase; DNA; INV; 3265 BP. XX AC . XX DT 11-MAR-2009 (Rel. 14.03, Created) DT 11-MAR-2009 (Rel. 14.03, Last updated, Version 1) XX DE Chapaev DNA transposon -a consensus sequence. XX KW Chapaev; DNA transposon; Transposable Element; Chapaev-17_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3265 RA Jurka J.; RT "Chapaev transposons from the hydra genome."; RL Repbase Reports 9(3), 651-651 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(750..998,992..1510,1847..2896) FT /product="Chapaev-17_HM_1p" FT /translation="MVYKSLYFRYNKKCVFSVFLNCFIFPDRMSLFKPTCT FT HEENRRKVRLYCAKKTNKPNDIDVPWIKNLVNFKYNIKNSFYPKAISYLRI FT LQKQYKKRKRALHVKWNKMKIPVQTRSVFVCKCXICIIGSTQGRPKKFKXG FT RICNAQPPFNSLTCQTCFSIISKGITHKSSLAVLKCNIIQKLANNVKTIEQ FT IANSLIKDKIDKKGKAILATGGKKVSFTIKRNHDTSNIQQLSINSMIDIQS FT TLELSNTKILKLAKENIQVCLNIVSNPSETIKNKRRKYGEIKSPNFKEGSV FT KASFIIALVPDAPETNHNVKILLNKLCLDKINFTCVADIKLCQVMCGMSPS FT SMATHPCYICTWNRTEVFKNDCTELRTLGSLRLSSKLYKLSGSKKIRAKEY FT FNCIDEVLLIGDDNDLVVNHLSPPELHLMEGATNHIFKTINKLADPVLLKK FT IENDLNISKSIRNGSYNGNGCQKILKNWVKYYNLFPVLLKPLAEAFRLLDN FT LVHSCFGVNLRPTFEKDIRLFKNHYSTLVEDYEMTVTVKLHVIFQHVGPFC FT KDKDIGLGFFSEQAGEALHYSFLKEAWERFKVSDNHQKYAKYLKKAVVSFN FT GKHLNNV*" XX SQ Sequence 3265 BP; 1206 A; 474 C; 535 G; 1040 T; 10 other; cacacccacg ctgaaccaga ttttagtttg atgaaatgac atagggttgt gtcatacctt 60 tttgaaattg ttagaaccat accattaagg gaaagttagt aaaaaaaatc ataaaactaa 120 tctttaaaaa atatttgggt ttatactggg gtcaaaacgg ttcccgaact ttggcccccc 180 tctggttgat cgtgagaccg acccgagggg ggaaarcaac tgttttatat tttgttaaat 240 aatataacta gttattagtt gccccttcta gatggtctaa aaaacawccc gaggggcacc 300 aaagttcatc acctattttg acccgtgtgt atataatcat tattaaactt tataaataag 360 tttataaaga acgaataata tattttactt waaacaaaat atcgaaaatc actttatcaa 420 tctagtcaaa tatgttttca ttgaaaaaat aaaataaaaa agmatacaaa tattgtacag 480 tagktttata ctgaacaata gttttatact gaacaataca aaacagaccg caactcatca 540 ttaccacacg tgtaatcaaa cttcttaatt ctgcatattt tcattttgtt ttaactttaa 600 aaattgaagt ataataaaca acgcttgtaa gtaatgttaa atggtattta ataaggaacc 660 cgtataccgt aatagtcagc atttatattt gtattatatt atgttatatt tgtatagtgt 720 gtttgaaaca ttttttaaag aattattata tggtatataa atctttatat tttaggtata 780 ataaaaagtg tgttttctct gtgttcttaa attgttttat ttttccggat agaatgagtt 840 tatttaaacc racctgtact catgaggaaa acaggagaaa agtgcgttta tattgtgcca 900 aaaaaacaaa taaacctaac gatatcgatg taccgtggat taaaaattta gtgaatttta 960 aatacaacat taaaaattca ttctatccta aagctatttg agaatcttgc aaaagcagta 1020 taagaagagg aaaagagctt tacacgtaaa gtggaataaa atgaaaatac cagttcaaac 1080 tagaagtgtt tttgtatgta aatgcwaaat atgtataatt ggatctactc aaggaagacc 1140 taaaaaattc aaaargggac gtatatgtaa cgcgcaacct ccatttaata gtctaacttg 1200 tcaaacttgc ttttcaatta ttagcaaagg aattacccac aaaagttctt tagctgtact 1260 taaatgtaat ataatacaaa agcttgcaaa taatgtcaag acaattgagc aaatagctaa 1320 ttctttaata aaagacaaaa tagacaaaaa gggtaaagcg atacttgcta ctggtggtaa 1380 aaaagtaagt tttacaatta agagaaacca tgatactagt aatatacaac aactgtctat 1440 aaacagtatg attgacatac aatcaacttt agaattaagt aacacaaaaa ttttaaagct 1500 agcaaaagaa taacgattca aagcacaaaa cagaaaaatt gtacaatcag ggttagcagc 1560 agagctcaaa cgtattgacc agcaacttga tgattgcttt gttgaggatt ttgaactttt 1620 taatattaaa tctgtagagg aaaaaagttt gtttaaactc catttatcag aacaacagat 1680 gtcaatgagt ttctgtggca aataatagat aaaaaaagga gcctatcctc aaaaccagta 1740 tatacaagtt tccatttacg gaggtcgggg atttcttaag gtaatttgat tttataatgt 1800 gccacttttt tttctaatta aaaataataa gtgtaattta ttataaaata ttcaggtttg 1860 cttaaatatt gtatcaaatc ctagtgaaac aattaaaaat aaaaggagaa aatatggaga 1920 aattaaatca ccaaatttca aggaaggatc tgtgaaagca tcatttatta tagcactggt 1980 accagatgcg ccagagacaa accataatgt taaaattctg cttaacaaac tctgtttgga 2040 taaaataaac tttacttgtg tggcagatat caaactctgt caagtcatgt gtggtatgtc 2100 tccgagttcc atggcaactc atccgtgtta tatttgtaca tggaacagaa cagaggtttt 2160 taaaaatgat tgcactgagc tacgaacact tgggtcacta agattaagca gtaaattgta 2220 taaattgagc ggatcaaaga aaattagagc aaaagaatat tttaactgta ttgatgaagt 2280 tcttttaatt ggtgatgata atgatctagt tgttaatcat ctttctcctc cagaacttca 2340 tttgatggaa ggggcaacaa accatatctt taaaacaata aacaaattag ctgatccagt 2400 actattaaaa aagatcgaaa atgacttaaa tatttctaaa tcaataagaa atgggtcata 2460 taatggtaac ggttgtcaaa aaatattaaa aaattgggtg aagtactata acttgtttcc 2520 agtcctatta aaaccattgg ctgaagcgtt caggcttctg gataaccttg ttcactcttg 2580 ttttggtgta aatttaaggc caacatttga aaaagacatc aggttattta agaatcatta 2640 ttcaacctta gtggaagatt atgaaatgac agttactgtg aagttacacg tcatatttca 2700 acatgtaggt cctttttgta aagataagga cattggattg ggtttcttta gtgaacaagc 2760 aggggaagct cttcactact catttctaaa ggaagcatgg gaacgtttca aagtgtctga 2820 taaccatcaa aagtatgcaa agtatttgaa aaaagcagta gttagtttca atggaaaaca 2880 tttaaataat gtttaataaa tattttctaa aaataatgtt tttttttccc ttttaatttt 2940 aagaagaaaa aagtacyygg cattggcgcc cctcagactg tttttaagac ccatctgagg 3000 gggacatata acgactagtc atattttctg acaaaatgta caacaattgc atgcacccct 3060 gtggtgggtc ttgatatcag ttagaggggc gcctaagtcc gggaccctat ttgacccctg 3120 tataaaccca aatatttttt aaagattagt tttatgattt ttttcactaa ctttccctga 3180 atggtatggt tctaacaatt tcaaaaaggt atgacacaac cctatgtcat ttcatcaaac 3240 taaaatctgg ttcagcgtgg gtgtg 3265 // ID Gypsy15-SM_LTR repbase; DNA; INV; 108 BP. XX AC . XX DT 02-MAR-2008 (Rel. 13.03, Created) DT 02-MAR-2008 (Rel. 13.03, Last updated, Version 1) XX DE LTR retrotransposon from freshwater planarian: long terminal DE repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy15-SM; KW Gypsy15-SM_I; Gypsy15-SM_LTR. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-108 RA Jurka J.; RT "LTR retrotransposon from freshwater planarian."; RL Repbase Reports 8(3), 246-246 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX SQ Sequence 108 BP; 48 A; 3 C; 14 G; 43 T; 0 other; tgtataataa agaattaatt agaaaagaaa gttgtttaaa taattaaatg ttttagaaaa 60 tagagatttt gaaattagta aattcttgtt gttaattatt atcataca 108 // ID Kolobok-12_HM repbase; DNA; INV; 2787 BP. XX AC . XX DT 06-JAN-2009 (Rel. 14.02, Created) DT 06-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE Kolobok-type family - consensus. XX KW Kolobok; DNA transposon; Transposable Element; Kolobok-12_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2787 RA Bao W. and Jurka J.; RT "Kolobok-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 9(2), 421-421 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 439..2223 FT /product="Kolobok-12_HM_1p" FT /translation="MPNKQNSTRKKSRKWNGFKPSDILKSKPADHEISCVS FT SRKLDVSFSSDKCDSDDYFILINFLLLKKLIAHTVCPNCFIGNLLIKDNVN FT SRMGFCHLLEMKCTKCNFSKNFRTSPKSKNSLNTNEFPIDTVFPIYKKELS FT SLESPYEINLRAVIGLREIGAGHESMKTFSFCMNLYCLSPNGFNKLNKTAM FT LVYKIAAEESMQKAAFETKSINNSLEMKEVRISIDGSWQKRGHNSLNGVVT FT AVCSDKCIDAEIFTKHCNGCKMWRSKKGTPEYQCWLVDHQCESNHKGSSGS FT MESVGAVAMFKRSLQKNKLIYKEYLGDGDTSSFNDVIEADPYKEYGIMPVK FT LECVGHVQKRLGTRLRNLVKAQKGTKKPISGRGKLTENCINSMQNFYGLAI FT RSNVGNLYSMKKAVYAILFHFTNFPDSYTRHQFCPRGKYSWCKYWALNQKN FT YKPKSTIPLWIKDLILPIIINLQSDELLSKCLHGNTQNANEALNGLIWARV FT PKRVFVSKSTLEMGTYSAILAYNDGAKGVISVFRHFGLHGKVSNISATSKN FT KKRISQMRCKSSEKGKKRRKTLRAIKKGHLDEEKAKECNDSYVSGGF*" XX SQ Sequence 2787 BP; 973 A; 388 C; 465 G; 961 T; 0 other; ggtggtacaa caccaaaaaa actagaaaac taaaaaattt tttgagtttt tcaattttgc 60 ataaaatatg tgtcaaatta atcaaagaac gtttagccaa agtttcagag ctaaataatt 120 tatctttctt tagatatgag cattttatta gaacactgtg tatctttttt tccttagcaa 180 cggagatact aaatatttaa catggttgtt agggcaattt ctccctaaaa ataagctcaa 240 ttttactgga gtcttcaagc tttgtaagct tgatgtcact tctatgtttt ccgaattgaa 300 attcaattaa aagtcattgt tcttaaccac taaacacgtt ttgtaacagc taacagattt 360 gaaagatttg gagtgttatt gtttgtttac cctttggatt tatattatat atatattata 420 taaatatatt taaaagaaat gccaaataaa caaaatagta ctagaaagaa gagcagaaaa 480 tggaatgggt ttaaaccatc tgatattctt aaatcaaaac ctgcagatca tgagatttct 540 tgtgttagct caagaaagtt agatgtatcg ttcagctcgg ataaatgtga ctctgacgac 600 tattttatac ttattaactt tttattgtta aagaaattaa ttgcacacac tgtttgcccc 660 aattgcttta ttggaaattt gttaatcaaa gataacgtta attctcgcat gggcttttgt 720 catttacttg agatgaagtg tacaaagtgt aacttttcta aaaattttag aacttcacca 780 aagtcaaaga attctttaaa cacaaatgag tttcctattg atactgtatt tcctatttat 840 aaaaaagaac tttcatctct agaatcacca tatgagatca acctgcgagc tgtcatcgga 900 ttgcgagaaa taggtgctgg ccatgagtcg atgaaaactt tttccttctg tatgaatctt 960 tattgtttgt cgcctaatgg ctttaataag ttaaataaaa ctgccatgtt agtatacaaa 1020 attgctgctg aggaaagtat gcaaaaagct gcttttgaaa ccaaatcaat aaacaactct 1080 cttgaaatga aagaagttag aatttctatt gatggttcat ggcaaaagcg aggtcataat 1140 tcgttgaatg gtgttgtaac tgctgtttgt agtgataaat gtattgatgc agagatattt 1200 accaaacatt gcaatggatg taaaatgtgg agatctaaaa aaggaactcc agagtatcag 1260 tgttggttgg ttgatcacca gtgtgaatcg aatcacaagg gttcatcggg tagtatggaa 1320 tctgtaggag ctgtggctat gtttaagaga tctttacaaa aaaataagtt gatatataaa 1380 gaatatctag gagatgggga tacatcttct tttaatgatg taatagaagc agatccttac 1440 aaagagtatg gtattatgcc tgtaaaactg gaatgtgttg gacatgttca aaaacggtta 1500 ggaacacgtc tacgaaacct tgtcaaagcc caaaaaggta ctaaaaaacc aatatcaggc 1560 agaggtaaac tcactgaaaa ctgcattaat tcaatgcaaa atttctatgg tcttgcaatc 1620 agatctaatg tgggaaactt gtattcaatg aaaaaagcag tttatgccat acttttccat 1680 ttcaccaatt ttcctgattc ctacacacgc caccagtttt gtcctcgtgg aaaatatagt 1740 tggtgcaaat actgggctct aaatcaaaag aattataaac ccaaatctac catacctctg 1800 tggataaaag atttaattct accaataatt ataaatttac aatcagatga attgttatca 1860 aaatgtttac atggaaacac acaaaatgct aacgaggcac ttaatggact tatttgggct 1920 cgtgtaccaa aaagagtttt tgtttctaaa tcaacactgg aaatgggaac gtattcagct 1980 attttagcat acaatgatgg tgcaaagggt gttattagtg tttttagaca ttttggttta 2040 catggtaagg tatcaaatat ttcagcgact tctaaaaata aaaaaaggat tagtcaaatg 2100 aggtgcaagt catcagagaa gggaaaaaaa agaagaaaaa ctcttagagc cattaaaaaa 2160 ggtcacctag atgaggagaa agcaaaagaa tgtaatgaca gttatgtttc aggaggattt 2220 taaattaata ttgtaaatat tatatcaaga atttttcaat gctgtttatc tcagttcgcg 2280 tttttttatt tttgcttaaa cttcaaaaca cagtttcttg agttagcatt gttcatttga 2340 tctgaaattt tcagtacttg ctcgaaatac ctataaaatt catttaacag atggattttt 2400 ttttatgatc agtaattttt ttttttttta gtcagatctc tctaaaacgt caaaaaataa 2460 ataaaatatg aggaaaaact acataactca tcatttactt tgatatttaa aaaatctatc 2520 tgttaaatgg aataaaatag tgtttagagt aagtgtacaa aatttcaggt catataatga 2580 tggctagcct tagatattat gttttgaaat ttagtccaaa aacgcatttt ttaatagttt 2640 ttccgccatt atggtcctaa tgctaattaa aaaaaaaatt tttttttttg ctttttttta 2700 tttaaatcat ttgagtaaac tgtatttaaa agcattttta ttttggaggt ttatttcatt 2760 tttttattta tttggtgttg taccacc 2787 // ID DNA8-4_CQ repbase; DNA; INV; 4916 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A DNA transposon family from Culex quinquefasciatus - consensus. XX KW DNA transposon; Transposable Element; nonautonomous; DNA8-4_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4916 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the southern house mosquito."; RL Repbase Reports 11(1), 81-81 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 4 sequences with >99% CC identity. CC 8bp TSDs. XX SQ Sequence 4916 BP; 1769 A; 768 C; 776 G; 1599 T; 4 other; caggggtgtg tttgatgggc ttgagctagg tagatgcgca tttggctgag ggcgcctaca 60 gtgcatatat gatttttagg gcgctgacag tcactaactg atatgtgagg cgcttatagt 120 ggaaacgtat ttttaaactt aaaaataaaa caaaatccta aagtacctaa acatacacat 180 ggttcttgat ggttaataaa caaagcgtag attaaattaa aaattaaaaa aaaawtatgg 240 taaatttacc tattttaaca aagtagtaca tatataactt aagtttaatt tttgggatta 300 aagattacta tttttgcata aattttattt ttggtacgat ttctgctgat attttttttc 360 tgtgattttg aatcaatatt attagaaatt aaaaaaagtc ttgtcgcaaa aagaacgata 420 cacgcctgtg aatccgttgc cgaaaccgca aggtttaaga gggctagatt ataaggtgtt 480 acgtcgattc aacattatca gaaataaaag ttattttatg tttttgattt tttttttttt 540 cattcaaaaa tgtaagcacc ttcccgcaaa tttattttag gaaaacataa taaaaagaat 600 ttcaaaaatt ctttaacggt tagtaaggta attcgcttgg ctaaacactt gcttgaaagt 660 gtcaaagatt tcaagaacct ataggtttga aaacattatt tagtaaactt caaatataag 720 atcaaaacag acaaacagtg acttttgccc ttaacataca caaacttcac attatttttg 780 aaaattcata aaattgactc taaacctagt cgtaaaccct ttctttgtcc cgtcctactc 840 gataatgaaa atgcccatcc cccaacaaat aaaaatgttt tgtctgcagc gtggaaaaac 900 gcaacccatt gcaatgaaac aatagaatct tttagttatt taaaattttt aaggtcgcac 960 ctatcataaa agataactct atctacacat aattaaattt gatccatttt ggaattgata 1020 ataaaaactg actctcacgt attcaatcsa gaagtcagaa ttatatatta acagggtatc 1080 cagatccacg gtaagtttcg gagaacaaat ctaagataaa agtgaatatt ttttatcgat 1140 tttaatgtat atttctatgg aaaaatcaca tgaaattgat agaaacacgt acattcatat 1200 ttatcaaatt tgtcatgaac atgcctggaa tgacttgaac gggaccatcc ataaaccacg 1260 tggtcacttt aggagggtat ggtgattgtc cacgatccat acaaaaaagt tttttttgta 1320 tggacaattg tccacaaggg gaggggggag ggtggtggag gtaacagatt cccaaaaaag 1380 tgtccacgtg gtttatggat ggtcccaact tgaaaataac atataaaaaa tccttattca 1440 gctatccgaa aaacacaatt acttaataag gaaaggaggc agaattctta aaataaagga 1500 atttcgtact ttttttttat ttcagtgtat ggcttttgaa ctttaacttt gcaatcaaat 1560 cggagaaaaa agattgacta gctctcaata gaataatctg atgcgattta gtattacata 1620 aagcgtccaa tttcccaggg ttacaaaatt cccgggaaac ggtaaatttt aaatttatcc 1680 aaaattgttc ttatattggt tctggttaat attctttaac aaaattgtat aaaatagcaa 1740 cataaatgat caaattaagt gcgagggtca attaatgtct taacttcttg caaaaacatt 1800 ttcaagaaaa tatattgata ttctaaatct aaagactatt tctaaattta ctagtatttt 1860 tttttttttg cagtgatatt tttccaatca cagtcttttg acggcgagta aaatgaatcc 1920 atgcttcaaa accataacat tgctatacat tatggttctg tgaacagttt gagagaatat 1980 gataaagaat aatattgagc cgacgttgaa atgaaaataa aaaataatga agaaaatatg 2040 gatttgttgg caaatctttt ttcttgaaca aatcttactt ctttgagctc ataaataaat 2100 agataagcat tggattcacc ttaaatatca gctttttcac ttgaaatatc attttcataa 2160 aatcaccttc tacataatac ggagtagttt tttttaatga ttcagcttaa taaatgattt 2220 cgtctaaaag aattttcata gcaataaaag aagttcagtt catattaaat ctcttacgcc 2280 cttgttagct cgagtttttt tttcttgata ttaaactgtt aacacttttt aacaaatttc 2340 ccaatttttt tttatttgga atgaccctct ctgagacatc gattgatatt tttttcaatt 2400 ttcatccaat ttggccgtgg taaacaaaac cgtaaaaaaa cataatttga tcttggcaga 2460 acaggatacc taagtttcaa aggatataac aacaattttt gtttaaaaag ctaaccagaa 2520 ttataatagt taattttcat ttcataatat cgtaattttt gttgctcaac gaataaacaa 2580 gatcccagtt gtgaaagctt cagaaactaa tattttttga aatatttata ctttgatatg 2640 aaattttcag gcgaactgaa tggttataag aacaataaat cgaaatgaat aaaataaaat 2700 taagtttttc attactattt ttttgaaaat ttaaaaaatg tgccaaaaag ttaagaaaat 2760 tatgaaacta tattatgtgc aattatcgtt actattaacg gcattgcttt attgagacaa 2820 aaaattgata tcaaaatttt atttaatttc tttaaaaaaa taaatatgag cattcaaaat 2880 gtctcagtta acgcttccta ttgtgtaact aagcgaaatt cgcatatctc agctagtaac 2940 gaacaaatac tgatatttta ttatgagaac ttgttccgaa aattgttctt agagatctaa 3000 ttgataaagt ctagattata caaagtggaa tcaaattaga taatttagaa gaatatgtgc 3060 aatttgcata gattatgtat atgtattagg atgtaacaaa aattacattt tggcggacat 3120 tcagaggttt gttccggtgg tcatactgag ctcaaattcc aaatatgagc ttgattggat 3180 gtaacaggag ctggtgcttt gcatttgaat tttaaatggg atttaacccg taaaaagatt 3240 ttttttttca aaaatgtaat tttttaaggc acttggccac tgaagcgctt attttcaaca 3300 tcactggcgt gtaggcccga ttcttgcaca tcttttggta tatataacat tgatttttgg 3360 ggcaccccgg agctcggtac agaccttaaa agtttggcat tttttcgaaa aatcgatccc 3420 agtataatca aatagtatct cggacgccgc gagatgcgac gtatatcttt tttacgattc 3480 agtagccaaa gtgcttaama aatataataa gtatattgtt tcaatcaatt ttcagtgact 3540 ttccaaagaa aaatcttata agatcgttat aaaattctga gattcttaga agaagttgca 3600 cttggcgcta aatattatat ttcaatttaa ccctcaaaat ttcgggaaat agactttctt 3660 acaaggtgca aaatatcaga cctaccttat ttatctagga gaactctaaa taatcacacc 3720 gtgtacattg tacactgtat gacttaataa aaatcaaact atgtttcatt tcccatatca 3780 gttctcgatc tataaagtaa acgaaatcat ttagtaaagt cctctactag aataaagttg 3840 ccacatagag cgatttattt tgagtcccta tcacttcaaa aaaactttaa ccagaagcga 3900 agatctcttt catcaaaaaa taaaacagca ataaaaaaat taatattaat ggtactgatt 3960 tcaaaagtat tctttttttc aaagcttmgg ggtaaatgtt ccgaaaaact ctacccgtca 4020 aaaatcatcc tctgttagag gattcaaaga gtaagtcagc ctacacaagt catctgttgg 4080 aaacatgttg acaaagcgat gcataaagtt tgcatgccct gtgaggctgt tgctgcgaag 4140 tggttgttca agaatgctac accgttaaat actgtaagca cagaattaaa ggatgaatag 4200 cacgtattta caatatgaaa gatgataatg ttctcataaa cattgatgat cgtgccaaaa 4260 aaataaaacg cattacgcaa atgaatgacg aattagattt tgcgcacttc caaacaaata 4320 aaaaataaat caatgcgatt gtgcttttta aaacgagctc caacattttt accaattcgg 4380 agtatcgaat cacctgaagg gtaggcttac tccttcaatc tgagaaagtc atttgaaatc 4440 attggtctaa atttgataca ttttggtgag cgagtcggct tctctcgctc ttggggaaac 4500 atatagttgc gtgcgcacaa caaccaaaaa tcatttagtc aaataaaaaa aaaacaacca 4560 aaccacaaaa agaaacttat gcatcgcaag agtgacaaag ctaaatttaa ataaaacaaa 4620 caacaaacca tcgcgcagcg atcgcgcatc gacttagcga ctttgcgact gactgcgaca 4680 gcactacctt ggaacattga aatgtttggt tgactttcca gaaagagtgc atatgacttg 4740 aaaaaaagaa aagggcgctt caaaattgag ccaagaaatt ggtgaaactt ggtgcaagcc 4800 cttttatggt caaaaatatc gtttaacttg aatatttttc cttaaaaaat aattaaattt 4860 aagcaaacag ctaagtagat gtcatctact caaatctacc cataagcaca cccctg 4916 // ID piggyBac-2_SM repbase; DNA; INV; 2426 BP. XX AC . XX DT 29-MAY-2008 (Rel. 13.05, Created) DT 29-MAY-2008 (Rel. 13.05, Last updated, Version 1) XX DE DNA transposon from Schmidtea mediterranea. XX KW piggyBac; DNA transposon; Transposable Element; piggyBac-2_SM. XX OS Schmidtea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae. XX RN [1] RP 1-2426 RA Kapitonov V.V. and Jurka J.; RT "Families of autonomous piggyBac element from the freshwater RT planarian genome and two clear examples of horizontal transfer of RT piggyBac transposons between a flatworm and insect."; RL Repbase Reports 8(5), 521-521 (2008)Repbase Reports. XX DR [1] (Consensus) XX CC piggyBac-2_SM is a young family of piggyBac transposons, CC characterized by 13-bp TIRs and TTAA target-site duplications. CC The consensus sequence was reconstructed based on multiple CC alignment of ten copies (they are ~96% identical to the CC consensus). XX FH Key Location/Qualifiers FT CDS 548..2185 FT /product="piggyBac-2_SMp" FT /note="piggyBac transposase." FT /translation="MPEAATDQDSDASDDEVMCDPGHLPRRILLSKVLNND FT VDIDQSEIEFNPPSTSTGRGSKRRKKVVYMWHTDKSKVANLVPDFSETREV FT VDAPKTNPLEFFEERFWTEDWINVLCEQSKNYAHQKGIPCSDVNADNMKLF FT LAILILSGYNKVPNRRLYWSESPDTQNKLVMNSMRRDTFDQIMRCLHINDN FT MKMDGDRFYKVRPLFEHLNNSNKDEKCGEYYSIDEIMVPYYGRHGDKQYIR FT GKPVRFGFKLWAICTSDGFLHNVEPYCGSHTRISDRGFGLGGNVVLEMIER FT VNIKPGQHIVFDNFFGSVALLKELSQKKIAATCTLREDRLSGAPLRPRKVL FT EKEERGTSDEAFTSCISVVKWKDNKVVSVASNKLRSDPAKKAKRWNRVDKK FT HVEVDLPHSIHVYNQHMGGVDVFDQQVAAYRCRIRSKKWWWPLFAWTVNAQ FT VVNGWKLFREHEKKITLLEFSRQLVISTLTKYGKPKAKPGPVPMPVVNTVR FT YNSQHHWTYKGKSPYNRCRECGSRTIYFCSVCNIPVHPECMKTFHTKKET" XX SQ Sequence 2426 BP; 775 A; 449 C; 513 G; 689 T; 0 other; cccattaatg ccgcttggca agtatacttg ccacacaaaa aaatgtgtaa ttttcaatac 60 tatgaatgtg agggtttgga ttgaagtcta acggtaacga gacgagtact gtacgtgttc 120 agaaccagtt acagcggtcg gctgtgcttc acgcctgctc agttgacctt ggactttgct 180 gctgtgaaca atggcgttac tacgtcagaa aaaaacgtaa gttactgctt atatctgtta 240 gctatgattt actactatac tgaaaactgt tgttaaaaag ttgttctaat gatatatgta 300 tgcacttaca tgattctaat ctcttacgac cactatattt tactatctat atttgtggca 360 agtgtacttg ccactcggta aaaaaattat ggaatcgcta tttttccatt atatcacttg 420 acgattattg gtaatatatt ttaaccctta tttcaggttg tcacttgacg aaattcatga 480 tttcctgtta gaagaagatg aagaagagca taatgaagtc cgagtagcaa tagagccccc 540 tgaagagatg cctgaagctg caacagacca ggatagtgat gcttcagatg acgaagttat 600 gtgcgatcca gggcaccttc ctaggagaat tttactctct aaggttttga ataatgatgt 660 ggacatcgat cagtctgaaa ttgagttcaa tccaccctca acatcaactg gccgtggatc 720 aaaacgacga aaaaaggttg tctacatgtg gcatactgat aaaagtaagg ttgcaaatct 780 agtaccagat ttcagcgaaa cacgtgaggt agttgacgcc ccaaaaacca atcctttaga 840 attttttgag gaacggtttt ggacagaaga ttggatcaat gttttgtgtg aacagagtaa 900 aaactatgct caccagaaag gaattccttg tagcgatgta aatgcagaca acatgaagtt 960 atttctagcc attctcattc tctctgggta taataaggtt cctaacagac ggctgtactg 1020 gagtgagtcg cctgacaccc aaaataaact tgtgatgaat tcaatgagac gtgatacctt 1080 tgatcaaata atgagatgct tgcacatcaa cgataacatg aaaatggatg gtgatcgatt 1140 ctacaaggtg cgtccgcttt ttgagcactt gaacaattcc aataaagatg agaaatgtgg 1200 tgaatactac agcattgatg aaatcatggt tccatattat ggaaggcatg gtgataagca 1260 gtatattaga gggaaacccg tgagatttgg gttcaaactt tgggcaatct gcacatctga 1320 tgggttttta cataacgttg agccatattg tggtagtcat accagaattt ctgatagggg 1380 ttttggtctt ggtgggaacg tcgtgctaga aatgattgaa agagtgaaca taaaacccgg 1440 tcagcacatt gtatttgaca atttctttgg ctcagttgca ttactcaagg aactttcaca 1500 gaaaaaaatc gcagccactt gtactctaag agaagaccga ctatctggtg cccctctaag 1560 acccagaaaa gtcttggaaa aagaagaacg aggaactagt gatgaagcgt tcacaagctg 1620 catttcagtt gtaaagtgga aagacaacaa agtggtaagc gttgcctcca acaaacttcg 1680 ttcagatcct gcaaaaaaag cgaaacggtg gaatcgagtt gacaagaaac acgttgaagt 1740 tgacctaccg cactcaatcc acgtttataa ccagcacatg ggtggcgtgg acgtatttga 1800 tcaacaagta gcagcatacc gatgcagaat aagatcaaag aaatggtggt ggcctctttt 1860 tgcctggact gtcaacgccc aagttgtaaa tggatggaaa cttttcagag aacatgaaaa 1920 gaaaattacc ttattagaat tctcacgaca actagtgata tcaactctga ctaaatatgg 1980 gaaacccaag gcgaaaccag gtccagttcc tatgcccgtt gtgaatacag tgcggtacaa 2040 cagtcagcat cattggacct ataaaggcaa aagtccgtac aacagatgcc gtgaatgtgg 2100 ctcccgtacc atttattttt gcagtgtgtg caacattcct gtacacccag agtgtatgaa 2160 aacctttcac accaaaaaag aaacctagga aagacaatat atggactgat tctagaagaa 2220 aatccctcac tgtaactttg atttctcatc aactgaatat tattatttta ttgccgggtg 2280 gcaagtatac ttgccggtct gtatttcggt aacaaataca atatttcatt ttttttcttc 2340 ataattactt aaaggagact cgaataaatc agtttagttg aaaattgcgt tttattaagc 2400 tttgtagaat tatcggcatt aatggg 2426 // ID Transib4_DP repbase; DNA; INV; 1334 BP. XX AC AADE01007081; XX DT 13-JUN-2005 (Rel. 10.05, Created) DT 13-JUN-2005 (Rel. 10.05, Last updated, Version 1) XX DE Transib4_DP is a family of autonomous DNA transposons - a partial DE fossilized copy. XX KW Transib; DNA transposon; Transposable Element; KW Interspersed repeat; transposase; Transib4_DP. XX OS Drosophila pseudoobscura OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; obscura group; OC pseudoobscura subgroup. XX RN [1] RP 1-1334 RA Kapitonov V.V. and Jurka J.; RT "RAG1 core and V(D)J recombination signal sequences were derived RT from Transib transposons."; RL PLoS Biol 3(6), (2005). XX DR GenBank; AADE01007081; Positions 1 1334. XX CC Transib4_DP is a family of autonomous DNA transposons that CC belongs to CC the Transib superfamily. The complete Transib4_DP is not obtained CC yet. CC Its partial copy encodes remnants of the Transib4_DPp CC transposase. XX FH Key Location/Qualifiers FT CDS 1..1305 FT /product="Transib4_DPp" FT /translation="VKFLTADLTLSYGFDGSSGQSNYNQGYSGDEAFDMDS FT SILATTVTPLRLVDSSGNILWNNRTPQSIRFCRPLRLAFMKETKECILKEN FT SDLKEEIKHLRPCKMIMEDGKFVYISFNLQLTAIDGKVLNALTGTKSTHAC FT PICGALPSDFINNKNFQCEKFLPILSNLKYGLSPLHCWIRMFEFILHAGYK FT CNIKKWKICNQSDKMEVYERKRHIQELFWEQLSLKVDRPKTGGFGSTNDGN FT TARRAFSDPNIFSKITGIDEQLISNLRIVLICLSCQLPINVSKFKQFCYKT FT AEIIIDKYPWLPMTATVHKILVHSSDIMESTVLPVGYFGEEGAESRNKLYK FT SDRLHHARKCSRMHNFTDVFHRAMDTSDPIISSVNLDRRIQRQNRINLPTE FT VLEMLSSEGSPLEDHGRNETDYDDNFGLQWSMEVENEEHEA" XX SQ Sequence 1334 BP; 465 A; 222 C; 262 G; 385 T; 0 other; gtaaagtttt tgactgcaga tttgactttg agttatggtt tcgatggtag ctctggacaa 60 tcaaattaca accaaggtta ttctggagat gaagcatttg atatggactc cagcatttta 120 gccacaactg ttacaccatt aaggctagta gattcatccg gaaacatact gtggaacaac 180 cgaacacctc aatctattag attttgtcgg cccttaagac tagcctttat gaaagagacc 240 aaggagtgca tattaaaaga aaattcagac ttaaaagaag agataaaaca cttgcgaccc 300 tgcaaaatga ttatggaaga cggaaaattt gtttatataa gtttcaattt acaactaaca 360 gcaatcgatg gcaaagtgct taacgccttg acaggcacaa agtcaacaca cgcctgtcca 420 atatgtggtg cattaccttc cgattttatt aataataaaa atttccaatg tgaaaagttt 480 ttgcccattt taagtaattt gaaatatgga ttaagccctc tacactgctg gattcgcatg 540 tttgaattta ttttacacgc tggatacaag tgcaatatta aaaagtggaa aatatgcaat 600 caaagtgata aaatggaagt ctatgaaaga aaaaggcata ttcaagaatt gttttgggaa 660 caactttctc taaaagttga tagaccgaaa actggcggat ttggcagtac aaatgatgga 720 aatacagccc gccgtgcctt tagtgaccca aatatatttt caaaaatcac tggcattgat 780 gaacagctta ttagcaactt aaggattgtg ttgatatgtc tttcatgcca attgccaata 840 aatgtttcaa aatttaaaca gttctgctat aaaactgcgg aaataattat tgacaaatac 900 ccttggttac caatgactgc aacagttcac aaaattctgg ttcattcaag tgatattatg 960 gaaagcactg ttcttcctgt aggatatttt ggagaggagg gcgctgaatc tagaaataaa 1020 ctatataaat ctgatcggtt acatcatgca cggaagtgca gccgaatgca taattttaca 1080 gacgtgttcc accgagcaat ggatacatca gatcctatta tttcatcggt aaatctagat 1140 agacgtattc agagacaaaa cagaataaac cttccaactg aagttttgga aatgttatca 1200 agcgaaggta gcccattaga agaccacggt cggaatgaaa ctgattatga tgataatttt 1260 ggattgcaat ggagtatgga agtggaaaac gaagagcatg aggcttaaaa acatgtaggt 1320 ttgtataatc tggt 1334 // ID Gypsy-4_OD-LTR repbase; DNA; INV; 169 BP. XX AC CABV01000256; XX DT 24-FEB-2011 (Rel. 16.02, Created) DT 24-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Oikopleura dioica genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-4_OD_; KW Gypsy-4_OD-I; Gypsy-4_OD-LTR. XX OS Oikopleura dioica OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; OC Oikopleuridae; Oikopleura. XX RN [1] RP 1-169 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Oikopleura dioica genome."; RL Direct Submission to RU (24-FEB-2011). XX DR Genome; CABV01000256; Positions 2448 2280. XX SQ Sequence 169 BP; 36 A; 51 C; 23 G; 59 T; 0 other; tgactaccgc attcagattc tgcgtgccaa gtttggcttg gccccattat gtttccttcc 60 tttctgtcca cctctcctca atttctgaat ctctgttcac tctgctacaa accacgaaat 120 acacaacatc tcacttttac ttcgacttga gtctttttat cggcaagca 169 // ID TMSATE1 repbase; DNA; INV; 142 BP. XX AC M30656; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 25-MAR-1997 (Rel. 2.02, Last updated, Version 2) XX DE T.molitor satellite repeat E1. XX KW SAT; Satellite; Simple Repeat; Satellite repetitive element; KW TMSAT1; TMSATE1. XX OS Tenebrio molitor OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Coleoptera; Polyphaga; Cucujiformia; OC Tenebrionidae; Tenebrio. XX RN [1] RP 1-142 RA Ugarkovic D., Plohl M. and Gamulin V.; RT "Sequence variability of satellite DNA from the mealworm Tenebrio RT molitor."; RL Gene 83, 181-183 (1989). XX DR GenBank; M30656; Positions 1 142. XX SQ Sequence 142 BP; 41 A; 26 C; 30 G; 45 T; 0 other; gaattctgta attcttgcgt cgttttactt cgaaatgtac aagttccacg acgaaattcc 60 gattcgcact tagtttttcg tgattctaca cagttgcgag cgaaaaaacg tatttagagg 120 aaagttagcg tcttggaaac ag 142 // ID BEL2-I_Dya repbase; DNA; INV; 7139 BP. XX AC chrU; XX DT 18-MAY-2009 (Rel. 14.05, Created) DT 18-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL2_Dya; KW BEL2-LTR_Dya; BEL2-I_Dya. XX OS Drosophila yakuba OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; melanogaster subgroup. XX RN [1] RP 1-7139 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1012-1012 (2009). XX DR Genome; chrU; Positions 7629380 7636518. XX CC 'ATTAG' target site duplication CC LTRs are 96% similar to each other. XX FH Key Location/Qualifiers FT CDS 2443..5997 FT /product="BEL2-I_Dya_1p" FT /translation="MGAVVEIPTWDEINSFLEERYRPLEAIEEVRLHTSNS FT QHTKEPRNPQPKRVKSVKSQVGPRVRSCDLCSRESHQVRQCPRFLQIKQKR FT LCLNCFAKGHNLRECTSRHNCFTCKGRHNTLLHKESPATSTSAAQVHNPDQ FT PIANTSVHFASNRQGILLGTAIVQVCHQGENFPARALIDSGSEGTFISEKL FT ATQIKLPCQAVRTTITGLNQTSTGVSQRMCHFQMGTAAKPMLKIDTTALVL FT PNLAGNLPTSSIDRSILGRLPNIPLADPLFFQPSQLDLLIGEDILPSVLLS FT GLKPNLCGSLLGQDTIFGWILTGPVSTATSRVSSFTTKISIESEPTMETIL FT TKFWEVEDLPCKVVKESDNLCEENFARTTIRSRDGKYVVSLPFKDAEHINL FT GHSRSFALAQFLRNETRLKRDPILKDTYDSVIKEYIDLGHMKPVPPSTDTV FT NFYLPHHAVFKPESTTTKVRVVFNASSPSSNGNSLNDILHPGPVLQSDLTL FT QILKWRTFQFVFNADITNMYRQILLNPAHTAFQRILFRDDRGDIRDYELKT FT VTFGVNCAPFLALRTLRQLSEDVHALYPRSSRIISDYMYVDDVLAGAHTKV FT EATLAIQELRTALSSAGFPLRKWTANERTLLKALPSDQLLSGDFLDIEEVS FT TTKTLGIRWNAKADEFNFVPTELVVNANYSKRNVLSQIAKLFDPAGWLSPF FT VVQAKILMQDIWLAGIGWDEFLPAELRQRWHDFLRSHSSLHKVRVPRWVQF FT RPGAQVQIHGFCDAFQKAYGAVIYVRIQYDGGISLSLLTSKTKVAPVKTVS FT LPRLELCGADLLADLWAAILPHIPFGRIETFFWTDSTIVLAWLNKPPCQWT FT TFVANRVTKIALNTDASQWSHVRSEHNPADLASRGVPAEELASSDLWWHGP FT SWLIQPQVSWPTSHEAVSDLELERRSVRCHLACPGWDLLVRFSNFNRMLRV FT MAFVQRFINRCRRIPTPSSTDLSSKELAHIQLMLIRQTQRADYPEEYNILQ FT SKGQLPSSSCILNLNPFLDADGIMRSCGRLTASDSLSYEERHPILLAYHSK FT LAELLVTFTHPISIHGGNQLMVRLIRTRYWIPKLRNLIKRVTTTCKVCVIH FT RKKVQSQLMGKLPQSRATYSRPFTHTGLDFAGPFDVKSYSGRACRITKGYV FT CVFVCFNTKAIHLEATSDLTTEKLGG" XX SQ Sequence 7139 BP; 1761 A; 1846 C; 1582 G; 1950 T; 0 other; ttttgtggtc cttcgtcagc cggagtactg cacgccattt tgttcaattg taagccttgg 60 ccggcagaag tttttacatc gttccctgca tgtgtgtaca tatgtgtgat agttctaatt 120 ctacatttgc ttccaaaact ccaagtgtca tcaatttcct acattctttt attcttggca 180 agtgatcgtt tttttttttt gtttttgcat aaacctacat acatacatta cacatttggt 240 tccagaattc caattttccg attccgaatt ccgattttct acattcttct attctttgca 300 agtgatcggt ttttttttgt gcataaccct acaaacatac atacattaga catttggttc 360 cgaaattcca agtgcatcga ttgtctacag taatttattt cttttgcaac tgatcgattt 420 gcaattaata gttcccaatt gtccatatta acatataaac atacataaac atacatacat 480 acattagaca tttggtttct aaaattccaa gtgtcatcga ttgtctacat taatttattt 540 tgttcaactg atcgtttttt tgcataaaca aatattaatt tgtgcaatta atagttccca 600 attgtccata gaaacataaa catacataaa catacaacac atacacattt acatattcct 660 acatttactt tggccacagt aacagtgtcc ttacactgcg ttacatttct acatttcgca 720 tttctggtgg acggtactgc tacatttggt ttctaaaatt ccaagtgtca tagattgtct 780 acattaattt cttttgttca actgatcgtt tgtttgcata aacaaatatt aatttgtgca 840 attaatagtt cccaattgtc catataaaca taaacataca taaacataca acacatacac 900 atttacatat tcctacattt acttcggcca cagtaacagt gtcttcacac tgcgttacat 960 ttctacattt cgcgtttctg gtggacgata ctgctacatt ctaaattttt tcggacggta 1020 ctcaaatcac cgtatggttt cggttttccg tcagtgcaca tcagtaccat cccgcatttc 1080 ctacattccg catttctggt ggacggtact actatattct gaattttttc ggacggtatt 1140 caaatcaccg tattgtttcg gttttccgtc agttcacatc agtaccattc cgcttttctc 1200 cattccgcat ttctggtgga cggtacttct acattctgga ttttttttgg acggtactca 1260 ggtctgcata tttgttttac ggtttccgca aagtaaactt tgcgtcagat caaacgaccg 1320 tttcggatag ttgcatctcg acgttagtta actgtatttg gatcgtttta gtggcttgag 1380 tacagagatt aacacgtgtc tactattcca ttcctgcagc aaatccttgc taataatcct 1440 aaaataatta caagcatcat tctccagcca atttattcgg aagaatcaag tcaccagttt 1500 acacggttgt cctctattcc tttttctggt attaattttg cgcttattct caaacaacta 1560 taatgatgtc tgaagcaggc cagaagcggg atcctagccc aaggtacgca gaagccggac 1620 ccgctgggaa tttgagatca aaaactagaa cctccttaga cgatgcggtg cttgatttaa 1680 taaccaccag cgaccggctt agcaaatttg aaagccagat ttgtgcggca tcctcgtcct 1740 ctgccgagcc taatgctttt acactaaaga tccgcctaga ccaaatccag tccctctggg 1800 acaaggttga gcgggaatac gaagcagtct cccgtctagc ccttcacgaa ggaagtggcg 1860 aaagcggcag cttcagagca aatatgagaa ttgctacttg gtttatgaga ggtgtgcttc 1920 ccaattaaac gaacatattg cccgtctgtc cgcccctagt tctcagggga gttcaaatcc 1980 gtccgttgag acgttctcga agggttgcag gcttccgcca tgtgacacgg aaatcttcac 2040 tggagattgt ctgcggtggc ccaccttccg ggacctattc acagcgatct acatcgataa 2100 ttccaggctg actccagtgg aaaaattatt ccacttaaac gccaaaacta gcggcgaagc 2160 tcacgctatt gttagcaagt ccccccttac aaattctggg ttccaatccg catggtctgc 2220 ttttcaagag cgttttgaaa acaagcgact cttagtaaat agccagttga aggttctgtt 2280 cataagcata agcaccgaat caggaacggc cttgaaggat cttcagagca cgattcaagg 2340 gtgtttgatt gccatggaac actccaatat ctccacagaa aattgggatt gcattctgat 2400 tttcttctgc tcttccaagt tgccaacagt cactctctca ttatgggagc agtcgttgaa 2460 attcctacct gggacgagat taattcgttt ctcgaagaga gatatcgccc gctagaagcg 2520 attgaagagg ttcgacttca tacttccaac tcacaacata ctaaggagcc caggaaccca 2580 cagccgaaaa gggtaaagtc ggtaaagtcg caagtcggcc cacgtgtacg ctcctgtgat 2640 ctctgctcca gagagtcaca tcaagttcgg cagtgtcctc gtttcctcca gatcaagcag 2700 aagcgattat gccttaattg cttcgccaaa ggacacaatc ttcgagagtg cacaagtaga 2760 cacaattgtt tcacgtgcaa gggccgccac aacacgctgc ttcataagga aagtcctgct 2820 acttccacca gcgcagcgca ggttcataat ccagatcagc ccattgcgaa cacttctgtt 2880 catttcgcct ccaatcggca gggcatcctt ttaggaacgg ccattgtcca agtctgtcac 2940 cagggagaaa actttccggc acgggctttg atagactcag gctcagaggg caccttcatc 3000 tccgagaagc ttgcaacaca aatcaagctt ccttgtcagg ccgtgagaac tacaatcacg 3060 gggctgaacc aaaccagcac aggtgtctca caaaggatgt gtcatttcca gatgggcacg 3120 gcggcgaagc cgatgctcaa gatagacacg actgcgttgg tgttgcccaa tctggccggg 3180 aatctgccga cgagttcgat cgatcggagc attctcggaa gacttcccaa tattccgttg 3240 gctgatccac tctttttcca gccctcccag ctcgaccttc taatagggga ggatattctt 3300 ccgtctgtcc tcctctcagg cttgaagcca aacttatgtg ggtcactgct agggcaagat 3360 accattttcg ggtggatttt aaccggccca gtttctacag ccactagtag ggtttcgtcc 3420 ttcacaacaa agatatccat agaatcggaa cctaccatgg agacaattct cacaaaattt 3480 tgggaggtgg aggatcttcc atgtaaagtg gtaaaggagt cggacaactt gtgcgaagaa 3540 aacttcgctc gaacgactat caggtctcgg gacgggaaat atgtagtatc cctacccttc 3600 aaggatgctg agcacatcaa cttggggcat tccagatcat tcgctctcgc acaattccta 3660 aggaatgaga ctcgcctgaa aagggatcct attctcaaag atacctatga ctcagtgatc 3720 aaagagtata tagatcttgg ccacatgaag cctgtgcctc cgagtacgga cacggtgaat 3780 ttttatcttc cccaccacgc ggttttcaag cccgaaagta ctaccacaaa ggtccgtgtg 3840 gtattcaacg catccagtcc ctcgtcaaat gggaacagcc tgaatgacat cttgcatcca 3900 ggaccagtac ttcagtccga tctcaccctc cagatcctaa agtggcgaac attccaattt 3960 gtctttaatg ccgatatcac caatatgtat aggcagattc tcctaaaccc tgcacacacc 4020 gcttttcaga gaattctctt tcgagatgat cgtggggaca tcagagacta tgagcttaag 4080 actgtcacgt ttggagttaa ttgtgccccg tttctagctt tacgaacgtt gcggcagctg 4140 tccgaggatg tccacgccct atatccacgg tcgtctcgaa taatttcaga ttatatgtat 4200 gtggacgatg tgcttgcagg agcccacacg aaggtcgagg ctactttagc cattcaggag 4260 ttgaggacag ctctgagctc cgcaggattt cctctccgca agtggacagc aaatgagcga 4320 actctcctaa aggcacttcc ttcagaccaa ctgctctcag gtgacttcct cgacatcgaa 4380 gaggtgagca ccacgaaaac gttgggaata cggtggaacg ccaaggcgga tgagttcaat 4440 tttgtcccga ctgagcttgt tgtcaatgcg aattattcca agcgcaacgt cctatcgcaa 4500 atcgcaaaac tgttcgatcc cgctggatgg ttgtccccat ttgtagtcca ggccaagatc 4560 ctaatgcagg atatatggtt agccggcatt ggatgggacg agtttctccc tgcagagcta 4620 cggcaacgct ggcatgattt ccttcgcagc catagttccc tccacaaggt tcgcgtcccg 4680 cgttgggtac aattccgacc gggagcacag gttcagatcc acggcttctg tgatgcattt 4740 cagaaggcct atggggcagt gatttatgtt cgcatccaat acgacggagg catttcgttg 4800 agtctgctca cgtcaaagac taaagtcgcc cctgtcaaaa ccgtttcact cccgcgctta 4860 gaattgtgtg gtgcagatct tcttgctgac ctgtgggctg caattcttcc tcacatcccg 4920 tttggtcgaa tagagacgtt tttctggact gattccacca ttgtgttggc atggttgaac 4980 aagccaccat gtcaatggac caccttcgta gcgaatcgag tgactaagat tgctctaaat 5040 acagatgcca gccaatggtc acatgtgcgt tccgagcaca atccagcaga tctagccagt 5100 cgtggtgtcc cggctgaaga gctggcaagt agtgacctct ggtggcatgg cccgtcgtgg 5160 ctgattcagc cgcaagtttc ctggcccacg tcgcacgaag cggtatcgga ccttgaactc 5220 gaaagacgat ccgtccggtg ccatttagcg tgtccaggat gggatctcct agtgcgtttt 5280 tcaaacttca atcgcatgct gcgagtcatg gcttttgtgc agcggttcat caaccgctgc 5340 agacgcattc ctacgccttc ttctacagac ttaagcagta aagaactcgc ccatatacaa 5400 ctcatgctta ttaggcaaac tcaacgggcc gactatccag aggaatacaa catcctgcaa 5460 tccaaaggac agcttccctc gtccagttgt atcctcaacc ttaatccgtt cctggatgct 5520 gacggcatca tgaggtcatg cggtcggttg actgcctctg actcgttatc ctacgaggaa 5580 cgtcatccta tccttttggc gtaccattcc aaactagcag agcttctggt cacatttact 5640 caccctatct ccatacacgg gggaaaccag cttatggttc gtctcatccg gacgagatac 5700 tggattccga agctgcggaa cctgatcaaa cgtgtgacca cgacctgcaa ggtgtgtgtc 5760 attcaccgga agaaggtgca gtcccagcta atgggtaaac tacctcagtc cagagcgacc 5820 tattcgcgac cctttacaca cacaggactc gacttcgcgg gaccgtttga tgtcaaaagc 5880 tattcaggtc gtgcctgtcg aatcacgaag ggctatgtct gcgtcttcgt gtgcttcaat 5940 acgaaggcca tccatttgga agccacttcg gacttgacca cggagaaatt agggggttaa 6000 ggggtgttca aagggtttat cctggctctg atgacaaagt tcgcgtcgta gacgtccgca 6060 cggcccgtgg tatcctcgaa gggccaattt ccaaattagt tttccttccc gtagacaaac 6120 atgtccaatg tctatgcgtt cattgtccat aactgtgaga gtaaagcgta tccactaatt 6180 attatttttt ttgttctatc ctccagtcgt tctttttctc tctgcttccc tcatctcttt 6240 gcgccagcca tgccgtccca tcgtaaccaa cgtcgccagc tcatcgacag atgttctcgg 6300 ggaactcgat cgttccgata ccgagtctgt caggggattc atccgcttcg gacgtgttcg 6360 cggtttctcc ggctagatgc agagcggcgg ctccgtgcgg tcctatcaat aaatattgcc 6420 cgaactgctt ggcgcgccag cattccggag gctcgtgccg gagtgagtgt cgctgtcgag 6480 tatgtgggga cgctcatcac accttgctac atctccatcg gcgggaaaag cgagagcggt 6540 ctgctagcgc ggacaggccc atcgagacgc cagagccgaa ggccaacatc gagcaacccc 6600 gcctctcggc agtgctctcc cacactgcaa cggctatcct gcccaccgtc aatctgcggt 6660 tcgacatcgg ggacaagcgc ttcgatgtgc gggccatggt agatgcgtgc tctacttcga 6720 gccgcattga cggatcactg gccgaggcca tggccctccc tatcctggga gtcggtgatg 6780 agagggtgtg cagagccact ctcatcccga ttcatacaga gacgccacgg attgaggtgg 6840 tgttccacgt cgaggagcag ctgcgaatgc gaacttctgc ccgagagctg cctgacgccg 6900 ccaaaacagt gttcaccaac ctcatcctgt cggaccccag tttccacagg ccggctggta 6960 cctccgtcgt gctaggtgcg gatgtgtatc ccacgctcat ccaacctggt gtcatgcaca 7020 gccagaacgg gcgcatcgta gcacagagca cggttctggg atggatgctc tccggaacat 7080 acacccactg atgcatcgcg gattcttttg caatctgtgg cattgcaagg cgggcggga 7139 // ID MSAT-6_CQ repbase; DNA; INV; 140 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A satellite repetitive sequence family from Culex DE quinquefasciatus - consensus. XX KW MSAT; Satellite; Simple Repeat; MSAT-6_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-140 RA Kojima K.K. and Jurka J.; RT "Satellite sequences from the southern house mosquito."; RL Repbase Reports 11(1), 618-618 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >93% CC identity. XX SQ Sequence 140 BP; 39 A; 38 C; 29 G; 34 T; 0 other; ttttcattac cagtatcgaa aaccaagaag tttgataccc atattgccca aaatcgtatg 60 gttcgtaaat gtcctcccgg gagaacctcc ctgagggcac cggccactcc aggttgtggc 120 caatactgtc aaaatggcca 140 // ID CR1-11_HM repbase; DNA; INV; 4334 BP. XX AC . XX DT 11-DEC-2008 (Rel. 13.12, Created) DT 13-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE CR1-type family: consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-11_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4334 RA Jurka J.; RT "CR1 families from Hydra magnipapillata."; RL Repbase Reports 8(12), 1839-1839 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(751..1635,2093..3847) FT /product="CR1-11_HM_1p" FT /translation="MDSNEPVNFESTLFNFFQTNDFLLDSDSDPDLNYFSE FT AGALQNKCSYFYNHEIKEFLDRDNFNVIHINIRSLKKNFENFRNXIEETFN FT IFSIICLTETWCDSDDVNFNSNFHLPGFNLISLARKTNKRGGGVXIYVKEX FT LXFFTRLDMSISDGDKEVLTIEILTKKTKNIIISCCYRPPSGVIESFNSFL FT RNDIIKKSNHEKKFNYLIGDLNLNCFEYHVNNNIKRFYNDIFENGAIPLIN FT KPTRITQSTSSLIDNIITTDVFNESLKKGIIKSDISDHFPIFFSINTNNEX FT VTKTHSLCLKTIKFNDKFLYDKKEIANELNKYFVSVGPNLAKKIPKAINPI FT NNLFIPNTSQFNSFELSFEEFESAFKMLKPDKAVGPDDINGNIVINSYDII FT KDILFKIFSCSIKQGIFPDQLKIAKVIPIFKGGDLTNVSNYRPISILSVFS FT KVLERISYNKIYNHLSVNNLLYSNQYGFKKNNSTEHAILQFTRSITDSFEN FT SQFTLGVFIDLAKAFDTIDHKILFKKLKYYGITGNVLNWLKSYLNNRKQFV FT YVDVVSPVNLFDITCGVPQGSILGPLLFLIYVNDLHKASNLMTIMFADDTN FT LFLSHNNITTLFQNMNIELVKISDWFKLNKLSLNIDKTKWIFFHPNRKKHL FT LPSEMPLLFIDNILIKRVTVTNFLGIYIDENLSWKNHIENLCSKISKSIGI FT LYKARSVLNKHTLIQLYYSFIHCHINYANIAWGSTNKSKLXPLYRQQKHVA FT RLINFKDRFTHAKPLLFEMKVFNIYELNIFNVLCFMFKCKTNISPVSFHNL FT YXKKEKNKYILRNDNFIRQPVFQTNFGKFCISFRGAFLWNKIVLKNFKDFS FT QECNYRSFKQKLKKVIFSIDDISIYF*" XX SQ Sequence 4334 BP; 1718 A; 528 C; 530 G; 1523 T; 35 other; ggctaaagga ttatcattta ctcaaattaa agaattactg gaaattcatg aaaacacaat 60 attaaaaatm tttarcgata aatttgataa aatggaaaac aaaatgtaag tacttaaaag 120 mgaaaataaa attttaaaaa atgaaatagc agatattaaa agtgcaatga cttttcataa 180 atgaaaagta tgaaaaagag actaaagaat tatacgattt taaagaacat cttaaacaca 240 acatgaaagt aaatgaaaaa gctttaaaca atgatgaatt taaagacgaa cttgckgaac 300 ttgaagacag aagtcgaaga aataatttaa gatttaatgg aattgaagaa aaggaaaatg 360 aaacgtggga agaaagcgaa aaaaaaatta aagaantctt aaaakaaaaa cttggtatta 420 cgaacgaaat ataaattgaa agrgctcata gaacaggaaa aaagaatgaa gttgaaatga 480 aaaaacgaag aactattatt gtaaaatttt taaactataa agacaaagaa accgttttar 540 aaaaataccg attactcaaa ttatggaacg aaaaactata cataaatgaa gattttagcg 600 aaagaacaat ggaaatacga aaaaatytwt ttaaagaagc aaaagattta agagccaaag 660 gtaagtatgc taaagttgta tataataaat tagttacacg cgatttttaa aagaattaat 720 ttttttattt ttggatttcg aataactaaa atggattcaa acgaaccagt taattttgaa 780 tcaactttat ttaacttttt tcaaactaat gattttttac tggacagcga ttctgatcct 840 gatctaaatt attttagtga agctggtgct ttgcaaaaca agtgcagtta tttttataac 900 catgaaataa aagaatttct tgatcgggat aattttaacg ttattcatat taatataaga 960 agtttaaaaa aaaactttga aaattttcgt aatwttattg aagaaacttt taatattttt 1020 agtataattt gcttaacaga aacatggtgc gattctgatg acgttaattt taattctaat 1080 tttcacctgc cwggttttaa tttaatttca ttagcgcgta aaacaaataa acgtggtggt 1140 ggtgtkmtta tttacgtaaa agaaarcttg cratttttta ccaggcttga catgagtatc 1200 tctgatggcg ataaagaggt tttaacaatt gaaattttaa ctaaaaaaac taaaaatata 1260 attataagtt gttgttatcg cccaccatct ggtgtaattg aaagttttaa ttcattttta 1320 cgtaatgata taattaaaaa aagtaatcat gaaaagaaat ttaactactt aattggtgat 1380 ttaaatttaa attgttttga atatcacgtt aataacaaca ttaaaaggtt ttataatgac 1440 atttttgaaa atggtgcgat tccactaatm aacaaaccaa ccagaataac tcaatcaaca 1500 tcttccttaa ttgataatat aataacaact gatgttttta acgaatctct taaaaaaggc 1560 ataattaaaa gtgacatttc tgatcacttc cctattttct tttccattaa tactaataac 1620 gaaaamgtta ccaaataaaa atcaaatttt cgttaaacgt atttttaatg aagagaattt 1680 ggaatctttt aaggaacaat tatctctact tcattggagg cacattacca attcagatga 1740 tgctaactta gcttacaaca cattttttaa aactttttat gatatatatg atgttaattt 1800 tccaaaaatt aaaatacatt gtaaagcttt aaaaagtata aaaatttcac gtcaccatgg 1860 attactaaag gtttaagaaa atcwtcaaaa attaaacaaa aattgtacat taaatactta 1920 aaatcaaaaa cagatgaaag taaaactata tataaaaatt acgctaaaca atttgaaagt 1980 caaagaaaaa atcttaaaaa aaattattac aatgatttac tagagaaata taaattaaat 2040 tctaagcgca catggcaaat tataagggaa attacaggca gtawtaaaat gaacacactc 2100 actttgccta aaaaccatta aatttaatga taaatttttg tatgataaaa aagagatagc 2160 taacgaactg aataaatatt ttgtttctgt tggaccaaat ttagcaaaaa aaattcctaa 2220 agcaattaat ccaataaata atctatttat ccctaataca tctcaattta attcttttga 2280 attatctttt gaagagtttg aaagtgcttt taaaatgcta aaacccgaca aagcagtagg 2340 tcctgatgat atcaatggaa atattgttat taattcatat gatatcataa aagatatact 2400 ttttaaaatt ttttcatgtt cwattaaaca aggaattttt cctgatcaac taaaaatagc 2460 aaaagtgata ccaatattta aaggaggtga cttaactaat gttagtaact atcgtccaat 2520 ttctattcta tctgtatttt caaaagtttt agagagaatt tcgtacaata aaatttataa 2580 tcatctttct gtaaacaatt tactatacag caatcaatat ggattcaaaa aaaataactc 2640 tactgaacac gcaattctcc artttacaag aagtattact gactcttttg aaaaytccca 2700 atttacttta ggcgttttta ttgacttagc taaagctttc gatacaatag atcacaaaat 2760 tctttttaaa aaactaaagt attacggtat macyggaaat gttttaaatt ggttaaaaag 2820 ttatttaaat aatcgaaagc aatttgttta tgttgatgta gtttcaccag taaacttgtt 2880 tgatataacc tgtggagttc ctcagggatc watattaggg cctcttctat ttcttattta 2940 cgttaacgat ctccataaag cctcaaattt aatgacaatt atgtttgctg atgacacgaa 3000 cttattttta tctcataata acattacaac gttatttcag aatatgaaca ttgaactagt 3060 aaaaatttct gaytggttta agttaaataa actktctctt aatattgata aaactaaatg 3120 gatttttttt catccaaatc gcaaaaaaca tcttctacca agtgaaatgc ctcttctttt 3180 tattgataat atacttataa aaagagtaac agttacaaac tttctaggta tttatattga 3240 tgaaaatctg tcatggaaaa atcacattga aaatttgtgt agtaaaattt caaaaagcat 3300 aggaatttta tacaaagcaa gaagcgtttt aaataaacat acattaattc aactatatta 3360 ttcgttcatt cattgtcata ttaactatgc kaacattgck tggggtagta ctaataaaag 3420 taaactgraa cctctttatc gtcaacagaa gcatgttgca cgccttatwa attttaagga 3480 tcgttttact catgccaagc ctcttttatt tgaaatgaaa gtttttaata tatatgagct 3540 taatattttt aatgttcttt gttttatgtt taaatgtaaa accaatattt ctcctgtttc 3600 ttttcataat ttgtattmca aaaaagaaaa aaataaatac attttacgaa atgataattt 3660 tattcgtcaa ccagtttttc aaactaattt tggaaaattt tgtatttcat ttcgaggagc 3720 atttttatgg aataaaatag ttttaaaaaa ttttaaagat ttttctcaag aatgtaacta 3780 tcgttctttt aaacaaaaac ttaaaaaagt cattttttca attgacgata tatctatata 3840 cttttaattt tttgtttctt tttttaactt tatatttaac actgcattaa taaattttat 3900 aatttaacaa tttataaaca gttgtttaac acttttgtta aaggtatctt aaaactgtat 3960 aacaaaacct ttaacaaaaa gtttaacaat tgtgctaaag atatcttaaa actgtataac 4020 gcatctatgt atttataacg gatttctttt tctttttttc tttaaatttg tttatgtttt 4080 caattcttaa ttcttaatwa tttgttatct tcatgtatgt taagtatata ttcctattga 4140 aagtttattt attttgagaa tattttatta atacgaaata ttaacacgaa aattttttag 4200 cggttctcga cgacaagacc ttacggtctt ctacgagtwc ccgcgttctt ttatttctta 4260 atatcattta tattattata tcattgttaa cgaatattta ttaaatacaa agttgtaata 4320 aagaacaaaa aaaa 4334 // ID Gypsy-78_CQ-LTR repbase; DNA; INV; 243 BP. XX AC AAWU01003709; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-78_CQ_; KW Gypsy-78_CQ-I; Gypsy-78_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-243 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 536-536 (2011). XX DR Genome; AAWU01003709; Positions 41452 41694. XX SQ Sequence 243 BP; 54 A; 56 C; 61 G; 72 T; 0 other; tgttgggaac gaatggatca ccctgttcac caacggcatc gtagcaacac cgctgggctc 60 actcgtggca gttcagctgt caccggtcga tgcaggaaac agctcaactc ttagtggcgt 120 cattatctgg tggattgtct taccgcgtgg aacgcggact gttttaatcc ggctaaataa 180 agttttttta atgtgtagta atttacgttg cgcgttgtat tattgccacc ttaattgcgg 240 cca 243 // ID I-74_AAe repbase; DNA; INV; 8380 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE An I non-LTR retrotransposon from Aedes aegypti. XX KW I; Non-LTR Retrotransposon; Transposable Element; I-74_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-8380 RA Kojima K.K. and Jurka J.; RT "I clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1345-1345 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 11 sequences with >98% CC identity. XX FH Key Location/Qualifiers FT CDS 393..2072 FT /product="I-74_AAe_1p" FT /translation="ASMELAYGFPPLPGDPGGPIGGSGGISNRVNGEYTGP FT RYPGFMDRDGTAGPLQFLKMQAVSGSIPQDPFLLRLSVEKHLGGPIEGAYK FT ENRGISYVLKVRSQTQFNRLLKMKTLCDGTAIEIKEHDQLNQRKCVVSNYD FT SVGLTDDYLKVQLAAQGVKEIHRIKRKNADGVWQNTPTIILTIAGTVIPPH FT IDFGWSRCKTRNYYPSPMLCFHCWEYGHTGKRCPQQFRICGRCSQTHPEDR FT TINSSPGISTEAGNIPGVNTSATTNSVRPLCTEDPYCKHCKSSDHQVSSRK FT CPAYLREIDIQHIRVDEGTSYPQARKVYEARLASGSSKTAYSGVINASKDA FT ENAELRSVIKQLQEDAKLKDKRMEEMERRLRNTSVNDRIETTKKHGTIDDL FT IRQVAELSSTVNRLERALTKKDEIIEAQNKEIAKLRAADSDSNPSAFSIPE FT SEVSEEHINHKDLRLNPISKQIDEWFIGNSKNTGNDKETLKKGKSSATGKL FT DTMNESGHETDSSMKSADSRQSTTTINKRTHASGNSSGESSTGSPKTKRAT FT NTRKGKGGAKKKN" FT CDS 2127..7910 FT /product="I-74_AAe_2p" FT /note="endonuclease, 2 reverse transcriptases, and FT ribonuclease H." FT /translation="VQIFYIVFTPQSSVEIVKRKESTEQTQTASLELPSFQ FT TEDSTSGARDSRGPTSADATPQPELADNPRHPGATVVAGTHKGHCACTPEF FT PTYYPDSQGPAGAEVLAVPEPTDNLWHPESVGRGTQKGLMSYSPEDNVDIN FT IGVKPAPRRNPFRSCRKPNVYSPLTEAEATNAVCYDILHRTRYGQSVLITP FT GTSTECRYRTSIQQQRSSLPKRNMQKHHGQPNPAGKYNKTSTFTTHDRFAA FT RETAANSTPGHPGNPNPGGKSHPDPQLLSHSQSISAISHGDECHSQLVDGR FT QQPVRENTLSFSTHSQQRETASTFAIQWNINGYFHNLPDLEMLVHSMQPVV FT IALQEIHRATPSIMNNTLGKKYQWFSKSGSNIYHSVGIGVSADLAVHQINV FT DTDFPIVAVRLQWPFPISVVSIYLPNGKLHNLESQFNEVLNQIPEPMIILG FT DVNSYHRAWGSSSNNVRGSIIANIASHRNLTILNDGSPTFFRGHSESAIDI FT SLVSASITNRFLWSVDTDFRGSDHAPIYIVLENTICPETTRRPRWLFDRAD FT WSGFQTELANELDALSPASISDLTNLIYRAASNNIPRTSPTPGRRALHWWN FT QDTRNAVKARRKSLRNIKRLRKKLPEGHPDRALALETYQAARNTCRQVIRD FT AKGESWTKFLDGINENQSSSELWRRVNSILGKRRHKGVAIEIDGTTTRDPH FT RISNALANYFYGISSFQRYPNTFLKRHPSPETAIQQFQVPADRGQVFNSPF FT TMNELKYALKKANGKSVGPDDIGYPMLNNLPPSGKTTLLAMINKEWISGTL FT PADWKHSYIVPVPKNSGPSNIVSSYRPIALTSCTAKIMERMVNRRLMEFLE FT EHHELDNRQHAFRPGFGTGTYMASLGXILNDALKEGHHIEIXSLDLAKAYN FT RAWTPGVLKKLASWGITGSLLTFVKNFLDGRSAQVLVGNCKSNSFDEETGV FT PQGSVIAVTLFLVAMSGVFLVIPKGVFILVYADDILLIVTGMHPRSVRRKL FT QTAVTAVAKWAQEAGFEISAEKCARLHVCNIKHAPPKIPNTCRQVIRDAKG FT ESWTKFLDGINENQSSSELWRRVNSILGKRRHKGVAIEIDGTTTRDPHRIS FT NALANYFYGISSFQRYPNTFLKRHPSPETAIQQFQVPADRGQVFNSPFTMN FT ELKYALKKANGKSVGPDDIGYPMLNNLPPSGKTTLLAMINKEWISGTLPAD FT WKHSYIVPIPKNSGPSNIVSSYRPIALTSCTAKIMERMVNRRLMEFLEEHH FT KLDNRQHAFRPGFGTGTYMASLGQILNDALKEGHHIEIASLDLAKAYNRAW FT TPGVLKKLASWGITGSLLTFVKNFLDGRSAQVLVGNCKSNSFDEETGVPQG FT SVIAVTLFLVAMSGVFLVIPKGVFILVYADDILLIVTGMHPRSVRRKLQTA FT VTAVAKWAQEAGFEISAEKCARLHVCNIKHAPPKIPIKINKCPIPTKKSIR FT ILGVNIDRHLTFQDHFAKVKEACKNRVCMIRNITNRRTRSCRATRRRVADA FT VINSRLFYGIEITCREMENLVSTLSPIYNNTVRALSGLLPSTPASSACVEA FT GMLSFRHRATMTLCHRAINYLEKTRSGGQVCFLTEQANRALNDVASSTLPP FT VARVHRVGPRSWKARGPEIDMAMKSLFRKGGNPMQAKLLFIERIRSRYSNA FT DIRYSDGSKAKGTVGVGVFSPVWKAQYSLPGQCSVFSAEAAALFKAIAKPS FT DKPILVATDSASVLSALESPTAKHPWIQAIQNTINDRNRRITLTWVPGHTG FT IHGNEQADILANLGRSSRRLYQKVPAADAKIWIKHVIENAWAEEWRANQFL FT FLRKIKASTEAWEDRTNWKEQVVLSRLRTGHTRASHDFSSNSSFRRTCETC FT GTRNTVEHILYECPTLEHLRTLYQMGSIRNVLQNDQASETKLICFLKDAGL FT FALI" XX SQ Sequence 8380 BP; 2493 A; 2123 C; 1866 G; 1893 T; 5 other; cagtcgacag ctaactgtgt tcgcgaacgg tcgtgataca tttacactcg tccgctatca 60 ccgggtagtt ttcgcgcgtt agtgtttttc gacgcattcg aatcgtctaa gaaggcagat 120 tgaaaaagat ttgaaagtaa atcgataktg gccagtgaaa acctgcagtt ctgcatattg 180 ccctgaaagt gcttgatttc cgaacctgac gcttttcagg ctccgtacag ggcccctttt 240 tgaagtgagg ttagtgaacc acgtgtttta gctgcgaggt cgaaaagctg ctaccgcttc 300 gacgtaagag aagccgagat tgagagaagc tgagagagaa acccgtgagt aatttgagag 360 tacatataag gtaatcccct acccgccgct gagccagcat ggagctggct tacggcttcc 420 cgcctcttcc aggggaccct gggggcccca taggaggaag cggaggtata tcgaaccgtg 480 tgaatggtga atacactgga ccacgctacc ctggtttcat ggatcgcgac ggcactgctg 540 gaccactgca gttcttgaaa atgcaagcag tttctggttc aattccwcaa gacccgtttc 600 tattacgact ctccgttgag aaacatcttg gaggaccaat cgaaggagca tacaaggaga 660 atcggggcat ctcgtatgta ctcaaagtgc gaagtcagac acagtttaac cggctgctaa 720 agatgaaaac tttgtgtgat ggcacggcaa tcgagataaa agagcacgat cagttgaacc 780 agcgaaagtg cgtcgtctcg aactacgact ccgttggtct aaccgatgac tacctcaagg 840 tccaactggc tgcccaaggc gtcaaggaaa tccaccgaat caaaagaaaa aatgcagatg 900 gtgtatggca aaatactccc acgatcattc ttacgatcgc tggtacggta attccgcccc 960 atattgattt tggatggagc cgatgcaaaa cgaggaatta ctacccgtct ccaatgctat 1020 gtttccactg ctgggaatat ggacacacgg gaaagcggtg ccctcagcaa ttcagaattt 1080 gtggacgttg cagtcaaacc cacccggaag atcgaacaat taattcctct ccaggcatct 1140 ccacggaagc tggcaacatc cccggtgtaa atacctcggc tacgacaaat agcgtgcgac 1200 ctttatgcac agaggatcct tattgcaaac actgcaaaag tagcgatcat caggtttcta 1260 gccgaaagtg tccggcatat cttcgggaaa tcgatattca acacatccgc gtggacgaag 1320 gcacgtccta cccacaggct cgcaaggtct acgaagctcg tttggcatct ggcagcagca 1380 aaaccgcata tagtggcgta attaacgcca gcaaagatgc ggagaatgct gagttgagat 1440 ctgttatcaa acaattacag gaggatgcaa aactgaagga caagcgaatg gaggaaatgg 1500 aacgcaggct gaggaacaca agcgtcaacg accgaataga aacaacaaaa aaacatggga 1560 ctattgatga tcttattcgg caagttgctg aactctcttc tacagtcaac cgccttgaac 1620 gagccctgac gaaaaaggac gaaatcatag aagcacaaaa caaggaaatt gcgaaactgc 1680 gtgctgctga ctctgactcc aatccctccg cgttttcgat ccctgaatca gaagtgtccg 1740 aggaacatat caaccataag gatctgcgtc taaatccaat ttccaagcaa atcgatgagt 1800 ggttcatcgg taactccaag aataccggta acgataaaga aaccctgaag aaaggaaagt 1860 catcagcgac tggaaaactt gacaccatga atgaatcagg tcacgaaacg gattcaagta 1920 tgaaatccgc cgactccaga cagagtacca ctaccatcaa caaacgaacc catgcaagtg 1980 gaaactctag cggcgaatct tcaactggct ctccaaagac aaagcgggca accaataccc 2040 gcaaagggaa gggtggagcc aagaaaaaga actaagttct ccctacatat tgctcagtcc 2100 tccacaatac ggccaatacc acgtgagtcc aaattttcta catcgtcttc actccgcaat 2160 ccagtgttga gatagtaaaa agaaaggagt cgactgagca aactcaaacc gcatcattag 2220 aactaccaag ctttcaaaca gaagactcga catcaggggc cagagacagt cggggcccca 2280 ccagtgcgga cgctacaccc caaccggaac tggcggacaa ccctcgacat cctggcgcaa 2340 ctgttgtcgc cgggacgcat aagggacact gcgcctgtac cccggaattc ccaacatact 2400 atccggacag ccagggcccc gccggtgcgg aagttttggc tgtaccggaa ccgacggaca 2460 acctctggca tccggaaagt gttggaagag ggacgcaaaa gggacttatg tcctattccc 2520 ctgaggataa cgtggacatc aacatcggag tcaaaccagc ccctagaagg aatccattta 2580 gaagctgtcg caaaccaaat gtgtatagcc ctctcacgga agcggaagca accaacgcgg 2640 tctgctatga catccttcac agaacgcgct acggccaatc cgtcttgata acacctggta 2700 catcaaccga gtgccgatac cgtaccagta ttcaacagca aagatcttca cttccgaaac 2760 gaaacatgca gaagcatcac gggcaaccga acccagccgg taagtacaac aaaacttcca 2820 cttttacaac tcatgatcga tttgccgcga gagaaactgc agctaattct actcctgggc 2880 accccgggaa cccaaacccg ggcggtaagt cccatccgga ccctcaactg ctttcacata 2940 gtcagtcaat ttctgccatt tctcatggcg atgaatgcca ctcgcaattg gtggatggca 3000 ggcaacagcc agtcagagaa aataccctat ccttttcgac tcactcacaa caacgagaaa 3060 ctgcctccac tttcgctata cagtggaata tcaatggata tttccacaat cttcctgatc 3120 tagagatgct tgttcattcc atgcaacccg tggtaatagc actacaggaa attcatcgtg 3180 caacacccag catcatgaac aacactctag gtaagaaata ccaatggttc tccaaaagtg 3240 gctcaaacat atatcattca gttggtatcg gtgtctcagc cgacttagcc gttcaccaaa 3300 tcaatgtaga taccgatttt cccatagtcg cmgtccgact gcaatggcct tttccgatct 3360 cggtggtttc gatttactta ccgaatggaa agttacataa cctggagagt caatttaatg 3420 aggtacttaa tcaaattcct gagccgatga taattctcgg tgacgtcaac agctaccatc 3480 gagcatgggg aagtagcagc aataacgtgc gaggttctat tattgccaac attgccagcc 3540 atagaaacct caccatactc aatgatggat cacctacgtt tttccgtggc cattctgaat 3600 ccgcaattga catctctctt gtatcggctt ccatcacaaa tcggttttta tggtccgttg 3660 ataccgactt tcgtggaagt gatcacgctc caatctatat agtgttagaa aatactattt 3720 gtccagagac aacgcggcgg cctcggtggt tattcgatag agccgactgg tctgggtttc 3780 agacagaact tgctaacgaa ctggacgcac tatccccagc ttcaatctcg gacctcacca 3840 acctgatcta tagagcagca tccaacaata tcccgcgcac cagtcccact ccgggacgtc 3900 gtgcactcca ctggtggaac caggacactc gaaacgcagt gaaagctaga cgaaaatccc 3960 ttcggaatat taagcgattg aggaaaaaac taccagaagg tcatcctgac agggccctcg 4020 ctctggaaac ttaccaagcc gcccgcaata catgtagaca ggtgattcgt gatgctaaag 4080 gagaatcttg gacaaaattc ctcgatggca taaacgaaaa tcaatcatca tctgagcttt 4140 ggcgccgagt caatagcatc ttgggaaaaa gacgccacaa aggagtggct attgagatcg 4200 atggcacaac tacaagggac ccacatcgta tctcaaacgc tctagcgaac tatttctatg 4260 gcatttcttc atttcagcga tatcccaata cttttctcaa gcgacaccca tcccctgaaa 4320 cggcgataca acagttccag gttccagctg acagagggca agtttttaac tcaccattta 4380 caatgaacga gttgaaatac gctctaaaaa aggccaatgg taaatcggtt ggtccagacg 4440 acattggtta cccgatgctc aataatcttc ctccgagcgg aaagaccact ctcctcgcta 4500 tgatcaacaa agaatggatc tctggtaccc tccccgctga ttggaaacac agctacatag 4560 tacccgttcc gaaaaactct ggcccctcca atatcgtaag cagctatcga ccgatagcat 4620 tgacaagttg cacagcaaaa ataatggaga gaatggtcaa tagaaggctc atggaatttc 4680 ttgaggagca tcacgaatta gataaccggc aacacgcttt ccgtcccgga tttggaaccg 4740 gaacatacat ggcctctctc ggtcakattc tcaacgacgc actcaaggaa ggacatcaca 4800 ttgaaatagm atcgctggac ttggcaaaag cctataatag ggcatggaca cccggcgtgc 4860 tgaaaaaatt ggctagctgg ggcatcaccg gcagtctgct cacgttcgta aaaaacttcc 4920 tcgacgggcg aagcgcacag gtgcttgttg gaaactgcaa atccaattca tttgatgaag 4980 aaactggcgt cccacaagga tcagtgattg cggtcacgct atttcttgtg gccatgagcg 5040 gtgttttctt ggttattccc aagggagtct ttattctagt gtatgcggat gacatattgc 5100 taatcgtcac tggcatgcac ccgaggagcg ttagacgaaa actgcaaaca gccgttactg 5160 ctgtcgccaa atgggcacaa gaagccggct tcgaaatttc agcagagaaa tgtgctagac 5220 tccatgtatg taatattaaa catgctccac caaagattcc caatacatgt agacaggtga 5280 ttcgtgatgc taaaggagaa tcttggacaa aattcctcga tggcataaac gaaaatcaat 5340 catcatctga gctttggcgc cgagtcaata gcatcttggg aaaaagacgc cacaaaggag 5400 tggctattga gatcgatggc acaactacaa gggacccaca tcgtatctca aacgctctag 5460 cgaactattt ctatggcatt tcttcatttc agcgatatcc caatactttt ctcaagcgac 5520 acccatcccc tgaaacggcg atacaacagt tccaggttcc agctgacaga gggcaagttt 5580 ttaactcacc atttacaatg aacgagttga aatacgctct aaaaaaggcc aatggtaaat 5640 cggttggtcc agacgacatt ggttacccga tgctcaataa tcttcctccg agcggaaaga 5700 ccactctcct cgctatgatc aacaaagaat ggatctctgg taccctcccc gctgattgga 5760 aacacagcta catagtaccc attccgaaaa actctggccc ctccaatatc gtaagcagct 5820 atcgaccgat agcattgaca agttgcacag caaaaataat ggagagaatg gtcaatagaa 5880 ggctcatgga atttcttgag gagcatcata aattagataa ccggcaacac gctttccgtc 5940 ccggatttgg aaccggaaca tacatggcct ctctcggtca gattctcaac gacgcactca 6000 aggaaggaca tcacattgaa atagcatcgc tggacttggc aaaagcctat aatagggcat 6060 ggacacccgg cgtgctgaaa aaattggcta gctggggcat caccggcagt ctgctcacgt 6120 tcgtaaaaaa cttcctcgac gggcgaagcg cacaggtgct tgttggaaac tgcaaatcca 6180 attcatttga tgaagaaact ggcgtcccac aaggatcagt gattgcggtc acgctatttc 6240 ttgtggccat gagcggtgtt ttcttggtta ttcccaaggg agtctttatt ctagtgtatg 6300 cggatgacat attgctaatc gtcactggca tgcacccgag gagcgttaga cgaaaactgc 6360 aaacagccgt tactgctgtc gccaaatggg cacaagaagc cggcttcgaa atttcagcag 6420 agaaatgtgc tagactccat gtatgtaata ttaaacatgc tccaccaaag attcccatca 6480 agataaataa atgtccaata cctactaaga agagcattag aattcttggc gttaacatcg 6540 atcgacactt aacttttcaa gaccattttg ccaaggtcaa ggaggcttgt aagaacaggg 6600 tgtgtatgat ccgcaacatt acaaaccggc gtactcgaag ctgtagggca actcgtcgcc 6660 gagtcgcaga tgccgtaatt aacagtagac tgttttatgg gattgaaatc acctgtcggg 6720 aaatggaaaa tctggtctct acactgtccc cgatctataa taatactgtc agagcactgt 6780 cgggccttct cccctctaca cctgcttcct ctgcctgtgt agaagccggc atgctctcgt 6840 ttcgtcacag ggctacaatg actttatgcc accgagcaat caactacctc gaaaaaacta 6900 ggagtggtgg gcaagtttgc tttctcactg agcaggcgaa ccgcgcccta aacgatgtgg 6960 ccagttccac gctccccccg gtggcgagag ttcaccgtgt tgggcctcga agctggaagg 7020 ccagaggacc cgaaatcgac atggccatga agtctctatt tcgtaagggt ggaaacccga 7080 tgcaggccaa gctgctgttc atcgaaagaa ttagaagccg ttactctaac gcggatatcc 7140 gatactccga tggctctaag gcaaaaggta ccgtaggagt aggcgtattc agtcctgtct 7200 ggaaagctca atacagtttg cctggccaat gctccgtatt ttccgctgag gcagcagcac 7260 tgttcaaagc aattgcgaaa ccaagtgata agcccatttt agttgctacg gattcagcca 7320 gtgtcctgtc agctctggaa tcaccaacgg caaagcaccc ctggattcaa gcgatacaga 7380 acaccataaa cgacagaaat agaagaatca ctttgacttg ggtcccgggc cacactggaa 7440 tacacggtaa cgaacaggca gacatactgg ccaaccttgg tcggtcaagt cgccgtctat 7500 atcagaaagt acctgcagcg gatgccaaaa tatggatcaa acatgttatt gagaatgcct 7560 gggcagaaga atggcgtgcc aaccaattcc tgtttctccg aaaaattaaa gcaagtacgg 7620 aagcctggga agacagaacc aactggaagg agcaggtcgt gctttcacga cttcgtacag 7680 gacatacgag ggcctcacac gacttctcca gtaattcatc gtttcgcaga acctgtgaga 7740 catgcggaac tcgtaataca gtggagcaca ttctatatga gtgccccact ctcgagcatc 7800 ttagaaccct ctatcagatg ggaagcatca ggaatgtgct ccaaaatgac caggcaagtg 7860 aaaccaaact tatttgtttt ctaaaggatg caggcttatt tgcactaatt tgagtctcct 7920 taaactattt cccaaacagc aatcaaacta cccaaccccg ccgagtgcag tcccccgctg 7980 ggacatgcat ttcgtgaggg caatgacgaa aatagcaatc agacaacggt caccggcccc 8040 aggcatgaaa atggatcatc aaacggaaaa ctactataac ccaacattct caaaccgtat 8100 caccgaccca agcagaaaac aggctatcga gctggttatg acttaaagtt ctatgcaaac 8160 tccattacac ccctaaccca aacctattct cctaaatctt ctacctcaat atcctttcac 8220 ccaatatgaa tgccagcatt gttataattt tttgtaatac taaactctgt aacagactag 8280 agaataccct taaggtacct ctagcgtgca atattcgtca accgagatga accagcctca 8340 ggctgaaagt ctcgttaata aagataataa taataataat 8380 // ID Copia-7_DPu-LTR repbase; DNA; INV; 278 BP. XX AC scaffold_25; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-7_DPu_; KW Copia-7_DPu-LTR; Copia-7_DPu-I. XX NM Copia-7_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-278 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 678-678 (2010). XX DR Genome; scaffold_25; Positions 205555 205832. XX CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX SQ Sequence 278 BP; 59 A; 64 C; 40 G; 115 T; 0 other; tgttgtgttt ttcaattcaa gcgttacccc aactctgttg ctgaacccca ttgttgccaa 60 aacctatctc tcccaattct tgtcaaagtt tccttcgtct gctatgtgac gttctacact 120 tcttcttgtt tctactccag ccgcgtttcg actttcgacg tataagtatt gtctcaagtg 180 atcatttgtg aagtgtgtgt tactttggta ttaaactccc tttttttcta tcaaatgcat 240 gcgttcatta ttcttagttc taatatatat tatctaca 278 // ID Crack-20_AAe repbase; DNA; INV; 5409 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A Crack non-LTR retrotransposon family from Aedes aegypti. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; KW Crack-20_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5409 RA Kojima K.K. and Jurka J.; RT "Crack clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1236-1236 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 9 sequences with >95% CC identity. CC Both termini are uncertain. XX FH Key Location/Qualifiers FT CDS 3..461 FT /product="Crack-20_AAe_1p" FT /translation="ENSVCSARRLLGKNRNTQGAPILVTFGSASIKEKLFE FT LKRAYGPLELSKLGDSFRGFSTRVVIRDELTAFGRELYQEAKELQSSMGYK FT YVWPGRNGKILIKRQDGGKIEEIGSKKQIEDLKKMSAKRLLNSSSKERSTS FT SSPVQEPASKRLQM" FT CDS 513..3410 FT /product="Crack-20_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MFSNYVNKNFFYYTIAELNKNVNVNVDNKQLKILQIN FT ARGMNCMGKFDKIRDILSLYIGNVDIVVVGETWLKENHTNLYNIDGFRSFF FT SCRPDSSSGGIAVYVKQTIKLSLQKNEHDDGQHHIHLVIHAESSPFHVHAV FT YRPPSYDVTRFMAMIDGICMGHGVDASCVIVGDVNIPVNHQSSSVVRQYNN FT LLRCYNFEVTNTYTTRPASNNILDHVVCSDVVRRNVVNETILSDISDHYPI FT LSTFKLHRAIEKRTLEKIIVNTRGLGEAFETAVEELVYESAEERLCKVIDL FT YKSLKERFSKKVAVQAKIKGYCPWMNFDLWKLLRIKENVLASCKRRPSESG FT PRELLARVSKLVQRAKDKCKKEYYGNLFRSSNHKQNWSHLNKVLGRSADDS FT EEIKLMMDNQITNGRPVADAFNKFFSTIGPQLASTINSRQDINKYNSLSPL FT NSSIFLRPASEQEVVVEIRNLDVNKSSGPDGLPIKFVKDHHRTFSLLLRDV FT FNQAISTGVFPDCLKVARVTPIHKSGSKTDINNYRPISVLSVLSKLFEKLL FT ATRLLSFLRTHNIMYSHQYGFRTGSSTLTATSELIDEVYEAMDSRELVGVL FT YLDLKKAFDTIDHDILLKKLEYYGIRGTPNNLIRSYLTGRTQFVYVNGTIS FT SQRGLTVGVPQGSNLGPLLFLLYVNDLPNLNLHGKPRLFADDTSLSYKNTN FT PNDIVRHMKEDLMKLQDYFNESLLSLNLSKTKYMILHSPRRNISLHGDLMV FT RGTTIEKVSCFKHLGLTIDSTLTWSDHINILQTQTSSTCGLLWKVSKFLPR FT KSLLAMYHAFVQSKLQYLVSIWGAAAKSRLKPLQTIQNRCLKTVFKLPRLH FT STVDLYRNSPSSVLPIPALRELQCLLQIHNNMFNREAHHNQNIERVSHRYS FT LRNATVLVTSRANTELAKRSVSYFSRKSFNSLPVALKLEQNARKFKLAVKS FT KIRTVVDRYII" XX SQ Sequence 5409 BP; 1554 A; 1307 C; 1200 G; 1346 T; 2 other; atgaaaacag cgtgtgtagt gctcgtcgtc tccttgggaa gaaccgtaat actcaaggtg 60 ccccgattct agtaacgttt ggctctgcgt caattaagga aaagctgttt gagctgaaac 120 gagcttacgg gccgctagaa ctgtcgaagc taggtgattc ttttcgtgga ttcagtactc 180 gagtagtaat cagagacgaa ctgacagcgt ttggacgaga gttatatcag gaagcaaagg 240 aactgcagtc atcaatgggt tacaagtatg tgtggccagg aagaaacgga aagatactca 300 tcaagcggca ggacggtgga aaaattgagg aaattggttc gaagaaacag attgaagatt 360 tgaagaaaat gtcagcgaag cggttgctca actcgtcgtc caaagaaaga tcaacatctt 420 catctcctgt acaagaaccg gcctcgaaac gtctgcagat gtaaagcgtt aaaaaatttt 480 gtgtgtgtca tatctgagtt ttttatatta taatgtttag taactatgta aacaagaact 540 ttttttatta tacaattgcg gagctaaata aaaatgtgaa tgtaaacgta gataacaaac 600 aactaaaaat attgcaaata aatgctagag gaatgaattg tatgggtaaa tttgataaaa 660 ttagagatat tctttcatta tacatcggaa atgtggatat agttgttgtt ggagaaacgt 720 ggctaaaaga gaatcacact aatctgtaca atatcgacgg atttcgtagt tttttttctt 780 gtcgcccgga ctcgtccagt ggtgggattg ctgtttatgt aaaacaaacg atcaaactga 840 gtttacagaa aaatgagcat gatgacggcc aacaccacat ccatctggtc atccacgcag 900 agtcttcgcc attccatgtg catgcagttt ataggccacc gtcttacgat gtgacgcgtt 960 tcatggcaat gattgacgga atatgcatgg gacatggagt cgacgcatcc tgtgtcattg 1020 taggtgatgt aaacattccg gtcaatcatc agagcagttc ggtagtccga cagtacaata 1080 atcttttaag atgttataat tttgaagtca ccaatacata cacgacgaga ccggccagta 1140 acaacatctt agatcacgta gtttgctcag atgtagttag gcgcaatgta gtcaatgaga 1200 ctattctttc ggatataagt gaccactacc caatcctatc gacctttaaa ctacataggg 1260 caattgaaaa gcgaacattg gagaagatca tcgtgaacac ccgagggcta ggtgaagcct 1320 ttgaaactgc ggtcgaagaa ctggtctatg aatctgctga agaaagacta tgcaaagtga 1380 tcgatttgta caaaagtcta aaggagcgat tttcgaagaa agtggctgtg caagccaaaa 1440 ttaaaggcta ttgcccatgg atgaacttcg atctatggaa gttacttcgc atcaaggaaa 1500 acgttctggc cagctgcaag cgacgccctt ctgaaagtgg accaagggaa cttctagcac 1560 gtgtgtctaa gttggtacaa agagctaagg acaaatgtaa aaaggagtac tatggcaacc 1620 tatttcgtag cagtaaccat aaacaaaatt ggagtcattt gaacaaagtt ctgggaagat 1680 cagctgatga ttcggaagag attaagctga tgatggataa tcagataaca aatgggcgtc 1740 cagtggcgga tgcgttcaac aagttcttca gtactattgg accccagctt gcctcaacaa 1800 ttaacagtcg ccaggacata aacaaataca attcgcttag tcctctcaat tcttcaatct 1860 tcttgcgtcc ggcaagtgag caagaggtcg tggtcgagat aagaaatttg gatgtcaata 1920 aaagcagtgg accagatggt ttaccgataa aattcgtcaa agatcatcat cggactttct 1980 cattattgct tcgcgatgtc ttcaatcaag ctattagcac aggagtcttt cccgattgct 2040 taaaggtagc acgtgtgacg cctatccaca aatcaggatc caaaactgat atcaataact 2100 accgaccaat atctgtgctg tctgtcttga gcaagttatt tgaaaaacta cttgcaacga 2160 ggttgctaag ttttttgcgt acgcacaata tcatgtacag ccatcagtac ggttttagga 2220 caggttccag taccctcaca gcaactagtg agttgatcga cgaggtctac gaagctatgg 2280 attctcggga actggttggg gtactctacc ttgacttaaa gaaggctttc gacactatcg 2340 accatgacat attgcttaaa aaattggaat actacggaat tcgaggaaca cctaacaact 2400 tgatacgcag ctatctcacc ggaagaactc agttcgtgta tgtaaatggg acaattagct 2460 cacaacgtgg attaacggtt ggcgttccac aaggcagcaa tctcggtcct cttttgtttc 2520 ttctatacgt aaacgactta ccaaacctga atttgcacgg gaagccacgg ttattcgcgg 2580 atgatacctc actatcgtat aaaaatacaa acccaaatga catagtgcgc cacatgaagg 2640 aggatttgat gaaacttcag gattatttca atgaaagttt gttgtcattg aacctatcca 2700 aaacgaaata tatgatattg cactcacctc gccggaacat ttcgctacat ggagatctaa 2760 tggttcgagg gactacgata gaaaaagttt cgtgtttcaa gcatcttggt ctaacaattg 2820 attccacgct tacatggagc gatcatataa acatactgca aacacaaact agctcaacgt 2880 gcggtttact atggaaggtc agcaagttct taccacggaa atcgttgttg gccatgtacc 2940 atgcctttgt ccagtctaaa ctgcaatacc ttgtttcgat ttggggagca gcggcaaagt 3000 ctcgactgaa accattacaa acgatccaga accgttgtct gaaaactgta ttcaagttac 3060 caaggctcca ctcgaccgtg gatttatata gaaacagtcc gtcatcagta ttaccaatcc 3120 ctgccttgcg agaattacag tgtttgctcc aaattcacaa caacatgttt aacagagagg 3180 cccatcataa tcagaacata gaacgagttt cgcatagata ctcgctacgg aacgcaacag 3240 ttttggttac ctcgcgtgct aatacagagt tggccaaaag atcggtttcc tacttcagta 3300 ggaaaagttt caattcgtta ccagttgctc tgaagcttga gcaaaacgcg aggaaattca 3360 aacttgcagt gaagtccaaa atacgtactg tagtggacag atacataata tgagaaacac 3420 aacattatgg aaaaccgtca aagaacatgt gccggaaact gccgattgta gcatatttgg 3480 ccgcgcagcg ccgcgctcat cctacggcga aagtagaatc cagctcttca gggtttgagg 3540 ccagtgtgct tcaaacagtc atccgtgcac cgctcattat tcggctgaaa gcagaatcca 3600 gctcttccag aggcgcatcg accgccttcc cgccaaccga gaggagaatc tagccttctg 3660 attcatccgt ggaccgcaca tcccttgtct aaaagcagaa tccagctctt ccagtagatc 3720 gaccgcttcc cccgccaacc acaacgtgct ccgaacagtc gtccgtggac cacacatctt 3780 tcggctgaga gcaaaatcca gctcttccag gtaagagact ggtatatgtc tagccgaagt 3840 cagtgatgct gcgcgcctaa tcagtttgac taactgttcc agaggcacat cgaccgcctt 3900 cccgccaacc gagaggagaa tctagccatc tgattcatcc gtggaccgca catcctttgt 3960 ctgaaagcag aatccagctc ttccagaata tcaaccgctt ctcccgccaa ccaaaacgtg 4020 ctccgaacag tcgtccgtgg accactcatc tttcggctaa gaggcatgaa tccagctctt 4080 ccagtacatc aaccgcttcc cccgccaacc acaacgtgct gcgaacagtc gtccgtggac 4140 cgctcatctt tcggctgaga gcagaaccca gctcttccag gtaagagact ggtatatgtc 4200 tagccgaagt cagtgatgct gcgcgcctaa tcagtttgac caactgttcc agaggcgcat 4260 cgaccgcctt cccgccaacc gagaggagaa tctagccttc tgattcatcc gtggaccgca 4320 catcctttgt ctgaaagcag aatccagctc ttccagtata tcgaccgctt cccccgccaa 4380 ccataacgtg ctccgaacag tcgtccgtgg accgctcatc tttcggctga gagcagaatc 4440 cagctcttcc aggtaagaga ctggtatatg tctagccgaa gtcagtgatg ctgcgcgcct 4500 aatcagtttg accagctgtt ccagaggcgc atcgaccgcc ttcccgccaa ccgagaggag 4560 aatctagcct tctgattcat ccgtggaccg cwcatccttc gkctgagagc agaatccagc 4620 tcttccagta tatcgaccgc ttctcccgcc aaccacaacg tgctgcgaac agtcgtccgt 4680 ggaccaccta attatatttc ggctgagagc agaatccagc tcttccagag gcacatcgac 4740 cgcctgcctg ccaaccgaga ggagaatcta gccttctgat tcatccgtgg accgcacatc 4800 ctttgtctga atgcagaatc cagctcttcc agtatatcga ccgcttctcc cgccaatcat 4860 aacgtgctgc gaacagtcgt ccgaggacca ctcattttcg gctgaggcag aatccagctc 4920 ttccaggtaa gagactggta tatgtctagc cgaagtcagt gatgctgcgc gcctaatcag 4980 tttgaccaac tgttccagag gcgcatcgac cgcctgcccg ccaaccgaga tgagaatcta 5040 gccttctgat tcatccgtgg accgctcatc ctttgtctga gagtagaatc cagctcttcc 5100 tgtatatcga ccgcttcccc cgccaaccat aacgtgctgc gaacagtcgt ccgtggaccg 5160 ctcatctatc ggctgagagc agaacccagc tcttccagtg gcgcaacgac cgctggcccg 5220 ctaaccgaga ggagaatcta gcccttcatg acaagacctc atccgtggac cgctcatctt 5280 tcggctgaga ggaaaattca gctcttccag cgaaaacgtc cacattcagc cgtctataaa 5340 ccgtttgtcc atcggactgt aggagaggtg atctagagga agtatcagac ccagccgttt 5400 ctcgacaca 5409 // ID DNA-TA-8_AAe repbase; DNA; INV; 192 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A non-autonomous DNA transposon family from Aedes aegypti. XX KW DNA transposon; Transposable Element; nonautonomous; KW DNA-TA-8_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-192 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the yellow fever mosquito."; RL Repbase Reports 11(4), 1277-1277 (2011). XX DR [2] (Consensus) XX CC ~94% identical to consensus. TA TSDs. TIRs are ~54 bp long. CC This CC family is inserted as a part of DNA-TA-5_AAe, Gypsy-78_AA, CC Shinagawa-3_AAe, Shinagawa-8_AAe, and TE-1_AAe. XX SQ Sequence 192 BP; 52 A; 47 C; 46 G; 47 T; 0 other; gggtgcagaa ccacttgggc acttccagat tcactttggc atggggggtt tttctcggcc 60 gaattttctg aaactttgca ataagaagcg cttcagtacg acgcacattg tggccaaata 120 tgagctctgt agctttcaaa aaaccccact gccgaagtga atcaaaagtg ccaagaatag 180 gatccggctc cc 192 // ID Gypsy-195_AA-LTR repbase; DNA; INV; 1223 BP. XX AC supercont1.68; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-195_AA_; KW Gypsy-195_AA-I; Gypsy-195_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-1223 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.68; Positions 2009172 2010394. XX SQ Sequence 1223 BP; 327 A; 284 C; 297 G; 315 T; 0 other; tgtaaccgtt tggttacaaa atctcatgat tgttttttaa tttgtgcatg ttgttgagaa 60 actgtagtgt ttttatagat tttgttagtg ttgggttagt agaattagta ggataaaatg 120 tttgtgatag taggataagc agatggtaat tacatgtttt gtggaacata gtggggctcg 180 atggtagaaa agggcgaaga aatcaactgt ggtaaaatgg gcgaaaatag aactggatgg 240 gtggtgagcg acctacaagg aaggataatg gagggataag gaacgattgg aaaactggaa 300 ggaaaaggag ggctttgtaa gacttgggga gtcaaccatt gctcattagc aatccggcgt 360 cagtgatagt aagtcgtgag tgagttgaac catttgtgaa acggcagtga acgaccagta 420 gtgccagcca tagttcgttc aggtaaatct gtgaaggcga aatacctgga gactttattt 480 taatatgcct ctggtcttca ggacctcttt tttcttgctt ttgcgtaatc tttgattacc 540 acagcctcac gctgcacgat aggtcaagct gagtcttgaa cctgtgctct ccggctccgt 600 atccgccatt gagcttcttc tcgaaccgca cggttctaat cgcaaaggag aacccttctc 660 ggcgtggcca atacggtctc gcccgtggtc cgtcgatagc gtcaacgaag ggaacgccgt 720 agacccccaa gctacgttag tagccgtaca gtggcaagcc gccattgtac ccggccagag 780 caatccgcta ttgctttgtg cgtgcccaag ccaaccgcca tcttccagaa gaccaagaaa 840 accgaaccaa ctcaccagaa catcaagcca cggagccatc tgcgagcgct cgccaacgtc 900 gtccagcaac cccggtacgt accgaacaac ggaaatcatc tagccctaga attgcccttt 960 tagaccacaa tagcccaata aatccctact tgtaaagttc aataaatccc tttagtttca 1020 acccacattc aagttcatcc taatcttcaa gaattccata gtttctgcac ccgttgttcc 1080 gcgtagcccc tccgaattac ccagtagcat gccgaccctg cgacacagtt caggggtctc 1140 atgctactga gctagttaac cgggagtctt ggttctgccg tgagcctata gagcagcatc 1200 agcgtgagtc taagtgagtc tca 1223 // ID BEL-162_AA-LTR repbase; DNA; INV; 443 BP. XX AC AAGE02022918; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-162_AA_; KW BEL-162_AA-I; BEL-162_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-443 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02022918; Positions 83347 82905. XX SQ Sequence 443 BP; 170 A; 82 C; 84 G; 107 T; 0 other; tgttccggac gacgaatatc cagcaccgcg gtgcgaaagg gtcactcgat tcacatgcag 60 caggagatca tagtcgacaa gtaacgtcaa cacatacgac gatgtggaat gaaatgtcat 120 gccaataaat tgaattgtct acctaaatac gggcaaaaca caacacgata aaaattgctt 180 ataactagaa cattgagttg gtagaatttc ccttaaaatt agttaaaatc tagctaaaat 240 tactgaaatt gttaaaataa attgtggaac agtggacaag ataccaatag aggattagaa 300 atgtaagtaa cacgcagttt aaaatgaaat tatgaactaa gcgaattaaa attaattgca 360 gcttaaaagc tgactcatgt gccaacataa aacgagtcgt gctacaagat cgtccgaaat 420 acctttccgc agctgtccca aca 443 // ID DNA8-14_AP repbase; DNA; INV; 937 BP. XX AC . XX DT 22-AUG-2009 (Rel. 14.08, Created) DT 22-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-14_AP. XX NM DNA8-14_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-937 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(8), 1756-1756 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 937 BP; 273 A; 122 C; 159 G; 381 T; 2 other; cagggatcga aaccgtttca aattttaacg gttacggttt tagttcttga gaacggtttt 60 ctaaattttc tggttttggt ttaggtttta agatcggttt ttaaaatttt ttggttttgg 120 tttcggtttt aaaaaaggtt tttcaaatat tttatcggtt taaagaacgg ttttagaaaa 180 ttttcggttt tggtttttgg aacggttttt aaaaatntcc attttatttt cctgtctaat 240 tactgtagac tcgattttta ttatctatta actattataa gggccgagtc taaagccctg 300 aaatatggct actttgtata attagtgtac aatatacgat ataacaaatt tacgttcagc 360 agaactaatt tcgtgttatt aacttgaata tttcgtacct aatttaggtg cctcgtgcct 420 acttttgcat tcgacataat attatgttgt catattatgg ttgagatcat tattcgatta 480 cattaatatt taattgtata ttacaataac atcacactat attatttgta tttgcaattt 540 tgaatcataa gtttagtaat aatatatata tattaatatt tcgtaatact cagtaattgc 600 atcgggattt ttatcaataa atccgaacat aataatttct tcgtataaaa atagtagaat 660 tacaaataat ataattaact ttatgaaatt caaagaaacc gacattagtt tcaaatttta 720 atcggttttt caagaacggt acgagaacgg tttttcaatt ctatggtttc ggttcgggtt 780 ttgaaaacgg ttttttaagg tcttaggttt cggttacggt tttttaaccg gtgtttnaaa 840 gcttttggtt ccggttccgg ttccggtgtt cagaaatgag aaaaccgcgg ttttaaccgg 900 ttttttggaa aaccggttaa accggtttcg atccctg 937 // ID Gypsy-3_IS-I repbase; DNA; INV; 4046 BP. XX AC ABJB010044547; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the black-legged tick genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-3_IS_; KW Gypsy-3_IS-LTR; Gypsy-3_IS-I. XX OS Ixodes scapularis OC Eukaryota; Metazoa; Arthropoda; Chelicerata; Arachnida; Acari; OC Parasitiformes; Ixodida; Ixodoidea; Ixodidae; Ixodinae; Ixodes. XX RN [1] RP 1-4046 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the black-legged tick genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; ABJB010044547; Positions 10868 6823. XX CC Positions [3164-3499] - Integrase core CC 'GGAAC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 114..1451 FT /product="Gypsy-3_IS-I_2p" FT /translation="MAQAQALYAAPIQYDPAVENWTNYVERLEAYFDASGV FT KDDGRKRSILISALSPEVYARLRSLVAPNKPRDETFDTIVKVLTQHLSPEP FT SEIYETFRFQSRVQNEGESVADYLAELRRIADHCGFGDALERNLRDRFVIG FT LREKTVQRILLAKPKKLTLKEALDTALAAELAMRNADRLPGSSSLPTTAGN FT VNALRFKKHKGRQQKTSGTKTAQPCIRCGDSSHDPPNCPHKKTTCFTCNKQ FT RHLASRCFRTQSTPAVNRKKQQANLLTEEAQLGQAEETFHLFTVHSPKATV FT DPVNEVLQWGTVEVTMQVDTGSPVCVISRDTYSKNAKVWPPLQGTTKELHC FT YLGKLPVLGVLAMPVRRGDKKVQGQLYVVDCTGPSLCGRDVIQALNLLEEH FT LIGSMQTSDNQVQDTVKKLKEDFKDLFQPGTVLMKGPAAHINLKKEVNPKF FT C" FT CDS 2177..4018 FT /product="Gypsy-3_IS-I_1p" FT /translation="MSTLLAPLHNLLRSEVKWKWGREEQAAFDGVKTTLQK FT SNLLVHYDPSRELILECDASPEGLGAVLSHCVNGVDRPIAFRSRTLTKAEK FT NYSQLEKEALALIFGVTKFRDYLLCRSFTLKTDHQPLVGLFKESKSIPSMA FT AARIQRWALTLGAYKYVIKYKPGSQNLNADALSRLPVATAPDAAETEEAQE FT MGVFTVDMLDASPVSRRQLEDLTGKDPTMRSLRGTISRGWPKRLLDSRLQP FT YWHHRNALTLEGAFICMGGRVVIPECARRAFLLELHDTHMGVNATKALPRT FT LVWWPGIDREVEELLHRCTVCQQAAPMPPSREPLAWPATKQPWTRVHVDYA FT GPVEGKMILVVVDSFSGWIEALPTTTATSATTVELLRTIFARFGLPTCLVS FT DNGTSFTGAEFQDFVKKNGIRHIRTAPIQPQSNGLAERAVRSVKEGLKRVS FT GGSLTTRLARWLHMHRRMPSGGRPKSPAEMLLAYQPKVRLGLLTARAESHK FT PAGPANQSPTWSGGQSVFARNFGKGPRWLSGNITEELGNRMALVQTPGGMI FT RRHHDQLRLRHEVNTPDPLHQPSERNEDLMLEWGLSSAAPETSRATLEPSK FT EQRPLRQRRPPVRLDL" XX SQ Sequence 4046 BP; 1048 A; 1035 C; 1189 G; 774 T; 0 other; atggcgacga ggctcgataa ggagaggaca accgaacgcc ggtacgtgaa cgtcaagtgg 60 tgttagtcgg tgcgtcggtg ggacccggga acaaaggaga agcggtggct gccatggcgc 120 aagctcaagc cctctacgcc gcgccgatcc agtatgaccc ggcggtggag aattggacga 180 actacgtgga gcgactggag gcgtacttcg acgccagcgg ggtgaaggac gacggtagga 240 aacgttcgat cctaatatcc gctctcagcc cggaggtcta cgcacgactg cgaagcttgg 300 tggcgccaaa caaaccgcga gacgaaacct tcgacacgat cgtgaaagtg cttacgcagc 360 acctcagtcc ggaaccgtcc gagatctacg agacgttcag gttccagtca cgagtgcaaa 420 atgagggaga aagcgtcgcg gactatctgg ccgaattacg gagaatcgca gaccactgcg 480 gttttgggga cgcactagag cgcaacctaa gggaccgatt cgtcatcgga ctgcgcgaga 540 agactgtgca gaggatcctc ctggcgaaac caaagaaact tacgctcaaa gaggcgctgg 600 acacagcgtt ggccgcggag ttagcaatgc ggaatgccga ccggcttccc ggctcaagtt 660 cgctacccac tactgcgggg aacgtgaatg ccttgcggtt taagaagcac aagggacgtc 720 agcagaaaac gtctgggacg aagactgcgc agccctgcat tcgttgcgga gacagctctc 780 atgacccgcc gaactgcccg cacaagaaaa cgacgtgctt cacgtgtaac aagcaacgac 840 atttggcgag ccggtgcttt cgcactcagt cgacaccggc ggtcaaccga aagaagcaac 900 aggccaacct gctcacggaa gaagcacagc taggacaggc agaggaaact ttccacttgt 960 tcacagtgca ctctccaaaa gctacggttg atccagtcaa cgaagtgctc cagtggggaa 1020 cagtagaggt gaccatgcag gttgacactg gctcccctgt gtgcgtgatt tctagggaca 1080 catactccaa gaatgctaag gtctggccac ccctacaggg aactaccaag gagctccact 1140 gctacctggg gaaactacca gtgctgggag tcctggcgat gccagtaagg cgtggagaca 1200 aaaaagtgca aggacaactg tacgttgtgg attgtacggg tccatccttg tgtggtcgag 1260 acgtcatcca agcgttgaac ctcctggaag agcatctaat cgggagcatg caaacaagtg 1320 acaaccaagt acaggacacg gtcaagaagc tgaaagaaga cttcaaggat ctcttccagc 1380 cggggacggt gttgatgaag ggacctgcgg cccacatcaa cttaaagaaa gaggtaaacc 1440 ccaagttctg ctaggcccgg ttccttccct gtgccctacg tgacagggtg ggagaagaac 1500 tggaccgaat gtgcaaagag gggatattgt ctccggtatc ctacagtgag tgggcaacgc 1560 cagtagtgcc agtgaccaag gccaattgag gtctcaggat ctgtggagac tttaaagtga 1620 ccctgaacat ggcatgcgac atcgaacaat acccacttcc caaagtggag gacatctttg 1680 cgtctctgaa ggatggccat tggttctcga aattggacct tcgggaagcc tactgccatg 1740 tggccttaga tgaagaatcc agacaggccg cagtgctcaa cacccacaag ggtctcttct 1800 gctacaaccg gctgccatac ggcatcgctt cagcaccagc gatcttccaa aggaaaatgg 1860 aggcgttgtt gaaggacatt ccgggaacgc aggttttctt ggatgatgcc ttggtggcag 1920 agggcaaaga cgaattcggg cagacgctgc ggaaggttct gcagagattc agggaaaatg 1980 gcatccgact gcgggaagaa aaatgtgtgt ttggtgaaga ggaggttacc tacctcggac 2040 ataggatcga ccgacatggg ctgcatccat ccgaaaagaa agtggaggcc atcgtgaagg 2100 ctcctgcacc tcgcaacctc caagaacttc ggtccttttg ggactcgtca catactacag 2160 gagtttcctg cctaggatgt ctacacttct agctccgctc cacaacctgc tgaggagtga 2220 ggtgaaatgg aaatggggaa gagaagaaca ggctgcattc gacggggtga aaactacctt 2280 gcagaagtcc aaccttcttg tgcactatga cccaagtagg gaactcattt tggaatgtga 2340 tgcgtcaccc gaaggcctag gagcagtgct gtctcactgc gtgaatggcg tcgacaggcc 2400 tattgccttt cggtcaagaa cccttaccaa ggctgagaag aactattcgc agctagagaa 2460 ggaagcgctt gcgctcattt ttggggtaac gaagtttagg gattaccttc tctgcaggag 2520 cttcacccta aagacggacc accaaccctt ggtgggattg ttcaaagaaa gcaaatcgat 2580 tccaagcatg gctgcggcac gcatccagag gtgggcacta acgcttggag cgtataagta 2640 tgtgatcaag tacaagcctg ggtcacaaaa cctgaacgca gacgctctca gtcgtctgcc 2700 ggtggcaact gccccagatg cagcagaaac cgaagaggcc caagagatgg gggtcttcac 2760 agtggacatg ttggacgcca gcccagtgtc cagacgacag ctagaagacc tgacggggaa 2820 ggacccaacc atgcggagcc tacgaggcac gatcagcaga ggttggccaa aaaggctttt 2880 ggacagtcgc cttcagccgt actggcatca tcgtaatgca ctcacgctgg aaggggcctt 2940 catctgcatg ggtggaagag tggtgattcc agagtgtgca cggagggcat ttctattgga 3000 gcttcatgat actcacatgg gagtcaatgc aactaaggca ctgccacgca cgctggtgtg 3060 gtggccaggc attgaccgtg aggtggaaga gttgcttcac cggtgcacag tgtgccaaca 3120 ggcggctccc atgccgccga gtagggagcc actggcatgg ccagcaacta aacagccgtg 3180 gacaagagtt catgtggatt atgccggacc agtagagggg aagatgatcc tggtggtggt 3240 cgacagtttt tctggctgga tagaggcatt gccaacgaca acagctacat cggcaacgac 3300 agtggaactg ctcagaacca tctttgctcg atttggcctg cccacatgtt tggtgtcgga 3360 caatggtact tcattcacag gagccgaatt ccaggacttc gtcaagaaaa atggaatacg 3420 tcacatacgt accgcaccga ttcaacctca gtctaacggc ctggcagaaa gggccgtgcg 3480 ctcggtgaag gaaggactaa agagggtgtc tggtgggtcc ctcacaactc gcttggcgag 3540 atggctccac atgcatcgca gaatgccatc gggtggtcgt cccaagtctc cagcagaaat 3600 gcttctggcc tatcagccaa aagttcgtct gggcttgctg acagcccggg cagagagtca 3660 taagccggca gggccagcca accagtcgcc tacatggagc ggaggtcagt ctgtttttgc 3720 aaggaacttt ggtaagggac ctcgctggct gagtggaaac atcacggaag aactgggaaa 3780 ccgcatggcg ctggtgcaga cacctggtgg gatgatccgc agacatcatg accagctacg 3840 cctgaggcac gaggtcaaca cacctgaccc tctgcatcaa cccagtgagc ggaatgagga 3900 tttaatgtta gaatggggcc tatcctcagc ggctccagag acgtcaaggg ccaccctgga 3960 gccaagcaag gagcagcgtc ctctgcgaca acgccgacca cccgtgaggc tggacttgta 4020 gttccacata ctaagtgtgg gggaag 4046 // ID Gypsy-20_DWil-I repbase; DNA; INV; 6632 BP. XX AC scaffold_180958; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-20_DWil_; KW Gypsy-20_DWil-LTR; Gypsy-20_DWil-I. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-6632 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_180958; Positions 33827 40458. XX CC Positions [4498-4977] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 750..2165 FT /product="Gypsy-20_DWil-I_4p" FT /translation="MWRDNQKNSDSSDEELLKVIEGNSEQVKDTYTNNQKN FT SDSSDEELSKVIEGKSELVKETHKMPTTEEVMKIVQVAVQTALDHQAAQMQ FT STFTQQLNLEMESLRTQIESLKPVAPEVEVYRNAEIIPGIECKEPLDIVRS FT LPDFDGNQNEYISWRSAASFAYELFRPYRGSSAHYQAVGIIRNKVKGAASS FT TLASYNTVKNVDAIIARLDFTYADKTPIRVIQQKLGTLRQGEQSLSQYYDE FT VEKTLTLLTNKTIMTHEAASANVLNEQFRSDALHSFVSGLKKSLRAVVLPA FT KPKDLPTALALAHEAEVSHDYAAFAASFARATEEKAQGPFEQKNQNFRNNA FT NQNKSSENSQGAYKKTPHYTRVQEKPNNKGKPPQNPGAQAFSTPEPMEVDS FT SSKYKQPTRYSENKSTKSGFRSKKNQQVHFTSQDSPEEEAYQTKAEASALA FT ASDDLSDSEESCNFLGNTPFYRSSREQ" FT CDS 2228..4039 FT /product="Gypsy-20_DWil-I_3p" FT /translation="MAVLKFITPAEKPFNVRSIHGTSKVTEKCPIHIFGLK FT AHFFLINIPEDFDGIIGFDLLKKVGATINLLNNQIVTKNGSEAINFTRCPQ FT VNFIYIDDTDVPAAVLKSFNKMMDERVGAFADPDEALPYNINTVATIRTNG FT EPVYSRLYPLPMGVTDFVNAEVKQLLAHGIIRPSKSPYNNPTWVVDKKGTD FT ENGFHKKRLVIDFRKLNQNTTDDKYPIPMISTILSNMGKAQYFTTLDLKSG FT FHQITLAEKDREKTAFSVNNGKYEFCRLPFGLKNAPSIFQRAIDDVLREKI FT SRTCYVYIDDVIIFSETKEDHVRDIEWVLKSLQDAGMRVSQEKSKFFKKSV FT EYLGFIVSRDGITTSPEKVQAIKNFPMPSTLFDLRSFLGLASYYRCFIKDF FT AKIAGPLTDILKGDNGKVSKNMFRKIKIEFSLEQSQTFSKLKDILASEEVF FT LAYPDYRQPFDLTTDASGDGLGGVLSQKGRPITMISRKLRNNERNFATNER FT ELLAIVWSLKSLRNYLYGVRDLNIFTDHQPLTFAVSDKNSNAKLIRWKAYI FT DGHGAKIIYKPGKENFVADALSRQNINALEAPDAESDRATCHSEESLSYTI FT DVSDKP" XX SQ Sequence 6632 BP; 2445 A; 1352 C; 1284 G; 1551 T; 0 other; tttatataac tggcgcccaa ctaaaattaa aaagtgccca aaacttaaaa aacaattcta 60 agtaaaatag aattgaaata aaaaaacaaa aaacacaaaa agtctaaaca ttgtagtgaa 120 caaattatca ctacaaaaat aaaggaaact aaaagtgtcc aaaacttgaa aaacaattct 180 aagtaaaata gaattgaaat aaaaaaaaca aaaaacacaa aaagtctaaa cattgtagtg 240 aacaaattat cactacaaaa ataaaggaaa ctaaaagtgc ccaaaacata aaaaacaatt 300 ctaagtaaaa atagaattga aataaaaaac taaaaacaca aaaaagttta aacattgtag 360 tgaacaaatt atcactgcaa aaacaaagaa aaaaaaaaac aacaacatag tcacgtcact 420 gggctggtga cggcagcggc gaagaaaaaa gaaattgaag acgagaagca gcaacaacaa 480 acgaaaaaaa actccaccag aataccgtcg accacaatcc agcaataaga gggcatagaa 540 atttatcaac agaaaaatga aaatgtaagt gagatttttg ttgtctaaat aaataaatct 600 gggattgtga aaatattcaa taaatcgctt ggcattagag gtggttgtta aaaaaaaaaa 660 aaaaaaaatt tgtgaagttg acattctctg ttttaattga atcaaaaaca ttttacacta 720 ttaaacgaaa tatttgttgc cgtctgttca tgtggagaga taaccaaaaa aatagcgata 780 gtagtgacga agaattacta aaggttatag aaggcaacag tgagcaagtg aaagatacgt 840 acaccaataa tcagaaaaat agtgatagta gtgacgagga gttgtcaaag gttatagaag 900 gcaaaagcga gctagtgaaa gaaacacaca aaatgccaac aacggaggaa gtgatgaaaa 960 tagtgcaagt ggcagtgcag actgcgcttg atcaccaagc tgcacagatg cagtctactt 1020 tcacacaaca gttaaacctc gaaatggagt ctttaaggac tcagatagaa agtttaaaac 1080 cagtcgcacc ggaggttgag gtttatagga atgccgaaat catcccagga atagagtgta 1140 aggaaccatt agatattgtt agatccttgc cagatttcga tggcaatcaa aacgaataca 1200 tttcatggcg ctccgcagct tccttcgcat acgaactatt tcgaccttat aggggcagtt 1260 ccgcccacta ccaggcagta ggcataatca ggaataaggt taaaggtgcc gctagttcca 1320 ctctagcatc gtacaacact gtaaaaaatg tcgatgcgat catagctcga ctagacttca 1380 cctatgctga caaaacgccc attcgcgtta ttcagcaaaa gttgggaacc ttacgacagg 1440 gggaacaatc gctgtctcag tactatgacg aggtggaaaa aacattgacc ctcctcacca 1500 ataaaacgat aatgacacat gaggcagcgt cagcgaacgt gcttaatgag cagtttaggt 1560 ccgatgcgct acactccttt gtgtcggggc ttaagaaatc cctcagggcg gtagtactac 1620 cagcaaaacc caaagattta cccactgcgc tagcattagc gcacgaagct gaagtaagcc 1680 acgactacgc agccttcgct gcaagctttg cgcgggctac tgaggaaaaa gctcagggcc 1740 cctttgaaca aaagaaccaa aattttcgaa acaatgctaa ccagaataag agctcagaaa 1800 atagccaagg ggcatacaag aagactcctc attatactag agttcaggaa aaaccaaata 1860 acaaagggaa gccaccacaa aatcctgggg ctcaggcgtt ctcaacacca gaacccatgg 1920 aggttgactc ctcttcaaag tataagcaac caaccagata ttcggaaaat aagtccacaa 1980 aatctggatt ccgctcgaaa aagaatcagc aggttcattt cacgtctcag gattcccctg 2040 aggaagaagc ctatcagaca aaagcagagg cgagcgctct ggcagctagt gatgaccttt 2100 cagatagtga agaatcttgt aattttttag ggaacactcc cttctaccgt tcatcacgag 2160 aacagtagga ggaaaaacaa ttaaattact tatcgatacg ggggcctcaa aaaattatat 2220 tcgtccaatg gcggtactaa aattcatcac ccccgcagaa aaaccattta atgtcagatc 2280 catacatggg acctcaaagg taacggaaaa atgcccaata catatttttg gcctcaaagc 2340 acatttcttc ttgataaata tcccggaaga tttcgacggc ataattggtt tcgacctttt 2400 aaaaaaagtc ggagcaacta tcaatctgtt aaacaaccaa atagttacaa aaaatggatc 2460 tgaagccatt aattttaccc gttgcccaca agtaaatttc atttatatcg acgatacaga 2520 cgtcccggct gctgtactta agtctttcaa caagatgatg gacgaaagag taggagcgtt 2580 tgcagaccct gacgaggctc tgccatataa cataaatact gtagcaacta ttcgaacaaa 2640 tggggaaccg gtatactcca gattataccc gctcccaatg ggcgttacgg actttgtcaa 2700 tgcagaagtt aaacaacttc tagcccacgg cataattcga ccctcaaaat ctccctacaa 2760 taatccgacc tgggtggtcg acaaaaaagg aacagatgaa aacggcttcc ataaaaagag 2820 gctagtaatt gactttagga aactaaacca aaatactacc gatgacaagt accctattcc 2880 aatgatttca acaatcttgt cgaacatggg aaaggcccaa tattttacaa cattggacct 2940 taaatcggga tttcatcaaa taacactagc ggaaaaagac agggagaaaa ctgctttttc 3000 tgtcaataat gggaagtacg aattctgcag acttcccttc ggtctcaaga atgcacccag 3060 tattttccag agggccatcg atgacgtcct tagggaaaag ataagtagaa catgctacgt 3120 ctatatagac gatgttatta ttttttctga aacaaaggaa gaccatgtta gggacattga 3180 atgggtcctt aaaagccttc aggacgcggg gatgcgagta tcacaggaaa aatccaaatt 3240 ttttaaaaaa agtgtggagt atctgggatt tatcgtatct agggatggta tcacaacttc 3300 acccgaaaaa gttcaggcaa tcaaaaattt tccaatgcca agtacccttt ttgatctcag 3360 atcatttctc gggcttgcca gctattatcg ctgcttcata aaagacttcg cgaaaatcgc 3420 aggtcccctc acagatattc taaagggcga taatggaaaa gttagcaaga acatgtttcg 3480 gaaaataaaa atcgagttta gtttagaaca atcgcagaca ttctccaagc taaaagatat 3540 tttagcatca gaggaagttt ttttagccta cccagattat cgacaaccat tcgatcttac 3600 cacagatgca tctggtgatg ggctaggcgg tgtcctttca caaaaaggca gaccaatcac 3660 catgatatcc cgtaaattac gtaacaacga aagaaatttt gcgactaacg aaagagagct 3720 tcttgccata gtgtggtcac taaaaagctt gagaaactac ctttacgggg taagagacct 3780 aaatatattt accgaccatc agccattaac ctttgccgta tcagacaaaa attcgaatgc 3840 gaaactaata cgctggaagg catatattga tggacatggc gcaaaaataa tttataagcc 3900 aggaaaggaa aacttcgtag cagacgccct ttcgcgtcag aacattaacg ctctcgaagc 3960 accagatgcg gaatccgata gggcaacctg ccatagtgag gagtctctgt cctacactat 4020 tgatgtatcc gataagcctt aaattgtttt cgaaaccaga ttgtgattga acaagcagag 4080 tttccttcgg taaggacatt cgttctcttc cagagtaagt ccaggcacat aatacgcaca 4140 tcaaaccccg gttcactttt acagacactg caagatgtaa taaatgctga agtggtaagc 4200 gcaatacatt gcgaactgcc aactcttgcc ctcattcagc ataaattggt cgaactgtat 4260 ccggcaacaa aattctggta ctgcaaacac atcgttagag atgtaccaga caaatcggaa 4320 caaaaagaaa ttgtagtgac cgaacataac agggcgcacc gtgcagccca ggaaaacatg 4380 aaacaaattc tgcaagacta ttttttcccg aacatgggaa aattgaccac agaaattgca 4440 gccaattgca agatctgctc tgtcgcaaaa tatgacaggc atccaaaaaa acataagttg 4500 ggtaccacac cagtaccagc ttacgcagga gaattcttac acatcgacgt tttttccaca 4560 gataaaagat tcttcctgac ctgcatagat aagttttcta aatttgcgat agtccaacct 4620 atagcatcga gagccatagt tgacgtaaag ccggccattc tacagttagt caattacttt 4680 gctaatgtta agacaatata ttgcgataat gaaaggtcaa taaactctga aacaataagg 4740 tccctccttt taaatgagtt tggaatacaa atcgtcaacg ccccaccctt acacagtaca 4800 tctaatggcc aagtagaaag attccacagc acgttacttg agatcgcacg atgcgacaaa 4860 ataacaaccc aagctagcga cactatcgat gttatattac tagcaacagt aaagtataac 4920 aggtcaatcc attcagtctt gaacaaaaaa ccaattgaag cagttcatga aactacggta 4980 gaagagggat tgcaaattgc caaaaagata aaagacgctc agagcaaaca gatggaacag 5040 gaaaacagaa ccagaataga taaaaagtac caacggggtg acaaagtctt ggtaaagtca 5100 aacaaacggc ttggtaacaa acttactcct ttgtacaagg aaggaattgt agaggctgac 5160 ataggaacta aagtccttat aaaggggagg gaggtccata aggacaattt aaaatagaac 5220 ctctaaaatt ggtttctaaa attaggttta cacacttcta taactactag agtaataaga 5280 caattttata atatcagaaa ttacagaacg gactttatac ttatactgac aatactgaga 5340 acacacgcga caatgaccat gaaaatcacg gactatacca aagcagactt tatacagata 5400 ctggacggag acgtgacagt ttggaaagag tacagctacc tcggacatac gacaaacatt 5460 acctcgtaca ggacatacgc ggacgagacc agaaaaacta ttgaattctt taaggaagat 5520 catattaaaa gaattatttt gacagatatt agggaaatcg acgcactact agaaacaatg 5580 acggttcatc atagaggcgc tagaagccta aatattttag gcactgcatt gaaagtagtg 5640 gccggaactc ctgattttga cgagcaatta atagagtcag aaaataaaca agtcaagatc 5700 aacaacaaat tacaggaaaa actgaatgca ataacgaact ccataaacca aataagtaaa 5760 acagaaaatg tcaatcacga acacttttat gaaaccatcc tagcaaaaaa cagaatggtc 5820 atttcagacc ttgaaaattt aattctaaca accactttgg caaaagcgaa tttgttgaac 5880 ccaataatcc tcgatgactt ggatgtcaat gaaatgacta acaaaaatct cacaaatatt 5940 agtgtgtcag atttattaga tgtaggtagt attaaggttt ttcgaaattt taacattttg 6000 tactttttaa tcaaatatcc tcacccagaa tctgtctgta agaaaatcag cctttacccg 6060 gtagagcggg ataacatcat tttagatttt aataacagaa acgtagttgc tgattgtggt 6120 atggagctat tctcagtcga gggttgccaa ttggcggtaa ccacaacatt ctgcaaaaga 6180 tggccacgaa tggctgcacg caacaattag tagccgcaac tcaagcccat tgtagctcca 6240 gaccaggtca cctggaaccc ttaatagttg tgaacgaggg aatggttatc atcaacgacg 6300 ccaccatgga catcatagac gaagtcggca acgaaaaaag ggtgaacggc acctacctgg 6360 tgacattcac cggcaaaatt accatcaacc aaacaacgtt tgtcaacaag tatgatgtgc 6420 tcagaaagaa gccataaccg gactacgctc aaaaataata tcgggccctg tattgagcag 6480 cagtgtcact attgggatag tcgtaatctg gttcgcaatt tatctggcca ttcgcagact 6540 ccagagacgc aacctagtaa caactcttca aaagtctctt gacctacaga agtccgggga 6600 cggcctccac ttaagcgagg gaggagttaa ca 6632 // ID Loner_Ele3 repbase; DNA; INV; 5945 BP. XX AC . XX DT 07-OCT-2010 (Rel. 15.1, Created) DT 07-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE A Loner non-LTR retrotransposon family from Aedes aegypti. XX KW I; Non-LTR Retrotransposon; Transposable Element; Loner; KW Loner_Ele3. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-5945 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5945 RA Kojima K.K. and Jurka J.; RT "Loner non-LTR retrotransposons from the yellow fever mosquito."; RL Direct Submission to Repbase Update (07-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. >98% identical to consensus. ~10 bp TSDs. CC This consensus is ~99% identical to the original sequence in [1]. XX FH Key Location/Qualifiers FT CDS 366..1811 FT /product="Loner_Ele3_1p" FT /translation="MFSSRDGNIPTDTFPQQEIACGNVDMASHTGTISKHG FT SSLTQPSKETSIQHSHSKQCDVENENVHFSQQPRIRQYVDDPNRERWVVFI FT RRVNKKIRVKQVTSDLIKHYPSVIFITEVNRDKLRVVFTNYKEANAVVNDD FT RFRLEYRVYVPARVVEIDGVVTSDSDLSPNDFKEAVGVFKNPNIPPVKIID FT CQQLKKVIKNGSNSQYVPSGSFRITFEGSALPHYVTMNKILRLRVRLYVPK FT VMSCTNCKQLGHTKSFCSNKAKCLKCEGQHEEDACQKNVEKCVLCGLDPHD FT KKDCPKFQNYSSKLKYSVKERSKRSYAELVKKTLVSNMFLELSDSDDDSDS FT DTEYDSLPSQKGKRGRKHIPSSVMSKNKQKVARIEKVHTPEKYNLVPRDSV FT EFNKSFPLLSGNAHRDTSSTEPKFQSQKENTMFTGFLKLSTIIQWVLDILK FT APKPIQDLLFGCIPMIANFGKQMTIQWPILSLIDFDG" FT CDS 1807..5487 FT /product="Loner_Ele3_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="MARYANQLTDHVSVLQWNCRSVMANINSLNLLISETN FT CDIFALCETWLIPELTVDLIDFNIIRQDRSDGYGGVLIGIKKCHSFYRIGV FT SNKSKIEVIACHVNMKGKXFSLVSFYIPPKVTFSYNDLETILDTIPAPWVI FT VGDFNCHSTAWGSDHDEKRAMPIHDMLDCYNLCILNTGNVTRIAGPQKKDA FT VLDLSFCSSSMVYDCSWIVIDNPHGSDHLPIVISIHNDNQRMREYVNIEYD FT LTKNIDWKKYSSIISEALENLADLCLEDEYDFIVNLIIDSAIKAQNKPIPN FT KKIVNRIPNPWWDKECNTVVKEKSAAFKVYRKVRSRQSYEKYISLERKFKN FT LVRAKKKSYWRKFVGELSHETSMTALWKAAKNMRNKSSVNEDKEYSHEWIK FT KFAKKICPDIVSENQICFDTSEDSVPMFTMIEFSLALLSGRNNSPGKDGIK FT FDLLKHLPDVAKKRLLNLFNWFLDHNVIPNDWRKVKVIAVQKPGKPPSDHN FT SYRPISMLSCIRKLMEKMILNRLDNWAESNKLISETQFGFRKGKGTTDCLG FT LLTTEIDIAFGSKQQMTSVFLDIKGAFDSVSLGILTQKLHRKGLSPKLNNF FT VNNLLREKHMYFSSGSISEYRVSYMGLPQGSCLSPLLYNIYVSEIDNCLAE FT GCTLRQLADDCVISVIGKDREDMQAPLQATLDSLSSWATDLGIEFSPEKTE FT MVVFSKKHHPAKFELQLSGKKINQSLSYKYLGIWLDSKCTWKKHVQYIKSK FT CEQRINFIRSITGSWWGAHPEDLIRLYKTTVRSVIEYGSFCFYPAARTHIL FT VLERIQYRCLRLALGSMNSTHTKSLEVLAGVPPLMIRFYELGSRFLIRSEI FT MNPIIIANFEKLLQLMVQSRRMPLYYWHMSEDISPAIIDSEQLNMNFLQLN FT NDKVTYDISMKQEINNIPDYLRSKIIPSIFSSKYGRLSSDKCFYTDGSSID FT NSVGYGVFNENVSTYHQLKNPCSVFVAELAAIYNALIIILNLSVGSYYIFS FT DSLSSIEALKSIKPQNATYFIIEIRKLLNTLTMRSFSITFIWIPSHCLING FT NEKADILAKKGAIVGEIFERKIDFREFFQTCRRKAIAEWQRSWDEGSMGRY FT LYGLAPKISKKSWFKDYNLTRDFIKVISRLMSNHYRSKAHLKRIGLSDTNH FT CDCGEGFEDIEHVVWVCSKYEVKRQELIDSLPVLWKPPNATVRDALFSRNP FT NFLNPIYKFLKAIDFPI" XX SQ Sequence 5945 BP; 1960 A; 971 C; 1096 G; 1917 T; 1 other; ttcattggga tcgcagtagt cgtaacgatc ggacgtgcgc ttactcgctc tggcagtgat 60 ttgacagcag attttttcgt ggtaaagaaa attgccgtgt cgtgcaggaa aaaaagtggt 120 ggcagctgga attgtaggaa ttttcaagtc gtcggtttgc tggctggcac atctttcgtt 180 gttcaatagc agataagtgt ttttcattgt tccttttatt attagcggta tctaaaagcc 240 ttaaattcca cccttcttgt tttatccatt atattgttat ttattttttc tttcattagt 300 ttctgtttta ccatttcgtt ttattcaaag ttttttttta actcttttca ataaagtcaa 360 aagggatgtt ttcgtcgaga gacggtaaca tccctactga cacatttccg cagcaagaaa 420 ttgcctgcgg taatgttgat atggcctctc atactggaac tatatctaaa cacggatcaa 480 gcctaacaca accatctaag gaaacatcca ttcaacattc tcattctaaa cagtgtgatg 540 ttgagaatga gaacgttcat ttttcacaac aaccgagaat acgacaatat gtagatgatc 600 cgaaccgcga acgttgggtg gttttcattc ggcgtgttaa caaaaagata agagtgaaac 660 aagttaccag tgatctgatc aaacattatc cttcagtgat ttttattact gaagtcaatc 720 gtgataaact tagggtggtt ttcacaaact ataaagaggc caatgctgtt gtcaatgatg 780 atcgctttcg tctagaatac cgtgtatatg ttccagcaag agttgtagaa atcgatggtg 840 tagttaccag tgacagtgac ctgtctccaa atgattttaa ggaagctgtg ggtgtgttca 900 aaaacccaaa tataccccca gtaaaaatta tagattgtca acaactgaag aaggtaatta 960 aaaatggatc taactcacag tatgtaccat caggatcatt tcgaattact tttgaaggtt 1020 ctgcgcttcc acactatgtt accatgaaca aaatcttacg tttacgtgtt cgattgtatg 1080 ttccaaaggt tatgagttgc acaaattgta aacagcttgg acatacaaaa agcttttgtt 1140 caaataaagc aaaatgtctc aagtgtgagg gccaacatga agaagatgct tgccagaaaa 1200 atgtagagaa atgtgttttg tgcggtctag atcctcatga taaaaaagat tgccctaaat 1260 ttcaaaatta ctcatcaaag ctaaagtact ctgtcaagga acgttctaag cgatcctatg 1320 cggaattggt aaaaaaaacc ttagtttcaa atatgttttt ggaactatca gattctgatg 1380 atgactctga ttcagatact gaatatgatt ccttaccatc gcagaagggc aaaagaggta 1440 gaaagcatat tccctcctct gttatgtcaa agaacaagca aaaagtagct aggattgaga 1500 aagttcatac tccagaaaaa tacaatcttg ttcctcgaga ttctgttgaa ttcaacaaaa 1560 gcttcccatt actttctggt aatgctcatc gtgatacatc ctcaactgaa ccaaaatttc 1620 aatcacagaa agaaaatact atgttcactg gttttttgaa actttcaact attattcagt 1680 gggttttgga tattttaaaa gcaccaaaac ctattcaaga tttattgttt ggatgtatac 1740 cgatgatagc taactttggt aaacagatga ctattcagtg gcccatttta tctttaattg 1800 atttcgatgg ctagatacgc aaatcaattg actgatcacg tttcagtatt acaatggaat 1860 tgtaggagtg ttatggctaa tatcaactca ttgaatttat taatatcaga aactaactgt 1920 gatatttttg cgttatgtga aacatggttg attcctgaat tgacggttga tttgattgat 1980 ttcaatatta ttagacagga ccgaagcgat ggctacggag gggttttaat tggtattaag 2040 aaatgtcatt ccttttatag aattggtgtt tctaataaat ccaaaataga agttatagca 2100 tgtcatgtga atatgaaagg taaggamttt tctctagttt ccttttatat acctccaaaa 2160 gtaactttta gctacaatga tcttgaaaca atacttgata ccattcctgc gccttgggta 2220 atcgttggtg attttaattg tcatagtact gcatggggtt ctgaccacga tgagaaaaga 2280 gcgatgccaa tccatgacat gttagattgt tacaatttat gtattctgaa tacaggaaat 2340 gtaacaagaa ttgcaggtcc acaaaaaaag gatgctgttt tagatttatc attctgttca 2400 tcttcaatgg tttatgattg ttcatggata gtaattgaca atccgcatgg aagtgatcat 2460 ctaccaatcg ttatttcaat tcataatgat aatcaaagaa tgagggaata tgtgaacatt 2520 gaatatgatt taactaaaaa cattgactgg aaaaaatata gttcgataat ttcagaggct 2580 cttgagaatc tggcagattt atgtttagag gatgaatatg attttattgt gaatttgatt 2640 atagatagtg ctattaaagc tcaaaataaa ccgattccaa ataaaaaaat agtaaatcgc 2700 ataccaaatc catggtggga taaagaatgc aatactgttg taaaagaaaa atctgctgct 2760 tttaaagttt atcgaaaagt gcgttcgaga caatcttatg agaaatatat ttcacttgaa 2820 cgtaagttta aaaatttagt gagagcaaag aaaaaatcat attggagaaa gtttgtcgga 2880 gaattatcac atgagacctc tatgactgct ctttggaaag ctgctaaaaa tatgagaaat 2940 aaatcttcag ttaatgagga taaagagtat tcacatgaat ggatcaaaaa gtttgctaaa 3000 aaaatttgcc cggatattgt ttcagaaaac caaatttgtt ttgatacttc tgaggatagt 3060 gtaccaatgt tcacaatgat tgagttttca ttagctcttc tatcagggag gaacaattct 3120 ccagggaaag atggtataaa atttgatcta ttaaaacatc ttccagatgt agctaaaaaa 3180 agattattga atttatttaa ttggttccta gaccataatg ttattccaaa tgattggaga 3240 aaagtcaagg taattgcggt ccaaaagccg ggtaagccac cttctgatca caactcttat 3300 cgtcccattt ctatgttgtc atgtatacga aaattaatgg aaaaaatgat tctgaaccgt 3360 ttagacaatt gggctgaatc aaataagttg atttcagaaa cacaatttgg ttttcgtaaa 3420 ggaaaaggga ctaccgattg tcttggtttg ttgacaactg aaattgatat agcgtttggt 3480 agtaaacaac aaatgacctc tgtatttcta gatataaaag gggcctttga ttcagtttcc 3540 cttggcattc ttactcaaaa attgcatagg aaaggacttt ctccaaaact caataatttt 3600 gtgaataatt tattaaggga aaagcacatg tatttttcta gtggctcaat ctcagaatac 3660 cgagtaagtt atatgggttt gcctcagggc tcttgcctga gtccacttct gtataacata 3720 tatgtgagtg agattgataa ttgcttggct gaaggatgta ctctcagaca attagctgat 3780 gattgtgtta tttctgtaat aggtaaagat agagaagata tgcaagctcc gcttcaagct 3840 acattagata gtttgtcttc atgggcaact gatttaggaa tcgaattttc accagaaaag 3900 actgaaatgg ttgttttttc taaaaaacat catccagcaa aatttgaact acagctatca 3960 gggaaaaaaa ttaatcaatc tttatcctac aaatacttgg gtatttggtt agattccaag 4020 tgtacatgga aaaaacatgt tcaatatata aaatctaagt gtgagcaaag aataaatttt 4080 atcaggtcca ttactggaag ttggtgggga gctcatccag aagacttaat taggttgtac 4140 aagacaacgg tacgatctgt gatagagtat ggaagttttt gcttttatcc agctgcacgt 4200 acacatatcc ttgttctaga aaggattcag tatcgatgtc ttcgtttagc attgggatct 4260 atgaattcta ctcatactaa aagtttggaa gtcctagcgg gtgttcctcc tttaatgatt 4320 aggttctatg agcttgggag tcggtttcta attcgcagtg aaatcatgaa tcctattatc 4380 attgcaaact ttgaaaaact tctgcaatta atggtacagt ccagacgaat gccactttat 4440 tattggcaca tgtctgaaga tatttcacct gctataatag attcagaaca gctcaatatg 4500 aatttccttc agttaaacaa cgataaagtt acatatgaca tttctatgaa acaagaaata 4560 aataatattc cagattactt gagatccaaa ataatacctt caatattttc atcgaaatat 4620 ggaagacttt cttctgacaa atgtttttac actgatggct ctagcataga taatagtgta 4680 ggttacggtg ttttcaatga aaacgtttct acatatcatc agcttaagaa tccatgttca 4740 gtatttgttg cggagttggc agcgatatat aatgctttaa ttatcatcct taatttatcg 4800 gttggcagct attacatatt ttccgacagt cttagctcta ttgaagcttt aaaatcaata 4860 aaaccacaaa atgcaacata ttttattatt gaaataagaa aactattgaa tacattaaca 4920 atgcgctctt tcagcatcac tttcatatgg ataccatcgc attgcctcat caatggtaat 4980 gaaaaggcag atattttagc caaaaaaggt gccatagtag gtgaaatttt tgaaagaaaa 5040 attgatttta gggagttttt tcaaacatgt cgtcggaaag caattgctga atggcaaaga 5100 tcttgggatg aaggatctat gggacgttac ttatacggtc ttgcaccaaa aatatcaaaa 5160 aaatcatggt ttaaagatta taatctgact agggacttta ttaaggttat ttctagacta 5220 atgtcgaatc attatagatc taaggctcac cttaagcgta ttggtttaag tgatactaat 5280 cattgtgact gtggtgaagg atttgaagat atagaacacg tcgtttgggt ttgttctaaa 5340 tatgaagtaa aaagacaaga attgattgat tcccttcctg ttttatggaa accacccaac 5400 gcaacagtcc gcgatgctct gttttctcgc aatcccaatt tcctaaatcc tatatataaa 5460 tttctaaaag ctattgactt tcctatctaa tttttgtctt taacagtgaa ttactattac 5520 ctgcgacttg agtgattgac ggatgtctta agccacatat caaaccgatt ccgtttcctc 5580 aaattttcct caagccacca ttttgaactg cacgacgcct ccaataatac cattgagctg 5640 atcatcacca tctcatctga gcaacttcac aaatccattg agaagatgga gtgacaagag 5700 aaccgtccta cgatgcatcc caagccccga gtttagaact tctaacttgt cgtgaaagta 5760 tgacaagatg tagaagctaa acaaaattaa tacaaacatt gtatataacg ttattgaatc 5820 ggcccctaat actttatggt aagggcctaa ataaagatag gtttgagggt tttatgccta 5880 taggagaaga gatttttttt tcactcctac aggctttccc tcattccaaa aaaaaaaaaa 5940 aaaaa 5945 // ID R2_AM repbase; DNA; INV; 4277 BP. XX AC AF015815; XX DT 27-JUL-1999 (Rel. 4.06, Created) DT 27-JUL-1999 (Rel. 4.06, Last updated, Version 1) XX DE Anurida maritima retrotransposon R2, complete sequence. XX KW R2; Non-LTR Retrotransposon; Transposable Element; R2_AM. XX OS Anurida maritima OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Collembola; OC Poduromorpha; Poduroidea; Neanuridae; Pseudachorutinae; Anurida. XX RN [1] RP 1-4277 RA Burke D.W., Malik S.H., Lathe C.W. and Eickbush H.T.; RT "Are retrotransposons long-term hitchhikers?."; RL Nature 392(6672), 141-142 (1998). XX RN [2] RP 1-4277 RA Burke D.W., Malik S.H., Jones P.J. and Eickbush H.T.; RT "The domain structure and retrotransposition mechanism of R2 RT elements are conserved throughout arthropods."; RL Mol. Biol. Evol 16(4), 502-511 (1999). XX RN [3] RP 1-4277 RA Burke D.W. and Eickbush H.T.; RT "R2_AM."; RL Direct Submission to Genbank (24-JUL-1997)Biology, Univ. of RL Rochester, 135 Superior Road, Rochester, NY 14625, USA. XX RN [4] RP 1-4277 RA Burke D.W. and Eickbush H.T.; RT "R2_AM."; RL Direct Submission to Genbank (09-SEP-1998)Biology, Univ. of RL Rochester, 135 Superior Road, Rochester, NY 14625, USA. XX DR GenBank; AF015815; Positions 1 4277. XX SQ Sequence 4277 BP; 1116 A; 919 C; 1195 G; 1047 T; 0 other; tcagttggca cttctatatt agttgtgtac ccaagagata gttactgtgt aggacatagg 60 gggttctgct taccctctat gttgctactt acggccaatc atatggtcta gctgtacgtc 120 gccttgtcgt ggtcccgccg ctcaagatcc ttcccatggg atcttgctaa agacgtacgt 180 acggtgaacg gggttacatg ttgaacattg tgatcgttca acaatcccca aaaaggcgaa 240 agctaaagcg cttgggtgac ttagtacgtt aaattggaag tttggcacta tctccattag 300 gacctcacgg aacatttaat cgaaaagtca catggcggcc taaaagcccc accaatgcgc 360 tacgccactg gtgatcctcc agaaatgggt aaggattagt acggctacag tgggaccccc 420 cagatctccg gatctgactg cattagatgg agaatgatgc agtgaccgag tacgttgaat 480 ctttgccttg gtctacggat aacgtacttt aggttggggt acagggtgaa gtaacctccc 540 tccgtctctt gtgccaacaa gattcggagg tttatgcaac cacaatgaac aggaaaaata 600 attacacccg tggtaaccgg ttcagttccg gaagtcccgg caattttgtg gttgtaaggg 660 atcccgcaca ggatctcccg ttcaaatgtg cattttgcga acggacattt acaacgtcca 720 atgggaaggg actccatgaa ctgagaagtc acccgaagga gtataacatg cgtgttccgg 780 ttgcgaaaaa gcgggcgagg tggtcggagg aggagttaag tcagcttgct gaagctgaac 840 atgacctcaa aagtaagaag caatatgctt ccgaactcga cctaagccgg gatctggagg 900 gttgtatggt ggggagatct ctggaatcca tcaggggaca gaggaagctc ccacggtatg 960 ttgagatctt taaccgactg tctaatagcc gagcatcaat cccagaaatt ggtgaaggtg 1020 acgaggaagc taattcagtc ggttccgatg aggtgtttca tgcgtccggt catgggacag 1080 ggattaccga ggcactagag gtgctggttt cgaaacggcc tggtgaggcg tttcgggagg 1140 aagtcctgaa tggaatagta cgagcgaaac ttgagggtag cgaagtcttt gtacgacttg 1200 agcaatattt gtcccggatg tttgtgggac aaagctcgtt ggcttctaca ccttgtagtt 1260 cgggtgaggg gaaattgccc ttgacatgtt ctggctccaa caaaaaggaa cagaggtctg 1320 atctgttctc ccggttccct gaaaagggtc cgagtgtatc agtttcagcg ccctctgtct 1380 ccgacatggg acgtaaacgc cgcaaggcgt tatctcgtaa cgaggcaact caagtacgag 1440 aggaatttgt tgagtttgcc agaaatgtcg atccaatacc ccgacgcaag tgtgctagag 1500 ttaacgaagg cctacaaggt aaccagaggt tacctaaaga aaagccgatg accgctaggg 1560 caaaaatgcg gcaccttcgg cttctaagat accgtaggct gcaagagttg tacaaaaaag 1620 accgttccct cgcagcaaaa caagtgctgc aggacatgct tgattccaaa ccaggacgga 1680 atcccgaagc ggttaagtac tgggctgaaa ccatgggtaa agaatccacc ggtattgatg 1740 tgtccgtaat gaccggacgt ccgagatacc gagacaatgt atggagtcca atatacccgg 1800 gggaggtttc agcagctgta aagctgatgg atagtagcgg ggctacaggt cccgatggtt 1860 tcagtgtgag atctctgaag tgtacaccca gcagggttct ggcgaaggtc tttaaccttt 1920 ttcttttgga ggaaaagctt ccggcattcc tgatgacctc aagaacggtc ttagtcccga 1980 aagttaagga gcctaaggct cctacggact acagaccaat ctcagtttcg tccaccttgg 2040 tccggctttt tcacaaaatc ctggcgagaa gactaacctt ggcgagcggg ttagacagca 2100 ggcaacgcgg cttcgttcct gttgatggat gtgccgagaa cctcgtggtt ctcgaaagtg 2160 ccatcaggag cgcgaagaac tataagcgtt ctctattcgt agcttctatg gatattaaga 2220 atgcttttgg ttcggtagcg catgaagcca ttttcgaagc gttgtctaag tcgggtgcgc 2280 ctgactcttt tgtgacatat gtgcgaaact gttacgacgg tttcgcgagt gttgtaaaac 2340 ttggtcggga cactgcccaa actactgtga gacaaggcgt cttacagggt gaccccttgt 2400 cacccattct gtttaatctg gttatagatc agataataag gtctcttcct gagacggtgg 2460 gtgttcaact cgatgcgaac acgaaactta actctatggc atttgctgat gacctaatcc 2520 tgctgagttc atcggaggcg ggtatgaggc ggatgttggg agtcctggct ggagtgagtt 2580 caaaatttgg acttattttc catccaggta agtgcaagta cctagctatg atatgggcgg 2640 ggaaacagaa aaagatgaag atcgcaacgg acttgagctt tgagataggt ggcggattta 2700 tgacgccggt gggagtaact gagacatgga agtatttggg ggcttatttg ggacagattg 2760 ggattcaacc agcaaggctc tctttgcaga cttttctgga gagaatcgct aagtccccgt 2820 tgaagccgca acagaagttg tacctaattc gggtccattt acttcctaag ctcatatacc 2880 cactggtaat ggcgcctatt agggcctcaa tgctcaataa gttggatcga atggttcgtg 2940 ttgccctaac tggcaaggat ggtattctgc acttacccca gtcagtgcct tctgccttct 3000 tctatgcgcc gataggtgag ggtggtcttg ggctaatgga gttgaggaca tctataccag 3060 caatggtaaa ggcgagattc gagagaatga tgaactccac ttgtcatcat gtgagagctg 3120 cagctaaggg cgctgcaaac tcgaacagaa tcgctttggc aaatcgattc cttcggaaga 3180 ccgcagatgg tattcccgtg actagtgcaa agctggttaa ggagtaccag gcggcaaagt 3240 tgcacggctc atttgatggg aaaccattga gtgaggctgg gcgggttaaa ggaatccact 3300 catggacatg tgatggtagg atggtgatga caggacaggc tttctgtgag gcccttaaga 3360 tccgcatcaa tgctcttcca tgtctctcta ggtataatcg gggaaccgag aagcctagag 3420 aatgtagggc gggatgcaaa accacagagt ccctgaacca tgtcttacaa gtttgtccga 3480 gaacacatga catgagagtt gcgcgacatg acaagctagt caatagacta ggtggatatt 3540 tgagccagaa agggttcgaa atccacacgg aaccaagaat cattacgtct cttggattga 3600 ggaagcctga tatcattgcg atcaaaggag agaagggagt ggtacttgat gcgcaaatcg 3660 gaggagctgc aaatttaaat gcagctcatg acgctaaaat gtgttactac tcctcctctc 3720 cggagatcaa agaatgggta acgggaaagg gagcgccgga cgtgtcctac ggcgcatgca 3780 ttgtttcgcc acaaggcata atgtcggagg agtcctggaa aaccctgcgg gggttaggat 3840 tctcaaaggg gatgctgaac tccttggtcg tgactgtcat ggagcaaagc acatatgttt 3900 ggcatgtgtt caaccggagc actgcctcat atgggtggaa gcgcaggcga aagcgcaagt 3960 gggattagct cgtttttagc aggaacggcg aagcctgaag gcaaatgttt gtttgagcgg 4020 gaccctccct atgaagctct aaacaagctg atggccttgg aggcacgggg tctcgagcaa 4080 tcacgtgttg atgtgaagtg gtgatcacta ccaccgctgg ggccaattag atgaacatgt 4140 aagacccaac tggtaaggat tgaaagtagg agtcgacgta cagactctga atgtagggca 4200 taacgtcaat ttgcctgatc gctggtgaat ctcgcttgcg agcggtgtga aatgtgccac 4260 cttatcagtg gctatag 4277 // ID Rehavkus-2_HR repbase; DNA; INV; 5880 BP. XX AC . XX DT 31-MAR-2008 (Rel. 13.03, Created) DT 26-MAY-2011 (Rel. 16.05, Last updated, Version 2) XX DE This is a recently transposed Rehavkus-2_HR DNA transposon - a DE conceptual consensus. XX KW MuDR; DNA transposon; Transposable Element; Rehavkus group; KW Rehavkus-2_HR. XX NM Rehavkus-2_HR. XX OS Helobdella robusta OC Eukaryota; Metazoa; Annelida; Clitellata; Hirudinida; Hirudinea; OC Rhynchobdellida; Glossiphoniidae; Helobdella. XX RN [1] RP 1-5880 RA Kapitonov V.V. and Jurka J.; RT "Rehavkus DNA transposons from the leech genome."; RL Repbase Reports 8(3), 377-377 (2008). XX DR [1] (Consensus) XX CC Rehavkus-2_HR belongs to the Rehavkus group of the MuDR CC superfamily of "cut and paste" DNA transposons. Transposons from CC this group are widespread in different metazoa, including CC insects, sea squirts, sea urchin and fish. The genome harbors CC only three copies of this transposon, which are ~5% divergent CC from each other. One copy contains both termini and is flanked by CC the AGGGATAGG 9-bp target site duplication. Its 5'-terminal A is CC deleted; however, the T-3' terminus is intact. The The 822-kb CC inverted termini are composed of a 335-bp terminal inverted CC repeat and 178-bp subterminal minisatellite-like units. This CC transposon encodes a 1097-aa Rehavkus-2_HR transposase (pos. CC 1198-4488, contains 3 stop codons), which is composed of the CC transposase core, C48 cysteine protease, and PHD finger. This CC transposon contains an insertion of another transposable element CC (pos. 4680-4805). CC This sequence was derived from sequence data generated by DOE CC Joint Genome Institute. XX SQ Sequence 5880 BP; 2147 A; 898 C; 962 G; 1873 T; 0 other; actggtctga atctatccga cgcaagataa cattgaataa agtctaacaa atatatggaa 60 attctatcag acgcaatgta tatcatatat actatatata tatactataa aatattgatt 120 agcgctttac tttgcaaatt aataatctaa tttagactta atatatgcta tttaatttct 180 aataatttta atatattttg aaatttttgg taatcatttt taattgttgt tcataataaa 240 tttttcactt aattttcaac aatattggac atggaattga ttagcgcttt actttccgaa 300 ttaataatct taagatacat atatatacaa gtacaagaac aaagttaact caaaatatag 360 taaaataata aatagaaata tctttattct agacatatac aaaaatacat tttcattaaa 420 atatctcggt catattgaca cactgatttt taacagaatt atttgtaata ttttcgatca 480 ataattgggc atgaggctga tcatcgccac agccgacaga acaaaattat ctcaaaatct 540 agtaaaataa taaatatatc tttattctag acatatacaa gaatacattt tcattaacat 600 atctcggtta tattgacaca ctgattttta acagaattat ttgtaatatt ttgaatcaat 660 aataggacat gaggcagatc atcgccacag ccgacagaac aaaattatct caaaatctag 720 taaaataata aatatatatt tattatagac atatacaaga atacattttt attaaaatat 780 ctcggttata ttgacacacg gatttttaac agaattattt gttaaacttt gatcgttaaa 840 actttatcga caaatgacaa agaacttttt tgcatatgac aaagaacttt attctcaggt 900 ttttatataa atacattact ttttcattat tactattatt attatttgaa taaagctatt 960 gcatagttgt ttaactactt gaaaattcat taaatctacg aaatcgatca aaaaatgaaa 1020 gggggaaagg aaaagaatac cgtttatttt aaagggttta aaaacgttag taaaaatttt 1080 attaaaattt acattatttt ataaaaaaaa attgatacca taatgcttta ataaacaatt 1140 taaataaatt agtcaaccag gaagatagaa ttgcagaaag tggaagagac aataaacatg 1200 gttttattag atacagcact ttcaactgag aagaacaacc gtgtttggga ggaatgttgt 1260 ttgaggctag caattgctcg aactaaaaaa aactgtacac atttgtacca ttgttgcaga 1320 tataaattat gctcaaaaaa aatagaatac gttaatgtag aagacaaagt gggtaagaca 1380 ttaatgaaca acgtggttca aacaactaca attgataatg tcttgacgtc taatcttatc 1440 cagaatgaag acgatataat tttgaaatgc agtgcagatt taatatatga acaagtcgaa 1500 ctatcaacag tttctataac accaactatt actgttggtg aagaaattgt tccctcgcat 1560 tctcaagtga ctgctttggt agaagacaac atggtttttg aaatgaatgt tgatcggcat 1620 gagtcattga atgtttacga tgtcgaacct gccgcattag ataatgatac ctgcagcgat 1680 agtgattcgg catctttcac agaccaggaa ctagatgcca acaacatatg gaccaagaat 1740 ttacaaaaaa cggaattttc gtttagactt gaagttggtg aatggcaact actacggcaa 1800 tcacaacaaa atcatttgtt taaaaattta gactggatta atataatgac aagaaatttg 1860 aaaaaattta atccatattg ctgcttatgc tttttgtggc atagggttaa gaaagtggat 1920 tctattaggc gcgaaggtta tctttttaat gctcagggat attgtaaatt cacggggtgt 1980 cccataacat tcaaaataat tatttctgac gaagatcttc ttgatgctca tgtgaaattt 2040 tctggcataa cttgcataca tagattgtcg gaaataatgg ctagaccggt tagggcgagt 2100 aaaagagggg aaattgcttc ttcgatgaaa aatattattc ctcgccaaaa acacctagac 2160 tgcattaaag agttgtgccc tgaagtacga gcagctggca acagagatga tgctccttct 2220 gtagcggtcc tttcgcaaat tgtatgccag tacaacaaaa cgaaacgaaa gcatccaaac 2280 gaaatggaaa gtctgagatt attgaaggaa gattttgaaa aacaacacca aagctatgcc 2340 tgtctgcaat atattttcac tgaaccaccc agtgtcatgc tctggtccaa agaaagttta 2400 aaaattttac gccaacgatg catcgatgat gtggtgtaca tagacgcaac aggaagtatt 2460 gtaaagaaaa ggaaacatcc attttatatc tatgaaatgg ttgtcagaag ttcgatgaaa 2520 ggaggatcac cgtttcctgt cgctacatac ctgacaaacc ttcattcaac tgcctcaatt 2580 ttgtcatttt taaacttcct gtacagcgac tactgtaaat tgtttgggaa aagtagtagg 2640 gtaagcccgc ggtattttat ttgtgatggg tccatcgtct tgctgcaagc attggctcgg 2700 acattctgct acgagctact ggaatcagtt atggaaagat actacaggat agtggttggt 2760 cacggaaaca ctaaggactt ccaaacacca atcatacaca gatgtctgag ccacattatg 2820 aagaacgcaa aggtattatg tttaaagaac attccaaaaa attttacctt tgcgatgcac 2880 ctcatcggac gagtggctaa tgtcacaatg attaacgaac tgaacagtat cttaatatca 2940 atggaaggag tttttgggag tgagtttgaa ggaccacagg tgacacaata tctgcttcaa 3000 ctccaacaag ccataaatga taacgaattt ggtttagagg aaaacggagc aacagcaact 3060 acagaattgg cggaaacgga agatgtaagt ttacaattgt attaatttgt taaatttatg 3120 tcattatatt tttctaacat ataatttatt taactctttt taaaggacgt ttgccagcct 3180 ggatctttta tgaaagcatg cgaaaacaat atggctcttg ctaaaataac agacaccgga 3240 attaaaagaa atatgtatta ttccccagaa tttttatcat catttaagaa agatcttttg 3300 cccagtgctc ttctttggac aaatttacta accggtaatc ttgggagaca tggcacatca 3360 gataattata aaaatcaaaa cgctatatat ttagaattga ccctaaaaga tactcaaaac 3420 attacaaaag actatacgac acagggcatt atggaaaaga gccaatggga tttaaagcac 3480 ataagaatga cgagacagtt aaatagaata gatgattttg ttatcacata ccaggaaatc 3540 cacctagcga tgatcacaga atttgctgat aatcaaaaaa gactgtttaa aaaacgaaaa 3600 atagatctgg aagtttggaa aaaaaggaaa cagaggaaaa aggaaattta tgtctctaac 3660 ccaaaaaaac ccttaatttt aattaaagac aagattcagg aaaaccagtc acttgtcact 3720 aaaccagtta ctcatataca gcaacaacca ataattttaa attcattttg gtcagatcta 3780 caaggcacta gttggctgtc atcagatcac atcaatctcg ccatgtttat tttaagtttg 3840 caataccccc acattgatgg gctatctgac actgactttc tcacacatat agggttaaac 3900 agcaacttac ccacgccaca tggaaaattt atacaaatat taaacaaagg gggaaaccat 3960 tggattaccg tcagcaactt tttttctgaa caaaatataa ttgacatata tgacagtaga 4020 cccacaaaaa tgttgccaga ggatgaacgg gtgatttgtt ccattatgaa gaatttcatt 4080 cttccagaac aaaagaatgt gatcatgcat ctacacaaag ttcaggaaca aatcggctcg 4140 aatgattgcg gggccctagc aatagctttt gcatcatcat tagcttgtgg gctaaatccc 4200 acatgttgtt catacaaaca agaaaactta cgtgcagagt tggcaaaagt attcacctcc 4260 aaattttttc atccgttaga ggagaagatt accaacagaa agagtaagcc aaaaaacata 4320 gagataaatc tgtactgcac ctgccgtttg aaccacattc ctcacaagac cgatgttaag 4380 gagaacatga tcacttgtga tcgatgtgga gaatggtttc atagaggttg tcatcaaatt 4440 ccacaatcag cctttgattt aaaatcctac atttgcgaaa aatgcaatta aattaaagcg 4500 ttagtgttag tattccaccc tttaattgtt tgttttattt tataattatt tttttagtcc 4560 gcggctagag tgcgatgtca tcaattgata aattgatagg tttgttgttt gttgtctcca 4620 gcttaaatct aatctatttt ttgacatttt atacagcgcc aaatacgtgt cgctcaacaa 4680 tgttcctaga tgaacctctc aaacgcgagt gatttggaag aagtaagaga agaaggcaga 4740 cagttccaga taagagggcc ggaaaaggca aaagcagact tggcaaaaga atgtctaaca 4800 gataagacaa agaaattatt agaaaaagag gactggagat cagctcacat tggttttatt 4860 ctgttaaaaa tcagtgtgtc aatataaccg agatatttta atgaaaatgt attcttgtat 4920 atgtctagaa taaagatata tttattattt tactatattt tgagttaact ttgttctgtc 4980 ggctgtggcg atgatcagcc tcatgtccaa ttattgatca aaaatattac aaataattct 5040 gttaaaaatc cgtgtgtcaa tctaaccaag atattttaat gaaaatgtat tcttgtatat 5100 gtctagaata aatatatatt tattatttta ctagattttg agataatttt gttctgtcgg 5160 ctgtggcaat gatctgcctc atgcccaatt attgatcgaa aatattacaa ataattctgt 5220 taaaaatcag tgtgtcaata taacagagat attttaatga aaatgtattc ttttatatgt 5280 ctagaataaa gatatattta ttattttact atattttgag ttaattttgt tctgtcggct 5340 gtggcgatga tcagcctcat gtccaattat tgatccaaaa tattacaaat gatcctgtta 5400 aaaatccgtg tgtcaatata accgagatat tttaatataa atgtattcgt atgtatgtct 5460 agaataaaga tatttctatt attagattat tacaaagcaa tatacaattt ttgtatatat 5520 atgtatctta agattattaa ttcggaaagt aaagcgctaa ttcaatttca tgtccaaaat 5580 tgttgaaaat tgagtgaaaa atttattatg aacaacaatt aaaaatgatt accaaaaatt 5640 tcaaaatata ttaaaattat tagaaattaa atagtatata taaagtctaa attaaattat 5700 taatttgcaa agtaaagcgc taatcaattt attctatata tatatatata tataattcta 5760 tatatatata tatatatata tatatatagt atatttgata tacattgcgt ctgatagaat 5820 ttccatatat ttgttaggct ttattcaatg ttatcttgcg tcggatagat tcagaccagt 5880 // ID Sola1-N4_AAe repbase; DNA; INV; 1247 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A non-autonomous Sola1 DNA transposon family from Aedes aegypti. XX KW Sola; DNA transposon; Transposable Element; nonautonomous; KW Sola1-N4_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-1247 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the yellow fever mosquito."; RL Repbase Reports 11(4), 1294-1294 (2011). XX DR [2] (Consensus) XX CC ~94% identical to consensus. 4-bp TSDs. TIRs are 27 bp long. CC Both termini are 70-80% identical to those of Sola1-3_AA. XX SQ Sequence 1247 BP; 395 A; 227 C; 238 G; 387 T; 0 other; ctgcccatac tcgcaataca gtcccatcag gaaaatcatc atgtcgagaa aaacgcaatt 60 gaacattatt gcaaaggttt tcgtaacgcc gctgcaggca gcagaaaatt ctagtggcat 120 tactcaacac attatcatta gagtacaaga aaatgaacaa ctactaaaat attgtgaatg 180 cggttgtttt ttatatacaa atccaagata cttgccaatg ggactcgtta tccgagtact 240 ttcctcttca aattcaaata tacccgcata ccagtcccat tacttgggac tcgcttactt 300 agttatcgcg catcttgaca catttattgc caagtttatc atggattcca agggactttt 360 ccttttgcta ctatagtgct ttcatatttt aattgcgata cttattatac aatataaaat 420 aatatggtgg tgatagtgac agtcctcaag acattcctgg gtttgatgaa tgcgttgcgg 480 ccggccatat aggaaaagta gttaggattc acctgcagca tgttgtagta tctgacacac 540 aggtactaag gcgggtaagt tttgattagc ctgatttcga aaaagctgtt ggacatttct 600 tagatatcca taatgctggc ttttcattgg accttcttaa ttatcaccat tagtgtcgct 660 aaacgtactg cttgaccagt cgacgatacc agtggtggta tgctaattcc catgttagcc 720 caaaaagcct cccctccatg tcgttagaaa gatgaaccta ccggagattt aggtaattgg 780 tcagaagaca aaagttatga cagctatagg tttgacttag agggaagcta tttatcttat 840 ttttaattta gactctcata aggactagat atgtttcaat cactcatcgc accagaagca 900 ttctagacaa aaatgaaaaa caacatttgt tatggagtgc agatagttta aaacactgta 960 atttgtagca tttctttttc ttatgggatg aaaactacga tgcaagatta cagaaatcat 1020 aataggaata agggtacatg aatttcgtcg taatctacta gtgggactgt tatgcgggta 1080 ctttcaatat gggacgcaca tgatttggta tttttttcgc attttgtcca tacaaaaata 1140 caattttaag tatgatacca tgaagtacac taacaataaa catgattcat gaagaaaact 1200 caaaagtgat gaaaattgaa atgggactgt tttgcgagta tgggcag 1247 // ID Gypsy-13_SI-LTR repbase; DNA; INV; 222 BP. XX AC AEAQ01024017; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: long terminal DE repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-13_SI_; KW Gypsy-13_SI-I; Gypsy-13_SI-LTR. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-222 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01024017; Positions 390 169. XX SQ Sequence 222 BP; 56 A; 49 C; 33 G; 84 T; 0 other; tgtgatatta tgcgctagta attttattat agttatttta gttagtccct atgcagcgtt 60 tggcgcctat gtgttcccgc caagaaccga ctctcttttt cgttgctact aaccaacgtg 120 cgactgctcc cttgcgttac taatattcat ttggcatttt cagtataata aatatagttt 180 cgttcacata tattacttat ttaccaccca taagacatca ca 222 // ID L2B-5_AAe repbase; DNA; INV; 4567 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE An L2B non-LTR retrotransposon from Aedes aegypti. XX KW L2B; Non-LTR Retrotransposon; Transposable Element; L2B-5_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4567 RA Kojima K.K. and Jurka J.; RT "L2B clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1410-1410 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 13 sequences with >97% CC identity. Closely related to CR1-1_AG and L2B-1_CP. XX FH Key Location/Qualifiers FT CDS 290..1687 FT /product="L2B-5_AAe_1p" FT /translation="MSIQCTVCAKVINAVNDRVFCFGGCGQVLHAKCADLT FT NVQATSLRENVSLKYMCHDCRKKQVCLNTMSGKCEEILRAINEIKIRVERI FT ESDCERNXVCDAIKKSEESLKVFIQEXIRVQFTNLVSSEDKCLRAENQDAI FT ISDVTTPGEGLAGSRGTSLISYASXAKRNRRISANESVLRSGRVRQRSDIS FT TPKTGGKGHEQMNNAENVQSDARGKSSQKTLDCMVRIKPAVQQTNQQTKKE FT VRNKINPSQMGIKSVRNGMNGSIVVECGTKNEAEGLLDKVRNELGENYMAA FT IEQPKRPRFKIIGVEDEYDSDELRNIFKNQNNIENVQFLKILKTLKHRRGV FT YTEFTLICESDSSTFEQIMRKGRLYIDFDSCRVTESVDIFRCFKCCGYAHK FT SQDCKNGLHCARCAGNHDIKECSSEQEKCINCAVSNKDRKTRLDVNHTAWS FT SECPIYLRRLRLSKQYIDYNK" FT CDS 1691..4498 FT /product="L2B-5_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="QSKDVADIILLNIAGITTHFDELELLVDSKKPKLVML FT TETHLTTDIDIAAFSIRNYKICCCYSNSRHTGGVMMYIHDSIKFHEINNLV FT KGMNWFLAVKIVKGLKVGVYGLLYRSPNGNKNEFLEYLELNWLENTLDDNQ FT MNLIAGDFNINWKDPSDSESLRNITQCFNLNQRVEDATRRTRQSETMIDLV FT FCNNDDVKVSIEENSKISDHETLRIDLGQNDGCVEDVLTIKCWRNYSKDAL FT LALLRNKMSHESARLLDEKADVLATVLKESVNELVIVRKIKCRSRKKWYTI FT ELKKLKHSRDVAYKIASQSWDDVEWQNYTILRNEYSNSLRKAKAEYTQKQI FT NLNRGNSKQLWKTLKKLWKNKENSANCISFNGSSVEDGNEICNKFNSYFVD FT SVQEINSSIENVPDTYNDQNEHLSDQLSVFRQISLDDLRKVVYSIGNSSGV FT ENVNLQVLKDSFEVTGEYLLNIVNESLEIGKCPQNWKQSVVVPIPKIPGTT FT KAEEYRPINMLPIYEKVLEVIVKNQLLDYLKSQKILIEEQSGFRQNHSCES FT ALNLLLYKWKRMIEEKKSIIVLFLDLKRAFETISRPIMLNTLRRYGVAGKV FT LSWFESYLSSRTQICQYNGSNSLPRVVPLGVPQGSVLGPLLFILYINDMKK FT AVKYCDINLFADDTVLFIGDKDPLSAVRKIRADILTLTNWLKFKKLKLNVQ FT KTKLMIITNKKQIELDEIKLEIDGLEIERVNVFKYLGVYIDDRLTFKEHID FT NVVKKVARKYGMLVRLKSQLTFWSKLFLYKSLIAPHIDYCSSVLFLASDIH FT LRRLQKLQNKFMRLILNCDRYTPIQTMLDTLQWQSVKQRIIFNVLLLIYKL FT TNNKLPDYLNDIIVRGRNIHQHRTRRIDDLRVVPFTMSTNQKSIYYNGIRI FT YNQLPLDVKNARSIQEFRRKCSGWVKNSFSIR" XX SQ Sequence 4567 BP; 1706 A; 641 C; 920 G; 1294 T; 6 other; tctactacta tcgcagtgtg cagacgtgaa atwatcagcc gaataaacct caatctaaac 60 ctgtgaaata ttaatcaatg cgtaataaat catacatcat aagacgttgt gagcaaaatc 120 cagttgaatg gtgtacatta cgtcggagta cwcgcaaagt gaagtgaatt atcatcaatt 180 taatttcatc gcggtagtga aattacgatt cgctttcaac tgaaagtgcc tacagaacaa 240 agcggccgtt ggccgtgcaa caatacaata cagagagcac gcatcaaaaa tgtctatcca 300 gtgtactgtt tgtgcgaaag tgataaatgc agtgaatgat cgagttttct gttttggtgg 360 atgcggtcaa gtactacacg cgaaatgtgc ggatttaact aatgtgcaag caacttcttt 420 acgtgaaaat gtttccttga agtatatgtg ccacgactgt agaaaaaaac aagtgtgcct 480 caatacaatg tcggggaaat gtgaagaaat actgcgggca atcaatgaga ttaagatccg 540 tgtggaaaga attgaatcag attgcgaaag aaacawtgtt tgtgacgcaa taaaaaagag 600 cgaggaaagc ttaaaagtgt tcatacaaga amatattaga gtacaattta ctaatttagt 660 gagctcggaa gataaatgtc ttagagccga aaatcaagat gctataattt ctgatgttac 720 cactccgggc gaaggcctag ctggtagtag aggaacgtca ttaattagtt atgcatcgst 780 agcgaaaagg aatagacgaa taagtgcgaa tgaatctgta ctacgttcag gtcgtgttcg 840 acagagaagt gatatttcaa cccccaaaac tggtggcaaa ggtcatgagc aaatgaataa 900 tgcggaaaac gttcaaagtg acgcaagagg aaaaagttcc caaaaaacac tggactgtat 960 ggtgcgaatt aaaccggcgg tacagcaaac taatcagcaa acaaaaaaag aggtaagaaa 1020 taaaattaat ccgtcccaaa tgggtatcaa gagtgtgcga aatggcatga atggatcaat 1080 tgtggttgaa tgtggaacaa aaaatgaagc cgaagggcta ttggacaaag twagaaatga 1140 actgggagaa aactatatgg cagcaattga gcaaccaaaa agaccaaggt ttaaaattat 1200 tggtgttgaa gacgaatatg attctgatga gctgagaaat attttcaaaa atcagaacaa 1260 tattgaaaat gttcaatttt tgaaaatttt gaagacacta aagcatcgta ggggtgttta 1320 tacggaattc acgcttattt gtgaatctga ctcaagtact tttgagcaaa ttatgcgtaa 1380 aggaagactt tacatcgatt ttgatagttg tagagtcaca gaaagtgttg atatatttcg 1440 ttgtttcaaa tgttgtggtt acgctcacaa atcacaagat tgtaaaaatg gtttacattg 1500 cgcaagatgt gccgggaatc atgatataaa agagtgttcg tctgaacaag aaaaatgtat 1560 aaactgcgca gtttcaaata aggacaggaa aacaagacta gatgttaatc atactgcatg 1620 gagctctgag tgtccaatct acttgagaag gttaagacta tcaaaacagt atattgatta 1680 taataaatag caatcaaagg atgtagcgga cattattttg ttgaatattg ctggaataac 1740 tacacatttt gacgaattgg aacttcttgt agattccaag aaaccaaaat tggttatgct 1800 tactgaaaca cacttgacaa ctgatataga tattgcggca tttagcataa gaaattacaa 1860 aatttgttgc tgttattcaa attcgagaca cactggtgga gttatgatgt atattcatga 1920 ctctataaaa tttcatgaaa tcaataattt ggtgaaagga atgaattggt tcttagcagt 1980 taagatagtc aaaggactaa aagtaggagt atatggtttg ctctatcgtt ccccaaatgg 2040 caataaaaat gaatttttag aatatttgga actaaattgg ttagaaaata ctctagacga 2100 taatcagatg aacttgattg cgggagactt caatataaat tggaaggacc caagtgacag 2160 tgagagcctg cgcaacatta cacagtgttt caatttgaat cagagagtag aagatgcaac 2220 aagacgtaca agacagtccg aaacgatgat agatttagtt ttttgtaata atgacgatgt 2280 taaagtttca attgaagaaa acagcaaaat atctgatcat gaaacattgc gaattgatct 2340 tggtcagaat gatggatgtg ttgaagatgt tttgacaata aaatgctgga gaaattattc 2400 taaagatgca ttactagcat tgctgagaaa taaaatgtct catgagagtg caagattgtt 2460 agatgaaaaa gctgacgttt tagctactgt tcttaaggag agcgtgaacg aacttgttat 2520 agtgagaaag attaaatgtc gatcccgaaa gaaatggtat acaatagaac taaaaaaact 2580 gaaacattca agagatgtag cttataaaat agcaagccaa agttgggatg atgttgaatg 2640 gcaaaattat actatactga ggaatgaata ttctaattct cttagaaagg cgaaggcaga 2700 atacactcaa aaacagataa atttaaatag aggcaatagt aaacagctat ggaagacgct 2760 taaaaaatta tggaaaaata aagagaattc agctaattgt ataagtttca atgggtcaag 2820 cgtagaagat ggtaatgaaa tttgcaataa atttaattcg tactttgttg atagcgttca 2880 agaaataaat agcagcatag aaaacgttcc tgatacctat aatgatcaaa atgaacattt 2940 atcagatcaa ttgagtgtat ttcgacagat ttctctggat gatctcagaa aagtagtata 3000 tagcataggt aactcatctg gtgttgaaaa cgtgaatctt caagtactga aggactcgtt 3060 tgaggtaact ggcgaatatt tactaaacat tgttaatgaa agtctcgaaa ttggcaaatg 3120 cccacaaaat tggaaacaat cagttgttgt accaattccg aaaatacctg gaacaactaa 3180 agcggaagaa tatagaccaa ttaatatgct acccatctat gaaaaagtgt tggaagtcat 3240 cgtaaaaaat cagttgttgg attatttgaa aagtcaaaag atattaattg aagaacaatc 3300 tgggtttaga caaaaccact cttgcgaatc agctcttaat ttacttttgt acaaatggaa 3360 aaggatgatc gaagaaaaaa agtcaataat tgtgttgttt ctggatctta aacgagcatt 3420 cgaaacaatt tcacgcccta tcatgttaaa cactttgcgt agatatggtg ttgcaggaaa 3480 agtcctaagt tggtttgaat catatttatc gagtcggaca caaatatgcc aatataacgg 3540 atcaaattca ttgcccagag tagttccttt aggagtacca caaggtagtg tccttggacc 3600 attattattc attctgtata taaatgacat gaaaaaggct gtcaagtatt gtgatattaa 3660 cctgtttgca gacgacactg ttttattcat tggtgacaag gatcctttat cagcagttag 3720 aaaaattaga gcggatattt taacattgac caactggcta aaatttaaaa aattgaaatt 3780 aaatgtgcaa aaaaccaaat tgatgattat aacgaataaa aagcaaatag aacttgacga 3840 aataaaatta gaaatcgacg gattggaaat agaaagagta aatgtattca agtatctagg 3900 tgtttatatt gacgatagac taaccttcaa agaacatatt gataatgtag tgaaaaaagt 3960 ggcaagaaaa tatggtatgt tagtacgtct gaaaagtcag cttacatttt ggagtaaatt 4020 gtttttgtat aagtctttaa ttgcgcctca tatagattac tgttcatctg ttctttttct 4080 ggcgagtgat atacatctga ggagactaca gaaacttcaa aataaattta tgcgattaat 4140 tcttaactgt gatagatata ctccaatcca aacaatgtta gatacactac agtggcaatc 4200 tgttaagcag cgtataatat tcaatgttct tttgctcatt tacaaattga cgaacaataa 4260 gttaccagac tacctcaatg atattattgt aagaggaaga aatatacacc aacatagaac 4320 gagacgaatt gatgatttgc gtgttgtacc cttcacaatg tctacgaacc aaaaatcgat 4380 atactataac ggaatacgaa tttataatca gctaccattg gatgtaaaaa acgcaagatc 4440 gattcaagaa tttagaagaa aatgttctgg atgggtgaag aatagtttta gtataagatg 4500 aagaatgtat gaatttgtaa acttgtaaat tatcataaga taaataaatg aactattatt 4560 attatta 4567 // ID Rehavkus-N1_NV repbase; DNA; INV; 6708 BP. XX AC . XX DT 27-MAY-2008 (Rel. 13.05, Created) DT 26-MAY-2011 (Rel. 16.05, Last updated, Version 2) XX DE A family of non-autonomous Rehavkus transposons - a consensus. XX KW MuDR; DNA transposon; Transposable Element; Nonautonomous; KW Interspersed repeat; Rehavkus; DNA-5-9_NV; former DNA-8-2_NV; KW Rehavkus-N1_NV. XX NM Rehavkus-N1_NV. XX OS Nematostella OC Eukaryota; Metazoa; Cnidaria; Anthozoa; Hexacorallia; Actiniaria; OC Edwardsiidae. XX RN [1] RP 1-6708 RA Kapitonov V.V. and Jurka J.; RT "Rehavkus transposons in the starlet sea anemone genome."; RL Repbase Reports 8(5), 611-611 (2008). XX DR [1] (Consensus) XX CC Rehavkus-N1_NV is a consensus sequence of a family of CC non-autonomous Rehavkus transposons. The Rehavkus-N1_NV CC transposon is characterized by 3000-bp TIRs and 9-bp target site CC duplications. Classification is based on a structure of termini CC conserved in all known Rehavkuses. A minor fraction of CC Rehavkus-N1_NV elements are flanked by 8-bp TSDs, probably, due CC to mutations. Rehavkus-N1_NV is a composite transposon: it CC harbors an inserted copy of DNA-5-9_NV, which is masked by Ns.The CC 3000-bp TIR is composed of two unique sequences (pos. 1-576 and CC 740-1352) and a 118-bp minisatellite. XX SQ Sequence 6708 BP; 2111 A; 1239 C; 1202 G; 2020 T; 136 other; accatagtga agctggcagg caactcagca ggcaactcgg tgaagttagc aggcaactca 60 aatgtgatga tttgtgtaat atctcggcag gtatatggcc gattgtcacg aaattttgac 120 acgattgtac aaaaagtaca acaaacatat tcatgcatgt atgatatcaa atattattaa 180 tcacgtgact gtgtgacgtt gaacgttcca gcaggcaagt cgttacatca caattgtcgc 240 gatatctacg tcccattttc accgatagtc atcaaacttt aacctaagtg tacaaatgta 300 ataacgaaga caacgccgta ttagttattc tcatgacgtc atcaatcacg tgaccgtggg 360 aaaaaacatt ttctcgagtt cctattgaga aaacattacg aaactttcga aatatgacaa 420 atagcacatg ggattttaca atcatgaact tttaaaatat gacgtcatca atcacgtgac 480 cgtgggaaaa aaaagtcgat atctagagtt ctgactgaaa taacagtgaa aaactttcga 540 agtacgacaa attatgtatg tcgttattaa agcctaactt ttttggaatg ttttcgtgat 600 ttacttttat aaaaacatat agcaggcaac ttgacagtac agcaggtaac tcgtgatgca 660 ctgggaatat cgtattcgtc gttattaaag cttcactttt ttggcatttg taagtgatta 720 actcgaatag aaacatatac tggaaatatc gtattcgtcg ttgttaaatg ttcacttttg 780 gcacttgcac gtgatttaca catcttaacc tgcactgtaa ttcctaccgt ttctctgccg 840 tatctacgcg tattattttc gctcgatatg ggccatcgct taagctcatc gattatcagt 900 cactaaccta gagcgcgaat cctaacaaac tctctctcta cacgacacaa tataaaaacc 960 catactcaag gagccaactg tccaatactt atataaccac tatctatctt ctatttgcca 1020 tcattgattt gtcccctatc tgcacgcttt cagatattat tcctcaccgg tcttgttatt 1080 gacttgattt ttcacaagct cacattacac tcacaggaaa ttatcaacag gtaattatca 1140 ccctcgttcc aaggtcaaat acactgttct gccctaactc caagcccgtt ttctacattt 1200 catttatagc cacaactctt aactaacatc cccaatgcaa tacaccatat ataatagcca 1260 taattgcaca ctttaaccca cgcactaacg tctatcacca tacattttat aaccccacac 1320 agccgcctcg gtacgaatac cagatttaaa tgttcacttt ttggcatttg tacgtaattt 1380 actcgaacaa aaacatataa caggcaactc gacagtacag caggtaactc gttatacact 1440 gggaatattg tattcgttgt tgttacagct tcactttttg gcatttatac gtgatttact 1500 ttaataaaaa catatagcag gcaactcaat agtacagcag ctaactcgtt atgcgctggg 1560 aatatcgtat tcgtcgtcat tcaagcttca cttttttgga atttgtacgt gatttacttt 1620 aataaaaaca tatagcaggc aactcgacag tacagcaggt aactcgttat gcactgggaa 1680 tatcgtattc gtcgttattg aagcttcact ttttcgtatt tgtacgtgat ttactttaat 1740 aaaaacatat agcaggcaac tcgacagtac agcaggtaac tcgttatgcg ctgggaatat 1800 cgtattcgtc gtcattcaag cttcactttt ttggaatttg tacgtgattt actttaataa 1860 aaacatatag caggcaactc gacagtacag caggtaactc gttatgcact gggaatatcg 1920 tattcgccgt tgtttaagct tcactttttt ggtatttgta cgtgatttac tttaataaaa 1980 acatatagca ggcaactcga cagtacagca ggtaactcgt tatgcgctgg gaatatcgta 2040 ttcgtcgtta ttgaagcttc actttttcgt atttgtacat gatttacttt aataaaaaca 2100 tatagcaggc aactcgacag tacagcaggt aactcgttat gcactgggaa tatcgtattc 2160 gtcgtcattc aagcttcact tttttggaat ttgtacgtga tttactttaa taaaaacata 2220 tagcaggcaa ctcgacagta cagcaggtaa ctcgttatgc gctgggaata tcgtattcgt 2280 cgtcattcaa gcttcacttt tttggaattt gtacgtgatt tactttaata aaaacatata 2340 gcaggcaact cgacagtaca gcaggtaact cgttatgcgc tgggaatatc gtattcgtcg 2400 ttgttaaatg ttcacttttt ggtatttata cgtgatttac tttaataaaa acatatagca 2460 ggcaactcga cagtacagca ggtaactcgt tatgcgctgg gaatatcgta ttcgtcgtca 2520 ttcaagcttc acttttttgg aatttgtacg tgatttactt taataaaaac atatagcagg 2580 caactcgaca gtacagcagg taactcgtta tgcgctggga atatcgtatt cgtcgttgtt 2640 aaagcttcac tttttaggta tttrtacgtg atttacttta ataaaaacat atagcaggca 2700 actcgacagt acagcaggta actcgttatg cactgggaat atcgtattcg tcgtcattca 2760 agcttcactt tttggatttg tacgtgattt actttaataa aaacatatag caggcaactc 2820 gacagtacag caggtaactc gttatgcact tggaatatcg tattctccgt tgtttaagct 2880 tcactttttt ggcatttgta cgtgatttac tctgataaaa acatatagca ggcaactcga 2940 cagtacagca ggtaactcgt tatgcgcttg gaatatccag aagatacaca attacaacag 3000 gatttgcgaa aaggaccagc ctcgcccatg ttgtttctca gcccctgaca gccccactaa 3060 caatggattg ttgagaaacc ttaatacgct attaagcaca aacgtgctta atagcgtatt 3120 ttctaaaaaa gaaatgacac aagaattact tctcaagtac tacatgacgc aaacttattt 3180 tgaaatttta aaacaccagt atgaagagaa tacaaaaccc cgtaatcatg ttggaaagta 3240 gatacaatac agcaacactg taccatgtcc gctatatcta ctaatgttaa aattgtgcaa 3300 atacaataga aagaaagcta tagcaaagac aaataataca atattcatca taattnnnnn 3360 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3420 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3480 nnnnnnnnnt gtattagaga aaatcactta caaatactac aaagtgagga ttgaactgta 3540 ctgtcgagtt gcctgctata tgtttttatt aaagtaaatc acgtacaaat gccaaaaagt 3600 gaacatttaa taacgacgaa tacgatattc ccagtgcata acgagttacc tgctgtactg 3660 tcgagttgcc tgctatatgt ttttattaaa gtaaatcacg tacaaatgcc aaaaaagtga 3720 agcttgaatg acgacgaata cgatattccc agcgcataac gagttacctg ctgtactgtc 3780 gagttgcctg ctatatgttt ttattaaagt aaatcacgta caaataccaa aaagtgaaca 3840 tttaayaacg acgaatacga tattcccagc gcataacgag ttacctgctg tactgtcgag 3900 ttgcctgcta tatgttttta ttaaagtaaa tcatgtacaa ataccaaaaa gtgaacattt 3960 aacaacgacg aatacgatat tcccagtgca taacgagtta cctgctgtac tgtcgagttg 4020 cctgctatat gtttttatta aagtaaatca cgtacaaata ccaaaaagtg aagcttcaat 4080 aacgacgaat acgatattcc cagcgcataa cgagttacct gctgtactgt cgagttgcct 4140 gctatatgtt tttattaaag taaatcacgt acaaatacca aaaagtgaac atttaataac 4200 gacgaatacg atattcccag tgcataacga gttacctgct gtactgtcga gttgcctgct 4260 atatgttttt attaaagtaa atcacgtaca aataccaaaa aagtgaacat ttaacaacga 4320 cgaatacgat attcccagcg cataacgagt tacctgctgt actgtcgagt tgcctgctat 4380 atgtttttat taaagtaaat cacgtacaaa ttacaaaaaa gtgaacattt aatgacgacg 4440 aatacgatat tcccagcgca taacgagtta cctgctgtac tgtcgagttg cctgctatat 4500 gtttttatta aagtaaatca cgtacaaata ccaaaaagtg aacatttaat aacgacgaat 4560 acgatattcc cagtgcataa cgagttacct gctgtactgt cgagttgcct gctatatgtt 4620 tttattaaag taaatcacgt acaaattcca caaaagtgaa gcttgaatga cgacgaatac 4680 gatattccca gtgcataacg agttacctgc tgtactgtcg agttgcctgc tatatgtttt 4740 tattaaagta aatcacgtat aattactaaa aagtgaacat ttaacaacga cgaatacgat 4800 atttccagtg cataacgagt tacctgctgt actgtcgagt tgcctgctat atgtttttat 4860 taaagtaaat catgtacaaa tacgaaaaaa tgaagctcca ataacgacgg atacgatatt 4920 caaagcgcat aacgagttac ctgctgtact gtcgagttgc ctgctatatg tttttattaa 4980 agtaaatcac gtgcaaattc caaaaaagtg aacattgaac aacgacgaat acgatattcc 5040 cagtgcataa cgagttacct gctgactgtc gagttgcctg ctattgtttt tattaaagta 5100 aatcagtaca aatcaaaaag gaagcttgaa tgacgacgaa tacgatattc ccagcgcata 5160 acgagttacc tgctgtacta tcgagttgcc tgctatatgt ttttattaaa gtaaatcacg 5220 tataaatacc aaaaagtgaa gcttcaacaa cgacgaatac gatattccca gtgcataacg 5280 agttacctgc tgtactgtcg agttgcctgc tatatgtttt tattaaagta aatcacgtac 5340 aaatgccaaa aagtgaacat ttaaatctgg tattcgtacc gaggcggctg tgtggggtta 5400 taaaatgtat ggtgatagac gttagtgcgt gggttaaagt gtgcaattat ggctattata 5460 tatggtgtat tgcattgggg atgttagtta agagttgtgg ctataaatga aatgtagaaa 5520 acgggcttgg agttagggca gaacagtgta tttgaccttg gaacgagggt gataattacc 5580 tgttgataat ttcctgtgag tgtaatgtga gcttgtgaaa aatcaagtca ataacaagac 5640 cggtgaggaa taatatctga aagcgtgcag ataggggaca aatcaatgat ggcaaataga 5700 agatagatag tggttatata agtgttggac agttggctcc ttgagtatgg gtttttatat 5760 tgtgtcgtgt agagagagag tttgttagga ttcgcgctct aggttagtga ctgataatcg 5820 atgagcttaa gcgatggccc atatcgagcg aaaataatac gcgtagatac ggcagagaaa 5880 cggtaggaat tacagtgcag gttaagatgt gtaaatcacg tgcaagtgcc aaaagtgaac 5940 atttaacaac gacgaatacg atatttccag tatatgtttc tattcgagtt aatcacttac 6000 aaatgccaaa aaagtgaagc tttaataacg acgaatacga tattcccagt gcatcacgag 6060 ttacctgctg tactgtcaag ttgcctgcta aaagttttta taaaagtaaa tcacgaaaac 6120 attccaaaaa agttaggctt taataacgac atacataatt tgtcgtattt cgaaagtttt 6180 gcactgttat ttcagtcaga actctagata tcgacttttt ttcccacggt cacgtgattg 6240 atgacgtcat attttaaaag ttcatgattg taaaatccca tgtgctattt gtcatatttc 6300 gaaagtttcg taatgttttc tcaataggaa ctcgagaaaa tgttttttcc cacggtcacg 6360 tgattgatga cgtcatgaga ataactaata cggcgttgtc ttcgttatta catttgtaca 6420 cttaggttaa agtttgatga ctatcggtaa aaaggggacg tagatatcgc gacaattgtg 6480 atgtaacgac ttgcctgctg gaacgttcaa cgtcacacag tcacgtgatt aataatattt 6540 gatatcatac atgcatgaat atgtttgttg tactttttgt acaatcgtgt caaaatttcg 6600 tgacaatcgg ccatatacct gccgagatat tacacaaatc atcacatttg agttgcctgc 6660 taacttcacc gagttgcctg ctgagttgcc tgccagcttc actatggt 6708 // ID Gyp1c_Cis_I repbase; DNA; INV; 1529 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 06-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE Internal portion of Gypsy LTR Retrotransposon from Ciona DE savignyi. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW internal portion; Gyp1c_Cis_I. XX OS Ciona savignyi OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. XX RN [1] RP 1-1529 RA Smit A.F.; RT "Gyp1c_Cis_I - Gypsy LTR Retrotransposon from Ciona savignyi."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX CC Ci000388, Ci000091. Non-autonomous derivative of Gyp_Cis1. CC Gyp_Cis1c-LTRs. XX SQ Sequence 1529 BP; 459 A; 307 C; 279 G; 482 T; 2 other; tatttggtga ccctgtttct gagcaaattt tgaggtggca ttgatgattg gaaagccgga 60 aaaagaagaa tttcaacwac attgctctgc tccttaaagt ttcgaggtac atttctgcat 120 caactgaaaa cgtcgcattc gccatcaaga acaaacgttg gacatttccg ttctattgtt 180 cgtggaggtg tcaagttcgc acatcggttc atagaagcga gattcatttg gaggattcta 240 cagacgacga tcgcatctac tacgacggaa acgagatcaa cttcaccgga caattcaatt 300 gagttctaaa ttattgtttt gctatagtga atttatttga aactagaagt tatggattcc 360 aagaaagagc cttaacaaga tcaaaaaaac tctcaggcaa gacgttggat atggcgtata 420 gcgtcttgtc taatactact aaatcgccat ctttactgtt tatcacggat tatgaccttt 480 aagtgttaga tctgacctct ttattctaca tcatttctgg attactttcg ccatcggamc 540 ttcgcgtcag tcttcataca gattcggaga agtcaaccac aggttaacta cctgtttaaa 600 ttctacaaga atggtggtta agcaatcaat ccatagacga tatattcttc gaatgtcaac 660 tgcaagtata tctctgttct ctatttaatc tatttaagaa actggaagac tgtttcacag 720 ttaaagtgct atcagccaaa atgggtaatt cccgagctgc aataactatt ggaaatactg 780 aaattgtatg tgagcaaata aatgccacat tacgccacag tctccaatat aaatcctact 840 tctccattcg tcctctcgtt agctttacca atagtaatgg taccttacga attggtcaag 900 taatgaaaaa cgatgtggtc tatgaaggtg taagattagt tgagcactat acacccggaa 960 ggacatttaa atttcgtgta aacgataagt tctacttgta tgataactat actttagaac 1020 atgctgatgt ccatgttcgg cataaatttt ctctaactcc gattgaagaa ccactccgga 1080 cgatttggtt tcgcttgcga aaaaaattcc cttgtcgtcg atgggattgg agcatttcag 1140 cgcaatcctt ggttcaatac atcaatcgca agttacaacg gacaacatca ttcaactcca 1200 gaatgaaaga tgctaacgag acagcagatg cagatcccca tacattcgtt ggaaagtcta 1260 tttgatcact ttgccaaaca cgcactcttg atcgatccta caccaattta agaacatttt 1320 tacggtcaat cgtggggaca tttttaaatc ttgtgtcgcc acaaatacgt gataatcatt 1380 ttattattat tttttcgcac catttcttga gttcgacaat ttttcagttt atttcctcgt 1440 tctctcacat aatatctttt acgaagtatg gccatttttc gttggttact ctaaactaaa 1500 atgaccatac ctctgagggg ggaagtgta 1529 // ID ITmD37D_Ele3 repbase; DNA; INV; 1304 BP. XX AC . XX DT 13-OCT-2010 (Rel. 15.1, Created) DT 13-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE An ITmD37D DNA transposon family from Aedes aegypti. XX KW Mariner/Tc1; DNA transposon; Transposable Element; ITmD37D_Ele3. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-1304 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-1304 RA Kojima K.K. and Jurka J.; RT "ITmD37D-type DNA transposons from the yellow fever mosquito."; RL Direct Submission to Repbase Update (13-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. ~97% identical to consensus. This consensus CC is ~98% identical to the original sequence in [1]. TA TSDs. 26-bp CC TIRs. XX FH Key Location/Qualifiers FT CDS 163..1185 FT /product="ITmD37D_Ele3_1p" FT /note="transposase." FT /translation="MEAKRELVINLFLKQYRPCAILRKLKPFGINERFIFR FT TIKRYKCTGSVKTAPRSGRKRTVRSPEAIKRVRERIRRNPGKSGRKMAKEL FT GMSQSTLGNVLKHDLGLKAYKKQRVHGLTEKQKIARVERCRKLLKRHGDCE FT ILFSDEKMFLLQDHHNQQNDRVYAQYLSDIPRDKLAVQRFQNVSRVMVWGC FT FSKNMKVPLLFIEPGVKINSKYYIDNVLKNHLLPHVKKHYKNVPYCFQQDS FT APSHQAKVTQAWCETNFPDFISSKEWPASSPDLNPLDFFAWGHMLSQLGNT FT KGLNLETFKQRLVKIWDEMPQDIVRASCEAFQQRMRLVIKAKGERFELD" XX SQ Sequence 1304 BP; 408 A; 246 C; 284 G; 365 T; 1 other; cagggtgatt actaagtcct gtcagtgttt tatttaggtt tgtagttttt ggtaggttaa 60 ttacatacat gtctgtatat ttttcattta gagtgtattc aaatgtcaac acggcttgca 120 actttgtttt gaaaattcat tcgtttaaga cgcgtaaaaa ttatggaagc aaaaagggaa 180 cttgtgataa atctgtttct gaaacagtat cgaccgtgtg ctatcttaag gaagttgaaa 240 ccattcggaa taaacgaaag attcatattc cgcactatca aacgatacaa gtgcaccggt 300 tctgtgaaaa ccgctcccag atctggacgt aaacgcactg tcaggagtcc ggaagccatc 360 aaacgtgtga gagagcgaat tcggcggaat cctggtaagt cgggtcgtaa aatggccaaa 420 gaactgggaa tgtcgcaatc aacgctcggg aatgttttga aacatgatct cggattgaaa 480 gcatacaaaa aacagcgagt gcacggtttg actgaaaagc agaagattgc tagagtcgaa 540 agatgccgga agcttctcaa gcggcacggt gattgtgaaa ttttgttttc ggacgagaaa 600 atgttcctgc tacaggatca tcacaaccag caaaatgatc gggtatacgc acagtacctt 660 tctgatattc cacgcgacaa attggctgtt caacggtttc aaaatgtttc cagagttatg 720 gtttggggat gcttctccaa aaatatgaag gttccattgc tattcattga gccaggtgtc 780 aaaatcaaca gcaaatatta catcgataat gttttgaaga accatttgct tccccacgtt 840 aagaaacact ataagaatgt accatactgc ttccaacaag actctgcgcc gtctcatcag 900 gcaaaagtca cacaggcttg gtgtgagacg aatttcccag attttatttc gtcaaaagag 960 tggcctgctt cgtcacctga tttaaatcct ctggacttct tcgcatgggg tcacatgctg 1020 agtcaactag ggaatactaa aggactgaac cttgaaactt ttaagcaacg tttggtgaaa 1080 atttgggatg aaatgcctca agatatcgtg cgtgccagct gcgaagcctt tcaacaacga 1140 atgcggctgg taattaaagc aaaaggagaa agatttgaac tggattaggt taagttaaat 1200 gtaacagata tactaaacct gaaatacact aaaagttgaa wtccatcaat gtttctagtt 1260 tttatgacca tttactttac tgacaggaat tagtaatcac cctg 1304 // ID Gypsy-1_DFa-LTR repbase; DNA; INV; 300 BP. XX AC ADHC01000033; XX DT 21-APR-2011 (Rel. 16.04, Created) DT 21-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Dictyostelium fasciculatum genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-1_DFa_; KW Gypsy-1_DFa-I; Gypsy-1_DFa-LTR. XX OS Dictyostelium fasciculatum OC Eukaryota; Amoebozoa; Mycetozoa; Dictyosteliida; Dictyostelium. XX RN [1] RP 1-300 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Dictyostelium fasciculatum RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; ADHC01000033; Positions 282115 281816. XX SQ Sequence 300 BP; 89 A; 52 C; 58 G; 101 T; 0 other; tgaaggaagg cgtggtacgc tgcatgcctt tggtgtcact gcgagagtgt cagagaacaa 60 ggttcaccct gcattccgaa aatacataaa atatgaaaaa cacaaataaa tagaagtcgc 120 gacgagcttg gtggatcatt cttgtttccc caattcgtgt agctagttat aattgtatat 180 ctatcgtgta gagtcttgtt cagtctagtt tattgtatct tgtcagttat ctagttatct 240 tgtttatcat cattgtcatt attagcagta gatattatat acaccataca caatattaca 300 // ID Helitron-1_AAe repbase; DNA; INV; 8288 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A Helitron family from Aedes aegypti. XX KW Helitron; DNA transposon; Transposable Element; nonautonomous; KW Helitron-1_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-8288 RA Kojima K.K. and Jurka J.; RT "Helitrons from the yellow fever mosquito."; RL Repbase Reports 11(4), 1316-1316 (2011). XX DR [2] (Consensus) XX CC There are 6 sequences with >98% identity. Both termini are CC uncertain. XX FH Key Location/Qualifiers FT CDS join(1272..5303,5273..6118) FT /product="Helitron-1_AAe_1p" FT /translation="MAPNKRKCIGRVQTKAKKVKEQRKQESSLTRNKRLAA FT LRERARASRASETTAEREVRLKGVAFRTLRTRDKETIQCREERLKNAAERA FT AAARAVETQQHRELRLQAMAQRATTSRAEETDQQRKSRLQDMALRSATSRS FT EETVQQRETRLHDMALYATASREEETVQQREDRLYNQRERTVLLRTTETVE FT QHATRVQEMRCMARHSRTIRHINLALEAFRYDPQKNYMEHNDVTIGKMEVV FT CTYCQAKKFKNEAPGICCKNGKVNLSQLEPPPQALLDYMSGNTPESKHFLK FT FIRKYNACFQMTSFGATLVVEQAGFPSTFTVQGQIYHKAGSLLPLPEQPPK FT FLQLYFIGDEQMETDRRCSYISGTRRQIVQHLQQMFQEHNDLVRVFKSALD FT RMPSDEFKVIIRADRRPPDEHERRFNAPQVDEVAVVVTGDAYDQRDIVIQK FT RGESLQRISETHRPYDALQYPIMFPHGEDGYHFNLKQIDPTTGESTTKKVS FT AMDFYAYRLMIRGDAYNHLLNCRQLFQQFVVDMYVKIESERLLYIRLNQRK FT LRVDDYVHLKDAIANDGNVDNLGALVILPATFTGSPRHMHEYTQDAMTYVR FT MYGRPDLFITFTCNPAWPEITEXLTNGQSVNERHDLVARVFRQKQLKLIEA FT ITKGHVFGAPRCWMYTVEWQKRGLPHSHNLIWLVNKLQPTQIDDVISAELP FT DPVVDPELHAVITKNMIHGPCGNLNLSSPCMRDGKCTKKYPRNFVEQTQTG FT NDGYPLYRRRSTENGGMSVKLKVRNTEVEIDNRWVVPYSPLLSKMFQAHIN FT VEYCNSVKSIKYVCKYVNKGSDMAVYGISSENTNDEVMQYQLGRYVSSNEA FT VWRIFGFPIHERHPTVVHLSVHLENGQRVYFTPENASRFADHPPNTTLTAF FT FQLCQEDPFARTLLYPEVPRYYTWDAAKKIFHRRKVGKPVLGHDIFASDAL FT GRVYTVHPNNAECFFLRMLLHSIRGPRSFVELRTVNQTVCQTFREACQVLG FT LLEGDRHWDDTLQEAALLCLPDQIRTLFAIILTTCAPSNPYALWEKYKEAL FT SEDVLMNVRRSIPTVQITYNTDIFNKALIMLEDKCVAMANKTLLQLGLPAP FT IRNEERMIDADYIRETAYNIIDLTDYVVEKESLLNSDQKIAFELIMDEVVK FT QSGGVIFLNASGGTGKTFLTNLILAKIRAQGEIALAVASSGIAATLMEGGR FT TAHSAFKLPLDLTRQENPTCNISKTSGKAQVLKLCKLIIWDECTMAHKKAL FT EALDTTLKDIRGNNKLMGGALLLLCGDFRQTLPVIPKSTPVDEINACLKYS FT AIWKHVTKVSFNIYNDWIIRVNILFLFFFRLPYFIFIFLQVTLKTNMRICL FT GDEHAQTFAKQLLEIGEGRVAIGSNGYIKLAPNFCNLVSSAAELIDVIYPE FT IADNYRNMNWLRERAILAAKNEDVNDMNNQILAKVPGAMMEYKSIDTVEEA FT DDAVNFPPEFLNSLDPAGMPPHRLLIKEGCPIILLRNLDPPKLCNGTRMCV FT RKMLANVIEAVILTGKGQGETVFIPRIPLVPSDLPFSFKRLQFPVRLAFSI FT TINKSQGQSIEHCGVDLRSPCFSHGQLYVACSRVGSPKNLNILAPSDETKN FT VVYKQVLS" XX SQ Sequence 8288 BP; 2775 A; 1518 C; 1700 G; 2289 T; 6 other; atggcagccg tgaaatagtg gatcaaagac tgaaatcatg taaaatggat atctttcaat 60 cggtgtttat ataatggaaa ttaatcttca tctggaaatc caagatggct taccttaatt 120 taaatttgcg actctgaatt aatgcctgaa attatgtaaa tcctagaaag agactgaact 180 atgaaaatcc tagaaagagt aaaaatcatg cggaataaac attcttctat cgagttcgat 240 gatataagat cacaaaaata aatatatagc ggcatttgga aatccgagat aatggaccgg 300 gattcaaaat ggaagccgta ggataaagtt tttttttcga ttcacgatgt tattttggat 360 tcacgaaatc aatatttggc gcccaagata gagaatctga attctaaatg gctttcatga 420 cagtgaattg agtcctgaaa ccatacaaaa tggatggaac atggcggccg tgaaataatg 480 aatgaaagcc tggaaccatg gtgaaaggat taatttcctt cggatttaga tagtatatca 540 tgaacatgaa tatttggtcg atcacattct atttaaattt gacatggttt ttcagtgtaa 600 ttaaaaaaaa aaataaagtc aaaattaagt gaacaaactt taaaaatata aaataaaaaa 660 caacacaaag tgagataaac tcatttcaat tcgaataatc gatagagatg gaagatagag 720 atagacatag agaagataga gatggaaagc acttttatta ggtactagct aacattaccc 780 ggctttacga tcattgtttg aagatttcaa tcgcgattaa gcgtacctac tatatgttga 840 agatcaggta cttcacacac cctgcagtgg agatgtcaat tggtaagaat tgtagaggta 900 acgcgtagcg gataatattg tacccagaca tgaaataaaa gagaggagct ggcagatagc 960 ggtatcagtc agtagcctgt ggagcagaag ccttgagttt atgtttatta ttttcaacac 1020 tatattccat tacaatcaat cagtttcgtg ttctatcaaa aaagtgtctt tagtaaagca 1080 aaagtgagaa ttacggagat aaagataata attatcgaag gaagtgagta aaagtgttaa 1140 ttctattcca ttaattaatt actaattcca ttagtaatga tataaaccct catattgatt 1200 catttattaa caaaatttag ttttcagatt aaatagaatt tattcaagtg ttattctctg 1260 caatcgatac aatggctccg aataaacgaa agtgcattgg tcgagtgcaa acaaaagcga 1320 aaaaagtgaa agaacagcgc aaacaagaaa gttcattgac aaggaataaa cgtcttgccg 1380 cattgcgtga acgggcaagg gcttctagag cttcagaaac gacagcagaa cgggaggtca 1440 ggttgaaagg tgttgcgttc agaacgttaa gaactagaga taaggaaaca attcagtgtc 1500 gcgaagaacg gcttaaaaat gcggcagaaa gagctgcggc agcaagagct gtggagacac 1560 aacaacatcg tgaattacga ctgcaggcta tggcacagag agccacgact tcgagagcag 1620 aagagacaga tcaacaacgt aaatcccgac tacaagatat ggccctgcga tccgcgacat 1680 ctagatcaga agaaacagta caacaacgtg aaacacggct gcatgacatg gccctatatg 1740 caacggcatc cagagaagaa gagactgttc aacagcgtga agatcgtcta tacaaccaaa 1800 gagagaggac agtgcttttg agaacaacag aaactgtcga gcagcatgca actcgagtac 1860 aggaaatgag gtgtatggca agacactcac gaaccatacg acatatcaat ctggcattgg 1920 aagcgtttcg ttatgatcca cagaagaact acatggaaca taatgacgta actataggaa 1980 aaatggaggt agtttgcacc tattgccaag ctaagaagtt caaaaatgaa gctccgggaa 2040 tttgttgtaa aaatggaaaa gtgaatctta gccagttaga accaccgcca caagcacttt 2100 tggactacat gtcaggaaat acgccagaat ccaaacattt tctcaaattc ataagaaaat 2160 acaatgcatg ctttcaaatg acatcctttg gcgcaacatt agttgttgaa caagctggtt 2220 ttccatcgac attcacagtt caaggacaga tttatcataa ggctggatca ttgcttccgt 2280 taccagagca gccgccaaag tttctgcagc tgtatttcat cggagatgaa caaatggaaa 2340 cagatcgcag atgcagctac atatcaggaa ctagacgaca gatagtgcaa catttgcaac 2400 agatgtttca ggagcacaac gatttggtca gagtttttaa atctgcattg gatcgtatgc 2460 cttcggatga attcaaggtg atcataaggg ctgatcgaag acctcctgat gaacatgagc 2520 gtcgcttcaa tgcaccacaa gtagatgagg ttgcagttgt tgtaactggt gatgcttatg 2580 atcaacgtga tatcgtcatt caaaaacgag gagaatcatt acaaaggatt tcggaaactc 2640 atagacccta cgatgctttg cagtacccaa taatgttccc tcatggtgaa gatggatatc 2700 acttcaatct aaaacaaatc gatccgacga cgggtgaatc taccaccaaa aaagtttcag 2760 ccatggattt ttatgcctac cggctaatga ttcggggcga tgcttacaac cacttactca 2820 attgcaggca actattccaa cagttcgtcg ttgatatgta cgtcaaaatc gaaagtgaac 2880 gccttcttta cattcgcttg aatcaacgaa aactaagagt tgatgactat gtccacctca 2940 aggatgcaat agcgaacgat ggtaacgttg acaaccttgg cgcgttagtt attctgccag 3000 ctacatttac aggtagccct agacatatgc acgaatacac tcaagatgcc atgacgtacg 3060 tcaggatgta tgggagacct gatctgttca ttaccttcac atgcaaccca gcttggcctg 3120 agatcacgga gwtgcttacc aatggacaat cagtaaacga acgtcacgat ttggtggcga 3180 gagttttcag acaaaagcaa ctaaaactta ttgaagccat aacgaagggt catgtttttg 3240 gagcgccaag gtgttggatg tatacagttg aatggcaaaa gcgaggattg ccacattctc 3300 acaatttgat atggttggta aacaaacttc aaccaacaca aatcgatgac gtcatatccg 3360 ctgagctacc tgatccggtw gtagatcctg aacttcatgc tgttataaca aaaaatatga 3420 tccacggtcc atgcggcaac ctcaatttaa gttcaccatg tatgagagac ggaaaatgta 3480 ccaaaaaata tcccagaaat ttcgttgaac aaactcaaac tggaaacgat ggctatccat 3540 tatatagacg acgtagtaca gaaaatggtg gaatgtcggt gaaattgaaa gtacgcaata 3600 cagaagtaga aattgataat agatgggttg taccatattc gccattgctg tcgaaaatgt 3660 tccaagcaca catcaacgtg gaatactgta acagtgtaaa atccattaaa tacgtwtgca 3720 agtacgtgaa taagggaagc gatatggctg tttatggaat aagcagtgaa aacacaaatg 3780 acgaagtaat gcagtatcaa ttgggaagat acgttagtag taacgaagca gtatggcgaa 3840 tttttgggtt tccgatmcat gaaagacatc ctactgttgt tcatctcagc gttcatttgg 3900 aaaacggcca gcgagtttac ttcacacctg agaatgcttc tagatttgct gatcatccgc 3960 caaatacaac attgactgca tttttccagc tatgtcaaga agatcctttt gcaagaactt 4020 tgttgtatcc tgaagttcca cgttactata cgtgggacgc agcgaagaag attttccaca 4080 ggaggaaagt gggaaagccc gttcttggac atgatatctt tgcaagtgac gctttgggtc 4140 gagtttatac cgttcatccc aacaatgctg aatgtttttt tctgaggatg cttctacatt 4200 cgattcgagg accaaggtct ttcgtggaac tcagaaccgt aaaccaaaca gtgtgtcaaa 4260 catttcgaga ggcctgccag gtgttaggat tgttggaagg tgatcggcat tgggatgata 4320 cattgcaaga agctgcatta ttgtgccttc cagatcaaat aagaactctt tttgccataa 4380 tattgaccac atgtgctcca tccaacccat atgcattgtg ggaaaaatat aaagaggcac 4440 ttagtgaaga tgtactcatg aacgtccgta gatcaattcc cacagtacaa atcacgtata 4500 atacagatat tttcaacaaa gctctgatca tgcttgagga taaatgtgtg gccatggcta 4560 ataaaacgct gttacagctt ggtttgcctg caccgatacg aaacgaagaa agaatgatag 4620 atgcagatta catacgtgaa acagcctaca atattataga tctaacagac tacgtcgttg 4680 aaaaggaatc attgctgaat tcagatcaga aaatagcttt cgaactgatc atggatgaag 4740 tagtaaagca aagcggtggt gttatatttt tgaatgcttc tggaggcacc ggcaaaacat 4800 ttttgaccaa cttgattctc gcaaaaatca gagcccaagg agaaatcgct cttgcagtag 4860 cgtcttccgg aatagcggca acgttgatgg aaggaggcag gactgctcat tctgcgttca 4920 agctaccact tgatttaacc aggcaggaaa accctacctg taatataagc aaaacaagcg 4980 gtaaggcaca agtgctcaaa ttgtgcaagc tgatcatatg ggatgaatgt actatggctc 5040 ataaaaaagc tctagaagcc ctggacacca cattgaaaga catacgcggt aataataaat 5100 taatgggagg agcgctgtta ctgctgtgtg gagactttcg tcaaacgcta cccgtcattc 5160 cgaagtctac tcccgttgat gagattaatg cttgcttgaa gtattcagct atatggaaac 5220 atgttaccaa ggtaagtttt aatatatata atgactggat tatacgagtt aatattttat 5280 ttttattttt cttcaggtta ccttgaaaac aaatatgaga atttgtttgg gcgatgaaca 5340 tgctcaaaca ttcgccaaac aacttctgga aatcggagaa ggccgagttg caattggttc 5400 gaatgggtac atcaaattgg cacctaattt ttgtaacctg gtgtcatctg cagcagaact 5460 gatagatgtg atctatccag aaattgctga caactatcgt aatatgaact ggttgcggga 5520 aagggcaatc cttgcagcca aaaatgaaga cgtcaacgac atgaataatc aaatacttgc 5580 caaggttcca ggtgcaatga tggaatataa atcaatcgac actgttgaag aagcggatga 5640 tgctgtcaat ttcccaccgg aattcctgaa ttcgcttgat cctgctggaa tgccaccaca 5700 ccgtctactt atcaaagaag gatgtcccat catactgctg cgaaatcttg atcctcctaa 5760 actgtgtaac ggaacgagaa tgtgtgtgag aaagatgcta gctaatgtta tagaagcagt 5820 catcctaaca ggaaaaggac aaggtgaaac tgttttcatt ccaaggatac cactggtgcc 5880 ttcagatctt ccattttctt tcaaaagact gcaatttcca gtgaggcttg cgttctcgat 5940 aacaatcaat aaatcgcagg gtcagtcgat cgagcactgt ggggtggatt taaggtcccc 6000 atgtttttct catggtcaac tttacgttgc atgttcgagg gttggttcac cgaaaaattt 6060 gaacattctt gccccaagtg atgaaacgaa gaacgtggtt tacaagcaag ttttgtcttg 6120 aaggcagaag aaattattgg cattgaatgt gttgacgtga aattcgaggt aaaagtgcat 6180 aataaactgt aggggaaagt gccatataaa atgtagataa gaaataaaaa gtgaaataaa 6240 agtcatattc aaatggttaa catgaatttc attatgaagg ctgtgtattt tttttaatat 6300 aaatcttaag atatttacta tgattgacaa aataccaaga agagaacatc tatcgtaagt 6360 ttagtaaaat gagtttaatt tcaagctcat gtaaatctat aacacattac ggagtaagag 6420 gaaaatggct attttttaca aggagtaaaa ctgtagagct gttcacagtg ttaaaggagt 6480 catatgcaat cttcagtggc gaaaaaaggt gttggttaca aaagcagtta tgctcttttt 6540 aaaatgttgc agaaatgaaa ttaaaaaaaa aaaaagaatt gccctgcaaa aaagggtttt 6600 tttgtttgca catagtaaca aacgctaaaa atgatattgc actccttgat ttgattatcg 6660 acagtatccg cttatcttta catttccacc ttctaaaatt ttgactacgt atgcacaaac 6720 ttggagcaga aaaaaataca tttttaacac ttccggccga aacaatttta gttcaagtca 6780 ttttactacg tatgattttt ttgtggcgac acgtcattac cgttaaaaaa tttatacgac 6840 agatccgctt tacggataga gaaactgaag cagcagaatg ccgccgtagt cttattttcc 6900 aacgagtgca agtttttata cgtgagtcac acatcacaca atctataaca caagtactga 6960 gaaaaaatga tctgtttata aaattatact atgttattac ttacagttct acatggatca 7020 aattttgcat ttgttagttg gttgaatgac cggaataaca tatcttatat cacaataact 7080 tttgtagaaa gaatgacatc caaagtctat cattaatttc cttttagttt attctccttc 7140 ttggcatttc gtccccattg gtatagaggc tgcttttcag ccacaacacg atagtattgc 7200 ttcgttagag cgtattacac cgagatacct taagcttcag gggcgaagcc tcatgaagac 7260 ccgtattccg aagtgttagt gttgcatcaa atcgattgta ggcaaatgtc cttgcaagcg 7320 ttcgcccgct taatgcaaaa aagattgttt tccctttttt taggtccgac gagcgatagc 7380 gagtaaggac agcaagcgtc tgcccggtaa aatgcaatca tagttttaat cgacagttat 7440 ttgtaaaaca ctacaaatca ctcaactggg aacaacttca gttctcaaaa attaaataaa 7500 acttagaatt cttatagtga ttatgttatg accaatattt atatttttat tctgagcaaa 7560 tttgctcctt tcttcttctt ggtattagtt tgcatctcag cctaatgttt tatgaatgct 7620 tgcgcattcc ttaaccgagt tgctaaagtt gcatgttttt cactcttata tcatgtaaca 7680 ggtacaataa tactttattc ccataaaagt cccaaaaatt tcttttaaaa aagattcggg 7740 accatccgga atcgaaccta ggttacttct tcaaagcttt gctttacagc cccgggcgtt 7800 accacatggc tacggawgaa ctcatttcac acaataggga tcgcccatat ataacgccta 7860 tgtacctact gctatcctat catgcaatag aatgtcggtt ccagagacac attactcgcc 7920 aatcgcaaga tttgtgaaaa gcttaatgta tagaagggga tgcttaatgt atagaagaga 7980 tgctccaaaa agatctcaac atattttatt ttgtacaacc atgtgaaaag gccgcttttc 8040 gtacacagca tcagcgcggt ggttttggcg cagtgagatt gcgtaacaag caaaatgtta 8100 agtttacaat caattccaga aacggctgct gttctttttg gtatgtaatg caataagcat 8160 gaagcagcag aaatatttag tgtaattaat taaaagcttt tacagaaatt acgactaaaa 8220 attgaagtga ctccaacatc aaattacatc ccgagcaagg ccgggtaata tcagctagta 8280 atawtata 8288 // ID Gypsy2-NVi_LTR repbase; DNA; INV; 304 BP. XX AC AAZX01004373; XX DT 05-NOV-2007 (Rel. 12.11, Created) DT 05-NOV-2007 (Rel. 12.11, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: long terminal DE repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy2-NVi; KW Gypsy2-NVi_I; Gypsy2-NVi_LTR. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-304 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from Nasonia parasitic wasp."; RL Repbase Reports 7(11), 1119-1119 (2007). XX DR Genome; AAZX01004373; Positions 4698 5001. XX SQ Sequence 304 BP; 69 A; 67 C; 70 G; 98 T; 0 other; tgacgatcaa gagtagtgct acgatgcaat aatactcgtg ccgtactaca accgtttgta 60 gtcgagagta gcgctctagt gcggcctcgc ggcaagtctc ttcgagcgac tccgagcggg 120 tcagactttg tcagaccgcc gagcggtgca tttggttttc tgggcagcta gctgcgagtt 180 cataactttt aatgtaacct tccattttat ttgattatta aacttattta cacgttttta 240 aaagagttac ggtcattttg cgtgaatgtg tcctcttttg cccctttgaa taggtcactt 300 aaca 304 // ID DNA8-93_AP repbase; DNA; INV; 478 BP. XX AC . XX DT 26-AUG-2009 (Rel. 14.09, Created) DT 26-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-93_AP. XX NM DNA8-93_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-478 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2030-2030 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 478 BP; 194 A; 59 C; 51 G; 174 T; 0 other; cagacttggg taaaatactt ttaaaaagta tttcaaatac atattccgaa tacttaaaaa 60 aaaagtattc agaatatgta ttcgaatact ttatgataat aatattccta agtatttgga 120 atacttcaca aaaagtattc caaatatttt tcgaatactt ttttttagta gatttagatt 180 tcaatagcaa aaatattact ttttcaattc tgtgtttata aatgaacatt attataatct 240 tgataaaaca cacctatctg gctatctata atctattagt tataaaatat tatattaaaa 300 ttaatattgg atgttaaaaa agtattccga atactgtact cagaatactt gtaaaaagta 360 ttcaaaatac cgtattgaat actttcaaaa aaaagtattt aaaaagtatt cggaatactt 420 aaaaagtatt tcgaatacag tattccgaat actgtattcg gaatactacc caagtctg 478 // ID Gypsy-3-I_HM repbase; DNA; INV; 3719 BP. XX AC . XX DT 19-DEC-2008 (Rel. 13.12, Created) DT 19-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE An internal portion of the LTR retrotransposon - a consensus DE sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW reverse transcriptase; Gypsy superfamily; Gypsy-3-I_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3719 RA Bao W. and Jurka J.; RT "Gypsy LTR retrotransposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 1972-1972 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(17..727,688..3699) FT /product="Gypsy-3-I_HM_1p" FT /translation="MPPVFDGNHQRFVRQLRTYITAMDVPESKRKTIILTC FT LPPQIYSALEDLCLPAYPEDDSVTYEDIEENIMKLFKPKSSLILRFEFATI FT KKASNENVTEFSRRITRAAEGCKFTDRDDRMRDQFITGFNDGVTIKRLLLE FT SEELTFEKAVEIAVTLERVTQEARQLDGSENVMIGATSLLSRTSGASDINK FT VKCFNCGKLGHFARDCEEKCKTCSHDHQAKNCFKRLQGLKGKGRIWKTIAR FT FKRKRENLEIMASSILPEMTLPSLKVKINGRLRTALIDSGCSITVVHAESA FT HGLVLKSQKLITTLGGTVQVLGEMVINLEIDGIHRMVNSLVVKEKPFGFDL FT IFGMNAIKKFGGVIIYGNKNCNVRMLACDKAKCGAVTTTRKNENEINELKI FT DVINTTRKSENEVQKIKCDVKNTTRNEENERNEIKHQDFTAKFDGKKWTAS FT WNWNKEPVIDNTICEYKMNQEHERSYRKEIREWIDSGILVPYNELILGPPK FT VLIPLMCVVQENKDKKVRPVGDFRALNEFVNCHTANADVCQEKLRCWRKIK FT NAKILDLKKAYLQIHIDKKLWPYQTVIINGQRYCFTRMCFGINVAPTIMTT FT ILRHVLNLNPTVKKACDNYIDDIFVDESVETAKNVAKHLLKFGLQCKPPQD FT LEQGRVLGLRVERNKNGELIWKRDNVVQFESLNAVTRSQLFSQLGKLIGHY FT PVAGSLRLQASYIKRLAGNIPWCDFISEDCKEKLKELLYRVEKEDPVGGKW FT LVPVNGKAELYCDASSIGLGVVLLIGGVVVEDAAWLRPASDTHHINISELD FT SILRGLNLASKWGIKELTVYSDSKSTVGWLNAVLKEEYRVKTRGISQLLVQ FT RRLNTIKEVAKDIQIAVQWIPSEINLADRLSRIPQKWCRTVCAAGLLEQKD FT IWKIHSIGHFGVARTLQLCLRNNQNVSRKEVSAVIANCNQCNSIDPYSVKW FT KNGQLSVEKNWNRVAIDVTHVGAELYLSMIDCGPSRYTIWRKLATKAAAEV FT VGRLEEIFSLFGPPVCLLMDNEFRSNTLTSFAEEWHVNLEYRAAHRAQGNG FT IVERIHRTIKRTVARSHCSIALAVLLYNETISSNCTESPSKQFKKRSTCFI FT PGVDTRRKVDEKGVSKKFKVGDNVWVKPTQETKCDQPWIPGVVRKQNTLSF FT DVETANRVYPRHISHLRRRTDVQSSNESVNDVFYRLNDKTDNVKGSYNDDV FT AQVQNDSQELNVLQVTRSGRTVKPPQRLDL*" XX SQ Sequence 3719 BP; 1300 A; 652 C; 819 G; 948 T; 0 other; agctatttcg gccattatgc caccggtatt cgatggcaac catcagcggt tcgtccgcca 60 actacgaaca tacataactg cgatggacgt acccgaatcc aaaagaaaaa caataatatt 120 aacatgtttg ccaccacaga tttattccgc actagaagat ctatgcttgc ctgcataccc 180 tgaagatgac tcagttacct acgaagacat cgaggaaaac atcatgaagc tttttaaacc 240 gaagtcatcg ttaatactac gctttgaatt tgcaactata aaaaaagcaa gtaacgaaaa 300 tgtgacggaa ttttcacgac ggattacaag agctgctgaa ggatgcaaat tcacggatag 360 agatgatcga atgcgcgacc aattcatcac gggttttaac gatggtgtta caatcaaacg 420 gttactactg gaatctgaag aactaacttt tgaaaaggct gtggaaatag ctgttaccct 480 tgaacgagta acccaggaag ctcgacaatt agatggttcc gagaatgtaa tgattggcgc 540 aacatcgtta ctttcacgga cgtctggtgc gtccgacata aacaaagtta aatgtttcaa 600 ttgtggaaaa cttggacatt ttgcaaggga ctgcgaagaa aaatgcaaga cttgttcaca 660 cgatcatcaa gctaaaaact gctttaaacg attgcaaggt ttaaaaggaa aagggagaat 720 ctggaaataa tggcatccag tattctccca gaaatgacgt tgccatcatt aaaagtgaaa 780 ataaatggaa ggctccgaac agcattaatt gattctggat gctccattac tgtggttcat 840 gctgagtccg cgcatgggct tgtgttaaaa agtcaaaaac ttatcacaac tttgggagga 900 actgtgcaag tgctgggtga aatggtaatc aatctggaaa ttgatggaat acaccggatg 960 gttaattctc tggttgtaaa agaaaaacca tttgggtttg atttgatttt tggaatgaat 1020 gcaattaaaa aatttggagg agttatcatt tatggtaata aaaactgtaa cgtccgaatg 1080 ttagcatgtg acaaagcgaa atgtggtgcc gttactacga cgagaaaaaa cgaaaacgaa 1140 ataaacgaat taaaaattga tgttattaat acgacaagaa aaagcgaaaa cgaagtacaa 1200 aaaataaaat gtgatgtcaa aaatacaaca agaaatgaag agaacgaaag aaacgaaata 1260 aaacatcagg acttcacagc aaagttcgac ggaaaaaaat ggacggcatc atggaactgg 1320 aataaagagc ctgtaattga taataccatt tgtgaataca agatgaacca agagcacgaa 1380 cgtagttacc gtaaagaaat acgtgaatgg attgactctg gtattcttgt accatataac 1440 gaactaatat tgggaccacc aaaagtattg ataccgctta tgtgcgtagt gcaagaaaac 1500 aaggacaaaa aggtacgtcc agtgggagat ttccgagctc tcaacgaatt tgtaaattgt 1560 catacggcaa atgcggacgt ttgtcaagaa aaactgcgtt gctggagaaa aataaaaaat 1620 gcaaaaattc tcgatcttaa aaaagcctac ctccaaattc atattgataa aaaactttgg 1680 ccatatcaaa cggtgattat aaatgggcaa cggtattgtt ttaccaggat gtgttttggc 1740 atcaatgtgg ctccaacaat aatgaccaca attcttcggc atgtgctcaa tttaaatcca 1800 accgtaaaga aagcatgtga taactacatt gatgatatat ttgtggatga aagtgtggaa 1860 actgcgaaaa atgtagcgaa gcatctgctt aaattcggat tacaatgtaa acctccacaa 1920 gatcttgaac aaggacgtgt cttaggattg agagtcgaaa gaaataaaaa tggggaactg 1980 atatggaaac gagacaatgt ggtacaattc gaatctttaa atgcagtcac acgatcacag 2040 cttttcagcc agttaggcaa acttattgga cattacccag tggcaggaag tttgcgttta 2100 caagcttcat acatcaaacg tttagctgga aatatacctt ggtgtgattt tatttctgaa 2160 gattgtaaag aaaaactgaa agagttgctt taccgtgtgg aaaaagagga tccggttggt 2220 ggaaaatggc tggtaccagt taatggaaaa gctgagcttt attgtgatgc atcatctata 2280 ggacttgggg tagttctgct cattggtgga gtagttgttg aagacgcagc atggctacgg 2340 ccagcgagtg acacacacca tatcaatata agtgaactgg attctatact tcgaggacta 2400 aacttagcta gtaagtgggg cataaaagag ttaacagtat acagtgatag caagagcacc 2460 gtaggatggt taaacgcagt tttgaaggaa gaatatcgcg taaaaactcg gggaatttct 2520 caattgttag tacaaagacg attgaatact attaaggaag tggcaaaaga tattcagatc 2580 gcagtacaat ggattccatc tgagataaat ttggctgatc gactctcacg aatccctcaa 2640 aaatggtgca gaacagtgtg tgcagctgga ttacttgaac aaaaagacat atggaaaatt 2700 cattcaattg gtcattttgg tgttgctcga acattacaac tttgcttgcg aaataatcag 2760 aatgtatcac gtaaggaggt ttcagcagta attgcgaatt gtaaccagtg taactccatt 2820 gatccttact ccgtaaagtg gaagaatggc caactaagcg tagaaaaaaa ttggaatcgg 2880 gttgccattg acgtgacgca tgtaggtgct gaactgtact tatcaatgat tgactgtggt 2940 ccatcacggt atacaatatg gcgtaaactt gcaactaaag cagctgctga agttgttggt 3000 agactcgaag aaatattctc tttatttggt ccgccagtat gtcttctaat ggacaacgaa 3060 tttcggtcca acacgttaac aagttttgct gaagaatggc atgttaattt ggaatatcga 3120 gcagcacatc gagctcaagg taatggtata gttgaacgta tacacagaac gataaaaaga 3180 acggtcgctc gctcacattg ttcaatcgct ctggcagtct tgctgtacaa cgagacaata 3240 tcatccaatt gtactgaaag tccatccaag caattcaaga aaagatcgac atgtttcatc 3300 ccaggcgtcg atactcggag gaaggtagac gagaaaggcg taagcaagaa attcaaagtg 3360 ggtgacaatg tttgggtaaa accaactcaa gaaacaaaat gtgaccagcc gtggatacca 3420 ggcgtagtac gaaagcaaaa tactttgtcg tttgacgtgg aaactgcgaa tcgggtatat 3480 ccgcgtcaca tttcacacct gcgtcgccgt acagacgtac agagttcaaa tgaatcagtc 3540 aacgatgttt tttatagact aaacgataaa actgataacg ttaaaggaag ttataatgat 3600 gatgtcgctc aagttcaaaa tgattctcaa gaactcaacg tgctgcaagt aacacgctcc 3660 ggacgaacag taaagccccc acaaagatta gacttatgag ttaattcacc aagagggca 3719 // ID DNA8-3_AP repbase; DNA; INV; 225 BP. XX AC . XX DT 22-AUG-2009 (Rel. 14.08, Created) DT 22-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-3_AP. XX NM DNA8-3_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-225 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(8), 1745-1745 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 225 BP; 66 A; 29 C; 61 G; 69 T; 0 other; cagtggcgtg ccgaacagtc attttttgag gcaattgcct caagggaaat ttgggtgggt 60 gggtgggtgt gttagtatag atgtgcaact tatacctaaa aaaacttaat aatatttatt 120 attttgaata ataaaaaaaa atagtgaaaa aactgttggt ggtggggagg gggatcaaat 180 tgtatagttt gcctcaggtc tgcttgtcag ttcggcacgc cactg 225 // ID DNA-9B_AAe repbase; DNA; INV; 1276 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A non-autonomous DNA transposon family from Aedes aegypti. XX KW DNA transposon; Transposable Element; nonautonomous; DNA-9B_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-1276 RA Jurka J.; RT "DNA transposons from the yellow fever mosquito."; RL Repbase Reports 11(4), 1263-1263 (2011). XX DR [2] (Consensus) XX CC ~90% identical to consensus. Present in >6800 copies in the CC genome. CC TA TSD. XX SQ Sequence 1276 BP; 434 A; 192 C; 187 G; 463 T; 0 other; ggactgtcca ttttatagag tggacagctt gtttatgcta tatctttttt atttattgat 60 aaaatcgtaa tcggttttct gtgcatcgtt caactattat tccacaatgt tatgaaaata 120 taaaatctta caaaatgctt ttggttgaag aaattaatgg tttttccaaa aactcctaag 180 aaaaactgtt cgtcaaagtt aagcattatt tttcgcacaa caaaaatcct tatttttatg 240 aacaaaattg tgtgttggta tccattacta ttctaatata ggtagagctt gaaaaaatca 300 ataatattcc accattctct gacactagat cgctctaggg gtctatcctt tatggccaaa 360 tttgctgata caccatcctt ttactcccag aaaaaaagta aagtagaaag cgtgtttaaa 420 attggtttag attactaaag aaactatatt tgtataaata agagccaaat cgtgagtttt 480 aaccgcgttc ttaatataaa ttttactctt tatggtcgtt tcattgaaaa atctcatact 540 tttaaaataa tttacgcata tttatcgatc tacagcaaat tgtagacgtt tttctccaaa 600 tattttgata ggaaatctcg gaataacata ttatgaaaat aatacgtgga aaataaattc 660 gatttcatta cagcagcgta atgccaattt tgatgattgt cattgtgctt tgagaggtct 720 tctgaaaata aaacatttgt ttttctattg tactttaaat gtgaggtggg gactaaataa 780 gcaactctat gaaggcatag gttatttctc atattaatac tatatttgca cttccgtcag 840 gacgtacttt gaacctaaaa attatactca tatcagttac ctttcaattt aaaatatttt 900 tcttcaaaaa aatcggtgaa aaatgatttt atgatatctg gtaaattgtt gctccaaaaa 960 taacttgata ttttgcagca ggcttttgat tgatgatgct tccgaagaac aagccttttt 1020 cgaaaataaa aaatatagat tgtcgaattt tccagaattt ttctaataat cattgatttt 1080 gataacctcc aaaaatcgac cgttctaaac gttgtgcctt attcgtgcga aatttaatta 1140 ttttggtggc ttttttatta tcacaatatt gtacaacatt agtcgaacga tgtacagaaa 1200 accgattgcg atttcattaa taaatataaa aagatattgc atgtacaagg tgtccacttt 1260 ataaaatgaa cagtcc 1276 // ID DESMAR1 repbase; DNA; INV; 1292 BP. XX AC U24436; XX DT 15-SEP-2005 (Rel. 10.09, Created) DT 15-SEP-2005 (Rel. 10.09, Last updated, Version 1) XX DE Mayetiola destructor Desmar1 mariner DNA transposon. XX KW Mariner/Tc1; DNA transposon; Transposable Element; mariner; KW Interspersed repeat; DESMAR1. XX OS Mayetiola destructor OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Sciaroidea; OC Cecidomyiidae; Mayetiola. XX RN [1] RP 1-1292 RA Russell V.W. and Shukle R.H.; RT "Direct Submission."; RL Direct Submission to EMBL/GenBank/DDBJ (10-MAY-1995). XX DR EMBL/GenBank/DDBJ; U24436; Positions 99 1390. XX FH Key Location/Qualifiers FT CDS 170..1210 FT /product="DESMAR1_1p" FT /translation="MENFENWRKRRHLREVLLGHFFAKKTAAESHRLLVEV FT YGEHALAKTQCFEWFQRFKSGDFDTEDKERPGQPKKFEDEELEALLDEDCC FT QTQEELAKSLGVTQQAISKRLKAAGYIQKQGNWVPHELKPRDVERRFCMSE FT MLLQRHKKKSFLSRIITGDEKWIHYDNSKRKKSYVKRGGRAKSTPKSNLHG FT AKVMLCIWWDQRGVLYYELLEPGQTITGDLYRTQLIRLKQALAEKRPEYAK FT RHGAVIFHHDNARPHVALPVKNYLENSGWEVLPHPPYSPDLAPSDYHLFRS FT MQNDLAGKRFTSEQGIRKWLDSFLAAKPAKFFEKGIHELSERWEKVIASDG FT QYFE" XX SQ Sequence 1292 BP; 418 A; 251 C; 289 G; 334 T; 0 other; tattgggtgt acaacttaaa aaccggaatt ggacgctaga tgtccacact aacgaatagt 60 gtaaaagcac aaatttcata tatacgtcat tttgaaggta catttgacag ctatcaaaat 120 cagtcaataa aactattcta tctgtgtgca tcatattttt ttattaacta tggaaaattt 180 tgaaaactgg cgaaaaagac ggcatttgcg ggaagttttg ttgggccact tttttgcaaa 240 aaaaactgct gcagaaagtc accgtttgct tgtggaagtt tacggcgaac atgccttagc 300 aaaaacacag tgtttcgaat ggtttcaacg cttcaaaagt ggtgactttg atacggaaga 360 caaagaacgt cctggtcagc caaaaaagtt tgaagacgaa gaactggagg cattactcga 420 tgaagattgt tgtcaaacgc aagaagaact tgcaaaatct ttaggagtga cacaacaagc 480 catttcaaaa cggctaaagg cagctggata cattcaaaag caaggaaatt gggtcccaca 540 cgaattgaag ccgagagacg ttgaacggcg attttgcatg tcggaaatgc ttcttcaacg 600 ccacaaaaag aagtcatttt tgagtcgaat tattactgga gatgaaaaat ggattcacta 660 cgataattcc aagcgcaaaa aatcatatgt gaagcgcggc ggacgagcca aatcaacacc 720 aaagtcgaat ctccatggcg ccaaggtaat gctctgtatt tggtgggatc agaggggtgt 780 tttgtattat gagctattgg aaccgggtca gacgatcaca ggggacctct accgaacaca 840 attgatccgt ttgaagcaag cattggccga aaaacgcccg gaatatgcga aaagacacgg 900 ggcggtaata ttccatcatg acaacgctcg gccacatgtt gctttaccgg ttaagaacta 960 tttggaaaac agtggatggg aagttttacc ccacccgcct tatagcccag accttgcccc 1020 ttctgactac catttgttcc ggtcgatgca gaatgacctt gcgggaaaac gcttcacttc 1080 agagcagggt atccgaaaat ggcttgattc attcttggcc gccaagccgg cgaagttttt 1140 tgagaaggga atccatgaat tgtcagaaag atgggaaaaa gtcatagctt cagatgggca 1200 atactttgaa taatgcattc attcattttg ttatcgaaat aaagcattaa ttttcactaa 1260 aaaattccgg tttttaagtt gtacacccaa ta 1292 // ID hAT-39_SM repbase; DNA; INV; 3077 BP. XX AC . XX DT 14-AUG-2009 (Rel. 14.08, Created) DT 14-AUG-2009 (Rel. 14.08, Last updated, Version 2) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-39_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-3077 RA Jurka J.; RT "haT-type repetitive family from freshwater planarian Schmidtea RT mediterranea."; RL Repbase Reports 9(8), 1842-1842 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 1168..2871 FT /product="hAT-39_SM_1p" FT /translation="MCVLCXKILANSSLAPAKLRRHLETTHTDYKGKDMQF FT FKRQRDSLEKSKLQMITMAKTENENATEASYRVSYRIALNGEAHTIGETLI FT KPCAKEMVSCVLNEQATKKIDAIQLSNNTVSRRIKDLRTSIEKELIRRLGL FT CSEYALQMDESTDVAGLAVLLVFVRYTYDKSIEEDLLTCAYLHGKTTGEDI FT FNCIDDYITKQEIGWEKCXDXCTDGARAMVGKMKGVVSRIXKVATRATSSH FT CILHRXALLTKKIPTNLKNVMDEAVKMINYVKSRPLQSRLFKILCDEMGSG FT HKALLLHTEVRWLSRGXALLRLYELRNELLVFLKEQNDSRHNFVEFLXNPS FT WLIXLAYLANIFSKCNDVSLSLQGKSITIFNTRDKIESFDRKIQFWISSVE FT SNNFDCFPVLDECLNALNFQVDKEISKVFLEHLYNLKTAISEYFPKTSKDD FT SWVRNPFSVTVKPVGLSARDYEHLIDLINDSDLKEKFKDQPLNDFWASLTE FT EFPNVSKQAVVCLLPFSTTYFCEVGFSKYFATKTKYRNRLDATADMRIQLT FT NIVPDIKKICNLRTQNHQSH*" XX SQ Sequence 3077 BP; 1081 A; 475 C; 531 G; 970 T; 20 other; cagggtttcc caaactgtgt tccgctaaac ctgaataggg gttccgcgac aatttgtcgg 60 aagaatttaa tttaatttaa tataatatac acatattcat atttcaacac cgctgatatt 120 attcacaaga aattattatt tgtaaaaata tttcacaata gttcttaagt taggtactag 180 acccttccag gtagtgagac taagctttat ggaagttgaa aagagccgct tatgctatta 240 ttgcagtaaa tcttcatttt acatcctaaa aaaatctttg aaatttatta ctgcgataag 300 agtcaataat atgaacatct gtgcgatgtg gaaacaaaaa caacgggata acggatttgc 360 acgtgtttca cgaaatttct gttcccaacg aattcaacca tatttttttc gaaataatgt 420 aaaaatcaac cagtcttact acatccttgt tagcaattag tcgaactata tacgcgcatg 480 gtgccataaa tgttggtgaa aaaatcgtgt gaaattagtt gtggaaaata atccttatca 540 atcaaatatc gaacgtctaa aggagaaccc aaacttcata tataatacat atatgtatat 600 gaatctacat ttggaaatcg atttgtacgt gaattttata caagattttt aaatcttggg 660 atccaatctg tcaaatttac aaaaggttat tgcggacaat ttttcacgcg tttttattat 720 acatgttaat tcctgaaact aatttattgc gttaaaacag gattgggtat atacttttgg 780 aaatgtaaat gtattaaata aaatttgcat aaaaatgaat cgattttctt gtacattaat 840 ttagtccggt tgcttacaat gtcaaaaatt tgttcctatt ttgacaaaat ttaaatatta 900 ttncagaatt tacatattat ttaaaaacta tccatggatc tctggttgaa gacaggaaat 960 tcgacgaaag aaaggccaaa tacttctact gatacgtcta tcatcgaaga agaaaataat 1020 ccaaaaactg tgatttcttc aaaatcgaat attaaagaac aaatgtcatc tacgaatgtc 1080 atctagaaca gaccaaaacg gaagtacgat gaaagttatt taaatttngg atttacgtct 1140 actggcaatt cagatgctcc tgatgcaatg tgtgttttgt gtcanaaaat actcgcaaac 1200 agttctttgg ctccagcnaa actccgtagg cacctcgaaa cgacgcacac tgattacaaa 1260 ggaaaagaca tgcaattttt taagagacaa cgtgactcac tagaaaagtc caaacttcaa 1320 atgattacaa tggctaaaac tgaaaatgaa aatgcaacag aggcgtctta tcgcgtgagt 1380 tatcgcattg cacttaatgg tgaagctcat acaatcggag agacgctcat taagccatgt 1440 gcgaaagaaa tggtgtcatg tgttcttaac gagcaggcta ctaaaaaaat agatgcaata 1500 caattatcaa ataataccgt ttcacggcgc attaaagatc ttcgtactag catagagaaa 1560 gagttaattc gtcgncttgg actttgcagt gaatatgcat tgcaaatgga tgaatctaca 1620 gatgtcgcag gactagctgt actactggta tttgttcgtt acacgtatga taaaagtatt 1680 gaagaagatt tactcacgtg cgcatatcta catggaaaaa ctacgggaga agatatattc 1740 aactgtattg atgattacat tacaaaacaa gagataggct gggaaaagtg tatngatntn 1800 tgtacagatg gtgctcgngc aatggtagga aagatgaaag gtgttgtatc acgtatccng 1860 aaagtagcta caagagccac tagcagtcac tgtattttgc ataggcangc actgttgact 1920 aaaaaaattc caacaaattt aaaaaatgtt atggatgaag ccgtgaaaat gataaactat 1980 gtaaaatcna ggccacttca atccagatta tttaaaattt tgtgcgatga aatgggnagc 2040 ggacacaaag cacttttatt acatactgaa gtgagatggc tatccagagg aaangcacta 2100 ttaagacttt atgaacttcg aaatgaatta ttggtcttct tgaaggaaca aaatgattcg 2160 cggcacaatt ttgtcgaatt tttancaaat ccttcctggt tgatacanct tgcatatctg 2220 gcgaatattt tttcaaaatg caatgatgtt agcttatcat tacaaggaaa aagtattaca 2280 atttttaata cgagagataa aattgaatct tttgatagaa aaatacaatt ttggatttca 2340 tcagttgagt ctaataattt tgattgtttt cctgtcttag atgaatgttt gaatgcattg 2400 aattttcaag tngataaaga aatttcaaaa gtatttttag aacacctata taacttgaaa 2460 acagctattt cggaatattt tccgaaaaca agtaaggacg attcctgggt tcggaatcca 2520 ttttccgtga cagttaaacc agtgggctta agcgcaaggg attatgagca tttgattgac 2580 ttaattaatg actctgattt gaaagaaaaa tttaaagatc agccattaaa tgatttctgg 2640 gcgagtctga ctgaagagtt ccccaatgtc tcaaaacaag ccgtggtttg tttacttccg 2700 ttttccacga cgtatttctg cgaagtggga ttttctaaat attttgcaac aaaaacaaaa 2760 tacagaaata gactcgacgc cacggcagat atgagaatac aactgaccaa tattgtgcct 2820 gatataaaaa aaatttgtaa tttaagaacc caaaatcatc aaagtcatta ggagttttat 2880 ttatttgtat ttttatgttt atattanata tatntatatt tcgtttatat attaaataaa 2940 taatatttct ttaataacat tttttatttt tatttatact attatgaatt acccaaccta 3000 aaaaataagc taagtgttcc gtcaaaattt tcggatttgc caagtgttcc gtcgctgaaa 3060 aagtttggna aaccctg 3077 // ID GYPSM1_I repbase; DNA; INV; 25856 BP. XX AC AAWT01004332; XX DT 30-JUN-2007 (Rel. 12.06, Created) DT 14-JUL-2007 (Rel. 12.06, Last updated, Version 3) XX DE Gypsy-type LTR retrotransposon - internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW internal portion; GYPSM1_I. XX NM GYPSM1_I. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-25856 RA Jurka J.; RT "GYPSM1: Gypsy-type sequence from freshwater planarian (Schmidtea RT mediterranea)."; RL Repbase Reports 7(6), 186-186 (2007). XX RN [2] RP 1-25856 RA Jurka J.; RT "Consensus sequence."; RL Direct Submission to Repbase Update (13-JUL-2007). XX DR [2] (Consensus) XX CC This element has clear hallmarks of a Gypsy transposons, but CC carries several very interesting protein motifs. It might have CC played a role as an "evolutionary tinkerer" in a relatively CC recent evolutionary history. The youngest copies are ~5% CC divergent from the consensus. LTR consensuses are ~96% identical CC to each other. The consensus sequence includes a PiggyBac-like CC insertion at the 3'-end, which was removed during curation. XX FH Key Location/Qualifiers FT CDS 3929..18547 FT /product="GYPSM1_I_1p" FT /note="Matches several motifs: AIR1, Smc, FT MATH_TRAF_C, reverse transcriptase and ankyrin FT repeats." FT /translation="MTTSINSNMTSIPDANTMLNNNNVIVGEFREKFRGEP FT FDGSLNKLDTFIKEFEVYKNICHWTDQKVKERLPLYLKGSAQDVFVEEARD FT VTKLTTWTEIKEFLIKIFGIEKKGNRKILDFLHRKQRRDESNAVYACELKK FT LCKEAFGEADLPEDKMVDVFIRGIKREDIRANLGCLAPETLDKAVAIANRC FT EAHLGLGYQVAVLATEPTTSTVNANNQNNNTNPAMNQFIPRAQQMFKPPAT FT RNPEDRGNNQVSYNNNFRNNPNANEKCSTCQRMGHRSENCFRNYNCQKCGR FT RGHTEKICRQLNSKACYNCGQQGHIARRCNIRGEPNQSRPNNNPMGNPQIN FT VMQEIDMLRETLQRMPMELKEMMQQMPTNIQQQRINVMRSNEEIQELRTRE FT YQEQQQAEWQRLQTEEAQEIQQRWENRKQTPKRINMMRMEIPDEIHTAEQC FT VENEGGGNMITDNEKTVELNTMEKIEAIKEFTEEEKQRRRQLKETTTEKPD FT EPETQEIKRAEMMITEDQNEETLMYEIDELENEIMTQKIESTLEKIKQEEA FT LEQEEDQLKEKEAILKQEKAELRKKALILKIDGALKHLEQEQALDQKIEVL FT AKKVLSQEMEEPKLELQKDEQEASTSQEKEKISEPIITKNRIQRILLQKIE FT TRIRKEKPTWSEAQIENRIKEIWFQTKGPTKIMKEEEQLHINMLRQIPDDQ FT TDEDESEISSLGGEIFQSCEENKPYKDESKSELEEMETNVPREDETEILKE FT INEIVEEVDRNQMFLNLRDTTAYSRIMKLIAKGYQQDNGTFTASWKDMIKK FT YQENLKEYDREEYQVIIDFPEIPRQIIATRGESTNELYDWVVKKLGPSYKN FT KPLYYQGRKIKRDIHLADALIWPKGINVVTLNKNETEETTAEIWQKIQILY FT NKEIKPGLEKERQKIMRQNQIKMTEQAIDQARKYINRLSLTRPISMKIFVE FT AVRDTDIMRFDNERTDELIAKYGEHQRDMRGDEPIKIKIKLPNDEIKEILV FT DPDCQMHELYRRINREIIQPIIESSDDGMVEDVAIMYHKRQISPRERTPIY FT ELGIWEDGQTINIIAMDEPERKNTTKREQQIQEFQEEMKRKMDQILTIYQL FT WRQDAKLSTKLQPIDDSRCRSTPWKKVYDEKTKEWKWKMERTEPKWRKYIP FT GEAPEDAANETTLELYEDEPYENLPEEMINEMEWQPEEDEEINDRARLAPY FT TWKIGGWAEKVKSAEEEKIIGITSHQFYTEIGGYRMKLRVYPNGDGVGKGK FT YLSVYFLLARGRYDGKLEWPFNRKVTISIAKPGKWEERHSEKLGVDPRSMI FT SQKPKSTENVAGGVPQFLLKKYVTPQFVKDDTIYITIEVETEREMKLRIQQ FT EQARPRISVLEKRIPGIKYLPTDIDDPVEEGLTRLTETQKCRELAAILRTG FT AKNQRRTVTDLAKWIIDSCNRVENLRNIVKEIEEYRRMYAWYPEQFREIIR FT KWKHTEWAAETIEKYSLKIMTPDFEPIEREFYITAKVSEVKKRIPYEPPFH FT LQYEGRSLDDDLEIGVTQMQLGLINILTVAKIAGRELNKREIKTKTPIGRQ FT ESLNTPEVPPAADEVIVITTGDEEILNDLNDIVDEETTRRIPESRDLFMTE FT NENDLITDEPDELVCGQRQPVEIIEPDTEEENKKKERRRRSSTGDEWDNWI FT NERGATTSKETADEQPKRKETGTAMEKKSRGRPKIDKPAPERKTKEQMQMD FT QLTTKLNKLSQVTKQLQGKLKGQEQGAKWINRLITDMLGLANPEIKRNLKL FT NNIGAEAKQLWAAIKPIINKIMDNKVRTTTKDPEELLLIINHLTKRSDDPN FT RYLVGMAYLMYNTIVIYRKRTPRYNPNGYEFAITDIIRKVIEPFILTQGGW FT QAFINLVSKDDQTEMMQTKLKELLTEFETTINASDPKTARKGMNAIITTIL FT IWMMCVKGVEPFVVYDCDNIKIGDKYSLKETEECKAANPGKLQTTATAVSY FT NVYQEVDFIKTEIKECSVTRKIVAFHCGHHSHSTIIEAGTETTVVSVTRNE FT CEEGFTTGRISIAGRVNVAAEEGKIKTTRVYAAGRITAGDGTCQGGEYTLL FT NQLVKGVVVIESYQVKLEKYQGYFDPDTQAMKKYPQCLATDRSCNTGMSTI FT VYHADTRTCRLTLLKSAIFDEVQGRVHSETETNTQKSALDELRRTGRIGKP FT TITTTIGSEVTPTVVISTDTTIGMRFIKGRMVSKCDEMVASTNYKGIFLSR FT TAIPNAKAQIDPKDVKLYLYINNKMDFLYHKGLASTEKIYYDLVKNDCILN FT REILKTKLAMAITNPDNAIPLLPLQEGYFGRIVGEVMYTYKCEKTIAKLPE FT NSTIPRDKCINELEIMHKGKIRFAQPVTRMINPEKFVPNMFSCSNVYGPLF FT EIRDGSWLQFPTRVIVAPPKIFGLTELASEAEFKPVDISQSGIYQTKDLER FT AREHLLFPAQRQAILSNIVTITGGNNYGEKPNYELLLSPDHFQHATANMMK FT AMWGRFLIFGQMMAGILGIILIIQIVRVIITQMLACFDIYKRERKINWKMA FT IGFLPFLAKTMVLHGHSKDIHKIKRLVGILTKMNGTDEEVGRKFKAIMRLE FT AEKTRMKRQKTSNWKRWINFKDDSSDNSDNDENQGGTQMLTQEQLKQWENR FT KTRHSGHYNHPPPVYECLKKAASSVERNMPVERHVPIPRSTTLRNIYPVPY FT EPNWADLVPRTSQRSRSVDQDPETRTIAMLQTLAGPTLTVPIEINGVEIKA FT MIDTGANISAIKMDKIPGRLWPCIEEANMDVITCSKESVAVIGKIWSIIEY FT KNVKINTYLAVIRRLSADCIIGTDLMPELLKEIIIDLGSMELRDKTGRICM FT LKSEVRCSEKIVVPGRTQMICYVKVNEETRGEMIFEPNHKFEKKKELPLAR FT EIVYVNDNSEIPINITNFDEEDKVIYENERLGKLTPMMDIELPTTKTEGTE FT EELWTIDSQLLNETEKEQLQKLLMEFKDIFAASDLDLGTSDVTQHTIPLTD FT DTPITLRPYRLAEAQKSEAEKEVQKMLDAGVIEPSCSPWQFPVVMVKKKDG FT TQRFCIDYRRLNAVTKRDTYPLPDINEMLQTLHGAAYFTSLDLKSGYWQIK FT VKEEDREKTAFTIGKDLYQFKKMPFGLTNAPASFQRCMNFVTHGIKQCMVY FT IDDLLIYSSTFEQHLKDIRNVFVRLRQWKLKLKPSKCDWAKEKVTFLGHIV FT SAKGKEPDPRNIEKIKNCPAPKTVTQVQEFLGLCGYYRKFIKSYATIAKPI FT QELTKKDTPFIWEKEQQTAFETLRDKLISAPILVHPDFKKPFLLATDASGY FT ASGAVLGQWDDEKRERVIGYYSRTFKKHEKNYSVTEREALAIIQAIKHFKY FT LLWGHEIYITTDHQPLVWLGQHKEASSRLMRWAMQLQEYSPYIKFKSGKAN FT ANADCMSRFVFEELMDHDVRRICMIIAEDIDFTKLRNEQKEDEELKTIIEF FT METNDMKIFDNNSKLKEHMEKHRHRYTMEKGWLLLMDGEKRLMCLPESRRK FT EVLMQYHDGKLGGHMSAKKTEARLRQKYYWPNIGEDVKGWIKNCLICATRK FT STGSKLKAPLKPMPIPPEPMTMIAMDVVGPLPETNDGNIYILVVTDYLSKF FT PEAFAIPDQKARTIARILVEEICCRYGTPKQILTDQGTNFMSEIIEEVTNY FT FRIAKLRTSPYHPQTNGQTERFNGILIEMMSNYVSRHQKDWDRYINLCLMA FT YRMSIHSATKMSPFKLMYGRECNMPIDLEYQPPISQYMEIDDYVTGFKERM FT QEIWKTAGMNIRFNQESYKELHDKKRDVREHNFQVGDWVIIATPEIIPGVA FT RKLQRCGKGPYEIMYVNENNARVKLVNNPLTRPIYVNVSRCKAIPRTITRE FT PENRGMQTRSQTQQLSINGLTMSRKIEATSNNTVEEPKPDESKITLNATQD FT NVDKMDIIVKAIREIMPEWVEGKGWMRIVNKAGKIAMDYAKGARSPPEMDR FT IKESIPMIFTMADVEGNVPLHIAVQHRNTQRIMDIIQLMQKCGHTSNIPNE FT EGQTPLMFAVAADASTTVLRELLKTGSSFKEKDNGENNVLHYALKSRQASA FT IKFITSNATIREREMMGKTIQTLWEAIDAKQDEATANLMKWTYEGSPLKVF FT NLNYMVRNRNVLLKETIYAHYPFPKTIKAMLKWEGLKFTNHPKEAIRIGDR FT KEQLLMIIHRWHRKQIRFQASKQEIKEMDDIKARIIKVYEEIIEIINVEDD FT ISKETPDGNNNGEGGGDNSKNQDDQNTRDENKSAQQEKKESSNEERQETSN FT SNNEGKRNKGMMLHNKKILQQRKNQNRPTLLQKIMILITSIKLTEADENIT FT KEIKSFRIRTINDKGISSKGICLQLTMNTSISCQFDSIGKPGKMEIRSENS FT SSCQTTKDVCRDATEDRNVKIQVKPGIGKLRIINRGIETIEQVCIQNKIID FT CWKMNMKAGKKCEWDSLEGNAEYIICANNDIIITCTLAKTLGEKFTTPKEE FT PYQRRARENKEIIIKVESDPKSGEEIIVENMGKEPITKVCVFETKKNGTIN FT ERIKCQSNEDIKNQKYRIPTKRKRGKYIVHVQKGRDDYWKSITRKLLESIT FT TKRPEKETKEEPYQRQIRESKAVIINIESNAANEEEILIENTGEKPITKIC FT VYEPKRNEPVNKRIQCETSKTIKNQKYRIPTKRKLGKYVINIQKGTHEHWR FT TITRKPPKSNATDQPIIKQSERKPRNETWKKGMIMDDSHLSIHEALQLQKA FT LKNVQPKEGIMETTNARENTTMQPNEKNYMIKMIWITTAVISSVVMLIIIQ FT LVRTTKCRKKPAKKRLWPAVEKFNYDIELGQYDAERTEEYEIIGNSSTSGL FT WLKEPPIIEAQQWSNESDVYQIMNEKYGDIRTLRIESPETMV" FT CDS 19246..20652 FT /product="GYPSM1_I_2p" FT /note="This protein contains a domain similar to FT MATH_TRAF_C." FT /translation="MQTIIERIRGSENPSETTRRIMKEKIRVVNDIKERLK FT CCKCKELLIGARQTKCGCRICTDCTIEVIETKTIKCPGCDETFERDEDPTT FT MDKATNKEINNLLVQCRVKECEWKGLMKEAESHANECEYNEILCEKCQTIT FT TKKTMKEHHKVCIKREEKCEFCGVIDEWIKIEMEHRNPKEEQICDLYEGIC FT ANNCDRKDKVKLKEHWNECIGKPVKCPLELFNCQAQLPIAEMQKHMIENSI FT EHTEIMMKTIGMLAWEVKSLRTENDAQKALIREILESRKNFERSQGNTLKK FT MAEMEKKMQQMTNSIKELETNSSIARKHPTAPFIWKITEFAEKLQRAKDGT FT QRILDSPPIYSWENGYKLGIRLYPNGDGNATGTHISVFIRVGRGDYDELLT FT WPFKQNMILTMKGQEDYRNPIRTGNEPQMGRPTTDWNTAIGTPRFYPQQRL FT RGEGFLREDNLYIKLEVEKPDKTL" FT CDS 22137..23063 FT /product="GYPSM1_I_4p" FT /translation="MNPNEMINQKIEKVVTEAEESKPAIKDQSESIKVTNE FT EVELWRQRTLQLKEMLLIAEEELVQETLKTKMMKEKWEKQLEQKITYPEGK FT EKTYQEREEEMMKEFMQNQTRIKNQRKEIRTLEQEKQDQQSEIERLREVIK FT DKEAEIREEGEETIKIIQQMQEEIGQISTQYRIEKTKRTHLEQAKEWAKRQ FT ARYWKTKWTEDREHSRSNWEARIRHYIQQIAEEEDRTRAYKAKWENAKMEI FT NKLRNQGPKPSKKIREPRDDRQERMLTQITDQARKWRIECRILENANAKQC FT KHIVKLEQEIEYGNKPE" FT CDS 20778..21965 FT /product="GYPSM1_I_3p" FT /note="Shows significan similarity to BIR: FT Baculoviral inhibition of apoptosis protein repeat FT domain; Found in inhibitors of apoptosis proteins FT (IAPs) and other proteins." FT /translation="MASGISEEITIQMREGPDSIWTRLWKKPEEEEKVNTL FT AAAIIYYKDYDGTRRPTQIIKAIYSQMRVKDEELRKEEQRYLTFKDNKVFT FT DRQSQEMAAQGFYFTGTPELPDEVTCIYCKGKLAKWEKGETAKREHLRKFP FT TCPFRINIDVRNQAMNRLEGLRQSELDKDEKGGEKKITIIKSFEDPAELIW FT EKETINRTFEERIKTFKGNPRAAEFASQGYCQTARGIMCEGCKNQQPIPDQ FT NVKKLKHKGKCMYKQEKESKNIEIIITNQQIEDILDRPKTRTIMLLSGVDT FT QKMRKIIKERLANNLEWYDSTQAIIRAIKRYQEKPKENRDKSMIKEAILEQ FT YQCVTCWQKHVEIIYIPCGHMVTCEACGLKVDICPVCRKPINGRIKGFRFA FT DQ" XX SQ Sequence 25856 BP; 11272 A; 4139 C; 5266 G; 5163 T; 16 other; tacaaatttg ggggctcagc cgggattgaa gctttgttat ttggaaagac caaaaccatc 60 aagttaagaa acaacagaga attctgtcag tacaccctca gccgggataa acataacttt 120 gttggaaaag gaccaaaaga aaccgtaata aaattaatgt cgggacgaca ttttrccgaa 180 ccccggagag atgaaatata gagtttaaat attgtaaaga tcgattgtaa agatcgatga 240 gaaattgtaa agatcgatga aaagcgatca acctgactac tagatacaaa tttgggggct 300 cataaaatga ctgtttttac aaagaaccca aggaatataa rcaaacgtgt aaggtttaaa 360 cagaaatatt gggacgacat tttgcaaaac cccagagaga tgagaacgtc atgaaacaaa 420 aacccagaga acaactgatg aaaaacgaac aacccgacta caagataacg atataaacaa 480 tgaaaaacaa ccacgctgac tacatctaaa acaaccaacc cattgataca acgattgttt 540 cacaaaaccc caaggaaaca agcggatatg aaagttacat agatagcaac gccaaaacat 600 tcaacatcca tcaatacttg acgacaccac acccttgtaa gataacaatg acaagacgca 660 taagcaaaca acatataatt aaaagggcaa aattcgggat aaaatgaaat ggggtaagta 720 aggggagtaa tgataaacaa gagaaagcaa cggaaccaaa tagaggaagc caacggtaat 780 agaataaagc acaacaggta atgagtaaaa tagtagttga taaaatgaat caaaatatat 840 taaactaaga gttgaatgaa tttagttaat tagtatttaa ataaatttaa agggtaatta 900 aaagagattt aataatttta gaaatggaac caacaaaaat tatagaaaaa ctaaagggga 960 tcctaaacat cgaaaatgac aatcttgagg aactaattaa taaacaaaag ttaattaaac 1020 gagctaaaga ggatatatca acgatttcaa acctgtactc cgacgaacat tgcatattaa 1080 taaaattgat cgatattgat aaagatgtaa caaccgaaag aatatatgaa attctatcca 1140 tgagattgga aataaccaaa tttgggaaaa gctggaacaa gttaatggaa aaagtcaaaa 1200 atattgaaga cgagattgca acaacaatga aagagcaaac caaggcttat ttggaatgtt 1260 atgcgtgcaa aacagaggaa gcacgagtaa tggtgatacc ttgcggacac ctgagttatt 1320 gtcatggatg cgcacaagag tcgcactgta tgttatgcca gcgaaagata aaggaaagac 1380 taacggttta ctttccatga attgttatga taaaatgaaa taaatgtgta ttatgcatta 1440 cttgaattat tattagatga taagacgaat ataaaagaaa tatgcaatgt ttgtaaataa 1500 taaaaccacc cgacaacgat aaataaaggt aatagaccaa atacatgaaa gccgaattag 1560 ggaaataatt aaaacacatg cacagagaca caaaatattg gacggaaaaa tttcgacaac 1620 cgagttttcc tagtgtataa ttctttttgt tttattgata ttatagtttt atcgatatta 1680 tattgagata ggacgaggaa aaatggttaa cctgagctat ctaatgatag tacgctcacc 1740 caagggaaac cctgatcaac cctactgaga aaggaactgg taggccaagt cgcgtatgtc 1800 agcaagtaac cggcgagaga ttgatcacct cttttgggga gcttgctgat gcgattacca 1860 caaattttga aggggggagt gaagagatag accgagaggt gagagaaaga gtgtacacgg 1920 aacagaatta atagttttga ttaaaggtac ctagagatga cttctttttt ccctagctgg 1980 tggtgaggca ccagaggaca gctacggact gtcgtgttca ggttgttatc agccaaggga 2040 ctgatggcgc tgagccgttc gatacaaaaa cccggatcac acaccgagta tcgaacgatt 2100 cttttttttc gctgggagga actcgataca acaacccgga tcccacaccg agtatcgggg 2160 gccaaacggt aggtgagatg tgaactaacc ggtaaatggt tcatactcaa aatgtcctac 2220 taaggccaga taagggcggc cgaattgagc cgggaaatct gacrggtcag ataagggcgg 2280 ccgaattgag cctggaaatc tgatkggaga ggcgggcaga gattagatag tttattttcc 2340 gcgcccatat ttaatcgctt ccttttcttt tcactattcg ggaaatcaag taaaccgctt 2400 ttggaacttg attgaacgtc taggcatttc catactttta tattctccga ggtacacaac 2460 ccggatccca caccgagtac cgagggaatg ttttttttat tgcccagcga ggagtttggt 2520 acaataaccc ggatcccaca ccgagtacca aatatccgaa tggcggtgag taaagcgtaa 2580 ccggaagaat gcgcggccaa aagacgccat ggcttgagag tcagtgtgag gcctcaaaca 2640 tacccaaaag tcactgatga gctcaagtcc ggtgaaacaa tgagagataa tatgaataag 2700 atgaatatga atatgagtaa agatgaaatt gatagaatat gattatgatt agatattaga 2760 ggcaatagag tgccggatga gatatgatta tgattagata ttagaggcca tagagtgcca 2820 gatgagatat gattatgatt agatattaga ggccatagag tgccggatga gatatgatta 2880 tgattagata ytagaggcca taagaatgat atgattatga ttagatayta gaggccatag 2940 agataagata tgatatgagt aaagatgaaa tatgattatg aatagatatt aaaggtcaca 3000 gatgaccagt aaaagagtta aaggccgaag gccgaagagc caacggaaat tatagagggc 3060 atagagcgcc agcgattgag gattaagtaa atgaattttt cgggagacat aggaaaaata 3120 attaatggat tgaatgagag tatgaatatg atgattttat attattgata ttatgatata 3180 gataattgat acgactatat agataactga taagactata tagataattg atacgagatt 3240 aatagataat tattacgagg tatagataat taaaaataga cttaaaatga ggggggtgag 3300 tcagataaag aagaccacca cgataaatgc gaacaaggga tgataacgta tatagtatgg 3360 taaacggagg agaaattagt atagagtata gaatattgga gtgttattta tttttaggaa 3420 tcaatagtca aacaaagcaa tccaagtcaa gtgcaaagca aagcaaagga gaaaatattt 3480 cagagcaaac gacaaagcaa agtgataata aagaaacttg aataaacgag cagaaaagaa 3540 tttggaataa acgtcagagt agttacaagc agataagcaa ccagccagaa atttcaaagc 3600 aaccaaagca agagaagaga aacgataaaa ycagtcaagc agacgaataa ccgtaaacat 3660 tgatagcatc agcaaccaca cgagaagcaa aagccaagag aatagagaaa aaagtttatt 3720 aaaagcagaa gaaaagaaaa aagttcgcaa cagtagaagt gtcaatacca gagtatcaac 3780 tacagcagca acaggaaccg gcacagatag cggattaacc gattttgaat ttaacagcag 3840 cgaagagaac gaattacgta atatcgaaaa ccagttggca acaatagaat cagtgaacga 3900 agaagagaat ctagacaaca atccaaacat gacaacttcc ataaattcca acatgacttc 3960 aattccggat gcaaatacaa tgttaaataa caataatgtg atagtaggag aattcagaga 4020 aaagtttaga ggagaaccgt ttgacggtag tctcaacaaa ttggatacat ttataaagga 4080 atttgaagtg tataaaaata tctgtcactg gacagatcaa aaagtaaaag agcgactacc 4140 cttgtatttg aaagggagcg cacaagatgt atttgtggaa gaagcgagag atgtaacgaa 4200 attgacaacg tggacagaga ttaaagagtt cttaataaag atattcggaa tagagaaaaa 4260 gggaaatagg aagatattgg acttcctaca tcgaaaacaa agacgagatg aatcaaatgc 4320 agtgtacgca tgtgaattaa agaaattgtg taaagaagcc ttcggagaag cagatttgcc 4380 cgaagataaa atggtagacg tcttcataag aggaatcaaa agggaagaca tacgagccaa 4440 ccttggatgc ttggcaccag aaacattaga taaagcagta gcaatcgcaa atagatgtga 4500 agcccatctg ggactgggat atcaagtggc agtcttggcc acagaaccaa caacatcaac 4560 agtcaatgcc aacaatcaaa acaacaatac caatccagct atgaatcaat tcattccaag 4620 agcacagcag atgttcaagc caccagcaac gagaaatccg gaagatagag gtaataacca 4680 agtaagttat aataataatt tcaggaacaa tccaaatgca aacgaaaaat gctccacatg 4740 tcaaagaatg ggacatcggt cagaaaattg tttccgaaat tacaactgcc agaaatgtgg 4800 tagacgaggg catactgaga aaatatgtag acaacttaat agcaaagcat gttacaattg 4860 tggtcaacaa ggacacatcg caaggcgatg taacataaga ggagaaccaa accaatcaag 4920 accaaacaat aaccctatgg gaaatccaca aattaacgtg atgcaagaaa tagacatgct 4980 acgagaaaca ttacaacgaa tgcccatgga attaaaagag atgatgcaac agatgcccac 5040 caatatacaa caacaacgaa tcaacgtaat gagaagcaat gaggagattc aagagctgcg 5100 aactagagaa tatcaagagc aacaacaagc tgaatggcaa agactacaaa cagaggaagc 5160 ccaggaaatc cagcaaagat gggaaaatcg aaaacaaacc ccgaaaagaa ttaatatgat 5220 gagaatggaa attccagatg aaattcacac tgcggagcag tgtgttgaga acgaaggggg 5280 aggaaatatg ataactgata acgagaaaac agttgaatta aatacgatgg agaaaataga 5340 ggccatcaaa gaatttacag aggaggaaaa acaaagacgt cgacaattaa aagaaacgac 5400 aacggagaaa ccggacgaac cagaaactca agagattaaa cgagcagaga tgatgataac 5460 ggaggatcaa aacgaggaaa cattaatgta cgagatagac gaactcgaaa atgaaattat 5520 gacacaaaag atagagagta cactcgagaa aataaagcaa gaggaagctt tagaacaaga 5580 ggaggatcaa ttaaaagaaa aagaagcaat cctaaaacaa gagaaagcgg aactaagaaa 5640 gaaggcacta atattaaaaa tagacggagc gttaaaacat ttagaacaag aacaagcact 5700 agatcaaaaa atagaagtgc tggcaaagaa agtgctatct caagaaatgg aagagccaaa 5760 attagaacta caaaaagatg agcaagaagc atcaacgtca caagaaaaag aaaagatatc 5820 agaaccaata ataacaaaaa atcgaatcca gaggatactg ctacaaaaaa tagaaacaag 5880 aatcagaaaa gaaaagccca cgtggagtga agctcaaatt gagaatagaa taaaagagat 5940 atggttccaa acaaaagggc caacaaagat aatgaaagag gaggaacaac tacatatcaa 6000 tatgctgaga caaataccag acgatcaaac agatgaagat gaaagtgaaa taagcagtct 6060 gggcggagaa atatttcaat catgcgagga aaataaaccg tataaagatg aatcaaaatc 6120 cgaattagag gaaatggaaa cgaacgtgcc aagagaagat gaaaccgaga tattaaaaga 6180 aattaacgag attgtagaag aagtagatcg aaatcaaatg ttcctaaacc tcagagatac 6240 aaccgcatac agcagaataa tgaaactaat tgcgaaagga tatcaacaag ataatggtac 6300 atttacggca tcatggaaag acatgataaa aaagtaccaa gagaatctga aagaatatga 6360 tagagaagag tatcaggtaa ttattgattt tccagagata ccaagacaga taatagcaac 6420 aagaggcgaa agcacaaacg agttgtatga ttgggtagtg aaaaagctag gacccagtta 6480 taaaaataaa cctttatatt atcaaggaag gaaaataaag cgagatatac acctagcgga 6540 tgctctaata tggccaaaag gaataaatgt ggtaacgtta aacaaaaatg aaaccgagga 6600 aacgacagca gaaatatggc agaaaattca aatattgtat aacaaagaaa taaaacccgg 6660 attagaaaaa gagagacaaa agataatgag gcaaaaccag ataaaaatga cagaacaagc 6720 aatagaccaa gctagaaaat atattaaccg attatcgtta actagaccga tctcaatgaa 6780 aatatttgta gaagctgtac gagatacaga tattatgaga ttcgataacg agagaacaga 6840 tgagttgata gccaaatatg gagagcacca aagagatatg agaggagatg aaccaataaa 6900 gataaaaata aagctaccaa atgatgaaat aaaagaaata ttggtagatc ccgattgtca 6960 aatgcacgaa ttataccgac gaataaaccg agaaattata caaccaataa tcgaaagtag 7020 tgatgatggt atggtagaag acgtggcaat aatgtaccac aaacgacaaa taagtccgcg 7080 agaacgaacc ccgatatacg aattaggtat atgggaagac ggacaaacca ttaatatcat 7140 agccatggat gaaccagaga gaaaaaatac aacaaaaagg gaacaacaga tccaagaatt 7200 tcaagaggaa atgaaaagaa agatggacca aatattaacc atctaccagt tatggagaca 7260 agatgcgaag ttgagcacaa aattacaacc tatagacgac tcgagatgcc gatcaacacc 7320 atggaaaaag gtgtatgatg aaaagacaaa ggagtggaaa tggaaaatgg agagaaccga 7380 accaaaatgg agaaaatata taccaggaga agcgccagaa gatgcagcta atgaaacaac 7440 cttagagtta tatgaagatg aaccatatga aaatctacca gaggaaatga taaacgaaat 7500 ggaatggcaa ccagaagaag acgaggaaat aaacgataga gctcgactag ccccatacac 7560 atggaaaata ggtggatggg ctgaaaaagt aaaatcggct gaagaagaga aaatcatcgg 7620 tattacaagt catcaattct atacagaaat cggaggatac cggatgaaat tgagagtata 7680 cccaaacgga gatggggtag gaaaaggaaa atatctatca gtgtattttc tattagcaag 7740 aggcagatat gacggaaagt tagaatggcc atttaatcga aaagtcacaa tttcaatcgc 7800 taaacccgga aaatgggaag agcgacacag cgaaaaattg ggagtagacc caagatctat 7860 gataagtcag aagccaaaat ccactgaaaa tgttgcagga ggagtaccac aatttctgtt 7920 gaaaaaatat gtaacacctc aatttgtaaa agatgatacg atatacatta caatcgaagt 7980 agaaacagaa cgagaaatga aattgcgaat ccaacaagaa caagcgagac cgcgaatcag 8040 tgtactcgaa aaacgaattc caggaataaa atatttgccc acagatatag acgatcctgt 8100 ggaggaagga ttaacacgat taacggaaac gcaaaaatgt agagagttag cagcgatatt 8160 gagaaccgga gcaaagaatc aaagacgaac tgtaacagat ttagcaaaat ggataatcga 8220 ttcttgtaac agagtagaga atctgcgaaa tatcgtaaag gaaatagagg aatatcggag 8280 aatgtatgcg tggtacccag agcaattccg agaaattatc cgaaaatgga aacacaccga 8340 atgggcagcc gaaacgatag agaagtatag tttaaaaatt atgacgccag atttcgaacc 8400 aattgaaaga gaattttata tcactgccaa agtaagtgaa gtgaaaaaga gaataccgta 8460 tgaaccgcca tttcatttac aatacgaagg acgatcatta gatgacgatc tcgaaatcgg 8520 agttacgcaa atgcaattgg gattaataaa tatattaaca gtagcaaaga tcgcgggtag 8580 agaactcaat aaaagagaaa tcaaaacgaa aacgccaata gggcgacaag aaagtctaaa 8640 tacgccagaa gttccacctg cagcagatga agtaatcgtc atcacaacag gagatgaaga 8700 gatactaaat gatctaaatg atattgtgga tgaggaaaca acccggagaa taccggaatc 8760 aagagatttg ttcatgacag aaaatgaaaa cgatctgata acagatgaac cagacgagct 8820 agtatgtggt caaagacaac cagtcgaaat tatagagcca gataccgaag aggaaaataa 8880 aaagaaagag agaagacgcc gaagctcgac aggagacgaa tgggataact ggataaatga 8940 acgaggagct acaacatcca aagaaacggc agacgaacaa cccaaacgaa aagaaacagg 9000 tacggcaatg gaaaagaaat ccagaggacg accaaaaatt gataaaccag ctccagaaag 9060 gaaaactaaa gaacaaatgc agatggatca attgacaacg aaattaaaca agcttagtca 9120 agtaacaaag cagttacaag gaaaactgaa aggccaagaa caaggagcaa agtggatcaa 9180 cagattaata actgatatgc tgggattagc aaatccagaa ataaagagaa atttaaaact 9240 taataatatc ggagcagaag cgaaacagct ttgggcagca ataaagccaa taataaataa 9300 aatcatggac aataaagtac gaaccacaac caaagatcca gaggaattac tgttgataat 9360 aaatcactta acaaaacgat ctgatgaccc aaatagatat ttggtaggaa tggcctatct 9420 gatgtacaat acgatagtga tatacagaaa gcgaactcca agatacaacc caaacggata 9480 tgaatttgca ataacagaca ttatcaggaa agtaatcgaa ccattcatcc tgactcaagg 9540 tggatggcaa gcctttataa atttggtaag taaagatgat caaactgaga tgatgcaaac 9600 aaagttgaaa gagttattaa cggaattcga aacaacgatt aatgcaagcg atccaaaaac 9660 agctagaaaa gggatgaatg cgataatcac tacaatactc atatggatga tgtgcgtaaa 9720 aggagtagaa ccgtttgtag tatatgattg tgataacata aagattggtg ataaatactc 9780 actaaaagaa acggaagagt gcaaagcagc aaaccccggt aaattacaga caacggcaac 9840 agcagtatcg tataatgtat atcaagaagt agatttcatt aaaacggaaa tcaaagaatg 9900 ctcagtaaca aggaaaattg tagcatttca ttgcggacat cactcacatt caacaattat 9960 agaagctgga acggaaacaa cagtggtatc agtaacgcgg aacgaatgtg aagaaggatt 10020 cacgacagga cgcatctcaa tagctggacg agtgaatgta gcagccgagg aaggaaaaat 10080 aaaaactacc cgagtatacg ctgcgggaag aataacggct ggagacggaa cgtgtcaagg 10140 aggagagtac accctattga atcaattagt caaaggagta gtagtgattg aaagctatca 10200 agtaaagttg gagaaatacc aaggatattt tgatccagat acgcaagcaa tgaaaaagta 10260 ccctcaatgt ctagcaacag atagatcatg caatacggga atgtcaacga tcgtatatca 10320 cgccgatacc agaacttgta gactcacctt attgaaaagc gcaatatttg atgaagtgca 10380 aggacgagtg cacagtgaaa cagaaactaa cacgcaaaaa agcgcattag atgaattacg 10440 gagaaccgga agaatcggaa aaccaacgat aaccacaact atcggatcag aggtcacgcc 10500 aacggtagta atatcaactg ataccacaat cggaatgaga tttataaaag gacgaatggt 10560 gagtaaatgt gacgaaatgg tagccagtac caattataaa ggaatattct tatcgcgaac 10620 agcaataccg aacgcaaaag cgcaaataga ccccaaagat gtgaaactat atctctatat 10680 aaataacaag atggatttct tatatcacaa aggattggca tcaacagaga aaatatacta 10740 cgatttagtt aaaaatgatt gtatcttgaa cagagaaata ttaaaaacaa aattggcaat 10800 ggcaatcacg aatccagata atgcaattcc attattacca ctccaagaag gatattttgg 10860 aagaatcgtt ggagaagtta tgtatacata caaatgtgaa aagacaattg cgaaactgcc 10920 agagaattca acaattccaa gagataaatg cataaacgaa ttagagataa tgcataaagg 10980 aaaaatcaga ttcgctcaac cagtaacaag gatgataaat ccagagaaat tcgtacctaa 11040 tatgttcagt tgcagtaatg tatatggacc attattcgaa ataagagatg gaagttggtt 11100 acaatttcca actcgagtaa tagtagcacc accaaaaata tttggactaa cagaattggc 11160 cagcgaagca gaattcaaac cagtagatat atcacagagt gggatatacc aaacgaaaga 11220 tctggaacga gccagagaac atttattatt tccagcacaa aggcaagcaa tattatccaa 11280 cattgtaact ataacgggag gtaacaacta tggagagaaa cccaactacg aattgttact 11340 atcaccagac cattttcaac atgctacagc aaatatgatg aaagccatgt ggggaagatt 11400 cttaatcttc ggacaaatga tggcaggaat tttaggaata attctcatca ttcaaatcgt 11460 aagagtaatc ataacacaaa tgttagcatg tttcgatatt tacaaaagag aaagaaaaat 11520 caactggaaa atggcaattg gattcttacc gtttttagca aagacaatgg tattacatgg 11580 acactcaaaa gatattcaca agataaaaag attagtagga attctaacta aaatgaatgg 11640 taccgatgaa gaagtcggac gaaaatttaa agcaataatg agattagaag ccgagaaaac 11700 cagaatgaag cgacaaaaaa cgagcaactg gaaacgatgg ataaatttca aagatgatag 11760 cagcgataac agcgacaacg acgaaaacca aggaggaaca cagatgctaa ctcaagagca 11820 actcaaacaa tgggaaaatc gaaagacaag acatagtgga cattataatc atccaccacc 11880 agtatatgaa tgtctaaaga aagcagcatc atcagtagag agaaacatgc cagtagaaag 11940 acacgtaccg ataccaagat caacaacatt acgaaatatt tatccagtac catacgaacc 12000 aaattgggca gacttagtac caagaaccag ccaaagatcg agaagtgtgg atcaggatcc 12060 agaaactcga actatagcta tgttacaaac attagctgga ccaacattaa cggtacccat 12120 agaaataaat ggggtagaaa taaaagcaat gattgatacc ggagcaaaca tctcagcaat 12180 aaaaatggat aaaatacccg gaagattgtg gccatgtata gaagaagcta acatggatgt 12240 aataacttgc agcaaagaat cggtagcagt catcggaaaa atctggtcga taatcgaata 12300 taaaaatgta aaaatcaata cgtatttagc agttatcagg agattgagcg cagattgtat 12360 aattggaaca gatctcatgc cagaattatt aaaagagata ataatagatc taggatcaat 12420 ggagctgaga gataaaaccg gacgaatatg tatgctgaaa tcagaagtga gatgctcaga 12480 gaaaatagta gtgccagggc gcacacaaat gatttgttac gtgaaagtga acgaggaaac 12540 tagaggtgaa atgattttcg aacctaatca caaatttgag aaaaagaaag aactgccatt 12600 agccagagag atagtgtatg taaacgacaa cagtgaaatt ccaatcaaca ttacaaattt 12660 tgacgaggaa gataaagtca tttacgaaaa cgaaagattg ggaaaactaa caccgatgat 12720 ggatattgaa cttccaacaa caaaaactga aggaactgag gaagaattat ggacaatcga 12780 cagtcagcta ctaaacgaaa ccgagaaaga acaattacaa aaactgctaa tggaatttaa 12840 agatatattt gcagcatctg acttagatct cggtacaagc gatgtaacgc aacacacgat 12900 accattaaca gatgataccc caataacact acggccatat cgattggcag aagctcaaaa 12960 atcagaagca gaaaaagaag ttcaaaagat gttagacgca ggagtaatcg aaccaagttg 13020 ctcgccatgg caattcccag tggtcatggt aaagaaaaaa gacggaacac aacggttctg 13080 tatagattat cgacgactca acgcagtaac aaaacgggat acatacccgt taccagatat 13140 aaatgagatg ttacaaacgt tacacggagc agcttatttt acatcgctag atttaaaaag 13200 tggttattgg caaataaaag tcaaagagga ggacagagag aaaactgctt ttacaatcgg 13260 aaaagatcta taccaattca agaaaatgcc tttcggatta actaacgcac cagcttcatt 13320 ccagagatgt atgaactttg taactcacgg cattaaacaa tgtatggtat atattgatga 13380 tttattaatt tattctagca ccttcgagca acatttaaaa gatatccgta atgtattcgt 13440 tcgactgaga caatggaaat taaaattgaa accctccaaa tgcgattggg caaaggaaaa 13500 agttaccttc ttaggacata ttgtgtcagc aaaaggcaaa gaaccagatc caagaaacat 13560 cgaaaagata aagaattgcc cagcaccaaa gacagtcacg caagtgcaag aattcttagg 13620 cttatgtgga tattatcgaa agttcatcaa gagttatgca accattgcga aaccaataca 13680 agaattaacc aaaaaagata ccccgtttat atgggaaaag gaacaacaaa cagcctttga 13740 aacactcaga gataaactga tatcagcacc aatactcgta cacccagatt ttaagaaacc 13800 gtttttgtta gcaacagacg caagtggata tgcatctgga gcagtcctag gacaatggga 13860 tgacgagaaa agagaacgag ttattggata ttatagtcga acattcaaga aacatgaaaa 13920 gaattactct gtgactgaaa gagaggcatt agcaataatc caagctatta aacatttcaa 13980 atatttatta tggggtcacg agatatatat cactacagat catcaaccat tggtatggtt 14040 aggccaacac aaagaagcct cgagtagatt aatgagatgg gcaatgcagt tgcaagaata 14100 ctcaccgtat ataaaattca aatcaggaaa agccaacgcc aacgcagatt gtatgtcaag 14160 atttgtattt gaggaactaa tggatcatga tgtaagacga atatgcatga tcatcgcaga 14220 agatatcgat tttacaaaac tgagaaatga gcagaaagaa gacgaggagt taaaaacaat 14280 aatcgaattc atggaaacta atgatatgaa aatcttcgac aacaacagta aactaaaaga 14340 gcatatggaa aaacacagac atcgatatac catggagaaa ggatggttat tattaatgga 14400 tggcgagaaa cgacttatgt gtttaccaga aagcagacga aaagaggtac tgatgcaata 14460 tcacgatgga aaacttggag gacacatgtc agcaaaaaag acggaagcaa gactgcgtca 14520 aaaatattat tggccaaata tcggagagga cgtaaaagga tggataaaga attgtctgat 14580 atgtgcaacc agaaaaagca caggaagtaa attgaaagca ccgttaaagc caatgccaat 14640 tccaccagaa ccaatgacga tgatcgcaat ggatgtagta ggaccattac cagaaaccaa 14700 cgatggaaat atctatatat tagtagtgac agattattta tcaaaattcc cggaagcatt 14760 cgcaatacca gatcaaaaag ctagaacaat agcgagaatc ctagtagagg aaatatgctg 14820 tagatacgga acacccaaac agatcctaac tgatcaagga accaacttta tgagcgaaat 14880 catagaggaa gtaacgaatt acttcagaat tgcgaagttg agaacatcac cataccatcc 14940 acaaacaaat ggtcaaaccg aacgattcaa cggcatatta atagaaatga tgtcaaatta 15000 tgtaagcaga caccaaaaag attgggatag atatataaat ctatgcctta tggcttatag 15060 aatgtcaata cattcagcaa caaaaatgag tccattcaaa ttaatgtacg gcagagaatg 15120 taacatgccg atagacttag aatatcaacc accaatatca cagtatatgg aaattgatga 15180 ttatgtaaca ggattcaaag agcgaatgca agagatctgg aaaacagcag gtatgaatat 15240 tagatttaat caagaaagct ataaagaatt acatgataaa aagcgagacg tacgcgagca 15300 taactttcag gttggagact gggtaataat cgcaacgcca gaaattatcc caggagtagc 15360 cagaaaatta caaagatgtg gtaaaggacc ttatgagata atgtatgtaa acgagaacaa 15420 tgctcgagtt aaattggtca ataacccact cactagacct atttacgtaa acgtgagcag 15480 atgtaaagcg ataccgagaa caataacaag agaaccagag aatcgaggta tgcaaaccag 15540 atcacaaact caacaactga gtataaacgg actaacgatg agtcgaaaaa tagaagcgac 15600 atcaaataat accgtagagg aacctaaacc ggatgaatcc aaaataactc taaacgcgac 15660 ccaagataat gtagacaaga tggatataat agtaaaagcc atcagagaaa taatgccaga 15720 atgggtcgaa ggaaaaggat ggatgagaat agtaaacaaa gcgggaaaaa tcgcaatgga 15780 ttacgctaaa ggagctagaa gtccaccaga aatggatcgt atcaaagaaa gtatacccat 15840 gatcttcact atggcagatg tagaaggaaa cgtaccttta catattgcag tacaacacag 15900 aaacactcaa aggataatgg atataatcca acttatgcag aaatgcggac acacctcaaa 15960 tataccaaac gaagagggac aaacaccatt aatgttcgca gttgcagcag atgcttctac 16020 aacagttctg agagaactac taaaaacagg atcatcgttc aaagaaaaag ataatggaga 16080 aaataatgta ctacattatg cactgaaatc cagacaggca agtgcaataa aatttataac 16140 cagtaatgcg acaattagag agagagaaat gatgggtaaa acaatccaaa cattatggga 16200 agctatcgac gccaaacaag acgaagcaac agccaatctg atgaaatgga cctacgaagg 16260 ttcaccactt aaagtattca atttaaacta tatggtaaga aacagaaacg ttctattaaa 16320 agaaacaata tatgctcact atccatttcc aaagacaata aaagcaatgt tgaaatggga 16380 aggactgaaa ttcaccaatc atccaaaaga agcaatacga atcggagata gaaaggaaca 16440 gttattgatg ataatacaca gatggcatcg aaagcaaata agatttcaag cgagcaaaca 16500 agagataaaa gagatggacg atataaaagc aagaataata aaagtatacg aggaaataat 16560 tgaaataata aatgtcgagg acgacatttc aaaggaaacc ccggatggaa ataataatgg 16620 agaaggaggw ggtgacaaca gcaaaaacca agatgatcaa aatactagag acgaaaataa 16680 atccgcacaa caagagaaaa aggaatcatc caacgaggaa cgtcaagaaa cgagcaacag 16740 caataacgag ggaaaacgaa acaaagggat gatgttacat aataaaaaga tactacaaca 16800 aaggaaaaat caaaaccgac cgaccctgct acaaaaaata atgatattga ttacgtccat 16860 caagttaacg gaagctgacg aaaatatcac gaaggaaata aaatcgttca gaatacgaac 16920 aataaatgat aaaggaattt caagcaaagg aatatgctta caactgacaa tgaatacaag 16980 cataagttgt caattcgatt caataggaaa accaggaaag atggaaatac gatccgaaaa 17040 ctcatcatca tgtcaaacca ccaaagatgt atgtagagat gctacagaag atagaaacgt 17100 aaaaatccaa gtaaagccag ggataggcaa attacgaata ataaacagag gaatagaaac 17160 aattgaacaa gtatgtatac aaaacaaaat tatcgattgt tggaaaatga atatgaaagc 17220 aggaaagaaa tgtgaatggg atagcctaga aggaaatgcg gaatatataa tatgtgccaa 17280 caatgatata ataataacct gcacattagc caaaaccctg ggagaaaaat tcacaacacc 17340 aaaagaagag ccttatcaac gtcgagcacg agaaaataaa gaaataataa taaaagtaga 17400 atcagaccca aagagtggag aagagataat agtcgaaaat atgggcaaag aacccattac 17460 aaaagtttgt gtattcgaaa ctaaaaagaa cggaacgata aatgaaagaa tcaaatgtca 17520 atcaaatgaa gatatcaaaa atcaaaaata ccgcattcca actaaaagaa aacgtggaaa 17580 gtatatagta cacgttcaga aaggaagaga tgattactgg aaatcaataa caagaaagtt 17640 actagaatca attacgacaa aacgtccrga aaaagaaaca aaagaggaac cgtatcaacg 17700 acagatcaga gaaagtaaag cagtaataat aaacatagaa tcaaatgcgg caaatgaaga 17760 ggaaatatta attgagaaca caggtgaaaa acctatcaca aaaatctgcg tatacgaacc 17820 taaaagaaac gaaccagtaa acaaaagaat ccaatgtgaa acaagcaaaa ctataaagaa 17880 tcaaaagtat cgcatcccaa cgaaaagaaa actsggaaaa tatgtaataa atatccaaaa 17940 gggaactcat gaacattgga gaacgataac aaggaaacca cccaagtcaa atgcgacaga 18000 tcaacctata ataaaacaat ctgagagaaa accaagaaac gaaacatgga aaaaaggaat 18060 gatcatggac gacagccacc tttcaattca cgaagcatta caactccaga aagcacttaa 18120 aaatgtacaa ccaaaagaag gaataatgga gacaaccaat gcaagagaaa acacaaccat 18180 gcagcctaat gaaaagaatt acatgattaa aatgatctgg ataacaacag cagtaataag 18240 cagcgtagta atgctgataa taattcaact agtaagaact accaaatgca gaaagaagcc 18300 agcaaagaaa agattatggc cagcagtaga aaaattcaat tacgatatcg aattaggaca 18360 atacgatgca gaaagaactg aagaatatga aatcatagga aattctagta caagcggact 18420 ctggttaaaa gaaccgccca ttatcgaagc acaacaatgg tcaaacgaat ctgatgtata 18480 ccagataatg aatgagaaat atggggatat acggacattg cgtatagagt ccccggagac 18540 gatggtgtaa aacgaaaaag agacgttgta tatagtgtaa aatattatgt agtataatag 18600 tagtaatagt atgtatattt atatataaat attgtacaaa atgcgttaga aacagtaatc 18660 aaacagaaat gaatgccgat gaattaatga tgtattataa ccaacagatg gatccgaata 18720 acgtaaatca acaaaatcaa gtgagaatac caccgttcac agaaggaata caacaaccag 18780 cagctgccaa tgtacctata gagattgtag aattaaatca aggaatatac caagatcaag 18840 gaatcgaggc aacatggaga cgacaatgcg aattaatgac actaataaca aacaatgcgc 18900 aaactcagtg gacaaatact catctagtat taacggaaag tgtgaacaga ggtctacaac 18960 aagccataag actagaagag caacaaagaa ctatactaca attacagaga ataaatgaac 19020 aattagcaga tcaagttaga tggcaagctg acgaaattag aagactacgt aacaatcaac 19080 agcacgtaga tccagcaagt tgggtggaat ggatagcatc aatgccagaa ttcaccccag 19140 aaccggattc aacagtagag taaaaataaa gcgtaccgtg taaatatgta taaataaaac 19200 gatagaagta aagcaacgtc aagtaatcag aaaatttttg tagatatgca gacaatcatc 19260 gagagaatta gaggaagcga gaatccatcc gaaactacga gacgaataat gaaagaaaaa 19320 atcagagtgg tgaacgatat aaaagaaaga ctaaaatgtt gtaaatgtaa agaattactg 19380 attggagcga gacagacgaa atgtggatgc agaatatgta cagattgtac gatagaagtg 19440 atagaaacaa aaacaataaa atgccctgga tgcgacgaaa cgtttgaaag agatgaagat 19500 cctacaacaa tggataaagc caccaacaaa gaaatcaata atctattagt tcaatgccga 19560 gtaaaagaat gtgaatggaa aggacttatg aaggaagccg aaagtcacgc aaacgaatgc 19620 gaatataacg agatattatg cgaaaaatgt caaaccatca caacaaaaaa gacgatgaaa 19680 gaacatcaca aagtgtgcat caagagagag gaaaaatgcg aattctgtgg agtcatcgac 19740 gaatggataa aaatagaaat ggaacatcgg aatccaaaag aggaacaaat atgtgatcta 19800 tatgaaggaa tatgcgcaaa caactgcgac aggaaagata aagtcaaatt aaaagaacat 19860 tggaacgaat gtatcggcaa accagtcaaa tgtccgttag aattattcaa ttgccaagca 19920 caactaccaa ttgcggaaat gcaaaagcac atgatagaaa atagtatcga gcataccgaa 19980 atcatgatga aaacgatagg aatgttggca tgggaagtaa aaagcctacg aacagaaaat 20040 gatgcacaaa aggcactaat aagagaaata ttagaatcca gaaagaattt cgagagaagc 20100 caaggaaaya cactcaagaa aatggccgaa atggaaaaga aaatgcaaca gatgaccaac 20160 agcataaagg aactagaaac caacagttct atagccagaa aacatcctac agcaccattt 20220 atatggaaaa taaccgaatt cgcagaaaag ctccagagag caaaagatgg aacacagaga 20280 attctagaca gtccaccaat atacagttgg gaaaatggat acaaattggg cattagacta 20340 tatccaaatg gagatgggaa tgcaacaggc acccatatca gtgtcttcat cagagtagga 20400 agaggagatt atgatgaatt attaacatgg ccttttaaac aaaatatgat attaacaatg 20460 aaaggacaag aggattatcg caacccaatc agaacaggaa atgaaccaca gatgggaaga 20520 ccaacaaccg actggaatac agcaattggt acaccacgtt tctatccaca acaacgattg 20580 agaggagaag gattcttgcg agaagataat ctgtatataa aattagaagt agagaagcca 20640 gacaagacgt tgtaaagaga aaaagaaagt gtaaataaaa tgtaaataaa ataataaaaa 20700 tcaaataaaa atcaaaaaat tacaaataat cagaaaaatc agaaaaacaa aaattcaaaa 20760 ataatgacga tatttagatg gcaagcggaa tcagcgaaga gataacaata cagatgagag 20820 aaggaccaga ttcgatatgg acgagactat ggaaaaaacc agaagaggaa gagaaagtta 20880 atacactagc agcagcaatt atatactata aagactacga cggaacaaga cggccaacgc 20940 aaatcataaa agcgatatac tcacaaatga gagtgaaaga cgaggagctg agaaaagagg 21000 aacaaagata tttaacgttt aaagataata aagtatttac cgaccggcaa agtcaagaaa 21060 tggccgcaca aggattttat tttacgggaa cgccagaact accagacgaa gtaacatgca 21120 tctattgtaa aggaaaactc gcaaaatggg aaaaaggaga gacagcaaaa cgagaacacc 21180 tgaggaaatt tccaacctgc ccatttagaa ttaatattga tgtcagaaac caagcgatga 21240 acagattaga aggacttcga caatcagaat tagacaaaga tgaaaaggga ggcgagaaaa 21300 agataacgat aataaagagt ttcgaagatc cagcagaatt aatatgggaa aaagagacta 21360 taaatagaac attcgaagag agaattaaaa ctttcaaagg aaacccaaga gcagcagaat 21420 ttgcgagtca aggatattgc caaacagcca gaggaataat gtgcgaagga tgtaaaaatc 21480 aacaaccaat accagatcag aatgtcaaaa agctaaaaca taaaggaaaa tgtatgtata 21540 aacaagaaaa agagtcaaag aatatcgaaa taataattac caaccagcaa atagaagata 21600 tattagatcg accaaaaacg agaaccataa tgctattatc aggagtagat actcaaaaaa 21660 tgaggaaaat aatcaaagaa aggttagcta ataatttaga atggtacgac tcgacacaag 21720 caataatccg agcaatcaag cgatatcaag agaaacctaa agaaaatcgt gacaaatcaa 21780 tgataaaaga agctattttg gagcagtatc aatgtgtgac atgttggcaa aaacatgtag 21840 aaataattta tattccttgt ggacatatgg tcacatgtga agcatgtggg ttaaaagtcg 21900 atatatgtcc agtatgtaga aagccgatca acgggcggat caaaggattc cgtttcgctg 21960 atcagtaaat agaaaacgaa aacaatgtgt aaaaagtgta aataaagata aacgtaaaaa 22020 caaaaaatta taaaaactca aaaatcacaa aaatacgaga tgatgtaaat tatgataaaa 22080 ttttgagatg attgaaaaga gagaataatc gattattaat atttagaaat acaattatga 22140 atccaaacga gatgataaac caaaagatag agaaagttgt aacagaggct gaagagtcaa 22200 aaccagcaat caaagatcaa tcagaatcaa taaaagtaac aaatgaggaa gtagaattat 22260 ggagacaaag aacgttrcaa ttaaaagaaa tgctactaat cgcagaagaa gaattagtgc 22320 aagagactct aaaaacgaaa atgatgaaag aaaagtggga aaaacaacta gaacaaaaaa 22380 taacttaccc agaaggaaaa gagaaaacgt accaagagcg agaagaggaa atgatgaagg 22440 aatttatgca gaaccaaact agaatcaaaa accaaaggaa agagatacga accctagagc 22500 aagagaagca ggatcaacaa agtgaaatcg aaagattgcg agaagtgata aaagataaag 22560 aagcagagat acgagaagaa ggagaggaga cgataaaaat tatacaacaa atgcaggagg 22620 aaattgggca aattagcacc caatatcgaa tagagaagac aaagagaacc catttggaac 22680 aagccaaaga atgggcaaaa cgacaagcac gatactggaa aaccaaatgg acagaagata 22740 gagaacattc aagaagtaat tgggaagcac gcattagaca ttatattcag caaatagccg 22800 aagaggaaga tagaacaaga gcgtacaaag ctaaatggga gaacgcgaaa atggaaatca 22860 acaaactgag aaatcaaggg ccaaaaccat caaagaaaat tagagagccc agagacgata 22920 gacaagaaag aatgctgact cagataactg atcaagcaag aaaatggcga atagagtgta 22980 gaatattgga aaacgccaac gcaaaacaat gtaaacatat agtgaaatta gagcaggaaa 23040 ttgagtatgg caacaaacca gaataaaaga gatgcaaatc gagcaggacg aagctcaccg 23100 acaaagacaa atacaacaaa taaaaacgga aaacaaagaa aattggatca aattagaagc 23160 aagaaaatgg aaggaccagt accaagaagc caagaaagag ataagactgt tacccaaaaa 23220 gatattgaat acggaagatt gggagctatg ggaacaagtc gaagagcaga aacaaacaat 23280 cgasgaacta aacgagcaaa tccaaacatt agaattccac gaacgaatca gacaraatac 23340 aataagaagg ctcagaaaaa ccgaaaagga ccacagaaaa tggattgatc agttaaatca 23400 gaagataccg gaaaccttaa acgaacaaac ggatgaaaga cgacaaaacc aacaattaaa 23460 cgagaggata acagaactcc agcaggaggt gaacggacac cgaggagttg cagtatttaa 23520 agaaattcag ttgagacatc aacacgaagt caacgaacag ttgaaagaac aagttgaagc 23580 agagataaaa acgataaagc aaggatatga ggaaaacgaa cagaatctga gaaaagagat 23640 tgaaaagtta aaagcgttgg aaaacacgca tcgattggaa acgcaagtag aagaattaac 23700 agaaaataca gaatatcttg cctatctctt agacgaagag atacaaatat caaacgcctt 23760 gcgtgaagag ataaatacct tgcaaaaccg accaccaatc ctgcttatgc cgttataccc 23820 aactgaaatt ccaacccaag tacccgccga acccgatgaa ggattcgggg agacggacga 23880 aacgatagtg taaayctaaa gaagaaaaat tgtaaaaagt gtaaaaagat aataaaaata 23940 ataaaaaatg ataaaaacaa aaaattgtaa tcgatgtata tttagcatga attcgaatga 24000 aatagaaatc cgaatgaatc tagaagacgg agaatacgta ccaagcccaa ttcaattagt 24060 agaaccaatc gagcaagacc cactgaggta ttatagtaaa aggtggagag ccgaaagaaa 24120 agaatggcaa gagttagtgg aggagcaaga ttcagaaatc gcccggataa atcaacagtt 24180 aatacaatca aaacaactag tcaatatgat aaaacgaaaa ctaacatcca gcatgggaga 24240 aaacagagat caaaagagag aaaatgaacg attgagacga gaattagccg aaataaaaac 24300 tcgtgtaatc ctggaagatt attggaaaac cgaagcagaa cgaatgagag acagcagaaa 24360 aagcgaacag agatatgtaa tacaactaga agacagttgt aaagaagcca gagagataaa 24420 agaaagtcat caacaaactc aggacgaaat taaaacatta acagaacagc tccaggaagc 24480 gagacgaaag ctggaaaaac aaacaaccaa acgagtcaat tataaacgtc aactcaagag 24540 aaaacaaaca aaagaaagaa atcttcaaat gaaaatctat aatcaacaaa gaataatcaa 24600 agaaaacaaa gaacgactaa aagaactccc aaaactaacc gagcgattaa taaaagccca 24660 agtcctgata atgagattct cagaaaataa aattaaagag gagtgcgaga gacactcaac 24720 cccagaaaaa gagaaagaat aaaaagggct aaaactccac tgtatatagt tgaaataaaa 24780 ttgtaaataa atataaaaat tgattgatat atattagaaa tgaaaactat gaacccaaag 24840 ctatggatca caataataat gatgataata atgagaaagt gtaatactga aaacccgtta 24900 gaaaaatgcg agaaatccca acccttcaga agagaagaca gcacctgtgc cgatttcaaa 24960 gaaatttgta gtaaccccga taatgaagaa tgtcaacaaa taaacctcca atgtatacaa 25020 tgccccacaa aagaaaacaa ggacgagttg aaacagaata tcgagaaaac ccaaataatg 25080 gtacaaaccc tactgaaaaa taatgaagag atagccaata taaaaaatga gaaccgagag 25140 ataaagaaaa ccctagaaac cctcatcaga caaattcaaa gaaaagccaa agcaaatccc 25200 aataataaac caatccaaga aattcaaaaa atcgtcctat caatcaaaaa gcaacaagga 25260 aaacaaatgg gacagaaaca gaaaaatgaa cccagtacaa caagtctatc aatcgaactt 25320 atactaccga ttagtataac atctaacata atactcctaa tatgctgtct catcctcggc 25380 ctaacaaaaa acaagcaaag gcaaaatacc caaattcaaa aattcagaag aatgttcacc 25440 caaattcgaa aaattaacaa aaccaacgaa acgacaagac tctcgcagat ttccatcaat 25500 cacccttcaa gatcaaccca gctcgaaagc gaaacccaag ccgtcagtga ttcattcgaa 25560 aacaaatata actgagttac agtaactgat attgatttga actgattttc ctaactcgta 25620 ctgatctaca ttgatttgaa ctgatattat aaaacttgca ctgatattaa aactgaacta 25680 tccgatttta tttgattatg tattgtagcc ggaccttaag tccatcttta tcttgatgat 25740 ttaaccttac cgtagaaact ttttatgaat gtgaatattg aaaacggaat tgtatgtaty 25800 attttgaaat caattgaaat tgcgggtcgc aattttacgg acccccgagt gtttta 25856 // ID Gypsy-153_AA-LTR repbase; DNA; INV; 1505 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-153_AA_; KW Gypsy-153_AA-I; Gypsy-153_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-1505 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1032-1032 (2011). XX DR [2] (Consensus) XX SQ Sequence 1505 BP; 471 A; 295 C; 342 G; 396 T; 1 other; tgtaacgagg ttacagtatt ttaatgtaat ctttaattac aaaaaccaaa acagtgcgaa 60 atgcaatatt atttccgtgt agccactttt cgcgtaccaa gtctctgatt ccacaagttg 120 cgatgcaaca gtagagtcct tatcaaaact gaaaacttgc gttcttaaag gaagtacacg 180 aaaaacgacg aacatttacg aactcgggaa ccgccggtac agggtaatag cccaaaattt 240 ctgaaattgc catcctcttt aatctaaacc gtcgaacagt taagacgctc tttgcaaaaa 300 gttaaatata aaattaaaag tcaatcagtg attttataca atctagttaa gagtgaaagg 360 tgtgtacgca gttgaaaagt aagatttagt agaagtagaa atttgccaag tttgttaaaa 420 taaatattaa ttcccattga acagtttttt tgaaaacgtg ctctgaacaa aagtttagat 480 tagttctggt aatttcgtgc gtgtgtgcaa tcaggtaact cttgtaatac aacgagaaac 540 aatgttagaa tctaactcca aatactgtct cgcttgaaga actttccttc aagtgaaggt 600 ccacccttag gacggggcga atagtccaaa gcctccaccg tcgacggtat tcatcgacgt 660 ggtgtacgtt gggccgatga gcatagtgcc agtgcagcat aatcagcctc gtgtcgatgt 720 agcgagtgac agatcgccgt agccaggacc accatcataa ttatttcggc caacaacaac 780 aacctcaaat tcaccacgct tccgagaacg tcacagagcg agcatgattg tgacgctcag 840 aacgtaacgc actgtcttcg ctagcggccg ctagagggcc gtgagaattt gaaacgaacg 900 cacgtgtacg ctacccgacc aaaaaggaat tcgaatcgag gagattggaa cggagaaagg 960 attcggagtg taaaggaaca ggattcgttg gatcaggaaa ggaaggataa ggaataagga 1020 caatgggaaa ggatcaagta agcgctacct aaaattgtaa atactaaccc cctacatttt 1080 aagatgaaga ttataaagcc atgagcaaaa agtgaattga ataaataggt acctaatgaa 1140 ttttcaagcc ttggcaagat tttagttttt gcacttttag ataaaagaga actggtattt 1200 tcagatttcg gttttcggga cgagtagaat aggctctggg taggaaaacc tacaaaggaa 1260 acctacccac cctsaagtgt tgaaatcgtt tcttgatagt ttgtttttat tcgttggagg 1320 tcgtgtacgg atgtttcggt cgtgggtcga cggtcgcgat taaggactcg ccctgaggtc 1380 tcgcagccgg gtcgggattc tttgtttgag acaactcgct ccttcgcggc aagataaaac 1440 ttgcctggcg cttaagcgga ggtagtatta catccgaggt ttttgccgcc cctagtcccg 1500 ttaca 1505 // ID Gypsy-2_BT-I repbase; DNA; INV; 4108 BP. XX AC AELG01001160; XX DT 15-JAN-2011 (Rel. 16.02, Created) DT 15-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the buff-tailed bumblebee: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-2_BT_; KW Gypsy-2_BT-LTR; Gypsy-2_BT-I. XX OS Bombus terrestris OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; Apoidea; OC Apidae; Bombus; Bombus. XX RN [1] RP 1-4108 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the buff-tailed bumblebee."; RL Direct Submission to RU (15-JAN-2011). XX DR Genome; AELG01001160; Positions 121038 116931. XX CC Positions [3031-3561] - Integrase core CC 'TCAG' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 1096..4026 FT /product="Gypsy-2_BT-I_1p" FT /translation="MELNLSLRRAFKWHFTFADVQTPIIGLDFLSHYGLLV FT DPRKRRLLDTTTQLTTRGYAANCEQHMIKTVNGDSIYHQLLAKFPDLTRPP FT AFGREKIRHGVEHHIETTPGPPVYCKPRQLAPDRLKQIKAEFATLIEQGVM FT RPSKSSWASPLHVVLKKDGSLRPCGDYRALNARTVPDRYSPPHIQDFAQQL FT HGRRIFSKIDLVRAYHQIPIAPEDVKKTAIATPFGLFEATNMMFGLRNAAQ FT TCQRFVDEITRGLDFVYAYVDDFLVASETEEQHREHLRILFERLNHYGVVI FT NLAKCEFGVDEITFLGHTVNAQGIKPLADRVKAVNEAPLPANIKALRRYLG FT MIIFYRRFIPGAAKILQPLNDLLQGAKKGNMPITWSEEAKTSFSESKRALA FT NATMLAHPIPDAPISLAVDASDFAIGAVLQQRANDTWQPLGFLTKPLNSAQ FT RKYSAYDRELLAMYTAVKRFRHAVEGRNFIIFTDHKPLTYAFSKNPDKCSP FT RQFRQLDYIGQFTTDIRHIKGLNNNADALSRIEAIGKSVDHKTLAAAQEND FT EELRDIVNSDTSALQLKKIRFPDYDAEIYCDVSGDIVRPYVPKSLRRDVFN FT ALHGLSHPGIRATQKLVKSRFVWPALNKDCQIWTRQCIPCQQCKITRHVST FT PSGTFVELAGRFEHIHLDIIVMQYSQGCRYCLTCIDRFSRWPEAIPIPDME FT ATTVASALLSTWIVRFGVPTKITTDQGRQFESNLFKKLCRMLGIKHLRTTA FT YHPASNGMVERFHRQLKAAIKCHNSSNWVESLPIVLLGIRTAIKEDLSATA FT AEMIYGTGIRLPAEFFVPAKQPTNAEFANRLRERMSEIRPLPTSQHGEKKT FT FMFHDLQTSPYVFVRHDAIEGPLQPPYDGPYQVIQRREKTFTIRVKDKSVK FT VSVDRLKAAFIVSEDIEHLERSAQTHDVFVPRKIFRPQSSEQAQQSDGDAR FT AHYTTRSGRRVRFPDRFQAGLN" XX SQ Sequence 4108 BP; 1156 A; 1130 C; 985 G; 837 T; 0 other; actggtgacc ccgacgtgat tttttagtgt aagaggtgac agtgatagac tagtcgcagt 60 gtgaacagta ttgtccgcat ctcacacgat ggaaaaggag agtaaaccaa ctccagccgg 120 cgcatcgttc cgcgtgccgc catttatgcc tacgaagacc gggatctggt tttcgattct 180 agagaagtat ttcggcgtag ccggcattag ccacgacgac gagaaagcct taaccctcat 240 gggattcctc gacccagaat acctcgcaaa gatagaggat accgtgacca acctccccgc 300 caccggccag tacgataaat tgaagagcga gctgatccgc gccctgacag aatcggacag 360 cgccaaggtt gaaaggttag tcgaacgcga agagatgggc gacagaaagc cctcgcaatt 420 ttacaacgat ttaaagaaat tagccagctc tctcgtgtcc gatgaattca tcctaaactt 480 gtggaagaat ctcctcccag atcgtatcag ggaagtcctg gctgaagtgg gcgatacgag 540 cattgagaaa ctgatcagga ccgcggataa aattgacgag gcgtacaatc gtatcgggca 600 gcgcccgcac aaggtggtca cactcgccga gccaccggca ccgcacgtag acaaaaacga 660 cgcgctggcc gccgaggtca gcagactgcg gaaggaaatt aaggctttga aaatcggcga 720 gcatcgtcga ccgcgaagac gctcgaactc cctcactcga ccccgccgcc gttcccgctc 780 gtgagacaac ccgcgtcagg acggaatttg cttttatcat acgagattcg gcggtcgcgc 840 cacgaaatgc accatcccgt gtaagtggaa gtcgggaaac gaccccagcc gtccgtaaaa 900 gcggcagacg tcgatggctt gcgatctcgc cgcatcttta tcgtggatca gagaacgaaa 960 acttcgttct tagtggacac cggtgccgac atcagcgttt atccccgaag caggctatcc 1020 agggacgtaa agaaaacagc gtacaaacta ttcgctgcca atgggtagcg catcgcaacg 1080 tacggcactt tagcgatgga gctgaacctg tccctcagac gtgccttcaa atggcacttc 1140 acgttcgccg atgtgcaaac ccccatcatc ggcctagact tcctaagtca ttacgggttg 1200 cttgtggacc caagaaaaag acgtcttctt gacacaacaa cccagctaac gaccagggga 1260 tacgccgcca actgcgaaca acatatgatt aagaccgtaa atggggattc gatctaccac 1320 caactgttgg caaaatttcc agatctaacg cgcccaccgg cctttgggcg agagaaaatc 1380 cggcacggcg tggagcacca catagaaacc acacccggcc cacccgtata ttgcaaacct 1440 cgccagctcg caccggaccg tctcaagcaa atcaaggcgg aatttgcaac acttatcgag 1500 caaggagtca tgcgcccatc gaagagttcc tgggcatctc cactacacgt cgtcctaaag 1560 aaggatggaa gtctgcgacc ttgcggtgac tacagggcgc tgaacgcccg cactgttccc 1620 gacaggtact ctccacctca catacaagac ttcgcgcagc agctacacgg taggcgaatt 1680 ttctcaaaaa ttgacttagt gcgcgcctac catcaaatcc caatcgcgcc tgaggacgtt 1740 aaaaaaaccg cgattgcgac accattcggc cttttcgagg caaccaacat gatgtttggg 1800 cttcgaaacg ccgcgcaaac gtgtcaacgt tttgtcgatg aaatcacgcg aggcttagat 1860 tttgtatacg cgtacgtaga tgacttcctg gtagcttcgg aaaccgagga acagcaccgc 1920 gagcacttgc ggattttgtt cgaacgtcta aatcattacg gcgtagtaat caaccttgca 1980 aagtgcgagt tcggcgttga tgaaataacc ttcctaggcc atacggtaaa cgcgcagggt 2040 ataaaaccgc tcgccgatcg agtaaaagcc gtaaatgaag caccgctccc cgcgaacatc 2100 aaggctctcc gcaggtacct cggtatgatt atcttttacc gacgtttcat accgggagca 2160 gccaaaatct tacaacccct caacgatttg ctgcaagggg cgaaaaaggg caacatgccc 2220 attacctggt cggaggaagc caaaacgagc tttagcgagt caaaacgcgc cctagctaac 2280 gctacgatgt tagcgcatcc tatacctgac gcgcccatca gcctcgctgt agacgcgtcc 2340 gactttgcaa taggagcggt gttgcagcaa cgcgcgaatg acacttggca accgttgggt 2400 ttcttaacta aacccttaaa ttccgcgcag cgaaaatata gcgcgtatga ccgcgaattg 2460 ctcgctatgt ataccgcggt caagcgattt agacacgccg tagaggggag aaattttatt 2520 attttcactg accataaacc cctcacttat gcgttcagta aaaatcccga caagtgttcg 2580 ccgcgacaat tccggcagct agattatatc ggtcagttca cgaccgatat aaggcatata 2640 aaagggctga acaataatgc cgacgcccta tcgcgcatcg aagccatagg aaaatccgtg 2700 gatcacaaaa ccctcgccgc cgcgcaagaa aacgatgagg aactccgcga catcgttaac 2760 tccgatacat ccgcgttaca gcttaagaaa attcgttttc cggattacga cgcggaaata 2820 tactgtgacg tgtcaggcga catcgtacga ccgtacgtac cgaaatcctt gcggcgcgac 2880 gtattcaatg cacttcacgg actttcccac ccagggatac gcgcgactca aaaactcgta 2940 aaatcgcgtt ttgtttggcc cgcgctaaac aaagattgcc aaatatggac acggcaatgc 3000 atcccgtgtc aacaatgtaa aataacgaga cacgtatcaa cgccgtccgg aacattcgta 3060 gagctcgcag gacgattcga acatatacat ttagacatta tcgtcatgca atactcacaa 3120 ggctgccgat attgtctaac atgtatcgat cgtttttcac gttggcccga agcgattcct 3180 ataccagata tggaagcaac gaccgttgcc tctgctcttc tttccacttg gatagtgcgg 3240 tttggcgtcc caacaaaaat aacaacagac caaggacgcc aattcgaatc caatcttttc 3300 aaaaaactat gccggatgct cggcattaag catttgcgta ccacagccta tcaccccgcg 3360 tccaacggga tggtagagcg cttccaccgg cagcttaaag ccgcaatcaa atgccacaat 3420 tcgagcaact gggtcgaaag ccttcccata gttcttttag gcattagaac cgctatcaag 3480 gaagacctga gcgcaacggc agccgaaatg atctatggta cgggcatacg attgccagcg 3540 gaattttttg taccagcaaa acaaccgacc aacgcggaat tcgcaaaccg tctgcgagag 3600 cgaatgagcg agatcagacc tctgccgact tcacaacacg gcgaaaagaa aacattcatg 3660 ttccacgact tacaaacatc gccgtacgtt ttcgtacgtc acgacgcaat cgaaggccca 3720 ctgcaaccgc cgtacgatgg cccctatcag gttatacagc gtagggaaaa aacctttacc 3780 atacgggtga aggacaaaag cgtgaaagtc tccgtagacc gtctaaaagc cgcctttatc 3840 gtgtcggaag atattgagca cctagaaagg agtgcacaaa cccacgacgt gtttgtacct 3900 aggaaaatct tccgaccgca aagcagcgag caagcgcaac aaagcgacgg agacgcaaga 3960 gcccattaca ccacgcgttc gggcaggaga gttcgttttc ctgaccgctt tcaagcgggt 4020 ttgaattaag cggttagcaa gggcaccccc cgctaactta agtaagccaa cataaaagga 4080 agtaacgtta acactggcag gggggtac 4108 // ID Copia-1-I_DY repbase; DNA; INV; 3967 BP. XX AC . XX DT 05-JAN-2009 (Rel. 14.02, Created) DT 05-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE LTR retrotransposon Ty1-copia like, internal portion from DE Drosophila yakuba - consensus. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-1-I_DY. XX OS Drosophila yakuba OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; melanogaster subgroup. XX RN [1] RP 1-3967 RA Bao W. and Jurka J.; RT "Copia LTR-retrotransposons from Drosophila yakuba."; RL Repbase Reports 9(2), 475-475 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 312..3932 FT /product="Copia-1-I_DY_1p" FT /translation="MEQIEGRTSAKRACTKGTRSSSTNGWSSMSTYISEFV FT EILDGLAGVGIELNDELRTIVLLSSLPEQFEHFVVAMETRDQLPTFEILTI FT KLKEESERKGIAEERIDNTKAFAATQRQNWQTKTYGQKKRKNIVCFRCGEQ FT GHIKSQCRSASEGDSNRTVPKNLVNRQSSLLHACDVNNLQNSMWCLDSGAT FT SHMCCARELFIKFEKHTEQIGLADAGFLQAEGIGDVKIQTELCMLTLKNVL FT YVPKMTGNFMSVSSAVQNRCKVTFDLEKAMVVQDGECIVKAKKIGNLYLFE FT GGRSGCFAMSAVDGILLHKRYGHINFSSMKELVSKGMVRGIENIGLPENIN FT CKTCMVSKIHVQPFPRTTCNRAKELLELIHADVCGPFPTLSLGGSKYFLTF FT IDDKSRRIFVYFLRGKNEVLSKFMEFKNLVERQTGKQLKCLRSDNGGEFVN FT KQFDEYLRKYGIARQLTIPHTPQQNGVAERANRTLVEMSRCLLVQAGLGDS FT FWAEAVNTSAYLRNRSPTSAVKSMTPMEAWTGKKPCVKHLKVFGSLAVALD FT KGPRKGSKFQPKGKEYIMVGYSVAAKGYRLYDPAARQLVEKRDVLFDEHHA FT AASKDDVVLINLEELDEPCQRDGAEVDETVSDISVDTGSADESIDQYESAD FT DNEELVLEEARRGPGRPKIVRTGKPGRPKKQYNILGALVSENVPIPLTCKE FT AVESSHAMEWREAMAKEYDSLVANQTWQIADLPKNQRAIGCKWVFNVKRDK FT DGQIERFKARLVAKGCSQQFGVNYLETFSPVCRLESVRLMIALAAELKLYL FT HQMDVCTAYLNSDLRETVYMKQPEGYIDEKNPDKVLVLRKAIYGLKQSGRA FT WNEKLNSVLVDMGFISCNNEPCLYKQSGQGNLSLILVYVDDILIACQSHED FT LVEIKNSISQSFECVDKGPLSHFLGMEIEREGDLGAITLGHSQYIKDLLRT FT HGSELCKSAKTPLDAGFQIDCTSDKCKKVDPTVYQSVVGELMWLALTTRPD FT ILHSVSKLAQRNQNPHAEHMAGIKHILRYLSSTVDVKLHYQQCGQSFCGYV FT DADWAGDRLDRKSYTGYIYLLAGGPISWRSEKQRSVALSSTEAEYMALSAA FT FKEAIVMRRLILEIGCGDDNTPIVVYGDNLSAQALTKNPVNHSXTKHIDIR FT YHFVRDVVKEGKVLLKYKCTTEMIADILTKNLPKTKHELFTEMLNLY*" XX SQ Sequence 3967 BP; 1181 A; 749 C; 1075 G; 960 T; 2 other; ggttatgggc ccaggcatta gtgaatttat tttggttaag cggtgtgtga ctcacgatca 60 agaacaatga gtgctcttta tcaaattgaa aagttagagg agaaaaacta tgatacgtgg 120 aaaatacaga tgcgatctgt attggtgcat tccgggttat ggagtgtgac gtcaggagag 180 ctaacagcaa cgaacattcc agagggacac agctttgtgg ctttggacaa taaggcgtta 240 gcaacaataa cactgagtgt aaagacatct cagttggcct acatcaagaa ttgctcgact 300 gctgtggagg catggaacaa attgagggac gtacatcagc caagcgggcc tgtacgaaag 360 gtacaagaag ctcctcaaca aacggatgga gcagtatgtc gacatacata agcgagtttg 420 tggaaattct ggatggtcta gctggagttg gaatagagct aaacgatgag ttacgaacta 480 tcgttttgct gtcgagtctt cctgaacaat tcgaacattt cgtggtggcc atggaaacac 540 gagatcagtt gcctactttc gagattctga ccataaagtt gaaggaagag agcgaacgaa 600 aaggaatagc agaagagcga attgacaata ctaaggcatt cgctgcaaca cagaggcaaa 660 attggcagac aaagacgtat gggcagaaga aacgaaaaaa tatcgtttgc tttaggtgcg 720 gtgagcaagg gcacattaag tcccagtgtc ggagtgcgag cgaaggagat agcaatcgca 780 cggtgccaaa aaatttggtg aatcgacaaa gcagtctgct gcatgcttgt gacgtcaaca 840 acttacagaa ctcgatgtgg tgcttagata gcggtgcaac aagccatatg tgttgtgcac 900 gggagctttt cataaagttt gagaagcaca cagagcagat cggactggcc gatgctggat 960 ttcttcaagc agaaggtata ggtgacgtca agattcagac agaactgtgc atgttgactc 1020 ttaagaatgt tctctacgtc ccgaagatga caggaaattt catgtccgtg agcagtgctg 1080 tgcagaacag atgcaaggtg acatttgacc tagagaaggc catggtggtg caagatggag 1140 aatgcatcgt gaaagccaaa aagattggca atttgtattt gtttgaaggt ggtcgaagcg 1200 ggtgtttcgc catgtcagca gttgatggga ttttactgca caagagatac ggacacatta 1260 atttctccag catgaaagaa ctcgtttcca aagggatggt gcgcggcata gaaaatattg 1320 gtttgccaga aaatattaac tgcaaaactt gtatggttag caagatccat gttcaaccat 1380 tcccaaggac aacgtgtaat cgggcaaagg agcttttgga gctgatacac gcggatgtgt 1440 gtggaccttt cccaacactt tctctgggag gttcaaagta ttttcttact ttcattgacg 1500 acaaatccag gaggattttt gtttattttc tgcgtgggaa gaacgaggtg ctgtcaaagt 1560 tcatggagtt caagaatctg gtggaacggc agacaggcaa acagctgaaa tgcctgcgga 1620 gtgacaatgg cggagagttt gtcaataagc agtttgatga atacctcagg aagtatggga 1680 ttgccaggca gctaacaata ccgcatacgc cgcagcaaaa tggcgtggcg gaaagggcaa 1740 atcggacgct ggtggaaatg tcaagatgtc ttctggtcca ggcggggcta ggagatagtt 1800 tttgggcaga agctgtcaac acttcggctt atctgcgtaa ccgatctccg accagcgcgg 1860 taaagagtat gactcctatg gaagcgtgga ctggtaagaa gccttgcgtg aaacatctaa 1920 aagtatttgg atctcttgca gttgctctgg acaaaggtcc gaggaagggc agcaagtttc 1980 aacccaaagg caaggagtat ataatggttg ggtactctgt ggcagccaaa ggttatcggc 2040 tgtatgaccc agcagctcga caactggtcg agaaacggga tgttttgttt gatgaacatc 2100 atgctgcggc atccaaagat gatgttgtgc tgatcaactt ggaggaactc gacgagccct 2160 gccagcgaga tggggccgaa gtcgatgaaa ccgtatctga catatccgtc gatactggca 2220 gcgcggatga gagcatagat cagtatgaga gtgctgatga taatgaagag ctggtcctag 2280 aagaggcacg cagaggtcct ggtcgcccca agattgtccg aactggaaaa cctggacgtc 2340 caaagaaaca atataacatc ttgggcgctc ttgtttcgga aaatgttccg atccctttga 2400 cctgcaaaga ggcggtggaa agttcgcatg ccatggaatg gcgcgaagct atggctaagg 2460 aatatgattc cctggttgcc aatcaaactt ggcagattgc tgatttaccg aagaatcaga 2520 gagcaattgg ttgcaaatgg gttttcaacg tgaaacgaga taaggatggg caaatagaac 2580 gtttcaaagc tcgtctggtt gcaaagggat gctcccagca gtttggcgta aattacctgg 2640 aaactttttc gccggtatgc cgactggaga gtgtgcgtct aatgatagca ctggcggcgg 2700 agcttaagct atatcttcat cagatggatg tatgcacggc gtatctcaac agcgatctaa 2760 gagagacggt ttacatgaag cagccggagg gatatattga cgagaagaat cctgacaagg 2820 tattggtgct gaggaaagca atatacggct taaagcagtc aggaagggcc tggaacgaaa 2880 agctaaatag cgttttggtg gatatgggat ttatctcctg caacaacgag ccatgtttat 2940 acaaacagag cggacaaggt aacctttcac taattcttgt atatgttgac gatatactta 3000 tagcttgcca atcgcacgag gatttggtgg aaattaaaaa cagcatttca caatcgtttg 3060 agtgcgttga caagggtccg ttgagccatt tcctgggaat ggaaatcgag cgagaaggcg 3120 accttggagc cattactttg ggacattcgc aatatataaa ggatttattg cgaacccatg 3180 gaagcgaatt atgcaagtca gcaaaaactc ccctggatgc aggtttccag atagattgca 3240 ccagcgacaa gtgcaaaaag gtggatccca cggtctatca atctgtggtg ggagagctta 3300 tgtggctagc tctgacaaca agaccggaca tactgcattc agtgtccaaa ttggctcaac 3360 gaaaccaaaa tccgcatgct gagcacatgg ctggcataaa gcacatccta aggtatctct 3420 cgtccacagt tgacgtgaaa ttacattatc agcaatgtgg tcaatcgttc tgtggatacg 3480 tggatgcgga ttgggcagga gataggcttg acaggaagtc ttacactggc tacatctact 3540 tgttggccgg tggtccaatt tcttggcgtt cagaaaagca gcgaagcgtg gcgttaagca 3600 gtactgaggc ggagtacatg gcactgtcag cggcttttaa ggaggcgata gtcatgcgaa 3660 ggctaatttt ggaaattggc tgcggagacg acaacacccc gatcgtcgta tatggcgata 3720 atctgagcgc gcaggcgtta accaagaatc ctgttaacca ttccaraacg aagcatattg 3780 acatacgtta tcattttgtt agagatgtcg taaaggaagg gaaagttttg ttaaaatata 3840 aatgtacaac ygaaatgata gcagatattt tgaccaagaa ccttccgaag accaaacacg 3900 aattgtttac agaaatgcta aatttgtatt aaatataatt tgtaaacatg catgcattga 3960 ggaaggg 3967 // ID Zator-N1_CQ repbase; DNA; INV; 471 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A Zator DNA transposon family from Culex quinquefasciatus - DE consensus. XX KW Zator; DNA transposon; Transposable Element; nonautonomous; KW Zator-N1_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-471 RA Kojima K.K. and Jurka J.; RT "Zator DNA transposons from the southern house mosquito."; RL Repbase Reports 11(1), 651-651 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >79% CC identity. 3-bp TSDs are usually TWA. XX SQ Sequence 471 BP; 151 A; 84 C; 62 G; 174 T; 0 other; ggctggtaca aatattttta aaagtttttg tcaccccccc ccccccttca aaattggccc 60 gaaaaatcag ggggcaaaaa aaatattttt acaataaact tcaaaatttc aatgaaaatt 120 caagtgcaac cagctgaaat caaattaaaa tacattctcc tgcgtttaaa atcattttta 180 gcatgtttgg gtttattaaa aaatcttaag attttttgaa aattttcgat gcaaaatctt 240 tttttttcga tacaattttt gtttttgtca gatcttagat tttttgaaaa ctaatgattg 300 caaaacaact gaactagtgt aaaatgcatt ttaaaacact tttttcattt aaatgtgaag 360 actatggctt gttatttaaa tttttatatt tttttatttt tttgcccccc cccccccttg 420 acctcggcca gggccgaggg acaaaaactt ttttaaatat ttgcatcggc c 471 // ID Gypsy-13_DPu-LTR repbase; DNA; INV; 194 BP. XX AC scaffold_35; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-13_DPu_; KW Gypsy-13_DPu-LTR; Gypsy-13_DPu-I. XX NM Gypsy-13_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-194 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 742-742 (2010). XX DR Genome; scaffold_35; Positions 991420 991227. XX CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX SQ Sequence 194 BP; 44 A; 63 C; 45 G; 42 T; 0 other; tgttgtaacc tcgcccgtca gagggcacac acacccgacc caccatctcg gctgacagag 60 gagacggtcg ggcagtctcc acgcgactag ctcagcagac ggtcgcagcg tagacccatc 120 tcactcccgt gctaatttgt attctctctc tcgcctcgtg taagttgaat aaacgtgtat 180 cacgagtctc aaca 194 // ID Proto2-1_CS1 repbase; DNA; INV; 4564 BP. XX AC . XX DT 10-JUL-2009 (Rel. 14.07, Created) DT 10-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE Annelida Proto2-1_CS1 autonomous Non-LTR Retrotransposon - DE consensus. XX KW Proto2; Non-LTR Retrotransposon; Transposable Element; KW Proto2-1_CS1. XX OS Capitella sp. 1 OC Eukaryota; Metazoa; Annelida; Polychaeta; Scolecida; Capitellida; OC Capitellidae; Capitella. XX RN [1] RP 1-4564 RA Kapitonov V. and Jurka J.; RT "Proto2, a novel clade of metazoan non-LTR retrotransposons."; RL Repbase Reports 9(7), 1555-1555 (2009). XX DR [1] (Consensus) XX CC Proto2-1_CS1 is a very young family of non-LTR retrotransposons CC present in the annelid genome. It belongs to a novel clade of CC metazoan non-LTR retrotransposons called Proto2. This clade CC includes families of non-LTR retrotransposons present in the CC hydra (from Proto2-1_HM to Proto2-5_HM), annelid (from CC Proto2-1_CS1 to Proto2-8_CS1), hemichordate (Proto2-1CS1) and CC amphioxus (Proto2-1_BF) genomes. A model Proto2 non-LTR CC retrotransposon is ~4.3 kb long, contains two ORFs; its 3' CC terminus is composed of microsatellites; usually there is no CC target site duplications generated upon retrotransposition of CC Proto2 elements. ORF1 codes for a protein conserved in elements CC from all species mentioned above. ORF2 codes for a protein CC composed from the AP endonuclease and reverse transcriptase CC domains. It appears that the Proto2 clade is a clade ancestral to CC the RTE and RTEX clades. XX FH Key Location/Qualifiers FT CDS 396..1463 FT /product="Proto2-1_CS1_1p" FT /note="ORF1." FT /translation="MTRTPSFLGEESAMKLIPNELLCYAFNYADNSTVSSL FT SACLCEFYQPQEISSAREILWKEYEQVISTLTKKTRRSQAPTDAVSAKPFA FT EDIGVWINHVKNNTSASIMTTFCAVNIRKIPNCPPEEINLFSIVARLGALE FT KKLEDSEPTVHRVQTREDIAAPLAPVADDQLQTAVNEPPQKTSGVRPLVAQ FT AEWTSVVKKRMNKKKKIRIAAKELTTVVGSASDCSIKASEPVRHVFINKVD FT KQCTNENIHDYIKKKGITPKDVRRTSKEGWLSTSFKISVPSVNFNDLLKAE FT FWPSGIRCREWLSYIPRKRIGSANADLNENDAFLSDGLSEEEEDAHDSFAT FT SDAAAAHHSHHG*" FT CDS 1402..4284 FT /product="Proto2-1_CS1_2p" FT /note="ORF2, AP endonuclease, RT." FT /translation="MLMTPSRLLMLLLLTILIMDSALRLCTWNSRGHGADR FT LIYINKLLQQCDILLVQETWLHDWELNKLSDDDAISILGVSGMDPQQLLVG FT RPHGGCAVLFHSSLKISIASVQTNSCRMFACVVNVINSIKFLIFNVYMPCD FT SVTNVDAYMDIMNEIESIISVHQDIDNVIIGGDMNVDLRRTRSANLRALEN FT LCFNQSLLFCQSFDQCHVDFTYENEATGARSIIDHFCVSENIFNSIVHYLA FT VHEGDNLSDHAPVFLDIKLNLCNVTQSVSTSQNRVSWHRASSRDILAYKEM FT LSVCLEGINVPYEALHCVNCSNFNNDCLIHTHSVDKYYDDIVRAMRSSAEV FT TIPVCKKRGKAGWSTHVKQFQEDAIFWNRIWVENGRPTTGSLSNIRRSTRA FT QYRRASRWVVRNQDKLSADRMAQALASNNSRDLWGEVKKKANKVRDKPIIV FT DDADGELEVCEMFKVKYESLYSSVSFNENDMNEFIDQVTDRISTTCAAGSC FT YHNHFFSVDNVKSAVKMLKSGKSDSECNLSSNNYINGCDSLFVHLSILFNL FT MFARFSSPNEMLKSFLIPIPKNRKKSLSDSNNFRSIAISSIMGKILDHIVL FT KVHSNVLSTSDLQFGFKSNHSTTQCTYVLNEVIEYYNSRHTPVYTTLLDAS FT RAFDRVHFIKLFNLLLFRGMCPALIKFLVHMYIRQTLLVRWQNTVSAPFSC FT RNGIKQGGVLSPILFCVYMDELLMRLKKSKIGCYVGHVYCGALTYADDLTL FT IAPTHSAMSHLIEICERFSLEYSVKFNSTKSHVVLFNNSSRTHCCQPFYLH FT GQPIEYVTNALHLGTLIGKDTHAKNLSKLSKDMIIKTNVLYSHYKHCSPDV FT LCSLFKSYCTSFYGSPLWSLSNINDFIVCYKKCIKRLLNLNLRTRSKYVYL FT LMHQPDLDTQLMYRFSSFWNNCFKSDNVLIQLCANLSLTSNSVVAKNLKKN FT ISIPTM" XX SQ Sequence 4564 BP; 1357 A; 817 C; 891 G; 1499 T; 0 other; tgtgagtaga tcaggtttcg attaaggcgg ttttttgtat agctcctgag aatttgctat 60 ttcttgagga ttttttgctt tttgtgtgtg gatcttttct cattatgtga ggtatttgtg 120 tcctatttaa tttgatatat ttattcgcct tattcgaggt gttttgttct atttatttca 180 tctttttcac tgaactctta ttcaattttc aaatttagag catattgaac tgctcgctgg 240 tatagaaggc tattttacct tcttagccag cgataatcag gttatttcat tgttctctca 300 cgtttatttc cttgtgttaa ctcatttgtg cacagttatt atattttgtg tgtgtatttt 360 gtgtgtgtat ttttattttt aaatggtcgt cagctatgac gagaacgcca tccttcctcg 420 gcgaagagag tgcaatgaag ctcattccca atgagctttt gtgctatgca tttaactatg 480 ctgacaatag cacagtgtca agtttgtctg catgtctatg tgaattttat caaccacaag 540 agatatcatc tgcgcgtgaa atcctgtgga aagaatatga acaagtgata tctactttaa 600 cgaagaagac acgtcgatct caagcaccaa cggatgcagt aagtgcgaaa ccgtttgccg 660 aggatattgg tgtatggata aaccatgtga agaacaacac gagtgcctca ataatgacaa 720 ccttttgtgc ggtcaatatc cggaagatcc caaactgtcc tccggaggag ataaacctat 780 tctcgattgt tgcgagatta ggtgcccttg agaagaaact cgaggactca gaaccaaccg 840 tacatagagt acaaacacgt gaagacatcg cagcgccact ggcccccgtt gctgatgacc 900 agctgcagac cgctgtgaac gaacccccac agaagacatc aggtgtaaga ccactggtcg 960 ctcaagcaga gtggacatcc gttgtgaaga aaaggatgaa caaaaagaaa aagatccgta 1020 ttgcagcaaa agaactcaca actgtggttg gttctgccag tgattgctct atcaaggcat 1080 cagaacctgt tcggcacgtg tttatcaaca aagtggacaa acaatgtaca aatgaaaaca 1140 tacatgatta cataaagaag aaaggcatca ctccaaagga cgtacgacgt acttcaaagg 1200 agggttggct cagtacatcc ttcaagatat ccgtaccttc agtcaacttc aacgatttgc 1260 ttaaagctga gttctggcca tctgggatca gatgtcggga atggctctct tacattccca 1320 ggaaacgtat cgggtctgct aacgcggatt taaacgaaaa cgacgctttc ttatctgacg 1380 gcttgagcga agaggaagag gatgctcatg actccttcgc gacttctgat gctgctgctg 1440 ctcaccattc tcatcatgga tagtgcccta agactatgta cttggaacag taggggccat 1500 ggtgccgaca gacttatata catcaataaa ctgctacaac agtgtgatat tctcttagta 1560 caagagacgt ggttgcatga ctgggaattg aataaactca gtgacgatga cgccatatct 1620 atattaggag tctccggaat ggaccctcag cagctcctcg ttggtcgccc gcatgggggc 1680 tgtgctgtat tgtttcactc tagtttgaaa atctcaatcg ccagtgtgca gacaaactct 1740 tgtagaatgt ttgcttgcgt ggtgaacgtg attaattcta tcaaatttct tatttttaat 1800 gtatatatgc cgtgtgacag tgtgacaaat gttgatgctt acatggatat tatgaatgag 1860 attgaatcta taatttcagt tcatcaagat attgataatg tcatcatagg tggtgatatg 1920 aatgtagact tgagaagaac aagatcagct aacttaagag cacttgaaaa tctatgtttc 1980 aatcaaagtt tactcttttg tcagtctttt gatcagtgtc atgtcgattt cacctacgaa 2040 aacgaggcca ctggagcacg ctctattatt gatcatttct gtgtctctga aaatatattt 2100 aactcgattg tgcattactt agcagttcat gaaggtgaca acctttcaga tcacgcacca 2160 gtatttcttg atatcaaact caatttatgt aatgttactc aatccgtatc tacttcacag 2220 aatcgtgttt catggcatag agcatctagt cgagatattc tagcgtacaa agaaatgtta 2280 agtgtatgtc tagaaggtat taatgttcct tacgaagctt tacattgtgt taattgttct 2340 aactttaata atgattgtct cattcatact cactcagtcg acaaatatta tgacgatatt 2400 gtaagggcca tgcgatcatc agcggaagta acgatcccag tatgtaaaaa aagaggaaag 2460 gctggttggt caacccatgt taaacaattt caagaggatg ctattttctg gaaccgcatt 2520 tgggtggaaa atggtagacc aacaactggt tcattaagta atattagacg gagtacaaga 2580 gcgcaatatc gaagagcttc aagatgggtt gtaagaaacc aagataaatt atctgctgat 2640 cgcatggctc aagcgttagc tagtaataat tctcgtgatc tttggggcga ggtaaaaaag 2700 aaagctaata aggtacgaga taaaccgatt attgttgacg atgcagatgg agagcttgaa 2760 gtatgtgaaa tgtttaaagt caaatatgaa tctctgtaca gtagtgtctc gtttaacgaa 2820 aacgacatga atgagtttat tgaccaggtc acggacagaa tttcgacgac gtgtgctgct 2880 gggtcgtgtt atcacaacca tttttttagc gttgataatg taaaatcagc tgtaaagatg 2940 ctcaaatcag gaaagagcga ctctgaatgt aatttatcat ctaataatta tatcaacggt 3000 tgtgatagtt tatttgtaca tttatctatt ctgttcaatt taatgtttgc tcgtttttct 3060 tcaccaaatg aaatgctgaa atctttcctc attccgattc ctaagaatag aaagaaatcg 3120 ctcagtgaca gtaataactt tcgttcaata gccattagca gcattatggg taaaattctt 3180 gatcacatag ttttaaaagt tcactctaat gttttgtcaa ctagcgattt acaatttgga 3240 tttaagtcaa accattctac aacccaatgt acatatgttt taaatgaggt cattgaatac 3300 tataacagtc gtcatacgcc tgtatatact actttgctcg atgcttctcg tgcgtttgat 3360 agggtgcact ttatcaagtt gtttaattta ctgttgttta ggggcatgtg cccagctcta 3420 attaagttcc ttgttcacat gtatatacgt cagactcttc ttgttcggtg gcaaaatacc 3480 gtttcggctc cgtttagttg ccgcaatgga attaagcagg gtggagtgct gtctccaatt 3540 ttattttgtg tttatatgga tgaactttta atgcgtttga aaaaatcaaa aattggatgt 3600 tatgtaggtc atgtttattg tggagcttta acttatgctg atgatcttac tcttatagca 3660 ccgactcact cagccatgtc ccacttaata gaaatttgtg agcggttttc acttgaatat 3720 tctgtcaaat tcaatagcac caaaagtcac gttgtcctgt ttaataactc ttcaagaact 3780 cattgttgtc aaccatttta tctccacggc caacccatag aatatgtcac taatgcttta 3840 catttaggaa ctttaattgg aaaagataca catgctaaaa atctgtctaa attatctaag 3900 gatatgatta taaaaactaa tgttttgtac tctcattaca agcactgttc acctgatgtt 3960 ctatgttcat tattcaaatc ctattgcaca agtttttatg gttctcctct ttggtcgcta 4020 agtaatataa atgatttcat tgtttgttat aaaaaatgta ttaagcgatt acttaatttg 4080 aatctacgca cgagatcaaa atatgtttac ctgttaatgc accaaccaga cttagataca 4140 cagcttatgt acagattttc ttctttttgg aataattgtt ttaaaagtga taatgttttg 4200 attcaattgt gtgcaaactt atctcttact tctaattctg ttgtggctaa aaatttaaaa 4260 aaaaacatca gtattcctac aatgtgatca gcaaaattta gtcatgccga aaaactgttt 4320 gaaaaatgta cttttcgcaa aatattttag cgccatttca gagcacgata ttgcaaatgt 4380 tactgtttta aaagaattac ttgaagctcg gtgtggctca atagactgca tattgagtca 4440 tatcgaaatt aatgcactac tttattattt atgtattact taattaagta ctgtttgcag 4500 ttctgttaaa cttactgttt ttgtttttca tataaaatgt acaaacatga gtttgtgaat 4560 aaaa 4564 // ID Gypsy24-I_Dpse repbase; DNA; INV; 6428 BP. XX AC Unknown_group_154; XX DT 15-MAY-2009 (Rel. 14.05, Created) DT 15-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy24_Dpse; KW Gypsy24-LTR_Dpse; Gypsy24-I_Dpse. XX OS Drosophila pseudoobscura OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; obscura group; OC pseudoobscura subgroup. XX RN [1] RP 1-6428 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1126-1126 (2009). XX DR Genome; Unknown_group_154; Positions 21028 14601. XX CC Positions [2884-3360] - Integrase core CC 'GCCG' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 547..3732 FT /product="Gypsy24-I_Dpse_1p" FT /translation="MRALSVRVRKARSRFKQRRAIRRQVIASVKRWCKEDP FT RVFAEISIMGESVRGLLDSGATHSMLGCNGQEFLEKLGVEARRYSSIVKVA FT NGEDRAIVGRVELPVKYKGEVKRLTFFVCPFLEQEIYLGVDFWRVFSLAPE FT VMGEGPVGASHRLVEEVLADQVAHYVEGPEKEIVQEEWELSEEQKAALDQV FT KAEFLTFEAVGLGRTSRETHKIQLVEKAEPVKDRHYPLSPAMQEVVCGEVD FT KMLALGVIEYSDSPWSNRTTVVRRPGKNRFCLDARKLNKVTIKDAYPLPSI FT EGILSRLDQTIYISSVDLKFAFWQIELDEQSRPYTAFTVPGRPLYQFRMMP FT FGLCNAAQRLCRLVDKVIPVEMRSNVYAYLDDLLIIAEDFQTHVKILRQVA FT ERLREANLTIGLSKSHFCYKNIKYLGFIVGGGALKMDPDRVSAILRIPEPR FT SVRELRSFLGTAGWYRRFMKNYAEMAGPLTDALKKSGNGKFKLTEEAKRAM FT GKLKAALTSAPVLVHADFKRHFYIQCDASHYGVGAVLFQRDDEQNERPIAF FT FSAKLNCHQKNYSVTEKECLAALMAILKFRPYVELMPFTVITDHASLKWLM FT TMKNLDGRLARWSLQLQAFDFGIEHRKGADNVVADTLSRSIEELEVDPSSI FT LGFETVEFESVEYQELRKEITENQDRLPDLQIVDGMIFKRMRFERLDDQLE FT GAVWKLWVPSSLTAALIDKAHSEEKTAHGGIAKTLHLLRRQFYWPNMAIEV FT RDYIRRCTVCKESKVPNYRMQVGMGQEVVTERPFQKLYIDFLGKYPRSKNG FT QAWIFIVVDHFSKFTFLKTMTKATAKEVVRFLVHEVFYKFGVPEMIHSDNG FT AQFVSKTFKEMTDAFGVTHMRTPVYSPQSNAAERVNRTVLSAIRAYLEDDH FT RDWDVHLPEIELAIRNSVHEATGVTPFFAVFGQQMYLHGSCYKLAKRLRSI FT GEHDISLLEGQDKRQLIQERIRASLHQAYERSRVRYNQRARVFYARPGQEV FT FRRNFVLSDFGKGFNAKFARKFIRCRVVKPVGENAYALEDLNGRPIGVYHA FT KDLRM" XX SQ Sequence 6428 BP; 1733 A; 1314 C; 1874 G; 1507 T; 0 other; cggcgtccga gttcgaccgc atgagagaac tgcgtctacg gcttacgggt ttattcctta 60 gaccaactgc gggatgagtg cgtggaaata gaggcacatc tgaagaggaa ggctcgatct 120 tcgtggcgag gagcacccaa agccacaaac tatggtcgcg gggcgaatgc gaacgtcagc 180 gaggtcgcgc aggatcggtc gattagcgat gaagaggcag aggtcattga ggaagcgcaa 240 gtgacacagc gtgttccaca gaagttgatt tgctggaact gcgggcagac cgatcatggc 300 tttagggatt gtatggcatc ggagaggcga attttctgct accggtgtgg aaagtccgat 360 gcatattgtc caacctgtcc gaattgcgcg ggaaacgtga agaggggtgc gatccaagca 420 ggcgaatcac gctcacagaa ggggcctgcg cagagaaatt agaagaaaat aaaaatgaat 480 tagatagacg taaattgcat gctaggttaa ggaaatagaa tattttctga actacgagtg 540 gatggtatgc gagcactgtc ggtgcgggtg agaaaggcga gatcccgttt taagcaaagg 600 cgagctataa ggaggcaggt gattgcttca gtaaagcgtt ggtgcaagga agacccgaga 660 gttttcgcag aaatctctat aatgggcgag tccgtgcgag gcctgttaga ttcaggcgct 720 acacacagca tgctagggtg taatggtcag gaattcctgg agaagctggg cgtggaggct 780 cgtcgatatt cttctatagt gaaggtagcg aatggggaag atcgcgcaat cgttggccga 840 gtggaattac cggtgaaata caagggagag gtaaaaagat taacgttttt cgtatgccct 900 ttcttggaac aggaaattta cttgggggta gacttttggc gggtattctc attggctcct 960 gaggtgatgg gagaaggtcc agtaggggca tctcacagat tagtggaaga agtgctggct 1020 gatcaagtcg ctcactatgt ggaaggcccg gagaaggaga tagtgcagga agaatgggag 1080 ttgtcagaag agcaaaaagc agccttggat caggtgaagg cagagtttct gactttcgaa 1140 gccgtaggcc tggggaggac gtcgagagag actcacaaaa ttcaattggt ggagaaggcc 1200 gagcctgtga aggatcgaca ttacccgctg tcgccagcga tgcaggaagt ggtgtgtggt 1260 gaggtggaca aaatgttggc tttgggagtc atcgagtaca gtgatagtcc gtggagcaac 1320 cgtaccaccg tggtgcgaag gccgggcaaa aatcgtttct gtctcgatgc ccgaaagctc 1380 aacaaagtaa ccatcaagga cgcttaccca cttcctagca tcgaggggat tttgtcgcgt 1440 ctcgatcaga cgatatatat ctccagcgtc gacttaaaat tcgccttctg gcaaatcgag 1500 ctggacgagc agagtcgacc atatactgcg ttcacggtgc cgggacgccc tctgtatcag 1560 ttccgtatga tgccctttgg cctctgcaac gccgcacaac gcctatgtag actggtcgat 1620 aaggtaattc cggttgagat gcgttcaaac gtctatgcgt atctcgacga cttgctgatt 1680 atagctgagg acttccaaac gcacgtgaag attctgcgac aggttgccga gcgtctacga 1740 gaagcaaatt taacgatagg gctgagcaaa tctcattttt gctataaaaa cattaaatac 1800 ctgggattta tagtgggagg aggagcgctg aagatggatc cggatcgtgt gtcggccatt 1860 ttgcgcattc ccgagccgcg atctgttcga gaattgcgta gcttcctggg aactgcgggc 1920 tggtaccggc gctttatgaa aaactatgct gagatggccg gtccgctcac agacgccctg 1980 aaaaagtcgg gaaatgggaa gtttaaattg acggaggagg caaaacgagc catggggaag 2040 ctaaaagctg cactcacctc ggcaccagtc ctggtacacg cggatttcaa gcggcacttc 2100 tacatacagt gcgacgcctc acattatggg gtaggagccg tactcttcca acgggatgac 2160 gagcagaacg agaggcccat agcgtttttc tctgccaaac tgaattgcca ccaaaaaaac 2220 tactcagtga cagagaagga gtgtctggca gccctcatgg cgatccttaa gttccgtccg 2280 tacgtagagc ttatgccgtt tacagtcata actgaccacg ccagtctcaa atggctcatg 2340 accatgaaga acctggatgg tcgcttagct cgttggtccc ttcaactgca ggccttcgat 2400 tttggcatag aacaccggaa aggagcggac aatgtagtgg cagatacatt gtcacgcagt 2460 attgaagagc tggaagtgga tccgagtagt atcctgggtt ttgaaacggt ggagtttgaa 2520 tctgtagaat atcaggagtt aaggaaggaa ataacagaga accaagaccg cttacctgat 2580 ctccagattg ttgacggcat gatattcaag cgcatgcgct ttgaacgatt ggatgatcaa 2640 ctggagggcg ccgtctggaa attgtgggtt ccgagttcgc tgacggccgc gttgatagac 2700 aaagcacatt cggaagagaa gacagcgcac ggaggaatcg cgaaaacctt gcacttattg 2760 cgtaggcagt tttactggcc gaacatggcg atagaggtac gggattatat ccgacggtgc 2820 accgtgtgca aggagtcgaa agtgccgaat tatcggatgc aggtcgggat ggggcaagaa 2880 gtggtcactg agcgcccgtt ccaaaaactt tatattgatt ttttgggtaa atatccgcgg 2940 tcaaagaacg gtcaggcgtg gatatttatt gtagttgacc atttctcgaa gttcacgttc 3000 ctaaagacta tgaccaaggc cacggcaaag gaagttgtaa gattcctcgt tcacgaggtg 3060 ttttacaagt ttggagtgcc ggagatgatt cactcggaca atggggctca gttcgtctcg 3120 aaaactttta aggagatgac cgatgcattc ggagtcactc atatgaggac gccagtgtat 3180 tctccgcaga gtaacgcagc cgaacgcgtg aatcgcacag tactcagcgc gattcgagcg 3240 tatctggaag acgatcaccg ggactgggat gtgcatctgc cggagataga actggctatt 3300 aggaactcgg ttcacgaggc gacaggcgta acgccattct ttgcagtctt cgggcagcaa 3360 atgtatttgc atgggtcatg ctacaagctg gcaaagcgac tgcggtcaat cggcgagcat 3420 gacatatcgc tgctggaggg tcaggacaag cgacaactca tccaagagag gattcgcgct 3480 agtttacatc aggcatacga gcgcagtagg gtaaggtaca accagcgagc tcgggtgttt 3540 tacgcgaggc cgggacaaga ggtattccgc cgaaattttg tccttagtga cttcggaaaa 3600 ggctttaatg ccaagttcgc gcgcaaattc atccgatgcc gggtagtgaa gccagtagga 3660 gagaacgcgt atgctctgga ggaccttaac ggccgtccca taggagtgta ccacgccaaa 3720 gacctgagaa tgtagtgagg tgtgggtgtg gcctacgttg ggcgcggcag aagaatacag 3780 ggcacgccgg cgggtgctaa atagtgttcc ggaatccggt gggcatgccc acgcgtatgt 3840 tgtccctcct gggactgacg acatactcgt ggtggtgttc cagtcccacc tgaacgatag 3900 gtggccaggg ctgagggcga cgcgccatct agcggaagtc cagggaggct cgatggtcga 3960 tggggacatc ggggggagct ggagccgtta ctggggatat aaggaagaga ataggattag 4020 ttatatgtat tagtaaggga aaaagggata aataggagtg aaaagattag aaaacgtgga 4080 ttgggttatg aaaatcgtgg agcgtggacg gagtcacgtc gggatccgaa attttgtgaa 4140 ttaaacaaac agtttgaagt gagtaccgaa aaagcgggca agaaagtagg tagtaatgca 4200 aaattttaca aacagtttga accggcaagg aatacagcca ctagaaccgg gccggaagat 4260 ttcctggaga ggaacactgg agcccgagaa agtcgactgc agagaagaag acgatacaac 4320 gggacctgag tgagttcgtt ccagttgcgc gtgggtgagc caagtagcgc gggacgtgtg 4380 aagggagggg gtgaaagagg tcgagtgagc ggccagggcg attctagcca caccgcacga 4440 tcgttgcatc gccgtccagt gcgtgccaca cgtttgggct ctgttaccaa gtgaaaacct 4500 cgaaatccta ttcacacaca cacacacaca cacgcgcaca cgtatacacg aacctagata 4560 aatttagggc tgaccggcca cggccagcga tcagcgagcc cacgttattg aatttaaaga 4620 aaaacaaata atccgcactc caaacaccgt tccactgtaa atgttatttt gttccgtgtg 4680 ttgtccttta ttctgtattt agtataagct caaaacgttt aaagtaaaat cttggccagt 4740 gaaaaaaaat atgtatgtaa aaacaccaac acctttcaca aacgtaaatc tgcgatcagc 4800 tggtcaacgc cagaagatta acccttagcg ggtgacgcgg gggtcccatc agtccaccgt 4860 atttgggcga gtgaattgtg gaaaatattg gttattgttt atgtcccggg gacccttaac 4920 aagagctctt ggtaggtcag gtctccgtag gggcccgagt tgaattaact cccgtaagac 4980 ggatccagtc cccactcatc tacacatagt ccagtacgaa tgggtgaggg tatccctgta 5040 ggttaaacga aatttcagca gttatttccg tgtttgtctt tgtgtcgaac gttttggtgt 5100 ttctttttgg ttagggataa ttttgagtat agctatttat taataagaaa tatgaagttg 5160 tcaatttata attattaaag cttggatata gactgaagta tgatgcgtta aatttaaaat 5220 agagggagga cgcgtggcgt aagccatgga ttaggccggc cgttaccgtt ccagcagagg 5280 gtggccaaaa tgaacgttgt gtttggtcac aggtagggcg ggcgcgcgaa gtggtaacac 5340 ccaagcttca cccacttgat agccgtctgt ttatgatttt ctttgttgtt gttaggttgt 5400 tgttagtttt gatgtcccga gaagtagtat gtctcctttt ttgtctacat ttggcgggca 5460 aggggaacta cccagccctg gggtcgacag ctagccggca ttgccctctg ggaaacaagc 5520 taaatgtaac aaacggcgcc caaactattt tcgaattcgc atgcatctgg actgacgcag 5580 gttcgagtgg tattacaaca aatgtcactc gggctggtaa aggcagatgc gggggaatga 5640 aaaaaaaaat ataaatcttg tggagaagac gacagctcgc gaagtgcagt ttagctgtcc 5700 cggcagaagt cgctgctgca ggcttaaggt gagcgtaaag ctggtcctta atggcgcggg 5760 gtgattgatg ctagggatgg atagacggag gcagttaacg gaaatataaa atatgtgata 5820 gggacttacg aattacagtt gaacattttg tcgctattcg atgtcacaag tgaacgtaag 5880 tcactgccgt tcaacaaaaa aatgaacagg gagtttgcga cggacacatt gcctggtgga 5940 ctgtgatatc gaagaaatct gtgatggaat gtgatattgg aataagttag agtaagtcat 6000 gtaggtaaat gttcattgta gtttcgcacc tgtttgttta aaacaaagcc gctttcgtgt 6060 ttttggacct gttgtctgtc tgccgagagg tggtgaggag taggtggccc gggagtcgca 6120 aagacaaccc gcgatgcatc tgttgtcatg gcagatcggc atgatggatg acaggaacaa 6180 atcacaagca aagtgagctc atgcaccgtg aggcttctcg tttatcggcc atcgcgtaga 6240 gcgatgacta gagtagatga caggccacac aaaggtggtg ttcgtaattg taggatatcc 6300 gtagggcgac tgtcaagtgg agcgcagaaa ctggagcggt taccggtccc ggctagtcac 6360 gtgtccccct cttaatgaat ctagtaagaa gaagaagaac tgttaggaat ttttgttgtt 6420 aggttttg 6428 // ID CR1-121_AAe repbase; DNA; INV; 2415 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-121_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-2415 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1209-1209 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 19 sequences with >96% CC identity. This consensus is 5'-truncated. XX FH Key Location/Qualifiers FT CDS 3..2300 FT /product="CR1-121_AAe_1p" FT /note="reverse transcriptase." FT /translation="VNDVVNENGRCLDLCFVSTQDVAPSIAIAPSALVKNV FT SHHPPLVLLLADSEPCEPLTISPVSYDFNRANQQGIVEFLASVNWLEVLDT FT NDVENAALTFTHILGHVIERHVPKKVHGTAGPPWQTQELRRLKTSKRAALR FT LYSKRGTQSHRNHYVSVNNKYKSTSRRCFRRYQQHIQQNLKTKPKSFWRYV FT NQQRKESGLPSTVKLDDEVASDRNQACRLFAKKFSSVFCDEMLSEEYVALA FT ASNVPLVMETLNAFSVDDGMIARAATQLKSSINPGPDGIPSSFVKSYIEQL FT LLPIRHVFALSLTSGTFPSVWKNAIIFPVHKKGDRKDVSNYRGISSLCAIS FT KLFELVVMEPLIAHCKQFISNEQHGFIPGRSTTTNLLSLTSHVSDSFTERV FT QTDVIYTDLTAAFDKINHLITIAKLERLGISGLMLRWFKSYLTERQLTVGL FT EGHFSETFTATSGIPQGSHLGPVIFLLYFNDVNYVIKGPRLSYADDLKMYL FT RVRSICDAITLQNDLDAFAGWCSLNRMVVNPEKCNVISFSRRKEPVYFNYQ FT LFGKILQRVDHVKDLGVILDAQLTFKHHMSYVIAKASRTLGFIFRIAKEFT FT DVYCLKSLYCSLVRATLEYCSTVWHPHYQNGVQRIESVQRRFIRFALRRLP FT WRDPHRLPSYHSRCQLIHLETLQSRRDINRAMIVADLLQGRIDCPAILGQV FT DISARHRNLRANSMLRPPLRRTNFGQNVAINGLQRVFNRVAPAFDFHLPRH FT TVRRNFAEILTRRIV" XX SQ Sequence 2415 BP; 673 A; 532 C; 505 G; 705 T; 0 other; aagtcaatga cgtagtgaat gaaaacggac gatgtttaga tctctgcttt gttagcacac 60 aggatgttgc accttcaata gcaatcgccc catcagcttt ggttaagaac gtttcacatc 120 acccgccgtt ggttctcctg ctcgccgata gtgaaccatg tgaaccgcta accatatctc 180 cggtttcata cgatttcaat agagcaaatc aacagggaat agttgaattt ttagcctctg 240 taaactggct tgaggttctt gatacgaatg acgttgaaaa tgctgcccta acttttacgc 300 atatcttagg gcatgtaatt gaaagacatg tacccaaaaa agtccatggg accgctggtc 360 caccttggca aactcaagag ctccgtcgcc tcaagacttc caagagagca gccctcaggc 420 tatactccaa gcgtggaacg cagtctcacc gtaatcacta tgtgagtgtt aacaacaaat 480 ataaaagcac tagccggcgt tgtttcagac gatatcagca gcacattcag caaaatctca 540 aaaccaaacc taaatcattt tggcggtacg tgaaccaaca acgaaaggaa tccggattgc 600 cttctactgt gaaactagac gatgaagtgg cctctgatag aaatcaagct tgtcggcttt 660 ttgccaaaaa attttctagc gttttttgtg acgagatgtt atcagaggag tatgtagcac 720 ttgctgctag taacgttccg ttggtcatgg agacattgaa cgcatttagt gtcgatgatg 780 gtatgattgc tagagctgcc acccaattga aatcttcgat caatcccggc cccgatggaa 840 ttccttcttc cttcgtgaaa tcgtatatcg agcagttgct cttacctatc cgtcatgtgt 900 ttgctttgtc cctaacgagt ggaacgttcc catcggtttg gaagaatgct atcatttttc 960 cagttcacaa aaaaggcgac cgcaaggatg ttagtaatta tcgtgggatc tcgtcacttt 1020 gtgctatttc aaaactcttt gagttagtag taatggaacc gctaatagct cactgtaagc 1080 agttcataag taatgagcaa catggtttca ttcctgggag atctactacc accaatctct 1140 tgagtcttac atcccacgtt tccgatagct tcactgaacg tgttcaaaca gatgttattt 1200 atacggatct gactgctgca tttgacaaga tcaaccattt gataacgatc gcaaaactgg 1260 aaaggctcgg aatttccgga ctaatgctac gttggtttaa gtcctacctt acagagagac 1320 agctgacggt cggactggaa ggacattttt cagaaacatt cacggctact tctggaatac 1380 ctcaaggaag tcatttgggt cctgtgatct ttttattgta ctttaacgat gtgaactacg 1440 taattaaagg gccacgatta agctatgccg acgatttaaa aatgtatctc cgtgtccgtt 1500 ccatctgtga tgctataact ctacagaacg atttggatgc ttttgcagga tggtgtagtt 1560 tgaatcgcat ggtggtgaat cctgaaaagt gtaatgttat atcgttctca cggagaaaag 1620 agcctgtcta tttcaactac cagttattcg gaaaaatact tcaacgagtt gatcatgtga 1680 aggatttagg tgtcattctg gatgcccaac taacgttcaa gcaccatatg tcctacgtca 1740 tcgcgaaagc ttccaggact ttaggattta tcttccgtat tgctaaggaa ttcaccgacg 1800 tatactgcct caagtcactc tactgttcgc ttgtacgtgc gactttggaa tattgttcga 1860 cggtttggca cccacactat cagaacggcg ttcaaaggat tgaatcggtt caacgtcgtt 1920 ttattcgttt cgcactccgc cgacttcctt ggcgcgatcc acaccgatta cctagctatc 1980 atagtagatg ccaattgatc catcttgaga cgttacagtc acgccgggac ataaacagag 2040 cgatgatagt tgctgacctg ttgcaaggga gaatcgactg ccccgctatt cttggacagg 2100 tagatataag tgctcgccac cggaatttgc gagctaattc tatgttacga ccacctctac 2160 gacgaactaa ttttggacaa aatgtagcta taaatgggct gcaacgcgta ttcaacagag 2220 tagcaccagc gttcgacttt catcttccaa gacatacagt ccgccgtaat tttgccgaaa 2280 tacttactcg aagaattgtt tgatttttat tgtgtacttg tttttatata tcttgaccta 2340 ttttgtcgta aatttaagta accacacatc attgggacta attgtctgtt ggtgtaaaca 2400 aataaacaaa aaaaa 2415 // ID Gypsy-65_CQ-I repbase; DNA; INV; 5949 BP. XX AC AAWU01038188; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-65_CQ_; KW Gypsy-65_CQ-LTR; Gypsy-65_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-5949 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 509-509 (2011). XX DR Genome; AAWU01038188; Positions 8033 2085. XX CC Positions [2218-2721] - Reverse transcriptase CC Positions [3790-4266] - Integrase core CC 'ATTT' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 143..2047 FT /product="Gypsy-65_CQ-I_2p" FT /translation="MMEVNPIDLSNEELNFELELRGVAGLGPMTHRIKCTT FT LEKVMQDDLKSGRVYASSSHVVDGSSEIRICHSRIKRLLPQIITGIERNNI FT AFLEPIVSRLNHYYNRLTLVSPPSADMREEWTGVRDTIARTKRQIFRILYP FT ANQSTNQYSVSSVSNSTAPAHNSTAVSSGSARQINQLSLAGAVGGDSSGME FT RGSSASARGTSTGAIPKRTSDQSVNDLLLLSGQASAQARNAAAVAHGGQGL FT PVDPRPHSLGRGRGRPIFNPNHPQRDPPPLPPNLGRDITPPPRLQNDDAEY FT NELRDRVLCDLLRRERQPAEGARIPKAIHNWPFKFRGTKDSTNLNTFLDRV FT ESFAFSEGVNNAMLLRAIKHLLQDDALDWYSRALNEGELDTWENFKLMIRQ FT EYLSSHYGQLLREEASSRYQGANEKFQKYYRDISALFRFVQPPMSQEDKFF FT ILKKNMNPEYATILASARPQTIQDMVEVCNAYDDTRTLLTHQRRTPIPHSS FT LLEPNLATPSGSQRSQANNAGWQRFGGRVNAIDSREENQDRRGRYEAASAD FT QRESLDEDDWTDQIEQLTQQICAIRTQFQKRSNEQYRRGGNYAGQRWQSHQ FT QQPVAEEQQPMLARNNHQQPNRIRNPCENSTSRNSGS" FT CDS 2014..4638 FT /product="Gypsy-65_CQ-I_1p" FT /translation="MRELNKQEQWQLNKAIAKFPCSREGMIGRTTKYVHRI FT DIGEAKPKKQRVYVMSKYVLDEVNKEVDRMLALDVIEEAMFSPWNNPLVAV FT KKKTGQYRVCLGARYLNSIMQNEGHPIPQIANIINNLGGCRYISSIDLKDA FT FWQLPLEKESRPMTAFTVPGRGHFQFKVVPFGLCTASQALARLMTHLFADL FT EPHVFHYLDDVIICSRNFDEHLRLLNIVAERLRDANLTISPDKSKFCRSEM FT KYLGYVLNENGWKVDEGKVACIVKFPRPENRKDVQRFLGLCGWYRRFIANF FT SRIAVPLTELTKAKVKFKWTPAAEDAFVKLKSLLVSAPVLAMPDYSKPFNI FT ACDASDTAIGAVLAQEIDGEEHPIAYFSQKLSASERNYSVTERECLAVIRA FT IEHFRGYVEGVRFTGYCDHSALTYLRSIKNPTALMCRWILRLNAFDFNIEY FT RKGSCNIVPDALSRIVASLVFAVKAMDDSWYKRLKQSVDKQPDKFPDFKIA FT SGELYKNCRCKDEFGNTVHKWKKAVPLGERADVISKFHDSVTGAHLGFQKT FT WQKLQNFYYWPKMQQDVGRYVRSCAVCKASKAPNTKMMPTMGKLKPARVPW FT ELISIDFVGPLPRSKDGNTVLLVIVDWVTKYVIAHPMRSADTCKMVAFLEK FT EVFLRYSRPRIVLSDNGKQFVSASFKSLLARHKIEHMTTAFYCPMVNNAER FT VNRVLVTCIRALLDEDHRSWDENLPSIVAAINSAKHEATGVSPHFANFGRD FT LILHTDLYKQQDLNAPDDPKLAQDLRLAKLKRIHEFVFQRIKNNHEKSKQR FT YNLRTRVVSFKVGELVWRKLFSLSSKADHVNQKLNPKYVPAIVKAVLGHNL FT YELEDISSGKRGRYHAKDLKSD" XX SQ Sequence 5949 BP; 1626 A; 1508 C; 1624 G; 1191 T; 0 other; aaatggcgcc caactaaaaa acccacacta ggcctaggct agaattgtaa aaccattgcg 60 tgcttattcg tagaatatat gttttttgtt gttcttggga taaatttttg cttttttgaa 120 gacttttgag agttcagcga agatgatgga ggtcaatccc attgatctga gcaatgaaga 180 actcaacttt gagttagagc ttcgtggtgt agcaggcctt gggccgatga cgcacagaat 240 taaatgtaca acgttagaga aagttatgca agatgatcta aagtcgggga gagtttacgc 300 ttcctcgtct cacgtggtgg atggctcgtc ggaaataaga atatgtcatt cgcggatcaa 360 gcgattgttg ccacaaatta tcaccgggat agaacggaac aacatcgctt ttctcgaacc 420 aatcgtgtcg cgcttgaacc attactacaa tcggctcacc ctggtttcgc cgccgtccgc 480 ggacatgcga gaagaatgga cgggagtgcg agacacgatc gcgcgtacga aacggcagat 540 attccgaatt ctttacccgg ccaaccagtc aacaaatcag tacagtgtca gttcggtttc 600 aaactccaca gcgccggcgc acaacagcac agctgtcagc agcggcagtg ctcggcagat 660 caatcagcta tcgctagcag gtgcagttgg gggcgattcg tcgggcatgg aaagaggaag 720 ttctgcaagt gcgcgaggaa catctaccgg agccatcccg aaaaggacat cggaccaaag 780 cgtgaatgat ctgctactcc tgtcgggtca agcttcggcg caagcccgca acgcagcagc 840 agtagcacat ggaggacaag gcttaccggt cgacccaaga ccgcattcgc taggtcgagg 900 acgaggacgg ccaatcttca acccgaatca tcctcagcgc gatcctccac cacttccacc 960 gaatctagga cgagacatta ctccacctcc aagactgcag aacgatgacg cagaatacaa 1020 cgaactcagg gatcgggtac tgtgtgacct gcttcgccgc gaacgtcaac cagccgaagg 1080 cgcgcgaatt ccgaaggcaa tccacaactg gccattcaag tttcggggaa cgaaagactc 1140 gaccaacctt aacacgttcc tggatcgagt ggaatcattc gctttttcgg aaggagtcaa 1200 taacgcgatg ttgctgcggg cgatcaaaca tctgctgcag gacgatgcgc tggattggta 1260 cagtcgtgcg ctgaacgaag gagaattgga tacctgggag aacttcaagc tgatgatccg 1320 ccaagagtac ctgtcgagcc actacggcca gctgttgcgt gaagaagcgt cgtcgagata 1380 ccaaggagca aacgaaaagt tccaaaagta ctaccgggac atctctgcgc ttttccgctt 1440 cgtgcaaccg ccgatgtcgc aggaggacaa gtttttcata ctgaaaaaga acatgaatcc 1500 agagtacgca accatcctag cgtcagcaag gccgcagaca atccaagaca tggtcgaagt 1560 ttgtaacgcc tacgacgaca cgcgtacgct tctcacgcac cagcgtcgaa cgccaatccc 1620 gcacagctcg cttctcgagc caaatctggc cacacccagc ggttcgcagc gctcacaagc 1680 aaacaacgca ggatggcaga ggtttggcgg cagggtcaac gcgatcgaca gtcgggaaga 1740 gaaccaggac aggcgaggga gatacgaagc agcaagcgcg gaccagcggg aaagcctgga 1800 cgaggacgat tggacggatc agatagaaca actgacgcag cagatttgtg cgatccgtac 1860 acagtttcag aaacgcagca acgaacagta cagaagagga ggaaactatg cgggacaacg 1920 ttggcaatcc catcaacaac aaccggtagc agaagaacag cagccgatgt tggcaagaaa 1980 taaccatcag caaccgaacc ggatccggaa cccatgcgag aactcaacaa gcaggaacag 2040 tggcagctga acaaggcgat cgcgaagttt ccgtgtagcc gggaaggcat gatcggacgc 2100 actacaaagt acgtgcaccg aattgacatt ggagaggcga aaccgaagaa gcagagggtc 2160 tacgttatgt cgaagtacgt gctggatgag gtcaacaagg aggtcgaccg gatgctcgcg 2220 ctggacgtga tcgaagaagc aatgttcagt ccctggaaca acccgctggt ggcggtgaag 2280 aagaaaacgg gccagtatag agtgtgtctg ggcgcgcgct acctgaactc gatcatgcag 2340 aacgaagggc acccgattcc gcagatcgcc aacatcatca acaacctggg cggctgcagg 2400 tacatctcgt cgatcgactt gaaggacgca ttctggcagt tgccgctgga aaaggagtca 2460 agaccaatga cagcgttcac tgtaccggga cgtggccact tccagttcaa ggttgtaccg 2520 tttgggttgt gcacagcgag ccaagccctg gcgcgcttga tgacgcatct gttcgccgat 2580 ctcgagccgc atgtcttcca ctacctagac gacgttatca tctgttccag gaactttgac 2640 gaacacctta ggctgctgaa catcgttgcc gaacgactga gggacgcaaa ccttacgata 2700 tctccggaca agtcgaaatt ctgccgcagc gagatgaagt acttggggta cgtcctaaac 2760 gaaaacggtt ggaaggtcga cgaaggcaaa gtagcgtgca ttgtgaagtt tccgagaccg 2820 gagaacagaa aggacgtcca acgtttctta ggtctttgcg gttggtatcg tcgctttatc 2880 gccaacttct cgcgcatcgc agtgccgctc accgagctga cgaaagcgaa agttaagttc 2940 aagtggactc cggccgccga ggacgcattc gtgaagctga agtcgttgct agtgtcggcg 3000 cccgttctgg ccatgccgga ctactcgaag ccgttcaaca tagcgtgtga cgcaagcgac 3060 acggccatcg gtgctgtact cgcgcaagag atcgacggag aggaacaccc gattgcatac 3120 ttctcgcaga agctgtccgc ttcggagcgc aactactcag tgacggaaag ggaatgccta 3180 gcggtcatca gagcaataga acactttcgt ggctacgtag aaggagtgag atttacgggg 3240 tactgcgacc actcagctct gacgtacttg cgatcgatca agaatcctac cgcgctgatg 3300 tgcagatgga tcttgcggct aaacgcgttc gatttcaaca tcgaataccg aaaaggatcg 3360 tgcaacatcg ttcccgacgc gctctcaagg attgttgcgt cgctggtgtt tgcggtgaag 3420 gcgatggacg acagctggta caagcgactg aagcagagcg tggataaaca gccggacaag 3480 tttccggact tcaaaatcgc tagcggtgaa ctgtacaaga attgtcgctg caaggacgag 3540 tttgggaaca cggtgcacaa gtggaagaaa gcggttccgt tgggcgaaag agcagacgtg 3600 atcagcaagt tccacgattc ggtgaccggt gcgcacttgg gattccagaa aacgtggcaa 3660 aagctgcaga acttctacta ctggccaaag atgcagcaag acgttggtcg ttacgttcgc 3720 tcctgtgcgg tctgtaaggc cagcaaagcg ccaaacacca agatgatgcc gacgatgggg 3780 aagctcaagc cagcgcgcgt gccctgggaa ctcatctcga tcgattttgt aggtccactg 3840 ccgaggtcga aagacgggaa cacggttctg ctcgtaattg tagactgggt aacgaagtac 3900 gtgatcgcgc accccatgcg aagcgctgac acgtgcaaga tggtggcgtt cttggagaaa 3960 gaagtctttt tgcgttattc caggccaagg atcgtgctca gcgacaacgg aaagcagttt 4020 gtgtcagcat cgttcaaatc gcttctcgca cggcacaaga tcgagcacat gaccaccgca 4080 ttctactgcc cgatggtcaa caacgccgag cgcgtcaaca gagtgctggt cacctgcatc 4140 cgggcactgc tcgatgagga tcaccgttcc tgggacgaga acttgccgtc gattgtagcg 4200 gcgataaaca gcgcgaagca cgaagccacc ggggtaagcc cgcactttgc gaatttcgga 4260 agggacctga ttctgcatac cgacctgtac aaacagcaag atctgaacgc gccggacgat 4320 ccaaagctcg cacaggactt gcggctggca aagctcaagc gaattcacga gttcgtcttc 4380 cagaggatca agaacaacca cgagaagtcc aaacagcgat acaacctgcg cactcgagtc 4440 gtgtcgttca aagtgggaga acttgtttgg cggaagctgt tctcgttgtc gtcaaaagca 4500 gatcacgtca accagaagct caacccgaag tacgttccag cgatcgtgaa ggcggtgctc 4560 ggccacaact tgtacgagct ggaggacatc tcgagtggca agcgagggcg ctaccacgcc 4620 aaggacctga aatccgactg atcctagtat cccaaagcta tgtctgcaac caaaaggttg 4680 cgaaccagat cgacaagaac aacaaacacg caaagcggta aacatgtcgt ctgaggaacg 4740 aggaggagaa aaagtcatca agctgttttg cagcggtatg acccagccag ctatgtagcc 4800 ttcacaccac gagaggagga caagaccaaa caacacacgg tgccaagtac agcacaagca 4860 accgggatgg tgaggaacca gcaggagagt ccggaacgtt gaagcgcgcc gaggacaaaa 4920 gtagcagtac tggctgatcg tgtagtgcag tgccgaacga gacaagcctc gtgcagaggt 4980 tctggacgca gcacgcaccg aacgcaggaa gacagcaact ggagcagctt tgacgcagga 5040 cagccgctgg agaactgact atccgggaac gctgacgcgg gaagaagagc ctggacaaca 5100 aaacaccact accaccaagc acaagcgcac aatgggccct gagcgcaagc ttcggtgaga 5160 aacaaccaac cagatcaaac aattcttgct atgagcgtac cagctccaaa cttctcaaaa 5220 caccttcggg gtacaaacat tctctcgcct ggaccggtac attcgtccag tgttgtcacg 5280 atctagctca ctcattggtc ttcagattgg tcgttgaggc atcaccatga cacttccatc 5340 aaaagcaacg cgaatctagg gcgcggacgt tcagccgggc aaccgctgaa tttaaccctt 5400 agccggactc atggtgatag gagatgggga aatcctctca aacctccgga aatcgaccac 5460 tctccgacac tgtcgacatg tcgtttgaga gcgtagtgcg gatcgcttcc cgattcagtc 5520 actagaatta gcaaagttat cccatagtta gtattacttg ttacccgttt ttcgtcatat 5580 cgatatctat tagtcgttag atttcagtta ataagtaaaa cattagctgt aagtctggat 5640 cgatgtgcca ttgtagcaag agttggaaca acaaaaaact gagctggaga cactttgggc 5700 tggtaaagtt agccagctgg tcactttctg tgtagagtaa gtctacagca tcgggttcgt 5760 ttatgtttgt aagctgtaca tgttttggag agccgcgggc tttcgtgtct tagttagctt 5820 tgagtttagg gcattgtgtc catgaccagt aaagtctcca gcacactttt tagggaaagt 5880 gaagttttaa gaattttccg tgtgggtctg ccatcccaca cggaaaattc tgccgcaatg 5940 gcagtgttg 5949 // ID BEL-10_AA-LTR repbase; DNA; INV; 371 BP. XX AC supercont1.13; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-10_AA_; KW BEL-10_AA-I; BEL-10_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-371 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.13; Positions 1060498 1060868. XX SQ Sequence 371 BP; 95 A; 87 C; 97 G; 92 T; 0 other; tgagtgtccc acaatcagcg tcatcgttat cgtggcactc attgcatggt tcatccggga 60 gctcatcgta cagtggcagt gacggtagca gcacacgaat cgcgacgctt ccacgcgctt 120 ctagaagctt ccgcgtgttt ctagattgcg agcgcggtgc tatatatacg cgagccgtat 180 gcgtagcaga ctcagtatta ttttaaccgc cgcgtgatta tgatcgacag tgaaagtgaa 240 gtgtaatagt aaagtagtga agtagtgaag tgaataaagt gcagtgttta gtaaaccagt 300 ggtatcgaat tctttcatcc gaaatacagt ccaccccgac ccgaccggag ttggccacgg 360 ttaacgcgcc a 371 // ID GYPSY1-LTR_CS repbase; DNA; INV; 471 BP. XX AC . XX DT 11-APR-2005 (Rel. 10.04, Created) DT 11-APR-2005 (Rel. 10.04, Last updated, Version 1) XX DE Clonorchis sinensis LTR retrotransposon, LTR sequence consensus. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; CsRn1; Ty3/gypsy-like; GYPSY1-LTR_CS. XX OS Clonorchis sinensis OC Eukaryota; Metazoa; Platyhelminthes; Trematoda; Digenea; OC Opisthorchiida; Opisthorchiata; Opisthorchiidae; Clonorchis. XX RN [1] RP 1-471 RA Bae Y.A., Moon S.Y., Kong Y., Cho S.Y. and Rhyu M.G.; RT "CsRn1, a novel active retrotransposon in a parasitic trematode, RT Clonorchis sinensis, discloses a new phylogenetic clade of RT Ty3/gypsy-like LTR retrotransposons."; RL Mol Biol Evol 18(8), 1474-1483 (2001). XX RN [2] RP 1-471 RA Gentles A. and Jurka J.; RT "C. sinensis gypsy LTR retrotransposon."; RL Direct Submission to Repbase Update (11-APR-2005). XX DR [2] (Consensus) XX SQ Sequence 471 BP; 124 A; 105 C; 79 G; 163 T; 0 other; tgtggcgttt tgtaccttta ttttgctttc attataacac tacgtatttc atcaattgca 60 taatcgtttt ttttgtttga atacgaaaaa ctacaaaaac gcgcacttgt acaatttcac 120 gcaatacaaa cgtggtcacg tgcttcgact cttctcttct gatttcgctc gcgcttgcag 180 cggtcatttg cgagtctcat gtgctaactc tggtatcaaa acatttcaga ccatcttcgg 240 aatctattat acgtgggaac tgttcagaag aagacccctg gaacaccata attggattat 300 tgtttatcct ctgacctggg ccaacgacga actattttaa agagaacaat tattagtgag 360 tctgttccat ctgacctctt tattattgta tacccaacct agcgttatta tttttatttc 420 tttgccacac actcctggtg aagtaactta ttgggtagcc agaccactac a 471 // ID DNAX-8_AP repbase; DNA; INV; 152 BP. XX AC . XX DT 26-AUG-2009 (Rel. 14.09, Created) DT 26-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNAX-8_AP. XX NM DNAX-8_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-152 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2064-2064 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC TA or TAA TSD CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 152 BP; 40 A; 43 C; 38 G; 31 T; 0 other; ctgcggccca cgagagagta ttactgtcgc ccgacgaaaa cacgttcgat tccgcgcatc 60 gccacgtatt tgacatacac taaatggcac aaagtggggg aatgcctaca gtctgcatgc 120 gcaaaactgt aatactctct cgtgggccgc ag 152 // ID Gypsy-142_AA-I repbase; DNA; INV; 7343 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-142_AA_; KW Gypsy-142_AA-LTR; Gypsy-142_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-7343 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1017-1017 (2011). XX DR [2] (Consensus) XX CC Positions [5059-5541] - Integrase core CC 'AACT' target site duplication CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 401..2638 FT /product="Gypsy-142_AA-I_2p" FT /translation="MEWYKINACYLDESELNYELLLRDYSTRGSLEERRRD FT LRALLRDPQSERLVRVTEHMMVDDLCTVPRKLKEIADILFRGSDPSCLSRL FT VHYHQRIRRYQPRTRRQQEDVQYLLDLIARITMHYYGLNFDDTVRSVPIAE FT STVRVPRSDGNTAGAPFSVIIGPQSISENTAVEHEITQRNRESPWLMTWNE FT AEEAVGGNMERSDDTTGVIPALVENLLNFRNQDRTQTTGAVPRQANPPDAS FT PEQCGNPSSIAHGDGPTFDHAPNVHGVQTGAVTEPNVFRPTIPLRGMSATI FT DRSRSGCSTPVPNPIRSSPVSNAVPENINLQNNEYVHVSEISTYVQRCFDQ FT LVRQRFSTVRLDPAVDALSNNLAEMNFHPRTQALRDNRNTSVNQTFRFPDP FT SPPLQLSGPTLEQRSTPVPNPSGWNTMNSTSRPRNDFRSAANSGVGNVTFA FT PMNQMNSGRSSASGYPRRLPHQQCSIIEKWPKFTGDTNSVPVTDFLRQIDI FT LCRSYDITKPELRMHAHLLFKDGAYVWYTTYEEKFTSWEMLESYLKMRYDN FT PNRDRIIREEMRARKQRPTELFSAYLTEMEMLAQRMMKKMTEAEKFEVIVE FT NMKQSYKRRLALEPIHSIEHLAQLCFKFDALESNLYSTAAHSRPAINQLDC FT DENYEDEQRELEETEELNALKARFNKRFPVTKPTPGDSKDKTKQTMCWNCQ FT GVGHMWRECDKRKTIFCHVCGLADTTAYRCPNKHELGQKEELPKNE" FT CDS 3109..5910 FT /product="Gypsy-142_AA-I_1p" FT /translation="MPFADDYFGDRSKEVCFQIVPSSDNNLSDPTENIDES FT LEMPTVELPQKTFETLDDLSTEHLLSDTERKALFEAIKQLPETHEGSLGRT FT QLIQHRIDLLPDAKPKRIAYYRWSPNVESVIDAEVERMEKLGVIEECHGPV FT DFLNPLLPIKKANGKWRICLDSRRLNQCTKKDDFPFPNMMGILQRIPKSKY FT FSVIDLSESYYQVPLEASAKDKTAFRTNKGLYRYTVMPFGLTNAPATMARL FT MTRVLGHDLEPYVYVYLDDIIIVSNSIDEHIRLIQIVAERLRRAGLTINLQ FT KSKFCQKKIKYLGYVLSEEGLSMDVSKIQPVLDYPAPKTVKDIRRLLGLAG FT FYQKFLPNYSEITTPITNLLRKGMKKFQWTEEADAALQKLKTALVSAPILA FT NADFSLPFIIETDSSDLAVGAVLAQVHDGVRKPIAYYSKKLSSTQRRYSAT FT ERECLAVLLSIENFKHFIEGSQFIIQTDAMSLTFLKTMSIESKSPRIARWA FT LKLSKYDMLLQYKKGSDNVPADALSRAVNTIDVTSSSDPYITQLIKMVQTV FT PERYPDFRISDGKVYKYIIGSTVSEDPSFRWKYVVPVVERRDIIRRIHSEA FT HLGFVKTLSKIRERHYWPRLASDVKRFCSQCEICRESKTPNINVQPICGKP FT KHCSRPWELISMDFLGPYPRSKRGNVWLLVVSDFFSKFVMVQCLRSATAAS FT TCNFVENMIFNVFGAPSICITDNATVFKSELFQKMLQRFSVTHWPLSVYHP FT SPNPAERVNRVIVTAIRCSLNREKDHRDWDKDVHQIAKAIRTNVHESTGFS FT PYFLNFGRNMISSGSEYEALRESESPKSPQEVSKDMELLYSKVKENLLKAY FT QKYSHPYNLRANRKHQFEKGDVVYKKTMYLSDKSRNFVGKFANKFEKVRVK FT EVVGTNTYVLERLDGQRIAGSYHGSFLKRV" XX SQ Sequence 7343 BP; 2172 A; 1520 C; 1586 G; 2063 T; 2 other; ttttggcgcc caacgtgggg ccttcgaaaa gcaatagttt cacatttcaa attaaattaa 60 ttatttttct ttgactggct ttgtacatat taaattgaat tcattgctta attgaactct 120 tacatagtat tttcagttac ttttcaacct attctgtcaa tttcattttg tttcttcagt 180 acctgcgcgg ctaggtttaa gtttgattga taatttcggt aattagtgat taagttcggt 240 cggttattag ttttggtttt cggttattag tgtcggtttt cggttataag tattatcggt 300 ttagtgtaaa attttcagtg aattgtgttt gaatttgtta aaaaaaaaat ctttaattta 360 agtttttctt cttttgaatt gaattttggt ttgtttgaca atggagtggt ataaaataaa 420 cgcttgttac ttggacgaaa gtgaacttaa ttatgaactt ttgttacgtg actattctac 480 gcgaggatcg ctcgaagaac gtagacgaga tctccgtgcg ttacttcgtg atccacaatc 540 ggaacgatta gtgcgagtaa cggagcacat gatggttgat gacctctgca cagtaccaag 600 gaagttgaag gagattgcgg acatcctgtt tcgtggatct gatccttcat gcctttctcg 660 cttggtgcac taccaccaaa ggattcgacg ctaccagcct cgtactcgtc gtcagcaaga 720 agatgtgcag tacttgctcg atctgattgc caggataacc atgcattatt atggccttaa 780 ctttgacgat acagtgcgtt cagtaccaat tgctgaatcc acagttcggg tacctcggtc 840 agacggaaat accgcaggtg caccgttcag tgtgatcatt ggacctcaat ccatttccga 900 aaataccgct gttgaacatg agattacgca gcgtaatcga gagtctccct ggttgatgac 960 ctggaatgag gcagaagaag cggttggagg aaacatggaa cgatcggatg atacgacggg 1020 tgtgatacca gcactcgttg aaaatctttt gaattttcgg aaccaggatc gaacacagac 1080 caccggagcg gttccgcgtc aagctaatcc accggatgcc tccccagagc agtgtggtaa 1140 tccttcctca atagcacatg gagatggtcc tacmtttgac catgcaccga atgtccacgg 1200 agtgcagact ggagccgtaa ctgaaccaaa cgtgttccgt cctacaattc cattgcgggg 1260 aatgtcggcg accatcgatc gttcaaggtc tggttgttcg acgccagttc cgaaccctat 1320 ccggtctagt ccggtatcca atgccgtacc tgaaaatata aatttgcaga ataatgagta 1380 tgtccatgta tccgagattt caacgtacgt ccagagatgc ttcgatcagt tggttcgcca 1440 gaggttctca acggttcggt tggacccagc cgttgatgca ctttcgaata atctggccga 1500 aatgaatttc catcctcgga ctcaagctct acgtgacaat cggaatacca gtgtgaatca 1560 gacattcaga ttccctgatc cttcgccacc gttgcagtta tctggtccta ctttggaaca 1620 acgtagtacc cctgtkccaa atccaagtgg ctggaatacg atgaattcaa ctagtagacc 1680 aagaaatgat ttccgatccg cagctaactc tggtgtagga aacgtcacgt ttgctccgat 1740 gaaccagatg aattctggca ggtcgtctgc tagcggttac cccagacgtt tgccgcatca 1800 gcagtgcagc ataatagaga agtggccgaa gttcactggc gataccaatt cggtccctgt 1860 cacggatttt ctgcgacaaa tagacatttt gtgtagatcg tatgatatta caaaaccaga 1920 gttacgaatg cacgcccacc tacttttcaa agatggcgcg tatgtgtggt acacaacgta 1980 tgaggaaaag tttacgtcgt gggaaatgtt agaatcttat cttaagatgc ggtatgacaa 2040 tcccaaccgg gatcgcataa ttcgcgaaga aatgcgtgca agaaaacaaa gacctacgga 2100 gttattcagc gcctatctta cggaaatgga aatgctagcc caaagaatga tgaaaaagat 2160 gactgaagct gagaagtttg aagtcattgt ggaaaatatg aagcaatcgt acaaacgtcg 2220 acttgcgtta gaaccaattc actcgattga gcatctggcg caattatgct tcaaatttga 2280 cgctctcgag tcgaatttgt actcaactgc agcacattcg aggccagcaa tcaatcaact 2340 agattgtgac gagaattacg aagatgaaca acgtgaactt gaggagacgg aagaattgaa 2400 cgctctcaag gcgagattca ataaacggtt tccagtgacc aaaccaaccc ccggagatag 2460 caaggataaa actaagcaaa cgatgtgttg gaattgtcaa ggcgttggtc acatgtggcg 2520 agaatgcgat aaaagaaaga ctatcttctg tcacgtctgt ggattagccg atactacggc 2580 ctataggtgt ccaaacaaac acgagttagg ccagaaagag gaattgccaa aaaacgagta 2640 ggcgtgggca attctgggaa tccttgccct caacgcaacg atccagaggt tcccaaaccc 2700 ttctataaca tcttcaataa tgttcatcaa atcaacacaa agtttcatcg ttgtccacac 2760 ttgaaagtca agatactctc agaagaaatc gaaggccttg cagatactgg cgcaagccta 2820 acgataatta gttcggttga cttagttaac aaattgggtt tgaaaattca tccgatagct 2880 gttaaaatct cgacagctga cggtacagca taccgatgtc taggcttcgt caatgcacct 2940 ttcacatatc aggacaaaac tcatgtgatt cagacagtta ttgttccaga agtttcaaaa 3000 tgcttgatat tgggcgtgga cttcttgaat aaatttggat ttcgacttct tcctccatct 3060 ccagcaacag tagcaagagg tcaagatgaa acaagtagtt caaacaccat gccatttgca 3120 gatgactact ttggagatcg cagtaaagaa gtttgcttcc agatcgtgcc tagttcggat 3180 aataatttgt cggatcctac ggaaaacatc gatgagagtt tggagatgcc aaccgtagaa 3240 ttaccacaga agacctttga aaccttagat gacctaagta cagaacatct cctttcggac 3300 acggaaagaa aagccttgtt tgaagcgatc aagcagttac cagaaactca tgaaggcagc 3360 ttgggtagga cgcagttgat acagcaccgt attgatctgt taccagatgc caaacccaaa 3420 cgtattgcat attatcgctg gtcacctaat gtcgaaagtg tcatcgatgc cgaagtggaa 3480 cgaatggaaa agttaggagt tattgaggag tgccatggtc cagttgattt cctcaatccg 3540 cttttaccca taaaaaaggc aaacggtaaa tggcgtatat gcctggactc ccgcaggctg 3600 aatcaatgta ccaagaaaga tgactttcca ttcccgaaca tgatgggaat tctccagaga 3660 attcccaaat caaaatactt ttcggtgatt gacttgtcgg agtcatacta ccaagtgccg 3720 ctcgaggctt cagcgaagga caagactgcg ttccgtacga acaaaggttt ataccgatat 3780 actgtcatgc cattcgggtt gacaaacgcc ccagcaacca tggcccgatt gatgacgcgt 3840 gtattaggtc atgatttaga gccatatgta tacgtttatt tggacgatat tattatcgtg 3900 tccaacagta tcgacgagca tataaggctt attcagatcg tggcagagag actacgtaga 3960 gcaggattaa cgataaacct ccaaaaaagc aagttctgcc aaaagaagat caaatatcta 4020 ggctacgtat tatcggaaga aggtctgtcc atggacgtaa gcaaaattca accggttctc 4080 gattacccag ctccgaaaac ggttaaagat attcgccggt tattaggact tgccggattt 4140 tatcagaaat tcttgccaaa ctattcggag attacaaccc ctattacgaa tctgctgaga 4200 aagggaatga aaaagtttca gtggactgaa gaggcagacg ctgctctcca aaaacttaaa 4260 acggccttag tatctgctcc catattggca aatgcagatt tttcattgcc attcataatt 4320 gagactgaca gctctgactt ggctgtaggc gcagtgctag ctcaagttca tgacggagta 4380 cgcaaaccta ttgcgtacta ctccaaaaaa ctttcgagta ctcagcgacg ctatagtgcc 4440 acagagcgtg aatgcctagc agttctcttg agcattgaaa actttaaaca tttcattgaa 4500 gggtcgcaat tcatcatcca aactgacgcg atgagtctaa cgttcctaaa gactatgtcc 4560 atcgagagta agtctccaag aattgctcga tgggcgttaa aactctcaaa atacgacatg 4620 ctcttgcagt ataagaaagg gagcgacaac gtacctgccg atgcactctc tcgtgcagtt 4680 aacactattg atgtgacatc tagttcggac ccgtacatca cccaactaat taagatggtt 4740 caaaccgtac ccgaacgtta tcccgatttt cgtatatcag acggaaaggt ctataagtat 4800 ataattggtt caactgtatc tgaagatcca tccttccgct ggaaatacgt tgttccagtt 4860 gttgaaagga gagacatcat cagaagaatt cacagcgaag cccacctagg gttcgtcaag 4920 acactttcca aaattcgcga aaggcattat tggccacgct tagcttcaga cgttaagcgg 4980 ttttgcagcc aatgcgagat atgccgtgag tcaaaaacac ccaacattaa cgttcaacca 5040 atctgcggaa agcccaagca ttgctctcgc ccttgggaac taatatcaat ggacttctta 5100 ggcccatatc caaggtccaa gagaggtaat gtttggttgc ttgtggttag cgactttttc 5160 tcaaaattcg tcatggtaca atgcttgcgt tctgcgacag ctgcgtccac ttgcaatttc 5220 gttgagaata tgattttcaa tgtattcggc gcaccatcaa tttgcattac agataatgca 5280 acagttttca aatctgagct gttccagaaa atgttgcaaa gattctccgt gacccattgg 5340 cccctgtccg tctaccaccc cagccctaat ccggctgagc gtgtcaatag agtcatcgtg 5400 acagccataa gatgctcgtt gaacagagaa aaagaccacc gcgattggga taaggatgta 5460 caccaaatcg cgaaagctat acggactaat gttcacgaaa gtacgggttt cagcccctac 5520 ttccttaatt ttgggcgaaa tatgataagt tctggtagtg agtacgaagc cctgcgagaa 5580 agcgaatcac ctaaaagtcc ccaagaagtc agcaaagaca tggaacttct ttacagcaaa 5640 gttaaggaaa acttgcttaa ggcgtaccaa aagtacagcc atccatataa cttgcgtgct 5700 aatcggaaac accagttcga gaaaggagac gtggtgtaca agaagaccat gtatctctca 5760 gacaagtctc gaaatttcgt aggcaagttt gctaacaaat tcgagaaagt tcgcgtgaaa 5820 gaagtcgttg gtacgaacac ctacgtctta gaacgtctgg atggccaacg aatagcagga 5880 agctatcacg ggtcattctt gaaacgagtg taatacgtaa agctatgtcg gtccttcgcg 5940 agaaggtaca taaacaacga aactattact gaaaatactc aatgaggttc ctaaactacg 6000 tccaatgttc gagatgtcag ctagttctgt ttcctcacat tgtcgactaa aaatattctg 6060 tccaaaaaca caagctatga ctgcattacg gccgcgtaat gcatatacta cttcaaatct 6120 aaaatacaca tctgtggtgc aacgatgcag tccgatctga gacgtcctag tgttccctcg 6180 atcgttgacg aactgataag caactttaaa acaacaagct atgacggtgc atcctaatag 6240 gcacataaac caaaagtttg caatgaaaca ctctttgagg tgaaaaatac atattgccga 6300 tgttcgagaa gtcatcataa atgtttcctc acatcgtaga ccactttcat gcagttgcat 6360 aagtcgaaca aacctagaat agaaatccaa ttagaatctt gtttatatcg ttccgtttat 6420 gtttgtgttg agctgtgttg cattgcttgc aacttatata ggtcaatgat gcacttcgtt 6480 gaccttgacc caaatcgaat aatttcactt tgtttacaaa ttgcaatagc ccggaatcac 6540 aaaatctgtt caatatttcc aatctttaac catagatttc atccacagtt cactctatag 6600 ttagcataaa atcgtttcac agtagtttcc gtcgtcgaat cgcagttaaa atcattaatt 6660 ttgaagaaat taggtagaat tagtttacct ctatcccatc agcacagctc gcgttcgttt 6720 tttgtgttga tcgatttcgt gtttcgtttt tgacagatcg tcaaaaagct ttcgtgttag 6780 aacccggtac tgaaagttcg ggatatgtgt gaagcgtttg tcgagatgtt tatgcgttcg 6840 tgattcgtgt ttttgctatg acattttgaa atgaatgatt gcggttaatt aattaatgag 6900 tgatttcgcg tatgataagt tattaagtaa tcatgtatat tttcagacca ttgttctgaa 6960 gtacgttggt tattcggaag aaaatatgct cagacatttc tgagggtaat tcggttttag 7020 tcagtaattc gttagttgaa gatgctcaga catttctgag gtaaattcgg ttttaggtat 7080 tagttttctc agacgtctct gaggtcattt cggttaagat aagttggttg aatgagtacg 7140 gttgtttgta aagcatgtga gagttcaata tggcatagca ggaccaatca tttgtttgta 7200 tttgtttagt ttactagttc agaattcttg tatgttcttc gatgtagttt gaattaaatt 7260 agagtttgat gtttaatctg tgaaattaat aaaaattttg aaattaattt caaaattttt 7320 attattatta gtgtaggcga ata 7343 // ID hAT-N6_BF repbase; DNA; INV; 228 BP. XX AC . XX DT 30-SEP-2008 (Rel. 13.09, Created) DT 30-SEP-2008 (Rel. 13.09, Last updated, Version 1) XX DE Amphioxus hAT-N6_BF non-autonomous DNA transposon - consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW hAT-N6_BF; non-autonomous. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-228 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-228 RA Kapitonov V. and Jurka J.; RT "hAT transposons in the amphioxus genome."; RL Repbase Reports 8(9), 916-916 (2008). XX DR [2] (Consensus) XX SQ Sequence 228 BP; 78 A; 51 C; 48 G; 51 T; 0 other; cagggctgtc tccaggaccc gtccctccgt cctgggacgg aaatttgctt gttgggacgg 60 acaaaaattt caccccatcc gtcccaaaaa tctgaacttc agtacataaa attgatgaaa 120 atgtgttttc ataagcttaa aatgacagat ttaacaagga aaatcaacaa catgggctcc 180 caaacaatga gtgggacgga aaaaaattta aagctggaga cagccctg 228 // ID Copia-109_AA-I repbase; DNA; INV; 4189 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-109_AA_; KW Copia-109_AA-LTR; Ty1_copia_Ele76; Copia-109_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4189 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [1622-2125] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS join(986..2524,2528..4189) FT /product="Copia-109_AA-I_1p" FT /translation="MLAGEKKKESRKVSFAVDSGASSHMVNDEKLLQRAVK FT LATPVTISTAKSGETLQAVKKGKVTLKSVVGLNKIKRIELYDVLFIPDLEE FT NLLSVRKINSMGKRVTFQDKEVRIEDKGEVIAEGKNKNGLYCVDMFLEHTV FT DQAALFSQRKVSLTDWHKRLGHLSFSGLEKLLKNNMVDGIDISSNDLVKQY FT EVCESCLAGKQSCKKFLSMQLPRSSRPLELVHTDVCGYMEQSTYDGYRYFV FT TFTDDFTHFTVVYLIKRKSDVFDRFKEYEAMATAHFGQKLSKMRCDNGREY FT LGKDFQDYCRNKGIRMIFTVPYTPQQNGVSERINRTLMEKVRALLHQSELS FT KEMWGEALYVAAYVTNRSPTNALVELKTPFEMWNGRKPDLSNLKMFGSHAY FT KQIPKEKRHKLDSKTKRLTFVGYANNGYRLWNKNNRVIEISRNVVFDESIV FT RSNTTTEDVHLPIVVREQKHCENEDKEEEISICNEDESITNKDDEEEVIRL FT PEPEEEVEAEEVVIIETDEVFADAEEDEGIDEQQSELRRGTRKRKVPQRFQ FT PSSAKIASATFDEDAPETVAELKKREDWENWKIAIDEELNSLHENKTWTVV FT NCLPRNQKAIHSKWVFSVKDDGRYKARLVAKGCSQRPGFDYQDTFAPVARM FT ESVRTILSVANKNGLFIHQMDVKTAFLYGDLEETIFMKLPNDEIGNSQIVR FT LQKSLYGLKQASRSWNRKFDDEIKKLGFIPLKSDCCVYTSRAKDLILILYV FT DDILILGKQESQLNWIKSELGKLFQMKDLKEVKHFLGMDIHRDYQNQIMEI FT SQAGYAEKILKRFGMSESKPVSTPLDPNVRWVKSNDDELTSHPFKELLGCL FT QYLALVSRPDISAAVNILSKYQASPSNAHWTGLKRILRYLRGTMNTKIVFS FT NREHSNVLLGFADADFANDIDDRKSISGNSFLVFGNLVSWSTKRQQTVSLS FT STEAELISLCQAAKEGLWLTNLLNELGIESSPFIIMEDNIPCINFTEEPRS FT HQRMKHLDLKFMFIRELVRSRKLKIEYISTVDQPADAFTKGLPAAQHRKLL FT NILNVRIEGK" XX SQ Sequence 4189 BP; 1364 A; 751 C; 1011 G; 1056 T; 7 other; ggttatgggc ccagtagtca actgagcaag atttattttt cttcaacgat caagtttttt 60 tgaagaaatc gaaagatgtc tgtgaacgat gtcgaagtca aggcaggtgg tcaacctgca 120 aaccagcgac aggaggttga gaatcggacc gagagaggag ctttcgaacg gcagctgccg 180 tataccgccg ttgctgtacg agaccgtgcc catatcttcg atgggtcgga gaggatgttt 240 gcggcatgga agtggaggat ggagcaccac ctctctatga tggacctcat ccatacgttg 300 gttcgtactc cgtacgaaga agaatacgcc actcccgtgg cgggtaattc ggaagcccaa 360 gaggaagtga gaaaggcaaa gctgcgtaag agaatcaacg acgacatgaa cgctctggat 420 gaaattgtta tgtgggtgaa caatgacgtc cttaacaagc taatcggagt ctcatacgca 480 aaggaggcta tggatctatt ggtgaaaacc taccaaaaag ttggaacggg cgttatgatt 540 gccatgcgtg atcgattgtt ttcgatcaaa aatagaaaac atacttctat ttcggccttg 600 ttcgacgagt acgasccgat aatscgggag ctagatcgga tgggtactcg aatkgatcaa 660 tcggagaaga tggccgctct gwtgatagcc ataccggagc gatatcagca cgtcaaggga 720 gcattgacsg ttttgccgwg cgatgaactc tgcaagaagc cgttagtgga gataaagcgg 780 atgctgttgg atgcggagca aagcgagacg atggaggtcg atgatgagcc agtggtgcca 840 gacgtggcgc taaaaacgac cagaagagta attgagtgtt ttgggtgtgg aaagccggga 900 cactacaaaa ataaatgccc tctaatgaag gcgttcgtgg caaagaagaa gaagaagcgg 960 tttgttcata cgccaaggtc atgcgatgct agcaggagaa aagaagaagg aatctcggaa 1020 ggtatcgttt gctgtggact ctggagcttc aagccacatg gtgaacgacg aaaaattgtt 1080 acagagagcg gtgaagctgg caacaccggt taccatcagc acagcaaagt cgggggagac 1140 cctacaggct gtgaagaaag gaaaggtaac attgaaatcg gttgttggct tgaacaaaat 1200 aaaacgtatt gaattgtacg atgttctttt cattcccgat ctcgaagaaa acctcctttc 1260 agtcagaaag attaattcca tggggaaaag ggtcactttt caggataaag aggttcgaat 1320 tgaagataaa ggcgaggtta tcgctgaggg gaagaataaa aatggtttgt attgcgttga 1380 tatgtttttg gagcatactg tcgaccaagc agcgttgttt agtcaaagga aggtaagtct 1440 aaccgattgg cataaacggc tcggtcatct tagtttttcg gggcttgaaa aactgctcaa 1500 aaacaatatg gttgatggaa tagatatttc atcaaatgat ttagtcaaac agtatgaagt 1560 ttgcgagagt tgtttagcag gcaaacaatc atgcaagaag tttctttcta tgcaattacc 1620 cagatcaagt cggccacttg aattagtgca tacagatgta tgcgggtata tggagcaatc 1680 cacctacgat gggtatcggt atttcgtcac ctttaccgat gattttaccc atttcactgt 1740 tgtttatctc attaaaagaa aaagtgacgt attcgatcgt tttaaagaat acgaagcaat 1800 ggcaacagct cactttggtc aaaagttgtc taaaatgcgt tgcgataacg gtagagaata 1860 tcttggcaaa gatttccaag actactgtag gaataaaggt atcagaatga tttttaccgt 1920 tccttatact cctcagcaga acggagtaag tgaaaggatt aatcgaacgc taatggagaa 1980 ggttcgagct ttactccatc aaagtgaact ttcgaaagaa atgtggggag aagcattgta 2040 tgtggctgct tacgtcacta acaggtctcc gacaaatgct cttgttgaac tcaagacgcc 2100 ttttgaaatg tggaatggca ggaaaccaga ccttagtaat ctaaagatgt ttggtagcca 2160 tgcttacaag caaattccga aagagaaacg tcacaagctt gattcgaaga caaaacgatt 2220 gacattcgta ggttatgcga ataacggata tcggttgtgg aataaaaaca accgagtaat 2280 tgaaatatcc agaaacgttg tctttgatga atcaattgtt cgcagtaata caacaaccga 2340 agatgttcat ctaccaattg ttgtgaggga gcagaaacac tgcgaaaatg aagataagga 2400 ggaagaaatt tcaatctgca acgaagatga gagcataaca aacaaagatg atgaagaaga 2460 ggtgattcga cttcctgagc ctgaggaaga agtggaagca gaagaagttg taataatcga 2520 gacgtmtgat gaggttttcg cggatgctga ggaggatgag ggcatcgatg agcagcaatc 2580 agagctaaga cgtggtacac ggaaaagaaa agttccccaa agattccaac cgagttctgc 2640 caagatagct tctgcaacat tcgatgaaga tgctccagaa acagtagccg aattgaagaa 2700 acgggaggac tgggagaatt ggaaaattgc catcgatgaa gaattaaatt ctcttcatga 2760 aaataaaacc tggacagtgg tcaactgttt accgaggaac caaaaagcta ttcattcgaa 2820 atgggttttt tcggtgaaag acgatggccg ctataaagca cggctagttg caaaaggttg 2880 ctcacagaga cctggttttg attaccagga tacttttgct ccagtagcca gaatggaaag 2940 cgttaggaca attttgtcag tcgcaaataa gaatggactt tttattcatc aaatggatgt 3000 taagacagct tttctttatg gagatttaga ggaaacgatc ttcatgaagc ttccaaacga 3060 tgagataggt aattctcaaa tcgtacgtct acagaaaagt ttgtacggtt taaaacaagc 3120 tagtagatct tggaacagga aattcgatga cgaaattaaa aagctgggtt ttattccatt 3180 aaagagtgac tgttgtgtgt atacgtctcg agcaaaagat ttgattttga ttttatatgt 3240 agacgatata ctcattcttg gaaaacagga atcacaacta aactggataa aatctgaact 3300 tgggaagcta ttccaaatga aagatttaaa agaagtgaaa cactttttgg gtatggacat 3360 acaccgagat tatcagaacc aaatcatgga aatatctcaa gcaggttatg cagagaaaat 3420 cctaaaaaga tttggaatgt ccgaatcaaa accagttagc actccactag atccgaatgt 3480 acgatgggtc aagtcaaatg atgatgaatt aacatcacat ccattcaaag aattgcttgg 3540 ctgcttgcaa tatctagcat tagtatcacg ccctgatatc agtgctgcag taaatatcct 3600 aagcaaatac caagcttccc cttcaaatgc acattggaca ggattgaaac gcattttgcg 3660 ttatttgcga ggaaccatga acacaaaaat cgtattcagc aatagagaac actcaaacgt 3720 tttattgggt tttgctgatg ccgattttgc caacgacatc gatgatcgaa aatcaatttc 3780 tggaaattcg tttcttgttt tcggcaatct tgtgtcttgg tcaaccaaaa gacaacaaac 3840 ggttagtttg tcgtcaacag aagctgaatt aatttcactc tgtcaagctg ctaaagaagg 3900 attgtggtta acaaaccttc tgaatgaact gggcatagaa agttctccat tcataatcat 3960 ggaagacaac attccatgca taaacttcac tgaagaacct cgaagtcacc aacgaatgaa 4020 acaccttgat ttgaagttca tgttcatccg ggaactcgtc aggagcagga agctaaagat 4080 agaatatatc tcgacggtgg atcaaccagc tgatgcattc accaaaggac ttccagctgc 4140 tcaacatagg aaactgctga acatccttaa tgtccggatt gaggggaaa 4189 // ID Gypsy-621_AA-I repbase; DNA; INV; 5159 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-621_AA_; KW Gypsy-621_AA-LTR; Ty3_gypsy_Ele43; Gypsy-621_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-5159 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [3937-4404] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 358..1473 FT /product="Gypsy-621_AA-I_1p" FT /translation="MRLSAGTDQVSESGFNQAGSSSSTMRVDSSMLSTMNN FT WTLGTLNIPECAPSSGETEIDKQAFEYWKDILVSSLQLINAVDEQTKFGVF FT KIKSGPKLREIFQATTSSPGMPDERTEPFSNAIARLDEYYGSRTYTLSQRG FT KLMMMSQMDSESSINFVRRVGTAAKLCSYGPDEEMEAVVRVLTKNANDPRV FT RVLAHRNWVKQGSMKDLIDLVRDREIEKSNEEEFQRTRQTLRVSAISQERS FT EFRGHRESFNSNWHPRGNYRGNRRYGRGGRGAVRGNLRQQAATNCWRCGSA FT FHRAPACFAIDKECRVCGRLGHIARACSARGSSNQFENQTRGLKRIADQED FT SSVEKKIAAIEDTKAEPFDQKVREEDDLE" FT CDS 1629..3848 FT /product="Gypsy-621_AA-I_3p" FT /translation="MYATLQKSRFGSMKSKHAQPSMNALQGNQLINDNGAI FT IEAKVAGVRVQFLIDSGAHVNTITKTTFDEIMSEKNAASRIVGMNFKTDKP FT LKAYATNAQITVIANFFAELYISEDRPVMVEKFYVVDEKRALLGFNTAIRY FT CVLDIGLDVPIRHHRLMSSRCEANSINYAQQVSLSEQFPKFNIPAVSLKYD FT KEMPPARNVYTHIPAAFKEATKQKLKDLLSTGIIEEVTSDMDRSFCSSLLV FT VPKGKEDIRLVIDLRGPNRCIYRTPFKMPTFESIIMDLHGAKYFSTIDLTS FT AFHHIELNEDSRHLTNFFSGDGLYRYCRLPFGLTNAPDIFQEVLQTVVLAG FT CEGVVNYLDDVMIFGSTKEEHDENLARVLECFKNHNVMINQAKCAFGKRSV FT DFLGFVVSDKGWKIEDEKISAIQNFRTPETLAEVKSFLGLINFVEKFIPQR FT ADKTRRLRELAKADVFYWNQELEDEFEFVRTKAWKAIQTLGYFKREDQTEL FT YVDASPYGLGAVLVQFDENSKPRVIACASKSLTDTERKYPQTQKEALAMVW FT GVERFSMYLMSINFIVRTDAESNEFIFGGLHRIGKRAVTRAESWALRLQPY FT NFKVCSIPGTMNVADALSRLVQQAQSTESFDEEADEKHLLFYIDAGAMEIC FT WDEIELLSEDDDELTRVRTSIETDQWETGLKKYESEAKGLTTLGCMVFKGD FT KIVLPDALRTKAIRIGSSGSHGRWFNQKDNTAAFLVARHE" FT CDS 3790..4983 FT /product="Gypsy-621_AA-I_2p" FT /translation="MGVGSTKRIIRQHFWWPGMSRAVENYIKNCETCLLLS FT KKNPPVPLTSRDLPNGPWEILQIDFFSNNEFGFGEFLVVIDIYSRYLHVLE FT MRRIDADSTIEALNKIFAVWGYPLTLQSDNGPPFQSDKFVETWENRGVKIR FT KSIPFSPQSNGAVERQNEGIKKALAASKLDNMNWKIALNNYVHMHNKVRPL FT SRLGVTPFELLVGWKHRGTFPGLWKIDSQSEVDREDVREKDALSKLISKNH FT ADFRRGAKHSDLSVGDKVVLSQIKRNKSDPTFGSEKFTIIARDGSKLIVQS FT DRGVIYSRNVADAKRAIDHITDEQTSQSQDASTSTQITLASKFFFLIFSLA FT HDLVLTINVYVDEPQPDPKRNTVCPIDGNRILNETNETGCTRNGKVQLNAL FT GDVF" XX SQ Sequence 5159 BP; 1639 A; 1005 C; 1199 G; 1316 T; 0 other; atggtgcggt ggacgaatca gctcccccaa agtaagtttt atctaaaaaa aaaaaaaagg 60 cgaactctga tggagatagg cagagtttcg cgtaatatag aaggggctcc ggtggagata 120 gacggagcac ccatgaagtt taatagctaa agcatgctct ggtggagata gacagagtat 180 ttgcaagagg agctccgatg gaaatagacg gggctcatat aatgttttta ttaaagattt 240 attctctgac agaatattgt tttctgcttc gtttttcaca ggaaaatctc cagaaaacgt 300 attctacgag aactgaaaga agctgtcgaa aggaatacta cattgcgaga acaaattatg 360 cgactttctg ctggtacgga tcaagtttcc gagagtggat ttaaccaagc tggctctagt 420 tccagcacca tgcgagtaga ctcgtcgatg ttgtccacaa tgaacaattg gacactcggc 480 acgctcaaca taccggaatg tgctccgtct tcaggcgaaa cggaaattga taaacaagca 540 tttgaatatt ggaaggacat tcttgtttct tctctgcagc ttattaacgc agttgatgaa 600 caaaccaaat ttggtgtgtt caagataaaa tcaggaccaa aactgcgaga aatattccaa 660 gcgactacct catctccggg aatgcctgat gagcggactg agccattttc aaatgcaatc 720 gctcgtctcg atgaatacta cggatcaaga acttacacct tatcccagcg tggtaaactc 780 atgatgatga gccaaatgga ttcggaaagt agcataaatt tcgttcggcg cgtaggaaca 840 gctgcgaaat tgtgtagtta cgggccagat gaggagatgg aggctgtggt acgagtactt 900 acgaaaaacg ctaatgaccc acgagtacga gtactagcgc accgcaactg ggtaaaacaa 960 ggctcaatga aggacctcat tgatcttgtc cgtgatcggg agatcgagaa atcaaatgaa 1020 gaggaatttc aaagaactcg tcagactttg agagtatcgg caatttcgca agaacgatca 1080 gagtttcgcg gacatcgtga atctttcaac tctaactggc atcccagagg aaactatcgt 1140 ggcaaccgtc gctatggaag aggtggtcgt ggagcagtaa gaggaaatct gcgtcaacaa 1200 gcggctacaa attgctggcg ttgcggcagc gctttccatc gggctcctgc ttgcttcgcc 1260 atcgataagg aatgtcgtgt ttgcggacgc cttggccata ttgcgagagc ttgttcagct 1320 cgtggctcgt ctaatcagtt tgaaaaccaa accagaggac tgaaacggat tgcggaccag 1380 gaagattcaa gtgttgagaa aaagattgct gccatcgagg atacaaaagc tgagcctttc 1440 gatcaaaagg tacgcgaaga agatgacctt gaatgagcaa taaaattaat aattccttga 1500 tattccttta acatgtatgt gctgtatcta ataagaactt tttgctttgt atgaacattt 1560 gaattgaatt ttctattgtt aacattgact gatttttata ataaactgat tggaaactta 1620 ataaaattat gtacgctaca ttacagaaat cacgtttcgg atcaatgaaa tcaaagcacg 1680 ctcagccatc aatgaatgca ttgcaaggaa accaattaat aaacgataat ggggcaataa 1740 ttgaggccaa agttgcaggt gtgcgggttc agtttctaat cgactctggt gcccatgtaa 1800 atacgataac aaagacgaca tttgatgaaa tcatgtctga aaaaaacgct gcatctcgga 1860 ttgtaggcat gaatttcaag accgacaaac ccttgaaagc atatgccacg aatgcccaaa 1920 ttacagtaat cgccaatttc tttgctgaac tgtatatctc agaggacaga ccagttatgg 1980 ttgaaaagtt ttatgtagtg gacgagaaaa gagcacttct cggcttcaac actgcaatca 2040 gatattgcgt tcttgacatt ggcttagatg ttcctattcg ccatcatcga ctgatgtcca 2100 gcagatgtga agcaaactcg atcaattacg cgcaacaagt atcactttct gaacagtttc 2160 cgaaattcaa tatacctgct gtctcgctaa aatacgacaa agaaatgcct ccagcccgaa 2220 atgtctacac gcatataccg gcggcattca aagaagcaac aaaacagaaa ctaaaagatc 2280 tgttatcaac tggtatcatt gaggaagtaa caagtgatat ggaccgatcc ttttgctctt 2340 cgctattggt agtacctaaa ggcaaagaag atattcggct agtcattgac ctacgtggac 2400 cgaatcgttg tatctaccgc acgcccttta aaatgcctac tttcgaatcg attattatgg 2460 atctgcatgg tgctaagtat ttttctacaa tagatcttac aagcgcgttt caccatattg 2520 agctaaatga agactcacgt cacctcacaa actttttctc tggagatgga ctgtacaggt 2580 actgcaggct accttttggc ctcactaacg cccccgatat attccaggag gtgctacaga 2640 cagtcgtcct tgctggatgc gaaggggtcg ttaattatct agatgatgtc atgatcttcg 2700 ggagcacaaa ggaagagcac gacgaaaatt tggctcgagt attggagtgc ttcaagaatc 2760 acaatgtaat gattaatcaa gcaaaatgcg catttggaaa acgttccgtt gattttttgg 2820 gattcgtagt gtccgataaa ggatggaaga tagaagacga gaaaatctca gcgatacaga 2880 acttcagaac cccggaaact ttagctgagg ttaaaagctt cctaggtcta attaactttg 2940 tggaaaagtt tataccgcaa agggcagaca agactcggag acttcgtgag ctggccaaag 3000 cagatgtatt ttactggaat caagaattgg aggatgagtt tgaatttgtc aggacaaaag 3060 cgtggaaagc aattcaaacc ttgggttact tcaaaagaga agaccagacg gaactttacg 3120 tagatgcatc tccttatggc ttgggagccg tgctggtaca gtttgacgaa aattctaagc 3180 ctagggtgat agcatgtgct tccaaaagtt tgacggatac ggaacgaaaa tacccccaga 3240 cccaaaagga agcattagca atggtctggg gagtagaaag attttccatg tacctcatga 3300 gtataaactt cattgtgagg acagatgcag agtcaaatga gtttattttt ggtggactgc 3360 ataggatcgg caaaagggcg gtaacacgtg ctgagtcatg ggcgttacgg ttacaaccgt 3420 ataatttcaa ggtatgcagc attccaggga ccatgaatgt agcagatgcg ctatcaagat 3480 tggttcaaca ggcacaatca actgagtcgt ttgatgaaga agcggatgag aaacatttac 3540 tgttctacat tgatgctggt gcaatggaaa tatgttggga tgaaatagaa ctcctatcag 3600 aggatgacga tgaattgaca agagttagga cgtcgattga aacagatcag tgggagactg 3660 gactcaaaaa gtacgaatca gaggcaaaag gactaacaac acttggatgc atggtgttta 3720 aaggtgataa aattgtgctc cctgatgcct taaggactaa ggcaatacga atcggctcat 3780 cagggtcaca tgggcgttgg ttcaaccaaa aggataatac ggcagcattt ctggtggcca 3840 ggcatgagta gagctgtaga aaactacatt aaaaattgtg agacgtgcct tttgctatcg 3900 aaaaagaacc cgccagttcc tctcaccagc agggatctcc caaacggtcc atgggagatt 3960 ctacaaattg atttttttag taacaacgag tttggattcg gagagtttct tgtggtaata 4020 gatatctact cacgttatct gcacgttttg gaaatgcgcc gcattgatgc agactccact 4080 atcgaggcat tgaacaaaat atttgctgtt tggggttacc cgctaaccct gcaaagcgac 4140 aacggtcctc ccttccaaag cgacaaattt gtggagacct gggagaacag aggcgtcaag 4200 attcggaagt caattccctt cagcccacaa tccaatggcg ctgtcgaacg ccaaaacgaa 4260 ggcataaaga aagcactggc agcctctaaa ttggataata tgaattggaa aattgctttg 4320 aataattatg ttcatatgca caataaagtg cggcccctat caagactcgg cgtaacgcca 4380 tttgagttac tcgtgggctg gaaacataga ggaacgtttc ctggcctgtg gaaaattgat 4440 agtcaatcgg aagtagatcg tgaagacgtc agagagaaag atgctttgtc caaacttatc 4500 agtaagaatc atgcggattt tcgaagagga gctaaacact ccgatctctc ggttggagac 4560 aaagtagttc tgtcacaaat caagcgaaac aaatcagatc ccactttcgg atctgaaaag 4620 tttacgatca tagcgagaga tggatctaaa ttgatcgttc aaagcgaccg gggagtaata 4680 tactcaagaa acgtagctga tgcaaagaga gcaattgacc atataactga tgagcaaact 4740 tcgcaaagcc aggatgcatc aacaagtacc cagataactc ttgccagtaa gttttttttt 4800 ttaatttttt ctctagctca tgatcttgta ttaacgataa atgtttatgt agatgaaccg 4860 cagccagacc ctaaacggaa cacagtctgt cccatagacg gaaaccgtat attgaacgaa 4920 actaacgaaa caggatgcac gagaaatggt aaggttcaat tgaatgcgtt aggcgatgtt 4980 ttctaataat aatgtatttt ctacacagat atctctccac ccaaacaaca aacgcagaaa 5040 cgagccgggt atggtgattc cactagacaa cgttcaagac gagaaataag gattccggag 5100 aagctgaaag atatggtttt gtataacatt tatgaatgaa gagtagaggc gggatgaat 5159 // ID Gypsy-175_AA-LTR repbase; DNA; INV; 179 BP. XX AC AAGE02026183; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-175_AA_; KW Gypsy-175_AA-I; Gypsy-175_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-179 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02026183; Positions 52795 52973. XX SQ Sequence 179 BP; 49 A; 31 C; 37 G; 62 T; 0 other; tgttatatct gtgaattcga cctaaacgcc gattcctgtg gtctgagtat tcatttcatt 60 gtgagtgcct tttgggctat cgaatagtat acaggaataa atgagtttag ttttgactcg 120 agccgtaagc tacaagcaac ggtgttctat atttgatatc tgaaacgcaa tacatttca 179 // ID Vingi-2_BF repbase; DNA; INV; 2855 BP. XX AC . XX DT 28-JUL-2009 (Rel. 14.07, Created) DT 17-AUG-2010 (Rel. 15.09, Last updated, Version 2) XX DE Amphioxus Vingi-2_BF autonomous non-LTR Retrotransposon - DE consensus. XX KW Vingi; Non-LTR Retrotransposon; Transposable Element; INGI; KW I group; Ingi-2_BF; Vingi-2_BF. XX NM Ingi-2_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-2855 RA Kapitonov V.V. and Jurka J.; RT "New families of I non-LTR retrotransposons from animals."; RL Repbase Reports 9(7), 1534-1534 (2009). XX RN [2] RP 1-2855 RA Kojima K.K., Kapitonov V.V. and Jurka J.; RT "Recent Expansion of a New Ingi-Related Clade of Vingi non-LTR RT Retrotransposons in Hedgehogs."; RL Molecular Biology and Evolution 28(1), 17-20 (2011). XX DR [1] (Consensus) XX CC Originally classified as Ingi [1] and re-classified as Vingi [2]. CC [1] Ingi-2_BF is a consensus sequence of the young Ingi-2_BF CC family of non-LTR retrotransposons that belongs likely to the CC Ingi clade from the I group (based on the RT domain phylogeny). CC However, Ingi-2_BF, analogously to other Ingi-like elements from CC lancelet, sea urchin, sea slug and middle-African hedgehog, CC including Ingi-1_BF, I-1_AC, I-2_AC, I-1_AAl, Jockey1_SP and CC Jockey2_SP, does not code for the ribonuclease H domain. All CC known Ingi-like non-LTR retrotransposons contain only one ORF. CC [2] All related non-LTR retrotransposons described above are CC re-classified as Vingi. XX FH Key Location/Qualifiers FT CDS 61..2844 FT /product="Vingi-2_BF_1p" FT /note="APE and RT domains." FT /translation="MDLAIYHPSPVHGSAVYVRDKSIIVKTKDLSAHGIEI FT LMVETLDTKVFSVYKPPPTAFVWPQECDLTGKACLVLGDFNSHSTRWGYQD FT TNPDGEAVEEWASNNDLSLLHKAKDKPSFISGRWRRGYNPDLVFVPSRLTQ FT NFEKTVENPIPRSQHRPITVNTKPVIRPVKPRNPIPRFNFRKANWESFTME FT LDDRIQTLEPHPNIYDQFQSLVWEISKRHIPRGCRKSYIPCLSEDSKDLYE FT EYVKAYEVDPFAEETIAMGESVLASVSEERRERWKELITNVDMTQNSKKAW FT STIKKLNSDKNPQASLAAVTPNEVATQLLLNGKPLNKERGHRKKMKEEMEN FT VMLESEDEFADFTISELEDAIKLLKPGKAAGLDGITTEVIRNFGPKAKAWL FT LQMFNTCATTLSIPKVWRKAKVVALLKPNKDPSNAKSYRPVSLLCILFKLY FT ERMILARIGPIIDELLSPDQAGFRPGRSCCGQVLNITQHIEDGFETGKVTG FT AVFVDLTAAYDTVNHRLLLLKLAKMVRNTSTVRIIQSLLENRRFFVEMDGK FT RSRWRSQTNGLPQGSVLAPTLFNVYTNDLPQFNNIRRFIYADDLGLTTQHK FT SFEVIERRLTAALNSLSVYYKNNFLNANPSKTQVCAFHLNNHQAKRQLNIV FT WNGKKLANEKFPVYLGVTMDRTLSYREHVRKLKEKVASRNNLLNNLTTLNW FT GADANTLRSTALALCYSTAEYCSPAWERSCHASKVDAELNTSCRIITGALR FT PTPLPALYRLAGIAPPPIRREALSRTQKFVQEQDSRHPLHYYHDVRRRLKS FT RKSFMTIQSLNPKEAAKYRVDRWKASDTLLPNEALQTPCESLPHGTHLPRR FT EWVDLNRARSGVGRTRDNLLKWGRVNSAECRCGHPTQTMGHITKDCVLGPS FT ISNQDLLEANEKAVAWLQYWHGTL" XX SQ Sequence 2855 BP; 849 A; 727 C; 660 G; 619 T; 0 other; gcagatattc tgtgcattca agaaactcac aaggaagctg tgccatataa aatccctggc 60 atggatctgg cgatctacca tccaagccct gtacatggca gtgcagtcta tgtgcgggat 120 aagtccataa ttgttaaaac caaggacctg tctgcccatg gtattgaaat actgatggtg 180 gaaaccttag acaccaaggt gttctcagtg tacaaacccc ctccaactgc atttgtttgg 240 cctcaggaat gtgacctaac tggcaaagcg tgcctagtcc taggggactt caacagccac 300 agcactcgct ggggctatca agacaccaac ccagacggtg aggcggttga agaatgggca 360 tcaaacaatg atctgtcctt actacacaag gcgaaagaca aaccttcctt tataagtgga 420 cgctggcgtc gtgggtacaa cccagacctg gtgtttgtgc cttccagact gacacaaaat 480 tttgaaaaaa cagtggaaaa ccctatacca aggtcacaac acaggccaat cacagtcaac 540 accaagccgg tgatccgacc tgtaaaacca aggaacccca tacctaggtt taactttcgt 600 aaagcaaatt gggagagttt cactatggaa cttgacgata gaatccagac gttggaacct 660 catcccaaca tctatgatca gttccaaagc cttgtatggg aaatatcaaa gagacatatt 720 ccaagaggat gtcgcaagtc ttatatacct tgcttaagtg aagacagcaa agatctctac 780 gaggaatatg taaaagcata cgaagtcgac ccatttgcag aggaaacgat agcaatggga 840 gaatctgtgc tggcttcagt ttctgaagag cggagagaaa gatggaaaga gcttatcact 900 aatgtggata tgactcagaa cagcaagaaa gcctggtcca caatcaagaa gttaaactct 960 gacaagaatc cacaggcaag tctagctgcc gtgactccta atgaagtggc tacgcagcta 1020 ctgcttaatg gcaaacctct caacaaagaa agaggtcatc ggaagaagat gaaagaggag 1080 atggagaatg tgatgttgga aagcgaggac gagttcgcag atttcaccat ctctgaactg 1140 gaagatgcca tcaaacttct gaaacccggg aaagctgctg gccttgatgg tatcacgact 1200 gaagtcattc ggaactttgg acctaaagct aaagcctggc tactacaaat gttcaatacc 1260 tgtgccacca ccctttccat ccctaaggtt tggcggaaag caaaagtggt cgctctcctg 1320 aaacccaaca aggacccttc caacgccaaa agctaccgac ctgtgtcatt attgtgtatc 1380 ctcttcaagt tgtatgagag aatgatactt gcgcgtattg gtcccataat agatgagcta 1440 ctatctcccg accaagcggg tttcagacca ggaaggtcct gctgtggcca agtcctgaac 1500 attacccagc acatcgagga tggttttgaa accgggaagg tcacaggggc ggtttttgtt 1560 gacctgactg ctgcatacga taccgtcaac catagactgc tcttactaaa actggcaaaa 1620 atggttcgga acacaagcac tgtcaggatc atccagtcgc tacttgaaaa ccgacggttc 1680 ttcgttgaga tggatgggaa aaggagccgc tggcgttctc agacgaatgg cctcccacaa 1740 ggctcggtgc ttgcaccaac cctgttcaac gtctacacca acgaccttcc ccagttcaac 1800 aacatccgcc ggtttatcta cgctgacgac ctaggtctaa ccacccagca caagtctttc 1860 gaggtcatcg agcgcagact gaccgctgct ctgaacagtc tgtcagtcta ctacaagaac 1920 aatttcttaa atgcgaaccc tagcaagact caagtgtgtg cattccattt gaataaccac 1980 caggccaaac gacagctgaa catagtctgg aatggtaaga aactagccaa cgaaaagttc 2040 ccggtttacc ttggagtaac catggacaga actttgtctt atagagagca tgtcaggaaa 2100 ctcaaggaga aggttgcttc caggaacaac ctcctcaaca acctcacaac tctgaactgg 2160 ggagctgatg caaacacctt acgatctact gccttggcgt tatgttattc cacagcagaa 2220 tactgttctc ctgcctggga aaggtcatgc catgccagta aagtggatgc tgagctcaac 2280 acttcttgtc gcatcataac aggagcgctg aggcccacac ccctgccagc actgtaccgg 2340 ctagcaggta tcgcaccacc tcccatccga cgggaagctc tctccagaac tcagaaattt 2400 gtacaagagc aggactctcg acaccctcta cactactacc atgatgttag aagaagactc 2460 aaatctagaa agagcttcat gaccatacag agtctgaacc ctaaggaggc tgccaagtac 2520 agagtcgatc gctggaaagc ttccgacacc ctcctaccaa atgaagctct ccagacccct 2580 tgtgagtccc ttcctcatgg cacccatcta cccagaaggg agtgggtgga cctgaacaga 2640 gcaaggtccg gagtaggtag gaccagagac aacctactga aatggggacg ggtgaactca 2700 gctgagtgtc gttgtggcca cccaacccaa accatgggac acattacgaa ggactgtgta 2760 ttaggcccca gcatctccaa ccaggaccta ctagaagcta acgagaaggc cgtggcctgg 2820 ctgcagtact ggcacggaac gctatgatga tgatg 2855 // ID Copia-35_CQ-I repbase; DNA; INV; 4025 BP. XX AC AAWU01005677; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-35_CQ_; KW Copia-35_CQ-LTR; Copia-35_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4025 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 373-373 (2011). XX DR GenBank; AAWU01005677; Positions 9090 5066. XX CC Positions [1376-1879] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 17..4024 FT /product="Copia-35_CQ-I_1p" FT /translation="MGTMKDGIPLLTGQNFENWSYRVQLHFEGAELAEVFT FT ADVPAAAAGNQDREKFLKLDRKAKTQLVGFISEECLGIVKGKETTKLMWQA FT LEGHFLKKSVASQTIIRMQLARLRMKEGTGMRSHFQCFDDLVRQLKSVGAK FT LEEGDLIAQLFLSLPESFDPLVTALENLDEKDISLELCKQRLLAEEAKRIN FT RMEESPEVTTAAFVGQKKKFSGKCRRCGKKGHMAKDCRVKLSEGQANAVVK FT ERPVSFMVNQGQPGRDSTKVVFYVDSGCSDHLVNNLACLKAVRKLKHPFIV FT DVAKDGVSLTGSYEGVLEGVTMAGVPVEMKEVVFLPELRGNLMSVKKLSKA FT GVDVLFTGSRGQEKAVMKFNGAVIAVAYLRNNLYELELEVNMSRGTANLCA FT AELGKLWHRRLGHASQQAMDNLVKHEMVTGINTPLGAVGFCDTCVLGKQAR FT EPFSGRREPVTRPLERIHSDVCGPIDPPGWDGSRYFVSFIDDYTHFAMVYT FT IKKKSEVFESFKEYEALVTTQLEKKICKMTVDQGREYCSNQQLDYYKRRGI FT QLQPTVAYSPQQNGVSERFNRTVVEKSRTMMIDAKVPKNLWPEAVQTATYI FT LNRSSTAAIVEKETPVERWTNIKPDLEKLRVFGCQAFAWIPNQQRKKLDPK FT SRETVMIGYAPNGYRLWDQRDRKVVIARDVKFNEAHFPFAEDGDAQSAPPL FT MIQVQHDVPEGDQRVADFDVPVQDAAEVEVPPIGNEADDDTDDAEFCDADG FT GAYPADNDAGGAGQFSSEEDNPGALPSQQRGDGFSESNTRRSDRERKFPGK FT FLDFLTYGAHCAVKNSELADPPLTFDEINTRADGHLWKAAVDDELRSLDAN FT AVWKLVKCPSGIKPLKSKWVLRVKEDESGNPVRYKARLVVKGFLQKAGLDY FT HETYAPVAKLVTIRTALAVGVQRRCYFHQMDVKTAFLHGKLQETIFMAVPD FT GVQAEPGTACQLLKSLYGLKQSPRCWNERFNQVLVQLGFTRSKYDYCLYTR FT FNERGNDAVILVLYVDDLLIIGKRQETIQKLKMDLAQTFEMTDCGEVKFFL FT GMKIDYNWQQGRMKLSQAASIDKVLKNFGMENCNPTKTPMEKGLMLERQVT FT EAAEEPYRELLGSLMYMMMSTRPDICFSVGYLGRFQQQPGQQHWTALKRVV FT RYMKGTKQMNLEINRCEDANCLLGFADADWATDTEDRKSVSGFLFQVYGNT FT VSWSSKKQTTVATSSSEAEYVALSAAASEAIWLRGLLGDLGEAITGPVTIF FT EDNHGCIGMAKNLESKRAKHIDIKHHFIRDHVADGKLNIQPIGTTNQLADI FT FTKALDPGRFGDLRRQLGLRDREG" XX SQ Sequence 4025 BP; 1046 A; 873 C; 1188 G; 918 T; 0 other; agaattcctt taggttatgg gcactatgaa ggacggaatc ccgctgctga ccggccagaa 60 ttttgaaaac tggagctatc gcgtgcagct gcactttgaa ggggcggagt tggcagaagt 120 tttcacggcc gacgttcctg ctgctgcagc tggaaatcaa gatcgcgaga agtttcttaa 180 gttggaccgg aaggccaaga cgcagcttgt tggattcata tccgaagaat gcctcgggat 240 tgtcaagggt aaggagacga cgaagctgat gtggcaagcg ttggaaggcc acttcctgaa 300 gaagtccgtg gctagccaga ccatcatccg gatgcagctg gccagattgc ggatgaagga 360 ggggaccggg atgcggagcc acttccagtg tttcgatgac ctcgttcggc agctcaagtc 420 cgttggagca aagctggagg agggtgatct catagcccag ttgttcctgt cgttgccgga 480 gagtttcgac cctctggtta ctgctttgga gaacctggac gagaaggaca tctccctgga 540 attgtgtaag caacggttgc ttgctgagga ggctaagcgg atcaaccgta tggaagagtc 600 gccggaagtc acgactgcag cgtttgttgg acagaagaaa aagttcagcg gtaagtgtcg 660 tcggtgcggc aagaagggcc acatggccaa ggactgccgg gtgaagctgt ctgaaggtca 720 agcgaatgca gttgtcaagg agagaccggt gtctttcatg gtcaaccagg gtcagccagg 780 aagagactct acgaaggttg tgttctacgt ggattctgga tgcagtgacc atctggtgaa 840 caacctggcc tgtttgaagg ctgttcggaa gttgaagcac ccgtttattg tggacgtcgc 900 caaggatggc gtttcgttga ccggttctta cgaaggtgta ctggagggcg tcaccatggc 960 gggagttcct gtggagatga aagaagtcgt cttcctgccg gagctacgcg ggaacctgat 1020 gtctgtgaag aagctgtcga aggctggtgt tgacgttctt ttcaccggta gccgtggaca 1080 ggagaaggca gtcatgaagt ttaacggtgc tgtgattgct gtggcgtatc tacggaacaa 1140 tctctacgaa ttggaactag aagtgaatat gtcaagaggt acagcgaacc tgtgtgctgc 1200 cgagttgggt aaactgtggc atcgcagatt gggtcatgca agtcaacaag cgatggacaa 1260 tctggtgaaa cacgaaatgg tcaccgggat aaatacgccg ctaggagcag tcggattctg 1320 tgacacttgt gtgttgggta agcaagcccg tgagccgttc agtggtagac gtgaacctgt 1380 tacgaggcca ctggaacgta tacactcgga tgtgtgtgga ccgattgatc cgccaggatg 1440 ggatggatct cgttatttcg tgtcgttcat cgatgattac acacattttg cgatggttta 1500 cactatcaag aagaagtcgg aagttttcga gagtttcaag gagtacgagg ccctcgtgac 1560 gacccagttg gagaagaaga tatgcaagat gacggtggac caaggtcgcg agtattgttc 1620 gaaccagcag ttggattatt acaaacgtcg tggcattcaa ctccagccga cagtggcata 1680 ctccccgcag cagaatggtg tctcggaacg attcaacagg actgttgtgg agaagtctcg 1740 aacgatgatg attgacgcga aggtaccgaa gaatttgtgg ccagaagcgg tgcaaaccgc 1800 aacgtatatc ttgaacagaa gttcaacggc tgcgattgtt gagaaggaaa ctcctgtaga 1860 acgttggacc aacatcaagc cggatctcga gaagctgcgt gtttttggat gtcaagcctt 1920 tgcgtggatt ccaaatcagc agaggaagaa gctggacccc aagagccgag agacagtgat 1980 gattgggtac gcgcccaacg ggtaccgttt gtgggaccaa agagatcgta aagttgtcat 2040 tgcacgagac gtgaagttca acgaagcaca tttcccgttt gctgaggatg gagacgccca 2100 atctgcaccc ccgttgatga tacaagttca acatgatgtg ccagaggggg atcagcgtgt 2160 agctgatttt gatgttccag tccaggatgc ggctgaggtt gaagtacctc cgattggaaa 2220 cgaagctgat gacgacacgg atgatgctga attctgcgac gctgatggag gtgcatatcc 2280 tgccgacaac gacgcgggcg gtgccggcca gtttagttcc gaagaggata atcctggggc 2340 gctcccatcg caacaaagag gtgacggctt ttcagagtca aacactaggc gcagcgatag 2400 ggagcgcaag ttcccaggta agttcctaga ttttctcact tatggagcac actgcgctgt 2460 taagaattcc gaacttgcag atccaccgct tacatttgac gaaatcaata cccgtgctga 2520 tggtcatttg tggaaagccg cagtggacga tgaactacga tcccttgatg ccaatgctgt 2580 ttggaagcta gtgaaatgtc cttctggtat caagcccctg aaatcaaaat gggttctgcg 2640 agtcaaggag gacgaaagcg gtaatcctgt gcgctacaag gcaagactcg tcgtgaaggg 2700 attcctgcag aaggctggtc tggactatca cgagacgtat gcaccggtag caaagctggt 2760 caccattcgg acggcactgg ctgttggcgt gcagcgaaga tgctatttcc atcaaatgga 2820 tgtgaagacg gcctttttgc acggaaaact gcaggaaact attttcatgg ctgttccgga 2880 tggcgtgcaa gcagaaccag ggacagcgtg tcaactttta aagtcattgt acggattgaa 2940 gcaatctcca cggtgctgga acgagcggtt caatcaagtt ttggttcagc ttggatttac 3000 ccggtctaaa tacgattatt gtctctacac acggtttaac gagaggggga atgacgctgt 3060 tatccttgtg ctgtatgttg acgacctgct aatcatcggt aaacggcaag agacaattca 3120 aaagctgaaa atggacttgg cgcaaacctt cgagatgacg gactgcggag aagtcaagtt 3180 tttcctaggg atgaagatcg actacaactg gcagcaaggt cgaatgaagc tgtcccaagc 3240 agctagcatc gataaggtcc tgaagaactt tggaatggaa aattgtaacc cgacgaaaac 3300 tcccatggaa aaaggactga tgcttgagcg acaagttacg gaagctgccg aagaacctta 3360 ccgtgagctt ctgggtagtc tcatgtacat gatgatgtct accagaccgg atatttgctt 3420 ttcggttgga tacttgggac gattccaaca acaaccagga cagcagcact ggacggcgct 3480 gaaacgtgtt gtacgataca tgaaaggtac caagcagatg aacctggaga tcaaccgttg 3540 tgaggacgcc aactgcttgc tcgggtttgc tgacgcggac tgggcaacag acactgaaga 3600 ccggaagtca gtcagcggat tcctgttcca agtatacggt aacacggtct cgtggtccag 3660 taagaaacag acgactgtgg ccacatcttc gagtgaagct gagtacgttg ccttgagtgc 3720 tgctgcatcg gaagcaatct ggttgcgtgg tcttttggga gatcttggag aagccatcac 3780 tggaccggta acaatttttg aagataacca cggatgcatc gggatggcga agaatttgga 3840 gtcaaagcga gcgaagcata ttgacatcaa acatcacttc atccgggacc acgttgctga 3900 cgggaagctc aacattcaac ctatcgggac aaccaaccag ctagcagaca tcttcacgaa 3960 ggcgttggac cctggacgtt ttggagacct gcgacgacaa ctaggactgc gcgatcgaga 4020 ggggg 4025 // ID BEL-2_CQ-I repbase; DNA; INV; 6449 BP. XX AC AAWU01001625; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-2_CQ_; KW BEL-2_CQ-LTR; BEL-2_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-6449 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 157-157 (2011). XX DR Genome; AAWU01001625; Positions 2980 9428. XX CC Positions [5496-6056] - Integrase core CC 'GATTC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 33..6449 FT /product="BEL-2_CQ-I_1p" FT /translation="MPPKMDPNRPRLRVSKRKRNCFECEQLDTKEMVQCDT FT CDSWYHYECAGVTSAIIDVDWDCKWCGENNETIPRDPENSIAPQLPYLLEP FT EPPHQKQVSEDQRSVRSVGVGSMAASSSGRSRSARMRELELKRLEEDYRLR FT RNYLDKRFAIMRAYDCEESDDDSLVEKGSQVAEWIQKTDQIGSRNKPGSAL FT PAFPEQPPSNSQPAEQQASGQGQQAGAFQNPPAVLNQSSIIRASQTPLGHL FT IGNTRDLTLEERPPRYLFHSTGYNGVPGFQVANTTEPQTNPQLATSAAQGY FT TQLAGHGGVLFQAPIVTSNPPAPSRGLPIARNQLLEPAIQRSNPPIGQRTY FT PDWSRIQASTNGNRQIIPETRSTPNPPCYQSQINQQPNLPNNTLDLAGQGL FT RQEPSSFQYSERASDQIQQGLMGTTQVRPNVLGAMSGQANGRTAYTGTNSQ FT QAGLLNQGGNRNAVSDRLVEEPFQENPPRYLNESLDNRGDETVYLLSRSQM FT AARQAVPKDLPDFSGLPHEWPLFYAMYNSSTQMCGFSNEENMLRLRKCLKG FT RALTQVGSELLHPNSVASVMSTLKMLYGKPETIIQAVIENVRGLPSPRVEK FT LETLVDFALSVKNLCATIQACEIGDYMYSSSLRYELVQRLPPTLQLEWAKA FT SRGILTPTLFQFNTWLYHMAEDASAVMRAQIGSYKTDSKPRHKKESLLHVH FT VESDDGEIADAQTGSNTLPTCPVCKGKCSLTAKCPRFVERSVDARWETIRE FT LKLCRKCLQQHNNFCKQKRQCGVDGCTYLHHPLLHANKKAALTSTVTSPPA FT ECENSNVNVHQERTSEVLFRFVPVVLYGPSKEVHTYAFIDDGSELTLLEQG FT LADELGVSGETNPLCLKWTGDTKRIEPKSQKVNLEVSGTSATSKKFSISNA FT RTVDKLKLRPQTLVVDDLRNRFHHLQGLPIESYRDAAPRILIGVDHAKIGH FT VRKSREGKTKEPIAIKTCLGWCVYGCIATNLSGTSTVNHHALDICQCNQEN FT GDYLHTAMKEYFSLDSMGVLTPERLLMSTEDERSLQLLKNLTRRRGDRYES FT GLLWKYDGVRLPNSRAMALKRWECLERRMLNDPTLGEALNSKIADYLDKGY FT IRRLTDAELVIERPRVWFLPVFPVVNPNKPAKTRLVWDAAAKAHDVSLNSV FT LLKGPDYLTSLLSVLIQFREHRIAVCGDLREMYHQVLMRDEDQHCQRFFWG FT RKDGTPEPNTYVMQVMTFGACCSPSTAQYVKNQHAKQYENEFPAAVRAIVD FT QHYVDDMLVSIETEEEAATLAHDVKWIHKQGGFEMRNWLSNSKTVLATLQE FT SPQAEKNFNVSDGLPTEKILGMWWDTQQDCFTFKLSTRCEEELLSGRRRPT FT KREALRMLMLMYDPCGLIAHFLMYLKVLLQEIWRSGIGWNDKIGEKEFEKW FT LLWIKVLPGVRNVRVQRCYRKHTSINAEVQLHTFVDASENGFAAVVYLRFQ FT EGENIECAIVAAKTRVAPLKFLSIPRSELQAAVLGVRLANTVQRSLNIKVS FT QRFFHTDSRDVCCWLNSDHRRYSQFVAFRVSEILESSDAKEWHWISTKQNV FT ADLGTKWKGLPDLNPNSPWFRGPSFLWKPESEWPAASRQYGSTDEELRPHL FT LAHIVNLESLLQPEEFSTWRTLKRRTAMLFRQANNWRRRASKQQPVRGPLT FT QKEHLEAEHHLYRVAQADVFADEVASLSSSVSTKKKLSKSSSIYRACPFLD FT ERNVLRVQGRTGACKYVDVDATNPVILPREHAITKLVVAQFHRNFHHQNHE FT TVINELRQHYYIPQLKAVYRKVRKDCQQCKNDGAKPCPPIMADLPESRLAA FT FTRPFTHMGVDYFGPVTVSIGRRTEKRWGVLATCLTTRAIHLQVAYSLSTD FT SCIMALRNIFSRRGTPSVIYSDRGTNFQGTAKMVEEVVRKIDQDKLVAEFT FT SNHTEWKFNPPASPHMGGAWERLVRTVKQNLNKVLGTRVVSAEVLENALNE FT VENIVNSRPLTNIPVDGDLSPVLTPNHFLVQSSNGLKPFVLYDDSSRALRN FT NYELSQILANYFWKQWVRDYLPTITRRTKWFTSTKPIEVDDIVIIADPRSP FT RNCWPKGRVIATKIAADGQVRSATVQTASGGIYERPAVSLAVLDVGVEANA FT HRMSLTHSGAE" XX SQ Sequence 6449 BP; 1788 A; 1707 C; 1615 G; 1339 T; 0 other; aattacaaat ttcgtttaac cgtacctaca cgatgccgcc gaagatggat ccaaatcgac 60 cccgcctgcg ggtttctaaa agaaaacgga actgttttga gtgcgagcaa ctagacacca 120 aggaaatggt ccagtgcgat acctgtgaca gctggtacca ctacgagtgc gcgggagtga 180 cgtcggccat catcgatgtg gattgggact gcaagtggtg tggggagaac aacgaaacca 240 tccctcgtga tcccgagaac tccattgcac ctcaactccc gtacttgctg gagccagagc 300 cgcctcacca gaaacaagtt tctgaagatc aacgctccgt ccgatctgta ggagttggct 360 ccatggccgc gtcctcgtct ggaaggagca ggtcagcaag aatgcgtgaa ctggagctga 420 aaaggctgga ggaggactat cgtctacgga gaaactatct tgacaaaaga tttgccatta 480 tgagggctta tgattgtgaa gaaagcgatg atgattcttt agttgaaaag gggtcacaag 540 tcgcagagtg gatccagaaa actgatcaga tcggtagcag aaacaaacct ggttctgcgc 600 ttccagcgtt ccctgaacag ccgccgagca actcgcagcc tgcggaacaa caagcttcag 660 gtcaaggaca gcaagctgga gctttccaaa acccaccggc agttttgaac cagtcttcaa 720 taattcgtgc atcacaaact ccgttaggcc atctcatcgg caatacaaga gaccttaccc 780 tagaggaacg tccgccgcgt tatctctttc attctaccgg ttacaatggt gtccccggtt 840 tccaagttgc gaacaccacc gaaccacaga ccaaccccca acttgccaca agtgctgctc 900 aagggtacac acaactcgct ggacatggtg gcgtgttgtt tcaagcccca atcgtcactt 960 caaacccgcc tgcaccatca agagggctcc ctatagcccg gaaccagcta cttgaacccg 1020 cgatacaacg ctcaaatccg ccgattggac agcgtaccta tccggactgg tcccgcatcc 1080 aagcctctac aaacggaaac cgccaaatca tcccagagac acgttcgaca cctaaccccc 1140 cgtgttatca gagccagatc aaccagcagc cgaaccttcc caacaacaca ctcgatctgg 1200 ccggccaggg tctcaggcag gagccaagca gtttccagta cagcgaacgt gcttcggatc 1260 aaattcagca aggactgatg ggcacgacac aagtaaggcc gaacgtcctc ggagccatgt 1320 ctggtcaagc aaacggtcgt acagcttata ccggtaccaa cagtcagcaa gcaggattac 1380 taaatcaagg agggaaccga aacgctgtta gcgatcgact tgttgaagag ccgtttcagg 1440 aaaacccccc gcgctattta aacgaatccc tggataacag gggtgacgaa acagtatacc 1500 tgctgagcag aagtcaaatg gccgcccgtc aggcagtccc gaaggatctt ccggatttta 1560 gcggcttacc ccacgaatgg ccgctgttct atgccatgta caattcctca acgcagatgt 1620 gtggattctc caatgaagaa aatatgctcc gcttgcgaaa gtgcctaaag ggaagagcac 1680 tcactcaagt aggatctgaa ctgttgcacc ctaacagtgt agcaagcgtg atgtcgaccc 1740 tcaagatgct gtacggcaaa cctgaaacaa ttatccaagc ggttattgag aacgtgagag 1800 gcctcccatc gcccagagta gagaaacttg aaactttagt tgacttcgct ctctctgtca 1860 aaaatctttg tgctacaatc caagcttgcg aaataggtga ctacatgtac agctcatcgc 1920 tgcggtacga gcttgtgcaa cgtttaccac caactctaca actagaatgg gccaaagctt 1980 cgcgcggaat tctgactccg actctgtttc aatttaacac gtggctctac cacatggccg 2040 aagatgctag cgcggtcatg agagctcaga ttggcagtta caaaactgac tcgaaaccac 2100 gccacaagaa ggagagcctt ctgcacgtgc acgtggaatc agacgatgga gagatagctg 2160 acgctcaaac tggctcaaac acactaccca cgtgccccgt ttgcaaagga aagtgctctt 2220 taactgctaa gtgtccacga ttcgttgaac gcagtgtgga tgccaggtgg gaaaccattc 2280 gtgagctgaa attgtgcaga aaatgtcttc agcagcacaa taacttctgt aagcagaaaa 2340 ggcagtgtgg agtcgacgga tgtacttatt tgcatcaccc actgctacac gccaacaaaa 2400 aggcagcctt gaccagtacg gttacctccc ctccggcaga gtgtgaaaac tctaacgtca 2460 acgtccacca ggaaaggaca agcgaggtac tgttccggtt cgttccggtg gtcttgtacg 2520 gtccgtcgaa ggaagttcat acgtacgcct tcattgatga cggctctgaa ctgacacttc 2580 tagaacaggg cttggcagac gaactaggag tatccggaga aacaaaccct ctatgcttga 2640 aatggaccgg agatacgaag aggatcgagc ccaagtcaca aaaagttaac ttggaggttt 2700 ccggaacatc cgcaacatca aagaagttct caatctccaa cgcccgtacc gtcgacaaac 2760 tcaagctgcg gccacaaacg ttggtcgtcg atgacttgcg gaatcgtttc caccacttgc 2820 aaggacttcc tatcgagtcg tatcgagatg cagcaccacg aatcctgatt ggtgtagacc 2880 acgccaagat cggacatgtt aggaagagtc gggagggtaa gaccaaggaa cccatagcga 2940 tcaaaacgtg cctgggttgg tgcgtgtacg gatgcatagc tacaaacttg tctggaacca 3000 gtaccgtcaa ccaccacgcg cttgatattt gccagtgcaa tcaagaaaac ggagattatt 3060 tacacaccgc tatgaaggaa tacttctcgc tcgacagtat gggagttctc acacctgaac 3120 ggctgctgat gtcgacagaa gatgagcgat ccctgcagtt gctgaaaaat ctaacacgca 3180 gaagaggtga ccgctacgag tctggtctac tctggaagta cgacggtgtt cgtcttccga 3240 acagtagagc tatggcgttg aaacgctggg agtgtctgga gcgccggatg ctcaacgacc 3300 caaccctagg tgaagcactc aacagcaaga ttgccgatta ccttgacaaa ggctacatcc 3360 gtcgacttac cgatgccgag ctcgtcattg aacgcccaag agtttggttc ctcccagttt 3420 ttcctgtcgt aaaccctaac aagccggcga aaaccagatt agtctgggat gcggctgcca 3480 aagctcacga tgtctcgctc aactccgtcc ttttgaaagg acccgattac ctgacgtcgc 3540 tactaagtgt gctgattcaa tttcgagagc atcggattgc cgtttgcggt gacttaaggg 3600 agatgtacca tcaggtgctc atgcgagatg aagaccagca ttgccagcgc ttcttttggg 3660 gacgaaagga cggaacaccc gaaccaaaca cctacgtcat gcaggtaatg accttcggag 3720 cctgctgctc acctagcacg gcgcaatacg tcaaaaatca acacgcaaag caatacgaaa 3780 acgagtttcc agcagccgtt cgcgcgatcg tagatcaaca ctatgtagat gacatgctcg 3840 tcagcataga aacagaggaa gaagctgcaa cactcgcaca cgacgtcaag tggattcaca 3900 aacaaggcgg ctttgagatg aggaattggc tgtcaaactc aaaaaccgtg ctggccacgc 3960 ttcaagagag tcctcaagcc gagaagaact tcaacgtcag cgatggtttg ccaacggaaa 4020 aaattctcgg catgtggtgg gatacgcaac aggactgctt tacgttcaag ctatcaacaa 4080 ggtgcgagga ggagctgctg tccggacggc gacgaccaac caaacgagag gctctccgaa 4140 tgttgatgct gatgtatgat ccttgcgggc ttatcgcgca ctttctgatg tacctgaaag 4200 ttctgctgca ggaaatttgg cggtcaggta tcggctggaa cgacaagatt ggggaaaaag 4260 agttcgagaa gtggttgctt tggattaaag ttctacccgg agtgcgaaat gtcagagttc 4320 agcgatgcta ccgaaaacac acgtccatca acgccgaagt ccagctacat acgtttgtcg 4380 acgcaagcga gaacggattt gccgccgtag tttacctacg atttcaagaa ggagaaaaca 4440 tcgaatgcgc gatcgtggcc gcaaagacga gggtggcccc actgaagttc ctgtcgattc 4500 cgcggtctga actacaagca gctgtcctcg gcgtgcggct ggcaaacaca gttcagcgtt 4560 cattgaacat taaggtgagt cagcgctttt ttcataccga ctccagagac gtgtgctgct 4620 ggttgaactc tgaccaccga aggtacagcc agttcgtcgc cttccgcgtt agcgaaatct 4680 tggagagcag cgatgcgaag gaatggcatt ggatatcaac caagcagaac gtagcagatc 4740 taggaacgaa gtggaaagga ctaccggatc tcaacccaaa cagcccctgg ttccgcggac 4800 catcgttcct gtggaagccc gaatcagaat ggcctgctgc gtcgcgacaa tacggatcaa 4860 cagacgaaga actccgcccc cacctgttgg cgcacatagt aaatcttgaa tcgctactac 4920 aaccggagga gttttcaacc tggagaacat tgaaacggcg aacggcgatg ctctttcgac 4980 aagccaacaa ttggagacgg cgcgcttcaa aacaacagcc agtgcgcgga ccgctcacac 5040 aaaaggaaca cttggaagct gaacatcatc tgtatcgcgt ggctcaagct gacgtatttg 5100 cggatgaagt agccagtctt agttcaagcg tgagtacaaa gaaaaaattg tcgaaatcca 5160 gcagtatcta tcgcgcctgt ccatttctgg acgaaagaaa tgtccttcgg gttcaaggcc 5220 gaacgggtgc ctgcaagtac gtcgacgtag atgcaaccaa cccggtgatt ctgccacgag 5280 agcacgcgat cacgaaactt gtcgtagccc aatttcatcg caactttcac catcaaaacc 5340 acgaaaccgt aattaatgag ctacggcaac actactacat cccgcagttg aaagcagttt 5400 accggaaggt gcgaaaagac tgccaacaat gcaaaaatga tggagcaaag ccctgcccac 5460 cgataatggc cgacctcccg gaatcgcgac tagctgcatt cacccggccg ttcacccaca 5520 tgggcgtgga ctacttcggc ccggtaacgg tatctatcgg ccgccgcacg gagaagcgat 5580 ggggcgttct tgctacgtgc ctaactacgc gggctatcca tctccaggtt gcctactcac 5640 tttccacgga ctcttgcata atggctttgc ggaacatttt cagcagacgg ggaactccat 5700 cggttatcta cagcgatcgc gggaccaact ttcaaggcac cgcaaagatg gtagaagagg 5760 ttgttcgaaa aatcgaccaa gacaagctcg tggccgaatt cacctcgaat catactgaat 5820 ggaagttcaa ccctcccgca tcccctcaca tgggaggagc gtgggaacgc cttgtacgaa 5880 ctgtgaagca gaacttgaac aaggtgctag gtactagagt cgtgtccgca gaagtactcg 5940 agaacgcgtt gaacgaggtg gaaaatattg tcaattcgcg accgttgacg aacatcccgg 6000 tagacggtga tctctcccca gtgctgacac ccaaccactt tcttgttcag tcctcaaacg 6060 gcctgaaacc attcgtactc tacgatgata gctcccgggc tttacgaaac aattacgagc 6120 tatcgcaaat tctggccaac tacttctgga aacagtgggt tcgggattat ctgccaacta 6180 taacacggag aactaaatgg ttcacatcga caaaacccat cgaggtcgat gacatcgtca 6240 tcatcgccga tccgaggagc ccacgtaact gttggcccaa aggcagagtc atagctacca 6300 agatcgctgc agatggacaa gtacgatcag cgacagtgca gacggccagc ggcgggatct 6360 acgagagacc agctgtgagc cttgctgtac tcgacgttgg cgttgaagcg aatgcgcatc 6420 ggatgtcctt aacgcattcc ggggcggag 6449 // ID R1_DSi repbase; DNA; INV; 5434 BP. XX AC . XX DT 08-NOV-2010 (Rel. 15.11, Created) DT 08-NOV-2010 (Rel. 15.11, Last updated, Version 2) XX DE 28S rDNA-specific non-LTR retrotransposon R1 in Drosophila DE simulans. XX KW R1; Non-LTR Retrotransposon; Transposable Element; R1_DSi. XX OS Drosophila simulans OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; melanogaster subgroup. XX RN [1] RP 1-5434 RA Stage D.E. and Eickbush T.H.; RT "Origin of nascent lineages and the mechanisms used to prime RT second-strand DNA synthesis in the R1 and R2 retrotransposons of RT Drosophila."; RL Genome Biology 10(5), R49-R49 (2009). XX DR [1] (Consensus) XX CC This family is specifically inserted into 28S ribosomal RNA CC genes. XX FH Key Location/Qualifiers FT CDS 293..1804 FT /product="R1_DSi_1p" FT /translation="CYQTLPMGGARVPEQLPFVSEQTSIRLLALTVGGATI FT VVMPMESDSSASALSGSSASRKSKRGRRRSHLAPGSAPTQAKLVALASNGV FT PEPVGVLKEPFSSLEDARAATANAAIDAAPPPHAAATAVDPTAAPAAAPAA FT DHTAVSAVSTAAKIVATTATAATAAARAGQAAMMAKLSATQRMVRSSFRRL FT GEVDTEELSYAISRYDELVLALMLRCGELETRLAMPPPPPPSSNPLKNTAA FT NAPQMQQVAPTAAPRTTKVRETWSAVVKCDDPALSGKAIAEKVRTMVAPSL FT GVRVHEVRELRRGGGAIIRTPSVGELQKVVASKRFTEVGLNVARNAAEKPK FT VVVYDVDTAIGPEEFMKELHENNFDSEINLSQFKKSVHLVTKAWSVADGAT FT VNVTLEVDDRAMAKLDVGRVYIKWFSFRCRSQVRTYACHRCVGFDHTVSEC FT RQKDSVCRQCGQQGHTAAKCQNPVDCRNCRHRGQPSGHYMLSSVCPIYGAL FT LARVQARH" FT CDS 1801..4863 FT /product="R1_DSi_2p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="TLMFSFIQANCGRGRAATIELGVRLRRSESMFALVQE FT PYLGGDGMDVLPEGMRIFIDRRGKAAILVDHQEAICMPVETLTTDYGVCLV FT VKGSFGSIFLCAAYCQFDAPLEPYLRYMDAVLLQASRTPAILGLDANAVSP FT MWLSKLSRHAEGQANYRRGELLSEWMLEARVAALNQSTEVYTFDNHRATSD FT IDVTIVNEAASMWATYEWRVDEWELSDHNIITVVAEPTTARAVESIAPVPS FT WNFSNARWRLFKEEMVSRTAELPENFSESPLDQQVSTLRSIVHNVCDIALG FT RKSIRSPNRRARWWTADLCDARREVRRLRRLLQDGRRRDDGAAIERVVAEL FT RRASANYKKLIWRAKMDDWKRFVGDHADDPWGRVYKICRGRRKCTEIGCLR FT VNGEMITDWGDCARVLLRNFFPVAESEAPTAIAEEVPPALEVFEVDACVAR FT LKSRRSPGLDGINGTICKAVWRAIPEHLASLFSRCIRLGYFPAEWKCPRVV FT SLLKGPDKDKCEPSSYRGICLLPVFGKVLEAIMVNRVREVLPEGCRWQFGF FT RQGRCVEDAWRHVKSSVGASPAQYVLGTFVDFKGAFDNVEWSAALRRLADL FT GCREMGLWQSFFSGRRAVIRSSSGTVDVPVTRGCPQGSISGPFIWDILMDV FT LLQRLQPYCQLSAYADDLLLLVEGNSRAMLEEKGAQLMSIVEAWGAEVGVA FT VSTSKTVIMLLKGALRRAPTVRFAGANLPYVRSCRYLGITVSEGMKFLTHI FT ASLRQRMTGVVGALARVLRADWGFSPRARRTIYDGLMAPCVLFGAPVWYDT FT AXQVAARRRLASCQRLILLGCLSVCRTVSTVALQVLGGAPPLDLAAKFLAV FT KYKLKRGYPLEENDWLYDEDTTCLSWKQRKTRLEECLLQNWQNRWDDDSEP FT GRVTHKFIPYVTLAYRNPSFGFSMRTSYLLTGHGSFNAFLQGRALSDTTAC FT ACGDPYEDWMHILCACPLYADLRNLDGLGVQRLGENWTFDRILKDQQRTQR FT LAVFADEVFRRRRGV" XX SQ Sequence 5434 BP; 1203 A; 1380 C; 1656 G; 1193 T; 2 other; cggacgtgtt ttcgttgcgc tcgtgtacag attgcgaaga acttggtttt ccgtgtttgg 60 aaagtaataa aatcggtgaa ttagtgctcc gcgaaagtcg tgtgctaatt ttcgtgtgtt 120 ataaacaagc ggtttggaag taattagcga ttaattattg caaattttcc attcagcacg 180 tgctgcgagg cgagcttacg agtaagtttg tcgagtaagc tttttgatca gctgtcacaa 240 agcttattgg tggcagtcac tgctaaggtt tgtgtctagg gagagttttt agtgctacca 300 gactctaccg atgggtggag ctcgagttcc agagcaacta ccttttgtca gcgagcaaac 360 gagcatacgg ttgctggcgt tgacggtagg tggagccacc atcgtagtta tgccgatgga 420 gagcgacagc agcgcgagtg ccttgagcgg aagtagtgcc tcgaggaagt ccaaacgagg 480 caggcgcaga agccacctgg cacctggctc ggcgccaacg caggcgaaat tggttgccct 540 ggcatcgaat ggagtgccag aacccgttgg ggtactraag gagccgtttt cgtcgctgga 600 ggacgcccgg gcggctacgg caaacgctgc catcgatgct gccccccccc cccacgctgc 660 tgccaccgct gttgatccta ctgctgcccc tgctgctgcc cctgctgccg accacactgc 720 tgtctccgcc gtttccactg ctgctaaaat tgttgccacc actgccactg ccgccaccgc 780 tgccgcccgt gctgggcaag cagccatgat ggcaaagctg tcggccacac agcgcatggt 840 gagaagcagc ttccgcagac taggagaagt ggacaccgaa gagctctcgt atgctatcag 900 ccgctacgat gagctggtct tggcgttaat gctccggtgt ggagagctgg agacgcggct 960 tgctatgccg ccaccgccgc cgccgtcgtc gaatccgttg aaaaatacgg ccgccaatgc 1020 tccccagatg cagcaggttg cacccaccgc tgccccgcgg accaccaagg tccgcgagac 1080 gtggtcagcg gtggtgaagt gcgacgaccc tgcgctatcg gggaaagcca tagcggaaaa 1140 ggtgcgaacg atggttgcac cctctctcgg agtcagagta cacgaggtcc gtgagctccg 1200 tcgaggtggt ggtgcgatca ttcgtactcc ttcggttgga gagctgcaga aggtggtagc 1260 ttcaaaaaga ttcaccgagg ttgggctaaa tgtggcacgg aatgcggccg agaagccgaa 1320 ggtcgtcgtc tatgacgtcg acacagctat cggccctgag gagtttatga aggagctcca 1380 cgagaacaac ttcgacagcg aaattaatct gtcccagttt aagaagtccg tgcacctggt 1440 gaccaaggcg tggtcggtag ctgacggcgc cacagtaaat gtgacgctag aggtagacga 1500 ccgggcgatg gcgaagcttg atgtaggtcg tgtctacatc aagtggtttt cattccgatg 1560 ccgatcacag gtccgcacat atgcctgcca cagatgcgtg ggtttcgacc acacggtcag 1620 cgaatgccgg cagaaggaca gtgtctgccg tcagtgcggg caacaaggcc acactgcggc 1680 aaagtgccaa aacccggtgg actgccggaa ctgccgccac agagggcaac cctcggggca 1740 ttacatgctc tcgagcgttt gcccgatata cggggcgttg ctggcgaggg tgcaagctag 1800 acactaatgt ttagcttcat ccaagcgaac tgtggccgag gccgagctgc gaccatcgag 1860 ctcggagtcc gactcaggag atcggagtct atgttcgcgc tggtgcagga gccgtatctc 1920 ggcggggacg gaatggatgt gctgcctgaa ggaatgagaa ttttcatcga tcggcgaggg 1980 aaggcagcca tcctagtgga tcaccaggaa gccatctgta tgccagtgga gaccctcacc 2040 acagattatg gcgtatgtct ggtcgttaaa gggagttttg gctccatctt cctttgcgcc 2100 gcatactgcc agttcgatgc acctctggaa ccgtacctcc ggtacatgga tgcggtcctg 2160 ctgcaggcca gcagaacccc cgcaatcctg ggcctcgacg cgaatgcagt gtcccccatg 2220 tggcttagca aactctctcg tcatgccgag gggcaagcta actacagacg gggtgagctg 2280 ctgtcagagt ggatgctgga ggcaagagtc gccgccctaa accagtcaac agaggtgtac 2340 acgttcgata atcacagagc tacaagcgat atcgacgtga caatcgtcaa tgaggcagca 2400 tctatgtggg ccacatatga gtggagagtg gacgagtggg aattgagtga ccacaacatc 2460 attactgttg tggccgaacc aactaccgcg cgcgcagttg agagcatagc tcctgtgccg 2520 tcctggaact tctccaatgc acgttggcga ttgttcaagg aggaaatggt gagtagaaca 2580 gccgaacttc cggaaaactt ctcagagtcg ccgttggacc agcaagtttc gaccctgcgc 2640 agtatagtac ataatgtatg tgatattgcg ctgggaagaa agtcaattcg atcgcccaac 2700 aggagagcac gttggtggac tgccgacctc tgcgatgcaa ggcgcgaagt ccggagactt 2760 cgtcgcctac tccaagatgg aaggcgtcgt gatgacggtg ccgctataga gcgtgtagtg 2820 gccgagctga ggcgggcctc agccaactac aagaagctca tttggagggc gaaaatggat 2880 gactggaaac gcttcgtggg agatcatgcc gacgacccat gggggcgcgt ctacaagatt 2940 tgccgaggcc gcaggaagtg cacggagatt gggtgcctcc gcgtgaatgg cgagatgatc 3000 actgattggg gtgactgtgc acgagtgctc ctccgcaatt tcttcccagt tgcggagtcc 3060 gaagcaccga ctgccatcgc ggaggaagtc ccaccggccc tcgaagtatt cgaggttgat 3120 gcatgtgttg cccggttgaa gagcaggcgc tctcccggct tggacggcat caatggcact 3180 atctgcaagg cagtctggcg cgccataccc gagcaccttg catcgttgtt ttcccgatgc 3240 atccgattag gatattttcc cgccgaatgg aagtgcccac gagttgtctc gctgctcaaa 3300 gggccagata aggacaagtg tgagccctcc tcatatagag gaatatgctt gctaccagtc 3360 tttggtaagg tgctcgaggc catcatggtg aatcgtgtga gagaagttct tccggaaggc 3420 tgcagatggc agtttggatt tcgccaagga cgatgtgtgg aggatgcttg gaggcacgtg 3480 aagagcagtg ttggtgccag cccggcgcaa tacgtgctcg gcacattcgt ggacttcaaa 3540 ggagcattcg acaacgtcga atggagtgct gcactccgcc gactagccga cttgggatgc 3600 cgggagatgg gcttgtggca gagctttttc tccggccgaa gagcagtgat ccgaagcagt 3660 tccggtactg tggatgtacc ggtaactaga ggctgcccgc agggatcaat cagtggccca 3720 tttatctggg acatactgat ggatgtactg cttcagcgtc tccagccgta ttgccagctg 3780 agtgcatacg cggatgactt gctgcttctc gtcgagggaa attcccgagc tatgctagag 3840 gaaaaaggtg cacagctgat gtccatcgta gaagcgtggg gagcggaagt tggcgttgcc 3900 gtctcgacca gcaagacggt aataatgctg ctgaaaggtg ccttgagacg tgcgcctacg 3960 gtgaggtttg ctggagcgaa ccttccgtat gtgcgtagct gtcggtacct tggcatcacg 4020 gtcagtgaag gaatgaaatt cctcacgcac atagcttcgc ttcgccagcg gatgaccgga 4080 gtcgttggag cattggcgcg tgtgcttcga gccgactggg gcttcagtcc tcgagccagg 4140 cggaccatat atgacggact catggcacct tgtgtgctgt ttggtgcccc ggtatggtat 4200 gacaccgccg adcaagtagc cgcccggaga agactagcct cctgccagag gctaatcctg 4260 cttggatgcc tttcggtgtg ccgaactgtg tccacagtgg cactgcaggt acttggtgga 4320 gctcccccgc ttgacttggc tgctaagttt ctagcggtca aatacaagct gaagcgtgga 4380 tacccgctgg aggagaacga ctggctatac gacgaggaca ctacgtgtct aagctggaag 4440 cagaggaaga cgcgcctaga agagtgctta ctgcagaatt ggcagaatag atgggatgac 4500 gacagcgagc caggacgggt gacgcataag ttcatcccat acgtcactct tgcctatcgg 4560 aatccaagct ttggattctc gatgaggacg tcttacctgc tgacaggtca cgggtcgttt 4620 aacgcatttt tgcaagggag agccctcagc gataccactg cttgcgcttg tggagatcca 4680 tatgaggatt ggatgcacat attgtgcgct tgccccctat atgcagattt gcggaacctc 4740 gatggacttg gagtgcagcg ccttggcgaa aactggacct tcgatagaat cctgaaagat 4800 caacagagga ctcaacggct ggcagtgttt gcggacgaag tgttccgtag gaggaggggt 4860 gtttagccca acaacttcgc cgtgtggtta gcgggcgaga atacttccac agcccgctat 4920 tgcttgtcgt aagaggcgac taatatagcg ataggttcct ctaaccgtgc ttgtcggagc 4980 aaaaggagga ggcccaccga gcctctcttt cggtaccacg ggttgtgcag ctatccaaga 5040 ctgcacattg aggtaggccc cctggtggga gtatcgtggt ggctgtggtt ggtacccata 5100 tcgcgggtag agccttcgtg ttcgacgttt gagttgcggt gctggttgcg caaaactcgg 5160 gtgctgtgac ccagagatca gtagagattt taggtagatc tcgctcctca gcaaggggga 5220 gtgcttgccc ggcaagcaag tactcgaatt gctaccgggg tggtcgctat gtacatagct 5280 atagcttcca gtccgggacg cttgtctggc gtatccagac tcatgcacca tgttgataca 5340 tgcaccacta gtgggtgttc agggtgtcgt ggttgtaatc ccttcagtgt ggaacacgcc 5400 acgtaaaaca agttcggaga ggtccgaaag tcac 5434 // ID BEL1_Cis_I repbase; DNA; INV; 6020 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 06-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE Internal portion of BEL LTR Retrotransposon from Ciona savignyi. XX KW BEL; LTR Retrotransposon; Transposable Element; internal portion; KW BEL1_Cis_I. XX OS Ciona savignyi OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. XX RN [1] RP 1-6020 RA Smit A.F.; RT "BEL1_Cis_I - BEL LTR Retrotransposon from Ciona savignyi."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX CC Ci000143, Ci000029, Ci000555, Ci000725 ORF from bp 55 to 5871 CC encodes proteins closest related to those of Catch3 in Danio CC rerio (56% similarity). XX SQ Sequence 6020 BP; 1676 A; 1567 C; 1434 G; 1341 T; 2 other; tcgttttcaa actgagcaaa ctgatcaaac ccatcgtaac cgtacctctc gaccatggat 60 aaccaaccag aacaccgcgt agaacaatgg ctggacaaca ttgaagaagc caagcagcca 120 ggcattgaag aagccaagca gataccagca ccacctaatt caaccacacc actcgatgaa 180 aagcacgaat cagtaagtgt tgtaagcact acaagctcga tgcgcagctt taagctaaaa 240 gatagcaagg taaagcttga acttgctagg cttgcgcgac aacaagactt agagagaaaa 300 gaagaaagca agctgctcgc tgaaacagag gaacataaaa gaaaactgct cgctgaagca 360 gctgaaagag aaagaaaatt acaagaggaa ctcgccgacc gtgaatcccg gcgaaaacta 420 gaattggctg aagctgaagc tagggcatgg gaggaagcat ccmttcctga ggccagacac 480 aacgaagttc aaacccgcca atatgctggc tcaacgacac cgcttctaga gcagactaca 540 tccactgaac aatcacgcaa cgagataaca ccgaccgcag cgcagccacc acaacgttct 600 actctgtacg ccccgcccgt tgtgcgcccg aaggacagcg ctctgtcaca tcttccctta 660 accgaacgtc atttgccacc gtttccgtcc gacatccggg tgaacgacga aagagaacac 720 gtgacaacac agcatcagaa ccgaccgtcc cgatatgctg atcgctcccc aatgcgcaat 780 caacaacgac gtacacagct atcaccgact cgtatccgac gagattcacc actccgtcac 840 gtaaatcacc aagaggtagg tgaacgcttt ctacccaaac caattgtaca aaaatttaat 900 ggagatccga tggactactg gtccttcgtc aaccgatacc agctacatat tgccgacaaa 960 gtagactcat acgacctaaa attagtatac ctactccaac attgcacaga gcccgtctgt 1020 gagaaaatta agcataactc cgtagataat tacgaaaatg atggctacga aacggccatg 1080 caagaattat tcgaaagata tggtcaaccc catataatag cacgacggtg cgagagcttg 1140 ttgcaggaat tcccgagggt gcccgcagat gaccccgaag cgctggagaa attggccgtt 1200 ctgctccgaa actgctgcgc gtccctggaa aagacaaggg taaaatcgac ggtagactct 1260 gttggcttca tatcgacgct ggtcaacaag ctgccattgg aactgagaaa gaaatggata 1320 tccaagtcgc tgagtataca ggacaaaaca ggtgcggtag ctgtttttaa agactttgct 1380 aacttcgtta gagaacagag tcgcgaagct aactctgtgt atggaaagtg tttactgcta 1440 aagtccacaa accctgtagt aactaccgca cgaaaaccga aagtgtcggc ttataacgtg 1500 aacgtgactg ccgaatttcc cactgctgcc gctcccactc cgaaaccgtc cgggctgcag 1560 tgccctgtct gcaagaaacc acatttgatt ggcgcctgcc gcgacttcaa ggataaatcg 1620 tacgagcaac gtaagcgcgt aacccgggaa tgcggcctat gctttaaatg cctaaaggca 1680 ggccattccg tgcgagaatg cccacataaa atcaaatgca gtgtagaagg ttgcaccggc 1740 aacttccacc acaccttgct ccatgacccg agatggatca tcagacccgc agaccgaatt 1800 gagagcaaca cgaccgttaa agcatgcatg cactcgctta gcgctaagaa aaacaccgtg 1860 gtaccgactt ccgcatattt agacattgtc cccgtacgcg ttagaagcgg aggcaaagag 1920 acgagagtgt acgctcttct cgattccgga gcgaaccgca cattctgtga ccgtcgactg 1980 gccgatcagt tacaagctac gccgactggt cgcgccgcga actttgacct gtgtacaatg 2040 actacttccg aaccgatcaa tgttaaaacg acttccgtct cgttaactgt tcttccgctt 2100 gctggcggcg aagagcttaa cctgcccgac gttgcgctca ttaaggaaat tcccgctaca 2160 ccgagcaaga ttccagagcg caataaactg aaccgatata agtaccttcg tggagtggac 2220 ctgccacaaa ttcccgatgg tgaagtgatg ctgttgatcg gcaacgacaa cgcaatggca 2280 caccggtgcc tggaaagtcg cttcgcacca aatccacgcg agacgcctga cgctatccgg 2340 acccccctag gctggctgtt aaaaggaccg tcgctggaca gcttgccgaa cgaagagggt 2400 gcagccggat tcttacttca agccacagcg ggctcaggaa gtgacgaggc gataaaggac 2460 ttactcatca atgacgttgg tgatgtgttc ccctcctccg atactgctgc aaccgacgac 2520 gttgaaacat tgatgtgctg gcttcgaagc cacaaggagg cgtcggagtt tggcatgaag 2580 tactcaaaag aagacgtcat ggcttacgat ataatgacgc gccgaattga aagagttgaa 2640 ggccactacc aacttccgct gccatggaaa gacgacagca ccgttttacc ggatagcctg 2700 ccaatggcaa ggagcagact atcaagtttg aagaagcgac tggcgagaga tgtcgatttt 2760 tccgtaaagt atacgaaata catgacgatg atgttggaca atgggtacgc agaagtagtg 2820 ccagaggacg aagtgaatac taccaacaag ttatggtacc tcccgcatca ccctgtaatt 2880 aacccaaaga aaccaagtaa ggttcgtatc gtgtatgact gtgcggcgaa atgcgaagga 2940 caatctataa atgacaagct gatgaagggc cctgatttag tgaacccgct ggtaggggtg 3000 ttgatgagat tccgcagaga acgcgtggct atcgtatccg acatagaagc aatgtttcat 3060 caagttctgg ttgcccaaaa ggatcgtgac gctctgagat tcttgtggtg gccaggaggc 3120 gacctgacca aatcacccgt tccgcaccgc atgaaagtgc acctgtttgg cgcccgctcg 3180 tcaccgtgct gcgcaacttt ctgcctgcgc gaaactgcca gagaatttgg caagtttttt 3240 catccacgag tggccgaagt tgtaaagaat aacttttatg tggacgactg cctcgtaacc 3300 gccgttgatg atcaagccgg catccaactg gtgaaggatt tacgtaatct actttcaatg 3360 ggtggcttta aactgacgaa gtggctatcc actagtgtcg ctgtgatgga atccgttccg 3420 gccgatgatc gagcaaaggt gattcaagat attccgctag gaggagaggt acatgaacgt 3480 gttctgggaa tcaattggtg tgtccaagac gacgagttca gattcgacat cactatcccg 3540 acaaaaccgc tgacacgacg aggtatgctt tccgttacca attcactttt cgatccatta 3600 ggctttgttg cccccgttgt attggaggcc cgacttattt acagagatct ctgcttgatg 3660 aaaaccgact gggacgagga ggtcactggt accgagctcg agcgatggaa agcttggtgc 3720 aacagcctta atgaactgac ccgcattaga attcctcggt gcttcaaacc acaagtccgt 3780 catccgaccc ccggagtaca gctccatgta tttgcagacg catctaccgc cgcccgtggt 3840 tgcgtatgct accttagaac taccttcgag gatgacaaca cgacatgttc gttcgtgatg 3900 ggaaagtctc ttctctgcga ttctggtaag cacacaattc ctagattgga gctcgaagct 3960 gcacttgacg cagtaaaatt atcgcgaacc gtgaaaaggg aactcggact ggacgattgc 4020 ccgtgtatct actggacgga ctcgacaatt gttctacaaa gccttttctc tgactgcaag 4080 aactttccgg ttttctctcg aaaccgcctc tctcaaatcg aagcatacac aagcgtgcac 4140 gactggagac atgttcctac gaaactgaac ccagctgatc aagcgtcacg tggaacgacg 4200 gcagggacca tgatgaagac aggaacgtgg cattccggcc ctaagttcct acaccgtccc 4260 gctaaagact ggcctactac gtttttacgg ccacaaccaa cggagcaaat ctaccgaagt 4320 tttgacctgc caaaaccaga atctgtattt ctgacagcta ccagctgctc acccacggac 4380 aaattcatcc gttatttctc gtccctgcat cgattgaaga taagcacgtc ctggattctc 4440 cgtttttgtg catacctgaa gaacaaggta aagaaggaca ccgcaaaaat tccgacatcg 4500 ccgattgacg ccgaggagct tcgatctgct gagagcgccc tcgtgcggta cgtacaacgt 4560 cgggagttcc ccgattggat gcgcatgcaa tcgacccgac gtccgccktc tacgtcaccg 4620 atctacaaac tgaaccctat tctcaatgat ggcattctac gcgttggagg tagattggac 4680 aatgcacgtc ttgactacaa agcacgccac cctgccatta tccccgataa ctgccacctt 4740 actgacctac ttatcgatca ctgccacagt gttgatgctg cacactctgg tatgaatcac 4800 actttaaaca ttttatttca gcgctactgg gtgcaaaatg ctagagttgc aacccgacgt 4860 atcctgtccc gatgcctact gtgccagcga agagctgcaa ggcccgaaag acagatcatg 4920 gctgatctcc cgccgtcaag actgcaaatt gacgaaccac cgttcactca tacaggagtg 4980 gacttcttcg gcccgctact gaccaagtta aggagaagtg aagtaaaacg gtatgggtgt 5040 ctgtttacat gtatgactac tcgcgccatc catctagaag tcgctacgga cttgtctacc 5100 gacgcgttta taaatgccct gcgacgtttt acggcccgac gaggtcccgt cactcacctg 5160 tattccgaca atggcactaa cttcgtcggc tgcgagcgga tacttcgaga atgtatagaa 5220 gaatggaatc gcaatcaagt tggcgactac ctgagacaga aaaatataat ctggaaattt 5280 aacccaccag ctgcaagcca ctttggtggg gcttgggaac gattgatccg ttccgtgaaa 5340 tttattttgc aggcattatt gcagggcaag ccgctcgatg atgaccttct acacaccctg 5400 cttgttgaag tggaggacat tgtcaattcc agaccgctta cacaagtacc gctggagatc 5460 ggagaagacc tgccgctgac accgaatcac cttctgaaaa tcaaccctac aattactccg 5520 ccaccgacag tgacacgtag tacagattgc tacgcccgac gacgctaccg agttgtccag 5580 tatctcgctg acgagttctg gaaacgctgg atccaagagt accctcgatc aataatctct 5640 cgacagaaat ggcacgagca aaaggacaat gttgctgaag gagatgttgt actccttgtg 5700 gacaattccg cccctcgagg acaatggcct ctaggccgcg tgaacagctt gtttcatgac 5760 aaacatggga tcgttagaag cgtcgaagtg aagactgccg ctgggttaat gaagcggcct 5820 atcgcaaaac tttgcgtgat cgtgagcgag cgagagagga gcgccccatg acctggcacg 5880 gcgttacgcc ttattgtata tattgtgtat attttttttg ttttttattc ttcgttaaaa 5940 atgtgcctcg aatcttttct ttgtgtaaaa aaattgaaac tgctttgatg acgcattttt 6000 ccatttttgg cgggggagga 6020 // ID Sola2-N1B_CQ repbase; DNA; INV; 1934 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A Sola2 DNA transposon family from Culex quinquefasciatus - DE consensus. XX KW Sola; DNA transposon; Transposable Element; nonautonomous; KW Sola2-N1_CQ; Sola2-N1B_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-1934 RA Kojima K.K. and Jurka J.; RT "Sola DNA transposons from the southern house mosquito."; RL Repbase Reports 11(1), 627-627 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 10 sequences with >94% CC identity. 4bp TSDs. ~90bp TIRs. ~80% identical to Sola2-N1_CQ. XX SQ Sequence 1934 BP; 671 A; 302 C; 305 G; 656 T; 0 other; gagcaattct ctaccaaaac cggaaatgga ttttatttgt attttttgat ttggctcaaa 60 ctttgtgggg gccttcccta tgaccaaata tgctattttg tgtcattggt tcacccatac 120 aagtctccat acaattttgg cagctgtcca tacaaaaatg gtatgtaaat attcaaacag 180 ctgtaacttt tgagtgaatt ttctgatcaa tttggtgtct tcggcaaagt tgtaggtatt 240 gttgaggact tttgagaaaa aaataggtac acggaaaaaa aaatttgcag atttttttat 300 caactttttt ttcactaaaa ctcaatttcc caaaatacgt attttttgat tttcgagatt 360 ttttgatatg ttttagggga caaaaatccg caacttttga gccatagaga aacatggtca 420 aaaaatctgc cgccgagtta tgaatttttg aaaaaatagt gatttttgga aaaaatcgaa 480 gtttcatgca aaaacaagtt tgacattatt ttttaatgca aaattgaatt tgcaatcgaa 540 aagtacttta cagatttttt gataaagggc tccgtttaca agatatagcc accgaaagtt 600 tgattttagc gaaatatttg cagtttttca atttttaaaa atagtgacca tgagtgacca 660 tttctaaaaa tatttttttt ttgaaaagtt cagaaaattt gctataaaat tgtctaagag 720 acattgaaga ttggacctct ggttgctgag atacagcggc ttaaagaaaa agaaacacga 780 aaattgaagt tttctaagtc tcacccaaac agcccaccat tttctaatga cgatatctca 840 gcaactaatg gtccgatttt caatgttaat acatgaaaca ttcgtgaaat tttccgatct 900 cttcgaaaaa aatattttga aattttttaa atcaagacta acatttgaaa agggccaaat 960 attgaatatt acgcccattt aaaatgctag tctagattta aaaattttca aaatattttt 1020 ttcgaaaaga tcggaaaatt tcacgaatgt ttcatgtatt aacattgaaa atcggaccat 1080 taattgctga gatatcgtca ttagaaaatg gtgggctgtt tgggtgagac ttagaaaact 1140 tcaattttcg tgtttctttt tctttaagcc gctgtatctc agcaaccaga ggtccaatct 1200 tcaatgtctc ttagacaatt ttatagcaaa ttttctgaac ttttcaaaaa aaatattttt 1260 agaaatggtc actcatggtc actattttta aaaattgaaa aactgcaaat atttcgctaa 1320 aatcaaactt tcggtggcta tatcttgaaa acggagccct taatcaaaaa atctgtaaag 1380 tacttttcga ttgcaaattc aattttgcat taaaaaataa tgtcaaactt gtttttgcat 1440 gaaacttcga ttttttccaa aaatcactat tttttcaaaa attcataact cggcggcaga 1500 ttttttgacc atgtttctct atggctcaaa agttgcggat ttttgtcccc taaaacatat 1560 caaaaaatct cgaaaatcaa aaaatacgta ttttgggaaa ttgagtttta gtgaaaaaaa 1620 agttgataaa aaaatctgca aatttttttt tccgtgtacc tatttttttc tcaaaagtcc 1680 tcaacaatac ctacaacttt gccgaagaca ccaaattgat cagaaaattc actcaaaagt 1740 tacagctgtt tgaatattta cataccattt ttgtatggac agctgccaaa attgtatgga 1800 gacttgtatg ggtgaaccaa tgacgcaaaa tagcttattt ggtcataggg aaggccccca 1860 caaagtttga gtcaaataaa aaaatacaaa aaataaaaat ggtcgaaatc ggccgatttc 1920 gtagagagtt gctc 1934 // ID Vingi-1_BM repbase; DNA; INV; 3206 BP. XX AC . XX DT 17-AUG-2010 (Rel. 15.08, Created) DT 17-AUG-2010 (Rel. 15.08, Last updated, Version 3) XX DE A family of Vingi non-LTR retrotransposons - consensus sequence. XX KW Vingi; Non-LTR Retrotransposon; Transposable Element; Vingi-1_BM. XX OS Bombyx mori OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Bombycidae; Bombycinae; Bombyx. XX RN [1] RP 1-3206 RA Kojima K.K., Kapitonov V.V. and Jurka J.; RT "Recent Expansion of a New Ingi-Related Clade of Vingi non-LTR RT Retrotransposons in Hedgehogs."; RL Molecular Biology and Evolution 28(1), 17-20 (2011). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 155..3163 FT /product="Vingi-1_BM_1p" FT /note="includes endonuclease and reverse FT transcriptase domains and a CCHC zinc-finger FT motif." FT /translation="MPGCARRNASTNQCQMTKDIGPSVRVCSLNIEGITQA FT KTECMLRIAIDNKIDIILLQETHTADLAQLARRGTIPGFRMICATFHPQYG FT TATYIREALLPSASIICTSTDTENIAMSTIQVGEMIVCNLYKPPNSTWPQP FT GLPNYGHPSIVVGDFNSHHTSWGYKNNDANGESVVKWADESSYFLVHCNKD FT KGTVKSARWGREYNLDLAFVSLDCKRRPLQVNRTVLQDFPHSQHRPVLLDI FT GLKILIIQSYPAPRWNFTKADWERYTKTLDDSIRWIPPLRENYERFLKLVL FT STAKKCIPRGYRKQYIPCWTTKTEALWQEFITTSDRNVAHELIESLNEARQ FT AKWESTVQDMNFTHSSRKAWHLLKRFTSGTTGQKSEPKVCPSKIADRIVKL FT SKAPSHKKHTREVRKDLKSLKTDTASDPYFSRPVTADEITASIRAIKGNKA FT CGEDKIFPEFIKDSGPHTRVWLAKFLTNIIDSGKIPLAMKRAKITAILKPG FT KPADSADSYRPIALLSVMYKLLERIILNRISPTIEKHIPQEQAGFRPGRSC FT SDQVLALTTLLERGFNENLKSSTVFVDLTAAYDTVWRQGLLYKLLKLIPCI FT KIYNVIEDALTNRPFRVHLGDKSSLSRVLNNGLPQGSVLAPTLFNVYTHDV FT PETLSRKFAYADDLALVAQGRSIETTECHLENDMATLDNYFIRWRLCPNAS FT KTEVCCFHLNNRQANKKPNVTFRGSTLPFNPCPKYLGVTLDRSLTYSAHLN FT NVAAKLRTRNNILQKLTGTTWGASANCLRTSALALVYTTAEYCAPAWLNSS FT HTTVVDTQLNHTMRLITGCLKPTPLHWLPALSSIAPPHLRRTHHLYAEIEK FT ALRRPELPINRDLANFKPTRLKSRNPPALQAHQPHSNWNTNEAWVKEWSAR FT CIPPELFELNPHSRPPGFKEKRPIWCMLNHLRTGVGHCNQLRKRWGWTENE FT SCECGCPEQTIRHIVMECPLTRFAGQVEDLKNVTEEAIDYFTNCKFKL" XX SQ Sequence 3206 BP; 982 A; 898 C; 663 G; 663 T; 0 other; gggtcgaagt tgcataacgt ccgtaggaga ttaccccacc gctaccctac acgtgacccc 60 tgggtggtgc taatagtgaa acaacctggc aaccgaactg tcctcctgag gatctcccgg 120 ttccaactac actgcgtcaa ttataaaccc taatatgcct ggatgcgctc gaagaaatgc 180 aagcaccaat cagtgccaaa tgactaaaga cataggacct tccgttcgtg tctgttcttt 240 aaacatagaa ggcattaccc aggccaaaac tgaatgcatg ctcagaattg ccatagataa 300 taagattgac atcattctgc tacaagagac ccacactgct gacctcgcgc aactcgctag 360 aagaggaacg attcccggct tccgcatgat ctgcgctaca ttccatcccc aatacggcac 420 tgccacatac atacgagagg ccctgctgcc ttcagcttca attatatgca cctccacaga 480 caccgaaaac atcgctatgt caacgatcca agtgggagaa atgatagttt gtaacctgta 540 caaacctccg aatagcacat ggccccaacc aggacttcca aactatgggc acccgtccat 600 tgtagtaggc gactttaata gccaccacac cagctgggga tataaaaata atgatgctaa 660 cggagaatct gttgtcaagt gggcagatga atccagctac ttcctggtcc actgtaacaa 720 agacaaaggc accgtcaaat ccgcacgctg gggtcgtgaa tataacctgg acctcgcctt 780 cgtctccttg gactgcaaac gcagacccct ccaggtgaac agaactgtac tacaagactt 840 tccacacagc caacaccgcc cagttcttct cgacatcggc ctgaaaatac ttatcataca 900 atcatatccc gcaccacgct ggaacttcac taaagctgac tgggaacgct atacgaagac 960 actcgatgat tccattaggt ggataccacc ccttcgggaa aactatgaaa gattcctaaa 1020 gctagtactt tccacagcta aaaaatgcat tccaagggga taccggaagc aatatatacc 1080 atgctggact accaaaactg aagctctatg gcaagagttc ataaccacta gcgaccggaa 1140 tgttgctcac gaactcattg agtctctcaa tgaagcacga caagctaaat gggaaagcac 1200 agtccaagat atgaacttca cacactcaag ccgtaaagcc tggcacctac ttaaacgctt 1260 cacttctgga actaccggcc agaagagtga accaaaagta tgccccagta aaattgcgga 1320 taggattgta aaactgtcta aagcacccag ccacaaaaaa cacacccgtg aggtgcgtaa 1380 ggatcttaaa tctcttaaga cagacactgc ttcagaccct tattttagcc gtccagttac 1440 agcagacgag ataactgcca gtatcagagc tattaaggga aacaaagctt gtggagaaga 1500 caaaatcttt ccagagttca taaaagacag tggaccccac acccgtgtgt ggctcgcgaa 1560 attcctcacg aatatcattg actctggaaa aatcccactt gcgatgaaac gggcgaagat 1620 aacggcaata ctcaaacctg gtaagccggc tgacagtgca gatagctatc gccccattgc 1680 ccttttgagc gtaatgtata agctacttga gcgaatcata cttaaccgca tttcaccaac 1740 tatagaaaaa catattcctc aagagcaagc tggattccgt ccaggacgca gctgcagcga 1800 tcaagtgtta gcactgacca cactcttgga gaggggcttt aacgaaaacc tcaagtcgtc 1860 aactgtattc gtggacctaa cagctgctta cgacacagtg tggcgccagg gactgctgta 1920 caaactactc aagctgatcc cctgtataaa gatctacaat gttattgagg acgcccttac 1980 caacagacca ttccgtgtcc atctgggaga caagtctagc ttatccaggg tcctgaacaa 2040 cggcctcccg cagggttcag tcctcgcacc aacactcttc aacgtgtaca cccacgatgt 2100 gccagagacg ctctccagga aatttgcgta cgccgacgat ctagctctag tcgcacaagg 2160 ccgcagtata gagacaacgg aatgtcacct tgaaaacgac atggccacgc ttgataacta 2220 cttcattcga tggcgtttgt gtccaaatgc ttccaagacg gaagtctgct gtttccacct 2280 aaataaccga caggctaaca agaaacccaa cgtcaccttt agaggctcca ctctcccctt 2340 taacccctgt cccaaatatc tgggtgtcac cctagaccgg tctctcacat acagtgccca 2400 cctgaacaac gtcgcagcga aactgagaac caggaacaat atcctacaaa agctgacagg 2460 gacaacgtgg ggagcctcag ccaactgtct gaggacatcc gccctggccc tggtatatac 2520 gacggcagaa tactgcgccc ctgcatggtt aaacagctcc cacactactg tagtcgatac 2580 gcaactaaac cacaccatgc gcctaatcac ggggtgctta aaaccaaccc ctctgcactg 2640 gcttccggct ctgagcagca ttgccccacc ccaccttcgc cgcacccacc acctatacgc 2700 agagatagaa aaggcgctta gacgacccga gctgcctatc aacagggacc tagccaactt 2760 caaaccgacg agattgaagt cacgaaaccc accggcatta caggcacacc aaccacactc 2820 caattggaac acaaacgagg cctgggttaa agaatggtca gccagatgta tcccacctga 2880 actgttcgag ctgaatccac actcccgccc acctggtttc aaagaaaaac gcccaatctg 2940 gtgcatgctt aaccacctga gaacgggagt tggtcactgt aaccaactgc ggaagagatg 3000 gggctggacg gaaaacgaaa gctgtgaatg cggatgccca gagcaaacta tacgccacat 3060 tgtcatggaa tgcccactaa cgagattcgc aggccaagtg gaagatctca aaaatgtaac 3120 agaagaagct atcgattatt tcacaaactg taaatttaaa ttataaccac gcagtgtaaa 3180 cgaatgccat acgataaata aataaa 3206 // ID Gypsy-9_OD-LTR repbase; DNA; INV; 186 BP. XX AC CABV01000587; XX DT 24-FEB-2011 (Rel. 16.02, Created) DT 24-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Oikopleura dioica genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-9_OD_; KW Gypsy-9_OD-I; Gypsy-9_OD-LTR. XX OS Oikopleura dioica OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; OC Oikopleuridae; Oikopleura. XX RN [1] RP 1-186 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Oikopleura dioica genome."; RL Direct Submission to RU (24-FEB-2011). XX DR Genome; CABV01000587; Positions 13847 13662. XX SQ Sequence 186 BP; 46 A; 44 C; 35 G; 61 T; 0 other; tgtggcggag aacttgttgt tcactttacc cgcaacggtt atgagaacgc tgcactgcga 60 tgacatcact gataacggct acgccgttct tctcttactg cttttcactc actcacttta 120 gagccacgaa taaaacagcc agattttcgg attttgattg ttatattata ttgagatttt 180 ccacca 186 // ID SK repbase; DNA; INV; 256 BP. XX AC D14862; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 28-SEP-1995 (Rel. 1.08, Last updated, Version 1) XX DE SK repeat family. XX KW SINE; Non-LTR Retrotransposon; Transposable Element; SK; KW SK repetitive sequence; short interspersed repeated sequence. XX OS Loligo bleekeri OC Eukaryota; Metazoa; Mollusca; Cephalopoda; Coleoidea; OC Neocoleoidea; Decapodiformes; Teuthida; Myopsina; Loliginidae; OC Loligo. XX RN [1] RP 1-256 RA Ohshima K., Koishi R., Matsuo M. and Okada N.; RT "Several short interspersed repetitive elements (SINEs) in RT distant species may have originated from a common ancestral RT retrovirus: characterization of a squid SINE and a possible RT mechanism for generation of tRNA-derived retroposons."; RL Proc. Natl. Acad. Sci. U.S.A 90, 6260-6264 (1993). XX DR GenBank; D14862; Positions 56 311. XX SQ Sequence 256 BP; 61 A; 68 C; 68 G; 59 T; 0 other; cggtttagct cagtcggtag agcatccgcc tcgtaacgac aaggttccgg gttcgatacc 60 ggttcccgcc aactcagcgt atgagtagca cggcttggat ctgtgcggta aagtggggag 120 ctgatggtag gagtggcgct ccacctcaga cgcaaaaaac aatatccctg gtctagcgct 180 agcgaaaagg gaatatacta cctcgctgtt cggctgatac cgacaatacg ccacttcgcc 240 tacgtctaat ttactt 256 // ID Gypsy-32_OD-I repbase; DNA; INV; 4918 BP. XX AC CABV01001277; XX DT 24-FEB-2011 (Rel. 16.02, Created) DT 24-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Oikopleura dioica genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-32_OD_; KW Gypsy-32_OD-LTR; Gypsy-32_OD-I. XX OS Oikopleura dioica OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; OC Oikopleuridae; Oikopleura. XX RN [1] RP 1-4918 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Oikopleura dioica genome."; RL Direct Submission to RU (24-FEB-2011). XX DR Genome; CABV01001277; Positions 9387 14304. XX CC Positions [3705-4178] - Integrase core CC 'ATAA' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS join(1110..2897,2901..4670) FT /product="Gypsy-32_OD-I_1p" FT /translation="MSITTSQRATTAHCTKLTFTINEFKFTRSTLVDSGSV FT INLIPRDAIPNNITHLITPRNSTVTGVGERKIIGTINADIFTTDGHLLARR FT ISFSVVESPFPIILGTPFFNCSDFVTKSYRISHDYLEVTTKRDNVKHFIPH FT NMDNVRAYTVSAPTEKLSNEQKTQWLLKSKEITIPSACEDGSNHHRLANLL FT YDNSDVFCSSSEDIGTYPHEVSIHTEPGRAKYVKQHPIPAQYKEEVSNEIR FT RMLQKGIIEECRDNRGWHSPIHCVPKKSGRLRIVCNFKPTVNKLTTQDNDA FT CEIPKIDLIMSRIGTRTKYFSCLDVASGYWMLPIKEADRHKTCFSWDNQQL FT QFKRLPFGLSFAGYSFVKAISKALNTITNRDQITTYIDDILIASNSYDEHL FT ETLRQLLCAFRSFGIRLGSSKCKFFQKEVTFMGRLLTPEGVKLDPKNYDAI FT MNMSAPTTRNGLQSALGNFTWIKSWLSANYNENVAENSCSQLLKELNNCMK FT GSKKKFKMTKEAQVAFATAKKRISSDKVYAYADFDKPFILVCDASAVAMGA FT LLIQIHDKKQKIVAAASKTFTECEQRWSTSERECYCIVTMCERFDYYLRPK FT AFTVLTDHRPLTALDVKSFGSPKLARWQLRLQCYRFVTQYIKGSQNVWADL FT LSRPFDRAPSKLIEKNTVMGSYYKLDDDSSMEIYIPSWVQKTDLPKELLLK FT KSHVTACHALGHCFTSGFEKLPISEFRIIEGAQAEDPTIGIIRDLIRNDVD FT VDKWSAPDNAYEWKKMKKFANFLFIDKTTNLLLIKLEGNVPKVVLPRALRT FT RYIRDAHEKGHFGVARTIEIMHWCWWSNKRDDISDFISTCEYCARRKGSYI FT QPATPPLKHVLKGTRPFEIIYCDYVHMPTGKTGKRYILTIQDGFSRFLYTI FT PQARNRAIDAARGLYTFMLQYGFPSVISSDKGRHFISQVIEDFANLLKIRQ FT NLHCSFRPQSTGTLERVHRVLKNSLWGVANDQGCDWEDALPSVTSWINRGY FT CKSIKSTPWKIIFGYDYNDSGLALPINSTAAKTANQYAKQIRNTLSIAYEM FT VKLAQSEADAAVEAKKPYFAPVAIASGDQILLRREVSAEAKKTHQPYIGPF FT KVLKSNDCILLIDRDGATEFVHRGHVIKYKPRTGQQDDDILELMDTGPDDA FT RPTPTTELRRSTRLKKPIARLGIHSSA" XX SQ Sequence 4918 BP; 1562 A; 1141 C; 1099 G; 1116 T; 0 other; gtggtaccag aagtaaaaca aaaatggatc gttacattgc agatatgaag aagctggaga 60 tggcgatcaa agagtctgaa gacgacacaa ccaccaaaat tctcgagaag gcgatgaagg 120 atcttaaacc gacggattac gtaatagtta agtcagaagc cacgcagcta gtcgcactca 180 gaaaatcgtg cgaaaacgcg ctcgacaaac tcgaattaga ggctcgcgat tacgaggaag 240 tctctcgatt ttgtgcctcg gtagacaaaa ttcgagctag ctatgagtca attagtgacg 300 gttagtttac gaatgacggt tcttcgattt gggacagacg cgtacgcgag atacgaatct 360 agcgagagaa atcacggcga atggaagaag ttgagagatt ggattctcgt tgagttcagc 420 agcgggctct cgattctgca gctgattgga cgagcttttg acgcaccgtt tgaaacagcg 480 cgaggatgga aacacttttc aatcttagtc gagaaccgat tacagtgcgc tgaagcggcg 540 attttgaaga agatacgaga gaaaaagata gcggaaggaa agactaaaga agaagcgcac 600 aaatacaatc cgacggcgag agaaataatt cagtattttg gagccgccat tgtttcggat 660 cgcattaaag gtgaagataa cgacttattc aacaaaatgg cgtctcagtg gaaggattgc 720 tgctctgcta acgaggttgg aaacagtgct caatatttac acgcacagag cgttagtaca 780 gaaagctacg cgataagaag atctcgattc gagaagagaa gcccagacaa ttcgaacagc 840 cctcgaaaag attgtcgaca cggggcggaa tgtcgatttt tgcgcaacgg ccactgcaaa 900 atgcgacata caaaagaaga ggtaattgcg gctaaaaacg cgaaatcaca gggcgcgtac 960 agaaacgacg atcgtcgcgg agaaaataga cgcaccttcc acaccgatga tcagaatgaa 1020 aacggcgcag aaagcacgcg cgaagccgac aattttcccc gagaaagcgc aagttcgatt 1080 tttcatcagt gatcgacaag ccgcacgcga tgtcgatcac aactagccag cgtgcgacaa 1140 ccgcgcattg tacaaaactc actttcacaa tcaacgaatt caaatttaca cgatctacac 1200 ttgtcgattc cggttcagtc atcaatctta ttccacgcga cgcaataccg aacaacataa 1260 cgcatcttat tactcctcgc aactcgacag taaccggagt aggagaacga aaaattattg 1320 gaacgattaa cgcggatatt ttcacaaccg acggtcatct tttagcacga cgcatatctt 1380 tctctgttgt tgaatctcct tttcccatca tacttggaac gccattcttt aattgcagcg 1440 attttgttac gaaatcatat cgtatttcac acgattatct agaagttacg acgaagcgag 1500 acaatgtgaa gcattttata ccacacaaca tggacaacgt acgagcttac acagtctccg 1560 cacctactga aaaattatct aacgagcaga aaacacagtg gctactaaaa tcgaaggaga 1620 ttactatccc ttcggcctgt gaagatggta gtaaccacca caggttagcc aacctactat 1680 acgacaatag cgacgtattc tgctcgtcat cagaagatat cggaacctac ccgcacgaag 1740 tgtcaataca cacggagcct ggcagagcga aatacgtcaa gcagcacccc attccagcgc 1800 agtacaagga agaagtaagt aacgaaataa gaagaatgct gcaaaaggga atcattgaag 1860 agtgccggga caaccgaggg tggcatagcc cgatccactg cgtccctaaa aaatcaggtc 1920 gactcagaat cgtctgtaac ttcaaaccca cggttaacaa actcacgacg caagataacg 1980 acgcatgcga aataccgaaa atcgatctga taatgtcaag aataggcacg cgcacaaaat 2040 atttttcctg cttagatgtt gcgagtggat attggatgct cccgatcaag gaggcagatc 2100 gtcacaagac ctgtttcagc tgggataatc agcagctaca gtttaaaaga ctgccattcg 2160 gactttcatt cgctggttat agtttcgtga aggcaatttc aaaagctcta aacacaataa 2220 cgaaccgaga tcaaataacc acatatatcg atgatattct aatcgctagt aattcatacg 2280 atgagcattt agaaacactt cggcagctac tttgcgcttt ccggtcattt ggaattcgac 2340 ttgggtcatc gaaatgtaag ttctttcaaa aagaggtaac gttcatggga agactgctca 2400 cgcccgaagg tgttaaactc gatccgaaaa actacgatgc tattatgaac atgagtgcac 2460 caacaacacg caatgggctc cagtcagctc ttggaaattt tacgtggata aaatcatggc 2520 tatcagcaaa ctacaacgaa aacgtcgccg agaacagctg ctcacaactt ttaaaggagc 2580 taaataactg catgaagggc agtaaaaaga aattcaaaat gacgaaggaa gcgcaagtag 2640 cgttcgcaac cgccaagaaa agaatttcgt ctgacaaggt atacgcatat gctgatttcg 2700 acaagccctt tatactcgtt tgtgatgcat cagctgtagc tatgggcgca ttattaattc 2760 aaattcacga taaaaaacag aagatcgtcg ctgcggcatc aaagacgttc acagaatgtg 2820 aacaacgatg gagtacgagc gaacgcgaat gttactgtat agtgacgatg tgcgagcgct 2880 tcgattatta tctacgctga ccaaaggcct tcacagtact aaccgaccat cgtccactca 2940 cagcactgga cgtaaaatca tttggatcgc cgaaattagc aagatggcag ttacgactac 3000 aatgttaccg gttcgtaaca cagtatataa aaggatcgca aaatgtctgg gcagaccttt 3060 tgtcccggcc attcgaccga gctccaagta agcttattga aaaaaatacg gtaatgggga 3120 gctactacaa acttgatgac gacagctcta tggaaatata tatcccatct tgggtccaaa 3180 aaacggactt gccgaaagag ctcttgctca aaaagagtca cgtaacagcg tgtcacgccc 3240 taggccattg cttcacctct ggtttcgaaa aactgccgat tagcgaattc agaataatcg 3300 agggggcgca agcggaggat ccaacaattg gaattataag agaccttatc cggaacgatg 3360 ttgacgttga caaatggtcc gcaccagata atgcttacga atggaaaaag atgaaaaaat 3420 tcgcgaattt cctgtttatc gacaaaacga cgaatctcct cctcattaaa ctggagggca 3480 atgtcccaaa agttgtactg ccgcgcgcac ttcgaacaag gtacattcga gacgctcatg 3540 aaaaaggaca tttcggagtt gcacgaacaa tcgaaatcat gcactggtgt tggtggagca 3600 acaaacgcga cgacatctca gatttcattt cgacatgtga atattgcgca cgaagaaagg 3660 gttcatatat tcaacctgcg actccgccgc tcaaacatgt tctaaaggga acccggcctt 3720 tcgagataat atattgcgat tacgtccaca tgccgacagg aaaaactggt aagcgctata 3780 tactgactat acaggacgga ttctcgcgat tcttgtatac aataccacag gcgagaaacc 3840 gagcgattga cgcagctcga gggctataca cctttatgct ccaatacgga ttcccgagtg 3900 taatctcgtc ggataagggt cgccatttta tatcacaggt tattgaggat tttgctaatc 3960 ttctaaaaat acgtcaaaat ctacactgca gctttcgacc acagtcaaca gggactctag 4020 aaagagtcca tagagtactc aaaaacagtc tctggggcgt cgcgaacgat caggggtgcg 4080 actgggaaga cgcgcttccg tctgttacgt catggataaa ccgaggatat tgtaagtcta 4140 taaagtcaac gccatggaag atcatatttg gatatgatta taacgattcg ggactcgctc 4200 ttccgataaa ctcaaccgca gctaaaaccg cgaaccaata cgctaagcaa atacgaaata 4260 cgctaagcat tgcttatgaa atggtaaaac tcgcacagtc agaagcagac gccgctgttg 4320 aggctaaaaa accgtatttt gctccagtgg cgatcgcttc aggagatcag atacttctca 4380 ggcgcgaagt cagcgcggaa gcgaaaaaga cgcatcagcc atacatcggc cccttcaagg 4440 tgctaaaatc aaacgactgt attttgctca ttgacagaga cggcgccacc gaatttgtac 4500 atcgcggcca cgtgatcaaa tacaaaccgc gcactggcca gcaggacgac gatattctcg 4560 agcttatgga cactggacca gatgacgcga gaccgacacc aaccacagag ctacgcagga 4620 gcacaagact caaaaagcct attgcccgcc tcggaatcca ctcatccgct taatagtcga 4680 ggtagtcata ttgctttccg tttctcatca tgttttgtgc gcatcaattc tcatccgcat 4740 cttatggaat tcatccttgg acgatctcac gctcagcgac ttaattgtcg aggtatgatt 4800 ccacttttca tattttcatc gaatttatgc tcacaacttc accgccgacg tcattgccta 4860 cttcacaact ttcaactccc agcagcttcc gacgactcgc ctgcgctgac cggagggc 4918 // ID BEL-236_AA-LTR repbase; DNA; INV; 726 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-236_AA_; KW BEL-236_AA-I; BEL-236_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-726 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 928-928 (2011). XX DR [1] (Consensus) XX SQ Sequence 726 BP; 210 A; 198 C; 160 G; 158 T; 0 other; tgtacgaata tcccccagaa gctacaaccc tgacctgcag tatgcagtgt tgcactttcg 60 ctttcgtttg acccaaacaa acaacgttat tgttcgcaac tacggcctgg ctggatagca 120 gaacaatacc catcatacac cgtctctgat ccagtatctg ttggcacagc atccacacat 180 agctacgagg ctatcatcat gcgagcggat ctgatgtcga gatgagagga gggagaaccg 240 aatgcgatct aaatcccgat ggacagccga tcgtcacagg agattctcta cactggccat 300 gcacagagaa gaccgcgggt caactgatga ctagaaagag tatctgccgc aagttccaat 360 ccgatcgccg ccgggaatag ataaccgaaa agtagtgatt agttactctg tgttttagtt 420 tgcccaataa atgtagcatg ttcagtgtag tgtttattta aataaagtgt ttgttagttt 480 atcgatgttg gagtgttcaa ttgtgtacca gcgaactccc tggccaccaa ggacgctccc 540 aaggacccaa gcccacggaa ataagcccac aacccgcgga aacaagccac caacactcca 600 cggcagacaa gtcacccagc ttacccacca cccctacccc taagcaacat caaccaaccg 660 gtcccatcag tgttggagtc ggatggaaat ctccactgat cgaaggtaag ggccgactat 720 ccaaca 726 // ID Troyka-1-I_BF repbase; DNA; INV; 5556 BP. XX AC . XX DT 29-APR-2008 (Rel. 13.04, Created) DT 29-APR-2008 (Rel. 13.04, Last updated, Version 1) XX DE Internal portion of the amphioxus Troyka-1_BF autonomous LTR DE retrotransposon - consensus. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Troyka-1-I_BF; KW Troyka-1-LTR_BF; Troyka-1_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-5556 RA Putnam N.H., Srivastava M., Hellsten U., Dirks B., Chapman J., RA Salamov A., Terry A., Shapiro H., Lindquist E. et al.; RT "Sea anemone genome reveals ancestral eumetazoan gene repertoire RT and genomic organization."; RL Science 317(5834), 86-94 (2007). XX RN [2] RP 1-5556 RA Kapitonov V.V. and Jurka J.; RT "Troyka - a distinctive group of gypsy-like LTR retrotransposons RT inducing 3-bp target-site duplications."; RL Repbase Reports 8(4), 510-510 (2008). XX DR [2] (Consensus) XX CC Troyka is a distinctive group of gypsy LTR retrotransposons. CC Troyka retrotransposons differ from canonical gypsy and other LTR CC retrotransposons by the length of target site duplications (3-bp CC in Troyka) and by the unusual 5'-CG and CG-3' termini of LTRs. CC The ~1600-aa polyprotein encoded by Troyka elements is composed CC of the aspartyl protease, reverse transcriptase, and integrase CC domains. The polyproteins encoded by different families of Troyka CC present in the cnidarian and amphioxus genomes are ~50% identical CC to each other. However, there proteins are only 20-30% identical CC to the pro, RT and INT domains encoded by canonical gypsy LTR CC retrotransposons. CC Troyka-1_BF is a very young family of Troyka LTR retrotransposons CC present in the amphioxus genome. Some copies of Troyka-1_BF are CC flanked by 100% identical LTRs. XX FH Key Location/Qualifiers FT CDS 85..4932 FT /product="Troyka-1-I_BFp" FT /note="Gypsy polyprotein." FT /translation="MASAAPRIPKQWQLTKTETINSFESWRANLIYSLSLD FT RNFAPFLVNGVTWQKASVENRGLQADGEAVPEASRRTAAQKAASLEMMLGQ FT IAIWCPVISKNTIVKASTSLSHIWQAIRLHFGFQSTGGYFLDIANIKQDTN FT ERPEDLYQRLTAAIEYNLLSVEGGITHHGEVVSADEEVSPSLENMIVLLWL FT QLLHPNLPALVKQRYGSELRNKTLASIKPEISLALASLLDELNTTEDIRSL FT RLSQARKPSPRFRPPPRPQRPKPKAQYKACPLCQQAGRPGYNHFLSECRYL FT PEQDKQYIIRSRVVEVDEDEHFDEPDYTDQDEPQVLVAETVTPPKSIPSQV FT SRVTVRPSPYMDAFHHGQSVHILLDSGAESSLIREDEAHRLGLQIVPNYSQ FT TPSQADGGRMSGILGECKTTFSRHTLPLQFDALVVKDLASPIIAGSPFLET FT NGFTIDFKHKRIQLPDGSTTDYSPVRKPSQPRIHRISTNLLRMQDRTTTVW FT PGDFVDLALPVEITDYQDVVLEPRYDSLSPEQMKAFEQGTYRNVAGHIRVP FT NVSSAPVHVAKNMHLFNAVPVVPVEDYPLDGQVYDKSPTPAHPPTPFSDSI FT QLDPDHTLSSTQRYQLSSVLRTYDQVFDTAFPGYNDAVGPIRAVVNMGPAQ FT PPQRKGRLPLYGRNRLVELQQKLDELETRGVIATPESADTVVEYLNPSFLI FT NKSSGGHKLVTAFTDVAKYCKPQPSLLPDVNQVMRQIACWKYIIVSDLTSA FT YHQIPLHVNSRKYCGIVTPFKGIRVYCRSAMGMPGSETALEELMSRTLGHL FT LMRGAVAKIADDLYCGGNTFEELLINWRDVLTALRKCDLRLSPSKTIICPR FT STTILGWHWNQGQISASQHTLCTLATCPAPQTVTALRSYIGSYKALSRVIP FT GTSVLLGPLDNIIAGQSSKDKVQWTDETMLAFRQAQRGLQAHQAVVLPRPE FT DELWLVTDAAVRPTGIGGTLYIRRNNTTKVAGFFSAKLKPHQRRWLPCELE FT ALAISAALKHFKPYFIQTTKPAFVLTDSKPCVEAVGKLRRGEFSYSPRVTT FT FLSAVSQFPIDVKHISGKHNISTDFASRHPLECHEDKCQICSFIHAISEEP FT VINTSTLPQGPDLNSPIYTSRQAWSNIQSSCRSLRRVFAHLRQGTRPSKKE FT TDISDIKAYLSKVSITTGGLLVVPNRDMFGVTRDRIVVPRKVVHGLATAIH FT LKLNHPSRHQLKTVFSRYFFALGMDTVIQKVTEACDQCASLKCQIPVPPIF FT TTEAPPQTVFTSFSADVMKRARQNILVVRENSTSYTCAHIIPDETSTALRD FT GLISLCTPLRLLDGSPAVIRCDPAPGFQALVNDPWLKDNRLQIEIGEPKNI FT NKNPVAERAIQEVRAELSKSDPLGQPVSPAELAVVMARVNAKIRTNGLSSR FT EYLQQRDQFSGEQLPMSDRTLLKEQHHRRRSNHKPSIKSKSPRTKAHATTP FT SIAVGDLVYLNTDRDKTKGRNRYLVTNVEQNWIHVSKFTGHQLRSKSYKVR FT PNQCYLVPNQVTLQNRRPTHDEADSDSTDTDEVRPCPEVTDFPVPDPPPDF FT LVPPDPPPVPEPPPAAEPHLQPATPEPHPPPRHSKRTVRPPPYLQDYVM" XX SQ Sequence 5556 BP; 1546 A; 1664 C; 1151 G; 1195 T; 0 other; tggtggcagc gtcgggtgtt cagtaaaccc acagcttatc tacagcaccg catccggcta 60 taggaacacg ctagacgacc gagaatggcg agcgccgcac cgaggatccc caagcaatgg 120 cagctcacta agactgagac catcaactcc ttcgagagct ggcgggccaa cctcatctac 180 agcttgtcgc tcgaccgtaa cttcgcaccc ttccttgtga acggcgtcac gtggcagaaa 240 gcgtctgtgg aaaacagagg tctccaagca gacggcgagg ctgtgccaga agccagccga 300 cggacggcgg cccagaaagc agcctccctg gagatgatgt tggggcagat cgcgatctgg 360 tgtcccgtca tttccaaaaa cacaatcgtc aaggcaagta cctctctttc ccacatatgg 420 caggccattc gccttcattt cgggttccaa tcgactgggg ggtacttctt ggacattgca 480 aatatcaaac aggacacaaa tgaacgccct gaagatttgt accagaggtt gacagctgcc 540 atagaataca acttactatc cgtagaaggg ggcatcaccc atcacgggga agtcgtctct 600 gcagacgagg aagtctcccc gtctctcgag aacatgatcg tgttgctctg gctacagctc 660 ctacacccga acctccccgc tctcgtcaag caacgctatg gctctgaact ccgtaacaaa 720 actctagcgt ccattaaacc agaaatctct cttgccctgg catccctgtt ggatgaactc 780 aacaccacag aagatatcag atcccttcgc ttaagccaag ctcgtaaacc gtccccacgt 840 tttcgtccac cacccagacc tcaacgccct aaaccaaagg cccagtacaa agcgtgccct 900 ttgtgtcaac aagcgggtcg cccaggatac aatcacttcc taagtgaatg caggtacctt 960 ccagagcaag ataagcagta cattattcgc tccagagtgg tagaagtgga cgaagacgaa 1020 cactttgacg agcctgacta cactgaccag gacgaacccc aggtcctggt agcagaaacc 1080 gtaactccac cgaagtccat cccatctcag gtttcgaggg ttactgtgcg tccctcaccc 1140 tacatggacg cattccatca tggtcagtcg gtacacatcc tactggactc gggtgctgaa 1200 tctagcttga ttcgtgagga tgaagcccac cgcctgggat tacaaattgt tcccaactac 1260 tctcagacac caagccaagc tgatggcggc cgcatgtctg gcatcctcgg tgaatgtaag 1320 accaccttct ccagacatac actacctctg cagtttgatg ccctggtagt taaggactta 1380 gcctcaccta tcatcgctgg cagtccattc cttgaaacta atggcttcac tatagacttc 1440 aaacacaaac gaatccagct cccagacggc tctaccacgg attattcccc ggttagaaag 1500 ccttcccagc cacgcataca tcgcatatcc actaacctcc tcagaatgca ggacagaaca 1560 accacggttt ggcctggtga cttcgtcgac ctagctcttc ctgtcgagat aaccgactac 1620 caagatgtcg tcttagaacc caggtatgac tccctctcac cggagcagat gaaagctttc 1680 gaacagggca cataccgcaa tgttgctggc cacatccgag tccctaatgt ctcttctgca 1740 cccgttcatg tcgccaagaa catgcacctt ttcaatgctg tgccagtggt tccagtagaa 1800 gattacccac ttgacggaca ggtgtatgac aagtccccga cacccgccca cccgccaaca 1860 cctttctccg actctatcca actcgatcca gaccatacac tcagctccac ccagcgttac 1920 caactatcgt ctgtcctccg cacctatgac caggttttcg acacagcgtt tccagggtac 1980 aatgatgctg tgggccccat ccgagcagta gtaaacatgg gaccagcaca accgccacag 2040 cgtaaaggcc gtctcccact gtacggcaga aaccgcctag tggaactaca gcagaaacta 2100 gacgagctgg aaacgagagg agtgattgca acaccagaat cagcagatac agtagtggaa 2160 tacttaaacc cttcattctt aatcaacaag tcaagcggcg gtcacaaact ggtaactgcc 2220 ttcactgacg tcgctaaata ttgtaaaccg caaccttccc tcctccctga cgtaaaccaa 2280 gtaatgcgac agatcgcctg ctggaagtac atcattgtct ctgatcttac ttcagcatat 2340 catcaaatcc cccttcatgt gaactctcgc aagtactgcg gcatagttac accgttcaag 2400 ggtattcgtg tctactgcag aagcgccatg ggtatgcctg gcagcgaaac agctctcgag 2460 gaactcatgt cccgcacact tgggcatcta ctgatgcgcg gagctgttgc aaagattgcc 2520 gacgatctgt actgtggggg gaacaccttt gaggagctcc tcatcaattg gagagacgtt 2580 ttgacggccc tgcgcaaatg tgacctacgc ttgtctccct ccaaaaccat catctgccct 2640 agaagcacga caatcctggg ctggcactgg aaccaaggcc agatctctgc ttcccaacac 2700 accttatgca cactagccac gtgtcctgca ccgcagacag taacagctct gagatcctac 2760 atcggttcct acaaagccct ttcccgggtt atccctggca catcagttct cctaggtcct 2820 ctggacaaca tcattgcggg tcaatcttcc aaagacaagg tccagtggac ggatgaaacc 2880 atgttagcct tccgtcaggc tcaacgtggc ctacaagcac accaggctgt ggtcctcccg 2940 agaccagaag atgaactctg gctagtaaca gatgcagcag tccgccccac gggcattggt 3000 ggaacactgt atatccgacg caacaacacc actaaagtcg ccggtttctt tagtgctaaa 3060 ctgaagcccc accagcggcg ctggctgccc tgtgagctgg aagcactcgc tatctccgca 3120 gcactgaagc acttcaagcc ttatttcatc caaaccacca agcctgcatt tgtgcttact 3180 gactcaaaac cgtgtgtgga agcagtaggc aagctacgca ggggagagtt ctcctacagc 3240 cctcgagtca ctaccttcct atctgctgtc agccagttcc ctatcgatgt caaacacatc 3300 agtggcaaac acaacatatc aacagacttt gcaagtcgtc acccattaga atgccatgaa 3360 gataaatgcc aaatttgctc attcatacac gcaatcagtg aggaaccagt catcaataca 3420 tcaaccctcc cccaagggcc agacttgaac agcccaatat acaccagccg acaagcatgg 3480 agtaatattc agtcgtcctg ccgatcattg cgtagagtat tcgcccacct ccgacaaggc 3540 accagacctt ccaaaaagga aacagacatt agcgacatca aagcctatct ctccaaagtc 3600 tccataacca ccggcggcct actggttgtg cccaacagag acatgttcgg cgtcacaaga 3660 gaccgaattg tcgttccgcg taaggttgta catggtctgg ccacggccat ccacctcaag 3720 cttaaccatc cttcccgaca tcaactcaag acggtcttct ccagatattt cttcgccttg 3780 ggcatggata ccgtcatcca gaaggtaaca gaagcttgtg accagtgcgc ctccctgaaa 3840 tgccaaatcc ctgtccctcc aatcttcact actgaggctc cgccccagac tgtcttcacg 3900 tcattctccg cggatgttat gaaacgagcc cgccaaaaca ttcttgtcgt cagagagaac 3960 agcacttcct acacgtgcgc ccacatcatc ccagatgaaa catcaacagc tctccgagac 4020 ggcttgatct ccttatgcac tccacttcgc ctgttagatg gctccccagc cgtcatccgc 4080 tgcgatcctg ccccaggatt ccaggcgttg gtcaacgacc cctggctgaa agacaacaga 4140 ctccagattg aaatcggcga gcctaaaaac atcaacaaaa acccagtcgc agagagagcc 4200 atccaagaag tccgggcaga actcagcaag tctgacccgt taggccaacc cgtgtctccc 4260 gcagagctcg ctgtcgtcat ggcaagagtt aacgccaaaa ttcgtaccaa cggcctctcc 4320 tcccgtgagt acctgcaaca gcgagaccag ttctccgggg aacagctacc aatgtcagac 4380 agaacccttc tgaaggagca acatcacaga cgacgaagca accacaagcc cagcatcaaa 4440 tccaagtcgc cccgaacaaa agcccatgca accaccccct ctatcgccgt tggtgatctt 4500 gtgtacctga acacagacag agataagaca aaaggtcgga acagatacct cgtcacgaac 4560 gttgaacaaa actggatcca tgtgtccaaa ttcaccggtc atcagttacg ctccaagtca 4620 tacaaagtac gcccaaacca atgctatctg gtccctaacc aggtgacact gcaaaaccgt 4680 cgcccaacac acgacgaggc tgattccgac agtaccgaca ccgacgaggt ccggccctgt 4740 cccgaggtga cagacttccc ggtcccagat ccaccaccag actttctggt cccaccagac 4800 ccaccacctg taccagagcc accacctgcc gcagagccac acctacaacc tgctactcca 4860 gagccacacc cacctccacg acattccaaa agaaccgtcc gcccaccacc gtacttgcag 4920 gactacgtca tgtaaacaca aactctatac ctcagtttct cctatccaga atacttaaac 4980 ttcaacttac tacacagact cttgtatcag aactcaccat accaaagaac tgtactcgaa 5040 catcattata cctagcaagt acctggtagc caccaagact acacatcgca cttcaatttg 5100 ccatgttcct agtataaggt tgttataaca aacgatagcg tacaagttct ttagaagttg 5160 aaagacttat gtttattcac gacaataata gcataaaagt tatttagcaa aagaaagttg 5220 tttcactaca acaacaagca cgaaaaataa tttctttaga agtagaaaga gttgtgttct 5280 tttagaagta gaaagagttg tttcatacca acaatgctgg cataaaagtt ctttagaagt 5340 agaaagagtt gtttcgtacc aacaatgatg gcataaaagt tctttagaag tagaaagagt 5400 tgtttcgtac caccaatgat ggcataaaag ttctttagaa gtagaaagac ttgtattagc 5460 cccaaagtta gattacatat tttcatcatt gtattctagt cgtaactctt acacaatcgt 5520 aatcatacag aacgaaggaa gaaagaaagg taatca 5556 // ID Gypsy-37_DWil-I repbase; DNA; INV; 5489 BP. XX AC scaffold_181148; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-37_DWil_; KW Gypsy-37_DWil-LTR; Gypsy-37_DWil-I. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-5489 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_181148; Positions 180652 175164. XX CC Positions [4538-5029] - Integrase core CC 'GAAC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 918..2216 FT /product="Gypsy-37_DWil-I_2p" FT /translation="MSDEDTGNINIRETPIDTAPSTSAGMRTRSQTQLLQE FT QAARSQSFVSERSNVETGLTSIGENIIDHDQFRTLQNELMTALTAQMARLI FT ETNVTACMQKFNPVAPRRTSPLYQIDQLDLNNWNMSGEDQRRGDYGQRSSA FT SRSVESDLLQRPDKVSHIMNGWKLKFTGKGLSIDHFIYRVEALTHQTLEGN FT FDIMCRNASSLFEGKANEFYWRYHKSVGQIRWSTLCTAFKKQFKDLRDDGD FT IEEDMRRRKQDVNETFDQFYDAIITLSDKMDHPMSNNRLVRILKNNLRPEI FT RHEILNIEITSVSELREVCRKRESFLADVKRSHGYYKKTPFKKEVSELLPS FT EQDLEGSDIEVEDEMEIDALSLICWNCRREGHRYQDCLEERRIFCYGCGLD FT NTYKPNCAKCAKNGYKSAQNVGHRPKVMRAPKPNSQSNK" FT CDS 2126..5398 FT /product="Gypsy-37_DWil-I_1p" FT /translation="MCKKRVQERAERGTQTKSDACSQTEFPIEQVNHSTAR FT CPTTNNLVSETLQKNISSSQSRSNKRQKSFWAEIKTVQRSMDSICALDQGS FT HDPRPFITVGILDRAISGLLDSGAGVSCIGGELAKQIISGGMPYKKLKVNV FT TTADGSPQNVVGKMQTNISYNGQTKEVQIYLVPTLRLDLYLGIDFWKAFEL FT LPSLNIAELDLTTIEHNLNHQQMGQLKKVINQFPSFAQEGLGRTNLISHSI FT DVGNAKPVKQRHFPVSPAVEKLMYVEVDRMIELGIIEESVSAWSSPVVLVQ FT KPGKVRLCLDSRKVNEFTIKDAYPLPHIGGILSRLPKAEFITSLDLKDAFW FT QIPLELSSREKTAFTVPGRPLYQFKVMPFGLCNAPQTMSRLMDKVVPAKLR FT NEVFVYLDDLLIVSDTLECHLDVLRELALCITRAGLTINIGKSKFCMKQVK FT YLGHIVGEGGIRTDPEKVAAITDFPVPKSIKALRSFLGLCGWYRKFIENFA FT SLTAPLTDLLRNKRKFVFDEAAEQAFQNIKKKLSKAPVLISADFKKPFYIH FT CDASKTGVGGVLVQLSDEGDECPISFVSKKLNKAQKNYSVTEQECLAALVC FT LKNFRAYVEGHEFTIITDHASLKWLMTQTDLSSRLARWALKLQGYTFKIEH FT RKGKLNVVPDVLSRAHSEEIAAINAEDGLYVDLDSEHFKSKSYLELCDRIK FT GMEQSLPDLKVIDGYVYRLAEHTNGDQLHDSQVWKLWLPSELVPEILKKAH FT DSPLASHGGIHKTIERIRRHYFWPGLVGDVKSFIGECETCKTTKAPNFIMR FT PLMGKTPESQRFFQKLYVDFLGPYRRSRRGHIGIFIVLDHLTKFVFLKAVK FT KLTADVTIQYLESELFHAYGVPETIVSDNGSQFRSEAFKKLLNRYNVTHTL FT TAVHAPQANASERVNRSIIAAIKAYVRDDQKDWDEYLSQINCALRSAQHSA FT IGTSPYYLVFGQHMITSGKTYSLIRKLELLSDRSLRLNQPDSLDLARSKAC FT AVMQKQQQKNGKAYNLRARQINYRVGQEVFRRNFKQSNFAQGYCSKLGARF FT IKARVRKKIGTSNYELEDLQGHLIGNYHAKDIRP" XX SQ Sequence 5489 BP; 1785 A; 967 C; 1180 G; 1557 T; 0 other; gttataatta tatcattatt ttattatttg gcgcccaatc gtgaggcctc attacaacat 60 ttagatgaag tcaatgaaac tcgcaaatat tcaattttca aagaataaga aatctctatt 120 tcctttcact tttatgacgc ctttagatta gctattgagt tcggaacaca gaaagcaaaa 180 tacaagtatt agcttctgtc catcaactta gcaaactatt tgagtcaact ttagaactaa 240 ataatgagac gagaaatttt gctggaattt aagcagaaat taaaactaag gtccaagcgc 300 ttaaccgctt ttaaaacgtt gattttattt aggttcttac atgtatttat gaatacatct 360 ttggtttacg ctcgctcagt cgcataggaa ttttaatcaa tactctccgg aaatgcgatt 420 taaccacgat tgagttggtg tttgctaaga tctataaatt gaaaatttgc ccattgtaag 480 cgcataggat tcgtcgggag aaccatcaaa agaagttttc gatgaccacg atttcaatga 540 gtgaggatat tttcaattct aattctattt taaactatct atcctatttc tattttatct 600 aaatagtctg tttatgctaa taggcagttt gtatgcttta ctgtgggaga gtgtagcatg 660 gcatgaattc atttttttat gatttatttt ggttactttt tgtttcgatt gtttactgaa 720 ttggactcga ttcatttgtc ttttattcat tgtctttcta attactgtcg atttttaata 780 tattttatta gattcttata aatacaatac ttactaaata taatagcagt tagttaaaat 840 aatttccatc ggtagtaggg tcaaataaca cttatagtaa gatagggtaa actgtatgaa 900 attaaattta attcacaatg agtgacgaag ataccggcaa cataaatatc agggaaactc 960 ccatcgatac tgctccttct acgagcgcag gtatgaggac aaggtcccaa acacagttgc 1020 tacaagaaca agcagctaga agtcagtcct ttgtttcgga aaggtcgaac gtagaaactg 1080 ggcttacttc aattggggaa aacataattg atcatgacca attcagaacg ctgcaaaacg 1140 aattaatgac tgcgctaact gcgcaaatgg ctcgtcttat tgagacaaat gtgacggcct 1200 gcatgcaaaa gtttaatcca gtagcaccaa ggcgaacatc cccgttatat cagatagatc 1260 agctggatct aaataactgg aatatgtctg gtgaggatca gcgaaggggc gattatggtc 1320 agagatcatc tgccagcaga agtgtagagt cagacttgtt acaaaggcca gataaagttt 1380 ctcatattat gaatgggtgg aaacttaagt ttactggtaa aggactatca atagatcact 1440 ttatttatcg agtagaagct ttgactcacc aaacgcttga aggaaatttt gatattatgt 1500 gccgaaacgc cagttcatta tttgaaggaa aggctaatga gttctattgg cggtatcata 1560 aatcagtagg gcaaattaga tggagcacct tgtgtacggc atttaaaaag cagtttaaag 1620 atttgcgaga tgatggggac attgaagagg acatgaggcg tagaaaacag gatgtcaatg 1680 aaacttttga tcaattttat gacgctataa tcactttgag cgataagatg gatcacccca 1740 tgagcaacaa cagacttgtg agaattttaa aaaataattt gcgaccggaa atacgacacg 1800 aaattttaaa tattgagatc acctcagtct ctgaactacg ggaggtatgt agaaagcgtg 1860 agtccttttt agctgatgtg aaacgtagtc atggctatta taaaaaaact ccatttaaaa 1920 aggaagtatc agagctattg ccgtcagagc aggatttgga aggatcagat attgaagtcg 1980 aagacgaaat ggaaatagat gcattatctt taatttgttg gaattgtaga agggaagggc 2040 atcggtacca agattgtctg gaagaaagga gaatcttctg ctatggctgc ggcctggata 2100 atacatataa acctaactgt gccaaatgtg caaaaaacgg gtacaagagc gcgcagaacg 2160 tgggacacag accaaaagtg atgcgtgctc ccaaaccgaa ttcccaatcg aacaagtaaa 2220 ccacagtacc gcgagatgtc caaccaccaa caacttagtc tctgaaaccc tccagaaaaa 2280 tatatccagc tcacaatctc gtagcaacaa acgtcaaaaa tccttctggg cggaaataaa 2340 aactgttcaa agaagtatgg atagtatttg tgctttagat cagggcagtc atgatccgcg 2400 gccatttatt acggtgggca tattagatcg cgcaattagt ggattgttag actcgggggc 2460 tggagtatct tgtattggtg gtgagttggc caaacaaata atatccggtg gaatgccata 2520 caagaagctc aaggtgaacg tgaccaccgc tgatggcagt cctcagaatg ttgttggcaa 2580 aatgcagaca aatatttcat ataatgggca aacaaaggaa gttcaaatat atttagtacc 2640 tactcttaga ttagatttat atttaggcat cgatttctgg aaagcattcg aattgttacc 2700 atctcttaac atagcagaat tagatctaac aactattgaa cataatttaa atcaccagca 2760 gatggggcaa ctaaaaaagg ttatcaatca attcccatcc tttgctcaag aaggtttagg 2820 aagaactaat ctaatttcgc attccataga cgtgggcaac gcaaagccag ttaagcaacg 2880 gcatttccca gtatcgccag ctgtagagaa attaatgtat gtagaagtgg atcgcatgat 2940 cgaattaggc attatagaag agtcggttag cgcctggtct tcaccagtgg tgctggtaca 3000 aaaaccaggg aaggttcgac tatgtttgga tagccgaaaa gtaaacgagt tcaccataaa 3060 ggatgcgtat ccccttcctc acattggcgg gatcttgagt agacttccga aggcagaatt 3120 tataacaagt ttagatctaa aagatgcgtt ctggcaaatc cctttagagt taagttctcg 3180 ggaaaaaaca gctttcacag tgcctggcag gcctctttac caatttaaag ttatgccgtt 3240 tgggctctgt aacgctcctc aaacaatgtc tagattgatg gataaagtgg tgccagctaa 3300 attaaggaac gaagtcttcg tttaccttga tgatctctta atcgtttccg acaccttgga 3360 atgtcattta gatgttttac gggaattggc tttatgcata acgagggctg gtttgaccat 3420 aaacatcggt aaaagtaaat tttgcatgaa gcaggttaaa tatcttgggc atatagttgg 3480 tgaaggtggt attagaacag atcccgaaaa ggtggctgcc attacagatt tcccagttcc 3540 gaagtcaatt aaagctttaa gaagtttctt gggtctgtgt ggctggtacc ggaaattcat 3600 tgaaaatttc gcttctttaa cagcaccatt gacagatttg ctaagaaaca aaagaaaatt 3660 tgttttcgat gaggccgctg aacaggcgtt ccaaaatata aagaaaaaat tgagcaaagc 3720 tccagtctta attagcgcgg atttcaagaa acctttctat atacattgtg acgccagtaa 3780 aacgggggtc ggaggtgttc tagtacaatt gtccgatgaa ggagacgagt gtccaatatc 3840 cttcgtatca aaaaaactga ataaagctca gaaaaattat tctgtaaccg agcaagaatg 3900 tttggctgca ctggtgtgtc taaaaaattt tagggcttat gtggagggcc atgaatttac 3960 catcatcacc gatcatgcct cattaaaatg gctaatgact cagacagacc ttagttctcg 4020 tctagcaaga tgggccttaa aattacaggg ctatactttc aagatcgagc ataggaaagg 4080 aaagctcaat gtggtcccag acgtgctgtc tcgtgcacat agtgaggaaa tagcagcaat 4140 caatgcagaa gatggattat atgtggattt ggactcagag cactttaagt caaaaagcta 4200 tttagaatta tgtgatcgaa ttaaaggcat ggaacagtca cttccagatc ttaaagtcat 4260 agacggatac gtgtatagac ttgctgaaca caccaatggg gatcagctac atgatagtca 4320 ggtttggaaa ttatggttgc ccagtgagtt agtaccagaa atcttgaaga aagcgcacga 4380 tagtccatta gcctcacatg gcggaattca caaaaccatt gaaagaatca ggcgtcatta 4440 cttttggcca gggttagtgg gcgatgttaa atcatttatt ggtgaatgtg aaacttgcaa 4500 aacaacgaaa gctccaaatt ttataatgcg tccactgatg ggaaagactc cggaatccca 4560 aaggttcttt caaaaattgt acgttgattt tctaggtccg tatcggcgat cacggcgggg 4620 gcatattggt atttttatcg ttctcgatca tctcacgaag tttgtgtttt taaaagctgt 4680 aaagaaattg acggcagatg tcacaattca atatttggaa tctgaactgt ttcatgcata 4740 tggagttcca gaaactattg tctcagataa tggatcccag tttcgctctg aagcctttaa 4800 gaaattattg aatcgctata atgtgacaca cacattgacg gcggttcatg ctcctcaggc 4860 aaatgcatca gaaagagtta accgctcgat catagcagcc attaaagcct atgtcaggga 4920 cgaccagaaa gattgggatg aatatctgag ccagataaat tgcgccctaa ggtcagcaca 4980 gcattcggcc attggaacta gtccttatta tttagtattc ggtcaacata tgataacgtc 5040 agggaaaacg tactcattga taaggaaatt ggagcttctc agtgatcgat ccttacgtct 5100 taatcagcca gattcattag atttagctcg gagtaaggca tgtgcagtta tgcaaaaaca 5160 acaacagaaa aacggaaaag cttataattt gcgagcccgg cagataaatt atagagtggg 5220 tcaagaagtt tttagaagaa atttcaaaca aagcaatttt gcccaaggct attgctcaaa 5280 attaggtgcc agatttataa aggcaagggt acggaaaaag attgggacat ccaactatga 5340 attggaggac ttacaagggc atttgatcgg caattatcat gcaaaagaca ttcgacccta 5400 atgggggaaa aaaagaaaaa aaacagcaac aagcggaatc agctttggtt tgctttgcaa 5460 cccgtagtct gatttttcca ggtgggatt 5489 // ID Gypsy-607_AA-I repbase; DNA; INV; 5993 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-607_AA_; KW Gypsy-607_AA-LTR; Gypsy-607_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-5993 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [3196-3762] - Reverse transcriptase CC Positions [5074-5553] - Integrase core CC 'AAAG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 933..2288 FT /product="Gypsy-607_AA-I_3p" FT /translation="MNRLRDIEEETKREHQNLKKSINRARTLQGILQKSEK FT LKDFRNEYKTILKSYEYRLTPKVWDLEVTNYEKLKLIFEECHTILDRSKIL FT PTFKAIAKAAIICNRLSLKNQEVNVTVTTTTMPLDIKLATAIVQPYDGSPN FT GLDTFIDSANLLKESVDGTNLQTAIKFLRTRLTGKARLGLPDNLTTIDALI FT DNVKSRCEEIVTPESVLAKLKTIKQKGDIDSLCNDVDSLCSQLRATYIASG FT VPDNVAKKMATKAGVDALATGVTNHETRIILKAGNFTDVKQAIQKVAENAN FT SNTNNSQILNFHTNRQFNRNRSVTNRSRGNYNYNGNYNYNDNYNSNYNNNY FT NNNRREFRNNDFRRQQPQERENFGRNYQRGNGSSHHPQNVQSNRNYHGRGG FT HNSFRRIYATNIETDQNQPQMQQQFTNSAQIPQSQPVTNNNSFLGNLASQA FT NLGQYMH" FT CDS join(3172..4077,4081..5904) FT /product="Gypsy-607_AA-I_1p" FT /translation="MEAQVQKMLDDEIIEPSISSYNSPILLVPKKSDNDTK FT KWRLVVDFRQLNKKVLADKFPLPRIDAILDQLGRAKYFSTLDLMAGFHQIT FT LDEDSRKYTAFSSASGHFQFTRLPFGLNISPNSFQRMMNIAMAGLTPECAF FT VYIDDIVVVGCSVDHHFDNLRKIFERLRHYNLKLNPNKCNFFRTEVTYLGH FT KITGNGILPDDSKFEAVKNFPTPTNADEVRRFVAFCNYYRKFVPNFANIAK FT PLNDLLRKDTKFEWTSVRHEAFNKLKKYLMSPQILRYPDFTKEFIVTTDAS FT DVACGAILSQYEEGDLPVAYASKSFTKGERNKPIIEKELTAIHWAIDYFRP FT YLYGRRFKVRTDHRPLVYLFNMKKPTSKLTRMRLDLEEFEFDIEFVPGKAN FT VGADALSRIVLNSEELKQLSILFVNTRSMAKKQNLGQDKVTPQKLVEPHLQ FT TDHLVVRETENPSEVQKLLKLDCVVHHNQLTIILLNHNSRKILKQFTTKFT FT RDNGSQALEFALLELMKYMKFYKRDKLALSMENEIFKFISLHTLKEIANNA FT ISTYEIILYKPPTFIENEEEIIKILSQYHMTPTGGHVGQHRLYLKLREIYK FT FKNMKNLISNFVRNCEKCKINKVTRHTKEPEVVTTTPSKPFEVLSVDTAGP FT LTRTNNGNRYILTIQCNLTKYVVLVPVPTKEAAVLARALVYNFILIYGTFL FT ELRTDQGSEYNNEILERICKLLEIKQNFSTAYHPQSIGALERNHRCLNEYL FT RCFTNAHQTDWDDWVKFYEFSFNTTPHTEHGYTPYELIFGRKARLPQESKN FT LSLEVEPMYNFEQYDAELKFRLQNSNKIARNNLIKNKHKRQNTANTNVNPI FT ALNAGDKVYITNENRKKLDPFYIGPYEIVKLEEPNCFIKHTVTNKVTKVHK FT NRLIKA" XX SQ Sequence 5993 BP; 2304 A; 1202 C; 1025 G; 1453 T; 9 other; aatggcgacc tgtgacttga gcctcttctc aacacagctc gaggaatagt gaagtgaagt 60 gaaaatcacg actttgaaca attatcgtcg caacaataaa ctcccaactc cagtgatttg 120 tgtgcaaaga aagtcaaaca caatgggatg gttcacgtcg gataatttca tttcgaacac 180 cgtcgaaata ccgacaacag aaatccgaat tatggtcggt gttgccatta ccatcagtgt 240 cgtgcttgtt ggatacctga tcctacgtct atacaacaaa tatcagaaga aatcgttaca 300 aaacactatg caacaagaaa tccgcatgaa caatatggaa acgaagtggc aggctaaagt 360 ttaaagtgac caaacccgta aaatacgcaa atgttcgaaa tagtgtgtgg aatgagtggc 420 tgctggccaa tttctgcaac gacgacaacg acgacaacgg cgataccaac aacatcgtgg 480 agcagcacct cgatcgaaac gaccacagtg cacgaaattg aatcgaacat taacagtgag 540 cttgtgctaa cggtaataat agtgatagca ctagcttgtt tagcaattgc gctagtatac 600 tatgacagac aaagaagaaa gaagcaaagt gcgaacagga aattaatgca gatcatcatc 660 gcgacagcgc atcgatcgat tatcgcagat gtcacaaacg agcagcagaa cgagcaataa 720 gcaaaccaga cgagacaaac agaaacaggc atgttttaac cagctagcag aggaatgagc 780 aatgagcaac ccggatgaga ctggcagaaa acggtatgtt ttattaaacg gattgaataa 840 ctattagtat tataacaaaa aaaaaaaaaa cgacaataat aattattgca aactagccta 900 acgagaaact cagtaaatta aaagcctctt gaatgaatag gttaagagat attgaagaag 960 aaacaaagag ggaacaccaa aacctaaaga aaagtattaa cagggctaga actctccaag 1020 gaatcttaca aaaatccgaa aaattaaaag attttcgtaa cgagtacaaa accattttaa 1080 agtcatacga atatagacta acaccaaaag tttgggacct cgaagtaaca aactacgaga 1140 aactaaagtt aatatttgag gaatgtcaca caattctaga cagatccaaa atactaccaa 1200 catttaaggc aatagccaaa gcagcaatta tttgcaatcg attgagcctc aaaaatcaag 1260 aagttaacgt aacggttact accaccacca tgcctctcga tatcaaattg gctaccgcca 1320 ttgtacagcc gtatgatggt tcgcccaatg gactagatac attcatagac tcggccaatc 1380 tattaaaaga atcggtcgac ggaacaaatt tacaaacagc aataaaattt ttacgtacca 1440 gactaactgg caaggcaaga ctaggtctgc ctgacaatct tacaaccatc gatgctttaa 1500 ttgacaacgt aaagtcaagg tgcgaagaaa tagtaacacc tgaaagtgtt ttggcaaaac 1560 taaaaacaat aaaacaaaaa ggggatatcg acagtctttg taatgatgtc gattccctct 1620 gcagtcaact ccgtgcgact tacatagcaa gcggagttcc cgacaacgta gcaaagaaaa 1680 tggctacaaa agcaggcgta gatgcactcg caaccggagt aaccaaccac gagaccagaa 1740 ttattttaaa agccggaaat ttcacagacg tgaaacaggc aatacagaaa gttgctgaaa 1800 atgcaaacag caacactaac aactcccaaa tattaaactt tcataccaat cggcagttca 1860 ataggaaccg gagtgtcact aatcggagcc gaggtaatta caattataac ggcaattaca 1920 attataatga taattacaat tctaactaca acaataatta caacaataac agacgtgaat 1980 ttcgaaataa cgattttaga cgccaacaac ctcaagaaag ggaaaatttt ggcagaaatt 2040 atcaaagagg taatggatcc tcacatcacc cacaaaacgt acaatccaat cgcaattatc 2100 acggcagagg cggacataac tccttccgtc ggatatacgc cacaaacata gaaacagatc 2160 aaaatcaacc acaaatgcaa caacaattta caaattctgc acaaatcccg caatctcaac 2220 ctgtaacaaa caacaactct tttttaggga atttggcatc ccaagccaat ctgggacaat 2280 atatgcacta aatctaagcg cagcaaattt cgtcaacgta tttgttggca tgaccaattc 2340 aacatgttca tttatagtcg atactggagc agatatatca atcatgaaag tagacaaact 2400 cctaccacac caaatagtgg acagaagcaa aaactatcga ataaccggaa ttaccgatgg 2460 cgtaaaagag acattagccg aagccattac gcccctaaaa ttcaataatg gattaatagt 2520 taaccacaat ttccagttag tagaaccgac ttccccattc caaccgatgg matactcgga 2580 cgtgatttcc tgacaacatt taagtgtacc attgactacg agcattggct gctaaatttc 2640 aaatttcaac aatcggaaat atcgattccc attgaagaca actacaacgg aaanatactt 2700 ataccagcaa ggtgcgaagt agtacgaaaa attccaaacc tcaatgtcaa agaggacacc 2760 ktagtgcatt cacaagagat acttcccggt gttttctgtg gaaacactat agtctcccaa 2820 aattcaccat gtgtaaaatt cattaatact actgagaaas aagtactaat acaaaatttt 2880 aaaccaaaaa ttgaaccgtt aaatcattat gtgaccgtta aacacaatca tttgagtaaa 2940 acgcagaact cagatagagt acaggaagtt atctcacaaa taaatttcga taaagtacca 3000 caatttgcaa ggaaaaatct ccaagcgctc atcacagaat tccaagacat attcagtcta 3060 cccgatgaaa aactacccac gaacaatttt tacagccaaa gtatacatgt taacgataaa 3120 atccctgtgt atatcccmaa ctacaaaaat atacactcac aaaatggcga aatggaagcc 3180 caagttcaga aaatgttgga tgatgaaata atagaaccat ctatttcatc ttacaactct 3240 ccaattttat tagttccgaa gaaatcggac aatgatacaa aaaagtggag acttgttgtc 3300 gatttccgac agttgaacaa aaaagtctta gcagataaat ttccacttcc acgtattgac 3360 gcaatacttg accaattggg aagagccaag tatttcagta cactagattt gatggcgggt 3420 tttcatcaaa taacactaga tgaagactct cgcaaatata cagccttttc atcagcatcc 3480 ggacatttcc aatttacaag gttaccgttt ggactgaata ttagtccaaa tagcttccaa 3540 cggatgatga acatagcaat ggcaggctta acgccagaat gcgcatttgt atacattgac 3600 gacattgttg tcgtaggatg ttctgtcgat caccattttg acaatttgag aaaaatattc 3660 gaacgattga ggcattacaa cctcaagctg aaccckaaca aatgcaattt tttccgaacg 3720 gaagtgacat atctaggtca caaaataaca ggcaatggca ttctaccaga cgattccaaa 3780 ttcgaagccg tgaaaaactt tcccacgcct acgaatgcag acgaagttcg tcgatttgta 3840 gctttctgca attattaccg maaattcgtc ccgaattttg caaacatagc aaaaccactg 3900 aacgatctgt tacgaaaaga cacaaaattt gaatggactt cagtacgcca cgaagckttc 3960 aataaactaa aaaagtatct aatgtcaccc caaatattac gatacccgga tttcactaaa 4020 gaattcatag tgaccaccga tgcttcagac gtggcgtgcg gagcaatcct ctcgcaaags 4080 tacgaagaag gagacttacc cgtagcgtac gccagtaaaa gttttacaaa aggcgaaaga 4140 aataaaccaa ttatagaaaa agaattaacc gccattcatt gggcaatcga ttactttaga 4200 ccttatttat atggacgaag atttaaggtc cgtacagacc atcgtccatt ggtttattta 4260 tttaatatga aaaaacctac atctaagctg actagaatgc gattagattt ggaggaattt 4320 gaattcgata tagaattcgt tcctggaaaa gcaaacgtgg gtgcagatgc actttcacga 4380 attgtcctaa actccgaaga acttaagcaa ctatccattt tatttgtaaa tactagatcc 4440 atggctaaga aacaaaattt agggcaagat aaagtgacac ctcaaaaatt agtggaacca 4500 catttacaga ctgatcacct ggttgtgcgt gaaactgaaa atccttctga ggtccagaaa 4560 cttttaaaac ttgattgtgt agtgcaccat aatcaattaa cgataatatt attaaatcat 4620 aacagcagaa agatactcaa gcaatttaca acaaaattta cgcgcgataa tggaagtcaa 4680 gccttagagt ttgctcttct agaattaatg aaatatatga aattttacaa aagagacaag 4740 cttgcattgt caatggaaaa cgaaatcttc aaattcatct ctttacatac actaaaggaa 4800 atcgcaaaca atgccatttc cacgtatgag ataattcttt ataaacctcc gacgtttata 4860 gagaacgaag aagaaattat taaaatctta tcacagtatc acatgacacc tacgggaggt 4920 catgtaggcc aacacaggct ctatttaaaa ctaagagaaa tttacaaatt taaaaacatg 4980 aaaaatttaa tatccaattt cgtcagaaat tgcgaaaaat gcaaaatcaa caaagtaaca 5040 agacacacca aggaaccaga agttgtcact acaactccat ctaaaccatt tgaagtactt 5100 tcagtggaca ctgcaggacc actgacaaga acaaataatg gtaaccgata tattttaaca 5160 atacaatgca atctaaccaa gtacgttgtg ctagtaccgg taccaaccaa agaagcagca 5220 gtattagcta gagcattggt atacaatttt attttgattt atgggacatt cttggagctc 5280 agaaccgatc aaggatcaga atataacaac gagattcttg aacggatttg taaactccta 5340 gaaataaaac aaaatttttc caccgcatat cacccacagt ctattggagc attagaaaga 5400 aatcacagat gtttaaacga atatctgaga tgttttacta acgctcacca aacagactgg 5460 gatgattggg tcaaattcta tgaattttca tttaatacaa cacctcacac agaacatggt 5520 tatactcctt atgaattaat ttttggaaga aaggcaagat taccgcaaga gtctaaaaac 5580 ttaagtttag aagtcgaacc aatgtataac ttcgaacaat atgatgcaga actaaaattt 5640 agacttcaaa attcaaacaa aatagcaaga aacaatttaa ttaaaaataa acacaaaagg 5700 caaaatacag caaatacaaa cgtaaacccc atcgcattaa atgcaggaga taaagtatac 5760 atcacaaacg aaaacaggaa gaagctagat cccttttata taggtccata cgaaattgta 5820 aaattagaag agccaaattg cttcatcaaa catactgtaa cgaataaagt caccaaagtc 5880 cataaaaata gattaataaa agcttaaaag ctagttaaat acttttaccg atacattaag 5940 actaatattt tcttttttcc gaatggttac accattctcc taaagggggg agg 5993 // ID BEL-118_AA-I repbase; DNA; INV; 2776 BP. XX AC supercont1.120; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-118_AA_; KW BEL-118_AA-LTR; BEL-118_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-2776 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.120; Positions 1281625 1284400. XX CC 'ATGTG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 248..2776 FT /product="BEL-118_AA-I_1p" FT /translation="MSQGRSPKTKSASKAAAEAVKQKQNRDAEQHNWEITA FT DAETLKELSSLSRQRDQVNLKLARVHRALATAQNQLSLPQLKTFLKRVDDA FT YAEFSVVHTKLVAIIPNEAFRQQEEIYIAFEERYDYARTAIEELILDHEPN FT TIKTPTTQPQVIVQQQPLKAPIPTFDGSYANWPKFKAIFQDLMANSGDSDA FT IKLYHLDKALVGNAAGVLDAKVISEGNYQQAWAILTERYENKRIIVETHIR FT GLFDINKMASGSCKELRRLHDECSRHVESLKYLEQEFSGVSDLFLVHIVTS FT AMDQPTRMAWEATQKKGELPTYAQTISFLQTRCQMLENCEIAFPKSSSPLQ FT QRSNPKGQASTKPANVKVYAAGTEPRKKDSCDFCQGPHRNYQCDTISNMPF FT EKRMEKVRASGVCFNCLRKGHSARDCSSTKTCQKCQKRHHTQLHNEEVELE FT PKPSVSLTAQEPKPVTPQRMVFASASTRDSPIMPLFCNYADSSKTVFLLTA FT IVHTLDENDQPYPCRVLLDSGSQVNFVTADLANRLGSTKRRVNIPIRGIND FT VKTVSYDKVTIKFRSRVSDYETQVQCLVTPNVTGIIPAAKIDVSSWSIPFG FT VQLADPEFYKPEKIDMLLGAELFLQLLRPGYIKLNDDFPELRETTLGWVVA FT GVFREKPVASKTQHSLVASIDDTEEIPQQSNIDEVANPNPLPKQICKQCEK FT WKQRRKIRSECDNPNQRNLPPQDAIFRPSSLPYTLRVSDASLKTTAASNPL FT NEPSCTANCEQGSRRTTGFRPVLNVLKNTLRTSRDQKVGRMTREYAEPGKV FT ARSLDAHSNGPIYRRGSIPTKNNESRSPKKPVETSCQPGGV" XX SQ Sequence 2776 BP; 790 A; 773 C; 665 G; 548 T; 0 other; tagtggtcct tcgaaccgga tccagtgccg ccccaccgcc gttggaaagg attacgcacc 60 accgctaccc gacacgtttg tgtggcgctc cagccagaac gtgtaccgcc caaggatttg 120 atcagtcact accgtcgccg ccgcttgtga attgctgcat accgaccaca taccgcctag 180 ggagtactct gcaccgccgt tccgacccga cactcgctgc tggtgattct tccgaccaaa 240 attcgacatg tcgcaaggac gctcaccgaa gaccaagtcg gcttctaagg cagcagctga 300 agccgtcaag cagaaacaga atagagacgc cgagcaacac aactgggaaa tcactgccga 360 tgccgaaacg ctgaaagaac tatcatcgct gtcccgccaa cgtgaccaag tgaacctgaa 420 gcttgcgcgt gtgcacaggg cacttgccac cgctcagaac cagctcagcc tgccccagtt 480 gaagacgttc ctgaaaagag tggacgatgc gtacgccgag ttcagcgtcg tgcacaccaa 540 gctggttgcc ataatcccaa atgaggcctt tcgacagcag gaggagatct acatcgcttt 600 tgaggaacga tacgactacg ctcgcacggc catagaagaa cttattctgg atcacgaacc 660 gaacaccatc aagactccaa cgacgcaacc gcaagtgata gtgcaacagc aaccattgaa 720 ggctccaatc cctaccttcg atgggtctta cgccaactgg ccgaaattca aggccatctt 780 ccaggacttg atggccaact caggagattc ggacgccatc aagttgtacc atctcgacaa 840 ggcacttgtt ggtaacgcgg ctggagtttt agatgcgaaa gtgatcagcg aagggaacta 900 ccagcaagca tgggccatcc tcaccgaaag gtacgaaaac aagaggatca tcgtggagac 960 gcacattcgt ggcctatttg atatcaacaa aatggcttct ggttcgtgca aggagcttcg 1020 acggcttcac gacgaatgta gccgtcatgt cgagagcctg aaatatctcg agcaggagtt 1080 ctccggtgtt tccgacctct ttctggttca cattgtgacg tctgcgatgg atcagccaac 1140 gaggatggca tgggaggcaa cacaaaagaa aggcgagcta ccaacgtatg cccagacaat 1200 ctcatttctt cagacaaggt gccagatgct ggagaattgt gaaattgcgt tcccgaaatc 1260 ctcctccccg cttcaacaga gatcgaatcc caagggtcaa gcatcaacga agccagccaa 1320 cgtgaaggtc tatgctgctg gaactgaacc aaggaagaag gatagctgcg atttctgcca 1380 aggtccgcat cgcaactatc aatgcgacac aatcagtaac atgccgttcg agaaaaggat 1440 ggagaaggta cgagcatctg gagtctgttt caactgtctc cgtaagggac acagcgccag 1500 agactgctca tcaacgaaaa cgtgtcaaaa gtgtcagaag cgacaccata cacagctcca 1560 caacgaggaa gtagagcttg agccaaaacc cagtgtgtcc cttacagcac aggaacctaa 1620 gcctgtgact ccccaaagaa tggtattcgc ttcggcttcc accagagact ctccgatcat 1680 gccgctcttt tgcaactacg ctgactcttc gaagactgtg ttcttgctaa ccgctattgt 1740 ccacacctta gatgagaacg accaaccgta tccctgccgt gtgctcctgg acagtggctc 1800 acaggtaaac ttcgtcacag ctgatcttgc gaaccgtctt ggaagtacga aacgacgagt 1860 taatatcccg atcagaggaa tcaacgatgt gaaaaccgtc tcatacgaca aagtgacgat 1920 caaattccga tcccgtgtgt ccgactacga aacccaggtg caatgcctcg ttacaccgaa 1980 cgtgaccgga atcattccag cagcgaaaat cgacgtctca tcttggagca tcccctttgg 2040 tgttcaactc gctgatccag agttctacaa gccagagaag atcgacatgc ttctcggtgc 2100 agaattattc ttgcagctgc tacgacctgg ttacatcaaa ctcaacgacg attttccaga 2160 gcttcgcgaa accactctag gatgggtggt agccggagtg ttccgagaga aacccgttgc 2220 cagcaagact cagcattcgt tagtcgcatc gatagacgac accgaggaga ttccacaaca 2280 atcgaacatc gatgaggtag caaatccaaa cccgttaccc aagcagatat gtaagcaatg 2340 cgaaaagtgg aaacaacgcc gaaaaatacg aagcgaatgc gacaacccga atcaacgaaa 2400 ccttcctccg caagacgcca tcttccgacc ttcaagtttg ccctacacac ttcgggtatc 2460 cgatgctagc ctgaagacaa ccgccgctag caatccgctc aacgagccat catgtactgc 2520 caattgcgaa caaggatcga ggagaacgac aggatttcgc cctgtattga acgtgctgaa 2580 aaatacacta cgaacatcgc gcgaccagaa ggtaggacgc atgacacgag agtatgctga 2640 accaggcaaa gtagcacgct cactcgatgc gcattcgaat ggacctatct atcgacgagg 2700 aagcattccg accaagaaca acgagtcacg atcaccgaag aaacctgttg aaacttcgtg 2760 tcaaccgggg ggagta 2776 // ID Gypsy-244_AA-LTR repbase; DNA; INV; 202 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-244_AA_; KW Gypsy-244_AA-I; Gypsy-244_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-202 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1090-1090 (2011). XX DR [1] (Consensus) XX SQ Sequence 202 BP; 65 A; 27 C; 49 G; 61 T; 0 other; tggtgcctgg cgcactgtaa atggttctac cttaaataaa gatacgttga tatgttgaag 60 tgttaatctt tgggttgatg acttgggatt gaagagagta aacaaagctg gcatttacga 120 cgtgcggaat acaacaatca tattattcga atattagtgg aatattgggg aatataaata 180 attcggaggt caagtctcca ca 202 // ID Copia-2_DWil-I repbase; DNA; INV; 3873 BP. XX AC scaffold_180723; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-2_DWil_; KW Copia-2_DWil-LTR; Copia-2_DWil-I. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-3873 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_180723; Positions 25760 21888. XX CC 'ACAGAA' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 796..3837 FT /product="Copia-2_DWil-I_1p" FT /translation="MCCNRDLFSTFVKHSEKIVLADNNFLQAEGKGEVKLN FT MVSRILTLKNVLYVPKMRGNFVSVSKAVEHGSIIEFGQKYARVIQKGECIL FT KADKVGRLYIFGTAPSECYAIEQSYEDLWHKRYGHLNYGSLAEIARKDLAR FT GMGTFDSTSLEPCKICMVSKITVQPFPDSTKNKATEILELVHSDVCGPFDT FT KSLGGIRYFLTFIDDRSRRIFVYFLKGKDEVFGKFVEFKNMAERQTGKKVK FT CIRSDNGREYVNKSFDEFLKSNGILRQLTVPYTPQQNGVAERVNRTLVEMS FT RCLLIQSGLHQSLWAEAVYTAAYLRNRSPTRALVNMTPMEAWNGKKPSISH FT LKVFGSIAVALDKGHHKGKFDPKGKEYRMVVYSRNAKGYRLYDAETRQVVE FT KRDVLFDERFGNGDSNGDIVKFDFPYTDDQDSGSVDAADDFSSTDDSDAEV FT SEKGSSSEDFDMAVEQKVAEPVVEQRVGPGRPKLIRTGRPGRPKKQFNILS FT ALITSEVLIPETYEEALSSPFASHWREAMQKEYDSLMSNHTWQLSTLPEGY FT KPVGCKWVYSVKRDAKGEVERFKARLVAKGCSQRYGVNYNETYSPVCRLES FT VRFILALAAQLKLYLHQMDVCTAYLNSDLGDTVYIRQPEEYATEDKSKVLL FT LKKALYGLKQSGREWNSKLDSVLRANGFIPCDNEPCLYKQQAKGNLSLILV FT YVDDLLIGCQEKSDLDRIKSAIMAAFECVDKGPVRLFLGMEVYRDGELGEI FT SLGHSQYIKDLLERYGYAQCRPAATPLEAGYQVACDKMECKRVDATLYQSM FT IGELMWLALTTRPDILHSSKLAQRNKDPHNEHLAAIKHVLRYLASTVGFKI FT HYKQSDAAFTGFVDADWAGDRIDRKYTGYVYFLAGGPISWKSEKQRSVALS FT STEAEYMALSTACKEAIVLRRLIIEIGCGEEDTSTVLYGDNLSAQQLAKNP FT VHHSRMKHVDIRYHFVRQVVNDGQIVLKYRSTNEMIADILTKNLAKKKHVE FT FTNLLSIR" XX SQ Sequence 3873 BP; 1150 A; 704 C; 1012 G; 1007 T; 0 other; ttctgtttta ttataacaat aggttatgga cccaggcaac gtgcctatgt ggttgctgtg 60 gtaatttcta agaagaaatt tctaattcga agaaattaga tttcgcgctg tgttctagac 120 ggttttttca caaagtatat ataaacgagt ataatgtcgg cgttatatca gattgataaa 180 ctcgaggaca gtaattatga ctcgtggtgc atccagatga aaagtgtgct ggtgcatgcg 240 gatttgtgga atgtggcgtc aggcgtattg gcaaggccaa ctacggggga gaacaccgga 300 tggacgacgc tggatcagaa ggcattggca atgatcacct tgagcgtcaa gacttcacag 360 ttgggacatg tgaagcactg tgcaagtgca catgaggctt ggaagaagct tcaggaagct 420 catcaaccca gaggaaccgt tagaaaagta gctcttttca agaaattgct aagtaaacgc 480 aggagggtca aagcatatcg agttatttgt cggatttcaa ggaaatcttg gacaaacttg 540 ctggcgttgg aattgaaata ttggacgagc ttgtaacgat tattttgcta tccagtcttc 600 cagacgaatt cgacaatttc gttgtcaaag gccggcaaag gccattttaa agcaaattgt 660 cccagtgtaa acggtcggaa agttgcatcg aatccgaacg aggcagtaca gtcatcgtta 720 ggtttgctga acgcgttgga cgcgaattgt ttggaacgat cgaactggtg tctggacagt 780 ggcgccacca gccatatgtg ttgtaacaga gatttgtttt caacatttgt gaaacattcg 840 gaaaaaattg ttttggctga taataacttt cttcaggctg aaggaaaggg cgaggtaaag 900 ctgaatatgg tttcacgtat acttacatta aagaatgtat tgtatgtacc caaaatgaga 960 ggtaatttcg tgtcggtgag caaagcagtt gaacatggaa gcatcattga atttggacaa 1020 aagtatgcac gagtaatcca aaagggcgaa tgcattctga aagcggacaa agttggccgt 1080 ttgtatattt ttggaacagc accgagcgaa tgctatgcca ttgaacagtc ttacgaagat 1140 ttgtggcata aaaggtacgg acatctcaac tatggcagtt tggctgaaat tgcacgaaag 1200 gatttggccc gtggcatggg aacgtttgac tcaacgtcgt tggaaccctg caagatctgc 1260 atggtaagca agatcacagt acagccattt ccagattcga ccaagaacaa agctacagaa 1320 atcttggaac tggtgcattc agatgtttgc ggtccgttcg atacaaaaag ccttggaggc 1380 atcagatact ttttgacatt tatcgatgac aggtcacgtc gcatatttgt atatttcctg 1440 aagggaaagg atgaagtatt tggaaagttt gtcgagttta aaaacatggc agagcggcaa 1500 actggaaaga aggtaaagtg cattcggagc gacaatggtc gcgagtacgt caataaatcg 1560 tttgacgagt ttctcaagag caatggaata ttgcggcagc tcacggtacc atacacgccg 1620 cagcagaacg gtgtagcaga gagagttaac agaactcttg tggagatgtc aagatgcttg 1680 ctcatccagt ctggtctaca tcaatctctt tgggctgagg cagtttatac agctgcttat 1740 ctgcggaata ggtcaccaac tcgagccctg gttaacatga ctccaatgga agcatggaat 1800 ggtaagaaac catcaataag tcatttaaaa gtttttgggt ctattgcagt ggcactggac 1860 aaaggacatc ataagggcaa gtttgatcca aagggcaaag aataccgaat ggttgtttat 1920 tctcgaaatg cgaaagggta tcggttgtac gacgccgaaa ccaggcaagt tgttgagaag 1980 cgagacgttt tgttcgacga gcgtttcggc aatggcgatt cgaatggtga tatcgtcaag 2040 tttgattttc catatacgga tgatcaggat tcaggatctg ttgatgcagc agatgacttc 2100 agcagtacag acgactctga tgcagaggtc agcgagaagg gcagcagtag cgaagatttc 2160 gacatggcag ttgaacaaaa agtggcagaa cctgtggttg aacaacgtgt tggacccggc 2220 aggccgaagc tgattcgaac tggaaggcct ggcagaccaa agaagcagtt caatatttta 2280 agcgcattga ttacaagtga ggtgctcata ccagagacct atgaggaggc gcttagtagt 2340 ccgtttgctt cgcattggcg tgaggctatg caaaaggagt atgattcctt gatgtcaaat 2400 cacacttggc agcttagtac gctaccggaa gggtataaac ctgttggctg caagtgggtc 2460 tacagtgtaa agcgtgatgc aaagggagaa gtcgagcgtt ttaaggcgag actagtagca 2520 aaaggatgtt cacagcgcta tggagtgaac tacaatgaaa catactcgcc tgtatgtcgc 2580 ttggagagtg ttcgattcat cttggcccta gcagcgcagt taaaattgta tctacaccag 2640 atggatgttt gcacggctta cttaaacagt gatttgggcg ataccgtata catcaggcag 2700 cccgaagaat acgcaactga agacaaaagt aaggtacttc tgttgaaaaa ggcgttatat 2760 ggactcaaac agtctggacg ggagtggaac tccaaactcg atagtgtgtt gcgagctaat 2820 gggttcatac cttgcgacaa tgaaccatgt ctatacaaac agcaagcaaa aggtaatcta 2880 tcactaatcc ttgtctatgt agacgatctt cttataggat gccaggagaa atcggatctt 2940 gacaggatta agtcagctat aatggcagcg tttgaatgtg ttgacaaggg acctgtgcgt 3000 ctgttccttg gcatggaggt ttaccgtgat ggcgaacttg gcgaaatatc attgggtcat 3060 tcacagtaca tcaaggatct actggagcga tatggctatg ctcagtgcag accagctgca 3120 acaccgttag aagcagggta tcaggtagcc tgcgataaaa tggagtgtaa aagggttgat 3180 gcaactcttt atcagtcaat gataggcgag ctaatgtggc ttgcgttgac tactagacca 3240 gatatacttc actcatcgaa gttggctcaa cgtaacaagg acccccataa tgaacacctt 3300 gcagccataa agcatgtttt acgatatttg gcttcgacag ttggcttcaa gattcattac 3360 aagcagtcgg atgcggcgtt tactggtttc gtggacgcag attgggcagg cgataggatt 3420 gatcgcaaat atacaggata tgtatatttt ctggctggcg gaccgatatc atggaaatcg 3480 gagaagcagc ggagtgttgc tcttagcagc accgaagcag agtacatggc gttatcaaca 3540 gcgtgcaaag aagccattgt attacgtcga ttaatcatcg agattggttg cggagaagaa 3600 gatacatcta ccgttttgta tggcgataac ctcagcgcgc agcaattggc taagaacccg 3660 gtacaccatt caaggatgaa gcacgtggac attagatatc attttgttag acaagtagtt 3720 aatgatggtc agattgtttt aaagtatagg tctactaatg aaatgatagc tgatatccta 3780 accaagaatc ttgcgaagaa gaagcatgta gagtttacga atttgttaag tataagataa 3840 atttgtggac atacttgcat tgaggaaggg tgt 3873 // ID Gypsy-67_CQ-LTR repbase; DNA; INV; 1591 BP. XX AC AAWU01019862; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-67_CQ_; KW Gypsy-67_CQ-I; Gypsy-67_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-1591 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 514-514 (2011). XX DR GenBank; AAWU01019862; Positions 77496 75906. XX SQ Sequence 1591 BP; 401 A; 406 C; 408 G; 376 T; 0 other; tgtaacctgg ttacaggaac cacctatcga ttcgttgtat gtgcactcca tacgaggcga 60 aagttaccta aactacccta acgaacatgg aaagcagcgt tgcattatct ttcccttcgg 120 caccttcagt gcctctggcg gcattcgctg gaacttctca ctctaaccga gtcacagaac 180 agtactcatc gttcgccgaa tacttaacac atataatgtc cataaaaccc gaagggcacc 240 tcgtctgtat ttgtgacggg gggaccttgt ccttggaacg tagcgggccc aaggttaagg 300 caacgacagg tcttttcaat ttgaccggca agctgtgcac acgcgtgtgt ccaagtggaa 360 gtacaatcag cgaagcggcc gcagagtttg ccgatcgtca tagtgagtgg caagaagaag 420 attaggttca acttgtgatc taattgtacc ttcctaaatt ttagttcaaa ttgaaacctg 480 tggttttgaa cgttttacta ggcccaacga gccgataggg agtgtgcttt gtgagtagat 540 agattatttg tagctcggtt caaaccttta aaacatgtgc actctgtagc ttaggagagc 600 cgaccgagtg acgtcaacct aacacgcggt catcccaacc caaccgtaat acgcttgggc 660 gaggccaggt aattttcttg gatctgaaga tcgtattgta accatgtgtg aaaagtttgg 720 taactcgtgc gtgaatttct ttattgaagg actattcctt caattgaagg tgttagtctc 780 gggcaggtgt aggattcggc gaccaccacg ccggctctgc gggacggtcc ggagacccgg 840 agggtccgcg acccgaaacc cgcgtgcgca cctgtgggcc aggaggtcac cacaacctct 900 gggacggagg gactcggtag tcaattgccg aacttccatc cccggcctcg aaccaagcca 960 ccccggggaa gttgaacgcg gcccagccac actggccgtc gagcacgtga cgcgtgcaac 1020 ccatctgcgt aagcctgatc agctgcattt agtacaacaa gccaagtcac caacccagca 1080 ccagccgaac agccgccgac gccgacgccg gcgccttcgc cgcctcgtca ccaccagcat 1140 gcagaatcca ggtcagaccc accctttgtt attctaaaaa tagacaccag tgcgatgcat 1200 gaaccaaagc gttacgggaa aaggttagac taatgttagg gaataagaat tagcaggtca 1260 atgtccggac aaataaacta tcgagctaca cgtcgtcgtg tttccgtttg ccctaaaaat 1320 taaagggttg ttgttgtttt acacgcgaaa ctcctggata aggtctccta gttttaagga 1380 tattctatgt ggagggttgt cgttgagcga tgttgttggg gttcgaatag tggcttcggg 1440 acaagaaaat gaactcgccc tgagcgagct agaccgggac gtaacgaaca cgctgctttg 1500 catgtacctt gggaactcgg tacttggcgc ttaagcggag tagcagaatc tccctgaacc 1560 cttgccgttc agcagcgacg cgcttgttac a 1591 // ID Gypsy1-LTR_Dya repbase; DNA; INV; 425 BP. XX AC chr2L; XX DT 18-MAY-2009 (Rel. 14.05, Created) DT 18-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy1_Dya; KW Gypsy1-I_Dya; Gypsy1-LTR_Dya. XX OS Drosophila yakuba OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; melanogaster subgroup. XX RN [1] RP 1-425 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1031-1031 (2009). XX DR Genome; chr2L; Positions 21281727 21281303. XX SQ Sequence 425 BP; 185 A; 68 C; 110 G; 62 T; 0 other; tgtaagacgg ggtattggga cttctcaaca aaagcagtac tcgagtagat ttaagaagag 60 atgcagaaaa agtattaaga cagcagagca acagcagagc gacagaagag agcagtagaa 120 acagcagagc gacagaagag agcagtagaa acagcagagc gacagaagag aatagcgaca 180 gcggagaagc aggagaagcg acagcagagc agcagcagag cgacagcaga acagcagcag 240 agcgagagaa gaggattcgg agagtagaag tcagcagata cggacgtcag agaacgagtt 300 aagaaagcag aaaaatcaaa agcagaaaac ttaaactaaa cttgtaacat tcatcgttaa 360 taataatact gatatttaga tacctataat aataaaaacc aaagttgagt aaatctcatc 420 ttaca 425 // ID Gypsy-34_AA-LTR repbase; DNA; INV; 744 BP. XX AC AAGE02026431; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-34_AA_; KW Gypsy-34_AA-I; Gypsy-34_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-744 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02026431; Positions 172457 173200. XX SQ Sequence 744 BP; 245 A; 92 C; 150 G; 257 T; 0 other; tgtagtgaat aagagcacta gcgataaaat aatgtaaata attactcacc tgtttccaaa 60 tatatttcgc ttcctcactt cggacttgtt tttttatttt ggttttgacg aaatgcaaat 120 gtaaataaga attctactat tggatttgta tgctattcca attagaatat gttggtattg 180 gtttcggatg atctgtcaaa tgaatgcacg gaaaaagaag ttgtgagata atcaatttcg 240 aaaaatcgaa cacggaaatt tattgttgtt acacgctaat ttatttccac tctttgtgat 300 aacttgaaaa attaacgaaa attgttggaa aatgtatttg tgttttggag tatcttgatt 360 tctgtagaat ttggctcatt aaaatagatt attgtaatat aatgtattac aagtgatgtg 420 aacataggtt attaccaact tcattctgag ggtaaataaa aattcatttg tttgcgtttt 480 acatgtaaaa tttgttgagt gtattctgaa aacgcaaact gttgaggggg tatgaaagat 540 atggaacgca gagagaaaaa cgcaaactat aaaacgcaaa aggatgagag gaagtgagtc 600 tggttggagg gaaactggat gctaacgaat tgatcgtcat tctgtgaacg atttttgaag 660 cgtacggttc gacctttttt tgtaataacg tttttttata ctattttcgg ccttgagtgg 720 aaaaatcaac ccggagatac taca 744 // ID BEL-115_AA-LTR repbase; DNA; INV; 807 BP. XX AC supercont1.79; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-115_AA_; KW BEL-115_AA-I; BEL-115_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-807 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.79; Positions 1249315 1250121. XX SQ Sequence 807 BP; 189 A; 172 C; 218 G; 228 T; 0 other; tgtttgggcg catcgccccc tagacacgcc agtggaagtt tctgctgttt tgctcatcca 60 gacagctcgc ccaaaaccca gacggtattg ttcaaccgat ggagtagtcg acggaaggac 120 aataggtcgt cgcagtccat gctatgccta gacggctacg gatggattga tggacagacg 180 actcttatct ccatcatgcc catgcgattg ttgctgctgt ccttgtccag cgtattgcga 240 tgcggagaac atttatccga ggcttggatg cggcgatgat agacaaccaa cgatggctgg 300 ggtgctctcg cactgaccaa gccgatcgtt gtttctctca gtggatgacg tcactgatat 360 cgatcatcag agcacggtgg atggatggct tgctctgatg atgatatcaa acgcatccac 420 ttctcagttc cactttagct acacggcgcg atcggtcgct tatttaaatt gcatacccgt 480 gaatacagtt aggtttatta cgtagaatat agtaagtagt gatgagtgag aactattggt 540 tttatttgaa cggtacgaac tgaaattccc gcgttcgctc ggattggctc agggtgtctg 600 tggttctgct gcgactaaat tgaaggtgat ttgggggcta aaaagagtgt cgtattgtgg 660 acgatttact ggaatttagt ggaattgtac gattgagtgt cggtgattgc gccaatttgt 720 gaccagaatt ttactgtcgg ctactggagc ctaatcgaaa tcaaggactt ttcggctatt 780 aatcgaaggt gggttccgga tccgaca 807 // ID Proto1-3_NG repbase; DNA; INV; 5441 BP. XX AC . XX DT 21-MAY-2009 (Rel. 14.06, Created) DT 21-MAY-2009 (Rel. 14.06, Last updated, Version 1) XX DE Proto1-3_NG is a non-LTR retrotranspsoson from the Naegleria DE gruberi amoeboflagellate genome - a consensus sequence. XX KW Proto1; Non-LTR Retrotransposon; Transposable Element; KW Proto1-3_NG. XX OS Naegleria gruberi OC Eukaryota; Heterolobosea; Schizopyrenida; Vahlkampfiidae; OC Naegleria. XX RN [1] RP 1-5441 RA Kapitonov V.V. and Jurka J.; RT "Proto1 non-LTR retrotransposons from the Naegleria gruberi RT amoeboflagellate genome."; RL Repbase Reports 9(6), 1146-1146 (2009). XX DR [1] (Consensus) XX CC Proto1-3_NG is a very young familiy of non-LTR retrotransposons CC that belongs to the Proto1 clade of non-LTR retrotransposons. CC This clade includes also the Proto1-1_NG, Proto1-2_NG, CC Proto1-4_NG and Proto1-5_NG families from the the Naegleria CC gruberi amoeboflagellate genome. The Proto1 elements code for two CC ORFs. The ORF2-encoded proteins are composed of the apurinic CC endonuclease, reverse transcriptase and ribonuclease H domains. CC It is likely that the Proto1 clade is a sister clade of the L1 CC clade. Proto1 retrotransposons are characterized by 15-18 bp long CC target site duplications and by a weak target site preference: CC 5'-CATTTTTTTNNNNNNNN-retrotransposon-ATTTTTTTNNNNNNNN-3'. XX FH Key Location/Qualifiers FT CDS 6..1703 FT /product="Proto1-3_NG_1p" FT /note="Proto1-specific protein of unknown FT function." FT /translation="MKKIPQQDHPPDTEGSNLLADRIFVLERAIETYQKEN FT NNLRDYLQQLVNICNGHPLLNNNNNNPIKPVNNNLINTQPSENENLFITRN FT PVNVRNLPTSRNLINGGNFLTETVQICDSNPKNRTVSYADITRLAPIKAQP FT SIKKTIAKQKSTTNKMKRNMATNTRPNKRKSNYNLNQIINVICYKDVTNEF FT VEQITDLLGDSVIIRVNRYCNNIINVICPDETSYKECFNQLVQGSHGGKFI FT LECFPSPAIFNPKPLALSHTTSSKTTHLKMLENIISKYILEDNISVEELYT FT NYSHNQFIIRISFEDQTICSQMAQNLGLNQYKTIEQIRNETKETTSQCSIP FT INYTNDEIKKGIIQAFTRINKTLQENNLTITDTTNRSGTTKYKLIKIVHND FT MDEMNEFCKKSISLKDKQRNHSGNGFKFKPSIQILDSKIQRKSKLHPYLPQ FT LELSPIDKLVFIDEPIRQLPTTTIVNNNNNNNNNNNNITNANNRIDKIEKS FT IIDLTQQLMLTNNNINNIVEILKQANIIRPNNDNKMLNNPNNKMLEDSNNS FT NNNMLDDELSSSSNPSTLP" FT CDS 1802..5386 FT /product="Proto1-3_NG_2p" FT /note="contains the APE endonuclease, reverse FT transcriptase and ribonuclease H domains." FT /translation="MCLTHGIHFMSLTETHLNTPELEIEVKNSGSGYLLFN FT SLMNTRKGCKGTAAMHFLNNYRKRNITNNNLVPGTLQWTRFESRGIPINIF FT TIYLSGRSNDYDNEEEALNSFINAALTIKDEHIILSGDINIDTRNPTSKRE FT KDWLWILNLLRLTELQTSNNTWIRRVQSNIFESKPDHVFVSKDIRIVKEVL FT MPPLTTNDHIPFYIDLEFKSNLTWRPYFSKSKRKKIYEEINKSPIDNFKEL FT NQAILTQINKYGSKTDRLGVGRISTAYKEEIQDLEEEILHKLNNETISQSE FT ISELQTLLKETHIKNGKHIKEKLIEEFNDRTSSKMYQFDRMVHSKEIKWKD FT INFEEEEILEYFTNKFTSTNQLPTFIPSRSTIKEGPNWTNSIHIEELEKAL FT KRMKSNTPGPDLISLDIIKHLTEKNQKILLNEFNNCLRIGDISEDWKQGWV FT KLIPKRDITSLSDIRPITILPTFYRILFNIIAFRLRSWASSHINMRQQAFI FT TDRNTLNHGVLLSSLAMKTKKRTFILVNLDIEGAYDAVELSVIKMALDHCK FT FPNELTQFIINAYSNHELSLEIDNHLSTKFTKTRGIPQGCPLAPLIYDCIT FT QLIIDKAIDKWKIPIKPNKLCANDIALCCFADDMNVVCDRYAKYNDRLDNI FT HDWLGQLLFKLNAKKSVATLLPKTSKVSPKIKGTPIPKQKNLRILGHYPWD FT DILVRQDIQNKIEKFTKSLKFLPLFKIEPDNLKQIIHAKAISLFTHLSKIN FT LIPLKKAQQIDIAIRGAIRRKLYMDRKTPTSLFHLPLEKGGLGLPSIVEFS FT ERMNLRTMTLIANSNNLLVKKAFKHGIKHCEETKDNIYSKWLLVLKEYNLT FT LKKNEMKFKLKKIKTGNNFDIHTDGSRLESKTGMGINIYDCSNMNTPVRSL FT SLHINDQYSNNVAEICSIITAAKVIPRDSKIKIHTDSNVAIEVLKERYKGN FT FLPLKKAFFKTIKDRKLEYEFKKVEAHKDKENIKVDLLAKNATLKPKIFDI FT RKLLFNQYILTKNDIIIFDYKRLTTSQHLRLMLDKIESHPNHIPNLDWNSI FT NVSYLKSHLSPKSKYYIWRNSSNAHINFGEGKSFCHDCNTESDLNHYIHEC FT PALDSARYFCLSKISDILDQRECKLTHIQFRPEKHHHVLFFHHSGTTFFED FT GYVQQYPDIAEKWPEIQGAISNFVGRSYRYYYIDSM" XX SQ Sequence 5441 BP; 2175 A; 928 C; 751 G; 1587 T; 0 other; cgacgatgaa aaaaatacct caacaagacc atcctccgga tacggaaggg agtaatttgc 60 ttgctgacag gatttttgtg ttagagagag caattgaaac ctaccaaaag gaaaacaata 120 atctccgtga ctatcttcaa caacttgtaa atatatgtaa tggtcacccg ttattgaata 180 ataataataa taatcccatc aaacctgtaa ataataatct gatcaatacc caaccatcag 240 aaaatgaaaa tctatttatt actagaaatc cagtaaatgt tagaaatcta ccaactagta 300 gaaatttaat taatggtgga aatttcttga ctgagacagt tcaaatatgc gatagtaatc 360 ctaaaaatag aactgtcagt tatgctgata ttacaagatt agctcctatt aaagctcagc 420 catctattaa gaaaacaata gcaaaacaaa aatcaacaac taataagatg aaaagaaata 480 tggcaactaa tactagacct aacaaacgta aatctaatta caatcttaat caaataatta 540 atgtaatttg ctacaaagat gttactaacg aatttgttga acaaattaca gacctactcg 600 gtgactctgt aatcattaga gttaatcgat actgtaataa tattattaat gtaatttgcc 660 ctgatgaaac cagttataaa gagtgtttca atcaattagt gcaaggaagt catggaggaa 720 aattcatatt agaatgcttt ccatctccag caatctttaa tcctaaacct ttagcacttt 780 cacatactac ttccagtaaa actactcacc ttaaaatgct agaaaatatt attagcaaat 840 atatattgga agataatatt tctgttgagg aactctacac taactattct cataaccaat 900 ttataattag aatatcattt gaggatcaaa ctatttgctc tcaaatggca caaaatcttg 960 gtttaaatca atataaaact attgaacaga ttagaaatga aaccaaggag acaacatctc 1020 aatgctctat tccaattaat tatactaatg atgaaattaa aaaaggaatt atccaagcat 1080 ttaccagaat taataagact ctccaagaaa acaatcttac cattactgat acaactaacc 1140 gctctggaac aactaaatac aaattaatta aaattgtaca caatgatatg gatgaaatga 1200 atgaattttg caagaaatct atctccctaa aggataaaca aagaaatcat agtggaaatg 1260 gatttaaatt taagccatct attcaaatat tggattctaa aatccaaagg aaaagtaaat 1320 tacacccata tcttcctcaa ttagagttat cacctattga taaactcgta tttattgatg 1380 aaccaattag acaactacca actactacta ttgtaaataa taataataat aataataata 1440 ataataataa tatcacaaat gcaaataata gaattgataa aattgaaaaa agtattatag 1500 atttaactca acagctcatg ctcactaata acaatattaa taatatagtt gaaatactta 1560 aacaagcaaa tattattaga cctaataatg ataacaaaat gttaaataat cctaataata 1620 agatgttaga ggattctaat aatagtaata ataatatgtt agatgatgaa ttaagttcat 1680 catctaatcc atctactcta ccttgagtag aaccacaaca agtttaaggc tagcgtgcta 1740 taacgtaaat ggtttatata agcacgcaag ccataaatca gttactcctg aattgaaatc 1800 catgtgttta acacatggaa ttcactttat gagtttaact gaaactcact taaatacccc 1860 agaactagaa atagaagtca aaaactctgg atcagggtac ctattattta actcattaat 1920 gaatacaaga aaaggttgca aaggtactgc agctatgcac tttctaaata attatagaaa 1980 gcgcaatatt acaaataata atctcgtacc tggaacatta caatggacac gattcgaaag 2040 tcgtggaatt ccaattaaca tcttcacaat ttatcttagt ggaagatcta atgattatga 2100 caatgaggaa gaagctctta atagctttat taatgctgcc ctcacaatta aagatgaaca 2160 tattatttta agtggtgata ttaatattga tacaagaaat cccactagta aaagagaaaa 2220 agactggcta tggatactta atctactcag actaaccgaa ttacaaacat ccaataatac 2280 ttggataaga agagttcaat ctaatatttt tgaatcgaaa cctgatcatg tctttgtgtc 2340 aaaagacata agaatagtta aagaagtgct catgcctcca ttaaccacta atgatcacat 2400 tcctttctat atagatctcg aatttaaatc taacttaacc tggagacctt actttagtaa 2460 gagtaagaga aagaaaatat atgaagaaat taataagtct cctatagata atttcaaaga 2520 gttaaatcaa gctatcttga ctcaaattaa caaatatggc tccaaaactg atagattagg 2580 tgttggtcga atctcaacag cctataaaga agaaattcaa gatttggaag aagaaattct 2640 tcacaaatta aataatgaaa ctatttctca atcagaaata tctgaattac aaacattact 2700 gaaagaaact catattaaaa atggaaaaca tattaaggaa aaattgatag aagaattcaa 2760 tgatagaact tcatctaaaa tgtatcaatt tgatagaatg gttcattcca aagaaattaa 2820 atggaaagat attaatttcg aagaagagga aattttagaa tactttacaa ataagtttac 2880 ctcgactaat caacttccaa cattcattcc aagtagatca actattaaag aaggaccaaa 2940 ttggaccaac tcgatccata tagaagaatt ggaaaaagct ttaaaacgta tgaaatcaaa 3000 tacccctggt cctgatctaa tttctctgga tattattaag catctcacag aaaaaaatca 3060 gaaaatttta ttgaacgaat ttaataattg cctaagaata ggtgatatct ctgaagattg 3120 gaagcaagga tgggtgaaac tcattcctaa aagagatatc acttctctca gtgacattcg 3180 tccaatcact attttaccta cattttatag aattttattt aatattatag cctttcgact 3240 tagatcttgg gcatcatctc atataaatat gagacaacaa gcatttatta cagacagaaa 3300 cactttaaat catggtgtcc ttctatcatc attagctatg aaaactaaga aaagaacctt 3360 cattcttgtt aacctagata ttgaaggagc ttatgatgct gtagaattgt cagtgattaa 3420 aatggcacta gaccattgta aatttcctaa cgaactcact caatttatta ttaatgccta 3480 tagcaaccat gaattgagcc tagaaattga taatcatcta tccacaaaat ttacaaaaac 3540 tcgaggaatc ccacaaggat gcccattggc tcctcttata tacgattgta tcacacaatt 3600 aattattgat aaagcaatag ataaatggaa aataccaatc aaaccaaaca aactatgtgc 3660 taatgatatt gcactctgtt gctttgcaga tgatatgaat gttgtctgtg acagatatgc 3720 caaatacaat gacaggctag ataacataca tgactggcta ggtcaactac tttttaaact 3780 aaatgcaaag aaatcagttg caactctact acctaaaaca agcaaagtat ctcccaaaat 3840 taagggtaca cctattccta aacaaaaaaa ccttaggata cttggacatt atccatggga 3900 tgatattctg gttcgacaag atatccaaaa caaaattgaa aaatttacca aatctttgaa 3960 atttctccca ttattcaaaa ttgaaccaga taaccttaaa caaattattc acgctaaagc 4020 tattagctta tttacacacc tttctaaaat taacttaatt cctctgaaaa aggcacaaca 4080 aattgatata gccattagag gagcaattag aagaaaacta tatatggaca ggaaaacacc 4140 aaccagctta ttccacttac cattagaaaa aggtggatta ggactaccat ctattgtaga 4200 attttctgaa agaatgaacc taagaaccat gaccttaatt gctaacagta acaatttact 4260 cgtcaagaaa gctttcaaac atggtatcaa gcactgtgag gaaacgaaag acaatattta 4320 ttcgaaatgg ctactagtac taaaagaata caacctcaca cttaagaaaa atgaaatgaa 4380 attcaagttg aaaaaaatca aaacaggaaa taattttgat attcacactg acggatcaag 4440 actcgaatca aagactggaa tggggattaa tatctatgat tgctccaata tgaacactcc 4500 agttagatct ttatccctcc atattaatga tcaatattca aataatgtag ctgaaatatg 4560 ctctattatt acagcagcta aagttatccc aagagactcc aagattaaaa tccatactga 4620 tagtaatgtg gcaattgaag ttctcaaaga acgatataag ggaaactttc taccactaaa 4680 aaaagccttc tttaaaacca ttaaggatag aaaacttgaa tatgaattca agaaagttga 4740 agctcataag gataaggaaa atatcaaagt tgatttattg gcaaagaatg caactctgaa 4800 acctaaaatc tttgatatcc gtaaactact ctttaatcaa tatatcctca caaagaacga 4860 tattataatc tttgactata agaggttaac tacctcacaa catttaaggc taatgcttga 4920 caagatagag agtcacccca atcacatacc taacctagat tggaactcta tcaacgtcag 4980 ctaccttaaa tcccatttgt ctcctaaatc taaatattat atttggagaa actcttcaaa 5040 tgcccacatc aattttggtg agggtaaaag tttctgtcat gattgtaaca cagaatctga 5100 ccttaatcac tatattcatg aatgccctgc tctggatagt gccagatact tttgcttgtc 5160 aaaaatatct gatattcttg atcagagaga gtgtaaactc actcatattc aattcagacc 5220 agaaaagcat caccatgtac tcttcttcca ccattctgga accacattct tcgaagatgg 5280 atacgtacaa caatacccag acattgcaga gaaatggcct gagatccagg gagccatttc 5340 taactttgtc ggccgctcct atagatatta ttatattgac tcaatgtaaa tttacaaatt 5400 caaagagaat ctgaatagat tcgttattat tgcaataaaa g 5441 // ID BEL-77_CQ-LTR repbase; DNA; INV; 597 BP. XX AC AAWU01021738; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-77_CQ_; KW BEL-77_CQ-I; BEL-77_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-597 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 296-296 (2011). XX DR GenBank; AAWU01021738; Positions 10298 9702. XX SQ Sequence 597 BP; 153 A; 144 C; 153 G; 147 T; 0 other; tgttcgggga tacaaccctg gatcaccgcg agctattgtg ccagtccatc gagcgacgag 60 ctctggaccg cgattgcgtg gttgctgcag cgggaacgag atggccagcg tcatcgatag 120 tcatcgacgg accagcagca gcagatcggg ggagcgagtt gatgtcgccg attctcagag 180 aggatcgacg aggatgagct ttgcatgacc cgtatcaaca tgacccaatt attatataag 240 ctaagaaatt acgtaactaa aactctcagt cattcctggt caccagatga aaaaaggtca 300 agctaggccc tcgcacaaat tttaacgtgt ggttcaataa gaacatagtt tgttaggtgt 360 tagaaaaata aatgtagttt gtcgaaatat aagtggtttc ctgtctaaaa gtgttgtacg 420 tgattttgtt tgttcaaacc gttcaccaac ctggaacctg gcccggttgg aacccgggta 480 atctgccttg acccgttgga agtccctcac cgaattgtcc atcatcggtg ctcacccgtt 540 cccgtcaacc agtggtcgat gtggagtgct ggctggaccg ttgccaacac ccccaca 597 // ID Chapaev-2_BF repbase; DNA; INV; 6411 BP. XX AC scaffold_896; XX DT 30-SEP-2007 (Rel. 12.09, Created) DT 30-SEP-2007 (Rel. 12.09, Last updated, Version 1) XX DE Autonomous DNA transposon. XX KW Chapaev; DNA transposon; Transposable Element; Chapaev-2_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-6411 RA Kapitonov V.V. and Jurka J.; RT "Chapaev - a novel superfamily of DNA transposons."; RL Repbase Reports 7(9), 778-778 (2007). XX DR JGI v1.0; scaffold_896; Positions 21320 14910. XX CC Chapaev-2_BF belongs to the Chapaev superfamily. Hallmarks of the CC Chapaev transposons are 4-bp target-site duplications, terminal CC inverted repeats with the conserved '5-CAC and GTG-3' termini, CC and the Chapaev transposase. The Chapaev transposase is CC characterized by the conserved D-x(60-80)-D-x(220-290)-E CC catalytic triad. Chapaev transposons populate genomes of CC different animals, including sea urchin Strongylocentrotus CC purpuratus, amphioxus Branchiostoma floridae, starlet sea anemone CC Nematostella vectensis, sea hare mollusc Aplysia californica, CC mosquitoes Aedes aegypti and Culex pipiens, and nematode CC Caenorhabditis elegans. The N-terminal portion of Chapaev CC transposase in Chapaev-1_ACa, Chapaev-2_ACa, Chapaev-3_ACa, CC Chapaev-1_BF, Chapaev-2_BF, Chapaev-1_NV, Chapaev-2_NV, CC Chapaev-3_NV, and Chapaev-1_SP is similar to the N-terminal CC portion of RAG1 (100-370 aa in the human RAG1). It includes a CC novel type of zinc finger, called Chapa: CC H-X7-C-R-X-C-G-X35-D-X4-H-X4-C-X2-C-W-Xn-C-X2-C-X8-G. In the CC amphioxus and anemone Chapaevs, the N-terminal portion contains CC also the RING finger motif. Some Chapaev transposases (e.g. CC Chapaev-2_ACa) show low similarity to the RAG1 core. XX FH Key Location/Qualifiers FT CDS join(811..2001,2213..2496,2603..2821,2936..3172, FT 3284..3372,3468..3676,3770..4141,4248..4451, FT 4536..4740,4893..5203) FT /product="Chapaev-2_BFp" FT /note="transposase." FT /translation="MAMSANLGLQDHIENLSKLCRFCGESVLTKADKARFI FT KPLVCIEYAKQIEELWQINVHRDKSYQHPPSFCHRCRLKLYQPRPKTITPT FT RVSWYKHPFGKPCNHCEKVLSISKGGKKPKKKSKAGRPRLQTNDKGAKVEQ FT EQSTPEYSDGNLTEETFAHLLDLLEDLPFDETKFEPICNLDAYKCGICQSF FT LYLPVLTPCAHIFCFECIQKSFTISERNMIDCAICKTQILQNDLKPVHRTW FT VSCYTQLTFNCKNCSTQLRLEQLKHHLDCSSTTEDESTGAGPPAPPPSVTL FT TSLQPVQVVYVPIATPVLSTPPNNTAAQTSTFTLTNTAGSSAASTSTPSTS FT STTSCHCEPGMIPFDPNEDGPLNSQQKKVALSYVHRMMAEAPDKRSFQVPT FT GGQPLQFVHVPRPRKKTSETTSSTTRKRAKMMQDVRDKVSGGAAAAQMKAE FT LQLHPRDELQQMLQELKLDRIVIPNGHLLAAKVGVGMSWSQIRKLKRWLGK FT YNIKLPSEKISREIAAEQISGFDITAEKLPFSVRENRKDPFTVQLRPCAYV FT TSLKDTIFSYLDKNKEANMLTWHGKIPEDEVWVKLGGDHGGESFKMIFQVL FT NRDHPNSKDNTNVFCIFNAKDSRENLTLALQRYTEEIRDLQVSKWTSDGKE FT YKLKILATGDYAFLCTWYGLSGACGFHPCLWCYITLHQIPEDRENRPLRIP FT KRTLDSLAADHQRFVQEGMGKLKKAKEYNNAIAPVMFNVPIDQVMVPGLHI FT GLGLYKKLFEHLEADLQDIDLKLQSYLESVLAEGEVTKDVLLADEHLGKFK FT SFVAAIDEARALDDAADVLEDQIEEQESQLAWLAYRDGVEDSMAEVVFNEA FT CSMVQDLFQQKETLRAKADAVRNKASVKTGKGPLTSQLDPKLKEFKVRRQE FT YHGKSFIGNHVHKMLKASTVLVSHLNFLTLQENAINELTSIVVTTINEILE FT KFPDLPLSLVPKAHATAEKHKQLFTLFAQCHKKYSHADLMDAEAINELDNA FT ITAFMGYYRQHVPHGTVPIKMHMLEDHVVDCVRSWGFGLGFLGEQGIEHLH FT AIFNNINATFRGMMKDEVVQLKAALKDHHIQNSPGHEGGVPDPVPRKKND" XX SQ Sequence 6411 BP; 1893 A; 1442 C; 1316 G; 1760 T; 0 other; caccgctgtt tcagatatgg gcttccgggt tgtttagcct gggcctcatt atagcctcat 60 tatcatactc tcgcgagaga tgccccggcg tggcagcgcg agatcctgtt acagaaccac 120 agcaaccagt tgtttatgcc ttgtaactag ggtgtttggg gagcctagga tcattaatca 180 ctcctaaaca gctgtctttc caagaattta agcccttccc cctgacctga cagcggcctc 240 atgagtaact tccagcagat tagttttcca ggcaggcatt ttcagggcat gacctggctg 300 ggtgtgttca tgctgacatc tgattttcct caacttcaag tgctctctac acttttccct 360 ctcaattcac atggtaatct ttacattcct cttcaacctt gtacttttct ctacttattg 420 ctatatttct ggtcaacttt ggaatttttc tattctttcc atgtgtcttt gaatcactat 480 gacccaacac gtgggtatac atgactgtag tgtccggtat ctgggttaca catcaaattg 540 atgtcagaat cacttaaaat tatacagaac tttatcaaac tgtacattat tattcaatct 600 gcaaattaag ataagcaaga cattcatatt tttatatata ttcatccttt attctggtaa 660 aagaagagga agtgcccccc tccccacccc aaaccccatc tcatacatgg tagctccaag 720 tgcatacaaa tcagttttct agaatattaa gtcattctaa taactagtat atcatagaca 780 cagtggtatc taatagttct tttccaacct atggctatgt ctgccaacct tggtcttcag 840 gatcatatag agaatttatc aaagctatgt cggttttgtg gagagtcagt cttgactaag 900 gccgacaaag caaggtttat caaaccactc gtttgtattg agtatgctaa gcaaattgaa 960 gagctatggc agatcaatgt tcacagggac aaaagttacc agcatcctcc ctccttctgt 1020 cacagatgta gattgaaatt gtaccagcca aggcccaaga ccatcacacc tacacgcgtg 1080 tcttggtaca aacacccatt tggcaagcct tgtaaccact gtgaaaaggt cttgtctatt 1140 tccaaaggag gtaaaaaacc caaaaagaag tctaaagctg gtagaccaag gctgcagaca 1200 aatgataagg gggcaaaggt tgaacaagaa caaagtacac ccgagtacag tgatggaaac 1260 ttgactgaag agacgtttgc acatttgctg gatttgctcg aggacctgcc attcgatgag 1320 accaagtttg agcccatttg taaccttgat gcatacaaat gtggcatctg ccagtcattc 1380 ctatacctac ctgtgctaac cccctgtgct cacattttct gttttgagtg catacaaaaa 1440 tcattcacta tttctgaaag aaacatgatc gactgtgcca tttgtaaaac acagatatta 1500 cagaatgacc tcaaacctgt acacagaact tgggtatctt gttacaccca gttaacattc 1560 aactgcaaaa actgttcgac tcagttgagg ttggaacaat tgaagcacca ccttgactgt 1620 agctcaacga cagaagatga aagtacaggt gcaggtcccc ctgcccctcc cccttcagtt 1680 acacttacat cactgcaacc tgtccaggtt gtctatgtcc ccattgcaac tccagtgctc 1740 tctacaccac ctaacaacac tgctgcccaa acttctactt tcactttaac taacacagca 1800 ggctcatctg ctgcttccac tagcactccc tccacttcat caaccacttc atgtcattgc 1860 gagccaggca tgattccttt tgatccgaat gaagatggtc ccctgaattc gcagcagaaa 1920 aaagttgctc tatcatacgt tcatcgcatg atggctgaag ctcctgacaa acggagtttt 1980 caagtgccaa caggaggaca ggtatggaac atttttcttt cttgttggaa tataatttat 2040 cacaataaat atttcaaaat tttacatcaa tatactttga tgcttacaaa tctttctgtg 2100 ttaaaatgta tgctgcattc aaaagcacat gggtgtctgt tattttaaca tatgacatca 2160 ctacatactg ttgatacatc aatgaatctt gatttttttt tttactttgc agcccctaca 2220 gttcgtacat gtcccacggc ctaggaagaa gaccagtgaa acaacatcat ccactactag 2280 gaaacgggcc aagatgatgc aggatgtccg tgacaaagtg agcggtggtg ctgctgctgc 2340 acagatgaag gcagaactcc agcttcaccc tcgtgacgag ctgcaacaga tgttgcagga 2400 gttgaagctg gatcggatcg tcatccccaa tggtcacctg ttggcagcaa aagttggggt 2460 tggaatgagc tggagtcaaa taagaaagtt gaaaaggtac agttggcaaa ataaaacaac 2520 tttgtctgtg tacaaaccct tatcttgcat ttagttgaaa aagggtgtac atatatttca 2580 aaaccattct attttgttac aggtggctgg gcaaatataa catcaagctc cctagtgaga 2640 agatttcgag ggagatagct gcagaacaga tttctgggtt tgacatcact gcagaaaagt 2700 tacctttttc tgtaagggag aaccgcaagg acccattcac agtacagctc cggccttgtg 2760 cttacgtcac gtcattgaaa gacaccatct tctcctacct ggacaagaac aaggaagcaa 2820 agtaagggtc taccattcct tgtcagactt acactgatta cacaggatgg aaataataca 2880 gtaaaatgcc aacaatgttg acggcaatta actccactac atttgtctct tacagcatgc 2940 tgacctggca cggcaaaatc ccagaagacg aagtttgggt gaaacttggt ggggaccatg 3000 gtggggaatc attcaagatg atcttccaag tcctcaacag ggatcaccca aactccaagg 3060 acaacaccaa cgtcttttgc atttttaatg caaaagacag ccgtgaaaac ctgaccctgg 3120 cgcttcagcg atacactgaa gagatcagag accttcaggt gtctaagtgg acgtaagtaa 3180 acatattttc ttttggattt gaacaatggt gtggacataa aaatcagttt tacaaacatt 3240 tacaatatta aatgttaatc aaaacaacgt tcttcaaatg caggtccgac gggaaagaat 3300 acaagctgaa aatccttgct acaggagact acgccttcct gtgcacgtgg tatgggctgt 3360 ctggagcttg cggtgaggga atttaaattt gaacactaac atgaatatat ctagtgatac 3420 tttgttcaga gttagcttga cactgtgctg ttgtattttt actacaggat tccacccctg 3480 tctatggtgc tatatcaccc tgcaccagat tccagaggac agggagaaca gacctctcag 3540 aatccctaaa aggactctgg actccttggc tgcagaccac cagaggtttg tccaagaggg 3600 catgggcaaa ctcaaaaagg ccaaggagta caacaatgcc attgcaccag tgatgttcaa 3660 tgtgcctata gaccaggtaa tttcagacat ctccaactaa actttcacct acttagtcat 3720 tatgtatatc agactcttaa tctaagctgc tgcctctatt tatttacagg ttatggtgcc 3780 tggtctgcac atcggcctcg gcctgtataa gaagcttttt gaacatctag aggctgattt 3840 gcaggacata gacctgaagc tacagtcata ccttgagtct gtcctagctg aaggcgaggt 3900 gacgaaggat gttcttctcg ccgatgaaca tctgggcaag ttcaagtcct tcgtcgctgc 3960 cattgacgag gcgcgagctt tggatgatgc agcagatgtc cttgaagacc aaatagaaga 4020 acaggagagc cagctggcct ggcttgccta cagagatggc gtggaggact ccatggctga 4080 agtcgtcttc aacgaggcct gcagcatggt tcaggacctg ttccagcaga aggaaactct 4140 ggtaagtaca acattcaaac tattcaaatg tgacagctat attctaactt ctaaatgatt 4200 atcactggtc tatgtccatt ccatgtttat cattcttatt catttagagg gcaaaggctg 4260 atgctgtcag gaataaagca tcagtgaaga ccggcaaggg ccccttgacg tcacagttgg 4320 atcctaaact gaaagagttt aaagtccgca ggcaggagta ccatggcaaa tccttcatcg 4380 gcaaccatgt acataagatg ctcaaggcaa gtaccgtact tgtttctcat ttgaatttct 4440 tgacattgca ggtctggtac actgttgcaa tctaaaacca tctaccactt cagaatttct 4500 catctgctgt tgtgatattt tctttatgta cacaggaaaa tgcaataaac gagctaacat 4560 ctatcgtggt cacaaccata aatgaaatcc tggagaagtt tcctgacctg cccctgtcac 4620 tggtgccaaa ggcacatgca acggctgaga agcacaagca gctcttcacg ctgtttgccc 4680 agtgccataa gaagtactcc cacgctgatc tgatggatgc ggaagctata aatgagctgg 4740 gtaagaattc ccataaaaat aacattcttt gtgtattgtt cactcatttt tcctcacccc 4800 agcatgcctg ggatctttga tgtgtattta gctatggatg taacttaaaa tggcatgtaa 4860 attgatttct atgtaaacat ttacatttac agacaatgca atcacagcat tcatgggata 4920 ttatcggcaa catgtgccac atgggacagt acctataaag atgcacatgt tggaggatca 4980 tgtggttgac tgtgtcagga gttgggggtt tgggcttggc ttcctgggcg agcagggcat 5040 cgaacacctc catgccatct tcaacaacat caatgccaca ttccgcggca tgatgaagga 5100 tgaggtggtc cagctcaagg ctgccctgaa agaccatcac atccagaaca gccctggcca 5160 cgagggcggt gttccagacc ctgtaccaag aaagaaaaat gactagcatt actattgttg 5220 ttaattctat tgttattggt atcaaatgta tgagtaatgt tgcatgtaat ttcagacttg 5280 catgctaaga gaaaggtgta aaaacatttg actcctacaa agactggagt ataatgtagt 5340 gttctatatt ataattcata gacttcatta gtatatgcac taatgcaaca attatatctg 5400 ggatacttta ataattatca taaactgcca ccaaacaagt gtatcagtac tgtgtgatgc 5460 aaattacagt aacttgtatg acaaatgaaa acgagtgagg ccattgtaca aaaaaatatt 5520 gttattgtgt tttcaatatt tgtgatcaat gttcataagt tcatcataac atactggccc 5580 attttggcat tgtaactcta gcacactgta accatgaaaa tatttattca ctatgttatt 5640 gaaatggtca caaataaatc cttttactcc atcttacaac atacaacaca acactgatga 5700 tttccaaacg tgatcataga atactgggtt tggtctcaag gggggagggg ggcggtatta 5760 ctctatggga cgaaatggaa cttcgggaac cacaacgctt tcaaaacaaa tgtaaacaaa 5820 tgtttccgaa gtcaattcac caagttaacg tgacagaaac taatattttc cgataatatt 5880 cgataaaacg tgccattcag aatatgcaaa tgaggtagca tggcggaaaa cagagaaaac 5940 atacctttgt ttcgcgaact cctgaacggt acaaaaaatt ccaaagttgt caggctgcct 6000 tctggagggt taccgaccga tcgatccaac tatttccagt gttctcggcc gttggaattg 6060 agacacattc gtaaaatccc atgctaaagt ctgtgattgc ttcttgacct ctccacaccg 6120 acgcagcaac ttagaaccga ccgatggcgg cgaaatcccg gcagaaacgc gagtaagcag 6180 cttcagcctc tagccaacaa agatggctac cgtctctatc aagttggagg caccacgtga 6240 ccgaaactat gagctatgat tggtctaaag atggatatcc agttctgagc ctgcagtgcg 6300 cccacgcggc cgcatggtgg cgcaggcgta tgtgtacctc attatgcatg cattacatca 6360 ccagataggc cacacccata caaactaaag tgccaaagcc aaacagcggt g 6411 // ID Gypsy-249_AA-I repbase; DNA; INV; 7680 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-249_AA_; KW Gypsy-249_AA-LTR; Gypsy-249_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-7680 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1099-1099 (2011). XX DR [1] (Consensus) XX CC Positions [3681-4106] - Reverse transcriptase CC Positions [5166-5642] - Integrase core CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 2787..6020 FT /product="Gypsy-249_AA-I_2p" FT /translation="MQAPERGTVRKLRFSRLSVKQLSILRKKRVGSEPKPP FT PAEPKLLSVNSSSYFQPASENLYPETPASYTITYPRANDNRPYISVDIYGI FT TVSALLDSGSNLTLINDAVYNTVKPKKLFPLRDPPNLRTASGEALTVRGKV FT YLPFSWNGVVRVVPTLVIPNLAINCICGMDFWKTFRIQPTVVGCATVEYTG FT AEITPSPLRTPSVLSASEQKTIDHIKSLFIPARPGKLSTTPLAEHKIELAE FT EWRKKPPVRQFPYVMSPKTQGLVSVELQRLLDAGIIEPSNSDWSLNCVPVV FT KPHKVRLCLDARKINERTVRDAYPLPHPCRILGQLPKAKYLSTIDLSEAFL FT QVPLEKASRKYTAFSVQGKGLFQYTRMPFGLVNSPATLARLMDRVLGHGVL FT EPYVFVYLDDIVVVTETFEHHVQLLQEIARRLNRANLMINLEKSQFGVPEI FT PFLGYLLSSEGLRGNPDKIRPIVEYERPTTITKLRRFLGMANYYRRFIPNF FT SETCGPLSDLLKSKTKIIGWNEAAEGAFCRIKEQLISAPILGSPDFSREFT FT IQTDASDVAVAGVLTQQQEDGERVISYFSRKLTTPQKNYHAAEKEALAALL FT SIEAFRGYIEGYHFTLVTDSSALTHILKTKWKVGSRCSRWSLDFQQFDMTV FT VHRKGKENVVPDALSRSIAVVQDSPTSVWYTSLMEKVTRRPDDYVDFKVED FT GKLYKFIAVKNVPYDSRFEWKQVLPTEEVSGVLKQAHDDKFHPGFDKTLAL FT VQQRYYWPKMAKDIRAYIQACSTCKEVKPSYVQTTPEMGKMRCASRPWQII FT SMDFIGPLPRSRKGNQHILVICDYFSKWVMIQPVKKLDSSAMCVILKNQWF FT YRNSVPEIIITDNGSSFVSKEFKELIDHFKITHWLNSRYHSQSNPVERVNR FT SINAAVRSYVQEDQRIWDTKIPEVELMLNTSVHSSTGFTPYFITHGQELAE FT QGSDHRLSRHGEELSAEDREERRKQMFTKIHEIVRKNLEKSHEVSRDRYNL FT RNRTFAKSFTVGQLVYRKNMKQSSAAENYNAKLGRQYLPCKIKAKIGSSSY FT ELEDLSGKNLGVWPAVHLKPG" XX SQ Sequence 7680 BP; 2080 A; 1733 C; 1699 G; 2153 T; 15 other; attggcgccc aacagaaata ataagaaaag gtttccactt cactattgaa gtggaaccct 60 tttcagataa gctccgggag gaattggaat cctttattcc aattgttctw ttatacagga 120 tttattgtat tgagttgtat cgggataaga aagggataag tgaaagtgtt ttcttttcgt 180 ggaatcgttc ggatgatgat aactgctgct gtaagtcgtt cttttccttt caattcatct 240 ttacaaatta gcctttttat ctctttatag ctgaagaata aattaatctg aatagtagag 300 ttaaataaat aattagtgaa atttatcttc atwattgcac agtggttcaa aattattatt 360 tttttaattt ttattatttt ttttttattt ttaaatgttt ttgtgtctta cacagtggtc 420 caaaattata attttttaat ttttatttta tttatttttt tttaaattta ttagtgccta 480 gcttaatgat aaagcatwaa tatatgctta atttatttga atttttttgc tttttgtaca 540 ttttttaaat tttttttagg aataagcttt gatttattgt tacttttttt gcatttttaa 600 gcaaaaagta gcttttgatt tagtttatgg taggtttaga atttttgtaa atattttttt 660 tgattgaatt tttattttta agcatttttt ttgattttgt tgtttattct caagtgagaa 720 atggctgaat cacttgaaat ccaattcaca cacgctgtgg aaaagattcg ggagtggtac 780 cacagcttaa gggtggacca cttgtcacct gaggagttgg actttgaatt aaaaattcgc 840 tcgatcgtta ttcgggatga tcccacctac tctcgaagac gcaggagcct tcgggatctg 900 atgaaaatag agaaagagga caaaaatcaa attgaagtag aattacaggt gaactgggag 960 gagacaatcc agttctgtcg acaaaatttc gacgaattgt cggcaggcct acagcagaat 1020 gacaaaaact taagggtgag gggacaagca aagcttttgc atttcggtca tcggatagtc 1080 cttcttttga atcaggttag ggattcgggg gaaacagaaa gcttcctcaa ggaaatgctt 1140 gttgatgtgg tgaacctcct gaagcagtat ttccacactg aaacaactga acctgaatct 1200 cataatcgag agggactgga ggatcaaacc ttaatggact tgttccgtac tgagcttgtt 1260 cctgatgccc atccccgatc gattgcagat tcaggtacag tagtttctgg tgtgaatact 1320 acttcagaaa atacttcggt ggcttctcgg tttgaaggca gtgactgtgt gcgctcttta 1380 ctagctcgca ttaaagagtt agaagcggag atggctagaa tgggaagaca gtgtgaggaa 1440 gagtctcaac gcaggsaatc cgagctccct agtggaagtc gacagccagt ggtgagtgtg 1500 ccgctggcta gtcaagctcc tccaactctt cattcctctt tcccctctct tccgatatac 1560 tcgaaccttt cacagggtca gagttcttcg ggggcggtcg atccttctag gccgatcact 1620 agtgtaacct tctcgttacc gcaaagccat ctctctacac agccgcagcc tccttcaacc 1680 tggcatagct atctgcagaa caatccttcc attcatccgc aaaccaccaa tccgtggtta 1740 ggatatccag ctgcaagttc agtaccattt tcaaatccga gcctcgggac ttcggtcttg 1800 aatagtgggc agttttcgaa tccgtggcta gggaatccga ttccaagcgt cagtcagcca 1860 gcttatccat ggaacaatcc tatagggact tcggcttcaa ctacaactgc ctttccttca 1920 tctagcttcg ttcctttcaa tccagttccg tctggcaatc aggtgcgtca caccttgccg 1980 gtgtccaagt ggacaatcgc taaatatgat ggtgaagatc aagggctgaa gttgaaccag 2040 ttccttggaa tggttcatgc gatggctgtt gctgaacatg cttcggaagc tgaactgttt 2100 gattcagcga ttcacctttt caaaggtcca gcactgcaat ggtacatgac catgcgcgct 2160 acaggccggt tagtcaactg gcagcatttg gtgctagagc ttaggcggac gtttatgcat 2220 cccgacctag acaccttaat aaaaatgaag gtttatcagc gtcagcaagc acgccatgaa 2280 accttccttc aattctacca cggtatggaa gagctttttg ggacaatgag tgttcctttg 2340 ccggaacacg aaaaggtcca aatccttctt cagaatatga ggattgatta taaaaagcag 2400 ctcaatttca tccctattgc tgaccttccg acacttgtgt ccgcaggcca gaaaatagat 2460 gccgtaaact tctcagtcta caacaaagtg tttggacctg agaagtcggt taacgcggtt 2520 gagttttctg aaccgaaaaa gggtacatct caggataatc ccaacaagcc acccactcca 2580 catcaacaag cctcgtcgag aaattcgaag gcccacactc ctcctacaca gaccaatcag 2640 agtcgtaagg gttcccaaac cggaaccgat actcctcgcg gaccagcacg ggcaccaacc 2700 actttggaag acctgatcga gtctcatcaa ccattatcaa gccgacattg cttcaattgc 2760 ggcaagtttg ggcatcggat ggaccaatgc aggctcccga gaggggtact gtgcgaaaac 2820 tgcggttttc gaggctatcc gtcaaacaac tgtccatact gcgtaaaaaa cgcgttggta 2880 gcgagccaaa accgccgccc gctgaaccca aactactaag cgtaaactct tcttcgtatt 2940 ttcaaccggc ttcggagaat ctttatccgg agactccggc gtcgtacacc attacgtacc 3000 cacgtgcgaa cgacaaccgt ccttacatat cggttgacat ttatgggatt actgtttctg 3060 cccttctaga tagtggaagt aatctcactc tcatcaatga tgcagtgtac aatacagtca 3120 aaccaaaaaa actgtttcca ctgcgagacc cacccaacct gcgaacggcg agcggagaag 3180 cgctcacggt ccgtgggaag gtctatctac cattttcatg gaacggtgtg gtccgggttg 3240 tgccgactct cgttattcca aacctagcca tcaattgcat ttgcggaatg gatttctgga 3300 agaccttccg gatccaaccc accgttgtag ggtgcgctac ggttgaatat accggggcag 3360 aaatcactcc gtctccgttg cgtacaccat ctgtcctctc tgcttccgag cagaagacaa 3420 tcgatcacat caaatctctt ttcattcccg ctcgtccggg gaaactgtct accactccac 3480 tggcggagca caaaattgag ctcgccgaag aatggcggaa gaagcctccg gttagacagt 3540 tcccgtacgt catgtctcca aaaacacaag gtttggtgtc cgtagagctg caaagattgc 3600 tcgacgcggg tatcatcgaa cccagtaact ctgattggtc actaaactgt gtaccagtcg 3660 tcaaacctca taaggttcgg ctctgtttgg atgcgcgcaa gattaacgaa agaactgttc 3720 gtgatgctta ccccttgccg cacccctgtc gaatcttagg ccaacttcca aaggcgaagt 3780 acttatcgac gattgatttg tcggaagctt tccttcaagt tccacttgaa aaggcttcgc 3840 ggaagtatac ggcctttagc gtgcaaggta aggggttgtt tcaatacacc agaatgccct 3900 ttggccttgt caatagtccg gccacattgg ctagactgat ggaccgggtt ctgggccacg 3960 gtgtattgga gccttatgtc ttcgtttacc tcgacgatat cgtcgtggta acggagacat 4020 ttgagcatca cgtccagcta cttcaggaaa ttgcacgcag attaaacaga gccaacctaa 4080 tgataaacct ggagaagtct cagtttgggg tgccggaaat tcccttcctt gggtatcttt 4140 tgagttctga gggtcttcgt gggaatcctg acaagattcg acccatcgtc gagtacgaaa 4200 ggcccaccac gatcactaaa ctcagaagat tcctagggat ggcgaattat taccggcggt 4260 tcatcccaaa cttcagcgag acttgtgggc ctttgtcgga tctcctgaag tcgaaaacca 4320 aaatcatcgg ctggaatgaa gccgcagagg gggcgttctg tcgcatcaag gagcagctca 4380 tcagtgcacc gattctgggt agcccggact tctcccggga gtttactatc cagacggatg 4440 ctagcgatgt agctgtcgcc ggagtcctca ctcagcagca ggaggatggt gagagggtga 4500 tatcctactt ctcccggaag ctcaccaccc cgcagaagaa ctaccacgct gctgaaaaag 4560 aggctctggc cgctctcctc tccatcgaag cattccgggg ctacatcgag ggctaccact 4620 tcaccttggt gacggattcg tccgccctca cccacattct taaaacgaag tggaaggtcg 4680 gttctcgatg tagtcggtgg agcttggact ttcagcagtt tgacatgaca gtggtacacc 4740 ggaaaggaaa ggagaatgtc gttccggatg ccctgtcgcg tagcattgct gtagttcaag 4800 attcaccaac ttcagtatgg tacacttctc tgatggagaa ggttacccgc cggcctgacg 4860 actatgtcga tttcaaggtc gaagacggca aactttacaa gtttatcgcc gtcaaaaatg 4920 tgccatacga ctctcggttt gagtggaagc aagttctgcc aaccgaggaa gtttccggag 4980 tgcttaaaca ggctcatgat gataagtttc atcccggttt cgacaagacg ctagctcttg 5040 tccaacagcg ctattactgg cccaaaatgg ccaaagatat tagagcctat atacaggcct 5100 gttcaacctg taaggaggtc aagccctctt atgtccaaac taccccggaa atgggtaaaa 5160 tgcgctgtgc ttcccgaccg tggcagatta tttccatgga cttcattggt ccactgccac 5220 gtagtcgaaa gggaaatcag catatcttag tcatttgtga ctatttttcc aagtgggtaa 5280 tgattcagcc ggtgaagaag ctggatagtt cagccatgtg tgtgattttg aaaaatcagt 5340 ggttctatag gaattctgtc cctgaaatca taatcacaga caacggcagc tctttcgttt 5400 cgaaagagtt caaggaactt atagaccact tcaagatcac gcactggctg aactcccgct 5460 accattccca gtcgaatccc gtggagcgcg tgaaccgtag tataaacgcg gccgtccgca 5520 gttacgtcca agaggaccag cgaatatggg acaccaagat cccggaggtg gagctgatgc 5580 tcaacacaag tgttcactcc tccacgggat tcacaccgta tttcattacc cacggacaag 5640 agctagcgga acagggatcc gaccatcgtc tttctcgaca cggtgaggaa ctttctgcgg 5700 aagacaggga ggaaagacgc aagcagatgt tcaccaaaat ccatgagatt gttcgcaaga 5760 accttgaaaa atctcatgag gtttccaggg atcgctataa cctgcgaaac cgtactttcg 5820 ccaagtcgtt cacggtaggt caattggttt accgaaagaa catgaagcag tcttcggctg 5880 ctgaaaacta caatgccaaa ttagggcggc agtatttacc ctgtaaaatc aaggccaaga 5940 ttggctcgtc ttcctacgag ctggaagacc tcagtgggaa aaatctgggg gtttggccag 6000 ccgtccatct caaacccggg tgaacatctg catcgccttt cttccctgac ctccaatctc 6060 gaatccccta acaaggattc aataaatcgg tgaaaatatc cgaatactca cctgtttgct 6120 ttgttktgtt tctsattcat tcatgcctgt ccgtcgtcgt cgtcagaacc ataatatttt 6180 gattktcctc ctcgtcgaca twgttcgcat catccgcttc tttttcctct ttcttccgtt 6240 ccccatgttt acccttttaa caagggtcga actaagggga atgtttaccw ttttcataac 6300 gcctcccmgg tggcggattt cgatcgatta gttcgtcggt ccaaccgagg ggtctacagc 6360 gatatcgcta ggcttcctca aaattattta gaatagggta ccctagaatc ttagtttagc 6420 atatatagcg tgtgaatgaa ttgtatgttg tcttgggaaa tmaagtttta tcccagacaa 6480 ccgtatgaat gagatgaatg tatgaatgaa tttgtatgaa aggtgggggg aatgaatgtg 6540 tgaatgaatg cgtactttta atcccgagaa aatctcggtg atgattmtwa ggaagagatt 6600 tccgaaaagt ttattttctc tcctaatagt agattagttt taaaataggt tataattgag 6660 ttaatttaaa ataaaagttt aatttataaa gatttgatga tgagaggatg gatmagaacg 6720 cttttattct gacgaacctg tattctggaa ggctgtattt tccagattca cttattgttt 6780 cgatacaact caatcattgt cctcttattt cagcttatgg taattagaga gtacagtggt 6840 caggctcgct ctgcaatttt atgtaatttg aaggaaaaga gcctcctttc gtgatcaatc 6900 attgcttcaa attatcactt aaattgcagg ggagagagcc actgtaggag aactagcttt 6960 cagtgaatct atcacgaatt cgggaagtag gactcctgct tctctaattc gatcgtggat 7020 cgtggaagtt cgcctctccc ggtaattcct ttgatcttaa acagtgacta ctaatctcga 7080 aaagaagagc tcccaccgag gaawcaatca cttttccaga ttggaagtca ccaacaagag 7140 aaaaaggttt actgcgaact aattacccta agaggaaata agattcgtag tttggcagcc 7200 agactctatg gctgaaagtt aggtgatttg gtggaaaaga gctcctctct gcacggaaaa 7260 atcattccac taaatcaaaa taaccttcag ctaaaagaga gctgccaaaa taagaactcc 7320 ttacttgacc ttctccttct agtggacagt ctctattatt aagtgttaag tgattttgtg 7380 gaaaagagct cctcttcatg gacaaatcat tgcacaaaat caatctaaca cttaacaaga 7440 ggagggccac actgaccttt ttccctgact gatttcctct ccagtggaca gtctctatca 7500 ttgaatgtaa agtgatttcg cggaaaagag ctcctgtacg gaaaaatcat ttcgcaaaat 7560 caaattacat tcaatgagag gagggccact aaaatccatt atcctaaatt cccattccat 7620 acttctctta taaaaaaaaa aaactacaag tttttttttt attccgaagt gggggatgaa 7680 // ID CR1-3_HM repbase; DNA; INV; 4733 BP. XX AC . XX DT 11-DEC-2008 (Rel. 13.12, Created) DT 11-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE CR1-type family: consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-3_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4733 RA Jurka J.; RT "CR1 families from Hydra magnipapillata."; RL Repbase Reports 8(12), 1831-1831 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 931..4194 FT /product="CR1-3_HM_1p" FT /translation="MKICHHCKFCNKKIKDNHRAIQCDCCSFWIHAKCIPV FT DKHAYNMLSNDSSEWFCPNCISNNMPFGVLSDLELKSTLSCKNPSNKIKFH FT EPPDYLKNLFKTMNNVFNPTTKCKYLDVNELNKSINPGTEIYLHLNISSLP FT FHINDLSSFVGTINTPPMVIGITESNLYANDSNITDITIQGYNIEHCPTES FT KKGGALLYLNSNLNYIVRSDLQIYATKFLESIFVEVIYPLKSNTIFGCIYR FT HPSLNITEFLSIHFNPLLEKLSHEAKNIVLMGDFNIDLLNYRESQVISNYF FT ESLCSHSLFPTIILPTRVTAKTKTLIDNIFMNSFPTDIVSGNLTISISDHM FT AQFVCIPNNPPIKKKVKMFKRSFKKFDSDSFIQEISDINWELLIKDDDNIN FT ESINFFLKTFEKILDRYAPYKELTKKQIKLQSKPWITSGILKSISNKNILY FT KKFTRSKNVNTKNILFTKFKLYRNKISNLLRYSKKLYYASFFNNNINNVKN FT TWKGIKEIINIKPSTSTKSFNLKINENVISDHTSVANIFNNYFTTIQDELL FT KNTVPSKCAFGNFLKIPSINSFFIEPVTENEVANLIKDTLKNNKSLGPNSL FT PTFLLKLVSHIISKPLCTMMNNSFKNGIFPEAFKVAKVIPIHKMGSYLDYS FT NYRPISLLSNLSKLFEKAMFQRLEHFLEKHKFIYKHQYGFRNKHSTTHSLI FT EITEKIRQAIDNKHFACGVFIDIRKAFDTVEHTILLEKLKHYGIRGIPFLW FT FSSYLCNRTQFVSINGINSGLAKSFNGVPQGSVLGPLLFLIFINDLNVSLK FT FSTAYHFADDTNLLLINKSLKKLNKNMNHDLANVVQWLRSNKLSLNSKKTE FT IIIFKSAKTKINKQLNFRLSGQKINPVNSIKYLGIKIDSNLSFASHLQDLA FT LKLSRSNGILAKIRHFVNHETLLNLYHAIFHSHLRYACQVWGQSKQLAFLR FT LTYLQNKALKLIYFQHTNSNCSILYFLSKVLKLCDLIQLSNCLFVWNQNHN FT NLPLTFINFFSYRENCKYILRSALNFKLSVPKYRTVHYGYESIQHKSIQTW FT NNLPSQLKSLKSFSKFKTALFNHFLEKYSL*" XX SQ Sequence 4733 BP; 1699 A; 744 C; 479 G; 1808 T; 3 other; aggtcatcgc aaatggcgga cgtatttttc taacggtaaa aaataaataa aagaaaaagg 60 ataagataaa cagttaattt ttcattttca attcgctaaa aagttttttt aaattgttaa 120 aaatctgatt taaatcaaac gcttaaaaat aaaaatcaaa taaaaaaaag aacttcgaat 180 tattaaaaaa aaagaacact ttaaacaaag aaaagaacga agaaaaaaaa aaaaaaaaga 240 aagaaagaaa gagaaagtaa acaaagaaga agaaaaaaag aagaaataag taaacaaaca 300 agaaagagaa gaacaaaacc aattttaaag agaataaaga agattggtta ttcatttatt 360 ttcgcgattt aatttgtaat tattattttt taatataatt atttaactta tttgttattt 420 tattattatt attattattt tttttttttt tcactttttc acatattttt taattattta 480 tttatcatta attatattca ttagttctag ttaataaggt tttcccgtta tctatctaag 540 ttgtattatc acatattatt tatttattta tcattagatt atctatttag gaagttttgt 600 tatatatttt attatattta aattatttta ctatatataa ttttattata ttttttgatc 660 attattataa ctttatcatt attatatatt cttatacatt tatttattat ggatttatat 720 tattttaatt ttatctattg tttattattt tatcttagtt attaattaat attataaatc 780 tatactcttt atttataatt tcgataataa ttaactacgt actactactg attactgttg 840 ctatttatcc atttactgtt tattgtttga ttattttctc ttttaatttt ctatattttc 900 tgttctataa acggctttac catctatatc atgaaaattt gtcaccattg caagttttgc 960 aataaaaaaa taaaagataa tcacagagcc atccaatgcg attgttgctc cttttggatt 1020 catgcaaaat gtattccagt tgacaaacat gcttacaata tgctttccaa tgactcttct 1080 gaatggtttt gccctaactg catatcaaac aatatgccat ttggcgtact ctccgatctg 1140 gaacttaaat caactctttc ctgcaagaat ccttctaata aaataaaatt tcatgaaccc 1200 cctgattatc tcaaaaactt atttaaaacc atgaacaatg tttttaatcc aacaactaaa 1260 tgcaaatacc ttgacgttaa tgaactaaat aaatctatta atcctggtac agaaatttat 1320 cttcatctaa atatttcttc tcttccattt cacataaatg atctctccag ctttgttggt 1380 accattaata ctcctcccat ggtaatagga attactgagt ctaacttata tgcaaatgat 1440 tctaatataa ctgacattac catacaagga tataacattg agcactgtcc aacagaatca 1500 aaaaaaggtg gtgcacttct ttacctaaat tcaaatctta attacatagt tcgtagtgat 1560 ctccaaattt atgccaccaa atttctagag tctatatttg ttgaagttat atatccactt 1620 aaaagtaata ccatttttgg atgtatatat cggcatccct ctttaaacat cactgaattt 1680 ttatctatcc attttaatcc tctccttgaa aagttaagtc atgaagcaaa aaatattgta 1740 ctaatgggcg atttcaacat agatttattg aattatagag aatcacaagt catttctaat 1800 tattttgaat ctttgtgctc tcattcctta tttccaacca taattcttcc aacacgtgta 1860 actgcaaaaa ctaaaaccct tattgataac atcttcatga attcatttcc tactgatata 1920 gtctctggta atcttacaat ttcaatctct gaccacatgg cacaattcgt ctgtattcct 1980 aataacccac ccattaaaaa aaaagtaaaa atgtttaaaa gaagctttaa aaaatttgat 2040 tctgattctt ttatccaaga aatttccgat attaactggg agcttctcat aaaagacgat 2100 gataacataa acgaatctat aaattttttt cttaaaacat tcgaaaaaat attagaccga 2160 tatgctcctt acaaagaact aacaaaaaaa caaattaaac ttcaatctaa accctggata 2220 acttctggta ttcttaaatc aatttctaat aaaaacatac tctataaaaa atttactaga 2280 tcaaaaaatg tgaatacaaa aaatatatta tttactaaat ttaagttata cagaaataaa 2340 attagtaatt tgttaaggta ttctaaaaaa ttgtactacg catctttttt caataacaac 2400 atcaataatg taaagaatac ctggaaagga attaaggaaa tcataaacat caaaccatct 2460 acttctacta agtcttttaa cctcaaaatc aatgaaaatg tcatctcaga tcataccagt 2520 gttgctaata tatttaataa ttattttacg actattcaag acgaactact taaaaatact 2580 gttccctcta aatgtgcttt tggcaacttt cttaaaatac ctagtattaa ttcttttttc 2640 attgagcccg tgactgaaaa tgaggtggcg aatctcataa aagatactct taaaaataat 2700 aaaagccttg gtcccaatag tctgccaaca tttttactaa agctggtatc tcacataatc 2760 tccaaacctc tatgtactat gatgaataat tctttcaaaa atggtatttt tccagaagct 2820 tttaaggtag ccaaagtaat tcctatacac aaaatgggtt cttatctaga ctactccaat 2880 tatcgaccta tttcacttct ttctaactta agtaaactat ttgaaaaagc aatgttccaa 2940 agactcgagc attttttgga aaaacataaa tttatttaca aacaccagta tggttttcgc 3000 aacaaacact caacaactca ttcattaata gaaataacag agaaaattag acaagcmatc 3060 gataataaac actttgcgtg tggagtgttt attgacataa gaaaagcatt tgatacagtt 3120 gagcacacaa tcctcttaga aaaattgaaa cactacggta taagaggtat tccttttctc 3180 tggttttcct cttatctttg taatagaaca caatttgttt caattaatgg tatcaactca 3240 ggactagcta aatcttttaa tggtgttccg caaggctcgg ttttaggtcc attacttttt 3300 cttattttta tcaatgattt aaatgtttct cttaaatttt ctactgctta tcacttcgcg 3360 gatgacacca acttactcct aattaataaa tcacttaaaa aacttaacaa aaatatgaat 3420 cacgatctag ctaatgttgt tcaatggctc cgctccaata aattatctct taactctaaa 3480 aaaacggaaa ttattatatt taaatctgct aaaacaaaaa taaataagca attaaatttc 3540 agattaagtg gccaaaaaat aaatccagtt aactctatta aatatcttgg aatcaaaata 3600 gactctaatt tatcttttgc atcgcatctt caagacttag cattgaaatt aagcagatcg 3660 aatggtattt tagctaaaat tcgtcatttt gttaaccatg aaactttgct taatttatat 3720 catgctattt ttcactctca tctcagatat gcttgccaag tttggggaca atctaagcag 3780 cttgcttttt taagactaac ttatctccaa aacaaagctc taaaattaat ttactttcaa 3840 catactaatt cgaactgtag catactctat tttttatcta aagtccttaa attgtgtgac 3900 cttattcagc tatcaaattg tctatttgtt tggaaccaaa atcataataa cttacctctt 3960 acatttatca acttttttag ttatagggaa aactgtaaat atattctacg atctgctctc 4020 aactttaaat tatctgtacc taaataccga actgttcatt acggctacga atccatacaa 4080 cataaaagca tacaaacctg gaacaatctc ccttcccaat taaaatctct taaatcattc 4140 tcaaaattta aaactgctct ctttaaccat tttttagaaa aatactcttt ataaatctta 4200 taccttttta caacttatta tttataaccc taacgcagtg cttctattca tcttttaatt 4260 tttctcattt ctatcgtgat atattactaa ttttaatatc aattattaaa tttattatta 4320 taaactgata tgtttctata gtattctctt atcactgttg ttgttattat tactatattt 4380 tatatcgtaa tgaatattat tgtaattgtt attattgttg ttgttattat ttttgttgtt 4440 attattgcta ttattattgy tattattatt attattatta ttattattat tattattatt 4500 attattatta ttattatagt atattctttt tataatcgct actattattt tttttatcac 4560 tataatcact actattactt ttattattcc tacttgtatt ataattatta ttattactat 4620 ctccatctta tttttattta ttttgacagg tgtttacttt atattagtac aaattactat 4680 ttgtaaatat ccttgatgta acaactttat tcagaatata twtgatttga ttt 4733 // ID Gypsy-11_DPu-I repbase; DNA; INV; 5458 BP. XX AC scaffold_48; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-11_DPu_; KW Gypsy-11_DPu-LTR; Gypsy-11_DPu-I. XX NM Gypsy-11_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-5458 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 737-737 (2010). XX DR Genome; scaffold_48; Positions 547943 542486. XX CC Positions [4059-4538] - Integrase core CC 'ATAT' target site duplication CC LTRs are 99% similar to each other. CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX FH Key Location/Qualifiers FT CDS 347..1393 FT /product="Gypsy-11_DPu-I_1p" FT /translation="MATRGARGGGRGRGRGANIPNLDPADVVPGPNPPANI FT AQPAYPLRHKDPPIFHGKADEDVVNWIFRFEQIAEFNQWGPDQRLRHIGMC FT FEGVAEKWYCSLMTRVPPPLTFDEFRQELLRAFKPVNYEEHLETKLRSRTQ FT GNEEPFIDYFHDVLFMCSRIDPGMSERSKIGHLFRGLLPATVRGVYRFITP FT NSTTNDLFREAQIFLQGEDIAAKGEKRSESIPPAQPVLHVKKEESTNSLPS FT NSLSPYTPKETNAVSREELMQMEKRILDNVAKAIEQASRQGTERNQGNRDR FT PAARYGTNKRTRDGRPICNDCGHPGHIARFCGKDKDSRGKDEPSTKIKPES FT GPSGSN" FT CDS 2628..4847 FT /product="Gypsy-11_DPu-I_2p" FT /translation="MDMQSGYWQVEVRPEDREKTAFITPDGLYQFKVMPFG FT LSNAPATFQRMMDVLLSGLKWNTCLVYLDDIVVFSKTFSEHLSRLDEVLAR FT IQRANLKLKISKCSFFATSLKVLGYVVSGKGLSPDASKVLAVRNFPVPQNV FT KDVQSFLGFCTYYRRFICDFANIARPLSDLTKKNNPFVWASEQQNSFEALK FT SALQSSPILGHPNYELPMEIHCDASGYGLGAVLVQQQESGERVICYASRLL FT NKAETNYSVTERECLALIFAIQRFRAYIWGARIKVVTDHHALCWLMKKKDL FT SGRLARWSLQLQELDIEIFHRSGKLHSDADVLSRNPVDQPGTMEDIPTLII FT LSTDADRIINGQKTSGWWKPILERLRSEEDSATTRRLKNKYVIRDGILFRR FT VSWEGREYFRLCVPEGLITEILLACHDDVTAGHLGITRTLDKIRKRFFWPR FT FTQKVIRYVRSCVDCQTKKRSTEIRAGMMKSIESTQPFEKVGIDLIGPFPL FT TKTGNKYIVVAVDYLTKWVIAQPVPHAGTREVVDFFVRRIVLQHGAPVTLI FT SDRGKCLTSGFAEEMYRALQSNHSVTTAYHPQCNGLVERFNHTFAEMLSMY FT VSSCHDDWDDVVDFVVFAYNTSRQESTGATPFYLLYGREAVLPIDVCLGNN FT PNPVCKTNSESSANLRELSGRLTVIREKVKRRLIAVRAKQKRRFDRQRRQV FT RFVIGDLVWVHRPLRKKGRSQKLLHPFFWTIQNRGKNKRS" XX SQ Sequence 5458 BP; 1542 A; 1244 C; 1376 G; 1296 T; 0 other; tttggtggag atgcagggta atacgggtct actattcacg tgtggcctga tcaattgaat 60 tcttcgagtg tggcgtgata cgggcgagcc tactgtagcg attcatccgg atcgcgattc 120 ttcttgtctc cgagtcttca ttttcgtgag aggacggctg gagctgggcg ttgcgattcc 180 agcatcgcat ctggacttta accgacggca gcagccgtgc gccgtggagc ataagttttc 240 ttcttccttt attttatgtc cccttgtttt tggaaattgt tttattggtc gtttctattg 300 tcattctcac ggccgcataa actatcgtta gtctaagtga tacaaaatgg cgactcgggg 360 ggctcgagga ggcgggcggg gccgcggaag aggggctaat attccgaatc tagaccctgc 420 tgatgtcgta ccagggccga atcctccggc taatatcgca cagccagctt atccacttcg 480 tcataaggac ccacccattt ttcacgggaa ggctgacgaa gatgttgtaa attggatttt 540 tcgattcgaa caaatagctg agtttaatca atggggccct gaccaacgtc ttagacacat 600 cggaatgtgt tttgaggggg tagcggaaaa atggtattgt agcctaatga cccgggtacc 660 accacccctt acatttgacg aatttcgaca ggaacttctc cgagcgttca aaccagttaa 720 ttacgaggaa caccttgaaa caaagcttcg ctctagaaca caggggaatg aggaaccatt 780 cattgactat tttcacgatg ttttgttcat gtgctctcga attgatccag gcatgtcaga 840 aaggagtaaa attggacact tgtttcgcgg actgctaccg gccacagtcc gaggcgtata 900 ccgtttcata acacctaata gtaccaccaa tgacctattc agggaggccc aaatctttct 960 acaaggggag gatattgcgg caaaagggga aaaacgatca gaatccatcc caccggcgca 1020 acccgttctt cacgtgaaaa aagaggagtc taccaattct ttaccctcaa actctctttc 1080 cccttatact cctaaagaga caaacgccgt ttcaagggag gaattgatgc aaatggaaaa 1140 gagaatcttg gacaacgtgg cgaaggctat agagcaagca agtcgacagg ggacagaacg 1200 taatcaggga aatagggacc gaccagccgc taggtacgga acgaataagc gaactaggga 1260 cgggcgaccc atttgtaatg attgcggaca tcccggtcat atcgctcggt tttgtgggaa 1320 ggacaaggat tcaagaggaa aggacgaacc atccacgaaa atcaaaccag aatcagggcc 1380 ttccggatcc aattaaaaaa actagactcg gtggtgaaaa ggggggaatc caccgttgtc 1440 gtttttaaca tgcccccacc agatcgatct aatctcgttt tttctaaggt aatttgcaga 1500 aataattatg ttgaagctat tatagataca ggggcgggaa ttactgtaat ctcgcccgaa 1560 ttttgtaaat atctgaatct tgtacctaaa aaacaatggg agggacccaa acttttgatg 1620 gcgaatggct caatcgtgtg tccagaaggc tccgtaaccc tcgaaattat tataacggga 1680 atcccagtct acgtcgaagc cgcggtgttg cctataaatg ggtataaact cttgctgggg 1740 aacgacgcat tacgtcaact tgagtcgatc tcgataattt atggggaagg tggcgaggcg 1800 gtattctcag tagcaccgga ggcggaagtc gatacagaaa aggggaaaga tgagctaggg 1860 tctatagtta gccaggaatc gtgtgtcata ccagcatatt cggtggttac gattaccgca 1920 gaattaaaca acgttcagat agggccatcg gggacgcgcc taatggttga gccagtaaaa 1980 aaaaatacta attgataaag gtttttcggt tgggcacctt ctgctcccca ctgaagacca 2040 tcatgactgt ttaaaaggaa ttcagctggt gaatttttcc cgtcaagatc aatggctcag 2100 caaaggaacc gtgttaggaa aaattgtgcc agtcgaagta gtatcagaat cggaaaagca 2160 agacggcagc acaggcgatt ctttttccca aaataccaca atgcagttcg agagcgtgat 2220 aaacgaggaa ctcgcaccgg aagagcgaga tgcggcggag agactcttac gaaaacgcgc 2280 tgggtgtttt gccacttccg attgtgacct gggacagtcc aatatagttc agcatagtat 2340 agacaccgcc acgcacaaac cgattcatca ggccccctat aaaagcgctt ggaaggaaag 2400 ggaacttacg caaaatcaag tccagcatat gcaaaacata ggggcgatag agccatccag 2460 cagcccgtgg gcggctcccg tcgttctagt taaaaaaaaa ggatgggtct tggcgtttct 2520 gtgtggacta tcggaaactt aatgcaatca ccacccgaga tgtctacccg ctccctcgaa 2580 ttgaagatgc gctgagtagg ttcgaagggt ctcgttattt ctcgattatg gatatgcaat 2640 cgggatactg gcaggtcgaa gttaggccag aggatcgaga aaaaacggcg ttcattaccc 2700 cggatggctt atatcagttt aaagtaatgc ccttcggctt gtcaaatgct ccagcgacgt 2760 tccagcggat gatggacgta ctcctttccg gtttaaaatg gaatacctgc ctcgtctatc 2820 tcgacgacat agtggtcttt tcaaaaactt tttccgaaca tttatccagg ctcgatgaag 2880 tgttggcgcg gattcaaaga gcgaatttga aactaaaaat ctctaagtgt tcgttttttg 2940 ccacctccct caaagtactg gggtatgtcg ttagcggaaa gggactatca cccgatgcgt 3000 cgaaagttct tgcggtcagg aattttccag tgccacaaaa cgtcaaagac gtccagagtt 3060 tcctcgggtt ttgtacatac tatcgacgat ttatatgtga ttttgcaaac atcgcgcgtc 3120 cgctttcgga tctcacaaaa aagaacaatc ctttcgtctg ggcgagcgag caacaaaata 3180 gcttcgaagc ccttaagagt gccttacaat cgtctcctat cctcggccat cccaattatg 3240 agttacccat ggaaattcac tgtgatgcga gtgggtacgg gttgggagcg gtcttggtcc 3300 aacaacaaga aagtggggaa agggttatct gttatgcgag tcgtctatta aataaagcgg 3360 aaactaatta ttcggtaacc gaacgggagt gtttagcttt gattttcgcc atacaacggt 3420 tccgagcata tatttggggg gcgagaatca aagtagtaac cgaccatcac gcgttatgtt 3480 ggctgatgaa aaaaaaggac ctctccggtc gccttgcacg ctggagtttg cagcttcaag 3540 aattggacat tgaaattttc caccggagtg gaaaactgca ttctgatgcc gacgttctct 3600 cacgaaatcc cgtcgatcag cctggaacaa tggaggacat ccccacgcta ataatactgt 3660 ctaccgacgc ggacagaatt ataaacggac aaaaaacatc tgggtggtgg aagcccatct 3720 tagaaaggtt gcgatcggaa gaagattctg ctacaacgcg tcgcctcaaa aataaatacg 3780 taatacggga tgggattcta tttcgccgtg tgagctggga ggggcgtgaa tatttccgtc 3840 tttgcgtgcc tgaaggacta ataacagaaa ttttactggc ctgtcacgac gatgtgacag 3900 ccgggcatct gggcatcacc cgtactttag ataaaatccg caaacgcttc ttttggccaa 3960 ggtttactca gaaggtcatt cgctatgtga gaagttgcgt cgattgtcaa acaaaaaaga 4020 gatctacaga gatacgcgca gggatgatga aaagtataga gtctacccaa ccatttgaaa 4080 aagtgggaat tgatcttata ggcccattcc ccttgacaaa aacaggaaac aagtatatag 4140 tcgtcgcggt ggattatttg actaagtggg taatagccca gccagtccca catgcaggga 4200 ctagggaggt ggtcgatttt ttcgtacgta gaattgtgtt acagcatggc gcacccgtga 4260 ctctcatctc ggacagaggg aaatgtttga catctggatt cgctgaggaa atgtaccgag 4320 cccttcagtc aaaccattcc gtgacgaccg cataccaccc ccaatgcaac ggcctggtcg 4380 agaggttcaa ccatactttc gcggagatgc tttccatgta tgtgagctcc tgccacgatg 4440 actgggacga cgttgtggac tttgttgttt tcgcctacaa caccagtcgg caggagtcaa 4500 caggcgcgac cccattctac ctgctatacg gacgtgaagc tgtgctccca atcgatgtgt 4560 gtctgggcaa caacccaaac ccggtgtgta aaaccaacag tgaaagcagt gccaatctac 4620 gcgaactttc aggtcggcta actgtcataa gggagaaagt gaaaagaaga ttaatagctg 4680 tacgggcaaa acaaaagaga cggtttgatc gtcagcgtcg acaagtgcgt tttgtgatcg 4740 gtgaccttgt gtgggtgcat cgccccttgc ggaaaaaagg acgctcacag aaactactgc 4800 atcctttttt ttggaccatt caaaatcgtg gaaagaacaa acgatcttaa ctatgtggtt 4860 gttccgttta acggcaagaa gaagacccga gatcgagttc acgtgaccaa tctaaaacct 4920 tactacgctc gccatgaagt tggtaccaga acaaagtgtc cattgaaccc aggaccaaac 4980 gaagggacga gcaacgttca gtcgggtaac ggcccgggag gtagactggg gaaaaaacga 5040 ctccaacgga gtgaagctcc aactacgcca gagtcaaaca ggaaagaaaa ctctcccaag 5100 tgatggtcgt gtcggagcag agtgcagagc gcggagcaag tcgcagagcg cagagcacaa 5160 gcgtgcagag cgcagagcgc aaagcagggc gcagagtaca gagcacgcac gagccctcga 5220 ccaggaatat ggcgaaaaca ttccccggag tgaagttcct agaaaagtgg tcacgtcgga 5280 agaatctgaa gtcgctccat ttcaagaaat agtagagatc gggcgaccag ttcaaaccct 5340 tccctctacc attacgagat ctcacagaca cctcctccga ccccaatcta cccttagaca 5400 tcctgactat tttcattcct aaggatcggg tcgatcctcc cactggaccg gggggaaa 5458 // ID Ginger1-1_AC repbase; DNA; INV; 4336 BP. XX AC . XX DT 03-FEB-2010 (Rel. 15.01, Created) DT 03-FEB-2010 (Rel. 15.01, Last updated, Version 1) XX DE Ginger1 DNA transposon from Aplysia californica. XX KW Ginger1; DNA transposon; Transposable Element; Ginger; integrase; KW Ginger1-1_AC. XX OS Aplysia californica OC Eukaryota; Metazoa; Mollusca; Gastropoda; Heterobranchia; OC Euthyneura; Euopisthobranchia; Aplysiomorpha; Aplysioidea; OC Aplysiidae; Aplysia. XX RN [1] RP 1-4336 RA Bao W., Kapitonov V.V. and Jurka J.; RT "Ginger DNA transposons in eukaryotes and their evolutionary RT relationships with long terminal repeat retrotransposons."; RL Mobile DNA 1, 3-3 (2010). XX DR [1] (Consensus) XX CC TIR is 140-bp long. Tpase contains 2 introns: 833-1122, CC 2042-2434. XX FH Key Location/Qualifiers FT CDS join(716..832,1123..2041,2435..2697,2701..3384) FT /product="Ginger1-1_AC_1p" FT /translation="MFLVETEKVLHCQGLTMTAFIHMHSTASRKQTNHFPI FT LNMNRFHLSTAKYLRVITDRTEQIRIIRATHGGLGETVESRSLGGHLGWDK FT TEGRLSKSVWWPGVRKDVRNYIQHCDRCQRRGPSLDKGQQLHPVKIPPKPW FT SQIGVDTCSLPKSKDGFTCMVVAVDYFTKWMEAEPLVAKTAEGVANFLYQC FT MCRHGCAGIQINDRGREFVNSVSRALHKLSGVEQRVTSAYHPQANDLVERE FT NRTIQGMMVKMLTEVSCVEDWPRALPGTLFALRTSKHATTKYSPFFLLYGR FT EPKLPIDVESDSKKKNETESSENLETNTDVSSTLSRTIEAEGELNQNNGNE FT SSENVMKNIRKAQERQSKQYNKRRAGQTFEVGDSVLLYNLRRADRKGGGGA FT NCWTGPYEINRQLSNGTYEIKQNGLVLKNKANAQNLKPYKKVKNENESEDE FT VRNNRTKQPEASVKQWDERRESNSITSEDDVIIDSVEDGEAPKFNPTDHVW FT RLQKTAELGLPSPKPMPDREPSNHLGSPNNTNSVRGDGNCFFRAMIVELTG FT LEDYHQDLRSFTVSFMRANAHSFAGYLGQNVNAYLNDSNMEVVSTWSTDAE FT IYAMATLLDTVIYIYTLFGPTRRWIPFKPGFSSGNLSQSKSENPALYLTNL FT CAHYERVVGVEH" XX SQ Sequence 4336 BP; 1349 A; 923 C; 910 G; 1154 T; 0 other; tgtagcacgc caaactgact cccacgccag aggcactccc ccgtgcgttt ggcgtggcgg 60 gaagtcgctt tggcgtgaac ccacgcgaaa tctacttcca cgcgaaacgc actccctcca 120 cgccaaacgc actcccctaa gggaaacaac tctgaatcta ttattagcgc agagtgactt 180 cccggaaaaa tggatgaatt gggagaaatc ttcgactaca ttgtctccaa aagatatcct 240 aattccgtaa aaacaaaagg acaaaaagtc aattttagaa gaagagctgg caagttcact 300 gttgaggaag gggtcctgaa gcgagcagtc aaagacgatg taaaaaacgg tgagtttcac 360 ttcatgattt aatctagatc tagatatcta gatctagtca ttcattctct cccccccccc 420 tcgctctccc tccctccctc tctccatctc cctctccctc cgtccccctc tctccctttg 480 ccccctccct ctctctctct ctctctctct ctctctctct ctctctctct ctctctctct 540 ctctctctct ctctctctct ctttgcaaat gtgtgtctta gatctaatca gcttgcaata 600 attgtatgta taagcttgag agagcatatt tttgtctgtg tgctgcgtta gtttttgcac 660 atgcacacat tggttgaaaa attgctagat tcagagaaag ggaacagcag cagatatgtt 720 tctggtagaa actgagaaag tgttgcactg tcagggctta acaatgactg cgttcatcca 780 catgcacagt acggcaagta gaaaacaaac aaaccacttt ccaatactga atgtgaactt 840 tcccgaaatg caaaaatgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg 900 tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg 960 tgtgtgtgtg tgtgtgtgtg tggataaaga cccccccccc cccttttttt ctgaaatgtt 1020 acatatgcta caaaatggtt tagaacctta ggttgataat ttgatggatc gactgactga 1080 ctgattgact gactgattga ttggtttatt gacgattgat tgatgaatcg atttcatttg 1140 tctacagcca agtacctcag agtgatcact gacagaacgg aacaaattcg aattattcga 1200 gcaacacacg gaggccttgg tgaaaccgtg gaatcaagat cgctgggagg tcaccttgga 1260 tgggataaaa ctgaaggacg gctatctaaa tcagtgtggt ggccaggtgt tcgcaaggat 1320 gtgaggaatt atatccagca ctgtgatcgc tgccagcgtc gtggaccaag tctcgataaa 1380 ggacaacaac tgcacccagt aaaaataccc ccaaaaccat ggtctcagat cggggttgac 1440 acatgttcct tgccgaaatc taaagatggt tttacgtgta tggtggttgc tgtcgactac 1500 ttcactaagt ggatggaagc tgaaccactt gttgcaaaaa cagctgaagg agtggctaac 1560 tttttatacc aatgtatgtg cagacatggc tgtgctggta tacaaataaa cgacaggggt 1620 cgggaatttg taaacagtgt gtccagggct ctacacaaac tgagtggtgt tgaacagaga 1680 gtcacaagtg cctaccaccc ccaagcaaac gatcttgttg aaagagaaaa cagaactata 1740 cagggaatga tggtcaaaat gctgacagaa gtctcgtgtg ttgaagactg gccaagagct 1800 cttccaggaa ctctttttgc tcttcgtacc agtaaacatg ctacaacgaa atattcgccg 1860 ttttttcttc tttatggacg ggagcctaaa ctgcctattg atgtggaaag tgattcaaag 1920 aaaaagaacg aaacagagtc atctgaaaac ctggagacaa acacagatgt aagttcaact 1980 ttgtcccgaa ctattgaggc ggagggcgaa ctcaatcaga acaatggaaa cgaatcttct 2040 ggtagaagca cgtgtaccac cagctcaact tccctctaca ccattacttt caacggtgaa 2100 ctgaaacaga atgaaaagcc agatttttat ggaagaaaca caacaaatac agatctaggt 2160 tcaacaactt cgtttgaagc tgttggttgt gaagatgaac tcaatagcaa tggaagtgtc 2220 aacccaaaca tagtcaaact ggaagacaca gcagaatctt ctgactccga gatattagat 2280 tttgaagaca tgctaaagca aacagccgca tctaattcac atacatgtat tagcaatgca 2340 agcacgtgct cagactctgt ttattttcac ccaaacaaaa cagctatcaa aagacgcata 2400 catatcatgg aagacataag aaagggtaca acagaaaatg tcatgaaaaa cattcgcaag 2460 gctcaggaac gacagtcaaa acagtacaac aaacggagag ctgggcagac gtttgaagtt 2520 ggtgattctg tgttgctgta caatttgagg cgagctgaca gaaagggggg ggggggggct 2580 aattgctgga ctggacccta tgaaattaac agacagttgt caaacggtac gtatgaaatc 2640 aaacaaaatg gacttgtatt aaagaacaag gcaaacgcgc agaatttaaa accctattaa 2700 aaaaaggtca agaatgaaaa tgaatcagaa gatgaagtca gaaacaacag gacgaaacaa 2760 cctgaagcgt ccgtgaaaca gtgggatgag agaagagaaa gcaattccat tacgtctgaa 2820 gatgacgtga taattgacag tgtagaagat ggtgaagctc caaaatttaa tccaacagat 2880 catgtgtgga gattgcagaa aacagctgaa ctaggtcttc catctccaaa acccatgcca 2940 gatcgagagc caagcaacca tcttggttcc ccaaataaca caaactctgt tcgtggagat 3000 ggaaactgct tttttcgagc aatgattgta gaactaacgg gacttgagga ctatcaccaa 3060 gaccttcgat cattcacggt gtcattcatg agagcaaacg cacactcatt tgctggctac 3120 ttaggccaaa acgtcaatgc ttacttgaat gactcaaaca tggaagttgt cagcacctgg 3180 tccacagatg cggaaatata tgcaatggca acgttgcttg acacagtgat ttacatatac 3240 actctcttcg gaccaacacg acgttggatt cccttcaagc caggattttc gagcggcaac 3300 ttaagtcaaa gcaaatccga aaatccagca ctttatctga ctaacctctg tgctcattat 3360 gaaagggtag taggcgttga gcactgaaag tagacagata attgctgggt tttgggattt 3420 gcttccactt ctgtttgaag aaacgtttac atattaaatt aaaagtgttt caggggtaaa 3480 acacatactt tatctaaagt agagtattct ttatttcctt gtcctccgct ttgtcgttgt 3540 taactgtttc tgcttatttc gttgtttgtt tttgcatttt tctacgtcat ggtctctcct 3600 gttcgcgcta ccaaaaaagg tttaataatt cgagtgtaat tcactgcata taagtctgat 3660 ggccttaaaa aagacaaccc aacccccccc ccccccaaca aaaaaaccca caaaaaaaac 3720 aaacaaaaaa caacaacaac aaaagcaaga caaacaaaca aaaataacta caacaaacaa 3780 acaaaaaaat tcaacaaaaa caacaacaac caaacaaaaa caacagcaac aacaacgcat 3840 cactattatc attattatca ttattattat tatcattatt attattatta ttattattat 3900 tattattatt attattatta ttattatact caacagaaaa caaacaacaa cattggcaaa 3960 aacatgtcta tgtttaaaat acaatccatg acaagtcttt cctgaaaaac aacaagaagc 4020 accacattta ttaatgatcc tttaatttag atacaacaat aaccgtgttt ttctttcaaa 4080 tgcaggcatt taatatcttc tgggggaaaa tactgaatga attactgagt gatctcatcc 4140 tttgggctga gtaactcgat ctttctggta acagagttac ttcccttaac cgtttttcag 4200 gggagtgcgt ttggcgtgga gggagtgcgt ttcgcgtgga agtagatttc gcgtgggtac 4260 acgccaaagc gacttcccgc cacgccaaac gcacggggga gtgcctctgg cgtgggagtc 4320 agtttggcgt gctaca 4336 // ID BEL-177_AA-I repbase; DNA; INV; 6922 BP. XX AC supercont1.8; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-177_AA_; KW BEL-177_AA-LTR; BEL-177_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6922 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.8; Positions 2744433 2751354. XX CC Positions [5975-6532] - Integrase core CC 'CCGAT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 29..6922 FT /product="BEL-177_AA-I_1p" FT /translation="MNLQSNVRQTRSRTRAFQQENADPTPPGEDDKVSNAS FT SDVSFVPSMLDNDGGELRCCVGCDRPNNAEKYMVQCQKCTLWYHFSCANVS FT TATVRMTTFECKKCRPSGDPLVPARTVSVASGISSSSSACRARIDRELKRL FT DEEKKLMEDLSRERIEMERALQERRLQEKLDRERQFIARKHELLSQQDGEE FT GSVRSMRSSRSSTKRTEDWVRQSVGRTAENSGRNVVEPITSIATISDLPDD FT VRSENPQIVQHSSTPLKATTSTVPPLTGNAIRDAGSEAKSIARTLGSITIG FT DESEVSLEGAVGGDADNEDKGPSNLPFVDLQPYTDLLKVEEVPPAGVDPRI FT NRFGSHCYKRWSVQTGELRSQIALQQQQHTEAERRAMQELAVKHQVDIDER FT RQREVDLVHRIKSLQLQHNAELKLVRNSEEGLRVQLNKRDSEWLDLKNQIQ FT ILEKRLVEEKERMCTSEKDLHDQLMESKRECEALRLQVAELESEIQCLREA FT ELQLQSQIDSSRQRENEAIRLRRVAEKEYWDLHDEVQQIINRNEDHSTSGD FT CSPPLPPPPLSWFESMNASNTDTSSLPHPPPPPICVDDGLPYDRRETIANV FT GAFGERINIPAPIPFTSTTVHPEVHPQLATHHLVPPYVSNQCGPSPQQIAA FT RQVVTKELPVFSGDPIDWPLFISSYQHSTDTCGYTNSENLLRLQRSLRGSA FT KDSVSSFLLHPSTVPQVLSTLQQLYGRPEQIVNNMIAKVRATPPPKPDRLE FT TLVSFGLVVQNLCGHLKAVGLERHLANPILLQELVDKLPATVKFSWALYQE FT QVPVVDLNVFSEYMAKISSAASGVTQLASFPQKAPKDERSRQKERSFVNTH FT VSMNQPKANRREEVNKSVTNNEKQNERDAGNINVKSCSMCNAGSHQIEHCT FT SFKSLDLDGRWKAVKANKLCARCLTSHACWPCKGEVCGINDCPKRHNRLLH FT FEPPAVAKATNAVVTVHRQLSSSTLFRILPVTLFGRNGKVDTFAFLDDGSS FT VTLVERSIAEALGVNGEIETLHIEWTGGINKTIVGAEVVAMEISEAGGNKR FT YKLSEVYTVDNLGLPRQTTDYTELAARFAHLKKLPVKSFRSAEPGILIGQS FT NSHLLATLKLREGKLDEPIATKTRIGWAVCGSLRNAQSNTTHRQLHIAEPT FT LGDLHEYVRRFFDIESLGVAIVPAVIGTEEQRALHILEATTRRLPGDKFET FT GLLWKHDYVEFPDSWPMAERRFKCLEKRLAQKQDLYESVRRQIADFKAKGY FT IHEATATEMEGFDLRRTWFLPIGVVVNEKKPGKVRVIWDAAAKVDGVSLNS FT MLLKGPDLLTPLLSVLFPYRERQVAVSADIKEMFLQILIRQQDRSALLFPY FT RESPQLPMSIMVSDVAIFGAACSPAHSQYIKNLNATEQEVELPRGSAAVKK FT KHYVDDYVDSFDTSEEAFEVAKEVIEVHRRAGFYIRNWMSSDKSVLEKLGE FT VSQEPSKAMLPEKDISFERVLGLAWMPEEDVFTFSLKFCEKVQALLESSQI FT PTKRELLRLVMSIYDPLGLVASFVIQGKILIQEVWRTETGWDCQIPLEIAP FT RWTEWISVLKKMDGLRIPRCYFPGYDPESFKNLELHVFVDASAQAFAAVAY FT FRIIDRGQIRVALVSSKTKVAPLRALSIPRLELMAALLGVRLRKTVEKNHS FT LKIQKTFFWSDSSTVCSWIKSDTRRYRQFVAFRVDEILSLSSVDEWRWIST FT KVNVADEATKWGKGPTYNVESRWFLGPAFLYSNDEDWTNNPAEGIDESDRE FT LREAYVCSHLIRQPLVNAERFSRFDRMLRSVAYVHHFVECLRTTKSRSADI FT VGLTSSDIQKAERTLWTLAQSDAFPDEVAILKKNLERSRENRKQIETSSSL FT SKLSPFVDEFGTLRVGSRGTEAQVLAYDTKYPIILPRSHRITDLLLDFYHR FT KYGHANDETIVNEVRQKFHVPRLRVEVRLARKRCMWCRVYKARPVAPKMGP FT LPAIRFEPGVRPFTYVGVDLFGPYLVKVGRSVAKRWVCLFTCLTIRAIHLE FT VVTSLSTDACKKAIRRFIARRGSPLEIYSDNGTNFVGASRELQDEIKKIHT FT ELGSTFTNVQTQWRFNPPAAPHMGGCWERMVRSVKSALGSVPVERKLDDES FT FATVLAEAESMINSRPLTFIPLETADQESLTPNHFLLLSSSGVREPEKFPM FT DAGMALRSSWNLVKHTLDNFWRRWLIEYLPTIIRRTKWFKDVRPIEVGDLV FT LVADENVRNRWIRGRVIRTIPGKDGVVRQAEVRTMGGILKRPATKLAVLDV FT VGTGDANPELEATREGG" XX SQ Sequence 6922 BP; 1957 A; 1522 C; 1844 G; 1599 T; 0 other; gattctcaaa gattttaatc ggaccataat gaacctccag agtaacgtgc gacagactcg 60 gtcgagaacg agggctttcc agcaggagaa tgctgaccct actccgccgg gtgaagacga 120 taaggtgtcg aatgcgtcgt cggatgtttc gttcgttcca tccatgttag acaacgatgg 180 aggtgagtta cggtgttgtg tcggttgtga tcggccgaac aacgcagaaa aatacatggt 240 acaatgtcag aagtgtactc tgtggtacca tttctcctgt gcgaatgtca gcacggctac 300 ggtacgtatg actactttcg agtgtaagaa atgtaggcca agcggtgatc cgcttgtacc 360 ggcgagaacg gtgagcgtag caagtggaat ttctagctca tctagtgcat gcagggccag 420 aatcgatcgt gagctgaaac gtctggacga agaaaagaag cttatggagg atttgagtcg 480 tgaaaggatc gagatggagc gagctctgca agaacgtcga cttcaggaga aattggaccg 540 agaaagacaa ttcatcgcac ggaaacatga gttgttaagc cagcaagatg gtgaagaggg 600 tagcgtacgt agtatgcgta gtagccggag tagtaccaag cgcacggagg attgggtcag 660 gcagtcagtt ggtagaacgg ccgagaattc cggccggaat gtcgtagaac cgattacgtc 720 aatcgctacc atctctgacc taccggacga cgtacgatca gaaaatccgc agattgtaca 780 acattcctca acaccactga aggctaccac atccacagtt ccaccactca ctggtaacgc 840 tatacgggat gcaggatcag aggcaaaatc gatcgcacgc acgctcggaa gcattacgat 900 tggagatgaa tcagaagtgt ctctggaagg cgccgttgga ggagatgcgg acaatgaaga 960 caaaggacca tccaatttgc ctttcgttga tctgcagcca tacactgatc tgttgaaggt 1020 tgaagaagta ccgccagccg gcgtagatcc gaggattaat cgtttcggct cccattgcta 1080 caagcgttgg agcgttcaaa ctggtgagct ccgaagtcag attgcgttac agcaacaaca 1140 gcacacggaa gcggaacgcc gggcaatgca ggaacttgca gtgaaacatc aggttgacat 1200 cgacgaaaga cggcaaagag aagtcgatct ggtgcatcgg atcaaaagtt tgcagttgca 1260 acataacgcc gagttgaagc tagttcggaa ttcggaggaa ggattacgtg ttcagctgaa 1320 caagcgagac agtgaatggt tagatttgaa gaaccaaatt caaattctcg agaagcgact 1380 tgtcgaagaa aaggagcgaa tgtgcacttc ggaaaaagat ttgcacgatc agctcatgga 1440 aagcaagaga gaatgcgaag ctcttcggct tcaagtggca gaactggaga gtgaaatcca 1500 gtgcctgcgt gaggcggagc tgcagctaca atctcaaatt gattcaagtc gtcagcgtga 1560 aaacgaagcc atccgtctgc ggcgtgtagc agagaaagaa tactgggacc tgcacgacga 1620 ggttcaacag atcatcaatc gaaacgaaga tcattcaaca agcggcgatt gttctcctcc 1680 gttaccaccc cctccgttgt cgtggtttga atccatgaat gcgtctaata cagatacttc 1740 ttctcttcct catcctcctc ctcctccaat ttgtgttgat gatggtttac cttacgatcg 1800 acgagaaacc atcgcaaatg ttggagcctt tggagaaaga atcaatattc ctgctccaat 1860 tccattcact tccaccaccg ttcatcccga ggttcacccg caactagcaa cccaccatct 1920 agtgcctcca tatgtaagca atcagtgtgg accatcgcct caacaaattg ccgctagaca 1980 agtggtcacc aaagagcttc cggtcttctc cggggatcca attgattggc cgctgtttat 2040 tagcagctat cagcactcaa cggatacatg tgggtacact aattcggaaa atcttctgcg 2100 gctacagcgc agtttgaggg gtagtgcgaa agactccgtc agtagcttct tacttcatcc 2160 gtcgactgta ccacaagttt tgtccactct acaacaattg tatggacgac cggaacagat 2220 cgttaacaat atgattgcta aggttcgtgc gacaccacca ccgaagcctg atcgtttgga 2280 gacgcttgtt agtttcggct tggtggtcca aaacctttgt gggcatttga aagcggttgg 2340 cctagaaaga catcttgcaa acccgattct gctacaagag ttggtcgata agctgccggc 2400 tacagtaaaa ttcagctggg cactctatca ggaacaagtt ccagtggtgg acttaaatgt 2460 gttcagcgaa tacatggcga agatatcctc agcggctagt ggcgtcaccc agcttgcgag 2520 cttcccacag aaagcaccaa aagatgagcg aagtcgtcaa aaagagcgat cgtttgtgaa 2580 cacacacgtc tcaatgaatc aaccaaaagc gaatcgaaga gaagaggtca ataaatcagt 2640 aacaaacaac gaaaagcaga acgaaagaga tgctggcaat atcaatgtca aatcgtgctc 2700 gatgtgcaac gcagggagcc accagattga gcattgtaca tctttcaaaa gcctggattt 2760 ggatggcagg tggaaggcag tgaaggcgaa taaactttgt gctcgatgcc tcacttccca 2820 tgcctgttgg ccttgcaaag gcgaagtttg cgggatcaac gactgcccga agcgacataa 2880 tcgtttgctg cactttgagc cgccagcggt ggccaaggct accaatgcag tggtaacggt 2940 acatcgtcaa ttgtcttcat caacgctttt ccgcattctg ccagtcacgt tgttcgggag 3000 aaacgggaag gtggacacgt tcgcgttcct cgacgacgga tcgtcagtga cactcgtcga 3060 acgatcgatt gcggaagctc ttggagtcaa cggagagata gaaacgctac atattgaatg 3120 gaccggcggt atcaacaaga caattgtagg agccgaggtc gtagcgatgg aaatatccga 3180 agccggagga aacaagcgct acaagttgtc agaggtgtac actgtagaca atctcggttt 3240 accaaggcag actacagact acacagagct tgcagcgagg tttgcccatc tcaagaaact 3300 accagtgaaa agtttcagat ccgctgagcc cggaatcctg atcggacaga gcaactctca 3360 tctgttagct acattgaagt tgcgcgaggg aaaactggac gagccgattg ctacaaagac 3420 gagaattgga tgggctgttt gtggtagcct gcgaaatgca caatcaaata cgacgcacag 3480 acagcttcac attgcagagc caacactagg ggatctgcac gagtatgtcc gccgcttttt 3540 cgacatcgaa agcttgggcg tagcaattgt gccagccgtg ataggcacag aggaacaacg 3600 ggcgctgcac attcttgaag cgactacacg acgattacct ggtgataagt tcgagacggg 3660 cttgctctgg aagcatgatt atgtggagtt tccagacagc tggccgatgg cagaacgacg 3720 tttcaagtgc ctggaaaaac gtttagccca gaaacaagac ctttatgaaa gcgttcgtcg 3780 tcaaatagca gacttcaagg ccaaaggata tatccacgaa gcgacagcaa cagaaatgga 3840 aggattcgac ttgcggcgaa cctggttttt gccgattgga gtcgtcgtga atgagaagaa 3900 gccaggaaag gttagagtta tttgggatgc tgcagcgaaa gttgacggcg tttcgttgaa 3960 ctcaatgttg ctgaagggcc ccgatttgct tactccactg ctgtctgtac tgtttccata 4020 ccgagaacgt caggtggctg tgtcggcaga catcaaagag atgtttctgc aaattttgat 4080 tcgtcagcag gatcgcagcg ctttgttatt cccttatagg gaatccccgc aactgccgat 4140 gagtatcatg gtgtctgatg tggcgatttt tggcgcagca tgctcgccag cacactcgca 4200 gtacataaag aatttaaacg ctacagaaca agaagtagag cttccacgag gttccgcggc 4260 agttaaaaag aagcactatg tagacgatta cgtagacagt ttcgatacgt cggaagaagc 4320 atttgaagtg gcaaaagaag taatcgaggt gcatcggcga gcaggattct acattcggaa 4380 ttggatgtcg agtgacaaaa gtgtactgga gaagctagga gaagtaagtc aggaaccgtc 4440 aaaggccatg ctacctgaaa aagacattag ttttgaacga gtgttggggc tggcatggat 4500 gcctgaggag gacgtcttta ccttttcgtt gaagttttgt gaaaaagtgc aagccttact 4560 ggaaagtagt cagattccca cgaaaaggga actgcttcga ttggtgatga gtatttacga 4620 ccctttaggg ttagtagcat cgttcgtcat ccaaggaaaa atcctcatcc aagaagtatg 4680 gcgtacggaa acaggttggg actgccagat cccattagaa atagcgcctc gctggaccga 4740 gtggatatcg gtactcaaga agatggacgg gttgcgtatt cctcggtgct attttccggg 4800 ctatgatcca gaaagtttta aaaacctgga gctacacgtg ttcgtggacg cgagtgctca 4860 agcatttgca gcagtggcat acttccgcat aatagatcgg ggacagatca gagtcgcact 4920 tgtttcctct aaaacaaaag ttgctccgct tcgggcactt tcgattcctc gactggaact 4980 gatggcagcc ttgctgggcg ttcgattgcg gaaaactgtc gagaaaaacc attcattgaa 5040 aatccagaaa accttttttt ggagtgattc atcgaccgtt tgctcatgga tcaagtcgga 5100 cacgcggcga tacagacagt tcgtcgcttt cagagtcgac gagatattga gcctctcatc 5160 tgtcgacgaa tggcgatgga tttcaacgaa ggtgaacgtt gccgatgaag ccaccaagtg 5220 gggcaaaggt ccaacgtaca acgtagaaag tcgctggttc ctcgggcccg cttttctcta 5280 cagcaacgat gaagattgga cgaacaaccc agcagaaggt atagacgaga gtgatagaga 5340 attacgcgaa gcgtacgttt gcagccatct aataagacaa ccgttggtga acgctgaaag 5400 attttcaagg tttgacagaa tgctacgttc agttgcgtat gtccatcatt tcgtggagtg 5460 tttgcggaca acgaagagtc gatccgcaga tattgttggg ctgactagtt cggatatcca 5520 aaaagctgaa agaacgttat ggacgttagc gcagtcagat gcgtttccgg acgaagttgc 5580 tatcctcaag aagaacttgg agcgtagccg agaaaaccgg aagcaaatcg aaacttcaag 5640 ctctctttcg aaattatcac catttgtaga cgagtttgga acgttgcgag ttggcagcag 5700 gggaacagaa gcacaagtac tagcctatga tactaagtat cccatcattc tgccgagaag 5760 ccatcggatc acagatttgc tgttggattt ctatcatcga aagtatggcc atgctaacga 5820 cgaaactatt gtcaacgagg tgcgccaaaa atttcatgta ccaaggctgc gggtggaagt 5880 tcgattagca agaaagcgat gtatgtggtg tcgagtttac aaagcaaggc cggtagctcc 5940 aaaaatggga cctcttccgg caatacggtt tgaaccaggt gtgcggccat tcacctacgt 6000 gggcgtagat cttttcggtc cgtatttagt gaaagtcgga cgaagcgttg caaaacgttg 6060 ggtatgcctt ttcacctgcc ttacgatcag agctatccat ctagaggtag ttacaagttt 6120 gtcgacggat gcgtgtaaaa aggcaatcag aaggttcata gctagaaggg gctccccatt 6180 ggaaatctac tccgataatg ggacaaactt cgtcggagca agtcgagagc tgcaagacga 6240 aatcaagaag atccatacgg aattaggcag tactttcaca aacgttcaaa cccagtggcg 6300 gttcaaccct cctgcagctc ctcatatggg gggttgttgg gaaaggatgg ttcgttctgt 6360 gaaatcagct ttaggatctg ttccagtaga aagaaagcta gatgacgaat cgtttgctac 6420 agtacttgct gaagcggaga gtatgatcaa ttcccggcca ttaactttca tccctctgga 6480 aaccgcggac caagaatctt taacaccgaa tcactttttg ttgctaagtt caagtggcgt 6540 gcgtgaacca gaaaagtttc caatggatgc agggatggct ctacgaagca gttggaactt 6600 ggtgaaacat acactggata acttctggcg acgatggttg atagagtact tgcccactat 6660 tatccgtcgc acaaaatggt tcaaggacgt acgaccaatc gaagttggag atctagtgct 6720 cgtcgccgac gaaaatgtcc ggaataggtg gattcgtgga cgagtgatcc gaaccattcc 6780 gggaaaagac ggagtggtac gtcaagcgga agtaaggact atgggaggaa ttttgaaaag 6840 acctgctacg aagttggctg tgctggacgt tgtaggaact ggtgacgcca atccggagtt 6900 agaggcgaca cgggagggag ga 6922 // ID Copia-1_SI-I repbase; DNA; INV; 4245 BP. XX AC AEAQ01003789; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-1_SI_; KW Copia-1_SI-LTR; Copia-1_SI-I. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-4245 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01003789; Positions 5947 1703. XX CC Positions [1722-2225] - Integrase core CC LTRs are 95% similar to each other. XX FH Key Location/Qualifiers FT CDS 306..3236 FT /product="Copia-1_SI-I_1p" FT /translation="MRTLFMAHRLWDIVNGSTPMPEAGAARETWSVQNANA FT MYLLSSTLEPEQKRPMLTCENACEMWKKLARIHEQKSASNKLLLSTRLYEY FT RMSASDSITQHITKVQNMAAQLLDVGETVTDTVIMAKILGSLTPKFAKFQT FT AWDNMPPDMQTLQNLEERLLREEARMTENDESESAFVVSKKSKEKSKSRVS FT SREAKNKKDGKDIRDRKDKKKVRCFRCQGLGHFARECKKDKKEGNRGTSES FT SDCAFVVESGKSGAGKSGFGEIPTDLAREILKADKKDCWISDSGASQHMTF FT RREWLRDFRELPMGEHEVALGDDKVCYALGVGTVDIEKFVNGKWIASYIKT FT VFYVPELKKNLFSVGACARKKIGVYFFDEYVKFDNRGFTAGYGVIQTNNLY FT RMLFKTKCAEDENEASVAKTSFLAWHERLGHVNVRAMRKLVSEKLIDGIEI FT TDGAEDFCGACQEGKSQRKAFKKTRVRAGTEPGEVIHSDVCGPMSVESIGG FT SRFMVVFKDDATNFRYVYFLKHKNEVLEKFKELDRLIKNRFGRNIRVLRSD FT NGLEYKNRAFDEYTDKRGIEREYTAPYTPEQNGKAERDNRTLVESARTMLL FT AKGLPKNLWAEACNTAVYLANRAGASSMRVGATPYELWMGVKPNLHHLKIF FT GSEAYVNVPQIKRTKLDARAKKMIFVGYDSTSSNYRIFDPVSKRVSVSRDV FT TFRETTGSVKLSKPKDAREVILPKVEREALVPAEGDSFDEEEEDDKVFENA FT VDDEQRDEGRQQVNEPVRKLRDRSLIKKPSRYDAEAAECDIPITYREAVES FT KDAGKWREAIDSELESHEKNGTWHLVKRDPKMKPIDSKWVFKKLKNEKGEI FT RRFKARLCARGFMQEKNIDYTETFSPVVRYDSLRVLLVASKNLEIAQFDVQ FT TAFLYGTLDEEIYMETPEGLNMDEETREEQVCRLVKSLYSSSHRDAGTKSS FT VRFCESSGSKKHPRISVYSWER" XX SQ Sequence 4245 BP; 1234 A; 860 C; 1306 G; 845 T; 0 other; acgggttatg ggcccagtca cgctgactgc gaggcgtttg cggaaacgag tgtgaatcgt 60 tgagttgtcg agcgtgcgaa tgcgtcgcgc gaaagagaaa aaaatcgcga gtgagtgaaa 120 acgtggtgcg cgcgaagcgc cggtagagac acgtgagact gcagggggcg gccatattgg 180 catcgtgttc gcgagtgaat tcaaagcgct gtgcgagcga aaagtaatcc ccgatagctg 240 aagcgatctc agtaaagaat atcacgaaat tcgatggtac ggattaccag agctggaaat 300 ttgagatgcg gacgttattt atggcccatc ggttatggga tatcgtaaat ggctcgacgc 360 cgatgccgga agcgggggcc gcgcgtgaaa cgtggtcagt acagaacgcg aacgcgatgt 420 atttgttgag ctcgacgctc gagccggaac aaaagaggcc catgttaacg tgtgagaacg 480 cgtgtgaaat gtggaaaaaa ttagcgcgaa tacatgaaca gaaatcagca tcgaataagc 540 tgttgctgtc gacgcgttta tacgagtatc gcatgagtgc gagcgactca ataacgcagc 600 atattacaaa ggtgcaaaat atggcagcgc agttgctcga cgtgggcgaa acggtgaccg 660 atacggtgat catggcgaag atcctgggga gtctgacgcc gaagttcgcg aaatttcaga 720 cagcgtggga caacatgccg ccggatatgc agacgttgca gaatttagaa gagcggttgc 780 ttcgagaaga agcaagaatg acagaaaacg acgaaagtga aagtgcgttc gtggtttcaa 840 agaaatcgaa agaaaagtcg aaatcgagag tgagttcgcg agaagcgaaa aataaaaagg 900 acggtaaaga cattcgagac agaaaagata aaaagaaagt gcggtgtttt cggtgtcagg 960 ggctcggaca ttttgcgcgc gagtgcaaaa aggacaaaaa agaggggaat cgtggtacga 1020 gtgaatcgag cgattgtgcc tttgttgtcg agagcgggaa aagcggtgcc ggtaaaagcg 1080 ggttcggcga gataccgaca gacttggcgc gcgagatact gaaagccgat aagaaagact 1140 gctggatttc cgacagcgga gcgtcacagc acatgacgtt ccgacgcgag tggttgcgag 1200 actttcgcga gttaccgatg ggagagcacg aagtcgccct cggagacgat aaagtatgtt 1260 atgcattggg agttggtaca gtcgacatcg agaagttcgt gaacgggaag tggatagcat 1320 cgtatatcaa gacagttttt tacgtaccag aactgaagaa gaacctattt tcggtcggcg 1380 cgtgcgcgag aaagaaaatc ggggtgtatt tcttcgatga atacgtgaag ttcgacaacc 1440 gaggtttcac cgccggttac ggtgtgatac aaacaaacaa tttgtatcgg atgctattca 1500 aaacaaagtg cgcggaggac gaaaacgaag cgagcgttgc gaaaacgagt tttctggcgt 1560 ggcacgagcg actcggacac gtgaacgtac gagcgatgcg taaactggtg agcgagaagc 1620 ttatcgacgg catcgagatt actgatgggg cggaagattt ctgcggagcc tgtcaggagg 1680 gcaaatccca gcggaaggcg ttcaagaaga cgcgcgtgcg tgctggcaca gagcccggcg 1740 aggtcataca ttcggatgtg tgcgggccga tgtcagtgga atcgatcgga ggatctcgat 1800 tcatggtcgt gttcaaggac gatgcgacga attttcggta cgtgtatttt ctcaaacaca 1860 agaacgaagt cctagagaag ttcaaggaac tcgatcggct cataaagaac aggtttggaa 1920 gaaacatacg tgtgttacga tcggacaacg gtcttgagta caagaatcga gcattcgatg 1980 agtacacgga caagagaggt atcgagcgag agtatacggc cccgtatacg cccgaacaga 2040 acgggaaagc tgaacgggac aatcgcacgc tagttgagag cgcgcgtacg atgctgttgg 2100 cgaagggact tccgaagaac ttgtgggcag aggcttgcaa tacagcggtg tatttagcca 2160 accgagccgg tgcatcgagc atgagagtgg gtgcaacccc gtacgagctg tggatgggcg 2220 tgaagccgaa tctgcatcac ttgaagatat ttggttcaga ggcatacgtg aatgtgccgc 2280 aaataaagag gaccaaatta gacgcacgag cgaagaagat gatcttcgtg ggatacgata 2340 gcacttcgtc aaactaccgt atatttgacc cggtgtcgaa gagagtgtcg gtgtcacgcg 2400 atgtgacgtt ccgggagacg accggaagtg tgaagctgtc aaaaccaaaa gatgcacgag 2460 aagtgatctt gccgaaggtc gagcgagaag ctctagttcc ggcggaaggc gacagcttcg 2520 acgaggaaga agaggacgac aaagtctttg agaatgctgt ggacgacgag cagcgcgatg 2580 aaggtcgaca acaggtcaat gagccagtgc gaaagctccg agacaggagt ttgatcaaga 2640 agccgtcgag gtatgatgcc gaagcagcgg aatgcgatat tccgatcacg taccgtgagg 2700 ctgtcgagtc caaggatgca ggaaaatggc gagaagccat agactccgag ctagagtcgc 2760 atgagaagaa cggtacgtgg catttagtca agagggatcc gaagatgaaa cctatcgatt 2820 cgaagtgggt cttcaagaag ctcaagaacg agaaaggaga gattcgtcgg tttaaggcac 2880 ggctgtgcgc caggggcttc atgcaggaga agaacatcga ttacacggag acgttctcac 2940 cggttgtgag atacgattcg ttgcgagtgc ttctcgtggc gtcgaagaat ctcgagattg 3000 cacaattcga cgtgcaaacg gcgttcttgt acggaactct ggacgaggaa atctacatgg 3060 agacacccga gggactcaac atggatgaag aaactcgaga agagcaagtg tgtcgacttg 3120 taaagtcact gtactcaagc agtcaccgag atgctggaac caaaagttca gttcgtttct 3180 gcgagagttc cggttccaag aaacatccgc ggataagtgt atattcgtgg gagaggtaaa 3240 tggcgagacg gtttatctcg ctctcttcgt cgatgacggg ctagtcatcg gaagaacgaa 3300 aggagtcatc gaagtcgtgt ttcggtactt gaaacgagca ttcaagatca tggtcggaga 3360 tggaagtcta tttgtcggga caaaaataga gagagatcga aacaagaggt ccatcttcgt 3420 gcaccagact ctgtatgcga agaagatttt cgagagattc gggatgtccg aggcaaagtc 3480 ggtgagtgta ccgagcgacc cgaatacaac gttgctgtcg gtgtcggaga acgacgaggg 3540 gatcgagaag gttccatact gagaagctgt tggatcgctg atgttcttgt cgattgtgtc 3600 gaggccagat ctggcatttg ctgtaaactc ggtgagcaaa ttcctaaata aacacaatcg 3660 ggaacattga caagctgtaa aacgcatcct atcgtacgtc gcagggacga tggactacgg 3720 aatcgagtac cgaggaagcg aaaacgacga aataaccctg gagggttatt cggactccga 3780 ctacgcacgc aacgtagaaa ccaggaggtc aacgaccggc tatgtgttcc aagtggcagg 3840 tgggcctgtc acatgggcga gtcaacgtca aaagctcgtg acgttgagta ccacggaggc 3900 agagtacgtg gcggcgtcca ttaccgcacg tgaagctgtg tggctgagaa agctacttga 3960 agaagtcaac tgtctcagcc agggggcaac gacagtctac gtggacaatc aaagttcgat 4020 tcgtctggtg aggaacccag agtttcataa gagaacgaag catatcgacg tccgatacca 4080 ctttgtccga gagcgagttc agaagaaaga actcgaggtc aactatatcg agtcggcaaa 4140 gcagaaagcc gatatcttca cgaaggcgtt cccaaagaat cgtttctgtg agctgagaga 4200 agaaatcggt gtacgagagc gaccacgcac gaacggcgga agtat 4245 // ID BEL-651_AA-LTR repbase; DNA; INV; 502 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-651_AA_; KW Pao_Bel_Ele228; BEL-651_AA-I; BEL-651_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-502 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 502 BP; 197 A; 81 C; 103 G; 121 T; 0 other; tgttcgcgaa ccaactggga accctgcgca tactaggtga gtcccggttt cgccctttcg 60 caatgtacga aaatcacgtt gacagggcgt acattgtcaa cgggtggaaa ataaataaaa 120 ctaagaaact aaaaggtaga agtaaaacga gaaaagaaat taaaaagcca ggtgttcaga 180 aagtagcgga aaactaaaat aaactgtgaa ttatttgatt taaatagcgt aacggaaggt 240 aaagaagtga aataataaaa agtgaattat acttataatt atctggtcta acaggaatta 300 ggttgctggg agaatttgag aattctagtt ggaaggagtt agccagcgag gtacaccaaa 360 atgtaagacc cattattaat aatctaacaa atgaaacaaa aattattaaa atcaaaaata 420 aatttcagct tgaagctgcg tttaccgttg ctgaaaagag gtgttaatcc atcgccagaa 480 tcccgaaatt ctccccccaa ca 502 // ID TEC2B repbase; DNA; INV; 705 BP. XX AC M73024; XX DT 27-JUL-1999 (Rel. 4.06, Created) DT 27-JUL-1999 (Rel. 4.06, Last updated, Version 1) XX DE Euplotes crassus transposon like element Tec2-2, inverted repeat DE region. XX KW Mariner/Tc1; DNA transposon; Transposable Element; EUPTLED; TEC2B; KW transposon-like element. XX OS Moneuplotes crassus OC Eukaryota; Alveolata; Ciliophora; Intramacronucleata; OC Spirotrichea; Hypotrichia; Euplotida; Euplotidae; Moneuplotes. XX RN [1] RP 1-705 RA Krikau F.M. and Jahn L.C.; RT "Tec2, a second transposon-like element demonstrating RT developmentally programmed excision Euplotes crassus."; RL Mol. Cell. Biol 11, 4751-4769 (1991). XX DR GenBank; M73024; Positions 1 705. XX SQ Sequence 705 BP; 235 A; 146 C; 89 G; 235 T; 0 other; tagagggata ataatattga aacaaattaa ttaataatta taattataaa tattgaaaaa 60 attatttcca aactatcctt tctaacttac cctcttccac tcctaactat tctctattac 120 caaatgtgat gatattaact ataatcatgt ctaatagctg attaacttca actatgtttc 180 aaacattaaa tacccttcta aatacctaaa ttctacccaa actataagga aatatcacat 240 ttagactata aattcttcac tggttcgcct gcggcttgtt aatttactct caccccagac 300 cccctacagt agcgtagtaa aggcccacca acttcatttt agaggcttta gtatcctctg 360 aatatggcct tagatcctat gtctaacacc tacattggat acaaggcaaa gtatatggca 420 ttctgaggac ttatagtaaa tttcgatttt aatcttaaat ttaataatta atctaaatta 480 gactacttta aaatagggag atttgcctat tttcaactca tcataactac taaggcagaa 540 ctaactagga gctgtcagtt gcagaatggt cttgcccttg ccaacttaac agactgacac 600 cattagacat cttgtcctct ggcaatcttt gaattattag atcatatggc cgagtcctca 660 gtggactctc ccttttcaat tcaaacagtg aaattctagc taatt 705 // ID Gypsy-5_BM-I repbase; DNA; INV; 4020 BP. XX AC nscaf3063; XX DT 19-MAR-2010 (Rel. 15.07, Created) DT 19-MAR-2010 (Rel. 15.07, Last updated, Version -1) XX DE LTR retrotransposon from silkworm: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-5_BM_; KW Gypsy-5_BM-LTR; Gypsy-5_BM-I. XX OS Bombyx mori OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Bombycidae; Bombycinae; Bombyx. XX RN [1] RP 1-4020 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from silkworm."; RL Repbase Reports 10(7), 985-985 (2010). XX DR Genome; nscaf3063; Positions 960604 956585. XX CC Positions [3063-3581] - Integrase core CC 'GGCAA' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 2244..3803 FT /product="Gypsy-5_BM-I_1p" FT /translation="MTPEQQEAFNSCKKSLSEAVMLAHPGPAAQLAIFTDA FT SDTSMGAALHQKCKDSWKPLGFFSRRLSSAEKKYSPYDRELLAIYNAIRHF FT RHMVEARPFTVFTDHKPLSFAFSHNREKCSPRQFRYLDYISQFTTDIQYIQ FT GSQNVVADALSRIEEISRIIDYTALAASQDSDLELQQLLQKGSALDLKKIK FT IPDSDKEIYCDISTPNPRPYMTVAFRKLIFDALHKLSHPGAKATARMVTRR FT FIWPGAQKDCRQWVSECTDCQKNKVHRHTQSPTSTFPVPSQRFSHVHMDIV FT GPLPISGGFRYCLTVIDRFTRWPEAYPLEDITAEACARAFVAGWVSRFGCP FT HRVTTDRGRQFQSELFRSVTNILGTQHRPTTAYHPACNGMVERLHRQLKAA FT IRCHQNSSWTEVLPLVLLGIRSAWKEDIKASAAELVYGEPLRLPGEFFVPS FT NAIPALDITDFASRLRLQMAKLSPEPASRHGQKTFYIPKDLATAEYVFLRQ FT DAVRRSLEAPYTGPYKVIERGNKT" XX SQ Sequence 4020 BP; 1185 A; 1043 C; 884 G; 908 T; 0 other; attggtgacc ccgacgtgat ctgatagaag atgagtcgtg agaagggaca acttccatcg 60 ggaagcagag acgcttctgg agatcgtaca gaaggaggca tgggtgaaac cgtaggaatc 120 tacaaagttg gagtgaaaat accgccattt tggcccgagg agcccggatt gtggtttgcc 180 caggttgagg gtcaattttt aatttccggc atcacaacag atgcgacgaa attctaccat 240 gttgtcgcac aactcgacca gcaatatgca gcggaggtaa aagatattat tatttcccca 300 ccagctgaaa ataaatacat aaaattaaaa actgaactga caaaaagatt gtcggcgtca 360 caagaaaaaa aagtcaaaca gttattgatg cacgaagaat taggagatcg aaaaccctcg 420 caattcctgc gacatttaca aacattagca ggacctaacg tacctgattc gttcctacgc 480 acgctatggt cgagccgtct gccacacaat attcaaaccg taatagcgtc acaaacggat 540 gtaccactcg agactgttgc agacctagca gatcgcattt atgaaattgt tccgataccg 600 gctgcacctg tgatagcagc agctgatggc agtagtgtta taaacaacat ggcaaaacaa 660 gttgaagcat taactaaaga agtggcatca ttgaaggcac ggctgtctcg cggtccaatt 720 tggcatcggc gtaatcagcg aagccagtct caagagaatc atcgttctcg ctctcgctct 780 cgctcacgat ccggagattg gaatataagc tactgttggt accacaaccg atttggaggt 840 aaagccacac gatgtacaac accttgcaaa ttcaacgcga cttcgggaaa cgcacccggc 900 agccagaggt agcggctaac ggctgcccat ctaagccagg ccgcttattc attacggata 960 gaaacagcaa gttgtcattt ctcgtcgata caggatctga cctttgtgtg tatccaagat 1020 ctgcgctgag tcaacctcgt actaaaaccg actaccaact ctatgccgca aatggttctc 1080 cgatcgctac ttacggttgg atccacttac aactcaacat tggtctacgc cgcgcctaca 1140 gatggcgttt cgtggtggcc gatgtttcta aaccaataat tggtgctgat ttcctccgtt 1200 tttatgattt gctcgtagac gtgaggaacc accgcctggt cgacgggttg acatctctct 1260 gtgctacagc tacaccagaa aaagaggcaa atgacattgc atcaattaaa gtgatatctg 1320 gcgactccat atatcaccag ctattgcgtg aatatcctgc aattacacgt ccagctggaa 1380 cccaacgaga aatcaaacac ggaacagagc accacatccg cacgacacca ggtcctcccg 1440 tttccagtcg accgagaaga cttccacctg accgactaaa gattgcgaag caggagttca 1500 acgaaatgaa tacgaaacgg tacggcgcgt aggtccgaga gttcgtgggc atcaccgctt 1560 cacctcgccc ccaaaaaaga caacggctgg cgaccttgtg gggactatcg ggcactgaac 1620 gcgcgtacaa tcccggataa atatccaatc cgccacatcc aagacttctc gcagcaacta 1680 gcagggacga aggtctactc aaaaattgac ttagttaaag cttaccatca gataccgatt 1740 cacaaggccg atatcccaaa ggcagcaata actacaccgt tcggattatt tgaattccca 1800 tatatgacct tcggactgcg aaacgcggca cagacctttc aaagattcgt agacgaaatg 1860 cttaaggatc tgccattttg ttacgggttc ttagatgata ttttacttgc ttctccagat 1920 gagacctccc acctccaaca ccttcgacag ctgttccagt gcttagcaga atatggcatg 1980 ctcattaaca ctaacaagtg tgaatttggt aaggcttgta tcaccttcct gggccatgag 2040 gtaagcgcag aaggcattaa gccgctacca gaaaaagtgc aggcaatcat gaactaccca 2100 ccaccgacaa cagtaaaaga acttcgccga tttttaggaa tgtacaattt ttaccgtcga 2160 tttgtaccta atgcggccaa atttcaaagt ccattaaacg acatactagc ggacctaaaa 2220 cgaaaggaac acagagcatt acaatgactc ctgaacaaca ggaagctttc aatagctgca 2280 agaaaagtct ttccgaagca gtgatgctcg cacatccagg acctgctgca cagttggcca 2340 tattcacaga cgcgtccgat acgtccatgg gagccgcact tcaccaaaag tgtaaagaca 2400 gttggaagcc actggggttt ttttcacgac gcctctcttc tgcagaaaag aagtatagcc 2460 catatgatcg tgagctgctg gctatataca acgcaatacg ccattttcgc catatggtag 2520 aagcaagacc attcacagtt tttacggatc ataagcccct gtcatttgct ttctcacata 2580 atcgtgaaaa atgctctcca aggcaatttc ggtatctgga ctatatatcg caatttacaa 2640 cggatatcca gtatatccaa ggaagccaaa acgttgtagc tgacgcgctc tcccgcatag 2700 aggaaattag tcgcattata gactacaccg cactcgcagc atcacaagac tcggatctag 2760 aattacaaca gctgctacag aaaggttcag cactagatct gaagaaaatt aaaataccag 2820 actccgacaa ggaaatttac tgcgatattt ccacaccaaa tccacgacca tatatgacag 2880 tggctttcag gaaacttatt tttgacgcgc ttcataaact gagccatcca ggtgcaaaag 2940 ccactgcaag gatggttacg aggaggttta tttggcctgg agcacagaaa gattgcagac 3000 agtgggtcag tgaatgcaca gattgtcaga aaaacaaagt ccaccgccat acccaatcac 3060 ccacttctac ttttcctgtt ccttcccaac gcttttctca tgtccatatg gacatcgtgg 3120 gaccattacc gatttcaggt ggctttaggt actgtctcac agtcatcgac cgttttactc 3180 gttggcctga ggcataccct ctcgaagaca tcacggcgga agcatgtgcc cgcgccttcg 3240 tcgctggctg ggtgtcccgc ttcggctgcc cacatcgcgt cacgacggac agaggacgtc 3300 agtttcaaag cgagctcttc cgcagcgtca caaacattct gggtacacag catcgtccta 3360 ccacagccta tcacccagcc tgcaatggga tggtggaacg gctccatcga cagttgaagg 3420 cggccattag atgtcaccag aactccagtt ggacggaagt tctacctctc gtactcctgg 3480 ggatccgtag cgcatggaag gaagacatca aagcttcagc agcggagttg gtttacgggg 3540 aacccttacg actgccaggc gaattcttcg taccatccaa cgccatacca gcactcgaca 3600 taaccgactt cgcatcgcga ctcagactcc aaatggcaaa gctgtctcca gaacctgcaa 3660 gtagacacgg tcagaaaaca ttctacatac caaaagacct ggcaacggca gagtacgtgt 3720 ttctccggca agatgcagtc aggcgttcac tggaagctcc atacaccggc ccctataaag 3780 tcattgaaag aggaaacaaa acttaaaatt cttacaaacg aaagagaaac gactgtttcc 3840 attgaccgcc tgaaagcagc acacatggcc aaagaggatg ccaaaacggg gaacaccaac 3900 atcagcacta ttccacctgc agttgatcca accccttcaa tgaagacgag atcaggccgg 3960 gccgttcact ttccagacta ctatcgacct caataacctt cggtctcagc gggggagtag 4020 // ID CENSAT_CC repbase; DNA; INV; 379 BP. XX AC . XX DT 14-SEP-2004 (Rel. 9.08, Created) DT 14-SEP-2004 (Rel. 9.08, Last updated, Version 1) XX DE Cicindela campestris centromeric satellite sequence - a DE consensus. XX KW SAT; Satellite; Simple Repeat; CENSAT_CC; Centromeric repeat; KW satellite sequence. XX OS Cicindela campestris OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Coleoptera; Adephaga; Caraboidea; OC Cicindelidae; Cicindela; Cicindela. XX RN [1] RA Gallin J. and Vogler P.A.; RT "Evolutionary dynamics of a satellite DNA in the tiger beetle RT species pair Cicindela campestris and C. maroccana."; RL Genome 46(2), 213-223 (2003). XX RN [2] RA Gentles A., Kohany O. and Jurka J.; RT "Cicindela campestris centromeric satellite sequence - a RT consensus."; RL Direct Submission to Repbase Update (JUN-2004). XX DR [2] (Consensus) XX SQ Sequence 379 BP; 124 A; 75 C; 54 G; 126 T; 0 other; taagtccaaa tggtgcaccc ctcgctaact ataaaaggta gacggttgaa atttacaata 60 tacgtcgaaa gtaaaattct cttttgaatt cagttttatt gccgcagcaa cgtctcaaat 120 tttgataaaa ctggttccaa tgcaaatcat gtcgtgtata acaatttcat aaatcaccaa 180 cttccttcta attttcgacg agaaaaaagt ttcgattttc ccaactttta gtaagtattg 240 ctgtagagca atgacaactc tacacaaacc gtaaatttca acctcgtcac cctcatagtt 300 tccgagaaaa tcgactttat tttatcgttc atttttaagt gtatatttag aaattagcaa 360 gcttaaacgg cgtatttct 379 // ID Gypsy-172_AA-I repbase; DNA; INV; 5154 BP. XX AC supercont1.188; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-172_AA_; KW Gypsy-172_AA-LTR; Gypsy-172_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5154 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.188; Positions 1621300 1616147. XX CC 'ACAC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 820..3999 FT /product="Gypsy-172_AA-I_1p" FT /translation="MLQIEEDVGQLIRLANNLKKSINTKYRIETLKAKLKY FT ANELYADVEQRLIQHEEEIPLTSLNFLIKSSRDANRIITSIINDKIAQNSK FT KELINKENLTLTNGTQTASNMALPNGNQPVPNMAQFDIKTASSLLKQYDGS FT PENFHAFEDAVNMLVDLIEADYHPILLRFIKTRLTGKARNGLPEHCATIPD FT LVANVKTRCEEKINAESVIAKLKSVKVKENMETFCSEIETLTNKLQSIYVG FT QQIPEQVASKMATKVGVDALISNINNQETKIILKVGTFGNIKDAIQKVQEN FT VLPNSHIFQLKRTNNTNQDYRNNNSRGQGNRSNQRNNTGRQIHSSFQTLTC FT FNKGHTVEMLIEDNLRGDIVLPPRCEVFRQIYVTLDKGDYLFPSSEILPGI FT FNANTIINSTNSVVKFINTTNEVIKLNKNFSNHCVPLNNYNIFTFTKNQSK FT DRAQQLLEELNLSHVETDVKSKLMKLCNKYSNLFALKTDTLTCNNFYTQQI FT HLNDPTPVYIKNYRTPEAHLAEINSQVNKMLNDGIIQPSISPYNSPILLVP FT KKSQSGDKKWRLVVDFRQLNKKIIADKFPLPRIDEILDHLGRAKYFSTLDL FT ASGFHQIELDENSKKLTAFSTSSGHYEFNRLPFGLNISPNSFQRMMTIALS FT GLPPECAFLYIDDIIVIGCSINHHLMNLEKVFEKLSKYNLKLNPSKCIFFC FT ADVTYLGHHISNVGIQPDKSKYNVISNFPIPKNADDVRRFVAFCNYYRRFI FT PYFSELASPLNALLKKNVKFEWTDACQYAFENMKMKLLSPQILKFPDFSKK FT FILSTDASKIACGAVLTQDYDNIEMPIAYASKAFSKCEKNKSTIEQELIAI FT HWAITHFKPYLYGRRFTVKTDHRPLVYLFSMKDPSSKLTHMRLDLEEFCFD FT IEYVKGKNNVGADALSRIIINSDELQSLSILPVQTRSVTKKLNNQRNETTI FT SDSSNTEIDHLRVYESVNNLDAFDIPKTIFDIRNNKMNLQIFTKNLKRVIA FT HATLFYKNGSINIAECLQTINEMAKKTKIQELAMKRDDTIFTMIPTSSGIQ FT TSV" FT CDS 4067..5092 FT /product="Gypsy-172_AA-I_2p" FT /translation="MIQKLISENHDTPTGGHVGINRLLNKLRSKYYWPNMK FT NTITKYIKNCFKCIQNKHTIKTNEIFSKTTTPSKCFDTISIDTIGPFTRST FT KGNRYALTIQCDLSKFVIIAPIPDKQAKTLAKALVENCLLIFGCPVIIKSD FT LGTEYKNEIFNNICNLLEIQQKFSTSYHPQTIGSLERNHRCLNEYLRHFIN FT EQHDDWDSWLMYYTFSYNTTPHSEHLFTPFELIFGKQANLPSQFMNNVDIE FT PVYNYESYLSELKFKLQTVCAKAKILLDKSKTSRNQIQSQSANPLEKTIGD FT TVWLKVENRRKLDPVYSGPFKIINIEHPNVIIENIISRDKQKVHKNRIIK" XX SQ Sequence 5154 BP; 1995 A; 908 C; 856 G; 1395 T; 0 other; tggcgaccga cagtaaaaac aatgaaagta ccaaataacg taataattgt gttaatagtt 60 tcgctgatag gattttgcct gatagtatat atttattatc aactactggt tgaacttaag 120 gcaatcacaa gaagtgcttc attaatcaac cagctaagtg attaaaagtg aataagtgaa 180 tacctacagc aagaaaatgg gatctggaca atccaagcca gaggagatag tgattcaaca 240 acagcagcag cagcagcagc agcaacagcc attaacagac tcaggaatcg gacctatgca 300 tgttattgca acctgcatga tagtgctgtt aatagcaata tttttaaaag tggtgtggcg 360 cataattatg cgtgaagttg aaacacgtcc acgcagggcg cctagtcttg ccaatattgt 420 ctaaaaactg ttaagcgagt ttaaatgaat gtggtgacta cccaacagtg ggtgaaatag 480 acaaaataaa ctggacagaa gaacaaaagt gcgcgaaaca gggtgaaggt aaaaatcccc 540 gtcggaagaa ttcgagatgc agtaaaagtg aaaccccaaa gtcgggaaaa atcgtcaaaa 600 gtgctggtgg tgatcgtcgc gagaagcaga tgagccagaa aaaggaaaaa tcgacaacat 660 caagcgtgcg ttgaggagtg cgcctccgca gctaggaatt acaacaagcg tgcgttgagg 720 agtgcgcctc cgccgcaagg gaccacaatg gaattgaaaa caagataagt acagttaaaa 780 agttaattag ataagtaaag ttaagctagt gagtttttaa tgttacaaat tgaagaggac 840 gttggccagt tgatcagact agccaacaac ctaaaaaaat ctataaatac aaaatataga 900 attgaaaccc taaaagctaa actaaaatac gcgaatgaac tctatgccga tgtggaacaa 960 aggcttatac agcatgaaga ggaaattcct ttaacaagtc taaatttttt gattaaatcc 1020 tcaagagatg ctaatcgaat aataacttcc attataaatg ataaaattgc acaaaatagc 1080 aaaaaggaat taataaacaa agaaaaccta acattgacaa acggaactca aactgcctca 1140 aacatggctc tcccaaacgg aaatcaacct gtcccaaata tggcacaatt tgacataaaa 1200 acggcgtcca gtctacttaa acagtatgac ggatcgccgg aaaattttca cgcttttgaa 1260 gacgcagtaa atatgttggt agatctaatt gaagcagact accatccaat tctgctaaga 1320 ttcatcaaaa cgaggttgac gggaaaagcc agaaatggtt tacccgaaca ttgcgcaacc 1380 attccagacc tagttgccaa tgtcaaaacc cggtgtgaag aaaaaattaa cgctgaaagc 1440 gttatcgcga aattaaaatc agttaaggtt aaggagaata tggaaacatt ttgctcagaa 1500 attgaaactt taacaaataa actacaatcg atttatgtgg gacaacaaat cccagaacaa 1560 gtggcttcta aaatggccac aaaagttggt gtagacgctt taatttctaa catcaataac 1620 caagaaacga aaattattct caaggttggc accttcggaa acatcaagga tgccatccaa 1680 aaggtacaag aaaacgttct gccaaactcg catatttttc aattaaaacg caccaataat 1740 acaaatcaag attacagaaa caacaattcc agaggacaag gcaaccggag caatcaaaga 1800 aataataccg gtagacagat tcacagttct ttccagacac ttacgtgttt taacaaagga 1860 cacacagtgg aaatgttaat agaagataat ctcagaggtg atatagtttt acctccgaga 1920 tgcgaagttt tcagacaaat atatgttaca ttagacaaag gtgattatct ttttccttct 1980 agtgaaatac tcccaggaat atttaacgca aataccatta taaattcgac aaattcagtt 2040 gtgaaattta ttaacacaac taatgaagta attaaactga ataaaaattt ttctaatcat 2100 tgcgtacctt tgaacaatta taatattttt acctttacaa aaaatcagtc aaaagataga 2160 gcacaacaac ttttggaaga attaaactta tcacacgttg aaaccgacgt taaaagtaaa 2220 ttgatgaaat tatgcaataa atacagtaat ctattcgcat tgaaaacgga cacattaaca 2280 tgtaataatt tctacactca acaaattcac ctcaatgacc caactccagt atatatcaaa 2340 aattaccgaa ctccagaggc acacttagcc gaaattaact cccaggtgaa taagatgtta 2400 aatgatggca taatacaacc ttcaatatcg ccatataact ctccgatcct tttagttccg 2460 aaaaagtcgc aatcaggaga taaaaaatgg cgtttagttg tcgattttag acaactaaat 2520 aagaaaatta tcgctgataa atttcctctt ccaagaatcg atgaaatttt agatcatctt 2580 ggaagagcta aatatttttc aacccttgac ttagcctcag gttttcatca aattgaactt 2640 gatgaaaatt cgaaaaagtt aacggccttc tccacatcat caggtcatta tgaatttaac 2700 aggttaccgt ttggtttgaa catatcacca aacagtttcc agaggatgat gactattgct 2760 ttaagcggtc tccctccaga atgtgcattt ttgtacattg atgatataat agtaattggt 2820 tgttctatta atcatcattt gatgaattta gaaaaagtct ttgaaaaatt atcaaaatac 2880 aatttaaaat tgaacccgtc gaaatgtatt tttttctgcg cagatgttac ttatttagga 2940 catcatatca gcaatgtggg tatccaacca gacaaatcga agtacaatgt catttcgaat 3000 tttccaattc ctaaaaatgc ggacgatgta agacgattcg tcgcattttg taattattac 3060 cgacgtttta taccgtattt ttcagaatta gctagcccac ttaacgcatt actaaagaaa 3120 aatgttaaat ttgagtggac tgatgcctgt cagtacgctt ttgaaaatat gaaaatgaag 3180 cttttatcac cacaaatatt gaaatttcct gatttctcta aaaaattcat tctaagtact 3240 gatgcttcca aaatagcatg cggagcagta ctaactcaag attatgacaa catagaaatg 3300 ccaatagcat acgctagcaa agcattttcc aaatgcgaaa aaaacaagtc aactattgag 3360 caagaattaa ttgccataca ttgggcaatt actcacttta aaccatacct atacggcaga 3420 cggtttacag tgaaaactga ccaccgtccg ttggtatatt tattttctat gaaggatcca 3480 tcgtctaaac tgacacatat gcgattggac ttagaagaat tctgttttga catagaatac 3540 gttaagggta aaaacaacgt tggtgccgat gctctctctc gtattattat caattcagat 3600 gaactacaaa gcttatcgat attaccagtt caaacaagat ctgttacaaa gaagctaaat 3660 aatcaacgaa atgaaacaac tatttctgat tcatctaaca ccgagattga tcacctcaga 3720 gtttatgagt cagttaataa cctggatgcg ttcgacatcc cgaaaactat ttttgacata 3780 agaaataata aaatgaattt acaaatattt accaagaact tgaaaagagt aattgcacat 3840 gcaacattat tctataaaaa tggttcaatt aatatagcag aatgcctaca aacaataaac 3900 gaaatggcaa agaaaacaaa gatacaagag ctggcaatga aacgagatga tacaatattc 3960 acgatgatac ctacctctag cggtattcaa acaagcgtgt aatgaaacat taaaagatat 4020 aaaaattgtt atatatgtac cggtacaaat agttcaagaa aatgaaatga tacaaaaatt 4080 gatatctgaa aaccacgaca ctcctacggg aggtcatgtt ggtataaata gattacttaa 4140 taaactaagg tcaaaatatt attggcccaa tatgaaaaat actataacta aatatattaa 4200 aaattgtttt aagtgtattc aaaataagca taccattaaa acaaatgaaa ttttttcaaa 4260 aactacaaca ccatcaaagt gttttgatac tatttcaata gatacaatag gaccttttac 4320 taggtcaaca aaaggaaata gatacgcatt gactattcaa tgtgatctat caaaattcgt 4380 cataatagct cctataccgg acaaacaagc taaaacacta gcgaaagcgc tggtagagaa 4440 ctgtttactc atatttggtt gtcctgtaat aataaaatct gatttaggaa cagaatacaa 4500 aaacgagatc ttcaataaca tatgcaattt attggagata cagcaaaaat tttcaacttc 4560 atatcatcca caaaccatag gaagcctcga acgaaatcat cgttgtctta atgagtatct 4620 tagacacttt attaacgaac aacatgatga ttgggattct tggctaatgt actatacttt 4680 tagttacaac acaactccac actccgagca tttatttaca ccgtttgagc ttatctttgg 4740 gaagcaggca aatttaccaa gtcaatttat gaataatgta gatattgaac cagtatacaa 4800 ctatgaatca tatctttccg aattgaaatt caaattacaa actgtttgtg caaaagcaaa 4860 aattttatta gataaatcaa aaacttctag aaaccaaatt caatcacaat cagcaaatcc 4920 attggaaaaa acaataggtg acacagtatg gctaaaagta gaaaatagaa gaaaactcga 4980 tcctgtatat tcaggaccat ttaaaattat aaacatagag catccaaatg tcataataga 5040 aaacattata tcgagagata aacagaaagt tcacaaaaat agaataatca aataaattca 5100 aattgtaaac gaataatctt actttgtaat atcattcttt tctgaggggg aagg 5154 // ID Copia-101_AA-LTR repbase; DNA; INV; 162 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-101_AA_; KW Ty1_copia_Ele55; Copia-101_AA-I; Copia-101_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-162 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 162 BP; 33 A; 40 C; 38 G; 49 T; 2 other; tgtacccgat atacgaaaga ataaagacgt ttagagtaaa cggtcccgcg agttaaaakt 60 ttttttccct ctatcacgat acgcattgtc cggaktgttc cggtgttccg ctggcctgtt 120 gggtgtccgt attcgtcggt cggtccactc tgttgcccta ca 162 // ID Polinton-8_NVi repbase; DNA; INV; 16996 BP. XX AC . XX DT 15-MAY-2009 (Rel. 14.06, Created) DT 15-MAY-2009 (Rel. 14.06, Last updated, Version 1) XX DE autonomous Polinton DNA transposon - consensus. XX KW Polinton; DNA transposon; Transposable Element; Polinton-8_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-16996 RA Bao W. and Jurka J.; RT "Polinton DNA transposons from Nasonia vitripennis."; RL Repbase Reports 9(6), 1157-1157 (2009). XX DR [1] (Consensus) XX CC TIR is 122-bp long. XX FH Key Location/Qualifiers FT CDS 1482..637 FT /product="Polinton-8_NVi_1p" FT /translation="MAEAILLANQEKQDVVAEEFNSERAKNHSRPLLFAST FT YLLNKSGSKKLYIGLEYDDSTQKYQPIIELKNTQYSRGLRIDIESWLELEN FT KLEDISTYFKSNSVALCIQQQERIKLKKMDIILTTSYGMKSVMFVLQQQLQ FT QTQESLYVQKKKRALMPSITMQKRTFDGLLNIVVCVNERYRRLERVLNEIN FT ECKNLLRDELAEILTLYDYHNTERSYDIIKSIVIDKRKYLKENILKKLSPK FT NSCFLDNYFEMIFTEMIVLHSDVFHDEVYNIFCNQNAQKI*" FT CDS 14795..13707 FT /product="Polinton-8_NVi_6p" FT /translation="MMQEIEKQYFDPGHAVGFTGARNLISANKKKIPASRI FT KDWLTRHDTYTLHKPIRKKFPRLYYDVLGLDHVWEADLIVLSSLKSHNDNY FT SYLLVVIDVLSKYTFVEPLRDKTVKEVTKAFKQILNKNKNRCPYMLQTDAG FT KEFVGKEFQKFLSETGIKFRVARNPDVKAAVVERFNRTLKERMFRYFTYKN FT TKRYIDVLQAIVQSYNNSIHSTIKMKPAAVTVYNAQEARKNMLEKSLKQQV FT IRKRPRYKVDQYVRISRERNIFEKGYEKGWSEEIFKIVKVKKRQNLFIYEL FT VDLEGEEIEGFFYPEELSLVHSERVSREEYKINEILKTRGKGAKKEYFVSW FT IGYPDKFNSWILASDVTKL*" FT CDS 12054..11023 FT /product="Polinton-8_NVi_4p" FT /translation="SSGRFFFNQKLVTPSNTLYPYRAYIETLLNYGPDAKT FT SHLSTSLWGADTAGKMDEKPATDCANKGLLSRQKFTSDGKVVDLLGHLHID FT VCNQDRFLLNGVEMRVRLVRAKDAFCLMDASDKNYSVHIKEASLLVRRAKI FT NPGVLVAHAKTLATTTAKYPLTRVEVKAFSMHNGILGECLDNVILGQLPKR FT LIIGFVNNKAFNGDRLLNPFNFQNFGINYLSLYIDGQQVPGKPLQPSFSAA FT APLYVDCYHTLYSGTGIHFANEENDIDRTEYPDGFCLFAFDLTPDMSAHCG FT THWSLVRNGNVRIEVRFDKPLTHTVNCIVYAEYDNILEIDSMRQVILDYNS FT *" FT CDS 16935..16111 FT /product="Polinton-8_NVi_8p" FT /translation="MARRGKSSWVSRWGVMTQERVSPGGAPWGGDRGGQEG FT SWTAYSTFFVSVSLYAKMDTCFMHPWTMICAGPTSAGKTVFVQRFLKHLDV FT LANVKFDRIIFYYSEWQESYRVENVEIEFREGLPRNEDYSNDPEKRKLLII FT DDLMRESANNGTIIDLFSKGSHHRNASVIYICQNIFHKGQRDISLNANYMV FT LFKNPRDRAQIQHLARQLYPEDSKFLQEAYYDSTIKPYSYLVIDLKQSTPE FT ELRFRSCIFPDDEYHYVYIPKYSSSQYSKLNVV*" FT CDS join(10819..9977,10007..9441) FT /product="Polinton-8_NVi_3p" FT /translation="MDFIIDIQAFHDKEGLFLPKEIAVIAVDRNFIAHWIV FT KPPCDYNELPKGIISTNSYLSCYHHGIEWYDGEAALEDIYPSLRYIAREAH FT TIYTRGYQKANLLERILGRQITNLEEYSCPSFRNLSSVDDHICMYHGGKSE FT HLTCALAYTFRVRSWLRKALYGHTEIDMNEQRKKKKSPIEKPPRRALVDGA FT TSSSSSSSSSGSSVAPRKISVNNNNISKITTSVTATPAEVIESCFAVIVEE FT EEAAAAASRARGTGGGGSGANRKTKRGRSRRRALLLETLRKKSPTTRNLTM FT NSLQIQQALRGISAQTVGVFPADRLPRVWTRPVAIIANTQEHHLPGAHWIA FT IFVDKNGHGVYFDSYGIQPTVSYHLDGLRRNCTLFTWSTSRLQSFDSDVCG FT NYCVDFLYFMCRYNCAMDFSKLFCGDTRSNDKLCMVIFRKIMKSIKKKKFK FT SVSNSCSYCIQSCKSESCKKKKRI*" FT CDS join(16078..15695,15691..14867) FT /product="Polinton-8_NVi_7p" FT /translation="MAKRASRESRALIRKKNNASPFEEKYFVLLQALRHSN FT KEQRLALLRTADEKLVKYICECALNVLKGVVCLNVTEKNKLKKYKTILRNL FT TKVTKRKKNSWKSKKRIIVQKGGSFLGFLLPPILDLFLNAKMERARKMVLI FT PQESLQKMQASVKTASINSLPAAAGVQGVPEQEKKLTSESIQTPGDKFSRL FT DSEMRQILDSGKYTTDSDKYRDFLQVLQRYLFFVDEKRRFDDKKNLADGKS FT DEKDQMSDEYILESVPKLYKNKTKLLLNHLKNNSNRISWSATGVVKIDGEK FT IKNSNIIDLVNDVARARKNIKAAGRQQFGELLFTTQIPREFIGNSEFFNLN FT GSLSRSGFENGLEKSWKKSRREEDEDDDEEEIDDFQNSKSDSSLIMDTIVA FT KKAKRRQQ*" FT CDS join(13704..12925,12934..12377) FT /product="Polinton-8_NVi_5p" FT /translation="MNQSEFTVVLPSNSSMTFFPDNTTTSFSVRLPRAIDL FT HEKWQVGLSEIHIPCTTLHLRYKDTLISPAEGKDNFHFQHGVYKSVQGLID FT AINEGFKEYHDNQTAKELLYDEKGGYVTLREYPELVKKKSRLGKPVQRILG FT FGNGTFRITLESETKPYRDYVASQPATLAHAIPDQLFIYSNICEPCIVGDA FT HAPLLRIVNVEARDYKYGSTIVKRYSPVNYIPLLNSRFQIIDIDIRDQFGS FT SIPFEFGTLTVALHFRREFKRILKMNYYDTYYSAQSGGGGGGIASVYVGSR FT GQRGHGIGSFLGGLFRRALPFLAKGARAVGREALRAGINVLDDVTENNMSF FT KDSLNNRLAESGLKLKRKANEKISELMEGSGYKGMVLKRASQLQYVSRARK FT TANKKKKRRTSKKKTKAVKRLKSGRKKKKNTRGKKVKSAVNKKRRKSTKKN FT *" FT CDS join(3484..3936,3861..7637) FT /product="Polinton-8_NVi_2p" FT /translation="MDISNVSDDESNTPSHATPASPPSSSSPSTSPPASPP FT APTQLDPHYQVGRGIDRGDLKKVVILNESTRRARNFFTAARDVEFGFIITP FT NIDNPIVYLRSALLEIYRYLVSGLNENILIGVTVNSDRFLLEATHGFQCER FT SAISTLTIFGTVIFARGDAWFPVRKIRDFDFNDLWNRIGSVTQSNEAFFID FT ETLRVNVTYSDLPRGGNGLQESKLVDALNRQSILTIKNNDNLCLPRAIACG FT LVFLEKAESERTIWEKNWRKIAFNSMAHQRTCAIEISQKAGVVVPEEGCGY FT TEFQKFQLYLLDYGVALMVYYIESLGRGEGALFNGKAFFDGVNIKYTKKIN FT IAFHRDINHFNLILNLTALCVKKHQRYYCEDCNTVYTRPFHHKCENLCISC FT NTSPACQALGAAITCDDCHREFKNPQCFANHKAPGSYTMSKSKNARRSICQ FT FVKYCTFCHYVYDIDSNKHNCKNMYCTNCKAVHEKNANCYMQIRPLKKNSR FT VVTTTTNFHPETLFVFYDFETTQNTVLREEGENSVYLHVVNLCVAQFACSS FT CLNENDVSQFCNYCGLRQHVFTKEPISSFLNLVLRKKTNFKNIICIAHNAK FT AFDSQFVLRDIVENRIVKESGSVPSLIMNGRKIILISFGKTKFIDSINFMP FT MKLAALTASFGLPEEAKKGYFPHLFNTDDNADYEGPYPAAEYYAPDAMSSK FT EREHFFNWYNEVSIREEFNMRREIVSYCVQDVTVLRLACLKFRQIFLDCAN FT LCPFTESVTLAGSCSLAYARNFLKDNLIGLISNEGYVRVDRHSQKAVEWLL FT WVEQQVGREIIHAGRGREYTLFGRIKVDGFLADLQNPETGTVYQFHGCYFH FT GCPICYPTRRDAPLLDRTTLNQKYESTLAISAKIRSNGFILVEMWEHDFNS FT NIATLNAFKRLIENHPIAHNEILNPRDAFYGGRTGNIATLYKVEPGEKIKY FT VDVCSLYPFVCKTGKFPIKHPRVYNSTDCLELTGPNFEYFDRVEGLVKCTI FT LPPRKLFHPVLPYRLHNRLLFPLCRTCCEILSKVPCTHVTPEARALSGTWV FT SDEIKKSLQLGYTILNISVIWQYEITQYDPKAGNQDDTTCGLFTDYINLFL FT KLKQESSGWPRNCTDEASKAAYIAYYKEKEGIDLDPQKIEKNPGMRSVAKL FT CLNSLWGKFGQRGNLPNTEIINSSEQFLQLLTSPEKEVGRIVLVNENVIYA FT ASKLKTEVATSARNTNVVIAAYTSAQGRLKLYEYLENLGERVLYYDTDSVI FT YVSKGERDEYEPETGSLLGQLTDELEEYGEGSYIETFVSGGPKFYAYRVRA FT PNGETFDTCKVKGIRLNYENSQKINFDSVLEMMENHEFDIIDAEEEEEGEE FT EVNEDRGANNCIYVRNFHIGRTSVNDIISFKQTKKCRAVLKKKTIC*" XX SQ Sequence 16996 BP; 4983 A; 3409 C; 3388 G; 5210 T; 6 other; agtagagagg ggcaagccca ggcagacagg taggggaggt aaaagagggg aggggtggcg 60 ccgtggctcg tcggggcaag tctagttggg cccctctggt gggaggtgat gacgtaagag 120 cgtcgcacgt gcggtggcgc gtccccggcg gggggtactg cagggcccgt gtgggtgtag 180 ggatcgcacc cgcgctacag gaggtcattt gcacaatgaa aaggacaata agtgcgttta 240 tctgttggct agctacgaca tgcgcattga cccttttttt tatatagcat accgagctat 300 gtaagtggcg tkaaagcaat gagctgcagc agaatttttt tcctacagga tgtgtagagg 360 gatcgaccct ttttttgaca aggtgaaatt tcgtgcgatt tggaggaaag ctacgagaaa 420 aagcagtgag agaaatgcta ttatgcgtgt ttctcggaga aataaaactg aaggtttttt 480 tattttacaa gctttgttaa aatgaatttt tgtactaaag attgacgtat atagtatgtt 540 ttcaatagaa cattttttaa ataatcgggt agttgtgagg aggatggtga aggttaaaaa 600 tttttaaaag caaaattatg taaataaatt ttttatttat attttttgag cattttgatt 660 acagaaaata ttgtaaactt catcgtgaaa aacgtcacta tgaagtacta tcatttcggt 720 gaaaatcatt tcaaaataat tatcaagaaa acaagaattt ttcggtgaga gtttttttaa 780 aatgttttct tttaaatatt ttcttttatc aataactatg cttttaataa tatcatagct 840 tcgttcagta ttatggtagt catacaacgt taaaatttca gcaagttcgt cgcgtaacaa 900 atttttgcac tcattaattt cattcaatac gcgttctaaa cgccgataac gctcatttac 960 acacactaca atgttcaata atccatcaaa tgtccgcttc tgcattgtta ttgatggcat 1020 caaggccctc ttcttctttt ggacgtaaag agattcttga gtttgttgta gctgttgttg 1080 gagcacgaac atcactgact tcatcccgta tgaagtagtt agaataatgt ccattttttt 1140 cagttttata cgttcttgtt gttgaataca taacgcaaca ctgtttgatt tgaaatacgt 1200 cgaaatatcc tctagtttat tttctagttc aagccagctt tcgatatcga ttctgagccc 1260 tctggaatat tgagtattct ttagttcaat aatcggttgg tatttttgtg tggaatcatc 1320 gtattccaag ccgatataaa gtttcttcga tcccgacttg tttagaagat acgtagaggc 1380 aaacagaagc ggacgactgt gattttttgc tctttcactg ttgaattctt cggcaacaac 1440 atcttgcttt tcctggtttg caagtaagat agcttcagcc atgttgttgt tgaataaagt 1500 aaataaaaca ctgtcttgaa atgcacaacg ttttgttcaa ttagttagac gttgttgtta 1560 aataaagcac tatcttgaac tgcacagtga atttacagcg atgcacaagg taaaaaaatc 1620 gagagttttt ccagttagga cgtcaatgaa aaaaaaaatt ccacaaagat cgagtaactg 1680 gatagagcga gaggtagact gtgtatggat gctcacggtc aagtgtaccc tttttttata 1740 gcggtttggt gtcactgcga gaaaatcccg gctrctggtg gtggtggtaa acggttcaga 1800 gtagagggtg gggggtgcgt gcggcccaat acaagggatg acctgactcg gcgtgagagt 1860 ggtggtggta gcgtaggtga gccacaggtc cagaagaggg gatactcagg gcatccctgt 1920 attatgtaaa aaaagggcac tgatgcacct gtattgtgga aagtgtatga aaaaaaatgt 1980 caactgttct acactgagga agaaaattct ttcgagggtg gacgagatct cgggaaaatc 2040 tagtggtcac tctgagtttc gcttgcgcag tgcaaactac gttccgtgtc taaggctaaa 2100 aatgtctctc cgaaaaattg tgtggcgtct cgttcactat ttagtggaca acaattatga 2160 cgtggagaga ttgtttgtgc aaaaaaagtg gcaaaagtgc gtgaaacaag caagcgagcg 2220 tggcttgctg aaaatcgcgg aaaaagtgtt tagccacgca cggccacgac aaataggccg 2280 gtgctacccg ctgtatagga aaaatgaaga gctggttcga ctagccgggc aagtgtgtga 2340 agcttgtcac cattgggact acacacgtaa tcgaggtcca ttacgtgtgc ttgcagagcg 2400 aaaccgtaag taaactctat gattctggat aaacttttct tttgttttgt tttttaaacg 2460 tgtggcacat tttaaaatat ttgtggttta caggccgtgt tgcactatcg ccgccagtcg 2520 cagcagcacc aacaccgccg ccggctgtgc aagcattacc agtgccggta tcagcagcag 2580 cagaagcaac tgcaacaatt gaggcgtgca tggaattact ggggactaat gccgatgatt 2640 gggtcgagtg gaatggcgaa tcttgctctc caacatcttc gcaagtcccg gtggcggata 2700 ggactataac gagcgaagag agagaagtcg aactaagcat acaaggtacg acggatgcgt 2760 ggatcgactg gagtggtgac atgctccagg tgccgccgcc gccgatcgat gtacagccct 2820 cgtcagcatc gtcaccgtct cttagaccgt cttcttctgc gccgatctcg gaagtagaaa 2880 atggtaagtt tcaaatttaa tatacattgc acgcacacac acacacaacg tagtaatgaa 2940 aaatatttaa aaataataat aacattttaa tttatatttt cagtcttcga agatctccag 3000 ggttattttg acgacgacaa cgaggatgat aacgatgtaa ttatcctgtc cgatcaaccg 3060 gcttttcctg accttggaaa tgtcaccctt gaaggggtga tgtttgaggc tgccgagatg 3120 ttcgagacgg ggtcagctgc gtacgctgat atcctcaatg caggactaga accgcaggcg 3180 caatcacagg gtcctcttta cgaggacata agtgacgaag aagccttgag catcatattc 3240 gactcgcaat tggtgcccga tgacagcatc gcttggctcg aggcggctgc tgctaatgcc 3300 gcygctgctg ctgctgcagg tccttcacag ataacatgtc tcgacgttca aggtgtgtaa 3360 tttatataaa taaaaaaaaa agaaattttg tcaacgatat tctgtaatta atttatttat 3420 agtattataa taaatgaaac aaattaaact aaaaaaaatt cttgtattta tttaaaaaaa 3480 gatatggaca tctccaacgt cagcgatgac gagtccaaca cacccagcca tgccactcca 3540 gcatcacctc catcatcatc atcaccgtca acatcacctc cagcatcacc tccagcacca 3600 acacaactag atcctcatta ccaagtaggc agaggaatag atagaggtga tttaaaaaaa 3660 gtcgttattt taaatgagag tactcgtcgt gctcgtaatt ttttcaccgc ggctcgcgac 3720 gtagaattcg gttttataat aacgccgaat atcgataacc cgattgttta cctgcgttcg 3780 gctttattag aaatttatcg atatttagta tcgggtttaa atgaaaatat tttaatcggc 3840 gttacagtaa atagtgatag atttttgctc gaggcgacgc atggtttcca gtgcgaaaga 3900 tccgcgattt cgactttaac gatctttgga accgtatagg tagcgttaca cagagtaacg 3960 aagcattttt tatagacgag actctccgcg taaatgtgac ttattcagac ttaccacgcg 4020 gtggtaatgg tctgcaggag tcaaagttgg ttgacgcgtt aaatcgtcaa tcaattttga 4080 ctattaaaaa taacgataat ttatgcctac cgcgggctat cgcatgcggg ttagtttttc 4140 ttgagaaagc ggagtccgaa cgcacgatct gggaaaaaaa ttggcgaaaa atcgccttta 4200 attcaatggc gcaccaaaga acatgtgcaa ttgaaatttc tcaaaaagct ggagtagtcg 4260 tgcctgaaga aggctgtggc tataccgagt ttcagaaatt ccaattatac ttgttggatt 4320 acggagttgc gctaatggta tattatatag agagtttagg ccgtggagag ggtgcgttat 4380 ttaacgggaa agcatttttt gacggagtaa atataaaata tactaaaaaa attaatatcg 4440 cgttccatag ggatattaat cattttaatt taattttaaa cttgactgcg ttatgtgtga 4500 aaaagcatca gcgatattac tgtgaagatt gtaacacagt atatacgcgg ccttttcatc 4560 ataaatgtga aaatctttgc atatcgtgca atacttcacc agcgtgtcag gctctcggtg 4620 cggccataac ttgtgacgat tgccaccgtg agtttaaaaa cccacagtgc ttcgcaaatc 4680 acaaggctcc tggctcgtac acgatgagta aatcgaaaaa tgcgcgacgc tcaatttgcc 4740 aattcgtaaa gtattgtact ttttgtcatt atgtttatga tattgactcg aataaacata 4800 attgcaaaaa tatgtactgt acgaattgta aagcagtgca tgagaaaaat gccaactgct 4860 acatgcaaat tagacctcta aaaaaaaatt cacgtgtagt tacgacaaca acaaattttc 4920 acccggaaac tcttttcgtc ttctacgatt tcgagacgac gcaaaacacc gttctaagag 4980 aagagggtga aaatagcgtt tacctacacg tggtaaattt atgcgtggcg cagtttgcat 5040 gctcgtcgtg tttaaatgaa aacgatgtat cgcaattttg caattattgt ggccttcgcc 5100 aacacgtttt cacaaaagag ccgatttcta gctttttaaa tttagttttg cggaaaaaaa 5160 ctaattttaa aaacatcatc tgcattgcgc ataacgcaaa agcatttgat tcacaattcg 5220 ttctacgtga cattgtagag aatcgaattg tgaaagaatc cggcagcgtt ccgagcttaa 5280 ttatgaacgg acgaaaaata attttaataa gtttcggtaa aacgaaattc atagatagca 5340 taaattttat gcctatgaaa cttgccgcat tgacagcctc gttcggtctg ccggaagaag 5400 caaagaaagg atatttcccg catctgttta atacagatga taacgcggat tatgaaggtc 5460 cttatccggc agccgaatat tatgcaccgg atgcgatgag ctccaaggag cgtgaacact 5520 tttttaattg gtacaatgaa gtatcaatta gagaagagtt caatatgcga agagaaatag 5580 tctcttattg cgtgcaagat gtaactgtct tgcgattagc ttgtttaaaa tttagacaga 5640 tattcttaga ttgcgctaat ctgtgtccat tcacggaaag cgtgactttg gctggctcgt 5700 gctctcttgc ttatgcccgt aatttcctaa aagataattt gataggtctt ataagcaacg 5760 aaggttatgt gcgcgtcgat agacatagcc aaaaggccgt agagtggttg ttatgggttg 5820 agcagcaggt aggcagggaa atcattcatg ccgggcgtgg acgggagtac acccttttcg 5880 gtcgtataaa agtcgacggg tttttagcgg atcttcaaaa tccagaaacc ggtacagtgt 5940 accagtttca tggatgttat tttcatggct gtccgatctg ctatccaacg aggcgcgatg 6000 cgcctcttct agatcggact actctcaatc aaaagtatga gagtacccta gcaatttcag 6060 ctaaaataag atcaaatggt tttattttag ttgaaatgtg ggagcatgac tttaattcga 6120 atatagctac tttaaacgct tttaagcgct taatcgaaaa tcacccgatc gcgcataatg 6180 aaattctcaa tccgcgagat gctttttacg gtggtcgcac gggtaacatt gcgaccttat 6240 ataaagtcga accgggtgaa aaaataaaat atgtcgatgt ttgctcttta tacccctttg 6300 tttgtaaaac gggcaagttt ccgataaaac acccgcgcgt atataacagc accgactgtc 6360 tcgaactaac tgggcctaat tttgaatatt ttgaccgtgt tgagggctta gttaaatgta 6420 cgattctacc gccgcgtaaa ctctttcatc ctgtgctacc gtatcgtctg cataatcggt 6480 tactttttcc cttatgcaga acttgttgtg aaattttgag caaagtccca tgtacacacg 6540 taacgccgga agctcgcgcc ttaagcggta catgggtatc ggacgagatt aaaaaaagtt 6600 tacaactcgg ttatacaatt ttaaatatca gcgttatctg gcaatacgag ataacgcagt 6660 acgaccctaa agccggtaat caagacgaca cgacatgcgg attgtttacc gattacatta 6720 acttattttt aaagttaaaa caggagagct cgggctggcc acgtaattgc acggatgagg 6780 ctagtaaagc ggcttacatc gcctattata aagaaaagga gggtatagat ctagatcctc 6840 aaaaaataga gaaaaatccg ggcatgcgct cggttgcaaa actttgttta aatagtttat 6900 gggggaaatt tggacaacga ggtaatttgc caaacaccga aataattaat tcgagtgagc 6960 agtttttaca attattaact agcccggaaa aggaagtagg tagaatagtt ttagttaacg 7020 agaatgtaat atatgcagcc agtaagttaa aaacagaagt agctacttcg gcgcgaaata 7080 cgaatgtagt gatagccgct tacacgtctg cgcaaggacg tttaaaattg tatgaatatc 7140 tcgaaaatct aggcgagcgg gtactgtatt atgataccga ttccgtcata tacgtatcaa 7200 agggtgagcg tgatgagtac gaaccagaaa ccgggtctct gctaggccaa ttaacagatg 7260 aactggaaga gtacggagaa ggctcttaca tcgagacctt tgtttcgggc ggtccgaaat 7320 tctacgccta tagagtgcgt gcacccaacg gcgaaacctt tgatacgtgt aaagtaaagg 7380 gcatccgact caactatgag aatagtcaaa aaataaattt cgactctgtt ttagagatga 7440 tggaaaatca cgaattcgat attatagacg cagaggagga ggaggaggga gaggaggagg 7500 taaatgagga tcggggcgcg aataattgta tatatgttag gaatttccat ataggtcgaa 7560 cgtcagtcaa tgacattatt agttttaagc aaacgaaaaa atgtcgcgct gttttgaaaa 7620 aaaagacaat ttgttaatca tcgatattct gtaccatatg gttatataca tgaataaaaa 7680 ataaaaataa aaattatatc ctattttatt tactcaaccc cttaaatcct ataaaataat 7740 agcatccctc aggcggcggc ggcgcggcgc ggtggcttgc tgatcagcgg gaccggaccc 7800 cccccccccc ccccggcacg cgctctagta ggctagtagc gagggcgcgt gcacgcgcgc 7860 aggtaaaaaa agacctagca gggccgctgg gtaggacttt tactcgacgc gcaggtaagc 7920 ggtttaaccc cggcgcaggt gaatgaccag cggtgtcgct tctcgcactg cagttgacct 7980 atattttaca aggagacatg cgccactata gaagcaacac ctgtagtggc gctctgaacc 8040 gaaaattcgt atacagcatg gatgattttc ttcactattt cctcctcctc ctcctyywtt 8100 cctctctcac cccacttatt tttttcgcta tagcaacact agatgtagag agagagagag 8160 agagagagag agagagagag agagagagag agagagagag agaggggaga aaccaaattt 8220 tctttttgct ctagtaaaat ctgtcatacg tagacagttt gtagaggtag caactcctgc 8280 atcatgaatc ctcgtggagt tcttgcacgg gacatggatc ctttgattca tacagttcat 8340 gaagtgagag tgtctgcgtt tcaaacacca tcttgtttcc atatacaatt aataaaagaa 8400 gatgccgcgt acgtgaaatt ggaggcagaa atgttaaatt attatagtac acagtcgttg 8460 ttatattttt gtgaagaaag ttttacaatt tcaatggaaa aatctgattc accagtgtta 8520 cacaaggaaa aactccacga gttaaaggta agaaaaagaa gaagaagaag aagaagaaga 8580 atgaggaatc caaatttatt ttttacgata atatagtgac agaatttttt tttcatttta 8640 gggaaacacg ataattgcag tgaaaataat gggaagatgg ttacgaggag taacagaaga 8700 aggatgcccg caacgatgcc gactaataga taatggtaca tgtgtgactc tgattgagtc 8760 aaacgaggaa agtggcattt atctgctaga agcccgattt cttgccttac ctccacgagc 8820 aattttagct ggacttggtt tcgtcatgcc catatcgaca aatggtctcc actggtcacc 8880 tcgttctgct gaaatatttc ttatggtcac cgtaaaccgt attctaagat gcgtttttgc 8940 acaaaaagga ttattcgaca attcagctta cgtatttcta agcagtgtgg gcaaggaagg 9000 tggaggtatg gttcatgtga atccaatgct tgtcgcattt gacacagcca tgaatgatct 9060 caccggaatg catcatccaa tctggccatc agatactttg cgttaataat atataaattt 9120 catataaaaa taataataat aaaagtaaac gaaaaatcta tcaaagttgt ttttctcttt 9180 ctgtctttta acctttttct tgacaagaat attaataaaa tttaagacgg aggaggagga 9240 ggaaataaat aaacatacgt atttcacgta aaaataattt gaaacacaac tacaattttt 9300 tcttttaaaa aaatttatta aatcagttga ttgttaaaaa aaatatacgt aatacatgag 9360 aaattcttaa atctagaata aatgcggtta tataattcac tcataaattt tgaatacgta 9420 gtaatgtaaa ttaacaaaag tcagattctt ttctttttct tacaagattc ggatttacaa 9480 gattgaatgc aataagaaca tgagtttgat acacttttaa attttttctt ttttatactt 9540 ttcataattt ttcggaaaat taccatgcat aatttgtcgt tactccttgt gtcgccacaa 9600 aagagtttag aaaaatccat agcacaatta tagcgacaca taaaatacaa aaaatccaca 9660 caataattgc cgcacacatc ggaatcgaaa ctctgcaacc gacttgtaga ccacgtgaat 9720 agtgtacaat ttcgtcgcaa tccgtcaagg tgatacgaca cggttggttg tataccgtag 9780 gagtcgaaat aaacgccatg accgtttttg tcgacaaaaa ttgctatcca gtgagcacca 9840 ggtagatgat gttcttgagt attagcaata atagcgaccg gtctcgtcca caccctgggt 9900 aaccggtctg ctgggaagac tccgacggtc tgtgctgata tgcccctcag tgcctgctga 9960 atctgtagtg agttcatcgt aaggtttcga gtagtagggc tcttcttcta cttctccctc 10020 tctttgtttt tctatttgcc ccactaccac cgccaccggt accacgagca cgactagcag 10080 cagcagcagc ttcttcttct tcaacaatga cagcaaaaca agattcgata acctcggcag 10140 gtgtggcagt aactgaagta gtaattttgc tgatgttgtt gttgttgaca ctgattttac 10200 gaggtgctac agacgaccct gatgacgacg acgacgacga cgacgatgta gcaccatcca 10260 cgagtgctct tcgaggaggt ttttcaattg ggctcttctt cttctttctc tgctcgttca 10320 tgtcgatttc ggtgtgaccg tataaggcct ttctcaacca tgagcgcact ctgaatgtgt 10380 aagccaatgc acacgtaagg tgttcagatt ttcctccgtg atacatgcaa atatgatcgt 10440 ccacactgga taagtttcta aaggatggac agctgtattc ttcaaggttt gtaatctgtc 10500 ttcccaaaat acgctctagt agattagctt tctggtagcc acgtgtgtaa attgtgtgtg 10560 cttctctggc aatatagcgt aacgacgggt aaatgtcctc caacgcggcc tcaccatcgt 10620 accattctat accgtgatga taacacgaaa gataagaatt agtcgaaatt attcccttgg 10680 gtaattcgtt gtaatcacac ggtggtttca caatccaatg agcgataaaa ttacggtcaa 10740 ctgcaattac agcaatctcc ttgggtaaaa ataacccttc tttatcgtga aaagcttgga 10800 tatcaatgat aaaatccatt gtagctatcg actgtttctt tctcaaatac tagagatgaa 10860 ctaacagccc tcaaaatccg gagaggttat ataggcaaac cctctccctt ctttacggtt 10920 tctccctctc tccctcctgc atacaacgca gaactttaac gaaaaagagg gaggaggcag 10980 agttctgtga ttcgttcgtt cgttcttact gttacgcaca tcttatgaat tgtaatctaa 11040 aataacttga cgcatagaat cgatctctaa aatattgtca tattcagcat aaacgataca 11100 atttacggtg tgtgttaatg gtttatcgaa acgcacttcg atacgcacat ttccattacg 11160 gactaaggac caatgcgtac cgcaatgagc agacatgtcg ggggtaagat cgaatgcaaa 11220 caaacaaaag ccatcgggat attctgttct atcgatatca ttttcctcat ttgcaaagtg 11280 tattccagta ccagaataaa gagtgtggta acagtctacg taaagaggcg cagctgcgga 11340 gaagcttggc tgtaacggtt tacccggcac ttgttgacca tcgatataaa gtgataaata 11400 attaataccg aaattttgaa agttgaatgg atttagtaat ctatcgccgt taaaagcttt 11460 attattgaca aatcctatga tgagtctttt cggtaattgc ccgagaatta cattatctaa 11520 acactcgccg agtataccgt tatgcatgct gaatgcttta acctcgacgc gtgtcagagg 11580 gtattttgca gttgttgtcg ctaatgtttt agcgtgtgcc actaaaacac ctgggtttat 11640 ttttgctcgt cgtacaagta aactagcttc ttttatatgt acactataat ttttatcgga 11700 ggcatccatt aaacaaaaag cgtcctttgc acgtaccaaa cggactctca tttctacacc 11760 atttaaaaga aatctatcct ggttacatac gtcaatgtgc agatgtccta aaagatcaac 11820 gacttttcca tcgctcgtaa atttttgacg acttaaaaga cctttattcg cacaatcagt 11880 ggcaggcttc tcatccattt ttcctgctgt atccgctccc cataacgacg tcgagagatg 11940 cgaagttttt gcatcaggtc cgtaattcaa taaagtttca atgtacgctc tataaggata 12000 cagtgtatta gacggcgtca caagtttttg attgaaaaaa aaacgtccac ttgactaaac 12060 attgagtgca agaaattgtt cacaggtgcc acactcgcat ccgtttcctt gttgtggggt 12120 gttatttgaa catttagttt gatcatagta tgagctaaat ctatatactc ttcgcctgct 12180 gcggcgacta aaaattctaa aggtgcttga tcggagagtg atgaaacagg cttgtattcc 12240 acacatgatg atgattcgat agtggtctgg gtcattggta gtgaaaacaa atcaagctcc 12300 gattttaagc actcgcagct tgaatgatgt aagaaagaca ttttaaattt ttgatttaat 12360 cgaaaatatc caaaacttaa ttttttttag tagatttcct acgctttttg ttcactgcac 12420 ttttcacttt ttttcctctt gtattcttct tcttctttct accacttttt agtcttttta 12480 cagcttttgt ttttttcttg ctagttcgcc tcttcttctt cttattagcg gtcttgcgtg 12540 cacgactgac gtactgcaac tgtgacgctc gtttgagcac catcccttta tagccggagc 12600 cttccataag ttcgcttatt ttctcattag ccttacgttt tagtttcagt ccagactcgg 12660 ctaggcgatt gttgagggaa tccttaaatg acatattatt ttctgtcaca tcgtcgagga 12720 catttatacc tgctctcaag gcttctcggc cgacagctcg ggctcctttc gccaaaaacg 12780 gtaatgctcg tcgaaatagt ccacccagga aggaaccaat accatgaccg cgctgccctc 12840 gcgagcctac atacacactc gcaataccgc cccccccacc accactctgt gcactataat 12900 atgtgtcgta gtaattcatt tttaaaattc tcttctaaaa tgaagtgcaa cggtcaacgt 12960 tccaaactcg aatggtattg aactcccaaa ttgatctctt atatcaatat ctataatttg 13020 gaaacgactg ttcaaaagtg gtatataatt aacaggtgaa tatcgtttaa caatagtact 13080 accgtattta taatcccttg cttcaacatt cacaattctt aacagaggcg catgtgcgtc 13140 gcctacgata cagggttcac aaatgtttga ataaataaaa agttggtcgg gtatagcgtg 13200 agcgagagta gcaggttggc tagctacata atctctataa ggtttagttt ccgactccag 13260 tgtaattcta aaggtaccat ttccaaaacc aagaatacgt tgtacaggct ttccaagtct 13320 gcttttcttt tttacaagct ccggatactc tcttagagta acgtaacctc ctttttcatc 13380 gtaaagcaat tctttagccg tttgattgtc gtgatattct ttaaagcctt cgtttattgc 13440 atcaatcaaa ccctgtacgg atttatatac accatgctgg aaatgaaaat tatccttccc 13500 ttctgcaggt gatattaacg tatctttata acgtaaatga agcgtagtgc agggtatatg 13560 aatttctgac agtccaactt gccacttttc atgtaaatcg attgcacgag gtaatcttac 13620 tgaaaaacta gtcgtcgtgt tgtccggaaa aaatgtcatg ctactattac tagggagcac 13680 tactgtgaat tccgattgat tcatttttac agctttgtta cgtcacttgc gagaatccaa 13740 gagttaaatt tgtcgggata gccgatccag cttacgaaat actccttttt agctccttta 13800 ccgcgtgttt ttaagatttc atttatttta tattcttctc gcgatactct ttcactgtgc 13860 accaaactca attcttctgg gtagaaaaat ccttcaattt cctcaccttc taaatcaacc 13920 agttcataga tgaacaagtt ttgacgtttt tttactttaa caattttaaa aatctcttcg 13980 ctccagcctt tttcgtaacc tttttcaaag atgttacgct ctctgctaat tcgaacatac 14040 tgatccactt tatatctcgg tcttttccgt ataacctgtt gttttaaaga tttttccaac 14100 atattttttc tagcttcctg tgcattgtaa actgtcacgg cagcaggttt catttttatc 14160 gttgaatgta tactattgtt ataactctga actatagctt gcagtacatc aatataacgt 14220 ttcgtatttt tataggtaaa atagcgaaac attcgttcct ttagcgttct gttgaatctt 14280 tccacaactg cggcttttac atcaggatta cgcgctacac gaaattttat acccgtttca 14340 cttaaaaact tttgaaattc tttaccaaca aattcttttc ccgcatccgt ttgcaacata 14400 taaggacagc gatttttatt tttatttaga atttgtttaa atgctttagt cacttctttt 14460 acggttttgt cacgcaacgg ctcgacaaaa gtatatttac ttaaaacatc aatcactact 14520 agcaaatatg aataattatc gttgtgactc ttcagagatg acagaactat gagatcggct 14580 tcccatacgt gatccaatcc caaaacatcg tagtacaaac gtggaaattt ttttcgaata 14640 ggtttgtgaa gtgtataagt gtcgtgtctt gtcaaccagt ccttgattcg tgatgctggt 14700 atctttttct tattcgcact tattaaattt ctcgctcctg taaagccaac cgcatgcccc 14760 ggatcaaaat attgtttctc aatttcttgc atcattatta tgtcatttca accgtaattc 14820 attccaacgt ttagttatag gagtagatgt tcgagaacta ttttttttat tgctgacgtc 14880 gttttgcttt ttttgctaca attgtatcca ttattaacga agaatcactt ttactgtttt 14940 gaaaatcatc gatttcctct tcgtcatcgt cttcgtcttc ttctcgtcga ctttttttcc 15000 aacttttttc aagtccgttt tcaaaaccac tgcgagacag agaaccattg agattaaaaa 15060 attcactatt tccaatgaac tccctcggaa tctgggttgt aaaaagtaat tcaccaaact 15120 gttgtcgacc cgcggcttta atatttttac gtgctctggc tacatcattt accaagtcaa 15180 ttatgtttga attttttatt ttttctccgt caatttttac aacaccagtt gcactccaac 15240 ttatccggtt actattattt ttcaaatgat ttaataaaag ttttgtttta tttttgtata 15300 attttggcac actttcaaga atgtactcat cgctcatttg atctttttca tcacttttac 15360 catctgctaa atttttctta tcgtcaaacc ggcgtttttc atcgacaaaa aatagatatc 15420 tctgtaaaac ttgtagaaaa tctctgtact tgtcactatc tgtcgtatat tttcctgaat 15480 ccaagatctg tcgcatttcg ctatcaagcc gtgaaaattt atctccaggt gtttgaatag 15540 attcactcgt taattttttc tcctgttctg gaactccttg tacgccggct gctgctggca 15600 aactatttat tgatgctgtt tttacacttg cttgcatttt ttgtaaactt tcttgaggaa 15660 tcaaaaccat ttttcgtgct ctttccattt tttatgcgtt tagaaaaaga tccaaaatag 15720 gtggcaacag aaagcctaga aaactccctc ctttttgtac aataatgcgc tttttactct 15780 tccaagaatt tttttttcgt tttgtaactt ttgttaaatt acgcagaatt gttttatact 15840 ttttcagctt atttttttcc gtgacgttta aacatacaac accttttaat acgttcaatg 15900 cacactcaca aatgtacttt acaagttttt catcggcagt acgcaataac gctagtcgtt 15960 gttctttatt agaatgtctc aatgcctgca gcaacacaaa atatttttct tcaaaaggcg 16020 atgcgttatt cttctttcta attaacgctc gactctcacg agatgctcgc ttcgccattg 16080 tcggattata actgaaccga tattttctct ttacacaaca tttaacttgc tgtactgact 16140 cgacgaatat ttgggaatat aaacataatg atattcatca tcaggaaata tacaactacg 16200 gaaacgtaac tcctcaggtg tactctgctt caaatcaata actaaataag aatacggctt 16260 tatagtacta tcataataag cttcttgcaa aaacttggaa tcttcggggt atagttgacg 16320 agccaaatgc tgaatctgtg cacggtcccg aggattctta aataaaacca tataattcgc 16380 attcagcgat atatcgcgct gccctttgtg aaatatattc tgacatatgt aaataaccga 16440 tgcattacga tgatgacttc ccttactaaa caaatctatt attgtaccat tattcgctga 16500 ctcgcgcatc aaatcatcta ttattaacaa ttttcgtttt tccggatcgt tcgaatagtc 16560 ctcgttccgc ggtaatcctt cacgaaattc aatctctaca ttctccactc tataactttc 16620 ttgccattca ctataataaa atattattct atcaaacttc acattagcca acacatcaag 16680 atgctttaaa aatctctgta caaacactgt tttacctgcg gaggtaggac cagcgcaaat 16740 catagtccac gggtgcataa aacaagtatc cattttagca tataatgaca cactaacaaa 16800 aaaagtacta tacgcggtcc aagacccttc ttgaccgcca cgatccccac cccagggggc 16860 cccgccggga ctcactcgct cttgcgtcat caccccccac cgcgagaccc aactagactt 16920 gccccgacga gccatggcgc caccctcccc tcctttacct cccctacctg tctgcctggg 16980 cttgcccctc tctact 16996 // ID Harbinger-N3_BF repbase; DNA; INV; 1329 BP. XX AC . XX DT 04-AUG-2008 (Rel. 13.08, Created) DT 04-AUG-2008 (Rel. 13.08, Last updated, Version 1) XX DE Amphioxus Harbinger-N3_BF non-autonomous DNA transposon - DE consensus. XX KW Harbinger; DNA transposon; Transposable Element; Nonautonomous; KW Harbinger-N3_BF; non-autonomous. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-1329 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-1329 RA Kapitonov V. and Jurka J.; RT "Harbinger-N3_BF - a family of non-autonomous DNA transposons RT from the amphioxus genome."; RL Repbase Reports 8(8), 816-816 (2008). XX DR [2] (Consensus) XX CC This transposon has imperfect 15-bp TIRs and TWA TSDs. XX SQ Sequence 1329 BP; 281 A; 356 C; 387 G; 305 T; 0 other; ggcctacgtc acattacata gggggtccgc gcggtgtcta accgatgtag gccccggccg 60 ggctctgaag ccgggtagac atcggctcac atagtcacaa atcatttttt cttcacgcgt 120 cccgttagag gtcccggggg gctccggtgg gcattccggc ggctttcgtg caatagtcgt 180 atgctttaac cctttatcgc tcgatgttcg gcaggcaacc gtgagctgcc ctgtggatgt 240 ccacgggtac cccgtagggg tctggcagtg ggtcggccga ttcacccctt gtagataact 300 gtatggtccc tgtcgtacgc cccgcgaacc acgaccatca tgtttttttt tttggggggg 360 ggggggggat ataagcattg gttggctcgt cttcgttgac ttttgcaact gtgccacgag 420 gtactaacac tagtagaagc ggaaaatgcc actgcacttc gactgacagg gtgagcacat 480 gccgcttgct cagctcaggc tgagggtggc gacggctcgt caccgtttca tccttgttct 540 tgctacatgg gtagtggaag aggagagaca agcggaacgg cagtgccaga gaagaatatg 600 ttgggttagg ctccgaacca tgttttgggg tctacgatac gctgatgagc aagctataaa 660 tgcgagaaca ttcttgcgac tttaagtcct ttatgcgcct tatgcgtgtg ggaccttcca 720 ttgagaagag tagaaagtga gttttttact gatgatattt ccctaataat ttccccgccg 780 ggaccccggt tcattttaag tccgacttaa aaccaaccgg ggccccgccc gagaccaaaa 840 ataacccggc agccgctggg agaccagcag gaccccgacc ggaagagcca aaaaacagtc 900 cgtaaattac ccgccagggt ccctatgtgg aaatgtgacc atggcccaaa ttgtcctgtt 960 tgatgaccga gggaccccgt cagcggccct ggcggtgttc cgggatgaca caacggatgc 1020 aatagtttgt aatgcaagaa ggttgggcct gtaccgtcga gtaccgtcgg ggtaccgggc 1080 gggtaccggg caggtactgt aggggaccct gccgatgtgt tcacggttag ttgacggctt 1140 tgattcggcg tattttcctg cccgggcccc gattgatttt aagtccgact tagaatcaac 1200 cggggccccg cccgagcccc aaaatagccc ggcagccgcc gggggtctca cggggacccg 1260 cccgaaagtg caacaattca gcccctaacc tacccggccg gacccctttg tgcaaatgtg 1320 acgttagca 1329 // ID Penelope-5_HM repbase; DNA; INV; 2357 BP. XX AC . XX DT 13-DEC-2008 (Rel. 13.12, Created) DT 14-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE Penelope-like element. XX KW Penelope; Non-LTR Retrotransposon; Transposable Element; KW Penelope-5_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2357 RA Jurka J.; RT "Penelope-like elements from Hydra magnipapillata."; RL Repbase Reports 8(12), 2095-2095 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(97..1026,1008..2018) FT /product="Penelope-5_HM_1p" FT /translation="MCAFESELIELIKNIKFRKVYSKFQKKLKEDISNIRN FT CNLTYTFADKTSNLYNLSKQDYNHLLTNAVTSTYKKSDPKVKNDINKEGKR FT ILQNHEIFNKIDINTSSNCFITLKDHKENFANNPTVRLLNPAKNEIGRISK FT VILSNINSELVKRLHLNQWKKTQDVICWFKSISDPNLYKFLIFDIVDFYPS FT ISEKLLKNAISFAEQRLIIDPEQKSIIYHARKSLLFDSDQVWIKKSSGLFD FT VSMGAFDGAEVCELVGIFLLFQITQFYNKNDFGLYRDDGLAIFKNISGPHM FT EKIKKHVVKIFKSNGTIFIKWNDLLISTQCNIKIVNYLDITLNLNDHTYQP FT YSKPGNSLNYIHAESNHPSSILKQLPHSIELRLSANSSSEEIFQKSTPMYN FT EALKKSGFNYNLSYQPNLILSHTKNRKRNIIWFNPPFSKNVSTNIGKIFLT FT LIDKHFPKNHKLHKIFNRNTLKVSYSCMPNMKSVLNAHNRHVINNKKESAE FT NNCNCIDKNTCPLSNQCLTKNIVYQANVCSNNLVQTPINKFYIGISETSFK FT IRYANHIKSFKIPKYKNDTELSKEVWRLKDENLNPVVRWKIIKQCDAYNPV FT TKRCNLCLNEKYEILFFKHPNLLNKKSEILSKCRHKNKFLLAQFDTGD*" XX SQ Sequence 2357 BP; 898 A; 385 C; 304 G; 766 T; 4 other; tttttcttta ttaacgacga tcaaaatagt aagaacaatt tttattataa taattatgga 60 ataaaaacac caaattaccc taaacaaatt actgaaatgt gtgcatttga aagcgaatta 120 atagaattaa taaaaaacat aaaatttcgt aaagtttata gcaaattcca gaaaaaatta 180 aaagaagata tatctaatat acgcaattgt aatcttacct acacttttgc agataaaaca 240 tctaatttat ataacttatc taaacaagac tacaatcatt tattaacmaa tgccgtaact 300 tcaacttaca aaaaatcgga tccaaaagta aaaaatgata taaataaaga aggtaaacgc 360 attttacaga atcatgaaat ttttaataaa atygatatta atacatcatc taactgcttt 420 ataacattaa aagaccacaa agaaaatttt gcaaacaatc caacagttcg attattaaac 480 cctgccaaaa atgaaatcgg cagaattagt aaagttattc tttcaaacat aaattctgaa 540 cttgtaaaaa gactccatct caatcagtgg aaaaaaacac aagatgttat atgctggttc 600 aagtccatca gtgatccaaa cctttataag tttctcattt ttgatatcgt tgatttttat 660 ccctcaataa gtgaaaaact acttaaaaat gctattagtt ttgctgagca gcgtctgatt 720 atagatcctg agcaaaaatc gattatttac cacgccagaa agtcactgtt gtttgatagc 780 gatcaagttt ggattaagaa gtcaagcgga ttattcgatg tttcgatggg agcatttgat 840 ggcgccgagg tgtgtgaact tgtggggatt tttcttttat ttcagattac acagttttat 900 aataagaatg atttcgggtt ataccgtgat gacggtttgg cgatttttaa aaacatcagt 960 ggtccccata tggaaaaaat taaaaaacat gttgttaaaa tatttaaatc aaatggaacg 1020 atcttctgat atcaacacag tgcaacataa aaatcgtaaa ttatctggac attacattaa 1080 acctcaacga ccacacctac caaccttact ctaaacccgg aaactcttta aattacattc 1140 atgctgaatc aaaccaccca tccagtattt taaagcaact cccacattct attgaattaa 1200 gattatcagc aaattcgtca agcgaggaaa tctttcaaaa atccactcca atgtataatg 1260 aagcattaaa aaagtccggt tttaattaca atctttctta ccaacctaac ttaatattat 1320 ctcacactaa gaaccgtaaa cgtaacatca tttggttcaa cccgcctttt agcaaaaacg 1380 tcagcacgaa tataggaaaa atatttttaa cattaattga taagcacttc cctaaaaatc 1440 acaagttgca caaaattttt aacagaaaca cattaaaagt aagttatagc tgtatgccta 1500 atatgaaatc cgttttaaac gcccacaatc gccatgttat aaacaacaag aaagaatcag 1560 ctgaaaataa ttgcaattgc atagataaaa acacctgccc tttatcaaac caatgcttaa 1620 caaaaaatat agtatatcaa gctaatgtat gctcaaacaa tcttgttcaa actcccatca 1680 ataaatttta cattggcata agcgaaacat cttttaaaat tagatatgct aatcacataa 1740 aatcttttaa aattcccaaa tacaaaaatg acactgaatt atcaaaggag gtttggagat 1800 taaaagatga aaacttaaac ccggttgtta ggtggaaaat aattaaacag tgcgatgcct 1860 ataaccctgt aacaaaacga tgtaacttat gcttaaacga aaagtacgaa attttatttt 1920 ttaaacaccc aaatttgtta aataaaaaga gtgaaatatt atctaagtgc cggcataaaa 1980 acaaatttct tcttgcccaa tttgacactg gcgattagac agttttacaa ctatgcttct 2040 tttttattcg ttttattcgt tttgttttga cgtcggaact cattctactg ctttgttatt 2100 attttttytg ttagtttttg tattttgtat tttttaacgg ttttttcttg atttgtataa 2160 atatatatgg ctgaagattg ccgataggca tgaaactttt agttccatta taaagttgta 2220 tttttctatt taattacact ttatttatat atatatatat attgatctat tttgtaaaaa 2280 ttatttttac attatattga tcactctcaa gtaaacaaga ggattttgta tctcgayctg 2340 atacaataaa acaacaa 2357 // ID CR1-6_CQ repbase; DNA; INV; 3709 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-6_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-3709 RA Kojima K.K. and Jurka J.; RT "CR1 non-LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 8-8 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 10 sequences with >98% CC identity. XX FH Key Location/Qualifiers FT CDS 1..738 FT /product="CR1-6_CQ_1p" FT /translation="SKRQTQASTIPLFTNVTSSTLMPACILATIGRRPYIA FT IYRAAATATRHQVGGRQTLLTRDARTPASWKTPTLPSQSRHRSAATYLVPD FT IKVVPALCTVPEKGSSKRQTQASTIPLFTNVTSSTLMPACILATIGRRPYI FT AIYRAAATATRHQVGGRRTLLTRDARTPASWKTPTLPSQSRHRSAATYLVP FT DIKVVPALCTALENGSSKTRFQACIRVFPFSRRSLMLHNLPAPSNPPTHRS FT PTLAR" FT CDS 198..3644 FT /product="CR1-6_CQ_2p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="MEDPDSPVTVAPPLCCDVSSSGHQSRPGPVYGSRERV FT FQTPNAGKYNPAFHERDVFDPDARLHSSDDRTTPVHRDLPSCSYRNPSPSR FT WPSDTADPGRTNPRIMEDPDSPVTVAPPLCCDVSSSGHQSRPGPVYGFRER FT VFQNALPGMYSRFSVQSTLPDASQPSSSQQSSDAPLTHPRSLARVSSRIQQ FT CHLYYQNAGGMNGNLDDYLHASSGECFDFIALTETWLKENTLSSQVFGPAY FT EVFRCNRGPNNSRKVEGGGVLVAARRCFKPRRIDQDAWKAVEQVWVSLKLS FT DRVLFLCVVYFPPDRIYDKDLYKIHLDSIASIAERTRPIDDIVVVGDFNLP FT KLTWTPSRNGFLYPDPDRSQFHPCARDLLAGYNLATLQQINHIENEDGRRL FT DLCFVSAQDFAPTLIVAPAPLAKIVHRHPALVVTIDGCHSSAVRDRPDTVC FT YNFRGADYDEMSLALRNIDWDSVLDSVDVDAAVDTFSSILRDHIDHFVPKV FT RKRNSTHLPWQTPELCNLKTQKRAAFKIFSRCGTLSLREYYLRMNSKYQRL FT SRSCMASYQRKKQRELKANPKKFWKFVDENRKESGLPSSMRLANEEAESTA FT EICRLFAKKFSSVFTNEHITDDEIQIAANSVPRCDRSLPMIDIDDDAVSAA FT ITKLKHSSSPGPDGIPSTLLKRCSSALVSPLLHLFRLSLASGKFPCAWKQA FT FMFPVHKKGDRRNIENYRGISALCAASKLFELVVIGPIFAHCRQELSSDQH FT GFFPKRSTATNLLCFIEFVIDSFDNRSQTDAVYTDLSAAFDKINHSIAIAK FT LEKLGFCGSLLTWFRSYLSGRSLRVKIEDTLSESFDATSGIGQGSHLGPLV FT FLFYFNDANFTLRGPHLSYADDLKLFAKIDCLEDAEALQRELDKFADWCEA FT NHMVLNPGKCQVITFSRKHSPILFNYHLGDTLVERVDHVKDLGVILDVKLT FT FKQHVSFITAKASRQLGLVIRMTRHFTDIHCLKTLFCSLVRSSLEYCSSVW FT TPHYNNAVYQLERIQRRFVRYALRLLPWRNPQQLPPYEDRCQLMHLDTLQL FT RRDLARALTVSDVLTDRIDCPSLREKITLTAPARQLRHTPIMQIPFRRTNY FT SANGAIVGLKRAFNKVSSVFDVSLSRDVLKSKFLSVLRRMF" XX SQ Sequence 3709 BP; 855 A; 1118 C; 876 G; 860 T; 0 other; tccaaacgcc aaacgcaggc aagtacaatc ccgcttttca cgaacgtgac gtcttcgacc 60 ctgatgcccg cctgcattct agcgacgatc ggacgacgcc cgtacatcgc gatctaccga 120 gctgcagcta ccgcaacccg tcaccaagta ggtggccgtc agacactgct gacccgggac 180 gcacgaaccc ccgcatcatg gaagaccccg actctcccgt cacagtcgcg ccaccgctct 240 gctgcgacgt atctagttcc ggacatcaaa gtcgtcccgg ccctgtgtac ggttccagag 300 aaagggtctt ccaaacgcca aacgcaggca agtacaatcc cgcttttcac gaacgtgacg 360 tcttcgaccc tgatgcccgc ctgcattcta gcgacgatcg gacgacgccc gtacatcgcg 420 atctaccgag ctgcagctac cgcaacccgt caccaagtag gtggccgtcg gacactgctg 480 acccgggacg cacgaacccc cgcatcatgg aagaccccga ctctcccgtc acagtcgcgc 540 caccgctctg ctgcgacgta tctagttccg gacatcaaag tcgtcccggc cctgtgtacg 600 gctttagaga acgggtcttc caaaacgcgc ttccaggcat gtattcgcgt ttttccgttc 660 agtcgacgct ccctgatgct tcacaacctt ccagctccca gcaatcctcc gacgcaccgc 720 tcacccaccc tcgctcgtta gctagggttt cctcgaggat tcagcaatgc catctttatt 780 accaaaatgc tggtggcatg aacggcaacc ttgatgacta ccttcacgcc agctcaggag 840 aatgcttcga cttcatcgcc ctgacggaaa cttggctgaa agagaacacg ctttcgtcgc 900 aggtctttgg accggcttac gaagttttcc gctgcaatcg tggaccaaac aacagcagga 960 aagtggaagg aggaggagtc ctcgtcgcgg cacgccgctg tttcaagcca cgccggattg 1020 accaagacgc ttggaaggca gtcgagcagg tctgggtgtc tctgaagttg tctgatcgtg 1080 tccttttcct ctgtgtggta tacttccccc cggatcgcat ttacgacaag gacctgtata 1140 agatccatct cgattcgata gcatctattg cagaacgcac gcgcccgatc gacgacatcg 1200 ttgtcgtagg ggacttcaac ttgccgaaac tgacctggac accttcccgt aacggtttct 1260 tgtacccgga tcctgatcgc tcacaattcc atccctgcgc gcgtgatctt ttggccggat 1320 ataacctggc cactctgcag cagatcaacc atattgagaa cgaagacggt cgccgtctcg 1380 atctttgctt tgtgagcgca caagatttcg ccccaacgct aatcgttgcc ccggccccgc 1440 ttgctaagat cgtccatcgc caccccgcgc tcgttgtcac catcgacggt tgtcacagtt 1500 ctgctgttcg agaccgaccg gacaccgtat gctacaactt ccgaggtgct gattacgatg 1560 aaatgtctct ggcactgcgc aacatcgact gggatagtgt attggactct gttgacgtcg 1620 acgccgcggt cgatacattc tcatcaatcc tgcgagatca catcgaccat ttcgttccta 1680 aggtcaggaa gcgtaattct actcacctac cgtggcaaac gccggagctt tgcaacctca 1740 aaacccagaa gcgagccgct ttcaagatct tctccaggtg tggaacactt tcccttcgtg 1800 agtactacct gagaatgaac agcaaatacc agcgtctgag tcgcagttgt atggcaagct 1860 accagcgcaa gaagcagcgt gagctgaagg ccaacccgaa gaagttttgg aaattcgtcg 1920 acgagaaccg caaagaatcc gggctgcctt cgtccatgcg attggccaac gaggaagctg 1980 agagcactgc agaaatctgc aggctgttcg ccaaaaagtt ttcgagtgtt tttacaaacg 2040 aacatatcac cgacgacgag atccaaattg ctgccaacag cgttccccgt tgcgatcgat 2100 ctcttcccat gattgacatc gatgacgacg ccgtatcagc tgcgataact aaactgaaac 2160 attccagctc ccctggaccg gacggaatac cctcgacact gctgaagcga tgttcgtccg 2220 cgctggtatc gcccctcttg cacctgttcc ggctatccct cgcctccgga aagttcccgt 2280 gtgcatggaa gcaggcattc atgttccccg tgcataaaaa aggcgatcga aggaacatcg 2340 agaactacag aggtatctct gccctctgtg ctgcctcgaa actgttcgag ttggtggtga 2400 ttggtccaat ctttgcccat tgccgccagg aattatccag cgatcagcat ggattctttc 2460 caaaacggtc cactgccacg aatctgctgt gcttcatcga gttcgttatc gatagcttcg 2520 acaatcgctc tcaaactgac gccgtttata cagatttgtc ggcagctttc gacaagatta 2580 accacagtat tgctatcgcc aaactcgaga agctcggatt ctgtggcagc ctgttgacat 2640 ggttcaggtc gtatctgtcg ggacgctcgc tgcgggtgaa aatcgaggac acactctccg 2700 aaagtttcga cgctacctct ggaatcggtc aaggcagcca cctcggtccg ttggtgttct 2760 tgttctattt caacgacgcc aacttcaccc tgagaggacc ccatctttcc tacgcagacg 2820 acctcaaact gttcgccaag atcgactgtc tcgaagatgc cgaagccctg caacgcgaac 2880 tggataagtt cgctgattgg tgtgaggcga atcacatggt ccttaatccc ggaaaatgcc 2940 aggtgataac gtttagccgg aagcactctc cgatactctt caactaccat cttggcgaca 3000 cactagtgga aagagttgat cacgttaagg atctcggtgt catactggac gttaaactga 3060 ccttcaagca acacgtgtcc ttcatcactg ccaaagcatc ccgtcagctc ggtctggtga 3120 tccgtatgac acgccacttc accgacatcc actgcttgaa aacgctgttc tgctcgttgg 3180 tccggtcctc gctcgaatat tgctcgtcgg tttggactcc gcactacaac aacgccgttt 3240 atcagctaga gagaattcaa cgcaggtttg tccgatacgc cctgagactg ctgccctgga 3300 gaaatcccca gcagctgccg ccatacgaag accgctgcca gctcatgcat ctcgacactc 3360 tccagctgcg tcgtgacctg gcccgtgcgt tgacagtttc ggacgttctg accgacagga 3420 tcgattgccc ctctctccga gaaaagatca ccctgactgc acctgcccgc cagctacgcc 3480 acacgccaat catgcagatc cctttccgcc gtaccaatta cagcgccaac ggagccatcg 3540 tcggactgaa gcgagccttc aataaagtgt cctctgtttt tgacgtcagt ttgtctcgcg 3600 atgtgttaaa gtccaagttt ttatcagtgt taagacgaat gttctagttt tagtcttagt 3660 tttaagttca tttgggcaat aagtgcctgt tgagaaataa acaaacaaa 3709 // ID Gypsy-32_DPu-LTR repbase; DNA; INV; 112 BP. XX AC scaffold_53; XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from Daphnia: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-32_DP_; KW Gypsy-32_DPu-I; Gypsy-32_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-112 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Direct Submission to RU (16-DEC-2010). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR Genome; scaffold_53; Positions 725454 725565. XX SQ Sequence 112 BP; 30 A; 31 C; 22 G; 29 T; 0 other; tgttgtagaa tctagtgctg accgctagat gtcatgccac ctagagcgag catatccccg 60 ctatgcagag tcccaataca gcagtcttcg acacacactt gtctttataa ca 112 // ID BEL-615_AA-I repbase; DNA; INV; 6183 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-615_AA_; KW BEL-615_AA-LTR; Pao_Bel_Ele51; BEL-615_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-6183 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [5212-5772] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 4813..6183 FT /product="BEL-615_AA-I_1p" FT /translation="MKVLRLTQGPPDARHEAVKKSSPLFKICPYLDEQGIL FT RMRGRIAACRFASFEAKFPCILPRRHRITQLIVDWYHHHYRHANRETVVNE FT LRQRFEIAKLRAVVERVEKDCMKCRIKKASVRSPLMAPLPRARLEAYIKPF FT TSVGLDYFGPIPVKVGRSQVKRWVALFTCLSIRAVHLEVVHSLTTESCIMA FT VRRFVARRGPPAEVYSDNGTCFHGANRELQLQMRETNEAMALTFTSAQTRW FT IFNPPSAPHMGGVWERLVRSIKSATNVLLESPRNPDDETLLTILYEAECMV FT NSRPLTYVPLESADDEALTPNHFIFGCSNGTKVLPTEPVDIRRTLRSSWKL FT AQHIMDQFWCRWIKEFLPVITRRCKWFKECEDVKVGDLVLIVGESTRNHWI FT RGRVEQVIVGKDHRVRQALVRTSTGILRRPALKLAVLDILESSKPDLDDVS FT NTDLDQGLREGE" FT CDS join(470..2407,2411..4072) FT /product="BEL-615_AA-I_2p" FT /translation="MQRRLEERKSLLEAERQLRERQLVMDKEVCARQQQIR FT KESLEKKTALIRQIAESSSKGGSLGGSVTNSNDKVTKWLETAAEGPERASK FT DWRASNSIRSTPPCNPHQHGERLPSPRRQTGPASLPAMHHLEATLEQLSVN FT ETGLPPANAHQQQPALQEHNVNIFQQAANIGPASLRHIASPHPREHVVSNP FT RENSYTEPAPRAAGGLTFPRRTPEALPQQGLQPPVQLQGVNIQARQSPLQP FT QTTAAIHHRVLTSEQIAARQVLGKELPVFNGNPEDWPLFVSSFEQSTIACG FT YTDAENLIRLQRCLRGHALEAVRSRLLLPTNVPYVIQTLRTLYGRPELLIR FT SLLAKIQRVPPPRQDRLETLMEFGLAVQNLVDHLVGAQQENHLSNPVLLQE FT LVEKLPGSLKLNWAVYKRQDVIVTLATFGSFMSELVSAASEVTMEIPVFSR FT APRNEHQRSREKGFLQTHSSRPLPSGSSTTVNNGKDGQKPGKTCSICTKEG FT HRVGDCTQYNLMTVEERWSVVQKKGLCRTCLNNHGKWPCKSWKGCGVSGCT FT MKHHTSLHIPSAGPPSVNVSTSHFSSNSEALPLFRIQPVVLFANGLSLTVF FT AFIDEGSSLTLLEADIAEQLGVCGPKEPLTLLWTGNVSREESKAQRIEEIA FT SAGSQSPSHLMQTHTVNSLVLPTQTLRYQDLSARFPHLRGLPIEEYENVQP FT KLLIGINNLHLSVPLKIREGAPRDPIAAKCRLGWGIYGCAPRCSPRVVPIN FT FHIAAASDSDRELNEQLRDYFAVESIGTPNSSTCVLESEENRRARTIMEET FT TRRTSSGFETGLLWRMDDVEFPESHSMAVRRLQSLERKLEQDSFLRDSVRQ FT LIWDYERKGYAHKATMRELDSMDSNRMWFLPLGVVRNPKKPQKIRLIWDAA FT AKSHGVSFNSKLLKGPDLLTPLVAVLSRFRQYPVAVCGDIREMFHQISIIP FT QDRSSQCFLWRDEPTDRIQTYVMDVATFGSTCSPTSAQFIKNRNAEEFAKE FT FPRAAAAIVDNHYVDDYLDSFRTVQEAIEVTNDVRLIHSRGGFELRNFSSN FT SEKLLQGIGETPDRTGKALVLMRGEVESVLGMQWLPMEDLFTYTFNMRDDL FT QPILANDHIPTKREVLKVVMSLFDPLGLISFYLVHGKVLIQDIWAAGCSWD FT EAIGDELYAKWRQWINLFPMLQSIKIPRCYFNKPLPEDLKE" XX SQ Sequence 6183 BP; 1747 A; 1442 C; 1491 G; 1500 T; 3 other; caaatacgga gtttgctatc gagactttgg tcgaatatcc cccacaattc ttcaagaaat 60 tttgtttgtg ggaagcaaaa tggagagatc tcgcaccaac gatgggctca cctacacctg 120 ccaatcctgc gatcagcctg atcaagcaga acctcgcatg atagcctgcg acaactgccg 180 caaatggcag catctgagct gcgccggagt tgatcctact atcgaaaact acgctaatct 240 aaaatttact tgcaatggat gcaagttgaa gatttcttcc cttcggaaaa tccctcctac 300 ggacaggtcg acacgatcct ccgttaagaa aggagtaaac gttcctagtc acgtctcaag 360 tgaaagatct gttctcgaag cacacctgaa agttgtggag gagcaacaac agctcgagga 420 acaggwgttg aaagagcaga ctgagattca acagcgtgaa atagatgaga tgcaacgtcg 480 cctcgaagaa cgtaaaagtt tgttggaagc ggaaaggcag cttagagaga gacagctggt 540 tatggacaaa gaagtgtgtg ctcgccaaca acagattagg aaggaatccc ttgagaaaaa 600 aaccgcgttg attcgtcaga tagctgagtc gagtagtaaa ggtggttcat tgggtggttc 660 ggttactaat tccaatgata aagtaacgaa atggttagaa acggcagccg aggggcctga 720 acgtgcatcc aaagactgga gggccagcaa ttcgattcgg tcaacacctc cgtgcaatcc 780 gcatcagcat ggagaacgac tcccatcacc acgccgacag acaggtcctg cctctttacc 840 cgctatgcat catctcgaag ctacgctaga gcaattgtcg gtgaatgaaa cgggtttacc 900 gccggcaaac gcacatcaac agcagccagc tttacaagag cacaacgtta atatttttca 960 acaagcagca aatattggcc cagcctccct acgtcatatt gcatcaccac accctcgaga 1020 gcacgtcgtc tcgaatccaa gggaaaatag ctacacagag ccagcacctc gtgctgccgg 1080 tgggttaaca ttccctcgtc gaacgccgga agcgctgcca cagcagggac ttcaaccgcc 1140 ggttcaatta caaggagtga atattcaagc aaggcagtcg ccgctacagc cccaaacgac 1200 tgcagcaatt catcatagag tactaacgtc agaacagatc gccgctcgac aggtgcttgg 1260 aaaagagctt ccagttttca acggaaatcc tgaagactgg ccactcttcg tgagcagttt 1320 cgagcaatcc acaattgctt gtggatacac ggatgctgag aatttgatac gcctccaaag 1380 atgtttgcga ggccatgcct tagaggccgt cagaagccgc cttctgttac ccacaaatgt 1440 gccatacgtt attcaaacgc tgcgcacgtt atacggaaga cccgagttgc tgattcgttc 1500 cttattagcg aagattcaac gagtaccacc accccgtcaa gatcgactgg aaacgctgat 1560 ggagtttggc cttgctgtgc aaaatctcgt cgatcatctt gttggggccc aacaagagaa 1620 tcatctctca aacccagttc tccttcaaga gcttgtcgag aagctaccag gatcacttaa 1680 actcaattgg gcggtctaca agagacaaga tgtaatcgtc accttagcca ccttcggaag 1740 ctttatgtcc gaattagtat cggctgcaag tgaggtcaca atggagattc ccgtcttcag 1800 cagagcgccg cgaaatgagc atcaaaggag tcgagaaaaa gggtttcttc aaacacattc 1860 ctcaagaccg ctgccatcgg gttctagtac gacggtgaac aatggtaaag atggtcaaaa 1920 accaggtaag acgtgttcca tatgcaccaa agaaggtcat cgcgttgggg actgcacaca 1980 atacaatttg atgaccgtag aagagaggtg gtcagttgtt caaaaaaagg gtctctgtcg 2040 gacatgtctc aataaccacg gtaaatggcc gtgcaaatct tggaaaggtt gcggcgtaag 2100 cggatgcacc atgaaacacc acacctcgtt gcatattcca agcgctggcc cgccgtcggt 2160 gaatgtgtcc actagtcatt tcagttccaa tagtgaagca cttcccttgt tccgaattca 2220 acccgtggtc ctttttgcaa atgggctctc gttaaccgtc tttgccttca tcgacgaagg 2280 ttcgtcattg actttgttag aggccgacat agccgaacaa ctaggtgtat gcggacctaa 2340 agagcctcta acgttgctgt ggactggaaa tgtgagtcgc gaagaatcca aggctcaacg 2400 gattgaamtc gaaattgcca gcgctggttc gcagtctcca tcgcatttaa tgcaaactca 2460 tacagtcaat agcctggttc tcccaacaca aacattgcga tatcaagacc tgtccgctcg 2520 attcccgcat ctgcgaggat tgcctataga agaatatgaa aatgttcaac caaagcttct 2580 cataggaatc aataatttgc acttatcagt tccactcaaa atacgtgaag gagcgcctag 2640 ggatcctatc gctgcgaaat gccgactcgg ttggggcatc tatggatgtg ctccacgatg 2700 ttctcctagg gtagttccaa ttaacttcca tattgcggct gcttcggatt ccgaccggga 2760 gctaaacgaa caacttcgag attattttgc cgtggaatca ataggaacac cgaattcttc 2820 tacctgcgtt ttagaatcag aagagaatag gagagcgcga acaattatgg aagagacaac 2880 tcgtcgaacg tcttctggtt ttgaaactgg acttctttgg agaatggacg acgtggagtt 2940 cccggaaagc cactcgatgg cggtacgccg tcttcagtcg ctggagcgaa agcttgagca 3000 ggatagtttc ttgcgggatt ctgttcgaca gttgatttgg gattacgaac ggaagggata 3060 cgcacataag gctacaatgc gtgaattgga ttccatggat tcgaatcgta tgtggttcct 3120 acctctgggt gtcgttagaa atccaaagaa gccacaaaaa ataaggctga tttgggatgc 3180 tgctgccaaa tctcacggag tttccttcaa ctcaaaacta cttaaggggc cagacctatt 3240 gactcccttg gtcgcagtat tgagtcggtt tcgtcaatat ccggtggcgg tgtgtggaga 3300 tatccgtgag atgtttcatc agatctccat cataccccaa gaccgttcat ctcaatgttt 3360 tctctggcgt gacgaaccaa cagaccgaat ccaaacctat gtcatggatg ttgctacatt 3420 cgggtctact tgctccccca catctgctca gttcatcaag aatcgaaatg ctgaagaatt 3480 tgcgaaggag ttcccccgag ctgcagctgc tatagtagac aaccactatg tggacgatta 3540 cctggacagc ttcagaactg tccaggaagc tattgaagtg acaaacgatg tgaggttaat 3600 ccattcccga ggtggattcg aattacgaaa tttttcttcg aattccgaga agttgcttca 3660 aggaattggt gaaacccctg atagaacggg caaagcattg gttcttatgc gaggtgaggt 3720 agaatcagtg cttggaatgc aatggctgcc tatggaagat ttgttcacct atacgttcaa 3780 catgcgagac gatcttcagc cgatactggc aaatgaccac atacctacca aaagggaggt 3840 tctaaaagtg gtcatgagtc tatttgaccc actaggtctg atctcgtttt accttgtaca 3900 tgggaaagtt ctaatacaag atatctgggc ggctggctgt tcatgggacg aagcgatcgg 3960 agatgaactc tacgctaagt ggcgacaatg gataaatctt tttccaatgc tgcagagcat 4020 caaaattcca cgctgctatt tcaataagcc acttccagag gatttgaagg aatagagctt 4080 cacgtattcg tagatgcaag ctcttcggcg tatgcatgct ctgcctactt tcgactgcct 4140 agcgagactg gaattcaggt aaccctagtt accgcaaaaa ctaaagttgc tcccatcaaa 4200 acgctgtcta ttccccgact tgaactgaaa gcagcagttc ttggagttcg tttaacggag 4260 tcagttcaga gtcatcataa gttctcaatt tcacgcagga ttttctggac tgattcttca 4320 acagtattag cttggattcg ttctgatcat cgccgatatc agaagtttgt agccgtacga 4380 attggagaaa ttctaagttc cagtgaccag gatgagtggc gctgggtgcc ctctgaaatg 4440 aactcagctg ataaggcaac taaatggaac ggcggtcctg acttcaatgt agacaactct 4500 tggttccgcg gtccattgtt cttgcgaatg gaggagggac attggccgaa atttcccaaa 4560 ccattcacca ctagggaaga attacgtcca aaccattctc actggatctc agcaccgcta 4620 gttgacatat tcagattcag ccggtggacg agactgcacc gtatgatggc ctatgtatac 4680 cgctttatcg gtaatctasg gcggaagaaa aatggtcagc atcttcaaaa gggaccgttg 4740 actcaacatg aactgaaatt agctgagata ggactatgga gaattgcgca gaaggatgcc 4800 ttttctgatg agatgaaggt tctacggttg acgcaaggac caccagatgc acggcatgaa 4860 gcagtaaaga aatctagccc gctgttcaaa atatgtccct atctcgatga acaaggaata 4920 ttacgaatgc gaggccgcat agctgcctgt cgcttcgcat cctttgaagc aaaatttcct 4980 tgtatcttgc ctcgtagaca ccgtattaca cagctgatcg ttgattggta ccatcaccat 5040 taccgtcatg caaatcgcga aacggtggta aacgagctga ggcagcgatt tgaaatagcc 5100 aaactccgag cggtggtgga aagagttgaa aaagactgca tgaaatgtcg tataaagaag 5160 gcgtctgtcc gttcaccatt aatggcacct ttgccacgtg caagactgga ggcgtacatc 5220 aaaccgttta catcagttgg tctcgattat ttcggcccaa tccctgtgaa agtcggaagg 5280 agtcaagtga aacgttgggt agcgctcttt acatgtctat ccatcagagc agttcatcta 5340 gaggtcgtcc atagtctaac tacggaatca tgcataatgg cagtacgcag atttgtagct 5400 cgccgaggac caccggcgga agtatacagc gacaacggaa cttgcttcca cggagcaaac 5460 cgggagctgc agttacaaat gcgagaaacc aacgaagcaa tggcactaac ctttactagc 5520 gctcaaacac gttggatttt taatccacca agcgcccccc acatgggtgg cgtatgggaa 5580 cgattagttc gttcaataaa atcagccact aatgttctcc ttgaatctcc aagaaatcca 5640 gacgatgaaa cactactgac cattttatac gaagccgaat gtatggtcaa ttcaaggcct 5700 ctaacttacg ttcccctgga atcagcagac gacgaggcct taacgccaaa tcattttatt 5760 tttgggtgtt ctaatgggac aaaggtgctc ccaacggaac ctgttgacat ccgtagaacg 5820 cttcgaagta gctggaaatt ggcgcagcac attatggatc agttctggtg ccgatggatt 5880 aaggaatttc taccagtgat aacaagacgg tgtaaatggt ttaaagaatg cgaagatgtg 5940 aaagtcggag atcttgtttt aatagttggc gaatcaacta gaaaccattg gattcgcgga 6000 agagtggaac aggtgatcgt cggtaaagac catagagttc gtcaggcact ggtacgtacg 6060 tcaaccggca tactacgcag gccagcattg aagctagcag tgctggacat cttggagtct 6120 agtaaacctg atctggatga cgtcagcaat acggatcttg accaaggttt acgggaaggg 6180 gag 6183 // ID Gypsy-1_BM-LTR repbase; DNA; INV; 433 BP. XX AC nscaf1299; XX DT 19-MAR-2010 (Rel. 15.07, Created) DT 19-MAR-2010 (Rel. 15.07, Last updated, Version -1) XX DE LTR retrotransposon from silkworm: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-1_BM_; KW Gypsy-1_BM-I; Gypsy-1_BM-LTR. XX OS Bombyx mori OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Bombycidae; Bombycinae; Bombyx. XX RN [1] RP 1-433 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from silkworm."; RL Repbase Reports 10(7), 978-978 (2010). XX DR Genome; nscaf1299; Positions 204946 204514. XX SQ Sequence 433 BP; 157 A; 79 C; 82 G; 115 T; 0 other; tgctttagga gcagtacttt cccaaggcaa tgtgggctca gataaaccag tagcctatgc 60 ctcaaggaca ttatcagata ctgagatccg ttactctacc atagagaaag aactgttagg 120 gatagtatgg gcaattaagt attttagacc ttacttatat ggccgtaaat ttacaattta 180 tacggaccat agacccctta catggttaat gagtctaaag gaccctaact ctaaattaac 240 acgatggaaa ctaaagttgg cagagtatga ttacaaagtt gtttataaaa agggcaaaca 300 aaacactaac gcagatgcac tatcccgagc aaaaattttt cataatagta tagattctct 360 agctgttaat gttgatgaca atagtgacga caacataata aatagaatat tcgaaaacgc 420 ccgtagacag gca 433 // ID Gypsy-617_AA-I repbase; DNA; INV; 6938 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-617_AA_; KW Gypsy-617_AA-LTR; Ty3_gypsy_Ele49; Gypsy-617_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-6938 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [4327-4782] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 533..1693 FT /product="Gypsy-617_AA-I_2p" FT /translation="MPKVKSALKKTRELFRPDIDSSSDSDSSVKSEEYSHV FT PLKNLTIPIESLNMAEQNALEAMARQMEAMMQAINTLTTKQMQYETQIGAS FT TSRASDTRETGNTFTLMGDPFRIPDPIKTLPTFNGNKKQLNWWLETAEKTL FT KNYETLVSPEIFQIYLTAVSNKIEGHAKDVLCTNGNPTTFDEIKQILVAAL FT GDKQDLSTYNCQLWHNKMDGSVSKHYQRTKQLVHNIKSLAKQNPKYNDHWE FT AVNDFIDEYSLAAYVSGLQKPYFGYVQAAEPKNIESAYAFICKFTSNETDR FT ETTLYHQNRKDNFTRSNDQDKQSKPGFKKPFDPQNRARNENKVQKPEPMEV FT GSTRSRLTLNNVNNIETNSDNEEEEDEAFQDTNFCVLNESDSDT" FT CDS 2554..5067 FT /product="Gypsy-617_AA-I_1p" FT /translation="MLENGIITHSDSPWNAPIWVVPKKPDASGKKKFRVVI FT DYRKLNEVTVDDKFPIPQIEDILDNLGKSTYFTTLDLKSGFHQIEMDENSR FT QKTAFSTDLGHYEFLRMPFGLKNAPATFQRAMNSILGELIGTVCYVYLDDV FT IVFGRSLKEHITNLGKVLEKLENANLKIQLDKCEFLKKNCEFLGHVITPEG FT VKPNPDKIKEIQTWELPTTEKQIKQFLGLVGYYRKFIRDYAKLTKPMTKYL FT KKDVKVNTDDPDFKKSFEKCKTMLISDPILAYPDYSKPFVLTTDASNYALG FT AVLSQINEGKDQPVAFASRTLNKREINYSTTEKEALAIIWAIEKFKPYLYG FT QKFTLVTDHKPLVFIKNANKNGKLIRWRLELENYDYTVIYKTGKTNVVADA FT LSRKVETNVNEISDPSDIETVHSASSSEEFFIHFTERPINNYRNQIIFRTT FT SFNSILMFDIFPGYKRTTICDSNFTADKITDYLKRYRNGRQTAIHAPESLL FT NMIQVSYRNHFSHAHLVITQNMVEDIISEERQDNIIRQEHDRAHRGIQEVE FT QQIKRSYFFPEMVKKIRRFNNACQICSGHKYDRKPYNIKISPRPIEIAPFH FT RIHIDIFGMDKINYLTIICAFSKHLQAIKIETRNTIDIQNALNQYFSNFAI FT PRSIVCDHEAAFTSIQMADYLANLGTQMDFASSSESNGQIEKTHCTLIELY FT NTNKHKFPNMSSEQIMGIIVALYNNTVHSATGFTPNEIIFNRTDAISPAEI FT DNNANRIFRKVTENLQKAQQKMERYNDEKEDPPVIEEGQTIFVKRGTRKKL FT DPRFNTTKCLENKEKTIKIPRNTKRNKNKIKRIRKP" FT CDS 5420..6937 FT /product="Gypsy-617_AA-I_3p" FT /translation="MTITQFVLLVLCVISNVHPQSLNIRNLYNEPLLLLKF FT KDCRIQKSFIKIVHPINLTSIETEVIRIVKLASQLDQSLQISQLTIQKSRM FT LINNLRQLKPIHHRRVRRWDTIGKAWKWLAGTPDADDLRIINSTINQLITQ FT SNNQVIINQMITKRISGMTSVVNELINKHNMENRIILKEYDALTLILYIDS FT MNHIIEEIQDTVLRTKIMLPNNKLLTLKEITLVESLLVEQGINVRFPEEAL FT EYATPKVAIREDCLLYILQIPKLYDQNAEVIEIIPLTTNNYIIPEAPKYVI FT KMQNSWFLTHYPEKTIQKESDLSTMNDDCIHAILKGAVSHCKTIENHDITI FT KQVSRNKILINNSKQSYIGSNCGPHNRSLTGNYLITFHNCTVVINNKVYAP FT EEVIDNITYDFQGAFPNLKINWSFTEHYNISTLKNSTEFNREHMELMKLEQ FT SQHKSWIITLAGGLSTTTLTIIGIFLYVCLRRRKLIIKIKNPGRINQKLED FT DLSLPPRGVT" XX SQ Sequence 6938 BP; 2655 A; 1441 C; 1142 G; 1700 T; 0 other; gccgtccgga gtcggctgag tgactatttg tgataacagt ggaaaattca cgaaccaaaa 60 acgtgaataa agtgatcccg cggcgttcaa cgcgaattgg acaaggcatc agtgacaagt 120 agacggtaca cgtcgtcacg cgactcgagg atccccctgc aaatagcatc tggacaacca 180 gcaacgtcag ccagacctac gaggacacga ctacaaaaaa ccgtaagtaa tttttactcc 240 attttcactc ccccaaaaat tacttgcgag gagatttgcg aactactaaa taggcgcata 300 atctctacaa agaaatcgca ctatcagata gacgatttct ttgccgcaac ttgatttact 360 accagatagg aatcaggcca agcctctcct tacccccttc ttagcgtact accagatagg 420 cgcacactct ttacgacgtg tcgctctacc aaatagacga cccgttcgtt gcaactcaat 480 tcactaccag atagaaatag agcgaagtct cccccccccc ctagttaatt taatgccaaa 540 agttaaaagt gccttaaaaa aaactcgaga gctctttcgg cccgacatcg atagcagctc 600 tgattcggat agtagtgtta aatcggagga atattcccac gtaccgttaa aaaacctgac 660 tatcccaatc gaatccctta acatggcaga gcaaaacgca cttgaagcca tggctcgcca 720 aatggaagct atgatgcaag ccataaatac cctaactaca aaacagatgc aatacgaaac 780 tcaaatcggt gcatcaacca gccgggctag tgacacccgc gagaccggca acacatttac 840 actgatgggt gatccattca ggatacccga tccaataaaa accttaccaa ctttcaacgg 900 caataaaaaa cagttgaatt ggtggctgga aacggccgaa aaaacattga agaattatga 960 aactctagta tcacccgaaa tattccaaat ttatttgact gcagttagca acaaaattga 1020 gggtcacgcc aaggatgtat tatgcacgaa cggcaacccc acaacttttg atgaaattaa 1080 acaaatttta gtagctgctc ttggtgacaa acaagacctg tccacataca attgtcaact 1140 ctggcataat aaaatggacg gtagcgtgag caaacactac cagcgaacaa aacagctggt 1200 ccataatata aaatcgttgg ctaagcagaa cccaaagtat aacgaccatt gggaagcggt 1260 caatgatttc attgatgaat atagcctcgc agcgtatgtc agtgggctgc aaaaaccata 1320 ctttggctac gtacaagctg ccgagccaaa gaatatcgaa agcgcttatg catttatttg 1380 taagttcact tcgaacgaaa ccgatagaga gacgacgctt tatcatcaaa acaggaaaga 1440 taattttacc agatctaacg atcaagacaa acaatccaaa ccaggattta aaaaaccttt 1500 cgatcctcaa aacagggcaa ggaacgaaaa taaagttcaa aaacccgaac caatggaggt 1560 cggctcgaca cgcagtcgtc taacgttaaa taatgtaaat aatatagaaa ccaattctga 1620 caacgaagag gaagaggacg aagcttttca ggacacaaat ttttgcgttc tgaacgagtc 1680 cgactcggac acatagaaca tgatttttta ccgtacatca aaatcaggac tccaaaaaaa 1740 caaattttca aaattttaat agatacaggt gccaataaga accttctgag accaggcata 1800 ttaaagaata catccaaaat agaaagcacc caatccaaaa acgtaagagg tacgtttaaa 1860 atatccacta aaggaaaaat taaattactc ggtcccgact tcccagaatt aactttctac 1920 gaacttgatt tccatccatc atttgacgga ctgattggct ctgaaaccct ttctaggtat 1980 aaagccgaaa tcgactacga gaaaaaaact atcgttttta atagtaaaac gataccgttt 2040 ttcaaacatt tcattaaaaa ggaaaaaaca ttcaatcaca ttgttactct caaaagtaat 2100 aaagatggcg attggttggt aggtgaaccc accgaacttt ataaaaacat cttgatagaa 2160 ccaggaatct ataaatcagt cgacaacaaa acatcgtttc ttgtacggac aaatagcaaa 2220 gaacccccta aattaaaaaa aaatattatt catatcaaag tcaataattt cgaaacgcgg 2280 aacttcttct cgaacgataa acatccccct atccaacatg tagagaaact agaaaattta 2340 ataagaactg atcatttgtc cattctcgaa aagaagtctc tcattgaact aattaacaaa 2400 cataagaatg tcatactgaa agaaaacgaa aagttgacat ctaccacgaa aattaaacat 2460 aaaattcaaa catcccacga aaaaccaatt tatacaaaaa catacagata cccccatgcc 2520 cacaagcaaa ccgtgaaaga tcaaatagag gatatgcttg aaaatggcat aatcacgcat 2580 tcggattctc cctggaatgc accaatatgg gtagtaccaa aaaaacctga cgcctctgga 2640 aaaaagaagt ttcgagtcgt tatcgactac cggaaactta acgaagtaac agttgatgac 2700 aaatttccaa ttcctcaaat tgaagacatt ttagacaatt tgggaaaatc aacatatttt 2760 acgactcttg atttgaaatc agggtttcac caaatagaaa tggatgaaaa ctcaaggcag 2820 aaaacggctt tcagcaccga ccttggccat tatgagtttt taaggatgcc attcgggctt 2880 aagaatgcgc cagctacttt tcaaagagcc atgaattcca tccttggaga actaattgga 2940 acagtatgct atgtctatct agatgatgtg attgtgtttg gtagatcatt aaaggaacac 3000 attacaaatt taggaaaagt actagaaaag ttagaaaacg ctaatctcaa aattcaactc 3060 gataaatgcg agttcctaaa gaaaaattgt gaatttctag gacatgtaat cacaccagag 3120 ggagttaagc ctaatcccga caaaattaaa gaaatccaaa cctgggagtt accaacaaca 3180 gaaaaacaaa taaaacaatt tttgggacta gttggatact atcgaaaatt tattcgagat 3240 tacgccaagc taactaaacc tatgacaaaa tacttgaaga aggatgttaa agtaaatact 3300 gacgatccag atttcaaaaa atctttcgaa aagtgcaaaa ccatgcttat ttcagaccct 3360 attttagcct acccagatta ttcgaaacca ttcgtactga ccacggatgc gtctaattat 3420 gcactagggg cagtattatc ccaaattaac gaaggtaaag accaacccgt agcattcgca 3480 tcaagaaccc tcaataaacg tgaaataaat tactcaacta cagaaaaaga agctttggca 3540 attatttggg caatcgaaaa atttaaacca tacctctacg gtcagaaatt tactctagtt 3600 actgatcata aaccattagt tttcatcaag aatgcaaata aaaacggaaa acttataaga 3660 tggcgattag aactcgaaaa ttatgattat acagtaattt acaagacggg caaaacaaat 3720 gtagtggctg atgcgcttag caggaaagtc gaaactaatg taaatgaaat atctgatcct 3780 tcagacatcg aaacagttca ttctgcatca agttcagaag aattttttat tcatttcacg 3840 gaaaggccca ttaataatta ccgaaaccaa ataattttta gaacaacttc atttaactcc 3900 atacttatgt tcgacatctt ccctgggtat aaaagaacaa caatttgtga cagtaatttc 3960 acagcagata aaattactga ttatcttaaa agatatcgga atggtagaca gacggctatt 4020 cacgcgccag aaagtttact gaacatgatc caagtgtcct atagaaacca cttctcgcat 4080 gcacaccttg ttattaccca aaacatggta gaagatataa tcagcgaaga acgtcaagat 4140 aatatcatta ggcaggaaca cgatagagcc catagaggaa tccaagaagt cgaacaacag 4200 ataaaaagat cctacttttt ccctgaaatg gtcaaaaaga taagacggtt taacaatgct 4260 tgtcaaatat gttccgggca caaatatgat agaaaaccat ataacataaa aatttcacca 4320 agacccattg aaatagcacc ttttcataga atacatattg atatttttgg aatggacaaa 4380 attaactatt tgactataat ttgcgcattt tcaaaacatc tgcaggctat taaaattgaa 4440 acaagaaata ctattgatat ccaaaatgcc cttaatcaat atttttcaaa ctttgcaata 4500 ccacgatcca tagtgtgcga ccatgaagcc gcatttacga gtatacagat ggcagattac 4560 cttgcaaatt taggaactca aatggatttt gcatcatcat ccgagtccaa cgggcaaatc 4620 gaaaaaacgc attgtacttt aattgagctt tacaacacaa acaaacacaa atttccgaac 4680 atgtcttctg aacagattat gggaattata gtggcattgt acaacaacac agtacactcc 4740 gctacgggat tcaccccaaa tgaaataata tttaacagaa cagatgcgat aagtcctgcg 4800 gagatcgata ataacgcaaa cagaattttc agaaaagtaa cagaaaactt acagaaagcc 4860 caacagaaaa tggaacgtta caatgacgaa aaagaagacc ccccagtaat agaagaggga 4920 cagacaatat ttgttaaacg cggtaccaga aaaaaattgg atcctagatt caacacgacg 4980 aaatgcttgg aaaataaaga aaaaactatt aaaattccta gaaatacaaa gcggaataaa 5040 aacaagatta aaagaataag gaaaccataa tatttgttct ttattaatca atactttact 5100 tttcatttca gagaacccac ttttatttta aacttatgca acccggacag acatcgttgt 5160 gaatatcacg tacattttta acaatcaaaa ctaaaagatc aattcattta tagaatcaac 5220 caaccattgg gaggaacctt tcataccttg acgacataca taaccagcca ggaagccaaa 5280 atttgccgta taatatcaca aattgcacat tttcagacaa agcatatttg atttaacgga 5340 taaaacacag catacggtca acctatacat ttgatttatc taattcaaaa attaaacttt 5400 aagttagtta aaagcaacta tgactattac gcaatttgta ctacttgtgt tatgtgtaat 5460 ttcaaacgtt cacccacaat cgttaaacat tagaaacctt tataatgaac cacttttatt 5520 attgaaattt aaagactgta gaattcaaaa atcttttata aaaatagtac atccaatcaa 5580 tcttacatct atcgaaacag aggttatacg tatagttaaa ttagccagtc agttagatca 5640 atctttacag atttctcaat taacgattca gaaaagccga atgttgatca acaatcttcg 5700 acagctgaaa ccaatccatc ataggagagt acgacgctgg gacaccattg gaaaagcctg 5760 gaagtggcta gccggaaccc ctgacgcaga cgatctcagg atcataaaca gcactatcaa 5820 ccaacttata acgcaaagca ataaccaggt aatcataaac cagatgataa cgaaacgaat 5880 aagcggaatg acaagcgttg taaacgaact tataaacaaa cacaacatgg agaacagaat 5940 aattcttaag gaatacgatg ccctaacact catactgtat atagacagca tgaaccatat 6000 tatcgaggaa atccaagata ccgttttgcg taccaaaata atgctcccta acaacaaatt 6060 actaacgctc aaggaaataa cattagtcga atcgcttctc gtcgaacagg gaataaacgt 6120 acggtttccc gaagaagcac tcgagtatgc tacaccaaaa gttgccatca gagaagattg 6180 tttgctctat atcctacaga ttccaaaact ctacgaccaa aatgcggaag tcatcgaaat 6240 tattcctttg actacgaata actacatcat cccagaagca ccaaaatacg tgataaaaat 6300 gcagaacagt tggtttctaa cacactatcc agagaaaacc atccaaaagg aatcagactt 6360 aagcacaatg aatgacgatt gcatccacgc tatactcaaa ggtgcggtaa gtcattgtaa 6420 aaccatagaa aatcatgaca tcaccattaa acaggtgtca agaaataaaa tcttaatcaa 6480 caatagcaaa caatcctata ttggatcgaa ctgtggccca cataatcgat cactaacagg 6540 taactactta attacattcc ataattgtac ggtagttata aataacaaag tatacgcacc 6600 agaagaagtg atcgacaata taacgtacga tttccaagga gcttttccca atttaaaaat 6660 caattggagc tttacagagc attacaatat ttcaacactc aaaaattcaa cagaattcaa 6720 cagagagcac atggagctga tgaaactgga acaaagtcaa cataaatcat ggatcatcac 6780 ccttgccgga ggactctcaa ctacaacgct aacaataatt ggaatatttt tatacgtttg 6840 tctccgtaga agaaaactta tcatcaaaat caaaaacccc ggaagaatta accaaaagct 6900 cgaggacgac ctttctctac ccccccgagg agttaccg 6938 // ID BEL-83_AA-I repbase; DNA; INV; 5530 BP. XX AC supercont1.342; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-83_AA_; KW BEL-83_AA-LTR; BEL-83_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5530 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.342; Positions 586984 592513. XX CC Positions [4535-5122] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 179..5530 FT /product="BEL-83_AA-I_1p" FT /translation="MEKLFRERKSLDPRIKRVSEMVGKLKPETAEETEVQT FT ELEVLGDIWASYSMVHKRILDASEDDELYEDAVQQQCRFESVYITLKNRLL FT KVLKVIKNRDSSHEERSSSQNDALSQLAQQQAELLRLMSGRMSAGASTSTA FT IAEGFPNLTPLSDLKLPRMNLPVFSGDYLEWQSFIDLFRSMVDQNPSLKDS FT QKLYFLKTNLSGEAASLISHLKIEDVNYTPALQKLKSRYDKPLEIAHKHIE FT RFLNQPALTSPSAQGLRSLHDVSDEVVLALQAMQREDRDTWLLFILIEKLD FT HETKQLWYQKAADMSEANVTLQTFFAFIDSRSFALQSAQTIKQRASTTYKP FT LMKPQYRGATTFVATNTFSNCSVCDKSPHPLYQCGKFIHMSADERMSAVNS FT QRLCQNCLKGHAGEICRSGNCRKCGQLHHTLLHYAIMPTNVHQGPSTTYVV FT SGDGSAAQSLISALDSNTTFDATNTLLATVSINVMDKQGRPHTCRAVLDSA FT SQVSFISESFCKELGLDLLEADMDLEGISSTPAHVNKCAQIVLASRCTDYT FT TTVPCMVLETITKSLPVKPANIDGWPIPKSINLADPLFHRPGRINLLLGIE FT LFFQLLEPGKINLGPDDNFPMLQNTKLGWVVAGRYRDANIPSYATSPTCFL FT ATSDEGLDQQLRRFWELEEYATPSPHLNEEEKRCEDHFYKHTIRDISGKFT FT VRLPFLHSPTQLGDSRQMAERRLSHIERKLDRNPQLKHEYHSFLREYCELG FT HMTLADGPAPIGSVYLPHHCVVKEASSTTKCRVVFDASAKTTSGKSLNDVL FT MIGPVLQDSIVNILLRFRFPAIVVAGDIKQMYRMVDIHEADRDFQRILWRW FT SSDRPIDEYRLNTVTYGMKSASYLATKCVQQLLESHRDQYPTTVERAEKGT FT YVDDVLTGADTEEEAMLLRRQLTTIFTAGGFHLRKWASNSATVLNDIPDQD FT REVKLPIELNDTRTIKALGIHWQPCSDEFQFSYSPTKISQPTKRNVLSQIA FT SIFDPLGLLAPIIVKAKLAMQQLWELRVDWDETLPGEFVNNWLLFEQNLSD FT LQYLQVPRRVIGVEGAHRVCLHGYSDASERAMGACVYIRAADKQGNTSSHL FT LCAKSKTAPIGNGRTTLPRLELCAAVILSRLMANVMAAISLPINEVQAFSD FT STVALAWIYGGASRWKTFVANRVAEITTHLPAINWRHIDTHTNPADLISRG FT ALPDQIIGNPLWWHGPEWNGSNSTHQYPSISALDINQQRHVEREQRSVAVA FT YLAIYENPFLDAMLARYYPNLQVLLRITARILRFSHREFRLTSRLTPPEID FT CAMEIYVRHIQLQHYWKEINQLKQNRDVHRGSSLHQLKPFLDANDLLRVGG FT RLQLWDLSYDMKHPILLPSHSRLTALILHHEHHEQLHCGPQSLLASIRRKF FT WIVRGASAARKTYRTCVECVRTRSSPLHQLMGQLPADRLKPNPPFSITGID FT YAGPINIISRRTRGVVPSKGYIALFVCFTTRAVHLEAVSDLSTSAFIAAFT FT RFSSRYGLPNKVYSDNATNLRGAARKFREVYQAINTTITADDVADFLTNKS FT IEWLFIPARSPHHGGLWEAGIKVAKGFLSKFDDNVRFTFEELSTVLAQVAA FT CMNSRPITPLSDDPNEPQPLTPAHFLIGRSLNAVPEINQLERRVGSLNRWE FT YVQRIGQEFRLRWQSEYVLSLQRMTKWQRSAPNVSVGDFVLLIADNEKPKQ FT WPIGRILDVFPGQDGHVRVVSVKTANGVTRRDVRRIRRIPLEDDEYVPGRN FT GAEIPRGNLVGGL" XX SQ Sequence 5530 BP; 1515 A; 1399 C; 1275 G; 1341 T; 0 other; tatggtcctt cgagccgaat tcgcgaaagt ttcgtgaaaa gtgcagtatt gattaactag 60 tgtcacgtgc attggccttg tagtccgact agtcctttct gtgctactgc caatccgtga 120 tagttatcga gcaaaactct ctgcgggaag gaagtgccta gtgaactgtg aacccaagat 180 ggagaaactt tttcgcgaac gtaaatctct ggatccgcgg atcaaacgtg tgtctgaaat 240 ggtgggcaaa ttgaagccag aaacagcaga agagactgag gtgcaaacag aactagaagt 300 gcttggtgat atatgggcct catatagcat ggtgcataaa agaattctgg acgccagcga 360 ggacgatgag ttatatgagg atgcagtgca gcaacaatgt cggttcgaaa gtgtatatat 420 taccctgaaa aaccgtttgc tgaaagtgtt gaaagtgata aagaatcgcg atagcagtca 480 tgaagagcga tcgtcatccc aaaacgatgc attgagtcaa ctcgcacagc agcaagccga 540 actgcttcga ttgatgtctg gaagaatgtc tgccggtgct agtacttcta ctgcaatagc 600 tgagggtttc ccaaacctaa caccactgtc cgatctcaaa ttaccacgaa tgaatctgcc 660 tgtttttagt ggagactatc tcgagtggca gtcattcatt gacttgtttc gaagcatggt 720 tgatcagaat ccttcactaa aggatagtca aaaattatat tttctgaaaa caaacctctc 780 tggtgaggca gcatcgctta tttcgcactt gaaaatcgag gacgtgaatt acactcctgc 840 tctgcaaaag ttaaagtcgc gatacgacaa accactcgag atagcccaca aacacattga 900 acgcttccta aaccagccag ctctgacgtc accatctgca caagggttac gatccctgca 960 cgatgtctcc gatgaagttg ttctcgcact tcaagctatg cagagagaag atcgcgacac 1020 ttggttactt ttcattctca tcgaaaaatt ggaccatgaa accaagcagc tgtggtacca 1080 aaaggcggca gacatgtccg aagccaacgt cactctccaa acgttcttcg cgttcattga 1140 ctctcgcagt tttgccttgc aatctgcaca aaccatcaaa caacgagctt ccactacgta 1200 caaacctcta atgaagccgc aatatagagg agcaacaacg ttcgtcgcca ctaatacatt 1260 ttcgaactgc agcgtttgtg ataaatctcc ccatccgctc tatcagtgcg gcaagttcat 1320 tcatatgagc gctgatgaga gaatgagtgc tgttaactct cagaggctat gccaaaactg 1380 cctgaaaggg catgcgggtg aaatatgcag gtcgggaaac tgtagaaaat gtggccaact 1440 acaccacact ctgctacatt acgctattat gcctaccaac gttcatcaag gtccatcaac 1500 aacgtatgtc gtatctggtg atggttcggc cgcccaatcc ctgatttcgg cccttgattc 1560 caacaccact ttcgacgcca ccaatacgtt actcgccacc gtctccatca acgtcatgga 1620 caagcaagga aggccccata cttgtcgagc ggttctagat tccgcttctc aagtaagctt 1680 catcagtgaa agcttctgta aggagcttgg tctcgactta ttagaagcag acatggacct 1740 cgaaggcatt tcatctactc cagcgcatgt aaacaaatgc gcacaaatcg tcttagcatc 1800 ccgatgcaca gactacacta caacagttcc gtgtatggtg ctcgagacga tcaccaaatc 1860 gctcccagtt aaaccggcga atattgacgg ctggcccatt cccaaatcta ttaatttggc 1920 cgacccattg ttccatcgtc caggtagaat caacctactg cttggtatcg aactattctt 1980 ccagcttctt gagccaggaa aaatcaatct tggtcccgac gacaattttc ccatgctgca 2040 gaataccaaa ttggggtggg tggtagctgg tcgctatcgt gacgccaaca tcccgtcata 2100 tgcaacttct ccaacgtgtt ttttggcgac ttccgatgaa ggccttgatc aacaattgcg 2160 acgtttttgg gagctggaag aatatgctac accttcaccc cacctaaacg aagaggagaa 2220 acggtgcgaa gatcacttct ataagcacac cattcgagac atctccggaa aatttaccgt 2280 gcgattgcca tttctacatt ccccgactca actaggcgat tcgaggcaaa tggctgaaag 2340 gcgattaagc catatcgagc gaaaactgga tcgaaaccca cagctaaagc atgaatacca 2400 ctcttttctc cgtgagtact gtgaactggg acacatgaca ctcgctgatg gacccgcacc 2460 tataggttcg gtttacctgc ctcatcattg tgtggtgaag gaagccagtt ccacaacgaa 2520 atgtcgcgtc gtcttcgatg cctccgcaaa aaccaccagc ggaaaatctc taaacgatgt 2580 gctcatgata ggtccagtac tgcaggactc aatagtcaac attcttttgc ggttccgatt 2640 tccggcgatt gtggtggccg gtgacataaa acaaatgtac cgcatggtgg acattcacga 2700 agccgatcgt gatttccaga ggattctttg gcggtggtcg agtgatcggc caatcgatga 2760 atatcgacta aatacggtca cgtatggaat gaaaagcgca tcatatctcg ccacaaaatg 2820 cgtgcagcag ctattagaat cgcatagaga tcaatatcca acgacggttg aaagggcaga 2880 aaagggcact tatgtggacg atgtgctgac cggcgctgac accgaggaag aagccatgtt 2940 attgcgacgg cagctgacaa caatatttac ggccggtggg ttccacctgc ggaagtgggc 3000 ttcaaacagc gccacagtac tgaacgatat acctgaccag gacagagaag taaagctacc 3060 aatcgagctc aacgacactc gcaccataaa ggccctcggc atccactggc agccgtgcag 3120 cgacgagttc cagttctcct actcaccaac caagatctct cagcccacaa agcggaacgt 3180 gttgtcgcaa atcgccagta ttttcgaccc gcttggactg ttggccccaa taattgtcaa 3240 ggcaaaattg gcaatgcaac agttgtggga gctgagggtg gactgggatg aaactctccc 3300 cggtgagttt gttaacaatt ggttgttgtt cgagcaaaat ctctccgatc tccaatactt 3360 gcaggtacct cgccgagtga tcggtgttga aggcgcacac cgcgtctgtc tgcacggcta 3420 cagtgatgca tccgagcgag caatgggtgc ctgcgtctat attcgcgcag cagataagca 3480 aggcaataca tcatcgcatc tgttgtgtgc gaaatccaag acggcaccta tcggcaatgg 3540 acgaacaaca ctaccacgat tagaactgtg tgcggcggtg atcttatcac gactcatggc 3600 aaacgttatg gcagcaatct cacttcccat caacgaagtt caagcttttt cggattcaac 3660 ggtggctctg gcatggatct acggtggagc atcgcgatgg aagacattcg ttgccaatcg 3720 ggtggccgaa attactactc atctacctgc aattaattgg cggcacatcg acactcacac 3780 caaccctgct gatctaatct cgcgcggtgc gctacccgat cagatcatcg gtaaccctct 3840 ctggtggcac ggacccgagt ggaatggatc gaacagcact catcaatatc cctccatctc 3900 cgctctcgac atcaatcaac agcgacatgt ggaaagagaa caacgttcgg tagcagtggc 3960 gtatctagca atttatgaaa acccctttct tgatgcgatg cttgccagat actacccaaa 4020 ccttcaagtt ttacttcgaa ttacggctcg aatattacgc ttctcgcacc gtgaatttag 4080 gctgactagt cgtctgactc ctccagaaat cgattgcgcc atggaaatct acgtccggca 4140 catccagctt cagcactatt ggaaagaaat taatcagcta aagcaaaacc gcgacgttca 4200 tcgtggcagc tcgcttcatc agttgaaacc ttttctggat gcgaatgatc tgctcagggt 4260 gggcggtagg ctgcaactat gggacctcag ctatgatatg aaacatccga ttctattgcc 4320 tagtcactca cgtctcaccg cactgattct tcatcacgag caccacgaac aactacactg 4380 tggaccacaa tcattgctcg cgtctatacg acgaaaattt tggattgtcc gtggagcaag 4440 tgcagctcgc aagacatacc gaacttgcgt tgaatgtgtt cgaacgagat cgtcaccact 4500 acatcagcta atgggccagc tccctgcaga tcggctaaag cctaatccac ccttttcgat 4560 aaccggtatt gattatgccg gaccaatcaa catcattagc cggcgaacgc gaggtgttgt 4620 acccagcaaa ggctacattg ctctatttgt ctgcttcact acacgcgcag tccatctcga 4680 agcagtgtca gatctcagta cgtccgcatt tattgctgcc tttacccgct tcagtagccg 4740 atatgggtta ccgaacaagg tttactcaga caacgcaact aatctccgag gagcagccag 4800 gaagtttaga gaagtgtatc aagccatcaa tacaactatt actgctgatg atgtcgccga 4860 tttcctcacc aacaaaagca ttgagtggct gttcatccca gcccgatcgc cacaccatgg 4920 tggactttgg gaagcgggta taaaggtggc gaaaggtttt ttgagcaagt tcgatgataa 4980 tgttcggttc actttcgaag agctcagcac agtcttggca caggtagcag cctgtatgaa 5040 ctcacgaccg attactcctc tctccgacga tccaaatgaa cctcagccgc tcactccagc 5100 acattttctc atcggacgtt ccctaaatgc tgtgccagag ataaatcagt tggaacgtcg 5160 cgtcggatcc ttaaacaggt gggaatacgt acaacgcatc ggtcaagaat ttcgattgcg 5220 gtggcaatcc gagtacgttt tatctctcca aagaatgaca aagtggcagc gttcagctcc 5280 aaatgtgtcg gtcggtgatt ttgttctgtt gatcgcagat aacgaaaaac ctaaacaatg 5340 gcccataggt cgtatactgg atgttttccc aggacaagat ggtcatgtca gggtggtttc 5400 cgtcaaaact gccaacggtg ttacacgacg agatgtccgg cggattagga gaattccatt 5460 ggaagacgat gaatacgttc cagggcgtaa tggagcagaa attccacgag gtaatttggt 5520 gggcggctta 5530 // ID piggyBac-N5_BF repbase; DNA; INV; 709 BP. XX AC . XX DT 13-APR-2011 (Rel. 16.04, Created) DT 13-APR-2011 (Rel. 16.04, Last updated, Version 1) XX DE piggyBac-N5_BF DNA transposon DNA transposon - consensus. XX KW piggyBac; DNA transposon; Transposable Element; Nonautonomous; KW piggyBac-N5_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-709 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M. and Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-709 RA Kapitonov V.V. and Jurka J.; RT "Transposable elements in the amphioxus genome."; RL Deposited to Repbase as the tmplanrep.ref file (02-May-2008). XX RN [3] RP 1-709 RA Kapitonov V. and Jurka J.; RT "A family of piggyBac-N5_BF DNA transposons from the amphioxus RT genome."; RL Direct Submission to Repbase Update (13-APR-2011). XX DR [3] (Consensus) XX SQ Sequence 709 BP; 179 A; 153 C; 158 G; 219 T; 0 other; cccgtaaaag acgatcggct acggtcgtgg ctttccggaa aaagacgatc ggccacggtc 60 gtgccacttc ggggctctct cgggtcaact tcgattatat ccgctaattc tcggtaatct 120 ccggggaatc cccggcatac ttcggggaac agcctcgttt tcatgtacac agccggaaga 180 catttttcta gaaggttatc agcgatttct ttgtgacttt tgcagctgaa atttagcgcg 240 ctatgtccgc gctaccatga aggcgctacc ggagacgata gttgaatgat cgtttcacaa 300 tcgacgtgga ctttgatgag tgtaactacg ataaaagcta caatctagcc atgttttggc 360 ctcgtattta caatgtaagg catggggaca ggtctgtaca agttttctga aaaaatattt 420 gacgtgattt ggtcttgtca gatgatcgat gttgtgttga ctttccggac tgagcatttc 480 catgtatcat tgctggaaat tcatatatta gacctggaat aatcttgtct ttgatttatt 540 tgtattttac atgatctcag cttttcaaaa atgtttagat ttacctatct agcgcttttg 600 tagagcaagt acaagcccca aaacagtacg gaggtcgttg tgctaatatc cacagtttac 660 ctacgtataa tcctggaaca accctgaatt ttatcccgtc ctcaacggg 709 // ID Sola1-1_AC repbase; DNA; INV; 4097 BP. XX AC . XX DT 28-JAN-2009 (Rel. 14.02, Created) DT 28-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE Sola1 DNA transposons from Aplysia californica. XX KW Sola; DNA transposon; Transposable Element; Sola1; Sola1-1_AC. XX OS Aplysia californica OC Eukaryota; Metazoa; Mollusca; Gastropoda; Heterobranchia; OC Euthyneura; Euopisthobranchia; Aplysiomorpha; Aplysioidea; OC Aplysiidae; Aplysia. XX RN [1] RP 1-4097 RA Bao W., Jurka M.G., Kapitonov V.V. and Jurka J.; RT "New superfamilies of eukaryotic DNA transposons and their RT internal divisions."; RL Molecular Biology and Evolution 26(5), 983-993 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 1262..3322 FT /product="Sola1-1_AC_1p" FT /translation="MINTSLRMRTRLLGVGDVDDSDTDPSFKPGKCEVKKC FT KTEVFAACTSCLVLLCYDHFLEDPESCSQHEKTWQSTNTESITHNDFETDE FT TQSSNERLPESFEVEGSARQASPKKRKRINKKNDAKRKRQSGEEYVSAGTK FT KVVPARMMRKGCVGMECTRAGRQCSGITETDRREIFQDFYKLADLQKEREF FT VARHIETTEPKVGKPNSRRSKTHAYYLTVNGKRLKVCRNTFLATLSISEKF FT VRVVLSKITETGVVEEDKRGGRQSEQKKTREEQNREKISRHIDRFPRVESH FT YCRASSSKEYLSADLTLSKMHDMFVKENPYNDISSFYTYRRVFKTKSLAFH FT SPKKDQCSLCNSFRKGDEAQKNELREIYNRHEAEKQKVRQLKEDCKKMAQA FT DHSILAGNFDLQQVIYLPISHEDALFYKRRLSNFNLTFYNLGDHTCDCFIW FT HEGQSRRGSSEISTAVCTALREYDQRGIKTAYLFADGCPGQNKNTIMPTMM FT LHMINTSAHTEELSLRYFESFHGQCKGDSAHSAISTALGNAGDVFLPSQLF FT PIIALARRKQPYKVHPLEHGDFLNYKKLSEDLKVLNIRKDNQDSGVPVNWN FT KVMEVRVNKAHPTTIFFLNSHLEEQYRSISLKRQLSHLIHCEVKQLNEEPN FT KISKEKYGDLMSLCSGNTPVIRNAEHKAFFCQLPHSE*" XX SQ Sequence 4097 BP; 1332 A; 922 C; 913 G; 930 T; 0 other; cgtcgcctgt cacacgaaag ggcgtattcc agattttggc agttttgagt tgttactact 60 agtagatagc gctgcctcag atctatgttc acacgaaacg gcgtatccct agctttaaaa 120 taacagtact agcagtaaaa aaacgaaaag ggcgtacgct aacactcgac gcgaccgaat 180 cagacgaaac ggcgtacgca ggcttgcgcc ctttcgtgtg caaaaaaatt gatacaaagt 240 tgtatcaaat ctattcgcac gaaaggacgt ataccttggt tacgcccctt cgtgtgaacg 300 accgttacac ttttgtcttt ttttttctgg cataggccaa ggccgtatga cgtaatatag 360 taatactggc cctcctctaa ttgtacaata ggcctgtaca cagcgcatgc actgtgttgc 420 agcgaatcca gcttgtgcaa gagagaaaac aaaaccggcc ttattgtgtt gttccagttg 480 cttactggtg cttgcggctt gttgtcacat acaacttgtc tttcaaactg acctgcttac 540 cttctgacga ttggaaacca ttaaaaaaag caaacttggg tttgatcaaa atttaaataa 600 gatctacttg ttgtccgacg attcacatga ctattgctaa gctatatgtt cttcctttag 660 cgtattatgc tacactaatt aactagaagg ctacgctatc tagacacccc ccccccccca 720 ccccaccccc ggcagcgcta ctgactgaag ggataaaagc atcaggactt tgcctttgtt 780 atattattga tctgaagaca tgtctacaaa gtgagccctg tcagtagcac cagaatcttc 840 ataatttgca cacacactca cacacacaca cacacacaca cacacacaca cacacacaca 900 cacacacaca cacacacaca cacacacaca cacacacaca cacacacaca cacacacaca 960 cacacacaca cacacacaca cacacacaca cacacacaca cacacacaca catacccata 1020 tggagggcgc acgcccacac acacacacaa gcacacacac acacacactc acactcacaa 1080 aagaaactca acctaaattg ttctttagct tgtactcaaa ctattgggga tttttcttgt 1140 gttttaggcc ggagggcagg tcgctttcat ggaagtgact ccgacaaaga agatttctcg 1200 ttggcagatg acattccgtt agctaaactt agatgtaggc ctacaatgac aagtgaattc 1260 catgatcaac acgagtctga gaatgaggac acgtctcttg ggcgtgggtg acgtcgatga 1320 ttcggacacg gacccatctt tcaaaccagg caaatgtgaa gtcaaaaaat gcaaaactga 1380 agtattcgca gcatgcacca gctgtttagt tctcctttgc tatgatcatt tcctagaaga 1440 tccagaatct tgtagtcaac atgaaaagac atggcaatca acaaacactg aatccataac 1500 gcacaatgat tttgagaccg atgagaccca gagtagcaac gaacggctac cagaatcatt 1560 tgaagttgaa ggatcagccc gccaagcgtc accaaagaaa cggaagagaa tcaacaagaa 1620 gaacgatgca aaaagaaagc gacagtctgg cgaagagtat gtgagcgccg gcacgaaaaa 1680 ggttgttcca gcacgaatga tgagaaaggg ctgcgttgga atggaatgta cgagagctgg 1740 ccgtcagtgt tctggaatca cagaaacgga taggagggag atatttcagg acttttataa 1800 actggctgac ctgcagaagg agagagagtt cgttgcgagg catattgaga cgactgagcc 1860 caaggtgggg aaaccaaact ctcgacgcag caaaacgcat gcttattatc tcacagtcaa 1920 tggaaagcgc ttgaaagtgt gtagaaacac attccttgca accttgagta tttcagagaa 1980 atttgtccga gttgttttgt caaagattac ggagaccgga gttgtggaag aagacaagcg 2040 agggggcaga caatcggagc agaagaagac aagagaagaa caaaacaggg agaaaatcag 2100 tcgacacatt gacagatttc cacgcgtgga atcgcactac tgtagggcct cttccagcaa 2160 agagtatctc agcgcagacc tcacattgtc caaaatgcac gacatgttcg tgaaagaaaa 2220 cccatacaat gacatctcaa gtttttatac atacagacga gtgttcaaga cgaaaagcct 2280 ggcctttcat agtcctaaaa aggaccaatg ctcactctgc aactctttcc gaaaagggga 2340 cgaagctcag aaaaatgaac tcagggaaat ttacaacagg cacgaagcag aaaaacaaaa 2400 ggtgcgacaa cttaaggagg attgcaagaa gatggctcaa gctgaccaca gcattctggc 2460 agggaacttt gatctccaac aggtcatata cttgccaatt tcacatgaag atgctttgtt 2520 ctacaagagg agactttcca atttcaattt gacattttat aatttgggtg accacacatg 2580 tgattgcttc atttggcacg aaggccaaag cagaaggggc agctcagaga tatcaactgc 2640 ggtgtgcacg gcactgaggg aatatgacca aagaggcatc aagactgcct acttgttcgc 2700 cgatggctgc ccgggacaaa ataaaaacac aatcatgccc accatgatgc tccacatgat 2760 caacacttca gcacacacgg aggaactgtc actcaggtac tttgaatcgt tccatgggca 2820 gtgcaaaggt gactcagcac acagtgctat aagcacggcc ttgggaaatg ctggggatgt 2880 gtttctccca tcacagctgt ttcctatcat tgcattggcc cgacgcaaac agccatacaa 2940 ggtacacccg ctagagcacg gtgactttct aaactacaaa aaactgtcag aagatctgaa 3000 agttctcaac atccgcaagg acaatcagga ttcaggtgtt cccgtgaact ggaacaaggt 3060 catggaagtg agagtgaaca aagcccaccc aacaacaata ttttttttaa acagtcacct 3120 ggaagagcag tacaggtcca tttcgctgaa gcgccaactt tcacacctaa ttcattgtga 3180 agttaaacaa ctaaatgaag aaccaaacaa aattagcaag gagaaatacg gcgatttgat 3240 gtctctgtgc tctgggaaca cacccgtcat ccggaatgcc gagcacaaag ctttcttctg 3300 tcaactacca cacagcgaat gagatgtctc acaaaattca agctgagaaa tacaaacatg 3360 tttgatcaag gatagcctac caaagcatga tggcatccat cactaatttg tttgttttta 3420 aaaaataata aatttggaat cggaacgaaa ggaaactgct tgtctttcta aaacgtgacg 3480 cctgtacctg agagtatgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg 3540 tgtgtgtgtg cgcgtgcgtg cgtgtgtgtg tatatgtgtg tgtgtgagtg agagagagag 3600 agagaaagat acagacggac tgacagacag acagacagat agacagacag acagacagac 3660 ggacgaacag agaggtgtgg gaaggaaaag agaaagggga agatttggcg ggtggatgtg 3720 ggagtgagga ggaattttat cctctactgg tcagtctgaa gactgaaatt atttgtgtga 3780 acactcaaac acagggcgta tacatgggcg tatacactga tttaggacag gacgaaacgt 3840 cgaacaatca cacgaaaggg cgtatgccca cagatggcac attttcgcag tgaaattatt 3900 aaaaaaagtg caaaactaat cttgctgatt gtttgatatc ttatcatgca cacaatatcc 3960 aaatatcgcg atggtacgat gcaaattcct ataatgacaa gggaaaagga taaagcaacc 4020 taagtctgaa aatctttaaa tcgttctcct gaaaaagtaa gaaaacaggg atacgccctt 4080 tggtgtgata gacgacg 4097 // ID CR1-102_AAe repbase; DNA; INV; 4527 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-102_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4527 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1190-1190 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >99% CC identity. XX FH Key Location/Qualifiers FT CDS 99..1388 FT /product="CR1-102_AAe_1p" FT /translation="MQCSVKTCNISDDRFLWKCEFCSKNYHAACIGVQRHQ FT ECFILSYMVPLCDDCQHNLKTGIDTRKVLHQQQQLINSIRAQTDANLRIAA FT DLKKLSAMGELFDQIELQLKESVLCINSSTSSSVSNAVSALSRVAEKYSGN FT TNDMPSNDLIAIKNHLTGLLDISMKSSKQHIEDFVEALTVDLTDELKKICT FT EVQSLSSLTIEMAAHCNEHNASIGTARFVENESPPSLLQELAETADLNSGT FT AGWRLLGTKKVWRSDWTEYDKRQKRRALQQKAKEKAQKRKKAKKSFNTSKS FT TTNTNMNNKEKPLNVLPNFMNYNDSHFRGRSNGKCDHDNSQKPKVFINRCR FT KGNNFLPPDRELLAVAKDRYSRPPTNYPPIQFQRGETLNPYPVCNGQQRHV FT PATNQPTNWMEPSCPSTSTASCKSCGCTRYSCFQRN" FT CDS 1367..4426 FT /product="CR1-102_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="VLVFSTKLTLSLDEENNNDVLDLNISSNLTSTELQDE FT EVTNLSKSDCSTTFKTSSQLGTEVLVYCQNFNRMKSASKMKEIHSRILTCP FT YTVILATETSWDETVNSEEVFGNNYDVFRNDRDCQTSEKKSGGGVLIAVST FT LFSAEIIETTKFKEFEHVWVKVHIAGETHVFVSVYFPPKNARKIVYEKFYN FT IVEHVIGGFPPETKVHIYGDFNQRDIDFFPDVDNESILLPIFGENEALQFI FT CDTSASLGLNHINHVKNQQNCYLDLLMTNTDEDFCVVESLTPLWKNELFHT FT AIEFSLFVHGNNRPKDCDFEEVHDYRSANYDNIRQKINQVSWQNVLRNEEN FT VESAVKMFYKILNDIIHDDIPLIKRRRNHNSKYPIWFSREIKNLKNRKQKA FT HKIYKKCKNDANLINYLNICDQLNLAIDTAFEEYNERTERELKSCPKNFFN FT YVKTKTKSNNFPSEMHLDDKIGVNPEESCNLFATFFQDVYTTYSENNRDRE FT YFSFLPEIPNDICVNQIKVHDIVDALKSLDVSKGAGPDGIPPVFLKMLSIE FT LASPLFWLYNMSLQSGSFPETWKSSFLVPIFKSGKKSDIRNYRGIAIISCI FT PKIFEAIINKILFDMVKNRITHTQHGFFKGRSTCTNLLEFIHYSLTAMDNG FT NHVEALYTDFSKAFDRIDIPMLLFKLEKIGFQPDLLKWVESYLTNRQQIVR FT YKGVKSIPIKVTSGVPQGSHLGPLFFILYVNDISFILKKVKALIYADDMKL FT FLEIVNDEDIHTFQNEIDMFYKWCNKSLLQLNVKKCNSISFSRKRSIPNTT FT VYLGNQLVEKCVRIRDLGVILDSKVTFIDHYNTIINKANKMLAFIKRFSYN FT FHDPYTIKTLYITYVRSILEYCSIVWSPFSSTHENRIESVQKQFLLFALRK FT LGWTRLPLPSYEARCMLINIQTLKKRREFAMVSFVNDIVSHRIDSMTLLSK FT LNFYAPSRQLRSRSIFYTNHHRTNYAKFEPLNQIMSVYNKHCETIDFTMTK FT SELKQKFMSYQNSN" XX SQ Sequence 4527 BP; 1592 A; 812 C; 812 G; 1311 T; 0 other; tcagtcattc gtcggaggta gacgcgtttt ttaagtgctc cgtaattatt tttatttaat 60 ttttttttat tgacaaacat cgcgcgcgac cggcaaaaat gcagtgtagt gtgaaaacct 120 gtaatatcag tgacgatcgg ttcctatgga agtgcgagtt ttgctccaaa aactaccacg 180 cggcgtgcat tggagtacaa cggcaccaag agtgttttat cttgtcctac atggtacctc 240 tatgtgacga ttgtcaacat aatttgaaaa ccggcattga cacccgcaaa gtgctgcatc 300 aacagcagca gctaattaat tcaattaggg cgcaaactga cgccaatctt cgaatagctg 360 ctgatctgaa aaagctcagc gcaatgggtg aattatttga ccaaattgag cttcagctca 420 aggaatcggt gctgtgcatt aacagcagca catcgtcgag cgtatcaaat gctgtatcgg 480 cattatcgcg tgtggccgaa aaatattcag gaaatacaaa tgatatgccc agcaacgatt 540 tgattgctat aaaaaaccat ttaactggcc ttttagatat ttcaatgaaa tcttcaaaac 600 aacatattga agattttgtt gaggcgttaa cggtggatct gactgatgaa ttgaaaaaga 660 tctgtaccga ggttcagtcg ttgagcagcc taaccattga aatggctgct cattgcaatg 720 aacataacgc tagcattggt actgcccgat ttgtggaaaa tgagtcaccg ccaagcctat 780 tacaagagtt ggctgaaact gctgatttaa actctggtac cgcaggttgg cgtttacttg 840 gaactaaaaa agtttggcgt tcggactgga cggaatatga caaacgtcag aaacgccgtg 900 cactacagca aaaagccaaa gaaaaggccc aaaagcggaa gaaagcaaag aaaagcttca 960 acacttcaaa gagcacgaca aatactaata tgaacaacaa agaaaagccg ttaaatgttt 1020 taccaaactt tatgaactat aacgacagtc acttcagggg cagaagtaac ggaaagtgtg 1080 atcatgacaa cagccaaaaa ccaaaggtgt tcataaacag atgtcgaaaa ggaaacaatt 1140 tccttccacc tgacagggag cttctggctg tagcaaagga tcgatactcg agaccgccaa 1200 cgaactaccc acctatacaa ttccaaaggg gggaaactct aaacccctat ccagtctgca 1260 atggtcaaca acgtcacgtt cccgccacca accaaccaac aaactggatg gagccgagct 1320 gtccttcaac atcaactgca tcatgcaaat cgtgtggatg tactaggtac tcgtgttttc 1380 aacgaaactg acgctctccc tagatgaaga aaacaacaat gacgtgttag atctaaatat 1440 atcttcaaat ttaacaagta ccgaattgca agatgaagag gtaactaatc taagtaaatc 1500 agattgttct actactttta aaacttcttc acagttaggt acagaagttt tggtttattg 1560 ccaaaacttt aaccgtatga aaagcgcttc taagatgaaa gaaatccata gtagaatttt 1620 aacatgtcct tacacagtta ttttggcaac agaaactagc tgggatgaaa ctgtgaatag 1680 tgaagaagtt tttggaaata attatgatgt ttttaggaat gatcgtgact gtcaaacgtc 1740 tgagaagaag tcaggaggag gagtcttaat tgctgtttct acattgttta gtgcagaaat 1800 catagaaacc actaaattta aagagtttga gcatgtatgg gtgaaagtac acatagcagg 1860 cgaaacgcat gtgtttgtgt ctgtgtattt tccgccgaaa aacgctcgta aaatagttta 1920 tgagaaattt tacaatattg ttgaacatgt tattggcgga tttcctcccg aaacaaaagt 1980 gcatatatat ggagatttca atcaacgtga tattgatttc ttcccagacg tcgacaacga 2040 gagcatcctg ctgccaatat ttggagaaaa tgaggctttg caatttatat gtgatacatc 2100 ggcaagtcta ggtctaaacc atattaacca cgttaaaaat cagcagaact gttatttaga 2160 tctcttaatg accaatactg atgaagattt ctgtgtggta gaatcattga ctcccttatg 2220 gaaaaacgaa ctttttcata cagctattga gttttcttta ttcgtgcatg gaaataacag 2280 acctaaagat tgcgactttg aagaagttca tgattatcgt tcagcaaatt atgacaatat 2340 aagacagaaa atcaatcaag taagttggca aaatgtgtta agaaacgaag aaaatgttga 2400 atctgccgtt aaaatgtttt acaaaatttt aaacgacata atacacgacg atataccctt 2460 aataaaaaga agaaggaatc ataactcaaa gtatcctata tggtttagta gagaaatcaa 2520 aaatttaaag aatcgtaagc aaaaggcaca taaaatatac aaaaaatgta aaaatgatgc 2580 aaatttaatt aactatttaa atatttgcga tcaattaaat cttgccattg ataccgcatt 2640 tgaagagtat aatgaacgaa ctgagcgtga attgaaatca tgtcctaaaa atttctttaa 2700 ttacgttaaa acaaaaacca aatctaacaa ctttccatct gaaatgcacc ttgatgataa 2760 aattggtgtg aaccctgaag aaagctgcaa tctgtttgca acattttttc aagacgttta 2820 cactacttat tcggaaaata atcgagatcg tgaatatttc tcatttcttc cggaaatccc 2880 aaatgacatt tgcgtcaatc aaatcaaagt acatgatatt gtagatgcac taaaaagtct 2940 agatgtctcc aaaggtgctg gacctgacgg gattccacct gtatttttga aaatgttatc 3000 aatcgaactt gcatctccgc tattttggct ctacaatatg tctctgcaat caggtagttt 3060 ccctgaaaca tggaaaagct catttttagt gccgatcttt aaaagtggca aaaaatctga 3120 catacgtaat tatcgtggga tagccattat ctcttgtatt ccaaagattt tcgaggcaat 3180 aataaataaa attttattcg atatggttaa aaacagaata actcatactc agcatggttt 3240 ttttaaggga cgctcaacat gcacaaactt gctggagttc attcattatt ctttgacagc 3300 catggataat ggtaatcacg tggaagctct ctatacagat ttcagtaaag ctttcgatcg 3360 cattgacata ccaatgttgc tttttaagtt agaaaaaata ggatttcaac cggaccttct 3420 taaatgggta gagtcttatc ttactaatcg ccaacaaata gttagataca aaggagtaaa 3480 atctattcca attaaagtta catcgggagt tcctcaaggg tctcatttag ggccattatt 3540 ctttatttta tacgttaacg acatttcctt cattctgaaa aaagtgaaag ctctcatata 3600 tgctgatgac atgaaactgt ttcttgaaat agtaaacgat gaagatatac atacttttca 3660 aaatgaaatt gatatgtttt acaaatggtg caataaaagc cttcttcaat tgaacgtgaa 3720 aaaatgtaat tcaatatcct ttagcagaaa gcgtagtata cctaacacaa cagtctacct 3780 agggaatcaa ctagtagaaa aatgcgtaag aataagagat ttaggagtga ttttagattc 3840 caaagttact tttatagatc actataatac aattatcaac aaggcaaaca agatgttagc 3900 atttataaaa cgatttagct ataattttca cgatccatac actataaaaa cattatacat 3960 aacctatgtg cgttcaattt tagaatactg tagcatagtt tggtcaccat tttcatcaac 4020 ccatgaaaat cgtatagaat cagttcaaaa acaatttctg ttgttcgcac ttcgaaaatt 4080 aggttggaca aggttacctc taccatctta tgaagcacga tgcatgctca ttaatattca 4140 aacattgaag aaacgtcgcg aatttgcaat ggtctcgttc gtaaacgata tagtttcgca 4200 tcgtattgac tcgatgacac ttttatcaaa attaaatttt tatgctcctt caagacaact 4260 gcgaagtcgt agtatttttt atacaaacca tcaccgcaca aattatgcca aatttgagcc 4320 tttaaatcaa ataatgtctg tttacaataa acattgcgaa accattgact ttactatgac 4380 gaaatcggaa ctaaaacaaa aatttatgtc atatcaaaat tcgaactaga ttaagaaaat 4440 aacgttttca aactatataa tattaagtgt aaaattgtaa caaaatggtc tacttctgat 4500 tgacgacatg aaataaataa ataaata 4527 // ID Gypsy-10_CQ-I repbase; DNA; INV; 3796 BP. XX AC AAWU01008510; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-10_CQ_; KW Gypsy-10_CQ-LTR; Gypsy-10_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-3796 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 399-399 (2011). XX DR GenBank; AAWU01008510; Positions 121324 125119. XX CC Positions [2511-2981] - Integrase core CC 'CTTGC' target site duplication CC LTRs are 96% similar to each other. XX FH Key Location/Qualifiers FT CDS 45..3773 FT /product="Gypsy-10_CQ-I_1p" FT /translation="MAKGLSFRRFDEGDESREFPQSRSQRRRIFSEEETSS FT AGEADRDTRCCNLGHLEEDCRVKNIRCWKCRADGHVRATCPTRRSHFPDCQ FT LPGAQGASKAIYSAAGKLAVDKPMTLAVGIGGRCHVMQLDTGAAISAVSVA FT YYGESLKDWKLQQSSLKLNGYGGERLHVRGVIEPTLSYGDSSKKVAFAVIE FT NGGPPLLGRNFVRAFNLGVSSIYSMETDADNIVESMIKCHEELFSDGLGAY FT RYATIKLEMEEDALPVFRKPRTVAYKFVDKVGAELDAMEKDGIISKCDRCS FT WGTPLVPVIKGDGSIRLCGDYKTTVNLYVKDVIHPLPTVDEVFSKLNGGKR FT FSKLDLSKCYNQFLLDEESREVCAISTTKGVYKMNRLPFGVRPASGIVQRV FT LEQLLCGIPGVQNFLDDVLVTGRTDKEHLENLSRVFDVLEKAGLKLNRKKC FT QFFKKEVTYLGHVINANGLCKTDERVKSIRMTKEPRNVQEVRAFAGLVNHY FT SRFVKNIAEMMSPMYRLLRKGTKFVWSKECREAFTRVKEAICEDVMLAHFD FT PGAKLLLVCDASMEGVGAVLMQKVGSEPERPVAFASRVLHAAERNYSVLDR FT EGLAIMFGLNKFFHYLVGNIFTVRTDHKPLISILNPRKGIPAIAASRMQRW FT ANFLGGFSYRIEHVSSEGNIADYPSRAPFESWTLWKEDDTYLNFINTSSTK FT VLDDDVVRVELADDPELAMLKDCLLKGKVGGDLQRGPYGKVFNELSLENGL FT IMRGVRVLVPGTLRKAVLEQAHRSHLGVGKCKTVLRSFVWWPGIDKDLEVH FT IKSCHACLVNRPSPEKAKLIPWEPPKSVWSRVHLDFAGPVKGWSFLIVVDA FT LSKWVEVFPTQKCDTEFVLEKLVDCIARFGLMHEIVSDNGTQFTSARFKNF FT LEANGIRQVLTSPGHPATNGQAENSVKTFKASLMKSFASGSTDVKEIVANF FT LLGYRSAAHCSTGSSPAQLMLGRQPRSTLDLLRNNNRCVASEQETKAREVV FT LARQSKQVDNYKGKREEQFNLNERVMVRDYTNPNKAAWTRAVVTGIVGKRN FT YVVKLSSTGRELKRHLDQMIADTTASTAGGGKSGQKTAPLRQREQQPDRVT FT VYRTMPKRVVELPSASDVAEVDPVPVDEFRPEVPTDSDLPEGDQETPTEEN FT AAQADSGLPEGDQSQPETPSVEVATEPNSDPAEGATSEDDDDDNFEDPQTD FT VTKSDDSDAGLEFAGFDRGKWKFRDYLGRLLRK" XX SQ Sequence 3796 BP; 967 A; 858 C; 1173 G; 798 T; 0 other; ttggcgacga gtttaaatcg gatttacgcg attagagtcg ggagatggcg aaagggcttt 60 ctttccgacg gttcgacgaa ggggacgagt cgcgggagtt tccgcagagc aggagtcaga 120 gacgacggat tttctccgaa gaggagacgt caagcgccgg agaggcggat cgtgacacac 180 ggtgctgcaa tctgggacat ttggaggagg actgccgggt caagaatatc cggtgctgga 240 agtgccgtgc agatgggcat gtgcgagcta cgtgcccgac aaggcggagc cacttcccgg 300 actgtcagtt gcccggggcg cagggtgctt cgaaagcaat ctactctgcg gcgggaaagc 360 ttgcggtgga caaaccgatg acgctagcag taggaatcgg cggccgatgc cacgtcatgc 420 agttggatac gggcgcggcc atatcggcgg tatccgtagc atattacggc gagagcttga 480 aggattggaa attacaacag tcgtcgctga agttgaatgg ctacggcggg gagcggctgc 540 acgtgagagg agtgattgaa ccaaccctgt cctacggtga ttccagcaaa aaagtggcct 600 tcgcggttat agaaaacgga ggcccgccgt tgttgggaag gaatttcgta cgcgctttca 660 acctcggcgt ttcttcaatc tactcgatgg agacggatgc ggataacatc gttgaatcga 720 tgatcaagtg tcacgaagag ctgtttagtg acggcctggg tgcgtacagg tacgccacca 780 ttaagctgga gatggaggag gacgcgctgc cggttttcag gaagccaagg acggtggcgt 840 acaagttcgt ggacaaggtt ggggcggagt tggatgctat ggaaaaggac ggcataatat 900 cgaagtgcga ccgttgcagt tggggtaccc cgctagttcc agtgatcaaa ggcgacggaa 960 gtatacggtt gtgcggagac tacaagacca cggtgaacct gtacgtgaag gacgtcatcc 1020 atcccttacc gacagtagac gaggtcttca gcaagcttaa cggtggcaag agattcagca 1080 agctagacct gtccaagtgt tacaaccagt tcttgctgga tgaggaatct cgagaggtct 1140 gcgctatttc tacgacgaaa ggagtctaca aaatgaaccg gcttccattc ggcgtgcgcc 1200 ctgcttccgg aattgtgcag cgagtgttgg agcagttgct ttgcggcatt cccggcgtgc 1260 agaatttctt ggatgatgta ctggtcacgg gaagaaccga caaggagcat ctggagaact 1320 tgtcgagggt gtttgacgtg ctggaaaaag caggtttgaa gttgaaccgt aagaagtgtc 1380 agttttttaa aaaggaagtg acatacctgg gacacgttat caacgccaac ggactgtgca 1440 agactgacga gcgagtgaaa tcgatccgga tgaccaaaga accaaggaat gtccaagaag 1500 ttcgagcttt tgcaggactg gtcaaccatt attctcgttt cgtcaagaac atcgccgaga 1560 tgatgagccc catgtaccga ctgctgagga aaggaaccaa gttcgtgtgg tccaaggagt 1620 gccgggaagc attcacgagg gtaaaggagg cgatttgtga ggacgtcatg ttggcacact 1680 ttgacccggg tgccaagctt ctcctcgttt gtgatgcgtc catggaaggt gttggtgctg 1740 ttctgatgca aaaggttgga agtgagccgg aaagaccggt ggcgtttgca tcgcgagtgc 1800 tgcacgcagc ggagcggaac tacagcgttt tggatcgtga ggggttggcg attatgtttg 1860 gactgaacaa gtttttccat tacctggtgg ggaacatctt cacggtccgt accgaccata 1920 aaccgctgat ttcgatccta aacccacgaa aaggcatacc cgcgattgca gcaagccgca 1980 tgcagcgatg ggcgaacttc ctggggggtt tcagctatcg catcgagcac gtcagctccg 2040 agggaaacat tgcggactat ccctcacgag ccccgttcga atcttggacc ttgtggaaag 2100 aggatgacac gtacctcaac ttcataaaca ccagcagtac caaagtgctt gacgacgatg 2160 ttgtccgcgt cgaactggcc gatgatccgg agttggcgat gctgaaagat tgcctactta 2220 agggcaaggt gggtggtgac ttgcaacgtg gcccgtacgg aaaagtcttc aacgaactgt 2280 cgctggaaaa cggactaata atgcgaggag ttcgagtgct ggtgcccgga acgctgagga 2340 aagcagtatt ggagcaagcc caccgttcac atctcggagt tggaaagtgc aagacggtac 2400 tgcgaagctt tgtttggtgg cccggaatcg acaaagatct ggaggtacac attaagagct 2460 gccatgcctg cctggtgaac cggccgtctc cggagaaagc caaattgatt ccgtgggaac 2520 caccaaagtc tgtctggagc agagttcatc tcgatttcgc gggcccggtg aaaggttgga 2580 gtttcctgat cgtcgtggat gccttgtcta agtgggtgga agtgttcccg acgcagaagt 2640 gcgataccga attcgtcctg gaaaagctgg tagactgtat cgctcggttt ggcctgatgc 2700 acgagattgt tagtgataac ggcacacaat ttacttctgc gaggttcaaa aatttcctcg 2760 aagcgaacgg catccggcag gttctgacaa gtccagggca tccagctacg aacggccagg 2820 cggaaaactc tgtgaaaacg ttcaaggcct ctttgatgaa gtcgttcgca agtggatcca 2880 ccgacgtcaa ggaaatcgtg gccaatttct tgttgggata cagatctgcg gcacattgct 2940 caacgggatc ttcaccggcc caactgatgc ttgggagaca gccacgttcg acgttggatt 3000 tgctgaggaa caacaatcgt tgcgttgcga gcgaacaaga aacgaaggcc agagaagttg 3060 tcctcgcacg gcaatccaag caggttgaca actacaaggg caagcgagag gagcaattca 3120 atcttaacga acgagtcatg gtacgagact acacaaaccc taacaaggcc gcatggacac 3180 gggcagtcgt aactgggatt gtgggaaagc gcaactacgt agttaaactc agctctacgg 3240 gtcgagaact caagcggcac ttggaccaga tgattgcaga caccacggct tctacggcgg 3300 gaggtggtaa gagtggtcag aagactgctc ctttacgtca gcgggaacag cagcctgatc 3360 gagttacggt ttaccggacg atgcccaaga gagtggtgga gctgccgagt gccagcgacg 3420 tagctgaggt cgatccagtc ccggttgatg agtttcgacc ggaagtgcct acggacagtg 3480 atctaccaga gggtgaccag gaaacaccga cggaggaaaa cgctgcgcag gcggacagtg 3540 gccttccaga gggtgaccag tctcaaccgg agacgcccag tgtggaagtc gctacagaac 3600 cgaacagtga tccagcagaa ggagcaactt ccgaagatga cgacgatgac aacttcgagg 3660 atccacaaac agatgtaact aaaagtgacg atagcgatgc aggacttgag tttgcggggt 3720 ttgatagggg gaagtggaaa ttccgcgatt atttgggtag attactcaga aagtgaagct 3780 tagcttagtg cgggga 3796 // ID CR1-124_AAe repbase; DNA; INV; 5458 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-124_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5458 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1212-1212 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 10 sequences with >96% CC identity. XX FH Key Location/Qualifiers FT CDS 301..1728 FT /product="CR1-124_AAe_1p" FT /translation="MTTVTCSACTQIIVAETDRVYCFGGCEQILHLRCSEL FT RPSDSSTLRNNIALKYMCFACRKKQICLNDLRSKYSELLERMDDVCVKVSK FT YESILNQLETNLLGKIEASLMSTVGSKIESLFALRSAMNDCPTYATVARGS FT DNRSVNTNAAAIVSSMPTSKKIKPGKSSLIANENVDDGWILRSGKRRAKST FT ANGNYATAKNTSDNEQLTTAKRSSNLPSTTSSAGKKFEQSVIIKPKENQQA FT DVTQRHIREKIDPIDFSVKGIRSKENGDVIVRCETSTHAQKLVSAAVDILS FT DDYEVSILKPLKPRLKVIGLSENLDASEFVSTLKKQNGLPDSSDVTLIHMR FT KIEKWKQFPFIAVLETDAQTFETLIQRQRVNIRWDRCQVTENVNVYRCFKC FT SRYGHKAATCVNPTCCPLCAGDHAVQECDATFEKCINCELKNKQRKLPYDE FT LLNINHSAWSSECPVYQKRLKAVRKMVDYSA" FT CDS 1732..5079 FT /product="CR1-124_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="QSVNYCAIYASIGKTNTSICPAGASIDSARNQMDLSP FT QMTRSTDVSSDTQVMDARSTNSDDCLVDNPACSGNFRERICHAEARQDAAL FT LNSRSVLQTDSSTAIFNGCTPDNLAVVPGNDGSICLAEEVGDTAPPESSSR FT CNSSSDNSVPQIIDQVATRHKLSFFYQNVRGLRTKIDDFFIAVSACQYDVI FT VLTETWLDEVILSPQLFGNSYTVFRNDRNQRNSSKSRGGGVLIAITSRISC FT CRESSPVNDSLEQIWVKLKLSSFNVSVGVIYLPPDRKTSVTCIKNHMESIE FT SVYSNLDAHDHAFMFGDYNQSSLLWKTTSTKSTVDIMRSTVSTACSSLLDG FT FCLNGLTQINHVFNRNGRLLDLVLVNEAAFGNCTLSEAIEPLTALDNDHPA FT LEVEAYLPDPIKFDEATPVPGLDFRKADFDGLNEALLRYDWSFLETVNSID FT DAIENFTRICNATMSNYVPLRRPPAKPSWGNFRLKHLKRKRSKALRKYCRT FT HCMLAKQALIRASNKYRLYNRLLYKRYTSRMQENLRKNPKLFWNFVKSKRN FT ETGLPTEMFLGDAFASSDLSKCNLFARHFQNAFNDTASTDSQVEIACKETP FT LNVFELAISRITVRQVASAINKMKYSTADSPDGIPSCVLKRCCDAISPVLA FT ILFNISLQQRKFPESWKLSIMFPVYKKGDKRNIENYRGITSLCASSKVFEI FT VVYDSIFTACKNYISIEQHGFYPRRSVSTNLVHFVSNCIRAMDSGLQTDVI FT YTDFKSAFDRVDHNILLKRLAILGISSDFVCWLKSYLMDRKLCVMIGSESS FT DTFSNISGVPQGSNLGPLLFSLFINDLSRLLPSGCKLFFADDVKIFMIIES FT FNDCEKLQQMVDKFSHWSSRNMLTLSINKCSVISFHHKQRPIVYDYAINNI FT QLQRVYQVRDLGVTLDSALSFRIHFEDIISRANRQLGFIFKICGEFTDPLC FT LRSLYCALVRSLLESNVVVWCPYQLTWINRIETIQRRFVRRALRNLPWRDS FT MNLPPYADRCRLLGIDTLVNRRFIQQAAFVAKVLTGELDSPEILSRLNIYA FT PQRILRQRDFLTITPSNTVYGQNDPISAMSTVFNEVYVYFDFNVPAATFSR FT RLQQLNRS" XX SQ Sequence 5458 BP; 1605 A; 1155 C; 1112 G; 1585 T; 1 other; ctatatgcta tatttaccat actgtgctga tgtgaaagtg atgttctcac gttaaatatt 60 cgttttttgt ctgcaaaatc cgtgatttgt gtgataagtc ttacattagc gagtagagaa 120 ttcaatgctt aacccacgca tggtgaacga ttgtacattg catttagtgt gtccgtttat 180 tttgtgaaaa agttatagca gtttctcgat ttcgatcaac atcggtagaa aaacaacagt 240 atcacaaaag ccagcttctg tttcgttgga ggggggacat gcgtgtccaa cgtcgctccc 300 atgacgacgg tcacatgcag tgcgtgcact cagatcatcg ttgccgaaac tgatagagtc 360 tattgtttcg gtgggtgcga gcaaattttg caccttcggt gttctgagct tcgtccgtct 420 gattctagca ctttgagaaa taacatcgct ctgaagtaca tgtgctttgc atgccgcaaa 480 aaacagatat gtttgaatga tctaagatcg aaatactctg agctgctgga gagaatggac 540 gatgtctgtg taaaagtatc caaatatgaa tcgattttga atcagctgga gacaaatctt 600 cttggtaaaa tcgaagcaag cctaatgtca accgtcggta gcaagatcga atctctgttc 660 gccttgcgta gcgcgatgaa tgattgccca acttacgcaa ctgtagcacg tggttctgac 720 aatcgttctg taaataccaa tgctgccgct atcgttagct cgatgccgac tagtaaaaaa 780 attaagcctg gcaaaagttc tcttattgcc aatgagaatg tggatgatgg ttggattctg 840 cgatctggga aacgtcgtgc aaaatctact gcgaatggaa attatgcaac tgcgaaaaac 900 acgtccgata atgaacagct tactactgca aaacggtcaa gtaacttgcc atcgacaacg 960 tctagcgctg gtaaaaagtt tgaacagtct gttataatca aaccgaagga aaatcaacaa 1020 gcagatgtaa cacaaaggca tattcgagaa aaaatcgacc ctatcgattt ttccgtgaaa 1080 ggcatcagat caaaagaaaa cggtgatgtt attgtgcgat gcgaaactag cacccacgcc 1140 caaaaacttg taagcgcagc tgttgatatt ttatcggatg actatgaagt ttccatactc 1200 aagcctctca aacccagatt aaaagtcatc ggcttgtccg agaatctaga tgcctcagaa 1260 tttgtatcca ctctgaagaa gcaaaatgga ttgccagatt catctgatgt aacattaatt 1320 cacatgcgca aaatcgaaaa atggaaacaa tttccattca ttgctgttct agagactgac 1380 gctcagacat tcgaaacact gattcaacga cagcgtgtga atattcgttg ggataggtgt 1440 caagttactg aaaatgtaaa cgtttatcga tgcttcaaat gctcaaggta tggtcacaaa 1500 gctgctactt gtgtgaatcc tacttgctgt cccctgtgcg ctggagacca cgctgttcag 1560 gaatgtgatg ccacttttga gaaatgtatc aactgcgaac tcaaaaacaa gcagaggaag 1620 ttgccatatg acgagctgtt aaatatcaat cattccgcct ggagttcaga atgtccagtt 1680 tatcagaagc gtttaaaggc tgtgagaaaa atggtggatt attctgctta gcaatcagtt 1740 aactattgtg ctatatatgc ttctattggt aaaactaaca ctagtatatg tcctgccgga 1800 gcttcaatag attctgcacg aaatcaaatg gatctttcac cacagatgac cagatctact 1860 gacgttagca gcgataccca ggtgatggat gcaaggtcaa ccaattctga tgattgtctt 1920 gtggacaatc ctgcttgctc aggtaatttt agagaacgta tatgtcatgc cgaagctaga 1980 caagatgctg ccctccttaa tagtcgttcg gttttacaga ctgactcttc tactgcgata 2040 ttcaatggat gcaccccaga taacttagct gttgttccag gtaacgatgg cagtatatgt 2100 cttgccgaag aagtaggtga tactgcaccc cctgaatcaa gctctcggtg taattccagc 2160 tccgacaact ccgtcccgca gatcattgac caagtagcaa cacggcacaa gctttcgttt 2220 ttctatcaaa acgttcgcgg actacgcacc aagattgacg acttcttcat tgcagtatcg 2280 gcatgccagt atgacgttat agtgctaaca gaaacttggc tcgatgaggt aatactgtct 2340 cctcaacttt ttggaaactc ctacacagtc ttcagaaatg acagaaacca gcgtaacagc 2400 agtaaatccc gtggtggcgg tgttctaatt gccattacat cgagaataag ttgctgtcgc 2460 gaatcttctc cagtaaacga ttctctagag cagatttggg tgaaactcaa attatccagt 2520 ttcaatgtta gcgttggagt catatacctt cctcccgatc gtaaaacgag cgtaacttgt 2580 atcaaaaatc acatggaatc tattgaatca gtttattcca atttggatgc acatgatcat 2640 gccttcatgt tcggcgatta taatcaatcc agtttattgt ggaaaacaac atcgaccaaa 2700 tcaactgtgg atattatgcg atccactgta tcaactgctt gcagtagtct tcttgatggg 2760 ttctgtttga atgggcttac tcaaatcaac cacgttttca atcggaatgg tcgtctcctt 2820 gatttagttc tggtgaatga agcagcattc ggaaattgca ctctttcaga agccattgag 2880 cctctcacag ctctcgacaa tgatcatcct gccttggaag ttgaagcata cttgcctgac 2940 cccatcaaat ttgacgaagc cacacctgtt cctggtcttg acttccgtaa agcagatttc 3000 gatggtttaa atgaagcttt gcttcgatat gattggagtt tcttggagac agttaatagt 3060 attgacgatg ctattgaaaa ctttacacgt atttgtaatg ctaccatgtc caactatgtt 3120 cctttacgta gacctcccgc taagccttca tggggtaatt ttcgtctgaa acacctcaag 3180 cgaaaaagat caaaagcact tcggaaatac tgcagaaccc attgcatgct tgccaagcag 3240 gcattaatca gagctagcaa caaatatcgg ttatacaatc ggcttctata caaacgttat 3300 acttcacgaa tgcaagaaaa tttacggaaa aacccaaagc tgttctggaa ttttgtgaaa 3360 tctaagcgca acgaaacagg cctacccacg gaaatgtttt tgggagatgc tttcgcttct 3420 tcagacctaa gtaaatgcaa tctatttgct agacattttc agaatgcatt caacgacact 3480 gcatctactg attcccaggt ggaaattgcc tgcaaagaga ctccgcttaa cgtctttgag 3540 cttgcgatca gtcgcataac ggtgcgacaa gttgcttccg caataaataa aatgaaatac 3600 tcaaccgcgg atagccctga tggtattccg tcctgcgtat tgaaaagatg ctgtgatgct 3660 ataagtcctg ttctggcaat tttgttcaac atctcgctac agcaacgcaa gtttccagaa 3720 agttggaaat tatccattat gtttcctgtg tataaaaaag gagacaagag aaacatagag 3780 aactatcgtg gaatcacatc tttatgtgcc agctctaagg tattcgagat agtagtctac 3840 gactcaatat ttactgcttg taaaaactat atatcgatcg aacagcatgg tttttaccca 3900 aggagatcag taagcacaaa tcttgttcac ttcgtttcca actgcatacg ggccatggac 3960 agtggcttac aaacggatgt gatatatacc gatttcaaat ctgcatttga ccgtgtagat 4020 cacaacattt tgttaaagcg cctcgcaata ttaggcatct cttcagactt tgtttgctgg 4080 ttgaaatcgt acctgatgga tcggaaactc tgcgtaatga ttggctctga atcttctgat 4140 actttctcaa acatatcagg cgttccacaa ggaagcaatc ttggtccact gctattctcg 4200 ctttttatta acgacctctc ccgcctatta ccatctggat gcaagttgtt ctttgccgac 4260 gacgtgaaaa tcttcatgat cattgaaagt ttcaacgact gcgaaaaact tcagcaaatg 4320 gtagacaaat tttcccattg gagttcgaga aacatgctta ccttaagtat caacaaatgc 4380 agcgtcatat cgtttcatca caagcaaaga cctattgtct atgattatgc aatcaataac 4440 attcagctac agcgagtcta tcaagtacga gatttaggcg ttacgctgga ttcagcactc 4500 agcttccgga ttcacttcga ggatatcatc tctagagcta acagacagtt aggcttcatc 4560 ttcaaaattt gtggcgaatt cacggaccca ctttgcctcc ggtcattata ttgtgcactc 4620 gtgagatctt tattagaatc aaatgttgtt gtatggtgcc cctatcaact gacgtggata 4680 aataggatag agacaattca aagacgattt gtaagacgtg ctctacgtaa tctaccgtgg 4740 cgtgattcga tgaacttgcc accatacgcg gaccgatgtc gccttcttgg cattgatacc 4800 cttgtcaaca gaagatttat ccaacaagcg gcttttgtgg caaaagtact caccggagag 4860 ttggattcgc ctgagattct gtcccgtttg aatatctacg cacctcaacg aattttgcgg 4920 caacgtgatt ttttaacaat tacacccagt aatacggttt atggccaaaa tgatccaata 4980 agtgccatgt ccaccgtttt caacgaagtt tacgtatatt ttgacttcaa cgtgcctgct 5040 gcaacatttt caagaaggct tcaacaactc aaccgatcat gatgtcagga cgaaaaatca 5100 ggcgagttcc gttcctaaag tttattttgg aaagaattaa gctacccatt tttcattttc 5160 agcagtgttt ccatcttgga cagccagaga gtgtgacgtc gttgttatcg tccatttgtt 5220 gaaaaactaa ctatttttaa gtaatgtgct ttgttttaat gttgactttt agcttgtaat 5280 tggtgcaaaa ttgtaaagca ttgaattgta aatgtaattt tgaaaagatg cggggttttt 5340 acgccctttg gagaatggca attacagcgc ttgtcaactc caacgggctt ttcctcacca 5400 ccttcattga gccaaaaggc asatgaagtg aaaagataaa taaacaaata aacaaata 5458 // ID BEL-647_AA-I repbase; DNA; INV; 6037 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-647_AA_; KW BEL-647_AA-LTR; Pao_Bel_Ele227; BEL-647_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-6037 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [5024-5593] - Integrase core CC LTRs are 91% similar to each other. XX FH Key Location/Qualifiers FT CDS join(1874..4570,4574..5908) FT /product="BEL-647_AA-I_1p" FT /translation="MRNCQGAHHPLLHRVEESVQLQKAKSDCTVIFRMMPV FT TLYVGKRQYDTVAFLDEGSSATLVDDVVAKRLKAEGTLEPLIVTWTGNINR FT FENESRSVEMMMSAKGSREKFPLFNTRTVSELQLPKQNVRYAEVVQRYKHL FT AGVPTKDFPSGVPTILIGLDNLHLFAPLESRVGKPNEPIAVRSKMGWTIYG FT SEKRKPTVQTYLNLHSVAPVSNQELHDLMRHQYVLDETACSSFAVPEPAEE FT KRAREILETTTKRIGDRFETGLLWRQDERRFPDSYPMAVRRMKALDRKLER FT SPALKENVCRQIEEYQMKGYAHKATDAELTETPQAAVWYLPLNVVVNPRKP FT GKVRLVWDAAASVNGVSLNSELLKGPDMLVPLPRVICNFRERPVAFGGDIQ FT EMYHQIRIRTEDKQAQRFIFRGNSDDAPQVYVMDVATFGSTCSPSSAQFVK FT NLNAEQFSDQYPEATVAIVKRHYVDDYYDSVDTVEEAVQRANEVKYVHSRG FT GFHIRNWVSNSGEFLENLGERSSESSVHFSLDKSTEYERVLGIVWDTVEDV FT FCFASASKSEYAEVLSGDKRPTKRIVLSIVMAQFDPVGYLAPVTILGKMLI FT QDLWRTGCQWDEVVDEASFGKWKRWTDILADVRAFKLPRSYFGSARSEEVQ FT DVQLHIFADASETAYGCVAYFRAVVRGEVMCALVMSRAKVAPLKQLSIPRL FT ELLAAVLGARMSQTVCENHNFHITKVVFWIDAEVVLSWIRSDQRRYKQFVG FT FRIGEILSLTRLADWRWVPTKLNVADQLTKWGKNPEIDPNSAWVRGQQFLY FT ESEEEWPKKSLPPANTMEELRVHLLLHDVKVPAVLVDATRFSKWTVLVRTM FT ACVIRFVSNCRRKRERLPIETLRATRSQLKVLIPNAGASVRVPLKQEYEKA FT ERCLLRIAQSESFVDELKVLSRNKIRPASQWMAIEKSSPLYKLTPLVDEHG FT LIRMEGRVERAEFLPFDLRFPVILPDDHRITRLIVQHYHERSGHGYRQAVK FT NELRQLYYIPHVDAVVRKVSASCMWCKVHRCRPEAPRMAPLPVQRVTPNLR FT AFSYVGVDYLGPYDVTVGRRTEKRWIALFTCMVTRAVHLEVAHGLTTQSCL FT MAIHRFIGRRGWPIEFFSDNGTNLRGASKEVVEAAQGIRDDCADLLTNART FT KWTFNPPAAPHMGGVWERLVRSVKEALRALDDGRRLTDEILQTAIVEAEDI FT INSRPLTYVSQQSDEAEALTPNHFLRGVSPNQPCVALPPPHPAVALRDAFQ FT RSQQLASVMWERWIKEYVPLLNQRTKWFGEARPLRAGDLVYVVEGANRKCW FT VRGVVEEAIKSGDGRVRQAWVRTSTGGTSERQ" XX SQ Sequence 6037 BP; 1448 A; 1415 C; 2002 G; 1168 T; 4 other; gaaatcgttt gtgccgccga cttgatccga acaaaatcaa aaattttaaa tcggattctt 60 cgagatggct tcctcacctt ccaagaacga gctatgtttg ttctgctcga tgtcggagtc 120 aatgtctgag gacgttgact gggtccaatg tgggaagtgt aaacactggg cccatttttc 180 gtgcgctggt gttgaccaag ggattgtcaa tagtcattgg ctttgcccac agtgtgtccc 240 wagtagtgtt cagcagctga gagttcctga tgcgaccgac aagaagagga gtaagaagtc 300 cggcaagaaa agcgatggag ggtccgatca gggagttggt tccgacgttg atccagttga 360 gaagcagttg gaagaggaac aactagccaa ggagaaggcc ttcgcgaagc agatggcggc 420 gcggaagaag ttactcgctc ggcagaaggc gtggaaggaa gagcagctga agcaagaacg 480 ggagatgcgt gagctggaac tccaggttca acgcgagatg gaggagcagc agttgcagca 540 cgagcaagag atgctagatg ctcaactggc tgcggagagg gatttttgaa gaagcgggat 600 gcgatccgga agcagtttgc tagcagcgtt agtagggtca acgcgttgaa ggaccaagaa 660 ggcgctgttg gtggagcttc ggcagacaat ccgaagcaga aagtgtacga gtggttggac 720 gagcagacgg agattgccaa attgccgtcg gccgccgatt ttcggggagc gtaccccaaa 780 ggcgatccgc tgcgagcgga gtcgaagaaa gttgagaagc cgaagctgat gaagaaagtg 840 cgggaagtgg tcgagaccga gtcggaatca agcgaagaag acgaaagcag cgattcgtcg 900 gaggatcgag tgagaagtag ccgtcatagc aacaaagccg gcgacgggcc ggggcgaagc 960 gttggtcgaa gctcgagatg tagcgaagtt ggcgataggt cggagcgagc cgtcggccga 1020 attacgagag agcagctggc cgctcgaaaa gcggtgtcgc agcaccttcc gaagtttcgc 1080 ggagaagcgg aagtttggcc gctattcatc agcagcttcg aacacaccac agcggcgtgt 1140 ggattcacga atttggagaa tctgaagcgg ctgcaggact gtctgcaagg agacgcgctc 1200 gaggcagtca ggagccgatt agttctgccg attcagttcc ggatgttatt cgggaccttc 1260 gaagcctgtt cgggaagccc gagaagctgt tgaagacgct gctgacgaag gtgaggagcg 1320 ctccagcgcc gagagctgat cggctggaga cgttcataaa ttttggtatc acggtgaagc 1380 aactgtgcga ccatttggaa gccgcacgac tgaacgacca cctcaataac ccgatgctgg 1440 tgcaggagtt agtcgacaag ctgccgccaa gctacaagct ggactgggtc cggtacaagc 1500 ggggcagagt ggacagtccg ctgaggatgt tcacgaactt cgcgaccgac atagtgtccg 1560 acgtctccga agtggcggaa ttcaccacac tgtcgatgaa cgaccgggtg cggcctggga 1620 aggagaacaa tcggaagaag gagttcgtgc atgtgcacga gtccgagccg aagcgaagtg 1680 aagtcaatcg tagcgaggtg gccagccggc cttgctggat ttgcgggcga acggatcatc 1740 tgattcggaa ctgcgaagag ttcaggcgaa tgaacatcgc agagcggcta cgagaagtcg 1800 agaggcagaa actctgtgga gtctgcctca acaagcacaa tggaaatcgc tgtcctccaa 1860 aattcggtgt gtgatgcgga attgtcaggg agctcatcat cctctgctgc accgggtaga 1920 agaatcagtg cagctgcaga aagcgaaatc ggactgcacg gtgattttcc ggatgatgcc 1980 ggtgacgctc tacgtcggca agcgacagta cgataccgtc gcattcctgg acgaaggatc 2040 gtcggcgact ttggtggatg acgtggtcgc aaaacggctg aaggcagaag ggacactcga 2100 gccgttgatc gtcacgtgga ccggaaacat caaccggttt gagaacgagt ctcggagtgt 2160 cgaaatgatg atgtcggcga agggttcgag ggagaagttt ccgctgttca acacgcgaac 2220 ggtgtctgag ctgcagcttc cgaagcagaa tgttcggtac gcagaagtgg tgcagcgata 2280 caagcatctc gcaggggtgc cgacaaagga ctttccgtcc ggtgtgccaa cgattctcat 2340 cgggttggac aaccttcatt tgttcgcgcc gttggagtca cgtgtcggga aaccgaacga 2400 accaatcgct gtacgatcga agatgggctg gacgatctac ggttccgaga agcgaaagcc 2460 cacggtgcag acgtatttga atctgcactc cgtcgcaccg gtgagcaacc aggagctcca 2520 tgacctgatg cggcaccagt acgtgctgga cgaaacagcg tgttcgtcgt tcgccgttcc 2580 agaaccggcg gaggagaagc gggctcgaga aatccttgag acgacgacaa agcggatagg 2640 cgatcgtttt gaaacgggat tgctgtggcg ccaggatgag agaagatttc ccgacagcta 2700 cccaatggcc gttcgacgga tgaaagccct cgaccgtaaa ctggaacgca gtccagcgct 2760 gaaggagaac gtgtgtcggc aaattgaaga gtaccagatg aaggggtacg cacacaaagc 2820 aaccgacgcc gagttaacgg agacgccaca agcagctgtg tggtatctgc cgctgaatgt 2880 tgtggttaat ccacggaaac cgggcaaggt acgcctggtc tgggatgctg cagcgtcggt 2940 gaatggggtc tccctgaatt cggagttgct caagggtccg gatatgctag tccccctccc 3000 acgggtaatt tgcaatttcc gagagcgtcc ggtcgccttt ggtggggaca tacaggaaat 3060 gtaccaccaa atccgcatca gaacggagga taagcaggcg cagcggttca ttttccgggg 3120 caacagcgac gatgctccgc aggtgtacgt tatggacgta gcgactttcg gctccacgtg 3180 ctcgcccagt tcagcgcagt tcgtgaagaa cttgaacgcc gagcagtttt cggatcagta 3240 cccggaggct acggtagcga tcgtcaagcg gcattacgtc gacgactact acgacagcgt 3300 ggacacagtg gaagaagcgg ttcagcgagc gaacgaggtg aaatacgtcc actcccgcgg 3360 aggtttccat atcaggaact gggtgtccaa ttcaggcgag ttcctggaga atcttggaga 3420 gcggagtagt gagtcgtcgg tgcatttcag cctggacaag agtaccgagt acgagcgtgt 3480 gctgggaatc gtatgggata cggttgagga cgttttctgt ttcgccagcg catcgaagtc 3540 ggagtacgct gaagtcctca gcggtgataa acgtccgacg aagcgaatcg tgctcagcat 3600 cgtgatggcg cagtttgatc cggtggggta cctagcgcca gtcaccatcc taggaaagat 3660 gctgattcag gacctctggc ggacagggtg tcaatgggac gaagttgtgg atgaagcatc 3720 gttcggcaag tggaaacggt ggacggacat tctggcagac gttcgagctt tcaagctgcc 3780 acgcagctat ttcggcagtg cgcgatcgga agaggtccag gatgtgcagt tgcatatatt 3840 tgcggatgcg agcgagacgg cgtatggatg cgtggcgtac tttcgagccg tagtacgcgg 3900 tgaggtgatg tgtgcgctcg tgatgagccg agcgaaggtg gcaccgttga agcagctgtc 3960 gattccgcgc ctggaacttc tggcagccgt gctcggcgca cggatgtcac agacggtgtg 4020 cgaaaaccac aatttccaca ttacgaaggt ggtcttttgg atcgacgcag aggtggtgct 4080 gtcgtggatt cggtccgatc agcgccgcta caagcagttc gtcggttttc gtatcggaga 4140 gatcctgagc ctaacgaggc tggcagactg gcgatgggtg ccgacgaagt tgaacgtggc 4200 agaccagctg acgaagtggg gtaagaaccc ggaaatagac ccgaacagcg cgtgggtacg 4260 aggacagcag ttcctgtacg agagtgagga ggaatggccg aagaagagtt tgccaccggc 4320 gaacacaatg gaggagttgc gggttcacct actgctgcat gatgtgaagg ttccagcggt 4380 actagtggac gcgaccagat tctcgaagtg gacggtcctg gttcgaacga tggcgtgtgt 4440 gatccgattc gtttcgaatt gcaggcggaa aagggagagg ctgccgattg agacgctgcg 4500 agcaacgaga agtcagctga aggtgctgat accaaacgcg ggagcttcag ttcgcgtgcc 4560 gctgaagcaa gasgagtatg agaaggcgga gcgctgcttg ctgaggattg cgcagtcgga 4620 gagtttcgtc gacgaactga aggtgttgtc gaggaacaaa attcgaccgg caagccagtg 4680 gatggcgatc gagaagtcca gtccgctgta caagctgact ccgctggttg acgagcatgg 4740 cctgattcga atggaagggc gagtggagcg agcggagttt ctaccgttcg acttgcggtt 4800 tccggtgatt cttccggatg accaccggat cacgaggctg atagtccagc attaccacga 4860 gaggagcggc cacggatacc gccaggcggt gaagaacgag ctgcgacagc tgtactacat 4920 tccccatgta gacgcggtgg taaggaaggt gtcggcatcc tgcatgtggt gcaaggtgca 4980 tcgttgtcgt cctgaagctc cgaggatggc cccgcttcca gtgcagcgag taacgccaaa 5040 tctacgtgca ttcagctacg ttggcgtaga ttacctggga ccgtacgatg tgacagtagg 5100 acgtaggacg gaaaagaggt ggatcgcgct gtttacctgc atggtgacac gcgcggtaca 5160 tttggaggtg gcgcacgggc tgacaacgca gtcgtgtttg atggcgatac accggttcat 5220 cggccgtcgg ggctggccga tcgagttctt ctcggacaac gggacgaatc tgcgaggagc 5280 cagcaaggaa gtagtggagg ctgcgcaggg catcagagat gactgcgcag acctattaac 5340 gaacgcgaga acgaagtgga cgttcaaccc tccggccgca ccgcacatgg gaggcgtctg 5400 ggagcggttg gtccgctcag tgaaggaggc gcttcgggcg cttgacgacg gaagacggtt 5460 gaccgacgag attctgcaga cggccatcgt tgaagcggag gacatcatca actcccgtcc 5520 gctgacgtac gtgtcgcagc agtccgacga agcagaagca ctcactccga accattttct 5580 gaggggtgtt tcaccgaacc agccatgcgt ggcgctacca cctccccatc cagcggtggc 5640 cttgcgggac gcgttccagc gttcgcaaca attggctagc gtgatgtggg aacggtggat 5700 aaaggagtac gtgcckttgt tgaaccagag gacgaaatgg ttcggcgaag caagaccgct 5760 acgtgcagga gatctggtgt acgtagtcga gggtgccaac cggaagtgct gggtccgtgg 5820 agtagtggag gaagccatca agtctggcga cggaagggtc cgtcaagcgt gggtgcgtac 5880 cagcaccggg ggtacaagcg agcgacagtg aagctggcca gaatggaagt acaggatggt 5940 aaccctgagc cggatgtggc ttccgggaca gagttacggg ccggggaatg ttctggcagc 6000 actgccgccc cgatgagacg tgagagtgag cgccmgt 6037 // ID Crack-2_BF repbase; DNA; INV; 4834 BP. XX AC . XX DT 30-APR-2009 (Rel. 14.04, Created) DT 30-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE Amphioxus Crack-2_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; Crack-2_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-4834 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-4834 RA Kapitonov V. and Jurka J.; RT "Young families of Crack non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(4), 807-807 (2009). XX DR [2] (Consensus) XX CC Its ORF1-encoded protein is similar to L1/Tx1 ORF1 proteins. XX FH Key Location/Qualifiers FT CDS 392..1165 FT /product="Crack-2_BF_1p" FT /translation="MPNRRNQKESADDDYVTMKEMKEMLEQIKQHYEDKID FT QNKQYYEDLLKRQESIFQNFTKMLMDSANSRVDSLVREVQDLKTSLQFSQN FT EIDSLKVEIKDAKKSSNVDDESAVAKKVDYLENQSRRNNIVIDGIEGDNSK FT ETWAETEGKVRDLLTKNLKLEAKDIEIERAHRNGVMNKDNPRPRQVVVRLL FT RYKDKQLILNQARSHLKNTPIYINEDFSEAVRKRRAELLPELKAARSRGEY FT AVLSYDKLVIRKKTKP*" FT CDS 1236..4364 FT /product="Crack-2_BF_2p" FT /translation="MSSRIFDTQGMQCLQMYEDDNLTFNPFLLNDDNESKS FT KYIDDWDPDVNFFNHAMNNSCQYYTENSFKNLCTELSHTSSTFSMLHLNIR FT SLPRNYDSFTHYLSTLQHDFSVIGLTETWLNDSTSHLYDLPNFASLHRCRS FT NKTGGGVSILIHQSFEFIERNDLCIQSEEDMVESLFIEILPASHITGKGII FT LGCIYRPPNTQIKNFISNLTTSLDMINKEGKTCYIFGDFNIDLLKHESHSL FT TTDFLNTMYSCTFYPIITKPSRITNRSASLIDNIMTNSTTNPHSAGLLVTD FT ISDHLPVFCITDYEISKNQGVTIGQRRCYRNFDLKNVNEFKALLSKENWEF FT VYNNTDANACYCNFVKIFNTHFEKCFPLKTISTKKNLESNNKGWFTKGLRK FT SSNVKNKLYRKYLKNPTPTTLQKYKTYRNKFNGLIRSSKRQYYQDRFTTVS FT NDIRNTWKLINELLNRKKASPTLPSKFMDGEDEITNQQNIVNKFNDFFVNI FT GPSLAKQIDKTDNSPLSYLQNEYPTISLFDAPSTAEILDIILNLKNAAPGK FT DEIHASLLKKVASEIVEPITYVFKTSIETGKVPSDLKVAKVIPLYKSGNSC FT LFTNYRPISILPVFSKILEKVVYKRILKHLDDNNILYTHQYGFRKNYSTYM FT PLIQLIEKITSALENKEFTIGIFLDLSKAFDTVDHNILLDKLNTYGFKGSV FT LKWLKDYLSERTQYVSNNSFVSSQKFVTCGVPQGSILGPLLFLIYVNDLPM FT ISDKFFALIFADDTNLFMSHKNFDTLISTINDELCKLNQWFKSNKLSLNID FT KTNFIIFTGRNKKYSKEMAKLHIDNKPIKQVSNARFLGVVIDEKLTWKFHI FT DIICKKIAKNIGIIRKVANCLSRKIFMTLYYSLIYPYLTYCNIIWASTYTT FT SLKPLHLLQKRFVRIASNASFIANSATFFHNLKILTVYDINKLQSAIFVHK FT VIHNNSFLPVLYKDIFKYNADIHNHSTRQQNSLRPIKTKSNLTQFTIMFRG FT PSIWNTLSLSLRNCQSLSIFKKRMKNNLIEKPTIYS*" XX SQ Sequence 4834 BP; 1653 A; 954 C; 786 G; 1441 T; 0 other; aatcagcaaa agagctaata ttattctgca cttcccctgc tatagatatc tgaacagtat 60 aaaggattgc tgtttgtata cttgcttgtt tgctggcgcc ctcccactct ccgtttcagt 120 tggcgggaaa gtggcctgaa cgagttagcg tctcctaccc tgtcaacaat acgccggctc 180 atcgcatcgc atcgtatcct gggcctgagc ttgagttaaa aactgcaaca agccgaaatc 240 cactcagcag tagtctacaa gaaaatatcg ggtctgctgg cagccgtaga ccgtcaacac 300 catctcgtaa tcactgtggc gacctggtgg cgcacgtatc aacaatcaag atttactcag 360 gtgggaacat tccttttcac attccacaaa catgccgaat agacgaaacc aaaaagagtc 420 agctgatgac gactacgtga ccatgaaaga aatgaaagaa atgctcgaac aaattaaaca 480 acactatgaa gacaaaatcg atcaaaataa acaatactat gaagacttat tgaaaagaca 540 ggaatcgata ttccaaaact tcactaaaat gcttatggac tcagcaaaca gtagggttga 600 ttcgctagtt cgggaagtcc aggatctgaa aacatccctc caattctcac aaaatgagat 660 agattccctc aaagttgaaa tcaaggacgc aaagaagagt tctaatgtgg atgatgaatc 720 agcagttgct aagaaagtcg actacttgga gaatcaaagc cgacgtaaca acatcgtcat 780 tgatgggatc gaaggagaca atagcaaaga gacttgggca gaaactgagg gaaaggtaag 840 ggacctactc acaaagaacc tgaagttaga agcaaaggat atcgagatcg aaagggcaca 900 tcggaacggc gttatgaaca aggataaccc acggccaaga caagttgttg tcagacttct 960 tcggtacaaa gacaaacaac tcatcctaaa ccaggcaaga tctcacctca agaacacacc 1020 aatctacatc aatgaagact tttccgaagc tgtaaggaaa cgacgagcag aacttctgcc 1080 ggagttgaaa gctgctagaa gtcgtggaga gtacgctgtt ctgagctatg acaaactggt 1140 catccggaaa aaaaccaaac cctaacaatt tagtagcttc atgattttgt tgattgtgtg 1200 cttattattg tcaaatatga aggttgtgta cgtctatgtc tagtcgtatt ttcgatacac 1260 aaggaatgca atgtttacaa atgtatgaag atgataattt gacgtttaat ccttttttgt 1320 tgaacgatga taatgaaagt aaatcaaaat acatagatga ctgggacccc gatgttaatt 1380 tttttaatca cgctatgaac aattcatgcc aatattatac cgaaaactca tttaaaaact 1440 tgtgtacgga gctgtctcat acgtcttcaa ctttttcaat gttacatctt aatatcagga 1500 gccttccacg gaattatgat agctttacgc attacttgtc aaccttgcag catgattttt 1560 cggtcatcgg cctaactgaa acgtggctaa atgattccac gtcacacctc tatgacttac 1620 cgaattttgc ttctttacac cgttgcagat ctaacaaaac aggtggagga gtatccatct 1680 taatacatca aagtttcgag ttcattgaac gaaatgacct ttgcatccag tcagaagaag 1740 acatggttga atctctgttt atagaaatct tacctgcctc tcatataacg ggaaaaggaa 1800 ttatattagg ttgcatatac agaccaccaa atactcaaat taaaaatttt attagcaatt 1860 taacaacatc gttagatatg atcaacaaag aaggcaaaac ttgttatatt ttcggtgact 1920 ttaatataga ccttctcaaa catgaaagcc actctttgac tacagatttt ctgaacacaa 1980 tgtactcgtg cacgttctac cctataataa cgaagccttc cagaataacg aatagatcag 2040 cttcattgat agacaacatt atgacaaact ctacgacaaa tcctcattca gcaggtcttc 2100 ttgttacaga catatcagac cacttaccgg tattttgtat aacagattat gaaatcagta 2160 aaaatcaagg agtaacaata ggtcaacgac gttgctacag aaatttcgat ctaaaaaatg 2220 tcaatgaatt taaagccctc ttatctaaag aaaattggga atttgtctat aataacactg 2280 acgcaaacgc ttgttactgt aattttgtaa agattttcaa tactcatttt gaaaaatgct 2340 tccctttaaa aactatctca acaaaaaaga acctggagag taacaacaaa ggctggttta 2400 caaagggact acgcaagtcc tctaatgtga aaaacaagct ctacaggaag tatttaaaga 2460 accccacacc aacaacttta caaaaataca aaacgtatag aaataaattc aatggtttga 2520 ttcgctcgtc taagaggcaa tattatcagg ataggtttac aactgtgtca aacgatatac 2580 gtaatacttg gaaacttata aatgaactac ttaacagaaa aaaggcaagt cctaccctcc 2640 cttctaagtt tatggacgga gaagatgaga taacaaatca gcagaacata gtcaacaaat 2700 ttaatgattt ttttgtgaat attggccctt ctttggcaaa acaaattgac aaaactgaca 2760 attctccctt atcctacctg caaaatgaat atccaaccat cagcttgttc gatgccccca 2820 gcacagcgga aattcttgac ataattctta atttaaaaaa tgcagcacct ggtaaagatg 2880 aaatccatgc ctcacttttg aaaaaggttg catcagaaat tgtagaaccc attacttatg 2940 tctttaaaac ttccattgaa accggtaaag taccctctga cttaaaagta gccaaagtta 3000 ttcctcttta caaatctgga aattcctgtc tgtttacaaa ttatcgaccc atatctatat 3060 tacccgtatt ttcaaaaatc ttagaaaaag tagtgtacaa aagaatctta aagcacctag 3120 atgacaataa tattctatat acacatcaat atggcttccg gaaaaactat tctacttata 3180 tgccccttat acaactcata gaaaaaatta catcagccct ggaaaataaa gaattcacca 3240 ttggaatatt ccttgaccta tccaaagcgt tcgacacagt tgatcacaac attctacttg 3300 acaaactcaa cacatatggt ttcaaaggct ctgtacttaa atggttaaaa gattatctgt 3360 cagagagaac gcagtatgtt tctaataaca gttttgtctc gtcccagaaa tttgttacat 3420 gtggtgtccc gcaaggctcc atactaggac ctcttttatt tttgatttat gtaaacgacc 3480 ttcccatgat atcagacaaa tttttcgcct taatatttgc agatgacacc aatcttttta 3540 tgtcacataa aaattttgat acactcattt caacaattaa cgatgaactt tgcaaattaa 3600 atcagtggtt taagtcaaat aagttatccc taaatataga caaaacaaat ttcatcattt 3660 tcacaggaag aaataagaaa tacagtaaag aaatggcaaa gttacacata gataataaac 3720 ccattaaaca ggtaagcaac gcacggtttt taggagtcgt tatagatgaa aaactcacgt 3780 ggaagtttca catagacatc atttgtaaga aaattgctaa gaatattggt atcataagaa 3840 aagtcgcaaa ctgcctatct cgtaaaatat ttatgactct ttactacagc ttgatatacc 3900 cctacttaac atactgcaat ataatctggg ctagtactta cactacctca ttaaaaccac 3960 tgcacctact acaaaaacgt tttgttagaa ttgcatcaaa cgcctctttt atagcaaatt 4020 cagcaacttt ttttcacaat ctgaagattt tgacagtata cgacataaac aaattgcagt 4080 cagccatttt cgtccataaa gttattcaca ataattcttt cttaccagtc ctatataaag 4140 acatttttaa gtacaacgca gatattcaca atcacagtac aaggcagcaa aacagtctta 4200 gacctatcaa aacaaaatcg aacctaacac aatttactat aatgtttaga ggcccatcca 4260 tatggaacac tctaagtctt tccctacgaa attgtcagtc cctttccatt ttcaagaaac 4320 gtatgaaaaa caaccttata gagaagccaa ctatttactc ctgatttgtt tttttccttg 4380 tttttgatca tgttatgctc aattatcatt tagtattata ccgtgtattt ggtgctcctt 4440 taacctgttt gtttgcagtc tttgcaccca ttgtcttggt tatctgtatt tgctttcatt 4500 ttgttttgct agcatgatta attgtatgat tttgactaat ttctttatga tctgttgatc 4560 atgtttagtt ttgtctaatt tcagtttttg atgttaagat taattgtaac gttgtgtatg 4620 atcactgagt tatattattt tcacttggtt tattattgtt cgatgtataa tcttcatgac 4680 catttaattg tatcatgatt tgtatatttt gacgggggga gagtttgtat aagccacatg 4740 gcttttttct ccccccttgc acgttctgtt gtattctata attttctgta tgtatcattt 4800 tttaaagtgc aaaataaaat caatcaatca atca 4834 // ID BEL-6_AA-I repbase; DNA; INV; 5687 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-6_AA_; KW BEL-6_AA-LTR; BEL-6_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5687 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 861-861 (2011). XX DR [2] (Consensus) XX CC Positions [4617-5195] - Integrase core CC LTRs are 96% similar to each other. XX FH Key Location/Qualifiers FT CDS 663..1697 FT /product="BEL-6_AA-I_1p" FT /translation="MQGPAIAAANVHRGQALMPIHHQQFPMVQQPVAVRLP FT TLELPTFSGDYLGWPAFRDAFEALIDRNTQLSDVQKLLYLKSSLKEEAACM FT LDAMDITDANYRVAWDLLIERFENRRVIKQKHLKALFTTKQVSQDSPKELR FT RLLTECQRNINALKQLGEPTDQWSTILVYLISSKLDSTSRRDWETQTQQEE FT SPKYEDIVKFINGRCHTLEALATDKEELRKSSRFKPQLSHASTSSTGCKIC FT ESGQNHPLWNCKKFIAMSPEARFDEVRKWKLCYNCFSSKHVVSKCESRGCK FT SCSEKHNTLLHGQQHHQQEQRPSSHVSSSSESVPQDQRIVNASMASQANHT FT MQ" FT CDS 3759..5549 FT /product="BEL-6_AA-I_2p" FT /translation="MITLQWITAPSNRWKEFVANRVERIQNSTIGCHWRYI FT SGKDNPADIVSRGLAAEELMDSTLWWNGPAWLVNRSLWPEQPELRNEDIPE FT ARKVCAVVITEPIFFEKYSSLTKMKGVVGYVLRFIHNSKPSNKETRRTGTL FT SASELNNALKRIIRCVQQTSFENEYTELQRNNSVKDKSKLIALNPFIDSDG FT LLRVGGRLKNAELPYSSQHPVILPAHNFLAKRIALQLHRENCHIGPSALLN FT EIRSQYWPIQARNLVKKVCSSCTVCARFKARTAKQIMGDLPASRVQPARPF FT IKTGVDFAGPITIRSSLLRKAPKLKAYIAVFVCMSTKAIHLELVGDLTTQS FT FLAALRRFVGRRGMVAEFFSDNATNFVGANSELRELKRMFESNGEHFSKKL FT AEAGIIWHFIPPRSPNFGGLWEAGVKSVKNLLKKTCGAMVFTYEELNTLLV FT QIESILNSRPLAPVSDDIEDLMPLTPAHFLIGDRLTSNVEPNLTSLSMNRL FT TRWQTIQQLKQKFWKRWSNEYLHHLQQRTKWKSSSADLSTGMLVLLKEDNL FT PPEGWALARIIKVHPGADQRIRVVTVKTSTGEYTRSVSKICPLPDPDSDC" XX SQ Sequence 5687 BP; 1607 A; 1285 C; 1412 G; 1373 T; 10 other; tttggtcctt cgaaccggat cgcgcggatc tagtaagtca gaaggcaagg caagtttcgc 60 gtcagtcgcg ttcggtcgta gtgtaagtag tcgcgtcgcg agtcgtttta gtgtgcgcga 120 aagtgtgata gtcgcgggac cgtcgtgaca gtgtccgtcg cagtcagcgg aactgtgggt 180 ctcgaccagt cgactagttc gtcgttcgtc ggtttacttg gtagcaagac agctacagtg 240 cccgatttta gtgcttcggc gcgcgtcagt cgtcggttgt taatccagtc agacacaagc 300 agctgaggat ttcaacggcg tcaccggccg gcggtacatt tgcatttagg aacagtttgg 360 aacgacgtat cgcaacagct ttcggcatga cggagcttaa acgacagcgt ggagtcattc 420 tcggacgcct gacacggatc gaagtgttca tccgagacat cgaatccaag cccggaatca 480 cagcagagtt ggttcaggca cgactggatg tcgtcaatca gtgctgggat gagtacgaca 540 ccgtacaaac tgccatcgac gcggaggaag gagtaaacgt cgaagaggaa gaagaaaagc 600 gtgccacctt tgaggaaagg tgcatgaacg caagagctgc attgckttcc atcatcgaga 660 ggatgcaggg tccagcaata gcggctgcaa acgtgcacag aggccaggcc ttgatgccta 720 ttcaccacca gcaattcccc atggtgcaac aaccagtagc tgtacggttg cctacattgg 780 aattgccaac gttcagcggt gattatctcg gttggccagc gtttcgcgat gcttttgagg 840 cgcttatcga cagaaacacg caactgtctg acgttcagaa gttgttatac ttaaagtcga 900 gcctcaaaga ggaagcagcg tgcatgttgg atgcgatgga catcaccgat gccaattacc 960 gagtggcctg ggatttgctg attgagaggt tcgagaatcg ccgggtaatc aagcagaaac 1020 atctgaaggc attgttcacc acgaagcaag tgtcacaaga ttcgccgaag gagttgcgtc 1080 gcttgttgac tgagtgccag cgaaacatca atgcactgaa gcaactwggt gagccaaccg 1140 atcaatggag cacgattttg gtgtacctga tctccagcaa actcgatagc acatcacgtc 1200 gtgattggga aacccaaacc caacaagagg agtcaccaaa atacgaagac atcgtcaaat 1260 tcatcaacgg tagatgtcac acactagagg cgttggcaac ggacaaagaa gagcttcgca 1320 aatccagtcg tttcaagcca cagctatctc atgcttctac gagttctact ggctgtaaaa 1380 tctgtgagag tggtcaaaac catccactgt ggaattgcaa gaagttcata gccatgagtc 1440 cagaagctag attcgacgag gtgagaaagt ggaagctgtg ctacaactgt ttctcgtcca 1500 agcatgtagt aagtaaatgc gaatccagag gttgtaagtc ctgcagcgaa aagcacaaca 1560 ccttactaca tggacagcaa catcatcagc aggaacaacg gccatctagc cacgtgtcat 1620 cgtcgtcaga atctgtccct caggatcagc gcatcgttaa tgcgagtatg gcgagtcagg 1680 caaatcacac catgcaakca tcgagtgata ttttggctac tgctatggtc tgcatctacg 1740 atcgagctgg gagcaaatta ccgtgtcgcg tgctgttgga ttccggatca caaccgaatt 1800 tcataacgca agcttttgct cggaaacttc ggttgaagga atgcaataca tcgatcaccg 1860 tccacggagt aagtggagta gaagctcaag ttggaaggca ggtgacggct actatcgagt 1920 ctcgtgtgaa cggaaagcag tttaacgttc cactgctggt gatgaacaag atcacgaatg 1980 tgttaccatc tgacaacatt caatgcgata gtttttccac tagaagagaa gagttggcag 2040 atccgacatt cagcagacca ggaagaatcg atgttctact gggagccaag cttttcttca 2100 agcttttact tccggaaaag attgaagttg gtgacttcat tctgtggaac actgagctag 2160 gctgggtagt ttccggaagc tgcaagcagt attcgcggga tcacgtgcaa tcgaatgtgg 2220 ttacttcgat gactgatgct gatctgctgc aacaattgga aaagttctgg aaagtagagg 2280 agctgcaagt tggagtaaat ggtgaggcca aattggatcc agcggaaaaa cactatctcg 2340 aacacgtcag acgcttatca gacggaaggt acgaggtgaa gatcccattg tctcagcctc 2400 cttcawtgct tggtgattct aaggaacaag ccctacgaag attccatgct ttagagcgca 2460 ggctacatgg aaacgaccaa cttcgagaaa tgtatacaga tttcatgaga gagtacgaat 2520 cttctggaca catgagtgtt gttcctgacg gcttggaaga ggtgggtttg ttagtaaatt 2580 acctccctca tcatgcagtc cttaagccaa cgagtactac aactaagcta aggactgttt 2640 tcgacgcatc cagcaaaaca tcatcaggtt tttcactcaa cgatattatg atgactgggc 2700 cgaacatcca agcagatcag ttctcgcttt tgttgggatt cagatgttgg agatatgttg 2760 tcacagctga cattgccaag atgtatcggc agattcgagt agaccagcaa tcatcgaatt 2820 tacagcgaat actgtggaga gaaacaccat cacaaccact gaaggtgtat cagctgaaca 2880 cagtgacata cggtacacgc gctgctccat ttcttgctgt tcgtacgctg caccagctag 2940 ccgcagacga gaaagacacc tatccgaatg cctgtcgttc tcwttgcaaa cgattctacg 3000 tggacgactt gttasatgga tcaaacagca aacaggcttt actgaaaacg agagatgagt 3060 tgctcgcagt gctgcataag ggcggttttg agctgaggaa gtggagtagt aatgatccag 3120 aattgctgaa ggatttaccg gaaggcttgg tggagaaagg cacaatgtgt gagattgacg 3180 acggcgaaat cgtcaagtct ctgggattga gctgggacac caggaatgat cagttgacat 3240 accgtatcca tctaaaacag ttccaggttt actctcgacg taccatcctt tctgcgattg 3300 catctctgta tgaccccgtg ggattgatag caccgatcat cgtakcagct aaaactttga 3360 tgcagcagct ttggctctta aatgtctcct gggatgaccc actcccagaa catatcgtgc 3420 agaagtggca acaattctac gatagtctgc ggcatcttca aaacctgcga atcgaccgat 3480 ggacgtttgg atgtctacag ccaatcgccg tagaaataca tgggttctca gacgcttcca 3540 tggtcgcata tggagcctgc atttatgtca aaaccgtcga cgtttctgga aatgcatacg 3600 ttcgtctgct gtgtgccaaa tcacgcgtag caccattgaa gaaggttaca ctgccacggk 3660 tggaattatg cgcagcagtt ttattgtctg aactcttgac aaaggttcta gattcwgtgg 3720 aattgagcgt cgacagctgt tttttgtgga cggactccat gataacgctt caatggatca 3780 ccgcaccttc aaaccgctgg aaagagtttg tggcaaatag agtcgagagg attcagaact 3840 ccactattgg atgtcactgg agatacatca gtggaaaaga caacccagcc gacatagtat 3900 cacgcggact tgcagcagaa gaacttatgg attcaacgct ctggtggaat ggcccagcat 3960 ggcttgtgaa ccgatcactt tggcctgaac agcccgagct tcgaaatgag gatattccag 4020 aggcgcgtaa ggtttgtgct gttgttataa ccgaaccgat attcttcgag aagtactcgt 4080 ccttgacgaa aatgaagggc gtcgtcgggt atgtattgcg tttcattcat aactcgaaac 4140 catcgaacaa ggagacacga cgaactggaa cgctttctgc ttckgagttg aacaacgctt 4200 tgaaaagaat aattcgttgt gtacaacaga cctcctttga gaatgagtat actgaacttc 4260 aaaggaacaa cagcgtaaag gataaaagca agttaatcgc tttaaatcca ttcatcgatt 4320 ctgacggact tcttcgagtt ggtggaagac tgaagaatgc tgaattgcca tacagttcgc 4380 agcacccagt tatactccct gcacataact ttttggcaaa acgcatagca ttgcagttac 4440 atcgggaaaa ttgccatata ggaccatctg ctttactgaa cgaaatacga tcgcagtact 4500 ggcctataca agctagaaat ctcgtcaaaa aggtatgttc aagctgtact gtgtgtgctc 4560 gctttaaggc caggacagcg aaacaaatca tgggagattt accagccagc cgagtgcagc 4620 cagcaagacc attcatcaaa accggagttg atttcgctgg acccatcacg atccgttcca 4680 gtttgttgcg caaggcaccg aaacttaagg cgtacatagc tgtgtttgta tgtatgtcca 4740 cgaaggccat tcatcttgaa cttgttggtg acttgactac gcaatcgttt ctggcagcac 4800 ttcgtcgatt tgtgggacgc cgtggaatgg tggcagaatt cttcagtgat aatgctacaa 4860 atttcgtcgg agcaaattct gagctgagag agttgaaacg aatgttcgag tcgaacggtg 4920 aacatttttc caagaaacta gccgaagcag gaattatttg gcacttcata ccccctcggt 4980 caccaaattt tgggggtctt tgggaagcag gagtcaaatc ggtcaaaaat ttgctcaaga 5040 agacatgtgg agcaatggtg tttacctatg aggaactcaa cacacttcta gtacaaattg 5100 aatcaatcct aaattcaaga ccattagctc ctgtctctga cgacattgag gacttaatgc 5160 cactaacccc agcgcatttt ttgatcggag accggcttac atccaatgta gaacccaatt 5220 tgacttcatt atcaatgaac cgtttgacac gatggcaaac aattcagcag ctcaagcaaa 5280 aattttggaa gcggtggtca aatgaatacc ttcatcatct tcagcaacga acaaaatgga 5340 aatcttcatc ggcagattta agcactggaa tgcttgtgct cttaaaggag gacaatttac 5400 caccggaagg atgggcttta gcgcgcatca tcaaggtaca cccaggagct gatcaacgca 5460 taagagttgt cacagttaaa acatcaactg gagaatatac acgatcagtt tcgaagatat 5520 gccccttgcc cgatccagac tcagattgtt gaaggcgagc cttcaaggcc ggcggtatgt 5580 ttagaaagaa gaagcagttt gaacggcatt cccctagaaa tcaccatcac cccacataac 5640 acacggatag actgcgcaaa gggtataaag tccccctcac acacggt 5687 // ID R1_DYa repbase; DNA; INV; 5513 BP. XX AC . XX DT 08-NOV-2010 (Rel. 15.11, Created) DT 08-NOV-2010 (Rel. 15.11, Last updated, Version 2) XX DE 28S rDNA-specific non-LTR retrotransposon R1 in Drosophila DE yakuba. XX KW R1; Non-LTR Retrotransposon; Transposable Element; R1_DYa. XX OS Drosophila yakuba OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; melanogaster subgroup. XX RN [1] RP 1-5513 RA Stage D.E. and Eickbush T.H.; RT "Origin of nascent lineages and the mechanisms used to prime RT second-strand DNA synthesis in the R1 and R2 retrotransposons of RT Drosophila."; RL Genome Biology 10(5), R49-R49 (2009). XX DR [1] (Consensus) XX CC This family is specifically inserted into 28S ribosomal RNA CC genes. XX FH Key Location/Qualifiers FT CDS 360..1904 FT /product="R1_DYa_1p" FT /translation="VELVHQSNQLVTVSAQFPELPEGGATIVVSPMESDSS FT ASAASAASASSASRKPGRGRRRSHQASKSSAATQAKLVAVASRGVPEPVGA FT LDEAPSSLEDPRMVPAATTAATTAATAAATAAATAAAPAAAPAAAPAAAPA FT AAPAAAPAAAPAAAPAAAPVAAPVATPAVATATVASAAARAGQAAMMAELS FT ATQRMVRSSFRSLGGVDTDELSFAISRYDELVMALMLRCGELETRLAMPPP FT PPPSMPTSASSRFPQMPQAAPIAAPRTTKVRETWSAVVKCDDPALSGKAIA FT EKVRTMVAPSLGVRVHEVRELRRGGGAVIRTPSVGELQKLVSSKKFAEVGL FT NVARNAAEKPKVVVYDVDTAIGPEEFMQELHENNFDGEMTASEFKRSVHLV FT TKAWSVTDGATVNVTLEVSDKAMAKLDVGRVYIKWFSFRCRSQVRTYACHR FT CVGFDHKVSECRQKTNVCRQCGQQGHTAAKCQNPVDCRNCRHRGQPSGHYM FT LSDVCPIYGAVLARVQARH" FT CDS 1901..4963 FT /product="R1_DYa_2p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="TLMFSLIQANCGRGRAATIELGVRLRRSGSMFALVQE FT PYLSGEGMDVLPEGMRIFTDRRGKAAILVDLQDAICMPVETLTTDYGVCLV FT VKGSFGSIFLCSAYCQFDAPLEPYLRYMDAVLLQASRTPAILGLDANAVSP FT MWFSKLSRHAEGQANYSRGELLSEWMLEARAAALNQPTQVYTFDNYRARSD FT IDVTIVNEAASMWATYEWRVDEWELSDHNIITVVTEPSTTSAVESIAPVPS FT WNFSNARWRLFKEEMVSRTAELPENFSESPLDQQVSTLRSIVHNVCDIALG FT RKSIRPPSRRARWWTADLCAARREVRRLRRRLQDARRHDDDAAAELLVVGL FT RRASANYKKLIGRAKMXDWKRFVGENADDPWGRVYKICRGRRKCTEIGCLR FT VNGELITDWGDCARVLLRNFFPVAESDAPIAIVEEVPPALESHEVEACIAR FT LKSKRSPGLDGINGAICKAVWRAIPQHLASLYSRCIQSGYFPNEWKCPRVV FT ALLKGPEKDKCEPSSYRGICLLPVFGKVLEAIMVNRAREVFPEGCRWQFGF FT RQGRCVEDAWRHVKSSVDASPAQYVLGTFVDFKGAFDNVEWSAALRRLADL FT GCREMSLWQSFFSGRRAVIRSSSGAVDVPVTRGCPQGSISGPFIWDTLMDV FT LLQRLEPYCLLSAYADDLLLLVEGNSRVVLEEKGAQLMSIVETWGAEVGVA FT VSTNKTVIMLLKGTLRRAPTVRFAGANLPYVRTCRYLGITVSEGMKFLTHI FT ASLRQRMTGVVGALARVLRADWGFSPRARRTIYDGLMAPCVLFGAPVWYDT FT AGQVAAKRRLASCQRLILLGCLSACRTVSTVALQVLGGAPPLDLAAKQLAV FT NYKLKRGYPLEENDWLYGEDIACLSWEQRKTRLEECMLRSWQNRWDDDSEP FT GRVTHRFIPDVTLVYRDPNFGFSMKASFLLTGHGSFNAFLHGRALSDTTAC FT ACGDPYEDWMHVLCACPLYADLRDLDGLGVQRVGENWMFEGILEDRERTQR FT LAAFAEEAFRRRRGL" XX SQ Sequence 5513 BP; 1167 A; 1442 C; 1703 G; 1200 T; 1 other; cggacgtgtt ttcggtacgc tcgtgttcag attgcgaaaa gctaagtttt ctttgtaaaa 60 caaagttcgg aaagtgaatt aatcggtggt tgagtgctcc gcggaaatag tgacattatt 120 tacgcgaacg caaagcctgt gatttcggaa caattcgcaa taatttattg caaaattttc 180 cactgagaac agcacgtggt tccaggcgag ctttcggaca gctgttaaac agctgggcct 240 gtcggtttgt gtccgcgcta ccacaatcag ctgttggttg ttgtcgctgc tacggtctgt 300 gtcttgggac ggttcggtgc tgccagactg gtactttggt tacgcaaatc tgccaatagg 360 tggagctcgt gcaccaaagc aaccaacttg tgacagtgag cgcgcaattt ccggagttgc 420 cagaaggagg agccacaatc gtggtaagtc ctatggagag cgacagcagc gcgagtgccg 480 ctagtgccgc gagcgccagc agcgcgtcgc ggaagccggg acgaggcagg cgcagaagcc 540 accaggcttc taagagctcg gcagctacgc aggccaagct cgttgctgtg gcctcgcgtg 600 gtgttcccga gcctgtggga gcgctggatg aggcgccttc gtcgctggag gacccccgga 660 tggtccctgc tgccactact gctgccacta ctgctgccac tgctgctgcc actgctgctg 720 ccactgctgc tgcccctgct gctgcccctg ctgctgcccc tgctgctgcc cctgctgctg 780 cccctgctgc tgcccctgct gctgcccctg ctgctgcccc tgctgctgct cctgttgctg 840 cccctgtcgc tacacctgct gttgccactg ccacagttgc ctccgctgcc gcccgtgctg 900 ggcaagcagc tatgatggca gagctgtcgg ccacgcagcg aatggtgaga agcagttttc 960 gcagcctagg aggagttgac actgatgagc tctcgtttgc tatcagccgc tacgatgagc 1020 tggtgatggc attgatgctc cgatgtggag aattggagac gcggttggct atgccgcccc 1080 caccgccgcc gtctatgccg acatccgcca gttcgcgttt tccccagatg cctcaggctg 1140 cacccattgc ggccccgcgg accacgaagg ttcgcgagac gtggtcagca gtggtgaagt 1200 gcgatgaccc agcgctatcg gggaaggcga tagccgaaaa ggtgcgcacg atggtcgcac 1260 cctccctcgg agtcagggtg catgaggtgc gtgagctccg ccgaggtggt ggagcagtta 1320 tccggactcc ttcagttgga gagctgcaga agcttgtgag ctctaaaaaa ttcgccgaag 1380 tagggctgaa tgtggcgcga aacgcggccg agaagccgaa agtcgtcgtc tacgacgtcg 1440 acacagcaat cggccccgag gagttcatgc aggagctcca cgagaataac ttcgacggcg 1500 agatgacggc ctcggagttc aagaggtcgg tgcacctggt caccaaggcg tggtcggtga 1560 ctgacggcgc caccgttaac gtgacgctag aggtcagcga caaggcgatg gcgaaactgg 1620 atgtaggccg tgtctacatc aagtggtttt ctttccgatg ccgatcgcag gtccggacat 1680 atgcctgcca caggtgtgtg ggtttcgacc acaaggtcag cgaatgccgc cagaagacga 1740 atgtatgccg ccagtgcggg caacaaggcc acaccgcggc taagtgccag aacccggtgg 1800 actgccggaa ctgccgtcac agagggcaac cctcggggca ttacatgctc tcggacgtat 1860 gcccgatata cggcgcggtg ctagcaaggg tgcaagctag acattaatgt ttagcctcat 1920 ccaagcgaac tgtggccgag gccgagctgc gaccatcgag ctcggagtcc gactcaggag 1980 atcgggctca atgttcgcac tggtgcagga gccgtatctc agcggcgagg ggatggatgt 2040 gctgcctgaa ggaatgagga tcttcaccga ccggcgaggg aaggcagcca tcctagtgga 2100 tcttcaggat gccatctgta tgccagtgga gaccctcacc acggattatg gcgtatgtct 2160 ggtcgttaaa gggagttttg gctcaatctt cctttgctca gcatactgcc aatttgatgc 2220 acctttggaa ccgtaccttc ggtacatgga tgcggtcctg ctgcaggcca gcagaacccc 2280 cgcaatcctg ggcctcgacg cgaatgcagt gtcccccatg tggtttagca aactctctcg 2340 acatgccgag gggcaagcta actacagtcg gggtgagctg ctgtccgagt ggatgctgga 2400 ggcaagagcc gccgccctaa accagccaac acaggtgtac acgttcgata actacagagc 2460 tcgtagtgat atcgacgtga caatcgtcaa cgaggcagca tctatgtggg ccacatatga 2520 gtggagagtg gatgagtggg aattgagcga tcacaacatc attactgttg tgaccgaacc 2580 aagtaccacg agcgcagttg agagcatagc tcctgtgccg tcctggaact tctccaatgc 2640 tcgttggcga ttgttcaaag aggaaatggt gagtagaaca gccgaacttc cggaaaactt 2700 ctcagagtcg ccgttggacc agcaagtttc gaccctgcgc agtatagtac ataatgtgtg 2760 cgatattgcg ctgggaagaa agtcgattcg accgcccagc agaagagcac gttggtggac 2820 tgccgacctc tgcgctgcga gacgcgaagt ccggagactt cgtcgccgac tccaggatgc 2880 acggcgtcac gatgatgatg ccgcggcaga gctcttggtg gtcgggctga ggcgtgcctc 2940 agccaactac aagaagctca ttgggagggc taagatgrat gactggaaac gcttcgtggg 3000 agaaaacgcc gatgacccat gggggcgcgt ctacaagata tgccgaggcc gcagaaagtg 3060 cacggagatt gggtgcctcc gcgtgaatgg cgagttgatc actgattggg gtgattgcgc 3120 acgagtgctc ctccgcaact ttttcccagt tgcggagtcc gatgcaccga ttgccatcgt 3180 ggaggaagtc ccaccggccc ttgaatcgca cgaggttgaa gcctgtatcg cccggctcaa 3240 gagcaagcgc tcgcccggct tggacggcat caatggcgct atctgcaagg cagtctggcg 3300 cgccatacct cagcacctgg catcgttgta ttcccgatgc atccaatcgg gatactttcc 3360 caacgagtgg aagtgtccac gagtagtggc gctgctcaag ggacccgaga aggacaagtg 3420 tgagccctcc tcatacaggg gaatatgctt gctgccagtt tttggtaagg tgctcgaggc 3480 aatcatggtg aatcgagcga gagaagtttt tccggaaggc tgcagatggc aattcggatt 3540 tcgccaagga cgatgtgtgg aggatgcttg gaggcacgtg aagagcagtg ttgacgccag 3600 cccggcgcaa tatgtgctcg gcacattcgt ggacttcaaa ggagcatttg acaacgtcga 3660 atggagtgct gcactgcgcc gactagccga cttgggatgc cgggaaatga gcttgtggca 3720 gagttttttc tccggccgaa gagcagtgat ccgaagcagt tccggtgctg tggatgtgcc 3780 ggtaactaga ggttgcccgc aggggtcaat tagcggcccg tttatttggg acacgctaat 3840 ggatgtactg cttcagcgcc tcgagccgta ttgcctgctg agtgcatacg cggatgatct 3900 gcttcttctc gtcgagggaa actcccgagt cgtgttagag gaaaaaggag cgcaacttat 3960 gtccatcgta gaaacgtggg gagcggaagt tggcgttgcc gtctcgacca acaagacggt 4020 aataatgctg ctcaaaggta ctttgagacg tgcgcccacg gtaaggtttg ctggagcgaa 4080 cttgccgtat gtgcgtacct gtcggtatct tggcatcacg gtcagtgagg gaatgaaatt 4140 cctcacgcac atagcttcgc tgcgtcagcg gatgaccgga gtcgttggag cattggcgcg 4200 tgtgcttcga gccgactggg gcttcagtcc tcgagccagg cggaccatat atgacggact 4260 catggcaccc tgtgtgctgt ttggtgcccc ggtatggtat gacaccgccg ggcaagtagc 4320 tgccaagagg cgactagcct cctgccagag gctgatcctg cttggatgcc tttcggcttg 4380 ccgaacagtg tccaccgtgg cactgcaggt tcttggtgga gcccccccgc ttgacttggc 4440 tgctaagcaa ttagcggtca attacaagct aaaacgtgga tacccgctgg aggagaacga 4500 ctggctctac ggcgaggaca tcgcttgtct aagctgggag cagagaaaga ctcgcttgga 4560 ggagtgcatg ctgcggagtt ggcagaacag atgggacgac gacagcgaac caggacgggt 4620 gacgcatagg ttcatcccag atgtcactct cgtttatcgg gatccaaatt ttggtttctc 4680 gatgaaggcg tctttcctgc ttactgggca cgggtcgttt aatgcatttt tgcacgggag 4740 agccctcagc gataccacag cttgcgcatg tggcgatcca tatgaggact ggatgcacgt 4800 attgtgcgct tgccccctgt atgcagattt gcgagacctc gatggacttg gagtgcagcg 4860 cgttggcgaa aactggatgt tcgaaggaat cctggaggat cgagagagga cccaacggct 4920 ggcagcgttc gctgaggaag cgttccgcag gaggaggggc ctttagccca acacctttcg 4980 ccgtgtggtt agcgggcgag aatactacca cagtccgctg ttgcttgtcg taggaggcga 5040 ctaatacggc tataggtcgc tccgcccgtg cttgtcggag ccaaaggagt gaggccgacc 5100 gagcctctaa tttcggtacc acgggttgag tagctctcca aggttgctca ttgaggtagg 5160 ccccctagtg ggagtatcgt ggtggctgtg gttggtaccc atatcgcggg tagagccttc 5220 atgctcgacg tttgagttac ggcgatggtt tacgcaaaac tcgggtgctg tgacccttag 5280 atcagtagag attttaggta gatctcgctc ctcagcaagg gggagtgctt gctcggcaag 5340 caagcactcg aatctgcgac cggggtggtc gctatgtaca tagctatagc ttccagaccg 5400 ggacgtttgt ctggcgtatc cagactaatg catctttgga tgtcgtggtt gtaatccctt 5460 caatgtggaa cacgccacgt aaaacaagtt cggagggatc cgaaatcaca cac 5513 // ID Copia-43_AA-LTR repbase; DNA; INV; 177 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-43_AA_; KW Copia-43_AA-I; Copia-43_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-177 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 965-965 (2011). XX DR [2] (Consensus) XX SQ Sequence 177 BP; 44 A; 35 C; 35 G; 63 T; 0 other; tgaactagta tcactgtttc acataatacc gtggattcag tgtgacatat tctttgcagt 60 gtataaccta cgatcaataa agtaaatacg cagtttgtaa tcggtcgggt cgtgttttta 120 ttccgtcgga ttccgagtta ttccggattg tttaatactc tgcgaattgt cctccca 177 // ID hATm-4_HR repbase; DNA; INV; 3176 BP. XX AC . XX DT 30-OCT-2007 (Rel. 12.1, Created) DT 30-OCT-2007 (Rel. 12.1, Last updated, Version 1) XX DE hATm-4_HR, a family of autonomous hATm DNA transposons - a DE consensus sequence. XX KW hAT; DNA transposon; Transposable Element; KW Autonomous DNA transposon; hAT superfamily; hATm group; KW hATm-4_HR. XX OS Helobdella robusta OC Eukaryota; Metazoa; Annelida; Clitellata; Hirudinida; Hirudinea; OC Rhynchobdellida; Glossiphoniidae; Helobdella. XX RN [1] RP 1-3176 RA Kapitonov V.V. and Jurka J.; RT "hATm, a distinct group of hAT DNA transposons in animals."; RL Repbase Reports 7(10), 1052-1052 (2007). XX DR [1] (Consensus) XX CC hATm is a separate group of hAT DNA transposons. Significant CC identity between hATm and hAT transposases appears after PSI CC BLAST iterations. hATm transposons exist in genomes of different CC animals, including segmented worms (Helobdella robusta, leech), CC flatworms (Schmidtea mediterranea, freshwater planarian), insects CC (Aedes aegyptus, mosquito), and tunicate (Ciona savignyi sea CC squirt). All identified hATm transposons are characterized by CC 8-bp target site duplications and conserved termini CC (5'-TAGGgtGgyccnnA). The planarian hAT-7_SM, hAT-8_SM and CC hAT-10_SM transposons also belong to the hATm group. Their CC putative classification as hAT transposons was not supported by CC significant similarity (BLAST and PSI-BLAST) to known hAT CC transposase proteins. CC hATm-4_HR is a young family of hATm autonomous DNA transposons CC identified in the leech genome. The consensus sequence was built CC based on multiple alignment of 10 copies that are ~5% divergent CC from the consensus. TIRs are 17-bp long (two mismatches). XX FH Key Location/Qualifiers FT CDS 552..2801 FT /product="hATm-4_HRp" FT /note="transposase." FT /translation="MAKSTVTRSNNNIWLIGFPKETISGARLPSGQDVMKN FT FVFYHRVQKLNINKAAQQVHEQLLPFWVKSRLPVRQKPHIIQKIKDLYADQ FT VRLMKHRKRNNSTDQMNQKQYINKLSQLFDISHANSEQLIKNDKDRQFLKL FT QQQSRTGSIGSVDKKLMVKKKRVAKRQKRFIRNVDNAAKDAVSKIVYRGRS FT GLLSTAESAADQQLGSATHSTTSSVCHDSNEDVKSDEEFLVQSACSSKTNS FT TTQRRHRRDIISLKVAAVLDRTNTSIRKSSMILASAFNEAGVPISESIFSK FT STMHRHRQRRRQQAAALIKENYVPSKSIVHWDGKLLIDVTGVVNVDRLPVL FT LSSLVDGTTKLLGVSKLASCSGRAVADAVHEHIKSWKCESMVIGMCFDTTA FT SNTGKINGACTLLEKAMGRNLLWMACRHHIFEVLLADVFNVCLGPSTGPEI FT LFFKRFREKLTEMNHTPEARSTPLIIVSDAIKAFIKCQLEVRHSRDDYLEF FT LLLAAQIVGLQVVVAIRKPGALHMARWMARAIYALKVELLFTVNKTIFNLT FT ARELQGIQHLNRFIICVYLQSWFSCRLTADAPVNDILLIQRLHDYDDAVLG FT STGLKMMLRHSWYMSPELATLALFSSLLSDKEKTDLVRTIQQDRGPHLMKI FT LPQSIDDLRASKTFFETANIDASFLDVLVENWSDTPSFQVTAVFVRNLVCI FT NDSAERGVALVQHFNETNTKDAEQKQFLLQVVEQHRKNFAEYNRELLAKI" XX SQ Sequence 3176 BP; 1067 A; 580 C; 544 G; 985 T; 0 other; tagggtgtcc cagaaatgca cggaattttt tttttaaata gttgatatct caccctctga 60 aaagtgtctc ttggttacaa taattaaatt ggtgaagtta cagctcactt gaacaacttt 120 gagaggtccc ccaagagctt ttaaaatttt actatcgagt caattcaaaa tccatatcaa 180 taatcttaaa caatatgcac aaaatgagtt aaataaagct agaactgcaa atttttactt 240 acttttattt attattcttt gctgcagtgc ttatgtcaca tgaccaagtg gccattttgt 300 atttttttat ttttagtaaa cacaatttgc tcatttttta ttgtatataa tttagtgaaa 360 aaatcaagaa atacagcaac aatttgtaag tttgttaatt attgtactaa taatataaca 420 attatgtaat catttttaat tgtattctcc atatatctct aaatatattt aaatatttaa 480 atacaatatc aaactactta tacttaataa cacttatact gtcattggta cttgttttat 540 tttttgaggg tatggctaag tctacagtaa ccagaagcaa taacaacata tggcttattg 600 gctttccaaa agaaactatc tcaggtgctc gcctcccttc aggccaagat gtcatgaaaa 660 attttgtatt ctatcaccgt gtacagaagc taaacattaa taaagctgca caacaagtgc 720 acgagcaact gctcccattt tgggtgaaat caagactgcc tgtacgacaa aaacctcaca 780 taatacagaa aattaaagat ctgtatgctg accaagttcg tttgatgaag catcgtaaga 840 gaaataactc tactgatcaa atgaaccaaa aacaatacat aaataagttg agtcaacttt 900 ttgacatcag ccatgccaac tctgaacagt taataaaaaa tgacaaagac agacaatttt 960 tgaaattaca acaacaatca agaacgggat caattgggtc tgtggacaaa aaacttatgg 1020 ttaaaaaaaa acgagttgct aaacgtcaaa aacgatttat cagaaatgtc gataatgcag 1080 caaaagatgc tgtatcaaaa attgtttacc gtggacggag tgggttatta tcaacagcag 1140 agtcagctgc agaccaacaa ctgggcagtg caacacattc aacaacatca tccgtttgcc 1200 atgattctaa tgaagatgtt aaatctgatg aagaatttct tgtacaatca gcatgctcaa 1260 gtaaaacaaa ctctacaaca caacgcagac atcgcagaga cattatatct ctcaaagttg 1320 cagctgtgct ggataggaca aatactagca ttagaaagtc aagtatgatt cttgcatcgg 1380 cattcaacga agctggtgtt ccgatatcag agtcaatatt ttccaagagt acaatgcata 1440 gacatcgcca gcgccgtcgt caacaagctg cagctcttat aaaagaaaat tatgttccct 1500 caaaatctat tgttcattgg gatgggaaac ttttgatcga tgttactggt gtggtcaacg 1560 tagatcggtt acctgtcctg ctctcatcac ttgtcgatgg cactacaaaa cttcttggag 1620 tttcaaagtt agcatcttgt tcaggacgag cagttgctga tgcagttcat gaacatataa 1680 aatcttggaa gtgtgaatca atggttattg gcatgtgctt tgacacgaca gcttccaata 1740 ctggaaaaat caatggagct tgcactcttc tagaaaaagc catgggacgt aatttattat 1800 ggatggcttg ccgtcatcat atctttgaag tgctgcttgc cgatgtattc aatgtttgtc 1860 ttggaccatc cactggacca gaaattttat ttttcaaaag gtttcgtgaa aaattgacag 1920 aaatgaatca cacaccagaa gccagatcaa cccctttgat aattgtttca gatgcaataa 1980 aagcattcat caaatgtcaa cttgaggtca gacattcacg agatgactat ttagagtttc 2040 tacttcttgc tgcacaaata gttgggctgc aggttgttgt agccattcgc aaacctggtg 2100 cactgcacat ggcacgatgg atggcaagag ccatctacgc attaaaagtt gaactattat 2160 tcaccgtaaa taaaaccatt tttaacttaa cagctcgaga attacaaggt attcaacatt 2220 tgaatcggtt tatcatttgt gtttatcttc agtcatggtt ttcatgcagg ctcactgctg 2280 atgcacctgt gaatgatatt ttgttgatac agcgtttgca tgactacgac gatgcagttc 2340 ttggttcgac tggattaaaa atgatgctgc gtcattcttg gtacatgagt ccagaattgg 2400 caactcttgc tcttttttca tctctcctgt ctgacaaaga aaagactgat cttgtccgca 2460 caatacaaca agatcgaggc ccacatctga tgaaaatact accacaaagt attgacgatt 2520 tgcgtgcgtc taaaacattt tttgaaacag ccaatattga tgccagtttt cttgatgtac 2580 ttgttgaaaa ctggtcagat actccttctt ttcaagttac agcagtgttt gtaaggaatc 2640 tggtttgcat caatgactct gcagaacggg gagtagcgtt ggtacaacat ttcaatgaaa 2700 ccaacaccaa agacgcagag caaaaacaat ttttgcttca agtagttgaa cagcatagga 2760 aaaactttgc agaatataat cgtgaactgc tagctaaaat ttgaattatt tgcatttatt 2820 aatattcaag caaactgaaa ttcaataaaa tcttgttttt caatcatttt tgacctaatt 2880 tttaaaatta aaaaataaaa ggcattgcag atttatatat tttttcacaa ttatatattt 2940 tgtttcattc ctataaacaa atcaaaaaac atttataatc atatcgtata acactgtatc 3000 aaaagttggt agcgaacgat caaacaccaa aatcggaaac tttcaacccc ttgggggacc 3060 tctaaacatt gtccaaatgg gctaaaattt ggcaggtaaa ctattgttac cataaagaca 3120 tatttaaggg ggtgagacat cattatttta aaacttcata tttttggcac acccta 3176 // ID Gypsy-246_AA-LTR repbase; DNA; INV; 180 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-246_AA_; KW Gypsy-246_AA-I; Gypsy-246_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-180 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1094-1094 (2011). XX DR [1] (Consensus) XX SQ Sequence 180 BP; 51 A; 41 C; 26 G; 61 T; 1 other; tgaaatattg aattcgaaat tttkcatttg tgaaccaccc taacgccacc caacaattgg 60 gggctttcct tttgcttttc atctattata taacatttct agtcagtaat atatctcccg 120 ttgaactgag cggccgcaat atcgttgtaa atccgaacgt ctttaaaaga tcctcttaca 180 // ID RTE-13_BF repbase; DNA; INV; 3383 BP. XX AC . XX DT 29-JUL-2009 (Rel. 14.07, Created) DT 29-JUL-2009 (Rel. 14.07, Last updated, Version -1) XX DE Amphioxus RTE-13_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW RTE; Non-LTR Retrotransposon; Transposable Element; RTE-13_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-3383 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-3383 RA Kapitonov V. and Jurka J.; RT "Young families of RTE non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(7), 1711-1711 (2009). XX DR [2] (Consensus) XX FH Key Location/Qualifiers FT CDS 219..3365 FT /product="RTE-13_BF_1p" FT /translation="LFRANHPSSCQSGGNQHSGKYPITKVTRNYPQTRSRV FT ARQTSHPDPMTGQREPGTSSQTVGPDMSRLSTFKPLVISTYNVRTLYQKGK FT THQLFMGLNDAGVDIVGIQEHRLITQSPTEELWSDDKNYVLIYSSATAQRL FT GGVGLAVSKHIYKCLQSVNSVSERILTVTFHGNPQLTITVIYAPTESASPA FT DKDDFYSTLKDHIEMVKKHDEHLVIGDFNARIGHDSHVSHPVATGPHCYYS FT VTNDNGDRLVNLCQEHKLRPAQTRFPHPRSRLWTWMHPGGSTHQLDHILIN FT GKWVNSLRNCRAYNSVELDSDHRILSVLFVTSLRKSKGKPCRRPKFNWRKL FT REAHTREVFQMELSNRFEALKCDEVSSPITERYACWEKTVAETAEKVVGRC FT KSCGMPSWVSAKTLELRDQRDKAKKRFLLAKTRQSKETWRQLNTSLNESYK FT RDEVAFTEQQIEDLEEADKKGDHLSTWKIINSISGKNRRTNQKVKKRDGTA FT PSSDNELLAEWREYFSELLNNVSVPQTSGLPPPADQDLPICVDPPTLEETV FT RAINSMKNNKAPGLDCAITAEALRGGGDVMARAIHAFCKEVFVTLTPPDQW FT TTNVIVPLPKKGDLSLMTNYRGITLMSIAAKVYNKILFMQIRGHIDPILRS FT NQAGFRRGRSCAQQTHILRRIMEAYQTHQLPLVITFIDFKKAFDSIDRKVM FT LAVLRHYGIPLTIVNAINVLYNNSKSAVMVDGNISGPFQVTTGVLQGDVLA FT PFLFIVLIDFLMRRSTEDIDSGIVTHPRQSRRYPARILNDMDFADDITLLE FT SSIPKAQEQLNRTAAAGTQLGLVISVPKTEYMTINCSQQPPLEVYGQPIKN FT VTNFRYLGSMMASSKNDLTRRRALAWTAFWKLQKIWSSTSLPTSRKIQLFN FT VSCVPVLLYGCESWVLSSDMENNINSFATSCYRIMLNIKRLQRVSNSEIYR FT VTNTQPLICKVRQRQLRFIGHALRMPLEEPLRTYALYVPSHGKRRPGRQKT FT NYLTYIQNLLGDVEGDLNPTAIAALAADRLRWNRIVTDCTAAD" XX SQ Sequence 3383 BP; 984 A; 851 C; 793 G; 755 T; 0 other; attagttatg gggattaaga cagagtgctg acctcctagt tggtagtccc cgggcaacgg 60 ctgctgcaat gaccctgagt tgcgatacca tcgggggccg ggcggagcac cagcaggtct 120 acctgtggtg ttccccccgc tcctggcagg ttcaaccatg gtgggacggt tctcagggga 180 gctgcagctg gctaacagca cctgcactcc gcatatagct gttcagggcc aatcacccgt 240 ccagttgcca gagcggagga aaccaacatt ctggcaaata cccgattacg aaagtcacga 300 ggaattatcc acaaactaga tcgagggtag ccaggcagac gtctcaccca gaccctatga 360 cgggccagcg ggaacctgga acgtcatcac agactgttgg tcctgatatg tctcgtctat 420 ctaccttcaa accacttgta atttcaactt ataatgtgcg tacactctac cagaaaggaa 480 agacgcacca gttattcatg ggcttaaacg atgctggtgt tgacattgtt ggcattcagg 540 agcaccgact tataacacaa tcccctactg aagaactatg gtcagatgac aagaattatg 600 tccttattta tagctcagcc acagcccaaa gacttggagg tgttggactt gcagtctcaa 660 aacacatcta caagtgtctt caaagtgtca actctgtgag tgagagaatc ttgactgtca 720 cgttccatgg taatcctcag cttacaatca cagtcatcta cgccccaacc gagtctgcca 780 gtccagctga caaggatgac ttctacagta cgctcaaaga ccacatagag atggtaaaga 840 agcatgacga acaccttgtt attggagact ttaatgctag aataggacat gacagccatg 900 tatcccatcc ggtagcgaca ggtccccatt gctactatag cgtgacaaac gacaacggtg 960 accgcctagt gaacttgtgc caagaacaca agcttcgtcc agcacagaca agattccccc 1020 accctaggag tcgtctctgg acctggatgc acccaggagg ttctacccat cagctagacc 1080 atattctaat caacggcaaa tgggtgaact cactacggaa ctgtagagcc tacaactcag 1140 tagagttgga ctcggaccac cgtatcctaa gtgtactgtt cgtcaccagt ctccgtaaat 1200 cgaaagggaa accatgtcgg agacctaagt tcaattggag gaagttaagg gaagcccaca 1260 ctagagaggt atttcagatg gagctttcaa accgatttga ggcgctgaag tgtgacgagg 1320 tctcgtcacc aatcacagag aggtatgcct gctgggaaaa gacagtcgca gaaactgcag 1380 agaaggtagt aggaagatgc aagtcttgtg ggatgcctag ctgggtgtca gcaaagactc 1440 tagagctacg ggaccagagg gataaggcaa agaagagatt cctgcttgca aagacccgtc 1500 agtcaaaaga gacatggagg caactaaaca ccagcttaaa cgagtcctat aagcgtgatg 1560 aagtcgcctt tactgaacaa caaatagaag acctcgagga ggcagacaag aaaggtgacc 1620 atctctcaac atggaaaatt atcaactcaa tctctggcaa aaacagaaga accaaccaga 1680 aggtaaagaa gcgagatggt acagccccat ctagcgacaa cgagctcctt gctgaatgga 1740 gggagtattt ctcggagttg ctgaacaatg ttagtgtccc acaaacttct ggcctcccac 1800 caccggcaga ccaggatctg cccatctgtg ttgacccgcc cactttggag gaaacggtca 1860 gagctataaa cagcatgaag aacaacaaag ctccaggcct ggattgtgca atcactgcag 1920 aagcccttag agggggaggt gatgtgatgg ccagagcgat acatgctttc tgtaaagagg 1980 tctttgtaac actcactcct cctgaccaat ggaccacaaa tgttattgta cctttgccta 2040 agaaagggga tctcagcctc atgaccaact atagagggat aaccctaatg tccatagcag 2100 ctaaagtcta taacaagata ctgttcatgc agataagagg ccacatagac ccaatcctaa 2160 gaagcaacca agctggtttc cgtcgaggca gaagctgtgc ccagcaaacg cacatcctac 2220 ggagaatcat ggaagcctac caaactcacc aattgccact tgtcatcaca tttatagatt 2280 tcaagaaggc atttgactcg atcgatagga aggttatgct cgctgttctc cgccactacg 2340 gcatcccgct cacaattgtc aatgccatta atgtacttta taacaactcc aaaagcgctg 2400 ttatggtgga tggtaacatc tctggcccat ttcaagtcac cacaggagta ttgcagggtg 2460 atgtccttgc tcccttcctg ttcattgtgc tcatcgactt tctgatgaga agatcgacag 2520 aagatattga ctctggcata gttactcacc ctcgtcagtc tagaagatat cccgctagga 2580 ttttaaacga catggacttt gctgacgata tcaccttgct agagtcttcc atccccaaag 2640 cacaagaaca acttaacaga actgctgctg cagggacaca actgggcctg gtcatcagcg 2700 tcccgaaaac tgagtacatg accatcaact gcagccagca gccaccactg gaggtgtacg 2760 gccagccaat caaaaatgta accaacttta gatacctggg tagcatgatg gcatccagta 2820 aaaacgatct cactaggcga agggctcttg catggactgc cttctggaaa ctgcaaaaga 2880 tctggagtag tacttcactt cccacctcca gaaaaatcca gctctttaat gttagctgtg 2940 tgccagtgct tttatacgga tgtgaatctt gggtactgtc gtctgacatg gagaataaca 3000 taaactcgtt tgctacatca tgctacagaa tcatgttaaa cattaagcgt ctccagcgtg 3060 ttagcaacag tgagatatac agagtgacaa acacccagcc tctcatctgc aaagtcaggc 3120 aacgccaatt acgcttcatt ggacatgctc ttagaatgcc tttagaagag cctctgcgga 3180 cttatgcctt gtatgtccct tcccacggca aacgacgacc cggtcgtcag aagacgaact 3240 acctgacgta tatacaaaac ctgctggggg acgtggaggg cgacctgaac cccactgcta 3300 tagctgcact ggctgccgac agactgaggt ggaatagaat cgtgaccgac tgcacagcag 3360 ccgactgatg atgatgatga tga 3383 // ID BEL-25_AA-LTR repbase; DNA; INV; 336 BP. XX AC supercont1.26; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-25_AA_; KW BEL-25_AA-I; BEL-25_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-336 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.26; Positions 816914 817249. XX SQ Sequence 336 BP; 108 A; 67 C; 72 G; 89 T; 0 other; tgatagaaaa ggatagaaga aaatatcgat gcatagattt ttattagcat ccctattggc 60 aacaaattct ctcgccggtt tgctcttctc cttatatacc cacagccggc aaacgatcgt 120 attgcagcca gcagtgaaga atgtatatac atatagtagg aaactagcat agcagaagtt 180 agttgtaaat aaaaccgtcg cggagtacac ttcgtttggt gatttctgac cgtagtaaaa 240 tcagttcagt gttcgcgaat aaagaagtgt tcagttcatc aatacgcgag tttctgaagt 300 caagtgtccg aaaaactacc cgcgacggcg cgaaca 336 // ID hATm-32_HM repbase; DNA; INV; 3386 BP. XX AC . XX DT 07-DEC-2008 (Rel. 13.12, Created) DT 10-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type family: consensus sequence. XX KW hAT; DNA transposon; Transposable Element; hATm-32_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3386 RA Jurka J.; RT "Families of hATm elements from Hydra magnipapillata."; RL Repbase Reports 8(12), 1926-1926 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(652..1890,1894..2613) FT /product="hATm-32_HM_1p" FT /translation="MARIKTKYRQNCMFDVMKLWNEWQSLMKNKGRTTDPG FT SKRANFIERLDSLFNIGAPDAIDEIKKSRFLSERKKQDDIDFYLDQQQERR FT ACMDGHDKIFETKAEIKLARQAREARITEREQCRLDILREEQVEGVAALLE FT YDSETQLNEADDRTEIDPDFETNLQQEATKIDQGEYVTLKLPRKIMQCDQI FT TSAADRLKLSDNQTTMIVSALIKTAGGNLEDFEIYNLRSTRSTTWRTRMFN FT RQKIAESVMETVKQNPPQYGALHWDGKLLKDMLGDTYERLAILISGAPEYK FT EGKLLGVPSLANSQGATQAEATLDLIQVCDLSDRVVALVFDTTASNSGIHK FT GAAKLMEASLDKKLLYLACRHHILELVVGAVWKLLFGDILGPENKLFAKFK FT DVWSNLDKVQAIQVLETENWLLNIKHKVVKELTDMLSCKDSTTFPRDDYRE FT SAENTLIILGETPPRGAHFLKPGGMHQARWMANNIYAGKMYMFSKQMRYDD FT DMVAKLMRMNRFLAHFYTPAWMKTSWGADASMNDLQFIHDMIDFRSVDKDI FT ADVVINKLRNHQWYLSEEVVPFALFSKHPLMTHALKKEMADQLLLTPIPEN FT FRLGKPVFKKIGRETTLTSLIGPESHSLFHILKVSTGWLSKPVEQWNSDQA FT KRX*" XX SQ Sequence 3386 BP; 1163 A; 521 C; 670 G; 1027 T; 5 other; gggttgtcca ggatatagta tgaagtgtca aaacgtcgtg aagtgaagtt tttcccatta 60 tactatcaat actatgtcta aaaataattt ttacagattt ttgataagtg atcctgaggt 120 ccctcaaggc ctctgaacat gttcccccat tttttgcata aattgcattt tttgttttgt 180 ttttttagaa tcgcaataat caaagaatca tttctaaaaa gttttgattg gttttcagtg 240 gcttttagtt gtattattcg agttttgata ctgtttagtg aaagaataaa tttaatttta 300 tcaatgaata tattggaaaa atgttgactc gaagtagtaa aaagcgcgat ccatcaccac 360 ggcgaagtac tgaagacaaa aagtctaaag aacaaaaaga gaatccactt tgtcaaggta 420 cttctagctc acataaagga actccaagca gacctgtgac aagaaagtta actgatttct 480 ggttaattgg tcatccttca acatcaatta atggggccaa actgcctgat tgtaggcaaa 540 ttatgaaata tgttttattt ctaaggaacg atccagaaaa tattaaaaat aaawgtgaag 600 aatgaggata ttgcttatgt tgtggttgat gcagtccttg ttttttggaa tatggcacgc 660 ataaaaacta aatacagaca gaattgcatg tttgatgtta tgaaactttg gaatgaatgg 720 cagagtttga tgaaaaataa aggaagaaca actgaccctg gaagcaagag agcaaatttt 780 attgaacgat tggattctct ttttaacatt ggtgcacctg acgccataga tgaaataaag 840 aaatcaagat ttttgtcaga aagaaaaaaa caagatgata tcgactttta tttggaccaa 900 caacaggaaa gaagagcatg tatggatgga catgataaaa tttttgaaac caaagccgaa 960 ataaaacttg caagacaagc acgtgaagca agaattacag aaagagaaca gtgcaggttg 1020 gatattttaa gagaagaaca ggtcgaagga gttgctgctt tgctagaata tgattctgag 1080 acacaattaa atgaagcaga tgaccgaact gaaattgatc ctgattttga aacaaatcta 1140 caacaagaag caacaaaaat agatcaaggc gaatatgtaa ctcttaagct acctaggaaa 1200 ataatgcaat gtgaccagat aaccagtgct gctgatcgtc tcaagttgtc agacaatcaa 1260 acgacaatga ttgtatcagc attaataaaa actgctggag gtaatcttga agattttgag 1320 atatacaact tgagatctac aagatctaca acttggagaa caagaatgtt caatcgtcaa 1380 aagattgcag aaagtgtcat ggagacagtg aaacagaacc ctccccaata cggtgctttg 1440 cattgggatg gtaagctttt aaaagatatg ctgggagaca catatgaacg acttgcaatt 1500 cttatttctg gagctcccga atacaaggaa gggaaattat taggtgtacc ttctctagct 1560 aactcccaag gagcaactca agctgaagca acgcttgatc tgatacaagt ttgtgatctg 1620 tcagacagag ttgtagctct agtgtttgat acaacagcta gcaatagtgg tatacacaaa 1680 ggagctgcta agttaatgga agcaagtcta gataaaaagt tactctatct agcttgtcgt 1740 catcacattt tagaacttgt tgttggtgct gtatggaaat tattatttgg tgacatttta 1800 ggcccggaaa acaaactgtt tgcaaaattt aaagatgtat ggtcaaactt ggataaagtt 1860 caagcaatac aagtactgga aactgaaaac taatggttgt taaacataaa acataaagtt 1920 gtcaaagaac ttacagacat gctatcttgc aaggattcaa caacttttcc aagagatgat 1980 tatcgtgaat ctgctgaaaa cactctgata atcttgggtg aaactccccc acgtggcgcc 2040 cactttctca aaccaggggg gatgcatcaa gcacgttgga tggccaacaa catatatgct 2100 ggaaagatgt atatgttttc caaacagatg cggtatgatg atgacatggt tgctaaactc 2160 atgcgtatga atagattttt ggctcatttc tacactcctg catggatgaa aacatcatgg 2220 ggagcggatg catctatgaa cgacctccag tttattcatg acatgattga ttttagaagt 2280 gttgacaaag atattgctga cgttgtgatt aacaaactcc gtaatcatca atggtacttg 2340 tcagaagaag ttgtgccgtt tgctttattt agtaaacatc ctttgatgac acatgcacta 2400 aagaaagaaa tggcggatca attgcttttg actccaatac cagaaaactt ccgccttggt 2460 aaaccagtat tcaaaaaaat tggtcgtgaa acaactttga caagtttgat tgggcctgaa 2520 tctcattcgt tgttccacat tttaaaagtg agcactggtt ggctttccaa accagtggag 2580 cagtggaatt cagaccaggc taaaagarat tgagatttgc agtcaagttg acttttattt 2640 ggagcarcaa caaaagarca tgttggataa atttgaaaca ttttgaaaaa aagccaaatt 2700 gactttcmag aagcggaaca gttgaaatcc agtccaggct tttagagttg cagaacaatt 2760 tgtttgtaca gttaaagttg tcaacgatgc agcggagcgt ggtgtcaagt taatctcgga 2820 ttttgctaca atcattacaa ctgatattga gcaaagggca tggttgttgc agggtgtcga 2880 gcagcataga aagttatatt ctagttttga taaaaagaca ctaaatttgt gaaactgtag 2940 tgtagtacta agtactatat gagtgtgttg agtataagag attaaacact tttcaagtaa 3000 attactaatt tttcaagtaa agactgtaaa gtccttcgct ttttgttata tttggtcaca 3060 aaattgacta aaacatttta ttttttttaa gttgaaatgt taaatgtaac attaactttt 3120 tttttttttg tagatctgat tacctttata ttaaacatga ttttgtttaa attatgttgt 3180 tatatttata gaattataac attttataat gtaaaagatt ttaatttttg ctctaaaaat 3240 ggggttcaaa ttcggaggcc ttgagggacc tcaggatcac ttatcaaaaa tctgtaaaaa 3300 ttatttttag acatgttatt gatagtataa tgggaaaaac ttcacttcac gacgttttga 3360 cacttcatac taatcctgga caaccc 3386 // ID Gypsy-43_AA-LTR repbase; DNA; INV; 239 BP. XX AC supercont1.107; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-43_AA_; KW Gypsy-43_AA-I; Gypsy-43_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-239 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.107; Positions 2200354 2200592. XX SQ Sequence 239 BP; 84 A; 34 C; 47 G; 74 T; 0 other; tgtaagctag atatcataca taaagtgaat ttgatcccta agtggtagtt tgctataaac 60 caatacatgt tctgaactca aaatgaactc tataatataa gtgagtgacg gcaaaaataa 120 ttgaccgaag atgtaataaa gattagtcct aattgtacct tcatggacta aagactagtt 180 ctaagaatcg gtgcgttttt cggtagattc gaaaagtctt tgggtacgat aggtttaca 239 // ID Copia17-NVi_LTR repbase; DNA; INV; 218 BP. XX AC AAZX01010587; XX DT 13-NOV-2007 (Rel. 12.11, Created) DT 13-NOV-2007 (Rel. 12.11, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: long terminal DE repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia17-NV; KW Copia17-NVi_I; Copia17-NVi_LTR. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-218 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from Nasonia parasitic wasp."; RL Repbase Reports 7(11), 1151-1151 (2007). XX DR Genome; AAZX01010587; Positions 5479 5696. XX SQ Sequence 218 BP; 50 A; 52 C; 50 G; 66 T; 0 other; tgtgggaata tatatagacc atacgcatta tgcccatgta caggagtggg gagcaccatg 60 ttggcatctc cgtgagaagc ggcgctgcgc gccacttgct tccgagctac tgatcggtgt 120 gcatcgcctg taactcttat cgaataaagc cttttactta cgaccgaatc ttttgtgtat 180 ctttattgtg agcacctacc tagtctgtta ttctaaca 218 // ID Kolobok-1_Aplcal repbase; DNA; INV; 3975 BP. XX AC . XX DT 16-MAY-2011 (Rel. 16.05, Created) DT 16-MAY-2011 (Rel. 16.05, Last updated, Version 1) XX DE DNA transposon: consensus. XX KW Kolobok; DNA transposon; Transposable Element; Kolobok-1_Aplcal. XX OS Aplysia californica OC Eukaryota; Metazoa; Mollusca; Gastropoda; Heterobranchia; OC Euthyneura; Euopisthobranchia; Aplysiomorpha; Aplysioidea; OC Aplysiidae; Aplysia. XX RN [1] RP 1-3053 RA Yuan Y.W. and Wessler S.R.; RT "The catalytic domain of all eukaryotic cut-and-paste transposase RT superfamilies."; RL Proc Natl Acad Sci U S A 108(19), 7884-7889 (2011). XX DR [1] (Consensus) XX SQ Sequence 3975 BP; 1006 A; 860 C; 949 G; 1160 T; 0 other; gggggagcgc ggcttttaaa acggacgatt ttggggtcaa attatggtat cgaggttttt 60 tacatcttgt gaaacttcaa cctctcccgt acctaaaaat agctatgaga gaatttagaa 120 atagcgtatt tacacctaaa attgtctttc aaaacactag ggtagggaaa aaattgtatg 180 attttgagaa aacatgatct atttctcatc aattgaccaa ccaataatta tagtttatat 240 cggaataagc atacagtgtt gtatatcgat ggaatcgtta ttttcccctc tgtctgaata 300 aatatgcaac tgggtcatga cgtctcatca ccgcttccta ttagctcaga acgagacaaa 360 ggtttgacca agattgacct agattgtggg ctgaatgttt actactgaga tgaccttgaa 420 tgttgattgg ctaatgcgaa tgaccccaag gaatgctcga attctgttgg ttctccacca 480 tagttacaag gtcacttccg caaagcttga cccataattg cacgtggaat tttccgtctt 540 ttcttgaatg aacaaaggat tttctttgta tacaggctca tgtgtgtcga ttgtttgtgc 600 gcatgtgtac ttggtgtgta gtctcaagtc aaagaaacgc gtttgtctct cgaaacgaag 660 ggccactttc agattacgtg ctggatggca gcatcactgt gtcgataacg ctctacaatt 720 gctgtgctca ctcgtctctg atattgttag taaggttgtt acttacttag gccttcttgt 780 atacttgaag cggacattct agatttgttc tagatttgtg aattgcgtta gtattgaaaa 840 tgccgaagaa gaaggtgact ggtcgttcga agcagctgca gaaagctcgt gagatgcgtg 900 gaaaggctgt ggaactggaa ctcgatagga gctgtgaagc tgaggttgag gtacgtagac 960 aagattctct agcctagtga tatcaattta tatgtgtact ttagtataaa tgtggttact 1020 agaatgatag atctagtagt agatctagat atagcgtgta gatcaagatc tgtaccatgt 1080 ggtttgtgtg cgcacgcgtg tgtgtgtact ttagtatttg tggtactaga atgatataga 1140 tctatctagt agtagtaacg tgtgtggatc tagatctatg tggtttgtgt gtgtgtgtgt 1200 gcatgattag actctaattc tttgttgcag aaatgaaaaa tatctattga ttcttcttac 1260 tgaatttgaa atgcatggtt acctaggccc tatattcatt tttatatagg cacaattttt 1320 ttcaactgta tgactttttc atgtaggcct acaggcctaa tgttttaaca ttttcagtca 1380 acagattttg aatattcagt accagaaatt tacttgcatg aagctgattt ttccgtaact 1440 ttttttccgc agtcctcgtc ttctgtccct gtgcccccag agatgcctgc atcgtcttcg 1500 attctggata ttccagagac cccacggcct gtctttccac cggaccctct gctgctgatc 1560 cctctccgtt cccaggagaa gctgagtagg tatgtcgtgc ctgctgctgc tgaagatacg 1620 tctcgagcca tcgtggagtt gtcgcaggtt gaggggctgt tcaagcagct tcagtgtccc 1680 cagtgtggca tcgttgggac tttgtcgcta cgtcagatgg gtcagttcgg cctggctctt 1740 cggctctccc tgttctgctc agaatgttct agcgtcgtcg gtaagcccca gtacacgtct 1800 tccctcgaca ccagcacccc agcaaagagg aaaccgttca aggtaaatca gagtgccacg 1860 gctgcggctc tgatgtctgg cctggggccg taccagttca acacattatg tgctcacctg 1920 gatctcccag gtcttaatcc gaagaccttc aacaaatatg ccactcaggt gtacggaaag 1980 agtgagtctc tggcggacaa agtatttctt caggctgcca acgctgttcg ccaggcttac 2040 caaagcatgg gctgcgtcgc ggaggatggc gtgctggaca tcgccgtgag cttcgacggc 2100 tcctggctga cgcgtggcca caagtccctc atcggtattg gctgcgtcat cgacgtactg 2160 accgggttag tactcgacag ccatgtcctg agccttcatt gccagacttg cgctaccact 2220 ggtcagtgga agaagacgaa cactcccctg gcttacgacg cgtggctgac ggagcacaaa 2280 gctagtggct gcaacataaa ctatggtgga tcgtcgggga tgatggaggt ggaggctgca 2340 gttgtgttgt ggagccgttc gatggaaaag ttcggcctgc ggtacaccac gtttgttggt 2400 gatggagact ccaaagcctt caacaaggtg acagaggtga agccgtacgg tcctggagtc 2460 cagattgtga aggaggagtg cctaaatcat gtggggaagc ggctgggcac tgcgttacgg 2520 aacctcgtca gcgactgcag caagaaaggc gtcacgctag gtggaagagg ccatggcagg 2580 ctgacagcaa acactattcg caagctgtcc atctactatt caagggccat ccgaagccaa 2640 gccactgcag ccgagatgag gacagctatc ctggccagtg tccatcatgg atactcaaca 2700 gacgaccacc cgcagcacat gtactgtccg tctggcaacg actcctggtg cttctacaag 2760 aagtccttgg caaggcacgt ctatccagga ggccacaaga acagggtgca cacgcccctg 2820 aactacgaac tgctgcatga gcacctccgt cccgtctacg aaaggctcac agccgattct 2880 ctacttcttc ggtgtgaacg caaggcgacc cagaatgcaa acgaaagttt tcaccactcg 2940 gtctgggcca aatgcagcaa aactcagttc cgcagcaagc aacgagtcga agtggctgtc 3000 atcactgctg ctgccgaatt caactttggc ccaggatata caaccgagtt gaaggattta 3060 cttggtgtac cttctggggt aaatactgaa cgcctgaact ctgcccgcac caccaagcgt 3120 ttatacaaaa gccaggacgt acagaaagcc gctgcgaaga ggaggaaggt gatgcgagcg 3180 aaggcaatag agcaggctcg tcttgaggcc gagaaagaag atggtgtggc gtatgggcct 3240 gggatgtttt gaagtgtact ctaacggaca ctatcaccag ttttcttcgt tttttggttt 3300 tgaacagtca gtaccccatg tcacaattta ctttcttatt agcatatttt tagctggatt 3360 gtattgaaac tgtgattcac aagaacataa atgcttgttg tgcatgttac aatcatcatt 3420 tttcagattt tccgcccagg aatttttttg gggtctattt tgaaacaaaa tgtcgttttt 3480 ctcaaaactt tacattttga tacctttttc tcatctactt ttttattttc taaccgattc 3540 ctctgatatt ttcagggttt gttgttcaca tcataaccaa actttataca acaggacaat 3600 gtaataggta catttatact tgagttatgg gcgattgtgt aatgaaattc agaggtgtga 3660 aggggtaggt cagttgtcca tctataactt ttttcctatt caagatatcc gagagccctg 3720 ttgtataata ttttgttttg agtatatcta tcaatgtacc aaatttgaat aaattcggtc 3780 agtaattaaa aaagcctttc aagatttatg aaagctgttt attttcaata tggccgccgt 3840 taccatggta acggtggatt tttaaaaaaa cgtaacgttg ctttttagag atgttttcct 3900 gtttaacctc cttgccaaaa ggctttacct gaactttcat agttaccgag ataggttata 3960 aagtacccct ccccc 3975 // ID Crack-20_BF repbase; DNA; INV; 2631 BP. XX AC . XX DT 30-APR-2009 (Rel. 14.04, Created) DT 30-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE Amphioxus Crack-20_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; KW Crack-20_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-2631 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-2631 RA Kapitonov V. and Jurka J.; RT "Young families of Crack non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(4), 825-825 (2009). XX DR [2] (Consensus) XX CC Its 5'-terminal portion is incomplete. XX FH Key Location/Qualifiers FT CDS 2..2479 FT /product="Crack-20_BF_2p" FT /translation="EKENADIAIMGDFNSDLLNSTHSMSSVEFLMGLYQLE FT PVIRQPTRITETSESCIDNIFVSNPDKYKSSASVAWGPSDHNLIMTCAKAG FT RVAGAARRCEYRSYKQYNQQPFIDSLKSARWDTVLDCFDVTEAWNAFKDIF FT LTVADEHAPLRNKTVRENNHTAPWLTDRVKNIMGRRDAARRKAIKTRDVKD FT WEIYRSLRNQATSVIRKEKKSHFAQAVTEAKGNQSLMWKIINNFTGKSKPN FT SVVSHLERADNTTTSSPCEIAQDFNEYFTSCATRLVDGMPASQEDPLRNMP FT EPTKTFSFESVDEVTVLNELQRLKTKKATGMDKIPSRLLKDSAPVIVKPLT FT HIFNLSLSTGEVPTEWKEAQISPIHKSGTRANVANYRPVSVLSVTSKVMEK FT LVSYQVSRFLDRNDLLTEHQSGFRKHHSTATAVQKIVEDVKSAFNSSKVTV FT ALFLDLRKAFDTVNHNILLGKLKKLGFDSDATKWFTSYLSDRFQCTQIQGQ FT HSTKALVTCGVPQGSVLGPLLFCIYVNDLPNVIEKSSIHLYADDTILYYSA FT SSVKECEETISSDMXRVAXWLKENKLLLHPDKTKSMLFGLPQKLRHTGRSV FT NITDGVNVYEQVQSFTYLGVTLDPSLLWTSHAGKVMKKILGGLGAMWRAKG FT FVSTEILQTMCQTLLLSHLDYCATAWLPSLIQCNKGQTAKLDRLLNRAARL FT ITGYRLQDHVPVHTLLTAAGLESVRKRLETVSLTTVFKSVRGRAPSYMSTL FT FRWMSPPTLRVNTRAASTRLRDYDPHLLQLPHVRVTSFRGSLQYYGVQLWN FT KLSHETRSMMKFRTFRRQLHLMA*" XX SQ Sequence 2631 BP; 750 A; 628 C; 611 G; 640 T; 2 other; agaaaaagag aatgcggata ttgctatcat gggggacttt aattctgacc tgctgaactc 60 tactcattca atgtcgtcgg tggaattctt aatgggtcta taccagctgg agcctgtcat 120 acgccagcct acccgcatca cagagacttc cgagtcatgc atcgacaaca tatttgtatc 180 gaacccggac aagtacaagt ctagtgcaag tgtggcctgg gggccttctg atcacaacct 240 gatcatgaca tgtgctaaag ctgggagagt ggccggtgct gctcggcggt gtgagtacag 300 gtcatacaag cagtacaacc agcaaccctt catcgacagt ctgaagtcag ctagatggga 360 cacagtgctc gactgctttg atgtcacaga ggcctggaac gcctttaaag acatttttct 420 gactgtcgcc gatgaacatg cacctcttcg caacaaaaca gtacgggaaa acaatcacac 480 tgctccatgg ttgacagaca gagtgaaaaa catcatgggc cgtcgagacg cggccagacg 540 aaaagccatt aaaaccaggg atgttaaaga ctgggagatc taccgatctc tccggaacca 600 ggcaacttct gtcatcagga aggagaagaa aagtcatttt gcccaagctg tgacagaagc 660 taagggtaac caatccctta tgtggaaaat catcaacaat ttcacgggga aatccaaacc 720 taatagcgta gtcagtcact tggagcgtgc tgacaacacc acaacatctt ctccctgcga 780 gatagcacag gacttcaatg aatattttac gtcctgtgcg acaagacttg ttgatgggat 840 gccagcttct caggaggacc ctcttcgcaa catgcctgag ccgacgaaga cgttcagctt 900 tgaatccgtc gatgaagtca ctgtactaaa tgaactccag aggctaaaaa caaagaaagc 960 gacagggatg gacaaaatcc cctccagact tctcaaagac tccgcccctg tgatagtaaa 1020 gccacttacg cacattttta atctgtcact ctccactgga gaagtcccaa ctgagtggaa 1080 ggaggcgcag atctcaccaa tacacaagtc cggaacacgg gctaacgttg ccaactatcg 1140 gccagtgtct gtgctaagtg tcacgtccaa ggtgatggaa aagcttgtgt cctaccaagt 1200 atcgcgcttc ttggatagga acgatcttct cacagaacat caaagcgggt ttaggaagca 1260 tcatagtaca gcgactgcag tccagaagat tgttgaggat gtcaagtccg cgtttaacag 1320 tagcaaggtc acggttgcgc ttttcctgga tttaaggaag gcttttgata ctgttaacca 1380 caatattcta ctgggcaaac tcaagaagct gggttttgat agtgacgcta caaaatggtt 1440 cacatcctac ctctcagacc ggttccagtg cacgcagatt cagggtcaac actctacaaa 1500 agctctggtc acctgcgggg ttccccaggg aagtgtcctg ggccccttgt tattctgcat 1560 atatgtaaat gacctaccta atgtcataga gaaatccagt atccacttgt atgccgatga 1620 taccattctt tactactcgg catcttcagt aaaagagtgc gaagagacaa tctccagtga 1680 catgamgaga gtagcaaast ggctgaaaga aaacaagctg ctactgcacc cggacaaaac 1740 caaatccatg ttatttggtc tcccacaaaa actaagacat actggtcggt ctgtcaatat 1800 aacagatggt gtaaacgtgt atgaacaagt tcaatcgttt acatacttgg gggtcacact 1860 tgatccatct ctcctgtgga cttcacacgc tgggaaggtg atgaagaaaa tccttggtgg 1920 tctcggtgcc atgtggcggg ccaaaggttt tgtctctacg gaaattctac agaccatgtg 1980 ccaaactctg ttgttgtctc accttgacta ctgcgccacc gcgtggctac caagtctcat 2040 tcagtgcaat aaaggacaaa ccgcaaaact tgacagactg ttgaacagag cggcaagact 2100 gattacagga tacagacttc aagatcatgt cccagttcac accttgctaa ctgctgcagg 2160 gctggagtct gtgcgtaaac ggttggaaac ggtctctctt actacagttt tcaagtcagt 2220 ccggggaaga gcaccatcgt atatgtccac attattccga tggatgtcac ccccgaccct 2280 aagggtcaac acacgtgcag cgtctacacg gttacgggac tatgaccccc acttgcttca 2340 gttgccacat gtcagagtga cgtcatttcg cgggagtctg cagtactacg gtgtgcagct 2400 gtggaacaag ctttctcatg aaacacgaag catgatgaag ttcaggacat ttaggcgcca 2460 attacaccta atggcttagt aatgctattt ttacagctat gttgttattc attgttattg 2520 tgtaatgttt ttatgtatgc gcatgtgccc aggattgcct gaaaaacagg tcaacgctga 2580 cctgagatgt atttgcctgg taaaataaat aaactgaaac tgaaactgaa a 2631 // ID Copia9-NVi_I repbase; DNA; INV; 4591 BP. XX AC AAZX01000081; XX DT 05-NOV-2007 (Rel. 12.11, Created) DT 05-NOV-2007 (Rel. 12.11, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: internal DE portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia9-NVi; KW Copia9-NVi_I; Copia9-NVi_LTR; internal portion. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-4591 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from Nasonia parasitic wasp."; RL Repbase Reports 7(11), 1116-1116 (2007). XX DR Genome; AAZX01000081; Positions 9244 4654. XX CC Positions [1983-2510] - Integrase core CC 'ATTTA' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 121..1716 FT /product="Copia9-NVi_I_1p" FT /translation="MAEIICTAVQKLNSVNYSTWSYKIQLMLMKDKVWDVI FT KDEPPAAADTTYAAWTARDETAQVTIGLLVEDSQLRLIKNAKTAREQWNTL FT KEYHQKSTLSGRIHLQRRFYHTVLQEGGDMVEHIATLSEYVDKLFGMGHSV FT DDSTLVGVLLCSIPESYGTLIMALEGRPDKDLTPDFVKGKLIDEYMRRKDS FT REARSSASDEVAMNTTYSEKKGTQPSKLKCFFCKKKGHAKSENAKTAREQW FT NTLKEYHQKSTLSGRIHLQRRLYHTVLQEGGDMVEHIATLSEYVDKLFGMG FT HSVDDSTLVGVLLCSIPESYGTLIMALEGRPDKDLTPDFVKGKLIDEYMRR FT KDSREARSSASDEVAMNTTYSEKKGTQSSKLKCFFCKKKGHAKSECRKYKT FT WQEKHTEKANKVTSVIIENDEDYGCYLARVEACLGAKAEIENWIVDSGATS FT HMAQTRTFFEELDAGSKGYVRLADDENRVQVEGVGTAAIKCADASEKISTI FT KVREALYVPGLGANLLSVSKLTKDGYKRFSKRTSAR" FT CDS 2061..3851 FT /product="Copia9-NVi_I_2p" FT /translation="MQTLTPGGKRYFMTIIDDYSRYTHLYIMRHKNEAAGL FT IQEYVQMAKTQFHKVPKIIRSDRGREYVNTELQSFLKNQGILIQYTAPYSP FT QQNGVAERKNRSLLDMSRCMLFDAELDIKYWGEAVHTANNLQNRLPSRATT FT KTPYELWYSEKLHLADLQIFGSPAYMHIPKEQRRKLDHKGKKYFFVGYSDE FT SKAYRLLDTETDKIKISRDVEFVDKRGSKEKSHPSQEECEISILHNKQNEN FT IDQPDDNSSIASSSSEDYASIEESSLESSSSDVKESSVQDDNVPEAEAPPV FT TRRSERSNRGVPPDRYMAASNLTRLESEPRSVKEALSRPDAHHWKKAMEEE FT IFSHIKNGTWVIVQRPDGRDIIKCKWVFKIKYNANGEIERYKARLVACGYS FT QIYGEDYNEVFAPVVKQTTFRTLLTVAGHKGLQIKHFDVKTAFLNGELKEN FT IYMKPPDGFITQDGQEMVCRLRKGIYGLKQAAKVWNDCLNDCLTAFGFKQS FT QADLCLYTKDDNGDKTYLVVYVDDFLIAAKDSMKIDEVAEFLSTKFQLKDL FT GNLSFYLGIEIRRDEDGIFYMKQEKYIEKILHRFGCKMQKSPIFLSIKAI" XX SQ Sequence 4591 BP; 1537 A; 1004 C; 1094 G; 956 T; 0 other; ggttatggac ccagccctag tgaataaatt ttttggagta tacaaataca atatcaagtg 60 tgtgacacaa cgtttgactg aaagtgatta aaagtgaagt gagtttttat aaagttcaga 120 atggcagaaa tcatctgtac cgctgttcaa aaactcaata gcgtaaacta ttcaacatgg 180 agctataaaa tacagctcat gttgatgaag gacaaagtgt gggacgtcat caaagacgag 240 ccacctgctg cagctgacac cacatacgcc gcatggaccg ctagagatga aacggctcaa 300 gtaaccatcg gcttgctggt agaggacagc caactaaggc tgataaagaa tgccaaaaca 360 gcacgcgagc agtggaacac gctgaaagaa taccaccaga agtcaaccct ctcaggtaga 420 atacatctac aacgacgttt ctaccataca gttctgcaag agggtggtga tatggtcgag 480 catatagcta cactatcaga gtacgtagat aaactttttg gcatgggtca ttctgtcgac 540 gacagtacgc tcgtaggagt cttgctatgc agtatccccg aatcttacgg cactctcatc 600 atggctttag agggtaggcc ggacaaggac cttacgccag acttcgtcaa aggaaaatta 660 atcgacgaat acatgcggcg caaggactcg cgggaagcaa ggagcagtgc atcggacgaa 720 gtagccatga acactacgta tagtgaaaag aaaggtacgc aacctagcaa gttaaagtgt 780 tttttctgca aaaagaaagg ccacgcgaaa tcagagaatg ccaaaacagc acgcgagcag 840 tggaacacgc tgaaagaata ccaccagaag tcaaccctct caggtagaat acatctacaa 900 cgacgtctct accatacagt tctgcaagag ggtggtgata tggtcgagca tatagctaca 960 ctatcagagt acgtagataa actttttggc atgggtcatt ctgtcgacga cagcacgctc 1020 gtaggagtct tgctatgcag tatccccgaa tcttacggca ctctcatcat ggctttagag 1080 ggtaggccgg acaaggacct tacgccagac ttcgtcaaag gaaaattaat cgacgaatac 1140 atgcggcgca aggactcgcg ggaagcaagg agcagtgcat cggacgaagt agccatgaac 1200 actacgtata gtgaaaagaa aggtacgcaa tctagcaagt taaagtgttt tttctgcaaa 1260 aagaaaggcc acgcgaaatc agaatgcaga aaatataaaa catggcaaga aaaacacacc 1320 gaaaaagcca acaaagtgac tagtgtgata atcgaaaacg acgaagatta tggctgttat 1380 ttagcacgag tagaagcctg tctcggtgcg aaagcagaaa tcgagaactg gatagtcgat 1440 tccggagcga caagccatat ggctcaaacg cggacttttt tcgaggagct ggacgcaggc 1500 agcaaagggt acgtacgcct cgcggatgat gagaaccgcg ttcaggtaga aggcgtaggg 1560 acagcagcca taaagtgcgc agatgccagc gaaaaaatct cgacaataaa agtgcgcgaa 1620 gcgctctacg ttcccggatt aggcgcaaat cttctgtccg tcagtaaact cacgaaggac 1680 ggatataaac gattttcgaa gagaacatct gcaagataat aaaaaatggg acagttctgg 1740 caatagcaaa ggtagcctca gacctttacg agctgagagc caaaaaatcc tttgcagcga 1800 aagcagtagc cataaagaaa agctgtgccg aggattgtca ccacgtgtgg cacagacgct 1860 tcgggcaccg ctttacaacg gcaattaagg agctggagga aaaacaactg gccactggga 1920 ttaaaatatc ggactgtgga aggaacatcg tgtgccagga ctgcattaag ggtaaattgg 1980 cccgtaaacc ttttccaaag gaatcgagga cccaaacaca cgcagccctc gatctcgtac 2040 acacagatgt ttgcggccca atgcagacgc tcactccagg gggcaaaagg tattttatga 2100 ccatcatcga cgactactcc aggtatacac atctgtatat aatgcgccac aagaacgagg 2160 ccgctggatt gattcaggag tacgtgcaga tggcaaaaac gcagtttcac aaagtgccta 2220 aaataatcag atcagatcga ggaagggagt atgtgaatac ggaactccag tcattcctca 2280 agaatcaagg catcctcata cagtatacag caccttattc acctcaacaa aacggcgtcg 2340 ccgaacgcaa gaaccgatcc ttgctcgata tgtcgagatg catgcttttc gatgcggaac 2400 tcgacataaa gtactgggga gaagcagttc acacggcaaa taatttgcaa aaccggctgc 2460 ccagcagagc aacaacaaaa actccatacg agttgtggta ctcggagaag cttcatctag 2520 cggatctgca aatcttcggc agtcccgcct atatgcacat tcccaaggaa cagcgtagaa 2580 agctggatca taaaggtaaa aaatattttt ttgtcggtta ttccgatgaa tcgaaagcct 2640 atcggctctt ggatacagaa accgacaaaa tcaagataag ccgagacgta gaattcgtgg 2700 ataaaagagg cagcaaagaa aaatcacatc catctcaaga ggaatgtgaa atttcaatac 2760 tacacaacaa gcagaacgag aacattgatc aacccgacga taatagttca attgcgtcat 2820 catcaagcga agattatgcc agcatagaag agagcagcct cgaaagcagc tcgtcagatg 2880 tgaaagaaag cagcgttcaa gatgacaacg tacctgaagc agaagcgcct ccggtcacca 2940 ggcgctccga aaggtccaat aggggcgtgc cgccggacag gtatatggct gcatccaatc 3000 tgacgagatt agagtctgag ccaagaagtg taaaagaggc attatcaaga ccagatgcac 3060 accactggaa aaaagctatg gaagaggaaa ttttttcgca cattaaaaat ggcacatggg 3120 ttatcgttca acgaccagat ggaagagaca tcataaaatg taagtgggta tttaaaatta 3180 aatacaatgc caacggagaa atagaaaggt ataaggcccg gctcgtggca tgcggttact 3240 cgcaaatata cggagaggac tacaacgaag tttttgcacc cgtcgtgaag cagacgacct 3300 ttcgcacgtt gttaacagta gcgggtcata aaggtctgca aatcaagcac ttcgatgtga 3360 aaactgcatt ccttaacggc gagcttaagg agaacatcta tatgaagcca cccgacggtt 3420 tcatcactca agacggacaa gaaatggttt gccgcttgag gaagggaatc tacggtctaa 3480 agcaagcggc aaaagtatgg aacgactgct taaatgactg tttaacagca tttggattta 3540 agcaaagcca agccgattta tgcctgtata caaaggacga caacggcgac aagacctacc 3600 tcgtagtgta cgtcgatgat tttcttatcg cagccaaaga ctcaatgaaa atcgacgaag 3660 ttgcggaatt tctgagcaca aaatttcagc tcaaagatct cggtaacctc agcttttatt 3720 tgggaatcga gataagaaga gacgaagacg gcatttttta tatgaagcaa gaaaagtaca 3780 tcgagaagat cctacacaga ttcggctgca agatgcaaaa gtctccaata ttcctctcga 3840 tcaaggccat atgaagacaa gaggcgatca atcgccaatg cccgaaagca agcgctacca 3900 gcagcttatc ggcgcattat tatatctcgc tgttaataca cggccggaca tatctgcaag 3960 tgtgactatt ttaagtcaat acaacaagga gcccggcaca gctgattgga atgaagcgaa 4020 aagggtcgcc cgctacctgc acggaacaaa aggcgccgag ctacgactcg gaaaaagggg 4080 cggcgcacct acactcatcg gctatgcaga cgcagactat gcagagaccc gacttgacag 4140 aaaatctttt agcggctata catttcaata ttgcggatca acaattagtt ggtcttgccg 4200 aaagcaatcc tgtgtgtcac agtcatcgac ggaagccgaa tatattgccc tcgcggaagc 4260 gactcaagaa ggaatatgga tccgtcgctt acttgaagat tttgaagaac gaccacaaga 4320 aaaaaccgtg atctatgagg acaatcaaag ctatcttaag ctcttagatc acaaacgatt 4380 taatcatcgc acgaagcata tcgatacaaa atatcacttc gtcaaagata ttaaggagaa 4440 gcagctgatg gattaccaat attgccccac agcagaaatg atcgccgaca tgctaactaa 4500 gccattagga aaaattaaat taagaggctt cgcagaaatg tcaggcatga taaattttta 4560 gatcaccgat gtgtgcatcg ttgaggaagg g 4591 // ID Gypsy-600_AA-LTR repbase; DNA; INV; 2032 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-600_AA_; KW Ty3_gypsy_Ele178; Gypsy-600_AA-I; Gypsy-600_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-2032 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 2032 BP; 525 A; 458 C; 504 G; 544 T; 1 other; tgtaacgagc atgtcgttaa gttattttcg taaatgttta tcgttgttgg aattattaat 60 aaactacaat agaatataaa aatatataaa actttgagcc agcctaatat gaattcaccg 120 cataagcgtt catttagttc atcccaccac gctccgctgg atttatttta aatgagttcc 180 actgcaaagc gcgcaaattc tccaaccgaa agcggatgca aatatgctgg gtttcttgct 240 atgcacgcta tcggcagcgt acatcgacct gttgagaatc ctttgaggat ttattgacct 300 tttcccgatt ccttcctccc atcattttgg ccgagcagcg aggtggttga ttgaggctgt 360 tcgtggaatg agaagcgaaa atgcgcgaat atgactgttg ggagcgtgag agcggttttc 420 gggatcacga acttaccttc gtgatttccg gtgacgaaac caaattatga cgcttgatct 480 gaggagtccc aagtcttaca agatcacttc gaactgatct gtggaagagg caggagtggc 540 ccaaatttgt taatcgcgcc gtggtgttga gtagaaggta gaagcgactt gagaagaatt 600 cgatcagtgg atctttattt aatgtaaatc ctttccctag ttttgcagtt agatttgttt 660 cgtgtgcaaa tagataggcg gtgtgttatc gtgtgtggct ttgtggatta agaaggtaat 720 ttgtgcagtt atgttatatg tggataattt gtgcttacac gtgcaatgtg caatggttgt 780 aatttagatt cgtggccaag gccgtcatca cgcccgacac accacaatcc aaccctacgt 840 ggccattgct gaccccacca tcaggctccg aagtttcccc ggagcacgcc ggaagagggc 900 cgttcgccaa ttcacgcgcc ctaggcaggg catatcggcg agttcatcgt ccaagcgagc 960 acgttttccg ttccgttggt gacgaccagc ggaactcaaa cgcacgtgcc gcgtcatcgc 1020 cagcatcgat gaaccagtag aggaccatcg gttcaccccc atgacgagca gcagcagcag 1080 cgattagaag caaaccgctt tctcccgccg gcgcaaggtg gaagagttca ccgccacccc 1140 ttcccgttca tgcgtcgagc gggcgccgtc accgtcatcg ttgaaaggtc gtgagtactc 1200 actttatatg ttttatccat gcgcatgtga tttttcccgc cgcaacatag ttaccgttgc 1260 taggcgtcaa cacggcatat ttatgctagc cgtgaattaa atttcaattt gcccgtaaac 1320 cgtttgcttt gcaaacgcga gattgcacgt agaaaaccac acctagccaa gcacacacga 1380 taggtagtag acagggaccg agcacatggg aaaggtagaa agagcgtaga gagagcaaga 1440 agcgagagag gaatgaacac tttgtagtag gagaaatgaa ataaaaagct caaatgaaag 1500 ctccgtttta tggcagagga gtagaagtgt ttcaatgaac actgcgggca atgaaagttg 1560 atacccctga aaatgtgatt tkaattgagg tcccgatgaa gtgaagttgg tcgaaggttt 1620 cttcatgtcg ttcgtgttcc gtctgtagtt ttctcctgtt gtttatttta ctggggaggt 1680 tggctccgaa accaaccggg caacttttta ttcggacaca ttttggtccg aagggaactt 1740 gtaggcttgg ccagtgatga taaaaaccgt catcgctgtt ttgtcaccga ccgacgatcg 1800 atttcacgct tctggcagtt atactcagaa gtcgttatta gtagtggaat ttctgcgaac 1860 cgtttcagtc agccacctta cttcccgtcc aaatctggaa aaccgagcct tattccccct 1920 gaaaggtctc aacggtcggg caaccctaat cttggtggat agtctccctc gggagtggcg 1980 cttaagctat cccactttat aaagagggaa gtccaggagg ctgaccgtta ca 2032 // ID Gypsy5-LTR_AP repbase; DNA; INV; 172 BP. XX AC Contig832; XX DT 21-APR-2008 (Rel. 13.04, Created) DT 21-APR-2008 (Rel. 15.12, Last updated, Version 0) XX DE LTR retrotransposon from pea aphid: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy5AP; KW Gypsy5-I_AP; Gypsy5-LTR_AP. XX NM Gypsy5-LTR_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-172 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from pea aphid."; RL Repbase Reports 8(4), 446-446 (2008). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 172 BP; 38 A; 26 C; 37 G; 71 T; 0 other; tgttggctac tctgtttttt gtaaagggct tgaccattca gtcatggtta gtcagtcgca 60 gagtaccggt tgtgagctct gtttgtaaat tgtgtgttgt tcgtttatgc gtttaataaa 120 actgttttaa taagtacaat tcgtttgtct tcgtctaaac cttttagtaa ca 172 // ID Gypsy-4_BM-I repbase; DNA; INV; 3975 BP. XX AC nscaf3031; XX DT 19-MAR-2010 (Rel. 15.07, Created) DT 19-MAR-2010 (Rel. 15.07, Last updated, Version -1) XX DE LTR retrotransposon from silkworm: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-4_BM_; KW Gypsy-4_BM-LTR; Gypsy-4_BM-I. XX OS Bombyx mori OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Bombycidae; Bombycinae; Bombyx. XX RN [1] RP 1-3975 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from silkworm."; RL Repbase Reports 10(7), 983-983 (2010). XX DR Genome; nscaf3031; Positions 1896751 1900725. XX CC Positions [3030-3584] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 1464..3974 FT /product="Gypsy-4_BM-I_1p" FT /translation="MIRDGTTRPSDSPWASALHLAPKKNGWRPCGDYRALN FT ARTIPDRYPIRHIEDYAHRLAGSSIFTKIDLVRAYNQIPVAPEDIAKTAIT FT TPFGLIEFPFMSFGLRNAAQTFQRFLDEVLRGLDCCFSYIDDILVFSRSRS FT EHLEHLEKVFKRLQDYGLVINEGKCKFGQSEMDFLGFHISADGTRPLEDKV FT AIIKEFSPPKTVSSLRRFLGMLNFYRRFIPNAAADQAPLHGMLSGSKVKGS FT HPLTWTPELLEAFNTCKASIARSTLLAHPAMNAPLALVTDASNSALGAVLQ FT QSVEGQWQPLAFFSKKLKKAQQLYSAYDRELLAIYEGVKHFRFMVEGRHFV FT IYTDHKPITYTFYQNSQKCSPRQFNHLDFMSQFTTDIRFISRKDNIPADTL FT TRIEAVSSPPDLQQLARSQENDNELQELLSKRSSSSLKMEQIPIPGTNVTL FT YCDVTLSRPRPFITKELRRQVFNSIHTMSHPGTKATTKMVMSRFIWPSARR FT DCRTWTRACEACQKSKIHRHISSPLGDFQLPQSRFSHVHIDLIGPLPSSND FT YKYCLTAVDRFTRWPEVMPLYDITAETVAKAFYDMWISRFGSPERVTTDLG FT RQFTSHLFKALTNLCGIQLCHTTAYHPSANGMVERFHRTLKTAIMAHGERR FT WTNILPVVLLGIRTAWKEDLRCSVAELVYGEPLRIPGEFLHTPKADQLTPS FT DFVSQLRKHMALLRPQAASRHSSTSIFVHDDLKTCKNVFIRKDALRASLDP FT PYTGPYRVLARSDKTLTVELNRGPVKVSIDRVKPAYITTDSTSTTVIAPEP FT AIKNPDGQRPVRTTRSGRIIKFQSFPGLPDSKGGR" XX SQ Sequence 3975 BP; 1100 A; 1020 C; 912 G; 943 T; 0 other; tagtggtgac cccgaagtga tacaatggca caaggaagta aggataacgc cgccgcggcc 60 gccgccgaaa ctttattaaa cgtcgaatcg gatccggtcg taagccgtgt agcggtgcgc 120 ttaccgccat tttggcccga ggaccctgaa gtttggttcg tgcaggcgga ggcacaattc 180 caaatatccg gtatcaagga agatactaca aagttttacc acattatctc gcagctggaa 240 cagaggtaca tccgcgaaat taaagatatc ataaaaaatc cacccgctac cggcaaatac 300 gaaaaattga aacaagaact tattaagcgt ttgtcaattt cacgagagca ccaaattacc 360 cagctgcttt cccatgaaga actgggtgat agaaaactgt cgcaattctt acggcatttg 420 aaaactctcg ccgccaacga agtttcagac gaatttttac gtagtatgtg gtcgagccgc 480 ttgccacctc acatacaagc tataattgtt tcgcatacta tcggcacctt ggaagatgtt 540 gcggagctag cagacaagat atacgaggtc gtgacgccgg cgccgttaca acaagttgcg 600 agtgcagccg ccggttccag tagcttcgat ggcttagtga agcgcctcga cgagatgatt 660 gcgtcacgag ttaagacaga gttgcaacag cagatcgctc agataaactt gaatcgtcgc 720 agccgttctg tatcgcgtga cggctaccgc gcgaggagaa ggagccgaag ccggacgcca 780 ggcgtgtgct ggtaccacaa cactttcgga gacaaggcca ggaagtgcac aaccccctgc 840 aactataagg gaaacttaca aggcagttcg tagaagcggc ggacgactgc cagagtgtta 900 ctagtcgcct ttttgtcact gacaggaaga caaaggtgca gtacttaatc gacaccggct 960 cggatctttg cgtactgcca cgtcggtttc tgcgtcagtc acgggaaccc gcggactaca 1020 agctcaccgc tgctaatgga agcgttatca acacctacgg aacctcatct atacatttgg 1080 acctagggtt gcgacgcgac ttcacctgga attttgtagt tgccaacgtg aacggaccaa 1140 tcatcggagc tgattttatt tcacattatg gtttactcgt agattgcaag aacggacggt 1200 tgcttgacaa cgtgactacc ctttcaacaa ccggcatcgt tcgtgggtgt gaacaacata 1260 gcatcaaggc gatatcaggt actacagaat ttcataaact tttatctaaa taccctgact 1320 taactaagcc ttctggaata ttccgagaga tcaagcactc gactatacac tacatacata 1380 caacacctgg ccctccagta ttttgcagac cgcgacgcct ggctcctgac cgtctcaaga 1440 ttgcgaaaga gcagttcgac gatatgattc gggatggtac aacgcgacct tcagactcac 1500 cttgggcatc ggcgttacac ctggcaccta agaagaatgg atggagaccc tgtggggatt 1560 atcgcgcact caacgcacgc acaatacctg acaggtaccc catcaggcac atagaggact 1620 atgcacacag gctggcgggt agctcaatat ttacaaaaat tgacctggtt cgggcctata 1680 atcaaattcc cgtggctccc gaagacatcg cgaagaccgc tatcacaacc ccattcggat 1740 tgatagaatt cccatttatg tcttttggac ttaggaacgc agctcaaact ttccagaggt 1800 tccttgacga agttttacga ggcctggact gctgtttctc gtatattgac gacattttag 1860 tcttttctag aagcagatcc gaacacctgg aacacctgga aaaagttttc aaacgcctgc 1920 aagactatgg cctcgttatc aacgaaggaa agtgcaagtt cggccaaagc gagatggatt 1980 ttctcggatt ccacatttcc gcagacggta cacgcccatt ggaagacaag gtagcaatta 2040 taaaggaatt ttctcctcca aaaaccgtta gcagtttacg tagattcctt ggaatgttaa 2100 acttctatcg gcgtttcatt ccaaacgctg ctgctgatca agcacctctc catggcatgc 2160 tgtcgggatc aaaggtcaaa ggatcacacc cacttacctg gacgccggaa cttctcgaag 2220 ctttcaacac ctgtaaagca agtatagcta gaagtactct cctcgctcac ccagccatga 2280 acgcgccgct cgctctcgtg acagatgcct caaattctgc cttgggcgcg gtgctacagc 2340 agtcagtgga aggacaatgg caaccactcg ccttcttctc gaagaaactt aaaaaagctc 2400 agcagctgta cagcgcctat gaccgagagt tgctcgctat atacgagggc gttaaacatt 2460 tccgttttat ggttgaaggc agacacttcg tcatttatac ggatcataaa cccatcacat 2520 acacctttta tcagaatagc cagaagtgct cgcccagaca gtttaaccac ctcgatttca 2580 tgtctcaatt caccacagac ataaggttta tatcgcggaa ggataatata cctgcggaca 2640 ctttgacgcg cattgaagcc gtatcatcgc caccggactt acaacagctt gctcgttcac 2700 aggagaatga caacgagctc caggaactcc tctcaaagcg ctcatcttct tcgttaaaga 2760 tggaacagat acctatacct ggaacgaacg ttactttata ctgtgacgtc actctgtcga 2820 ggcctcgtcc ttttataaca aaggagttgc gtcggcaggt cttcaactca attcatacca 2880 tgagtcaccc tggtacgaaa gcgacaacca agatggtaat gagtaggttt atttggcctt 2940 cagctcgacg ggattgtcgt acttggactc gggcctgcga agcctgtcag aaatcgaaga 3000 tccacaggca tatttcatct ccgctgggtg atttccagtt accgcaatca cgtttcagcc 3060 acgtacacat tgatttgatt ggaccgctac catcttcgaa tgactacaag tactgtctta 3120 cagcagtaga caggttcaca agatggccag aggtaatgcc actatacgat atcacggctg 3180 aaacggtggc aaaagctttt tatgatatgt ggatctcacg attcggttct ccggaaagag 3240 taaccacgga tttgggtcgc cagttcacat cccatttgtt caaagcactc acaaacctat 3300 gcgggatcca gctatgccac acaaccgcat atcacccatc tgcgaatgga atggtggaga 3360 gattccatcg taccctaaag accgcgatca tggcacatgg cgagcggaga tggacgaaca 3420 ttttacctgt tgttttgctt ggaataagaa cggcttggaa ggaagatctt cgttgctccg 3480 ttgctgaatt ggtttacggc gagcctctgc ggatacctgg cgagtttcta cacacaccca 3540 aagctgatca gttgacacca tcagacttcg tctcacaact caggaagcac atggcattac 3600 tgagacctca ggcagcatcg cgacactctt caacatctat attcgtacat gatgatctaa 3660 agacgtgcaa gaacgttttc ataagaaaag acgctttacg agcttcgcta gatccgccgt 3720 acactggccc atatcgtgtg cttgccaggt ccgataagac cctcactgtt gaattaaaca 3780 gaggaccagt gaaggtctct attgatcgcg taaagcctgc ctacatcaca acggactcaa 3840 ccagcaccac ggtaatagca ccggagccgg cgattaaaaa ccctgatggt caacgccctg 3900 ttagaacaac aaggtcaggg cgcataatca aattccaaag ttttccaggt cttccagact 3960 ccaagggggg tcgtg 3975 // ID hATm-26_HM repbase; DNA; INV; 3747 BP. XX AC . XX DT 07-DEC-2008 (Rel. 13.12, Created) DT 10-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type family: consensus sequence. XX KW hAT; DNA transposon; Transposable Element; hATm-26_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3747 RA Jurka J.; RT "Families of hATm elements from Hydra magnipapillata."; RL Repbase Reports 8(12), 1920-1920 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(911..1828,1956..2957) FT /product="hATm-26_HM_1p" FT /translation="MYKSIITYESNTKFAYYRTSFNMAIRLVRTRMMLTCP FT VFGIPKEMSQVVLPTYEDVMKHYLLVKHQLKPTAATKEPTITEISEKVAQV FT VEKIWLKPMITHTRVLKVIRSYHDKYRKLFRNIKRSNYDSKVNAFKEDARH FT KLFDICTCKCVFEKCVCNKTRKVPPAEQDFIQDQRTLKLMCIGNVDKLASN FT KLAMKFKKKSIEQRRRNKHRNALSVSTVGDFESRSTAQSDEDDNYINKRNI FT SDVSSIDIISSSYFQCRQPLAALANVCDRYGISDRAGAAIANAVLEDFSVV FT NQQDTSYVVDPSKIKEFGIKEGLKFYRRSITQEHISLIXEPGSKYLGHLSP FT SGSSAYLIKSSIVNFITIKNIDVKKFVAVGSDGTAVNTGIKGSAIRLMEKE FT YNKSLQWLVCLLHANELPLRHLLIYLDGSTTGPKAFSCPIGKALINCEKLP FT VIEFEKIDVTLPKVELKDLSTDQKYMWEMCEAISKGECSLPLSKKTLEVLN FT HFRWLTTANRLLRLYVASEDSSTNLIHLVTYVVRVYSPVWFSIKMHPSCKD FT GARHLFKLIQQSRYLSQELRDIVDPVLQLNGYFAHSENLLLAMVSDKRQQI FT RDLGLRRILKTRQEKDHTLRTFSLPKLNLDAHDYIELINWK*" XX SQ Sequence 3747 BP; 1250 A; 614 C; 678 G; 1202 T; 3 other; gggtgcagtg aaactttagg tgaaaatata aatagtaaat tattgatatg taatagcaac 60 aaaaagttgc cttttgatat gtgttcaaat aaagtgaata aaatttcctc aaaattttaa 120 tgtcttcagc tgcctcaagg gggtcaaagt tactgaaaaa taggtatttt tcatgaaaac 180 agcctgtttt agccataact ttttaaaata aaaaactgta tttttttctt tattaatacc 240 ctaagcctaa ttactaatag tattgtttca cccagtactg tattaagggg gctagtttgg 300 gggtgggggg ctaggggcta tagccccagg ccccccaaaa gagtcaagag aagttaagca 360 gccgtggcgc agtggtagcg cacttgcctc agaaatgtaa gatccgtggt tcaaacccca 420 cctctaagca agttttgcga catcggttgg gaaggaggcg tgaacttcct attaaatgct 480 cttccgcggt gctctgtgat aagaccgtaa ggacttcttg gggcacctaa aatcaacttt 540 taaaaaaaaa aaaaaggttt ttttttttag tcatcttcac gctctgacac ataaaaaagg 600 gccctgaaaa gtctaaattc gccattggtt tcacctaacc tgttttggtt atcttggtta 660 tttcaccttt ataactgttc aataaatgtt taaaattaaa tgttttaaac tacttgttaa 720 ctttatgtta tattgacgta aaatgtgttt tattaacctt tttttagtta ttatttagaa 780 ataattatct aagtggttca ttatatttct tttaatactt ctcaatatat ttctttgcat 840 atcaagaaat tgttactaat gtttgatatt ttagtttagc ttttaatttt atttatggct 900 tgctagatta atgtataaat caataattac ttatgaatct aatacaaagt ttgcttatta 960 tagaacaagt tttaatatgg ccattaggtt agtgagaacg aggatgatgc tgacttgtcc 1020 tgtctttggt atacctaaag agatgtcgca agttgtttta ccaacatatg aagatgtaat 1080 gaaacattac ttgcttgtaa aacatcaact gaaaccaaca gctgctacga aagaacctac 1140 tataacagaa atctctgaaa aagttgcaca agttgttgaa aaaatatggt taaaaccgat 1200 gattactcat acacgggtat taaaggtgat tcgctcctat catgataaat acagaaaact 1260 atttcgaaac atcaagcgca gtaattatga tagcaaagta aacgctttta aagaagatgc 1320 gagacataaa ttgtttgaca tctgcacctg taagtgtgtg tttgagaagt gtgtatgtaa 1380 caaaactcgc aaggttcctc ctgcagagca agattttatc caagatcaga gaactctcaa 1440 attgatgtgc attggcaatg ttgataaact tgcatcaaat aaactggcca tgaagtttaa 1500 aaaaaaaagt attgagcagc ggcgtagaaa taagcatcgc aatgccttgt ctgtgtctac 1560 ggttggtgat tttgaaagtc gatctacagc tcaaagtgat gaagatgata attacattaa 1620 taaacgtaat atttccgatg tttcttccat tgacattatt agttcttcgt attttcaatg 1680 tcggcaacca cttgctgcat tagcaaacgt ctgtgaccgg tatgggatct cagatagagc 1740 cggagctgca atagcaaatg cagttttgga agactttagt gttgttaatc aacaagatac 1800 ctcttatgtt gttgatccga gtaaaatatg acgggagcgt aagagaaaac gtaatcaact 1860 aaaaacttct aaaaattcaa aaatagttcg aggaatttat tttgatggac gaaaagacaa 1920 aactcttgaa aacatcaagg aagggttaaa gttgaaagga atttggtatc aaggaagggt 1980 taaagttcta tcgacgaagt attactcaag agcacatctc tttaatagam gaaccagggt 2040 ctaaatacct tgggcattta tcaccatctg gatcgtctgc ctatttgata aaaagttcta 2100 tagtcaactt tataactatc aaaaatattg acgttaaaaa gtttgtagct gttggatctg 2160 acggaactgc cgtaaacaca ggaattaaag gtagcgcaat aaggttgatg gaaaaagaat 2220 ataacaaatc gttgcaatgg ttggtttgtc ttttgcatgc aaacgaactt cctcttcgac 2280 atctgctaat ttatttggat ggatcaacta caggaccaaa agcattttct tgcccaattg 2340 gtaaggcact tattaattgt gagaaattac cagttattga atttgaaaag attgatgtaa 2400 ctcttccaaa agttgaactt aaagatttga gcacagatca aaaatatatg tgggaaatgt 2460 gcgaagcaat atccaaaggt gagtgttcac tacccctatc aaaaaaaact ttagaggtac 2520 taaatcactt ccgatggctg acaacagcga atagacttct acggttgtat gtggcgagcg 2580 aggattcatc aacaaatctg attcatttag taacttatgt cgtaagagtt tattctcctg 2640 tttggtttag tattaagatg catccttcat gcaaagatgg tgcgcggcat ctttttaaac 2700 ttattcaaca gtcacgttac ttatcgcagg aactgcgaga tatagtagat cctgtattac 2760 aacttaacgg ctattttgct cattcagaaa atctcttact agcaatggtt agtgacaaac 2820 gccaacaaat tcgtgattta ggattaaggc gcattcttaa aacgcgacag gaaaaagatc 2880 acacactgcg tactttcagt ttaccaaagc ttaacttaga tgcacatgat tacatagaac 2940 tcatcaactg gaaataaaat aaaataaccg agccaccgtt gacagctgat gaggcttcgc 3000 tggctgtatg tggagaaaaa actcgagatg gttttattcg atcgcgttta catgcacgtc 3060 tcataatgcc agtgtttaac acaaagtcag aatatcgtgc ttccgaacca ataaattggt 3120 agagtgtaat gacagatttt ctaaaacttg attgtgtatg ttaaaataat taatgaatag 3180 atatttttgt ttcacccttt aacttgttat agacttgctg acttactaac ttgctaatct 3240 gccacagtga cggagagagg gaaggggaag cgcagaggac catgtcccct arttattttt 3300 aaaaagacct ttttttattt ttacaaagtc ggtgtttaaa gtagtactta ttgacatttt 3360 aaattttttt agtgcattga tcccaatccg cccccctttt ccccacgaaa gtgtcacgcc 3420 stgaaatata tataacctta aaaggtcatc gaaatatgtt catattagat aaaaaagaat 3480 gacatatatg tcattctttt taattattac agttttttct gaaagaattt gacaaaaaag 3540 tataatcttt aatcaaaaaa aataattgac taaaacactc aattgtcata aaaaatacct 3600 ctttttagta actttgtccc ctatgaggca gctgaagaca ttgaaatttt tgaaaatttt 3660 tttttcaatg tttgaacact taccaaaaga caactttttc ttgctattac atataaatat 3720 tccactttta attttttcac tgcactc 3747 // ID Saci-2_I repbase; DNA; INV; 4216 BP. XX AC BK004069; XX DT 09-JUL-2004 (Rel. 9.06, Created) DT 29-MAR-2007 (Rel. 12.04, Last updated, Version 4) XX DE Schistosoma mansoni Saci-2 LTR retrotransposon: internal DE sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Saci-2_INT; KW internal portion; Saci-2_I. XX NM Saci-2_I; Saci-2_INT. XX OS Schistosoma mansoni OC Eukaryota; Metazoa; Platyhelminthes; Trematoda; Digenea; OC Strigeidida; Schistosomatoidea; Schistosomatidae; Schistosoma. XX RN [1] RA DeMarco R., Kowaltowski T.A., Machado A.A., Soares B.M., RA Gargioni C., Kawano T., Rodrigues V., Madeira M.A. et al.; RT "Saci-1, -2, and -3 and Perere, Four Novel Retrotransposons with RT High Transcriptional Activities from the Human Parasite RT Schistosoma mansoni."; RL J. Virol 78(6), 2967-2978 (2004). XX RN [2] RA DeMarco R., Kowaltowski T.A., Machado A.A., Soares B.M., RA Gargioni C., Kawano T., Rodrigues V., Madeira M.A. et al.; RT "Direct Submission to GenBank."; RL Direct Submission to Genbank (03-DEC-2003)Departamento de RL Bioquimica, Instituto de Quimica, Universidade de Sao Paulo, Av. RL Prof. Lineu Prestes, 748, Sao Paulo, SP 05508-900, Brazil. XX DR Genbank; BK004069; Positions 260 4475. XX CC Key Location/Qualifiers CC CDS 9..4157 CC /codon_start=9 CC /product="pol polyprotein" CC /protein_id="DAA04499.1" CC /db_xref="GI:44829171" CC /translation="MEVFTKKRDENLNEPPVIPVKPTTGSAPRQPPLLENSSDYTVWK CC FKVAAYLRKVPASEQFDYLVSYLSDEATKRAIVNGFAADNTLAQNWKILDDCFSTPVD CC PQQAAIRFLSRHQTPGENPMDYFNSLQQMAVQAFPCLDATGRDELIKSRFVEGLLSSS CC LKEHFLLNPPTDINSLKRVTFRFMAAEQLKSPLDMHQPTAMAVMQPTKASRPLNPQRI CC SNVTRSSHWNSSRRQFNPTEKKWRPTNSYHGRQSECPYCRRFGKNAKKCGHNRFGNTS CC KPYIFHLSSYANHVRPITIKARVQGIDIEVLLDTGASVSLIKSEFLGRLNRKVQQTRH CC PSTLLTASGDPLKVNSKVWLDLMIDRHSFRHGFFVCPTLTWDMILGVDFMLEHKACIF CC MDSLEAKFGETKIRLHHTNTPFKAVSYVGISDLVRQVNDNRTLKEKDRHATIAVPQQF CC SSVFEESISGNRTSTVQHTICTGDHRPLRQPPRRVPVHYQPQLDTMIKDMLDKNIIVP CC SSSPWASPIVLVKKKDSSLRLCIDYRRLNAITKRDSFPLPRIDAILDALSGACWFSTL CC DLASGYWQVEVRPQDRKKTAFVVPNGLYEFQVMPFGLTNAPATFQRLMQTVLQDTVPH CC KCLIYLDDIIVYGSTPEQHNANLKAVLHRLQQHNLKVKPSKCRLLQKEVVFLGHRITA CC DGVGTDKEKTRVIVNWPQPKSPEDVRSFLGLASYYRRFVRDFASLAAPLHRLTHKGRK CC FLWTTECQQAFDALKTRLSSPPILAVPDTSANGGEFILDTDASSSAIGAVLSQVAPDG CC QERVIAYASRRLDKSETRYSTTRREMLALVKFLQHFRHYLLGKPFRVRTDHRALQWLR CC SFREPEGQVARWQERLQEFDFTCEYRPGSRHTNADALSRIPQTAGTVNAVLSTAVEID CC WPSLQAADPDMQIIYQRQLQGNNKPSMKELKDQSLTTRRICTKWSNLKLYGDTLFLIN CC EARQPLLVVPRIKVESIVEQVHRKLGHAGERRTEYAVRQRFWWLSMHEDVVQICKNCN CC TCCRFKSPRQTSRAPLTPMVTTGPHQRVGIDIMGPLTTTKKGNRYILVMVDYFTKWCE CC AVPLPQQDALTVARAFIDHWVSRYGAPFSLHSDQGPAFESRLIAEICQLLGVRKTRTT CC AYHPEGNGLVERTNRSIKAILQAFVSRSSQELWDDVLPQCLLAYRTSVHGSTGFSPAI CC LLFGHELRLPVEIQTPLLPCEEQEHVPYIRTLRNRLADAYRLVSSNLRKASEHQKDLY CC DRRVHGPVYKVGDRVWLRRPMASSGSCSKFHQPWQGPVEIVLIRSSTTFVLRNLQRPQ CC DDVITVHYNQIKPDRMTVSSDAQFANPPPATTFYEVPSEGGTAYPCPRPGTEDSAYLR CC EGAV". XX SQ Sequence 4216 BP; 1225 A; 1018 C; 930 G; 1043 T; 0 other; gagtggtgat ggaggtgttc actaaaaaac gcgatgaaaa cctgaacgaa ccgccggtga 60 ttccagtcaa gccaactact ggctcagctc cccgtcaacc gccactcctg gaaaactcgt 120 cagactatac agtgtggaag ttcaaagttg cagcatatct tcgtaaagtt ccagcttcgg 180 aacaattcga ttatctggtg tcctacctca gtgacgaagc taccaaaaga gctatagtta 240 acgggtttgc ggcggacaat actctcgctc agaactggaa gattctcgat gattgtttct 300 cgacgccggt ggatccccaa caagcagcca tccgtttttt gtcacgacat caaacacccg 360 gcgagaatcc aatggattat tttaattctt tgcaacagat ggctgtgcaa gcgtttcctt 420 gtcttgatgc cactggaaga gatgaattga tcaagtctcg gttcgtggaa gggttactat 480 ccagttcctt aaaggaacac ttcctactaa accctccaac agatataaat tccttaaaaa 540 gagttacatt ccggttcatg gcagccgagc agctaaaaag cccacttgac atgcaccagc 600 caactgcgat ggcagtaatg cagcccacaa aagcgagtcg cccactgaac cctcaacgca 660 tatctaacgt tacccggtct tcccactgga acagcagtcg aagacagttc aatcccactg 720 agaagaagtg gagaccaacg aatagttatc atggtcgtca atccgaatgc ccgtactgtc 780 gaaggtttgg caaaaacgca aagaaatgtg gtcataatcg ttttggtaac acaagtaagc 840 cgtacatttt ccatttatct tcttatgcta accatgtgcg tcccatcacc attaaggcta 900 gagtacaggg tatcgatatc gaagttttat tagacaccgg agcgtcagta tccctaatta 960 agagcgagtt tttaggaagg cttaatcgta aagtccaaca gacgcgacac ccatccacat 1020 tactcacagc tagcggagac ccattgaaag tgaactctaa ggtatggttg gacctaatga 1080 tagatagaca ttcgttccgt catgggtttt tcgtttgtcc gactttgact tgggatatga 1140 tactcggagt cgacttcatg ttagaacata aagcttgtat atttatggac agcttggaag 1200 ctaaatttgg agagactaaa atacgcttgc accatactaa cacgccattt aaagctgtgt 1260 catatgtggg aatctcagac ctggtgcggc aagtaaacga caaccgaaca ctaaaagaga 1320 aagatcgaca tgcaacgatt gccgtgcctc aacagtttag ttcggtattc gaagagagta 1380 tttccggcaa tcgcacgagt acggtgcaac acaccatttg cacgggggac cacagaccac 1440 ttaggcaacc gcctcgtaga gtgccagtac actatcaacc acaactggat acgatgataa 1500 aggatatgct tgacaagaat attatagtac cgtcatcatc cccctgggca tcacctatcg 1560 tcctggttaa aaagaaagat tcctctctgc gtctctgcat agactatcga cgccttaacg 1620 ctataactaa gcgtgactct tttcctcttc cgagaatcga tgctatatta gacgctttga 1680 gtggcgcatg ttggttttca acgctagacc tagcatcagg gtactggcaa gtggaagtca 1740 gaccacaaga tagaaagaaa accgcatttg tagtgccaaa tggtttatac gaattccaag 1800 taatgccttt cggtctaacg aatgccccag ctaccttcca aagattaatg caaacagttt 1860 tacaagatac agtacctcat aaatgtttga tttatctcga tgacattatt gtgtatggta 1920 gcacacctga gcaacataat gctaatctga aagcagttct ccatcgcctt caacaacaca 1980 acctaaaagt aaaaccgtcc aaatgtcgtc tgctgcagaa agaagtagtt ttcttaggac 2040 atcgtatcac tgcagatgga gtaggcaccg ataaagaaaa gacacgtgtc attgttaatt 2100 ggccacagcc caaatcacct gaagacgttc gtagctttct tggcctagct tcttactaca 2160 gacgctttgt ccgtgacttt gcatcattag cagcaccctt acaccgtttg acgcacaaag 2220 gacgaaaatt tttatggact acagagtgtc aacaggcgtt cgatgctttg aagacacgac 2280 taagctctcc acctatatta gccgttccgg atacctcagc aaacggggga gaatttatac 2340 ttgatacgga tgccagttcc tccgctattg gagcagtcct atcacaagtg gccccagatg 2400 ggcaagaaag agttattgcg tacgctagcc gtcgactgga taagagcgaa acaagatatt 2460 cgacaacgcg tcgagagatg ttagcattag taaaattctt acaacacttc cgccattatt 2520 tactaggtaa acctttccgt gtacgcacag accaccgtgc attacaatgg ctccgctcct 2580 tccgcgaacc agaaggacaa gtggcacgct ggcaagaacg attacaagag ttcgatttca 2640 catgcgagta ccgacctgga agccgacata caaacgcaga cgctctttcc cgcatacctc 2700 aaaccgcagg tactgttaat gcagttctta gcacagccgt agagatcgat tggccttccc 2760 tgcaagcagc agacccagac atgcagatta tatatcaaag acagttacag ggaaacaata 2820 aaccatccat gaaagagtta aaggatcaat cattaacaac tcgtcgtatt tgcactaaat 2880 ggagcaatct aaaattatat ggtgacacgt tattcttaat taacgaagcg agacaacctc 2940 tattggtagt accacggata aaagttgaga gcattgtgga gcaggtgcac cgcaaactag 3000 gtcatgcagg agaacgaagg acggaatacg cggtccgcca gcgcttttgg tggctatcca 3060 tgcatgaaga tgtagtgcaa atttgcaaaa attgtaacac gtgctgccgt ttcaaatcgc 3120 ctcgacagac atctcgtgca ccgttaactc ccatggtgac cacaggacca catcagcggg 3180 tgggcataga tattatggga ccattaacaa caacaaagaa agggaatcgc tacatactgg 3240 ttatggtaga ttactttact aaatggtgcg aagcagtacc ccttccccag caagacgctc 3300 ttacggtcgc ccgagctttc atcgatcatt gggtttcccg ttatggcgct cccttctcac 3360 ttcattctga tcaaggtcca gcctttgaaa gtcgcctcat cgccgagata tgccagcttt 3420 tgggagtcag gaagacacga accactgcgt atcacccaga aggtaacggg cttgtagaaa 3480 gaacaaacag aagtattaaa gccatattgc aagcgtttgt gagcagatcg tcgcaggagt 3540 tatgggatga tgtccttcct cagtgccttt tagcatatcg tacatccgta catggctcca 3600 ccgggttttc acccgctatt ttattatttg gacatgaatt acgtctacca gtggaaatac 3660 agacgcccct gctgccctgc gaagaacaag agcatgtacc gtacatacgt actctccgta 3720 accgcttggc tgatgcatac cgcttggtca gcagcaattt gcgcaaagct agtgaacatc 3780 aaaaggactt gtacgatcgt cgggtgcacg gaccagtgta taaggtcggt gatcgcgttt 3840 ggctacgtcg ccccatggca tcgtcaggga gttgcagtaa atttcaccag ccatggcaag 3900 gccccgttga aatcgtcctc atccgatcct ccactacgtt cgtattacgt aacttgcaac 3960 gaccccagga tgacgtgata actgtacact ataaccaaat aaaaccagat aggatgacag 4020 tttcctcgga tgcccagttc gccaatccac cgcctgccac tacgttttat gaggtaccat 4080 cagaaggggg gacagcatat ccatgtccga gaccaggcac tgaggacagt gcctatttaa 4140 gagagggggc ggtgtaacgg ggttgaaatt tctatgtatg atatactaat gcattctatt 4200 ttctttctat tgtaga 4216 // ID Copia2-NVi_I repbase; DNA; INV; 4116 BP. XX AC AAZX01002725; XX DT 05-NOV-2007 (Rel. 12.11, Created) DT 05-NOV-2007 (Rel. 12.11, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: internal DE portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia2-NVi; KW Copia2-NVi_I; Copia2-NVi_LTR; internal portion. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-4116 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from Nasonia parasitic wasp."; RL Repbase Reports 7(11), 1109-1109 (2007). XX DR Genome; AAZX01002725; Positions 4506 8621. XX CC Positions [1646-2059] - Integrase core CC 'ATAAA' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 1676..3349 FT /product="Copia2-NVi_I_1p" FT /translation="MVSFLKHKADVYDRFKAFEHQVNTKFNRPMKVLRSDN FT GTEYCNRSMRKLLEDKGIKHETSAPYTPQQNGKSERANRTIVESARTMMLR FT ANAPRFLWAEAINTAVYLLNRATSDKESNSTPFEAWTGQKPDYSHLKVFGS FT PAYVHIQKMFRKKMDAKAFKATFVGYEGDSRNYRFYNPATKKVIQSREADF FT LEDRVGEASEDAVDSATIWKHKGQPWEMEREEAVERYSPAPPSLIPEPPSS FT PPSPRRIADHAPRPVASPPPEVEPVPAKTTKAANSNDRQLRDRSNIKKPSK FT YQANIIEYNDPQTYNDELQAHAKNGTWELVPKPTGHSIIDSKWVLKYQAVK FT AGKESRCKARLCARGFGQEYGKNYQETFAPVVRYDALRMLLAIANQEDYEI FT VQFDVKTAFLHGLLKDEVYMAIPEGLNIRGEVDNVLCKLKRALYGLKQASR FT CWNITFKNCMLDLNFKPCDSEKSIFVSEKNSELVYIILFFDDGLIMAKQSN FT VLTNIISALKERFEITVCEPRTFVGMQIERDRANRTMFLHHSEYAWKILKR FT FNMLDAKTVCT" XX SQ Sequence 4116 BP; 1361 A; 796 C; 970 G; 989 T; 0 other; ggctatgggc cgaggagcgt accagagtaa tagtgcaagt gtgaaaagtg ataacattga 60 ttgtattaaa tttaagtata ccgcaaaacg aagcgaagac aatcaacatg acaacagcga 120 atgaatcacc gatacaccgc aacaaactcg atggcacaaa ctatcaatta tggaaattcc 180 aaatgcgcgt aacactaatg gctaatggcg tatttgaagt agtcgacgga accaccgaga 240 agccagcaga tttaacgaat accgatggta agaaatggat tgcagacaac gcaaaggcta 300 tgtgcttatt ggcaggctct atgatgccgc ttcatcttga aaattgtatt actagcgata 360 ctgcccacga gatgtggatg aaggtcaagc tcattcacga gcaaaaatcc gaagctaata 420 aagcgagctt gctccagaag ttttatgcat gtcgaatgga ggcaaccgag tcgatggtgc 480 aattttccac gaaagtgatg aatatggctc gaatgttaga agacttagaa gaaaaagtat 540 cggagcttgg aatcgttgct aaaatcctgg gcagctcacc aaagaaatat aataattttg 600 tgacagcatg ggacagtgtc gatgccaagg atcaaaccct aaataagctc caggagcgat 660 taattaaaga agaaaaacgt cttagcgaaa ttgaagaaga aacgagcgcg tttatagtta 720 atacgaagcc taatatttat gaaaaacaaa gattcgatga acgaacgacg agatcacacg 780 ataacagaac ataccaagga agaaagacaa aaacaggatt aatatgttat tattgtaaga 840 agactggtca cttcgcgaga aactgtccca agaagaagag aaactatcat aaaggtaaac 900 aaaactccga atcaaacgcg atcgcccttg tagcgacaat taatgataga ccagcgagaa 960 attgctagaa tattaaagct tgatagcacg gacgtttggc tgaccgacag tggcgcctca 1020 agacatttaa cttatcgaag aaattggttg caagattttt cggtatgcag cggcgaaagc 1080 gtgactctcg gggacaacga agagtgtaaa attgaaggca aaggaacaat ttttattgaa 1140 aagcatgtcg atggcgaatg gatttctgga aaaatcgtag acgtgctcta tgttccacgt 1200 ttgcgaaaga atcttgtttc ggtcggagta tgtacatcga aaggataccg agttagcttc 1260 acaaaaaaag tggagttttt ctttgaaggt atcaaggtcg cacaaggtat taagcagggg 1320 aatgaaatat atcggatgct gtttcgagtg gtcgagcgac ctgaagcaaa tgtatctaca 1380 gtcagcctgc aacggttaca cgagcgctgt agccacgtca atgaaaaaac gctgttaagt 1440 atggtaaaaa cgggcgctgt aaagggtgtt gatgtcaaag cagaatcgat tttctcgtgc 1500 gactcatgca agattgaaaa gtcgcaccta cacccatata agaaggacca ggagcatcgt 1560 aaatgactgc cgggtgaact gtttcattcc gacgtgtgcg gtccgatgag cgtaacatca 1620 ctaggaggtg cgaggtattt tctaactttt tattgaccac gcttctaact acagaatggt 1680 atctttcctc aaacacaagg cggatgtcta tgatcgattt aaagcatttg aacatcaagt 1740 caacacgaag ttcaatcgtc caatgaaagt tctgagaagc gacaatggca ccgagtattg 1800 taacagatcg atgagaaaat tgttggaaga taaaggtatt aagcacgaaa cgtctgcgcc 1860 ctacacgccg cagcaaaatg ggaagtctga acgcgccaat aggaccattg tagaaagcgc 1920 acggacaatg atgctcagag cgaacgcacc gagatttcta tgggctgaag caatcaatac 1980 agctgtgtac ctgttaaata gagcaacctc agacaaggaa tcaaactcta cgcctttcga 2040 ggcttggacc ggacaaaaac ccgactacag ccatctgaaa gtattcggat ctccagccta 2100 cgtacacata caaaaaatgt ttcgtaagaa aatggatgca aaagcattca aggcaacctt 2160 tgttggctac gaaggagatt ccaggaatta taggttctat aatccggcaa ccaagaaagt 2220 aattcagtca agagaggcgg atttccttga agatcgtgtc ggagaagcat cagaagatgc 2280 ggtcgattca gcaactattt ggaagcataa aggtcaacct tgggaaatgg aacgggagga 2340 agctgttgag agatactctc cagcaccacc atctctaatt ccagaaccgc cgtcgtcacc 2400 accgtctcct cgacgaatcg cggaccatgc gccaaggcca gtagcgtcgc caccaccgga 2460 ggtcgagcca gtacccgcta aaacgactaa agcagccaac tcaaacgatc gccagctgag 2520 ggaccggagc aacataaaaa aaccaagcaa gtatcaagct aatataatcg agtataatga 2580 tccacagacc tataatgacg aactacaggc gcatgctaaa aacggcacat gggagctggt 2640 accaaagcct accggtcaca gcataatcga ttcgaaatgg gtcctgaagt atcaggcagt 2700 gaaggccggt aaagagagtc gctgtaaagc gcgcctctgt gctcgaggtt ttggacagga 2760 gtatggcaaa aattaccagg aaacttttgc cccggtagtt agatatgacg ccttgcgaat 2820 gctgcttgct attgctaatc aagaagatta cgagattgtc caattcgacg taaaaacggc 2880 ctttttgcac ggtcttttaa aagacgaagt atatatggcc attccggagg gactaaatat 2940 aagaggcgaa gttgacaatg tgttatgtaa gttaaaaagg gccttgtatg gcctaaaaca 3000 agcttccaga tgttggaata ttacctttaa aaattgtatg ttagacctaa actttaaacc 3060 gtgcgactca gaaaaatcta tatttgtaag cgagaaaaat agcgagcttg tatatattat 3120 attatttttc gacgatggtc tcattatggc caaacaaagt aatgtattga ctaatataat 3180 tagtgcgcta aaggaaagat ttgaaattac tgtgtgcgaa ccgcgtacat ttgttggtat 3240 gcagattgaa cgcgatcggg ctaatcgaac catgtttctt catcattcag agtatgcgtg 3300 gaaaattttg aagcgcttca acatgctgga cgccaaaaca gtatgtacct gagtatgaga 3360 agggaataga tctgaactcg atgaaacaac atgattctga gacagtgaaa ttgccttacc 3420 gagaactaat tggctcttta atgtttctat gtacggtcac gcgttttgac atgatgtacg 3480 gagtaaattt atttagtcgt ttcttagata attatacgat tgcacattgg acagcagcga 3540 aacgcatttt gcgatatctt agaggtacaa tcaatcacgg tatactgttc aaaaacagcg 3600 ggagtaatca cgaattaatt gggttttgtg attcggatta cgcgggcgac acagaaacgc 3660 gacgatctac gtccggatac atttttcgtt actgtggcgg tccgatttcg tggagtgttc 3720 aaagacaaaa acgtgtaaca ctcagcacca ccgaagccca atacgtctcg gctagcaatg 3780 ccacgcgaga ggtcgtatgg ttgagagagc ttttgaggga tgttggattt ccatgcacaa 3840 agcctaccat tttaaacatt gataaccaag gtgcaattca attgataaaa aaccctgttt 3900 ttcataggcg ttctaagcat atagaagtcc agcatcactt cgttagagag aagtatgagt 3960 gtggagcgat cgatggaagt atgtaccaag cgaaaatcag ctcgccgatg tcttcacaaa 4020 agcattagcg cgggagcttt ttgtaaaatt atgtaagaat ataggtttaa gctttgttga 4080 aaatcttaaa gcgattgatg tctcggaaag cgggag 4116 // ID BEL-163_AA-I repbase; DNA; INV; 5706 BP. XX AC AAGE02018617; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-163_AA_; KW BEL-163_AA-LTR; BEL-163_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5706 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02018617; Positions 29424 23719. XX CC 'AGGTG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 285..5681 FT /product="BEL-163_AA-I_1p" FT /translation="MPLEHSPPKGDPPKDAASEQHGLPSRTLPPENSPPFH FT GFPEWEVAMDAATAKEMQGLYRQRNQAKKKVIRIQRTLIDNESFGLAQLNV FT FSKSLSAIYAEYSGFHSKVLALVPDEGIEEQENEYASFEDLHYAVSEELEE FT LLLPAKRNTAPPPGQSITPKVVIQQQPLSVPIPSFDGSYSGWPKFKAIFQD FT LMAHSGDTDAIKLYHLDKALIGEAAGLLDAKILSEGDYKQAWSVLEDRYEN FT KRVLVETHIRGLFSLREMASESHKELRAHLSTVTHHVEGLKFLGQQITGIS FT EHMIVYLVVSALDKSTRKAWEGTQKRGELPKYDPTIAFLKSRCQILENCET FT AFETTKSSAKQKLIHPPSKVHPQKSHAASTTPSQPTKACEICGGSHLNFQC FT SALSSLTPAQRNEKIRAAGVCFNCLRKGHRSKDCPSDKTCRKCQYRHHTLL FT HDDGAPNRTTTSNVSLPAETVADPPVVPAVPSQPSPVDPPVSTTCSSNLVQ FT STKTVLLLTAVVQAYDRKGQPHPCRVLLDSGSQVNFVTEEMANRLGLKKNP FT ANVPIVGINALRSLARDKVTIKIRSRVSSFQANLECLVTPRVTGSIPASKI FT DISNWKFPEGMALADPKFHTPDKVDLLIGGELFFDLLKPGKLNLDEGLPQL FT RDTHLGWIVAGVIDDPLVSNVSLHYSLASVRDIEEEMQQFWQIEEVPEVSK FT LSTEEATCEAHFLSTYQRDETGRFIVKLPFKENSTRLDDCRALALKRFLML FT EKRLSRNPELQAQYVEFLREYEALGHCHETSEADDPPNQQAYYLPHHAVLR FT PSSSSTKCRVVFDASAKSSPSDLSLNDVLQVGPVVQNDLHHIALRFRKFKV FT AFTGDIAKMYRQVLQAPCDRRFLRIFWREHPAMPMRVLELCTVTYGTASAP FT YQATRCLVQLVEEDGEDFPIAARIVKEDTYMDDVLSGADSLEDAIEAQQQL FT KQLLGRGGFPIHKWCSNSQEFLEHIPVEEQEKKTVLEEHGVNEAIKVLGLH FT WDPSADLMLIAYRPNPSTSHPQQPTKRTMYSEIAKFFDPLGLVSPVIVLAK FT LLAQKLWQLKTEWDDPVDDDTAQQWQELQDSLTHLHEIKIPRRVTFDNAVS FT HELHGFSDASNKAYGACIYLRSIFADGSAKLRLLTSKSKLAPLKEISIPRK FT ELCAALLLTRLVQKVLPALDMSFRDIVLWCDSTVVLAWIRKPLQQLQIFVR FT NRIAVILEHTGEYRWEYVRSQQNPADVVSRGQLAEFLKNNSLWWNGPEFLQ FT RIEYIVNPPEDIPDEELPELKALVAAPDVSMDLASFLPMYSSFRKLQRVMG FT YVMRFAANCRKRIPTERERRPLLTVRELRRSTEAILHVVQQAHFADEIKRV FT MANESCKRLGSLRPIFHEGLLRVGGRLDRSHLPFESRHPIILPDKDPVVRL FT LIQQMHVELLHVGQTGLMNALRQRYWLLNPRSTIRLITRRCVRCFRTNPTT FT PNQLMGNLPASRVVPSPPFAVTGVDYAGPFWTKQGTRRPTLVKSYVAVYVC FT MATKAVHLEGVSDLSTDAFLASLRRFIARRGMVHEIHSDNATNFRGADHEL FT KQLHQQFHDQQTVKIIESFCHNHEIEWHFIPPDAPEFGGLWEAAVKSAKTH FT LKRVAGNTSLTFEEMCTLLAEIEAVLNSRPLFTISNDSADPLVITPAHYLI FT GRPLIAPAEPSLENVKASRLTRWQHLQLLREQFWRAWSRDYLNTLQPRKKN FT QREMPNIRVDMIVLLQDRNQPPLNWKLGRITAVYPGEDGLVRAVDVFANGA FT TYRRPINKVSVLPIVDNDPQRDPPTSDC" XX SQ Sequence 5706 BP; 1469 A; 1609 C; 1442 G; 1186 T; 0 other; tctttttggt ccataccgaa ccggatcacg gtggccatcc gaaaagactt tcgccgagaa 60 agttccccca gtgccgttca actgagcggc agaagccatt cccgccgaag agaaaaacga 120 cgcgctacgc taaaaaagtg ctgtgcccaa acaaaacaac tgcttccgct gctacgattg 180 tgattcgagt gagtgaaaaa aaaataaatt ccgcgttcgc atacaaactc aaattgaagt 240 accgcgttcg agtttaaaaa agtagtgaaa aaagtgacgt aaaaatgcca ttggagcatt 300 cgccaccaaa aggcgaccca ccaaaggacg ctgcaagcga gcagcacgga ttgccttcca 360 gaacgctgcc tcccgaaaat tctccacctt tccacggatt tcctgagtgg gaagtcgcaa 420 tggatgccgc cactgccaag gagatgcaag ggctgtatcg tcagcgaaac caagccaaga 480 agaaagtgat tcggattcag cgaactctca tcgacaacga gtccttcgga ttggcccaac 540 tgaacgtctt ctccaaaagc ctgtctgcca tatacgccga gtatagtggc ttccacagca 600 aagtgctagc actggtgcca gacgaaggta tcgaggagca ggagaacgaa tacgctagct 660 tcgaggatct ccactacgcc gtttcggaag aactggagga gttgctgtta ccggcaaaga 720 ggaacacagc gcccccgcct ggccaaagca ttacgccgaa ggtagtcatc caacagcagc 780 cacttagtgt gccgatcccg tcattcgatg gcagctactc cgggtggcca aaattcaaag 840 ccatctttca ggatctgatg gcccattcgg gggatacaga cgccataaaa ctgtatcacc 900 tcgataaggc gctcatcgga gaggcagctg gcttattgga tgcgaagatc ttgagcgaag 960 gtgactacaa gcaagcctgg tccgttttgg aggatcgtta tgagaacaag cgcgttctcg 1020 tagaaaccca catccgtggc ttgttttcac taagagagat ggcatcggaa tcccacaagg 1080 agttgagggc ccacttatcc accgtcaccc accacgtcga gggcctgaag tttctcggac 1140 aacaaatcac tggcatctca gagcacatga ttgtctacct ggtggtttcg gcgctcgata 1200 aatccacccg caaggcctgg gaagggaccc aaaagagagg cgagcttcca aaatacgatc 1260 caaccatcgc ctttctgaag tccaggtgcc aaatcctcga gaattgtgaa acggcatttg 1320 agacaacgaa atccagcgcc aagcagaagc tgatccatcc gccatccaag gttcatccgc 1380 aaaagagcca tgcagcatcg accactccat cccaaccaac gaaagcgtgt gaaatttgtg 1440 gtggttctca tctcaacttc cagtgttcgg ccttgtccag tctcactcct gcccagagga 1500 acgagaagat ccgagctgct ggagtctgct tcaattgcct gcggaaaggg caccgatcca 1560 aggactgtcc ttccgacaag acgtgccgca aatgtcaata ccgacaccac acgttgctcc 1620 atgacgatgg ggccccaaat cgaacgacta catccaacgt ttcgctccca gcagagacgg 1680 tggcagatcc gccagtagtt ccagccgtgc cgagtcaacc gagccccgtg gacccgcctg 1740 tgtccacaac atgttcatcg aacttggtcc aatccaccaa gacggtactg ttgctaaccg 1800 cagtggtgca agcctacgac aggaagggcc agccacaccc gtgccgggtc ctgttggaca 1860 gcggatccca agtcaacttc gtcacggagg agatggccaa ccgccttggc ctcaaaaaga 1920 acccagccaa cgttccaatc gttggcatca acgccctacg gtccctggcc cgtgacaagg 1980 tgacgatcaa gatccgttcc cgtgtatcca gcttccaagc gaacctcgag tgtctggtta 2040 ccccaagagt gaccggttcg attccagcat ccaagatcga catctccaac tggaagttcc 2100 cggaaggaat ggccctcgct gatcccaagt ttcatacgcc tgataaagtg gatttgctaa 2160 tcggtgggga gctatttttc gacttgttga aaccaggtaa gctaaaccta gatgaaggcc 2220 ttccacagct acgagacact cacctggggt ggatcgtggc tggtgtcatc gacgacccgc 2280 ttgtgtcgaa cgtgtcgctc cactattcgc tcgcatccgt gagagacatc gaggaagaaa 2340 tgcagcaatt ctggcaaata gaggaggtgc cggaagtttc caagctatcc accgaagaag 2400 ctacctgtga agcacacttt ctgtccacat atcagcgtga cgaaaccgga agattcatcg 2460 ttaagctgcc cttcaaggaa aactccaccc ggcttgacga ctgccgtgct ctagcactaa 2520 agaggtttct gatgctggag aagcgactgt cccgcaaccc ggaactgcag gcgcagtatg 2580 tggagttcct ccgggagtac gaagctcttg gacactgcca cgagaccagc gaagccgacg 2640 atcctccgaa ccagcaagcg tattatttgc cgcatcacgc agtgctacgg ccgtccagct 2700 cgagcacgaa atgtcgagtt gtgttcgacg caagtgctaa gtcgtcgcca tcagatctct 2760 ccctgaacga tgtactacaa gttggtccgg tggtgcagaa cgacctacac cacatcgcgt 2820 tacgcttccg gaagttcaag gttgccttta ccggagacat cgcaaaaatg taccggcaag 2880 tacttcaagc cccatgcgat cgccgattcc tacgaatctt ttggagggaa catccggcga 2940 tgcccatgcg agttctggaa ctttgtaccg tgacttacgg tacggcgtca gcaccgtacc 3000 aagctaccag gtgtttggtg caactcgtcg aagaagacgg cgaggacttt cccatcgccg 3060 ctcgtatcgt gaaagaggat acgtacatgg acgatgtact ctccggcgca gactcgttgg 3120 aggacgccat cgaggctcaa caacaactta agcaactcct tggacgcgga gggtttccca 3180 tacacaagtg gtgctccaat tcccaagaat tcctggagca tattcctgtg gaagaacaag 3240 aaaagaagac cgtgttagag gaacatggag tgaacgaagc tatcaaggtt cttggtttac 3300 actgggatcc atctgcagac ttgatgctta tagcataccg accgaatcca tcgacgtctc 3360 atccgcaaca gcccacgaaa aggacgatgt attcggagat tgccaagttt ttcgacccct 3420 tgggactggt ttcaccggtc atcgttttgg caaagctcct ggcgcaaaaa ctgtggcagc 3480 tcaaaaccga gtgggacgac ccagtggacg atgatacagc acagcagtgg caggaactcc 3540 aagattcctt gacacatctc catgaaatta aaataccacg acgtgtcaca ttcgataacg 3600 cagtgtccca tgagctgcac ggcttctctg atgcctcaaa caaggcgtac ggagcctgca 3660 tctacttgcg aagcatcttt gccgatggct cagcgaagct acgtctactc accagcaagt 3720 cgaagttggc ccccctcaaa gaaatctcca tccctcgaaa ggaattgtgc gccgccctcc 3780 tgttgacccg gttggtgcaa aaagtgttac cagccctgga tatgtcgttc cgggacattg 3840 tgctgtggtg tgacagtacg gtggtcctag cctggattag gaaacctctc caacaactac 3900 aaatattcgt aaggaatcgg atcgctgtca tcctagaaca caccggtgag taccgctggg 3960 aatatgtccg gtcccagcag aacccagccg acgtcgtatc gcgtggtcaa ctagccgagt 4020 tcctgaagaa taacagcctt tggtggaatg gaccggaatt cctccagaga atcgagtaca 4080 tcgtaaatcc ccctgaagac atcccagatg aggagctgcc ggaactaaaa gcgctggtgg 4140 cagcgccaga cgtgagtatg gaccttgcat catttctccc catgtacagc agcttccgca 4200 aacttcaacg cgttatggga tacgtaatgc gtttcgctgc caattgccgg aaaaggatcc 4260 caaccgaacg tgagcgaaga ccccttctga ccgtccgtga gctgcgtcgc tcaacggagg 4320 ccatcctaca cgttgtacag caggcacatt tcgccgatga aattaagcga gttatggcaa 4380 acgagtcctg caagaggctt ggaagtctac gacccatctt ccacgaagga ctgcttcgag 4440 taggtggtcg gttggatcgc tcacatctac cattcgaaag ccgtcatcct atcatcctac 4500 cggataagga tccggtggta cggcttctga tccagcaaat gcacgtcgag ctcctccacg 4560 ttgggcaaac gggcctgatg aacgctttga ggcaacggta ttggctacta aacccacgat 4620 cgaccatccg gttgatcaca cgtcggtgcg ttagatgctt ccgaaccaat ccgacgaccc 4680 ctaaccaatt gatgggaaac ctgccagcat cgagagtcgt gccgtcaccg cccttcgccg 4740 tcaccggcgt ggactacgcc gggcctttct ggaccaagca aggaactcgt cgtccgactc 4800 tggtaaagtc ctacgtggcc gtgtatgtgt gcatggcaac caaggccgtc cacttggagg 4860 gcgtctccga tttaagcacc gatgccttcc tggcatccct tcgacgtttc attgcccgcc 4920 gtggaatggt ccatgaaatt cattcggaca acgcaacaaa ctttcgtggt gctgatcacg 4980 aattgaaaca attgcaccaa caattccacg accaacaaac cgtgaagatc atcgaatcct 5040 tctgccacaa ccacgagatc gagtggcatt tcatcccgcc ggacgctcct gaattcggtg 5100 gattatggga ggcggccgtg aagtcggcga agacccacct gaaacgtgtc gctgggaata 5160 cgagcttgac gttcgaggaa atgtgtacgc tcctggccga aatcgaggct gtcttgaact 5220 cccggcctct gttcactatt tccaacgact cggcggaccc gctggtcata actccggcac 5280 actacttaat tggacgtccg cttatcgccc cggctgaacc atccctggaa aacgtgaagg 5340 catcccgttt gacacgatgg caacacctcc agcttctgcg tgagcaattc tggcgtgcct 5400 ggagccggga ttatttgaac accctgcagc ctcggaagaa aaaccagcga gagatgccga 5460 atatccgtgt ggatatgatc gtactccttc aagacaggaa tcagccacca ctcaactgga 5520 agttgggacg aatcacagcc gtgtaccctg gagaggacgg cctggtacga gcagtagatg 5580 tcttcgctaa tggagccacc tatcgacgac cgataaacaa ggtgtccgtt ctgcccatcg 5640 tggataacga tccccagcga gacccaccaa cttcggactg ttgagattcc tcaaccgggg 5700 ggagga 5706 // ID Copia-26_CQ-LTR repbase; DNA; INV; 104 BP. XX AC . XX DT 07-JAN-2011 (Rel. 16.02, Created) DT 07-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-26_CQ_; KW Copia-26_CQ-I; Copia-26_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-104 RA Jurka J.; RT "LTR retrotransposons from the southern house mosquito."; RL Direct Submission to RU (07-JAN-2011). XX DR [2] (Consensus) XX SQ Sequence 104 BP; 35 A; 18 C; 25 G; 26 T; 0 other; tgatagatcg aagtaatccg gtcatggagg atttggttct ggaagtcgac gtctgtaagc 60 tgagcaaaaa taaaatcatt ctactgcgaa cagtctacaa gaca 104 // ID BEL-8_SI-I repbase; DNA; INV; 5587 BP. XX AC AEAQ01025887; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-8_SI_; KW BEL-8_SI-LTR; BEL-8_SI-I. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-5587 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01025887; Positions 13042 18628. XX CC Positions [4619-5185] - Integrase core CC 'TTCTC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 126..4037 FT /product="BEL-8_SI-I_2p" FT /translation="MSENAGNLRKARGYVKAKLTRLRTQTLQIVEEKGELD FT KEQAESRLEKLEEIYRGFEAIQTQLLEQVNDLSEDDTAEEEVFEQKYFEVK FT SVLKRFMRVPSEGIESSGTIVQLLRQQSELIQRIGSGAHSDVSAITPSAEN FT EAIAAILARQTEILDRVAITANSHNVDNRVKLPTIKLPRFDGKIEEWKFFF FT DNFRSIIHDKPHLSDIEKFQYLTSSISGEAAKIIESIELSGQNYRTVWDLL FT QQRYDDPRSLKKKHIQCLFTMPTVTKESSRALRELIDYVSRHLRVLKVLGS FT PTEAWDELVLHMIETKVDVKTLRAWEEKIKSSENPSLADMLEFLRGKCQTL FT ERIESRTVDKIEKPSKEGEHRNKGFVTTGTKSQSYNKSESIKKTTLSASLG FT SGKCYFCNDENHFIYYCDKFLALSVPDRINEVKRLRLCINCLRNDHFVKNC FT KMGPCRECTQRHNTLCHLAQEGKASIDKPINSQSQDEANKEKSVNSMSVHH FT ASNNSQRRHVLMATAVVEAIRRDGSVVQLRVLLDSASEVNFVTRDIHNKLG FT LKRHRVSEIVSGLNDTENKIYNSCDVHIKSKHSSFETNAQCFIVPKIAKCL FT PSMRIEFEQSKIPNDINLADSEFYKPGPIDMLLGAEFYYDLLETGKIYLGG FT NRLILQNTKFGWVIAGSMQPIALKDQFRKQESLCAMICSLRSEDSLSKDLE FT RFWKLESYDDNKRGDLSFDETECERHFEQNTTRADDGRFIVRLPFRKTNKL FT IGNNKEIALKRLYQLERRFKGNNAFYARYARFMSEYIELGHMSIASEPLDN FT CKNVVYLPHHGVLKESSTSTKLRVVFDASSKNNKGTSLNDALLVGPTLQDN FT LVDIVIRFRFYDIAITADLQKMYRQVSVHSDDRNFQRILWRFSNNDPVKEY FT QLNTVTYGQACASYLAIRCLRLLATEGSERYPLAARALLNDTYVDDIITGA FT NTIEDAQILQKQLVNLLSEGKFEAHKWCSNSNFALENVPVELRESSANLDI FT EANDIIRTLGLEWNPSSDEFRFTAQKTSGASTKREILSAISKLFDPLGLIG FT PVLTSAKILMQSLWKTKLDWDDPLPETVLIKWREFQASLIDVNVLRIPRLV FT INSSGNSRVSICGFCDASENAYGACLYIRSINCHSIEVSVNLLCSKARVAP FT LKRQSIPRLELCSAVLLARLINNVKRTLTVPIEEIRAWSDSMVVLYWIGGD FT SNRWKPFVGNRVSEITDILPAEHWRHVKGSENPADLISRGATPAQLLDNSL FT WWHGPKWLCDPHHSTIDDERCRLTTMILLSPRWSTKKTLKFAI" FT CDS 4004..5551 FT /product="BEL-8_SI-I_1p" FT /translation="MEYKKDAQICNLNLHSISNDITHKIVEDCSTLTKIVR FT SLAYCFRFISNCRKSPDDRTLTKLSMLELTEAHQAVIKYSQNLHFKEDVIQ FT LQTHKQLCRTSQLQQLHAFLDENGILRVGGRLREAPWNFTRKHPILLPAKC FT KITRLIIEREHRALLHAGPQLLLSSIRRQYWPLNARNLIRQICRACVWCVK FT NNPKGLIQSMGSLPADRIQPSRAFSVSGVDFAGPIVTLVNKGRGRKTCKSY FT VALFVCFATKAVHLEAVSELSTAAFLATLRRFVGRRGLPRKICSDNATNFV FT GASRELEELYSFVRTSIDGAVGDTLQEMNIEWSFIPPYSPHLGDIWEAGIK FT SCKFHLKRVMGNTLFTFEELTTALVQIEACLNSRPLSPLSSDPSDLQPLTA FT GHFLIGGPLTSLPEVDLSDIKLNRLDHWESIQRAVQGFWQRWAAEYVANLQ FT SRTKWKKTKENLKVNDLVLLQEDNLPPLKWKIGHVIESHAGKDGLVRVVTV FT RTTNGIVKRAITKLCKLPVD" XX SQ Sequence 5587 BP; 1700 A; 1107 C; 1312 G; 1468 T; 0 other; tatggtcctt cgagccggat tgaggtcgaa tcacagtcca gaaacgattc gagtctgatt 60 cggaaaatac gtgaacgcat ttacggcaat tccacaaggc aagttaacca agcgatagat 120 ccaagatgtc ggaaaacgcc ggcaatctga ggaaggcacg tggatacgta aaggcgaagc 180 tcacaaggtt aagaacgcag acattgcaaa ttgtcgaaga gaaaggcgag cttgacaagg 240 agcaagccga gtcaaggttg gaaaagctag aggagattta tcgaggcttt gaggctatcc 300 aaacgcaatt gcttgagcag gtgaacgatc tcagcgaaga cgacaccgcc gaggaagagg 360 tatttgaaca gaaatacttc gaggtaaaat ctgtattaaa acgatttatg agagtgccat 420 cggaaggcat cgagtcaagc ggtacaatcg ttcaattgtt acgacagcaa tccgagctga 480 tacaacgtat cgggagcggg gcgcacagcg atgtgtcggc cattacacca agtgcagaaa 540 acgaggcaat cgcggccatt cttgcacgcc aaacagaaat tctcgatcgc gtagccatta 600 cagcaaattc gcataatgtc gacaataggg ttaagctccc tacgataaaa cttccgcgat 660 tcgacggtaa aatcgaggag tggaagttct tttttgacaa ttttcgttcc attatacacg 720 ataaaccaca tttgtcggac attgaaaaat ttcaatattt aacgtcgtct atttccggtg 780 aagccgcgaa aatcatcgaa tcgattgagt taagtggtca aaactatagg accgtgtggg 840 acttattgca acagcgatat gacgatccca ggtcactcaa aaagaagcac attcaatgct 900 tatttaccat gcccacggta acgaaggagt cctcgagggc acttcgtgaa ttaatagact 960 atgtttccag gcatttgcga gtactaaagg ttctaggttc ccctacagaa gcttgggacg 1020 agcttgtctt acacatgata gaaacaaagg ttgacgtcaa aactcttcgt gcttgggagg 1080 aaaagataaa atctagcgag aatcctagtc tggcggacat gttggaattt ctcagaggaa 1140 aatgccaaac gctagagcgg attgagtcaa ggacggttga taaaattgaa aagcctagca 1200 aggaaggtga gcatcgaaac aagggatttg taacgacagg tacgaaatcg caatcttata 1260 acaaaagcga gtcaattaag aaaacaacgt tatcagcctc attaggctcc ggtaagtgct 1320 atttttgcaa tgatgagaat cactttatct attactgtga caagtttcta gcattatcgg 1380 tacccgatcg catcaatgaa gtcaaacgat tgagactttg cataaattgt cttaggaatg 1440 atcactttgt taaaaattgt aaaatgggtc cttgtcgaga gtgcacgcaa aggcacaata 1500 cgttatgtca cttagcgcaa gaaggtaaag cgtctataga taaaccaatc aactcgcaat 1560 ctcaagatga agcaaacaag gaaaaatcgg taaatagcat gtcggtgcat catgcttcaa 1620 ataattcaca aagacgtcac gtgcttatgg ccaccgctgt cgttgaagct attcggcgtg 1680 atggttccgt agttcaattg cgtgtacttc tagatagcgc gagcgaggtc aatttcgtga 1740 cgagagatat acacaataag ctagggttga aacgacacag agtatcggaa atcgtttccg 1800 gtttgaacga tacagaaaac aaaatttaca attcgtgcga tgtgcacatt aagtcaaagc 1860 attctagttt cgaaacgaac gctcaatgtt ttatcgtgcc aaagattgct aaatgtttgc 1920 catccatgag gatagagttt gagcagtcga aaattccgaa tgacattaat ttagctgatt 1980 ctgaatttta caaacctggt ccaatagaca tgttgctagg agcggaattt tactatgatt 2040 tgctagaaac gggtaaaatt tatcttggcg gaaatcggtt gattttgcaa aataccaaat 2100 ttggttgggt aatcgctggt tccatgcagc ctatcgcgtt aaaagatcaa ttccgcaagc 2160 aagaatcatt atgtgcgatg atttgttcct taaggtctga agattcattg agcaaggacc 2220 tggaaagatt ttggaaactg gaaagctatg acgataataa gagaggagac ctatcctttg 2280 acgaaacgga gtgtgagcga cattttgagc aaaataccac acgagcggat gacggtagat 2340 tcattgtaag gttaccgttt cgcaagacaa ataaacttat cggcaataac aaggaaatag 2400 cattgaaacg attgtatcaa cttgagcgca ggttcaaggg caataacgct ttttatgctc 2460 gatatgcaag gttcatgtcc gaatatatcg aattagggca tatgtcgatc gcaagcgaac 2520 ctctagataa ttgcaaaaac gtggtttatc tccctcacca tggcgtgcta aaggagagca 2580 gcactagtac taaactacgt gtcgtgttcg acgcatcgtc taaaaataat aagggcacgt 2640 cattgaatga tgctcttttg gtcgggccca cattgcaaga taatttagtt gacattgtta 2700 ttagatttag attttacgat atagcgataa ctgccgatct gcaaaaaatg taccgccaag 2760 tgtcggtaca cagtgacgat cgtaattttc aacgcattct atggaggttc tcaaacaacg 2820 atcccgtaaa agagtatcaa ttaaatacgg ttacttatgg tcaagcgtgt gcgtcctatc 2880 tagctatacg atgtttgagg ctactcgcta ctgaagggtc agagcgttat ccattggcgg 2940 ctcgagcgtt attaaatgat acgtacgtgg acgatataat taccggtgca aacacgatcg 3000 aggacgcaca aattttgcaa aagcagttag tcaatttgtt gtccgagggc aaatttgaag 3060 cccataaatg gtgttcaaat tctaattttg cattggagaa tgttccagta gaattgcgag 3120 agtccagcgc aaacttagat atagaggcaa atgatattat cagaacatta ggtctcgagt 3180 ggaatcctag ctcggacgag tttcgattca cggctcaaaa aacgtcaggt gcgtccacga 3240 agcgtgaaat tttgtcggcg ataagtaagc tattcgatcc tttaggatta atcggtccag 3300 tcctgacaag cgcaaagatt ctgatgcaaa gtttatggaa aactaaatta gattgggatg 3360 atcctctccc agaaacggtt ttaattaaat ggcgagagtt tcaagcgagt ttgattgacg 3420 tgaatgtatt gcgtatacca cgattagtta ttaactcttc gggaaacagc cgagtttcaa 3480 tatgtggatt ttgtgatgca tccgaaaatg catacggtgc atgtctttat ataagatcta 3540 ttaattgtca ttctattgag gtatctgtca atttattatg ttcaaaggcg agagttgctc 3600 ccctgaaaag gcaatcaatc cctaggttgg aattgtgcag cgcagtatta ttggcaagat 3660 taattaacaa tgtaaaacga acgttaactg tgcctattga ggaaatacgt gcttggtcgg 3720 actcgatggt ggtcctatac tggataggcg gagattcgaa tcgatggaag cctttcgtag 3780 gtaatcgagt gtctgaaatc actgacattc tgcctgctga acattggaga cacgtgaagg 3840 gttctgagaa tcccgcggat ttaatatcca gaggagctac acctgcacaa ctactagata 3900 attcgctgtg gtggcacggt cccaaatggt tatgcgatcc acatcactcc actattgatg 3960 atgaacggtg cagattaacg acgatgatct tgctttcgcc aagatggagt acaaaaaaga 4020 cgctcaaatt tgcaatttga atttgcattc aatttcgaac gatatcacac acaagattgt 4080 cgaggattgt tccacattaa caaaaattgt acgatcgtta gcctattgct tcaggttcat 4140 ttcgaattgt cgcaagagtc cggacgacag aactctaacc aaattgtcta tgttggaatt 4200 gacagaagct catcaggcag tgattaagta ttcgcaaaat cttcacttca aggaggatgt 4260 aatacaattg caaactcata agcagttatg ccgaactagc cagttgcagc aactgcacgc 4320 tttcctggat gaaaacggta tccttagggt gggcggtcgc cttcgtgagg caccctggaa 4380 cttcacgaga aagcacccta tactattacc tgccaaatgc aagataactc ggttgattat 4440 cgaaagggaa catcgtgctc tattacatgc agggccacag ctgttgctct cttcaatacg 4500 tagacaatac tggccattga acgccaggaa cttgattcgt caaatttgcc gtgcttgtgt 4560 atggtgcgtc aaaaacaatc caaagggatt aattcaatca atgggatcac tgccagccga 4620 tagaatccaa ccttccagag cattttctgt ctcaggagtg gactttgcag gtccaatcgt 4680 cacgctcgtt aataaaggac gtggcaggaa aacgtgtaaa tcgtacgtag ccttgtttgt 4740 ctgctttgca accaaggctg tccatttgga agccgtcagc gaactatcta cggctgcatt 4800 tctagccact ctacgaagat tcgtgggacg cagagggttg ccgcgcaaga tttgcagtga 4860 taacgctacg aacttcgtgg gagcaagccg tgaacttgaa gagctttatt catttgttcg 4920 tacctctatc gatggtgccg taggtgacac cttgcaagaa atgaatatcg aatggagctt 4980 catccctcca tattctcccc atctgggaga catatgggaa gcgggaataa aatcctgcaa 5040 atttcattta aaacgagtaa tgggtaatac tctatttaca tttgaggagt tgacaacggc 5100 gttggtccaa attgaagcct gcctaaactc caggcccttg tcgcctttgt cgtcggatcc 5160 ttcggatcta caacctttga ccgcaggtca ttttctaata ggcggacctt taacgagcct 5220 accggaggtg gacttatcgg acatcaaact taatagactt gatcactggg agtcaataca 5280 aagagccgtt caaggattct ggcagcggtg ggcggcggaa tatgtagcca acctacaaag 5340 tcgcaccaaa tggaagaaga ctaaggagaa tctcaaggtc aatgatcttg tattgctgca 5400 ggaggataat cttccccctc tgaaatggaa gattggtcat gtgattgaat cgcacgctgg 5460 caaagatggt ctagtgcgag tggtcaccgt tcgtacgact aatggtatcg tcaagcgagc 5520 catcactaag ttgtgcaaat tgcctgtaga ttgataaaga ttaattactt aatcttggtg 5580 ggcggtg 5587 // ID BEL-223_AA-LTR repbase; DNA; INV; 518 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-223_AA_; KW BEL-223_AA-I; BEL-223_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-518 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 904-904 (2011). XX DR [2] (Consensus) XX SQ Sequence 518 BP; 158 A; 108 C; 117 G; 133 T; 2 other; tgttcgcaca tcaaaatagt gcgaagagag aattgtggcg ccatctgttc ttcagtgcgg 60 gcaatctgac agcacggaca aactatggct gctagacatg gctcccttcg gaaatatgcg 120 tttcgctctt ccggcaaccg caaaaagatg gagtaaccct ggcctataaa aggcgattgg 180 tcatcggtcc tccctctctt acaaaacggc atcggcgacc gccatcaccg tcatcaacga 240 agcagtggac aacaacaaac gagtgatact tttaccaccc actgaaggtg aattactgta 300 atttgaattg attgtagtaa taagttaaat agttaatgaa ctagtggaat taaaagtgaa 360 actgtagaat aaagtactgt gaattttgct gtgatgtacc cttaagtttc cgaaaagaag 420 aagaaacaac ttatctgagg cgtgaacatt gtccccgttg tgaggtgtgg tgaattwtaa 480 ktgagcacag attcgctgtt ctacccgagt tatcaaca 518 // ID BEL-107_AA-I repbase; DNA; INV; 5245 BP. XX AC AAGE02022366; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-107_AA_; KW BEL-107_AA-LTR; BEL-107_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5245 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02022366; Positions 145940 140696. XX CC 'AAGTC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 85..1689 FT /product="BEL-107_AA-I_1p" FT /translation="MSSTPAKDPEKRLEIYSTPVGGRQQQKKKMAEESQAI FT VLQRGAVKGKITRIRNVLFRYEQDGTIPDEFLLRTQLKTIDAAYDEFNLFQ FT NKIYAILPSSAEEQEQKYIEFEDLYNDVRTRVCRLLGDVKREVNHQPQEGI FT QEAPAPQPVRVQTTPSLHTPLPTFDGKPENWFKFKAIFMDVMNKHVGESDA FT TKLYHLDKCLVGEASGTIDPQTINDGNFAAAMEHLTVRYEDKRKIVDIHVN FT GVLNLKAMTCESGKQLRELVDECKRHVDALNFYEFEMDGLSDIMVVSILAS FT KLDLETRKLWEGSIDHGDIPKYAEMIKFLTTRSQVLERIEPTSKPRKSNVT FT SAAKIPPLKTLSLAASVDCRCNFCEQAHLNHQCSDYLKLNPKERYEKAKQA FT GVCYNCLRKGHVTARCSSTRSCKACNKRHHSTLHMNPTPAGVVEQTATTVP FT AAQSIDDSTKPTTTNTINASCTVYQQRALLCTALVNILDDSGKAQPCRVLL FT DCRSLELYLRSTSIFLRGRYHLAYTSPILISIVHSAWTC" FT CDS 1996..4746 FT /product="BEL-107_AA-I_2p" FT /translation="MLPFRDNVNQLGDSKQHAQQRFSYLEKKLSKNAELKA FT AYCAFMEEYKQLGHCREIDPSEEEAGGFYLPHHAIQKPSSSTTKLRTVFDA FT SAQSSSGLSLNDTLMVGQVVGPTIQDSLIDIALRFRTYAYVFTADISKMYR FT MILVHADHTKYQRVFWRADPTEPLKTLELLTVTYGTASAPYLATRTLKQLA FT QDEGHAHPRAAEILENDFYIDDVLSGADTLEELMLTRTDLEELLLNGGFEL FT HKWCSNSAEFLGTIPEERREKRMSYEQDGVNDLIKTLGILWNPTSDQLLFR FT ISPTDQTHPVSKRYILSEIAKSFDPLGLQAPVCVTAKLILRQLWRKNVSWD FT EEVSQDIAESWNKFRGGLQALMNLQIERRVIAPKRIALELHAFSDASQDAY FT GTCVYVRSILADNTVEVHLLISKSHLAPRATIPRLELCGFRLMSRLVVKVV FT SALKVKFNRMVLYSDSTITLGWLSKSPSQLNTYVANRVAHIQELTPSSSYD FT YKYVRTNANPADLVSRGLYPEALIENRFWWHGPEFLQNDSHDDEIMEEMTE FT LPEVKVAALVTNNDYELEYNRIFMKFSSFRKLQRVIAYVIRFSNKTRNRAT FT DGDHERYPTVSELRSSLLAIVRMTQQQRLRNDIQEIQKKQGTTDKYTGRLR FT SLNPWIDSSGILRVNGRIKHANVAYDQRCPVILPSNHHVTDILIQATHLEN FT LHIGLNGTLSVLRQRFWILNGRNVIRKQLQKCVRCFRVNPPETKLFMGDLP FT RCRVTQALPFARTGVDFAGPIWVRKGHPRKPVYCKAYVALFVCMVTKCIHI FT ELVSNLTTDAFIAALHRFVARRGIPVHMYSDNATNFAGSSSELHELYQLLH FT QQLTKECIQGFCLPKEISWLHPSSFATRRRIVGSRRQVGEIPNQAHGWRHE FT VNGRRMEHTINAD" XX SQ Sequence 5245 BP; 1491 A; 1245 C; 1280 G; 1229 T; 0 other; ttttttggtc ctacatcgcc ggataatcga aaagtctcgg tcgcaattat cgataaagga 60 cccgtcggtc gaatggacgc gtaaatgtca tcaacccccg cgaaagatcc ggaaaaaaga 120 ttggaaattt acagcacgcc agtgggagga cgccagcagc agaaaaagaa aatggcggaa 180 gaatcccagg cgattgtgct tcagcgcggc gccgttaaag gaaaaatcac tcgaatcaga 240 aatgttttgt tccgatacga gcaagatggc acaataccgg atgaattcct gttgcgaacc 300 cagttgaaga cgattgatgc tgcttacgac gaattcaacc tgttccagaa caaaatttac 360 gcgattttgc ccagtagtgc agaggagcag gagcagaaat acatcgagtt tgaggacctg 420 tacaacgatg ttcgaacgag ggtgtgccga ctgctgggtg acgtgaaacg agaggtgaat 480 catcaaccgc aagagggtat ccaggaggca ccggcaccgc aaccagttcg agtacaaacg 540 accccttcgc tccatacacc gttgcctaca ttcgacggga agccagaaaa ctggttcaaa 600 ttcaaggcca tattcatgga tgttatgaat aagcacgtcg gagagtcaga tgctacgaag 660 ctctatcacc tagacaagtg tttggtaggt gaagcttccg gaacgatcga tccacaaacg 720 ataaacgacg gcaactttgc tgctgcgatg gagcatctga cagtacgcta cgaagataaa 780 aggaagattg ttgacatcca cgtcaacggt gtacttaacc tcaaagcaat gacgtgcgaa 840 agtggcaaac aacttcgtga actagtggat gaatgcaagc gacatgttga cgcactcaat 900 ttctacgaat ttgaaatgga tggactatcg gacatcatgg tcgtcagtat tctggcgtcg 960 aagcttgatc tggaaactag gaagctctgg gaaggcagca tcgatcacgg tgacatccca 1020 aagtatgcag aaatgataaa gttcctgaca acacgaagtc aagttttgga gcgaatcgag 1080 ccaacaagca agccaaggaa gtccaacgtc acttctgcag cgaagatccc tccattgaag 1140 acattgtccc tggcggcatc ggttgactgt cggtgcaact tttgtgagca agctcatttg 1200 aatcatcagt gcagcgatta tctgaaacta aacccaaagg aacgctacga aaaggcaaag 1260 caagctggag tctgttacaa ctgcttgagg aaaggacacg tcactgcacg ctgttcttct 1320 acgagatcgt gcaaggcctg caataaacga caccattcta ctctccacat gaatcccacg 1380 ccggcgggtg tcgttgagca gacagctact acggtgccag cggcacaatc aattgacgac 1440 tccacgaagc ctactacgac aaatacgatc aatgcttcct gcacagtcta ccagcaacgg 1500 gctctgttat gcacagcact tgtgaacatt ctcgacgata gcggtaaggc gcagccatgt 1560 cgcgttctac tagattgcag atcactggaa ttatacctgc gaagcacatc gatatttcta 1620 cgtggccgat accatctggc ctacacctcg ccgatcctga tttccatcgt ccacagcgcg 1680 tggacatgtt gataggagtc gggcactttt tcggtctcct gaaatctggt aagatgaagc 1740 ttgctgagag cctgccgttt ttacaggaga caacgtttgg atgggtcgtc ggcggcatgg 1800 ctgactacaa gttggaagta cgaagtgaat cactgcaaca ttgcggtcaa tgagggaaat 1860 ttgaatgaac tcgttgaacg gttctgggag tccgaggcgg tgtctaccgc ttccagtctg 1920 tcatcggaag aagcagcatg cgaacagttc tacgagcaga ctcacagtcg cgatgcaagc 1980 ggccgctaca ccgtaatgtt acctttccgt gacaatgtga atcaactcgg cgattcgaag 2040 cagcacgctc aacagcgatt ctcctacttg gagaaaaagc tgtcaaaaaa tgctgagcta 2100 aaggcggcct actgtgcatt tatggaggaa tacaagcagc ttgggcactg ccgagaaatc 2160 gatccgtctg aagaagaagc tggtgggttc tacttgcccc atcacgctat acaaaagcca 2220 agttcatcta ccaccaagtt acggaccgtt tttgatgctt cggcacaatc gagttcaggg 2280 ctctccttga atgatacttt gatggttggc caagtggttg gccctactat acaagattcg 2340 ctcatcgata ttgctcttcg ttttcgaaca tatgcttatg tgttcacggc tgacatttct 2400 aaaatgtacc gaatgatatt ggtgcacgct gaccacacca agtatcaacg cgtgttttgg 2460 agggcagatc ccacagaacc actgaagaca ctcgagcttt tgactgtcac ttacgggaca 2520 gcttcagcgc catatctagc tacccgcaca ttaaagcagt tagcccagga tgaaggccat 2580 gctcaccctc gcgcagcaga aatactcgaa aacgattttt acatcgacga cgttctttcg 2640 ggagctgata ccctggagga attgatgctg actcgtactg atttggagga actacttttg 2700 aatggaggtt tcgagctaca caagtggtgc tccaattcag ccgaattcct tggcaccata 2760 ccagaagaac gacgagagaa gcgaatgtca tatgagcagg acggcgttaa cgacctcatc 2820 aagaccctcg gtatattatg gaaccctaca agcgatcaac tactatttcg catttctccc 2880 actgaccaaa cgcatccggt ttcgaaacgc tacattttat cagaaattgc gaagtcgttt 2940 gaccctctag gtctgcaagc accggtttgt gtcactgcta aactcatact gcgtcagctg 3000 tggagaaaga atgtttcgtg ggatgaagag gtaagccagg acatagcaga aagctggaac 3060 aagtttcgtg gtggtttaca agccctgatg aacctacaaa tcgaacgccg agtcatcgca 3120 cccaaacgaa tcgctctaga gttgcacgca ttttcagacg cctctcaaga tgcttatggt 3180 acatgtgtct atgtacgtag tattttggcg gacaatacag tggaggttca tctgctaatt 3240 agcaaatcgc acttggcgcc gagagctaca atcccgagac tggaactgtg tggttttcgt 3300 ttgatgtcaa gactagtggt caaggtcgtc tcagcgctca aggtgaaatt caatcgcatg 3360 gtgttgtact cggattcgac tatcacttta gggtggttat caaagtcacc aagccaactc 3420 aacacgtacg tagccaatcg agtggcgcat atacaagaat taacgccaag ttcatcctat 3480 gattacaagt acgtacggac taacgctaac ccggcagatc tggtttcgcg aggactctac 3540 cctgaagctc tgattgaaaa tcgattttgg tggcacggcc cagagtttct gcagaatgac 3600 tctcacgatg atgaaataat ggaagaaatg actgaactgc ctgaggtcaa ggttgctgca 3660 ttagtgacca ataacgacta tgagctggaa tacaacagaa tattcatgaa attcagttca 3720 tttcgcaaat tgcaacgtgt gattgcttac gtgattcggt tttcgaataa gacaaggaac 3780 cgagccactg acggagacca cgaaagatac ccaactgtga gcgagttacg ttcttcgttg 3840 ctggctatcg ttcgcatgac tcaacaacaa cgactgcgaa acgatattca agaaattcaa 3900 aagaagcagg gtactaccga caagtacact ggaagactac gaagcctcaa tccgtggatc 3960 gacagcagcg gtatacttcg agtaaatgga cgaatcaaac atgcgaatgt ggcatatgat 4020 caacggtgtc cggtaatatt gccgtccaac catcatgtca cagacatatt gatacaagct 4080 acccacctgg aaaacttgca cattggacta aatggaactc tttcggtctt gcgacagcga 4140 ttttggattc tgaatggccg aaatgtgatc cgtaagcaac tacagaaatg cgtgcgatgc 4200 ttcagagtga atcctccgga gacaaagctt ttcatgggag acctacctcg ttgcagagtt 4260 acacaagctc ttcccttcgc acgtacgggt gttgattttg ccgggccaat ctgggtacgc 4320 aaaggccatc cacgcaaacc agtgtactgt aaggcatatg tagcattgtt tgtgtgcatg 4380 gtcactaaat gcattcatat cgagcttgtg tcaaacctga cgacagacgc attcatagcc 4440 gcattgcacc gatttgttgc acgccgcggc attccggtgc atatgtacag tgataatgcg 4500 accaactttg ctggatcaag ctcggagctc cacgagttgt atcaactatt gcatcagcag 4560 ctgacgaagg agtgtattca aggtttttgt cttcccaaag aaataagttg gcttcatccc 4620 tcctcgttcg ccacacgtag gaggattgtg ggaagcaggc gtcaagtcgg cgaaatacct 4680 aatcaagcgc acggctggag acacgaagtt aacggaagaa gaatggaaca cactattaac 4740 gcagattgaa ggcatcctca attcgcgccc tttggtaccc cagacggcag accctgatga 4800 ctacagtgtt attacacctg gacatttgtt gattggtcgt ccaatcacag cgattcccga 4860 accagcatac gaccaactca agcatggaac cctctccaga tggcaacaca ttcagaaaat 4920 gcgcgctgat ttctggaagc gttggtcagc gttttatctg tccgagttgc aacaacgccg 4980 taaatggtcc aaacaacaca ccgagatcaa aattggggat ttggttctgc ttaaagagga 5040 caacatccct ccacttcaat ggcgacttga cagagtggta catacccacc caggacagga 5100 tggagtaaca cgtgtagtaa tggtgaagac atcagcagga gtgttcaaac gttcgaccgc 5160 caacattgct gtactcccgc tggacgatgt ggtagttaag acggaagctg attgaatttc 5220 ctggaaatcc aatggcgggg gagaa 5245 // ID BEL-170_AA-LTR repbase; DNA; INV; 227 BP. XX AC supercont1.309; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-170_AA_; KW BEL-170_AA-I; BEL-170_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-227 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.309; Positions 648676 648902. XX SQ Sequence 227 BP; 76 A; 42 C; 48 G; 61 T; 0 other; tgtaagggaa cgtaaaagct ctctgctatc aacttgtcga atggtcggac aaaaggagta 60 tccctagcag aaaagtagaa aaaaaaaaag ttggcacccc tcgaattccg ccagcttatg 120 agttaagtat ttcgcgatcg cgggcgcacc agacgccatt attttgtatt tttcgcaatc 180 gtatcaaata aatatgtaaa acagataaaa tgtgtttggt tgttaca 227 // ID Gypsy-177_AA-LTR repbase; DNA; INV; 1969 BP. XX AC supercont1.143; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-177_AA_; KW Gypsy-177_AA-I; Gypsy-177_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-1969 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.143; Positions 548935 550903. XX SQ Sequence 1969 BP; 594 A; 365 C; 466 G; 544 T; 0 other; tgaaacgagc agatttggtg aaaatccgcc gtttcaaata agtagcgttt gctaccattg 60 cgcctgttag ataggcaact atagatgata aagtgcttga atctattgtt tataagtgtg 120 agttttaaga tgatgatgag agtgggtgaa gagtaatgat taacaggaaa ccccgagaaa 180 tgaaaatcat tcgacaagaa aatttttatt agagaattta aatcaaaata gaaccggaaa 240 ccctcgaata ttatattaat ataatcggtg acaactttct tggtcaatcg aaaccaacaa 300 gttggtcttt cccttttgga ccgccaagta gtcaagtgct aaaaaaatct tatcaagtgc 360 attgagaaaa ttttcttaaa aattatttta tctaaaacaa gtaaaagaag ctataaaatc 420 aaaagaagag tgtaaatttt aacgtgataa aatttattat tattttcaga gagataattt 480 gcaatttaat tgaaatttaa tacaaaccgt gagtcgtatg tttaaaataa ttgtagttgt 540 atctaaattt aatatttcgt tttggacaag aagggtagga aaattaaggc agtgagtcaa 600 agccaagaaa aaagtgagaa cagtgaagca aggaatataa ggtaatttgg tgaattgata 660 gaaaatgtgt gtgttgggta ctaagaatct gctattacag tagccacagc ccggtggcat 720 tctccattgg ggaattggtg tgcggtcggt gctaaaacgt gaaggtgcga gattggtggt 780 gcgaagatta atcgatcgtt gcgacggtga aagtgaaccg atttggtggt tggtacacag 840 cgaggttacc ctgggcatac ctatcgactc gtggtctcgt gccactatag gtacgtctag 900 agtgcgttgg ccatcgaagt aggaagcttg gcccccccac gagaatcaca cccgctaccc 960 gtgattgcac gctctgtggg gtcgtccatc aaagggaagg aattccagcg ccagtgtacc 1020 cgcccgactc acgttctgaa gtccggagac tgttcggacg gtaccaaacc acctttccag 1080 ggaggaaaat taccacctca gaaccgcgga ttgtggataa cctcacccaa gtaagagtag 1140 tgatcaccct tcgccacctc ccgcctgcga tatccgggaa gagaaagaca aaatttctgc 1200 agccgagcca cctcccctcg ttgttggtgt tcatcaatct cgacgtggtt ggtcgatagc 1260 gtcgtcctcg ggaccgatgc caccccctgc tcagggttga tcgtgttagt cagcaagcga 1320 cgctcaactg taagtacccg aatcaaactc catcaacaca caccaatcca ccgtgatttc 1380 aagtagggaa acactatttg tagcagtagg ggaagaacac agttaggggg ttagaactac 1440 tcacgaaaga tacacgagag gtgagtgacg tcacacatag ggagtgagtg actatcggcg 1500 agttcggtac cgttggccag cgtttagctc cgggagaaca gccgatagcc actcagtact 1560 tagctagctc tggtaggaga cagtgcaggg cattgtacgt gatgaaatga aaaaaaaaaa 1620 actttaatga attttaaaac gttaaaattt tgataaaagt tttactgaat tgccttacct 1680 agtatttttt tttttaataa aattttctct tgtctgctta actttagatt tactattttt 1740 ttggttgatc gaaagtttca ccattctgtt agaaatttgg gaaggaattt tgggaaagtt 1800 tttggaattc ggactgattg tttgtttgtt tgggattgga ggtggattcg gtgctaggca 1860 ctgaaaactg ctagctgggc attacgtatc cggagttctt gacagacgac ttcgtcgtct 1920 ggcgcataag tcaaggaagt tttagaccct agccaggatc ctccttaca 1969 // ID I_Ele7 repbase; DNA; INV; 5314 BP. XX AC . XX DT 07-OCT-2010 (Rel. 15.1, Created) DT 07-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE An I clade non-LTR retrotransposon family from Aedes aegypti. XX KW I; Non-LTR Retrotransposon; Transposable Element; I-3_AAe; KW I_Ele7. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-5314 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5314 RA Kojima K.K. and Jurka J.; RT "I clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (07-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. This consensus is generated from 12 CC sequences with >92% identity, and ~98% identical to the original CC sequence in [1]. XX FH Key Location/Qualifiers FT CDS 230..1594 FT /product="I_Ele7_1p" FT /translation="MAASSGWDPGLPPTAVDGRPTVPEWMGGVGIGQLQIL FT SLFAADGETKLCPFIVGKSMHDLAGDIESTTTEASGMKYVLRVRDAGQARK FT LLNMKALFDGTAVKVEPHPILNKRRCVISCREIMNKTEEELQNWMAKDGVI FT GVKRITRMENGKPVNTPTIILTINGTAVPDYIKVGPLRIKTRLYVPDPMIC FT YRCFEYGHTKTRCKNAAKCRNCSQAHNMEEECKEAPFCFHCQGNHGPSNRS FT CPVYATEKEIVRLRFTKGISQQEAIKQVKAGGGSYASVSQVQNRLTIAQGK FT SESELLKTKDATIKQLTETIATLTKRIEELERRSKSKKEKKRSKRIQLTKD FT DESCSEMETDSSAKQTTTEQPKGGTSTQVSAAFIKPSVQKHKRHPTTEIYA FT PITKKPLADQIQLPNTQSNTNLNKSPANPPGILDLQNLMDSTRLDSPRSRP FT NNGQHNKPSK" FT CDS 1569..5249 FT /product="I_Ele7_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="MANTTNRVNSQFSPKPQSQTCLALQWNLRGIKFRIPE FT LQLLISKFDPICLSLQETMMPESKTPPNFIKGYTLYIRENPDNPSKTGTAI FT AIKSDIPHRKIDVSSSVLAIGIEIDYPCKVTLLSLYIPGHLSNQTIKTDIS FT KLLNQFSSPVLLMGDFNGHSTLWGCSNNNSRGVMLGQLFDEFGLTLLNDGT FT HTRICPSTGNSSALDLSLCSHSLGSQLSWMVHDDCCCSDHLPIVISLNRAP FT PVSTCRPRWKYEQADWVNYQNEVFHAFTVDPPNSADDFVKQLFQIASNHIP FT RSSGAPGRKSVPWWNSDVHSAVKLRRKKLRALQRLPKEDRKNSEIFAEFKA FT ARNAARATVKSAKQDSWDQFISSINPESSSKELWDKIHRLSGKKARQSIKL FT RINNQITEDPSTITNHLADHFAQSSSSSNYSDRFISRKSSIESAPLNLESD FT DSQEYNRNFSFDELQWALHKVHGSSAGPDDVGYPLIKNLPLIGKTILLTIF FT NNIWNQGEIPDSWKEGLVVPIPKPGKDKNSADSFRPITLLNCIGKILEKMV FT NRRLITLLESQGLIDDRQFAFRPDKSTDDYLTELEDIIDSHLERGMHGDIV FT SLDLSKAYDRAWRFPILKSLEEWNIKGRMGRYVQNFLKDRRFRVILGNNRS FT ELRCQENGIPQGSVIAPTLFLISIQSLFKTIPPNVFILVYADDITIITFHS FT MKSLARKRIQSAVNTIAKWADENGFTVSPEKSQLLHISQNRKKMSKLPDIT FT LNNSIIKSQISLKILGIHLDRSLSFIHHLNSVRKQICNRTNLIKVLGSRIP FT CAHRKTINQIINSWLLPKMFYGVGLFSRGGDLVNKRLSPLYNKAFRYASGA FT FCTSPTSAIMAETGQLPFEYHLTTCLVAKATRWLSLGGRYNVPLVKRTTEK FT FQSLTNQVFPTIATLPRSKNRPWNQPLLKIDLSLLSQVKAGDPSSIIQPHF FT HKLIADKYSTFQHLYTDGSVSNGKVGYGITGSPNVSNISSALPSIYSIFSA FT EAYALMKAVHLISPQNLQSTVIFSDSASCLQEMNSYSLKHPWLQEAQGLAL FT TNGATFCWVPGHSGIHGNEEADRLAGEGRGVNPEDVSVPSIDVLRWTKERI FT LESWNTKWHHSRDISLRIIKPSIFPWPDNSNPKHRKIFTRLRIGHTRLTHE FT YRLDKVDPPICTSCNSPLTVRHILIDCLTYNNERLAYNLGGTLDEVLSTTK FT ETDLLGFLTATGLLDQL" XX SQ Sequence 5314 BP; 1572 A; 1361 C; 1050 G; 1329 T; 2 other; cattcatcca tcattcagtt accaaccacc agtgcgcgcg gacgtacttt ttgcgagtgc 60 gagttaattc cgagcgaaca gtagtgtatc tccaaccgat cccggtggag gaatagtgcg 120 aaccactttc ggtggtgaaa acagtgaaga gtgaagtaca gtgaagtcgt gaagaactga 180 tacgacaagt gaaaagtgcg aaaacagtga gaagtgaagt tgtgccatca tggcggcgag 240 ttcagggtgg gaccctgggc tacctcccac ggcagttgac ggaagaccaa cggttccaga 300 atggatgggc ggtgtcggga tcggccagtt gcagatcctg tcactgtttg ctgctgacgg 360 ggaaacgaag ctctgcccat ttatcgtggg caaatccatg cacgatctgg ctggcgatat 420 tgaaagcacg acgacggaag caagcggaat gaaatacgtg ttgcgagttc gagatgctgg 480 acaagcgagg aagctcctca acatgaaggc cctgtttgac ggaacggcgg tgaaggtgga 540 accccacccg atcctcaaca agcgacggtg cgtgatctcc tgccgcgaga tcatgaacaa 600 aacggaagaa gagctgcaga actggatggc caaggatggt gtgattggtg tcaagcgaat 660 cacccggatg gagaatggga agccagtgaa cacgcctacc attatcctca cgatcaacgg 720 aactgcagta cccgactaca tcaaggtcgg tccgctgcga atcaaaaccc gactttacgt 780 accggacccc atgatctgct atcgctgctt cgaatatgga cacaccaaaa cccgctgcaa 840 gaatgccgca aaatgcagga actgctccca agcccacaac atggaggagg aatgcaagga 900 agcacctttc tgcttccact gccaaggaaa ccacggacca tccaaccgct cctgcccggt 960 ttacgccacg gagaaggaga ttgtccggct acgattcacc aaaggaatct cccaacaaga 1020 agccatcaag caggtcaaag ctggcggtgg atcatacgcc tctgtcagcc aagttcaaaa 1080 tcgccttacc attgcccaag gaaaatcgga atcggagctg ctgaagacca aggacgccac 1140 catcaagcaa ctgacggaga ccatcgcaac gctcaccaaa cggattgaag aattggaaag 1200 aaggagcaag agcaagaagg agaagaaacg aagcaagcga atccagctga cgaaggacga 1260 tgaaagctgc tccgagatgg aaaccgactc cagcgcgaag cagacgacaa cggaacaacc 1320 gaaaggtggc acctcgaccc aggtaagtgc tgcctttatt aagccaagcg ttcaaaaaca 1380 taagcgacac ccaacaactg aaatctatgc cccaatcacc aagaaacctc tggctgacca 1440 aattcaactc ccaaacaccc aatcaaacac caacctgaac aaatcccctg caaatccccc 1500 tggtatcctc gatctacaaa atcttatgga ttccactcgc cttgactccc ctcgctcacg 1560 tcccaacaat ggccaacaca acaaaccgag taaatagcca gttttccccc aaaccccaat 1620 ctcaaacctg tttagcttta caatggaatc ttcgtggaat taaatttaga ataccagagt 1680 tacaactact gatttctaaa tttgatccta tttgtttatc tcttcaagaa acaatgatgc 1740 ctgaatcaaa aacaccaccc aattttatta aagggtacac actgtacatc cgtgaaaacc 1800 ctgacaaccc ttctaaaact ggaacagcaa ttgccattaa aagtgatatt cctcaccgta 1860 aaattgatgt ctcctcctca gtactcgcaa taggcatcga aatagattac ccctgtaaag 1920 taacacttct ttcattatac atacctggcc acctctccaa tcaaacaatc aaaaccgata 1980 tttccaaact tcttaatcaa ttttcatctc cagtcctttt gatgggtgat ttcaatggcc 2040 attctactct atggggttgt tccaataaca actcccgtgg agtgatgtta gggcaactct 2100 tcgacgaatt tggattaact cttttgaatg acggtaccca tactcgaatt tgtccttcta 2160 caggaaattc atcagccctt gatctttcat tatgttcaca tagccttggt tcccagttat 2220 cctggatggt ccatgatgat tgttgttgta gcgaccatct tccaattgtc atttctctga 2280 atcgagctcc tcctgtttca acatgtcgtc ctagatggaa gtatgagcag gctgactggg 2340 ttaattacca gaatgaggtc tttcatgctt tcaccgttga tcctcccaat tctgctgatg 2400 atttcgtaaa acaactcttt caaattgctt caaatcacat tccccgatcc agtggtgctc 2460 caggtaggaa atctgtcccc tggtggaatt ctgatgttca ttctgcagtc aagctccgcc 2520 gtaaaaaact tcgtgctctt cagaggttgc caaaggagga tcgtaagaac tccgaaatat 2580 tcgcagaatt caaagctgcc aggaacgctg caagagcaac agtcaagtcg gccaaacagg 2640 atagttggga tcaattcatt tcttccataa acccagaaag ttcatcaaaa gaactgtggg 2700 ataaaatcca tagactaagt ggcaaaaaag ccaggcagtc cattaaactt cgaattaata 2760 accaaattac tgaggaccct tccactatca ctaatcacct tgcagatcat ttcgcccaat 2820 catcctcttc ttccaattat tccgaccgtt ttatttcccg taaaagttcc attgaatccg 2880 ctccccttaa tcttgaatca gacgattccc aggaatacaa ccgtaatttc tcctttgatg 2940 aattacaatg ggctctccac aaagtccatg gctcttctgc tggtcctgat gatgtgggct 3000 accctctcat caagaatctt ccccttattg gtaaaaccat cctgcttaca atcttcaaca 3060 atatctggaa tcaaggtgaa atacctgatt catggaaaga aggtctagtt gttcccatac 3120 ctaagcccgg taaagataaa aatagtgccg atagtttccg tcccataact cttttgaact 3180 gcattgggaa gattcttgag aagatggtca atagaagact tatcactctc ttggaatccc 3240 aaggtttaat cgatgatcga cagttcgctt tccgccccga taaatctact gacgactacc 3300 taactgaact cgaggatatc attgactctc acctggagag gggaatgcat ggtgacattg 3360 tatctcttga tctttcgaag gcctacgatc gtgcgtggcg atttcccatc ctcaaatcac 3420 ttgaggaatg gaacatcaaa ggtcgcatgg gtcgttatgt acaaaacttc ctcaaagaca 3480 ggagattcag ggtcatctta ggcaataacc gatctgagct gagatgtcag gaaaatggta 3540 ttcctcaagg ttcggtgata gctcccactc ttttccttat ttctatacag tccctgttca 3600 aaactattcc tcctaatgtt ttcatccttg tttatgcaga cgacataacc ataatcacwt 3660 tccatagtat gaagtcactc gccaggaaaa gaattcaatc cgcagtaaat accatagcca 3720 aatgggctga tgaaaatggt tttactgttt cccccgaaaa atctcagctc ctccacatta 3780 gtcaaaatag aaaaaaaatg tccaaacttc ctgatataac cctcaacaat tctattatta 3840 aatctcaaat atccctaaaa atccttggaa tacatcttga ccggtcccta agttttattc 3900 accatcttaa ttccgttcgc aaacaaatct gcaatagaac caatctcatt aaagtcctag 3960 gatctcgaat accctgtgcc cataggaaaa ctatcaacca aataatcaat agctggcttt 4020 tacccaaaat gttctatggc gttggactgt tcagcagagg aggggatctc gtcaacaaaa 4080 ggctaagtcc tctatacaac aaagcctttc ggtacgcatc gggcgctttc tgcactagtc 4140 ccacctctgc tataatggcc gagacaggac aattaccttt tgagtatcat ctgacaacat 4200 gcctagtggc caaagctacc aggtggctaa gtcttggcgg tcgatacaat gtccctttag 4260 tcaaacgaac aaccgaaaag tttcaatctc ttactaatca agttttcccc accatagcta 4320 ctcttcctcg atccaaaaat agaccctgga atcaaccact cctcaaaata gatctttctc 4380 ttctcagcca agttaaagct ggtgatccct catcaataat tcaaccccat ttccacaaac 4440 ttattgcaga caaatacagc actttccaac atttatacac cgatggctct gtatcaaatg 4500 gtaaggtcgg gtatggcatc acaggtagcc caaatgtctc taatattagt tctgcgctcc 4560 cttcaattta ctccatmttc agtgccgaag cttatgcttt aatgaaagct gtccatctca 4620 tttcacctca gaatcttcaa agtacagtta tcttcagcga ctctgccagt tgtctccagg 4680 aaatgaattc ttattccttg aaacatcctt ggcttcagga agcccaagga ctcgctctga 4740 ccaatggggc caccttctgt tgggtgccag gccactctgg cattcatggt aacgaggagg 4800 cggaccgatt ggctggagag ggtcgtggtg tgaacccaga ggatgtttcc gttccatcta 4860 tcgacgtgtt gagatggaca aaggagagaa ttcttgaatc ctggaacaca aaatggcacc 4920 acagtagaga catctcactt agaattatca aaccttcaat cttcccttgg cccgacaata 4980 gtaaccccaa acaccggaaa attttcaccc gtcttcgaat tggccatacc cgtcttactc 5040 atgaataccg cctcgataaa gttgaccctc caatctgcac ttcttgtaat tccccactaa 5100 ctgttcgcca tattttaatt gattgcttaa cttacaataa tgaacgcttg gcttataact 5160 tgggtggcac cctggatgag gtgctatcaa caacaaagga aacggatctt cttggattcc 5220 tgactgcaac cggattactg gaccaactgt aaaacccagc gataagggac gaatgacctt 5280 cgggttaaag tccctctcaa acaacaacaa caac 5314 // ID SR2B repbase; DNA; INV; 2399 BP. XX AC AF025681; XX DT 27-JUL-1999 (Rel. 4.06, Created) DT 27-JUL-1999 (Rel. 4.06, Last updated, Version 1) XX DE Schistosoma mansoni SR2 subfamily A non-LTR retrotransposon. XX KW Non-LTR Retrotransposon; Transposable Element; SR2B. XX OS Schistosoma mansoni OC Eukaryota; Metazoa; Platyhelminthes; Trematoda; Digenea; OC Strigeidida; Schistosomatoidea; Schistosomatidae; Schistosoma. XX RN [1] RP 1-2399 RA Drew C.A.; RT "SR2B."; RL Direct Submission to Genbank (19-SEP-1997)Molecular Parasitology RL Unit, Queensland Institute of Medical Research, Post Office, RL Royal Brisbane Hospital, Brisbane, Queensland 4029, Australia. XX DR GenBank; AF025681; Positions 1 2399. XX SQ Sequence 2399 BP; 609 A; 530 C; 582 G; 678 T; 0 other; gtaaactggc tgcaacatct gttgcaagta agtatcgagc ggagctagcc tctaggctag 60 ctaccacccc accgaaaagt atagatgagc attggttgca tcttcacgac gccatgaaaa 120 tggcgagtaa agtcgcttgt ggcttcgcga aacgtcccgc ttacaagcac tgggtttctt 180 ctggctcctt acaactgatc gaagcccgtc ggtctactcc gggtgaccgt gagtttgacc 240 acaaacgaag gctgttacgt aatgaaatcg ggcaaagctt gcgtaaggac cgggaagcct 300 ggtggtcgga gcgtgctaat gagatggaag cagcagctgc atctggtaac tgccggaagc 360 tcttccaact catccgagcc actggcagca agaagtctgg tgtgagtgaa acaatctgtg 420 aggatgacgg gatgccaatc actaacatct gtcgacgtct tggacgatgg gcggaattct 480 tcgaagggca gttcaactgg catgctgctc cggcaacatc aaccagcttg tccttccccc 540 catggccggt gacgactgat ccaccaaacg aggcggaagt ccgcaaggaa ctccaactct 600 tgaaacgtta caaatcaccg ggcccagatg acttacaccc agcacttttt aaagatggtg 660 gtgactttct ggctaaggaa ctgactgtgt tgtttggaaa ggtttgggag caagaaagtg 720 ttccaacatc atggaatgag tcgatagtcg tccctatctt taaaaagggt tcacgttgtt 780 cctgtaacaa ctatcggggg ataagtctac tttcgattgc gtccaaacta ttggcttcta 840 tcattcttcg taggttgttt aaaacccgag aacgattgac tcgcgaggag caggctggtt 900 ttcgttctgg tcgaggatgc attgatcaca tcttcaccct ctgccaaatg ttagaacacc 960 gtcatactta tcgcaggcca acaatcgtag tgtttcttga tatcagggct gccttcgatt 1020 cgttggacag gactgttctc tgggattgtc tattgaagaa gggtgtgcct gagaagttca 1080 ttaacatctt aaaggccctg tatacgaaca cctcaggcag agtgagggca tacaaccacc 1140 tttctccctt gttccattcg agcagtgggg ttaggcaggg ttgcccgatc tcaccattcc 1200 tcttcaactt tgccatcgac gacatcctgg aaacagctct gatggatgta agtaatggca 1260 gtgtggatat gttgcctgga gaacgacttc tcgaccttga gtatgcggat gatattgtct 1320 tactgtgcga taatgcccaa ggcatgcaat ccgcacttaa tcagttggca atcagtgtcc 1380 gcaggtacgg catgtgcttt gcaccctcca agtgaaaagt actcctacaa gactggcagg 1440 attctcatcc tgtactcacc ctggatggtg agcagattga agtagttgag aagttcgtgt 1500 atctaggcag ctatataagt gctggtggtg gcgtgagtga tgagattgat gcacgtataa 1560 tgaaagccag agcggcttat gccaatctgg gccatctctg gcgccttcgt gatgttagtc 1620 tggctgtaaa aggtcggatc tacaacgcgt cggtaagagc agttttgctt tatgcttacg 1680 agacctggcc tcttcaagtt gagcatgtta gatgactctc tgtgtctcat catcattgtc 1740 tttgaagcat tgctgacatt cagtggtcgg gtccagtggc gacaccatgt tggcaatgca 1800 gagattcggc aacgtgtgtt cgggcacagc aacgataact caattggtgt cactaacttg 1860 aaacactgac ttcagtggat tggacatgtc ctacaaatgt cgtcccagag aattccacat 1920 tagacattat ttgccaacgc tagggccggt tgaaaagaac cgagaggtgg tcagtgtatg 1980 acatggtgtc gtgttatgaa agaaagttat acaggactgg tttccgttgg tcattcacaa 2040 ctccctgatt ggggtcctag agacggtaca acacagtggc ttgaaatatt atcagacata 2100 gttcagaata gaagccagtg gcgatcctgc tgcaaccgtc ttttactttc tacataaaga 2160 gtggttctaa ctttcttaac tgaaagtgtc ttctggttgt acattttagt ccgcgttatc 2220 ttttcatcct ttctcttcct actttcatta ttttgtgtgg cgcatatgta tctggtgccc 2280 ttttgtacca atatatatgt gtttaaataa ataaataaat aataaaatca tatataatat 2340 gatttaatca ttctcatatt tttcaataca ctattttata tatgtattat ttatttatc 2399 // ID Transib-12_HM repbase; DNA; INV; 3943 BP. XX AC . XX DT 31-JAN-2008 (Rel. 13.02, Created) DT 31-JAN-2008 (Rel. 13.02, Last updated, Version 1) XX DE Autonomous Transib DNA transposon -a consensus sequence. XX KW Transib; DNA transposon; Transposable Element; Transib-12_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3943 RA Kapitonov V.V. and Jurka J.; RT "Autonomous Transib transposons from the hydra genome."; RL Repbase Reports 8(2), 37-37 (2008). XX DR [1] (Consensus) XX CC Transib-12_HM is a young family of autonomous Transib DNA CC transposons that were active in the hydra genome less than a few CC million years ago (copies are ~4% divergent from their consensus CC sequence). The consensus sequence was obtained based on multiple CC alignment of 10 copies; it codes for a 654-aa Transib CC transposase. Like other Transib transposons, Transib-12_HM is CC characterized by 5-bp target site duplications and short terminal CC inverted repeats. CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 1623..3584 FT /product="Transib-12_HMp" FT /note="transposase." FT /translation="MRPTISVLMTKFRKKYKEAAQRQDYFFKKNENWLKTS FT VYFQIDAKRAIVNKSTGRPTGSFETLSDRSKRRRTKEMRTKFSPAELSFAT FT QMSLRSSGHSDAAKIVNDVTLSSPFKATKYKESLQLSLSLPAFYDANEALA FT LLLDLSLTKDLYQHLRNQAKKKGLNTFYPPYSAVLAAKKCCYPPDDTLNFT FT ESKAEVQLQALLDHTVQRILLLNQDNITYKSDSEILNMTLICKWGFDGTSG FT QSIYKMTFEDPNISDASLLFTSLVPLQLVGVNQETKEENLLWSNPKPSSTL FT FCRPLKLQFTKETEETFINEKTHFDKQIEQLQPLKTTIHMKNVFIQYSLSM FT TMLDGKVCNAITNTKSAQRCYLCQATSKQFNNIDEVLQRPVTTDYLQYGLS FT TLHAWIRCFECCIHLGYKKNTKKWQARKEEDKQEKATMKASIQKAFKQQLG FT ILVDMPKQGLGSTNDGNTARRFFENTQTSSLITGVSEELIHRFHIILQTIS FT SGHEIDVEKFRGYVLATARHYIKEYPWYNMPTSVHRLLIHGPDIISSAILP FT IGQLSEDAQESTNKLIKQYRLGFSRKFSRVKTMEDVFRRLLATTDPYISSL FT RKSTPKKLKTLLPEAVALLVPVTEDYEDSDADSDETNFEYNKDFNKETDQD FT EKEPK" XX SQ Sequence 3943 BP; 1423 A; 637 C; 611 G; 1272 T; 0 other; cacagtgggt cagatctcaa aaaagctggt caaaaaggaa aaaaaaaaat tttcgatacg 60 gtacttttgg actatcaggt tggcttcagt gatgttggga ctacttaatg ccgggtatcc 120 agaagatcag ctgttcattt cgtgcgctat attgcagatc gaacgcggca gtcctttgtc 180 tgtcgtactt gcgtacgtta aagctaaaaa atcgtctttt tttcaaaaat aaatattttt 240 gttgtgtgca cgtgaaacaa atattttcaa tatggaattc gtttcacaag gtatgtacta 300 attaaaaaac aaaaatatta ttattaagtc attaattata aaggtttttt ttttaattaa 360 cttaatacag aaaaatattt tataaaattg taaaatttgc tttaaaaata gagataagtt 420 acgaattcca caatattcca ctttcatttg cattgtgttt gtacgtatgc agtatttttt 480 ttgtaattta acattcaaat tcacctacta gtaccaactc cacttttaag ctatgtttta 540 tatgacagtt gcaaaatgcc gttattgaga catcttgaaa aatgaaagta aaattattta 600 attatgagct acatttgtat aattttaatt ataatcgatg tctagaagca ttataaagaa 660 gattaaaagt agaaaaggac gggaacaacg ctttctccta cacagtgaga tgaataactt 720 agaaaaggac agtgatcgct acacacaccg ccacacacac accgccacac agtagttacc 780 attttccaat cttcaagata tgattccatt cccttttttt ttttcttgtt tcatatttta 840 aactgaataa ttggttattt ttttaatgac attattaata ttaaaaaata ttttttatta 900 aaaaagttat taacttcata attaaattaa gaaaaaatta aatataaata caaacattgt 960 acatatgttt tttagcgctt ttaagtattt aataaatgag ttaataaata aatacataaa 1020 taatgcataa attgataact aaaataaaag ttgtatttgt attgtattga aagtatttat 1080 atagtaattt atatttatct atacagtaat tataaatgta tcaatacagt aatttatatg 1140 taatttatac agtaatttaa atctttaagt cattatcttt ttacagtatg gatagaattc 1200 atttaaatta ctggataata cattatacat ttttattaaa attagaatat caaaataaat 1260 aatttagttt aaaatttggt cgccgtttac ggactttcga gtttacggac ttattacaaa 1320 gcaaacgatt tggaaatgga tggaattcat ttaaatgatg gtataaatac aattatatat 1380 tttattaaaa taaaaagtta aaaacttaac taaatattga gtaaatacat tttaaataag 1440 tatacataga ttttttaaat ttaggtttat cagatccacc ggttgccgta tatggacttt 1500 cgaggaaaga ggtctttatt ataacaaaag aatatttttc aaacgatttg caagtgatca 1560 tagaacattt ggaaaagtca ttgactgaaa agattggata tgtaatttca attcgcgaca 1620 acatgagacc tactatttct gttttaatga caaagtttag aaagaaatat aaggaagcag 1680 ctcaacgaca agattacttt ttcaaaaaga acgaaaactg gttaaaaact tcagtatatt 1740 ttcagattga tgcaaaacgt gcaattgtaa ataaatcaac gggtcgaccc accggaagtt 1800 tcgaaacttt atctgatagg tccaaaagaa ggaggactaa agagatgcgt acgaagttca 1860 gtcctgccga actatcattt gctacgcaaa tgagtcttcg atcttcaggg cattcagacg 1920 ctgctaagat tgtgaatgac gttacactgt caagtccatt caaggcaaca aaatacaagg 1980 aaagtttgca actatcatta tcattgccag ccttttatga tgcaaatgaa gctctagcac 2040 ttttattaga cctaagtttg acgaaagatt tgtaccagca tttgagaaat caggcaaaaa 2100 aaaaaggctt aaatacattt tatccaccat actcagctgt attggcagca aaaaaatgtt 2160 gctaccctcc agacgacacc ttaaatttta ccgaatcaaa agctgaggta cagttgcaag 2220 ctttattaga ccacaccgtt caacgtattt tactgctgaa ccaagacaat attacatata 2280 agagtgattc agaaatattg aatatgactt tgatttgtaa gtggggcttt gacggaacat 2340 ccggacaaag catttataaa atgacttttg aagatccgaa tatatcagat gcaagtttac 2400 tttttacttc gcttgttcca ttgcaacttg taggtgtaaa tcaagaaaca aaagaagaaa 2460 atctcttatg gagtaatccg aagccatctt caaccctctt ttgcagacct ctaaaattgc 2520 aatttacaaa ggaaaccgag gaaactttta taaatgagaa aacgcatttt gacaaacaga 2580 tcgaacaact ccaaccatta aaaacaacca ttcatatgaa aaatgttttt attcaataca 2640 gtttaagtat gaccatgtta gatggtaaag tatgcaatgc tatcaccaac acaaaatctg 2700 cgcaacggtg ttatctctgc caggcaactt ccaagcaatt taataacatt gatgaagttt 2760 tgcaaagacc ggtaaccact gattatctac agtacggact gtcaactcta cacgcctgga 2820 ttcgatgttt tgaatgctgt attcatttgg gatacaaaaa aaatacgaaa aaatggcaag 2880 cgcgtaaaga agaagataag caggaaaaag caacaatgaa agctagcatc caaaaagctt 2940 tcaaacagca actaggaatt ttagtcgaca tgccaaagca agggttgggc agcacgaacg 3000 acggaaacac cgctcgacgt ttctttgaaa acactcaaac gtcatccttg ataactggtg 3060 tttccgaaga attaatccac cgttttcata taattttgca aacaatttcg agtggacatg 3120 aaattgacgt tgaaaagttt cgaggttacg tcttagcaac tgcaagacac tatatcaaag 3180 aatatccctg gtacaacatg ccaacatcag tacatagatt gctgatccac ggaccagata 3240 ttatttcttc ggccatttta cccattggtc aactgtccga ggatgctcaa gaatcaacca 3300 ataagttaat aaagcaatat cgtttgggtt tttctcgaaa gttttcaaga gttaagacta 3360 tggaagacgt gttcaggaga ttgttggcaa ccacagaccc ctacatttca tctctgcgaa 3420 aatcaacacc aaaaaaattg aagactttgt tgccagaggc tgttgccctc ttggttcctg 3480 taacagaaga ttatgaagac tctgacgcag attccgatga aacaaacttt gaatataata 3540 aagatttcaa caaagagaca gatcaagatg aaaaagaacc aaaatgatta taactgaaac 3600 aaaatgtttt ttctatattt aacactatct tccatcattt gatccatttt ttagcattta 3660 cggacacatt ggatccacca ttttgttttt taattttttt aaactttttt ttgtcaaatt 3720 tttaatttgt ggcctgaaag aacctttaaa acccatttat gaacaattta aacaattttt 3780 taacatatat atccgccata ttggatccgc cattttgttt ttcttttttt ttttaagcaa 3840 aattttaatc agtaacttca aaaaatattt gccatgaaat ttcaggcaaa tcattcaacg 3900 ttttcttatt ttgaccagct ttttcgatat ctgacccact gtg 3943 // ID GYPSM1_LTR repbase; DNA; INV; 1269 BP. XX AC AAWT01004332; XX DT 30-JUN-2007 (Rel. 12.06, Created) DT 13-JUL-2007 (Rel. 12.06, Last updated, Version 2) XX DE Gypsy-type LTR retrotransposon. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; GYPSM1_I; GYPSM1_LTR. XX NM GYPSM1_LTR. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-1269 RA Jurka J.; RT "GYPSM1: Gypsy-type sequence from freshwater planarian (Schmidtea RT mediterranea)."; RL Repbase Reports 7(6), 187-187 (2007). XX RN [2] RP 1-1269 RA Jurka J.; RT "Consensus sequence."; RL Direct Submission to Repbase Update (13-JUL-2007). XX DR [2] (Consensus) XX CC The originally reported LTRs were incomplete. Current consensus CC LTRs are 96% identical to each other. XX SQ Sequence 1269 BP; 386 A; 310 C; 283 G; 289 T; 1 other; tggccatatc cgactacctg gaggatatta ggtttcgaga aggatccctg cggtctctcc 60 ggacagcaat acccagccgc agagatatat gaacgcgcag aacctcctca cccaatggga 120 aagggtgcag gagcaagtgg atttccaggt ggttaaggcg aggatgtacc ttgcgaacgt 180 ggtggaagac atcctcgccg ggcggattgc agtcgcctat ccacctcgcc cagtccgaga 240 tacccgacga ccctatacag tggaggctct caaacgaagc ctgaagagat attgtcctga 300 agagtgactt aataatgata gttttatgta gtgattgatt caaaatatta gcaatttgtg 360 tgttattttc tcttataatt agaattattt attatttggt ttaatgggat aartgactat 420 ctaacttaat caagaataaa ttaaaataat ctctttaatt acagcgatgt cgtccctacg 480 aatcttacaa actgaactgg accacgcgag acaggcaaac gcggaattac tcgcccagat 540 tattcaatta acccgcgaaa tgcaacagat gaaagcaaca tggacggatc ccgcaaaaac 600 gaaaacgatc taccaccgac tcaccgcggc ccagaaaggc tgggcagaag agcggcagct 660 caaccagagc ctgcgcacgc agattcgggg actcgaagtc gccctagcgg tatgccggga 720 aggtgaagcg gtaacttatc cgctcatctt cgccccaacc caaatgccgc agacaaccac 780 caagccagca gaacaaccca tcactacaac caacaatcgt cgaccgggac gcaaggaacg 840 tgcccgacgt cgagcaaccc aactccaaaa catcaagtag taaatgttat acatgtgttt 900 gaatcaaata gaatagtttt agccttagtt gagttgttac tttaatataa ttgtatgcca 960 acattgtcat aaatattctc attacgccgc cagttcgttc cacagtcaaa taagaaccta 1020 atcctgaata gttacgaggg acagagcaaa gcgagcatag gtcttctatc gacagctcgt 1080 gaacccgagg gttgtccgag caactacagt actagccaat ggtactcaat ttttgcgggc 1140 aatccaccac ttgaacacat taccaacgtt gcctagcaac gggtgccaag tggcgattag 1200 tatcattttg tatggcgtta ctagggtcac caggaggata agcaagattc gaaaccctgt 1260 aacccacca 1269 // ID Gypsy-234_AA-I repbase; DNA; INV; 6883 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-234_AA_; KW Gypsy-234_AA-LTR; Gypsy-234_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-6883 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR [1] (Consensus) XX CC Positions [4096-4695] - Reverse transcriptase CC Positions [5818-6282] - Integrase core CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 2557..4380 FT /product="Gypsy-234_AA-I_1p" FT /translation="MDSLRPVRPFDSKVDQSQLASEWRRWKRSLEYYLEAS FT GIVGQRQKRNQLLFLGGPDLQDIFDNLPGVNDVPHVAIDPPFYDIAVEKLD FT AHFQPTRRRTYERHVFRQISQKVGERFNDFVMRLRTQANRCEFDQDGGSVL FT DSMIIDQIAEKCLSPSLRKKILEKDRSLNEVVAIGKTIEDVEMQCKELVCG FT APTEVVNNVRMNQNLMFKKIPAFNRPQNWSMLPSKERQFQYSRAQQRSFMP FT ELRTNSSIPSNARPYQPPMGQRSSFIETRICFSCGRRGHLKGSVECPAREQ FT QCAKCKSKGHFAKWCTKRVYEGPKKEPTPKRIKAVFEEENTRGEFQAREDK FT EAEICYVMGQNVFRFLIGGIGVKMAIDSGAAANLIDLATWNELRESGATLT FT FQKQPDRSFKAYGTEEPLKLIGMFGATIEAHENKTEAVFYVAENGKQNLLG FT DETAKILKVLKIGYNVGAVQESSCFPKMKGILVEIPIDPTVKPVQLPYRRV FT PFAIEEKLAKKLEYLIDQDIIERVEEPSPWVSPIVPILKDSGEIRLCVDMR FT RANNAVLRETHPLPIVEELFAGINGAVRFSKIDVKEAYHQVEISERSRPIT FT TFITKQGLFR" FT CDS 4495..6780 FT /product="Gypsy-234_AA-I_2p" FT /translation="MFGISCAPELFQKVMESVVAGLEGVVVYLDDVLVSGR FT TQAEHDKRLHLLLNRMKEYGILLNEDKCVINVSQIEFLGHELTKNGIRPTE FT SRISAIKSFREPRNVSELRSFLGLITYVGRFIPHLADKTEPLRKLLRDGEK FT FIWMDLHKQAFDSIKSAVCEADFLGYFDTKDQCILVADASPTGLGAMLLQK FT DEQGNSRIISFASKALTPIEQRYFQTEREALALVWAVDKFKLYLLGKRFKL FT ITDCKPLHFLFKERAKPCARIERWVMRLQSYNFEVVYEPGSCNLADALSRL FT SVSEATDFDVTGEACIYQLVLQDTPKAMSLEEVERESSRDSTFIAVLESLS FT NGVWDDKSKGYKPFASELCVSGKFLLRGDRLVIPEQLRQRVMEIAHESHPG FT MVVMKRRLRQKVWWPMLDKQVEHFVKKCKACAMVSALDPPEPMNRTEMPDR FT PWKHLAADFVGPLPSGHNLLVIIDYFSRFTEVIIMKQITATLTVKALHETF FT CRFGMPTSLKTDNGPQFISNELEAFCEEFGIKHIRTTPYWPQANGEVERIN FT RSIGKRLKISQETYGSDWKWDIRMFVLMHNSTPHSTTGVAPSTLMFGRVLK FT DKLPGLMTKGCKILEEVHDRDADRKMREADYANQRRMARTSEIKVGDTVIV FT KRVHKENKLSTTFSPEEHVVVDRKGSDVTVKSKESGKELHRNTKHLKKLVT FT ENKDEIPEQNGCPDEEGNSTPKPGNVSIENSESREEISSRPRRETKQPAYL FT EDYQIHNLNSF" XX SQ Sequence 6883 BP; 2327 A; 1124 C; 1551 G; 1870 T; 11 other; tttggcgata cgaggaaaat ttcggccgac gcgacggagt gtatttttgt gaaaagcaaa 60 tttcaagttt ttttcagtgg taagattgaa aggaattggt agtgagatat tttagcacat 120 gtcgtttttt cgattaattg aaatggggaa aattgtgaga agtaagaatt tgataagaaa 180 tcatagaaac gaggagaatt ttatgtggtg tcgaatgaat tgtcctgtgt gcaacaaaaa 240 gaaattagtt ctatggaggt ttcgtataat tattgccacg aagtctcaaa atcaagatgg 300 aattgtgatt gagtagcttt cgaaccgtga atttaaaaaa aaaaaaaaaa tcatgtgamc 360 atcggccaat gcgataaaat aaaatttaaa aaaaaaagtg tggaacggtg atgcaaatgc 420 aagaaaaata caaaatggcc gacttgaatc agaaagcggt ttggctatat gtgtaaagga 480 ttgtgatacg gggacggata tgaaagtcat acaatgaaga agcatcatag tggttgtgtg 540 tgtgtggaga catatcgcgt gagcaaatct agcaaaccgg acaatacata catggaaaat 600 aagtgcctac tgtacatgtg gttgtgcaat cagaaaaaaa aactgtggtt tatctttgga 660 caggagcatg tgacgattga tgatagattt gaaccaagct tggttcattt ggtagtggtg 720 ccgaatgaag cagtggagga tcaaaataaa atgtgtttct gttcctatga ccgttcacct 780 gtgatggtga tttcttaatt atgaaggatc cgtagctgat ktgagccgtc atagtttgag 840 acggaaaacc atttttattc gtgaacgtag tctactgtaa gtacgaatta ttagttttta 900 ctatatcwga aaaaatcgca tacgatgagc agctggggtt cgaccaatac atcgaaaggc 960 taatgcaaac attaaatatt caatgctgta ctgcccatat acgcataatt gtcccatgtg 1020 aataggaaac ccagcaaaca tgggactgac atgcatttat gggtagtgta ttgctcaact 1080 ttttttttgt aaggacgatt tatatttcaa aagtgaataa tagttgaaaa ttcttcgttt 1140 taaattgatg aatatgaatt acatcgattc cttgtttctc atttcattca atatatagaa 1200 ttacattgaa gagaacaaac agaatttcct taatcatgac ccatgctgtc catcctattt 1260 ttttttaaat ttttgatgtt atgtgaaatg agttgagtac tggattcaca taggaaaaaa 1320 caacaaaata aacactatgc atatctaatg tgaaatgagt cgagtactgg attcacataa 1380 aaaaaaaaag aagaaagaat tttagattgc ggttaggtaa tgtgaaatga gttgagcact 1440 gaattcacat tatgtwaaac gaaataaata tcgtacgtgt tatgtgaaat gagtcgagta 1500 ctggattcac ataaaaaaaa agaagtttat tttaacaacg tgaaatgagt cgagcactgg 1560 attcacaatg aaaaaaataa ccactstata tgtatatgta aaaaaaaaag tattgataam 1620 attattgtga ccttttattt tttctatatt ggtgcatgtt gttcattggt aattttaaag 1680 ttttaaaatg tgaaatgagt cgagtactgg attcacatta caaaaaaaag aagaaagaat 1740 tttakattgc ggttaggtaa tgtgaaatga gttgagcact gaattcacat tatgtaaaac 1800 gaaataaaca ttgaacgtgt tatgtgaaat gagtcgagta ctggattcac ataaaaaaaa 1860 atgaagaaga ttcttttaac aacgtgaaat gagtcgagca ctggattcac aacttacaaa 1920 aataaatacg agcattatat gatttggtac aaattggaac tagaagtacg ttggaaatta 1980 aacggaaatc gtagaacata cattacgaat gtgaaatgaa tcgagaactt gatttacgtt 2040 ttgggatcag agattaaata gtgcttattt agaatattca attggctcga acaattgtaa 2100 attaaagcaa gaatcgtgta catgtttcga ctaacaaata actatcggaa aaaaaaatgc 2160 gaaggatttt tgatcggttg aagtgaaatg agtcgagtta ctggattcac aataagtaaa 2220 taaatcatgc aaaaataaaa aaataaaaaa ataaaaagac taaatagttt taaactgtga 2280 aagaagacta gccattatta ttagatattc atccaaggtg aaactgaagt taaaatgaag 2340 taagtagaga attaaaaaaa aaaaawaata ataataataa ttaaatacag tggtatggaa 2400 actaacaata ataatttttt tatcaatttt tatattatca tagctcttgt tagcaatttt 2460 gttaataaat acaagaaaat cgaaaaaacg acatggaaaa caaaacatgt tttggtttgg 2520 agttgggaat gmattttaat tttattttct taggaaatgg attcattgag accagtcaga 2580 ccttttgatt caaaggtgga tcagtcccag ttggcgtcag agtggagacg ttggaagcgt 2640 agtctagaat actatttgga agcgagcggt attgtgggtc aacgacaaaa gcgaaatcag 2700 ttgctattct tgggcgggcc tgatttgcag gatatctttg ataacctccc aggagttaat 2760 gatgttcctc acgtagctat tgatccgcct ttttatgaca ttgctgtcga gaaattggat 2820 gcccattttc aacccactcg tcgacgcacg tatgaacgcc acgtttttcg tcaaatttct 2880 caaaaagttg gtgagcgatt caatgatttc gtaatgagac tacgcactca ggccaacagg 2940 tgtgaatttg atcaagatgg aggttccgtg ctggatagta tgattatcga ccagattgca 3000 gagaaatgcc tttcgccgtc tttgaggaag aagattctgg aaaaagacag gtcgttgaat 3060 gaagtggtcg caattggaaa aacaatcgaa gatgtagaaa tgcagtgtaa ggagttagtt 3120 tgcggagctc ctactgaagt tgtgaacaac gtgcgtatga atcagaatct aatgttcaaa 3180 aagattcccg cgtttaatcg cccccaaaat tggtcaatgt tgccttcgaa ggaacgacaa 3240 tttcagtact caagggcaca acagagatct ttcatgccag aactacggac caattcatca 3300 attccatcaa atgcaaggcc gtatcaacca ccgatgggtc aacgttcctc attcatcgaa 3360 accaggatct gcttcagctg cggacgacgt gggcatttga agggaagtgt ggagtgcccg 3420 gcacgtgaac agcaatgtgc gaagtgtaaa tctaaaggac atttcgcaaa gtggtgcacc 3480 aaaagggttt acgagggtcc taaaaaggaa ccgactccca agcgaatcaa ggcagttttc 3540 gaggaagaaa atactcgagg agagtttcaa gcaagggaag ataaggaagc cgaaatctgt 3600 tacgttatgg gtcagaatgt tttccgattc ctcatcgggg gcattggagt caaaatggct 3660 atagattccg gagctgcggc aaacctcatt gaccttgcca catggaacga attacgmgag 3720 tcaggagcaa ccttaacgtt tcagaagcaa cctgatcgtt ctttcaaagc ttacggaaca 3780 gaggagccac ttaaactgat cggtatgttt ggcgcaacca ttgaggcaca tgagaataaa 3840 acggaagcag ttttctatgt agctgagaac gggaaacaaa acttgctcgg tgacgaaaca 3900 gcaaagatac ttaaggttct gaagattggt tacaacgtgg gggctgtgca agagagttct 3960 tgttttccga aaatgaaagg aattttggta gaaattccaa tcgaccccac cgtcaaaccg 4020 gtacaactac catatagacg agtccctttt gcaattgagg aaaaactcgc aaagaagctg 4080 gaatatctaa tcgatcagga cattattgag cgtgttgaag aaccatctcc ttgggtatcc 4140 cctatcgtac caattctcaa agattcagga gaaattcgct tgtgcgtgga tatgcgacgt 4200 gccaataatg ctgtcttgcg agaaactcac ccacttccga tagtggagga gctattcgcc 4260 ggtataaatg gagcggtaag attctcaaag atagatgtga aagaagcata tcaccaagtc 4320 gaaatatcag agaggtcaag gccaattaca acgtttatca cgaagcaagg actgttcagg 4380 taatggttga agatgatttt gatttttgtt gttgttgggt aaatgaaata aatgaaatgt 4440 tgaattccgg gaacaatttc tttttcttaa tcttgacaga tacaaacggg ctagatgttt 4500 gggataagct gcgctcctga actcttccag aaagttatgg agtctgttgt agctggatta 4560 gaaggggttg tagtttatct ggacgacgta ctagtatcgg gcagaaccca agcagaacat 4620 gataaaaggt tgcatttgct tcttaaccga atgaaagaat atggaattct tttgaacgaa 4680 gacaaatgtg tcataaacgt gagccaaatt gagtttcttg gccatgaatt gacaaaaaac 4740 ggaataagac ccactgagag cagaatttcg gcgatcaaga gtttccgtga accccgaaat 4800 gtatcggaat tgcgcagttt cttgggcctc attacatatg ttggacgttt tattccgcat 4860 cttgctgaca aaacagaacc actacgaaag ctacttcgag atggtgagaa atttatttgg 4920 atggatttgc acaaacaagc attcgattct atcaaatcag ctgtatgtga agcggatttt 4980 ttgggttatt ttgacacaaa agatcaatgc atactagtag cagatgcaag tccgactggt 5040 ttgggggcaa tgctattgca gaaggacgaa caagggaaca gcagaattat atcattcgct 5100 agcaaagcat tgacaccaat agagcagcgt tatttccaaa cagaacgcga agccttggct 5160 ttagtctggg cggtagataa gtttaagttg taccttttgg gaaaacgctt caaacttatt 5220 acagactgta agccattgca tttccttttc aaggaacgtg ccaagccttg cgcccgaatc 5280 gaaagatggg ttatgcgcct acaatcttac aatttcgaag tagtctatga accaggatct 5340 tgcaatttgg cagatgcgtt atcaagactt tcggtttccg aagctacaga ttttgatgtg 5400 acgggagagg cctgcatata tcagcttgtc cttcaggata caccgaaagc aatgtcctta 5460 gaggaagttg aaagggaatc ttccagagat tcgactttta ttgctgttct tgagagtttg 5520 agtaatggcg tatgggacga taaatcaaaa ggatacaaac cttttgcatc agagctgtgt 5580 gtatcgggaa aatttttgct tcgaggtgac cgattggtca ttccggaaca acttcgacaa 5640 agagtaatgg agatagcgca cgagtcacat ccaggaatgg ttgttatgaa acgcagactt 5700 cgccaaaaag tttggtggcc aatgttggac aagcaagttg aacatttcgt aaaaaaatgc 5760 aaagcctgtg caatggtatc cgcacttgat ccaccggaac ctatgaatag aacagaaatg 5820 ccagaccgcc catggaaaca cttggctgcc gattttgttg gaccgttgcc atctggacat 5880 aacttgcttg tcataataga ctacttcagt cgattcacgg aggtgatcat tatgaagcag 5940 ataactgcga cacttaccgt caaagcatta catgaaacgt tctgtcgttt tggtatgcca 6000 acatcgctta agacagataa tggaccgcaa tttatmagta acgaactgga agctttctgt 6060 gaggaatttg gaatcaaaca cataagaacc acaccatatt ggccgcaagc taatggcgag 6120 gtcgaaagaa tcaatcgttc aattggaaag cgtctaaaaa taagccaaga aacatatggg 6180 tctgattgga aatgggatat ccgtatgttc gttctaatgc acaactcgac tccacattcg 6240 actacgggag tagcaccgtc gaccttaatg tttggtcgtg ttttgaagga caagttaccg 6300 ggattgatga ctaaagggtg caaaatactg gaggaagtac atgatcgcga tgctgatagg 6360 aaaatgcggg aggcagatta tgcaaatcaa cggcgaatgg cgagaactag tgagataaag 6420 gtcggtgata ccgttattgt gaaacgagtt cataaagaga ataagctgtc aacaaccttc 6480 agcccagagg aacacgtagt tgttgataga aagggatcag atgtgacagt caaatcgaaa 6540 gagtctggca aggaattaca ccgcaacaca aaacatctca agaaactggt aacagagaat 6600 aaggatgaaa ttccagagca aaatggatgt ccagatgaag agggcaacag cactccaaag 6660 cctggaaacg tatccatcga gaattcggag agtcgtgaag aaatcagttc cagaccacgt 6720 cgcgagacga aacaaccggc gtacttggag gattatcaaa ttcataatct caacagtttt 6780 taaagaagaa gaggaatgta gtgtcgcact aatccatgaa ataactcgaa tgtatataaa 6840 ccgtgaaacg attatacctt gcattgtcaa atacctttga ttg 6883 // ID I_Ele9 repbase; DNA; INV; 6126 BP. XX AC . XX DT 27-OCT-2010 (Rel. 15.1, Created) DT 27-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE An I clade non-LTR retrotransposon family from Aedes aegypti. XX KW I; Non-LTR Retrotransposon; Transposable Element; I_Ele9. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-6126 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6126 RA Kojima K.K. and Jurka J.; RT "I clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (14-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. This consensus is generated from 6 CC sequences with >98% identity, and ~99% identical to the original CC sequence in [1]. XX FH Key Location/Qualifiers FT CDS 73..1413 FT /product="I_Ele9_1p" FT /translation="MTAASSGDPGGSKESWPCIGNPSNVEFGSLTYLLLRS FT KDDKVPLTSNPFIVGQSVELAAGGPIEGASTEAQCTRYTLKVRNQTQVKKL FT LRMSTLLDGVEVIIEPHPILNVSRCVISCFETVHMTEDAMLDGLNSQGVIR FT VQRITRKEHGKPVNTSALILTFGRCDYPSHVKIGLLRVATRPYYPNPLLCY FT GCVRFGHPRVRCPGPKRCVNCSAEHELLEGEVCSFAAHCINCNGPHRPTSR FT QCPVFKKEMEVVRLKVDQNLSFPEARRRIEQDLGSYAAAAAQQTADRKRLD FT ELEKKMEQKEALISQLLVKIQDKDERIEQLLLQINRMKTVENQDSQQNTPT FT QPLKSCQANKPSEPVKPIEAIQKAREQLTTATRMQLRNRSPATTQTAGTET FT RNRKKKHNRSTNGSPGRQSPPPKKTPAEEDPNQEELSDNGIEIEETPPSQS FT FR" FT CDS 1728..5774 FT /product="I_Ele9_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="MSVDLPSPTACPLPHAITISRSLLRGSVCSEVGNLRH FT TNVEPSQSYRLSAGETHRHRSNSPQPPHQKQQTRKISSERGSRRGEGEREV FT TSPLPRRGVDLRGVECGTSNECLARSVSSWHPVDSQTRPLSSSASSSSSST FT KSNSQADKCYALQWNIRGLRANINELKQLIHEYEPLVIALQETKANSRSVP FT SHFVGSNYTLILQTGNCQYWQQGVGLAIREGVPFERIQLDCNLQAIAVRIH FT LPATVTVVSIYVPPSTQQCQAEMNKLFEQLQGPALILGDFNAHHMAWGSQT FT ENSLGRFIAEKSLTKQMIILNDGTHTRIDPATGHTSAIDLTICSEALAAKY FT VWRTLTDTYNSDHFPITVSIRGWSHSLTKKRRWLYDQADWATFEQTIANSI FT KVGEDIGVDEFANRLLSAAVASIPRTSGRVGPKSAPWWNSQVEEAVKKRRK FT ALRTLKRKKKTNSDITEALKNFQQTRTAAKKVVQNAKTQSWENFVAKISPN FT STAKDMWQTVNKLRGKQQNNTAIIKRSSGFTDNVEEVAEELAQHYSKISAS FT TSYPPLFQRAKEKAEQNRMNVLPNTEDIYNCDFTLNELLWALDKGRSTSTG FT PDSIGYPMLQRIPLSVKSALLDLFNRIWHSGVFPASWRTGIIVPIPKPNSN FT DSGPAAFRPITLINCMAKVFERIVNRRLMTELESKGRLDKRQHAFRAGYGT FT DTYLAELERSLPENNEHCLVASLDLSKAYDTTWRYGILRTLRKWRIGGRMF FT NILNSFLTDRTFQVQVQGHLSRIMPLENGVPQGSVLSVTLFLVAVQPVFRV FT IPNAVEILLYADDIILIVRRRKEETLYRTLQAAVKAVSKWAKSVGFRISPS FT KSKIFYGSPNVRRAPRNNICIDQVPIPKTNKLKIIGITIDRKLTFQQHCKT FT IKSSCESRLRILKIIGNRLPRGHRTSLLRIGSAIITSRLLYGIGVTSRAEN FT AVIKILSPIYNQMVRYASGAFVTSPVLSVMSEAGTLPFHLLVLQSVTQVAI FT RILEKNEAAGDLPIIQRVFSRLEEVXNISPPNISIRTRLTDRIWHKPKPQV FT VWTVKQQVKAGDPPEMVRPIVHELLSGRFNRYTVVYTDGSKSNGTVGAGVF FT GENISRSVGLPPQCSVFSAEAFAIKLALNLVENSKELLILSDSASCLAAIE FT SGKSQHPWIQGIEDALHNRVVYLCWIPGHAGVAGNEEADRLADEGRHRTPM FT NVALPGKDTKKAMKQAIRQSWESQWSLLRDVKLREVKFTTEKWPDHANPID FT QRVLTRLRIGHTRITHEFLLKRTCPPVCDCCGTTLDVRHLILECRKYDEAR FT RQHGIDPTSLRVALSCDSENEEKLLKFLRDTNIYKYV" XX SQ Sequence 6126 BP; 1931 A; 1485 C; 1349 G; 1356 T; 5 other; ttaacttttt atcaaattcc aacgttcgtc gtgtacaaca ataaagaagt ggcagtgtag 60 ggtggtgtaa acatgacggc cgcaagcagc ggcgaccctg gtgggtccaa ggagtcatgg 120 ccatgtatag gaaatccgtc gaatgtcgaa tttggttcac taacgtatct tctattacga 180 agcaaggatg acaaggttcc actgacgtcg aatcctttca ttgttggaca atcggtagaa 240 ttggctgccg gtggaccaat tgaaggagca tcaacggaag cgcaatgcac aaggtacact 300 ctcaaggtca gaaatcaaac tcaagtcaaa aagctgttga gaatgtctac gcttttggat 360 ggcgtcgaag ttatcatcga accgcacccg atcctaaatg tgagccgctg cgtcatctcg 420 tgcttcgaaa cggtccacat gacagaggat gccatgctgg acggtctgaa tagtcagggc 480 gttattcgag tccaacggat cacgcgtaag gagcatggaa agccggttaa tacttctgcc 540 cttattctta ccttcggtag atgcgactat ccatcccatg taaaaattgg ccttcttcgt 600 gttgctacac gcccgtatta tccaaaccca cttttgtgct acggatgtgt ccgtttcggc 660 catccgcgtg ttcgctgtcc tggaccaaaa cgttgtgtta actgctctgc cgaacatgaa 720 ctgttggaag gcgaagtgtg ttcattcgct gcccactgta ttaactgcaa tggacctcat 780 cgaccaacca gccgtcaatg cccagtgttc aaaaaggaga tggaagtggt tcgactaaaa 840 gttgaccaaa acctatcttt tcccgaagca cgtagacgga tcgagcaaga tctaggaagc 900 tatgctgccg ccgcagctca acagacagcc gatcgcaaaa ggttagacga attggaaaag 960 aaaatggaac agaaggaagc tcttatctcg cagctacttg taaaaatcca agataaagat 1020 gaaagaattg aacagctcct gttgcaaatc aacagaatga aaactgtgga aaaccaggat 1080 tcccaacaga acacaccaac ccagccccta aaatcttgcc aagcaaacaa gccatcggaa 1140 ccagtcaaac caattgaagc catccaaaaa gccagggaac agctcacaac cgcaactcga 1200 atgcagctca ggaacagatc acccgcgaca acacaaaccg ctggaactga aactcgaaat 1260 aggaagaaaa agcacaacag aagtacgaac ggctcacccg gtcgacaaag ccctccccct 1320 aaaaagacac ccgctgaaga agatcctaat caggaagaat taagtgataa cggcattgaa 1380 atcgaagaaa cgcctcctag tcagtctttt cgataagatg cccctctcca cacgcccacc 1440 gatcaacgac accaacctag caacgaaacg aacagcgaaa acaatcgatt tgaagggagg 1500 gaatgtgtag tacccccttc actctcacaa gcgagagcgc agcaggctgg aggatccatc 1560 cgggacgtcc taccccaacc cgttgatgga gaatccccct ctgcggtcga tcccgaagaa 1620 gtatagtgta cacccctaat ccacccagca taccacttcg aattcctata catagccaca 1680 acgtatccaa cttgtcctcc atcaatgtag ctataaacca tccagaaatg tcagtagacc 1740 tcccgtcacc aacagcatgc cctctgcctc acgcaatcac cataagcaga tctctactca 1800 gaggatctgt ctgtagtgaa gtgggcaatc tccgtcacac caatgtagag ccttcgcaat 1860 cctaccggct ttccgctggt gaaacccatc gtcatcgatc aaactctcct caaccacctc 1920 atcaaaaaca gcaaacccgc aaaatcagtt cggagcgcgg cagtcgacga ggggagggag 1980 agcgtgaagt aacctccccc ctccccagga ggggagtgga tctccggggg gtagaatgtg 2040 ggaccagtaa cgagtgtctt gcacgttcag tttcctcctg gcatccggtc gattcgcaaa 2100 cacgcccgtt gtcgtcatca gcatcgtcgt cgtcttcttc gactaaaagc aacagtcaag 2160 ctgacaagtg ctatgcactg cagtggaaca tcagaggctt gcgtgccaac ataaacgagt 2220 tgaaacaatt gatccacgaa tatgaaccgc tcgtgatcgc tctccaagag actaaagcga 2280 acagtcggtc agtcccttcc cactttgtcg gaagcaacta tacccttatc ctacagacgg 2340 gtaactgcca atactggcaa cagggagtag gcctcgctat cagagaggga gttccgtttg 2400 aacgcatcca gttagattgc aaccttcaag ctatcgcggt ccgcatccac cttccggcaa 2460 ccgtaactgt ggtgtctatc tatgtgcctc caagtactca acaatgccaa gcagaaatga 2520 acaaactttt cgagcagttg cagggaccag ctttaatact gggtgatttc aatgctcatc 2580 acatggcttg gggctcccaa acagaaaatt cattaggtcg ttttatagct gagaagtcac 2640 ttacaaaaca aatgataatc cttaatgatg gaacgcatac ccgaatcgac ccagctactg 2700 gtcacacctc agctatagac ttaacgattt gctcggaagc actggccgcc aaatatgttt 2760 ggcgtactct cacagatacg tacaacagtg accatttccc tattactgtt tcaatccgtg 2820 gttggtcaca ttccttgacc aaaaaaaggc gatggcttta tgatcaagct gactgggcaa 2880 cgttcgaaca aacaatcgct aatagcataa aagttggtga ggatatcgga gtagacgagt 2940 tcgctaaccg actactatcg gcagctgtag caagtattcc ccgaacatcc ggcagagtcg 3000 gaccgaaatc agccccatgg tggaattcmc aagtagaaga agccgttaaa aagaggagaa 3060 aggcactgag aacgttaaaa cgcaaaaaga agacaaattc ggatataacg gaggccctta 3120 aaaatttcca acagacaaga acggcagcaa aaaaagtagt ccaaaacgca aaaacacaat 3180 cgtgggaaaa ttttgtggct aaaatwtccc caaattctac ggctaaggac atgtggcaga 3240 cagttaacaa gcttagagga aagcagcaaa ataataccgc cattattaaa agatccagcg 3300 gattcacgga caacgtggaa gaagtggcag aagagctagc acagcactac agcaagatct 3360 ctgcgagtac cagctatccc ccattgtttc aaagggcaaa ggaaaaagct gaacaaaatc 3420 gcatgaatgt cttgccgaac accgaagata tctacaactg cgacttcacc ttgaatgagc 3480 tcttgtgggc actcgacaaa ggtcgtagta catctacagg accggattct ataggatatc 3540 cgatgctcca acgaatcccg ctatctgtaa aatcagcact gttggatctc ttcaaccgaa 3600 tttggcatag cggcgttttc cctgctagtt ggcgaactgg gataattgtg ccgatcccaa 3660 aaccaaactc caacgattct ggaccggctg cgtttcgtcc cataacttta atcaactgta 3720 tggcaaaggt ctttgagcgt attgtgaata ggcggctgat gactgagctt gaatcgaagg 3780 gccgacttga caagcgacag cacgccttca gagctggtta cggtacggat acctaccttg 3840 ctgaacttga aagatcacta cccgaaaaca acgaacattg cctggttgcg tcgcttgatc 3900 tttccaaggc atacgacact acgtggcgat atggcatcct ccgtacactt cggaaatggc 3960 gtattggtgg tcgtatgttc aacattctga acagctttct tactgataga acgttccaag 4020 ttcaagtgca aggacattta tcacgcataa tgccacttga aaacggcgtt ccacaaggct 4080 cagtgctttc agtcactttg tttttagtgg ccgttcaacc ggtgtttcgt gtgattccaa 4140 atgcagtcga aattctcttg tatgctgatg acataatact tatcgttcga aggagaaagg 4200 aagaaacgct ctaccgaaca ctacaggctg cagtgaaggc ggttagcaaa tgggctaaga 4260 gtgtcggttt tagaatttca ccttcaaagt caaaaatatt ctacggaagt cccaatgttc 4320 gcagagcacc tcgtaacaat atttgcattg accaagtacc tattccaaag acaaacaagc 4380 ttaagataat tggaatcacc atcgatcgaa aactcacttt ccagcaacat tgcaaaacga 4440 taaaatcttc ctgcgaatcc agactacgga ttttaaaaat tatcgggaac cgacttcctc 4500 gtggacatcg cacctcccta cttcgtatag gatcggcaat aataacctcg cgtctcttat 4560 atggcatcgg agttaccagc agagcggaga acgcagttat aaaaatcctc tcacccatct 4620 acaaccagat ggttcgatac gcgtctggcg cctttgtcac cagtccggtt ctwtcagtta 4680 tgtctgaagc aggcacactg cccttccatc tccttgtact gcaatccgta acccaagtgg 4740 ctatccgcat attagagaaa aacgaagctg ctggagattt accaataata caaagagtat 4800 tcagtcgtct ggaggaagta atkaatatat cacccccaaa catcagcatt cgaactcgtc 4860 taactgatag aatctggcat aagccwaaac cacaagttgt gtggactgtt aaacaacagg 4920 ttaaagctgg cgaccctccg gaaatggtac gccccatcgt tcatgaactc ctatcaggtc 4980 gattcaacag atatactgtt gtctatacag acggatcaaa aagcaacggt actgtcggag 5040 caggagtgtt tggcgaaaat atttccaggt ctgttggcct tcctccacaa tgcagtgtct 5100 tttcagcaga agcgttcgca ataaaattgg cactgaacct cgtcgaaaac tcaaaagagc 5160 ttcttattct gtcagactca gctagctgtt tagcagccat agaatctggc aaatctcaac 5220 atccgtggat ccaaggaatc gaggatgcac tgcacaaccg tgtggtatat ctgtgctgga 5280 taccagggca tgctggtgtc gctggaaatg aggaagcaga tcgccttgcc gatgaaggta 5340 ggcatagaac tccgatgaat gttgcacttc caggaaaaga caccaaaaag gcaatgaagc 5400 aagcaatcag acaatcatgg gaaagtcaat ggtcactgct tcgagacgta aaactcaggg 5460 aagttaagtt tactacggag aagtggcccg atcatgcaaa ccccattgat cagcgcgtgc 5520 tcacccgatt acgcattggg cacacaagga taacccacga atttctactc aaacgaacat 5580 gtcctccagt ctgcgattgc tgtgggacta cgctagatgt ccggcacctt atactagaat 5640 gcagaaaata cgacgaagct agaaggcaac acggtataga tccaaccagc cttcgagttg 5700 cgcttagctg tgacagtgaa aatgaagaaa aattattgaa gtttttgcgt gacaccaaca 5760 tatacaaata tgtgtaatcg agcaaagaaa tagtgaaaaa aagaaaccaa tagaaaacag 5820 tgagtgaaga acaacaaaac attaattatg tgaagcaagg ggaaccatca agaaagtgaa 5880 gaagaactaa gcaacagaca aaaagagaag aaataaagaa caggtggaaa caaaaagaag 5940 atatcagtga acagaaatac aacaaagcga gtcatgcgaa ggaaggagaa acaacacagt 6000 atcgaagttg gcagaaacga cagatagaca acagaacaag aaagctaagt tatattttat 6060 gatattttct ttagaggtga atgatcacga gatgattaaa acctctctaa taaaataaaa 6120 aaaaaa 6126 // ID Gypsy-214_AA-LTR repbase; DNA; INV; 532 BP. XX AC supercont1.9; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-214_AA_; KW Gypsy-214_AA-I; Gypsy-214_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-532 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.9; Positions 1361236 1360705. XX SQ Sequence 532 BP; 191 A; 81 C; 120 G; 140 T; 0 other; tgtagaatat agaactgtta aataaataca taaaaacgta cttacagctg aatttgatgc 60 tcgtgataaa atcactttga aaaggtttac cctagcaacc agtttgtcca ctcaattcgt 120 gatgggatca aaacaaactt aacatcgtaa aaaaaaagta aataagatta tagatgacgt 180 ttctcacccc aatgaaaata atatgcatga ggttatagtg ctaccgggcc tgaacatttt 240 ggctaccaat tctgaatgac ttttaaaaag ggcttaatac atggaagtga acagaaaatg 300 gaccatatat gagtgtggat ttccggctaa ctaaatttac agaggagaaa gagaaggata 360 gcgattgaag aaagcggaaa agggatagaa acaacgactt ggggtttgga tttgcgggaa 420 cgggatgagg attcattctg gtgactaatt tcatagcgaa cggttcgttg tgatattttc 480 gcgaattttt cgaagtgcgg aatgtcggaa aaagcagacc tcggacacta ca 532 // ID BEL-227_AA-I repbase; DNA; INV; 5887 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-227_AA_; KW BEL-227_AA-LTR; BEL-227_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-5887 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 909-909 (2011). XX DR [1] (Consensus) XX CC Positions [4865-5455] - Integrase core CC 'GGTTT' target site duplication CC LTRs are 91% similar to each other. XX FH Key Location/Qualifiers FT CDS 542..5857 FT /product="BEL-227_AA-I_1p" FT /translation="MSSKSSMTNLMQPSKINCLFASNTLMRCGVNLRKLRK FT KLRFWRMHRKGFQQEREDFQTLYCELKASLQSKMPPQQPAAPPTPVQPISS FT SVPISSCVRLPEIKLNEFSGNLDDWVSFRDLFVSLIHTSVQLTQVQKMHYL FT RAALSGEAARIISSLEISANNYLVAWNLLKERFENPCLLVKRHMSALLSIP FT SLKKESAQGLADLADEFNRHVQLLDKLEGAENHWNSFLVERLSQCLDPVTL FT REWETLVSEDGEYPKFQELLEFIHRRSRVIQTLKLSHSANTQSEIKHPKSR FT IVSHVTSDNVPKCVSCKQAHFLFQCEQFRTLSPQQRFEVAKKHGLCLNCLK FT GTHQAKNCSSGSCKSCARKHHSLLHLPPFSSVPGSQPATTSSRSAEASNSQ FT SCQMSHSVPNVVESSPSVELAVASPPVSSSFRSKSSRGNPFPSSSVDFVAN FT SPSTSCQSADVTTQPRQSTVILSTAVIKVKNADNQYVCARALLDSGSQPSF FT ISESLCQKLRLKRTKLNSPVSGIGQSVVNARYAVTLSLKSRFETYTAQLDC FT LVLPKLTVALPSHHIDVSRWRIPKNLPLADPQFNISQGVDIIIGAALFFDL FT LEHDQISLASGYPTLQKTVLGYIVCGKLEQQTPDTTNMQSCNICVEDRLDT FT QLQRFWEIENFDTGKAYTPDEQYCEDHFQSTVARDNSGRYVVRLPLKEEKL FT SMLGDSHTTALRRFQQMEKRFAVDEGLRHSYTEFMEEYERLGHMELAPVAS FT RNPQFFLPHHAIQRPESSTTKTRVVFDGSCRGAASLSLNEALYVGPTVQPA FT LFSTVVNFRLPRFVITADAEKMFRQIWVHPDDRKFQQILWRGSSTEPIRTY FT QLKTVTYGLASSPFHAARVLIQLANDEGHRFPLAVPVILKGTYVDDVITGH FT DDHDTMAETCKQLSEMMKSGGFVLRKWASNYKTVLIHIPEELWETSAELEL FT DRSEAIKTLGLLWFPQRDVFKFKVPTLPDLPVVTKRIVVSEMSQLFDPLGL FT LGPVVVNAKMFIQSLWAANLSWDSELAEESATWWRNYRADIKQLQQLVVPR FT RVLWNTQHKYSIHCFCDASQKGYGCCVYIVSPDELGQLHSHLLTSKSRVAP FT LRGQSIPRLELCAALLGSQLVDNLLANTNIDASVTFWTDSSIALHWIKSRS FT NSWKVFVSNRVAEIQRLSKNSTWMHVPTDLNPADHISRGMLASQILTDKLW FT WHGPAFLTSPVEQWPKCAVLMPTKSEVEEEMRPLVSLPLMSEDASIFSEFS FT ELAKLIRFVAYCFRFRNNCKLPKNQRVLSSLSPEEVDFALKSMIRLAQRQE FT FPTEIGLYSRNKSDSTANIASKSPLKNLNIFIDEFGLLRIDGRLKKLNAPF FT DTRHPILLPANHKLSWLIARSVHLQTLHGGPSLLLATIRQRFWPLRGRDLA FT RKVVRQCVTCFRCNPTPASQIMAPLPSVRLRPARAFTYAGMDYCGPFYVRP FT LIGKGTSVKVYVALFVCLVVKAVHLEVVPDLSSVACIAAVKRFVARRGRVL FT ELHCDNATTFVGADRELRQLRKEFQLQFKSPEWGDYCSGNGITFRFIPARS FT PHFGGIWEAGVKSFKYHFRRIMGQKAFSMDQLLTVVAQIESVLNSRPLVPI FT SDSPDDLSALTPGHFLIGEPLVSIPEPDLLQKSPNRITRFQEMQRSVQDLW FT RCWSRDYVSQLHQRSKWRRPSVDVRKGQLVLLKNENYPPLQWPLGRIVETI FT AGSDGRVRVVVVKTASGEYKRAITEIAVLPIDSDDEEEKTSLLPSTSDAVD FT G" XX SQ Sequence 5887 BP; 1432 A; 1569 C; 1387 G; 1497 T; 2 other; ttttggtgcc gaaacccggg attgaggttg caagatwagt gaacctaacc tcaccaccac 60 gtggctcgaa ggttatctcg tgctccattg ttccaacgat caggatctgg ttcattggct 120 gatcgaatcg tcatcgcatc atctcgtcgt catcgtcgtc gtagccgctg actggtcatc 180 accatcagcg tgctggctgt gctcctcatc ggtagcacac catcctcatc atcatcttcg 240 tcgtcgccgt cgtcatctcg ttgctgattg ctccagtagc agcagtgtgc cctatcggca 300 taaccatcat cgcttcgtcg tcatcatcaa gagtctccgt gcagcgtgcg ccggacgaga 360 ggtactcagc gcccaaaggg gagattgtgg taaggtagca gcagtagcgg aaaacacagg 420 tacttgattc attttttcgg tgaacggtgc gataaaatga tgcctactga gaaggcgacc 480 gccaagcaaa caaagttaaa acttctttac cgtcggcgga caaacatttt gggttccgcg 540 aatgtcatca aaaagttcaa tgacgaattt gatgcaacca agcaagatca attgcctatt 600 cgcgtccaac accttgatgc gttgtggcgt gaatttgagg aaactcagga agaaattgag 660 gttctggagg atgcacagga agggttttca gcaagagcgg gaagatttcc aaacccttta 720 ctgcgagttg aaagcgtcgt tgcaaagtaa aatgcccccg caacaacctg cagcaccccc 780 cactcccgta caaccaatct cttcgtcagt accgatctcg tcgtgcgttc gtctgccaga 840 aatcaaactg aatgagttct ctgggaatct cgacgattgg gtttcgtttc gcgatctgtt 900 tgtctcgctc attcatacga gcgtgcagct tacgcaagta cagaaaatgc actatttgcg 960 tgctgcgctt tctggcgaag cagctcgcat catttcttcg ctcgagattt cagccaataa 1020 ctatctcgta gcttggaatc tgttgaagga acgcttcgaa aatccatgct tattagtgaa 1080 gcgacacatg tctgcattgc tttcgattcc ttcactgaag aaagaatctg cgcagggtct 1140 tgcagatctt gccgacgaat tcaatcgtca cgtacagctg ctggataaac tcgaaggggc 1200 tgagaaccac tggaattctt ttctggtcga gcgactgagc caatgtctcg accccgttac 1260 actccgtgag tgggaaacac tggtttccga agatggcgaa tacccgaagt ttcaagaact 1320 tctagagttt atccacagaa gatcgcgtgt tattcaaacc cttaagctct cccattcggc 1380 aaatacccaa tccgagataa aacatcccaa atcacgaatc gtttcccacg ttacttcgga 1440 caatgttccc aaatgtgtca gctgcaagca ggcacatttc ctgttccagt gtgaacaatt 1500 tcgaacgctc tctccccaac aacggttcga agttgcgaag aagcacggcc tgtgtcttaa 1560 ctgtctgaag ggaacacatc aagcgaagaa ttgctccagc ggttcctgta aaagttgtgc 1620 aaggaaacat cattcgttac tacatctgcc accgttttcg tcagttcctg gttctcaacc 1680 agcaacaaca tcaagtagat ccgctgaagc aagcaattcg caatcgtgtc aaatgtcgca 1740 ttccgttccg aacgttgtcg aaagctctcc gtcggtagag ctcgccgttg cctcaccacc 1800 tgtctcgtcg tcgtttcgtt cgaagtcgtc gcgcggtaat cctttcccaa gttcgtcggt 1860 tgatttcgtt gccaattctc catccacttc ctgccaaagt gcagatgtta ctacccaacc 1920 tcgtcagtca acggtcattc tgtcaacagc agtgataaaa gtgaaaaatg cagataacca 1980 gtatgtttgc gccagagccc tattggatag tggatcacag ccaagtttca tctctgaatc 2040 actgtgtcaa aaactccgtt tgaagcgaac aaagctgaat tcaccggtca gtgggatcgg 2100 acaatccgtc gttaatgcgc gctacgctgt gacactatcg ctaaaatctc gtttcgaaac 2160 ctacactgct caattagact gcctagtatt gccgaagtta accgttgcat tgcccagtca 2220 ccacatcgac gtctcgcgct ggagaatccc gaagaatctt cctctcgctg atccacagtt 2280 caacatcagt caaggtgtgg acatcatcat cggagctgcc ctgttcttcg atcttctaga 2340 gcacgatcaa atctcgctcg cctctgggta tccgacgttg cagaaaacgg ttcttggata 2400 catcgtctgt gggaaactag agcagcaaac gcccgatacc acaaacatgc agtcctgcaa 2460 catctgcgtt gaagatcggc tcgataccca gcttcagcga ttttgggaaa tcgagaactt 2520 cgatacaggt aaagcataca ctccagatga acaatactgc gaagatcact tccaaagcac 2580 tgtcgcccgt gacaattcag gtcgctatgt tgttcgtctt cctctcaagg aagaaaaact 2640 gtcgatgctc ggagattcgc acaccactgc gctccgaagg ttccaacaaa tggaaaaaag 2700 gtttgccgtc gatgaaggtc tgcgtcacag ttatacggag ttcatggagg agtatgagag 2760 gttgggtcac atggagttgg ctcccgtcgc gtcgcgtaac ccacagtttt tcctcccaca 2820 ccatgccatc cagcgtccag agagttcgac caccaaaacc cgtgtggtat tcgacggttc 2880 ctgtcgtgga gctgcctcac tatcgctgaa cgaggctcta tatgtcggac ctactgttca 2940 gcctgcgctc ttttcaactg tcgttaattt tcgtctgcct cgcttcgtca taacagccga 3000 cgcagagaaa atgttccggc agatttgggt ccacccagat gaccgtaaat tccaacaaat 3060 cctctggcgt ggaagttcaa ckgaaccgat ccgcacgtat cagttaaaaa ccgtcaccta 3120 cggtcttgca agttcgccgt ttcatgctgc tcgagtctta atccagttag ccaacgatga 3180 aggtcatcgc ttcccgctgg cggtgccagt gatactgaaa ggcacctacg tagacgacgt 3240 tattaccgga cacgacgatc acgacacaat ggctgaaact tgtaagcagc ttagtgagat 3300 gatgaaatcc ggcggtttcg tactccgtaa gtgggcttcg aactacaaaa cagttctcat 3360 ccacattcct gaagaacttt gggaaacctc tgcagaattg gagctcgatc gctcggaagc 3420 tatcaaaacc ttggggttgt tatggttccc ccaacgggac gttttcaagt tcaaggtccc 3480 tactctacca gatcttcccg tcgtgactaa aaggattgtc gtctcagaga tgtcccaact 3540 tttcgaccca ctcggacttc ttggaccagt ggtcgtaaac gccaagatgt tcattcagtc 3600 tctctgggca gctaacttgt cgtgggattc tgagcttgca gaagagtcag caacctggtg 3660 gagaaattac cgtgctgaca tcaaacagct tcaacagctg gtagtcccga ggagggtact 3720 gtggaacact caacacaaat attcgatcca ctgtttctgc gacgcctcac agaaaggcta 3780 tggatgttgt gtctacatcg tatcgccgga tgagctaggc cagctccatt cgcatctact 3840 cacatcgaag tctcgtgtcg cacctctacg gggacagtca attccacgct tggagctttg 3900 cgcagctctc ctcggaagtc aattggtgga taatctcctg gctaatacta atatcgatgc 3960 gtcagtcacg ttttggaccg actcgtctat cgcacttcac tggatcaaat ccagatcgaa 4020 ttcctggaag gtctttgttt caaaccgagt cgcggagatt cagcggctgt cgaagaattc 4080 aacgtggatg cacgtaccga ctgatttgaa ccctgctgac cacatctccc gaggaatgct 4140 tgcaagccag atccttacag acaagctgtg gtggcatggt ccagcgtttc tcacatcccc 4200 cgtcgaacag tggccaaagt gtgcggtttt gatgccgacc aagtcggaag tggaagaaga 4260 gatgcgtccg ttggtatccc tacctttgat gagcgaagat gcatccatct tcagcgaatt 4320 ctctgaacta gccaagctaa taagattcgt cgcttactgc tttcgattcc gaaacaactg 4380 caaacttccc aagaatcaac gagttctttc atcactgtcg cctgaagaag tggattttgc 4440 actgaagtca atgattcgtc ttgcccagcg ccaggaattt ccgacagaaa taggcttgta 4500 tagccgcaat aagtcagaca gtactgctaa tatcgcatcg aaatctccac tcaagaatct 4560 caatattttc atagacgaat tcggacttct tcggattgat ggacggctga agaagctgaa 4620 tgcccctttc gacacacgtc atccgattct gttaccagca aaccataaac ttagctggtt 4680 gattgcaagg tccgtacatc tgcagactct tcacggtgga ccttcgctgt tactcgctac 4740 aattcgtcaa cgcttttggc cacttcgggg tagagacttg gctcgtaagg ttgttcgtca 4800 atgcgttacc tgtttccgat gcaatccaac acccgcaagt cagattatgg caccgttgcc 4860 atcagttcgc ctcagaccgg cccgagcctt cacttacgct gggatggatt actgtgggcc 4920 gttttacgtt cgtccgctaa ttgggaaagg tacgtcggta aaggtttacg tcgcgttgtt 4980 cgtctgtttg gtggtgaagg ccgtacacct cgaagttgtg ccggatctgt cgtccgtcgc 5040 ttgcatcgcc gcagttaaac gcttcgtggc gcgtcgcggt cgtgttctcg agttgcattg 5100 tgacaacgcg actacgttcg tcggtgccga tcgcgagctc cgacaactgc gaaaggagtt 5160 ccagctgcag ttcaagtcac cggaatgggg cgattactgc tccggaaatg gaatcacctt 5220 ccgcttcatt cctgcacgct ctccacactt tggaggcatt tgggaagctg gagtgaaatc 5280 atttaagtat cactttcgtc gtattatggg acagaaggct ttctccatgg accagctcct 5340 aactgtcgta gcccagattg aatctgtcct taactcccgt cccctcgttc ccatttccga 5400 ttctcccgat gatctgtccg ccctaactcc tgggcatttc ctgattggag agcctcttgt 5460 ctccattccc gaaccggatt tgcttcagaa gtcgcctaat cgcatcaccc gtttccagga 5520 gatgcaacgc tcggtccagg acttgtggcg ctgctggtca cgggactacg ttagccagct 5580 acatcaacgc agcaagtgga gacgcccatc cgtagacgtt cggaagggtc aactggtact 5640 actgaagaat gaaaactacc ctccgctaca gtggcctctt ggtcgcatcg tagaaaccat 5700 tgctggttca gacggccgag tccgtgtcgt tgtagtcaag acggcttcgg gagaatacaa 5760 gcgggctatc accgagatcg ccgttttacc gatcgactcc gacgatgaag aggagaaaac 5820 ctcgctccta ccttcaacta gcgacgcggt agatggttga aacggacggt ttcaacggcg 5880 gccggca 5887 // ID BEL-27_AA-LTR repbase; DNA; INV; 111 BP. XX AC supercont1.16; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-27_AA_; KW BEL-27_AA-I; BEL-27_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-111 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.16; Positions 140140 140250. XX SQ Sequence 111 BP; 20 A; 25 C; 24 G; 42 T; 0 other; tgactgtatt tcaattcggt gacggaaatt cggttttttc gttatagaac ttcgcgtcag 60 ttctcggtcg tcgtgttttg tccacttccg catcattttg gtcgccaaac a 111 // ID Sola1-1_CS repbase; DNA; INV; 3315 BP. XX AC . XX DT 30-JAN-2009 (Rel. 14.02, Created) DT 30-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE Sola1 DNA transposons from Ciona savignyi. XX KW Sola; DNA transposon; Transposable Element; Sola1; Sola1-1_CS. XX OS Ciona savignyi OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. XX RN [1] RP 1-3315 RA Bao W., Jurka M.G., Kapitonov V.V. and Jurka J.; RT "New superfamilies of eukaryotic DNA transposons and their RT internal divisions."; RL Molecular Biology and Evolution 26(5), 983-993 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS join(891..971,1015..1344,1388..1651,1721..3046) FT /product="Sola1-1_CS_1p" FT /translation="MDNVSIYFSGLRFFFIYLFKECFNGSENNFVKCLLMS FT FQDATVKLKRNKRPAPDTWNHNTCIKGRLMGKAYKNRKGVEKPPREMGPLC FT TSNFCKKSQLRECQTINEDQRNWIFNNFFEMGSWDERRAYVRALVSKVNCT FT AFVNRLTELMFYQVNIKQKKVLDTISRRKSSYVYRMKLEDGTSVQVCLPFF FT ASTLGLAQRTVKSWLTTVKDTTYTQEKFHRPRKGKWYWVILQNKTYLFLYY FT SRLPPKTPCYITTYHYLFYPGKTAYVEKEDLDFLKQWLAGLPSLPSHYCRS FT SLTYKNKKFLYPGTTIAKLHIEYQIAAETVGNRAVGGTKFNEIFHQQNYSV FT FIPRKDQCDVCVSAKLGNLDNEIYLAHVLAKDEARKEKEEDKKAANEEKMV FT WTMDMQAVLLCPRTKASAAYYKTKLQVHNFTLYDLQTKEGFCYLWEEHEGD FT LSSEVFAFLQYTHFERIIQERPGVKEVIIWSDGCGYQNRNLTVSNAYLELA FT RKCNVKITQKYLVAGHTQMEVDSMHSTIERNIAGDIYTPRDYFVIMQSARI FT RPTPYNVKQIFHNDFKKLTGCYMKSIRPGKKCGDPTVHDLRALEYNSNGTV FT RFKLKFSADWEVLPQRIRITQEPFEWVQVFDGPKPLTYRKFTDLQSMKMVF FT PLCTHHYFDSLPHSNPA*" XX SQ Sequence 3315 BP; 1091 A; 574 C; 654 G; 996 T; 0 other; cgtcagcctc ataacaaaag tgccaaagtc attttttaaa ttttttgagc ttttctgcct 60 aaatcgctcc ttgtgtcaac tcgcacacgt gttaaactgc ccaagtcatt ggaaggattt 120 taagcatatt aaaacacact ttttgcatat atgcctatgt cagcagtgtg acataggcaa 180 tttaacttaa ttatatttag aaaaaacgtc ccagaaaaca gagaaccatt agttttattt 240 tagttaagga agtaatagaa atcaaaacaa attgtcgtga atcgttgtaa aaatacaaat 300 aaagcaagcg cagtttctgc gttgtagtct gattagctgc ctcagagaga tttttaaaat 360 ggcgaaaagg agaaatcaac ttgtgactgt ggatcaagtt cttttgttac ttgatacatc 420 agattcgagt ggtaagtgat tttggaccct ttgtagatta caataattac ccatttagta 480 taaaagtagg cttagatcat ttacaaatag aataccataa ttggactaga ctatacggta 540 tgtatatggc atgctgttta cagtaagggc cggtatgact aggcctgcat gttgtaagtt 600 cagagatact ttgtgaaaca ttaggcctac ttgctagcta ctgctgctac tctattactg 660 taggcccata ggctattatt tacataggcc aatttgatat agtcagattc aatattctgg 720 gattattaaa acctaatgct ggtagtcata ttaacttatt cttaacattt acagatagta 780 atgatgatga acttcaggtt gaatccattc cccctgaact ggctgagcct gaccactcag 840 aatttaatgg cattaataat gatggtgatg gtggtactgg tactaatggc atggataatg 900 tgagtatata tttctcgggt ttacgttttt tttttattta tttgttcaaa gaatgtttca 960 atggttcgga atgatttaat attctattct gtcgctgttg atatataggt gtagaataac 1020 tttgtgaagt gtttactcat gtcttttcag gatgcaactg taaagctaaa gagaaataag 1080 cgtccagcac ctgacacatg gaaccataac acctgcataa aagggcgtct gatgggaaaa 1140 gcatataaaa acagaaaagg ggtagaaaaa ccacccagag aaatggggcc actatgtaca 1200 tccaatttct gtaaaaaatc ccaactaaga gagtgccaaa caataaatga agaccagcga 1260 aactggatct tcaacaactt ttttgaaatg ggatcctggg atgaacgccg tgcatatgtt 1320 cgagccttgg tttccaaagt aaattaaata aatacaattg aaacaaataa taattatgcc 1380 tatctgatgc actgcttttg taaatagact aactgaactt atgttttacc aggtcaatat 1440 taaacagaaa aaagttctgg acactatttc aaggcgaaag tcgtcctatg tctataggat 1500 gaagttagag gatggaacat ctgtgcaggt ttgcctgcca ttttttgctt ctacccttgg 1560 tcttgctcaa agaactgtca aatcctggtt gacgacagtc aaggacacta cgtacacaca 1620 agaaaagttt catcgaccac gcaaaggcaa gtagactagt ataaatatct agtgtgttca 1680 agtgtccgta tttgtcaaac ttaaacatgt gattaaataa tggtattggg ttattctgca 1740 aaataaaact tatttgtttt tatactacag taggctaccc cctaaaacac catgttatat 1800 taccacttac cattatttat tttacccagg caaaacggcg tatgtggaga aggaggactt 1860 ggactttctt aagcagtggc tggctggctt accttcacta ccatcccact actgccggag 1920 ttctttaaca tacaagaaca agaaattttt gtatccaggc accacaattg caaaacttca 1980 tattgagtac cagatagctg cagagacagt aggcaataga gcagttggtg gcacaaagtt 2040 caatgaaata tttcatcagc agaattattc agtatttatt ccaagaaagg atcaatgtga 2100 tgtatgtgta agtgcgaaac ttggcaactt agataatgag atatatctag cacacgtttt 2160 ggcaaaagat gaggcccgga aagaaaaaga ggaggacaag aaagctgcca atgaagaaaa 2220 aatggtgtgg actatggaca tgcaggctgt ccttttatgt cctagaacca aggctagcgc 2280 agcttattac aaaaccaagt tacaggttca caactttact ttatatgatt tacaaacaaa 2340 agaaggcttc tgttatttat gggaagagca tgaaggggat cttagcagtg aggtgtttgc 2400 ttttttgcag tatacacact tcgagcgcat tattcaggaa aggccagggg taaaagaagt 2460 tattatttgg agtgacggtt gcggatatca aaaccgcaat ttaacagttt ccaatgcata 2520 tttagagttg gcaagaaagt gtaatgtcaa aattacacaa aaatacctag ttgcgggaca 2580 tacccagatg gaggtagaca gtatgcattc cactattgag aggaatatag caggtgatat 2640 ttatactccc agagattatt ttgttataat gcaaagtgca agaattcggc caacacctta 2700 caacgtcaag caaattttcc acaatgattt taagaagctt actggatgtt acatgaaaag 2760 tattagaccc gggaaaaaat gtggagatcc aacagtacat gatctaaggg cattagaata 2820 caattccaac gggactgtac ggtttaagtt aaaattttca gcagattggg aggtgctgcc 2880 acagagaatc aggatcacac aggagccatt cgagtgggtc caagtgttcg atggtcctaa 2940 acctctaaca tatcgcaaat ttaccgattt acaatcaatg aaaatggtat ttcccttgtg 3000 tacacatcat tattttgata gtctgcctca ttctaatcca gcttaacaag ttacaaaacc 3060 aattaacaat aaaacttatt tttcttactt cgtgttattt tgcctaagtc acccgtattt 3120 atgcctaagt cctatgtagc tatgacgtca taatacaaac gagtaaaaag tattgttttg 3180 cattttgctt ctattctcaa ctctgcacac tataaatcgt ggccgaatcg tcattgataa 3240 aaaactggca aagttttttg tgattatctc aaaactggaa aaaatgactt tggcagtttt 3300 gttatgaggg tgacg 3315 // ID Gypsy-215_AA-I repbase; DNA; INV; 6343 BP. XX AC AAGE02027318; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-215_AA_; KW Gypsy-215_AA-LTR; Gypsy-215_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6343 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02027318; Positions 28978 22636. XX CC Positions [3605-4114] - Reverse transcriptase CC Positions [5307-5780] - Integrase core CC 'CGGAG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 2039..3889 FT /product="Gypsy-215_AA-I_1p" FT /translation="MDLVRPVKPFDAKVDQSQLASEWRRWKRSLEYCLEAQ FT GVITQREKRNQLLHLGGPDLQDIFDSLPEVNSVPHVSPDPPFYDVAIEQLD FT AHFQPSRRRTYERHIFRQISQQQGERFNDFVMKLRVQANRCDFDQEGTSVL FT ESMIIDQIAEKCTSSALRKKILEKDRPLSEVVAIGKTIEEVEMQCKELAQP FT SEREHPTISVNKVQQYRPRQQNSYSMQPDWSFYPSKERGHQFPRTQQYAFR FT RDQRFNAKGNWSGPPNGRFRQQHFSAHQNDNDNRLCFACGRRGHVKDSPKC FT VAKNAECLQCKRIGHFAKWCTKRPYQEANFGSNGGLSTPPAKRIKMVHESD FT KTNKPEENMDEICYVMGQNVFRFKVGGVEIPMAIDSGAAANLIDLATWTQL FT EQSNARVSFQPNVDRTFKAYGTEKALQLTGMFTAKIEAGGNSTEATFYIAK FT DGRQCLLGDDTAKKLKVLKIGYNIGAVREHVKAFPKIKGVVVEIPINSNIP FT PVQQPYRRVPFTLEDKIAQKLGNLLDLDIIERVQQPSAWVSPVVPILKDSG FT EIRLCVDMRRANQAVLRETHPFPIVEELFAGMKGAQRFSKLDIKEAYHQVE FT ISERSREITTFITKQGLFR" FT CDS 3993..6101 FT /product="Gypsy-215_AA-I_2p" FT /translation="MFGISCAPELFQKVMESIVTGLPGVVVYLDDLLVTGN FT SQEEHDERLKRLLDRLASFGILLNNDKCIYNATRLEFLGYELSPEGVRPKE FT SRIAAIQSFREPKNTSELRSFLGLVTYVGRFIPNLAEKTDTIRSLLRSGER FT FVWNEYHQRCFEEIKQAVCNSSCLGYYNPKDLCIVIADASPTGLGAVLLQQ FT DEYKRIRVIAFASKTLTDLERKYCQTEREALGLVWAVDKFQLYLLGTRFKL FT VTDCKPLLYLFKERAKPCARIERWVLRLQCYRYEVEYKPGSENLADDISRL FT AKSTPTEFDVASEACIYQIAMTDIPDAVTVQEVETESSKDDEIQSVFRSLD FT TGIWEGSSRQFKPFNTELCKVGHLLLRGDRLVIPNNLRARTLKVAHESHPG FT IEIMKRRLRLKVWWPHLDKDVEKYVRQCKACTLVSALDPPMPIHCTKMPDR FT AWVDLAADFLGPLPSGHSLLVIIDYFSRFTEIIVMKQTTATLTVRALHETF FT CRFGFPESLKTDNGPQFISSEFQRFCSQFGIDHRKTTPYWPQANGEVERMN FT RTIVKRLKISQETDGSDWQWDLRMFALMQNSTPHSTTGVAPSILMFGRVLK FT DKIPGMIIKGDKIVEEIRDRDTEKKHKGAEYADAKRSAGYRSLEVGDSVVA FT KRIQKDNKLSTNFHPEELIVIGKTGSDVTLKSQDSGRVFHRNISHLKKLHT FT GEV" XX SQ Sequence 6343 BP; 2139 A; 1101 C; 1469 G; 1634 T; 0 other; ttggcgtcga gttgaaacaa agttttattt cgattacgag cgaacccttg taacaagaag 60 taagttagaa aaaaattaaa acggaaaagt atttgcgacg gtcgtttctg cggtgatagt 120 tttgcttttt ccaaaagaag aactgtgagt gattgttgag aagcggaacg catagcaaag 180 tgtctttacg ttgataagaa ggattaattt aaaagcaaac cattaaaaag tgttcaaata 240 tatttttcat gtattaaaaa aaaaagtgtt ggtcgtgcaa gtgccagaaa acaaaaatat 300 aaacgaaaaa aatcacaaga tttaatatat ttccaatatg gcgtcgtttt ctggatgtga 360 agcgaaaaaa aagagtgaga gggaagataa ccatatcggc gttactaaca accaacgcca 420 cgggttcaaa gtgtgtatgt aaaagtggag cgatatcata gcgatagttg tgtcaagctt 480 ttgtgatgtg tatgcgtgaa cgggtggtaa tggagagaaa aaaaatgttg agaaagagat 540 gtttaccttg aaaaaaaaaa aaaatccatg caaagcataa acaaatatca tcggatactg 600 aaggttggcg ttgatgtgta ttgaacggaa aataaaagtt atttgtatag ttctggtaaa 660 agggttggtt agtgttaaaa gtattgcggc gtgtgaagaa tataggtatt caagtggaca 720 gtacatgcga tatggaagga gcggagctgt gaagtagtgt tgatgatcgc gaattgaaag 780 aagaccagat agtgttatgt tgccgaatgc tgaaggtcaa agtatgtgtc agatgaatat 840 ataggtggat aagctgtatg agtgagaaaa ccgtgaatct agtgaatgat aagttggtag 900 tgtgtgcgta tgtcaatgag ttgattgaag aacgatcttc accagacagt ggaagtggat 960 tgatgtcttc atcttggtaa gtcaggaatg cgcaacattt gaaaagtttt ccacagccgg 1020 gaagcaagct gagcaatgag catgatgcgt cgaagtaccg taagtacaca aaatcatgga 1080 caaaaaaaaa aaatgaagag aataaataaa tgtgccatat atcttttttt tatttacgat 1140 aagagataaa ttttttgaac tcattcgcat ccatttattg ggaacagtta atgtgaaaag 1200 acagtactgt tcgacttaat ttattaatta tctcatgatt tgttttgatg atatacaagt 1260 ttatacatgg aagactatag catttttttt atattcttag aaaaggggtg ttgtcttttc 1320 agcagagcac agccttttga ttcgttgatt tacgtttagt gagttataaa tggaggatga 1380 agtcataatc catgaaaaga ataaaatggt tcaataaaaa gaaaagacga tttttgacat 1440 cggtattttg tgttaatcaa gaatgaaacg ggctgagagc cggattcgaa tttaaagtta 1500 tacaaacgtg agaaattgag gtgttaaaca gaacaaagta tgtaaagatt gaaatgagct 1560 gtaagctgga ttctttcaag agagctggtt atatatgtaa aatgagctgt gagccagatt 1620 tgtattgttt atgaaggatt atctgattga aaagggtgaa tgccagatgc gtcgtgtaaa 1680 taccgttaaa aaaatgatga atgaaagtga aacggatgtt tggacatggt actaactgag 1740 ttcaatttgc aaggcataga ataacattag cgaaaaaaaa taaatgtgaa agtgtgaaac 1800 gagctgtaag ctggatgttg taacattatt gtttgaacag gatgcattaa gaagttgatc 1860 taggtgaaat gagctgtaag ctggattcac agtattttaa aaagagtaat ttgggttcca 1920 gaagattaat tgaatagaat gctagaataa tttaaccaat gcacaaaaac aaaacaccac 1980 agaaattgac aagaaaaaaa aataaattaa cttaattcat gataattgat tcataggaat 2040 ggatttagtg cgaccggtta aaccgttcga tgcaaaagtg gaccagtctc agctcgcttc 2100 agaatggaga aggtggaagc gtagcttgga gtactgtttg gaagctcagg gagtgatcac 2160 tcagcgagaa aagcgaaatc agctccttca tttgggagga ccggatcttc aggatatttt 2220 tgacagcctt ccggaggtga atagcgtgcc ccatgtctca cctgaccctc cgttctatga 2280 cgtagccatt gagcaactgg atgcgcattt ccaaccgtct cgtaggcgta cgtatgaacg 2340 ccatatcttt cgccaaattt cacaacagca aggtgaaaga ttcaacgatt ttgtcatgaa 2400 actgcgggta caggccaatc gatgtgattt cgaccaggaa ggaacctcgg tgttggagag 2460 tatgatcatt gatcaaatag cggagaaatg tacttcctcg gctttgcgga aaaaaatctt 2520 agaaaaagat cgcccactca gcgaggttgt ggctattggt aaaaccattg aagaagtcga 2580 aatgcaatgt aaggaactgg cacaacccag cgaacgtgaa catcccacaa tttctgtgaa 2640 taaggtgcaa caatacagac ctcgccagca aaattcatat tcgatgcaac ctgattggtc 2700 cttctaccca tctaaggagc gtgggcacca gtttcctcga actcagcagt atgctttccg 2760 acgtgaccag cgattcaatg ctaagggtaa ctggtctgga ccaccaaacg gccgtttccg 2820 ccaacaacat ttttcagctc atcaaaacga taacgacaac agactttgct tcgcctgtgg 2880 ccgtcgagga cacgtaaaag atagtccaaa gtgtgtggcc aaaaatgctg agtgtctgca 2940 atgcaaacgc attggtcatt ttgccaaatg gtgcactaaa agaccatacc aagaagcgaa 3000 ttttggatcg aatggtggct tgtctacgcc accagcaaaa agaataaaga tggtacatga 3060 aagcgataag accaataagc cagaagaaaa catggacgaa atttgttacg tcatgggaca 3120 gaacgtcttt cggttcaaag tcggtggtgt tgaaattcct atggccatag attccggagc 3180 agcagcgaat ctcatcgatc tggctacctg gacacaattg gaacaatcaa acgcaagagt 3240 tagcttccaa ccgaatgtag atcggacatt caaggcttat ggaacagaaa aagcactcca 3300 attgacagga atgtttacag ccaaaatcga agcgggggga aatagtaccg aagcaacgtt 3360 ttacatcgca aaggacggtc gacaatgtct acttggagac gacacagcga agaagctaaa 3420 ggtattgaaa attggataca acatcggtgc agtcagagaa cacgtgaagg cctttcccaa 3480 aataaaagga gtagtggttg aaattccaat aaacagcaac ataccaccag ttcagcaacc 3540 ttaccggcgt gttccattca ctttggagga caagatcgca caaaagcttg gaaatctact 3600 cgatctagac attatcgaac gggtacaaca gccgtctgct tgggtttctc cggtagtacc 3660 aatcttgaag gattcgggag agattagact ctgcgttgat atgagacgag cgaaccaagc 3720 agtacttcgg gagacacacc cattcccaat cgtagaggag cttttcgcag gaatgaaagg 3780 ggcacagcga ttttctaagc tagatattaa agaggcctac catcaagtgg agatatcgga 3840 gcgatccagg gaaattacta cgtttattac gaagcaggga ttattccggt aaagattctt 3900 tataagatat tttcgatgaa aaaatgaata acgaataaag aaaaacaaaa acaaaaaaaa 3960 atctattttc tctcaccaga tacaaaagat taatgttcgg aatatcatgt gctcctgaac 4020 ttttccaaaa agttatggag tctattgtaa caggtttgcc aggagttgtt gtatatttgg 4080 atgatttgtt agtcacagga aacagtcaag aagaacacga tgagagactg aaacgcttgc 4140 tcgatcgtct agcttctttc gggattttgc taaataacga caagtgcatc tacaacgcaa 4200 ctcgcttaga attcttagga tatgaattat cacccgaagg cgtacgtcca aaagaaagtc 4260 ggattgcagc catacaatct tttcgagaac ctaaaaatac atcagagctc cgcagctttt 4320 tgggactagt gacctatgtg ggtcggttta taccaaatct cgccgaaaaa actgacacga 4380 tcagaagttt actgcgttcg ggagagcgtt tcgtctggaa tgaataccat caacgatgtt 4440 ttgaagaaat caaacaggca gtttgtaatt cgagttgtct aggctattac aatcctaaag 4500 atctctgtat tgtaatcgca gatgcaagtc caactggtct tggagcagtg cttttgcagc 4560 aagatgagta caaacgcata cgggtcattg cgttcgctag taagacattg acggatttgg 4620 agagaaagta ttgccaaaca gaacgagaag ctttaggact cgtttgggct gtggataagt 4680 tccagttata tcttttagga actcggttca aactagtaac cgactgtaag cctctgttat 4740 atttgttcaa agaacgagct aagccatgtg caagaattga gaggtgggtg ttgcgactac 4800 aatgctatcg ttatgaagta gaatataaac ccggatcaga aaacttagct gacgatatct 4860 ctaggctggc caagtcaact cccacggaat ttgacgtagc aagtgaagcg tgcatatacc 4920 aaatagctat gacagatata ccggatgcag ttactgtgca agaagtagaa actgaatctt 4980 caaaggacga cgaaattcaa agtgtttttc ggagtttgga tacagggatt tgggaaggca 5040 gctcacgaca attcaaacca ttcaatacgg aactatgtaa agttggacac ctactactac 5100 gtggcgatag attggttata cctaataatc tcagggcaag aactttgaaa gttgcccacg 5160 agtcacaccc cggaatcgag attatgaaaa ggcggttaag attaaaggtc tggtggccac 5220 atttagataa ggatgtagaa aaatacgtga gacagtgcaa agcctgcaca ttagtttcag 5280 cgctcgatcc acctatgcct attcattgca ctaagatgcc agatcgagct tgggtagact 5340 tagcagcgga ttttctaggc cctctaccat ccggtcacag ccttctggta atcattgatt 5400 atttcagccg attcaccgag attattgtca tgaagcagac aaccgcaact ctaaccgttc 5460 gagcattgca cgaaacattt tgccgtttcg ggtttccaga atcactgaaa acagataatg 5520 gaccgcaatt catcagctca gagttccagc ggttttgtag tcaatttgga atcgatcaca 5580 gaaagaccac tccgtattgg cctcaagcaa acggggaagt tgaacgtatg aaccgcacaa 5640 ttgtgaagcg tttgaaaatc agccaggaga cggatgggtc tgattggcaa tgggatttga 5700 gaatgttcgc gctcatgcaa aattcgaccc cacattcaac caccggagtg gcaccctcga 5760 ttcttatgtt cggacgagtc ctaaaagata aaattccggg aatgataatt aaaggagaca 5820 aaattgtgga agaaatccga gatcgcgaca ccgagaagaa acacaaagga gcagaatatg 5880 cagatgcaaa acgatcagca ggttaccgca gcctagaagt tggagattct gtagtggcca 5940 aacgcatcca gaaggataat aaactgtcaa caaatttcca tccagaggaa ttgatagtga 6000 ttggtaagac aggatcagat gtgactttaa aatcacagga ctccggacgt gtttttcata 6060 gaaacatttc ccacttaaaa aagttacata caggtgaagt ataagtaaat agaggatact 6120 tagaaatata aaaatttgaa ataataatta ttttcaggta atggatacac gaccaatata 6180 gaaccaaatc cgatggatgg cgatgttagc caaggttgtc ctaaggtttc agatgatacg 6240 agcgagagca ttaacctacg gccacgacga gaagtgaaac gaccggcaca tttaaatgac 6300 tataagatta acgaattttc tttgttttaa agtaagagag gga 6343 // ID CR1_Ele20 repbase; DNA; INV; 5919 BP. XX AC . XX DT 19-OCT-2010 (Rel. 15.1, Created) DT 19-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE A CR1 clade non-LTR retrotransposon family from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1_Ele20. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-5919 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5919 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (19-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. This consensus is generated from 22 CC sequences with >96% identity, and ~99% identical to the original CC sequence in [1]. XX FH Key Location/Qualifiers FT CDS 254..1573 FT /product="CR1_Ele20_1p" FT /translation="MPVEMDGEEGEHGCCNQPCAQCALPIRDGDASVECFG FT RCKLSIHVHCLPDATNEEIVLLRKIRNAAFVCDACLSLAEFDDRHSEKRLD FT EIAKKLEDLAGVAEVVKNFDQSVKRIVRDELARANKRDAAALGDKDDVSHR FT MVTRSSAKRRKIDNLGNRVEEEVTPKSTFAEVLKKRSEKRVIEQPKKILPV FT QKPNPVVVVKPKSGVQVEDVRAELRKKVDARQLNVQRVTSGKSGEVVIALK FT DEQSVKLLRENVEKNMGGLYDVHVRESIKPTIKLIGMSEEMNEQELKETLI FT DQNLAFENLKHFKLCKSYCNEKLRFNNVSAIIELDAETFRKAMLEERLNCG FT WDRCRVVDGLRVTRCYNCCAFNHKSKDCKATTPKCAVCSGNHLVNECKSNV FT NECANCKKMNTDRKLRLDTNHAAWSVLCPVYQKQLEQRKSFVDYSV" FT CDS 1577..5752 FT /product="CR1_Ele20_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="QLMSLHEDFENGERSCVGNIEVVDTPASSGNLIQSIW FT SAEARHDTAPCLNLNALQMAGRLHDHDDPLPASEDPFDEACCPAGCSTVVD FT IRTSLESNGRPKLMGSTSTLSINTSSCSTQTERTSRSMMDAPKPPDTVEIH FT RSSASSDCSGHPSRSVPVFGVGEGAVHLACLGKYYDVCDLSYPESHSTRSK FT TTTKRHHSASSSLPDIQFASSSRSSSSCSTATERTSRSMMDAPKPPDTVEI FT HRSSFSSDCSGHPSRSVPVFGTGEGAVHLACPGKYYDACDPSCPESHSIRS FT KTTTKRHHSASSSTPDIQLSSQYASSIRGSSSCSTVTERTSRSMMDAPEPP FT DTVEIRRSSFPSDCSGHPSRSVPVFGTGEGAVHLACLGKYNDVGSSSYSES FT HPACSDIASNVDQPSLTCHRGRSDSTAASDAIANKLMVYYQNVRGLRTKID FT DFFVTVVDCDYDVIVLTETWLDDRIYSAQLFGNRFMVFRNDRNPFNSTKAR FT GGGVLIAVSNRLNCLLDSAHINPSVEQLWVKVTTPNRAISIGAVYVPPDRK FT TDMLYIENHINSIGAVLSRLDTNDFALVFGDYNQSGLAWNVQSNNIPSIDV FT NRSVISASCSSLLDGFSLHGLVQLNTIFNRNSRLLDFLLVNEVALPDCTIC FT EAPEPLIGLDADHPALETQINMSLPIQYDNFDDPLAFDFNRADFVSMSQAL FT ALIDWQYLETCLCVNEAVDYFTSNLNHIISANVPLRRPSAKPIWGNARLRY FT LKRLRSKALRKYLENKNPLTKQSFVLASNEYRSYNRFLYNGYTNRMQDKLR FT RNPKSFWSFVKTKKKEDGLPSEIYFGDKFAASVVDKSNLLAERFKNVFNDS FT TAAESQIDTALSDTPRDAFNLSMFSVTPDMVESAIRKLKHSNESGPDGIPP FT SILKKCSTELILPLTKLFNASLLQRLFPERWKYSYMFPVFKKGDKKNAENY FT RGITSLCACSKVFEIIVNDVLLSCCKNYILPDQHGFYPRRSISTNLAPFVS FT LCLQNMEAGVQVDVIYTDLKAAFDRVDHSILLAKLGKLGVSSDLVHWFQSY FT LTHRKLCVKLGSANSGYFTNTSGVPQGSNLGPLLFSIFLNDIGFALPTGCR FT ILFADDIKIVVVVRCENDCLVLQHLINVFCEWCDRNFVTISVQKCNVISFH FT RKQKPIIFNYTIDDLPLHRVDTIRDLGITLDSALTFKPHYLDTIAKANRQL FT GFIFKIADEFRDPTCLKALYCSLVRSIIEFGSVIWCPYQRVWIMRLETVQR FT KFVRYALRHLPWRDPTSLPPYEHRCQLLGIDTLESRRSTAQSVFVAKVLVG FT EIDSPEILANLLVYAPERPLRTRSFLHLPARTVNYGLHDPIRYMAARFNEH FT YHLFDFGLATSTFRQRIERVLREEEREDI" XX SQ Sequence 5919 BP; 1635 A; 1349 C; 1321 G; 1614 T; 0 other; ttttttttta ctgtgctgac ggcgtgataa gatcgtttac ttcgtctgat cacaaaagtt 60 atgttttatg tgcgcaaatc tcgcgttatc atatcaaaag tttgtgcttg tgttgtgaga 120 acattgtgct cgtataatta aacaggaatc ttattgatac gtaaagtgga actcctgaaa 180 ttcagtgatt catatcagca tattactgaa gctaagttct gttgcctgtt ttgtttgttt 240 tcgcccgggc gatatgcccg tggagatgga cggtgaagag ggtgagcacg gctgctgtaa 300 ccagccgtgc gcgcagtgcg ctttgcctat tcgagatggt gacgccagtg tcgaatgctt 360 cggtcgatgc aaattgtcaa tccacgtcca ctgtttacct gatgcgacga atgaggagat 420 agtattattg cggaagatta gaaatgctgc gttcgtatgc gatgcgtgtt tgagtcttgc 480 cgaattcgac gataggcaca gtgaaaaacg attggatgaa attgcgaaga aattggaaga 540 cctcgcgggt gtagcggaag tcgtaaaaaa tttcgatcaa tcagtgaaaa gaattgtgcg 600 tgatgaacta gcgcgagcga ataagcgtga tgcagctgct cttggtgata aggacgatgt 660 atctcaccgt atggttacac gttcgtctgc taaacggcga aaaatcgaca atttgggaaa 720 tcgcgttgag gaagaagtaa cgccaaagtc aacttttgcg gaagtgttga aaaagcgttc 780 tgaaaaaaga gtgatagaac aaccaaagaa aattctacca gttcagaaac caaatccagt 840 ggttgttgta aaacctaagt caggtgttca agttgaagat gtacgggctg agctacgaaa 900 gaaggtggat gctagacagc taaatgtcca gcgggtgacg agtggaaaat ctggtgaagt 960 tgtgattgcg ttgaaagatg agcaaagcgt gaagttgttg agggaaaacg ttgagaaaaa 1020 tatgggcggt ttgtacgatg tgcacgttag agaaagcatt aagccgacta tcaaattgat 1080 tggcatgagc gaagaaatga atgagcagga actcaaagaa accctgatcg atcaaaactt 1140 agctttcgag aacctcaagc atttcaaact gtgcaagtca tactgcaacg aaaaactacg 1200 ttttaacaat gtcagtgcaa tcatcgagct cgatgctgaa acgttccgaa aagccatgct 1260 tgaagaaaga ttgaactgtg ggtgggatag gtgtcgagtg gtagacggct tgcgagtgac 1320 gcggtgttac aattgttgtg ctttcaacca caagagtaaa gattgtaagg caacaactcc 1380 gaagtgcgct gtgtgtagtg gaaatcacct ggtgaatgaa tgtaaatcga acgtgaatga 1440 atgtgctaat tgtaaaaaaa tgaacactga tcgcaaactg agacttgaca caaaccacgc 1500 tgcatggagc gttttatgcc cggtgtacca gaaacaactt gaacagcgta agagtttcgt 1560 cgactattcg gtttagcaac taatgtcctt gcacgaagat ttcgaaaacg gtgaacgcag 1620 ctgtgttgga aatatagaag tagttgacac tcctgcttcc tcaggtaatt taattcaaag 1680 tatatggtct gccgaagcta ggcatgatac tgccccctgc ctgaatttaa acgcactcca 1740 gatggctggc cgattgcacg atcatgacga cccacttcct gcatctgagg atccttttga 1800 cgaagcctgc tgtcctgctg gatgttctac agtagttgac atccgtactt cgttagagag 1860 caatggacgt ccaaagttaa tgggtagcac gagtacactc agtatcaaca cttcttcttg 1920 ttctacgcaa acggaacgca cgtcacgaag catgatggac gcccccaagc ctcccgacac 1980 agtcgagatt caccgatcat ctgcctcatc tgattgttcc ggccatccaa gccgttccgt 2040 tcctgtgttc ggggtcggtg aaggggccgt ccaccttgct tgtctaggca agtattacga 2100 cgtatgtgat ctttcgtacc ctgaatctca ttcaactcgc agcaaaacaa caacaaaacg 2160 tcatcattcc gcatcatcaa gtttacctga catccagttt gcatcatctt cacgtagctc 2220 ttcttcttgt tcaacggcaa cggaacgcac gtcacgaagc atgatggacg cccccaagcc 2280 gcccgacaca gtcgagattc accgatcgtc attttcatct gattgttccg gccatccaag 2340 ccgttccgtt cctgtgttcg ggactggtga aggggctgtc caccttgctt gtccaggcaa 2400 gtattacgac gcatgtgatc cttcgtgccc tgaatcccat tcaattcgca gcaagacaac 2460 aacaaaacgt caccattccg catcatcaag tacacctgac atccagttaa gcagccagta 2520 tgcatcgtct atacgtggct cttcttcttg ttcaacggta acggaacgca cgtcacgaag 2580 catgatggac gcccccgagc cgcccgacac agtcgagatc cgccgatcgt catttccatc 2640 tgattgttcc ggccatccaa gccgttccgt tcctgtgttc gggactggtg aaggggctgt 2700 ccaccttgct tgtctaggca agtataatga tgttggttcc tcttcgtact ctgaatccca 2760 tcctgcttgc agtgacatcg catcaaacgt tgaccaacca tcattgacgt gccatcgagg 2820 tagatctgat tccactgcag cttcagatgc catcgccaac aagctgatgg tttattatca 2880 aaatgttcga ggactgcgga cgaaaatcga tgactttttc gtgactgtag tggactgcga 2940 ctacgatgta atagtgctta ccgaaacttg gctcgacgac cgaatctatt cagctcagct 3000 gtttgggaat cgattcatgg ttttccgcaa tgaccgcaat ccgtttaaca gcaccaaagc 3060 aagaggtggt ggagtattga ttgcggtttc aaatcgttta aactgtcttt tggactcagc 3120 tcacatcaat ccttctgtcg aacaactctg ggttaaggtt acaacaccta accgagcgat 3180 tagtattggt gctgtatatg tccctcctga tcgcaaaaca gacatgctgt atattgaaaa 3240 tcacattaac tcaattgggg ctgtactgtc ccggcttgac actaacgact ttgccctcgt 3300 atttggtgat tacaaccagt ctggtttggc ttggaatgtg caaagcaaca acattccatc 3360 gattgacgtc aaccgctctg tcatatcagc ctcgtgcagc tctcttttgg acggcttctc 3420 tctgcatggt cttgtgcagc tcaatacgat tttcaacaga aactcccgat tgctcgactt 3480 tttgctagta aacgaagttg ccctccccga ctgtacaata tgcgaagcgc ccgagccgtt 3540 gattgggctt gatgcagatc atcccgcact agaaacgcaa atcaatatgt ctttacccat 3600 tcagtatgac aacttcgacg atccactggc ctttgacttc aacagagctg actttgtttc 3660 aatgtcacaa gcacttgctt taatcgactg gcaatacctc gaaacttgcc tctgtgtcaa 3720 tgaagctgtt gactatttta ctagtaatct gaaccatata atatcagcaa atgtgcctct 3780 gcgaagacca agtgcgaaac caatttgggg caacgcacgc cttcgatact tgaagcgttt 3840 aaggtccaaa gcactacgga agtaccttga aaacaaaaac ccgttaacca agcaatcttt 3900 tgttcttgca agcaatgaat accgcagcta caatcgtttc ttgtacaacg gctacactaa 3960 tcgcatgcag gataaacttc gcaggaaccc aaaatcgttt tggtcatttg tgaaaactaa 4020 aaagaaagag gatggtctcc ctagcgaaat ttatttcggc gacaaattcg ctgcatcagt 4080 ggttgataaa agcaatcttc tagcagaacg tttcaaaaac gtgttcaacg actcgactgc 4140 cgctgaatca caaatagata cggcgctcag tgatacacca agggatgcgt tcaatctcag 4200 catgttttct gtcactccgg atatggttga atcagcgatt cgtaagctaa agcattcaaa 4260 cgaaagtggc ccagatggta tcccaccaag catcttgaaa aaatgctcta ccgaacttat 4320 tctgcctttg acgaagctgt tcaacgcctc tttgctccaa agattgtttc cagagcggtg 4380 gaagtattct tacatgtttc cggtattcaa gaaaggagat aagaagaatg cggaaaacta 4440 tcgaggcatc acatcactct gcgcctgttc caaggttttt gagataattg tcaacgatgt 4500 cctactctcc tgttgtaaaa actacatttt acctgaccaa catggtttct acccccgaag 4560 atcgatctca actaacctcg ctcccttcgt atcactgtgt ctgcagaata tggaagcagg 4620 agtacaagtg gacgttattt acaccgacct caaagcagcg tttgatcgcg tcgaccactc 4680 catactactt gctaaattag gcaaacttgg cgtatcgtcg gatttggttc actggttcca 4740 gtcataccta acgcaccgga agctttgtgt aaagctaggt tcagcaaatt cgggctactt 4800 tacaaacacc tcgggagttc cccaaggcag caacctaggg ccgctgttgt tctccatctt 4860 tctcaacgac attggattcg ctcttcctac tggatgcaga attctgtttg ctgacgacat 4920 caaaattgtt gtagttgttc gctgcgagaa cgattgtctg gtgctacaac atctaatcaa 4980 cgttttttgc gaatggtgcg ataggaactt cgtcaccata agtgttcaga aatgcaacgt 5040 gatcagtttt caccgcaagc aaaagccaat tatattcaac tataccattg atgatctgcc 5100 actacatcgt gtagatacta ttcgagacct aggtatcacg ctagattcag ctctaacttt 5160 taagccccat tatctggaca ctatcgccaa agctaatagg cagctcggat ttatattcaa 5220 aattgcggat gaattccgtg acccaacctg tctcaaagcg ttgtactgtt cccttgtgcg 5280 ttccatcata gagtttggat ccgtcatctg gtgcccatac caaagggttt ggataatgcg 5340 actggagact gtacaaagaa aatttgtgcg ctatgccctt cggcatctgc cgtggagaga 5400 ccccacaagt ctgccaccat atgaacatcg gtgtcaactg ttgggtattg acacacttga 5460 gagcagaagg tctacagctc aatcagtgtt tgtagctaaa gttttggtag gcgaaattga 5520 ttcaccggaa attcttgcca atctgttagt atatgcacca gaacggccac tccgtacaag 5580 aagtttccta catctccctg ctcggacagt taattacgga ttacatgacc ctattcgata 5640 catggcagca cgcttcaacg agcactatca cctcttcgat ttcggcttgg caacttctac 5700 atttcgacaa cggattgaac gagtgttacg cgaggaagaa cgggaagata tttgatattt 5760 gtttgacgca tttttaaagt gttagaatta agttattcgt tgtatatgtc ttcaatatgc 5820 gtttcaaata gttgattata ttttttttta ctttaattgt tcattaagac acatagtcag 5880 atgaattgac aaatatacaa atacaaatac aaatacaaa 5919 // ID BEL-24_AA-LTR repbase; DNA; INV; 443 BP. XX AC supercont1.15; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-24_AA_; KW BEL-24_AA-I; BEL-24_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-443 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.15; Positions 2074442 2074884. XX SQ Sequence 443 BP; 118 A; 113 C; 99 G; 113 T; 0 other; tgttccgtgt acggaaggaa tcatgtgatg cgtgaacata tttgcaatcc ttttccttcc 60 atttctatcc cgcgcgtgtg gagacttttc ttccacgccc caccttcatg cacgatcacg 120 ccacgcaagg ctctgtgcag aggaaggaag tgtgaaaaag tacagtcttc gagaaagtgt 180 cgacagcaac caacgtgttt agaagaaagt tccaactcac gtgttaattt agttaattaa 240 agagtagtta agtgtaaaac ccagtgtttt agatcatttc ccagtaaccc agtcctagtt 300 ccgaagaagt gtcctcccga aagagccgta aagtgtatcg gctacaagac gaagcgtcca 360 gcgagggatc cttaccctct agcttcggcc gtcctcccgg aaaagtgtta cagtccactt 420 tgctcgttcc cccagacgga aca 443 // ID CR1-9_CQ repbase; DNA; INV; 4184 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-9_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4184 RA Kojima K.K. and Jurka J.; RT "CR1 non-LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 11-11 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 9 sequences with >98% CC identity. XX FH Key Location/Qualifiers FT CDS 2..4063 FT /product="CR1-9_CQ_1p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="GESESRGREHGRVTRPPLAASPGRAVESHVRVPRLPD FT TTTSSPSHRLIRPACCSLQSRPGAGVGAGGGVSQTATLGKYXSXSQXPLPD FT ARPTSSSCAISSTSPPVRTPNAVRAFWSSPGRAVESHVRVPHLPDTTTSSP FT SHRLISPACCSLQSRPGAGVGAGGGVSQTATLGKYHSSLQNPLPDARPTSS FT SCATSPPVRTPNAVRAFRSSPGRAVESHVRVPRLPDTTTSSPSHRLISSAC FT CSLQSRPGAGVGAGGGVSQTATLGKYHSSLQNPLPDARPTSSSCATSPPVR FT TPNAVRAFWSSPGRAVESHVRVPRLPDTTTSSPSHRPISPXCCSLQSRPGA FT GVGAGGGVSQTATLGKYPSGLQNPLSDARPTSSSAAFPSTINAKGTSTPCK FT LRIYYQNVRGLRTKIDAFFLAVTECDYDVIVLTETWLDIEILSPQLFGSTY FT RVFRNDRNERNSDKKSGGGVLIAISSRLDCVLDPTPVCDTLEQLWVRLTVG FT ASQRVCIGAFYLPPGLNRSIDLIKDHLESIGNVLSHLNLNDSFIQFGDYNQ FT PGLKWIPSEEGGLEIDPVQSRVPLVSSYLIDGFNLHGLSQLNPCVNVYNNC FT LDLVLVNEVALPNCAVLAAVDPLLPQDIAHPALEMFVDVPAPSATAEPPLN FT SQQSFNFRKANYAAIQEELSRIDWSFLETAPSIDAAVDAFQSILNNLFTEH FT VPLRRPPQKPPWSNPTLRKLNRLKTAALQEFSNHRSPFFKSKFNLASKRYR FT GYNRTLYRRYVIRTQRDLRRNPKNFWSFVNSKRKADGLPPTLYLDDVAAAT FT SEEKCALFAAHFKDAFNSRFASEAEIIEATRNTPINAIECSLTEVANCQVS FT AAINKLKLSYSPGPDGIPSSVIKRCSGALLSPLTCLFNLSLRQSTFPDKWK FT ASFMFPVHKKGDKSSVKNYRGITSLSACSKVFEILVNNVLFDSCKNYISTD FT QHGFYPKRSVVTNLAQFVSHCLQEMDAGAQVDAVYTDLKAAFDRVDHGVLL FT KRLERLGVSASLVRWLKSYLTNRILRVKIGNSESDMFISLSGVPQGSNLGP FT LLFILFANELSMLLSEGCRLFYADDVKIFTVVRSVLNCSALQRQLNHFNEW FT CVRNFLTISIEKCSTISFHRKLKPVVFDYAIAGNTLARVTQVRDLGVTLDS FT ELCFTIHLHDIVSRANRQLGFIFRIADDFRDVACLRSLYCALVRSILEFCS FT VVWCPYQSTWTAKIESVQRKFTRLALRRARNPAGWNTYEERCRVLQLETLE FT RRRHVSQATFVAKVLAGEVDTPWIRAQIQIYTPARPLRQRQYLQLAHRDTN FT YGQHEPVRFMCSRFNVFSHLYDPNISTPQFQQRARLWYSHQL" XX SQ Sequence 4184 BP; 991 A; 1223 C; 950 G; 1012 T; 8 other; cggagaatcc gaatctcggg gtagagagca tggaagagta acgcgaccac ctttagctgc 60 atcgccagga cgcgccgtag aaagccacgt gagagtccct cgcctgcccg acaccaccac 120 gtcgtcaccc agccaccgcc tgataagacc agcctgttgc agccttcaga gccgtcctgg 180 tgctggtgtc ggtgccgggg gaggggtctc tcaaacggca accctcggca agtaccmctc 240 takwtcgcaa watccmttgc ctgatgctcg tccaacttcc agctcgtgcg ccatatcatc 300 cacatcgcct ccagtcagga caccaaacgc tgtacgagct ttctggtcat cgccaggacg 360 cgccgtagaa agccatgtga gagtccctca cctgcccgac accaccacgt cgtcacccag 420 ccaccgcctg ataagcccag cctgttgcag ccttcagagc cgtcctggtg ctggtgtcgg 480 tgccggggga ggggtctctc aaacggcaac cctcggcaag taccattcta gtttgcaaaa 540 tccattgcct gatgcccgcc caacttccag ttcctgcgcc acatcgcctc cagtcaggac 600 accaaacgct gtacgagctt tccggtcatc gccaggacgc gccgtagaaa gccatgtgag 660 agtccctcgc ctgcccgaca ccaccacgtc gtcacccagc caccgcctga taagttcagc 720 ctgttgcagc cttcagagcc gtcctggtgc tggtgtcggt gccgggggag gggtctctca 780 aacggcaacc ctcggcaagt accattctag tttgcaaaat ccattgcctg atgctcgccc 840 aacttccagt tcgtgcgcca catcgcctcc agtcaggaca ccaaacgctg tacgagcttt 900 ctggtcatcg ccaggacgcg ccgtagaaag ccatgtgaga gtccctcgcc tgcccgacac 960 caccacgtcg tcacccagcc accgcccgat aagtccagmc tgttgcagcc ttcagagccg 1020 tcctggtgct ggtgtcggtg ccgggggagg ggtctctcaa acggcaaccc tcggcaagta 1080 cccttctggt ttgcaaaatc ccctgtctga tgctcgccca acttccagct cggcggcctt 1140 cccttctacg atcaacgcta aaggcacttc aaccccctgc aaactccgca tctactatca 1200 gaatgttcgc ggactccgaa caaaaattga tgcctttttc cttgccgtca ctgagtgcga 1260 ttatgacgtc atcgtcctga ckgaaacttg gctggatatt gagatccttt cccctcaact 1320 gtttggatcg acgtatcggg ttttcaggaa tgacagaaac gagcggaaca gcgacaaaaa 1380 gagcggagga ggtgtcctga ttgctatttc gtctcgcctg gactgtgtgc ttgaccccac 1440 tcctgtttgt gacacgctag aacaactatg ggttaggttg accgtaggcg catcacagcg 1500 cgtttgtata ggggcgttct atcttccacc gggcttaaat cgcagcatag atctgattaa 1560 ggatcatcta gaatccatcg gtaacgtcct ctcccacctc aacttaaacg attcgttcat 1620 wcaatttggt gactacaatc aacctgggct gaaatggatt ccctcggaag aaggcggact 1680 tgaaatcgac ccagtacaat ctcgtgtccc gcttgttagc agttacctca tcgacggctt 1740 caacctacat ggacttagtc agctcaaccc ctgcgtcaat gtctacaaca attgcctcga 1800 ccttgtattg gtgaatgaag ttgcgctgcc caactgcgct gttctcgcag ccgtggaccc 1860 gttgcttccg caggatattg cgcatccggc gctcgaaatg ttcgtcgatg ttcccgcgcc 1920 ctctgcaact gccgaaccac cactcaactc tcagcaatcg ttcaacttca ggaaggccaa 1980 ctacgcagcg attcaagaag agctgagcag gattgactgg agtttcttag agactgctcc 2040 gagcatcgat gctgcagtgg atgcttttca atcgattttg aacaaccttt ttaccgaaca 2100 tgtcccgtta cgaaggcctc cccagaaacc cccgtggtca aacccaaccc tacggaaact 2160 aaaccgactc aaaacagctg ctcttcaaga attcagcaat caccgctcgc ctttcttcaa 2220 atccaaattc aaccttgcca gcaaaagata ccgggggtac aaccgcaccc tctacaggcg 2280 atacgtcata aggacgcagc gtgatctccg gagaaatcct aagaactttt ggtcattcgt 2340 gaactcaaaa agaaaagcgg acggtctgcc acccacatta tacctggacg atgttgctgc 2400 tgcgacttca gaggagaaat gtgccctgtt tgctgcccac ttcaaggacg cgttcaattc 2460 caggttcgct tcggaggcgg aaattatcga agccacaagg aacactccca tcaatgctat 2520 cgagtgttcc ctcacggaag ttgcgaattg ccaagtatct gctgctatta acaagctcaa 2580 actctcctat tcgcccggtc ccgacggaat tccctcttcg gttatcaaac gatgttctgg 2640 tgcactcctt tcccccttga cctgcctgtt caacctatct ttgcggcaaa gcaccttccc 2700 ggataagtgg aaagcctctt ttatgttccc ggtgcacaag aaaggtgaca aaagcagcgt 2760 caaaaattac cgtggaatca cctctctctc ggcttgttcg aaggttttcg aaattctggt 2820 caataatgtt ctgtttgact cttgcaaaaa ctacatttcc acggatcagc atggcttcta 2880 ccctaaacgc tctgttgtca cgaatcttgc tcaattcgtg tcacattgtc ttcaagaaat 2940 ggatgctgga gctcaagtcg acgcagtgta cacagacttg aaggcggcgt tcgatcgggt 3000 tgaccacgga gttttgctaa aacgactgga aagactcgga gtctccgcca gtctcgttcg 3060 ttggctgaag tcttacctca ccaaccggat tctgcgagtc aagattggga actctgagtc 3120 agacatgttc ataagtctct ctggcgttcc tcaaggcagt aatttaggtc cgttactgtt 3180 catacttttc gccaacgaac tgtcgatgct gctaagcgaa gggtgccggt tgttttacgc 3240 tgatgatgtc aaaatcttca ccgttgttag aagcgttttg aactgctcag ctctgcaaag 3300 acagttaaac cactttaacg aatggtgcgt gcgtaacttc ctgacgatca gcatagaaaa 3360 gtgtagcaca atctctttcc accgtaaact caaacccgtg gtcttcgact atgccatcgc 3420 cggaaacact cttgctcggg taacccaagt acgtgacctg ggcgttaccc ttgatagcga 3480 attgtgtttc actatccacc tccatgacat cgtgtccaga gccaatcgac agcttggatt 3540 catctttagg atcgctgatg acttccgaga tgtcgcgtgc ttacgttcat tatactgcgc 3600 gctggtaaga tccatcctcg agttctgctc cgtagtgtgg tgtccgtacc agagcacctg 3660 gactgctaaa atcgaatcag ttcagcgaaa attcacccgg cttgcccttc gccgtgcaag 3720 gaatcctgcc ggttggaaca cgtacgaaga acgttgtcgt gtcttgcagc tagagactct 3780 cgagcggagg cggcacgtct cacaagcaac gtttgttgcc aaagtactgg ccggtgaagt 3840 tgacactcct tggattcgcg ctcaaattca aatctacaca cctgctcggc cacttcgtca 3900 acgccaatat ctacaacttg ctcaccgcga caccaactat ggacagcacg aaccagttcg 3960 tttcatgtgt agccgattca acgtattttc ccacctgtac gatcccaaca tctccacccc 4020 gcagtttcaa caacgagccc gactctggta ctctcaccag ctttaatctg tttttgtgtt 4080 tcatgttgtt agcatttgtc tttagttaag ttaagtttta tcattaagac cacaagttgt 4140 cggatgattc aaataaacaa ataaacaaat aaacaaaaaa aaaa 4184 // ID Tx1-10_BF repbase; DNA; INV; 3984 BP. XX AC . XX DT 28-APR-2009 (Rel. 14.04, Created) DT 28-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE Amphioxus Tx1-10_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW Tx1; Non-LTR Retrotransposon; Transposable Element; L1-10_BF; KW Tx1-10_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-3984 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-3984 RA Kapitonov V. and Jurka J.; RT "Young families of Tx1 non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(4), 847-847 (2009). XX DR [2] (Consensus) XX CC The 5' terminal portion is not present. XX FH Key Location/Qualifiers FT CDS 199..3606 FT /product="Tx1-10_BF_2p" FT /note="endonuclease and RT." FT /translation="MKNNTSFQVHQFKSDDSGRWIILDIEVNNFHFCLVNL FT YAPNDDSPSFFKELEEHIATFDISNESLIVVGDFNTVQNALVDRSGVQQRN FT YNPHALDAISELKGKFDLHDIWRFRNPKSVRYTWRRGIQASRLDYFLISFS FT LLNRITKCCIADKFKSDHNLISLSFVTTDFPRGPGYWQFNLSLLNDKTFCL FT KTKTVMTEFFQNNIGTASPQIVWDAAKCFFRGHCIKFSSYKKKQYLIKEKQ FT LTDEINALQCQIDNTPSPSQAQLDEMTCKQNSLETLYNERLNGLLLRSRSR FT WMELGERCTKYFFNFIHRNYARKNMQRLTRVSGEEILDPQEILSEQVKFYS FT NLYSFEEPPVPLTDTYCSTFFPPQYSKCLSHAQQQDCEGLITEEEVKHAIS FT TFQNGKAPGHDGIPIEVYRVFFDTFKVPMLQCFNFAFDNGHLSGTQKQGII FT SLLLKKDSKGVDKDPSIMGNWRPLTLSGCDTKIISKCIALRIKNVISDIVH FT QDQTGFIKDRYIGENIRKLIDIIEHYDDEEKPGLIFIADFRKAFDTLRWDF FT MFKCLEFFRFGPELIKWVKVLYRDTSSSVINSGYISKPFQLSRGVRQGCPL FT SPYIFILAAEVLAIKIRSNQSIKGLRVYGKCSKVSQFADDANFPFEPKLDS FT FYSLLSDIDGFSNISGLELNAEKCKILRLGSLKHSKFRLPTHLPVQWVDGD FT VDILGVHIPADMNLIVETNIEPRLDKLDRILRPWKGKSLSLYGKITVVNSL FT VVPQFTHLLQTLPSPNDSFFKRYEQKIFSFIWEDGPEKVARKVLYNSTENG FT GLNLINLRVFDQTLKASWIPKLYFHPEWFSSMMIHYNPRLSLQLLPFFQLS FT SITFLKLKGFILEALAAWCKFQYTESETPVSVRQQILWNNSSITIEGRPIL FT FNALLNRKIIFINDLLSEDGHLMSYHDFCSIYPDTCNQLRYQQLIASIPSK FT WKTLLSIEASKNVICVPHQKNYKWLSQVKINKSIYNFFLLSMNQTTVAHKI FT RLSWFDYFDTPIPWKSVFINLVKSTIDPSLRYFQFRLLYKYLPTNRMLRIW FT NLIETNLCSFCNSDEKSYVHVFYECIYDVSFWRKVQEWLLQECGLDIHLTS FT FLIIFGDLSTNASSVTKMIILLGKLFIF" XX SQ Sequence 3984 BP; 1299 A; 702 C; 705 G; 1278 T; 0 other; agttttaatt gtaatgggtt gggtaatttc tataaaagaa ggggagtctt cacttggcta 60 aaagagaaac cataccacat atattgctta caggaaaccc atacaacagc attagatgaa 120 gcaaaatggc aaaatgaatg gggtggtaaa atatattttt cacatggttc tactaggcag 180 agaggagtag caattttgat gaaaaataat acctccttcc aagttcatca gtttaaatcc 240 gatgacagcg gaagatggat tatactagac attgaagtaa ataattttca cttttgtctt 300 gttaatttat atgcaccaaa cgacgactcc ccttcatttt ttaaagaatt ggaggaacat 360 atagctactt ttgatatcag taatgagagc ttgatcgtgg ttggggattt taatacagtt 420 cagaacgctc tagtggacag atcaggtgtc cagcaacgta attacaaccc tcatgcgctt 480 gatgcaatat cagaactgaa aggaaaattt gatctgcacg atatttggcg ttttagaaat 540 cctaagtctg tcagatacac ctggcgccgt gggattcaag ctagtaggtt agattatttt 600 cttatttctt tttcgttgct taacagaata actaaatgtt gtatcgctga caaatttaaa 660 tcagatcaca atttgatctc actctcattt gtaactacag actttccccg agggccagga 720 tattggcaat ttaacctatc acttctcaat gacaaaactt tctgcttgaa aacaaaaact 780 gttatgactg agtttttcca aaacaatatc ggcaccgcaa gcccacagat agtttgggat 840 gccgccaagt gcttctttcg agggcactgc ataaaatttt ccagctacaa gaaaaaacag 900 taccttatca aagagaaaca acttactgat gaaataaatg ctcttcagtg tcaaatagat 960 aatactccct ctccctcaca agctcaatta gacgaaatga catgtaaaca aaattcactt 1020 gaaacccttt ataatgaacg tctcaacggt cttttgttaa gatctagatc ccgctggatg 1080 gagctgggag aaagatgcac taagtatttc tttaatttta tccatcgaaa ttatgctaga 1140 aaaaatatgc aaagacttac acgtgtttcg ggtgaggaaa tattggaccc acaagaaatt 1200 ttgtcggaac aagtaaaatt ctactctaac ttatactctt ttgaagagcc cccagtgcct 1260 ctcactgaca catactgtag tacgtttttt cctccacaat atagtaaatg tttgtctcat 1320 gcacagcaac aagattgtga aggtttgata acagaggaag aagttaaaca tgctatttca 1380 acattccaaa atggtaaggc tccaggacac gacggtattc cgatagaggt atacagagtc 1440 ttttttgata cttttaaggt accaatgtta caatgtttta attttgcctt tgataacgga 1500 catctgtctg ggacccaaaa gcaaggaatt atttctcttc ttcttaagaa agatagtaag 1560 ggggtggaca aagatccaag cataatggga aactggcggc ctctcacctt gtcaggatgc 1620 gacacaaaaa ttatatccaa atgtattgct ttaagaataa agaatgtaat atcagatatt 1680 gttcatcaag accaaactgg ttttattaaa gatagataca taggggaaaa tataagaaaa 1740 ctaattgaca ttatagagca ttacgatgac gaagagaaac ctggtctaat attcatagca 1800 gattttagga aagcgttcga tacgttacgg tgggatttta tgttcaaatg tttagaattc 1860 tttaggtttg gcccagaatt aataaagtgg gtaaaggttt tatacagaga cacatcaagt 1920 tccgtaataa acagtggata catatcaaaa cccttccagt taagtcgtgg tgtacgccaa 1980 ggctgtccct tgtcccccta tattttcatt ctcgctgctg aagttctagc aataaagata 2040 cgatccaacc agtccattaa aggtttacgg gtatatggaa aatgttcgaa agtttctcag 2100 tttgcagacg atgccaactt tccctttgag ccaaaacttg attcatttta ttcattatta 2160 tccgatattg acggtttttc aaatatatca ggccttgaat taaatgcaga aaaatgtaaa 2220 attctaagac taggctcttt aaaacattca aagtttcgac tacccaccca cttacctgta 2280 caatgggtag atggagatgt agatatatta ggtgtacata ttcctgctga tatgaatctt 2340 attgttgaaa ccaatataga acctaggtta gataagttag accgaatatt acgtccatgg 2400 aagggtaaat cgttatcgct ttatggcaaa ataacggttg tgaactctct agtagttccc 2460 caattcaccc acctactcca gaccttaccg tcaccaaatg attctttttt taaaagatac 2520 gaacaaaaga tattttcgtt tatttgggag gacggtccag aaaaagtcgc cagaaaggta 2580 ctttataact caacagaaaa tggaggttta aatttgatta acttacgtgt attcgatcaa 2640 actttgaaag cttcgtggat tccgaagttg tattttcacc ctgaatggtt ctcatcgatg 2700 atgatacatt ataacccacg tttgagtctg cagttactac ccttctttca actgtcgtca 2760 atcacttttc ttaagttaaa aggctttatt cttgaagcgt tagctgcttg gtgtaaattt 2820 caatatacag aatccgagac gccagttagt gtacggcaac agatcctttg gaacaactca 2880 agcatcacaa ttgaaggtag acctattctc tttaatgcac tcttgaatag aaaaattatc 2940 tttataaatg atttattgtc agaagatgga cacctgatgt cttaccacga tttttgttcg 3000 atttaccctg atacttgcaa tcaattacga tatcagcagc ttattgcttc cattccatcg 3060 aagtggaaaa cattgctaag tattgaggca tcaaaaaatg taatatgtgt acctcatcaa 3120 aagaattaca agtggttaag ccaagtcaag ataaacaaat ctatctacaa tttcttctta 3180 ttatctatga accaaactac agttgctcac aaaatcagac tttcctggtt tgattatttc 3240 gacacaccca ttccttggaa atcagtcttt ataaaccttg taaaatctac aatagatcct 3300 agtttaagat attttcagtt tcgtctgctc tataaatatt taccaacaaa caggatgcta 3360 cgaatctgga atctaataga aactaattta tgttcttttt gcaacagtga tgagaagagt 3420 tatgtacatg ttttttatga atgtatttat gatgtctcct tctggagaaa agtgcaggaa 3480 tggttattac aggaatgtgg cttagatatt catttaacaa gttttttaat tatttttgga 3540 gacctatcaa caaatgcatc atccgtaaca aaaatgataa tcttattagg aaaactgttt 3600 attttttagt ctagatcttc aaaaaggtta aatttcaact ctttcaaaag gctagtatta 3660 ttttatgaaa agaccgaata tcttattgcg tctcggagag ggaagctgga gaggcaccgg 3720 ggtaagtggc gatctttatg taatgcttaa gtttcatctt ctttttacca tactgtctca 3780 ttatttatcg acttatcaat tttctttacg taaagcttaa gttgcatctt tatttaacat 3840 actatgtcat tatttatcaa tttattaatt ccgttgtaac gcatctgcta ggggtggctt 3900 catctgtgtt ttttgtaatg tctgtatgtt ttgtttatca taatggaaaa tgaataaata 3960 tctaataaaa aaatttaaaa aaga 3984 // ID SINE2-2_AP repbase; DNA; INV; 1998 BP. XX AC . XX DT 18-MAR-2009 (Rel. 14.03, Created) DT 18-MAR-2009 (Rel. 15.12, Last updated, Version 2) XX DE A family of SINE retrotransposons - a consensus sequence. XX KW SINE2/tRNA; SINE; Non-LTR Retrotransposon; Transposable Element; KW Nonautonomous; Interspersed repeat; SINE2-2_AP. XX NM SINE2-2_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-1998 RA Bao W. and Jurka J.; RT "SINE retrotransposons from Acyrthosiphon pisum."; RL Repbase Reports 9(3), 662-662 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX SQ Sequence 1998 BP; 749 A; 243 C; 248 G; 758 T; 0 other; ttagattctg aacgaagtga tgaatgtatt gattttacaa tgatgtgtgt tttttttttt 60 tttttttttt gtgtctgacg acaacttttg gagcagtaaa aatgcttcga ttttcaaaag 120 ttgtaccttt tctgaaagga aagtgaatct agttggtact ttggggggtc aaaagtgaaa 180 atttcccaat acttttcaaa agcggcagga aaaaccttaa aaaaataacg gaaaaacgcg 240 aatttttacg caaaaccaat ttttgacaaa aacgaaaact taattttact gtaactcaaa 300 aaataattat tgtaagcact tgaaatttgc aacagatgtt tatattagcg ttgtctatac 360 agggttaaat tttcaaagtg tttcgacttt ttttgagcta tttatagaca aatgaaattt 420 tcaatttttc taagtatttt tttttaaaat aacgataaaa aatttttggt tggatagaaa 480 actttgaaaa tttaatacaa ggtttcttat aagttgttct cacagtgata aaaaaaaatc 540 caaaatcgtt agtcacaatt tttttttata agtgcttaaa gtttaaattt atacgaaata 600 tgtcaaaatt gcgaaaattt gcaagtaatt ttgtggttga aaaatcgtaa aatttttttc 660 ttttataact aagcttttaa aatttggtac aaggttctcc ataagttttt ctttaaatat 720 ctgtaaaaaa aactcaaccg gactaagaca aaaaattttt aggagtgttt gaaatttaaa 780 tttttacaaa accgcattaa ataacggttt agcctcaaac gatttttgat atttgttatt 840 attcaaaaag tatgagtcgt agacacttga aaattttacc agttgtttaa attgacattt 900 tctttatata gttttatttt caaaatattt cactaatttt taatctattt ataggcaatt 960 gaaataatcg attttttttg attttttttt tataaatgtt gataaaaaaa aattagctgg 1020 gacaaaatac ttgaaaattt aatagaaggg tccacatatg ttgttctaac tcccattcaa 1080 aaattataaa aatacatagg cacaattttt ttttataagc atttaaagtt caaattttga 1140 ctaaatacgt aaaaattacg gcaatgttca aataatctta aacagaaatt cataaaagtt 1200 tttcttttta attctaagat ttggaaattt aatacaaggc tcctaacata tttttacaat 1260 agcagttgca aaataaaagg aatacattgt cacaattttt atttataagc atttaaagtt 1320 caaattttga caacatatta tgtaaaaatc acgaaaattc gtaaattatt ttatgctaga 1380 aattcataaa aaattttcct tttatatcta agatttcaaa atttaataca aggctcataa 1440 tatattttta caatagcaat tgaaaaataa aagaaatata tagtcacgat ttttttttta 1500 taagcattta aagttcaaat tttgactaaa tacgtaaaaa ttacggcaat gttcaaatta 1560 tcttagacag aaattcataa aagtttttct ttttaattct aagatttgga aatttaatac 1620 aaggctccta acatattttt acaatagcag ttgcaaaata aaaggaatac attgtcacaa 1680 tttttattta taagcattta aagttcaaat ttatacgaaa tatgtcaaaa ttgcgaaaat 1740 ttgcaagtaa tttagtggtt gaaaaatcgt aaaaattttt tcttttataa ctaagttttt 1800 aaaatttggt acaaggttct ccataagttt ttctttaaaa ataatatatc ctagactgac 1860 aaatcatctc cgttcagaat cgtttttcgt atacaatgat ataatatcat tggattcaaa 1920 tttaattcca tccattacag taacccactt gtaacctact gtacagcaga gcgacatcca 1980 cgacttaccc gccttttt 1998 // ID Gypsy-9_IS-I repbase; DNA; INV; 4045 BP. XX AC ABJB010083307; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the black-legged tick genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-9_IS_; KW Gypsy-9_IS-LTR; Gypsy-9_IS-I. XX OS Ixodes scapularis OC Eukaryota; Metazoa; Arthropoda; Chelicerata; Arachnida; Acari; OC Parasitiformes; Ixodida; Ixodoidea; Ixodidae; Ixodinae; Ixodes. XX RN [1] RP 1-4045 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the black-legged tick genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; ABJB010083307; Positions 11921 7877. XX CC Positions [1503-2144] - Reverse transcriptase CC Positions [3147-3386] - Integrase core CC 'CCTAC' target site duplication CC LTRs are 95% similar to each other. XX FH Key Location/Qualifiers FT CDS 945..3386 FT /product="Gypsy-9_IS-I_1p" FT /translation="MLGDLYAVTPRREPPEEVIISLEVQGVPLRFVVDLGA FT AFTFISEQTFRAAWPRNPPEFTDDNIALRTWSGEGIECLGTIHVMVQFKKL FT KVNLPLLVIKHPGCNLLGRNSFAALGIGLSGVHQVVLDGSLTAAIERYGSV FT FDGNLNGYNGPAVSLELEDNASPKFLRARTVPFGLKPAVEAELQRLTEQGI FT LEQTQHSNWATPVLLVRKKNGSIRLCGDYRSTVNAVTLKASYPLPTVTEAL FT AKLRGGRIFSTIDLAQAYQQLRVTENTADVLTINTIKGLYRVKRLPFGVSA FT APAIFHKCMETTLAGISGVSVYLDDIIVSGATAAEHAQRLNLVLSRLQEAG FT LRANCEKCLFGVPGVTYLGHTIDAVGVHPTNDKIKAFRELPEPTSKATLQS FT FLGMLAFYDRFLENLATVACDLYRLQAKNVPCHWEERNALAFRKLKELVLR FT KTTLAHYDESKPLLLSCDASPYGVGAVLAQVDHEGREAPIAFASRTLRDAE FT KNCSQLDREGLAVMFGAMHFHQYIAGRSVIIITDHQPLLGILGPQKPVPQV FT LPPRMRRCCVKLSAYDYKLVYRPGKRHQNANVLSRLPLFDSDDEPCSPGDV FT LMIEALTSSPLTAKKIAAMTCADPILSRVYEAVQQGKLHELTEKNFTAYRK FT KGAEMSTQRGCVIWGSRVVIPALARPEAMNLVHAGHKGIVAMKLSARSYVW FT WPGIDRDIEVAVGSCPICQESRKIPSKAPLPTWDHASLPWDTLNLDFAGPL FT EGNMFLVVVDAYTKWLEVRSMTSATSAAVIKELRGMFATFGLPRKIVSDSG FT SVFVSKEITDFY" XX SQ Sequence 4045 BP; 1067 A; 1061 C; 1069 G; 848 T; 0 other; attggcgacg agtgagagaa acgaacctgc ggctttgaga agatggcgac tcacggcacc 60 atggaagcgt tctcgggaaa ggggtggtca tcttgggcgg tgcggctcat ctactacttc 120 gaggccaacg gcatcagggt tggcaccaag aaacgggctg ttctgctaac tctgtgtgtc 180 cctccacatt tgaaacggtc aaggcattgg tggctcccag gtctccaagt gagttgactt 240 tcgaggagat cgtgaatttg ctccaagcac attttgatcc ccggccggca gaactgttca 300 gccgttgcac gttccaacgg cgagatcaac agcccgaaga gtcagtgacc gcttacgtcg 360 ctgctcttaa gaaacttgca gcagactgca attttgggaa cgttcaagtg actccgacgg 420 ctagcacagc tggccctccc gtctcagcgc cagctgctgg ggcagccgac ggcacggtca 480 gcacggtcac aacagcggtt cccagcaacg cgacgatgtt acccatggac ataatgttgc 540 gcgatcgttt tgtttgcggt cttcaaaatg aaaatctcca gcagcggcta tttgcggagt 600 aggaccttat gtttctcaag gcgttcgaca ttgctgcacg agccgagagc gccaaggacc 660 accagcggca aataaaggca gattttcaag aaataaacaa agcggaaaaa acgcgtccgg 720 cacatcgtcg ccaggagaag acgacgccat tatccacgcc ctgctatcgt tgcgaagaac 780 tccacagtcc tagttcttgc aagttccgaa catcgcagtg tcggtactgc aaaaagacgg 840 gacacatcgc gaaagcttgc ttcaagagaa agaaaaacaa ccaaaactcc gtacaggcac 900 tcgaataaca ggaacatcat ggccctttta cggaaccgtc aggcatgctg ggtgacttgt 960 acgccgtaac accgcggcgg gagccgccag aggaggttat aataagcctg gaggttcaag 1020 gggttcctct cagatttgtg gtggacttgg gagcggcgtt tactttcatc agcgaacaaa 1080 cgttccgagc agcgtggcca aggaaccctc cggaatttac agacgataac atcgctctgc 1140 gtacctggtc tggggaaggc atagaatgtc tgggaacaat tcacgtcatg gtccagttca 1200 agaaactgaa agtgaatttg cctcttcttg tgataaagca ccccggatgt aaccttcttg 1260 ggcggaattc gtttgcagcc ctgggaattg gcctcagcgg cgtgcaccag gtagtgttgg 1320 acggcagcct gacagccgcc atcgaaaggt acgggtcagt ttttgacgga aacctgaacg 1380 gctacaacgg gccagcggta agcctagaac tggaagacaa cgcgtcaccc aagtttctac 1440 gagctcgcac tgtgccattc gggctcaaac cagcggtgga agccgagctg caacgtctca 1500 cggaacaagg catccttgaa cagactcagc actcgaattg ggcaacaccc gttctactag 1560 taaggaagaa aaacggcagc atcagacttt gtggtgacta ccgcagcacc gtcaacgctg 1620 tgacgctcaa ggcgtcatat ccactgccga cagtgacaga ggcgctcgca aagctgcggg 1680 gagggagaat tttctccacc atcgacctgg ctcaagcata ccagcagtta cgagtcaccg 1740 agaacacagc tgacgttctt accatcaaca cgataaaggg cctctatcgt gtcaagcgac 1800 ttcctttcgg cgtctcggca gcaccagcca ttttccacaa atgcatggag acaacacttg 1860 cagggatatc cggtgtcagc gtgtatctgg acgacatcat tgtcagcggt gccacagccg 1920 cggaacacgc ccagcgcctg aaccttgtcc tgtcaagact ccaagaagct ggcctgcgag 1980 caaattgtga gaagtgccta tttggcgttc ctggggtcac ttacctgggc cacacgatcg 2040 atgcggttgg tgttcatcca accaacgaca agatcaaggc gttcagggag ttgccagaac 2100 ctacgtcaaa agccacgctg cagtcgttcc tcggaatgct cgcattctac gacaggttct 2160 tagaaaacct agcgactgta gcctgcgatc tgtacagact acaagccaag aacgtccctt 2220 gtcactggga agagcgcaat gccctcgctt tccggaagtt gaaagagttg gtacttcgca 2280 agacaacgtt ggctcactac gatgagtcaa aaccactgct tctttcatgt gatgcgtcac 2340 catatggcgt cggcgcagtt cttgcgcagg tcgaccacga gggcagggaa gctccaatcg 2400 cattcgcgtc aagaactcta agggatgccg agaaaaattg ctcacagctt gatcgagaag 2460 gattagctgt gatgtttggg gcgatgcact ttcaccaata catcgccgga cgaagtgtca 2520 taattatcac cgatcaccaa ccactgcttg gcatcttggg accgcagaag ccggtaccac 2580 aagtcttgcc accccgaatg cggcgatgtt gcgtcaaact atcggcgtac gattacaaac 2640 tcgtgtacag gcctggaaaa cgacaccaga acgcgaatgt tctaagccgc ctgcccctat 2700 ttgacagtga cgacgagcct tgctcaccgg gagacgtcct gatgattgaa gccttgacaa 2760 gttctccgct gacagcaaag aagatagcag ccatgacatg cgcggatccg attctgtccc 2820 gcgtctacga ggctgtgcag caaggcaagc ttcatgagct cacggagaag aactttaccg 2880 cgtacagaaa aaaaggcgcg gagatgtcca ctcaaagagg atgtgtcata tggggatccc 2940 gggtggtcat accagccttg gcacgtcctg aagccatgaa cttagtgcat gctggacaca 3000 agggaattgt ggccatgaag ttatctgcac gcagctatgt gtggtggcct ggcatcgacc 3060 gcgacataga ggtcgctgtt ggcagctgcc ccatctgcca agagtccagg aagataccaa 3120 gcaaggcacc gcttcctacg tgggatcacg catcacttcc ttgggacacg ctgaatttgg 3180 actttgcagg ccctctggaa ggaaacatgt tccttgtcgt tgtagatgcc tacacgaagt 3240 ggctagaagt gcgaagcatg actagtgcca catctgcagc ggtcatcaaa gaactccgag 3300 gtatgttcgc aacttttgga ctgcctcgga agattgtttc cgacagcgga agcgtcttcg 3360 tatccaagga gataacagac ttttattaaa agaatggcat gaagtttgtg accagcgctc 3420 cgtaccatcc agcgacaaat ggccaagcgg agcgcatggt ttacgaactt aagcaaagtt 3480 tgacaagaga gaaaacagga tcgctgtcat tgagaatttc tagattcctg tataaacaac 3540 acaattcaat ttgcaactcg acaggaaaaa caccagcatt cttgatgttc aacagggagc 3600 tcgctactaa catcagtaga ctccttcctc gtcgagaaag tagtcacaag gaacgagata 3660 tggaaaagac accgccgtct agggtgtttc aagagggaca ggcagtctat gtcctaaact 3720 tcaaaggaac ccccaagtgg atccaatgga agttgaccaa aagacttgga attcgctcct 3780 ggctggcaga aacagcaact ggatcatccc gccgccacgt ggatcaaata aggcgtcgat 3840 ttacgacatc gacgccatcg gctgagtggt ctgttcctgc tcctgtttcg gactcttttc 3900 gtttcggacc tgtttcgtca ttccaagata attccgagac gacaccgaaa catacgacag 3960 acgccggaag cggaagaagg gagcaacctc caagaaatag gcaccctcca gaacgctatc 4020 gaagctacat ctgaggggga agagg 4045 // ID Copia-112_AA-LTR repbase; DNA; INV; 264 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-112_AA_; KW Copia-112_AA-I; Copia-112_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-264 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 264 BP; 67 A; 70 C; 58 G; 69 T; 0 other; tgaagagaat aagcagtgtc ttgcccctcg tcaaccacat cctgtgcatt acttttccac 60 tggggcgact gccgcacccc gacggcgacg gcagagcagt gtgtagagaa agagacagtg 120 ttcagttcaa ctttgtacat ccgacgagaa acacgtagag taaaagtttc ctttattcag 180 ttttcctaaa actcgtggtc gttccttttt ccggtaatcc gaaacatacg cggattcccc 240 tgtgcggtaa atcatactct ccca 264 // ID BEL-67_AA-LTR repbase; DNA; INV; 449 BP. XX AC AAGE02020328; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-67_AA_; KW BEL-67_AA-I; BEL-67_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-449 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02020328; Positions 18476 18924. XX SQ Sequence 449 BP; 175 A; 93 C; 64 G; 117 T; 0 other; tgtgcgcgac gcagcctatt gcgttcagta ccgccccatc aacatctgct ccaacggcag 60 actagatgtt gctgtcatct ctttgacaat gtctttgaag cctatctcta acaaaacaac 120 acaaaatttc aacacgagaa cgtgaacaat tgaattacta ttgttaaaat cttattttcc 180 ccccaaacac gttaaaatag taagtataat ggattaaaat tgaattcctt taaactaaaa 240 tgtactatct atagaaagac acaaaacaca gaagaataaa cagttacaac agacggaaaa 300 cgaacgaaga aaactaaaaa ctgtaagtta acctaaaata attaaaacaa tgaaattaat 360 aaactttttg ttgtagcttg aagcttactc cgccaaaaac gagttcgtct aaagaggttt 420 tctaaacacc atccgtccgt acagtaaca 449 // ID Copia-1_DPu-LTR repbase; DNA; INV; 256 BP. XX AC scaffold_154; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-1_DPu_; KW Copia-1_DPu-LTR; Copia-1_DPu-I. XX NM Copia-1_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-256 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 666-666 (2010). XX DR Genome; scaffold_154; Positions 222037 222292. XX CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX SQ Sequence 256 BP; 62 A; 55 C; 38 G; 101 T; 0 other; tgttgagttt ctgcgacatc tagcgtcaca taaaccaatc tcctattcca ctcttcatga 60 agtaatcaag atcgtcattg cgtcgtctta ccgttgttgt attgatagcg tccttcacga 120 cgttttcctt cttgttcttt cttgatctga agttctttct gtagtatcaa tgtgtcctac 180 tctcatctca tcttgaatac atctcaggta ctaaatatta ttaagtttat tccattgttg 240 tgaaatagaa ttaaca 256 // ID PENT_PU repbase; DNA; INV; 805 BP. XX AC . XX DT 09-JUL-2004 (Rel. 9.06, Created) DT 09-JUL-2004 (Rel. 9.06, Last updated, Version 2) XX DE P.univalens terminal heterochromatin/telomeric pentameric repeat DE segment DNA. XX KW PENT_PU; pentameric repeat; subterminal repeat; Telomeric repeat. XX OS Parascaris univalens OC Eukaryota; Metazoa; Nematoda; Chromadorea; Ascaridida; OC Ascaridoidea; Ascarididae; Parascaris. XX RN [1] RP 1-805 RA Teschke C., Solleder C. and Moritz B.K.; RT "The highly variable pentameric repeats of the AT-rich germline RT limited DNA in Parascaris univalens are the telomeric repeats of RT somatic chromosomes."; RL Nucleic Acids Res 19(10), 2677-2684 (1991). XX RN [2] RP 1-805 RA Gentles A., Kohany O. and Jurka J.; RT "Consensus."; RL Direct Submission to Repbase Update (JUN-2004). XX DR [2] (Consensus) XX CC Evidence: experimental. XX SQ Sequence 805 BP; 187 A; 129 C; 162 G; 324 T; 3 other; attgcattgc ttgaattgca ttgaatgcat tgcattgcat tgcattcatt gcattgcatt 60 gaattgcatg cattgcattg aattgcattg aattgcattg cattgcattg cattgcattg 120 attgcattga attgctttga attgcattgc attgcatgca ttgcattgcg ttgaattgca 180 gtgaattgca ttgcattgga ttgcattgca ttgaattgca ttgcattgca ttgaattgca 240 ttgaattgca ttgaattgca ttgcattgca ttgcattgca ttgcattgca ttgttgcatt 300 gcattgaatt gcattgcatt gcattgcatt gcatttcatt gcattgcatt gaattcattg 360 cattgcattg cattgcattg aattgcattg cattgcattg aattgcattg cattgcattg 420 cattgcattg aattgcattg cattgcattg aattgcattg cattgcattg cattgcattg 480 cattgcattg cattgcattg aattgcattg aattgcattg cattgcattg aattgcattg 540 cattgcattg cattgcattg cattgaattg cattgaattg cattgaattg cattgcattt 600 cattgcattg cattgcattg cattgcattg cattgaattg cattgcattg cattgcattg 660 cattgcattg cgttgaattg cattgcattg cattkcattg cattkcattg cattgcrttg 720 catcgcattg cattgcattg aattgcattg gattgcattg cattgcattg cattgaattg 780 cattgcattg cattgaattg cattg 805 // ID Jockey-2_CQ repbase; DNA; INV; 4403 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A Jockey non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW Jockey; Non-LTR Retrotransposon; Transposable Element; KW Jockey-2_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4403 RA Kojima K.K. and Jurka J.; RT "Jockey non-LTR retrotransposons from the southern house RT mosquito."; RL Repbase Reports 11(1), 113-113 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 10 sequences with >94% CC identity. XX FH Key Location/Qualifiers FT CDS 229..1512 FT /product="Jockey-2_CQ_1p" FT /translation="MTPPSSRRHRRNSTGGISSISDQLNEHRPLTQPGGSS FT GGFPTANQFEMLDFDISGDEQEDVNNGGGGEILPAGDGAASGNAAGKVVKN FT VPPAPAKVRCPPIFVHGSSVPALNRLMSTTQLGADDYHLRVNKGYIQIRVS FT TKSHFTSVVSTLKQANVQFYTHGTSDETPVKIVLSGLPVFPVEDVRRELED FT VFLRPTSVRQMGKSKHGDYALYLLQFEKGTVKLQELQQIKALFNVIVRWRH FT YSKKKSDVVQCFRCQQYGHGMRNCHLEAKCVKCGERHQTTACALPARADVV FT VNDDRSQIRCANCNQNHTANYKGCPTRLKYLQDLKAKKKTSPAGRSNLPKV FT SAVPAPAPRPPGGDLSQLLGSIANPGVSYSQAVQGQPESSTLFTVEEFMCL FT ASELFTRLSKCQSKAMQFLALSELIIKFVYNGQP" FT CDS 1502..4174 FT /product="Jockey-2_CQ_2p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="MANLSSLKVMNWNCRSIRNKVVEFFHFLETHQIDIAA FT ITETWLQLNNSMYHEHYTIVRADRESTEAARGGGVALLVRNGIDFSKLETS FT TRAIESVGITVKTDPAPVNIFAVYYPGSPRRATLSQLRRDIRQLSSIDGPF FT FLVGDLNARHRMWNCARANKAGQILHQEYESKDIYIHAPDTPTRCPAGRGR FT PSTLDLVISNNQVQMSKPITHVELSSDHLPVTFTIDVDVPVAPAGRLVRCF FT DRANWSTFARSIEREVDLNASWINTLDRSEKIDDAIDKFTAILKAAEEAAV FT PSRRILPQKKDIPDELKLLIRLRNVRRRQFIRRRDPLIGAVVTHLNNVIST FT KSAEHRYKNFGSLLRTLDNGSNKFWKLTRNLRNTVKYSPPLNVNGDLVASP FT SAKAEALASAFAAAHNNEQPGDPETSAAVENALGHIRATTPPVPPAALVKP FT KEVKAIIRTLKNRKSPGQDGIRNTCLKQLPRKGLVVLTKIFNACLALGYFP FT ARWKHANVIAIPKANKDITNPGNYRPISLLSSLSKLLERVILARINRHLET FT EQLIPHEQFGFKPGHSTVHQLARISEMVKRGFLAGKSTGMILLDVEKAYDS FT VWQDAVVYKLFRSNLQPFLVKIVESFLTNRSFTVTVNGERSSVHNIPFGVP FT QGSVLSPTLYNILTSDVPMIDGVSYAFFADDTAYLASDKDPKIVVTHLQAA FT QNNLEEFQRKWRIKLNAGKTQSIFFTRRRAARHLPRRSISVNGQPATWDDE FT VRYLGVVHDKKLKYDKHVNNIIEKVDRSTKALYSLLNRRSKLSIKNKSLVV FT KCIIRPMLTYAAAVWGNCAKSHRKRLQVKQNKLLKMVHNLDPWYPTDDLHE FT LAGVDTIDASIEKATRTFRTSCGMSTNPLIEALLRQQL" XX SQ Sequence 4403 BP; 1078 A; 1305 C; 1168 G; 850 T; 2 other; tcattcgagc tgtcaccgtc gacgcgtaaa catcgtcgtc aaattctagc tctcgcgaaa 60 agttaattta ttggtgctcc cggtgtgttc accagcaccc tggggaaacg aaagcagccc 120 aagccgccgc cgctgcccgg cgaaggcgat tccggccggg acagtgcatc gcagatcctg 180 cagcgggtgg actacaagaa ggcccgcaaa cagcagaagg aaaaggtgat gacgccgcca 240 tcttcgagac gccaccgaag aaattcgacc ggcggtatct catcgatttc ggaccagctg 300 aacgaacatc ggccactgac ccaacccggg gggtcctccg ggggattccc gacggcgaat 360 cagttcgaga tgctggactt cgacatctcc ggcgacgagc aggaggatgt caacaatggc 420 ggcggtggtg agattttgcc cgctggtgat ggtgctgcta gtggtaatgc cgccggtaaa 480 gttgtgaaaa acgtgcctcc cgcgcccgcc aaggtgagat gtccgccaat cttcgtccac 540 ggttccagtg tcccagcgct taaccgactg atgtcaacaa cccagctggg cgccgacgat 600 taccacctgc gggtaaacaa agggtacatc cagatcaggg tgtcgactaa atcccacttt 660 acttctgtgg tgagtacttt gaagcaggca aatgtccagt tctacaccca cgggaccagt 720 gacgaaaccc cggtgaaaat cgtcctttcc ggattaccag tgtttccggt agaagacgtc 780 aggcgggaac tcgaagatgt gtttcttcgt cccacaagtg tgcgccagat ggggaaatcg 840 aagcacggcg attacgcgct gtacctgctg cagtttgaga agggaaccgt gaaacttcag 900 gagctgcaac aaatcaaggc gctgttcaac gtgatcgtcc gttggaggca ctactccaag 960 aagaagtcgg acgtcgtcca gtgtttccgg tgccagcaat acggccacgg gatgagaaac 1020 tgccatctgg aggccaagtg cgtgaagtgt ggtgagcgcc accagacgac cgcatgcgca 1080 ctccccgcgc gtgccgacgt tgtggtcaac gacgaccgct cgcaaatccg ttgtgctaac 1140 tgcaaccaaa accacacagc gaactacaag ggttgcccca cccgcctcaa gtacctgcag 1200 gacttgaagg ccaagaagaa gacgagtccc gccggcagaa gcaaccttcc gaaggtgtct 1260 gcagttcctg cgcctgctcc caggccaccg ggtggtgact tgagccagct gctcggcagc 1320 atcgcgaatc ctggggtctc gtactcgcaa gccgtgcagg gccagcccga gtcgtcaacc 1380 ttgttcaccg tcgaggagtt catgtgccta gccagtgaac tcttcactag gctttcgaag 1440 tgccaatcga aggccatgca attcctcgcc ctcagcgagc tcatcatcaa atttgtttat 1500 aatggccaac cttagctcgc tgaaggtgat gaattggaac tgccgttcca tccgcaacaa 1560 ggttgtggaa tttttccact tccttgaaac tcaccagatt gatatcgctg caatcacaga 1620 gacctggctg cagctcaaca actcgatgta tcatgagcac tacactattg tgcgtgcgga 1680 tcgcgaatcg acggaggccg cccggggggg aggagtggca cttctggtgc gaaacggcat 1740 cgacttctcc aagctggaaa cctcgacccg tgcaatcgag tctgttggga tcacagtcaa 1800 gacggacccg gcaccagtta acatctttgc cgtctactac cccggttctc ctcgcagggc 1860 aacgctcagc cagcttcgac gcgatatccg ccagctcagc agtatcgatg ggccattctt 1920 cctggtgggt gacctgaacg cgcgacatag gatgtggaac tgtgccaggg ccaacaaagc 1980 agggcaaatc ctgcaccaag agtacgagtc caaggacatc tacatacacg ctccggatac 2040 tcccacccgc tgccccgctg gacgagggag gccttccacg ttggacctgg tcatctcaaa 2100 caaccaggtg cagatgtcga agccgataac gcacgttgaa ctgtcgtcgg accatctgcc 2160 ggtcaccttt acgatcgacg ttgatgttcc agtcgccccg gctggaagac tcgtccgatg 2220 cttcgacaga gcgaactgga gcacttttgc tcgctcgatc gagcgggagg tcgaccttaa 2280 cgctagctgg atcaacactc tagatcgctc ggagaaaatc gacgacgcca tcgacaagtt 2340 caccgccatc ctcaaagctg ctgaggaagc cgctgtccct tcgcgtagga tcctacctca 2400 gaagaaggac attcccgacg agctgaagct gctaatccgg ctgaggaacg tgcggcgtag 2460 acagttcatc cgaagacggg acccgctcat cggagcggtg gtgacccacc tgaacaacgt 2520 catcagcacc aaaagcgccg aacaccgcta caaaaacttc ggctccttgc taaggacgct 2580 ggacaacggc agcaacaagt tctggaagct tacccgcaac ctccggaaca cggtcaagta 2640 cagccccccg ctgaacgtca acggagacct cgtagcgagc ccttcagcga aagccgaagc 2700 gctcgcgagt gcgttcgccg ctgcccacaa caacgagcaa ccaggtgatc ccgaaacatc 2760 ggccgctgtg gagaacgcgc ttggccacat ccgagcgaca actcctcccg tgcctccggc 2820 cgcccttgtt aagccgaaag aggtcaaggc gatcatccgc acgctcaaaa accggaagtc 2880 tccagggcaa gacggaatcc ggaatacgtg cctcaagcag ctgccaagaa aagggttggt 2940 cgtcctgacc aaaatcttca acgcttgcct ggccttgggc tactttccgg ctcgctggaa 3000 gcacgcaaac gtgatcgcca tcccaaaggc gaacaaggac ataaccaatc caggtaacta 3060 ccgcccgatc agccttttga gcagcctgag caaactcctg gaacgggtga tcctcgcccg 3120 gataaaccgt cacctggaga cggagcaact catcccacac gagcagttcg gcttcaagcc 3180 ggggcactca acggtgcacc aactcgcccg tatctcggag atggtgaagc gcggcttcct 3240 ggcggggaaa tcgaccggca tgatccttct cgacgtcgaa aaggcctacg actccgtgtg 3300 gcaggacgcc gtggtctaca agctcttccg ctcgaacctc cagccgttcc tggtcaagat 3360 cgtcgagtcg ttcctgacga atcggtcgtt tacggtgacg gtgaatggcg agcgttcctc 3420 cgtccacaac atccccttcg gcgtgcccca aggatcagtg ctgagcccga ctctctacaa 3480 catcctcacc tccgacgtgc cgatgatcga tggagtgtcg tacgcattct ttgcggatga 3540 cactgcatac ttggcatcgg acaaggaccc gaagatcgtc gtcacccacc tacaagcggc 3600 acagaacaac ctcgaggagt ttcagcggaa gtggcgcatc aagctgaacg ctgggaaaac 3660 ccagtccatc ttcttcacca ggaggcgtgc tgctcggcac cttccccgca gatcgatcag 3720 cgtcaacggc cagcctgcga cgtgggacga cgaggtcagg tacctgggtg tggtgcatga 3780 caagaagctg aagtacgaca agcacgtgaa caacatcatc gagaaggtgg atcgttccac 3840 caaggcgctc tactccctgc tkaatcggcg gtcgaagctg agcatcaaga acaagtcgct 3900 cgtggtcaag tgcatcatcc gtccaatgct gacctacgct gctgcggtct ggggcaactg 3960 cgccaaatcg caccgtaagc gactccaggt gaagcagaac aagctgctca agatggtgca 4020 caacttggac ccgtggtacc cgaccgacga tctacacgag ctggccggcg tcgacaccat 4080 cgacgcaagc atcgaaaagg caacgagaac cttcaggact tcctgtggga tgtcaacgaa 4140 tccgctcatc gaagcactcc tccgtcaaca actgtgatat tagttcttaa gccaaacata 4200 gatctagggt tttttctcaa aatttatttt ccctaaaaga gcacgcagct agcaamatct 4260 aaacgtagaa acatgtaccc tccctgtaaa ggcttatgaa ttgatctcag ttgaaaggtt 4320 ctccaaatct ctgaattgca aatgtaacat cctggagcag aactgttaaa ttctaataaa 4380 caccaattga attgaattga att 4403 // ID BEL-59_AA-I repbase; DNA; INV; 5876 BP. XX AC supercont1.11; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-59_AA_; KW BEL-59_AA-LTR; BEL-59_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5876 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.11; Positions 2168728 2174603. XX CC Positions [4917-5465] - Integrase core CC 'CAGAC' target site duplication CC LTRs are 91% similar to each other. XX FH Key Location/Qualifiers FT CDS 33..1790 FT /product="BEL-59_AA-I_1p" FT /translation="METRNDSPRNCQSCSRKDSAEGEMVQCESCMLWEHFG FT CAGVDEQVRRSDVKYICKRCNHPGVSNDRLKIPSGDLRSKGAKVGSKISSK FT ASSKKGKKIVDPAGSVTSSVRAAMLAEQLKLIEEDQLLQEQEIRDQEEIQK FT RLLEEEERQLEEKKKLAEEAKQIRERKLQEELDAKRKQQQVRRESLEKRQE FT IIRQLAEASSRSGSVVSSRQKVHNWLNDQDITGRSDGNHVRDAVKPNIIRI FT SEDPRKEPAQPLTLVPPVDVNAQSSFVVPPDFVPLSPPTTNLPTYPTRTLS FT QAQIAARQVLGKDLPVFDGNPEDWPIFISNYDQSSATCGYSDAENLVRLQR FT SLKGNALESVRSRLLLPASVPHVIETLRTLFGRPELLIRSMLSKINRIPSP FT RHDHLESVMHFGLAVQNLVDHLKAANQYNHLTNPVLMQELVEKLPGSMRLD FT WAMFKAKHQAATLATFGEFMSGLVTAASEVSFTLPSFNASQKYTVSTDTRS FT GRQRSGNTAVLQAHSTDCNQITNFATTPTSSKPANNALRAEERVTGWPNVN FT SSSQPASTNVGLWFSKKDSAEPALITTANGLADPGADAA" FT CDS join(2454..3362,3366..5876) FT /product="BEL-59_AA-I_2p" FT /translation="MNDQLRDYFTLEEIGCTERGNLVEPIEDQRAKRLLKD FT TTQRTRSGSNFETGLLWKTDDLNFPDSFPMAVRRLESLERRLNREPALQKC FT VRDQIADYERKGYAHRASLWELTSVDSKRVWYLPLGVVPNARKPGKVRLIW FT DAAAKVGDTSFNSQLLKGPDLLTPLPKVLCQLRQYPVAVTGDIMEMFHQIK FT IRAPDCQSQRFLFREQPTDPPRVYIMDVATFGSTCSPASAQYAKNLNAQEF FT TEEYPRAVTAIIDKHYVDDYLDSFETISEAVSVVNEVKLVHSKGGFTLRRF FT LSNEPAVLQGIEPVETESKNLDLERAGKIESVLGMEWVPSDDVFTYTLGAR FT EDILPFLDSSRVPTKREVARVVMSLFDPLGLISYLLVHGKVIIQELWIKGT FT EWDAQIPQDTNARWQQWTESLQQLEQLRITRYYFPGSLPKDFVSLQIHLFV FT DASDTAYAAVVYFHLQTERGIHVALVGSKTKVAPLKALSIPRLELKAAVLG FT SRLMETIQTQHSYPISQRFCWTDANTVLAWVRATDHRRYHKFVAVRIGEIL FT SSTQQSEWRWVPSKTNAADLATKWNNGPQEMVNSQWFNGPSFLYNPEEQWP FT RQQKNTITDEELRPVHFSLAHFSSSMVFERFNQWSKLLRVAAYVLRWIDNL FT KRRRNGENLELGILTSNELRRAEEALLKTAQCESNAEEVAILNKTSGPPER FT RHPTVNKSSEIYKKLPYIDDLGILRSRGRIGAAPYASMDAKFPIILSKQHV FT ITFLIVDWYHRRLRHANRETIFNEIRQRFEISALRRLLEKVERVCTICRIA FT KATPKSPVMAPLPEMRLTAFIQPFTYTGLDYFGPLLVKMGRSNVKRWVAVF FT TCLTIRAVHIEIVHSLSTGSCIMAVQRFVSRRGHPREFWSDNATCFQGTSN FT ELREHNRSMAKKFTTSRCTWKFIPPAAPHMGGAWERLVRSVKVAIGAVMEA FT PRKPDDETLETILTEAEGMINSRPLTYIPLESADQEALTPNHFLLGNSSGS FT KFLYTEPKGSCVVLRNSWKLARYITDDLWRRWLKEYLPMITRRCKWFEDVK FT ELEVGDLVLVVDGTARHKWVRGRIEEVIPGRDGRVRQALVRTSSGVLRRAA FT VKLAVLDVAGKSKPCSISSGAVDPHEGLRAGV" XX SQ Sequence 5876 BP; 1658 A; 1410 C; 1457 G; 1351 T; 0 other; atttctttaa gaaatttgtg ggtatcggaa gtatggaaac ccgcaacgat tcccctcgta 60 attgccagtc atgttcccgg aaggactctg cagaaggcga gatggtgcag tgcgagtcct 120 gcatgctctg ggagcatttt ggatgcgccg gtgtcgacga acaagttcga aggtcggatg 180 ttaagtacat ttgtaagcga tgtaatcacc caggggtgtc caacgatcgt ctcaagatcc 240 cttccggcga tttgcgctcc aagggagcaa aggtaggttc gaagataagt tccaaggcaa 300 gctcgaaaaa agggaagaag attgttgatc cggccggtag tgtaacctcc agcgtacgtg 360 cggcaatgtt agctgaacag ctgaaactta tcgaggaaga tcaactgttg caggaacaag 420 agataagaga tcaggaagaa atccagaaac gtttgttgga agaagaagaa cgccagttgg 480 aagagaaaaa gaagctcgcc gaagaggcga aacaaattcg cgagcgaaag ttgcaggagg 540 agctagacgc taaacggaaa caacaacaag tgaggagaga atcgctagag aaacgtcagg 600 aaatcatacg acaattggcg gaagcaagca gccgaagtgg atcggtggtg agctcccggc 660 aaaaggtgca taactggctg aacgatcagg acattactgg aagatctgac gggaatcatg 720 ttcgggatgc tgtcaaaccc aacattattc ggatttccga agacccacgg aaggagcctg 780 ctcagccgct tacacttgta ccaccggtcg atgtcaatgc acagtcgtcg tttgtcgtac 840 cacccgactt cgttccgtta tcgcctccca ccacaaactt acccacatat cctacacgaa 900 ctctgtcaca ggcccaaatc gcagcacgac aggttttggg gaaggatctg ccagtgtttg 960 acggaaaccc ggaagactgg ccgattttca taagtaacta tgaccagtct tcggcaacct 1020 gtggctattc ggatgccgaa aacctcgttc gtcttcagcg atcgctgaaa ggaaatgccc 1080 tggaatcagt gaggagccgt ttactcctcc cggctagcgt tccgcacgtt atcgaaactt 1140 tgcgcaccct tttcggacga ccggagctcc tcattcgttc aatgttgagc aagattaatc 1200 gaattccatc tccaaggcac gaccatctcg agagcgtcat gcacttcggt ctagctgtgc 1260 aaaacttggt cgaccatctc aaggcagcga accaatacaa ccatttgaca aacccagtac 1320 tcatgcagga acttgtggaa aaattaccag gttccatgcg actcgattgg gctatgttca 1380 aagccaagca tcaagctgcg actctcgcaa ccttcggtga attcatgtca ggattggtaa 1440 ctgcagccag cgaagtttct ttcactcttc caagcttcaa cgcaagccag aagtacacag 1500 tatcaacgga tactcgaagt ggcaggcaaa ggagcggaaa tacggctgtg ctccaagccc 1560 actcaacaga ttgcaatcaa attaccaatt tcgctactac tcctacaagt tctaagcctg 1620 caaacaatgc actgcgtgcg gaagaacggg tcacagggtg gccgaatgtc aacagttcat 1680 cgcagccagc gtcgacgaac gttggactct ggttcagcaa aaaggactct gccgaacctg 1740 ccttaataac cacggcaaat ggccttgccg atcctggagc ggatgcagcg tagacaattg 1800 ccgtcaaaga catcacacgc ttcttcattc ctcgagccct ccagaaacag taaacgtctc 1860 accaagtcac gttatttgtg gaaacttcca atggccactc tttcgaatca taccagtggt 1920 gttatatggt cgtgacactt cgcagaccgt ctacgccttt atcgacgaag ggtcgtcgta 1980 cacacttgta gaagactccg tggctatgcg actaggtatt accggcgaag aacaaccgtt 2040 aacactaaag tggaccggaa acgtaagcag ggtcgaatcc aaatcgcagc aagtacagct 2100 cgatatatcc ggtaagggta tcgatactcg atacacactc agccacgctc gcacggttgg 2160 tcgtttggtt ttgcccaccc aaagcctaaa gtacaaggaa ttagcacgtc gttttccaca 2220 cttgcgagga ttgccaatag aggattacga gctagttcaa ccaaaaatac ttattggact 2280 ggacaacctc agattggccg gttccactga aactacgaga aggaggttca cttgatccta 2340 tcggagcgaa atgtcgtttg gggtggagca tttatggatg tgttccgaac caaacgtctc 2400 attcagcaat tgtcggtttt cacgtcgctg ccgcctccgt gtcggatcag gaaatgaacg 2460 atcaacttcg agactatttc accctcgagg agatcggttg cactgagcgt ggcaatcttg 2520 tcgaaccaat agaagatcaa cgggccaagc ggctgttgaa agatacaaca caacgtacac 2580 gatcgggatc aaatttcgaa acaggtttac tttggaagac agacgacctg aactttcctg 2640 atagcttccc aatggccgtt cgccgcctag aatcgcttga acgtcggtta aaccgtgaac 2700 cggcattgca gaagtgtgtc cgtgatcaga tcgcagatta tgaacgaaag ggttatgcgc 2760 accgagccag tttgtgggaa ttgacatctg ttgactcgaa acgcgtttgg taccttccac 2820 tgggcgtggt acccaatgcc agaaaacctg gaaaggtgag attgatttgg gacgcagcag 2880 cgaaagtagg tgatacgtct ttcaactctc aacttctgaa agggccggat ctcttgaccc 2940 ctcttccaaa agtcctatgc caattacgac agtatcctgt ggccgtcacc ggggatataa 3000 tggaaatgtt ccaccagatc aaaattcgcg caccagattg ccagtcccaa cgcttcctgt 3060 tccgagagca gccaacggat cctcccaggg tctacataat ggacgtggcc accttcgggt 3120 ccacctgttc accggcgtcg gcacagtacg cgaagaacct taacgcacaa gagttcaccg 3180 aagaatatcc gcgagcagta acagcaatca ttgataagca ctacgtcgac gactatttgg 3240 acagtttcga gacgatttca gaggcggtga gtgttgtgaa tgaggtaaaa cttgtacact 3300 ctaaaggtgg ctttactctt cgtcgctttc tgtccaacga acctgcagtt ttgcaaggca 3360 tatgagaacc agtagaaacg gaatcaaaaa acttggacct cgaacgtgct ggaaaaattg 3420 agtcagtgtt ggggatggaa tgggtcccaa gcgatgacgt gttcacctac acccttggcg 3480 ctcgtgaaga tattctccca tttttggaca gcagtcgggt tcccacaaag cgagaagttg 3540 ccagagtagt catgagtctg tttgacccac tcgggctcat atcctatctt ttggtgcatg 3600 gaaaagttat aatccaagaa ctgtggataa aaggcacgga atgggatgca caaattcctc 3660 aagataccaa cgcccgctgg caacaatgga ctgaatcgct acagcaacta gaacaactgc 3720 gcattacgcg atactatttc cctggctctc ttccaaaaga tttcgtcagt ctacaaattc 3780 acctatttgt agacgcaagc gacacagcct acgcagctgt ggtgtacttc catttacaaa 3840 ctgagcgagg tatacatgta gcacttgtcg gttcaaaaac caaagttgcg cctttgaaag 3900 ccctctctat tccaagactt gagcttaagg cagctgtcct tggctcccgc ctgatggaaa 3960 caatccaaac tcaacactca tatccgataa gtcaacgttt ctgctggacg gacgctaaca 4020 ctgtactagc atgggttcgt gctactgatc atcgtcgata tcacaaattc gtcgccgtta 4080 gaataggaga gattttgtca tccacgcaac agtcagaatg gcggtgggta ccatcgaaaa 4140 ctaatgcagc tgatttggcc actaaatgga acaacggacc acaggaaatg gtgaatagcc 4200 aatggtttaa tggaccttcc ttcctataca acccggaaga acaatggcca agacaacaga 4260 aaaacaccat tactgatgaa gaacttcgac ccgtacactt cagcttagca cacttctctt 4320 caagcatggt gttcgaaaga tttaaccagt ggtccaagtt gctacgcgtg gctgcgtatg 4380 ttcttcgttg gattgacaac ctcaaaaggc gtagaaacgg ggaaaatctg gaacttggca 4440 tccttaccag caatgaacta cgtcgagcag aagaagcatt gctgaaaacg gctcagtgtg 4500 agtctaatgc agaagaagtc gccatcctta ataagacatc aggaccacct gagcgacgtc 4560 atccgacggt gaacaaatcc agtgaaatct acaaaaagtt accgtatatc gacgatttag 4620 gaatactgcg aagccgtggt cgaatcggtg cggcaccata cgcatccatg gacgctaaat 4680 ttcctatcat actctctaag caacatgtta taacgttcct aatcgttgat tggtatcatc 4740 gtcgtctccg tcacgcaaac cgtgaaacta tattcaacga aatacgtcaa cgattcgaaa 4800 tatctgcact acgacggctg ctggagaaag tagaacgggt gtgtacgatt tgtcgcattg 4860 cgaaagctac accaaaatcg ccggtaatgg cgcctcttcc agagatgcgt ctgacagcgt 4920 tcattcagcc gttcacttat accggactgg attactttgg tccgctgctt gtgaagatgg 4980 gcagaagtaa cgtgaagcgt tgggtggctg tgttcacttg ccttaccatc agagcggtgc 5040 acatagagat tgtacactcg ctaagcactg ggtcgtgcat catggcagtt caacgattcg 5100 tgtcacgcag aggtcatcca agagagtttt ggtctgataa tgccacatgc tttcaaggta 5160 caagcaatga actaagagag cacaacagat ccatggcaaa aaagttcacc acctctagat 5220 gtacgtggaa atttattccc cctgccgctc ctcatatggg aggggcgtgg gaaagacttg 5280 tgcgttcggt gaaggtcgct atcggagcag tgatggaagc tccacgtaag cccgatgacg 5340 agacattaga aactattcta acggaagctg aagggatgat taactcgagg ccacttacgt 5400 acattccatt agagtcggct gaccaggagg ccttaacacc taatcatttt ttgttgggaa 5460 actcttctgg gtcaaaattc ttgtataccg agccgaaggg cagctgtgta gttctgagaa 5520 acagctggaa actagcaagg tacattacgg acgacctttg gcgaaggtgg ctcaaagagt 5580 atctaccaat gattactcgc agatgcaagt ggttcgagga tgtgaaggag ctggaagtcg 5640 gggacttagt gttggtggtc gacggaacag ctagacataa gtgggtacga ggacgtattg 5700 aagaagtgat tcctgggcga gacggaagag tgcgacaagc actggtacga acatcgtcag 5760 gagtattacg aagagcagct gtaaaactag ctgtactgga cgtcgcggga aaaagtaaac 5820 cttgttccat ttcatctgga gctgtagacc ctcacgaagg tttacgggcg ggggta 5876 // ID Sola3-1_CR repbase; DNA; INV; 5062 BP. XX AC AAGD02001381.1; XX DT 28-JAN-2009 (Rel. 14.02, Created) DT 28-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE Sola3 DNA transposons from Caenorhabditis remanei. XX KW Sola; DNA transposon; Transposable Element; Sola3; Sola3-1_CR. XX OS Caenorhabditis remanei OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-5062 RA Bao W., Jurka M.G., Kapitonov V.V. and Jurka J.; RT "New superfamilies of eukaryotic DNA transposons and their RT internal divisions."; RL Molecular Biology and Evolution 26(5), 983-993 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS join(1439..1759,1776..2339,2532..2867,2857..3603, FT 3584..4405) FT /product="Sola3-1_CR_1p" FT /translation="MCRGAGMCDLNFLDPLVMDKWKTIDICEHHVAELLTE FT WNLLPTFRETHIYRVKTQSFGTVEACSMPDSIGTKHEKGRPIGRSHLSVKA FT ADALIKQDHTLVHPGIRKIFAMFPFTALCRSHENYIRDLMSKPRPPPTKKS FT RTSSESDSCSSEDPPYASSEESITQKKEITLAESFSQFAMLAGETKVCTVK FT PWNQLKHLTQEKKARTARNLFLTMLGIMVPDDTEEFKKLVERKTFVGKQWS FT TGSSASFEAVMEQLAVQFFAAECRRSRLLVLSFVTNSVSYLEMVKYIPHLR FT YFKSHFSPTVMIGLPYGVRNVKLSDGTKMEIPNSIRQQSATEVIEMWKNVC FT MVRIHEQMIGTYTNLVIQDNDQPDLLLSVSTMYKILEACVATKRESTTCVD FT YFIAYGMQVNFNWILIGRKLQNLCFKGFEDMHRVVDGWLAEELFSQSLTQL FT KTALFEVAQYYRTDYRLHIKSQSRVADHCASFALSDPSDKRLSSPCSSDPH FT KHSHDLKCDRCQHVNSTLEKLRDYAEEFLLDSREALKTADESTKQNIQAIL FT ERREDDKKVIERSIAYVHEMKKHLLRAAFTSQEREKIISGLKDNEALVTLD FT FAQKFLPKFHRELQSQYYGKKGVSYHISHVVAKIGDRLVQHSFVHVYSGPV FT TQVLTLYLRFSHCTKTSSCVDEPDTVSETIEDDVPEQNVGELYSCPEPGCS FT STFLKYFNLEKHILKAQHKIAPERMTERDFALNLFARRLEDVNESRTFPAI FT ETALKELKETKEDKHVKQGWGLPEKRTRKAFGEDVKKFLIECFDVGFNTGR FT PLNPFSVARKMRTERKHDGSKRFLEEEILDVRQIAGFFYRESEKRQPQETP FT QNRRKRWWKRPSPHRRRRDNEHEKPAALDIEDQHDDSEWISFLKSEVFWTL FT EDEFYNATLKSFDPIFEFEEIEMKS*" XX SQ Sequence 5062 BP; 1656 A; 904 C; 984 G; 1518 T; 0 other; gacgcactct ctgactcgtt cgatcaccca tgttcacccg aaaaaagtcg accatctccg 60 atttgccaat ccttttgatc aaagatagta ccaagtgttc ttagcaaatt gggtactttt 120 tggccaggtc cgtcacttgg gtacctagaa aaatcaaaaa atcgatattt ttcgaaaatt 180 tttcctgaag gagttaggaa tgaaatctca acgggaaatt aatgtagaca ctattttaag 240 cattttttgt gttcagaaca aaattccagc tcaacacctt catagtgaaa attttgaaaa 300 tttataaaat tttcgggttt ttttcatgaa atcgttttta tctcgacagg gacacgagaa 360 gacgggcttt tttattttaa attcggaatc tacataaaat ttggtattgg aaacatgttg 420 tccctactcg tatctctaga cccaaaaaaa gattttttgg aaagaaaatg agtcaggagc 480 gatcaagttc aactgaatgt gacttaagtc gcgctgaatt tctctcaaat gtgttttttt 540 tttgtttcga atttatttat ttgtttattt tatttattat tcaaccattt atctcctgtt 600 cttaaccatt ctcacaataa ttttttccaa ttttcgctca aaatttatga tttcatttca 660 atttcttcga actcaaaaat tggatcaaaa ctcttcagtg ttgcattata aaatttatct 720 tcaaacgtcc aaaacatttc gcttttcaaa aaagatatcc attcgctatc gtcgtgttga 780 tcttcgatgc cgatcctcct tgggttcctc cactatctct tcgttgacgt cttaactact 840 gatggcgcaa cagacgcggc gccaactgtt cctgcggccc ctctttatat cttctggtca 900 tgtcatcctc tctatccttc tccattattg atttatggga tggattcgat tgtcggtcac 960 gaaatagaga tactttttct tctcaattta gaagtagttt tatattcatc actatttgat 1020 aagatggttc gtttttcctt agctatatat tttttttcag gttacggcag ccagtgaacc 1080 tcatgttaat aattttattt ttactgtgcc ttttcccact aacctccacc aaacctatca 1140 atgaggtcag caaaacttga ttcgaattat aattttattg tagttgtttc agaagaacat 1200 aactccagga tcaaatgtaa aagaagatgt gaagtgttac ttttcccgaa tccgtaaaac 1260 tactcacgaa actgtagaaa gatcagaaaa gtcaattgtc agtacaattt gttcgggaaa 1320 actcgagaag tgagttttta aatgtacaaa aatattgaaa tttatattaa ggttggagaa 1380 atactcaact ttgtcagcag gcccagaagt taaagacgaa catccattat ccgtgattat 1440 gtgtcgtgga gctggaatgt gtgatttgaa ctttctcgac cctcttgtta tggataaatg 1500 gaaaactatt gacatttgtg aacatcatgt tgctgaactt ctcactgaat ggaatttgtt 1560 gccaacgttc cgtgagacgc atatttatcg tgtaaaaacg caaagttttg ggacagtaga 1620 agcttgctct atgcctgata gtattggaac aaaacatgaa aaaggaaggc caattggtcg 1680 atcccatctg tcagttaaag cagcagacgc attaattaaa caggaccata ctcttgttca 1740 tccaggaatt cgtaagatat agttgagaga aataatttgc aatgttccca tttacagctc 1800 tctgtcgttc acatgaaaat tatatcagag acctaatgtc aaaacctcgt cctcctccaa 1860 cgaagaagtc caggacatct agcgaatctg attcttgttc atcagaagat ccaccttatg 1920 ccagctcaga agaatcaatc actcaaaaaa aagaaataac gttggctgaa tctttttccc 1980 agtttgcaat gcttgctggc gaaacgaaag tatgtactgt gaagccgtgg aaccaattaa 2040 agcatcttac tcaagaaaaa aaggctcgca ctgccagaaa cttatttctt actatgcttg 2100 gaattatggt cccagatgat acggaagagt tcaaaaaact tgtagaaaga aaaacttttg 2160 ttggtaaaca atggtcaacc ggctctagtg catcctttga agcagttatg gagcagttag 2220 ctgttcagtt ctttgcagcg gaatgtcgaa gaagtagact gcttgtgctc tccttcgtca 2280 caaactcagt atcgtattta gagatggtga aatatattcc tcatttaagg tatttcaaat 2340 aaatccccag aaaatattat caaatttcta gccgatacat gtatgagtct tcaaaaattt 2400 ttggacgtcg gaaaagaagt gaaaatgcag tgaaggagcg acaattggta cgatacgatc 2460 ataaaaaagt tcaagcattt attgatttca ttaccaggta ttatttttgt aacgtttatt 2520 gactagaatg aagtcatttc agcccaacag taatgatagg gctaccatat ggagtcagaa 2580 acgtaaagct ttctgatggg accaaaatgg aaatacctaa ttcaattcgt caacaaagtg 2640 cgactgaagt gatagaaatg tggaaaaatg tttgcatggt aagaatacat gaacaaatga 2700 tagggacata tacaaacctt gtgattcagg acaatgatca gccggatctt cttttgagcg 2760 tttctacaat gtacaaaatt cttgaagcct gtgttgcaac taaacgagag tctacaacat 2820 gcgttgatta ttttatagct tatggcatgc aggtaaattt taattggtag aaagcttcaa 2880 aatttatgtt tcaagggttt cgaagacatg catcgtgttg ttgacggatg gctggcggaa 2940 gagctgtttt cccaaagcct aacgcaattg aaaacggcat tgtttgaagt tgctcagtat 3000 tatagaactg attaccgtct tcacatcaaa agtcagagta gagttgcaga tcactgcgct 3060 tcttttgcat tgagcgatcc cagcgataaa cgattatcgt cgccatgttc ttccgatccg 3120 cacaaacatt cgcacgacct caagtgtgac agatgccagc atgtgaattc aactttggag 3180 aaactgagag attatgctga agaatttctg ttggattcga gagaagcact gaaaacggct 3240 gatgaaagca cgaaacaaaa catacaagct attttggaac gtcgtgaaga tgataaaaaa 3300 gttattgaga ggagtattgc gtatgttcac gaaatgaaaa aacatctact tcgggctgca 3360 tttacgagtc aagaacgtga gaaaataatt tctggattga aggacaacga ggcactggta 3420 actctggact ttgcccagaa gtttctacca aagtttcatc gcgagttgca aagtcagtac 3480 tatggaaaaa aaggagtcag ttatcatatt tcccatgtgg tcgctaaaat aggagatcga 3540 cttgttcaac atagttttgt ccacgtttat tctgggccgg tgactcaggt tctcacattg 3600 tactaaaaca tcttcatgcg tggatgaacc tgatactgtc tctgaaacaa tcgaagatga 3660 cgttccagaa caaaatgttg gcgaattgta ctcttgcccg gaacctggct gctcgtcaac 3720 gtttttgaaa tatttcaact tagaaaaaca tattctgaaa gctcagcata aaattgcacc 3780 ggaaagaatg acggagagag attttgcttt aaatcttttc gctcgacgtt tagaagatgt 3840 aaacgaatct agaacgtttc cagctattga aactgcttta aaagaactga aggagacaaa 3900 agaggacaaa catgtaaaac aaggatgggg cttaccagag aagagaacac gcaaagcatt 3960 tggagaggac gtcaaaaaat ttctaataga atgtttcgat gtaggattca acacgggcag 4020 accgctgaac ccatttagtg ttgcgagaaa aatgcgtacg gagagaaaac atgatgggtc 4080 aaaaaggttt ttggaagaag agattttgga tgttcgacaa attgctggat tcttctacag 4140 agaatcagaa aaaagacaac ctcaggaaac gccgcaaaat cgtcgaaaaa gatggtggaa 4200 aagaccatcc ccgcatagaa gacgaagaga taatgaacat gaaaaaccgg ccgcactcga 4260 catcgaagat caacacgacg atagcgaatg gatatctttt ttgaaaagcg aagtgttttg 4320 gacgcttgaa gatgaatttt ataatgcaac actgaagagt tttgatccaa tttttgagtt 4380 cgaagaaatt gaaatgaaat cataaatttt gagcgaaaat tggaaaaaat tattgtgaga 4440 atggttaaga acaggagata aatggttgaa taataaataa aataaacaaa taaataaatt 4500 cgaaacaaaa aaaaaacaca tttgagagaa attcagcgcg acttaagtca cattcagttg 4560 aacttgatcg atcctgactc attttctttc caaaaaatct ttttttgggt ctagagatac 4620 gagtagggac aacatgtttc caataccaaa ttttatgtag attccgaatt taaaataaaa 4680 aagcccgtct tctcgtgtcc ctgtcgagat aaaaacgatt tcatgaaaaa aaacccgaaa 4740 atttcataaa ttttcaaaat tttcactatg aaggtgttga gctggaattt tgttctgaac 4800 acaaaaaatg cttaaaatag tgtctacatt aatttcccgt tgagatttca ttcctacctc 4860 cttcaggaaa aattttcgaa aaatatcgat tttttgattt ttctaggtac ccaagtgacg 4920 gacctggcca aaaagtaccc caatttgtta agaacacttg gtactatctt tgatcaaaag 4980 gattggcaaa tcggagatgg tcgacttttt tcgggtgaac atggtctgtt gatcgatgct 5040 cgaacgagtc agagagtgcg tc 5062 // ID SMAR10 repbase; DNA; INV; 1297 BP. XX AC . XX DT 02-OCT-2007 (Rel. 12.1, Created) DT 22-OCT-2007 (Rel. 12.1, Last updated, Version 1) XX DE Consensus sequence of Mariner-type family of repeats. XX KW Mariner/Tc1; DNA transposon; Transposable Element; SMAR10. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-1297 RA Jurka J.; RT "Mariner-type element from freshwater planarian (Schmidtea RT mediterranea)."; RL Repbase Reports 7(10), 1068-1068 (2007). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 281..1102 FT /product="SMAR10_1p" FT /translation="MYKWIALFKHGRQSVFDDQRTGRPCEISDKIKEKCES FT LLREDRRITTRALADAVNVSKGTIQAILHEFGVRKLASRFVPRFLSSEMCE FT KRLECSQANLLLFQQHGEAFLRNIITQDETPLSLYLPESKRESSEWKFPGE FT SSSKKMRSGTSHRKCLMLSIFWDARGVILVDFAEKTTRLNSAYYSDLVCKC FT RHARRKERGIPMWFLQDNAPIHKSAASMRVLEESGFDLLDHPPYSPDLAPS FT DFYLFRYIKKHLRGKQFENNDELKEAVENFLHD" XX SQ Sequence 1297 BP; 373 A; 248 C; 286 G; 385 T; 5 other; tacgtggtgc gccaataagt ccgtgcaata tgtargaagg aaatattatt ttgtcatgta 60 atcaaatttt tattataggt ttaatttgtg tagtttgttt taaattatca ttacagttta 120 tattgacatt tgcgagattt tgaaatttgg tgcaagtcat gattgatgca gtggaatatc 180 gcagcgttat aaagtttctg gtattacgcc atgttcccaa tgaccaattt tttcccaatt 240 ggaggagaca tatggcgagg actctccatc acgcagagca atgtacaagt ggattgcact 300 gtttaaacat gggagacagt cagtttttga tgaccaaagg actggacgcc catgtgaaat 360 ttctgacaaa attaaagaaa agtgtgaatc tttactgcga gaagaccgtc gaatcacaac 420 gcgggcactg gctgatgctg tgaatgtgag caaaggaacc atacaagcaa ttctgcatga 480 atttggagtg agaaagctgg cgtccagatt tgttccccgc tttctttcat cagagatgtg 540 tgagaagcga ctggagtgct ctcaggcaaa cctccttctt ttccaacagc atggtgaggc 600 gttcctgcgt aatatcatca ckcaagatga aacccctctc agtctgtatc tgccagagtc 660 taaacgtgag tcaagtgagt ggaaatttcc aggagaatca agttccaaga agatgagatc 720 tggaacctcc cacagaaaat gcctgatgct gagcattttt tgggatgccc gtggtgttat 780 ccttgtcgat tttgctgaaa aaaccaccag gctcaactct gcatactact ctgatcttgt 840 gtgtaaatgc cgacatgctc gacgaaaaga acgaggtatt cccatgtggt ttctccagga 900 caacgcacca atccacaaaa gtgcagcatc catgcgagtt ctggaggaaa gtggatttga 960 tctattggac catcctccat acagyccaga tctggctcca agtgactttt accttttccg 1020 ctatattaag aarcacttac gtggcaaaca gtttgaaaat aatgatgaac tgaaggaagc 1080 agtggagaat ttcctgcatg attagtcccc agatttcttc aaaaatgcat tttcagaact 1140 tgtgatccag cgctgggaaa agtgtgtgaa tgtaaatggc tcttatattg aaaaatgatg 1200 tctatgttac atgaattata tgttattgtg aatgaaartt tagtgtttct attcaattgc 1260 tttcctagtt attgcacgta ctttttggcg caccacg 1297 // ID BEL-92_AA-LTR repbase; DNA; INV; 587 BP. XX AC supercont1.183; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-92_AA_; KW BEL-92_AA-I; BEL-92_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-587 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.183; Positions 909734 909148. XX SQ Sequence 587 BP; 216 A; 95 C; 116 G; 160 T; 0 other; tgttgaggca ccgttcactg gcaacacggc ccaggcgcac tatcacagta acatagaaca 60 aatcgaacaa ccttagaaac cgagagatag atagggatta gaaccaaaac aaaaccttca 120 ataacagtga attgtcgaat ctatcattgc catgcagttt ttgctattaa agtgaatata 180 atttgaaaat atactttttt gttaaatcgc gcattatacg aagttgtact tcgacattta 240 aagcctcaaa cgtaagtgta gaactgaaat tattagacat tctttaatat cgatgtagtt 300 atccaggttg tcaccaatag tgcacgtatt tcatcattag tgggaacaaa cgtatctgaa 360 ttattctgac tgcatctaat acagagagga aggaaacctc agtaaaagga ggaaaaggtt 420 cttggatatt aaggaagagg aggattggtt gaaggaagga agctattcat gtaagtgaag 480 taattgttat atgattctga aagataatta aataaacaat cgttacagtt ttgagctgct 540 aaaaccataa aagagctgct tcaagaaaat agtgacccac ccgaaca 587 // ID Gypsy-2_PPP-I repbase; DNA; INV; 5375 BP. XX AC ADBJ01000051; XX DT 13-DEC-2010 (Rel. 15.12, Created) DT 13-DEC-2010 (Rel. 15.12, Last updated, Version -1) XX DE LTR retrotransposon from the Polysphondylium pallidum (slime DE mold) genome: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-2_PPP_; KW Gypsy-2_PPP-LTR; Gypsy-2_PPP-I. XX OS Polysphondylium pallidum OC Eukaryota; Amoebozoa; Mycetozoa; Dictyosteliida; Polysphondylium. XX RN [1] RP 1-5375 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Polysphondylium pallidum (slime RT mold) genome."; RL Repbase Reports 10(12), 2160-2160 (2010). XX DR GenBank; ADBJ01000051; Positions 314523 319897. XX CC Positions [3632-4156] - Integrase core CC 'ACAAG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 80..5362 FT /product="Gypsy-2_PPP-I_1p" FT /translation="MPEFPAVPLALPTSTNANLPGVAITVENADFEIFHLV FT DNSIEPLINFINSDEKYRAMAAKYANMKNVPVKTAILYTCNKLVMMDTNTA FT FNQIMSTKVGKASIDSLPKFDFESSTADIESHLLLIESTIPLPNARAKVFF FT NTLNEKSRALTRNFHVAANASWNEYKSMMVKLFSREFPPEHYVGILAKLKF FT LPNRDDIVSFNLTFTKVVNKTRNIVNSQTAAAMYANALRNVRNLESIKNLP FT DLESRMLETAELVVKYGINGRRICTLQNIVDDEEKVGSKNHKGGFVEKRPH FT DFKKNLHKNKNNHHGAKPACYVCGKVGHLAKDCYFKNKNKDATVNFINSTP FT AIQAVINDQKTVAVVDTGADISVISKDMAKRLNVSIQKCNPIKINNVSNSI FT LCNEFTNIKFYITTDNNKRLECNEKFYVINTHIDYIILGVNWQASACITLS FT ANPEGVIISVKVNNDIHKLINLNKNSLNKNLIKIKLNELKNERKVQDKNIN FT FLKNKINKINVKNRDQFNELDRLLKKQNLIQSEIKKIIDDVNKDNRVKNKN FT NTDFTPHYAWVNINEIYDDISKDVDTDNDKKMNESVTDNIDIENETKVVET FT LKRLFPNVFIPFDGLPPSRGKQFDMTIKLKTEEIPKSKLYPVPIAHLEELK FT KQINDRLNKGWIKRSRSPFGSPILFVSKPDGGWRLCVDYRELNKITVRDDY FT PLPRINELLNNTRLAYWLSKLDLLDGYHQIRINEGEQYKTAFKTTFGTFEY FT TVTPFGLAGAGANFQRLMDHIFQLEILNMKICVYLDDILIMTNSDSLDDHI FT NDLIEIFKILQKHDFKVKLSKCKFARREIEFLGHVVGRGVIKPLHNKIESI FT LNWKKPENKNEMRSFIGLVGYYRRFISNVSSIEIPLLNMIKDKSEFVWTEE FT ATNAFNQIKKLVEDVKYLVAPNYTIPFHLECDASKYGIGHALYQINKLDNI FT TRDFISFGSRKLTISEINYTVLEKELLSIIHALKVNYYHLIGHEVIINTDH FT KNIKFLREQYAIGINSRINRWLQFIELFNPTLQYIKGETNVIADGLSRYTF FT DINNIKISYDNDLLENIKDSYKLEIKMIENDEIKLQSIYKSDKVTTLNDVK FT YFEGKIIIPMVKPLIDQILHLYHNSSTSAHINAFSMLEQISKKFIWDSMKS FT DIYKFYKRCDVCNKGSDYGDKRRGFLQPLPIMNDRFLALSCDFITNLNEVE FT DSNGRKFNQIWVIVDRFSKYTTLIPTHSTYTSLELARLFKSEFIKIHGVPL FT YISSDRDSKLTSKTWKEFAKLLNIKLNFTTADHQQANGQAEIVIRAIKNTL FT RKLLIEDKENDLNRNWYQHVDTIQFGLNNTKSTSTNYTPFLIAYGRNPSTV FT ADLAIPLNREPSDTVNDNEIESLLNYTSTIIINVRNNLIKIRNKMLSNYNK FT NKKYNDFKIGDKVYVKEKRPSKKWNKLEAVYAGPYEIIDIDKQNLNITIRD FT YISGVSRIRTTSKTLHVKYFKHAPKDDVNEREYIPVEKLDSFLNNDELISD FT VKMSDISNEDEITDETMDQDDVVNHQDEINQDEDDISDLEDLNDLMTDISS FT IDSSLEPKGHFSDLQLSRILNPEKNTYHKVREILNNNKSSPNTFINLLRTL FT EEKESVIDYPHNKVTEESNLLINSIMKFAISPELFEMIQKRSFKIVSKRRL FT TSNTDYLVKISNTVGWVSRQILAKHASKQINDFNKKIDNLNPVANINSLLF FT RENNILNALETVTNPHHRNKLMSKYHKIKSNTAVFFLNTSF" XX SQ Sequence 5375 BP; 2111 A; 838 C; 846 G; 1580 T; 0 other; tggtggagag cctcactacg ttcccggaac acctttccgt ggtattaacc tactcaattc 60 actccaatct gtaagagaca tgcctgaatt cccagctgtg ccgttagctt tgcctacctc 120 taccaacgcc aacttgccag gtgttgctat tacagtggaa aacgcagact tcgagatctt 180 tcacctcgtc gacaattcca tcgaacctct tataaacttt atcaacagcg atgaaaagta 240 tcgggccatg gctgcaaagt acgctaacat gaaaaatgta ccagtaaaaa cagccatact 300 gtacacctgt aacaaactgg tgatgatgga tacaaacacc gctttcaacc agataatgtc 360 tactaaagtc ggtaaagcct cgattgatag tctccccaag ttcgatttcg aatccagtac 420 tgccgacatc gaatcacatc tactactcat tgaaagtacc atacctttgc ctaacgccag 480 agctaaagta tttttcaaca ctctcaacga gaaatccaga gccctaacaa gaaacttcca 540 tgttgccgca aatgcatctt ggaatgaata caaatccatg atggtgaagc tattcagtcg 600 tgagtttcca ccagaacact atgtcggtat tctcgccaaa ctaaagtttc tgcctaaccg 660 cgacgacata gtttctttca atctgacctt cacaaaagtg gtaaataaga ccagaaacat 720 cgtgaactcc cagacggccg ctgcaatgta tgccaatgct ttgcgtaatg taagaaactt 780 ggaatccatt aaaaacctgc cagacttaga atcccgtatg ttggagaccg cagagttagt 840 tgttaaatat ggcataaacg gtcgacgtat ctgtacctta cagaatattg ttgatgatga 900 ggagaaagtt ggttcaaaga accataaagg tggctttgtt gaaaaaagac cacatgattt 960 caaaaagaat ctccataaaa ataaaaataa ccaccacggt gctaagccag catgttacgt 1020 atgtggtaaa gtcggtcatc ttgctaaaga ctgttacttt aaaaacaaga acaaagatgc 1080 tactgttaac tttataaaca gtacaccagc aatacaagct gtaatcaatg accagaaaac 1140 ggtcgcggtt gtagacaccg gagcagacat atcagtcata agtaaagaca tggcaaaacg 1200 tctaaatgta agcatacaaa aatgtaatcc tattaaaatt aataatgtta gtaattctat 1260 attatgtaat gaatttacaa acataaaatt ttatattact acggataata ataaaagatt 1320 agaatgtaat gaaaaattct atgttattaa tacccacatt gattatatca tattaggtgt 1380 gaactggcaa gcttcagcct gtattaccct gtctgctaat ccagaaggtg taataatatc 1440 agtaaaagtt aataatgata tacacaaact aattaatctt aacaaaaata gtttaaacaa 1500 aaatctaata aaaattaaat taaatgaatt aaagaacgaa cgtaaagttc aagataaaaa 1560 tattaatttt ttgaaaaata aaattaataa aataaatgtg aaaaatcgtg atcagtttaa 1620 cgaattagat cgattattaa aaaaacaaaa tttaattcag tctgagatta aaaaaattat 1680 tgatgacgta aataaagata atagggttaa aaataaaaat aatactgatt tcacgcccca 1740 ttatgcttgg gtgaatataa atgaaatcta tgatgatatt tcaaaagatg tagatacaga 1800 taatgataaa aaaatgaatg aatctgttac agataatatc gacatcgaaa acgaaacaaa 1860 agttgtagaa actttgaaac ggttattccc aaacgtgttt ataccgtttg atggtttacc 1920 accttcaaga ggtaaacagt ttgatatgac aattaaatta aaaacggaag agattcccaa 1980 atcaaagtta tatcctgtac caatagcaca cttagaagaa ttaaaaaaac aaattaatga 2040 tagattgaat aaagggtgga tcaagagatc tagatcccca tttggtagtc ctatcttatt 2100 cgtctctaaa ccagacggtg gatggagact ttgtgtcgac tacagagagc taaataaaat 2160 tactgttcgt gatgattatc ctcttccaag gattaatgaa ttattaaata atacaagatt 2220 agcttactgg ctatcaaagt tggatctgtt agacggatac catcaaattc gtatcaatga 2280 aggcgaacaa tacaaaactg cttttaaaac tacatttggt acatttgaat atacagtaac 2340 accgttcggg ttagccggag caggtgctaa ttttcaacga ttaatggatc acatattcca 2400 gttggaaatt ttaaatatga aaatatgtgt atatctagat gatatattaa tcatgactaa 2460 ttcagatagt ttagacgatc atatcaatga tttgatcgaa atttttaaaa ttttacaaaa 2520 acatgacttt aaagtcaaat tatcgaaatg taaatttgct agacgagaaa ttgaattctt 2580 aggtcatgta gttggtcgtg gagtaataaa accacttcac aataaaatag aatcaatttt 2640 aaattggaaa aaacccgaga ataagaatga aatgagatca ttcataggtt tagtgggtta 2700 ttatagaaga tttatcagta atgtatcatc tattgagatt cctttactta atatgataaa 2760 agataaatct gaatttgtgt ggactgaaga agccacaaat gcttttaatc agattaagaa 2820 attggtcgaa gatgttaaat atcttgtcgc accaaattat acaataccat ttcatcttga 2880 gtgtgacgca tctaaatacg gcattggtca tgctctatat caaattaata agttagataa 2940 tataaccaga gatttcatct cttttggttc aagaaaactc acaatcagtg aaataaatta 3000 cactgtgtta gaaaaagaat tgttatcaat cattcatgct ttgaaagtaa attattatca 3060 tttgatagga cacgaagtga ttataaatac tgatcacaaa aatatcaagt tcctacgcga 3120 acaatacgca ataggaatta attctagaat caacagatgg ttacaattca ttgagttgtt 3180 taatccgaca ttacaataca taaaaggtga aacaaatgtt atagccgatg gtttatcaag 3240 atatactttt gatatcaata atattaaaat atcttatgat aatgatctat tagaaaatat 3300 aaaagatagt tataaattgg aaatcaaaat gattgaaaac gatgaaatta aactacaatc 3360 tatttataaa agtgataaag ttacaactct aaatgatgtt aaatattttg aaggtaaaat 3420 tattatacct atggtaaagc cattaattga tcaaatatta catttatatc ataattcatc 3480 tacatctgca catatcaatg cgtttagcat gttagaacaa atttctaaaa aatttatttg 3540 ggattcaatg aaatccgata tttataaatt ttataaaaga tgtgatgtat gtaacaaagg 3600 tagtgattat ggtgataaga gaagaggttt tttacaacca ttaccaataa tgaatgacag 3660 atttttagca ctgtcatgtg actttattac gaatttaaat gaagtggaag atagtaatgg 3720 tagaaaattc aatcaaatct gggtaatcgt agatagattt tcaaagtata ctacattaat 3780 acccacgcat agcacatata cttctttaga acttgctaga ttattcaaaa gcgagtttat 3840 aaagatacac ggagtaccat tatacatctc cagcgataga gacagcaaac taacttccaa 3900 aacttggaaa gaatttgcta aattattaaa tatcaaactt aatttcacta cagctgatca 3960 tcaacaagct aatggacaag ctgagatagt gattagagcg attaaaaata ctttaagaaa 4020 gttattaatc gaagacaaag aaaatgattt aaacagaaat tggtatcaac atgtagatac 4080 aattcagttt ggattaaata atactaaatc tacttctaca aattatacac cattcttaat 4140 tgcttatggt agaaatccat ctactgtcgc cgacttggca attcctttaa atagagaacc 4200 aagtgataca gtgaacgata atgagattga atcattatta aattatacct ctacaatcat 4260 tataaatgtt agaaataact taattaagat tagaaacaag atgttgtcta attataataa 4320 aaataaaaag tataatgatt ttaagatcgg tgataaagtg tatgttaagg aaaagagacc 4380 ttccaagaag tggaacaaat tagaagcagt atacgctggt ccctatgaga tcatcgatat 4440 tgacaaacaa aatttaaata taactataag agattatatc tctggagttt caagaattag 4500 aacaacgtct aaaacgttgc acgtaaaata ctttaaacac gctccaaagg atgacgtgaa 4560 cgagagagaa tacattccag tagaaaaatt agattctttt ctaaataatg atgaattaat 4620 atccgatgta aaaatgtctg atatcagtaa tgaagacgaa atcacagatg aaactatgga 4680 tcaagacgac gtagttaatc atcaagatga aattaatcaa gacgaagatg atatctctga 4740 tcttgaagat ttaaatgatt tgatgactga tatatcaagt attgattcgt cattagaacc 4800 taaaggacat ttttcagatt tacagttatc gagaatctta aatcctgaaa aaaatacata 4860 tcataaagtt agagaaattt taaataataa taaatctagc ccaaatacat ttataaatct 4920 tttaagaact ttagaagaaa aggaatcagt aatagattac cctcacaaca aagtaacaga 4980 ggaatctaac cttttaataa atagtattat gaaatttgca atatcgccag aattattcga 5040 aatgattcag aaaagatcat ttaaaatagt atctaaaaga agattaacta gtaacactga 5100 ttatttagtt aaaataagta atactgttgg atgggtttca agacaaatcc tcgccaaaca 5160 tgcatccaaa caaatcaacg atttcaacaa gaaaatcgac aacctcaacc ctgtagccaa 5220 tattaacagt ttactgttta gagagaacaa tattctaaac gccttagaga cagttaccaa 5280 cccacatcac cgcaataaac ttatgtcaaa gtaccataag atcaaatcca acaccgctgt 5340 attcttcttg aatacttctt tttaagaaag aggag 5375 // ID FAMAR1 repbase; DNA; INV; 1299 BP. XX AC . XX DT 10-SEP-2005 (Rel. 10.09, Created) DT 01-NOV-2005 (Rel. 10.09, Last updated, Version 2) XX DE Forficula auricularia Famar1 mariner transposable element - a DE consensus. XX KW Mariner/Tc1; DNA transposon; Transposable Element; FAMAR1. XX NM FAMAR1. XX OS Forficula auricularia OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Orthopteroidea; Dermaptera; Forficulina; Forficuloidea; OC Forficulidae; Forficula. XX RN [1] RP 1-1299 RA Lampe D.J., Witherspoon D.J., Soto-Adames F.N. RA and Robertson H.M.; RT "Recent horizontal transfer of mellifera subfamily mariner RT transposons into insect lineages representing four different RT orders shows that selection acts only during horizontal RT transfer."; RL Mol Biol Evol 20(4), 554-562 (2003). XX DR [1] (Consensus) XX CC The sequence is 95% identical to MARINER_AM, possibly due to CC horizontal transfer. Other species sharing highly similar CC mariners are Ceratitis capitata (Med fly) and a blister beetle CC Epicauta funebris (pestifera). They are not represented in CC Repbase. XX FH Key Location/Qualifiers FT CDS 190..1215 FT /product="FAMAR1_1p" FT /translation="MENQKEHFRHILLFYFRKGKNASQAHKKLCAVYGDEA FT LKERQCQNWFAKFRSGDFSLKDEKRSGRPVEVDDDLIKAIIDSDRHSTTRE FT IAEKLHVSHTCIENHLKQLGYVQKLDTWVPHELKETHLTQRINICDLLKKR FT NENDPFLKRLITGDEKWVVYNNIKRKRSWSRPGEPAQTTSKAGIHQKKVLL FT SVWWDYKGIVYFELLPPNRTINSVVYIEQLTKLNNAVEEKRAELTNRKGVV FT FHHDNARPHTSLVTRQKLLELGWDVLPHPPYSPDLAPSDYFLFRSLQNSLN FT GKNFNNDDDVKSYLIQFFANKNQKFYERGIMMLPERWQKVIDQNGQYITE" XX SQ Sequence 1299 BP; 436 A; 226 C; 248 G; 389 T; 0 other; atatataagg ttggcaacta agtacttgcg gatttcactc atagatggct tcagttgaat 60 ttttaggttt gctggcgtag tccaaatgta aaacacattt tgttatttga tagttggcaa 120 ttcagctgtc aatcagtaaa aaaagttttt tgatcggttt cttagttttc gtttggcgtt 180 cgttgaaaaa tggaaaatca aaaggaacat tttcgtcata ttttgctttt ttatttccgc 240 aaagggaaaa acgcatcgca agctcacaaa aagttatgtg ctgtttatgg cgacgaagcc 300 ttaaaagaac ggcagtgtca aaattggttt gccaaatttc gttctggtga tttttcactc 360 aaagatgaaa aacgctctgg tcgtccagtt gaagttgatg acgacctaat caaagcaata 420 atcgattcgg atcgtcacag tacaacacgt gagattgcag agaagcttca tgtatcacat 480 acatgcattg aaaatcactt aaaacaactt ggctatgttc aaaaactcga tacatgggtt 540 cctcacgaac tgaaagaaac gcatttaacg caacgcatta acatctgcga tttgctaaag 600 aaacgtaatg aaaatgatcc atttttaaaa cgactgataa ctggcgatga aaaatgggtt 660 gtttacaaca atatcaagcg gaaaagatcg tggagcaggc caggtgaacc agctcaaaca 720 acatcaaaag ctggtattca tcaaaagaag gttttgttat cagtttggtg ggattacaaa 780 ggaattgtct actttgaact cttaccaccc aaccgaacga tcaattctgt tgtctacatt 840 gaacaactaa cgaaattaaa caatgcagtt gaagaaaagc gggccgaatt gacaaatcga 900 aaaggtgttg tattccatca tgacaatgca aggccacaca catctttggt cactcggcaa 960 aaattattgg agcttggttg ggatgttttg ccacatccac catatagtcc tgaccttgca 1020 ccatctgatt actttttatt tcgatcttta caaaactcct tgaatggtaa aaatttcaat 1080 aatgatgatg atgtcaaatc gtacctgatt cagttttttg ctaataaaaa ccagaagttt 1140 tatgaacgtg ggattatgat gctgcctgaa agatggcaaa aggtcattga tcaaaatggg 1200 caatacatta cagaataaag ttatttagtt ccatgaaaaa attgtttgat tttctaaaaa 1260 aatccgcaat tatttagttg ccagcccaat ataatatat 1299 // ID SAT_ME repbase; DNA; INV; 277 BP. XX AC AY078994; XX DT 09-JUL-2004 (Rel. 9.06, Created) DT 09-JUL-2004 (Rel. 9.06, Last updated, Version 2) XX DE Meloidogyne exigua satellite DNA. XX KW SAT; Satellite; Simple Repeat; SAT_ME. XX OS Meloidogyne exigua OC Eukaryota; Metazoa; Nematoda; Chromadorea; Tylenchida; Tylenchina; OC Tylenchoidea; Meloidogynidae; Meloidogyninae; Meloidogyne. XX RN [1] RA Randig O., Bongiovanni M., Carneiro M.R., Sarah L.J. RA and Castagnone-Sereno P.; RT "A species-specific satellite DNA family in the genome of the RT coffee root-knot nematode Meloidogyne exigua: application to RT molecular diagnostics of the parasite."; RL Mol. Plant Pathol 3(6), 431-437 (2002). XX DR Genbank; AY078994; Positions 1 277. XX SQ Sequence 277 BP; 41 A; 79 C; 48 G; 109 T; 0 other; agatcttcaa agccctcagt agtacccttc ttttttcttg ttttattttt attcctagac 60 ccttagatcg gttcagtgcc tacatttggt cgcctgttct tttcttggcg gtgtacactt 120 cggtgccagt tgattttcag tggtggacac ttcgatgcct gagttcattg tgggggtttc 180 agaggagtga ttcttggacc acttctcttt cttccccact cctttctttc tttcttcacc 240 cctctctctc ccctccctta tacacatcta cgcccat 277 // ID L1-2_Cis repbase; DNA; INV; 6100 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 06-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE L1 Non-LTR Retrotransposon from Ciona savignyi. XX KW L1; Non-LTR Retrotransposon; Transposable Element; L1-2_Cis. XX OS Ciona savignyi OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. XX RN [1] RP 1-6100 RA Smit A.F.; RT "L1-2_Cis - L1 Non-LTR Retrotransposon from Ciona savignyi."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX CC Ci000161 2% div. XX SQ Sequence 6100 BP; 2504 A; 1105 C; 957 G; 1523 T; 11 other; tagcattcta tttgaataag gcaagattca ccttgtgtgt aagttgccaa acagcaaata 60 gatctctttg gattatccat tcggattatt tcatcgactt aatctgtgga tataccccct 120 ctcgagccag caagtacctg caaacgaaag gtaaaatagt ttattgcctt gttttactat 180 ttgtaggttt aaagcccccc gtttatttcg aatttttcaa ttaccagcct ccaagtcctt 240 tttttcgact taaggcgaat agcaccccgt gattaatatc tgtatataaa tattcctttt 300 ctttttgtgg tgtaagttag tatctctgtg ccacgtggat cccaacgttg gcaacactgg 360 tactaactta acgagtttga ttaacctttt caccttgtga agctttctat agtgtttccg 420 cgcttttcgt tccaattttt tttggaaaca ctgataatta gttttgcgtg aataatacct 480 gattacgcgc acttctcccg ttttttcttt ctcttcttac tcctgggctt gtgtagactc 540 acagctcact gtcgccaccc cgcgacgtac aatatataat ttttcatacc actataagta 600 agtgacatcc aatacgtggg tttccgacct gttaaaaata aacaaacata ttaaaacaac 660 acaanagtat aaaaacaaac aaaagaacga tcatgatgag agacaacgta cctgaaattg 720 aagcgaacct tgaacgatct ttggtgttcc aactcatcgg aaatgtggaa cacatgagcg 780 tgagttcctt catggaattg tttgcagacg gagccccgct cgcagacatg tccagcatgg 840 tcgatggcgt gatccttgac aactttaagg actgcacctt catcgtcacc ctgaaagaga 900 aaaatgatga tgttgatatt cctcaaagaa aagaagtaat taaatacttc gaaaataacg 960 acatcaattt caccacggaa aagggttcaa ccatgtcgat taaagccgaa ctgccacagg 1020 gnggaagcga agtggtttcc ttacaccctg taccagcaga taccgataga aaaaaactgg 1080 aagcactcat tacaggacaa aagtggggaa agataaaaaa catcaactat ggaacacacc 1140 gaaacttcaa taagattaaa aacggttggg tcaacattac tctcacagag acacaaatnc 1200 aaaacattcc acaactcatc aaaatagcag gacgtaccat tacagttacg cgaccagggg 1260 aagagcatat ggcgctttgc agatactgta aagtaagagg acatatccaa aataaatgtc 1320 cgaagaaagg tttctgtggc gaatgcaaaa ctcatggcca cacgtcaaga aactgcagag 1380 gaaacgtcca agcaccccgc ccaagaacag tctttcattc agactgtaat acagcggaac 1440 taatgaacac accaaaaagg aacaacattg gacaacanca accatggcac acagtagaac 1500 gctacgtcga aaagcgttac caaaatacaa gaaatgctga aattaaatta acaaacagat 1560 ttggactatt ggaaaactgg ccagaatttt atattgaaaa tgaaatgcga caagaagggc 1620 cgaaccaata cgacgaaagc atcttcaatt tggaagattt tccaaacatg acggcggtaa 1680 acagtacccc aaagaaaccc ctcacaccgc tntcacagag aaaagcaaag gggggaaaga 1740 aaaacaaanc accaaaccgt caagaacanc acaaaacaga ccaaccaatt gaaaacccca 1800 tggacaacac acaggaaaac aaacaaggca aagcgacatt aaaacaacaa ccaacaaaag 1860 ttcatcacct tagcgattca tcaagtgaca ctaatgaacc caccgagccc atgaccatgt 1920 ataacgagac aaaacaagcc atacccattc accgaaagga atcaaatgta gagtcaattt 1980 atggaacacc agacaatgca aatgacatac ctacaattcg gagatcagta attgaacgat 2040 cggacatctt cacgtcagag gcaacttctt cacctaaaag aaaacgaact gactcataaa 2100 tcgcgttaaa cgcaaacaaa ccgttaaaaa caattacaat tccatgtaat aaagtatcta 2160 atggagccaa acccagagat ggttaataat ttgaaaattg gatctttaaa cacaaatgga 2220 cttctaaaca aaattaaaaa gataatttat tacatggaat ccaataaaat cgacatacta 2280 ctaatacaag aaacccacgt acttacccag gaaatgatct ccaaattcaa atcgcagagt 2340 aacattgaaa ttttcgccaa tgcgcccgaa catcgtatta catccttccg ccaaggaacc 2400 gcaatcttcg tcaaaaaaca catcctgtca atgtacaaaa ttcatcatga tatattattt 2460 gaaaaccgtg ttcaaaaact aaaccttaaa tataaacata atgaaattaa cttatataac 2520 atttatctaa aagctggtca gacatatatt aatctagtta acagagaaca aatgatctat 2580 gaattgaaag ataaaataga aaacgaaaac gaaacaattg acttactggt tggcgacttt 2640 aatatggtat cgaatgaaat cgacgtaaaa gcaaattatg ataaaaggaa aaacagagac 2700 aggatagcgc taaaacgact acaaggcggg cttaactttc acgacgcatt tcgcctaata 2760 aataaacaaa agattgaatt tacgaggatt acaaaaacga gtgccactag attagataga 2820 atatatgtta ataatcaagc gaaaaataaa gtaattagtt tttcacatat acgcaactat 2880 atttctgacc acaataattg cccagtcata acccttaaaa taaatagcaa tcgtaaatgg 2940 ggattatcct tctacaaaat aaacaattca atattaaaac attatgacct aattgaaaat 3000 ataagcataa tgtggaataa ttggcaacat caaaaaacaa aatatatgaa tacgtcctta 3060 tggtgggaaa acggaaagaa actcattacg aatgaagtac gaaactattc tcaaagtatt 3120 acgcaagcgg aacgaaaacg atatattact aaagtgcaag agttaaatga tttagaaaac 3180 cgagttcaat ctgaaaaaat aatatcaaaa atactacgat taaaagaaaa catcaatcag 3240 tatgaacgta aaattaatga aggcgcaata attaggtcca aaataaaaat tatagaagat 3300 gaagaaaaac caaccaagga attctttaaa tacgaagaaa caaaatgcaa ccgggatagc 3360 atatattcta tatataataa aacgggatca ttaacagaaa ataaagaaca aacccttata 3420 gcaatagaga acttttatcg cgacctatgg acaagtgaag aaataaatgt aaacgatata 3480 gatgaatacg tatcaattat agaaccctta acctttgacc aataccaact aaaggaaatg 3540 tcccgaccaa taaaccataa agaaatctac aactgcctaa ttgaatcgaa cactaatagt 3600 accccaggat gtgacggcct aactataaaa atatatagac ttttatggga caatattaaa 3660 tacgacatgg aagagctata taacaatatt tacttaaaag gaactatgcc agaaaccatg 3720 agaaccgcta taattaaatt aatctacaaa aaaggggaca aaaaagacat cagaaattgg 3780 agacctattt cgctactaaa tacggactat aaaatactga gtaaaataat agcaaaccgt 3840 ctaaatatag ttatccccaa aataataagc cgagaccaaa agtgtgcaat acgcggaaga 3900 tcaattaacg acaatctata taacgtaaaa gcatgtatag acgcagcgaa acaattcaat 3960 aaaaatctaa caatcattgc aatcgatttt gaaaaggctt tcgaccgggt aaattattca 4020 tatntattca aaatattatc aaaacttaac atacccaaat acttaatcaa ttggctgagg 4080 ataatatata ataatattca aagcaaaata gaattaaacg gagcttttac aaaaaatatt 4140 agaataacga gaggaatacg acaaggctgc ccatgtagca tgttgttatt cctaattggt 4200 gtagaaatat taaacagaaa gataaatatt aataaaaata taactggatt taaactaaat 4260 aaaatagagt taaaattaga acaatatgca gacgatctgt ctataataat atcaaatgcg 4320 caatccctaa aggaggtaat aagggaatta aaaacattcg aaaaagcatc tggccaaata 4380 atgaacacaa gtaaaacaca aattattaca aacgatatat caattcaaaa tatattaaac 4440 gaagacttcc caagtgactg catnaaagaa aagataaaaa tacttggcgt ttattttagc 4500 ttaagtattg actgcataca agacaacata gcaaaatccc gacgcgtgat aaatacttta 4560 tattggaaaa atctcaaacg caaattaacg ttaaaaggta gaataatgat tataaactct 4620 ctctttatac cgcaactaat cacaatagga agacatatgc tattncctaa acattttata 4680 aacgaaatca ataactatat atataaattc atatggtatc cttttaagat agaccgcata 4740 gctaggaaaa aactaatagc gcaccctaag gatggaggat tgttcgcccc taacgtaaat 4800 ctaaaactaa aggccgtacg agcaacccgt ttgtatgact taantaaatt agaaaaaatc 4860 gaaacaatag cccaagagtg gactcgatat aacttgggat caacgataaa agtaataaat 4920 aataagcttt atacgaattc ggcaataaat gcaacggaac ctaacgcctt ttttactgat 4980 atacgcaaaa caatatacca actgcgtcga acagattttc aatgggaatc taataaacta 5040 aagcctatat acctagaact tttaaaaaat gtagcgcaaa caacagtaat atgtgaaaat 5100 agtgaaatca taaaatggcc gcaaattaca ctaagtgaaa aacgaaccaa atcccacttt 5160 actaatttag aacgcgatag aaactataaa atagcgcata acgcatatca tttcggcgat 5220 tggtatagag acaaaattgg aatgcaatac cggaacggaa aagtattaat aagaaattgt 5280 aaattttgcg gaaacgaaac agataatata aggcatattc ttacggaatg ccaattaacg 5340 gaaataatga ttaatgacat agaagaaata acaaacgaag cttgtaaaca gaatacgcag 5400 attaccaagt cagtaatact atataaccaa acgataaata atgcaacgcc taacctattt 5460 gtaactaaag taataaatat ctttaagtcc gaaatcatta agaaaaaaca tcaactcgat 5520 tttgcgaata aatatattgg cagtaaccat gattttacga gaaaaatgct ttggattata 5580 aacacaaaaa tcaaaaatat attaaggcgt gaatgtacat tgaaaggcaa acaggatact 5640 tacgaattat acgatctaaa acacaattac gtattttaaa agccgttgtt cttcaactcg 5700 aaagcgctgt ctaaaaaaaa atggtcactt acaaagaaaa cgtacgaatt atactatttt 5760 gttattatac atttataaaa caaggcaatt aactatttta cttttacaat atctatatat 5820 atatattggg aaaaaacaaa caaaaaactc accaaaaaga aagtattttg cagaactcgc 5880 tggctgagaa actcatccct ctctcctgca atggaaattt tgttaaaatg tgtaaatgtg 5940 atcaaatgtt aaatgtaaat aatatgtaaa attgatgtaa aatgttaaat gtaaatatgt 6000 atatatgtaa atatgtaaat gtaatgtttc gtattattat tatgtacggc cgggaaaccg 6060 gtaatcgtgt acttagtcgc ccaataaaaa aaaaaaaaaa 6100 // ID Copia-1-I_HS repbase; DNA; INV; 4722 BP. XX AC AC234858.1; XX DT 16-JUN-2009 (Rel. 14.06, Created) DT 16-JUN-2009 (Rel. 14.06, Last updated, Version 1) XX DE LTR retrotransposon from Hydractinia symbiolongicarpus: internal DE portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-1-I_HS. XX OS Hydractinia symbiolongicarpus OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydractiniidae; Hydractinia. XX RN [1] RP 1-4722 RA Bao W. and Jurka J.; RT "LTR retrotransposons from Hydractinia symbiolongicarpus."; RL Repbase Reports 9(6), 1260-1260 (2009). XX DR [1] (Consensus) XX SQ Sequence 4722 BP; 1649 A; 738 C; 1051 G; 1284 T; 0 other; atggtagcag agttaaaagg tacagatcaa ctggcctggt atttatgatc attgtgcgaa 60 tgacatttga gttgaaagaa agcaattgag ggcacacacc cggatgtgtt ccttgaggta 120 agagagataa ctttcccttt tgttggctat ttgttgacac gtgtgtgtgc aaaccaattg 180 gtgaagtggt ttgtgaacaa tggcaacaac aacattgaag gctccaccag cgtttaaaga 240 ggatgatgac tatagcagtt ggaagaacga tttggagatc tggcagatgt ttacagaact 300 accaaagaaa aagcaaggtc cagctgtata cctgtcactg actggtcggg cacgggaggc 360 agtacgcgac ctgacacctg cagaagtggg atcagatgat ggtttggaca agattatatc 420 taagctagat tcaatatatt tgaaagatga aagcacaaga gcatttattg cttttaaaga 480 gttttatgac tataagagga atagtggaga caaatttcct gattttattg tgcagtttga 540 gaagctttac aataagatat ccaaatatga tatggcattg cctgaagctg tcagggcata 600 ttttcttctc acagctgcga atatgtctga agaaaatgaa aaactggctc gtacaacttg 660 tgcagaactc aagtataaga atatgaagga cacaatcatg aagatatttg gagaactgaa 720 cacagcagaa gacgctgcac atttgggagc tcttcctgtg aaacaagaat gcttatatgg 780 aaatcttcat gctgatacca aggacagctt gtcaaggaat acacgttcaa gaatgaatcc 840 tgttgataag gatgggaaac tcatgaggtg ccatgaatgt gattcaacaa gacattttgc 900 taataaatgt cctcatagaa atcaagagca aagaggtttc agatcacgga acaatagacc 960 tacagaaaag aaacaagatt taaatgagat ccatatcacc ctttttgcag cctctccaga 1020 ccaaaggcaa tattgtcttg ttggagaatc ctttggtaga ggtgtcttgg attctggctg 1080 tacaaagaca gtggctgggg agaaatggat ggatgaatat cttgggactc tctctgaatc 1140 agataaaaat gctgtagttt caaatagtac tgattctata tatagatttg gtgatggaaa 1200 ggaagtgaaa gctacaagaa atctaataat accagttatt ctaggtggca acaagtataa 1260 attgtctgtt gacatagtag agattgaaat accactactg attagtcgta gtaccatgaa 1320 gtctctcagt atgaagttag attttaccag tgacttggcc cttattggga aggaaagaat 1380 taacttaata tgtaccacaa caggacacta ttgtcttcca ctaacacatt gtgatcttga 1440 tgctcaaaaa ggaacaaaaa tcattctaca tggagaaaat ttgaaaaata ttagtaagac 1500 agagaaaatg aagaaagcta tgaaactgca ccgccaattc gctcatgctg gcaaagataa 1560 gctactcaaa cttgtaaagg atagtaattt gtatgacaag gaatttttgt catgcatcga 1620 agaatgttgc aatgattgtg aaatatgtca caaatataaa aggccataca gtagaccagt 1680 tgtgggtata cctctggcaa acacttttaa tcaagtaata tgtatggatt taaaggttta 1740 tgaacataat aaacaacaca tcttacatat cattgatgct gcaactcgtt actctgctgg 1800 atgtcttatc aaatcgaaac ataaagatgt cattgtcagt agagtgttta gattttggat 1860 tgcatacttc ggatcaccaa gtacaatctt gactgacaat ggtggtgaat tctcaaatga 1920 ggttttacaa gaaatgaatg aaaagctcaa tattgagaca aaaacaacag caggcgaatc 1980 tcctttcagc aatggtattg tggaaagaca taataaaatt ctggcagaga ccatgttcaa 2040 gacaattgaa gatacgaagt gtgaaccaga cgtagcacta gcatgggccc tcagcgcaaa 2100 gaattcttta cagaatcatg gtggctacag cccaaaccaa ttggtatttg gccataatgt 2160 gaacacaccc actattctga cagatcattt accagcattg gagccaacta cgtcaagtga 2220 aatagtgaga aaaaatctat ctgctctgca taaggcaagg gagaatttca ttaaagcaga 2280 atccagtgag agaattaaaa gagcgctgag acataatatt agaacatatt ctgatgagac 2340 gtatttgaat ggagaaaaag tcttctacaa gcgcaaaaac accaagggtt ggaaaggtcc 2400 aggtgtggtt ttgggacaag atggtcaaat ggtgcttgtc agacatggtg gtaactttta 2460 ccgagtgcat ccctgtcagt tgatgaaaca tagccaaaat tgtaaagtta gtgaattcaa 2520 gtcagatcat gtcaagagga aggaaagcca tttgaataag ttgtgtgata tgaacgatag 2580 cgtcaatgat tgtgataatg atgatgatga tgatgatgat gatggtggtg atgattatga 2640 taatgatgat gatgatgatg atgacacagc tgataagaat gtgatagatg tttatgatga 2700 ttctgaagat gtagagcaat atcatcaaga acaagctaga aatgtagacg aaaatgacca 2760 agaagattct ggcacaagtg aaaatgaagt tgaggaaaac agggatactc ataaaaataa 2820 tgaaactgaa gaagacgaag gtagtggttg tacaagtgac attttcagaa gaccatctac 2880 agaatccaag aaagttgaat cgaaaccaaa agcaaaatca tacatcaaat acaatctaaa 2940 tggaagatgg tatcatgcaa gagttctctc agcacaaccc aaaagaagtg gcacttataa 3000 aaattgggtt aatgtgtcta atgatggtga tagaaagcca aaaagcatca attggaatga 3060 tgtgtccaaa tgggtcgagc tggaaaatcc ggaattccct gtctatatgt gtgatgtcga 3120 tatgttttca caggaagtag ttgatgcgaa ggagagagaa ttgcagaatt taaaagaaaa 3180 tgatgtcttc gaagaagttc cagacactgg tcaaacaact gtatcaagca aatgggtcat 3240 aactgagaag ttcaaaggtg aaaaaagggt tgtgaaagct agattggtgg ctagaggctt 3300 tgaagaaaat tcacaatcgt tgagaacaga ttctccaact tgtggcaaac aaagtcttcg 3360 acttgctatg gctatcatag ttagcaatga ttggcagatc aattcacttg atatcactgc 3420 agcttttctt cagggagatg ctattgaaag agaactgttt ctacaaccac caaaagacac 3480 attgtcattg ggaacagtgt ggaaactaaa gagatgtttg tatgggttaa acgatgcatc 3540 tagagcatgg tacaaacgag tgagaagtga aatgttgaaa cttggtggtg aaatgtgtac 3600 ctacgaccct gctgtgtttt actggtatag agtcaagcaa atcactggaa ttcttgtttg 3660 ccatgttgat gactttgtgt atggtggtag tccagctttc cacaaagaag ttattgctaa 3720 acttttagaa acgttcaaaa tcagtacgca atcaagctca accttcaagt atctaggatt 3780 agatgtggat caatttaaaa gcaaaattaa gattaatcaa gtgagctata tagagtcttt 3840 gataactgtt aatattgaaa atgatgttaa caatgaacga aaattgacta gcaaggaaaa 3900 gaccatgctg agatcgttga gtgggcaaat tgcttgggtg gccggacaaa cccgccctga 3960 tgtggcttat gatagttgcc aaatgtctaa ttatggcaag gaacctacag ttcaaaattt 4020 gaaagatgcc aacaagatta ttcgaaaaat taagagcaag catgttgcaa taaccattgc 4080 caaaattgca aatatgaaaa attgtgaaat tatttgctac acagatgcaa ctcatgccag 4140 tttgaagtgt ggatcatcac aaggtgcatt catcatattt gtgaaacacc tgagcgaagt 4200 tataccaatc tgctggcaat caaaaaaact acaaagggta accaaaagcc caattgctat 4260 ggaaacactt gctctgagtg aaggagctga tgctagtttt tatttggcca atgtcataca 4320 gcaaatctgc aagttgacga aggttccaaa gatcacttgt attatcgaca acaagtcatt 4380 atttgaaacc ttgaaaacga caaatgtcac aaaggatttg agattgcgag ttgatatagc 4440 cagattgagg cagatggtgg aacaagatga gattactgtc aaatgggttg aaggaaaaca 4500 ccaattagcc gacttcttga caaaacatgg agcgtctcca aacaagttgc ttgaggtttt 4560 ggagacatcc aaaatatctc tacaataatt gattgtagtg attaccgttg tgtttttttt 4620 gatgatttta atttgtttgt catgttgact atgacaaaca gttgcagcat gcaatttgat 4680 attctacttg tgttggcttt aaaattgtgt tactggtggc aa 4722 // ID Mariner-16_HM repbase; DNA; INV; 1293 BP. XX AC . XX DT 13-DEC-2008 (Rel. 13.12, Created) DT 13-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE Mariner-type family: consensus sequence. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-16_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-1293 RA Jurka J.; RT "Families of Mariner elements from Hydra magnipapillata."; RL Repbase Reports 8(12), 1950-1950 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 179..1228 FT /product="Mariner-16_HM_1p" FT /translation="MENIEHRAVIKFFVKKGLKPMEIHNEMVNVLGDDAPS FT KTTVCKWAAEFQRGRSSLEDDPRSGRPKSATTEETINAVHDMVMDDRRLTK FT REIAEAVGISDERVLHILHDELRLRKLLARWVPHSLTLDQKRTRVKLSQEH FT LARFQKNKTDFVRRFVTMDETWVYHYDPKLRQQTAEWTEPGCSAPKQVKAS FT KSSKKVMASVFWDAKGILLIDYLQTGKTITGEYYKNLLDQLDKKIREKRPG FT LLHKKIIYHQDNAPAHISDVAKAKLVELKYERLEHPPYSPDLAPSDFHLFP FT HLKKFLRGKRFSSHEEVIAAVNGYFEGLPESHFRDGIHELENRWKKCIALK FT GEYTEE*" XX SQ Sequence 1293 BP; 415 A; 236 C; 280 G; 362 T; 0 other; ctaggttggt tgaaaagttc gcggagcggc acctatgttg cagtagtaga aacaaattta 60 taactgtatt aaacgtattc attcttcgta tgaacgcata aaaaatttca agtcgatatg 120 tcatttcagt ctttgtttac aagctattga aattagacgt gtcatagtat tttttaaaat 180 ggaaaatatt gaacatcgtg ctgtaataaa attctttgtg aaaaaaggtt taaagcccat 240 ggaaattcat aatgaaatgg taaatgtgtt gggtgatgat gctccttcaa aaacaacagt 300 atgcaaatgg gctgcggagt ttcaacgtgg gcgttcaagc cttgaagatg accctcgctc 360 tggacgtcca aaaagtgcta ctactgaaga aacgattaat gctgtgcatg acatggttat 420 ggatgataga cgattgacca aacgtgaaat tgctgaggct gtgggcatct cagatgaacg 480 ggtattacac atcttgcacg atgaattacg tctgagaaag ctgttggcaa gatgggtgcc 540 gcattcgcta actctggatc agaaacgcac tcgagtaaaa ctttctcaag aacatttggc 600 ccgttttcaa aagaataaaa cagattttgt gcgtcgattc gtaaccatgg atgagacttg 660 ggtttaccac tatgatccga agctgcgaca acaaactgca gagtggacag aacccggttg 720 ttcagctccg aagcaagtga aggcgtcaaa atcgtcaaag aaggttatgg catcagtttt 780 ttgggatgcg aaaggaattt tattaataga ttaccttcaa accggtaaaa cgataacagg 840 tgaatattat aaaaatcttt tggaccagtt ggataaaaaa attcgtgaga aaagacctgg 900 tttgctgcat aaaaaaatca tttatcacca ggacaacgca cctgctcata tcagcgatgt 960 tgcaaaggcg aaattggtcg aattaaagta cgaacgcttg gaacatccac cctattcacc 1020 agatctggct ccgtccgact tccacctctt tccacatctc aaaaaatttc tacgtggaaa 1080 gcgtttctca tcacatgaag aagtcattgc agccgtaaat gggtattttg aaggccttcc 1140 ggaatcgcac ttcagagatg ggatacatga attggagaat cgttggaaaa agtgtattgc 1200 tcttaaggga gaatatactg aagaataaag tcattttttg aatcataaaa ttgttttttc 1260 atttccgctc cgcgaacttt tcaaccaacc tag 1293 // ID Hebe_Av repbase; DNA; INV; 5223 BP. XX AC . XX DT 02-JUN-2010 (Rel. 15.06, Created) DT 02-JUN-2010 (Rel. 15.06, Last updated, Version 2) XX DE Adineta vaga clone CopyA non-LTR retrotransposon Hebe. XX KW Jockey; Non-LTR Retrotransposon; Transposable Element; Hebe_Av. XX OS Adineta vaga OC Eukaryota; Metazoa; Rotifera; Bdelloidea; Adinetida; Adinetidae; OC Adineta. XX RN [1] RP 1-5223 RA Gladyshev E.A. and Arkhipova I.R.; RT "A subtelomeric non-LTR retrotransposon Hebe in the bdelloid RT rotifer Adineta vaga is subject to inactivation by deletions but RT not 5' truncations."; RL Mobile DNA 1(1), 12-12 (2010). XX DR [1] (Consensus) XX CC GU176366 REGION: 661..5883. XX FH Key Location/Qualifiers FT CDS 121..1674 FT /product="Hebe_Av_1p" FT /translation="KNWFFIEILCTSSTQTRKQKQSTSNYQELVLILMPSK FT RGRGGGGQQRRYHSVDNQQRATTTQRKQGPQSSQSIQSPQRTTQDGKAGPY FT ALKSSQPTFSIAPLILEGVNLNKLQLNDILKQHLAEVNIHDIQLGRNGIFT FT LYARDVKSFNNILNDFSSILSSNGQPSATVYVPRSIQRIKDTEKIAFVKRV FT DLELPNDRITEALKNVGLEVTEVIRLTSKDGKTPTRTVKISFSDATNRNIF FT VQTGLQVDCMHFTAEPATQNSKPVQCYICLKYNHVAKYCKTKQQVCSRCGE FT NHSNDKCTVTDDAVKCYNCKGNHIATSKECSHYREQEKKMQNMVNQYATTS FT KQVTQAPSIYSTHDFPPLSQFNQTQKQLLTDNLFDDIINALTSKMETIIEK FT TTNRLFKALHQKIKKIEKSFGLLDRNNEEKYILTDSDSDSNEQSQAVKIIK FT PKPKKQQSEADKSSTTENVTSNKPTMPSTPTTSKDAQNTKEGSNTNKRALS FT SNNSPDASINDKKTLKTSSNDA" FT CDS 1670..4357 FT /product="Hebe_Av_2p" FT /translation="MLSLCHININSITKHKDELLARFSKYDIISVNETNLK FT SERPFTLFGYNIFRNDRIGQAGGGVLLAVKQHIKCQEVLNKTTCKNEAIAV FT EIRTKSFKSILISSIYVPPKAKIDINLFHELYNVNNNCIIMGDLNATLYNM FT GSQQANARGRQLQELFKDGLIDCVDDDSPTFEKNDYEVKLDWILASQPLLS FT FISNVETHPTIGALNGHKPLTFDIPCGAEPKPASSRILFNFKAAKWSKFRC FT KLDQQLMLWKNDHHLDSTADIEEYTSFITTSILEATKEAVPQTKQMIRTYT FT PSEVSISLIKQKHQAYRKWKKTGNNLDKHLYYNSKVLLTNSLRNDRRNNFN FT KLMSSLCHKKMYSDKVWLTVRKFHNKRIKQTYANIMKYNNTTATSDKEKAD FT LFADYFENEVYSHTADTLPLHDQVTRQANKVKSKTITSSNTPKWKQIKEKE FT VKNHIRQLRNSSTGPDNIHNRCLKNHSKLLVQHLTNLFNAILKQGYIPAMW FT KKANIILLLKPKKDKQQPSSYRPISLLSCLGKLLEKIIKQRLMLELERRNI FT LPQHQAGFRPGKSTIYNIVRLERYAAGQLRRPRRRRHSAVILFDIKAAFDS FT VWHDGLIYKLNDLRLPCFIINYLISFLSNRTAAIEIENILSRPFNLKSGTP FT QGSPLSPLLYIMYTADSMNGIPTHTEHGLFADDTALWTSGNTLTNLNTRLQ FT QSIDAFESWCKSWKLKLQPTKTELIHFSIHPRKHYKNPVKVKVEDTIIKPL FT DSTRYLGVIIDKRLKWRAHLEHIESKIAPRIGLLRYLSRTAYEPNNKTMIN FT IFKSIVRTVIIYGHPILLTADQKVWDRLQIMQNKAIRAALGLPIYTSVKYI FT HKISNIPKIKDYAIGLLKHTLQKATTNNDITLKNHLQCILDKI" XX SQ Sequence 5223 BP; 1962 A; 1114 C; 776 G; 1371 T; 0 other; cattcgtcta gttgacttgg tcgcgatcag tcgcattcgt ccaagtgaaa taaacatata 60 tatcttatct ctcgtctttc tactttttct ttcgaccagc taaaaagtcc taggacttaa 120 aaaaactggt ttttcatcga gatcctctgt acttcatcaa cccaaacaag aaaacaaaaa 180 caatctacat caaattatca agaactggtg ctgattttaa tgccttccaa gagaggaagg 240 ggtggcggtg gtcaacagag acgctatcac agcgtggaca accaacaaag agcaaccacc 300 acacaacgaa aacaagggcc acaaagctca caaagcatac aaagcccaca acgaacaaca 360 caagatggca aagcaggacc atatgcttta aaatcaagcc aaccaacatt tagtatcgcc 420 cctcttattc tagagggtgt taacttaaac aaattgcagc taaatgatat attgaagcaa 480 catcttgctg aagtcaatat ccatgacatc caacttggac gaaatggaat attcacatta 540 tatgcaagag atgttaaatc gttcaacaac atattaaacg acttttcatc aatattatcg 600 tcaaacggtc aaccatcagc cacggtatac gttccaagat ctatacaaag aatcaaggac 660 acagaaaaga tcgccttcgt aaaaagagtc gatctagaac taccaaacga tcgaataact 720 gaagcactaa agaacgttgg tcttgaagta acagaagtta ttcgattaac aagcaaagat 780 ggtaagactc caacacgaac agtcaagata tcatttagcg atgcaacaaa tcgaaatatc 840 tttgtgcaaa ctggtttaca agtggattgc atgcacttca ccgctgaacc agccacacaa 900 aattccaaac cggtgcaatg ttacatttgc ttaaaataca accatgtagc caaatactgc 960 aaaaccaagc aacaagtatg tagtcgatgt ggtgaaaatc atagcaacga caaatgtact 1020 gttacagatg acgcagtcaa gtgctacaac tgtaaaggta atcatattgc tacttccaaa 1080 gaatgttcac attatagaga acaagaaaag aagatgcaaa acatggttaa ccaatatgca 1140 acaacaagca aacaagtaac acaagcacca tcaatctaca gcacacatga cttcccacct 1200 ctatcacaat tcaaccaaac acaaaaacaa ctattaacgg acaatttatt cgacgatatc 1260 attaatgctc ttaccagcaa aatggaaacc atcatcgaaa aaactaccaa tcgactattc 1320 aaagcactac atcagaagat caagaagatc gaaaaatcat tcggactcct cgatcgtaat 1380 aatgaagaaa aatatatatt aactgattca gactctgatt ccaacgaaca aagtcaagca 1440 gttaagatca tcaaaccaaa accaaaaaaa caacagagtg aagctgacaa atcgtctaca 1500 acagagaatg tcacttcaaa taaacctact atgccatcaa ctcctactac gtcaaaagac 1560 gcacaaaata caaaagaagg atccaataca aacaagagag ccctttcttc caacaactct 1620 cctgatgcat ctataaacga caagaaaaca cttaaaacaa gcagcaacga tgcttagttt 1680 atgccatatc aacattaact caattaccaa acacaaagat gaactcctag ccagattctc 1740 caaatacgat attatctctg ttaatgaaac taatctaaag agcgaaagac cattcacgct 1800 ttttggttat aacatcttca gaaatgatcg aataggacaa gctgggggtg gagtattact 1860 agcagtgaaa caacatatca agtgtcaaga agtactaaac aaaacaacct gcaagaatga 1920 agcgatagca gtagagattc gaactaaatc attcaaatca atactaatat cctccattta 1980 cgtaccacca aaagcaaaga tcgatatcaa cttattccac gaactttata acgtcaacaa 2040 caactgcatt atcatgggtg atcttaatgc aacattatat aatatgggat cacaacaagc 2100 taatgctaga ggaagacagc tgcaagaatt atttaaagat ggtcttattg attgtgtcga 2160 cgatgatagt ccaactttcg aaaaaaatga ttatgaagtt aaactagatt ggattctagc 2220 aagtcaacca cttctttcat tcatatcaaa cgttgagact catccaacaa tcggtgcatt 2280 aaatggccat aaaccattaa catttgatat tccttgcgga gctgaaccca aaccggcttc 2340 gtcaagaatt ttatttaatt tcaaagcagc aaaatggtca aaatttaggt gcaagttaga 2400 tcaacaactg atgctgtgga aaaatgatca tcatttagat tcaacagcag acatagaaga 2460 atatacatca ttcattacca ctagtatact agaagcaaca aaagaagccg ttccacaaac 2520 aaagcagatg atccgaacgt atacaccaag tgaagtatcg ataagcctga taaaacaaaa 2580 acatcaagca tatcgaaaat ggaagaagac tggaaacaac ttagataaac atctatatta 2640 caattccaaa gtcttgctta caaattcact tagaaacgac agaagaaata acttcaacaa 2700 gttaatgtca tctttatgcc ataagaaaat gtattcggac aaagtttggc tgacggtgcg 2760 caagttccac aacaaaagga tcaagcaaac ctacgccaac atcatgaaat acaacaatac 2820 aactgcaaca tcagacaagg agaaagcaga cttatttgca gattacttcg aaaatgaagt 2880 ctactctcac actgctgata cgttgccact tcatgatcaa gtaacacgcc aagcaaacaa 2940 agtcaaaagt aaaactataa cctcttcaaa cacaccaaag tggaagcaaa tcaaagaaaa 3000 agaagtcaaa aatcatataa gacaacttag aaatagctcc actggtccag acaacattca 3060 taaccgatgt ttgaagaatc attcaaaatt actggtacag catctaacaa atctgtttaa 3120 tgcaattttg aaacaaggtt atattccagc aatgtggaaa aaggctaata ttattcttct 3180 attaaagcca aagaaagaca aacaacaacc gtctagctat cgaccgatta gtctccttag 3240 ttgcttaggc aaactattgg agaaaataat caaacaacgt ttaatgctcg aacttgaacg 3300 acgaaacatc ttaccacaac atcaagccgg atttagacca ggtaaaagca ctatatacaa 3360 catcgtgcga ttagaaagat acgctgcagg acaacttaga cgaccacgtc gacgacgtca 3420 ctcagctgtc attcttttcg acatcaaagc cgcatttgat tcggtatggc acgatggttt 3480 gatatacaag cttaatgatc tacgtcttcc ctgtttcatc atcaactatc tgatctcatt 3540 cttaagcaat agaacggctg caattgaaat cgaaaatata ttatcacgtc cgtttaatct 3600 gaagagcggt acaccacaag gatctccgtt atcaccatta ctttatatta tgtatacggc 3660 agattcgatg aatggaatac caactcatac agaacatggg ttgttcgccg acgacactgc 3720 attatggaca tctggcaaca ccttaacaaa tcttaacacc agattacaac aatccataga 3780 tgcattcgaa agttggtgca agtcatggaa attaaaactt caaccaacta agacagagct 3840 gatacacttc agcatacatc caagaaaaca ctacaagaat ccagtgaaag ttaaagttga 3900 agacaccatc atcaaaccac tggactccac acgttacttg ggtgtaatta tagataaaag 3960 attaaaatgg cgagcacatc tcgaacatat tgaaagcaag attgctcctc ggatcggtct 4020 acttcgatat ctttcgagaa cagcatatga acccaacaac aagacgatga taaatatttt 4080 caaatcaatt gtacgcacag ttatcattta tggacatcct attctcctta ctgctgatca 4140 aaaagtatgg gaccgtttac aaatcatgca gaataaagca attagagctg cactcggact 4200 acccatatat acatctgtca aatatatcca caagatcagc aacattccaa aaatcaaaga 4260 ttatgcaata ggactactta aacatacact tcaaaaggca acaacaaaca acgacattac 4320 actgaagaat catctacaat gcatcttaga caaaatttaa aaaagaaatc accaaccatt 4380 acatataata gtagcatcga gagaatagac tattcttgtg tcagtcgaat cgatgtaaaa 4440 aacgatatat cgcaatcaaa tatactattc agctactttg tttaactcgt tcttatacaa 4500 aataagaagt cgattcaatc tatatgctgt atcgttcttt cgttctattc agttatttat 4560 tgatcgcatt cttatatata atataatatc gagccaatct ttaaaccgat tcatgtatac 4620 agcaaattct tttactcgtt cttatataca atatgatatc gagtcaacgt ctttgctgat 4680 ccttacaata tccagtaaat tccttgctcg ttcttatata caatatgata tcgagtcaac 4740 gtctttgctg atccttacaa tatccagtaa attccttgct cgttcttata tacaatatga 4800 tatcgagtca acgtctttgc tgatccttac aatatccagt aaattctttg ctcgttctta 4860 tatgcaatat gaaatcgagt caatatattt gctgatttaa aacatgtcac atacaatcca 4920 gcttcttttc tcttcctctc caatctatcc atatagtacc actatatctt ctgttttctg 4980 ttttctttct ctttttcttc taccacatat gtaagacttt acacgcgtca ccccatctac 5040 actcactatt ggcatataaa atttttctcg ttcttattac tatgtcacgg acaaaagaca 5100 tacgtcatgt gtcctttttt ttttctctac tccagcttat gtagaagtac aataccttcc 5160 gttgacttta tgttacagtg ttgttctttt ttacaataaa gctttgttac caaaaaaaaa 5220 aaa 5223 // ID Mariner-5_BM repbase; DNA; INV; 1316 BP. XX AC . XX DT 26-APR-2010 (Rel. 15.07, Created) DT 26-APR-2010 (Rel. 15.07, Last updated, Version 2) XX DE Mariner-like DNA transposon - consensus sequence. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-5_BM. XX OS Bombyx mori OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Bombycidae; Bombycinae; Bombyx. XX RN [1] RP 1-1316 RA Jurka J.; RT "DNA transposons from Bombyx mori."; RL Repbase Reports 10(7), 940-940 (2010). XX DR [1] (Consensus) XX CC >93% identical to consensus. XX FH Key Location/Qualifiers FT CDS 154..1191 FT /product="Mariner-5_BM_1p" FT /translation="MEWGVKENRIAVIALHKVGMEPSVIFQTLQKLGISRM FT FVYRTINRYNNTSSVEDQKRSGRPRAVRTTKAVNAVKARIRRNPIRKQKIL FT SREMKIPARTLSRIIKQDLKLGAYRRYTGHALNQSLQLKRVDRSKRLLSRY FT AGERHRNILFTDEKIFTIEEHYNKQNDKVYAHSSKEAAQVVGKVQRGHHPA FT SVMVWWGVSYQGVTKLHFCEKGVKTSAKVYQDTVLDHVVKPLSNTLFKNIP FT WTFQQDSAPGHKARTTQAWLETNVPDFIRAEDWPSSSPDLNPLDFKLWSVL FT EDMACSKRHGDIESLKKSLEQAVAKFPLETVRKSIDSWPNRLKACIKAKGG FT HFE" XX SQ Sequence 1316 BP; 423 A; 256 C; 260 G; 377 T; 0 other; taactagtct ggccataaat actgttacat aaaaactttt tttttttcaa ctttttaatt 60 attttgaatt tggaacacgt atattatttt aaaaacttct ttcgattttg cgccatttca 120 catatttttt cgtgttcatt atcaaattac aatatggaat ggggtgttaa agaaaatagg 180 attgcagtga tcgccttgca caaagtgggt atggagccgt cggtcatatt ccagacactt 240 caaaagcttg gtatcagccg tatgttcgta tatcgtacta tcaacagata caacaacact 300 tcgtctgtcg aagaccagaa aagatcgggg cggccgcgtg ctgttagaac tacaaaggct 360 gttaatgctg ttaaagccag aattcgtcga aaccccatta ggaagcaaaa aatcttatcg 420 cgagaaatga agattcccgc taggactctg tcgcgtatta taaaacaaga cctgaagctc 480 ggtgcttatc gtcgatatac aggacatgcc ctaaatcaat ctttacaatt aaagagagtg 540 gatcgatcaa aacgccttct gtcgcgatac gcaggtgaaa ggcacagaaa tatcctcttt 600 accgatgaaa aaattttcac gattgaagaa cactacaata agcaaaatga taaagtgtac 660 gctcacagtt ctaaagaagc tgctcaagtg gtcggaaagg tacaacgtgg tcatcatcct 720 gcgtcagtga tggtttggtg gggcgtgtct tatcaaggag tcacaaaact acatttttgt 780 gaaaaaggag tgaaaacttc agccaaagtg tatcaagata cagtcttgga tcatgttgtg 840 aaacctctca gcaatacact ttttaaaaat ataccgtgga ctttccagca ggactctgca 900 cctggtcaca aggcacgaac tacccaagcc tggcttgaaa ccaacgttcc ggactttata 960 agagctgaag actggccctc atctagccca gacctcaacc ctttagactt caaattatgg 1020 tcagttttag aggacatggc ctgctctaaa cgacacggcg acattgagtc tcttaaaaaa 1080 tctttggagc aagcagtggc gaaatttccc ttggaaacag tgcgtaaatc catagattcg 1140 tggccaaaca gattaaaggc ctgtataaaa gccaaaggtg gccatttcga atagaatatt 1200 tttttttttt gctgagactt taataaatat gtaggtataa attttaataa tattacatta 1260 cttcattaaa aaaaatgcat tttcatttgt aacagaactt atggctggac taggta 1316 // ID Gypsy-84_CQ-LTR repbase; DNA; INV; 230 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-84_CQ_; KW Gypsy-84_CQ-I; Gypsy-84_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-230 RA Jurka J.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 548-548 (2011). XX DR [2] (Consensus) XX SQ Sequence 230 BP; 81 A; 32 C; 56 G; 61 T; 0 other; tgtaatgagc gctaacaacc ctggagtgga tatgggagaa ggtgaggaag aggatgggaa 60 gattgaatta gttggtaagt gaagccaatc gctatcagaa gctcgaagaa ttaaatcggc 120 taataaattt gtttgaaatc gcgcaataaa gagtttttcc taaaatgtgt gattcgctaa 180 tgatttgctc cgaatactcg ctaaaacagt gctaaaacaa gtaatttaca 230 // ID Mariner-6_SM repbase; DNA; INV; 2403 BP. XX AC . XX DT 11-FEB-2008 (Rel. 13.02, Created) DT 03-MAR-2008 (Rel. 13.02, Last updated, Version 1) XX DE Mariner DNA transposon from Schmidtea mediterranea: consensus. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-6_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-2403 RA Jurka J., Bao W. and Tempel S.; RT "Mariner DNA transposon from Schmidtea mediterranea."; RL Repbase Reports 8(2), 150-150 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS join(561..1289,1293..1721) FT /product="Mariner-6_SM_1p" FT /translation="MRKCNISKNYIRSDNNKIMENNEDRIVLYSNIEENTE FT NNIETVILNPAVNAPRRRNTTVPDDDRRRIVDSYNAGSTANEISRIMQYKR FT TTILAIIKKYQETGLTTANRRGLITEPKLSDEQKETIRVWIDEDCTISLRS FT ICARIHTEMAITITKSSIARYIENFNYSLKRVLLIPVRRNDENTIAVRKAY FT AIRFFGLQAQFADSQIIFIDEVGFKVSTRVSRGRSLVGTPAVAIVPTIRSR FT NIICCAMTRAGIVHYQTQTTPFNTACFMAFINGLIVRLRETNVQSAIFIMD FT NVAFHRTNAVRDLIIAEGFSYDFLPPYSPFLNPIENMFSKWKEHTKRTNPN FT NEEELMTGIENGRRIITNNDCDGYFRNMIRYATRAIAEEVITD" XX SQ Sequence 2403 BP; 898 A; 357 C; 381 G; 767 T; 0 other; tacagggttt cccaccctat tttgacattg tcctacccca atttgacatt gtcccatagt 60 attttgacat tgtcacatcg taatttgaca ttgtcccacc ccaatttgac aaattaaaac 120 aaacgatttt agaatatgaa aaaaactatg ataacgcaag aaaatgtcaa tcattcgcgt 180 gttatgcata gtttgcattt taaacataaa acgtaggttt aatataattt gctaactaat 240 aatgtcttgt aagaacataa aaattacgca tttgattgtc atttaaccaa atttaatgac 300 ctatgtatga cttttaatag ttactcgtgt tacattttgt tcataaaaaa atcattttcg 360 tctaaaaata aaatcataaa taactgttat agatccatgc taatcgtata aagtaaaaaa 420 agtcgcatat gatcgtaaaa tacatggcat ttcgtacata gtgtctatag cattttatga 480 aaatagatat aataatatgt taagaaatgt catttaaatc gaatgttata atcattatct 540 tttgatttat aagaaaccca atgcggaaat gtaatataag caaaaactat ataagaagcg 600 acaataacaa gatcatggag aataatgaag atcggattgt attatattca aatattgaag 660 aaaatacaga aaacaatatc gaaacagtta tcctaaatcc cgcagtcaac gctcctcgac 720 gccgaaatac aacagttcct gatgatgaca gaagaagaat cgtcgattct tataatgcag 780 gaagtactgc caacgaaata agccggataa tgcaatataa gagaaccact atcctggcaa 840 taataaaaaa atatcaagaa acaggcttaa caactgcaaa taggagagga ttaataacag 900 agcctaaact ttctgatgaa cagaaagaaa ctataagggt atggattgac gaagattgca 960 ccatatcatt aaggagtata tgtgccagga tacacacgga aatggcaata acgatcacta 1020 aatcatcaat tgcaaggtat attgaaaatt tcaactactc tttgaaaagg gttttattaa 1080 taccagtgcg ccgaaatgat gaaaatacca ttgctgtgag aaaggcttat gcaataagat 1140 ttttcggatt acaagctcag tttgcagatt cgcagatcat atttatcgac gaggttggat 1200 ttaaggtgag tacaagggtg agtcgaggaa gatcattagt aggcacacca gcagtagcaa 1260 ttgtaccaac aattcgatca agaaacatat aaatatgctg tgcaatgacc cgggctggta 1320 tagtccatta ccaaacacaa acaacaccat ttaacaccgc ctgtttcatg gcatttatca 1380 atggactaat agtaagattg cgggaaacaa atgtgcaatc ggctatattc attatggata 1440 atgtagcttt tcataggaca aatgcagtaa gagatctaat aatagcagaa ggtttttcat 1500 atgatttctt acctccatat tcaccctttc ttaatccgat tgaaaatatg ttcagtaaat 1560 ggaaggaaca tacaaaaaga acaaatccta ataacgaaga agaattgatg acagggatcg 1620 aaaatggtag aagaataatt accaataatg attgcgatgg ctactttcgg aatatgataa 1680 gatatgctac acgtgccatt gctgaggaag tcatcacgga ttgaagaaat aaatatcatg 1740 acttattttt tattttacat tttgtaattt tataattatt tcttaatatt ttttaaaaga 1800 tttagtaatt ggttctttat taatttttca tttttatgat ctgtaaaaaa ctgtcatttt 1860 ttaagagcgt ttttccttta aaatacgtga tatgtcaaga gtattttttt ttgatcacac 1920 acatttatgt ttttattttt tgactacacg accctaaaaa ccttaaaatg ttttcgcgct 1980 aaatgcgcgg tttacattag catcagggag tatatcttga accttaaaat ttatctatgt 2040 tagaagtaat ctgtacagga tattcagtta aaaaaatcaa acttatctat gtgtttttta 2100 tcaataaaaa gtcataccac taaaactgaa ataatatcaa attttaatta tactcaaata 2160 attactttgt atttttttgc cgaatatcta atttcttata aataaaatta atagaattgt 2220 ttgaaatatt ttagaagata acaactatgt ttgttaatga ttgctttgta aaataacatc 2280 ctatcaataa cttttcttac aaaaatatat tgtattttgg gtactaaatg tgccgtgtca 2340 aattgggatg gacaatgtca aatttgtgta ggacaatgtc aaaaaaggat gggaaaccct 2400 gta 2403 // ID KAMIKAZE_BM-LTR repbase; DNA; INV; 374 BP. XX AC . XX DT 25-APR-2010 (Rel. 15.04, Created) DT 25-APR-2010 (Rel. 15.04, Last updated, Version 2) XX DE Bombyx mori Pao-like retrotransposon Kamikaze DNA (long terminal DE repeat). XX KW BEL; LTR Retrotransposon; Transposable Element; KW reverse transcriptase; protease; integrase; RNase H; gag domain; KW pao-like retrotransposon; KAMIKAZE_BM; KAMIKAZE_BM-I; KW KAMIKAZE_BM-LTR. XX OS Bombyx mori OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Bombycidae; Bombycinae; Bombyx. XX RN [1] RA Abe H., Ohbayashi F., Sugasaki T., Kanehara M., Terada T., RA Shimada T., Kawai S., Mita K., Kanamori Y. et al.; RT "Two novel Pao-like retrotransposons (Kamikaze and Yamato) from RT the silkworm species Bombyx mori and B. mandarina: common RT structural features of Pao-like elements."; RL Mol. Genet. Genomics 265(2), 375-385 (2001). XX RN [2] RP 1-374 RA Jurka J.; RT "Consensus of long terminal repeat."; RL Direct Submission to Repbase Update (25-APR-2010). XX DR [1] (Consensus) XX SQ Sequence 374 BP; 81 A; 112 C; 89 G; 92 T; 0 other; tgttcgcgcg acagcgcgac gtttgtaaat gttcggctca actcgggtgc cgtcgcccgt 60 acgcaccccg cacctctcat aacagtgcgg tacatgacgg cacgcggaga tacgctcgtt 120 cgatattgag taaattgtgc gcgcgcgcct ttcgttatat actaagtaaa aggttgagtg 180 tgaaccggtg aaaaggtgta tgtgccaccc atcgcagagg aacaccacta cgaccgtgat 240 agtgccagca tcagcctcca agtaagtccc gttttcctac ctaccctttc gtgtccgctc 300 attctccgca gagttcatat gctccctttc agctcatcag ctcacgttta aggtgacgct 360 cacctgcgtc acca 374 // ID BEL-203_AA-LTR repbase; DNA; INV; 377 BP. XX AC AAGE02027244; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-203_AA_; KW BEL-203_AA-I; BEL-203_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-377 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02027244; Positions 116140 116516. XX SQ Sequence 377 BP; 106 A; 70 C; 79 G; 122 T; 0 other; tgttgccaat acccctcgtt acgatggata ggtagtgaaa cgacttacca ttccctcgga 60 cgaacctgta tagaaacgtg gaatgataaa ctgcgaatga catttgtccg ttatctgcta 120 tccgtgagtt tatggttagt cagcactatc agatcgttct tcaaattata tttcgaaaat 180 taaggtatct gctttcgtaa aaagtccgtc agttttcgta gaattatagt tgagcattca 240 agatttctgt ggattgatcg tcgttaaatc cctttgcaaa gattgccagt catattatac 300 gcagaagaac actgaattac cttgtagctg cttgatagcg acttacaaga cggtgttttg 360 tttggcaatt gggaaca 377 // ID CR1-79_HM repbase; DNA; INV; 4883 BP. XX AC . XX DT 07-JAN-2009 (Rel. 14.02, Created) DT 07-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE CR1-type family - consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-79_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4883 RA Bao W. and Jurka J.; RT "CR1 families from Hydra magnipapillata."; RL Repbase Reports 9(2), 366-366 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 84..770 FT /product="CR1-79_HM_1p" FT /translation="QQKKYILKETDKLLKNQEATFTNIISSNLKIITDRLD FT KLESEINTNKSKTSTLEKNFRDIKETFNFQEENFTNKLAQMSQKCNDEISI FT LKKKTIDLENRSRRNNLRIDGVSESNSETWNDCVSIVKNIFKNQLGIAEDI FT VVERAHRVGQVKEDKPRTIILKLLNFHEKSKILSLAKKLKGSGVFINEDYA FT NETLEKRKKLWEEIKRHRKEGKFAIIKYDKIFVREFRK*" FT CDS 821..3907 FT /product="CR1-79_HM_2p" FT /translation="MAYKTIDFETKQLNFFETDNNILLNDFYDADKQVFKE FT LDFNLQYYDTETFKTLLKTYIKNFTVIHINIRSLSLNIDKLKHFLLESNFL FT FNMICLTESWCDDESAQKNSNFQIPNYKIISSERKTLKRGGGILVYIRNDH FT DIKIRKDLSISNSDSEVLTIEIIIKNTKNVVVSTCYRPPDGDVIQFSNYMQ FT QILIKSNNEQKRLFCIGDINIDSLKYNEHPNTKIFFDNIFQHGIIPIINKP FT TRVTTTSISAIDNILTNSFLDKSIKTGIIKTDISDHFPIFFSFAEHATIND FT NSKIKVLKRKMNEHSMLKFKESLSTINWINVYQECNLGHTNSAYNLFLDIF FT LEYYNKHFPIQEIEVKTKYLKCPWITRGIRKSSKKKQKLYIKYLKNRNEAN FT LNAYKKYKNLFEKIKNKSKKRYYSQQLKNNNGNIKKTWEVMKEIIGKNKIV FT SDILPSRITENDIEYKNKKDISEIFNHFFVNTGLNLASKIKGSETSFQNYF FT SDFESSLPDSNLTFDELETAIKSIKKNKAPGIDDISGNVVLYIFSVIRKPI FT FNIFSSSIKNGIVPDKLKIAKIVPIFKGGDTSAISNYRPISILPIFSKLLE FT RIIYNRLYKYLIEHKILSENQYGFQTQHSTEHAILDLINNITTSFNDRKFV FT LGVFIDLTKAFDTVDHTILLRKMEKYGIKNQTIKWFANYLDNRQQCVTIDH FT YNNSKQLKIKCGVPQGSILAPLLFLIYVNDLPKVSSKLDCIIFADDTNLFY FT SSSSISDLFITANLELIKLNTWFRANKLSLNLNKTKYILFHPNQKKKAIPS FT ILPTLSIDNTIIERTQSSKFLGVLLDEKLSWNSHINTINTKVSKNIGLLYK FT ARNYLSFENLKLLYFSFIHSYLSYANIAWASTHKTKLSSLYRRQKLASRII FT FYKNKLTHANPLLKQMNALNIYQINIYQNILFMFKHKMGIIPSRFMNNFFK FT TNINKFSTRATGNYIVPLKKRKLSQFSIAYRGPYLYNKLIPKNSSISLLKN FT INSLKSYLKKLVLNTSSYMEIF*" XX SQ Sequence 4883 BP; 2004 A; 689 C; 606 G; 1578 T; 6 other; atcaaactat cttcaaaaat tttgcaacac tcattttttt aaaaaaatgg aagttacaat 60 gaaaaatatt gaacgactta taacaacaaa aaaaatatat tctcaaagaa actgacaaac 120 ttctaaaaaa ccaagaagca acatttacaa atattataag ctcaaatctg aaaattatca 180 cggatagact tgataaactc gaaagcgaaa ttaacaccaa caaatcaaaa acttcaacct 240 tagaaaaaaa ctttagagat atcaaagaaa cctttaactt ccaagaggaa aacttcacaa 300 ayaaactagc acaaatgagc caaaagtgta aygatgaaat tagtatayta aaaaagaaga 360 ctatagattt ggaaaacaga tcacgtcgta acaatctaag aattgacgga gtatccgaat 420 ctaactctga aacatggaat gactgtgtta gcatagtaaa aaatatcttt aaaaaccaat 480 tgggaattgc agaggacata gtagttgaga gagcccatcg agtaggacaa gtaaaagaag 540 acaagccgcg aacgataatt ctcaaacttt taaattttca tgaaaaaagc aaaatattat 600 ctttagcaaa aaaacttaag ggcagtggag tattcataaa tgaagattac gcaaatgaaa 660 ccttagaaaa acgaaaaaag ctttgggaag aaataaaaag acaccgaaaa gaaggtaaat 720 ttgcaattat caaatatgat aaaatttttg tacgagagtt tcggaaataa gttaaatgct 780 taacctcaat tttataaaaa taaactctta atcatgcgca atggcttaca aaacaattga 840 ctttgaaacc aaacaattaa atttttttga aactgataac aacattttac ttaacgactt 900 ttatgacgca gataaacagg tttttaaaga attagatttt aatttgcaat actatgacac 960 tgaaactttt aaaacgctat taaaaacata tataaaaaat ttcacggtga tacacatcaa 1020 tattagaagt ttaagtttaa atatagacaa actaaaacac tttcttttag aatccaattt 1080 tttatttaac atgatttgtc tcacagaatc atggtgtgat gatgagtcag ctcaaaaaaa 1140 ttcaaatttt cagattccta attataaaat aatttcatct gagagaaaaa cgctcaaaag 1200 gggaggcgga attttagttt acattcgtaa tgaccacgat atcaaaatta gaaaagacct 1260 ctcgatctct aattcagaca gtgaggtcct aacaattgag ataataatta aaaatacaaa 1320 aaacgttgta gtttctactt gttatagacc acccgacggc gatgttattc agttttcaaa 1380 ttatatgcaa caaattttaa tcaaaagcaa taatgaacaa aaaagattat tctgcatcgg 1440 tgacatcaac atagatagtt taaagtacaa tgaacaccct aatactaaaa tcttttttga 1500 caatatattt caacacggca taatcccaat cattaacaaa ccaactcgtg ttacaacaac 1560 ttccattagt gctatagata atattctaac taactctttt ctagataagt ctattaaaac 1620 aggaataatt aaaactgata tttcggatca ttttcctatt ttcttttctt ttgctgaaca 1680 tgcaaccata aatgataatt ctaaaatcaa agtcttgaaa cgaaaaatga atgaacattc 1740 tatgcttaaa tttaaagaat ctttatcaac tataaattgg ataaatgtat accaagaatg 1800 taaccttgga cacacaaatt ccgcttataa tctatttctc gacatctttc ttgaatatta 1860 taacaaacat tttccaatac aagaaataga agtcaaaaca aaatatttaa aatgtccttg 1920 gattacacgc ggaataagaa aatcttcaaa aaaaaagcaa aaattataca tcaaatattt 1980 aaaaaacaga aacgaagcta atcttaatgc ctataaaaaa tataaaaatc tgtttgaaaa 2040 aataaaaaat aaatctaaaa aaagatatta ttctcaacaa ttaaaaaata ataatggtaa 2100 cataaagaaa acgtgggaag ttatgaaaga aatcattggt aaaaataaaa tagtatctga 2160 cattttaccc tccagaataa cggaaaacga tattgaatac aaaaacaaaa aagatatatc 2220 agaaatattt aatcattttt ttgtaaacac tggtctaaac ctggcatcaa aaataaaagg 2280 ctcagaaaca tcgtttcaaa attattttag cgattttgaa agctcattgc cagatagtaa 2340 tttaactttt gatgaactag aaacagcaat caaatcaatt aaaaagaata aggccccagg 2400 tattgatgac atctctggca atgttgtgct ttatattttt tctgtaataa gaaagcctat 2460 ttttaatatc ttcagctctt caatcaaaaa cggaattgtc ccagacaaat taaaaatagc 2520 aaaaattgta cctatattta aaggtggaga tacatctgca attagcaatt atagaccaat 2580 ctcaattctt cctatatttt ctaaactcct tgaaagaata atttacaaca gactgtataa 2640 atatttaata gaacacaaaa tcttaagtga aaatcaatac gggtttcaaa cgcaacattc 2700 aacagaacat gctatcctag atctcattaa taacattaca acttccttta atgatagaaa 2760 atttgtatta ggagtcttta ttgatcttac aaaggcgttt gacacagtag accacacaat 2820 actactaaga aaaatggaaa aatatggcat aaaaaatcaa actataaaat ggttcgccaa 2880 ctacctagac aacagacaac aatgtgttac catagatcac tataacaact cgaaacaact 2940 aaaaatcaaa tgtggggttc ctcaaggatc tattctagct cctcttctat ttttaattta 3000 tgtcaacgac ctccctaaag tctcatctaa gttagattgc ataatttttg cagacgacac 3060 taatctgttt tattcttcta gctcaatcag tgacctcttt ataaccgcaa acttagaact 3120 tattaaactt aacacatggt tcagagctaa caaattgtca ttaaacttaa acaaaactaa 3180 atacatctta tttcacccca accaaaaaaa aaaggctatc ccatcaattt taccaactct 3240 tagtatagac aacacaatta ttgaacgaac tcaatcatct aaatttcttg gtgtgcttct 3300 tgacgagaaa ttatcttgga actctcacat aaataccatt aatacaaaag tctcaaaaaa 3360 cattggtcta ctctataaag caagaaatta tttatcgttc gaaaatttaa aacttcttta 3420 tttttccttt atacacagct atctatcata tgccaatata gcatgggcaa gtacccataa 3480 aactaaatta agttcactct atagacgaca gaaactcgcc tcaagaatta ttttttataa 3540 aaacaaactt actcacgcaa acccattact aaaacagatg aatgctttaa acatttacca 3600 aattaatata tatcaaaata ttttgtttat gttcaaacat aaaatgggaa ttattccaag 3660 tcgatttatg aataactttt ttaaaactaa tataaacaaa tttagcacta gagcaactgg 3720 aaactatata gtacccctta aaaaaagaaa actgtctcaa ttctcaatag catatcgtgg 3780 tccttacttg tataacaaat taataccaaa aaattcctca atatctctcc tgaaaaatat 3840 aaattcctta aaatcttatc ttaaaaaact tgtcttaaat acaagcagtt atatggaaat 3900 tttctaataa cttatgagta attaaaatat caaaatcatg tacaaaaaaa aaaaaaaaaa 3960 aaagtatgaa tatacatata tacatgtatg tgtatgtgta tgtgtgtatg tatatatata 4020 tgtatgtatg tatgtatgta tgtatgtgtg tatgtaaata tgtatgaata tatatatata 4080 tatatatgta tatatatrta tatatatatg tatatatrta tatatatata tatatatatg 4140 tgtgtgtgta tgtgtatatg tgtatatgta tgtatgtgta tatgtatgta tgtrtatata 4200 tatatatata tgtatgtgta tgtgtgtaat tatgtgtatg tgtatgtatg tatgtgtata 4260 tatatatata tatatatata tatatatata tatatatata tatatatata tatatatata 4320 tgtatgtgtg tgtatgcgta tgtgtgtatg tgtgtgtgta tgtatatatg tatatatata 4380 tatatatgta tatatatatg tatatgtgta tatgtatata tatgtgtgtg tatgtgtgtg 4440 tatgtatgtg tatatatata tgtatatatg ttatgtttat gttatgctcc ttcctaaaaa 4500 cacgactaat tatattgaaa aatgtaataa gtaatgaata aaatatttat gagtatctca 4560 tgaaattgtt taaaaaaaag ggttctcgat gacaagactt tacacagttt tctgcgagtt 4620 tcccttgact acataactaa aggtttaatt tttccttttt taagactttt tttgtctttt 4680 caaaatataa actgaaaaac tattttacat tattttatta ctttgtacat caaatatatc 4740 ttagtattat ttcagttatt ttcaaagcat tttgactgta tttgtaataa aagtatatat 4800 tgttaaaact atttgtaata aaagtatgta ttgttaaagc tatgtagttg taaaaaatat 4860 atggaaaaaa aaaaaaaaaa aaa 4883 // ID Troyka-2-I_BF repbase; DNA; INV; 5117 BP. XX AC . XX DT 29-APR-2008 (Rel. 13.04, Created) DT 29-APR-2008 (Rel. 13.04, Last updated, Version 1) XX DE Internal portion of the amphioxus Troyka-2_BF autonomous LTR DE retrotransposon - a conceptual consensus. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Troyka-2-I_BF; KW Troyka-2-LTR_BF; Troyka-2_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-5117 RA Putnam N.H., Srivastava M., Hellsten U., Dirks B., Chapman J., RA Salamov A., Terry A., Shapiro H., Lindquist E. et al.; RT "Sea anemone genome reveals ancestral eumetazoan gene repertoire RT and genomic organization."; RL Science 317(5834), 86-94 (2007). XX RN [2] RP 1-5117 RA Kapitonov V.V. and Jurka J.; RT "Troyka - a distinctive group of gypsy-like LTR retrotransposons RT inducing 3-bp target-site duplications."; RL Repbase Reports 8(4), 514-514 (2008). XX DR [2] (Consensus) XX CC Troyka is a distinctive group of gypsy LTR retrotransposons. CC Troyka retrotransposons differ from canonical gypsy and other LTR CC retrotransposons by the length of target site duplications (3-bp CC in Troyka) and by the unusual 5'-CG and CG-3' termini of LTRs. CC The ~1600-aa polyprotein encoded by Troyka elements is composed CC of the aspartyl protease, reverse transcriptase, and integrase CC domains. The polyproteins encoded by different families of Troyka CC present in the cnidarian and amphioxus genomes are ~50% identical CC to each other. However, there proteins are only 20-30% identical CC to the pro, RT and INT domains encoded by canonical gypsy LTR CC retrotransposons. CC Troyka-2_BF is a very young family of Troyka LTR retrotransposons CC present in the amphioxus genome. Some copies of Troyka-1_BF are CC flanked by 99% identical LTRs. The internal portion is not CC completely reconstructed. XX SQ Sequence 5117 BP; 1322 A; 1539 C; 1058 G; 1128 T; 70 other; tggtagcagc gataggattc gaactaccca ccaagacaaa aaattgccgg gatacgctcc 60 ttaaacgacc gtgatggcat cgaatcgcac tcggaaacag tggcctctga caacatcgga 120 aaccataaca tcattcgaat catggaaaaa caacctggtc tacggcctat cgcttgatcc 180 caactttgct ccctttctcg cccccggcgc cacgtggcaa aagcagagcc gaggggtgga 240 aaatcgcggc ttagtggctg acggtgaaga cgtcccggaa gctcagagga aaacagcagc 300 acagaaggcc gccattttgg acatgctcct tggccagatc gccaatttct gccctatcat 360 cgccagatct cgcatctgca aggcaagtgt gtctctgaac gacatatggc aaactattcg 420 cttacacttt gggtttcagt ccacgggggg atatttcctc gacatcgcta atattaagct 480 ggaaccaaac gagaaaccgg aggatttgta ccaacgatta tgtgccgctg tagaagatag 540 catgctcact gcggcagggc ctatcactca ccacggggag gctatgacgg ctgacgaaga 600 gctcacaccg tctctcgaaa acttcattgt cttaacatgg ctcaaactaa tccatcctga 660 cctcccagcg ctcatcaagc aacggtatgg ccctgagctt cgaaacaaga ctctagcctc 720 cattaagcca gaaatatcct tggccttgtc gtctctcttg gacgaacttc gtagcacgga 780 agacatccgc actctccgtc tcactcagaa ccgccagccg taccggaagt acccatccac 840 ccgcccccaa ggcgccaaga ccaagttcaa gactaactct aacaccaaga gtagcccagc 900 tacgtgtcca ctttgtgaac aggtaggcag accagcattt gaccactatc taagtgtctg 960 tccgtatctc cctgcaaagg ataagaagta catggtgcgt actcgtgcag tagagataga 1020 tgatgtgtat gacgttgatg atgacatcca ctctgatgaa gaggaggaac ccagtatttg 1080 tcaggccaaa accatttctc ctgtaccaga gccatctcct catgtgtcac gtgttgctat 1140 ccgtgcatcc ccacatctag atgcctttca tcatggacaa actgtacgca ttttgttgga 1200 ttctggtgca gagtctagca tgatacgagc ggatgaggcg aagcgactcg gcttgcgcat 1260 caacccaaac acaacacaga ccccaagcca agcagatgga ggtcgtatgt caggtatgct 1320 aggggaatgc acgacaacat tttaccggca gaaactacca cttcgctttg aagcccttgt 1380 tgtcacagat cttgcctctc cgatcattgc aggcagccct ttcctggaag caaacgggtt 1440 cactattgac ttcactcaac gccagatccg cctcccagat ggatccatct ccagttacac 1500 tccatccgtc aaaagacaca acccgcacat ccaccggatc tcccacaacc tgattcgcat 1560 gcaggacaag tcaaccaccc tgtggccctc tgacttcttg gaactccaac tacccgagga 1620 cctgaaagac taccctgatg tcgcccttga accccgtctt gattctcttt ccacccagca 1680 ggcaaaggcg ttccaccagg gtacgtacca caatattgca ggctacataa gagtcccaaa 1740 catctcgtgt gcacctgtcc annnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1800 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn ngctcagacg gtggtggagt atttgaatcc 1860 atcatttctc gtcaataaac caaatggcgg acatcgcctc gtcacagcat tcaccgacgt 1920 ggcaaaatat tgcaagcctc aaccatcgct cctgccaaac gttaacatgg tcatgcgcca 1980 gatagcgtgt tggaaataca tcatagtctc cgatcttacc tcggcatacc atcaaatccc 2040 tctacacccg aactctcgca aatattgcgg cattgtcacc ccgtataagg gcatcagagt 2100 gtactgccgc agtgccatgg gaatgccagg cagtgagaca gcactggagg agctgatgtc 2160 acgcgtctta ggtgatctgt tgatgcgtgg aatcgtcgcc aaaattgccg atgatttgta 2220 ctgcggagga aacacatttc aagaactcct cgccaactgg aaggaagtcc taactgccct 2280 ccaggccagc gatttacgcc tttccccgtc caaaaccatc atctgtccag caactaccac 2340 gattctagga tggcactgga accagggcca gatatcagcc tctcaacaca cattatgcac 2400 gctggctaca tgctctccac ccccatccat cacagcccta cgctccttca ttgggtctta 2460 caaggccctg tctcgcgtca ttccaggcac atcggtactc ttgggaccgt tagacaacct 2520 ggtcgcaggc cgatcatccg gcgatcccat ccaatggtca gatgagatgc ttgaggtatt 2580 ccgcaaggct cagcggggct tggatcagca tcaagcggtt gttctcccta gaccggatga 2640 cgaactctgg ttagcaaccg atgccgctgc ccgcccaact ggcataggcg ccacgctctt 2700 ccttcgtcgc aacaacaaca caagggttgc tggcttcttc agttgtaagc ttaagccaca 2760 ccagcgccaa tggctaccat gcgaactaga ggcccttgct ataacatccg ccctgaaaca 2820 tttcaagccg ttctttatac agacaactaa accagcattt ctggtcacag actccaaacc 2880 gtgtgtccag gcggtggaaa agctacgccg aggcgagttc tcccacagtc cacgagtgtc 2940 caccttcctt gccgctataa gccagttccc gatcagtgtt caacacatca gcggtaaaca 3000 caatctatct accgactacg ccagccgcca cgccctggag tgttctgaca gcaaatgcca 3060 agtctgctcc ttcgtccatg ccctcagtga ggaaccagtc attaacacca ccacgcttag 3120 ttcggcgtct aacctagacc tcccagtgta caccagtcgc caggcttggc acgccatcca 3180 gaactcgtgt tcttcattac gacgggtctt cgctcacctt cagcaaggca cacgcccatc 3240 aaagaaggac accaaagccc gtgatgttaa gtcctacctg tcacgcgtta ccatcgctcc 3300 cgacggtcta ttgattgttc ccaacagaga tatgtttggt gtcacacgtg atcgcatagt 3360 ggtcccccgt caggtggccc acggcctggc tacagcgatt cacctccgga ttggccaccc 3420 cacctgccac caactcaagg ctgtcttctc caggtatttc tttgccttaa acatggacgc 3480 cgttctccac caggtctcgg atgcatgtga ccaatgtgcc tccctcaaac gccaactacc 3540 catccctccg tccttcacca cacaaccacc tccttcaacg gtcctcacct cgttcgcagc 3600 agatgtgatg agacgagccc gccaacaaat ccttgttgtc agagaatgca gtacctcctt 3660 cacaagtact cagctcatca gcggcgaaac atccaccagc ctcagagatg gactagttgc 3720 attgtgcact ccacttcgtc tcctggatgg tcctcctgct gtcatacgtt gtgacccagc 3780 cccaggattc caggccctga tcaacgaccc ctggttgaca gaacatcgtc tccagattga 3840 agtagggcac cacaaaaatg tcaacaaaaa ccctattgca gaacgcgcca ttgaggagct 3900 gagagaggaa ctccgtaaga ttgaccctct tggccaacct atcacacccg cccaactagc 3960 cgtggtcacg gcctccctga acacaaaggt ccgctccaac ggcctctcat cgcgagagta 4020 tctgtaccaa cgagaccaat tcagtggcga gcaactccca ctttccgaca accgtctcct 4080 cgaagaacaa caccgccgtc gttccgaaaa ccacgccccc agcgccaaat ccaaatgccc 4140 caaggcaggg ctcgcgccca ctcccacctt ccatgtggga gacttagtct atgtccattc 4200 agatagagac aaatcccaag ctcgcagtcg gtacctagtc acagcagtgg aaaaggactg 4260 gatctatatc tccaagttta tcggtcgcca gctgcgtgct agatcctaca aggttcgatc 4320 agcagaatgc tatcctgtcc catgccaggt gccaccgcag caacgccaga ctgtcccctc 4380 tgacgatgac aacaacgacg attctgaccc tgaagaccct atcccttcag tcccggcaaa 4440 cgatccggtt ccagcagaag acttggttcc agtcaacggc cccctaccag aacttaacga 4500 cccaccgaac actcctcccc ctctgccaga actccataac ccaccggaca ctcctcctcc 4560 cttgccccaa cgccagtccg accgatgcag acgacagccg cggtacctga aagactttgt 4620 gctgtcatag tcttagtgtg acgaaaagtt attttcaaga cctgatgtat ccctgagtaa 4680 tactagtacg tactcacata gtccaaacag ataaacactt aatgccagat atgaagtatc 4740 accatatata ctcgtgtatt ccatagttag tgtaaacaac agttatcgtg tactccacgg 4800 ttatataagt cctgattctt aacatagttc acaccagatt gtatactcca aagttaccta 4860 ggtcctgatt cttaccatag ctcatagtca taccatggta gtgttcgtaa actccaaagt 4920 tacttagttg agataacacc atatattcat agcttagata ctcatagtca taccatggta 4980 gtgtacgtat aaaccccaaa gttacttagt tgaaataaca ccatttattc atagcttaga 5040 tactgaccac agttgaaatt gttaagcaca tgttcgttct tatcactcac aaaaaaaagg 5100 aaaagaagag taaatca 5117 // ID Copia-3_ACA-LTR repbase; DNA; INV; 204 BP. XX AC AEYA01000017; XX DT 23-MAR-2011 (Rel. 16.03, Created) DT 23-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the Acanthamoeba castellanii genome: DE long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-3_ACA_; KW Copia-3_ACA-I; Copia-3_ACA-LTR. XX OS Acanthamoeba castellanii OC Eukaryota; Amoebozoa; Centramoebida; Acanthamoebidae; OC Acanthamoeba. XX RN [1] RP 1-204 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Acanthamoeba castellanii genome."; RL Direct Submission to RU (23-MAR-2011). XX DR Genome; AEYA01000017; Positions 1498 1295. XX SQ Sequence 204 BP; 29 A; 63 C; 13 G; 99 T; 0 other; tgttggatac tcttcatctc tctctctctc tctctttttt tctctatctt tctttatgtt 60 tacctttctt ttctttattc cctcttattt cctctgactt cacacttgta ccgtttcaga 120 tctcctcaat acattacctt tttcctttca gacatcttca tgtctcttta gcttctcagc 180 ttctcgtctt cttcacctcc aaca 204 // ID CR1-43_BF repbase; DNA; INV; 1848 BP. XX AC . XX DT 30-JUL-2009 (Rel. 14.07, Created) DT 30-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE Amphioxus CR1-43_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-43_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-1848 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-1848 RA Kapitonov V. and Jurka J.; RT "Young families of CR1 non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(7), 1614-1614 (2009). XX DR [2] (Consensus) XX SQ Sequence 1848 BP; 595 A; 456 C; 407 G; 390 T; 0 other; aactcttttg gaaatatgtg aactcaaagt caaagaccaa agaaagcata cctgatctga 60 aagatggttc tggggtagca acaacagata aagaaaaagc agaaacactc aacaggttct 120 tcattagcgt gttcaccaaa gaaagaacgg acacaatgcc taatgcacaa gaacaagaat 180 acctagccga actagtagac attgaaatac aacccgccga catttgcaaa aggcttaagg 240 cattaaatca gaacaagtct atgggaccag atgctataca ccctcgggtc ctaaaggaac 300 tcgctgaggt tttggcaacg cccctgtcgg tgatctacac caagtcctta aaagagggta 360 aactacccag agactggaaa ctgggccaca ttacccccat ctttaagaag gggagcaaga 420 aggaacccaa caactacaga ccagtatgcc tgacctctgt ggcaggaaag gtccttgaag 480 gcatcgtcag agatggaata gtcaagcaca tgttaagcaa ccagctcttt accaagcacc 540 agcacggttt tttacccgga aggtcgacaa cgacacagtt actggcgtgc ctagagaact 600 ggacttgttc acttgacaac gacctcccca tcgacgccat ttatctggat ttccagaaag 660 cgtttgacac ggtccccatc gagcgattgc tggtaaaagt ggaaagttac ggtatcagag 720 gttttttgct tcagtggatt aggtcatttc tgactgacag aaaacagaga gtacacgtaa 780 atcaggcatg ctcggaatgg gccatagtta cgagcggcgt cccacaaggg agcgttctcg 840 gtcccgtcct gttcctgtta tttgtgaacg atatgccaca ctcagtgaaa agcgatctca 900 agctgttcgc agacgatact aaagtcttcc gcacagtaag agaagctatc gactgtgagg 960 cactacaaga tgacctcgat aagttgcaat cctgggccaa gacctggcag atgaagttcc 1020 acccaaataa atgcactgtt ctaagacttg gagcgggtca cccccccttc aactaccaca 1080 tgactgacca ccacggtggg actgtgaacc tggaaaacac cacagaggaa aaggaccttg 1140 gcatcaaagt ggacagaaac ctaagttttg aagctcatat cctgtcaata gcttccaagg 1200 gaaatcagat gacagggctg ctctggagaa ctttccgata cattgataga gaagtcttca 1260 tcaccctttt taaatcccta gtacgccctc tcgtcgagta tggggccccc atttggtctc 1320 ctgggagttg gaaactagta gacaacatag aaaacattca aagacgcgca actaagagag 1380 tccctggact gagggatctc acatacgagc aaaggcttag aacccttaaa ctcccaaccc 1440 tactgtacag gagactcaga ggcgacctca tcaacactta caaatacata cacggattat 1500 acgacaccga gccatgcata ccagaaatca agaaggatgg aagaacgaga gggcatagcc 1560 tacgtctatc aaggccattg tcaaacacca acaaaagaca cgactttttt agcaacagag 1620 tagtgccgtg gtggaatgca cttcccgaag aagtagttac agcacctact gtcaatagtt 1680 tcaaatcgag gttagacaga gccatggaaa accatcccat catgttcaac tacagagcgc 1740 tggacaaacc tcataaacca acaatgactg tatgctgaaa cgaaagagcg gttcaaattg 1800 gagcacgtct cctacccaac cgaaagtact ctactctact ctactcta 1848 // ID Gypsy-64_CQ-LTR repbase; DNA; INV; 570 BP. XX AC AAWU01018564; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-64_CQ_; KW Gypsy-64_CQ-I; Gypsy-64_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-570 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 508-508 (2011). XX DR GenBank; AAWU01018564; Positions 4848 4279. XX SQ Sequence 570 BP; 180 A; 110 C; 153 G; 127 T; 0 other; tgtgggatat atacctagtt aataataata ataaaaaaag tgttcatttc tcaagttgcc 60 gactaaattt gaatatttag ctgaaccatg acgaacgtgt aaacaaaccc ctgtttgggc 120 taatttcaat catcttacta gaatctcgtc agcgtgttca acaaacgcaa agaacgcgcc 180 cctagacaaa cgcagactcg caccggaaac tggttgagaa tacgaacgag cccgaagacg 240 ggatcacgat ccgatactgt gcgagagtaa ttgagcatga gagattgtgt gttgagagag 300 agagattata tgtgagaacg gtgtgatggg tatgctttta aagtggagtg agtgagagag 360 aatcatttcg agcgagagat ggtgagcacg agagagatcg gcaaggcgag tgtgagagta 420 gcacgaatgg aactctcttc actacagccg aaggagaacc gcagcaaccg agtgattcta 480 catcgtattt ctgaagcgaa cagttgagtt gctgttgagc tccgccgtcg cccaagatgg 540 cgagcaaacg tgcagcagca ggacacgaca 570 // ID SMAR7 repbase; DNA; INV; 1283 BP. XX AC . XX DT 29-SEP-2007 (Rel. 12.09, Created) DT 29-SEP-2007 (Rel. 12.09, Last updated, Version 1) XX DE Consensus sequence of Mariner-type family of repeats. XX KW Mariner/Tc1; DNA transposon; Transposable Element; SMAR7. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-1283 RA Jurka J.; RT "SMAR7: Mariner-type element from freshwater planarian (Schmidtea RT mediterranea)."; RL Repbase Reports 7(9), 996-996 (2007). XX DR [1] (Consensus) XX CC The youngest elements are ~90% identical to consensus sequence. XX FH Key Location/Qualifiers FT CDS join(176..616,699..1202) FT /product="SMAR7_1p" FT /translation="MENRKQHFRHILLFYYRKGKNAVQARRKLCDVYGEDV FT LTERQCQNWFAKFRSGNFDVEDAPRSGRPVEADEDKIKALIDANRRITTRE FT IAERLNLSNSTVHDHVKRLGLISKLDIWVPHVLTERNLLRRINDCDLLLKR FT QENDPFLKTSSIHFEGRYSPKKGYAICLVGFQGDRFFELLPDNTTINSEVY FT CDQLDKLNDSLKQKRPELINRKGIVFHHDNARPHTSLVTRQKLLQLEWDVL FT PHPPYSPDLAPSDYYLFRSLQSFLDGKTFTSNQDVKYHLDQFFASKDQKFY FT ERGINLLPERWQKVLDQNGEYII" XX SQ Sequence 1283 BP; 413 A; 232 C; 257 G; 380 T; 1 other; tattgggttg ttcggaaagt catttcgttt tttcccaaca gaggaccaat tgtatttttt 60 cacagccaac ttcacactta agccgcataa tcattatata ttttgacagc tgatatagta 120 aggtttgttt gttaaaaatt ttgtttgatt ctttgcaata gtttttggaa aatcgatgga 180 aaatcgaaag cagcattttc ggcatatttt actcttttac tatcgcaaag gtaaaaatgc 240 agttcaagca aggcgaaaat tatgtgatgt gtatggagaa gatgtattga ccgaacgaca 300 atgccaaaat tggtttgcaa aatttcgttc cggcaatttc gatgttgaag acgcaccacg 360 ttctggaagg cccgttgaag ccgatgaaga caaaataaag gcattgatag atgcaaaccg 420 ccgaataaca actcgtgaga ttgctgagag gttaaatttg tcgaattcga ctgttcatga 480 tcacgtgaaa cgtcttggtt taatttcaaa gcttgacata tgggttccac atgttcttac 540 agaacgaaat ttgcttcgtc gcatcaacga ctgtgatttg cttctcaaac gtcaagaaaa 600 tgatccattt ttgaagtgaa tcattactgg cgacgagaaa tgagggttgt trtttacaaa 660 aatgttaagc gcaagagatc atggagtaaa aaagatgaac cagctcaatc cacttcgaag 720 gccgatattc accaaaaaaa ggttatgcta tctgtttggt gggatttcaa ggggatcgtt 780 tttttgagct tctaccggac aataccacga ttaattctga agtgtactgt gatcaactgg 840 acaaattgaa tgattcgctc aaacagaaaa ggccagaatt gatcaataga aaaggtatag 900 tgttccacca cgataatgcg agacctcata caagtttggt aactcgccaa aagcttttac 960 agcttgaatg ggatgtacta ccacacccac catattctcc agatttggca ccttcggact 1020 attacttgtt ccggtctctg caaagttttt tggacgggaa gaccttcact tcaaatcagg 1080 acgtcaaata tcacttggac cagttttttg ccagcaaaga tcagaagttt tatgagcgtg 1140 gaatcaatct cctgcccgaa agatggcaaa aggtattgga ccaaaatgga gaatatataa 1200 tttaataaaa tgtttataca ctctaaaaaa atcgtgtttc attttcacta aaaaaacgaa 1260 atgactttcc gaacaacctg ata 1283 // ID Proto1-5_NG repbase; DNA; INV; 5178 BP. XX AC . XX DT 21-MAY-2009 (Rel. 14.06, Created) DT 21-MAY-2009 (Rel. 14.06, Last updated, Version 1) XX DE Proto1-5_NG is a non-LTR retrotranspsoson from the Naegleria DE gruberi amoeboflagellate genome - a concesptual consensus DE sequence. XX KW Proto1; Non-LTR Retrotransposon; Transposable Element; KW Proto1-5_NG. XX OS Naegleria gruberi OC Eukaryota; Heterolobosea; Schizopyrenida; Vahlkampfiidae; OC Naegleria. XX RN [1] RP 1-5178 RA Kapitonov V.V. and Jurka J.; RT "Proto1 non-LTR retrotransposons from the Naegleria gruberi RT amoeboflagellate genome."; RL Repbase Reports 9(6), 1148-1148 (2009). XX DR [1] (Consensus) XX CC Proto1-5_NG is a very young familiy of non-LTR retrotransposons CC that belongs to the Proto1 clade of non-LTR retrotransposons. CC This clade includes also the Proto1-1_NG, Proto1-2_NG, CC Proto1-3_NG and Proto1-5_NG families from the the Naegleria CC gruberi amoeboflagellate genome. The Proto1 elements code for two CC ORFs. The ORF2-encoded proteins are composed of the apurinic CC endonuclease, reverse transcriptase and ribonuclease H domains. CC It is likely that the Proto1 clade is a sister clade of the L1 CC clade. Proto1 retrotransposons are characterized by 15-18 bp long CC target site duplications and by a weak target site preference: CC 5'-CATTTTTTTNNNNNNNN-retrotransposon-ATTTTTTTNNNNNNNN-3'. The CC Proto1-5_NG consenus sequence was reconstructed from two CC misassembled parts of the same element based on the CC ATTTTTTTAATCTTGTT target site duplication. XX FH Key Location/Qualifiers FT CDS 14..1393 FT /product="Proto1-5_NG_1p" FT /note="Proto1-specific protein of unknown FT function." FT /translation="MSKHHPPWITKNITKAGPSSARNLKSTKSHTKELISV FT PKNNLNFNFKMSYADVVKQIPRSESKLKSESKQIVIANRTINIKSVNLNSR FT QKKFSRNIKMSDNKQIDYETCISVFVQQPNEAFEDLIVKEVGHLCNRIYVP FT ELKPKLVQLIIDSKENYQEAIRILNNMVTKTNTKFLPSPMNFTPTNHTVKY FT HSEKLDFINHQISNIIDAVWHSKINISQMNLSFDRTTQIYKLRVALINSNG FT KDELLKNGFQEYKSMDQILKECQFEATCNFPTSFQEKDIIEAVKEISKLAD FT VQFRRSNLTITKLKATYLHNREYYAVNYKLQNEEELKAFISKAIRCKGSKD FT NSKKIFKRFKSTAEYLKKKLQSNNTDNHENLTSSSIIIDINKRLTRMEEKI FT ETQVAKTSNHELKLQNFDERMNKFLESLEQLRFLTMNFKNIGNAKEGMNDE FT GYEDYNPHVNHSRYD" FT CDS 1389..5123 FT /product="Proto1-5_NG_2p" FT /note="contains the APE endonuclease, reverse FT transcriptase and ribonuclease H domains." FT /translation="MIDLKSTIRVGCYNVNGLIRTGNSKEKLIQPINAMCR FT SNGIHIMCITETHINNYESQQKCYNECNGHILFNSLMNTRKACKGTAILHF FT LNNKRKRFISNHNIINGKLQWFRFKSRNKIFNIIAGYLSGKKEEEDDDQLA FT MSHIIEIMTKFNDEYFIIAGDLNIDSNNPISKRDKEWMNLLKQLNMYEDKQ FT LSQISYRRKQKDKIIETHPDHIFVSNNLKIIGGKSLPSITHNDHIPFYIDI FT QYDCDPTWRASISRKHIETLCSTINQETYLSFDELERNIKNNFLKYSYRND FT KLNVKFLSSRKRKDIRRIESKIREIMDKKDPKQEQELITLQTELKTVYREA FT AKEIRTNLLDQFNSHNSSAMYRFDRHLKEKTINWNEISFDESTVVNFFQDK FT FTSKVEDINFIPSRSTIKYGPKRIKFKLETIKKALLRMSSNTPGTDNIPLS FT IIRGLKDEHLQLICDKFNQCLDNGNIPLEWKNGWVKLIPKRDIISLKDIRP FT ITILPIMYRLMFNIIAFKLRKWASENINIRQQAFIPKRNTLNHAILVSTLA FT AMNPKSFLLVNLDIEGAYDSVEVSVIELALKHCKFPINLIQFIINSYKEHT FT LQLEINNHLSEPITKSRGIPQGCPLSPLIYDCITQLIIDKSIDVWKLPKDP FT CNLNANEIGILGFADDMYLIADTSDINTDYNERLNELDRWVTDLALKINPS FT KSVATILTENELDITPKINNVKIPIQKNIRVLGHYPWEDQLVQEDIEKRIS FT LFELSLKYLPIRSLEIRNFRKVIQAKVLSLFTHITRSNVISTKYCDRINIA FT IRQAIRRRGNLHSATSTDWFHLPLDQGGLNIPKIDEFIERLHIKTIMQLRN FT QRNTLVRKAFNWGIDNISNYNSIFHQMNQSLKHFDLTFNQNSIEPLRPSIL FT QGQNFEIYTDGSKMDENSTGFGIYFHKGVIEGNEQLSFNIDGRYSHNVAEL FT TAILMASKLLPNNSKATIFTDSKISIDILNPRYDGKFSIFKDEFYRTIFDK FT NLKIKLEKVKGHVDPNNIIVDELAKKGSKQYNISIDIRKLLPNAMYLENNG FT TIIFDLQNYLKGIQRDRRATIARDKSNLIITKGWSSANIKYLNSHLEPYTK FT FCIWRNMTNNHIRQYKPVSCSHCGRIVELEHYIYYCPEMSEKRNYFLNKFY FT EITKYRAILHDRNEFRFSNRWKGFLINHRGLLDENNQDEIRRKYPSIIRNW FT VEIQSLLSLLVGKAHKRYYRDFLKTRKKNKKKTKE" XX SQ Sequence 5178 BP; 2095 A; 740 C; 772 G; 1571 T; 0 other; cgaagtaaaa aaaatgtcta aacatcatcc tccgtggatt acaaagaaca taacaaaagc 60 aggccccagc tctgccagaa atctaaagtc cactaaaagt cacactaagg aattaatttc 120 cgtacctaaa aataatttaa attttaattt taaaatgtct tatgctgatg ttgttaaaca 180 aattccgaga tcggaatcaa aattgaaatc ggaatcaaaa caaatcgtga ttgccaaccg 240 aactattaat atcaagtcgg ttaatttaaa ttcaagacaa aaaaaattca gtagaaatat 300 taagatgtca gataataagc aaatcgacta cgaaacttgt atttcagtat ttgtccaaca 360 accaaatgaa gcatttgaag atttaattgt taaggaggtt ggtcacttat gtaatagaat 420 ttatgttcca gaattgaaac caaaacttgt tcaattaatt atagatagca aagaaaatta 480 tcaagaagca ataagaattc tcaataatat ggttacaaaa actaatacga aattccttcc 540 ttcccctatg aactttactc caactaatca taccgtcaaa tatcattcag aaaaattaga 600 ttttattaat caccaaattt ctaatattat tgatgcagta tggcatagta agattaatat 660 ctcacaaatg aatctcagtt ttgatagaac tactcaaatc tacaagttaa gagttgctct 720 tattaactca aatggaaaag atgagctatt gaaaaatggt tttcaagaat ataagtcaat 780 ggatcaaatt cttaaagaat gtcaatttga agccacctgc aatttcccta catcattcca 840 agaaaaggat atcattgagg cagttaaaga aattagtaaa ttggctgacg ttcaatttcg 900 tcgcagtaac cttactatta ccaagttgaa agcaacctat ctccataata gagaatatta 960 tgcagttaat tacaaacttc aaaacgaaga ggaattgaaa gcatttatat caaaggcgat 1020 tagatgtaag ggatctaaag ataattccaa gaaaatcttt aaaagattta aatctactgc 1080 tgaatacttg aagaagaaat tacaatcaaa taatactgat aaccatgaaa atttaaccag 1140 tagttctatc attattgata ttaataagag acttactaga atggaagaga aaattgaaac 1200 tcaagttgca aagacttcta atcatgaact aaaattacaa aattttgatg aaagaatgaa 1260 caagttctta gaaagtttgg aacaattaag atttttaaca atgaatttca agaatattgg 1320 taatgcaaag gaaggtatga acgacgaagg ctatgaggat tacaatcctc atgtaaatca 1380 ctctcgttat gattgattta aaatctacta tccgtgtggg atgctataat gtaaatggtc 1440 taattaggac aggaaattct aaagaaaaac tcatacaacc aattaatgca atgtgtagat 1500 ctaatggtat tcacataatg tgtataacag aaacacatat caataattat gaatctcaac 1560 agaaatgtta taatgagtgt aatggacata ttctattcaa ttccttaatg aatacaagaa 1620 aagcttgcaa gggaacagca attcttcatt tcttaaataa caagagaaag agatttattt 1680 ctaatcataa tatcatcaat ggtaaattac aatggtttcg tttcaaatct agaaacaaga 1740 tatttaatat cattgcgggc tatttatcag ggaaaaagga agaagaagat gatgatcaat 1800 tagctatgtc tcatattatt gaaattatga ctaaatttaa tgatgaatat tttattatag 1860 ctggtgattt gaatattgat agtaataacc ctatttccaa aagagataaa gaatggatga 1920 atcttctaaa acaactcaat atgtatgaag acaaacagct tagtcaaata tcatatagaa 1980 gaaaacagaa agacaagatc attgaaacac acccagacca tatatttgta tcaaataatt 2040 tgaaaataat tggagggaaa agtctaccat ctataactca caatgatcat ataccttttt 2100 atatagacat tcaatatgat tgcgatccga cttggagggc atctatttca aggaaacata 2160 tagaaacgtt atgtagtaca attaatcaag aaacctattt atcatttgat gaactagaac 2220 ggaatattaa gaataatttt ttaaagtatt catataggaa tgataagctt aatgttaaat 2280 tcctttcgtc tagaaagcga aaagatatta gaagaataga atcaaaaatc agagaaataa 2340 tggacaaaaa agatccaaaa caagaacagg aattaattac actccaaaca gaactgaaaa 2400 ctgtttatag agaggctgca aaagaaataa gaacaaatct attggatcaa tttaatagcc 2460 ataattcttc agcaatgtac agatttgata gacaccttaa agaaaagaca attaattgga 2520 atgaaatctc atttgatgaa tcgacagttg tcaacttctt tcaggataaa tttacatcaa 2580 aagttgaaga tattaatttc attccaagta ggtcaacaat taagtatgga ccaaaaagaa 2640 tcaaatttaa attggaaact atcaaaaaag cccttttaag aatgagttct aatactccag 2700 gaacagataa tattccattg tctataataa gaggtttaaa agatgaacat cttcaactca 2760 tttgtgacaa atttaatcag tgcttggata atggtaatat tcctttggaa tggaaaaatg 2820 gttgggtaaa attaattcct aaacgagaca ttatttctct caaagatatt cgtccgatta 2880 ctatccttcc aattatgtat agattaatgt ttaatattat tgcttttaaa ttaaggaaat 2940 gggcttcaga gaatataaat atacgtcaac aagcatttat accaaagagg aataccctta 3000 atcatgcaat attggtatct acgttggcag caatgaatcc gaaatccttt ctattggtga 3060 atttagatat cgaaggagct tatgactctg ttgaagtttc tgttattgaa ttagctctaa 3120 aacattgcaa atttccaatt aatttaattc aatttattat taattcttac aaagaacaca 3180 ctctacaact tgaaatcaat aaccacttgt ccgaaccaat tacaaaatcc aggggaatcc 3240 cacaaggttg tccattatca cccctgattt atgattgtat aacacaatta attattgaca 3300 aaagtataga tgtttggaaa ttaccaaaag atccttgtaa tcttaatgca aacgaaatag 3360 gtatccttgg ttttgcagat gatatgtatc ttattgctga cacatctgat attaataccg 3420 attataacga aagattaaat gaattagata gatgggtaac tgatcttgcg ttaaaaatca 3480 acccttctaa atcagttgca acaatactga ctgaaaatga attggatata accccaaaaa 3540 ttaataatgt gaaaattcca attcagaaaa atatcagagt attaggtcat tatccttggg 3600 aagaccaact tgtacaagaa gatattgaaa aaagaatttc cctgtttgaa ctttccctaa 3660 aatatcttcc aattagatca ttagaaataa gaaattttag aaaagtaatc caagctaaag 3720 tattgagcct attcacacat attactcgta gtaatgttat ttcaaccaaa tattgtgata 3780 gaattaatat tgcaattaga caagcaataa gaagaagagg taacttacat tcagctacat 3840 ctacagattg gtttcactta ccactagatc aaggaggtct taatatccct aaaattgatg 3900 aatttattga aagactacac attaagacca ttatgcaatt gagaaatcaa agaaataccc 3960 ttgtcagaaa agcgtttaac tggggaattg ataatatttc aaattataat tcaatattcc 4020 atcaaatgaa tcaaagtttg aagcattttg acttaacatt taatcaaaac tcaatagaac 4080 ctttaagacc ttcaattcta caaggtcaaa actttgaaat ttatactgat ggatcgaaaa 4140 tggatgaaaa ttcgacagga tttggtatct actttcataa aggggtaatt gaaggaaatg 4200 aacaattaag ctttaatatc gatggaagat actctcataa tgtagctgaa ttgactgcaa 4260 ttcttatggc ttctaaatta ttacctaata attcaaaagc tactatattt acagacagca 4320 aaatatctat tgatattcta aatccaagat atgatgggaa attttccatt tttaaagatg 4380 aattttatag aacaatattt gataagaatt tgaaaattaa attggaaaag gtaaaaggac 4440 atgttgatcc taataatata atcgttgatg aattagccaa aaagggttcg aaacaatata 4500 atatttccat tgatattaga aagcttcttc ctaatgccat gtatttagaa aataatggaa 4560 ccattatatt tgatctacag aattatttga aaggtattca aagagatagg agagctacca 4620 ttgctagaga taaatccaac ctaattataa ccaagggatg gtcctcagca aatatcaaat 4680 acctaaattc tcatttagaa ccttacacca aattctgtat ctggagaaac atgacaaata 4740 atcatattag acaatacaaa cccgtaagct gctcgcactg tggaagaata gttgaattag 4800 aacattatat ttattactgt cctgaaatga gtgaaaagcg aaactacttc ttgaataagt 4860 tctatgaaat aacaaaatat agagcaatat tacatgatag aaatgaattt aggttttcaa 4920 acagatggaa aggattcttg attaatcata gaggattatt agatgagaac aatcaagatg 4980 aaattagaag aaaatatcca tcaattatta gaaactgggt ggaaattcaa tctttactgt 5040 cgttacttgt tggtaaagca cataagagat attatcgtga ttttcttaaa acaagaaaga 5100 aaaacaagaa gaaaaccaaa gaataatcta tttcgatttc taaaaagatc caatgggtct 5160 ttaataaagt tcaaaata 5178 // ID DNA8-6_AP repbase; DNA; INV; 172 BP. XX AC . XX DT 22-AUG-2009 (Rel. 14.08, Created) DT 22-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-6_AP. XX NM DNA8-6_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-172 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(8), 1748-1748 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 172 BP; 32 A; 56 C; 36 G; 48 T; 0 other; cagtggcgga tccagggggg gggcaaaggg agcatttgcc ccccccccaa caccgtagtt 60 ttcctttgtt ttacactgtg tttagccaat tttgtagcca attttgcaac tttttgtact 120 tttgcccccc cccccatgcc cctcccaaaa ttaagccctg gatccgccac tg 172 // ID Gypsy-4_RP-LTR repbase; DNA; INV; 998 BP. XX AC ACPB02026971; XX DT 28-JAN-2011 (Rel. 16.02, Created) DT 28-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Rhodnius prolixus genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-4_RP_; KW Gypsy-4_RP-I; Gypsy-4_RP-LTR. XX OS Rhodnius prolixus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Euhemiptera; Heteroptera; OC Panheteroptera; Cimicomorpha; Reduviidae; Triatominae; Rhodnius. XX RN [1] RP 1-998 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Rhodnius prolixus genome."; RL Direct Submission to RU (28-JAN-2011). XX DR Genome; ACPB02026971; Positions 11478 10481. XX SQ Sequence 998 BP; 279 A; 258 C; 252 G; 209 T; 0 other; tgttggaacc ccaacttcac tttgctgcat aattgaacag ctacataatt tttgtaaata 60 gaaaaaaaat aatttttata tgtacatatg attgcttatc aaagaaaaat aaaaaaggaa 120 aagaggcaaa tagcccattt taaaaagtct ttctcaagac tctggaatgg caatgtggtt 180 ctttggacgg tacagaaata gcgacgcagt cgcgacgcca gggcgcagca cggtcgactt 240 cgctataaaa agggcagaat cggccgtgtc tgcatcagtc caagataatt tttcaacggg 300 aaaacatcca caatagctac cttagctgga agattgctga ctgggcctta ctctcaacct 360 ggatctctgt accggccaaa ggacctcact gtccggacca gaacaaaaaa tcgatcactt 420 aacccctgta ttgtgaagaa agaagaaaga aggaagtgca gcacacgaga cgccttggtt 480 taccagctac gaaggcgagc cgaacgccag cgctgccgca gtgcgagaca agagcgagga 540 ccaagcccag cacacgagcc gctgcagttc gccagctacg gaggcaagcc aaatcaccag 600 cgctgccaca gtgtgagata cgagcgagga ccaggccaat tggtaccggg gtttatcggg 660 gtgctggcca cactgccggg gctcccaaag ccggttgact ggtattagcg aacggccctt 720 ttcagggcct ctcacatcac ttttgctgtg gcctcgcgct gggagcaagc catgctcgga 780 gcggcgcagt cacttcaacc aacagtgtga acggcccttt tcagggccac ccaactgtac 840 ggctaccccg ctggggagcg ccataggcga gtcaagcgcc gacacatttt tttgtgaacg 900 gtccttttca gggccatcca gtcacttgta cctgcgagcg gtgagaagaa gagcttgtat 960 actggacttc atccctcggc cgcattagta atacaaca 998 // ID Gypsy-9_TCa-LTR repbase; DNA; INV; 114 BP. XX AC chrUn_2; XX DT 21-FEB-2011 (Rel. 16.02, Created) DT 21-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the red flour beetle genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-9_TCa_; KW Gypsy-9_TCa-I; Gypsy-9_TCa-LTR. XX OS Tribolium castaneum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Coleoptera; Polyphaga; Cucujiformia; OC Tenebrionidae; Tribolium. XX RN [1] RP 1-114 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the red flour beetle genome."; RL Direct Submission to RU (21-FEB-2011). XX DR Genome; chrUn_2; Positions 135912 136025. XX SQ Sequence 114 BP; 28 A; 39 C; 24 G; 23 T; 0 other; tgtagcaata attagtacgg gtagttcata agcgctccac caatacccat ttggtagcgc 60 cacttgcacc agcggccgca gccccactac cccaactagc cctttgcgca ggca 114 // ID TTAA24_AP repbase; DNA; INV; 657 BP. XX AC . XX DT 27-AUG-2009 (Rel. 14.09, Created) DT 27-AUG-2009 (Rel. 15.12, Last updated, Version 2) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; nonautonomous; TTAA24_AP. XX NM TTAA24_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-657 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2092-2092 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC TSD is TTAA tetranucleotide. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea CC Aphid Genome Project. XX SQ Sequence 657 BP; 215 A; 95 C; 123 G; 223 T; 1 other; gggggctgga gcataccatg ttttaaggag tagtgtttag gcatttaata ttacatgtac 60 attttggctg tgtcaatgtg tgagaagtaa atgaacatta gtgtgtgtaa tccgttatcg 120 cacttttcgc gacgaaattt gaattcccgg atttacgatt tattttgtat tgtcgtaata 180 tttatcgtac taagattctt agaaatgttc taacacccat acgcgcatgc acaagtcaat 240 gaacacgaag aaaatacttt tgctccggct gtagggcaga aatttgatgt aaacagtacc 300 aaaagaggaa taaaaattca ccttgtttcg atcgacttat attagttttg aaaagatgag 360 cgtattcgtt agtctatttt ctaggtttgc tcggaaatag atttacgact ttatttttag 420 ttaatataat atgatgcata gtaagtttat atttttatca aaataaactt anacttgact 480 tagttgtaga tagcgacgaa atgaaaaata taagaatcca cttccgagca aacttagaaa 540 agaggaatac aaactaaatt tacttttcaa aattgaaatc gtgctactgg aagtatgttt 600 aattttcttt cgcttattga gctaaatgta tggaaaaaaa tggttgctcc agccccc 657 // ID Gypsy-13-I_HM repbase; DNA; INV; 5027 BP. XX AC . XX DT 05-JAN-2009 (Rel. 14.02, Created) DT 05-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE An internal portion of the LTR retrotransposon - a consensus DE sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW reverse transcriptase; Gypsy superfamily; Gypsy-13-I_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-5027 RA Bao W. and Jurka J.; RT "Gypsy LTR retrotransposons from Hydra magnipapillata."; RL Repbase Reports 9(2), 398-398 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 166..4977 FT /product="Gypsy-13-I_HM_1p" FT /translation="MSTISLPCQAPGCNVVIEGPSEAIAIALFNSHQLHHP FT PNSIDIHSAFSKHKAPKMDRPHIGQDISEEEWANFISRWTLFKRCTDIRQE FT EMTAQLFQCCEDDLGNQLLKENPMIINENDEAFLKAMKKLAVIPVATCVRR FT AELLQMRQGHNESIRSFYAKLKGKAATCNYTVQCHCHPQTTVDFTDVIIRD FT IVVAGLSDTDIRKDVLGWVSLDTASVTDIIAFIEGKEMARNALGCSSTTAS FT VSTFRKQNKSNTQSSKGTNITAKCPDCSESYSLFTETSRGWNKKPHKLCLD FT CWKKLRSRTSKINNRIIPNDKKSNATKDAEAGGLVLKLSAIEHSNKKIRSQ FT MKTNVKRPCTGINHHIFTHDGWATAQLLVHPTIQLRVTTNKSDYVAFGIPF FT PPIIPKYIQVITDTGAQSCLWSRKGFDSAGFNANNLIPVKHSMSAANKTQF FT RIDGAILLRLEGRQSNGHKVECAVMVYISPDANGFYLSKEAMTQLGIISKD FT FPEIGSALHGAINSGPQTIGTKTSSENNSSHVCNCPKRSLPAGAPTKLPFV FT CCPENNERMKAWLLERYSASTFNTCPHQILPDMEGPPISILIQENAKPVAA FT FTPATIPIHWQDQVEKDLIRDETLGVIEKVPLGEPTTWCHRMVITRKHDGG FT PRRTVDLSPLNKYCYRETHAMKSPFQLARSVPRNSWKTVTDAWNGYHSIPI FT RVEDRNLTTFITPFGRYRYIRAPQGFVSSGDGYNRRFDEILADFIGKQRCV FT DDTIHYDDDLVKHWWRTIEFLELVGKAGIILNPQKFQFAKRIVDFAGFRIS FT ENGVEPLPKYIDAIRSFPTPSSITDIRSWFGLVTQVSHYAQLRETMAPFRV FT LLSPKSKFYWSDELNTLFNASKEKIINAIKDGVQIFDITKKTCLRCDWSKL FT GIGYYLSQKHCECKSDSPDCCIDGWKITLAGSRFLSSAEQRYAPVEGEMLG FT VAWSLEQTKYFTQGCDNLMVVTDHKPLVKLLGDRTLDEIANTRLFRLKQRT FT LPWKFTIQHQPGKTNLAADATSRYPAENTKESVNSATCNQNTDFDNTESLI FT IASIRNDIDKMIAVTWDSVKAETEKDTDFKALMLAIENGFPKNKNNFIPNL FT LPFWEYRNDLMVSDGVIIYNDRIVIPPTLRKEILDVLHSAHQGVSGMTARA FT QATVFWPGITHSIQSKRDNCCSCNRNAPSNARLPPIRPHIPVTPFECIFAD FT YFEFRGWDYLVIGDRLSGWTEAYRIKSGTSEMSAKGLITCLRKLFTTFGVP FT EEISTDGGPQFTSGETKEFLQQWGITHRISSAHHPQSNGRAELAVKSTKRL FT LENNVGPNGDLNNNAFVRAMLTIRNTPDPTCQLSPAQVLFGHPIRDAMPRI FT KSRIPMFNNDQFLKTRRHAWSAKEEALRTRYAKTLESLQEHTRFLAPLHEG FT DSVFIQNQVGNHPKKWDRSGKVVECKENDQYIIKVEGTGRLTLRNRKYLRK FT FLNPLKSYETEVALTTPPKTSSSHYNSADLPTPFTEYVPESSNKVIPSQPE FT LAPSTVLLPPSTLAPAVPPLPLPTTPVPLPLMPASLSTSSPHLAMPPLLNT FT EVVGDQPLKRSSRLRIPKKFYQPETGKSLTLDDFKK*" XX SQ Sequence 5027 BP; 1700 A; 1142 C; 950 G; 1235 T; 0 other; tgttagtttc ttatgcattc tatggaattt taaagaaaca ctggtatttc aaagataaaa 60 acttcttgtc agtggttgaa attgtaattc acttacgtat tcgtttgaag ttgttaatgt 120 acgcacaaaa ggtaggattt tgctaacctt tttctttcct acattatgtc cacaataagc 180 ttaccttgcc aggccccagg atgcaatgtc gtcattgaag gtcccagcga agcaattgca 240 attgcattat tcaacagtca ccagcttcac catcctccaa actcgattga cattcattca 300 gccttctcaa aacacaaggc acctaaaatg gatcgccccc acatcggtca ggatatctca 360 gaagaagagt gggcaaactt tattagtcgc tggacattgt ttaagagatg tacagacatc 420 agacaagaag agatgacagc acagctgttt cagtgctgtg aggacgacct tggaaatcaa 480 ctcctaaaag aaaatcccat gataattaat gaaaacgatg aggcatttct aaaagcaatg 540 aaaaaactgg ccgttatccc agtagccact tgcgtcagaa gagcagaact ccttcaaatg 600 agacagggac acaacgaaag tattcggtca ttttatgcaa aactgaaagg caaggctgct 660 acttgcaact acactgttca atgccactgc cacccacaaa caactgttga ttttaccgat 720 gttatcatac gagacattgt tgttgctggg ctatcagaca ctgacatccg caaagatgta 780 cttggatggg tctcccttga cacagcctca gtcacagata tcattgcttt tatagagggt 840 aaggaaatgg cacgaaatgc actaggctgt agttcaacaa cagcaagtgt atcaactttc 900 aggaaacaaa acaagagcaa tacacagagt agtaaaggga ccaatataac agccaaatgt 960 ccagattgct cagaatctta cagtttattt acagaaacat ccagaggatg gaacaaaaaa 1020 ccccacaagc tgtgtctaga ctgttggaag aagctcagat ctcgcacttc taaaattaac 1080 aacagaatca tcccaaatga taagaaatcg aatgcaacca aagatgctga ggcaggcgga 1140 cttgtgttaa agttaagtgc aattgagcat tcaaataaaa aaattagatc acaaatgaag 1200 accaatgtga aacgaccctg cacaggaata aaccatcaca tatttaccca tgatggatgg 1260 gccacagctc aattgcttgt tcatcctaca attcagctgc gcgtgactac taacaaatca 1320 gactatgtcg catttggtat tccatttcct cccattatac caaaatatat acaagttatt 1380 acagataccg gtgcacagtc ctgcttatgg tctaggaaag ggtttgacag tgcaggcttt 1440 aatgcaaaca atttaatacc tgtgaaacat agcatgtcgg ccgcaaataa aacccaattt 1500 cgtatagatg gtgcaattct tttacgtctc gaaggacgac aatctaatgg ccacaaagtt 1560 gaatgtgcag taatggttta cataagtcca gacgcaaacg gattctacct atctaaagaa 1620 gcaatgaccc aactgggtat catctctaag gactttccag agataggcag tgccctccac 1680 ggagctatta actcaggccc ccaaactata gggaccaaaa catccagtga aaataatagc 1740 tcacacgtgt gcaattgccc caaaagaagt ctcccagcag gtgcacccac caaacttcca 1800 tttgtctgct gtcccgaaaa caacgaacgg atgaaagcgt ggctccttga gagatactct 1860 gcttcaacat tcaacacatg tccacaccag attttacctg atatggaggg acctccaata 1920 agtattttaa ttcaagaaaa tgccaaacca gttgcagcat ttactccagc caccatacca 1980 atccattggc aagaccaggt ggaaaaggat ttaattcgtg atgaaactct tggagttatt 2040 gaaaaggtac ctctaggaga acctacaaca tggtgtcata gaatggttat cacacgtaag 2100 catgacggtg ggcctcgtcg cactgttgat ctatctcctt taaacaagta ctgctacaga 2160 gaaactcatg caatgaaatc gcctttccaa ctagctcgtt cagtcccaag aaattcatgg 2220 aaaactgtaa ccgacgcatg gaatggttat catagcatcc cgatacgggt tgaagacagg 2280 aatcttacca ccttcattac tccttttggg cgctacagat acatccgagc tcctcaaggt 2340 tttgtatcta gtggggatgg ctacaacaga cgtttcgacg aaattttagc agactttatt 2400 ggcaaacaaa gatgtgtaga tgacaccatc cactacgatg acgacctagt caaacattgg 2460 tggcggacaa tcgaatttct tgagcttgtt ggtaaagctg ggatcatttt gaacccacaa 2520 aaattccaat ttgctaaaag aatagtggat tttgcaggtt tcagaatatc tgaaaatgga 2580 gttgaaccac tacctaaata tatcgacgcc attagatcat ttccaacacc atcctcaata 2640 acagacatcc gatcttggtt cggcttagtg acccaagtat cacattacgc tcaacttcga 2700 gaaactatgg ccccattcag agttctgctt agccctaaat caaaatttta ttggtcggac 2760 gaactcaaca ctttattcaa cgcgtcaaag gaaaaaatca tcaacgcaat taaagatgga 2820 gtccaaatat ttgacattac caagaagacc tgtttaagat gcgactggtc aaaattgggg 2880 ataggatatt acctaagcca aaaacattgt gaatgtaaat cggactcccc agattgctgt 2940 atagacggat ggaaaataac cctagcaggg tccagattcc tgtcatctgc tgaacaacga 3000 tatgcaccag ttgagggcga aatgctagga gtggcatgga gcctggagca gactaaatat 3060 ttcacacaag gttgtgataa ccttatggta gtaactgatc ataaaccact tgtcaaatta 3120 ttaggagata ggaccttaga tgaaattgct aacacacgac ttttccgact aaaacagaga 3180 actttgccat ggaaatttac catccaacat caaccaggaa aaacaaacct ggcggcagat 3240 gctacttctc gttaccctgc agagaacaca aaggaatctg ttaattcagc cacttgcaat 3300 caaaataccg attttgacaa cacagaatca ctaataatag cctcgatacg aaacgacata 3360 gataaaatga ttgcagtaac atgggatagt gtcaaagcag agacagaaaa ggataccgat 3420 tttaaagcat taatgttagc aatagaaaac ggtttcccaa aaaataagaa taacttcata 3480 ccaaatcttt taccgttctg ggaataccgc aatgacttga tggtgtccga cggggttata 3540 atatacaacg acagaattgt aatcccacct actctacgga aagagatatt ggacgtacta 3600 cactcagccc accaaggagt atcgggaatg acagccagag ctcaagccac tgttttctgg 3660 ccaggtatca ctcacagcat acaaagtaag cgcgataact gttgcagctg taatcgaaat 3720 gcaccttcta atgcacgact accacccatc cgacctcaca tccctgtgac accatttgaa 3780 tgcatatttg ctgactattt tgaattccgt ggctgggatt atctggtcat tggagataga 3840 ctttctggat ggaccgaagc ctacagaata aaatcaggaa catcagaaat gagtgcaaaa 3900 ggtcttatca cgtgcttacg taaactattt acaacatttg gagtcccaga agaaatttca 3960 accgacggtg gtcctcaatt cacctctgga gaaacaaaag agtttctaca acaatggggc 4020 ataactcatc gcatttcatc agcccaccac ccacaatcta atggaagagc tgagcttgca 4080 gttaagtcaa caaaacgcct cctcgaaaat aatgtgggac caaatggtga tctcaataat 4140 aacgctttcg taagggcaat gcttacaata agaaacacac cagaccctac ttgtcagtta 4200 tcccccgctc aagtattatt tggccatccc attcgcgacg ccatgcccag gataaaatcc 4260 agaataccta tgtttaataa cgaccaattc ttgaaaacgc ggagacatgc atggtctgca 4320 aaagaagaag cattacgaac aagatatgca aagacccttg aatcacttca agaacacaca 4380 cgttttctcg caccactaca tgaaggagac agtgtcttca tacaaaatca agtagggaac 4440 caccctaaaa aatgggatcg tagtggaaaa gttgttgaat gtaaggaaaa tgaccaatac 4500 atcattaagg ttgaaggaac aggtcgacta acattgagaa atagaaaata tttgcgcaaa 4560 ttcctcaacc cactaaagtc atacgaaacc gaagttgcat tgaccacacc accaaaaaca 4620 agctctagtc actataactc tgccgatttg ccaacaccat ttacagagta tgtcccagaa 4680 agtagcaaca aagtcatacc atcccaaccg gaactagcac catctacagt attgctaccg 4740 ccttctacat tagcaccagc agtaccacca ctacctctac caacaactcc agtaccctta 4800 ccgttgatgc cagcatcttt atcaacaagc tcgccgcatc tagcaatgcc acccttatta 4860 aatactgaag tcgtaggaga tcaaccattg aaacgttcat ccaggctacg tatccccaaa 4920 aaattctacc aaccagaaac cgggaaatct ttgacattag atgactttaa gaaatgaacc 4980 cagaaattat gtttatattc attgacactt aaaaacatgg aagggga 5027 // ID ITmD37E_Ele3 repbase; DNA; INV; 1294 BP. XX AC . XX DT 13-OCT-2010 (Rel. 15.1, Created) DT 13-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE An ITmD37E DNA transposon family from Aedes aegypti. XX KW Mariner/Tc1; DNA transposon; Transposable Element; ITmD37E_Ele3. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-1294 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-1294 RA Kojima K.K. and Jurka J.; RT "ITmD37E-type DNA transposons from the yellow fever mosquito."; RL Direct Submission to Repbase Update (13-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. ~97% identical to consensus. TIRs are 28 bp CC long. This consensus is ~98% identical to the original sequence CC in [1]. This family encodes a DD37E-type transposase and is CC similar to Tx_mos from Toxorhynchites amboinensis. XX FH Key Location/Qualifiers FT CDS 151..1158 FT /product="ITmD37E_Ele3_1p" FT /note="transposase." FT /translation="MESKEQLVRDKILRIHHENKDLSHRSIAKTLGIANST FT VSRVIKRFEERLTTDRKPRSEGKSIPYNTKNHNRVVGAFNRNPNASVRDVA FT KNLHLSRSFVQKAKTKAGLRTFKVQKAPNRDEKQNKSAKTRARKLYLNMLT FT KVECCIMDDETYVKADFKQIPGNLFFTAKDKFSVPEHVRTQKMSKFAKKFL FT VWQAICTCGKRSAPFVTQDTMNGQVYMKECLQKRLLPLLKAHNVPTIFWPD FT LASCHYSKDVLKWYADNKVNFVPKMFNPPNTPELRPIEKYWAIMKQHLLKR FT PKVVKTVEELKKVWVYMQKTVDSQVVQNLMAGVKAKVRAFAYGL" XX SQ Sequence 1294 BP; 382 A; 296 C; 318 G; 298 T; 0 other; agggtgtcca cgatgaaatt gccacactat gaaattgctc taacttttta accgttgggt 60 agaatttaat gaaaatttgg gtggatttag ttcatagtgc attatttaca tcctgcaagt 120 tttaaagtcc tgtgatcaaa actcgcggaa atggagtcga aagaacagct cgtgcgtgat 180 aaaatcttgc gcattcatca cgagaacaag gatctctcgc atcgttccat cgctaaaacg 240 ttgggaatcg cgaattccac ggtgtcgcga gtgattaagc ggttcgagga acgattgacc 300 accgatcgga agcccagaag tgaaggaaaa agtattccgt acaacaccaa aaatcacaac 360 cgcgtagttg gggccttcaa ccgaaacccg aacgcctccg ttcgggatgt ggctaagaac 420 ctgcacctaa gccgaagttt tgtccagaag gccaaaacta aggctgggct tcgaacgttc 480 aaggtacaaa aggcccctaa tcgcgacgag aagcagaaca agtccgccaa aacccgtgcc 540 aggaagttgt acctcaacat gctgacgaaa gttgaatgct gcatcatgga cgacgaaaca 600 tatgtgaagg ccgacttcaa acagatcccc ggcaacctgt ttttcacggc caaggataag 660 ttcagcgttc cggagcatgt ccgcactcag aagatgtcca aatttgcgaa gaaattcctg 720 gtttggcaag ccatctgcac gtgcgggaag cggagtgcac ctttcgtgac ccaggacacg 780 atgaacggac aggtgtacat gaaagagtgc ctccagaagc ggctgcttcc tctcctgaag 840 gcccacaacg tcccaacaat cttctggccg gatttggcct catgccacta ctccaaggac 900 gtgctgaagt ggtatgcgga caataaggtc aatttcgtgc cgaaaatgtt caaccctccc 960 aacactccgg agctccgccc catcgagaag tactgggcga ttatgaagca gcaccttctt 1020 aaacgaccta aggtagtgaa gacagtcgag gaactgaaga aagtatgggt ttacatgcaa 1080 aaaacggttg attcacaggt tgtgcagaat cttatggccg gggttaaggc caaggtgcgg 1140 gcatttgcgt atggactgta aataaaatac gagtaaaatg gtaaaatgaa gtttaatagt 1200 tatttttcaa tccctgaaaa tttgatggca atcggatgaa aactcgaatt ttgcgaatca 1260 attttgtgtg tggcaatttc atcgtggaca ccct 1294 // ID I-72_AAe repbase; DNA; INV; 6990 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE An I non-LTR retrotransposon from Aedes aegypti. XX KW I; Non-LTR Retrotransposon; Transposable Element; I-72_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6990 RA Kojima K.K. and Jurka J.; RT "I clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1343-1343 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 11 sequences with >90% CC identity. XX FH Key Location/Qualifiers FT CDS 413..2152 FT /product="I-72_AAe_1p" FT /translation="MEPANGTPLRPGDPGGPGGGRAHPNNWKDGEYLGARL FT PSFMDRDGTAGPLQYLKMTAKNGSIPQDPFLLRLSVEKYIGAAIDGAYKEN FT RGVSYVLKVRSNSQFNRLLRMQKLNDGTEISITEHPQLNQRKCVVSNVDST FT DLTDQYIKEQLASQGVKDIRRIRRRQPDGSFINTPTIILTISGTVIPDHID FT FGWSRCRTRNYYPTPMQCYRCWRFGHTNSTNKCPEPFRICGKCCKVHPEDK FT VPKTADPVMGYSATTATDNETPREPATTGLIQTRSECVDPIFCWNCNTNDH FT PASSKKCPIYVKELAIQHIRVDHNLSYPQARREYEAQNGIGNRTDTFAGAV FT NLSKDVEVEGLKTTVKQLLEDSKRKDERIAEMERALGNRSNVNDRLDKIRE FT HGTIEELVRQVAALTATVEKLQKDLVKKDAIIKKLRDGKETETIPTPLVMS FT TAESEKESAFTLSSNEDTSLKESITQKVSLWIEEMKSEMNEEASAETKKEI FT CKEKKRKQKKAKKPKTTENDISDSSMRSTHSHKLGKTANSSITEQPNSSKR FT THDIESISDSTDTSLKAKRNSKIQKADEDVMIE" FT CDS 2233..6762 FT /product="I-72_AAe_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="MPIINLDDDITTESTEPQHNTPLINEHTAEVSGSRGP FT ASAEATPPPELPDHPRHPSPPPAVGTHKGLTTCTPNTPEDGRDRQGPVSAE FT AKVVPELADNPCHLDRSGRGTHKGSTRSPLDAEGVKKPARPLMAPVFIPYH FT RKSTRSTISKPSVELKINTNNTPKDDEEPKLDTRRYPKRIRSAPDRYCPCL FT PRTKNANRLHRALTEAEATNADVYDRMVQKRDGRSSIPVAGTSHLRPRRLA FT HVPHQRAADCVKDLSGHSRSSNLGVGLYLPLASDKLTIASSGALSCPFSED FT SSKDVTRGHTVFAVQWNMNGYLNNLPDLEMLVNRHHPVVIALQEIHRTNVA FT TMNNTLRRQYRWYTKTGANIYQSAGIGISAEIPADQINIETDLPIVGVRIP FT WPFPVSVVSLYLPNGKLVNLKKSLEEIFRKIPGPLIVMGDVNGHHKAWGSP FT HNNTRGSIVAALANQLDLIILNDGSTTFSRGRSETVVDVSLASAAITHRLL FT WSAEPDLRGSDHTPIFLKLDSNATPETSRRPRWLFETADWLTYRSVLDEKL FT NSSPPESIQDFINHISSTSSETIPRTSPNPGRRALHWWTSETCKTVKLRRK FT TLRVVQKLRKKLSPDHPDLLQALEQYHTARNTCRQIIREAKEKSWTEFLDG FT INEDESASELWRRINCFQGKRRSRGIALMVDDCLSRDPAVIADALADYFHD FT ISSYQHYPDGFRIAHPTPDEAIRRFVVPPDRGQDFNVPFTFNELEFALKRA FT KGKSAGPDDIGYPMLKNLPLNGKLVLLDMLNRLWTTETLPESWKHSFMIAI FT PKNTGPASDAKSYRPIALTSCVVKIMERIVNRRLVGFLEDNQLLDSRQHAF FT RPGFGTGTYLASLGQILDEALKAGEHVELASLDLAKAYNRTWTPGILQKLV FT NMGISGNMLAFVRNFLEGRTFQVLVGNHRSKIVQEETGVPQGSVLAVTLFL FT VAMNGVFLVLPKGVFILVYADDILLLVRGKHPKMIRRKLQAAVSAVAKWAT FT QVGFDMAADKCVRVHVCDSKHRPPGKINIQGLPIPTRNYVKILGVTIDRNL FT SFHAHFTAVKSACKTRISMIQSISGKRTRSERTTRKRVAEAVVNSRLLYGI FT EISGRSFDDLVKTLSPTYNNCVRALSGLLPSTPAVSACVEAGILPFQYKAT FT MALCCRAVGYLERTVGRGQDCFLAEQANRALREVANATLPSVVELHRVGPR FT SWSAKEITVDKTIKNHFRRNANPVAVQGCFLERLSEAYPNAEVYYTDGSKL FT ANRVGVGVFGSDTEISLRLPSQCTVFTAESAAIYLAAKKQTDRMKLIVSDS FT ASAITAICSDTNKHPFIQAAQNLLANTKKQATLMWVPSHCGILGNEHADRL FT AAMGRQEALLTPKVPGDDVKSWVKTMIKSAWALKWDRDRSLFLRRIKADTA FT LWTDIPNWREQKVLSRLRTGHSRLSYNMSGSGSFRKTCDICKVHNTTEHII FT SCCPKYDLLRRQHDITSTSRALQNNSVYERTLINFLREAGLFLEI" XX SQ Sequence 6990 BP; 1989 A; 1819 C; 1601 G; 1580 T; 1 other; gagttcggga cattgttgtt attactagca accgcgtwcg cgctcaacat actgcccaat 60 tttcgaggtt ttagtgattt ttttctccga ataagtcact ctacactaag tgactagtgt 120 atactagtga ctatactaaa agttcgatcg atatcggtta agttttggga aaaaaccagc 180 gaaattagtt agtccgcgtg caaaaaacat tcaccgtcac tgcattgttc gaccggcaca 240 cggtatcatt aattgaaatt ctgcagttgt gtgtggcttg gtgacccgag cgagaagagg 300 tgtcacaaag tgaacaaact gttttttttt ttttttctct ctgtgtttca acaaaagtga 360 tttttccgtg gtggtaaagt ggtaaagtca ccttcctcac cgtttcgccg gcatggagcc 420 ggcgaacggc acccccctcc gcccagggga cccagggggc cctggagggg gtcgtgccca 480 cccaaataat tggaaagatg gtgaatactt gggtgctcgc cttccaagtt ttatggaccg 540 agatggtacg gcaggtcctc tgcagtacct gaagatgact gccaagaacg gatccatacc 600 tcaggaccct ttccttctgc gtctatccgt ggaaaaatac atcggcgcgg caatagatgg 660 agcatataaa gagaaccggg gagtctccta cgttttgaag gtacgtagca actcacagtt 720 caaccgttta cttcgaatgc aaaaattgaa tgacgggaca gaaatctcaa ttaccgagca 780 ccctcagctg aatcagcgaa agtgtgtagt gtctaacgtc gactcaacgg acctaactga 840 tcagtacatc aaagaacagc tcgccagtca aggtgtgaag gacatccgcc ggattcgacg 900 tcgacaaccg gatggctcct tcataaatac accgaccata atccttacca tcagtggtac 960 cgtcatcccg gaccacatag acttcggatg gtcccgttgc cgaacccgca attactatcc 1020 tacgccaatg cagtgttacc gctgttggcg ttttggtcat accaactcca ccaataagtg 1080 ccctgaaccg ttccgcattt gcggaaaatg ctgcaaggtt catccagagg acaaggtacc 1140 aaagacagct gacccagtta tgggttattc cgctaccact gccactgata atgaaactcc 1200 gcgagaacct gctactaccg gcttgattca aacgcgttcg gaatgcgtcg acccaatctt 1260 ttgttggaat tgcaacacca atgatcaccc agcctccagc aaaaagtgcc cgatttatgt 1320 gaaggaactg gctattcagc atatccgtgt ggaccacaac ctctcttacc cccaggcccg 1380 tcgcgagtac gaagcacaga acggaatagg taatcgcacg gacactttcg ctggagcagt 1440 caacctgagc aaggacgtcg aagttgaagg gcttaagacc accgtcaaac aactccttga 1500 ggactctaag agaaaagacg aaaggattgc tgagatggag cgcgctctgg gaaatcgtag 1560 taacgtcaac gaccgccttg ataaaattcg tgaacacggc acaatcgaag agttggttcg 1620 ccaagttgct gcactcactg ctacggttga gaaactccag aaggacctag tgaaaaaaga 1680 tgcaatcatc aagaagcttc gagatggaaa ggaaaccgaa accataccca cccccttggt 1740 gatgtctacc gctgaatccg agaaggagag cgctttcact ctttcttcta acgaagatac 1800 ttccttgaaa gagtctatca ctcagaaagt gtccctatgg atcgaagaaa tgaaaagtga 1860 gatgaacgag gaagcgagcg cagaaacgaa aaaggaaatt tgcaaggaga agaagcggaa 1920 acaaaaaaaa gccaagaaac ccaaaactac cgaaaacgac ataagtgact caagtatgag 1980 gtccacccac tcccacaaat tgggtaaaac cgcaaactca agcatcaccg aacaacccaa 2040 ttcatccaaa aggacccacg atatcgaatc cattagcgac tcgaccgata catccctcaa 2100 ggccaagagg aactccaaaa tccagaaggc tgatgaggat gttatgatcg aataagctct 2160 tctccattat gctcagtcct cccatacgac cgacagcagt ctgtacgaat tattcatcaa 2220 ccccatatct tcatgccaat catcaacctc gatgacgata tcaccacgga gtcgactgag 2280 ccacaacaca acactccact catcaatgaa catacagcgg aggtctcggg tagccggggc 2340 cccgccagtg cggaagctac acccccaccg gaactgccgg accacccccg acacccgagt 2400 ccacctccag ctgtcgggac gcacaaaggt ttaaccacct gtaccccgaa taccccagag 2460 gacggtcgag accggcaagg ccccgttagt gcggaagcca aggtcgtacc ggaactggcg 2520 gacaaccctt gccatctcga tcggtctgga agagggacgc acaagggttc tacccgttcc 2580 cctttggacg ccgagggagt gaagaaacca gcacgaccac tgatggcccc agtatttatc 2640 ccataccacc ggaagagtac ccgatcaact atcagtaaac cttcggtaga gctaaagatc 2700 aacacgaata atacaccgaa ggatgacgag gagcctaaat tagacactag gagatacccc 2760 aaaaggatcc gatcagcacc tgaccggtac tgtccgtgtt taccacgtac caaaaacgcg 2820 aaccgcttac accgcgctct cacggaggca gaagcgacta atgctgatgt ttatgaccgg 2880 atggtacaaa aacgtgacgg ccggagctct ataccagtag ctggtacctc ccacctgcga 2940 ccgcgaagat tggcacacgt cccgcatcaa agagcagctg attgtgtaaa ggacctctcg 3000 ggacattcca gatcatcaaa tttgggagtc gggttatatc tgcctttggc aagtgacaag 3060 ctgacaattg cgtctagtgg agccttatct tgtccatttt cggaagattc cagtaaagat 3120 gttactcgag gacatacagt tttcgccgtg caatggaaca tgaatggtta cctaaacaac 3180 cttcccgacc tggagatgct ggtgaaccgt caccaccctg tagtaattgc attacaggag 3240 atccatcgta caaatgtcgc cacaatgaac aatactctcc gccgccagta tcgatggtat 3300 acaaaaactg gagccaatat ataccaatct gccggcatcg gcataagcgc cgagatcccg 3360 gccgaccaaa tcaacatcga aacggacctc cccattgttg gagtacgtat tccatggcct 3420 ttccctgtat cagtagtttc actctacctc cccaacggaa aacttgttaa cctgaaaaag 3480 agtttggaag aaatattccg gaaaattcct ggtccgttga tcgtcatggg ggatgtgaac 3540 ggccatcaca aggcgtgggg tagcccccac aataacaccc gcggctccat agttgcagcg 3600 cttgccaatc agctagatct tatcattcta aacgatggat caaccacttt ctcacgcggg 3660 cgatcagaaa cggtagtcga tgtctcactg gcctcagcag ctatcacaca tcgcctccta 3720 tggtcagccg aaccggacct gcgtggaagt gaccatacac caatctttct caaactagat 3780 agcaacgcta cgccagaaac aagtcgtcgc cctcgatggc tatttgaaac ggcagactgg 3840 ctgacctacc gatctgtgct cgacgaaaaa ctgaactcct ctcctccgga gtctattcaa 3900 gatttcatca atcatatcag ttctacctct tccgaaacta ttccgaggac tagtcctaac 3960 ccgggtcgcc gcgcacttca ttggtggact agtgagacct gtaaaaccgt taaattacgt 4020 agaaaaacgc tacgagtagt tcaaaaacta aggaaaaaat tatctcccga ccatcctgac 4080 cttctccaag ctctagagca gtaccacact gcccgtaaca cgtgcaggca gatcatacga 4140 gaagctaaag aaaaatcctg gacagagttt ttggatggaa taaatgagga cgagtctgct 4200 tctgaactct ggcgtcgaat caattgcttc caaggcaaga gaagatctcg agggatagcc 4260 ctcatggtcg acgactgctt gtctcgcgac ccagctgtca tcgcagatgc actggcggat 4320 tatttccatg acatttcctc ctatcaacat tatccggacg gttttcgaat agctcaccca 4380 actccggatg aagcaataag acgatttgtt gtcccgccag accgcggcca ggacttcaac 4440 gtgcctttca ccttcaatga actcgagttt gccctcaaga gggcaaaggg aaaatctgct 4500 ggacctgatg acattggcta cccgatgctg aaaaacttgc ctttgaacgg caaacttgtg 4560 ctgcttgata tgctcaatcg actatggacc accgaaacac ttccagaatc ctggaagcat 4620 agcttcatga tagcaattcc caaaaacact ggaccagcat cagacgctaa aagctatcgc 4680 cccatcgccc tcacgagttg cgttgtgaag attatggaaa gaatcgttaa ccgccgcctg 4740 gtaggtttcc tcgaggataa tcagctacta gatagccggc agcatgcgtt ccgtccaggt 4800 ttcggaacgg gaacttattt ggcttccctc ggccagattc tagacgaggc tttgaaggct 4860 ggtgaacatg tcgagctagc atctcttgat ctagccaagg cttacaatcg gacgtggacc 4920 cccggtattc tccagaagct ggtaaatatg ggaatctctg gaaacatgtt agcgtttgtt 4980 cggaactttt tggaaggacg gactttccaa gttcttgtag gaaatcatcg ttctaaaata 5040 gtccaagaag aaacaggtgt accgcagggc tccgtcctgg ctgtcaccct gtttcttgtg 5100 gccatgaatg gagttttctt ggtactaccg aagggcgtct tcatcctggt atacgctgat 5160 gacattctgc tgcttgtgag aggaaagcat ccaaaaatga tacgccgtaa gctccaggcg 5220 gctgtttcag ctgttgctaa atgggctacc caagttggtt tcgacatggc ggcggacaaa 5280 tgtgttcgcg ttcacgtgtg cgactccaaa catcgcccac ctggtaaaat aaacatccag 5340 ggccttccta taccgacaag aaactacgtc aaaatactcg gtgtcacgat agatagaaac 5400 ctatcctttc atgcacattt cacggcggtc aaatccgcct gtaagactag gatcagcatg 5460 atccaaagta tctcgggaaa gcgcacccga agcgaaagga ccacccgcaa acgagtagct 5520 gaagctgttg tcaatagccg tctgctgtac gggatagaaa tatctggaag atctttcgat 5580 gaccttgtta aaaccttatc acccacgtat aataattgtg ttcgtgcgct atccggcctt 5640 ctcccctcaa ctcctgcagt ttccgcatgc gtagaagctg gaatcctccc ctttcagtac 5700 aaggcaacca tggcattgtg ttgtcgcgca gttggctacc tggaaagaac cgtaggtaga 5760 gggcaggact gtttcctcgc ggagcaggca aaccgtgccc ttagagaagt ggccaacgcc 5820 acgcttccct cggtggtgga actccatcgt gtaggaccta gaagctggtc ggccaaggag 5880 atcacggtag acaaaacaat caaaaaccat ttccgtagaa acgccaatcc ggtagccgtt 5940 caaggttgct ttttggaacg attatctgaa gcctacccga atgccgaggt ttattataca 6000 gatggctcca aattggccaa ccgtgtggga gtcggagttt ttggctctga tacagaaata 6060 tctctccgcc taccaagtca gtgcaccgtc ttcaccgcgg agtcagcagc aatctatctg 6120 gctgcaaaga aacagaccga cagaatgaag ctgatcgtta gtgactcagc tagtgcgatt 6180 accgctatat gctctgacac gaataaacat cctttcatcc aggccgctca gaatcttctt 6240 gctaatacga aaaagcaagc gacacttatg tgggtcccca gccattgcgg cattttgggc 6300 aacgagcatg ctgatagatt ggctgctatg ggccgccagg aagctcttct aacgcccaaa 6360 gttcctgggg atgacgtcaa gtcttgggtg aaaacaatga tcaaaagtgc ctgggctctg 6420 aaatgggata gggatcggtc attgttcttg agaagaatca aggctgacac agccctttgg 6480 actgatatac cgaactggcg ggagcagaaa gtcctttctc gactccgaac agggcactcg 6540 cggctctctt acaacatgag tggaagtggt agcttcagaa agacatgcga tatatgcaaa 6600 gtacataata cgacggaaca tatcattagc tgttgtccca aatatgacct acttcgtaga 6660 cagcacgaca taacatcgac cagccgagct ttacaaaaca actctgtcta cgagagaact 6720 cttataaact tccttcggga agccggactt ttcctggaaa tctaaaaccc taagagacgg 6780 tcgaacgatt atgataaccg ctgagaacca aggacacgaa tagacaaaaa tacgctagac 6840 agcctaaatt catcatataa aaactctgta acttacactt gctagctggt tgcaccctta 6900 aggtgccttc agtattttcc ttcttctcga gatgaaccag cctagggctg aaaatctctt 6960 aaataaagac aataataata ataataataa 6990 // ID Gypsy-1_TCa-I repbase; DNA; INV; 5239 BP. XX AC chrUn_220; XX DT 21-FEB-2011 (Rel. 16.02, Created) DT 21-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the red flour beetle genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-1_TCa_; KW Gypsy-1_TCa-LTR; Gypsy-1_TCa-I. XX OS Tribolium castaneum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Coleoptera; Polyphaga; Cucujiformia; OC Tenebrionidae; Tribolium. XX RN [1] RP 1-5239 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the red flour beetle genome."; RL Direct Submission to RU (21-FEB-2011). XX DR Genome; chrUn_220; Positions 539 5777. XX CC Positions [2749-3216] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 371..1357 FT /product="Gypsy-1_TCa-I_2p" FT /translation="MPENQQLFKFEYLQILPEFTGNQALLSEFIATSEQLI FT NKFYNVQDANDFQNTLLIKSIKNKIKGEAAAHIASYQIKNWNDLKTALLAT FT YADKRDLQTLTIELCNLRQGNLKPLEFFTRIQENLNLQISYISTQVVVTAR FT DALIQNSNHLALRVFLKHLNNPLGDYLSTRQPASLGDALRILTNDFNINDK FT TKFTKPQTNHNRPNPMVYKPLPQPHQNFNRQQNFNQNRNQSNVFKPRQTQL FT DKPVPMSISTRNTVPNKPNYNFRQPQQNWYAEELHNNENEVNDQSEIQECP FT DDNNETNNGASTSFFEPNFEQYSEPFLEEAASDMLNY" FT CDS 2011..3597 FT /product="Gypsy-1_TCa-I_1p" FT /translation="MATTPDNSEPETDYSDKPIFEYIKKFNQELINKEDDI FT SSTRVEPDKDLDQNIDENDDNITIHTNEENPVCLIPISENPLNFGKNQIQF FT ISVTNNPSDPVVIKLFETSKQRITVQISRNDSKNNILNFIKNYVIPQVSYA FT CHFASDELYEITGNILREHFNESILFKRHLKILEDVESPEEQIAIIQNYHY FT GKSNHRGINETLQKISSRYYWPNMQKAIQNFINKCEICKAVKYDRKPLKLK FT FNLTPTPTKPFEIVHIDVLKYQSHKFLTIIDAFSKYAQVYLLENMQAIEIM FT KKLLIFFSHHSVPSLIVTDNGSEFDNGVITEFFRLHKIDVHFCSPHHPASN FT GLIERFHSTFLEHLNLLNNRPEFSKDPISTKILYATIAYNNSIHSTTKLTP FT FNVLNLNNEQEIGDINLEYVILNNHLQSHKEKLQIIFKEINKRLTENKKKL FT ITKLNENRENIPTKLPDTVFVKSNFRSKVRNRYKKEEILETNRDRKTIKPK FT INKSKTGRKFNKLHMDNVKRPTKQENSIAGTSS" FT CDS 3609..4868 FT /product="Gypsy-1_TCa-I_3p" FT /translation="MVTVSSDRNITNNPGILPINLGKARLQSSSHHMIHYF FT DISGLLTEYNKLQTYLEIVKNSTANETETQKEIQNYLKIIHYNINLVDEKL FT YPFFQNKRAKRGLVNILGSIIKTITGNMDSEDNERITSVINTIKQNQNNIA FT HQLTNQYSINQEIINKFEKTIKDIEHNERALYSLTSKFQNVTRNQINTLFI FT KDTLNQLSHLFNVILNIAQDIENSLAFCKLQTVHPSIITNEELFKELLKIE FT SIYKNHLPFPVKSENIQNYKRILSTKCVIKEFEIIYFLSFPLYETRELELF FT YLIPLPDQKFQTIIPSAKYVLKGESYLWPLTDICDKIQKEYFCKNELRTYV FT NISCQMEILFKNTFEKCLPIQLNRLQTLEFVPEINQYLGIFPVPVSIKNFL FT LQCFGTTPNPRCLSLRKSTKLPKIRQ" XX SQ Sequence 5239 BP; 2068 A; 1030 C; 688 G; 1453 T; 0 other; taattggcgc agtcagtagg ataggcgaaa cgtcttgtgc ccaaaaaagt gaacctccaa 60 aaatctaagt tgtggacttt caccgaaaaa ttgtgaaccc aacaagtgcg cgaggacttt 120 cacctagtgg ccgctggacc tggacaactc cacggacttc caatcaaaag acggtgcgtt 180 tcaacgcttt gacgcctatc gaaaaaacct gaagacaacg ttggcagatg gagtacagtt 240 ggtgagtcgt agatgtcatt ttgcaacaaa ttcggactcg aaaaattgca tcaatcttat 300 caaattcggg cgaaaaccga aaaattaatt gtttgaaatt tttgagtatt aagaaaaata 360 aaaattaaaa atgccagaaa atcaacaatt atttaaattt gaatatttac agattttacc 420 ggaatttacc ggaaatcaag ctttattaag cgagttcatt gctacaagtg aacagttaat 480 taataagttt tacaatgtac aagatgcaaa tgattttcaa aataccttat taataaaatc 540 gataaaaaat aaaataaaag gtgaagctgc tgctcacata gcttcgtatc aaattaaaaa 600 ctggaacgac ctcaaaactg cgttattagc aacatacgcg gacaaacgtg acttacagac 660 gttaaccatc gaattgtgta atttgcgaca aggcaacctg aaaccactag aatttttcac 720 aagaattcaa gaaaacttga atctgcaaat cagctatatc tcgacccaag tagtcgttac 780 tgccagagat gcattaattc aaaattcgaa tcatttggcc ctaagagttt ttctaaaaca 840 tttaaacaac cccttaggag attatttatc cactagacaa ccggcaagtt tgggcgacgc 900 tttacgaatt ttgaccaatg atttcaacat aaatgacaag accaaattta caaaacccca 960 aactaatcac aacagaccaa acccaatggt gtacaaaccc ttacctcaac cacaccaaaa 1020 ttttaaccgt cagcaaaatt tcaaccaaaa ccgaaatcaa tctaacgtat ttaaaccacg 1080 acaaacccaa ttagacaaac cagttccaat gagtataagt accagaaaca cagttccaaa 1140 taaaccaaat tataattttc gacaacctca gcaaaattgg tacgcggaag aattgcacaa 1200 caacgaaaat gaagtaaatg accaatctga aattcaagaa tgtcctgatg ataataatga 1260 aacaaataac ggtgcatcaa cctcattttt tgaaccaaat tttgaacaat attcagaacc 1320 ttttttagaa gaagcagcct cggacatgtt gaattattaa atattgacga caacccttca 1380 caattgcctt atatcgttat accctttcca aactgtaaat taaaatgctt aatcgacact 1440 ggttccacga aatcgtttat aaatccagaa aaagcaaaac ttcactttgc tagattaata 1500 aaaaatgacc cgtttattat ttctacagcg cacggttgta caaaagaaca atttagcgtt 1560 tccataccct gtcccaaaat ttttaaaagt aaaactaaaa tcgttaagtt ccatatattc 1620 aaatttcaca aatatttcga ttgtttactt ggcattgata atttaaaaca actccaagca 1680 aacattgacc ttaacttagg caaactcttt cttcccaata cccaaatcga tataaacttt 1740 aaaaaattat gtgattacaa taatcatatc gaagtcctgc caaattccga acaagtaata 1800 aaaattaaga ttgcaaacgt taaaaatgga actgcccttt taccaaaaca tacattttca 1860 gacctagaaa ttcacgaaag tctcattaaa gtcgaaaata atttcgcttt aacgtcgtta 1920 ataaacccta ctaacaaaca tacaaaagtc gatatctctc atccatttga agtgattcca 1980 caggaccaat ttgatatacc aaatttaaac atggctacaa ccccagacaa ttctgaacca 2040 gaaacagatt attcggataa accaatattc gaatatataa aaaaatttaa tcaagaactc 2100 attaataaag aagatgacat aagttctacc cgcgtagaac ccgataaaga tttagaccaa 2160 aatatcgatg aaaacgacga taacattaca attcatacta acgaagaaaa ccccgtctgt 2220 ttaataccta tttctgaaaa cccgttaaat tttggcaaaa atcaaattca attcatttcc 2280 gttacaaaca atccttcaga tccagtagta atcaaattat ttgaaacttc aaaacaaaga 2340 ataacagtcc aaatttcacg caatgattcg aaaaataata tcctaaactt tataaaaaat 2400 tatgttatac cccaagtatc atatgcatgt cacttcgcat cggacgaact ttatgaaata 2460 accggaaaca ttttaagaga acatttcaat gaatcaatat tatttaaaag gcaccttaaa 2520 attttagaag acgtcgaatc ccccgaagaa caaattgcta taattcaaaa ctaccattac 2580 ggtaaatcta atcatcgtgg cattaatgaa accctacaga aaatttcatc acgttactat 2640 tggcctaaca tgcaaaaagc tattcaaaat tttatcaaca aatgtgaaat atgtaaagct 2700 gtaaaatacg accgcaaacc attgaagcta aaattcaatt tgactccgac accaaccaaa 2760 ccattcgaga ttgttcatat agacgtatta aaataccaat cccacaaatt tttaacaatc 2820 attgatgctt tcagtaaata tgctcaagta tatctactcg aaaacatgca agcaatagaa 2880 ataatgaaaa aattactgat tttctttagt catcattctg tccccagttt gattgtaact 2940 gataatggat cagaattcga taacggtgtt ataacagaat ttttccgact tcacaaaatt 3000 gatgtccatt tctgtagccc tcatcaccca gcaagtaatg gacttataga gcgtttccat 3060 tcaacatttt tagaacatct aaatttactc aacaatcgtc ctgaattttc taaagaccca 3120 atctcaacaa aaattttata tgccacaatt gcctataata attccatcca cagtaccaca 3180 aaactaaccc ctttcaatgt tttgaatttg aacaatgaac aagaaattgg agatattaat 3240 ttagaatatg taattttaaa caaccattta cagtctcata aagaaaaact gcaaatcatt 3300 tttaaagaaa ttaacaaacg actaacagaa aacaaaaaaa aattaataac caaattaaat 3360 gaaaataggg aaaacatacc aactaaatta cctgacacag tcttcgtaaa aagtaatttt 3420 agatcaaaag ttcgaaaccg atataagaaa gaagaaattt tagagacaaa tcgtgaccgt 3480 aaaacgataa aacctaaaat caataaatca aaaaccggca gaaaatttaa taaactccat 3540 atggataatg ttaaacgacc aactaaacaa gaaaattcca ttgcaggtac atcttcgtaa 3600 cctggctaat ggttaccgtt tcgtcagacc gaaacattac aaataaccct ggcatccttc 3660 ccataaattt aggtaaagca aggcttcaat cttcttccca tcacatgatc cactattttg 3720 atatttccgg tcttttaaca gaatataata aattgcaaac ctacttagaa atcgtgaaaa 3780 actcaacggc taatgaaact gaaactcaaa aagaaattca aaattattta aaaattattc 3840 attataacat taatcttgta gacgaaaaac tttacccatt tttccaaaat aaacgagcaa 3900 aaagaggatt agttaacata ttaggatcaa tcataaaaac tattacagga aatatggact 3960 ccgaagataa tgaaagaata actagtgtca ttaacacaat aaaacaaaat caaaataaca 4020 tcgcacatca actgactaat caatattcaa ttaatcaaga aataattaat aaattcgaaa 4080 aaactataaa ggatatcgaa cataacgaga gagcattgta ctcgttaaca tcaaaattcc 4140 aaaacgtcac aagaaatcaa ataaacactt tatttatcaa agacacctta aaccagctta 4200 gtcatttatt caatgtaatt ttgaatattg ctcaagatat tgaaaattca ttggcatttt 4260 gtaaattaca aactgttcac ccaagcataa taacaaatga agaacttttc aaagaacttt 4320 tgaaaatcga atctatttat aaaaatcatt taccatttcc tgtaaaatct gaaaacattc 4380 aaaattataa aagaattttg tcaaccaaat gtgtcattaa agaattcgaa ataatttatt 4440 ttctttcttt tcctttgtac gaaacacgtg aattagaatt attttactta atacctcttc 4500 cagaccagaa atttcaaaca atcataccat ctgcaaaata tgttttaaaa ggcgagtcct 4560 atttgtggcc gttaacagac atctgtgaca aaatccaaaa ggaatacttc tgtaaaaatg 4620 aattaaggac ctatgttaat atttcttgtc aaatggaaat cctatttaaa aacacgttcg 4680 aaaaatgtct tccaatccag ctcaacaggc tccaaactct cgaattcgtt cccgaaatta 4740 atcagtatct aggcattttt ccggtaccag tttccatcaa aaacttcttg ctccaatgtt 4800 ttggcacaac accaaatcca aggtgtcttt ctcttcgaaa atcaactaaa ctgccaaaaa 4860 tacgtcaatg atcaacttct gatttttaat gacacaactg aaggaaaacc cataatccta 4920 gaagacttca gacttcccgc aagaaataat gtgaagattc ccaagctaac cttgaggacc 4980 cttcaagtta caagtttaac taacaacctg cctcgacttc agatgtaccc ctacgaagat 5040 agcaaaaata tttggcacct aaccggaact gctatattgt attttgtatt tctagcctta 5100 gccatctggt ttatgatgca gaagatagca tttcggcgaa gaaaccaaac ggaagaagcc 5160 cctgagccag tagaactacc ccgcgatgcc aaattttagt tcctctccgg agtttcgggc 5220 taagagggga ggaattata 5239 // ID CR1_Ele27 repbase; DNA; INV; 4880 BP. XX AC . XX DT 25-OCT-2010 (Rel. 15.1, Created) DT 25-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE A CR1 clade non-LTR retrotransposon family from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1_Ele27. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4880 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4880 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (25-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. This consensus is generated from 11 CC sequences with >94% identity, and ~97% identical to the original CC sequence in [1]. XX FH Key Location/Qualifiers FT CDS 399..1181 FT /product="CR1_Ele27_1p" FT /translation="MDCAKCXMKIAGIESPVVCRGYCGSVFHMTCAGVSRA FT LTDYFXKHRKNLFWMCDKCAXLFENSHLRAITKQADEKSPLXSLTNAITNL FT QTEIQKISSKPASTMSSPNRWPMISEMTRAKKRPRGFDSSECQSGSKQLDQ FT NVVSVPVSEKPAAKFWLYLSRIRLDVTNEAIAAMVKANLELDSDPVVVKLV FT AKGADTSNMSFVSFKVGLDPSLKDXALDPSTWPEGIMFREFEDYSSQKFRR FT APTVSPGLSPLATTATITVN" FT CDS 1487..4804 FT /product="CR1_Ele27_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MVDSPVLIWAPRRALHHDPPFFNQHSPGCTCESFMED FT SNPPETVEPINEYSPYLHHPISSCQHRHPGPVFVERGGVFQTPTAGKYSNC FT SSLHLPEVLLDTHRTHFDAADANDGTPEGTQHSVEIHHSDVQCATQQHSGH FT QRGLATTPVNRQSTPVDRFLSVYYQNVRGIRTKTNTLRLMLSSCDYDVIVF FT TETWLRQDIADSEISSEYNIFRCDRSAATSEFSRGGGVLIAVKTALQCSAL FT ELSDCGQLEQMVVRIKLANRSLIIVAIYLRPNSIPASYTAHTNAIQALCDM FT VSDTDIILALGDYNLPNLQWQFDEDINGYLPTNASTEQDVILTESFSTSGL FT IQLNSISNSNGRILDLAYANTPDYIELIDPPLPLIKMDEHHKPFILRLDTR FT YTSPCFDNVPMDATAFDFKRCDTDALRYKLSSIDWIHQLSCSSVDETVCAF FT YNILHGIFDQCVPHRRLAPAIVNRKPWWTAELRQMRNRLRKTRNRYFGDRT FT DANRIALHQLETDYNDLLSSTYAVYVNNIQSNLKQNPSSFWSFVRAQKSSK FT RTPCNVEYNGQTASSAKEAANLFANFFQSVFRDSSPAARFHHLPSYDVNLP FT DFRFSAGEVLELLHCLDASKGPGVDGLPPSFIKSCAHELVTPVALIFNRSL FT TEGTFPAMWKTARMIAIHKGGSINHVENYRGISILCCLGKVFETGVRRYLY FT NAAKPFISEFQHGFVENRSTTTNLMEYVNTLFPAVESRCQVDSVYVDFSKA FT FDLVPHNLLIEKLRHLGFPNWITNWLNSYLTDRKAFVQVNQQVSTHFPIPS FT GVPQGSVLGPLIFVLFINDLADRVSSGKLFFADDLKMFRIIASALDCTALQ FT NDIDQLTDWCQENGMFVNIKKCKVITFTRCHTPICFSYKIGSASLERVESI FT RDLGVIMDSRLNFNEQVSLAVSKAFSVLGLVRRHSTFITDIYALKALYCSL FT VRSILEYAAPVWAPHQVTQILKIERVQKYFIRFALRQLPWENPSELPPYNS FT RCLLIDLETLSARRTKLQRVFIFDLLSNRIDCSALLNSXNINVPSRTLRSH FT NALVLPVSRTXYGFNNPFYVCIRAFNTVFHLFDFDISKNVFVSRIKNIA" XX SQ Sequence 4880 BP; 1307 A; 1241 C; 996 G; 1298 T; 38 other; aaaatcwtct ggcatcactg attgcttwca atcgtgttta ctttgaagcg gagtwtttgk 60 ttacgttttt atcgttttgt gaagtgtgta aattcgtgat ccgtcgctca acccktttat 120 gwatgtgaca amattgaacc gtggctggaa tattgkacak atactgtgcc ktagtgswaa 180 awaagtgttg tgattcctgg tcgagtgctg acaaaaacac tccccttacc tgccccccat 240 cgcaattttt tcgtgatcag cgacatcttc cggaaaatat tgggactama aattgctwgc 300 aataatccaa tccaagtgct attttcacca ccgtcaccma scttcgcatc gttcctcgcg 360 cagcctactt aaaggcgcat tcgatcaagt tcttaaccat ggactgcgcc aaatgcwcca 420 tgaaaattgc gggaatcgaa tcacccgttg tctgtcgtgg ctactgcggc agcgttttcc 480 acatgacttg tgctggtgtk tcacgtgctc ttacggacta ctttkcgaaa catcggaaga 540 atctcttctg gatgtgtgac aaatgtgcag awcttttcga gaattcacat ctgcgggcca 600 taacgaaaca agcagacgag aaatctccgt taatktcatt gaccaatgcc atcacgaatc 660 ttcaaacgga aatacagaag atttcgtcga aacctgcatc aacgatgtca tctccgaacc 720 gttggccaat gatwagcgaa atgacacgag ccaaaaaacg acctcgagga ttcgactckt 780 cagaatgtca atctggttca aaacagctgg atcagaatgt agtatccgta cccgtaagcg 840 aaaaaccagc tgctaaattt tggctatatt tatcccgtat cagactagat gtgacgaacg 900 aagcgatcgc cgctatggtg aaagcaaatt tggaacttga cagcgatcct gtcgttgtta 960 aattagtagc caaaggggcg gacacaagca acatgtcttt tgtgtccttc aaagttggcc 1020 ttgatccttc attgaaagat awcgcgcttg atccttccac ttggccagag ggaatcatgt 1080 ttcgtgagtt cgaggactac agttcwcaaa aatttcggag agcgccaaca gtaagccctg 1140 gattgtcwcc gttagcwaca accgcgacga taactgtmaa ttgaatgctc atctaccttc 1200 gccgggatgc acctgcgaaa gctgtaagga aggccttatc ccacccgaaa cagtcgcacc 1260 attcaccatt caccatcaac gactatatcc gtacaactta gtcagttcct gccatcaaag 1320 tcatcccggt cctgttttcg cggaaaaggg agaggccttc cggactcccc tcgcaggcaa 1380 gtatctaaat tatgcatctc ccatcgtacc tgaagtattc tcggatttca gtaatgctga 1440 ctgtgaaaga gcctatttcg gtggccaacc tgttcttgga aacctgatgg tcgattcccc 1500 cgttcttatt tgggcccccc gacgtgcact acatcatgat ccaccgttct tcaatcaaca 1560 ttcgccggga tgcacctgcg aaagcttcat ggaagactcc aacccacccg aaacagtcga 1620 gccaatcaac gaatattcgc cttatttgca tcacccgatc agttcctgcc aacatagaca 1680 tcccggtcct gttttcgtgg aaaggggagg ggtcttccaa actccaaccg caggcaagta 1740 ttctaattgc tcatcccttc atctgcctga agtgcttttg gatacccaca gaacacattt 1800 cgatgctgct gatgcaaatg acggaacacc tgaaggcacg cagcattccg tggaaatcca 1860 ccattctgac gtacagtgcg caacacagca acattctggt catcagcggg gattagctac 1920 tactcccgta aacagacaat caacgccagt cgatcgcttt ctgagcgtat actaccagaa 1980 cgttagggga attcgtacta aaaccaatac gcttcgcctt atgctaagca gctgtgacta 2040 cgatgtcatc gtcttcaccg aaacgtggct tcggcaggat atcgccgata gcgaaatctc 2100 ctctgaatat aacatcttca gatgcgatcg tagcgcggca acaagcgaat tttcgagggg 2160 tggcggtgta ctaatcgctg tcaagactgc tcttcaatgt agtgcacttg aactaagtga 2220 ctgcggtcaa ctcgaacaga tggtagttcg catcaagctt gccaaccgat cactcatcat 2280 cgtcgctata taccttcgac caaactccat tccagctagt tacaccgctc acaccaacgc 2340 catccaagct ctttgcgata tggtttcaga caccgacatc atactagctc taggcgatta 2400 caatcttcct aacctkcagt ggcagtttga cgaagacatc aatggctatt taccgacgaa 2460 tgcttctacc gaacaagacg ttattttgac cgaatctttt tccactagtg gcctcattca 2520 gttgaactct ataagcaact ctaatggtag aattctagac ttggcatacg caaacactcc 2580 tgactacatc gaactcatcg acccaccttt gcccctgatt aaaatggacg agcatcacaa 2640 accgtttatt ttaagacttg acacacggta cacttcacca tgttttgaca acgtcccaat 2700 ggatgccacg gctttcgact tcaagcgatg tgatactgat gcgcttcgct ataaactctc 2760 ttcaatcgat tggatccatc aactcagctg ttcttctgtg gacgaaacgg tgtgtgcttt 2820 ctacaatatt ttgcatggca tattcgatca atgcgtgcct catcgtcggc ttgctccagc 2880 aatcgtcaat cgcaaacctt ggtggacagc tgagctccgt caaatgcgta acaggttacg 2940 aaaaactcgg aatcgatact tcggcgatcg aactgatgca aatcgaattg cacttcacca 3000 gctggaaact gactacaacg acttgctttc atcgacctat gcagtatatg taaacaacat 3060 ccaatccaac ttgaagcaga atccatcctc gttttggtca ttcgtccgag cacagaagtc 3120 atcaaaacgc actccatgca acgtcgaata caacggacaa acagcaagct cagccaaaga 3180 ggcggcaaac ctatttgcca actttttcca aagcgtgttt cgagattctt ctccagcagc 3240 cagattccac catcttccat cgtatgacgt caacttgccg gatttccggt tctccgccgg 3300 ggaagtctta gaactgctgc actgtctgga tgcatccaag ggaccagggg tagacggact 3360 accaccatcg tttataaaga gttgtgcaca cgagctggtc actccagtag cgctgatctt 3420 caatcgctct ctgacagagg gtacattccc ggctatgtgg aaaactgcaa gaatgatagc 3480 aatccacaaa ggaggatcca tcaatcatgt cgagaactac agaggcattt cgatactttg 3540 ctgccttggt aaagtgttcg agaccggtgt tcgtagatat ttatacaatg ccgcaaaacc 3600 cttcatcagc gaattccagc atggcttcgt cgaaaaccgg tcaacaacga caaatttgat 3660 ggaatatgtc aacactctgt ttcctgcagt ggaatcacga tgccaagttg actcggtgta 3720 cgtggacttt tctaaggcct ttgacctggt tccgcataac cttcttattg aaaaactgcg 3780 tcatttgggc ttcccgaatt ggatcactaa ctggcttaat tcttatctca cagaccgcaa 3840 agcgtttgtc caagttaacc agcaagtttc gactcatttc cccatccctt ccggagttcc 3900 gcaggggagt gtgcttggtc ctctaatatt cgtcctgttt atcaacgatc tagcggaccg 3960 agtgtcttca ggtaaattat tctttgccga tgacctcaaa atgttccgga ttatcgcttc 4020 cgccttagac tgtacagcct tgcaaaatga tatcgatcaa ctcactgact ggtgccaaga 4080 aaacgggatg ttcgtcaaca taaaaaagtg caaagtaatc actttcactc gatgtcatac 4140 tccgatttgt ttttcataca aaatcggttc agcctcactt gaacgggttg agtccattcg 4200 wgacctgggw gtgataatgg atagcagatt gaatttcaac gaacaagtct cattggcggt 4260 atccaaggcg ttttctgtgc ttggtctcgt tcgccgtcat tcaacgttca tcaccgatat 4320 ctacgctcta aaagccctgt actgttctct tgttagaagt atactggaat atgccgcccc 4380 agtgtgggcc ccgcaccaag taacacagat tttgaaaatc gagcgtgttc aaaagtattt 4440 cattaggttt gctctccgtc aactcccctg ggaaaaccct tcagagctgc ccccatacaa 4500 ctcgcgctgc ctgttaatcg acctggagac gctttccgcc aggcgcacaa agctgcaacg 4560 cgttttcatw tttgatttgc tctcaaatcg cattgactgt tcwgctctgc ttaacagtmt 4620 caatataaat gtcccttctc gcaccctccg aagccataat gcattggttc tcccwgttag 4680 taggacgamk tacgggttca ataatccatt ttatgtttgt attcgtgctt tcaacactgt 4740 atttcatctg tttgactttg atataagcaa aaatgtattt gttagtagga ttaagaatat 4800 agcctagaat agcgtacagt ctgtacggct aaggccgaag acggagaata aataaataaa 4860 taaataaata aataaataaa 4880 // ID Crypton-1_NVi repbase; DNA; INV; 1955 BP. XX AC . XX DT 17-FEB-2009 (Rel. 14.02, Created) DT 26-MAY-2009 (Rel. 14.06, Last updated, Version 3) XX DE Putative Crypton-type transposon. XX KW Crypton; DNA transposon; Transposable Element; Nonautonomous; KW Crypton-1_NVi. XX NM Crypton-1_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-1955 RA Jurka J.; RT "First Cryptons from insects."; RL Repbase Reports 9(2), 480-480 (2009). XX RN [2] RP 1-1955 RA Bao W. and Jurka J.; RT "Crypton in Nasonia vitripennis."; RL Direct Submission to Repbase Update (26-MAY-2009)The full length RL of this element.. XX DR [1] (Consensus) XX CC The 3'-end is ~120 bp palindrome structure. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Nasonia CC Genome Project. XX FH Key Location/Qualifiers FT CDS join(156..656,637..789,740..1492) FT /product="Crypton-1_NVi_1p" FT /translation="MIAKLCIISARRLVYNLLVPNRKVNMSDNSSCDTDDI FT SDYGNEIPPEIAAAAKEVTLDLLPDKSKRLYVTTTLSLGVQQKKTTMFVEE FT IFLAYFKELAMTLSPPTLWNRYSMLKTTVKTFDHIDISQYHQLLAYIKKKM FT LNIRKKNPRLSLRKTLVDFVMMLRTKHICSGRSIFSEKSNLFIKNLKLYFI FT FQQFTRIDFVVGCSYFWNSRCVEKRRINVVLIFGILGALRRGELTNITTDD FT IEDDSTRLLIKIPVTKNNVPRSFAVRGEFYQICKNYMNLRPTEIDKHKRFF FT IHYIDGKCTRQPIGLNTIGQMPKQIADWLKLPNPHLYTGHSFRRTSATLLV FT DGGGNLTDLKRHGGWKSSTVAEGYINESMNHKEQIHKKITKSIVLKPTEVV FT VADNHIPQCDNQVASTSGAEITVDQPKKSVTVSLKRTRFDSEGRDDDEPHA FT EDEFYRLTETQQATCGSPLKKRR*" XX SQ Sequence 1955 BP; 676 A; 335 C; 348 G; 596 T; 0 other; tttttcataa cagcgtgcgc ggagctcgac tttagagtca tttgttttgg cgctaaagtc 60 atcgacttag gcacgctctc tcacagcgtg cgcgggaaca tttgaagcgt gcgtggggaa 120 acctatcctg tcgctgtcat ttgttttaca tatgtatgat tgcgaaattg tgtatcatta 180 gtgcaagacg tcttgtttat aacctattgg tacctaaccg aaaagtaaac atgagtgata 240 acagttcttg tgatacagac gatatatccg attatggaaa tgaaattcca cccgaaatcg 300 ctgcagcagc taaagaagta actttagatt tattgccgga caaatcaaaa cgtctatatg 360 ttacaacaac tttaagtctt ggcgttcaac aaaaaaaaac cacaatgttc gttgaagaaa 420 tatttttagc ttatttcaaa gaattggcaa tgacactttc tcctccgaca ctatggaacc 480 gttattcaat gttgaagact accgtaaaaa cttttgatca tattgatata agccaatatc 540 atcaactttt agcttacatc aagaaaaaaa tgctaaatat tcgaaaaaaa aatccaagac 600 tttcactgcg gaaaacgtta gtagattttg taatgatgct ccggacgaag catatttagt 660 gaaaaaagta atttatttat aaaaaattta aaattgtatt tcatttttca acaatttacg 720 agaattgatt tcgttgtagg ttgttcttat ttttggaatt ctcggtgcgt tgagaagagg 780 agaattaact aatataacaa ccgatgacat tgaagatgat tcgacgagat tattaataaa 840 aattccagta acgaagaata atgtccctcg ttcttttgca gttcgtggag aattctatca 900 aatatgtaaa aattacatga atcttagacc aactgaaatc gataaacaca aaagattctt 960 catacattat atagacggaa aatgcacacg acagccgatt ggtctcaata caataggtca 1020 aatgccaaag caaatagctg attggttaaa attaccaaat cctcacttat acactggtca 1080 tagtttccga agaacttcag caactttgct agtcgatgga ggaggtaatc ttactgactt 1140 aaaaagacat ggcggttgga aatcgtctac agtagctgag ggctacatta atgaatctat 1200 gaatcataag gaacaaattc ataaaaaaat aactaaatcg atcgtcctaa aacctactga 1260 agttgtagtg gcagataacc atataccgca gtgtgacaat caagtagcat ctacttctgg 1320 agcagaaatt acagttgatc aacccaaaaa atctgtaaca gtttcgttaa aaagaaccag 1380 atttgattca gaaggtcgag atgatgatga accacacgct gaggatgaat tttatcgatt 1440 aaccgaaaca cagcaagcta catgtggttc accgttgaaa aaacgtagat aaaactaata 1500 attgtaatgt tacaattaat gtacataatc atcataatta actacaatgt acgatacact 1560 atatttgtgt tttagtttaa tattttttga taaaagatgt aataattttc tattaaaaat 1620 aattcatttt taaatttgca aaaatatttt agtttagtat aattacactc gtatatagcg 1680 acaatgtgtg atttaacgaa ctttttgtat gaataagctt tcacaggagc gtggcctatt 1740 tcgcattatg aaataaattt aatttgtata atatactaat ttagtataat gtcgagctcc 1800 gcgacacttg ttatgaaaaa tagtatacgc aactcgtgcg agggaaacac gacttacgca 1860 ctcgtggtag ttcagcgccc tcgcttcgct cgggcgcgaa ctaccgcgcg tgcgtaaatc 1920 gtgttcccct cgcactagtt gcgtaatgta ctatt 1955 // ID Kiri-9_CQ repbase; DNA; INV; 4506 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 16-FEB-2011 (Rel. 16.01, Last updated, Version 2) XX DE A Kiri non-LTR retrotransposon family from Culex quinquefasciatus DE - consensus. XX KW Kiri; Non-LTR Retrotransposon; Transposable Element; Kiri-9_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4506 RA Kojima K.K. and Jurka J.; RT "Kiri non-LTR retrotransposons from the southern house RT mosquito."; RL Repbase Reports 11(1), 128-128 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 6 sequences with >98% CC identity. CC This family and related Kiri elements found in two mosquito CC species, Culex quinquefasciatus and Aedes aegypti, constitute a CC distinct lineage belonging to the CR1 group. The phylogenetic CC analysis with PhyML indicates that Kiri is a sister lineage of CC the L2 clade. Although several Kiri elements do not encode two CC proteins, probably due to 5' truncation, Kiri elements generally CC encode two proteins. Their ORF1 proteins commonly contain an CC L1-like RNA-binding domain. XX FH Key Location/Qualifiers FT CDS 248..982 FT /product="Kiri-9_CQ_1p" FT /translation="MSFKRSREDLTKSDSEDEPVQSLDQLFSRMKSMFEET FT NERIECCKTDLQSEFAVLRDDMQQFKAECTNNVNKLSAELSKTQDNVSLNY FT ERILICGKLNDLLLSGVPYQSSENISNYVRSVSLALGYSDQDRPLIYTKRL FT ARLPIADGAAPPLLLQFTFRAARDEFYRRYLSSRNLSLSHLGFTINKRIYL FT NENLTELARAIKGSALKLKKDGKLHSVFTKDGFVQVKTRAEDEARPVLSME FT QLTN" FT CDS 1546..4377 FT /product="Kiri-9_CQ_2p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="MLANNGLAPNTNQSSSIPRIVMNCAMESDKLNICHLN FT VQSLCARGLSKLEEFKQCFTNSKANIICVTESWLTEKIEDELISLDGYKLV FT RHDRNMQTRGGGICVYFKTGINCKILSCSKNESNNITEFMLFEMEIAGEKV FT LLGVFYNPPRVDCATVLSQKLADHSLHYNKIIFIGDFNTDIRKSCTLTDNF FT LNVTDTYGLSCINTLPTHFHYGGSSMIDLLLTNDPDFVLKFNQVSASCFSR FT HDIIFSTLNLNTSNTSNFSLFRNYNKINFQALDHAVASINWNLLYSITDSE FT IALDFFNVNIKQLFENFVPLCKYIPRKNPWFNNDILKAMIERDLAYGLWKA FT DRSTLRFDEFKRLRNRVTNLINIAKSKYVANRLNTAESSKVLWKKLKVINV FT KSKQPRVEQFLNSNNEINYHFSSNFTQDNTPPHTLLPNNNGFKFSTVDIND FT VILALSSMKSDAIGLDQIPLKFIKLILPLIVDKITYIFNLAISTSKFPRAW FT KSAKIIPVKKKPRSLALDNLRPISILPALSKAFEKILKAQIQTYLNNFDYL FT HRFQSGFRSKHSTTTALLKVHDDILKVIDRKGVALLLFIDFSKAFDRISHS FT KLINKLSLKYNFSIEAAKLIQSYLTGRSQSVLSGGTLSIPIDIISGVPQGS FT ILGPILFSLFINDLPTVLKNCMIHIFADDVQLYLASNSXPVTEMARLLNDD FT LCEILRWSTRNLLPINPSKTKVMFISRNRGVSTLPEIIFEQNVIEYVTEAS FT NLGIIMQNNLEWDCHINGQCRKIYNGLRHLRMTSSMLPIQTKLKLFKSLLL FT PHFMYGDVLFLNAAANSINKLRVALNACVRYVYNLSRYDRVSHLQHNLLGC FT SFNNFYELRSCLTLFKIINTSTPRYLVDKLLNSRSTRTRNYIIPQHRSSHY FT NESFFVRGVANWNQLPVSIKNIHSPIEFRKECINWFGMRN" XX SQ Sequence 4506 BP; 1367 A; 895 C; 783 G; 1458 T; 3 other; tttctgaccg tatcgtgttt tggagttgtg amgttctatc cgaagcagct gtggttcctg 60 gagtacgtgc agatcgtcta ctwgttgtgg agttgtgggc gataaagttt tgccgaaacg 120 agccatcatc ttctacccag gccacgccat tttgctacga taaatctcct agtggcactc 180 acggttcatc tctcgttcac ccgctgatac acacactact ttgatcagaa tcaaaacaaa 240 accaacgatg agcttcaagc gatcgaggga agacttgacc aaatcagact cggaggatga 300 gccggttcaa tccctggacc agctcttttc tcgaatgaaa tctatgtttg aggaaaccaa 360 cgagaggatt gaatgctgta agacggatct gcagtctgaa tttgctgttc ttcgcgacga 420 catgcagcaa ttcaaagccg aatgcaccaa caatgtcaac aagctgtctg ctgaactatc 480 caaaacccag gacaacgtca gcctcaacta tgaacggatc ctcatttgcg gtaaactaaa 540 cgatctgctg ctgtctggtg tgccgtacca gtcttcggaa aacatctcaa attacgttag 600 gagtgtttcg cttgccctgg gttacagcga tcaagatcgt ccgttgatat acacaaaaag 660 gctggctcgt cttccaatcg ctgacggcgc tgctcccccg ttgctgctcc agtttacctt 720 cagagctgca cgagatgaat tttaccgacg atatctctca tcgcgcaacc tgtcgctgtc 780 acacctgggc ttcacgatca acaagcggat ctacctaaac gaaaatctga cggaactggc 840 aagagcaatc aaagggagtg ctcttaagct gaagaaggat ggcaaactgc acagtgtctt 900 caccaaggat ggattcgtgc aagtgaaaac ccgtgcggag gatgaagctc ggccagtact 960 ctcaatggag cagctgacca attaaccctt tccttaaagc gctctctctt ccttcctaca 1020 acatccatga ctcccatcct taatcccggt gttttcatcc ttcctgaaag ttaaacctct 1080 cctgaacaag cttttctttt ttccttccaa gttaatccgt gactcctatc cttagttagt 1140 ccatgactcc ttccttcctg caagtcttgc tgtactatct ctggatgctg cgaacgtgct 1200 gttggtgtgc tgggaacgct gctgctgctg ttttggatgc tgttgctgta cgagtgctgt 1260 tgaaccctgc tgctgcttgc ggcggttttt ttgcacttct gacctgctga ggactgccct 1320 taagtataat tcttccgtct atccctttca acactaccct tctctattct accctattca 1380 ttttgcgcta ttcttcatga atcacaccct gttagaagtt gtaataattg gcttaaatgg 1440 tgattataat aatgattgac gatcactctt tattgttttg ctttgttaat ttcaaaatgc 1500 caacctggaa tatgcattta ccgttaattc cgtcatttgt tcgcgatgtt ggctaataat 1560 ggtttagccc cgaatacaaa tcaaagctcg tccattcccc gtattgtgat gaactgtgca 1620 atggaatctg ataagctcaa catatgtcat ttaaatgtcc aaagtttatg tgcgagaggt 1680 ttgagcaagc ttgaagaatt taagcaatgt tttactaata gtaaagcgaa cataatttgt 1740 gttactgaat cttggcttac tgaaaaaatt gaagatgaat taatatccct tgatggatac 1800 aaattagtga gacatgatag aaacatgcaa actcgtggag gtggtatttg tgtgtatttt 1860 aaaacgggta ttaactgcaa aattttgtca tgttccaaaa atgaatctaa taatataact 1920 gaatttatgc tttttgagat ggaaattgca ggtgaaaaag ttttacttgg tgttttctat 1980 aatcccccga gagttgattg tgcaactgtt ctttctcaaa agctagcaga tcattctcta 2040 cattacaata aaataatttt tataggtgat ttcaatactg atatcagaaa atcttgtact 2100 cttacagaca attttctaaa tgttactgat acctatggat taagttgcat aaatactcta 2160 cctactcatt ttcattatgg aggaagttca atgattgatc tccttctaac aaatgatcca 2220 gattttgttc taaaattcaa tcaagtttct gcttcgtgtt tttcgagaca tgacataata 2280 ttctcaactt taaatctgaa cacatcaaat acatcaaact ttagtttatt tagaaattat 2340 aacaaaatta attttcaagc cttagatcat gctgtggctt ctataaattg gaatctattg 2400 tattcaatca ctgattctga gattgctctt gatttcttta atgtaaatat taaacaactt 2460 tttgaaaatt ttgttccttt atgtaaatat ataccaagaa aaaacccatg gttcaacaat 2520 gacatactca aagctatgat tgaaagggat ctggcttatg gcttatggaa agcggataga 2580 agtactttaa ggtttgatga gtttaaaagg cttcggaata gggttactaa cttaatcaat 2640 attgccaaat caaaatacgt cgctaatcga ttaaacactg cagaatccag taaagttctc 2700 tggaagaaac ttaaagttat taatgtcaaa agtaaacaac ctcgtgtaga gcaatttttg 2760 aattcaaaca atgaaataaa ttaccatttt agttctaatt ttacgcaaga taatacgcct 2820 cctcatacct tgctgccaaa taacaatggc tttaaatttt caacggtaga tataaatgat 2880 gttatcctag cgcttagttc tatgaagtcc gatgctatcg gtcttgacca aataccactt 2940 aaatttatta agctaatatt acctttaatt gtagataaaa taacatatat ttttaattta 3000 gccatttcaa cttcaaaatt tccaagagct tggaagtcag caaaaattat accagttaaa 3060 aagaaaccaa gaagtctggc cttggacaat ttgcgcccca ttagtatttt acctgctctt 3120 tcaaaagctt ttgaaaaaat tttgaaagct caaatacaaa cttatctaaa taactttgat 3180 tatttacata gatttcaatc tggatttaga tctaaacaca gtacaactac agcgttactt 3240 aaagtgcatg atgatatact aaaagttata gatcgcaaag gagtagcact tttattgttt 3300 atagattttt ccaaagcgtt tgatcgcatt tctcactcta aattaattaa caaactatca 3360 ttaaaatata acttttctat agaagctgct aaactaattc agtcttattt aacaggaaga 3420 tctcaatctg ttttgtctgg cggcacttta tctattccca ttgatataat ttctggagta 3480 cctcaaggtt caattcttgg tccaatttta ttctcacttt ttataaacga tcttccaacc 3540 gttctcaaga attgcatgat tcatatattt gctgacgacg ttcaactgta tttagcttca 3600 aattctckgc ccgtcactga aatggcaagg cttttaaatg atgatttgtg tgaaattctc 3660 cgctggtcta cccgtaattt gttgccgatc aatccttcta aaaccaaagt catgttcatt 3720 tctcggaaca gaggagtttc taccctacct gagattattt ttgaacaaaa tgttattgaa 3780 tatgtaactg aggcgtctaa tcttggaata ataatgcaaa ataatcttga atgggattgt 3840 catataaatg ggcaatgtag aaaaatctac aacggcctca ggcatctaag aatgacttca 3900 agtatgttgc caatacaaac aaagctcaaa ctgttcaaat ctctcctact tccgcatttt 3960 atgtatggtg atgtactttt tcttaatgct gcagctaatt ctattaataa attacgtgtt 4020 gccttaaacg cgtgtgtacg ttatgtctat aatctatcaa gatatgacag ggtttctcac 4080 ctccagcata atttacttgg ctgttctttt aataactttt atgagctcag gtcatgtctt 4140 actctgttta aaataattaa tacttctacc cctagatatc tagttgataa acttttaaat 4200 tctcgcagta cacgaacaag aaattatata ataccacagc acagatcatc acattacaat 4260 gaaagctttt ttgttagggg cgttgctaat tggaaccagc ttcctgtctc cattaaaaat 4320 atacattcac cgattgagtt ccgaaaagaa tgcataaact ggtttggcat gagaaattaa 4380 caagcgctgt tagtggttaa gttggcaaag ccatataata agaagaagaa ctacacactc 4440 tcgaactcta attgtagtaa ttgaaaagga gcagtcctta ctctacatgt ataaataaat 4500 aatgaa 4506 // ID Mariner-7_HM repbase; DNA; INV; 3127 BP. XX AC . XX DT 22-MAR-2008 (Rel. 13.03, Created) DT 22-MAR-2008 (Rel. 13.03, Last updated, Version 1) XX DE Mariner-type family: consensus sequence. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-7_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3127 RA Jurka J.; RT "Families of Mariner elements from Hydra magnipapillata."; RL Repbase Reports 8(3), 224-224 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(793..1083,1234..2637) FT /product="Mariner-7_HM_1p" FT /translation="MKPTFLTTIKKRMLWTXNNMKLAVRAVLXGMSQRKAS FT LLYKVPRSTLQTMISGKSKIGAKPGKKPMLGNLEEKLIDYAGNRAQMGIGF FT GKNIYHLCCRKVSKYFTSLEVQCYKISNXKNLLDNKPECMWNMDETGMQLE FT HKPRHVVARKGSKYIQSRTSGNKETITVICCINAAGQVIPPHIIGKGKTVR FT TLYGFDTENAPSGATWSVSEKGWTKRGIAELWFEKTFLPNIGSARPQILIL FT DGHDSHNFVEMIELAIVNQIEIVELPAHTSNWLQPCDRTIFKPLKTAYSEV FT CQTMMNDYPGVVVSHSNFCGLFSKAWKYAMTDANIRSGFRACGIYPFNPDA FT IPNEAYIPNILYSHISDTVPVQNVCDMSILNNLSLLPDLSDGSYTINSQSL FT TKPIEENELIDFFIQGNTVXYSNIITEHSYAKIKPVXDNQFIDFSVQDNTL FT GLNSDSEIILEEVVVTESTLDSNDKKKVFCKTLTDLPFPFTNSSCPSDKDS FT DVLTYPSPQIKQNKKNKKNSNLKFFVLTSEEAYNAKLEDRAAKVASEKRKI FT EKQKDFKRKKKKFLLKRS" XX SQ Sequence 3127 BP; 1141 A; 453 C; 489 G; 1035 T; 9 other; gggtaaatac gccaattact ggccattggc cacttttaaa agcttttctc acaagaatct 60 atggcgcaag ttgtactaaa ttattattta aactcaatga ctttattaaa actttctaaa 120 tttcgcaatt tcaggaaatt tctagaaggt tttaattatg tgaaaaacat catattaaat 180 gttactattt tacgctttgt tttcgtaaat tcatatttta ttgtcatttt ttatgcgcga 240 caagtatttg tggcagtcca ttcaggaaca aaaaaaagat aaatatctta tgtacacttt 300 gaaacaaaat tatttagttg atgcctcaca tataatgttt taactgaact agattaaacc 360 tttcctaatg caccatttac tggccggtta tttaatccat ttactggcca tgttcaataa 420 atggtatcag atataaaatt atttagttaa atatttttat ataagatttt ctcacacatt 480 aattattgtg aaaaaagtta agatactaat tcgatagtta actggtaaac aacatccaca 540 ttttaaatta atttctgatt ttctgagttc attctacttt tcaagtagaa tgagcttaga 600 aaattatagt taattataat tacagttaaa tatgttctga taattttgat ttaataaaaa 660 ycattaataa aagaaaaaag tttttctaaa agtattttat gaatacatgt ttttattttt 720 tagtatataa taatattaaa tatgagaaaa caaatatgaa aaaataattt gaaaatcaaa 780 tcaaatggta tcatgaaacc aactttttta acaacaataa aaaaacgcat gctgtggact 840 ragaataaca tgaaactagc agtacgtgca gttttawcag gtatgtctca acggaaagca 900 agtttattat ataaagttcc acgatcgact ctgcaaacca tgatctctgg aaagtccaaa 960 ataggagcaa aaccagggaa aaaaccaatg ctgggtaatc ttgaagaaaa actaattgat 1020 tatgctggca atcgtgccca aatgggaatt ggttttggaa aaaacattta tcatttatgc 1080 tgctaaactt gctaaaaaay ataacataag ttttaaaaat ggtctaccta gtgaaaagtg 1140 gtggcagcta cttaaaaaga gacatggttg tgtgagtctt cgtcgtccag aagctacagc 1200 ttctatacga cacatgtgta tggatagggt taaagaaagg tttcaaaata ttttacttct 1260 ttagaagttc aatgttacaa aatctcaaat yataagaatt tactggacaa caaaccagaa 1320 tgtatgtgga acatggacga aacagggatg caactagagc ataaaccaag acatgttgtt 1380 gccaggaaag gctcaaaata catccagagt cgaacaagtg gcaacaaaga gacaattact 1440 gttatatgtt gcataaatgc agctggtcaa gtcataccac ctcatataat agggaaagga 1500 aaaactgtac gtacccttta tggttttgat acagaaaatg ctccaagtgg agcaacatgg 1560 agtgtgtctg aaaagggctg gacaaaacga ggtattgcgg agttatggtt tgagaaaaca 1620 tttcttccaa atattggctc agcaagacca cagattttaa tcctagatgg acatgactca 1680 cacaattttg ttgaaatgat tgagctggct attgtaaacc aaattgaaat tgtcgagctt 1740 ccagctcaca ccagcaactg gttgcagcct tgtgacagaa caatttttaa gccactgaaa 1800 actgcttatt ctgaggtctg ccaaacaatg atgaacgact acccaggtgt agtagtgtcg 1860 cattctaact tttgtggttt attttcaaaa gcatggaaat atgcaatgac tgatgcaaat 1920 atacgctcag gatttcgagc ttgtggaatt tatccgttta atccagatgc tattcctaat 1980 gaggcataca tcccaaatat tttatatagt catatttcag atactgtccc agttcaaaat 2040 gtatgtgaca tgtcaatatt aaacaacctt tcattattgc ctgatctatc agatggcagc 2100 tatacaatca actcacaatc tttaaccaaa ccaatagaag aaaatgagtt aattgatttt 2160 ttcattcaag gcaatactgt gmaatattca aatattataa cagaacacag ctatgcaaaa 2220 atcaaaccag tacragacaa tcagtttatt gatttttctg ttcaagacaa tactctgggt 2280 ttaaatagtg attcagaaat aattcttgaa gaagtagtag taactgaatc aacattagat 2340 tcaaatgata aaaaaaaagt tttttgcaaa acattaacag atcttccatt tccatttact 2400 aattcatctt gtccatcaga caaagattca gatgttctta catatccttc tccacaaatt 2460 aaacaaaaca aaaaaaacaa aaaaaactct aatttaaagt tttttgttct gacctcagag 2520 gaggcttata atgcaaagct tgaagatagg gctgcaaagg tagctagtga aaagagaaaa 2580 attgaaaaac aaaaagattt taaaagaaaa aaaaagaaat ttctgctaaa aagaagctag 2640 atcatatatc aaacttraag aaattgactt gaagaaatta aaagatgaat gaaaaaaaaa 2700 atgatatttt taataatatt tttwttatat tttatatatt atatatttta ttcagttatt 2760 tttagttttt tgtggagttt cttaattcac tttaatcagt gtaaaattaa tttagttgaa 2820 caagttatct tgattttatg ctgtgctttt tttgtttggt tttattattt ttctgttcaa 2880 tgtgtttcct aatttctggc caacggccaa taaatggtta agatggccag tatttggtaa 2940 aattgtttac ctttattatc tgcgttatat ataattttgt gctattataa tcacaaaaaa 3000 ttttacttaa aatgacagag tttttaattc tctaaaaatt gatatatgat atgtaacatt 3060 tacttgaata tttttctgta acgggtgttt cttgtaaaat tgtggccagt aattggcgta 3120 tttaccc 3127 // ID CR1-12_HM repbase; DNA; INV; 3672 BP. XX AC . XX DT 11-DEC-2008 (Rel. 13.12, Created) DT 13-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE CR1-type family: consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-12_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3672 RA Jurka J.; RT "CR1 families from Hydra magnipapillata."; RL Repbase Reports 8(12), 1840-1840 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 461..3256 FT /product="CR1-12_HM_1p" FT /translation="MEGNIESKTFENISFCAYNCQSFKSNSIYISELVKSF FT DIIFLSEHWLLNIELFVLKNIALPTHKVFFHAAEKKAFGRPFGGNAFIIKK FT DVLLSPHIIYEDDNILAIKGKNKCHNLVFIGIYLTSCRNNEESLSKYEQQL FT NLITSIVKNYEDVSECIIAGDFQSYPETLYDSEIRNNKKRNKFSNHLTNFI FT KSNKLSFIDIISGAGPTYTYQHKTLSNSSYIDHIAALKESSLSFINTSVIP FT PSPFNFSDHLPIATNIVVTHTNLSTVINSIMHKYNYIPKYAWNNEKFINIY FT NHNLSVAFNYYTFNEEEIEQELKQTCEKIQNAALLAVKQCFKQQICKYSKS FT WWTTDLKKCKENLSFNYKQWQKSNYNKTQECIEYKRYIFSRKNFRKAVKAA FT HNKKIHEKTINIENLRTTNPQRFWDNIRKIKTKATTRLFTINKSQNIKDIV FT QEFRQHFQTLLNTPLITNNNHNPSQIPPLSKEPNSLIITSADIINCISQLN FT SNKSPDSCGLSAEHLKNSHNNNLLLWLAKFYNSILSNGKVPNDLSTSTIIP FT LVKSYKKSLNNPDNYRGISILPIFTKLLEYLILLICPSITDSHSHQFGFKK FT NSSTLHAEFLLSETVKHYKNNNTPLYMCSLDANKAFDTCNWDLLFEKLYFQ FT KKLPLSVVHTISSLYHSGTANVSYQGVTSNQFSLSQGVRQGSILSPHLYNI FT YTESILNEIVSECKVGSTINGIYTGVIAYADDVILLSPTISGLQELINRFQ FT TYGVANFIKLNTEKTEFLISGKSYIXNNVIKINGVSIYLQNKLKHLGFQWD FT NDTSKNKIATLQKTNLNERITQFQSVAKALIDSGIRYCQPSTIVHLYNTLT FT VPKLIYGLELCKYDTNFLSKIDAVGRAVLKSFFCISKYSKNYLHSFFKIED FT ISTILLRNKVNLFIRLLKNQTCLXTDN*" XX SQ Sequence 3672 BP; 1391 A; 661 C; 501 G; 1115 T; 4 other; aaaggtctat aatgaagcct attttcggaa aaaaagttgc atttaacaga aactctattg 60 caggcgaacg tacggtgcga gaatcttata tctttgtagg aggggtctgt aactcaatgt 120 cggaagaaga tctatctaac cacatcaaaa atgaaatagg tattgagcct attaaggtag 180 agcttaaccg ggaaaataaa tacaatcgtt cattcaaagt tatcataaaa agtactgaaa 240 aagacataat cctaaaccct gatatttggg atactaatat tatcgttaaa ccattcaggc 300 ttcatcgtgt tacctcaaga aacgaaacac cacagcagag tgcagttaat acgctacact 360 ctgtacaacc aactcaacaa atattttcta catctaaaga acaaccatcc tttgcacagc 420 aacattctta ctcaactcga atactaaatt ctaattcaaa atggaaggaa acatagaaag 480 taaaaccttt gaaaatattt ctttttgtgc atataactgc caaagtttta aatcgaactc 540 aatttatatt tccgaactgg tcaaatcatt tgatattatt tttctatcgg aacattggct 600 tttgaatatt gaactttttg tgctaaaaaa catagcattg ccgacacata aagtattttt 660 tcatgcggca gaaaaaaaag catttggaag accatttgga ggaaacgcat tcattattaa 720 aaaagatgtt ttactgtctc ctcacataat ttatgaagat gacaatattc tcgctattaa 780 aggtaaaaat aagtgtcaca atctagtctt tattggtatt tatctcacat catgtcgaaa 840 taacgaggaa tccctttcaa agtacgaaca acaattaaac ctaatcactt caatcgtaaa 900 aaactacgaa gatgtaagtg aatgtattat tgctggagat tttcagtctt atcccgaaac 960 actttacgat tctgagatcc gtaataataa aaaacgcaat aaattctcta atcatctaac 1020 taatttcatt aaatcaaata agctctcatt tatcgatata atcagtggcg ctggtccaac 1080 atacacctat cagcataaaa ctctatctaa ctcatcgtat atcgatcaca tcgcagctct 1140 aaaagagtca tccttatctt ttataaatac aagcgttatt cctccttctc cttttaattt 1200 tagtgatcac ttaccaatcg ctacaaatat cgtcgttaca catactaatc tatcaaccgt 1260 aataaatagc ataatgcata aatataatta tattccaaaa tatgcatgga ataacgaaaa 1320 attcattaac atatataatc ataacctctc cgtagctttc aactactaca ccttcaatga 1380 agaagaaatt gaacaagaat taaaacaaac atgcgaaaaa atccaaaatg ctgctttgtt 1440 agcagttaag cagtgcttta aacaacaaat ctgtaagtat tccaaatcat ggtggacaac 1500 ggaccttaaa aaatgcaaag aaaatctttc atttaactat aaacaatggc aaaaatctaa 1560 ctataataaa acacaagaat gcattgaata caaaagatac atcttttcac gcaaaaactt 1620 ccgtaaagca gttaaagcag ctcataacaa aaagatccay gaaaaaacta taaatattga 1680 aaatcttcga accactaatc ctcagaggtt ctgggataac atccgtaaaa tcaaaactaa 1740 agcaacaact cgattattta caattaacaa atcgcaaaat ataaaagata tcgttcaaga 1800 atttagacaa cattttcaaa ctcttctcaa tacgcctctt attacaaata ataaccataa 1860 tccttctcaa attccacctc taagtaaaga gcctaattca ctaatcataa catcagctga 1920 tattataaat tgcatctcac aattaaatag taataagtcc ccagatagct gtgggttaag 1980 tgctgaacac cttaaaaact ctcataataa taatcttctt ttgtggctcg ctaaatttta 2040 taacagtatt ctatcgaatg gaaaagtacc aaatgatcta tcaacttcaa caattattcc 2100 acttgttaag tcatataaaa aatctcttaa taatccagat aattatcgag gaattagtat 2160 tttacctatc tttactaaac ttctagaata tctaatacta ctaatatgtc ctagtataac 2220 agatagccat tcccatcaat ttggttttaa aaaaaacagc tctacactcc acgcagaatt 2280 tttattgagc gaaaccgtta arcactataa aaacaataac acgcctttat atatgtgcag 2340 tttagatgca aataaagctt ttgatacttg caactgggat ttgctatttg agaagctcta 2400 tttccaaaaa aaacttccgt tgtctgtagt acacacaata tcatcgcttt atcattctgg 2460 tactgcaaac gtatcctatc aaggtgttac ctcaaatcaa ttcagtctgt cacaaggggt 2520 gcgacaaggg tctattctat cgccacattt atataacatt tatactgaaa gcatcctaaa 2580 tgaaattgtc tcagaatgta aagtaggatc gacaataaat ggaatttata ctggtgtaat 2640 tgcatacgca gatgatgtaa ttttgctaag ccccacaatc tctggtctgc aagaactgat 2700 taacagattt caaacatatg gagtagcaaa ctttatcaag ctaaacactg agaaaacaga 2760 atttctgatt tctggaaaaa gttatattyc taacaacgtt attaaaatta atggagtttc 2820 tatatatctt caaaataaat taaaacatct gggtttccaa tgggataacg ataccagcaa 2880 aaacaaaatt gcaacactcc aaaaaacaaa tttaaatgag cgtataactc agtttcaatc 2940 ggttgcaaaa gctctaattg acagtgggat tcgttactgc cagccatcaa caattgttca 3000 tctgtacaat acattaacag tgccgaagct tatatatggt cttgaactat gtaaatatga 3060 tactaatttt ctcagcaaaa tagacgctgt tggaagagca gtcttaaaat catttttttg 3120 tatttcgaag tacagtaaaa attaccttca ttcttttttt aaaatagaag atatttcaac 3180 tattctcctt aggaacaaag ttaatctttt tatcaggctg ctgaaaaatc aaacttgctt 3240 aytcactgat aattaatcaa atacaggaaa cacagaataa aaaatccttt acaaacgaag 3300 cgtttgatgc atgcaacaca cttgatttaa atttcataca gtgcatgtta aatggcaaaa 3360 aaattaaatt gacgcaaccc gtcgtagagc tacaaactaa tgttatgcta atattgaaag 3420 aaacgttcca gttttggaat ttgaaagaaa agagagaggc gtttaaaatc ttgatggagg 3480 agaatgttcc tagaactctg tcccaataat atgtggaaaa caaaccggct gtttataagc 3540 atttattgct aacaactaca agagagcccg tgttgtttta gctgattttt aatttattgt 3600 ttgactattt aaatattatt ttacctgtaa tttattaccg ggtgataaag agaaaatata 3660 atataataat aa 3672 // ID I_Ele43B_AAe repbase; DNA; INV; 6831 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE An I non-LTR retrotransposon from Aedes aegypti. XX KW I; Non-LTR Retrotransposon; Transposable Element; I_Ele43B_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6831 RA Kojima K.K. and Jurka J.; RT "I clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1353-1353 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 4 sequences with >97% CC identity. CC The consensus is ~83% identical to I_Ele43. XX FH Key Location/Qualifiers FT CDS 558..1841 FT /product="I_Ele43B_AAe_1p" FT /translation="MAAASSGDPGGSAKRRLPEFMDPTNQFGELTFLQLSG FT KNGTPLPINPYITGKSVEACAGGPIESAKTEAQGTKYTLRVRDPAQVAKLL FT KLTKLIDGTEVEVVPHPNLNVSRCVISCFDLIQMEEKDILTEMISQKVIRV FT QRITRNESGKRVNTPALILTFCKTTYPEYIKVGLLRVATRPYFPNPMLCYG FT CFSYGHTRVRCPGPQRCANCAQNFHGEECGEAPSCRNCKGDHRSTNRQCPV FT YKKEVEVIKVKVSENLSFPEARKRVEQQSGSYAQVAAQQSVFEKKLKELEA FT AMLQKDKEIARLQEDNKKKEERIEQMMAFIKQVKQQSNPERVHHVSETAEK FT PRHSREQRVAQSTAGPMTRSRNNSPAVQETKRGRPPKFVYPKPASSPDISP FT PPKKTAPTTHDLTQMEYSGEESEVSETPPNQRLR" FT CDS 1777..6711 FT /product="I_Ele43B_AAe_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="LRWSIPAKSLRYRRHPLTSVFDNPTLEQRSKYNEDPR FT TNTDNETFRFGSRESVVPLPTQVRQDEGVXRDVLPQPVDDEFTSVADGIPH FT DTTTLQTHRVTIHTVPISSNSIETNNENTDDTTQSLSPVPAIAGVSGSSRV FT VVSAMAGTVGQDVPQVSLLNSQPQTGTAAPAFFSDAHHTTPTAANPNQQQP FT TSTNRLAINNRTEETTQSLSPVPAIAGGSGISRVVVSAMAGTVGQDVPQVS FT FRLSLPQSGTAAPAIFSGTHQTTPTTVNLDRHQPASTNRLANNNHFTDDTT FT QSLSPVPAVAGVSGSNRVVVSATAGTVGQDVPQVSLSTLPPQTVAAAPDYL FT SGHIASPLSPGEGPSRVHRRNITQSASPVPAVVGIGRTSVMVSTAAGTVGQ FT SVPQASDRLVFSPTEATALPLSPTTDPPXRRTSTSSSTTRSATDSRSGSCF FT ALQWNIRGLRANISELKLLISNLEPCVIALQETKVNQTVVPPDFVGRNYTL FT LLXSTNVNYWQHGVGLAIREGIPFERVQIDTSLHIVAARLHAPIQVTVVSV FT YIPPSSAQCQAALSELLEQLEGPVLLLGDFNAHHLAWGSHQSTALGRFIAE FT TTLDKQLVILNDGSPTRIDPTTGNTSAIDVSICTESLACRFTWRTLSDTHN FT SDHFPIIVSIPGWCSPRTTRQKWLYDQADWSMYERITIETIDADVEWDVPS FT FTEKILAAAKKSIPRTSGRSGPKSVPWWCPEAKAAIRRRRKCLRALRRLQQ FT TDSNQPEALAEFQEARAAARRAVKEAKEKSWEXFVAKISPNSTTTELWRTV FT NTLRGNRQQRPVVLKRASGFTDNPNDAAEELAKYYSERSATSSYPPSFQMA FT KAAAELESTDFSHNTGDVYNIDITLAELLWALDKGRGVSTGPDSIGYPLLQ FT RLPVSVKMTLLNLLNKIWRSGEFPTSWRNAVVIPIPKPNCQDSGPAAFRPI FT SLTSCMSKLFERIINRRLITELESSGRLDKRQHAFRAGRGTDTYFAELERS FT LPITDEHCLIASLDLSKAYDTTWRHGILRTLKSWRIRGRMFNTLKSFLSER FT TFQVSVDGHLSREHDLENGVPQGSVLSVTLFLVAMQPIFRVLPKGVEILLY FT ADDILLAVRGPKSANVHRKLQAAVKAVDKWAKSVGFAISATKSHTFYCSPN FT ARREPANEIQIDRTSVPKTNRLRILGITLDRTLSFKPHCQMVKKSCESRLR FT ILQMIGAKLPRGNRSTLLQVGSALVTAKLIYGIGLVSRGGPTTLQTLAPAY FT NKMIRYASGAFVTSPTSSVMAEAGALPFEFLATQSTARTAIRILAKNRNNS FT TLPLIQRTSDRLEEITGSALPAVDQLVRQSDRLWHTRKPSVVWDVKNTVRA FT GDPPEKVRPVVQQLLSTRFQNSTIVYTDGSKLNDMVGSAFYLNGLVGMYSL FT PNQCSVFSAEAYALKMAVSIPNLSTELVVLTDSASCLLALEGGKSKHPWIQ FT QVEHIAKNKPIRFCWIPGHAGISGNVEADRLAGEARGQPANNIAIPGEDAL FT RAMKQGIRQRWNRQWYESRDSKLREIKNDTHRWADYGSAADQRILTRLRIG FT HTRLTHTFLLKKEAPPTCECCGSVLDVRHLILECRKFDRQRQDNGISSTNL FT QEALTNEEINMKKVLKFLHDTGLYTKL" XX SQ Sequence 6831 BP; 1895 A; 1812 C; 1647 G; 1454 T; 23 other; cagatcgttc cgacacactg tgttgaccgg ttgkctccgt gcacgcgttc ggcwtaaacc 60 aggtgttttt ccgagttttt cgcggttttc gcgatcgmaa cgcgtgtkaa tctgtggttg 120 cggtggtgaa gaakaagwgg atagtgaagg tttggtgaaa atcggtgttg tagcggccga 180 aaacggcgaw aaaaagtggc acccacgtgc tttgaaacgg ccaaacaaat cgccgtcagc 240 cggctccccg gccgttctca ttcaccwttc ggtggtgcac agtgtgcttg aattgttttt 300 cgctgaaaaa aaaaaaaagt gtcgaaagtt tgggaagtaa gtggaattga gaaagtgagc 360 gatccwcatc agggattcac tacagtagtg tccagtagtg cctacagtgc ggcacaaaat 420 tttgggaacg cctggccatc gctttctacg accgaggata aaakaaggta ggatctttcc 480 ttttactttt ctttcgtcaa tttggggtag gtaacgggtg accgttgcca gtggaagtgt 540 tttccactta aaacaacatg gcggccgcta gtagcggcga cccagggggg tccgccaaac 600 gtagattacc agaatttatg gatccaacga accaatttgg cgaattgacg ttcctccagc 660 tgtctggaaa aaacggaact ccgcttccga tcaacccgta cattaccggg aaatcggttg 720 aggcatgtgc cggcggccca attgagagcg cgaagaccga agcgcagggt acmaagtaca 780 ctctgmgagt tcgsgaccca gcccaagtwg ccaagttgct kaagctgacg aagctaattg 840 acggaaccga ggtagaggtt gtccctcacc ccaatctaaa tgtaagtagg tgtgtcatct 900 cgtgcttcga cctcattcag atggaggaaa aggacattct gackgagatg atcagccaga 960 aggttattcg ggtacagcgt atcacccgaa atgaaagcgg aaaaagggtc aatacaccgg 1020 cactgatcct tactttctgt aagaccactt acccggagta catcaaagtt ggcttgctgc 1080 gcgttgctac tcgcccctac tttccgaacc cgatgctttg ctacggttgc ttcagttacg 1140 ggcacactcg tgtgcgctgc cctggaccgc aacgttgtgc caactgcgcg cagaacttcc 1200 acggggaaga atgcggtgaa gccccgtcgt gccgtaactg caagggtgac catcggtcga 1260 ccaatcgcca gtgcccwgtg tataaaaaag aggttgaagt gatcaaggtg aaagtgagcg 1320 aaaacctgag tttccccgaa gcgagaaaac gagtggaaca acaatccggt agttacgccc 1380 aagtagccgc ccaacaaagc gtgttcgaga agaagctgaa ggagctggaa gcggccatgc 1440 tccaaaagga taaggaaatc gccaggttac aggaagacaa taaaaagaaa gaggagagga 1500 tcgagcagat gatggctttc atcaagcagg tcaagcagca gtctaaccca gagagagtgc 1560 atcatgtgag cgaaaccgct gagaagcccc gccacagccg agagcaacga gtggcccagt 1620 cgacggctgg cccgatgaca cgctcaagga acaactcccc ggccgttcag gagaccaagc 1680 gcggaagacc tccaaaattc gtttatccca aaccagcctc ctcgccagac atcagcccgc 1740 ccccaaagaa gaccgcaccc accacccatg acctgactca gatggagtat tccggcgaag 1800 agtctgaggt atcggagaca ccccctaacc agcgtcttcg ataaccccac tcttgaacaa 1860 cgttcgaaat acaacgaaga ccctcgcacg aacacggaca acgagacttt tcggttcgga 1920 agtagggaaa gtgtagtacc tcttccgacg caagtacggc aggatgaagg agtcwtccgg 1980 gacgtcttac cccaacccgt tgatgacgag ttcacctccg tagccgatgg aatcccacac 2040 gacaccacga cattgcaaac ccaccgagtt accatccaca ccgtaccaat atcaagcaac 2100 agcatcgaaa ccaacaacga gaacaccgac gatactacac aaagtctttc cccagtgcca 2160 gctattgccg gtgtttctgg aagcagtcgt gtagtagttt cggcaatggc tggcactgtg 2220 gggcaagacg tcccacaggt cagtcttttg aattctcaac cacaaacagg tactgccgcg 2280 cctgcattct tttcagatgc ccaccatacc acaccaaccg cagcaaatcc caatcaacag 2340 caaccaacat caaccaatcg tctcgctatc aacaaccgca ccgaagaaac tacacaaagt 2400 ctttccccag tgccggctat tgccggtggt tcaggaatca gtcgtgtagt tgtttcggca 2460 atggctggca ctgtgggaca agacgtccca caggtcagtt ttcgcttatc cctacctcaa 2520 tcaggtactg ccgcgcctgc aatattttca ggcacccatc aaaccacacc aaccacagtg 2580 aatctcgatc gacatcaacc agcatcaacc aatcgcctcg ccaacaacaa ccatttcacc 2640 gacgatacta cacaaagcct ttccccagtg ccggctgttg ccggtgtttc tggaagcaat 2700 cgtgtagtag tttcggcaac ggctggcact gtgggacaag acgtcccaca ggtcagtctc 2760 tctactttac caccgcaaac agttgctgct gcgcccgatt atctttcagg acacatcgca 2820 tcacccctat ctcccggaga aggcccttcg agagttcacc gcagaaacat tacacaaagc 2880 gcttccccag tgccggctgt tgtcggtatt ggcagaacta gtgtaatggt ttcgacagca 2940 gctggtactg tgggacaaag cgtcccacag gcaagtgaca gactagtttt ttcaccgaca 3000 gaggctaccg cactccctct ctctcccaca acagacccgc cgawccgaag aacatcaacc 3060 tcgtcatcca ccacacgatc agctacggat agtcgctctg gtagctgctt cgctctccag 3120 tggaatatcc gtggtcttag ggctaacatt agcgagctaa agctgctcat ctccaacctc 3180 gaaccatgtg ttatagcctt gcaggagacc aaggtgaatc aaactgtcgt tccgccagac 3240 ttcgttggca gaaactatac actgctacta magtcgacga acgtcaacta ctggcaacac 3300 ggtgtaggcc ttgccatccg agaaggtata ccattcgaac gtgtgcaaat cgatacctct 3360 ctacatatcg ttgctgctcg ccttcacgcg ccgatccagg ttaccgttgt gtcggtatac 3420 attccgccga gttcagctca gtgtcaggca gcattgagtg aacttctcga gcagttagaa 3480 ggtccagttc ttcttctcgg tgactttaat gcccaccatc tcgcatgggg gtctcaccag 3540 tcaaccgcgc ttggccgatt tatagccgaa acaacgttgg acaaacaact agtgatcctg 3600 aatgacggct cccctactcg tatcgatccg actacgggaa atacctcggc aatcgatgta 3660 tctatctgca ctgagagtct ggcatgtagg ttcacttggc gtaccctgtc agatacacac 3720 aatagcgacc acttcccgat aatcgtttcc atacccggat ggtgcagtcc acgaacgact 3780 cggcaaaagt ggctctacga ccaagcagat tggtcgatgt acgaacgtat cacaatcgaa 3840 accatcgacg cagatgttga gtgggatgtt cctagtttca ccgaaaaaat attagcagca 3900 gcgaagaagt ctatcccacg aaccagtggc cgaagcggac cgaagtcagt gccgtggtgg 3960 tgccccgagg caaaggcagc cattcgacgg cgacgaaaat gtcttcgagc actccgacgc 4020 cttcagcaga cggactcaaa ccaacccgag gctttggcag aattccagga agctagagca 4080 gcggcacgaa gagcggtcaa agaagcgaag gagaaatcat gggaagawtt cgtagcgaaa 4140 atttccccaa acagcacgac gaccgaactg tggcgaacgg tcaacacatt gcgtggcaac 4200 cgacaacagc gaccggttgt actgaagcgg gctagcgggt ttacggacaa tccgaatgat 4260 gcagcggaag aactagcgaa gtactacagc gagagatcgg cgacttcaag ctaccctcca 4320 tcgttccaga tggcgaaagc ggcagctgaa ctagagtcca cagatttttc gcataacacc 4380 ggcgatgtat ataacatcga tataacccta gccgaacttc tgtgggctct cgacaaaggg 4440 cgaggtgtct caacaggtcc cgattcaata gggtatccct tgcttcaacg gcttcccgtt 4500 tccgtgaaaa tgacgctatt gaatcttctc aacaaaatct ggcgaagcgg tgaattcccc 4560 accagctggc gaaacgcagt cgtcattccc atccctaagc caaattgtca agactccgga 4620 cctgctgcat tccgaccaat ttcactcaca agttgcatgt ccaagctttt cgagcgaatt 4680 ataaatcgtc gcttgattac cgaactggag tcaagtgggc gacttgacaa gcgtcaacat 4740 gctttccgtg ctggacgtgg caccgacact tactttgcgg agctagagag gtcacttccg 4800 atcactgatg agcactgtct aatagcgtct ttggatttat cgaaggcata cgatacgacc 4860 tggcgccacg gcatactacg cacattgaag tcttggcgta tacgtggcag gatgttcaac 4920 acgctaaaaa gttttctctc tgaacggacc ttccaggtgt ctgtggatgg acacttgtct 4980 cgtgagcacg atctggaaaa cggagtaccg cagggctcgg tactatccgt gacgcttttc 5040 ttagtcgcaa tgcaacccat ctttcgggta ctaccgaaag gtgttgaaat acttttgtac 5100 gctgatgaca tccttctcgc agtacgagga ccgaaatctg ccaacgtcca ccgaaaactg 5160 caggccgccg tcaaggctgt cgacaaatgg gcgaaaagcg tgggcttcgc catatctgcg 5220 acgaagtcac atacttttta ttgcagccca aatgcacgtc gagaaccagc gaacgaaatc 5280 caaatcgatc gcacatcagt acctaaaacc aaccgtttaa gaattctggg tatcactctt 5340 gaccgaacac tatcgttcaa accccattgt caaatggtta aaaaatcttg cgagtcccgt 5400 cttcggatac ttcaaatgat tggggctaaa ctcccaagag ggaaccgatc aactctttta 5460 caagttggtt cagcgctagt cactgcgaag ttgatctacg gaataggact ggtgagccga 5520 ggaggaccaa caacgctgca aaccctcgcc ccggcataca acaaaatgat cagatacgct 5580 tccggagctt ttgtgacgag tccgacaagt tcggtcatgg ccgaagcggg tgctttgcca 5640 tttgaatttt tggcaacgca atccacagcg cggacggcca ttcgaatact ggcgaaaaat 5700 cgtaacaaca gcactctccc gctgattcag cgaacatcag accgtctgga agaaataacg 5760 ggatcagcac ttcccgctgt tgaccaactc gtaaggcaga gcgatcgcct atggcacaca 5820 cgaaagccat cagttgtatg ggatgtwaaa aatactgtcc gtgccggcga tccgccggag 5880 aaagtccgcc ccgtggtaca gcaactgctc tctactcgtt ttcaaaactc gactatcgta 5940 tacaccgacg gctccaaact aaacgacatg gtagggtccg ctttttactt gaatggcctt 6000 gtaggaatgt atagccttcc aaatcaatgt agcgtcttct ctgcggaggc gtacgccctt 6060 aagatggcgg tttcgatccc aaaccttagt acggagctwg tggtattaac ggactctgcc 6120 agctgtcttt tagctttaga aggggggaaa tccaaacatc cctggattca gcaagtcgag 6180 catatagcca agaacaaacc gattcgattt tgctggattc caggacacgc tggtatcagc 6240 gggaatgtgg aagccgatcg actagcgggt gaggctagag gacaaccagc aaacaatatt 6300 gcaataccag gagaggacgc tctgagagcg atgaagcaag gtatacggca acggtggaat 6360 aggcagtggt acgaatccag agactcgaag ttacgagaaa ttaagaacga tacgcacaga 6420 tgggcagatt acggaagcgc agccgatcaa cgaatactaa cacggctccg aatcggccac 6480 acacggctaa cccatacctt cttactaaag aaagaagcac caccaacatg cgaatgctgc 6540 ggatcagtgc tcgatgtacg gcatttgatc cttgaatgta gaaaattcga cagacaacga 6600 caagacaatg gcattagctc aacaaattta caagaagcat taacgaatga agaaattaac 6660 atgaaaaagg ttttaaagtt ccttcatgac acaggattat atacaaaatt atagagttac 6720 tgttaaaact agtgaactga attgtatcag aatgtaacac actttccaaa cgacacgaat 6780 gcaccctttt ggtgtcaagt gtcgaaaata aacaaacaaa caaacaacaa a 6831 // ID Crack-25_BF repbase; DNA; INV; 2333 BP. XX AC . XX DT 30-APR-2009 (Rel. 14.04, Created) DT 30-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE Amphioxus Crack-25_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; KW Crack-25_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-2333 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-2333 RA Kapitonov V. and Jurka J.; RT "Young families of Crack non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(4), 830-830 (2009). XX DR [2] (Consensus) XX CC Its 5'-terminal portion is incomplete. XX FH Key Location/Qualifiers FT CDS 3..2153 FT /product="Crack-25_BF_2p" FT /translation="KTQAMVVFKRTYRDFCPEDFLKDAEMVPWHLVLEESN FT VNDALHLFTVMFNDIADQHAPMRKRAVKSNPVAWLDDELRELMCLRDEAKR FT ESVASGLQSDFEVYKKLRNTVVNLNRKKKAAFHKTKLEESKGDPKAMWKTL FT NGILGKGSKRSTGVVEQDGIYLVKAKDIAEHFNNYFLNKVNILRQDMHQQS FT NDALLHLIEENIMADKDCSFAFQQVRREEVYKLLIDLPEGTAAGLDNMDNK FT LLRLAAELVSTPLCYIINLSFATSVFPREWKKAKVVPIPKSATAPFCGANS FT RPISLLPATSKIMERIVSIQVSDYFQQNSLMSNHQHAYRKNHSTCTALLHM FT VDDWHHSIDQGSLVGAIFLDFSAAFDLVDHSRLLQKLGCYGFSESTTTWMA FT SYLTGRQQCVHINGTNSSFLELPCGVPQGSCLGPLIFTIYTNDLPLAVSQA FT TADMYADDTSAYTCDPAIETISSNLQTEVNNICNWVRTNQLFLNASKTKCI FT VLGSKPKMSKRPKLALTVDGKSIEQVTEVKLLGTTIDECLTWNSQTKSTSK FT KMARSLGMVKRCAKFMDQKLLNIVIECLVLSHIDYCSPVWSCTTKRNLNLL FT QRVQNKANRLLQIRPKWKSVNTRLLLNTMTTFRRIVLTNEPVCLSSKLSFV FT KKQHNYSTRFASGRTFKLDKPRTNSLKRTFRYRATSMWNRLPIALQSALSS FT RSFKTGLCDLVSCLEK*" XX SQ Sequence 2333 BP; 724 A; 497 C; 500 G; 612 T; 0 other; ctaaaacaca ggcaatggta gtattcaagc gaacttaccg ggatttctgc cctgaagatt 60 ttctgaagga tgcagaaatg gtaccgtggc atcttgtgct ggaggaatca aacgtcaatg 120 atgcacttca cctcttcaca gtcatgttta atgatatagc tgatcaacac gctcctatga 180 gaaaacgagc agtgaagtca aacccagtgg catggctaga cgatgagctc agggaactga 240 tgtgtcttag ggatgaggct aagagagaat cggttgcctc gggtctccag tctgacttcg 300 aggtttacaa gaaactccga aacacagtcg tgaatctaaa caggaaaaag aaggctgcat 360 tccacaaaac aaagctagaa gagagtaaag gtgaccctaa agcaatgtgg aaaaccttaa 420 acgggatcct gggcaaagga tctaaacggt cgactggtgt ggttgaacag gatggcattt 480 accttgttaa agcaaaggat attgctgaac acttcaacaa ctatttcctg aacaaagtaa 540 acatcttgcg gcaagacatg catcaacaaa gtaatgacgc cttgctacac ctgatcgaag 600 aaaacatcat ggctgacaag gactgcagtt ttgcttttca gcaagtaaga agagaggagg 660 tgtacaagct ccttatagac ttgccagagg gaacagcagc tgggctggac aacatggaca 720 ataaactcct gagacttgca gcagagctag tttccactcc tctttgctat atcattaact 780 tgtcttttgc tacctcagtc ttcccaaggg aatggaagaa ggcaaaggta gtgcccattc 840 ccaagtcagc aacagctccc ttctgtggtg caaacagtag gcctattagc cttcttcctg 900 ctactagcaa gataatggaa cgtattgtta gcattcaggt gtcagattat ttccaacaga 960 actcacttat gtctaatcac cagcatgcat acaggaaaaa ccattcaaca tgtacggcac 1020 tgctacacat ggtagatgac tggcatcata gtatagatca gggaagccta gtaggcgcca 1080 tttttcttga tttctcggca gcatttgatc ttgtcgatca cagccgtcta ctccagaaac 1140 ttggctgcta tggctttagc gaatccacaa ctacatggat ggcaagttat ttgacaggaa 1200 ggcaacaatg tgtccacatt aatggcacta actcttcctt cttggaacta ccctgtggtg 1260 ttccccaggg gagttgtttg gggcccctca tattcactat atacacaaat gatttgccac 1320 ttgctgtaag tcaagccaca gcagatatgt atgctgacga cacctcagcc tacacctgtg 1380 accctgcaat tgagacaatc tccagtaatt tacaaacaga agtgaacaat atctgcaact 1440 gggtaagaac gaaccagctt ttcctgaacg catcaaagac aaaatgcata gttctcggaa 1500 gtaaacctaa aatgtcaaaa aggcccaaac ttgccttgac tgtggacggt aagagcatag 1560 aacaagttac agaagtaaaa ctgctaggca caaccattga cgagtgtctc acttggaact 1620 ctcaaacaaa atctacctca aagaaaatgg ccaggtctct ggggatggtt aaaagatgtg 1680 ctaagtttat ggatcagaaa ttgttaaaca tagtaattga atgtctggta ttgtctcata 1740 ttgactactg tagcccagtg tggtcttgca caacaaagag gaacttaaat cttctacaga 1800 gagtacagaa taaggctaat cgtctcctcc agataagacc aaaatggaaa tctgtgaaca 1860 ctagattgtt attgaacact atgactactt tcaggagaat tgtgttgacc aatgaacctg 1920 tctgtctcag cagtaaactg tcttttgtta agaaacagca caattacagt actcgttttg 1980 cttcaggtag aactttcaag ttggacaaac ctagaacaaa ctccttaaaa agaactttca 2040 ggtatagagc aacatcaatg tggaatcggc tacctatagc actacagtcc gccctatctt 2100 caaggtcctt caaaactggt ctctgtgacc tggtctcctg tctggaaaaa tgatcttggc 2160 tgttccatga cttgcgtatt gtgtatatat catgactgtt tttaactgat gtttgtaaat 2220 attgtccact gtatgtatgt tatctatgtc tatgtatgaa gtccaggaag attagtggcg 2280 tgccctaggg cacgtcacta atggattttc gaataaagtc tcaaagtctc aaa 2333 // ID Gypsy-189_AA-I repbase; DNA; INV; 5735 BP. XX AC supercont1.90; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-189_AA_; KW Gypsy-189_AA-LTR; Gypsy-189_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5735 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.90; Positions 168512 174246. XX CC Positions [3875-4351] - Integrase core CC 'AAAT' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 2648..4636 FT /product="Gypsy-189_AA-I_1p" FT /translation="MDEVLGYGELEPSVFVYLDDIVVVSSAFEAHLESLKE FT VARRLRLANLSINLDKSKFCLKELPYLGYIISAEGLRPNPDRVEAIINYER FT PTSLRALRRFLGMSNYYRRFIPRFSEISVPLTNLWKKKPKTIVWNPAAEKA FT FFQLKEKLIAAPVLANPNFLLQFQIQTDASDSAIAAILTQQHESGEKIVAY FT FSQKLSPAQQAYAASEKEGLAVLTAIDKFRPYIEGTRFVVITDASSLTHIM FT NGKWRTSSRLSRWSIELQGYDFEIRHRKGRDNVIPDALSRSMETASLDERD FT AWYSTLYHKVLSEPDECLDYKVEEEKLLKFVPSKTDVLDFRFEWKLCVPEK FT SREKILRKEHDESFHIGYEKLLDKLRTRYFWPKMATSVRKYVERCRICKEC FT KPTTTSQHPIMGNARLSTKPFQILAINFIQSLPRSKSGHTHLLVILDVYSK FT WTMLVPVRKIATDLIIKILEEQWFRRFSVPEVLISDNATSFLSKSFQNFLS FT YYQVKHWANSRHHSQANPAERLNRSINTCIRTYVRSNQRLWDTRISEVEHT FT LNNTTHSSTGLTPYMILFGHEIVTSGEEHRRDSITTDVTEAERQLQKMEID FT QQIQEIVNENLKKRMRKAIELIIFDFASLLLFIRWDKRCISETSHYLRLGS FT PITQNWGQPILFAL" XX SQ Sequence 5735 BP; 1835 A; 1088 C; 1254 G; 1558 T; 0 other; aatggcgccc aatttaaaaa caaaagcttt caagagttct agaattcatt agtttcttaa 60 gagtgtgcat aatagattac ggcaaacaaa aggattgagt gttgtagcgg tgatacggtt 120 gcagtaaatt ggagaatcgt tctaagattc attggatcat cctactagtg cgcgaacaat 180 tacgaaagtt aacatttttg cgggcattca ccaaattacg gtagtattgc tgcgtggtat 240 atattgaatt acttactgtt tgttgggtct tgattcgact gacagttcga agcagaattc 300 tgtttcagat cttaaagagt gatttgattc gacttttgaa catttttcgt caagggtcat 360 caatcggatc gcgatataga aataggatta tcaaatttat attgaatact ttttgtcagg 420 aatcttcaac ggattcagcg tataaaacta ggattttaat tttttttctt ttgcgataat 480 tttcggcaag aattatccgt cagattcgga atatacatct aggattttca aattgcttgt 540 gataactcat ttagtttcgt ttttctgatt agcggaaaag aaatggatat acaatcgtta 600 tacaaaaaga ttgatgtgtc acatctatca atcgatgagg tggaacacga attgctcatt 660 aggaacattt tgtatggcct agacgaacat gagagtatta agcggagaaa gctaaatgat 720 caaatgagaa aagaaagaaa caagaacaga cttgttactt gttcaatcga gtgcagagaa 780 tgattcagaa caatctgaaa ttgagaaacg aacaaaacaa cgtgtcagta aacgactaac 840 gtcggaagaa aacagaagaa aagaagaaaa agtatagaaa aagaaaacat aggaaggata 900 agaaagctag gagaagcatt aggagggttc caatttctga atggaaatta aaatatgacg 960 gcaaggatca ggggcgtcgg ttggctgaat ttctcaagga agtgaaaatg cgctgtaagt 1020 cagaagacat ttccgataga gaactttttc gaggtgcgat ccacttattc agtggacggg 1080 ccaaggactg gtttatggaa gggtttgaga atcgatactt ccgtagctga gcttaaacgc 1140 gaattcttac cgccagatct agattttcag attgaaattc aggctaccaa ccgacgacaa 1200 gctcgaggag aaaagttcgt tgactatcta catgcaaaag cttttccaat cgatgaccaa 1260 accaatctca gatcaacgaa agtttgaaat tgtttggcgc aacatgagat acgactacaa 1320 aaacgcatta acagggtcgc gtatcaagac tacatctagg ctaaaaaaat atggtcgtat 1380 aatcgatgaa aacaattgga acatgtttca aaagtccaat gaaattcgcc taaaagttca 1440 gcaggttaat gaaaactttt tctcagagca aacccaagaa aaacatggag agtgagaaaa 1500 gcaaacagca agaggagcca aatgaggaaa ttaagaggca gagctatgtg gaccctattg 1560 agcggtcatc caagggaact ttaaaagctc aatccgaacg ttatatacgc ccaccaatag 1620 gcacttgcta taattgcagg aaaaatgggc accattatgg tgactactcg gaaaagcgga 1680 agaagttttg tagattatgt ggcttttggg atgttattac ccctgaatgt cctttctgcc 1740 aaaaaaacga gcagagctca gcttgagaag gcaagctgac gtgttgccta caaatccttc 1800 cacaatcgag caagtttaca ccgacttaat aactaacggg tttgagccaa tttcgaaaga 1860 ggattatttc tcagatatgg tggtcgatga gctttttgta cgaatcgccg cagataacag 1920 accattcgca ataatatggg ttctggggag agagatatgt ggactactcg atagtggggc 1980 tcacaggaca gttctcggat ttggctgtag aaagttggta aaagatttaa aattgaaaat 2040 tttaaaattt tgaggggatg aaattggtgg cgaagaacca gaagatgata tggagattct 2100 cacagctgag cagaaatttc agttggaaga ggtcaaaaag ctcttcaaaa cagcgattga 2160 gggagaagtt ctcgaggtaa cttctctcat ttctcataaa atagagttta aggaagagtt 2220 taaaaactct ccaccggtga gaataaaccc atatccaact tctcccgaaa tacagcgcaa 2280 agttaacatt gaacttgata agatgctcag ttagcaggtc attgaaccga gcaagagcga 2340 gtgggccctg agtaccgttc ctgtcctcaa acccactggg gaagtccttc tttgtttaga 2400 cgcaagaagg ttgaacgatc ggactaggag ggatgcttat cctctcccac accaagatcg 2460 tatattaagt aggctagggt cgagtagatt cttaaccacg attgatttaa ctagagcatt 2520 cctacagata ccccttgacc ctagctcacg aaaatatacg gcgttctcag tgttgggtag 2580 aggattgttc cagttcacac gactaccttt tggcttagtc aatagtcctg ctacactagc 2640 tcggttaatg gacgaggtct tagggtacgg tgaactggaa ccgagtgtgt ttgtctacct 2700 cgacgacatc gttgtagtaa gcagcgcatt cgaagcccac ctcgagtcgt taaaggaagt 2760 tgcccgtcgc ttgcgattgg ccaacctgtc aataaatctc gataaatcga aattctgcct 2820 gaaagagtta ccttacctcg ggtatattat ttcagctgaa ggcctacgac cgaacccaga 2880 ccgtgtcgag gctatcataa attatgagcg cccgacatca ctacgagccc tgcgccgttt 2940 cttaggaatg tcaaattatt atcgacggtt tattcctcgc tttagtgaga tctcggtacc 3000 tttaacaaat ctttggaaaa agaaaccaaa aactattgtt tggaacccag ctgcagagaa 3060 ggcttttttt caacttaagg aaaaattgat cgcagctccg gttcttgcaa atccaaattt 3120 tctactgcaa tttcaaatcc aaacggatgc aagcgatagt gccatcgccg caatattaac 3180 gcagcagcat gagagcggag agaaaattgt ggcatatttc tcgcaaaagc tttcgccggc 3240 acaacaagct tatgcggctt ccgaaaaaga aggcttagcc gtactgaccg ccattgacaa 3300 atttaggccg tatattgaag gaacaaggtt tgtggtaata actgatgcat cgtcccttac 3360 ccatattatg aatgggaagt ggcggacatc atcgaggctt agcagatgga gcattgagct 3420 gcaaggatat gactttgaga ttcgacacag aaaggggagg gataatgtga tcccggacgc 3480 tctatctcgt tcaatggaga cggcctctct cgatgagaga gacgcatggt actcaacttt 3540 gtatcataaa gtgctctctg aacctgatga gtgcctcgac tataaagtag aagaggaaaa 3600 gcttttaaaa ttcgttccca gcaaaaccga cgttttagat tttagattcg aatggaaact 3660 ttgtgttccg gagaaatccc gcgagaagat tctccgaaaa gagcacgatg aatcgttcca 3720 tatcggatat gagaaacttc tcgataaact ccgaacgagg tacttttggc ctaaaatggc 3780 tacctcagtc cgaaaatatg tcgaacggtg tcgaatttgc aaagaatgca aacctaccac 3840 tacttcacag catccgatta tgggaaacgc acgtttatct accaaaccct ttcagatttt 3900 ggcgattaat ttcatccaat ctctgcctag gtcgaaatcc ggtcatacac acctattagt 3960 gatcctggat gtctattcta aatggactat gcttgtgccg gtaagaaaaa tagctacgga 4020 tctgattatc aaaattctcg aagagcaatg gtttcgccga ttttcggtgc ctgaagtcct 4080 aataagtgat aatgcgacga gttttctcag caagtccttc cagaattttc ttagttatta 4140 tcaggtcaaa cattgggcga actcaaggca tcacagccag gcgaaccccg ctgagcgcct 4200 aaaccgcagc attaacacat gtatccgaac gtatgttagg tccaatcaac gattatggga 4260 tacacgaatt tcagaagttg aacacactct gaacaatacc actcactcgt ccacaggcct 4320 tacaccctac atgatcttat ttggtcacga aatcgtaacc agtggtgaag agcaccgtcg 4380 cgattcgatc actacggatg ttacagaagc tgaaaggcaa cttcaaaaaa tggaaattga 4440 ccaacaaatt caagaaattg tgaacgaaaa tttaaaaaag cgcatgagaa aagcaataga 4500 gcttataatc ttcgatttcg caagcctgct cctgtttatc aggtgggaca aaaggtgtat 4560 aagcgaaact tcacactatc ttcggctggg gagtcctata acgcaaaact ggggccagcc 4620 catactcttt gcactgtagt ttctcgtcga ggaacaagct cctacgagct tctcgatgaa 4680 aacggtaaaa acgtagggat attttcttcc gctgatctca agcctggagt tcctcaagac 4740 ttctccgctt aaccttcctg ttgtagagta cagagaatac tgaaaaaagg ttggctgttg 4800 cccagcctca atattacgtc gttcacgttc tttatagtag taattgtatc gttgtgttga 4860 ctcgtaagat tcgtagagca aataatgtta gagtgagaaa attggtttga tggtcccata 4920 atgtctcgct gtgttgatag tgtttgtcta atgttaacga gtgatttttg ttgtaaataa 4980 taggttacaa tcgaattaag tagttgcgat agagaagttc aaccgattaa ggaaaagaga 5040 cttaaaatgc atagtttctt attcgcgagg cggtaaagcg ttggtacccg ctcactgccc 5100 taccataatt gattagtttc tactaagatt aactcagaat tcactttgaa cccacccgat 5160 tgattatagt ttttcaattt actcagtcct agataattca gcgagaaata gcattcaaaa 5220 ccggacaatc aagattaact tattttgatt gaaaataatt agtctaattc ttcttcgaaa 5280 cagttaacgg tttcggacat tgggagagtt taagactaca taaacaagtc gcggaagtaa 5340 ttgcgtaaac aagtgccctc aatcatgcgt gagggatcta tttgttggaa gaacgcaatc 5400 tatgttgatt tttatctgaa aacgaaagca caaaatgtgc ggctcaaccc tggaatggtt 5460 cacaatgaat tgaacaatta aataaaaaaa ataatacaaa acctagtgat cggacattgg 5520 gcaggacact atttcctacg ttgtgattaa gtgatgttaa aataaaaaca accgattttc 5580 tgatcgtcca gccatggcgg agactcaata gctactgttt attcatatat atctacattg 5640 tggtataggt acaatataag tagggtaatt acacatattg acgtgatgtt caaacaccta 5700 gttttgatta atttttaggg gggagtagaa ggtag 5735 // ID CR1-70_HM repbase; DNA; INV; 4178 BP. XX AC . XX DT 26-DEC-2008 (Rel. 13.12, Created) DT 26-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE CR1-type family - consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-70_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4178 RA Bao W. and Jurka J.; RT "CR1 families from Hydra magnipapillata."; RL Repbase Reports 8(12), 1897-1897 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(62..778,782..2470,2412..3947) FT /product="CR1-70_HM_1p" FT /translation="MAKSLTIVKEFLECHESTLLKTFGERFDKLEEKFYNE FT IEKLKNENALLKREVAEVTKAMNFQNEYYEQIVKESNDLKGKLNNYNKDNE FT NLKCTEIIQDKLAELEDRSRRNNLRFDGFDEREEETWEDSESKVKEFLKEK FT LGIKNNIAIERAHRTGKKHVSGKRRRTIVVKFLNYKDREIVLNEYKKRKLW FT NEKIYLNEDYSNRTIEKRRFLFTKAKELRAEGKFAKVIYTKLVTRASEVEI FT LLNLIQHTKNDNNKKYKMDDYESLIFNFRNKYIGSDENTDPDINYFNDLKV FT DCFYHYASELKEFLSKNGDENKKIRIVHINIRSMNKNFENLRHFIEETKNY FT FDIICLTETWCSQNELNNNSNFHXPNFETILLERQLNKRGGGILIYVKQNI FT AHNVRNDLCVSDGDKEIVTIEILNNKSEKLKSKSILLSCCYRPPGGASENL FT SLYLKHNVFHKGNKEKKKNFVIGDLNMDCLRYSENKKIKKFYDEIFVSGAI FT PLINRPTRITENSTTLIDNILTTNIYNNDIQIGIIKSDISDHFPIFLTINT FT ISTPNINKSQTIRKRIFNKKNLEAFKYQLSLLNWKHINFNEDINAIYNKFF FT NTFFSVYDANFPVSEITLKPKTLSSPWITKGLKKSSKIKQKLYIKYLKTKS FT HANLRIYKNYKTLFEKIRKNLKKNYYSNLITKYKNDSKQIWKILKEITGKQ FT KTCSNFLPKMIKINNVCVYEPNEIAKEFNKYFTDIGSRLADRIPSTNISFN FT DFLTTSNNLLSSHDLFLDLSIEEFERAFKSLKKKQSNRCGWNKREHYYRLL FT KKNKATGADGINGNIIIDCYESLKNILFKVFQTSIQQGVFPDLLKIAKVIP FT IYKEGEKSNVNNYRPISILPTFSKILEKIIFEKTYNYLVSNNLLYKNQFGF FT KKNSSTEQAIIQFTREVSNSFAKSQYTLGVFVDLSKAFDTVNHDILLKKLE FT FYGITGKMIKWYKSYLSNRKQFVYHGEELLPINLQNKTLIEIKCGVPQGSI FT LGPLLFLVYVNDLNKASNLISIMFADDTNLFLSNKDINKLFLDMNKELNQV FT SNWFKCNKLTLNIEKTKWILFHSSTKKRFLPYDLPKLYIDKVEIKKDSVLK FT FLGIYLDENMTWKAHIDYISTKVAKSIGILYKARNYLNKNNLKQLYYSFIH FT SYINYANIAWGSTGKSKLQRLYRHQKHAMRIIYFLNRFSNTKPLFKDMNVL FT NVYELNLYNILCFTFCCMNNPLFHVFKDLFTFKVKNKYSLRNNNYLKEPFC FT QTKFNQFCIDYRAPYIWNKIVQPNLDSSISFSVFKNKLKNIIFSTDNILKY FT F*" XX SQ Sequence 4178 BP; 1729 A; 584 C; 557 G; 1307 T; 1 other; caaggaaaaa ctatttttct ttttagtata ataagttttt atattttgaa ataattcgaa 60 aatggcaaaa tctttaacaa ttgtaaaaga atttttggaa tgtcatgaaa gcactctact 120 aaaaaccttt ggagagagat ttgataaatt agaagaaaaa ttttacaacg aaattgaaaa 180 attaaaaaac gaaaatgctt tactgaagag agaagtagct gaagttacta aagcaatgaa 240 tttccagaat gaatattatg aacaaattgt taaggaatca aatgacctaa aaggtaaact 300 caataattac aacaaagaca acgagaatct aaaatgcacc gaaattattc aagataaatt 360 agccgaatta gaagatagaa gtagacgaaa caatcttcgt tttgatggct ttgacgaaag 420 agaagaagaa acttgggaag atagcgaatc gaaggttaaa gagttcttaa aggagaaact 480 cggaattaaa aacaacattg caatcgaaag agcacacagg actgggaaga agcacgtatc 540 tggaaaaaga agaagaacta tcgtggtaaa gtttcttaac tataaagaca gagaaatcgt 600 attaaatgag tataaaaaaa gaaagctatg gaacgaaaaa atatatctca acgaagacta 660 tagcaatcgc acaatcgaaa aaagacgatt tctatttaca aaagcaaaag aacttcgagc 720 tgagggtaag tttgctaaag ttatatatac caaacttgta acgcgcgcct cagaagtata 780 agaaattctt ttaaatctta ttcagcacac aaaaaatgat aacaacaaaa aatataaaat 840 ggatgactat gaatcactta ttttcaactt tagaaacaaa tacattggat ctgatgaaaa 900 cacagatcca gatattaatt attttaatga cttaaaagta gattgttttt atcattatgc 960 tagtgagtta aaagaatttc tttctaaaaa tggcgatgaa aacaaaaaaa ttagaattgt 1020 ccacataaat ataagaagta tgaacaaaaa ttttgaaaat cttcgacatt ttatagaaga 1080 aactaaaaac tattttgata ttatttgctt aacggaaact tggtgttcac aaaatgagct 1140 taataataac tctaatttcc atyttcctaa ttttgaaaca attttattag aaagacaatt 1200 aaataaacgc ggtggaggaa ttctgattta cgtaaaacaa aacattgcgc ataacgttag 1260 gaatgatttg tgcgtttctg atggcgataa agaaatcgtt acaattgaaa ttttaaacaa 1320 taaaagcgaa aagttaaaaa gcaaaagcat tctactaagt tgttgttata gaccacctgg 1380 tggtgcgtca gaaaatttaa gcttatactt aaagcacaac gtttttcata aaggtaacaa 1440 agaaaagaaa aaaaactttg taattggcga tttaaatatg gattgcttac gatatagcga 1500 aaacaaaaaa ataaaaaaat tttatgatga aatatttgta tcaggagcaa taccgcttat 1560 caaccggcct actcgaataa cagaaaactc aaccaccctt atagacaata ttttaacaac 1620 aaatatatat aacaatgaca ttcaaatagg cattataaaa tcagatatat cagaccattt 1680 ccccattttt ctaacaatta acacaatttc aactcctaac attaataaaa gtcaaactat 1740 aagaaaacgt atttttaaca aaaaaaattt agaagcattt aaatatcaat tatcattgtt 1800 aaactggaag catataaatt ttaatgaaga cataaatgca atctataata agttctttaa 1860 tacattcttc tcggtgtacg atgcgaattt tcccgtttcc gaaataacct taaaaccaaa 1920 aaccttaagt agtccttgga ttaccaaggg tctcaaaaaa tcttcaaaaa ttaaacaaaa 1980 attgtatata aaatatttaa aaacaaaatc tcatgctaat ttacgcatat ataaaaatta 2040 caaaactcta tttgaaaaaa ttcgcaaaaa cctaaaaaaa aattattact ctaatttaat 2100 cactaaatat aaaaatgact caaaacaaat atggaaaata ttgaaagaaa taacgggaaa 2160 acagaaaacc tgttcaaatt tccttccgaa aatgattaag ataaacaatg tatgtgtata 2220 tgaaccaaat gaaatagcaa aagaatttaa taaatacttt actgatatcg gaagtaggtt 2280 ggctgatagg attccatcta caaatatttc ctttaatgat tttttgacaa cttcaaataa 2340 tttactttct tcccatgact tattcctaga cttatcgatt gaagagtttg aaagagcctt 2400 caaatcttta aaaaaaaaac aaagcaaccg gtgcggatgg aataaacggg aacattatta 2460 tagattgcta tgaaagtttg aaaaatattt tatttaaagt tttccaaaca tctatacaac 2520 aaggggtttt ccctgatcta ttaaaaattg caaaagttat cccgatttat aaagaaggag 2580 aaaaatcaaa cgtaaataat tatcgcccta tttctatcct tcctacgttt tcaaaaatat 2640 tagaaaaaat tatcttcgaa aaaacataca actacctggt ttcaaataat ttactatata 2700 aaaatcagtt cggctttaaa aaaaatagtt caactgaaca agccattatt caatttacac 2760 gtgaagtctc aaattctttt gctaaatcac aatatacact tggtgttttt gtcgatttat 2820 caaaggcttt tgatacagtc aatcatgata tcttacttaa aaaacttgag ttctacggaa 2880 taactggtaa aatgatcaaa tggtataaaa gctacttgtc aaatagaaag caattcgtct 2940 accacggaga agaattgctt cctattaatc ttcaaaataa aacattaatt gaaataaaat 3000 gtggtgttcc acagggttct atccttggcc cacttctctt tttagtatac gtcaacgatc 3060 taaataaggc ctcgaatctt ataagcatca tgtttgccga cgataccaat ttatttttgt 3120 caaacaaaga cattaacaaa ctttttttag atatgaataa agaacttaat caagtttcta 3180 actggtttaa atgcaataaa ttaaccttaa atattgagaa aacaaagtgg atccttttcc 3240 actcctcaac aaaaaaacgc tttttgcctt atgatttacc taaactttat attgacaaag 3300 tagaaataaa aaaagattcc gttttaaaat ttctaggtat ttatcttgac gaaaatatga 3360 catggaaagc tcatattgat tatatatcaa caaaagttgc gaaaagtatt ggtatcctct 3420 ataaggccag aaattatcta aataaaaata atttaaaaca actatactac tcatttattc 3480 atagctatat aaactatgca aatattgcct ggggaagtac aggtaaaagt aagttgcaac 3540 gtctttaccg ccatcagaaa catgcaatgc gaataatata ttttttaaat cgtttttcaa 3600 atacaaaacc tctatttaaa gatatgaatg tattaaatgt ttatgaactt aacttgtaca 3660 atattttatg ttttacgttt tgttgtatga ataacccctt atttcatgtt tttaaagatc 3720 tatttacctt taaagtaaag aataaatata gtttgcgaaa taataactat ttaaaagaac 3780 ccttttgtca aacaaagttt aatcaatttt gtatagatta tcgagcacct tatatctgga 3840 ataaaattgt acagcctaat ttagattcat ctatctcttt ttctgttttt aaaaacaagt 3900 taaagaatat tattttttct acggataata tattgaaata tttttgagat attattttaa 3960 ccatttctta tttcaaatgt tttggattct tattttgtta taaactacct gtattatact 4020 atattatatt tatattgtaa tttttatatg gttctggcga caagatcgat atgatcttct 4080 tccagattcc aagtttatat gtttatatta catggaaatc acgacatgta aataatgtaa 4140 actgaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaa 4178 // ID L2B-5_CQ repbase; DNA; INV; 4116 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE An L2B non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW L2B; Non-LTR Retrotransposon; Transposable Element; L2B-5_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4116 RA Kojima K.K. and Jurka J.; RT "L2B non-LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 146-146 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 8 sequences with >98% CC identity. XX FH Key Location/Qualifiers FT CDS 1..765 FT /product="L2B-5_CQ_1p" FT /translation="SRKRKLEAENQNGSNGDETPKSIPSFAEVLNSKRAKK FT DDNVKTVPAERLKVKPNPVVVIQPRKGVDIGEKTAKKELKQKVNPKNLNVN FT RVIEGKDGAVIVVVKDDESSQKLKESVEKEMGDQFEVNVRDSMKPTVKIIN FT ISEEFNEDELKATLIEDNDVFHDLKHFKLRKFYRNEKRSYAQFTALVELDA FT TTFFKVMELEKIFVGWDRCRVFDGLEVPRCFKCCKYNHKIAECKAEVDTCP FT KCAGIIEGRTVSQL" FT CDS 909..3719 FT /product="L2B-5_CQ_2p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="QPDKPHDVLYLNIAGLPTHFDELQLLIQNQKPKLVIL FT TETHLTASHDLARFAVPCFEMVNCFSKSLHTGGVSMYIDDRLTHEIVSNKT FT VGANWFLSVDISNSSFNGIFGGVYHSPSSSDRDFVECFESWLQEVVVEDKT FT NVIAGDFNIRWGQGPYSLELRNATDVAGLKQKITEYTRSHGGSRSIIDLVF FT SNIDECDASVVEEFKISDHETIGVCVPGALVVSLPDNKKKFVSWKKYSKSR FT LQALLRSDVTLQTQEVSVDAKAKVYSSSLVKAVSELTEERVVRATRSNAWF FT GSSLQMLKQQRDDAYKTYKRTSSEECWQRYKILRNLYVKELRNAKNGSVRQ FT EIDRCSGDSKRLWKCLKSLITPGGMKKSSIVFEGDEGGSCSDEMTARKVNK FT FFVKSVEEIHKSIPAPQLPEVSSLPPQNLFDSFQPITMEKLIEIVASLKKC FT SGTENITKQVLLDSLGIVGGKLLEIVNVSLQQGVFPKEWKKSTVIPIPKVA FT KSTKPEDHRPINMLPLYEKVLETVVKEQLLDFINREEILLTEQSGFRKKHS FT CESALNLLLLKWKQAIENKRTILAVFVDLKRAFETIDRRKLVEVLRRYGIA FT GNVLKWFSSYLEERTQVTLFNGVLSPEVAVDLGVPQGSVLGPLLFILYMND FT IKKALQRSEVNLFADDTVIYVTGSCREECCEILNAELNVFADWLKRKKLKL FT NVSKTKCMLITNQRRDDTDGTVYIDGEVVERVDVIKYLGVMLDNKLNFDEH FT INYTIRKAARKLGIMYRLNKYLSFDNKLMIYNTLIAPHFDYCASILFTATQ FT QQQKRMQLLQNKVMRLILGCERLTPRSFMLDCLRWMSVRQRVEYNTLVFIF FT RLTKGMAPQYLTDTIVYGSDVHQHFTRQAREIRLLNFKMTSTQNSLFYKGY FT RLFNMLPEATRNSDNLRDFKKFCKSFVRQRPLL" XX SQ Sequence 4116 BP; 1331 A; 662 C; 989 G; 1134 T; 0 other; tctcgtaagc ggaaactgga agctgaaaat caaaacggaa gcaatggtga tgagactccc 60 aagtcaattc catcatttgc ggaagttcta aatagtaagc gagccaagaa agatgacaac 120 gttaaaacgg tgccagctga gaggttgaag gttaaaccga atccagtggt cgtcattcag 180 ccgaggaagg gagttgatat tggagaaaaa actgcgaaaa aggaattaaa acagaaagtt 240 aacccaaaga atttgaatgt caaccgtgtc attgaaggaa aggatggagc agttattgtt 300 gttgtgaaag atgacgaatc atcacaaaag ttgaaggaaa gtgttgaaaa ggaaatgggt 360 gaccagtttg aggttaatgt tcgtgacagt atgaaaccca ctgtgaaaat catcaacatc 420 agtgaggagt ttaatgaaga cgaactgaag gctacgctta ttgaagacaa tgacgttttc 480 catgacttga aacattttaa gctccgtaag ttttacagaa atgaaaaaag aagttatgca 540 cagtttacag ccttagtaga gttagatgcg acgacctttt ttaaagtgat ggagttggaa 600 aaaatctttg ttggatggga tcgctgcagg gtttttgacg gacttgaggt tccaagatgt 660 ttcaaatgct gtaagtataa ccataaaatt gcagagtgca aagcagaggt agacacatgc 720 cccaaatgtg cgggaatcat cgagggaagg actgtaagtc aactgtagag aaatgtgcta 780 actgtgaaag agatcgtgtt gaggggaatc tggacgttga aacggatcat gctgtatgga 840 gtaccaactg cccagtttat caaaggttta ttaaacgtct taataagcgg attgattaca 900 ctgcatagca accagacaag ccacatgatg tattgtattt gaatattgcg ggacttccaa 960 cgcatttcga tgagcttcag ttgttaattc aaaatcaaaa accaaagctt gtaattttga 1020 cggagacgca tttgaccgct agtcacgatc ttgcccggtt tgcggtacct tgttttgaaa 1080 tggtgaattg tttttctaaa tctcttcata ccggtggagt ctcgatgtac atcgacgatc 1140 gtttgactca tgaaattgta tcgaataaaa ccgttggggc aaactggttt ttgtcggttg 1200 atataagcaa cagtagtttc aatggaattt ttggcggtgt ttatcactct ccaagcagca 1260 gtgataggga cttcgttgaa tgtttcgaaa gttggctgca agaagtggtt gtggaggaca 1320 aaacgaatgt aattgcagga gatttcaata ttcgttgggg gcaaggtccg tactctctcg 1380 agcttagaaa tgcaacggat gtagctggat taaaacaaaa aataacagaa tatactcgta 1440 gccacggtgg aagtagatcg ataattgatc tggtgttttc taatatagac gaatgcgatg 1500 ccagtgttgt tgaggaattc aaaattagtg accacgaaac aatcggtgtg tgcgtcccgg 1560 gtgcactagt tgtcagtttg ccagataaca aaaagaagtt tgtgtcatgg aagaaatatt 1620 caaaatctcg tttacaagcc ttgttaagaa gtgatgtcac cctgcaaact caagaagtat 1680 cagtggatgc aaaagcgaaa gtttacagct cctcgttggt gaaagcagta tctgaattga 1740 cggaagaacg agttgtgaga gcgactagaa gcaacgcatg gtttggatca agtttacaga 1800 tgctgaaaca gcagagagat gatgcataca agacgtacaa gcggacaagc agtgaggaat 1860 gctggcaacg gtataagatc ctgcggaacc tttacgtcaa agagctacgt aatgcaaaaa 1920 acggctctgt aaggcaagaa atcgaccggt gtagtggaga ctctaaaagg ctgtggaaat 1980 gtttaaagtc gctgataact ccaggaggca tgaagaaatc atcaattgtt tttgaaggtg 2040 atgaaggtgg ttcctgctct gatgaaatga cggcaagaaa ggttaataag ttttttgtaa 2100 aaagtgttga ggaaatccac aagtcaatac cagcgcctca gttaccagaa gtttcaagtt 2160 tgcctcctca aaatctattt gattcattcc agccaatcac gatggagaag ttgattgaga 2220 tagttgcgtc gctaaaaaaa tgctctggta ctgaaaatat caccaaacaa gtgctgttag 2280 actcattggg cattgttggt ggtaaactgt tggagattgt taatgtttcg ctacaacaag 2340 gtgttttccc gaaggaatgg aaaaaatcta ctgtcatacc gattccaaaa gttgcaaaat 2400 cgacaaaacc agaagatcat cgaccgataa acatgctgcc attgtatgaa aaagtgttgg 2460 aaactgtcgt gaaagaacag ttattggatt tcataaaccg tgaagagatc ttgttgaccg 2520 aacagtctgg ttttcgtaaa aagcattcct gtgaatcagc gttgaacctg ttgctgctta 2580 agtggaaaca agctattgag aataaaagga cgattcttgc tgttttcgta gacctcaaga 2640 gagcgtttga gacaatcgat cgccgtaagc tcgttgaagt tttacgacgc tatggaatag 2700 cagggaatgt actcaaatgg tttagcagtt acttggaaga acgaacgcaa gttactttgt 2760 ttaatggtgt actatctccg gaagttgcag ttgacctggg tgttccacag ggcagtgttt 2820 tgggaccgtt gttgtttata ttgtatatga acgatatcaa aaaagcgtta caaagatctg 2880 aagtgaacct gtttgctgac gatactgtaa tttacgtcac tggaagctgt cgcgaagaat 2940 gctgtgaaat cctgaatgct gagctgaatg tgtttgctga ctggttaaaa aggaagaagc 3000 tcaagttgaa cgtgtcgaaa accaagtgta tgctgataac taaccaacgg agggacgaca 3060 ctgacggaac agtttacata gatggcgaag tagtcgaaag ggttgatgtg attaaatatc 3120 tgggtgttat gttagacaac aagttgaact ttgatgagca catcaactat acgattagga 3180 aagccgctag gaaactgggt atcatgtaca ggttgaacaa gtacctttct tttgataaca 3240 agttgatgat ctataacaca cttattgctc cccatttcga ctactgtgcc tctattttgt 3300 ttaccgcaac acagcaacaa cagaaacgta tgcaacttct tcagaacaag gtgatgcgtt 3360 tgatattagg atgtgaaaga ctaacaccaa gaagttttat gcttgactgc ctacgatgga 3420 tgtcagttag acaacgtgta gaatacaata ctttagtttt tatttttaga ctgacaaaag 3480 gaatggctcc acaatacttg acggatacaa tagtttatgg aagtgatgtt catcaacatt 3540 ttactagaca ggcgagagaa attaggttgc tcaacttcaa aatgacgtct acgcagaatt 3600 cactttttta caaaggatat cgtttgttta atatgttacc tgaagcaaca agaaactcgg 3660 acaatctacg tgattttaaa aagttttgta aatcttttgt aaggcaaaga ccactgttgt 3720 aagtgactgc agaacattga ctgagggtgg ctgtgacaga gagtgtatgg tttaattgtt 3780 tgtatgtttg gattattggt ttgtttgttt gctctcttaa tcggtgcacg tctcagtgaa 3840 tgaatgaaaa atacgttgtt gggccgtata taagacatgc gtgtggaatt aatttaccag 3900 tacccaatgt ttaaagcaac ctgacaagtg ttgaggatgg aaaaagagag acaaggatgg 3960 gcatacacgg aagttgagta agatgatagg attgtctcca aaattatcga aagatatctg 4020 ctcataatcc ttccatacta caaaagatgt gtatgggtaa gaggtgggcc atccagagaa 4080 aaaaattaaa aattgtttac tttgaaaaat caaaat 4116 // ID Copia-1_ACA-LTR repbase; DNA; INV; 158 BP. XX AC AEYA01002308; XX DT 23-MAR-2011 (Rel. 16.03, Created) DT 23-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the Acanthamoeba castellanii genome: DE long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-1_ACA_; KW Copia-1_ACA-I; Copia-1_ACA-LTR. XX OS Acanthamoeba castellanii OC Eukaryota; Amoebozoa; Centramoebida; Acanthamoebidae; OC Acanthamoeba. XX RN [1] RP 1-158 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Acanthamoeba castellanii genome."; RL Direct Submission to RU (23-MAR-2011). XX DR Genome; AEYA01002308; Positions 229624 229467. XX SQ Sequence 158 BP; 44 A; 32 C; 25 G; 57 T; 0 other; tgttgtgtag tgtcaactac acttgcacat gactctggta ttctgttata tctttattag 60 gtagttacca tatatgtgac atgtacacgt tagataataa atcaaagagg ctcttctgag 120 ctctcttatc tttccatcaa ctgcatgata cctcaaca 158 // ID P-3_Hrobusta repbase; DNA; INV; 3567 BP. XX AC . XX DT 10-MAY-2011 (Rel. 16.05, Created) DT 10-MAY-2011 (Rel. 16.05, Last updated, Version 1) XX DE DNA transposon: consensus. XX KW P; DNA transposon; Transposable Element; P-3_Hrobusta. XX OS Helobdella robusta OC Eukaryota; Metazoa; Annelida; Clitellata; Hirudinida; Hirudinea; OC Rhynchobdellida; Glossiphoniidae; Helobdella. XX RN [1] RP 1-3567 RA Yuan Y.W. and Wessler S.R.; RT "The catalytic domain of all eukaryotic cut-and-paste transposase RT superfamilies."; RL Proc Natl Acad Sci U S A 108(19), 7884-7889 (2011). XX DR [1] (Consensus) XX CC 3' incomplete. XX SQ Sequence 3567 BP; 1250 A; 517 C; 598 G; 1202 T; 0 other; catggacgtc tcgatagcta tatatctgga ctgcaaggtg aacgtttcac ggttttcaac 60 tctgaccata ttaggctatt gcatttataa tatattacat aaatattaga tacatattat 120 atagtttata ggttgatatt aaattactgg aattcataga tgcaagtaga ctaactagca 180 gcatgttaaa tttctttgtt aatctttttt tacttccttt aattttcttt attcgttccg 240 atcgtgcatg cctatatatt cttgttgatt gttgctattt tccaatgttt taattttatt 300 ttggtgataa aatcgtgtaa aataggtatt ataacaaact taaatcattt tttaattatt 360 ttaaaaattt ccgtaacatc ttaaatttgg agcaatataa tttttacttt tggtaaaagt 420 gcaatttcga ttattattac ttcacagtcg gttagttagt taggagttat ttagagaaca 480 tttattttag atttaatttt tatatatatt ggtttatttt cttaaattaa atttacacaa 540 aactatttct ggattaaaaa gttgaagatg aatgaattta ttaaacaaca atttatttat 600 tgatctgtac ccgcacttct aatattggtt attaattgat ttatgttttt ttgcataaat 660 aaataacatt ttaattttta aggtattaag aataagataa gtaaagtaaa ttttgcttaa 720 aacttttaat tactaaaatt aatatgtata tttaaggaat attttattag tctatttctc 780 ctttgacgtc cattttatta tcacaaaaaa aatgaggttg agcattttta tgaatgaatg 840 aaaatgatgc tctacctgta aacctgttta agtttaaatg cagtagtgga cttgccgcta 900 tcttttttct tttggatttc tagttttaac atcataacca gggcgacttg agaatataat 960 atattaaatt ttacatttat tcaagattta tattaaatat gattttaaac taacattaca 1020 attacagtaa actactacct aaaatggtga acaagtgtgc tgcctatagt tgcaaaagtg 1080 gatacaaatc aaatgctgcg atagatgctg aaaacaaagt gtcttttcat tcctacccag 1140 tcaatgatcc tgaactttgt gcaaagtgga ataaggcaaa tccacgaagg gattttattc 1200 caagcagaca ttcaaagatg tgctctttgc attttcgacc ttcggatttt gttgataaat 1260 attttgatac aaattcaaca agatataaaa acaaaacaag caatagtgag tccagaatca 1320 agcgagaatt aaagcctgga acaattccaa caatatttaa tgcaccatca tacgcatcaa 1380 caatcgagca tctaccgaga cctacaacaa attcatctgc tgcgtgtcgt cgagaaggtg 1440 aaataaaaag aatgaatgaa cttgaaaatg aattcaacaa aagtgatcag ataataaatt 1500 tgagtttaga agaaaatcta gatcgtttaa aaaaggaaac atctgctcca agtggatacc 1560 attatcaaat cttcgaaaac agtttggtat tgtatgaaat acattttgac aactgttgcc 1620 ctcatatcaa atcatctatc accatagatg aacaaaaaaa tatatccctg tttaacaaag 1680 gtcgagaagt ttcaaaatca tttgttgaag gcatcatcaa caaaaaagtt gacaccatga 1740 atcagctatt gaatgcgatg gctagactta aaaatttgca aactgatcat gcgaaagtaa 1800 cttttgatga agccaaagct gcagctaaac aacaattaaa atgttgcttg gattctgttg 1860 atgacgaagt ttctaaatca caattacaat ttttaattga ccaactaaaa ctgattggtg 1920 ttaacaaatt tggaagaatt tacactccaa aacaaattat atggtgctat ttattatatg 1980 ctgctagtca tgcagcttat gttaagttgt atgaattgaa tattatgtgc ctaccatcag 2040 taaacacatt gaacaaatta acgaagaaag taaaagtaaa tagcggctta aataatgacg 2100 ggtacctgca tttgaggctg aattcgttga atcaattgga gaggacgtgt gtgttgataa 2160 ttgatgaaat ttatgttgca aaacgagttg aatattgtgg tggacaaatg ttaggtttaa 2220 caactgaagg ggaagttgca agtactattc tttgtttcat gatcaaatcc ctgtgtcata 2280 aatatcagga tatgatcgcc atgtatcctg tcaataaact aacttcttca tacatgtttc 2340 aatgctacca agaagttatg aatgttttaa agaagcaata tgtcaccatc attggaattt 2400 cactagataa tgctgctgta aacaggaagt ttttctcaag tcatttatgc tctggagaac 2460 taaaagccca tatctcaaat cctattgaca accagccatt atttctgttt tttgattcag 2520 ttcacaattt gaaaaatatt tacaacaact ttttaaacag aaaaactttt gatttgccat 2580 ctttcaatag aaactttatt aaactatctg atggtactgc acggttttca gatatagaag 2640 atttgtacaa acatgagtta aatatgctag tgaagaaagc tcacaagttg aatgctacaa 2700 tatttaatcc aacaaatatt gagagggttt ctgtcaagtt ggctgcttca atattttgtc 2760 aatcaacatg tgatgctttt gactattata taaatcatga aggcagatat gaatggaaga 2820 cgacaggtga atttgttaag ttggtcaaga aactttggga tataatgaat gttaaaacgt 2880 catccaaagg caaggcgaaa atcaacagtg atatggatcc tgtgaagtct tccaatgatt 2940 ggaaaattca cttcctgtta gaatttaaag attttttgtc agaatgggaa atgatgaaag 3000 caaataaatt aagcagagaa acattcattg ctatgaaaca aacttgttct gcttatgctg 3060 aatgcagcaa gtatttgtta gatagcatgg gttttaatta tgtccagctt ggatatctac 3120 agtcggatcc aattgaaact aggtttgggt ggttgcggca gatgtcagga ggaaactatt 3180 ttgtttctgt aagacaaatt attgaaaatg ataaaaagat caaaatggtt tcattgttac 3240 accactcaaa gttgtcaatc aaagaaattg atgatttcat tgatgatgtg ggcatcgttg 3300 aagatactag tatatttgaa gcagctaatg aaatcgtcca acaacttcaa tccacagcct 3360 catttgcctt cagcgatcca gcagatggtg aagcatgcac aatatattac atgtcaggag 3420 cagtcgctca cagtgtgctt catattacac gatgtgatga ctgcaaggaa atattagttg 3480 atgatgaatc catctgtgaa gatgtccacc atttaaaaga gcaagtattt ttcaatgatg 3540 taaacagggg agcgctaata aaaccat 3567 // ID Gypsy-182_AA-I repbase; DNA; INV; 7079 BP. XX AC supercont1.136; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-182_AA_; KW Gypsy-182_AA-LTR; Gypsy-182_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-7079 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.136; Positions 2054368 2061446. XX CC Positions [4988-5470] - Integrase core CC 'CTAG' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 387..2573 FT /product="Gypsy-182_AA-I_2p" FT /translation="MANPNYSYDDLLNTNRTVNSQRTLVTPPGSDFADSLE FT GAVGGNPARTPRIVWADQMMVDDSTNRLGRDRFSLDSRFPAPNNLQAELEH FT AARLQLSAEGFQLPFRGVQSSTRVERRAENQSVTSQGFHMDLSGFSDPAGT FT ASHTGGTNPFWDSEPGIDSNRNDSLNRLNASFLMPRDDAAPLLNSTAQPMV FT STTIPQPVGTVHPESPGHPTAQRSAYFSQPSSANVDRRMFETTQPMIPESR FT NEDFRMSYGESFAASQSASVPNPGNLPTGGPATESETLNNYVHVSEIQNYV FT QTYVQQLLKDQHHRTAFSDPITSNLARQMAGVGLQDSDGPRASDRIRENRS FT ASHDPLKLPAPLQLQCSDRSSFQPGNRTNPITAPTPQVFREVLRNSWPSFV FT TPGYPRSPVRPTVRDTANEPQISTNYPVNDDSNARTSEEGTNFQIRRGRLP FT HQTCNIMEKWPKFAGDANPTPVIDFLRQIELLSRSYQISKEELRTHAHLLF FT KEDVYVWYSAYEPKFCSWDTLLYYLRMRYDNPNRDRFIREDLRNRKQRPNE FT LFSAFLTDIENLSQRMNKKISDEEKFDIVVENMKMSYKRRLALEEISSLEH FT LAQLCYRFDALEGNLYHPRGPTKTQLINEVYTEEEVVDSEDLNEDLEIAAL FT QARKGLNGPQKDQSPVKFRETNDRAKPLCWNCRKSGHLWRECDQKKGIFCH FT MCGHPETTAFRCPQQHDLRAASEAKPKNE" FT CDS 3158..5845 FT /product="Gypsy-182_AA-I_1p" FT /translation="MPTVEIPTKRIENPMEIETEHQLSLQEKNQLFKAIQY FT LPVTQEGRLGRTPLLKHSIELLPGATPRRIPSYRWSPAVETVIDAEMDRLL FT NLDVIEECEGSADFVNPLLPIKKPTGKWRICLDSRRLNSLTKKDDFPIPNM FT TVILQRIKRARYFSVIDLTESYYQVELDDEAKNKTAFRTNKGLYRFKVMPF FT GLTNAPATMARLMVKVLGHDLEPFVYVYLDDIIIVSESFEEHLRLIECVGE FT RLTRAGLTINILKSKFCQTSIKYLGYVLSEEGLSMDVAKIQPVLDYPIPRT FT VKDIRRLLGLAGFYQKFIKNYSEITVPITDLLKKGHRKFEWTKEADNALQK FT LKDALVSAPILANPDFSLPFIIETDSSDLAIGSVLVQIQNGERKCIAYYSK FT KLSSTQRKYSATERECLAVLLSIENFRHFVEGNRFVVQTDAMSLTFLQTMS FT IESKSPRIARWALKLSKYDVELQYKRGSENIPADALSRSLYVLDCQPTDPY FT IEGLKQQVLQNPDQYSDFKIVNGKLFKFLTNSTLPEDPAFRCKEVVPLAER FT KSTIDDIHKVAHLGFLKTLEKVRERYYWPRMASEIKRFCSSCPVCKESKVP FT NMNVRPPCGKPKLCSRPWELISIDFLGPYPKSRKGNVWILVICDFFSKFVL FT AQCMKAATAPGVCAVMENLVFNLFGAPSICITDNAKVFQSDLFKNLLKNYG FT VTQWNLAVYHPSPNPAERVNRVIVTAIRCALNKQADHRNWDESIQKIVAAI FT RTSVHESTGFTPYFINFGRNMISMGSEYEHLREFESETAYASVHRSEETRK FT LYELVRQNLQEAYKRYSRPYNLRSNENHVFKPGDVVYKRDVHLSDKAKHFV FT GKFGTKYTRARVQSVLGTNTYVLENEEGQRIPGTFHGSFLKKA" XX SQ Sequence 7079 BP; 2071 A; 1498 C; 1537 G; 1973 T; 0 other; ttttggcgcc caacgtgggg cccgaaagcg gtgccttggc ttttttgtaa ataagataat 60 tggtttgaat ttagttgagt tttgatgaat tttgtaaata tttgtgaatt gaattaacaa 120 aatcggaatt taatcttaga gtttgcatta gtagaattta gattcggtag gcaagtaaga 180 tagtaaatag gctaatcggt taagaaaatt tcagcatttg ttagaatttt gatcagtttg 240 tatcttgaat tcggttttga acaaaaattt aaaagcgtag tatatctcct aattttattt 300 ttcttttttt ctcagcaata tcatttctat ccggtatttt tcgtattttt tgtattgaat 360 ttatattgaa ttgtgaatta ctaaatatgg cgaaccctaa ctatagttac gatgatctgt 420 tgaacacgaa tcgtaccgtg aattctcaaa gaactctggt tacaccacca ggttcagatt 480 ttgccgactc tctcgaagga gccgtgggtg gtaatcctgc tcgcacgcct cgtatcgtct 540 gggcagatca aatgatggtg gatgattcca ctaataggct tgggagagat cgattttcgc 600 tagacagtcg ctttccggct ccaaataact tgcaggctga gttggaacat gctgcgcgcc 660 tccaactttc agcggaggga tttcagcttc ctttccgtgg ggtgcaatct agcacaagag 720 tggaaagaag ggcggagaat caatccgtga catcgcaggg gttccatatg gacttgtcgg 780 ggttttccga cccggcaggg acagcatcac atacgggtgg gacgaaccca ttctgggaca 840 gtgaacccgg aatcgattcc aacaggaatg actcgctgaa tcggttgaat gcttctttcc 900 tgatgcccag agatgacgcc gctccgctgc taaattctac ggctcagccg atggtatcca 960 ctacaatacc gcagccggtg gggacggtgc atcctgaatc accaggacat cctaccgctc 1020 agagatcagc atacttctct caaccgtcct ctgcaaatgt agatcgacga atgtttgaga 1080 caacgcaacc aatgattcca gaatccagga atgaagattt ccgtatgtcg tatggtgaga 1140 gcttcgcggc gagccagtct gcttcggtgc caaacccagg aaacctccca actggcggtc 1200 ctgcgactga aagtgaaacg ttgaacaatt acgtacacgt ttccgaaatt caaaattatg 1260 ttcaaactta cgttcaacag ttgttgaaag atcagcacca tcgtacagct ttctccgatc 1320 cgattaccag caatttggct cggcaaatgg caggtgtggg tttacaagat tcagatggtc 1380 ctcgggcgtc tgatagaatc agggaaaata ggtctgcttc tcatgatccg ttgaaactgc 1440 cagcgccctt gcagctgcag tgctcggata ggtcaagttt tcaacctgga aatagaacca 1500 atccaattac tgctcctact ccccaagtgt ttagggaggt tctacggaat tcttggccca 1560 gttttgttac ccctggttat cccagatctc cagtaaggcc aactgttcga gatacagcca 1620 acgaacccca aattagcacg aattatccgg tgaacgatga ttcgaacgct cgaacaagtg 1680 aggaggggac taattttcag atcaggcgtg gtagattgcc acatcaaact tgtaatatta 1740 tggagaaatg gcccaagttt gctggtgatg ccaatcctac tcccgttatt gacttcctgc 1800 ggcagatcga acttctgagt cgttcctacc aaatctccaa ggaagagttg agaacccatg 1860 cgcatctcct tttcaaagaa gacgtctatg tctggtattc tgcatacgag ccaaaattct 1920 gctcgtggga taccttattg tattatttgc gtatgaggta tgacaatccc aatagggatc 1980 gttttatacg ggaagatctt cgaaatcgaa aacagcgccc taacgaactg tttagcgcct 2040 tcctcacgga cattgaaaat ctgtcccaaa ggatgaataa aaaaatttcg gatgaggaaa 2100 aatttgatat cgtcgtcgag aacatgaaaa tgtcctacaa acgacgactg gccttggaag 2160 aaatttcatc tctggagcac ttagcccaat tgtgctatcg attcgatgcc ctggaaggaa 2220 atttatatca cccaagaggc ccaacaaaaa ctcaattaat taacgaagtc tacactgagg 2280 aagaagtagt cgactcagag gacctaaacg aagatctcga gatcgcggct cttcaggcac 2340 gaaagggact aaatggtcca caaaaagatc aatccccggt taaattcaga gaaaccaacg 2400 atcgagctaa accactatgc tggaattgcc gcaaatcggg tcacctttgg agggaatgtg 2460 atcaaaagaa aggcatattc tgtcacatgt gtggacaccc agaaacaacg gcctttcgtt 2520 gtccacagca acatgacctc cgagcagctt cggaggccaa accaaaaaac gagtgaagga 2580 ggagattctc gggaaccgag ctccttccaa ctttaaagaa caggttccca agtcacagtt 2640 ttctcgattt aatcagacct ttttaataca gaccaatttc cggagatgcc ctcaccttat 2700 cgtttccatt ttcgatattc aggtcgaagg acttgcagac accggagcta gtgtctcaat 2760 aatcagttcg accaggctcc tcactcgcct tggcttgaag gttcagaaat gcgatttaag 2820 aatatttacc gcagatcgaa ctccgtacac ctgtctgggg tatgtgaata tcccgtaccg 2880 gtaccaagac gttacgaaag ttatcccgac tctagtagtc cctgaaatag ctaaaacgct 2940 aatcttagga atcgatttct taactgcctt caacttccaa ttggtcgtta cgcctgttca 3000 gaaggagtcg aatagtccag cttctaagag agatctatgt ctgaattata tcgagaacta 3060 ttttggtgat gaagatggtc acatttgctt tcaagtgatc cctgaggtaa atcgtggacc 3120 tgttgacgaa cctcaagaag aagatcaaag tcttgaaatg ccgacagtag aaattcccac 3180 aaagcgaatc gaaaacccga tggagattga aaccgaacat caattgtctc ttcaagagaa 3240 aaatcaactt tttaaagcca ttcaatacct tccagtaact caggaaggac gattgggtag 3300 aaccccttta ttgaaacact ccatagaact tcttcctgga gcgacaccac gacggattcc 3360 gtcttacaga tggtctccag cagttgagac agtgattgat gctgaaatgg acagactctt 3420 aaatctagat gtcattgaag aatgtgaagg gtcggcagat ttcgtcaatc ccctattacc 3480 gattaagaag cccaccggga aatggcgtat ttgcttagat tctcggcgat tgaactcctt 3540 gaccaaaaag gacgacttcc caattccgaa tatgaccgta atcttgcagc gaatcaaacg 3600 tgctcggtac ttttccgtga tagatttgac agaatcatac tatcaggtcg agcttgacga 3660 tgaggctaaa aacaagactg cttttcggac caacaagggc ttgtaccgct tcaaggtaat 3720 gcctttcggc cttacgaatg ctccggctac aatggcccgt ctgatggtga aggtcttggg 3780 acatgaccta gagccgttcg tatacgtcta cttagatgat attataatag tatcggaatc 3840 gtttgaggaa catctacgtt taattgagtg tgttggagaa cgtttaacac gagcaggctt 3900 gaccatcaat atacttaaat ctaagttctg ccaaacgagc attaaatatc tgggatatgt 3960 tttatccgaa gaggggttgt cgatggatgt agctaaaatc cagcccgtct tagactatcc 4020 cataccccga actgttaaag atatcaggcg attgctcggt cttgccggtt tttatcaaaa 4080 gtttattaaa aactattcag aaattactgt ccctattacc gatctcctca agaaagggca 4140 tcggaaattc gagtggacca aggaggctga taacgctctg cagaagttga aagacgcctt 4200 agtgtccgca ccgattttag cgaaccccga cttctcacta ccctttataa tagaaaccga 4260 cagttcagat ttggccatcg gatctgtcct agtccaaatc cagaacgggg aacgaaagtg 4320 catagcatat tattcaaaaa aattatcgag tacccaacgg aaatatagcg ccaccgagag 4380 agaatgtttg gcggtacttt tgagtattga gaattttaga cacttcgttg aaggcaatcg 4440 tttcgtggtt caaaccgatg cgatgagcct gacgtttctt caaacgatga gtattgaatc 4500 caaaagtccg cgaatcgcgc gatgggctct aaaactttcg aaatacgacg tagagctcca 4560 atacaaaagg ggttcagaga acattcccgc tgatgctctg agccgcagcc tgtacgtttt 4620 ggattgtcaa ccgactgatc catatattga aggccttaaa caacaagttc tacagaatcc 4680 tgaccagtat tctgatttca agatcgtcaa tggaaaactt ttcaagttct taacgaactc 4740 gaccttgcca gaagacccag cgttccgctg taaagaagtt gtgcctttag cggagaggaa 4800 atctacaatt gatgatatcc acaaggtagc tcacctcggg ttccttaaaa ccctggagaa 4860 agtacgtgaa cgctactact ggcctcgtat ggcgagtgaa attaaacgtt tctgtagtag 4920 ttgtcccgtt tgtaaagaat ctaaggttcc aaatatgaac gtccgtccac catgtggaaa 4980 gcccaaactt tgtagtcgac cgtgggaact aatctccatc gacttcttag ggccctatcc 5040 taaatcacgt aaagggaacg tctggatact tgtgatctgt gactttttct ctaaattcgt 5100 gctggcccaa tgcatgaaag cagcaacggc tccaggagtc tgtgccgtta tggagaacct 5160 tgtgttcaac ttatttgggg caccttcgat ctgtataaca gacaatgcca aagttttcca 5220 aagtgacctg ttcaaaaatc tgctgaagaa ctatggggtg acacaatgga atcttgccgt 5280 gtatcaccca agtccaaatc cagctgaaag ggtcaaccgt gtcattgtaa cggccatacg 5340 atgtgctctg aataaacagg cagaccacag aaactgggat gagtccatac aaaagattgt 5400 tgctgctatt cgtacaagcg tacatgaaag tacgggtttc acaccgtact ttataaattt 5460 tggccgaaac atgatcagta tgggcagcga gtacgaacat ctcagagaat ttgagtcaga 5520 aaccgcctac gcttcggtgc ataggagtga agaaactcga aaattgtatg aacttgttcg 5580 tcagaatctc caagaagcgt acaaacgtta ttctcgacct tacaatttga gatcgaatga 5640 aaaccatgtc ttcaaacccg gcgatgttgt gtataaacgt gatgtacatt tgtcggataa 5700 agcaaaacac ttcgtcggga aattcggtac aaaatacact cgggctcgag ttcaatcggt 5760 gcttgggacc aacacctatg ttctagagaa cgaagaaggt caaagaatcc ctggaacctt 5820 tcatgggtct ttcctgaaga aggcctaaaa atagaacagg gttcagccat gaataatgct 5880 caaaaaagca caagctatga ctgcactacg gtcgcgtagt gcatacacaa ctttcaaaac 5940 aaacacacat ataggtggtg caacgatgca gtccgatccg agaggtccat aggtttcctc 6000 gatcgtcgac acaaactaaa gctatgacgg tgcatccgcc tgatgcacaa aaacaataag 6060 ccactcgtcc aaagaaaaca ctgaatgcag gtgctacggc aaatagtcca acgttgagaa 6120 gtccatttgg ttgcctcacg ttgttgacca atttgtgctt ccgagatcga tcagtccagt 6180 ctgttcaatt agactctgac cttgaccgtg aagtcgaagc ctctctcatg cgcatataat 6240 gggattacct gtgttcttat tggaacagca gaaatcatcc caaaagcgta attcacaatt 6300 cagtttttat cacttttttc actataattt ttggttctcg ttgatttttc actttccttc 6360 cgtagattac aatagtttgt tcccgaattt ccatcgtaga gtccgtattt attattaaat 6420 tacctgaaaa tttagttaaa tatttcattt ttcttacctc agcttgtttt gttttcatag 6480 tttccatcct tctgagtgtt gacagaattt gacagcactt tctcacagta aagtttttga 6540 cattccgtga aacgtcttgt aacgtctgaa tagggtagct atgttgactg ggaggattag 6600 aatagggcgt gaatggtttc gatgtttcgg tatgcgtttc gtgtattagt tcggtaggta 6660 attatgtttt ccaattcgtt aattggagta attcgggtcg gtatttgaaa tttcctgata 6720 tgtcaggagt aaatcggttt aggttaagga gaaattttca gatatttctg aagtaattgg 6780 aatatagaat aacgaatgcg tttgaggaat gattcagttt gtgagtaaat gtaatgagta 6840 atttcgtcgg ggcagatcaa catagtaaat ccgttagaca atacagtgtt tcattagttt 6900 ccgtaaattc cgtttagtta gtttgattag gtattccgat aatgaattta aaaacttatt 6960 tttcaatttt ataaatgttt tgtgttaatt ttttcttaaa aaaaaaaaaa aaaaaattta 7020 acataaaaat tttgaaattt ttaaaatttc aaaattttta tttcttagca tgggcgaaa 7079 // ID Zator-N2_AAe repbase; DNA; INV; 1690 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 23-DEC-2010 (Rel. 16.02, Last updated, Version -1) XX DE A Zator DNA transposon family from Aedes aegypti. XX KW Zator; DNA transposon; Transposable Element; nonautonomous; KW Zator-N2_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-1690 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the yellow fever mosquito."; RL Repbase Reports 11(2), 658-658 (2011). XX DR [2] (Consensus) XX CC ~91% identical to consensus. 3-bp TSDs; usually TWA. TIRs are CC ~35 bp long. Both termini are ~80% identical to those of CC Zator-2_AA. XX SQ Sequence 1690 BP; 551 A; 270 C; 263 G; 606 T; 0 other; ggccggaaca aatttgaaat tttacttttg tcaccccccc cccttcaaat ttttcaaaaa 60 gtcagagggg ggaaataaat aaaatgtctt attttcagtg ttggttttca ttgttttttg 120 tgtttttagc tagttttgta attcgtcctt acgatattac atgagataac aggtgttatt 180 ggattataag tcctaaatct gttcgttaat gctaataggt gtgggatttt ccattgaacg 240 ttttttcttt tagaataaag tttacgccac ttttactgta acagtgtagc ctttgaaaat 300 agaatatttt agtgatttcc atgaattatt atttatataa actttcaatg aacaatcaat 360 ttgtcaaatc aattcagcaa ttggctctta aaatttattt gaatacatgt tttcctagta 420 gatcctcctt cttctggatg tataaatatt gaaagcgtga tcatcatcca caagctgcgt 480 atcaaatgtc cttgtccttg tcttcattgt gtttgtgtaa acgttatgat tctatttttt 540 taattaagaa taaaaccata tttatggggg aaatcgctct ataaatataa aaccgtatat 600 agaaaaagtg ccatcaagct gcttttcagg acacttacaa ctagggacaa tggaaattcg 660 cgatttcgcg gaagccgcga aatttggcct tcgaccgaaa ttcaggaaaa cccgcgaaat 720 tcgtttgatc gttttaattc atatgaaaaa cgcagtaatt ttcagtttat tgcctatgaa 780 aatggtttat ttattttgag tcaaggttcc tccaacaatt ttcgtatctt ttagtttatc 840 ttggacttct tctgattgct tgttttgttt taaattttaa attgccactc tggtaataaa 900 cttattatct tgatctgtaa aatacaaatt acaattatta acacaccgct gagtttacac 960 atttgaaatg tattttcact gagaatatgt actttagtgc aaaatttgac caaaagtatg 1020 gggccgcgac aaagccgaga aattttagtt tcatcacctt cgtcacccgt gaaatttaaa 1080 gaattttact gcgaatttcc attgtcccta cttacaacct attatactgc cgtcctatgc 1140 atatttatct cacagtcgct aaggtttgca aagaacatgg cacaagaggg catagaagaa 1200 caatatattt cacagagcca attttcttca attttattca aatctagtga aatattttta 1260 caaacaaaat gttacatata tgttaaatca aattaaaata atctaaattt aatttaaagt 1320 tgcatccaag tgcaacattt ctgcagtgtt cctaaaattg taaatagctt aaacacaatt 1380 tgaaagttta atattttctc tcaccaaaac tcaagaaaaa gacggtaaaa aaccctggag 1440 attgttgtta tgaaaaatcg aattatgaaa aatgttatag attaatgtta tgaaaaataa 1500 aaaaatggca tgattattgt tcaaaacgtt ttgatttgac ggttatatgt ggcaaaaaca 1560 tggttttgca gcatatgagt ataaaaaatg ataaatttac ctgaatttcc attttattat 1620 ttattgtccc cccccctcga cacattttga tggaaggtga caaaagaaga ttttaagatt 1680 tgttccggcc 1690 // ID SAT-4_AAe repbase; DNA; INV; 132 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE Satellite-type sequence: consensus. XX KW SAT; Satellite; Simple Repeat; SAT-4_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-132 RA Jurka J.; RT "Tandemly repeated DNA from Aedes aegypti."; RL Repbase Reports 11(4), 1454-1454 (2011). XX DR [1] (Consensus) XX CC 132-bp unit. XX SQ Sequence 132 BP; 41 A; 26 C; 26 G; 39 T; 0 other; ccataaaata aagtctcagc cgtcgcctct caatttccca attgagggtc aatttgaggt 60 ctatactaaa gatctgagag ggtttcgtca aaattttatc gtgcagtttt tgccaaacga 120 cgaataaaga tg 132 // ID BEL-11_DPu-I repbase; DNA; INV; 5826 BP. XX AC scaffold_26; XX DT 16-DEC-2010 (Rel. 16.02, Created) DT 16-DEC-2010 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from Daphnia: internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-11_DP_; KW BEL-11_DPu-LTR; BEL-11_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-5826 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Direct Submission to Repbase Update (16-DEC-2010). XX DR Genome; scaffold_26; Positions 895553 901378. XX CC Positions [4755-5339] - Integrase core CC 'GTGAA' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 237..1679 FT /product="BEL-11_DPu-I_1p" FT /translation="MAGVNGDNEVPEEETTTPPTQKDLRDATTWKKQRSAV FT RQQITKTIRYLGNLVNERGSRGSISSMMKHLENLLATAAKIHTNLSTVEDQ FT AENDRQDEFHLKYVELAGDALERGQQYLNSRLGEPPSVIAGKKAPSISPSE FT QNRQEEERKLAQQRADAAILEADEARRQANEVANRADQARREATAAQQALQ FT ALQDLENDDDDTSAVTNGSQHLSQLGQDWRQNQRRLNSSTSPAAPDEWIDR FT YAAGQLKPNVTASSRSSVKTELEIYSGRSLDWFEWIDLYRALVHDSGKSAG FT EKLVILKRHLKGDCLDLVQGLGGGESAYIEALVRLKQSCGRRDVMRAATLQ FT AIDKVELKNDPAIFKRFAERIRTHLSDLSRIGESNAPDLIEKISMRHQPPD FT RLEWNRGRRGGLETRSLNAFDSWLCERAAEYQNAYSIASEQNTSSVLKPPR FT SHARSHPASSAKAPDNHSAPKVTFRPLCFKCEGDHK" FT CDS 2655..5723 FT /product="BEL-11_DPu-I_2p" FT /translation="MSQQRFGGLLRRFSREPAFEKDYRAAVQKTIDKGYAS FT VLSEEEAESAKYFLAHHGVYKGIKLRVVFDAAAPFQGKCLNDAILSGPALV FT PPLPSVLIQFREGAIAWASDVEAMFSRFRLNLSDANYFCFLWQVSPPKTVV FT CRMDRLPFGATCSPFIAIQTTRRAVADAGVGEKAVEAVQKRMYVDDYLGSA FT KNADEGVAEATTVRKALAGADLNFQDWISNSAEFVAAMQEEKKPIPEISSL FT SADSESTKVLGTVWNTTSDALGFRINRPADEEYTRLSLTSHVAGIFDPLGL FT AAPIIIKAKVRLRDLVVKGLKWSDPVEGGDRAWWESWFQIVQELAHVSIER FT CLFPEEDDIVESQLHTFGDASEEAYAAVVYIRNQYRCGKIIIRIVKASSKL FT APKKSLSVPKLELNAALLSARVAAAVQHCLSHSIGRRYFWTDSSTVRNWIR FT ATASFYQVFVANRVGEIQTLTESDEWRFIPGKLNPADAATRSAIGEEIWPT FT IWQDGPEFLLQPESTWPTDLPWMAATDELKTTKLYSVQSNPDPFDWSAVSL FT DHTNLSTFLKLEGETKNLLQRCQLEAFPEDIARLKRGKPLRSSSHLLVLSP FT KLGEDGILRLGGRIDRAKLPYDARHPPLLPSKHPLTEKIVQVVHGQMHHAG FT TDYLFAKLCQHFWIIRGRELVKKVRRICPTCIKERAVPATQLMADLRAVRL FT DSYSPPFFHTASDYFGPIETSPGRNRVTKRYGVLFTCLATRAVHLELAESL FT SSEDFLLVFRRFIGLFGKPATIHSDNGTNFVGAERELNDLVDQLEKDPKLS FT QFRKEKVIDWYFQPPRAPHFGGAHESLVRSTKRALYRALDLEKKALRYPTD FT EMLRTLFAEIAGFLNSRPLTYASSDPEDFRPLTPNDFLNRPPTFDLLPGEF FT GDALPRERFRYVQKMAQLFWDLWTKLYLPTLVPRKKWQSVQGNLTVGDVVL FT LLDPNQPRGSWKIGHVNQTYPGCDGLVRVVKVKTEDGEYTRAIHRLSLLEK FT AAVADVPPEESPSPLSK" XX SQ Sequence 5826 BP; 1484 A; 1545 C; 1449 G; 1348 T; 0 other; ttttggtcct tcgaaccgga gtcatccaca gagattcttc ttggcatatt cgttgtcaga 60 agttgatcct acgtctacgc tcgttagcca gagcttgtca acgttcctgc cggtcgaaca 120 ctgtcgacca ttctctttca atcgcctgag ttgaatcact gcttcattgg aggagcttct 180 gcccgctgtt tcatctagct tccagttggc cgtctactct cgcctcatcc atcagtatgg 240 ccggtgtaaa cggagacaat gaggttcctg aagaagagac gaccacgcct ccaacccaaa 300 aagaccttcg agatgctact acctggaaga agcaacggag tgccgtacgc cagcaaatca 360 ctaagacaat acggtacctc ggcaacctgg tcaacgaaag aggttccaga ggatccattt 420 ccagcatgat gaaacacctt gagaatctat tggccacagc agccaagatc cacactaatc 480 tatcgacggt tgaagatcag gccgaaaatg atcgccagga cgagttccat cttaagtatg 540 tcgaactggc tggagacgcg ctggagagag gccagcaata tctgaattcg cgattaggcg 600 agcccccgtc tgtgattgca ggcaagaagg cgccatctat ttcaccttcc gaacaaaatc 660 gtcaagaaga agaacgcaag ttggcccaac aaagagcaga tgcagccatt ctagaagcag 720 acgaagcccg tcgtcaagca aacgaagtgg ccaaccgtgc ggatcaggcc cgacgtgaag 780 ccactgcagc ccagcaagcc ttgcaagcct tgcaagactt agagaatgat gacgatgaca 840 ccagtgcagt cacgaacggt tctcaacacc tttctcaact tggtcaagat tggagacaga 900 accaacgccg gcttaattct tcgacttcac cagcagctcc tgacgagtgg atcgaccgtt 960 atgctgccgg ccagctgaag ccaaacgtga cagcgagttc ccgttcttca gtcaagacag 1020 agttagaaat ttattccgga aggtccctcg actggtttga atggatagat ctgtacagag 1080 ctcttgtaca cgattctgga aaatccgctg gagaaaaatt ggtcattctt aaacgtcatc 1140 tgaaaggcga ttgcctcgac ctagtccaag gtctcggagg tggcgagtca gcctacatcg 1200 aagccttagt ccgtctcaaa cagagctgtg gcagacggga tgttatgcga gctgctactc 1260 tacaggccat cgataaagtg gaacttaaaa atgatccggc catcttcaag cgttttgccg 1320 agcggattag gactcatctt tccgatctca gccgtatcgg agaatccaac gccccagatt 1380 tgatcgagaa aattagcatg agacaccaac caccagaccg tcttgagtgg aaccgcggac 1440 gacgaggagg attggaaacc agaagtctca acgcctttga ctcctggctg tgtgagagag 1500 cggcagaata ccaaaacgca tacagcattg catccgagca gaatacttct tccgtgctca 1560 agccacctcg gtctcatgct cgcagccacc cggcttcatc cgccaaggca cccgacaacc 1620 actcagcacc caaggtcacc ttccggccat tatgtttcaa gtgtgaagga gaccacaaat 1680 aggaaaattg tagccggttc aaagctttaa tcgttgcaga tcgagttgct ttctgtgcca 1740 agcacaggtt atgtttcgga tgtcttggag cgaaacattc cgtccggaat tgttttacga 1800 agaagccgtg taagatcgct ggttgcagcc tgcatcatca tgaattggtc cacgaccccg 1860 acagaccagc cactggatcc gtgccggatg caccgaggac caaagagtcc gcaacaagtc 1920 gccatgggaa tgatgcggtt gaaagtttcc agtgctgaaa acggctcggt gtggcgaacg 1980 tcttcatcga cgaaggaagc gactctacgc tcatgcggca aagtctcgcc agtgctaaca 2040 ggatttccgg tgtgcatcag attctgaccg tggaaggcgc cggcggtata gtcaaacgct 2100 accgttctca acgtgttaat ttccagattg acacgattta tggtgagaag ttgaatcttt 2160 tgtgctctac cctgcccact accgtcgcga gcactacccc agttacggat tggggaaatt 2220 tgaagaaaca ttggtcgcat ctggccgacc ttcctgttgg agagacgggc ggcagagtgg 2280 atatcctgat cggcaacgac tactcgcacc tcatcgtagc tttagaatcc agggtcggaa 2340 atgactatga gccgaccgcc atcagaagca gactaggctg gatcatccgt ggtgtcgtca 2400 gcgatggtgc ttccgtcaca gccgtccgga ctcacaccgt caatagttcg acgcagctgg 2460 aagagattgc gtccgaacta cgccgtttct gcgacacgga gaattttgga acggagtcga 2520 agacgaaagg aatgtcagac gacgatcgac aagccatcgc aattttagaa gctggaacaa 2580 agaagctgga cgtaggctac gaggttccca tcacctggaa gacggggtag ccagctttag 2640 tttgcaacaa gcagatgtct caacaacgat tcggcggtct acttcggcgt ttcagcagag 2700 agccagcgtt cgagaaggat tatcgtgctg ccgttcagaa aactatcgac aaaggatatg 2760 cttccgtgtt atctgaagag gaagcagagt ccgccaaata ctttctcgca catcacggtg 2820 tttacaaagg cattaagtta cgtgtcgtct ttgatgctgc agcccccttt cagggaaagt 2880 gtttgaatga cgccatactc agcggtcctg ctctcgtacc ccctcttcca tccgttttaa 2940 ttcaatttcg tgaaggagcc atagcctggg cttcagacgt tgaagcgatg tttagccgtt 3000 tcaggcttaa tctctccgac gcaaattatt tttgcttttt gtggcaagtt tccccaccga 3060 agacagtcgt ttgccggatg gatcgattgc cgtttggcgc tacatgttct ccctttatcg 3120 ccattcaaac tacccgccga gcagttgctg atgccggagt aggagaaaag gcggtcgaag 3180 cagtccagaa gagaatgtat gtagacgatt atctagggtc cgcaaagaat gccgacgaag 3240 gcgttgcaga agcaacaacc gtgagaaaag ctctggccgg agccgacctt aactttcaag 3300 attggatttc caattcagca gaattcgttg cagccatgca ggaagaaaag aagccgattc 3360 cagaaatctc cagtctttcc gcagacagtg aaagcacgaa agtgctcggt acggtgtgga 3420 acaccacctc agacgctcta ggcttccgga taaatcgccc ggcggatgaa gaatacacac 3480 ggctcagttt gactagccac gtcgccggaa tattcgatcc gcttgggtta gcagctccga 3540 tcatcatcaa agccaaggtc cgccttcgcg atttagtcgt caagggacta aaatggtccg 3600 accctgtaga aggaggtgat cgagcgtggt gggagtcatg gttccagatc gtccaagaat 3660 tagctcatgt gtccatcgag cgctgcttat ttcccgaaga ggatgatatc gtcgagtctc 3720 agctgcacac gttcggagat gcctcagagg aggcttacgc tgcagtagtg tacatccgca 3780 accagtaccg ctgcggtaaa attattatca gaattgtgaa ggcaagtagt aagttggccc 3840 caaagaagag tttgtcagtg ccgaaattgg agctaaatgc cgccttattg tccgcccgcg 3900 ttgccgccgc cgtccagcac tgtctctccc attccatcgg tcgcagatat ttttggacgg 3960 attccagtac ggtccgtaat tggataaggg caaccgcctc cttttaccaa gtattcgtcg 4020 ccaaccgcgt aggagagatc cagacgttaa cggagagcga cgaatggcga tttattcccg 4080 gcaaactcaa ccccgctgac gctgcaaccc gttccgccat cggagaagaa atatggccaa 4140 cgatctggca agacggcccg gaattcctgc tacaacctga atcgacctgg cccacagatc 4200 tgccgtggat ggctgcgacg gacgaactca agaccaccaa attgtattcc gtccagtcca 4260 atccggatcc gtttgattgg tcagcagtca gtttggacca tactaacctt tcaactttct 4320 taaaattaga aggagaaacg aagaacctcc ttcaacgatg tcaattagaa gcctttccgg 4380 aagacatcgc cagacttaaa cgaggaaagc cacttcgttc ctcttcccat ttactggtgt 4440 taagcccaaa gctcggcgaa gatggaatct tacgtctagg cggccggatt gatagagcga 4500 agctgccgta cgatgctcgt caccctccgt tactgcccag caagcacccg ctcacggaga 4560 aaatagtcca agtggtgcac ggccaaatgc atcacgccgg aaccgattat ttatttgcca 4620 agctgtgtca acatttttgg atcattcgcg ggcgagagct ggtcaagaag gtgcggcgaa 4680 tttgcccaac gtgcatcaag gagagagcgg ttccagccac tcaactgatg gccgatctac 4740 gagccgtgcg acttgattcc tattctccgc ctttcttcca cactgcaagc gactatttcg 4800 gcccaataga gacaagtcct ggccggaaca gagtgaccaa acggtacggt gtcctcttca 4860 cttgccttgc gacaagagcg gtacatttgg aattagccga atcattatca tctgaagatt 4920 ttctactggt cttccgtcga ttcatcggcc ttttcggaaa acccgccacc atccactccg 4980 ataatggaac gaacttcgtc ggagcggagc gtgagctgaa cgatcttgtt gatcaactcg 5040 aaaaggaccc gaagctgtcc caattccgaa aagagaaggt gattgattgg tatttccagc 5100 ctccacgagc accacacttc ggcggagctc acgaaagctt ggtccgttct acgaaacgtg 5160 ccctgtacag agccttagat cttgagaaga aagctctgcg ctaccccaca gacgagatgc 5220 tgcggactct atttgccgaa atcgctggat tcctcaattc tcgccctctg acctacgcta 5280 gttcggatcc agaggacttt cggcccttaa cgccgaatga ctttttgaat cgcccaccta 5340 ctttcgattt actgccaggc gaattcggtg acgccctgcc ccgagaacgg ttccgttacg 5400 tccagaagat ggcccagttg ttctgggatt tatggactaa attgtatctt ccgacactcg 5460 tccctcgaaa gaagtggcaa tcggttcaag ggaaccttac tgttggcgac gtcgtcctcc 5520 tgctggatcc caaccaaccg cgcggctcct ggaaaatcgg ccacgtcaac caaacttatc 5580 cgggatgtga tggactagta cgcgtcgtca aggtgaaaac agaagacggc gagtacaccc 5640 gagcgattca ccgcctaagt ctcctcgaga aagctgccgt agccgatgtt cctccggaag 5700 agtctccttc ccctttatca aaatagtccc gatgtttcat tatctttctt ttctctcttt 5760 gatttttgct catctgttcc ttctttattc gtcttgtcgc cggatgtaac agattcgggg 5820 ggagaa 5826 // ID TTAA13B_AP repbase; DNA; INV; 595 BP. XX AC . XX DT 27-AUG-2009 (Rel. 14.09, Created) DT 27-AUG-2009 (Rel. 15.12, Last updated, Version 0) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; nonautonomous; TTAA13B_AP. XX NM TTAA13B_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-595 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2078-2078 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC TSD is TTAA tetranucleotide. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea CC Aphid Genome Project. XX SQ Sequence 595 BP; 202 A; 99 C; 108 G; 186 T; 0 other; gggggctcca gcgtatcatg ttttaaggag taatatttag gcaattcaca tgacatgaac 60 atttgggcta tgtctatgtg tgagtagtaa atgaacatta gtgtgtgttt ccgagaatcg 120 ctgtaatgcc gcgtcgggca accgccgccg gatgcaactc tcgcacgcgc acgcacatgt 180 caatacttcg cgaaaaatga aatatattct tgtttgcata actataaaaa ctactaaaag 240 cggtagtaaa ttaaacatac ttcctgaagt tcgattgtaa tttcgaaaaa atggttgaat 300 ttataagctt ctagactatg ctctttagga atcacttttt tgatttaaaa tcagatgtgc 360 ccaaaataaa tacaaatgta atgcaattat ttcatgggat ttttgaaaaa aaatcttaat 420 tttaaggtcc tttaaaattg aaaattgaaa atcgacttcc taaaaagcct agatattttg 480 tcaaagagtt catacatgta taaaaaatta aaatcggaca ttccgaagta ttatgtgtaa 540 tttaccaccg cttattggtc taaaacgtat gaaaaatacg tgtgctgcga tcccc 595 // ID Gypsy-30-LTR_NVi repbase; DNA; INV; 1815 BP. XX AC . XX DT 14-MAY-2009 (Rel. 14.05, Created) DT 14-MAY-2009 (Rel. 14.05, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: long terminal DE repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy-30-LTR_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-1815 RA Bao W. and Jurka J.; RT "LTR retrotransposon from Nasonia parasitic wasp."; RL Repbase Reports 9(5), 997-997 (2009). XX DR [1] (Consensus) XX SQ Sequence 1815 BP; 619 A; 395 C; 381 G; 419 T; 1 other; tgtgaaatgc tcactttctt tcgcattctt aatatattaa aaacaagtaa gattagttat 60 tataaaggaa atcaatcgat acccttccta ctagaatgca gtgtataata agatatagaa 120 aactcgacaa attcaataat taattgaatt aataagaata aggaacgaca acgtaaggca 180 aagtaaacat gaaaatataa ttaaatccga agtaaaatca ttgaggaaaa tcaataataa 240 tagtaaattc acgtagaaca atcaggataa aataaaaacg cacgcacaca ttgcggctct 300 acggcggagg cataagacca agcgaaaaac accgatatga acgcacatat tacagtccta 360 cagcggaggc acaaagctaa gcgaagcaca ctaatatgat cgcacacatt atgcacacac 420 agatcacgac tccacggcgg gaacacagca gacgcaagta gaaaaaaaca agaagcctac 480 tgtacatcat ggaccgcgca tacccgaaag cgatgcacgt gcaatgaagg gaaaacccac 540 ggagcgcgca gcgctatcgg caagtgacac ataagtcacg cgcgctggaa gctcgaggag 600 ccgccaaaga ggacaccgcg aaagtcggct caaaatgcac agaggaagcc tatgagcctg 660 cagataagga aggtaaatca gaggcgagtt tgacaggaga agagcacgag gggactcgat 720 gtactagagg agaggatttt tactctggtt ttctgattgg ccctccctgc aacgtcacat 780 gcattaaacg ctccaaataa ggcaagaagc tgcatctagg cctcctaatt ggccaaacac 840 ccaccccgcc gaggagacga aaaatcggag atcgcctctg ctacgagagc ataaaagagc 900 ccagcaacag gtcctcacgt cctcagtcta ctctaaaaat catcaccacg cgcaagctct 960 gcccgaattc ggttcggtgt ttcgcgactt agaataattt cggtgtcctg gaacttagaa 1020 tattttcgac cgttcgggac ttagaatatt ttggatagta caaaaactta ggatattttc 1080 tttctatcat ttaaaactta gaagaatttc ggccgtgcaa cacttataaa atcttcactt 1140 tatcaaacta gcatacttac gttgttgttc gtcattcttg acaaaccaaa ataattaaca 1200 aaacgtcaga tagttgcgag cgagcgttat ttacaagttg tttaaaattg aataaaataa 1260 gaaattaaaa gcgaaatcgc gagtgtattt tgaacctgga aactcctgac tccttgcgga 1320 ccaatccacm atcgaattta ctccgcttcg gaagctgagc caaagctgct gagaggacgt 1380 gctggtagcc gagcagtcca gtagcagaga atagaaaaag gtaagtgact ttaattaaat 1440 aagcatgcct caatcgtgca cttctctctt tctctctcat gattagaaat ccaagtagcc 1500 tctcgaatat cacgaattcg cgataacgat atcacgatta gagtcacgtc cattcgacga 1560 attacccgaa aagcagcgga atcaaaattt aggagtttaa attaaatttg tgcgtgtaac 1620 gggaactagt gcaatttcga aagattttcc ggcggctact ggaggccgtt ctattaaata 1680 gatcgcgaag gtagtcaagc accgacggcg ctcagcttcg cgctctgaag ctttgtcctc 1740 gtccgtgctt atgaaagcaa aataacccgt agagttactg tccggtgaca gctttcaata 1800 gcttaggaca taaca 1815 // ID L2-2_NVi repbase; DNA; INV; 3427 BP. XX AC . XX DT 15-FEB-2009 (Rel. 14.02, Created) DT 15-FEB-2009 (Rel. 14.02, Last updated, Version 1) XX DE CR1-like retrotransposon: consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; L2-2_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-3427 RA Jurka J.; RT "CR1-like elements from the parasitic wasp Nasonia vitripennis."; RL Repbase Reports 9(2), 479-479 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Nasonia CC Genome Project. XX FH Key Location/Qualifiers FT CDS 1119..3104 FT /product="L2-2_NVi_1p" FT /translation="MNSIETDLEGALCXLNNNIKLAIDELAPLKTVCPRRK FT YAPWSGPELLFDEVLRLTNEVDMRSAQERESFLRQLSDALDENRNIWKVMR FT HLGLLPGRKEEEFFGFTPGELNEHFAGVSVSPLENIDNATDTILLASEEGF FT TFSPVSMSDVVLAISHFSSQARGVDGIPCGVXVKALPIIGEFILNLFNCSF FT AQGIFPSIWKQAQLIALRKTSAPSNVKDFRPIALLCFLSKVLEKIAHAQIT FT EYLNKNHILDPFQAGFRKHHSTQTALLKLTDDVRMAIDKKKVTLMLLFDFS FT KAFDTISPSKLLSKLRQLGFSRAALLWIKSYLQGRSQMVISNKNGTSEWLE FT TNLGVPQGSVLGPLLFSLYVNDLQNILDGNTIKHLFYADDLQIYLHTNKDN FT FLDGVARLAEAARLVSGWAESSGLRLNSGKTKSIFFGSMKNVNDIKSWNLP FT GVPLPDGVIVPFSETVVSLGVVLDSKLTWKPQVDAITKKVNKALYSLRFIR FT GCTTETLRRRLVETLIQPHIDYCTVTILDASNEQRIRLQRLSNSCVRFIFG FT VRRDEHISPYRRRLEWLRTDSRRLYFEAILLYKXIRIGEPSYLASFFNKHK FT PRPSSRGVPPELSIPTVSTETGARAFQVQGARFWNSLPSTLRNLPSLASFK FT GALRRHLLATES*" XX SQ Sequence 3427 BP; 834 A; 822 C; 800 G; 936 T; 35 other; gtctatttaa atgaaatgtt gtcccaggaa aaattctcga ggaggcccat tgcacaggga 60 ctgggcttca aatatgtctg gcatgctagg cgtagggggg gggggcgagc gcgcgcacgt 120 gtttgcatct gctgccgatt tgcaggctat tctcacggcg tgtcaggccg gctcgaaaca 180 acctcgacga caaaacacac gcatgactgt cactgaaaag gacgaaaact ctgacgctac 240 ttcaagtaac agacctgagt cggacctggc aagcaaatcg gctgcaactt tgccagcatc 300 gatgccatca ttgacgactg tatcatctgc atcaayatcc atacagtcgt catcatcgtc 360 atcatcatca tccgcgccct ctatatgact cgcgaggccc cctgacaatg cgccgggtrc 420 tgggagtggc ttgagggcgg gctttctcaa tgtgaactcg ctcagggctc gcattgagat 480 cgtccgtaat tttttgacag agcgcccctc gtttgacatc tttggttttg cggagacatg 540 gctgggtccg attgtggatg atagcctagt tagcatcggg ggattttcca tcattaggca 600 ggatcgaaat gtaaacggag gtggcgtcgc cttgtttgtg cgcaatggat tgaaaattaa 660 aaagcttgct tcatctgata ctatgggtct aggaaaacct ggaatcccag agtacctttt 720 ttgtagcgta caaaggggcg actcacctcc ggtactactc ggagttattt acaggccccc 780 gaaaattccg atgcaaaagg actcagacct ttttgatgta ttgcgggact tatgcgacga 840 atttagtcat aagatcataa tgggtgatct gaactcggat ttactatctg actgacgctg 900 ctactatcaa gcgtctgtcg agttatctct tcagatcatc cggcatggtc ccacacatca 960 tacctcatcg tctcacacgt ggattgacct tatcatgacc gacgaaacga cacgatcttg 1020 gactctaaga atgagtggct gcctagtttc ggcaagcatt gtgtcataga tgtctcgcta 1080 gacatctcga cctcttgtct tgttgtgact ggacagccat gaattccatc gaaaccgatc 1140 tggagggcgc tttatgcrtt cttaataata atataaaatt agcaattgat gaattggcgc 1200 ctttgaaaac ggtttgccca cgtaggaaat atgctccttg gtctggacca gagctscttt 1260 tcgatgaggt ccttcgactc accaacgagg tcgacatgcg ttctgctcaa gagcgtgart 1320 ccttccttcg ccaactatcg gacgcccttg atgaaaatag aaatatctgg aaggtaatgc 1380 gtcatctcgg cctcctgcct gggcgtaagg aggaggagtt ttttggtttt acgccggggg 1440 agctraacga gcacttcgcg ggagtatcgg tgtcacccct cgaaaatata gacaatgcaa 1500 cggacactat tttacttgca agcgaggagg gctttacttt tagccctgtt agtatgagcg 1560 acgtcgtcct tgctatatca catttttcct cgcaagcrag gggtgtcgat ggtattccgt 1620 gcggggtgrt cgtcaaggcc ctcccaatta ttggcgaatt tattcttaat ctatttaact 1680 gctccttygc tcaggggatc tttccgagta tctggaagca ggcacagtta atcgcgctaa 1740 gaaagaccag cgctccttcm aatgttaagg actttcgtcc aattgcaytg ctctgcttcc 1800 tctcaaaggt actcgagaag atcgcgcacg cccagatcac ggagtatctt aataaaaatc 1860 atatactcga ccccttccag gccggtttcc gaaaacacca cagtacacar acagcgttat 1920 taaaactgac cgacgacgta cgcatggcaa ttgacaaaaa gaaggtgacg cttatgctgc 1980 tgtttgactt cagcaaggcc tttgacacca tctcaccttc maaattacta tctaagctga 2040 gacagctggg gttctctagg gcggctctcc tgtggattaa rtcatattta caggggcgat 2100 ctcagatggt aatttcgaat aaaaacggaa cytcagagtg gcttgaaacc aatctgggag 2160 ttccacaggg ttctgtcctg ggacccctgc tgttcagcct ctatgtcaac gacctgcaaa 2220 atatactcga cggcaacaca attaaacatc ttttttacgc ggacgacctc cagatctatc 2280 tacacaccaa caaagataat ttcctggacg gcgtggctcg actggcagaa gcggcacggc 2340 tggtttctgg ctgggcggag agctccggtc tgcgactcaa ctccggcaaa actaagtcta 2400 twtttttcgg gtccatgaag aacgttaatg atattaartc atggaacttg cctggggttc 2460 cgttgccgga cggagtgatc gtcccattca gtgagacggt cgtgagtctt ggtgttgtat 2520 tggatagtaa acttacttgg aaaccgcagg tggatgctat cacgaaaaaa gtcaacaaag 2580 ccctgtatag cctgcggttt atcaggggct gcaccactga gactctccgc agaaggctag 2640 ttgagacact tatacaaccr cacatcgact actgtactgt gactatcctg gacgcgtcta 2700 acgaacaacg aatacggtta cagagactga gcaattcatg tgtgagattc atcttcgggg 2760 ttagraggga tgaacacatc agtccttacc ggaggcgtct agaatggctt cgcaccgatt 2820 ccagaaggct ctactttgag gccattctat tatacaaart aattcggatc ggtgaacctt 2880 cgtatctggc ctcatttttc aataaacaca agccgagacc ctccagtagg ggagttcctc 2940 cggaactgag catccctacc gtgagcactg aaaccggggc tagggccttc caggtccagg 3000 gtgctcggtt ctggaactct ctcccttcta ctctgcgaaa cctaccatct ctcgcctcct 3060 tcaagggggc actrcggcgg cacctgctgg ccaccgaatc gtgacgactc tgagccctct 3120 ggcgctgctt cttwattttt attttttaat ctagtcagac tcggcccttg gccgtaaatt 3180 mtttcagtat tacgacaatg atatagmywt aaawcgtgca atatttttct ttttccwaac 3240 ngaggtgtta ycactctttt gtgctaaatg tatratatta rattgatata ctgtgataac 3300 tayctgacta ctgtgatatg gctttgtgat attattttta tatattttmt tagtttctgn 3360 gtattcttat gtgatctatt gtatttgctt tgaataaaga tctctctctc tctctctctc 3420 tctctct 3427 // ID Copia14-NVi_I repbase; DNA; INV; 4338 BP. XX AC AAZX01010927; XX DT 13-NOV-2007 (Rel. 12.11, Created) DT 13-NOV-2007 (Rel. 12.11, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: internal DE portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia14-NV; KW Copia14-NVi_LTR; internal portion; Copia14-NVi_I. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-4338 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from Nasonia parasitic wasp."; RL Repbase Reports 7(11), 1161-1161 (2007). XX DR Genome; AAZX01010927; Positions 5264 927. XX CC Positions [1677-2177] - Integrase core CC 'AGAAT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS join(1179..2903,2907..4214) FT /product="Copia14-NV_I_1p" FT /translation="MRDEYVLLGDDTRVKVEGRGKVEILRLVNGQWLEGEI FT NNVLYAPSLKKNLLSAGVCTALNYAVVLAGDQACVYSKNNKLIVQGLKQRN FT NLTTMFIKSKCAEVNRVCIASTKLWHERLGHVSTSTIEKMVKCEAVSGISL FT DKENDFFCDDCPSGKQFKLPFKTHEKSAKVNPVDVVHTDLCGPMQTTSVSG FT ARFFVLFKDESSGFRSVDFLKHKSDTYDALKNHMLHVKNHFGKDVKIVRAD FT NGTEYTDKRVKDFLSSQGIKILFSAPYTPEQNGRAEREMRTIVECARTMLL FT ARGLPRRLWAEAVNTAVYTLNRTIGAMSPTKTPYELWFNKKPDLSHMRIFG FT SDAYVLVPDVSRKKWDAKSVRLTLVGYENDSQNYRLFDSNTGRITVSRNVL FT FNENSWKNAADGATSIPIWDDTLNYDNVQEQDERVIDGGDERVADDDNGRE FT IGENAEPAVKVRGRRHNECGANAEAARDDQPQVERVRYDLRPRENIRSPNR FT LICQVAYDKLYETPENYRDALNCEDAPFWEKAIKEELDAHAKNKTWKIVKK FT PEGKKLVGHKWVFRIKSTKETDQCRYKALCAQGFTQEAGVDYSETFSPVVR FT YESVRILLSIAAEEKLCSLHFDVSTAYLNSKLNEDLYMRVPDGLNVDCKKF FT ALKLEKAIYGLKQAGRCWNERFDTFIKSLDFVQSSADKCVYTGLFKRCKVY FT LALYVDDGLLLCRDQNVLNELVGVLMEEFDITVTNLNFFVGMEISQHDDCI FT FISQEAYINKILKRFEMLDANEVKTPADPSEKLTYPIVKDGLKVPYRELVG FT SLLYLSVISRPDITYAVGVVSRYLDNYDESHWRAAKRILRYLKGTRSLGLL FT FCNNRANSGLVGYSDADYAGDIATRRSTSGYVFLLNGNCVVWSSRRQSTVS FT LSTTEAEFIAASEATKEAIWLRKLLYDIGRQCVGPTILLVDNQSAIKLTRN FT PEFHRRSKHIDARYQFICEKQRDGLIDMLIRTSSVQTCSRRLLVLKNSTQC FT VLRLM" XX SQ Sequence 4338 BP; 1294 A; 917 C; 1080 G; 1047 T; 0 other; ggttatgggc ccagccacgc agcggctcga ggaaggaagt ttctttacgc gaagtgttca 60 gtgagagtgc tgaaaaactt tttggacata cgtttgttca agcaacgtac tcctcgctcg 120 tttcttttgg tttgacaagt tcacgttatc ttggcaaatc aacgacaatg tcggacatca 180 gctcccagga tttgcgaaac gtgacgaagt ttaacggaca aaatttccaa ttgtggaaat 240 ttcaaatgag agtaattttc gtgtctcaag gccttatcaa gattgttgag ggcgccgaag 300 ttaaacctga agacaccgag gcaaacgccg ctgcaagggc ggactggatc aaacgagacg 360 cgaaggcaat gttcatattg tcctcatcga tggattacag tcaactcgac tatttgatta 420 cgtgcacgtc tgccaattcc atgtgggaaa aactttcgag catccacgaa caaagaagcg 480 ctgcaaacaa gctctctttg acgacgaggt tccacgaata caaaatgacg ccaagcgact 540 cagcggccca gcacatggcg aaagtggaga acctcgcgaa tcagctcaag gatgtcgggg 600 agacgatttc tgatgtcatg ataatggcaa aaatcatcgg gaccctgccg tcgaagtaca 660 gcccgttcat ctcagcatgg gacagcgttc cagaggcaga tcaaaccctg acgaagctac 720 gtgagagaat tctacgcgag gaatcccggc tctaaaacgt ggacgacgtg accagcgcgc 780 tcgcgaccac cagcttctct aagaacaagg gcaagaaaga ttcttcatct aacaacggta 840 atcaaacgaa aagtaagaaa gatattatat gttacttttg taaaaagagc ggtcacatcg 900 taaagaattg taacgagcgt aaaaataaga aaaaaaaagg caatgcgcct gacgactgta 960 agaatccccc caaggacggg caagattcca caaactttag cacgttagtc gcctcgtcga 1020 atcgcgaaat ccgtggaaca gaccgaaaac aaattttgtg ctcgagtcgc gacgagagtt 1080 ggttccgaga aacgagcgac gaccagctct ggattctcga tagtggcgcc tcacgccact 1140 actcgtgccg acgcgagtgg ttttcagaat tcacgcctat gcgagacgag tacgttttgc 1200 tcggtgacga tacgcgtgta aaggtcgaag ggcgtgggaa ggttgagatt ctacgattag 1260 taaacggtca gtggctggaa ggcgaaataa ataatgtttt atacgctccg tctcttaaga 1320 aaaacttgtt gtcagccgga gtatgtaccg cgctgaacta tgctgtagtt ctcgccggcg 1380 atcaagcgtg cgtttattct aagaacaaca agctcatcgt acaaggcctg aaacagcgta 1440 acaatctgac aacgatgttc ataaaatcga agtgcgccga ggtaaaccgc gtatgtatag 1500 cttcaaccaa attgtggcac gagcgtctcg gccacgttag cactagcacg atcgaaaaaa 1560 tggtcaaatg cgaagcggtc tcgggcatat cgctcgataa agagaacgat tttttttgcg 1620 atgactgtcc atccggaaaa cagtttaaac taccatttaa aacgcacgag aaaagcgcga 1680 aggtgaatcc cgtcgacgta gtccacaccg acttatgcgg gcctatgcag acaacatcgg 1740 tcagtggtgc tagatttttc gtgttattta aagacgaatc atcggggttt cgaagtgttg 1800 attttctgaa gcataaaagc gacacgtatg acgctttgaa aaatcatatg ctgcacgtca 1860 aaaatcattt cggtaaggac gtaaaaatag ttcgcgcgga caacggtact gaatacacag 1920 acaagcgtgt caaggatttc ctgtcatccc aaggcattaa gattttgttc tcggctcctt 1980 acacgccgga gcaaaacgga cgagccgaac gcgagatgcg taccatcgta gagtgcgctc 2040 ggacgatgtt attagcgcgg ggcttgccgc gtcgcctttg ggccgaagcg gttaataccg 2100 ccgtgtatac actcaatcga acgatcgggg caatgtcacc gacgaaaact ccgtacgagt 2160 tatggtttaa caagaagccg gatctgtcgc acatgcggat ttttgggagc gatgcatatg 2220 tattagtccc agacgtttcg cgaaagaaat gggatgcaaa atccgttcga ttaacccttg 2280 ttggatacga aaacgattcg cagaactaca ggttgttcga ctcgaacaca ggcaggatta 2340 ctgtttcgag aaacgtgctt tttaatgaaa actcgtggaa aaacgcggca gatggagcga 2400 cttcaatccc gatttgggac gacacactaa attatgacaa tgttcaagaa caagatgaac 2460 gcgtaataga cggcggcgat gagcgcgtag cggacgatga caacgggcgc gagatcggcg 2520 agaatgctga gccagctgta aaagtgcggg gcagacgaca caacgagtgc ggagcaaacg 2580 ccgaggcagc tcgcgacgat cagcctcaag tcgaacgagt tcgatatgat ctgcgacctc 2640 gcgaaaacat tcgatcgccg aaccgcctaa tttgtcaagt cgcttacgat aaactctacg 2700 aaacacctga aaattacaga gacgcgttga actgtgaaga tgctccattt tgggaaaaag 2760 ctataaaaga ggagcttgac gcccacgcga aaaataaaac ttggaaaatt gtaaagaaac 2820 cggaagggaa gaagcttgtt ggtcacaagt gggtttttag gattaaatct acgaaagaaa 2880 ctgaccagtg tcgctataag gcatgactgt gcgcgcaggg tttcacgcag gaggccggtg 2940 tggactatag cgaaacattt tctcccgtgg ttcgttacga gtccgtgcga atattgttat 3000 cgatagctgc agaagagaag ctttgttctc tacatttcga tgtaagcact gcgtacttga 3060 acagcaaatt aaatgaagat ttgtacatgc gagtccccga cggactaaat gtagactgca 3120 agaaatttgc gctaaaactt gagaaagcaa tatacggttt aaaacaggcc ggtcgatgct 3180 ggaatgaaag attcgatact ttcattaaaa gcctggattt cgtgcaaagc agtgctgaca 3240 aatgcgtata cacgggacta tttaagcgtt gtaaagtgta tctggcattg tatgtagacg 3300 acgggctttt gttatgtcgc gatcagaatg tactcaacga attagttggc gttttgatgg 3360 aagaattcga catcaccgta acaaacctga atttcttcgt cggcatggag atttcacagc 3420 acgatgattg tatttttatc agccaagagg cttatattaa taagatttta aaacggtttg 3480 aaatgttaga cgctaatgag gttaaaacac ccgctgaccc tagtgagaaa ttgacgtatc 3540 cgatcgtcaa agatggactg aaagtccctt atcgcgagtt agttgggtca ttgttatatc 3600 tatcagtgat ctcacgacct gatatcacgt atgcggtcgg tgtggtcagc aggtatctcg 3660 acaattatga tgaatcgcat tggcgcgcag ctaaacgtat cttgcgatac ttaaaaggga 3720 cgcgttccct cggattatta ttttgtaata atcgcgcgaa tagtggactg gtcggatatt 3780 ccgatgcgga ctatgccgga gatattgcaa ctcgtcgatc aacaagcgga tatgtattcc 3840 tgctgaacgg aaattgtgta gtctggtcgt ccagaagaca aagtactgta agcctcagca 3900 ccaccgaggc agagtttatt gcggccagcg aggcaacgaa agaggctatt tggcttcgta 3960 aactattata cgacatcgga cgccaatgtg ttggaccaac gattttattg gtggataatc 4020 agagcgcgat taaactgacc cgaaacccgg agtttcatcg cagatccaaa cacatagacg 4080 cccgttatca attcatttgc gagaaacagc gtgatggttt gattgatatg ttaatacgaa 4140 cgagcagtgt gcagacgtgt tcacgaaggc tcttagtttt aaaaaattca actcaatgtg 4200 ttctacgatt aatgtaacca gtttgtaata ttcagtttat tatattctta taatttatct 4260 gtggttttca atattgtact caatcgaaag ttctatgtaa gaaaaggaga gacatcaaaa 4320 acactcaaat agtggggg 4338 // ID BEL-240_AA-LTR repbase; DNA; INV; 498 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-240_AA_; KW BEL-240_AA-I; BEL-240_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-498 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 935-935 (2011). XX DR [1] (Consensus) XX SQ Sequence 498 BP; 198 A; 95 C; 80 G; 125 T; 0 other; tgtgcgggag taagcccacc gtgcgcacca ccactgctac gattgaccaa gaggaccttg 60 cataccgttt acttcacgac cgagtagctg tcaacgatcg gtaaaaaacc gaacaaaaac 120 aaattgcaaa cggaacagtt aaaacatgcg tttctaattc taaatttgtt attctaagac 180 acgaattaca ttaaattgtt aaaattgtat tttttggatt attctatgaa tttgtaagta 240 catactacaa cacaacatgc agaaaatgga atgaattaac atttatacaa catatagaca 300 gattaaaacc aacataaacg ttacggacag tgacagacag attaaaagga aggactaaaa 360 acgtaagtca cattaaaacc taaaaattga aattgtaaaa ctaatttaaa taaaatttac 420 agcttaaagc attctcctat aaaaagcgag tgctaaaaag gcgtccgaaa cattcctgct 480 gtcgcagcca ccgtaaca 498 // ID Gypsy-218_AA-I repbase; DNA; INV; 5596 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-218_AA_; KW Gypsy-218_AA-LTR; Gypsy-218_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5596 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1039-1039 (2011). XX DR [2] (Consensus) XX CC Positions [4597-5061] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 1320..3113 FT /product="Gypsy-218_AA-I_2p" FT /translation="MQQLNTLVRNVRFYVTFKCFEYRMNIPSRIKPFDVHI FT ETSRLPSEWEVWKLDLESFFIAQRIETQSERRAQLAYLGGPGLQELLRHLP FT GVNQVPHVTLDPPFYDVAIRCLDEYFEPFRRKTFERHVFHQIRQTPGERFA FT DFVMRLRKQISRCGYDANVVDELIADRIAQGCASEDLRIKLLQKDRSLEEI FT VALGTSMVESAEHSKKFYQLSTLKPESHDINKVSERRWNWKPQNTKREGMF FT QQPRFEKRTSERQTDRFICYSCGRRGRMQGSDECPAKRTKCANCGRIGHWA FT KRCYSRGDSHKRRQQSPGRGPPQKRIRAITEEVEKQQSSDYIFFAMGRNVF FT MFKVGGVDIPMTIDSGADANVITSDIWEQMKESGVKVCEMTNEVDRTLIAY FT ASNEPMKMKGMFHADIEAGNRRTFSPFYVVENGQQCLLGDKTAKELQVLKI FT GLNVNSINSEPKKPFPKIRGFTVEIPIDKDVQPVQQPYRRPPIAMEEEIGK FT KLRALLERDIIERVAEPSPWVSPMVPVAKDSGEIRLCIDMRQANKAVLRET FT HPLPIVEELLGSVNGAVRFSKIDIKDAYHQVEISERSRPITTFITKYGLFR FT " FT CDS 3274..5562 FT /product="Gypsy-218_AA-I_1p" FT /translation="MFGVSCAPELFQKVMESVLAGLEGVVIYLDDVMVYGT FT NQDEHDKRLAAALQRLSEYGIMLNDQKCVYNAESLEFLGHELSAIGIRPKE FT SRLSAIQQFREPQNVSELRSFLGLVCYVGRFIPHLAAKTDCLRQLLRAKVP FT FKWTEKEKIAFNSIKTDICKIEHLGFFNPKHRTKLITDASPTGLGAVLLQE FT DMNGQTKIIAFASKALTDLERKYFQTEREALSLVWGVEKFKLYLQGRKFVL FT MTDCKALKFLFSTRSKPCARIERWVLRLQAYTYDVEHIPGNANIADAISRL FT SLSSPEHFDEPVDAFIRTVAEQSAVAAIEFGDIVKATQEDEVIQKVMIALQ FT TKSFDDVPREFKPFTNELCGIEGVLLRGTRLVIPRYLQHRVLEAAHEAHPG FT IAAMKRRLCQKVWWPGIDKNAEDFVKNCKQCILVGSIGVPEPIRRSRMPDN FT PWTDIAVDFMGPLPSGHSIFVMVDYYSRFTEAVVMKQTTAKRTVDALHETF FT CRFGMPETIRSDNGPQFISEELKSFCAQYGIKLLKTTPYWPQANGEVERAN FT RAILKRLKISQLSENTDWIWDLRSFLLMYNSTPHATTGVSPSKLMFGRILR FT DKIPVFGEMGRRADDEEIRDRDWTNKIKGAEYADARRGAKQSVFKEGETVV FT TKRMTKDNKLSSSFEPEECIIVSLDGPEATLRSTKTDRLFKRNVAHLKLFP FT RDAHAETMMTSEEDITVRAGTGEDHCTVGEEKSERANQTNPSTKLERPTRT FT HRKPTYLDEYRT" XX SQ Sequence 5596 BP; 1780 A; 1011 C; 1317 G; 1484 T; 4 other; tctggcgacg aggattgaaa ttcaaaaggt ttcgctgtta aaatttttca aaaaaagtga 60 ggaaatacca gttgtgagac tggagcaaag gctgaaggca gtgagtgaaa gaaaaagggg 120 cacagcatct gccctcgact tccagcagaa ggtaagcgac gaggttatgt acaacatgat 180 ggtagttgtt gaagagcttc gaaaacagtg ttagaatcgt cggggttttt tttttcaatg 240 gtgctaacgg tgtttttatg aacggtggta tgaccgacga agaatgagaa tgaaaagsma 300 ammaaaagaa atagtggtga aattttgtta tgtttgcgtt ttgtatggaa ctttgttttc 360 aattacgacg tcatgggagg gttgtgtacc cagatggata ttctactgct gcaaaataag 420 acacgtgaag gttgagaatg ataattgttt ttgggttgtt gatcttgtga tgagagtgta 480 cctaaattga gcatttggac gcgtttgtga gacggcagta ttgaataaat tgagaagaca 540 acggttgggt accaaaatac aagagctctg gttgtgtacc atacagtgga gttaaaggtt 600 atgtacctgt gtgcggatcc tttccgaatg tagtgaaggt tgtgtaccaa tggaggaaaa 660 cttttttttt tgctgtgtgc gttgaataaa atgagaatga aaatggttga gtaccataat 720 acaagagtcc tggttgtgta ccatataatg gaaaaggctg tatacctgtg tgcggattat 780 ttctgaatgt agcgaaggtt gtgtaccact gggggaaaga gttttggctg tgtgccatgt 840 gaaacagttc ttgttttagc tgtacaagaa aaaaatatgg cgaacaatac aagaaataat 900 agcgaatgaa gtgatttaca tatgaacata ttgctaattg aggaaaagcg actattaaaa 960 ggaataacgt aagtggttgg ggcgcggggg ggggcccgcc gccatgcgag ggtgacagaa 1020 tcatgcattt tagattaacg tttcgagctc ccaaggtaag tggaaatttg actttgtcgt 1080 ataacagata ttattacaga ttacagatac tatgattttc accgatatat gtagcaccat 1140 ttgaaaagaa aaaaaaaaca gttcaatgat tttttttaat tttaatgtaa ctaattctac 1200 gcatgttaca tgctcatgaa ttttattgtt attttttata tgaattaaat tgtacggagt 1260 tccgtaacgg atcatgatat gattctttgg aaacaaataa caacattatt gacggttcaa 1320 tgcaacaact aaatacactt gtaagaaatg taagatttta tgttacattt aaatgttttg 1380 aatacaggat gaacatacct agccgcatta agccattcga tgtgcatatc gaaacaagca 1440 ggttaccatc ggagtgggag gtatggaaac tggatcttga atcgtttttc attgcacaac 1500 gtattgaaac acaatccgaa agacgagcac aactggcata cttgggtgga ccgggacttc 1560 aagaattact gcgtcacctt ccgggagtaa accaggttcc acatgtgaca ttagaccctc 1620 ctttctacga tgttgctatc agatgtctcg atgaatattt tgaaccattt cgtcgcaaga 1680 catttgaacg tcacgtattc catcaaatta gacaaactcc tggtgagcgg tttgcggact 1740 ttgttatgcg actccgtaaa caaatctcaa gatgtggtta tgacgcaaat gtagtagacg 1800 aactcattgc ggacagaatt gcccaaggtt gtgcatcgga agatttgcgt atcaaactcc 1860 tccaaaaaga tagaagcctt gaagaaattg ttgctttggg aaccagcatg gtcgaatcgg 1920 ccgagcactc caagaaattc tatcaattgt ccacattgaa accagaatct cacgatatca 1980 acaaagtttc ggaacgtcgc tggaattgga aacctcagaa taccaagcgg gaaggcatgt 2040 ttcaacaacc tcgttttgag aaacgaacaa gtgaacgtca aaccgaccgt tttatttgct 2100 acagttgcgg tcgaagaggt cgcatgcaag gcagcgatga atgcccagca aagcgcacaa 2160 aatgtgcaaa ttgtggtaga attgggcatt gggccaaacg atgttattct cgtggggatt 2220 ctcataagcg acgtcaacaa tcccctggca gaggaccacc tcaaaaacga attcgtgcga 2280 tcactgagga agtcgaaaag cagcaatcat cggattacat ttttttcgct atgggtcgaa 2340 acgtgttcat gttcaaagtg ggaggagttg acatcccgat gactatagac tcaggagcgg 2400 atgcgaacgt gataacaagc gacatttggg aacagatgaa agaatcagga gttaaggttt 2460 gcgagatgac caatgaagta gatcgaactc tcattgcgta cgcatcgaac gaaccaatga 2520 agatgaaagg tatgttccac gcagacatcg aagcaggaaa tcgacgaaca ttttcaccat 2580 tctacgtcgt ggaaaatggg caacaatgct tgttgggtga taaaacagcc aaagaactac 2640 aagtattgaa aatcggctta aatgtgaatt cgataaattc tgaaccaaag aaaccttttc 2700 caaaaatccg aggattcacg gttgagattc caatcgataa ggacgtccag ccagtccaac 2760 agccctatag gcgccctccg attgctatgg aggaagaaat tgggaagaag cttcgtgcgc 2820 ttttggagcg agacataatt gaacgcgttg cggaaccatc accctgggtt tcgccaatgg 2880 taccggtagc gaaagattcg ggggaaatac gcctatgcat cgacatgcgt caggcgaata 2940 aagcagtgct gcgcgaaact catcctttgc ctattgttga agaactactg gggtcagtaa 3000 atggtgctgt gcggttttcg aagatagaca taaaggatgc ctatcatcag gtggaaattt 3060 ctgagaggtc tcggccgatt actacattta tcaccaaata cgggctattc aggtaaagcg 3120 agaacgttat atccaagttt tttcttttat tcagatgaat ttgaagttgt tttgtataat 3180 gttaaataga tgaactccca ttgaaattat ttgttttatt cttccaattg ttttttttga 3240 ccaccattgc tttcaaccag atacaagagg ctaatgttcg gtgtgagctg cgcaccagag 3300 ctattccaga aagttatgga atcagttctt gccggacttg aaggagtagt tatatacctc 3360 gacgatgtta tggtgtatgg tacaaatcaa gatgagcacg ataaaaggtt ggccgctgca 3420 ctacaacgac tatctgaata tggtattatg ctaaatgatc aaaaatgcgt ctataacgcc 3480 gaatcactag aatttcttgg acatgagctc tctgctattg gcattcgtcc aaaggaaagc 3540 agattgagtg ctattcaaca atttcgggag cctcaaaacg tatcagaact acgaagtttt 3600 ttgggtctag tttgctacgt gggtagattt ataccacatc tagctgcaaa aactgattgt 3660 ttgagacagt tgcttcgcgc aaaggtacca tttaaatgga cggaaaaaga gaaaattgcg 3720 ttcaattcga taaaaacgga tatatgtaaa attgagcact tgggattttt caatcccaag 3780 caccgtacaa agctaataac cgatgcaagt ccaacggggt taggagctgt ccttctacaa 3840 gaagatatga atggacaaac aaaaataatc gcttttgcca gtaaggctct aacggatcta 3900 gagagaaaat actttcaaac cgaaagagaa gctctatccc ttgtgtgggg agtagaaaaa 3960 tttaagttat acttgcaagg cagaaagttt gtactaatga ctgattgtaa ggctctcaaa 4020 ttcctattta gcaccagatc gaaaccctgt gccagaatcg aacgatgggt ccttcgatta 4080 caagcctata cgtacgacgt agaacatatt ccgggaaatg caaatatagc tgatgctatt 4140 tcgagactat cgttatcctc acctgaacat tttgatgaac ctgtagatgc gtttattcgt 4200 acagtagctg agcaatcagc ggtagcagca atagaatttg gtgacattgt aaaagcaacc 4260 caggaggatg aagtaattca aaaagttatg atagcattac aaacaaagtc gtttgacgat 4320 gttccaagag aattcaaacc gtttactaat gagctttgtg gaattgaagg tgttctgctc 4380 agagggacta gattggtaat tcctcgatat cttcaacatc gtgttctcga agcagcacat 4440 gaagcacatc ctggaattgc tgctatgaaa cgcagattgt gccaaaaagt gtggtggcca 4500 ggtatcgaca aaaatgcaga agacttcgtg aagaactgta aacagtgtat attagttggt 4560 tccattggag tgccagagcc gatacgtcgc tctagaatgc cagataatcc atggactgac 4620 attgcggtag actttatggg gccacttcct tccggtcatt cgatatttgt tatggtcgat 4680 tactatagca gattcacaga ggcagtagtc atgaaacaaa caactgctaa gcgaacagtt 4740 gatgcgttac atgaaacatt ttgcaggttt ggaatgccag agactatccg atctgacaat 4800 ggtccacagt tcataagcga agaactaaaa tcattttgcg cccaatacgg aataaaacta 4860 ttgaaaacga cgccatattg gcctcaggcc aatggcgaag tagaaagggc gaaccgagcc 4920 atacttaaac gtctaaaaat cagtcaactt tcggaaaaca cagattggat atgggacttg 4980 cggtcattcc ttctcatgta caattctact ccacatgcga ctacaggagt ttcaccatct 5040 aaactgatgt tcggtcgtat tttgcgggac aaaattccag ttttcgggga aatgggtaga 5100 cgtgccgatg acgaagaaat acgagatcga gattggacaa ataaaatcaa aggagcagag 5160 tatgcggatg cacgacgtgg agcaaagcaa tctgttttca aagaaggcga gacagtggtc 5220 acaaagcgca tgacgaagga caataagctt tctagcagtt tcgaaccaga agaatgtata 5280 atagtaagtt tggacggccc agaagctact ttgcgttcaa cgaaaactga tcgacttttt 5340 aagcggaacg tcgctcactt gaaactattt cctcgagatg cacatgcaga aacaatgatg 5400 acaagcgaag aagacataac ggtaagagca ggaactggcg aagatcactg taccgttggg 5460 gaagaaaaat ctgaacgagc gaatcaaacc aaccctagca caaagctgga gcgtcctacg 5520 cgcacacatc gaaagccgac ttatcttgat gaatatcgaa cataattgat tgtaggtcag 5580 taaagttaag ggggga 5596 // ID DNA-9-1_HM repbase; DNA; INV; 3061 BP. XX AC . XX DT 22-JUN-2010 (Rel. 15.06, Created) DT 22-JUN-2010 (Rel. 15.06, Last updated, Version 2) XX DE Non-autonomous DNA transposon from Hydra magnipapillata- DE consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA-9-1_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3061 RA Kojima K.K. and Jurka J.; RT "Non-autonomous DNA transposon from Hydra magnipapillata."; RL Repbase Reports 10(6), 842-842 (2010). XX DR [1] (Consensus) XX CC 9-bp TSDs. 66-bp TIRs with one indel. CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX SQ Sequence 3061 BP; 1155 A; 427 C; 404 G; 1075 T; 0 other; tagacgatat ttcgtctgac tttttttccc gtgaagtttg ggaagtcaga cgagaaaact 60 ttggttattt aaaaaaatta aaaatggcga aaaatattta cgcttagcag gtttaaggac 120 gggggtaata aactttttaa aaaaaacaat tatcaaccat tttcaacaaa ttatcaacca 180 ttttaacaaa tttaagatac gtctaatata ttttgttaat ccgtattttt aactttcaaa 240 ctaaaaaaag acacattatt aatgtttttt ctgattgcag atttataaaa ataataagtt 300 catttatttt ttttcttact cataagtagt ggtcaaattc agtcagatca tcttcaaatt 360 taatttatta aattcaaaca ttttgttttg ctttttatat actcataagt tttggtttac 420 ttttatattg caacatggta tccagttttt aataaatgga ggataagttg tttgattttc 480 aatgcaatga tcacaatgtt tcattgctta aaaaaaaaaa cattttacca ttgccaaaaa 540 aaagggcaca accttatgag gttttcttac ctcctaattc attaagcgtg tgcattttat 600 gtctccatga tatcacacaa aagtattcat cctatgatga tggttctatg aaaaataatc 660 tatgggataa taagggaaaa ataacagaag ctggcctaca tatagaaatt tcataaatga 720 aaaactaatt tattcaaaaa actatcattg tatttgtaaa atatgctata caaatgtaaa 780 aagaacatta gcaaaaatac aaaaaaatat aaatgccatc tcaagaggtc gtgaaattct 840 taaagaaaac tcacaaaaaa tattgatatc tacaccaaca gcagaagaaa cgaatagtag 900 aatattttcg aacaaaaaaa tgaagcttga tcaagtttag aggttgttta tatttttgtt 960 ttgtattatt atacttttca tcattaaaat ctgttttgga atgaaattga acgatatggt 1020 ctatcaagat agcttttata ttttacacct ttttgacctt gttctttgtg aaagtaccaa 1080 taaattgatc ttacatcagt catagttgta gaaaagacta aagttctttt agctgaataa 1140 gtattcaaaa gaatatcttg aacaactggg tagaaaaatt tctctggatc tggtatttaa 1200 tatttcttct ttctggtgta gatgatataa taactaaatt atgcatacaa agattatcaa 1260 taatggtagc tcagttggtt aaagcactgt tataagttcg ggatagctca gttggttaaa 1320 gcactgttat aagttcagaa taattattgg ctgcagcagc aggattggag gttcaaatgc 1380 tggctccgcc tactcagcta ttacgatgtt tccttctcga ctagtaaata aataatctag 1440 tcgagtaaat gaaaattatc gtttaccatt tttgtttagt tgcagttgct gtcaatgcca 1500 attctggaat ccttttaggg acaagtgatc acaattctga gtctattatc atgagaccga 1560 aatacatctg attatgtact tgaacttccc ctataaaaat tacagacaaa taaggtataa 1620 atagaatcat gtaaatcaca agattttgct catataatgc tataggtaat gcattttttc 1680 tttcaatgag caaaattttt taattttaaa ataattttat atataaaaag aaggtctgac 1740 attaaagttg taaatttcaa tgtttactaa acaagcaaat ttatttatat aacaaaatca 1800 cttaccactt tttcacacaa tggctttcgt cagctgcaat tgcaataagt ttctgattaa 1860 aagcagagtc attgacaatt gatcaataag ttgttacaat ggcttcaggt gaacaaaata 1920 ttacatttgt ttcattgtaa tttatctaaa aaaaatagta tatactttta aacaataaac 1980 aacactccat aattaaaaaa aaaaatttat tcgaggcatg ttgaaattca atatcagaat 2040 aatataatat ttcctaataa atgcttaacg catttagaat taataatgaa atatcaaaat 2100 atcataaata tcataatgaa atataataat gaagtatcat aatgaaatat aataatgaaa 2160 tatcataata atgaaatatc ataattatct aaatgcattt agcattaaaa atttctcaat 2220 aaatgattaa ttattttgca ttcactaaaa aatattttac atgttttcta cagaaaattg 2280 aaatcaaaat tccaaagttt atgaacaaca tttcagaaaa aaatgtataa tttctcaggg 2340 taattaccca gcaaaaacta gaccttttct agaaaaatgg aaaactttat ttgttaacta 2400 aaacaatgta taataatgat aattattacc tctgtttgat catcttctat acttttttct 2460 ttgttgatta tgcttcataa aagctacttt gatttccttt ttagtaaaaa gtttttacta 2520 agaatgtttt cttgatcttt cattaaactc aaaagcggag aaacaactaa aacttaatta 2580 ttgcacaact taagtacata ttcatatcga atactatcag gtgcttgtcc ttttcttaat 2640 aagtttaaag tgaaggtaaa aaagggtaaa catcggaaca caatagattt tttactgcct 2700 gttggtttgc aaacaaaaca atctctgcta ttgctgagag ctattaaagc ttctttttgg 2760 taatccttca aagatttaat acaaaataag tcacaaacta ttttgaaagc atactctaaa 2820 ttatcctttt tcatttttta tattgcttct gaatagcact gataatttaa ttaactaaat 2880 tttggcgcat ttttaaaata aaaccccccg aaactttgcg atattacatt aagatcaaag 2940 aaaagccgca ggaacatatt tgttccggta gatgataaat tgcattagcg caaaaaccaa 3000 agttttctcg tctgacttcc caaacttcac gggaaaaaaa agtcagacga aatatcgtct 3060 a 3061 // ID I_Ele4G_AAe repbase; DNA; INV; 5682 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A I non-LTR retrotransposon family from Aedes aegypti. XX KW I; Non-LTR Retrotransposon; Transposable Element; I_Ele4G_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5682 RA Kojima K.K. and Jurka J.; RT "I non-LTR retrotransposons from the yellow fever mosquito."; RL Repbase Reports 11(4), 1376-1376 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 9 sequences with >96% CC identity. CC The consensus is 77% identical to I_Ele4. XX FH Key Location/Qualifiers FT CDS 353..1660 FT /product="I_Ele4G_AAe_1p" FT /translation="METDGESVSVVDSVSKXSSSTVKHIRVKTYPTTFLGP FT FVVFFRKKEKPINVLLISSEIYKLYKSVKEIKKISLDKLRVVFGSREDANA FT LLESKLFLNSYRVYAPCDSCEINGIIYDEALECDDIVNHGSGVFKNQAISP FT VKILECVRLSKLTFSDKGSSYTHSNCIKITFEGSILPDYVIIDNVKFHVRL FT FYPKIMHCDRCLLFGHTSHFCSNKPKCSKCGGVHPPSDCKKNSDGCIYCGK FT KHNLLKECSVYIAHQQQFNLKIKNKNKLSYSDVIKTSDGIASHNIFEPLSE FT NNCNENLKEETNNFVYKPPVKRKRNNKSNNHNHILEPQPSTSYDKNFPPMN FT SSNTTQNIPGFQKSNTXFSGNKFDGSNNIKNNSQDNGNKXNDANILNILED FT IIEFLGLNDFWKNLIRKFLPFLANILEKLNSFGPLISSLFCS" FT CDS 1663..5355 FT /product="I_Ele4G_AAe_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="MAQXNNQSLNILQWNCRSIIPKVDRLKALLLNNGVDI FT FCLNETWLVDTKSLRIPFFNIIRRDRNISYGGVMIGIRENVQFKFLEFSLD FT SQIENVSISVKHNDIEFSIICLYIPPQARFSLQELKTILNSVPSPFYILGD FT LNAHHFAWGSDMTDGRGSLIMDLIDELNLNILNDGSFTRIAVPPARHSCID FT LSLCSNSLYIKSSWKIIDDPNGSDHLPIKISIHNPVRGQLSXEPYVHDLTK FT NVHWNKFSDLVSSALNKFDYSLSPLQNYDSFSXILIECLYKSQCKQIKLGP FT AKRKHISFWWDNDCTIALKNKSNAFKQFRRSGSREHYIFYCKTEAQFTRVT FT KFKKRDYWRNFVENLDPETSLSELWSVARNLRNYNIPSTSVLEYSEDWIDQ FT FASKICPDFVPPPITFKNQQRYNYFPELCNEFTIEEMDLALSITNNTAPGI FT DNIKFIVLKNLPIDGKLHLLSLYNLFLFQNIFPLQWRSIKVVSXLKPDKNP FT SLVDSRRPISLLSCLRKLMERMILNRLELWAEKNNILSSSQFGFRKGRGTR FT DCVGLLASYIELSFNKKQDVVTTFLDVSGAYDSVLIDLLFSKMXDCKIPII FT LXNFLCNLFSFKIMHFYHNGSPRXIRYSYFGLPQGSCLSPFLYNLFTRDII FT SIIPNGCYFIQFADDKNISITGQNREVIRHFMQLALDNIDTWAHNNGFTFS FT AQKTKFILFSRKHSPVSIELFLNSHRIDQVFDYKYLGIWFDYKLKWNSHIQ FT YIQKICSKRINFLRMITGTWWGAHPXDMITLYKTTIRSVMEYGCFTFGSAV FT QTHFSKLEKIQFRCLRICLKLMNSTHTKSVEVLAGIIPLRNRLHELNCKFL FT INCFSNNHPIIDLLKSXFEINPTNKILHSFLYCSGETIIPNSTPGFYEYSM FT NVHAFRPCIDLSLLEELKQIPCHXHFRFANMLFNRKFIGVEQNQIFFTDGS FT LVGNMAGFGVFNINLAHFYKLESPCSVFTAELTALYFTCNLIKNSAPNIYM FT VCSDSLSCLNALNXSNFHFKTHRTVLSIKNLLHNLYSRGYVIKFVWVPAHC FT NIYGNEQADLLAKLGVFRGITYNRDICDFEYFTKLKKYSLNNWQLSWNTSI FT QGRYCYSILPKVKAVPWFKNLYVGRNFICSLSRLMSNHYICNSYLYRMNII FT DSDICECNESYEDIDHIVLKCSRFNGPREKFLDNIIRLGFDIPVSVRDILG FT NKYLPILKILYKYLYDISYHV" XX SQ Sequence 5682 BP; 1727 A; 849 C; 905 G; 2162 T; 39 other; catcttcggt aagtaggcct tgaccgggtt acaggtcttt ttttctgcgc ttgtcaattt 60 ttttctattt tcaggtgaat ttctaccccg aagttttgcg atctaaggcg tttggatttt 120 gagtgtaatw cactggaagm gaaggtgtgc tgtatcgttg ggaagctttg wtttgcaaga 180 tttctcgtag ctgttggtgw ttccagccgt tgacgktttt cgattgctgt tgttggcgtt 240 gccgmagttm atttttgctc gttaagtatt attttatttt cttttgttgt ttttttattt 300 ggtttttata ctggtgtaat cgawttttta ctatccccgt catttttcca ttatggagac 360 tgacggggag tcagtttctg ttgtggatag tgtatctaaa gamtcttctt ctactgttaa 420 acacattcgc gtgaaaactt atcctactac ctttctgggt ccctttgtgg tattttttcg 480 taaaaaagaa aaaccaatwa atgtcctttt gatttcctct gagatttaca aattgtataa 540 atctgtcaag gaaatcaaaa agatttccct tgataaattg agggttgttt ttggatctcg 600 wgaagacgct aatgcgttgt tagagtccaa attatttttg aattcatatm gagtctacgc 660 tccatgtgac tcatgtgaaa ttaatggcat aatttatgac gaagctttag agtgtgatga 720 tattgtaaat catggttctg gagttttcaa aaatcaggca atttctcctg tgaaaatttt 780 agaatgtgtc cgtttatcta aattaacctt ttcagataaa ggatcatcct atacacattc 840 taattgtatc aagataacat ttgaaggatc tattcttcct gattatgtta ttattgataa 900 tgtgaaattt cacgttagat tattttatcc caaaattatg cattgtgatc gttgtcttct 960 ttttggacac acatcacatt tttgttctaa taaaccaaaa tgttccaaat gtggtggagt 1020 tcatcctcca tctgattgta agaaaaattc tgatggttgt atttattgtg ggaagaaaca 1080 taacttgttg aaggagtgtt cagtttatat tgcacatcaa caacaattta atctgaaaat 1140 taaaaataaa aataaattat cttattcaga tgttattaaa acttccgatg gtattgcctc 1200 tcataatatt ttcgaacctt tatctgaaaa taattgcaat gaaaatttaa aagaggaaac 1260 aaataatttt gtttataaac ctcctgtaaa aagaaaaaga aataataaat ctaacaatca 1320 caaccacatt ttggaaccac aaccttcaac atcttatgac aaaaattttc ctcccatgaa 1380 ttcttctaat acaactcaga atattcctgg tttccagaaa agtaataccw tattttctgg 1440 aaataaattt gatggtagca ataatattaa aaataattct caggataatg ggaataaaam 1500 taatgatgct aacattttga acattttgga agatattatt gagtttttgg gattaaatga 1560 tttttggaag aacctgatta ggaagttttt accatttttg gccaatattc ttgaaaaatt 1620 gaattcattt ggaccactca taagttcctt gttttgttcc taatggctca awtcaataat 1680 caaagcttga atattttaca atggaattgt mgaagtatta ttccaaaagt tgatcgactt 1740 aaagctttac ttttaaacaa tggtgttgat atattttgtt taaatgaaac gtggttagta 1800 gatactaaaa gcttaagaat tccttttttt aatataatcc gaagggatcg taatatttca 1860 tatggagggg ttatgattgg aattcgtgaa aatgttcaat tcaaattttt agaattttca 1920 ttggactctc aaattgaaaa tgtttctatt tctgtcaaac ataatgatat tgaattctcg 1980 attatttgtt tatatattcc tccccaagca agattttctc tacaagaact taaaactatc 2040 ttaaactctg ttccttctcc attttatatt cttggtgact tgaatgctca tcattttgct 2100 tggggcagtg atatgaccga tggtagagga tcattaataa tggatctaat cgacgaatta 2160 aatttaaata ttttgaacga tgggtcattt actagaattg ctgtgcctcc tgctcgccat 2220 tcctgtattg atttatcgct ttgttcaaat agtttgtaca ttaaatcgtc ttggaaaata 2280 attgatgatc ctaatggtag tgatcattta cccattaaaa tttccattca taatcctgtc 2340 cgtgggcaat tgagtmcaga accttatgtt catgatttga cgaaaaatgt acattggaat 2400 aaattttcgg acttggtttc ttctgctttg aataagtttg attattcgct ttcacccctt 2460 caaaattatg atagtttttc amaaatttta attgaatgtt tatataaatc acaatgcaaa 2520 caaatwaaat tggggccagc gaaaaggaaa cacatttctt tttggtggga taatgattgt 2580 acmattgcwt taaaaaataa atcaaatgct tttaaacaat ttaggcgttc aggatcaaga 2640 gagcattata ttttttattg caaaactgaa gctcagttta ctcgagttac taaatttaag 2700 aaaagagatt attggagaaa ctttgttgaa aatcttgatc ctgaaacatc tttatctgaa 2760 ttatggtctg ttgctcgaaa tttgagaaat tacaatattc cttctacatc tgttctggaa 2820 tattcagaag attggatcga tcaatttgct tctaaaattt gtcctgattt tgtccctccw 2880 cccatcacwt ttaaaaatca gcaacgttat aattatttcc ctgaactttg taacgaattt 2940 actatagagg aaatggattt ggcactatct attaccaata acactgctcc aggtattgat 3000 aatattaaat tcattgtgtt gaaaaattta cccattgatg gtaaattaca tttactttcc 3060 ttatataatt tattcttgtt tcagaatata tttcctttgc aatggcgttc tataaaagta 3120 gttagtmttc ttaaacctga taaaaatcct tcattagtag atagtagaag accgatcagc 3180 ttattatcgt gtcttcgtaa gttaatggaa agaatgattc tcaatcgcct tgaattgtgg 3240 gctgagaaaa ataatattct ttcatcttct caatttggat tcaggaaagg tcgtggaact 3300 cgtgattgtg ttggcctttt agcttcatat attgaattat cgttcaataa aaaacaagat 3360 gtagttacta cmttccttga tgtttctggt gcatatgatt ctgttttaat agatttactt 3420 tttagtaaaa tgmttgattg taaaattcca atcatccttw ctaatttctt atgcaatttg 3480 ttttctttca aaataatgca tttttaccat aacggatctc cwagawcgat tcgttatagt 3540 tattttggat tgccacaggg ttcttgttta agcccatttc tatacaattt attcaccaga 3600 gacataatwt ccattattcc gaatggatgt tattttattc aatttgctga tgataagaac 3660 atttctataa ctggccaaaa tagagaagta attcgtcact ttatgcaact tgctttggat 3720 aatattgata catgggcaca taataatggt tttacttttt cagctcaaaa aacmaaattt 3780 atattatttt ctcgcaagca ttctccagtt agtattgaat tgtttctcaa tagtcatcgc 3840 attgatcaag tttttgatta caaatatctt ggtatatggt ttgattataa attgaaatgg 3900 aacagtcata ttcaatatat ccaaaaaatt tgttcaaaam gaatcaattt tcttcgtatg 3960 attactggaa catggtgggg tgctcatccc wctgatatga ttacactcta taaaacaact 4020 attcgwtcag ttatggaata cggttgtttt acatttggaa gtgctgttca aacacatttt 4080 tcgaagcttg aaaaaatcca gtttcgttgt ttgagaattt gtttaaaatt aatgaattct 4140 actcatacaa aatctgttga agttcttgct ggcattattc ctctaaggaa tcgcttacat 4200 gagctcaatt gtaaattttt aataaattgt ttttcaaata atcatcctat tattgatctg 4260 ttaaaatcam tatttgaaat taatccaact aataaaattt tgcattcatt tttatactgt 4320 tccggagaaa ccattattcc aaattctaca cctggttttt atgaatatag catgaatgtt 4380 catgccttcc gtccatgtat tgatttatct ttacttgaag aattaaaaca aattccttgt 4440 catkctcatt tccgwtttgc taatatgtta tttaatcgaa aatttattgg agtggaacaa 4500 aaccaaattt tttttacaga cggatcttta gtgggaaata tggcaggwtt tggagttttc 4560 aacatcaact tggctcattt ttataaattg gaatctccgt gttctgtttt cacagctgaa 4620 ttaactgctt tgtattttac ttgtaattta attaaaaatt ccgctcctaa catatacatg 4680 gtgtgttctg atagcctgag ttgtcttaat gctttgaatt scagtaattt tcatttcaaa 4740 actcatcgta ctgttttatc tattaaaaac ttattacaca atttgtattc tcgaggatat 4800 gttattaaat tcgtttgggt accagctcat tgtaatattt atggtaatga gcaggctgat 4860 ttattggcaa aattgggtgt tttccgtgga ataacttaca atcgtgatat ttgtgatttt 4920 gaatatttta caaaattaaa aaaatattct ttgaataatt ggcaactttc atggaacaca 4980 agtatccaag ggcgttattg ttattctatt ctcccgaagg ttaaggcagt tccttggttt 5040 aaaaacttgt atgttgggcg taattttatc tgttccctct ctagattgat gtccaaccat 5100 tatatttgta atagttattt gtaccgcatg aatatcatag attcagatat ttgtgaatgc 5160 aatgaatctt atgaagacat tgatcatatt gttcttaaat gttctcgctt taatggacca 5220 cgagaaaaat ttttggacaa cataatcagg ttaggttttg atattcctgt atctgttcgc 5280 gatattctgg gaaataaata tctccctata ttaaaaatcc tatacaaata tctatatgac 5340 atttcttatc atgtttgata tttgctgttt tgttttcttt tttcttttca ttttcagcta 5400 ctacaagtcc aggattggca catgggacat gtcaatgatt ggctctgtga atcgtgtgga 5460 tggcccttaa ctatgatgac ctcacttgac agatattggc tccgcaatgg atatgctccg 5520 caagagcctt taattatatt tatttaattt ttttgtaacg tatttttgaa aagataaaga 5580 ggttttatgc ctttttgaga aagatttcga aaggaaatca ctcaaagggg cttttccctc 5640 tttcaaaatt tttaagttaa taaacaataa caataacaat aa 5682 // ID Perere-3 repbase; DNA; INV; 3373 BP. XX AC BN000794; XX DT 04-SEP-2005 (Rel. 10.08, Created) DT 21-JUL-2009 (Rel. 14.08, Last updated, Version 3) XX DE Schistosoma mansoni Perere-3 non-LTR retrotransposon (EST). XX KW RTE; Non-LTR Retrotransposon; Transposable Element; GAMERA; SR2; KW PERERE-3. XX NM PERERE-3. XX OS Schistosoma mansoni OC Eukaryota; Metazoa; Platyhelminthes; Trematoda; Digenea; OC Strigeidida; Schistosomatoidea; Schistosomatidae; Schistosoma. XX RN [1] RP 1-3373 RA DeMarco R., Machado A.A., Bisson-Filho A.W. RA and Verjovski-Almeida S.; RT "Identification of 18 new transcribed retrotransposons in RT Schistosoma mansoni."; RL Biochem Biophys Res Commun 333(1), 230-240 (2005). XX DR EMBL/GenBank/DDBJ; BN000794; Positions 1 3373. XX FH Key Location/Qualifiers FT CDS 218..3193 FT /product="Perere-3_1p" FT /translation="MRPMHLLTTRAKIFIGTWNVRTMWETGRVFQISAEMR FT KYNLEVLGISETHWTQVGQQRLSSGELLLYSGHEEENAPHTQGVALMLSRK FT AQNALIGWESHGPRIIKASFKTKREGITMNIIQCYAPTNDYDEDAKNQFYD FT RLQSIFEKCPTKDLTILMGDFNAKVGKDNTGYEDVMGQHGLGGRNENGDRF FT ANLCAFNKLVIGGTIFPHRNIHKATWISPDHTTQNQIDHVCINKKFRRTME FT DVRTRRGADIASDHHLLVAKMKLKLKKHWTMARTTSQKFNMAFLRDADKLN FT EFNIALSNRFEAFHDLLNGEGTTIESSWKGTKEAIVSTCQEVLGQKKHHHK FT EWITVGTLDKIEARRNKKAAINISRTRAEKAKAQAEYTEANKQVKRSIRTD FT KRKYVEDLAMTAEKAAREGNMRQLYDTTKKLAGNYRQPERPVKNKEGKVIT FT DIEEQRNRWVEHFKELLNRPAPLNPPNIEAAPTDLPIDIGPPTIEEISMAI FT RQIKSGKAAGPDNIPAEALKANVAATAKILHILFSKIWDEEQVPKDWKKGL FT LIKIPKKGDLSKCDNYRGITLLSIPGKVFNRVLLNRMKDSVDAQLRDQQAG FT FRKDRSCTDQIATLRIIVEQSIEWNSSLYINFIDYEKAFDSVDRTTLWKLL FT RHYGVPEKIVNIIRNSYDGLNCQIVHGGQLTDSFEVKTGVRQGCLLSPFLF FT LLVIDWIMKTSTSGGMHGIQWTGRMQLDDLDFADDLALLSQTQQQMQEKTT FT SVAAASAAVGLNINKGKSKTLRYNTICTNPITLDGEALEDVEIFTYLGSII FT DEHGGSDADVRARIGKARAAYLQLKNIWSSKQLSTNTKVRIFNTNVKTVLL FT YGAETWRTTKAIIQKIQVFINSCLRKILRIRWPDTISNKLLWETTNQIPAE FT EEIRKKRWKWIWHTLRKSPNCVTRQALTWNPERQRRRGRPKNTLRREIETD FT MRRMNKNWKELEKKAQDRVGWRKLVGGPCSIESYRRK" XX SQ Sequence 3373 BP; 1200 A; 749 C; 788 G; 636 T; 0 other; caattactcc gcctgtagct cctccagggg ctactgccgg tcccaagccc gggtaaagga 60 ggagggttgg gcatggggtt agcgacccca tcccgtagaa aatcaacccg ctaaaaaaac 120 gctaaccaga aaaaatcatt caaaccattc aaactctgcc ctgggagtag aaggaaaagt 180 gacgtctcat gatgaaagcc gagttccttc ggaagtcatg aggccgatgc accttcttac 240 aaccagagcg aaaattttta taggtacatg gaatgtccgg acaatgtggg agaccggaag 300 agtcttccaa atttctgcgg aaatgaggaa atacaacctg gaagtgcttg gaatcagtga 360 aacacattgg acgcaggttg gacaacaacg actgtcttca ggggaacttc tgttatactc 420 cggtcatgaa gaagaaaatg ccccacatac acaaggagtg gcactgatgc tgtctagaaa 480 agcacaaaat gcacttatag gatgggaatc tcatggaccg aggatcatca aagcctcctt 540 caaaacaaag agagagggca ttacaatgaa catcatccaa tgctatgcgc cgaccaacga 600 ctacgatgaa gacgctaaaa accaattcta cgataggctg cagtcaatct tcgagaagtg 660 cccaaccaag gacctgacca ttctaatggg agatttcaat gccaaggttg gaaaggacaa 720 cactggatac gaagacgtca tgggacaaca tggactggga ggaaggaacg aaaacggtga 780 tagatttgca aacctatgtg ccttcaataa actggtcata ggtggcacca tattcccaca 840 cagaaacata cacaaagcca cttggatttc accggatcac actacacaaa atcaaatcga 900 ccatgtctgc atcaacaaaa agttcaggag gacgatggag gatgtgagaa ccagaagagg 960 agctgatata gcatccgatc accatttgct ggtcgccaag atgaaactaa aactcaagaa 1020 gcactggaca atggcgcgga caacatcaca aaaatttaac atggcctttc ttcgagatgc 1080 tgacaaactc aacgaattta acatagccct cagcaacagg ttcgaggcct ttcatgacct 1140 actcaatgga gagggaacca ccatagagag cagctggaaa ggtaccaagg aggcaatcgt 1200 ttcaacatgt caggaggttc tgggccaaaa gaagcaccat cacaaggaat ggatcactgt 1260 tggtacactg gataaaattg aagcaaggag gaacaagaag gcagcaatca atatcagccg 1320 aacaagagca gaaaaagcca aggcacaagc cgaatacaca gaggcaaaca agcaagtgaa 1380 gaggagcatc aggaccgaca aacgtaaata tgtggaagat ttagcgatga cagcggaaaa 1440 ggctgcaaga gagggaaaca tgagacaact gtatgataca acaaagaaac tagctggaaa 1500 ttaccgtcaa ccagaaagac cagtgaaaaa caaggaaggc aaagtaatca ccgacattga 1560 agaacaacga aacaggtggg tagaacactt caaggaactc ttgaatcgac cagctccact 1620 gaacccaccc aacatcgaag cagcacccac agacctccca atcgatattg gcccaccaac 1680 aattgaagag atcagcatgg ccattagaca aatcaagagt ggcaaagcag cgggaccaga 1740 caacatcccg gcagaggcac tgaaagcaaa tgtagcagca actgccaaga tactccacat 1800 cctcttcagt aagatttggg atgaagaaca agtaccaaaa gactggaaaa aaggacttct 1860 gatcaagata ccaaagaaag gcgatctcag caagtgcgac aactacaggg gcatcactct 1920 tctctcaata ccaggaaaag tcttcaacag agtattgtta aacaggatga aggattccgt 1980 ggacgcccaa cttcgagatc aacaagctgg attccgtaaa gatagatcgt gcacagatca 2040 aatcgcaact ctacgtatca ttgtggaaca atcaatcgaa tggaattcat cactctacat 2100 caacttcatt gactacgaga aggcatttga tagcgtggac aggacgacac tatggaagct 2160 tcttcgacac tacggcgtgc ctgagaagat agtcaacatc atacggaact cctatgatgg 2220 actaaactgc caaatcgtcc acggaggaca actcaccgac tcgttcgagg taaagaccgg 2280 tgttaggcaa ggttgcctac tctcaccctt tctctttctc ctggtgatcg actggattat 2340 gaagacgtca acatctggag gaatgcacgg gatacagtgg acaggcagga tgcagcttga 2400 cgatttagac ttcgcggatg atctggctct tctatcacaa acgcaacaac aaatgcagga 2460 gaaaacgacc agtgtagcag cagcctcagc agcagtaggt ctcaatataa acaaagggaa 2520 aagcaagact ctccgataca atacaatatg caccaatcca attacacttg acggagaagc 2580 tttggaggat gtggaaatct ttacatatct gggcagcatc attgatgaac acggtggatc 2640 agatgcagat gtgagggcgc ggatcggcaa agcaagagca gcatacctac agctgaaaaa 2700 catctggagc tcaaaacaat tgtcaaccaa caccaaggtt agaattttca atacaaatgt 2760 caagacagtt cttctgtatg gggcagagac gtggagaact acgaaagcca ttatccagaa 2820 gatacaagtg tttattaaca gctgtctacg caagatactt cggatccgat ggccagacac 2880 tatcagcaac aagttactgt gggagacaac aaaccagatt ccagcggagg aagaaatcag 2940 gaagaagcgc tggaagtgga tttggcacac cttgaggaaa tcacctaatt gcgtcacaag 3000 acaagccctc acatggaatc ctgaacgtca aaggagaaga ggaagaccaa agaacacatt 3060 acgccgagaa atagagacag acatgagaag aatgaacaag aactggaaag aactagaaaa 3120 gaaggcccag gacagagtgg gttggagaaa gctggtcggc ggcccatgct ccattgagag 3180 ttacaggcgt aagtaaggca acaaagagta agtgcactgg tagaagagga acaagcccta 3240 cgcgatctga acctgcagaa taacgcagta aactattttg taataacttt ccctttttgt 3300 actcgaaccg cgtctttcaa tttcaaatat atctgttgtg ccagcaaaaa aaaaaaaaat 3360 aaataaaaaa aaa 3373 // ID Copia-135_AA-I repbase; DNA; INV; 4044 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-135_AA_; KW Copia-135_AA-LTR; Ty1_copia_Ele191; Copia-135_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4044 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [1403-1930] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 62..4042 FT /product="Copia-135_AA-I_1p" FT /translation="MDGGKFSVEKLRSGGYETWRFKVEMLLVRENLWKYVS FT EAAPNPLTDAWKEGDAKARATIALLVDDCQHPLIRDSKTAQSTWDNIENHH FT QKTTMTTKVSLLKKLCRAEYNENGDMEGHLFKMEELFSSLANAGQELDANL FT KVAMVLKSMPDSFDNLTTALETREDEDLTMDLVKGKLLDEAQKRMEKTHPS FT ESILRVGAEKKIICHQCRKPGHMKRDCPMMKNEGSSTSGGRSGGQNGQPKH FT KPKGKAAKTTPFAFTVSKNRKLMKTWIVDSGATAHMCCDRSFFQALKPSSG FT VTITLADGNETAVQGIGSGRLFCYDDSGNQQEVILSDVFYVPDLESNLVSV FT GCLVNKGAEVTFGKSRGCVIQCGGVVAAVAQKVGGLYQLNTGAERSMNVVH FT HTKDCIHSWHRKLGHRDPEAIQRVVREGLAKGVSIKKCDIYQTCECCAEGK FT IARKPFPRKTERQSTKVLDLIHTDICGPMNTVTPGGSRYFLTMIDDHSRYT FT VVYFLKRKSDAADVIEEYVTMVRNRFGRNPVAVRSDQGGEYKAKRLGQFYR FT ANGIVPQYTAGYSPQQNGVAERKNRTLVEMARCMLLDAKLGHRYWAEAINA FT AVYLQNILPSRSVEKTSFELWYGKQPDYSNLHIFGSAAIVHVPAEKRKKLD FT PKGKKLTFVGYADNHKAFRFIDVSTDKVTISRDAQFIEMEEPKTQNSFDHP FT GSLQVVEYEPEVIPFYPGPEEEDDDNVDEDYEEDDNAGEDAEEGSNADEDD FT EINNVDETLVEEEHAATDSSMYETPNEDDGSETDTPVRRSSRSTKGVLPQR FT YRETTGMVRKLTDEPRTYKEAMRGPEAELWKAAMDDEMKSLQENGTWQLTG FT LPPGRKAIGSKWIFKRKLDEDGNVVRYKARLVAQGFTQKFGTDYDEVFAPV FT VRQVTFRVLLTVASQRGMLVKHADVKTAYLNGELKESIFMRQPPGYETEDR FT NAVCSLQRSLYGLKQAGHVWNKKFDQVLKQMRFQQSKNDPCLYVRRNGAHY FT TYLVIYVDDMVIACNDEEEYEEIIKTLNRNFKVTSLGDISHFLGIKIKRNE FT KGFSLNQQTYIQQVLERFGMDQAKPSKYPLNPGHIQQKEEQEKLANNHQYA FT SLIGSLLYVAVNTRPDIAVSVSILGRSVSAPTHADWNEAKRILRYLKLTMD FT HELVLGEDESDLEIYVDADWANDAKSRKSNSGYLFLFGGGPIRWGSRKQTC FT VALSSTEAEFVALAESCQELQWIDRLLTDFSVKTIKPILIHEDNQSCIKQM FT ESGRATNRSKHVDTKYHFVYQLMADGVIRMQYCSTEHMIADMLTKPLTNVK FT LTRFREAAGVYPSRR" XX SQ Sequence 4044 BP; 1157 A; 917 C; 1131 G; 839 T; 0 other; taggttattg gcccagaata ttttactgcg agtagttcag aagttcattt tcaagattac 60 gatggacggt ggaaagttta gcgttgaaaa gctacgaagc ggcggatatg agacttggcg 120 gttcaaagtt gagatgctgt tagtccggga gaatttgtgg aagtacgttt cggaggccgc 180 ccccaaccct ctgacggacg cctggaagga aggcgatgcg aaagcccggg ctacgattgc 240 tcttctggtg gacgactgcc aacatccgtt gattcgcgat agcaagacgg cgcaatctac 300 gtgggacaac atcgagaatc accatcaaaa gacgaccatg acgacgaagg tatcgctttt 360 gaagaagctg tgccgggcgg aatacaatga aaacggcgac atggaaggac acttgttcaa 420 gatggaagaa cttttctcaa gtctggcgaa cgcgggtcag gaactggatg cgaacctgaa 480 agtggcaatg gttctgaaga gcatgccaga ttccttcgac aacctgacaa cggctttgga 540 gacccgtgaa gacgaggacc tcactatgga tttggtaaaa ggcaaattac tggatgaagc 600 gcagaagaga atggagaaga cccatccaag cgaatccatc ctacgtgtcg gtgccgagaa 660 gaagatcatt tgccatcaat gtcgcaagcc aggccacatg aagcgtgact gtccgatgat 720 gaagaacgaa ggttcatcca catccggtgg acgaagcggc ggtcaaaacg gtcaaccgaa 780 gcacaagccc aaggggaaag cagcgaagac cacacctttt gcattcacgg tgtccaagaa 840 ccggaagctg atgaagacgt ggatcgttga ttccggggca accgcccata tgtgttgcga 900 ccggtcattt tttcaagcgt tgaaacctag ttcgggagtg accattactc tagctgatgg 960 taacgagacg gctgtccagg gaattggttc cggccgtttg ttctgctatg acgatagcgg 1020 aaatcagcaa gaagtcatcc tcagtgacgt gttctacgtg cctgatctgg aatccaactt 1080 ggtttccgtc ggatgtctgg tcaacaaggg agctgaggtc acatttggaa aatcccgagg 1140 ctgtgtgatt caatgcggag gcgttgtcgc agctgttgcg cagaaggtag gtggcctata 1200 ccaactcaac acaggtgccg agcgaagcat gaacgttgtg caccacacga aggactgcat 1260 tcattcttgg caccgcaagc tgggacatcg agatccagaa gcgatccagc gagttgtgcg 1320 cgaaggctta gcgaagggcg tgtcgatcaa gaaatgtgac atttaccaga cctgcgagtg 1380 ttgtgctgaa ggcaagattg cacggaagcc attcccgagg aagacggagc gacaatcgac 1440 aaaagttttg gacctgattc atacggacat ctgcggtccg atgaatactg tgacgcctgg 1500 cggatcaaga tatttcctta caatgatcga cgaccacagt cgctacactg tggtgtattt 1560 cttgaagcga aaatcggatg ccgcggacgt gatcgaggag tacgttacca tggttcgcaa 1620 ccggtttgga cgaaatccgg tagcagtccg ctctgaccag ggcggtgagt acaaggcgaa 1680 gcgcttggga caattctatc gagccaacgg aatcgttccg cagtatacgg caggttactc 1740 cccgcaacaa aatggagttg cggaacgcaa aaatcggacg ctagtggaaa tggcccgctg 1800 catgttactg gatgcaaagc ttggccaccg ttactgggct gaagccatca atgcagcagt 1860 gtatctccag aacatccttc cgtcaagatc cgttgagaag acttcttttg aactgtggta 1920 cgggaagcaa ccggactaca gcaatctgca catttttgga agtgcggcta tcgtgcacgt 1980 accggccgag aaacggaaga agctagatcc aaaggggaag aagctaacat tcgtcgggta 2040 tgccgataat cataaagcgt tccggttcat cgatgtgtcc accgacaagg ttaccatcag 2100 tcgtgatgcg cagttcatag agatggagga gccgaagaca cagaattctt tcgatcatcc 2160 gggatcactg caagttgtgg agtatgaacc cgaagtcatt ccgttctatc ctggtccaga 2220 agaagaagat gatgacaacg tggatgaaga ttacgaagaa gatgataacg ccggcgaaga 2280 tgctgaagaa ggtagcaacg ccgacgaaga tgatgaaatt aacaacgtcg atgaaactct 2340 ggttgaggag gagcatgcag ctacggattc cagtatgtac gaaaccccaa acgaagacga 2400 cggcagcgaa accgatacgc ctgtgagacg atcttcccgc agcacaaagg gtgtgttgcc 2460 gcaacggtac agagaaacaa ctggaatggt gagaaaactg actgacgaac caaggacgta 2520 caaggaagcc atgagaggac ctgaagctga actatggaag gccgctatgg acgacgagat 2580 gaagtcactg caagaaaacg gtacgtggca actgactggt ttaccaccgg gccgcaaagc 2640 gataggcagt aagtggatct tcaagcggaa actagatgaa gatggcaacg tggtgagata 2700 caaggcccgt ctagttgcgc aaggcttcac ccaaaagttt gggacagact atgatgaggt 2760 ctttgcgcca gtggttcgtc aagttacgtt tcgagtgctg ctaacggttg ccagccaacg 2820 aggaatgctg gtgaaacatg ctgatgtgaa gaccgcctac ctcaacggag aactgaagga 2880 atcaattttc atgcgacagc cccccggata cgaaactgaa gataggaatg cggtgtgctc 2940 gttacaaaga tccttgtacg gactgaagca agcgggccat gtttggaaca agaagtttga 3000 tcaagtcttg aagcagatga ggttccagca gtcgaagaac gacccgtgcc tgtacgtgcg 3060 gcgcaacgga gcacactaca cgtatctggt aatttatgtg gacgacatgg tgattgcctg 3120 caacgatgaa gaggagtatg aagagatcat caaaacccta aaccggaatt tcaaggtaac 3180 atcgttgggc gatatcagtc actttcttgg aatcaagatc aaacgcaacg agaaaggatt 3240 ttccttgaac cagcagacgt acattcaaca agtcttggag cgattcggca tggatcaagc 3300 gaagccgtcc aagtatcctc ttaatcctgg ccacattcag caaaaggagg agcaagagaa 3360 gctggccaac aaccatcaat atgccagtct cattggaagt ttgctgtacg tggcggtgaa 3420 tactaggccg gatattgcgg taagcgtttc aattctagga agatcggtaa gcgcaccaac 3480 acatgccgat tggaacgaag ctaagcgaat acttcgctac ctcaagctaa ccatggacca 3540 cgaacttgtg ctgggagaag acgaatccga cctggaaata tacgtagatg cggactgggc 3600 gaatgacgcg aagagtcgca aatccaactc tgggtacctt ttcctgttcg gcggtggtcc 3660 aatacgctgg gggtcccgca agcaaacatg cgtcgccctc agtagcacgg aagctgaatt 3720 tgtagcactg gcggagagct gtcaagagct acaatggatc gaccgattgc taacggattt 3780 ttcggtgaaa actatcaagc cgatcctgat tcatgaggac aatcaatcct gcattaagca 3840 aatggagtcc ggacgtgcca ccaaccgttc gaaacacgtg gatacgaaat accatttcgt 3900 ttaccaactg atggcggatg gcgtgatccg gatgcagtat tgctcaaccg agcacatgat 3960 agccgacatg ttaacaaagc ccctgaccaa cgtgaagctg accagattca gagaagcagc 4020 tggagtttat ccgtcgagga ggag 4044 // ID Gypsy-39_OD-I repbase; DNA; INV; 12021 BP. XX AC CABV01002763; XX DT 24-FEB-2011 (Rel. 16.02, Created) DT 24-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Oikopleura dioica genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-39_OD_; KW Gypsy-39_OD-LTR; Gypsy-39_OD-I. XX OS Oikopleura dioica OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; OC Oikopleuridae; Oikopleura. XX RN [1] RP 1-12021 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Oikopleura dioica genome."; RL Direct Submission to RU (24-FEB-2011). XX DR Genome; CABV01002763; Positions 1636 13656. XX CC 'CGTT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 68..2206 FT /product="Gypsy-39_OD-I_2p" FT /translation="MKYLYTLLPVEEELIRQLYEQNGNILGNDAKTPDARY FT NVFKQFERRDGMIKVPLKLHTHETLEIMEIYNMEASRLELDKSFARRNCLV FT LKPKEELRDVTQLVNIMLKMKEKFGDDFLKVKQEEKTLIAEEDINDLIKSA FT ENTLEETLQSKHEAERLRLRVTELEKEMEEVEGSVKRTEESMASSVEEWRK FT KFETAKESLEKCKRNREMDEKDIAEASDKLCMAMDDCERKDSEIGDLKNRV FT MELQEKQRKLARTKVSHGSEIDEESFQDVSPRKMFTSMHENGRGLRENEKP FT SYEDEKNRIKDTLNMMTNKIDEALDSRNTNRTQNLFSSPFRKDPVKVNNLK FT VTMRIPIWESGPAIDKTTSLNDFIDKLKRFKRMNVMSDAETIYSTLEASNR FT VDILRELDEEASENIDKFITYIREAHGGSTLKQRSNLESLLQSPMESAISF FT FKRVIREYYLSRGLEPKDPEKIQEKDRQEDILYYFSRGLRNATTGTHIRMN FT RINTEFKELGRLATHIDQSVEPITSTLVNNIVEQKQNEERNEARVNATQGN FT RGGFDGKCHKCGYHGHMARQCFANQRTRGRYNRQYDRRSGSRDRRSSGDRR FT SHERGRSRENKYRNHYNNSRRGRSGSREGYRNRSRDDNGGKRNYRERRDTP FT HRGGSREHRKHSGSRERDTSRNSSRGRDNSSRRRYSNRSGDRNSRRDRSYG FT RSRDSSRESNKRW" FT CDS 3046..5010 FT /product="Gypsy-39_OD-I_1p" FT /translation="MYKVMTSTTGGQPAKFTRTARKYDKELQKHIDNLEKQ FT EVVEKVDFVDTITSAFVIVKKKCGRLRFCLDLRGLNNITVPVKNYPIPYLS FT DILNNLSGNNYFTSLDMCSAYHQFLIDPDDRNRYTFMGPTKQIYRYRRVPF FT GGRMVTSWLQALMSQVVLKGISQSTAYIDDINLGTITFEEHKKVLRETLQR FT ISELNLVLSARKCSFGDRETKAFGYITNKNGYKADPERISTLVIELPKTKK FT KLLKALAAMSYYRSTIPKFAELANNLFKLTRTESEFDPDDKIILQEWERLL FT KALKEGIMIQTPNLNKKFILRTDASKNAYGNVLSQEGDGSTEKIISVESKQ FT FKDAQIHWSIGLKELYSCAQGVRKNINFLEGNRFLLITDCKSVFYLLVNKK FT EIKLSASNPMTRCFMFLMTFDFEIRWSKGTEKDFLLTDLLSRNLVDKNTNL FT QIAANSKEPLFSIQLLNGKNLDVDTNTQYTRREKGTKTNNINLTPLHNPDY FT KKLIHEIKKSQFEDDKIRKIIKRISNGKVLQGFKTILEPPLEQPILTRQDG FT SIVVPNYLVPKVLATIHKHATKNNDIGKIEQAKLYWTNMARSIAEWHQSCR FT ECTMGKPNSAPQKHTLAQIGDFTLEPFKSVCIDCMHARPFVILANGQIIFQ FT GIRAVW" FT CDS 6074..8794 FT /product="Gypsy-39_OD-I_3p" FT /translation="MRGLTGIIIFLTLNDVLDITNGIDTQADLSFQKPTLS FT ERTSRHAVRILQNVGIGLIRSDNAENVVIKGESWYSVTIDFPLPSFEHFMV FT KNCPKSLQDTIKAKIQQDTVLQVLVGEAKHLVAQFKHELTNREPYLANSRK FT RRNITEGRDQVGVNTEIFMEKEEIQLETQNSKITDSSASITNSWDSNIYAS FT NVENSESSSLSDIYRDSQSFIPNIVSETKVETNIPELKPYFEPNFYGSING FT PGELRRENENIQEDNLKVTTTTSTKTSTTGLITTTISPTLTYPQTNKIRPP FT TTVVTTTTKTTTVSTAIESSQVTPSMFENETTERGMQSTHTENQQRVSRPK FT GWVDDEEIVFNPIGLCGNIDNEEQFRFFEYFKHDCPEPYLCCAQDCPTYNM FT RLAHQEIIRDSKNVTTGMNENEIFNFVRALDSAKTCFSRENHNDRRRKSLS FT FWEFWKRGGGFSPSSINTEVKEMEDHEDRQMHHLAANMTSKGEFRATIEAE FT DTKIKLLASSLCSVSEEIVGTERLRQMRSHLIQLEDKIEQSVAECERNRRP FT FSLNTKLLVSLCLKLNPNDQIHCRNANIIGRVGCEMERLEITQDKVKMHIR FT ISMFSFLENEMSTSIVTLPFADSSGRLFMLSDIPRTVIETDKHLLATKCEN FT NSRRKIHICRLEDALLLGNNAACVNALVSESATLVKMSCKKESYKGENCVV FT QSLPTGGYAIWSEEAVDIKAAPRDKSYSRTLEMTKPKQITFVTRKDVSFNC FT NGVSYMTNTEGKELVTDADSHINLDLSAENLVNSNIEVFGDHLEKLQATQE FT QIRVNATKAEYRTLTNIFGIHEKHESWLRIVVRVVGGMAFFVFTFWIITKT FT IKKLYNWYFGNKIRRYNAMFGTGLGRVAQNGQINEVVTTLPVSNREATTIV FT SGV" XX SQ Sequence 12021 BP; 4649 A; 2051 C; 2507 G; 2814 T; 0 other; tttggtgtca gaatcgtaaa gattccacgc caacaacagg taatcttgac aagctatcca 60 caggaaaatg aaatacttgt acactctcct tcccgtcgag gaagaactaa taaggcaact 120 atatgagcaa aatggaaaca tattaggaaa tgatgcaaag accccggatg caagatacaa 180 tgtattcaaa caatttgaga gacgagatgg aatgataaaa gttcctctta aattacatac 240 tcatgaaacc ctggaaataa tggaaatata caatatggag gcttctaggt tagaactaga 300 taaaagtttt gctagaagaa actgtttggt tctcaaacca aaagaagaat tgagagacgt 360 gacgcaactg gtgaatataa tgttaaaaat gaaggaaaaa ttcggagatg atttcttgaa 420 ggtaaaacaa gaggaaaaga cattaattgc ggaagaagac ataaatgatc ttatcaaatc 480 ggcggaaaat acactagaag aaactttaca aagcaaacac gaagcggaac ggctgaggtt 540 aagagtaact gagctagaga aagaaatgga agaagtcgaa ggtagcgtga aacgtacaga 600 ggaaagtatg gcttcatcgg ttgaagaatg gagaaagaaa tttgaaacag caaaggaaag 660 tctagaaaaa tgtaaaagaa acagggaaat ggatgaaaaa gatatagccg aagcatccga 720 caaactgtgt atggctatgg atgactgcga aaggaaagac tcggaaatag gagatctcaa 780 aaacagagta atggaactac aggaaaaaca gagaaaacta gctaggacga aggtatcaca 840 tggatctgaa atagacgaag agagttttca ggatgtttct ccaaggaaaa tgtttacatc 900 tatgcatgaa aatggaagag gcttaagaga aaatgaaaaa ccaagttacg aggatgaaaa 960 gaataggata aaggacaccc taaatatgat gacaaataaa attgatgaag cgttggactc 1020 taggaacact aatcgcactc aaaatttatt ttcttcacca ttccggaagg atccagtaaa 1080 ggtaaataac ctaaaggtaa ccatgagaat tccaatttgg gaatctggac cggcaattga 1140 caaaactacg tctttgaacg actttattga taaactcaaa aggttcaaac ggatgaatgt 1200 aatgtctgat gcggaaacaa tttacagcac tcttgaagca agcaatagag ttgatattct 1260 aagagagctt gacgaagagg ccagtgaaaa cattgataaa ttcataactt atattcggga 1320 agctcacgga ggtagtactc ttaaacaaag aagtaatttg gaatcgttac tccaatcacc 1380 catggaatca gcaatatcat tctttaaaag agtgatcagg gaatattatc tttcaagagg 1440 tttagaaccg aaagacccgg agaaaattca ggagaaagac agacaagagg atattttgta 1500 ctatttttct agaggattaa gaaacgcaac aactggtaca cacataagga tgaataggat 1560 aaacacggaa ttcaaagaac ttggacgttt ggccactcac atagaccagt cagttgagcc 1620 gataacatca actctggtta ataatatagt agaacaaaaa caaaacgagg aacggaatga 1680 agcaagagta aatgcaacac aagggaatcg aggaggcttt gacggtaaat gccacaaatg 1740 tggatatcat ggccatatgg ctcgccagtg ttttgcgaat caacgaactc gaggtcgata 1800 taacagacag tatgatagaa gaagcggaag tagagacaga cggtcaagtg gagacagaag 1860 atcacatgag cgaggacgct ctagagaaaa taaatataga aatcactata ataatagtag 1920 acgtggaaga agtggcagcc gcgagggata tcgcaacaga agtagagacg ataatggagg 1980 aaaaaggaat tatagggaaa gacgcgatac tccacataga ggaggatcac gagaacaccg 2040 taagcactca ggtagtagag aacgggacac atctcgaaac tcaagtagag gacgtgataa 2100 ctcatcaagg aggaggtact caaacaggag cggagatcga aattcgagac gcgacagatc 2160 atatggtcga tcaagagaca gttcgagaga gtcaaataaa agatggtaaa acatggagct 2220 acctatagta gaaatgggaa taattcacaa gggacaaaaa attaaggtaa acatgttatt 2280 ggacaccggt tcaaatagga atatcttatc aacctcgcta aaactaaacg acacaggtaa 2340 atgtaaagaa tcgacagtca gaactttcga tggaacatca cggaacataa aaacgaataa 2400 tattatcggt gatcttatat cgtcagacgg tagaactatg gaatctgatg ttcagttcgt 2460 agtgacagaa tcgatatatg aaggaatagt tggattcgat atattggtaa aatatacttt 2520 aactttttca ctacgcatgg caattttaac tagcaaaaat gataaaaaca aaatatttca 2580 tcttaaccca acccaggaaa tggcagtgaa ggtaaataac ttccacctga acccaaacga 2640 aatgataact ttggaaataa aaaatgaagc atcggaaaag aattatgaaa gtacatttgc 2700 ttttgtcggt acatcttatg caaatcaaac attacctgca aacgtaaaag gaaatcaagt 2760 agaaattttt aatgacactg atgaaaggat attcgtagag aacaatatta tatatttata 2820 aataatcaga tagctttatc taaaaaggat aaggataagg aacataacag accagcccac 2880 tacagaaagt cattaggctt cgaaaaactt aaagtaaagg attttaaaat aaacaagaat 2940 ttatcggaaa aagaacgaca agaaatcttc gaaatactaa cggaatatga tagttgcttc 3000 gcaagaagtg acactgatat cgaaccaggt ttccggttac catatatgta caaagttatg 3060 acaagcacaa ctggagggca accagctaaa ttcacacgga cagcaaggaa atacgataag 3120 gagttacaaa aacacatcga taaccttgaa aaacaagagg ttgtagaaaa ggtagatttc 3180 gtcgatacaa taacttcagc atttgtaatc gtgaagaaaa agtgcggcag acttcgattt 3240 tgcctagact tgagaggatt aaataatata actgtaccgg taaaaaatta tccgatacca 3300 tatttgagtg acatcttgaa caatcttagc ggtaataact acttcacatc actggatatg 3360 tgtagcgctt accatcaatt tttgatagac ccagatgaca gaaacaggta tacgttcatg 3420 ggtccgacaa aacaaatata tcggtacaga agagtaccct ttggaggacg aatggtaaca 3480 agctggcttc aggctcttat gagtcaggta gtacttaaag gcatttcaca aagcactgct 3540 tatatcgatg atattaatct gggaactata acgttcgagg aacataaaaa agtattaagg 3600 gaaacccttc aaagaatatc ggaattaaat ctggtattat cagcaagaaa atgcagtttt 3660 ggtgataggg agaccaaagc tttcggatat attacaaaca aaaatggtta caaggcggat 3720 ccggaaagaa tctctacttt ggttattgaa ttaccaaaaa cgaagaaaaa actcctaaaa 3780 gctttagcag cgatgtcgta ctatagatca acaataccaa aatttgcaga attagcaaat 3840 aatctgttta aactcacaag aacggaatcc gaatttgatc cggatgataa aattatttta 3900 caagaatggg agagactttt aaaagcgctg aaagaaggaa taatgattca aactccaaac 3960 ttgaataaaa agtttatttt gcgcactgat gcttctaaaa atgcttatgg aaacgtacta 4020 agtcaagaag gtgacggcag tacggagaaa attatatcgg tagaatcaaa acaatttaag 4080 gatgctcaaa tccactggag catagggctt aaggagctct acagctgtgc tcaaggagtt 4140 aggaaaaaca taaatttctt ggaagggaat cgttttcttc ttataacgga ttgtaaaagc 4200 gtattttatc tattggtaaa caaaaaggaa ataaagttgt ctgcgtcgaa tccaatgact 4260 aggtgtttta tgttcttaat gacattcgat ttcgaaatac gatggtctaa gggtaccgaa 4320 aaagattttc tattaacaga tttactatcc agaaacctag tggacaaaaa tacaaatttg 4380 cagatagcag ctaattcaaa ggaaccattg ttcagtatac agctgctaaa tggcaaaaat 4440 ttggacgtcg ataccaacac tcaatacaca agacgagaga agggaacaaa aacaaataac 4500 atcaacctta cacctctaca taatccggac tacaaaaagt taattcatga aataaagaaa 4560 tcccaatttg aggatgataa aataaggaaa ataataaaaa ggatttccaa tggcaaggta 4620 ttacaaggat tcaagacaat tttagaacca cctcttgaac aaccaatcct aacccgtcag 4680 gatggttcta tcgtggttcc aaactatctg gtaccaaagg tattggcaac aattcataaa 4740 catgctacta agaacaatga tatcggtaaa attgaacaag ctaaactgta ttggacaaat 4800 atggcaagat caatcgcaga atggcatcaa tcctgcagag aatgcacaat ggggaaacca 4860 aatagtgcac cacaaaaaca cacattagct caaataggtg attttacttt ggaacctttc 4920 aaatcagtgt gcatcgactg catgcatgct cgtccatttg taatacttgc taatggtcag 4980 atcatttttc agggtatacg agcggtctgg taatcacaga tgaacaagta tctactttaa 5040 taaatagtgt gctatgtcta tcgtttaggt ttggaatgcc aagaacgata agactagaca 5100 atcataggtc cttccaggca aatgaattta aggagactat ggccaaaatg ggtataggtc 5160 taagtttcac atcaccatcg aactcacagg caaacggcaa atgcgaaaga caaataagga 5220 gcattcaaga gcgtctaagg attctaacta ttgaagaagt tctccctccg aaagaagcaa 5280 ttacactatt agaattacaa acagttatag aatatatagt attggaagta aatacaactc 5340 gtaagacgga caaaaagagt cctttggaaa tcttgacagg catcgagcct cacttaggcg 5400 ttcatttatc aaaggggtta ggaataaccc aagacagtag cactcagaaa caacgattgg 5460 caatactacg tagagaggtt caggaaaaac tcgcagaaga aatcgaaaaa caggctaata 5520 gtacgctgga aaatatcgaa gatgtgaaat taaaagtaaa cgatttagta agaataagaa 5580 aacttgcaaa agttggtcaa accaagagag agcaaataaa attttcaagc gagatttaca 5640 aaattattga agtccaggaa caatacggaa ctataaaaat aaaagaagtt aaagaaacaa 5700 acaaagtaga tagaaggaag ccggaaatca gaataattag tatgaggaaa gtaaagaaga 5760 ttttgagcag agatgatgta gaagagaagt atggaaaaag acaagcagaa gaagaaaaag 5820 ctacattaga tgataagacc ggcgtgatac aggaaagtaa gagtatactg aagcgaaata 5880 tttcatatgg gggccctaat aagaaggtaa ggttcaagat tgacgaatcg agaacagaaa 5940 aagtacaaga aaagcataaa aatcggcaaa ggcaagctga taatccgcga aggagtaaca 6000 gattgcgaga gagacaagca attaactatg ctgaatgaga gcttgatcaa ctcgagtcag 6060 tcgtttttgc aaaatgagag gactcacagg aataataata ttcttgactc tgaatgatgt 6120 cttagacatc acaaacggaa ttgacacaca agccgacctg agttttcaga aacctacttt 6180 aagtgagaga acaagtagac atgcggtaag gattctacaa aatgttggaa taggtttgat 6240 aaggagcgac aacgcggaaa atgtggttat taaaggggaa agttggtatt cggtaacaat 6300 tgactttcca ttaccaagtt ttgagcactt tatggtaaaa aactgtccta aatcactgca 6360 agatacgatc aaagcaaaaa ttcaacaaga tactgttctg caagttcttg ttggagaagc 6420 taaacatctc gttgctcaat tcaaacacga attaacaaat agagaacctt acttagctaa 6480 ttctagaaaa agacgaaata taactgaagg tagagatcag gtcggggtaa acacggaaat 6540 atttatggaa aaagaggaaa ttcaattgga aacacaaaat tcaaaaataa cggactccag 6600 tgcaagtatc accaatagct gggactcaaa tatatatgca tcaaatgtgg aaaactcaga 6660 aagctcaagt ctatctgaca tatacaggga cagtcaatcc tttatcccaa acatcgtttc 6720 ggaaacaaaa gtagaaacaa acataccgga attaaaacca tacttcgaac caaactttta 6780 tggatcgatt aacggacctg gtgaactaag aagagaaaac gaaaacatac aggaggataa 6840 tctgaaagtc acaacaacaa catcgacaaa aacatcgaca acaggactaa ttacaacaac 6900 tatttcacca actttaacct acccgcaaac aaataaaata cgacctccaa caacagttgt 6960 tacgacaaca acgaaaacaa caacggtatc aaccgctatc gaatcctcac aggtgactcc 7020 atcaatgttt gaaaatgaga caaccgagag aggaatgcaa agcacacaca cggaaaacca 7080 acaacgagtc tcacgaccaa aaggatgggt cgatgacgaa gagatagtat ttaacccgat 7140 tggtttatgt ggaaacattg ataacgagga acagtttagg ttcttcgaat atttcaaaca 7200 tgattgcccg gaaccctatc tctgttgcgc gcaagattgt ccaacttata atatgagact 7260 agcgcatcag gaaattataa gggattcaaa aaatgttacg acaggaatga atgaaaatga 7320 aatctttaac tttgtgcgtg ctcttgacag tgccaaaact tgtttctcaa gggaaaatca 7380 caatgatagg cgaagaaaat cgctgagctt ctgggaattt tggaaacgag gaggaggatt 7440 tagtccgtca agcattaaca cagaagttaa ggaaatggaa gatcacgagg acagacaaat 7500 gcaccactta gcagcaaata tgacaagtaa aggggaattt agagctacta tagaagcaga 7560 ggacacgaaa attaaattgc ttgcaagttc tctatgcagc gtttcagagg aaattgttgg 7620 aacggaaagg ttgagacaaa tgcggtctca cctaatacaa ttagaagaca aaatagaaca 7680 atcggtagct gaatgtgaaa ggaacagaag accttttagc ctgaacacaa aactactggt 7740 gagtttgtgt ctaaagctaa atccaaatga tcaaattcat tgtcggaatg ctaatataat 7800 cggaagagtc ggatgtgaga tggaacgact ggaaattact caggataaag taaaaatgca 7860 cattcgcatt agcatgttca gttttctcga aaatgaaatg tcgacatcta tagtaactct 7920 gccattcgca gacagttcag gtcgtctatt tatgctttcg gatataccac gaacggttat 7980 tgaaacagat aagcatctac tggcaacaaa atgtgaaaat aattcgcgaa ggaaaataca 8040 catttgtagg cttgaggacg ctttgctatt aggaaataac gccgcatgcg tgaacgcact 8100 cgtgtcagag tccgcaacac tggtaaaaat gtcttgtaaa aaagaaagtt acaaaggtga 8160 aaattgtgtg gtacagagtc tacccaccgg aggctatgcc atatggtcgg aggaggcggt 8220 tgatataaaa gcagccccaa gagacaaatc atattcacgt actctcgaaa tgactaaacc 8280 aaaacaaata actttcgtta caagaaaaga tgtaagtttt aactgcaatg gcgtaagtta 8340 tatgacaaac accgaaggaa aagaactggt aacagatgca gattctcata ttaacctcga 8400 cctaagcgca gaaaacctcg taaattcaaa tattgaagtt tttggggacc atctggaaaa 8460 attacaggct actcaagagc aaattcgagt aaatgcaaca aaagctgagt accgaacact 8520 tacaaacatt ttcggtattc acgaaaagca tgaatcttgg ctacgaattg tggtcagagt 8580 cgttggaggt atggcatttt tcgtttttac cttttggatc ataacaaaga ccattaaaaa 8640 actatacaac tggtattttg gcaataaaat aagacgttat aatgccatgt tcggcacagg 8700 gttaggtcga gtagcccaaa atggtcaaat caatgaagta gtaacaactc tacctgtctc 8760 taaccgagaa gcaaccacaa ttgtgagtgg agtttaaact atgaaacatt taacgatact 8820 ttctttttat tgtacaataa cctggtaacc aaaagaatgc gaaattcaag gaaataaatt 8880 tatttgtaaa aggaaaaaat gggttaaaaa cgtcgaaagc taacattaga ggaaagttcc 8940 aaagaccggt caactgtttt ttcgtgctcc ctaaaggagt tcctcaaaca ccgtcgaata 9000 tattgatcag cctcacagcg tcgcccttca tccgtgaatg gtttcatttc atcattcaaa 9060 tgggaaaact aaaacataaa aaaaaaaaat aaataaccta aagaatttac cgaacctgga 9120 tacaatcttg gtcgtcggaa acggcgtttg cgaagaacag tcgataggtt aaatcggagt 9180 attttcgtag ctgtggccga agttttctga agagcaactc gacaggcgta aatcgccctt 9240 ccaccgacaa acctgacgga ttgcggagaa ggaactccga tgtgtatttg agtatatatg 9300 agtcaggatt gtagagtcta acaggtacag gagtctgctc tttcagctgc tgaagagctt 9360 ttctggacat gacctgtcga ttcgcattcc ttcggaaaaa ttccttagat ataattatta 9420 acatataaat gactcagcga aataacctct tcaacctcta ggtaagtgtc gcagtaggaa 9480 tagttatttt tcgtataacc ggattccaca accaaaaatg aacgggaagt aatattcggc 9540 cgcccacgat ccgtctttga caacacctgg aacacaaaaa taataagaga ataaaaggag 9600 aaaatattac cggtaactca gctcgcatta tcctttggat aaagtaagca tcatggtcct 9660 tggaaccttg aactaactcc ttgtgagtta gggcttgcct ggaaaccatt atctaaaaat 9720 aagataaatt gtaggaacaa aaaaggttac aacacaattt aaataagtca aaagcatgca 9780 ctaaaatccg cgaaaagtca cagatttaac ctacactggt caaccatcgc gacaggtaaa 9840 cacctaacca aaaggcccgc gactaacagg tagggtttag taaataataa caagagttat 9900 atttcacacc ttaatcaaca aaaaaaaaaa ttacctcatc cataatactt tcgttaatac 9960 ctccagactc gctcgtcgta atcatgtcgg aaaataatgg ttcgtccagt tcgtaaccaa 10020 cacctccacc aagagcctca agatccatcc tcttatctta tattttgcct ctatttaact 10080 ttctcaaatt atcgctaaag taagcgatct gagtgaaggt gttaatagat aagccaaagg 10140 tgacaaaaaa agtctagagg agcgcgtctt gagcgcacac acacacaagg ttattgccaa 10200 agatgcgagc agcagaaact ccacccaata ttactaacga gaccgaagac atggaacaag 10260 actggacaag agaaagcgaa tcagaatcat tgtacgacga agcatctcaa aacagggaag 10320 aggaactaga agaggaagta attgaggtaa aagacgatgt agataaggaa gaaagatgca 10380 aacagctaga aaaattactc ttggaggcga aacttgagtt agaaaaaaag gaaaaaaggt 10440 tgaaaactct ggaaggaaaa attaccatgc gtcctaaacc ggtaaaggat tatatcagag 10500 ggttgcgaga aataattcaa gagccaggaa aaaaagaaga aggtaatggc tggcgtattg 10560 tagaagaact agaccaaaat ttatcaggct ttctacttcc agaacctgat ttcaaacttt 10620 tcgcggtagc cggcgtaaag gaagaaggta ccatcccgat ggttactcta gcgtggtcag 10680 cagaagaagt cacaatggaa aatctaaagc tactaaccgt tagaaacttg atgaacgtgc 10740 aaatggacat aagatcccaa ggtgtgaagg aattatattt atctctaatg gaatggagcg 10800 tggcgcgaaa cgatggcgat gtttttacag taagactact tatgaatact ctggatgcgg 10860 taaaaccagg acgaaaaacg aacttacaca tatacactga ggagaaagat tttaacggaa 10920 aacgtctgaa agtggtggta aatggtcttg cgttaggtaa cctcggcaaa cgaaaaagcc 10980 cggtaaagga agaatcacca aaacgagaaa gacagtatta aaaagatggt atttacagct 11040 aaatttgggt aacttctacg aataataggt tttcgtgtac gatagttaat aaatctcttg 11100 tatatatata taattaatat ccgcacggaa actggtaaca ttaaattata actgtaataa 11160 ttatttactg caattttaac cttaaacttt ttcataacct ttgaaatgtc ggcaagagta 11220 acttcaaaga atcatcacct caaacgagcg cgccgcaaaa gaaaaacagc tgacaatcat 11280 cacctcaaac gagcgcgccg caaaagtaaa acagctgacg atcgtcactt aagcgagcgc 11340 gtcaaaaaaa gggaaaacag agcaaaaaac aggaaaaaga gaagatgttt aatctcagaa 11400 aaccaacgga cttctacgcc aatgacgaac aagaacagaa tcaacaggaa agaaggtgcg 11460 aggaagaaac aaacggccag aggacgagaa agacgattaa caaggtacga gccaacatca 11520 gacgaaattg gacaaatctg cactcctggt tcacgacatt ccagatcgac gaatttatcg 11580 ccatgacagc aaaaaatagc gacgatgctg tactgatgtc tctgaaattc gccgaaaaac 11640 cagcttggat taagctgaaa ctgctaagta acgagaattt ggagacatac gacgcagacg 11700 gaacggtatt atatctaaag aaggcatgga tgaaggttgg atggagaaca acggaagaaa 11760 tatcctgcgc aacggtattt ataaacaact tgatcttcag gaagaaagag gaagacgttg 11820 gttacgattg gacgaaagtt gaggcgattt gggaacgatc agaataagca tggatcgctc 11880 aggtcatgtt tttctaaggt gaaggatgaa aaaaggttaa aggttaaaat tttcttactg 11940 tacttttatt tttatgctgc atttcggttt gatttttttt agcctcggcc acggttcttc 12000 cattaaaaag aaggggaata a 12021 // ID hAT-64_HM repbase; DNA; INV; 3881 BP. XX AC . XX DT 30-DEC-2008 (Rel. 13.12, Created) DT 30-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type DNA transposon family - consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW hAT-64_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3881 RA Bao W. and Jurka J.; RT "hAT-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 2052-2052 (2008). XX DR [1] (Consensus) XX CC The sequences derived from CR1-45_HM is masked out. CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 2110..2853 FT /product="hAT-64_HM_1p" FT /translation="QHQQEGSTMSAHYFFPSSQNKFTKEYGVIQSAAVEEI FT QDGIYFTTIVDGTPNISHTERITFVIRFLLYEREITCWKIKERCVETFETK FT KGFDIAKLITNVILRLCMDIQKCRGQGHDKGSSRSGAYKGDQGLIRLNYSE FT ALYVXCSAHSLILAGVLAVESAIEIKVFFKNVESWSVFFSPSLTRWKILTK FT ATGISLHRLSLTRWRSHTNLIKPLAKRPREMLAAFKHIMKNLDLTSEQLSQ FT TRDLKK*" XX SQ Sequence 3881 BP; 1106 A; 543 C; 655 G; 1073 T; 504 other; cagggccttt gagagggagg ggcaaagggg gcaaattgac ccgggcgcca agcttgaagg 60 ggcgccggac aatacgtttt tttttttttt atagaataaa tagaaaacat tttaatgact 120 gacaatctga aatctgctta gtttttgcat ttgagattga aactttagta tttggaataa 180 aattgcacat gaaaaaaata atttaagttt taaattctat tcttataatt attaataaac 240 taatttttac gaaaaaaata ggatttagct gttgtagaat tagacagaag cgggcgtgtt 300 ccgtaagccc atgaacattt agcaatgcta gctcatgttc gattgtaagg aagattatgt 360 gctgtggtaa taataatggt aactcatcaa attttgcttt gaaggagttg acacagatag 420 cactgaaaac ttcaataggt agtgcgttcc atgggttcgc ttcaacttct ttaactatac 480 ggttgttgaa gaagttgaag cgaacgagac agttgtggtc gatttcacaa taaaagcgct 540 gttttttttt aagtagtatt tgactgttga ggcaaaagag agaatcatag cattgcattt 600 ttcagacaaa taagcggcct tgactctacc aggacccggc cagttatcaa atatttgagt 660 ttttgcatct tggctgcact caatatgggt aatttactgc actcaatatg ttattcgaag 720 gactgtggat cagtgggggc ttaaacaatt tccctcaagc tgagcttaat gaatttcaaa 780 gtgaatcatt agagtcagaa aaaggaactt tatctaacag tggagaatgc aaggttcaac 840 ttaacttggt aataattaat gctaaccctt ctttgtgggc agttgttttg agtcaaaaag 900 atcaagatgc aattatttta aagagatctc cagcacaaaa gcagtattca aaagcggcta 960 attgtccctt cttcgttgaa actgaggagc tagagatcaa cggaggaaac tcataagtcg 1020 tgtgaagcag catcaatgca gcccattgca ctaaaggttc tacaatgatt ggaaagaggc 1080 ctacattcgt cttaccagct ccactggtaa atatttagag ctccataagt gaataaattc 1140 tgagacagct aaatggctga caattcttcg tgaaattttg aatgtgactg ttttcctggc 1200 ttctaaggaa tttaacttta gggxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 1260 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 1320 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 1380 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 1440 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 1500 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 1560 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxtta tttgttggtt gttgataaaa 1620 axxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 1680 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 1740 xxxxxxxxxx cgtcagaagg ggaaaacggt aatttatata tttttgaaaa ttatggacat 1800 tattgttcgc atgagtcagc tgttcggaag acacagtaat ttgattgatg caagccgtaa 1860 acaccataac aaaaggaaaa aatcattaaa aaaaaatctt ctaatttctt caatagaggt 1920 ttctctgtga caagaagctt ttttaggtag ctcattataa agttttctca atatcaattt 1980 taatttttta tataattatt acaataattt gtaacatccc acttagaaaa tgtatacaat 2040 aattataatt aatttattgt aataattata taaataatta aaattgatat tgagaaaact 2100 ttataatgac aacatcaaca agaaggctcc acaatgagtg cccattattt ctttccgtcg 2160 tctcaaaaca agttcaccaa agaatatggt gttatccaaa gtgcagctgt cgaagaaatt 2220 caggatggta tctacttcac aaccatagtt gatggtaccc ctaatatatc tcatactgag 2280 cggataacct ttgtcattcg ttttctcctc tatgaaagag aaattacttg ctggaaaatt 2340 aaagaacgat gtgttgaaac ttttgagacg aagaaaggat ttgatatagc taagcttatc 2400 actaatgtga ttttaaggct atgtatggat attcaaaaat gtcgtggtca gggacacgat 2460 aaaggatcca gtaggtcagg tgcttacaag ggagatcagg gcttgataag gctaaattat 2520 tcagaagctc tgtacgttmc ctgtagcgct cacagtctga ttttggctgg tgtgctcgca 2580 gttgaatcag ccattgagat caaagtattc ttcaaaaatg ttgaatcttg gagcgtattt 2640 tttagtccaa gtctaacccg atggaagatt ctcactaagg caactggaat ttcactgcat 2700 agactttctc tgacaagatg gagatctcat accaacttga tcaaaccact tgccaagagg 2760 ccgagagaaa tgttggcagc ttttaaacat atcatgaaaa atctggatct gaccagtgag 2820 cagctcagcc aaacaagaga tttaaaaaaa tgattatcat cttttgaatt tgtattgctc 2880 gatacgattc ggtacaaaac tcttactact gttaacgatg tttgtcttaa acggattagg 2940 gctcttggtt gcaagttttc taagaggctg cagttgtagt tgaccgctag gattcgaaat 3000 tgaactcgct cctaaaagaa agcgtgagct aaagcgcttt catgatgagg catcaaacac 3060 agtccatttc catgaaagcc aaacaaaaga atttaaagtg aacattttca atgttggttt 3120 agatgatctt atccagcaga ttgattcaag atttgaaaca agcagggtgg ttggcaatat 3180 tttcattctt caggttgaat aataataatc tttccgttta aaataaagca tgagaactcg 3240 cacggtttca tccgagagat gtaagcaaaa agaagaagtt cgccgtttta ccaatgcccg 3300 caaaacaatt ttcaagcaag caatctaatg cagtttcttg atctaatctt ttagaagaag 3360 cttgaacgcc tttttccata tatttgcatc atgcctcgaa ttttcaacac catttttaag 3420 tcagttgctg agggtgaacg attgttttgt aaactgaaag ttgtaaaaaa atctccgttc 3480 gaccatgtgt caggatcacg ttactggtat ttgattattt tgattaaaga agattcgact 3540 aaaaaggtga gttacaagaa cgtcattaaa atttgggcgg ccaaaaaagc tcaaaaaatc 3600 attttaaagc gattttttac ctttcttaaa agttttattg attgattttt taatagtttt 3660 aaaactaata ttttgattca tatatcagtt ttgattaatg aatcgtttct caatccagag 3720 aattaatagt aacgcaaaaa tacaatgtgc tgtatcagaa ataaataaat tgctgcgttc 3780 acaaaatttt taaaatttta gaattttaat ttcttggttt gaattaaggg cgccgctctt 3840 catttttgac ccgggcgcca ttggctctct cgaaggccct g 3881 // ID OR2 repbase; DNA; INV; 484 BP. XX AC D32092; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 28-SEP-1995 (Rel. 1.08, Last updated, Version 1) XX DE SINE element. XX KW SINE2/tRNA; SINE; Non-LTR Retrotransposon; Transposable Element; KW OR2; Repetitive element. XX OS Octopus vulgaris OC Eukaryota; Metazoa; Mollusca; Cephalopoda; Coleoidea; OC Neocoleoidea; Octopodiformes; Octopoda; Incirrata; Octopodidae; OC Octopus. XX RN [1] RP 1-484 RA Ohshima K. and Okada N.; RT "Generality of the tRNA origin of short interspersed repetitive RT elements (SINEs). Characterization of three different RT tRNA-derived retroposons in the octopus."; RL J. Mol. Biol 243(1), 25-37 (1994). XX DR GenBank; D32092; Positions 51 534. XX SQ Sequence 484 BP; 119 A; 109 C; 125 G; 131 T; 0 other; ataatcgtga gattgtggtt tcgattcctg gaccgggcga cgtgttgtgt tcttgagcaa 60 aacacttcat tttcacattg ctccagttca ctcagctgac aaaaatgatt aacgctgcga 120 tgactggcgt cccgtccagc tggggaacac atacgccatt gaaactcgga gaccgggccc 180 atgaggctgg ctaggcttta aaagggcgca tttatttgta gaggcgcatg gcttagtggt 240 tagggtgtca gcatcatgat cgtaaaattg tggtttcaat tcctggaccg ggcgacacgt 300 tgtgttcttg aacaaaacac ttcatttcac gttgatccag ttcactcagc tggcaaaaat 360 gggtaacgct gcgatgactg acgtcccgtc cagctgggga acacatacgc cattgaaacc 420 gggaaaccgg gcccatgagc ctggctaggc tttaaaaggg cgcatttatt atttatttat 480 ttta 484 // ID Chapaev3-N1_HR repbase; DNA; INV; 1752 BP. XX AC . XX DT 29-FEB-2008 (Rel. 13.02, Created) DT 03-MAR-2008 (Rel. 13.02, Last updated, Version 1) XX DE Chapaev3-N1_HR is a family of non-autonomous DNA transposons - DE consensus. XX KW Chapaev; DNA transposon; Transposable Element; Nonautonomous; KW Chapaev3; Chapaev3-N1_HR. XX OS Helobdella robusta OC Eukaryota; Metazoa; Annelida; Clitellata; Hirudinida; Hirudinea; OC Rhynchobdellida; Glossiphoniidae; Helobdella. XX RN [1] RP 1-1752 RA Kapitonov V.V. and Jurka J.; RT "Chapaev3, a distinctive group of animal Chapaev transposons."; RL Repbase Reports 8(2), 60-60 (2008). XX DR [1] (Consensus) XX CC Chapaev3-N1_HR belongs to the Chapaev3 group of the Chapaev CC superfamily of DNA transposons (see comments on Chapaev3-1_PM). CC Chapaev3-N1_HR is a young family of leech Chapaev3 non-autonomous CC DNA transposons: genomic copies of Chapaev3-N1_HR elements are CC ~98.7% identical to their consensus sequence, which was derived CC from multiple alignment of 10 Chapaev3-N1_HR elements. CC Chapaev3-N1_HR contains imperfect 12-bp terminal inverted repeats CC (2 mismatches) and non-functional remnants a Chapaev transposase. CC This sequence was derived from sequence data generated by DOE CC Joint Genome Institute. XX SQ Sequence 1752 BP; 615 A; 270 C; 298 G; 569 T; 0 other; cactgcccaa cagtgttttt gtatctgaac caaaattagt tgtttcttat gtatttccac 60 ctgctgaatc cgaaaatgac attaaaaatt acatattagc tctagttcaa aagataaaag 120 caaaactcta tcctcaagag tataaatttt tattttttgg aaaaattttt ttatcttttt 180 ttattttgcc tcttggctat ctataatgct ttatttatgg aggcacaaat gaaatagaag 240 aaggaaaaga aagccagcct tccacatcaa atgattctga ttttgtgtca acagatgatg 300 caccacacag atggagttaa gcagaattaa gtgacctcgt tcgagatctt gacttgtcac 360 tggaaatggc agaactttta gggtctagat taaaacagtg gaatcttctc caatctgatg 420 ttaaagtttc ctatttcaga aaacggcaac aaaatcttct tcgttttttt tgagaagaaa 480 aataaccttg ttgtttgcgt tgatatctat ggattaatgt attgtctcaa tttgaactat 540 gatccaagtg aatagagatt attcatagat tcatcaaagc taagcttgaa agctgtatta 600 ctgcacaatg gaaatcattt tctatcagtt cctattggcc atgcagctca catgaaagaa 660 atttacctaa atatgaaaac tcttcttaac ccaatcaatt acaacgaaca caaatggaaa 720 atttgtggtg acctaaaagt attcctaaaa gtattgccac acttttagaa atgcaattag 780 gctataccaa atactgctgc ttcttgtgta tgtgggacat agagacaaaa tttcatatta 840 cacacaaaag gactggcctg caagaaatct caacaaaggt gaaaaaaatt ttgttgctga 900 aagatgtact tcttccatcc atacacatta agctgggttt aatgaaaaat tttgtcaaag 960 gaacaaagat ggacaggctt ttaagtattt aagaaacaaa tttccaaaac taagtgatgc 1020 aaaagtaaaa gagggtattt ttgttggacc atagatacat gaacttgtga aagatcctgc 1080 atttgatgaa gttttgaagg ggcaagaaaa agaaacttgg gaatctctca agggagtgat 1140 ttgtggattt ttaggccaca gaagagatga taactacatc caattgttaa cagtacttct 1200 gcaaaaatgc catcaactaa gacgtaacat gtccttgaag attcacttcc tcaactcaca 1260 tttagacttc tttccacttt gtggagctgt tagtgatgaa catagagaaa ggtcccacca 1320 agatatttct gtaatggaaa aaagatatga gggtcgttgg aatgaagcta tgcttgcaga 1380 ttaatgttgg tttatatgta gggatgctcc agagctcgca tacaggcgga aagcaaaaat 1440 ggcacaatcg cgcgaaaata ttccataata ttctttatat ttattatttt tcattgttta 1500 aaactgttag aatgtaatct ttattgaaaa aactaaatgc actttaaata ttattaaaat 1560 aagtaaatta tatgtcatat tgctatgttg tgaataagca aaaccaaaaa tttttttgtt 1620 gttgtatatc tcagaaacta gagctaataa aaaatttttg tttgcatttt cgtttttctt 1680 accccaaaat tagtataatt tgactaaaaa agttaaggat acaaaaaaaa atattttttt 1740 tgttgcccag tg 1752 // ID Mariner-12_HM repbase; DNA; INV; 3698 BP. XX AC . XX DT 22-MAR-2008 (Rel. 13.03, Created) DT 22-MAR-2008 (Rel. 13.03, Last updated, Version 1) XX DE Mariner-type family: consensus sequence. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-12_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3698 RA Jurka J.; RT "Families of Mariner elements from Hydra magnipapillata."; RL Repbase Reports 8(3), 229-229 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(1513..1791,1795..2961) FT /product="Mariner-12_HM_1p" FT /translation="MKNYKKIKNGVWVGDIATERVFNHDETPQFVNYGVDG FT TATGLVYAGKGETCKKMIRENRECITINPFVSLSGEPTKQLFTTKVVAFIV FT LYFKKNIIFILLGEVSLCQVIFAGKGITSQMAPQKAVSNIKNLIISTTESG FT SQDNCSLLAAYKKLDKHLTEQGIERPVVLLSDGHSSRFDFKVLNFLREKQI FT NLFVSPPDTTGVTQLLDQAPNQQLHRYYNNTRDVLFSSFQTINRERFMNIL FT GTMWNNWASAANIVAAAKRVGISKNGLNVDDMQQDKFEQAALLIDKSNDVS FT LEPSTPKKTRSVTSKDGPLATTPRSQSKVAQTKGRYGSAEYWKSMYKMSQS FT FIEESYEKSLNLAEVPGLLSVNRVKPKEVNQTKTTRVTNVHGSMEAQDVLS FT KVAFLEGEKQKKNHDSEEKKKHKLKEKELFYRCKLQCSCVEECLAKSLKEC FT SQCHSILKSVCSKMACRIDNKKPTMILPASAASSSDS" XX SQ Sequence 3698 BP; 1338 A; 539 C; 639 G; 1182 T; 0 other; caggtagagt tcccttaaaa actacgcaaa ttttgaaaat ggaatacatc cgttacaggt 60 caactatttg aatttttttt tttgtgattt gaaaacacct attatagggt ttttcttccc 120 tggaagtcgt tatccttgtt ttgccatatt tagtgagaat tagcagagtg aattttctga 180 ttttttcaaa gattcccctt aaaaactaca aggaacataa ttgtatttat atggatgaaa 240 aatttctgaa agttataaat ttttttctta ttatatcaga aattagtgta gtatgatata 300 taaggaggct tgctaactta gtccaacaaa aaatagcatt ttagctagta aagtccctta 360 aaaactacac gatttccccc ttaaatacta cattacatta ggtaaaatat gcctccacaa 420 gctcacttag aatagccagg aatggagaga atgtttatac tgatcttctt taactaagaa 480 atggtaaaat ggttataact ggtacctttc tgttcatttt gtgctcataa tctctaatca 540 agtatgtata ttctttattt ttagtgttta aataaaataa tagtatggga aaaccgcaat 600 ttaattttaa gatttctgct ggtagaaaaa gaggtagaac aaagacaaag ccagaaactc 660 gagctgaagc aatagtttac agaaaagaca taagagctaa agaaaatcaa atagcagagg 720 cagtagattg gtgcacagaa aataaaaaac gaggacaggc agcattaaaa acaggaaagt 780 ttctttaatt aaagatcgtg gtacaataga cagaagatta gatggcaaag ttaagaatgt 840 aaaaaaggaa catttgagag tgttacatcc agatgatgag aaggaaatag tttgttttat 900 cagaaacaaa aacagagccc accagggact ctctaagaag tttctgaaat tattttggat 960 gttctgaaaa tcagagatca tttgaacaaa aatggaaaag gtggaagaaa gtttataaaa 1020 ctatcaccaa gtgcaaaaac agccctacaa aatggaaggt aagtaagttt tttctttaaa 1080 ctaatttatc aacattcaaa caggtacaag ggtacagtag atattaaata gatgtaaaat 1140 actgaacctt attatccaaa ctatttagac tcgggaaatc attttggaga agatggcatg 1200 ctgaacatga tgatattgtc attaagaggc aaggccgaat taaaatgaac agagccctaa 1260 attgtactag ggagatggct actgatcatc taggtacaca ttttttatca catattgttt 1320 ttttagttta gcctgaatat aaaaagatct attttcaatt tcattctcaa gtgttctaca 1380 acttggtttt tattcttgaa taagtttata tgttttgtac gttgataatt tatgtaactt 1440 gattatatta ctaattttac aacttttttt attccagatg ccttagctga tgaactgatt 1500 gcttgtagta taatgaagaa ctataaaaaa ataaaaaatg gtgtatgggt tggtgatatt 1560 gcaacagaaa gagttttcaa tcatgatgaa acaccacagt ttgtcaacta cggtgtagat 1620 ggtactgcca ctgggttagt ttatgctggc aaaggagaga catgtaaaaa aatgattcgc 1680 gaaaacagag agtgcataac tataaaccca tttgtttcat tatcaggtga gccaactaaa 1740 caacttttta caactaaagt tgtggctttt atagttttat attttaaaaa gtgaaatata 1800 atatttatac ttttagggga ggtttcattg tgtcaggtaa tatttgctgg caaaggcata 1860 acgagccaga tggcaccaca aaaagctgtc agtaacatca aaaatttgat catctccacg 1920 acagagagtg gttcgcaaga taactgttca ttattggctg cgtataagaa acttgataaa 1980 catctcaccg aacaaggtat agagcgacca gttgtgcttc tatcagatgg tcattcgtcg 2040 cgatttgact ttaaagttct aaattttttg cgtgaaaagc aaatcaattt atttgtgtct 2100 ccgcctgata ctactggagt cacacagcta ctagaccaag caccaaatca acagctccat 2160 cgatattata ataacacccg agatgtactc ttttcctctt tccagacaat aaatcgagaa 2220 agattcatga atatccttgg aacaatgtgg aacaattggg catccgctgc caatatcgta 2280 gcagctgcta agagagttgg tataagtaag aatggtctta atgttgatga tatgcaacag 2340 gataaatttg aacaagcagc tctccttatt gataaaagta atgatgtttc tctcgaacct 2400 tctactccta agaaaacaag aagtgtgacc tcaaaagacg gacccttggc aacaactcca 2460 agatcacaat ccaaagtcgc ccaaacaaaa ggaaggtatg gttcagccga atattggaaa 2520 tcaatgtata aaatgtcaca atcattcatt gaggaatcat atgagaaaag cctcaattta 2580 gcagaagtgc ctggtttact ttcagtaaac cgggtaaaac ccaaggaggt taatcaaaca 2640 aaaacaacaa gagtcactaa tgtgcatgga tctatggagg ctcaagatgt gctttcaaaa 2700 gtggcttttc ttgaaggaga aaaacagaaa aaaaatcatg attcagagga gaaaaagaaa 2760 cataagctta aagaaaaaga actgttttat cgttgcaaac tgcaatgctc ttgtgtagaa 2820 gaatgtcttg caaaatcttt aaaagagtgt tcgcaatgtc attccattct taaatctgta 2880 tgtagtaaaa tggcttgtcg cattgataac aaaaagccga caatgattct ccctgcatct 2940 gcagcatcat cctctgattc ttaaaaaaaa aacactctat tttggttctg attcaaataa 3000 atcttaatga ctcaaagata tttctttaga tgaaatgttt gtttattgca tgtctctttt 3060 tcaaaatatt ttatgctatt catatttttt ttgtcatttt atgttattca aaaaatagtt 3120 attgttgtta ttaaagctgt tattgtgttg aataactttt aacaatgtat attttgaata 3180 ataggaaata tgaaaaataa taagaaaata cttttcttat tatttttcat atttcctatt 3240 attcaaaata tacattgtta ttgtattaaa agctagattt cctattttaa agctagtgac 3300 cccttaaaaa ctacatccat agcactgcaa aaaaaaaaaa aaaataaaca tctttaagag 3360 tttgtaaatt agttaatatt acattggtgg attagaacaa gtgtaacatt atctagaaat 3420 aagataattg caaacttaaa attaaactat taaacagtag gaggtcataa tgcataaatt 3480 gagccatttt taaaactaaa aatttgattt gcattctctt tttgaggatt ttaaaattgt 3540 aaaagaggta aatttttaat taatttgtca tctaggatgt ttagagatac tgattcaatc 3600 attatttaat aatttcattg tcattagtgg aacccagagc tgaaagttta tacgtgtagt 3660 gtttaagggg gtatgtgttt ttaagggaac tctacctg 3698 // ID BEL-16_AA-LTR repbase; DNA; INV; 832 BP. XX AC AAGE02020450; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-16_AA_; KW BEL-16_AA-I; BEL-16_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-832 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02020450; Positions 58449 57618. XX SQ Sequence 832 BP; 204 A; 194 C; 182 G; 252 T; 0 other; tgttcgcgac agcgaaatgg ttacaatcct tttcccaacg tccctaaatt caaccctgta 60 cgcataccct gtacagcgca cgaccaacat cccttttatt tcccttcctc attccttcct 120 ttcaacgcaa aatcaacacg cacacaccaa cacctgaata agccacaagc gtacagtcat 180 cgaaggcata atgcgtgtgt acaacgtcaa cagaacgcaa tttgtgattg gaggaccaat 240 cgcgtggatt tttgttttgt atattttttt tttgctcctg agagtataaa agtaaatgta 300 tcgccacgaa gcggttgaga agttatttgt aatctataca gtgaagacat ccgaataaaa 360 cataaagtga aatgtagtga gtctatcatt tgtgaagctg agccctgagg atcgcagcaa 420 ccgctgctca tcttgttacc tgaccatcgt tgccgtttcg atttcgtcgt gttcccgttc 480 ctgttgagag tcagctgccg attgcattca tcaaattgcc accactcaga aacgtttctg 540 ctctgccgct gctgtgtgta ttattttgac taccgctgct atccaatgcc gtcatttgtc 600 gctaccttcc aaccagctag agttgttaag aaattccgcc atagttcttt tcagaattgg 660 ccaccaggtt ggtaagcgat ttgagagcta ggataggcat tgcgttgtag tcgtaatttt 720 ccaagcagag aatagtctga gttgtccggt ggttttggac ttgaatttgg attgcgatcg 780 gtcgctggct gctctggtgg tttgtgagtt tggcgttggc gtgaagccaa ca 832 // ID PFRP3 repbase; DNA; INV; 51 BP. XX AC M93045; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 28-SEP-1995 (Rel. 1.08, Last updated, Version 1) XX DE P.falciparum repetitive sequence DNA. XX KW PFRP3; Repetitive sequence. XX OS Plasmodium falciparum OC Eukaryota; Alveolata; Apicomplexa; Aconoidasida; Haemosporida; OC Plasmodium; Plasmodium (Laverania). XX RN [1] RP 1-51 RA Saul J.A., Yeganeh F. and Howard J.R.; RT "Cloning and characterization of a novel multicopy, repetitive RT sequence of Plasmodium falciparum, REP51."; RL Immunol. Cell Biol 70, 357-359 (1992). XX DR GenBank; M93045; Positions 1 51. XX SQ Sequence 51 BP; 17 A; 10 C; 13 G; 11 T; 0 other; gatctgtcgg aaggacagta cacagtaccc aggaaatcta cggattgtat a 51 // ID Gypsy-257_AA-LTR repbase; DNA; INV; 235 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-257_AA_; KW Gypsy-257_AA-I; Gypsy-257_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-235 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1116-1116 (2011). XX DR [1] (Consensus) XX SQ Sequence 235 BP; 69 A; 44 C; 54 G; 68 T; 0 other; tgttacgatc tagttggcaa tgccatttcg agctttaggt agtgcggtaa aatggacagc 60 taccataaca ttgatgagag tagagagaat acgatcggaa tcggttcggg tccgaactcg 120 aacacagcat tgtaactgaa caggagaaat acatgagttt agttttactt tgactatcga 180 caagctgagt tgcgtttatc tccccatatc accggctaag tattagctta tttca 235 // ID Mariner-1_PPc repbase; DNA; INV; 1303 BP. XX AC . XX DT 07-JUL-2010 (Rel. 15.07, Created) DT 07-JUL-2010 (Rel. 15.07, Last updated, Version 1) XX DE Mariner-type DNA transposon from the Pristionchus pacificus DE genome. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-1_PPc. XX OS Pristionchus pacificus OC Eukaryota; Metazoa; Nematoda; Chromadorea; Diplogasterida; OC Neodiplogasteridae; Pristionchus. XX RN [1] RP 1-1303 RA Jurka J.; RT "DNA transposons from the Pristionchus pacificus genome."; RL Repbase Reports 10(7), 958-958 (2010). XX DR [1] (Consensus) XX CC >99% identical to consensus. XX FH Key Location/Qualifiers FT CDS 153..1184 FT /product="Mariner-1_PPc_1p" FT /translation="MVKSNPLRPAILKLFRAGIPRHDIVLRLAIPKRTVYD FT AIARYLELGSEEDRKGRGRPATVSTPRNRERIRKRIGRNRKQSMRAMAKNL FT HISNTSVRRLIKSQLTLRPYKYLKLHGLNDRQIKLRRDRCRLLLMRCARAE FT HFSTVFSDEKIFTIEGKMNSQNDRILAHDPEEAYKSGGFIGQTSHPLYVMV FT WGGVCATGKTPLVFVTPGVKVNKEFYVKHILQDALLPWARSHFGQSHWTYQ FT QDSAPSHKAKKTQDWLKAHVPDYIPTSEWPPNSPDLNPLDFSVWGVLQAKV FT STTKYKNRDTLKAALLKAWAELDTNYLRELATAYERRLKACVKAGGGHIEI FT R" XX SQ Sequence 1303 BP; 331 A; 342 C; 317 G; 313 T; 0 other; cagggtgcgc caaaattacc cgcactctat tgcatcattc ttatccattg tgcccaatcg 60 ataaatcggg acccctattg catcatgctg atgattgacc cttgacctct tcctatttaa 120 ggtttcaagt tagtaagtca tctctactag ttatggtgaa gtcaaatccc cttcgacccg 180 ctatcttgaa gctgttccgg gccggaatcc ctcgacacga cattgttctc cgactggcca 240 tccccaagag aactgtgtat gatgccattg cgaggtatct ggagcttggc agtgaggagg 300 atcgtaaggg gagaggaaga ccggccactg tctctacccc caggaaccgc gaacgaatcc 360 gcaagcgaat cggtcgcaac cgcaagcaga gcatgagagc catggccaag aatctccata 420 tctcgaacac ttctgttcgc agactgatca agtcccagct gactcttcgt ccctataagt 480 acctcaagct acacggactg aatgatcgtc agatcaagct tcgtagagat cgatgccgtt 540 tactgttgat gcgctgtgct agggccgagc acttcagtac ggtcttttcc gacgaaaaga 600 tctttaccat cgagggaaag atgaatagcc aaaacgatag gatcctcgcc cacgaccctg 660 aggaagctta caagagtggt ggattcatcg gtcaaaccag ccatcccttg tacgtgatgg 720 tttggggcgg agtctgtgct accggcaaga ctcccctcgt tttcgtaacc cctggcgtga 780 aggtgaacaa ggagttctat gtcaagcaca tcctccaaga cgcactactt ccctgggcaa 840 ggtctcactt tggtcagagc cattggacct accagcaaga ttctgcaccc agtcataagg 900 cgaaaaagac tcaggattgg ctgaaggccc atgttccaga ctacatcccc acttcggaat 960 ggcctcccaa ttcccctgat ttgaaccccc tcgatttcag tgtgtggggg gttctgcagg 1020 ccaaggtatc tactactaag tacaagaacc gcgacaccct caaggcagcc cttctcaagg 1080 catgggctga actcgatacg aactacctgc gggaattggc taccgcctac gagagacgtc 1140 tcaaggcctg tgtgaaggcg ggaggaggtc acattgagat tcgttaatat attattgtta 1200 ctattgttgc tgataaacgc tgttatgaaa atggttgctc aattcacaat acagcgggaa 1260 atatgggtga aaacattagt gcgggtaatt ttggcgcacc ctg 1303 // ID NAVIMARN1 repbase; DNA; INV; 733 BP. XX AC . XX DT 06-NOV-2007 (Rel. 12.11, Created) DT 06-NOV-2007 (Rel. 12.11, Last updated, Version 1) XX DE Putative non-autonomous Mariner element - consensus. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Nonautonomous; KW NAVIMARN1. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-733 RA Jurka J.; RT "NAVIMARN1: Non-autonomous DNA transposon from Nasonia parasitic RT wasp."; RL Repbase Reports 7(11), 1173-1173 (2007). XX DR [1] (Consensus) XX CC TSD: TA; ~2000 copies in the genome. XX SQ Sequence 733 BP; 263 A; 98 C; 99 G; 273 T; 0 other; cactgagaga actatatatt aaaatttact acatatgtag gaaaaattac tatattagat 60 agtaacggca ggtcatactc gctccatgac gatttttact atattcatag taaaaattgt 120 tagacttata ttaaaatttg ttacgtgttt actagaaatt actacattat atagtaaaaa 180 ttattttaag aatttctccg tgtgccagaa tttgaggtta taaagttgat ttatgcagta 240 atcagagacc aaaatctata attttttgtt tatctggtat tattaagtga taaaatatca 300 ttcgctgcta ttatttatta attttggtca atttattatt aatattaact catgagtaaa 360 tgtgattaat atatcagatt tgaaatttag tacataacag tactgtttgg gtaagtggtc 420 agcgcgctcg actcccattc cagaggtcca ggttcgattc ccgtcgccgc agaaaaattt 480 ctttctttcg tttatttctt tcacttaaca acattaaata tcaataataa taataataat 540 aatcatattc gaataagcgc gtaaaattgg ccagcaaaaa aaaaattaaa tttttaaaaa 600 tttcatttca gtatagaaat tactatctga gatagtaaaa tttaatacaa agcaagtata 660 accgtccgtt actatataat atagtaattt ttcctgcata tgtagtaaat tttaatatct 720 agttttctca gtg 733 // ID Gypsy-6_PPc-LTR repbase; DNA; INV; 309 BP. XX AC chrUn; XX DT 06-JUL-2010 (Rel. 15.07, Created) DT 06-JUL-2010 (Rel. 15.07, Last updated, Version -1) XX DE LTR retrotransposon from the Pristionchus pacificus genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-6_PPc_; KW Gypsy-6_PPc-I; Gypsy-6_PPc-LTR. XX OS Pristionchus pacificus OC Eukaryota; Metazoa; Nematoda; Chromadorea; Diplogasterida; OC Neodiplogasteridae; Pristionchus. XX RN [1] RP 1-309 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Pristionchus pacificus genome."; RL Repbase Reports 10(7), 1005-1005 (2010). XX DR Genome; chrUn; Positions 71222394 71222702. XX SQ Sequence 309 BP; 55 A; 99 C; 60 G; 95 T; 0 other; tgttgtatcg tagagttccc cctacttcac agacattgtg tcaacgggcc cgggtgtagt 60 ttagtcgccc ccgtctctag tatgtagttt gctcgtgctc tctacttgtt tacaccgaga 120 gcgcgaggtg ttgtctccct ctccctctcc ctctccctcg cctctcttcc cttccccctc 180 gctcactcta gcccccgtct agttactagt aggtgactac tcactctctc gtagttcccc 240 atcccaattt gtggataaac attagtccac ataactgtgt gtgtgttgat acaaggcaag 300 gacacaaca 309 // ID Transib-3_HM repbase; DNA; INV; 3459 BP. XX AC . XX DT 29-JAN-2008 (Rel. 13.01, Created) DT 30-JAN-2008 (Rel. 13.01, Last updated, Version 1) XX DE Autonomous Transib DNA transposon -a consensus sequence. XX KW Transib; DNA transposon; Transposable Element; Transib-3_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3459 RA Kapitonov V.V. and Jurka J.; RT "Autonomous Transib transposons from the hydra genome."; RL Repbase Reports 8(1), 3-3 (2008). XX DR [1] (Consensus) XX CC Transib-3_HM is a young family of autonomous Transib DNA CC transposons that were active in the hydra genome just a few CC million years ago (they are ~2% divergent from their consensus CC sequence). The consensus sequence was obtained based on a CC multiple alignment of ~50 copies; it codes for a 650-aa Transib CC transposase. Like other Transib transposons, Transib-3_HM is CC characterized by 5-bp target site duplications and short terminal CC inverted repeats. CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 862..2811 FT /product="Transib-3_HMp" FT /note="Transib transposase." FT /translation="MSTITRYQIHQFIRQHGLSMSNEKDKLYVYNFIREYC FT SEFDAQFVTFLSKYRKRWSLSKRIDENFMKHNSVWLQKDFIKSFKNQDNKT FT ESIVSGRPVKLFDDSSNKTKSRKVAAMDLLTCSSNELLYASSCNLFKDSRR FT KDAKVVRSLSIKQPFDTEHTQNTLNEALALILDIDMSKSDYQRLRNNALSK FT GCNLYPAYNDVRAAKEQCLPEKTLWTVNDFSAEINFQALVDHTAIRLIQLQ FT KQVIDSMNDLERLTLYSKVGFDGSTSQRIYNQVTVEKENRDLLEEASLFIS FT CFVPLQLFGFINDKKIIIWSNPHPSSTLYCRPIRFSFKKETIEVLKQENQF FT INNSIANLSLTFCENLTILHKIEITMVDGKVATALSTATTSSQCCSVCGCS FT PKKMNDLNAAKNMPLTEIGLCFGLSTLHAWIKSMECLLKISYKLTVKKWAT FT RTISEKKIIQERKQKIQIRFRDEVGLIIDMPTSGGSGSSNNGNTARKFFQN FT AEKSAQILELDVSMIKMLHVILCTLSSNFLIDSSLFREYCYKTAEKYVILY FT PWYHMPQSLHHILIHGWQVVERMALPIGMLSEEAQEAQNKNFKKFRESFSR FT KCSRSKTNEDLLRRLMCSSDPLISSLRKPHHPKKNALPNGVFDLLKEVPIF FT D" XX SQ Sequence 3459 BP; 1284 A; 427 C; 493 G; 1255 T; 0 other; cacagtgggc cagtaggtgg acaaaactaa aaaaatttat gtctgaaaac tgaaattctt 60 tttaaaaagt ttttacttag atacattgag gaccttaaaa ataattacaa taaattaaaa 120 aaatatatat tctcaacaaa aactcatttc aagtaaaatt tttgcaaaaa aaaaatggcg 180 ataaagtgag aaaatgtagt ttttaaaaaa gcaaaagtgt gtaataaatt tttagtatat 240 tggttaactg tttttattaa catatcaaat aattgttaat atataaaatt tataaaaaaa 300 aaaaagttaa aaaaaaattt taaaccagtt ttagatatat tttttggtaa aaattggact 360 ttttttttaa ttttttttga aaagacaaca aggaaaaatt tgtgctctat tttgccttta 420 tctcttatac atattttgaa ggctatatgt atatgttttt ataaaattat taatttagct 480 gctagtatct gctacttttt ttaatactta taatactttt taatacttat actttataaa 540 atcaaagatg tttctttttc aagtttcttt tgtcgcaaaa taacaagtat ttatatatat 600 atatatatat atatatatat atatatatat atatatatat atatatatat atatatatat 660 atatatatat atatatatat atatatatat atatatatat atatatatat atatatatat 720 atatatatat atatatatat atatatatat atatatatat atatatatat atatatatat 780 atatatatat atatatatat atgttaatac aactttttaa tgtttatatt attttagtct 840 aataatatta ttattagtaa tatgtcaacc atcacaagat atcaaattca tcaattcata 900 agacagcatg gtttatctat gtctaatgaa aaagacaaac tttatgttta caattttatt 960 cgtgaatact gttctgaatt tgatgctcag tttgtaacct ttttgtccaa gtatcgtaaa 1020 cgttggtctt taagtaaaag aattgatgaa aactttatga aacataattc agtatggctt 1080 caaaaagact ttattaagtc atttaaaaat caggataata aaactgaatc tatagttagt 1140 ggtcgacctg taaaattatt tgatgattca tcaaataaga caaagagtcg taaagtagca 1200 gctatggact tattaacttg ttcatcaaat gaactgttgt atgctagtag ttgtaatctt 1260 tttaaagata gcagaagaaa ggatgcaaaa gtagttcgaa gtttatctat taagcaacct 1320 tttgatacag aacatactca aaacactcta aatgaagctc tagcactcat attagatatt 1380 gatatgagca aatctgatta ccaacgcttg agaaataatg ctttgagcaa aggttgtaat 1440 ctctatcctg cttacaatga tgttcgtgct gctaaagaac agtgtcttcc tgaaaaaaca 1500 ttatggactg taaatgattt ttcagcagaa ataaactttc aagcattagt cgatcataca 1560 gcaatacgcc taatacagtt gcagaaacaa gttattgatt caatgaatga tcttgaaagg 1620 ttgactttat attctaaagt tgggtttgat ggatcaacta gccaaaggat ttataatcaa 1680 gtaacagttg agaaagaaaa cagagatctt ttagaagaag ctagtttgtt tatttcttgt 1740 tttgttccac ttcagttgtt tggattcatt aatgataaaa aaataataat ttggtctaat 1800 ccacatccat cttcaacttt atattgcaga ccaataagat tttcatttaa aaaagaaacc 1860 attgaagttt tgaaacaaga gaatcagttt attaacaatt caattgcaaa tttgtcttta 1920 acattttgtg aaaacttaac aatacttcat aaaattgaaa taacgatggt ggacggcaaa 1980 gttgctacag cattatctac agcaactaca tcttctcagt gttgttctgt ttgtggatgc 2040 agcccaaaaa aaatgaatga tcttaatgca gctaaaaata tgccattgac agagataggt 2100 ctctgcttcg gtctttcaac tctccatgct tggattaaaa gcatggaatg tcttcttaaa 2160 attagttata aattgacagt gaaaaagtgg gctacaagaa ctatatctga aaagaaaatt 2220 attcaagaac gaaaacagaa aatccagata aggtttcgtg atgaagttgg actgataatt 2280 gatatgccaa catctggtgg ctctgggtct tcaaataatg gaaacactgc tagaaaattt 2340 tttcagaatg cagagaagtc tgctcaaatt cttgaacttg atgtcagtat gataaagatg 2400 ttgcatgtga ttttgtgtac attgtcatct aactttctaa ttgattcttc actatttaga 2460 gaatactgct acaaaacagc tgagaagtat gtaatccttt atccatggta tcatatgcca 2520 cagagtctgc atcacattct tattcatggt tggcaagttg tagagagaat ggctttgcct 2580 attggcatgc tcagcgaaga ggcacaagaa gctcaaaata aaaattttaa gaaatttcga 2640 gaatcatttt ctagaaagtg ttcgcgttcc aaaacaaacg aagatttatt aaggcgacta 2700 atgtgttctt ctgacccttt gataagtagt ttgagaaaac cccatcatcc taagaaaaac 2760 gctcttccaa atggagtttt tgatcttctt aaagaggttc caatttttga ctaagtgtca 2820 atgacatagt tttgattttc gttggtgttt aagatggtgt ttactttata agtatttaat 2880 atttacataa taaaataatg aatataaaag tttgttttct tatatattcg ttataaaatt 2940 ttttgtatct ctaattgcat aaaaacttgc ttatttttat atggttgcta agggtatcta 3000 aactacacag ttaaaattta attaactgtt cctgtaaacc tttatagtta agagtgtttt 3060 aaaagttttt atttcatttt ttaaagaaat tttgaattaa gaaaaagttt caaaaaaaat 3120 taccaactgg tttctaagag cacataacag catagtaaca ccaaaaattg ataatttttt 3180 gtttgtgtta tcattcaaat acataaaata gcaatgtagc attaatatca gtttagttta 3240 taactttgat attttgagaa agtcttttta tttaggtaac tatgcaaaat gcctgttgac 3300 atgaaatata ggtatccatg gcaaccaaat aaacaatcta atggtgccat tagatttgtt 3360 atatgaaata tactcaagct gctaattatg ggaaaaacct gataaatatt ttaaaagtta 3420 ttaattttgt ccagctaata atgattttgg cccactgtg 3459 // ID BEL-641_AA-LTR repbase; DNA; INV; 613 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-641_AA_; KW Pao_Bel_Ele8; BEL-641_AA-I; BEL-641_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-613 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 613 BP; 218 A; 96 C; 121 G; 178 T; 0 other; tgttcaccaa tcgaagaaag cgaactattt tgaatttaat tcaaactctt aaaacttaat 60 actattatca atgaactaaa acttacagct aggaataaat aggttaaaac aaactattat 120 acactccgcg aaacacaaca catgaaacat ttatataagt acacatgcag gtactgataa 180 cactgataaa atgaccacac aacacagaga tagaacacga atgataaaaa ttatatgtgt 240 aggtaataag ccgaaccgaa aagaagcttg tgtggctagg aaagcagttt gcctcattta 300 gtagccaaca gattgctagg caataaagtt caattttgct ctgagattat taaaacataa 360 attatttcgg ttccaataaa ctctgtttta ttgaagttct tagttcaagt ggtagttttg 420 gtggttttgg tggaagaagt gcccgcgaat agtttttgac gaagttttga agagtttgtt 480 attcggagtg gataagcggt gtaaaacaga acagcctcaa attgctgtta gatggactgt 540 tatgcgcttg gtcgaagatt cggtgataag tgaattgaaa ccaaccctcg aaccgtggag 600 ttttctatga aca 613 // ID CR1-19_BF repbase; DNA; INV; 3345 BP. XX AC . XX DT 30-JUL-2009 (Rel. 14.07, Created) DT 30-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE Amphioxus CR1-19_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-19_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-3345 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-3345 RA Kapitonov V. and Jurka J.; RT "Young families of CR1 non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(7), 1590-1590 (2009). XX DR [2] (Consensus) XX SQ Sequence 3345 BP; 932 A; 979 C; 732 G; 702 T; 0 other; atggcgcccc cctcccaccc tagcgattct taccttatga gtttacgaac gttttctttt 60 gttttcactg cccttctcgc cattagtatg tctagcaatg attcttcaag tgccccttgt 120 gggatttgtg acctaccagt tacttggtct gacagaggag tagaatgtga gacatgtgga 180 acctggttcc acgcatcctg ccagcacatt ggatccgaca cttacctgaa catcaccctt 240 gacacaacat ggcgatgtgt catctgtgcc aatgcaaact acagcagcac tctctttgac 300 ctacacggag ttgaacagga gtatagcaga aaccttagct cctcctccct cccagacagc 360 gaatcaggat tcaagcctct tcactcctct actcccaccc gcaacagtca gcatggcaga 420 cagctcaatc gaccattgag gatgctgaac gtcaacttcc agtcagcttc tggaaaaccc 480 gcagagatag caaatatgtt gcacagcact aaaccagacg ttgtctttgg aacagagacc 540 tggctctacc cagacatcaa gaccacagag ttctttcctg atggcttcaa cgtctacagg 600 aaagaccgca ccggaaagat tggtggcgga gtcctcatcg cagtcaggga tcacctgcag 660 tgcacagagg tccccgagct tgaccctggc ggcgaaattc tgtggctgaa gctacttata 720 cgcaaccagc gtccactcta cctctgcgtg ttctaccgcc cttcctcttc tgacaaagca 780 gcccttgaga agctcgacat ggcacttcgc agagcctcag caatgaagaa cgcccaactg 840 atcattgccg gtgactttaa ctttccatca tggaactggg aagccatgac gctgaaacca 900 aaccccgttt acacgacact tcaccaacag ttcgtcgacc ttctatatga tacaggtcta 960 gaccagatag tacaggaccc tacacgtggg gaaaacacac tagacttagt cctcacaaac 1020 tctccatccc tcatcccccg tgttgaagtc atccccggta tctccgacca cagcatcgtg 1080 tacttcgagt acaagaccaa gccggaagta ctgcagaacg ctaaccgtcc catcctcctc 1140 tatgggcgcg caaactggga agtcatgaag caggacatgg agacactaca gcagagcttc 1200 acagactcgg aacacatgtc aacagaagaa ctgtggcaga agttcaagtt ggcattgaaa 1260 gttagcatgg ccaaacatat cccaactaag aaacctagac gaaaagaatc ccttccctgg 1320 atgacctcag gcatccatca tctcatcaga aagcgtgaca gactctacag gaagatgaag 1380 aaaagtggca gcctggacat gcggaaggaa gtcaagaatc tgaacagaga gatcaggaaa 1440 cagatgcggc gctcctactg gagatacacc gaacacctgt tctctcccac cagctcagaa 1500 gagagctcca gacctagtct gaaaagattc tggacctaca tgaagcacca gcgctccacc 1560 accactggcg tacctgccct aaagtcaaat ggaaagcttg tcattgaccc taaacaaaaa 1620 gccgaactac tgaacaaaca gttctacact gcgttcagcg aaggagccac atacacagca 1680 tcagaattca aggaccggtg cccaatgcct gactctcgta atgactttcc ctccatggat 1740 aacatcacga tcagtaccaa aggcattgag aagctcctga caagactgaa ccccaccaaa 1800 gcagcaggcc cagatggcat cactccacgc gtgctgaaag agctggccac agaactctcc 1860 cccatactca ccaccattta taagtcgtcg ctacacacag gccaggtacc caaggattgg 1920 aaggaagcat tggtgactcc agtgtttaaa aagggagaac actacaaggc atccaactac 1980 cgtcctatct ccctcaccag cgtccctgcc aaaatccttg agcatgtcct tgttagtgcc 2040 atcatgcatc accttgaatc gaacaacatc ttgtccactc aacaacacgg tttccgcaag 2100 caccactcct gcgagactca gctcctcgag tttgttgagg aagcctcgtc agctatggag 2160 agcggtgttg caactgacgt cattattatg gactttcaaa aagcgttcga tcgagtgaat 2220 cacagcctcc tcgtacacaa gttagaccac tatgggatca gagggcgcac caacagctgg 2280 atagcgaact ttctcagtga tcgcaagcag gctgtggtgg tcaatggtgc ccagtcgagc 2340 tacgtggatg tgagatcggg cgtcccacaa ggtacggtgc tgggaccgtg tcttttcctc 2400 acctacatca acgacctgcc ggagaggatc tcctccccct ctcgtctatt tgctgatgac 2460 actgccgtgt accggctgat cacttgcctt gaagactcct caaagctgca agaggacctc 2520 ggtaagctag aagactggga gagcgagtgg gatatggcct tccacccaga caagtgcagt 2580 caactgccct tgacacgtgc cagaaagccg cccaatgccc acaccagtta taagctccat 2640 aaccacaccc ttgaacgagt tccatccgcc aagtaccttg gagtcacatt gcaagcagac 2700 ctatcctggg gcaagcacat cgataacacc tactctaaag ccaatcgaac attaggattc 2760 ctacggcgca acctgagggt ttgttctagc aagacaaagg agctggcgta caaagcacta 2820 gtccgaccag ttgtggagta cgccagctcc gtttgggatc ctcacaccaa cagagacatt 2880 agcaaaattg aaaagatcca acgcagggca gcccgttttg tcctgaacag gcacagaaac 2940 acgtccagtg tttctgatat gctggaacaa cttcagtggc cttcccttca agaccgacgc 3000 cgcaccagca gactcgccat gctgtacaaa atcctgaatg gtctggcaca cgtgcgctgc 3060 aaaacactga agcctctacc tagtagcaac agatgccgaa ggggtcacag cctacagctg 3120 caacatatcc cctgtcgcac caactaccga ctcaactcct tcctccccag gactgttcga 3180 gaatggaaca acttgtccga agaaactgta cagtccccat ccctggcccg tttcattcta 3240 aaagtatcca gtgcctccta gagcattagt ggtgacctct gaccccaaca tgtcaattat 3300 tgccgaagac tcggcagctg gacattacgg gaagaagaag aagaa 3345 // ID Chapaev3-1_HR repbase; DNA; INV; 2434 BP. XX AC . XX DT 29-FEB-2008 (Rel. 13.02, Created) DT 29-FEB-2008 (Rel. 13.02, Last updated, Version 1) XX DE Chapaev3-1_HR is an autonomous DNA transposon - consensus. XX KW Chapaev; DNA transposon; Transposable Element; Chapaev3; KW Chapaev3-1_HR. XX OS Helobdella robusta OC Eukaryota; Metazoa; Annelida; Clitellata; Hirudinida; Hirudinea; OC Rhynchobdellida; Glossiphoniidae; Helobdella. XX RN [1] RP 1-2434 RA Kapitonov V.V. and Jurka J.; RT "Chapaev3, a distinctive group of animal Chapaev transposons."; RL Repbase Reports 8(2), 48-48 (2008). XX DR [1] (Consensus) XX CC Chapaev3-1_HR belongs to the Chapaev3 group of the Chapaev CC superfamily of DNA transposons (see comments on Chapaev3-1_PM). CC Chapaev3-1_HR is a very young family of leech Chapaev3 CC transposons: genomic copies of Chapaev3-1_HR elements are ~99.4% CC identical to their consensus sequence, which was derived from CC multiple alignment of 20 Chapaev3-1_HR elements. Chapaev3-1_HR CC contains imperfect 12-bp terminal inverted repeats (2 mismatches) CC and encodes a 536-aa transposase (two exons). CC This sequence was derived from sequence data generated by DOE CC Joint Genome Institute. XX FH Key Location/Qualifiers FT CDS join(460..1035,1104..2135) FT /product="Chapaev3-1_HRp" FT /note="transposase." FT /translation="MASKRRSCKNKPDVFCYICGEYTLVHNRNQVTCFIKR FT AYHAYFGIKLGDQDKDWAPHMVCKACTVYLRQWTKGKKTCLKFGIPMVWRE FT QRNHDTDCYFCSIDLTGINRKNRSSLEYPDLQSARRPVAHCDDIPVPVFHK FT LQDISDDESSSDEDQETEEEWPVVDDETPQLFSQQELNDLVRDLSLSKASA FT ELHQEYLQFFSEVQDLVYCTDIAQLLHHLGVPQYKPEDWRLFIDSSKRSLK FT CVLLHNGNQFASVPLAHSTKLKEKYEAVKYVLEMIRYDQHKWVICVDLKMV FT NFLLGQQSGFTKYPCFLCMWDSRDTAHHYTKKDWPVREELVPCRASNVINS FT PLVDRDRILFPPLHIKLGLIKQFTKALDKEGGCFNYMCQTFPGVTIVKLKA FT GIFDGPQIRRLIRDPEFEKSMNKVEQEAWNAFVLVVKNFLGNNKARNYAEL FT VNNMVTAFKKLGCNMSIKLHYLFSHMDRFPENLGSMSDEQGERFHQEMKEM FT ETRYQGRWNAVMMADYCWTLKRDIPDAEHSRVSNKRKFQP" XX SQ Sequence 2434 BP; 728 A; 522 C; 526 G; 658 T; 0 other; cactggtcaa cacccaaaaa atttcctaag gtctttcggc tgatttatgg aaattttgat 60 gtcctgaatc caaatatcac attggttttt ctagatcagg tcaactttct aagctatggc 120 catatcttca ttttcacttt tttgtgtaca gtcccaggta ttttcgctgt atgtgatgct 180 ttgttcttaa tcacttcatc actatgagga atttgcaatt tttaattaat tggtattttt 240 atgtttcccc tatgcacaca cacacacaca cacacacaca cacacacgtt caatgcatta 300 aatcacgtaa ctacagtcac gtgatcagac aatcgcagac atgcccggac gaggctgtgt 360 ataactatgg atagttacgt cagcattttc caaatctgag ccatatagca gattgcatta 420 gtataggtat cagttgcaaa cgaatcgcaa gcgaacgaca tggcttccaa gagaagatct 480 tgtaaaaata aacccgacgt attttgctat atctgcggtg aatacacact tgttcataac 540 aggaatcaag tcacatgttt cataaagcgt gcttaccatg cttattttgg tattaaactt 600 ggtgaccagg ataaagattg ggcgccacac atggtatgca aggcttgcac cgtgtatctg 660 cgtcagtgga ccaagggcaa gaagacgtgt ctgaagtttg gaattcccat ggtttggcgg 720 gagcagagaa accatgacac tgactgctac ttctgtagca tcgatctgac tgggataaac 780 agaaagaacc gaagcagcct cgagtatcct gatttgcaat ccgcacgtcg tcctgtagct 840 cattgtgatg atattccagt acctgtgttt cacaaacttc aagacatcag tgatgacgaa 900 tcctccagtg atgaagatca ggaaacagaa gaagaatggc cagttgttga cgatgaaact 960 ccacagcttt tttcccaaca ggagctaaat gatctagttc gcgacctcag cttgtcaaag 1020 gcctctgctg aactgttggc atccagattg aaaaaaaaaa cctcctctct ggctgtgctc 1080 gcatcaccct gtaccgcaac aggcatcaag agtacctcca atttttctcg gaggtgcagg 1140 acttggtgta ctgcacagat attgcacagc ttctgcacca cctgggagtg ccgcagtaca 1200 agcccgaaga ctggagactg ttcattgaca gcagcaagcg gtcactgaaa tgtgttctac 1260 tgcacaacgg caaccagttt gcctctgtgc cccttgctca ctctactaaa ctgaaggaga 1320 agtatgaagc ggtgaagtat gtgctggaga tgattcgtta tgatcagcat aagtgggtta 1380 tttgtgtcga cctgaagatg gtgaattttt tgttggggca acaatccggg ttcaccaaat 1440 acccatgttt tctgtgcatg tgggacagta gggacactgc tcaccattac acgaagaagg 1500 actggcctgt gcgagaggaa ctagtgcctt gcagagcaag taacgttatc aacagccctc 1560 tggtggacag agacaggata ctcttcccac cactgcacat caagcttggc ttgatcaagc 1620 agttcacaaa agctctagac aaggaaggtg gctgcttcaa ttacatgtgc cagacatttc 1680 cgggagtaac catagtgaag ttaaaagctg gtatctttga cggtcctcaa atccgtcggc 1740 tcatcagaga tccagagttt gaaaagtcaa tgaacaaagt ggaacaggaa gcatggaatg 1800 cttttgttct cgttgtgaaa aacttccttg gcaacaacaa ggccagaaac tatgctgaac 1860 ttgtcaacaa tatggtgact gctttcaaaa aactcggatg caacatgagc atcaaactgc 1920 attatctatt ttcacatatg gatcgttttc ctgagaatct tggatcaatg agcgacgagc 1980 agggagagag attccaccag gagatgaagg agatggagac caggtatcag ggacgttgga 2040 atgccgtcat gatggctgat tactgctgga ctctaaagag agacatccct gatgctgagc 2100 attccagggt atccaataaa cggaagttcc agccctaaat tcttaacaat gacgaacatt 2160 aatgaccctt acttaaggta ctgtgtttca gaacgctttt ttctctttga aaccaattcc 2220 gaatgagaaa tgttcataga aataaaccga ttatggcaca aaaattttac tgatttattc 2280 acaaataaaa taaaataaac atttaactta gcagaaaaac tggacctgac ctagagaaac 2340 ggatgtcatt ttcggactca gcggtgccaa attatcataa atcagttgta aaaaattagg 2400 aaatttcctc aaaaattttt tttgttgctc agtg 2434 // ID L2-9_AAe repbase; DNA; INV; 4865 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE An L2 non-LTR retrotransposon from Aedes aegypti. XX KW L2; Non-LTR Retrotransposon; Transposable Element; L2-9_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4865 RA Kojima K.K. and Jurka J.; RT "L2 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1405-1405 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 8 sequences with >98% CC identity. XX FH Key Location/Qualifiers FT CDS 650..1831 FT /product="L2-9_AAe_1p" FT /translation="MDACKACGSNLGRSERSVLCSGLCGSVFHPGCVGLNA FT TNFKAWTANVGLLWFCEHCRINFNPSIIDREAVIIKTMRDLLTRVDSMDLR FT IGQFGENLKSMYGLLSLTTSMRRDSTVSTRPQRFPLSSMATHDPSDFYNSI FT ARLNLDSTLQSNAANNSIQFPDEIEVELDDSQSNEHMSRGDLPRTYAAAVA FT APIVSNASEHTSASIIVPAARTTTIANSATSVTSVANASVATASVTTASVT FT DTAAATPVPRVAFSIPIDDNLSSTRASAIVPASHDENCRLKVVNRNRLIAN FT RETPDEPLKSFYVTPFTVEQTEEDIIEYLRETVSIDDSTVKCVKLVPRNKN FT ISELSFVSFKVSVSENLASVIGDRFYWPEGVEIREFQPKNGVRLNQPIQIQ FT " FT CDS 1831..4749 FT /product="L2-9_AAe_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="MNVSDTNANVLLNCNSPPLTVLSHPALSHELTYNKEI FT KMHYFDYSQTKALIQKFTRNFLKIGFHNARSLLKNIDNYRMLFENSKMNIF FT AIVESWLKPSVTNKSVELKGYKIVRSDRCNKDKKRGGGVAFYIKSDMKYSV FT ICKSERDNDVDYLFIKLIQSKLVCGVVYKPPDIHVSKLDVFFNKITEICST FT EANILVMGDFNINMLATDSLKTRSLVDHVNALSFELIDSWPSCHKPGCQPS FT LLDLLMGNCTTNISNAYQSSIGGISDHDLICVDYRFKNSKVKPEEYWGRDY FT HKINAESFVSDLQNCSFNRLYYCSNANEKLKCFNELLLSTLDRHAPLKKKT FT FKDPSCPWINKHINQIFSNRSEAYECWKKDKNNLRKWNNFKCLRNFANREI FT IRVKREYFTTQLNADLPTKQLWNNIKRLGLKQTNTKMGGGDVNASTLNNYF FT VSHYVPITFIENCDSTELQSEFTFRGVTNDEVLEEFTSASCDSVGVDLIPL FT KILKMSLPVTLPYITNLINYCITCSQFPEDWKVAKVIPIGKVDNPTTTKDF FT RPISILCALSKIFESILAKQLNDYLRSKNLLSPFQSGYRKSCSTVTALIKV FT ENDIREALDKKMVTVMALLDFSKAFDTISHSLLCDKLLRIFKLDYFSVKLI FT RSYLTRRSQYVEFNNKESEKITVPCGVPQGSILGPLLFSMYINDLPLTLSF FT CKFHLYADDCQLYLSDTVNNLPNTVRKMNSEIQNILKWCKLNGLLLNSKKT FT QTIIFRNKRMAINNAPKIKVDNDTIEYSDTVKNLGLLMDCKMCWNDQVNAV FT CNKVYKALHSLVVVRSCTPQHTRLLLARSLIIPLFDYGDVLFSLVSKKNLQ FT KLNSAFNTVIRYVFNLGKFDHISSYANSLLGISFTNYLKTRMCTQTFKILN FT NPPTYLRNFFAYARSLRAPLLTVPRCFSGYLKDSFRHRAIRAWNELPRKCR FT CERSYLSFKNSVDKFYNK" XX SQ Sequence 4865 BP; 1509 A; 957 C; 898 G; 1501 T; 0 other; aacaccagtt aattagtcgg aacgcgtttt tttctcaact cctaatattt gaagatttac 60 acctgtgtct gaagcgattg tggcaaaccc attcggaatt cgaccaagta ttactcgctt 120 atcgggacgt gactgaagat caaacaattt ctactaaaac cttcgtgcaa aagtgtttta 180 aaattgtgtt ccatttcgaa gttgccattc tagaacaacg aaaagtaccg aaaattgagt 240 actagttgta ttggtgttga ttggatatag ggtacactca caacattgtg cctgcagtca 300 tagctgacaa gtaccattat cgctcagttt gttttgctct gtgcatttgc acgctttttc 360 aagcggtgct gttgtgttgg atagaggagt gtttctgggt gcgtctggtg gtttcaaaaa 420 ccagaacact ttaatcgaga cgcatgatct ggttcaacac cggctttagc gaacagtact 480 aatacagcga aacaatagta gcaagatctg ggaacgttgc ttacttgttt ggtacaagca 540 agtgtttgta ctcctatatc ttctcttcat caatacttct actactcttt cgtaaggaaa 600 ttcaaatcta gtgtggattt tacttagggt gggacatata ttttgagaca tggatgcatg 660 caaggcctgc ggtagtaatc ttgggcgatc ggaaaggagc gttctgtgca gtgggctgtg 720 cggaagcgtt ttccatcctg ggtgcgtcgg tctaaatgcc acgaacttca aagcttggac 780 agcaaacgtc ggcttgctct ggttttgcga gcactgcaga attaatttca atcccagcat 840 tatcgaccgt gaagccgtca ttatcaagac aatgcgagac ctcctaacgc gcgtcgattc 900 gatggatttg agaatagggc agttcggaga aaatttaaaa tcgatgtacg gtctgctttc 960 tctgacaacc tcgatgcgaa gagattcgac tgtttcaaca cgcccacaac gttttccgct 1020 atcgagtatg gcaacacatg atccttccga tttctacaac agtattgctc gcctgaatct 1080 ggactcgact ctgcaatcca atgctgccaa caactccatc caatttcccg atgaaataga 1140 agttgaattg gatgactctc aatcgaacga acatatgtca agaggtgatt tgcccagaac 1200 gtatgcagct gcagtggcag cgcctattgt ttccaacgca tctgaacata caagtgcttc 1260 gatcatcgtt cccgctgcca gaaccaccac catcgccaac tccgcaacct ccgtcacctc 1320 cgtcgccaac gcctccgtcg ccaccgcctc cgtcaccacc gcctccgtca ccgacaccgc 1380 tgctgccacc cctgtcccgc gtgttgcttt tagtattccc atcgatgata atctttctag 1440 tactcgtgca tcggctattg ttcctgcttc acatgatgaa aactgtcgtt taaaagtagt 1500 caacagaaat cgcttaatag ctaatcgcga aacaccagat gaacccttga aatcatttta 1560 tgttacacct ttcaccgttg agcaaacaga agaggacata attgagtacc tccgggaaac 1620 cgtttctatt gacgattcga cagtgaagtg tgtgaagctt gttcctcgta ataagaatat 1680 cagcgagttg tcttttgttt catttaaagt tagtgtgtcg gagaatcttg catccgttat 1740 cggtgatcgt ttttattggc ctgaaggcgt tgaaattcga gaatttcagc caaaaaacgg 1800 agttcgactc aatcaaccta tacagattca atgaacgtaa gtgacactaa tgccaatgta 1860 cttttgaatt gcaattctcc tccattaact gttctctcac accctgctct ttctcacgaa 1920 ctgacttata ataaggaaat caaaatgcac tatttcgatt attctcaaac aaaagcatta 1980 attcaaaagt tcacgcgcaa ttttttgaaa atcggatttc acaatgctcg aagtctttta 2040 aaaaacattg ataactaccg tatgcttttc gaaaattcca aaatgaacat ttttgccatt 2100 gttgaatcat ggcttaaacc ttctgtaaca aataaatcgg tagaattgaa aggatacaaa 2160 atagttagat cggatcgttg caacaaagat aaaaaaagag gtggaggagt tgcattttat 2220 attaaatcag atatgaaata ttctgttatt tgtaaatctg agagagacaa tgatgttgat 2280 tatctgttca taaagttgat tcaatcaaaa ttagtttgtg gcgttgtgta caagcctcca 2340 gatatacatg tttcaaaatt agatgttttt ttcaacaaga ttaccgaaat ctgttcaact 2400 gaggctaata ttttagtaat gggagatttt aacattaata tgcttgctac tgattcattg 2460 aaaactcgca gtctagttga ccacgtgaat gcattgtctt tcgaacttat tgacagttgg 2520 ccatcctgtc ataaacctgg atgtcaaccg tctctgcttg acctgttaat gggaaattgc 2580 acaactaata ttagcaatgc ttaccaatca tcaatcggag gaattagtga tcatgatctg 2640 atttgcgtgg attatagatt taaaaactca aaagtcaaac ctgaagagta ctggggtcga 2700 gactaccata aaataaacgc tgaatctttc gtatctgatc tccaaaattg cagttttaac 2760 aggttgtact attgctctaa tgcaaatgaa aaacttaaat gctttaatga attgcttctg 2820 tccacccttg atcgtcacgc tcctttgaag aaaaagactt tcaaagatcc cagctgtcca 2880 tggatcaata aacatataaa tcaaattttt tcaaaccgtt ccgaagctta tgagtgctgg 2940 aaaaaagata agaacaatct tcgaaagtgg aataatttca aatgtttacg aaactttgct 3000 aatcgtgaaa taattcgtgt caaacgtgaa tatttcacca cacagcttaa tgcagatctg 3060 ccgacaaaac agttgtggaa taatataaaa cgtttaggac taaaacaaac aaatacaaaa 3120 atgggtggtg gtgatgtaaa tgcttcaacg ctcaataact actttgtttc acactatgta 3180 ccaatcacgt ttattgaaaa ttgtgatagt actgaactgc aatcagagtt tacgtttcgt 3240 ggagtaacta acgatgaagt tcttgaggaa tttacctcag cgtcctgtga ttcggtcggt 3300 gttgatctta ttccacttaa aatcctaaaa atgtctttgc cagtaacctt gccgtacata 3360 actaacttga taaattactg tataacctgc tctcagtttc ctgaagattg gaaagttgct 3420 aaagttattc ccataggcaa agttgataat ccaactacca ctaaggattt tcgtccaatc 3480 agcattcttt gcgctctgtc caagatattt gagtcaatac ttgctaaaca attgaacgac 3540 tatttgagaa gtaaaaattt attgtctcca tttcaatcag gatatcgtaa atcttgcagc 3600 actgtaactg cgttgataaa ggttgaaaac gatataaggg aagctctcga caaaaaaatg 3660 gttacagtca tggcactact agatttcagt aaagcgtttg acacaattag tcatagttta 3720 ctttgtgata aacttttacg aattttcaaa cttgattatt tttcggtaaa attgatacgt 3780 tcgtacttga cacgccgctc acaatacgtt gaatttaaca acaaagaatc ggagaagatc 3840 acagttccct gtggtgtacc ccagggatcc attttaggtc cacttctttt ttccatgtat 3900 atcaatgatc tgccactcac tctatctttc tgcaaatttc atttatatgc tgatgattgt 3960 cagctatatt tgtccgatac tgtaaataat ttacctaata ctgtaagaaa aatgaactct 4020 gaaattcaaa atattttgaa gtggtgtaaa ttgaacgggc tattattgaa ctcgaaaaaa 4080 acacagacaa ttatcttcag aaataagcgt atggcaataa ataacgctcc caaaatcaaa 4140 gtggacaatg atacgattga atattccgat actgttaaaa accttggctt actaatggac 4200 tgtaaaatgt gttggaatga ccaagtaaat gcagtatgta acaaagtata caaagctctt 4260 cattctctcg ttgttgtaag aagttgcact ccacaacata cacgtcttct acttgcaaga 4320 tctctaataa ttccactgtt tgattatggg gatgttttat tttctctagt ttctaagaaa 4380 aatttacaaa aactgaattc agcgtttaac acagtcataa gatatgtttt caatcttgga 4440 aaatttgacc acatatctag ttacgccaat tccttactcg gtataagttt caccaattat 4500 ctaaaaactc gaatgtgcac tcaaacattt aaaattctaa ataatcctcc tacgtatctt 4560 aggaactttt ttgcatacgc ccggtccttg agagcacctt tattaacagt gcctagatgt 4620 ttttctggat atcttaaaga ttcgtttcga caccgtgcaa ttagagcgtg gaatgaattg 4680 ccaagaaagt gtaggtgtga gaggagttat ctatcattca aaaatagtgt tgataaattt 4740 tataataagt aacaaagtac attggtataa gtcgtctagt aatagcatat actttgaaaa 4800 ccgttaggag gttatttgtg ctgttatact gtagttggaa ataaataaat aaataaataa 4860 ataaa 4865 // ID Copia-4_DPu-I repbase; DNA; INV; 4187 BP. XX AC scaffold_73; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-4_DPu_; KW Copia-4_DPu-LTR; Copia-4_DPu-I. XX NM Copia-4_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-4187 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 671-671 (2010). XX DR Genome; scaffold_73; Positions 226500 230686. XX CC Positions [1511-2035] - Integrase core CC 'GGTCT' target site duplication CC LTRs are 99% similar to each other. CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX FH Key Location/Qualifiers FT CDS 256..1248 FT /product="Copia-4_DPu-I_1p" FT /translation="MIDDELVPHDAERMDWIRRDILARNYLVATIESQQQR FT SLINCRTAYEMWTRLSAQHLRNAVENQHVLQQKFFEYQYQPEHDIMAHITE FT VETMASMLSDVEAPVSEIQIMTKILCTLPPSYRSFTTAWDSVPAHEKTIAL FT LTSRLLKEETMAKRWTRGQKDTNDAAFFAHNFPTITPNGARGNRTDGGRGG FT RGNRRGYNRRRPESSRHRPYNPNYCSHCQLGPHKAIHCRGRIKEEADAKAR FT DDAAAASDKKKATSPEENSDFSFLSEFTCFTARSIDDWYADSGATQHMSGQ FT RSLFRNFVPVKPNTWFVNGIGGARLQVLGHGEIVFKAKK" FT CDS 1220..4108 FT /product="Copia-4_DPu-I_2p" FT /translation="MAKSSSKPRNDKTVIVGERIGRNLYHLAITAAKPVET FT AYFMTPAAPSLAVWHQRFAHVSCRKIAKMAAQQLVKGLTIPANAAIPSQPC FT PGCIAGKMERSPFPTGRTRAVQIGQLIHSDVCGPMHVETPGGAKFFVMFTD FT DYSGFRTVYFLKFKSEVADCFKDYAEHLHTETRQKIHTLRTDNGGEYTGHS FT FRAWLSEQGIRYETSAPHTPEQNGVSERANRTIVEGARSLLHAKHLPLELW FT GEAISCAVYVLNRVTSKAAPNTPFQNWYGTKPDVSNLRIFGSTAYIHVPKI FT ERRKLDSKSLRCYFVGYSDSQKAYRFWDPISRKIKTSRDVIFDEQILYTPA FT PTSTQPNEDPFNHLILHPSIPEDESINHQLSAPLEVAQVEENVAAPPVDVH FT NGQEIPPVSVSIPDNVPNPRPDRERISPYPLRVREPKRRWEESMQSTSYVP FT EPDEPKNLNYALKSPDAALWKLAADDEYNSLMDNKTWTLTTLPPDREAIQS FT RWVLTVKPGVRGATPRYKARLVAKGYSQRPGIDYDETFAPVAKQTTLRTVL FT SFVAALNLEMCQLDIKTAFLYGELNEEIYLEQPEGYISVESKNLVCRLHKC FT LYGLKQASRVWNQHFDNFLKLFGLTPSESDPCLYLRISGNEFTAVTIWVDD FT GLVCSNSSDVITKIISHLKEHFEMRSSEANHFVGLSITRNREEKTLYISQP FT DYIKKILRRFHMDQCNPVDLPATPGVFFTRDDKNEGSIQVPFREAVGSLLY FT LMLSSRPDISFSVNQVSQFCEKPQNCHWAAVKKILAYLKRTSEHGIRFGPE FT LTSLLGFTDADFAGDTDTRRSTSGYVFLLNGGPIAWSSRRQKCVTLSTTEA FT EYVAACEAAREGVWLKRLLNELMPNWSEPIPLMCDNMSSIDLSKNPRFHQL FT TKHIHVRYHFIRLAQEENEIDVRHIPSKQQLADPFTKPLANPRFTDLRNAI FT SVVSVPIA" XX SQ Sequence 4187 BP; 1244 A; 1059 C; 908 G; 976 T; 0 other; ggttatgggc ccagcacgta cctgacttgc tgaaacaaaa tgaactcaac gctaagagaa 60 gtgagccaca tagccaagtt caatggcacc aatttctccc tatggaaatt cggaacatgg 120 ctgttactag aacaacacaa tctagttggc atagtaaatg gggaggaact ctaccctgag 180 gatgtaagac tattctatca tacacatgtt accaacagat ggtttaattc ttcacgtcaa 240 atcccttttt aggagatgat tgatgacgag ttggtccctc atgacgcaga aaggatggat 300 tggatacgta gagatatcct agccagaaat tacctcgtgg ccacgatcga aagccaacag 360 cagaggtctc tgatcaattg tcgcacggca tacgagatgt ggacacgcct ctcagcacaa 420 catctccgca acgccgtaga gaatcaacac gtgctccaac aaaaattctt tgaatatcag 480 taccagccag aacatgacat catggcacac attactgagg tggaaactat ggcgtcaatg 540 ctaagtgacg tggaggcgcc tgtcagtgag attcagatca tgacaaaaat tctttgtacg 600 ctgccaccaa gctatcggag ctttaccacc gcgtgggata gcgtaccagc ccatgagaaa 660 accattgctc tcctgacatc tcgattacta aaagaagaga ccatggctaa gagatggaca 720 agaggacaaa aagacaccaa cgatgcagct ttcttcgcac acaacttccc caccatcact 780 cccaatggag caagaggcaa ccgaacagat ggaggcagag gtggtcgagg aaaccgaaga 840 ggatacaacc gaagaagacc tgagtcatct cgacaccgtc cctacaatcc aaattattgc 900 tctcactgtc aattgggacc tcataaagcg atccactgcc gtggaagaat aaaggaggaa 960 gctgacgcca aggccaggga tgatgcagca gctgcaagtg acaagaaaaa ggcgacttca 1020 ccagaagaaa acagcgattt cagtttttta tctgaattca cgtgtttcac tgcacgttcc 1080 attgatgact ggtatgctga ttccggggcc actcaacaca tgtccggtca gcgatctctt 1140 ttcagaaatt tcgtaccggt gaaaccaaac acatggttcg tcaatggaat aggaggagcg 1200 cgtcttcaag tcctaggtca tggcgaaatc gtcttcaaag ccaagaaatg acaaaactgt 1260 gattgttgga gagagaattg gacgaaatct ctatcatttg gcaatcacgg cagcgaaacc 1320 agtcgaaaca gcctacttca tgactcctgc agcgccatct cttgccgtct ggcaccaacg 1380 ttttgcccac gtcagttgca ggaagattgc aaagatggca gcacaacaac tcgtcaaggg 1440 cctcactatt ccagcgaatg ctgccatacc ttcacaaccc tgccctggat gcattgccgg 1500 caagatggaa cgatcacctt tcccgactgg acgaacgcga gcggtacaaa tcggacagct 1560 gatccactcg gatgtgtgtg gacccatgca cgtcgagaca cctggtggcg cgaaattctt 1620 cgttatgttc actgacgact atagtggatt tcgaacagtc tacttcctga aattcaaatc 1680 cgaagtagct gattgcttca aagattatgc agaacacctc cacactgaga cgagacagaa 1740 gatccacacg ctacggacag acaacggagg agagtacacg ggccattcat tccgagcatg 1800 gctgtccgaa caaggaatca gatacgaaac atccgcgccg cacacccctg agcagaatgg 1860 agtgtccgaa agggcaaata gaaccattgt tgaaggtgct agaagtctac ttcacgcgaa 1920 acatctccct ctagaactct ggggtgaagc catctcctgt gcagtttacg tcctgaatcg 1980 tgtgacttcc aaagcagcac cgaacactcc tttccagaat tggtacggta cgaaaccaga 2040 tgtctctaat ttgcgtattt ttggttccac agcctacatc cacgtcccaa aaatagagag 2100 aagaaaactt gactccaaaa gtctgagatg ctatttcgtg ggatattctg acagccaaaa 2160 ggcctatcgt ttctgggacc caatatccag aaagatcaag accagcaggg acgtgatttt 2220 tgatgagcag atactctaca cacctgcacc aacatccaca cagcccaatg aagatccctt 2280 caaccacctg atactccatc catcaattcc agaagacgaa agtatcaacc atcaactctc 2340 cgctcctcta gaagttgcac aagtcgaaga aaatgtcgca gcccctcctg ttgatgtgca 2400 taatggacaa gaaatccctc ctgtttctgt atcaattcca gataatgtcc caaatcccag 2460 acctgatcgt gagcgcattt caccttaccc tcttcgtgtt cgtgaaccaa aacgtcgatg 2520 ggaggagtca atgcagtcga cgagctatgt tccagaacca gacgagccga aaaacttaaa 2580 ctatgcgctg aagtcaccag atgcagccct ttggaaactc gcggcggatg acgagtacaa 2640 ctccctcatg gacaacaaga cctggacact gacaacactg ccacctgatc gagaagcaat 2700 ccaatcacgc tgggtcctca cggtaaagcc tggtgtacgt ggtgccactc ctcgctataa 2760 agcaagacta gtagcgaaag gctactccca acgacctgga attgactacg atgagacttt 2820 tgcaccagtc gccaagcaga caacactgag aacggtatta tccttcgtag cagccctcaa 2880 cctcgaaatg tgtcaactag atataaaaac ggcgttttta tacggcgaat tgaatgaaga 2940 gatctacctg gaacaaccgg aaggctacat ctcagtggaa agtaaaaatc tagtgtgtcg 3000 tctgcacaaa tgcttgtacg gattaaaaca ggcatcacgc gtctggaacc agcattttga 3060 taattttcta aaactttttg gacttacacc aagcgaatct gatccatgcc tctatcttcg 3120 aatctcagga aatgaattta cagcagtcac catctgggtg gatgacggtc tcgtttgcag 3180 caacagcagt gacgtcatca ctaaaatcat cagccatctg aaggaacatt ttgagatgcg 3240 gtcatcagag gccaatcact tcgttggcct gtcaatcacg cgcaatcgag aagaaaagac 3300 tctctacatt tcccaacctg actacattaa aaagatcctt cgacgcttcc acatggatca 3360 atgcaatcca gttgatttac ctgccacgcc tggagtattc ttcactagag atgacaagaa 3420 tgaaggatcg attcaagttc ccttccgtga agctgtggga tcactactgt acctcatgct 3480 ctcatcaagg ccggacatat ctttctcagt caatcaagta tcccaattct gtgaaaaacc 3540 ccagaactgt cattgggcgg ccgtaaagaa aatcctggcc tacttgaaga gaacatctga 3600 gcatggcatc cgctttggtc ccgaactaac ttctctttta ggcttcacag acgctgattt 3660 cgcaggcgac acggatacac gacgatccac ttccggatat gttttcctac tcaatggagg 3720 tccaattgcg tggagcagcc gccgccagaa atgcgttact ctatccacaa cggaggcaga 3780 atatgttgct gcatgtgaag ccgcaagaga aggtgtttgg ttgaagcgcc ttttaaacga 3840 actgatgccg aattggagtg aaccaattcc actaatgtgt gataatatgt cttcaattga 3900 tctgtccaag aatcccaggt ttcatcagtt gaccaagcac atccatgtgc gttatcattt 3960 catccgtttg gcacaagaag agaacgagat agatgtgaga cacatcccat ccaaacaaca 4020 acttgctgat ccattcacga agcctttagc caatccccga ttcactgatc tgcgtaatgc 4080 aatcagtgtt gtctcagttc ccattgccta aattgctaaa tcttttgaac caatattcag 4140 tttaactcta ttcttttttt tttaaattcg tcatgtttga ggaggag 4187 // ID BEL-1_Cfl-LTR repbase; DNA; INV; 1184 BP. XX AC AEAB01030939; XX DT 26-JAN-2011 (Rel. 16.02, Created) DT 26-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Florida carpenter ant genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-1_Cfl_; KW BEL-1_Cfl-I; BEL-1_Cfl-LTR. XX OS Camponotus floridanus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Formicinae; Camponotus. XX RN [1] RP 1-1184 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Florida carpenter ant genome."; RL Direct Submission to RU (26-JAN-2011). XX DR Genome; AEAB01030939; Positions 6414 7597. XX SQ Sequence 1184 BP; 358 A; 361 C; 209 G; 256 T; 0 other; tgttcaaaat gcataaagtc acggaaaccg cataattaaa gccgcatttc ccaccccata 60 atttcgagga ttttggaaag tcaacggcgg tcgagaatct tctcgagacc tctattcttt 120 aacaaaaacc cgaccgttcg agcaatctct gtttgcgccg cgtataaaaa aaacgctcga 180 taacaacatt gtcaactgcc gccggaatcc acgaggcgaa cgggcatgcg cgcccgtgcg 240 tcacgatcag ctgatggcca atcagctgat ccaaaagagt tgcgcgcatg tcaacagtga 300 agaatattaa ttctgacctc atcggcggcc aatcatgtgc ggcgccgtgg ggtcaccgcc 360 acactttaaa attaattggg ccacgtcgca acggagccca aaatccgccc cagtatataa 420 cgggcccagc gcgcgggccc ggccaagtct cgcactttac gccttttcgc tcacctattt 480 ctatttcgac gcaccgctaa aacattacac gaccgctctt ttctcacgga tcaccgcttt 540 cgattttcgg aaaattctcg aacctaaacg cacgacgaca aaattctcga acctaaacgc 600 acgacgacaa aattctcgaa cctaaacgca cgacgacaaa attctcgaac ctaaacgcac 660 gacgacaaaa ttctcgaacc taaacgcacg acgacaaaat tctcgaacct aaacgcacga 720 cgacaaaatt ctcgaaccta aacgcacgac gacaaaattc tcgaacctaa acgcacgacg 780 acaaaattct cgaacctaaa cgcacgacga caaaattctc gaacctaaac gcacgacgac 840 aaaattctcg aacctaaacg cacgacgaca aaattctcga acctaaacgc acgacgacaa 900 aattctcgaa cctaaacgca cgacgaccgc tatctctttg gacttcgcta aaccgctcgc 960 tcgcttcacc gcacgcgata tcgcgcacga taatcgcacc gcttcaccgc accgctcgtt 1020 tcattacgaa ttaccgcgca ctggacactt gtaattcttt caccttttct tttactgtgc 1080 actgactaca ttaaacttct gttattttta ttgaaagttc aaagttactc tttcactaca 1140 cgaatccact atccacgatt cactgccgga cttcctcgcg aaca 1184 // ID BEL-134_AA-LTR repbase; DNA; INV; 650 BP. XX AC AAGE02019323; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-134_AA_; KW BEL-134_AA-I; BEL-134_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-650 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02019323; Positions 67036 67685. XX SQ Sequence 650 BP; 233 A; 110 C; 125 G; 182 T; 0 other; tgttgggaac ccctggaagg gcaacaatgc caaacgtcaa ccttaacttg aagagatagt 60 caatagaata tgacagctat agaaattgtt tagatgacta gcgagaagct acaaaggaat 120 taaattgtgg attagttaag cgaattgaag attaaagtga attaactatt attcggtggc 180 agtctactca gcctctgctg acattacagt ttacatcaag taagaaacta ttgttctaaa 240 tgattgcttt atataaatgt ataaaactat gtaggttggg taaaccacaa cgtgagagaa 300 ccagttggga gttaaattta gattcgctaa tacggaccaa cgtagaccgt gaagccaaaa 360 atactattgt aaagctaagg aagcgattga acttcgtaag tcaatattta cattaacaca 420 aattattact aaagcttcca tttacaaaca gtcttggcac agttattaag gttagacccg 480 agttgagagt ttataacaga agcgggcaag tacgctacgg aaacaactat tgtaagttaa 540 caatgtaaca ctttttaaaa taccaattat atcacgagaa tacatttcag ctttgatcgg 600 cttatcacca accagtcgat tcaaggacgt ttctttgcct gatccgaaca 650 // ID Gypsy-8_DVir-LTR repbase; DNA; INV; 443 BP. XX AC scaffold_12963; XX DT 10-MAR-2011 (Rel. 16.03, Created) DT 10-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-8_DVir_; KW Gypsy-8_DVir-I; Gypsy-8_DVir-LTR. XX OS Drosophila virilis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; virilis group. XX RN [1] RP 1-443 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (09-MAR-2011). XX DR Genome; scaffold_12963; Positions 2603191 2602749. XX SQ Sequence 443 BP; 146 A; 68 C; 98 G; 131 T; 0 other; tgtcgtgtat ccaataaatt attgtgagta ttattttgaa tgctaacgat taagtgttag 60 taagaatcag agaatttgac agaaggtact gctatttaac ataactgtcc tacttgtatt 120 agggttgcta ctgaaagcat ggtgtgagac aaaaaaagat agtctctgcc gagcatggcc 180 cgagtgtgtt gtataacgtc cagtttgcta tagtgttgaa tgttcactaa cagtactaaa 240 agtcagagaa gaatatagaa tgacaagagc atgagtagta tgctcgagga cgatgaaaat 300 tattcattct aaaagtggaa gtcatagaga acagacgtgc tcgctcctaa aaatataatt 360 attttgtgac tataacgaca ctcgggactt gctgtgctgc cagtatgcct aattgcaaag 420 aaagtgattt ctcagttccg aca 443 // ID BEL-2-LTR_NVi repbase; DNA; INV; 484 BP. XX AC . XX DT 20-APR-2009 (Rel. 14.04, Created) DT 20-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia vitripennis: long terminal DE repeat - consensus. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-2-LTR_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-484 RA Bao W. and Jurka J.; RT "BEL type LTR-retrotransposons from Nasonia vitripennis."; RL Repbase Reports 9(4), 745-745 (2009). XX DR [1] (Consensus) XX SQ Sequence 484 BP; 119 A; 112 C; 91 G; 162 T; 0 other; tgttcgaata ttagtttaat atgcgaagag agagcgagaa ggttcgagaa gcttgtcacc 60 ttaagttcaa attcgacggt tttcgctagc ccgcacggca ccaatgaggt tgccatggca 120 acgcgcgctc gctcgctcct actctgtctc tctccgcctg tttcgcgagc ggtcttctgc 180 cgctttcgct gatttgtgtt attcagtttt cgtattgccg aagcgctgtg aggactcgcc 240 tcacagtgtc gaccgttaaa caatatcgtt tgttaacgcg tcggtgctct aacactcttc 300 taatactctt caattttcca tagtcaatac atttgtaatc acactctttt cttttcaata 360 aacatcaata gtttgacggt atacatagta taccttttca tctcaaattt aaagagcgtg 420 ttattatttt cacattaatt gttttgatcc acaagtctct atcaaagtcg agagactata 480 aata 484 // ID BEL-217_AA-LTR repbase; DNA; INV; 539 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-217_AA_; KW BEL-217_AA-I; BEL-217_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-539 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 892-892 (2011). XX DR [1] (Consensus) XX SQ Sequence 539 BP; 168 A; 105 C; 120 G; 146 T; 0 other; tgttcgtgta accacgaaaa agggaattaa gggactacgc gcgccatctg cttctcaaaa 60 gaaatattgt caattcatct tatctccccc gttcgtcgac atcgataatg gcccaaaatg 120 agatggaaag agaactacca tggctgcaca ccttcttccg ggacaccttg atataaaagg 180 aacctaggga catgtgtcac gctctctttt tatgctgcaa gaactcggac gtcattgagg 240 ggactgcacg gagaaaagtt tataattgaa ttaagagtgg aagtagtgaa atttgtgaac 300 attgttatta aattgaagta aaatagaagt gaagtaaaga aagttacctg aaaataaagt 360 tcgatttcgt taagttaacg cgtgtgagtg atttacttac ctgcctaagt cgcctgccat 420 ctgggataac ctgaaaagga agaaaaacta gtggtgatac ttacctgagc ctgtagtgat 480 ctggaatttc cctatcgcta aaattttgga tccgctaccg ctcgctgtgc tactcgaca 539 // ID BEL-88_AA-I repbase; DNA; INV; 6781 BP. XX AC supercont1.2; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-88_AA_; KW BEL-88_AA-LTR; BEL-88_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6781 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.2; Positions 4365874 4359094. XX CC Positions [5786-6373] - Integrase core CC 'ATAAT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 31..3639 FT /product="BEL-88_AA-I_1p" FT /translation="MTRKTRSQAKAKAKATTSKQLHDAIVTGICDPRGESV FT ATTSDFAREVESNHQQTNRLVRFMTSDVAADRGADCECPTCNQLVGMERSL FT QCQKCEKYFHLTCTKVSMEPPEQFTCEACILRYSVPKPPSSCGRSSNSSIR FT RARLARELALLEEERKLEEEARKEQLDRERLMNEKAAQEKIERDKQYLARK FT RDLLIQQDEEDDVVSIRSHRSTHSTGKRIEDWIDKQVKTTSSIGIDVVGET FT SKPSEPAHHTPVRSVDAHGQQCTSTPIRDEQIISKKLFAVKSEPSNFIKMD FT LISPTRTTGSITIGDSDPGEAAGGKDQMDGNSISPAIPLVNLNSLQNLLDD FT SRQTVKNTGTLPKLASNLLLPYGQWQRDTALRRTTADLEKRHQSEIDTRRR FT RELDLVEEIKRLQIQKETDTERFRKHESDTKKRLQEFSDRQAAMELRRYEE FT LQQRDAEIGRLKDAERRLTGQLHLYSQAHQSEHVVGMVGATPQLGQQRPTQ FT PGHSNSAAVPLEDFAAATGFVFNPNPVNAIPLIPSKTNSSLRDHGTRQNHD FT PHFPTFPASDRDRPSYSEHISPSPINQPTATSNAPNVGNSRLSSAGYQQVF FT LPSPEQLAARQVLSKELPSFSGDPTEWPLFFSSYSNSTSACGYSDAENLLR FT LQRALQGPAKDAVSSFLLHPSMVPQVMSTLQILFGRPEQIVHIMIEKLRTT FT PSPKPERLETLISFGLVVQNFCAHLKAVGLDWHLSNPSLLQELVDKLPANV FT KFNWALYQQQLPVVDLNVFSDYMTRVISAASSVTLLSAHQKTGKEDRPKLK FT EKAYVNAHTTMDEFSSSSKANQDGVVDARSGKNPCPACDKPGHHAGECSKF FT KALNLDSRWNMVKEFKLCRRCLIAHLRWPCKGVVCGVNGCQKLHHRLLHYD FT SSMDTNQIERSTNATVTIHRKPASSILFRVLPVTLYGKHGKVDTFAFLDDG FT SSVTLIERAIADTLGLNGHAESLCIQWTSGIDKKISNAQMVQMEISAIGSN FT KRFKVGEAYTVNGLGLPKQSLDFEQLADQFKYLKKLPVQSFQSATPGILIG FT IDNAHLLTTLKLREGRMREPIATKTRIGWAVFGRIEGEHQLEHRLMHICTR FT NSLDDLHEYVREFFTLENLGIAISPAHEGVEEQRAKEILEKTTTRTDCGKF FT ETGLLWKNDYIELPDSLPMAEKRLKCLEKRLQKIRSCTKPFAIK" FT CDS 3582..5921 FT /product="BEL-88_AA-I_3p" FT /translation="MFGEASSKNPELYEAVRNQISEFQRKGYIHEASKEEL FT DTFDLRRIWYLPLGVVTNPNKPGKVRLIWDAAAKVEGVSLNDHLLKGPDLL FT TPLLAVMFQFREREVAISADIMEMFLQILIRPVDRSVLLFLWRDSVDQPIK FT VMVTNVAIFGATCSPTQSQFVKNLNALEYEMDFPRASLAITKRHYVDDYLD FT SVDTTEEAITLAKDVTMIHGKADFFIRNWISNKVEVLEQIGETHPAAVKQF FT SASKDCKFERLLGMIWLPEEDVFSFDLSLREDTQRLLVGDVVPNKRQLLSV FT VMSLYDPLGLVSAFVIQGKILIQETWRENIGWDDKISSKIFSRWKQWLTGL FT REMSKVKVSRCYFPGYDRDSYNSLQLHIFVDASEESYAAVAYFRIKDKGEI FT RCSLVASKTKVAPLQSLSIPRIELMAALIGARLRKTIEDHHSVRVARTFLW FT SDSTTVVAWIKSDTRRYRQFVAFRVNEILSLSSVAEWRWIGTRQNVADEAT FT KWGKGPSTNPECRWYRGPAFLYEGDGDWMEEQVDVTSEELRSAFVCSHFVI FT KPTIKYERFSKFDRLKRCLAYVLRYLENLRRAVNGNPLKFGREFTSEELQK FT AEHSLWLLVQSECFPDEVAVLKHNKQPESSQKSLESSSKLANLPPLMDDQG FT ILRVDGRTDAAEYLSVDARRPIILPREHRATELLLDWYHQKYRHGNDETVL FT NEIRQRFYVAKLRTCLRKTKTRCMWCKVYKCVPVPPKMAPLPRVRLTPYVR FT AFTFVGIDYFGPYMVKVGRSAVKRLCLRA" FT CDS 5747..6781 FT /product="BEL-88_AA-I_2p" FT /translation="MYVVQGLQVCSGAAKNGPFASSTVNSICSCLYLCWDR FT LLRTVHGESRKKRCEEVVFTCLTIRAIHIEVARSLSADSCKKAIRRFIARR FT GAPQEIYSDNGTNFIGVSRELQSEIRAISTELGSTFTDAYTQWRFNPPSAP FT HMGGCWERMVKSIKLALGTIPVDDKLDDESLETLFAEAEMMINSRPLTFVS FT LQTTDEEAISPNHFLLLSSTGVQQTIKSPVTDGIRIRDSWTAMQKILDKIW FT KRWIIEYLPMITRRTKWFQQVRPISEGALVIIADEKVRNRWIRGRVVRTYP FT GKDGIVRQADVSTVGGVLRRAVAKLALLDVVADNGGSKEDDVLGESSATCG FT GE" XX SQ Sequence 6781 BP; 2014 A; 1459 C; 1650 G; 1658 T; 0 other; aaactcaaga atttaacaaa catcttcgat atgactagga aaacacgatc ccaagctaaa 60 gccaaggcga aggcgacgac tagtaagcag ctacatgacg ctattgttac cggcatttgc 120 gatccaagag gtgagagcgt agcaacaacg agcgattttg ctcgcgaagt agaaagcaat 180 catcaacaaa caaaccgtct agtcagattc atgacgtcag atgtggctgc cgatcgtggt 240 gctgattgcg agtgtcccac gtgcaatcag ctcgtcggga tggagcgtag tttacaatgc 300 caaaaatgtg aaaaatattt tcatcttact tgcaccaaag tgagtatgga accaccagaa 360 caatttacgt gtgaggcttg cattttacgt tattctgtac caaaaccgcc gtccagttgc 420 ggacgttcca gcaattcaag cataagaaga gcccggttag cccgagagct ggctctgtta 480 gaggaagagc gaaaattgga agaggaggcc cggaaggaac aactagatcg ggaaaggttg 540 atgaatgaaa aggccgctca ggagaaaatc gagcgcgaca agcagtactt ggcgcgaaaa 600 cgtgatttgt tgatacagca ggacgaggag gatgatgtgg tcagcatcag gagtcatcgc 660 tccacccatt caactggaaa acgtattgag gattggatcg acaaacaggt gaagacaacc 720 agtagtattg gaattgacgt cgttggagag acgtcgaagc cctccgagcc agctcatcat 780 acgccagtga ggtctgtgga tgcacatggt cagcaatgta cttctactcc tattcgagat 840 gagcagatca taagtaagaa gttgtttgcc gttaagagcg aaccctcgaa tttcattaaa 900 atggacctaa tatcaccgac tcgaacgacc ggtagcataa caatcgggga cagcgatcct 960 ggagaagcag ccggtggaaa ggatcagatg gatggaaatt cgatttcacc tgcgattccg 1020 ttagtgaacc tcaactcttt acagaatctt ctggatgatt ctagacagac tgtgaagaac 1080 actgggactc taccaaaact ggcaagtaat ctactccttc cgtatggtca atggcaaaga 1140 gatacagctc tccggcgcac gacagccgat ttggagaagc gtcaccagag cgagattgac 1200 acacgccgca gacgggagct agatctggtg gaagaaatca agcgcttgca gattcagaaa 1260 gagactgaca cagaacggtt tcgaaaacac gaaagtgata caaagaagag actgcaggaa 1320 ttctcagacc gtcaggcagc tatggaacta cggcggtatg aggaattgca gcaacgggat 1380 gcggaaatag gtcgcctgaa agatgcagaa agaagactaa cgggtcaact tcatctgtat 1440 agtcaagcac accagagtga gcatgttgtc ggtatggttg gtgctactcc tcagttggga 1500 caacagaggc caacacaacc agggcattcg aattcggcag cagtaccgtt ggaagatttt 1560 gctgcagcca cagggtttgt atttaatcca aatccggtaa atgcgatccc acttattccc 1620 agtaaaacaa actcttcctt gcgagatcat ggaacccgtc agaatcatga tcctcacttt 1680 ccgacatttc cggcatcgga cagagatcgg ccatcgtatt cggaacatat ttcaccctct 1740 ccgataaacc aacctacagc tacctctaat gcgccgaatg tcggaaactc gaggttgtcg 1800 tcagcaggtt atcaacaagt gtttcttcca tctccagaac agttagctgc cagacaggtg 1860 ctctccaagg aattaccctc tttctctggc gatcctaccg agtggccact gttctttagc 1920 agctacagta actcaactag tgcttgcgga tactccgatg cagaaaattt gcttcgactc 1980 cagcgagctt tacaaggacc ggctaaagat gctgtaagca gtttcctgct acatccctct 2040 atggtcccac aggtgatgtc aactttgcaa atcttgtttg gacgtcccga acaaatcgtc 2100 cacatcatga tcgaaaagct tcggactacc ccttcaccga aaccggaacg tcttgagacg 2160 ttgatatcat ttggactagt tgtgcagaac ttttgtgcgc atttgaaggc cgttggatta 2220 gattggcacc tatcaaatcc ttctttgcta caagagcttg tcgacaagtt acccgcgaat 2280 gttaaattca actgggcact atatcaacag cagttgccag tggttgacct taatgttttc 2340 agtgattata tgactcgagt tatctctgcg gctagtagcg tcacacttct cagtgctcat 2400 cagaaaaccg gtaaagaaga cagaccgaaa cttaaagaaa aggcatacgt taacgctcat 2460 acaaccatgg acgaattttc atcttcatct aaagcaaatc aagatggggt ggtagatgca 2520 agaagcggta agaatccgtg ccctgcgtgt gataaaccag ggcatcatgc tggcgagtgt 2580 tcgaagttca aagcacttaa tttggacagt cgctggaaca tggtcaaaga gttcaaactt 2640 tgtcgacggt gtttgattgc acatttacga tggccatgta aaggagtggt ttgtggcgta 2700 aacggttgtc agaagctgca tcatcgtttg cttcactacg attcgagtat ggacacaaat 2760 caaattgagc gaagcaccaa cgcaacggtt accatccatc gaaagcctgc ttcatcaatc 2820 ctctttcggg ttcttccggt tacactttac ggaaaacatg gaaaggttga tacgtttgct 2880 tttctcgatg acgggtcatc tgtcacactt attgagcgcg ccattgctga tacattggga 2940 ctgaatggtc acgctgaatc gctatgcatc cagtggacaa gcggaatcga caaaaagata 3000 tccaatgctc agatggtaca gatggagatc tctgcaatcg gtagcaacaa acggttcaaa 3060 gtgggagaag cctatactgt caacggacta ggactaccca aacaatcgtt agacttcgag 3120 caacttgctg atcagttcaa atacctaaag aagttaccag tacaaagctt tcaatcggca 3180 actccaggaa tattgatagg gatcgacaat gcgcatctac tcacgacgtt aaagctacga 3240 gaaggtcgca tgcgagagcc aatcgccaca aaaacccgca ttggttgggc tgtttttggt 3300 cgaattgaag gggaacatca actggaacat cgtcttatgc atatctgcac tcgaaactct 3360 ttagatgacc ttcacgagta cgtccgagag ttctttacgc tagaaaacct tggtatagca 3420 atttcaccag cccatgaagg cgttgaggag cagcgggcta aggagatatt agagaaaact 3480 acaacccgaa cagactgtgg caaatttgaa acaggacttt tgtggaagaa tgactatatc 3540 gagttgcctg atagccttcc aatggcggaa aagcgtctga aatgtttgga gaagcgtctt 3600 caaaaaatcc ggagctgtac gaagccgttc gcaatcaaat aagtgaattc cagcggaaag 3660 ggtatatcca tgaagcttca aaggaggagc ttgatacctt tgacttgcgg cgtatatggt 3720 acctaccact cggagtcgtg acaaatccca ataaacctgg taaagtccgc ctaatctggg 3780 atgcagctgc caaggtcgaa ggagtatcgc taaatgatca tctgttgaaa ggaccagatc 3840 tcctaacacc actgttagcg gttatgtttc aattccgaga gcgtgaagtg gcgatatcgg 3900 cggatataat ggaaatgttt ctacaaatcc tcattcgtcc tgtggatcgc agcgttctcc 3960 tgtttctgtg gagagattca gttgatcaac cgatcaaagt tatggttacc aacgtagcta 4020 tcttcggcgc aacctgttcg cctactcaat cccagttcgt taaaaatcta aacgcattgg 4080 aatacgagat ggatttcccg agagcatcat tggcaataac taaaaggcat tatgtcgacg 4140 actacctcga cagcgtcgat actactgaag aagcgattac actagccaag gacgtaacaa 4200 tgattcacgg taaagcagat tttttcatac gcaactggat atccaacaag gtggaagtac 4260 tcgaacaaat cggagaaaca catccagctg cagtaaaaca gttttctgcg tccaaagact 4320 gtaaatttga aagacttttg ggaatgatct ggctgcccga agaggatgtc ttttcattcg 4380 atttgtcctt acgagaggat acgcaacgct tgttggttgg tgatgtcgtt ccaaataaac 4440 gccaattgct tagcgttgta atgagtctct acgatcctct tggattagtt tctgcctttg 4500 tcatacaggg caaaatcctt attcaggaaa cctggcgaga aaacattgga tgggacgata 4560 aaatttcttc caagatattc tcccgatgga aacaatggtt aacgggtttg agagaaatga 4620 gcaaagtcaa agtcagccgt tgttatttcc caggatacga tagagacagc tataattctc 4680 tccagcttca tatttttgtt gacgcaagcg aagaatccta tgctgcagtt gcctattttc 4740 ggattaaaga taaaggggaa ataagatgct ctctagttgc atcgaagaca aaagttgcac 4800 cattacaatc cctctcaata cctcgaatag agctgatggc agccctgata ggagctaggc 4860 tgaggaaaac tattgaggat catcattcag taagggtcgc tcgtactttt ttgtggagcg 4920 attctacgac ggtagttgca tggataaaat ccgatacacg acgctatagg cagtttgtag 4980 catttagggt taacgaaatc ctgagtttat catcggtagc tgaatggaga tggatcggca 5040 caagacaaaa cgtagccgat gaagctacaa aatggggtaa aggtccatcc actaatccag 5100 agtgtcgatg gtatcgtgga cctgcattcc tgtatgaagg agatggtgac tggatggagg 5160 aacaagtgga tgttaccagc gaagagctaa gatctgcctt tgtttgttcg cattttgtca 5220 ttaaaccaac gatcaaatat gaacgctttt ctaaatttga ccgactgaaa agatgtttag 5280 catacgttct gagatatctt gaaaacctgc gtagagctgt caatggtaat ccacttaaat 5340 ttggacggga atttaccagt gaggagttgc agaaagcaga gcactcatta tggcttctag 5400 tacaatcaga atgtttccct gacgaagtag ccgtgttgaa gcataacaaa caaccagaat 5460 cgtcgcagaa atcgcttgag agttcgagca aactagcaaa tctaccaccg ctcatggacg 5520 atcaaggaat ccttcgagtt gacggacgga ccgatgcagc agaatatttg tctgtcgatg 5580 caagacgccc aataatactg cctcgagaac atcgtgcaac agaacttctt ctggattggt 5640 accatcaaaa gtatagacat ggtaacgatg aaactgtgtt gaacgagata cgtcaaagat 5700 tctacgtggc gaaactacgg acgtgcttgc gtaagacaaa aacccgatgt atgtggtgca 5760 aggtttacaa gtgtgttccg gtgccgccaa aaatggcccc tttgcctcga gtacggttaa 5820 ctccatatgt tcgtgccttt acctttgttg ggatcgacta cttcggaccg tacatggtga 5880 aagtaggaag aagcgctgtg aagaggttgt gtttacgtgc ttgactatca gagcaattca 5940 tatcgaggta gcgagaagct tgtctgccga ttcctgtaag aaggcgattc gcagattcat 6000 tgccagacgg ggagcgccac aggaaatata ctccgataat ggaaccaact ttatcggcgt 6060 aagtcgagaa ctacagagcg aaattcgtgc tatcagcacc gaacttggta gtacgttcac 6120 agatgcttat acgcagtggc gattcaaccc tccctcagca ccgcacatgg gtgggtgctg 6180 ggaacgaatg gtaaaatcca ttaagttggc tttgggaaca attcctgtgg atgacaagct 6240 cgacgatgag tcactggaaa cacttttcgc tgaagcagag atgatgatca actctcgtcc 6300 acttacattt gtatctttgc aaacaacgga tgaagaagca ataagcccca accattttct 6360 gttactgagc tctactggag tgcaacaaac gattaagagc cccgtaactg atggaatccg 6420 cataagagat agttggaccg caatgcagaa gatcttggac aaaatctgga agagatggat 6480 tatcgaatat cttccgatga ttacaagaag aacgaagtgg ttccaacaag ttcgtccgat 6540 tagcgaaggt gcgttggtga tcattgcaga tgagaaagtg aggaaccgat ggataagagg 6600 tcgagtagtg cgcacctacc ccggaaaaga tggcatagtt cgtcaagcag atgtgtctac 6660 agtaggagga gttttacgaa gagctgtcgc taaactcgca ttgttggatg tagtagctga 6720 taatggtggc tccaaagagg atgacgtact tggtgaatca agtgcgacat gtgggggaga 6780 a 6781 // ID Gypsy-26_IS-I repbase; DNA; INV; 4167 BP. XX AC ABJB011015085; XX DT 15-FEB-2011 (Rel. 16.02, Created) DT 15-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the black-legged tick genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-26_IS_; KW Gypsy-26_IS-LTR; Gypsy-26_IS-I. XX OS Ixodes scapularis OC Eukaryota; Metazoa; Arthropoda; Chelicerata; Arachnida; Acari; OC Parasitiformes; Ixodida; Ixodoidea; Ixodidae; Ixodinae; Ixodes. XX RN [1] RP 1-4167 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the black-legged tick genome."; RL Direct Submission to RU (15-FEB-2011). XX DR Genome; ABJB011015085; Positions 4043 8209. XX CC Positions [3311-3589] - Integrase core CC 'CTAGA' target site duplication CC LTRs are 95% similar to each other. XX FH Key Location/Qualifiers FT CDS 112..1809 FT /product="Gypsy-26_IS-I_1p" FT /translation="MPRIGKIEEFDQKSESWSSYIERIEVFFSANDIPESK FT RADVLSIVGREMYGLIRNLAAPAKPSTKTYVQIVQLVQNHIDPTPNETVER FT YKFHHRKQLESETIADYVAALRQLSLQCNFGVSLNDRLRDKLIDGIKDGNI FT LEKLLAEPTLTFKAAVDLATSMEVAMKDAKKIGSETEIQQDVNRLVQGRKH FT KTSLPKKASEEKGKSRTSRTPCHRCDKINQNPANCFARNKTCKKCGRTGHF FT QLVCRDRKRDPQDSRKKDKPNFTNHVTTGNSGDETPFYFIDSSYPKPITLT FT VEICNKPLRMEIDTGSPVTVINLETFQEILPNIQLKPSGITLKPYTGKPIV FT TEGMVQVEMRYNGQVKRHEIQVVKNGGPPILGRTWLQDFQVNWKEVRNVRA FT ETLTLAGVLQKHSSIFEGPGELKDYAVKLCVNDDAQPKFLRPRSVSYALRQ FT KIEEDLNRLEKEHIITPVMRSQWATPIVPVMKANGKVRICGDYKVTLNPAL FT KPDRYPLPKTEYLFVKIAYGEKFTKLDCKQAYFQIPLAEESRDLTVINTHK FT GLYQVNRVPFGITPASAIF" XX SQ Sequence 4167 BP; 1309 A; 1034 C; 1014 G; 810 T; 0 other; atggcgacga ggatgggata tatgcaacaa agacggatat aaacaactga gcgacatcgg 60 aacaaagaag accgcggtaa cgtgagtgac gccccatctc aaacctgaag aatgcctcgc 120 atcgggaaaa ttgaagaatt cgatcagaaa tcggaatcat ggtcttcgta catcgaaagg 180 atcgaagtgt tcttctctgc gaacgacatt ccggaatcaa agcgagccga tgttctctct 240 atcgttggac gagaaatgta cggactgata cggaacttgg cagcacctgc taaaccaagc 300 accaagacgt acgtacagat cgttcaacta gtgcagaatc acatcgaccc gacgcccaat 360 gaaacggtcg aacgctacaa atttcatcac cggaagcagc tggaatccga aaccatcgca 420 gactacgtag ctgccctcag acagctttca ctgcaatgca acttcggtgt ttccctcaac 480 gaccgccttc gagacaagct cattgacggc atcaaggacg gaaacattct cgaaaagctg 540 ctggctgaac cgacgctgac attcaaggca gcggtggatc ttgccacttc gatggaggta 600 gccatgaagg atgcgaagaa aatagggagc gagaccgaaa tccaacaaga tgtcaatcgc 660 ttggtgcaag ggcggaaaca caagacgtca cttccgaaaa aggccagcga agaaaaagga 720 aaatctcgca caagcaggac cccatgtcat cgctgcgata aaatcaatca aaaccctgca 780 aactgctttg caaggaacaa gacttgtaaa aaatgcggaa gaacaggaca cttccaattg 840 gtctgtcgcg acaggaaacg agatccacaa gactcaagaa aaaaggacaa gccaaacttc 900 accaaccatg tcactacagg gaactcggga gacgaaactc cgttttattt catcgactca 960 agctacccga aacccatcac actaactgtg gagatttgca acaaacctct tcggatggag 1020 attgacacag gctcacccgt gactgtgatc aacctggaga cgtttcaaga aattcttcca 1080 aacatccagt tgaaaccttc gggaataacg ctgaagccgt acaccggcaa gcccatcgtt 1140 acagagggaa tggttcaagt ggagatgcgg tacaacgggc aagtcaagcg tcatgaaatc 1200 caagtggtga agaatggggg acctcccata ctgggaagaa catggcttca ggattttcaa 1260 gtgaactgga aggaagtgcg caacgtgcgt gcagaaacct taactctcgc aggtgtgcta 1320 cagaaacaca gttccatctt tgaaggtccg ggagaactga aggactacgc cgtgaaactc 1380 tgcgtgaatg atgatgctca accaaagttt ctcagaccaa ggtcagtttc atacgccctc 1440 cgacaaaaga tcgaagagga tctgaatcgt cttgagaaag agcacataat cactcctgta 1500 atgaggagtc agtgggcgac tccaatcgtt ccagtgatga aggcaaacgg aaaagtgcgc 1560 atatgtgggg attacaaggt tacacttaac ccagccttga agccagatag gtatccacta 1620 ccgaagactg aatacctctt tgtgaagatt gcttatgggg agaagttcac caagcttgat 1680 tgtaaacaag cctacttcca gatcccctta gcggaagaat ccagagacct cactgtgatc 1740 aacacacaca aagggctgta tcaggtgaac agagtgccat ttggaataac tccagcatca 1800 gctatctttt aacgcgtcat agaactcgtt ggaaagctgc cgtacacagg tgcatttcag 1860 gatgacatcg tcgtaactgg caagaacgat caacatcacc tgcggaacct ggacatcgta 1920 ctgcaaaaac tcaaggaaag cggccttcgc ttgggaaaag aaagatgcta cttcatgcag 1980 gactccatcg agtacctagg acatgttgca gacaaagaag gaatccaccc aaatcccaag 2040 aaggtggatg cgatcctaca agcgaagacg cccaaagacc aaaaagaact tagagcattt 2100 ctaggactcc tgaactacta cgggaagttc atcagcaacg cagccgacat cctacatcca 2160 ctacatgaac tgctgcgaaa agataaaagg tgggagtggt cgagatcgtg ccagagagcc 2220 ttcaacgaag cgaagaagca actcaccaag cagagtctgc tagtgcacta tgaccctgcg 2280 cttcctctgg aactacactg tgacgcatcg agccatggca ttggtgcagt acttcaacac 2340 gtactaccca acaagcaacg gaggccgatc gccttcaggt caagaacgct ttctaaaggt 2400 gagcgaaact actctcagat tgacaaagaa gcgttgagtc tagttgaggg cgtaaagaaa 2460 tttcaccagt atttgtttgg acgccacttc acgctgtgga cgaaccacag accctggtat 2520 cgattttcaa tcccagtaag gaaatcccag gcacggtcgc agcaagaatg cagcgatggg 2580 caatcgtctt gtcagggtat gacttcgaca taaagtacgt gtcgactgat aagaacgtag 2640 tcgcagacac gctgtcgaga ctcccaatga ctgacaaggg tactatgaaa tcagaaacta 2700 caagaatgga cgaaatcaac tgtctccatc tagaaaagtt tcaagcacta ccactgacac 2760 ataagcagat cagcagagag acacgaaagg acactgtgtt gtctaaagtt ttaagactca 2820 ccatggatgg atggccggaa caagtgaatt ccgaagacct gaagtcttac ttccatcgga 2880 aaacagaact aaccgtggag ctcgactgct taatgtgggg aacgcgcgtc atcgtaccct 2940 cgaggtaccg aaaacacgtt ttgaacgact tgcatcaagg tcatgcagga atgacaaaga 3000 tgaagggact ggcacgacac tatgtgtggt ggccgctttt ggatcatgac atcgaaacga 3060 tggtctaaag ttgtcggccg tgccaggaat tgagcaacga ggcacccaag gttcccatac 3120 acccgtggga atacccaaaa aagaaatggc aacgcctgca tgcagacttc gcaggaccct 3180 ttcaaggaag gaatttcttg ctgatcattg atgcatattc aaaatggccc gaagtagtgc 3240 ccatgacctc gacgacttcg gcagccacca taagagtgtt tctaacgctc tttgcaagat 3300 atggactgac agaaagattg tgacagacaa tggcccgcaa ttccgatcaa gcgaattccg 3360 ggattttatg acaagaaacg gagtgcagca gtcattctca cccccgaacc attccgcaac 3420 aaacggtgcg gcagataact tcgtgggaac cttcaaaaag tctgtcaacg caagcctgaa 3480 gtcggggctg accatggaag aggcatgcca taacttcttg gtggcatacc ggacaacctc 3540 tcacacgaca actggctcaa gtcctgccaa aatgatcaca ggagacgaat ttcgcacaag 3600 gttcgatctt tttagaccat caatgacgga tatcgtgcga tcaaagcagg cgaagcagca 3660 cgcctcgcgc aactcgaaag aacgacacct acatctgaat gaccaggtct gggcaagaga 3720 ctacaggaat ggcaagaaat ggtcgaaagg agtcgtcgta cgagttctgg gaaatcgaac 3780 atatgagatt cgtacgaacg acagaaatac ctggaaacgg accattgaac agctgcgaca 3840 tgtacccaag taggaggagg gaatatctcc aaatcaagac gatagcccca agggcgcaaa 3900 acatcgaaaa agttcgcccc tccccgcggc atcagccagc gaaacggttc atcgtctccg 3960 atttccagtg acgactttcc aaccacgcag aacgatcctg caacgccgga ggcaggcact 4020 ccaagctcgc cagccagacc acttaatggc acacgtggcc agagcacgga gggcagctct 4080 cctcttctca gaagatcaac gcggacgcgc cagccaccaa cgcggctggg cctgcgaaaa 4140 agtgaagttt ttttaagggg gagatat 4167 // ID Waldo-2_AAe repbase; DNA; INV; 6241 BP. XX AC . XX DT 05-OCT-2010 (Rel. 15.1, Created) DT 05-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE A Waldo non-LTR retrotransposon family from Aedes aegypti. XX KW R1; Non-LTR Retrotransposon; Transposable Element; R1_Ele2; KW Waldo-2_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-6241 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6241 RA Kojima K.K. and Jurka J.; RT "Waldo, (AC)n microsatellite-specific families of non-LTR RT retrotransposons from the yellow fever mosquito."; RL Direct Submission to Repbase Update (05-OCT-2010). XX DR [2] (Consensus) XX CC [1] Originally named as R1_Ele2. CC [2] Consensus update and chracterization of target sequences. CC This consensus is generated from 30 sequences with >99% identity, CC and ~100% identical to the original sequence in [1]. Both sides CC are (AC)n microsatellites. Renamed as Waldo. XX FH Key Location/Qualifiers FT CDS 815..2338 FT /product="Waldo-2_AAe_1p" FT /translation="MGSQGRLLTTGGDSGLEGGPLSMEVNNINVGETANPF FT AKSGLPRSPPKQCQQQELEQQQLEHQQSEDELHEQRTREQERRDKSQSEQE FT QRERQQQHQLQSAWKLPPRQSKVEAAKTVVDELREFVDKRHNVHKDIKDLV FT ARIQGTLGSAVKDWRKVMQRIEAVETELAATKKALAAAVVQPEKTGIAPRK FT DSKGKERSAGIGTTPSFTPKRHRSSPGDERQAVSKKPKNVNPKPADEVLRG FT GPGDIPWQEVRNKKKRGKQESARKQRPIRKKAKCEAVVVKTSEDTYAEVLR FT AMRTDPQLKEFSADVQKIRRTQTGDMLIELKKDSVNKSSTYKELTERVMGE FT KVQVKAMCPEVTLQLRDLDWITTEEEVRTAIKEQCDLETVQMTVRLRRAPL FT GTQAASIKLPVDAANKALEVGKIRVGCSVCPLKISQRPEGCYRCLEYGHLA FT RNCEGPDRSKLCRWCGDEGHKAQDCNNKQRCLICKDKSRNRHATGGPKCPA FT FKQARNSKPQWR" FT CDS 2326..5343 FT /product="Waldo-2_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="TAVEVTQLNLNHCDTAQQLLWKSVSESKCDVAILAEP FT YRVPAGNGNWIADKAKTAAIWTMGRYPIQEIVFQADEGFVIAKLNDVYVCS FT CYAPPRWTLDQFNEMLDELTDKLADRRPVVIAGDFNAWAVEWGSRLTNPRG FT RSLLEALARLDVDLGNEGTTSTYRRDGRESIIDITFCSPVLTGGLNWRVCD FT GYTHSDHQEVRYRIGGRTQCSQRASTPHEQMWKTKRFNEELFVEALRREEQ FT VLNLRSEELTAAMVRACDIAMPRKVEPKYRRRPVYWWNETLGNLRKRCLQA FT RRRMQRAKTDGEREERRVTLRAAKADLKREISLSKRTCFKELCRDANANPW FT GDAYRVAMAKISGPAVPAETCPEKLQIIVDGLFPQHAPTVWPPTPYGQEGE FT ERAIITNEELIEAAKKMKPKKAPGPDGIPNVALKAAITANPDMFRTSLQIC FT LDEGHFPAQWKRQKLVLLPKPGKPPGEPGSFRPICLLDTLGKLLERIILNR FT LMAFTEGERGLSEWQYGFRKGKSTVDAIKTVIDTADKARTKKRRGNRYCAV FT VTIDVRNAFNSASWEAIAVALHKMKVPDYLCKILGSYFENRVLSYSTSEGQ FT KTRSVNAGVPQGSILGPTLWNAMYDGVLTLRLPTGVKIVGFADDITLVVIG FT DSLERVEVLATEAVDAVENWMYEKKLVIAHQKTELLLISNRKVVQKAEITV FT GEHTIASKRELKHLGVMIDDRLNFNCHVDYACEKAARAVTALSRIMPNNSA FT ICSSKRRLLSSVATSILRYASPAWGTAMKTKRNRVKLSSTYRLMAMRVASA FT YRTISSEAVCVIAGMVPIGYVLEEDKECYEASSTRGARKIARADTMVKWQR FT EWDTAEKGRWTYRLIPNLAKWVSRSHGEVNFHLTQFLSGHGCFKKYLHRFG FT HAESPLCAECPVVEESPEHVIFECPRFAAERSEMTAVCGADVTVDNVVDRM FT CSDEVMWNAVNMAVTKIMSLLQRKWREEQAGEPEVSLQPESPNRPRHPTGR FT " XX SQ Sequence 6241 BP; 1769 A; 1444 C; 1838 G; 1189 T; 1 other; gggttgcaaa atggtgagtc gacaaccagg aaggagcgtc caacatagct ctggtcctca 60 caagccccta cctcacgctt ccacgggtct aacgatgaca aagaccgcca gctaagggtt 120 gcgtacttag ctggtagtgc aacctgggca ctgttgtcct tctgacatca gctagagtga 180 ggaggtgcca ggtggaagct tgggattttt accctttcca agcgaaactc atacatacag 240 ccgtatggaa taccaccttt ggtacttcct tcgggaggtg gccgagcgtc tggtccatct 300 gaccaggggc tcaagcatgg tctacctagg atgtggcggg ggttcatcag tgggctctgg 360 tgaatctcta caaaaaacca catatctgta agtaaccctg aacaagcgac ctggtaccgc 420 tttcaaagtg ccttagccca accattggag tgccaaccgg cacattagga tcgatgccag 480 taagatccta attacggcat actggctgca aggacaagga caaggtcaag gactgtatca 540 acacgacgtt atagagctga caatcgctct attccacttc cttagtaaga atgggtgaca 600 ctattctgaa acggcaggtg ggctaatgga acggttgccc ccgtcccgtt aaaaccgtgg 660 caggcctcct agtacgttca aagctcgcca ccttagctgg tggaagtagg ctcagatgga 720 tattcctgac tagcgtcgat ctggctctga acccggtacc ccatgaagga ccgtgtcagc 780 cctcgcgtgg acctcactgc aagcaaaggt acccatggga tcacaaggac gactacttac 840 gacgggtgga gatagcggct tggagggggg acccttaagc atggaagtta acaacatcaa 900 cgtaggtgaa acagcgaatc cgttcgcaaa aagtggcttg ccgaggtcgc caccgaagca 960 atgccaacag caggagctgg aacagcagca gctggaacac cagcagagcg aagatgagct 1020 gcacgagcag cgaacacgtg aacaggagcg gcgggataag tcgcaaagtg agcaggagca 1080 gcgtgagaga cagcagcaac atcagctgca aagtgcgtgg aagttacccc cgaggcaatc 1140 gaaggtggaa gcggcgaaaa ccgtcgtaga tgagcttcgg gaattcgtcg ataagaggca 1200 caacgtgcac aaagacatta aggacctcgt ggcaaggatt caaggaaccc tcggatccgc 1260 cgtcaaggac tggaggaaag tgatgcagag gattgaagct gttgaaaccg agttagcggc 1320 gaccaaaaag gccctggcag ctgcggtagt gcagccggaa aaaaccggca tagcccctcg 1380 gaaagacagc aaggggaaag aaaggtcagc gggaatagga actacgccaa gtttcacccc 1440 caaaagacat aggtcgtcac caggagacga aagacaggct gtatccaaga agccgaaaaa 1500 cgtaaatcca aaaccggcgg atgaagtgct cagaggggga cctggcgata tcccatggca 1560 agaggtccgg aacaagaaga agagaggcaa acaggagagt gccagaaagc aaaggccaat 1620 caggaagaag gcaaaatgcg aagctgttgt cgtcaaaact agcgaggaca cttatgctga 1680 agtcttgcgg gccatgagga cggaccctca actaaaggag tttagtgccg acgttcagaa 1740 gatcagacgc acccagacgg gagacatgct catcgagcta aaaaaagact cggtgaacaa 1800 aagctcgaca tacaaagagc taaccgagag agtcatgggc gaaaaggtgc aggtgaaagc 1860 catgtgccct gaagtgacac tccaattgcg ggatctggat tggatcacca ccgaagagga 1920 agtgaggacc gccataaagg agcagtgcga tctggaaacg gtgcagatga cagtacggct 1980 aagaagagca ccgcttggta cacaggcggc gtccataaag ctcccagtag acgcagccaa 2040 caaagcgctg gaagtcggga aaattcgagt cggttgttca gtatgtccac tgaagatctc 2100 ccagagaccg gagggatgct acaggtgcct ggaatacggc cacctggcac ggaactgcga 2160 aggaccagac agaagtaagc tctgtagatg gtgtggcgac gaaggtcaca aggcgcaaga 2220 ttgcaacaat aagcaaaggt gcctgatctg caaggacaaa agtcgcaaca gacacgcgac 2280 gggaggccct aagtgtccgg ccttcaagca agcgagaaat tctaaaccgc agtggaggta 2340 actcagctaa acctcaacca ctgtgacaca gcgcaacagc tgctgtggaa gtccgtatcg 2400 gaatcgaagt gcgacgtagc tatccttgcc gagccgtatc gagtaccagc tggaaacggt 2460 aactggatcg ctgataaggc gaagacagcg gcaatctgga cgatggggag gtatcctatc 2520 caggaaatag tgttccaggc ggacgaaggc ttcgttattg caaagcttaa cgatgtgtac 2580 gtgtgtagct gctacgcacc cccccgctgg acattggacc agttcaacga gatgctggac 2640 gagctaaccg ataagctagc cgaccggaga ccagtcgtca tcgctggcga cttcaatgct 2700 tgggcagttg agtggggcag ccgtctcacc aacccaagag ggcgtagtct actagaagcg 2760 ctggcaaggc tggacgtaga cttgggtaac gaaggaacaa ccagcaccta ccgtagagat 2820 ggccgggaat caataatcga catcactttc tgtagtcctg tgttgacggg aggcctgaac 2880 tggagagtct gcgatgggta cactcatagc gatcatcagg aggtccggta tagaattggt 2940 ggaagaacac aatgctcaca gagagcgagt acacctcatg agcagatgtg gaaaacgaaa 3000 cgttttaacg aagagctttt cgtggaagcc ctcagaagag aagagcaggt tctgaacttg 3060 agatcggagg agttaacggc tgcgatggta cgagcatgcg acattgccat gccacggaag 3120 gtcgagccga aatacagacg tcgtccggta tactggtgga atgagacact tggcaacctc 3180 cgcaaaagat gtctccaagc caggagacgg atgcaaagag ccaaaaccga tggtgagagg 3240 gaagagcgaa gagtaacatt gagggcggcg aaagccgatc tgaaaagaga aatttcgctg 3300 agcaagagaa cgtgcttcaa ggagttgtgt cgcgatgcca atgcaaatcc gtggggagat 3360 gcatacaggg tggcgatggc caagatcagt ggtccagccg tacccgctga aacatgccca 3420 gagaagttgc agattatcgt cgacgggttg ttcccacaac acgcgccaac ggtctggcca 3480 ccgacaccgt atggccagga aggtgaagaa agagcaatca ttaccaatga ggagctcata 3540 gaggcggcca agaaaatgaa gccgaagaaa gcgcctggcc ctgatggtat ccccaacgta 3600 gcgttgaaag cwgctatcac ggccaaccca gatatgttcc ggacatcact ccagatctgc 3660 ttagatgagg gacatttccc agcacagtgg aaaaggcaaa aactggttct gctaccgaaa 3720 ccgggtaagc cccccggtga accaggctcg tttcggccga tatgtttgct tgacacactc 3780 ggaaagctgt tggagaggat catcctcaat agactgatgg cgttcactga gggagagcgt 3840 ggactgtcgg agtggcagta tggattccgg aaaggaaagt caacggtaga tgccatcaaa 3900 acagtgattg acacagccga caaagcaagg acaaaaaaac gtagaggtaa ccgttactgt 3960 gctgtcgtga cgatagatgt gagaaacgcc tttaacagtg caagctggga agccattgct 4020 gtggcgttac ataaaatgaa ggttccggat tatctgtgca agatactagg aagctacttt 4080 gaaaaccggg tcttgagcta cagcactagc gaaggacaga agacgaggtc ggtaaatgcg 4140 ggagttccac agggctctat cctgggtcca acgttgtgga atgctatgta cgacggagtg 4200 ctgacgctta ggctgccaac aggcgtcaaa atcgttggat ttgcagacga catcacgctt 4260 gtggtcattg gtgactcact ggaaagggtg gaggttctcg ctacagaagc agtggacgca 4320 gtcgaaaact ggatgtacga gaagaaactg gtgatagccc accaaaaaac ggagctgttg 4380 cttatcagca atcgaaaagt ggtgcagaag gctgagatta cagtgggaga acacaccata 4440 gcctcaaagc gggagctaaa acacctcggc gtaatgatcg acgatcggct gaacttcaac 4500 tgccatgtcg attacgcgtg cgagaaagca gcaagggcag tcacggcgct gtccaggatc 4560 atgccgaata actcggcgat ttgcagcagt aagcggaggc tattgtctag cgtggctacg 4620 tcgatcctga ggtacgctag cccggcgtgg ggaaccgcaa tgaagacgaa gaggaaccga 4680 gtcaagctta gcagcacgta taggctgatg gccatgcggg tagcgagtgc ctaccgaacc 4740 atatcttcgg aggcagtatg cgtgattgcc ggaatggtcc ctatcggcta cgttttggaa 4800 gaggataagg agtgctacga agctagtagt actagaggag cccggaagat tgcccgagcc 4860 gacacgatgg tgaaatggca acgcgagtgg gataccgctg agaagggaag gtggacttat 4920 cgccttattc caaatctggc gaaatgggtg agcagatccc atggagaagt caacttccac 4980 ttgacgcagt tcttgtcagg tcacggctgc ttcaagaaat atctgcacag gttcggccac 5040 gcagagtcgc cgctctgcgc tgagtgccca gtcgtagagg agtcgccgga acacgtcatt 5100 ttcgagtgtc cacgttttgc tgcggaacgt agcgagatga cagcggtctg cggtgctgac 5160 gtaaccgtag acaacgtagt ggatagaatg tgtagcgatg aggtgatgtg gaacgcggtg 5220 aacatggcgg tgacgaaaat aatgtcattg cttcagcgga agtggaggga ggaacaagcc 5280 ggtgaaccgg aagtcagtct ccaaccggaa tcgccgaacc gacctcggca cccaaccggt 5340 aggtaagctt gatccaccgt cggggacaag accgagtaga tcgtggaaaa gccccggaaa 5400 ttgtaggctt cgtggccgtg cggttagcgg cgtcagtcgt ttaggcatat tgtgccatga 5460 agtgtgggtt cgattcccac tccagtcggt ggaaactttt cgtcaaacga aaaattcatc 5520 attgggctac tgggtgtttc gtgttgtccg ttgcctaatg tttgtgattg ttcagtctgt 5580 gcagccttga gctgaagacg gtgtaaattg tcttgccaaa ttggtcgtca gggcacctgt 5640 gaaccggaag tcatccacca accggaatcg caggatcgac cttggcacct atccgggcag 5700 catcggtacc ggaagaactt ccacctccgg ggaagtcttc gtcggggtag ggtagattca 5760 ccgccgggga ctatccgaat agtgcgcgag aacctggagt gctaaatggc atgcgtccag 5820 caccgagaga acttccttcg tcagggagtt tctggcaccc atggtatcgt gagccgaatg 5880 gctgaatatg gcatgtttag atcacaaact aggaatacta acaaattatt aatgggagcc 5940 gaagggctca gcatggcatg gattaggaac taacaaatta atgatggagc cgaagggctt 6000 aatgaagggt gcgcgtagca tgtgtcgcct cttcttcgaa gtaataccgc aaggtggtgc 6060 cggagaagaa ttctttacgg tgacggtgta cctaggagac tgttttttag tgggtaggcg 6120 ccccattgcg aatcccacac agtgccaaac catacactgt ggcatgagca tgaacaaata 6180 caggccagtc ttgagaagat ttttttaact cctagagatg catgaaaaaa aaaaaaaaaa 6240 a 6241 // ID CR1-60_AAe repbase; DNA; INV; 3510 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-60_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-3510 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1147-1147 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 21 sequences with >91% CC identity. Closely related to T1 and Q. This consensus is likely CC 5'-truncated. XX FH Key Location/Qualifiers FT CDS 1..3357 FT /product="CR1-60_AAe_1p" FT /translation="QQACTGKYQFRNCSSLCDSVPLTQPTVRTAGRDVTPE FT XNSLHNRSXLQPGRTVESFLEAPSSSNTVVHNPASAHHSRPGPVVSRCSGV FT FQSALTGKYDYQNELPLSHINSLFSAVTDGTLGNMMSTTECNQAEAPRLEQ FT IPTLNLGTRSNCKSMMNQHRADNVVIYYQNVGGMNSLLEDYRLSVSDNCYD FT IIALTETWLDCRTTSHSVFGSNYEIFRCDRNRRNSRKSSGGGVLLAVSRKL FT TAYPIENDSWTCLEQVWISIKLADRTLFLCVVYIPPDRVRDNDLIETHCRS FT VLEVAENASANDEVIVLGDFNMSGISWRSSGNGFLYPDPQRSTLHPAATTL FT IDGYSTATLTQINSVTNENNRILDLCFVSSIEVAPPIYRALAPLVKSTIHH FT TPLVISVENQLICEFTIAPKAVHYEFSKADYHGIANLLSSLDWDTILDAED FT VDAAVETFSNVLSYVIDRHVPKQVFQDKTRIPWMTNELRRLKTSKRAALRQ FT YTKHRTPQLKCRYLRLNKEYKRVSQRSYSLYQRGIQRKLRANPKAFWNFVN FT EQRKESGLPSSMLYNGRLGSSTSEICQLFGAKFAEVFSTETLDADQVNLAV FT NGLPIFSHSLSTFHIDESMIADAASQLKTSFNPGPDGVPPAFLKTNIDHLL FT HPLCRLFRLSLSCGTFPSLWKKAHMFPVHKKGCKRDVNNYRGITSLSAVSK FT LFELVVMGPLLSHCKQYLSDDQHGFIAGRSTTTNLLCLTSYATESMTQRAQ FT TDVIYTDLSAAFDKLNHAIAIAKLERLGINGNLLQWFHSYLTDRHLLVAIG FT DCHSDTFSASSGIPQGSHLGPLIFLLYFNDVNLALKGPRLSYADDLKIFLR FT IRSIDDCRFLQQQLDAFANWCSVNRMVVNPDKCSVITFSRKKQPFVYQYTL FT RGTDIERVEHVKDLGVILDSQLTYKQHISYVVDKASRTLGFIFRCAKNFTD FT VHCLKALYSSLVRSSLEYCSTVWSPSYSNGAERIESIQRRFLRYALRRLPW FT NNPFRLPSYESRCQLIRLEPLSSRRDTAKALLIADALGNRIDCPSILEQIN FT VNVQPRALRNSAMLRLPYRSTNYAMHGAINGLQRTFNRVSSLFDYHLSREV FT LRSRFSEYFRR" XX SQ Sequence 3510 BP; 928 A; 874 C; 727 G; 979 T; 2 other; caacaagctt gcacaggcaa gtaccaattt cgtaactgtt cttcgttgtg tgattcagtt 60 cccctcacgc agcctactgt acgaacggct gggcgtgacg ttactcctga aamcaattct 120 ctacacaacc gatcgamcct tcagccagga cgcaccgtag aaagcttttt ggaagcccct 180 agttcctcta acacagtcgt gcataatcct gccagcgctc atcacagccg tcctggccct 240 gttgttagta ggtgttcggg ggtcttccag tctgctttga caggcaagta cgactatcag 300 aacgaacttc cgctgtctca tatcaactca cttttcagtg ctgtgactga tgggacactc 360 ggtaatatga tgagcacaac ggaatgcaac caagccgaag cgccacgctt agaacaaatt 420 ccaacgttga accttgggac tcggagtaac tgtaagtcta tgatgaacca acatcgtgcc 480 gacaatgtag tcatatacta ccaaaatgtc ggtggtatga atagcttgtt ggaggattac 540 cgcttatctg tttctgacaa ttgctacgac ataattgctc taaccgaaac ctggctcgat 600 tgtcgcacca catcccactc agttttcggt tccaactatg aaatttttcg ctgcgaccgc 660 aatcgaagga acagtcggaa gtcatcagga ggtggcgtcc tgctagccgt aagccgaaaa 720 ttaacagctt atccaatcga aaacgattca tggacctgct tggaacaagt ttggatatcc 780 ataaaattag cagaccgcac gctgtttcta tgtgttgtat acataccacc agatcgcgta 840 cgtgacaatg atctcatcga aactcattgt cgatcagttc tggaggttgc agaaaacgct 900 tctgctaacg acgaggtcat cgtgttgggt gatttcaaca tgtctggcat ttcttggcgg 960 tcatccggga atggttttct atatccagat ccccagcgtt caaccctgca tccggctgca 1020 acaactctca tcgacggtta tagtacagct actcttactc aaatcaacag tgtgaccaac 1080 gaaaacaatc ggattttaga tctttgcttt gtcagttcga tcgaagtggc tccaccgatt 1140 tatcgcgccc tcgccccgtt agtgaagtca accatccacc atacccctct tgttatttcc 1200 gtcgaaaacc aattgatttg cgagtttact atcgctccca aggctgttca ctatgagttc 1260 tcaaaagccg attaccacgg tattgctaat ctgctttcta gcctggattg ggacactatt 1320 cttgacgccg aagatgtgga tgcagcagtg gaaaccttct cgaatgtgct ctcctacgtt 1380 atcgatcgac atgttccgaa acaggttttt caggacaaaa cgcgtatccc ttggatgacc 1440 aacgagcttc ggcgactaaa aacttcgaaa cgagccgcct tacgacaata caccaagcat 1500 cgcactcctc agctaaaatg ccggtacctc agattgaaca aggagtataa gcgcgtgagt 1560 caacgctcct actcacttta tcaacgtgga attcaacgaa aactgagagc taatccgaaa 1620 gctttttgga attttgtgaa tgagcagcgt aaagagtctg gcttgccctc atcaatgtta 1680 tacaacggcc gattgggatc atcaacttca gagatttgtc aactcttcgg tgccaaattc 1740 gccgaagtgt tctccaccga aacgttggat gcggaccaag tcaaccttgc cgtcaatggg 1800 ctgcctatat ttagtcattc gctaagcacc ttccatattg atgaatcgat gatagccgat 1860 gctgcttccc aacttaaaac atcattcaat ccaggacctg acggtgtacc tccagctttt 1920 cttaagacaa atatcgatca cttactgcac cctctttgtc ggctgttccg attgtcatta 1980 tcatgcggaa ctttcccttc gctctggaag aaagcacata tgttcccggt tcataagaaa 2040 ggatgcaaac gagatgtaaa taattatcgt ggaataacat cgctgagtgc agtttcgaag 2100 ttattcgaac tagtagtcat gggaccacta ctttcccact gcaaacagta tttgagcgat 2160 gatcagcacg ggttcatcgc tggtcgctct acaaccacca acttgttgtg tctcacttca 2220 tacgcaactg aaagtatgac ccaaagagcg cagacagatg tgatttatac tgatttgtcc 2280 gcggcctttg acaagctaaa ccatgccata gcgatcgcaa aattagaaag gcttggaatt 2340 aacgggaatc ttttacaatg gtttcatagc tatttgacag accgccatct acttgttgcc 2400 ataggagact gtcactccga cactttctct gcatcatcgg gtatccctca aggaagccac 2460 ctgggccccc tgatatttct cctgtatttc aacgatgtga atttggcgct taaaggacca 2520 cgattgtctt acgcggacga tctcaaaata tttctccgca tacgctccat cgacgattgc 2580 cgttttctgc aacagcaact tgacgccttt gccaactggt gcagtgtgaa tcgtatggta 2640 gtcaaccctg acaagtgctc tgtgataacc ttctcaagga aaaagcaacc gtttgtctat 2700 caatacactt tgcgtggaac ggacatagaa cgagtagagc atgttaagga cttgggagta 2760 attttggatt cccaactcac gtacaaacag cacatttcgt acgtagtaga caaggcatcc 2820 agaacgcttg ggtttatttt tcgctgtgcc aaaaacttca ccgacgttca ttgtctgaag 2880 gcactttact cttccttagt ccgttcctcg ctagaatact gctcaactgt atggagccca 2940 tcctacagca acggcgcaga aagaatagaa tccattcaac ggcgtttcct tcgatatgcc 3000 cttcgaagac taccctggaa taatcctttt cgtctgccca gctatgagag ccgatgtcag 3060 ctgattcgtt tggagccact ttcatccaga cgggatactg ctaaggctct tctgatcgct 3120 gatgcactgg gtaaccggat tgactgccca tctatattgg aacagataaa cgtcaacgtt 3180 caacctcggg ctttgcgtaa cagtgcaatg ctgcgactgc cctatcgtag cacaaactac 3240 gcaatgcatg gagctatcaa tggactccaa cgaaccttca atcgcgtttc gtcactcttt 3300 gactaccact tgtctcgcga agtactccgt agtagattct cagaatattt tagaagatag 3360 ttgtatttaa atgtctgttt ttgctttaat ttgtttgtaa attttgttcg cttttattgt 3420 ttgattagtg acgtttcatg tacttatttt tttttatcat catttgggca actcagcctg 3480 ttgatgaata ccaataaaca ataaaaataa 3510 // ID hAT-31_HM repbase; DNA; INV; 3826 BP. XX AC . XX DT 07-DEC-2008 (Rel. 13.12, Created) DT 12-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-31_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3826 RA Jurka J.; RT "hAT-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 2020-2020 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 447..2810 FT /product="hAT-31_HM_1p" FT /translation="MSKRKTKYNKLWESEFNWIQVCSGDHYSASCKLCQKT FT FSISGSGAGQVRSHAISKLHTSRFKEREGQSEFSKNKDKILELKASKISYT FT VEEEILKAEIIHSLKCVESNYSFASTNGDSKMFEAMFPDSKIAKGYRQSET FT KSKYTIQYGIFPYLKTLLLEDLNDTVFTFKFDESTNQQVNKQYDGYVQYWS FT RKHKCIKIAYCGTVMVDHCTAEKLLEHFLELVEKVKLNIKFMLHIGMDGPN FT VNIKFERLLKSSEKMKNLNKNIISIGTCPLHIVHNAFRAGVNVLNFNIDSF FT AIDVNFFFKHSAARRSDYRQIEDLTEIISHFIQRHSSTRWVTLRRICSQIL FT EQHKNLCEYFIKFLPTTTSFKSTVRDTERYKRITDILQNEISMPYISFIAF FT IANDFEIFLKTFQSIKPRIIHLIYSEMSKLFVALMSKFIKTKLLYVNNNGS FT NSIKSINELLMIDVIDSKNCKPLKLIDIGTKAKSYFSEALEISCGEKKFRQ FT NCLESYQAFTSYLKLKLPWESTILKNAIFLDPLKKGSNESLAAVSNLTLEV FT CKSLEVVLCKVFPSCSSKEEVCDIVRSEWRIYQMELLPECYYKCSEELVSI FT GRNQTSYWEKAFQLAGLPIVSKLLCSFDIDTLINSFETKILNDSGSPKFPN FT ITSLFKVVSSLSHGNSAPENGFSINKHIIQLHGTSLDPETIEALRFVKDTI FT LSFGSILDIPITKSFIQSVKFAHSSYKADLEAKRRLKEKEEKHIKTKETEE FT AKQHVLKNEKEAVLASIQQVCYTKLHENCFGNVICT*" XX SQ Sequence 3826 BP; 1406 A; 511 C; 584 G; 1324 T; 1 other; caggcctccc agagtgtccc aaaatcccaa aatgtcccaa atttttacaa ttttactttt 60 ttgtcataaa atgtcataat ttatggggtt atgtcctaaa atgtcctaaa attttaaaag 120 atttttttca ataaatgttt ttatacaata atatacttat agtaatgttg ttttttataa 180 aaaaaaaaaa aaacattact ataagtatat tattgtaaat tattcatatt gtgtgtacat 240 taaaatgtgc gtatttgaaa ataattatta atctgtcaaa aaacagtgtg tactacaaag 300 tcattattat tcatatctat tgaatgtgta attattatta attgtcaact ttattgttgt 360 ttttatatca gtctataaaa tagaaaaagt tgtaaaaagt ttttagatta ctttgtgaaa 420 ttatagttca actgtttatt tataagatgt ctaagagaaa aacaaaatat aataaattgt 480 gggagtctga gtttaactgg atccaagttt gttctggtga tcactattct gcaagttgca 540 aattgtgcca gaaaaccttt tcaatcagtg gaagtggagc tggacaagta agaagccatg 600 caattagcaa acttcacaca tcaaggttta aagaaagaga aggacaatca gaatttagta 660 aaaataaaga taaaatactt gaattaaaag cttctaagat tagttacacc gttgaggaag 720 agattttaaa agccgaaatc attcatagtt taaaatgtgt tgagtctaac tattcttttg 780 cctcaaccaa tggagatagt aaaatgtttg aagcaatgtt tccagattca aaaattgcta 840 aaggatacag acaaagtgaa actaagtcta aatacaccat tcagtacggc atctttccat 900 atttaaaaac attgttattg gaggatttaa acgatactgt ttttacattt aaatttgacg 960 aatctactaa tcaacaagtt aataaacagt atgatggtta tgtacaatat tggtctagaa 1020 agcacaagtg cattaaaatt gcatactgtg gaactgtcat ggtagaccat tgtactgctg 1080 aaaaattgtt agaacatttc ctcgagcttg tagaaaaagt taagcttaac ataaagttta 1140 tgctccatat tggtatggat ggtcctaatg taaatattaa gtttgagaga cttttaaagt 1200 cgtcagagaa aatgaaaaat ttgaataaaa acattatatc aataggcaca tgcccycttc 1260 atatagttca taatgcattt agagcaggag ttaatgtttt aaatttcaac attgactcat 1320 ttgcaattga tgttaacttt ttctttaaac attctgctgc aagaagatct gactatcgac 1380 aaatagaaga tctaacagaa attatttcac actttattca aagacattct tcaacaagat 1440 gggtgacact cagaagaata tgttctcaga ttttagaaca acacaagaat ctttgtgaat 1500 attttataaa atttcttcca acaactacat cttttaaatc aactgttaga gatactgaaa 1560 gatataagag gattacagat attttgcaaa atgaaatatc gatgccgtat atttctttta 1620 ttgctttcat agcaaatgac tttgaaatat ttttaaaaac atttcaatca attaaaccaa 1680 gaataattca cctgatttac tctgaaatgt caaaactttt tgttgcatta atgtccaagt 1740 ttatcaaaac taagcttctg tatgttaata ataatggcag caattccata aaatctatta 1800 atgagctgct gatgattgat gtgattgact caaaaaattg taaaccacta aagttaattg 1860 atattggcac aaaagccaaa tcttattttt cagaagctct agaaatctca tgtggcgaaa 1920 agaaatttcg gcaaaattgt cttgagtctt atcaagcatt tactagttat ttaaaattaa 1980 aattaccttg ggaatcaact attttaaaaa atgcaatctt tcttgatcct ttaaaaaagg 2040 gtagtaacga gtctctagca gctgtctcaa atctcactct agaagtatgc aaatcactgg 2100 aagtagtctt atgtaaagtt tttccatctt gttcatcaaa ggaagaggtt tgtgatattg 2160 tacgaagcga atggcgaatt taccaaatgg aattgcttcc agagtgctat tataaatgtt 2220 cggaagaatt agtttcaatt gggagaaatc aaacttctta ctgggaaaaa gcctttcagc 2280 ttgcgggtct tcctatagtt tcaaaacttt tatgtagttt tgatatcgat acactaatca 2340 attcctttga gactaaaatt ttgaatgata gtggttcacc taagtttccc aatattacat 2400 ctcttttcaa agttgtttct tcactttcgc atggaaatag tgccccagag aatggatttt 2460 caattaacaa acacattatt cagttacatg gtacttcact tgatccagag acaattgagg 2520 ctttgaggtt tgtaaaagat actatattaa gctttggatc catactcgat attccaataa 2580 caaagtcatt tatacaatca gttaaatttg ctcactcaag ttacaaagct gacctggaag 2640 ctaaacgcag gttgaaagag aaggaagaga agcatataaa aactaaggaa actgaggagg 2700 caaaacaaca tgttttgaaa aatgaaaaag aagctgttct tgcatccatt caacaggtat 2760 gttatactaa acttcatgaa aattgttttg gtaatgtgat ttgtacttaa ttttcatttt 2820 tttaattatt gcatgaagta aaagtttgtt tttttgcaga taaaaaacgg tctatcagtt 2880 gctgacgatt tagtttttga aggaaatagt gatttaaaga agtgtttgct tcaaaaaaac 2940 agcacaagaa atgaattgca gcgagcccag tgtaaaattg aaacagggat gaaaagaaga 3000 caggagcttg ctgatgaaca agacgtttta gaaaaaagga gtaaagaact gtcaatttag 3060 tcaatttaga attatattgt ttatattgca actctaacta ttttgctttt aaactctatt 3120 ttaacttagt tgtattataa ctctaaaaga attattattt tatttagaat tttatttttg 3180 aataaaacag ttttttcaga taattaactt aaaaaaatct attaacattt aatactttgt 3240 ttctattacg tcctttaata tatatcacaa attgtaaaaa tactgtagaa tccttttaag 3300 tcagacttta aagggaaatt gatatatagt ctgacttgta agtatcaaca gatgtatcaa 3360 tataacaaaa ttaacaaaaa tgattactta acaaaaaaat ctgtgaaaat gttttgttat 3420 gtccgagtta tccagggtca gagttaacag gattctattg tatatttaac tttgtttagt 3480 attttagata taagttatta cttgcaggtt ttacatgtaa gagttgtcgt tgtaaagtta 3540 aagatttttt tgtttaaatt atattttgtt tattatctac tttatataaa cgtaaataat 3600 ttccaagggt atcaaagggt aaataatata ttaataataa ttgaatttaa agggttaatt 3660 cccaaactta aatttaaata attaaataca aaaactcatt cctaactttc gattgtccta 3720 aaatgtccta aatttaacac taatttgtcc taaaaagtca taaaaaactt ggaaaaattg 3780 tcccaaaatc ataaaatttt accctatttt aggagtggga ggcctg 3826 // ID I-3B_CQ repbase; DNA; INV; 6358 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version 1) XX DE An I non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW I; Non-LTR Retrotransposon; Transposable Element; I-3B_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-6358 RA Kojima K.K. and Jurka J.; RT "I non-LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 108-108 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 6 sequences with >98% identity. CC This family is ~94% identical to I-3_CQ, but transposes as a CC different subfamily from I-3_CQ. XX FH Key Location/Qualifiers FT CDS 600..2264 FT /product="I-3B_CQ_1p" FT /translation="MDPTESDHEESDFDSTLEGDEDVEMEDSVPGVPSNTT FT NRDHFLKDEDAGGKGGSSXGSKRTKRKRPKSDPNPVPPQPPIPPLTPDPIP FT PLTPDPIPPLTPEPIPPNPDPIPPKTPVPNPDPNPPPPLNPHSQKPIPRPR FT QYQEGSQTEWVVFFRPKQKPLNFVQITKDLQKHYPGVVECTKLNKSKLRVI FT VNSAVQANQIVTDLRFSIEYRVWIPAHKVEIDGVVTDDGLSLVDLSRAVGR FT FKNPKLPAVEVLECRQLGNVTTEGGQKKFIPSASFRVTFAGSALPDYIELY FT KLRLPVRLYVPRVMSCENCQQLGHTKTYCSNKSKCSKCAGPHKDVDCQKQA FT EKCLLCGGEPHKTRQCPKYKEREDKMKRSLRERSKKSFAEILSQVTHPNRF FT APLADEDSGDEXSDVEVLFQRDEESDSSGPNPKRNKASKSRKAAGGKGKES FT DSLNFEEDFPFGPSGKPAPKPAPKPAPIPLKPLKPVPKLPKIPLKXVPQPN FT PITDAFKGVLXFSTIVEWLCSFVSEPTRLIIKRFEPLARHIGKQLASTMPL FT LSFISFDG" FT CDS 2260..5958 FT /product="I-3B_CQ_2p" FT /note="apurinic-like endonuclease, reverse FT transcriptase and ribonuclease H." FT /translation="MGNTPKTKNGISFLQWNCRSLTPKLNDFQQFAFERDC FT DVFALSETFLIGDETPAFRGYNIITESRAGPNRGGGVLIGVRKCHTFRKVT FT LPKLGGIEAVAVQARIRGLDICIVSVYVPPQVSLTRQKLWGMLELLQAPRL FT VLGDFNLHSTEWGSHKTDPQAYLIHELCDDFQMTLLNRDKQVTRIATPPTP FT TTSGTKASAIDLSLCSNSLAVLCDWNVLQDPRGSDHLPIAILVSHGPQQST FT TVEMPYDLTRNIDWKQYAEAVSIGVELRXGLAPLEEYAFLANLIYDSAXQA FT QTKPVPGPYIRTRPPLPWWDKECTDLSAARSEAYVDFCKCGIRANFQKYRA FT LDRRYSNTLKRKKRMYWREFVEGLPADTSMSTLWRVARAMRRGSVPNESVE FT KSDKWILDFAKKVCPDSVPTQPSFDVVDGSDXNPPFSMVELSIALLTGNNT FT APGPDKIKFSLLKNLPDVAKQRLLNLFNTFVEQNVIPHEWRQVRVVAIQKP FT GKPASDHNSYRPISLLSCLRKLLEKMILDRLEPWMEREHLLSDTQYGFRRG FT KGTSDCLALLATGIDIARGKKQQMASVFLDIKGAFDSVSIXVLGVKLRRSG FT LNPTMCNFLINLLSEKHMHFVQGDLAITRISYMGLAQGSRLSPSMYNFYVS FT DIDDCLTGDCTLIQYADDAVVSIAAKKEVDLLEPLQDTLNNLAHWAVGKGI FT EFSPEKTELVVFTGKHEPPQLNLSLSGKTIEQSDHFMYMGTIFDQKGTWGK FT HINYLKQKCLQRTNFLRSVSGNRWGAHPSDLLRLYKTTILSVLEYGSFCFQ FT XAAKSRLLVLQRIQYRSLRIVLGCMHSTHNMTLEVLAGVLPLRTRYYELSY FT RFLIRCDYRNKLVIDNFETLLNLHVESRRMLLYYDFMSSWEWSPSTTVQRT FT PPLTSSTLIGFDTSMKADTRGIPNHLLRGVVPSIFASKYNHVPPNNRFFTD FT GSKLDGCTGFGVYHESYQLFFKLKDPSSVYVAELAAVYCALRIIETMPPDH FT YFIFTDSFSSVEAIRSLKPTRESAFFLTEIRKTLNNLAAQSFSITVVWVRA FT HCSIPGNEKADLLAKKGAESGDIFERPISLQECYGFPRQRALLDWQNQWDD FT DTKGRWMYSIRPKVQRKAWFKGMDLTRGFIRTMSRLMSNHYSSKAHLYRIN FT MSDTNLCDCGQGYQDIDHLVWACPDHQAHRIKLKDTLRARGRPPEIPMRDA FT LSQLDLDVLYPIYQFLSDSKISI" XX SQ Sequence 6358 BP; 1556 A; 1680 C; 1640 G; 1470 T; 12 other; cattgctccg gcagaagtct cagcgsgtag tcgtgttttt ttcgcggtgt gtttggcgcg 60 tttttcacgg cgattccgag gcgtgtgcga gcggacaagc ccgttccggt ggtcagagag 120 gccacaatct ggtcgtcgtg gacacggcgg cggtgcagcg gcggagaagg aacgagttcc 180 cgtatccgag cagcagcagc aggaggagtg agcagcagcg gctgcggagg cagctgaggc 240 gagagacggt cgttggccgg tggaatcggc cccggagagt cgactgcggt cggcggacca 300 gttttcggcc aggccaggtc ggagacaacc cccgttgttt acgtggctgg cagcaggcag 360 cccacaacaa caacaacaag cgcgcgcaaa cgagcttccc gcgcatcccg ctgcgggcgg 420 cagccagtgc ggccagttcg gcggtgagtg cgaaaggcga gttttttgtt agcgtttagt 480 tttttttttc tcatagtttt ttttttttcg gttagttttt ttttctttct ccttttttac 540 ttttgcgatt agtttcatca ctttcgtttt cgtttagttt atcgcgtttc tccgcaaaca 600 tggatccgac cgagtcggac catgaggaat ccgacttcga ttcaaccctt gagggtgacg 660 aggacgtgga aatggaggac tctgttccgg gagttccttc aaacaccacc aacmgggacc 720 acttcctgaa ggatgaggat gctggcggta aagggggatc ctcggwcggg tctaagagga 780 cgaagcgaaa gcgtcctaaa tcagatccga atccggttcc ccctcaacca ccgattcctc 840 ccttgactcc agacccgatt cctcccttga ctccagaccc gattcctccc ttgactccag 900 aaccgattcc tcccaatcca gatccaattc ctcctaaaac accggtgccg aatccggatc 960 caaatccccc cccgcccctg aatcctcatt ctcaaaaacc gattcctcgt cctcgtcagt 1020 atcaagaggg atcgcaaacg gaatgggtgg tcttcttccg gcccaaacag aagccgttaa 1080 actttgtaca gatcaccaag gatttgcaga agcactatcc tggggtggtc gaatgtacca 1140 agctgaacaa gagcaagctc cgcgtgatcg tgaactccgc ggtgcaagct aatcagatcg 1200 taactgatct gcggttcagc attgagtacc gtgtttggat tccggcccac aaagtcgaga 1260 tcgacggcgt ggtgaccgat gacggtctgt cgcttgtcga cctctcaagg gcggtgggac 1320 gcttcaagaa ccctaaactt ccagctgtgg aggtactcga gtgccggcaa ctgggcaacg 1380 tcaccaccga gggtggccag aagaagttca ttccttctgc ctcgttccgg gtgacttttg 1440 cagggtcagc tctgccggac tacattgagc tgtacaagct tcggcttcca gttcgattgt 1500 acgttccgcg agtgatgagc tgcgaaaact gccagcagtt gggacacacc aagacctact 1560 gcagcaacaa gagcaagtgc tcaaagtgcg ccggccccca caaggacgtg gattgccaaa 1620 agcaggctga aaaatgcctg ctttgtggcg gggaaccgca caaaactcgg cagtgcccaa 1680 agtacaagga gcgcgaggac aagatgaagc gatccttgag ggaacgctcc aagaagtcct 1740 tcgcggaaat ccttagtcag gtgacccacc ccaatcgctt cgctcctttg gcggacgaag 1800 attcggggga cgaggawtcc gatgtggagg tcctcttcca gagagacgag gaaagtgact 1860 cttccgggcc aaacccgaag cgcaacaagg cttccaagtc cagaaaagcc gctggcggca 1920 agggtaagga aagtgactcg ttgaacttcg aggaggattt tcctttcggt ccgtcgggta 1980 agcccgctcc gaagccggca ccgaagccgg caccgattcc tctcaagccg ctgaagcccg 2040 ttcccaagct gccaaagatc cccctaaaam cggttcctca accgaatccg atcaccgatg 2100 ccttcaaagg agtccttmct ttttccacaa tcgtggaatg gctgtgctct tttgtgagtg 2160 aaccgacgcg tttaatcata aagcggtttg agcctctcgc tagacacatc gggaaacagc 2220 ttgctagcac gatgcccctc ttgtcgttca tttcctttga tgggtaatac accaaaaacg 2280 aaaaacggca tctcctttct acaatggaat tgtagaagtt taacccccaa gttaaacgat 2340 tttcaacaat tcgcattcga acgggactgt gatgtgtttg cgctgagcga aacgtttctt 2400 atcggagatg agacgccagc tttccggggg tacaacatca tcacagagag cagggcagga 2460 ccaaatagag gtggaggcgt actgattggg gtcaggaaat gccacacttt tagaaaggtc 2520 acgctcccga agctaggagg aatcgaagca gtcgcagtac aggccaggat cagaggactg 2580 gacatttgca ttgtgtccgt ctatgttccc ccacaggtgt cgttaactcg acaaaagttg 2640 tggggtatgc ttgagctgtt gcaggcaccg cgactggttc tgggtgactt taatcttcac 2700 agcaccgagt ggggaagtca caagacagac cctcaagcat atctgattca tgaactgtgc 2760 gacgacttcc agatgaccct cctaaacagg gacaagcaag ttacgcggat tgcaacacca 2820 ccaacgccaa ctacgtccgg tacgaaggcg agtgccatcg acttgtcgct ctgttccaac 2880 agtctggcag ttctctgtga ctggaacgtg cttcaagatc ctcgtggcag tgatcatttg 2940 ccgatcgcta ttctggttag ccatggccca cagcaatcga caacggtgga gatgccttac 3000 gatctgacgc gaaatatcga ctggaaacag tacgcggagg cagtctccat tggagttgag 3060 ttgcgagmtg ggttggcacc gcttgaagaa tatgcgtttc tggctaattt gatctatgac 3120 agtgcgwtac aagctcagac gaaacccgtc ccgggaccat acatccgaac gcgccctcca 3180 cttccctggt gggacaagga gtgtacggat ctctcggcgg ccaggtctga agcttatgtg 3240 gacttctgca agtgtggaat ccgagcgaac ttccaaaagt acagagctct tgatcgcagg 3300 tacagtaata ctctgaagcg gaagaagcga atgtactggc gggagttcgt tgaaggacta 3360 ccagcagata cgtccatgag cactctgtgg cgcgttgcga gggccatgcg tagaggttcc 3420 gttccgaacg agagtgtgga aaaatcggac aagtggatat tggatttcgc caagaaggtg 3480 tgcccggact cggtgccgac acaaccatca ttcgacgttg ttgatgggag cgatkctaat 3540 cctcccttct cgatggtaga gctctccata gcactattga cgggcaacaa cactgctcct 3600 ggtccggaca agatcaagtt cagcttgctg aagaatcttc cggacgtcgc caagcagcgt 3660 ctgttgaatc tgttcaacac gttcgtggag cagaacgtaa ttccacacga atggcgacag 3720 gtccgggtgg ttgccattca aaagcccggt aagccggcgt ccgaccacaa ctcgtatcgt 3780 ccaatcagct tgctgtcgtg tctacgcaag ttgctggaga agatgatcct cgaccgcctc 3840 gaaccatgga tggaacgaga gcacttgctg tcagacacgc agtatggctt ccggagaggc 3900 aagggaacaa gcgactgtct agccttgctg gccacaggaa tcgacatagc ccgtggtaaa 3960 aaacagcaaa tggcttctgt cttcctagac atcaagggtg cattcgattc agtctccatc 4020 gakgtgttgg gagtaaagct gcggcggagc ggtctcaatc cgacgatgtg caacttcctc 4080 atcaacctgt tgtcagaaaa acacatgcac tttgtgcaag gtgatctggc aatcacccgg 4140 ataagttaca tgggtttggc acaaggctca cgccttagtc catccatgta taacttctac 4200 gtcagcgata tcgatgattg cttgaccgga gactgcaccc ttatacagta cgcagatgac 4260 gcagtggttt caatcgctgc caagaaggaa gtcgatctgc tagaaccctt gcaagatacc 4320 ctgaacaact tggcccattg ggcagttgga aagggtattg aattctctcc ggagaagacg 4380 gaactggtcg tttttacggg gaagcatgaa ccgccgcagc tcaatctttc tctctcggga 4440 aagactatcg agcagtcgga ccacttcatg tacatgggta ccatctttga tcagaaggga 4500 acatggggga agcacattaa ctacctgaag caaaagtgcc tgcaaagaac taattttctg 4560 cgcagcgtct ctggcaaccg gtggggtgct catccttctg acctgcttcg actgtacaag 4620 acaacgatac tctcggtgct ggaatatggc agtttctgct tccaawcagc tgcgaaatca 4680 cgcttgttgg ttctccagcg gatacagtac cgaagtcttc gcattgtctt gggwtgcatg 4740 cactcaactc acaacatgac cctcgaggtc ttggcgggag tgttgcctct gcgaactcgt 4800 tactacgaac tgtcttaccg gttcctgatc cggtgtgatt acaggaataa actggtaatt 4860 gacaactttg aaacgttgct gaaccttcac gttgagtctc ggcgcatgct tctatattat 4920 gactttatgt catcgtggga gtggagcccg agcacaacgg tacagcgcac gcctccgtta 4980 accagcagca ccctgattgg cttcgacacg tccatgaaag ccgatactcg cggtatccca 5040 aaccatcttc tgcggggagt tgtaccgtcg atcttcgcat caaagtacaa ccatgttcct 5100 ccaaacaaca gattcttcac cgatggctcc aagctcgacg gctgtacggg cttcggtgtt 5160 tatcatgaat cttatcagct gttctttaag ctgaaggacc cgagttcggt ttacgtcgca 5220 gagttagccg cggtttactg cgcactgcga atcatcgaga ccatgccacc tgaccactac 5280 ttcatcttca ccgatagctt tagctctgtt gaggctatcc ggtctctgaa gccgaccagg 5340 gagtctgcgt ttttcctcac ggaaatacgc aagactttaa acaacctggc ggctcagtcc 5400 ttcagcatca cggtggtgtg ggtccgcgct cattgctcga ttccgggtaa tgagaaagcg 5460 gacttgctcg ccaagaaggg tgctgagagt ggagacattt ttgaaaggcc aattagccta 5520 caagaatgct acggttttcc gaggcagcgt gcgcttctgg attggcaaaa tcaatgggac 5580 gacgacacca aaggacgttg gatgtattcc atacgaccta aggtgcaaag aaaagcctgg 5640 ttcaagggaa tggacttgac gcgtggattc atcagaacga tgtcccgcct tatgtcgaac 5700 cattactcgt ccaaggctca tctgtaccgt atcaacatga gtgatacgaa cctttgcgac 5760 tgtgggcagg gttatcagga catcgaccat ctcgtatggg cgtgtccaga tcaccaggct 5820 cacagaatta agttgaaaga taccctcagg gcccgaggaa gaccaccaga aatcccgatg 5880 cgagacgcgc tatctcaact agaccttgat gttctctatc ctatctatca gttcctctca 5940 gattccaaaa tatctattta gttttcttcc agttagtttc ccttcagtta gttttcctct 6000 agttagtata actcgccttc gcacaaagcc gtcgtttctg gttgcaccgg aagcaaagca 6060 agatcgcccc agcagctgga caccgagttg cggctgagga caaaacgcaa aggccagcag 6120 agccaaccaa gacccctaca caatccccct tcccatccca cgccttgtct ttaacatcaa 6180 tcccttccct actaaccccg agtaggccgc gggtaatcgg ctcccctccc actaacattt 6240 acacacaaga tcctctgtaa ttattaagtc aaatgtaatt acaaaagccg actcggtcct 6300 aaccaggtcc cagtaccgaa aaggacctaa taaaaataat tttatgaaaa aaaaaaaa 6358 // ID TELSAT_PG repbase; DNA; INV; 430 BP. XX AC . XX DT 14-SEP-2004 (Rel. 9.08, Created) DT 14-SEP-2004 (Rel. 9.08, Last updated, Version 1) XX DE Palorus genalis satellite repeat region - consensus sequence. XX KW SAT; Satellite; Simple Repeat; TELSAT_PG; Pericentromeric repeat; KW satellite repeat. XX OS Palorus genalis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Coleoptera; Polyphaga; Cucujiformia; OC Tenebrionidae; Palorus. XX RN [1] RP 1-430 RA Mestrovic N., Plohl M., Mravinac B. and Ugarkovic D.; RT "Evolution of satellite DNAs from the genus Palorus--experimental RT evidence for the "library" hypothesis."; RL Mol. Biol. Evol 15(8), 1062-1068 (1998). XX RN [2] RP 1-430 RA Gentles A. and Jurka J.; RT "Palorus genalis satellite repeat region - consensus sequence."; RL Direct Submission to Repbase Update (JUL-2004). XX DR [2] (Consensus) XX SQ Sequence 430 BP; 167 A; 52 C; 59 G; 152 T; 0 other; aaaaaacgtt aaattcaaat gatttaagac gtttggctaa gaaattagaa taatattcac 60 gttagtttaa taaaaacaaa taaaaaacat cgactttcgc atgattttag cattatttta 120 gtcaatttat cactaaatcg ctctaaaaac gttaaattag catgatttaa gacgtttggc 180 taagaaatta gaataatact cgcgttaatt tactaaaaaa agctaaataa cgtcgaattt 240 tgcttgattt tagcacaaat ttagtcaatt taacgctaaa ttgctttaag aaaattaaat 300 ttgcttgttt taagacgttt ggctaagaaa ttagaataat actggcgtta gtttaataaa 360 agcagctaaa aaacattgac tattgcttga ttttagcatt attttggtca atttatcgat 420 aaatcgcttt 430 // ID PiggyBac-5_HM repbase; DNA; INV; 2449 BP. XX AC . XX DT 15-JAN-2009 (Rel. 14.02, Created) DT 15-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE PiggyBac-type family: consensus. XX KW piggyBac; DNA transposon; Transposable Element; PiggyBac-5_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2449 RA Bao W. and Jurka J.; RT "PiggyBac families from Hydra magnipapillata."; RL Repbase Reports 9(2), 454-454 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 469..2055 FT /product="PiggyBac-5_HM_1p" FT /translation="MNRNKKITVSDVLXYVFDGXSDFNDTDXSDNEFYNEN FT DXANTAIEEDSDNEPLANIASRLSKNLKETTNNPNKNKPXTYKWKSKNFEP FT LDDVTFKEPEMKPTPNVETTPFEYFKLFVSDNMLNSITNESNIYILQSKGS FT EKNISKKDIEKFIGAYLRMGLVKLPSQRSYWETFMTYNGVSSIMGRNKFET FT ILRNIHFVNNLEISKEEKADDRVWKLRTWITELRNNFQKVSPEEFHAVDEI FT MVPFKGKSLLRQYLPKKPHKWGFKLWGRSGISGFLYDFDIYQGKSKNNSDT FT SLGVSADVVIXMTSSLPDKHNFKVFADNYFTSLPLIVELRKRNIYYVGTIR FT DKRMKKCPLLLEKDLKKQGRGAYDFRVEQESNVVAVRWYDNKPVNLVSSYV FT SIEPLHTVRRYDRSLKKKIDVKQPNIVHVYNQYMGGIDKLDMMCALYKPRL FT RTRRWYIYIWFHTIQIAVVNAWFLYRRDLKICRPDTKFMQLQCFIAEVAES FT LIKVTSSRGRPSLDNITQPPKKLCSGSRKSFQ*" XX SQ Sequence 2449 BP; 833 A; 342 C; 431 G; 820 T; 23 other; cccttaatga ccccaggtat cagatctgat acacacattt taaaccacag tattttttag 60 acaaataact atttttatct gaaaacaccc gttgttgcta atatgagaaa ctagttttcg 120 gttccgtata aacacttctt ttcatccggg tcaattttca tataataaac attttaaaat 180 agcacatgtt aaaagtgttt tacgtggtgt tcaccaagtg ttgtttctgt atttatttcg 240 cgttggttat atcaatttta atgatttttt ttttctttar tttttatawt aaaggtaaaa 300 gaaaacttta ytaactgttt catgtttttt tttttatcaa atatgcataa atattgatta 360 ctgaatttgt atgtatcaaa tttgatacaa cgggtttttc cagtataaag ccttgtgara 420 agttctttgt ttatstatat ttttaakttt ttatttatgt ttttctagat gaataggaat 480 aaaaaaataa ctgtatcsga tgttttaarc tatgtatttg atggrgakag ygattttaat 540 gacacagatg awagtgataa cgagttttat aatgaaaatg atayagccaa cactgctatc 600 gaagaagata gtgacaatga acctttagca aacatagcat caagactatc taagaatctt 660 aaagagacaa ctaataaccc caataagaat aaaccaakta cttacaaatg gaaaagtaag 720 aactttgaac cccttgatga tgtaactttc aaggagcctg aaatgaaacc aacaccaaat 780 gtggaaacaa caccctttga atatttcaaa ttgtttgtaa gtgataatat gttgaacagc 840 atcacaaatg aaagtaatat ttatattctg caatcaaaag gttcagagaa aaatatctca 900 aaaaaggata ttgagaagtt tataggtgct tatctcagaa tgggtttggt aaagcttcca 960 agtcaaagat catattggga gacatttatg acctacaacg gagtttcttc tattatgggw 1020 agaaataaat ttgaaactat acttcgaaat attcattttg taaacaactt ggaaatatct 1080 aaggaagaaa aagcagatga ccgtgtatgg aagttacgaa catggataac agagctacgg 1140 aataactttc aaaaagtttc accggaagag ttccatgctg ttgatgagat tatggttcct 1200 tttaaaggaa agtctctctt gcgtcagtac ttgccaaaaa agccccayaa gtggggattc 1260 aagctttggg gtcgaagtgg catttcaggc tttttgtatg actttgatat ttatcaaggg 1320 aagtcaaaaa acaattcgga tacaagttta ggtgtcagtg cagatgttgt aattwatatg 1380 acttcttcac ttcctgataa acacaattty aaagtctttg ctgacaacta cttcacaagc 1440 ttgcccctta ttgttgaatt raggaaacgt aayatctatt atgtaggaac aattagagat 1500 aaacgaatga agaaatgccc attattgctt gaaaaggatt tgaaaaagca aggcagagga 1560 gcctatgatt ttcgagttga gcaagaatca aatgtggttg cagttcgctg gtatgataac 1620 aaaccagtta atttagtttc ttcatatgta agcattgaac cacttcacac tgttcggaga 1680 tatgatcgtt cgttaaaaaa aaagattgat gtaaaacagc ccaacattgt tcatgtatat 1740 aaccaataca tggggggcat agataagttg gatatgatgt gtgcgttata caagccaaga 1800 ttaagaacac gaagatggta catttatatt tggtttcata cgattcaaat tgctgttgtt 1860 aatgcatggt ttttgtaccg acgcgattta aagatttgta gacccgatac taaatttatg 1920 caattgcaat gctttattgc agaagttgca gagagtctga tcaaagtaac atcatctcgt 1980 gggagaccat ctttagataa tattacacaa cctcctaaaa aattgtgctc gggttcaagg 2040 aaatccttcc aatgatgtaa gaaaggatgg ttttgatcat atgccaagtt acaatgaaaa 2100 aagacagagg tgtttgtcct gtaagtcagg actttcatat ataagttgta aaaagtgtaa 2160 tgcatggctt tgttttaaga aagaaagaaa ctgttttgaa gtttttcatt cataatctta 2220 ttatgaatgt gctaatatgt ttagcttata tgtactatgt agtatgttta atataaatat 2280 tttgataaat aattacccgt tgtatcatat ctgatacaac tcaaaaaagt gcctagtcat 2340 catttttttg tctcaaattt ttktcwgcaa tcaacttatg ccataagggt tacttgtatt 2400 traaataaga tttttttgag tcttttaaac tgttttgggt atttaaggg 2449 // ID Gypsy1-I_AP repbase; DNA; INV; 4072 BP. XX AC Contig19874; XX DT 21-APR-2008 (Rel. 13.04, Created) DT 21-APR-2008 (Rel. 15.12, Last updated, Version 0) XX DE LTR retrotransposon from pea aphid: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy1AP; KW Gypsy1-I_AP; Gypsy1-LTR_AP. XX NM Gypsy1-I_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-4072 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from pea aphid."; RL Repbase Reports 8(4), 437-437 (2008). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC Positions [3170-3694] - Integrase core CC LTRs are 100% similar to each other. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX FH Key Location/Qualifiers FT CDS 1271..4069 FT /product="Gypsy1-I_AP_1p" FT /translation="MADVARPIIGADFLHYFGLLIDVKNHRLIDPSDQNVI FT QTISVDYVSAPTLVVSHTHKWTELLQTFPDVTRESPVPVKFKHNVVHELRT FT TGPPLFARPRRLPPERSQVARREFDYMLSKGICRPSSSAWASPLLLVAKKD FT GACRPCGDYRRLNAVTRADRYPLPHLHDFTAHLAGRTLFTKLDLVKAYHQV FT PIAEHDIPKTAVTTPFGLFEFPVMCFGLRNAAQTFQRVVNEMLRGLDYAFA FT YIDDVLIASRNELEHEQHVRAVLKRLQEFGMSINPAKCVFAVTSLTFLGHV FT INKDGCQPNPDRVDTIHRWPLPATKKGLQRFLGSVNFYRRFIPNAAEMQTT FT LYNLAAQVKKKDGPLQWNDSTRSAFDTCRTALADTANLAHPKPNAELRLST FT DASSTSIGAVIEQRNGNDWQPLGFFSRKLTDAETRYSTYDRELLAAYSSTR FT YFIHLIEGRLITLFTDHKPLTFMFTHKSEKYVDRQVRQISFLSQYIHTVKH FT IQGTNNVVPDALSRLETAALHSGPPDPAQWSKAQADDPELQRILANPDKCA FT LRLQARDTPYGPIYADFSTGTTRTYVPAAYRRSVFDALHGLAHGGQKATSR FT LINSRYCWPDMNRQVGRWVKCCVQCQRVKTGKHTVSPIIPFSTAERRFSHI FT HIDLVGPLPPSNGNKYLLTCIDRYTRWPEAWPLENMSAHAVAVTLVNQWFS FT RFGLPDVVTTDQGRQFEADLFRALSETFGIQHVRTSPYHPQANGMVERLHR FT TLKDALTTQESAHWSTKLPLVLLALRSTVKSDIGLAPAELVYGTSLRLPGE FT LFNPAPPTPSPPELIATLRDSMAQLRPTPGTNHNTRRSIFVPKDLSNVSHV FT FLRIDAVKPPLQPRYEGPHAVLERNEKNYKLQCNNRRVWVSVDRLKPAFVL FT RENPSLTDHSYAASTITKKQVRFSLPGGE" XX SQ Sequence 4072 BP; 1028 A; 1257 C; 961 G; 826 T; 0 other; aattggtgac cccgaccatt cgaacacaca gcacgttcgg aagaccaccg cgaacgacat 60 cgaactatcg aaggttccgc cgtgttgatc tcggtctcgt tgcaccaaca cgcgtgttac 120 atttttcttt ttcaattcac gcaatgaacc agtttgacat tttcgctacc gacggcgagg 180 ttccgaacga aaatcgcgtc gacccgaacg ctgcgtccgc tcaacaacaa ccgttacacg 240 ttcccggcct gataccgcct ccacaaaata tttccgcgat cgaacacgtg cgtcttcccg 300 gattttggcg acactcacca cagcaatggt ttacgcacgc cgaagctgtc ttccatagca 360 accggataaa agcagatctg accaagctga accacgtgct cacggcatta gacgaggacg 420 gaatccgaac ggtgtcggac ctgctaggcc cgaacgcgca gtattcggca gtacgggatc 480 gtctcatttc ggcctacgcc gttccacaag cgactcgctt ccgcaccatt gttcaacacg 540 ggggcatggg tgaccgccgc ccgtcgcaaa tgctccgaga catgcgaagt gtcctacccg 600 aaggaatcgg cgacgccgcg cttaaagagt tctggatgca aaaactacct cccaccattc 660 tcaccgttat ctccggactt gacggacctt tggacacctt agccgagcgt gccgatcgcg 720 tgatggacgc gagcgccgga cgcgaaatct ccgtcgtttc ctcgcccaat acactcgaaa 780 atcgattcca aacgatggaa agtgcaatta ccgcgttaac ctcacaaata gccgccctcg 840 tcacgtcgca gtcgaggcaa gaacagcaat cacggcaaag aatttctacg cgttcacggt 900 ctcgaaatcg ctcacaatcg cgccatcgaa acaaggactt ttgttactac cacaaccgtt 960 tcgggaaaga ggcaaaaaat tgccacgacc cgtgcagctt ccgttcggaa aactagtgga 1020 agcggcggag gccccgctga cttccgccgc ccgcgcaaca cgccgcgtgt tcgtcagcga 1080 tacgcgtacc ggccagcagt ttctcgtcga ttccggtgcc gaaatatccg tgctaccacc 1140 caccgcaggc gtacgcctgt cttccgacat agttctgacg gccgccaacg gtacacggat 1200 aaaaacgtat ggtcctaaaa cgctacgtct gtgtttgggt gtcaaacgaa cgtacgcctg 1260 gacgttcgag atggcagatg tcgcccgccc gatcatcggt gccgattttt tgcattattt 1320 tggattgctc attgacgtca aaaatcatcg cctcatcgac ccgtcagacc agaacgtaat 1380 tcaaactatc tccgtagatt acgtcagcgc accgacactc gtcgtgtcac acacacacaa 1440 gtggaccgag ctattgcaga cttttcccga cgtcactcgc gaatcaccgg tacccgtcaa 1500 attcaaacat aacgtggttc acgagctacg aaccaccgga ccgccattat tcgcacgccc 1560 gcgacgccta ccacccgagc gcagtcaggt cgcacgccgt gagtttgact atatgctcag 1620 caaaggtatt tgtcgcccat catctagtgc ctgggccagt ccacttctgc tggtagcaaa 1680 aaaggacggc gcatgccgac cttgtggcga ctaccgacgt ttgaatgccg taacccgcgc 1740 cgaccggtac ccgttgccac atcttcatga ctttacggca cacttggcag gtcgaacgtt 1800 gtttactaag ctagacctgg tgaaagcata ccaccaagta cccatagccg agcatgacat 1860 acccaagacc gcagttacca cgccctttgg attattcgag tttccggtga tgtgcttcgg 1920 cctgcgaaac gcggcacaga cttttcaacg cgttgtcaat gaaatgctgc gcggtcttga 1980 ctacgccttc gcatatatcg acgatgtcct catcgcgtca cgtaacgaac tcgaacacga 2040 gcaacacgtc cgcgcagtgt tgaagcggct ccaagagttc ggcatgtcga ttaaccccgc 2100 taagtgcgtt tttgccgtca cctcgctaac gttcctgggt catgtaatta acaaggacgg 2160 ttgtcagccc aacccagacc gtgtcgacac tattcaccga tggccacttc cggcgaccaa 2220 aaaaggccta caacggttcc taggttcggt aaatttttat cgtagattta tcccaaacgc 2280 cgcggaaatg cagaccacgc tgtacaactt agcagcacaa gtcaagaaaa aagacggacc 2340 gttacagtgg aacgacagca cccgctctgc attcgatacc tgccgcacag cccttgccga 2400 cacagcaaac ctagctcacc ctaaaccaaa cgcagaacta cgcctcagca cagatgcgtc 2460 cagcacatcg atcggcgccg tgatcgaaca aagaaacggt aacgactggc aaccactagg 2520 tttcttctca aggaagctca ccgacgcaga aacgcggtac agtacgtacg atcgcgagtt 2580 actcgcggca tatagcagca cccgctactt catccacctc atcgaaggcc gcctaatcac 2640 tttattcacg gatcacaaac ccctcacgtt catgttcacc cacaaaagcg agaagtacgt 2700 agaccggcag gtgcgccaaa tatcatttct gtcgcagtac attcacaccg ttaaacacat 2760 tcaaggcacc aacaacgtcg tacccgacgc attgtcgcga cttgaaacag ccgcattgca 2820 cagcggaccg cccgatccag cacaatggtc aaaagctcaa gcggacgatc cggagctaca 2880 acgaattttg gcaaatcctg acaaatgtgc gttacggttg caggcgcgag acacaccgta 2940 tggaccgatt tacgccgatt tttcgaccgg cactacgcgc acgtacgtac cggccgccta 3000 ccgccgcagt gtttttgacg ctttacacgg cctcgcgcat ggtggtcaaa aagcaacgtc 3060 gcggctgatc aactcacgtt attgttggcc ggacatgaat cgtcaagttg gccgatgggt 3120 aaagtgttgt gtacagtgtc aacgagtcaa gacgggcaaa cacacagtat cgccgattat 3180 accgttttcg acggcggaac gccgattcag tcatatacac attgacctcg taggaccact 3240 tcccccatca aacggaaaca agtacctgct cacatgcatc gatcggtaca cacgttggcc 3300 ggaagcttgg ccgcttgaaa atatgtccgc ccacgccgtc gccgtcacct tagtgaacca 3360 gtggttttcg cgtttcggtt tacccgacgt tgtcacgacg gatcaaggac gacaattcga 3420 ggccgatcta ttccgggcgc tgtcagaaac gttcgggata cagcacgtgc gtacctcccc 3480 gtaccacccc caggcaaatg gtatggtcga gcgactgcac agaacgctca aagacgcgct 3540 caccacacaa gagtccgctc actggagcac caagctgccg ttagttttgc tcgccttgcg 3600 cagtacggtc aagtcggaca tcggactagc cccagcggaa ttggtttacg gcacttcact 3660 acggttgccc ggcgagttgt tcaaccccgc accaccaacg ccgagcccgc cggaacttat 3720 cgcaacgttg cgcgacagca tggcacaatt acgacctaca cctggtacga atcacaatac 3780 gagacggtca atattcgtgc caaaggacct gtccaacgtc agccatgtgt tccttcgaat 3840 agacgcggtc aaaccacctc tgcaaccgcg ctatgaggga ccacacgcag ttctagagcg 3900 aaacgaaaaa aactacaagc tgcagtgcaa caaccgccga gtatgggtgt ccgtcgacag 3960 gctcaagcca gcattcgtgc tccgcgagaa cccgtcgctg accgaccact cttacgccgc 4020 cagcacaatc acaaaaaaac aagtccgttt ttcactcccc gggggggagt ag 4072 // ID EnSpm-16_HM repbase; DNA; INV; 6763 BP. XX AC . XX DT 20-JAN-2009 (Rel. 14.02, Created) DT 20-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE EnSpm-type family from Hydra magnipapillata - consensus. XX KW EnSpm; DNA transposon; Transposable Element; EnSpm-16_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-6763 RA Bao W. and Jurka J.; RT "EnSpm-type families from Hydra magnipapillata."; RL Repbase Reports 9(2), 387-387 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 1865..4228 FT /product="EnSpm-16_HM_1p" FT /translation="MIIIFRVLIYDNYISESDPSFQRVTYEPTFLEMLNSV FT PIYSRESSGDSPGTSKSSTQSSFILQPQIMNVPATVQSPLSLEIGVKVCCC FT GFMSENNARMDRHLGISSICKIVCKICNASFKSTRGVSRHICKKNEKAVLP FT FPKIKIDSQTTFAIKLCGQLRVPSTTVNTIFSISANMLLNIRKLIKEEHFH FT TFDKYFSQVDILSNKYQREKLLNDLFDILKINEVLFAGGKGYNFLLFDIIN FT FLLQIDEFVQLLSRKDAEANYRTDFSTHKPLEFSLSLYCDDIELANPIGKH FT RKLHKLTIFYVQFHSLPSYMRSTLISVFPLAVVPVKLFKSNKKGAYFQYLE FT DFITSCNLLNNGVTFNSTIGCVKLFLKFVIYLGDTLASNHICGFKTSFAPN FT VFRCCRTCLTTNIQMSSIYREDMCMLRTQANHQQHILDLKSVKSKKSKLYW FT SRFYGINYESALLLITAFDIIENAPHDPMHILLEGLVPHELCLLMRLHICE FT KRIYSLAALNSFIIQFPYSYSEARDKPNICPTTFDISGSQTSAQILILARI FT IPLFIGQYSNSNDPYYVNFLDLLQILQLTLSPITTYRTILDLETLIEKHNK FT SFTLLWPNNLIPKHHFLLHFVGQMHRFGPLKNQFCMTFEAKHNKIKSVRWH FT NFKNIAKSVCQYLRWNTISNFLDDNGQLIPYPFTAKTQHLKKRIVIGSIVY FT QVGDIISTIDDSYQLYKITDLQPNCVKVCIMTIDSYIPHKLAFHVRSTNTF FT KSIVLENLQIPWPLTKFTVDGTTLVVPRSKHCLIVS*" XX SQ Sequence 6763 BP; 2278 A; 953 C; 986 G; 2543 T; 3 other; cactgtaaaa aatgtcacgc caaattctta aatgtcccag caaaatgttt aaatttcacg 60 ccacttttaa atctttgcac agaaatgatc taagtcgcgc gacattaaag tttacacgca 120 attataaatg tcacggagag tataattgtt taaatgtcac gccttactaa taaatttcgc 180 gccacgcgct aaatgtcgcg tgactaataa aacatatgag tgtccgcagc gaaaaaaaga 240 aatatcacat taattaattt ttcgtggttc attctgtgac taaaatgtct aaaataatta 300 aaatttccta tttattaaag tttatgccag tggttttggc tgtagatgat aatttgttca 360 cgcttaaatc taaaggtaaa aatttctttg caacgagttt tactttttat tacattacca 420 ttaatgttta aaatattagc taaaaatgca ctctacataa attgtcataa cttatgagag 480 taaagtaagt tgataattaa tctgagataa gtattttcaa tgatatatta ggaataccta 540 ttctttattt ataattctta ttcttaatgc atcattgagt atttcaagct cgttttacaa 600 tgttatacac tatgataacg attatcttta cacaatgcca ttgtgtacat tgctattatt 660 agcaattgga ctgccaatcg attatttaca attattattg tttaaaaaaa actttattag 720 gctatttatg tcagagtttc tgtgtgacct ggccacacta gctttttttt tttaaataga 780 acttttaata atggtggttc gagaaacaaa caaatacgga attaatagta tataataatc 840 aatgtgtata cacattgatt gttatgtcat atttaaattt tatttatcta ttggtataat 900 tttgatagca tctaagtcct gctttttggt tgtatttttg gctaatcaaa tcgaaagttg 960 tttaagtaat tttttcgtat ttcgatcaaa atttctaatc acgacgagga tttcaacaac 1020 aacagaaaga agttttttgc tgcactaaaa tatttacttt tattgcacta aaagtaaatg 1080 gtttattgca ataaaacctg aaaagttatc ttttgattta tattattcta gaatttgttc 1140 tgataatttt attacaggtt tagtttttaa tttttatttt ggtttttatt tttgtttaag 1200 ataatccaaa ctactaattg tatcacaaat tgtataaact aatcaagcca acacacaata 1260 gtaataatta caatagatac gtatgctaat atctttaggt atacatagtt acagttacta 1320 aaatgttttt acaattttaa gataacaatt ttactttcag tatatatttt caaaatttgt 1380 gacaatagta ttatcaaagt tatttattaa ttttacttat ttaaagtaaa attataaaat 1440 aagcttaaaa tacaatttat aaaataatat ctattagata atctaattgc tatttttata 1500 ctattattgt attctatgcc taaagatatt gctatctaat gttgtttagt taaacatagt 1560 ttgtaataaa attgtcatag ttttacccaa tttatgcata aaatatatga aaataatgtt 1620 ttgttgtagt gtcctcaaag atcaatgtat tgccagatga ctttgatttg atgttttctg 1680 acatggtctt agaagatgag atgatagttg ctcttcctaa tccatgtgaa atcgttgtca 1740 agattattgg taaaaattta ctataaatta gtctaaaatc attatgtccg gttttttctt 1800 attattttaa aagtttgttc taaactaata atatttttct tgatgcttat cttatgttta 1860 atatatgata attatatttc gtgttttaat atatgataat tatatttcag aatcagaccc 1920 atcattccaa agagtaacat atgaaccgac atttcttgaa atgttgaact ctgttccaat 1980 ttatagccga gaaagttcag gggattctcc tggtacaagt aaaagctcga cacaatcatc 2040 gttcattttg cagccacaaa ttatgaatgt cccagctact gtccagtctc cactytctct 2100 agagattgga gttaaggtat gttgctgtgg atttatgtct gaaaataatg cacgtatgga 2160 cagacattta ggtataagtt caatatgtaa aattgtttgt aaaatttgta atgcctcatt 2220 taaatcaaca agaggagtat caaggcacat ttgcaaaaaa aatgaaaaag cagttttacc 2280 ctttccaaag ataaaaatag actcacaaac aacatttgca attaaactct gtggtcagtt 2340 gagagtgcct tcaacaactg tcaatacaat atttagtatt tccgccaata tgttacttaa 2400 tattcggaag ttaataaaag aagagcattt tcatactttt gacaagtatt tttcacaagt 2460 ggatatcctt agtaataaat atcaaagaga aaaattgtta aatgatttgt ttgacatttt 2520 gaaaataaat gaagttttgt ttgctggtgg caaaggttat aattttttac tatttgacat 2580 tattaatttc cttttgcaaa ttgatgagtt tgtacaactt ttaagtagga aagatgctga 2640 agcaaattat cgtacagatt tctctaccca caagccatta gaattttctc ttagcttata 2700 ttgtgacgat atagaattag ctaaccctat tggcaaacac agaaaacttc ataaactaac 2760 tattttctat gttcaatttc attccttgcc atcatatatg cgttctactt taatatcagt 2820 ttttccactt gctgttgtgc ctgtaaagtt atttaagtct aataaaaaag gggcttattt 2880 tcagtactta gaagatttta ttacttcctg taatcttttg aataatggtg ttacttttaa 2940 ttcaacgatt ggttgtgtaa aactattttt aaaatttgta atatatttgg gagatacatt 3000 agcttcaaat cacatttgtg gatttaaaac cagctttgca ccaaatgtat ttagatgctg 3060 tcgtacatgt ctcaccacaa atatacaaat gtcatcaatt tacagagaag atatgtgtat 3120 gttacgtact caagcaaacc accagcaaca tatcttagat ctcaagagtg tgaaaagtaa 3180 aaaatctaaa ctctattggt caagatttta tggcataaat tatgagagtg cacttttact 3240 aatcaccgct tttgatatta ttgaaaacgc accacacgat cccatgcaca ttttgcttga 3300 aggactagtg ccccatgaac tatgtcttct gatgcgattg catatttgtg aaaaacgtat 3360 atactcatta gctgcactta atagttttat tatccagttt ccttattcat actctgaagc 3420 cagagataag cccaatattt gtccaactac ctttgatatt tcagggtctc aaacatctgc 3480 tcaaattttg attcttgctc gtatcattcc actttttatt ggacaataca gcaattccaa 3540 tgatccttat tatgttaatt ttcttgatct tttacaaatt cttcaactta ctttgtcacc 3600 aataactacg tatagaacta ttttggattt agaaactctt attgaaaagc ataacaaatc 3660 atttacttta ttgtggccaa ataatctgat accaaaacat cattttttac ttcattttgt 3720 tggacaaatg catcgatttg gtccactaaa aaatcaattt tgtatgactt ttgaagcaaa 3780 gcacaataaa ataaaaagtg tacgatggca taactttaaa aacattgcca aatcagtttg 3840 tcaatatctt cgatggaaca caatttcaaa cttccttgat gataatggtc aactgatacc 3900 ttatcctttt acggcaaaga ctcaacattt aaaaaaaaga attgtaattg gttcaattgt 3960 ataccaagtt ggtgatatta tttcaacaat tgatgattct tatcaacttt ataagatcac 4020 tgaccttcaa cctaattgcg taaaagtttg tatcatgaca attgacagct atattcctca 4080 caaactggct tttcatgtta gaagtactaa tacttttaaa tcaattgttt tagaaaatct 4140 ccaaattccg tggccattga caaaatttac tgtagatggc acaacattag ttgtgcctag 4200 gtcaaaacat tgtctcatag tttcttaata aatattttga atatgaatat atattgtttt 4260 taattttttg ctgttttttt agagaagttt gtgtttcatt catttatata taagcgtatt 4320 aatgcataca gaatatatac ttgctaaagt gttatctaag caataaaatg tttatatata 4380 tacatattat tatatataat tttttaatac tagctaccta tcttttcaat ggagtgcagc 4440 aaatatttaa atggtttggt gccttgccgt aagcatactt taaaagatgt ttgtagtgaa 4500 ttggcgcatc accttgaaga gtgtcttcca gagggaaacc cttcaataag aaaaaggttc 4560 tataatcgtg taactgtcct tttgaaacag tcatacccaa cctgtacttt tggagaagag 4620 ggtgcagtaa gtttgaactt taatgtttgt aaattttcca taaaacatca gtgaaatatt 4680 acccaaaaat aaattatata cccaaatata aaaataaatt ataaaaacca ataaatagtt 4740 cacaacatca tttttaaaag atatttaatt aagcagaaca taaggtatgg ataattactc 4800 caaataaggg taaaattaaa ttcaaataga tatagattca gtttttttta atcttcagtt 4860 ttatgcatgt ttcgtgtaac acatccatgt gttacacaaa atatgccact tttttaattc 4920 atcagatatt atttcaatga taaattttct tttttagggt ggagttgtaa agcgcgtttc 4980 taatgtaatt cggaagcgac gtgagcgtcg tttaaaaagt ttgcttgagg taaaaaatgc 5040 aaacagaatg ctaaacatct ctggaataaa cgacagctca tttgatgaag atatacaggt 5100 tatacagatg agcccttttt ttaaaaatgt ttttaatcag tatattctaa ataaaaatta 5160 ttattaccta atatatacag gaagaattta ataaggttta ctttagagtt attgcaattt 5220 ctttgtatgt tagtttgctt tggtacaaaa agttcctgat atggatcata ttcgttgttt 5280 ggttaggaat atgaagtgtt ctctgccata ttcagaacac aaagagcacc tgttccatcc 5340 agaaattgta agcaaataat acttgtaata ttgatatata tatatatata tatatatata 5400 tatatatata tatatatata tatatatata tatatatata tatatatata tatatatata 5460 tatatatata tgtgcggtcc tttatatgtg atattatcat ttaactttta taagtaataa 5520 gttttttgtt atttatttaa taaattttta ttttgcgcat ttttaacaaa aagttattcc 5580 aaatgggtta aaaatttata gatcatggac gaatttgaaa gaatgggcat tgtttccaaa 5640 gaagctgcat tagagaggct attgcaagtt gttgaaaagt tgtatccagg ttcagatatt 5700 ccgtctgccc taatacgatt tcataagaac actaccagaa aaggagtgtt gtgtccaatt 5760 gttattgcta gtgtaagttt aacatatagt tgtttctttt tggatttttt aaattgttat 5820 ttatttttgt aatgttatgt tattaaaata gatttaaatt tttatttttt aacaaatttt 5880 tagtctgaag ataacgtctc ttctcctggg ccscaactca ttttaggaga gaaaataaca 5940 tttgttgata aagggaaagt aattgcgatt ggtcatcggc ataatgattc cctacacgtg 6000 cttcaattaa ttgcttatta ttatattatg gacttgtcgt acccaagatc gtttggggga 6060 gttttggggc tgttgcagca ttacgtcttg caaataccat ttgcttgtac aaataagctt 6120 tggttaaaaa aatgccgagt cttgtaaaag gttttgttgt attttgcaac tgttcaactc 6180 ttcgaataaa aacattaaga caatttaaaa ttaaatattt atatttatgt ttgatgatgg 6240 ttacttttta tacaaataaa atataatttt cattatttct ttggtctttt tatttattta 6300 atttaagtaa agtgtgccat ttattaaatt agtcacgcga tttttagaga tttcgtatgg 6360 cgtgacattt atatttttta tttgtcacgg gacatctaaa acactatcgt gtcgcatgtc 6420 accgagcccg atttaacgcg ccacatttaa attttttaaa tgtggcgtga ataatcgggc 6480 ttggtgacat gcgaaacgat agtgttttag atgtcccgtg acaaataaga aattaaattg 6540 tcacgccata cttmttttta aaaagtcgcg tgacaaattt aattataaat gtcacgccat 6600 atttaattgc taaaagtcgc gtgactgatt taattatata tgtcgcgcca aacttaatat 6660 ctaaaagtcg cgtgacagtt ttaactctaa atgtcgcggg ttaaatcggg cttcgtgaca 6720 tggcaatttt attagacaat ttggcgtgac attttttaca gtg 6763 // ID BEL-1_AA-LTR repbase; DNA; INV; 572 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-1_AA_; KW BEL-1_AA-I; BEL-1_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-572 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 852-852 (2011). XX DR [2] (Consensus) XX SQ Sequence 572 BP; 144 A; 155 C; 144 G; 129 T; 0 other; tgttcacgcc caaacgtgaa cgacgactga cagccgccag tcagcgagag cgaataccat 60 gttgcgcaat cattcgacgt tagctccacc catctggcat gcaacgacac gacgggacga 120 caatcggacg gaagaagaag atgaaacaac aacgaccggc agagtataaa accagcgcag 180 gtagcgttgc acatatcagt cttttttata atttcaaagt ggtaacatcg aaaataaaat 240 gtgtagataa gtgaagtggc tagttttctt tcggcgtctt ccttgaggag cagaccaccc 300 gacgaggaga gttccctgcg gtgtttacag tccaccgacg atcgagttgt tggcttcgac 360 ggacttgtgg aacccaatcc gcccttttga gggcaagaag agttgtttgt ttcttccccc 420 cacaagctcc cgagaagaac actcgccctt tcccggcttc ctgcttgagc ggcaccctct 480 cctttcctga cgtgtcagga cgtgtttcgt catgtcgcaa gagagcatgg aagtccatcg 540 tgcgtttccc ggagcctagt acggccccga ca 572 // ID Academ-1_Hrobusta repbase; DNA; INV; 2918 BP. XX AC . XX DT 10-MAY-2011 (Rel. 16.05, Created) DT 10-MAY-2011 (Rel. 16.05, Last updated, Version 1) XX DE DNA transposon: consensus. XX KW Academ; DNA transposon; Transposable Element; Academ-1_Hrobusta. XX OS Helobdella robusta OC Eukaryota; Metazoa; Annelida; Clitellata; Hirudinida; Hirudinea; OC Rhynchobdellida; Glossiphoniidae; Helobdella. XX RN [1] RP 1-2918 RA Yuan Y.W. and Wessler S.R.; RT "The catalytic domain of all eukaryotic cut-and-paste transposase RT superfamilies."; RL Proc Natl Acad Sci U S A 108(19), 7884-7889 (2011). XX DR [1] (Consensus) XX SQ Sequence 2918 BP; 1001 A; 454 C; 472 G; 989 T; 2 other; tagtgcggtc ggttaatgtc ttcacataat ataaatattt cacttccgga ccccaagata 60 tcctttggta tatagattaa ttatgttata taatgttaaa aaaaagcggg acacaagtta 120 tttgaacatt gtaaattgtt tatttcaacc gcaaaaaatc accgatttaa gtaactgagt 180 agtaacattc attatcttcg atttgcaaat tgtaaccgtt gtttccatgg ttacgaagtt 240 attgacattt tatattcgcc gaaaaatttt caatttcgca aacaaacttt ttaaattcgt 300 tgacaaattt aattattact taaaaatatc aaaaattaaa atattataaa taaaaatatt 360 gtgaaaattc gagctgaagt aaatgaaatt aacatgaaaa taaagtaaat gtgtgcttca 420 atctgtttac tttactacag acttacattg tcactgctaa gcattttgtc tttaattttt 480 tagtttttta cgagtaaatt gataacaaac tgaacataaa aattaaaaaa aatctgagtt 540 tttatgtaag tcggtaattt ttttaaaaca atttatttat tactaatatt tatttaataa 600 tgataactat tattattgac ttaactatta taaacaatta gcaacaactg taaaaactaa 660 caacattact taccttgtaa taaatattta ataatgatat tattagttaa ctacacgaaa 720 aaaggttgtc acacctaact acttttatgc taatttcatt tgcatagttg cttttcagct 780 cttattttgt taaaannaaa tatacagagc ttccggtaac aggcaatata tcttctgaac 840 cacctcttac tacaattaat gaaacatcaa taagaacaga aacgaatact attcaacaac 900 tgactccgtg gttgaatgct gttgaaatgg cggtatcaaa caacttcaaa tctggcaata 960 tttctttaag tagaaatcaa ccagaaaaca atggtctgcc ttctacagct gtgtgcaata 1020 atacactatt acctctgttg ccggatcaca ttcaatcacc tgccactgtt cgacatttaa 1080 tcaatattat atcatctatt attgataatg tgaattatgg tcaaccagtt gttattactg 1140 cagatcaacc agtatttgcg attgctaagc aacttcagtg gaaataccct gatctttatg 1200 gtgaggataa aatgattata atgatggggg gacttcacat tgagatggct attgaaaata 1260 tgattggaaa gtggttatct ggaagcggtt ggaccgaatt atttttcaag actgaaatcg 1320 caagttctgg acgtagtgaa acgttgctta agtcttcaca tgttaaaaga actagatacg 1380 ctcatgaagt tagtattgca gcactttaca ttctaagaaa caacgcatat agagataacc 1440 aaaaaacggc agtggagtct ctggaaacat gggtggctcg aagatgcaaa gaatctgctc 1500 agtttttatt ttggcagacg accataattt tggagggcat attacttaac tttgttcgta 1560 gcatacgtac ttctgatttc tgtttatttg tccaaatgct cgaagagctt tgtccttggt 1620 tgtttgcttt agatcttatt cattattcac gatggcttcc tgttttcata aaagggctca 1680 aagaactacc agtgcgacac ccgcaagttt atgaagcatt ccagaaaggt catttcacaa 1740 gtagaaaaac taatgctgat ttctccgcaa tttccgatga tcaattgcat gaacaaaata 1800 acaaactaat aaaaggatgt agtggagcta tcaacaattt gcataataat gatgctcttc 1860 ttaaatggat ggtggccggt ccggaaatct cacgaatgat acatgattat gaccagattc 1920 cagtgcacaa tttgtctaat aaagttccaa atcggtgtca tcatgaatgt tcagcaagct 1980 ttcaatctcg atatattagt catgtttcaa aaatggttaa atatatgcag gaagacggca 2040 atccattttc tgaacaactt ttgcaaacta tcgacaacca aaaaatttta atgtctcaat 2100 ctgcagagaa ctccgttaga caagcggaac aaaaaggaat caagcaatat gaaaattttg 2160 taaaagaacg tttaattttc ggtcaaaaat ctttttatga ttcattgtct aaaaacaatt 2220 tggaactatt tcatggccct aaaaagacat gtaacaaatt gtttgttagc attagcacca 2280 gtgtatgatc cagaaaaatg gggatggaag cgactgactg atggaactta tggattgttg 2340 gggactacat ttcctgacgt atctttgcat tgttctgaac ttgtgaaatg ttcctgcaaa 2400 aaagtttgcc gaaattgtaa gtgtagacgc ggcgagttac gatgtacaaa gctttgcgca 2460 tgtaatggag attgcagtga agagccaatc agtcatgatt cgattaactt agatacatat 2520 tcggatgatg aaaatacaga agttgatact aattttctgt ttaatgatga ttgtttgatg 2580 attaaagagt acttacctta gattcatatg aaatgtacat aatttttttt ggtcttacct 2640 tattatttac cgactattac atttttctta ctatagttct cataactttt tccaattttt 2700 tatgtttata aagaagttaa atgttgtgta aaattattgt tcaattaaaa tttttaataa 2760 aatctaatta attttttaaa tagtgaaaaa aaattttttt ttgaacaaaa tgtttgtgtc 2820 ccgttatttt tcagtagcaa atcatgttat ctatcaatat ccaaaacggc ttgctggggt 2880 ccggaagtga aataattttt cactaaccga cctcacta 2918 // ID Gypsy-17_OD-LTR repbase; DNA; INV; 195 BP. XX AC CABV01000966; XX DT 24-FEB-2011 (Rel. 16.02, Created) DT 24-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Oikopleura dioica genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-17_OD_; KW Gypsy-17_OD-I; Gypsy-17_OD-LTR. XX OS Oikopleura dioica OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; OC Oikopleuridae; Oikopleura. XX RN [1] RP 1-195 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Oikopleura dioica genome."; RL Direct Submission to RU (24-FEB-2011). XX DR Genome; CABV01000966; Positions 1535 1341. XX SQ Sequence 195 BP; 51 A; 54 C; 38 G; 52 T; 0 other; tgttgtgatc gcaacttgtg cgaatcccaa ataaggagat ccgctccgtc gcacctgctc 60 cagcgtttcg taaggcgaaa gccggcactt tctgttcact ctcttgcaac taacagccga 120 taatatacgc ttttcctgaa tctagcttta cgtctaattt attgagcaga gtcaacgcaa 180 tccgcaagca caaca 195 // ID DNA8-51_AP repbase; DNA; INV; 507 BP. XX AC . XX DT 25-AUG-2009 (Rel. 14.09, Created) DT 25-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-51_AP. XX NM DNA8-51_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-507 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 1981-1981 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 507 BP; 171 A; 66 C; 79 G; 191 T; 0 other; cagtggtctt caaccggtgg gtcgcgaccc atttttgggt cgcgatacgt cttcttttta 60 aaataaaaaa taagtttaaa ttacttagat gaaataaata aaaatacagt ctatataatc 120 tatattctat aatataatat tatagtaata tagtggatat caaatcacag tatatttttc 180 atatactggt atgattctac gtagcaggtg gaatttatta tgtgaaaaca ctcaaggaca 240 aacatcacat taaatttttt gtgtcgtatt tttattttat tatatttttt cgtgtgaatt 300 ctatacattt tcaattaata ataattaaat attctgcctg gtactaaaat gtaattttta 360 tttattcagt tataataggt agtactacta tacaaagcat taagcaattt tagctttttt 420 tttgtttttc tacaaagctt acgtgggtcg cgaaaaatgt gggcctaaaa aagtgggtcg 480 cgagtccaaa aaggttgaag accactg 507 // ID BEL-73_CQ-I repbase; DNA; INV; 5709 BP. XX AC AAWU01004923; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-73_CQ_; KW BEL-73_CQ-LTR; BEL-73_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-5709 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 291-291 (2011). XX DR GenBank; AAWU01004923; Positions 14895 20603. XX CC Positions [4737-5327] - Integrase core CC 'TAGTT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 336..5681 FT /product="BEL-73_CQ-I_1p" FT /translation="MSTPFRNSKRTPRTPPLSPSPSPTPTLPKTPSTPQVK FT SVAEQKSEKKKKEEKKKMEAQLKALVKQREAVRGKLLRVRAALRDSEEEPN FT PNVRNVHFLQLQKRALENIYGECNEVQNKIYALPLSEDQDTVQTEKYVEFE FT GVFNDVSLKLSTLIDVVPKAEPAAPAAGAPIQQPAQHYLPPLQVPLPKFDG FT SYEKWYAFKALFTTLMNRYRQEEPALKLYHLRDCLVEKAAGIIDDEMINNN FT DYDAAWALLTLRYEDKRIVVDKLVDKLYGLPKISQEDATELRKVIDTCTKN FT VDALRNQELPVAGLGEQMLINLIVGKMEKKLQVAWEAKQKKNVLSTYAALM FT AFLEEQCRISEKIDTKVKPAKESAKPKTAGRSHTLAVSESKNEQKCTVCKA FT AHELWKCEAFKSKSVSEKYEILKKCGACFNCLMKGHRTRACSSGSCRHCGK FT KHHSSLHEDSPSQVTNSAVPTTPASDVSQTVNTPAAGTSTTLCASAASSEK FT QTVLSTAVVLVDGKCSTPIPCRVLLDSASQMNFVTERFANLLSLKLKPADF FT TVSGLNGNKTRISRRLRATIRSRHGNFAAELDFLVTPRITGELPVKSFDAS FT EWPISSELVLADPAFNKRGRVDMLIGAEYFWNLLLDGRYELGPDRPVLRNT FT KLGWIAGGVIASDATVVARTLCQTSEEEPLIELLKSFYKVEACDEMCLEHF FT QRTHERTEEGRYVVRHPFNERKRELGDSREMALRRFLALERKLDKQPELKE FT QYSQFIREYEQLGHMRVIEEAPNEDPGSTFYLPHHCVLRPTSTTTKLRVVF FT DGSAKTSTGVSINDVLRIGPTIQNDLTAILLRFRGFQFVFTLDIPKMFRQV FT RVHPEDTRYQRIFWRYDRNDFLTVRELLTVTYGLGPSPFQATMALKQAAAD FT HEDEFPRAAEVVDKGTYMDDVLTGADTLAEACELQREMTGLLAKACFGAHK FT WCANHSDLLREVPEELRGNSFEVTDDSSKTIVKTLGVTWNPFEDWFSVSVP FT DFDDLEEVTRRKLLSQLAKIFDPLGFFGPVITYAKLILREVGELHIEWDDP FT VPTDIGEKWRSFRIELTALREVRLPRWISWKGALKLELHGYADASDPAYGA FT CIYVKGFFANEETEMRLICSKSRILPKKRKPKEKAISTPRAELLAALLLAR FT MVVKFLSATELKFESVHLWSDSQIVLAWLKKLPQLLQTFVSNRVSEIQKLT FT QNFHWHYVATHENPADLISRGVTPKKLIKSKKWWQGRPRTTVAAAADEIVI FT PDDELPEMRTGVVLAMTVPVERFPMFDKLSSFTGIVRSMAYLVRLARFVKS FT RKTEVVKGRLTAKEMRTATLVIVRLVQREAFQPEILALMDGADTNHRLNGL FT KAFLDPDDGLLRVGGRIKRAFVPYDSRHQMLLPAKHPITEALVRELHVDNL FT HIGQRGLLAVVRQRFWPLNVKSTIRQVIRKCIVCFKANPLKTTQLMGDLPS FT YRVQPAPVFSNTGVDYAGPFWIKSSSATRKPQITKAYVCLFVCLQTRAIHL FT ELVSDLTTDAFLASLRRFASRRGCPKTMHSDTGPNFVGAKTELHELWQMFQ FT NECATKKITSYCVEKGIEWSFIPPRSPHFGGIWEAGVKQVKHHLKRIVGER FT KLSYEELYTTLTQIEAVLNSRPLVPSSDDPSDYTAITPAHFLIGREMQAVP FT EPDYSHLKENRLSRWQLVQTMLQHFWRRWTAEYLPELQNRSKWLKTKVIKE FT GSLVLLVDQNAPPLQWPLGRIVTAHPREDGVTRVVTVRMANGAEFKRAVTE FT VCLLPLDEDED" XX SQ Sequence 5709 BP; 1370 A; 1577 C; 1733 G; 1029 T; 0 other; ttggtcctta cggaaccgga tacgcgggaa accggtggaa tcgtcgtggt tctgtttttc 60 gcgagaaaag tgccgattga aacatccgcg atcggatgtt agtgacgcgg aagtgcgcgt 120 tgctgccgag gaaaaaagag tagcctcctc cggggaacgg cggatgtgac ccgggaagtg 180 cgatcccgac ggaatttgcc agaaggcgtt ccgatttcgt gaagaaaagg tgaaaaagtt 240 acttttccgg aagatttggt cggctttggt tggcgtgggt tggttttgca aatggcggac 300 gggcagcagg aaaaagtgtg tgagtgaaat ggacgatgtc cacgccgttc cggaacagta 360 agagaacacc gcgaacacca ccactgtctc cgtcgccgtc gccgacacca acactaccaa 420 agactccgtc gacgccgcaa gtgaagtctg tggccgaaca aaagtcggaa aagaagaaga 480 aagaagaaaa gaagaagatg gaagcgcaac tcaaggcgct agtgaaacag agagaggcag 540 tgcgtggaaa gctgcttcgt gtgcgtgcgg ccttgagaga cagcgaggag gaaccgaacc 600 cgaacgttcg aaacgttcac ttcctccagc tgcagaagcg ggctttggag aacatttacg 660 gcgagtgcaa tgaggtgcag aacaaaatct acgcactgcc gctctctgaa gaccaagaca 720 cagtgcaaac cgagaagtac gtggagttcg agggtgtctt caacgacgtg tcgctgaagt 780 tgagcacgct gatcgatgtc gtgccgaagg cggaaccagc tgccccagcg gccggagcgc 840 cgatccagca gccagcccag cactacctcc caccgctgca agtgcccttg cccaaattcg 900 atgggtccta cgagaagtgg tacgcgttca aggcgctgtt caccactctg atgaaccgct 960 accgacaaga agagcctgcc ctgaaactct accacctgcg ggactgcctc gtagagaaag 1020 cagctggcat catcgacgac gagatgatca acaacaacga ttacgacgcc gcctgggcgc 1080 tgctcacgtt gcgctacgag gacaagcgga tcgttgtgga caagctcgtg gacaaactct 1140 acggtctacc gaagatcagc caggaggatg ctacggagct gcggaaggtg atcgacacct 1200 gcacgaagaa cgttgatgcc ttgaggaacc aggaactacc ggtggctggg ctcggcgagc 1260 agatgctgat caacctcatc gtgggcaaga tggagaagaa gctgcaggtg gcctgggagg 1320 ccaagcagaa gaagaacgtg ctctcgacct acgctgcgtt gatggcgttc ctggaagagc 1380 agtgtcgaat ctccgagaag atcgacacca aagtgaaacc agcgaaggag agtgcgaagc 1440 cgaagacggc gggaagaagc cacacgctgg cagtgagtga atcgaaaaac gaacaaaagt 1500 gtacagtgtg caaggccgct cacgaactgt ggaagtgcga agcctttaag agcaaaagtg 1560 tcagtgaaaa gtacgagatc ctgaaaaagt gcggcgcctg cttcaattgc ttgatgaaag 1620 gtcatcgaac ccgagcgtgt tcgtccggct cttgtcgcca ctgtggcaag aagcaccaca 1680 gctcgctcca cgaggacagt cccagccagg tcacgaacag cgcagttcca acgacgccag 1740 catccgatgt gtcccagacg gtcaacacac cagcagccgg aacgtcgaca accctgtgcg 1800 cgagcgctgc gagctcggag aagcaaaccg tcctctcgac tgccgtggtt ctggttgacg 1860 gcaagtgcag cactcccatc ccgtgccgtg tgctactcga ctccgcctcg cagatgaatt 1920 ttgtgaccga acgcttcgcg aaccttctct ccctgaaact gaaacccgct gacttcactg 1980 tcagtggtct gaacggcaac aaaacccgga ttagccgtcg actgcgcgcg accatcaggt 2040 ctcgccacgg aaactttgct gctgaactgg acttcctggt gaccccacga atcactggtg 2100 agcttccggt gaaatccttc gacgcctcgg agtggcccat ctcaagcgaa cttgtgctgg 2160 cagacccggc gttcaacaaa cgaggacgcg tcgatatgct gatcggcgcc gaatactttt 2220 ggaacctgct actagacggc cgatacgaac tcggccccga ccgaccagtg ctgaggaaca 2280 ccaagctggg ttggattgcc ggaggtgtga ttgcgagcga tgcgacggtc gttgcgcgca 2340 cactttgcca gacgtctgaa gaagagccgc tcatcgagct gttgaagagc ttctacaagg 2400 tggaggcttg cgacgagatg tgtctggagc acttccagcg cacacacgaa cgaactgagg 2460 aagggcgcta cgtggtacgt caccctttca acgaacgcaa gcgcgagctg ggagactcgc 2520 gtgagatggc gctgcggcgt tttctggctc tggagcggaa actggacaag cagcccgaac 2580 tgaaggagca gtactcgcag ttcatccgag agtacgagca actcggacac atgcgagtga 2640 tcgaagaagc gccgaacgag gacccgggat cgacgttcta cctgccgcac cactgcgtgc 2700 tgaggcccac gagtacgacc accaagctgc gggtcgtgtt cgatgggtcg gcgaagacgt 2760 ccacgggcgt ctcgatcaac gacgttctga ggatcggccc aacaatccag aacgacctga 2820 cggcgatcct gctccgattc agaggcttcc agttcgtctt cacgctggac atcccgaaga 2880 tgttccgaca agtgcgcgtc catccggaag acacaaggta ccagcgaatc ttttggaggt 2940 acgaccggaa cgattttctc accgttcgag agctgctcac ggtgacgtac ggattgggac 3000 cctcgccgtt ccaagcaacc atggcgctga aacaagctgc agcagatcac gaagacgagt 3060 tcccgagggc cgccgaggtg gtcgacaagg gaacgtacat ggacgacgta ctgacgggtg 3120 ccgatacact tgctgaggcg tgcgaactcc aacgggagat gaccgggctc ctagcgaaag 3180 cctgcttcgg cgcccacaag tggtgcgcga accactccga cctcttgcga gaggtccctg 3240 aagagctgcg tggcaattcg ttcgaggtca ccgacgacag ctcgaagacg atcgtgaaaa 3300 cgctgggcgt tacgtggaac ccgtttgagg actggttctc ggtgtcagtg cccgacttcg 3360 acgacctgga agaagtgact cggaggaagc ttctgagtca gctggccaag atcttcgacc 3420 cgctgggatt cttcgggcca gtcatcactt acgcgaagct gatcctgcga gaagttggcg 3480 agctgcacat cgagtgggat gacccggtcc caacagacat cggcgagaag tggcggagct 3540 tccgcatcga gttgaccgcg ctgagagaag tgcgactgcc gagatggatc tcctggaaag 3600 gcgcgctcaa gctagaactg cacgggtacg cggacgcatc tgatccggcc tacggtgcgt 3660 gcatctacgt gaagggcttc ttcgcgaacg aggagaccga gatgcgattg atctgcagca 3720 agtctcgaat cctgcccaag aaacggaagc ccaaggagaa ggccatctcg accccccgag 3780 ctgaactgct ggcggcactg ttgctcgcta gaatggtcgt gaagttcctg agcgccacgg 3840 aactcaagtt cgaatcggtg cacctctgga gcgactcgca aatcgtcctg gcttggctca 3900 agaaactccc gcagctgctg cagacgttcg tgtcgaatcg ggtaagcgaa atccaaaaac 3960 tgacgcaaaa cttccactgg cattacgttg ctacgcacga aaacccggcc gacttgattt 4020 cgcggggagt cacaccgaag aagctgatca agtccaagaa gtggtggcag ggccgaccac 4080 gcacaaccgt agccgcggct gcggacgaaa tcgtgattcc ggatgacgag ctgccggaaa 4140 tgcgaactgg agtagtgctg gccatgacag ttcctgtcga acgtttcccg atgtttgaca 4200 agttgagcag tttcacgggg atcgtcagaa gcatggccta cttggtgcgc ttggcgaggt 4260 tcgtcaagtc gcgcaaaacg gaggtggtga agggacgact caccgccaag gagatgcgca 4320 cagcaacgct cgtgattgtg cggctggttc aacgcgaagc attccagcca gaaatcctcg 4380 cactgatgga cggtgcggac acaaaccatc gactcaacgg gctgaaggcg ttcctagatc 4440 ccgacgacgg ccttctgcga gttggtggcc ggatcaagcg ggcgttcgtg ccgtacgata 4500 gccgtcacca gatgctgctg ccggctaagc acccgatcac cgaagcactc gttcgagaac 4560 tgcacgtgga caacctgcac atcgggcaac gaggtctgct tgcggttgta cgacaacgat 4620 tctggccgct caacgtgaag agtacgatcc gccaagtgat ccggaagtgc atcgtgtgct 4680 tcaaagcgaa cccgttgaag acgacacagc tgatgggtga cctcccgtcg tatcgtgtcc 4740 agccagctcc ggtcttctcg aacaccggtg tggactacgc cgggccattc tggatcaagt 4800 cgtcgtcggc cacgcgcaag ccccagatca cgaaggctta cgtgtgcttg ttcgtgtgtc 4860 tgcagacccg ggccattcac ttggagctgg tctcggacct gacgacggac gccttcctag 4920 cgagcttgcg acgattcgcc agccgacgcg ggtgcccgaa aacgatgcac tccgacacgg 4980 gacccaactt tgtcggggcc aaaacggagt tgcacgaact ttggcagatg ttccagaacg 5040 agtgcgccac gaagaagatc accagctact gcgtcgagaa gggaatcgag tggtctttta 5100 tcccaccgcg gagtccacac ttcggtggca tctgggaggc cggcgtgaag caggtcaagc 5160 atcacctgaa gcggattgtc ggcgagcgaa agttgtccta cgaggaactc tacaccacgc 5220 tgacccagat cgaagctgtt ttgaactcgc ggcccctagt gccgagctcg gacgacccgt 5280 cggactacac cgcgatcacg ccagcgcact tcttgatcgg acgcgagatg caggccgtgc 5340 cagaaccaga ctactcgcac ctcaaggaga accgactgtc gcgatggcag ctggtgcaga 5400 ccatgctgca gcacttctgg aggcggtgga cagctgaata cctgccggag ctgcaaaacc 5460 gatcgaagtg gctgaagacg aaggtcatca aggagggttc gctcgtgctg ctcgtggacc 5520 agaacgcgcc acctctgcag tggcctcttg gacgaatcgt gaccgcgcac cccagagagg 5580 acggcgtgac tcgcgtcgtt acggtgcgca tggcgaatgg agcggagttc aagcgtgcgg 5640 tgaccgaggt gtgtttgctg ccgttggacg aggacgaaga ttgaaatacg atttcaacgc 5700 gggggagga 5709 // ID Harbinger-N8_BF repbase; DNA; INV; 391 BP. XX AC . XX DT 04-AUG-2008 (Rel. 13.08, Created) DT 04-AUG-2008 (Rel. 13.08, Last updated, Version 1) XX DE Amphioxus Harbinger-N8_BF non-autonomous DNA transposon - DE consensus. XX KW Harbinger; DNA transposon; Transposable Element; Nonautonomous; KW Harbinger-N8_BF; non-autonomous. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-391 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-391 RA Kapitonov V. and Jurka J.; RT "Harbinger-N8_BF - a family of non-autonomous DNA transposons RT from the amphioxus genome."; RL Repbase Reports 8(8), 821-821 (2008). XX DR [2] (Consensus) XX CC This transposon contains 34-bp TIRs and is characterized by 3-bp CC TSDs. XX SQ Sequence 391 BP; 121 A; 62 C; 78 G; 130 T; 0 other; ggcctaggaa aaaaaatgtt gtgtttcctg tttcagtcct gaaaaaatta gggtcggtag 60 gtagggatta tctttttttt ttcttgtatt tttttttagg ctggccaaaa ttctagtgat 120 acatatttac aatacagttg aacaaagtaa actgaaatgt atgatgacac ttgtctttca 180 atgtcaaact ttaattttgt gcatgataga tacatttatc cttcattaga tgggacaagc 240 agtgttttac ttcaatagga agcaggccaa ataaaaattc taactggaag ctatgtgact 300 ccactgccca ccccaaaaaa aagtctaggg tcggcaggtt tttttagggt aggtagggaa 360 acaggaaaca caacattttt tttcctaggc c 391 // ID Ginger2-1_AP repbase; DNA; INV; 2885 BP. XX AC . XX DT 03-FEB-2010 (Rel. 15.01, Created) DT 03-FEB-2010 (Rel. 15.12, Last updated, Version 2) XX DE Ginger2 DNA transposon from Acyrthosiphon pisum. XX KW Ginger2/TDD; DNA transposon; Transposable Element; integrase; KW Ginger; Ginger2; Ginger2-1_AP. XX NM Ginger2-1_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-2885 RA Bao W., Kapitonov V.V. and Jurka J.; RT "Ginger DNA transposons in eukaryotes and their evolutionary RT relationships with long terminal repeat retrotransposons."; RL Mobile DNA 1, 3-3 (2010). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX FH Key Location/Qualifiers FT CDS join(790..1644,2159..2644) FT /product="Ginger2-1_AP_1p" FT /translation="MSLNQLFVPLKPGETKIIYFVKNEDLFDIIHDAHIKT FT GHGGRTRVISELQTKYKNITYKSVTLFLSLCVQCQRKQKVPRKGIVVKPII FT SRELNSRCQVDLVDMQTCKDGEYKFILNYHYHLTKFIQLRPLKSKTAEEVA FT LMLLPIFLTFGAPNILHSDNGREFSNKIIMDLCSRWEGVKIVHGKPRHSQC FT QGSIERANQDFQNILRAMMHDKNTTKWSEALPFVQFAKNTAYHQGIKQTPY FT EAMFGNVAKRGLATSSLPREQIKDIETEEQLEQIIQSIGEYLQVIKKKKKN FT MIQNRAAARMGLELQAERMLRSSRQKFTPAKPGDTVRIRVPDVDRGRMDPQ FT NILAVVVAVDNEFYTLGTKEGVINQLYTRNQFAVCKEQLLSPEEVATDQSV FT SLRKASTSISKTGGQGYMKCNCKAKCVSKKCSYRSSGILCNSKCHNSLSCC FT NK" XX SQ Sequence 2885 BP; 1063 A; 446 C; 507 G; 869 T; 0 other; tgttaaggtg aatgacgaca tttggtgaat gttgacattc accggtgaat gttgacattc 60 actgctgttt ttggtgaatg tcaacatttt gaatgaatat taaaaatttg ccaacagtca 120 ccggtgaatg tcaacattcg caacgattat cggtgagtgt tgacattcac aattgagttt 180 tggtgaatgt tagacatttt tagttaaaat tggagtatag acgttatcgt gtcatacgtc 240 aagatagtaa atgtttatag atatgatatt gttcacacaa tatttaatcg aacggtgttt 300 aatgatctat ataatattat catagatatt taggcattta acgttacggt atctgtctgt 360 ctgcctgtac ccactaccta tctctctctc tctcacacac gctaagattt tgcttaatcg 420 caaaattggc gatacggcag tcatcgtacc taatttagta gtgatgaaaa atacgtttat 480 atcgatcagt gtcgacgtat ttcatttacc ttgcttacac gaacgaaaag gtagactgtt 540 gtgactgatt gaatttgtga tacgttcgag ttacctacta attgttcacg tgtataatgg 600 agaacatatg agaagttttt tacgaaaaat ttaacgctat attaagtaca aaaagagaag 660 acaacaattt ttatttgagt acttaaaata tatgacaaat gtgttgaaga cgttctcaag 720 tttaagggga aagaaagtaa gcaacgcgcc gtgtgcaagt tattgaaaat atatgatgtc 780 gttaaaataa tgtctctcaa ccaacttttt gttccattga aacctggtga aacaaaaata 840 atttacttcg tgaaaaatga agatttattc gatattatcc atgatgccca tatcaaaaca 900 ggccacggag gacgaacccg tgtaattagt gaattacaaa caaaatataa aaatataaca 960 tataaatcag ttacattatt tttgagcttg tgtgttcaat gtcaaaggaa acaaaaagtc 1020 ccgagaaaag ggatcgtcgt aaagcccatt ataagtcgtg aactcaactc tcgatgccaa 1080 gttgatttag tcgatatgca aacgtgtaaa gatggtgagt acaaatttat tctcaattat 1140 cattaccatc tcactaaatt tatacagctt agaccattga agagcaagac agcagaggaa 1200 gtagccctca tgttgctgcc aattttcctc acttttggtg caccaaatat tctgcattca 1260 gataatggga gagagttttc caacaaaatt attatggatt tgtgctcaag atgggagggt 1320 gtaaaaattg ttcacggtaa accccgtcat agtcaatgtc aaggttctat cgaaagagcc 1380 aatcaggatt ttcaaaatat actcagggcc atgatgcatg ataagaatac aacaaaatgg 1440 tcagaggctt tgccatttgt tcaattcgca aaaaacactg catatcatca aggaataaaa 1500 cagacaccgt atgaggcaat gtttggtaac gttgctaaaa ggggtctggc aacaagttcg 1560 ttgccacgag agcagataaa agacattgag accgaagagc agcttgaaca aattatacag 1620 tcaattggtg agtacctaca agtttaattt aaaaaataaa atgatataga taatgaattt 1680 tattattatt attagatgat gataacactg agcaagtgga acaacatgac aatcatatac 1740 aacagcttga acaaatttta cagtcattgg tgagtaccta caagttaaat taaaaaaata 1800 aaatgataat atagataatg aatttaatta ttattattag atgatgataa cactgagcaa 1860 gtaggacaac atgacaatga tatacaacca gaaagtgtat taggtaaact gtatttataa 1920 taaataaata attaagtaca ataataataa ttaaatatat tttatcataa tatctggctg 1980 agctatttaa aatagaaaaa aatatcttat aaacagtaaa atgataaatt atattcaatt 2040 ctattcagaa aaatctgtaa ataaataatt tatcaattta ttttccagct aatgacatcc 2100 aatataataa cactgagcag gtggaacata attttgtgca aagtgaagta gataatgaat 2160 taaaaaaaaa aaaaaaaaca tgattcaaaa tagagcagca gcaagaatgg gacttgaatt 2220 acaagctgaa agaatgctac gttcatcaag acaaaagttt actccagcca aacccggtga 2280 tactgttcgt atccgagtcc ctgatgttga ccgaggacgt atggatccac agaacatatt 2340 ggcagttgtc gtagctgtcg ataatgaatt ttacacacta ggcacaaaag aaggagtaat 2400 aaatcaatta tacacccgta atcaatttgc tgtatgtaaa gaacaactat tgtctcctga 2460 agaagttgct acagaccaat ccgtgtcctt aaggaaagct tccacttcaa tttctaaaac 2520 aggtggacag gggtacatga aatgcaattg taaggcaaag tgtgtatcga agaagtgtag 2580 ttatagatcc tcaggcattt tatgtaactc aaaatgtcat aacagtttat cttgttgcaa 2640 caaataattt tataatataa aataaaataa cttatggtta attaatttgt aaaaattatt 2700 ttatcactag gtattcatct ttttttacaa catgttaaca ttcaccaaaa acagatggtg 2760 catgtcaaca gtcaccagtg aatgtctaca tctaagaaga aattgacatt caccagaaat 2820 ttcggtgaat gtgacattca ccggtgaatg tcaacattca ccaaatgtcg tcattcacct 2880 taaca 2885 // ID PERERE-9 repbase; DNA; INV; 4394 BP. XX AC BN000800; XX DT 04-SEP-2005 (Rel. 10.08, Created) DT 02-JUN-2010 (Rel. 15.07, Last updated, Version 2) XX DE Schistosoma mansoni Perere-9 non-LTR retrotransposon (EST). XX KW R2; Non-LTR Retrotransposon; Transposable Element; PERERE-9. XX NM PERERE-9. XX OS Schistosoma mansoni OC Eukaryota; Metazoa; Platyhelminthes; Trematoda; Digenea; OC Strigeidida; Schistosomatoidea; Schistosomatidae; Schistosoma. XX RN [1] RP 1-4394 RA DeMarco R., Machado A.A., Bisson-Filho A.W. RA and Verjovski-Almeida S.; RT "Identification of 18 new transcribed retrotransposons in RT Schistosoma mansoni."; RL Biochem Biophys Res Commun 333(1), 230-240 (2005). XX DR EMBL/GenBank/DDBJ; BN000800; Positions 1 4394. XX FH Key Location/Qualifiers FT CDS 361..3792 FT /product="PERERE-9_1p" FT /translation="MPVSTGAETDITSSLPIPASSIVSPNYTLPDSSSTCL FT ICFAIFPTHNILLSHATAIHHISCPPTPVQDGSQQMSCVLCAAAFSSNRGL FT TQHIRHRHISEYNELIRQRIAVQPTSRIWSPFDDASLLSIANHEAHRFPTK FT NDLCQHISTILTRRTAEAVKRRLLHLQWSRSPTAITTSSNNHTITDIPNTE FT ARYIFPVDLDEHPPLSDATTPNASTHPLPELLVILTPLPSPTRLQNISESQ FT TSHESNKNSMHTPPTYACDPDETLGATPSSTIPSCFHSYQDPLAEQRGKLL FT RASASLLQSSCTRIRSSSLLAFLQNESTLMDEEHVSTFLNSHAEFVFPRTW FT TPSRPKHPSHAPANVSRKKRRKIEYAHIQRLFHHRPKDASNTVLDGRWRNP FT YVANHSMIPDFDCFWTTVFTKTNSPDSREITPIIPMTPSLIDPILPSDVTW FT ALKEMHGTAGGIDRLTSYDLMRFGKNGLAGYLNMLLALAYLPTNLSTARVT FT FVPKSSSPVSPEDFRPISVAPVATRCLHKILAKRWMPLFPQERLQFAFLNR FT DGCFEAVNLLHSVIRHVHTRHTGASFALLDISRAFDTVSHDSIIRAAKRYG FT APELLCRYLNNYYRRSTSCVNRTELHPTCGVKQGDPLSPLLFIMVLDEVLE FT GLDPMTHLTVDGESLNYIAYADDLVVFAPNAELLQRKLDRISILLHEAGWS FT VNPEKSRTLDLISGGHSKITALSQTEFTIAGMRIPPLSAADTFDYLGIKFN FT FKGRCPVAHIDLLNNYLTEISCAPLKPQQRMKILKDNLLPRLLYPLTLGIV FT HLKTLKSMDRNIHTAIRKWLRLPSDTPLAYFHSPVAAGGLGILHLSSSVPF FT HRRKRLETLLSSPNRLLHKLPTSPTLASYSHLSQLPVRIGHETVTSREEAS FT NSWVRRLHSSCDGKGLLLAPLSTESHAWLRYPQSIFPSVYINAVKLRGGLL FT STKVRRSRGGRVTNGLNCRGGCAHHETIHHILQHCALTHDIRCKRHNELCN FT LVAKKLRRQKIHFLQEPCIPLEKTYCKPDFIIIRDSIAYVLDVTVSDDGNT FT HASRLLKISKYGNERTVASIKRFLTSSGYIITSVRQTPVVLTFRGILDRAS FT SQSLRRLCFSSRDLGDLCLSAIQGSIKIYNTYMRGT" XX SQ Sequence 4394 BP; 1168 A; 1294 C; 768 G; 1163 T; 1 other; atctcacgtt ttaatttatt tttgaactac tgcagtctga gtgcttctaa cgacccgaag 60 gctcagaaac tacccacttc ttgaactgct actttttgct gtttatccac aacaacagtt 120 gtgattctat tctccanata ttccttgtgc ttttgtcaac attattctat accaactgta 180 ccacctactt cttcatctca cgttttaatt ctggtcttat tttctcatca ttagtcacgg 240 agagggccta tgaacggtcc gtgacgcgaa attctatccg cgatttcgac ctctcctgct 300 agtggtcccc gaagtacggt tcctctggcc tgtcagttgt gttaaaacta tataataacg 360 atgccggtct caaccggcgc agaaactgac ataacctctt ctttgcctat tcctgcatcc 420 tcaatcgtct cgccaaacta cacactccct gattcctctt caacctgcct tatatgtttc 480 gctatcttcc ccacccacaa catactcctc tcccatgcca ctgcaatcca ccatatttct 540 tgtcctccta ctccagtgca agacggttct cagcagatgt cttgtgttct ttgcgccgcc 600 gctttttcat ctaacagggg actaacacaa cacattcgcc accggcacat ctccgaatat 660 aacgaactaa tcagacaacg aattgcagtg cagccgacgt ctcgcatatg gtcaccattc 720 gatgatgctt ctttactatc catcgctaac catgaagccc atagattccc cacgaagaat 780 gacctatgcc aacatatcag caccatacta acacgcagga cggcagaagc cgtcaaacgc 840 cgactcctcc acctacagtg gtcgagatca ccaacagcga ttactacctc ttcgaataat 900 cacacaatca cagacatccc caataccgag gcccgatata tttttccggt agacctagac 960 gaacatccac cattgtctga tgccacaacc cccaacgcat cgacacatcc actcccagaa 1020 ctccttgtca tcttgacacc gcttccatcc ccgactagac tacaaaacat atccgaatca 1080 cagacctccc atgaatctaa taagaactca atgcatacac cgccaacgta tgcctgcgat 1140 ccggatgaga cactaggggc tactccctca tcaactattc cctcatgctt ccacagttat 1200 caggaccccc tagctgaaca aagaggcaaa ctcctgaggg catccgccag cctactacaa 1260 agcagttgta ctcgcatacg gtcctccagc ctgctcgcct tcctccaaaa cgaatccaca 1320 ttaatggacg aggaacacgt gtccaccttc ctcaatagtc atgcagaatt cgtcttccct 1380 agaacatgga ccccatcccg acccaaacac ccctcccacg ccccagctaa tgtttctagg 1440 aagaaaagga ggaaaataga gtacgcacac atccagagac tcttccacca ccgtcccaaa 1500 gatgcctcca acaccgttct agacggtcgg tggagaaacc cctatgtcgc aaaccattca 1560 atgattccag acttcgactg cttctggaca acagtcttta ctaaaacaaa ttccccagac 1620 agccgggaga ttactccaat catccctatg actccctctc tcattgaccc gatcctcccc 1680 tctgacgtca catgggcgct gaaagaaatg catggcacgg ccggtgggat tgatcgtctg 1740 acatcgtacg atctgatgag attcgggaag aatggtcttg ctggatatct caacatgcta 1800 ctcgctcttg cataccttcc cactaatctt tcaacagcac gggtaacttt cgtccccaag 1860 tcatcaagtc ctgtgtcacc tgaggacttc cgtcccatca gtgtcgctcc agtagccact 1920 aggtgcctgc acaaaattct agctaagaga tggatgccgc tctttccaca ggaacgactt 1980 cagttcgctt tcctaaaccg agatggatgc tttgaagcag ttaatcttct gcactcggtc 2040 atacggcacg tccacacccg ccatacagga gcatccttcg ccctgctcga catatcacgg 2100 gcctttgaca ctgtatcaca tgactccatc atcagagcgg cgaaaagata tggggcacct 2160 gaactgttat gccgctacct caataactat taccgacgtt caaccagctg cgtcaaccgc 2220 actgaattgc atcctacgtg tggggtgaag caaggagacc ccctgtcgcc actcctcttc 2280 atcatggttc tcgacgaagt actggaaggt ctagatccaa tgacccacct aacagttgat 2340 ggagagagct tgaactacat agcttatgct gacgatctcg tagttttcgc tccaaatgcg 2400 gaactccttc aacggaaact cgatcggatc tccatacttc tacacgaggc tggatggtcg 2460 gttaaccctg aaaaaagccg gaccctggac ctaatctctg gtggccattc caaaatcaca 2520 gcgctctctc agacagaatt caccatcgcg gggatgcgta taccaccgct ttctgccgcc 2580 gacaccttcg actacctggg tatcaaattc aacttcaagg gccgatgccc agtggcccat 2640 attgacttat tgaacaacta cctcacggaa atatcgtgcg ctccacttaa gccgcagcag 2700 cgcatgaaga tcttgaaaga taatctactc cctcgactcc tataccccct gactctagga 2760 atagtacacc tgaaaaccct gaagtcaatg gaccgaaata tccacacggc cataaggaaa 2820 tggttgcggc taccctccga caccccgcta gcatattttc actcacccgt cgctgccgga 2880 ggcctaggga tcctccatct gtcctcatcg gttccattcc accgtcgaaa acgtctagaa 2940 accctcctat cttcaccgaa ccgcctactg cacaagttgc caacttcccc aacactagct 3000 tcttattcac accttagtca actgccagtt cgaattgggc acgagaccgt aacgtctaga 3060 gaagaggctt ccaacagctg ggtgagacga ttacattcgt cctgcgacgg gaagggacta 3120 ctcctagcac cactaagcac cgagtcccat gcatggctgc gctaccccca gtctattttt 3180 ccatctgttt acatcaacgc cgttaaatta cgaggtggct tactatccac caaagtcagg 3240 agatctcgcg gaggtagagt gacgaatggc ctgaactgtc gaggcggttg cgcccatcat 3300 gaaacgatcc accacattct gcaacattgc gcgctcaccc acgacatcag atgcaaacgc 3360 cataacgaac tatgcaacct tgtggcaaag aaactgcgta ggcaaaaaat ccatttctta 3420 caggagccct gcattcctct agaaaaaacc tactgcaaac ctgattttat aattatacgt 3480 gactcaattg cttatgttct agacgtcact gtatcggacg acggaaacac ccacgccagc 3540 cgcctgttaa aaatatcaaa atacggcaat gagcgaaccg tcgcgtcgat caagcgtttc 3600 ctcacatcca gtggatatat cattaccagt gttcgacaaa caccagtcgt ccttacattc 3660 agaggtattc tggatagagc aagttcacaa tccctacgac gcctatgttt ttcatcccgt 3720 gacctcggtg acctttgcct gagtgcgatt caaggctcaa ttaaaatata taatacctat 3780 atgagaggaa cctaacggct gaacgaatag cccccttcac tcttagacat tcccccactg 3840 ttgttgctta tcttcatgct cttgtgttaa ttgactgctc tcttctgggt tgacgtctga 3900 ttgtctctct ctctttccat attgcttgct ctgcccgctt acttccaata gttgtcatat 3960 tatgtctttg tttacttgcc atgtctaacg acaattactt tatctacctt agtttgtcct 4020 cttggtttcg attgccttca tatgttcatg gcggaatctg atgtttataa tgactattcc 4080 tattaccacc actacaacta ctattattat tttcattact attaacatta ttataaacat 4140 tattactatt attattatta ctattattac ttctacaatt aatattatgg ctactcctct 4200 cagcacacca ataaaatatc aatcaaacat ctcaattata tccacctatt aaactctctc 4260 tatttcccct gagttataaa cttacaattc agtctaaccg aatatctctc ttttacaaat 4320 cttaagtatg taattttgtg ccaaacccat ttgggtctgt acaatttgat acttaaaaat 4380 aaatgttatt agcc 4394 // ID Gypsy-238_AA-I repbase; DNA; INV; 5626 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-238_AA_; KW Gypsy-238_AA-LTR; Gypsy-238_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-5626 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1077-1077 (2011). XX DR [1] (Consensus) XX CC Positions [4568-5041] - Integrase core CC 'GTACC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 1715..3094 FT /product="Gypsy-238_AA-I_1p" FT /translation="MLRLRAQVKRCEYDKPNEMILDQIVEKCASSKLKQKM FT LKRDMSLEEVEALGTSLEESEKKLKEFGGSTVRGDIQKVNSWGSFKKQPGR FT VQLGNRWGQTSSNKPDGRERMSQGPDYDSQRYTNRSIPVCFACGRRGHVKG FT AESCAAKGAACMKCGNIGHFARQCLKRPNTRQPSGLPPKRVRMVQGHSEED FT KSDGYIFYAMGSNTFRFTVGGVEIPMTIDSGAAANIVSKEIWEEMKEAGVQ FT VWNMTQEVDRNFTCYASETPMKIVGSFMSTIEGGGRQTAAQFYVAEKGQQC FT LLGDDTAKVLKVLKVGFDIGNIANEPAKEFPKFKGVVVEIPIDQAIQPVQQ FT AYRRAPYALEDKVEEKLRVLLAQGIIEKVRGPSPWVSPMVPVLKESGDVRL FT CIDMRRANQAVLRETHPLPLVDELLGSVNGAVRFSKIDIRDAYHQVEISEK FT SRPITTFVTKNGLYR" FT CDS 3254..5107 FT /product="Gypsy-238_AA-I_2p" FT /translation="MFGICCAPEIFQKTMENVLAGLDGIIVYLDDVVLFGA FT SEKEHNERLQALRNRLDEYGILLNHAKCVYNVEEVVFLGHVLNKNGIRPTE FT HRMIAIKSFREPKNVSELRSFLGLITYVSRFIPHLASKTESLRQLLRGGVA FT FRWGDMHRRAFDEIKEAISDIGYLGFYNKLDITKLIADASPDGLGAVLLQE FT NSEGETRIIAFASKALTELEKKYFQTEREALALVWGVEKFSLYLLGRRFQL FT ITDCKALKFLFNSRSRPCPRIERWVLRLQSYDYEVIHEPGASNLADALSRL FT SIRTPKAFDSSGEAYIHQLLEYSVPDAVTLVDVKEASLEDETLQELRKALE FT TDQWEDKVKGYKQFKTELLVMGDVLMRGDRLIIPEKLQQNVLLCAHEGHPG FT MSAMKRRLRQKVWWPKIDEQVEKCVKDCNACTIVSTTGPPEPMMRSRMPTR FT AWSDVAIDFLGPLPSGHNLLVMIDYFSRFAEVVVMKQITAERTIEALFETF FT SRFGIPEVLRSDNGPQFISGILGTFCREYGITQQKTTPYWPQANGEVERMN FT NTILKRLRISQELESPDWKWDLRNFLLMYNSTPHSITGVAPSALHVWTCTA FT GQTTERTGNYRKINRKHQGS" XX SQ Sequence 5626 BP; 1696 A; 1026 C; 1450 G; 1454 T; 0 other; ttggcgacga gtataaaaat gtaagtagca aagcaactcc gaaagaagtt gattatttaa 60 aaaattccga ttttaaccaa tatggctgac ataatttgca ttcggctgat ggtcggaaag 120 atggcggggt cctggtgctg tgccccgacc gacatccgga tgataaagtg tgcaatcatt 180 cacataaccc atccgcaaca ttgttgatgt gtgcgtaaag caaatttttg tttattgtcg 240 tgaacggtat tttggccgac agtattggac cacgagaagt gattaccggg aagaacagtg 300 ggggctatcg tatgtgccgg tgagtgtttg ctcctagttt ttttttctgc cacggtggtt 360 gtgctgtcac gtttggattt gaaatttctt ataccgttat tgcgtcatat gcgttgtggt 420 gggtgggtgt gaatgtgaaa tgtggaagga gaagaggaga gaagaatgga ataattgaaa 480 ccgacgttga atgctgtgat cctaatgcaa gtaaaacgtc atatgtatag gtgatggcga 540 tgttcacagt gttccgcaac aggtacgcat caggtgagta aacatgaaca aaagtgggcg 600 atttcatgca ctgaaataaa tagctgcgga tatgtttgtg gatttgtttt attttatcgc 660 tgagattata ttgaatttct ggcattgact gggggaaatc aaatgttgaa caaatttgtt 720 tctctatatg atgcacagga ttattgattt ttcaatgttt gggaatgtag attgatgaaa 780 taattgtgaa aagaaagtga tttaatcgga gaataatgga cagaaaatca gtagagggcg 840 ggttcgccaa gttaaggaga gtcattatgg actagcgggt tcgctataca acgatggcgg 900 gttcgctatg aaagacagcg ggttcgctac atgacaaggc taaagagtta gcggaatcgc 960 ttggagttgg cggattcgct atggttaagc gggttcgcta taagttagtg ggttcactat 1020 atgattaagc gggttcgctt gtaatcagaa agttggctat gaaattgcag gtcagtccaa 1080 atttgtgtct gattggttac aaaataaacc ggtggtgtca ttatcaagtt taagcgggtt 1140 cgctgatatt tattagttgg gacttaaggt caacaaggaa tgagatcaag aaatcatttt 1200 aattttagtt tttttttttt gtaaactgag atcaaattta tttttcttca tactggatag 1260 ttatgtgaat gatgcaacaa tttatccaat atactacagt agtatcattc tcagcatatc 1320 tatcatatct agtttttggt ttgttcattt ttggaacagg gctaacagcg atagccggat 1380 caaaccgttc cagtcctcga ttgaatcctc gcaacttccg ttggcctggt cgaaatggaa 1440 aagagatttg gagtcgtatt ttgaatcgga aaagattgaa tctcagtacg ataaaagatc 1500 aaaattattg taccttggtg gttccgattt gagggacata tacgacaatt tgccagagac 1560 ggcgaatgtt ccccatgtgt tgaaagatcc tccgtattat gacgtagcta tagccaagct 1620 cgacgctcat ttcgagccat tccgccgacg aacatatgaa cggcatcagt tccgacaaat 1680 tttccagaat tcgtcggaac gcttttcgga tttcatgttg agactaaggg cacaggtcaa 1740 acgctgcgaa tacgataaac caaatgaaat gattctggac cagatcgtgg agaaatgtgc 1800 ttccagcaag ttgaaacaaa aaatgctgaa gcgtgatatg agtttggagg aagttgaagc 1860 cttgggcaca agtttggagg aaagcgagaa gaaactcaag gagttcggtg gttcaacggt 1920 gagaggagac attcaaaagg tcaatagttg gggatctttt aagaaacaac caggaagagt 1980 gcagctcgga aatcgttggg gtcaaacctc gtcgaacaaa cctgatggtc gtgaaagaat 2040 gagtcaagga ccagattatg attctcagcg gtatactaac cgttcgatac ctgtttgttt 2100 tgcttgcgga agacgaggtc atgtaaaagg agcggaatct tgtgcagcta agggagccgc 2160 ctgcatgaaa tgtgggaata tcgggcattt tgccagacag tgtttgaaac gaccgaatac 2220 caggcaacca tcgggacttc cgccaaaacg tgtaagaatg gttcaaggac acagcgaaga 2280 ggacaaaagt gacggatata ttttttatgc gatgggaagt aataccttcc gattcactgt 2340 tgggggcgtc gaaatcccga tgacaattga ttcaggggcg gccgctaaca ttgttagcaa 2400 ggaaatttgg gaagagatga aagaagcagg agtacaggtg tggaatatga ctcaagaagt 2460 tgaccgaaat ttcacgtgtt atgcatcgga aacaccgatg aaaatcgtcg gcagtttcat 2520 gtccaccatc gaaggcggtg gcagacaaac ggcagcccaa ttctacgttg ctgaaaaggg 2580 tcaacagtgt ttgcttggag atgacacggc gaaagttctc aaagtgctca aagttggttt 2640 cgacattggg aacattgcca acgaaccagc caaagagttc ccaaagttta agggcgtagt 2700 ggtagagatt cccattgatc aagcgatcca accagtgcaa caggcctatc gtcgcgcgcc 2760 gtatgctttg gaggataaag tagaggagaa gctaagagtt ttgttggcac aagggatcat 2820 agaaaaggtg agaggaccgt ccccatgggt ttcaccaatg gtgccggtgt tgaaggagtc 2880 tggagatgta cgcttgtgca tcgacatgag acgagccaac caggcagtgt tgcgcgaaac 2940 acatcctcta ccattggtag atgagctgtt gggatcagtc aacggagcgg tgcggttctc 3000 caaaatagat ataagagatg cgtaccatca agttgaaatt tcggagaaat cgcgaccgat 3060 aaccacgttt gtgacgaaga atgggctata caggtaaatg gagatttata gaacttcaaa 3120 aatcgagatc tgtttttttt tcgctatatt ttttgaaaag cagtcaagtc aaaattggta 3180 ttggaatgaa ataaacgaaa gtaatcgatt gtttatattt ttgcattctt aactaaacag 3240 atataagagg ctcatgtttg gtatctgctg tgcaccggaa atattccaga aaactatgga 3300 aaatgttttg gcaggtttgg atggcatcat cgtgtatctg gatgacgtgg ttttatttgg 3360 agcatcagaa aaagaacata atgagaggct tcaagcacta cgtaaccgat tggatgagta 3420 cggtatactt cttaaccacg ccaaatgtgt gtacaacgta gaagaagtag tttttctcgg 3480 tcatgttttg aacaaaaacg gtataaggcc tacggaacat cgcatgatag ctatcaaaag 3540 cttccgagaa cccaaaaatg tttctgaact gagaagcttt ttgggactga taacttatgt 3600 tagccggttt ataccacact tggcatcaaa aactgaatca ctaaggcaac ttctgcgagg 3660 tggtgtagct tttcgttggg gtgatatgca ccgtagagca ttcgatgaaa ttaaggaagc 3720 catttctgat attggatact tgggttttta taacaagctc gacataacga agttgattgc 3780 agatgccagt cctgatggtt taggtgcagt attgttacag gaaaattctg agggggagac 3840 gagaatcatt gcttttgcaa gcaaggcact gacagaactt gaaaagaaat attttcaaac 3900 tgaacgcgag gcgcttgcac ttgtctgggg cgtggagaaa tttagtctct atcttctcgg 3960 aagacgtttt caactgataa cagactgcaa agcgttgaag tttctgttca actcgcggtc 4020 taggccgtgt cctcgtatcg aacgatgggt tctccgcctc cagtcttacg attacgaagt 4080 cattcacgaa ccaggagcta gtaatctggc ggatgctctg tctagattat cgataaggac 4140 accaaaggca tttgattcat caggcgaagc gtatatccac caacttctgg aatactcggt 4200 tccggatgca gtaacacttg tagatgtcaa ggaggcatca cttgaagatg aaaccttaca 4260 agaactgcgt aaagctttgg aaacggatca atgggaagac aaagtgaaag gatacaaaca 4320 gttcaaaact gagttactcg tgatgggaga tgtcctcatg cgaggagatc ggctaataat 4380 tccagagaag ctacaacaaa atgtcctact ctgcgcacac gaaggacatc cgggtatgag 4440 cgctatgaaa agaaggctcc ggcaaaaggt atggtggcct aaaatcgacg agcaagttga 4500 aaagtgcgtt aaagactgca acgcctgcac gatagtatca acaacaggac ctccagagcc 4560 tatgatgcgg tcgagaatgc caacaagagc gtggtcggat gtggcaatag attttcttgg 4620 gcctctgcca agtggtcata atttattggt gatgatcgat tatttcagtc gatttgctga 4680 ggtagtagtg atgaagcaaa tcacagccga acgcactatt gaagcattat ttgaaacatt 4740 tagtcgcttt ggtattccgg aagtactgcg gtcagataac ggccctcagt ttattagtgg 4800 aatcctcgga acgttttgta gagaatatgg tattacacag cagaaaacta ccccgtactg 4860 gccgcaagcc aacggagaag ttgaaaggat gaacaacaca atcttgaaac gccttcgtat 4920 aagccaggaa ctcgaatcgc ctgattggaa gtgggacttg aggaacttct tgctcatgta 4980 taattcgacg ccccattcaa tcactggcgt tgctccttcg gcgttgcatg tttggacgtg 5040 tactgcggga caaactaccg agcgcacagg aaactacaga aaaattaaca gaaagcatca 5100 gggatcgtga ctggactaaa aaagttacag cagcagaatc ggagaacaag aaacgtcgag 5160 caaaaccgaa tgaactgaag gaaggagaca ctgtggttgc taaaaagatt ctaaaagaaa 5220 acaaactttc cggaaatttt ggccacgaac gattccagat actcaagaga tcaggtacgg 5280 aggttgaact gaagtcaatg gaaacaggcc ggacgtatca caggaatgta gcacatttaa 5340 aacgaatcag cgaagataca gagccttacg gagaaagcac gacaggtgga gaaacatcaa 5400 ctggaaactt gtcatcgtct aatcgggtga taaactctag agaacgcaga gagcacaggg 5460 ttcccagcta tttgcaagac ccgcgcgcgg cggcgccggg cgggccggcg gcgcccgccg 5520 cggcggggcg ccccgcgccg cgcgccgcct gaagttacgt gacggtaacc ttatttctaa 5580 cagtgtatgt taaaacagta gaatctaata ataagtaaag ggggaa 5626 // ID hATm-52_HM repbase; DNA; INV; 3288 BP. XX AC . XX DT 07-DEC-2008 (Rel. 13.12, Created) DT 15-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type family: consensus sequence. XX KW hAT; DNA transposon; Transposable Element; hATm-52_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3288 RA Jurka J.; RT "Families of hATm elements from Hydra magnipapillata."; RL Repbase Reports 8(12), 1946-1946 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 374..2914 FT /product="hATm-52_HM_1p" FT /translation="MLTRSGKVVETELKSLPTSSSCLSDTTDQPSPSSSSS FT TTRPVTRASTXFWLIGHPSPSITGSKLPDCRQVLKFFLFLRRDEENVRNHV FT TNQEIGYIVIDAVVTFWNMARIKTKTRQNSVLDLMCLWNEWNRLLKNKNRE FT SDAGGKRVNFLAKLDSLFDIGSPDAIQEIMKSRLLTAQKKDDDVAFYLDQK FT KDRKATMDGHDEIFEIKAKMKEVRYNRKIILRDEEKERVEKEGRQDDQDDE FT DMEVNDEEQNINENIAVAEVDIGLGWDSNYEPPMNRRSEFVTLHLPKRIMQ FT CEEITSAADRLKLSDNKTTMIVSAVIKAAGGNLDDFDISRSTSRRTRMINR FT QRIAESVIDNVRLNPPQFGALHWDGKLLKDILDVQHEQLAVLVSGAPEYSE FT GKLLGVPVLVDSKGQTQANATLDLLEAWDLSNNVVALVFDTTASNSGIHNG FT AAKLIEENLGRKLLYLACRHHIFEIFVGACWKKLFGNIFGPENKFFAEFKT FT AWPALDKQLPIETLAVEGTWLQNLKEQTIQHLTHLLANKDATTFPRDDYRE FT CAENTLIIFGQTPPRGVHFLKPGXMQXIQQLGWPATYMLXRCSCFLKMXYD FT GNYIHQARWMACNIYANKMFMFSKQLPYDQRMTTKLFRMNRYLSLFHTPAW FT TKASIGADAPMNDLQFIHDMMDYKEIDKEVADVVLSKITNHRWYLTEEVVP FT FAFFSKHVSSTVKKDMAAKLLEIPVPENFRLGKPVFRQITRQTGLSNLLGP FT ESHSLFSILGINTDWLAKPIEQWTEQPGYKEAEKFVRTVKVVNDTAERGVK FT LISEFATIITSDPEQREWLLQGVENHRQQYPDFGKKNSWTMKQH*" XX SQ Sequence 3288 BP; 1087 A; 580 C; 655 G; 951 T; 15 other; ttaggggtgt ccactatgca aaaaattttc aaaaccaccc aaattgaatt ttttcccatt 60 ctagtgtttg tgtacagtgt aaaaaaaagt tttactaaat tttaatgact attctagagc 120 cccctcaaag ccccaatttc ctgtttaacc cattagtgcc tgagtttttt gttacatttt 180 acgatttata gagcgtccca tttttgggga cgctaggcac gaatgggtta aaacagtagc 240 acatttttgc attgattaca aatttcataa acaagcaaat taatattatg atattgacac 300 aatttaatat gtaatattga ctgttataat tattttgctg tttcctgttt tagttaaata 360 aattaaaact gaaatgttga caagaagtgg taaagttgta gaaactgaac tcaagtcttt 420 gccaaccagt tccagttgtt tgtcagacac aaccgaccag ccttcaccgt cgtcatcatc 480 atcaacaaca aggcctgtta ctcgtgcctc tactaakttt tggttaattg gacacccatc 540 accatcaata actggttcca aactacccga ttgcagacag gttttgaaat tttttctttt 600 tttacgacgc gatgaagaaa atgttcgaaa tcatgtaaca aatcaagaaa ttggctatat 660 agtaatcgac gctgttgtaa ctttytggaa catggcacgc atcaaaacaa agacaagaca 720 aaattcggtg ttagatctta tgtgtctctg gaatgagtgg aaccgcttgt tgaagaacaa 780 gaatcgcgaa agtgatgctg gcggaaaacg tgttaacttc ttggccaagt tggatagtct 840 ttttgatatt ggatcacctg atgctataca ggaaatcatg aaatccaggc tactgactgc 900 ccaaaagaaa gatgatgacg ttgcattcta tttagatcaa aaaaaagaca gaaaggcgac 960 catggacggg catgacgaaa tctttgagat taaagcaaaa atgaaagaag tcagatataa 1020 tcggaaaatc atacttagag atgaagaaaa agaaagagta gaaaaagaag gtagacagga 1080 tgatcaggat gatgaggaca tggaagttaa tgacgaagaa cagaatatta atgaaaatat 1140 tgctgttgct gaagtagata ttggtttagg gtgggactcg aattatgaac ctccaatgaa 1200 tagacgatca gaatttgtta ctcttcacct ccccaaaaga attatgcaat gcgaggaaat 1260 cacttctgca gcagacaggc tgaaattgtc tgataataaa acaactatga ttgtgtctgc 1320 tgtaattaaa gctgctggtg gaaacttgga cgattttgat atctccagat cgacatctcg 1380 tcgaactaga atgataaatc gacagaggat tgctgagagt gttatagata atgtgaggct 1440 caacccaccc cagtttggtg cccttcactg ggacggaaaa ttgctaaaag acatactaga 1500 cgtgcaacat gaacagttag ctgtgcttgt gtcgggtgct ccagaataca gtgagggaaa 1560 attgcttggt gttcccgttc tcgttgattc gaaaggacag actcaagcaa atgcaacatt 1620 ggatctccta gaagcttggg atctaagcaa caatgttgtg gcattggtat ttgataccac 1680 tgccagtaat agtggcatcc acaatggtgc tgctaaattg attgaagaaa acttaggtag 1740 aaaacttctc tatcttgcct gtcgacatca catttttgaa atctttgtcg gagcatgttg 1800 gaagaaatta tttggaaaca tttttggtcc agaraataaa ttctttgctg aatttaaaac 1860 tgcttggcca gcattggata aacagttgcc aatcgaaact ttagctgttg aaggcacctg 1920 gttgcaaaac ctcaaagaac agaccataca gcatttgaca catctacttg ctaacaaaga 1980 tgctacaaca tttccaagag atgactatcg tgaatgtgct gagaacacac tgatcatctt 2040 tgggcaaact ccaccgcggg gtgtacactt tcttaaacca gggwggatgc aagycatcca 2100 gcagcttgga tggccagcaa catatatgct ggraagatgt tcgtgttttc taaaratgrw 2160 atatgacgga aactatatcc accaagctcg gtggatggca tgtaacattt atgcgaacaa 2220 aatgttcatg ttttctaaac aattgcctta tgatcagagg atgacgacca aactttttag 2280 aatgaacaga tatctttctc ttttccacac accagcttgg acaaaagcta gcattggtgc 2340 tgatgcacct atgaatgacc tgcaattcat ccacgacatg atggactaca aagaaataga 2400 caaggaagtt gctgatgtcg ttctttctaa aataaccaat caccgatggt atttgacaga 2460 agaggtagtt ccatttgctt tcttcagcaa acatgttagt tccactgtga aaaaagatat 2520 ggcagccaaa ctcttggaaa ttccagtacc cgaaaatttt cgtcttggta agccagtttt 2580 tcgtcaaatc actcgtcaaa caggtttgtc aaatcttctg ggacctgaat ctcactcatt 2640 gttcagcatt cttggcatca atacagattg gcttgctaaa ccaattgaac agtggacaga 2700 gcaaccagga tacaaagaag ctgaaaaatt tgttcgtacg gtaaaagttg taaatgatac 2760 ygcagaaaga ggagttaaac tgatttcaga atttgccacc atcataacca gtgatccaga 2820 acaaagagaa tggttgctgc agggcgttga aaatcacagg cagcaatatc ctgattttgg 2880 caaaaaaaac tcttggacaa tgaaacaaca ttaataacat tcagaaagac agtcttaatt 2940 ttagttttgg ggttcccgtc tgatactttg actataactg ttaaccaaaa atgcagaact 3000 agcttggtta aactggataa tttaacatat attatgcaag ctgcttttgt ttggcttgat 3060 ttctgtgatt taatttctaa atacttgaaa aatgagattc acaataatta tttctctgtg 3120 ttttaactaa aaaagatact gttttaaaca ggaaaatttg gggctcwrga carggggctc 3180 tagaatagtt attaaaattt gawaaaaata ttttttaaaa ttactaraaa gggaaaaaat 3240 tcgaatttgg gtggttttga aaattttttg catagtggac acccctaa 3288 // ID Kolobok-15_HM repbase; DNA; INV; 2820 BP. XX AC . XX DT 15-JAN-2009 (Rel. 14.02, Created) DT 15-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE Kolobok-type family - consensus. XX KW Kolobok; DNA transposon; Transposable Element; Kolobok-15_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2820 RA Bao W. and Jurka J.; RT "Kolobok-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 9(2), 424-424 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(396..707,658..1098,1124..2236) FT /product="Kolobok-15_HM_1p" FT /translation="LSDKWVQKNRYAKSRAKKRKFHKLEKDSLNNSNALQQ FT TVLSPHSPXNVSXSASGKKINLSLLDVENFDYIEKECNIIINSNLFSSLLK FT TIGRCPRMLFVNQYXTKQLEDALECYSSINIXHDINKKEGLSHTFLISCQD FT CTWTINFLTSKSIENKNEVIKGKKMXEINLRTVLAFRDMGQGYIAIXTFCR FT LMNMPPGMTKRGYHNINNKLHLAYIETSRQSTKKASEDIRREKLLKVKIMM FT NHQLSMLLLVMVQRRGYSSMNGIVTAISHGKCIDAAVQTKSCYPCKIWSRR FT KGTPAYENWKITHNCKINHKGSSGSMEFVGAVQIFKRSVKELKLKYLTYIG FT DGDTKSYMQVCDANPYPGEIIEKAECIGHVQKRVGARLRMLRDRYKGNTLS FT DGKYISGKGRLTDKWINTLQNYYGIAIRQNIDSLYGMKKAVAAVLFHCAQS FT DNLEQQHQFCPKGSSSWCKYHSDQKNYNSKCIPKAIIELIKPIFSYKDLGS FT DELLKKCLHGQTQNVNELLNGMVWXRAPKRVHVEKTVVEIALASAIISFNN FT GVQGLLPVFEKCNVVPGYFTTKSSLNVDYVRXKQNEIKSTGATKNRRKKLR FT AIKKGFIDQHNEAEXVTYNSGGF*" XX SQ Sequence 2820 BP; 1006 A; 390 C; 469 G; 945 T; 10 other; ggtggtagta ccccaaaaaa caggctaaat attttttttt ttcaaaattt tttttttaac 60 aattttcatt catttatcat gtctactctt ataataattc aaaaaaatta aaaaaaaaat 120 aatcaaaagg cctaaattat ccttaataga tttggtactg gaaaattaat tgccttagca 180 acggycctta gagatacatt tcttgctata tcttgaaaaa taaaagttca atttatatgc 240 tttttttgtg tgcttatgac taaattgttc cttaagaact cttgttagtt ttcgatctaa 300 tatccataaa ttgaaacctc atatttgttt ctgtaatata tagataatta tccttttata 360 aaattttctt tgattgatgg gataattttg attagttgtc tgataaatgg gttcaaaaaa 420 atagatatgc taaatcaaga gcaaagaaaa gaaagtttca taaattagaa aaagattctc 480 taaataattc aaatgcttta caacaaactg ttttgtctcc acattcacca arcaatgtaa 540 gtkgatcggc atctggaaaa aaaataaact tgtctctttt agatgtagaa aactttgatt 600 atatagaaaa agagtgtaat ataattataa actcaaattt attttcaagt ttgttgaaaa 660 caattggaag atgccctaga atgttattcg tcaatcaata ttrtacatga cattaataaa 720 aaagagggtc tttctcacac tttcctaatt tcttgtcaag attgtacatg gacgataaat 780 tttttaactt caaaatcaat agaaaataaa aatgaagtaa tcaaaggtaa aaaaatgara 840 gaaattaacc tacgtactgt gttagctttt cgagatatgg ggcaaggata catagcaatt 900 awtacttttt gtcgtttaat gaatatgcct cctggtatga ctaaacgtgg ttaccataat 960 attaataaca aacttcacct tgcttacatt gaaacctcta gacagagtac aaaaaaagcc 1020 tcagaggaca tacgtaggga aaaacttttg aaggtgaaaa ttatgatgaa tcatcaattg 1080 tcaatgttgc tgttagttta gatggttcat catggttcca tagatggttc agcgcagagg 1140 atactcctca atgaatggta tagtcacagc aatatctcat ggaaagtgta ttgatgctgc 1200 tgtccagaca aaaagttgtt acccctgcaa gatatggtca agaagaaaag gtactccggc 1260 ttacgagaac tggaagatca cacataactg caagatcaat cataagggta gctcaggaag 1320 tatggaattt gtaggcgctg tacaaatatt caagcgatct gtcaaggagc ttaaactcaa 1380 gtacctaaca tacattggag atggagatac caagtcttat atgcaagttt gtgatgcaaa 1440 tccataccca ggagaaatta ttgagaaagc agagtgcata ggacatgttc agaagagagt 1500 tggtgcaaga ctacgaatgt tgcgagatcg ttacaaaggt aatacattat cagatggaaa 1560 atacatcagt gggaaaggaa gattaactga taagtggata aatactcttc aaaactatta 1620 tggaatagca atacgccaaa atattgatag tctatatggg atgaaaaagg ctgttgctgc 1680 agtacttttt cactgcgcac aatctgacaa cttagagcaa cagcaccagt tttgccctaa 1740 aggttcaagt tcatggtgta aatatcattc agaccagaaa aattacaatt caaagtgtat 1800 tccaaaagct ataattgagc tcattaagcc aattttttct tacaaagatc ttggcagtga 1860 tgaattgctc aagaaatgtt tacatggcca aactcaaaat gtcaatgaat tgctaaatgg 1920 gatggtttgg rctcgcgcac ccaaaagagt acatgtggaa aaaacagttg ttgagattgc 1980 tttggcctct gcaataatta gcttcaataa tggagttcag ggtttactac cagtgtttga 2040 aaaatgtaat gttgtgcctg gatattttac aacaaaaagt tctctaaatg tagattatgt 2100 acgcrttaag cagaatgaga taaaaagtac aggtgccaca aaaaacagga ggaaaaagct 2160 aagagctatt aaaaaaggct ttattgacca acataacgaa gcagaagrtg ttacatacaa 2220 cagtggtggt ttttagtagg ttttattgtg taaaaaaact taatgtgtct ttaatatttt 2280 tttaaattgt gtttttctag cgattcactt tttgtgtatc ttttaatttt ttaacaaaga 2340 taacttttta accgctcatt ggaattcttt caaattttca gcaattattc ttgagtgtat 2400 ttgtgatgtg ctgaaccaaa gaaaattaac ttttttttat attaaagtat taaaatttga 2460 rctttagcaa gctgtactta tacttttttc tttttattga tagaatttaa tgattttaaa 2520 gttattaaat gttgtttatt gtataatttt taattcagca cattttttga acatatataa 2580 aactacttat taagtttggt ttgtagaaca tgtatggttc tcaagatatt cttgttaaaa 2640 aaaggtcaat ttttttgcct aaattttgct aatttatgcg cataattatt gagcagaggg 2700 gtaaaaaaaa ttaattttgt tattttttga aatgatttaa tctaacactt cagaaaaaca 2760 taaatgttcc catataatat caaattttaa attttttttc tcttaaaggg tactaccacc 2820 // ID Gypsy-40_AA-I repbase; DNA; INV; 5318 BP. XX AC AAGE02031144; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-40_AA_; KW Gypsy-40_AA-LTR; Gypsy-40_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5318 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02031144; Positions 70680 65363. XX CC Positions [3912-4379] - Integrase core CC 'CGTAG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 2298..4898 FT /product="Gypsy-40_AA-I_2p" FT /translation="MDTSFCSSMLVVPKGKNDIRLVIDLRGPNRYIFRTPF FT SMPTLEQILAELDGATWFSTIDIANAFFHIELDEESRHLTNFFTEFGMFRC FT VRLPFGLCNAPDLFQETLQRKILGGCKGCKNYQDDILVFGSSKEEHDQNLA FT AVLACLANHNVKLNDSKCVFGSQIVGFLGFTLTPQGWMIEEEKLSAVKHFR FT TPASCSEVKSFLGLVTFVDKFIIHRATKTEYLRALAVSERFYWTKNEEREF FT CYLRDEALKTIRRLGYYNPSDRIELFVDASPTGLGAVLVQYNNADQPRIIA FT CASKVLTPTEQRYPQTQKEALAVVWGVERFTYYLLTTSFVVRTDAEANQFI FT YDANHRLGRRAVTRAEGWALRLQSYDFSVKRIPGNENVADALSRLIPATQQ FT SESFDEDEEKHFLRALDSGCMELTWTEIEEASESDEELRLVRQALKVKKWP FT SEIRPYEAQRKKLHFLGPLIFKDERMVLPRALRRRALDSAHGGHVGVMAMK FT RVMRQFFWWPKMSAAVSRFVKNCETCALLSKRNPPVPLVSRELPDGPWEVL FT QIDFLSVPNFGSGEFLIVIDTYSRYLHVVEMRSMDAESTNSALSKVFQTWG FT LPIVIQSDNGPPFQSAVFIKFWKEKGIDIRKAIPLSPQSNGAVERQNQGII FT KALSASRLDGRNWRDALQEYVHRHNTLVPHSRLAVTPFELMVGWKHRGTFP FT SLWSGSSELDRSEIRDRDAEAKLASKVYADSVRGAKESDIKIGDVVLLAQQ FT KRSKTDSTFSAERFRVVAREGAKVVVMSKTGVQYARNVQEIKKAPGFVVDP FT ESSEEKQDIHIDNGSSSMLENFPNAADGISPSSVIPTNDSRLLRKRNLIKR FT PSRLNDDFIYRVFH" XX SQ Sequence 5318 BP; 1539 A; 1040 C; 1278 G; 1461 T; 0 other; tggcgtggtc atggtcgcca ggtgagtaaa gagaaaagat tgcatttcaa ctacaaatgg 60 gtgaaagttg accggatttt tttttcccca agaacgaatg atgcaaaaaa aatgaggagc 120 taagagcagg tcttaatgcg caggatgcgg gaagtaatcc agatcaccac aatcttgact 180 tcgatgtgcc aaacaggttc agtagtactc gacacatggc attcgctccc ggtgcgttcc 240 gaagtcttga agaatctcgt tttttatcat cgatgaacca actgtcggta gcgtcgatca 300 atgtaccaga atgcaaggct gccgaaaacg accaaatcca tcgtcagacc ttcgaacagt 360 ggaaagactt gctaatggat tctatgaatt tggctggtat tgaggaagag gcaacaatgt 420 acacgatctt caaggtcaag gctggaacag gtttgcttga aatctacagg aataccaaat 480 cgcaaggaga tgcgccagat gaaaaatcag ctccattttc gaacgccatg catcgtctga 540 agacctattt tggctctggg tctgatgtca tgttgatgag acgtcggctg gcactaatgg 600 ttcaaaaatg tgaggaatct gatctgaatt tcatcataag ggttggttca acagctcgcc 660 tgtgtgaata tgatccagat aaagaattcg aggaagttgt ggctacagtc gccgaacacg 720 ctaggaatcg agacgtacgg actactgctt tgaagatgct tagtcgcaag ggaaacttcg 780 cagacttggt tgacaaagtt cgcgaactgg aagcaataag actcaatgaa gaatatgtta 840 tgcagaaaca tgaaaagcag attgaaaatc caagtcacgc attgattgct ccggtaagat 900 cgccgtacag ctgggtacca aatcgtccta ttcgtgggaa tgctattgct cgtgggtatc 960 cgagtcgccg aggcacctgg cggggtctaa gaggcaggca atcaatccca aggaattagt 1020 atcaaccaca aacattgtct gcaccaggag aaacgtgctg gcgttgcaac ggtttctatc 1080 actcggtaaa tgtgtgtaat ggtcgagata aaatttgtta tagatgcggt gttgtgggta 1140 cattgtaagg gcgtggaccg caagcgcata taagtgtatt gttcagatgg gaatggaagc 1200 agctctggtt aaaattgccg ttgttgagaa gttcgaagaa gtttcagctg gcgaagactc 1260 ggtaagtgtc gctacggaaa attaatcggg cattgaattt tgaaaggaag cattaaatta 1320 ccttttgtta tatgattttc atgacttcaa taaatgaatt aatgtgaaac aatcattatt 1380 ttgaaattta ctatgcagga atcgattccc aacaacagag tggtaattca cagtagtcct 1440 tgtgatcccc ttgtaagcga tcttgagtta atgatgccgc ccaaacaaca accagattct 1500 tcatctcttg tcgcaacact tcaggctgca cttccggaaa aggattcaaa acatgttgaa 1560 atggacgcta taagtgtaaa tactgtaagt attgatatac tgttgatcaa tgtatggttc 1620 taaacttatt aacctcatgt tacgctagct taatcgatca atgccaaaca aaccacctgg 1680 cacgagctcc catgtgtata atgttacaac agatgatgga gttatagttg gagttgtggc 1740 tggattgcgc tgttcattct tgatcgattc tggagcacag gtcaacacat ttactgaaga 1800 tttgtttcaa gaactgattt ctaatcctaa gcatagagat gaagtgttcg aggtaaaata 1860 tcagaccgac agaccactga aggcgtacgc aaccgtcggc gaaatcaagg tagttgctac 1920 tttcctcgca taccttttta tttcggatga taggccactt ctgctggaga aattctatgt 1980 ggtcagcgaa gtacgtgctc tgctaggtag atcaactgca tcacgataca gtattctaat 2040 gcttggtcta aaggttcctg tacaatcaca gcgtctttgg atcgcatata tccttgattg 2100 caggtgaaat tgcaacgata gtcacaaatg aaattttccc caaattcaat ttgccgccgg 2160 ttaagatcca ttacgataga tctaagccgc cttgtcgaaa cattttcctc aatattccat 2220 tggccgtgag accgttggtc gagaaaagaa tccaagagct gataaatgcg aatataattg 2280 agccagtagt cgaggggatg gatacttcgt tttgttcctc catgttggtg gtacctaagg 2340 ggaaaaatga cataaggctg gttattgatc tgagaggccc taaccgctac atattcagga 2400 caccattttc aatgccaact ttggagcaga tcttggcaga actagacggc gccacctggt 2460 tctcgactat tgacatagcc aacgcgtttt tccacatcga attggatgaa gaatccaggc 2520 atttgaccaa cttctttacc gaattcggaa tgtttagatg tgttcgactt ccatttgggc 2580 tctgcaatgc accggattta ttccaagaga ccctgcaaag gaagatattg ggaggatgta 2640 aaggctgcaa aaattatcaa gatgatattc ttgtctttgg atcgagtaaa gaagagcatg 2700 accaaaattt ggccgcagtc ttggcgtgcc tggccaatca caatgttaag cttaatgata 2760 gcaaatgtgt ttttggcagc cagatcgttg gatttctggg tttcaccttg acgccacaag 2820 gttggatgat agaggaagag aaactatcag cggtcaaaca tttccgaacc ccagccagct 2880 gctccgaggt caaaagcttt ctcggattgg ttacgtttgt ggacaaattc attatacatc 2940 gagcaaccaa aacggaatat ttgcgagcgc ttgcggtttc ggaaagattt tattggacca 3000 aaaatgaaga aagagaattt tgttacctta gggatgaagc gttgaagacg atcaggcgtt 3060 tggggtatta caatccgtcc gatcgaattg agttgtttgt cgatgcgtct ccaactggcc 3120 ttggtgctgt gctagtgcaa tataacaatg ctgatcagcc tcgaatcatt gcttgcgcgt 3180 caaaagtact gactcccact gagcaacgct atcctcaaac acaaaaagaa gctcttgctg 3240 tggtttgggg agtggaacga ttcacttatt atctgttaac cacatcgttt gtcgttcgaa 3300 cggatgcgga ggccaaccaa tttatttatg acgccaatca tcgattggga cgaagagcag 3360 tgacgcgggc tgaaggatgg gccttaagat tgcagtccta tgatttttct gtaaagcgca 3420 taccaggcaa cgagaatgta gcagatgcat tatcccgatt gatacccgca acgcagcagt 3480 cagaatcgtt cgatgaagat gaggagaaac actttcttcg tgctctggac tctgggtgta 3540 tggagcttac ttggactgaa attgaagagg catcagaaag tgacgaagaa ctaagattgg 3600 taagacaggc gcttaaggtg aagaaatggc catccgaaat acgaccgtac gaagctcaga 3660 gaaaaaaatt acattttctc ggaccattga ttttcaaaga tgagcgtatg gttctacccc 3720 gggcgctccg tagaagggca cttgattctg ctcacggtgg acacgttggt gtaatggcca 3780 tgaagcgtgt aatgcgtcaa tttttctggt ggccaaagat gtctgctgca gtttctcgat 3840 ttgttaagaa ctgcgaaaca tgtgccctct tgtctaaacg caatcctccg gtaccacttg 3900 tatccaggga acttccagat ggtccatggg aagttttaca gattgacttc ttgtccgttc 3960 ccaactttgg ttcaggtgag tttttgattg tgatcgatac gtactccagg tatcttcatg 4020 ttgtggagat gcgatctatg gatgcagaga gtacgaattc agccttaagt aaggtgtttc 4080 agacctgggg actaccaatc gtgattcaga gcgacaacgg accgccgttc caaagcgccg 4140 tttttatcaa gttctggaaa gaaaaaggga tcgatatacg caaagccata cctcttagtc 4200 ctcagtctaa tggagctgtc gaaagacaaa accagggtat tataaaagcc ctatcagctt 4260 ctagacttga tggtagaaat tggcgtgatg ctctccagga gtacgttcat cgtcacaata 4320 cactggttcc ccattcgcga ctggcggtga cacctttcga actgatggtg ggctggaaac 4380 atcgtggcac ttttcccagc ctgtggagcg gttcatctga gcttgatcgt tcagagattc 4440 gtgatcgcga tgcagaagcg aagcttgcaa gtaaggtgta tgcggattca gtgcgtggtg 4500 cgaaggagtc cgatatcaaa atcggcgacg tggttttgct tgcccaacaa aaacgatcaa 4560 agacggattc aacattttca gcggagcgtt tcagagtagt ggctagagag ggagcaaaag 4620 tggtagttat gagcaaaaca ggtgttcaat acgcgaggaa tgttcaagaa atcaagaaag 4680 cgcctggctt tgttgtggat ccagagtcat ccgaagagaa gcaggatatc catatagata 4740 acggcagttc ttcaatgctt gagaattttc cgaacgcagc agatggtatc agcccgtctt 4800 ctgtcatccc gacaaacgat tcaagattac ttcgcaagcg aaatttgata aaacgtcctt 4860 ccagattgaa cgacgacttc atataccgtg tgttccatta gctttcacgc tactgaatgg 4920 tttttttttc ttgaagcaat ggaaataatt tgatgaaaat gtcatggcaa cctgacaaag 4980 aaaaattgaa gtgttaagta ataaaagaga aaaaaatgtt acctactttg tttttttatt 5040 tctcgtagcg aaaattttgt cgaatgtatt gggaactttg cactctaaca gaaaagtaat 5100 cgcctatctt gatgcgtggg aaattgcttc cacagaaatt atcacttttc ttgagcggaa 5160 cagcataatt agtttgatat ctgaatttag cttaagaata cgcatttagc gtatgcttgg 5220 tgtgtgttcg atgtcgtttt aaaagttatt ttgtttagaa aaataatatc aactaacaag 5280 atgttttttt ttttgaagag agtacagatg gggaaaga 5318 // ID STREPB_FA repbase; DNA; INV; 275 BP. XX AC L22443; XX DT 14-SEP-2004 (Rel. 9.08, Created) DT 14-SEP-2004 (Rel. 9.08, Last updated, Version 1) XX DE Plasmodium falciparum subtelomeric region repeat. XX KW STREPB_FA; Repeat region; subtelomeric repeat. XX OS Plasmodium falciparum OC Eukaryota; Alveolata; Apicomplexa; Aconoidasida; Haemosporida; OC Plasmodium; Plasmodium (Laverania). XX RN [1] RA de Bruin D., Lanzer M. and Ravetch V.J.; RT "The polymorphic subtelomeric regions of Plasmodium falciparum RT chromosomes contain arrays of repetitive sequence elements."; RL Proc. Natl. Acad. Sci. U.S.A 91(2), 619-623 (1994). XX DR Genbank; L22443; Positions 1 275. XX SQ Sequence 275 BP; 71 A; 56 C; 33 G; 115 T; 0 other; gacaagtttt tgttttttct tgaagtttat taagcataca ttctatagat ccttatgacc 60 atcttttttt tgtgagctct catcgccatt aagaccacac gacttctcga actcacctaa 120 agtaccacaa ggttttacag ctttgttaac atcattgaga agggtccctc tctaaaaaat 180 ttttcacatt gaatcatctc tatagtcatt tttatattgt ttcaggtaaa gtttatttta 240 ctccattctt tccttttcgt atttatccgt tattc 275 // ID Gypsy-23_OD-I repbase; DNA; INV; 13294 BP. XX AC CABV01004651; XX DT 24-FEB-2011 (Rel. 16.02, Created) DT 24-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Oikopleura dioica genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-23_OD_; KW Gypsy-23_OD-LTR; Gypsy-23_OD-I. XX OS Oikopleura dioica OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; OC Oikopleuridae; Oikopleura. XX RN [1] RP 1-13294 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Oikopleura dioica genome."; RL Direct Submission to RU (24-FEB-2011). XX DR Genome; CABV01004651; Positions 277 13570. XX CC Positions [3099-3581] - Reverse transcriptase CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 678..6095 FT /product="Gypsy-23_OD-I_1p" FT /translation="MEPVKNGQVLNLEEYTRKIVESDCAISFMNKFPLQLK FT MLRSFLTGEMVKVKSGCNYVMISQILKDFRIDHSPNARFPEGTKIGSSCFS FT ARRYTKKEEHTPFGDLMAIVFCIIAMSTKTWEHVLEEIARKEGLRPEKVDM FT RDLLNNLNYLFNCLDRKTGASGHMEIQQLEYAANINAIRKGGESLNDDMEF FT IKKEGHKQSLKLQRIANRSVEINKIESDLENNAGSASDTEEKQEEANRFFK FT NNKKTFGRGQSKNKKVSFGNNTIMSAEEFLRVDGLVVDKLSKKVYKPDGRK FT LSYGAFETVNKEYGGRVMSIFPRAMGDRKGIRSLIDRGVKSARVIRKDGRK FT FAAFECDEEGNPTDLYELLSEMEHELNTIATEVKIVDVDTDREGIKSLTEY FT ESKGKNQTMIELNWIGNENDLENEFFSVKYETDMETWTQVAERRALAKKVF FT SVDFFLPTVKSLDRNKLSEIKFRLLIDSGASVSLLPESFVSGLVNATKVKV FT KVAKAEKGIARTAGGGSLEYLAITVSFTMKIGEMNISINNAKVHKGHSRSL FT LMSLSDMALNGISLINSVEIKGHMGPQEMVLMHKGQKMDKIVPKIQMIDAV FT FMYDSIPEEIFACDIKDKYSNFPNEIENTVERTSTIGEKCYDESVVRNMRP FT TSVKDHVGMRHYLRHIEKEHQKLRETNTRSDCTIGPKNDEFDPFEFVEDRG FT KMIAKIEEIIEKCKVLFEGTQGHVIGKEYEIHAEIVGDTSPKSAANSYGKN FT RPEYEQKAIIQQLDKELAEGVLTLKPPEMIPAHFVNYHAVGKKNQDTGKVE FT LACGNVRVVVDCARSGINKDTKHLARPTDSIRQVLQKVAPFTKKGFVATID FT ISSMFYCFSMHRSMWKHFTVMHPHLQLLVYTKLPMGWISSPLLAKEMIALI FT LYEHISKDQMAIYVDDICIFGSTETEFLSNLSSVFNTLAKMNLRLKGKKCT FT FLSRDIELLGRRIVNGVIKQSPHTIGKIAVEDEKSLNTVKSMKRFLGILAY FT LSEAIPYKTELLDDLYKLVGKTEAKKDRKPNEAIKWTPELIEKLNKVKSIV FT NEKTLTELHPLDPSKLCYIVVDSSNLGTGAFMYHLDDQDKPKIVKMYSKKR FT GDAKNIVQWASCMLEVHGILSAVNYFKYEIDSLDKPAAVLTDSSSCAKLFQ FT KCQAGKDLSNDPKLNERILKLMNFNISITYTPAKMKDLDLADFISRSENLL FT TKCNNDCKLCVEMHENGMIKEKEHAKLLEDTLNVTIEQQNMKIDRFEDIFK FT IEKIGEFQRAYKYATEESVLRFSDEEEEIEMMVTRQKTRSLERKRKGPRGS FT HEYENKIIFELIDRKDWKDLVTDHKLLREIQLNDKVLRITIEAKEGGRLPG FT PKDRPSESLRTNNTVIENGVLKKERIMVGVKAVSPIIIPENMVLIFVKKIH FT KSFGCGSAAKLKNLCKSIIWGKNMAKHAENITRQCKDCSYLRNPPKIIKIE FT TEHDNRWPDHIGQVFFADIISRRTTTDRSAQPETSMKFYVISEALSGYIKL FT YHIHNKENNAESGSEILIKAMYDLAVRPHGTLHKTIAMDGAPVNRKIAKNP FT VWEDLNVTIEIMDKTSNNKNYLAPLDSRMQKLSCYLNQIIGKKYSPDIVAH FT KVSDRYNATPGAAGYSPNEIWFGTDQHTGKRLDIDMEKIKGKVKEACKLAR FT DANDRNNSKSKRRLPIILKPYEEGDTYGNDHESPIKTGDLVLIEGEKDKNE FT THPFYEIIPIDGFDGIDWEHRVVHTKRHNVIAKKTKLWGFDSIKIVLDGNN FT KEAVAQFLTKEAQSIPEKFDSRTIVYNMINQNEFN" XX SQ Sequence 13294 BP; 4476 A; 2768 C; 3286 G; 2764 T; 0 other; tatctaacgt ggtgactgag agtttttctc ttgaaaaagg gcttaactcg atcgaactac 60 acgcattgtc ggaaaataac ctcgaggcgg aacattaccg ctcggtccat ggagaatgga 120 ttatataatg tgcgagaggc accaaggttt tatgccaaat ttacgaaatt gacgttcgat 180 gcgtcggacg gagcaattcg tcataggaaa gtggagcagt ggaaaactgc ggcagaagtc 240 cacatatccg atcggattac taaaaagtac aaaaagaacc cgttcgcaca gcacgttaac 300 aggctaagct gggcatgggg cgagtcccca aacggtagtc cgaacatgaa cgggtcgctt 360 agcattccat cttgtctgct agaattctat ggatgggcgg tggaaggtag atacatgaac 420 ttcccagaaa tccataagat cgcgcgatcg atcacaatcc cggaatattg gttttacaca 480 ccgttgaacg atccggacgc gtacgacgcg gatgaagcgt ttatgctgtt caaacagcat 540 atggacacga acatctccgc acgctcaggc agatctatag attccctgac gcggatgaca 600 aaacggtcta cgattacgcg atctcggcca taagggagct cgcgcccatc atttgcatta 660 atgtggaggc caacacgatg gagccggtga aaaacggaca agttctgaat ctggaagagt 720 ataccagaaa gatcgtggag agcgactgtg ccataagttt catgaacaag tttccactgc 780 agttgaaaat gttacgatcg tttcttaccg gagaaatggt gaaagtaaag tcaggttgta 840 attatgttat gatcagtcag attttaaaag actttaggat cgatcatagc ccgaacgcta 900 gatttccaga aggaacgaag atcggtagct catgcttctc cgctagacgg tacacaaaga 960 aggaggagca cacaccattt ggtgatttga tggcgatcgt tttttgcata attgcaatga 1020 gcacaaaaac gtgggaacac gttttagaag agatcgcgcg aaaagagggg ttgaggccag 1080 agaaggtaga catgagagac ctcttgaaca acctcaacta cttgttcaac tgtttagaca 1140 ggaaaacagg ggcctcaggt cacatggaga ttcaacaatt ggaatacgcg gctaatataa 1200 acgcgattag aaagggtgga gaatctctaa atgacgatat ggaatttatt aagaaagaag 1260 gccataaaca gtcccttaaa ttgcaaagaa ttgcaaaccg atcggtagaa attaataaga 1320 tcgaatctga cctcgaaaat aatgcaggtt ccgcaagcga cacggaggaa aagcaggaag 1380 aagctaacag gttctttaaa aataataaaa agacgttcgg aagagggcaa agtaaaaaca 1440 agaaagtatc gttcggaaac aatacgataa tgtcggcaga agaatttcta cgagtcgacg 1500 ggctagtagt agacaaactc tcgaagaaag tctacaaacc ggacggtaga aagttatctt 1560 acggagcgtt cgagacagtc aataaggaat atggcggtag agtcatgtca atcttcccac 1620 gagcaatggg agatcgaaaa ggaattcgat cgctgattga tcgaggtgta aaaagcgcgc 1680 gagttataag aaaagacgga agaaaattcg cggcgtttga atgtgacgaa gaaggtaacc 1740 cgacagatct atacgagctg ttatcagaaa tggaacacga gttgaatacg atcgcaacag 1800 aagtcaagat agtggatgtg gacacagatc gagaaggaat taaaagcttg acagaatatg 1860 aaagcaaagg taagaatcaa acaatgatcg agctcaattg gataggaaat gaaaacgatc 1920 tggaaaatga attctttagt gtgaaatacg aaacggacat ggagacgtgg acgcaagtag 1980 cagaaagacg agcgctggct aagaaagtat tcagtgttga tttctttctg ccaactgtaa 2040 aatcattaga tcgaaataaa ctgtctgaaa ttaaattcag gttattgatc gactcgggag 2100 catcagtatc attactcccc gaatcgtttg tgtcaggtct ggtaaatgca acaaaagtaa 2160 aggtcaaagt cgcaaaagca gaaaaaggca tagcacgaac agcagggggt ggctcactag 2220 aatatttagc tattacggtt tcatttacta tgaaaatagg cgaaatgaac atatcaataa 2280 ataacgcaaa agtgcataaa gggcactcac gatcgttgtt aatgtcgtta agcgacatgg 2340 ccttaaatgg aataagcctc atcaattcgg tagaaataaa aggtcacatg ggaccacagg 2400 aaatggttct catgcacaaa gggcagaaaa tggataaaat tgtaccaaaa atacaaatga 2460 tcgacgcagt gttcatgtat gacagcatcc cagaagagat atttgcatgt gatataaaag 2520 acaaatacag caatttcccg aacgagattg agaacacggt cgaaagaact tcaacgatcg 2580 gagaaaaatg ctacgacgag agcgtagtca gaaatatgcg accgacctca gtaaaagacc 2640 atgtaggcat gagacattac ttaagacata ttgagaaaga acatcaaaag ttacgagaga 2700 cgaatacgcg atcggattgc acaatagggc caaagaatga tgaatttgat ccctttgagt 2760 tcgtggaaga tcgaggcaaa atgatcgcta aaatagaaga aataatagaa aaatgcaaag 2820 tgttgtttga aggcacgcag ggccacgtga tcggtaaaga atatgagata catgccgaga 2880 tcgtgggaga cacttcgcca aaaagcgcgg cgaactcgta cggcaaaaac aggccggaat 2940 atgaacagaa agctattatc caacaactgg ataaggagct agcagagggt gtattaacgc 3000 tcaaaccgcc ggaaatgata ccggctcact ttgtaaatta ccacgcggta ggcaagaaga 3060 accaggatac aggcaaggta gaactggcat gcggaaatgt cagagtggtc gtcgattgcg 3120 cacgatcggg aatcaataag gacactaaac accttgctcg accaacagat tcgatcaggc 3180 aagtgcttca gaaagtggca ccatttacga agaaagggtt cgtagcaacg atcgatatat 3240 cgtcaatgtt ttattgcttt tccatgcata gaagcatgtg gaaacatttc acggtgatgc 3300 atccccacct gcagttattg gtctatacaa aattgccgat gggatggata agttcgccac 3360 ttttagcaaa agaaatgatc gcgctcatat tatatgagca cataagcaag gatcagatgg 3420 cgatctacgt tgacgatatt tgcatatttg gatcgacgga gacagaattt ctgtcgaacc 3480 tgtcaagcgt gttcaacacg ctcgcaaaga tgaacttgcg actcaaaggg aagaaatgca 3540 cattcttatc aagagatatt gagctgctcg gaagacggat cgttaacgga gtaataaaac 3600 agtctccaca tacgatcgga aagattgcag tggaagatga aaaatctttg aacacagtca 3660 aatcaatgaa aagatttctg ggcattctcg catacctgtc ggaagcaatc ccgtacaaaa 3720 cagagttgtt ggacgatctg tacaaattag tagggaaaac ggaggcaaag aaagacagga 3780 agccgaatga agcaataaaa tggacaccag aactgatcga gaaattaaat aaagtaaaat 3840 cgatcgtgaa tgagaaaacg cttacggaac tgcacccact ggacccgtcg aagctgtgct 3900 atatcgtagt cgactcttcc aatttaggga caggagcgtt tatgtatcac ttagacgatc 3960 aggataaacc gaagatcgta aaaatgtact ccaagaaaag gggagacgct aaaaatatcg 4020 tacaatgggc ttcctgtatg ctagaagtcc atggaatatt gagcgccgta aactatttca 4080 aatacgagat cgattcgctg gataaaccag cggcggtact cacagactcg tctagttgcg 4140 ctaaattatt tcaaaaatgt caggcgggca aagatttatc gaacgatccg aagctaaacg 4200 agagaatatt aaagctaatg aattttaaca tttcaattac ctatacgcca gcaaaaatga 4260 aagatctaga cttggctgac ttcatttcac gatcggaaaa cttgcttaca aaatgcaata 4320 acgattgcaa attatgcgtg gaaatgcacg agaatggtat gataaaagaa aaggaacatg 4380 ccaagctttt agaagacacg cttaatgtaa cgatcgaaca acaaaacatg aaaatagatc 4440 gatttgagga tatattcaaa atcgagaaga tcggagagtt ccagcgggcg tacaagtatg 4500 ctacagagga atcggtactt cggttttcag atgaagaaga agagatagaa atgatggtta 4560 cacgacaaaa gactcgaagc ctagaaagga agagaaaggg gccacggggg tctcatgagt 4620 atgagaataa aatcattttt gagctcatcg atcgcaaaga ctggaaagat ctcgtaacag 4680 atcataaatt attgcgagaa attcagttga acgataaagt tctcagaata acgatcgaag 4740 cgaaagaagg agggcgatta cctggaccga aggacagacc ttctgaatcg ttgcgtacaa 4800 ataacacggt gatcgaaaac ggagttctca aaaaggaaag aataatggtc ggcgtaaaag 4860 ctgtaagccc aataataata cctgaaaata tggtcttgat cttcgtcaaa aagatacata 4920 aatcgttcgg ctgtggatcg gcagcaaaac ttaaaaattt atgcaaatcg atcatttggg 4980 gaaagaacat ggcgaagcac gccgaaaaca tcacaagaca gtgtaaggac tgctcttacc 5040 tccgcaatcc cccaaaaatt atcaagatcg aaaccgaaca tgataacaga tggccggatc 5100 atataggcca agtattcttc gcggacatca tatccagaag gaccacaacc gatcgatcag 5160 cacagcctga gacttctatg aagttttatg taattagtga ggccctaagt ggctacataa 5220 aactgtacca tatacacaat aaagagaata atgcggaaag tgggtcagaa attttaataa 5280 aagcaatgta cgatcttgca gtgaggcctc atggcacact acacaaaacg atcgcaatgg 5340 acggagctcc agtaaatagg aaaatagcaa agaatcccgt ctgggaagac cttaacgtaa 5400 cgatcgaaat tatggataaa acgagtaaca ataagaatta cttggctccg ttagatagta 5460 gaatgcaaaa attatcgtgt tacctaaacc aaataattgg taaaaagtat agcccagaca 5520 ttgtggccca caaagtttcc gatcgatata acgcaacacc aggagcagct gggtactcgc 5580 ctaatgaaat ctggtttggt acagaccaac acacagggaa aaggttagat attgatatgg 5640 agaaaataaa gggaaaagta aaagaggcct gcaaattagc tcgggacgct aacgatcgga 5700 acaactcaaa atccaaaaga agattaccaa taatcttaaa gccatacgag gaaggagaca 5760 cgtacggcaa cgaccacgaa tcgccaataa aaacaggcga tctcgtgctt atagaagggg 5820 aaaaagataa aaatgaaact caccccttct atgaaataat accgatcgat ggtttcgacg 5880 gaatagactg ggagcatagg gtcgtacaca cgaaacgaca caatgtgatc gcaaagaaaa 5940 cgaaactatg gggtttcgat tccataaaaa tagtgttaga tggcaacaac aaagaagccg 6000 tagcacaatt cctaacaaaa gaggcacaaa gcatacctga aaaatttgac tcaagaacga 6060 tcgtatataa tatgatcaac caaaacgagt ttaattaata aaaacgatcg acataagtga 6120 catgaaacgt taaagtttat ttgataagaa aatttgaact taacagtcac taatatcacg 6180 gaacaaatat gttagccgag gtgcgctcta gttccaaaaa ggaatccaac agtgctttca 6240 ggcactcgta aagatcggct gggcttggta aatgggtact gatcagtatg ccgcctcgaa 6300 aatgagcgat cagataagtc agatagcggt taaatacagt agtaagactc tcgcgaaaga 6360 accggttgcc gatcgaaggg aagctcgcag agcccctagc gccaagtcga aagtgtccga 6420 taatgggatc tccagcgagc acaaatgatc gtccaatatg tcgcagaaag gaagtgacgt 6480 caggagcggg gtctaaaaat aagttaagta agcggttagc tggtttcagc aaaccagtat 6540 agtcgagccg agaatctcga ggcgatcgta aacgagctcg aagaaaatct ccaggcaagt 6600 taacagaccc aggagcgcta aataactcga tcgtggacca agggtactcc ggaacgaggc 6660 cagaaggcac caaatgcagc gggaaagtgc tagaatcatc acagaagccg cgagaggcca 6720 agatggtgaa aggacagtga ggcaaaacca gcggtgaagt gctggaaggc cgatttggag 6780 tcaaggtaga cgatcgttcc ggcgtagaat tcagagaagc ggcaggccga tctggtaagt 6840 cacgtccaac taagaaacgg tgataattta agttaaaata agttacctat cccgttcata 6900 tcatcgtgga actcggttag atcaccacga gtaggtaaca aaaacgatcg gcgaggcaat 6960 ttcgagcgag attgggatcc gggaacatat gacgcacgac gaggagaacc tcgggcggta 7020 gcgccataag gatgggccca ggaggatgat cggacgccgg tagtacgagg aaaaggggcc 7080 gaccgaggac tggcaaccct aaatacggta ccattagagc cggcagaaga caccgattca 7140 gcggatcggc gcgtagaatc aacaggtcga gacccggaac cagcaacagc ttgagcgcga 7200 gcagttagaa cagactcgtc aatgatcgcc gataacgcag cattgtcgat ctggctgacc 7260 tcgatgcgct caggtaaagc acgataggag ccagtagagt gtgacgatcg ggtgagagaa 7320 tgcggcgaag cttctaaaat ggagtaagca gtaaaacgat cgaagacaaa cctgctataa 7380 aagttgaaga agctggtagt gaatacggcg cttcaggcgc ttcgggcgct tcaggtgtct 7440 ctgcagcaag tcgcgtcata cggcgcgttt tgacacgttg atccatttct cgcgtcgacc 7500 gagtgaatga ggacggcgag acgaaaagtg ccgaaaaatc ggtgccagag gcggtcacgc 7560 cggcgacgat cagcaaacca attacgatcg agcatacgca acggtacgca agcaaagatc 7620 gagaacagac caaaacgaaa tgatcttgaa agtggataga gaaataaccc caacggaatc 7680 agcgcgttat caacgagccg agacacaagc gagggcaatc taccggcttc aagctatgac 7740 gatcgacaga ctggtcaacc taaaagaatc gatcgaacat tgggagacat taaaaagtat 7800 gcttgaagtc cgagaatcta cctttctcac gatcgagagt aacagcgaga tactcgacga 7860 ggcggacgtt gcgttgtccc gaaagttcag gcagctcacg aatgtcgctg tcggagaagt 7920 atttgacatc gaaggcgcgc tcgcagacat cgagcgctcg atcgagcaag tcataagggt 7980 tgcaggatgc gatttcgttc cgcggctcta caacccagta acgatcagaa gtctctatga 8040 cgaaggtaga taattacaga gaatatatat aaataataaa ttaaggtcca aagaaagtgg 8100 actccggacg aacaaagaaa tacacattcg gagtaaaagc cagagccacc ggtagagtcg 8160 ttctcatact caccgtcgct gaagacgaca tatcgatcga tatattccca tagaaacgtt 8220 ctctgtacca cgaaagctct ccagaataaa gagatcgagc tagaaaatat aattctacca 8280 ttcagatgca gaacttaact atcagtagtg aggcgctcga ttacctcgct aggccgtccg 8340 atcgtgatgt tgcagaagtg gtcgctggga tcccaagctg gattcgctgc cgccgatgcc 8400 aatacgtaat gggcgaccag tctccagaaa tccggtacac gcaatgcggt tgcagatcgg 8460 tatgtaaacc atgtcacgat ctatcgccaa agtgcgaact ttgtaaaaac gatcgagact 8520 catctgcgca gccgatcgaa atcgtgagta ccaactgcac gccatgcaag gacaaggtca 8580 actacttggc tgcaagccag aagtattacc tggttactga aaaggtaata agaaataaaa 8640 atgagtcgaa ctatatttag cttaacaaaa tggctgagca aacacgacgt ggagctcaag 8700 aaaaagaaac agccctggct gagattattc gctggaaacc agcccaacag aacgatcgac 8760 agccgatcac gacacagcca acaccagcag catccactcc ggtgatgagg cgagtcgagc 8820 agagcgctgt ctacttcaca gccaacgacg agcgctacaa catgatcaac cagaacatga 8880 caacgctagc caacgaaaga gcacggattc cgcgtcaagc tcacagccta ccgatcggag 8940 ctgcggcaga aaagtacgcg ccacgatcaa tcatcgaaga taactaaagg ttacagaaaa 9000 taaaattcaa ctctaaagat agaaattaag tataacgatc gtactgcaaa gttctgcttc 9060 gctaccgcta aagttctgct ttattgctgc aaaggaacaa ataggataaa acactgttaa 9120 ctttattttg tttcagtcgg cagtaacagt gatagaaggc agatgtgaag gcggttgagg 9180 aatcggcaag tgataagagg cattctaaag ccataaggaa cgatcgattt actagaacaa 9240 agaaattata gaagtaacga aagtaaaact caccatttag gagcaagtac ttcccaaagt 9300 cgatcggcaa gagtagaaat cctcctcatc agagaattca tcatcggtga ggagatcacg 9360 atcgtcaata acaagtcggg tgcacagatc atcgtcgtcc tcatcgacag ctgtatcaag 9420 caaggcgctt ccttcctcga aaattggact gaggccaggt cgttgagaag cggagacacg 9480 aacaggagcg ggaggctcag caggctgatc aacgatctca gtagccggaa gtggagcagc 9540 ccgatcgacg tagcgccaag ctggatcgcc atgagattga acgtcaggtt caaaatcgca 9600 gaaaatattc gcccgacgag cgtaagcact aataaatagt aaaagtgaaa taaagggaaa 9660 acttacgcga tcggcgcgcg aagaagcggg ttcccgtcgg agtcgaagcg acgagcgtcc 9720 agccgagaat acacgtgctc ttggttgtac atctcatcgt cgatcgaaat ggacatgcgc 9780 tgaatttcaa agccgtcatc ggtgctggca ggcagaactc cacggtagtt ctcgatcgtt 9840 cgacgaccaa ggacaagctc ggctagaccg gtagggatca actgcgtgcg agctttaaaa 9900 tatatttaaa gtaataaata tataagtaga attaccgttg tagaagatgg gatcgcgcag 9960 cattccagtt cgagctacat cgcgcacaac gatcgggcgg ttctcgtgat cgcgcatgaa 10020 aagcttcggc gcaaaagtgg acacccaagc attaagtaca tgccagacac cgtgaacgat 10080 caagtcgcga gcagtggcga ggccgtggtg aacaaatcgg atgaacccga caaaatagaa 10140 ctggcgctcg tcctgggtaa aaatttgata aaaataacat aagatcgcac ttaccactcc 10200 gttgggcaaa tgaactttgc catgtccacg agcggtaaca catgactcgc ccatatgaag 10260 atagagggcc caaaatcgag cagcttgacg gttgaccttc gtccatcgat caggattagc 10320 ggacacaacg atcagagctt caaggtcgag ttaaattgac gatcgagaaa atacaaacta 10380 acagttttgt ctgggccaat tctggcccca actctgcctc acagtagtag gaggagcgta 10440 ccactggtcg ccatcatcgc ggggaattcg agactaaaag ttctgcgaat tcgatcggga 10500 aagaaaacaa acctcgacac cgatcgtgtc gaagaagccg cgaacagcgg caacagcttc 10560 gccgttgaaa gcagatcgtc ggggaatgaa ggctacgaac gctgacaagt atcgtcgctc 10620 catgtacgct tcttcgccat gctgagcttg gaagtcgatc ccggtctgta cacgctacaa 10680 cgatcggtaa aataagtaac tactgaaaac atactcgaaa gtatttgcta gtgaacacct 10740 ggtaagccaa tccagtagaa atgcccacat cactcacacc gtgaatcacg ggttgcgcag 10800 gagcaccttc acgatcgtat cgcccatcgg aagcatgaag cttcagccgc tcgaacgaaa 10860 atcgacctag aaaaggtaaa acattactca acgatcgttt aaaattagct tacctgtgta 10920 cttgaatagt tgatacaact gattctgcga gtacgtttgc cagcgagtga aacgctcaaa 10980 cagacgccgt ttcgccatcg tgcgcatatc ggcctcgaac aagtggccag caacacgagc 11040 gtcgttgcaa gaatcaattc gatcgcatgc ttgttcgaac catcgagtga cactgaactg 11100 attgatctcg cactgttcag gagcttgaag gtcgatctca atggcgatgt cggatcgatc 11160 ttcgtaatgt ggttgcccaa agtgagcttt gaactagaaa agaagataag taaaatttct 11220 aaaaatcgag attcgctcac cttgatcacg ccattgagct tgttagcacg atcgagaggg 11280 atgaaattaa tatcgagtgt cttttaacat ttgaattaag ggaaagttga aagtaaagca 11340 aaccttttgg acagaagacc aattcgcgaa acgccagtcg agtaaggtcc gattcgtagc 11400 aagagtaacc gatcggcgta taccaagacc acgacactgg tgctcgatcg tgccgaaacc 11460 atgctggcga tgtcgttcgg cgacttcacg acacatatca aaaagttcgg gcttgaaatc 11520 accaatcagg ccagcatttc tgaaaagata aaattattcc ctgattatga aagaaattaa 11580 ataacgatcg aaacttttcc atagaattca gaaatggaaa ataatcgtga tttaatcgat 11640 ctgcacgaat tacccatgga agaggatgat ctcaacttcc tgttcgacga gaacgcagac 11700 gagaatgtcg acggaaaccc tgaaaacgat cgtcgggaac agccagccga aatcgacttg 11760 ctggaagagc tcgactttgg ggaccaaaac gagcaggatg agcctctcca gctcgtcacg 11820 gtggaacaaa ttcggcgaaa atttgaggtc gagcacgaca cgaaagttcg aaagaccttc 11880 gacgatcttg gaaaaggagt tcgaccgggg gaagtctacg aaaagctgtt ggacatcccg 11940 aaattcgcta aagctctcgc aaggcgcgac gcaaatgtat gggcggacga tctacgcaag 12000 ttgtcgcaaa aacttgaaac gtacgatcgg gcaatctacg tccgcgacga tagggtaaat 12060 cataaattaa aatttgataa atttaaaata atttgcgatc gaaaaattta aaaagcaatc 12120 gatcgattaa aagtttggga acttatttac taacctttcg atcgtagaaa gaactgagca 12180 gcctgtgtgc tccacgcgct gtccttcttc gccgcctctc atcggagctt gcagttcact 12240 gcctgaacgc tccagatgag gtaaaaattt ctctggaaat actaaacgat cggaaaattt 12300 aagatctctg gcctctttat agagcttgag gagcgcgtga gtatcctgcg atcgggagga 12360 tcgggattga gaatgatcgt cagttttgtg gcggatgagg aagtgaaacg ccgctaccta 12420 gaactgtcca atgagagcaa cgtaagagaa gctatggagg gcgagcttga gacgatcgag 12480 cgggtggctc gatccgccgg aatcattcat aaggtgataa gtcacgatcg gtaacacagc 12540 gctgactaat ttaagatgcc gccgctccca gacaacgtcg atgatatcga cgcatggaag 12600 gaatttggtc gaattgtgga cgaaattcga tcggaaagtc agcgaaagac ggacgatttc 12660 aaattgaaga tcggaggaat tgcaagatgg ccgatcgaca cgttggtaca gtacgtagta 12720 aagtactcgg accaatctcg actcgccaat ctcccgagag tcgaccaagg ccagtcggca 12780 atgacgattc cgacgataac gctcagcgga aagatcgaga gttacaaaat cctggcagac 12840 gggtcgatca gaaagctcac aaacgatcgt ctcccatcct ggtgtcaaga ttggaaaatc 12900 tcgacgaacg gtagacagat cggcagcttg cagaagaacg gagtgatgaa aggaaatggg 12960 ttcgccagaa agaacaacga tcgaggagca gatcgaaatc gaggtcgagg cggccagcat 13020 cgagcggcgc accgcggcgg agcgcgcgga ggctacagag gcaaagcaaa ccacccatac 13080 caacgctaaa aggtaattgt gactattcca agtcaaaata gggggacgct agggaatccc 13140 ctgccactta tgtggaaact ggccggaagc catccgtgac tcccgcccaa tgggaagaga 13200 cacaattccc aaatcgagca gcagactatc cgatgagctt cactctcgcg aaagtcgctg 13260 ctcgaggtaa gacacgatcg agagaagtag atcg 13294 // ID R2-1a_Cis repbase; DNA; INV; 6550 BP. XX AC . XX DT 29-AUG-2008 (Rel. 13.08, Created) DT 29-AUG-2008 (Rel. 13.08, Last updated, Version -1) XX DE R2 Non-LTR Retrotransposon from Ciona savignyi. XX KW R2; Non-LTR Retrotransposon; Transposable Element; LINE; KW R2-1a_Cis. XX OS Ciona savignyi OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. XX RN [1] RP 1-6550 RA Smit A.F.; RT "R2-1a_Cis - R2 Non-LTR Retrotransposon from Ciona savignyi."; RL Direct Submission to Repbase Update (05-AUG-2008). XX DR [1] (Consensus) XX CC Ci000746, Ci000412, Ci000131 Inserts specifically in U2-like CC snRNA containing tandem repeat units (Sat131). Single ORF from CC 980 to 5347 encodes a protein 29 identical (48% similar) to CC R2-like transposases in C elegans (NP_507408) and Limulus CC polyphemus (horseshoe crab; gi|7511751). Perhaps to be named a CC separate class with these two members. XX SQ Sequence 6550 BP; 1515 A; 2014 C; 1634 G; 1382 T; 5 other; ccaaaattac ttccagcacc tccacagcag acgaacgaag aaagaagact tacgaaataa 60 gaataagcag tttaaagacg aagcagacga acaccactcc accaacgacg ctctgcagct 120 acaccaccac catctcgccg caacaagaag aattttccgc tgctctccac tctgctccaa 180 caccacctct cctgctctgt ggactgctgc cttgctgctg gaccaacctc tacccgaagg 240 aacccttcga acccagcagc aaggtacgtg tcaccacctc tccacagagc caaggccaga 300 gtggatagag cagcggcctc tacaaccaag ttctccactc cgacgacaaa acacctgcct 360 cgtggccaga gtcctgccga agaagaatct tcgaccccaa caccgtctcg gcagggctcc 420 aagcaacgac atcagccacg agtgcccacc ggagtggacg cggcgaaccc gaggaccgct 480 gtcgccaaga acaataacat caccgcctgc agcagctacc aaacaaaggt tagtctccta 540 cctcacctac aaactcgtct gaatagacgc ccccgcgtgg gagctaacct tggtagctag 600 ctgcagtgcc ctgcgacagc ggcctcgaag agctgcagag cgtcagcctg cttcgacctc 660 gcttgttctc atttcaacct actccgtcct gtggaatcaa aagagcccca ctaacaatta 720 cattcataaa atctagcaag aacgaagaga agcgaccaac ttttaatcca taacttttag 780 atctttttat ttattactgt ttttaagccc taagcatatt gccttttttt agatcttata 840 attataaaaa tagattcaaa gttaacacca ccaggccgct acagagcatt ttatttaatc 900 aatttgttac cgacctcctg ctgcttcttt ttactttctc cagactacta cccggataca 960 acccttggaa acgagaggaa tggccggtca caagatcaca atgtcggagg ggaagctgct 1020 ggaggttgct gtacggtatg gtggtgtgag gaatgtctcg tacgagtgtc cagtcccgga 1080 ttgcacaaaa accttctcac aggcaaataa cttgatacgg catttaaata actttggcaa 1140 tacaaaacac cgagcacaca atttcaccta ttttttcacc tgtgagaagt gcaaaatcca 1200 aattcacagc aacacaaaac ataatatttc aaatcattac aaacagtgtt gtgcaaccgg 1260 cggcggaccc tcgtgcgaga cgggtcaata cttctgccct gcctgcgagc aagcagggct 1320 ggggaacctg gagtcggcac tccgccattt tcaatcctcc caccctgaat ttaatctacc 1380 cccccggtct caattttcaa aatctcatcc caatagttat accctatcct taaaacctaa 1440 agaccacctt atgaaaatac tttacagtgg accactgact ccggggcaat tggtatgccc 1500 cattaagatc tgtctcaggt cctcagctgc acgtcttttc catgacgtgt ccaagctgag 1560 gaagcacatg ttggttgatc acaaccgaac ccttgtgtac gagacaacat gtggcaaatg 1620 cttgcggcct gtcgacacct cgaaaaatat gcggaaaacg acgtcgcact tcgaaaagtg 1680 ttctggcgaa tcctttatct catccccctc acccataccc caaaagactt ataaactcga 1740 cttaccctca accagcaccc ctcccccccg taaatctccc aaattacaac cctacaaacc 1800 aattcgaacc tttaagaacc ccctcaccaa gtccagtcaa tccaaatccg acaacccacc 1860 caaacctacc ccttttttct cgccaagaac acttgagagg tcggcgtcgt ggcccgcatt 1920 gagcgaggtt gtcgacccac tgcccaagct caaggaaaaa cacccctcgc tgccatgcgc 1980 cctggataag tgcccaccct caccccggat caaaccctcg accctagtcc ctccctgcca 2040 tacagcaaac aatagtccca aaccaacttc acccgaatca ccctctaccc taaaaccctt 2100 accccgcccc atccgaccat ctaaaccctt ggaggattgg ttaactgtgc ggagcgtggg 2160 accggatcgg gagatcgtcc tgaacatcgg tccgagaccc cgacccggcc ccgcagcagg 2220 ctccaggaca acatctcccc cgtcaacggc acccgcgaaa agggttgctg caaacccaat 2280 cgcagcaccc ttgagcgggg agccgggtgc gacccttgat tgcggccaga ccgggcgaaa 2340 ggttcagccg ccaaaaaaga gaccgacaga gagtgctggg tccctccctc ctccggccga 2400 gccagccacc gatctcctta cagggagaga ggggctggcg aggctggttg aggaatatca 2460 tctgtcgggg gactttggcg ctttctgtcg ggatctcgag cgatggacgg cgctttcgtc 2520 gaccaaccga aggccaaagc cgaggcgcgg caggtacaac cgcggtgcgg ctgcccgtgc 2580 caccaggaac cggggccgag atgatcgtca agatccacaa gatcgagacg atcaaggagg 2640 ccccggtccg gtgacatgcg ggcggccgca aagatacaag cgggccgccg cgctgaggtc 2700 ggccttcggg agggacatga aggcgaccgt ccgtcgcatc atagacggag aacggggcga 2760 cgctcgctgt gaaatcgacc ccaaaaccat cgagggtcga tttcgcgacg agctgtcgcc 2820 gccggttcgg gaagggccgg agtgctcgtt gcccccttgg atggctgaag cccaggcagg 2880 tgagcatgcg ccatcgaatg atagccaacc cggcgatgca tacgatgggc caatcacggc 2940 gctggaggtg gagatggtcc tcagtaccct gaacgtcggc tctgccccgg gatcggacgg 3000 cctctcctat ggattttgga gagctctaga cccgaaagga ctcgtccttt cggagctctt 3060 cgaagtctgt aggatcgagc gccgggtccc ggggccatgg aagagcagcc gggtcaccct 3120 catctgcaaa gatgcagagg gtgacctcga cgatttgggg aactggcggc ccatctccat 3180 ctgccagacg gtgtataaga tctacgccgc cgtgttggcc cgtcgactgc agagctgggc 3240 gcttgacggt ggcgtcatct cccgaagtca aaagggcttc atgccgttcg agggggtata 3300 cgagcatgta ttcctcctcg actcggtcgt cgctgacgcg cgcgcgaccc ggaggtcgct 3360 ggccgtgtgc tggctngacc tccggaacgc gttcggcagc gtcgaccaca ccaccatagt 3420 cgaggcgttg tctcgcttcg gcgcccccgc gggcctcgtg gagatgatat cggacatcta 3480 tacgggtggg tcctgccgaa ttagaactcg tgcgggattc accccggaca tccctgtcgg 3540 tcgtggagtg cgacaggggt gtcccttgtc cggtatcatc ttcaacctcg tgatggaggt 3600 cctgctgcgg ggcgtcgaag cgaacaacgc ttgtggatac cggctctcct gcgccggcgg 3660 cgcgtccgtc agggtgctcg cgtacgccga cgatgtggct ctggtgggct cttccagggc 3720 cgagatnaag atccagctgg gtgtgtgcga gcggtttgcc gcctgggccg gtttttcttt 3780 taacaacaag aagtgcgccg ccatggtact gaaacatcag agagggggtc ggaggctctt 3840 ggactcggcg cctcttcgcc tttgcggtga agaggtggcg atcctgggcc ccgactcctt 3900 ctacaagtac ctgggggcgc ataccggcta cgggcggcaa acgggtggac agcttgtcga 3960 tcgagtcgaa aggcaggtcg tcaggctgtt cacctcgttc ctcaccccca cccaaaaact 4020 ctcggccctg aagagaatag tcctgcccgc catgagcttc catctccggg tccggccctg 4080 cgccgagggg catctccggc gccttgacaa cacagtgagg cgctgtgtga agacagcgtt 4140 gcgcctgccg aaggggtcgt gtcgggcttt tttccacacg tcccccgacg cagggggcct 4200 ggggatcacg tcggttgtcg ccgagtgcga catactgacg gtgacccagg ccttcaagat 4260 gctgtcttca ccggaccatc tcgtctcgct agttgccaag ggccgtctgg ggatgcacgc 4320 cgcgcgcatg ggccggtccg agacggcgtc cgcttgcgcc atggcggact acctgagcgg 4380 ggactcggta atggggcaca ngtcgtggaa aaccggatac aggatgccgg ccgatctctg 4440 gacagccact cgagctgcca gccggcgcct gtccctccgg ttttccccgc agccccaagg 4500 cgaattcggc ctcgagtcgg gcactttcaa gatcgccccc agggagcggc gctccttgac 4560 ccgaaggctg caccacaggc aaaacctgtg gtggcggaac cagtgggcgg cccttcccaa 4620 ccaagggaag accgtcgccg cccactccgc ctacgcagcc tccaacaact gggtcaaggg 4680 cccgtcctcc ctggcccctc aggccctgtt cttcggcctt aaagcgcggc tgaaccagat 4740 gccaacgcgc tcggtcaagg cctgctactc aagggcgccg aactacgaca agtcctgtcg 4800 taggtgtggc gcggaggtgg agaccctccc gcacgtgctg aatcattgcc ccaagtccat 4860 gaaatcgatc ttggagcggc atgattcggt gcttgcggag gtcctcgccg ccatccctcg 4920 cggcacattc gccagtgtcg acgtcgacag gacgtcccga gaacatttcc ggcgagtggg 4980 cgaagctctg cgacccgaca tagtcgcccg tcgacatgac ggatccgtcg tggtcgcaga 5040 cgtgacgtgc ccattcgagt cctgcgcgtc ggccctcgat acggcggccg cgcgaaaaat 5100 cgaaaaatac gaccagctgt gtgcgaactt gcggcagtta taccgtaagc ccgtcgagtc 5160 gcacgcactg gtcgtaggtt ccttgggcag ctggggcagg accaacaaca ctgctctggc 5220 tgcactcgga atccgaggcg cggttcgctc gaggttggcc aagcaattgg tcaacctcag 5280 cgtcgagggt agccacaaca tctggctacg ctggtccggc ggcatcccaa aggacctggt 5340 cagataagat ccgcggctgt ggcgccgaac gagcacctgc ccattcttct tgtagggact 5400 ttttcaccct cactcccccc aatagttttt tttcgttttt tcgttttttc acccccaccc 5460 cactcgcctc tgggctgcac atcccacacg tagggacctg tttatattat ttgcctttta 5520 tatgtaccac tttttaaata tatttttgta ccccacaaga tgcttttcgc caaaaaaaaa 5580 aaatttttgt atcacatttt tatattttgt aaaacacaga tttttataaa ctttgcacta 5640 tttttatata aacttcgcac ttatttaaaa tgaatcgcat cttttttata tacaccaaca 5700 caaacaggat gtgcagctca ggggaaccaa tcctgcgtcc ctcctagcgg cgggaggggc 5760 gcccacctac ccccacgctc ctcttgagca ccaacaggga ctcccttccg gagcccctgc 5820 accctcaact tttcttattt ttaaaaaaaa aaaatcatat atattgatct tgacgacggg 5880 ggctacattc agcccccaaa aacccaccca ccatccccaa cgagtgccgg ggcattgaag 5940 agctccggca caattagcac ttagcttatt tattattttt tgtcaacatt tttgtttttt 6000 caaatttttt cacccctcac ccccacccta ataggtccct cgggcttggg cccctttttc 6060 gtgctcgaga agcgtcacat cgccccactg accacgacct tccccgacat tggagtcctt 6120 ggcgtctccc aggtcgaaac agtcccaagt gatagcacct aatgctcgac ttgtttcggc 6180 ctgggccgcc gaggattccc agaacgacca ttcttctaaa taatatttat atttcagaat 6240 aaaactatat atatatcgtt ggcgggactt gtcccgcctc gataccgagt gctgcagagc 6300 ggcaaaataa agaagaaacc gacgtcgctc tgcagccaag gaccacccaa actcaagcca 6360 gcaccgtcga caaccaacat cctcaagtcg gcggttgctg gaacaactca taacatcttc 6420 aanataaatt atcaccctgt gcagcaggag gccgtgcttt taaaactact ctgtagtggc 6480 tcatgataat atttcgctcc ttttttgccc cgtgtaaact tagtngatgc gaataaaatc 6540 agttgaatca 6550 // ID BEL-640_AA-I repbase; DNA; INV; 5935 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-640_AA_; KW BEL-640_AA-LTR; Pao_Bel_Ele3; BEL-640_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-5935 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [4995-5543] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 20..4510 FT /product="BEL-640_AA-I_1p" FT /translation="MPKSLDLQPPQVDEVSCITCNRLDVTGNFVQCDACDA FT WCHFSCAGVDASVSERVWVCAKCRAAPSISGKSGHSSKTGSSLARELERLK FT QQQEIELKRANLLLQMKFLDQQQELLNKAATTDETKDRSSNIDQEANTQRM FT KDWVNHTIGDEEEGAVGGLSQQAASPTDQGNQQLHPERPDATKQVPAASLA FT IATSTDRMPTHTDVAELRAQLETCIRRMEEGFHVNQQTKPQPPPLTLPSTM FT QPGPSTGALPKTYPHQMDQAGKRPAISYAQNKNSTSFFDISPPANNTRNPP FT SHFESLPVGTYSNPYDRVPNMNRSHSTRQELPPVIPPRVTHAEPQLGPNAQ FT QLVARQSLARDLPVFSGDPAEWPIFISSYNYTTAACGYTDGENMIRLQRRL FT RGSALESVRSRLVIPSTVPQVINALQMRYGRPELLINALLQKVRSIPAPRA FT DRLEGLVEYGTAIQALCDHIEAANELSHLANPTLLQELVAKLPSDQKMMWA FT GFRRGVNNVDLRTFCNYMQQVVEDATSVLSFESEERRHIGKEHGRERMKQK FT RFLNSHTSGREGTFEVPAPKPTDKIECVSCGKLGHRIRECLEFKSLSVDNR FT WRKIRSLGVCQNCLFRHGRRSCRMNTRCGTDGCQHKHHPLLHSVQRPSNAI FT STQLAENHTHRRMTSSILFRIIPVTLYGDTGRVNTFAFFDEGSSVTMVEQD FT LMTKLGVTGSPAPLCLRWTANTCRTEKNSQIVSISISGVDKQQKYKLVGAQ FT TVESLNLPKQSFHYDEAVKRFDYLKNLPLRSYQNATPGILIGVDNLHLAVP FT LKIQEGDVRGPVAAKTRLGWCVYGQQREGDREAYSFHVCECTPNETLHDTV FT KTFFAVEEVGSSKPILSIEDQRALQLLQNTTRRVGERYETGLLWRTDYVEL FT PDSYPMAVSRLKCLEHRMEKDSSLKLKIHQKVQEYVDKGYAHVATEDEIAN FT ADPRRVWFLPLGAVTNPKKPNKVRIIWDAAAKVDGSSLNSHLIKGPDQLTA FT LPSVLSRFRQFGIAVSADIQEMFHQILIKPEDKHSQRFLFRFDTEQPPSQY FT VMDVATFGATCSPASAQFVKNLNAQEHSYQYPQAAQRIVDNHYVDDYLDSF FT EDEDEAKQITEQVRLVHRNGGFNLRNWNANSDAVLQYLDEPAVNGDKNINL FT INKEHTERVLGMLWVTKTDELRFATQMSQEVCALIETSTRPTKRQVLRCVM FT TLFDPLGLLATFLIHGKVLIQELWKTGIDWDEKVTDDQFIRWRDWVQMIQF FT IDMIRIPRCYFSSATQQTYENAQLHVFVDASPVAYCCVVYIRVTDQEGYGR FT CSLAAAKAKVAPLKPMSVPRLELQGCVLGVRMLKFVEENHSISFSKRFLWT FT DSTTALSWIKSDPSNFRPFVAHRIVEIQDSTSVDEWNYVPSLYNPADEGTK FT WGEGPYFSHESKWFIGPRFLTYSEEYWPSMPTVSIEDKEERRISVMNHFSV FT EPLIHFERFSKWEKLQRTVAYVLRFVSNIKPK" XX SQ Sequence 5935 BP; 1671 A; 1393 C; 1459 G; 1367 T; 45 other; attcttcaaa gaatttaata tgcccaagtc gctggatcta caacctcctc aggttgatga 60 agtaagctgt ataacctgca ataggctcga tgtaaccggc aacttcgttc agtgtgacgc 120 ctgtgatgcc tggtgtcact tttcatgtgc tggcgtcgat gcatcagtga gcgaacgcgt 180 gtgggtgtgt gctaaatgcc gcgccgctcc cagcatatct ggaaaaagtg ggcactccag 240 caaaaccggt tcatctcttg ctcgagagct cgaacgtttg aagcagcagc aagagatcga 300 gttgaaacga gcgaacctgt tattgcaaat gaaatttctg gatcaacaac aggaactgtt 360 gaataaagcg gcaacgacag acgagacgaa ggatcgatcg agcaacatag atcaggaagc 420 aaatacgcaa cggatgaaag actgggtgaa tcacaccatt ggcgatgaag aagaaggcgc 480 agttggaggt ttatctcaac aagctgcctc tccaacagac caaggtaacc aacagcttca 540 cccggagcgt cctgacgcta ctaagcaagt cccggcagca tctttggcca tcgccacatc 600 aaccgaccgg atgccgactc acacagacgt agcagaactt cgagctcaac tggagacctg 660 cattaggcgt atggaagaag gcttccatgt taaccagcag acaaaaccgc agcccccacc 720 gttgactctt ccatcaacca tgcagccagg tccaagtaca ggagccctcc ctaaaacgta 780 tccacaccaa atggaccaag caggtaagcg tccggccatt tcctacgctc aaaataaaaa 840 cagcactagt tttttcgata tcagtcctcc cgctaacaat accagaaatc ctccttctca 900 ctttgagtct ctgccagttg gcacatactc gaatccatac gatcgtgttc ccaatatgaa 960 ccgatctcac tcgacacgtc aagagcttcc ccccgtcatt ccccctagag tcacccatgc 1020 tgagccacag ctaggaccta acgcccagca acttgtagcg cgacaatcgc tggccagaga 1080 tcttcccgta ttcagcgggg acccagcaga atggccaatt tttatctcca gttacaacta 1140 tacgacggca gcatgcggct ataccgatgg agaaaatatg atacgtttgc agaggcgttt 1200 aagagggagt gcactggaga gtgtaaggag ccgcttagtg ataccatcaa ccgtacctca 1260 agtcatcaat gcgttgcaaa tgcgctacgg tcgcccggag cttcttatta acgcacttct 1320 tcagaaggtg cgatcgattc ctgcgccaag ggccgatcga ctagaaggac tcgtcgaata 1380 tggaacagca atacaagcgt tgtgcgatca catcgaggct gcgaacgaac tcagtcatct 1440 agcaaaccct acgctactac aggagttggt agccaagttg ccgtcggatc aaaagatgat 1500 gtgggccgga ttccgacgtg gtgtcaataa cgtggatttg cgcaccttct gcaattacat 1560 gcaacaagtc gtcgaggacg ctacgagtgt gctttcgttt gaatcggagg aaaggcggca 1620 tatcgggaaa gagcacggac gagagaggat gaagcagaaa cgttttctca attcacacac 1680 ttcgggccgt gaaggaacat ttgaagtacc agctccgaaa ccgaccgata agatcgaatg 1740 tgtcagctgc ggcaaattag gtcatcgtat tcgagaatgt cttgaattca aatcgcttag 1800 cgtggataac cgatggcgaa aaatacgctc attaggagtc tgtcagaact gtctgttcag 1860 gcatggtcga cgctcgtgcc ggatgaatac acgctgtgga accgatggat gccaacataa 1920 acatcatccg ctcctgcatt cggttcaacg gccatcgaat gctatttcta cccaactagc 1980 agaaaatcat acgcaccgtc gaatgacttc aagtattctc ttcagaatta ttcccgtcac 2040 actatatggt gataccggaa gagtgaatac ctttgcattc tttgatgaag ggtcatctgt 2100 tacgatggtt gagcaggatc taatgaccaa gttgggagtt accgggagcc ccgcaccgct 2160 ctgtcttcgc tggacggcca acacgtgccg aactgagaaa aactcccaga ttgtgtctat 2220 ttccatttct ggtgttgata agcaacaaaa gtacaaatta gttggagccc aaacggtaga 2280 atccttaaat ttaccaaagc aaagcttcca ctacgatgag gccgtaaagc gcttcgatta 2340 tttgaaaaat cttccgttac gaagctatca aaatgctaca cctggcattt tgatcggagt 2400 agataactta cacctcgccg tgcctttgaa aatacaggaa ggagatgtgc gcggaccagt 2460 agctgcgaag acaaggcttg ggtggtgtgt ctacggccaa cagagagaag gwgatcggga 2520 agcctatagt tttcatgtat gtgaatgcac acccaacgaa acgcttcatg acacggtgaa 2580 aacttttttt gcggttgaag aagttggatc ttcgaaaccc attctctcga tagaagacca 2640 gcgtgctcta caacttctgc agaatacaac gcgtagagtt ggcgaacggt acgaaacggg 2700 actgctatgg cggacagatt acgttgaact accggacagc tatcccatgg ccgtcagcag 2760 gctgaaatgt ttggaacacc gtatggagaa agactcgagt ctgaagctga aaattcatca 2820 gaaagtgcaa gagtatgttg acaaagggta cgctcacgta gcaaccgaag acgaaattgc 2880 taatgccgat cccagaagag tctggtttct accattggga gctgtaacaa atcctaaaaa 2940 accgaataag gtacgcatca tatgggacgc agcagcaaaa gttgatgggt cgtcactgaa 3000 cagtcatcta ataaaggggc cagatcagtt aactgcgctc ccatctgtgc tctctcgctt 3060 tcgacaattt ggtatagcgg taagcgccga tatacaagag atgtttcatc aaatactgat 3120 aaaaccagaa gacaaacatt ctcaaagatt tctgtttcgc tttgatacag aacagcctcc 3180 gtcgcaatat gttatggacg tggctacctt cggggccact tgctctccgg catcagcaca 3240 gttcgttaag aacttaaatg cccaagagca ctcatatcag taccctcagg cagcccagcg 3300 catagtcgac aaccattatg tcgatgacta tctggatagt ttcgaagatg aggatgaagc 3360 taaacagatc accgaacaag tacggctagt tcaccgcaac ggcgggttca accttcgaaa 3420 ctggaatgcc aacagcgatg cagtgttaca gtatctcgat gagccagctg ttaacggcga 3480 taagaatatc aacctgatta acaaggagca tactgaacga gttctgggaa tgttgtgggt 3540 aacaaaaacg gatgaattac ggtttgcaac tcagatgagt caagaggttt gcgcactgat 3600 agaaacctct acacgtccca cgaaacgtca agtgctccgc tgtgtaatga cacttttcga 3660 ccccctagga ctgctcgcga ccttcttaat ccatgggaag gtactgatcc aagaattgtg 3720 gaaaacagga atcgactggg acgagaaagt tacagacgat cagttcatcc gatggagaga 3780 ctgggtacaa atgatccagt tcatagacat gattcgtata ccaagatgct acttttcatc 3840 ggcaacgcaa cagacctacg aaaatgctca gctacacgtc tttgtggacg cgagtcctgt 3900 tgcctattgc tgcgtcgttt acattcgtgt gacagatcaa gaaggatatg gacgttgttc 3960 actggcggct gcgaaggcaa aagtggcgcc attaaaacct atgtcggtac ccaggctaga 4020 actgcaggga tgtgttcttg gtgtgagaat gttgaagttc gttgaagaaa atcactctat 4080 ctccttttcg aagcggttct tgtggactga tagcacaacg gctctttcat ggataaaatc 4140 ggatccaagt aacttccgac catttgttgc ccatcggatt gtggaaatcc aagactctac 4200 aagtgtagat gagtggaact acgtgccgtc gttatataac ccagcggacg aaggaaccaa 4260 atggggtgaa ggtccttatt ttagccatga aagcaaatgg tttattggac cgcggtttct 4320 cacctactcc gaagaatact ggccaagtat gccaacggta tccatcgaag ataaagaaga 4380 gcggcgcatt tccgtgatga atcatttctc tgtagaaccg ctgatccact ttgaacgatt 4440 ctcaaaatgg gagaaactac aacgaacggt agcatacgtg ctgcgatttg taagtaatat 4500 caaaccaaag taagatggtt ggtgagttac aacaacagga gcttcaggcg tccgaagcat 4560 tcatctttag gcaagtgcaa tggacgtttt ttccagaaga gatggcgatg cttacccaga 4620 agactgcaat gaaggacaat cttcaaacga ttccaagaac gagcaaactg tatcagctgg 4680 tcccgacatt agatgaacga ggatttatgc gccagaatgg gcgcattgga gcagccaaca 4740 tgcctcgtat aacatgcggt atccggtaat tttaccgaaa gatggcatag taactatgct 4800 aatagtggac aaatatcacg kwacctamaa acacgcgaat ccagaamctg tggwaaawga 4860 aatcsgccaa aattcgaaat cctmkacttc gatcgatcgt twgacgttat gggcggcagt 4920 gtcaatattg caagcttckk gaagcacaac cagtgattcc gccgatggct cctcttcctc 4980 cggcacgcct cgcagcgtat tctaggccwt tcagctacgt tggactggat tgttttggcc 5040 cactgttggt caaacaaggt cggacaaamg tkaaaaggtg gatcgcactc ttcacgtgcc 5100 ttactgtgck ckctgtccac cttgaagtkg tgaatagtct gacgacctca tcctgcgttt 5160 cggcggtgck wagatttatt ggacgacgtg gtgcaccawt mgagttctat agcgacaatg 5220 ggacgaattt tcaaggcgct gaacggttgt tgcgagaaca aatcgagcag gaattgtcat 5280 cgaccttcac tmgckcgmga acaaaatggt tcttcattcc tccwkgtgca mcacacatgg 5340 gtggcgcttg ggagagaatg gtgcaatcag tgaasgcagc tatgggcgaa gcatatgggg 5400 acgggaagct tgacgaagaa ggtttatgga ctctggtast tsaggctgag agcgtakwga 5460 attccakgcc attgacgtwt ttgccgctag actckgaaga gtccgaggca tcgactccwa 5520 accakttttt gcttggsagc tctagtggag ttaaggaacc tggtcgtgac atacatgatc 5580 aacctcgcgc tctgcagaas acctttgttg atattcakgg ccaactkaac tctttcwggg 5640 cgcgctkgct gcgcgaatac ctgcccgtca tccgacggca gccgaagtgg ttcggagaag 5700 ctgctaggga gataaaggaa ggtgacttgg ttctgattgc cgaggatgga aaacgatgcc 5760 agtggcccag aggtcgtgtc cagaatgtta ttcgtggacc agatggtaag atacgtcaag 5820 caatcctgcg taccgccaat gcggtattac gacggcctgt aacgaagata gccttgctag 5880 atgttgggga cagtagtgca gtctcaactg acgtgaagat gcacccggcg gagga 5935 // ID BEL-40_AA-LTR repbase; DNA; INV; 449 BP. XX AC AAGE02018633; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-40_AA_; KW BEL-40_AA-I; BEL-40_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-449 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02018633; Positions 10340 10788. XX SQ Sequence 449 BP; 108 A; 136 C; 91 G; 114 T; 0 other; tgttccgtct cccgacggaa gcgcaaatca tgtaccaccc tattcccatg catacccatc 60 accaccctat tgccaatcag tgagcgacac gcgttcgtca gcgtggtcga caccgaaacg 120 tatgtcggtt acattttctc accactttct gctgaagcca accgcagtcc caacggctgc 180 atcagttcac tttcactcga cgccgagaac gcatcgtttg cgtagaaaca gtaaatcatt 240 attaaactaa tgtaaagaga agagaataaa gtttgtagtt aagcgtagcc gtccgcgttc 300 aattccttcg catcccaaga aaagttggcc cttgccaatc ctcccgcgga agtgatcctt 360 tcgccgcgaa gtgttttgtt ccgccgtgtt ttccccccaa agtgtattac agtccgctgc 420 ctttcgatcc accctgctgg gctcgaaca 449 // ID Gypsy-14_CQ-I repbase; DNA; INV; 5776 BP. XX AC AAWU01010358; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-14_CQ_; KW Gypsy-14_CQ-LTR; Gypsy-14_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-5776 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 407-407 (2011). XX DR GenBank; AAWU01010358; Positions 16893 11118. XX CC Positions [4733-5209] - Integrase core CC 'GATAT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 931..2424 FT /product="Gypsy-14_CQ-I_1p" FT /translation="MTTVCFSIINRFVEMYAPGRYVRAPSVSPDREDDHDV FT GKFVRAPNQALFDRSRRPDHGSGEPREVDPYAHYDTPWETAQQVQAELNAR FT VRQLEDIIKELSQTRSSTLGGSPAADAVNQDDPPVPAEGNLTVAVYGAPVT FT PVGKTHGSISQREEPRNDGSSGDGFRGSTAGTLPATPSEAAVRWDNIRAFP FT KDVPATKMWEAWTLFIEDFEMAATLSSVKSSRRRAELLLVSMGSQLKSITR FT AAQLLPDLDSDRCYEEFVDNVGRYLKSLTDPAAEHEEFTNMVQQEGEAAIY FT FHARLAEKVRLCDYPGGRESEDRFVRAQLMRGLRNQELKKAARTYGLDTNN FT IVTSATRAEAFRDERPPATPDTNVFAIDKPQEHRGPSKRKYEDQRRPMKRF FT KQEDSTFNQGRSQFNQGGARFNQGRSMNNQSRAPIRRRSRCSRCWKQIHNE FT GEDCPAAGRTCFACGTVGHFAIACHQREAYSVEDAGRTTGERAAKKKDQVN FT FV" FT CDS 2729..5677 FT /product="Gypsy-14_CQ-I_2p" FT /translation="MRVDRSFKATIEVCGPNKAAIEAEFLVVEEGRRSLLG FT RETASDLNLLKVGCAINSCENPDQFPKMPGVKISFSVDRSVPPVRNAYYNI FT PAAYRDKARQRLAEMERQGIIEKVTSAPQWISGMSAVPKGQDDFRLVVNMR FT APNKAVLREYFRLPLMDEMRVKLHGAEYFTKLDLQSAYYHLELCEDSRDLT FT TFLSESGMFRFCRLMFGVNAAPEIFQREMVRIFKDVKNIIIYIDDLLIFAK FT TIDELRKTTAEVLKILRANNLTLNTAKCKFDQTQVTFLGHELSAKGFNIEQ FT SKISSIQKFRQPSTASELRSFLGLASFVSPYIKNFAGMSGPLWAVVNDNKW FT HWGEEQQTAFEAVKENIIQSTLSLGYFSDCDKTILYTDASPYALGAVLVPE FT NDNSTPRIISFASKALSPTERKYAQNQREALGAIWAVEHFSYFLLGRHFVL FT RTDAQGMSFILDRPREESKRALTRADGWALRLSPYRYQVQFVKGRDNIADP FT SSRLYQGEDDEFDERANSWEVCQLELNRVEFLTEDEIKECTAKDDTLQKVL FT FHGGQIQFSKVSFKTVILFQVIDSLASGKWSNELLRFKAIASDLHYSDGML FT SKNGCAVLPANLRESALEVAHAGHPLEAKFKSILRKRVWWPGMAGDAEKWV FT KSCSTCAVNGRPEKPTPMQRSFAPKAVWDTVAVDFNGPYARLGGILILVII FT DLRSRYAMAYPVKSTGFEHTRTVLDAIFGREGFPRAMKSDNGPPFNGEDYA FT NYCSERGIQTVFSTPFFPQQNGLVECFMKNVNKAMSTALSTGSDYRKELQA FT TVQAYNAADHTVTKVPPEEVFSGRKIKRGLPLLLRGRTEHDDLVFDARDHE FT AKIKSKTREDTRRGAKRSTVKPGDAVLIERQSKAKGETRFDPKRYTVMEER FT NGSLVVCDPDGRQLRRHVTQTKKVQQWREPRPIIQEAQRSDELGCHDQEVK FT EIDQQRPRRTVKIPSHLSDYIRAVEEEF" XX SQ Sequence 5776 BP; 1571 A; 1349 C; 1590 G; 1266 T; 0 other; ttttggcgac cgtgacaggt ggtttaaata gtaatttaaa ccgcaaagat tattcttcgg 60 cgttttcctt ttaaaaacta tcacgcggga aatcacccac gcgcggggta agtggaataa 120 accaaatttt gccaagagat tttttttcaa aatggcggca ttgtggacaa acaagcgcgt 180 ggtgtggatg aattcaaatg ttccagaagc tctattgaac agcgaaagtg taatggattt 240 tttttacctt tctggcaaaa aaaaatcgct caacgcttgg atgtgtgggt gctgagaagg 300 tgagagcttc tcgtgaggtt cgtgaaccac gagggaaaag cgaagtttga gaaccgcttg 360 gatgtgtgtg tgctgggatg gtgaggtcac gcggaccacc aaacatcaac tcgtgaggtt 420 tgagaaccac gaaaaaaaaa agcgaggttc gagaaccgct tgaatgtgtg cccgccgtag 480 aggtgaggtc gcgaggacca ccgaacggct cttcgtgaag tttaaaaaaa atcgggtcaa 540 aagaagcgaa gtttgagaac cgcttgaacg tatgtgtgta caagagggct aagatcgcgc 600 atcgaacagt attttgtgaa gtttgagaac cacgaacaaa gcgaagcttt gacaaccact 660 caaatgtgtg tgccgttgct cgaaccacca aacggctgct tgctcttcaa cagtcaaagc 720 ggagctttgg tagtattggt gagagcttga gatttctcta atagagatgt ttgagtcttg 780 ttgacacagt cgcctaaaaa gtgtttgagg ttgacgcgtg gtaaagtacc agtgttttga 840 gcctactgct tttgttacgt ttaacacaat ttggcagaaa aataggaaga ctgttggttc 900 ggtcttggca acttcaacga cacggctgaa atgacgacgg tttgtttttc tattatcaac 960 agattcgtag agatgtacgc acctggaagg tacgttcgcg caccgtcagt atcacctgac 1020 cgcgaggatg accacgacgt tggaaagttc gtgcgggcgc cgaaccaggc cctcttcgat 1080 cgcagtcgtc gtcccgacca cggttcgggt gaaccccggg aggtcgatcc atatgctcac 1140 tacgacactc cgtgggagac agcccaacag gttcaagcag agttgaatgc cagggtgcgt 1200 cagctggaag atattatcaa ggagctgagt caaactcgca gtagcacgct tggcggtagt 1260 ccggcggcgg atgcggtcaa ccaggatgac ccaccggttc cggcagaggg gaatcttaca 1320 gtcgccgtgt acggtgctcc ggtcacccca gttgggaaaa ctcatggctc gatttctcag 1380 cgagaagaac ctcggaacga tggatcatcg ggtgacggct tccggggcag caccgctggt 1440 acccttcctg caacgccgtc ggaggccgca gttcgttggg ataatatcag agcgtttcct 1500 aaagacgttc ccgctacaaa gatgtgggag gcctggaccc tgtttattga ggattttgag 1560 atggctgcaa cgctctctag tgtaaaaagc tcaagacgac gtgccgagct acttctcgtg 1620 tctatgggat cccagttgaa gagcatcacc cgcgcagcac agctgctccc ggacctggac 1680 agtgacagat gttacgaaga gttcgtggac aacgttggga gatacctcaa atcacttacg 1740 gacccagcag ctgaacatga ggagttcacc aacatggtgc aacaggaggg ggaggcagcc 1800 atctattttc acgcgcggct ggcggaaaag gtacgattgt gcgattaccc aggtggccga 1860 gaaagcgagg accgtttcgt gcgagctcag ctaatgagag gactccggaa ccaagagctg 1920 aaaaaggcag cacgcacata tggcctggac acgaacaaca tcgtgacttc agcgactcgc 1980 gccgaagcct tccgtgatga acgtccaccg gcaacacccg acacgaacgt tttcgcgatc 2040 gacaagcctc aagaacatcg aggtccatcc aagcggaagt acgaagacca acgtaggccg 2100 atgaaacgtt tcaagcagga agactcaacg tttaaccagg gacggtcgca gttcaaccaa 2160 ggtggagccc gtttcaacca gggaagatcc atgaacaatc agtcaagggc gccaatcaga 2220 cgccgtagca gatgtagcag atgctggaaa cagattcaca atgagggaga agactgccca 2280 gcggccggaa ggacatgttt cgcttgcggc acggttgggc attttgcgat tgcctgccac 2340 caacgtgagg cgtactcggt tgaggacgca gggcgcacca cgggagagcg agctgcaaag 2400 aagaaggatc aggtaaactt cgtttaaatt aagattgatt tgaatacaat gtcacttcag 2460 tgcaagacaa acgatttgtt ttttttcttg attgaattct cctcttcagc aagtcaacgc 2520 gctttcactc caggacgtac ttgttgactg ccgcattggc tcgtcccacc ccataaaatt 2580 cctcatcgat tccggttctg acgctaatgt cgtgggcggt gcagactggt cggttctgga 2640 gaagcagttt gaacgagggg agattgaact gaccccggca aaatgcacaa acgacaggaa 2700 cttgcaagct tatgcgtcca acaagcccat gagagtagac cgaagtttca aggcaacaat 2760 cgaagtttgc ggacccaaca aagctgcgat cgaggccgag ttcctggtgg ttgaagaagg 2820 gaggcgttca ctgcttggta gagaaacggc cagcgacctg aatctgctga aggtgggatg 2880 tgcaataaac agctgcgaga atccggatca gttcccaaaa atgccaggag tgaagataag 2940 ctttagtgtt gacagatctg taccgcctgt gcgcaacgcc tactacaaca tcccagctgc 3000 gtatcgcgac aaagcacgac agagacttgc agagatggag cgacaaggaa taatagaaaa 3060 ggtgacttca gccccacagt ggataagcgg tatgtcggcg gtcccgaaag gacaagacga 3120 tttccgcctg gtcgtcaaca tgcgagcacc aaacaaagcg gtcctccggg agtatttccg 3180 cttaccgctg atggacgaaa tgagagtaaa gctgcatggt gctgaatact tcacaaaact 3240 tgacctacaa agcgcctact atcatcttga actgtgcgag gactcacgtg atctgacgac 3300 gttcctatcg gagagtggca tgtttcgttt ctgcagacta atgttcggtg tgaatgctgc 3360 gccagaaatt ttccaacgcg agatggtacg aatcttcaag gacgtcaaga atatcatcat 3420 ttacattgat gatcttctga ttttcgctaa aacgattgac gagttgcgga aaactaccgc 3480 agaggtcttg aagatactac gagcgaacaa tcttacgctc aacactgcca agtgcaagtt 3540 tgaccagacg caagtcacgt ttctgggtca tgagttgagc gcgaaagggt tcaacatcga 3600 gcaatccaag atcagcagca tccagaaatt cagacagcca tcgacagcat ctgaactgcg 3660 gagtttcctg gggctcgcct ctttcgtaag cccgtatata aagaacttcg ccgggatgtc 3720 cggccccctg tgggcggtgg tcaacgacaa caagtggcat tggggcgaag aacagcagac 3780 cgcattcgag gcggtgaagg agaacataat tcaatcgacg ctttcgctgg gatacttctc 3840 tgactgcgac aaaaccatcc tgtacacgga cgcttcacca tacgctcttg gtgctgtact 3900 agtcccagaa aacgataatt cgacaccccg gattatcagt tttgcgtcta aggccctttc 3960 tccaactgaa agaaagtacg cacagaatca gcgagaggcg ttgggagcaa tatgggcggt 4020 ggagcatttc tcttatttct tgctgggaag acatttcgtt ctacggacag atgcacaagg 4080 aatgtctttc atactcgacc ggccacgtga ggaatcgaag agagcgctga caagggccga 4140 tggctgggcg ctacggctca gtccgtaccg ataccaagtt caattcgtta agggacgtga 4200 caatattgcg gacccctcgt caaggctgta ccagggagag gacgacgaat ttgatgagag 4260 ggcgaactct tgggaggtgt gccaactcga gctgaacaga gtggaattcc tgactgagga 4320 cgagatcaag gagtgtactg caaaagatga caccctgcaa aaggtacttt tccacggagg 4380 ccaaattcaa tttagtaagg tatcattcaa aactgtgatt ctttttcagg tgattgactc 4440 gctggcctca gggaagtggt cgaacgagtt gttaagattt aaagccatcg cgagcgatct 4500 tcactacagt gatggaatgt tgagcaaaaa tggttgtgcg gtgcttccag cgaaccttcg 4560 tgaatcagcc ctagaagtgg cgcacgcagg acatcccctt gaagcaaaat tcaagagcat 4620 cctgcgcaaa cgcgtgtggt ggcctgggat ggctggagat gcggagaagt gggtcaagtc 4680 gtgctcgact tgcgcggtta acggacgacc tgagaagcca acgcccatgc agcgctcatt 4740 cgctccaaag gccgtttggg ataccgtggc tgttgacttc aacgggccat acgcaaggct 4800 tggtgggata ttgatcttgg tcatcattga tcttagatcc cgctatgcga tggcttaccc 4860 tgttaagtcg actggattcg aacacacacg gacagtacta gacgcgatct tcggtagaga 4920 aggtttccca cgggccatga aatccgacaa tggcccacca tttaatggag aggattatgc 4980 gaactactgc tctgaacgag gtatccagac cgtattctct acgccgtttt ttccacaaca 5040 gaatggcctt gtagaatgct tcatgaagaa cgtgaacaaa gccatgtcga ctgcactttc 5100 tacgggtagc gattaccgga aggagcttca agcaacggta caagcataca acgccgcgga 5160 ccacacagtc accaaagtac cacccgaaga ggtcttcagt ggtcgcaaaa tcaagcgcgg 5220 tctgccgtta cttctccgcg gtaggacgga acatgatgat ttggtgttcg acgccaggga 5280 tcacgaggcg aagataaaat ccaagacacg ggaagacaca cgccgtggag ctaaacggag 5340 tacagtcaaa ccaggggatg cggtcttgat cgagcgccag tcaaaggcga agggagaaac 5400 tcggtttgat cctaaacgct acacggtaat ggaggagcgt aacggcagtc tggttgtctg 5460 tgatccggat ggccgccagc tcagacgcca tgttacacaa accaaaaagg ttcagcagtg 5520 gcgagaaccc cgaccgatca tccaagaggc ccaacgttcc gacgagctgg ggtgtcatga 5580 tcaggaagta aaggagattg atcaacagcg gccgagaaga acagtgaaga tcccctcaca 5640 cctctcggat tatatccgtg cggttgagga agagttttga tggccaagac ggaaagcacc 5700 aggctgttga gtgaaggaaa ataaattatg actagaactt ttttttgtgt tttttttctt 5760 gagaaaggaa gggaga 5776 // ID Gypsy-231_AA-I repbase; DNA; INV; 5205 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-231_AA_; KW Gypsy-231_AA-LTR; Gypsy-231_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5205 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1065-1065 (2011). XX DR [2] (Consensus) XX CC Positions [2385-2885] - Reverse transcriptase CC Positions [4020-4490] - Integrase core CC 'CTAAC' target site duplication CC LTRs are 94% similar to each other. XX FH Key Location/Qualifiers FT CDS join(792..3941,3945..4904) FT /product="Gypsy-231_AA-I_1p" FT /translation="MAQSALATNIEPYRKGAGFTEWAERLEYLFLINSVQE FT EHKKAYLATLGGPVVYSELKLLYPNTDLNNIAYNDMISKLKARFDKVAPDI FT IQRLNFNNRIQQKDESVEDFVLAVKLQAEFCTFGDYKSIAIRDRIIAGIRD FT KALQQRLLNEENLTLSLAEKIIATWEVAGANSRGLADRVTDNTGVERVDAI FT ASTSKFNGNPRTTLEKLLATLDLARTANSDHNEGHDKGPVKSRLGYRPNSI FT GRQTNNGERMGPYYPRMRGDWKSNRERQRGQYADFVCDFCGIKGHIKRKCF FT KLKNMKRDAVNFVDQGSGHDKDDSLERLFNRLKADDSGSEDDWDAGDFDTC FT MNVTSTKNNPCLLELLVEGKNLQMEVDCGASVSVIGKKEYWKMFKKPVKQC FT NRKLIVVNGDRLKIEGEASLLVEFKGKVENLKILVLNSDYNFIPLMGRTWL FT DVFFPDWRNFFTNSDSINKLLVGNSDTIVNEIKQKFSKVFEKDFSLPIVGY FT EAELVLKSDVPIFKKAYNVPYRLKDKVIEYLDKLERQNVITPIKISEWASP FT VVIVMKKNNDIRLVIDCKVSINKVIIPNTYPLPTAQDLFAGLAGCKFFCAL FT DLEGAYTQLMLTERSRKFMVINTIKGLYVYNRLPQGASSSASIFQQIMDQV FT LQGIDFVYCYLDDVLIAGKNFEECKERLLIVLERLSQANIKVNWDKCRFFV FT TELEYLGHIISDRGLLPCPDKTLTIQKAKAPKNETELKSYLGLINYYNKFI FT PRLSIKLYHLYNLLKNKVKFIWDVQCEEAFKESKNSLVNAKFLEFYDPKKP FT LVVVSDASSYGLGGVISHIIDGAEKPISFTSFSLNSAQRKYPILHLEALAL FT VSTVKKFHKYLYGQKFQVFTDHKPLVGIFGKEGRNSIYVTRLQRFVLELSI FT YEFDIQYRPSSRMGNADFCSRFPLDQLVPENSDTEGVNNLNFGKELPIDYK FT TIAKWSKQDEFLQQIVSYLQNGWPERLERRYVDIFANHHDLEIIEDCLLYQ FT DRVVIPQIMQNKVLDLLHANHAGIVKMKQLARRYVYWFGINSDIKFVPSCD FT ICSSMTILPKPKLLSKWIPTTKPFSRIHIDFFYFEHHVFLLIVDSYSKWLE FT IEWMKNGTDARKVINKLVVFFARFGLPDMLVSDGGPPFNSNAFISFLERQG FT IKVLKSPPYHPSSNGQAERFVRTVKEVMKKFLMDRTVMDLELENQISLFLI FT TYRNNCLTKEGNCPSEQIFSYAPKTLIDLVHPKKHFKQNLQEILPHEDTNI FT SKRNGDVHTGLNDPLCNLTAGEELWYKNHNPHVKARWIKASFIKWHSQNTL FT QVQIGSVPIMAHRSQVRRCNESDNNMPNAVVLLRRCDGRCAKGSVYHWSRG FT EGRGTSS" XX SQ Sequence 5205 BP; 1687 A; 864 C; 1135 G; 1507 T; 12 other; actgacggcg aggctggtgc ggaaaccact gtagttccag gagatcatcg agaagacgat 60 tagggaatcc agcggattca gcggtaaacc aaccggcaaa acgtacagtc gtttgatcct 120 gcaagaaagt ctggtcatct ggattcggcc attgtgaggt agaagtttct tctatagtaa 180 gtagcgtgcg tctatctatt atctgaagga gaaatttagc aggacgctcc aaaattgtgt 240 tcttaacgaa aaactatgct atcggcacta acctttaaca aagattttgt gtaaaagaaa 300 cagatagtga cgctttatag tttgcattat tcatgcgtgc aatagacttc ggccgtgttg 360 ttctcgaata atacgaggtt tttttgtatt cgatcattta ggtctgaaat tcattttggt 420 attgtgatat tgtgaacatt acgcagcgtt acgtttgagg tgatttggct csgtgcagcg 480 gakcaaagca aggacatccg gagtgtgtat tccgtgggcg gcacaagaca cgtatacgca 540 acacgtcgac aaaggtcaat aaaccacaaa gcmmcmcagg gagcttcatk ggkgastcca 600 atctgttttt ttttcctata accgctttgc gacaaatttt gtttttcctt ttatgatttt 660 ctggctgatt tttggctaaa ttttaccctt ttcaatcatt tacaatagcc tttattgaaa 720 attctttgca cacaatagaa tacctaattc aattgtgttt caaacaatag cccttacaat 780 cctgataaat tatggcgcaa tctgcccttg caaccaacat tgagccttac agaaaaggtg 840 cgggtttcac agaatgggct gaaaggctag aatatctgtt tttgataaac tccgttcagg 900 aggaacacaa aaaggcgtac ttggctactt taggtggccc tgtagtctat tcggaattga 960 aacttttata ccccaatact gacttgaaca acattgcgta caatgacatg atttcaaaat 1020 tgaaagctcg attcgataag gttgcaccgg atattattca gcgcctcaat ttcaataacc 1080 gtattcaaca aaaagacgaa tcggtggaag attttgtcct ggctgtcaaa ttacaggctg 1140 agttttgtac ttttggcgat tataagtcta ttgccattag ggatcgcatt attgctggaa 1200 tacgggataa agctctccag cagcgactcc ttaatgagga aaatttgacg ctttcgttag 1260 cagaaaaaat tattgccacc tgggaagttg ctggagctaa tagtaggggt ttagcagacc 1320 gagtaactga caatactggg gtggaaaggg tagatgcaat tgcatcgact agcaaattca 1380 atggaaatcc acgcactaca ttggagaagc tattagctac actggatttg gcgaggactg 1440 caaacagcga tcacaatgag ggacacgata aaggaccggt gaaatcaagg ttgggatata 1500 gacccaactc tataggcaga cagacgaaca atggtgaacg aatgggacca tactatccca 1560 ggatgcgagg cgactggaag agcaacaggg aaaggcagag agggcaatat gcagattttg 1620 tgtgtgattt ctgcggaatc aaaggccaca tcaagcgcaa gtgtttcaag ctgaaaaata 1680 tgaaaaggga cgccgttaat tttgtggacc aaggatcagg acatgacaag gacgatagtt 1740 tggaacggct gttcaacaga ctgaaggcag acgattccgg aagtgaggat gattgggatg 1800 caggtgattt tgacacatgc atgaatgtga cgtctactaa aaacaatccc tgtttattag 1860 agttattggt ggaaggcaaa aatttgcaaa tggaggtaga ttgcggtgcc tcggtgtcgg 1920 taattggcaa aaaagaatat tggaaaatgt ttaaaaaacc agtaaagcaa tgcaacagaa 1980 agctgatagt tgtgaacgga gatagactca aaattgaggg agaagcaagt cttttggtcg 2040 aatttaaagg gaaagtggag aatctaaaaa tattggtact aaatagtgac tataatttca 2100 tcccattgat ggggagaact tggttggatg tgtttttccc agattggagg aattttttca 2160 cgaactcgga tagcattaac aagttgttag taggaaacag tgatacaata gtaaatgaaa 2220 tcaaacaaaa attttctaaa gtgttcgaga aagatttttc tctgccgatt gttggttatg 2280 aagcagaatt agtgttaaaa agtgacgtgc ctatttttaa aaaggcttac aatgtccctt 2340 atcgtttaaa agataaggta atagaatatt tagataaact tgaaaggcaa aacgtaatta 2400 cacctattaa aataagcgaa tgggcttcac ctgtggtaat cgttatgaag aagaataatg 2460 acatacgtct ggttattgat tgcaaggtat caattaacaa agtgattata ccaaatacat 2520 atcctttacc cacagcacaa gatttgtttg cgggtttagc tggatgcaaa tttttttgtg 2580 ctttggattt agaaggggca tatacgcagt taatgttgac tgaacgatca agaaagttca 2640 tggttattaa cactattaag ggactctatg tatataaccg attaccacag ggagcttcat 2700 caagtgcttc aatctttcaa caaattatgg accaggtact tcaaggcatc gatttcgttt 2760 actgttattt agatgatgtg ctcatagctg gaaaaaattt tgaagaatgc aaggaaagat 2820 tgctaattgt tttggagaga ctttctcaag ctaacataaa agtaaattgg gacaaatgcc 2880 gttttttcgt tacagaactt gaatatttgg gacacattat cagtgacagg ggtttattgc 2940 catgtcccga taaaactttg actatacaga aggcaaaagc tcccaaaaat gaaaccgaat 3000 tgaaatctta cctgggtctc ataaattact ataataaatt tatccctcgt ttatcaataa 3060 agctgtatca tttgtacaat ttattaaaga acaaggtgaa atttatctgg gatgttcagt 3120 gtgaggaggc ttttaaagaa agtaaaaatt cacttgtcaa tgcaaaattt ctggaatttt 3180 atgatcccaa gaaaccttta gtagtagtkt cagacgcttc aagctacggc ttgggaggag 3240 taatttctca tattattgac ggagccgaaa agccaataag ttttacatcg ttttcattaa 3300 actcagcgca gcgcaaatac cctatattgc accttgaagc tttagcttta gtgagtaccg 3360 taaagaaatt ccacaagtat ttgtatggac agaaatttca agtttttaca gaccataaac 3420 cattagtagg aatatttgga aaggaaggaa gaaattccat ttatgtgaca cgcttgcaac 3480 gattcgtact agagttatcc atttatgagt ttgacattca atatagaccc tcatcccgaa 3540 tgggaaacgc agatttctgc tctagattcc cgttggatca attagtccca gaaaattcgg 3600 atacggaggg tgttaacaat ttaaattttg gaaaagaatt gcctattgat tataaaacga 3660 ttgctaaatg gtccaaacaa gatgagtttt tgcaacaaat tgtttcttat ctacagaatg 3720 gctggcctga aagattagag aggcgttatg tcgacatttt tgcaaatcat catgatttgg 3780 aaattattga agactgtttg ttatatcagg acagggtggt gattccacaa attatgcaaa 3840 ataaagtgtt ggatctgctg catgccaatc acgctggtat tgtaaaaatg aaacagcttg 3900 caaggcgtta tgtctattgg tttggtataa atagtgatat agamaagttt gtaccttctt 3960 gtgatatttg tagcagtatg acaattctac caaagccaaa actgttatcc aaatggatcc 4020 ctaccactaa gccatttagc agaattcata ttgatttttt ctattttgag catcatgttt 4080 ttttactaat agttgacagt tactctaaat ggttagaaat tgaatggatg aagaatggta 4140 ctgatgcaag aaaggtaatt aataagttag tcgtattttt tgcgagattt ggtttaccag 4200 atatgttggt atctgatggg ggtccaccgt ttaattctaa tgctttcatt tcctttttgg 4260 agagacaggg cattaaagtt ttgaaaagtc caccttacca tccatctagc aatggccagg 4320 ccgaaagatt cgtgagaacg gtcaaggaag taatgaaaaa gtttttgatg gatagaactg 4380 tgatggattt ggagttggag aatcaaataa gcttattttt aataacgtat agaaataatt 4440 gcctaacaaa agaaggaaat tgcccatcag aacaaatatt ttcgtacgca ccaaaaacgt 4500 tgattgacct agtccaccca aagaaacatt ttaaacaaaa tttgcaagaa atattgccgc 4560 atgaagatac taacatctct aaacggaatg gagatgtcca tacgggatta aatgatccac 4620 tgtgcaatct gacggctgga gaagagttgt ggtacaaaaa ccacaaccct catgtcaagg 4680 caagatggat taaggccagt ttcattaaat ggcattctca aaatacattg caggtgcaaa 4740 ttggaagcgt accaataatg gcccaccgca gccaagtgcg tagatgtaac gagagtgaca 4800 acaatatgcc gaacgctgtg gtcttactgc ggcgttgtga tggaagatgt gcaaagggat 4860 cagtctacca ctggagcaga ggcgagggtc gaggaacctc cagttgacac ctgcaatcca 4920 accaagcgta aaagaaagca cgatgaattg gaccacagtg tttctctaag aaggtcaatg 4980 aggcctaaaa aggcgaagaa agatggtatt tatgtttaca attgattata twggtgaatt 5040 cgatatttcc attgtattat tgwacttcaa ccttgaactc aaaatctttt cgaatatctc 5100 gattaattaa tatttttcat aacaacgaat tcggaatttt gtattatact ttagattaag 5160 ctgtcggtaa aaaccgaaac tcgatcatcc gaaggggtga ggagt 5205 // ID Helitron-7_NVi repbase; DNA; INV; 5904 BP. XX AC . XX DT 30-JUN-2009 (Rel. 14.07, Created) DT 30-JUN-2009 (Rel. 14.07, Last updated, Version 1) XX DE Helitron DNA transposon - consensus. XX KW Helitron; DNA transposon; Transposable Element; Helitron-7_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-5904 RA Bao W. and Jurka J.; RT "Helitron DNA transposons from Nasonia vitripennis."; RL Repbase Reports 9(7), 1392-1392 (2009). XX DR [1] (Consensus) XX CC The both ends may be incomplete. XX FH Key Location/Qualifiers FT CDS join(25..381,314..895,822..1316) FT /product="Helitron-7_NVi_1p" FT /translation="MPLEICGQLYIYDDITSVEKRLKKNEKLTKDHIEILT FT TLIPDNPYAKNFRCLNQLANIQNLPNYRLYFVRKNDKQKHRYNKPLTAECG FT AIIVSKSVIPDDYDLCIYPKAKTNSDKNLFKFQMIMTFAFILKLKQILIKI FT YLSKLSHHVDPMVFPLMFPSGDLGWSIGYNKKPNSESNKIYDIDNLLLNIN FT RSKNNKRNLENLTLLQYYSYRLSYRPKIKHFSPLLYSGRLTQQYFIHALVM FT VESNRMNFFRNNQKQLRIECYQGLFDHVFNSASNISQNFEKKKKDWVIHIT FT CYIYRKSPIYAATLSRCNGNNFILPATYIGSPRYMQQHYQDAMAITRAKSR FT PDLLITMTCNPKWKELKMVLKNFPSGTTPNDIPNITVRLFYAKFNSFLNDT FT LKNKIFGEVLAYVYIIEFQKRGLPHAHIVVTLHPNNKIMTPESINKYISAE FT IPDNSNKDLQKLVIKHMLHGPHTDTSPCLKKK*" FT CDS join(1385..1615,1612..3045,2961..3803) FT /product="Helitron-7_NVi_2p" FT /translation="MVIHYIVGYNTSENHTYSTKKYKQIIPVDNRMVVPYN FT SFLLQKYKCHINVEYCASIQSIKYIFDYIHKGGDRAKFKKKIDDNENNEPD FT TIVFDEITQFIDGRYLSGMEVAWRLQEFPLCGRSHTVTRLAVHTKDQQKII FT FEEHKESEALRNAETTLTAWFKLKKTDDFAKNITYVEIPQYYRFDQKTKSW FT IKRSRNYKYSAIRRLNVVYPKDSERFFLKLILNRMKGATSFEDLRTYDNKL FT YQSYKEAALAMGLVADDSCVYNIFEEACQIMLPFQLRKFFAWFILAENIQG FT NTIWNMFKNFFTEDFKENKENNALSHIQSILETEERCCKDFSLPEPANLIV FT NDDMSSENIMQSATIFKQMVNQLNEDQHSIFTEIINSKTNIYFIDGPGGSG FT KTFLYKTLIHYFISLNKQVLSMAWTGIASILLPKGMTSHRTFKLPLDLDNV FT DTAFLKLESDKKKLRLADLIIWDEASMIPKKALEIIDRTLQDVCYNKLPFG FT GKLIVLGGDFRQILPVMKHGYRNSIVEETIKYSKLWPLFKVLKLKKIFVLM FT IENSHHYYIFQIMASFQSFKIKKNIRSNDREFSSLLLDIGEGKINPFRIPQ FT LWKTNDVCDKIYSNINQNISINSVILAPHNEDIKIINNKVLKLLYGDSWTY FT YSIDYATHKGVDQTDENIHLNYPIEMLNEIREGLPPHKLDLKINSTVMLIR FT NLSIIDGLCNGTRLKIVKLYQYNIEAEIITGENIGNKVFIPRITLNTGENS FT SFPFTLYRKQFPIILAFAITINKSQGQSFDSLGIFLRRPLFSHGQLYVALS FT RCKNPNKIFIQNELEDSSKIDNIVWSEIFKDAI*" XX SQ Sequence 5904 BP; 2305 A; 729 C; 845 G; 2024 T; 1 other; tctcatattt ctccgaattc actgatgcct ttagaaatct gtggtcagtt atatatttat 60 gatgatataa cttctgtaga aaaaagatta aaaaagaatg aaaagttaac taaagatcat 120 atagaaatat tgactactct tattcccgat aatccatacg ctaaaaattt tagatgtttg 180 aatcaattag caaatataca aaatttacca aactacagat tatattttgt tcgtaaaaat 240 gacaaacaga agcatagata taacaaacca cttacagctg aatgcggtgc aatcattgtt 300 tctaaatcag tgattccaga tgattatgac ctttgcattt atcctaaagc taaaacaaat 360 tctgataaaa atttatttaa gtaaattatc tcatcatgta gatcctatgg tttttccttt 420 aatgtttcca tctggtgatt taggatggag tatcgggtac aataaaaaac caaattctga 480 atcaaataaa atttatgata tagataatct tctccttaat ataaatagaa gtaaaaataa 540 taaaagaaat ttagaaaatc tcactctttt acaatattac tcttataggc tatcatatag 600 accaaaaata aaacatttct ctccattatt atatagtggt agacttactc aacaatactt 660 tatacatgca cttgttatgg tagaaagtaa tagaatgaac ttcttcagaa ataatcaaaa 720 acaattaagg attgaatgtt atcaaggttt attcgatcat gtttttaatt ctgcttcgaa 780 tatttcacaa aattttgaaa aaaaaaagaa agattgggta attcatatta cctgctacat 840 atataggaag tccccgatat atgcagcaac attatcaaga tgcaatggca ataactagag 900 ctaagtctag accagattta ttaattacta tgacttgtaa tcctaaatgg aaagaattaa 960 aaatggttct aaaaaatttt cctagtggta ctactccaaa tgatattcca aatattactg 1020 ttagattgtt ctatgcaaaa ttcaattcgt ttttaaatga tactttaaaa aataaaattt 1080 ttggagaagt acttgcatat gtatatataa tagaatttca aaaaagagga ttaccacatg 1140 cgcatatagt tgtcacattg catcctaata acaaaataat gacacctgaa tcaataaata 1200 aatacatttc cgctgagatt ccagacaatt caaataaaga tttacaaaaa ttagtaatca 1260 aacacatgtt acatggacca catactgata catctccttg tttaaaaaaa aaataaaaaa 1320 agtaatacta cttgtattaa aaattttccc aaagaatttt gctaccacac aacttttaaa 1380 gtaaatggtt atccattata tcgtaggtta taatacatct gaaaatcaca cgtattcaac 1440 aaaaaagtat aagcaaatta ttccagttga taaccgtatg gtcgttcctt acaatagctt 1500 tctactgcaa aagtataaat gtcatataaa tgtagaatat tgtgcatcta ttcaaagtat 1560 taaatatata tttgattaca tacataaggg cggagataga gcaaaattta aaaaatagat 1620 gataatgaga ataatgaacc agacactata gtttttgatg aaattactca atttattgat 1680 ggaaggtatt tgagcggaat ggaagtagcc tggcgactac aagaatttcc attatgtggt 1740 agaagccata ctgttactcg tttagcagtt catactaagg atcaacaaaa aataattttt 1800 gaagaacata aagagtcaga agcattgcga aacgcagaaa caactttaac ggcatggttt 1860 aaattaaaaa aaacagatga ttttgcaaaa aatattactt atgttgaaat tccgcaatat 1920 tatagatttg atcaaaaaac aaaatcttgg ataaaacgat cgcgtaatta taaatatagt 1980 gccattcgac gattaaatgt agtatatcca aaagatagtg aaagattttt tttgaagttg 2040 atactaaaca gaatgaaagg tgctacttct tttgaagatt tgcgaactta tgataataaa 2100 ctatatcaat cttataaaga agcagcactt gctatgggtt tagtagctga tgactcttgt 2160 gtttataata ttttcgaaga ggcctgtcaa ataatgttgc cctttcaatt acgtaaattt 2220 ttcgcttggt ttatattagc agaaaacatc caaggaaata caatttggaa tatgttcaaa 2280 aattttttta ctgaagattt taaagaaaat aaagaaaata atgctttatc tcatatccaa 2340 agtatattag aaacagaaga aagatgttgt aaggatttta gtttaccaga acctgctaat 2400 cttattgtta atgatgatat gtcttccgaa aacataatgc aaagtgccac tattttcaaa 2460 caaatggtaa atcaattaaa tgaagatcaa cactccattt ttacagaaat cattaacagt 2520 aaaactaaca tttactttat tgatggtcct ggaggttcag gaaaaacttt cttatacaaa 2580 actttaattc actattttat ttcattgaac aaacaagtct tatcaatggc ttggactggt 2640 attgcttcta tattattacc aaaaggaatg acaagtcata gaacttttaa attaccttta 2700 gatttggata atgtagatac tgcttttcta aaattagaat cggataagaa aaaattaagg 2760 ttggctgatt taattatttg ggatgaagcc tctatgattc cgaaaaaagc tttagaaata 2820 atagatcgaa ccttgcaaga tgtatgctat aataaattac cttttggagg aaaattaata 2880 gtattaggag gtgattttag acaaattctt ccagtaatga aacatggtta tagaaattct 2940 atcgttgagg aaacaattaa atattccaaa ttatggcctc ttttcaaagt tttaaaatta 3000 aaaaaaatat tcgttctaat gatagagaat tctcatcatt attattagat ataggagaag 3060 gtaaaattaa tcctttcaga atacctcaac tttggaaaac taatgatgtt tgtgacaaaa 3120 tttatagcaa tattaatcaa aatatttcta taaattctgt aattttagct cctcataatg 3180 aagatattaa aattataaat aataaagtat taaaattgtt atatggtgat tcatggacat 3240 attacagtat agattacgca acacataagg gtgtagatca aactgacgaa aatattcatc 3300 taaattatcc aattgaaatg ttaaatgaaa ttagagaagg tcttccacct cataaattgg 3360 acttaaaaat aaattcaaca gtaatgttaa ttcgaaattt gagtataata gatggtctat 3420 gcaatgggac tcgtttaaaa attgttaaat tatatcaata taatattgaa gcagaaataa 3480 ttactggaga gaatattgga aataaagtct ttataccaag aatcacctta aatactggag 3540 aaaattcttc atttccattt actctttata gaaaacaatt tccaattata ttagctttcg 3600 caataacaat aaataaatct caaggacaaa gttttgattc cttaggaatt tttcttagac 3660 gaccattatt ttctcatggt caattatatg tagctttatc tagatgtaaa aatcctaata 3720 aaatatttat tcaaaatgaa ttagaagact cttcaaaaat cgataatatt gtgtggagtg 3780 agattttcaa agatgcaatt taaaacaaat ttattgcatt attttttttt taaatcctat 3840 gagttattat tataaatatg ttttaaaaat atttcatcga aattaaaaaa aaaaaaacaa 3900 atttttttta tagattatta ttttttaaga cctccgttac ttttttttat attacgatac 3960 agtaattaat tatctttctt tcaaagctcg ccaacaaatt ttaataattt ggtttaaaca 4020 aaaaatacta atttttaagt tcaaatattt atttacttat tattatattt tttagcagat 4080 tattaacaat ggaatgggat ccagttcaaa attcattagt tcgtgtagag aatgttgaac 4140 acagagatta tttaattgca ttgaataatc aaatagatca acacttaatc gaacaagtaa 4200 attatgaact ttacttgaat gatttagtaa gagatcacga aaataataaa ttaaaagagt 4260 taaattctca aatagaattt tatgaaatat ttgacaatga aatggatgta gcttcaatat 4320 tttccaattt agatgaaatt cctactgata taactttaac tattcaaaag attttaaaat 4380 ttcgtaaaaa aaataattta ttaccaagta aactcaaaga taataatcat gatgtatatg 4440 ttaaatatat tttaaatata agcaacatta tacatcatat tcaattaact acaaaaaata 4500 attaatttgt aacatttttt tattattatt tggaacaaca ctaaattatt tctttcagat 4560 gcaggaaaca atttaaatta tttataaaat tatttactaa cgaacttaat ttacagattg 4620 gaagtatgta aatcaacatc agtcaataat gcaagcatag tcatccgtta tctctagtag 4680 aaaatgttaa ataaataaaa attattatta ttttagtttt tttttgtaca aaaaaaatca 4740 aatgaaatat tttacgttat attttcttaa aattgatcgt aaatattatc taaatatttt 4800 tcatcaaaat tcaatcacaa tatttttttt cataacaaat tattattttt taagacattt 4860 gaaaattttt taattataga aaattagaat cactaaaaat atttctttat aaattatttt 4920 tttgacaact ttcaatatac acaatataca aacagaaacc aatacagaac agatacacaa 4980 ataattatta gataatatca taagtttatt tcgtattatt ataggacatg atataataat 5040 acttagttga cttgtaaaaa tttaatgatt atttgtacac ttttacatga aagaagtttg 5100 aattaaattt aagcttaaaa acttctttct agtaaaagtg tttctgtagg acggaggatg 5160 gtatgaggtg gagtcttagg ggaggaggat tggtatctaa gactcccgtg gttatatggc 5220 tctaggcgat gattccacgg ttttttttag attattgtat acaaatttta attgataaaa 5280 atggatcgga attaatcgtt tcatttttaa caataaccat aattttatat aatttaatca 5340 tgtcatttaa ttatgtaaaa ttattggtga tttttataca ataaaawaca aaaaaaaatc 5400 aacgcaccta aagtacattg tattatgact ctaatacaat atgctcgaca gtacctcttg 5460 ggtggtgtgg aaaactgtca ttctaaactt taaaccgatg tatcgattag aaatttagtg 5520 aggttttgat ctttacattt ttgatgtaaa gatcggacat aacgatcgga cgctttaatc 5580 attttatttt aattctatcc aatcgtaaaa tgttataaat ttgtatctaa ttgttacatt 5640 cttctgtctt acaataatcc tcaaaatgtt ttacttctca aagtgtgaaa ttactccaaa 5700 gagaagaagt ggagggaaag tggaggggaa gtggagggaa agtgcagggg aagtggaggg 5760 aaagtggagg gaagtggagg gaaatggagg gaagtggagg ggttagtgga gggacaacgt 5820 agggatagtg gagggatagt ggaggaaagt ggagggaagt ggagggggtg gaggggattt 5880 tcaattaaaa tcacgcttac aact 5904 // ID BEL-49_CQ-LTR repbase; DNA; INV; 699 BP. XX AC AAWU01014131; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-49_CQ_; KW BEL-49_CQ-I; BEL-49_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-699 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 252-252 (2011). XX DR GenBank; AAWU01014131; Positions 47833 47135. XX SQ Sequence 699 BP; 211 A; 158 C; 152 G; 178 T; 0 other; tggtcagctc tgaactgacc atgaaggtgt tgtacagttg taaatagtcc agattagttt 60 tagcacacaa tttacacaga atgcattcga agacacaaac acacacacac acatttaggg 120 ttagttaatt gaatttactt atgaactaaa cacacaaaaa ttagctgcat gcaaaatacc 180 gaagttgctc tactacacgg tatcatcatc ctcgctgata tctctcagtt gggagaatct 240 gtatctctac cgtgtgaagc agaattggaa aaaacgtagc ctagcagaag atcaattcga 300 gaaccacgac agtacgaaat acatctcgca agcacaaacg tcgcgttctt cttgttcttt 360 acattgccaa atcccaatat acaagtcgtg cagttaccaa atcgagtttg ttcttaatcc 420 gaaatcgttc gacaatttta actcaaatcg tgaagattcg tcgtgatcgg tagtgatctg 480 ttctctagtg ccgctgaaac aagtccagcg gactgagtgt gatcgttaag gtgatcgtga 540 agcaacaacg acactttcac ggtggaaggt gcgaagtgcg agtgcctata gtgtacggta 600 gtgaggtgtt acctgtctcc gaggtcttca ggtgccacca acgaaggtgg gaagcaaaac 660 ctccaacgtc cggcgcaatt tcgagccccg gcccgaaca 699 // ID Sola1-1_AA repbase; DNA; INV; 3285 BP. XX AC . XX DT 28-JAN-2009 (Rel. 14.02, Created) DT 28-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE Sola1 DNA transposons from Aedes aegypti. XX KW Sola; DNA transposon; Transposable Element; Sola1; Sola1-1_AA. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-3285 RA Bao W., Jurka M.G., Kapitonov V.V. and Jurka J.; RT "New superfamilies of eukaryotic DNA transposons and their RT internal divisions."; RL Molecular Biology and Evolution 26(5), 983-993 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS join(652..681,695..2656) FT /product="Sola1-1_AA_1p" FT /translation="MSPIPGKVILSISLQFIFNLGNLLNDTISLHSESDPE FT WSYAAVSDEEEDESPSKKRKKNKQSWKNVQRKKAKEKGQAYTSTTGQHKRA FT RSVGDPCSCKRLKCTENFSTEARDLILAEFLTLKSSGQNQFLINNIDSQEK FT VRYTTAGPSRRQYSVKYFLPTTSGRLQVCRGMFLKTLDVSERKIRSLLGKQ FT RTSAAAICSEDGRSGNRSRLETSDDSKEIVRRHIRQFPAYTSHYTREKSKK FT KYLSSDLTIRKMFDLYKVWCVGENVVPVQENVYRKIFTEDFNLSFRKPRTD FT TCNRCDQFNVLIKNAENDDDKHDLERQRDEHQAWATKMYKKKDKDKSVAKL FT FPNVRTMSFDLQKCLPTPYLTSGESYYKRQLYTLNLTTYMTISSEQTKATC FT YLWHEGIAKRGSQEIGSCLYKELKLLPPEVDTLFCYSDCCRGQNVNFFICC FT MFSYFIEECILAGRELSIIHRLLVPGHTHMEADTIHGAIEKAKKTTSIDIQ FT IPKDWAQLIANIRRTNPIAVVEMDQGDFLAFKSLDGTYYKRPKKNVDQIPV FT RFSDIVCFKYTTSSPGAVFYKYRTSSVFEKAVIKEQTAEPIPTLQPISTEP FT LMLPVEKLRDLQDLQKFINPVNRAFYDTFLSNLQPKKKGRKPKVTKVVDHF FT EADLSDFEESSNEN*" XX SQ Sequence 3285 BP; 1075 A; 636 C; 678 G; 896 T; 0 other; ctcgcgttta gtatcataag ggcgtatctg cagttttggc gggaaatgag atttcaccac 60 atttgaattc tataggtttg aaactataac cctaccaaaa attattactt aaaacttatt 120 agaaatatca tattttcaaa aaaaatgatt ggcgtaacta cagagtcacc atgaaaaaaa 180 agatacgaca gacgtaactt ttttggtcat cttctctgaa agttacgcca actgtgattc 240 tgctcgtgtt tacaaaatgt ccgccggaaa atatttgcag ctcgctattt tttcaacagt 300 ttaaacacga tatggcgcag ttaaacgagt tcgacacatt cgaggatgta atttggctta 360 cacctgagga aatgctgatg gaattcactg cagtagtagg tgagtaatgc tgaaaaacaa 420 ttcccaacat tgtcctgtaa atttgtgttt accatcatag aacacagcgc agcggattcc 480 ctcacaaacg cctcacatga ttcagaaaca gaaaaagacg gtgatgaacc agtggactca 540 gctacggttg gcacggaacc agaagacgag caggaagctt tacgagaacc tgtcatagag 600 cccttggaaa taccagaaat tgttaatctt ccctcagtcc gcaatccgga aatgtctccc 660 attccaggta aagtaattct atgacagttg ttgatcaata tcattacagt ttatcttcaa 720 tttaggaaat ctcctgaacg atacaatttc attacattcg gaaagtgatc cggagtggtc 780 atatgcagct gtgtcagatg aggaggaaga tgaatcacca agtaagaaaa ggaagaagaa 840 taagcaaagt tggaaaaatg tccagcgaaa gaaggccaag gaaaaagggc aggcttacac 900 gagtaccact ggacaacaca agcgcgcccg ctcggttgga gatccttgtt cttgcaaaag 960 actcaagtgt acagaaaact tttcgacgga agcacgtgat ctcatcttag ccgaatttct 1020 aacgcttaaa tcatccggac aaaaccaatt tctgatcaac aacatcgaca gccaagaaaa 1080 agtgcggtat acaacagcag gaccatcgcg tcgtcagtat tcggtgaaat attttctacc 1140 aactacttct ggaaggttgc aagtatgcag aggtatgttt ctgaaaacac ttgacgtttc 1200 ggagagaaaa atacggagcc ttttgggaaa gcaacgtacc agtgccgcag caatatgctc 1260 agaggatggt agatctggta accgatcgag attggaaacc tcagatgatt caaaagaaat 1320 agttcgaagg catatccgac aatttcctgc gtacaccagt cactatacga gagagaaaag 1380 caagaaaaaa tatctttcga gtgatttaac catccgaaaa atgtttgatt tgtataaggt 1440 atggtgtgtc ggagaaaatg tggttccggt tcaagagaat gtgtatcgca agatttttac 1500 ggaggatttc aacttgagtt ttcggaaacc acgaacggac acctgtaacc ggtgcgacca 1560 gttcaatgtg ttgattaaaa atgccgaaaa cgacgacgat aaacatgatc ttgaacggca 1620 aagggatgag catcaagcat gggccaccaa gatgtacaaa aagaaggata aggacaaatc 1680 tgtagcaaaa ctgtttccca acgtaagaac gatgtcgttc gacttgcaga aatgtctacc 1740 tacgccgtat ttgactagtg gcgagtcgta ttataagcgt cagttatata cgcttaatct 1800 aacgacgtac atgacgatct ccagtgagca aactaaggct acttgctatc tgtggcatga 1860 gggaatagcc aaacgaggat cacaggagat tgggtcatgt ttatacaaag agcttaaact 1920 cttaccacct gaagtggaca cattattctg ctacagcgac tgttgcagag gacaaaatgt 1980 gaatttcttc atctgctgta tgttttcgta cttcattgaa gaatgcattc ttgctggcag 2040 agaattgtct atcatccatc gtttgctagt accaggtcat acacacatgg aggcagatac 2100 catccacggc gctattgaga aggcaaaaaa gactaccagc attgatatac agattcctaa 2160 ggactgggct cagctgattg ccaatataag gaggactaat cctattgccg ttgtcgaaat 2220 ggatcaggga gattttttgg cgttcaaaag tttggatgga acctattaca agcgtcccaa 2280 gaaaaacgtt gatcagattc cggtgcggtt tagcgacatt gtatgcttca agtacaccac 2340 tagtagtcct ggagctgtat tttacaaata caggacgagc agcgtcttcg aaaaggcagt 2400 tatcaaggaa cagacagccg agccaattcc taccctgcaa ccaatttcaa cagagccgtt 2460 gatgctaccc gtggaaaaat tacgcgacct tcaagacttg caaaaattca tcaaccccgt 2520 caacagagcg ttctatgaca cattcttgag taacttgcaa ccgaagaaaa agggaagaaa 2580 accaaaagta acaaaagttg tcgaccattt cgaagctgat ctatctgatt ttgaagaatc 2640 ctcgaatgaa aattgaggcg attttgatgt tcagttttgt gcgttgagtt tttcatgaaa 2700 taaatcaaac gtaattaata ttgacctatt tttttattac tttcttgtaa aaaacatact 2760 cataaagatg cataatccaa tcgagggggc tcatattata gtctgggcaa tcaaattgat 2820 gatatttccc tgtatcttcg ctgcgatgta tgaaaaccaa tatggcatta tccaagaaag 2880 tttttatttt aattgtaaaa ctgtacaagg agcacttagt tgagaaactg gctcgattcc 2940 agtaggtatg aaatttcaac aaaaagttat tagagactct catagtcgtt tagatgttat 3000 aaaaatggtt aattcatcaa acactaatga ttttgcgttt tgacgtatct ttttccaatg 3060 cgacataccg gcgtaactac aatttatttc attatcaagc atattttaca tgtacttcat 3120 atgccaccgg atactccaat ctcataatag tatttgtata aatttacaga caaatctact 3180 ttaaaacaaa aaaatacgtg atttgtttgt tcgttgataa tttcaatgtc gattttctcg 3240 gtttctttga atctgcagtt acgcccttat gatactgaac gcgag 3285 // ID hAT-N2_AP repbase; DNA; INV; 735 BP. XX AC . XX DT 25-AUG-2009 (Rel. 14.09, Created) DT 25-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW hAT-N2_AP. XX NM hAT-N2_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-735 RA Jurka J.; RT "hAT-type DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2101-2101 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX SQ Sequence 735 BP; 249 A; 84 C; 89 G; 312 T; 1 other; tagagcgcgg atttctatgc aattgcatat ttttttcctt ttaatatttt tggatttttg 60 tcagttatct ccacgaatca tcataatttg aagctaatgg tgaaattttt ttttgcatat 120 ttctgcatat ttaaaggtta ttgcatattt ttgcatattt tggcaaaatt caaaatttat 180 tgcatattga agcgaaaatt taaaaaaatg tataaaataa tattttgtca agtagaaaaa 240 ttaaaactaa atttcatagt cactaaataa taatttatat atatatttat ttataacatt 300 ataagataaa taaataatcg attacgacgt aaacttaatc gctcttgctg tgtcggtcgg 360 ttatcgacat accactgggc actcggttac atcgtttgtc gttgtctgta aatcgtgaat 420 atgccgaaac ttgaaatgca tatgattatt aattttaata ataaataaaa ttttaaaaat 480 aaattaagtt gttcagaaaa tttcctaaat tttttttttt aagcgttaaa gtnattaatt 540 attataagtt ttattgtaca ttttaaataa ataaaaaatt cttgcatatt ttttacatat 600 ttttatgatt ttgactgcat atttttacat attttcatga ttttgactgc atattttttg 660 catattttca tgatttttac tgcatattat atggcatatt tcgatacttt taagtgcata 720 aaaatccgcg ctcta 735 // ID TTAA9_AP repbase; DNA; INV; 479 BP. XX AC . XX DT 25-AUG-2009 (Rel. 14.09, Created) DT 25-AUG-2009 (Rel. 15.12, Last updated, Version 0) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; nonautonomous; TTAA9_AP. XX NM TTAA9_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-479 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2074-2074 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC TSD is TTAA tetranucleotide. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea CC Aphid Genome Project. XX SQ Sequence 479 BP; 142 A; 93 C; 103 G; 140 T; 1 other; cacgttgact gccacgagcc cgccggcgac ccaacacgga tgtgtaattg tggagctatg 60 gtataaataa cagagcatta ctgtaatgca ctcatgtgta tttatagata ggagagtcca 120 atcgnaattt tgaatatagc gttataatcg gaaagacctg aataatacaa atttaaaata 180 aaaatatttg ggtcgccggc ggtcaatatt actcgaaata catctataat taaagatata 240 atagtttttt ttttttttta attattcgtt tttttcccct ggacgccatg gtgtcattat 300 aaaattttta tctccacgcc atcaaaaaat aaagtgtatt gaatacaatg agaggtttga 360 attgccggcg acccaagcgg cccctgaaga ataattcggt ttgggtcgcc ggcgacccaa 420 atggctccac attaacgcat cagtgttggg tcgccggcgg gctcgtggca gtcaacgtg 479 // ID IS4EU-1_BF repbase; DNA; INV; 5615 BP. XX AC . XX DT 29-APR-2007 (Rel. 12.04, Created) DT 01-MAY-2007 (Rel. 12.04, Last updated, Version 1) XX DE A family of autonomous IS4EU transposons - a consensus. XX KW ISL2EU; DNA transposon; Transposable Element; IS4EU; KW Interspersed repeat; DNA-TA-7_BF; IS4EU-1_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-5615 RA Kapitonov V.V. and Jurka J.; RT "IS4EU, a novel superfamily of eukaryotic DNA transposons."; RL Repbase Reports 7(4), 144-144 (2007). XX DR [1] (Consensus) XX CC DNA transposons from the IS4EU superfamily are characterized by CC the TA target site duplications. These transposons are wide CC spread in metazoans, including fish, frogs, lancelet, sea CC urchins, sea squirts, insects and cnidarians. Autonomous IS4EU CC transposons encode two proteins: the IS4EU-TR transposase, which CC is similar to the IS4-like bacterial transposases, and the CC ISEU-EX DNA exonuclease (lambda-like exonuclease). Based on the CC conservation of both proteins in highly divergent transposons, is CC is clear that they are necessary for transpositions. CC IS4EU-1_BF is a consensus sequence of a young family of CC autonomous IS4EU transposons that were active in the lancelet CC genome in a last few million years. The IS4EU-1_BF transposon is CC characterized by 11-bp imperfect terminal inverted repeats, TA CC target site duplications, ATAT target sites; and it encodes two CC proteins: (i) the 426-aa transposase, IS4EU-1_BF1p, composed of CC the THAP DNA-binding N-terminal domain and catalytic "DDE" CC domain, which is conserved in all IS4EU transposases, and (ii) CC the 512-aa IS4EU-1_BF2p exonuclease, including the PHD zinc CC finger. Questions and comments send to Vladimir Kapitonov. XX FH Key Location/Qualifiers FT CDS join(107..317,913..1979) FT /product="IS4EU-1_BF1p" FT /note="IS4EU-TR transposase." FT /translation="MPTSCCALNCTNRKDKGSRKKFFRIPAEPGRRAAWLR FT ALKRSDFEQGKKNPTAAWTPRGHERVCSDHFISDEEMIADPPPESVETLRD FT PLPESVETLREQLTKVQEEKCKLEQFCESLQSEADNLREERDHLQEQIKKS FT STSASNLTDAKCKLFTGIPTVALFMFVFRLVSDFIAPLKSMCQQDQILMTL FT MKLRLGLKNADLALRFSVSASTVSKVINRCIPILAVRLKFLIHWPSKETVK FT RNLPKKFRKKKYSNCRVIIDCSEIFTERPYNLKTRAKTWSNYKHHHTFKFL FT VGITPYGAVSFLSSSWGGRISDKDITLKSRFYNKVEYGDMIMADRGFTISE FT ELALKGATLVMPPFTTGRTQLPGRIVQDARQISTLRIHVERAIERIKNFQI FT LSQICPLTFAPLLSDSVVICAALTNLLPKLIK" FT CDS join(5231..5060,4898..4739,4520..4427,4129..3710, FT 3185..3050,2783..2230) FT /product="IS4EU-1_BF2p" FT /note="IS4EU-EX lambda-like exonuclease." FT /translation="MSSSNSYISSLEPNDRTRYFEKLMVSVEDAGDSSNPE FT VTGSAVTGDGVRLPDPYSLTGWKDDLSLWPDTDYGCIYTYLIEAPGPFNGE FT AMKAYKSLEAYNLFISGHVRECRTESNRDSSQPMGVFDQKRGLCHGSSLHL FT YGRVLLVDVFRFLGNLELSTVPESEYTNTYHHVDVDNAKRNVYIVEIFKYH FT RDFHNEKIRQIFHCTVKLANLSVPFTINSKIAPDYSTYNVYLFFVIFQCRL FT GEVCSHVAALLFAVEASRSLQEKRTSCTSRKAVWNKYYKDKVPAIQWGMKQ FT EEKARQQYLEATLCICKDVKLEEVGLILYKDFPFIGASPDGMRQCSCHGRT FT VIEIKCPYKYRDIYPAADLCLSDANYCLDKDCKLKTGHRYYTQVQTHMLVT FT GVSTCDFVVWTNAGMVVVNVPRDQSLLNVMLSTCVQFVKSSLIPEILTHKL FT QCGDSGVDVTFQEDDDPNKRYCSCQKPAYGRMVKCDNNECESEWFHYKCVG FT IKRKPRGHWFCPSCQC" XX SQ Sequence 5615 BP; 1812 A; 1182 C; 1030 G; 1584 T; 7 other; cgagcgatcg cgctgacgtc acggttacgt cacgtgacgt catggtaggc aaaaagctgc 60 tggttgaacc ctcctcgccc accagttgcg cgcgggaaac gcaaacatgc cgaccagttg 120 ttgtgcgctg aattgtacta accggaagga taaaggcagt agaaagaagt tttttagaat 180 ccctgcggaa ccaggacgaa gggctgcatg gcttagagcc ttaaaaaggt cagattttga 240 gcagggaaag aaaaacccga ccgcggcgtg gacaccacga ggccacgaga gagtgtgcag 300 cgaccatttc atctcaggtt tgtgatctat atgttttata ttttgttgtt gtcccccttg 360 cataaataca tttttcgtga tgtaatagtg tacttttcgt tacacatctg cggaaagctt 420 aagtaagttt tgtcgctgat tcatgccata tyaargacag tcaatgttaa tgytatatgt 480 atcaaagttt tacaacgtat ayrcgtgcaa aagggtgggg gtgtctgacc tacactaaat 540 ggtaagggca ttacagaggg aataagaaaa gtgtctaatg gatasaatca gtaatcctaa 600 acatataaat tcttgatcaa cagtttaatc watcattctt gctcaacaag taaatctatc 660 acttgtcact tattcaacat gaatcacata ccatacacac aagactactt gctacaaagt 720 ttatccgctt ggttttcgga gtggggatat agatgtgaat caagtagaga agcatgcaac 780 ctgcatgagt aaaatattga aacaacaata catacatatg aatcagagtt tagaacattc 840 agttcagttc agatatatga cactcataat cacatagcac tttcattgaa cttcggattt 900 gctttatttc agatgaagaa atgatcgctg accctcctcc tgaaagtgta gagactctga 960 gagaccccct tcctgaaagt gtagagactc tgagagaaca gctcacaaaa gtacaagagg 1020 agaaatgtaa gttggaacag ttttgcgagt cccttcaaag tgaagctgac aaccttaggg 1080 aagaacgtga tcaccttcaa gaacaaatca agaaatcatc aacatcagcc tccaatctca 1140 cagatgcaaa atgcaagctc ttcactggaa tacctactgt tgcactgttt atgtttgtgt 1200 ttagacttgt tagtgatttc attgctcctc taaagtccat gtgtcagcag gatcaaatac 1260 tcatgactct catgaagttg agactaggcc tcaaaaatgc tgatctagct ttgagattta 1320 gtgtttctgc ctccacagtt tcaaaagtca taaaccgttg tattccaatc ttagctgtcc 1380 gactaaagtt cctgattcat tggccgtcca aggaaaccgt aaagcgcaat cttcccaaaa 1440 aatttcggaa aaaaaaatac tcaaactgta gagtcatcat tgattgctca gagattttca 1500 ctgagaggcc atacaattta aagacccgag caaagacctg gagtaactac aaacaccatc 1560 atacgttcaa atttctggta ggtattacac cttatggagc agtgtcattt ctgtccagtt 1620 cctggggtgg aagaatttct gataaagaca ttacgctaaa aagtcgtttc tataacaaag 1680 tggaatatgg tgatatgatt atggcagatc ggggattcac catttctgag gaacttgccc 1740 tgaaaggagc aacactagtt atgccaccat ttaccacagg aaggacacaa ctcccggggc 1800 gtatcgtcca agatgccaga cagatatcaa ctctaagaat tcatgtagaa cgtgcaattg 1860 aaagaatcaa gaacttccaa attctcagcc aaatctgtcc attaacattt gcccctctac 1920 tcagtgacag tgttgttatc tgtgcggctc ttacaaactt acttccaaag ctaatcaagt 1980 aatttttaat ggcaatgttg tctgcacgag ctggcctttt ctgtatatca aaatactgag 2040 attgaaggta aaatgttaaa gggacacaaa acaaattttt gggaagccca ctacaatata 2100 ataagtatac cacaaagtac aattcttact tgttttacat tagtttttta aaacatgaca 2160 taaaactgta aaatgccttc tattgttata ctagtatttt tagtatttca aactgtccag 2220 taggatttag cattgacaac tgggacagaa ccaatggccc cttggttttc tcttgatgcc 2280 aacacatttg taatggaacc attctgactc acattcattg ttatcacact ttaccatcct 2340 accatatgct ggtttctgac aagaacagta acgtttgttt gggtcatcat cttcctggaa 2400 agtcacatct actccagagt ctccacactg taacttgtgg gtcagtatct cagggataag 2460 actggacttt acaaactgga cacatgtaga caacattaca tttaacagtg actggtccct 2520 tggaacattt actacaacca tgcctgcatt agtccacact acaaaatcac atgttgagac 2580 accggtaacc agcatatgtg tctggacttg ggtgtaatat ctgtgccctg tctttagctt 2640 acaatcctta tctaaacagt agttggcatc tgatagacac agatctgcag caggatagat 2700 gtccctgtac ttgtatgggc acttgatctc aatcacagtt cttccatggc atgaacactg 2760 tctcatccca tcaggagatg cacctatata tacaacagta atgttcaaat tagtggcaaa 2820 gtttctgata tttcaaacat gaacataatg tacttacttt tgatacttgc caaatgctgc 2880 aaatggcaca tattcacttg gtacgaatac agtgcaattg ttaaaacaag tacatgtatt 2940 tcattgttac atcaacaaaa tcataagtca attagacatt gtgtagttgg tactcaatta 3000 catgtactat acttcataag tttgaagtaa gagccaaaat tagctatacc tatgaaggga 3060 aaatccttgt agagaatgag ccccacttcc tccaatttca catccttgca gatgcacaag 3120 gtagcttcca ggtactgttg ccttgccttc tcctcctgtt tcatccccca ttgtatagct 3180 ggaacctgaa agtgaaattc cctatttaga atttctgaca tagaaagcac aaatgaatta 3240 catctctaat atgacactat ctcgcattga ctgaacattt agactgtcac actttgcaga 3300 aatatttaga ctcctggtat tgtttaaatc caaataaaga tcagagctac acttgctctg 3360 ggattattat ggtacatgtt ccaatataca gggtaatgtt gtctctttgt gtgccttaat 3420 tacctacgaa aaatggcttc ctgcaagtgt acactgtact agttttcagt taactcactt 3480 ggctcaaatc cttgcagtca tattgcatga ctaggaacat ctactatatt gctgtaaatc 3540 tatagtgtag tgctatatat tgtgaattgg atacatgtaa atgccacatg attttgtaga 3600 agcaaagaaa tccattcatt aatgtaagtt ataacagtct acttaggcaa aaaaattgta 3660 gacattgaaa aaaattctac tcaattataa tgcccaaata tgaacagacc ttatctttgt 3720 aatatttgtt ccacacggcc tttctgctgg tgcatgacgt ccgtttttcc tgaagagacc 3780 tggaggcctc cactgcgaac agcagtgcag ccacatggct gcagacctct cccaatctac 3840 attgaaatat aacaaaaaat aagtatacat tataggtact atagtcagga gcaattttag 3900 agttaatggt aaaaggaacg gataagttgg ccagcttcac agtacaatga aagatttgcc 3960 ttatcttctc attatggaag tctctgtgat atttgaatat ttctactata tacacatttc 4020 ttttagcatt atctacatct acatgatgat atgtgttagt atactcactc tcaggaactg 4080 ttgataactc caaattacct agaaaacgaa acacatcaac aagtaaaacc tgcaacaaga 4140 acatcaacaa gaagaaatgc atttgcaaca ggtacaaata caactggaaa ccatcaagca 4200 caagaacatg aaccaaggaa aaaaattaag acagttatct tgccacatac aaactgcatg 4260 gtcgctcctc cctggaaata tacggaggaa catttacagg tgagtactag tttgactttc 4320 attatttcta aaactgtttc ttgccatgga cctcatctga cgagaacact gttagataaa 4380 aatgtgactt tttggatatg atggtcaatg cagttatatt acctaccctg ccatacaagt 4440 gcagtgagct gccatgacat aaccctcttt tttggtcaaa cacacccatg ggttgtgagg 4500 agtctctgtt actctctgtc ctgaaacaca aatgtaacag aaaaaaaaga taatacagta 4560 aatataaaaa cgtatctgac agttaaatac taatcacgca tctcattctt catcttcaag 4620 agttctcact acaggtacat gtaagtagaa atattgttcc aatgcactcc aacacccacc 4680 tggcacaacc ttggccttaa ggaagcagac cttcacattt tttccaatag ggtggtacct 4740 gcactccctc acatggccac ttataaagag attgtacgcc tccagggatt tataggcttt 4800 catggcttct ccgttaaacg gacctggagc ctcgattagg taggtgtaga tgcacccata 4860 gtccgtatct ggccagaggc tcaaatcatc cttccaacct gtataacata caaatgataa 4920 taaacataac gttacgtagc cagaatttct agagattagc aacaaacaca tttacacgta 4980 ctcatgtttt tgtaaccttt atctgacaca aaaatctacc ttatcaaaat ctagaaacac 5040 atgtacacat gatcctcacc tgtcagactg tatgggtcag gtaggcgtac accgtcacca 5100 gtcaccgccg aaccagtcac ttccgggttt gaactatcgc cggcgtcttc gacacttacc 5160 attaatttct caaaatacct ggttcggtca ttaggctcca aagagctaat gtagctgttt 5220 gaagaactca tattgattaa acgtattaat taaaccacat taggaacaga aatataacag 5280 cccggaacgg atgacagaaa gcccgtacgg cgggaaaatc aaaacaaccg gaaacacatg 5340 acggaagtca agagggcaag ggtcattttc tcccaaaata ttacacaaga aaaaaaaaaa 5400 aaaagactaa gaagtagtga aaaaaataca aaatctgaaa aaatcccatg gatgcgataa 5460 aaacgtggct gaaaatgcgc aaattcgcgt aaacagtacg atctgcttcc gcgccggtat 5520 gctaattagt aatctgatct cctaacgctt tttgcctacc agtcagcgct cgtgcgccga 5580 ataccccgga tgtaaacaat aacagcgatt gctcg 5615 // ID Copia-99_AA-LTR repbase; DNA; INV; 241 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-99_AA_; KW Ty1_copia_Ele2; Copia-99_AA-I; Copia-99_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-241 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 241 BP; 58 A; 62 C; 41 G; 79 T; 1 other; tgttgaacgt aagtatcttt ttaccacagg ggcgctgacc cactgaactg gaaataaagt 60 gtaaacccac cttttkttat ttccgttgtt acctctcgtt ctcaatcgat cattaaagtg 120 aacacgtttg taaacgcaag tcacgtcgtc gcgagttaat ttttctccga atcacaattt 180 ccccacgcag taatcgattc cctgtgttat tctctgctct ggtggtcaat tccttcccac 240 a 241 // ID BEL-50_AA-LTR repbase; DNA; INV; 288 BP. XX AC supercont1.300; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-50_AA_; KW BEL-50_AA-I; BEL-50_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-288 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.300; Positions 1071807 1072094. XX SQ Sequence 288 BP; 72 A; 65 C; 63 G; 88 T; 0 other; tgtttgggga tttttgctga aatttgtacc actgtgcgac caacgcatct tctaacgtta 60 ttgctgtgtc gggcaacact gtttgcacca tctacgaaga gaagaaaaat aattgaaacc 120 acttgtattc gaacagaaga gtaataaaca cgtgaaagtt tgtccgtatt ttcaatgcgt 180 gttccgccgg tagaaagatt ccgaccgtcc gaatcgtgtt ctcgggttac cgattgttgt 240 ccaagttccg ccgcgacctc tccgctattc gctttcgtta ttgataca 288 // ID RTE-3_CQ repbase; DNA; INV; 4043 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE An RTE non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW RTE; Non-LTR Retrotransposon; Transposable Element; RTE-3_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4043 RA Kojima K.K. and Jurka J.; RT "RTE non-LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 613-613 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 10 sequences with ~98% CC identity. ~80% identical to RTE-2_CPi. XX FH Key Location/Qualifiers FT CDS 933..4016 FT /product="RTE-3_CQ_1p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="MVLRRDRGLVQATRTRRKTPVQEAHDASRTNRHGTGH FT HMRSHDWKLGTWNCRSLKFDGSIRILSDILRVRKFSIVALQEVGWIGAEEV FT QAYPRIGCTIYQSRGENKRLGTAFIVLGEMRDRVIGWTPLTDRMCVLRIKG FT RFFNISIINVHSPHSGSEDDDKDAFYEQLNWTYNSCPKHDVKIVIGDFNAQ FT VGQEEEFRPVIGKFSAHVRTNENGLRLIDFATSKNMAVRSTCFQHNLRDKY FT TWRSPQGTESQIDHVVIDGRHFSDIIDVRTYRGANVDSDHYLVMVKMRQRL FT SLAKSVRYRRPPRLDLERLKLPEVASRYAHSLEAALPGEGELLEAPLEDCW FT RSVKAAITNAAESTIGFVERGRRNDWFDEECRAILEEKNAARRAMLQYNLR FT DYEEAYGQKRRQQHQLFRAKVRHQEELEFEDMEQLHRSNETRKFYKKLNGS FT RNGFTPRVEMCRDKNGAILTNEREVIDRWKQHFDEHLNGAEAEAGVQGGRR FT EDFIGTAGEGEEPVPTMREVKDAIKKLKNNKAAGKDGIGAELIKMGPEKLA FT SCLHRLIVRVWESEQLPEEWKEGVICPIYKKGDKLDCENYRAITILNAAYK FT VFSQILFSRLSPIAEGFVGSYQAGFVMGRSTTDQIFTVRQILQKCREYQVP FT THHLFIDFKAAYDSVDREELWKIMDENGFPGKLIRLIKMTMDGARCCVKIS FT GAESDSFTSLGGLRQGDGISCLCFNVVLEGVMRRAGFNMRGTIFSKSNQFI FT CFADDMDIVGRTFKAVADAYTDLKREAEKVGLRVNVAKTKYLLAGGTESLR FT ARIGPSVTIDGDEFEVVEEFVYLGSLVTSDNSCSREIRRRIIAGSRAYFGL FT HKSLRSRKFSLHTKCSIYKSLIRPVVLYGHETWTMLEEDLRALSVFERRVL FT RTIFGGVYENDGWRRRMNHELAQLYNEPSIRKVAKAGRLQWAGHVARMPER FT ADHLSQRNQKINPAKLVFVSEPVGTRRRGVQRARWVDQVESDLESVGAPRN FT WRNAAMDRACWRRIVQQAKLMV" XX SQ Sequence 4043 BP; 1039 A; 993 C; 1216 G; 795 T; 0 other; tgtgtgtgcg ccgtgcacat cggtattgtt tttcgtaccg ttcccgcgaa agtgcaaaat 60 ttacacgaaa atacgcgtta gtggggtgtg aaattcattg catccgtgtt ctgaacccgt 120 ctcctggccc gggaaagtga tagtccgtgt tttacgaccg cgagtgagac aaatcaatca 180 ccagatttga ccgaatatcg cgttccgcca cgggcaattg aaaaaaagta caaagaactg 240 ctaaggcatc gtcgtcgtcg tcgcgatcga tggtcgttgt attgtgtcgc gcggcgtgtg 300 gaagtgtacc gcgaagacca tactgacgcg tgtcgtcgtc gtcctcgtca ctgtgtctgt 360 tgctacgaga gtcgtgtata gaagagaagc cgttggtgca cggtggggtg aattgctagg 420 aaagctacag aaaaaaaaag tgctgcaaag ttttttgggt gaaattaaaa agaaggaaaa 480 atcaagaagc gcaaccctgc tgcatacaca gccccccccc cctcccctct agttccgttt 540 ctgggcgtct gcaccccatg ttaggggcgg cccaaacgga ctaggtggtc gattccttct 600 atccgtgagc ggctgttccc cacgttaggg gcggctcgaa agaagtggcc gacacccccc 660 cccccccccc cccacccttc tttgagcgtc tgttccccag gttaggggcg gctcaaagaa 720 accggtgtcc tgctctatcg tcgaggtaag cgtctgttct ccaggttagg ggcggcttac 780 agcagataga gttcggaccc cccaccccct ccccccatcg agcgtctgtt ccccaggtta 840 ggggcggctc gaaacagcgt ctgtacccca ggttaggggc ggctgagtaa aagtccttgt 900 gtcggcgtgg gactgtaaac agtaccggca cgatggtcct ccggcgagac agggggttgg 960 tgcaggccac acgaacccgc cgtaaaacac cagtgcagga agcacacgat gcgagccgga 1020 ccaatcggca cggaactgga catcatatga ggtcccacga ttggaagctc gggacgtgga 1080 attgtaggtc tctcaaattt gacgggagta tccgcatact ttccgacata ttgagggtcc 1140 gcaagttcag catcgtagcg ctgcaggagg ttggctggat aggcgcggaa gaggtacaag 1200 cgtacccaag gattggctgt acaatctacc agagccgcgg cgaaaacaag aggctgggga 1260 cagcctttat agtgctgggc gaaatgcgcg atcgcgtgat tgggtggacc ccgctcaccg 1320 accgaatgtg cgtgctgagg attaaaggcc gtttcttcaa cattagcatc ataaacgtgc 1380 acagcccgca ctcaggaagc gaagatgacg acaaggacgc attttacgag cagctgaact 1440 ggacgtacaa cagctgccca aaacatgacg tcaaaatcgt catcggagat tttaacgctc 1500 aggttggcca ggaggaggaa ttcagaccgg tgataggaaa gttcagcgcc cacgtacgca 1560 cgaacgaaaa cggcctgcga ctgatcgact tcgccacctc caaaaacatg gccgtacgaa 1620 gtacctgctt ccagcacaac ctccgagaca agtacacctg gagatcaccg caaggaacgg 1680 aatcacaaat cgaccacgtc gtaatcgacg gtagacactt ttccgacatc atcgacgtca 1740 ggacctatcg cggcgccaac gtcgactcgg accactatct ggtgatggtg aaaatgcgcc 1800 aacgactttc cctggcgaaa agcgttcggt accgccgccc tccgcggttg gatctggagc 1860 ggcttaagtt accggaagtc gcatcccggt acgcgcattc gctggaggct gcgttgccag 1920 gggagggtga gctgttggaa gctcccctcg aggactgctg gaggagcgtc aaggcagcca 1980 tcaccaacgc agcggaaagc accatcggat ttgtggaacg aggacgacgg aacgattggt 2040 tcgacgagga gtgtcgagcg attttggagg agaagaatgc agcacggagg gcaatgctgc 2100 agtacaatct ccgtgattac gaggaggcgt atggacagaa gcgaaggcag cagcaccagc 2160 tcttccgagc aaaagtgcgc caccaggaag agttggagtt tgaggacatg gagcagctgc 2220 atcgctcaaa cgaaacgcgc aagttctaca agaagctcaa cggatcccga aacggcttca 2280 cgccgcgagt cgaaatgtgc cgggataaaa atggagctat cttgacgaac gagcgtgagg 2340 tgattgacag gtggaagcag cacttcgatg aacacctgaa tggcgcagaa gcagaggcag 2400 gggtccaagg cggcaggaga gaggacttca tcggtacagc gggagaagga gaggagccag 2460 ttcccacgat gagggaagtt aaggatgcca tcaagaagct gaagaacaac aaagcagcgg 2520 gtaaggatgg tatcggtgct gaactcatca agatgggccc ggagaagctg gcgtcctgtc 2580 tgcaccgact gatagtcagg gtctgggagt cagaacagct accggaggag tggaaagagg 2640 gagtaatatg cccgatctac aagaaggggg acaagttaga ttgtgagaac taccgtgcca 2700 tcacaatcct caacgcggcc tacaaagtgt tctcccagat cctcttcagc cgcctatcgc 2760 caatagcgga aggttttgtt ggaagttatc aagccggatt cgtcatgggg agatcaacaa 2820 ccgaccaaat cttcactgtg cgacaaatcc tccaaaagtg tcgcgagtac caagtcccca 2880 cgcaccacct tttcatcgac ttcaaagccg cgtacgactc agtcgatcgc gaagagctat 2940 ggaaaattat ggacgagaac ggttttcccg ggaagctgat cagactgatc aagatgacga 3000 tggatggggc taggtgttgt gtgaagatat cgggtgcgga atcggactcg tttacttcac 3060 ttggggggct tcggcaaggc gatgggatct cttgtctttg tttcaatgtc gtgctagaag 3120 gtgttatgag acgagcgggc ttcaatatgc ggggcacgat cttcagcaag tccaaccagt 3180 tcatctgctt cgccgacgac atggacattg ttggcagaac gttcaaggcg gttgcggatg 3240 cgtacaccga cttgaagcgg gaagcagaga aggttgggct aagggtgaat gtggcgaaga 3300 caaagtacct gctggcagga ggaaccgagt cccttagggc tcgcattgga ccaagcgtta 3360 caatcgacgg ggacgaattc gaggtagtgg aggagtttgt atacctcgga tcgttggtaa 3420 cgtcggacaa cagctgcagc agggaaattc ggaggcgcat catcgctgga agtcgtgcct 3480 atttcggtct tcacaagagc ctaaggtccc ggaaattctc cctacatacg aagtgttcca 3540 tctacaagtc gctgataaga ccggtcgtcc tctacgggca cgagacgtgg acaatgctcg 3600 aagaggactt acgagcgcta agcgtcttcg aacgtcgagt gctaaggacc atctttggcg 3660 gcgtatatga gaacgacgga tggcggcgga gaatgaacca cgaacttgca caactctaca 3720 acgaaccaag catccggaaa gtcgcgaagg ctggacggtt gcagtgggcg ggtcatgttg 3780 caaggatgcc ggaacgagcc gaccacttga gccaacggaa ccagaagatc aatcctgcga 3840 agttggtgtt tgtgtcggag ccggtaggaa caagacgtag gggggtgcaa cgtgcgaggt 3900 gggtggacca agtggagagc gatttagaaa gtgtgggtgc gccgcgaaat tggagaaatg 3960 cagccatgga ccgagcttgt tggcggagaa tcgtgcagca ggccaagcta atggtgtagc 4020 gccaataaaa gtaaagtaag taa 4043 // ID Gypsy9-LTR_AP repbase; DNA; INV; 385 BP. XX AC Contig13293; XX DT 21-APR-2008 (Rel. 13.04, Created) DT 21-APR-2008 (Rel. 15.12, Last updated, Version 0) XX DE LTR retrotransposon from pea aphid: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy9AP; KW Gypsy9-I_AP; Gypsy9-LTR_AP. XX NM Gypsy9-LTR_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-385 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from pea aphid."; RL Repbase Reports 8(4), 454-454 (2008). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 385 BP; 91 A; 101 C; 63 G; 130 T; 0 other; tgtggcgacc cacaacaatg attattttgt aaccgcggta tcagcgcgtc agccgccgcc 60 agctgcgcgc ccctccctta attttgtata ctgcgcgtta cctaccgcct ccatagcgtg 120 accgactagc gctcactttt ttcggacgta cgtcgtcaac cgcttatacg ctccgtcgac 180 cgtcgttcgt aatcccattt taattgttct cgtaaaccac gtttttatgt tcctcgcgtc 240 caacgcctaa aatgtactat aatatttatt tctattgact gaattatttt tatttaatta 300 aactgaaacg ttatatttta acacattatt ttttatttat ttaaattggt aacccgagta 360 gtcggctaaa cgccgcatct tacca 385 // ID IR2_LM repbase; DNA; INV; 538 BP. XX AC . XX DT 16-MAY-2005 (Rel. 10.05, Created) DT 03-JUN-2005 (Rel. 10.05, Last updated, Version 1) XX DE Leishmania major interspersed repeat. XX KW Interspersed repetitive sequence; IR2_LM. XX OS Leishmania major OC Eukaryota; Euglenozoa; Kinetoplastida; Trypanosomatidae; OC Leishmania; Leishmania major species complex. XX RN [1] RA Ivens A.C., Lewis S.M., Bagherzadeh A., Zhang L., Chan H.M. RA and and Smith D.F.; RT "A physical map of the Leishmania major Friedlin genome."; RL Genome Res 8(2), 135-145 (1998). XX RN [2] RP 1-538 RA Gentles A., Kohany O. and Jurka J.; RT "L.major interspersed repeat."; RL Direct Submission to Repbase Update (16-MAY-2005). XX DR [2] (Consensus) XX SQ Sequence 538 BP; 88 A; 154 C; 198 G; 90 T; 8 other; gggggacatt gcagcgcgtg gtatctcagg gcccagtgcg ttcacactat tccacggggg 60 agcmgagaag ccccccctat ccccttccct gccagctgca gagccrcttc tgctggtgac 120 agggccatcc acccacgacg taggggaggt cagagcgatg catcgctgtt gatgtcggcg 180 gtcaaggccc tgggatggcg ttgcrtcgga gcsacctgcg acagcgaaca cgtctgtgcc 240 atccgcctga tgggcgaagt gtccgcgtga ctcgagcgca tctcgcacgg tcctcgctgc 300 ctactggtgt ggggagcgtg ggccaccccg agggtcgcac gaggtggcga ccggcacaat 360 gggcatggct gtgaggcgac ctgcgaggag gctgggtgga ggagcttgag gcaggggccg 420 wgctcagatg actgrgtcgg cgcattgctg tagcgcgcgt ctgcggctkc ttcgcaccac 480 gcgggtgggg cctgcgacwg gccggagggg gactgtcatg gcagaagaga ctgagaaa 538 // ID BEL3-I_AP repbase; DNA; INV; 6922 BP. XX AC Contig25132; XX DT 21-APR-2008 (Rel. 13.04, Created) DT 21-APR-2008 (Rel. 15.12, Last updated, Version 0) XX DE LTR retrotransposon from pea aphid: internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL3AP; BEL3-I_AP; KW BEL3-LTR_AP. XX NM BEL3-I_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-6922 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from pea aphid."; RL Repbase Reports 8(4), 433-433 (2008). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC Positions [5374-5967] - Integrase core CC LTRs are 99% similar to each other. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX FH Key Location/Qualifiers FT CDS 2353..6183 FT /product="BEL3-I_AP_1p" FT /translation="MSYKCKSSCSVCKRRHHGLLHKDPSTSDPSSKVPKVA FT MFAKHQNQLPSVVLATALLHVQDVAGSPQTVRALIDGGSQISAISANCCQR FT LGLRAAKWTLPVTGLSELPVPSVLGIVDLHIQPRDSSQSSMPVRAWVLSSI FT TSDMPTSKLPSTVREKCGDLSLADPLFDIPAPVEILLGADIYPMVWSHETV FT SLGHGYPTAFNSIFGWAIVGPLQHIKAPSPRALPVQISSSVESLMEKFWNV FT EEPEAAPPVFTQEGQCEEIFLSEMCKNGKGQYMVPLPFRDGQPSSFPGMRQ FT IAVNRLLQLERKLSRDPVLYNNYRRFMVEYESLGHMSEADSPGDYYIPHHA FT VYKAEGENMKLRVVFDASARCRSGSSLNEGLHVGPKLQQDIVDVLTGFRVH FT TVAFTTDICKMYRQIWVLEKYRGYQHILWRSSPQLQIRENTLNTVTYGVNS FT APYLALRVLRHIADNDCEEVPEVSKALKFQTYMDDICVGAPSLECALSLKS FT DLIKTLSRSGLMLKKWSSNEPRLLSGLPLEDLAGDPLTFDRGDGIPVLGMQ FT WRPTADHFVYDITAIKSVLSKRGVLSVIARIFDPLGFLSPVIFHAKCIMQR FT LWSAQTSWDEPLPPAIAKEWQQFLDMLGWLTEIRIPRCIGGSIGIEYSLCG FT FCDASERGYAAVLYLRVTDPSQKVNVYLLGAKTKLTSLKPTTIPRLELCGA FT VLLASWLSRMHRILEAHMKISGVYAWSDSTIVLSWLLNPHVALKVFVSNRV FT HHIKTLLPVCKWAHVRSEENPADCASRGLSPAELVKAQLYWSGPEFLRSPV FT DHWDLHPTTMPGDQLPEVHSVALLITTSIRKGEWFTRLSSYSIMIRVIARL FT RRFIGRCRRQETVSGNLTRSELDQALYVLVKCTQGCLLSSLLRELSSGSAV FT SSKVYAKLSPFIDEFGVIRVGGRLQNAMCSWDRKHPILLPKESHLSALIAR FT HWHLSACHARSNLLMSLVNSRFWIMGVRRVVYKAIKSCITCVKLDGVNPQP FT KMADLPVSRVQACRAFTAVGIDYAGPLMMKETKLRKAREYKVYIALFVCMS FT TRAIHLEVVLDLSTDAFLAALDRFVVRRGVPTSIHSDCGTNFVGAARKLRE FT LINSPSSRDQCSSRLMCHWNFNPPSAPHFGGLWEAAVKSTKSLLTRSLGAQ FT TWTLEEFTTILCRVEAALNSRPLTPVTSDPEDLDCLTPGHFLIGQPLMSVP FT EEELSVKPVNIQRRWKLLQQTFQTFWRRWSSEYLNTLQARIRWSSSQENVK FT VGDMVVIKDNTAPPLL" XX SQ Sequence 6922 BP; 1817 A; 1411 C; 1574 G; 2120 T; 0 other; gagggccgtt gctgtgtagt cggcgacgct ccgtaatata gattttatat tatagacata 60 tgttatcaga ctacttattt tagaaataat attgttgtgt tcacctggga accgtcttga 120 ggaggtgagg ctgccggcgt taactgtcat aagtgcgcac gtggccaaca ttttggtagc 180 agagcgtggt tcattaagga tcttctcttc tggctgttac ttggtgagct tacttctgtt 240 ttattagatt aaggtgtcat tccctcccat tatagtgtta agattatgac tgtcggtggg 300 taacagtatt atttagttgc agcagtccgt tgatccacgg atgtggcatc taataactat 360 taggcacctt tcaatattaa gtggttacat ttgattgaac aatatataag actgatcagt 420 ttaaatacct cagttctctt agtttagtag gttttctctt aacgttaata attaggatgt 480 catcaatacc tattatgtta gtgttattga tttaatattt tttatttttt gtttggatca 540 catattttat tgttgaccta gcaaagttat tatattttgt taaatttaaa caggttaaaa 600 cttccctgtt attactactg atattattac tacttagtac tactgtaggt agctaccata 660 ctattttatt atttattagt agtaatttgt tatgtttgtg tgaacttcac tggatctctg 720 ctctatttgt tgattattat tattttataa tctagtattc tactgcaatt gttggtagaa 780 tttgttagta ttctttaaat ttttgttgat taagtactga tcagtattga caatataata 840 tatttattta agtgggtttg gatatatgtg gttgctgata tatgtgcaat cacattgaat 900 tgttatttcc aaacaccagt tgtattagtg cttaagtgtt aaattaattt gtgctgcttc 960 ttgctcattg ggtcgttgcg ttagttcctt ttaggtgtgt gcagttgtta tctttgccac 1020 catgtcagat gaagagatca agactcgcaa cacaagactg aaacagcgtg ccacagctaa 1080 gcgagacctc gctgcaagtc atattcgtcg tctacatgct tcagcaaagg cagcaggaga 1140 tgatccaaca gcaaggggtc taattatgac ggctggtcag gacctgaata attggtggtc 1200 gcaatattca gtggaaagtg atgcgttatt gaacgttatg gttgagttgg atgaaataga 1260 tcaattttct ccggatgctg atgctgatgt ctacgccatg gtggtagaga tcaaatcaat 1320 catcaacaat tatgaacagg aaaccttgaa gtcatccaaa gacgctagac cagttataca 1380 gatcagtgat actgcaataa tggcgaatcc aagtgaagat catcagtcga ttacacaaac 1440 tagcaacgca gccatacttc caaatacagc ttccgtaatg gaaacacagt cgtcagctcg 1500 tgtgagtagt tctccgtcgt cagatgtctc cttccgtcct caagtatcaa tgcgcctccc 1560 aaaaatccct ctgccaaaat tcgatggtga tctccgttag tggccagatt tccgggaccg 1620 cttcgttaat ctagtaatcc gcaacaagga tattgattca gatagcacca gattttacta 1680 tttacttggg tgccttgaag ctgatgcggg ggaagtacta aaagggattc cagtatctaa 1740 agacaccttt cagttagcct ggaacacatt agtgaaatcc tacgataaac cgagaaaact 1800 ggcctcctca attattgaga gttttttagt tgctcccgta tccacttcag aatcactggt 1860 agatctcaaa cgttttttga gtgtatttga tgaaggacta gcaatattag aatcgttgca 1920 cttaccagac ctgtgttcgt tcctgctgtt tatcatcgct tcaaaatctc ttccaactca 1980 tacgcgtcgc ttgtttgaat ctgataattc agcagagtat ccatcagttg acagtgtgct 2040 ggaatttgtg aagaataggg tgagggtcct ggaaaatgct ggtggtactt caggaccagg 2100 gaaatcattt ggtggaagca acaagaaacc taatccaggt ccagggaagc attacaatcg 2160 tccttccact acccaaacag cgttggtgtc atttgtaaag actcagaaac ctaagtccac 2220 tgattccgaa aagtgtcgtt gctgtggggg agctcatgca ttgaaagatt gtgcaaaatt 2280 tctgggagca tctgtagatg atcgatacca tgatctcaca gactatgttt ggtatgcttt 2340 gaaagtggcc atatgtcata taaatgcaaa tcgtcctgta gtgtgtgcaa acgtcgccat 2400 catgggctat tgcacaagga cccttctact tctgatccgt cgtccaaggt tcctaaagtg 2460 gccatgtttg ccaagcatca aaatcaactt ccatcggtag tactagcgac agcactgctt 2520 catgttcaag atgtggctgg aagtccacag acggttcggg ccttaattga tggtggctca 2580 cagatcagtg ccatatctgc aaattgttgt caacgtctag gtctacgtgc agcgaaatgg 2640 acgttaccag ttactgggtt atcagagtta cctgtaccaa gcgttttggg tatcgtagat 2700 ctacatatcc agcctcggga ttcaagccaa tcttcaatgc ctgtaagagc ttgggtacta 2760 tcttccataa catctgatat gcctacttcc aaactaccgt caacggttcg ggaaaaatgt 2820 ggtgacttgt cattagctga tccgctattt gatatcccgg ctccagttga aattttgttg 2880 ggggcggaca tataccctat ggtctggtct catgaaactg tctcgttagg tcatggatat 2940 ccaactgcgt tcaactccat cttcggttgg gcaattgtag gtccacttca acacataaag 3000 gctccaagtc ctagggctct acctgttcag atatcttcct cagtcgaatc cttaatggaa 3060 aagttttgga acgttgagga accggaagct gctccgcccg tgttcactca ggagggtcag 3120 tgtgaagaaa tctttctgtc agagatgtgc aagaatggta aaggacagta catggtccca 3180 ttgccgtttc gtgatggtca gcccagttct tttcctggaa tgcgacaaat tgcagttaat 3240 cgtctgttac aactggagcg caaactatcc cgagaccctg tgctgtacaa taactacaga 3300 aggttcatgg tggaatatga gtcgttaggg cacatgtctg aggcagattc tccaggagac 3360 tactacatcc cgcaccatgc tgtttacaag gccgaagggg aaaacatgaa attaagggtg 3420 gttttcgacg cctctgctcg ttgtcgatcc ggttcctcac tgaatgaggg cctccatgtt 3480 ggcccaaagt tacaacaaga tatagttgat gtcttaactg gctttagagt ccataccgtt 3540 gcgttcacca cagacatctg caaaatgtac cgacagattt gggtactgga aaaataccgc 3600 ggataccagc atatactctg gagaagttct ccacagctcc agatccggga gaacacgttg 3660 aacacggtta catatggagt aaacagcgct ccttatttag cgcttcgggt ccttcgtcat 3720 attgctgata atgattgtga agaggtccct gaagttagca aggcgctcaa attccagaca 3780 tacatggacg acatctgtgt tggggcacca tcgttggaat gtgctttatc attgaaatct 3840 gatttaatta aaacattgtc cagatcggga ttaatgttga agaaatggtc tagcaacgaa 3900 ccacggttgt tatcaggcct tccgctggaa gacttagcag gcgatcccct cacatttgac 3960 cgtggtgacg gcattcctgt gctaggtatg caatggcgtc caaccgctga tcatttcgtg 4020 tacgacataa ccgcaatcaa gtcagttttg tccaaacgtg gggtcctatc agtaattgcg 4080 aggatcttcg acccgttagg gtttctatcg ccagttattt tccatgccaa gtgtattatg 4140 caacgtttat ggtcggctca aacttcatgg gatgagccac ttccgccagc catcgctaag 4200 gaatggcaac aattcttgga catgctaggt tggctgacag agatccgcat tccacgttgc 4260 attggcggtt ccatcgggat agaatattca ttgtgcggtt tttgcgatgc ctctgaaagg 4320 ggttatgctg cagtattgta ccttcgtgtg actgatccgt ctcagaaggt aaacgtgtac 4380 ttgctcgggg caaaaactaa actgacatcg ttaaaaccga ccacgatacc cagattggag 4440 ttatgtggag ctgttttgtt ggcttcttgg ttatcacgga tgcatcggat actcgaagca 4500 cacatgaaga tttcaggtgt ttatgcttgg tcggattcga ccatagtcct gtcttggttg 4560 ttgaaccccc atgttgcctt gaaggtcttc gtctccaatc gcgtccatca catcaagacc 4620 ttgcttccag tctgcaagtg ggctcatgtc aggtctgaag aaaacccggc tgattgtgca 4680 tcacggggac tgtctccggc agagctagta aaagctcagt tgtactggtc gggtccagag 4740 tttttaagat ctccagttga tcattgggat ctccatccaa ccaccatgcc aggcgaccag 4800 ctaccggaag tccactcagt agcccttttg ataacgacat caataaggaa aggcgaatgg 4860 tttacgcggt tatcatccta cagcatcatg atccgtgtga tagctaggct acgtcgtttc 4920 attggaagat gtaggcggca ggaaacagtt tctggtaatt tgacacgttc agagttagat 4980 caagctttgt acgtgttagt caaatgtacc cagggatgcc tgttgagttc acttcttcgg 5040 gaattgtcaa gtggtagcgc ggtttcttcg aaggtgtatg ctaaacttag cccttttatc 5100 gatgaatttg gtgttattcg agttggtggt cggctgcaga atgctatgtg ctcatgggat 5160 cgcaaacacc caatactgtt acctaaagaa tcgcacctat ctgcgttgat tgcacgtcat 5220 tggcatttga gtgcatgtca cgcccgatca aacctactta tgtcgttagt caatagtcgc 5280 ttctggatca tgggcgtacg ccgagttgtc tataaggcta tcaagtcctg cattacttgt 5340 gttaaactag atggagtgaa tcctcaacca aaaatggctg atctgcctgt gtcaagagtg 5400 caagcttgcc gagcattcac agcagtggga attgactacg cgggaccatt aatgatgaag 5460 gaaacaaagt tacgaaaagc tcgcgaatac aaagtttaca tagcactgtt tgtttgtatg 5520 agtactagag ctattcatct tgaagttgta ctggatttgt caactgatgc ctttttagcg 5580 gcacttgaca gatttgttgt ccggcgaggt gtgcctacat ccatccattc agattgtgga 5640 acaaattttg ttggggctgc gcgcaaattg agggagttga tcaactcgcc ctcgagtagg 5700 gatcaatgtt ccagtcgtct gatgtgtcac tggaacttta acccacctag tgcacctcac 5760 tttggaggac tttgggaggc tgcggtgaag tctaccaaat cgttgctgac aaggtcattg 5820 ggggcgcaga cttggacatt ggaagagttt actacaatcc tatgtcgggt tgaagctgcg 5880 ctgaactcca ggccattaac tcctgttact tcagatccgg aggacttaga ctgtttgact 5940 ccaggccatt ttttaattgg ccaaccattg atgtcagttc ctgaggagga actcagtgtg 6000 aagccagtga atatacaacg gcgttggaag ctccttcaac aaacttttca aaccttttgg 6060 cgaagatggt cgtcagaata cttgaacact ttgcaagcta ggattcgctg gtcgtcaagc 6120 caagaaaatg tgaaagtagg agacatggtg gtcataaaag ataacactgc acccccacta 6180 ctctaacgtt tgggtcgtat cctcgaggtg ttgcctagta aggatggggt ggtaagagtc 6240 gctcgggtgc taacgaaagg tggttccttg gtcagacctg tagtgaaatt agtactgcta 6300 ccgaccgatc agtcataata gtatttgttc tttattccct ttctcttgac accttaatac 6360 taatttgatg ggtggtgtgc ttgctgggaa cagttgatga gctgtatcca ttaaaaaaaa 6420 aaaaaaaaaa aaaaagtagt gtagagaaat aattgttttg ggccaaaccc cctcagactc 6480 attcttaatt ttttttttta tatgatgtta taatttagta caactgttga tccacagttg 6540 tttattgttt ttgttttttc acttgttcct aatcatttat tttagacagc tgttgatcca 6600 cagctgcctg ttttgtttta ataatattat aatgtaatcc tctgtgatat agctataaaa 6660 tttcgtaagc tgtgataaat tgtaatgtga taaacctgtg attctacagt taccataata 6720 tgtataccga tctcttgtat aataattttg ttgttgatgt aacccctctg taccactccg 6780 agtaagcgtg aagagtcaac atgtcaatca atcaattgtg tgcctgatcg ggtccatcat 6840 ctgtatcatc taacgaagag ctgaaacttt taactgctca aactctttgt aatcccagat 6900 ctgggatttc caagggggag ta 6922 // ID Penelope2_Dw repbase; DNA; INV; 2798 BP. XX AC . XX DT 25-JUL-2007 (Rel. 12.07, Created) DT 25-JUL-2007 (Rel. 12.07, Last updated, Version 1) XX DE Penelope2 retrotransposon from D. willistoni - a consensus DE sequence. XX KW Penelope; Non-LTR Retrotransposon; Transposable Element; KW retrotransposon; Penelope-like element; reverse transcriptase; KW GIY-YIG endonuclease; Penelope2_Dw. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-2798 RA Arkhipova I.R.; RT "Penelope retrotransposons from Drosophila willistoni."; RL Repbase Reports 7(7), 629-629 (2007). XX DR [1] (Consensus) XX CC Penelope2_Dw from Drosophila willistoni is 68% identical to CC Penelope from D. virilis, and 70% identical to Penelope1_Dw, CC another D. willistoni Penelope element. There are about 60 CC copies of Penelope2_Dw in the sequenced part of the D. willistoni CC genome, most of which differ from the consensus by 1-5% but can CC differ by as much as 10-15%. Most of the copies are CC 5'-truncated, and only one appears to be fully intact CC (AAQB01010750) and is arranged in a characteristic CC partial-tandem, forming "pseudo-LTRs". XX FH Key Location/Qualifiers FT CDS 73..2583 FT /product="Penelope2_Dw_1p" FT /note="reverse transcriptase; GIY-YIG FT endonuclease." FT /translation="MAYAEIKSNYNKYKTVINKYQSTTKKSVKLKSSIKFL FT LKCRKSKLVPKFISNNSKYNRIFTKDTDTYSDINKILERHTHNFHTKILSL FT LIKHKHNILARQNKEKEKATKKLQEQLDENDFKTFMKSESTIAEKLSTTLK FT RHHESKHERLRKQRNSVLLNKNNNNTNDWFVNKTNIEFPTDVKALLAKGPK FT FAIPVEKRKFPLFKYIADGEELVQTLKDKEKQEEARTKFSLLIKDHKTSNK FT EDAADRAILDTVEQTRKFLRQNDDILILTSDKGNKTVAMDKTEYDKKMDSI FT LNDLNTYRTLRKDPTTRLQTKNNELVEKLYKHELISRTERNKLITNTSIPP FT RIYGLPKIHKEGTPLRPICSSIGSPSYGLCKYIVNILKNITADSTYNVKNA FT SEFKTRINDTYICDDERLISFDVVSLFPSIPIELALDTIREKWIKIKAHTN FT IPKLTFMEIVRFCIQENRYFKYKDNIYTQLKGMPMGSPASPIIADIIMEEL FT LDKMMRTLQRPPRVMTKYVDDLFAIIKEDEIQNTLDNLNSFDRNIKFTIEL FT ENDNKLTYLDASIHRRGNELKLKWYKKPTASGRIINFNSKHPKSMIMNTAR FT GCIQRMLNISDPIFHEETKKEIRRILKANDFPNYTIRTLLKSNNKKTNKEV FT STKRFMSVTYVPQLSERLANSDCYDKEQIQIAHKPNNTLRSIFNRTKSKID FT TMDKSNVVYKIPCNGTNEESCDKMYIGTTKSRLKTRLSQHRSDYKLRQHSN FT IQKTALMAHCAASGHSPNFDEVNIIDQEQHYNKRYTLEMLHIINTPTNIRL FT NYKVDTDNCAHLYRHLLGTSQCVAPRHSVRTCKNK" XX SQ Sequence 2798 BP; 1197 A; 517 C; 451 G; 633 T; 0 other; taacaataaa aggtcgcaaa aaatcaacaa ttaacttgaa cggaaggcac gccgaaagca 60 tattccaaca ccatggctta tgcagaaatt aagtccaact ataacaaata taagacggtg 120 attaacaagt accaatcaac aacaaaaaaa tcagtaaaac ttaagtctag cataaaattt 180 ctcttaaaat gtaggaaatc aaaattagtc ccaaaattta tcagcaataa ctcaaaatac 240 aataggattt tcacaaaaga tactgacaca tattcggaca ttaacaaaat attagaaaga 300 catactcata attttcacac aaagatatta agcttattaa taaaacataa acataacata 360 ctagccagac aaaacaaaga gaaggaaaaa gcaacaaaaa agttacaaga acagctggac 420 gaaaatgatt ttaaaacatt catgaagagt gagagcacga tagccgaaaa attatcaaca 480 actttgaaga gacatcacga atccaaacat gagagactac gaaaacaacg gaacagcgtt 540 ctcttgaaca agaacaacaa caacacaaac gattggtttg taaataagac gaatatagaa 600 tttccgaccg acgtaaaagc gttactggcg aaaggcccta agttcgcaat cccagtggag 660 aagagaaaat tccctctctt taaatatata gcagacggag aggagctagt acagactctt 720 aaagacaaag agaaacaaga ggaagcacgc acaaaattct ctttattaat aaaagaccac 780 aaaacatcta acaaagaaga cgcagctgat cgtgcaatac ttgacacagt ggaacaaaca 840 aggaaatttc ttagacagaa tgatgacata ttaattttaa cttcagataa agggaataag 900 acagttgcaa tggataagac tgagtatgac aaaaaaatgg atagtatact taacgattta 960 aatacatata gaactttaag gaaggaccca accactagac tacagaccaa aaacaacgaa 1020 ttggtggaaa aattatacaa acatgaactt atttcacgga ccgaacgaaa taagcttata 1080 acaaacacat caattccacc aagaatatat ggattaccta aaattcataa agaaggtacg 1140 ccattgagac caatctgctc atcaattggt tctccatcat acggactatg caaatacata 1200 gtcaacatat taaaaaatat aacggcagac tcaacataca acgtaaagaa cgcctcagaa 1260 tttaagacaa ggattaatga cacatatatt tgcgacgatg aaagattgat atcatttgac 1320 gtagtctcac tttttccaag tattccaata gaattggcac ttgacactat tagggaaaag 1380 tggataaaaa taaaagcgca cactaacatt ccaaaattaa cattcatgga aatagtacgt 1440 ttttgtatac aagagaacag atattttaaa tacaaggaca acatatacac acaactaaaa 1500 ggaatgccta tgggctcacc agcatcacca ataatagccg atataattat ggaagaactc 1560 ttagacaaaa tgatgagaac actacagcga ccacctagag ttatgacgaa atatgtagat 1620 gacctttttg ccataattaa agaagacgag atccaaaaca cacttgacaa cctaaattca 1680 tttgacagaa acataaagtt tacaatagag ttagaaaatg acaacaaatt gacatatcta 1740 gacgcatcaa tacacagacg aggaaacgag ctaaaattaa aatggtacaa aaaaccaaca 1800 gcatcaggac gaatcatcaa cttcaactcc aaacatccga agtcgatgat aatgaataca 1860 gcaagaggtt gtatacaacg gatgctcaac atatcggacc caatttttca cgaagaaacg 1920 aagaaagaaa tacgacgaat tttaaaggct aatgactttc caaactacac cataaggact 1980 ttactaaaat ccaacaataa gaaaaccaat aaagaggttt cgacaaaacg ctttatgtct 2040 gtcacatatg ttccacaact gtcggaaaga ttggcaaact cggattgcta tgacaaagaa 2100 caaatacaga tagcacataa acccaacaac acgctacgga gtattttcaa caggacaaaa 2160 agcaagattg atacaatgga caagagtaac gtagtatata aaataccgtg caacgggaca 2220 aacgaagaaa gctgtgataa aatgtatata gggacaacca aatcgaggct aaaaactaga 2280 ctatcacaac atagatctga ctataaactt cggcaacact ccaatataca aaaaacagca 2340 cttatggccc actgtgctgc cagtggacat tccccaaatt ttgacgaagt aaacataata 2400 gaccaagaac aacattacaa caaacgttat actttggaga tgctgcacat aataaataca 2460 ccgacaaata taagacttaa ttataaggta gatacagaca actgcgcgca tttatacagg 2520 catttattag ggacaagtca gtgtgtagct ccacgtcaca gtgtgcggac gtgtaaaaat 2580 aagtaggtgt ttcttgttgt tatattatgt tatgttaaag ttaatcacaa ttaaatgttt 2640 ttatagttgc cctgaagacg accaccgatg agtggtcgaa atatatcgga aaagaacaac 2700 acaaaatatt attattttgt tttattcacc caagaaaatt tgacctcgag ccggcaaaac 2760 atacatttaa caataaaagg tcgcaaaaaa tcaacaat 2798 // ID L2B-1_CQ repbase; DNA; INV; 4835 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE An L2B non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW L2B; Non-LTR Retrotransposon; Transposable Element; L2B-1_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4835 RA Kojima K.K. and Jurka J.; RT "L2B non-LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 142-142 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 10 sequences with >98% CC identity. XX FH Key Location/Qualifiers FT CDS 267..1607 FT /product="L2B-1_CQ_1p" FT /translation="METGEGKGGGTNHPCAQCSQPISETDHCVECFGKCRL FT SIHVGCVPRATAESIELLGQLPNAVFVCDACYALHDYGDDGHTEKILSDVV FT AKLDELSCMIDFVKNFDSAVRNVVREEIARDRKTSTASGNKDGTRNRVITR FT SESKKRKAEEAEDSESKKMRAEEVGDTVTEFVTPKASFAKVVQQRIVLSEE FT EQKQKQTQKPDPVVVIKPKEGVEVENARMEVQKKVSSKNLNVQRVYTNKNG FT EVVVALKDEASVQVLQENVKKQLGERYDVRLRDSLKPTVKIIGMPEELEED FT ELRETLIDHNDSFANLKHFKLCRSYRNEKWSYDNNNAVVELDADTYFKVLD FT AGKVNCGWKRCRVFDGLQVLRCFKCNGFNHKGADCKAAVVTCPICSGPHEL FT KDCKAEREKCSNCEKLRSEKNADVDVNHAAWSSECPVYRKQQQRRNKLVDF FT TL" FT CDS 1641..4412 FT /product="L2B-1_CQ_2p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="MNIAGMRTHFDELKVIIQQNKPKLVILTETHLTSQHD FT LDEYEIQNYRRSVCLSRSAHTGGVLIYVDDRVDFETISDFVAGDNWFLAID FT VKSPTLNGIYGGVYHSPSSSDTSFIGSLETWLRSVFVEDRTNIFAGDFNIR FT WNEPGYSRELRNVTEALGMKQLVSEPTRVGPSSSTMIDLVFTNMVGGDARV FT VEELKISDHETIGINPGKSSVHLSERSKTRISWRRYSKEKLQEILLSNRSR FT ADPMASVNEAGEKFDAEIVAAVGSLVDVRFDERSDPHDWYGDQLSTLKTLR FT DDAYRKFKSTKLVSDWDRYKRCRNRYVNQIRTSKNSSINNEIRSCGGDSKK FT LWRCLKSLIQPGGQPPTEIVFGGPRGDAETATLLNEFFVQSVEEIHGKIPP FT TAFTPPTTGGTEHKELLSNFEPITMNKLKETVRSLKNCAGVEHVTKRVMMD FT ALDVVAVELLDIVNKSLSRGEFPDAWKRTLVIPIPKIPKSKNPEDHRPINM FT LPLYEKVMEIIVKEQLLAFIDRTGVLLKEQSGFRKNHSCESALNLLLLKWK FT QCIEEGRVVLSVFVDLKRAFETIDRTKLIGVLKKSGIRGTVLKWFSSYLEN FT RTQVTRYNSAVSSETAVKLGVPQGSVLGPLLFILYINDIKQALRRVQVNLF FT ADDTVLFVIGDSFDECFDIMNEELVGFTEWLKWKKLQLNIAKTKCMVVTTR FT KTNDCRRCVQMDGGVVERVETMKYLGVMLDEKLNFNEHIDYTIRKAARKFG FT VLCRINRYLTAETKVQVYNSLIAPHFDYCSSILFLGTQRQLKRMQVLQSKI FT MRLILKCDRLTPRRSMLECLQWMSVKQRIEYNTLVFVYRMKKRMAPQYLTE FT AMVCGRDIHEHDTRGADDLRLQNWKKACTQNSLFYKGFKLYNQLPENAKAT FT SNINEFKRNCKEFVRTRPLE" XX SQ Sequence 4835 BP; 1458 A; 937 C; 1350 G; 1089 T; 1 other; actgagtgta gtcaattgaa ttgttgcatc tttttaatga ccagcaataa agtgtcgttt 60 taattgtgtt accgttcgaa gtgaaggcca caaagtttag tacttgacgg aaaagcagct 120 ctgtgcaagt kggcgttgga gttcaagcgt tggatgctga gtagccaagg attttcgacg 180 ggacaaaatc agtcaaaacc cgaagctaag tatcgctccc catttgttct gtgcttgttt 240 atctcgccgg cacaatagcc gcggtgatgg agacgggcga agggaagggt ggagggacga 300 accatccgtg cgcgcagtgc tctcagccta tttctgagac agaccattgt gttgagtgtt 360 ttggcaaatg cagattgtcg atacacgtcg gttgcgtgcc ccgcgcgacg gcagaaagta 420 ttgagttgct tggtcagttg cctaacgctg tatttgtgtg tgatgcgtgt tatgccttgc 480 acgattacgg ggacgatggg cacacggaaa aaattctgtc tgatgttgtc gcgaaacttg 540 atgagttgtc atgcatgatt gattttgtga aaaacttcga ttcggctgtc agaaacgttg 600 tgcgcgagga aatagcgaga gacaggaaga cttcaacagc gagtggaaat aaggatggaa 660 cccgaaatcg tgtgattacg cggtcggaat ctaagaaaag gaaagcagaa gaggcagagg 720 acagcgagtc aaagaaaatg agagcagaag aagttggaga cacggtgact gagttcgtta 780 ctccaaaggc gagctttgca aaggtggtac agcaaagaat tgtgttatcg gaagaggagc 840 aaaaacagaa acaaacccag aagccggacc cagtcgtggt tatcaagcca aaggaaggag 900 ttgaggttga gaacgcccga atggaagtac agaaaaaggt cagctctaaa aacttgaacg 960 ttcagcgggt gtataccaac aaaaatggtg aagttgtagt agcactgaag gatgaagctt 1020 cagttcaggt acttcaggag aacgtgaaga agcagcttgg agagcggtac gacgtacggc 1080 ttcgggattc cctgaaacca actgtcaaga tcatcggcat gccagaggag ttggaggagg 1140 acgaactacg ggagacgttg atcgatcaca acgattcgtt tgccaacctg aagcacttta 1200 aactgtgcag aagttatcgc aacgaaaagt ggagctacga caacaacaac gctgtagttg 1260 agttggacgc agacacttac ttcaaagtat tggacgcagg gaaggtgaat tgcggctgga 1320 agcgttgccg tgtcttcgat ggactacagg tgctgagatg cttcaagtgc aacggtttca 1380 accacaaagg agctgactgt aaagctgctg ttgttacctg tccgatctgc agtggaccgc 1440 acgaattgaa ggactgcaag gcggagcgtg aaaaatgttc aaactgtgag aaactaagga 1500 gtgaaaagaa tgctgacgtc gacgtgaacc atgccgcgtg gagcagtgag tgtccggtgt 1560 acaggaaaca gcaacaacgg cggaataagc tggttgattt tactctgtag caaccaaaac 1620 ccgtcggaga tgtcttgtac atgaacatag cagggatgag gacacatttt gacgagctaa 1680 aggtgataat tcagcaaaac aagccaaaac tggtgatttt gacagaaaca cacttgactt 1740 cacaacatga tcttgacgag tacgagattc agaactatag gagaagcgtg tgtttgtcca 1800 ggtcagccca cacgggaggg gttctaatat acgtggatga tcgtgttgac ttcgagacaa 1860 tttccgactt tgtagctggt gacaattggt tcctggcaat tgacgtcaaa agcccaactc 1920 tgaatggaat ctacggagga gtatatcact caccgagcag cagtgacact agttttatcg 1980 gaagtctgga aacgtggctc aggagcgtct ttgtagaaga taggacgaac attttcgctg 2040 gggatttcaa cataagatgg aacgagcccg gttactcgcg tgagctgaga aacgttacgg 2100 aggctttggg gatgaagcag ttagtgtctg aacctacccg tgttggtcct agcagcagca 2160 caatgataga tctcgtgttt acaaatatgg tgggtggaga tgcacgtgtt gtagaggagc 2220 tgaagatttc cgatcatgaa acaattggaa tcaaccccgg gaaaagcagc gtgcatcttt 2280 ccgaacgcag caagacccgg attagttgga ggcggtactc taaggagaaa ctgcaggaga 2340 ttctcctttc gaaccgaagt cgagcggatc caatggcgtc agtaaacgaa gccggagaga 2400 aatttgacgc agaaattgtg gcggccgttg gaagtctcgt cgatgtccgc ttcgatgagc 2460 gttcagaccc acatgactgg tacggggatc agctttcaac actcaaaact ttgagagacg 2520 atgcctaccg gaagtttaag agcaccaagt tggtaagtga ctgggatagg tacaagcgtt 2580 gcagaaaccg ttacgtgaac caaatccgga catcaaagaa ctcgtcaatc aacaacgaaa 2640 taaggagctg cggcggcgat tcaaagaagc tgtggaggtg tttaaaatcg cttatccagc 2700 ctggaggaca gccgccaact gagatagtgt ttggaggacc acgaggagat gctgagacgg 2760 cgacgcttct gaacgagttc tttgtacaga gtgtggagga gatacatgga aagattccac 2820 caacggcatt tacgccgccg acgacaggag ggacggaaca caaggagttg ttgagtaact 2880 ttgaaccgat cacaatgaac aaactgaagg aaacggtacg gtcactcaag aattgtgctg 2940 gagttgaaca tgtcacgaaa cgggtcatga tggatgcttt ggacgtggtt gcagttgagt 3000 tgctggatat cgtaaacaag tcgctgtctc gaggtgaatt cccagacgca tggaagagga 3060 ccttggtgat tcccatccca aaaattccta aatcgaagaa tccggaggac cacagaccca 3120 tcaacatgtt gccgctgtac gagaaggtca tggagataat cgtgaaggaa caactgttag 3180 cgttcattga ccgtactggt gtgctgctga aggaacaatc tggttttcgg aaaaatcact 3240 cttgtgaatc ggcgctgaac ctgttgctgt tgaagtggaa gcagtgtatt gaagaaggaa 3300 gagttgttct atcagtcttc gtggatctga agcgggcctt cgaaaccatt gaccggacga 3360 aactaatagg tgttctcaag aaaagtggaa tacggggcac ggtactaaaa tggttcagca 3420 gttacctgga gaaccggacg caagtgacaa ggtacaatag cgcggtgtca tcagaaacag 3480 cagtcaagct cggagtaccg caaggaagcg tgctaggccc actactgttc atactgtaca 3540 taaacgacat taaacaggca ctgaggagag tacaggtgaa cctgttcgcc gatgacacag 3600 ttttgtttgt gattggtgac agcttcgacg agtgctttga catcatgaac gaagagctag 3660 taggattcac ggaatggttg aaatggaaga agctgcagtt gaacatcgcg aagacgaagt 3720 gcatggtagt gacgacacgg aaaaccaacg actgcagacg gtgtgtacag atggatggag 3780 gagttgtgga gcgggttgag acgatgaagt acctcggagt catgttggac gaaaagttga 3840 atttcaacga acatattgac tatactattc ggaaggcagc acggaagttt ggtgttctgt 3900 gtaggatcaa tcgctacttg acggcagaga cgaaagttca ggtgtacaat tcgctcatcg 3960 ctcctcactt tgactattgt tcatcgatct tgtttctggg aacacaacga caactgaaaa 4020 ggatgcaggt actacaaagc aaaataatgc ggctgatact aaagtgtgat cgactgacgc 4080 ctcgacggag catgctcgaa tgtttgcaat ggatgtcggt aaagcaaaga atcgagtaca 4140 atacccttgt gtttgtttat cgtatgaaaa aaaggatggc gccacaatac ttgacggaag 4200 ccatggtatg cggaagagat atccatgagc atgacactag aggagctgac gatctcagat 4260 tgcagaactg gaaaaaggca tgcacgcaga actcgctgtt ttacaaagga tttaaacttt 4320 acaaccagct tccagagaat gcaaaggcga caagcaacat caacgagttt aaaagaaact 4380 gcaaggagtt cgtacggaca cggccgttgg agtagaagtg acccacgatg gtactgtgag 4440 gaagagcacg ttatgacggt cggccatctt cattatcggt acgcattcgc atgggatcac 4500 tttgggccgc atatgataaa cctgatcaaa agtaacgcga atctgggcgc ggtttaaccc 4560 tatgcgctca tatgcgagtg gaatagcaaa tggttccaat accctaaatt tgatgcaatc 4620 tgagtgatga aagactctac ataggatttg taagagcgcc ttgagacgag ataagagaga 4680 tatggatggg catacacgga agtggagtaa gattacagga cactctctaa acatgaacga 4740 agaattatcg aaagatatct gctcgtaaat cttccatact acaaaaactg tgtatgggta 4800 tgaggtgggc catccaagga aaaaaaaaaa aaaaa 4835 // ID CR1-47_AAe repbase; DNA; INV; 3948 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-47_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-3948 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1134-1134 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 13 sequences with >95% CC identity. XX FH Key Location/Qualifiers FT CDS 427..3825 FT /product="CR1-47_AAe_1p" FT /note="endonuclease and reverse transcriptase." FT /translation="MGTPSDPVTVEPNHHATSNSSVCASCHLRRPGPVYGG FT LEGVSQPAFPGKYAYHNGHSLDDVSNNSSSTENDDPPASLHQNSYSTSQFH FT LPGRNVESLMEAPGPSDPVEQIAASVLHSRPGPVVSCGAGVFQSVTSGEYP FT FNTSHEQPILEEPNSRISERNSDSATKSLGRYDDVLVYYQNAGGMNGDVDK FT YLLATSDECYDVIVLVETWLDSRTLSGQVFGAEYEVFRCDRGPLNSRKSTG FT GGVLVAIKRKYKPSVINSDEWVSIEQIWVSFQLSNRKVFLCAVYVPPDRTR FT DVALIDLHSRSIMSAIDMADATDEIVIFGDFNLPGISWQSSDNGFLHPNPD FT RSTLHIGASRLLDSYSSATLRQINYTTNENGRSLDLCFVSAQNTAPSISVA FT PSPLVKQVYHHPALVLILNDSKVPELKSVDPISYDFRNADHDRIVEFLSAI FT DWDLVLDGRDVNNAALTLSHILGHVIERHVPKICHPAKSKPWMTRELRMLK FT TAKKSALRKYSKHHTVSLQEHYRKLNSAYKVSSRNSFRRYQQNLQRNLKAN FT PKFFWKYVKDQRKEAGYPSTMTLNGEVTSEPRRICQFFADKFSSTFSNEFV FT SEEQIIQAASNVPLLAGDSFGTLNIDDAMISRAASQLKSSTNTGPDGIPSV FT FVKKYIENLLVPLRHVFNLSMSSGDFPSVWKTAVMFPVYKKGERRDVNNYR FT GVTSLGAISKLFELVVMDPLLSHCKQYLSDAQHGFISGRSTASNLLCLTSH FT ITESLAERAQTDVIYTDLSAAFDRVNHAITIAKLERLGICGSVLRWLHSYL FT TGRKLTVTVEDIISDEFLATSGIPQGSHLGPLIFLLYFNDVNYVLKGPRLI FT YADDLKIYRRIRSESDARILQQEIDTFTNWCTLNCMTVNPSKCSTITFARI FT RQPIVFNYNLRGNEIERVDHVKDLGVILDPQLTFKRHVAYTVEKASRTLGF FT IFRIAKDFTNMNCLKSLYCSLVRSTLEYCSVVWNPYYQNGTERIESVQRRF FT IRFALRRLPWRDPFRLPSYESRCQLIHMETLQVRRNIARAMFVTDTLQGRI FT DCSTILRTIDLNVRPRALRNNSLLRLQFRRTNYGQHSALNGALRVFNRAAS FT CFDFNLSRATIRRNLSVFFSRRMDE" XX SQ Sequence 3948 BP; 1065 A; 976 C; 851 G; 1056 T; 0 other; gttcaactat ttgtcttgat tcacgccaag aagatgactt gatatggatc tacttgtccg 60 cgttccatcc gaatacgact gaaaaccaaa tttcttcgtt tgtcagggag tgctgtgaac 120 tgacagccaa cgctcatctc aaagtgatta aactggtgcc aaaaggaaag gatttgaata 180 cgcttaatta cgtgtccttc aaaattggtt aagcgttcaa ttcaaggaaa aagctttctc 240 gtgtgaaacg tggcctgaaa acatacgttt tcggcaattc gaagataatc gagcaaaaaa 300 cttaccgaga gttatcagcc tgtcttcaac gatacaacaa ggaacgactg ttccgcctcc 360 catggactct tcgagcttgg acatctaggg gtcagcatcg caccaggacg cacgcaacga 420 agcacgatgg gaactccttc tgaccccgta acagtcgagc caaatcacca cgccacctcg 480 aattcctccg tttgtgcatc ctgccatctt cgtcgtcctg gtcctgttta cggcggtttg 540 gaaggagtct cccagcctgc atttccaggc aagtatgcgt atcacaacgg tcattcactc 600 gatgacgttt caaacaattc tagctctacc gaaaacgacg atccaccagc gagtttgcac 660 cagaactctt attcaacgtc ccagttccat ctaccgggcc gcaatgtgga aagccttatg 720 gaagctcccg gcccttctga cccagtcgag caaatcgccg cctccgttct tcacagtcgt 780 cccggccctg tggttagttg tggcgcggga gtcttccaat ctgttacctc aggcgagtat 840 ccttttaata caagtcatga acaaccgata cttgaagaac caaattccag aatctctgaa 900 cgaaattccg actcagcgac taaaagcctt ggtcgttacg atgacgtgct ggtctactac 960 caaaatgccg gagggatgaa tggagatgtt gacaagtatc ttctagctac atccgacgag 1020 tgctatgatg ttatcgtctt ggtagaaacg tggctcgatt ctcgaaccct ttcgggacaa 1080 gttttcgggg cagaatatga agtttttcgt tgcgaccgtg gacctttgaa tagtcgcaaa 1140 tcaacgggtg gcggcgttct tgttgcaatc aagcggaaat ataaaccaag cgtcatcaac 1200 agcgatgaat gggttagcat cgaacaaata tgggtgagct tccagctctc aaatcggaag 1260 gtgttcttgt gtgcagttta cgtgcctccc gatcgaactc gcgacgtggc actgatcgat 1320 cttcattccc gatcgattat gtcagcgatc gatatggcag atgcgaccga cgaaatcgtc 1380 atctttggag atttcaactt gcccgggata tcctggcaat cgtctgacaa tggtttccta 1440 cacccgaatc cagaccgttc cacgttgcat atcggagcat caagacttct cgacagctac 1500 agctctgcta cgttgcgaca aattaattac acaacgaatg aaaacggtcg ctctctagat 1560 ttatgtttcg tcagtgcaca aaataccgct ccatcaatct cagttgctcc atcgcctttg 1620 gtcaaacaag tataccatca ccctgctttg gtcctcattc taaacgattc gaaagtacct 1680 gaattgaagt cggttgatcc gatttcctac gactttcgta acgcagacca tgatcgcatt 1740 gtcgaattcc tttctgcgat tgattgggac cttgtactcg atggtcgtga tgtcaataac 1800 gctgccctaa ctctctcgca tatactagga catgtcattg agcgccatgt gcctaaaata 1860 tgtcatccag ctaagagcaa accatggatg acccgagaac ttcgtatgct caagacagca 1920 aagaaatctg ctctaaggaa atattccaaa caccacacag tctcgctgca ggaacactac 1980 agaaaactaa attccgccta caaagtaagc agtcgaaata gtttcagacg ataccagcag 2040 aatcttcagc gcaacttaaa ggcaaacccg aaattcttct ggaagtatgt gaaagatcag 2100 cggaaagaag ccggttaccc ttcaaccatg acgttaaacg gcgaagtaac cagtgagcct 2160 cgacgtattt gtcagttttt tgccgacaag ttctcgagca cttttagtaa cgaatttgtt 2220 tccgaagaac aaattatcca agccgctagc aatgttccac ttctggcagg tgacagcttt 2280 ggtacactca acatcgacga cgccatgatt tctagggccg cttcacagct caaatcatcg 2340 accaataccg gcccagacgg aattccgtct gtttttgtta aaaagtatat cgaaaatctt 2400 ttagtccctc tccgtcacgt atttaatctg tctatgtcaa gcggagactt cccatccgta 2460 tggaaaaccg ccgtcatgtt tcctgtttat aagaagggtg agcggaggga tgtgaacaac 2520 taccgtggag tcacgtctct gggcgcaatt tccaagttgt tcgaactcgt cgttatggac 2580 ccattgctct cgcattgcaa acagtacctt agcgacgccc agcatgggtt catctccgga 2640 agatcgaccg cttcgaatct tctttgtctg acgtcacata tcaccgaaag tctagcggaa 2700 agagcccaaa ccgacgtcat atatactgac ttgtctgcgg cattcgatag agtaaaccac 2760 gcaataacga ttgccaaact ggaaaggctc ggaatctgtg gcagtgtatt aagatggctt 2820 cattcatatc tcaccggccg caagctaacc gtaactgttg aagacattat ctcggatgag 2880 tttttagcca cttccggcat accgcagggt agtcatctag gcccgttgat ttttctgctg 2940 tattttaacg acgtcaatta cgtactcaaa ggccctcgtt taatctacgc agacgacctg 3000 aagatctatc gtaggattcg ctccgagtcc gatgcaagaa tccttcagca ggagatcgac 3060 accttcacga actggtgtac cttgaattgc atgacagtga atccaagcaa gtgctccaca 3120 attacgtttg ctcggattcg acagccaatt gttttcaact ataaccttcg aggcaacgaa 3180 attgaacgtg tagaccatgt taaagatctt ggcgtgatcc ttgaccccca gctaactttc 3240 aagcgacatg tggcttatac agtggaaaag gcctccagaa ccttgggatt tatttttcgt 3300 atcgccaagg acttcaccaa tatgaattgc ctcaagtcgc tgtactgttc gctcgttcgt 3360 tcgacgctag agtactgctc tgtcgtttgg aatccgtatt accaaaatgg tacggaaagg 3420 atcgaatccg ttcagcggcg ttttatacgt tttgctcttc gtcgacttcc atggcgagac 3480 ccgttccggt taccgagtta tgaaagtagg tgccagttga ttcacatgga aactctccag 3540 gtgcgccgta acatcgccag agcaatgttt gtaaccgaca cgctacaagg aaggatcgac 3600 tgttcgacta ttttgaggac gattgatcta aacgttcgtc ccagagcgct acgtaacaac 3660 tcactgctga gattgcaatt tcgacgcaca aactatggcc agcatagcgc cctaaatgga 3720 gcgctacgag tcttcaatag agcagcttct tgtttcgatt ttaatttgtc ccgtgctaca 3780 atccgtcgta acctttctgt attcttttca agaagaatgg acgaataacg atttaatttt 3840 atttttgtgt tgcgcaccat gacttttgtt aaatttaaga tagtttaatc accattggga 3900 cacaaatcgt ctgttggtgt aagataataa taaataaata aataataa 3948 // ID Gypsy-20_OD-I repbase; DNA; INV; 8840 BP. XX AC CABV01004250; XX DT 24-FEB-2011 (Rel. 16.02, Created) DT 24-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Oikopleura dioica genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-20_OD_; KW Gypsy-20_OD-LTR; Gypsy-20_OD-I. XX OS Oikopleura dioica OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; OC Oikopleuridae; Oikopleura. XX RN [1] RP 1-8840 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Oikopleura dioica genome."; RL Direct Submission to RU (24-FEB-2011). XX DR Genome; CABV01004250; Positions 119 8958. XX CC Positions [3018-3524] - Reverse transcriptase CC Positions [4710-5198] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 37..1863 FT /product="Gypsy-20_OD-I_1p" FT /translation="MSSIESPFNIRRIKLADIPELYLTQVRARYKLKDKEG FT EIEKYNKKRHGEKNKSEVHLLACCIYIGEKNDLDPYFNLYVTFAIPYLKQI FT PLNSDSIQYVTFRDYYEQIPSIASEIPIFDPQTTESNEKNIAYALNFLYKQ FT AIIEKRAIDNYKFSLDPSAIMDPNQTIKNTIYEGKLNTSVTKIVLEDKNSS FT VVQMFTKQGSDSEEEEYEEGFDIKIEKPKTERKSIMNRVREVIGINNKEEH FT EDDSSDSDGEFDKFYDKKGDRIRTSSKIKSSPVNNFVTDEIKQPRVQRFGD FT KDDIDSLPSWAETQIFKLKLQNKNKNIPDEIFIGSKIIAEGLTEKMKKAQL FT QGKSLSLSRFVDIIKDISSKPDELVLRELDMLTYNPQRDTPASFYRKIAQM FT SLQMFPDKTAGSKLVDLNFRKKVSPNDDMFPIISMNLEGEELVKFAENYLS FT QRKLAQLNNINKSDNKEMSEEPKKAGKLNGGSAPAWRGRDGFSRNNHYSYR FT GHNSMMKGSANNYRDNYNSGFQQNSYPQGRNNYQPNQRGRSRGNFVQDRQF FT GRQNQNRNYIDNRQRQNNPRLCHACSSPEHFVRECPLMNRDNVTCYACNQE FT GHFSRDCPQRQ" FT CDS join(1869..5777,5781..6776) FT /product="Gypsy-20_OD-I_2p" FT /translation="MVKHLPCVNISFNKFNKLLISSLLDTGSSVSLIKRET FT QIKNNLKTKKCEKFALRGGDGKIISVVSSKVTVNLQFPNKKIIPTCDLFVT FT PELNYEVIIGYDIIKYLNLDFNNLPQVVTAASVHSFSLSSEPIEVTRIEAT FT RKPFANEIVAKKLTYLFPHSYFQVNIGTKNLNELDLQTEILPDPILTLSNE FT QVNFSDINFQAGIIYGYNKNNKIIILEENSIIGEIGKRHKELKVLNMLLRT FT ENLNPPELEIIENEFNNWKINREEMIKDISYDEVIMAKVAETEKDGILDKN FT EIIEVEKILKRFNKVFARHDSDTGFSRRYVVTFDLNKEMRKSEKRAPYRNS FT SCDQNEIIDFLDGLEESGVIGLSPSSWTSPSIFIRKKNKKLRLVTNFKTWL FT NECLNVPQWPIRSIRDLFYSIAKDISDLKKSSTSEILFTGIDLKSGYFTLA FT LEPSIRKFTAFAGPEKLYEYKKLSQGLSIAPSCFSQFITEIFSLPHNMSDQ FT FRVTTYLDDVLLIHTKESISPALNWLLSVIQKNHLVLDIKKTNIAKPKIDF FT LGYEICEKGLRPLKHKLEGLLKLESPKNQKNIQHFMGACNFFCRSLPRAQF FT LLKALAEESSNVEFKGSEKVDIAVEKLKSIFKQEGMTFHPIPGANEEYLLA FT VDSSIDGYGAVFGIGNYKEEKVENFKPVAFGSGAFEDKLKLESSRNRELIG FT ISRALSSFKDLLSEALPLTVVCDHKSISNLNNNEEAKKYPITRVRKALSEI FT FKYPNLKIKYASAKDEIVQICDGLSRAITISGQIEPTEIVEWMDQNLPQAQ FT VNNTEILKKYKLRHFSEIALTENTLKLEQEKDEFCLEILSKADNKKSWNIG FT PHEYKIKNSLILKYNFKLKEHQIVLPKGVAIEILTVIHMSYGHASAETILQ FT KINSEALFVPNFRTMLREIRAECYLCALKQKIHKNSQVQRRMKLALFPFQK FT CCIDIVVLDKRSRDAPYGLSFLDEFSEFLEVLPILDKTSHSVIKVLTILIF FT KYSLGRGSSFLSDNGPEFASEDFSKFMKTHGIRHSFISPYNASSNSVERSH FT KLLRKHIEAVQSLADESNFSDTERLQLALASLNSKPKANLDFKSPTQILTN FT QQPNMIIRAEEMDYVQQNYSELSDVEKREAREDWSTNLYKFQQETGLSNIL FT KIQESNATMLTGEIKEGDVVIIVNSKILGHGKIGTGPYLVLRNENGSCELE FT HILNKLRSKRNEKNLRKLFLTDNMKLAIKHGYIKLNENGEFAIRDNAGAPP FT LEPKVEEILKTEKEPSRTEKKVHFEREKNYNFRPRRFVSYKEAMKALFFLF FT LISSAIFHNNFTNTVVNFNNDDTLILQIEQKNYQVNIHNHFPVVDSIESSK FT CFRDPLTYKTINLLIANNIINRISNRTDYFLCEDYEHCSKLHEDHDKIHSE FT TLEKFNRLFEITPNLESPKEHYERPKSVENREPQMPDVMLEDGSKQEIKRW FT DENPLLSEPYIFRGKNVGSIIKGIVTSFNPTNTPRYNFNRNLIKSNINYIP FT YESKFLNSSNAVSLENWIKDTKDENLKLDLSNVTCSAQNFLWFLVELDYLV FT EEIGFHMSADAKTREWLDTEDGHSWLNSNLEYNIGFFLEDLKISEKLEYLM FT PVLESSKCTKVAITLAK" XX SQ Sequence 8840 BP; 3455 A; 1431 C; 1564 G; 2390 T; 0 other; atctggtgac tgaagaattt taaaagaacc cgcgccatga gttcgataga gagtccgttt 60 aatatacgga ggataaaatt agcagatatt ccggaactgt acttgacaca ggtaagagcg 120 agatataaac ttaaagacaa ggagggagag attgaaaaat ataataaaaa aagacatgga 180 gaaaagaaca aaagtgaagt tcatttgctc gcgtgttgta tttatatagg agagaaaaac 240 gaccttgacc cttattttaa tctttatgtc acttttgcga ttccttattt aaaacaaatt 300 ccccttaatt ctgattccat tcaatacgta acgtttagag attattatga gcagatccca 360 agcatagcta gtgaaattcc catttttgat ccacaaacga cagagtctaa cgaaaaaaac 420 atcgcttatg ctctaaattt cttatacaag caagcgatca ttgaaaaaag ggcaattgac 480 aactataaat tttcactcga tccaagtgca ataatggacc caaatcagac aattaaaaat 540 acgatttatg aaggaaaatt gaacacgagc gtaacaaaaa tagtattaga ggacaagaat 600 agttccgttg tacaaatgtt tacgaaacag ggaagtgatt ctgaagaaga agagtatgaa 660 gaaggttttg atataaagat agaaaagcca aagacagaaa gaaaatctat tatgaacagg 720 gtgagagagg ttataggaat aaataataag gaggagcatg aggatgattc cagcgattct 780 gatggtgaat ttgataaatt ttatgataaa aagggggaca gaattaggac ttcatcaaaa 840 ataaagagta gtccagtaaa caattttgta acggatgaga taaaacagcc gcgagtccaa 900 cgatttgggg ataaagatga tatagattca cttccatctt gggcggaaac ccaaattttc 960 aagctcaaac ttcagaacaa aaataaaaat atacccgatg aaatatttat tgggtccaaa 1020 atcatcgctg aaggattgac cgaaaaaatg aaaaaagccc aattacaagg caaatcactt 1080 tcacttagtc gatttgttga tataataaaa gatataagct caaagcctga cgaacttgtt 1140 cttcgcgaat tagatatgct cacttataat ccacagcgag ataccccagc gtcattttac 1200 aggaaaatag cgcaaatgtc tctccaaatg tttccggata aaaccgcggg atcaaaactt 1260 gtcgatttaa attttcgaaa aaaagtgtcg ccaaatgatg atatgttccc catcataagc 1320 atgaacctgg aaggtgaaga gcttgtgaaa ttcgctgaaa actacctttc gcagcgtaaa 1380 cttgcacagc tgaacaatat taataaaagt gacaataaag aaatgtcaga ggaacctaaa 1440 aaagccggta aacttaatgg tgggagcgcg ccagcatggc gaggtcgtga cggttttagt 1500 agaaataacc attatagtta tagaggtcac aattcaatga tgaaaggtag tgctaataac 1560 tatcgagata attataattc tggttttcag cagaattcat atccccaagg tcgtaataat 1620 tatcaaccca accaacgtgg acgatcaaga ggtaactttg tgcaagatag acaatttgga 1680 agacaaaacc aaaaccgaaa ttatatagat aataggcaga gacaaaataa tccacgtttg 1740 tgtcatgcat gcagttctcc tgaacatttt gttcgtgaat gtccattgat gaatagagat 1800 aatgtaacct gctacgcatg taaccaagaa ggacattttt cgcgtgattg tccacaaaga 1860 cagtaagtat ggtaaagcat ttgccatgtg taaatatatc atttaacaaa tttaacaaat 1920 tacttatatc atcactcttg gacaccggaa gttccgtgtc cttgataaaa agagaaactc 1980 agataaaaaa taaccttaaa actaaaaaat gtgaaaaatt tgcgttacgg ggtggtgacg 2040 gaaaaataat atccgttgtc tcgtcaaaag ttactgtaaa cttacaattt ccgaataaga 2100 aaataattcc aacatgtgat ctttttgtta cacctgaact caattatgaa gtaataattg 2160 ggtatgatat aattaaatat ctaaacttag acttcaataa tcttccccaa gttgtaacag 2220 ctgcatcggt tcattctttc tctttatcaa gtgaacctat agaagtcact cgaattgaag 2280 ccacaaggaa accgtttgca aacgaaatag tcgccaaaaa attaacctat ctgtttccgc 2340 attcttattt tcaggtcaat atcggaacca agaatttaaa tgagttagat ttgcaaaccg 2400 aaattcttcc agatccgatt ttaactcttt ctaacgaaca ggttaatttt agcgatataa 2460 attttcaagc tggaattatt tatggttaca acaaaaataa taaaataatc attttggaag 2520 aaaattctat tattggtgaa atcgggaaaa gacataaaga gttaaaggta ctaaatatgc 2580 tcctgcgaac cgaaaatttg aacccacccg aattggaaat aatcgaaaat gaatttaata 2640 attggaaaat taatagagaa gaaatgataa aagatatcag ttatgatgaa gtaattatgg 2700 ctaaggtggc cgaaacggaa aaagatggta ttttggataa aaatgaaata atagaagtgg 2760 aaaaaatttt gaagcgattt aataaagtat ttgccaggca tgattcagat acaggtttca 2820 gtcgtagata cgtcgtcacc tttgatctta acaaggaaat gagaaaaagc gagaaacgag 2880 ccccttacag gaactcctca tgtgaccaaa atgaaataat agattttctg gatggtttag 2940 aagaaagcgg agttattggc ttgagtcctt ctagctggac ttcaccatcg attttcatta 3000 ggaaaaagaa caaaaaattg agattggtca caaattttaa aacatggcta aatgagtgct 3060 taaatgtacc ccaatggcca attcgtagca taagagatct attttattca attgcaaaag 3120 atatttcgga tttgaaaaag tccagtacat ctgaaatctt attcactggc attgatctaa 3180 aatcaggata ttttacactc gcacttgaac cgagtatccg aaaatttacg gcctttgctg 3240 gtcccgaaaa actttatgaa tacaaaaagc tatctcaagg cttatctatt gcaccaagct 3300 gttttagtca gtttattact gaaattttta gcttaccaca taatatgtca gaccaatttc 3360 gagtaaccac ttacttggat gatgttcttt taattcatac gaaagagtca atttcgccag 3420 ctcttaactg gcttttgagt gtgatacaga aaaatcatct tgtattagac ataaaaaaga 3480 caaatatagc aaaaccaaaa attgattttc taggttatga aatatgcgaa aagggattaa 3540 ggccactaaa acataagctt gagggactct taaagctgga atctcctaaa aatcagaaaa 3600 acatccaaca ttttatggga gcatgtaact tcttctgtag atctttgccg agagcgcaat 3660 ttttactgaa agcattggca gaagaaagct caaatgtgga attcaaggga agtgaaaaag 3720 tagatattgc ggttgaaaaa ttaaagtcaa tttttaaaca agaagggatg acctttcatc 3780 ctattcccgg tgcaaatgag gaatatttgc tcgcggttga ttcaagcatt gacgggtacg 3840 gagccgtttt tggaattggt aattataaag aagaaaaagt ggaaaatttc aaaccggttg 3900 ctttcggatc aggggccttt gaagacaaat taaaacttga aagttctcga aatcgtgaac 3960 ttattggaat ttcgagagcg ctctcatcat ttaaagactt gctatcggaa gcactaccat 4020 taactgtagt ttgtgaccat aaaagtatta gtaatcttaa taacaatgaa gaagcaaaaa 4080 agtacccaat aacgagagtc agaaaggctc tgagcgaaat tttcaaatac ccgaatctca 4140 aaataaagta cgcatcggca aaagatgaaa tcgttcaaat atgtgacgga ttatcccgcg 4200 cgattaccat ttctggccaa attgagccaa ctgagattgt cgagtggatg gaccaaaatt 4260 tacctcaagc ccaagttaac aacaccgaaa ttcttaaaaa atacaaatta agacattttt 4320 cggaaatagc attaactgaa aatacactta aattggagca agaaaaagac gaattctgtt 4380 tggaaatttt atcaaaagcg gataataaaa aatcgtggaa tattggccca catgaatata 4440 aaatcaaaaa ctcgctgatt ttaaagtata attttaaatt aaaagaacat caaatcgttc 4500 ttccgaaagg agtggcaata gaaattctaa ctgtgattca catgagttac ggccatgctt 4560 cggcagagac aattttgcaa aaaattaatt cagaggcact ttttgtacca aatttcagaa 4620 ctatgcttcg agaaataagg gctgaatgtt atctgtgcgc acttaaacag aaaattcata 4680 agaattcgca agttcaaagg cgcatgaagt tagcactctt tccatttcaa aaatgttgta 4740 tagacattgt tgtccttgat aaaaggtctc gagacgcccc ttatggatta agttttttag 4800 acgagttttc agaatttttg gaagttttac caattttaga taaaacctct cattcggtca 4860 ttaaagtttt aactattctc atttttaaat attcgttggg ccgagggtca tcatttttga 4920 gcgataatgg acctgagttc gctagcgaag atttttctaa attcatgaaa actcatggta 4980 tcagacattc gtttataagc ccatataatg cgtcttccaa ttcagtcgaa agatcacaca 5040 agttactaag aaaacatata gaggctgtgc aaagtttggc tgacgagtca aatttttcag 5100 atacagaacg attacagtta gcgctagcaa gtcttaattc caaaccaaaa gcaaatttag 5160 attttaaatc acccacccag atccttacaa atcagcaacc aaatatgata atccgagccg 5220 aagaaatgga ttatgtacaa caaaattaca gtgaactatc agatgtagag aaaagagagg 5280 ctcgggaaga ttggagtacg aatctttata aatttcaaca agaaacagga ttgtcaaata 5340 tacttaaaat tcaagagtca aacgcaacga tgctaacagg agaaataaaa gaaggagatg 5400 tagttattat agtaaatagt aagattcttg gtcatggaaa aatcggaact ggaccatatc 5460 tagttttaag aaatgaaaat ggttcatgcg aacttgaaca tattttgaac aaactgagaa 5520 gtaaacgaaa tgagaaaaat ctaagaaagc tgtttctgac agacaatatg aaattagcaa 5580 ttaagcatgg atacataaaa ctcaacgaaa atggtgagtt tgcaattaga gacaacgcag 5640 gagcgccccc attagaacct aaggtcgagg aaatactgaa aacagaaaaa gaacccagca 5700 gaacggaaaa gaaagttcat ttcgaaagag agaaaaatta caatttcaga cctagaagat 5760 ttgtaagtta caaggaataa gccatgaagg ccttattctt cctttttctg ataagtagcg 5820 ctatttttca caataatttt actaacacag tagttaattt taataatgac gacacgctta 5880 tactccaaat tgagcagaaa aactatcaag ttaacatcca taatcatttt ccggtagtag 5940 attcaatcga aagttcaaaa tgcttcaggg atccgctaac gtacaaaacc attaatttac 6000 taatcgcaaa caatattata aatagaattt cgaatagaac agactatttt ctttgtgagg 6060 actatgaaca ctgctcaaaa ctgcatgaag atcatgataa aattcactcg gaaacactag 6120 agaaatttaa cagacttttc gaaataaccc caaatctgga aagtccaaaa gaacactacg 6180 agagaccaaa atctgtagaa aatcgagaac cacaaatgcc agacgttatg ctagaagacg 6240 ggtcaaagca agaaattaaa cgctgggatg aaaatccact tcttagcgaa ccatatattt 6300 tcagaggaaa aaatgtaggt tcaatcataa aggggatagt aacttctttt aatcctacaa 6360 atacaccgcg ttacaatttt aacaggaatt tgataaaatc aaatattaat tacattccct 6420 acgaaagtaa gtttttaaat agttcaaatg ctgtgtcact cgaaaattgg ataaaagata 6480 caaaagatga aaatctaaag ttagatttat cgaatgtgac ctgctcagca caaaattttc 6540 tttggtttct tgtagaatta gattatttgg tagaagaaat cggttttcat atgagcgccg 6600 atgcaaaaac aagagagtgg ctagatacgg aagacggtca ttcttggctt aactctaacc 6660 ttgaatataa catcggattt tttttagagg atttgaaaat ttctgaaaaa cttgagtatt 6720 tgatgccagt tctggaaagc tcaaaatgta caaaagttgc aattactctt gcgaaataat 6780 ctatattaaa atttacaact atttcacatt cttgagatca aagaagtaac ttaaatgtta 6840 ataagataag aaaacgtaat tataacgaag ttaatagctg ggaaccaaag atcaaaggag 6900 atcaaagata attttgaagt tcaaaccgct ctcgtaatac gaaaaaggta atcataaaaa 6960 gtaagtcgtt tcggaaaaaa taggagttcc acagcctatt cgtgcgagat aaatccgaga 7020 aaattcgaat atgatagaca acaactacaa agagcgacca acaagaaaat ggcgtccgat 7080 tcatcgatct ggtataaaag taagtcgaaa gcgtatttac gatagataca caaatgactt 7140 tttcgtaaaa ttcgaatttc gtaccgatcc cgttaacggg cgagggagca cgcaaaatat 7200 aagactaaca gatctaaaag atgtagatat tatcattaac aatactaaac aatcgatttt 7260 aacacagaaa ttactaatga aaagaataga ttacgggagc gataacatta actcgatttt 7320 actaacaaaa tttttaaata gaatagaaaa agatctagga aacaaaattc gagaaatcaa 7380 ggatgagtgc cataaaaaga aacaagacct gaaagtctta ctaacgcaca atgggtccat 7440 ctttaatttt tcgcaaatat tcgctcagct aacggaactt aatcaaaatt tactaaaatt 7500 aaacaaattc atacgcctgg aaggcataga ttatgacagt tcatatcaag cttccacgcc 7560 tgcggaatat gcattacagt ttaaaaatac tgctgattat actaacaaaa cacctgaaaa 7620 tgaggtgaaa aacacgtcaa cttcaacgtg tgttgtctct acaaaaaata gagcaacgca 7680 agttgaaaaa tcatttcaaa acacgtcaac ttcaacgtgt gttgtctcta caaaaaatag 7740 agcaacgcaa attgaaaaat caattttcat aagaattatt gattatatcc tcaattattt 7800 ctcacaataa aacgaaaata acgaaaaaaa atatgattaa aatttttgtc aaaactatat 7860 tataatgcaa atcaatgaat tacttttata aaatgaatca aaaacgaata tcgaaaatat 7920 ccaaatactt gaacatttat aaccatttga tatattacta ttttattttt tatctaatca 7980 tgctgaaaaa aaaacttttc actgccagta ccgaattact tattaagcta gaataacctt 8040 ttaacccaaa tcctttccat tctcctgtga tacaatgatc tgtaacttaa aattttacat 8100 ttactgcaat tatttataaa aaaaaaaaaa gaagggagga atccattatc caccccaaaa 8160 agaaagaaaa aatatcaaaa attataaaaa aaaatatatg aaatagaaaa acaagataga 8220 aaaaaaaata aaatataaaa aatcgtttct gtacatttaa attattttac tgtaaaaatc 8280 tcttttatat ttttgtgcac ataattatgt ctaacaacag cgccaccgcc aaaacgataa 8340 aataagaaga aaagaaaagg aataagaaga aaatattcag aagagaacga tgactgacga 8400 aagaaagtac atagtacgaa agttcgggcc actcatgccc cacttcaaga aactaacgat 8460 gccagacgga acgaaaaaga ccttcatcga ctgttggaga atatggatca aatgggacca 8520 gagttttgaa gacggtacac gatggtcagc ggagcgcacg gatgaaattg ataacgaaga 8580 agctattcga cgagcctttg aaaaaaagaa gacttggcct tggccaaaat tgtttgaaga 8640 aagaaaagca ctaaaagaaa gaggattcaa caaatatcat ggaaagaaaa agtatgtttt 8700 atttaagaac taattaaaaa acgaaacgac aaaacgaaga aaaaataaca ctaactgcaa 8760 tcaaacaaca gaggggcgag atagagacgg cgcctgaaaa tagcgccacc gcgcgtacgc 8820 gatcttactt atttacaacc 8840 // ID LOA-1_CQ repbase; DNA; INV; 3176 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A LOA non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW Loa; Non-LTR Retrotransposon; Transposable Element; LOA-1_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-3176 RA Kojima K.K. and Jurka J.; RT "LOA non-LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 148-148 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 3 sequences with >99% CC identity. XX FH Key Location/Qualifiers FT CDS 2..3052 FT /product="LOA-1_CQ_1p" FT /note="reverse transcriptase and ribonuclease H." FT /translation="SEVISEKILNWKVSDEESLSDHRQIEFEYNAGDIIKT FT NYRNPKKTNWILYKKCIDVYEFNSGGEIRTVDQLENASSHLINGILNSYAA FT SCPFQHNSSNREVSWWNDNLAALRKRSRKLFNKAKASGLWDSYRKSLTEYN FT IEVRKSKRKDWNRSCEKIESTPVVARLQKALSKDHTNGLGNLKKADGSFTQ FT DSSETLEVMMKAHFPDSIVVSSESDQASEGEGPTKDWSENAHLKSCEIFTQ FT SKVEWAISTFKPFKSAGRDGIFPALLQHAKDKISIRLTELFRASISLGYIP FT KAWREVRVVFIPKAGKKDKTNAKSFRPISLSSLLLKTMEKILDDFIKGTYL FT KNYPLSKSQFAYQSGKSTITALHSLVTKIEKSLLNKELALCAFLDVEGAFD FT NASHCSMRSAMESRSFDLCVINWIETMLRNREISADFGGERLVIRPQKGCP FT QGGVLSPLLWSLVVDKLLKKLAELGFEVVGFADDICIIVRGKLNNFITNRM FT QSALNFTSKWCRSEGLGINPAKTVIVPFTKRTNTSLQKLSLCGVTIEFQSE FT VKYLGVTLDKKLTWNPHLENVINKGINAFWVCKNTFGKKWGLRPKMIHWIY FT TCIVRPRIIYAAVVWWQKTKQTAVQKKLSKLQRLACISITGAMNSTPTEAL FT NALLNLLPLNQFIELQAVKSALLMNRSVKIQEKDMTGHLELLKTFNLNPTL FT NSIVDWMPTRANYDVPFNVVATSRYIWEQGGPTLRSGSIVFYTDGSKMHNM FT TGAGVFGPGIKKFISMGLYPTVFQAEIFAIIECIHICLNRNYRYSNICIFS FT DSLAALNALKGYTCQSKLVWEAIMLLKQLSLKNTVNLYWVPGHCGIDGNEA FT ADCLAREGSGTNLIGPEPFFGIPDCSLKMELKRWETSMVMSNWNAAIACRQ FT SKRFIYPCEKSSRVLLKLDKRNLRIIIGLLTGHCPCRYHLVKMNKLQDSRC FT RFCNIENETSEHLLCECSALIQRRILFLGKGFLLPKDIWEASPSRVIDFIK FT RVDPNWDECSF" XX SQ Sequence 3176 BP; 1080 A; 528 C; 632 G; 936 T; 0 other; cagtgaagtt atttctgaaa aaatcttaaa ctggaaagta tctgatgaag aatcgctttc 60 agatcatcga caaattgagt ttgaatataa tgcaggagat attattaaaa caaattatag 120 aaatcctaag aaaacaaact ggattttata caaaaaatgt attgatgttt atgaatttaa 180 ttctggcggt gaaatacgaa ctgttgatca attagaaaac gcctcttctc atctgataaa 240 cggtatttta aactcttatg ccgctagctg tccattccaa cacaattctt caaacagaga 300 ggtgtcgtgg tggaacgaca atctagctgc gttaagaaaa agatcaagaa agttatttaa 360 caaagcaaaa gcttctggat tatgggattc atacaggaaa tctttaactg aatataatat 420 tgaagtcaga aaatctaaac gtaaagactg gaaccgctct tgtgagaaaa ttgagagcac 480 accagttgtt gcaagattac agaaagctct ctctaaggac catacgaatg gtttagggaa 540 tcttaaaaaa gcggatggca gttttactca agacagttct gaaacattag aagttatgat 600 gaaagctcat ttcccagatt caattgtggt atcgagtgaa agcgatcagg cttcggaggg 660 tgaaggaccg acaaaagatt ggtctgaaaa cgcccacttg aaatcctgtg aaatattcac 720 tcaatccaaa gttgaatggg caataagcac atttaaaccg tttaaatctg ctggtagaga 780 tggaattttt ccagccctac tgcaacacgc taaagataaa attagtatac gattaactga 840 actttttaga gccagcatct ctctaggcta cattcccaaa gcttggagag aagtacgggt 900 cgtatttata cctaaggctg gaaaaaaaga taagacaaat gctaaatctt tcagacccat 960 aagtctttct tcattactat taaaaaccat ggaaaaaata ttagatgact ttattaaagg 1020 aacatactta aaaaattatc ctcttagcaa atcgcaattt gcttatcaaa gtggtaaatc 1080 aactattacg gcattacaca gcttagtaac gaaaatagaa aaatcattat taaacaaaga 1140 attagcttta tgcgcatttc ttgatgttga aggagcattc gacaatgctt cccattgttc 1200 aatgcgttca gcaatggagt caaggagttt tgatttatgt gtcataaatt ggattgagac 1260 tatgcttaga aacagggaga tttcagctga ttttggtggt gagaggctag ttataagacc 1320 acagaagggg tgtccacaag gtggagtatt atcacctcta ttgtggtcgc tagtagttga 1380 taaactactg aaaaagttgg ctgaacttgg atttgaagta gttggatttg cagatgatat 1440 ttgtataata gttcgcggaa agttaaataa ttttataaca aatcgtatgc aatcagcgtt 1500 aaattttact tcaaaatggt gcaggagtga aggtcttggg attaaccctg ccaaaaccgt 1560 gattgtacct tttacaaaac gaacaaatac ttctttacaa aaactttctc tctgtggggt 1620 aacgattgag tttcaatctg aagttaagta ccttggtgtc actttagata agaaacttac 1680 atggaaccct cacttagaga atgttataaa taaaggaatt aacgccttct gggtgtgtaa 1740 aaatactttt ggaaaaaaat ggggacttcg tcccaaaatg atacattgga tttacacatg 1800 tatagttcga ccaaggatta tttacgccgc cgtagtctgg tggcaaaaga ctaaacaaac 1860 agctgttcag aaaaaattaa gcaaacttca aagattggca tgtatttcaa tcaccggagc 1920 tatgaatagc actcctacgg aggcgctgaa tgctctgttg aatttacttc ctcttaatca 1980 gttcattgaa ttacaagctg ttaaaagtgc attattgatg aaccggtccg taaagattca 2040 ggaaaaagat atgacgggtc atttagagct cctcaaaacc tttaatctga atcctacgtt 2100 gaactcaata gtagactgga tgccgactag ggccaactac gatgttccct ttaatgtggt 2160 cgcaacaagt cgctacatat gggaacaagg aggcccaact cttcgttcag gctctattgt 2220 cttttacacg gatggctcaa aaatgcataa tatgacagga gctggagttt ttggccctgg 2280 gattaagaaa tttatatcga tgggtctata tccaacagtc ttccaagctg aaattttcgc 2340 aattattgaa tgtattcata tttgtttaaa tcgtaactat agatattcta atatttgtat 2400 cttctcagac agtctggctg ctctaaatgc actcaaaggg tatacctgtc agtcaaaact 2460 agtttgggaa gcaattatgc ttttgaaaca attatcttta aaaaacactg ttaatctata 2520 ttgggttcct ggacattgcg gaatagatgg taatgaagcg gcagactgtc tagccaggga 2580 aggatctgga acaaacctga ttggaccgga acccttcttt ggaataccag actgttcctt 2640 gaaaatggaa ttaaaacgtt gggaaacatc tatggtaatg tctaattgga atgcggctat 2700 agcttgtaga caatcaaaaa ggtttattta tccttgtgaa aaatcatctc gagttttact 2760 aaaactggat aaaagaaatc tcaggatcat aattggttta ttaactggtc attgcccttg 2820 taggtatcat ttagttaaaa tgaacaaatt acaggattca agatgtcgtt tttgcaacat 2880 tgaaaatgaa acatcagagc acctgctgtg tgaatgcagt gctcttattc aaagaagaat 2940 tttattttta ggaaaaggat ttttgctacc caaagatatt tgggaagcga gtcctagtag 3000 ggtaatcgac tttatcaaac gagtcgatcc taactgggac gaatgcagct tttaagtaac 3060 gctcttcacg ctaaatggtg ctgggtgaac ttatctgcta taaaaaaagg gtcacaccac 3120 aatattccta taattggtcg cagtggtatt atggctcgac aaaaaaaaaa aaaaaa 3176 // ID Chapaev-5_HM repbase; DNA; INV; 3078 BP. XX AC . XX DT 27-FEB-2008 (Rel. 13.02, Created) DT 27-FEB-2008 (Rel. 13.02, Last updated, Version 1) XX DE Autonomous Chapaev DNA transposon -a consensus sequence. XX KW Chapaev; DNA transposon; Transposable Element; Chapaev-5_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3078 RA Kapitonov V.V. and Jurka J.; RT "Autonomous Chapaev transposons from the hydra genome."; RL Repbase Reports 8(2), 31-31 (2008). XX DR [1] (Consensus) XX CC Chapaev-5_HM is a very young family of autonomous Chapaev DNA CC transposons that can be still active in the hydra genome (they CC are <0.4% divergent from their consensus sequence). The consensus CC sequence was obtained based on a multiple alignment of 25 copies; CC it codes for a 687-aa Chapaev transposase. Chapaev-5_HM is CC characterized by 4-bp target site duplications and 141-bp CC terminal inverted repeats. Based on the TPase identities, CC Chapaev-5_HM forms a distinctive group together with Chapaev-1_HM CC and Chapaev-2_HM. The N-terminal portion of this group TPase CC contains a Chapa-like zinc finger CC (H-X7-C-X2-C-X35-C-X2-C-x36/38-C-X2-C) but is free of the RING CC finger motif. CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 547..2607 FT /product="Chapaev-5_HMp" FT /note="Transposase." FT /translation="MPTFSKDHNQCRQVVCTLCMKKSDREISEYFISEIKR FT LISGNINFDDERVPRGICVTCRFLLRKLASGDEEVSIPQLYDFESILIKPS FT TRQTTKCDCIICQISKTKGKGKHPFEKPSQQEVQKEEKSFEKRCTKCLSVI FT ARGLPHNCKEATRRENLKALALADPLGAEQIASFIVSSKEVSSDGTILISR FT FHGKPLEIRPGMLKYCCWVCFLILVFFLFVSLFTGSNATQGLSSEPLTTQD FT MINIQQNIGLSNNGMRKLGSALNQISPVRIVEANFQQKFAAAGTTLKSFFT FT VTTSQLPESNETRHIVHCTNLETLKENIVSSRGLQNSNFLKLGIDGGGSFF FT KASLSLISEHEGEPHSPVQKKSKLITKDNFKNTSVQKQLIVAISENTPETY FT PNVKHILDLLQINEISISEKLVISCDMKLANIVCGIQSHSSKHPCCWCNID FT SAHLENCGQLRTFGGIRDLYKKFVKSGCDAKRSKEFENVVHLPMFAFPDRE FT LILEAIPPMELHLLLGVVNHLIKYLVQVFPKTKQWLDSIHIQMQPFHGGHF FT NGKDCMKVLRKIEELMQLTIAEKAPDATKACQALSSFHQVVVSCFGYTLLP FT DYEAKICDFKDNYLKLPVSVTPKAHAVFHHVQQFIHQHKVGLGVFSEQATE FT ALHSNFKTHWQRFKRNSTHPEYSSQLLSCVVDYNSKKV" XX SQ Sequence 3078 BP; 1021 A; 533 C; 552 G; 972 T; 0 other; caccgtttga aaattttgac ttttttggtt acgaaacatc ccgaaaaaat cctcttaggg 60 tatgaaacta cacatgtcaa gaatactata tcaacttact ttgaaatcca aaattaaaat 120 ccgagagaga tattttgatt taaatttttg aaataatata aaaacattag cagtttttgt 180 ttttgaaaat agcggtattg gggtgggtcg tagtgatttt tttaataaaa taactttttt 240 agacaattaa tttgtaaaga tttcgtattt aataaatttt tttgagaaat gagctataag 300 gggagatatt tccatttcaa gttttgaatc gattaatttt tgaatcgcga attacgatga 360 aaaatttgat acgtgaattg ccgtaaattt gtactttcct aaaatttgta ctcttcggac 420 atttccaatt ttacggactt tttgtacttt tagtacattt tcatacgttt tatgacctct 480 ttgtacttta tgaattagtt taaatgatat cttcaaacag atcacaagtt catctcagat 540 tttgaaatgc ctactttttc aaaggatcat aatcaatgtc ggcaggtcgt ttgcacactt 600 tgtatgaaaa aaagtgacag agaaatttca gaatatttca tcagtgaaat caaaagattg 660 atttctggca acatcaactt tgatgatgag agggttccac gagggatctg tgtcacctgt 720 aggttcctgc ttagaaagtt agcttctgga gatgaagaag tgagtatacc tcaactttat 780 gacttcgaat ccatattgat caaaccatca actcgccaga caactaaatg tgattgcatc 840 atttgtcaaa tttcaaaaac caaagggaaa ggaaagcacc cctttgaaaa accatctcag 900 caagaagtgc aaaaagaaga aaaatctttt gaaaaacgat gcacaaaatg cctatctgtc 960 attgctcgag gactgcctca taactgcaaa gaagcaacac gtcgtgaaaa cttgaaagct 1020 ctggcattgg cagatccgtt aggggcagag caaattgcat cattcatcgt ttcttccaaa 1080 gaggtatctt ccgatggaac tattctgatt agccggtttc atggaaagcc tcttgaaatc 1140 agaccaggta tgttaaagta ttgttgctgg gtttgctttt taatattggt ttttttctta 1200 tttgtttctt tattcacagg atcaaatgca actcagggtc tctcctcaga gccattgaca 1260 acacaagata tgattaatat tcagcaaaac attggacttt ctaacaatgg aatgagaaag 1320 cttgggtcag ctttaaatca aataagtcct gttcgcatag ttgaggccaa ttttcaacaa 1380 aaatttgctg cagctggaac taccctcaaa agtttcttta cagtaaccac ttcacagtta 1440 ccagaatcaa atgaaacacg ccatattgtt cactgcacaa atttggaaac cctcaaagaa 1500 aatattgttt catccagagg cttgcaaaac tcaaacttct taaagcttgg cattgatggt 1560 ggaggatcat tctttaaggc aagtttgagt cttatcagtg agcatgaagg agaacctcac 1620 agtcctgttc agaagaaatc aaaattgatc accaaagaca acttcaaaaa cacaagtgtt 1680 caaaaacaac tgattgttgc catttcagag aacacaccag agacttaccc aaatgttaag 1740 cacattctgg atcttctcca aatcaatgaa atctccattt cagaaaaact tgttatttct 1800 tgtgacatga agcttgcaaa catagtttgt ggaatccagt cacacagcag taagcatcct 1860 tgctgttggt gcaacataga ctcagcacat ttggaaaact gtggtcagct tagaacattc 1920 gggggcatca gagacttgta caaaaaattt gtgaagtctg gttgtgatgc caaaagatca 1980 aaagagtttg aaaatgtggt tcatttgcca atgtttgcct ttccagacag ggaattgatc 2040 cttgaggcaa tcccacccat ggaactgcat ttgctcctgg gagttgtcaa ccacctcatc 2100 aagtacttgg ttcaagtttt tccaaagacc aaacaatggt tggattcaat ccacattcag 2160 atgcaaccat ttcatggtgg ccacttcaat ggaaaagatt gcatgaaagt gctgaggaaa 2220 attgaagagc ttatgcaatt gacaattgct gaaaaagcac cagatgcaac aaaggcatgc 2280 caagcattga gctcatttca ccaggttgtg gtttcatgct ttggatacac tctgttgcca 2340 gattatgagg caaaaatttg tgatttcaaa gataactact tgaaattacc agtctcagtc 2400 acaccaaagg cacatgctgt ttttcatcac gttcaacaat tcattcatca gcacaaggtt 2460 ggtctaggtg tattcagtga gcaagcaaca gaagctcttc attcaaattt caagacccac 2520 tggcaaagat tcaagcgaaa ctctactcat ccggaatatt ccagtcaact cctcagttgt 2580 gtagttgatt acaacagcaa aaaagtttag ttggaaagaa agaaaataaa agactcctca 2640 ttttcaattt ggagaaactc agtttgtgtg tgtgtaaaga atgtatttat ttgtcaaaag 2700 aaaaattttt gaagtgagaa attttgatta tatcaaaaaa ttgaaagatg gttttctaaa 2760 tattttcaaa taaaatttta aaattgtgaa tatgaattat tgtcaaaaat ggcatatttt 2820 ctatcttaat tttacttttt aaattagtta ggctgtatta ggactgatat gacccctttt 2880 gttaagggca ggttgtaacc aacttctatc catttggtga aaatattgaa tattttcaaa 2940 tcaaaatatc tctctcggat tttaattttg gatttcaaag taagttgata tagtattctt 3000 gacatgtgta gtttcatacc ctaagaggat tttttcggga tgtttcgtaa ccaaaaaagt 3060 caaaattttc aaacggtg 3078 // ID BEL-14_DPu-LTR repbase; DNA; INV; 443 BP. XX AC . XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from Daphnia: long terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-14_DP_; KW BEL-14_DPu-I; BEL-14_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-443 RA Jurka J.; RT "LTR retrotransposons from Daphnia."; RL Direct Submission to RU (16-DEC-2010). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR [1] (Consensus) XX SQ Sequence 443 BP; 70 A; 112 C; 121 G; 140 T; 0 other; tgttggcgtt cctgagcccg ccaagagcgt gcgtttcaat ttggacctgg caacgcagcg 60 cgaggggtaa ggaaatcgga gacgaagggc ggagcttatc accacgctac cagttggaat 120 ccattcgtcg cgcgactcac atctccgctc cgtattttcc cttttgttcg gcttcaccag 180 cgcgtgcagt ttatcctttc tgaatcgtca agtagcgtgg cgtggcgtcc tccttcacgt 240 tcgtacttga tctagtttaa ttgtgtgtct acaattgttt gtgtgtgtgt gtgcatctcg 300 aatcgtggga gcccacgacg gtatctcctt tctggatcgc caccttgatt tgtgtgttga 360 gacgcgtggt ttggcagcca ttgtgttcgt ctcgtcctga tttcccgtct gtgccctgtg 420 ttatttgtgt ctattggcag gca 443 // ID BEL-36_CQ-I repbase; DNA; INV; 3198 BP. XX AC AAWU01012157; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-36_CQ_; KW BEL-36_CQ-LTR; BEL-36_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-3198 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 225-225 (2011). XX DR GenBank; AAWU01012157; Positions 5289 8486. XX CC 'GGGGG' target site duplication CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS join(1171..2121,2125..3198) FT /product="BEL-36_CQ-I_1p" FT /translation="MASAETPQPLEYWIADLLTTQLRLVELWQCVKDGNTT FT TSHLAVSYKEVDELWRHFFDASVEYQSHDDFVEEEQLTFDRVRQDFSELYF FT GVKAFILDSMNHAGDRNKQALDQFASRVHTQAVDRVTSIGSGGSKLCLQNR FT DEKQHTHTCKGVQKVQGSICRSHQLSQTQTNGGSGSTYTQEVQGSFTHTQM FT VTQNASSIVLTIPPTLRHRSENAQSIGDSLRTPTHIDLPERSIHAGEESHS FT ATRQGSFAEIFSIGRKQVGAPLRTDFLLASLKGGNIVQGTKGFPQSALVQD FT QTDSPSNIFKLTGFIMVLKTLIRPKAFMAYLKPKMLLGRATHSTPTKNSAV FT KNTTPRRNSCNGQLGYPNPRRGYKVNNYSRRNANCTGQFVFAHTTSRESRC FT IGQLLTEGSDKTITTTHTKAHMSQQVANHLEEWYSSVRFTGYFGLQLPGRL FT SDINTTTSCTQGCTGQQAVSITAGSCDTGRSTLVPSRLGCKTNTSAGPING FT AGQQFTNTTSKTRSAGQFEWYVSRDNSRFLFHLADYKRNLFLGRELPGREQ FT ICATIMWILLKQMPSPRTGRGGRPATGSTPRNTTNDHRQQLGTVTRKKRSR FT SPKLVWVCKQKVFDPRMYVLCEVDSKVELCCDVNINLNLIFEDTNTSTVGL FT KIKTIHRTLILGLVFIQKWVISAGAE" XX SQ Sequence 3198 BP; 903 A; 785 C; 776 G; 734 T; 0 other; taatggtcct tcgaaccgga tgaagtcacc ttcatccggc cgaagtgagg agaaccagtg 60 caacaacagc cacggacaac caggagaact aggacggttc taggaaggcg cgcttttacg 120 cttggatcgc cgccatcgcg atttaccatc atctgctgag aggagaagta ctgctacctg 180 gaagctggac aagcatatgg aaattcacca ttacggcgag gaaaccgcca tcgctgcaac 240 tacttctttt tactgaatcg gaaattcagc ctggacgatc atcacttcgc acatctaccg 300 gacaaaccac agcgaaccac ggctgcactg atccagcgcg gaattccttt tcataggtaa 360 atattactga tcagtgtatg tctggccgaa gccaagcagt aacctcttct aacaccaaca 420 tccgcaattt gtctttctac cctactacgg agcaaaacaa ctcgttggca gagttgcaag 480 gatcaacttg gatcggtttc acggtttggt gtcgtgtggt attgtaccat acggcaagga 540 gatcgaatgt tcgtcgaaca cgtcgctggt gttatcattt ggaccaactg gtacaccagc 600 tggaaatcta tcccgtttgg ccgcaacaac acttcgagag ttctcgcacg ctggcacgat 660 caagcacgtg gtgccggtgt aaacaaacca tcggcaagct cacgatcggc tttggaaatt 720 ctattcgtgc ggtaccggcg gtacccactc gaggagacga actactaatc ccttggatgt 780 acggaagagg cggcaaagca aactgttatc ccaggtaaat aacaaattgg gtggtatatg 840 tccgccgaag cttactcgga aatacgcaca ccaataattt tcaatcctct tcggcgtact 900 actactattc tggaatcacc ctgctcggcc gagataccca ctcggcaaat cgaaacaaca 960 gcaacgcaac ctgctacaac ccagagaacg tgtatacaac cgtgtgtggc tcgctattca 1020 gacgactgcg acgtatgctt tggatgcggt acgacaattt gtttgtcaag gtaaacacaa 1080 gtatatgtct aatgaagcaa agaccccgac agaagatcta acacacacct ttcttggttt 1140 tgtctggtca actacacact actcaacatc atggcgagtg ctgaaactcc tcaaccgctg 1200 gagtactgga tcgctgattt gttgaccaca cagctgaggc tcgtcgaact ttggcagtgt 1260 gttaaggatg gcaacactac gacttcccat ttagccgtgt cgtacaaaga agttgatgag 1320 ctgtggagac acttctttga tgcttcggta gaatatcaat ctcacgacga ctttgtggaa 1380 gaagaacaac tcacttttga ccgggttagg caagacttca gcgagctcta ttttggtgta 1440 aaagctttca tattagacag catgaatcat gcaggtgatc gtaacaagca ggcactagat 1500 cagttcgcat caagggtgca cactcaggct gtggacaggg tgactagtat tggttctggt 1560 ggatcaaaac tttgtttaca aaaccgggat gaaaagcaac acactcacac ttgcaaaggg 1620 gttcagaagg tccaagggag tatttgtcga tcgcatcagt taagtcaaac ccaaacaaac 1680 ggcggatcgg gtagcactta cactcaggag gtccaagggt catttacaca cactcaaatg 1740 gttacacaaa atgccagctc gatcgttttg accataccac caacactgcg gcacaggtcg 1800 gaaaatgctc aatccattgg ggattcgctg cgtacaccta cacacatcga tctaccggaa 1860 cgatcgattc atgcggggga ggaatctcat tcagctacac ggcaagggtc gttcgcagag 1920 attttctcaa tcggccgtaa acaggtagga gcaccactga ggacggattt tctattggca 1980 tcactcaagg gagggaacat cgtgcagggt acgaagggtt tcccacaatc ggctcttgtt 2040 caggaccaaa cggacagccc tagtaacatt tttaaactta caggttttat catggttctg 2100 aaaaccctca tacggcctaa ataggctttt atggcctact taaaacctaa aatgttacta 2160 gggagagcaa ctcattccac tccaacgaaa aactcagcgg tcaagaacac gactccacgg 2220 cgaaacagct gcaacgggca gcttggttat ccgaatccac gcagaggcta caaggtcaac 2280 aactattctc ggaggaatgc aaattgcaca gggcagtttg tgttcgcgca caccacctca 2340 agggagagtc gctgtattgg acagctgctc acggaaggtt cggacaaaac aatcactact 2400 actcatacca aggctcacat gtcccagcaa gtagcaaatc atctcgaaga gtggtattct 2460 tcagtacggt tcacgggcta ctttggacta caactaccag ggcgactgtc ggatatcaac 2520 accaccacat cgtgcacaca gggctgcaca ggtcagcaag ctgtatcgat cacagcgggt 2580 agctgtgata caggaaggtc cacacttgta ccctcaaggc taggctgcaa aaccaacact 2640 tctgcaggac cgatcaacgg cgctggacag cagtttacta acaccacatc caagacaaga 2700 tctgctgggc agttcgagtg gtacgtttcg agggacaaca gtcggttttt gtttcacctc 2760 gcagactaca aacggaactt gtttttggga cgggagctcc caggacgtga acagatctgc 2820 gcgacgatca tgtggatttt gctgaagcaa atgccatctc caaggactgg acggggcgga 2880 cggccggcaa caggatctac gccaaggaat acaaccaacg atcatcgaca gcagctgggc 2940 acggttaccc gcaagaagcg gtcgagatcg ccgaaacttg tgtgggtatg taagcagaag 3000 gtgtttgacc cgaggatgta tgtgttgtgt gaagtggaca gcaaggtcga gctgtgctgt 3060 gatgtgaata tcaacttaaa tcttattttt gaagacacca acacatctac agtaggtttg 3120 aaaatcaaga ccattcatag aaccttgatt ttggggctag ttttcattca gaaatgggtt 3180 atttcagcgg gggcagaa 3198 // ID CR1-47_HM repbase; DNA; INV; 4300 BP. XX AC . XX DT 19-DEC-2008 (Rel. 13.12, Created) DT 19-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE CR1-type family - consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-47_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4300 RA Bao W. and Jurka J.; RT "CR1 families from Hydra magnipapillata."; RL Repbase Reports 8(12), 1875-1875 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 146..1057 FT /product="CR1-47_HM_1p" FT /translation="MSSKVIDATDISFSXKYDFTGFGQGNTNLTKLDKAVF FT AAFQDMQLAIKLLNEKVIAQENTINELKIQVEKNQNKLESKTFSDALIGKS FT TESKQQRQQICSLISTVNKHTQSRELNVIVIGILENNTVEDSKQVMDFFGA FT VGLENVEIKHIRRLKSSKLKATSNNSNSNIIQVSLSNKIEKELVLNTCRHH FT TIKQYEKVFVHEDRTPAEQADFNAKRSQIVKQNQELKAAGLLDQPFRNVIH FT RRTGKIVCIDVIESTRSKQYVFKSATIALKSQRSNINSSNIEGNSTSATIT FT ISSTTPNNLKT*" FT CDS join(927..1592,1498..4068) FT /product="CR1-47_HM_2p" FT /translation="MFLNQQQSHSKANEVTSIVAILKETVHQLPSQFHLPH FT LTTSKPEIHLKSNNKRIRPIRLKLKSKPNLSQLSSKSSTSSSLISGQLCFW FT ANNPCSLNNLKRGELIARISSKPMHEKPHIIFLAETWYNNTSDTFVKGYEL FT HRADRESRGGGVAIYTRNDIITTEVRNKQLNSKAIEQVWCIIKFDGESLLL FT GCIYRPRDINDIVLTQTIESITGSGQTGHVNLVVYIDLVTLMTLSLLKQLN FT QLQAAAKQAMLTYNCSCMLLYGDFNLSHTWYESIEINGKVATVGHVIDECP FT RDLRFQSCLEDNNLTQLITFPTYRHSNLVEPSSTLDLIISDDPDRFIKVSE FT DDPLGFTKTGQSHCVIAGILAVHLKETAPKLSRPRLIWNRADYDAISTKIA FT SNNWHTMFEGHTANDNFSLLVNIYKKYIKEHIPSTLVPPKIKNAPWVTPEV FT KAAVKEKHTLWLKYIAAGRHTHENLRNAHKVACKNVKKVVKEAILSYEDDL FT VYASKSNPKCLHLHIRSKQRVHEQIRSLQTMNNSTTTKKDEICALLNNYFQ FT TVFAVEQEGSMPYFNLRTQTICQFDEDSIHEREVEKKLKGLSENKAMGYDK FT IHPRVLKKCATSFAIPITIIYKKSISTGDVPDLWKKSNITPIFKKGSKLQP FT SNYRPISLTSIVCKVMESLIHDCIMKFCTKHNLISKAQHGFVPKRGCVTNL FT LEAHDILTQSMHLGFPADVIYTDFAKAFDKVPHRRLLHKLRAYGISETLLK FT WVKNWLTSRQQRVVLDGITSEWVPMTSGVPQGSVLGPLLFVLFINDLPDTI FT VHHTKLYADDGKIIGVIKSLQDANLLQADIDKIVDWSHKWLMPFNINKCRV FT MHVGRTNKSTHSYTMAQLDGTRCTLEETIVERDLGVIVSNDLKVKTQVETA FT VSIANQMFNRLRKAFCSRRLVLWKTLYLAYIRPHLDFAVQVWSPHLKKDIV FT MMEKVQRRVTKTISEIKHLPYEERLQKLKLTSLEDXRKRGDLIQQFKFTQQ FT IDEVNFHFPLTLINSKTSYNLRGHNQRLMPQXVKGCIERENFFTNRVTKQW FT NALSQQAVDSPSVHFFKKFLL*" XX SQ Sequence 4300 BP; 1598 A; 800 C; 704 G; 1189 T; 9 other; aatatatata tatayggaaa aaacacaaaa aactaacagt tctatttgtg ttaataagta 60 aataataaat attgaatatt tcatatcagg gaggtaatcc actgttattc agggtggtag 120 ccctcccacc atcaagcatc atactatgtc ttcaaaagtt attgatgcaa cagatatcag 180 tttttcttyg aagtatgact ttactggatt tggacaagga aacaccaact taactaagct 240 ggacaaagca gtatttgcag catttcaaga tatgcaattg gcaataaaat tattaaatga 300 gaaagttatt gcacaagaaa atacaataaa tgaactgaaa attcaagtag aaaaaaacca 360 gaacaaattg gaatctaaaa ctttcagtga tgctctcatt ggtaagtcta ctgaaagcaa 420 acaacaacgt cagcagatat gttcactaat ttcaactgta aataaacaca cccaatcaag 480 agaattaaat gtcattgtta ttggaatctt agaaaataac acagttgagg atagtaaaca 540 agtgatggat ttttttggag ctgttggatt agaaaatgtt gaaattaagc atattcgaag 600 actaaaatca agcaaactaa aagcaacgtc aaataactct aactctaaca tcatccaagt 660 atctctctca aataaaatag agaaagagtt agttctaaac acctgcagac atcataccat 720 taagcaatat gagaaagttt ttgttcatga agatagaact cctgctgagc aggctgattt 780 taatgccaaa cgatcacaaa tagtcaaaca aaatcaagaa cttaaagctg cagggctgct 840 agatcagcca tttcggaacg tcatccatcg ccgcaccggt aaaattgtct gcatcgacgt 900 tattgaatca accagaagta aacagtatgt ttttaaatca gcaacaatcg cactcaaaag 960 ccaacgaagt aacatcaata gtagcaatat tgaaggaaac agtacatcag ctaccatcac 1020 aatttcatct accacaccta acaacctcaa aacctgagat acacctgaaa agtaataata 1080 aaagaatccg cccaataaga ttgaaattaa agtcaaaacc caacctctca cagttatctt 1140 caaaatcatc aacttcgtct tcattaatat caggacaact atgtttctgg gcaaacaacc 1200 catgttcatt aaacaattta aaaagaggrg aacttatagc tagaatttca tccaaaccaa 1260 tgcatgaaaa acctcatata atttttcttg cggaaacatg gtacaataat acatctgaca 1320 catttgtaaa aggatatgaa ctgcatcgag cagacagaga gagcagagga ggaggtgttg 1380 caatatacac aagaaatgat ataattacga cagaagttag aaataagcaa cttaattcta 1440 aagctattga gcaagtatgg tgtataatca aatttgatgg tgaatcattg ttactaggtt 1500 gtatatatag acctcgtgac attaatgaca ttgtccttac tcaaacaatt gaatcaatta 1560 caggcagcgg ccaaacaggc catgttaact tataactgct cctgtatgtt attatatggt 1620 gattttaatt taagccacac atggtatgag tctattgaaa taaatggtaa agtcgcaacc 1680 gttggacatg tgatagatga atgccccaga gatttgagat tccaatcttg tttagaagat 1740 aataacttaa cacagctaat aacttttcca acctaccgtc attccaatct tgttgaaccg 1800 tcatcaacac ttgacctaat tattagtgat gatccagata ggttcataaa agtctcagaa 1860 gatgacccat tggggtttac aaaaacaggt caatcacact gtgttattgc tggtatattg 1920 gctgttcatt taaaagaaac agcaccaaaa ctctcacgtc ctcgtcttat ctggaatcga 1980 gccgactatg atgcaatatc aacaaaaata gcttcaaaca actggcatac catgtttgaa 2040 ggtcatacag ctaacgataa tttttcctta cttgttaata tatacaaaaa atacattaaa 2100 gaacacatac catcaacttt agttccacct aaaataaaaa atgcaccgtg ggttacacca 2160 gaggtaaaag cagcagtaaa agaaaaacat acgttatggc tcaaatacat tgctgcaggc 2220 cgccatactc atgaaaattt acgaaatgca cataaagttg catgcaaaaa tgttaaaaaa 2280 gttgttaaag aagccatact ttcatatgaa gatgatttag tttatgcttc taagtctaac 2340 ccaaaatgcc tacacttaca tattcgcagc aaacaacgag ttcatgaaca aattcgcagc 2400 ctacaaacaa tgaacaactc aaccactaca aaaaaagacg aaatttgcgc attactaaat 2460 aattatttcc agacagtgtt tgcagtygaa caagaaggtt ctatgccata ttttaatcta 2520 cgcacacaaa ccatatgtca gtttgatgaa gatagcattc atgagcgtga agtagaaaaa 2580 aaactaaaag ggttatctga aaataaggcc atgggttatg acaaaataca tccacgcgtt 2640 cttaaaaagt gtgcaacatc ttttgcaatc ccaatcacaa taatctacaa gaaatctata 2700 tccacgggtg atgttccaga tttgtggaaa aaatctaata ttacaccaat ttttaaaaaa 2760 ggaagtaaac ttcaaccatc aaattacaga ccaatatcac taacatccat agtctgcaaa 2820 gtcatggaaa gtctcataca tgattgtata atgaagtttt gcacaaaaca taatcttata 2880 tccaaagctc aacatggttt tgttcctaaa cgagggtgtg tcacaaatct tctcgaagca 2940 catgacatac ttacacaaag catgcaccta gggtttcctg cagatgtaat ttatacagac 3000 tttgccaaag catttgacaa agttccacac aggcgcctgt tacacaaact gcgagcctat 3060 gggattagtg aaaccctgct taaatgggtt aagaattggt taactagtcg gcaacagaga 3120 gtagtattag acggaataac atctgagtgg gtccccatga ctagcggtgt gccacagggc 3180 tcggtccttg gtccactact attcgtctta tttattaatg atttgcctga caccattgta 3240 caccacacaa aactttatgc cgatgatggt aaaattatcg gagtcatcaa atctctgcaa 3300 gatgccaacc ttcttcaagc tgacatagat aaaatagttg actggtcgca caagtggtta 3360 atgcctttta acatcaataa atgcagagtt atgcatgttg gacgcactaa caagtcaaca 3420 cattcataca caatggcaca gctagatgga actcgttgta cactggaaga aacaatagtt 3480 gaacgagacc ttggggttat tgtttccaat gacttaaagg tcaaaactca agttgaaaca 3540 gcagtctcaa ttgcgaatca aatgttcaat cgtctccgaa aagctttttg tagccgcaga 3600 ctagttttat ggaaaacact gtaccttgca tacattcgtc cacatctaga ttttgcagta 3660 caggtttggt ctcctcacct aaaaaaagat attgtgatga tggagaaagt gcaaagacgt 3720 gttacaaaaa caatatctga gataaaacat ttaccatatg aggaaagatt acaaaagctg 3780 aaactaacat cgcttgagga tygcaggaaa cgaggtgacc taatccagca gtttaaattt 3840 actcaacaga ttgacgaagt taactttcat ttccctctaa ctctaataaa ttcaaaaaca 3900 agttacaatc tccgtggaca caatcagcga ttaatgcctc aacyagtcaa aggttgcatt 3960 gaacgagaaa acttttttac aaatagagta acaaaacaat ggaacgctct ttcacagcaa 4020 gcagttgatt caccatctgt acatttcttt aaaaaatttt tattgtaaat ytcttgatca 4080 gctgctttty gcagcttaac tgggtcaaca taggagagcc tgatgtagct cttcttcatt 4140 agtataaatt acatttacac tacaagatgt acataaaagt gtttttttta ttcttcaata 4200 aacatgatag gacatrggct attttaatgc acacttgtat ttatatggaa ttatttgtac 4260 taattttgaa ataaataaat aaataaataa ataaatatat 4300 // ID CR1-66_AAe repbase; DNA; INV; 3349 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-66_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-3349 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1154-1154 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 20 sequences with >96% CC identity. XX FH Key Location/Qualifiers FT CDS 19..3255 FT /product="CR1-66_AAe_1p" FT /note="endonuclease and reverse transcriptase." FT /translation="YSKRFQPSNETPAISHDVQPGCTPVSLVEASYPLVTV FT EPILPATSSHPGPVYEYGEEVFQTLFAGKYSCDSSNFSPVTSVASSDIPTR FT KSDTKSAAPDDFASDNPANRLPNVITDDLDNNQSCHALRFYYQNVRGLRTK FT VDSFFLAVSECDYDVIVLTETWLDDRFQNAQLFGSKFIVYRTDRNAANSSK FT RRGGGVLIAVSTHLNSFIDPTLTSIALEQLWIKISTPSRTISVGVIYLPPD FT RRDNMDYIRDHLNSMNSVLSNLALNDCVLQFGDYNQPNLLWEASDDGFLRV FT NSQSILSTAGSTLVDGFYFHGLTQLNGIPNRNSRFLDLVLGNDEARYSCNV FT TEAIEPLIDLDVDHPALDVILNLPDSITFEDCSDNSQMYDFRRADYAGIHA FT AISRIDWQPIDDANNIDDAVTYFNHVVQDIIAQNVPLCRPKPKPPWSNSHL FT RQLKRNRAKSQRHYCRNRSQFTKQEFNRASNAYRTYNRLLYRRYVVRTQTS FT LRRNPKRFWSFVNSKRKEVGLPRSVHLDGQTATSEQEKCFLFAQHFQKTFH FT DSPATPDQIAAAIEYTPRDVFDFRSFTIDEQTVQTAFRKLKMSHAPGPDGI FT PSCILKNCSHSLATPLAKLFSRSLQEGKFPDEWKVSSMFPVYKKGDKKDVH FT NYRGITSLCSCSKLFEILINDALFNSCKSFISTAQHGFFPKRSVTTNLTEF FT VSLCLRTLDSGGQIDTIYTDLKAAFDRVDHGILLAKLDKLGVSHSLVRWFE FT SYLTNRLLYVKIGSSQSESFTNSSGVPQGSNLGPLLFLLFINELSQLLPPG FT CRLFYADDVKIYMIIDSTADCELLQSRVNIMANWCSSNFLTLSVNKCNVIS FT FHRRQKPIIFDYRICDTVLNRVDQVKDLGITLDQEMNFKPHYNDVIAKANR FT QLGFIFKLSDDFRDPLCLKALYCSLVRSILEFSVVVWCPYQSCWIARLESV FT QRRFLRYALRHLPWRDPLNLPAYEDRCRLLGIETLEQRRRRAQAVFAAKLI FT VGDIDSPELLRQLNIYAPERVIRQRNFLQLVPRNRLYGMNEPLRASADALN FT EAFTAFDFNLPLISFIRRLSLS" XX SQ Sequence 3349 BP; 896 A; 788 C; 682 G; 983 T; 0 other; atcgctgttc gtacctgata ctcgaaacgt tttcagccat cgaatgaaac accagccatt 60 tcgcacgatg ttcaaccggg atgcacgcct gtcagcctcg tggaagcctc ctatcctctc 120 gtcacagtcg agcctatcct gccagcgacc agcagccatc ccggtcctgt gtacgagtat 180 ggagaagagg tcttccaaac tctctttgca ggcaagtaca gttgtgattc gagcaatttc 240 tctccggtaa cgtctgttgc ttctagcgat attcctacac gtaaatcgga tactaagtcc 300 gcggcgcccg acgattttgc ttccgacaac cctgccaacc gccttcctaa cgttattact 360 gatgatttgg acaacaacca atcgtgtcat gctctgcggt tttattacca aaatgttcgt 420 ggacttcgta ccaaggtgga ttctttcttc ttggctgtca gcgaatgtga ctacgacgtc 480 attgttttga ccgagacatg gctggacgac agatttcaaa atgcgcaatt gttcggatcc 540 aaatttattg tctaccgcac tgatcgaaat gcagcgaaca gctcaaaacg ccgtggtgga 600 ggcgtcctaa tagcagtgtc tacgcatttg aacagtttca tcgatcccac tttgacgagc 660 atagcacttg agcagttatg gattaaaatt tcgacgccaa gccgaacgat tagcgttgga 720 gttatttatc ttcctccaga ccggcgtgac aatatggact acatacgcga tcatttgaat 780 tccatgaatt ctgttttatc caatttggct ctcaacgatt gcgtattgca gtttggtgat 840 tacaaccagc cgaatttgtt atgggaagca tcagatgacg gatttcttcg agtgaattct 900 caatctatac tctcaaccgc cgggtctaca cttgttgatg gtttttattt ccacggttta 960 acgcaactta atggaatacc gaacagaaat tcacgatttt tggacctggt gctaggcaac 1020 gatgaagcaa gatactcctg taacgtaacc gaagcgatag aacctctcat cgatcttgac 1080 gttgaccatc cagccttgga cgtaattttg aatcttcctg attcgataac atttgaagat 1140 tgtagtgaca attctcagat gtatgacttt cgccgagctg attacgctgg aattcatgct 1200 gctatttctc gtattgattg gcagcccatc gatgacgcta ataatattga cgatgccgtg 1260 acatatttta atcacgtagt tcaggatatt attgcccaaa acgttcctct gtgccgtcca 1320 aaacccaaac ctccatggtc caacagccat ctacgccagc taaaaagaaa tagagcaaaa 1380 tcacaacgtc actactgccg caatcgttcg cagtttacga agcaggaatt taatcgcgcc 1440 agcaatgcct atcgtactta taaccgtctg ctgtatcgtc gatatgtagt tcgtacgcaa 1500 acatcacttc ggagaaatcc aaaacgattc tggtcgtttg tcaattccaa gcgaaaggaa 1560 gttggtttac ctcgatcagt acacctcgat gggcaaactg caacttcaga acaagagaaa 1620 tgttttctat tcgcgcaaca ttttcaaaaa acgtttcacg actcccccgc cacacctgac 1680 caaatagctg ctgctattga atatactccc agagatgttt tcgacttccg ctcttttact 1740 atcgatgaac aaactgtcca aacagccttc agaaaactga aaatgtcaca tgcgcctgga 1800 ccggatggaa ttccgtcttg cattttaaag aattgctccc attcgctggc cactcctttg 1860 gccaagttat tttcccgatc actgcaggaa ggcaagtttc ctgatgaatg gaaagtttcc 1920 agcatgtttc ctgtctataa aaaaggcgat aagaaagatg tgcataacta cagaggcatc 1980 acatctttat gctcctgctc caaattattc gagattctca taaacgatgc tttgttcaat 2040 tcttgtaaat cttttatttc aactgctcag catggatttt ttccgaagcg atcggttact 2100 acaaacctaa cggaatttgt ttctctctgc cttcgtactc tggattccgg tgggcaaatt 2160 gatacaatct atacggatct aaaagcagcg tttgaccgag tggatcacgg cattctgcta 2220 gctaagcttg ataaacttgg agtttctcat tccttagtgc gttggtttga gtcatatcta 2280 acaaaccgct tgctatacgt gaaaattgga tcatcgcaat ctgagagctt cacaaattca 2340 tctggagtac cccagggcag taatcttgga ccgcttctgt tcctgctgtt catcaacgaa 2400 ctctctcagc tgctacctcc tggatgtcgg ttgttctatg cagatgacgt taaaatctac 2460 atgatcatcg acagcactgc ggactgcgaa ttgctgcaat cacgggtgaa cataatggca 2520 aattggtgct cttcaaactt tctgacactc agcgtcaaca aatgcaatgt catttcattc 2580 caccgcagac agaagccaat cattttcgat taccggattt gtgatacagt tctcaaccga 2640 gtggatcagg ttaaagattt gggcatcact cttgaccagg aaatgaattt caagccacac 2700 tacaatgacg tgatcgcgaa ggcaaaccgg cagcttggat ttattttcaa gctttctgat 2760 gacttccgcg accctttatg tttaaaagca ctgtattgct cgttagtccg ttccattttg 2820 gaattcagcg ttgttgtctg gtgtccgtac caaagttgtt ggatcgcaag attggagtca 2880 gtacaacgca gattccttcg gtacgcactt cgacatcttc cctggcgaga tcctctaaac 2940 cttcctgcct acgaggatcg ctgtcggttg ctgggaatcg aaacgttgga gcagagaaga 3000 cgtagagcgc aagctgtttt tgcggctaag ctgatcgttg gtgatatcga ctctccggag 3060 ttactaaggc aattgaatat atatgcacct gaacgagtaa taagacaacg caactttctt 3120 cagctcgtgc ctcgcaatcg tttgtatggc atgaatgaac ctcttcgtgc ttctgctgat 3180 gcgctcaacg aagcctttac tgcttttgat tttaatttgc ctttgattag ttttatccgt 3240 agattgtcat tgagctagat ttttatgttt tttttttcat taagacaaac agatgtcaga 3300 tgaataacaa cttcaataaa taaataaata aataaataaa taaataaat 3349 // ID Gypsy-7_DWil-I repbase; DNA; INV; 3226 BP. XX AC scaffold_181088; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-7_DWil_; KW Gypsy-7_DWil-LTR; Gypsy-7_DWil-I. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-3226 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_181088; Positions 35373 38598. XX CC 'TACGT' target site duplication CC LTRs are 96% similar to each other. XX FH Key Location/Qualifiers FT CDS 798..3226 FT /product="Gypsy-7_DWil-I_1p" FT /translation="MELDEVMALRVADLRNKLAELNLSTVGRKRDLQNRLL FT THYDIKDQDSEDEEVESVVTARGELPINNCSRSWYTLKDVEGSVSYFSGSG FT SPDIDSWVQEIEECAITVQWDKLQTFIYAKQLLAAAAKSFIRSQHGINNWE FT ILKAALRKEFGSKRSAAEIHRILSQRSQKREETLHEYLYALMEIAKPVNLD FT DESLIEYFVKGIPDSQVNKAILFQARNLQELKFQIETYQKVRASHKSLNKF FT EAQSGGKINQGKIIDTSKNRKCYSCGDTSHIRRDCPKNDQKCFKCGKVGHR FT MAQCKLENPIKYERNTNVVENEDVVIQPSSEFTPSGLELKDIVYSNVIFKR FT LVDTGAGLCLMRRNIYFMIGSKPLLGGARHLTGIGDSKVLTFGSFIIPVKV FT DGLVMNIDIEDRDKSPDINLSHLNHLIKGEVEKLIRNYQPKRNANAPVSMK FT IILSDEIPVYQHPRRLALCDQEIVDKQIQEWLAEKIIKTSTSEYASPVVLV FT SKKNGQKRLCCDYRKLIEKKDVPLRVARWAMYLQDFDYVIVHRSGTQMRHV FT DALSRFACFMLVEDTIKHRLKEAQLQDEWTKAVRTLVEKESYKDFYIENEI FT LFKDPNCELIVVPSAMEHEIIRLAHTQGHFSVKKTQDLIEKSYYIPELRDK FT VIRVVHSCVECIVINAKAGKREGYLTSIEKGDKPLCSYHIDHVGPMELTNK FT RYNYLLVIIDGFSKYVWLYPTRSTGVDEVIRCLEVQATNFGNPVRLISDRG FT AAFTSHIFKRYCEDHDIVHLLITTGVPRGNGQVERIHKIVIPMLAKMSLDN FT PASWYKYV" XX SQ Sequence 3226 BP; 1087 A; 538 C; 784 G; 817 T; 0 other; tacaaacata taacactgcg tgagtgaaag agacaacaaa caaagttctt cgaagtgaca 60 aagcaaaaac ttttgtgtcg tcatcattgt gtaagagcaa gcgagatagg ctgtgctaaa 120 ttgttgcata aaatctttgt tgcccaaata cgttacgacg cagacaatac acgcacgagc 180 atgtttggta acggacggcg atcagtgaat ttttctagaa cagtctagaa atattcagca 240 gaggcgaacc gttggcagag gcaaagctgg acgtagcgtg aggcaaaacg tcagcagagg 300 cagaggcaac ccgttggcag aggcgaagcg ttggcagagg caacccgttg gcagaggcaa 360 agctggacat agcgtgaggc aaaacgtcag cagaggcaga ggcaacgcgt tggcagaggc 420 gaaccgttgg cagaggcaac ccgttggcag aggcaaagct ggacgtagcg tgaggcaaaa 480 cgtcagcaga ggcagaggca acccgttggc agaggcaacc cgttggcaga ggcaaagctg 540 ggcgtggtgt gtgataaaac gtcagcagag gcagaggcaa cccgttggca gaggcagaag 600 ctgcacgcgg caagagacaa aacgttagaa gaggacagac aacctgctgg cagaaattaa 660 acgaagactt cgtgggacag gagcggactt gggtaaattt tttggggact atatgtatat 720 agttctaacc ttaattgaat ttactaaaac taaccttagc attaattatt ggtattgaac 780 aaagaaaata ttgtaatatg gagcttgacg aggttatggc attgcgagtc gcagatttgc 840 gaaataagtt ggccgaattg aacctaagca cagtgggaag aaaacgtgat ctacagaata 900 gacttcttac tcattacgat attaaagacc aagatagcga ggatgaagaa gtagaatcag 960 tagttacagc tagaggcgaa ttgcccataa acaattgtag tcgatcttgg tacaccttaa 1020 aagatgttga gggcagtgta tcctattttt ccggctcagg tagtcccgac attgattcgt 1080 gggtacaaga gatagaagag tgtgctatta cagtacaatg ggacaagtta cagaccttta 1140 tatacgccaa acagttgtta gctgcggcag ctaaatcatt tattcggagt caacatggta 1200 taaacaattg ggagatatta aaagcagcac tgcgcaaaga atttggttcg aaacgatcag 1260 cagctgagat tcatagaata ttatcgcaac gttctcagaa aagagaggag actttgcatg 1320 aatatttgta tgctttaatg gaaattgcca aaccagtgaa tttggacgat gagagcttga 1380 tagaatactt tgtaaagggc attccagact cacaagtaaa taaagctatt ctatttcaag 1440 ccaggaatct acaagaactg aagtttcaaa tagaaacgta tcagaaagtt cgtgcttcac 1500 acaagtcgtt aaacaagttt gaggctcagt caggaggaaa gattaatcag ggtaaaataa 1560 ttgatacatc taagaatcgg aaatgctata gttgcggtga tacatcacat atcagacgag 1620 actgtccgaa gaacgaccaa aagtgtttca aatgtggtaa agtaggacat cgaatggcac 1680 agtgcaaact agaaaatcca attaaatacg agagaaatac aaacgtggta gaaaatgaag 1740 atgtagtaat acagccgtca agcgaattta ctccgtcagg gttagaactt aaagacatcg 1800 tatattctaa cgttatattt aaaagactcg tagatactgg cgctggatta tgtcttatgc 1860 gaagaaatat ttattttatg attggatcga aaccgttatt aggcggagct agacatttaa 1920 caggaattgg cgacagtaaa gtgttgactt tcggtagttt tatcatacca gtgaaagtgg 1980 atggtttagt tatgaacata gatattgaag acagggataa gagtccagac ataaatcttt 2040 cgcatttaaa ccatttgata aaaggggaag tagagaaatt aattcggaac taccagccaa 2100 aaagaaatgc aaacgcacca gtaagcatga aaattatatt gagcgacgaa atacctgtgt 2160 atcagcaccc gagacgttta gccctttgtg atcaggaaat tgttgataaa caaattcaag 2220 aatggctagc agagaaaata attaaaacca gtacttctga gtacgcgtcg ccagttgtct 2280 tagtctcgaa aaagaatggt cagaagcgtt tatgctgtga ctacagaaaa ttaatcgaaa 2340 agaaggatgt tcctttgaga gttgctcgtt gggctatgta tcttcaagat tttgattatg 2400 ttatcgtaca tcgatcaggt actcaaatga ggcacgtcga tgcgttgagt cgttttgcat 2460 gttttatgct tgttgaggac actattaaac atcgattaaa agaagctcag ttgcaagatg 2520 agtggaccaa ggcagtaagg acgctagtcg aaaaagaaag ttataaagac ttctatatag 2580 agaacgagat tctctttaaa gaccctaatt gtgagttgat tgtagtgcct tcggcaatgg 2640 aacatgagat tattagactt gcacataccc agggtcactt ttccgtcaag aaaacgcaag 2700 atttaatcga gaaatcatac tatattcccg aattgagaga taaggttatt cgtgtggtac 2760 atagttgtgt ggagtgtatt gttattaacg caaaggctgg caaaagagag ggatatctaa 2820 cttcgataga aaagggtgac aaaccacttt gttcttacca tattgatcac gtcggaccaa 2880 tggaacttac aaataaacgg tataattact tgttggttat tatagatggt ttttctaagt 2940 acgtttggtt gtatcccact aggagcacag gcgtagatga agtgatacga tgtttagaag 3000 ttcaagcaac aaattttggg aacccagtac gattgatatc tgatagaggc gcggctttta 3060 catcgcatat tttcaaaagg tattgcgagg atcacgacat cgtgcattta ttgattacca 3120 ccggagtacc gcgaggaaat gggcaggttg aaaggattca caagattgtt attccaatgc 3180 tagctaaaat gagtttagat aatccagctt catggtataa atacgt 3226 // ID CR1-22_HM repbase; DNA; INV; 4410 BP. XX AC . XX DT 15-DEC-2008 (Rel. 13.12, Created) DT 15-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE CR1-type family - consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-22_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4410 RA Bao W. and Jurka J.; RT "CR1 families from Hydra magnipapillata."; RL Repbase Reports 8(12), 1850-1850 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 216..1280 FT /product="CR1-22_HM_1p" FT /translation="MELSNETRKKVLEEMYSVYKKETELMFKEQEKNLISI FT ISANTKMTNDRMAKVEKDTHKNAKIIKSLENKVASIVESLNFHKELFDRKI FT KTLSDSHEKDNVLSKGAIKTYNDNVEVMNNTNIESVRILNTRIDELNKKLK FT SAPLSNKLESFATILKGKSAEEVQHRHRLINIVTAETENRQNRASNVIIVG FT LPKSDDYDNIIAANKFFNACGVDSAKIGYISRIKSSKINNTIHNSNIVKVS FT LINEQSRSEVLQKCSHHNIAEFKGIFCREDRTRSQQEAFNELRSKMKKLND FT ELYAADVLDKPFRNVIHKRTGEICCIDVQKSNELHKYVFASPASTLAPHRK FT RLLTTVQTATT*" FT CDS 1099..4290 FT /product="CR1-22_HM_2p" FT /translation="MLRMYLTNHFGMLSTSVPEKSAALMFKSQMNYINMSL FT PPQQAPLHRTANVFSQPCKLQPHNISPIPQTKAKRKQHLPNNSLQDLQSQA FT TSPRLKLPSINFFRFWANNPCSLNSEKLHELVARLTSIDKSEWPHVIFFSE FT TWFSATSDTTIAGYQLHNTDRDGRGGGVAIYIIDGLANSKVTDVTLTSTSV FT EQIWHIIHFENESILLGCMYRPIDKNDDTLTQILASIKAAQQILPKLKCST FT MLLYGDFNFSHTSYKSCNVGGGVETTAHVTEQRPGDIKFQECLKDCYLTQL FT VTFPTYRNNMLATPCSTLDLIITNNPDRLISINEGNSLGHTPMGKAHCLLV FT GLFAVYFNKNKLTSPSRRRLIWSKANFDAISLHITSKEIPTNLSANDLYSW FT FIDVYNEAVKLHIPSTTTPFKIKQDLWVTPKVLEAVKLKRSTWAKYVAAGK FT DSHAKLRNEHQAACKHVKKVVNAAVLHYEQQLVSSFKEFPKRLYSHVKNKL FT KVHNHISMLETSDDNITEDSKFITSTINNYFHSVFVEEPPTPLPIFEERTK FT STCEIDESLFTAATVQNYLFCLDDTKSMGLDGVHPRVLKNCAKAFAIPLSL FT IFKQSFQTGNVPDLWRKSNVTPIFKKGNKLKACNYRPVSLTSVPCKVMERI FT IKERIMKHCTLHNLISKSQHGFVNHRGCLTNLIESXDILTEAAYRGHIVDI FT IFTDFAKAFDTVPHKRLLHKIQAYGIRGQLFHWISAWLSERKQRVVIGEQT FT SEWKRVTSGVPQGSVLGPLLFVLFVNDYPDGISNYTKLYADDSKIIGIINN FT NQSSSDYETLQTDIDNAVSWSHKWLMHFNIEKCKVMHVGCRKNKSTHTYTM FT ADINGNRHNLSPTCIERDLGVLISDNLKVKAQVEAAASAGNRMLGRLKKTF FT RCRSLQLWKKLYQLYVRPHLEYCVQAWSPHLKSEIVLLERVQKRATKVISS FT IKHFNYEKRLQLLDLTTLEERRIRSDLILQYKIAHQLQEVKFYIPQAQPFL FT LNSRYNLRRNSRCLKPQLIKRNSERQHFFTNRVTNRWNKLAQNVVDAPTVH FT FFKKHV*" XX SQ Sequence 4410 BP; 1594 A; 902 C; 744 G; 1167 T; 3 other; ttttttcgtg agaagacgtg tttttgcaag agctataatt taaagaaaat aatggtagta 60 tttttaattt ttagtatttt ataactaaaa attcaataat ttaataaaga tttaacaaaa 120 aattaatata tatatcggaa agtaaacaca gttaagataa aacctttttg tgttaataag 180 atagtattat agaaaataaa gaaattacgg cagcaatgga actttctaat gaaactagaa 240 aaaaagtatt ggaagaaatg taytcagtat acaaaaaaga aacggagtta atgtttaaag 300 aacaagaaaa aaacttgata agcataataa gtgcaaatac caaaatgact aacgatcgaa 360 tggctaaagt tgaaaaggat actcacaaaa acgctaaaat aatcaagtcg cttgaaaata 420 aagtagcgag tatcgtagaa agtttaaatt tccacaaaga actttttgat agaaaaataa 480 agactttatc tgattctcac gagaaagata atgttttaag caaaggagcg ataaaaactt 540 ataacgataa tgtcgaggta atgaacaata caaatataga atctgtacgt atattaaaca 600 ctcggataga cgagcttaac aaaaagctaa agagtgcacc attatccaac aaattggaat 660 catttgcaac catacttaag ggtaaatcag ctgaggaagt ccaacatcgc catcgactaa 720 tcaacatcgt cacggctgaa actgaaaatc gacaaaaccg ggcatctaat gtcataatcg 780 tagggctacc caaatcagat gattacgaca atataatagc tgcaaacaag tttttcaatg 840 catgtggagt tgacagtgcc aaaataggct acatcagcag aataaaatct agcaagataa 900 acaacacaat ccataactcc aacatagtca aggtctcttt aataaatgaa caatctcgat 960 cagaagtgct tcagaaatgt agtcaccata atattgctga attcaaagga attttctgtc 1020 gtgaagatcg aactcgttca caacaggagg cattcaatga gctacgaagt aaaatgaaaa 1080 aacttaacga cgaactatat gctgcggatg tacttgacaa accatttcgg aatgttatcc 1140 acaagcgtac cggagaaatc tgctgcattg atgttcaaaa gtcaaatgaa ctacataaat 1200 atgtctttgc ctccccagca agcacccttg caccgcaccg caaacgtctt ctcacaaccg 1260 tgcaaactgc aaccacataa tataagccca atcccacaaa ccaaagcaaa acgtaaacaa 1320 catttgccaa ataactcatt acaagacctg cagtcacagg ctacttcgcc acgtttaaag 1380 ttaccttcga taaacttctt ccgcttctgg gcaaataacc catgctcatt aaacagcgaa 1440 aaactccatg aacttgtagc aagacttact tctattgaca aatctgaatg gccacacgta 1500 atattctttt ctgaaacttg gttcagtgct acgtcagaca ctaccattgc tggctatcag 1560 ctgcataaca cagaccgtga tggtagaggg ggcggtgttg caatctacat catagatggc 1620 ctcgctaaca gcaaggttac cgacgtcaca ctcacctcga catccgtaga acaaatctgg 1680 cacataatac attttgaaaa cgaatcaata cttcttggtt gtatgtatcg cccaatagat 1740 aaaaacgatg acacgctcac tcagattctt gcctctatta aagcagcgca acagatattg 1800 ccaaaattaa agtgttcaac aatgcttttg tatggtgatt ttaattttag ccacaccagt 1860 tataaatcat gcaatgttgg tggtggtgta gaaacaaccg ctcatgttac tgaacaacgt 1920 cccggtgata tcaaattcca agagtgtctt aaagactgct atttaactca actagtaaca 1980 ttcccaacat accgcaataa catgcttgca actccatgta gcacgcttga cctcatcatc 2040 actaataatc ctgatcgcct aatctcaata aacgaaggaa attcgcttgg acatacacca 2100 atgggaaaag ctcactgtct tttagtcggt ctttttgcag tatacttcaa caagaacaaa 2160 ctaacatctc ctagtcgccg acggcttatc tggagtaaag caaattttga tgcaatttca 2220 ttacatataa cttctaaaga aataccaacc aatctctctg caaacgatct gtacagttgg 2280 tttattgacg tatacaacga agctgtaaaa ttgcacattc catcaacaac gactccattt 2340 aaaattaagc aagatctttg ggtcacacca aaagtattgg aagcagtaaa attaaaacgt 2400 tcaacttggg caaaatacgt tgctgctggt aaagattctc atgcaaaatt acgtaatgaa 2460 catcaggctg catgcaaaca cgttaaaaaa gtggtcaatg cggctgtcct tcattacgaa 2520 caacaactag taagcagttt caaggaattc ccaaaacgtc tgtactcaca cgtaaaaaac 2580 aagcttaaag tacataatca cattagtatg cttgaaacta gcgatgataa catcaccgaa 2640 gactctaagt ttataacttc aacgataaac aattattttc attcagtttt tgtcgaagaa 2700 cccccaacac ctttaccaat atttgaagag agaacaaaat caacttgcga aatcgacgaa 2760 agtttgttta cagctgctac agtacaaaac tacttattct gccttgacga cactaaatca 2820 atgggacttg acggagtaca tcctcgtgtt ctaaaaaact gcgcaaaagc ttttgcaata 2880 cctctttcac tcattttcaa gcaatccttt caaactggaa atgtcccaga tctttggcgt 2940 aaatcaaatg ttactcctat cttcaaaaaa ggaaacaaac tcaaagcttg taattatagg 3000 ccggtatctc ttacctctgt accgtgcaag gtaatggaga gaattataaa agagaggatt 3060 atgaaacact gtacactaca caatctaatt tcaaaatctc agcatggatt tgtaaatcac 3120 agaggttgtc ttacaaattt aattgaatcc crtgacattc tcacggaggc agcttaccgt 3180 ggacacattg ttgacataat atttacagac tttgccaaag cttttgatac agtaccacac 3240 aaaagactcc ttcacaagat tcaggcctac ggcattagag gccagctatt ccactggatc 3300 tctgcttggc taagtgaaag aaaacaacgt gtagttattg gggaacaaac ttccgagtgg 3360 aaaagagtta caagtggagt ccctcaagga tcagtacttg gaccactgct atttgtgctt 3420 tttgtcaatg actatccaga tggcattagc aattatacca agttatacgc cgatgatagc 3480 aagatcatag gaattataaa taataatcaa tcatcctctg attatgaaac actgcaaacg 3540 gacattgaca acgctgtaag ttggtcacat aaatggctta tgcacttcaa catcgagaag 3600 tgcaaagtaa tgcacgttgg ttgccgcaaa aataaatcaa cacataccta taccatggca 3660 gacatcaacg gtaaccgaca taacttatct ccaacgtgca ttgagcgcga ccttggggta 3720 ctaatatccg acaacctcaa agtgaaagca caagtagaag cagcagcttc ggccggaaat 3780 cgaatgcttg gtcgcctcaa aaagacattt cgatgccgta gcctccagct ttggaagaaa 3840 ttgtaccagc tatacgttcg accgcatcta gagtattgtg tacaagcctg gtctccacat 3900 cttaaatcgg aaattgtcct tctcgaacgt gtgcaaaaac gtgcaacaaa agttatatcg 3960 tcgattaaac attttaatta cgaaaaacga ctacaactac tagatctaac cacgctggaa 4020 gaacgcagaa ttcgcagtga ccttatactg caatacaaaa ttgctcatca gttacaagaa 4080 gttaaatttt acataccaca agcccagcct tttttgttaa attcacgata taatctgaga 4140 agaaacagcc gatgtttaaa accacaactg atcaaaagaa attcagaacg tcagcacttc 4200 tttacaaacc gagttactaa ccgatggaac aaattagctc agaatgtagt tgatgcaccc 4260 actgtgcact tttttaaaaa acacgtttaa aatgctgctt cggctgcagc aatgtckatt 4320 gtcagcatag gaaagcctga tgtgtttccc ttcatcaatt caatcaagaa gttgtattga 4380 ttactgtaat aaataaatta aatttaaatt 4410 // ID Gypsy-9_DWil-LTR repbase; DNA; INV; 158 BP. XX AC scaffold_180702; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-9_DWil_; KW Gypsy-9_DWil-I; Gypsy-9_DWil-LTR. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-158 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_180702; Positions 1663408 1663565. XX SQ Sequence 158 BP; 58 A; 33 C; 21 G; 46 T; 0 other; tgttatgtat catttaaaat ctaaaataag catttcccaa cattgtaaaa cctagtattc 60 cttagatatg gtaacattgt cataaaacat caataaacgg tcagtctagc ataagctctc 120 aagaagaaca cgcgtacctc tcttctgtgg acacaaca 158 // ID Gypsy6-LTR_Dmoj repbase; DNA; INV; 966 BP. XX AC scaffold_6496; XX DT 13-MAY-2009 (Rel. 14.05, Created) DT 13-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy6_Dmoj; KW Gypsy6-I_Dmoj; Gypsy6-LTR_Dmoj. XX OS Drosophila mojavensis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; repleta group; OC mulleri subgroup. XX RN [1] RP 1-966 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1059-1059 (2009). XX DR Genome; scaffold_6496; Positions 26417505 26416540. XX SQ Sequence 966 BP; 286 A; 145 C; 253 G; 282 T; 0 other; tgtgaggcgg cgccccagtt accgcctcac gattatttga aatgaagaat gtttattttt 60 tatgaaagag agaggagatg ttgatttgcg aattggaatt aagagagtga agaaatgttt 120 attttagggt tgaaatgaag aaatgtttat ttttattgga atcgaagaaa cgtttgttgt 180 tattgaattg ttatcggaat ttatgaaata gttagtttat tttgtagtca ttaagattta 240 gagttaagtt agtaaattaa gtaggtttaa ttagggttag ttttgtagtc ggtaagattt 300 agctgtagcg gttaggaatt aagttagtga taattaagta aaaattagta tttgatataa 360 ttttgtcatt tgaattgata tttgaaaatg ccgcccaaag cgctgacgat ttatcggcga 420 tcggcggcgc aaccctgcgt ggagagttgt gagcggagag ttgcgtcgcc gacgccgagc 480 gctttgggcg agcgcaagaa aagagacagt ttttagtcga gttttgggag tcgaatcgag 540 aagagagtcg ggattgagta gcgagaagga gcggttgacc gcgaagagaa ggtgatagag 600 ttccttttaa atttaagcgc gtagtggaat cttgtgaaac gcggctaggt taagtgacat 660 tttgaataaa agaagaaaga aacacggaaa gagatcgacg actttcattg gcgtaaacgg 720 cgactggagc aaaatttgac ggcgacgagg agtcaagaat ttgggtgaga agacgctgga 780 gaagaaaaag gtacacgtga gtggcggtcc ccatattaga tttccccccc ctctcccatt 840 tcccggccaa taacgttaga tattcccccc ccccctggtg tctcactatt tcatacaaat 900 aaatttcttg tgagtacacc ctgggtagac tgagtctacc ccagcgttta aatttactac 960 cttaca 966 // ID BEL-215_AA-I repbase; DNA; INV; 5907 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-215_AA_; KW BEL-215_AA-LTR; BEL-215_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-5907 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 887-887 (2011). XX DR [1] (Consensus) XX CC Positions [4956-5357] - Integrase core CC LTRs are 96% similar to each other. XX FH Key Location/Qualifiers FT CDS 540..5357 FT /product="BEL-215_AA-I_1p" FT /translation="MSDNETPRGVLKGISKKNLLETLTESHSENVPPVPSV FT ADQKIAKQVFEQKKMMESKLEVLTDRRDLIMEKLMRMKETLRGEGVSIHLL FT NLHLETLRRCADEFDKIHAEISALLPKEQRTAVHQEYAVFEDVHNQLYVDL FT QTGIARAQEACRMNPGISNSLSIPGFQQPIVVQAAAPQLHAPFPKFDGTPE FT NWYSFKSLFKSIMMKYPNETPAMKILHLRNSLEGDARSKIDQDVVNNNDYE FT KAWKILEDAYEDKRLILDTHIDAILDCAAISKDNRGKSISQLVEICSKHVD FT ALDGHSYPVEGLGELILLNILYKKLDKETQEQWEMKVPRGEVPDYETFMEF FT LRERGRVLQRTNRSQQQQVQQSTVQARQRVTAGYKATAPSVKSFVQTTREV FT CPCCQAEHAIYKCSKFQNMDYSERKSFATRAGLCYNCLKQNHRVSDCQSDQ FT GCKVQGCGRKHHSFLHPKDEIRRSEEKISKPEVAGEESGPNEQEIPCATTM FT CAQIDTAKQQVLLSTAEVLVVGDGGCTVKCRALLDSGSDSHILTEKIARKL FT KLRMHRVDYPISGLNDIQTSAKYLVSTKIRSRINSYVTSDLDFLVISRITS FT HLPLVETDIKSWKIPQGVQLADPSFHVPGEIDMIIGNEVFFDLVKKGRLKL FT ENSAIMLAETEFGWVAGGSVPVTKAAPRARICQFSHQERILNETMTKFWEI FT EDVCHSSKLSAAEASVEQHFKETHARDDQGRYVVRLPFNDKKFQLGDSFVM FT ARKRYERLMLSLSKNPEKRMQYSEFMAEYLVLGHMKEVNVPEGDGYYIPHH FT AVYKASSSTTKTRVVFDASAKSASGVSLNDAVMVGPTVQSNLVEVIMRFCG FT HQVALTADVPKMYRQVCMHPDDCKYQRILWCNDQNEMKVFELQTVTYGIAS FT SPYHATKALMQLVEDDGDQFPLASVVIKKDSYVDDFLTGGQTARTVINKYQ FT ELSALLDRGGFGVHKFCSNSQEVLAAIPEQLHEQQVSFEESGINNTIKTLG FT LIWNPAEDYFTFRVQPIDGRVSSTKREVLSDIGRLFDPLGFLGPVIAMAKM FT VMQDVWRLRLDWDDLLPDELLCQWQHFREELPVINQKQKRRCVVRDRASWT FT ELHGFSDASKRAYGAVVYTKCIAADGTVEVNLVASKSRVAPLKPMTIPRLE FT LCGVKLLAELVQKVTTSMTIDFDEVNLWCDSQIVLCWLKKSPLTLNQFVAN FT RVAAILELTPGCSWCYVRSEENPADIISRGSMPEDLVRNETWWNGPPLLWQ FT QAYEGQTPDELEEALIPDIKPARSLAIISKISPINLNRVSDFRKLQRSWVY FT VLRFIHLKKKKNVISVTADEMAIAEKMILKLVQKEMFSELLKQLQLKSGMR FT HNLSNLAPFIGEDGLIRVGGRLKYSAIPYDGKHQVLLPEKHHVTEILVRKL FT HEENHHVGHGGLLAIVRERYWPIKVKLVIKKVISKCQVCGRCNPAPGIQFM FT GNIPESRVNPAPVFSKVGIDYAGPFMLKLGGRSPKVYKAYVVVFVCMAVKA FT IHFEVVSNLSTDNFIAALHRFVGRRGLPSDIYSDNGTAFVGANHELVALKQ FT LFEDQQHQHKVEEFCAVKGIFGILFLREALISEVFGRLV" XX SQ Sequence 5907 BP; 1695 A; 1172 C; 1570 G; 1470 T; 0 other; ttttggtcca tccgaaccgg atgttgtgag ggttttccgg aagtgtttgc tgtttcgcgt 60 gtggaagaag agcaccaagt gcaaaagtgc ggtccctgac cgaaaactat aagacgatta 120 tgtccgtggc atagtgcgtg ttcgggaaaa ttgactgcat cgtgtgcagt agcgtggccc 180 gtggccaaag tgcggtttga tttgatccta tcacaaactg catttcgttc ggagttcgtg 240 tcgatccgtg atcgatgatt ggttgggatt gttgtggtcc gtgactagag acaatcagtg 300 agcgtgtacg cgtcgtgatt acgtcgtgaa tacgtcgtac agcccgtggc tgaatttcaa 360 agttgactgt gcccgtggca cagagtgaat tgacattcga acgtaccgtg tgcgtatttt 420 ggctgtggcc aaagaaacag ttccaaaacg tttatcgggt tcgtgacacg cgtggtgaat 480 gtggatgtgg tccgtgacgc tttgcgagct cgattgaaca ggagtgaaca agttcgaaca 540 tgagcgataa tgaaacaccg cgtggagtgt taaaaggcat atcgaaaaag aaccttttag 600 aaacgctaac ggaatcacat agtgaaaacg tgcctcccgt tccgtcggtg gcagatcaaa 660 agattgcgaa gcaagtgttc gaacagaaga agatgatgga gtcaaagttg gaagtgctga 720 cagatcggcg agatctgata atggagaagc tgatgcgaat gaaggagact ttgagagggg 780 aaggtgtaag catacatttg ctaaaccttc atctcgaaac actgcggcga tgtgcggatg 840 aattcgataa gattcatgca gaaatttcgg cactgctacc gaaagaacag cggaccgcag 900 tacaccagga gtacgcggta ttcgaggatg tccataatca actttacgtt gatttgcaaa 960 caggaattgc acgagcgcaa gaagcatgtc gcatgaatcc aggtatttca aatagcttaa 1020 gcattccagg tttccagcag ccgattgtag tgcaagcagc tgccccgcag ctacacgctc 1080 cgttcccgaa attcgacggg acaccggaga attggtatag ctttaagagc ctcttcaaaa 1140 gcatcatgat gaagtaccct aacgagactc cagctatgaa gattttgcac ttacgcaatt 1200 ctctcgaagg cgatgcgagg agcaaaattg atcaagacgt cgtgaacaac aatgactacg 1260 aaaaagcctg gaagattctt gaagatgcct acgaagacaa gcggctgata ctggatactc 1320 acattgatgc tattctggat tgcgctgcaa ttagcaagga caaccgtgga aaatcgattt 1380 cgcaattagt ggaaatttgt tcgaagcatg tagatgcctt ggatggtcac agctatcctg 1440 tcgaaggttt gggggaattg atcctactta acatccttta caagaagctt gacaaggaga 1500 cacaagagca atgggagatg aaggtaccta gaggagaggt gccagattat gagacgttca 1560 tggaattcct tcgtgaacgc ggacgcgttc tgcagcgtac aaatcgctcc cagcagcagc 1620 aggttcagca gtcaacggtt caagcaaggc aacgtgtaac agcaggttac aaagcaacag 1680 caccatctgt gaagtcgttt gtacaaacaa ctagagaagt gtgcccatgc tgtcaggcag 1740 agcacgccat ttacaagtgt tcgaaatttc agaacatgga ctactcggaa cgcaagtcct 1800 ttgccacaag ggcaggcttg tgttataact gtttaaagca gaatcatcgg gtaagcgact 1860 gccagtcaga tcaaggatgc aaggtccaag gatgcggtcg taaacatcat agttttcttc 1920 atcccaaaga tgagattcgt cggagtgaag agaagatttc gaagccagaa gtggcgggcg 1980 aagaaagcgg gccaaatgaa caggaaattc catgcgcgac tacgatgtgt gcacagattg 2040 atacagcgaa gcagcaagtg ttgctgtcaa ccgcagaagt tttggtggtc ggtgatggag 2100 gctgcaccgt caaatgtcga gcattgttgg attcaggatc cgacagtcat atcctcacgg 2160 agaaaatagc aagaaagttg aagctgagga tgcaccgcgt tgattatcct attagcggtc 2220 tcaacgacat ccaaacctcg gcaaagtatc tggtatcgac aaaaattcgc tctcggatca 2280 actcgtacgt gacgagtgac ttggattttt tggttatatc gagaattaca tcacatcttc 2340 cattggtgga aaccgatatc aagtcttgga agattccaca aggcgtacaa cttgctgatc 2400 cgtcattcca cgtgccagga gaaatcgata tgattatcgg caacgaggta ttctttgatt 2460 tggtaaagaa ggggcgcctg aaattggaga acagtgcaat tatgttggct gaaactgagt 2520 ttggttgggt tgcaggagga tcagtgccag tgacaaaggc tgcaccacgt gcacgtatat 2580 gccagttcag tcatcaagaa cgaatcttga acgaaactat gacgaagttt tgggagatcg 2640 aggacgtgtg ccatagttct aaactatcgg cggcagaggc atctgttgag caacacttta 2700 aagagaccca tgcgagggat gatcaaggcc gttatgtagt gcgactccca ttcaacgaca 2760 aaaagtttca acttggtgat tctttcgtta tggcgaggaa gcgctacgaa cggctgatgc 2820 tctcattatc gaagaatcca gaaaagcgca tgcaatattc cgaatttatg gcagagtacc 2880 ttgttcttgg ccatatgaag gaagtgaacg tcccagaagg agatggatac tatataccac 2940 accatgcagt gtataaggca tccagttcta cgacgaagac cagagttgta tttgatgctt 3000 cggcaaaatc ggcttctggt gtatcattaa atgacgctgt tatggttggg cccaccgtgc 3060 agagtaacct ggtagaggtg attatgcggt tttgcggaca tcaagtggct ctgactgctg 3120 acgttccgaa aatgtaccgg caagtttgta tgcatccaga cgactgcaaa tatcaacgga 3180 ttttgtggtg caatgaccag aatgagatga aggtgtttga gcttcaaacc gtaacgtacg 3240 gtatagcgag ttcaccgtac catgcaacga aggcgttgat gcaattagtt gaagacgacg 3300 gcgatcagtt ccctttggca tccgttgtca tcaagaaaga cagttacgtc gatgacttcc 3360 tgactggtgg acaaacagcg agaacagtga tcaacaaata tcaagaactt tcagcgctgc 3420 ttgatcgagg aggattcggc gttcataaat tctgttccaa cagccaggaa gtattggcag 3480 caataccaga acaactacat gaacaacaag ttagctttga ggagtctggc atcaacaaca 3540 ctattaagac cctcggactt atttggaacc ccgctgagga ttacttcaca ttccgtgttc 3600 aaccaattga tggaagggta tcatccacaa aaagagaagt attatcggat atcggtcgtc 3660 ttttcgaccc gttaggattt ttaggaccgg taatagccat ggcaaagatg gtcatgcaag 3720 acgtttggcg tctccggttg gactgggatg atttacttcc ggatgaattg ttgtgccagt 3780 ggcagcattt tcgtgaagaa ctaccagtga tcaaccagaa gcaaaagcgt cgatgtgtgg 3840 ttcgtgatag agcttcttgg acagaactgc atggatttag cgatgcttca aagcgggcct 3900 acggcgcagt tgtttatacc aaatgcatag cagcagatgg tacggtggaa gtgaacctgg 3960 tggctagcaa gtcaagagtg gcaccattga aaccgatgac gattccacgc ctggaattgt 4020 gtggtgtaaa attattggct gagttggtgc agaaggttac aacgtccatg acgatcgact 4080 tcgatgaagt caacttgtgg tgtgattcgc aaattgtttt atgctggttg aagaagtctc 4140 cgttaacgtt gaatcaattt gtggctaatc gagtggcggc tattttggag cttacgccag 4200 gctgcagttg gtgttacgtt cgatcggagg aaaatcctgc agacattatt tcgagaggat 4260 cgatgcctga agatttggtg cgaaacgaga catggtggaa tgggccacca ctactgtggc 4320 aacaagcgta tgaaggacaa acacctgatg aattggaaga agcgctaata ccggatatca 4380 aaccagcaag aagtttggcg atcatcagca agatttcgcc aattaatttg aaccgtgtga 4440 gcgacttcag aaaactgcaa agatcatggg tgtatgtgct gcggttcatc catctgaaga 4500 aaaagaaaaa tgtgattagt gttacggcag atgaaatggc aattgcggaa aaaatgattc 4560 ttaagttggt acaaaaggaa atgttcagtg agcttttgaa gcaattgcag ttaaagtcag 4620 gaatgcgaca taacttgtca aacctggctc cgttcattgg agaagatggc cttatacgtg 4680 ttgggggtcg tctgaaatat tcggcgatac cttacgatgg taaacatcag gtgttacttc 4740 cggaaaagca tcacgtaaca gagattttgg ttcggaaact gcatgaagaa aatcatcacg 4800 ttggacatgg aggactacta gctattgtca gagagcgata ctggccgatt aaagtaaaat 4860 tggtaataaa gaaagtgatt tcaaagtgtc aagtctgtgg aaggtgcaat ccagcacctg 4920 ggattcagtt catgggaaac attccggagt ctcgagtgaa tccagcgcca gtgttttcaa 4980 aggttggaat agattacgct ggtccattca tgttgaagtt gggaggaaga agcccgaagg 5040 tctacaaggc ttatgtggta gtgttcgttt gtatggcggt gaaggcgatc cactttgaag 5100 ttgtatcaaa cctttcaacg gataatttca tagctgcact gcatcgtttt gtgggcagac 5160 gaggattgcc aagtgatata tattcggaca acggaacagc gtttgttgga gccaatcacg 5220 agcttgtggc attgaagcaa cttttcgaag atcagcagca ccagcataag gtcgaagagt 5280 tttgtgcagt gaaaggtatt tttggcattt tattcctccg agaagccctc atttcggagg 5340 tatttgggag gctggtgtaa aatccatgaa gcaccacctc aagcgagttg tcggcgaaac 5400 caaattgacg ttgaagaact gacaacattt ttggcacaaa cggaagcgat attgaattct 5460 cggccattag ttccagtgtc tgatgacccc aacgatttat caattctgac cccgtttcat 5520 tttttgattg ggcggtcagg tttgacagta ccggaaccat cttacaagga cgagaagata 5580 ggacggctaa gtagatggca acacatccaa ttgatgcagc aacatttttg gtcacgttgg 5640 tccaaagaat atttgcatca tttgcaaacc cggcataagt ggaacaacag tgtgcaacaa 5700 atcaatgtag gtgcgttggt tttgttactc gacgaaaatg tgcctaccca tcagtggcga 5760 cgaggacgaa ttgcagcggt tcacccagga gatgacggga tagtccgcgt ggtgactgtc 5820 catactacat ctggcgacta caagagggcg attacgaaga tcgcattcct accagctgtt 5880 gagccagagg actcaacggg gggtgaa 5907 // ID Gypsy6-I_Dpse repbase; DNA; INV; 4951 BP. XX AC Unknown_group_213; XX DT 13-MAY-2009 (Rel. 14.05, Created) DT 13-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy6_Dpse; KW Gypsy6-LTR_Dpse; Gypsy6-I_Dpse. XX OS Drosophila pseudoobscura OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; obscura group; OC pseudoobscura subgroup. XX RN [1] RP 1-4951 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1057-1057 (2009). XX DR Genome; Unknown_group_213; Positions 20605 15655. XX CC Positions [1826-2305] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 206..2533 FT /product="Gypsy6-I_Dpse_2p" FT /translation="MPSIPMILANLGKAKYFTTLDLKSGYHQIYLAEHDRE FT KTSFSVSSGKYEFCRLPFGLRNASSIFQRAIDDILREHIGKICFVYVDDVI FT IFSKNETEHIQHINTVLKYLIDANMRVGPEKTQFFKESIEFLGFIVTKDGA FT TSDPKKVKAIQDFPEPKSVYSVRSFLGLANYYRVFIKDFAAIARPISDILK FT GENGSVSRHVSKKIPVEFNEAQRDAFHRLRNILASEDVMLMYPDFQKPFDL FT TTDALASGIGAVLSQGNRPITMISRALKQAEQNYATNERELLAVVWALGRL FT QNFLYGSREINIFTDHQPLTFAVSDKNTNSKIKRWKSYIDQHNAKMFYKPG FT KENLVADALSRQNVNALQDGPRSDAAAVHSEISLTYTVETTDKPLNCFRNQ FT IVLEESRFPLKRNLLLFRSKTRHLISYTDKSSILKTFKEVVNSGVVNAIHC FT DLPTLASFQHDLILHFPATQFRHCKNLVQDITDKNEQLEVITAEHNRAHRA FT AQENIKQVLRDYYFPNMATLAKEVVANCRVCTKAKYDRHPKKQELGETPIP FT SYTGEMLHIDIFSTDKKQFLTCVDKFSKFAVVQPILSRTIIDVTSPLLQII FT NLFPATKTIYCDNEAAFNSETITSMLRNAYGICVVNAAPLHSTSNGQVERF FT HSTLTEIARCLKLDKKISDTTELILKATIEYNKSLHSVTREKPIEVVHAVA FT HVRRENVKDRLIKAQQDNIKRCNAARQNRVFEVGEKVFQKNNKRLGNKLCS FT EQKVEADLGTSVLIKGRVVHKDNLK" XX SQ Sequence 4951 BP; 1562 A; 1172 C; 1036 G; 1181 T; 0 other; atgtgccgac tgacttctga cttcataaac aaagaagtca aacaattact ggacgatggc 60 ataatcaggc ccatccagat cgccttacaa cagaccaaca tgggtggttg ataaaaaagg 120 cactgactca tacggtaacc caaaaaaaag gttggtaatc gatttcagga aactgaacga 180 aaaaaccata ccggaccgct atcccatgcc aagcatcccc atgattttag cgaatctagg 240 gaaggcgaaa tacttcacta ccctcgactt aaagtctggg taccaccaaa tctaccttgc 300 ggagcatgac cgcgaaaaga cttcgttttc agtaagcagc ggcaagtacg agttctgtcg 360 gcttccgttt ggcctgcgca atgcaagcag catattccaa agggcaatag acgatatcct 420 gagggaacat ataggaaaga tttgctttgt ctatgtagac gacgtcataa ttttttctaa 480 aaacgaaacg gagcatattc aacacatcaa cactgtgtta aaatatctaa tcgacgccaa 540 catgcgtgta ggccctgaaa agactcaatt cttcaaggaa agtattgaat tcctgggttt 600 cattgttacc aaagacggag caacatccga tcccaaaaag gttaaagcca ttcaggactt 660 ccctgaaccc aaaagtgtct acagcgttag gtcattcttg gggttagcta actactatag 720 ggtctttatt aaggactttg ccgctatagc gcgcccaata tcagacatac ttaaagggga 780 aaatggttca gtgagtaggc acgtatcaaa gaaaatccct gttgaattta atgaagctca 840 acgcgatgca tttcaccgtc tgcgaaacat tttggcatca gaagatgtca tgctaatgta 900 cccagacttc cagaaaccgt tcgatttaac cactgatgct ttggccagtg gaattggcgc 960 ggttctttcc caaggcaata ggccaataac aatgatttct cgcgccttaa aacaggccga 1020 acaaaactat gccaccaacg aaagagagtt gctggccgtt gtgtgggctt taggcagatt 1080 gcagaacttt ctgtacggtt ccagggaaat taacattttc acggaccacc agccgctcac 1140 gttcgctgtg tcggacaaga acacgaattc taagattaaa agatggaaat cttacatcga 1200 ccagcacaat gccaaaatgt tttataaacc gggcaaagag aatctggtgg cggatgccct 1260 ttcaaggcaa aacgttaatg ctttgcaaga tgggccccgt tcggacgctg cagccgttca 1320 cagcgaaatt tccctgacct acacagtcga gacaacagac aaacctttga attgttttag 1380 gaaccagatt gttctagagg aatcgcgttt cccgctaaag cgaaacctgt tgctgtttcg 1440 cagcaaaaca cgccacttga tcagttacac tgacaagagc tcaattctca aaactttcaa 1500 agaagttgtg aattctggag tcgttaatgc aatacattgc gacctcccca cgttggcaag 1560 tttccaacac gacctgattt tgcatttccc tgccacccaa ttcaggcact gtaaaaatct 1620 tgtgcaagat ataacggaca aaaacgagca gttagaagtc atcactgccg agcacaatcg 1680 agcgcacaga gcagctcagg aaaacatcaa acaagtgctt cgtgactatt attttcccaa 1740 tatggccact ttggccaaag aagtggttgc aaactgcagg gtttgcacta aagccaagta 1800 cgataggcac ccgaagaaac aggaactcgg ggaaacgcct ataccaagtt atactggcga 1860 aatgttgcat atcgacattt tttcaacaga caagaaacaa ttcttaacct gcgtagacaa 1920 gttctccaaa tttgcagtcg tccaaccgat attatccaga acaataatcg atgtcacaag 1980 ccctctactg caaattatta accttttccc cgcaactaag acaatttatt gcgataacga 2040 ggcggccttc aattccgaga ccatcacttc gatgctccga aacgcctacg gcatttgcgt 2100 tgtgaatgca gctccgcttc acagcacctc aaacggccag gtggagcgct ttcatagcac 2160 cctgaccgaa attgcgagat gcctcaaact agacaagaaa atcagcgaca ctacagaatt 2220 aattttaaag gctacgatag agtacaacaa aagtttacac tcagtaaccc gagagaaacc 2280 aattgaggtc gttcatgcag tagctcatgt tcgtcgcgag aacgtcaaag acaggctgat 2340 taaggcacag caagacaata tcaaaaggtg caacgcagcc agacaaaacc gtgttttcga 2400 ggtaggagaa aaggtgttcc agaagaacaa caaaaggcta gggaacaagt tatgctcaga 2460 acaaaaggtc gaagcagacc tgggaacgtc tgttctcatt aaggggaggg tggtccataa 2520 ggacaacctg aaataagttt tccccacaat tagctagtaa gccaacttaa taaactccga 2580 aactcaaaag caaataaaca gactgacgga cactatcaat aaaattatca gctcccgcaa 2640 aggcgatttg gttgacactc ctcatttatt cgaaacactg ctggcaaaaa tagaatactg 2700 aacacggaaa ttcaaaattt aatactaact ataacgctgg caaaagcaaa catagtaaat 2760 cccacaattc tcgaccacac cgatctgaaa tcacttattg agcaagacac cccaatagta 2820 agcttaatag aagcttctaa gatcaaggtc cttcagtccg agaacattat ccacatatta 2880 atagcctacc ctaaagttga gtttaggtgc caaaaggtct ccgtctaccc cgtttcaaac 2940 caacaaacca ttctgcgact cgacgaggac acgctggcgg aatgcgaaaa ggacacctct 3000 acagttactg gatgcaccgt gactacgcat aacaccttct gcgaacgagc acggcgcgaa 3060 acctgtgcta gctcccttca tgctggaaac atcgccaact gccataccca acccagccac 3120 ctgaaagcga taacacctgt agatgacggc gtcgtgatta tcaacgaagc gacagcgcgc 3180 gtccgaacag atgatgtagc agaagtgact gtctcaggaa ccttcttgat aactttcgaa 3240 cggtctgcag ccattaatgg cacagaattt gtaaacttgc gcaaatcacc aagcaagcag 3300 cctggcaccg tgaggtcacc gctcctaaac atcatcggac atgacccgac attaagcata 3360 ccactcctgc ctcgtatgaa catcaacaac ttgcaatcaa ttcgagattt caaggaggat 3420 gtaactgcgg cgggctcacc caaattctgg ttcacggttg gtgcggtcct gaacgtcgga 3480 ctaattggct cattcgtcct tttcatggtg ctaaggagaa agcgagccac attgagggag 3540 cagagggctt tggacaactt taatatgacc gaggacggtc atcatcctga gggggggggg 3600 gggggtagtt aacaactaag catataatat gaactctctc tgtcatacat tgttatttct 3660 aagtacgcag caggtaatca taagtatttg gttgccctca gcaaacaatg ctgacgcgct 3720 gccaagaatt ctgcacacag caagtgaaaa cggtcacccg tgctgtttgc ggaagagtgt 3780 tgagtcggca tgccgactct ggggtatggc gcgatgcatt gacttcgtag ggtaagctta 3840 agcaaagaca tttctgtaaa gtcagtctca gttcagatcg cattctaata agacgaacga 3900 ttaaagctcc ggcactgccg taaataaaat tcagtgtgtt atcaattata gttgggaaag 3960 tcccacaaca taattatcat tgatcccgtt tcgctcgccg gatccatcac acaacataat 4020 catcattgat ccagtttcaa tcgccggatc caccgttacc tcatcaaagg aagaggccac 4080 tatcgccagc taattaccca catagcgtcg ctgatcatcg agataacggc ggtagactta 4140 attggcgccc aaccaataca taagaaccca taccatgtaa gttcataagt gatttcagtt 4200 tgaagaaatt gttcacgaaa aaaaaataaa gataaactgt aaagtaaagt aaatattgct 4260 gccttcttct accagcattt cattccttta aatcattgca tacttttatt aaaattgttg 4320 catactttta ccagcgttcc tggtcaataa aactttgcat actttcggcg gcattatatt 4380 ttctgctgca tacccttaac aatttttata ataaatattg cataccccaa gatacaccag 4440 tttctactat tgccaacagt tttacctgcc gatctttctc gatcggccta agtttcgaaa 4500 agtgccaacc ataaaccaat gagttggaat tatagggaga aaaaacctga atttaatagt 4560 tccgatagtg aggagggggg aatcaattgc tcaataaatc ctacccctgc agatatcgtt 4620 gtaactgaaa aaatggagct ggctcaagtt cagactctca tcaggcaagc cttagcggaa 4680 caagaaaggc agtttcaagc acaaataacc gctttagctt cacgggtgca gagtttgcag 4740 gtcgaagcgc cgcaaattgt cacatacgag aaaatatccg tcgatcctga cgccaaatgc 4800 gacattccct tagacattat taaatctgtg ccggatttct ccggtgtaca ggatgagtat 4860 gtagcgtgga ggcaagcagc catctacgca tacgagttat tcaagggctt tccaaagagc 4920 acggcgcatt accaagcggt agcaaagcgg t 4951 // ID Crack-2_CQ repbase; DNA; INV; 3918 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A Crack non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; Crack-2_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-3918 RA Kojima K.K. and Jurka J.; RT "Crack non-LTR retrotransposons from the southern house RT mosquito."; RL Repbase Reports 11(1), 33-33 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 7 sequences with >96% CC identity. XX FH Key Location/Qualifiers FT CDS 76..837 FT /product="Crack-2_CQ_1p" FT /translation="MDSNLIAELPKILDAVKEESKSVKEIVASQQFLSNQF FT DKMVQMMGSLSDEIKHLRAENGLLKLSLNRLADNAKSISNVVEQAEKDIDS FT HHRAQLSINAIVLGIPRTPQEDTKSIVLEICDILGYKDGEKNIVSCCRVAN FT SKAACPPIRISFKHVRAKESLLDHKSSFGKLDVATLQGVRGPKGTTGKVVI FT RDDLSPLSMRIFQELKQLQNSLELRYIWPGRHGAIMVRRTDRSKAIPIQSR FT QDIQKLALSCRKH" FT CDS 882..3779 FT /product="Crack-2_CQ_2p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="MNLIQHAXNESVDDPSSLRLPPDSTVFKILQINIRGL FT NRMEKLDSLCVYLHNLNASIDVIVIGETWIKQGRSQYYNIPGYHSAFSCRK FT TSAGGLAIFVRNDLNFEARTNVADGGFHHISIKVDKSPSSVFVHGLYRPPS FT DDTSRLFSAIEEILANSDPASPCFIVGDVNVPVNQPHNVNVRRYNQLLNAF FT SATVSNTIITRPISSNLLDHVVCSIADSTRVSNYTMPCELSDHSYVLTVFK FT LKADHQRKVLTKTWVNYALVDAEFQRFLENLNQNSLPVDECLQSITGTYSQ FT LIQANSRTRTVEVKVKNDCCPWYNFDIWKLGNIRDSILKRWKRNRQDQHLS FT ELLARANYNLSEAKKRAKKQYHTKLFNNGNAKLLWGRINELMGGKSRTNSA FT PSLEVNQNIEHDPVTVANLFNDYFVSVGDNLASQLTSDGNINRFNTMKRSN FT ISIFLRPASVLEISNLIHELDPKKASGYDGFPVSALKKHRNLLSAIICNCF FT NQCVSLGTYPPCLKKAVVHPIYKNGNPRDPTNYRPISVLPVINKVFEKLIY FT ARLSNFLVATKRFYAHQFGFRKGSSTEIAVLEMTDEITKKLDQKMSAGTIF FT LDLSKAFDTINHEMLLRKLEVYGIQGLPNALLRSYLSDRLQQVTVSGVRGN FT SRFVRCGVPQGSVLGPLLFLLYVNDLPNLVLKGKPRLFADDTAISYYGSSP FT NQVIEFMREDMELVMNYLDNNLLSLNLGKTKLMHFRYPKRVLPPHPALTVR FT NQTIEEVTSYKYLGVHIDNRLCWDVQTREIVSKCASLCGILRKLSTTVPQH FT VLLKMYLAFIHSRYQYAVAVWGGCSKTHLSDLQVQQNRCLKAIFKLPFLHP FT TRELYLMRQHTVLPIRGIFMHRIAVIMYRTLNNVNLHHNFEFRTAIHQHYT FT RQAHHLRRSDFRTETGRRRFVVAGPTVYNSLPDEIKQARSLSEVKSKLTAY FT IRSNVNNYLLH" XX SQ Sequence 3918 BP; 1214 A; 955 C; 798 G; 949 T; 2 other; ctgtattcgc acactaccta ccggccgttt tgattcaggg acaactagcg ccatctgtgg 60 acgagcacca gtaccatgga ttcaaatttg attgcagagt tgccaaaaat cctggacgca 120 gttaaggagg agtcaaaaag tgtaaaggaa atcgtagcta gccaacaatt cctatccaac 180 caattcgaca aaatggttca aatgatgggt tcactgagtg atgaaattaa gcatttacgt 240 gcagagaacg gccttctaaa gctatcgctt aatagactcg ccgacaatgc aaagtctata 300 tcgaatgtag ttgagcaagc tgaaaaggac atcgacagtc accacagagc ccagctttct 360 atcaatgcaa tagtgttagg cattccgcgg acaccacaag aagacactaa atctatcgtg 420 cttgagatct gcgatatttt gggctacaag gatggagaaa aaaacattgt atcatgctgt 480 cgagttgcaa actcaaaagc tgcttgcccg ccgattcgta tctctttcaa acatgtaagg 540 gccaaggaaa gtctactcga ccataaatca agctttggga aactggatgt tgctactctc 600 caaggtgtgc gcgggccaaa aggaacaaca ggcaaggtag taattcgtga cgacctatca 660 ccactctcaa tgaggatttt ccaagagctc aagcagttac aaaactcact agagctgcgt 720 tacatctggc ctgggcgaca tggcgcaata atggtgaggc gtactgatcg atcaaaggca 780 atcccgatcc aatcacgcca agacatccag aagttggcgt tatcgtgtcg caaacactag 840 caagcattgc ttgtcagcga ccatctatct agaacacttc aatgaaccta atmcagcatg 900 ccasaaatga gagcgtagat gacccctcgt ctctacgcct acccccagat tctaccgtct 960 tcaaaatact ccagatcaac attagaggac tcaaccgaat ggaaaaactc gactccctgt 1020 gcgtttacct gcacaacctg aatgccagca tagacgtcat cgttatagga gaaacgtgga 1080 ttaaacaagg cagatcgcag tactacaaca tccccggcta ccacagcgct ttttcctgcc 1140 gcaaaacctc agctggtggt ttggcgatct tcgtacggaa cgatctcaac tttgaagcta 1200 ggactaacgt cgctgatgga ggatttcacc acatttccat aaaggttgac aaaagcccgt 1260 cctcggtatt cgtccacggt ctgtaccgcc cacctagtga cgacacatcg cgattgtttt 1320 cggcgataga ggaaattctt gctaattcgg atcctgcgtc cccatgcttc atagtcggtg 1380 atgtcaatgt acctgtaaac cagcctcata atgtgaacgt acgaagatac aaccagttgc 1440 tgaacgcctt cagcgcgaca gtgtccaaca cgatcatcac cagaccgatt agtagcaacc 1500 ttcttgatca cgtggtgtgc agtattgctg actcaactcg tgtttctaac tacaccatgc 1560 catgcgaact cagtgatcat agctacgtgt taactgtttt caagttgaaa gctgaccacc 1620 aacgtaaagt cttgactaaa acctgggtga actatgcttt ggtagatgct gagtttcaac 1680 ggtttctgga gaatctgaac cagaactcgc ttccagttga tgagtgtcta caatctataa 1740 ccggtaccta ctcccagctg atccaagcaa actccagaac aagaactgtt gaagttaagg 1800 tgaaaaacga ctgctgccca tggtataact ttgacatctg gaaactcggc aacatcagag 1860 acagcattct gaaaagatgg aaacgcaacc ggcaggacca acacttgagt gaacttctag 1920 cacgtgcaaa ctacaacctg tctgaagcaa aaaaaagagc gaagaaacag taccacacaa 1980 agctgttcaa caacgggaat gcgaagcttc tttggggacg tatcaacgag ctgatgggtg 2040 gtaagtcgag aacaaactct gccccttccc ttgaagtgaa ccagaacatc gaacatgatc 2100 ctgttactgt tgccaactta ttcaacgatt acttcgtgtc ggtgggagat aaccttgcca 2160 gccagctcac atcggatggc aacattaacc ggttcaacac catgaaacgc tcgaacatat 2220 ccatattcct ccgacccgca agcgtacttg aaatctccaa cctgattcac gaactcgacc 2280 cgaagaaagc ttccggttat gatggatttc cggtttccgc gctcaaaaaa catcgaaatc 2340 ttctctctgc catcatttgc aactgtttca accaatgtgt tagcctagga acatacccac 2400 cctgtcttaa aaaagcagta gtacacccaa tttacaaaaa cggaaacccg agggacccta 2460 ccaactatcg acccatctcc gtgttacctg tgatcaacaa agtctttgaa aaactgatct 2520 acgcacgtct gagcaacttc ctggttgcca caaaacggtt ctacgcccac caatttggtt 2580 tccggaaagg atcttcaacg gaaatagctg tactcgagat gacggatgaa atcaccaaga 2640 agctggacca gaaaatgtcc gcaggaacaa tattcctgga tctctccaag gcgtttgaca 2700 ccatcaacca tgagatgcta ttgaggaagt tagaagtgta cggcattcaa ggcttgccaa 2760 acgctctcct acgcagctat ttatcggacc ggcttcaaca agtgacagtc tcaggagtac 2820 gcggaaacag taggtttgta cgttgtgggg tgccccaagg aagtgtccta ggacctcttc 2880 tgtttctcct ttacgtaaac gacctgccaa atcttgtgct gaaagggaaa cctcgtctgt 2940 ttgccgacga taccgccata tcgtactatg gatcgtctcc aaaccaagtg attgaattca 3000 tgcgtgaaga catggaactt gtaatgaatt acctagataa caatctcctc tcgcttaacc 3060 tagggaaaac caagctaatg cattttaggt atcctaaacg agtcctaccg ccacatccag 3120 ccctaaccgt tcggaaccaa actatcgaag aagtgactag ttacaagtac ctaggcgtcc 3180 acatagacaa tcgtttatgt tgggacgtcc aaacaagaga aattgtgtct aaatgtgcgt 3240 ctctttgtgg tattttaagg aaactctcaa caacagtccc tcaacatgtc ctcttaaaaa 3300 tgtatctagc tttcatacac agtcgatacc aatacgctgt agcagtctgg ggaggatgca 3360 gcaaaactca ccttagtgac ttgcaagtcc aacagaatag atgtttaaaa gctattttca 3420 agttaccttt tctgcaccct acgcgtgaac tgtacctgat gcgccaacac actgtactac 3480 caattagagg gatctttatg caccgtattg cagtaattat gtacagaaca ctaaacaatg 3540 taaacttaca ccacaacttt gaattcagaa cagcaatcca tcagcactat acacgacaag 3600 cacaccacct cagaagatcg gattttagaa cggaaacagg aagaagacgc tttgtagtag 3660 caggtccaac cgtttacaat tccctcccag atgaaatcaa gcaagcacga tctctaagcg 3720 aagtcaaaag taagcttact gcttacatta gaagtaatgt taataattat cttttgcatt 3780 gaattaccag taatgtgtta ccagtaacta agattacaac cctttaaagg aacaactctg 3840 ttcattaggg aagtaattgc ggaacgaaat gaatgatgaa atatacgtta ttattattat 3900 taaaaaaaaa aaaaaaaa 3918 // ID Gypsy-26_CQ-I repbase; DNA; INV; 4330 BP. XX AC AAWU01010854; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-26_CQ_; KW Gypsy-26_CQ-LTR; Gypsy-26_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4330 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 431-431 (2011). XX DR GenBank; AAWU01010854; Positions 6528 2199. XX CC Positions [3374-3844] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 110..2776 FT /product="Gypsy-26_CQ-I_1p" FT /translation="MSDPDLKSLLEKLTEVCLQNQDLQKEQQAQQKALQDL FT VKTVTAKPIIAPPKLNANTNDNKEFLLESLANTMVEFHYDPDSDDGLFDVW FT YSRYADTFSQDACKLDDASKVRLLLRKLDTRAHSRYANFILPKKPKDNDFE FT ESVTKLKEIFSRRTSLFNKRCKCLQFIKPESNDFIAYAGAVNRLCEDFKLS FT SLKENEFKALIFVCGLRSHRDSEIRTRLLSKLETDAATLTVDTLATECQRL FT LNLKQDTALIENAAHSSTSVVNSLSDKYRGKSNQKRQDKPQQKQKSDARKT FT PKSPCWLCGGVLYTEACPFVKHVCKECKQIGHKDGYCAFARPSGKPQRSSV FT KEVSSDQTSARRRRKFVSILINGKPIKLQIDTASDITVISRDNWLALGSPA FT TRPTSHEATSASGGHVKLEAEFECSVTLQDSCRSTVCYVAPNSNLNLLGLD FT LIDAFDLWSVPLSSVCNRVTVADGVLDQVHQLQRKFPEVFLDTLGLCTKTK FT VKLFLKPNQQPVYVPRRPVAQTAYEPLQDELRRLENLGIISFVEHSDWAAP FT IVAVRKANGSIRICGDYSTGLNNALQPNDYPLPVPDDIFASLAGMRHFSII FT DLSNAYLQIEMDEDSKNLLTVNTHRGLFKFNRLAPGVRPATGAFQKIMDSM FT TAGLTGVRVYLDDILVGGATEEEHLHNLHAVLERIRDYGFHLQLSKCRFFM FT DEIKYLGHVVNSSGIRPDPAKLQAISTMPPPKNVPELRSYLGAVNYYAKFV FT SEMHRLRHPMDQLLKANAQWNWTKDCQRAFEEFKRLLSSDLLLTHFDPRHE FT IIVAADASNTGIGACLMHRLPTGAIKVVQHRSRSLTPEEKNYGQIEKEGLA FT LVFGVMKFHSMIYGRHFTLQTDHKSFSQSQLAQGHS" FT CDS 3092..4318 FT /product="Gypsy-26_CQ-I_2p" FT /translation="MIAEPEAKLFFCRRFALSVVEGCLLLGERVVVPEVYR FT KRILKQLHKAHPGIVRMKSLARCYVFWPRIDADIEDLVRACPKCAAAAKLP FT TKAPLQSWPTPNGTWERVHIDYAGPMNGVFFFVVVDAYSKWPEIFATPSSS FT TKATIKLLRQCIARFGRMDLLVSDNGPQFTSAEFRTFLKQNGIQHVTTAPY FT HPASNGQAERFVDTLKRALKKINEGEDLEETLQVFLQAYRSTANPQAPGGK FT SPAELMLSRRPKSTFDLLRAPSMPPKPTPANEKQNADYDRKHGAVERCFAP FT GDLIFAHVFRRNDFTWAPGQIIEKVGAVNYVIKLEGSNRPIKVHTNQIRRR FT RVANATSAEVQPGTLPLDILLDNFEIETQPLPSGDSPVLPGDDRPFPRRSE FT RDRRAPDRYTPGLWA" XX SQ Sequence 4330 BP; 1002 A; 1302 C; 1168 G; 858 T; 0 other; tttttggcga cgaggacgct gcaaatttcc ggtgtcctgt acaacccgtg gaatttgcgg 60 aaaagaaccc ggctctacgt tacccgccga acctaacctc aaccggacaa tgagcgaccc 120 ggatctgaag agcctcctgg agaagctgac cgaggtctgc ctccagaacc aggacctcca 180 gaaggagcag caagcccagc agaaggcgct tcaggacctc gtgaagaccg tcacggccaa 240 gcctattatc gcgccaccga agctcaacgc caacaccaac gacaacaagg agttcctgct 300 cgagtcgttg gcgaacacga tggtcgagtt tcattacgat ccggacagcg atgacggcct 360 gttcgacgtc tggtactcgc ggtacgcgga cactttctcg caggatgctt gcaagctcga 420 tgatgccagc aaggtgagac tgttactgcg gaagctggac actcgggcgc acagcagata 480 cgccaacttc atcctgccga aaaagccgaa ggataacgac ttcgaggagt ccgtgacgaa 540 gctgaaggag attttttccc gccgaacatc cctgttcaac aagcgctgca agtgcctgca 600 gttcatcaag ccggagtcga acgacttcat cgcctacgct ggcgcggtga atcgactctg 660 cgaggacttc aagctgagca gcctgaagga gaacgagttc aaggcgctga tctttgtttg 720 cggactacgc tcgcaccggg actccgaaat tcgcaccaga ctgctgtcca aactcgagac 780 ggatgctgct acacttacgg tcgacaccct cgcaaccgaa tgtcagcggc tcctcaacct 840 gaagcaggac acagcactga tcgagaacgc cgcacactcc tccacttctg ttgtcaactc 900 tctttctgac aagtatcgtg gcaagtcaaa ccagaaacgg caagacaaac cacaacaaaa 960 acagaagtca gacgctagga agacgccaaa atccccttgc tggctgtgtg gcggtgtact 1020 ctacacggag gcctgccctt ttgtaaaaca cgtctgcaaa gagtgtaaac aaatcgggca 1080 caaagacggc tactgcgcgt tcgcgaggcc gtccgggaaa ccgcagcgtt ccagcgtgaa 1140 ggaggtctcg agcgaccaga ccagcgcccg tcgcagaagg aagtttgtca gcatcctgat 1200 caacggtaag ccgatcaagc tccaaatcga cacggcctcg gacatcacgg tgatcagccg 1260 cgataattgg ttggctcttg gatctcccgc aactcgcccg acaagtcacg aagctactag 1320 cgcgtctgga ggccacgtca agctggaggc agagttcgag tgcagcgtga cgctgcagga 1380 ttcctgccgc tcaaccgtct gttacgtcgc acccaacagc aacctgaacc tgctcgggtt 1440 ggacctgatc gacgctttcg acttgtggtc tgtcccgcta tcttcggtgt gcaaccgggt 1500 gactgtcgcg gacggcgttc tggatcaagt tcaccagctg cagcgcaagt ttccggaggt 1560 ctttctggac actctgggcc tctgcacgaa aacgaaggtg aagctgtttc tgaagccgaa 1620 tcaacagccc gtgtacgtgc cgcgccggcc agtcgcgcag accgcctacg aaccgctgca 1680 ggacgagctg aggcgtttgg agaacctggg gatcatctcg ttcgtcgagc actccgactg 1740 ggccgcgccg attgtggctg tacggaaggc gaacggcagc atccggatct gtggagatta 1800 ctccaccggg ctcaacaacg cgctccagcc gaacgattac ccgctgccag tgccggacga 1860 catcttcgca tcccttgctg gaatgcgcca tttcagcata attgatctct ccaacgccta 1920 cttacagatc gaaatggacg aagattcgaa gaacctcctc acggtcaaca cgcaccgggg 1980 gctcttcaag ttcaaccgtt tggcaccggg cgtgcgtcca gccacgggcg ctttccagaa 2040 gatcatggac tccatgaccg ctggcctgac cggagttcgc gtctatttgg acgacattct 2100 ggttggcggg gccactgagg aagagcatct acacaacctg cacgcggtgc tcgagcgcat 2160 ccgcgactac ggtttccacc tccagctctc caagtgtcgt ttcttcatgg acgagatcaa 2220 gtacctgggc cacgtggtca acagttcagg tatccgcccc gatccggcca agctccaagc 2280 gatctccacg atgccgcctc cgaagaacgt tccagagctg cgctcgtacc tcggcgcagt 2340 gaactactac gcaaagttcg tttcggagat gcaccgtttg cgccatccga tggaccagct 2400 gctaaaggcc aacgcacagt ggaactggac caaggattgc caacgcgcgt ttgaagaatt 2460 caagcgcctc ctgtcgtcag atctcctgtt gacgcacttt gatcctcgcc acgagatcat 2520 cgttgctgcc gacgcgtcca acaccggaat cggtgcctgc ctgatgcatc gcctgccaac 2580 aggagccatc aaggtcgtgc agcatcgtag ccgttcgctc acccccgaag agaagaacta 2640 cggacagatc gagaaggaag gtctggcact cgtgttcggt gtcatgaagt ttcactccat 2700 gatctatggc agacacttta ctctgcagac cgatcacaag tccttctctc aatctcaact 2760 cgcacaaggg cattcatagc agcatcggcg aagaggatgc agcgttgggc catcatcctg 2820 taatcgttcg acttgcgtat ggagtacatc gctacggagc agttccctag gcggacgtgc 2880 tctcgcgtct catcagcaag cacagcagag aaaacgacga agacttcgtg atcgccgcaa 2940 tctcgttgaa tgacgaactc gaagttttta tcgactcgtc ggtcagcaac ttcccggtca 3000 cgcacgcctg atccggaagg ctacgaagga gtcgaagtcg ttgcaagcgg tcatcaagta 3060 ccaccacgaa ggctggcccg agtcttccac gatgatcgct gaacccgaag cgaaactttt 3120 cttctgccgc cgttttgctc tcagcgtggt cgaaggatgt cttctcctgg gtgagcgcgt 3180 cgtcgttccg gaagtgtacc gcaagcgcat tcttaagcag ctgcacaaag cgcatcccgg 3240 aattgtgcgc atgaagtcgc ttgcgcgctg ttacgtgttc tggcctcgaa tcgacgcgga 3300 catcgaagac cttgtgcgcg cttgcccaaa gtgcgccgct gcagccaagc ttccaaccaa 3360 ggcaccgctc cagtcctggc cgaccccgaa cggcacttgg gagcgggtcc acatcgatta 3420 cgctggcccg atgaatggag ttttcttctt cgtcgtagtg gatgcgtact caaagtggcc 3480 ggagattttc gcaaccccat cgtcttccac caaggcgact atcaagctgc tgcggcagtg 3540 catcgctcgc tttggtcgta tggaccttct cgtttcggac aacggtccgc agtttacgag 3600 cgctgagttc cgaacgttcc tgaagcagaa cggaattcag cacgtgacga ccgctccata 3660 ccatccggcg tccaacggtc aagctgagcg cttcgtcgac acgctgaagc gagctctcaa 3720 aaagatcaac gagggggagg atttggagga aactttgcaa gtttttcttc aagcgtaccg 3780 gtcgactgcg aacccgcaag cacccggtgg caaatctcct gcggagctga tgctgagtcg 3840 tcggcctaaa tcaacattcg atctactacg agctccctcc atgccgccga agcccactcc 3900 tgcgaacgag aagcagaacg ctgactacga ccggaaacat ggagctgtgg aaaggtgttt 3960 cgcgcccggt gacctgatct tcgcacacgt gttccgtcgc aacgatttca cgtgggcacc 4020 agggcaaata atcgagaagg tcggcgcggt gaactacgtc atcaagctcg aaggatccaa 4080 ccgtccaatc aaggtgcaca ccaaccagat acgacgccgt cgcgtcgcta acgcaacatc 4140 ggcggaagtt caacctggca cacttccgct ggacatcctg ctggacaact tcgagatcga 4200 gacgcagcca ctgccgagcg gagattcacc ggttctacct ggagacgatc gacctttccc 4260 gcgccgttct gaacgagacc gcagagctcc ggatcgctac accccgggac tctgggctta 4320 aagggggaga 4330 // ID Mariner1_DYa repbase; DNA; INV; 1299 BP. XX AC . XX DT 02-JAN-2009 (Rel. 14.02, Created) DT 02-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE Mariner/Tc1-type sequence: consensus. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner1_DYa. XX OS Drosophila yakuba OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; melanogaster subgroup. XX RN [1] RP 1-1299 RA Jurka J.; RT "Mariner-type families from fruit fly."; RL Repbase Reports 9(2), 477-477 (2009). XX DR [1] (Consensus) XX CC Individual copies are 99% identical to consensus. XX FH Key Location/Qualifiers FT CDS 183..1232 FT /product="Mariner1_DYa_1p" FT /translation="MEKSEFRVLIKHYFLRKKSITETKERLDKYYGDSAPS FT ISMVKKWFTEFRCGRTSTSDAERSGRPKEVVMPEIVDKIHGMILDDRRMKV FT REVAEAVGISTERVHHILHEYLDMKKLSARWVPRLLTHDHKRNRVTISKEC FT LAMFNRNPNEFLRRFVTVDETWIHHTTPETKEQSRQWVSPGERAPKKAKVG FT LSANKVMATVFWDAQGIIHIDYLEKGKTITGEYYSELLDRFDIDLKQKRPH FT LAKKKVLFHQDNARVHTCVVSMAKFHKLGYELLPHPAYSPDLAPCDYFLFP FT NMKKWLGGKRFGSNEEVITETNDYFEGLEKTYYLEGIKKLEKRWTKCIELK FT GDYVEK*" XX SQ Sequence 1299 BP; 413 A; 255 C; 295 G; 336 T; 0 other; tacgagggtc gtttgataag tccgtgactt tttgtatttg ccgcgcacct gctctaaata 60 caacactgct cctgtcagca gttatcgatt aacagctgac tgtaaaattt tgagaaagct 120 gcgttttttg gtttgcgttt tataggcatt gaaagcagac gatctcgtga attttaacga 180 aaatggaaaa aagtgaattt cgtgtgctaa ttaagcatta ttttttgcgt aaaaaatcca 240 tcaccgaaac caaggaaaga cttgataaat attatgggga ctctgcacca tcaatttcaa 300 tggttaagaa atggtttact gagttccgtt gtggtcgtac cagtacaagt gatgccgaac 360 gttcaggtcg cccaaaagag gtcgtcatgc cagaaatcgt cgacaaaatc catggaatga 420 tattggatga tcggagaatg aaagtgcgtg aggtagctga ggctgtaggc atctcaactg 480 aacgggtaca tcacatttta catgaatatt tggacatgaa aaagctttcc gcgcgatggg 540 tgccgcgatt gctcacacac gaccataagc gcaaccgtgt gaccatttca aaggagtgtt 600 tggcgatgtt caaccgcaat ccaaacgaat ttttgcgccg tttcgttacc gtagacgaaa 660 catggatcca ccacaccaca ccagagacca aagaacaatc aagacagtgg gtttctccgg 720 gtgaacgtgc accaaagaag gccaaggtgg gtctgtcggc caacaaggtc atggccacag 780 ttttttggga tgcacaaggt atcattcaca tcgattacct tgaaaagggt aaaacgatca 840 ccggcgaata ttattcagag cttttggaca gattcgatat tgatttgaag cagaaacgac 900 cgcatttggc gaaaaaaaaa gtgctgttcc atcaggacaa tgcacgggtg cacacgtgtg 960 tagtcagcat ggcaaaattt cataaattgg gctacgaact gctaccccat ccagcatatt 1020 ctccagattt agccccctgt gactattttt tgtttccaaa catgaagaaa tggctcggcg 1080 gtaagagatt cgggtcaaat gaagaggtca tcacagaaac aaacgactat tttgagggcc 1140 ttgagaaaac ctattatttg gaaggaataa aaaaattgga aaaacgctgg actaaatgta 1200 tagagctaaa aggagattat gttgagaaat aaaacgcttc tttgacgaaa aaaatatatt 1260 ttattcaaaa agtcacggac ttatcaaacg accctcgta 1299 // ID BEL-16_CQ-I repbase; DNA; INV; 1788 BP. XX AC AAWU01032461; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-16_CQ_; KW BEL-16_CQ-LTR; BEL-16_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-1788 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 185-185 (2011). XX DR Genome; AAWU01032461; Positions 1321 3108. XX CC 'ATATG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 47..1788 FT /product="BEL-16_CQ-I_1p" FT /translation="MPDPIRPCHHVHVVNCFGRGPEARETTHVDAYLLRRH FT QSCVGVPNETEHLAWNLPKCSIAIGHSKPSSHSCTPQTAGQQKRPSATSIS FT KESTSFSVFQHPHSPQPTVQNHARPLLQHKLPGIGPGPRANKCTPSTSTKT FT SLKFRLKLADLKLKKIQQEQDLLKKKLDLERDVFNRKYALLGVSSDAQPSD FT HQVTTVDYSSPAQSGAIVSQTSLAKLNPADVPNPASSQRKWHWTPSTTCST FT ASKPPHTEVAKSKQRANFARATNDGQLEEENASFQLQQHYRSAGQQYLWTA FT TNHTDVYRAANAWPIVRSRIVRYQYQALTISCFPILTQRRPTTQILSRIPT FT TYHEKENKKDRQMFTFGSYYDTQVSLTNRPTQSLVSMECDNTEFNDLRNSV FT VQMTKLLRTNLRFFQTSYFNTLRVPINSAASCIYKQKAAVLRGAEQVRLHF FT QSSWNTLVPSSLTGNSSARGISLHFDPPKGPDLMFVSKVNVERGTYQDRIS FT CLQSVRAHTITTRTSTPQHTADLHIIVSTSYMKIAFRKGYSPAEWTPSPAS FT SLLDVVDHFHLEPSGYSVRWKVNRTDWFGVPGGT" XX SQ Sequence 1788 BP; 522 A; 520 C; 380 G; 366 T; 0 other; ttttaataaa ttcgtttgca cacgagcaaa cgcggccaat ccggtaatgc cggacccaat 60 ccggccatgt caccatgttc acgttgttaa ttgctttggc agagggccag aagccagaga 120 aacgacgcat gttgatgcct atctgctgag acgccaccaa agctgtgttg gagtaccaaa 180 tgagaccgag caccttgcct ggaatttgcc caaatgcagc atcgcgatcg ggcactccaa 240 gccaagctca cattcatgta ctcctcaaac agctggtcag cagaaacgtc cttctgctac 300 gagtatcagc aaagaatcaa ccagtttctc cgtttttcaa caccctcact cacctcagcc 360 cacggtgcaa aaccatgcca ggccactgtt gcagcacaag ttgcccggca tcggaccagg 420 tccacgagca aacaaatgta cgccttccac atcaacaaaa acctccctca agtttcgcct 480 caaacttgcc gatctcaagc ttaaaaaaat acaacaagaa caagacctgc tcaagaagaa 540 actggatctg gaacgtgacg tttttaatcg aaagtatgca ctgcttgggg tttcttccga 600 tgcccaacca tctgaccatc aagtgacaac cgtggattac tcaagcccag ctcaatcggg 660 cgcaatagtg agtcaaactt cactcgctaa actcaatcca gctgatgtgc caaatccagc 720 gtccagtcag cgcaaatggc actggacacc atctacaaca tgctcaacgg ccagcaaacc 780 cccgcacaca gaagtcgcca aatcaaaaca gcgtgccaac ttcgcacgag caaccaacga 840 tggacaactg gaagaggaaa atgccagctt ccagctccaa caacattacc gatccgcggg 900 tcaacaatat ctgtggacgg ccaccaatca caccgacgtc taccgtgccg ccaacgcttg 960 gccgatcgtt cggagtcgta tcgtccgcta ccaataccaa gctctgacga tcagctgttt 1020 tccgatctta acacaacgaa gacccacgac acaaatactg tctcgcatcc cgactactta 1080 ccacgaaaag gaaaacaaga aagatcggca aatgttcacg ttcggatcat actacgacac 1140 acaagtgagc ctaaccaacc ggccaaccca gtcgcttgtg agtatggaat gtgacaacac 1200 cgaattcaac gacctgcgaa actcggtcgt ccagatgact aaactcctca gaacaaacct 1260 gcgattcttc caaacatcgt acttcaacac cttgcgcgta ccgattaatt cagcagcgag 1320 ctgtatctac aagcagaagg ccgcagtact acggggtgct gaacaagttc gactacattt 1380 ccagtccagc tggaacactt tggtaccttc aagcctaacc gggaacagca gtgcacgagg 1440 aatatcactt cactttgacc cgccgaaggg acccgatttg atgtttgtgt cgaaagtaaa 1500 cgttgaacgg ggaacctacc aagatcggat cagttgttta cagagtgttc gcgcacacac 1560 aatcaccacg agaaccagta caccacaaca caccgcagat ctacacatca tcgtgagtac 1620 gagctatatg aagattgcgt tcaggaaagg ttactcacct gctgaatgga ctccttcacc 1680 cgcaagttca ttgcttgacg tcgttgacca ttttcacctg gagccgtctg gatactcggt 1740 gcggtggaag gtaaaccgca cagattggtt cggtgtacct ggggggac 1788 // ID P6_Cis repbase; DNA; INV; 2972 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 06-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE P DNA transposon from Ciona savignyi. XX KW P; DNA transposon; Transposable Element; P6_Cis. XX OS Ciona savignyi OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. XX RN [1] RP 1-2972 RA Smit A.F.; RT "P6_Cis - P DNA transposon from Ciona savignyi."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX CC Ci000337. Lacking long ORFs and probably non-autonomous; has CC short region (1593-1916) with encoded similarity to P CC transposase. 6 bp target site duplications. XX SQ Sequence 2972 BP; 1027 A; 490 C; 545 G; 909 T; 1 other; ctagtgttcc ttaatagaga gagttgcgtc attgaaaatt agaagaactg tttgtgtcca 60 aattgtctac aagcccagtg cgcagtttca gacaaatgtg tataaaatta atgcgtacgg 120 gagtttgtga ggttatttac taatatttgg actgttgtac tttgcttatt taattggaaa 180 ttgttctgtt gcaaatatat attacttaga ctgtgatttt attagcacta ggctattagg 240 cctacagttg caggacttga accaaagagt tttctcagca actgtggtga actcataaga 300 ttatttgagt tttgcaagtc tgtcagatgg tggggcgata agcctataac atatttatct 360 tggacctaat tgtttttgtt acttgctttg ctgctgctac aggatttagg gttgtgtaat 420 atggcctaga gaagtcgcat tttcttgctg gctatatgga atatcttacg gaacttgcaa 480 gaaaacttca aaaatcaaga caagcatatg cttttaactt tttagataat ctctccaaaa 540 gggatgaaat tatgcagttc aaaccccttg agatgataaa gtcgacaaac aataatgaaa 600 ttccaacaga atttcagtag tagacaatga aaagtggtgt ctcgaacata tcatgcaact 660 gattttacct accagttatt aaataaaaaa ctgcatcatg tttgtttatt cttatacatt 720 tatttgctat atacaagtct gacaaaggta aaacggttaa attgaataca gattttacat 780 catagcaata aaatgacagg gtgtagtgct tggggctgca caaaccacac tgaaaataag 840 aaaaaaatgt ttagattccc aaaagaaaaa ggaaagcgag ttttgtggaa agcaaaatta 900 aacagaaaaa attgggaagt tacagacaat tctcgtttat gtgaagttag tattttgtta 960 tacattttak tatagcttaa gtattcaaac tgtaagcatt ttgaatcata ggagagaaat 1020 tgtactccga ataccatgct aagaattcac taggttgttg gtaaaacagg acaggacaca 1080 acagaacagg acaatgggat attacaggac aggacattac agaacaggac atatcaatga 1140 catttagact ttttagtgta ggcctaaaca ttgctaaaaa attaaaatga aacaaagtac 1200 atacacattt tcttacagga ccattttcac tgcagtcagt tcgttatggg gaaaaaaatt 1260 aaagaagaca gcagttccaa cgctgtttgc tcataaaaaa atacagagtc ctcggaaatc 1320 acctaaaaaa agaataaatt tacttgctgc acagcagaaa agagaacaac tgcttgttca 1380 agagcattct tactgccaac tttcatcatc tgcagttata gagacgagtg aagccgaaga 1440 agaaactatt gaaagttggg atcccatttg ttacagttat ttattccaat aaaactttta 1500 ataatttcat catttaaata atttgttgtt acataaaagt ttgctaagtt acttttaaag 1560 ttaattttga ttcagtattc tagatacaat ggcatcttta cctactcacc tctaggctca 1620 cacaggattg ccttgaaaat ctattcagtt ctgttcgctt gaaaaatccc accccatctc 1680 ctcttgaatt taaatgtgca ttgaaaatga taagcgttgc ccaatttctg catacttctt 1740 ccgggggaaa ccatcaaaaa gatgacgctc aatacttggc agaattttta caacagccag 1800 cacaactggc agcagcatct gatgatatac cgttacctga agttgacatg caactatctt 1860 tggaaccatg caactttgac aagagtgatc tcaatagttt gtactatttg gctgggtagt 1920 gtttctagag taaaaaaaca tcaaataaca tgcgacaaat gtatggagga agttcttacc 1980 tctcaactat cgagtgtgca actgttgcat gatgattcta tacagctcac aaaattaaaa 2040 gattacactg gaaactctct cgttgaagtc tcggacaaag ttttcagcat gattaaaata 2100 tcagaatcaa ccatgagact aattggggaa gctaaattaa aagcaatgaa aaatgttaga 2160 aaagttattt taaaacaagt gtcggcagcc accgaatctt tccaatttcc tacttgccac 2220 agtgttaaga taaaaatctt aaagagattc gtccatgttc gcttacaaat tatttgcaaa 2280 caaatgagag aaaaacaaaa aaaagaattt aaagctagcc gaggagaatt gggcagcaaa 2340 accatggcta agaaaaagct cgtgcagtca ttgcagtagg gcgacttatt aaattacata 2400 atattgtgat ttgtgaaatg gatatctcta gttggcctat ttggtttgtt tttttccatg 2460 cttctagtgt ttgataatca atggctcctg actgatatag tctgtaaata aatttcaatt 2520 ttaacagtaa tttctggagt attcaattca atttacagta agggcggtat agtacgctat 2580 acttacaggc taccggtata tagcttaaat aaaatagaaa tagttaagga aaggaaggtt 2640 taggttcaaa gtttgctaaa tgcctggcgc cactaaatct gctattgaca caaacaaaag 2700 tcaatttgcc gcattggtac aacatgtcaa agtgaggtaa aatagcccta aatcccttaa 2760 atgtaacaaa actgaactaa ttatagtcta ctaggtaaat aaacttcctt tcacgaaagt 2820 atctagataa actaagcttt ggtgttcaaa ttgttgcatt aattcaatga tttcgcctat 2880 gcttgacaca ataatttgga cacatttttg cttccatgcg gttttggcgg gaaaacccga 2940 atgtcggaac tctgttcatt aagggacact ag 2972 // ID Gypsy-35_OD-LTR repbase; DNA; INV; 282 BP. XX AC CABV01003631; XX DT 24-FEB-2011 (Rel. 16.02, Created) DT 24-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Oikopleura dioica genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-35_OD_; KW Gypsy-35_OD-I; Gypsy-35_OD-LTR. XX OS Oikopleura dioica OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; OC Oikopleuridae; Oikopleura. XX RN [1] RP 1-282 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Oikopleura dioica genome."; RL Direct Submission to RU (24-FEB-2011). XX DR Genome; CABV01003631; Positions 13364 13645. XX SQ Sequence 282 BP; 79 A; 81 C; 52 G; 70 T; 0 other; tgttgggtgc tgacgttgcc aagcgccacg ccaacatggc gcggcgcccc acttcgaggc 60 tatacagtaa cccagcatgc agtacttacc catctataga gcctgcgtcg taagcgctta 120 ggccttttca agcaactgta taaaatctgt aactactttc cacaacggtc actctcattc 180 catcgtacag ttaacctttt ccttcctccg acaacatata ttggaataaa caatttgaca 240 ataccgagag tctttaacat gtgagcaaca aggacaccca ca 282 // ID Sola1-1_CapTel repbase; DNA; INV; 3201 BP. XX AC . XX DT 17-MAY-2011 (Rel. 16.05, Created) DT 17-MAY-2011 (Rel. 16.05, Last updated, Version 1) XX DE DNA transposon: consensus. XX KW Sola; DNA transposon; Transposable Element; Sola1-1_CapTel. XX OS Capitella teleta OC Eukaryota; Metazoa; Annelida; Polychaeta; Scolecida; Capitellida; OC Capitellidae; Capitella. XX RN [1] RP 1-3053 RA Yuan Y.W. and Wessler S.R.; RT "The catalytic domain of all eukaryotic cut-and-paste transposase RT superfamilies."; RL Proc Natl Acad Sci U S A 108(19), 7884-7889 (2011). XX DR [1] (Consensus) XX SQ Sequence 3201 BP; 808 A; 792 C; 708 G; 843 T; 50 other; gtgaggctcc acgctcaaaa cctggatatg tcgcagccga gctaagcgag ctgcggccgt 60 tttttcggcc tgggggtcat ttttcaacat ttttgggtgt tcgcattttt ggacacttca 120 acatcttatt tatcatgtgt caaagtattt ctcctcaatt cagctccgaa atacaataca 180 atttgatttt aataaggcta ccctcaggca attatcaaga acaacgcccc taatttgcgc 240 taattgtcgg taattcatca tctcgcctca gccgccatgt ttaaattacc ctggcatatc 300 tgatatcatt gtgaccgtca ggtcgtgaac tttcaatctg gcgcagcttg gtctagtttg 360 cattcgatta caaccaatga gaagcgagca aaataggctg gtgtttgacc cagaaagagc 420 acgtgactgc tactatgctc cccgattggc cgacgccaag gcaacgaccc ggtgacctga 480 cctagttcgc ttcaacatgg acgccgagat cgctcgattg ctaaactcca gcgaagaaga 540 tcgccaaaaa gggcttgctc ttttggacga gtactacatt aatgacgacg acgacttgga 600 ttcggatacg gactacgatc ttatggaaga tggcactgtt ctggggccaa ctgccgacga 660 ggaagaggtc gctatgccgc cgccgtctgc tcactccgct cagccacact cacagacact 720 cgatcagctg ctggacggcc tggttccaga accgcaatgt gttgacaacc tggatcatct 780 tttccaggct gcgaaagacc acaggtgatt aaattgtctc ctattcattt ggacaatacc 840 tatccatgtt ttaatgcagt tgctcatgca agcatcgaaa tggccaacca tgctacacat 900 acttcacccc agaggagctg gccacctcaa gaatgaccat ggcctccctc ccattccgta 960 agttgattta attgatttgt tgcatagcaa tctccactgt cttgtatcag gtgaacaaaa 1020 tatgctgctc ctgggcatta tctctgccgg cattaacatg tcgcccatga cagcacgttc 1080 taagaaaaaa tcgcagacac agagaagacg tgctcggaca agatactttt accagggaaa 1140 gcagttgtgt gtcacaactt ttggattctt gtacaagtga gtacattgtt cccgatgatg 1200 caacaaatta ctgacttttg atttctcttc cagtgtttgc tatgatcgta ttcagcgcct 1260 ttcggctcat tatcagcaga atggcctctg ccctgttgag agccaaaaga gaggggcgca 1320 tggccacaag aaggcgctct catttgagga tatcagcagg gtcgtcaact tcttgaaaac 1380 gtacgcggag gaccatgctt tggtgctgcc tggtcgggtg ccaggatttt ggcgagacga 1440 tgttgtgctt ttgccatcat cacacactaa acnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1500 nnnnnnnnnn nnnnnnnnnn nngctcagca agatggagag tcacctgtcc cgagttcgag 1560 aagaaagagc attctacaaa gagctggtgg aggagtctaa gaagacctcc aaagacctgg 1620 ccatctctga gctggagcag agcgctccct gcagccgccc ggtctccatg cactattctt 1680 ttgattatgc acagcaagtt cacctgccta gtgatccctt gcagcctggc ccaatgtact 1740 ttctcacgcc aaggaagttg gggctgtttg gcgtttgctg cgaagggatc cctaagcagg 1800 ttaacttcct tgttgacgaa gcccattgca cctcgaaggg gtcgaacgcg gtgtgttcct 1860 ttctccactt cttcttggag aactatggtc ttggggagac tgatgtccac ctccactgcg 1920 acaattgttc aggacagaat aaaaataatt ttgtccttta ctacctcgcg tggagagtgg 1980 ccatgggtct tcacaagacc ataggtcttc attttatggt ggcaggacat accaagttcg 2040 cccctgattg gtgctttggg cttttgaaac aagcatttag aagatccaaa gttgattcgc 2100 tagagtgcat ggagtcggtg gtgaacggga gtgctgcctg caacatcgcc caacctattg 2160 gatgggaaga tggaagggtc gttgtcccag tctacgactg gaatgcccat ctatctccac 2220 atgggcaaca catgaagggc atcaaagaaa accaccattt caagtaagga tcaggaaatt 2280 ggggaaagat tctgtgattt aacaggagat ttgttttagg ttcgactctg atcatcccgg 2340 cgttgttcgg tattcgatca agcctgatgg tccctggcag cagtggaggc tgttcaaaaa 2400 tctagaatct ctcccctctg ctccacctgt gccatgccaa ccgccaggac ttcccaagaa 2460 gcgcagagaa tatctctacc accagatccg gcctttcgtc tctcccgcgt tccaagacgt 2520 caccacccct catccggaca tcatcgcaga gaactttgct gcttacactg aaagtcatca 2580 tcctccatct cctcctcctc cacctcctcc tgcaactgct gcaccaacca agacttccaa 2640 gcgacgtaaa tcttccgact cggctaccgc aaggaagccc cgtgcagcca aagctcaaaa 2700 gaaaaaaaaa taaagtgctt cagtctaatt caaatgatgt cttcctctcc ctcatatgat 2760 ttgtatatct tgttttttac tgttagtatc attttggggg ggttgtatag tgttgacatc 2820 cattttcaga cttgcgatat agtgtacccg aatgtaccca tgctaatgaa atcctgaagt 2880 ctttgcctca cttgcatgct tactatagtt gggggactga tagctcatac tgccatgagt 2940 ctgtgttgaa aaaattatga ggatagcatg attacaagcc aagatatgga caaatgaatg 3000 atgccaaact ttgaacgcga ttttctcgtt tcagagtttt tcagcatggc aacgctaaaa 3060 tggtcataac tttgtgacca gttaacattt tttaataatt ttttcaccaa aatgctccac 3120 agaacctgcc cgatctacca gccaaataac tcaaaaaagt agattttgac caatatccag 3180 gttttgagcg tggagcctca c 3201 // ID Hoana5 repbase; DNA; INV; 2373 BP. XX AC . XX DT 21-SEP-2009 (Rel. 14.09, Created) DT 21-SEP-2009 (Rel. 14.09, Last updated, Version 1) XX DE Hoana5 is a putative autonomous DNA transposon. XX KW hAT; DNA transposon; Transposable Element; HOBO; transposase; KW Hoana5. XX OS Drosophila ananassae OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; ananassae subgroup. XX RN [1] RP 1-2373 RA Ortiz M.F. and Loreto E.L.S.; RT "Characterization of new hAT transposable elements in 12 RT Drosophila genomes."; RL Genetica 135(1), 67-75 (2009)DOI 10.1007/s10709-008-9259-5. XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 601..1497 FT /product="Hoana5_1p" FT /translation="MHPDRLKVGKGQNDNATALDVFVGRSTFYSNDSARKR FT KLDEMVMNMIAVDVQPFSCVEDEGLINLLREMDPRYKLPSRTHLRDTVLPN FT QYDRLKIVLISVLDKIDFLALTTDLWTSRANEGYITITCHFVWTGAKLISA FT VLATRQLVTSTNHTAPNIAETISSVLTEWAIASKVVCVVTDNDSTMKKACE FT LLKYKHLPCVAHTINLLVQDALRFPVVESILSKCKSIVAYFKRSTVAYEKF FT KEAQGVDQPVGLLQEMPTRWNSAYEMIKRIIKTNEHITVVLVTSRNAPPPL FT TPDEVDV" XX SQ Sequence 2373 BP; 734 A; 489 C; 504 G; 646 T; 0 other; cagaggtgga gaaacatcga tgttaacatc gatgtcatcg atgttttgaa aacatcgttg 60 acatcgatgt ttggcaagcc aacatcgatg tttatcatct ctaacgagca tcaagcaggg 120 aagacgtgaa aattgtaagt agatttaaat taagttattt ctgataaatt actaatgtta 180 tgttgcaaac tagtgttctg ttggtggtgt tggctgaggt agtggagtgg aggcatggag 240 agatatctga aacgtaagta ttattaacat aaaagctgag gaaactttgc atttatatgt 300 gcatacatat gtatgcacgt atgctctgct tgcacgtgcg cgcatggcat gtgtgtgtgt 360 gcatatgtac atttgtactt ttacatggta tttgtattaa aatagtactt tttaggaaca 420 tcaaaatcgg atgaaaatgt ggaggagcca gcaccaggtc caaaaaagaa gcgggaagct 480 tcggttgcct gggatcactt caaaaaagac gttagaaaga cacacggagt ttgcaattat 540 tgtggcaaag ctataaaaac aaacggtaat actacaaacc ttctcgacca tctaaggcac 600 atgcacccag atcgccttaa agttggaaaa ggccaaaatg acaatgcaac tgctttggat 660 gtgttcgtgg gaaggtcaac gttttacagt aacgattctg ccagaaaacg gaaactcgac 720 gaaatggtga tgaacatgat agctgtagac gttcaaccat tcagctgtgt tgaggatgaa 780 ggtctcatta atcttttaag agaaatggac ccgcgctaca agctgccaag ccggacccac 840 ttgcgtgaca cagttcttcc aaatcaatat gaccgtttaa agatcgtcct aatttctgtt 900 ttggacaaga tcgactttct ggccctaacg actgatctgt ggacatcgag ggcaaacgag 960 gggtacataa ccataacgtg tcattttgtg tggaccggcg caaaactgat ctcggcagta 1020 ttggcaacta ggcagcttgt aacatccaca aaccacacgg ctcccaacat agccgaaact 1080 atatcatctg tgctgacaga gtgggctatt gcaagtaagg tggtttgcgt agtaacggac 1140 aacgactcga cgatgaagaa agcgtgtgaa ctgcttaaat acaaacactt gccctgcgtg 1200 gcacatacaa ttaatttact tgtgcaggat gctctacgtt ttccagttgt tgagagtata 1260 ttaagcaaat gcaagtcgat cgtcgcctat tttaaacgaa gcacagtagc gtatgaaaaa 1320 ttcaaggaag cgcagggagt agaccagccc gttggactgt tgcaagagat gcctactcga 1380 tggaatagtg cttacgaaat gatcaagcgg atcattaaaa caaatgagca catcactgtc 1440 gtgctggtca catctcggaa tgcacctcca ccgcttacgc cagacgaagt agatgtctga 1500 catgacttgt gcgaacttct gtccccgttt gatgatgcca ctctatcagt ttcaaccaac 1560 accaccgtat cagtttcaat tatcatccca gttatatgcg agcttttcca caaaatttct 1620 ggtctggatt gtaaactcaa aacaactgaa ggaactattg ctttgggata tatcaagact 1680 tgcttgcgtg agagattatt gccatatgag ggtagaacca ttccacgtat agccaccatt 1740 cttgaccctc gttttaaaaa acagggtttt ctcagccaaa ccaactcaga agaagcggct 1800 aaagctttgc aggacgagct aaactccacc cttgtgtcca ttcctcgaca gcaaccagca 1860 ccaccaacag aagaacccac acgattttct ttcatgaaat caaaattgga tgcaaaagtt 1920 caatcattcc gggccgatgc tattgtcctg ttgcgacaac atgtggaaaa agagaaccag 1980 cccgaaaagt gcgagccact tgcgtattgg gaggtaagaa tgttttaatt aagacctttt 2040 gttcttaaaa tataacaatt tatcttattt cagatgtcta ccgaagaagc ttttaaaatc 2100 ttgtctaaaa agtatttttg cgttcccgct tcatcgtgtg aatcagaacg tgtattcagc 2160 aaagcaggac aactgatatc cgaccgtcgc accagacttt catcttcggt cgtcgacaaa 2220 cttttgtttt taaacaaaaa caaacatata aataaaattt gactgtgata aaaaaaattt 2280 tttttctttt aaacatcgat gtttcatcga tgttttgccc caaaaacatc gaaaacatcg 2340 atgtccgcga acatcgatgt ttctccacct cta 2373 // ID Gypsy-34_CQ-LTR repbase; DNA; INV; 245 BP. XX AC AAWU01012010; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-34_CQ_; KW Gypsy-34_CQ-I; Gypsy-34_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-245 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 448-448 (2011). XX DR GenBank; AAWU01012010; Positions 8625 8381. XX SQ Sequence 245 BP; 61 A; 61 C; 47 G; 76 T; 0 other; tggcggagac gatcgtcatc gtaaaccctg ctggcccagt caacgatgat cgacccgtgt 60 gccgcgatca acaaaagatc ggcacaacaa ttgtttcgat ttacccacat ctcgtattca 120 cacatacgaa cgatttcttt tggatttgtg ctttgcaccc tttgtaatta gaattagttt 180 ttaatacatt tttggactac tatcagatca cacgtctttt tgttctgcgg atggccaact 240 cgcca 245 // ID ORTE-6_AAe repbase; DNA; INV; 5968 BP. XX AC . XX DT 18-OCT-2010 (Rel. 15.1, Created) DT 18-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE A non-LTR retrotransposon family encoding cysteine protease from DE Aedes aegypti. XX KW Non-LTR Retrotransposon; Transposable Element; RTE_Ele6; ORTE; KW ORTE-6_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-5968 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5968 RA Kojima K.K. and Jurka J.; RT "A lineage of non-LTR retrotransposons encoding an OTU cysteine RT protease from the yellow fever mosquito."; RL Direct Submission to Repbase Update (18-OCT-2010). XX DR [2] (Consensus) XX CC [1] Originally named as RTE_Ele6. CC [2] Consensus update. This consensus is generated from 24 CC sequences with >92% identity, and ~95% identical to the original CC sequence in [1]. This family encodes OTU superfamily cysteine CC protease upstream of apurinic-like endonuclease. It is positioned CC at the sister lineage of the lineage including RTE and RTEX in CC RTclass1. Renamed. XX FH Key Location/Qualifiers FT CDS 247..4797 FT /product="ORTE-6_AAe_1p" FT /note="OTU cysteine protease, endonuclease and FT reverse transcriptase." FT /translation="MIKVGGGEVLNENVNELKQIIFXYIKLNWENFVYELI FT LSXRIXISKEETLMYEQYEWSIKGGKQVFGYEVLWAVAEIFECTVNIFFEA FT GGKIIYGSKYRENHISLDFVINNEGKIENIIDTDEHMRFVYTSNKQGQNVR FT SEVEREIGKLKEXQGSRAYTFEIATRDEILEVKQIGHDDVSLMRALEVQLN FT AEQDGXSEKVREELDELFRGSKIQGEDFNQVENILARLGVVASNLNKEIII FT YREKKXEIVLGVTERKSKMKECKLLLTEREGKIHIDIISKCSEVTRIDWNN FT VVEDLALDTXVRKPREEIYQFEHPQKEGKMEVKIIQGDGNCMLRSLLDQLG FT RRYGRYLLSXHNVYKLRKFMVDYIQQAREQFEGFMFDYKQPEEDFEIGISN FT MRKNKTWLGHEALLALTEILKIAIIVYGKDGAINTIGENYSSNGSLQIFYT FT GVDVQNHYDSIITKNDEIAGNTRHTQRFHNENDKSYDLNDVAVKIDQMAEM FT REQVNPNHGTDNSEKEQASNRRKTENSHLPTSQVTNANNTNNTDVKGHNTR FT KVSNKFIRIASWNVKGCRSKEKREEIDETLTAYNVDIAALQEVNTLGDEVN FT TKNYVWKVADNHVNKSRGLAIVIRNNTNITIQNMKQVRQGILWCKIKAKDN FT VFIIINIHAPNVKQAAFLSKLNEITANEKDRKHLLVVGDFNAQLGINDMNQ FT EDEKWIGKMIGHDKCNDNGETFKIFLHTAKLKNVSSKIGMGTEITWRSGSR FT CSQIDHILSPTMTDFKIKYIRGNWTHLNTDHKLITIGIKFEKIVRNTNERN FT EPTKINLELLKDNKIKEEYQEQLKLNKSQKEIEENTIEECYKELSSRMKRA FT ANKTIRSSTIPSTPKRRKAFDRLKDAIRISKKHPDMINYKYKLKNRRQEFT FT QAVKDHKEKEIKHFFKNLNDFDVGQRIRKTHNYLKRYVKQKQKRIVNISTR FT VWDQMLRESEGPTIEFCDTHDYTPMIQPPTSEEIMDIIRNSSNGKAAGPDL FT IFTEYIKNADKETQDYLVKVIQKAYIENIYPSDWLKSTQIPIPKKVKATEA FT EHFRRISLCSVVYKVYAKWIANQITKYAGQPDLYQAAFTNNRSTDDHIFVT FT RRVMEENWNAGKDLYILALDIRKAFDTIKVHNIKDILLGLNVPTTLVDRVI FT ACMKDEMTRVLWNGQFSEEIKRGKGIKQGCPLSPILFNYIMQDVIRKVASK FT IPELKLMDVNSLIIPLLLAFADDLLIIADSKEQLEKIVKELVKQLEEVGLE FT LNYDKCQLLLRFPNYKGQMPNEIMLNGRPYKVCDSIKYLGVCLTATLDRKA FT TNRQRCVNAYRTGRSVIEFCKSFKPSWELGKLIYKTVLAPAITYGTKVAVL FT TKKSRIGMANYEKLILRSIYNHCRRPQNIRFNARKLLDGKTINRRVRVGRI FT SYYGHILRREKNHPIRLAYRLNFKKKKEGRPSFTWKDSLEKDLNRYSEVDR FT EEWKQLAEDRDKLKKKAEEIYKETFSEISEGESSEEEGRSKKYKHWKRKMK FT R" XX SQ Sequence 5968 BP; 2408 A; 822 C; 1265 G; 1462 T; 11 other; tgattccctt gtgaaaatat tacatagatt tgctgattct ataatttcct tggtggtgta 60 tataaggtaa gtcatattgc ttttgaacat aacattatta taaaaawaaa aaaaaacaat 120 gaatcgacaa gtaccaatac agaagtggta awtcgggaga tatgtgtgtg gccaaaaggt 180 ccaatactaa gaattcaatc gtttgtggaa gaaattgatt tgaccaaaac tgtattggaa 240 caagtgatga tcaaagtggg gggtggagaa gttctgaatg agaacgtaaa cgaattaaag 300 caaatcatat ttgawtatat caagttaaac tgggaaaatt ttgtgtatga gttgattcta 360 tcgaakagga ttaawatctc aaaagaggaa acattaatgt atgaacaata cgagtggtca 420 ataaaaggtg gtaaacaggt ttttggttat gaagtattat gggcagtggc agaaattttc 480 gagtgtaccg taaacatttt tttcgaagca ggtggaaaaa ttatctacgg ttcaaaatac 540 agggagaatc atatttcgtt ggattttgta atcaataatg aagggaaaat tgaaaacatt 600 atcgatacag atgaacatat gagatttgta tacaccagta ataaacaggg acagaatgta 660 agaagcgaag tagagcgaga gatcggaaaa ttaaaagaga wgcagggaag tagagcatat 720 acatttgaaa ttgcaacaag agatgaaata ttggaagtaa agcagattgg gcatgatgat 780 gtgagtctta tgagagcttt agaagtacaa ctcaacgcag agcaggatgg ggwaagtgag 840 aaggtaagag aagagctaga tgagttattt cgaggaagca agatacaggg agaggatttc 900 aaccaagtag aaaatatact tgctagatta ggggtagttg cttctaacct caataaggag 960 atcatcatat atagggagaa gaaaawggaa attgttttgg gagtcactga aaggaaatca 1020 aaaatgaaag aatgcaagct attacttacg gaaagagaag gtaaaataca tatagacatc 1080 attagtaagt gctcggaagt gactagaata gactggaata atgtagttga ggatttagca 1140 ttagatacaw acgtaagaaa accaagagaa gaaatatacc aatttgaaca tccgcagaaa 1200 gaaggaaaaa tggaagtgaa gataatccaa ggggatggaa attgtatgct aagatcgtta 1260 ttagatcaat taggaaggag gtacggcaga tatctattat cggwtcataa tgtttataaa 1320 ctacggaagt tcatggtaga ctacatacag caggctaggg aacagtttga gggatttatg 1380 tttgattaca aacaaccaga ggaagacttt gagataggga tatcaaatat gagaaaaaat 1440 aaaacatggc tgggacacga agcactatta gcattgactg aaatattgaa aatagcaata 1500 atagtttatg gaaaggatgg agcgataaac actatcggag aaaattattc atctaacggt 1560 agcctacaga tcttttacac aggagtagat gtacagaatc attatgatag cataataact 1620 aaaaatgatg aaatagccgg aaataccaga catacacaaa gatttcataa tgaaaatgat 1680 aaaagctacg atttgaatga tgtagcagta aagatcgatc agatggcaga aatgagggag 1740 caggtcaatc ctaatcatgg aacagacaac tcagagaagg aacaagcttc gaaccgaaga 1800 aaaactgaaa acagtcatct accaacaagt caagttacta atgcaaataa tactaataat 1860 acagacgtta aaggacataa tacaaggaaa gtaagcaata agtttattag gattgctagt 1920 tggaatgtca aaggttgcag gagtaaggaa aaacgggaag aaattgatga aacactcaca 1980 gcttacaatg tagacatagc agcgttacag gaagtaaaca cgctaggcga tgaagtcaac 2040 acaaaaaatt atgtatggaa agttgcagac aatcacgtaa acaagtcaag aggattggca 2100 atagtaatca gaaacaatac aaacattaca attcaaaata tgaaacaggt tagacaagga 2160 atactgtggt gcaaaatcaa agcaaaggat aacgttttca ttattattaa tattcatgca 2220 cctaatgtga agcaagcagc gtttttgagt aagctaaatg aaataacagc aaacgaaaaa 2280 gaccgtaaac atcttttagt agtaggtgat ttcaacgctc agttagggat caatgacatg 2340 aaccaagaag acgaaaaatg gataggtaaa atgataggac atgataaatg caatgataac 2400 ggagaaacgt tcaaaatatt tcttcacacg gcaaaactaa aaaatgtgtc atcaaaaatt 2460 ggcatgggca ctgaaatcac atggagaagt ggcagtagat gtagtcagat agatcatatt 2520 ctaagtccaa cgatgaccga tttcaaaata aaatacatca ggggaaattg gacacacttg 2580 aatactgatc acaaattaat aacgatagga ataaaatttg agaagatagt gagaaataca 2640 aatgaaagaa atgagccaac taaaataaac ctagagctac ttaaggataa taaaataaaa 2700 gaggagtatc aagagcaatt aaaactcaac aaaagtcaaa aagaaattga ggaaaataca 2760 attgaggaat gttacaaaga actatcgagc agaatgaaaa gagcagctaa caaaacaata 2820 agatcgtcca cgattccgtc tacccctaaa agaagaaaag catttgatag attaaaagat 2880 gcaataagaa tttctaaaaa acacccagat atgataaact ataaatataa gttaaaaaac 2940 aggagacaag agttcactca agcagtaaag gaccataaag aaaaagaaat taaacatttc 3000 tttaaaaacc taaatgattt tgatgtagga caacgaatca ggaaaacaca taattatttg 3060 aaaagatacg taaaacaaaa acaaaagaga atagttaata taagtacgag ggtatgggat 3120 caaatgctta gagagagtga ggggccaaca atagagttct gtgatacgca tgattacaca 3180 ccaatgatac aacctccaac atccgaagaa attatggata ttatcagaaa ttcaagcaat 3240 ggaaaagcag cagggccaga tcttattttt acggaatata ttaaaaatgc agacaaggaa 3300 acccaggatt accttgtgaa agttatacaa aaagcataca tagaaaacat ttatccgagt 3360 gattggttaa aatcaactca aataccaatt ccaaaaaaag taaaggcaac agaggcagaa 3420 cattttagaa gaatctcttt atgtagcgta gtttataaag tatatgcaaa gtggatagct 3480 aatcaaataa ctaaatatgc agggcaacca gatttatatc aagcagcatt cacaaacaat 3540 cgatctacag atgaccacat atttgtaact agaagggtaa tggaggaaaa ttggaatgcc 3600 ggaaaagatt tatatatatt agcactagat attagaaaag cttttgacac aattaaggta 3660 cacaatatta aagatatact gctaggtttg aatgtgccaa cgacgttagt agatagagta 3720 atagcttgta tgaaagacga aatgacaaga gtgttatgga atggtcaatt ttctgaggag 3780 ataaaaagag gcaagggcat aaagcaagga tgtccattgt ctccaatttt attcaattat 3840 attatgcagg atgttataag aaaagtagcc tcaaaaattc cagaattgaa attgatggat 3900 gtgaatagcc tcataatacc cttgctttta gcctttgccg atgacttgct aattattgca 3960 gatagtaagg aacaattgga aaaaatagtt aaagaattag tgaaacagct agaagaggta 4020 ggactggaat taaactatga caagtgtcaa ctattattaa ggtttccgaa ttacaaagga 4080 caaatgccta acgaaattat gttaaacggt aggccttata aagtgtgtga tagcataaaa 4140 tatttaggag tgtgtctaac ggcaacttta gatcgaaaag caacaaatag gcagagatgc 4200 gtaaacgcat atagaacagg tagatcagtt atagagtttt gtaagagttt taaaccctcg 4260 tgggaattag ggaagttaat ttataaaaca gttttagccc ctgcaataac ttacggaaca 4320 aaagtcgcgg tgttgacgaa aaaaagcaga ataggaatgg caaactatga aaaacttatt 4380 cttaggagta tatataatca ttgtaggaga ccacaaaaca taagatttaa tgcgcgaaaa 4440 ctgttagatg gaaaaactat aaatagaaga gtcagagtag gaagaattag ttactacgga 4500 catatactta gaagagagaa aaatcacccc atcagattag catatagact aaacttcaaa 4560 aagaaaaaag aaggaagacc aagtttcacg tggaaagact ccttagaaaa agatctgaac 4620 agatatagcg aggtagatag agaagaatgg aaacaattgg ctgaagatag agataaactg 4680 aagaaaaaag cagaagaaat atataaagaa actttcagtg aaatttcaga aggcgaatca 4740 tcagaagaag aaggtagaag taaaaagtac aagcattgga agagaaaaat gaagagatga 4800 agaaattagt gatgagatta gatagataat tgaaaggtaa aaatgagtag gaaagaaaca 4860 tacaattcga tagtactgat tcaaactaac gacctgtgta ccacctttaa taataaatag 4920 ctgtaacggg acacgtaatc gaataaaatg tttaggttaa ggtgaagatg agtcgaagcc 4980 ggggttcaga tgttcaaggg cacgggtctg gagagccggg catccgtttg agctggggac 5040 ctgatcgatt ggtcaccacc agctggtgac caatcgatta ggttttcagc ttgaacgggt 5100 gcttggttca ccagatccgt gctcttgagg atttgaggtt tggctttgat tcatcttcac 5160 cttaattaaa gcaaatttat tagtagcagt aattgcagtg gtaaatttca atataatcaa 5220 tgtagccatg gagggtcatg ttcagcttag ttaatgtcag ataaatgtta cccacaaaca 5280 aaagcaatat gacttacaag taaaacgatt agaaaaaaaa aataataata ataaaacggc 5340 caaaataacg tcggttttac gcaaaaatat actattaatt gataawcacg gcaacacgtc 5400 aatgaactac aaaggataaa tagggattaa atgtatagag atacattaat aggaatagct 5460 actgtagaac acaataaagg gtgaatactg ataaaaaaaa tcgtacttaa atacacacaa 5520 ataaaggtaa gatgaattgt aaccgattgt atattacata aacatactac tataaataac 5580 ggggtacggg agggaccaac agggcacccg attgaagcga tctggacagg gagcctggga 5640 ctgcctcctc cctgttgtta acatttcgtg gattgagggc gggctcttcg accccatcta 5700 ttgggagtat tctgtcaatc gagggcttga ttttacagtt gttcgtgaaa actttcttct 5760 ggaaggagat tcttttcccc tgccgcgggt ttcgtgcaag tgtctatgct tgtgggtccg 5820 cacatccatt tttggccctt catacagaag actgactgct cacaaacaaa agtacaatct 5880 tgatcaaatc gcttttcaga ttattccagt gctataggca gatgccttgc tacaatagat 5940 ggtagaacca tatcatcatc atcatcat 5968 // ID Gypsy-612_AA-I repbase; DNA; INV; 5365 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-612_AA_; KW Gypsy-612_AA-LTR; Ty3_gypsy_Ele47; Gypsy-612_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-5365 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [4302-4769] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 2673..5252 FT /product="Gypsy-612_AA-I_1p" FT /translation="MLRNDVIERVEGPAEWISPMVVVPKGKDDVRICINMK FT HPNEAIQREHYPLPVIDTFLNKLRGSKFYSRLDITSAYHHVELHPDSRCVT FT TFMSGIGLMRFKRLMFGINCAPEIFQRIMTEMLDGIEGVIVYIDDIVVSGR FT TKEEHDMRLKQVLEVLERNNAMLNMSKCVFGVEKLEILGFEVSASGISPSE FT EKIIAIKNFRPPSSKEEVRSFLGLLNFVGQFIPNLSTRTEPLRKFLRGEVD FT TFGKEQMSSFNDLRLELCNNVRQLGFFDCNDVTEVYVDASPVGLGAVLVQR FT DSKTKVARIVCFASKGLTTAERVYPQTQREALAVVWAVEKFYLYLFGIRFT FT IFTDHKTLQYIYGGKYRDGKRACSRAEGWALRLQPYDFEIKYIPGSTNISD FT VLSRLVSEPGKPFDEHAEHFLFAIGEGPVAITLEEIRHETVKEETLTAVMK FT ALDTKRWSPELFRYQAFEKELGVIDGILVRNDRIVLPAKLRPRALRIAHRG FT HPGVVAMRRNIREKIWWPCMDRDVADAVQECAGCAAVSKQHPPEPMVRKEM FT PDRAWQEIGIDFFSAKECATFLVIVDYYSRFLKVIEMRTTNAVKTIEALET FT VFREQTFPETIRSDNGPPFASEEFAKYCESKNIRLVRTIPYWPQMNGLVER FT QNQGILRALRIAKVTNCDWRKAVEDYVYMYNTTPHSVTEKAPLELMTGRPV FT KDLLPSLRTDPHWQRDVEVREKDAMRKMQGKHYADGRRQAKHSEIEVGDSV FT MLRNFETGKLEPKFRLEEFTVVKKNGSNTIVANKEGIMYRRSVTHLRKYPS FT SSKKNAAESGEQDNSNESPTVHVPHDHSMKRKRGDDDLEPKALKRSVREKK FT MPTRYLQ" XX SQ Sequence 5365 BP; 1589 A; 1054 C; 1361 G; 1357 T; 4 other; gttttggcgc agccgattgg atcatggtga gtagttttgg aacagaaagt gatttttttt 60 ctcccggaag tgtgtggtga tgtatggcgc aaacaaaaag aattaaggga aattgaagtg 120 atttcggaaa gatatttttc atcgtggata aaactgctta tagtgttggc ttccggaagt 180 tcggcgtggc gtggttaaat tttcgactgc aaaggagtgt ttcaggagtg aggttggaat 240 ttcggctkta atagtgtggt ttggaagtga ggttagaatt ccggctgtta aaagcgtgag 300 gcaatttttt cggcataatg tgattagatt wtcgaatgat tgaaaagctt tccaaatatc 360 attgtgcatt gtgagtagat tttatgatct caaaatttat ggaagtgtgt gaaagtgaac 420 agcgagtccg gagaattccg tagcgctaga gtgaaaatag tgaagcgagt ccggagaaat 480 ccgtagcgct gttatcaaaa gcgagtccag ataaatctgt agcgctgttt ttaaaagcga 540 gcccgggtaa acccgtagcg ctgttatcaa aagcgagtcc agagaaatct gtagcgctgt 600 tcccataagc gagcccggat aaatccgtag cgctgtcctt gaaaagcgag tctagagaaa 660 tgtgaagcat ttttcccaaa gcgagccggg caaatccttg tcgcaggtat tcgaaaacgt 720 gtccagagag attttacggt gtgaggaatt ctcaactcag cttgtgttta gaaagttatt 780 gcgccagcgt cacawtgagc tcctaatttg ctttattgca aagcaggaaa gtggttatag 840 cgagacaatg catattttga ggttaaattg tgaaatcgtc aattttcaat attttctcct 900 acagagtaac aattgaagaa gctaaaagag gcatataata attccttttg tctctgaatt 960 tgtatttcag atgaacgaca atggtacatt tggtacactg ttgggggctt tcaacgataa 1020 agttgatgct caagacttac gtcgtgaatg ggaagagtgg catcgtgcct ttgaactttt 1080 tcttcagatg cgaaacatcg aagcccaaca tgagaagttg gtcaccatgc taaatcgact 1140 ggagacttaa actaaataac tatgctgaag aggactgcag cgaattttct acaatttgcg 1200 tcctgtggca gaggagatag ttccagagcc cgttaaggtg cctttgatgc cgatcgaagt 1260 cccggagtat gacaacgcgg tgaaacggct agagaaactc ttcattggca agcggaacga 1320 acgagtggaa ttggaagttt tccggtcgct caaacaatca agtgaggaat ccttcaacaa 1380 tttcatcttg agactccggg cccaggctgc cagatgtgag ttttcggaac gcgaggagac 1440 agggctgctc cagcaaatca caatgggggc ttgtgacgag aaagtgcgag acaaaggttt 1500 ggagaatgtc atgagtttgg atgaagtgat cacctatgcc acgaaccgcg aaatcctgtc 1560 aaaacaaagg gataagcaga aacctttcag aactgagacg gagttgaaca gcgtgggctc 1620 gtatcgttcc ggaactcaaa acttgagcga gaagcgtaac acaggaagca atcagcgatt 1680 tgagagacaa cggatgggta gacgcgtaag atcaggtgag agtcgtttcc ggagcgaatg 1740 taaccgatgc ggttcgctcc gacacgcagc gtgactcacg agattgtttc gctcgaggtg 1800 cgacttgtaa caattgcgga cagcggggac actatgcaag gaagtgtgac aatcgccgca 1860 tgcaacgtgg aagtcgctta tcgaaacggg aaccgagacg caacgacttc awtgaagcta 1920 actcggtgaa cacatctgaa tcctggggaa aggatgattc tcatcgctca actgccgagg 1980 gcatcgctaa ggtggagtaa ctttaattat tttgttcttt tttgtaatat cttcggcgaa 2040 tcatattgaa ttaaatgagt tgaaataaaa gatatatgat ttacacatgc ttttccatat 2100 taggtggata cgattaccct gaagaatgac tccgtattgt gcaatattga caatgtgccg 2160 gtgaattttg taattgattc cgggtcctct atcaacgccg ttacggaaga cgtgtggaat 2220 gagcttctcg ctaagaaagc caaaatattc aagaaaaagt acaaatgcga ccgaaaattt 2280 tatgcgtatg caaatcgaga tccattgaat gttttggctc tttttgaagc tcgtatttcc 2340 gtcaatccta cgaaacctga aagctatgcg gaattcttcg tgatcgatgg tgcacggaag 2400 tcccttctaa gtaagcgtac gtcagaagag ctgaagcttt tgaaaatcgg actagatgtt 2460 ctgcatctca acgaagacga cggtgtaaac agcaagccat ttccaaagtt tccgggggtc 2520 caagtgaagt tatccattga ccacgcagta cctcccaaga agatcgcata cctgagagtg 2580 ccttctgcta tggaacaaaa ggtaatataa tatcaatcaa aatgccaggt tcaacagtta 2640 tatgtttata ggttcatgat aaaatacagg aaatgctccg taatgatgtc atcgaaaggg 2700 tagaagggcc cgctgaatgg atatcaccaa tggtggtagt ccctaaagga aaagacgacg 2760 taagaatatg catcaatatg aagcatccaa atgaagctat ccaacgggag cattacccgt 2820 tgcctgtcat cgacaccttt ctgaataagc ttagaggatc taagttctat tcacgattgg 2880 acataacgtc cgcataccat catgtcgaac ttcaccccga ctcaagatgc gtcacaacct 2940 tcatgtccgg cataggcttg atgcgtttta aacgccttat gttcggcatt aactgcgcgc 3000 ccgaaatttt tcaacggatc atgacggaaa tgcttgacgg cattgagggc gtcatcgtct 3060 atattgacga tatcgtcgta tcgggaagga ccaaagaaga acacgacatg cgactgaaac 3120 aggttttaga ggtgttagaa aggaataatg ccatgctgaa catgagtaaa tgtgtctttg 3180 gagtcgaaaa actagaaata ctaggctttg aggtgagtgc gtccggtata agtccatcgg 3240 aggagaaaat aatagcaatc aaaaacttcc gaccaccatc atcaaaggaa gaagtccgta 3300 gtttcctggg attgctgaat tttgttggtc agtttatccc aaatctctct acgaggacag 3360 aacccctgag aaagtttctt cggggagaag tagatacatt cggtaaggag caaatgagtt 3420 cgtttaacga cttgcggttg gaactgtgta acaacgttcg gcagttagga ttttttgact 3480 gcaatgacgt taccgaagtg tatgtggacg cttctcctgt cggtctcggg gctgttcttg 3540 tccaacgaga cagcaaaact aaggtagcac ggattgtatg cttcgcctca aagggcttga 3600 caacggccga aagagtgtat ccacagacgc agcgtgaagc gttagctgtt gtatgggccg 3660 tggaaaaatt ttacctgtac ctcttcggca tccggttcac gattttcaca gatcacaaaa 3720 cccttcagta tatatatggc ggaaagtacc gtgacggaaa acgtgcgtgt tcaagggcag 3780 aaggctgggc gctccgctta cagccgtatg actttgaaat taagtacatc ccgggatcga 3840 cgaatatctc ggatgtgctt tctcgtctgg tgagcgagcc cggtaaaccg tttgatgagc 3900 atgccgaaca tttccttttc gccataggag aaggacctgt ggcgatcaca ttagaagaaa 3960 tcagacatga aactgttaag gaagaaacgc tcacagcagt aatgaaggca ctggacacca 4020 agcgttggtc gccggaactc ttcaggtatc aagcatttga gaaagaactt ggagtaatcg 4080 acggtattct ggtgcgaaat gaccgcattg tccttccggc gaaacttcga ccaagagcac 4140 taagaattgc acatcgtgga catcctgggg ttgttgccat gcgaagaaat atcagagaga 4200 aaatttggtg gccgtgcatg gaccgggacg tggcagatgc cgtgcaagaa tgtgctggtt 4260 gtgccgctgt cagcaagcag catcctcccg agccaatggt gcgtaaagaa atgccggacc 4320 gtgcatggca ggaaattggg atcgactttt tctcggccaa agaatgcgca acgtttttag 4380 taatcgtaga ttactacagc cgtttcttga aagtgattga aatgagaacc acaaatgctg 4440 ttaaaactat tgaagcactg gaaacggtgt tccgtgaaca aacttttccc gaaaccattc 4500 gtagtgacaa cggcccaccg tttgcgagcg aggagttcgc aaagtattgt gaaagcaaaa 4560 acatccgatt ggttcgcacc ataccctatt ggccgcaaat gaacggccta gtagaaaggc 4620 aaaatcaggg gatcctccgg gccctacgca ttgcgaaagt taccaattgc gattggcgga 4680 aggcagttga agattacgtg tacatgtaca atactacacc acattcggta accgaaaaag 4740 cgccactgga acttatgacc ggtcgtccag tcaaggatct tttgccctcc ttaagaacgg 4800 atcctcactg gcagcgagat gtggaggtgc gagaaaaaga tgcgatgaga aagatgcaag 4860 gaaaacacta tgcggatggg cgcagacagg ccaaacactc cgaaatcgaa gtgggtgatt 4920 ctgtgatgct gcgtaatttc gaaactggta agcttgaacc aaagtttagg cttgaggaat 4980 tcacagttgt gaagaaaaat ggaagtaaca caatcgttgc taataaagaa gggatcatgt 5040 accgtcgatc tgttacgcat ttgcgaaaat acccatcttc ttctaagaag aatgctgctg 5100 aatctggaga acaagacaat tcaaacgaga gtccgactgt tcatgtgccg catgatcatt 5160 caatgaagcg gaagcgaggc gatgacgatc tggagcctaa ggcactgaaa cgttcagtac 5220 gagagaagaa gatgccgacc agatatttgc agtgattttt ttttcctacc gatgtaatta 5280 aaaaaaaaca gtctaataaa attatcattc ctcgtataac tttgattttt gttggagaag 5340 gagagagatg tagagtctta gcctt 5365 // ID Gypsy10-LTR_Dya repbase; DNA; INV; 778 BP. XX AC chrU; XX DT 18-MAY-2009 (Rel. 14.05, Created) DT 18-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy10_Dya; KW Gypsy10-I_Dya; Gypsy10-LTR_Dya. XX OS Drosophila yakuba OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; melanogaster subgroup. XX RN [1] RP 1-778 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1079-1079 (2009). XX DR Genome; chrU; Positions 1149968 1150745. XX SQ Sequence 778 BP; 233 A; 162 C; 189 G; 194 T; 0 other; tgtagtggcg actgaagagc cgctaaataa attttagatc aaccttaaaa gatataaaca 60 ttttctgcct agaagtatgc acttaattat gtgatactct gagcctggct ttacgcaacg 120 ggaataacgt cgatgcattt tagtgaaagc tttacgtagt tcaagctgag ctttaatgca 180 catgcatttc taattagcta gctgcatttc tgccagaaca gagttttgga cacaaatacc 240 agatagctca tggatgcgcc agggaagatc tggaaagctt tagtctgaat taggaaattt 300 ttatgctgtg gcagaggaaa aaaagtgcta agcaactcca tccgtttccg tgctaaagtc 360 tattcctttt gagatccggt agagtgcaaa tcgttgcagt ggccgataaa atatatatat 420 aaatcacata gtgccagtgt ccacgcggat tgtgaaaatc tggtagcgga aaaatttatt 480 cggcgaaagc ttaaccaaac tcaagtgagt aaagccgttt tcatttgcat acagatggat 540 aaccttttgt tgcccaggtc acggaaagag tgtccacgga ggcaaccagg cggcggactg 600 acagccagtg ccctaggccc gtctggaaaa cccatttatt tcgtcgggac tcaaccagcg 660 gatctagccc agcaagccga gtttgcgtcc tgtcccgagg agcaccctgg gaggggcagg 720 atggactact aggagagcag cagcagaagc agtagaaaat gatcggcaaa taacctca 778 // ID Gypsy-1_CQ-I repbase; DNA; INV; 4450 BP. XX AC AAWU01023480; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-1_CQ_; KW Gypsy-1_CQ-LTR; Gypsy-1_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4450 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 381-381 (2011). XX DR Genome; AAWU01023480; Positions 7294 11743. XX CC Positions [3404-3862] - Integrase core CC 'ATGCA' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 90..1448 FT /product="Gypsy-1_CQ-I_2p" FT /translation="MPPKVTPKKDGPQQQPDLTAILAQLTAYLENQSKTQE FT ALLEQLSKPKDSEFLMEALSNTIPEFVYDPQGGLVFDKWYSRHEEAFNKGG FT EKLEEADKVRLLVRKLSSVDHDRYVNYILPENPPDRSFQDTVDTLKEMFGH FT QTSVFFRRYQCLQTTKRNAEDFVTYASTVNRACEDFNIRTITSDQFKCLVF FT VAGLQSERYKDIRTRLLAKMEGETAEHPMTLKKLLLECQHLDNLKHDTAVI FT EGPKPAVKAVQQDRSGFRESGKPERKFDNRNPAKATPRRPCWQCGQMHFVA FT DCSYTKHTCKTCGKVGHKEGYCACCSKPSGSGEGASHTAPGGGKKKHKKKR FT ADGGGHSNGVYAVNKEGVVRRRKFVEVLINNEPIELQLDCGSDYTIISTDS FT LPLLGNPETIPTELRVATASGKPLPLELEFECDVTFRGTTKRSVCYVTPVR FT GSRFLGAT" FT CDS 1451..4438 FT /product="Gypsy-1_CQ-I_1p" FT /translation="MEAFGLYDIPINDFCKQLTTAEDSLACGAALKAKFPA FT VFQEGLRRCTKTKIQLFLIDGAKPVFKPKRPVPFHSQRLVEKELNRLQDMG FT VLEPVDYSDWAAPIVAVRKAQRDADGDPVVCICADYSTGLNAVLEANKYPL FT PTPDDIFAKLAGSKFFSVIDLSDAYLQMEVEEESQKLLTMNTHKGLFKVKR FT LPPGVKPAPGAFQKVVDNMLAGQEGAASFLDDVLVFGRTRAEHDRNLEQTL FT QRIQDFGFRLKIEKCKFYMTEVKYLGHIINQNGIRTDPGKVSAISQMPPPK FT NLTELRSFLGAVNYYAKFIKEMHQLRRPLDLLLKKDAKWAWSDDCQRSFER FT FKELLKSDLMLTHYNPTLDIIVAADASQTGIGATIRHRFPDGSEKIIQHAS FT RSLTPAEQAYGQIDKEALALVYAVTKFHRMLLGRRFVLETDHQPLLRIFGS FT HKGIPTHTSNRLQRLGLVLLCYNFGIDYVSTANFGYADVLSRLIDSAAKPE FT EEYVIAALYLEDDMSAVLEDSVGNTPVTSKMIAAATARDKVLKKVVQYLEG FT EWPASASEIDNLDVKAFYHRKESLALTQGCILFGQRVVVPETYRKRILQRL FT HHGHPGMVRMKSLARSFVYWPHIDKQVEDTVKQCADCAAAAKSPPHSPPEA FT WPTPAGPWQRIHIDYAGPIDGLYFLIIVDAFSRWPEIYPTTTTTARATIQF FT LRRTFARLGLPMVLVSDNAAQFTCDEFESYCKANGITHILTAPYHPQSNGQ FT AERFVDTVKRGMKKINKGEPLQETLDVFLATYRSTPSGTTEQKSPSELLFG FT RQMRTTLDLLRPPPTPVAEMKTTAENDQRREFAPGDLVYAKVHKRNDWYWE FT AGKVIERLGLVNYNVWLDGQRSGLIRSHINQLRPRHESADSPARGRAVNLP FT LDILLGEFGFAQTTTEDVTQAAVVEPKGANQPEITEQPGNNDGPEPAVIQR FT PIRRASTGSIPLSCVPVPKTTNTRSGREVRLPPRFDHYVMS" XX SQ Sequence 4450 BP; 1044 A; 1276 C; 1319 G; 811 T; 0 other; ttgtttggcg acgaggagta gtttcggttc gaagtagttt cgcgtcggtt ttcgggtttt 60 cgcgttcgag cggggttcct aacctcaaaa tgcccccgaa ggtgacgccg aaaaaggacg 120 ggccgcagca gcagccggat ctcacggcga ttctggcgca gctcaccgcc tacttggaga 180 accagtccaa gacacaggag gctctccttg agcagctctc caagccgaag gacagcgagt 240 tcctgatgga agcactttcc aacaccatcc cggaatttgt gtacgacccc caaggcggac 300 tggttttcga caagtggtac tcccggcacg aggaggcctt caacaagggc ggggagaagc 360 tggaggaagc ggacaaagtg cggctgctcg tgcgaaagct gtcctcggtg gatcacgacc 420 gctacgtgaa ctacatccta ccggagaacc caccggacag atcgttccag gacacggtgg 480 acacgcttaa ggagatgttc gggcaccaaa cgtccgtgtt cttccggcgc taccagtgtc 540 tacagacgac caagcggaat gcggaggatt tcgtcacgta cgccagtacc gtcaaccgag 600 cgtgcgagga tttcaacatc cggacgatta cctccgacca gttcaagtgc ctggtgtttg 660 tcgctggtct gcagtcggag cggtacaagg acatccggac ccggttgctg gcgaaaatgg 720 aaggcgagac ggcggagcac ccgatgactc tcaagaagct gttattggag tgccagcatt 780 tggacaacct caaacacgac actgctgtta ttgagggtcc gaagccggcc gtcaaagcgg 840 tccagcagga tcgcagcggc tttcgcgagt cgggcaaacc ggaacgaaag ttcgacaaca 900 ggaatccggc caaggcaacc ccacgacgtc cgtgctggca gtgcgggcag atgcactttg 960 ttgcagactg ctcgtacacg aaacacacct gcaagacgtg cggcaaagtg ggacacaagg 1020 aaggatactg tgcttgctgc tccaaaccct ccggttccgg agaaggtgca tcacacacag 1080 cacccggagg cggcaagaag aagcacaaaa agaagcgggc cgacggcggg ggacactcaa 1140 acggagtgta cgctgtcaac aaggaaggtg tcgtcagacg gcgcaagttc gtggaagttc 1200 tgatcaacaa cgaaccgatc gaactccagc tcgactgcgg atcggactac accatcatct 1260 cgacggattc acttcccctg ctcggcaacc cagaaaccat tccgacggaa ctgcgggtgg 1320 ccacagcctc aggcaaaccc cttccgctgg aattggagtt cgagtgtgac gtcacgttcc 1380 gcgggaccac caagcgttcc gtgtgctacg tcactcctgt gcgcggttca aggttcttgg 1440 gagcgacctg atggaagctt tcgggctgta cgacattccg atcaacgact tctgcaagca 1500 actcacaact gcggaggact cgttggcgtg tggtgccgcg ctgaaggcga aatttccggc 1560 ggtgttccag gaaggactca gacggtgcac gaaaacgaag atacagctgt tcctgatcga 1620 cggggcgaag ccggtcttca agcctaagcg gccagtgccg ttccactcgc agcggctggt 1680 cgagaaggag ctcaaccgat tgcaggacat gggagtgctg gagccggtcg actactcgga 1740 ctgggcggca cccatcgtgg cagttcggaa ggcgcagcgc gacgcagacg gagatccggt 1800 ggtctgcatt tgcgcggatt actcgaccgg gttgaacgct gtgctggagg cgaacaagta 1860 ccctctccct acccccgacg acatcttcgc aaagctggct ggcagcaagt tcttcagcgt 1920 gattgacctg tcggacgctt atctgcagat ggaggtagag gaggagtcgc agaagctgtt 1980 gactatgaac acacacaagg ggctgttcaa ggtcaagcgt ctgccgccgg gagtgaaacc 2040 cgctcctgga gccttccaga aggtcgttga caacatgctc gccggccagg aaggagcggc 2100 gtcgttcctc gacgacgtgc tggtgttcgg tcgaacgcgc gctgagcacg accggaactt 2160 ggagcaaaca ctgcagcgga tccaagattt cggtttccgg ctcaagatcg agaagtgcaa 2220 gttctacatg acggaggtga agtacctggg ccacatcatc aaccaaaacg gcatccgcac 2280 cgacccgggg aaagtgtctg ccatctcgca gatgccaccg ccgaagaacc tcaccgagct 2340 gcgttcgttc ttgggcgcgg tcaactacta cgcgaagttc atcaaggaga tgcaccagct 2400 gcgtaggcca ctcgatctgc tgctgaagaa ggacgcaaag tgggcctggt cggacgactg 2460 ccaacgatcg ttcgagcgct tcaaggagct gctcaagtct gacctgatgc tgacccacta 2520 caacccgacg ctggacatta tcgtggccgc tgacgcttcg caaaccggaa tcggcgccac 2580 aatccgacat cgtttccctg acggttcgga gaagataatc cagcacgcct caagatcgtt 2640 gacaccggcg gagcaagcgt acgggcagat cgacaaggaa gctctggcgc tcgtctacgc 2700 tgtcacgaag tttcaccgga tgctgctggg acgacggttc gttctggaga cagaccacca 2760 gcctctgctc cgcatctttg ggtcgcacaa gggaatcccg acacacacgt ccaaccgact 2820 gcaacgactg ggactcgttc tgctgtgcta caactttggc atcgactacg tctccacggc 2880 gaacttcggg tacgcggatg tactctccag actcatcgat tcggcggcca agccggagga 2940 ggagtacgtt atcgccgctc tgtatctgga ggacgacatg tcggcggtct tggaagactc 3000 cgttggcaac acacctgtca cttccaagat gatcgccgct gctaccgccc gggacaaggt 3060 gctcaagaaa gtggtccagt acttggaagg tgagtggccg gccagcgcaa gcgaaatcga 3120 caacctggac gtgaaggcgt tctaccaccg gaaggagagc ctagcgctga ctcagggctg 3180 cattcttttc ggccaacgag tggtggtgcc ggaaacctac cgaaagcgga tcctgcaacg 3240 cctgcaccac ggtcacccgg gcatggtccg gatgaagagt ctggctcgga gtttcgtcta 3300 ctggccgcac atcgacaagc aggtggagga cacggtgaag cagtgcgcgg actgtgctgc 3360 agctgcaaaa tctcccccac actctccacc agaagcttgg ccaacaccag cgggaccatg 3420 gcaacgcatt cacattgatt acgcgggccc catcgatggt ctctactttc tcatcatcgt 3480 ggacgctttc tcgcgatggc ctgagatcta tccgacgacg accacgaccg ctagagccac 3540 gatccagttt cttcgaagaa cgttcgcacg ccttggccta ccgatggtgc tcgtgtcgga 3600 caacgcggct cagttcactt gcgacgagtt tgagtcctac tgcaaggcga acggaatcac 3660 ccacatcctc accgcgccct accatccgca atcgaacggg caagcggaac gctttgtgga 3720 caccgtcaag cgaggaatga agaaaatcaa caagggagag ccactgcagg agacactcga 3780 cgtgtttctg gcaacgtaca ggtcgacgcc gagcggaaca accgagcaga agtcaccaag 3840 tgaactgctg tttggacgtc aaatgcgcac cacactcgac ctactgcgac caccaccaac 3900 accggttgct gagatgaaga cgactgctga gaacgaccag agacgagaat ttgcacctgg 3960 cgacttggtg tatgcgaagg tgcacaagcg gaacgactgg tactgggagg caggaaaggt 4020 catcgaaagg ctcggcttgg tgaactacaa cgtgtggttg gacggccaac gttcggggct 4080 gatccggtcg catatcaacc agcttcgtcc acggcacgag tcagcggatt cgccggcgcg 4140 cggtcgcgct gttaaccttc cgcttgacat tctcctgggc gagtttgggt tcgctcaaac 4200 aacaaccgag gacgtcacgc aagcagctgt ggtggaacct aaaggcgcga accaaccaga 4260 aatcaccgag caacctggca acaacgatgg accggaacca gctgtgatcc aacggccgat 4320 ccggcgggct tcgacgggaa gtattccgct gagttgcgtt cccgtgccga agacgacgaa 4380 caccagaagt ggacgcgagg ttcggctgcc gccgaggttc gatcactacg tgatgtctta 4440 aagagggaga 4450 // ID Gypsy-85_CQ-LTR repbase; DNA; INV; 226 BP. XX AC AAWU01006083; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-85_CQ_; KW Gypsy-85_CQ-I; Gypsy-85_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-226 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 550-550 (2011). XX DR GenBank; AAWU01006083; Positions 26553 26328. XX SQ Sequence 226 BP; 68 A; 36 C; 84 G; 38 T; 0 other; tgtagggttc agcgcgggga gtgtactgct ttctggcagc actgccatga ggagaggaag 60 aatagggaga ggaagaagag gagcgactgg ggggaacggc gaggggagtc aggagagagc 120 agttagcagc gacggacgcg aagttaacaa gcgggaaata aacgttgtga tcaaaggctg 180 attaaaggta ttaattctga agagtagtcc gaacgggtcc ctcaca 226 // ID BEL-46_CQ-LTR repbase; DNA; INV; 369 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-46_CQ_; KW BEL-46_CQ-I; BEL-46_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-369 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 246-246 (2011). XX DR [2] (Consensus) XX SQ Sequence 369 BP; 107 A; 86 C; 99 G; 77 T; 0 other; tgtccgccat cgcgtccgtt tggtaggctc ttcaagcgtg aaatgaaaaa ccacgacaga 60 tggcagcact ggttgctgcg atcggcaaac cccccgccgc cgctgtcaaa cgtcagcggc 120 gaagaagcaa cggcgacctg atcgactgtt tggcgcgaag aaaaagcgcg cgcaaagaac 180 gacagaaaga ggaagtacaa gaattttcat tccgttcgca agttttgagt tgtccagacg 240 tgtaaataaa aatcggtcga atagaagttt ttagagaaaa cgtgtgtttt tattccgtcc 300 cagaacgtgg tatacagtcc cgcgaaagct taagtggaat cgacttgttg aggacaagcg 360 gccgcaaca 369 // ID BEL-2-I_NVi repbase; DNA; INV; 6485 BP. XX AC . XX DT 20-APR-2009 (Rel. 14.04, Created) DT 20-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia vitripennis: internal portion - DE consensus. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-2-I_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-6485 RA Bao W. and Jurka J.; RT "BEL type LTR-retrotransposons from Nasonia vitripennis."; RL Repbase Reports 9(4), 744-744 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 689..1867 FT /product="BEL-2-I_NVi_1p" FT /translation="MADPAGELAKQKRTSGFILNFKRNTAKLAATARTLPH FT MRTRLSLLDAYWSEYXQRDLXLQANXSAFKGEAYFEXDEYLEVEXXXVEXK FT ADXXQXIEELEIAARPPVTAAGAQNSQVVHAPPVPGPSEFAXMPKCHIPKF FT SGKAEEWETFKEQFSSXVXNKANLPDAIKMQYLVDSVEGPAALRVKGLPLT FT GASFELAWQKLMRRYDNPXRRMHTYMELLIDMKPVKRKSSGELIELLDKAE FT AALKTFKDVGCPCEHWDSWLIHMVERKLDNETREGWRISQESVVGFSTFSA FT LTNFLETRASSLDQDADGDAESPTGKQXSQKGNGSNRSSRRESVSANATST FT SFAKGNRNASVKCSLCRGPHQLQACPXFNGMSXSARFEXCKKRDCALIV*" FT CDS join(2128..4950,4914..6071) FT /product="BEL-2-I_NVi_2p" FT /translation="SVCRGLFVTQRVVXIVRADTTRRFTRIDDKEERLRDS FT SNSTGQKGSSSAPGKSSMLEVSASTTVVGRTRLMATAKVLLQSVDGNSMYA FT RCLIDPCAEVSFVTQRVVQCLSAKIKPVSVSVVGVGAGPSAVSKGEVFLQL FT KSRLNSEFCLEFSALVLREVTGFLPREEVVVSNWDHIRGLQLADPDFARSK FT RIDCVLSAEVYAAIIRPGLRVGATDTPVAQETVFGWILTGRASSAPESDRA FT QSAQACHVEVEPSISTLLERFWEMEEISTSKVLSEEDQYCEDYFADTVYRD FT SKGRFVVRLPFSEKLSEVSFAHTRQIAVACLLRSEKRRAQDSRVDSAYRKF FT MEEYLTLGHMELVNSSHRKDNTFYFTHHPVYSAEVGPEGKFRVVFNGSFGK FT ISLNDVLLTGRKLQSDVTIIISMWRFSKIVFTADIVKEFRQILIHPDDVDW FT QRIVWRSDVSKPIEDYRLLTVTYGTRSAPYLSIRTLLQLASEGSLEFPLAA FT QVLRHNMYVDDAFVGADNESEAIEIRNQLIKLLSSAGMELGKWASNCSAIL FT EDIRAEKQKEFAVEWDEAISALGLKWTPSGDSFRFEVKVPVAPKIVSKRSI FT LSEISKLFDPLGWLSPVLVRAKLMLQDLWINGVDWDSPLTGELLEQWQMLR FT SDLPDLAQLRIPRWFGSSSQTSWELHGFSDASQRAFVAAAYMVIPGQKSAL FT IMSKTKVAPIKTESLPRLELCGSVLLVRLLKHLLDGLLLKPVSVHCWTDSK FT VVLDWLKGHPSRWQTFVANRVSEVITTLPGVQWRHVKSEDNPADCATRGFS FT SEQLQQSSLWWEGPEWIRNDWRKVIDRLPVDVSLDVVNAQVAVEEMEAARV FT GLIRYVQSQHYEEELRCLRSKQRLSSRSHLLRLVPFICPQGLLRVGGRLQH FT SFLQYDEKHSIILPASSIIVKRLIEEYIVRLFMEVFSGVHRQTLHGGVQLM FT LSSLNRTYWISRGLRVVQGVYRRCHRCIRYAAQSVQQQMAPLPSYRVTPQR FT VFAYTGLDYAGPFPILFSKGRGAKSTKGYIAIFVCMVVRAIHIEVVSDLST FT AAFLAAFRRFTARRGLCRMVFSDNGTNFKGAATEIDKLFQRASSVSQEVAA FT ALAKDGIVWSFIPPRAPHFGGLWESAVRSFKHHFKRVIGDTPLTFEEMSTI FT AAQIEACLNSRPLCPLSSEPTDSVALTPGHFLVNAPLNVLPEPFDDVNLKS FT RTCRWKLLSVMRNHFWERWRLEVLHHLQQRNKWLTPGRQLVVGDLVLLKDE FT LCPPSKWPLGRITAVHPGKDGLVRVVTIRTANSEFQRPVVKVIFLPSDESA FT QRAVSSLQPESSQD*" XX SQ Sequence 6485 BP; 1524 A; 1321 C; 1751 G; 1834 T; 55 other; ttggtagcag agcgtgcagc attgtggtga agaatcgtga aatcgtgtgt ggaaagaagt 60 gtccttgcgt tgtgcaagca gtgttagagt tacttcggca gccgttcgac tattcaaagt 120 cgcaacggtc agagcgtcca gatacttcct ggagttcgct ctctcagttt gccgcttggg 180 ctaactgctt gttgttgtcg tgtagttttt agatgtttag acggttagaa gttttttttt 240 gtttagaggt ttagattgtt ttagattgtt ttagaagatt agaggctaga ggtttagagg 300 ctagaggtca ccggtttagc attggacttg gagagaagcc acgagatcgc cgtttggggc 360 acatcaagtg gaaaacagaa aagacggcca tcttggagaa cgtttttccc gcataccggc 420 ttcagcgacg gttagctccc gccaacttcg gacgacgcgc gcagtccggg aaatcagcac 480 gggagtcgac ggcgtaggca gtgcagaggt ttggtttttt tctctctctc gctctttcgc 540 tcaagctctg tctcgctttc gctcacttgg cgtcgtcaag ccacgttgag gcagcgacag 600 tacggtgttt gctctagtat ctctctctct ctctctctct ctctctctct ctctcgctct 660 cgttagtttt ttttttaatt ttgcaataat ggcagatccg gctggtgaat tagccaagca 720 aaaacgtacg agcgggttta tactcaactt taagcgcaat actgcaaaat tagcggcaac 780 tgctcgcacg ctgccgcata tgcgcacgcg attgagtttg ttagatgcat attggtcyga 840 gtatwctcar agrgayctgr tyytgcargc gaayargtcg gctttcaaag gcgaggcgta 900 yttygaraam gaygagtatc tcgaggttga gwcgrgttwt gtkgagrsta aagcggatkt 960 gyttcagwct atygargagt tagaaatcgc cgcgaggcca ccggtaaccg ctgcaggcgc 1020 gcaaaatagt caggtcgtac acgcaccacc ygttccgggt ccgtcggart tcgcgartat 1080 gccraagtgy catatcccga aattttccgg raaagccgag gaatgggaga ccttyaagga 1140 gcaattttcg tcgwtggtga raaacaaggc caacttaccg gatgctatca agatgcarta 1200 tttagtagay tcggttgarg gkcctgcagc gctcagagta aagggtcttc cgttgacggg 1260 tgctagtttt gagttggctt ggcaaaartt gatgcgcagg tacgataacc ctamgagacg 1320 catgcataca tayatggaat tgytgatcga tatgaaaccg gttaagcgga aatcatcggg 1380 agagttgatc gagttactgg ataaggcgga agccgcgctt aaaaccttca aggatgtagg 1440 ttgcccgtgc gaacattggg acagttggct tatccatatg gtcgagcgca agttggataa 1500 tgagacgcga gaagggtggc gtatttcgca agagtcggtg gtaggttttt cgacattttc 1560 agctctcaca aattttctcg aractcgtgc gtcctcgctt gatcaggatg cagatggtga 1620 tgcggagtcg cctaccggca aacaakctag tcagaagggt aatggttcga atcgttcgtc 1680 tcggcgtgag tcagtctcag ccaacgcaac tagcacgtcg tttgctaaag gcaatcgcaa 1740 tgcctctgta aaatgtagct tatgtagggg tcctcatcag cttcaagcgt gtccaarrtt 1800 caatggtatg tcsasttcyg ctcgctttga ayattgyaaa aagagagatt gtgccttaat 1860 tgtttgagat cggacatttt ttggctgatt gcccctcgca gagtcgttgt gccaattgta 1920 atggcaaaca tcacacgaag cttcacacag atcgacgtca gggaggagac gctgcggaat 1980 ctacacagtc gacaggtcag aagggttctt ctagcgcgcc tggcaaatca agtatgctgg 2040 aggtctcagc aagtacaaca gtggtgggta tactgggtta cttcaatccg tagatggtaa 2100 ttcgatatac gttagatgct tgattgatcc gtgtgcagag gtctctttgt cacgcaaaga 2160 gttgtgcmaa ttgtaagggc agacaccaca cgaagattca ctcggatcga cgacaaggag 2220 gagaggctgc gggattcttc aaattcaaca ggtcagaagg gctcttctag cgcgcctggc 2280 aagtcaagta tgctggaggt atcagcaagt acaacagtgg tgggtaggac ccggttgatg 2340 gccacagcca aggtcttact tcaatccgta gatggtaatt cgatgtacgc tagatgcttg 2400 attgatccgt gtgcagaggt ctcttttgtc acgcaaagag ttgtgcaatg tttgtccgca 2460 aagattaagc ctgtttctgt gtctgtggtt ggtgtcggtg caggtccttc tgcggtgtca 2520 aagggtgaag tttttcttca attaaagtcc aggttgaact cagaattttg tttggagttt 2580 tcggcgttgg tgttgagaga ggtaacggga tttttgccgc gagaggaggt cgtagtgtcc 2640 aattgggatc acatcagggg cttgcagctt gctgatccag attttgctcg ctcaaagcgt 2700 atagactgcg tgctcagcgc agaggtgtac gctgctatca ttcggccagg gctcagagtt 2760 ggtgcaactg atactcctgt tgctcaggag accgtatttg ggtggatttt gacgggtaga 2820 gcgtcttctg ctcctgagtc tgatagagct cagagcgctc aggcctgtca tgtggaagta 2880 gagccatcaa tttcgacgtt attagagagg ttttgggaaa tggaggagat ttccacctct 2940 aaagttttgt ccgaggagga tcagtattgc gaagattatt tcgcagatac tgtttatcgg 3000 gattccaagg gcagatttgt cgttcgcttg ccattctctg agaagttgtc agaggtttct 3060 tttgcacaca cgaggcagat tgctgtggca tgtttgctcc gctcagagaa gcgtcgagcg 3120 caggattcta gggtagatag tgcctatagg aaatttatgg aggagtatct gactttaggt 3180 catatggagc ttgtaaatag ttcgcatcga aaggacaata cgttctattt tactcatcac 3240 ccagtttact cggctgaggt ggggccggaa ggcaaatttc gtgtggtttt taatggttct 3300 tttggaaaaa tatcgctgaa cgacgttctc ttgactggtc gcaaattaca gtcggatgtc 3360 acgattatta tttcgatgtg gagattttcg aagattgtat ttacggctga tatcgtcaag 3420 gagtttaggc agatcctcat tcacccagat gacgttgatt ggcaaagaat agtctggcgt 3480 tctgacgttt cgaagcctat cgaagattat aggttgttga cggttacgta tggtacgcgc 3540 tccgctccat acctctcaat tcgaactctt cttcagctag cgagtgaggg aagtttagag 3600 tttcccttgg cagcgcaggt gctcagacac aacatgtatg tagatgatgc gtttgtaggc 3660 gctgacaatg agtctgaggc cattgagatt cggaatcagc tgatcaagct tctgtcatca 3720 gcaggtatgg agttgggtaa atgggcgtca aactgttcgg ctattctcga agatatccga 3780 gcagaaaagc agaaagagtt cgctgtcgaa tgggatgagg ctatttctgc attgggccta 3840 aaatggacgc cttcaggcga ctcgtttcgt ttcgaggtta aggttccagt tgctccgaaa 3900 attgtgtcaa agagatcgat tctatcggag atttcgaagt tgtttgatcc acttggttgg 3960 ctatcacctg tattagtcag ggccaaactc atgttacagg atttgtggat taacggtgtt 4020 gactgggatt ctcctctcac gggggaactt ctagagcagt ggcagatgtt gaggtctgat 4080 cttcctgact tggctcaatt gcgaattcct cgttggtttg gctcttcctc gcaaacatca 4140 tgggagttgc atggattttc tgacgcatcg cagagagctt tcgtcgctgc agcgtatatg 4200 gtaataccag ggcagaaatc ggcgttgatt atgtccaaga cgaaagtagc tcctattaag 4260 acggagagtt tgcctcgatt ggagttgtgt ggctcggtgt tgttagttag attgttgaag 4320 caccttttag atggactttt attgaagcct gtttcagtgc actgctggac agactccaag 4380 gtagttttag attggttgaa gggccaccca tcgagatggc agactttcgt ggcaaacaga 4440 gtgagtgagg tgatcactac tctgcctggg gtgcaatggc gtcatgttaa gtccgaagac 4500 aatccagccg attgcgcaac tcgagggttt tcgtcggagc agcttcagca gtcttccctg 4560 tggtgggagg gacctgagtg gatcaggaat gattggagga aggtaattga tagattgccc 4620 gtggatgttt cgctagatgt agtaaatgca caggttgctg ttgaggagat ggaggctgct 4680 cgagtcggtc tgattcgtta tgtgcaaagt cagcactacg aggaggaatt gcggtgtctc 4740 agaagtaaac aacggttgtc atctcgcagt cacttgctgc gacttgtacc atttatatgt 4800 ccacagggtt tgctccgagt tggaggccga ttgcagcatt cgtttctgca gtacgatgag 4860 aagcactcga ttattctgcc agcgtccagt atcatagtca agaggttgat tgaggagtac 4920 atcgtcagac tcttcatgga ggtgttcagc tgatgctcag ttccctcaac aggacgtatt 4980 ggatttcacg tgggttaaga gttgtacaag gagtttatcg gcgctgccat agatgtattc 5040 gctacgctgc tcagagtgta cagcagcaaa tggcaccgct tccatcatat cgagttactc 5100 ctcagcgagt tttcgcatac accggtctgg actatgcggg accgtttcca atattgttct 5160 ccaagggcag aggtgccaag tctacaaagg gatatatagc gatctttgtg tgtatggttg 5220 ttcgggccat tcacattgaa gttgtgtcag atttgtccac tgctgcgttt ctagctgcgt 5280 ttcgcagatt caccgctcgg cgtggacttt gtaggatggt tttcagcgac aacggcacca 5340 acttcaaggg cgcagccaca gaaattgaca agttattcca acgggcttca tcagtatcgc 5400 aggaggtggc agctgcattg gcgaaggacg gtatagtatg gtcttttata cctcctagag 5460 ctcctcattt cggaggacta tgggagtcgg cagtgagaag ttttaagcat cacttcaaac 5520 gtgtcattgg agacacgccg ttaacattcg aggagatgtc tacaattgct gctcagatcg 5580 aagcttgctt aaactctcgg ccgctctgcc cattgagctc tgagcccacc gattctgtgg 5640 cccttacgcc tggtcacttt cttgtgaacg ctcctctcaa tgtattaccc gagccctttg 5700 acgacgtgaa tctaaaatct cgcacttgca ggtggaagct actctcggtt atgagaaacc 5760 atttctggga gcgatggcga ctggaggtat tgcatcattt acagcagaga aacaagtggc 5820 tgactcctgg acgccagttg gtcgtgggcg acttggtgct tctgaaggac gagctctgcc 5880 ctccatccaa gtggccactg ggtcgaatta cagcggtaca tccagggaaa gatggtcttg 5940 ttcgagttgt aaccatccgt actgccaatt ccgagttcca gcgaccagtt gtgaaggtca 6000 tctttctgcc ctcagacgag agtgcgcaga gggcagtctc cagccttcaa cctgagtcca 6060 gtcaagactg aagagtttcc cagtgtcatc aatcgaagag attttacaca caacacctac 6120 actgtaaact cacttagctc gtgaatagat agcatcctag tgaatgttta aatgtagatt 6180 cattaggtca agtttttatt ttgttcttag tattagcttt gctagaaatt ttgcattaga 6240 ttatcaatat atagcgaatt taaccttgta tcaattttat tgaatgttgt taaacttaaa 6300 gcatttaaac ttgttgatcg actgttcgtt raggaattga ttccaaagaa atttagtgag 6360 aacagttcga tcggtagagt catggtaggg attagcttct ttcgaagtag gctttcctat 6420 ctttaaattt aatttattgt tattgccggt ctctttattg ctgcgaggca gcaagagggg 6480 cggta 6485 // ID WUKONG repbase; DNA; INV; 431 BP. XX AC U87548; XX DT 21-AUG-1997 (Rel. 2.07, Created) DT 08-OCT-2010 (Rel. 15.11, Last updated, Version 2) XX DE Wukong, Miniature Inverted-repeat Transposable Element in DE mosquito. XX KW Sola; DNA transposon; Transposable Element; Nonautonomous; KW nonautonomous DNA transposon; MITE; Wukong. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-431 RA Tu Z.; RT "Three novel families of miniature inverted-repeat transposable RT elements are associated with genes of the yellow fever mosquito, RT Aedes aegypti."; RL Proc. Natl. Acad. Sci. U.S.A 94(14), 7475-7480 (1997). XX RN [2] RP 1-431 RA Kojima K.K. and Jurka J.; RT "Classified as a non-autonomous Sola1 transposon."; RL Direct Submission to Repbase Update (08-OCT-2010). XX DR GenBank; U87548; Positions 1 431. XX CC 4-bp TSDs. TAYA target site duplications. [1] The consensus is CC similar to Sola1 elements; thus classified as a nonautonomous CC Sola1 element. [2]. XX SQ Sequence 431 BP; 144 A; 68 C; 76 G; 143 T; 0 other; ctgcccataa ctgcatattt gtaacattcg acaaaagtag gcactgagta aatggaatac 60 caagtgtaca tttcattgca gtatgggagg aaactaaaat ttcaaaaaat taccaggaac 120 tcaaaagtgc ttatttgagg ctgatatttt gtacaactca taccacatac taggtgaata 180 gtcagaaaat aattctgata gaaatttcat tgctattaca ttacgaggct catttataat 240 ctcaatgtga ctgttatgcg gttataattg ctaatgtgac aaaacaactt ggtatttttt 300 tcgaattttc tagaacaatc tcacagattt cattaagttg atcggaagta tttatcattg 360 gatgactctc catggtatta actccatttt gtcaaaaatg tcgaatgtga ctgttatgca 420 gttatgggca g 431 // ID BEL-8_AA-I repbase; DNA; INV; 6930 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-8_AA_; KW BEL-8_AA-LTR; BEL-8_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6930 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 865-865 (2011). XX DR [2] (Consensus) XX CC Positions [5971-6540] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 28..4443 FT /product="BEL-8_AA-I_1p" FT /translation="MSSQPNVRQTRSQTRAQQANANPNVGGDDRVSSHSSR FT NSFIPSMMDNDGGELRDCGGCNRPNNSEQYMVQCQKCSIWFHFSCANVSTT FT TVRTISFVCGKCMPGGDSVPQPTPSVISGISSSSSIRRARLERELQLMADE FT KKLMEDLSRERIDMERELRERELQEKLEREKQFIARRHALLSRQDHDEERS FT VRSMRSSQNSIQRTEDWVRQTGSGVVGCGDTVQQIQASTSANSGHPDEASL FT VNAQGVHPSSTPLRTANESPPLGDAPMANTGSEVRSIPQTVESITIEDSED FT FAEGAVGLNLNKCDPSQNPSNLPLVDLQPYEDILKLEEVPPTGAIPKINRF FT GGHCYKRWSAETGELRKQNALLHHQQSEAELRVVHELMVKHRTDMDTRRKR FT EMELVHQIGSLDKQHADELRRVRESEKDLRTQIEHRDLEKATLKAKIETLE FT TQLVEEKERTRNSEADMRVQLIRSERECEALRLQLAEMENELQCLREMEHQ FT LHDQIEAARRSEQEAVRRWREAEKEYWDLHDQVEQVISRNVDRSTDSGSSP FT PLPPPPASWLEPNTLGTELSSASELNPSFPPPPPWVSVNNVLSGDGREFAP FT LSGQTVGGMINMNPPIHCDSNMISSGSNIPPQMSSQFLVPPYASGLGPSPQ FT QLAARQVVTKELPIFSGDPIDWPLFISSYQHSTETCGYSNSENLLRLQRSL FT RGSAKDSVSSFLLHPSTVPQVLSTLQQLYGRPEQIVNNMIAKVRATPAPKP FT DRLETLVSFGLVIQNLCGHLKAVGLERHLANPILLQELVDKLPATVKFSWA FT LYQEQVPVVDLNVFSDYMAKVSSAASGVTQLVNVTQKPAKEERNRQKERSF FT VNTHVSTDQPKATLRENAEKVGTNGGKQKEKDVSKNNSRFCAVCNVDNHQI FT ESCTSFKGLDLDGRWKAVKSHRLCARCLTSHARWPCKGEVCGINDCPKRHH FT RLLHFDPPAETKTTNAVVTVHRQLSSSTLFRILPVTLYGTKGQLDTYAFLD FT DGSSVTLVERSIADALGARGETETLRIEWTGGINKTIAGTEVVMMEISEAG FT GSKRHRLSEVYTVENLGLPQQTTNYAELSTRFAHLSKLPVKSFRSAVPGIL FT IGQSNSHLLATLKLREGKLTDPIATKTRIGWTISGSLQKPTMHRQLHIHAE FT STDADLHEYVRHFFDVESLGVAVVPAVKGVEEQRSYRILEATTRRLSDNKY FT ETGLLWKHDYVEFPDSRPMAEKRFKCLEKRLQQDPVLYDSFRRQIADFKVK FT GYIHQATTDELEEYDLRRTWYLPIGIVINEKKPGKVRVIWDAAAKVDGTSL FT NSMLLKGPDLLTPLLSVMFPYRERQVAVSADIREMFLQISIRKEDRSALLF FT PYRDSPDLPMSTMVSDVAIFGAACSPAHAQYIKNMNADEQEAVFPRGAAAV FT KKRHYVDDYVDSFDTAEEAVQVAMEVIEVHKRAGFHIRNWMSSDSTVLEKL FT SEANRKP" FT CDS 4741..6930 FT /product="BEL-8_AA-I_2p" FT /translation="MEWIAVLKQMDGLRIPRCYFPGYDPESYKNVELHVFV FT DASAQAYAAVAYFRIIDHGQIRIALVSSKTKVAPLREISIPRLELMAALLG FT ARLRKTVEENHSLTIQRTHFWSDSSTVCSWIKSDTRRYRQFVAFRVDEILS FT LSKVDEWRWISTKLNVADEATKWGKGPSCNTDSRWFQGPEFLYSNQTNWST FT VPEDNSDESNSELRETYMCNHQVRLPIVDVERFSRYDRMLRAVAYVHHFVD FT NLRAVKSARSADTVGVTSEDLRKAERTLWLIAQNEAFPNETAVLKRNLELS FT TACQKKIEASSSIWKQSPFADEFGVLRVGSRAANAHALVYDTKFPIILPRN FT HRITDLLLDFYHRKYGHANDETVVNEVRQRFHVPRLRVEVRLARKRCMWCR FT VYKSTPIAPKMGPLPAVRLQPYVRPFTYVGVDLFGPYLVKIGRSVAKRWVC FT LFTCLTIRAIHLEVVTNMSTDACKKAFRRFIARRGAPQEVYSDNGTNFVGA FT NRDLQTEISKIYTELGSTFTNVQTQWRFNPPAAPHMGGCWERMVRAVKSAL FT ECVPVERKLDDESFATVLAEAESMINSRPLTFIPLETADCESLTPNHFLLL FT SSSGVREPEKFPTDIGMALKNSWNLVKHTLDNFWRRWVVEYLPIIIRRTKW FT FQDVKPIEIGDLVYIADENVRNRWIRGRVVHTIPGKDGVTRRAEVQTSSGV FT LKRPATKLAVLDVASSGDDRPDVGATRRGG" XX SQ Sequence 6930 BP; 1939 A; 1583 C; 1872 G; 1534 T; 2 other; attcttaaag aattcaaccg gtacacgatg agttcacagc caaacgtgag acaaacccgg 60 tcacagacaa gggcgcagca ggcgaatgct aaccccaacg ttggtggtga tgatagggtg 120 tcgagtcatt catcgaggaa ttcattcatc ccatctatga tggataacga cggaggcgaa 180 ttacgtgatt gcggcggctg caaccggccg aacaactcgg agcagtatat ggtacagtgc 240 cagaagtgca gtatctggtt ccatttctcc tgtgcaaacg tcagcacgac cacggtacgt 300 accataagct tcgtgtgtgg gaaatgtatg ccgggtgggg attccgtacc gcagccaacg 360 ccgagcgtaa ttagcggcat ttcgagttcg tcgagcatcc gtagggcgag attagagcgg 420 gagctacagc ttatggctga tgagaagaag ctaatggagg atctcagccg ggaacggatt 480 gatatggaac gagagcttcg tgaacgagaa cttcaagaaa agttggagcg tgaaaaacaa 540 ttcatcgctc ggaggcacgc attgctgagt cggcaggatc atgacgagga gagaagtgtg 600 cgtagtatgc gaagcagcca gaactcaata cagcgcacgg aagattgggt aaggcagacg 660 gggtcgggcg ttgtcggttg tggagacacc gttcaacaaa tacaagcctc aacgtcagcg 720 aattctggtc atccggacga ggcaagtctg gtaaacgcgc agggagtaca cccctcttcg 780 acccccctaa gaacagcaaa cgagtcgcct ccactcggtg atgcacctat ggcgaacacg 840 ggatcagagg tgagatcgat cccacaaacc gtagaaagca tcacaattga agattcggag 900 gatttcgcgg agggagccgt cggattgaat ctgaacaagt gtgatccgag tcaaaatccg 960 tctaatttgc ctcttgtgga cctacagccc tatgaagaca tcctgaagct ggaagaggtg 1020 ccgccaaccg gcgcgatacc gaaaatcaat cgttttggtg ggcactgcta caaacgctgg 1080 agtgctgaaa ccggagagct gcgaaagcag aacgccctgc tacatcacca acaatccgaa 1140 gcggaactaa gggtcgtcca cgagttgatg gtgaaacata gaacggatat ggacacgcgg 1200 cgaaagcgtg agatggagct ggttcaccaa atcggcagtt tagataaaca gcacgctgat 1260 gagctgagac gagttcgtga atcggaaaaa gatttgcgca ctcagatcga acaccgagat 1320 ttagagaaag ccacactaaa agctaagatt gagaccctgg aaacgcagct cgtcgaggag 1380 aaagaacgta ctcgcaattc ggaagcagat atgcgggtcc aactcattcg aagcgagcga 1440 gaatgcgaag cgcttcggct tcaactagcc gaaatggaga atgaactcca gtgcttacgc 1500 gagatggaac atcaattgca tgatcaaatt gaagcggctc gtcgaagcga acaggaagcc 1560 gttcggcggt ggagagaggc cgaaaaggaa tattgggatt tgcacgacca ggtggaacag 1620 gtcatcagtc gcaacgtaga tcgttcaacg gacagcggca gttctccccc gttacctccg 1680 cccccagcgt cgtggttgga accaaacacg ctgggcacag aattatcttc cgctagtgag 1740 ttaaacccct cttttcctcc accccctcct tgggtatcgg tcaataatgt tctgtcgggt 1800 gacggccgag aattcgctcc gttatcggga cagactgttg gtgggatgat caatatgaat 1860 cctcctattc attgtgattc gaatatgatt tcttccggtt caaatatccc tccgcaaatg 1920 tcgtctcagt ttctggtacc gccttatgca agcggcctcg gcccttcgcc tcagcagtta 1980 gcggctcgac aggttgtgac caaagagctt ccaattttct ccggggatcc aatcgattgg 2040 ccgctgttta taagtagtta ccaacattca acggaaactt gtgggtacag caactcggag 2100 aaccttttgc ggttgcagcg cagtctgaga ggtagtgcta aggactctgt cagcagcttt 2160 ttactccacc cgtcgacagt acctcaggtc ttgtctactt tgcagcagct atacggccgc 2220 ccggaacaga ttgtgaacaa catgatcgcc aaggttcggg caactcctgc tccgaaaccg 2280 gatcggttgg aaactctggt cagctttgga ctggtaatcc agaatctctg tgggcacctg 2340 aaagcagttg gcctagagcg gcatctggcg aatccaatcc ttcttcaaga gttggttgac 2400 aaactacccg caacagtcaa gtttagctgg gcgctctacc aagaacaagt tccggtagta 2460 gatctaaacg tgttcagcga ctacatggca aaagtatcat cagcagcgag tggcgttact 2520 cagctagtta acgtgacgca aaaaccggca aaggaggagc gaaatcgaca gaaagaaaga 2580 tcgtttgtca acacccatgt ttccacggac cagcccaaag caactttgcg agaaaacgca 2640 gagaaagttg gaacaaatgg tgggaaacaa aaggaaaagg atgtcagcaa gaacaacagc 2700 agattttgcg ccgtttgcaa cgttgacaat caccaaatcg agagctgcac ctcgtttaaa 2760 ggtctcgact tagatggcag atggaaagca gtaaagtcac acaggctttg tgctcgttgt 2820 ctcacatccc acgcgcgttg gccatgcaaa ggggaagttt gtggaattaa tgattgtcct 2880 aagcgacatc accgtctgct ccatttcgat ccaccagcgg aaactaagac gaccaatgcc 2940 gttgtaacag ttcatcgtca attgtcgtcg tccacactct ttcgtatcct accagtcact 3000 ttgtatggaa caaaggggca gctcgacacc tacgcattcc tagatgatgg atcgtcggtg 3060 acactcgtgg aaaggtcaat agcagatgct cttggagcca gaggagagac ggagacgttg 3120 cgtatcgaat ggactggagg catcaacaag acgattgctg gaacagaagt cgtcatgatg 3180 gagatatccg aagcaggtgg cagtaaacga cacagattgt ccgaagtata taccgttgag 3240 aatcttggct tgccacaaca gacaacgaac tatgctgagc tttcaacacg gtttgcacac 3300 ctcagcaaac tgccggtcaa gagtttccgc tccgctgtac cgggaatctt gattggacag 3360 agcaattcgc atttattagc tacgttgaaa ctacgtgagg gcaaattgac tgatccgatt 3420 gcaacgaaga cccgaattgg atggacaata agtggtagtc tgcaaaagcc tacgatgcat 3480 agacaactcc acattcatgc ggagtcaaca gacgctgatc tacatgaata cgttcgccat 3540 tttttcgacg tagagagctt aggtgtagcg gtggtgccag cggtgaaagg cgtcgaggag 3600 caacgatcct atcggatcct agaagcgaca acgcggcgac tgagcgacaa caagtacgag 3660 acgggcctgc tctggaaaca tgactacgtg gaattcccag acagtcgacc gatggccgag 3720 aaacggttca agtgtctcga aaagcgttta caacaagatc cagtgcttta cgacagcttt 3780 cgtaggcaaa tagcggactt caaggtcaag ggctatatcc accaagcaac gacggatgaa 3840 cttgaagaat acgatcttcg ccgtacatgg tacctgccga ttggaatcgt tattaacgag 3900 aagaagcctg gaaaagttcg cgtgatctgg gatgcagcag caaaagtcga cggaacatcg 3960 ttgaactcca tgttgctcaa gggcccggat cttctaaccc cgctattgtc cgtaatgttt 4020 ccataccggg agcggcaagt agcagtgtcc gcagacatta gagaaatgtt cctgcaaata 4080 tcgattcgca aggaggatcg tagtgcgctg ttgttccctt acagagactc cccagatctg 4140 ccgatgagca ccatggtatc cgacgtagcg atattcggag cggcttgctc gcccgctcac 4200 gcgcagtata tcaagaatat gaatgcggac gaacaagaag cagtattccc acgaggagca 4260 gcggccgtta aaaaacggca ctacgtagac gattacgtgg acagttttga tacggcagag 4320 gaagcagttc aagttgcaat ggaagtgata gaggtacata agcgtgcagg attccatatc 4380 cgcaattgga tgtctagcga cagcaccgtg ctcgagaagt taagcgaagc taatcggaag 4440 ccakcgaaaa acatgctatc ggaacagaat actgcattcg agcgggtgtt gggaatggcg 4500 tggatgcaag aagacgatgt atttacgttt tcgatacagt cttgggagaa agtgcgtaac 4560 ctactggagg acaccgcgat tccaactaaa agagaaatgc tgcgtttggt tatgagcatc 4620 tacgatccgt tgggattgat tgcgtcgttc gtaattcaag ggaaggttat aatccaggat 4680 gtgtggcgga caaaaactgm ctgggataat caaattccgc cggagattgc tgaacgatgg 4740 atggagtgga tagcggtact caagcagatg gacggactgc gtatccctcg gtgctatttt 4800 ccgggatacg acccagagag ctacaaaaac gtggagctac atgtgtttgt agatgctagc 4860 gctcaggctt acgcagccgt agcttatttc cgcatcatag accacggtca aatcagaata 4920 gcgcttgttt cgtcgaaaac gaaggtcgca ccgcttcgag aaatttcgat accacggctg 4980 gaattgatgg cagcgttgct aggagcacgc ttgcgaaaga cggttgagga gaatcactcg 5040 ttaacgattc aaagaaccca tttttggagt gactcgtcta cggtgtgttc gtggatcaaa 5100 tcggatacac gacggtatcg tcaatttgtc gccttcagag tagacgaaat attgagcctt 5160 tcaaaggtcg atgaatggcg gtggatctcc acaaagctca acgtggctga tgaagctacc 5220 aagtggggga aaggcccttc atgcaacaca gatagccgct ggttccaagg accagaattc 5280 ctgtacagca atcaaacgaa ttggtcgacg gttccagaag acaacagtga tgaaagcaac 5340 agcgaattgc gagaaacgta catgtgcaac catcaagtca gactgccaat agtggacgtg 5400 gaaaggtttt ctcgatacga tagaatgctg cgtgcggtag cgtacgtcca ccacttcgtt 5460 gataacctac gcgcagtcaa aagtgccagg tctgcagata ctgttggagt gacaagcgag 5520 gatttacgaa aagcagagcg gacgttgtgg ttgatcgcac agaatgaagc attcccgaac 5580 gaaactgcag ttttgaagcg taatctggaa cttagtacgg cgtgccagaa gaaaatagaa 5640 gcatctagca gtatttggaa gcagtcacca ttcgccgatg aattcggagt attgcgagtt 5700 ggcagtagag ctgctaatgc gcacgcattg gtctacgata cgaagttccc aattattctg 5760 cccagaaatc atcgaataac ggatttattg ttggacttct accaccggaa gtacggccat 5820 gccaatgatg aaaccgtagt taatgaagta cgtcaaagat tccacgtgcc tcgtttgcgt 5880 gtagaagttc gcctagctag aaaacgttgc atgtggtgtc gtgtgtacaa gtcaacgcct 5940 attgctccaa aaatgggccc gcttccagca gtgcgattac agccatatgt acgcccgttc 6000 acgtacgttg gcgtagattt atttgggcct tatttagtga agattggacg aagtgtagcg 6060 aaacgatggg tttgtctttt tacctgcctt acaatccggg ccatacatct tgaggtggtc 6120 acaaacatgt ccactgatgc gtgcaagaaa gcttttcgaa gattcatagc aaggcgtgga 6180 gccccccagg aagtttactc cgataatggc acaaattttg taggagctaa tcgagatcta 6240 caaactgaaa tcagcaagat ttatactgaa ttgggcagca cattcacgaa tgtccaaaca 6300 cagtggcggt tcaaccctcc tgccgctcct catatgggag gttgctggga aaggatggtt 6360 cgcgccgtga aatctgcact agagtgtgtt cccgttgaac ggaaattgga cgatgaatcg 6420 tttgctaccg tgttggctga ggcagagagc atgattaact cccgaccact aacgtttatc 6480 ccgctggaga cggctgactg tgaatcactg actcctaatc actttctgct gttgagctca 6540 agtggcgtgc gagagccgga gaagttccct acagacatag gaatggcgct aaagaacagc 6600 tggaatttag tgaagcacac tttggataac ttctggcgac gttgggttgt agaataccta 6660 ccaatcatca tacgccggac taagtggttc caagacgtaa aaccgataga gataggagat 6720 cttgtctaca tagcggatga aaatgtcagg aatcggtgga tacgagggcg agtagtccat 6780 acgattccgg gaaaggatgg agtaacgcgc agagcggaag tacaaacatc aagtggtgtt 6840 ttgaagagac ctgctacgaa gttggctgtg ctagacgtgg caagttctgg tgacgaccgt 6900 ccggatgttg gtgcgacacg gcggggagga 6930 // ID MuDr-2x_AP repbase; DNA; INV; 2999 BP. XX AC . XX DT 21-APR-2008 (Rel. 13.04, Created) DT 21-APR-2008 (Rel. 15.12, Last updated, Version 2) XX DE A distinct, diverged MuDr-type family. XX KW MuDR; DNA transposon; Transposable Element; MuDr-2x_AP. XX NM MuDr-2x_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-2999 RA Jurka J.; RT "Highly diverged MuDR-type families."; RL Repbase Reports 8(4), 415-415 (2008). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX FH Key Location/Qualifiers FT CDS join(1247..1843,1773..2525) FT /product="MuDr-2x_AP_1p" FT /translation="MCLFTATYKYMNIHYENDHNIIESVDNFEFESIDGFK FT NWKHEIELSTQTLYVKNFGSSEKGNQIYSYYKCHRSGFYNSISKGQRHLKT FT QGQGSNKINGYCPASINVIESKVTKRCNVKFNGNHIGYENEIGHLPLNTCN FT RDDIAAKISQNIPFDRISDEIRDNITNNHLERTHLLTKKDLYNIEASYNLN FT NEAVPTSFKLKKIYIILKLHIISIMRQYLHHSNDATSVGAWVEKLKSDDKL FT SLVYYKSQGQIDTNYLELKEDDFLLLIMNDYQKSMLNKFGNDVICIDETRG FT MNSYHFNLTTIMVLVDLREGFPCSFMISNRVDEAVLRIFFAEIMDKTGIIQ FT PNVFMSDMAESFYNAWVVEMKPAKHRLYCTWHIDRAMALRIHIILNNKPHF FT YTHIYESQEHSIFRYTIVVKSNAPSATQICILKRLFYNLYCYFYIFIFLML FT KKTST" XX SQ Sequence 2999 BP; 1096 A; 416 C; 424 G; 1063 T; 0 other; acaagcgtct agcgagtagc gacagggaat ccgaatggct gcaagatggc cgacttcgtt 60 atcatgatcc gttatcacca attttattat caatttaaaa aaaaaaaata tatttctttc 120 ccttacctta tttagttatt tcgaacgaag ctgtataata atttattata aataagttct 180 actaagtcta atacagtaat tcaacaggta aaatgaactc agtacatcca gtacttctta 240 aaatatttgt gtttgagttt ttactaattg ttaaaaaatt tgcctgtttg attaaatgta 300 caatattgtg aaagctttca ctacatgagt ttaaatgttt tattcatccc ttttcttgat 360 gtactggaat attctttaat tgtgtttgat ttctcctatt atgcatttag ataatatttt 420 gattattttt tttatggttt ttgttattta tgttatggag gcctaagaat gtcactctca 480 ggttgatgtg caatcagatt acctacttcc atgttccttg tgaagtttct cagggcggtc 540 acttgtcttt gtcaccattc ctatgtacat gatgatggag ctggccagca aggatccatc 600 attcctattc tcaattatta atatttaatt gtatttgtat tacaaatgca ttgtaattct 660 attattattt tgttttcaat tttaatttca attaaagaaa tttagtgttt tcataatttt 720 aattggggca accctaatat attataaata aataaataaa tatctaatta ttattctaat 780 ttgtcaaatt ataaatgtgt ttaatgtgct atttgaatat gtattgtaaa attgtttcat 840 ggttgttata agtatttaat aaaataaaaa ttcacaaaat aaatgtctat ttattaacaa 900 attataaaat tataaatttt atttctgtaa atacaataaa tatgcatcaa tattttattt 960 tgttaaatta ggtttttaag aacatgtcag acataaatga attcaaatgc atagaatgtg 1020 agaaaaagtt taaattttta aaaaacttaa gagctcgcat aaaaacaaat catcctcttt 1080 taaagttgga tgaaatagca ccaacaaaaa aacgaaaaat agtaatttgt atatatttac 1140 atgtgaccag tgtacaaagt catattgtca tcaaaaaaat ctaattgaac acaaaaaagt 1200 tgctcattca attgctccaa caattgaaat tatagtcgaa tgtgccatgt gtttatttac 1260 tgccacatac aaatatatga atatacatta cgaaaatgat cataatatta tagaaagtgt 1320 agataatttt gaatttgaaa gcattgatgg ttttaaaaat tggaaacatg aaattgaact 1380 atccacacag acattgtatg tcaaaaattt tggttcctct gaaaaaggaa atcaaattta 1440 cagttattac aaatgccaca gaagtggttt ctataattca ataagtaaag gtcaacgtca 1500 tttaaaaaca cagggacagg gttctaacaa aataaatggg tactgtccag ctagcattaa 1560 tgttatagag tcaaaggtta ctaaaagatg taatgttaaa tttaatggca accatatagg 1620 ttatgaaaat gaaataggac acttgccttt aaatacatgt aacagagatg atatagcagc 1680 taaaatatca caaaacattc cctttgatag aatttcggac gaaattcgtg ataatattac 1740 caacaatcat ttagaaagaa ctcatttgct aactaaaaaa gatttatata atattgaagc 1800 ttcatataat ctcaataatg aggcagtacc tacatcattc aaatgatgct acaagtgttg 1860 gagcatgggt tgaaaaatta aaatctgatg acaaattaag ccttgtttat tacaagtcac 1920 aaggtcaaat tgacacaaat tatttagaac taaaagaaga tgatttttta ttgctcataa 1980 tgaacgacta tcaaaaatcc atgttaaaca aatttggaaa tgatgttatt tgtatcgatg 2040 aaacgcgtgg gatgaattcg tatcatttta atttgacaac aatcatggtt cttgttgatt 2100 taagagaagg atttccatgc tcttttatga ttagtaatcg agttgatgaa gcggtcttga 2160 gaatattttt tgctgagata atggacaaaa ccggtattat tcagcccaat gtcttcatgt 2220 cagatatggc tgaaagtttt tataacgctt gggttgttga aatgaaacca gcgaaacata 2280 ggctctattg tacatggcat atagatcgtg ccatggctct acggattcat attatattaa 2340 acaataagcc acacttttat acgcacatat acgagagcca ggaacacagc atattcagat 2400 ataccatagt cgttaagtcc aatgcacctt cagctactca aatatgcata cttaaacgac 2460 tattttacaa tttatattgt tatttttata tatttatatt tttaatgttg aaaaagactt 2520 caacataata tcatagatct ctgatggtca gtatacaaca attataacgt atttgacaac 2580 taattattac ttactaacta ttatgaatta tatgaactta catattacca tttgtatatc 2640 tatttcaata tctcattata tatgaattat gaattataaa ttataaaaca tagaattata 2700 ataactataa aaaactgtac aatgtacttt gtaaatcgta attcttaaat cctaacctaa 2760 gattataatc ggtgctaact actaacctgt taactaataa taattattgt taattaactg 2820 aaatatcaat gggttaccaa ctttaatacc caatacccaa ttattattaa tattattaat 2880 catcatgatt ttcaatagaa aatgatatta ggtatgataa taaaatcggt gataacggat 2940 catgataacg aagtcggcca tcttgcagcc attcggattc cctgtcgcta gacgcttgt 2999 // ID Gypsy-11-I_HM repbase; DNA; INV; 3739 BP. XX AC . XX DT 30-DEC-2008 (Rel. 13.12, Created) DT 30-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE An internal portion of the LTR retrotransposon - a consensus DE sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW reverse transcriptase; Gypsy superfamily; Gypsy-11-I_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3739 RA Bao W. and Jurka J.; RT "Gypsy LTR retrotransposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 1988-1988 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 29..3688 FT /product="Gypsy-11-I_HM_1p" FT /translation="MDAKIFKKVLCDEPNEEEYIYWKKMLSIYLDKANIDE FT DCKLQVLFVLSGVKAFTIIQDTTSFDDAIKILDNKYQKRSTPIMMRHKLRS FT FKQQEGETVESFMSSLKQFARKCPTEPLTAEQHINLLICDAFVAGIYSPSI FT RQRLLEATEDNLEALYKTALTMELATEDALNLTTTTTANTTLPTFAASRNL FT TKSHCYWCGNDPHPKIKCPARNSTCTHCQRRGHWATVCLSKKQNKTYTKAA FT VVKEYQEENDNAIVSTVIAAINSPDRLINVTINKNITQALIDTGSDKTFIT FT SELLQHWNLPYDTQSTSKVSLADNSTLKVKGIFQGSLLLACEKLKVQFLVV FT DNLVAPVIIGMDVLTQHTSITINFKGNKEPLKFCLATKSMKCTTYALIPGV FT DIDKIKPVATTSRRVPKNNSFITDEINRMLKQDIIQVSRSPWRAQCFVVSS FT GNKNRLVIDYSNTINLHTPLDSYPTPRIDDLILKIAAHKVFSTIDLKSAYH FT QVKIRPEDYKLTAFEANGRLYEFKRLPFGCTNAVPIFQRVMDEFIQNNKLE FT NTYAYLDDIIIGGKDDQEHDKNLSRFLHAAQIANVEINKEKSKFKKTHIEF FT LGHIIENGTLKPDPKRYEALINMPEPKTLKELNRMIGLFAYYAKWIHNCTE FT FTLPLTQARDNFPTKGLTPQAKEAIKKLKIKLAEACLASPNFKVPLTIETD FT ASDTALGGTLLQLGRPVAFFSRTLSNAERKHAIVEKEAAAIVECCRRWKHL FT INSVPYIIIITDQRSISLIFNKQISTKIKNEKLTRWRLELADINYTISYRP FT GNQNIVADALSRCCSIQNTNKTFYNIHSQLCHPGIKKMIHYCKTRNLPYST FT TEIKQLTSQCTTCNELKPRFYKPPAGRLIQATRPWERLSMDFVGPLQSTSA FT NKYILVIVDEYSRYPFAFPCKDITANTVIQHLLSLFSLFGAPSSMHTDRGT FT QFESISLKVFLERNGVIRTRTTPYHPQGNGQCERMNGTILKAVSLALKTLS FT LGKDRWEVALQMALSSIRGLLCTATNETPHQRMMNFQRSSIIGTVLPNFLT FT EQDSTILYRRQIRLKGDPLVDRVQLLETISPHFARVKLQNGKIDTVSTKDL FT APLQDRNPLIEPVQQQSECSNNDEIVISNNNHNSGNRKQVELNDTNEVVSN FT LLHPFGNPIKDSFSYDQSTTTVPNQPSIQKVITPDTQPNIISRTGRTIKRP FT KYLDDYFLS*" XX SQ Sequence 3739 BP; 1374 A; 812 C; 591 G; 962 T; 0 other; attttatcaa atatttttta tttacattat ggatgcaaaa atatttaaaa aagtactatg 60 cgacgaacca aacgaagaag agtatatata ctggaaaaaa atgctatcaa tttatttaga 120 taaagctaac atagacgaag attgcaaact ccaagtttta ttcgtactat ccggtgttaa 180 agctttcaca ataattcagg atactacatc atttgatgat gcaatcaaaa tacttgataa 240 caaatatcaa aaacgatcaa cacctattat gatgcgccac aaacttcgct cttttaagca 300 acaagaagga gaaacagtcg aaagcttcat gagtagcctg aaacaattcg caagaaaatg 360 tccaacagaa cctctaacag cagaacaaca tattaattta ctaatttgcg acgcttttgt 420 cgctggaatt tactcacctt ctattcgaca acgactcctc gaagctacag aagacaatct 480 cgaggcactt tataaaacag cgctaacaat ggaattagcc accgaagatg ctcttaattt 540 gacaacaacc acgacagcta atactacgct accaactttt gccgcaagtc gtaaccttac 600 gaaatctcat tgttactggt gcggtaatga tccacatcca aaaattaaat gcccagcacg 660 aaattccacc tgtacacatt gtcaaagaag gggccactgg gcaactgtct gcctctcaaa 720 gaaacaaaat aagacctaca ctaaagcagc agttgttaag gaatatcagg aagaaaacga 780 taacgcaata gtatctacgg tcatagcagc aatcaactca cctgatagac taattaacgt 840 caccattaac aaaaatataa cacaggcact gatagatact ggctccgata aaacgtttat 900 tacatcagaa cttcttcaac attggaacct tccttacgat actcaatcta cctcgaaagt 960 atcattggca gataattcga cattaaaagt taaaggaatc ttccaaggct cattgttatt 1020 agcttgcgaa aaattgaaag tgcaattcct cgttgtagac aacttggtag caccggtaat 1080 tatcggaatg gacgtgttaa cacaacatac ctcaattacg atcaatttta aaggaaataa 1140 agagccctta aagttctgtc tggcaacaaa gagtatgaag tgtactactt acgcattgat 1200 acctggagtg gatattgaca aaattaagcc agtcgctaca acatcacgac gggttcctaa 1260 aaataactcc ttcataacag acgagataaa ccgtatgtta aagcaagata taatccaagt 1320 gagccgtagt ccttggcgtg cacaatgttt tgtagtatca tcaggaaaca aaaatcgtct 1380 tgtgattgat tattcaaata caatcaatct acacactcca ctcgattcat acccaacacc 1440 tcgaatcgac gatttgatac ttaagatagc tgctcataaa gtcttcagta caattgacct 1500 caaatcggct taccatcagg taaaaatacg acctgaagat tacaagctaa ccgcattcga 1560 agcaaatggc agactatacg agtttaaacg tttacctttt ggttgcacaa atgctgtacc 1620 aatttttcag cgggttatgg atgaatttat tcaaaataat aaactagaaa atacatatgc 1680 ttacctcgat gatataatta ttggtggaaa agatgaccaa gagcacgata aaaatcttag 1740 ccgtttccta cacgcagctc aaatcgccaa cgtagaaata aacaaagaaa agagcaaatt 1800 taaaaaaact cacatcgaat tcttaggcca tatcattgaa aatggaacct taaaacccga 1860 ccctaaaaga tatgaagctc ttataaatat gccagaacca aaaaccttaa aagaattaaa 1920 ccgaatgata ggtttattcg cctactacgc taaatggata cataactgta ccgagtttac 1980 tctaccgctt actcaagcaa gagacaactt tccaacaaag ggcttgaccc cacaggcaaa 2040 agaagcaatt aaaaagctca aaattaaact tgcagaagct tgtctagcct cccccaattt 2100 caaagttcca ctaactattg aaacagatgc ctctgacaca gcactgggag gaacgttact 2160 tcaactaggc agacccgtgg ccttcttctc aagaacacta tcaaacgctg aaagaaaaca 2220 cgcaattgtc gagaaagaag cagctgctat tgtagaatgc tgtagaagat ggaaacacct 2280 aattaattca gtaccataca tcattataat tacagaccaa agatccattt cccttatttt 2340 caataaacag atttcaacaa aaattaaaaa cgaaaaacta actcgatggc gtttagaatt 2400 agcagatatt aattatacaa tttcatacag accaggaaat caaaatatcg tagccgatgc 2460 cctttctcga tgttgctcaa ttcaaaacac caataaaacc ttttacaata ttcactccca 2520 actatgtcat cctggaatta aaaaaatgat acattattgc aaaacccgaa atctacctta 2580 ctctaccact gaaattaaac aactaacaag ccaatgtaca acctgtaacg aactcaaacc 2640 gcgcttctac aaacctccag ctggtcgtct tattcaggct acgagacctt gggaacgctt 2700 atctatggac tttgttggac ctttacaatc cacctcagct aataaataca tcctcgtgat 2760 agtagacgaa tactctcgct acccatttgc attcccttgt aaggatataa cagccaacac 2820 tgtgatacaa caccttctca gtcttttttc attatttggt gcaccatcat ctatgcatac 2880 agatcgagga acacagtttg aaagtatatc acttaaggtt ttcctagaaa gaaacggtgt 2940 catccgaact cgaactactc catatcaccc acaaggcaac ggtcaatgtg aaagaatgaa 3000 tggaacaata cttaaagctg ttagtctggc cctaaaaaca ttgagtctgg gaaaagatag 3060 atgggaggta gcattgcaaa tggctctatc ctctataaga ggacttctgt gtactgctac 3120 gaacgagaca cctcatcaga ggatgatgaa ctttcagcgt tcatcaataa taggtacagt 3180 tctaccaaac ttcttaacag agcaggattc gacaatttta tacagacgtc agatacgatt 3240 aaagggagat ccacttgtcg acagagttca actattagaa acaatatcgc ctcatttcgc 3300 acgcgtgaaa cttcaaaatg gaaaaattga tacagtttca acaaaagatc ttgccccact 3360 acaagataga aatcccttaa tagaaccagt ccaacaacag tccgaatgct cgaacaatga 3420 cgaaattgtc attagtaaca acaaccacaa ttctggaaat agaaaacaag ttgaattaaa 3480 tgataccaac gaagtagttt cgaacctact gcatccgttt gggaatccaa taaaagatag 3540 cttctcttat gatcaatcaa ctacaacagt acctaaccaa ccatctattc aaaaggttat 3600 tactcctgac acacaaccta atataatctc acgaacagga cgtactatta aaaggccaaa 3660 gtatctcgac gactactttc tttcttagag tattatccaa gagtaagccg tactaaaact 3720 ttcttaaaac ggagaagac 3739 // ID SARTPx1 repbase; DNA; INV; 4384 BP. XX AC . XX DT 14-SEP-2004 (Rel. 9.08, Created) DT 08-NOV-2010 (Rel. 15.12, Last updated, Version 2) XX DE Papilio xuthus R1 clade retrotransposon SARTPx1 - a consensus. XX KW R1; Non-LTR Retrotransposon; Transposable Element; KW reverse transcriptase; gag-like domain; R1 clade retrotransposon; KW R1_PX; SARTPx1. XX NM R1_PX. XX OS Papilio xuthus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Papilionoidea; Papilionidae; Papilioninae; Papilio. XX RN [1] RA Kojima K.K. and Fujiwara H.; RT "Evolution of target specificity in R1 clade non-LTR RT retrotransposons."; RL Mol. Biol. Evol 20(3), 351-361 (2003). XX RN [2] RA Gentles A., Kohany O. and Jurka J.; RT "Papilio xuthus R1 clade retrotransposon - a consensus."; RL Direct Submission to Repbase Update (31-MAY-2004). XX DR [2] (Consensus) XX SQ Sequence 4384 BP; 1092 A; 1168 C; 1365 G; 752 T; 7 other; ggttaggtta ggttaggtta ggttaggggt gcggtggccg gtgacctcct cgtggtgcac 60 cctggaaact ccccaccttc tgtggggagg ctaacaaccg gccggggggg gaattccgtn 120 nnnctatacc ggatagggcc acccataccg ggcaggcccg cggaacgatg gggagcgcgg 180 cagggcaggt caactcggcc accctgtcgg gcaatcgtcg aggatgtgtg gcagcacaat 240 aacttgcaga ggacttcggt cggtgtatgg cggccttcat ggctcgcggc gcggcagggt 300 cggtcaaccc ggccaccctg tcgggcgtaa ggtcgagggc gtgcagcagc acgttaaact 360 gctgaggact tcggtcggcg gacggtgacc ttcaaggttg cgcatattgg tcggggacct 420 caccctgctt cgaaaaggcg gtttccggtg tccgacaaca ccggaccaac ccagcatgat 480 ggcaagttta aagacgaatt taccccagga gggtaccacc agctccgctg gtggagaatc 540 cctccccgag gccactttgg cctcgggctg cccctcgtac tctggggggg gcaaaaactg 600 ctttgactgc acgacgactt tggccttgaa tatggatact gatgcacgcg atgtggaagt 660 gacggggagt ggaatggaag aggagatgga gctggacacg gagttgatgt tgagcgagga 720 atcgacatca atgtccgaga gcagtcgccg agccagcccc ctgccaagga agaggagagg 780 acgaccgccc actacgggcc actatgtcgg tctggccaaa gctaagttgg ccctcatcga 840 ggctgaacgg gaggaggaac gaaggagagc ggaggcggaa gtagtcgctg cctcgcgcgt 900 ggcccgggct agatcccaag cccaccggct gtcggagacg gccactgaag atgaggagtt 960 ccaggcggcc ggttccctcg gtcaagtgat agaggagaac atagaggtca tcaagaagat 1020 ggccaccacg tctaagaact tgaaaggggg atacgtgcga gccctaaagg acgcagcgga 1080 ggaaatcact aagatgacaa aagctctaca aagaagaact acatcggaag agacaaaggg 1140 attacaggcg gataatgctc gcctgcgagc cgaggtagca cagctgcgca aggaggtggc 1200 agagatgaag aagcatctcc ttgcaccgaa cgagaggtca gaggttgcag ttgaggccag 1260 gcggcagcct gaaccacagc cgcaaagaca acctaacgac agcgaggagc ttgtgcgcac 1320 catattatgc caggtgggca atatgataaa cgcaaggttt gaggcgctgc aagaccggct 1380 cctcccggag aaacgactcc gtccccagct ggcggcggac aacaggcctg tgtccgagag 1440 tcgcggtgca ccggaggtcc taaaagcggc gggtaaagcc aanggcaaaa cctcgggaaa 1500 agttccgaac aaagccccga gcaaaccccc caaggagaag acgacggcgg agccccaggc 1560 ccaaccacag ccaacggcgg cggtggaacc ggcagaccat cagcccaaag agctgccctg 1620 gacccagtgg tcaaaaaagg gcttaagaaa aagaagaagg acagaccgcg ggtgtacaca 1680 aacaccgaaa gagagaaagc gtcaagagca ccatcagacc ggccacccga agactgctgc 1740 ggtggtcgtc tccatcacac cggaggccat cgctaaaggc ctcacatatg acgcagtcat 1800 tgctgaggcc aaggcaaaaa taaagctaca ggatgtggga ataacaacag gggtacgctt 1860 tcgggtgtgt gccacaggag cgcggaggtt cgaagtcctg ggaacgaaca atggcccgca 1920 agcggacgcc cttgcggaga gactgacgca agtttttgac acagacttgg tccgcgtctc 1980 cagaccgacg aaaacgacgg aggttaaaat atccgggttg gacgactcct caaccatcga 2040 ggaggtactg gcggcggtgg cggaaatagg aggctgccca agagagagcc tcaagagcag 2100 cggggtggtg agagacagat tcggagtagg acacgcctgg gtggagtgtg cagtgcccac 2160 tgcgagacgg gtggctgcag caggacggct gaccatatcg tgggtgtcgg ccaatgtaac 2220 gctgctcgaa cccagaccga tgcgttgtta tcgatgccta caaaaaggcc acgtcagagc 2280 acaatgcaat gcagaggaag ataggagcaa actgtgcttc cgctgtgggg tagagggcca 2340 caagtttaag ggctgcatgg ccaagccgca ctgcaccatc tgtgcggcgg cccaaaaacc 2400 tgcggatcac aaattgggag gcagaggatg ttcggctcca gctcccaaaa tcagcagagg 2460 gaagagaaca gcacagccac aaccagcaca acaggcacca gaagaagctg caatggagac 2520 ggaagaagtc aacatcagaa atggccctac acctactaca agccaatgtt aaccattgcg 2580 cacgggctca agacctgctc atacagagta tggctgagtg ggcgacgcag atagctgtgg 2640 tgtcagaacc ctactacgtt cccaataggg atgactgggt tggggacgag gacagcctcg 2700 tagcgctcat cgtgccccgg tccgcgagat ccccatctac tgatggggtt caaaagggcc 2760 gcgggtacgt gggtgctaca ataggaggaa ccttagtcgt cggcgtgtac tgcgccccta 2820 gcatgaacct cgtggagttc gaggacctcc tcgaacgtgt aggaactctt gttggcagaa 2880 atctccccca ttccgtgctg gtaatggggg atttcaatgc caagagttct gcatggcgta 2940 gcacttccac caatgctcgg ggtgcagtac tcgaggagtg ggctttgact tcgggactct 3000 gtctcctgaa taggggatcg agacccactt gcgtccggac gcagggcagt tccatcgtgg 3060 atttaacatt cgcctgccct gcgactgcga ggcgggtgta cggctgggag gtagtggagg 3120 gtgtggagac actatctgac caccgataca tccgcttcgc aatatcgacg acaccaacgg 3180 tcccagcaca ccagggaccg gagtcgcttc aacgggacac cccacggtgg gtggtgaagc 3240 gactcaacac ggatttactc atagaagctg cccaggttga gacatggtcg cctgcggaca 3300 gctcaccgga tttagacaat agggctgctg agttgcgtgc atcgatgact cgcgtctgcg 3360 atgcgtcgat gccgcgacaa ggcccccccc cacggaagag acaagtgtac tggtggtctg 3420 cagacatcgc agcaatgcgt gttgcatgcg ttgctgccag acgccagtac caacgacaac 3480 gccggagaag acagagagat gaaatcgctg aggcaagctt gcatgacatc tataaggccg 3540 ctaaacatgc cctgagccga gcgatatgcg aagctaaaga cagggcacga gaagaactta 3600 ttgagacctt aaacctggat ccgtggggtc gcccatatcg tatggttcga gggaagctcc 3660 ggacgcgggc gtcgcctctg acgcagagtc tccagcccca actggtgcgg ggagtggtgg 3720 ggtccctctt ccccaacgga gtggcacaca cgccgccttt caggattccg accatccgga 3780 atgaaaatcc tgactccgtt gatgaggagg acattccacc aatcacccct ggcgagttcg 3840 gggctgggat tcaccggctt cgcgccaaaa ggacggctcc ggggcctgat ggcataccgg 3900 gccgtgcttg ggtactagcc gcggatgtat acgaggaaag agttgtgggg ctgatgagcg 3960 actgcctcgc tcacggacgc ttcccccctg ctgtgaagaa cggcaacctg gtcctcctta 4020 aaaaagaagg caggccagcc gactcccccg cagcatatag ggccataatt ttacttgacg 4080 aggtggcaaa gttgtttgag cggattatcg ccaaccgctt aatcaaacac atgacgacgg 4140 tcggcccaga cctagatgaa aagcaatttg gttttagggc gggtaggtcg accatccata 4200 caataatgcg agtaaagaag ataacggagg aagccattgc ccagggcaat gtcgtgcttg 4260 cggtgtcctt agacatngcn aacgctttta acaccctgcc ctggagctgt atcatcgagg 4320 ccctccgtta tcaccaagtg ccgaaatacc tgcgccgtat tattaccgac tacctctcgg 4380 ctag 4384 // ID Gypsy-4_AA-I repbase; DNA; INV; 3510 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-4_AA_; KW Gypsy-4_AA-LTR; Gypsy-4_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-3510 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 977-977 (2011). XX DR [2] (Consensus) XX CC 'ATGA' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 1557..3143 FT /product="Gypsy-4_AA-I_2p" FT /translation="MKHRSTKASTVHQIITTGPPVFCRPRRLPVDKLKEAK FT AEFQFLMDQGICQPSKSSWASPLHLVKKSNDKWRPCGDYRNLNAITVPDRY FT PVPYIQDFSSIMHGKKIFTCIDLQRAYHQIPVAPEDVPKTAITTPFGLFEF FT KFMTFGLRNAGQTLQRHLHAILGDLDFVYPYIDDLCIASDNVEQHEEHLRI FT VFERLRQNGLVINAAKCQIGQESVDFLGHEVTSDGIKPKRSKVQAILDVPR FT PTNAKQLKRFLGTINFYRRFIPRAAENQRMIGGNIRNDTTALEWNETTNDA FT FERCKQDLANATLLAHPSPEAKLALDVDASNTCIGAVLHQITTNGPQPLAF FT FSRKLSESQQKASTYDRELIAMYEAVKHFHDTLQAREFCIYTDHKPLITAF FT QQRPEKATPTQQRRLSYISEYTTDIRHVSGEANKVADMLSRINTITTIEAI FT DYDRMAELQKIDPELKQFLENPPTNTTVQLKRLKSPLASTPIFCDISIETI FT RPFVQHEFRRTIMEKMHGVSHPGVRATTRLIST" XX SQ Sequence 3510 BP; 1028 A; 984 C; 754 G; 744 T; 0 other; actggtgacc ccgacgtcgc ggaagtatca gaatcacgaa ttgttcgata gttatcgacc 60 cacgatcgtc aaaacgccgt ttcaatattg cgaaatctcc gaacaaaagt gccaatttca 120 ttgccgtaaa aattcacaat ggcgatcgaa gaagcttctg gaagcgtttc tggaactgcg 180 aacaatatga agctggaaga tgttgattcc aaggaaccaa cggttatcgc caaaattagt 240 ttcccggact tcgatccgga cgacattgaa acatggttca tgtgtctcga agctgcattc 300 agtgtcaatt cgatacggaa cgacaagtta aaattcaatg cggtaattgt tgccctctgt 360 tcgcgcgcga aatttgtgca cacagtgatc gccaactgta atgcgacgaa tgtcaacgac 420 aagtacgacc gcctcaaagc agcggtactg gcgcatttcc aaccgtcaga aacacagcga 480 ttgaccagcc tactctctgg tatgtcccta ggagatcaaa agccaagtgt cctcctgtcc 540 gagatgcgtc gcttgggcgg agtaggttgc accgacaatg tgctatctaa cctctggctg 600 cgagcccttc caaacaccac tcgttccatt attgctgcta tgccaactgc ttcgttggat 660 gaccaagcga aagtagcgga caaaattttg gaagcacccc gtgagcaaat cgcagccgtt 720 cagaaacccg aaacctcatc aacatcatcc ctcgagcaac gtatcgaagc cctatccaga 780 cgcctggatg aagccctatc aggtaatttc cgtgggcgtg aacgacacag tagccgtgca 840 cgtcaacgct catctggccg gcgagatgga actccagcga gaagcaaaat acctcgtcgt 900 tggatctgtt ggttccatta ccgacatggt gcccaagctc gaaaatgtga gaagaacaga 960 agcgaaaacc caaacatcaa gtgcattttt ttcgatggga atgtcgaggt gtacactcga 1020 ccgagggaga attaaatcaa acaatcagcg tcaacaacca caccacatca agcagcgaac 1080 catcaaccga ccagaacccc agaatcctga accaaacctt agacgtccaa cgaatccaca 1140 tacacgatct caagtcatca aatcgattcc tcatcgacac cggagcggac atttcagtcg 1200 ttccgcattc tccaaaagag cgcctcaagc ctacagcatc gcagcaactc ttcgccgcaa 1260 acggtacccg aatccagaca tacggtacca agcggatcac cgtcgacctt gggctgcgaa 1320 gaccatttgt gtggatattc gtcattgcag atgtgaaatc tcccatcatc ggcgctgact 1380 tcctcaagca ttacgatctt ctcgtggatc tgcggagaaa caagttgatc gacagtacca 1440 cccggctgga aatcgagaac attcacgcag tgaccgaacc agtcattacc acattcgatg 1500 ccaattctcc tttcgctgaa atcctcgccg actatcaaga catcaccgtc ctcaacatga 1560 agcacagatc gaccaaggcg agtaccgtcc accagatcat cactaccggc cctcctgtgt 1620 tctgccgacc acgccggtta cctgtggaca aactaaaaga agcaaaagct gagttccagt 1680 tcctaatgga tcaaggcata tgtcaaccgt cgaaaagtag ctgggccagc ccacttcacc 1740 ttgtcaaaaa gtcgaatgac aagtggcgac cctgtggtga ctaccggaat ctgaacgcca 1800 tcaccgtgcc agaccggtat cccgtgccat acatccaaga cttctccagc atcatgcatg 1860 gtaagaaaat tttcacttgt attgatttgc agcgcgcata tcatcagatt ccagtagccc 1920 cagaagacgt cccaaagacg gccataacaa cccctttcgg actcttcgag ttcaaattta 1980 tgaccttcgg gctgcgaaat gcgggacaaa ctctccaaag gcacctacat gccatattag 2040 gtgaccttga ttttgtttat ccgtacatcg atgatctctg catagcgtca gacaatgtag 2100 agcagcacga agaacacctc cgcatcgtgt tcgaacgatt gagacaaaat ggtctcgtaa 2160 tcaacgccgc caaatgccag attggtcagg aatccgttga ttttctcggc cacgaagtca 2220 cttccgacgg aatcaagccg aagcgcagca aagttcaagc catcctggat gtcccgaggc 2280 caacaaacgc caagcagctg aagaggtttc tcggaacgat taatttctac cgacgtttca 2340 tcccacgagc tgcagaaaac cagcgaatga ttggtggaaa tatccgcaac gataccacag 2400 cactcgaatg gaacgaaacc accaacgatg ccttcgaacg atgtaagcag gatttagcga 2460 acgctactct gcttgctcac ccatctcccg aagcgaagct tgcactcgat gtcgacgcct 2520 ccaacacatg tatcggagcc gtcctacacc aaatcacaac caatggtcct caaccccttg 2580 ctttcttctc cagaaaatta agcgagagtc aacaaaaggc tagcacgtac gatagagagc 2640 tgatcgctat gtacgaagca gtcaagcact ttcacgatac tctacaggct cgtgagtttt 2700 gcatctatac agatcacaag ccccttatca ccgccttcca acaacgacca gagaaggcaa 2760 ccccaacaca acaacgcagg ctcagctaca tcagtgagta taccaccgac attcggcatg 2820 tatcgggcga agcgaataaa gtcgccgaca tgctgagtcg catcaacacc atcactacaa 2880 ttgaagcgat cgactacgac cgaatggctg agttgcagaa gattgatcct gagctgaagc 2940 agtttttgga gaacccacca actaacacca ccgtccaact gaagcgattg aaatctccat 3000 tagcatctac acctatattc tgcgacatct ctatagaaac cattcgtccg ttcgtgcaac 3060 acgaatttag gcgaacaatc atggagaaaa tgcacggtgt atcccatccc ggagttcgcg 3120 ccactactcg tctcatctcg acctgaaatc aacaatgtcg aagctaagcc ccacacctac 3180 cagcaaccat tcaaggaaat ctgttttcgt gcagaaacaa ctaactacat gcagccacgt 3240 cttcgtcaag gtaggagcga ttaaaccacc gctctcacag ccgtacgatg gaccatatcg 3300 tgtggttcgt cggaagaaaa aagtcttcat tgtggacgtg aacggcaagc caactgctat 3360 taccatcgat cgtttgaaag cagcgtacat acaggcagat catcgatgcg ccgaacacga 3420 aactacagca gaccagacta cctaccaaac acgctctgga cgccgcgtga agatccctcg 3480 tcgatattgg taaaactaag gggggagtag 3510 // ID Gypsy-184_AA-I repbase; DNA; INV; 4826 BP. XX AC supercont1.139; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-184_AA_; KW Gypsy-184_AA-LTR; Gypsy-184_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4826 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.139; Positions 363676 358851. XX CC 'TGATA' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 557..4720 FT /product="Gypsy-184_AA-I_1p" FT /translation="MALIGTIDPYVPGTSFSNYVELIEYFFSSNNIAEDRK FT KDIFMSCAGLAVFEELKLLYPATDLKTLSYAEITKKLKERLDKIDSEIMLR FT YKFRCRRQSPSESGENYILAVKHLAESCDFGAFRDSAIRDQLVFGVYNREL FT QKRLLNEEDLNLRSAERIIKGFEMANNNTLYFAETGAGIGVNSVKHRLGGR FT QSRIDRMDRSRSRERNFSRNRNLRTNFRKFEGRSRSGQRNSSRGRYANFVC FT HFCRKRGHIQKNCYQYRDNQGESVNSVKAETNPKEVHDYFKRLRVDYDSES FT EEEGDYPCLMISSIRKVSKPCLLEVKIEGRVCLMEVDSGSAVSVISKTEFL FT EKFSGINVRKSHKKLIVINGSNLEIYGKALISVCLNGNSSNLELVVLDGDH FT KFTALIGRDWLDVFYNGWREKFLQMDKSTSFGAVHSVQDNIDNALSVMKSR FT YSEIFNRDFSSPIVGFEAELVLKEDRPIFRKPYDVPYRLKDKVLDHLGLLE FT KDNIITPIKTSLWASPIVVIIKKDGDIRLVIDCKVSINKVLIANTYPLPTA FT QDLFASLAGCKIFCSLDLTTAYTQLQLSENSRKIVVINTIKGLFTYNRLPQ FT GASSSAAIFQQVMETILQDIEYVYVYLDDVLIAGKDFEDCYKRLTLVLDRL FT VKANIKVNLKKCKFFVTQLPYLGHIITDNGLLPNPEKVSTIIAADKPKNTT FT ELKAFLGLVNYYGKFLQNLSSTLSPLYMLLKKEVKFLWDFKCDEAFEGCKE FT KLLKANILTFYDPKKPIVVSTDASSYGLGGVISHVIEGVEKPIWFTSFSLN FT HAQRNYPILHLEALAVVSTVKKFHKFLFGQKFTIFTDHKPLLGIFGKEGRN FT SISVTRIQRYVMELAIYDYNIEYRPANKMANADFCSRFPLKIEVPKSLEKQ FT YIKHLNFSSELPLDYVTIAKESKKDFFLQQIVAFMKNGWPERIDQCFKNIY FT SQHQNLEEIEGCLLFQDRVIIPVALQRKCLQLLHSNHLGIVKMKQQARRSL FT YWFGINSDIESFVKNCEVCIKTSIVPTHKGTSEWISTSRPFSRIHADFFYF FT ERRTFLLIVDSFSKWLEVVWMKYGTDAEKVVKEFIAFFSRFGLPDVIVTDN FT GPPFNSTYFIGFLERQGIQVFKSPPYHPQSNGQAERLVRVTKEVLKKFLLD FT PAMKPMDLQDKINYFLFNYRNTCLSEGGKYPSESVLSFKPKTILDLVNPKN FT NYKQHLTEPLETKCEIIESNEQDPYANLKLGDKVYYKNHNTKDIEKWLNAT FT FIKKISQNILQISIGSTLISAHKGQVKIQSKSGRRRNITLQLPAVEDASSN FT TGRAKKRMFCELDESDEEFRGFSPEPPLPTAREPHPRLLQGSQKSVRVPTP FT QCHPVLRRSTREKKKKIDKDFVYRK" XX SQ Sequence 4826 BP; 1635 A; 761 C; 1029 G; 1401 T; 0 other; gttggcgacg agggaaagca aaagttgaga tattgattcg tgtgcattag ttggcagtgt 60 tgtgtcgtaa ggctggagta aagttcagtg gtagttggtg aagttgctgt tgtttggtgg 120 ctgttggaac ggtgaagggg aaaattgcta tacggagaac atattgtttg accaggtgca 180 aatcgaactc ctgtaagctg tggtgattcg cacaaggagg atacttgtct ggcgaataca 240 gtggtagaaa gcgccgagtg taataatttt gaagcggcca ttacaatttc attaatttgt 300 ggtttctagc cggtcggaaa agcgacgcca gattttactt gctgaaaaaa cggttcgtga 360 gtggaataag aacggcggag ttgaagtgtt gggaatagcg ggagccttga agtaaaacta 420 atctggaggt ggaagaaaca tctgaggaca acatcgaaga ttgaggacgt tagcagtatc 480 aaacaggtac gtggtttact ttctattatt aattcaggtg agcacagttt tatcaattga 540 ttcggtgcag tcaaaaatgg cgttgatcgg cacaatcgat ccgtatgtgc ccggcactag 600 tttttcaaat tatgtggaat taatagaata ttttttttcg tcaaacaaca ttgctgaaga 660 tcgtaaaaaa gatatattta tgagttgtgc aggattggcg gtatttgagg agttgaagct 720 tctgtaccca gctacagacc ttaaaacatt gtcttacgcc gaaatcacta aaaaattaaa 780 ggaaaggtta gacaaaattg actcggagat aatgttgagg tacaagtttc gttgtagaag 840 acagagtcca agcgagagtg gagagaatta tatattagca gtaaagcatt tagcagaatc 900 ctgtgatttt ggcgcatttc gagattccgc cattcgagac caattagtgt tcggagtata 960 caatagggaa ttacaaaagc gtttactaaa cgaagaagat ttaaatttaa ggtcggctga 1020 aaggattatt aaaggttttg aaatggcgaa caataatact ctctattttg ctgagacagg 1080 agcaggaata ggggttaatt ctgtcaagca ccgcctcgga ggtagacagt ccaggataga 1140 cagaatggat cgtagtcgat ctagagaaag aaatttcagc agaaatagaa atttacgaac 1200 taattttaga aagtttgaag gacggtcaag gtcgggccaa cgaaattcga gccgaggtag 1260 atacgccaat tttgtctgtc atttttgtag gaagcggggt catatccaaa agaattgtta 1320 ccaatacaga gataatcagg gcgaatcggt gaactcagtg aaagcagaaa ccaatcctaa 1380 ggaagtccac gactatttca aaaggcttcg tgtagattat gattcggaaa gtgaggaaga 1440 aggtgactat ccatgtttaa tgatttcttc tattagaaaa gtaagcaaac cttgtctact 1500 cgaagtcaaa atagaaggaa gagtatgttt aatggaagta gatagtggtt cggcagtgtc 1560 tgtcattagt aaaacagaat ttttggaaaa atttagtggc attaatgtga gaaagagtca 1620 taaaaaattg attgtgatca acggatcaaa tttggaaatt tatggaaaag ctttaatatc 1680 ggtatgttta aacggtaatt cttctaattt ggaattagtt gttctggatg gagaccataa 1740 gtttacggct ctcattggaa gagattggtt ggatgtattt tacaatggtt ggcgagaaaa 1800 atttttgcaa atggacaaat ctaccagttt tggagcagtg cacagcgtac aggacaatat 1860 agataatgca ttaagcgtta tgaaaagccg ctatagtgaa attttcaaca gagatttttc 1920 atctcctatt gttggctttg aagcggaatt ggtattaaaa gaagaccgcc caatttttcg 1980 aaaaccctac gatgttcctt atcgactgaa ggataaagta ttggatcatt taggtttact 2040 agagaaagac aatattataa cacctattaa aactagcttg tgggcatctc caatagtagt 2100 cattattaaa aaagatggtg atattagatt agtaatagat tgcaaggtat ctataaataa 2160 agttttgatt gcaaatacgt atccattgcc aactgcccaa gatttattcg catcactagc 2220 tggttgcaag attttctgtt ctttggattt aaccacggcg tatactcagt tgcagttatc 2280 agaaaactca agaaaaattg ttgtcattaa cacaattaag ggtctcttca cttacaatag 2340 gctcccacaa ggcgcatctt ctagcgcagc catctttcaa caggtgatgg agactatttt 2400 gcaagatatc gaatacgtgt atgtttatct tgacgatgtc ctcatagcag gaaaggattt 2460 tgaagactgc tacaaacgtt tgacattggt attagacaga ttagttaaag caaatatcaa 2520 agttaattta aaaaaatgta agttttttgt aactcaattg ccatatctag gacacattat 2580 aaccgacaat ggacttttac ctaatccaga aaaagtatcg acaatcatag cagctgacaa 2640 accgaagaac acaacagaat tgaaagcatt tttgggactc gtaaattatt acggaaaatt 2700 tttacaaaat ttatcctcaa cgttaagtcc actgtatatg ttattgaaaa aagaggtaaa 2760 atttctttgg gattttaaat gcgatgaagc atttgagggt tgtaaagaaa aactgctgaa 2820 agcaaatatt ttgacgtttt atgatcccaa aaagccaatt gtggttagta cagatgcgtc 2880 ctcatatgga ctcggaggag tcatttcaca tgtcatcgaa ggggtagaaa aaccaatttg 2940 gtttacgtca ttctcactca atcatgctca aaggaactac cccatacttc atttggaagc 3000 tttggcagtg gtatcaacag ttaaaaaatt ccacaaattt ttgtttggcc aaaaattcac 3060 tatttttacg gatcacaaac cccttttggg aatttttgga aaagaaggac gaaattcaat 3120 ttcagtgact agaattcagc gttacgttat ggaattggct atttatgatt acaatataga 3180 atacagacca gccaacaaaa tggctaacgc tgatttctgc tcgcgttttc ctttgaaaat 3240 agaagtcccc aaaagcttgg aaaagcaata tataaaacat ctaaattttt ctagcgagct 3300 gccattggat tatgtaacga ttgctaaaga atcaaaaaag gatttctttt tgcaacaaat 3360 tgttgccttt atgaaaaatg gatggccaga aagaattgat caatgtttca aaaatatcta 3420 ctcacaacat caaaatttag aggaaatcga aggatgcctc cttttccaag acagagtgat 3480 tattccagtt gcactacaaa gaaaatgttt acaattactt cattcaaacc acttaggaat 3540 agtgaaaatg aagcaacaag ctagaagaag cctctattgg tttgggataa atagtgacat 3600 cgaatcattt gtaaaaaatt gtgaagtgtg tataaaaaca tctattgtcc ctacacataa 3660 gggtacctca gaatggattt ctacttcgag acctttcagc agaattcacg ccgatttctt 3720 ttattttgaa aggcgcactt ttcttttaat agttgatagc ttttcgaaat ggttagaggt 3780 tgtttggatg aagtatggta cggatgcaga aaaagtagta aaagaattta tagcattttt 3840 ttcacgtttc ggacttcctg atgtaatagt aacagataac gggccaccct tcaattcaac 3900 ctattttatt ggttttttag agaggcaagg catacaagtt tttaaaagtc cgccatacca 3960 tcctcaaagt aatggacagg cagaacgttt agttagagtg actaaggagg tgttgaaaaa 4020 atttttgttg gaccctgcta tgaaaccgat ggatcttcaa gacaagataa attattttct 4080 gttcaactac cgaaatactt gtttatctga agggggaaag tatccttcag aaagtgtact 4140 atcttttaaa ccaaaaacaa ttcttgattt agtgaatcct aagaacaatt acaaacaaca 4200 cctgaccgaa ccattagaaa cgaaatgtga aattattgaa agtaatgagc aagaccctta 4260 cgctaatttg aaattgggag ataaggtcta ctataaaaat cacaacacta aagatataga 4320 aaaatggctt aatgcaacat ttattaaaaa aatctctcaa aatattttac agatctccat 4380 cgggtcaacg ctcatctcgg cccacaaggg gcaggtgaaa attcagagta agtctggacg 4440 aaggcgtaat ataacgctcc agctgcctgc tgtggaagac gcttcatcaa ataccggacg 4500 agcaaagaaa aggatgtttt gtgagctaga tgaaagcgat gaagaattca gggggttcag 4560 tcccgaacca cccctgccta cagctcgtga accacacccc aggttactcc aaggcagtca 4620 gaagtcggtt cgtgtaccta ctccacaatg tcatccggtg ctcagaagat cgactagaga 4680 gaaaaagaaa aaaattgata aggattttgt gtatagaaag tgaagcatgt attaatggaa 4740 ttttaaaaca atcactattg aattgaatct agtgaataca tttattactt gaattattaa 4800 tatttctaat taaaagggga aggtat 4826 // ID BEL-231_AA-I repbase; DNA; INV; 5964 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-231_AA_; KW BEL-231_AA-LTR; BEL-231_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-5964 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 917-917 (2011). XX DR [1] (Consensus) XX CC Positions [5024-5575] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 20..2224 FT /product="BEL-231_AA-I_1p" FT /translation="MSANNTPGTPKENEEGGCVDCSRPKSYDNLVQCDQCD FT SWWHMRCAGVTASIADRAWTCRMCLPLSVNTSTSSSSARIALRLKQIEEER FT AIQQRAWEAEKRALEQERKAILEKYRLLEQHLDESRDNRSHRSHRSHVSQR FT SKLQRVNQWVDDQKLDVAVGTTATVHENPSLKIGNGQLSRIAGNEPPAVFP FT ITTMTSPNPNVSQQQIERNQQPTSGAIPKADSKTRGLIVDSVRFNEGNKIP FT SVLPVLPPGKLATNSYHNKQVISQPVDHYQDVLAKLGSISLTPNVESPTRI FT FSTEPQVEGQSSRYVSQPFPVGGNDEQYAPLANVNTNANHPQSSVPRAVLA FT PEVPNNVPSPSQLAARQIMPRDLPLFTGDPADWPIFISAFTNTTAACGYSN FT VENLARLQRCLRGAAYEAVRSRLLLPASVPQVLNTLQLLFGRPELLINVLL FT EKVRTTPAPRAEKLETLIEFGMAVQSLCDHLEAAGQQAHLTNPHLLMELVD FT KLPAHVKMEWASFMQRYPLVNLKNFGDFMNSIVILASKVTMYTGGMNRGGS FT SENAKMKFKGSINAHFSESDEKLCYACNGRGHRVRDCDSFKNLTVDERWKI FT AQTHELCRSCLYAHGRRSCRYANQCGINGCTFRHHPLLHSDRSNQPSRGAD FT AESAGIHTHGMPNQALLFRIMPVTLYGPRKNVKTYAFLDDGSHLTLMDEGL FT IDELGVSGKTRPLCLTWTGNVSRMEMNSQQLQLTV" FT CDS 4478..5962 FT /product="BEL-231_AA-I_2p" FT /translation="MAYVHRFVNNLIRMRGGEPTTSGYLTQAELQQAETTL FT IKLAQKEAFQSEMAILIHNRDLSIAEQTRLDTKSTLYNLTPYIDESGILRL FT DGRIGAAMYASNDTKFPAILPKNHQITSLILDSYHRQFQHGNAETVVNEIR FT QRYYVSRVRTLVRKISNGCPECKIRRAAPRIPRMAALPPARLATYARPFTY FT VGLDFFGPLTVKIGRGSAKRWIALFTCLTIRAVHCEVVNSLSTDACIKAIR FT RFVSRRGAPTEIYSDNGTNFQGTERVLMQQIQQGLAATITNTTTKWVFIPP FT SSPHMGGAWERMVRSVKQAMNIAYHSGRKLDDESLLTFVAEAESIVNSRPL FT TYLPLDSAESEALTPNHFLLGSSTGILQPVAEPTDSAAALRSTLNMINQQL FT DCFWRRWIREMLPTLTKRTKWFREEKPVEVGDLVLVVNDGRRNDWIRGRVS FT EVISGSDGRIRQAVVRTARGLLRRSVAKLAVLNIENGGKTGTGGQLYGGE" XX SQ Sequence 5964 BP; 1674 A; 1361 C; 1518 G; 1407 T; 4 other; caactttaaa acgatcgcga tgagtgcgaa taatacacca ggaactccga aggagaacga 60 agaggggggc tgcgttgatt gctcgcgccc caaatcttat gacaacttag tccaatgcga 120 ccagtgcgac agctggtggc atatgcgttg tgcgggagtg accgcatcga tagccgatcg 180 agcatggacg tgtcgaatgt gtttaccgtt gagcgtaaat acttcaacca gcagcagctc 240 ggcccgaatt gccctgcgat taaagcagat agaggaagaa cgcgcgatac agcagcgcgc 300 gtgggaagct gagaaaagag ctctggaaca agagagaaaa gcgattctgg agaagtaccg 360 gttgttggag caacacttag atgagagtag agacaacaga agccaccgca gccatcgtag 420 ccatgtgagc caacgctcca aattacagcg tgtgaatcaa tgggtggacg accaaaagtt 480 ggacgttgcc gtcggtacaa ccgcaaccgt gcacgaaaac ccatctctga agattggaaa 540 tggtcagctt agtcggatcg cggggaatga accaccggcg gtttttccca ttactactat 600 gacgtccccg aatcccaacg ttagccagca gcagatcgag agaaatcaac aaccaacgtc 660 tggtgcgatt ccgaaagctg attcgaaaac tcgaggtctc atcgttgata gtgtaaggtt 720 taatgaaggt aacaaaattc cctcagtact ccccgtactg ccaccaggta agttggctac 780 caactcttac cataataagc aagttatttc tcagccagtt gatcattatc aagatgtgct 840 tgcaaagcta ggctcaataa gtttgacgcc taatgtagaa agtccaacta gaatctttag 900 cactgagccc caagtagagg gccagagctc acgttacgtt tcccaacctt tcccagtagg 960 gggaaacgat gaacaatatg ctcctttagc caatgtcaat acaaatgcta accatccgca 1020 gtcaagtgtc cctcgcgctg tcttggcacc cgaagtgcct aacaatgtcc catctccctc 1080 gcagttagcc gcacgtcaaa tcatgcctcg cgatctaccc ctcttcaccg gtgaccccgc 1140 ggactggccg atctttatta gtgctttcac caacacaact gccgcgtgtg gttattcaaa 1200 cgtggaaaat ctagcgagat tgcaaaggtg tctaagaggt gcagcctatg aagccgtccg 1260 cagtagacta ctactacccg catctgtgcc ccaggtgctg aacacgttgc agctcttatt 1320 tggaagaccg gagctgctga taaatgttct gttagaaaag gtacgtacaa caccggcgcc 1380 tcgagcagaa aaactggaaa cactcatcga gtttggaatg gccgttcaga gtttgtgtga 1440 tcaccttgaa gcagcaggcc aacaggctca tttaacaaac cctcatctac taatggaatt 1500 ggtggataag ttgcccgctc atgtgaaaat ggaatgggct agtttcatgc agagatatcc 1560 gttagtgaat ctcaaaaact ttggagattt tatgaacagc attgtgatct tagctagtaa 1620 ggtgaccatg tataccggcg gaatgaatag aggtggctcg tcggaaaacg cgaaaatgaa 1680 gttcaaaggt tcaatcaacg cacacttctc agaaagtgac gaaaaactgt gttatgcctg 1740 taatggccga gggcatcgag tacgtgattg cgattctttc aaaaacctca ctgtcgatga 1800 acgttggaaa atcgcacaga cccatgaact ctgtagaagc tgtttatacg cacacggtag 1860 aagaagctgt cgatatgcta atcaatgtgg aattaacgga tgtacgttcc gtcaccatcc 1920 gctactccac tctgaccgaa gcaatcaacc ctcaagagga gcagatgctg agtctgctgg 1980 aatccacacg cacggaatgc cgaatcaagc ccttttgttt cgtattatgc ctgtaacact 2040 atacgggcct cgaaagaacg tgaaaactta cgccttcctt gatgatggat cacatctaac 2100 gctgatggac gagggactaa ttgatgagct gggagttagt ggcaaaacga ggcccctctg 2160 cttgacctgg acgggcaacg tctctcgtat ggagatgaac tcgcaacagc tacaattaac 2220 ggtgkcaggt gttgacgacc gacggcgcct gaagctggat gatattcgca ccgtcaagaa 2280 gctcgccttg ccgggacaaa ctctacgaat caaggacttg gcaggcaagt tccatcacct 2340 tcaaggtctg ccggttgctg aatatgaaaa cgccgtcccc cgcttgctga ttggagtcaa 2400 taatttgcat ctcaccgttc cattgagtgt gaaggagggg aaaatcaacg agccaatagc 2460 ggtgaaaact cgccttggct ggagcgtgta tggaggaaga agcagcctac ccgcgaattc 2520 cttgaacctt cacgtctgtg gatgtactag tgaccgtaac ctgcatgatt tagtaaagga 2580 ctacttttcg ttggaggatg ttggaaccaa accggtgaat ggactactat cagcagatga 2640 tcgcagagca caacatatat tgcaacagtc caccatgcga gtcgggaata tatacgagac 2700 tgctttgctg tggagatttg acgacgtcga gttgccagac agctacgcta tggccctgcg 2760 cagactcgaa tgcctagagc gtcgaatgga caggaatcct cgcctaaagg agaatctggt 2820 ccgacaaata gatgaatatc aagaaaaggg atatgcgcat cgggccagtg aggaggaatt 2880 atcagctgcc aatatgaaat gaatttggta tctaccatta ggagctgtgg tgawtccgaa 2940 aaaaccggaa aaaattcgtc ttatatggga tgcagctgct acggttgatg gcgtatccct 3000 caattcgcta cttctcaagg gcccggatca attgacgtcc ttatctgctg tcttggtacg 3060 ttttcgacag tacgccatag ctgtctcagc tgatataaag gaaatgttcc accaaatacg 3120 aattcgggaa gcagatcgtc attcccagcg ttttctgtgg cgcactgatc catcgaacga 3180 gccggatatt tttctaatgg acgtagcaac gtttgggtca acgtgttcgc cagcgtcggc 3240 ccaatatgtc aagaaccaaa atgcagctga attctccgac gtgtacccaa aagctgttgc 3300 agaaataacc gacaaccact acgtggatga ttatttagca agttttgaaa ccgtagaaga 3360 agcaactgaa gtttctcaac aagttaagga aatacacgaa aggggtggtt ttatgttgcg 3420 gcactggcaa tctaatagct ctgcagtagt ccatagcttg ggcgaaagca gcaaggctat 3480 caacaaacac ttgtttctgg acaaaagcac tcaatacgaa cgagttcttg gtatgctgtg 3540 gcttacgaac gaagaccagc tggggttttc tacgcaactt aaagaagatg ttcagcagat 3600 aatagaaaac gaatgttacc caacgaagcg acagctcttg cgttgtctaa tgagtttttt 3660 tgaccctctc ggcatattaa gcttcatcct agttcatggc aagattttgc tgcaagatgt 3720 ttggcgtgtt gggacgcaat gggatsaaga gattcaggta gagcaccagg agttctggcg 3780 aagatgggct gctctgctta agcaaataca tacagtcaaa attcaacgat gttatttccc 3840 ggatgctacc gttcaatact accggcaact acaattgcac gtcttcgtgg atgcawgcga 3900 aagtgcatac gctgccgtcg cttatttccg tatagttgac ccaagtggag tgatacgctg 3960 tgctttggtg gcgggaaaaa ctaaggttgc cccgcttaaa ccactgtcaa taccacgctt 4020 agaacttcaa gcagccgtgc ttggatctag actactacga tttgttcttg agtctcacaa 4080 cgtaatcgtg gtaaaacgct acctttggtc cgactcgtcg acagttttgg cctggctaaa 4140 ggcagatcct cgcaagtaca agcagtacgt agcctgcagg gttggagaaa tcttgtcggt 4200 aacggaggtg aatgagtggc agtgggttcc ctcgaagcag aacccggcgg acctagcgac 4260 gaaatgggga gatggtccga ctgctgatat tgaaggaata tggtttacgg gtcctgcgtt 4320 tctacaacaa ccagagccgg gttggccaaa acataggttg ttacaacagt ccaccgtaga 4380 agagctgcga gtttgtaacg ttcattgtga aactattgat gtgccactgg taatcgattg 4440 gaatcggttt tccaagtggg aacgactaca cagggctatg gcctatgtac acagatttgt 4500 gaataatttg ataagaatgc gcggtggtga accaacaact agtggctacc ttactcaagc 4560 agaactgcaa caggctgaaa caactttgat caagttggct cagaaagaag cttttcaatc 4620 cgaaatggcg atccttatcc acaatcgtga tttgtcaata gccgagcaga ccaggttgga 4680 cacaaaaagt actttgtaca acttgacgcc ctacatcgat gagtctggaa tccttcgact 4740 ggatgggagg attggcgctg caatgtatgc gagtaatgat acaaagtttc cggcaatact 4800 accaaaaaat caccaaatta cgagtttgat tttggattcg tatcatcgac agtttcagca 4860 tggcaatgct gagacagtcg tcaacgaaat aagacaacga tattatgtat ctcgggttcg 4920 gacattggtt cggaagatta gcaatggatg cccagaatgc aaaatacgga gagcagctcc 4980 tcgtataccc cgcatggctg cactccctcc agctcgtctg gcgacgtatg ctagaccgtt 5040 cacctatgta gggttggact tctttggccc tttgacggtg aagataggac gaggaagcgc 5100 taaaagatgg atagcgcttt tcacctgcct tacgattcga gcagtacatt gtgaggtggt 5160 aaacagtctc tctacagatg cctgtatcaa ggctattcgt cggtttgtga gtcgacgtgg 5220 agcgcccact gaaatttatt cggataacgg gaccaacttt cagggaacgg agcgagtatt 5280 gatgcagcaa atccaacagg gcctagcggc aacaattacc aacaccacga ctaaatgggt 5340 tttcattcct ccatcgtccc cacacatggg aggcgcctgg gaacgcatgg tacgctcggt 5400 gaaacaggcg atgaatattg cgtatcattc tggtcgaaag ttggatgacg aatccctgtt 5460 aacgtttgtt gccgaagccg agagcatcgt caatagccgc ccccttacct acctacccct 5520 ggactcggcg gaaagcgaag ctttaactcc caaccacttc ctgttgggaa gctcaacggg 5580 aattctacag ccagtcgcgg agcctacgga tagtgcagct gccctgagaa gtacgcttaa 5640 catgatcaac caacaactgg attgtttctg gagacgatgg attagagaaa tgttaccaac 5700 attaacgaag agaacaaagt ggttccgcga agaaaagcct gtagaagttg gagatttggt 5760 gctggttgtg aacgatggaa gaaggaacga ctggataagg ggacgagtat cggaagtcat 5820 ctctgggagt gacgggcgaa tccgtcaagc cgttgtaaga accgcgaggg gattactgcg 5880 tcgatcggta gcaaagttgg ctgtattgaa cattgaaaat ggtggtaaaa ctgggaccgg 5940 tggccagttg tacggggggg agga 5964 // ID DNA2-4_TCa repbase; DNA; INV; 1439 BP. XX AC . XX DT 22-MAR-2009 (Rel. 14.03, Created) DT 22-MAR-2009 (Rel. 14.03, Last updated, Version 1) XX DE Non-autonomous DNA transposon. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA2-4_TCa. XX OS Tribolium castaneum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Coleoptera; Polyphaga; Cucujiformia; OC Tenebrionidae; Tribolium. XX RN [1] RP 1-1439 RA Jurka J.; RT "DNA transposons from insects."; RL Repbase Reports 9(3), 665-665 (2009). XX DR [1] (Consensus) XX CC TSD is TA, often in multiple copies. Unclassified (possible CC non-autonomous Tc1/Mariner). CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Tribolium CC castaneum Genome Project. XX SQ Sequence 1439 BP; 486 A; 246 C; 246 G; 460 T; 1 other; ctcagtgtca taaaaattgc aacaccaaga aggaatagga attttttgat gaaacttggc 60 acaaatgtta gactattagc aggtattaaa tgattaaaat ttgaacgttt ttaatcaagt 120 ggtaaaggag tttttaacac ttaaactctt ttaccagcaa gtgttgttct tccaattttt 180 tgcgtttctg acaatttttg aaggtgtttt tttaaattcg tttagtttta agtgttatga 240 aaacgaaaat atgcctagag tacgacaaaa aagaaattac caacaattaa acaaacttta 300 aagaagcaca atttttggga aacgaatagg tgttttttta tttagagaaa tygctatttg 360 tttgaagagt aatccaagta ccattatgcg atgttatcag tcattgatta gagaaggttc 420 aggacaaagc agaagaggaa gcagacttaa aaaaaacaaa tgaagtccaa aatggtcatc 480 ttcggattat ggcaataatt caagtacaag aaaaatttct gaccaatggt tgagttcgaa 540 ggtcgtcgta cttcacttct gactgtttac catcgaatta ggtcatttgg actgcttttt 600 gttggatttc ctaggtatgg gaggaatgag caaaaacaca gcttaaacct gacacaccaa 660 aacttttatt tggaaagaca acgtgtttca accacaaagt ggtcatcctc aggtgagaat 720 atacactgta tacattgacc acattatggt cgaaacgcgt tgtgtttcga aataaaggtt 780 tgatgggtca ggcttagata tgcgtttttg cccattcctc ccataccaag taaatccaac 840 aggaaacagt ccaaatgacc aatttcgatg gtaaacagtc cgaagtgata aaaaacgacc 900 tttaaaccca cccattgggc aaaaatttgt cttgtaatta aattatagcc atactccgaa 960 gacgaccatt ttggacttca tttgtttttt ttaaggtatg ctttctcttt cgcgctgtcc 1020 tgggccttct ccagtctatg actgataaca ttgcttaatg atacttcgat ttctatccaa 1080 acgactagca atttctctaa ttgaaaaacc cctatttgta gtccaaaaat catgcctctt 1140 tcaaaactgc ttaaattgtt agtaattttg gttttgtcgt ctccaggcct aattgcgctt 1200 ttagaacact taaacctaaa cgaattaaaa aaacacataa aaagatagtc agaaaagcaa 1260 aaaattggaa gaataacact tgctggtaaa agaacttaaa tgttaaaaac tcccttacca 1320 cttgattaaa atcgttcaaa ttttaatcat ttattactcg ctaatagtcg accatttgtg 1380 ccaagtttcg tcaaaaaatt ctaattcctt cttggtgttg caatttttat gacattgag 1439 // ID BEL-643_AA-LTR repbase; DNA; INV; 615 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-643_AA_; KW Pao_Bel_Ele220; BEL-643_AA-I; BEL-643_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-615 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 615 BP; 267 A; 72 C; 122 G; 154 T; 0 other; tgtcaacacc gctgggcagt tataacgggg tgccggaaag ccaaaaacgc aaagtgacag 60 agctagacag aagtttagat aaggggaaga ataaaaaaac ataaacaaac ttttcagagt 120 gcaggaaaac gtgaattata aaagttgtcc agtttattct tgaaaaatag ttgaaatctg 180 cggatgtaca gtaaagtttg ataaagttga ttggcctgta agtaaataaa attatcctga 240 aatgaaatta aaatgaaaat aatataaata atgtttgtga tgatagattg atggttaaac 300 agaagcgatt tgtgaagcaa atgaagtttc ggacagattt gagaaattaa gagtaaaagt 360 tgtaatcggg aaatcgtaag taattacaaa acaaacaata aaataacact aacgaagaaa 420 atttgttaca aataggaaac aatctgcaca ttaggattga aggtagagag cacgagggtt 480 aaaaaaaagg aaactacttt gtgagtatta aaatcaacaa acatttatac aaataataac 540 aaaaataaaa tattatagtt tgagctgacc aaaaaactgg agtctgcttc aaattcagtt 600 cctgaagtcc gaaca 615 // ID Copia-4_AA-I repbase; DNA; INV; 4139 BP. XX AC supercont1.10; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-4_AA_; KW Copia-4_AA-LTR; Copia-4_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4139 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.10; Positions 4195183 4191045. XX CC Positions [1772-2188] - Integrase core CC 'TGGAC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 208..1821 FT /product="Copia-4_AA-I_1p" FT /translation="MEDDRAPRVYLFAGKNYSTWSFRMQAYLEELGLLHCI FT EKPLEEEDFYPENPADTAAVEQAKEEKRKKRKQEDAKCKSVLIHKIADSQL FT EYVRGKTSPKAIWSVLQQTFEQKGVSGVFFLLKQLTGMKYCDSRSMEEHIL FT SFEKIVRELESAGIKFDEPVVVFFLLQSMPKSYEQLITVLETLPVQQCSME FT FVKSRLLSEDVKRQFSGSAKSLDPGTAFVGKGRKFPFKCHACGKPGHRRVD FT CPENKYSERASDDRKFDRKQMKKKNKSGAHSAETEHDDIAFIVSVGDALHS FT SGSEFRWVLDSGASEHMVNDRSCLVNVRKLEVPTVINVAKSGVSLVSRVVG FT DVKMSATVNGKKLSCTVYDVLFVPGLYTNLFSVKRVAERGMEVIFGKDSAR FT IVRNGKVVCTANRNGRLYELNVNVVKESTAMIGETESQLSLWHRRFGHIGK FT SGLLKMVRNDMVEGIEGSVSAATNAKVCEPCMMGKQAKLPFEECVKARSSR FT PLELVHTDVCGPFKPESWNGKKIFVTFIDDFTHFTDSIRAES" FT CDS 1862..3874 FT /product="Copia-4_AA-I_2p" FT /translation="MATSHFERKIARLRSDNGSEYMNEELITYCNESGIIM FT EPSVPYTPQQNGVAERMNRTIIERARAMLDESGINRMMWSEAVQAAVHVIN FT RSPTNALTVTKTPYEMWYGRKPDVSRLRVFGSKVFCHAPKEKRSKLDVKSR FT VCYLIGYGCNGYRVWEPNQRKVIVCRDVVIEELPVKRSKHPEVYTNDHLVS FT DRLQEPETSDLKSAEIVTGRKEVSVDEQDSSEEDGEDYEDAEDGNTPLRRS FT QRQRKLPERLADYEVCVAFALNAENYVEELPETVEELRKREDWPEWDAAIK FT DELKSLEKNRTWDLVDLPAGKRAISSKWVFKIKYTANGDVDRYKARLVAKG FT CSQRQGFDYQETYAPVVRISTVRTLLAVAVQKNLHVHQMDVRTAFLNGSLS FT ETVYMRLPTGFERGKKVCKLNKSLYGLKQAPRSWNERFNKFMVHLGFKRSE FT HDSCLYVRNKNGIVSYLILYVDDIILASSSLDELVKIKRSLTQEFEMDDMK FT ELNHFLGLKIERNMEEGTLIINQSQYIRALLKRFSMQDCKPVSTPLEVNLK FT LTTNESEPVTKHPYRELVGCLTYLMLSSRPDISSAVNFLSRFQSGATDTHW FT THLKRVLRYLQGTKDDSLVYRRNRNSEPLAGFADADWGSDVNDRKSTSGSL FT FQKQSTLRSVRRHAMLYGSETF" XX SQ Sequence 4139 BP; 1187 A; 806 C; 1159 G; 987 T; 0 other; ggttatgggc ccagaacacg tgttgttttt cacattagtt ttccagttct aaattcgcgg 60 aatttgtgaa ttcgatgagc ggaaagtagc gtgaaagttg aaagtagttt ccgtcgggag 120 tgaacacgaa aatttcgtcg gaaagttatc cgaaaagccg ccggtagtgt ttgccagcat 180 cgtcgtcatt ttcttcgttt tgacaaaatg gaggacgatc gcgcgccgcg ggtgtatttg 240 tttgctggga aaaattactc aacgtggagc ttccgaatgc aggcatacct ggaagagttg 300 ggtctgctgc attgtataga gaagccgttg gaggaagaag atttttaccc ggaaaaccca 360 gcggatacag cggcagtgga acaggcgaaa gaagaaaaac ggaagaagcg aaagcaagaa 420 gatgctaagt gcaaatcagt tttaattcat aaaatagccg actcgcaact cgagtatgtg 480 cgagggaaaa cgtcgccaaa ggcgatttgg tcggtgctgc aacaaacgtt cgaacaaaaa 540 ggtgtttcgg gagtgttttt cttgttgaag cagttaaccg gcatgaagta ctgtgactct 600 cgttcaatgg aagaacacat cctctcgttt gagaaaattg tgcgcgaact ggaatcggct 660 ggaattaagt ttgacgaacc ggtagtggtg tttttcttgt tgcaatcaat gccgaagtca 720 tacgaacagt tgattaccgt actggaaacg cttccggtgc agcagtgctc gatggaattc 780 gtaaaatcga ggctgctcag tgaagatgta aagcgccagt ttagtggaag tgcaaaatcg 840 ttggatccgg gaacggcatt tgtcggaaaa ggaagaaagt ttcctttcaa gtgccatgcc 900 tgcgggaagc cggggcatag aagagtggac tgccctgaga ataagtattc ggaacgtgca 960 agtgacgatc ggaaatttga tcggaagcaa atgaagaaga agaacaagtc gggagcccac 1020 agcgccgaaa ctgaacacga tgacattgct ttcattgtga gcgtggggga tgcgttgcac 1080 agtagtggta gtgagtttcg gtgggtgctc gacagcggtg cttctgaaca catggtgaat 1140 gacagaagct gtcttgttaa tgtgcgaaaa ctggaagtgc cgacggtgat caatgttgcg 1200 aagtcaggtg tgtcactggt tagccgcgtg gttggtgacg tcaagatgag tgcaacagtg 1260 aatgggaaga agttgagctg cactgtgtac gatgttttgt tcgtacctgg tttgtatacg 1320 aacctgtttt ccgtaaaacg agttgcggag cgtggtatgg aggttatttt tggaaaggac 1380 agtgcgcgga tcgtgcgaaa tggaaaagtc gtgtgtaccg caaatcgaaa tggtcgatta 1440 tacgaactaa atgtgaatgt ggtgaaggag tctactgcca tgattggtga aacggaaagc 1500 cagttgtcat tatggcatag aagattcgga catatcggaa agtctggttt gttgaaaatg 1560 gtccgaaacg atatggtcga aggaattgag ggaagcgtga gtgctgcaac caatgcaaaa 1620 gtgtgtgaac catgcatgat gggcaagcaa gcgaagctcc ccttcgaaga gtgtgtgaaa 1680 gctcggtcgt cgcgtccgct ggaattagtg cacacggacg tgtgtggacc gttcaagccg 1740 gaatcatgga acggtaagaa gatttttgtg acttttattg acgatttcac acactttacg 1800 gacagtatac gtgctgaaag ctaaaagtga tgtagtggat gcgttcaaaa agtatgctgc 1860 gatggctacg tctcactttg aacggaagat cgcgagacta cgtagtgata atgggagtga 1920 atacatgaat gaagaactga taacgtactg caatgagtcc ggaatcatta tggaaccgag 1980 tgtgccgtat acaccgcaac aaaacggcgt cgcggagcgc atgaacagaa cgattattga 2040 acgcgcgcgt gcgatgctag atgaatcagg tattaatcga atgatgtggt cagaagcagt 2100 acaagcagcg gttcatgtga taaatcgaag tccgacgaac gctttaacgg ttacgaaaac 2160 accgtacgaa atgtggtacg ggcgcaagcc agacgtttcg aggcttcgcg ttttcggaag 2220 caaagtgttt tgtcatgccc cgaaagaaaa gcgatctaaa ttggacgtta agagtcgtgt 2280 atgctatctg atcggttacg gttgtaacgg ctaccgcgta tgggaaccga accagcgaaa 2340 agtgatcgtg tgtcgtgatg tagtgattga agagcttccg gtgaaacgaa gcaagcaccc 2400 ggaagtgtac acaaacgatc atctggtgtc ggatcggcta caagaaccag aaacaagtga 2460 tttgaaaagt gcggaaatag tgacgggacg aaaagaagtt tcggtagatg aacaagattc 2520 ctccgaggag gatggtgaag attacgaaga tgcggaagat ggaaatacgc cattgagaag 2580 aagtcaacgg cagcggaagc ttccggaacg tcttgcggat tatgaagttt gtgtggcatt 2640 cgcgttaaat gctgaaaatt atgtggagga gttgccggaa accgttgaag agctgcgtaa 2700 gcgagaggat tggcccgagt gggatgcggc gatcaaggat gagctgaaat cgttggagaa 2760 aaatcgcacc tgggacctgg tggatcttcc cgctggtaaa cgcgctattt caagtaagtg 2820 ggtcttcaaa atcaaataca cggcaaatgg agatgttgat agatacaagg cacgcctcgt 2880 ggcaaaggga tgctcccagc gccaaggctt cgactatcaa gaaacgtacg cacccgtagt 2940 tcggattagc actgttcgca cgttgctagc agtagcggtc cagaagaacc tccacgtcca 3000 ccagatggat gttcgtaccg ctttcctgaa cggaagtctt tcggaaaccg tttatatgcg 3060 cctgcccaca ggatttgaga gggggaaaaa ggtttgtaag ctaaacaaat ccctttatgg 3120 cctgaaacag gcgccccgca gttggaatga aagattcaac aaattcatgg tgcatcttgg 3180 cttcaagcgg tcggaacacg acagttgttt gtacgtacgg aataaaaacg gaattgtctc 3240 gtatttgatt ctttacgtgg acgatatcat cttggcatca agctcattgg acgagctggt 3300 gaagataaaa cgttccttga cccaggaatt cgaaatggat gatatgaagg aactaaacca 3360 ttttcttggt ttgaaaatag aaagaaacat ggaagaaggg actctaatta tcaatcagtc 3420 ccaatacatc cgtgcgttac ttaagcgttt ttccatgcaa gactgcaagc ccgtgtccac 3480 cccattagaa gtgaacttga agctgacaac gaacgaatca gaaccggtaa cgaagcatcc 3540 gtatcgagaa ctagtagggt gcctcacata cctcatgtta tcgtcacggc ctgatattag 3600 ttcagctgta aacttcctta gccgtttcca aagcggagca actgacaccc actggacaca 3660 tttgaaacga gtattgcgct acctacaggg cacaaaggat gacagccttg tttatcgacg 3720 caacaggaac tcagagcccc ttgctggatt tgccgatgcc gattggggaa gcgacgtcaa 3780 cgatcgaaag tcgacatctg gaagtctgtt ccagaagcag agtacgttgc gctcagtcag 3840 gcggcatgcg atgctgtatg gctcagaaac gttttgaagg atctcggagt cgattgctgc 3900 tctcccacaa cattgtttga ggataatcaa tcctgtatcc atattgccag cgaaccttgt 3960 gaccagaaaa ggttaaaaca tttggatata cgctaccatt ttatccgtga atgcattcaa 4020 gccggtgaaa tacgagtgga gtacctaccg acgcagaagc aggttgcgga tatgtttacc 4080 aagagtcttc cgactaggag cttccagata caccgtctca cgcttggtct gagaggggg 4139 // ID Gypsy-23_DPu-I repbase; DNA; INV; 6088 BP. XX AC scaffold_35; XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from Daphnia: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-23_DP_; KW Gypsy-23_DPu-LTR; Gypsy-23_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-6088 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Direct Submission to RU (16-DEC-2010). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR Genome; scaffold_35; Positions 692240 686153. XX CC Positions [4702-5181] - Integrase core CC 'CCGC' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 785..2056 FT /product="Gypsy-23_DPu-I_3p" FT /translation="MPPKRTDSSVPFQIATRSSTRSAATPLTPARVVVPTT FT QVPSPSLRGRASRLTYNLPVSATAASRVPEMALPQDLIDALRNLATAMAEN FT RTAAQTQSTALLNALDQQRLQSVDLVQQLADNAAAAPAAAVARVTAAAVDS FT IPCFEGKLMDFPQDFIDFVDRVAVAEGWTDAQRIQVAARRLLKTALDWHIH FT IGHAHATWNDWSLAFTANFSPRLNVSEWLHLVEDRRQKPGESGIEYALDKH FT KVLRVAPIPINEEKMVAFFIDGLASWQHVAAMTANRPANVPQFIQRIRELE FT TLGVASRVVPPPAHGPMAPPWAPPVVPTVAPPAAPTRTPPVAPPSAPDLNA FT TLATFGNQLVNQLTAQLNKKTIGSRGTGGGGGGGDRGRRSGGDHGGGGWVD FT PSKRKCYSCDAIGHIARHCPTKSGKGPTGS" FT CDS 2071..3825 FT /product="Gypsy-23_DPu-I_1p" FT /translation="MLTGYAHLPCRPQLPLVTVFIENIGEVTGLVDSGASI FT SAIRFSVVGNVLDPKREKSFLNLTGVDDKKVMVDSFCSLKVKWENKVVELN FT DVAVVKNCPFALILGVDWVVKSKLNLIVEDGKIVLKSQDSNQQKVKKVRFA FT GIEEQNICSEEDDENDFFVSDELIDSLEAENKTKSCPGVIGTEVKVVESAV FT KISKKFTGNVIVRPNMCAHPGMEWIIPSCVVKVSAGKLKIPVLNMKMSSLV FT LRRKDFIAYVDTDFDSNMVVVGQEEQPENPVCSFVENAVESEEKLKILMDA FT RVGENLSEEERSAVFELLSKYLRCFPSADGELGFTNKAEHFIDTGDAQPIS FT CVPYRVSAMERRIIIEKVADMLKQGIIRPSFSPWAAPVVLVKKKSGDFRFC FT IDFRRLNAVTKRDVYPLPRLDDVFDRLAGAKYFSSLDLMSGYWQAPVASAD FT TCKTAFVTPDGLYEFVRLPFGLNNAPSTFQRLMNRVLARLKWQMCLVYLDD FT VLVFGRTFDDHQKRLECVLMALVEAGLTLNVSKCIFAINRIFHLGHTIDEY FT GIRPDSEKISALVNFKILLKKNSVWSWTEAQESAKCVD" FT CDS 4318..5757 FT /product="Gypsy-23_DPu-I_2p" FT /translation="MNNTELAFQQQLDGQLRPIITCLNSKVPGKIAEQFKI FT HGKILYRINPTQGRKFLLCVPSILRRKIIEFSHDDSSSSHMGIDKTIARVS FT ERYWWPKFRSSVGKYVMSCNYCQFHKCIPGLPAGQLQPIPPPDRPFHTVGM FT DHLGPFKATSEGKKHIIMAIDYLTKYVEAAAVADTSTALVADFVRDQINFR FT HGGTTRIISDQGTVFSSHLMEEKVNEWKTQHVFATAKHPQTSGLVERVNRT FT MTLALAAYVNTDHDDWDRHLPAAIFAINTARQSTTEISPFQLVYGRLPFTA FT LENEFPWPEERPESFDVFLSRVKELREVARIKIVKKQEKVKRLVDLRRRVV FT KDLCPGELVLVRRKLKKKGKTKKLLPKYVGPYQVVKKVCPTTYLVEDLPAR FT RKKKKFRRFNAHVVQIRKFHSREDPEWEDWPDEPDDRMPDEQDDSFVQQAA FT EEDDAVTHHPLVQQSIRRMKWWRHFLQRKRELEGQ" XX SQ Sequence 6088 BP; 1611 A; 1260 C; 1467 G; 1750 T; 0 other; tttggtgtca gaagtcgggc acgggccatt gaatagttct catttttttt ggtagaattt 60 tttttgtgag actgcccgtg cgatattcac gtagtttttt cttttttctt ttttcttttt 120 tttttgtaaa gcccacttaa gttcttttca ttgataagct tagttgatta ttttattctt 180 tgagtgtgat tgattttaga gttgattgtt aattgttttt ttcttggttt tgtctgccat 240 tttcgacgtg aatttgacga tcggcagccg taataatttt tttttgtgtg tcacgtcacc 300 aacccccctt ccctccaatt tttttcacat tttttttctt ttgtcgaaaa cgcccccacc 360 cccccccctt acttttcgcg cgtaaaacca cgtggcccgt cgaggcgttg gaagaaatca 420 tttatttttt tttgtgtgtg tgtgtgtgtg gctttggaaa tcttcgtgat agccgccatt 480 ttcttttttg tcgtttgatt atgctcttgc tccagcagtg agtgaattat tgtttctttt 540 ttttttgttg ttttgctctg gtattgagca gaatcattgg tgtttcattt tgctctggta 600 ttgagcatta ttattaattt ttttttctct tctctctctc gtcgatcagt tgtcgtgttt 660 cgtaacgacg accgccaccc gtatccgtat ttcgttcatt tatttatttt tttttagtat 720 atttaatcaa catttggtag tgtgttgact gttgagtcat agcgatacgt cgtgtgtaaa 780 ttatatgcca cctaagcgta cagattcttc agtgccattt caaatagcca cgcgttcaag 840 taccaggtct gcagccacgc cattgacacc tgcacgagta gtagtgccga ccactcaagt 900 accgtcaccg tccctgcgtg gtcgtgcgag taggctcacg tataatttgc cagtgtcagc 960 aactgcagct tcaagggtac cggaaatggc gctaccacag gatctcattg atgctctgag 1020 gaatctcgcg acggcaatgg ccgaaaatcg gacggctgcc caaactcagt cgacagctct 1080 tttgaacgcg ttagaccagc agcgtctaca atcagtggat ttggtgcaac aactcgcaga 1140 caacgccgcg gctgccccag cagcagcagt agcccgtgta acagccgcag ctgttgactc 1200 cattccatgc tttgaaggta aactgatgga ttttccgcaa gatttcattg attttgttga 1260 tagagtagcc gtggccgagg gctggacaga tgcgcagcga attcaagtag cagcaaggcg 1320 tctgctcaaa acagcgttag attggcacat ccacataggc cacgcccacg caacttggaa 1380 tgattggtcg ctggcattca cggcgaattt ttcacctcga ttgaacgtga gtgaatggct 1440 acatctagtg gaagataggc gacagaaacc cggcgagtct ggtatagagt acgctttgga 1500 caagcataag gtattgcgtg ttgcccctat cccaattaat gaagagaaga tggttgcatt 1560 ttttatagat ggcctcgcaa gctggcaaca tgtggccgcg atgacagcaa accgtccagc 1620 caacgtcccg cagtttattc agaggataag agaactagaa actcttggcg ttgcgtcccg 1680 cgtcgtgcct ccaccagcac atggaccaat ggcgccacct tgggctccac cagtggttcc 1740 aacagtagct ccgccagcgg ctccgacaag aaccccacca gtggcgccac ctagcgctcc 1800 tgatttgaac gccacactag caacatttgg caaccagctg gttaaccaac tgactgcaca 1860 gctgaacaag aagacgattg ggagtcgtgg caccggtggc ggaggaggcg gaggtgatcg 1920 tggtagaaga agcggaggtg atcacggagg aggaggatgg gttgacccaa gcaagcggaa 1980 atgctacagc tgtgacgcga ttggacacat cgcccgtcac tgcccaacaa agtcgggaaa 2040 aggaccaacc ggaagttagg agcaaggcct atgctcaccg gttatgcaca ccttccttgt 2100 cgtccccagc ttccactggt tacggtattc atagaaaata ttggtgaagt gactggcctg 2160 gtagattcag gcgctagtat ctctgctata agatttagtg tagtagggaa tgttttggat 2220 cctaagcgtg aaaaatcttt tttgaatcta actggggtgg atgataaaaa agttatggtt 2280 gattcttttt gttctttaaa agtaaaatgg gaaaataaag tggttgagtt aaatgatgta 2340 gccgtagtta aaaattgtcc atttgcattg attcttgggg tagattgggt agtgaaaagt 2400 aaattaaatt tgattgtaga agatggtaaa attgttttaa aatctcagga ttcaaaccaa 2460 caaaaagtta agaaagttcg ttttgctggc atcgaagagc aaaatatttg tagtgaagag 2520 gacgatgaga acgatttttt tgtgtctgac gagctgattg attctttaga ggcagagaat 2580 aaaacaaaga gttgtcctgg tgtgataggc acagaagtaa aagttgtgga atcagctgtt 2640 aaaatttcta agaagttcac tggtaatgtt atcgtcagac caaatatgtg tgctcatcct 2700 ggaatggaat ggattattcc atcctgtgtt gtaaaagtgt cagcaggaaa acttaaaatc 2760 cccgtattaa atatgaaaat gtcatctctt gtgttacgcc gtaaagattt tatagcgtat 2820 gtagatacag atttcgatag caacatggtc gtcgtcggac aagaagagca gccagaaaat 2880 cccgtctgct ccttcgtcga aaatgctgtt gaatccgagg agaagctgaa gatcctgatg 2940 gacgcccgcg taggcgaaaa tttgtcggaa gaagagagga gtgccgtttt cgagctcttg 3000 agcaaatatc ttcggtgttt cccctcagca gatggcgaac ttggatttac aaacaaggcg 3060 gagcatttca tcgacactgg cgacgctcaa ccaatcagct gcgtcccgta tcgcgtgtca 3120 gctatggaac ggaggatcat aattgaaaaa gtcgccgaca tgcttaaaca aggtatcatt 3180 cgtccatcat ttagtccgtg ggcagcaccg gtagtgctgg ttaaaaagaa atcgggtgac 3240 tttaggttct gcatcgactt tagacgttta aacgcggtaa caaaaagaga cgtttaccct 3300 ttaccccgat tggatgacgt ttttgatcgt cttgctggtg cgaaatattt ttccagttta 3360 gacttaatga gtggctattg gcaggcaccc gttgcctccg ctgatacgtg taaaacagcg 3420 tttgtcactc cagatggatt gtatgagttt gttcgtttgc cgtttggact gaataatgca 3480 ccgtccactt ttcaacgttt gatgaatcga gtgttagctc gcctcaaatg gcaaatgtgt 3540 ctcgtttatt tagacgatgt gttagttttt ggaagaacgt tcgacgatca tcagaaaaga 3600 cttgagtgtg ttctaatggc tttggtggaa gccggattaa ctttaaacgt gtctaaatgt 3660 atttttgcaa tcaacagaat ttttcattta ggtcacacca tcgatgagta cggaatccga 3720 ccagactctg aaaaaattag tgctctagtt aactttaaaa ttctcctgaa aaagaattcc 3780 gtatggagct ggacggaagc ccaagagtct gcaaaatgcg ttgattagcc gcttggtgtc 3840 gtcccctgta ctggcgcact ttgaccaaaa catcgataaa gttgtacaaa cagacgccag 3900 cctggtgggc cttggggccg ttctaatgca agatgccggt gatggaccac gtccagtcgc 3960 gttcatcagc cgaaaactta ccgacgctga agcaagtatc acgctaatga gctagagtgt 4020 ttggcaattg catgggcatt aaaaaaatta cgttcgtatg tgtatggtag acgattttct 4080 gtctgtacag atagctcagc ggttcgatgg ctatggtcta agaaagaggt tactggcaag 4140 ttcgccagat ggattttggc tctgcaagag tacgattttg aaattcgcca cataaaagga 4200 gttaataatt tggtggctga tgccctatcg cgaaaccctg atgattcctg tattggaacc 4260 agtggctccg cgatcggaca tgtagtttgt gtacttgaca gtagatggcc tgtgggcatg 4320 aataatacag aattggcatt ccaacagcag ctggatggcc aattacgtcc cattatcacc 4380 tgtcttaatt caaaagtacc gggtaaaatt gcagaacagt ttaaaattca cgggaaaatt 4440 ttgtatagga taaatcccac ccaagggcgt aaatttttgc tttgtgttcc gtcaatttta 4500 aggagaaaga taatagagtt ttcgcatgat gactcctcct ctagtcatat gggaatagac 4560 aaaacaattg caagagtgtc tgaacgttat tggtggccga agtttcggtc aagtgtcggt 4620 aaatatgtta tgtcttgtaa ttattgccaa tttcataaat gtatccccgg attacccgct 4680 ggtcaactcc agcccatacc accgccagat cgaccatttc acaccgtcgg tatggatcat 4740 ctcgggccat ttaaggcaac gtcagaaggc aagaaacaca ttattatggc tatcgactac 4800 ctaacgaaat atgtagaagc agccgcagtg gccgacacgt cgacagcatt ggttgcggac 4860 tttgtgagag accaaatcaa ctttcgccat ggtggaacaa cgcgaataat cagtgatcaa 4920 ggcactgtct tctcctccca cctgatggaa gagaaagtca acgaatggaa gacccagcac 4980 gtttttgcaa ccgcaaagca tccacaaaca tctggactcg ttgagcgagt caaccgaacg 5040 atgacccttg cgctagctgc ctatgtcaat actgaccatg atgactggga tcgccatttg 5100 ccagcagcaa tcttcgccat taatacggca aggcaaagta caaccgagat atcgccgttc 5160 cagttggtgt atggccgctt gcccttcact gccctagaga acgagtttcc gtggccagag 5220 gaacgaccag aatcattcga cgtctttctg tctcgagtca aggagctgag agaggtggcc 5280 cgaataaaga tagtgaaaaa acaagagaaa gtgaagcgcc tggtggatct tcgacgcaga 5340 gtagtgaagg atctctgccc aggagagttg gtgcttgttc gtaggaagtt gaagaagaaa 5400 ggcaaaacta agaagttatt accaaagtac gttggtcctt atcaagtagt taaaaaggtg 5460 tgcccgacta catatttggt tgaagacctg ccggcccggc gaaaaaagaa gaagtttcgt 5520 cgttttaacg cgcatgtcgt ccaaatacgt aaatttcact cgagagagga tcctgaatgg 5580 gaagattggc cagacgaacc ggatgatcgg atgccggacg agcaagatga cagctttgtc 5640 cagcaagcag ctgaagaaga tgatgcggtg acacatcacc cgctagttca acaatcgatc 5700 cgccggatga agtggtggcg ccacttcctc caacgaaaac gagagctgga aggacaatag 5760 ttggttgaaa aattttgtaa aataacaatg aagcatttca tttttgtttt cttggctttg 5820 ccattattgt ttttattctg ttttgtttgt tagcctaaca gctcttgccc gtatatcccc 5880 cccatcagtc tccataagtt gagttctttt atatatattt ttttttgggt gaggtagaag 5940 caggggaatg gaaaggttag acttaaggtc cccaattgac tatttatttg tctgtttcga 6000 tttgacaagt tccgtgtgtt ttttgtaaat agtttttttt gttatcctat ttgaatgttt 6060 gtgtcaaatc gagtcaggaa gggccgaa 6088 // ID Copia-2_Cfl-I repbase; DNA; INV; 4260 BP. XX AC AEAB01006442; XX DT 26-JAN-2011 (Rel. 16.02, Created) DT 26-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Florida carpenter ant genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-2_Cfl_; KW Copia-2_Cfl-LTR; Copia-2_Cfl-I. XX OS Camponotus floridanus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Formicinae; Camponotus. XX RN [1] RP 1-4260 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Florida carpenter ant genome."; RL Direct Submission to RU (26-JAN-2011). XX DR Genome; AEAB01006442; Positions 4700 441. XX CC Positions [1514-2041] - Integrase core CC 'ATATAT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 128..2179 FT /product="Copia-2_Cfl-I_2p" FT /translation="MAQVARFESLNKENYDTWKMFMEALLVKNDLWQYVSG FT TSVKPEVIAGNAASENMARTWEQNDAKARSDIVLSISSTELKQIKGCVTAR FT EVWLKLKDTYQSKGPARKAALLRQLTTLKMSGNSDVRAHLNQFFDIVDKIN FT EIGVEIDADLLSTLLLLSLSNEFENFRCAIEARDMLPTLETLRIKITEEAD FT ARKGIADSHLSNAMYAKKQYKKQQKRSKDVNENSSDSTFKYKCHKCKIVGH FT KASDCKAQKKDSQTAQNATDTTMLTPEAFLAGTTETSKWCLDSEATSHFCY FT ETCKFGVNMNAKREKLNLANKNSTQITGEGTARLNTNIHGELRSIHLENTQ FT LVPDLRMNLLSVAKITDHGYDVLFNKHRELVVDQDGNVKLIANKTNGLYIV FT EESVNEIAVAEEISTGSTTQLKADWHRRFGHLNVKDLQAAIRKGRVEGIEP FT GNFINSNCDICPEGKMARNPFPIKSERSTKILDLVHTDICGPMKVTSIGGA FT KYILQFIDDSSRWGQVYFLKSKSDVFQALQNFVITMENQTGRKIKIIQSDN FT GKKFVNATIDEFLKQRGIIRRLTVPYCPQQNGVAERRNRTLVEMARCLLLQ FT SGLPPSFWAEAVNTANYIRNRCPSKLLDGRTPFEVLSEKIPDVSHFREFGQ FT RVFVLCNKPGIGKLDSRGVTGIFVGYSDSSKGYRI" XX SQ Sequence 4260 BP; 1425 A; 936 C; 975 G; 924 T; 0 other; ataggttatg ggcccagatc ttaccgccgt aaagcagtca tagttcgtgt cgttcttttt 60 tacgcgacgt cattacgagt gaaataaata atcgtgaaaa acgtcgcaat acgatcggaa 120 agcagaaatg gctcaagtgg ctagattcga gtccctgaac aaggaaaact acgatacgtg 180 gaaaatgttc atggaagcat tgcttgtgaa aaacgatttg tggcaatacg taagcggaac 240 gagcgtaaaa cccgaagtga tagccggtaa cgcggcctca gaaaacatgg cacgaacgtg 300 ggaacaaaac gacgcgaagg cacgatcgga cattgtattg tcaattagct caaccgagct 360 gaagcaaatc aaaggatgtg tcacagcacg tgaagtgtgg ctgaagctga aagatacgta 420 tcagtcgaag ggaccggccc gaaaagcagc tctccttaga cagttaacta cgctaaaaat 480 gtcaggtaac agcgatgttc gcgctcatct taatcagttt ttcgacattg tagataaaat 540 taacgaaata ggagtcgaaa tcgacgcaga tctactctcg acgctactgc tgctgagttt 600 gtcaaatgaa ttcgaaaact ttcgctgcgc gatcgaggca cgagacatgt tgccaaccct 660 cgagacactc cgtataaaga tcacggagga agctgatgca agaaaaggca ttgccgatag 720 ccatttatca aacgcaatgt acgcgaagaa gcaatacaag aaacagcaga agaggtccaa 780 ggacgtaaat gaaaactcta gcgacagcac atttaagtat aagtgtcata aatgcaagat 840 agtcgggcat aaagcgtccg attgtaaagc tcaaaagaaa gattcacaaa ccgcacagaa 900 cgcaactgac acaacaatgc tcacacccga agcctttttg gccggaacga cggagacgag 960 taaatggtgc ttggacagcg aagcaacatc gcatttttgt tacgagacat gtaaatttgg 1020 cgtgaacatg aacgctaaac gtgagaaact gaatctcgca aacaaaaact cgacccagat 1080 cacaggagaa ggaacagctc gactgaatac taatattcat ggagaattga gatcaattca 1140 tctcgaaaat acgcagctag ttccagatct acggatgaat cttctctcag tcgctaaaat 1200 caccgaccat ggatacgacg tcctgttcaa caaacataga gagctcgtag tggatcaaga 1260 cggtaacgta aagttgatcg caaataagac aaatggtctg tacattgtcg aagaatcagt 1320 caacgagatt gccgtcgcgg aagaaattag caccggaagc acaactcagt taaaagcaga 1380 ctggcaccgc agattcggtc acttaaatgt aaaggatctt caagcagcca tacgcaaggg 1440 tcgagtagag ggaatagaac caggaaactt catcaacagt aactgcgata tttgtccgga 1500 aggaaagatg gcaaggaatc cttttccgat aaaatccgag agatctacaa aaatactaga 1560 tcttgtgcac actgacatat gcggtccaat gaaggtcact tccattggtg gagctaagta 1620 tattttgcaa tttattgacg atagctctag atggggacag gtctactttt taaaatccaa 1680 atctgatgtt tttcaggcgc tgcagaactt cgttattacg atggaaaatc aaactggtag 1740 aaagataaaa atcatccaat ccgacaatgg aaaaaaattt gtgaacgcca ctattgacga 1800 gttcctgaaa cagcgcggaa tcatcagaag attaaccgta ccatactgcc ctcaacagaa 1860 cggtgtggcc gagaggagaa atcgtactct ggtagaaatg gctaggtgtc tcttgctgca 1920 gtccggccta cctccatcct tttgggcaga agctgtaaat accgcaaact atatacgcaa 1980 tcgatgccca tcgaaattgt tagacggaag aactccattt gaagtgctgt cagagaaaat 2040 tccagacgtc agtcatttca gggaattcgg acaacgggta ttcgtcctgt gtaacaaacc 2100 tggcatcgga aaactggact cacgcggagt cactggaatc ttcgtcgggt actcagactc 2160 ttcaaaaggc tacagaatat gaatcccaga taaaatgaaa atcattgtgt caagagacgt 2220 aaagttcttg cataccgaat ctcccaagga aaacacatac gaagatttct accccggatc 2280 agttgaaaac agcgaagaag tcgatataaa taaaagtcac gatcacgaca tgatcgacgt 2340 aatattggag ccatccgaga actgtcaaaa cgaaaccgtc gaagagaata tagccgaggt 2400 acctgcagaa cgcatcgaac cagtcaatgg agatgcagta gaagaacccc aagagaatca 2460 cgatgatcta cctcacgacg cacaggaagt gatacgtcga gcaccaggca gacctaaaat 2520 tgtgaggact ggagaaaggg gaagaccttg taaagaattc cattatcatc atgccgaaat 2580 ttcagaaaca caatcagcac atctgtcaga aattccaatg agagaagcga tgtctggttt 2640 agaagccgac gaatggcgac tagcgatggt caaagaaata agatcgatca tcaaaaacga 2700 tacttggaaa ttagttgatc gagtcgacga tcgactaacc atcgaaccat catcggtacc 2760 cggatggtac tcaaaaacaa gctcagtccc gacggatcca ttcagcaacg aaaggctcga 2820 ttagtagccc aaggatttgc acagaaacca ggcgtacact tcaacgaaac attcgctccg 2880 gtagcacgga tgggatcgat tagactgatg atcagctcag ctgcacgctt ttgcatgaac 2940 attcatcagt tagatgtcac cacggctttc ctcaacggat atctggaaga agacattcta 3000 gtgaacccac cgaaagaact tatcgaactt ctacaaatcc tcgcagactc ggacaaggac 3060 gatgtcatcc gaaagaaggc aaaccaaatg ctccatgaat taaacactgg caataaagtc 3120 tgtaagttaa aaaaagcgtt atacggacta cgacaagccg gacgaagctg gtatcaaagg 3180 ctcgacaaag tcttaaagga gtgcggagca aacgctacaa atgccgaccc atgtctgtat 3240 cacttgggac aaggagagaa catcgtactg atagctgtat acgttgatga tatactcatt 3300 gcatcacgta acaagaaaga aattgatcgg atctcttatt ttctttcgca acaattcaag 3360 ataaagaacc ttggagaagt caatcggtgt cttggaattg aattcaaccg cgaaggagaa 3420 aagattactc taactcaaaa aggctatatc tgcgaactgc tcaggcgctt cggaatgact 3480 gactgcaacc cagtcagcac tccgttcgac tcgaatgtaa agctaaaaaa gggagcagaa 3540 cctacaccag acgatcaagt tttgccctac cgtgaactcg tagggagtct cacgtatctt 3600 gcatcttcta ctaggcagac atagccttct cggcaagcta tctcgggcaa ttcaacaatt 3660 gcttcgacga aacacactgg aaggccgcta aaagagtttt aagatatctg aagggaacca 3720 tgggagcagg actcgtctac gggccagact caaatccact catcggttat acagactcgg 3780 attggggaaa ctatcacgtg gaccgacgtt cacattccgg attcctgttc gtactgagcg 3840 gatgccctat cacttgggat gcgaagaacc aaaaactgtg gcactgtcat cgtgtgaagc 3900 tgagtacatg gcactcaccg aatgtaccaa agaagccata ttcttgcagc gtttcctgaa 3960 ggaattaggg ttcagcgatc taagtaacgt cacaatcttc ggagataatc tgggcgcgat 4020 caaactagcg gaaaatcccg ttttttacca gcgaagcaag catatcgacg tcaagtacca 4080 ctatgttcga gacgcacttc gcaacgaaaa tctaaacatc aagcatgtct caaccataga 4140 tatggtggca gacatactta cgaagggatt accgaaaaga aaacatttgg agtgcctcaa 4200 gaaagctgga atgcatctat cacactgcga gtagatcttc gagcgcgaat cgagggggag 4260 // ID BEL-228_AA-LTR repbase; DNA; INV; 197 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-228_AA_; KW BEL-228_AA-I; BEL-228_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-197 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 912-912 (2011). XX DR [1] (Consensus) XX SQ Sequence 197 BP; 76 A; 37 C; 35 G; 45 T; 4 other; tgaatgtcta ataatgattt gaatatttag twtcccaaac cggtgccaag twtgcgaccc 60 gaagagacwa cgactcagac tgactagmta ggagactaaa cgtaagtgaa ctaagactat 120 tacaaacact ggcatgcatt gacctaagag acacataaga cataaattat actaaaaatt 180 acgacgctat gtaaaca 197 // ID L2-8_NVi repbase; DNA; INV; 5587 BP. XX AC . XX DT 20-APR-2009 (Rel. 14.04, Created) DT 20-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE CR1-like retrotransposon: consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; L2-8_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-5587 RA Bao W. and Jurka J.; RT "CR1-like elements from the parasitic wasp Nasonia vitripennis."; RL Repbase Reports 9(4), 758-758 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS join(837..1151,1121..2587,2591..5530) FT /product="L2-8_NVi_1p" FT /translation="MPRASCEKCRNVTRAGAIIRHCKICSATFHNVCMTWH FT VTDKGNYICQACKIKKKDNTIKTQRKSAPRLELEKDKDKGDYNIKNKKTDV FT NITDVNNTLAKMLIILMYTGENVNNSNVNKTDISITNDTNNNVSLREEFSA FT STPAVSTSKASDGKRAQINSPVDMGALRYCTLPSSMLTLFSNQFELIERSV FT DDLKTLLVRRLDAISQRSDPAPAALEMIDRLVERVSSLEEKIEKQVSKLED FT LQNDRLLLSQENKALRESLDDAMRVMHRVETRINADDGGSCTSRNLSRNLS FT LSLSNSSNKISNDNDGDSSSADICIAEASRVFVSLRDCESVDASGGAGAST FT SADTDGFRVSKGRRHTRARKRRGEGREHSAGVRSSPTDGMSSGESTAEIII FT SSISDCSEQERGDVAHAVLATVLPSVARDDIVSARLLRPNTRVATVGGRPS FT PWVVRLSSHLIVKNIMRAKHKITGLNTRHINVSHLSQETSNSLVQSKIFIN FT ELLSKESFLQFKSLKSVASGLGFKYVWHRGGRFLAKMRDGEMSHHFQTAXD FT LQAIAASYSCTTSNDNLVNNVTAVHNGNAITGTAPCTGSLDKDSKKNLGGV FT KVRGAECDGLRAGFINATSLNRHMEMFRQFLATARPFYHLFGVAESRFGPS FT VDDVFAQVKGYSVLRQDRNTQGGGVALYIHNSYKATILCSSPTTTDGRKPG FT IPEYLMCXIQKGRMPPIFVAVIYRPPGISFTTNSDLVDKLKLYAEDYNHRI FT IMGDLNANMLSTSXDANFVKDLACELNLKLVEHGATHHVGESHTWIDVIYT FT DDNEVVLNANNMMATFPSRHNIIDFTIRTPKSESPTLTPFFYRDFKSIRCE FT ELLSLLEACDWSTISCPGMTTDSKLEALSGNIMTVIEKLAPLKKFKPKNKK FT SSPWVNAELQDLYDKRDAVERRFKRTRDSRFWSEFQSLAAQAEQRTIEARE FT AFIQARISEALNNNKNIWIELRNLGLLSTTKEELHGFSPGELNAHFAGVSV FT SESESEVNLDEIVATASEAGFTFREITFSDVVLAVAHFSSQAKGEDGIPQD FT VIAKSLPVLGHHLASIFNSSLSSGVFPGAWKKACLVPLKKTAIPSAASDFR FT PIALLCFLAKVLEKIVHDQISEYLESEKLLDTRQTGFRRYHSTQTALLSLS FT EDIRAGINSKKQLLTILLMFDFSKAFDTISPSKLLRKLIGMGFSRSVVLWI FT KSYITGRNHQVVTKTVGNSDWLTTNLGVPQGSVLGPLLFSLYINDLRDILA FT SFKGPKGELSDSVEHLLYADDLQTYTQVARDDLSGGVDRLSAVARAVAVWA FT SENALRLNVGKTKAIIFGSDSNINKVQSLKLPGIEIEDGVFVPFADTVTNL FT GVVMDSKMTWKPQVDAISRKVNRVLYGLRSFRSCTTEELRKQLAGALATPH FT LDYCSLVYLDVSGQQSTRLQRLQNSCVRYICGARRDEHISPYRRKIGWVPV FT KARRELFAAVMLYKAXVMGQPPYLAALFDRNQSRTSARDPQELVVPSARTD FT AGLYSFKAQGARLWNSLPRDVRDLPSLKSFKAVMRRRIYVREIECTVDQNT FT DSATVS*" XX SQ Sequence 5587 BP; 1472 A; 1249 C; 1341 G; 1515 T; 10 other; cgtgtcgtct gctattggta agtcaagaag cagtagtgag ttgctctgct atctttgtac 60 ttttcctaat attttcttca ctcctcggag tgttttctta cagcggaaag gaccccccaa 120 acggctctta tgctcgcgct aaaaaaatca caatctctgt gctttttatg agcctggagg 180 gaattttccg ctaactgtcg acgacataac ctacaatttc tgccgcctct gtcaccgagt 240 ctgggacaac tgcagcgatc tactctcgtc gctctgaggg gatttctggc cagcggcgtt 300 ctcagcgcag caacatcttg cttctatgcg cgtcttgcgt attttcgctg catcttctga 360 ggtcagatct gcacacacac acacacacac acacatatat atatatatat atatatatat 420 atatatatat atatatatat atatatatac atatatatct cgctcaataa gtgcgaaatc 480 atcgccgcga tcgaggatcg atattctgcc gcaagagtgc aacactatcg caactcatat 540 tactcgagca acagttggca gcactgactc ggcccacgcg catgtgcaat ctggtagctc 600 tcaacgcgcc agatttgaat ttgaattttt gtagagcggg aaattcaaat gttcgttcgt 660 aaccagtacc tagttgtctc tagtcaccat tcatttatgt aattatttgt ttatttattt 720 attgcttgac gaaactctgt tgcctatttt gttttactct ctgacactta tttatttatt 780 tatttattga cctatttatc tatttattta tttattattg taaatattta ttaaaaatgc 840 ctcgcgcctc gtgtgaaaag tgtaggaatg tcacacgcgc aggcgctata attcgtcact 900 gcaaaatctg ttctgcaaca tttcacaatg tttgcatgac ttggcatgtc acagacaaag 960 gcaactatat atgtcaagct tgtaaaatta aaaagaaaga caacacgata aaaacgcaac 1020 gaaaatctgc gcctagacta gaattagaaa aggataaaga taaaggagat tataacatta 1080 aaaataaaaa gactgatgta aatattactg atgtaaataa tacactggcg aaaatgttaa 1140 taattctaat gtgaacaaga ctgacatctc gattacgaat gacacaaaca acaatgtttc 1200 cctgagggaa gaattttctg cctcgacgcc ggccgtctct actagtaaag ccagtgatgg 1260 gaagcgcgcc cagatcaact ctccggtgga tatgggtgcg cttcgctatt gcacactgcc 1320 atcatccatg cttactctct ttagtaacca gtttgagctc attgagagat cagttgatga 1380 tctcaagact ctgcttgtga ggaggcttga tgccatcagc cagagatcag atcctgcccc 1440 agcagcactg gagatgattg acaggctggt ggagagggtt tcatcactcg aggagaaaat 1500 tgaaaagcag gtgtctaagc tggaagacct gcaaaatgat cgccttctgc tctctcaaga 1560 aaataaagcc cttcgggaga gtcttgatga cgctatgcga gtgatgcata gagtagagac 1620 tcgcataaat gctgatgatg gcggttcttg tactagtcgt aaccttagtc gtaatcttag 1680 tcttagtctt agtaatagta gtaacaaaat tagtaatgat aatgatggag acagtagcag 1740 cgcggatata tgtatcgcgg aggcgtctcg agtcttcgtc tcgctcaggg attgtgaatc 1800 tgttgacgcg agtggtggcg cgggcgcctc cacgtcggct gatacagatg gctttagggt 1860 cagtaaaggt cgacgtcaca cccgggctcg taagcgtcga ggggagggtc gggaacactc 1920 tgctggcgtg cgttcctctc cgacggacgg gatgagcagt ggagagtcta ctgcagagat 1980 aatcatctct agtatttctg attgttcgga acaagaacgt ggggacgtgg cccacgcagt 2040 cctcgctact gttctgccgt ccgtcgctag ggatgacatt gtgtctgcga ggttactccg 2100 tccaaatacg cgagtggcaa cggtcggcgg gcgcccatcg ccatgggttg tccgtctatc 2160 gagccacttg atcgtaaaaa atattatgcg agcaaaacac aagataaccg gcttaaacac 2220 gcgccatatc aacgtctcgc atttatccca ggaaacgagt aatagcctgg ttcaaagtaa 2280 aatcttcata aatgagttgt taagtaaaga atcgtttttg cagtttaaaa gtcttaaaag 2340 tgtagctagc ggtcttggtt tcaagtacgt ctggcataga ggcggtcgtt tcttggcaaa 2400 aatgcgggac ggggagatgt cacatcattt tcaaacggca rctgacttgc aagccatcgc 2460 tgcatcatat tcttgtacga cgagtaatga caatcttgta aataatgtga cagcggtaca 2520 taacggtaat gcgatcactg ggactgcgcc gtgcacagga tcgctagaca aagacagtaa 2580 gaaaaactga ctaggcggag tgaaggtgag aggagcagag tgtgatgggt tgagggcggg 2640 atttatcaac gctacctcgc tgaacaggca catggagatg tttcgtcaat tcctggcgac 2700 tgctcgtccc ttctaccatc tctttggcgt ggctgagtcg cgtttcgggc cgagcgtcga 2760 cgatgtattt gctcaggtaa agggctactc tgtccttcgg caggacagga atacccaggg 2820 aggaggtgtg gcactataca tccacaatag ttacaaggcg accatactgt gctcatctcc 2880 aacaacgacg gatggtagaa agcccgggat tccagagtac cttatgtgta raattcaaaa 2940 gggccgcatg ccccccattt ttgtcgccgt tatmtacaga ccaccgggta tttctttcac 3000 gaccaactct gacctggtkg ataaattaaa actatatgcg gaagactata atcatcgtat 3060 aatcatgggg gatctaaacg caaatatgtt gtcgacgtct cakgatgcga attttgtcaa 3120 ggatttggcc tgtgagttga atctcaaatt agttgaacat ggcgcgacgc accacgtcgg 3180 ggagtctcat acgtggattg acgtaatcta tactgacgac aatgaagtag tgctgaatgc 3240 gaacaacatg atggcgacct ttccgagtag gcataacatc attgatttta cgattcggac 3300 tccaaagtcg gaatctccta ctctaactcc attcttttat agagatttta aatccatcag 3360 gtgcgaggag cttctttccc ttctcgaagc ctgcgattgg tcgaccatca gttgcccggg 3420 catgaccaca gacagtaaat tagaagcgct tagtggtaac atcatgactg tcattgaaaa 3480 acttgcccct ttaaagaaat ttaaacccaa aaataaaaaa tcttcaccgt gggtcaacgc 3540 wgaacttcaa gacctttatg ataagcgtga tgcggtcgaa cgaagattca aaaggacccg 3600 tgactcgagg ttctggtcgg aattccagtc tcttgcggct caagcagaac agcgcacaat 3660 cgaggcgcga gaggctttta ttcaggccag aatttcggag gccytgaaca acaacaaaaa 3720 catctggatt gagcttcgta atcttggact attgtccacg actaaggagg aactgcacgg 3780 tttctcccca ggcgagctca atgcccactt tgccggggtc tccgtatcag aatcagagag 3840 cgaggtgaac ttggatgaga ttgtggcgac ggccagtgaa gctggtttca ctttccgtga 3900 aatcactttt tcggacgtgg tcctggccgt tgcgcatttc tcatcacagg caaaagggga 3960 ggatggcatt cctcaggatg tcattgctaa atcccttccg gtcctgggcc accaccttgc 4020 atctattttt aactcctctc tgtctagtgg cgtcttccct ggagcctgga aaaaggcctg 4080 tctagtgccc ctcaagaaaa cggcwattcc atcagccgct tcagactttc gtccgatcgc 4140 tctgttatgt tttcttgcca aggttctaga gaagattgtt cacgaccaga tatccgagta 4200 cttagagtcg gagaagcttc ttgatacacg tcagacgggt tttcggcgtt atcacagcac 4260 gcaaacagca ctactgagtc tgtcagagga cattagggcg ggtattaata gcaaaaaaca 4320 actgcttact atcctcctga tgtttgattt cagtaaggcg ttcgacacga tctcaccctc 4380 taaacttctt cgaaagctaa taggaatggg tttctctagg tctgtagttc tgtggatcaa 4440 gtcatatatt acagggcgta accatcaagt ggttacgaaa acagttggca attctgattg 4500 gctcactacc aatcttggcg ttccacaggg ctcggtcctg ggtcctcttc tctttagtct 4560 ttacatcaac gatctcagag acattttggc ttcttttaar ggccctaagg gcgaattatc 4620 ggatagtgtc gagcatttgc tatatgctga tgaccttcaa acctatacgc aggtggcgag 4680 ggacgacttg agcgggggtg tggaccgtct gtcggctgtg gcgcgtgcag tggcggtatg 4740 ggcgtccgag aacgcgcttc gcctcaatgt cgggaagact aaagctatta tttttgggtc 4800 ggacagtaat attaacaagg tgcaaagtct aaagctgcct ggcattgaga tcgaggatgg 4860 cgtcttcgta ccctttgccg acactgtaac caaccttggt gttgtaatgg attcgaagat 4920 gacatggaaa ccgcaggtgg atgcgattag ccgaaaggtt aatagagtcc tttatggact 4980 tagatccttt agatcctgca ccaccgagga actgcgtaag cagctggcgg gcgctcttgc 5040 caccccgcac ttggattact gctctcttgt ctaccttgac gtatcaggtc aacaaagcac 5100 acgacttcaa cgattgcaga actcatgtgt gagatacata tgtggtgcta gaagggacga 5160 gcacatctcc ccctatagga ggaagatagg ctgggttcca gtgaaggcaa gaagggagtt 5220 atttgcggcg gtgatgcttt ataaagcart tgtaatggga cagccgccgt atcttgctgc 5280 cctttttgat aggaaccagt ctagaacctc agctagggat ccgcaagaac tagtggtgcc 5340 tagcgcgcgt accgacgcgg ggctatattc ctttaaagcc cagggtgcgc gcctctggaa 5400 ttctctcccg cgcgacgtga gagatctccc gtcgcttaaa tcttttaagg ctgtgatgcg 5460 gaggcgcata tacgtacgcg agattgagtg tactgtcgat cagaacactg attctgcgac 5520 tgtttcgtga ctccgttaga tattgtaccg tccaaatatt tattctattt atttatttat 5580 attatta 5587 // ID DNA8-10_CQ repbase; DNA; INV; 726 BP. XX AC . XX DT 28-DEC-2010 (Rel. 16.01, Created) DT 28-DEC-2010 (Rel. 16.01, Last updated, Version 1) XX DE Non-autonomous DNA transposon from Culex quinquefasciatus - DE consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-10_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-726 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the southern house mosquito."; RL Repbase Reports 11(1), 87-87 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >93% identity. CC 8-bp TSD. 15-bp TIRs. XX SQ Sequence 726 BP; 232 A; 148 C; 136 G; 210 T; 0 other; ctgggtgcag aatagttgtc cgcaaagcgg cctcaattta gaactgtcaa agcggaacca 60 atttactgtt cgcaaaatga taccaagaag tggcaagtat tgtgatcgat ctataattta 120 ttttgtcagg aataaaaaaa agtcgatgcg tcgagctatt gaggtattcg aagtcaaaac 180 atcttaagaa gagggttgaa atcgagattg gcataattcg tcgctaacat ttcaaaatcg 240 ctatgtctca cgcaaaacct atgtttatga cgctttttga aatgttagcg acgaattatc 300 aataactatg aatggcacct gccaaataat attggataat cttactccga tattatcact 360 gcacatcaaa gatacataat ctcggaactt ccattcggtt gctcttctga ttcacgaaat 420 tccacccact tctgacgaca atttttcgtc acgtggaaaa caaaaaaggc ctcttttggc 480 cgcccatcgt ttagtctacg aacgaaaaaa gtgcgaagca cttaattttt catcgaaaac 540 atcaacatca acagatccaa aatgtttcaa aacaattcac ttcagcagca aaaaatatgt 600 cacgcttgag cttgaacata gccttctact ttgtttatgt ttacagctgt agtgcagttg 660 tattagtggg gagcagtggg gagctttcga aaatcacgtt acgcattctg tcttttcagc 720 acccag 726 // ID Zator-4_HM repbase; DNA; INV; 4137 BP. XX AC . XX DT 29-JAN-2009 (Rel. 14.02, Created) DT 25-JUL-2009 (Rel. 14.08, Last updated, Version 2) XX DE Zator DNA transposons from Hydra magnipapillata. XX KW Zator; DNA transposon; Transposable Element; Zator-4_HM. XX NM Zator-4_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4137 RA Bao W., Jurka M.G., Kapitonov V.V. and Jurka J.; RT "New superfamilies of eukaryotic DNA transposons and their RT internal divisions."; RL Mol Biol Evol 26(5), 983-993 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(2047..2472,2435..3820) FT /product="Zator-4_HM_1p" FT /translation="YFVFRLIRESARQRKRRQKLKESIETVCQNIPEASSA FT LKQFSRNHTGRPRLEVDQPELLSTIIKIVQNLSAADERRRTECLRSVSTLD FT DLQEELTKIGFTLSRSGLYLRLLPRRGNTSEGKKHVSTVPVKLLRPENSMR FT KKKSCYVQKTQCEKKNDDRMFAKSFIDDMFEVCKLFGPKAVLFISNDDKAR FT VPLGIAAASLQAPLLMHMEYKVKLMDHDFVVSSQHKLIPSVYGVCEVNNTG FT NVSYSGDTFIRIRSAKHDTSNAFTHAFDVRELFKTELVKRRPIMLMETDGA FT QDEAPRFPKTLATAVDLFRLLNLDALLHGVNAAGLSAFNPVERRMAPLSRD FT LAGIVLPHDFFGNHLDSSGKTIDYELEVENFQKAADVLSQVWEKTVIDGYP FT VHCQAVPVGKAYEPPIPDPVWVDKHCQQSRYSLQIVKCQEESCCTPFETNW FT LKNFPQRFIPFPAIYEYCENGYKAMEPSRYFKNVSKQSTFAPLTHRLLLHD FT IPIEGLKYKIIPFDLYCPSLKEKLSNGICDKCNKYWPSEAAMKRHKKAHTQ FT KKNTVEMESEDSVSDSESINKSEDISEEIDYKSVEAEFMPVFDNIFDLFKS FT PFEEI*" XX SQ Sequence 4137 BP; 1455 A; 695 C; 761 G; 1226 T; 0 other; ggggccatcc ataaagtacg tacgcatgta tggggggagg gggggttatc ctgaaacgta 60 cgaatgcgta cagggggggg gagggtgtta acgacagcga gtacgtacgc ggtttaacta 120 tctctcctaa acttctttat aaaaacattg aatttcacgt gttttaatgt ttttttataa 180 aaacattgaa tttcacgtta aaaaatcata atataatata tgttattttt ttttctgact 240 tcaattacaa ttgtttttta tttttcttta taagtaacga ggagagaaaa aatcgagcta 300 taatatctcc aaataatgtt ttccattttt tgcaacataa ttcgactgta gcaagatgaa 360 aagtaacaaa actgtaatca aagaaaactt cttctgaatt ctttttttaa ttacgaacct 420 ggtttttccc atttctctta aaaggttaat attttgtttt taaaaaacaa taaaataaaa 480 aaaggtaata ttattatttt ttttatcatt aaagttaata ttaacataca ttaagattgc 540 taaggacagg cgaagcgaaa tgaacataca tgtgacaatg catatgacaa tcattgtgat 600 aattgtaaaa aatagttact ttaaatgctg atggaaacca acacttttta tgatgcaatg 660 taacggcaat tatcttgagt aaactaaaac aaaaaaaact tttatgattc tattcgaaaa 720 ctcaatacgc ctcattggaa aaaaaactta ttatttatta tgaccaagat gccgcgcggg 780 aacgagaaaa aataccttca attccttgag ttatataaaa aagcatattc aacacttgaa 840 aaacaaaagc agtatcagca agcacaagag ttgtggaaaa tagtaaaaaa caatgaatcc 900 ttgtatgaaa aaaaaataaa tgagttaaaa gagaaagtaa caaagtcgaa agtatcactt 960 atgtcattct ggggaaatac catttcacca cctagcaaaa agaaaaaaga tatccatcca 1020 gcgccgtctt cgtcaaatga aacacgaagt ttcctaaatg caactacgac agagcctgca 1080 aggattgaaa ttagtaagct aatttctgta atattttaaa aactttttct ctcttgtgat 1140 tttctataca ttaggccggg tgaataaaaa agagcaattg ctttttctca gtaactgttc 1200 agctttgcgt gaataaaaat ttgaccaaga ttgctcaaat ttttattcac gtaaagctga 1260 acaatttact ccaacaacgc tcaactttac gtaaatagaa atttgaggat aattgctcaa 1320 ttttttattc acgtaaaact gaacaattac tgagtaaaag caattgctct tttttattca 1380 cccggcctac tgagtttaac cgtattgtaa atcaaactac atcgtacgac cttagagcag 1440 tcacagaaat ttataatgtt gaccactaga cctatttttc tatatccaaa ataaattgat 1500 tagtagtaat tgtcaagaat ttataatagc ttattttttg gcatttttta atttagattt 1560 aacagacaac aaaaaagaaa gagaaaatcc ggcacaagtt aggagtcgaa agaaaatctc 1620 agatcttgaa tcacaacgtg catccttgat agtcgtacgt gattctggtt tatctaccgt 1680 aacaaaggaa caaataaaca ctgtgaaaga aaccataaga aaagaaaaaa caaaattgga 1740 caggtaagat aatttagtta tgtatttctt tatttgatat ttcattttag aatgcttaac 1800 tgtcaggttt ctagtaaata tcaagagata tatgttaaat aatacagagt ttatttatgc 1860 gtatagaaaa gcgtcttggc tataaaaacc aaacattttc aaagacagaa actttcataa 1920 tcttattgta aataatactt tcttatttga aataagaatt tgcgtacttt ttcttttgca 1980 acattagcca aaagttgcga aaccttaagt atagaagttt ttataacgga atcaaagctt 2040 tactgatatt tcgttttcag attgattcgt gaatcagctc gtcaaagaaa acgaagacaa 2100 aaacttaaag agagcatcga aactgtgtgt cagaacatcc ctgaagcatc aagtgctttg 2160 aaacagttta gtcgaaacca tactggacga ccgcgtctcg aagttgacca accagagctt 2220 ttatcaacaa ttataaaaat tgtgcaaaat ttatcagcag ctgacgaacg tcgaagaaca 2280 gaatgtctcc gtagcgtctc cactttagat gaccttcagg aagagttgac aaagataggc 2340 tttaccttga gtaggagtgg tttgtacctt cgtcttttac cacgacgtgg aaacacttct 2400 gaaggaaaaa aacatgtcag cacagttcca gtaaagctgt tacgtccaga aaactcaatg 2460 cgaaaaaaaa aatgacgata gaatgtttgc caaatcattc attgacgaca tgttcgaagt 2520 ttgcaaattg tttggaccaa aagctgtgct tttcatttcc aacgatgata aagccagggt 2580 cccattgggt attgctgcag caagccttca agcgccgttg ctgatgcata tggaatacaa 2640 ggtcaaactc atggatcacg attttgtcgt tagctcacag cacaaattga tcccatcagt 2700 gtatggcgta tgcgaagtta acaatactgg aaatgtatca tacagtgggg acactttcat 2760 acgcataaga agtgcgaaac acgatacatc aaatgctttt acacatgctt ttgatgttag 2820 agagcttttt aaaaccgagt tggttaaacg tagaccaatt atgttaatgg agactgatgg 2880 agctcaggat gaagcaccac gtttccctaa aaccttagca actgctgttg acctctttcg 2940 cttgcttaat ttagatgccc ttcttcatgg tgtgaatgca gctggccttt cagcctttaa 3000 ccccgtagaa cgaagaatgg cacctctttc tcgtgatttg gctggaatag tccttcctca 3060 tgattttttt ggcaatcacc ttgattcttc tggaaaaaca atcgactatg aattggaggt 3120 agagaacttt caaaaagcgg ctgacgtttt atcccaagtt tgggagaaaa ccgtcattga 3180 cgggtatcct gttcattgcc aggctgtacc agttgggaaa gcatatgaac caccaattcc 3240 agatcctgtt tgggtagaca aacattgtca gcaatcgcga tacagcctac aaatcgtaaa 3300 atgccaagag gagtcatgtt gcacaccgtt tgaaacaaac tggctgaaaa acttccccca 3360 acgtttcatt ccatttccgg caatctacga atactgcgag aatggttaca aagcaatgga 3420 accgtctcga tattttaaaa acgtatcaaa acaatcaacg tttgcaccac tgacacatcg 3480 acttttgcta catgatatac ctatagaagg attaaaatac aaaattatac catttgatct 3540 gtattgccca tcgttaaagg aaaagctctc aaacggaatt tgcgacaagt gcaataaata 3600 ctggccaagc gaggctgcaa tgaaacggca caagaaagca catacacaaa aaaaaaacac 3660 ggttgaaatg gagagtgagg atagtgtaag tgacagtgaa tctattaata aaagtgaaga 3720 tatcagtgag gagattgatt ataaaagtgt ggaagcagag ttcatgcctg tatttgataa 3780 catatttgat ttatttaagt caccatttga ggaaatatga aatataaata ctcgcaacaa 3840 gaattctttt ttagaatatt tttctttctt taacaaaaac tggaaagtta ttaacaaata 3900 actttccggt ttttaattaa gcacagagaa attaacaaaa tgtttttttg tgtttaatgt 3960 tttgaaaaat taatgcgttg gggaaggaag gggaggagag cctgaacttg taataaaaaa 4020 tgcgtacgta cgcagaaggg gggggggagg tcgttgaaag catacaagtg cgtacaaggg 4080 gggagggggg gtcctaaatc agtggtttta ctgcgtacgt actttatgga tggcccc 4137 // ID Gypsy-19_DWil-I repbase; DNA; INV; 4250 BP. XX AC scaffold_181074; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-19_DWil_; KW Gypsy-19_DWil-LTR; Gypsy-19_DWil-I. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-4250 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_181074; Positions 59313 55064. XX CC Positions [1793-2335] - Reverse transcriptase CC Positions [3350-3826] - Integrase core CC 'TGTA' target site duplication CC LTRs are 91% similar to each other. XX FH Key Location/Qualifiers FT CDS 1205..4249 FT /product="Gypsy-19_DWil-I_1p" FT /translation="MSVNGVLLDLLVDTGSDVSIFSESTAKQLPNIEVHRN FT ESTLTGLGKKETKPTGAFVADVVVDDLHTTQKFIIVPDEAVECNALIGFDF FT VEKFIVTYAEGEYTFSKGDAVGDSNKERIAGSYNVFEVASLYEAPTIYQKA FT VRELIESIDEQPKQTQAECPVKLQIIPDATTMKSFHQSPSRLSACEAEAVK FT NQVDEWLTQGIVRKSTSSVASRVVVVQKKDGTPRICIDYRKLNSMTLSDRF FT PVPLIEEVLEKLQAAAFFSALDLENGFFHVPVEESSKYLTAFVTKEGLFEF FT NKTPFGFKNSPAAFIRFVNYVFQNLINDNIMQLYIDDIIVYAHSADDCLEK FT TQKVLQLAAQYKLKIKWEKCSFLQSRINFLGHIVCGGKIWPGKEKTDAVNR FT FGTPKNVRAVQSFLGLTGFFRIFIKDYSQIARPLTNLLRKDVKFHIGAAEL FT QSLPKLKELLTSKPLLHLYSRDAPPSCTQTHQSMVLEQCYSKVLTINSTRC FT IFGVETTESEAQRHSYILEAKAMYLALRKFRHYLIGIHFKLVTDCSAFKQT FT TRKEDVPREVTSYVLYLQDFTCDIIHKPGKSMQHVDHLSRYPQSIYNISTE FT VSARIKKAQSIDNHIKAVVTILNQQSYENYKIKGGVLYKAIDGNDLIVVPK FT QMENEIIREAHEIGHFATRKTMHAICQQYWIPHLERKVSNFIDNCIRCIIH FT SKKLGKQEGQMHLIDKGDTPLHTLHIDHLGPMDATSKSYKYILAMVDGFAK FT FVWLFTTKSTGHDEVIKKLTDWSHIFGFPKRIISDRGSAFLSNAFAEFVAE FT NGVEHILTTTGVARGNGQIERMNRSILAVISKLSAEDSSKWYKYVPQVQKA FT LNSQVHASTKFSPFKLMFGTKMCAQAGDRVLCLINEEFINNFNAERESLRD FT EAKKSILEAQMTYKRNFDKGRKQDHGYLMGDLVAIKRTQFVAGRKLASCFL FT GPYEVTKIKRNGRYDVRKAAQVEGPNQTSTSSDNMKLWRYVVDNDDPESSG FT TDDDDQEGR" XX SQ Sequence 4250 BP; 1414 A; 821 C; 977 G; 1038 T; 0 other; tttgggggct cgtcccattc tgagttggat taggctatta aacattacat ttaagaaccc 60 gatcgatgtt atactaatac atacgtatat gtacgtgcgt gaaaaggaaa aaagaaaaaa 120 aaaaaagacc aaaaagtgga caaagttgaa ccagcaatat acatacaaag tgtttaatga 180 cgccacgcac accatctcaa aaatcgaatt cgcagaagac gccgaaaagg gcgcgaaacg 240 cagtggctgt ggcgccaata tcagcagaca aagcatcatc agtgccgcca aaaattcggc 300 atcagaagcc gaaagaggaa agagagagac agacagaagc agtgcttgaa acagtgatcg 360 gaacagtgag caaagatatc acagaagcag cgatcgaaac agcgcttgac ctaataaaca 420 aaattgaaac agtggaaacg gaacaagcga tggctaacgc cgtggaaatg caaagtcaat 480 tgcaacagat gatgcaattg ctgacgctta aaatacaagt cgacgaacag cgagaaagcc 540 aagtccatat ggcatcgaat tgccttgaaa acattgttgg agaatttaat ggaactggcc 600 caaggaaatg gtttgaagtt ttcgagtaca atgcagatgc atttgaactt aacgaaaagc 660 aaatgtacgc acaagcgaga ggcaagatga aaggtgctgc caaactgttt ttggaatcag 720 taagtgttaa caattatgcc aatttgaaaa gttgtctaat cgaagaattt gagtgtgaat 780 taaatagcgc tgaagtccac caactgttga gagaacggaa aaagcataaa caggagtcga 840 ttcaggaata tattctcact atgagaaaga tcgcctctga tgggaatgta gaagagtcag 900 ctgttttgcg atacatcgta gacaatctgt ttcttaagaa cgaattcaaa gttggtctat 960 atgcatgtaa aacctttaag gagctaaagg aagcctatga aatcgtgatc gtgaaatcgt 1020 ttgttttaaa tgcggatcag ctgaacacaa aataaaagat tgcttgggcg aaacaaaatg 1080 ttttaagtgc aatgagaacg gacattttgc taggagttgc cctcttaatg gtggaaacga 1140 taaaaaggat gctcgtggag attttaaagt ccgcaatata gttaatcgtc gacgagttaa 1200 aaagatgtcc gtaaatggtg ttttgcttga cttgcttgta gacacaggat cggatgtgtc 1260 aatcttctcg gaaagcactg cgaaacagtt gccaaacatc gaagtgcata ggaacgaatc 1320 tacgttgact ggtctgggca aaaaagaaac aaaaccgaca ggcgcctttg ttgccgatgt 1380 tgttgttgac gatctgcata caactcagaa atttatcatt gtgcctgatg aggcagttga 1440 gtgcaatgcc ttaattggat tcgacttcgt tgaaaagttc atcgtcacat acgccgaggg 1500 ggaatacacg ttttcgaagg gtgatgccgt tggagactca aataaggagc gcatcgcagg 1560 ttcatacaac gtctttgagg tcgccagtct atatgaagca ccaacaatct atcaaaaagc 1620 agttcgcgaa ttaatagaaa gcatagatga gcagccgaaa caaacccagg cggaatgccc 1680 agtcaaactt caaattattc cggatgctac aacaatgaag tcatttcacc agtcgccatc 1740 aagattgtct gcctgcgagg cagaagcagt caaaaatcaa gtagacgaat ggctgaccca 1800 agggatagta cgaaaatcaa catcgagtgt ggctagtcgg gtggttgttg tccaaaagaa 1860 agatggtaca ccaaggattt gcattgatta tcgaaaactc aacagtatga ccttgtctga 1920 ccgctttcca gtcccattga tagaagaagt tctggaaaag cttcaagcgg ccgcattctt 1980 ttcagctctg gatttggaaa atggtttttt ccacgtgccg gtggaggagt ccagtaagta 2040 cctaacagca tttgtaacaa aagagggtct tttcgaattc aataagacgc cgtttggatt 2100 taaaaattca cctgccgcat ttataagatt tgtcaattat gtctttcaaa atttaatcaa 2160 cgataatata atgcagttgt acatagacga tataatcgtg tatgcccatt ccgctgatga 2220 ctgcttagag aagacacaga aggttctgca attggctgca cagtataagt tgaaaatcaa 2280 atgggaaaaa tgcagtttct tacagtcaag aattaacttt cttggacaca tcgtttgcgg 2340 tggaaagatt tggccaggca aagaaaaaac ggatgccgtc aatcgatttg gaacgcccaa 2400 aaatgttaga gcagttcagt cttttcttgg acttaccgga tttttccgaa tattcattaa 2460 agactactct caaatcgcaa gaccattaac aaatttactt cgaaaggatg taaaatttca 2520 tataggtgca gctgaactac aatctttgcc gaagttaaag gagttgctaa cgagcaaacc 2580 actgttgcat ctgtactcga gagatgctcc accgagttgc acacagacgc atcaaagcat 2640 ggttttggag caatgctact ccaaagtttt aacgataaac tctacccggt gtatttttgg 2700 agtagaaacg accgaatccg aagctcaacg tcacagctac atattagaag ccaaagctat 2760 gtacctggct cttcgaaagt tccggcacta cctgattggg attcacttta aattagtgac 2820 ggactgttca gcgttcaagc agacgacaag gaaggaagac gtaccaagag aggtgacatc 2880 gtatgtgcta tatttgcaag acttcacctg cgatattatt cacaaaccag gaaaaagtat 2940 gcagcacgtg gatcatctga gccgctatcc acaatcaatt tataatatat caacggaggt 3000 atcggcacgc atcaaaaagg cgcaaagtat tgataaccac attaaggctg tggtaactat 3060 tctcaatcaa cagtcgtacg agaattacaa gatcaaaggc ggagtactgt acaaagcaat 3120 agatggaaat gatttaatcg tggtgccaaa acagatggag aatgaaataa ttcgagaagc 3180 gcacgaaatt ggccattttg ctacccgcaa gacaatgcat gcgatttgtc agcagtattg 3240 gatccctcat ttagagcgta aggtgtctaa ttttattgat aattgtatcc gttgtataat 3300 tcacagcaaa aaacttggca agcaagaagg tcagatgcat ttgattgaca aaggtgacac 3360 cccgttgcat actctgcata tcgaccattt ggggccgatg gatgccacgt ctaagtcgta 3420 caaatatatc cttgccatgg tagacggatt tgcaaaattt gtgtggctct ttaccacgaa 3480 gtcaacaggc cacgatgaag tcataaaaaa gctgacagat tggtctcaca tatttggttt 3540 cccaaaacgt attataagtg accgaggttc tgcctttttg tcgaacgcat ttgcagaatt 3600 tgtcgccgaa aatggagtcg aacacatcct tacaaccaca ggagttgctc gaggcaacgg 3660 gcaaatagag cgcatgaatc gatccatact ggccgtcata tcgaagctgt ctgcagaaga 3720 ttcaagcaaa tggtacaaat atgtcccaca ggttcaaaag gctcttaact cacaggtcca 3780 cgcatctaca aaattttcgc cattcaagct gatgtttgga acgaaaatgt gtgcacaagc 3840 aggagatcga gtattatgtc taatcaacga agaattcatc aataacttta atgctgagcg 3900 agagagtttg cgtgacgaag ccaagaagag tatcctagaa gcacagatga cttacaaacg 3960 caactttgac aaaggacgca agcaggatca tgggtacttg atgggtgatc tcgtagcaat 4020 caagcgaacc caatttgtcg caggccggaa gttggccagt tgttttcttg gcccttatga 4080 ggtaactaag ataaagagga acggccgcta tgatgtacgc aaagcagccc aagtggaagg 4140 accaaatcag acgagcacaa gctccgacaa tatgaagctg tggaggtatg ttgttgacaa 4200 tgacgatccg gaatcatctg ggacagatga tgatgatcag gagggccgaa 4250 // ID Copia-11_AA-LTR repbase; DNA; INV; 273 BP. XX AC supercont1.46; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-11_AA_; KW Copia-11_AA-I; Copia-11_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-273 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.46; Positions 2090616 2090344. XX SQ Sequence 273 BP; 62 A; 58 C; 55 G; 98 T; 0 other; tgttggcaga aaacaacggt tacccctatt atgaacagtt tgccgaaccc ctttgccaat 60 tagttaagcc ggcatcactg gcgagtttga catttatgtt ttatgtaaga ataacaagaa 120 ggattatgtt tatttatttg attagtttac aacaatatac gtctgtgttc ggcacgttcg 180 agttttccct cccgtgatat ttttccgaat ttcccggtag tttttccgag ttcgtacgtg 240 gttccgctgc gctcctgttc gctttgttca aca 273 // ID hATm-22_HM repbase; DNA; INV; 3338 BP. XX AC . XX DT 07-DEC-2008 (Rel. 13.12, Created) DT 09-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type family: consensus sequence. XX KW hAT; DNA transposon; Transposable Element; hATm-22_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3338 RA Jurka J.; RT "Families of hATm elements from Hydra magnipapillata."; RL Repbase Reports 8(12), 1916-1916 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 590..2911 FT /product="hATm-22_HM_1p" FT /translation="MXQKIETVRMKKPLTRNDNICPVFGIISRMSINVLPT FT YADIMKHFISVRKELSDGHAKEATINETTRKISVDLETIXHRASIPVISRQ FT QIIAKIRDYHDKYRSLMKPFKQRSKDKESQYLNKVENFKIDSNRIFDIASC FT KCKDFSTCACEKNRKVPPNEIIFLADQRTERKMAIGNVDIKTTKQQEKKLK FT RKLADELRFYKCESITNAETLKSHPDCSLEDDCNDVTLGDLDEAITERDSN FT EDVDFMTDSPKSPSSTNNTSSQEILNSQMRTPLPTLARECDRWGVSDRSAA FT ALASAVLVDYGIINAHDSSDVIDPSKIRRERQKKRKILQAGAAKTLEGLYF FT DGRKDKTQSQIIKGNRYHPALETEEHVVLIKEPGSEYLEHVKTSQGSSQSI FT KISIISFLNANAIDTSSLVAIGCDGTNVNSGNIGGVIRLLELRFNKSLHWF FT VCLLHMNELPLRHLLINLDGSTYGPNSFSGPIGKLLKNMDLPIVSFQPIEG FT NVLSSIDADDLSTDQRYLHEMCQAISSGQCSFELGNRKPGPMCHSRWITTA FT SRILRLYVSTEKPDKNLIILVTYIQKVYAPIWFEVKSKHSCTDGSRHLFQM FT VHYTRYLPIELKKIVDPVIQRNAYFAHPENLLISMATDERPHIRQLALRRV FT LAARSISDKTDAKPTVRLFKIPPLNFEATDYTDIINWSSNKSAEPPMLRSI FT ESTQLKNSILTSEMLQFKSFPCHTQSVERSVKLVTEASMAVCEPQRDGFIR FT ARIESRSAMKVFNTKSDFSLSK*" XX SQ Sequence 3338 BP; 1148 A; 580 C; 615 G; 993 T; 2 other; gggtgcagtg aaattttttt ttttttaaat tcattttgcc tgggtgtgca aaagttgtct 60 attcatgtca gaaatactct gtaaaaattg caatgctcta ggaaattatc ttgaggtgcc 120 ccaaagactt tgaaattcac caaaatcccc aaaaagtgac ctacggaaaa aaatttagaa 180 tttttttttt tttttgatat tcattttgcc tgggtgtgca aaagttgtct attcatatca 240 gaaatactct gtaaaaattg caatgctcta tgaaattatc ttgaggtgcc ccaaggtcat 300 tgaaattccc caaaatcccc aaaatgtgac ctaccgacta cggagtacgg agcacggact 360 accgagatag actaagactt ttttttaaat ttgtgtgtgt ctgggtgata taataatatt 420 attatttagt gctatataat tagctgctat aatgctgcca tctggcgagt gaatgtgcat 480 gtattattat gtaaaataca aacggcgtca aacaaaaaac acagaatcta gcagataatt 540 attaaaatta tcatttattt ttgctgtgtt gtctttgtat ggttagatca tggsacaaaa 600 gattgaaact gtacgaatga agaaaccact cactcgcaat gataacatat gcccagtatt 660 tggcatcatt tcacgaatga gcataaatgt tttgccaacg tatgcagata tcatgaaaca 720 ttttatatca gtaagaaaag aattgtctga tggccatgcc aaagaagcta caattaatga 780 aacaaccagg aagatatctg ttgatcttga gacaatatgr cacagagcat caattccagt 840 aatatcccgt cagcaaataa tagctaaaat aagagattac catgacaaat atcgctcact 900 tatgaaaccg tttaaacaga ggtccaagga caaggagtca cagtacctga acaaagttga 960 aaacttcaaa attgactcaa atcgaatttt tgatattgct tcatgtaaat gtaaagactt 1020 ttcgacatgt gcttgtgaaa agaatcgtaa agttccacca aatgaaataa tatttcttgc 1080 agatcaaaga acagaaagga aaatggctat tggcaacgtt gatattaaaa caacaaagca 1140 gcaggaaaag aaactgaaaa gaaaacttgc agatgaattg agattttata aatgtgaatc 1200 aataacaaat gctgaaactt tgaaatcgca tccagattgt agcttggagg atgattgtaa 1260 tgatgtcacg ttaggtgatt tagatgaagc tatcaccgaa agagatagta atgaagatgt 1320 tgacttcatg acagactcgc ctaagtcacc atcatcaact aataatacaa gctcacaaga 1380 aattttgaac tctcaaatgc gaactccact accaacatta gctcgtgaat gtgatcgatg 1440 gggagtatcg gatcgctcag ctgcagcatt agcatcggct gttttggttg attatggcat 1500 tattaatgca catgactcat ctgatgttat tgacccaagt aaaattcgta gagaaagaca 1560 gaaaaagaga aaaattttac aagctggtgc tgccaaaaca ttggaagggt tatatttcga 1620 tggtcgaaag gacaaaactc aaagccagat cattaaagga aacagatatc atccagcttt 1680 ggagaccgag gagcacgttg tgttaataaa ggagcctggt tctgaatacc tggaacatgt 1740 taaaacatca cagggatctt cacaaagtat aaagattagc attatatcat ttttgaatgc 1800 aaatgctatt gataccagtt cattggtggc aatcggatgt gatggcacga atgtcaactc 1860 aggtaatatt ggtggtgtta ttcgacttct cgagcttcgt ttcaacaaat ctctacactg 1920 gtttgtatgt ttattacata tgaatgagtt accattgaga catctcctga ttaacctgga 1980 tggatctaca tatgggccaa attcatttag tggtccaatc ggaaaattat tgaagaacat 2040 ggacttgcca attgtcagtt ttcaaccaat cgaaggaaat gtgttgtcca gcattgatgc 2100 agatgatctc agtacagatc aaaggtactt gcatgaaatg tgtcaagcaa tatccagtgg 2160 ccaatgtagt tttgaattgg gtaatcgaaa accgggaccc atgtgtcatt ctcgatggat 2220 aacaactgct agtcgtatcc tccggttata tgtatctaca gaaaagccag acaaaaactt 2280 gatcattctt gtaacttaca tacaaaaggt atatgcacct atttggtttg aagtgaagtc 2340 taagcattcc tgtacggatg gcagtcgcca tctctttcaa atggtgcatt ataccagata 2400 cttgcctatc gagctcaaga aaattgttga tcctgtaatc caaagaaatg catactttgc 2460 tcacccagaa aacttattaa tatcaatggc aaccgatgaa aggccccaca ttcgtcaatt 2520 agcactaaga cgtgtattgg ctgcaagatc gatatccgac aagaccgatg ctaaacctac 2580 agttagattg ttcaagatac caccgctaaa ctttgaagca acagattata cagacataat 2640 caactggtca tcaaataaaa gtgctgaacc tccaatgctc agatcaattg aatcgactca 2700 gcttaaaaac agcattctga cctcagaaat gctacaattt aaaagctttc catgccacac 2760 tcagtctgta gaaagaagcg tgaagctagt cactgaagcc agtatggcag tctgtgaacc 2820 tcagcgtgat ggttttattc gtgcaagaat agagtcgcga tcagctatga aagtctttaa 2880 cacaaaatct gatttcagcc tttcaaaatg acgttcttga aatgattaac tgacagaaaa 2940 taaacttgaa ccagacttac ttaattttgt ttgaaataat tgttattctt aatttttgtg 3000 gcacctcaaa gttctacaat ttacataaat atattttcat taggttctga tatgatgtta 3060 aaaccatcat acaggtttat tataatagaa atcgaaaaac aagatcagtt tatttttagc 3120 cattgatttt tttgtgaaaa tatgaattta ataataattg taatttctaa ctttttgggg 3180 attttggggg atttcaaagt ccttgtggca cctcaagata atttaccaga gcattgcaat 3240 ttttacagag tatttctgat atgaatagac aacttttgca cacccaggca aaatgaattt 3300 caatcaaaaa aaaaattttt aatgttatca ctgcaccc 3338 // ID Gypsy-16_SI-LTR repbase; DNA; INV; 310 BP. XX AC AEAQ01025185; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: long terminal DE repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-16_SI_; KW Gypsy-16_SI-I; Gypsy-16_SI-LTR. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-310 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01025185; Positions 937 628. XX SQ Sequence 310 BP; 78 A; 72 C; 75 G; 85 T; 0 other; tgttatgtat gtgtgtacat cagtttaata cgcgcgctca agatctacac ttctagtagc 60 gcgcacgcct aaagctccgc gaataggctc gcgacacgcg tgcgacgatc gcgaatcgag 120 caggattagg gagatgtcgc gaggagcgtg cttgacaggg agaagacgga agtccacatt 180 gattcgagat ccttcgtaca cacacacact cgctgtaacg gttgtcggac attttcctaa 240 atatagtgct gtttaattat tctgcgcctt ctgagttttt ctttatggcg atttccttaa 300 atacacaaca 310 // ID NVBRP4 repbase; DNA; INV; 183 BP. XX AC X64096; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 25-MAR-1997 (Rel. 2.02, Last updated, Version 2) XX DE N.vitripennis repetitive DNA from B chromosome. XX KW SAT; Satellite; Simple Repeat; NVBRP4; Repetitive DNA; KW satellite DNA. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-183 RA Eickbaum C.D.; RT "NVBRP4."; RL Direct Submission to Genbank (27-DEC-1991)D.C. Eickbaum, RL University of Rochester, Dept. of Biology, Hutchison Hall 334, RL Rochester, NY 14627, USA. XX RN [2] RP 1-183 RA Eickbush G.D., Eickbush H.T. and Werren H.J.; RT "Molecular characterization of repetitive DNA sequences from a B RT chromosome."; RL Chromosoma 101, 575-583 (1992). XX DR GenBank; X64096; Positions 1 183. XX SQ Sequence 183 BP; 70 A; 25 C; 36 G; 52 T; 0 other; gtcggggtaa tataatcaaa agtctcgact tatgattgga tattaaaagt atatcataca 60 tcgatatgac aaataatttg catgtctaac gatattacgt cttaaatgaa gaagcagtaa 120 attttataaa acaagatggg tgacgcttat aacacgagca gcagttgaga aagcaatgcc 180 gtt 183 // ID R2_LP repbase; DNA; INV; 4495 BP. XX AC AF015814; XX DT 27-JUL-1999 (Rel. 4.06, Created) DT 27-JUL-1999 (Rel. 4.06, Last updated, Version 1) XX DE Limulus polyphemus retrotransposon R2, complete sequence. XX KW R2; Non-LTR Retrotransposon; Transposable Element; R2_LP. XX OS Limulus polyphemus OC Eukaryota; Metazoa; Arthropoda; Chelicerata; Merostomata; OC Xiphosura; Limulidae; Limulus. XX RN [1] RP 1-4495 RA Burke D.W., Malik S.H., Lathe C.W. and Eickbush H.T.; RT "Are retrotransposons long-term hitchhikers?."; RL Nature 392(6672), 141-142 (1998). XX RN [2] RP 1-4495 RA Burke D.W., Malik S.H., Jones P.J. and Eickbush H.T.; RT "The domain structure and retrotransposition mechanism of R2 RT elements are conserved throughout arthropods."; RL Mol. Biol. Evol 16(4), 502-511 (1999). XX RN [3] RP 1-4495 RA Burke D.W. and Eickbush H.T.; RT "R2_LP."; RL Direct Submission to Genbank (24-JUL-1997)Biology, Univ. of RL Rochester, 135 Superior Road, Rochester, NY 14625, USA. XX RN [4] RP 1-4495 RA Burke D.W. and Eickbush H.T.; RT "R2_LP."; RL Direct Submission to Genbank (09-SEP-1998)Biology, Univ. of RL Rochester, 135 Superior Road, Rochester, NY 14625, USA. XX DR GenBank; AF015814; Positions 1 4495. XX SQ Sequence 4495 BP; 1207 A; 1078 C; 1146 G; 1064 T; 0 other; tgggaggaga cccaaactat cctaggatgg ggcggaaccg accatatgag ccatattaac 60 attgcccaca ctatcctctg gaggtacctc ctcgtggtac ggctggatat aggtaaatcc 120 tgtaaccaaa tcctccaacc cgtgaaggag aacactaaaa cccatatagt ggcctcgcca 180 accactatat gtccaacggc aggagaagct atctcccgga tgggaaggaa aaccctaaac 240 cgtgatggga acttaccggc cccatcagct attgggtacc cggtagggac ttgcaaccct 300 accctgtatt tgcattttat agggaaccgg tcggccctat atcagagtag accgtttatt 360 aaatatgggt gaaaatatta acagtaaaag ctatggtttg gcgtccgtgt ggtgccaggg 420 cggcggccaa acccgagcta cttggcacca actggggatg gtagcttccg agcgattccc 480 tggcgacgtg ggaccgatcg acgatggagt ccaaacatcc ggaatagagg aattgagaaa 540 tacctattcc accaccggct cacataccca aggtgaaccc ggtgcaacta gagtacaacc 600 tatctgtggc ggtaggtgcc gaaccactca ggtgacgggc ttgtttattg atgtctccct 660 acgagacacg aattgtgaca aatccactcc ggtggacaat tacccgatct atgaacctgt 720 taccgatatt agacaagaaa ataaagaact gacaacgcct agagcttcag gcagcatgtc 780 tgtaagtatc cagtcatcga gcgtgactga gggcgaaatt gataataact ctgaaactga 840 ggaattgacg gatatatgtt tggctacgct agagcttcag gcagcacgtc tgtaagcatc 900 cagtcatcga gcatgactga gggcgaaact aacgaaaggg ccacgcctag agcttcagac 960 agctcgtctg taagcatcca gtcatcgtgc gtgactgagg gtgaatgtct acctcctaca 1020 gacaactgca acccgtctgt agagaaccag ttaccgtgcg taactgaggg taggtttgaa 1080 cgggtaggct cactggtgac ggtgcgtctg cccttcagaa aggtggcatg tgacttgtgt 1140 tctaaagagt tcttgacata ttcgaagttt gcagtccacc aggcaaactt ccacaattca 1200 gaaactcagg catgctgcac atattgcggt aaaagtgatg gcaatcatca ctctatagcc 1260 tgtcacgttc cgaaatgtcc ctggcggcga actgttacgt ttgctgcgaa cttaagcaat 1320 ttcttgtgtg atctttgcaa tgatagtttt aagaccaaat cagggctttc gcaacataag 1380 cgtcataagc atccttgttc aaggaatgct gaacgcatcc tttctcttgg agtcaggacg 1440 ccgtcggccc gccctcgcca ggtagtgtgg tccgaagaag aaacacgaac cctccgggaa 1500 gtggaagtag tgtattcggg ccaaaagaac attaatgtcc tctgtgcggg gcatctacct 1560 ggtaagactt ccaaacaggt ctcggacaag cgccgagact tgcacaggat acggtcttct 1620 aacgtacatg gtacacccac cactcagagt cgtggagatc ctgttgaaca ggtcgaggag 1680 tacgaggagt tggactggga aggaatgcat ccttttcccg accctgactc taagttttgc 1740 tcgtaccttg atcagctgag agatcagaag ggactcactg aaccggtatg gcaggagatc 1800 gaaatcgtgg cacaagaatg ggtagaaaac cttgcccatg ttcaatcgtc ttggaatcat 1860 gagagaacaa ccaagcaggt gccagaaaac aatacacctg cacgaagacc atttaaaagg 1920 cgtctccatc gtgtggaacg ttataagcgg tttcagagaa tgtacgacct ccagcgaaag 1980 cgcctggctg aggaaatact agacggccgg gaagccgtca catgtaacct caaaaaggag 2040 gagatcaaag accactatga tcaggtctac ggtgtgtcaa atgatagagt ttctctagat 2100 gactgcccca ggccaccagg ggccaataac accgacctcc tgaaaccgtt tacgccaacc 2160 gaagtgatgg actcacttca gggtatgaag aacggggcgc ctggccctga taagattacc 2220 ctaccgttcc tccaaaaacg tcttaaaaat ggcatccatg tttccttggc aaatgtgttt 2280 aacctttggc aattctcggg tcgcatcccc gaatgcatga agtcaaatag gtcagtcctc 2340 atcccgaaag ggaagagcaa tctgcgggat gtcagaaact ggcggccaat cacaatctcc 2400 tcgattgtgt tgcggctata caccaggatc ttggcacgcc gtctcgagcg ggcggtgcag 2460 attaatcccc gacagcgagg cttcgtccct caggctgggt gtagggataa tatattcctg 2520 cttcagtctg ctatgaggag ggctaagcga aagggaactc tggctctggg gcttcttgac 2580 ttgtcgaagg catttgacac agttggtcac aaacatcttc tgaccagcct agaaaggttc 2640 gctgtccacc cgcatttcgt ccgaattgtg gaggacatgt acagtggttg ttcgacgtcc 2700 tttcgagtag gcagccagtc tactcgcccc atcgttctga tgagaggcgt caaacaaggg 2760 gaccccatgt ctcctatatt gttcaacatc gctctggacc ctcttcttcg tcaactggaa 2820 gaggaaagcc gaggctttat gtttagggag gggcaggccc ctgtctcatc tctagcatat 2880 gccgatgata tggcactact ggctaaagat cacgccagtc ttcagtcgat gttgggcact 2940 gtggataaat tttgttcagg gaacggactt ggccttaaca tcgccaaaag tgccggactt 3000 ctgattaggg gagcgaataa gaccttcact gtcaatgact gcccttcctg gctagtaaat 3060 ggtgaaacgc tcccgatgat cggtcccgaa caaacttacc gttatcttgg ggcaagcatc 3120 tgtccgtgga ctgggataaa cagcgggcct gttaaaccca ccctggagaa atggatagcc 3180 aatatcacag agtctcccct caagccacat cagagggtcg acatactctg taagtacgct 3240 ttaccccggc tgttttacca acttgagctg ggcactctga atttcaaaga actgaaggaa 3300 ctagacagca tggtcaaaca agctgtcaaa cgttggtgcc atctacctgc ctgtacggct 3360 gacggcctgc tatactcccg tcatcgtgat gggggtttag ctgtagtaaa attagagtct 3420 cttgtccctt gtctaaagat caagacaaat ctcagactag tgcattcgac cgaccccgtc 3480 atatcatctt tggcggaatc cgatggttta gtgggtgcca tcgagggtat tgctcaaaag 3540 gctgggcttc cgatccctac gcctgaccag cgatctggaa catatcattc taattggaga 3600 gatatggaaa ggagaagctg ggaaaggttg gccctgcacg ggcaaggtgt ggagctcttc 3660 aaaggctcaa gatctgccaa ccactggttg cctaggccag ttggtatgaa gccacaccac 3720 tgggtgaagt gtctggcaat gagagctaat gtatacccta caaaaagagg cctcagtaga 3780 gggaatctat ctaagaacaa agattccgcc aagtgtcggg gatgcacatc aatgagggag 3840 accctatgtc atctaagtgg tcaatgcccg aaattgaagt cgatgagaat aaggcgccac 3900 aataagatct gtgagcactt gatcgccgag gccagcttta aaggctggaa ggttctgcaa 3960 gagcctacct tggttacaga caatggtgaa cgtcggcgac ctgatctgat cttccatcgt 4020 gatgataaag cggtggttgt tgacgtgacg gttcgctacg aaatttcgaa agacacgttg 4080 agagaagctt atgcttctaa agttcgaagg tatggatgtt tgaccgaaca aattaaagac 4140 cttacagggg ctacctccgt tgtttttcat ggatttccaa tgggtgcccg cggtgcctgg 4200 tttcctgaaa gctcggacgt gatggccgac ctgaacattc ggtcaaaata ttttgaagag 4260 ttcttgtgta gacgcaccat cctatataca ctggacttat tatggaaatc gaataacgaa 4320 caatatttag aaaggcttgc accataaatt ttgtctcttt ccccaatgat gtctactagc 4380 acgctgccga agctagatag attgaggaat ctgcgtaatc tgtaatgatt acgcctcatg 4440 ggcatctatc ggtagcgtcg accctgacgt taaattgggt aataagaaat atcga 4495 // ID Gypsy14-SM_I repbase; DNA; INV; 4394 BP. XX AC . XX DT 14-OCT-2007 (Rel. 12.1, Created) DT 02-NOV-2007 (Rel. 12.1, Last updated, Version 1) XX DE LTR retrotransposon from Schmidtea mediterranea: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; internal portion; Gypsy14-SM_I. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-4394 RA Jurka J.; RT "LTR retrotransposons from Schmidtea mediterranea."; RL Repbase Reports 7(10), 1065-1065 (2007). XX DR [1] (Consensus) XX CC LTRs are atypical: they do not have TG...CA termini. XX FH Key Location/Qualifiers FT CDS 58..1272 FT /product="Gypsy3-SM_I_1p" FT /translation="MECLPKSCQDMLTLNIRRYDGRGSKEAEEWIRDIEEW FT LLTNGQRLTSTFDVLLVAEAKTLWGNVRESETTDVKAKTWFLNTFTIKKSM FT TDKIRELADQDDERFAAFEIRVRKCLTEVLNSGQKEEDIVQDLITNKARDA FT RLREILLTKPETKIEETRALAKIFESNKNGSNKTEHVKAIGQRTYANVTER FT GQMRPQQNTQHIRSTPAQRRLSEEPFNYNEPRRDEYPRRESGRLAEDYHGR FT REQFQREQSFRNDRISNGFERKLPTVSMKNIAKRLYNESRGLPHSPVPKLT FT VGDCFCCGERGHKRYECPLGNKCLICGKEGHGFRDCFSLRKEVPRRYQRIA FT CIEEEDSKSRKQDYTENNQDFRITHSEIEERYNVPEKYTTMKEDEKNISDS FT MGFISSVESRQ" FT CDS 3142..3666 FT /product="Gypsy3-SM_I_3p" FT /translation="MKRPKKENQRSDIYIYSIQRTEAMEKVLEQQQKDIKF FT QELRDMLENGNVQTNKNCFPFNVKNIRIDGGLLTVEKNNNVLILVPETFAG FT LLTTTLHQELCHIGVKKLFHYIEKIFYWPKMQETIQLCLRCCDICSKRKID FT QTRTKEIFIPRYSSEFLEQIVMDIAYMERNTIKNIC" FT CDS 1323..3296 FT /product="Gypsy3-SM_I_2p" FT /translation="MYQTKDGRSWSKIILNSTEVDMLWDSGASVTVMSKNL FT WEKIGKPDLQRSAILLCGVFSTGGEESMGCMTIPAIWNQVTKIITVVVVKE FT IKPEFIGGIDTMEEFGVKLMKINNIETTFINKIHTDEERINKALMALTGEK FT NKDIEEMIRKYGGIFMASKFDLGFTNLVRHEMRTSGGPIMQHPRRQPMHME FT KRIEDLIKELIKAKVIRRCKSSWNAPLVIVGKKDGSIRMCVDYRGLNAITE FT KESFPMPNTKFLLDCLADARIFSSIDLGQAYYQVELAPDIQEMTAFSTREG FT QFCFNRLPFGLSTAPATFQRLMHTVLEGMLFKGVVVYLDDILIYGKSKKDH FT DELLEEIFKRIQMAGLKINPEKCEFNKTELCFLGHTVNGEGIQTNRKKVEE FT IEKAEVPKCSKQLRSFLGLTNYYRRFIKDYARIAEPLYAATSGCDKDIIWT FT KECNESFQVLKDRLCEAPILDYPREGRTFILDTDASFGAIGSVLSQVKEDG FT EESVIAYGSRHMTAHEKGYCVTRKELLAVHEYVMHFREYLYGKKFIIRTDH FT KALVFMNTTKTPISPQFQTWLANLAEYDFELKYRKGEEHSNADGLSRLKGM FT ICTQCQTRHEEAKEGKSKIRYIYILYPKNRGYGKSIGAAAKGYQISRATRY FT VRKWKCTDE" FT CDS 3663..4382 FT /product="Gypsy3-SM_I_4p" FT /translation="MLVMVDRFSKLISLTAITKQDEGTIFKAILNNWIYRF FT GKPKSILTDRGRNFEGKFLKERLQKLGINQEFSSPYQHQSNGLAERAIRTV FT RDLITATLAGGCEEKNWHELLPRIEFSINSSRQSATGFSPFEIIYGRSVNL FT HSNLKQADISRGELMENAKHNSDKAADNMRAMDTTKRGARVFEVGEEVLVR FT KEPHNRHKDDMQYEGPFKIVRFLSPHRVELHSQDGTKERRIEWLKKWRKS" XX SQ Sequence 4394 BP; 1690 A; 674 C; 979 G; 1050 T; 1 other; ggcgaccacg ccgaaaataa gaacatcaac gcagtttcaa aaacmacgaa atcattaatg 60 gaatgtctgc caaaatcatg tcaagacatg cttacactaa atattcgacg ttatgatgga 120 cgaggatcaa aagaagcaga agaatggatc agagatatag aagaatggct gttgacaaat 180 ggacagcgat tgacaagcac attcgatgtg ttattagtag cagaagcaaa gacattgtgg 240 ggaaacgtca gagaatcaga aacaacagat gtcaaagcca aaacgtggtt tcttaatacg 300 tttacaataa agaaatcaat gactgacaaa atcagagaat tggcagatca agatgatgaa 360 agatttgcag catttgaaat tcgagtaaga aaatgtttaa ctgaagttct gaattcaggg 420 caaaaagaag aagatattgt tcaggatctc attacaaata aagctcgcga tgcaagactg 480 agagaaatat tactgacaaa gccagaaaca aaaattgaag aaacgagagc attggcaaag 540 atttttgaaa gtaacaaaaa tggaagcaat aaaacagaac atgtcaaagc tatcggtcaa 600 agaacatatg caaacgtgac tgagagaggc cagatgagac cccagcagaa tactcaacat 660 atcagatcca caccagcaca aagaagatta tcagaagaac cattcaatta caatgaacca 720 agaagagacg aatatcctag acgcgaatca ggacgactcg cagaagacta ccacgggcgt 780 cgagaacaat tccaaagaga acaatctttt agaaacgata gaatttcaaa tggatttgaa 840 agaaagctgc caacagtttc tatgaagaac attgcaaaaa gattatataa cgaaagcaga 900 ggacttccac attcaccggt tccaaaatta acagtaggtg attgtttttg ttgtggtgaa 960 aggggacata aaagatatga gtgcccactt ggaaataaat gtctaatatg tggcaaggag 1020 ggtcatgggt tccgagactg tttttcatta cggaaagagg ttccccgacg atatcaaaga 1080 atagcatgca tagaagaaga agattcaaaa tctcgaaaac aagattatac tgagaacaat 1140 caagatttca gaataactca cagcgaaata gaagaaagat acaatgttcc tgagaaatat 1200 actacgatga aagaagacga aaaaaacatc agcgactcca tgggtttcat ttcgtcggtg 1260 gagtcaagac aataaattgt ggattaagaa atgagcgccg aagagaagag gcagaagaag 1320 agatgtatca gacgaaagat ggaagaagtt ggagtaaaat aattttgaat agtacagaag 1380 tagatatgtt atgggattcg ggagcgtcag taacagtcat gagcaagaat ttgtgggaga 1440 aaataggcaa accagacctt caacgtagtg ctatactatt atgcggagta ttttcaactg 1500 gaggagaaga atcaatggga tgtatgacta taccagccat atggaatcag gtgaccaaaa 1560 ttattacagt agttgttgta aaagagatta agccagaatt catcggtggt attgatacaa 1620 tggaagagtt tggagtaaag ttgatgaaga tcaataatat agaaactaca tttataaata 1680 aaattcacac tgatgaagaa agaataaaca aagctctgat ggcgttgaca ggtgagaaga 1740 acaaagacat tgaagaaatg atcaggaaat acggtggaat tttcatggca tcaaaatttg 1800 atctgggatt tactaattta gtacgacacg agatgcggac gtcaggaggt ccaataatgc 1860 aacatccacg gcgtcaacca atgcacatgg agaaaagaat tgaagacctg ataaaagaat 1920 taataaaagc aaaagttatt cggcgttgta agagttcttg gaatgctccg ttagtgatag 1980 taggaaagaa agatggatcc atcagaatgt gtgtagacta tcgtggattg aatgctataa 2040 cagaaaagga atctttcccg atgcctaata ccaaattttt actggactgt ctcgcggatg 2100 caagaatatt ctcgtcaatt gatttaggtc aagcgtatta ccaagtagaa ttggctccag 2160 atattcaaga aatgactgca tttagtacaa gagaaggaca attctgtttc aataggctac 2220 cgtttggatt atcaacagct cctgcgactt ttcagcgact aatgcacaca gtattggaag 2280 gaatgctatt caaaggagta gttgtttact tagatgatat tttgatatat gggaaaagca 2340 aaaaagatca tgatgagctg ctagaagaaa tatttaaaag aattcaaatg gcagggttga 2400 agataaatcc agaaaaatgc gaattcaata aaacagaact gtgcttcctt gggcatacag 2460 ttaatggtga gggtattcaa acaaacagaa agaaagttga agaaattgag aaagcagaag 2520 taccaaaatg ttcaaaacag ttaagatcat ttctcggatt aacgaactac tacagaagat 2580 tcatcaaaga ctatgccaga atcgcagaac cgttatatgc agcaacatct ggatgtgaca 2640 aagatataat ttggactaaa gaatgtaatg aaagtttcca agtactgaag gacaggttgt 2700 gcgaggcacc tattctcgat tatccaagag aaggtagaac atttatctta gacacagacg 2760 ctagctttgg agctattggg tcggttttga gccaagttaa agaagatgga gaggaatctg 2820 ttatagctta tggttcgaga catatgacag cgcatgaaaa aggctactgt gtaacgagga 2880 aagaacttct cgctgtacat gaatatgtta tgcattttag agaatatctt tacgggaaaa 2940 agttcataat tcgtaccgat cacaaagcat tggtgttcat gaacactaca aaaacgccaa 3000 tcagtccaca atttcaaaca tggctagcta atctggctga atatgatttt gaattgaaat 3060 acaggaaagg ggaagaacac tcgaatgcag atgggttatc aagattgaaa ggtatgattt 3120 gcacccagtg tcaaacaaga catgaagagg ccaaagaagg aaaatcaaag atcagatata 3180 tatatatact ctatccaaag aacagaggct atggaaaaag tattggagca gcagcaaaag 3240 gatatcaaat ttcaagagct acgagatatg ttagaaaatg gaaatgtaca gacgaataaa 3300 aattgttttc cattcaacgt aaaaaatata cgaattgatg gaggattatt aacagtagaa 3360 aagaacaaca atgtcttaat tttagtccca gaaacctttg caggcttatt aacaacaacc 3420 ttacaccaag aattatgcca tattggagtt aagaaattat tccattatat agaaaagata 3480 ttttactggc cgaagatgca ggagacaatt cagctttgtc tacgatgttg tgatatctgt 3540 tctaagagga agattgatca gacaagaacc aaggaaatat ttatccccag atatagctct 3600 gaattcctag aacaaattgt tatggatata gcctatatgg agaggaatac tataaaaaat 3660 atatgttagt aatggtcgat cgttttagta aactgatttc tttgaccgca ataacgaagc 3720 aagacgaagg aacaattttc aaggcgatac ttaacaattg gatctatcgt ttcggaaaac 3780 caaagagcat attgacagac agaggccgga attttgaagg aaaattttta aaggaaagac 3840 tacagaaact tggtatcaat caggaattca gctcacctta ccagcatcaa tcaaacggat 3900 tagctgaaag agccataaga acggtaagag atcttattac agcgacattg gcaggcgggt 3960 gtgaagagaa gaactggcat gaacttttac ctagaataga gttcagcata aatagcagtc 4020 ggcagagcgc gacgggcttt tcaccatttg agataattta tgggcgaagt gttaatttac 4080 actcaaattt gaagcaagca gatatatcta gaggagaatt gatggaaaat gcgaaacata 4140 actcagataa agctgcagat aatatgagag ctatggacac gacaaagaga ggcgctcgtg 4200 tttttgaggt gggggaggag gtattagtgc gcaaggaacc gcataacaga cacaaagatg 4260 atatgcagta cgaaggccca ttcaaaatcg tcaggttttt atctccgcat cgagtcgaac 4320 ttcattccca agacggaacg aaagaaagaa gaatcgagtg gctgaagaag tggagaaaat 4380 cttaagaagg ggag 4394 // ID Gypsy-610_AA-LTR repbase; DNA; INV; 193 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-610_AA_; KW Ty3_gypsy_Ele68; Gypsy-610_AA-I; Gypsy-610_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-193 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 193 BP; 65 A; 40 C; 35 G; 53 T; 0 other; tgttcaatcc aagcaatcta cattgatcgc agcaaccact gtctgcgacc aaagaggaaa 60 aattttcatt cgaagtgtaa atgtcaagta gaataaacgt atttgtattt agctctaaca 120 agctccaatt attttccgga agcatctgga aaagtacttc cgctgatctg gaagtacctg 180 gaagaacact tca 193 // ID DNA8-16_AP repbase; DNA; INV; 177 BP. XX AC . XX DT 22-AUG-2009 (Rel. 14.08, Created) DT 22-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-16_AP. XX NM DNA8-16_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-177 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(8), 1758-1758 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 177 BP; 54 A; 40 C; 36 G; 46 T; 1 other; cagtggcgca aatagggggg ggggcttggg gggcttagcc ccctcaaaat cactcaaagc 60 cccctcaaaa nataatagta taccattctt aaaagtatgt acctataaat tataatttaa 120 aaaaaaatgt gttggggctt gagccccccc taaacatttt tcctatttgc gccactg 177 // ID Gypsy_6B repbase; DNA; INV; 6556 BP. XX AC . XX DT 05-JAN-2009 (Rel. 14.03, Created) DT 05-JAN-2009 (Rel. 14.03, Last updated, Version 1) XX DE Gypsy_6B-1 - Retrotransposon from Drosophila yakuba. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy_6B-1; KW Gypsy_6B. XX OS Drosophila yakuba OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; melanogaster subgroup. XX RN [1] RP 1-6556 RA Bartolome C., Bello X. and Maside X.; RT "Widespread evidence for horizontal transfer of transposable RT elements across Drosophila genomes."; RL Genome Biol 10(2), R22-R22 (2009). XX RN [2] RP 1-6556 RA Bartolome C., Bello X. and Maside X.; RT "Gypsy_6B-1 - Retrotransposon from Drosophila yakuba."; RL Direct Submission to Repbase Update (05-JAN-2009). XX DR [2] (Consensus) XX CC Conceptual translation suggests that ORF1 and ORF2 overlap 54 CC nucleotides. There is a one-nucletide frameshift (+1) between CC both CC ORF. CC Positions [1843-2119] - Retroviral aspartyl protease CC Positions [2429-2975] - Reverse transcriptase (RNA-dependent DNA CC polymerase) CC Positions [3301-3637] - RNase H CC Positions [4214-4660] - Integrase core domain. XX FH Key Location/Qualifiers FT CDS 483..1862 FT /product="GAG" FT /note="Conceptual translation, with homology with FT GAG." FT /translation="MQPSFSQYRNSTPYSTDSESDCENQPSPLRRTTANSP FT AMAIDPEQLRAVIQAVVANTLADEAAKNELAQNEMRQKIQELASQLAATQV FT APTSAVAPAIRAYEPIDITSGVPCNMTLDAVKYLPEFSGSQESYVSWRQAA FT VAAYRIFKDYNGTSRHYEAVTIIRNKIRGAANNVLSSFGTVLNFDAIINRL FT DFTYSDKRPMHVIEQDMSTLRQGSNMTLLEYYDEVEKKLTLLTNKAHMSHE FT PSAANILCEKFREDALRIFISGLKRNLTDVLFAAKPKDMPSALALAQEVES FT NHERYVFAASFARSQEDKDRKPTAKAQGRQLDKYDRHDPQPSNTKNPYFNR FT QHKAQVHADPQGTKNYKDDGPPEPMEVDPSVSKLMQPTQANAYRNRRPATS FT ERTGAVRKQQKVNFITQNAEEKTGAYAAAASKAESKIDDDAITGYDSDYLN FT FLGENPCYPSSDEE*" FT CDS 5094..6554 FT /product="ENV" FT /note="Conceptual translation, with homology with FT ENV." FT /translation="KIHFHRCRIFILLSLALVSAHVTNYSQTKYIPIIDGE FT ILVWEELAYVTHSANLSEYMRVIEETSSMNEMFPQSHMRKLLEVDTLHLRD FT TLDSLKVHHRIARSLDFLGSMLKVVAGTPDAGDLERIRFTEWQLTQSNNRQ FT IKINTRIQSRINQLTTTVNQILQTHKNTQIDTGHLYETLLARNRILMMELQ FT NLMLAVTLAKNNIVSPNILDHADLKSVWLKEPTDTPIGDLMSASSVKILQS FT SNMMHFIIKFPKIKLSCKKVTIFPVACKGVMLRITDNMVAKCGEVVHTIKN FT CIPTPGATFCQLSTESSCARELHAGVLAHCESQPSDLHPLTRVDEGIIIIN FT DRPARVTVDNETEVYLRGTHLITFNDRVVINDTTFVNHDKAQTRAPGVANS FT PSLNITANRDILSLPYLHQLSERNLEFIKEFGREIDTDHSHRIIFVAAAIC FT CALVCIGIACLRYIGARRSAIQLNGMIAELGPTEDGRNLEGG" FT CDS 1808..4915 FT /product="POL" FT /note="Conceptual translation, similar to POL." FT /translation="LPKFFRGKSLLPVIRRRVAGVPMKFLLDTGASKNFIR FT PRKELKGVRPVDSPFEIHSIHGTTIVTKKCFVSIFNLKATFFILPDFTIFD FT GIIGADLLTQAGASLCLASAQLKWGQEVEKISFHKCTDVNFTNVECADAPP FT LVKKAFLGMIRSRKNAFADPNEALPYNTSVVATIRTTSEEPIYAKLYPYPM FT GAADFVNNEIQDLLKNGIIQKSVSPYNNPIWVVDKKGTDEAGNQNRRLVID FT FRKLNERTIPDKYPMPDISMILGNLGKAKFFTTLDLKSGYHQIILAENDRE FT KTSFSVSGGKYEFKRLPFGLRNAASIFQRAIDDILREQIGKTCYVYVDDVI FT IFSENENAHVKHVDWVLKTISDANMRVSVKKSSFFKKSVNFLGFIVTSDGT FT TTDPEKVRAIKEFPEPKTVFEVRSFLGLASYYRCFIKDFAAIARPISDILK FT GENGSVSRHRSRHIQVQFNEAQKNSFEKLRNILASEDVMLRYPDYKKPFHL FT TTDASAYGIGAVLSQENRPITMISRTLKDREMNYATNERELLAIVWALAKL FT RHYLYAVKDITIFTDHQPLSFSVSDSNPNAKIKRWKGRIEETGAKVVYKPG FT KENLVADALSRQQINAIEEQDAESCGATIHSEISLTHTIETTDKPLNCFQN FT QLILEEARFPLKRSFVLFKNKKRHTINFTDKESLLNDLADAIVPKGVNALH FT CDLHTLATVQDDLVRRFPTTKFWHCKNRVTDIFGVEEKREIILAEHNRAHR FT SAQENVKQVLTEYYFPKMAKLANEIVQNCKTCAKAKYDRHPRKQEIGESPI FT PSHVGEMLHIDIFSTDKKYFLTCIDKFSKFAVVQHVPSRTIEDLKPALLQV FT MNFFPKAKVIYCDNEPSLKSHTITAMLDNHFGVSITNAPPLHSVSNGQVER FT FHSTLLELARCLKIDKGMSDTVEIILLATTKYNKTIHSVIDKRPVDAIQEC FT TDDTQKRIADKIKSAQDALRSRENASRQNRVFEVGEKVLVKTNRRLGNKLT FT PLCEEKAIQADLGTTVLIEGRVVHKDNLK*" XX SQ Sequence 6556 BP; 2100 A; 1551 C; 1402 G; 1503 T; 0 other; actggcgccc aaccagtggt aaaaaccacc acccaccacc accatcacca acaccgccca 60 cacagctact aaatttgcgg caaagcaaaa gacagcggcg cgggaaaagt gaagggaatt 120 agtgacaaaa taacacacac accaaaaaca aaaaattatg tgagtggaaa ccgatgctaa 180 acgcatttta agtctgtgaa tttctaaaaa aaaaaatata taataataaa agaattatta 240 atccggcaga tacgctaaaa catggtctcg catcaatgta aatgaaaaca gaaatcatat 300 aattgttttt tataacagcg catacgttcg cttttcattt attgttgttg ttttgtgttt 360 ccattcgatt caagcgcgac attcgacgct gcagcctttg tctttagttg ttgtttgttt 420 ggctttggtc aactgctgcc cacacggaca gctttttcat acctaagcga attggactgc 480 tcatgcagcc cagtttctca cagtatagga atagtacacc atactccacg gactcagagt 540 ccgattgtga aaatcaacct tcccctctaa gacgtacaac cgcgaattcc cctgcaatgg 600 cgatagaccc agaacaactc agagctgtaa tacaggcagt ggtggccaat actttggcgg 660 acgaggctgc caaaaacgaa ctcgcacaaa atgaaatgcg tcagaaaatt caggagctgg 720 ccagccagtt ggcagcaaca caggtcgcgc ccacctcagc agtggccccc gcaattagag 780 cttacgaacc catagacata actagtggtg taccatgtaa catgacttta gacgccgtta 840 aatatttgcc tgaattctcg gggtcacagg aatcatatgt ttcgtggcga caagcggcgg 900 ttgctgcgta tcgcatattt aaagattaca acggcacctc tcgccattac gaggcggtga 960 caattattag gaacaaaatt aggggcgcgg ccaataacgt cttatcctcg tttggcactg 1020 tactgaattt cgatgccatt ataaatcggc tcgatttcac ttacagcgac aaacgcccaa 1080 tgcatgtcat tgagcaggac atgagcactt taagacaggg aagcaacatg accctgttgg 1140 agtattatga tgaggtcgag aaaaaactca ccttactcac aaataaagcc catatgtctc 1200 acgagccatc ggcggcaaat atattgtgcg aaaaattccg agaggacgcc ttgcgtattt 1260 tcatctcggg acttaaacgc aacctcactg acgtgctttt cgcggcgaag ccgaaagaca 1320 tgccgtcagc cttggcctta gcgcaagaag tggaatctaa ccacgagagg tacgtttttg 1380 cagctagttt tgcaagaagt caagaagaca aagatcgcaa acctactgca aaagcgcagg 1440 gtcgtcaact tgacaagtac gaccgccatg acccacagcc tagtaatacc aaaaacccgt 1500 attttaatag gcagcataag gcacaggttc acgccgaccc gcaaggtacc aaaaattaca 1560 aagacgacgg cccaccagag cctatggagg tcgatccctc agtttctaaa ttgatgcagc 1620 cgacccaggc aaatgcctac cggaacagaa gaccagccac ctccgagcgc acaggcgccg 1680 tgaggaaaca acaaaaagtt aacttcatta cgcagaatgc ggaagaaaag acaggagcgt 1740 atgccgccgc cgcttccaaa gcggaatcga aaatagacga cgatgccatt accgggtatg 1800 attctgacta cctaaatttt ttaggggaaa atccctgtta cccgtcatca gacgaagagt 1860 agcaggggta ccaatgaaat tccttttaga caccggcgcg tcaaaaaatt tcatccggcc 1920 tcgcaaagag ctaaagggtg tccgcccggt ggactctcct ttcgaaattc attcaattca 1980 cggcaccacc atcgttacaa aaaaatgctt cgtctcgatt tttaatttga aagcgacttt 2040 tttcattcta ccagatttta caatattcga cggtataatc ggggccgatc tgttaacaca 2100 ggccggtgca tcactttgcc tcgcttccgc ccaactcaaa tggggtcagg aagttgagaa 2160 gatttcattc cataaatgca ctgacgtcaa cttcactaac gtggagtgcg cagatgcacc 2220 acctttggtg aagaaggcat ttctagggat gataaggagc cgaaaaaacg ccttcgcaga 2280 tcccaacgag gccctgccat ataacacatc ggtagtggcc acgattagga ctactagcga 2340 agagcccatt tacgctaaat tatatccata ccccatgggg gcagcagatt tcgtcaacaa 2400 cgaaatccaa gacctgctta aaaatggtat aattcaaaag tcggtgtctc cttataacaa 2460 cccaatatgg gtggttgata aaaaaggaac cgacgaggcc ggcaatcaaa atagacgatt 2520 agtcatagac tttcgtaagc taaacgaaag aactatccca gacaaatacc ctatgccaga 2580 tatctccatg atactgggca acttgggcaa ggccaaattc ttcacgacac tggacctcaa 2640 gtccgggtat caccagatca tcttagcgga aaatgaccgc gaaaaaactt ccttctccgt 2700 aagcggaggg aagtacgaat tcaagaggct tccctttggc ttgagaaatg ctgccagcat 2760 cttccagaga gccattgatg acattctcag agagcaaatc ggcaagactt gctatgtcta 2820 cgtcgacgat gtaataatct tctcagaaaa cgagaatgct catgtcaagc acgtggattg 2880 ggttttaaaa accataagcg atgcaaatat gagagtctca gtgaaaaagt caagtttttt 2940 taaaaaaagc gtaaactttc ttggctttat agtcaccagt gatggcacta ccaccgaccc 3000 agaaaaagtc agggccataa aggagttccc tgaaccaaaa acagtatttg aggttagatc 3060 atttctaggt ctcgcgagct attacagatg cttcattaag gactttgcag ccatagcaag 3120 gcctatttcg gacatcttaa agggcgaaaa cggaagtgta agcagacaca gatcgcgaca 3180 catccaagta caattcaatg aggcgcaaaa aaattctttc gaaaaactgc gcaacatttt 3240 agcatccgaa gatgttatgc tccgataccc ggattacaag aagccattcc atctaacgac 3300 ggatgcttca gcctacggta ttggagcagt gctttcacag gagaaccgtc ctattacaat 3360 gatctcgagg acattaaagg acagggaaat gaactacgcc acaaatgaaa gggaattatt 3420 agccatcgtt tgggctttgg ccaaactgag acactattta tatgcggtga aagatataac 3480 tatcttcacc gaccatcaac cattgtcatt ctcagtatca gactctaacc ctaacgcgaa 3540 aattaaaagg tggaagggtc gcatcgagga aacgggtgcg aaggtagtct ataaaccggg 3600 aaaagaaaat ttggttgctg atgccctgtc taggcagcaa attaatgcca tagaagaaca 3660 ggacgcagaa tcatgtggtg cgaccattca cagtgagatt tccctcactc acaccataga 3720 aactacggat aagcccctaa attgcttcca gaaccaacta attctggaag aggcccgctt 3780 tccgctaaaa cgctctttcg ttctctttaa aaataagaaa cgacatacaa tcaacttcac 3840 tgacaaggaa tcattactca atgaccttgc ggacgcaata gtccccaagg gcgtaaacgc 3900 cctccattgc gatttgcaca cgctagcaac ggtgcaggac gacttagtcc ggagattccc 3960 gactacaaaa ttctggcact gcaaaaaccg tgttacagac atttttgggg tcgaggaaaa 4020 aagggaaatc atattggcag aacacaatag ggcccaccga tcggcccaag aaaatgtgaa 4080 gcaggttctc acagaatact acttcccaaa aatggccaaa ctggccaatg aaattgtcca 4140 aaactgcaaa acatgcgcta aagctaaata tgataggcat cctaggaagc aagagattgg 4200 cgagtctccg attccctctc atgtaggaga aatgctacac atagacattt tctcaacaga 4260 caaaaaatat tttctcactt gcatcgacaa gttttcgaaa ttcgccgtcg tacagcatgt 4320 accgtcaaga acaattgagg acttgaaacc ggccttgtta caggtcatga atttcttccc 4380 aaaggccaaa gtgatttact gcgataacga gccatcgtta aaatcgcaca cgatcacggc 4440 catgctagac aaccattttg gcgtcagcat cacgaatgcg ccgccccttc acagtgtctc 4500 aaacgggcaa gtggaacgtt ttcacagcac cctactagag ctcgccaggt gcctaaaaat 4560 tgacaagggc atgagcgata ccgtggagat aatcttgtta gccacgacca aatataacaa 4620 aacaatccat tcggtcatcg ataagaggcc agtcgacgca atacaagaat gcacggacga 4680 cacacaaaaa cggattgctg acaaaattaa gagtgcccaa gacgcgctaa ggtctagaga 4740 gaatgcttct cggcaaaata gagtcttcga agtgggcgaa aaagttttgg tcaaaactaa 4800 tagaagactt ggcaataaac tcactccctt gtgtgaggaa aaagccatac aagcagacct 4860 ggggaccacg gtcctcattg aagggagggt ggtccacaaa gacaatttaa aatgactctc 4920 tcccttaatt tttaattttt tattatcact acctattatc agccgtttgg cgaacttcca 4980 tttaatagac attaagtacg ttgccaaagg cgcgatgttt tagttttaat tggtattaat 5040 atagtattaa ttggtgatgg gataaacctt agcgaacaag aaaaagcgac taaaaaatac 5100 atttccacag gtgcagaatt ttcatcctcc tctcactggc attagtgtca gcgcacgtca 5160 ctaactattc gcaaacaaag tatatcccca tcatagatgg agagatcctg gtatgggagg 5220 agctcgcata tgtgacccac tcggcaaatc tctcggaata catgcgcgta atagaagaaa 5280 caagcagcat gaacgagatg tttccgcagt ctcatatgag gaaattgcta gaagtggata 5340 ccttgcacct tcgagacacg ctggattcgt taaaagtcca ccacagaata gcaaggagtt 5400 tagattttct aggctcaatg ctaaaggtag tagcggggac gccggatgcc ggcgatctag 5460 aaagaattag gttcacagag tggcagttga cacaatcaaa caataggcaa atcaaaatta 5520 atactaggat acagagtcga atcaaccaat taacaacaac agtaaatcaa atcctgcaaa 5580 cacataaaaa tacccaaatt gacaccggcc atttatacga gacactgctg gctaggaata 5640 gaatactaat gatggagtta cagaatttaa tgttggctgt aacgctggca aaaaacaata 5700 ttgtcagtcc aaatatccta gatcatgcag acttaaaatc agtttggctg aaagaaccca 5760 ccgatacccc cataggggat cttatgtccg catcgtctgt aaaaatactg caatcctcta 5820 acatgatgca ctttattatc aaatttccca aaataaaatt atcttgtaag aaggtcacta 5880 ttttcccagt cgcttgcaaa ggagttatgc tgcggataac cgacaacatg gtagcaaagt 5940 gtggtgaagt agtccacaca atcaaaaatt gcatcccaac accgggggct accttttgcc 6000 aactatcaac ggaaagctca tgcgccaggg aactacacgc aggcgtccta gcgcattgcg 6060 agtcgcaacc aagcgaccta cacccgctca cccgcgtgga tgagggtatc atcatcatca 6120 atgaccggcc agccagggtc acggtagaca acgaaacaga agtctaccta cgcggcacac 6180 atctcatcac cttcaacgat cgcgtcgtga taaatgacac cacctttgta aatcatgaca 6240 aggcccaaac aagagctcca ggggtagcga attccccatc attgaatatc accgccaaca 6300 gagatattct gagcctcccc taccttcacc agctaagtga acgtaacttg gagttcatca 6360 aagagttcgg gagagaaatt gatactgatc attctcatcg cataatattt gtcgcagcag 6420 cgatttgttg tgcattggtt tgtatcggca tcgcctgtct acggtacatc ggagcgcgga 6480 gatctgcaat ccagttaaac gggatgatcg ccgaattagg accaaccgag gacggccgta 6540 atcttgaagg gggagt 6556 // ID Gypsy-40_CQ-LTR repbase; DNA; INV; 248 BP. XX AC AAWU01014791; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-40_CQ_; KW Gypsy-40_CQ-I; Gypsy-40_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-248 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 460-460 (2011). XX DR GenBank; AAWU01014791; Positions 8468 8715. XX SQ Sequence 248 BP; 68 A; 52 C; 54 G; 74 T; 0 other; tgtagtgttc ctcgatcagc agctataaca gtgctaccaa cgtgcctact agtgttgcca 60 gatggtacga agtattgtaa agagaattat agcactaagg ttacttgctg gacatcgcag 120 ttggatcggc ctcttctcct gccgaccgtc ggccagtcag gtcgtcttac ggataataaa 180 ttacatatgt aatatatgta tgttagtctt agtattagac aacgcgtgtt tcaattaccc 240 aagtaaca 248 // ID BEL3_Cis_LTR repbase; DNA; INV; 431 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 06-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE Long terminal repeat of BEL LTR Retrotransposon from Ciona DE savignyi. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL3_Cis_LTR. XX OS Ciona savignyi OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. XX RN [1] RP 1-431 RA Smit A.F.; RT "BEL3_Cis_LTR - BEL LTR Retrotransposon from Ciona savignyi."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX CC Ci000162; 4-5% div. XX SQ Sequence 431 BP; 106 A; 65 C; 84 G; 174 T; 2 other; tgttaatgcg agagtttttg ttgatttaga ttattatttc ctgttttctg tgacgtcatt 60 attttctctc atttgccttt ttatcctttg ttctctttgt ttttgtacac gtgtaaggcg 120 tggctttagt ttatttcttt agtattttcg gtgacctttt tagttgtacg tgagagtccg 180 tggtgaaacg accaaaagcg ttgaatagnc aaagcgattg tgttaaatag aaagaaagat 240 taatctgtag tgaacggatt gacctgccta atcccccaaa ctcaggtatt taatttatat 300 tcttctgtgt ttattattta ttgtttcgta ttatttatat tgtgttaatt cgtttcagtt 360 accgcatttc aataaacttc gctggagaaa agcgcgtaac tgctaagggg caatagggcn 420 aattcagaac a 431 // ID LanceleTn-3b repbase; DNA; INV; 205 BP. XX AC . XX DT 15-MAY-2006 (Rel. 11.05, Created) DT 15-MAY-2006 (Rel. 11.05, Last updated, Version 1) XX DE Non-autonomous DNA transposon - consensus. XX KW MuDR; DNA transposon; Transposable Element; Nonautonomous; KW Interspersed repeat; LanceleTn-3b. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-205 RA Osborne P.W., Luke G.N., Holland P.W.H. and Ferrier D.E.K.; RT "Identification and characterization of five novel miniature RT inverted-repeat transposable elements (MITEs) in amphioxus RT (Branchiostoma floridae)."; RL Int. J. Biol. Sci 2(2), 54-60 (2006). XX DR [1] (Consensus) XX SQ Sequence 205 BP; 62 A; 36 C; 36 G; 59 T; 12 other; tactctccaa gcagaggtta ggctccggct gttttttaac gtttttttag tcgtttttat 60 cgggctttct attttktatt ntatcttgyw ktgtcaaaan cttaggcygg nnnacntcaa 120 raaaaatgac aaaatagaaa gcccgataaa aacgactaaa aaacgttaaa aaaacagccg 180 gagcctaacc tctgcttgga gagta 205 // ID SR1 repbase; DNA; INV; 2337 BP. XX AC U66331; XX DT 13-MAR-1998 (Rel. 3.02, Created) DT 13-MAR-1998 (Rel. 3.02, Last updated, Version 1) XX DE Schistosoma mansoni SR1 non-LTR retrotransposon, partial DE reconstruction of a retrotransposon, reverse transcriptase, DE partial cds. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; KW CR1 superfamily; LINE; SR1; reverse transcriptase. XX OS Schistosoma mansoni OC Eukaryota; Metazoa; Platyhelminthes; Trematoda; Digenea; OC Strigeidida; Schistosomatoidea; Schistosomatidae; Schistosoma. XX RN [1] RP 1-2337 RA Drew C.A. and Brindley J.P.; RT "A retrotransposon of the non-long terminal repeat class from the RT human blood fluke Schistosoma mansoni. Similarities to the RT chicken-repeat-1-like elements of vertebrates."; RL Mol. Biol. Evol 14(6), 602-610 (1997). XX RN [2] RP 1-2337 RA Drew C.A. and Brindley J.P.; RT "SR1."; RL Direct Submission to Genbank (08-AUG-1996)Molecular Parasitology RL Unit, Queensland Institute of Medical Research, Post Office, RL Royal Brisbane Hospital, Brisbane 4029, Australia. XX DR GenBank; U66331; Positions 1 2337. XX CC The genomes of representative species of fishes, amphibians, and CC reptiles contain non-long-terminal-repeat (non-LTR) CC retrotransposons CC showing strong sequence identity to the chicken CR1 non-LTR CC retrotransposon from birds. These non-avian retroelements have CC been CC termed CR1-like elements. SR1 is CR1-like non-LTR retrotransposon CC from the human blood fluke Schistosoma mansoni. CC SR1 elements possess atypical 3' termini consisting of the tandem CC repeat (AACCATTTG)2 which are similar in structure to the CC imperfect CC tandem repeat of the 3' termini of CR1. There are at least 200 CC copies CC of SR1 interspersed through the genome of S. mansoni [1]. CC Although other non-LTR retrotransposons have been described in CC invertebrates, this is the first CR1-like element reported from CC a non-vertebrate taxon. XX SQ Sequence 2337 BP; 673 A; 577 C; 454 G; 630 T; 3 other; gaattcctag ttgacgacct ctcaatcctg gctcccctgg ggaaaagtga tcacgccgtc 60 ttatcattca gctttgtcag caaaacggag ctacgatatc ctactagcaa caagcgctgg 120 aacttcaaac ggttgaatgt gttagcttta caggactatc tacaacaggt ggattgggat 180 gttcaccctc aacttgaagt ggatgctcat tgggattttt tactgcacac gatcttatgt 240 gctactgagc attcagttcc taaaatggtc ccaaaaagct acaagcaacc tccaatcatc 300 aagaaccgca ctcgtcgttt gctaagccgc aaaaggcact gttgggctga atataaacga 360 acbgataaca acggcgcgta caggcaatac aaacatataa ggaacatatg bacaaaggca 420 ataagagaag acaggcttca gttccagacc aagcttatcg ataaattcgt ctccaatccg 480 aaaagcttat tcagttacgc agcttctctt cgacaaggca aaactggagt ttcccaactg 540 cttggtccta atggcccgac caataacgac agtgatgccg ctaacctttt ggctgaacaa 600 tactcccaga cattccagct gacccacatc aaccatactg acgaaagctt cacctgcacc 660 tgtacaggac tttccgaagt ggatctgagt gctgacctgg tgctccgtaa actgcagcac 720 ctaagaaaag acacttctcc tggtccggat atggttcatt ccgctgtttt gagggaagca 780 gcttcaatcc tggcgacacc acttagcgtg atgtttgcac actcgctaag cagaggcaaa 840 ctaccggaaa tttggaagct ggcccacatc acaccaattt tcaaaggagg tcgacgcagt 900 gaaccctcaa gctaccgacc agtggccctt ctctccatac cttccaaaat tatggaatcc 960 ctaatatacg acggtatatt agaatactta tcatcctcaa agttcttctc acctcaacag 1020 catggtttca gaaaaggtca ttcttgtatg accaacctgc tgactgcggt ggatagatgg 1080 acaaccatcc ttgatcgcaa ggggaaggtt gacgtcatct acctggactt ctcaaaagct 1140 tttgataggg tcaaccatat atgtcttatc aagaagctta gacgattggg tataaaaccc 1200 cctttgattg attggctctc ttcatattta gaaaaccgac actttaaggt cagggttaac 1260 ttcactctct ctcaggctat ggaatgtcct agtggggtcc cccagggctc aatactagga 1320 cctcttctct tcttgattta cattaacgat cttcctcaac aagtttcatc tgacttattg 1380 ctttttgctg atgatgtgaa actttggaga gagatacgta atcataatga tatactagtt 1440 cttcaggagg atctgacccg acttcaaagt tgggcagacg acaacggact taccttcaac 1500 acttcaaagt gcaaagtagt ccatctgaga catgttgcag accatagtta taacttaggt 1560 aactcccctc tagaagtttc ccaagtcgaa aaagatttag gagtgttggt accctatgac 1620 ctgaaatcgt atgcgaactg tgacaaaaac gcctttcaag caaaccttgc actggtaaca 1680 ttgaagcgca tttttggcca gtttgacggt agaaccttcc acataatctt caacagtttt 1740 attcgtcccc atttagagta cggaaacata gtatttcctc cctccctcca aaaggataag 1800 gacactctgg aacgtataca acgtcgagcc acgaaatcag ttcggggact caaattcaaa 1860 ccttacgaag agcgcctcca atcacttaac ctttacccgt tagagtacag gcgtcttaga 1920 ggtgaccttc ttatgactta cagtatcctt aatacttctg gtcatcccct taaacatctt 1980 cttaagctta gtcataacac taacctmaga ggtaacaccc agaaattgga gaccctatat 2040 agcagaacag actgcagaca caacttctac tccgttagag ttgtcaagtg ctggaattcg 2100 ctgccgactg agctagtcca agcgacctcc caggagtcct ttaagaggaa acttgactta 2160 ttcttaagga ctaaggataa catattatta tgatttacca aattcttttt ttccctctat 2220 tatcgttcat atacctaggt ttttgcctgg aggtattggt gatccactgc tactagacac 2280 ggaagcccgt taagcgaaag cttcttctat tccatcctca accatttgaa ccatttg 2337 // ID BEL-27_CQ-I repbase; DNA; INV; 7395 BP. XX AC AAWU01040532; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-27_CQ_; KW BEL-27_CQ-LTR; BEL-27_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-7395 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 207-207 (2011). XX DR Genome; AAWU01040532; Positions 20930 28324. XX CC Positions [5160-5744] - Integrase core CC 'ATAAC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 696..5918 FT /product="BEL-27_CQ-I_1p" FT /translation="MDELKDLLKQERQLILTINGVGDFVEAYKKAEHENQI FT SIRLDTLEEAMRKFFKVRRKIEAMIDDEDEDEVVGESKEARKKRLADLVVQ FT REMEYNKALRDVEERYFVVKARLVALRPVKVEPTPGADLNETCFDRSISRI FT KLPDIKLPNFSGELKDWIPFRDTYKSLIHSNVQLPDIDKFTYLRSALQGEA FT QLEILSVDFSAEGYDVAWKALEKKYDNHKLIVKAYLDAIFDIEPLRKESFG FT GLSHLISEFETNLQMLKKLGEGTEAWSTILVHMLCARLDHATLRLWESHHN FT SKAVPKYDVLIEFLRDQCTVLQSIKANRPSEGDGRQNRSRISTAHTSSQSQ FT RRCLFCGETFHMPFNCSKLRNMSVSQRVEEVNLRRLCRNCLNAGHYADGCS FT RGSCSRCGARHHTLLHYDTPAPAGARRGRSSVQNTQNRPPAAGQQQHTRQQ FT GQTQNSSTQPTTSSYPVHQSTSRNTHPPPPTDSRNSTTLNAASLPAQNAPT FT LSRQVLMSTAVVRVEDQFGNYSLARTLLDSCSEFCYMTSTFSKKLKFRTTP FT DVLRVQGIGNGSATSLKAVRAKIQPRLDTISSFSEEMRFHLLQKISSDLPA FT TPVDVSQLMLPSDIILADPYFGEPGPIDMIIGAEFFLDLLSAGRRKIVEDG FT PTLQETVLGWIISGKVPASSPSIPRTATYVSSTVDLKELMERFWELESCHV FT NSTHSVEESTCEELFNKTTIRDAEGRFVVTLPKKQRVIEKLGESRNMAMKR FT FIGMEKRFTTNPALKFMYTEFVHEYLLMKHMREVKEDSGEGPVYYLPHHAV FT LKPDSTTTKLRVVFDGSCGTSSGVSLNDALMVGPVVQSDLLSTVLRFRLHR FT VALIADVEKMYRQIRVTLSDQRLQRIYWRDNEDEPVKTYELSTVTYGTSSA FT PYLATRCLKKLGEDCAESHPVASRVIQEDFYVDDMLSGADSIEEASTLMKE FT VRQVTDSAGFTLRKWNSNCPELLKRLPKHLKDERSTLEIDPAKTTVKTLGL FT RWEVSTDMFCFVLPQWKSELSPITKRTVHSDSAMLFDPDGFLAPVVVQAKI FT ICQQLWRIKSDWDVPLDESLQQLWRDYRMSLMAVATIKMPRWIGFSTDCVE FT IQIHGFCDASERAYGAGLYLRCTALDGSVTCRLFLAKSKVAPMENLKRKKK FT KVNIPRLELSSGLLLSHMYEKVQSILPAAQLFCWTDSMITLGWLASPPSRW FT KPFVGNRVSEIQHITRNAIWGHVPGEENPADIISRGMSPALLQYRTDFYEA FT PRWVVQDRENWPRTQRVSLADFDPEILEERVTPAFPTQVRPPHWLFGLCGS FT YMELVRLVTWLQRSKFNLSPKNRAVRRVGFLKSEELEEAVLFLVRLSQEEC FT FPGEMHCLKADGVVHPTSKIARFNPQLVDGVMRVGGRLSNAKISTNRKHPL FT ILDHHHPFTRLVVMYFHERLFHAGQQLLIASVRSKFWPVNVHSLARQVIHE FT CISCFKSKPKVIEQIMADLPAERVNPAPPFLHVGVDYCGPFLVSYPNRRAK FT PVKCYVAVFVCLAVKAVHLELVFDLTSQAFIAALRRFVARRGKPLQIKCDN FT ATTFVGAKNDLMELHRLFYKQQFQDVVTKTALEDGIEFSFIPPRSPNFGGL FT WESQVKSFKTHLKKTFGLQVLKMDEMLTALAQIEAVLNSRPLTPISNDPQD FT FEALTPGHFLIQRPLTAIAEPDLEGVPQNRLAMWQNAQRFTQQLWKKWSTQ FT YLSNLRSGRRSETTSPLVRWC" XX SQ Sequence 7395 BP; 1743 A; 1963 C; 2075 G; 1614 T; 0 other; ttggtccttc gagccggatc gaagtggccg gcccgaagtg gaagaacctt tttgtgtgcg 60 cgcgtgaaaa gtggccagtt cgctgaacag tggccatcgc gattcggact gtgcgagtcc 120 gcggcagttg gcgccatcgc gcagttattt gcggctttgt gctgcgtttt ttatgctgtt 180 ctgcgcgtgg tgctgactgg aagaagcagc tggagcgatc gtggtctgct gggagctgta 240 catcggcggc tggcggacgg aagctactgc tgctgtgatc gagagtgatt gtgtgtgtgc 300 tggacgctgc ccatcgaccg gaggttcctg gaggagctgc agcgtaccag gaagtgtttg 360 ggtttgcatg tggtgcatga ggtgtgtgtg cgttggtgag gtgtgtgcca ttttgggttt 420 agagtgaatg agtatttttc gagagaagtt ttaataaaga tatttggatt ttttacgcat 480 ttgcgtttct ttccggagtt gtggtctggg ttttaattgc gcaagccttg atttttgttc 540 tggtctctgt cggtcggttc actgctggtt ctccaccttg gctgtctctc cccactgtca 600 atcgggtcgc gtagaacgtt gagttgatcg tttcggtgtt cgcagttgca gttttctgga 660 tttttggcgg tcaacggtga gtgaaggcgt gaagaatgga cgagttgaag gatttgctga 720 agcaggagcg gcagctgatt ctgaccatca atggtgtagg agatttcgtg gaggcgtaca 780 agaaggcgga gcatgagaac cagatctcga ttcggctgga caccctggag gaggcaatga 840 ggaagttctt caaggtgcgc cgcaagatcg aagccatgat cgacgacgag gatgaggacg 900 aagttgttgg agagtcgaag gaggctcgga agaagcggtt ggcagatctg gtggttcagc 960 gtgagatgga gtacaacaag gctcttcgtg acgtcgagga gaggtacttc gtggtgaagg 1020 caaggctggt cgcgctgcgt ccagtcaagg ttgagcccac tcctggtgct gacctgaacg 1080 aaacctgctt cgatcgatcg atttcgcgca ttaagttgcc ggacatcaag ttgcccaact 1140 tcagcggcga actgaaggac tggatcccat ttcgcgacac ctacaagagc ctcatccact 1200 ccaacgtgca gctgccggac atcgacaagt tcacatactt gaggtccgct ctgcaaggtg 1260 aggcacagct ggagatcctg tcggttgatt tttctgcaga aggctacgac gttgcttgga 1320 aggcactgga gaagaagtac gacaaccaca agctcatcgt caaggcgtac ctggacgcga 1380 tcttcgacat cgagccactg cgaaaggaga gtttcggcgg tctgtctcac ctgatcagtg 1440 agttcgagac aaacctgcag atgttgaaga agcttggcga aggaacggaa gcctggtcaa 1500 ctatcctggt ccatatgctc tgcgcgcgtt tggatcatgc cacgctgcgt ctgtgggaat 1560 cgcaccacaa ctcgaaagct gtgccgaagt acgacgttct gatcgagttt ctgcgtgacc 1620 agtgtacggt gctgcaatcg atcaaggcca accggcccag cgaaggagac ggacggcaga 1680 accgatcgag aatctcaacc gctcacacgt cctcgcagtc acaacggcgc tgcctgttct 1740 gtggagagac gtttcacatg ccgttcaatt gtagcaagct gagaaacatg tctgtgtctc 1800 aacgtgtgga ggaagtaaat cttcgtcggt tgtgcaggaa ttgtttgaat gctggtcatt 1860 acgccgatgg atgttctcgt ggttcgtgtt cccgctgcgg cgctagacat cacacgttgc 1920 tgcattatga cacgccggca cctgctggcg ctcgtcgagg aagatcctcc gttcagaata 1980 cgcaaaatcg acccccagca gctggacaac aacaacacac gagacagcaa ggccagacac 2040 agaacagttc cacccaacca accacaagtt cctaccccgt acaccaatcc acttctcgaa 2100 acactcaccc accaccaccc acagactctc gcaattccac caccctcaat gcggcatccc 2160 tccccgcaca aaatgcaccc acactgtccc gccaagtcct catgtctacg gctgtagttc 2220 gtgttgaaga ccagttcgga aactattcgc tcgctcggac acttctggat tcgtgttccg 2280 agttctgcta catgactagc accttttcca agaagctgaa gttccggaca acacccgacg 2340 tactgagagt acagggcatc ggaaacggct cggcaacgtc gctgaaggcc gtacgtgcga 2400 agatccagcc gcggttggat acgatctcgt cgttctcgga ggaaatgcgg ttccacttgc 2460 tgcagaagat ttccagtgat ctacccgcca caccggtcga cgtcagccag ttgatgctcc 2520 ccagtgacat catcctcgct gatccgtact tcggagaacc tggtcccatt gatatgatca 2580 tcggtgctga gtttttcctc gatctgctgt ctgctggtcg gcgtaagatc gtcgaggacg 2640 gtccgacgct gcaagagact gtgcttgggt ggataatctc tggaaaggtt cccgcatcgt 2700 cgcccagtat cccacgcacg gcaacctacg tcagctcaac agttgatctg aaggagttga 2760 tggagaggtt ctgggaactg gagtcttgtc acgtcaacag cacccactct gtggaagagt 2820 ccacgtgcga agagctgttc aacaagacga cgattcgaga cgcagaagga agattcgtgg 2880 tgaccttgcc gaagaagcaa cgagtcatcg agaagctggg cgagtccaga aacatggcga 2940 tgaagcggtt catcggcatg gagaagcggt tcacgacgaa tcccgcactg aagttcatgt 3000 acacggagtt tgtgcatgag tacctgctca tgaagcacat gcgagaagtg aaggaggaca 3060 gcggagaagg ccccgtctac tacctgccgc accatgcggt tctgaaaccc gacagcacga 3120 ccactaaact gcgcgtggtc ttcgacggat cctgcggcac ctccagtggt gtgtccctca 3180 acgacgcgtt gatggtagga ccggtcgtgc aaagcgatct cctctcgacc gtgctacggt 3240 tccgcttgca tcgagtcgcg ttgatcgctg atgtggagaa gatgtacagg caaattcgcg 3300 tgacgctgtc cgaccaacgc ttgcaacgca tctactggag agacaacgaa gacgagcccg 3360 tcaagacgta cgagctttcg accgtcacct acggaacgtc cagcgctccg tatctcgcta 3420 ccaggtgctt gaagaagctt ggagaagatt gcgcggaaag ccatccagtg gcgtctcgcg 3480 tcatccaaga agatttttac gttgatgaca tgctgtctgg cgcagacagc atcgaagaag 3540 caagtacgct gatgaaggaa gtccggcagg tcaccgattc agctgggttc acgctgagga 3600 agtggaactc gaactgccca gagctgctca agcggctccc gaagcacctg aaggacgaac 3660 gcagcacgct cgagatcgat cctgcgaaaa caacggtgaa gacgctggga ttacgatggg 3720 aggtgtcaac cgacatgttc tgcttcgttc tcccacaatg gaagtcggaa ctgtccccca 3780 tcacgaagcg cacggtccac tcggactctg cgatgctgtt tgatcccgat ggtttcttgg 3840 ctccggtggt cgttcaagca aaaattatct gccaacagct gtggaggata aagagcgact 3900 gggatgtacc gctcgacgaa tcgctgcagc agctgtggag agactatcgt atgagtttga 3960 tggcggtcgc aacgattaaa atgccgcgct ggattgggtt cagcaccgat tgcgttgaga 4020 tccagatcca cggtttctgt gacgcctcag agcgagctta cggtgccggc ctctacctcc 4080 gatgcactgc gcttgacggt tccgtcacgt gtcgactgtt cttggccaag tccaaggtag 4140 ctccgatgga gaacctgaag cgcaagaaga aaaaggtcaa cattccacgc ttggagttgt 4200 cgtctggttt gttgttgtcc cacatgtacg agaaggtgca gtcgattcta ccagctgcac 4260 agttgttctg ctggacggac tcgatgatca cgcttggatg gcttgcgtca cccccgtcac 4320 gttggaagcc gtttgttgga aatcgggtct cggagattca gcacattacg aggaacgcaa 4380 tctggggaca cgtgcctgga gaagagaatc ccgcagatat catctcgcga ggaatgtcgc 4440 cggcgctact acagtaccga actgacttct acgaagcgcc gcgctgggtc gttcaagatc 4500 gggagaattg gccacgaaca caacgtgtgt ccttggcgga ctttgaccca gaaatactgg 4560 aagaacgtgt caccccagca tttcccactc aagtgcgacc tccgcactgg ttgtttggac 4620 tgtgtggctc gtacatggag ttggtacgct tggtgacttg gctgcaaagg tccaagttca 4680 acctttcacc gaagaatcga gctgttagga gagttggatt cctgaaatcc gaagaattag 4740 aagaagctgt gctgtttttg gttcgactgt cgcaagaaga atgtttccct ggagaaatgc 4800 attgcctgaa ggccgatggt gtggtccacc caacgtcgaa gattgcacgc ttcaacccac 4860 aactggtgga tggcgtcatg cgagtcggtg gtcgtctctc gaacgcgaag atctcgacca 4920 accggaagca ccccctcatt ctggaccatc accatccgtt cactcgactg gtagtgatgt 4980 acttccacga aagactgttt cacgcagggc aacagctgct catcgctagc gtccgctcga 5040 agttctggcc tgtcaacgtc cacagcctag caagacaggt catccacgaa tgcatcagtt 5100 gcttcaaaag taagccgaag gtgatcgagc agattatggc ggatctgcct gctgaacgag 5160 tgaatccagc tccgccgttc ctccacgtgg gcgtggacta ctgtggccca ttcctcgtca 5220 gctacccgaa tcgcagggcg aagcctgtga agtgctacgt ggccgtattt gtgtgtctcg 5280 ctgtgaaggc tgtccatctg gaactagtct tcgacctgac gtcccaggca ttcatcgcag 5340 cattgagaag gttcgtggca cgccgcggca aaccgctgca gatcaagtgc gacaacgcca 5400 ccacgttcgt cggagcgaag aacgacctga tggagctgca ccggctgttc tacaagcagc 5460 agttccagga tgtcgtcacg aagaccgcgc tggaagatgg gatcgagttt agtttcattc 5520 caccacgctc accgaacttt ggcggcctgt gggagtcgca ggtcaagtcg ttcaaaactc 5580 acctgaagaa gacattcgga ctgcaagtac tcaagatgga cgagatgcta actgccctag 5640 cgcagatcga agctgtgctc aactcgagac cgttgacgcc gatcagtaac gacccgcagg 5700 actttgaagc gctaacgccc gggcatttcc tgatccagcg tcccctcacg gccatcgcag 5760 agccagattt ggaaggagtg ccacaaaacc gcttggcaat gtggcaaaac gcgcagcgtt 5820 tcacgcagca actttggaag aagtggagta cgcagtacct gtccaacctc cgaagtggac 5880 gaaggagcga aacaacgtcg ccgttggtac gatggtgctg atcaaggacg agaacgcgcc 5940 tccccagaag tggaaacttg gtcgagtact gcacgtgttt cgcggaacgg acggcaacgt 6000 gcgcgttgtg accgttcgca cggcaaccgg acgcttcgat cgagcgttct cgaaggtctg 6060 cgcgcttccc attcgggaca accaagaatt aaatcaatcg acgtctgatt aagcactcgc 6120 tgtgcgcccc ggcgctgctg cgtcatccat tctcgaagtt tatgttggtg tctgtaggat 6180 gtatgtgtgc atgttaagtt gtcgtagaag gaagttcact ggacgccgtc cagtctctct 6240 actcggtaga cacctgtcta ctcgagctca gcgagcaagg cgctcacctg gttttgggat 6300 tatatgttgt catgtcgtta atcacaccct atcatatctt cccacgctac cacgtctcac 6360 tggaggaccc atggatcctg ccagtcggtc tcggctgctc gtgcggcatt gagcttggcc 6420 agctcacgcg ccgatcgacc ttcccgacga ccacgaagta cgaagtcgaa gacatcccaa 6480 gccagcgaag gacaacgccc ccaggaattc accgatgatc tgcggatcga gcgccagaat 6540 tcacctggtg gggtaagttg ttccagttca cgcacgctac atacacctac tacatccaag 6600 ttctgtccga ggcacaccac caactacatc gacgtccgaa agtccacccg cacaaaacga 6660 atccgcaaca acagcacagc ggcgatcagc acaccggcga acgacacacc accagcgtaa 6720 cacctttatg gcctgcgggc ccagtatcat cgttacctgg ctgggtaagt tgttccggtt 6780 ttatagaaat ctacgcaagc atactacaac aaatctctcc ccgaggaaac ctgcaacgaa 6840 atcaacatca acaagatggc atttcaacga agaagaagaa ggcatcaacc gatgcacctg 6900 caccccaacc gtgtgcactc tgtggtcgac agtttgtccc acccgaagca gagcgtgttc 6960 cgctggagca tgcgacccac ccacaccgaa gattcgcagc gcaaggctgc accagaaggt 7020 cagcacacgg ccagcgagct ggcgaaggct gacgacgcga agcagaggct gcaacggcag 7080 caaaaacacc accacgaccg ctactcggcg gcgaccccca cgagaaggct gcagcggcag 7140 caccgaagca gcaggtgacc tgatgcaagg agcagagttg cagtagaagc agaattgcag 7200 cacaatccag ctgcgtcgtc cacgatacac ctcaccccac gatggcagca gttttgttta 7260 caacagatcg aatgaattat tgaagtgttg tgaagttagc cctaagagtg ttagatatgt 7320 aattagtatt agtgaaagac gaaggtagag aggtttttga aatggttgaa accacaggtt 7380 tcaaggcggg cggca 7395 // ID Saci-3_I repbase; DNA; INV; 4861 BP. XX AC BK004070; XX DT 09-JUL-2004 (Rel. 9.06, Created) DT 28-JAN-2011 (Rel. 16.02, Last updated, Version 4) XX DE Schistosoma mansoni Saci-3 LTR retrotransposon: internal DE sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW internal portion; Saci-3_INT; Saci-3_I. XX NM Saci-3_I; Saci-3_INT. XX OS Schistosoma mansoni OC Eukaryota; Metazoa; Platyhelminthes; Trematoda; Digenea; OC Strigeidida; Schistosomatoidea; Schistosomatidae; Schistosoma. XX RN [1] RA DeMarco R., Kowaltowski T.A., Machado A.A., Soares B.M., RA Gargioni C., Kawano T., Rodrigues V., Madeira M.A. et al.; RT "Saci-1, -2, and -3 and Perere, Four Novel Retrotransposons with RT High Transcriptional Activities from the Human Parasite RT Schistosoma mansoni."; RL J. Virol 78(6), 2967-2978 (2004). XX RN [2] RA DeMarco R., Kowaltowski T.A., Machado A.A., Soares B.M., RA Gargioni C., Kawano T., Rodrigues V., Madeira M.A. et al.; RT "Direct Submission to GenBank."; RL Direct Submission to Genbank (03-DEC-2003)Departamento de RL Bioquimica, Instituto de Quimica, Universidade de Sao Paulo, Av. RL Prof. Lineu Prestes, 748, Sao Paulo, SP 05508-900, Brazil. XX DR Genbank; BK004070; Positions 108 4968. XX CC Key Location/Qualifiers CC CDS 176..1042 CC /codon_start=1 CC /product="gag protein" CC /protein_id="DAA04501.1" CC /db_xref="GI:44829174" CC /translation="MVDNDKNNINMIDSEIQVISFRPVPFIPHDPEVWFAALESQFEI CC RRITNQRQKYAYALESLPGDHLTAVREVVLNPNVPNVYDRLKDAILRHFLPSREERLR CC TLLARHPMGDAKPSHHLTRLQSLAGNMTADSEIVKELWFEALPVSIQPTLTALLEDTP CC LNKVALIADKILARVNTKGDHLVASFSRSNVDLDARRPPAHGDRDRCIHTRLSFRDRA CC NVPAPYVPRPRSQSRKAVTTRPKRASSKPRQKAVSEASPGWCWFHRAFGAGARHCRAP CC CSYKAGNFSAGE" CC CDS 793..4293 CC /codon_start=1 CC /product="pol polyprotein" CC /protein_id="DAA04500.1" CC /db_xref="GI:44829173" CC /translation="MYPYSSQFPRPRKCASPLRPPTQVTISKSRNDSPQAGIFQATPE CC GGFGSESRLVLVSPCFRSRCPSLSSSLLIQGGKLLSRRVNAAVLAGTSPQVGRLFYVH CC DYRTNARYLVDTGAQVSVVPIGNSKSQATMLRLRAANGSVIPTYGTRQLTVNLSNRRQ CC YLWTFIIADVPTAILGIDFLQHYELLVDSRRLQLIDTSSNSNFMGSKAHTNAYRITGV CC FHSRDDLFHVLFQKFPKLTKPLEETPSVTNRVVHHIVTRGPPVTARPRRLAPDKLAFA CC KREFDNLLATGIIRPSHSPWASPLHMVRKKDGVSWRPCGDYRALNTATRFDSYPIPHI CC HDITASLKGTTIFSKIDLVRAYHQIPVALEDIEKTAITTPFGLFEFLRMPFGLRNAAQ CC TFQRFIDSIVRDLDFVHVYIDDLLIASSNVDEHYQHMTLLFQRLSDNGIIVNPDKCEL CC GKKEMKFLGHVIKHEGILPCEDKVRTIMEYTVPSTLKELKAFLGLVNFYRRFISHAAE CC QLRPLTDLLRGNPRKLEWNDAARTAFSDIKTALAQATLLVHPDPSATLSIAVDASDFA CC IGAVMQQNISGSWQPLEFFSRRLTPTETRYSAFGRELLAAYCAIKHFRHAVEGRKFIL CC FTDHKPLTYALHTKSDRYSPRECRHLDYISQFTTDLRHVKGESNCVADALSRIQLNAV CC TLPVLDLPAMAAAQANDTSCTEAQQSTSLQCREVPLATSSGTILCDTSTGLPRPIVPS CC AYRRLVFDALHGLSHPGIAATLRLIAARYVWPSMNKDVRMWVKQCLQCQRSKVHRHVA CC APIGTFATPDARFDHVHIDIVGPLPPSHGYDHILTCIDRFTRWPEAIPITSITAETVA CC HRFVERWIAMYGCPSTVTTDRGQQFESALFSSLTRLLGTERIRTTAYHPASNGLVERF CC HRQLKSALRAHENNNWYETLPLVLLGIRTSLKADIQCSAAELVYGTTLRLPGEFFTPR CC SSTKFGESDYVQRLSAFMRTLTPVSTRIQHRQVALPRELSTCSHVFIRVDSVRKPLQQ CC PYEGPFHVISRHEKTFKVDRRGRIETVSIDRLKPAHVDDSAIPDKPRPNVRPIRASSG CC ISTSTSDPTLDAPKTSFSRPSQQHVSSAPSTDETSVSRPDLQTTPPLTADEIAGSRCS CC NETTVSRSGRRVRLPVRFLD" CC CDS 4205..4903 CC /codon_start=1 CC /product="ORF3" CC /protein_id="DAA04502.1" CC /db_xref="GI:44829175" CC /translation="MRLQAHDVRTRLPSHVPVAEYAYPYAFSTNLITNYGYSIGIYPY CC TTRKLHFSLFLSCFFVYPEPTPLTIAVLVSSTLVNLGESWPDHPRYSQRTQSISLTPN CC TYDIHLVPNMVILVASGSLAQSGFVLRLQRFWSGPLRTQPSDSGYKPLWCSMLIQAIN CC TARSFWYRSTSKTFGGNSRINDHSLFCQRFRPTIARFAISPTNETATYRKCKSFLAGG CC SCSDRPVSIRTRLE". XX SQ Sequence 4861 BP; 1226 A; 1377 C; 1015 G; 1243 T; 0 other; ctggtgagcc gtatttggaa cacgcaatac agctggtaac ccaatccatc gagcacaagt 60 caaacttggc caattctcca aaccagatca atccggtgag tctatttatc tatccacatc 120 tgttgcatat attcgtattg atatttttca atatataata ttttggcaac ttaatatggt 180 agataacgat aagaataaca taaatatgat agactccgag atacaggtta tcagttttcg 240 accggttcct ttcatccctc atgatccaga agtctggttc gcggcactgg aatctcaatt 300 tgagatccgc cgcataacca atcaaaggca gaagtacgcc tacgctttgg aatcattgcc 360 cggggatcac ctaaccgctg tccgtgaggt tgtcctcaat cccaacgttc caaacgttta 420 tgaccgtctt aaggatgcca tccttcgaca tttcctccca tcaagagagg aacgattgag 480 gacactttta gcacgtcacc ctatgggtga tgctaagccg agccaccacc tcacgcgcct 540 gcaatccctt gcaggaaaca tgacagctga ctctgagata gtcaaagagt tgtggttcga 600 agccttacca gttagtatcc agccaaccct tacggctctg ctcgaggaca ccccgctcaa 660 caaggtggcc ctcatagcag ataagattct ggcacgagtt aacaccaaag gcgaccattt 720 agttgctagt ttttcccgtt ctaacgttga tctcgatgct agacgacctc cggctcatgg 780 ggatagggac agatgtatcc atactcgtct cagtttccga gaccgcgcaa atgtgccagc 840 cccctacgtc ccccgaccca ggtcacaatc tcgaaaagcc gtaacgactc gccccaagcg 900 ggcatcttcc aagccacgcc agaaggcggt ttcggaagcg agtccaggtt ggtgctggtt 960 tcaccgtgct ttcggagccg gtgcccgtca ttgtcgagct ccctgctcat acaaggcggg 1020 aaacttctca gccggcgagt gaatgcggcc gtactcgccg gcacttcacc tcaggttggc 1080 cgtttatttt acgtgcacga ttatcgcacc aacgctaggt accttgtgga tacgggtgcc 1140 caagtttctg tcgtacctat cggtaacagt aagtctcaag ccactatgct tcgactacgc 1200 gctgcgaatg gctcagtcat tcccacctat ggtacacgac aacttacggt caacctgagc 1260 aaccgacgac agtatctgtg gacgttcatc attgccgatg ttcccacagc tatactcggt 1320 atcgatttcc tacagcacta tgaattgcta gtcgattcac gtaggctgca gctaattgat 1380 acttcgtcga acagcaactt tatgggctct aaagcccaca caaacgcgta ccgaatcaca 1440 ggtgtatttc attcgcgtga cgatttattc cacgttttat ttcagaaatt ccccaaatta 1500 actaaacctc tcgaggagac tccatcggtg accaatcgtg tggtacacca catagtcacc 1560 cgcggaccac cagtcacggc aagacctcgc cgactggcac cggacaaatt agctttcgct 1620 aaacgtgagt ttgacaattt actagctact ggtattattc gtccttctca cagtccttgg 1680 gcctcacctc tccacatggt tcgaaaaaag gatggggtta gttggagacc atgcggagac 1740 taccgagcgt taaacacagc tacacgtttc gatagctatc ccatccccca catacatgac 1800 atcacggcat cactcaaggg cacgacaatt ttttccaaga tcgatctggt acgagcatat 1860 caccagatcc cagtcgctct tgaagatata gagaaaactg ctatcacgac tcctttcggt 1920 ttatttgagt tcctacgaat gccatttgga ttacggaatg ctgctcaaac tttccaaagg 1980 tttatcgata gcattgtacg agacctagat tttgttcacg tctatattga tgacctgcta 2040 atcgcatcat caaacgtaga tgaacattat caacacatga cgctactatt ccagcgcctt 2100 tcggataatg gaataatagt caaccccgat aagtgtgaac tcgggaagaa ggaaatgaaa 2160 ttcttaggtc atgttattaa gcatgagggt attctacctt gtgaggataa agtacgtact 2220 ataatggagt acactgtacc gtccacactc aaggaactga aggcatttct cggtttggtc 2280 aacttctacc gacgcttcat ttcgcacgcg gcagaacagt tacgaccgtt aaccgattta 2340 cttcgcggta atccacgcaa actggaatgg aacgacgccg cacgtactgc attttcagat 2400 atcaaaacgg ccctagctca agccacactt ctcgtgcatc ctgacccatc ggccacgctt 2460 agtatagcgg ttgatgcatc ggatttcgcc ataggagccg ttatgcaaca gaatatctcc 2520 ggtagttggc agcccctcga atttttctca cgacgcctca ctcccacgga gacgagatat 2580 agcgcttttg gtcgcgaact gctagcagcc tactgcgcca tcaaacattt ccggcacgct 2640 gtagaaggtc gtaagttcat cttattcacc gaccataagc ccttaacgta tgctctgcat 2700 accaagtctg accgctactc accacgagag tgcagacatt tggactatat ctcccagttc 2760 acgacagacc ttcgtcacgt caaaggcgag tcgaactgtg ttgctgatgc tttatcacgt 2820 atccaactga atgcagttac tttgccagtg ctcgatctac ctgctatggc cgccgcccaa 2880 gcaaacgata cttcatgcac ggaagcacaa cagtctacgt cccttcaatg ccgggaagta 2940 cccctagcta ctagctctgg tactatccta tgcgatactt ccacgggcct ccctcgaccc 3000 atcgtaccct ccgcttatcg ccgtctcgtc tttgatgccc ttcatggtct atctcatcct 3060 ggtatcgcag ccaccctacg cctcatagct gcacgatacg tctggccgtc aatgaataaa 3120 gatgtccgta tgtgggtaaa acaatgttta caatgtcaac gatcaaaagt gcacaggcac 3180 gtagctgccc ctattggcac tttcgctacg cctgatgctc gcttcgatca cgttcacata 3240 gacattgtag gaccattacc accatcgcac gggtatgatc acatactcac atgcattgat 3300 cgtttcacaa gatggcccga agctattccc atcacgtcta ttacggcgga gacagttgct 3360 caccgcttcg tagaacgatg gatagctatg tacggttgtc cctcgactgt cacgactgac 3420 cgaggacaac aatttgagtc cgcattattc tcctcactaa cacggctgct tggtacggaa 3480 cgcatacgca ctaccgccta ccatccagca tcaaacggtt tagttgaacg gtttcatcgc 3540 caacttaaaa gtgctcttcg agcacacgaa aacaacaatt ggtacgaaac ccttccgctc 3600 gtcctcctgg gaatcagaac gagtctaaag gcagatattc aatgttccgc cgctgaactt 3660 gtttacggca cgacattgcg tctgcctggg gaatttttca caccacggag cagcactaag 3720 ttcggcgaat cagactacgt ccaacgactg tctgcattca tgcgaacact gactccggtg 3780 tcaactcgta tacaacatcg acaggtcgct ctccctcgag agttatctac ctgttcacat 3840 gttttcatac gagtagattc ggtacgcaaa cctctacaac agccttacga aggacctttt 3900 cacgtgattt cccgtcacga aaagaccttc aaggttgatc gacgtggccg catcgaaaca 3960 gtcagcattg atcgtctcaa gccagcacac gtcgatgaca gtgctatacc tgataagccg 4020 agacccaatg ttagacccat cagagcttct agcgggattt ctacatctac ttcggatccc 4080 acgctagatg cacctaagac ctcattctca cgtcccagtc aacagcacgt gtcatctgcc 4140 ccgtctacgg acgagacttc cgtctcacgt ccagacctgc agaccacacc gcccttgact 4200 gcggatgaga ttgcaggctc acgatgttcg aacgagacta ccgtctcacg ttccggtcgc 4260 cgagtacgct tacccgtacg ctttctcgac taacctcata accaactacg gatacagtat 4320 aggaatctac ccctatacta cacgaaagct acacttttct ctttttcttt catgtttttt 4380 tgtttacccc gagcctacgc ctctgaccat cgctgttctg gtatcgtcga ctttagtcaa 4440 tcttggcgaa agctggcctg atcacccacg ctatagtcaa cgcactcagt cgatctctct 4500 gactccaaat acgtatgata tacatttggt cccgaacatg gttatcctgg tcgcatctgg 4560 ttccttggct caatctggtt tcgtcctacg tctgcagcgt ttctggtccg gacctctacg 4620 aacgcaacct tcagattctg gatacaaacc actctggtgt agcatgctaa tccaagcaat 4680 caatacggca cgttccttct ggtaccgttc tacgtcaaag acttttggtg gcaactcgag 4740 gatcaacgat cactccctgt tctgccaacg ttttcgtcct acaattgcac gtttcgcgat 4800 cagtcccacg aatgaaaccg ctacgtatcg aaagtgtaaa tcctttctag cggggggctc 4860 c 4861 // ID Gypsy-105_AA-I repbase; DNA; INV; 5691 BP. XX AC supercont1.2; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-105_AA_; KW Gypsy-105_AA-LTR; Gypsy-105_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5691 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.2; Positions 5302042 5307732. XX CC 'AAAAT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 499..1992 FT /product="Gypsy-105_AA-I_1p" FT /translation="MADSGKYVRSTVDGRDETNENEHERDESIESGHEQDG FT NASEIDTEVSHGGDNSELSTNGSTTDMDSENGSSVSHCSSEYSTGSVDSTN FT DSHDQDTPSKIKEPSTDREITLDKQANQDKRMEEIEKVLVDLSRAITQLQP FT GIQQLASQPANPEADQSWNFSQQPAQPSSSYTTVRPEQIKSFPNGVRPNKM FT WAEWFDFIENFELALSLHHGNDPVYKVKLLYLSLGQELQTIVKAANLRPSL FT TDAHCYTAFVKNIENHLRSMTDIAAEHQAFLKMQQGKDVSTVAFHARLLKG FT AKLCDYAGATDRFVRAQLLNGLRNKELVKAARVYSYDTNFIVQSSTRDETY FT QEETADPIMDTNILEIGQGSGNYRKRTIQRQQSTYPHAKQHRMNSDVMWNS FT QRRSDQPRRGDNINQHARNDQLDPQAQGRRTRCPRCNNFFHRNLQCPALSR FT TCDVCGKRGHFAKACRSGPRTGRPKAVRLISEQPDAESSGDENMDYKKQVK FT QG" FT CDS 2308..4086 FT /product="Gypsy-105_AA-I_2p" FT /translation="MPVECTFKAVIVVPETNKPSVVATFYVVPDGSRSLLG FT RSTAHDLKLLQIGKNVNSLESTHDSTFPKMPGVQVKFSVNPTIPPTKNAYY FT NVPAAFREAARERIHDMEKRGIIEKVTKAPHWISGMSAVAKGKSDFRLVVN FT MRAPNRAINREYYRLPLLDEMRVKLHGAKYFSKLDLSNAFHHLELDEESRD FT LTTFLAENGMYRFTRLMFGVNCAPEIFQREMARLFTSMDNVIVYIDDILIF FT AETLDELRQTVDKVHQILRENNLTLNTTKCEYDKSSIKFLGHLLDGDGFHV FT DEEKIRSLRNFREPSTISELRSFLGLASYISAYVRNFAGLSRPLWDTVTKK FT TWTWGKEQKEAFERIKDHIVHCTTALGFFSDTDKTILYTDASPVALGAVLV FT QESRNHNPRIISFASKALTVTEQKYAQNQREALSAVWGVEHFSYFLLGRHF FT TLRTDAQGVAFILNRSREESKRALTRADGWALRLSPYSYEVEYVVGKDNIA FT DPPSRLYIGEDEPFNDDVSPWEIGRLEANSIEILTEEEIVQATSLDAALQH FT VVTALETGQWSNVDKTYYNIREELSMRDGIIIKTGCAIIPASLQVQ" FT CDS 4188..5270 FT /product="Gypsy-105_AA-I_3p" FT /translation="MKSIMRQRVWWPGLASMVQKWVESCMTCLTNGKPERP FT PPMQRIFAPKVVWEAIAIDFNGPYVLLGGISILVIIDYRSRYLFAKPVKST FT SFECTKKVLEEIFEIEGYPKSIRSDNGPPFNGSEYKAYCEQRGIKTVFSTP FT LYPQQNGLVESSMKIINKAMSAAISNKNNYIVELREAVHAYNAASHSITRF FT PPEEIMLGRKVRRGLPLINPERTPMNDALLDERDRKAKLQGKQREDLRRGA FT RKSRVLPGDSVIIQRQTRSKGQSRFSPTRYTVADENNGSLLLHSADGQSIK FT RHVTQTKRVFQEQIVPFPGDAADVSAPNTLSEPSDAANAPEPAKHCSRRGD FT RPKKIPAYLDQYVRSIQQ" XX SQ Sequence 5691 BP; 1817 A; 1321 C; 1239 G; 1314 T; 0 other; ttggcgatcc ttgccagttg aaaaaaaccg aactgagctt tgcatacagc atctgattgg 60 aaaaaaaaac gcgcctgcag aaataaattt gggaaaatcg ccccaaacac gcaagaaata 120 atgcctatca tgctttaatc gtttcaaaac gacgccgcat tgaaaacgct aaataaaatt 180 caatatccgc aaaatgacgc cattttgtaa gctgcaaaaa aaaatgctct aatcgtttca 240 aaacgatgcc acgtggtagc tgcaaaaaac gcataatgct ggcgatatcc atccctaagc 300 gcttgacaat atctttcaat gatttgcaaa acaaaataaa aaaaccgctt ctggagataa 360 aaatatcgca tgtcaataaa aaaaaaaaaa aaaagatcta ctaattttcg acaactgcta 420 gataattaat gccatacgtc taaactttgc aggtgaaatt cgaagattcc aaacaacatt 480 cccaaacaac tgcagaaaat ggcggacagc ggcaaatacg ttcgatcaac ggtagatgga 540 cgcgacgaaa cgaacgaaaa tgaacacgag cgggacgaat caatcgaaag cgggcacgag 600 caagacggaa atgctagcga gatcgacacg gaagtaagcc atggtgggga caacagcgaa 660 ctttcaacaa acggcagcac taccgacatg gacagcgaaa acggaagcag cgttagccac 720 tgcagcagcg agtatagcac gggaagcgtt gactctacga acgattctca cgatcaggat 780 actccatcga agataaagga gccgagcacg gatcgcgaaa ttaccctcga taaacaggca 840 aaccaggaca aacggatgga ggagatcgag aaggtattag tggacctttc aagagcaatt 900 actcaattgc agcccggaat tcaacagctt gcttcacagc cggcaaaccc agaggctgat 960 caaagctgga atttcagtca gcaacctgcg caaccatcga gctcatacac gaccgtacgc 1020 cctgaacaaa ttaaatcttt tccgaacggc gttcgcccga acaaaatgtg ggctgaatgg 1080 tttgatttca tcgagaattt cgaacttgcc ctgtcattgc atcacggtaa cgatccagtt 1140 tacaaggtta aactactgta cctgtcactc ggtcaggagc tacagacaat tgtcaaggca 1200 gctaacctcc gcccaagcct aaccgacgcg cactgctata ctgcattcgt gaagaacatc 1260 gaaaaccact tacgatcaat gacggacata gcagctgaac accaagcctt cttaaaaatg 1320 cagcaaggaa aggacgtgtc aactgtagct ttccatgccc ggctattaaa aggcgcaaag 1380 ctctgtgatt atgctggtgc gacagaccga ttcgtgcgag ctcagctatt gaacggcctg 1440 aggaacaaag aattggtcaa agcggctcga gtctatagct acgatactaa tttcatcgta 1500 cagtcatcta ctcgagatga aacataccag gaagaaacag ctgaccccat catggatacc 1560 aacattctgg aaataggtca aggctccggg aattatcgaa aaagaaccat tcaacggcag 1620 cagtctacct atccacacgc caagcaacac agaatgaatt cagatgtgat gtggaattca 1680 caacgccgtt ctgaccaacc gagacgagga gataacatca accaacatgc aaggaatgac 1740 caactcgacc cccaagcgca aggacgtcgc acacgctgcc caaggtgcaa caatttcttt 1800 caccgcaacc tacagtgtcc tgcactatcc cgcacgtgtg atgtttgcgg caaacgaggc 1860 cattttgcca aagcttgtcg atcaggacca aggacaggac ggccaaaagc ggtacgacta 1920 atctcagagc aacccgatgc tgaatcgtcc ggtgatgaga acatggatta caaaaagcag 1980 gtaaagcaag gttaatcatg catacctaaa tataattgaa gtgaataaac gtttagcgta 2040 aacaattctt ttattacaat cccaacaaaa ccgtccaaat ttctctttta gaacctgtac 2100 gcccttactc tagaggacgt actagtcggc tgcagtatcg gatccgcaaa acctataggt 2160 tttcttatcg actctggagc cgatgttaat gttatcgggg gaaacgattg gattaacttg 2220 aaacgagaat accacgcggg gtttgcaaaa ttacagatca tttccacacc ccagaaaagt 2280 ctgcatgctt acgcttcagc taaaccaatg cccgttgagt gcaccttcaa agcggtaatc 2340 gtggtacccg agacaaacaa accatcagtg gttgctacat tctatgttgt ccccgatggt 2400 tcaagatcat tgttagggcg atccacggca cacgatttga agcttcttca aattgggaaa 2460 aatgtcaaca gtctcgaatc aacccatgat agcacattcc ccaagatgcc gggtgtacag 2520 gttaaattca gcgtgaaccc gacaatccct ccaaccaaga atgcttatta caatgttcct 2580 gcagcattca gagaggccgc ccgtgagaga atacatgaca tggagaagcg cggtataatc 2640 gaaaaagtga cgaaggcacc ccactggatt agcggcatgt ccgccgtagc taagggcaag 2700 tcagactttc gactcgtagt taacatgagg gcgccgaata gagctatcaa tagagaatac 2760 tatcggctgc cactactcga cgagatgcgt gtcaaacttc acggtgcgaa atacttttct 2820 aaattggatc ttagcaatgc gttccaccac cttgagctgg atgaggagtc acgtgactta 2880 acaacctttc tcgctgaaaa cggtatgtac cgtttcacaa ggttgatgtt tggagttaat 2940 tgtgcccctg aaatcttcca acgtgagatg gcccgtttat tcaccagtat ggataatgtc 3000 atagtctata tagacgacat cctcattttc gcggaaacac ttgacgagtt acgccaaact 3060 gttgacaaag tacaccaaat acttcgagaa aataatttaa ccctcaatac cacaaaatgc 3120 gaatacgata agagcagcat taaatttctc ggccatctgt tagacggtga tggcttccat 3180 gttgatgaag agaaaattcg aagtctgcgt aattttcggg aacctagcac tatctcggaa 3240 ctaagaagtt ttctcgggct cgcttcttac attagcgcat atgtgaggaa ctttgcaggg 3300 ctatcacgac cgttatggga cactgttact aaaaaaacct ggacctgggg caaggaacag 3360 aaagaagcgt tcgagcgaat taaagaccat atagtccatt gcacaaccgc cttaggattt 3420 ttctcggaca cagataaaac tatcctgtac actgacgctt cccccgtcgc attaggtgcc 3480 gtcctcgttc aagagagcag aaaccacaat ccaagaataa taagctttgc atcaaaagcc 3540 ctcaccgtga ctgagcaaaa atacgctcaa aaccaacgcg aggccctcag cgctgtttgg 3600 ggagtagaac acttctctta ctttctctta gggagacatt tcactctccg cactgatgct 3660 caaggagtag cgtttatcct taatcgctct cgagaagaat caaagcgagc ccttactaga 3720 gcagatgggt gggcactccg attgagccct tacagctatg aggttgaata tgttgtcgga 3780 aaggataaca ttgccgatcc accatcacgg ctttacatcg gtgaagatga gcctttcaac 3840 gatgacgtaa gcccatggga aatcggaaga ttagaagcta actcaattga aatacttacc 3900 gaggaagaga ttgtccaagc cacatcactg gatgcagcgc ttcaacatgt ggtgacggca 3960 ttggaaactg gacagtggtc gaacgtagac aagacctact ataatattcg cgaagagtta 4020 tccatgcgag atggtataat aataaaaacc ggatgcgcta tcatcccagc gtccttacag 4080 gtacagtgac aaattttcct atgacttaaa ctaattaata ataatggtaa tccttacagg 4140 aaaaagcatt aagggttgca cacgaaggac atccgtccat tgccaaaatg aaaagtataa 4200 tgaggcaacg agtttggtgg ccaggactag cgtctatggt gcaaaaatgg gttgaatcat 4260 gcatgacctg cctaacaaac ggcaagccag agcggccacc tcctatgcaa aggattttcg 4320 ctccgaaagt tgtatgggag gcgatagcaa tagatttcaa cggtccttat gtcctactcg 4380 gaggtatatc cattcttgtt atcatcgatt acaggtcgag atatttgttt gcgaaacctg 4440 tgaagtccac aagcttcgaa tgcacaaaga aagttctcga agaaattttt gaaatcgaag 4500 gatatccgaa gagcatccga tcagacaatg gacctccgtt caacggatcg gagtataagg 4560 cttactgcga acaacgagga ataaagacag ttttctctac gccactatat cctcagcaaa 4620 atggattggt ggaaagcagt atgaaaatca taaataaggc aatgtctgca gctatttcca 4680 ataaaaacaa ctacattgtc gaattgcgcg aggccgtgca tgcttataat gctgcaagtc 4740 acagcataac tcgatttcca cctgaagaaa ttatgctggg caggaaagtt cgccgtggat 4800 tgcctttaat caaccccgag agaacgccca tgaatgatgc actgctggat gaaagagatc 4860 gcaaagcgaa attgcaagga aaacaacgtg aagatcttcg tcgtggtgca aggaagtcgc 4920 gcgttttgcc cggagatagc gtaatcatac aacgtcaaac acgtagcaaa ggacaatcaa 4980 ggttctcccc cacgagatac acagttgcag acgagaataa tggcagcttg cttctacact 5040 ccgcggatgg ccaatccatc aaacgacatg tcacgcaaac gaagagggtt ttccaggagc 5100 aaatcgttcc attccccggt gacgccgctg atgttagtgc tccaaataca ctaagcgaac 5160 ctagcgatgc agcgaatgca ccagagccag caaaacactg cagccgcaga ggcgatcgtc 5220 cgaagaaaat tccagcctat ttagaccagt acgtccggtc tattcaacaa taatccacgg 5280 aaacactttc catttcgtta cttggagatg taaaacacct gaaatgatat tattaatcaa 5340 taaaattgat tgaatttacc ttttagaaga aattctgttg tgcttcttat cgagctgttt 5400 gtttttgttt cactttccgt tcgtcctcat tttaagtaaa atgacatttg tttcatggtt 5460 tcattacacc accccacaat acaccgcttc gtttacgccc actccatccc acacggggaa 5520 aacaaagtac actcgaatct cgttgtacca tatatatggg caggctatgt tatagttaag 5580 ctactgctta tcaggttaga acactgattg tagcataatt gagtgtaggc gatgtaatgg 5640 aattactttt gccttggttg agaacatcac aaattttgag aaggggggag a 5691 // ID hATx-7_HM repbase; DNA; INV; 2821 BP. XX AC . XX DT 16-SEP-2009 (Rel. 14.09, Created) DT 16-SEP-2009 (Rel. 14.09, Last updated, Version 2) XX DE hAT-type DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; hATx-7_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2821 RA Jurka J.; RT "DNA transposons from Hydra magnipapillata."; RL Repbase Reports 9(9), 1925-1925 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 322..2541 FT /product="hATx-7_HM_1p" FT /translation="MASCSTRSTSRHSVFGFPKELKKNMLPTCEDVFCTYC FT FHQKQQCQSVNDIVRTVATDLIEIYNTASIPTIAFDSVMKKVKRLIDKGND FT LRKYPDAKRTSETYQKILSSFSNLFDICPCKCVSAGIANRADCKCPVDRKI FT PLIEWNFWVDQNTTRKMVIGNIDPVVTGKLQKSEERKRKAAMFLEKTKLKT FT VECSSTFQDFDNVDMEKSSESQDTGGSSIDDSDITDEETSDSDGSRCAQNR FT NHYPELCKAIDRSKCSNRDACLIVNSALKDLGFLFPENAIHPSKLRRQRSA FT YRKKSVVSHAEENQEIVCLGFDGKIDVTLTRVGSTRRKLKEEHYALVSFPN FT RNYVEHVVPASSRSDDISKEILSVVISTKSETTLRALVCDGTNVNVGKSNG FT IIRKIEQYLQRPLQWLVCMLHTNELPLRKLIEVIDGKSTGPRTSKGQLAGV FT MEFDPQHKPIIDFSPVSGCVAEVDETVMSDLSTDQVYLLKICLLIQRGYNA FT SNNYIDYLQTAQPGAVSHARWLTKANRLLRLYVCEQCPSHNLRRIVSFILN FT FYAPSWFHIKSHPTCQDGAKNFFFMLSLYQKLDKPDQEIVAPVLQNNSYFC FT HPENILVAAVGDDDGNIRKFAFEKILHARSEHSDDRIRCFDKSLIKINFTC FT KSYLDMIDWNKAIFASPPLLDDITSETIVSHSKVILPKFPCHSQDVERNIK FT DISAVCGKVYGHDSRHGVIIQMKKSRIDLPSLETKADFLP" XX SQ Sequence 2821 BP; 904 A; 487 C; 577 G; 853 T; 0 other; ttagggtggg gcaaattttg gttgcctcat gcagacaata gtaaattgat gctcccccat 60 caactgatca agaaaattgc tgtttttaat ttttagtacc atgtctaggg gtcgcaactt 120 tgaacgaaat attgtaaaat atttaccata gtacaatatt atgttttgta ttatttgtga 180 ttctgaatgg ctaattaatg acttgtatgt attgaaacct gtaggtagta ataattaatg 240 ttagtttaga tatcatttgt tttgtagatg ttgaaaattt ggatctaatt tgtgttgcaa 300 cttggcagta gcaagttaga tatggctagc tgtagcacaa gaagtacaag tcgacactca 360 gtgttcggtt ttccgaagga actgaaaaaa aatatgctac cgacatgtga agacgttttc 420 tgtacatatt gttttcacca gaaacagcag tgccaatctg ttaatgacat tgtgagaact 480 gtcgcgacag acttgattga gatctataac acagctagca tacctacgat tgcgttcgat 540 agcgtaatga aaaaggtgaa acggctaatt gacaaaggga atgatctacg aaaatatcct 600 gatgcaaagc gaacttctga gacatatcaa aagatattgt caagtttcag caatttgttt 660 gacatttgtc catgtaagtg cgttagtgct ggaattgcaa atagagcaga ttgtaaatgt 720 ccagtcgaca gaaagattcc tcttattgag tggaatttct gggttgacca aaacactacc 780 agaaaaatgg tgattggaaa tatcgatcca gtggtgacag gtaaactgca gaagtcagaa 840 gaaagaaaaa ggaaagcggc tatgttccta gagaaaacga aattgaaaac cgttgaatgc 900 agttctactt ttcaagattt tgataatgtt gatatggaaa agagttcaga aagtcaagac 960 actggtggta gctccataga cgattcagat ataacagacg aggagaccag tgatagtgat 1020 ggcagcagat gtgcacaaaa cagaaatcat tatccagaac tgtgtaaagc tattgatcgc 1080 tctaaatgta gcaatcgaga tgcctgttta attgtgaatt ctgccttgaa agacctcgga 1140 tttcttttcc ctgaaaatgc tatacatccc agcaaactac gccgtcaaag atctgcatat 1200 cggaagaagt cagttgtgag tcatgcagag gaaaatcaag aaattgtttg tcttggtttt 1260 gatggtaaga ttgatgttac tcttacacgt gtaggatcga ctcgacgtaa gttaaaagaa 1320 gaacattatg ctttggtttc attcccaaac cgaaactatg tcgaacatgt tgttccagca 1380 tcaagcagat ctgacgacat ctcaaaagaa attttatcgg ttgttataag tactaaatca 1440 gaaacaactt taagagcact cgtttgcgat ggaaccaatg ttaatgtagg aaaaagtaat 1500 ggcataatcc gaaaaattga acaatatcta caaagaccac ttcaatggtt agtttgtatg 1560 ctgcatacaa atgaattgcc tttgagaaaa ctgatagaag ttattgatgg gaaatctact 1620 ggtccaagaa cttccaaagg acaattagca ggcgtgatgg aatttgatcc tcaacataaa 1680 cctataattg atttctctcc tgtatcaggt tgcgttgctg aggttgatga gaccgtaatg 1740 agtgacctga gtacagatca ggtgtacctg cttaaaatat gtctgttaat tcaacgtggt 1800 tacaacgcca gtaacaatta cattgactac ttgcaaactg ctcaacctgg tgcagtcagt 1860 catgccagat ggctaactaa agccaaccgt ttgttgaggt tgtatgtttg cgaacaatgt 1920 ccttcccata atcttcgacg tattgtttca ttcatactga acttctatgc accgtcatgg 1980 tttcatatca aatctcatcc gacatgccaa gatggtgcca agaatttctt cttcatgttg 2040 tcgctttacc agaaacttga caagcctgat caggagattg ttgcgccagt attgcaaaac 2100 aacagctact tttgccatcc ggaaaacatc ttggtggcag ctgttggtga tgatgatgga 2160 aatatcagga aattcgcttt tgaaaaaatt ctgcatgctc gtagtgaaca ttccgatgat 2220 agaattcgtt gttttgacaa aagcttgatc aaaattaact tcacgtgcaa gtcatactta 2280 gacatgatcg actggaacaa ggccatcttt gcttcacctc cacttctgga tgacatcacg 2340 tcagaaacaa ttgttagtca ttcaaaagtt attctaccta agtttccatg ccattcgcaa 2400 gatgttgaga gaaatatcaa agatatatcg gcggtctgcg gtaaagttta cggacacgat 2460 tcacgccatg gtgttatcat tcaaatgaag aaatcgagaa tagacttgcc tagtttagaa 2520 actaaagctg atttcctacc ataagtaaga cacttatact taacatttgt atgcttgata 2580 tttgtaatgc atgccatgtt ggttacttgc tttattttct tacgaagatg tggaattaga 2640 gacgaaaacc tatatgcgag caaggctcga aagtgcgata atttgcagac taagttgcta 2700 gaaaacgatg atgtatatta taaatcagtt ttttgtgtaa taattacttt catataaagc 2760 actgattttc atttgtctca gaccctaatt ggaaaatttg ttatgttttg ccccacccta 2820 a 2821 // ID CR1-29_HM repbase; DNA; INV; 4438 BP. XX AC . XX DT 16-DEC-2008 (Rel. 13.12, Created) DT 16-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE CR1-type family - consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-29_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4438 RA Bao W. and Jurka J.; RT "CR1 families from Hydra magnipapillata."; RL Repbase Reports 8(12), 1857-1857 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 970..4050 FT /product="CR1-29_HM_1p" FT /translation="MDSVLLDNFKLTFNFLQTNKYLIDERSDPDLNYFSEA FT GALQNNCCYFYAHEIKDFLERDHLNALHINIRSLKKNFESFCLCMKDTLNV FT FNIICVTETWCDSDEVNYDCNIHLPGFKIISLARKANKRGGGVLIYIKESF FT QFFTRLDLSISDDDKEILTIEILTQKNKNILLSCCYRPPSGKIECFNTYLN FT TNIMKKADHENKLIYIIGDYNLNCFDYHVKQSTQKFYNELFKNGAFPLIDR FT PTRITEKTASIIDNIVTNDVFNESLKKGIIKSDISDHFPIFFSINIKPKVL FT PNEKTTFLKRIYNEANLLSFHEQLSLLHWKDINTSSDVNFAYNQFFKSFYE FT VYDVNFPKREINLKTKSVKSPWITNTLRKSSKVKQKLYIKYLKEKTIESKT FT IYKKYAREFEKIRKNLKKKYYSDLLMRYECDSKRTWQILREITGKTKIKSC FT ALKNGIKINGDISYDPREIAIELNKFFVTIGPNLARNIPNIKKTSDISSIA FT LANLDFLELSFEEFETAFSALKPNKASGFDDINGNVIKNSYKAIKDVLFKI FT FSLSIRQGIFPDQFKIAKITPIFKGGDLTNINNYRPISVLPVFSKVLERII FT YNKIYTHLTNNNLLFNHQYGFKKNNSTEHAILQLTRSITESFEKSEYTLGI FT FIDLSKAFDTIDHKILFKKLKNYGITGNVLKWIKSYLSNRKQFVTIDASSP FT ISLLEITCGVPQGSILGPLLFLIYINDLYKVSRLKTIMFADDTNLFLSHTN FT ITTLFQLMNVELNKISYWFKLNKLSLNIQKTNWTLFHPPSKKKLLPYEMPS FT LFIDNIEIIRVNVTKFLGVYIDDNLSWKNHIENLYNKTSKSLGILYKARNF FT LNKKILTQLYFSFIHSHINYANIAWACTNKSKLEPLYRHQKHAARLIHFSE FT RLTHAKPLLKKMNALNVYQLNVYNTLCFMFKCKTNSSPVSFHDLYMLKDKN FT KYFLRNNNFIKQPFFQTNLSKFCITSRGPFLWNKIVLKNFSQDFIQQCNYN FT SFKRKLKELIFTIDDLSIYF*" XX SQ Sequence 4438 BP; 1753 A; 660 C; 570 G; 1455 T; 0 other; acaaaatata ttatgaatgc aacgatttct tcgtgaacag acgtgttttt acagcaaaag 60 aattttcttt aaaaaataaa aataataata ataaaagtat attaatttta gtttttagta 120 tttaattatc tttataaccg gaattattct tcaaatttat caaactatct tcaaaaattt 180 tgcaacacaa aattaaaaat ggcgaaattt actgtttcac aaatcaagga gctgttagag 240 ttgcatgaaa gtacactgtt gaaaatattt aatgataagt ttgaaaaaat agaaaacaag 300 ctcattagta tgcaagtaga aaatacaact ttaaaaaatg aaatcagcga attaaagaag 360 tctgctgagt tcataaatga aaaatacgaa aaaattttaa acgaagtgaa tgattctaaa 420 aagaaagcat caacatcgaa tttaaacaaa actagcatag aacttgaaca cagcgataat 480 ataattaaag acaaactagc agaattggag gaccgcagtc ggcgaaacaa cttaagattc 540 aatgggatcc aagaaagtga aaatgaaact tgggaagaaa gcgaaaaaaa aatacacgaa 600 ctattaaatt caagacttgg tattaataac aatataataa ttgaaagagc acatagaact 660 ggaaaacttg attatggtgg aaaaatgaaa aacagaacta tagttgtcaa attcttaaac 720 tataaagata aacaaacgat aatggaaaat tactcgaagt taaaactctg gacagaacgc 780 ctatacatta atgaagatta ctgcgaaaga acaacggaat taagaaagaa attgtttatt 840 gaagcgaaag aacttagaac aaaaggtaaa tatgcaaaag tggtttataa taaacttgtt 900 acgcgcgatg cttaaaaata ggaattcctt tgttttcgct aggcactaaa agtatactaa 960 actattaaaa tggattcagt tttactagat aattttaaat taacctttaa ttttcttcaa 1020 acaaataaat atcttattga tgagagatct gatccagatc taaattattt tagcgaagca 1080 ggtgctttac aaaataattg ttgttatttt tacgcacatg aaataaagga cttcctagag 1140 cgtgaccatt tgaatgcctt gcatataaat ataagaagct taaaaaaaaa ttttgaaagc 1200 ttttgcttat gtatgaaaga tactctaaac gtttttaata taatttgtgt aaccgaaacg 1260 tggtgcgatt ctgacgaagt gaattatgac tgtaatatcc atttacctgg ttttaaaata 1320 atatcattag cgcgtaaagc aaataaacga ggcggcggtg tgcttattta cataaaggaa 1380 agttttcaat tttttacaag gctcgacctg agtatttctg atgacgataa agagatttta 1440 acaattgaaa ttttaaccca aaaaaataaa aacatacttt taagttgttg ttatcgccca 1500 ccttctggaa aaattgaatg cttcaacaca tatttaaata ctaatattat gaaaaaagct 1560 gaccacgaaa ataaattaat ctatataatt ggtgattata atttaaattg ttttgattat 1620 cacgtcaagc aaagcacaca aaagttttat aacgaattat tcaaaaatgg agcgtttcct 1680 ttaatagata gaccaaccag aattaccgaa aaaacggctt caataatcga taatattgta 1740 actaacgatg tttttaatga atctttaaaa aagggtatta ttaaaagtga catttccgat 1800 catttcccca tttttttctc tataaatata aaaccaaaag tactacctaa tgaaaaaaca 1860 acctttttaa aacgtatcta taacgaagct aatttattat cgtttcatga acaactgtca 1920 ttgcttcatt ggaaagatat aaacacttca tctgacgtaa actttgctta caaccaattt 1980 tttaaatctt tctatgaggt atatgatgta aattttccga aacgcgaaat aaacttaaaa 2040 accaaaagtg taaaatcacc atggattaca aacaccctaa gaaaatcatc aaaagtcaaa 2100 caaaagttat acattaaata ccttaaagaa aaaacgatag aaagtaaaac catctacaaa 2160 aaatatgcga gagaatttga aaaaattaga aagaatctta aaaaaaaata ctattccgat 2220 ttacttatgc gatacgaatg tgattcaaaa cgcacatggc aaattttaag agaaattacc 2280 ggtaaaacaa aaattaaatc atgcgctctg aaaaatggca ttaaaattaa tggtgatatc 2340 tcttatgatc cacgtgaaat agcaatcgaa ctaaacaaat tttttgtcac gatcggacct 2400 aatttagcaa gaaatatacc aaatataaag aaaacaagtg atatctcatc tattgcatta 2460 gctaatcttg attttttgga attatcgttt gaagaatttg aaacagcttt tagtgcgcta 2520 aaaccaaata aagcaagtgg cttcgatgat ataaatggca atgtaataaa aaattcatac 2580 aaggctataa aagacgtgct ttttaaaatt ttttcacttt caatcagaca aggaattttt 2640 cctgatcaat ttaaaattgc aaaaatcaca ccgatattca aaggaggaga cttaacaaat 2700 atcaataact atcgccctat ctctgttctt ccagtctttt ctaaagtctt agaaagaatt 2760 atttataaca aaatttacac tcatttaact aacaacaatt tattatttaa ccaccagtat 2820 ggattcaaaa aaaacaactc tacagagcac gcgattctcc agctaacacg tagtatcaca 2880 gaatctttcg agaaatccga atatacttta ggcatcttca tcgacctatc gaaagccttt 2940 gatacaatag atcataaaat tctttttaaa aaactcaaaa attacggaat aactggaaat 3000 gttttaaagt ggataaaaag ttatttaagt aatcgcaaac aatttgtaac cattgacgct 3060 tcttctccaa taagtttgtt agaaattact tgtggtgttc ctcaaggatc tattctcgga 3120 ccacttcttt ttcttatata tatcaacgat ctctataaag tatccagatt aaaaacaatc 3180 atgtttgcag acgatacaaa cttattctta tctcatacta acatcactac tctgtttcaa 3240 cttatgaatg tagaattaaa caagatttct tactggttta aattaaataa attatcgctt 3300 aatattcaaa aaacaaactg gactctcttt cacccgccct ctaagaaaaa actactgcct 3360 tatgaaatgc catccctttt tattgataac atcgaaataa ttagagtaaa cgttacaaaa 3420 ttcttaggtg tttatattga tgataatctt tcgtggaaaa atcatattga aaatttatat 3480 aacaaaactt caaaaagttt aggaatttta tataaagcaa gaaacttttt aaataaaaaa 3540 attttaactc aattatattt ctcttttatt catagccata taaattatgc aaacattgca 3600 tgggcttgca ccaataaaag caaacttgaa cctctttatc gtcatcagaa gcatgcagca 3660 cgtttgatac attttagtga acgtcttact catgcaaagc ctctattaaa aaaaatgaac 3720 gccttaaatg tatatcaact caatgtttat aatactcttt gttttatgtt caaatgtaaa 3780 actaattctt caccagtttc tttccatgat ttgtatatgt taaaagataa aaataaatac 3840 tttttacgca ataacaattt tattaagcag ccattttttc aaactaactt gagcaaattc 3900 tgcattactt ctcgcggacc attcttatgg aacaaaattg tcttaaaaaa cttttctcaa 3960 gattttatcc aacaatgtaa ctacaactct tttaaacgaa agttaaaaga attaattttt 4020 acaattgacg atctatcaat ttacttttaa taatttttaa tagttttccc attttacctc 4080 cacttatcaa agaactatta ttattttata atttatatat ttaccaagaa tatgtatcaa 4140 tatatacgac aatttatatg tttattaact tatatagtat ttatataact gatctttata 4200 tttataacgg gttttctgaa tatttttaat ttgtttgtat cacgttcatt ttagcggttc 4260 tcgacgacaa gaccttacgg tcttctacga gtccccgcgt tcttttattt cttatttacg 4320 atttttcttt tttgttattt tcttgtaact tctatatttt attttttaat ttgtatattt 4380 ataacggaat gttgtttaaa atgtaataaa gaacaaaaaa aaatatatat atatatat 4438 // ID Crack-7_BF repbase; DNA; INV; 3430 BP. XX AC . XX DT 30-APR-2009 (Rel. 14.04, Created) DT 30-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE Amphioxus Crack-7_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; Crack-7_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-3430 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-3430 RA Kapitonov V. and Jurka J.; RT "Young families of Crack non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(4), 812-812 (2009). XX DR [2] (Consensus) XX CC Its 5'-terminal portion is incomplete. XX FH Key Location/Qualifiers FT CDS 2..2992 FT /product="Crack-7_BF_2p" FT /translation="LANSDDYWYCNPCLLPCFSDSFFDSSCNDCSGDSVTA FT NEEEISESNSMIPASKGLIMVHLNICSLYSKLDQLYVFMSTNDVDIMTVSE FT THLDDSIHDSELCIDGYHLYRQDRNRSGGGVAIYVSDKYSHTERTDLKQPG FT LEALFCQVQLPSTKPIVIGTIYRPPTSSVEFYTLLRDSLETWSMSTPKSEL FT FLLGDMNIDISQPCSSAVKHMDNLSQEFQLQQVIDKPTRVHQHSSSIIDHV FT YCSDMHRVSDYGVVHSTISDHYAVFCTRKARRTKSVSKYVTSRKFTNFQED FT SFLTDLRALNWDSVLQATRVEEAWSLFKSLFITVSDVHAPYISKRTKTAQP FT KWLTPDIKSLMQVRDDTKARARRTGESKDWEDYKAKRNYVNKRVRLAKASY FT CQQKLEENLTDSKKLWATIKEVLPKKTQAVTKSLRWEGKHILDLPNIANCL FT NKFFVTVGNKLAEKFKHQATVPKCPARYKELKTTFNFKQISEVGVYEKLRG FT LHSNKATGLDRIHARLLKVAAPSICKPLTHIFNLSLSSGHIPTDWKMARVT FT PLHKGGDTEDPNNYRPISVLPVIMKAFEREVHTQFVEYLHQHNILSTQQSG FT FRTGHSTTTTLLDAKDYVLHNMDRGNLVGAVFLDLKKAFDTVHHGLLLTKL FT SWIGIQGVEHLWFSNYLSGRQQIVSLNGCRSEYLPVTLGVPQGSILGPLLF FT NLFINDIPDVVTTCKICLYADDTAIFYPSNNVKHIESILNYELSGLATWFQ FT TNRLTLNVTKTKWMLFGTANRLKQAQSLTVEIDQEIIERVHTFKYLGLYLD FT SHLSFNEHIDKVCSKVSQRIGLLRRLRHCLTFSIADMLYKTMILPIFDYCD FT TVWGTCGATKQRQLQILQNRAARVVLQRRQRDISTVNLHQTLNWKYLADRR FT FEHTCIMVFKCLTGLAPTYLSATFQHNSRIHTYNTRQTTKLHPPSYTTTCG FT QKTFAYTGTSQYNKLADNIRNITTLKGFKTALKNICISDLK*" XX SQ Sequence 3430 BP; 1068 A; 748 C; 680 G; 932 T; 2 other; actggctaat tccgacgatt attggtactg caatccttgc ctacttccct gttttagtga 60 ctcctttttt gattcttcat gtaatgattg tagtggtgat tctgtaactg cgaatgaaga 120 agaaatctca gaaagtaatt ctatgatacc agcatcaaag ggcttaatca tggtacatct 180 taatatttgc agcttgtata gtaagctaga ccaactgtat gtatttatgt ctacaaatga 240 tgttgatatt atgacagtca gcgaaaccca ccttgatgac tctatccatg acagtgaatt 300 atgcatagat gggtatcact tatacagaca ggacaggaat cggtctgggg gaggggtcgc 360 aatttatgta tctgataaat acagtcacac tgagagaaca gacctgaaac agccaggact 420 tgaggcactt ttctgccaag tccaactccc tagcactaaa ccaatagtaa ttggtactat 480 atacagacca ccaactagct ctgtggagtt ttacaccctg ctaagagact cacttgaaac 540 ctggagtatg tccactccca agtcagaatt gtttctactg ggagacatga acatagacat 600 tagtcagccc tgtagctcag cagttaaaca catggataac ctgagtcagg aatttcaatt 660 acaacaggtt atagacaaac caactagagt acaccaacac tctagtagta taatagatca 720 tgtgtactgc agtgatatgc atagggtaag tgactatgga gtggtacact ccactatctc 780 tgatcactat gctgtgttct gtaccagaaa ggcgagacgc accaagtccg tatctaagta 840 tgtaacctcc cgtaaattca ctaactttca ggaagacagt ttcctgactg acctgagggc 900 actaaactgg gactctgtac tccaggctac cagggttgag gaggcttggt ctcttttcaa 960 gtctctgttt atcacagtta gtgatgtgca tgcaccatat atttccaaac gcacaaaaac 1020 tgcccagcca aaatggctca cacccgacat aaaaagccta atgcaagtca gggatgacac 1080 gaaggccagg gcacgcagaa caggtgaaag taaagattgg gaggattaca aagctaaacg 1140 taactatgta aacaagaggg taaggctagc caaagccagt tactgccaac aaaagttgga 1200 agaaaacctg acagactcaa aaaaactgtg ggctacaatt aaggaggtyt tacctaaaaa 1260 gacccaagca gttaccaaat cactgcgctg ggaagggaaa catatcytgg accttccgaa 1320 tattgcaaac tgcctcaaca agttctttgt aactgttgga aacaaactgg cagagaagtt 1380 caagcaccaa gctacggtcc ctaagtgccc cgcaagatac aaagaattaa aaactacctt 1440 caacttcaag caaataagtg aagtgggagt gtatgaaaaa ctaaggggac tgcacagtaa 1500 taaagcgact ggtctggaca ggatacatgc aaggctcctg aaggtagccg cacctagtat 1560 atgtaaacca cttacacaca ttttcaattt gtccctttcc tctggccaca ttcccacaga 1620 ctggaagatg gcaagggtca ctccattaca caaagggggg gatacagaag acccaaacaa 1680 ttacagaccc atatctgtcc tacctgtcat catgaaggcc tttgaaagag aggtgcacac 1740 acagttcgtt gaatacctgc atcaacacaa tatcctctcc acccaacagt ctggttttag 1800 gacaggccac tctacaacaa caacactact ggatgccaag gactacgtgc ttcataacat 1860 ggacaggggt aacctagtgg gggcggtatt tttggatctt aagaaagcat ttgacactgt 1920 acatcatgga ctattattaa ccaaactgtc atggatcggg attcagggtg tggagcatct 1980 ttggttcagc aactacctgt cggggagaca acaaattgtc tctctgaatg gttgtaggtc 2040 agaatactta cctgttacac tgggagtgcc ccagggttcc atactgggcc cgttgctctt 2100 caatttattt attaatgata tacctgatgt agttaccaca tgtaaaatat gtttgtatgc 2160 tgatgacaca gcaatattct atcccagcaa caatgtaaaa catattgagt caattctgaa 2220 ctatgaactc tctggtctgg ccacttggtt ccaaacaaat cgtttaacat taaatgtaac 2280 caagactaaa tggatgttat ttggcactgc taacaggtta aagcaagcac agtccttgac 2340 agtcgaaatt gaccaagaaa tcatagaaag agttcatacg ttcaaatatc ttggtctgta 2400 ccttgactcc cacctgtcct ttaatgaaca tatagacaaa gtttgtagta aggtgtcaca 2460 gaggattgga ctacttagac gcctcagaca ctgcctgact ttcagtattg ctgacatgtt 2520 gtacaagact atgatattgc ctattttcga ctattgtgac actgtctggg gaacctgtgg 2580 ggccaccaaa caacggcaac tacaaatact acaaaacaga gcggcaaggg tggtgctaca 2640 acgtaggcaa cgggacatca gcactgtgaa cctccaccaa actctcaact ggaaatacct 2700 ggctgatagg agatttgagc acacgtgcat tatggttttc aaatgtctga caggcttggc 2760 accaacatac ctatccgcca catttcagca caactctcgt atccatactt acaacactag 2820 gcagaccact aaactacacc caccaagcta caccaccaca tgtggacaaa aaacatttgc 2880 atacactggc actagccagt acaacaaact agcagataac ataagaaaca tcacaacact 2940 aaaaggcttt aaaacagctc tgaagaacat atgtatatct gacctcaagt aatattggaa 3000 cttagtatgg aaaagcacct ttgtctctga tgtgccttac tgttctttaa atttttttgt 3060 ttgttcaatc tttacatatg atttactatc catactttgt aaatttcagt ttcattaatt 3120 ttcgttttac acatgatttg atatatgatt tgacctctga cctccacccc atctcttgac 3180 atattgattt tgaccttgga acgttttcga catgagtatg gacactgatt atgttttgac 3240 gcaccggttt tatgttttct gtactgtatg ttgtccctgg atgttcaaag tttgactgat 3300 tgttaccacg ttttttgtcc tgtatttatt attttacatg tatgtttgat gggccccctt 3360 ggaaatcaac tactgtatgc actgttgtag gggctaccca tctgttttca agatgaaata 3420 aataaataaa 3430 // ID Gypsy-15_CQ-LTR repbase; DNA; INV; 142 BP. XX AC AAWU01024898; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-15_CQ_; KW Gypsy-15_CQ-I; Gypsy-15_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-142 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 410-410 (2011). XX DR Genome; AAWU01024898; Positions 15064 15205. XX SQ Sequence 142 BP; 28 A; 37 C; 42 G; 35 T; 0 other; tgtgatagga acgacccaca gttgcatgtg tgtgcggtgg cagcatcatg atcctcatgt 60 gagagagcga gcgtctgagc ttgcttgctg tgtgctgtac cgaacacttc agtctccgtg 120 tgaccgcgcg ctagctacca ca 142 // ID Urukhai_Cis repbase; DNA; INV; 8737 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 06-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE Helitron DNA transposon from Ciona savignyi. XX KW Helitron; DNA transposon; Transposable Element; RC; Urukhai_Cis. XX OS Ciona savignyi OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. XX RN [1] RP 1-8737 RA Smit A.F.; RT "Urukhai_Cis - Helitron DNA transposon from Ciona savignyi."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX CC Ci000274, Ci000124, Ci001069, Ci000261, Ci000262 This is an CC incomplete sequence of a large element marooning the C. savigny CC genome. It is followed by a satellite-like sequence (UrukSat_Cis CC = Ci000268). There are multiple 5' end extensions and other CC indications of subfamilies. This sequence contains 4 long ORFs. CC ORF1 (bp 417-1847) encodes a protein with an 18 AA unit 6-fold CC repeated. Neither this nor the ORF4 (bp 7664 to 8371) product CC match known proteins (ORF4 could be fortuitous). ORF2 (bp CC 2155-3177) encodes a rolling-circle replication (RCR) initiation CC protein), matching that of helitrons (up to 21% identity and 35% CC similarity). The product of ORF3 (bp 3939-7520) contains a CC helicase domain (superfamily I) that is 45% similar to the CC C-terminus of RRM3/PIF1 in S. pombe, and at the C-terminus an CC endonuclease 47% similar to those found at the N-terminus of the CC pol proteins of L2-type LINEs found in Drosophila, rice and C. CC savigny itself (L2_Cis1). Unlike in helitrons, there may be no CC need for splicing. Perhaps Uruk was a helitron that picked up CC the endonuclease from a LINE-like element. XX SQ Sequence 8737 BP; 2929 A; 1601 C; 1668 G; 2525 T; 14 other; agcaaaatat tggctgaagt ccctttcaca cttttaattc attaagtttt tggtacattc 60 tataacgaca cacgaccctt cttaattcgt atgattgttt ttaaatatac tgtggcttct 120 ttaacaattt aaacaacctg taaatatttg tgcctaatga atgatgtttt gtttttattt 180 attctgttct aggacaatnt gatgacacct cgacttgtcc tgcatagact gactgaggct 240 gaaattttag aggcttctgt ggtaagcaat tggcaaattt aagctgtatg ccaattttta 300 ttactatttt actgttgata gtaaatagaa actttattgc atttagctcg attttaacaa 360 acttttaaaa agggaaacca aatttgactg atagtgtatg aataaattaa tactaaaaat 420 tgcaggtaaa atcaacacaa cccgtttcaa aggttgttgg aatcggtaat cagcggtttc 480 aaaagtggta tgctatgaat aaaaancgtc acaattttaa ccgaagaaag aaatacaaga 540 aagatgaaca atttaaaatt gccgctaaac aacgggtttt aaaaaattac caccaagtag 600 aagatgtaaa gaaaagggtg aagaaaagag ttttggacag ctaccacgga gatgacaaaa 660 ttaaagaaaa aatgaaacga cgagttttgg acagttacca cgaagatgac aaaattaaag 720 aaaaagtgaa acgacgagtt ttggacagtt atcacggaga tgacaaaatt aaagaaaaaa 780 tgaaacgacg agttttggac agntatcacg gagatgacaa aattaaagaa aaaatgaaac 840 gacgagtttt ggacagntat cacggagatg acaaaattaa agaaaaaatg aaacgacgag 900 ttttggacag ttatcacgga gatgacaaaa ttaaagaaaa agtgaaacga aggtcttcta 960 gtcatatgaa gagaaagtat gagagtgacc ccaaatttaa aactgcctta aaagagactt 1020 caaagaggac atcaaaaata cgttacgaag acgctgggtt gagggaaata aagaaagcag 1080 ccatggggcg catcataaag aagaaatatt atcaagatgc tgaattccgg aaaaatgtgt 1140 ctgctcaatc cgccaaacgc attnaaaagt tataccacac tgatgaggag tttaagtcaa 1200 catttgtaaa gaaagttaga actcgtcaat caataaagac aaaagaaaag aaggatatat 1260 caaaatgcat tgaggtattt cgagctcatt gcaaactgct tccaaaatat gcatgctgtg 1320 tctgctatcg ggaattcttt tcaaaacaag tgaagcagtt ttgcaaaagc gattatcctc 1380 caaatgttga agcntatgta ttggatttta atattgatgg ccatcaatgg atatgcttca 1440 cctgcatgaa gtacatgaag caaggaaaga tgccccctca agcttggaaa aatgggttag 1500 acttagaaga aattgcaccc tccctaaaag aattaaatac tctagagcga cacctgatat 1560 ctccaactct cccttttatg aagatagttt ctcttccacg tggggcacaa aaagggatcc 1620 atgggtcggt ggtctgcgtt aatgctgatg ttggcaaaac cacttccatc cttccgcgat 1680 cagcagcatc tcacactttg atgagagtaa agttaaaacg taaattggag tacaaaggcc 1740 atcatctata ccaagtaatt acccccaaca aagtcaattc ggcattagaa tttttaaaag 1800 aaaacaatcc tctttttaaa ggtatgttgg gtgcactaca tttttgagta ctcacatcgt 1860 aacattaata cattaatcca ttttagatat tgatattcat catattaatg aagctgaaaa 1920 cgaccatcta cattacactg taaatgaaaa tcaaggtggg attttcgtgt tttacactag 1980 ctgctgtata aagtagaaaa tgtaagttta ccaacatgtt atcatgttta ttagacttcg 2040 accaagagct gctggatgaa atttttaaag acgatgagga taatagcggt ggtgagtttg 2100 gtttttacta tttcgaataa agggtattgc aaattcacaa taacgtttta ataaaaatat 2160 attatagcga aagataagca aaaagacgaa aatgcacaga caacaccagg taaaccagat 2220 attgaggtaa gtcaaacaac aacacccctt gtttcgtgcc ttcaaccggc ggaccctgcc 2280 caacatctct tggacacaga agataagatt ctttgtttgg cacccggtga aggtaataaa 2340 ccaacgaagg caatggaatc agaagcaagt tgttttcctt cattatttcc caatgcagaa 2400 aacactttcg taggnaaacg acctcaaaaa gtaagtttta atcgatattt aaactctagg 2460 cttttaagtt atgataacag gtttgcagcg tgccctgaat atgttttttg gggtcaattt 2520 atcaacgaag taacaacagt tgcttcgtcc ctnactatcg ccatgcggaa gaataccagc 2580 agtacatcgg acggaacaaa gattaccaag agaatgttgg taagtgacga atcagttgaa 2640 aaactgctaa gaaaagacga agcatacaaa tctctccatt cgattcgtgg aagtccacca 2700 tactgggaaa agactttgag ggatctgttc gccatgcttc ggcagctagg taagccaact 2760 tggttttgta gtttttccgc agcagataga agatggcctg aaattgtgga ggcaatttgc 2820 gctcaacaag gtgtaccagt accggagctt aactgggaca cctactgcaa gttaattgca 2880 agcaatccgg tcactgctgc aagaatgttt gaccagaggg tgcatcattt tctaaatgat 2940 attttaagat cagagtcaaa acctattgga cacatcgtgg actttttttt tcgaactgaa 3000 ttccaaatgc gaggtaagaa taattctaaa ataaaattta taaatgtcgc catgcgtgtt 3060 ttaatttctg ttaattttgc aggttggccc catattcacg cgttgttttg ggttaaagac 3120 gccccgctct ataaaatcag tcccgatgca gaagtgtttg actttattga taggtaacaa 3180 gttttaatct gaatacctta ttttaaactt tttttagtaa cttcttaata ttgtgcagta 3240 gttatgttta atattgtaaa tgataaatct tctactttga ttcgaatgaa aactgatttg 3300 tagttttttt tatatattat taaatagcgg catattaaac agcggttgaa aattgtagat 3360 gtgaaattat agatcattac ataattttaa aattaaaact aattcggata agaacatcca 3420 tttacaaatt ttaattaaaa gggcatgtta ttaacatcac acggtgtata ggtacatcag 3480 ttgtaaagtt cctgataaaa acgtggatcc ggaacttcac gagaaggtaa gtaaattgca 3540 aacccacagt caaaaacact caagttcctg tagaaaaggt ggtaaaattt gcaggtatgc 3600 tatgtgatgt gttttacata ttgtatacat ttaaatatgc atattcaaag tacttgaaat 3660 gaaaaatacc tttacgtcgt tttgcagtta tttataaaag acgtatgtgg tactaaatat 3720 gacttaaacg ctcataattg tttagcagtt ttcagctttg cctttcacaa tgccatatat 3780 taaaaaaaat aatacaaaaa agaaatttac ccgttgaaag attacgtcct acaaatatat 3840 ttttcggttg tttttaaata tatagaaatg tttatgagtt gagcattatg ttataataat 3900 ggtaacttgt ataggctaac acgggaaact aaacttaaat acttttatat aacaacagat 3960 tcaactttcc acgtccgatt agttccgaaa catttatttc ggaaccactg gataaagaaa 4020 aaaagacgaa ggaaaaacgc gacacagcaa tggccgtgtt acaagcggtc atggacataa 4080 tcaacgacaa ggatwgtgac gtggataaag cggttgacat tcttcagaag gcaaacatct 4140 cgtttaaaga ttacgtagaa gctcatgaca tgctttctac cagaaggtca gtaatattaa 4200 gtcgtgaccg tagcgaatgc tgggtgaatg cctataaccc ggacctccta cgatcttggg 4260 atgcaaacat ggatattcag tatatcctgg atgcatatag ctgcataatg tacatagtat 4320 cgtacattac aaaaggagaa cgggaattcg gnaacttaat taaacaggca cttaaagagg 4380 cacatgaggg caacgttgat gccctcacag aattgcgcct cctgggaaat aaatatttaa 4440 cccaccgaga ggtgagcatt atggaagcgg tttaccgttc cattgggtta aaaatgaagg 4500 agagctcgcg aagcgttgtc ttcgttccta ctgatcctga ttgtgtcagg atgagcctcc 4560 ctctaagcag gctcgcgaac cacgcagagg acgacaccca gatttggatg accaacatcg 4620 tggacagata ttgtaatcga ccttttagtt gtcatttcag gtttatgagc ctggctagct 4680 ttgcatcctg gtacaaatat gtaccaaaaa gccagcgctc gaagacagaa gttgacgatg 4740 aagctgaaga ggatcgcaca aataacagcc gccatattcc nctccataat aacatggggg 4800 caattcaaag acgccttaat caggcaatta ttcgatttcc gaaatttcgt caggataaat 4860 atcctgaaaa atattatttt aacttactcc aactttatct accccaccag ggactaagcc 4920 taccctccgg ctacaccacg tacgaagaat actttgacac tggtgtggtg atcatcaatg 4980 gggaggaatc acctattaat aaggtggttg ccattgaaat ggcaaaattt aacaggttgg 5040 caaacgccga ggacatctgg gatgagcttc cacacgatgc tgagcagctg caggatgctt 5100 ggtcccagat tgctccttcc acagagcaaa cccgacaaga agaggaagat gaagggttgg 5160 aggagggagg tatagaacct ctcgaaccag tggagcccat ttttgaaact ggttgggtag 5220 atggtaagct gcgagataaa acacagcatg gtatggaata ccagctggag ccgtctaagt 5280 cacggctctc ttcagacgat gcttataaaa tgaagcaaaa tctgaacacg cagcagcagg 5340 ttgtgtttaa ttacattcga aaatggtgct ttaatgtaaa acagggtaat agcactgccc 5400 catttcacct gtttgttacc ggtggtgctg gtacggggaa aagtttgttg attaaatgca 5460 tcaaacatga ggcctcccat atattttcgg acctaaccaa ttcaccggat gatataacca 5520 ttcttttaac agcgtttaca ggtacagctg catttaacat cggaggatca actatccatt 5580 cggcattggg tataataaat cccaaaaagt tctatgttcc actttctgag gaaaaactta 5640 caacattgcg ctgtaccctg tcttcactga aagtaatggt tatagatgag gtttcgatgg 5700 ttgatacaac tttaatgttg tatatcagtc aacgtcttaa ccaaattcta aggccatcca 5760 acccaaacgc aatttttggt aatatttcca ttttagccgt cggtgatttt catcaaatcc 5820 ctcctgtttg cggaacatca cttctgaaac caacagaaac ctcaattttg gacatttggt 5880 cacaatttga aatttatgaa cttaatcaaa ttatgcgtca aaaggaggac caatcgtttg 5940 ctcagttgct caacngagcc cgagtggcaa ccaaaacaaa acctttatct ttttcagaca 6000 aaggagaact gcagaaacga ttggtcgatt ttcatcccaa ttatccaaaa gattgcctac 6060 atatatttgg taccaattcc aaagttgaca agcacaataa aaaaatgcta attggtttaa 6120 agaaaaaaat tgtgacttta accgcggttg acattattaa aactaacaag ggccaacatt 6180 ttcgtagtaa aaccccggta acttatagta aaaaagataa tccaaattta ccatccattg 6240 tagagttatc tgttggagca agagtgatgg tgatttcaaa catcgacgtg gcagatgggc 6300 ttgcaaatgg tgtggttggt actgtttgta acattattga cggtatcaat caatatgact 6360 tgccagatca cattgcaata atttttgata actcattaat aggcgcaaat agaaaacgaa 6420 gccaaccaaa aggaaattac ccaccggaag ctgttttgat tcaacctcaa agcagccata 6480 aaataaaacg aggcacaaga acatacgtca ggcaccaata tccactccgg ctttcctggg 6540 catgcacggt gcacaaaatt caaggcggta ccgttcaaaa tattattgta aacatggaca 6600 atttttttac tcggggaatg ggttatgttg cacttagtcg cgtgacctca attaacggac 6660 tttatttaca cagtttcaaa ccggaaaata tctacacggc agacgaaacg agcacggcac 6720 tcgaaggaat gagaagggcc aatattttca atttattgga taatttaccg gaagatcatc 6780 tgcacatagt tcaccacaat acacaaagtc tacaaaagca ccacctctct ttaaaaaaat 6840 tggcttgggt aaaacaggca caagtagttt gcctttctga aacatggata acggatccca 6900 aaatttacga gatgccgggg ttcaatatcc attgctcccc tcgtctacaa agtagaggtg 6960 gtggcttaat aatatacata aattcccatt tcagatctga acttttgttt acggaccaaa 7020 cagatgtcga gttgctgtgc gttcgtttgg tgtcgccaaa cattgtaatc tgcctagtgt 7080 actgcccacc gagcattaaa cagccaacaa aaaacatctt agaattaccg ggtcgcctga 7140 catttttatc ccaaaaattt gaaacgaaca aaattattat aactggagat tttaatgaaa 7200 atttatttga agaaaaaaaa caaccaattt atatggtatt aaaaaacgca ggatttgacc 7260 aacacgtgaa aaccccaaca acagatggtg gaacgcttct tgatcatatg tattccatta 7320 acataaataa ccttcactgc gccacactga caacctttta ttcataccat aacccggtat 7380 gcttgtcaat atgcttatca gataaatgct ctacctcctt tgcagataaa aaagagaacc 7440 ccataagtca caacttagca aataatcagg cttctcttat tggtggtgag tacattttat 7500 ttctacatga tcttgcttaa tttataaatc atattgtggg tatatatatg tgtgtgtgta 7560 aaattaatac cgttaatttt ttgcataagg cctacttata agtctttggt tttaaacttt 7620 aatacttctt cttttttagt ggcgtataat atatttgtgc tatagatctt taatcttaaa 7680 atgtttttag ctcaaattac ggctcctgtg acaattcttc cggatgtngc ttcctccaaa 7740 gttccgtcat ccgatgtgat ggagaatgta tcgaatatca gcaggaagca ctcccggagg 7800 aatactacct tgtcacgtgt gtctgacatg agaccacaga ggctgcgagc taggatcatt 7860 gatttgtggg gtggcaacaa aaatgtgttc cgtcttaaaa caactatgga ggaattaggg 7920 tttcgtgtaa atccccatat tcaccctaat gttcagccgg ccgtatcatg tggctatatt 7980 gctgcacgtg cagcattatg gtgcaatgat aatcgtaatg gaaattggtt tgaaggagac 8040 ttagttccca ttatttggcc accagtcata tatgctatta ttcagtctta taaccaaatt 8100 cttggtactt attcagattg tagtcggttc ctggcggatg atgaagtttt gcgcatcatc 8160 gatgaggtgg atgggnaagg aattggaaca ccatggctgg atgctccaca acccatcaac 8220 ctgttcacag atactttcgt tagaagggca actgcagcaa gattgaatcc atcgggtaac 8280 ctttgtattg cggttgttaa tacaaccgca cagcatagat tagaaaacga agtaatcggt 8340 gatcattggt ttgtatgtgt ttatgagtac taagcaaatt ctgataaaga gcattttctt 8400 gaataagtct ccttctgatc ttttattatt gcgatctatt ataatatgta aataacattc 8460 cacaagtttc aacctattat ttgtaaatat cattccaaat gttccaacct attatatgta 8520 tatatcattc cacaagttcc aacctattat atgtaaatat cattccacaa gttccaacct 8580 attatttgta aatatcattc cacaagttcc aacctattat atgtaaatat cattccacaa 8640 gttccaacct attatatgta aatatcattc caaatgttcc aacctgttat atgtaaatat 8700 cattctacat attccaaaat attctaatat gttaaca 8737 // ID RTE-1_BM repbase; DNA; INV; 2359 BP. XX AC . XX DT 26-APR-2010 (Rel. 15.07, Created) DT 26-APR-2010 (Rel. 15.07, Last updated, Version 2) XX DE Non-LTR retrotransposon - a consensus. XX KW RTE; Non-LTR Retrotransposon; Transposable Element; RTE-1_BM. XX OS Bombyx mori OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Bombycidae; Bombycinae; Bombyx. XX RN [1] RP 1-2359 RA Jurka J.; RT "Non-LTR retrotransposons from Bombyx mori."; RL Repbase Reports 10(7), 1053-1053 (2010). XX DR [1] (Consensus) XX CC >95% identical to consensus. XX FH Key Location/Qualifiers FT CDS join(128..592,623..2248) FT /product="RTE-1_BM_1p" FT /translation="TPHSKCSNRVQTNIGNAQTPTLDDRWYLALIEERRRK FT KAQGIDTNKLNVLSANIQASCRRDHNSHLRNICEEVEKHATKHESRDLYHK FT IRFITKSLPSKTWAIVNDKNELITELDQISETWKSYCQSLFQDPQSRQFTS FT TDPNDENIRYIKMSRLLQLKILTETWKSYCQSLFQDPQSRQFTSTDPNDED FT LEPSILLSEVRAAIKHLKNGKATGRDAIPIETIKALGERGDDMFHKICNRV FT WQTGVWPSEWAHTVFTPLHKKGSTKKCNNYRLIALTSHSSKIMLRILNERL FT KTYLSKEIAPEQAGFVRGKGTREQIFIVRQIIEKAREFNRPTYICFVDFSK FT AFDSVKWPVLWKTLLDLGTPKHLVHLLRRLYENGTASVRADDVLSGNFHPS FT AGVRQGCIVSPLLFNAYTEIIMRITLENWTDGVAIGGYRIANLRYADDTTL FT FATDAQCLGELLSRMERVSLEFGLRINRSKTKVMIVDRAMNNSPDVTQIAG FT CDVVQSYIYLGALISNNGGCVDEIRRRMAVTRSAMDGLRKVWRNRNITKTT FT KIRLVRTLIFSIFLYAAETWTVRDVERKKIDALEMWCWRRMLGVSWTDFRT FT NVSILQELGIKQRLFGIVQSRMVNFFGHVSRRDGRSIERLVVQGSVDGTRP FT RGRPPMRWTDQIKTAVGGPLNECSRMASSRERWRDIVRRIKSAPSNTT" XX SQ Sequence 2359 BP; 742 A; 482 C; 590 G; 545 T; 0 other; gagcgcaaag gaggatcgaa gtaacagata gggcaaagtt tactgcggcg ttcgaacgga 60 aatggacaga gtggactgac gttaatcaca atgaggctac accggaaatg ttatggagca 120 aggctaaaca cctcattcaa aatgcagtaa tagagtccaa accaatatcg gaaatgcgca 180 aacgccaaca ctggatgacc gatggtacct cgccctaatt gaagagagac gtcgaaagaa 240 agctcaaggc atagacacga acaaactgaa tgtattgtct gccaatattc aggccagctg 300 cagacgagat cataactctc atcttcgtaa catttgtgaa gaggtcgaaa aacatgctac 360 gaagcatgaa tctagagatc tctaccacaa aatacggttc attaccaaat cactgccatc 420 taagacctgg gctattgtaa atgacaaaaa cgaactgata acagagttgg atcaaatctc 480 tgaaacatgg aagagctact gtcaatcgtt attccaggat ccacagtcac ggcaatttac 540 aagtacagac ccaaacgatg agaatataag atacataaaa atgagccgct tatgattgcg 600 atacaataat tattagaaat aattgcaatt gaagatttta actgaaacat ggaagagtta 660 ctgtcaatcg ttattccagg atccacagtc acggcaattt acaagtacag acccaaacga 720 tgaggacttg gagccgagca ttctcctgtc tgaggttagg gctgctataa aacacctcaa 780 aaacgggaaa gcgactggta gagacgcaat tcctattgaa acgataaaag ccttaggaga 840 acgcggcgac gacatgtttc ataagatatg caacagggta tggcagacgg gagtttggcc 900 gtcggaatgg gcacataccg tgtttactcc cctgcataaa aagggctcta cgaagaaatg 960 taacaactat cgtctaattg ctcttacatc acattctagc aaaattatgt tacgtattct 1020 gaacgagcga ttaaaaacct atctgtccaa agagattgct ccggaacagg ccggtttcgt 1080 gagggggaaa ggcactcggg aacaaatttt catcgtacgt cagataattg agaaagcaag 1140 agaattcaac aggccgacat acatttgttt tgttgacttc tccaaagcgt ttgactcggt 1200 gaaatggcct gttttgtgga agaccttact ggatctgggg acaccgaaac atcttgtgca 1260 cctactgaga cgcttatatg agaatggtac ggcctcggtg cgcgcggatg acgttctctc 1320 tggtaacttc catccaagtg ctggggttcg ccaaggttgc attgtatcgc cgctcttatt 1380 caatgcgtac acagagatca taatgcgcat cactttggag aactggacag atggtgtagc 1440 aattgggggg tataggattg caaacttgcg atacgcggat gataccactt tgtttgcaac 1500 tgatgcgcaa tgcttggggg agttactgtc gaggatggaa cgtgtgagcc ttgagtttgg 1560 attaaggatt aaccgcagta agaccaaggt gatgattgtc gatcgcgcta tgaataactc 1620 gccggacgtg actcaaatag cgggttgcga cgtggtccag tcctacatat accttggggc 1680 gttaatctca aataatggcg gatgcgtgga cgaaatcaga aggcgcatgg ctgtcacgag 1740 gtcggcgatg gacgggctga ggaaagtatg gagaaataga aacataacca aaaccacaaa 1800 gatcaggctc gttagaacac taatattctc tatatttcta tacgctgctg agacgtggac 1860 cgtgcgagac gtggaaagga agaaaataga cgctctggaa atgtggtgct ggagaagaat 1920 gcttggagtt tcgtggaccg actttcgtac aaatgtctcg atacttcagg aactcggcat 1980 caagcagcgt ctatttggta tagtacagtc tcgaatggtg aatttcttcg gacacgtttc 2040 gcgacgagat ggccggtcca tagaacgcct cgttgtacag ggaagcgttg atggtacaag 2100 accgcgcggg aggccaccaa tgcggtggac cgaccaaatt aaaactgcag tgggaggtcc 2160 cctaaatgag tgcagtagaa tggcctcgag cagggagaga tggcgcgaca tcgtgagacg 2220 catcaagtct gccccttcca acactacatg acgatcacga ccactctgtc aagagtgaca 2280 cgactgagaa gaagaagatt tagaatatag tgaaaactaa ttgctttgat aataaattaa 2340 taaagacaaa caatatata 2359 // ID hAT-1_TV repbase; DNA; INV; 4010 BP. XX AC . XX DT 08-OCT-2008 (Rel. 13.1, Created) DT 08-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE hAT transposons from Trichomonas vaginalis. XX KW hAT; DNA transposon; Transposable Element; hAT-1_TV. XX OS Trichomonas vaginalis OC Eukaryota; Parabasalia; Trichomonadida; Trichomonadidae; OC Trichomonas. XX RN [1] RP 1-4010 RA Bao W. and Jurka J.; RT "hAT transposons from Trichomonas vaginalis."; RL Repbase Reports 8(10), 1196-1196 (2008). XX DR [1] (Consensus) XX CC hAT-1_TV is the consensus built from several highly similar CC members. The transposase encoded by hAT-1_TV is related to other CC hAT transposase. TSD is 8-bp long. TIR length is very short, only CC 4 or 5 bp. CC This sequence was derived from sequence data generated by TIGR CC Database: The Trichomonas vaginalis Genome Sequencing Project. XX FH Key Location/Qualifiers FT CDS 750..2885 FT /product="hAT-1_TV_1p" FT /translation="MTAWDPHVYDEWIPFKKTKYPPTPETLETKCGKKCFT FT TIGYYENEPNQSDKGRLFTYCAECSETLVQGQFVRCIKKSRKNDFKDHKCK FT FLLPMPNYVPAEHQVENTQQAVGSGNDPSLEEIIARLIASLDISLYATMKP FT AFKSFLESFAKWLKNKVNREQGINFTIPDIEFTRRGVREALITLGKTDSSR FT GLKFCEEFRIFSISLDCGTHGSRHGLFSCVCNPGKTDRHQFFDTTISTNWT FT YEDYHTWFDSLKVNKNHVAGVVGDGLPAQVKGLCHWRPEGALFPGLQTVYI FT RCLNHVLNNAIVHARKQCPLLNELMNRVHQIIIIFKNHDMRARYKIRVPSI FT PETRWLYIYDTLFFIFDNLETINTALRSGDLPLSLSRATREQLQWTRDGIP FT PLFKDALMIFKPLKQLQLWLESNKSMGAYAFLMIQQCQGMLDEMAGRLTPD FT GEEILRSITTIFKNDMKEYGRIDLLRFAFGFTPLARKIIRDKKGFGGKTSY FT PPIMQLKDVQVPTIPSLRIINREITFEKLGEILEEERTERVDEVAEHDEEI FT QQAEEEQMLNDDHAAETNADGSRNSLVWDVVDEEIKDLYTFMITILRQRAI FT AESTVDPTINKDDRAKELSDAYDFFIEQDDSHLINGHHLAQNDGKFWYMNG FT IKQLKAIRPIGKRMMSLSCSEADVERVISELRKTRNSMNESCKEEYVACRF FT FIKLNDDQFDI*" XX SQ Sequence 4010 BP; 1414 A; 652 C; 734 G; 1210 T; 0 other; tatggcattc caaacgggaa ccgcggttgg ttaccatgga tttccgtcac cggtcacatt 60 ttaaagattt tttaagcctt ataaaaatat gctttgtaaa tcattttttg taaatcggca 120 aaaatttggt aattttcttc aactaataaa tctgcgaata gcatttataa tacaagaaca 180 atcttttaaa atgcgtttga ttcatttttg tataatttca tataggcaca tatattcttc 240 cgggactttc cgtgatttta ccgtaaaatc attagtttac accaattttt ccgtattttg 300 aatagtttac tgcacttttt ccgtaaaatt gagttacggg aatccgcaaa atttccggac 360 ctcatttttc atagaatata ataataatat ttgataaaaa gatatattat ttatattttt 420 gatttttgaa ttacggaaaa agtgaattga taacttgatt tttttatatg gttaattttt 480 caaatgatat tatgaagatc taaaatatta aaagtaaaaa attgaattac ggaaaaagtg 540 aggtcaaaag tgaaataaat ttgcagaaca attttcattg aaataagctt tatttttcat 600 tattaataac aatgcaaaaa aatatttatc acgggttcaa aatttaaaat taaatatttt 660 aataatttac atcttattct ttattttata tttaacccgt gaaaatatca tttaaaaatt 720 gaatatattc aaaaagcttt atttttatta tgactgcatg ggatcctcat gtatatgatg 780 agtggattcc atttaagaag acgaaatatc caccaactcc tgaaacattg gaaacaaagt 840 gtggaaaaaa atgtttcact acgatcggct attatgagaa tgagcctaat caatcagaca 900 aaggaaggct ctttacatac tgtgccgaat gtagtgaaac attagttcaa gggcagttcg 960 tgcgatgcat taaaaaaagc aggaaaaatg attttaaaga tcataagtgc aaatttttac 1020 tcccaatgcc caactacgtt cctgctgagc accaagttga gaatactcag caagcagttg 1080 gctctggtaa tgatccttcg ttagaagaga tcattgccag gcttattgcg agcctagata 1140 taagccttta tgccacaatg aagccagcat ttaaaagctt cctagagagc ttcgccaaat 1200 ggcttaaaaa taaagtcaat cgcgaacagg gcatcaactt tacaatccca gacattgagt 1260 tcacacggcg cggtgtcaga gaggcattga ttactttagg caaaacagac tcctctcggg 1320 gcctaaagtt ctgcgaggaa ttccgcatct tctcaatttc cctggactgc ggaacacacg 1380 gcagccgtca tggtctattc tcttgtgtgt gcaatccagg caaaacagat cgccaccaat 1440 tcttcgacac aacaatttcg actaattgga catacgagga ctaccatacg tggtttgatt 1500 ctcttaaggt taataaaaac cacgtagctg gagtagtcgg agatggtttg ccagcccagg 1560 ttaaaggact ctgccactgg aggccagaag gagccctttt ccctggctta caaaccgtgt 1620 acattcgttg ccttaaccac gtcttgaaca atgctattgt ccacgcacgt aagcaatgtc 1680 cattgttgaa tgaactgatg aatagagttc accagattat cattatcttt aagaatcacg 1740 atatgagagc gagatacaag atcagggttc catcaatccc ggaaactcgc tggctctaca 1800 tatatgatac tctgtttttc atctttgata atctggaaac aattaatact gctttaagaa 1860 gtggagatct tccactatcc ctttccagag ctacacgtga acagttacaa tggacacgtg 1920 atggaattcc tccactcttt aaagatgctt taatgatatt caagccatta aagcaacttc 1980 aattatggct ggaatccaac aagtccatgg gtgcttatgc attcctgatg atacaacagt 2040 gccaagggat gttggatgaa atggcaggaa gattaacgcc agatggtgaa gagattctcc 2100 gatcaatcac tacaatattc aaaaatgata tgaaagaata tggtagaatt gatttgcttc 2160 ggtttgcttt cggctttaca ccgttagcaa ggaagatcat cagagacaag aagggttttg 2220 gaggaaagac aagctatcca cctataatgc agttgaaaga tgtgcaagtc ccaacgattc 2280 cttccctgag gatcatcaat cgcgagatca catttgagaa actcggcgaa atccttgaag 2340 aagaacgaac agaacgtgtt gacgaggtag ccgagcatga tgaagagatc cagcaagcag 2400 aagaggaaca aatgcttaac gacgatcatg cagccgaaac aaatgccgat ggttcgagga 2460 attccttggt ctgggatgtt gtagatgaag agataaaaga tctatacaca ttcatgataa 2520 caatcctcag acaaagagca attgccgaat caacagttga tccaacgatc aataaagacg 2580 atagagccaa ggaattgagt gatgcttacg atttcttcat agagcaagat gattcccatc 2640 ttataaatgg tcatcatctt gctcaaaatg atggcaagtt ttggtacatg aatggcataa 2700 agcaactaaa agcaataaga ccaattggaa aaaggatgat gtcactttcg tgcagtgaag 2760 cagatgtaga acgtgtaata tctgaattgc gaaagacaag aaattcaatg aacgaatcat 2820 gcaaagaaga atatgtagca tgccgcttct tcatcaagtt gaatgatgat caatttgata 2880 tttgattttg ttttttaata aaatttatta aataatttgt tttaaaaagt attattttta 2940 ttcgtaaaga tattagacag aaggtggcca gcctaatgtc tatcaatctt taatattagt 3000 agtaatttcc cgcatttctg aatgatgatt gattagcttt gttgttattg atgtaaaact 3060 ggataagtat taaataatta tattaatctt ggccagggga ctgctgatgg ccagtcagca 3120 gcggccacta tgaatcatta taattattta atacctaaat tgaggtaaaa agcaataaga 3180 cttgccgagc ctcatgcccc tacctcatat aaatagaatc acactttagt gaatcaacat 3240 ctataagaat gaagtgtttt gaagctgaaa taatattgaa tagaaattaa ttttttctga 3300 aattttgaat gaaattatgg gaaataaaaa aattactaaa tattgtattc atacaaatat 3360 attattttaa aggaaaaata gatgctgata gaaacgatga aaatttttga aaactttaca 3420 attaatttat gttcgttaac cacagagctt gcaatccaaa gctcacgtta gacaacaaga 3480 gtcatgaagt ctcatactaa gggatatcca acatatattt taataatcta atgtttgtga 3540 caaatattca acataaaatt aagttttcgt ttcgttacaa ggcattcata ctttggagat 3600 atcatattta atgtacagaa aagttaatat taaacaatac atgctaaaga attaataaat 3660 aagataataa aataaattta tacgtaatta ctttcattta atatgcttta tgctgtataa 3720 tattgattaa gaggcattaa aaggttccat agtctgcttt tataggcaag atttttgata 3780 aaaattttaa attttgaaaa atgatgagaa aaggaaaaaa aaggtaaatg agagcaagag 3840 gagggaaaaa aagaaaaaga aaagaaaggg aagggaaaaa taaaagaaag aaggacatat 3900 aaaaatttag atgcgcattc ttaatgccga ggcggcttgc cgccgaggcg cgattttttt 3960 aggcctctac ggtaacccgg caatgttttt ttaaatcttg tacagcccta 4010 // ID BEL-221_AA-I repbase; DNA; INV; 6467 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-221_AA_; KW BEL-221_AA-LTR; BEL-221_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6467 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 899-899 (2011). XX DR [2] (Consensus) XX CC Positions [5521-6078] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS join(79..3420,3424..6177) FT /product="BEL-221_AA-I_1p" FT /translation="MMPTTTRSQRICEKCKNPNDVTKMVACDRCHRWYHRT FT CAGVAERYIGKWSCERCVFTVTISDHSVSGKSGSTSRSSRLQLQLLRLEEE FT KRAQEKLIQEQQEQDRIRLEAKAALEKKYLDEKYALLIAEAEDEEADSRRS FT RRSRASRNSQVQEWVDGVDEAAGGNPVPEHIEDIFPPISVGTEMVVPRRGI FT VSRYTGAVPKITSNPSPNVEAVGRGNRPGPVINDNVEAEGIGSVAGINNPI FT TASTPIRIGGTVQSSQPVRSMYRQSVYTQPVYTQSLFPPSTQMQSLSILPK FT VVPRPPLPTFLFPEPISKPKTRTRVSFPPVSETSLPHTTGRPPVDLTVESP FT PSNSAQAKAPRSMVSTSSQLPPVLHSSQEPLAHSIEQQSSVQPASTLPSFN FT PSFQQAHTQSHVSRSVVCDQVDDQSQPPSALRSSASYTPSPPQHHVEPEIL FT LPQSAIHQPASSHSSPLQQNEPYATQLQQIQNQQAMWGQFQQQLSARQVVP FT KELPVFSGNPEEWPLFVSSFRNSTAMCGYSQSENLMRLQRCLKGKALEAVR FT SNLLLPTSVPKVMETLEILFGSPERLVQSLLSKVRSVPTPKAERLETLVNF FT GLVVQNLVGHLQAANQLAHLTNPTLLQELVEKLPPHLRLDWALYKKSAGPV FT DLGTFCEYMSAITSAASDVAHFTDFEGPRAGGNEKQRKEKAFVNAHVSSEP FT RKFEQQQVKKVESKERPCYVCHSVKHRIKDCNKFKSLSFGDRLKTVETHQL FT CAVCLVPHGKWSCKSTRICGVGDCDKKHHPSLHPGQPIGVKPTSESSGARP FT KSEAVVNVHCRIRSATVFRIIPVVLYGEKTRLSTFAFLDEGSSSTLIDREV FT ANLLNLGGDLKPLCLTWTGNVSRHEADSRLVSLRISGEESSKSFSLTSVST FT VSKLELPVQTLQYEELSRRYPYLAGLPVRSYENAVPTILIGLDNIRLSLPL FT KVREGRLGPVAAKTKLGWTIFGSIGDSKADSSPRILHICKNEDDNRLHDLV FT KGYFAVENLGVSVTCGPEADEDRRAKEILQQTTVKRADGHYETGLLWRYDV FT VELPSSYGMAERRLICLERKLRNHPELRASLDKQISEYLNKGYAHKATPQE FT LEESDPRTWYLPLGVVTNPRKPGKVRIVWDAAAKVNGVSLNGTLLKGPDLL FT TSLPTVLCRFRQRQIAIAGDIREMYHQLKIRKEDQQVQRFLYRSDPSMKPE FT VFVMNVATFGSTCSPCSANFVKNTNAMECKAEFPEAAAAVVENHYVDDYLD FT SRDTEEAMAKLASEVREVQQSAGFELRNWRSNSKQVLQILGEVAEDTRKDF FT SVDKESQVERVLGMAWLPDDDVFVYAVKLPDQFGEVTKRSILRFVMSVFDP FT LGLISNLLIHGKIIIQDLWRAKVGWDEAIPSDVLEDWKRWIKKLSELSQTR FT IPRCYFPGYNPDSLQTLELHIFVDASESVYACVAYFRIIDNGQPRCALIAS FT KAKVAPIKPLSIPRLELQAAVIGSRLAKSISEYHTIAPSRRFLWSDSTNVL FT SWINSDARRYRQYVAVRIGEILEETQPEEWRWISTKINVADEATKWGKGPD FT CRPNSRWFAAPDFLLKPESEWPRQKAPVVNTEEELRVVHVHHAVMLQPCID FT YNRFSKWERMLRTTAYVYQFMDRYLDLGKERSMQSGGVLKQEDLVRAERAL FT WRLAQAEEYAEELVVLRQANQSEDRLLNLENSSPLKQLSPFLDEFGVIRMA FT GRTEASPLAKYDSKFPIISPKNHRVTELLIDWHHRRFGHHNGETVVNEIRQ FT RFHISTLRTMVRKLSKKCQWCNVYKAQPAVPKMAPLPEARVTPFVRPFSLV FT GVDYFGPYMIKIGRSQVKRWVALFTCMVIRAVHVEVAASLSTESCKLAIRR FT FIARRGAPTQIYTDHGTNFVGASRELANQLATMNRELAETFTNTNTRWFFI FT PPSSPHMGGAWERMVRSVKTAMEVINEFRAPSEEVFHTIICEAESIVNSRP FT LTYVPLETSDQEALTPNHFILLSSNGVKQPEKAPTTEGEALRNGWNLCRYV FT LDQFWS" XX SQ Sequence 6467 BP; 1709 A; 1562 C; 1770 G; 1423 T; 3 other; ctctttaaaa atctcaactc atagaaagtg tgttgctgaa acggattaaa agattacgga 60 ggtgcagagt acgaatccat gatgcccacg acaacgagat ctcagcgcat ttgcgaaaag 120 tgcaaaaacc cgaacgacgt gacgaaaatg gtagcttgcg atcggtgtca tcgttggtac 180 catcgtacgt gtgcgggggt ggctgagagg tacatcggta agtggtcatg cgagcggtgc 240 gtattcacgg tgaccataag tgatcattcg gtatccggta aaagcggcag cacttcacga 300 tcttcacggt tgcagcttca actattacgg ctggaggagg agaaaagggc tcaagaaaag 360 ctgattcaag agcagcaaga acaggatcgt attcgactgg aagcaaaagc tgcgctggag 420 aagaaatatc tcgatgagaa atatgctctt ctaatcgccg aggctgagga cgaagaagcg 480 gatagtcgta gaagtcgtcg aagccgagct agccgaaata gccaggttca ggagtgggtg 540 gacggagtgg atgaagcagc gggaggaaat ccagtgcccg aacacattga ggatattttt 600 ccaccaattt cggttggcac tgagatggtt gtgcctagaa gaggaatagt ttctcgatac 660 accggagctg tgcccaaaat cacctcgaac ccatcgccaa acgttgaagc ggttggcaga 720 ggaaatcgtc ctgggcctgt tatcaacgat aacgtagaag cagaaggtat cgggtcggtg 780 gcaggtatca ataatccgat aacagcatcg actccaataa gaatcggtgg taccgtacaa 840 agttcgcagc cagtacgatc catgtataga cagtcggtgt acacccagcc ggtttataca 900 cagtcgttgt ttcccccatc cacgcagatg caatccctgt ccatactgcc aaaggtagtt 960 ccacgaccgc ccctaccgac gttcttattc ccagaaccga tctcgaagcc aaaaactcga 1020 acacgtgttt cgttcccacc cgtaagcgaa acatcattac cacacacaac tggtagaccg 1080 cctgtggatt tgactgtcga atcgcctccg tcgaattcag cacaagcaaa agctcctcgc 1140 agcatggtgt cgacgtcgtc gcagcttccc ccagttttgc actcgtcgca agaaccccta 1200 gcccactcaa tcgagcagca atcgagcgtg caaccagcat caacgttgcc atcgttcaat 1260 ccttcgtttc aacaggctca cacacagtcg cacgtatcga gatcggtggt gtgtgatcaa 1320 gtcgacgatc aatctcagcc tccatctgcg ttgcgctcct cagcttcgta caccccctca 1380 cctccacaac atcatgtcga gccggagata ttattgcctc agtcagcaat acatcaacct 1440 gcatcgtcac attcatcgcc actgcaacag aacgagccat atgctaccca gctgcagcag 1500 atacaaaatc aacaggcgat gtggggacaa ttccagcagc agttgtccgc cagacaggtt 1560 gttccaaagg aacttccagt gttttctgga aatccagagg aatggcccct ttttgtgagc 1620 agcttccgta actccactgc aatgtgtggt tactctcagt cagagaacct gatgaggctg 1680 cagagatgcc ttaagggcaa agcgttggag gccgttcgga gtaatttgct gctgccaaca 1740 tcggtcccga aagttatgga gacgctggag attcttttcg ggagccctga gcgattggtc 1800 cagtcgttgc ttagcaaagt acgtagtgta cccactccga aggctgaacg gcttgaaact 1860 ctagtgaatt tcggcctcgt tgtccagaat cttgttggtc atttgcaggc cgcaaatcag 1920 ctagcccacc tcactaaccc cacgcttcta caagagctgg tggaaaagct gccgccgcat 1980 ctccgattgg attgggcatt atacaagaag agcgccggtc ctgtggactt gggaacgttt 2040 tgcgagtata tgagtgccat cacgtcggcc gcaagtgatg tggcgcactt caccgacttc 2100 gaaggacctc gggccggcgg aaacgagaag caacgaaagg agaaggcgtt cgtcaacgct 2160 catgtgtctt cggagccccg gaaatttgag caacagcagg tgaagaaggt ggaaagcaag 2220 gaaagaccct gctacgtttg ccacagtgtg aagcatcgga ttaaggactg caacaagttc 2280 aagtccttgt cattcggaga tcggttaaag accgttgaaa cgcaccagct atgtgcggtc 2340 tgtttagtac cccatgggaa atggtcctgt aaatctacac gcatctgtgg agtcggagat 2400 tgcgataaaa aacaccatcc atcacttcac ccgggtcaac cgataggggt gaaaccaacg 2460 tctgaaagtt caggagcgcg gccgaaatct gaagcggtcg tcaacgtgca ttgtcggatt 2520 cgaagcgcca cggttttccg catcatcccg gtggtgttgt atggtgagaa gacccgatta 2580 tccacgttcg cctttcttga tgaaggttcg tcatcgaccc tgatcgatcg agaggtggca 2640 aacctgctga atcttggagg agacttgaaa ccactgtgtt tgacctggac ggggaatgtc 2700 tctcggcacg aagctgactc acggcttgtc agtctgagga tttccggcga agaaagcagc 2760 aaaagtttct cgttgacaag cgtcagcacc gtaagtaaac tggagcttcc tgtacagacg 2820 ctgcagtatg aagaactgtc tcgtcggtac ccgtatttgg ccggactacc agtaagaagc 2880 tacgagaatg ccgttccgac aatcctgatt gggctggaca acataagact gtcattaccc 2940 ctcaaggtgc gcgaaggacg acttggaccg gtggcggcaa aaacgaagct aggatggacg 3000 atcttcggga gcatcggaga ctcgaaagcg gattcgtcac ctcgaatatt acacatatgc 3060 aagaacgaag acgataatag gttacatgac ctagtaaaag gctactttgc ggttgaaaac 3120 cttggagtat ccgtgacatg cgggccagaa gcagatgagg atcgtcgagc gaaggagatc 3180 ctgcagcaaa cgactgtcaa gcgtgctgac ggtcactatg aaaccggatt gctatggcgg 3240 tacgatgtcg tggagttgcc atccagttat ggcatggccg agcgccggct gatctgcctc 3300 gagcggaaac ttcgaaatca tccagaattg agggcgagtt tggacaagca gatttccgaa 3360 tacttgaaca aaggctacgc ccacaaagcg acgccgcagg agttggaaga aagtgatcct 3420 cawcgtacat ggtacctgcc tctcggagtg gtaactaacc cacggaagcc cggtaaggtt 3480 cgcatcgttt gggacgcggc cgccaaagtc aacggagtgt ctctgaacgg cacactttta 3540 aaggggccgg accttctcac gtcgttacca acagtattgt gccgtttccg tcagcgacag 3600 atagcgattg caggtgacat ccgagagatg tatcatcagc tgaagatcag aaaggaagac 3660 caacaggttc agcggtttct ctacagaagc gatccatcga tgaagccgga ggttttcgtc 3720 atgaatgtgg caacctttgg gtcgacatgt tcgccctgtt cggcaaattt cgtgaaaaac 3780 actaacgcga tggagtgcaa ggcagaattt ccggaagcgg cggcagcagt ggtggaaaac 3840 cactacgtgg atgactacct cgatagccgc gatacggagg aagccatggc aaaactggcg 3900 tcggaggtac gagaggttca acaatccgcg ggattcgaac tacggaactg gcgttcaaat 3960 tcgaagcagg tgttgcagat tctgggagaa gtggcagagg atacaagaaa ggactttagc 4020 gtggacaagg aaagccaagt agaacgtgtt ctcggtatgg cgtggctacc ggacgacgat 4080 gttttcgtat atgcggtcaa actgccggac cagttcggtg aagtaacgaa gcggagcatc 4140 ctgcggttcg tcatgagcgt cttcgatccg cttggattga tttcgaacct gctcatccac 4200 ggcaagatca tcatccaaga cctttggaga gctaaagttg gatgggatga ggcaattcca 4260 tcagacgtgc ttgaagactg gaagcgatgg attaagaagc tctccgagtt gagtcaaacg 4320 cggataccac gctgctattt tcccggatac aatccagaca gtttgcaaac ccttgaactt 4380 catattttcg tggacgcgag cgaatctgtc tatgcttgtg tcgcgtactt ccggattatc 4440 gacaatgggc agcctcgatg tgccttgata gcttcaaagg ccaaagtggc accgataaag 4500 ccactatcga ttcccagatt agaattacaa gcggcggtga ttggaagtcg actagcgaag 4560 tctatttccg agtatcacac tatagcaccg agccgcagat tcctctggag tgactccaca 4620 aatgtgcttt cctggatcaa ctcggatgct cggaggtatc ggcaatacgt agcggtacgc 4680 atcggagaga tcctggaaga aacgcagcct gaggaatggc gttggatatc caccaaaatc 4740 aatgtagcgg atgaagccac aaagtgggga aagggccctg attgtcgtcc gaatagccgt 4800 tggtttgctg cgccagactt cctacttaag ccggaaagtg aatggccacg gcaaaaagct 4860 cccgtagtaa acaccgaaga ggaactgcga gtagttcacg tgcatcatgc ggtaatgtta 4920 caaccgtgca tcgattacaa tcggttctcc aaatgggagc gtatgctacg aaccacggca 4980 tatgtatacc agtttatgga tcgttatctg gatctaggaa aggaaagatc gatgcagtct 5040 ggaggagtgt tgaagcagga agatttggtt cgagctgaga gggcattgtg gcgacttgca 5100 caagccgaag aatacgccga agagcttgtt gtattgcggc aagcaaatca gagtgaagat 5160 cggttgctga acctggagaa cagcagcccg cttaagcagt tgtcgccatt cctggacgag 5220 tttggggtga tacggatggc aggccgaaca gaggcgtccc cacttgccaa gtacgattcg 5280 aaattcccaa taatctcgcc gaagaatcat cgtgtcaccg agctcctgat agattggcac 5340 cacaggcgtt ttggtcatca caacggcgag acagtagtta acgagattcg tcagcggttt 5400 cacatctcaa cactccggac tatggtgcgc aagttgtcga agaagtgtca atggtgcaac 5460 gtttacaagg cacaaccggc ggtgcctaag atggcgcctc ttccagaagc gcgagtaact 5520 ccgttcgtcc gtcctttttc gctggttggc gtcgactact tcggaccgta catgataaaa 5580 atcggccgca gtcaggtgaa gcgctgggtg gcgctcttta cgtgtatggt gataagagcc 5640 gtccacgtgg aggtagccgc atcactctct acggagtcct gtaagttggc aatacggaga 5700 ttcattgcgc gtcgaggcgc accgacacaa atctacaccg atcacggaac aaactttgtt 5760 ggggctagcc gagagttggc caatcagttg gctacaatga accgagagct tgcggaaaca 5820 ttcaccaaca ccaacacccg ttggttcttc atcccaccgt cctccccgca tatgggtggt 5880 gcctgggaga gaatggtaag atcagtgaaa acagctatgg aggtgatcaa cgaattccgt 5940 gcgccgtctg aagaggtttt tcacacgatt atctgtgaag cggaatcaat tgtcaactcc 6000 agaccgttga cgtacgtacc gctggagacg tcggaccagg aggctttgac tccgaatcat 6060 tttattctcc ttagttcgaa tggggtcaaa caaccggaga aagcacctac aacagagggc 6120 gaggcacttc gtaatggatg gaacttgtgc cgctatgtkc tggatcagtt ctggagtakg 6180 tggatccgag aatatctacc agtcgtgacc cggcgtacta aatggcacga cgaagtaaag 6240 cctgttaagg aaggtgatgt tgtgttcatc gtaggcgagg ccattcggaa tcgatggcct 6300 agaggtaaag tcttgaaggt catccctggg aaagacggtc gtgtccggca ggtagatgtt 6360 caaacagcta caggaattct tcgccggccc gtggcaaagc tggccgtgat caacgtactt 6420 ccggaaggta atcctgctgg accggagcag cattacgtgg aggggga 6467 // ID Gypsy-7_TCa-I repbase; DNA; INV; 4991 BP. XX AC chrUn_33; XX DT 21-FEB-2011 (Rel. 16.02, Created) DT 21-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the red flour beetle genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-7_TCa_; KW Gypsy-7_TCa-LTR; Gypsy-7_TCa-I. XX OS Tribolium castaneum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Coleoptera; Polyphaga; Cucujiformia; OC Tenebrionidae; Tribolium. XX RN [1] RP 1-4991 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the red flour beetle genome."; RL Direct Submission to RU (21-FEB-2011). XX DR Genome; chrUn_33; Positions 55619 60609. XX CC Positions [1889-2395] - Reverse transcriptase CC Positions [3455-3931] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 1499..4327 FT /product="Gypsy-7_TCa-I_1p" FT /translation="MKFDCLLGRDFLTNDKLSISLGKVVTVKNKRSEVDIR FT VQDESLELLHIDIGCNNKSEINLNINEQLSCDVHQQVKDIVNEFYVQPDRP FT DEPYTQLELKLHVKPNHAPFYSKARRLSYAEKEAVKQITNVLLSKKVIKPS FT SSQYSSPIVLVRKKDGNYRMAVDYRELNKLTTRDIYPIPHIEEQIDNLKGK FT AYFTRLDLKDAFHNIKLDPESTKYTSFVTFMDQYEYTKMPFGLANGSSFFM FT RFINTAFRDLLRQQKVQIYLDDILIPTKTMYENIAILKEVLKVLVKNKLEL FT RFDKCEFLLDKVKYLWYNIDKNGISPSDDNVKAIKEFPTPKNFRELHSFIG FT LLSYFRKFIRDFATVSKPLSELLKAKEFIWTDRENDCFESLKQKLISSPVL FT AIYSHKLETELHCDASSHGFGAVLMQKQLEDGKFHPVFFFSKRTTPTEASY FT HSFELEALAIIYSLERFRIYLQGIKFKIVTDCNSLKQTLERKEVNPRILRW FT SLILRNYDYTLEHRTSDQMRHADALSREFSISLISENTFERNLELLQNLDS FT KIVEIRKGLEKQQNKLYEMNNGLVYRKMGEKNLFYVLTSMEENIIRTCHEK FT VGHQGINKTIDYITRIYWFPEMRNKVQNHILNCFKCITYSTVANRVEGKLH FT CIDKGNVPFYTLHIDHYGPLEKSGNRHKYIFEIVDGFTKFVKFYATVSTNT FT DEVIKHLKNYFAYYSKPKRIISDNDTAYTSNKFRVFMQENGIQHILTATRT FT PQANGQIERVNRPLTAILSKFVSNHTSTWDKKLVDVEYAINNSVNRVTGET FT PARLLFGIEQVRNDDDLRELIKEIQEQDRNLEEMRTKAVVGINAVNKYNQQ FT YYDNKHKEPFKYKEGDYVMLKNVDTTAGVNKKLLPKFRGPYVISKVLDRDR FT YVVKDPEGFQLTQLPYEGIASPANMKLWRLNNVDDNRLC" XX SQ Sequence 4991 BP; 1777 A; 749 C; 1013 G; 1452 T; 0 other; cagacatcag aagtgggata agaacaagtg aacaagtgga gaagcttagt gatgtacccg 60 gattgtacga aaaaatcaag aagtcgtccg ttgccgcaat tcgcgctttt cacaaacttt 120 tgtacgatgg agtagatagt ggccgacgga ataggcagca cgtgcgtgat tttgcgggtt 180 ttgtattcgc cgatgacgcc gcgttagatc aaaaggtaga atgggcaacc acaaatttaa 240 cgttgggaga tttgacgtcc gtgtgcgcta ttttgaatat cgactacgag ggctcagcta 300 gagaggtggc ggaccgcata tgcacgaaac tgagagatct aaactcgttg acatcgattg 360 aagacacgga agaagaggaa gaagaagaag ctgacggaga ggacacgaat gatgacacac 420 cgcgaaggac accatcacta gagaatgaaa gatgggccat acaacaaaca ccaaaattcg 480 taatgaattt taaggatatc gaagattcga ttcgtccttt cagtggggaa gaatcatatc 540 aagtggaaaa ctgggtacag gactacgaag agatggcaga agttatgcaa ttttcagaat 600 tgcaaaaatt ggtatatgcg aaaaagtgcc taattgggtt ggcaaaacag tgtattcagt 660 gtgaacgagg actgaacagc tggtccaagc taaaagaaat ccttcaagac gaatttggca 720 agaagattag cagtgcagac attcatagac ttttatctga aaagaaaaaa caaagtaacg 780 agtcggtatt agagtactat tttaacatga aggagattgc agccagaagc caaattgaag 840 aagatgccgt aattcaatat attgtcgacg gaattccaga caaaatgtgt aataaaacgt 900 ttttatatga agcaaggaac tttaggcagc tgaagactaa attagaaacg tacgagaaaa 960 taaagagtcg accatcttca tcatcgatgg acaagcagcg tatcaataag aatgcagtaa 1020 taaacgcgag tgtaacaaac aaatcaaaga gtgcgacaaa tgtgaccgct aataacattg 1080 tgaggtgtta caattgtggt gaaaaggtca tattgcagca aagtgtagca aaccgaagag 1140 agaagttggt tcgtgttttg gctgcggatc gaaaagtcat caaaaaaagg actgctcagt 1200 taaacaagat acgacgagtc acctggtgga acaacaaatt ataccggctt ttatagtaaa 1260 ggtgcgttca aacaattttg ttggtgagtt aacggcaata attgattcag gtagtccaat 1320 ttcactgttg acttcccatt cgctaaaaac aaatgttaaa attttacctg ataattctaa 1380 tataacatat tgtggtctga acggtactaa attgaatata ttagggcaaa ttaacgaaag 1440 agttttagtt aatgattgtg agactgaaat tgaattcaaa gtagtacctg ccgacactat 1500 gaaatttgat tgcttactag gtcgagattt cttaacgaat gataaattaa gtataagttt 1560 aggtaaggtt gtaactgtga aaaacaaacg aagtgaagtg gacatcagag ttcaagatga 1620 atctcttgaa ttattgcaca ttgatattgg ttgtaacaat aaatcagaaa taaatttaaa 1680 tattaacgaa caattgagct gtgatgtcca tcagcaggta aaagatattg taaatgaatt 1740 ttatgtacaa ccagatagac ccgatgaacc ttatacgcaa ttagagttga agttacatgt 1800 aaagcccaat catgcgccat tttattcgaa agccagacga ttatcttacg cggaaaaaga 1860 agcggtaaaa caaattacaa acgtattatt gtccaaaaag gttatcaaac caagtagttc 1920 acaatatagt agtcccattg ttttagtgcg aaaaaaagac ggaaattatc gtatggccgt 1980 tgattaccgc gagttaaata agttgacaac gcgagatatt tatcccattc cgcacataga 2040 ggagcaaatt gataatttaa agggtaaggc ttattttacg cgtttggact taaaagatgc 2100 ctttcataat attaaattag atcccgaatc aacgaaatat acgtcatttg ttactttcat 2160 ggatcaatat gaatatacca aaatgccgtt tggtctagcg aatggtagtt cattttttat 2220 gcgttttata aataccgcgt tcagagattt attacgtcaa cagaaagttc aaatttactt 2280 agacgatata ttaattccca ccaaaaccat gtacgaaaat attgctattt taaaagaagt 2340 tttaaaagta ctcgtaaaaa acaaattaga gctaagattt gacaaatgcg aatttttatt 2400 ggataaagta aaatatcttt ggtacaatat agacaagaac ggtataagtc caagtgacga 2460 caatgtaaaa gcaataaagg aatttccgac ccccaaaaat tttagagagc tccatagctt 2520 tattggttta ttgtcctatt ttagaaagtt tattagggat tttgcgaccg tgtcaaagcc 2580 gttgtcggaa ctattaaagg caaaagaatt tatatggacg gaccgtgaga acgattgctt 2640 tgaatcatta aagcaaaaat taatatcgtc acccgtgtta gccatatact cgcataaatt 2700 agaaacggaa ctacattgcg atgctagttc acatggtttt ggtgcagttt tgatgcaaaa 2760 acaattggaa gacggaaaat ttcatccggt cttttttttt agcaaaagaa caacacctac 2820 ggaagcgtct taccatagtt tcgaattaga agcgttagcg attatttatt cattagaaag 2880 attccggatt tacctgcaag gtataaaatt taaaattgtc acggattgta acagcttaaa 2940 acagacatta gaaagaaaag aagtgaatcc aagaattttg cgatggtcat tgattttacg 3000 aaattacgat tatacacttg agcaccgtac ttctgatcaa atgagacatg ctgacgcgtt 3060 aagtcgtgaa tttagtattt ctttaatttc agaaaataca ttcgagcgca atttagagtt 3120 actacaaaat ttagactcaa aaatcgtgga aatcaggaaa ggtttagaaa aacaacaaaa 3180 caaattgtat gaaatgaata acgggttagt gtatcggaaa atgggcgaaa aaaatttatt 3240 ttatgtactg acatcaatgg aagaaaatat aattagaacc tgccacgaaa aagtcggaca 3300 tcagggcata aacaaaacga ttgactatat cacacgtatt tattggtttc cggaaatgag 3360 aaacaaagtc caaaatcaca tcttgaactg ttttaaatgc atcacatatt ccactgtcgc 3420 aaatcgagtc gaaggtaaat tacattgtat tgataaaggt aacgtaccat tttatacact 3480 tcacattgac cattacgggc cattagaaaa atcgggcaat cgtcataaat acatttttga 3540 gatcgtagat ggttttacca aatttgtcaa attttacgcg actgtctcaa caaataccga 3600 tgaagttatc aaacatttaa aaaattattt tgcttattat agtaaaccca agcgaattat 3660 ttcagataat gacacagcgt acacgtcaaa taaatttcgc gtatttatgc aagaaaatgg 3720 tatacaacac atcttaactg caacgagaac gccgcaggca aacggtcaaa ttgagcgggt 3780 taatagacca ttaactgcta ttttatcaaa atttgtctca aatcatacgt ccacgtggga 3840 caaaaaatta gttgacgtcg agtatgctat aaacaattca gtaaatcggg taaccggcga 3900 aacacctgcg agattattgt tcggtattga acaagtacga aatgatgatg atttgcgtga 3960 gctaataaaa gaaattcaag aacaagatcg aaatttggaa gaaatgcgta caaaagcggt 4020 ggtaggaatc aatgcggtca ataaatataa tcaacaatat tatgacaata aacataaaga 4080 accgtttaaa tataaagaag gcgattatgt tatgttaaaa aatgtagata cgacagcggg 4140 tgtaaacaaa aagttgctac caaaatttag aggcccctat gtaatttcaa aagttttaga 4200 tcgagaccga tacgtcgtga aagatcccga agggtttcaa ctaacgcagt tgccatatga 4260 aggcattgca tcaccagcga acatgaaatt atggcgccta aataatgttg atgataacag 4320 attatgttaa tttatttttg cagttcacta aagtaaattt tatttaagtt gagtgcatta 4380 attaggtaga atttattgta taaattgtgt atgaatgttt aaaataatag gtgaatccta 4440 agttagtgaa atatgtatga gatttgtttc cataagttat ttgcaatatt aggtgaaaat 4500 tagtgttgtg attagatgag atttgttctc atattatttg attgtttgag atttgttctc 4560 atgtgggtag ggatttgttc ctaatgaata gtttgagatt tgtcctcagt ggtagaaatt 4620 tgttctcata ttgaagttgt ttatttgaaa ttttatttca cagttagtaa atgatttgtt 4680 ctcatgttta aaaaatttcg agatttgttc tcacgtgtga ctaattattg tattttgaga 4740 tttgttctca gtttgttaga gatttgttct aggatttaat tagatgtcta tgttaggtta 4800 cttttaggat ttgttccaaa aatttcgaga tttgttctag gatttaatta gtttagtaat 4860 tatttagagc ttgagggcaa gctggaaatg tcaggatgga cgagctgtag tgtatttaaa 4920 aaactacctg ctctgtttgg gtccttgaga gaccagacct ttccctcagg ttctccaacc 4980 ctcgacagaa g 4991 // ID Copia-22_DPu-I repbase; DNA; INV; 4272 BP. XX AC scaffold_66; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-22_DPu_; KW Copia-22_DPu-LTR; Copia-22_DPu-I. XX NM Copia-22_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-4272 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 707-707 (2010). XX DR Genome; scaffold_66; Positions 46568 42297. XX CC Positions [1754-2254] - Integrase core CC 'AGATG' target site duplication CC LTRs are 100% similar to each other. CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX FH Key Location/Qualifiers FT CDS 335..4201 FT /product="Copia-22_DPu-I_1p" FT /translation="MTTISPKEQQAFVNCKTANSIWVKLAAQYLQNASAST FT HVLQARFFHYQFLKGHSMMSHITAIEGLAQQLEDLGSPMSQSQIMTKIVST FT LPTAFRNFMTVWDNLPETEKTMPQLITKLMNEQHRNVCTSPSSSKIAPVNT FT EIAAFAAKSESNRPTNSWERRALESGRPFQSREKRTVVNPDRKRPREECDF FT CGKFNHSELTCTRRINAEAGHPDEKCTYCRNYDHSAAVCSKRIRDERNESK FT TAPRTVKFARTDKDGDKDGVAFIAFGASQTPMNEDLWYADSGAIHHMCNSK FT SRMQNYTHTSTSRSVTGIGGVQLNVHGQGDVHVITKINGTPYNAVLQNVLH FT VPGLGTNLLSIPSTTERGIDVHFTGQSVSFTKNGTIIMAGSRVGKDLYELD FT ISTADSSQNDAIVCSAVANRQPISIWHQRLSHTCYKTIIKMVSNGLVDGIE FT LKDNSTPSSICPGCAFGKMHRLPFESGRRRATKVGEIIHSDVCGPMEPPSP FT GGARYYVSFKDDLSGYRVIYFLRLKSEVFDRFKLFVCKIESETNQSVRTLR FT SDGGGEFISKEFNEWLTEKSIRHEISAAHTPQHNGVSERDHRTIGEAERSS FT MHMNNIPLELWAESYNCANYTLNRTLSSSASVTPFELWFGRKPNLSHLRIF FT GCEAYMHVPDCDRHKLEPKSIKCLFVGYCETTKAYRLWDPVDRRLKISRDV FT IFNESPAHDVSTHHDSLKSSHPANECLNPSDQITATGPRHSSRTPQPKKLW FT AEFASTDKPDLTTPIIEPSSFKLAMASPDSDKWKIAMDEEYQSLMDNHTWS FT LAPLPHGRSAIGCRWTYKLKHGSDGTIQRYKARFVAKGYSQRPGLDFTETY FT APVVKLDSLRAILSIAANRDLDMIQLDVKTAFLYGEVAEELYISQPEGFIV FT TGQESLVCRLHKGLYGLKQSSRLWNLTFDSFLTSFGFISSSADPCVYFREN FT ASEFTIFALWVDDCLLCSTSSSVNTAILSYLTSHFSMTSGSADLFIGLQIS FT RDRAQRKLLLSQPQYLQRIIERFHMTNSNPSSTPADPNARLDSSMSPSTSD FT EIKAMNTTPYSEAIGCLTYAAVCTRPDIAFAVGQAARFCQNPGKAHWSAVK FT RILSYLAGTTTHGLLFSGKGRTTLVGYTDSDYAGDKDTRRSTSGFIFLHLG FT GAISWGSTRQSCTALSTTEAEYIAASNATKEAIWVQRLLLQIGHLQPGPVR FT LLCDNQSAISLVHNPAHHQRTKHIDVRYHFIREKQSDGVIDVVYIPTDHQL FT ADIFTKPTATHRFNFLRDRIGMASSVAI" XX SQ Sequence 4272 BP; 1243 A; 1136 C; 873 G; 1020 T; 0 other; ggttatgggc ccagagttaa ataagttttt ctgacatggc tgactctaaa tccaaactaa 60 gccacataga gcaatttaac ggcactaact tccagctgtg gaagtataat tgctggctga 120 ttcttgaaca aaacgatttg ctagacatag tagaggtaat ttcactgaga catattatca 180 cttgtcaatc agatcgttaa cccactttgt tctattgtaa ccttaacagg gaaagtcaaa 240 ggaacctgaa cctcatgcag tatctggctc gatcactaac tccaaagaga tcaaaaagtg 300 gaaaaagcaa gatacagatg ccagggtgat tctcatgaca actatatctc ccaaagagca 360 acaagcattt gtgaattgca aaaccgccaa ctcaatctgg gtcaaactag ctgcccaata 420 tctacagaat gcttcagcaa gcacccacgt actacaagca agattttttc attaccaatt 480 tctaaaagga catagcatga tgtctcatat caccgctatt gaaggacttg ctcaacaact 540 tgaagattta ggtagcccca tgtcccagtc acagatcatg actaagattg tctcgactct 600 cccaactgct ttcagaaatt ttatgacggt gtgggataat ctccctgaaa cggaaaagac 660 tatgcctcag ctcatcacga aattgatgaa cgaacagcat cgaaacgttt gtacatcccc 720 atcaagcagc aagatcgcac cggtcaacac tgaaatagca gctttcgctg ccaaatcaga 780 atcaaataga cccaccaaca gctgggaaag acgagctttg gaatcaggaa gacctttcca 840 gagtagagaa aagcgaactg ttgtcaaccc agacagaaag agaccacgag aagaatgtga 900 tttttgtggc aaattcaacc actccgagtt aacatgcacc agacgaatca acgctgaggc 960 cggacatccg gatgaaaagt gcacctattg tagaaactat gaccactcgg cagctgtatg 1020 ttcaaaacgc attcgggatg aaaggaacga atctaaaaca gcgcctcgaa cagtgaaatt 1080 tgctaggacc gacaaagatg gagacaaaga tggagtagcc ttcattgctt ttggggcaag 1140 tcaaacaccc atgaatgaag atctttggta tgcagattcc ggggcgattc accatatgtg 1200 taacagcaag tctcgcatgc agaactacac tcacacatca acctcgcgga gtgtgacagg 1260 aatcggtggc gtccaactga acgtccatgg acaaggggat gtccacgtca ttacgaagat 1320 caatgggact ccttacaatg ctgtacttca aaacgtccta cacgtaccag gactcggcac 1380 caatctactc tccataccat ccaccactga gcgtggaatt gatgtccatt ttacagggca 1440 atctgtttca ttcacgaaaa acggaacaat tatcatggca ggaagtcggg taggaaaaga 1500 cctgtatgag ttagatatct caacagctga ttcatcccaa aacgatgcaa ttgtatgcag 1560 cgccgtcgca aacagacagc ctatctcgat atggcaccaa cgtctatctc atacctgtta 1620 caagacgata atcaagatgg tgtccaacgg cctggttgac ggaatagaac taaaggataa 1680 ttctacacca tctagcatct gcccaggctg cgcgttcggt aaaatgcatc gcctaccatt 1740 cgaatctggc cgtcgcagag ctacaaaagt tggagagatc atccactccg acgtctgtgg 1800 accaatggag ccaccatcac ctggcggagc gagatactac gtctcattta aagacgacct 1860 cagtgggtat cgtgtaatat atttcttgcg gctaaaatct gaagtattcg atcgattcaa 1920 acttttcgta tgtaaaatcg aaagtgaaac taatcaatca gttcgcactc tgcgctccga 1980 tgggggtggt gaattcatca gtaaagaatt caacgaatgg cttaccgaaa aatcaatccg 2040 ccacgaaatc agtgctgcac acactcctca acacaatggc gtttcagaac gagatcatcg 2100 tactatcgga gaagcggagc ggagctcgat gcacatgaac aacattccac tcgaattgtg 2160 ggcggaatcg tacaactgcg ccaactacac cctcaaccgc accttatcga gcagtgcatc 2220 agtcactcca tttgaattat ggtttggacg caaaccaaat ctcagccact tgcgcatatt 2280 tgggtgcgag gcctacatgc acgtgccaga ttgtgatcgc cacaagttag agccgaagag 2340 catcaaatgt ctattcgtcg gatattgtga aactaccaaa gcatatcgcc tatgggaccc 2400 tgttgatcgc agactgaaaa taagccgtga cgtcatcttc aacgagtcgc cagcccatga 2460 tgtatcaact catcatgact ctctcaaatc ttctcatccc gctaacgaat gtctcaatcc 2520 cagcgaccaa atcacagcga ctggaccacg ccactcaagc agaacgccac aaccaaagaa 2580 actttgggca gaatttgcat ctacagataa acccgatctg acaactccca tcatcgagcc 2640 ttcttcattc aaactggcca tggccagccc tgattccgac aaatggaaaa tagctatgga 2700 cgaggaatac cagtccctca tggataatca cacctggtct ttggcacccc tccctcatgg 2760 gcgatcagca attggatgtc gatggacgta taagctgaag catggttctg atgggaccat 2820 ccaacgttac aaggccagat tcgtagcgaa gggatatagt cagcgaccag gcttagactt 2880 cacagagaca tacgcccctg tagttaaact cgattctctc cgagcaattt tatccatcgc 2940 tgcaaatcgt gacctagaca tgatacaact tgacgtaaag acggcctttc tttacggaga 3000 agtcgcggaa gaactgtaca tcagtcaacc agagggattc atcgtaactg gacaggagtc 3060 actcgtctgc cgcctccaca aaggactcta cggcctcaag cagtcgtccc gactgtggaa 3120 cttaactttt gattctttcc ttactagctt tggcttcatc agcagctcag cggatccatg 3180 cgtatatttt cgagaaaacg catccgagtt cactatcttt gccctctggg tcgacgattg 3240 cctcctatgc agtaccagtt catcggtcaa cacggctata ctatcctatc taacatccca 3300 cttctctatg acttctgggt cagcagatct attcattggc cttcaaatct ctcgagaccg 3360 tgctcaaaga aaattgctcc tatctcaacc tcaatacctg cagcgcatca tcgagcgttt 3420 ccatatgacc aacagtaatc cttccagcac tccagcagat cccaatgctc gactagactc 3480 ctccatgtct ccttctacct ctgacgaaat caaggccatg aacaccactc catacagcga 3540 ggccattgga tgtctcacat atgctgccgt ctgcacccgc ccagacattg cctttgcggt 3600 tggccaggca gcccgtttct gccaaaatcc aggcaaagcc cactggtctg ccgttaaacg 3660 catcttatca tatcttgcag gtactacaac tcatggtctt ctcttctccg gaaagggtcg 3720 caccaccctc gtcggataca cagactctga ctacgccggt gacaaagata ctcgtcgctc 3780 cacatctggt ttcatctttc ttcacctcgg cggtgccata tcgtggggca gtaccagaca 3840 atcatgcacc gctctttcca ccacagaggc ggaatacatt gcagctagta acgccaccaa 3900 agaagccata tgggtccaac gtctcctctt acaaatcggt catcttcagc caggccccgt 3960 ccgtcttttg tgtgacaatc agagcgcgat cagtttggta cacaatccag ctcatcacca 4020 acgcacgaaa cacatagacg tgaggtatca cttcatcaga gagaaacaat cagatggcgt 4080 catcgatgta gtctacatcc caaccgacca tcaactcgcc gacatattca cgaagccaac 4140 tgctacccat cgcttcaact ttctccgaga ccgcatcggc atggcttcct ctgtcgccat 4200 ttaatccata cgatattttt cttctttctg tttatcaatc cttatttctt gattactcgg 4260 tttgaggggg ag 4272 // ID Mariner-2_DMac repbase; DNA; INV; 459 BP. XX AC . XX DT 19-APR-2011 (Rel. 16.04, Created) DT 19-APR-2011 (Rel. 16.04, Last updated, Version 1) XX DE Consensus of mariner-like element internal region of mellifera DE subfamily. XX KW Mariner/Tc1; DNA transposon; Transposable Element; KW Mariner-2_DMac. XX OS Drosophila maculifrons OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; guarani group; OC guaramunu subgroup. XX RN [1] RP 1-459 RA Wallau G.L., Hua-Van A., Capy P. and Loreto E.L.; RT "The evolutionary history of mariner-like elements in Neotropical RT drosophilids."; RL Genetica 139(3), 327-338 (2011). XX DR [1] (Consensus) XX CC Consensus of clones with less than eight percent divergent. CC Dmacul2cons. XX SQ Sequence 459 BP; 117 A; 111 C; 114 G; 117 T; 0 other; tttgggtccc gcatgagttg acgcatacaa atctcttgga ccgcatcaac gcttgcgatt 60 ctctgctaaa acggaacgaa cttgacccat ttttgaagcg gatggtgact ggtgatgaaa 120 agtggatcac gtacgacaac gctaagcgaa aaatatcgtc gtcgagaagc ggcgagccgg 180 ctaaaccatc gccaggaaag ttttgctgtg tgtttggtcg gattggaagg gaattatcca 240 ctatgagctg ctcagttatg gccagacact taatttggtc ctctactttt tgctaccgtt 300 tgaagcaggc aattgaccag aattggccaa tttgaatggt gttgtgttcc attaggacaa 360 cgctcgtcct tacacatctt tgatgacccg ccaaaagcta cgggagctcg gatgggatgt 420 cctatcgcac ccaccgtact caccagacct agccccaag 459 // ID BEL-15_AA-LTR repbase; DNA; INV; 244 BP. XX AC supercont1.135; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-15_AA_; KW BEL-15_AA-I; BEL-15_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-244 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.135; Positions 1111684 1111927. XX SQ Sequence 244 BP; 55 A; 65 C; 47 G; 77 T; 0 other; tgttcatgat catcaatgcg tcgacccctc gacgcgccgc cactctacca cttttctttc 60 ttgttaacat ttttatacga acagaattat taaagtacag tacatggaac aagtcggtcg 120 tgttttattc tttctccgtc ggaatcgttc gaaaaagtat tcgcgagagt attacggtcg 180 tgccggaagt cagttcccgg tccaagtttc tccctccctt tcgctacggt ccattccggt 240 aaca 244 // ID Gypsy-190_AA-I repbase; DNA; INV; 5019 BP. XX AC supercont1.100; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-190_AA_; KW Gypsy-190_AA-LTR; Gypsy-190_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5019 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.100; Positions 452536 447518. XX CC Positions [2092-2598] - Reverse transcriptase CC Positions [3798-4139] - Integrase core CC 'GCAC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 110..1444 FT /product="Gypsy-190_AA-I_2p" FT /translation="MESNNRSIDGMINDMSFSDFECGARQNRQTYVPMSKW FT GIRYDNENGGLSVSDFMETVSRKAALFRFSDYEMYENFGELLGGNPLIWYR FT AFKDRFVTWYALKHDFVKQFTKPDNDHLIERQLQSRLQLPGESFGIYYACM FT ELLFKKLSTRKSEASKLEYLIRNLDDFYLNRIQEGQIRTTDDLVKFCEGFE FT RSREILRRRRLSQYPLAEPSLHGRVQTPRLKVNEVRFDNMNDRFEVFPKSA FT EHSEPSSSSFAQNFPSVSNLQPVYHPNCQSHESYPLRNSCANRSYCAAVHC FT PGTIEHPNNAHAISQQRDFAYCSQHSEPMNNRAFQVQGHCQKETIEQSGNC FT LQSFHPPRDKGYGFGSDEPHTQSESNPICVGNTDRSHHSQLINECAVIREF FT SGNCFNCKMTGHHWRNCQQPKRFFCHRCGLEGKSMHSCPNCSGNREPGSRS FT " FT CDS 1702..3822 FT /product="Gypsy-190_AA-I_1p" FT /translation="MHCAYVPYTFKGKTQVVLTVLSEAVSNPLILGIDFWS FT SFGISPTTVNEIEMFEPLTVPTVEPKNYSCTPEQQKILNEVVSEFQPYVEG FT SLTRTKVIEHCIDTGKNEPIKQKFHPVSPYVLKEIDRELNRMLNLKVIQPS FT SSPWSNPIVTVKKPDGSVRLCLDSRKLNAVTKKDAYPLPHIAGILSRFEGT FT RYISKIDLKDAFWQIPLERNSCEKTAFTVPGRGLFEFVVMPFGLHNAPQTQ FT CRLMDKVLGVDLQPYAFVYLDDVIIVTKTFSHHISILREIAKRLKTANLSI FT NLKKSQFCAPSVKYLGYVIDSDGIRTDPDKIKAIVDYPPPSSVKEVRRLVG FT LVNWYRRFIKDFSSIIAPVTDLLKQPKKKFVWTKQANDAFLKLKSALCSSP FT LLTCPDYDLPFFVQCDASDVGIGAVLWQKHSDGEKVIAYMSQKLTNSERKY FT SSSERECLAVLVAVERFRVYIDGVHFTVITDHESLKWLMNLKDPSGRLGRW FT ALKLQGYDHSIIHRKGVHNVVPDALSRAINSVMAVTIDDKDRAWYNELFKL FT VQEDPSEARDYQIRDGKLYRYQKANRGLIYNWKLVVNPDDRAEVLRQNHDD FT VAHLGVTKTINRICENYFWRDMRKAITEYVRSCDVCKASKPVNVNARAPMG FT NRKIAEYPFQVISMDFIGVLPRSKTGNTVLFVVSDWYSKFVFLHPMSKADT FT IRMCGFL" XX SQ Sequence 5019 BP; 1532 A; 948 C; 1110 G; 1429 T; 0 other; aagtggcgcc caacgtgggg ctcgatgctt ggaggttttg ttagatcttt tgaattgaaa 60 tatttttcta agtgtttagg aatttagtca ttttatttaa ttgtgtgcaa tggagtctaa 120 taatagatca atagatggta tgattaacga catgagtttt tccgattttg aatgtggtgc 180 tcggcagaat cggcaaactt acgtgcctat gtcgaaatgg ggtattaggt acgataatga 240 gaacggtggt ttaagcgttt cagatttcat ggaaacggta tcacgtaaag cagcattatt 300 tcgcttttcg gactatgaga tgtatgaaaa tttcggagaa ttgctaggcg gaaatccact 360 aatctggtat cgtgcgttca aggatcggtt cgtgacgtgg tacgctttaa aacatgattt 420 cgtgaaacag ttcaccaaac ccgataacga tcatttaatc gagagacaat tgcagtcccg 480 gctacaactg cctggggaga gttttggaat ttactacgct tgtatggaat tgctgttcaa 540 aaagttgagc acaagaaaat ctgaagccag taaactagaa tatctaatta gaaatctcga 600 tgacttctat ctgaatcgca tccaagaggg tcaaattaga acaactgacg atttagtaaa 660 attttgtgag ggattcgagc gtagccgcga aattctgaga cgtagaagac taagccagta 720 tccgttggca gaaccgtcat tgcatgggcg cgttcaaacc cccagattaa aagtaaacga 780 agttaggttt gataatatga acgatcgttt cgaagttttt ccaaaaagtg cagagcattc 840 tgaaccgagt agcagttctt ttgcacaaaa ttttccctcc gtgtcgaact tacaaccagt 900 ttatcacccc aattgccaat cacacgagtc atacccattg cgtaatagct gtgcaaatag 960 gtcgtattgt gcggctgtcc attgtcctgg tacaatagag catcctaaca atgcccatgc 1020 tatctcacag cagcgggatt ttgcctattg ttcacagcat tcagagccga tgaacaatcg 1080 agcttttcaa gttcaaggtc attgtcaaaa ggaaactata gaacagtcag gcaattgttt 1140 gcaatctttt catccaccaa gagataaagg ttacggcttc ggctcagatg aaccacatac 1200 acaaagtgag tcgaatccga tctgtgtagg taacaccgat agatcgcatc actcacaact 1260 gataaatgag tgcgcggtga tcagagaatt ttcaggcaat tgttttaact gtaaaatgac 1320 tggccatcat tggcgaaatt gccagcagcc gaagaggttc ttttgtcatc gatgcggctt 1380 agaagggaaa tccatgcact cgtgcccgaa ctgttcggga aaccgcgaac ccggatcaag 1440 atcgtagagg gaaaacttga ttccggacca ttcctcagtc tctctaatat agcaacatgt 1500 gaaattgtat atcgaccaaa tgataatcgt cctttcgtga aaaaggacat ttttggaaag 1560 ccattgatag gtttattgga ttccggtgct tcaatttcga tcctcggaaa aggttgttac 1620 gcgattgcag agtagctaga acttagtctt tttaaaatta attcctcgat tgctacagca 1680 gacggatcgg ttcatgaaat catgcactgt gcttatgttc cttacacgtt taaaggtaaa 1740 actcaggtgg tgctaacggt tttatctgaa gctgtttcga atccgctgat tctcggaata 1800 gatttttggt cgtcttttgg aatcagccct accactgtta acgaaatcga aatgtttgaa 1860 cctttaactg tgcccacagt agaaccaaag aattattctt gcaccccaga gcaacagaaa 1920 atccttaatg aagtagttag cgagtttcag ccctatgtgg aaggtagtct tacacgaaca 1980 aaagtaatcg aacattgtat cgacacggga aaaaatgaac cgattaaaca gaagtttcat 2040 ccggtctctc cgtacgtatt gaaagagata gatagagagt tgaacagaat gctgaatctc 2100 aaagttattc aaccatcttc gagtccgtgg tcaaatccga tagtgactgt taaaaagccc 2160 gacggtagcg ttcggttgtg tttagattcc cggaaactga acgcggtcac aaaaaaggac 2220 gcgtatccgc tgccacacat tgcagggatt cttagtcggt ttgaaggcac tcggtacata 2280 tcgaaaattg atttaaaaga cgcgttttgg caaatacccc tcgaacgtaa ttcgtgcgag 2340 aaaacagcat tcacggtgcc agggcgaggt ctgtttgaat tcgtagttat gccatttggc 2400 cttcataatg ctccgcaaac gcaatgccga ttaatggaca aagtcctcgg ggtagatctt 2460 caaccgtatg cgttcgtgta ccttgatgac gtgataatcg tcacgaaaac attttcgcat 2520 cacattagca ttcttcgcga aattgccaaa cgtttaaaaa ccgcaaattt gtctatcaat 2580 ttaaagaaat ctcagttttg cgcgccatcc gtcaaatatc tcggttacgt gatcgatagt 2640 gatgggatta gaacggatcc ggacaaaata aaagctatcg ttgactaccc gcctccctcc 2700 tccgtgaaag aagttcgtcg tcttgtcggt cttgtaaatt ggtatcgtcg gttcattaaa 2760 gatttttcgt cgataatcgc gccagtcaca gacctactca aacaacctaa aaagaaattt 2820 gtgtggacga aacaagcaaa cgatgcattt ttgaaactca agagcgcact gtgttcctcc 2880 cctctcctaa catgccctga ttacgatctt ccatttttcg ttcaatgcga cgcgtcggac 2940 gtcggtatag gcgccgttct ctggcagaag cactcagatg gcgaaaaagt gattgcctat 3000 atgagtcaga agctcaccaa ttcagagcgt aaatacagta gctcagagcg tgagtgccta 3060 gccgtattgg ttgccgtgga gaggtttagg gtgtatatag acggcgtaca tttcactgtt 3120 atcactgatc atgagtcgtt gaagtggttg atgaatctga aagatcccag cggccgactt 3180 ggtaggtggg cgctcaagct tcagggatac gatcattcaa ttattcacag gaaaggagtg 3240 cataatgtag tgccagatgc actttcgcgt gcgattaaca gtgtcatggc agttacgatc 3300 gacgataagg acagggcttg gtataacgaa ctgttcaagt tagtccagga agacccctcg 3360 gaagcgagag attatcagat acgtgacggt aaattgtatc ggtatcagaa agcaaatcga 3420 ggattgatat acaattggaa gttagtagta aatccagatg atagagcgga ggttttgcgg 3480 cagaatcacg acgatgttgc tcatttaggt gttacaaaaa cgataaatcg aatttgcgaa 3540 aactatttct ggagagatat gcggaaagca atcactgaat atgttcgttc gtgtgacgtg 3600 tgcaaggcaa gtaaaccagt caatgtgaat gcgcgcgcgc ctatgggaaa tcgaaaaatt 3660 gcggagtatc cgtttcaagt aatttcaatg gattttatag gagttttgcc gagatctaaa 3720 acagggaaca cagttctttt cgtggtgtcg gactggtatt caaaattcgt atttttgcat 3780 cccatgagca aagctgatac gattcggatg tgtgggtttt tataaaagga gtatttttaa 3840 aattcggagt accacaaacg gttatatctg ataacggtag tcagtttatt agccacgctt 3900 ttaaaaaatt cctagaaaat tacggaataa accactttaa aaatgcagtc tatcatcctc 3960 agaataatcc tgccgaacgt gtaaataagg tgatcatgtc ctcgattaga gcttatttag 4020 gagaaaatgc ctctcacaag gagtgggaca aagaaattgc aaaaattgag catgcgatca 4080 atacaagcgt acatgagtcc acaaaactgt ctccttattt tatagttttt ggacgaaacc 4140 acattcgttt cggtaaaaat tacgaccggt atccgaatgc tagcgaagat aataatgttg 4200 acgagttaca agaacggtta aaaacgttgg ataagatcaa agaaatcgta actagagaac 4260 tcgccgccgc gcatgaacgt caaacccact attacaatac tcgaagcaaa aacctaacaa 4320 ctttcaaagc tggtgatact gtgtggaaga aaaattttgc attgtcgaaa gcgtcggatt 4380 ttttttcgtc gaaattagct ccgttatata ttcaatgtcg tgttgttcgt cgaacaggta 4440 ataataccta tgatttagaa gatacggacg gtaaattttt aggaaattac tctatccaag 4500 atatcagacc gttgtgatta gtatatgaat taatttatag cttatcaagt ctttgcaaat 4560 tttgcggagc acacattttc ttttttttct cagtcttcgt ttgaatgtac ggagcattac 4620 catttcttta aaagtcttcg gcataaagtc acggagcagc atatatatct ttgttacggt 4680 aaacagagta gtctgatcac gaaaaagtct cttatatatg tattggagca tttgaattac 4740 atcaaagtct tttccaaagg gaataagcac cacaaaatag aatgatttaa gtttctagat 4800 ttaaaagggt attttgagat gaagaagatt tcaggacccg gaagcataag gcaacaagcc 4860 atttgtttca aatctcttag tggagcattt agccggtagc gaataaacta agaagtatat 4920 atatatattt gaaaaaaaaa aataaaattt tgccgttgtt tgaagttacc ttttttatga 4980 gcaaaatttc attttttttt tcgcggagaa aaggggtag 5019 // ID Copia-16_AA-LTR repbase; DNA; INV; 136 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-16_AA_; KW Copia-16_AA-I; Copia-16_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-136 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 944-944 (2011). XX DR [2] (Consensus) XX SQ Sequence 136 BP; 44 A; 26 C; 21 G; 45 T; 0 other; tgttggaaga taaaataaac atagtaggcg ttcagaccga acgtcaatca tagtatgcct 60 agatttaaat ttaaataaaa ttattctctt ctgtttctac cactgccaag agtagttcgt 120 tttactgcct actcca 136 // ID Mariner-31_HM repbase; DNA; INV; 3718 BP. XX AC . XX DT 31-DEC-2008 (Rel. 13.12, Created) DT 31-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE Mariner-type family: consensus sequence. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-31_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3718 RA Bao W. and Jurka J.; RT "Families of Mariner elements from Hydra magnipapillata."; RL Repbase Reports 8(12), 1965-1965 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 1063..2310 FT /product="Mariner-31_HM_1p" FT /translation="MVRKYIRTTSRGVNGNWTMQNMCHAIEAYNSNACSIN FT KAAVNFGIPEATLRRYIKKSSDQFPINNGRFRPIFSIEMEKLLAEYLIELS FT KRFFGMTSVQLRKFAFEFAEFNNVPHNFNKELKIAGIDWLSGFLKRHKNIS FT LRTPEMTSLGRIQGFNSPQVAIFFDLLRDLLTTHKFLPSRIFNADESGVPT FT VPTKIPKVFSIKGIKRVGKVVSAERGKTVTIVCSMNAVGIYIPPAFIFPRK FT NMRNDFMDNSPPDSVGFAHKSGWMTQEIFLDYLKHFAYHTKPTIADPVLLI FT VDNHNSHISFSAIEFCRKHSIVMLTLPPHSTHMITFMMQPLDVTFFSPFKT FT YYSQACDNWMVNHPGRPITEAQVPALVRHAYDRSATTGIARKGFQETGIWP FT FNPQIFSLTDFAPSLTSDDTX*" XX SQ Sequence 3718 BP; 1295 A; 490 C; 542 G; 1370 T; 21 other; ggggaacttg gggtaagatr acgaatgggg caagatgaaa acttaccgtt atcttcaaac 60 cttggctata tataaaagct gactgtagct aagatataag tagattrtat atgctaaatt 120 tatttatgct ttctttttaa atattattta tagaaaaaaa gataagtatt aaaaaagttt 180 ttccggtgtt cggaaaaaaa aaaaaattca cttgtatttg atttattgca taattacatc 240 gccatctaaa gttcagcgag agtttttcat ttagtttatt taatttagaa cccatagaag 300 tattattgag ctatgatacc ttgtttatgt atgtttctga atcattttaa agattgtgta 360 tcttgggggc aagatgactt atttgtcrtt ttgtaaaagt caatawttta ctataarctg 420 ttgaacgttg catagccgct actaaactta cagagagttc aataaaataa cttagaatra 480 taattaaagt ttcttgaaaa taaattgagt tgttgttata ttgtgaataa aatattattt 540 aagaatgtct agtttagtta tgcttatatg attwaaaaag atatactata tctatttgta 600 tatatttttt attatatata tatwtwtata tatatatata ttatatatat atatatatat 660 atatatatat atatatatat atatatatat atatatatwt atatatatat atatatatat 720 atatatatat atatatatat ttatatatat atatatatat atatattata tatatatata 780 trtatatata tatatatata tatatatata tatatatata tatatatata tatataatta 840 tatatatata tatatattta tatatatata tatatatata tatatatata tatatatata 900 tatatatata tatatatata tatatatata tatatatata tatatatata tatatatata 960 tatatatata tatatatata tatatgtata tacatatttg tatgttattt ttatacattg 1020 ataaggctat taaatactat tgtgttgttt ctagttatta aaatggttcg caaatatatc 1080 cgaactacta gcagaggtgt aaatggtaat tggacaatgc agaatatgtg tcatgctatt 1140 gaagcttaca attccaatgc ttgtagcata aacaaagctg ctgtaaattt tggaattcct 1200 gaagcaacac tgaggcgtta tataaaaaag tcttctgatc agtttccgat maacaatgga 1260 aggtttcgtc ctatattttc aattgaaatg gaaaaattac ttgcagagta tttaattgag 1320 ttaagcaaaa gattttttgg catgacctca gtgcagctca gaaaatttgc attcgaattt 1380 gccgaattta ataatgttcc acataatttt aacaaagaat tgaaaatagc tggtattgac 1440 tggttaagtg gatttttaaa acgccacaaa aatatttctt tgcgaactcc cgagatgaca 1500 tctttgggta gaattcaagg ctttaactcc ccacaggttg caatattttt tgatttactt 1560 cgagaccttt tgacaacaca caagttttta ccttcaagaa tcttcaatgc tgatgaaagt 1620 ggggtaccca ctgtaccaac aaaaatccct aaagtatttt cgatcaaagg cattaaaaga 1680 gtaggcaaag ttgtttctgc tgaaagagga aaaactgtaa ccatagtttg tagcatgaat 1740 gctgttggaa tttatatacc tcctgctttt atttttccgc gaaaaaatat gcgaaatgat 1800 ttcatggaca actctccacc tgactcagtt ggatttgccc ataagtctgg ttggatgaca 1860 caggaaattt ttttagatta tttaaaacac tttgcctatc atacaaaacc aaccattgct 1920 gatcctgttt tgttgattgt ggataatcat aactcgcaca taagtttttc agccattgaa 1980 ttttgcagaa aacattcaat tgtaatgcta acattgccac cacattccac ccatatgatt 2040 acctttatga tgcagcctct ggatgttacc tttttttctc cattcaaaac atactatagt 2100 caagcatgtg ataactggat ggtaaatcat ccaggtagac ctatcactga agcacaggtt 2160 cctgcacttg taagacatgc atatgacaga tcagcaacaa ctggtattgc aagaaaagga 2220 tttcaagaaa ctggaatctg gcccttcaat ccacagatct tttctttgac tgactttgca 2280 ccatcgttga caagtgatga cacakcrtga ttgcacagca ccaacttatc agaaaatatc 2340 caataatcaa ccaggaatra gtatgtaaat tatatattat ttttgacagt tgaatttttt 2400 gttttttaaa accaaaagtt tgtttttatt tacaaacttt aacccaacaa tttgtatctt 2460 tatcatttaa ataagctaat aacttcagct tatctatagt atattagaat tatgttaaat 2520 aatagaatat taaataacgt waatttaatt aaccttatta attttataat tttatcatat 2580 attgtttttt agttgaarat caacagtcaa cattaaagcc tgtaattaat gcagtcacaa 2640 cagaattaac tgtattgttt aataataatg actctttatt gttgccaaat gctaatagtg 2700 aaccaaaatg cattcagtcg aacctagata taaatgttga tagaacccaa atagatttga 2760 accaagtgcc caaaactaaa actgtatcac ctgagaatat tcggccttat cctattgctt 2820 ctagagttac aaataatgct agycgtaaga gaaaatgtaa gcgagccgag gtggcaacaa 2880 gttcgccttt taaagctgct gttcaattag ctacactcaa gaaacaagaa aaattgrtaa 2940 agcaagattt gaagaaagtt tttaaagaaa agaaggatca gcttaaaata acagctgaaa 3000 ataaagatag aaagaagaag aatgacaaag gcaaaaaaaa gagtgataaa aatatctaac 3060 aaagtttgtg aaactcaaga aatttcatgt tggtattgtg aaattatcta tggtgatcct 3120 aacgatccat tgcttgatga tgactgggat atgtgtaaat cctgtaacca ctggtgtcac 3180 tggtgtcatt taacttgtgg ccagtatgtt gaacgcaaat ttatgtgtat tgtatgttac 3240 tcttgtgtag cttagtcatt atttttgtct ttacattatt ttattattcg ttaaagtatt 3300 ttgctttatt ttgttgggtt tgatttcgct attaactaat tagtagggtg ctatttttgc 3360 atagctatat ataaatatat aattaaattg ttttgaaaat taaatcaccc tttgtattaa 3420 taattttatt aactaaaatt tatattgctc tttttttgag rcggtaacag tatcatggag 3480 ttaacttttt tttgtagtcg ccatcttgcc ccgcatacgg ggcaagatga ctattcgagg 3540 tgacttatgt taatgagttt ttctccagct atttttaatt tgtataaggt ttttttgata 3600 caattgtggc gaataataat ttgttatcac ttcttgataa agttttactt attggaacag 3660 ttattgtcta agatataagt rtttttcaaa acgttcgtca tcttgcccca tgttcccc 3718 // ID BEL-1_HAS-LTR repbase; DNA; INV; 213 BP. XX AC AEAC01014393; XX DT 27-JAN-2011 (Rel. 16.02, Created) DT 27-JAN-2011 (Rel. 16.02, Last updated, Version 1) XX DE LTR retrotransposon from the Harpegnathos saltator genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-1_HS_; KW BEL-1_HAS-LTR; BEL-1_HAS-I. XX OS Harpegnathos saltator OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Ponerinae; Ponerini; Harpegnathos. XX RN [1] RP 1-213 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Harpegnathos saltator genome."; RL Direct Submission to Repbase Update (02-FEB-2011). XX DR Genome; AEAC01014393; Positions 7270 7482. XX SQ Sequence 213 BP; 45 A; 65 C; 54 G; 49 T; 0 other; tgttgcggat tcgccgagaa tcagcaacac tgcgcttgat gtttacacga tcatgatcgc 60 tgtgtcgccg ctgacgtcag catacacgtg cgaggcggct ctccgacatt ctaatgtctc 120 gcgcgaccct cgctgcgctc ggcgtgactg ttcagcctcc cgacgcgtga aaacacttat 180 tccaaacaaa acttgtgtcg cgaggctgaa cca 213 // ID DNA-3_CQ repbase; DNA; INV; 2527 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A DNA transposon family from Culex quinquefasciatus - consensus. XX KW DNA transposon; Transposable Element; nonautonomous; DNA-3_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-2527 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the southern house mosquito."; RL Repbase Reports 11(1), 44-44 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 11 sequences with >94% CC identity. ~10-bp TIRs. XX SQ Sequence 2527 BP; 839 A; 446 C; 430 G; 806 T; 6 other; cccaagtagc atcgcgaaac atgtttcaaa tggctaagtc caaactatgt tatatcaggg 60 gcgtaaaaaa gttacatttg ttacaaccat ttggattttg tgggcgacac ttattgacaa 120 ctaatttgaa acaaaatata atgagtgtct ttgaccttct ccttgccttc tgggaaactg 180 aaaaagctga tgagttgtaa catattttgt tatcattaag ttatttttgt aacaacttga 240 caactttccg attgttttta tttagtaaga actatgttgt gaatatgtag cggtaacatt 300 cattagccaa acaaatttac gtaatgcgtg attgttttga ttcatttagt ttatgtccga 360 cgagtaaaaa ctagtgcgga ctttgaagaa aaggaaaggt ccataatcgc tttcggattc 420 cgaaatctgg cgtaagaacc agcgaaatgt agcgtattaa aaaggcgaaa ctggtaaaca 480 taaatacaga ccccattgaa cagcatacag aaaccagaac ctacaatgac tagcggatgg 540 attcatccca acccctgatt cggcctgacg tctgccatag attccgacaa tgatgcattg 600 caaaattagt ctgacgatta cgagcctgca gatttccctc aaacctaacc cagagaagat 660 ctccagaatt ggggaatcga ccaccggaac ataaaaaagt gtttttatac gtttttgtgc 720 ccaacaatgc gctttcaaaa gacctacaat ccaaagtgtc tagcggattt actggcgaaa 780 gctgcgtgat agtcatttcc ggtcggcaat tctttctttt ttcagtctgc acggcacaaa 840 tcatttttaa tctaacgaaa acatgaactg ctcaacacaa ctaacccaaa acgtttggca 900 tgttgacact tcccgctaca gacggacaaa atgtgaacac ttgtttgcaa cataacaaaa 960 catttgttga gatgctccaa aaacattatc tcaacaaaaa tgcaacaaac cgaaacttgg 1020 caaaaatgtc aaacagcagg gtcgtatcaa gggtcgtgtt aagacgtacg ataatgattt 1080 tacatgcaat ccgaatgttt tttactttga aatactgtag aaaatacttt taatatgaag 1140 tcttcataat tttcggaatt caaagataca cgggctaata atgatcaaaa atatgactct 1200 gaagcagaat gaatcggcat acgacgttct tacaacgaca ttcagtttgt tctagaggct 1260 tcatcttgca attttatgct tgtcgcgtaa aatttcttgg tttgcctgaa aaatgtgtga 1320 aaacattggt tcggattgaa acccgatctt gcctcattct catgtcaaac gtttctcaac 1380 gcagcaagct aaaacttgtt aaaagctctc tgagtgcagg gcgatcaagg gtcgtgttaa 1440 catatatctc aattggattt cgaaataata tccgaaggtc ttcggggctt tgaagacacg 1500 taaagcgtat tctacctggc acgacagctc ggtttctagc ggaatgccaa gcagttatcc 1560 gaaatagagc aaatgtatgt ctctctgaag gtttaattgg tttaaaattg gtwaaatata 1620 tttaattatt tctaatctaa ttgtaaatac tttcaattca aaatttcttt gtgcatagaa 1680 aatataatcg ataaattaaa tgtgtcagtt tagatagaca attattttgc tatatatttt 1740 ttttaattgg tgcctcgatt tgaccctgtt ttcgatggag gccatgctgt ttctttcatt 1800 cttgtaattt tttgatgatt acaaattaaa ttgttccaga attatgcaat taaactgatt 1860 aaatttggtt tgaaatgaat cacattattc aawgcttgag gttttcttct gactatctga 1920 cgtatttata gataaaaaat cattagcatt tattttgcag cmatttgaaa tatttttcaa 1980 ggcaaaatat ccttaaaaaa taaggcccgt aaaaaaatta aaagtataat taaatttcgt 2040 tcgaagaaag ttaattcgca gttkaatcag atgtttaatg accattataa ctaccataat 2100 aagatactgw tccgaagaag gaatgaaaac cattttcgaa cttgtaattt tttttcatca 2160 tcagcaaaaa cataattgag acatcaatat tcctaaatat gttttgcatt tgttgagggg 2220 aagaaatatc tgacagaata acttaacttc aagtaaacac tgaaatcaac gtgttaacat 2280 aaaattgctg caacttaaac ataacatacc tggtacaact ttacaaaatt tcttattttg 2340 acaacttaat cgaaacattt tgacaacatt ttttcaacaa wtttcttaac ttgaaagatg 2400 tttcaaatca gcttccaaat tgcgcttttc cagtcacgcc aaaggttaca cattagtcgt 2460 atctactcat tttgtttgtt tcaaacttgt ctggattaag catcggtgac aaataatgtt 2520 acttggg 2527 // ID Gypsy-37_AA-I repbase; DNA; INV; 4983 BP. XX AC supercont1.16; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-37_AA_; KW Gypsy-37_AA-LTR; Gypsy-37_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4983 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.16; Positions 2719232 2724214. XX CC Positions [2309-2809] - Reverse transcriptase CC Positions [4001-4468] - Integrase core CC 'GTTAG' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 409..1539 FT /product="Gypsy-37_AA-I_1p" FT /translation="MKAELERTRLENEELNRMLEDKEQDQNEFCEARSSTM FT RESQTAANETLLSTMNNMSLSTLNIPECVPTAGEAELNKRDYDHWKNVLDA FT SMHLIQATDESTKIYLFRIKAGSLLLELLDGTTTQQGMPDEKQLPYSNAIA FT RLDAHFGSRAYMLSQRSKLANMVQRSGELNVQYVKRVAAAAKLCSYKSDEE FT FEAISRTVTRGSTDSRVRTLAYRVLIDGGSLNELIDQVRIREVELENENDY FT RRLHQQQSATVAAVSYHPGDHVQRQRGSFDASRGYNGGRGRRNFSRGPMRN FT QRRGRSCWRCLSMYHAAEDCFHTDKVCRNCNRRGHIARACSIQVKQEPRKR FT NWTGEEPEPPTKIAMIQKTEDVEEDSKVNDTSNV" FT CDS join(1634..3496,3500..4960) FT /product="Gypsy-37_AA-I_2p" FT /translation="MQQKIMFENNGELLQVTQNIQRDQGTSAIDRVVSNVP FT DEKAAHVTAYVAGIKVPFFIDSGAQVNTITSESFNAILQNETSKKSLHELQ FT YGSDKVLRAYATQGKIDVIATFSAELFVSDERPTTVEKFYVVRESRALLGF FT NTAVRYSLLAVGIDVPVVEMDSEWRCEFSIRHLHAVSSREFPKFNVPPVKL FT SYDKSMPPSRNVYTHIPAAFKDLTKQKLNELLESGIIEKVTKDMDRSFCSS FT LLVIPKAKSDIRLVVDLRGPNKCIIRTPFRMPTFESILLQLHGAQYFSTID FT LKNAFFHIELEENSRHLTNFFAGEALYRCCRLPFGLTNAPDIFQEVMQTVI FT LEDCEGTVNYLDDVMIFGRTKEEHDRNLQNVLKRLQEHNVMINQEKCSFGK FT EIVTFLGFRVSNEGWQIEDEKISAIKDARKPESTAEVKSFLGLVTFIDRFI FT PQRADKTLHLRQLSNAHDFYWNQDLEDEFEYLRSSSWKHIKTLGYYSRDDE FT TELYVDASPHGLGAVLVQYDAKSKPRIISCASKALTVAEKKYPQTQKEALA FT MVWAVERFSVYLLNISFTIRTDAESNEFIFGGSHRIGKRAVSRAEAWALRL FT QPYNFKVRTFAFTVKIIVKLYFMVVRVPGEMNIADALSRLVKESQSLESFD FT DSNEKHLLYFLDTGTVEFTWEDIEIEAEKDAELNRVREAIRDNRWDKSLKR FT YEAEAKNLRVLGGLVFMNDRVILPYALRDGALNSAHQGHMGAASMKKILRN FT YFWWPCMSKEVERFVEGCETCFRLSRKNPPIPLTCRELPEGPWEILQIDFF FT SFKDCGSGEFLVVVDTYSRYLHVIEMKNTDADSTNAALCKVFEVWGYPIAI FT HSDNGPPFQGDKFVKTWENRGVKIRKSIPLSAQSNGAVERQNKGLKDALTA FT AKLDNVNWKKALEQYLHMHNKVRPLSRLGVTPFELLVGWRFRGTFPYLWEN FT LPSEQIDRTDIREKDAVTKLDSKQYADLIRGAKESEIAVGDRVLLAQNTRQ FT KGEPMFSEDRFTVLTREGAKVVVQSDRGVQYSRNVQDIKKVPIMLRETGDE FT ESTDGGEHPAEAELPDVPIVPSESEISVEKPKRLRKKPSRFNDMILFYVYD FT " XX SQ Sequence 4983 BP; 1557 A; 965 C; 1228 G; 1233 T; 0 other; atggcgcagc cggtagcgaa gtcgacccgg aggaaaatgt gataaggtga gtaaaagtag 60 aagcagttag tgaatagtag gaatctgatc ccgaagatta ttgaaataaa tggacctcga 120 ttggaaaggc gagtgtctat atttgacccc gattggaaag gcgggtgtta tgaattcaga 180 ccccgatagg aaaggcgggt gtcataaatc tggactccga ttggaaagac gggtgtcatg 240 aattaattag accttggaac atgatcgaaa atggagatca agtagaaaaa aattattctc 300 gtgcgaagtg ttgatttata gcatgaatct aaggagagtg ttgtttcatt ctacaggatt 360 accagacgga aaattacgaa aaagcacttg gctgacgaat tatccagaat gaaagcagaa 420 ctggagagaa cacggctgga gaacgaggaa ctgaatcgaa tgttggaaga taaagaacaa 480 gatcagaacg aattctgtga agcgagatcg agtacgatgc gtgaaagcca gactgccgca 540 aatgagacct tgctcagtac gatgaacaat atgtccttga gcaccctcaa cattcccgag 600 tgtgtaccaa cagcgggcga agcagagttg aacaagcgag attacgacca ttggaaaaac 660 gttttggatg catccatgca tctaatccaa gctactgacg aatcgacgaa aatctatctg 720 ttccgcatca aagctggatc gttgttgctt gaactactcg atggtaccac aactcagcaa 780 ggaatgcctg acgaaaagca gttgccctac tcaaacgcaa ttgcacggtt ggatgctcac 840 tttggttcca gagcctacat gctatcacag cggagcaaat tggcaaatat ggtacaacgg 900 agcggagagc tgaacgtcca atacgtgaaa agggtggcag cagcggcaaa actgtgttct 960 tacaaatctg atgaagaatt cgaagctatt tcgcgaactg tgacaagagg atctaccgat 1020 agtcgggtga gaacactcgc ctacagagtc ctaatcgatg gaggttcttt gaacgagttg 1080 attgaccaag tacgcatccg agaagttgaa ctggagaacg aaaatgacta ccgtagacta 1140 caccagcagc agtccgctac tgtggcagcc gtttcgtatc atccgggcga ccatgttcag 1200 cgacaacgtg gttcgtttga tgcatccaga ggatacaacg gtggacgtgg tagaagaaat 1260 ttttcccgtg gtcccatgcg aaaccaacgg cgtggcagat cgtgctggag gtgtttgagt 1320 atgtaccatg cagccgaaga ttgtttccat accgataaag tatgtcgtaa ttgcaatcgc 1380 cgaggacaca ttgctcgagc ttgctcgatt caagtcaaac aggaaccgcg taaacgaaac 1440 tggacgggag aagaaccaga gccaccgacg aagattgcga tgatccagaa aacggaagat 1500 gttgaagaag actcgaaggt aaacgataca agtaatgttt aagattttga agttgagttt 1560 tattttatgt ttgcttttga attcactgat taaaactgat ttataaatga ctgaatttct 1620 gaaataaacc caaatgcaac agaaaataat gtttgaaaat aatggtgaac tattacaggt 1680 gacacaaaat atacaacgag atcaaggaac gtcggccatt gacagagtcg tatctaatgt 1740 cccagatgaa aaagcggctc atgtcacggc atatgtggct gggataaagg ttccattttt 1800 tattgactct ggtgcacagg tcaatacaat cacatcggaa tcgttcaatg ctattcttca 1860 gaatgaaaca tccaagaaga gcttgcacga gctacaatac ggatctgata aagtcctgcg 1920 cgcatatgca acccagggaa agattgacgt gatagcgact ttttcggcag agctttttgt 1980 ttcggacgaa aggccaacga cagttgaaaa attttacgtt gtacgtgaat cgagggcact 2040 tcttggattc aacactgccg tccgctatag tttgctggca gtgggtatag acgttccagt 2100 agtcgaaatg gattctgaat ggcgatgtga attttctata cgacacctgc acgcagtttc 2160 ctcccgggaa ttcccgaagt tcaacgttcc tccagttaag ctgagctatg ataaatcgat 2220 gcctccttcg agaaacgtct atacgcacat tccggcagcc ttcaaagatt tgacgaagca 2280 aaagttgaat gagctgctag aaagtggtat tattgaaaag gtgacgaaag atatggatag 2340 aagcttttgt tcttccttgc tagtgatacc gaaagcaaaa tcagatattc gcctcgttgt 2400 cgatttacgc ggccccaaca aatgtatcat cagaacacct ttcagaatgc caacattcga 2460 gtcaatattg ttgcagttgc acggtgcgca atatttttcc acgattgatt taaagaacgc 2520 cttcttccac attgagctcg aggagaattc aaggcatctc acgaactttt ttgctggaga 2580 agcactttac agatgttgtc gacttccgtt tgggcttaca aacgccccgg atatattcca 2640 ggaggttatg caaaccgtaa ttctggagga ttgcgaaggg actgtgaact atctggatga 2700 tgtaatgata ttcggacgga caaaagaaga gcacgatcgt aatcttcaaa acgtcttgaa 2760 aaggcttcaa gaacataacg taatgattaa ccaagagaaa tgttctttcg gaaaggagat 2820 agtgacgttc ttgggattcc gagtatctaa cgaaggttgg caaattgagg acgagaaaat 2880 cagtgcaatt aaggatgcaa ggaagcctga atcaaccgcg gaggtgaaaa gttttttggg 2940 ccttgttacg tttatcgacc ggttcattcc gcagagggct gataaaactc ttcatttgcg 3000 acagctgtcc aatgcacacg atttttattg gaaccaggac ctggaagatg aatttgaata 3060 cctgaggagc agttcttgga aacatatcaa gacactaggc tactatagtc gagacgacga 3120 gaccgaacta tatgtggatg catcacctca tggactagga gccgttttgg tccaatatga 3180 tgcaaaatct aagcctcgta tcatttcctg tgcttctaag gcgttgacgg tagcagaaaa 3240 aaaatatccg caaacacaaa aggaagcttt ggccatggtc tgggccgttg aacggttttc 3300 tgtatacctt ttgaacatca gctttacgat tcggaccgat gcagaatcaa acgaattcat 3360 tttcggagga tcacatagaa tcggcaagag agctgtatca agagctgagg catgggcttt 3420 aagactacag ccgtacaact ttaaggtacg tacatttgca tttacagtaa agattattgt 3480 taagttgtat ttcatgtagg ttgttcgtgt accaggagaa atgaacatag ctgacgcatt 3540 atctcgcctt gtcaaagaat cacaatcact cgaatcgttt gatgattcca atgaaaagca 3600 tcttttgtat ttcctggata ctgggacggt ggaattcact tgggaagata tcgagattga 3660 agctgaaaaa gacgcagagc taaatcgtgt tcgagaagca atcagagaca atcgatggga 3720 caaatctctc aagcgatatg aagcggaagc taagaatttg agagttcttg gaggtctggt 3780 tttcatgaac gaccgagtga ttttaccgta cgctcttcgt gatggagcgc taaattccgc 3840 ccaccaagga cacatgggtg cggcatcgat gaaaaagata ctcaggaact atttctggtg 3900 gccatgcatg tccaaagagg tggaaagatt tgtagaaggt tgtgaaacct gtttccggct 3960 atctaggaag aatccaccca tcccattgac ttgtcgtgaa ctccccgaag gtccgtggga 4020 gattctccaa attgattttt tctcgttcaa ggactgtgga tcaggagaat ttcttgtcgt 4080 agtagatacg tattcgcgat atctacacgt cattgaaatg aaaaacacag atgctgacag 4140 cactaacgca gcgttgtgca aagtttttga agtttgggga tatcctattg caattcacag 4200 cgacaatggc ccgccgtttc aaggcgacaa gtttgtcaaa acgtgggaaa accgaggcgt 4260 gaaaataaga aaatcgatcc cattgagtgc tcagtcaaat ggagctgttg agcgtcaaaa 4320 taaaggatta aaagacgctt tgaccgcggc taagcttgat aacgtgaact ggaaaaaggc 4380 cctagaacag tatttgcata tgcataacaa ggtccggccg ttgtcgcgat tgggcgtcac 4440 accatttgag ctacttgtag gttggagatt cagaggaaca tttccctatc tatgggaaaa 4500 cctaccctcg gagcaaatcg atcggacaga tatacgcgaa aaagatgcgg taacaaaact 4560 tgacagtaaa caatacgcgg atctgattag aggagccaaa gaatcggaaa ttgcggtagg 4620 agatagggtt ctgctggctc aaaatacaag acagaaagga gaaccgatgt tctcagaaga 4680 ccgtttcacc gttctaacaa gagaaggtgc aaaagttgtg gtacaaagcg atagaggtgt 4740 acaatactca agaaatgtac aagatattaa gaaagtaccc atcatgctgc gcgagactgg 4800 ggatgaagag tcgacagatg gaggagaaca tcctgctgaa gctgaattgc ctgatgttcc 4860 aatagtaccg tcggaatcgg aaatatcagt cgaaaaacca aagcggttgc ggaagaaacc 4920 ctctcgtttc aatgacatga ttttgtttta tgtatacgat tagagagtag agaagtagag 4980 cat 4983 // ID Gypsy-107_AA-LTR repbase; DNA; INV; 188 BP. XX AC AAGE02029780; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-107_AA_; KW Gypsy-107_AA-I; Gypsy-107_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-188 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02029780; Positions 28307 28120. XX SQ Sequence 188 BP; 56 A; 37 C; 38 G; 57 T; 0 other; tgttgcagta tgtagcgacc cctcgtattc tctcccattc cattcatgcg atttacgcac 60 ccatacttgg agagtcagta cagagtagag tgacagacag cttggctgta gagcagagtt 120 tgctccgtga attcgataaa taaaagtttg ttaaatagtt aaaaccgtgt ttatttaatc 180 ataaaaca 188 // ID Gypsy-90_CQ-LTR repbase; DNA; INV; 2045 BP. XX AC AAWU01007292; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-90_CQ_; KW Gypsy-90_CQ-I; Gypsy-90_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-2045 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 560-560 (2011). XX DR GenBank; AAWU01007292; Positions 91451 89407. XX SQ Sequence 2045 BP; 537 A; 493 C; 513 G; 502 T; 0 other; tgtaacagtt ggcctgtttg gcaaaataat ttatttattt gtttcaatgt ttaatgttat 60 ttatttacat tttgcaataa aattcacgag tcaaactccg cttgcacttt tctgtaaaac 120 ctccgttgta ttcaccacgg taactcaacc tgttttggac cgattgcgtt aagacccgga 180 gagagactct tgagagagat gagccgatct tgaactacct cagtgagaga acaccacagc 240 tagacccgag aacacagttg ctaaaaaggg atggggagtt cgctacttct gtcgcacgca 300 gttacccaga gagcatctca aggaatgacg taattccggt caaaccgctt gagcgtccgt 360 tgcgaccttc cctattctga ggactccagg gctcgttcac tttgtaccca gctatcaacg 420 agagtggaag tgcgcgtttt gtaccgcgag tttttggctt tttgtacaaa atgtaattta 480 gtatagtgac gtgtttctta tttggtcttc taattaaatt tccgtttttc gtagtttccc 540 gttagttagt cgtgtgtgcg ctagttgcga agaatttgtt aaatttgtgt gcgatcagga 600 gctaaggtaa agtgctcata taaaacccgt gcatgtgctc atggaatttc tccgaacagt 660 tcgcccgtta gcaacagcgc taaccatccc gcagcagcac cagcccagca gcagcgcgaa 720 cacacgagca gcagaagcag cagaggaaaa agggagacga agttccccgc gcaccgtact 780 tctcgttgtc gccgcacgtc accaggcgaa gcacgaggac gaagagcgtg agcgcacgct 840 aggcggcgat ccgtcaccac atcgccccgg tgaagaaggg agggtccctg aaccaccaca 900 cgaggacagc ggagaacgtt agtcccggcc ggagtcaacc ctaaggccgg agcttcctgt 960 ccggttgcga acacctgaca gccgcggaac cgggaaaagt ccaaaaccgg cccgagccac 1020 cccgtaaggc cgagtcccgc attccggtca gctgttcatc agctgacggc cacgtgtcgt 1080 ccctgagctg aacaaccagc ccaagtagca gagaacaaca gcagcaacaa gatcatcatt 1140 cggtgagtcc gttctatttg cacttgtggg aagaagcacg tccctagctg acgtagcgaa 1200 gtttttgccg gcctagtgcc gaggcggcag acgcgaatcc gcccttgtcg gacgcgtcca 1260 aagtttgagg agcgaacttt aatcccacgc gtttctacgc cacacgtgga tgacccccaa 1320 aatttaaacg cgcgcgcgcg caattttgaa gaagcagtag ccgatttgat cagctgattg 1380 ctggaagatg ggaagagcac atgagcacgg ttagggaaac agacagaagt aaagataaaa 1440 cgcacacacg caaagtaggt agaaggtaga aaaacacgaa gatggccact aacactattg 1500 tacaaatata agcatgctaa accaaaacga acaaacgaga actcaccttt ctttctggag 1560 ttatgaataa aaaccaccct agtttttttt cggataaata aaacgttcgt ttaactgtta 1620 tttaaaacct acttgtggag tttctttagt tgattgaatc tggagttatg tcacttttcg 1680 tcgtcttttt tcgttctgtt ttctgaatga cagagcttgg cgtcgaatgc actgccacga 1740 tcgaagcgct gcctggtggc aggtggcggc actgccaatg atgagctttt aggcatggga 1800 agcaacacac tggagtaagt tgtctgggtg atttcttgtg gttggcgtga cggattctcg 1860 gtttcgttta gtggcagttc tcggtgatct agctggttcg atcctgaatc ctccgatgag 1920 gatcagtaca aaatgggctc gtcttttaaa agagtctcga ccgggcaaac aaacatggcg 1980 tgcgatctcc ctttgggagt ggcgcataaa tctcacgctc accagatcga ctagcaacgg 2040 ttaca 2045 // ID Copia-19_CQ-I repbase; DNA; INV; 4032 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-19_CQ_; KW Copia-19_CQ-LTR; Copia-19_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4032 RA Jurka J.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 353-353 (2011). XX DR [2] (Consensus) XX CC Positions [1362-1889] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 129..4007 FT /product="Copia-19_CQ-I_1p" FT /translation="MDANRFSIQKLNNDNYSAWRFKVELLLVREDLWRCVD FT PGTKPAAESDETWKALDAKTRATVGLLIEDNQHGLIRGAATSKAVWTALQE FT HHMKTTLTSKVSLLKRICDKRFADGEDMAEHLFAMEELFDRLKNAGQPLAD FT TLQVAMILRSLPRSFDTLTTALESRSDADLTLELVKRKLLDEVAKRQGSAG FT SDSALKVGAYGRSSKKKTLVCHRCKMEGHKQYECPQRGRHVDGDTKARRTK FT PKKEDSVSFAFVAGGGAVADVTKPWVIDSGATSHTVADRCFFSELKNSEVH FT VALYTPGLAMNLLSVPAIVKKNAKVLFDAGGCRILRGETTVAVGVLKQGLY FT HLKQPEEVAYAASGHHNKDCKHVWHRRFGHRDLAAIARMEAENLASGLRVF FT DCGVDEPCGSCLKGKSSRKPFPAESTTSTKATLDLVHTDVCGPVEVPSVGG FT YRYFMTMIDDYSRFCVLYLLRNKSEVSDRIAEYVALVKTLFGRKPKTVRSD FT QGGEYTGEKLRNFYKQEGIQAQFTAGYSPQQNGVAERKNRYLTEMVRCLLF FT DANLPQCYWAEALSTAVYLQNLLPTKAVKSSPFEMWHGTKPDVAKLKVFGC FT KAWVHVPKEKRKKLTGTARELTFVGYSLEHKAYRFLDRSTQKVIVSRDVRF FT VESASAEVGHQNGDAKPVTVTSREETVVFDSVLNKPVPKPDEPALEQERAP FT EAEPECPMDGDNPAELDAEEEDASFRSALSDEEEFNGFSDVSPSSSTEDTT FT NKTGSTTPLRRSDRLTKGIPPERYQNATNVTKHTVREPRSYLEAVQGPEKA FT AWLAAMKEEIKSLQENQTWELVELPPGRKLVGCKWTFKKKEDESGRVVRHK FT ARLVAQGFSQRYGTDYDEVFAPVAKQTTLRTLLTIAARDDMLVRHLDIKCA FT YLYADLDESIFMKQPPGFQSDDNLVCRLKRSLYGLKQSARVWNAKIDGIFK FT QMGFQPGVADTCLYVKKTDKQMSFIAIYVDDMVIFCHSEKEFSRIRTTLEG FT QFKLSTLGELRQFLGIHIEKIDGHYTLNQRSYIEKLLGRFGCEEAKPSKIP FT LDPGYVKQKEETNLPTNTSYRSLVGSLLYVAVNSRPDICIGTSLLCRKVSN FT PSDRDWTEAKRTLKYLKGTKDLRLHLGNGDAGLECFVDADWAGNEGDRKSN FT SGLIIKFGGGVVSWSTRKQTCVALSSTEAEFVALAEGCQELLWAKKLLNDL FT AEDNQESVVVWEDNQSCIKMVESDRVERRSKHIDTKYAFTKDLHQRGVIDL FT RYLPTDEMVADVMTKPLDRTKLELHRNTLGVK" XX SQ Sequence 4032 BP; 1042 A; 1049 C; 1165 G; 776 T; 0 other; ggttatgggc ctgagttcga cgtagaaatc ggaggaacaa aagaaatttt cgacgccatt 60 gcaaaagttg accgcgtgtc gcgaaattac ttgttttatc gcgaaaatcg aatagttttc 120 tcgtgaaaat ggacgctaat cgctttagca ttcagaagct gaataatgat aattattctg 180 catggagatt caaagtggaa cttctccttg ttcgcgagga cttgtggcgt tgtgtcgatc 240 ctggcacgaa gccagctgcg gaaagtgatg agacgtggaa ggcgctggac gctaagacgc 300 gggccaccgt cggcctactg atcgaggata accaacacgg gttgatccgt ggagctgcga 360 cctcgaaggc cgtctggaca gcgttgcaag agcatcacat gaaaacgacg ctaacctcaa 420 aagtttcgct cttgaagcga atctgcgata aacggttcgc ggatggcgag gacatggcgg 480 aacatctgtt cgctatggag gaactcttcg atcggttgaa aaatgccgga caaccgcttg 540 cggacactct tcaagtcgcg atgattttaa gaagtctgcc gcgaagtttc gacaccctga 600 ctacggctct cgagagccgt tctgacgctg atcttactct ggaactcgtc aagcgaaagc 660 tactggacga ggtcgcgaaa cggcagggat ccgccggaag tgattctgca ctgaaagtcg 720 gtgcttacgg acgcagcagc aagaagaaga cgctggtctg ccaccgatgc aaaatggagg 780 gccacaagca atacgagtgc ccgcagcgtg gtcgccacgt ggacggcgac acaaaggcga 840 gaagaacaaa gccgaagaag gaggacagcg tctctttcgc ttttgtcgcc ggcggcggtg 900 ctgtagctga cgtcactaag ccgtgggtca ttgactccgg ggctaccagc cacacagttg 960 cggatcggtg tttttttagt gaactgaaaa acagtgaagt gcatgtcgcg ctgtacaccc 1020 ctggtcttgc catgaactta ttatcagtac cggcaatagt gaaaaagaac gcaaaagtgt 1080 tgttcgacgc tggtggttgc cggatcctgc gaggcgagac aacggtagcg gtcggcgtgc 1140 tgaagcaagg attgtaccac ctgaaacaac cggaggaagt cgcgtatgcc gccagtggac 1200 accacaacaa ggactgcaag cacgtttggc acagaaggtt cggtcaccgg gacctggccg 1260 ccatcgcacg gatggaggcg gaaaacctag caagtgggtt gagagtgttc gactgcggag 1320 ttgatgagcc ctgtggaagc tgccttaaag gaaagagctc acgcaaaccg tttcccgcgg 1380 agtcgacaac cagcaccaaa gcgacgttgg acctggtgca taccgacgtg tgcggcccgg 1440 ttgaagtacc ttccgttggc gggtatcgtt atttcatgac catgattgac gattactccc 1500 gcttctgtgt gctctacctg ctgaggaaca agtcggaggt ttcggacaga atcgccgagt 1560 acgtcgccct cgtgaaaacc ctcttcggcc gaaaacccaa aaccgtgcgc tcagaccaag 1620 gaggagagta caccggcgaa aagctgcgca acttctacaa gcaggaggga atccaggcgc 1680 agttcacggc gggctacagt ccgcaacaaa acggagtggc ggaacgcaaa aaccgctacc 1740 tgaccgaaat ggttcgctgc ctcttgttcg acgccaatct tccgcagtgc tactgggcgg 1800 aagctttgag cacggctgtg tatctccaaa atctccttcc aacaaaggcc gtgaagtcgt 1860 ccccgtttga gatgtggcac ggaaccaagc cggacgtggc gaaactgaag gttttcgggt 1920 gcaaagcgtg ggtccatgta ccgaaagaaa agcggaagaa gctcactgga acagcccggg 1980 aactcacctt tgtcggctac tccctggaac ataaagccta ccgattcctg gaccgaagca 2040 cacagaaggt gattgtcagt cgggacgtgc gtttcgtcga aagtgcgtca gctgaggtcg 2100 gtcatcagaa cggtgatgcg aagccggtca cggttaccag ccgagaggag acagtggttt 2160 tcgattcggt actcaacaaa cctgtaccga aacccgatga accagcgctg gaacaagaaa 2220 gagcaccgga agcagaaccc gagtgcccta tggatggcga caacccagct gaacttgacg 2280 cggaagaaga ggacgcatcc ttccgctccg cactgtccga tgaagaggag ttcaacggtt 2340 tcagtgacgt ttcgccaagt tcgtccaccg aagataccac caacaaaact ggcagcacaa 2400 caccacttcg aagatccgac cggctcacga agggcattcc acctgagcgg taccaaaacg 2460 caaccaacgt gaccaagcac actgtcaggg agccccgatc ttacctggaa gcagtccaag 2520 gtccagagaa agccgcctgg ctggcagcga tgaaggagga gatcaaatcg ctgcaagaaa 2580 accagacctg ggagctcgtg gagctgccgc ccggaaggaa actcgtcggc tgtaagtgga 2640 cgttcaagaa gaaggaggat gaaagtggcc gcgtcgtgcg gcacaaagcc aggttggtcg 2700 cacagggttt ctcccagcgg tacggtactg actacgacga ggtgtttgca cctgtcgcca 2760 aacagacaac gctacgaacg ctgctgacga tcgccgctcg ggacgacatg ttggtaagac 2820 atctggacat taagtgtgcc tacctctacg cggatcttga cgaatccatc tttatgaaac 2880 agcctccggg attccagtcg gacgacaacc tggtgtgccg actaaagcgc agtttgtacg 2940 gcttgaaaca gtcggcgcgg gtctggaatg caaaaatcga cggcatcttc aagcagatgg 3000 gattccagcc cggcgtggcg gacacgtgcc tctatgtgaa gaagacggac aagcagatga 3060 gtttcatcgc gatatacgtg gacgacatgg tcatcttctg ccattccgag aaggagttca 3120 gccgaatccg tacaacgctg gaagggcagt tcaagctgtc tactttggga gagttgcgcc 3180 agttcctcgg aatccacatc gagaagatcg acggccacta cacgctgaac cagcgtagct 3240 acatcgagaa gctgctcgga cggttcgggt gcgaggaagc taagccgtcc aagatcccgc 3300 ttgaccccgg ctacgtaaaa caaaaggagg agacaaattt gccgacaaac acgtcgtacc 3360 gttcgttggt gggtagtttg ttgtacgtcg cggtcaactc ccggccggac atctgcatcg 3420 gaacttcgct gttgtgcagg aaagtgtcta atcccagcga tcgcgactgg acagaagcca 3480 aacgcaccct gaaatacctg aaaggaacca aggacctccg tctgcacctt ggtaacggag 3540 atgcgggact cgagtgcttc gtcgacgcgg actgggccgg taacgaaggc gaccggaagt 3600 caaactctgg tctgatcatc aaattcggcg gtggagtggt gagctggtcg acacgcaaac 3660 aaacctgcgt cgccctatcg tccaccgagg ccgagttcgt cgcccttgcc gaaggttgcc 3720 aggagctgct gtgggcaaag aaactcctta acgatctcgc cgaggacaac caggaatccg 3780 ttgtggtgtg ggaggataac caatcctgca tcaagatggt agagtcggat cgcgtcgaac 3840 gtcggtccaa acacatcgac acaaagtacg cgttcaccaa ggatcttcac cagcgcggag 3900 tcattgacct ccgttacttg ccaacagacg aaatggtcgc ggacgtcatg accaaaccac 3960 tggaccggac caagctggag ctgcaccgta acacactcgg tgtcaagtaa acttctttgc 4020 gttgaggagg ag 4032 // ID hATm-11_HM repbase; DNA; INV; 3695 BP. XX AC . XX DT 22-MAR-2008 (Rel. 13.03, Created) DT 22-MAR-2008 (Rel. 13.03, Last updated, Version 1) XX DE hAT-type family: consensus sequence. XX KW hAT; DNA transposon; Transposable Element; hATm-11_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3695 RA Jurka J.; RT "Families of hATm elements from Hydra magnipapillata."; RL Repbase Reports 8(3), 215-215 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(534..827,906..1367,1580..2236,2332..3408) FT /product="hATm-11_HM_1p" FT /translation="MADDDEDGFPRKSARTRNGRSTVREKNEESSLVTTRF FT RSPPPMRYTSPRTSRDCQPGTSSTSFNISLAPQKTNRCKRTIYILQYPLHM FT FAPNRYYFKIIFFRLPTLGEVCRRIYWLRASKNNAKLSCAQKTMSRSMLCI FT GGECKKSNRGEFTGVCILRELLSLWDKGGFDSRFRVTEQVVKKKLEKSYER FT YKKLCSLRTLMEINKIKEEQYKTKTESFIKEANCIFPVYKTDILKLIRDDE FT NRDKASKDRRYNYLQVVRVPLILTVNLSLAKKKKKISEDRVYMELSMNDVL FT DKWVPFLTRYKVSVRAETSLLSSLFKIGGVDLNTVPVSKSSLHRNKKIVIE FT NEARIVREENLDKVRGLKVIIHFDTKLVKHYRTDQKMSETVERLALSISSP FT QVNEPLDFLLGVLEIKSSKGQDQAIAIQNILEYYELTDQIIGCSADTTASN FT TGKYKGAIKFLVDHVLKRPVLWLLCRRHHISERHISHAMDYIQGSTKSRAI FT YKDFQKVWPEIRERANNVDQLLFFNYGREEYMVGTDFHRRVLDTKNFCITA FT LKQDCFQRGDYKVLCELIVIYLGGHVPGFQFKQPGAHHQARFMADCLYLLT FT MQLTSKINLMLNKSEKDMLELSTDFIVTFYGAYFLKSPMAAQAPSHDLDAF FT KLAYEMMSNKLYISKYGPLGKLLHASLVRHSWYLTPQLVILSIANRHLGLK FT ERAGLAQKLLGFNCPQPDDFEREQPQPPTDVVPMSVLSDFITKESWLLFTY FT LGINTGVINTWIKDNFENDSYKHFQHQIENLEVVNDRAERHIKLVQDFVAE FT SHEEGLLQDTLQVVQKNRKEVGKDMKKK" XX SQ Sequence 3695 BP; 1301 A; 571 C; 716 G; 1106 T; 1 other; aaaaaatttc acatgcaccc tatctcaatc attcttattt ggtcccagga cccaccaaaa 60 aaattttagg tctccaagtc aaggggatcc agtacaaggt atttttatat gctaggttgc 120 atggaattat tcgccccctt cctctcggtg cccctaattt atggattgaa tagtagaaaa 180 ttgtctattt tacatttatt tttgttctgg aaaaaaaatc tttcatttaa gaggtaatac 240 ttgtacatat tggtcattta aagacaaata aagtcgaaaa tcacataaaa attacatttt 300 aatgtgattt ttatccgatt ttagacatat aatgagattt taaacaaaaa ttgaaatatt 360 atgtcttgat gagtaaagtc agatagtctt gattcaaata tgttgttttg caataatttg 420 tcattgtttt agtggcatta attctctatt caaataattg tttttatgta aaatattata 480 agacccttac attttatact tttcagatta taacaaattt cttaaaatat ataatggccg 540 atgatgatga agatgggttt cctcgtaaat ctgcaaggac tagaaatggt cggagtactg 600 taagagagaa aaatgaggaa agctccttgg ttactacacg tttcagaagt cctcctccaa 660 tgagatatac ctctcctagg actagcaggg attgtcaacc aggaacctca agtacatcct 720 ttaacataag tttagctcca cagaaaacta ataggtgtaa aagaacaatc tacatacttc 780 agtatccact acatatgttt gcaccaaaca ggtattattt taaaatttaa ttaaaataat 840 gaattggaaa agcttttttt aggtaaaaaa cttttgaaac aaatatgtaa agactttttt 900 tttgaatttt ttttagacta cccacactgg gagaagtttg tcgtagaata tattggctca 960 gagcaagtaa gaataatgca aaactatcct gtgcacaaaa gacaatgtca agatctatgc 1020 tttgcattgg aggggagtgt aagaaaagta acaggggaga atttactgga gtatgcattt 1080 tgagagaact gttaagtctt tgggacaagg gtggatttga ttcaagattt cgggttacag 1140 aacaagtagt gaagaagaag cttgagaaat cttatgaacg ctacaagaaa ctgtgtagtc 1200 ttagaaccct tatggaaata aacaagatca aagaagagca atataaaaca aaaacagaga 1260 gttttataaa agaagcaaac tgtatttttc ctgtttataa aacagacatc ttaaaactta 1320 taagagatga cgaaaataga gacaaagcaa gcaaagacag aagatattga gttcctgaag 1380 agagcattca aaggtgaaat ggttaagttt gacactggga aagataaatc ttatcaagaa 1440 aaaatcatag gcggggaaaa acgaatgtta gatatgatag aaagtatgcg tagagaagaa 1500 caacataaaa aagaaaaaca agaaatgtca aagaaaaaag ctatagagga catagaaagg 1560 atgaaagatt ttgagataga actatcttca agtggtacga gtgccactga tactgacagt 1620 gaatttgagc cttgccaaaa aaaaaaagaa gatcagtgaa gatagagttt acatggagct 1680 ttcaatgaat gatgttcttg acaagtgggt tcctttcttg actaggtaca aggtcagtgt 1740 tcgagctgaa acttcacttc tgtcctcact tttcaagatt ggtggtgttg atctaaatac 1800 agttcctgta tcaaagagta gtctacatag aaacaaaaag attgtgattg aaaatgaagc 1860 tagaatagtg agagaagaaa atctggacaa agttcggggt ttaaaggtca tcattcattt 1920 tgatacaaaa ctggtcaaac actatcgaac tgaccaaaaa atgtctgaaa ctgtagaaag 1980 gcttgctctc agtatatcat ccccccaggt caatgaaccc ttagattttt tgctaggagt 2040 tctagaaatt aagtcatcta aagggcagga tcaagctata gctattcaga acatattaga 2100 gtattatgaa ctcactgacc agatcattgg atgtagtgcc gatacaacag cctctaatac 2160 tggaaaatac aagggggcaa ttaagttctt ggtggatcat gtgcttaaaa gacctgtgct 2220 atggctgctc tgtaggtaat taaattattt tgataacata ttttaaaagt ttgttgcacg 2280 aaactgttgc aaaattatta cattttatat cttttgaaaa ttgatcttta aaggcaccat 2340 atatctgaaa ggcatatttc ccatgcaatg gattatattc aaggttcaac aaaatcacgt 2400 gcaatatata aagattttca aaaagtctgg ccagaaataa gagaaagggc aaataatgtt 2460 gatcagttgc tttttttcaa ttatgggaga gaggagtata tggtaggtac tgattttcat 2520 agaagggtat tagatactaa gaacttctgc ataacagccc taaaacaaga ttgctttcaa 2580 aggggagatt ataaagttct ttgtgagcta atagtgattt accttggtgg ccatgtacct 2640 ggtttccagt tcaaacaacc aggtgcccac catcaagcaa ggttcatggc tgattgtcta 2700 tacttgctta caatgcagct gacttcaaag attaatttaa tgctgaataa aagtgagaaa 2760 gatatgcttg aactgtcaac tgactttatt gtcacatttt atggagcata ctttttaaag 2820 tctcctatgg cggcccaggc tccatcacac gatttggatg cttttaagct agcttatgaa 2880 atgatgagca ataagctata tatatctaag tatggccccc tgggaaagct gctccatgca 2940 agcctggtcc gtcatagttg gtacctgacc ccacagctag ttattctttc aatagcaaat 3000 agacatttag gactgaagga aagagctgga ctagctcaga agctgcttgg tttcaattgt 3060 ccgcaacctg atgattttga aagggaacaa cctcaacctc caactgatgt ggttcccatg 3120 tcagttttgt ccgactttat cactaaggag tcctggctgc tcttcacata cctaggtata 3180 aatacgggag taatcaacac atggataaag gataactttg aaaatgattc ctacaaacat 3240 tttcaacatc agatcgagaa tctagaagta gtaaatgaca gggctgagcg ccacataaaa 3300 cttgtgcaag attttgtggc tgaaagtcat gaagaaggtc ttctacagga tacattgcaa 3360 gttgtgcaaa aaaaccgaaa agaggttgga aaagacatga aaaaaaagtg attttatgta 3420 atttttatgt gatttttgac tttatttgtc tttaaatgac caatatgtac aagtattacc 3480 tcttaaatra aagatttttt ttccagaaca aaaataaatg taaaatagac aattttctac 3540 tattcaatcc ttaaattagg ggcaccgaga ggaagggggc caaagtaaac cactattacc 3600 ttgtactgga tccccttgac ttggagacct aaaatttttt tgggggtcct gggaccaaat 3660 aagaatgatt gagatgggtg catttgaaat ttttt 3695 // ID CR1-52_HM repbase; DNA; INV; 4463 BP. XX AC . XX DT 22-DEC-2008 (Rel. 13.12, Created) DT 22-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE CR1-type family - consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-52_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4463 RA Bao W. and Jurka J.; RT "CR1 families from Hydra magnipapillata."; RL Repbase Reports 8(12), 1880-1880 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 60..800 FT /product="CR1-52_HM_1p" FT /translation="MEVMMKNIEKLISSKLDEQKKSILXETAILLKNQEKI FT FTDILSANLKIITDRLDKLENEFNKNKSKLVIXEKDISDLKESINFQEETF FT LNKISQTNSLLNSEISNLKNKFIYLEKRSRQNNLRIDGLTELPAENWNDCA FT NELKNIFRNKLEISEDIIIERAYRIGKVKEDKSPRTMIIKLLDFKNKNKIL FT TSAKKLKGTGIFINEDYSNETMEIRKKLWEEVKRLRKEGKYAIIKYDKIFV FT REFRK*" FT CDS 881..3967 FT /product="CR1-52_HM_2p" FT /translation="MAXETIEFETYYSDYFKTANNILHNDFYHADKKIFNE FT KCCNTEYFEIETFKTQFKQSNEDFTVIHINIRSINSNIDKLKNFLLECNYS FT FSMICLTETWCTDNFSQENSNFQITNYNMISFERKILKRGGGILVYIRNDL FT EKKIRNDLSFSDADNEALTIEIINKNKKNILVTTCYRPPNGNIIKFSNYLN FT QIFIKNNNEKKYIFCIGDLNIDCLKYNENTNIKIFFDNLFQLGIFSLINKP FT TRVTSTSITAIDNILTNSFLDISLKTGIFKIDISDHFPIFFSLTQNSASLN FT NSKIKTYKRKINKSSIHEFKETLSKTNWSNVYLQCDRGLTNSAYNLFIDTF FT LEHYNKHFPIIEKEIKVKYLXCPWITSGIKKSSKKKQKLYNKYLKNRNEKN FT LTVYKQYKNLFEKIKKKSKKMYFSKQLQKHNGSIKKTWEIMNEVIGKTKTK FT SEILPSRIEEDGLEYTDKKTISDKFNKFFVNIGPXLASKINNSNNSFNNYF FT TDHTSSFTESDLKSEEFEEATKSIRKNKAPGIDDISSNVVYDILYILKDPL FT LKVFKSSIRTGVVPNKLKIAKILPIFKTGDTALLTNYRPISILPVFSKLLE FT RIMYNRLYKYLTANKILSECQYGFQTNHSTEHATLDLMDNIRSSFDKGKFV FT LGVFIDLTKAFDTVNHKILLEKMRKYGIKNQTIKWFTNYLNNRQQCVIIDG FT ITNTKLLKIKCGVPQGSILAPLLFLIYINDLPKASTNLDCIMFADDTNLFY FT SSTSINDLFDKVNIELQKLKTWFSANMLSLNAQKTKYILFHSKKQKNKLPS FT ILPSLSIEHKKIERTQTIKFLGVIIDENVSWTAHINAINTKISKSIGILFK FT ARPFLSQKNLKLLYFSFIHSYLTYANIAWGSTHKTKLMSLYRRQKHASRII FT HHKNKLTHAKPLLKEMKALNIYQINIYANILFMLKYKLGLVPNRFTENFFQ FT SNINKFNTRATGNFIVPRKRQKNSQFSISYRGPYLYNKLIPNYTSISLADN FT ITSLKSKLKNHVLNTDNYLEMF*" XX SQ Sequence 4463 BP; 1881 A; 698 C; 543 G; 1322 T; 19 other; ttggatttgc ggcattgaac gagcaagacg tgtttatcga gcgattartt atttaaaaaa 60 tggaggttat gatgaaaaat atagaaaaac twataagcag caaacttgat gagcaaaaaa 120 aaagcatttt aawwgaaact gctatacttt taaaaaatca ggaaaagata tttacmgata 180 ttttaagtgc gaacttaaaa ataataacmg atcgrcttga caaacttgaa aacgarttta 240 ataaaaacaa atcgaaactt gtaatwrttg aaaaagacat aagtgattta aaggaaagta 300 ttaactttca ggaagaaacc tttttaaata aaatatcaca aacaaacagt ctgttaaaca 360 gcgaaattag caacttaaaa aacaaattta tttacttgga aaaacgttca cgacagaata 420 acctaaggat tgatggatta actgagctgc ccgctgaaaa ttggaatgat tgtgcaaatg 480 agcttaaaaa tatatttaga aacaagctgg aaatctcaga ggacataata attgaaagag 540 cttatcgtat cggaaaagta aaagaagata aatctccacg aaccatgatt atcaaactat 600 tagactttaa aaataaaaat aaaattctaa catcagcaaa aaaactcaag ggaacaggaa 660 tttttattaa cgaagattac tcgaatgaaa caatggaaat aaggaaaaag ctttgggagg 720 aagtcaaacg gctacgcaaa gaaggtaaat acgctattat taaatatgat aaaatatttg 780 tcagagaatt tcgtaaatag gagagtgttt astaaaaaaa aaaaaaaaaa aaaaaaaaaa 840 aaaccrcaat ctttgtaatt aaattwtgta aaccgacaaa atggcaawcg aaacaataga 900 gtttgaaacc tattactccg attattttaa aaccgcaaat aatattttac ataacgactt 960 ttaccacgcg gataaaaaaa tatttaacga aaaatgttgt aatacagaat atttcgaaat 1020 tgaaactttt aaaacccaat tcaagcaaag taatgaagat tttacagtta tacacataaa 1080 cataagaagt attaactcaa atatcgacaa attgaaaaat tttcttttag aatgtaatta 1140 ttctttcagc atgatttgtc tcaccgaaac gtggtgtact gacaattttt ctcaagaaaa 1200 ctcaaatttt caaattacta actacaacat gatttctttt gaaagaaaaa tactcaaaag 1260 aggaggagga attttagttt atattcggaa tgatcttgaa aaaaaaatta gaaacgacct 1320 ctcattctcc gatgccgaca atgaggccct cacaatagaa ataatcaata aaaataaaaa 1380 aaacattcta gtaactactt gttacaggcc acctaatggc aatattatca aattttctaa 1440 ctatttgaat cagatcttta taaaaaataa caatgaaaaa aaatatatat tttgcatagg 1500 agacttgaac atagattgtt taaaatacaa cgaaaacacc aacattaaaa tcttctttga 1560 caatttattt cagctcggta tattctcatt aattaacaaa cccacccgag tcacctcaac 1620 ctcaattaca gccatagata atatactgac caactcattt ttagacatat ccttaaaaac 1680 aggaatattt aaaattgata tatcagacca ctttccaata ttcttctcac ttacacaaaa 1740 ctcagcgtca ttaaataatt caaaaatcaa aacatataaa agaaaaatta acaaatcttc 1800 tattcacgaa tttaaagaaa ccttatcaaa aacaaattgg tcaaatgttt atctacaatg 1860 cgaccgagga cttacaaatt ctgcttataa tttatttatt gatacttttc ttgagcacta 1920 caacaaacac tttccaatta ttgaaaaaga aataaaagtt aaatatttga ratgtccatg 1980 gataacaagc ggaataaaaa aatcgtctaa gaaaaaacaa aaactataca ataaatattt 2040 aaaaaataga aacgaaaaaa atcttactgt ttataaacaa tacaaaaacc tttttgaaaa 2100 aattaaaaag aaatctaaaa aaatgtattt ttctaaacaa ttacaaaagc acaatggtag 2160 tattaaaaaa acatgggaaa ttatgaacga agttattgga aaaaccaaaa ctaagtccga 2220 aatattaccc tccagaattg aggaagatgg attggagtat acagataaaa aaactatytc 2280 cgataaattt aataaatttt ttgtaaatat cggtcctaam ctggcatcca aaataaacaa 2340 ctcgaacaat tcatttaata actattttac tgaccatact agttcattta ctgagagtga 2400 cctaaaatct gaggaatttg aagaagcaac aaaatcaatt agaaaaaaca aagctccggg 2460 catcgacgac atttctagca atgtagtata tgatatttta tacattttaa aagatcctct 2520 tttaaaggtt ttcaagtcat caattagaac tggagttgtt ccaaacaaat taaaaattgc 2580 gaagatttta cctatattta aaactggaga cacggccttg ctaaccaatt atagacctat 2640 ctcaattctt ccagtgtttt ccaaactcct ggaaagaata atgtacaacc gactttataa 2700 atatttaaca gccaataaaa ttttaagtga atgtcaatac ggctttcaaa caaatcattc 2760 tacggagcat gcaaccttag acctaatgga caacattaga tcatcttttg ataaaggaaa 2820 atttgtattg ggagttttta ttgatcttac aaaagctttc gatactgtca accataaaat 2880 tctgctcgaa aaaatgagaa agtacggtat aaaaaaccaa actattaaat ggttcactaa 2940 ttatttaaat aacagacaac aatgtgttat aatagacggt atcactaata caaagttatt 3000 aaaaataaaa tgtggtgtcc cccagggatc tattcttgca cctttattat ttcttatcta 3060 cattaacgat ctccctaaag cctctacaaa tctcgattgt ataatgtttg cagatgatac 3120 caatctgttt tactcatcta cctctatcaa tgaccttttt gataaagtca acattgagct 3180 tcaaaaactt aaaacatggt tcagtgctaa catgttatca ttaaacgccc aaaaaactaa 3240 atatatttta tttcactcaa agaaacagaa aaataaacta ccatcaattc taccctcact 3300 aagtatcgaa cacaaaaaaa ttgaaagaac ccaaacaatt aaatttcttg gtgtgattat 3360 tgacgaaaat gtttcatgga cagcccatat aaatgccatc aacactaaaa tatctaaaag 3420 tattggtata ctttttaaag ctagaccatt tttatcgcaa aaaaacctaa aacttcttta 3480 cttttccttt atacatagtt atctcacata tgccaacatt gcttggggaa gcacacacaa 3540 gacaaagtta atgtcccttt atcgacgaca aaaacatgcc tctagaataa tacatcataa 3600 gaacaaactg acccacgcta aacccctatt aaaagaaatg aaagcgctaa acatttacca 3660 aattaatatc tatgctaata tattgttcat gctaaaatac aaattaggat tagttccaaa 3720 ccgattcaca gaaaatttct ttcaatctaa tatcaacaaa tttaacacaa gagcaactgg 3780 taactttatt gtaccccgga aaagacaaaa aaattctcaa ttttctattt cttatcgcgg 3840 cccttattta tataacaaat taattccgaa ttatacctca atttctcttg ctgacaatat 3900 aacttcttta aagtcaaaac taaaaaatca tgttctcaat acagataatt acttagaaat 3960 gttttagttc taatgataaa ttaatattta cgctcacatt caaataatca gcaatcaaty 4020 aaaaatggaa aatgtatttt tttttttaac aattaaaact agtaaaaaag ataatattat 4080 tattataaac cacaacacta acaccataca tagtataatt aaaaagaaaa gaaaaaaaag 4140 tctacatcaa taacatttaa aatatttgtt ttatgagaat cataatgttt atatgtgtac 4200 tttacgcaat accccaaggt tctcgatgat aagactaaac tcagtcttct acgagttccc 4260 tacaacagtc aaaacccttc ttatcaagtc atagcatatt taattattgc ttcctatgtt 4320 tatcttaacg gtattttgta aaatattaat ttttttgtat tggaagtata ttttgtttat 4380 gwtatcgcag tgtaaaaaaa aaaaaatttg taattaatat gtagttgtga aaatttaaag 4440 aaaaaaaaaa aaaaaaaaaa aaa 4463 // ID Gypsy20-LTR_Dpse repbase; DNA; INV; 168 BP. XX AC Unknown_singleton_29; XX DT 15-MAY-2009 (Rel. 14.05, Created) DT 15-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy20_Dpse; KW Gypsy20-I_Dpse; Gypsy20-LTR_Dpse. XX OS Drosophila pseudoobscura OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; obscura group; OC pseudoobscura subgroup. XX RN [1] RP 1-168 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1118-1118 (2009). XX DR Genome; Unknown_singleton_29; Positions 42307 42474. XX SQ Sequence 168 BP; 42 A; 21 C; 57 G; 48 T; 0 other; tggagctgtg ttggtgtgaa gacgacagaa aattctggag ctgtgttggt gtgacgacga 60 cagaaaattc tgtagctgtg ttggtgtgac gacgacagaa aattctggag ctgtgttggt 120 gcgacgacaa cgaaaacatt ttggaattgg ttgttgttgg tgagtgca 168 // ID DNA4-9_AP repbase; DNA; INV; 177 BP. XX AC . XX DT 26-AUG-2009 (Rel. 14.09, Created) DT 26-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA4-9_AP. XX NM DNA4-9_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-177 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 1958-1958 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 177 BP; 42 A; 53 C; 48 G; 34 T; 0 other; tactctgttc aaaatgagat cattactcgg cggcgaccag ttttcgtcgg gcggcggtca 60 gacatatggt cagacgccgt ccccactacg gcgacgcgtc gggcgctgcc acggaacaaa 120 gccacgccca tgcgcacaac taggccgccg agtaatgatc tcattttgaa cagagta 177 // ID Sola1-6N1_AAe repbase; DNA; INV; 1282 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A non-autonomous Sola1 DNA transposon family from Aedes aegypti. XX KW Sola; DNA transposon; Transposable Element; nonautonomous; KW Sola1-6_AAe; Sola1-6N1_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-1282 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the yellow fever mosquito."; RL Repbase Reports 11(4), 1289-1289 (2011). XX DR [2] (Consensus) XX CC 4-bp TSDs. This is an internally deleted version of CC Sola1-6_AAe. CC The identity is 91-95% to corresponding sequences of CC Sola1-6_AAe. XX SQ Sequence 1282 BP; 397 A; 246 C; 276 G; 363 T; 0 other; ctgcccataa atgcatacca gtcacactag caaaacagcg cacccaagaa aaacgcgata 60 gaagtcgtga acacctgcat gctttcttta ctccaattat attgaaaatg cgatgttttc 120 gcacccaaat ttgaactcaa atggtgcaat gaactattga ttacagtatt acatacattt 180 tgtttattaa ttttctcaaa tatttagtta atatacggaa aaatgtacgc gtgactgata 240 tgcgtttatt tgctcttgaa tgtacgagaa taaacgcata acagtatcag tgcttactat 300 gatgcatcgg tcttcacatg tgtattattg ttgttttgac agtttagtgt ccgcattcaa 360 agtcgcgaaa atgtctgctc tcgattattt gaaaagttat ggtacttctg atagttccga 420 ggatgagtgc aggaatttgg ccgatggcgt cccgttggac aatgacgcgg gctctgaaga 480 caagttcgct ggcttcgacg ataaggacgg caatgctggc ggcgacggca tatttcagtt 540 tccaccgaag acgtagcatt gggtaagcta tttaaaatcc ttggaggaca ctatatatgt 600 gacggggact gatgaaatgg actcagacgg aagtcaaaca gccgttgatg aatttgccaa 660 aaacgcaaga gaacaacaaa atgaaagcga cacacaacgc caagaagcgt cgattatcgc 720 accaagtaaa gcttgtcact tgtgggtgta taatgaacta cgggcttaaa ataccgaaga 780 aaacccgtat tcgtcttaac acgttgctct aggcattgga ttcacgagag caaaccagct 840 atatacgcca gtgcgtggaa cgaggggaac gagtggatgt tcaaagacgg cgtcggtaca 900 agtacgagga agacgatcca aggaaatcca atagctttgt atttcatctg gctgaatcaa 960 acgattcttc cgtttaagtg tgtcgtagat tttttctgac cactctgggt tacggcaaca 1020 attgtggata agtatatttt cctttacttg acttacttat caacacaatt tgtcgcaagt 1080 ccttatcaca caagtcgcga gactgacatg cgtttattag taacgtggga cagaccaagt 1140 tcgatattgt ttttaacatt tctagaacaa acgtattatt ttcttataat acaatgaaag 1200 tacatgcaat ttaatgacgt ttcccgtaat taactaaaaa atgacaaaaa gtcacgtggg 1260 actgttatgc atttatgggc ag 1282 // ID BEL-174_AA-I repbase; DNA; INV; 5874 BP. XX AC AAGE02031694; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-174_AA_; KW BEL-174_AA-LTR; BEL-174_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5874 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02031694; Positions 3577 9450. XX CC Positions [4891-5463] - Integrase core CC 'AGGAA' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 553..5874 FT /product="BEL-174_AA-I_1p" FT /translation="MKQASLEKRQELIRQHAQMSDRGSSVSEYSAGTESSK FT VEKWLTDRQEDIKKDLQKQETTELPSRQAPLPRAPSRPSLIVGSIPQHHHQ FT PLSVEHRLDEQIAAPFQNMSLHNFPSPQAVQPSLPPAYMQIAARQVTGKDL FT PVFNGNPEDWPMFIRIFEETTAACGFSNVENQVRLQKCLRGSALETVRSRL FT LMPEGVPHVIKTLEMRFGRPELIIRSMLDRIQRLPAPKPDRLETLIDFGLA FT VQNLVVHLQAAKQDNHLNNPTLLQELVAKLPAQLKLDWARYKILNAAPTLA FT TFGNFMEQLIQAASEVSFDIPITVTNTKPEKLRSKEKAVAYSYSHDKQSSN FT SSRNVESTKKPARVCSMCSNPSHRLWGCGEFRKRSFEDRIKFVRQNNLCRT FT CLNFHQKWPCKTFKGCAVEGCQEKHHSLLHPPASRDPVHLSTSHSSLRTDK FT RQLLPFFRILPVTLSKDSVELSTFAFIDEGSSSTLLDRSLADVLGLSGPME FT PLTLQWTGNVTRKERNSECVQVHIAGRDKSQKYQLIDARTVDNLLLPKQSL FT PYRLLSEKYPYLRGLPIEDYEMVEPKILIGVDNLSLGIPLKIREGGPKEPT FT AAKCRLGWTIYGGGGNSPLSSVSVHFHASAIQDPDRELNQQLSDYFTLDKA FT GADLPFPIPESEAEKRARRLLEETTRRVGQRFETGLLWKADDVKFPDSYPM FT AVRRLEALERKLNKQPVLKSRVKQLLDEYVAKGYYHQASPDELVAAEERNV FT WYLPLSIVINPRKPEKIRLVWDAAAQVHGVSFNSALLKGPDLLTPLVAVLS FT HFREHRFAVSGDIKEMFLQMLIRAEDRQAQRFLWRTNPDEAPHIYIIDVAT FT FGSTCSPSSAQFVKNLNAQDYVQQFPRATIAIVKFHYVDDYLDSFATEEEA FT IEVVQDVKYIHSMGGFEIRNFLSNSERVLAAIQEEPRRNEKDLNITRAEKV FT ESVLGMKWSPNADMFSYALSPKEEISKIFESAYAPTKREMLKVVMSLFDPL FT GFITFFLIHGRILIQDAWAAGIGWDTPICGPLYHKWRLWISGFTLLNDLRI FT PRTYFRHAEKGPVQLHVFVDASTEAYACVAYFSAMGPAGIEVALIAARGKV FT APIKVLSVPRLELNAAVLGTRLAESVIDSHTYQIERKLFWSDSKTVLAWIN FT SDHRRYHQFVAVRIGEILSSTRMSEWRYVPSSQNPADFATKWGNGPNFDME FT TQWFRGPAFLYHPEDSWPNTRCFTTATEMSNAQHVHWRPAEPVIDVTRFSK FT WERLQRTGAYVLRFIENLKRRQNGEALLLGILSQSELQRAEKALWKQAQNE FT VFAEERHQLQRTQGSAEARHNLLPKTSSIYKLWPYLDEDGILRMRGRIGAA FT WYAPYEAKYPVILPDTHLISSLLADWFHRLYHHAHSETVVNEMRQRFEIRK FT LRALVKRVVKNCFVCHLAKAIPRPPPMAPLPKQRLTPFIRPFTYVGVDYFG FT PLLVKVGRSQSKRWVALFTCLTVRAVHLEVVQSLSSESCIMAVRRFIARRG FT APAEIFSDNGTCFVGANKQLQQEMQVRNDALATTFTNTNTRWLFNPPNAPH FT MGGAWERLVRSVKQAIGTIIDAPRLPDDETLATILSDAESMINSRPLTYVP FT LDSADEESLTPNHFLLGSSSGVKQPPSEPKNFRTNLRSSWTLAQHITDTMW FT RRWIKEYLPVIRRRGKWLEEVRDIREGDLVLIVGGAVRNQYVRARVEKVFF FT GRDGRVRQALVRTATGVYKRPVVRLALLDVGSFGEADEESSEAADHHQASR FT EGV" XX SQ Sequence 5874 BP; 1648 A; 1375 C; 1467 G; 1384 T; 0 other; aaatctttta gaatcttgtc ctcgtcaaca cgatggataa aggaggtacc tctaaacctg 60 gatgtcgatc gtgtaaccgg ccagacaccg ctgatgctat ggtcgcttgc gatgtatgtg 120 gtgattggca tcacttcgga tgcgtaggag ttgatgcgag cgtggagaat catcattgga 180 tatgccgagg atgtgaagta aaaggccacc cgtccggctc aaacgccgcc ggtgataaag 240 atggtgtgcc acaagcgcat acgatgaaaa ctcgatcgaa agtcgctagc agtaaagcat 300 cgtcacggta caaaaggacc gtggacatcg tcgtccgttc gaaaactggg agtgccacat 360 ctaaccgatc agcgatcatc gaagaacagt tgaagttatt ggaggagcag gaacgtttga 420 aggaagaaga attgcgacaa aaggaggagc tagagcggct tgagtttgaa gaaaagcatc 480 gacaaattga agaattgcgt cgtcgtcagg tagaagagtc gaagctacgt gaagacagag 540 ttatggttgc gaatgaagca agcatctctc gaaaagcggc aggagctgat caggcaacat 600 gcccagatga gcgatcgagg atcgtcggtc tccgaatact cggctggaac ggagagcagc 660 aaggtagaaa agtggttaac ggatcgccag gaagacataa agaaagatct ccaaaaacaa 720 gagacaacgg aacttccttc gcgacaagcc cctctaccac gggctccatc acgaccgtcg 780 ttgattgtcg gaagtattcc tcagcatcac catcaaccac tttccgtgga acatcgtcta 840 gatgagcaga ttgcggctcc atttcaaaat atgtcgttgc acaattttcc gtcgccacaa 900 gcagtccaac cttcattacc ccccgcatac atgcagattg cagcacgtca agttactggc 960 aaggatcttc cggtgttcaa tgggaacccg gaagactggc cgatgtttat tcgaattttt 1020 gaagaaacaa cggcagcatg tggattttcc aacgtggaaa accaggttcg gctccaaaag 1080 tgcctacgag gtagcgccct ggaaacagtt cgtagtaggc tattgatgcc cgaaggagtc 1140 ccgcatgtta tcaagaccct cgaaatgcga tttggccgac cggagctgat tatacgatca 1200 atgttagatc ggatacaacg acttccagct ccaaagccgg accggcttga aacattaata 1260 gactttggcc tggccgttca gaatttagta gttcatctcc aggcggcaaa gcaagataat 1320 caccttaaca acccgaccct acttcaggag cttgtggcga agctgccagc gcagctcaaa 1380 ctcgactggg ccaggtacaa aattcttaac gctgcaccga ctcttgctac ctttggaaac 1440 ttcatggaac agttgataca agcagcaagc gaagtttcct ttgacattcc gataaccgtg 1500 acgaatacaa aaccggaaaa attgaggagt aaagaaaaag ctgtagcgta ttcgtactct 1560 catgacaaac aatcgtcaaa ttcttcgagg aacgtggagt caacgaaaaa accggccagg 1620 gtctgttcca tgtgtagcaa tccaagccac cgattgtggg gttgcgggga attcaggaaa 1680 aggagttttg aagatcgtat aaaattcgtg cgacagaaca atctttgccg aacctgcttg 1740 aattttcatc agaaatggcc ctgtaagacc ttcaagggct gcgccgttga aggatgccag 1800 gagaaacacc attctctctt acatccacca gcttcccgtg acccggtaca cctgtctaca 1860 agccattcat cactgcgtac tgataaacgc caactacttc cgtttttccg aatacttccg 1920 gtcactctct ccaaagatag tgtagagttg tccacatttg cgtttattga tgaaggatcc 1980 tcgtccactc tcttggatcg ctcattagct gatgttcttg gactcagcgg cccaatggag 2040 ccgcttacat tacaatggac tggcaacgtc acgcgaaaag agaggaactc cgagtgtgta 2100 caagtgcaca tcgccggcag ggataaatcg caaaagtacc aactaatcga tgccagaaca 2160 gtggacaact tgttgcttcc gaaacaatcg ctgccataca ggcttctcag tgagaaatac 2220 ccgtatctcc gtggtttacc aattgaggat tacgagatgg tggaaccaaa aatcctcata 2280 ggtgtagaca atctcagtct cggaattccg ctaaaaatac gggaaggtgg gcccaaggag 2340 ccaaccgcag cgaagtgcag attaggttgg accatctacg gtggcggtgg aaattcacca 2400 ctgtcatctg tatcagttca cttccatgcc tcggctattc aagatccaga tcgagaattg 2460 aaccaacaac tcagcgacta cttcaccctt gacaaggcag gagctgatct accgtttcct 2520 attccagaat ctgaagcgga gaaaagggca cggagactgt tagaagagac cactcgtcgc 2580 gttggacaac gtttcgagac tggactgtta tggaaagctg atgacgtgaa gtttccggac 2640 agttatccca tggcagttcg gaggctagaa gcattggaaa ggaaacttaa taagcaacca 2700 gttctcaaat cgcgcgttaa gcaactactg gacgaatatg ttgcgaaagg ctactatcat 2760 caagcttctc cagatgaatt ggtagccgcg gaggagagaa atgtttggta tcttccatta 2820 agcatcgtca tcaacccgag aaagccagag aaaattagac tagtctggga cgctgcagca 2880 caggttcatg gagtgtcgtt caattcggcg ctgttaaagg gaccggattt acttacacca 2940 ctagtcgctg tcctgagcca ttttcgagag cataggttcg cggtgagcgg tgacatcaag 3000 gagatgtttc tccagatgct gattcgtgcc gaggatcggc aagctcagcg gtttttgtgg 3060 agaacaaacc ccgacgaagc tccccacatt tacataatcg atgtagcaac gtttggatcc 3120 acttgttccc caagttctgc tcagttcgta aaaaacttaa acgcgcaaga ttacgtgcag 3180 cagttcccga gggctacaat agcaatcgtc aagttccatt atgtggacga ttatcttgat 3240 agcttcgcaa ctgaggagga agccattgaa gtcgtacaag acgttaaata cattcactct 3300 atgggaggat tcgagattcg gaactttctg tcgaattcag aaagagtgct agcggccata 3360 caagaagagc ctcgaagaaa cgaaaaagac ctcaacataa cccgggcgga gaaggttgaa 3420 tctgtattgg gaatgaagtg gagtcccaac gctgacatgt tttcttacgc tctgtccccg 3480 aaggaagaaa tttcaaaaat tttcgagtcg gcatacgcac caacgaaacg tgagatgctc 3540 aaagtagtta tgagcctatt cgacccacta ggattcataa cattcttcct tattcatggt 3600 cgaattttaa ttcaagatgc gtgggcggcc ggtatcggat gggatactcc aatatgcggc 3660 ccattgtatc ataaatggcg gctttggatc agtggtttca ctctgctaaa tgatctgcgc 3720 attccacgca cttattttcg gcatgctgaa aaaggtccag tacagcttca tgtgttcgtc 3780 gatgctagca ccgaagcata tgcttgcgtg gcgtacttca gtgccatggg ccctgcagga 3840 atcgaagtgg cgttgatcgc cgcaaggggc aaggtagcgc caataaaggt tctctccgtc 3900 cctcggcttg aacttaacgc tgccgttttg ggaactcgtc tagcagaatc agtgattgat 3960 tctcatacgt accaaatcga acgtaagctc ttttggagcg actcgaaaac agtcctcgcc 4020 tggatcaact ctgatcaccg aagatatcat cagtttgttg ctgtaaggat cggcgaaata 4080 ctatcgtcaa cccgaatgag cgaatggaga tatgtaccat ctagccaaaa cccagcggat 4140 tttgctacga aatggggcaa tggcccaaac ttcgacatgg aaactcaatg gtttcgtgga 4200 cctgcctttt tgtatcaccc tgaagattcg tggccgaaca cccgttgttt cacgaccgcc 4260 acagagatga gcaatgcaca gcatgtacac tggcggccag ctgagcctgt aattgacgtc 4320 actcgtttta gtaagtggga gagacttcaa cgaacaggag cctatgtact tcgtttcatc 4380 gagaatctca agcgccgaca aaatggggaa gcattattac ttggtatttt gtcgcaatcg 4440 gaactccagc gtgcagagaa ggctttatgg aagcaggccc agaacgaagt ttttgccgaa 4500 gaacgacatc agttgcagag aacacaggga agcgctgagg cgcgacataa cttgctacct 4560 aaaacgagtt ctatctacaa attgtggccg tatttagacg aggatggtat tctgcgtatg 4620 cgtggaagaa ttggcgccgc ctggtatgcg ccatatgaag ctaagtaccc cgttatatta 4680 ccagatacac acctaatctc ttctctcctt gcagactggt tccaccgcct ataccaccac 4740 gcccactcag agacggtagt gaacgaaatg cgtcagcgtt tcgaaattcg taaactgaga 4800 gcgctagtta agagagtagt taaaaactgt ttcgtctgtc acttagccaa agcaattcct 4860 cgtccacctc ccatggcccc tctaccgaaa caacgattga cgccgtttat tagaccattc 4920 acctacgttg gggtagatta tttcgggcct ctactcgtga aggtgggtag atcgcagtca 4980 aaacgttggg tggccttatt cacctgctta actgtgagag ccgtacacct tgaggtggtc 5040 caaagtctgt cgagcgagtc gtgtataatg gcagttaggc gattcatcgc tcgccgcggt 5100 gctcctgctg aaattttcag tgacaatggc acctgctttg taggtgccaa taaacagcta 5160 caacaagaaa tgcaagtaag aaatgatgcc ctcgcgacta cgtttaccaa tacaaacaca 5220 cgttggttat tcaatcctcc aaatgctccc cacatgggcg gggcgtggga gcgtttagta 5280 cgctccgtga aacaagcaat tggaacgatc atcgatgcac cgaggttacc agacgatgaa 5340 acattagcca ccattctttc cgatgccgag tctatgatca actctaggcc attaacttat 5400 gtgccccttg atagtgcaga tgaggagtca cttaccccaa atcacttttt gttgggcagc 5460 tcatcagggg tcaaacagcc accctcagag ccaaaaaact ttcggactaa ccttcggagt 5520 agctggaccc tcgctcaaca cataaccgat acgatgtgga ggaggtggat aaaagagtac 5580 ttgccagtta ttagacgccg tggaaaatgg ctggaagaag tacgagatat tcgggaagga 5640 gatttggtac taatagtggg tggtgcggtg cgaaaccagt atgtgagagc aagagtggaa 5700 aaggtattct tcggacgaga tggacgagtt cgtcaggcac tagtacgaac agctacggga 5760 gtttacaaaa gaccggtagt gagacttgcg ttgcttgatg ttggatcctt tggcgaagct 5820 gatgaggagt cttcagaagc tgcagatcat caccaggctt cacgggaggg ggtt 5874 // ID BEL-5_CQ-I repbase; DNA; INV; 5709 BP. XX AC AAWU01007782; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-5_CQ_; KW BEL-5_CQ-LTR; BEL-5_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-5709 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 163-163 (2011). XX DR GenBank; AAWU01007782; Positions 10465 16173. XX CC Positions [4719-5300] - Integrase core CC 'CTGAC' target site duplication CC LTRs are 96% similar to each other. XX FH Key Location/Qualifiers FT CDS 401..2194 FT /product="BEL-5_CQ-I_2p" FT /translation="MPPKILRTPAKKEEVDDGEDLETLVYLRDEKLLKLVR FT LKEKLAGLGLHERTATQINVNRRVLDASNAEFNSLHERIVRLDAKKRKEHG FT DKLVEFETLFIEIDTMLEGWKQTLEILAGPSTAPVPAAQRPVVIQQSLPRL FT VPTFDGKYENWKRFKTMFRDVVDRSNETPRMKLYRLEEALIGEAKGTLDEK FT TIEDGNYDGAWELLEERYEDNRRMVDIHIGGLLKVQKLHKESHAELRALVE FT TVVGHVENLKYLRQEFTGVSTQFVVYLLAHALDSDTKKLWEATVKKGELPN FT YEETIKFLKNRVSVLERCDETSEPAAKQRDRNNPRPTNKPSYQKANAATTS FT SNPEIRCEMCGDSHLTFKCGTLAGLTVAQRSEKVRSKNLCLNCLRKGHSWK FT KCTSRYSCGKCRQRHHTLLHDDSEQKFVQKPVNPQNVAPVANQPPVSQATG FT QEPPQSTSATCNHTQTVKTVMLLTALVHLLDDKDRPVPCRLLLDNGSQVNF FT ITEQLAKRLNTRRVAANVPICGIGAVKTYARESMTVQLRSRYNSFTVEVEC FT LIVPKVTGMIPSARSARPTGRFLRTSSWPTRTSTHRTGSTCCSASRCSSSC FT " FT CDS 1849..4410 FT /product="BEL-5_CQ-I_1p" FT /translation="MPTAARQRIAGQFYYGTIGKTVEHTSSSCKRSHLRNR FT CSEDIRPRVDDGPAPIPVQQLHGGSGVSHRPQGNRDDSVGPISTSDWPIPA FT NIQLADPNFHTPDRIDMLLGVSMFFKLLKSGQLELAKNLPELRETYLGWVV FT AGDVGDTVPDAQFSHTATLDDVSEAIERFWRVEDIDSATQVSTEQDECECE FT AFFRATHKRDPTGRYEVRLPFRPVVGNLDNNRSLALRRFLSLEKRLNRDPD FT LKRQYGEFISEYESLGHCKSVEEADDSPGQGRYYMPHHAILRPSSSTTKLR FT VVFDASAKLSPSSVSLNEALQIGGTVQNDLFSILLAFRKHPVAFTADLSKM FT YRQIRVAPADTPFQRIFWRNEPADFIRVLELTTVTYGTASAPFLATRCLVQ FT LCEDEGEKFPLAAKIVRDACYVDDILSGADSPDEAIECLKQLQGLLSRGGF FT PIHKWSSNESAVMDQIPEGEREKLIDLDGLTGGVVKALGLYRSPGDDEFRF FT TANQADADADATKRRVLSEIGKFFDVLGLLSPVIIKAKILMQRVWLAGLSW FT DVLLEGGLASTWEQFQIALPFVRDIRIPRYVIGPGNIALELHGYSDASEVA FT YGAVIYVRSLFPSGKSPQMRLLCSKSKVAPTKQLDIHRLELLGCRLLSKLV FT VKVLAALKLPFRNVVLWSHSQVVLAWLKKPLDQLQSFVRNRVAEIRNETSN FT FIWLYVRTKDNPADLVSRGMFPAELMVCEKWWKGPAYMQSEEYRVEPVEDL FT PDCEIPELRKVKMVTMLTCNSVEFPIFETCSSYRKLQRVMAYVLRFTANCR FT KKNPAERVRQRYLTIPELRAAQDVIVLVIQHDALSKEIQQVAENDTAGRFK FT V" FT CDS 4692..5708 FT /product="BEL-5_CQ-I_4p" FT /translation="MGRLPSCRVTQALPFEEVGVDYAGPIFVKVGTRKPQL FT VKAYFAVFVCMSTKAVHLELVSDLTTEAFLAALQRFVSRRGVPRTIHSDNG FT TNFKGAKAELHELFLLFNQRAFNDQIATYCQPKEITWSFIPPGAPNFGGLW FT EAAVKSTKYHLKRILKNAQLTFEQYATVLAEVEAVLNSRPLFATSADPADP FT QVLTPGHFLIARPLTAIPEPVYEGTPTNRLSKWQHLQLLREQFWRTWKRDY FT LTSLQPRGKDNKEKPNVRPGMVVLLEEKDAPPLQWKMGIIQQTYPGPDGFV FT RTADVKVGGTVVRRPITKLSVLPIEDNEQTADHGAIPAGTSPQPGGR" XX SQ Sequence 5709 BP; 1349 A; 1660 C; 1659 G; 1041 T; 0 other; cttcagtccg ctttttcctg ggccacgacc gggcccgaac attggtcctt cgagccggat 60 ggacttccgg cggtggatgt aaaaaagaag gctcccgccg cgcgtgtgag aggcaaaccg 120 gcctcaaagc cgtggacagt gaccgcgagt gagaagggtg aggcaaaccg gccaaaaacc 180 ccgcaaaaca gtgcgctgga ggaagagaca aaccggtctc ggaatctggg cacaaccacg 240 tggaagaccc gtgaggacga ctggcctaga actttccgtg gaagaagaag gcaaaccggc 300 ctgaaaaaac attccggaag aaaaagtgct tgaaggtgca aaaaatccgc atccggaaga 360 gacaaagtga aagtgaggtt aaataaccta aaagtgtacg atgccgccaa aaatactgcg 420 cacccctgcg aaaaaggaag aggtggacga tggcgaagat ctggagaccc tggtctacct 480 gcgggacgag aaactcctga aactcgtacg cctaaaggag aagctggctg ggctcgggtt 540 gcacgaacgg accgcgaccc agatcaacgt gaaccggaga gtgctcgatg cctccaacgc 600 cgagttcaat tctctccacg agcgcatcgt ccgcctggac gccaagaagc ggaaggagca 660 cggcgataag ctggtggaat tcgaaacgtt gttcatcgag attgacacaa tgctagaagg 720 atggaagcaa accctggaaa tcctggctgg tccatcaacc gcccctgtac ctgctgccca 780 acgacctgtt gtcatccagc agtcccttcc gcggctcgta cccacgttcg acggaaagta 840 cgagaactgg aagcggttca agacgatgtt ccgtgacgtc gtagaccggt cgaacgaaac 900 tcccaggatg aagctgtatc gcctggagga agccctcatt ggtgaggcaa aagggacgct 960 ggacgagaag acaatcgagg acggcaacta cgacggcgcg tgggagttgc tggaggagcg 1020 atatgaagac aaccggagaa tggtggacat ccatatcggt ggactgctga aggtccagaa 1080 gctacacaag gagagccacg cggaactacg agcgctggtg gagaccgtgg tcggccacgt 1140 agaaaacctc aaatacctga ggcaggagtt cactggtgtg tcgacacagt tcgttgtgta 1200 cctgctggcc catgccctgg acagcgatac taaaaaactg tgggaggcca cggtgaagaa 1260 gggcgagctg ccgaactacg aagaaaccat caagttcctg aagaaccgcg tctccgtcct 1320 ggagagatgc gacgagacca gtgaacctgc agcgaagcag cgagaccgaa acaacccgag 1380 gccaaccaac aaaccgtcct accagaaggc caacgctgcg accacgtcgt cgaacccgga 1440 gatccgctgc gagatgtgtg gggactccca cctcaccttc aagtgcggca ccctcgctgg 1500 cctcacggtg gcccaacgca gcgaaaaggt aagaagcaag aacctgtgtc tcaactgcct 1560 ccgcaagggc cacagctgga agaagtgtac gtccaggtat tcctgtggaa agtgccgcca 1620 acgtcaccac acactgctgc acgacgactc ggaacaaaaa ttcgtgcaga aacccgtcaa 1680 cccgcagaac gttgcaccag tagccaacca gccgccagtg tcccaagcca ctggccaaga 1740 accaccacag tctacttcgg caacctgcaa ccacacccag acggtgaaga ccgtcatgct 1800 tctgacagcg ctcgtccacc tgctggacga caaggaccga cctgtgccat gccgactgct 1860 gctcgacaac ggatcgcagg tcaattttat tacggaacaa ttggcaaaac ggttgaacac 1920 acgtcgagta gctgcaaacg ttcccatctg cggaatcggt gcagtgaaga catacgcccg 1980 agagtcgatg acggtccagc tccgatcccg gtacaacagc ttcacggtgg aagtggagtg 2040 tctcatcgtc cccaaggtaa ccgggatgat tccgtcggcc cgatcagcac gtccgactgg 2100 ccgattcctg cgaacatcca gctggccgac ccgaacttcc acacaccgga ccggatcgac 2160 atgctgctcg gcgtctcgat gttcttcaag ctgttgaaat cgggacaact agagttggcg 2220 aagaaccttc cagagctccg cgagacctac ctgggctggg tcgtcgcggg tgatgtcggc 2280 gacaccgttc ctgacgcgca gttcagtcac actgcaacgc tcgatgacgt gagcgaagcc 2340 atcgagagat tctggcgggt cgaggacatc gacagcgcca cacaagtcag cacagaacag 2400 gacgagtgcg agtgcgaggc attcttccgt gcgacacaca agcgggaccc taccggccgg 2460 tacgaggttc gcttgccatt ccgccccgtc gtcggcaacc tcgacaacaa ccgcagcctg 2520 gctcttcggc gattcttgtc gttggagaag cggctcaacc gggacccgga cctgaaacgg 2580 caatacggtg agtttatttc cgagtacgaa tccctggggc attgcaagtc ggttgaagag 2640 gccgacgact cacctggcca aggtcgttac tacatgccgc accatgccat ccttcgccct 2700 tcgagttcca caaccaagct gcgggtagtt ttcgacgctt cggcaaaact ctccccttcg 2760 agtgtctccc tcaacgaagc cttgcagatc gggggaaccg tccaaaacga ccttttctcg 2820 attctgctcg cattccgcaa gcacccggtc gccttcaccg cagatttgtc gaagatgtac 2880 cgccagatcc gggtggcccc tgcggacact cccttccaac gaatcttctg gcggaacgaa 2940 ccggccgact ttatccgcgt gctcgagctc acaaccgtga cgtacgggac ggcatctgca 3000 cctttcctcg cgacccgctg cttggttcaa ctctgcgaag atgaaggaga gaaatttccg 3060 ctggctgcca aaatcgttcg cgacgcttgc tatgtcgatg acattctttc tggggcagac 3120 tcacctgacg aagcgatcga gtgtttgaag caactccaag gcctgctcag ccgtggtggg 3180 tttccgatcc acaaatggag ttcgaacgag tcggcggtca tggatcagat tccggagggt 3240 gagcgggaga aactcatcga tctggacgga ctgaccggtg gcgtggtcaa ggctctgggg 3300 ctctacagga gtccagggga tgacgagttc cggttcactg ccaaccaagc cgacgctgac 3360 gccgatgcca ccaagcggcg tgtgctgtcg gagatcggca agttcttcga cgtgctgggc 3420 ttgctttcac cggtcatcat caaggccaag atcctgatgc agcgagtttg gctcgctggc 3480 ctgtcgtggg atgtgctgct ggaaggaggg ttggcaagta cttgggaaca atttcaaatc 3540 gcacttccct tcgtgcgaga catccggatt ccccgttacg taatcggccc tggaaacatt 3600 gcccttgaac tccacggata cagcgacgcc tccgaggtcg catacggcgc agtgatttac 3660 gtgcgcagcc tgttcccgag cggaaagtcc ccccagatgc ggctactctg cagcaagtcc 3720 aaggtagcac caacgaagca gctcgacatc caccgactcg agctcctcgg gtgcagattg 3780 ctgtccaagc tggtggtcaa ggtcctcgca gcgttgaaac tgccgttccg gaacgtcgtt 3840 ctgtggagcc acagccaagt ggttttggcg tggctcaaga aacccttaga ccagcttcaa 3900 tcgttcgtcc gcaaccgtgt cgccgagata cgaaacgaaa ccagcaactt catctggctt 3960 tacgtgcgaa ccaaggacaa ccctgccgac ttggtgtcac gtggtatgtt tcccgccgaa 4020 ctgatggtgt gcgagaagtg gtggaaagga ccggcgtaca tgcagtccga ggagtaccgg 4080 gtggaacctg tggaagattt gccggactgt gagatccccg agttgcggaa ggtgaagatg 4140 gtgacgatgc ttacctgcaa ttccgtcgag tttccgatct tcgaaacgtg cagctcatac 4200 cgtaagctgc aaagggtgat ggcgtacgtg ctgcgcttca cggccaactg cagaaagaag 4260 aatccggcgg aacgagttcg ccagcgctac ctgaccatcc cggagttgcg ggcggcgcag 4320 gacgtgattg tgttggtgat ccagcacgat gcactgtcca aggaaatcca gcaagttgcc 4380 gagaacgata ccgctggacg attcaaggtt tgaccccgtt cctggacaag ggcctactac 4440 gagtgggtgg cagattgcag cagtcggagt tgccgttcga gacccaacac caacttctcc 4500 tcccgaagca tcgcgttacc aacctgatcg ttcgagcgta tcacgaagag cacctgcacg 4560 cggggccatc ggccttgctt gcgacgctga ggaggcggtt ctggctgatc gacggacggt 4620 cgacagtccg gagcgttaca aggagctgcg tgacgtgctt ccacgcgaag cctcgcggct 4680 cgagccaact gatgggacgg ttgccgtcct gccgtgtgac gcaagccctc ccgtttgaag 4740 aggtcggcgt agattacgcc gggcccattt tcgtcaaggt aggaacccga aagccgcaac 4800 tcgtgaaggc ctacttcgcg gtgttcgtgt gcatgtcgac caaggccgtc cacctggagc 4860 tagtctcgga cctcacgacc gaggcattct tggccgccct ccagcgcttc gtgagccgac 4920 gcggcgtgcc tcgcacgatc cattccgaca acgggacgaa tttcaagggg gccaaggcgg 4980 agttgcacga gctgttcctg ctgttcaacc agcgtgcgtt caacgaccag atagcgacct 5040 actgccaacc gaaggagatc acctggtcgt ttatccctcc gggggcgccg aatttcggtg 5100 gcctctggga ggcggctgta aaaagcacga aataccatct caagcgcatc ctcaagaacg 5160 cacaactcac cttcgagcag tacgccactg ttctcgcgga ggtcgaggcg gtgctaaatt 5220 cgcggccctt gttcgctacg tcggcggacc ccgcggatcc gcaagttctg acgccgggac 5280 acttcctgat tgcgcgaccc cttacggcca tcccggaacc agtctacgag gggacaccca 5340 ccaaccggct gtcgaagtgg cagcacctgc aactcctgcg cgaacagttt tggcgaacct 5400 ggaagcgcga ctacctgacg agcctccagc cgagaggcaa ggacaacaag gagaagccga 5460 acgtgcgtcc gggaatggtg gttttgctgg aagaaaagga cgcgccgcca ctacagtgga 5520 agatgggaat tatccagcaa acttacccgg gaccagatgg attcgtacgc acagcggacg 5580 tgaaggttgg tggaacagtc gtccggcgtc cgatcaccaa gctatccgtc ctcccgattg 5640 aggacaacga gcagactgca gaccacggag cgattcctgc tgggacttct ccccagcccg 5700 gggggagga 5709 // ID Crack-27_AAe repbase; DNA; INV; 5183 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A Crack non-LTR retrotransposon family from Aedes aegypti. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; KW Crack-27_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5183 RA Kojima K.K. and Jurka J.; RT "Crack clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1243-1243 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 10 sequences with >94% CC identity. XX FH Key Location/Qualifiers FT CDS 819..1865 FT /product="Crack-27_AAe_1p" FT /translation="MLNDQTDDAVYCVECRKQENDLSQLITCLYCYSNAHF FT TCRNIIGSAVRKMKKNMYFCTSKCSDIYKKITDMQNNRTPMIDRLSSELQK FT SVVTVVSAQLKEVKAEVNSFVKAIEHSQQFISAKFDEFLNDFLKLKTENDH FT LKSEVEGLRHELSALKISVNKLEFNDDRANKDSLSKNAILLGVPVQENEQV FT SSIVAKVADCIGLDLPGDAIESASRLHGQGSTSNKHVPIRIVFKHKSMKES FT FFSKKKIFGKLSSAVIDQSMTVNGMPTNIVIRDELTPLSLELLREIRMLQK FT KMSLKFVWTGREGAILVKKDENTNTISIKNRNDLENFAKRSVFHLSSNSTL FT STNSLC" FT CDS 1931..4828 FT /product="Crack-27_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MATINNNNFHENIDDLNECYPTDLSSSKHLRILQWNI FT RGLNDFEKFDCMLVMLDQLHFPVDIIVIGETWLKQDNVSLYNIPGYKSIFS FT CRHSSNGGLVVYVKNSISFNVQCNVHLDGFHHIHVEVTLNSRIYDIHTFYR FT PPSFDVNHFLNIFENILDKSNHIRPCFLLGDVNVPMNAYSNNVVMKYRALL FT ESYGFVCSNTFPTRPISSNILDHAICRIDDLPRIKNDTISTEQSDHCIVIT FT SFKLFEEREKLTLTKKVVDHQKLNRDFTAFLQNYGAVQDGESCLLDVVSKY FT KSLNEKYSKILTKNVSAKSQFCPWMNFNLWHLIKMKNNYIKRLKRNPDDNH FT LKALLKHVSNKVSTSKKNCKRIYYDNLLNNTPHSKLWKRINCIFGKSSKAD FT TIVLLDNGAKIYDNQLTCEIFNNFFSNIGQHLANSIQIDATIDPLSYLRRV FT PNSIFLTPATVNEVTLLINELKRKKCGGPDNITADIVKNNSVCLSRILSEV FT FNKILETGCYPKCLKIARVVPVFKSGDASDINNYRPISTLSTFNKIIEKLL FT INRLVPFLNQHNVFYKYQYGFRQGSGTETAILELLDDVIKNIDNKKVVGSL FT FLDLRKAFDTLNHTILLKKLEAYGIRGLANKVLQSYLSDRMQFVSIGDSVS FT SLRPITVGVPQGSNIGPLLFLLYINDLCRLPLKGVPRLFADDTALFYPRFD FT VHMVVKDINDDLKILSKYLASNLLSLNISKTKYMIFHSPRKRIQXHNVVML FT NSCCIEEVKSFKYLGIILDATLNWSEHIEYIEKKTSSLCGVLRKVSYFVPR FT SILQKFYFAHIHSAFNYLIVAWGRACKSRLKKLQTIQNRCLKTILKKPFLF FT PSIQLYSDCSHNILPINGLCDLQTVIFLHDMLHRVNFHHNVQLPVVPHAYP FT TRRPNNLLQSRANTTFGQKRISIIGPTKYNQLPDEIKQISNRRIFKIKLKQ FT HYKLHLFDILH" XX SQ Sequence 5183 BP; 1643 A; 885 C; 901 G; 1753 T; 1 other; actgttctgg tagttggtgc aattactatt agctcaaatt agattcagta atttatagtg 60 attttctcct tttaaagtag tatttcattt tttaaagtgt gaatgaatat cggatctcgt 120 tacataaagt cttgtaatga atgtttgctg tggttataaa aagtaatatc aatttacatt 180 ttgtcaagtg atcatttacg gttcttgtct ttttccaaca atacatcgtt tgtttgagtg 240 tgtgtactgt acggaataaa ctgtggccca atcactcaat tatcgtcata tctgtgttga 300 agggtcttgt ttggacgaca gcttgtgagt gccagttgat aactattgtg tgcggagtat 360 atcccttagt agcacacaac tgtggtttct ctggtctctg gcaagggcca tcgtcgctac 420 atacaactgc cacaaagtag tctgtaggta tgcaagtggt cgctgtgtta ctctgcagaa 480 tttagcgaaa cgggatagaa aggtaccaat accttataaa ctcctctaat agccagtagt 540 tctgtgggat ctcttcctct cttgatgagt agttgagtgg gatcacggtt tactgcatga 600 ttgtttgtgc tgaattttgt ttcactgatg tttgtaagaa caagacttga ccatttgatt 660 tttgtatgtg tgttgtatct tgtgttacct ttttatacca tatgctctgt gatacacatt 720 cattttaatt tgttttgatt ctcgaggttt gtagtagtat ttagtttata tttacaacct 780 ttgtttttgt tttcgagtct caggttctta tctcaaaaat gttgaatgat cagactgatg 840 atgctgttta ttgtgttgag tgtagaaagc aggaaaatga tttgagtcaa ctaattacat 900 gtttgtactg ctattctaat gctcatttca cgtgtcgaaa tataattggc tctgctgttc 960 gtaaaatgaa aaaaaatatg tacttctgta catcaaaatg ttctgatatt tacaaaaaaa 1020 taaccgatat gcaaaacaat cgtacaccaa tgatcgatag actcagttca gagctccaaa 1080 aaagtgtggt cactgttgta tcagctcaac tgaaagaagt aaaagccgag gttaatagtt 1140 ttgtgaaagc cattgagcat tctcagcaat ttatctctgc caaatttgat gaattcctga 1200 atgattttct taaactaaag actgaaaatg atcacttgaa atcagaagtt gaaggtctaa 1260 gacatgagct gtcagctttg aaaatatccg taaataaatt ggaatttaat gatgataggg 1320 ctaataaaga ttcgttgtcc aaaaatgcaa tattgttagg tgtcccagtc caagaaaatg 1380 aacaggtttc cagtattgtt gccaaggttg ctgattgtat tggtttggac ctccctggtg 1440 acgcaattga atccgcttcc cgtttacatg gtcaaggctc cactagtaat aaacatgtgc 1500 caatacggat cgttttcaaa cacaaatcta tgaaagaatc gttcttttct aagaagaaga 1560 tattcggaaa gctttcctca gctgtcattg atcagtcgat gactgtcaat gggatgccta 1620 cgaacattgt tattcgagat gaattaacac ctctttcatt agaactactg cgggaaatac 1680 gaatgcttca gaagaaaatg agtctaaaat ttgtttggac aggtagggaa ggagccatac 1740 tagttaaaaa agatgaaaat accaatacca ttagtatcaa aaatcgcaac gacttggaaa 1800 actttgcaaa gagatcggta ttccacctaa gttctaattc gaccttgtca acaaatagtc 1860 tatgttagtg aaagctttgt ctcttgtgtt tttgtttttg ttattttact tgtcataatc 1920 taatagcaaa atggccacta taaataataa taattttcat gaaaacattg atgatctgaa 1980 tgaatgctac cctaccgatc tgtcgagttc aaaacaccta cgaattcttc agtggaacat 2040 tcgagggtta aacgattttg aaaaatttga ttgtatgctt gtaatgttgg atcaattaca 2100 ctttccagtc gatataattg ttattggaga aacatggctc aaacaagaca atgtatcgtt 2160 atataacatt ccggggtata agtcaatatt ttcttgtagg catagctcca atggtggttt 2220 ggttgtgtac gtcaaaaaca gtatttcttt caatgtccaa tgtaatgtgc atctcgatgg 2280 ttttcatcat atccatgttg aagtaacact caatagtcgt atttacgata ttcacacgtt 2340 ttatcgtcct ccgtccttcg atgtaaatca ttttttaaac atatttgaaa acattctcga 2400 taaatcgaac catataagac cttgtttcct gcttggtgat gttaatgtac ctatgaatgc 2460 ttatagtaac aacgtggtta tgaaatacag ggcgctcttg gaatcgtacg gattcgtttg 2520 ttccaacacc tttcctacta ggccaataag ctcgaacata cttgatcatg caatttgtag 2580 aattgatgat ctccctcgta taaaaaatga cacaatttca actgaacaaa gcgatcattg 2640 catagtaata acatcattca aactattcga agaaagagaa aaactgacac taactaaaaa 2700 ggttgttgat catcagaagc ttaacagaga ctttacagca tttttgcaaa attatggagc 2760 agttcaagat ggtgaatcat gtttgcttga tgtcgtttca aaatacaaat cactcaatga 2820 aaaatactca aaaatattaa ccaaaaatgt cagtgctaag agtcaatttt gtccttggat 2880 gaacttcaat ctttggcatc taattaaaat gaaaaataac tacatcaaac gattaaaaag 2940 aaatccagat gacaatcatc tcaaagcttt acttaaacat gtctcaaata aagttagcac 3000 ttcaaaaaaa aattgcaaaa ggatttacta tgataatctt ttgaataata cacctcactc 3060 aaaattgtgg aaacgcataa attgcatttt cgggaaaagc tctaaagctg ataccattgt 3120 tcttcttgat aatggtgcaa aaatctacga taatcaactt acctgtgaaa tttttaataa 3180 ttttttctcg aacataggcc agcaccttgc taatagtatt caaatagatg ctactattga 3240 tcctttgtct tatttgcgga gagttccaaa ttcaatattt ctaaccccag ctactgtcaa 3300 tgaagttact ttattgataa atgaattgaa acgtaagaaa tgtggtggtc ctgataatat 3360 tactgctgat attgtcaaaa ataattctgt ttgcttatct agaattctat ctgaggtatt 3420 taataaaatt ttagaaactg gctgttatcc aaagtgttta aagattgcaa gggtagtccc 3480 tgttttcaaa tcaggtgatg cgtctgatat aaataattat agacccatct caacgttatc 3540 tacatttaat aaaattatag aaaagttgtt gataaacaga ttagtacctt ttctcaatca 3600 gcacaatgtg ttttataaat accaatatgg gttccggcaa ggaagcggca cagaaacggc 3660 aattttggag cttctagatg atgttatcaa aaatattgac aataagaaag ttgtgggatc 3720 attgttttta gatttacgga aagccttcga tacacttaat cacacaattc ttttaaaaaa 3780 gcttgaagca tacggtatca gagggttggc aaataaagtc ttgcaaagtt atttatcaga 3840 tagaatgcaa tttgtctcta ttggtgactc cgttagctcc ctcagaccaa ttacggtagg 3900 tgtgcctcaa gggagcaaca tcggcccatt gttgttcctc ttgtacatca atgatctttg 3960 tagactccca ttgaaaggtg ttccacgcct atttgctgat gatacagctc tcttctaccc 4020 acgttttgat gtgcatatgg ttgtaaaaga tatcaatgat gacttgaaaa ttttatcaaa 4080 atacttagct tcaaatcttc tgtccctaaa tatatcaaaa actaaatata tgatttttca 4140 ttcacccagg aagcggattc aagmtcacaa cgtagttatg ctcaactctt gctgcattga 4200 agaggtcaaa tcattcaagt atttgggaat aattttagat gccactttga attggtctga 4260 gcatatagaa tatatcgaaa agaaaacatc ttcactgtgt ggtgttttac ggaaagtaag 4320 ctattttgta ccccgcagta ttttacaaaa attctacttt gcgcatatcc attcagcctt 4380 caactattta atagttgctt ggggccgtgc ttgtaaatcg cgtctgaaaa agcttcaaac 4440 catacaaaat aggtgcctta aaactatact taaaaaacca ttcttatttc cttcaattca 4500 gctttattct gattgctctc acaatatctt accaatcaat ggtctttgtg atttacaaac 4560 tgtaatattt ttacacgaca tgcttcatag ggttaatttt catcacaatg ttcaattgcc 4620 agtagttcct cacgcttacc caactaggcg cccaaataat ttgcttcaaa gtcgagctaa 4680 cacaacattt ggccaaaaaa gaatatcgat catcggtcca accaagtaca atcaactacc 4740 ggatgaaatt aagcagattt ccaatcgtcg tattttcaaa ataaaattga aacaacatta 4800 taaactgcat ctttttgata tccttcatta atcttacata tatatttatt ttgttatttt 4860 attttgcacc tactaccagt aacacgttat atttaaattc atttgtataa tttagttttt 4920 agatggctcc cttaaaagga acattttgtt ccactgggat gtcatgccct tcacattcga 4980 atgttattat taattttatt caatatattt ttcctagtta ttttttattt gttgcatctc 5040 agctccgtat taagtttgtg tttttgtatt gctgttcaga tagagctgag atagttgcgt 5100 ccactaccag gaagctctct ctccatgtga gctttttggt gtgggggata gtggcgggta 5160 aaaaaaaaaa aaaaaaaaaa aaa 5183 // ID Copia-45_AA-LTR repbase; DNA; INV; 201 BP. XX AC AAGE02023477; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-45_AA_; KW Copia-45_AA-I; Copia-45_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-201 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02023477; Positions 190539 190739. XX SQ Sequence 201 BP; 44 A; 52 C; 36 G; 69 T; 0 other; tgtatgaccc aaccgattcc ccttcattct atttaccttg acataacagc ccttctggca 60 accctgtaac tgttattgat gatgaataaa tatcactact agttgaactt ccaaagcaga 120 aaggtgtttc tcttgttcgc tgctgagaaa gccctcgggt gtgttttttc ctctgagtgt 180 tcttcctcgt gccgctcttc a 201 // ID Mariner-34_HM repbase; DNA; INV; 2130 BP. XX AC . XX DT 06-JAN-2009 (Rel. 14.02, Created) DT 06-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE Mariner-type family: consensus sequence. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-34_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2130 RA Bao W. and Jurka J.; RT "Families of Mariner elements from Hydra magnipapillata."; RL Repbase Reports 9(2), 392-392 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 82..1602 FT /product="Mariner-34_HM_1p" FT /translation="MARNYKRKLNLKVLYGTSSKEDIKKAIEDHGKGMSIR FT KAADKHKVKKSTLHDHIRKPTLKKVGGQTIISKADEVLIAQMLEAVADWGF FT PIGRLEVRMMAFDFLKNRNGSNSFNLQIPGNDWLNSFMKRNNISGRNASNI FT KRSRSKLGVEMINNFFDEFERTYESLGGIQPSNIFNFDETNLSDDPGKKFV FT LVRRGRRRVENVQEHSKTSISLMWCGSASGVLQPPMVVYRASNVYKGWVTG FT GPQGTIYSCTKSGWFDMETFAKWFEKCFLPNVQNLNGPKLLLGDNLASHFN FT PDVIKLAKEHNVYFAMLPPNATHLMQPLDVSVFGPMKRCWKHVLYEWRKQS FT RKTGCFPKEHFPALLKSLSLKLQDTVCQNLVSGFITCGLYPINRIKVLKRI FT PEYKEPGTSTASLLNDSVTTLLMYHRGACETKKPRGNKLNIASGAVLNVVE FT DIETDLCFICGKDDNDENDEISEEEEEILIDWTGCERCGRWYHSKCLENAM FT STDCIVCPKL*" XX SQ Sequence 2130 BP; 719 A; 319 C; 401 G; 691 T; 0 other; ccgtcaaccg gggtgacttc ggacagcggg gtgactccgg acagagcatt acttattatt 60 ctacttcagt ttcttttagt gatggcgcga aactacaaac gtaaactaaa cttaaaagtg 120 ttgtatggaa caagctctaa agaagatata aaaaaagcta tagaagatca cggtaaaggc 180 atgagtattc gaaaagctgc tgataagcat aaagtcaaaa aatcaacatt gcatgatcac 240 attagaaaac ccaccttaaa gaaggttgga ggccagacaa ttataagcaa agcagatgag 300 gtactcattg ctcaaatgct agaagctgtt gccgattggg gctttcctat tggaagatta 360 gaagtcagaa tgatggcttt tgatttcctc aaaaatcgca atgggtcaaa cagtttcaat 420 ttgcagatac ctggaaatga ctggttaaat tcttttatga aacgaaacaa tatttctggg 480 agaaatgctt caaatattaa acgatcgaga tcaaagttag gagtagaaat gataaataat 540 ttttttgatg aatttgaaag aacttatgaa tctcttggtg gtattcaacc ttcaaacata 600 tttaacttcg atgagacaaa cctttcagat gatcctggaa aaaagtttgt tttagttcga 660 agaggaagac gacgagtgga aaacgttcaa gaacatagta aaacttctat atctcttatg 720 tggtgtggct ctgcttcagg agtactacaa ccacctatgg tagtttatag agcatcgaac 780 gtttataaag gttgggtaac aggagggcca cagggaacaa tatacagttg cacaaaatca 840 ggttggtttg atatggagac atttgccaag tggtttgaaa aatgcttctt gccgaacgtt 900 cagaatttaa atggacccaa gcttctttta ggagataact tggcctctca ttttaatcca 960 gacgttatta aacttgctaa ggagcataat gtttactttg ctatgttgcc accaaatgcc 1020 acacatttaa tgcagccatt agatgtatct gtctttggtc caatgaagcg ttgttggaaa 1080 catgtattat atgagtggag aaaacaaagc agaaaaactg gatgctttcc caaagaacat 1140 tttccagctc tattaaaaag tttatcactt aagttgcaag acacagtgtg tcagaatttg 1200 gtttctggct ttattacatg cggactttat ccaataaacc gtataaaggt tttaaaaaga 1260 attcctgaat acaaggaacc tggtacatca acagcatccc ttttaaacga ttcggttaca 1320 actttgctaa tgtatcatcg tggtgcatgt gaaacaaaaa aacccagggg caataagttg 1380 aacattgcat ctggggcagt tttaaatgtc gttgaagata ttgaaactga tctttgtttc 1440 atctgtggaa aagatgataa tgacgaaaac gatgaaatca gtgaagaaga agaagaaatt 1500 ttaattgatt ggactggttg tgaacgttgt ggtcgttggt atcattcaaa gtgtttggaa 1560 aatgccatgt cgactgattg cattgtatgc ccaaaactat aaattttatt aaattttttt 1620 aaaagataat gatttgtata tgatttactt ttacagttgt caataattta aacaaaataa 1680 gaaagattag taattgattc ttcataatta atcattgtag ttgtatcatt attatatgta 1740 tatttgaaat agtttatcaa tagttgttgt tttttgtttt gttatttgaa ataaaaaatt 1800 aattgcttaa ttaaaaatta aaaaaattta attaagcagc ttttttacat agttatacat 1860 tcattttaaa actcttgatg ttgaaatttc aacttttaaa atctcaagct tcaagtgtcc 1920 ggagtcaccc catttcccga ttttgatttt ttgattttta aaccgttttt ttatgctaat 1980 aacatattgg aaagctgaaa gccaaaattt cttcacgttc tttataatta cagtggtacg 2040 ttccacaaaa atatttccat tttgtattga ttttaatgaa acagtttaat attttcttcg 2100 aaaattgtcc ggagtcaccc cggttgacgg 2130 // ID BEL-52_AA-I repbase; DNA; INV; 6333 BP. XX AC supercont1.337; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-52_AA_; KW BEL-52_AA-LTR; BEL-52_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6333 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.337; Positions 584245 590577. XX CC Positions [5390-5947] - Integrase core CC 'GGATG' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 41..6313 FT /product="BEL-52_AA-I_1p" FT /translation="MSSLEMPTHNHCKACNRPDHAEDMVACDTCGTWFHYT FT CAGVQASICNRPWQCGYCSSRGDGTSSSAISVSSGSSSARLRLRQLEESKA FT LDDRLLLQQAERERAFLAEKHQLEAEIQAEKHLRGSTSSFRSRNSGKSRVS FT YRSEVRTWINQSTDTPAKVVNQQLGASTSTPIASSENVVLSVPDKRKVLPT FT PSYDQDVVIQSPLMHHTVVQTTSMLPPKSNLQSLVAPIVSVTTSGPPMPVT FT IATTSHPSAQSSSKQLRNVPHQEQQKQHPSCHNKCQLPPTIQSGPKEYTLP FT GANPLESHVSAHVHSGTHIQAEEIDAQQPRMEQQNYQHFQPCGQQTGYYLT FT STMNQNPSADPVPTCHAQWSRNANVTFVDSQQQFPTTNRIPVTQPIRTSQP FT MRFYQPQATPFQHSSAGATRNSTSHIQSDPGYSTGHTLPGQASSFYRAGEG FT ILSAQQLAARQVVSRELPKFSGDPLEWPMFINAFESTTTMCGIQPDENLSR FT LQKSLIGAAREKVQSILTLPAAIPAIIETLRDECGRPDQLVHCLLVKIRLA FT PPPNVNKLDTLINFGREVRNLVTFIEAANLQDHLSNPMLLTELVGKLPPSL FT RLDWGLHMQRIPLVTLKAFSDYVTSIKAAACKVSLPTDCNHDESRRSKKEK FT GGFINAHNVEEKKSIQTSSPKKEYQAKSEPYVVKPCPACNRSDHKLRSCDK FT FAGFSMDQKQRLVDQMKLCQRCLGSHGKWPCRSKQSCDFDGCKEPHHRLLH FT TSVKPMSTVQAGPSGIISAHCSRQSGVLFKVVPVILSSNGKSVSTFAFLDD FT GSNVTLMEEEVAEELGLSGNISSLCIQWTSSVTRKEPTSRQVELRISSVHG FT GPDYALSEVQTVPRLDLPQQSLDYEQLSKQFPHLKGLPVQSFSNAVPRILI FT GLDNATLKLTLDKRERRNKEPIAAKTRLGWTIFGGGRREPQGTDHVMFHMC FT NCTADETLHQLVNNYFAVETIGVNPCLPLESAEDQRARRIMEQTTKRTDSG FT RFECGLLWKSDNFEFPCSYKMAERRLVCLEKKFAKDPALKMKFVDQIKEYL FT ERGYAHIATGDELQHSDQRRVRYLPLGIVQNPRKPGKVRIVWDAAARVGDV FT SLNSMLLVGPDLLTPLLKVICGFRQRQFVAVGDVRQMFHQLLVKQTDRQAQ FT RFLFRHDPEEQPTVYIMDVVIFEASCSPCLAQHVKNTNAKEYAEAFPEAAS FT AIINCTYVDDFLDSRDTVDETVRIVEEVRWIFDKAGFEIRNWQSNSEEVLR FT RVGVDVDDTAKCFSVEKSTITERVLGITWDPKSDMFVFETQFRKDLYPLLS FT GCIVPTKRQVLRVVMSHFDPLGLVATYTIHGKILIQDVWRSGVKWDEPITT FT EDFEKWQRWVRLIPNLRQVKIPRCYFPNYSLGSYDTLELHVFVDASLMAYC FT ATAYFRIMDNGSPRCALVAAKTKVTPLKPQSIPRSELCAGVIGVRLLRNIQ FT ENHSIPVQKRYLWTDSTTVLAWLRADPRKFRQFVAFRIAEIQSETSVDEWH FT YVPTNLNVADKGTKWGNGPCFDPESPWFSGPEFLHQLESEWPRQSTKFSEP FT QEEIKAVQHHTIVRDPTIDFAKFSSWEDLVKNLSYIQHFVHRCRTSNRKAT FT GTRIAVLEQKDYVAAEASVWRVVQKAEYSDEISILCKNTELTGDERKSLKK FT SSSIGKLSPFLDEKGVLRMRSRINEDSVYYSSEFQNPVIVPRGKHVTNLLI FT HKYHQRYAHANVDTVVNELRQRYYIPKIRFEVKKVVKQCMWCKVYRAKPVA FT PKMGILPHPRVTPYVRPFTFTGLDYFGPLIVKRRRSNEKRWVALFTCLTVR FT AVHVEVVHTLSAESCKMAIRRFISRRGAPQQIFSDNGTNFRGAAREIAEEM FT KAINRELASMFANTEIEWVFNPPSAPHMGGVWERKVRSIKEAFKALHHKQN FT LNDEELLTFLAEAEIIVNSHPLTFVPLEDSADEAITPYNFLLMSSSGANTS FT SRIPVNEEVSLRINWKLMQQLLNQFWKRWIQGCLPTIARRTKWFNDVRPLQ FT VDDPVIIVDESVRNGWLRGRIVKVYTASDGQVRKVDVQTVSGVFQRPAIKV FT ALLDIQKSSKAALG" XX SQ Sequence 6333 BP; 1794 A; 1500 C; 1543 G; 1496 T; 0 other; tcctcgaaaa ttattgcatc caatacgatt ccagtcggac atgagcagtt tagaaatgcc 60 gactcacaac cactgcaagg catgcaatcg gccggaccat gccgaagaca tggttgcatg 120 cgatacttgt ggaacttggt tccactacac ctgtgcggga gtgcaagcct ctatttgcaa 180 taggccatgg caatgcggct actgttccag ccgaggcgat ggtaccagca gctcagcaat 240 atcggtcagt tcaggaagct ccagcgcccg tttgcggttg agacaattag aggaatcgaa 300 agcgcttgat gatcgtcttc tactacagca agctgaacga gaacgagcct ttctcgccga 360 gaagcaccag ttggaagctg aaatccaagc tgaaaagcat ctcagaggca gtacgtcgag 420 ctttagaagc cggaacagtg gcaaaagccg ggtcagctac agaagtgaag tacgaacctg 480 gatcaatcag tcgacagata ctccagcgaa agtggtaaat cagcaattag gcgcgtcgac 540 atcgactccc attgcctcaa gcgagaacgt tgttctgtcg gtaccagaca agcgaaaggt 600 cttaccaaca ccgtcttacg atcaggatgt cgtcattcaa tcgcctctca tgcaccacac 660 cgtcgttcaa accacctcta tgctgccacc aaaaagtaat ttgcaatcat tagtggcgcc 720 catcgtgtca gtaaccacca gcggtccacc gatgccggta acaattgcta ctacaagcca 780 tccatctgcg caatcttcct cgaagcagct tcgtaacgtt cctcatcaag aacagcagaa 840 acaacatcca tcgtgtcaca ataaatgtca acttccaccg acaatacagt caggtccaaa 900 ggagtacacg ttgccaggtg cgaatccatt agaaagtcac gtttctgccc atgttcatag 960 tggaactcat atccaagcgg aggagataga tgcacagcaa cctagaatgg agcagcaaaa 1020 ctatcaacat ttccagccgt gtggccaaca aactggctac tatttaacat caacgatgaa 1080 ccaaaaccct tcggcagatc cagttcctac gtgccacgca cagtggtcgc gaaacgcaaa 1140 cgtgaccttc gtggacagtc agcagcaatt tcctactact aatcgcattc ccgtcacgca 1200 accgatcagg acttctcagc cgatgcgttt ttatcaaccg caagctacac cattccagca 1260 ctcgagcgcc ggtgcaacta gaaacagcac atcgcatatt caatctgatc ctgggtattc 1320 cacgggtcat acactaccag gtcaagcttc gagcttctac cgtgcaggtg aaggaatatt 1380 gagtgcacag caattggcag cacggcaagt ggtatcacgg gagctgccta aattctcagg 1440 tgatccccta gagtggccaa tgtttattaa cgcattcgag tccactacta ctatgtgtgg 1500 aattcagcca gacgagaatc tctctaggct gcaaaagagt ttgataggtg ccgccaggga 1560 gaaagtacaa agtatattga ctttgccggc ggcgatccca gcgataatcg aaacacttcg 1620 cgacgaatgt ggtcgaccgg atcagttagt tcattgcctg ctggtgaaaa ttcgactcgc 1680 gcctcctccc aacgtcaaca agctcgacac gctgataaat tttggcagag aagtcagaaa 1740 cctggtgacc ttcatcgaag ctgctaatct gcaagaccac ctgtcgaatc caatgttgtt 1800 aacagagctg gtagggaaat tacccccaag tcttcggttg gattggggtc ttcatatgca 1860 aagaattccc ctggttacgc tcaaggcatt tagtgattac gtcacctcta ttaaagctgc 1920 cgcttgtaaa gtctcgttgc ccactgattg caatcacgat gaaagtcgac gaagcaagaa 1980 agagaagggt ggcttcatca atgcacacaa cgtcgaggag aagaaaagca tccagacatc 2040 gagtccaaaa aaggagtatc aagctaaatc cgaaccgtac gtcgtcaagc cgtgcccagc 2100 gtgcaataga agcgaccaca agctacgtag ctgcgataaa tttgctggtt tcagtatgga 2160 ccaaaaacaa cgtctcgtgg atcaaatgaa actatgtcaa cgttgcctgg gaagccacgg 2220 aaagtggcct tgcagatcga aacagtcctg cgatttcgac ggttgtaagg aaccccatca 2280 tagactgctt catacatcgg tcaagcccat gagcactgtt caagcaggtc catcgggaat 2340 tatatcagct cattgttctc gccaatccgg cgttctattc aaggtagtcc ccgtgattct 2400 ttccagcaat ggcaaatctg tctctacctt tgcattcttg gacgacggat cgaatgtcac 2460 gttgatggaa gaggaagtcg ctgaggaatt aggtttgagc ggaaatatca gttcgctgtg 2520 tattcagtgg actagcagtg ttaccaggaa ggaaccaacg tcgaggcaag tggagttacg 2580 aatttccagc gtccacggtg gcccagatta tgccctctcc gaagtgcaaa cagtacctcg 2640 ccttgattta ccgcaacaaa gtctggatta cgaacaactc tccaagcagt ttccgcactt 2700 aaaagggctt cctgttcaaa gcttctcgaa tgcagttcct cgaatcctta taggactaga 2760 taatgccaca ctcaagttga cgctagacaa acgcgagcgg cgcaacaaag aacccatagc 2820 cgccaaaacg cgacttgggt ggacaatttt tggaggtgga cgtagagaac cacaaggtac 2880 tgatcacgtg atgttccaca tgtgtaactg tacagcagac gagacacttc accagctggt 2940 caacaattac ttcgctgtag aaacgattgg ggtgaatccg tgtctacctt tagagtcagc 3000 cgaagatcaa cgagctaggc ggattatgga gcagactacc aagcgaactg attccggaag 3060 gttcgaatgc ggacttttat ggaaaagtga caactttgag tttccttgta gctacaaaat 3120 ggctgaacgt cgtttggtct gcttggaaaa gaagtttgct aaggatccag cattaaaaat 3180 gaagtttgta gaccagataa aagagtacct agagcgaggc tatgcacata ttgcaactgg 3240 agacgagcta caacattctg atcaacgtcg tgtgcggtac cttcctttgg gaatcgtcca 3300 gaacccccgc aaacctggga aagttcgcat cgtgtgggat gcagcggcac gtgttggaga 3360 cgtatcgctc aactcgatgt tgctagtcgg gccagatctc ttgactccgt tgttgaaggt 3420 catttgcgga ttccgacaac ggcagtttgt tgcagtaggc gatgttcggc aaatgtttca 3480 tcagttgctg gtcaagcaaa ctgatcgaca ggctcaaaga ttcctttttc gacacgatcc 3540 cgaggaacag cccacagtgt acatcatgga cgttgttatc tttgaagcgt catgttcccc 3600 gtgtttagcg caacacgtta aaaataccaa tgctaaggag tatgcggagg cctttccaga 3660 agcagcgtcg gcgattatca actgcaccta cgtagatgat tttctggata gccgagatac 3720 cgtagatgaa actgtgcgta tagtggaaga agtacgctgg atcttcgata aggccggctt 3780 cgagattagg aactggcagt ccaactccga agaagttctt cgacgggtcg gggttgacgt 3840 cgatgatact gcaaagtgct tttctgtgga aaaatccaca attaccgagc gtgtcttggg 3900 gataacatgg gatcccaaga gcgatatgtt tgtgtttgag acccaattcc gaaaagatct 3960 ttacccgttg ctatctggct gtatcgttcc gactaaaaga caagttctcc gggtagttat 4020 gagtcatttt gacccgcttg gactcgtagc tacttataca atacacggaa aaattctgat 4080 tcaggacgtg tggaggtctg gcgttaaatg ggatgaacct ataacaacgg aggattttga 4140 gaagtggcag cgatgggtga gattaattcc gaacctgcga caggttaaaa ttccgagatg 4200 ctatttcccg aattacagtc taggcagtta cgacactctt gagctccatg tcttcgttga 4260 tgccagctta atggcatact gcgctacagc gtacttcagg atcatggaca atggatcacc 4320 gcgctgtgcc ttggtagcag caaagactaa agttacacct ctaaaacctc agtcgattcc 4380 acgcagcgaa ttgtgtgccg gagtaatcgg agtgagactg cttaggaaca ttcaggagaa 4440 ccactcaata ccagttcaaa aacgctactt gtggactgac tcaacaaccg ttcttgcatg 4500 gttgagagca gatcctcgaa agtttcggca gtttgtagct tttcgaattg cggaaattca 4560 atcggaaact agcgttgatg aatggcacta cgtcccaaca aatctgaacg ttgctgataa 4620 gggcaccaaa tggggaaatg gcccatgttt cgacccggaa agcccatggt tttcaggacc 4680 agaatttcta caccaattgg aaagcgagtg gccgcgtcag tcgaccaagt tttcggagcc 4740 tcaagaagaa atcaaggctg tccaacatca cacgatagtt cgcgatccga ccatcgactt 4800 tgcgaagttc tctagctggg aagatttagt gaagaacctt agctacattc aacactttgt 4860 tcatcgttgc cgaacttcaa accgaaaagc gactggaacc aggatcgcag tattggaaca 4920 gaaggattac gtagcagcag aggcaagtgt ttggcgagtg gtacaaaagg cggaatattc 4980 cgacgaaatc tcaattcttt gtaagaacac tgagcttact ggtgatgaac gcaagtccct 5040 taaaaagagc agttccatcg gaaagttgtc accatttcta gatgaaaaag gtgtactgcg 5100 aatgcgtagt cggattaacg aagattcggt gtactactcg agcgaattcc aaaatccggt 5160 gatagttccc agaggaaaac acgtcaccaa tctgttgatc cacaaatacc atcaacgata 5220 tgcgcacgct aatgtcgata ccgtcgtcaa cgagttacgc cagcggtatt acatacccaa 5280 gattcgtttt gaggtaaaga aggttgtgaa acaatgtatg tggtgcaaag tttatcgggc 5340 gaaaccagtt gcgccaaaga tgggtattct accgcatcca agggtgacac cttacgttcg 5400 cccatttaca ttcacgggcc tagattactt cggaccattg atcgtgaagc gacgtcgcag 5460 caacgaaaaa cgatgggtgg cgctatttac atgtttgacc gtacgagctg tacatgttga 5520 agttgtgcac acgttgtctg cagaatcttg taaaatggca atacggcgtt tcatatcccg 5580 tagaggtgca ccacaacaaa tattcagcga caacggtacc aatttccggg gagcagctcg 5640 cgagattgcc gaagagatga aagctatcaa tcgggagtta gccagcatgt tcgccaatac 5700 agaaattgaa tgggtattca atcccccctc cgctccccac atgggtggcg tgtgggagcg 5760 caaggtgcga tccataaaag aagcgttcaa ggcattgcac cacaaacaga acttgaacga 5820 tgaagaactg ctgacatttt tagcggaagc tgaaataatc gtcaactcgc atcctctgac 5880 atttgtaccg ttggaggatt cagcagacga agctattact ccttataact ttctgttgat 5940 gagttcaagt ggtgccaata cttcatcgag aatacccgta aatgaggaag tttccctaag 6000 aataaattgg aaactaatgc aacaactttt gaaccagttt tggaagcggt ggattcaagg 6060 ctgcctccca actatagcac gccggaccaa gtggttcaac gatgtccgtc cgctgcaagt 6120 tgacgatccg gtaatcatcg tagatgagtc ggttcgtaat ggttggttac gaggtcgtat 6180 tgtgaaggtc tatacggcga gtgatggtca ggtaagaaag gttgacgtcc aaaccgtatc 6240 aggcgtgttc cagagaccag cgatcaaagt agccctactg gacatccaga agagtagtaa 6300 ggccgcttta gggtaaggca ttacgggtcg ggg 6333 // ID CR1-33_BF repbase; DNA; INV; 2486 BP. XX AC . XX DT 30-JUL-2009 (Rel. 14.07, Created) DT 30-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE Amphioxus CR1-33_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-33_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-2486 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-2486 RA Kapitonov V. and Jurka J.; RT "Young families of CR1 non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(7), 1604-1604 (2009). XX DR [2] (Consensus) XX SQ Sequence 2486 BP; 677 A; 679 C; 532 G; 598 T; 0 other; gtcggggaca ggatcaagac cccagaggac tctaaactac tgtggacagt gctaaacaaa 60 gtaactggga agggaaggac gggtatccct gcactcctaa gccagggaga gacactcgac 120 aaggacattg acaaggctga acttctcaac aacatatttg tacaaatcac aaaggagagt 180 aaccacccgg actacactaa gagacttccg atgttcacaa acaatgaact cagctccata 240 cagctgtcgg tagaggaggt ctacaacgtc ctatctgatc tccgcatcaa caaagctccc 300 ggtccagact gtatacccaa ccgccttctc aaggaagcag cgccagtaat cagttcatcc 360 ctctgcgagt tattcaactt ctcactcgcg accggacgcc ttcctgcaga gtggaaacag 420 tctaacatct caccagtcca taagaagggg gacaagacag acccgtacaa ctacaggccc 480 attgccctgc tgcctacggt cgtcaaagtc ctggagagac tggtgcacaa cagactttac 540 acgtacctca tggacaacgg cttactcaac cctaaacaat ctggcttcaa gaagggtgac 600 gggactgtac tacaattact acgtttactt gatgactggg caaagtccat agatgaccca 660 aatgttgctt gtaccgccgc ggtattcctg gacgtgcgac gcgctttcga cagtgtatgg 720 cacgacggcc ttatctacaa actgtcgcga tacggagtcc aaggcccgct ctccaactgg 780 ttctccgact acttatcggg tcggaaacaa cgtgtgatta tcaacggcgt gggatcttca 840 tgggggtcta caacagctgg cgtcccacag ggaagcattc ttggaccgtt gctgttcttg 900 atctacctga acgacattca agagcttccc tgcaaatcca gcataaactg tttcgctgat 960 gacacctcac tgtataactc tggacgcacg gccttcgagg tcgccaacac aaccaatgcc 1020 gacttgcgcc tcgtctccaa ctggttccac gactgggggc tccagttaca ccccgacaag 1080 tgtaaggtca tgtgcattaa ggcactccaa agcaaagtca aactaccccc catttactta 1140 gcaggtgagc tcgttgaaga ggtgacctgt tacacccacc tagggcttac gttacacttc 1200 tcactacgct ggaaggaaca cgcagaagtg gtctccagca agtcaaagaa agtcctagga 1260 ctgcttagca aactccaaag taagcttcct cgtgaagcca tggagactgc ctacaacacc 1320 ctcgtgcgca cgaagctaga atatgcttcc atccttttaa gcaacatcgg taccactgcc 1380 agcaagtctc ttgaacaagt tcagtatcat gcgggacgtc ttgtctcagg tgcgatggtg 1440 cgcacaccat atgccaaatt actggacgaa ctagagtggg ataccctcgc agccaggaga 1500 gatcacaaca gactactgat tatgtacaaa ctgacatcag gttctgttcc acctcatctc 1560 cagccgttaa tccccaccac taggaacagt caaagacaac taaatgtacg ccttcggaac 1620 gacacacact tgcatgttcc ccactccaga acaaacacat acaagaacag ctttgtcccg 1680 tacaccactc gtttgtggaa cagtctaccg aaggaagtga aggaagccac ctcctttaac 1740 ctgttcaaac ggaagtgcag aaaccacatg ctgtcagccc gtcatcacca gaagtaccgt 1800 aggtttggtg aacgccgcag caacatcctt gctaccagac tccgtcttgg ctggtgtcag 1860 ctgaattcca cgctggccaa gttcaccatc accactcgga gatgcgcctg tggtgcaaca 1920 tcggagacgg ttgcacactt cctccttcat tgctcactgt acacagcggc acgacagtca 1980 ctcacaacag ctgtgcatcg tctcgtggaa cgtcccctct ccactaccct cctactcaac 2040 gggtctccgg gtcacgacga caatacaaac agaagtcttt cgaccgcttt tcatacttac 2100 atcttgtcaa caagacggtt ttagctactt ccagtcgttc cccctgtact ttgctattta 2160 gatatgcgat gtcgcccgtg gaagttccaa cattgtttgt acatagcctt tcctccagat 2220 gacctctgat ctccgaactt gtgaacaaac tatttttgtg atatgaagca tttatgactc 2280 gaattgtata ttgcaaataa gattgtagac tgattttaga ttaacatgtt agatatgttg 2340 ttagtttttg attatgccat tgttgtgcag cgtcatgatt tgtattgtat atgtatgtat 2400 gtggtcacga catcagcata tgctgcctat gtgtccacat tgtattgccc ttgttgcaat 2460 ttgaataaat aaataaataa ataaat 2486 // ID EnSpm-N1_NVi repbase; DNA; INV; 1687 BP. XX AC . XX DT 14-MAY-2009 (Rel. 14.05, Created) DT 14-MAY-2009 (Rel. 14.05, Last updated, Version 1) XX DE EnSpm-type family - consensus. XX KW EnSpm; DNA transposon; Transposable Element; Nonautonomous; KW EnSpm-N1_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-1687 RA Bao W. and Jurka J.; RT "EnSpm-type families from Nasonia vitripennis."; RL Repbase Reports 9(5), 941-941 (2009). XX DR [1] (Consensus) XX CC TSD is 2-bp long. TIR is ~80 bp (imperfect). XX SQ Sequence 1687 BP; 600 A; 261 C; 267 G; 557 T; 2 other; cccgaatgga aataataacc gtacatttgc aatacatttc agtacgtaac tgttacaatg 60 ctgtacattt gcgtacgttc gcgtacgttt tgagtggaat tcggacattt tttacacaat 120 tgtacagaaa cgtacgcgaa tgtacgtgga tatatgtcgt tgtacagtat cgtgccgaat 180 cctacatttt gagtacattt acaaaaaagg atttagatca gcatgtacgg caaaatatgc 240 aaacgtacgc gaacgtacgc gaatgtacgt gaacgtacgc gaccgtacgt agatgtaggt 300 aaatgtatat tattgaaaaa ttatacggta aacagtacat gtattataat attcacttat 360 tattacattt aaacgataca gttgaaaact ggctaatagt acatttgatt ttgaaaatga 420 aaaacagatt gttactaaaa aaaaattgta ataatcgtat tgcgcagcaa attgtttata 480 gactaatatg gtatatcagc aattaccatt actttatcaa aaacattata tttcagacaa 540 tacaagaaat ataagtaaca atatatgtaa taataaccga ctaaataagt atgaaaaaat 600 tggaaattgt aatttattag gacatggccc tactgaaaaa aagcagatga caatcaactg 660 attattagct aataatcagc tgataatctg ctggtaatcg tacgtgccaa aacgtcagct 720 gattatcagg tgatatcagc ttaatatttt tatttaagct gatagttaca aatatgcgtt 780 tatatgagat aaaatgctaa tgataatcag ctgattatca ggtgataatc atcaacaaat 840 ttttaacctt tagtatattt tttcgaagtt gttttaaatt aataaaattt ataaaattca 900 cataatgagg gccaatacgg tttttccatc cagatgataa caccaaaaca ctatggctgt 960 gagctacaaa atttttattt tttaataatt tgatttaaac aaaaaatttt ttaaatatta 1020 taagttccct taacaaagca atgcttacta aatatgtaaa ctaacttaag atataattat 1080 taaataaatg aatatttttt ataaaacatt taatatgaat aaaatatctc ccgcgagtca 1140 ttaaaaattc ccttgtgtac ttctatataa aaaaatcaga aaattaagaa atattacttt 1200 tctgtacaat tgtgtacaaa catgtataat tgtgtacgtt cgcgtacatt tatgataatt 1260 tgcatacatt cgcgttcrtt cacgtacgtt cgcgtacgtt cacgtatacg ttcgcgtacg 1320 tttacgtacg ttcgcgtacg cttacgtacg cttatgcaca agcgagtatc ttttagaacc 1380 gtaaaatttt acgtataaaa attgtatttt atctctgata atgtataaag ctgtacatat 1440 tcgtataaat gagtatattc cagcaatttt tcgtacggtc acgtacgttg gcgtacgttk 1500 gcgtacgttt gcgcacaatt gcgtacgttt gcgtacgatt acgtataatg ctgtattaaa 1560 tgtccgaatt ccacccaaaa cgtacgcgaa tgtacgtcaa cgtacacaag tctacagtaa 1620 tgtaccgaaa tgtatcacaa tcgtacggtg atttgcgtta taatactgtt gaaaatttcc 1680 atacggg 1687 // ID Kiri-38_AAe repbase; DNA; INV; 4524 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 16-FEB-2011 (Rel. 16.02, Last updated, Version 1) XX DE A Kiri non-LTR retrotransposon family from Aedes aegypti. XX KW Kiri; Non-LTR Retrotransposon; Transposable Element; Kiri-38_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4524 RA Kojima K.K. and Jurka J.; RT "Kiri non-LTR retrotransposons from the yellow fever mosquito."; RL Repbase Reports 11(2), 733-733 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 6 sequences with >98% CC identity. CC This family and related Kiri elements found in two mosquito CC species, Culex quinquefasciatus and Aedes aegypti, constitute a CC distinct lineage belonging to the CR1 group. The phylogenetic CC analysis with PhyML indicates that Kiri is a sister lineage of CC the L2 clade. Although several Kiri elements do not encode two CC proteins, probably due to 5' truncation, Kiri elements generally CC encode two proteins. Their ORF1 proteins commonly contain an CC L1-like RNA-binding domain. XX FH Key Location/Qualifiers FT CDS 290..1060 FT /product="Kiri-38_AAe_1p" FT /translation="MGPDKHPSVQARPNNNKEYNETNPDTLMKVIHRMMST FT LTESVEAMIESYVTNLDNRIVCIQRRIDALKVDYNASVDKLIEAVNDVRAA FT HFSEMHRLDRLETQQDLVITGIPYHQDEDLQHIYCNIARSIGIAKPKESMV FT TLRRLSKHPVRNSASPPILCHFAFRGLRDEFFNKYLHRRSLVLHQIGFQAS FT DRIYVNERLSPHTRRILKAAIKLRNEGHVHKVSTKKGSVFVIFRKSNNSVL FT VECIDQLLLHKSNLSK" FT CDS 1484..4348 FT /product="Kiri-38_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MFFALSGFLPLIMLNDVSSVNASTNALIPRAVLRSAL FT NSKYLNISHINVQSLLSRQLSKFNELKLNFADCKLDIICMTETWLDDSIDN FT RIIAIDDYNVFRNDRNRHGGGICVYVRKNLICRILKSSIIDSSVRTTEFMC FT LEILCGEERFMLAVYYNPPGVDCTNILSQHFDEFTVKYESTFFIGDFNVDL FT LKRNSRQKQFNDILSSMSYVCINSEPTFFHQTGCSLLDLCLTDSPNKVLKH FT DQLSMPGVSHHDMIFLSVKIVSPISSTPIYYRDYVHFDASALQNAFNRINW FT NEYFSYSDPDMLLEFLNDKLLYLHDTYIPLKTCKVKKSPWFTADIERAIIS FT RNIAYNNWLRDKTTVNCSLYKRIRNRVTMMIRNAKEDNDKRVINTNLPSKQ FT LWNNVRKLGVANDRAHESFCDASSNEINCYFSSNFTSDSRPMHNFQVNSHG FT FVFHEVEDFEIVNAVFDIKSNAVGLDNISIKFIKIILPLAVSVFRHLFNKI FT IATSIFPVKWKQIKVIPIKKKSNCSDVSNLRPISLLCSLSKVFEKLLKYQM FT CDFINRMDFLNQNQSGFRKKHSTNTALLKVHDDVAQAVDKKGVAILLLIDF FT AKAFDRVSHRKLLNKLSSNFIFSNTAVTLIKSYLTGRTQTVFHNMEFSSYV FT EIKSGVPQGSILGPLLFSLFINDLPSVLEYCSVHLFADDVQIYLCSDKTID FT VDDMSRKINSDLQKLLEWAKRNLLMINPTKTKALLINRSRSTIRTPDLFLN FT SEKVAFVDQASNLGLIFTSNLSWDAQVNQQCRKVYYALKQLNLTTRHLDIQ FT TKIKLFKALILPHFIYCDFVYSNASMAAMNKLRLALNACVRYVYCLSRFSR FT VSHFHKILLGCSFWRFLEYRICLTFYKIINSETPNYLFSKITRTRLPRTMN FT FNIPQHTTIYYGQSFFVRSIVHWNSLPISLKSCTSLVGFKRELLSRFANLN FT " XX SQ Sequence 4524 BP; 1395 A; 848 C; 796 G; 1481 T; 4 other; tcamagttcc tgaagggatg agtgcaatcg agtgatagtt tttgttggtg cagtttcaag 60 tggcggctac tcattactat gtgcctgaaa attccaacaa tatagcgatt caagctgctg 120 atgatcgcct tcttcgtggt gtagtgaaaa ccctctgcta tatatcgaat taktcgaatg 180 gttttaaatt gattttctat cccttkgcac atccaaataa tcatcccttc agaagtgttc 240 aaccatttgc cgtttcacta aactcacgct gtcatctttt gtttgcacca tgggcccaga 300 caaacatcca tcggttcaag ctcgcccaaa caataacaag gaatacaacg aaacgaatcc 360 ggataccttg atgaaagtga tccatcgtat gatgtccact ttaacggaga gtgtggaagc 420 aatgattgaa tcatacgtga caaatctcga taaccgtatt gtatgtattc aacgacgtat 480 cgatgctctg aaggttgact ataacgctag tgtggacaaa ctaatcgagg cggtgaatga 540 tgttcgtgct gcacattttt ccgaaatgca tcgacttgac cggttggaga cgcagcaaga 600 tctcgtcatc acaggcattc cataccacca agatgaggat cttcagcata tatattgcaa 660 tattgctcgc agcattggta tcgcgaaacc gaaagaatcg atggttacac tacggcggct 720 ctccaaacat cctgtacgca atagtgcttc gccaccaatt ctctgccatt ttgccttccg 780 tgggttgaga gacgagttct tcaataagta cttacatcgc agatctttgg tgctacatca 840 aattggcttc caagcgagtg atcgaatcta cgtgaatgaa agactgtccc cacatactag 900 gcgaatattg aaggctgcca tcaagcttag gaacgaagga cacgtacata aggtttctac 960 gaagaaagga tcggtttttg taattttccg gaagtcaaac aactccgtcc tagtggaatg 1020 cattgatcaa ttgctgctgc ataagtcaaa cctatccaaa tagttctctc tttccttcat 1080 ttttgatcct tgactctatt ccccatattg tcctctgcct ccttccgttc ctgaaagtta 1140 tttgtcttga acatcttaag tacctttccc agttttcaca ctattatcct ttgattccta 1200 tcccattaca tcgtccatca ctccttccta aaagcaaagg ggggaggggg ctgctgttgt 1260 tgctgtttga ttactgctgc tgttgctgtt gactgtcttc tgctggttct gatggtgctc 1320 ggtgtttgct gctacacaat tgttgttgga attgatctct tttacawcgt gccatttgtc 1380 tgcatcaaag taatttaatt tcttttgata gtaatgttat ttgtaatagt taggttaggg 1440 ttcctgatat atcatacttt ttcatgttgt gcgggtttgg aagatgtttt tcgctttatc 1500 cggctttctt ccgttgataa tgctaaatga cgtaagtagt gtcaatgctt ccacaaatgc 1560 tcttatacct agggctgttc taaggtctgc tttaaattca aagtatttga atatcagcca 1620 tattaatgtg cagagtctgt tatcacgaca actttcaaag tttaatgaac ttaaattaaa 1680 ttttgctgac tgtaagttag acataatctg tatgacagaa acatggctag atgattctat 1740 tgataatcga attattgcca ttgacgatta taatgtattc agaaatgatc gtaatagaca 1800 tggtggcggt atttgcgttt atgttcgcaa gaacttaata tgtcggattt tgaaatcatc 1860 gatcattgat agcagtgtca gaacaactga attcatgtgt ttagaaatat tatgtggcga 1920 agaacgcttt atgctcgcag tgtattacaa tcctcctggc gttgattgca caaacatatt 1980 atcacagcat tttgatgaat tcactgttaa atatgaatcc acttttttca ttggtgactt 2040 caacgttgat ttgctaaaga gaaatagtag acaaaaacaa tttaatgata ttttatccag 2100 catgtcttat gtgtgtataa acagtgaacc tacattcttt catcaaacag gatgctcttt 2160 gcttgatctt tgtttaacgg attcaccgaa taaagttctc aaacatgatc aattatcaat 2220 gcctggtgta tcacatcacg atatgatatt cttatccgta aaaatagtgt ctcctatttc 2280 aagtacacca atttactatc gtgactatgt tcattttgat gcttctgctc tacaaaatgc 2340 ctttaacaga ataaactgga atgaatattt ctcatacagt gatcctgaca tgcttctaga 2400 gtttctaaat gataaattac tgtatttaca cgatacatac ataccattga aaacatgcaa 2460 agttaagaaa agcccgtggt ttacagctga tattgaacgt gctattattt caagaaacat 2520 tgcctataat aactggttaa gggacaagac tacagtaaac tgttcgcttt ataaacgtat 2580 ccgaaataga gtaactatga tgatcagaaa cgccaaagaa gataacgata aacgtgtcat 2640 aaacactaat ctaccaagta aacaattatg gaataacgta agaaaactgg gagtggcaaa 2700 tgacagagca cacgaaagtt tttgtgatgc atcttcgaat gagattaatt gttatttctc 2760 ttctaatttc acctcagata gtcgacctat gcataacttc caagtaaatt cacatggttt 2820 cgtttttcat gaagtagagg atttcgagat tgttaatgca gtatttgaca ttaaatcgaa 2880 tgccgtgggg ctagataata tttcgatcaa atttattaag ataattttac cactagcagt 2940 ttcagttttt agacatcttt ttaacaaaat tattgctact tcaatatttc ctgtaaaatg 3000 gaaacagata aaggttatac caattaagaa aaaaagcaat tgctccgatg tttctaatct 3060 cagaccgatc agtttactgt gctccctttc caaagttttc gaaaaactgt taaaatatca 3120 aatgtgtgat ttcataaata ggatggattt tctcaatcaa aatcaatcgg gttttcggaa 3180 aaagcacagc acaaatacag ctttactaaa agtgcacgac gatgttgccc aagcagtcga 3240 caaaaaagga gttgcaatat tattattaat tgattttgcg aaagcctttg acagagtgtc 3300 acatcgaaaa ctgctcaaca aattatcatc taacttcata ttttccaata ctgctgtcac 3360 actgatcaaa tcttatctta ccggacgcac gcagactgtc ttccacaata tggaattttc 3420 ttcatatgtt gaaattaaat ctggagttcc tcaaggttcg attcttgggc cattactatt 3480 ttctctattt attaatgatt taccttcagt tttagagtat tgttcagttc atcttttcgc 3540 agacgatgta caaatttatc tttgcagtga taaaaccatt gatgttgatg atatgtcaag 3600 aaaaattaat tccgaccttc aaaaactact ggaatgggct aaacgaaatc tcttgatgat 3660 aaatccaact aaaacaaaag cattgctaat taatagatcg cgttctacaa taagaacccc 3720 agacttattt ctaaacagtg aaaaggttgc atttgtagat caagcttcaa atttaggatt 3780 gattttcact tcaaacctgt cgtgggatgc tcaagttaat cagcagtgta gaaaagtata 3840 ttacgcatta aaacagttaa acttaacaac caggcacctg gacatccaaa cgaagattaa 3900 gctcttcaaa gcgcttatac tgccacattt catctattgt gactttgttt acagtaatgc 3960 atctatggct gcaatgaaca agttgcgttt ggctcttaat gcatgtgtgc gctacgttta 4020 ttgtttgtca cgattttcaa gagtatcgca cttccacaaa atattgctcg gatgttcttt 4080 ctggcgattt ttggaatata gaatatgtct gactttttac aaaataatta attctgaaac 4140 accaaactat ttattttcta aaattactcg tacacgctta ccaaggacca tgaacttcaa 4200 tatacctcaa cacactacta tatattatgg gcaatctttc tttgtgcgaa gcattgtaca 4260 ctggaattcg ttaccaatta gtctcaaatc gtgtacatca ttagttggat ttaagcggga 4320 acttctttcc agattcgcca atttgaatta gatattatat ggaaccttca ggaactagat 4380 attgtaatgt taagttaagt tactgaattg aaaattttct tcttcttctt tgaaatcatt 4440 ttgtggcaac aaaaaagact aggtcttacg tcgcaagaat tgtgaaataa taaataaata 4500 aataaataaa ataaataaaa taaa 4524 // ID Gypsy-126_AA-LTR repbase; DNA; INV; 802 BP. XX AC AAGE02025415; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-126_AA_; KW Gypsy-126_AA-I; Gypsy-126_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-802 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02025415; Positions 1 802. XX SQ Sequence 802 BP; 265 A; 163 C; 185 G; 189 T; 0 other; tgtgcaaatg gcgaggttga gcggcaaaac cgctcactgt tgaaagtact caagataagc 60 caacagcgaa gtacgaattt agaagaagct cttcaagagt atctttatat gtactcagta 120 acccccacat agcgtcactg gagttccacc cgcaaccctg atgttcggac gaagatttag 180 ggacctattt ccccatgtgc aagatgaagt aacgttcgat gatgaaatgc gagacaagga 240 tgcaactgtc aaataccgag caaaagagta ccgtgataaa agagtcggag ctaaagaaac 300 atcagtcaat gtgggaagcg aagttctcat gaagaacatg cagcctacaa ataaactatc 360 gccaactttt ctgccagtac cagcaacggt tgtggagaaa actggtagta tggcaactgt 420 gcagacaaac accggacagc agttcaagcg aaacacatct catcttaagg tttatcatcc 480 aactacacca aagacggctg actcattgga agtgacaaac cgtaaggaac cgattgaagg 540 caccgagcct atgatcgaac atcaagaccg acctcgtaga agtgtatcgg tgccgaagcg 600 ttttgatgac tactgtttga aatagttaaa tttcttttgt tactgttttt aacattttca 660 tattgaaaaa gagagatatg ttgtggtgta aaactgaatc ctggcagcac tggatgcggg 720 actaccaaca tgtaggggaa aggatctgga gtaataaaaa cgtactggta aagttcacac 780 gtgtagtgtt tgattcccaa ca 802 // ID Gypsy-3_CQ-I repbase; DNA; INV; 4550 BP. XX AC AAWU01007771; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-3_CQ_; KW Gypsy-3_CQ-LTR; Gypsy-3_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4550 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 385-385 (2011). XX DR GenBank; AAWU01007771; Positions 8323 12872. XX CC Positions [3454-3972] - Integrase core CC 'CAAAG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 208..4515 FT /product="Gypsy-3_CQ-I_1p" FT /translation="MDADQFAQFMGKFQEMIGSLKDKQASAPAAAAARGPA FT ANSLAGNSAAASIPLPPPLELEGDMEENYNFFEGNWKTYASAIGMDGWPDT FT QNKQKASILLSVVGKDALKKYFNFELTAAERADPALVLTAIKAKVVRERNK FT FVDWFDFFSLSQEPTESIDNYLCRLKSLAKVCKFNALEDEMVKYKLATSIR FT WMKLRSKLITTKNLTEANAMDLCRAEEIAERHPVTAGQPSAEVNVVKKSKM FT KCKFCGAKHDFTKGACPALGKKCNRCGGKNHFEKVCKADRRKKLKKKKFRV FT KKVQEETSSESESTEDSGDEEEESTESESVSIGKIVDKSGSGGNVSAELEL FT LLDQKWRLVQCELDTGANTSLVGRNWLEEMTESSNVNLQPSSYRLQGFGGS FT SIPVVGQVKIPCRRKGRKYNLVLQVVDVPHGPLLSANVCRLLGFVKFCNTV FT NFVAPKTDQELLNIYRVKAQDIVHQHEGVFQGIGKFAGAVSSEVRPDVPPC FT IQPPRRIPIAMRGKLKDELKNLEREGLIVKETQHTDWVSNIVLVKRKEQKS FT ESIRICLDPIPLNKALKRPHLQFTTIDEILPELGKAKVFSTADARKGYWHV FT VLDDESSRLTTFWTPFGRYRWIRLPFGIAPAPDIFQMKLQGAIQGLRGVEC FT IADDLLIYGTGDTLLEALEDHNRCLESLLVRLEECNVKLNLEKLKLVEKSV FT KFYGHVLTDKGIHPDESKIAAIKNFPQPTDRKQLQRFIGMVNYLSRFIPNL FT SANFTVLRRLISEKEPWIWSEREEEEFKRVKQLVADTRTLQYYNVNEPIVV FT ECDASSFGLGAAIFQSRGVIGYASRTLTATEKNYAQIEKELLAILFACVRF FT DQLIVGNPKTTVKTDHKPLVTVFKKPLLSAPRRLQHMLLNLQRYRPSIEFV FT TGKENVVADAISRAPFDERQADDRFDKRDIYKVFREVEEVKLSSFLKVKDE FT QLNEIMEESAADASMQLIVKYTLEGWPTSVDKVLDSAKMFFKYRNELSTQD FT GIVYRNDRIVVPHSLRRKLTEKVHVSHNGIEATLKLARANLFWPGMSAQIK FT EAVAQCGICAKFCPSQQHPPMQSHPIPVYPFQLVSMDVFFADYQGKSCRFL FT VTVDHFSDFFEVDVLKDLTPKSTIAVCKSNFSRHGKPQRVVTDNGTNFVNR FT EWQQFAREWSFEHTTSAPHHQQANGKAEAAVKIAKRLMKKADEAGSDFWYA FT LLQWRNVPNKVGTSPAARLFSRNTRCGVPTSANNLKPKLVSGVPEAIEERR FT QKAKQHYDRKARKLPQLETGSPVLVQLNSETNKRWTPGTVSSKLNDRSYVV FT EANGTQYRRDLVNLKPRKEPQTPEAIPVQATVPNVEMPPIERSNREPSFRS FT APTEALPDQELSNSVPDVGVPEATTKPKAGRTPRAAKTPKPSRSPSVPAGG FT EKFSRTKRDVKLPARFKDFYLE" XX SQ Sequence 4550 BP; 1256 A; 1084 C; 1264 G; 946 T; 0 other; tggtgtcaga agcactcgtg gtgttccggc gtcattgaaa atcgcggaag tgtagtgaga 60 aaaactcatc ccaagcgacg aaaagatcat ttgcaacagt aattttgaac agttttgcag 120 tgaaacgtgt tttatcctgt tggcagccat atttttttcg ctctcgcgtt gggttgagtg 180 ttttccggaa gtgaaaagtg cagaaaaatg gatgcagatc agttcgccca gtttatgggc 240 aagttccaag agatgatcgg ttctctgaag gataagcagg cgagcgcgcc ggcagcagca 300 gcggcacgtg ggcctgcggc gaattcgctg gccggaaatt cggcggcggc gtccattcct 360 cttccgcccc cgctggagtt ggagggcgat atggaggaga attacaactt cttcgaagga 420 aactggaaga cctatgcaag tgcgattggc atggacggtt ggcccgatac gcagaacaag 480 caaaaagcca gcattttgct ttcggtggtg ggaaaagacg ctctgaaaaa atattttaac 540 tttgagctga ccgcggccga gcgagccgat ccagcattag tgttgacggc aatcaaagca 600 aaagtggttc gtgaacggaa caagtttgtg gactggttcg attttttctc gctcagtcaa 660 gaaccaacgg agagcattga caactacctg tgtcgactca aatcgctcgc gaaagtgtgc 720 aagttcaatg cgttggagga cgaaatggtg aaatacaagt tggccacctc gatccgttgg 780 atgaagctca ggtcgaagtt gatcaccacc aagaacttga cggaagcaaa tgctatggac 840 ttgtgtcggg ccgaagagat cgctgagcgg catccggtta ctgcgggtca gccgagcgcg 900 gaagtgaatg tagtgaagaa gagcaaaatg aagtgcaagt tctgcggtgc caagcatgac 960 ttcaccaagg gtgcgtgccc tgcgcttggc aagaagtgca accggtgcgg tggaaagaac 1020 cactttgaga aagtgtgcaa agctgaccga agaaagaagc tgaagaagaa gaagttccga 1080 gtgaagaaag tgcaagaaga aaccagctcg gagagtgaat cgacggaaga cagtggtgac 1140 gaagaagagg agtctactga gagtgaaagt gtgtccatcg gaaaaattgt cgacaagtcc 1200 ggcagcggcg gaaatgtctc ggcggaactg gagttgctcc tggaccaaaa gtggcggctg 1260 gtacagtgtg agctagacac cggcgccaat accagccttg tgggacgaaa ttggttggaa 1320 gagatgaccg aaagtagcaa cgtcaaccta caaccgtcat cgtaccggct gcaaggattc 1380 ggcggaagca gcattccggt ggttggccag gtgaaaattc cgtgtcgcag aaaaggccgt 1440 aagtacaacc ttgttttaca agtcgttgat gtcccacacg gcccgctcct ctcggcaaac 1500 gtgtgccgtc tcctgggttt cgtcaaattc tgcaacacgg tcaactttgt tgcgcctaaa 1560 accgaccaag aactcctcaa catctaccgt gtgaaagccc aggacattgt ccaccagcat 1620 gagggcgtct tccaaggtat tggcaagttt gctggtgctg tgtcgtcgga agtccgccca 1680 gatgtaccac cgtgcatcca acctccgcga cggattccga tagcgatgag aggtaagctg 1740 aaggacgagt tgaaaaactt ggagcgggaa ggcctgatcg tgaaggagac tcagcacacc 1800 gactgggtca gcaacatcgt gctggtgaag aggaaggagc agaaatccga atccatccgc 1860 atctgcctcg acccgattcc gttgaacaaa gcgctgaaac gaccccatct tcaatttacc 1920 acaattgatg aaatcctgcc agagctgggg aaggcgaaag ttttttcaac ggcagatgct 1980 cggaaagggt actggcacgt ggtattggac gacgaaagca gccggttgac gacattctgg 2040 acaccgtttg gtcgctaccg gtggatccgc ttacccttcg gaatcgctcc agccccagac 2100 atcttccaga tgaagctcca aggtgcgatt caaggactgc gtggcgtcga atgtatcgcc 2160 gatgatttgc tgatctacgg aactggtgac acgttgctgg aagcgctgga ggaccacaat 2220 cgctgcttgg agagtttgtt ggttcgtctc gaggaatgca acgtgaaact caacctggag 2280 aagctgaagc tggtcgagaa gtcggtgaaa ttctatggac acgtgcttac cgacaaagga 2340 atacatccag acgagagcaa aatcgcggca atcaagaact tcccacaacc cacggaccga 2400 aagcaactgc aaagattcat cggaatggtg aactacctca gtcgtttcat acccaatcta 2460 agcgccaatt ttaccgttct acgacgactg atttcggaaa aggagccttg gatctggtcg 2520 gagcgtgagg aagaggagtt caagcgtgtc aagcagttgg ttgcggacac gcgtaccctg 2580 cagtactaca acgtgaatga gccgatcgta gtcgagtgtg acgctagctc ttttggtttg 2640 ggcgcagcaa tcttccagag tcgaggagtc atcggatacg catctcgtac actcacggca 2700 accgaaaaga actatgcgca aatcgagaag gagctgctcg ctattctttt cgcgtgtgtc 2760 cggttcgatc agcttattgt tgggaatccg aaaacgacgg tcaaaaccga ccacaagccg 2820 ctagtcaccg tgttcaagaa gccgttactc tcagcacccc gccgtcttca acacatgttg 2880 ctgaatctac agcgctaccg tccgtctatc gagtttgtga ccggcaaaga gaacgtggtg 2940 gcagatgcga tttcccgagc gccgtttgat gaacgccaag ctgacgacag attcgacaag 3000 cgagacatct acaaggtctt ccgggaggtc gaggaggtga agctctccag tttcctgaaa 3060 gtcaaagatg agcagttgaa cgagattatg gaagaatcgg cggccgacgc atcaatgcaa 3120 cttattgtga agtacaccct ggagggatgg ccgacatccg tggacaaggt gctggacagc 3180 gctaagatgt tcttcaagta caggaatgaa ctcagcactc aggatggtat cgtgtaccgc 3240 aacgacagga ttgttgtccc tcactcgctt cgacggaagt tgacggaaaa ggtccatgtc 3300 agccacaacg ggatagaagc aaccctcaag ctggcacgag caaacctatt ctggcctgga 3360 atgagtgctc aaatcaagga agcggtcgca cagtgcggca tttgtgccaa attctgtccg 3420 tctcagcagc atccaccgat gcagagtcat ccgatacctg tgtacccgtt ccagctcgtg 3480 tccatggacg tgttcttcgc ggactaccaa gggaagagtt gcagatttct cgttaccgtc 3540 gaccacttct cagacttctt tgaggtggat gtgctgaagg atttgacacc aaagtcaacg 3600 attgctgtgt gcaaatcaaa tttctcacga cacggaaaac cgcagcgagt cgtaacggat 3660 aacggcacaa attttgtcaa ccgtgaatgg cagcagttcg cacgtgagtg gagcttcgag 3720 cacacgacat ccgcaccgca tcaccaacaa gctaacggga aggcggaggc agcggtcaaa 3780 attgcgaagc ggttgatgaa gaaggcggat gaagctggaa gtgacttttg gtatgcctta 3840 ctacaatgga ggaatgtacc gaacaaggtg ggcaccagcc ccgctgcacg cctgttttct 3900 cgtaacacac ggtgcggcgt tccaacctct gccaacaatc tgaagccaaa gttggtatcg 3960 ggagtaccgg aagccattga agaaaggagg cagaaggcga agcagcatta cgaccggaaa 4020 gcaagaaagt tgcctcagtt ggaaacagga tctccggtgc tcgtgcagct gaactcggaa 4080 acaaacaaac gttggacacc gggtactgta agcagcaagc tgaacgatcg gtcgtacgtc 4140 gtggaggcaa acggtacgca atatcgacga gatttggtga acttgaaacc acgtaaagaa 4200 cctcaaacgc cagaagccat tcctgttcaa gcaactgttc ccaacgtcga aatgccacca 4260 atcgaaagat ccaacagaga gccttcgttc agatcagcgc ctaccgaagc tttgccggat 4320 caagaactga gcaacagtgt tccagatgta ggagtaccag aagcgaccac gaagccgaaa 4380 gcaggtagaa ctccaagagc agcgaaaacc ccgaaaccat cgagatcacc cagcgttcca 4440 gcaggaggag agaagttcag ccgaaccaag agagatgtaa aactaccagc ccgtttcaaa 4500 gatttctatc ttgagtagct ttattttttt tttatataaa acagagagga 4550 // ID Gypsy9-NVi_LTR repbase; DNA; INV; 210 BP. XX AC AAZX01007771; XX DT 13-NOV-2007 (Rel. 12.11, Created) DT 13-NOV-2007 (Rel. 12.11, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: long terminal DE repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy9-NV; KW Gypsy9-NVi_I; Gypsy9-NVi_LTR. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-210 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from Nasonia parasitic wasp."; RL Repbase Reports 7(11), 1140-1140 (2007). XX DR Genome; AAZX01007771; Positions 13679 13888. XX SQ Sequence 210 BP; 53 A; 57 C; 53 G; 47 T; 0 other; tgttatgtat gtgcgaacta gtactaagta gcgacacatg atggtgctac gtaccagcgc 60 ccaccgaagg gactgccgcg aggcgcgagg cccgcgtgcg aggagctctg acctgatcga 120 gcgctcatct gtgtaacaac tcaacgtgtc gatcaataaa gagaatatcc agaaggcatt 180 cttcttcctt accttcctcc ggatactaca 210 // ID Gypsy-148_AA-LTR repbase; DNA; INV; 1951 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-148_AA_; KW Gypsy-148_AA-I; Gypsy-148_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-1951 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1022-1022 (2011). XX DR [2] (Consensus) XX SQ Sequence 1951 BP; 610 A; 370 C; 454 G; 513 T; 4 other; tgaaacggat ttacaaccgt ctcattcaga accctatcca aattgaattt gcgcctgcaa 60 aataggcaac cctaatgtaa aagtgctcga atcgatagtt gttttaaaag tattaaaatt 120 ttggttaaag tccagtcgga ggatagtgtg tgttggttaa gtagtaccag gaaagtaaat 180 ttttaatatg aaataggtaa tcaagaaatt gaaccggaaa cctaaatgaa aagaaaaaaa 240 gaaattataa ggggtcctgg aacggagatc aaatctactc gatctttccc gttcagacca 300 ccggactgag aagatcgtta taaagtgaat atttcaaaag aagctaattm aaagtgtttg 360 ctaagtttgc tcaaagtatg aagttctaaa tctaattaaa caaactttaa tgtgaaatgt 420 tttcctaaag ctattttcag agtgaagttt tgctaaaatt ccattattaa aaaaaatcgt 480 gagtctagtt gtggaagaaa ttctatcagg gtgagcacat gttaattgag ttttaattaa 540 gtgatatcta attacaattt aaacaaaaaa ataggaaaat caatcgtgag tcattaatag 600 ccagtaaaaa tacggagaag aaaagcctag ctgaattgtg aggtaaattc aagaagtggt 660 attggtggta tttttggtgc taattaagtt tggatttcac agtcaccagc agtgaccaca 720 ccgccattgg gtggttagtg gaggtgtata gtgtggagga gcgaatcgtc gctagtgcat 780 gcgagtccat caccgccagc acagaaacca atagcaccta tcttcaacac gtgtgctaag 840 gatcagacga ggggcgatcc cctggggata tttttcgcct cgtggctccg ggccactaaa 900 gatataccca gagagcgtgg ccaacggtgt aaggagtccg gcctcgcgag caccacccgt 960 caccacgtgg ttgcggagat ttgcgagacg tctgagtgag gaaagtctgc caccgccgga 1020 gtgaatcgcc cttgactcac gtcccgggaa aaacccatgg taatcaggcc gtctgatccg 1080 tgttccatcc accccatcat cgggatccat ctcgtcgtgg tgttgtttac gtcgaccgtc 1140 gcgggatcga gcttcgcccg agctcaattc gtggacgacg cccatacgac gaccccgaac 1200 ctcgtgctgc cgtgagattg aagcagtacg atcctccaca gtttcatcgg ggcctaccgt 1260 cccgactctc gaaagtgatc gtacaagaac cgacggccgg cggccgttat ccgccaggta 1320 acccacgagt ctacggagag ctgtaagtac caaaaccata ccacgacagt agtgaaccga 1380 tttgatacca ccactaataa cagtaacggc cggtgggatt gtaacagtag tgaaaacgag 1440 agtaattcgt ttcgactcac agagaaccga taaatagatt tgcaagtgac gtcactgata 1500 ggattgtttt taataaattg tcgcaagctc aaggaaatgt tgcagagatg agagatagct 1560 aggaaagaag tgttgcatgc aaccaaatac aaaattcaaa ctgcaaagta gcttttgaaa 1620 ctaaattatc ttttttttct aataataaat agattttcga gaacaattta attcttctga 1680 tcggcttcaa waattttaac ttaattttat gaataaactt tgggaatgtt taaaagaact 1740 tttagtgaat agwtttgaat tttgcgttgg gaaggaaaac caattttggg aaggaagctt 1800 gggagatttg ggaggtaaaa aaggaggtgg attcttcaaa gggctagata ttgagttagc 1860 tgggcaatca aacagtgaat tctggcagac gacttacgtc gtctggcgtt waagctagaa 1920 ttttggaggt ccttgcagga tcctccttac a 1951 // ID Gypsy-21-LTR_NVi repbase; DNA; INV; 931 BP. XX AC . XX DT 17-APR-2009 (Rel. 14.04, Created) DT 17-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: long terminal DE repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy-21-LTR_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-931 RA Bao W. and Jurka J.; RT "LTR retrotransposon from Nasonia parasitic wasp."; RL Repbase Reports 9(4), 780-780 (2009). XX DR [1] (Consensus) XX SQ Sequence 931 BP; 209 A; 287 C; 268 G; 167 T; 0 other; tgtgacgagc gaacgcgaat tcgtatttcg cgcgctcgcc gcgtcgcgcg accatttcat 60 gcgtgagggc gacgttgcca gttcggcggt actcgggaag cgcgcgcgga atcagcgcgc 120 gagacagata ggcaaaagcg ctagtacagt caagacgtgt gagactttta ccggtacatc 180 acccgcgatt atccgtgagt acgagacgag agagagcagc gacgacccga gaccatccgc 240 tggaggagca cacccgagcc accatcacct cgagcacccg gggtacccca gcacgtcact 300 gcttcagcca ggagggacgc ctcccggcca tccaggactt ttccccgtcg gcagccgacg 360 gatacgcggc gaccgagcaa ccaacgcgcg cgtcgaaact gtaagttcgt aacattagct 420 attgcgcgat ttatagatca ccgaatcggc tgcagacgcc gaggccgagg ctcgctgcca 480 ccgagtcgtc ggctcgtctc cagtacgagc cgcgagtgcg aactcgtcct ctcgcgacac 540 gcgttacatt gtatcaacat cttgagtgaa tatagacatc ttacaattgg tacacgaatg 600 catgactatt tcttccgtat acctgttgcg tccttccgac gaacccggta gagtaaagcc 660 ggttacatcg cggcgcggct acagagcgcg cgcctggccg gagactctat acgcgaagaa 720 agcgcggcga gcttcgcacg ccgcgcagtt gtacaccgag gcaacccggc tgcatagtac 780 ggagtcggca ttgcaccgac ggctaacgtc cgtcgccgac ttcagcatca tcgagggata 840 gccgtgcgga gataaaagag tgagtaatcc ccctcgttga cgcgacagca tcttcgcggc 900 tgtcgcgcta aagcccagtt gccccgtgac a 931 // ID TVSAT1 repbase; DNA; INV; 174 BP. XX AC J03989; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 28-SEP-1995 (Rel. 1.08, Last updated, Version 1) XX DE T.vivax satellite repeat DNA. XX KW SAT; Satellite; Simple Repeat; Satellite repetitive element; KW TVSAT1. XX OS Trypanosoma vivax OC Eukaryota; Euglenozoa; Kinetoplastida; Trypanosomatidae; OC Trypanosoma; Duttonella. XX RN [1] RP 1-174 RA Dickin K.S. and Gibson C.W.; RT "Hybridization with a repetitive DNA probe reveals the presence RT of small chromosomes in Trypanosoma vivax."; RL Mol. Biochem. Parasitol 33, 135-142 (1989). XX DR GenBank; J03989; Positions 1 174. XX SQ Sequence 174 BP; 19 A; 72 C; 40 G; 43 T; 0 other; ccttcttcag gttggtgttc tggtggcctg ttgcccccca cccgctccca gaccatatgg 60 tcctgagtgc tccatgtgcc acgttggcac gctccactgt ctagcgtgac gcgatggccc 120 gtgcactgtc ccgcacccct tccccactcc ctcttgcacc tctcgctccg gcca 174 // ID Gypsy-20_RP-LTR repbase; DNA; INV; 147 BP. XX AC ACPB02043182; XX DT 28-JAN-2011 (Rel. 16.02, Created) DT 28-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Rhodnius prolixus genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-20_RP_; KW Gypsy-20_RP-I; Gypsy-20_RP-LTR. XX OS Rhodnius prolixus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Euhemiptera; Heteroptera; OC Panheteroptera; Cimicomorpha; Reduviidae; Triatominae; Rhodnius. XX RN [1] RP 1-147 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Rhodnius prolixus genome."; RL Direct Submission to RU (28-JAN-2011). XX DR Genome; ACPB02043182; Positions 1846 1700. XX SQ Sequence 147 BP; 45 A; 14 C; 43 G; 45 T; 0 other; tgtagtgttg ggattagata gttggcagct cagcttggag gtaacagtga ggaggttggg 60 atagagagct aggagatata catgttgcgc aataaggtta aataaagtaa ttgtacttac 120 aagtgttttt atttgcatga cagtaca 147 // ID Copia-32_AA-LTR repbase; DNA; INV; 207 BP. XX AC supercont1.117; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-32_AA_; KW Copia-32_AA-I; Copia-32_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-207 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.117; Positions 2443332 2443538. XX SQ Sequence 207 BP; 61 A; 44 C; 40 G; 62 T; 0 other; tgaaaacgaa gtaggcggcg actttttaaa tcgatcgcct ctccacctcg tcgagtgaaa 60 aaaaaaaaat taattgaatt cagtctgtac caaccacgtg acgcgaacgg tcacttttaa 120 attcgtttaa ataaattatt ctcaagtcag ttgtactccg tgcctttttc gtgttccgga 180 taaccgagat cctttaggtt atgggca 207 // ID MuDR2x_SM repbase; DNA; INV; 1966 BP. XX AC . XX DT 26-OCT-2007 (Rel. 12.1, Created) DT 26-OCT-2007 (Rel. 12.1, Last updated, Version 1) XX DE MuDR-type DNA transposon element - a consensus. XX KW MuDR; DNA transposon; Transposable Element; KW Autonomous DNA transposon; MuDR2x_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-1966 RA Jurka J.; RT "MuDR-type element from Schmidtea mediterranea."; RL Repbase Reports 7(10), 1090-1090 (2007). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 283..1641 FT /product="MuDR2x_SM_1p" FT /translation="MEFSTTNRGNSVLIYSGYEYLKFRTNNGVVTWRCRQN FT HQEKCRSFLKTTIDSSTIVTPPTRHSHDSCPQKAMANVAKAQMKEAIKEVG FT ATARNVLGNVLVNVNSDILGHLPKKSSIVRSLNQQKQSDRIANPTTHAFTI FT PEKYSAMILHDTGFDDPDRILVIGNRELLLELNKETIYGDGTFDKCPSMFY FT QLYTWHALVGTSYPPCIYFLLQKKNRETYIRMFEIMKQLLTNLSPRKVLVD FT FEKACMTAAAVAFPDAEVKGCYFHLCQSLVRKVNCVGLKNLYESNVDVKLM FT LKSLCALAFVPANEVRSAFDILARTFPDEDPYNAVLSYFFQTYIEGAIGRD FT PMFPVRIWNQWEAAAEKSPKTTNCCEGFHNALNATFHCSHPSIWLLFDGLQ FT RDIACHRLTLANYQTGRPEKKKRKYESLHQAVATAVQEYDSHADDESKLKY FT LRRMANLQ" XX SQ Sequence 1966 BP; 612 A; 389 C; 385 G; 580 T; 0 other; gaatattccg caaatgaatt ttccgcaccg gaataatgcg cacaagaata ttccacacag 60 gaataatccg cacagaaatt tccgcacaga aatttccgca catgaagatt ccgcacagtc 120 atttttccgc acaggaaaat tcggcaaatc taaatgaggt caagcttggg tcagacttag 180 gtcaaccgca ctttcttatc agttcagtta gttgtttgcg accaaactgt caacaacact 240 tttaagctcc aaattttcct tcttattttc accctatttg caatggaatt ttcaacaaca 300 aacagaggaa attcagttct catttattcc ggctatgagt atttgaaatt tcgcacaaac 360 aacggggtcg tgacttggag atgccggcag aatcatcaag aaaagtgccg ttctttctta 420 aaaaccacca ttgattcctc cacaattgtg actcctccga ctcggcattc tcatgactcg 480 tgtccccaga aagccatggc gaacgttgca aaagcacaaa tgaaagaggc cattaaagaa 540 gttggtgcca ctgccagaaa tgttctgggt aatgttttag tcaacgtgaa cagtgacatt 600 ctgggtcatc taccaaaaaa gtcatccatt gtgagaagcc taaatcagca gaaacagtcc 660 gatcgcattg ccaatcctac tactcatgcc ttcactattc ctgagaaata ttctgcgatg 720 attttgcacg acactggctt tgatgatcct gaccgaattt tggtaattgg caatagggaa 780 ctattacttg agctaaacaa agaaacaata tatggtgatg gaacatttga taaatgcccc 840 agcatgtttt accagctata tacctggcac gccctggtag gcacatcgta tcctccgtgt 900 atttattttc tcttacaaaa gaagaacaga gaaacataca tccgaatgtt cgaaataatg 960 aagcaacttt taacaaattt atcgcctaga aaggtcttgg ttgatttcga gaaggcttgt 1020 atgacagcgg ctgcagttgc tttccccgat gccgaagtaa agggatgcta ctttcatctc 1080 tgccaaagtc tcgttaggaa agtaaattgc gtagggttga agaatctgta tgagagcaat 1140 gttgatgtga agttaatgtt gaaatctctt tgtgccttgg cattcgttcc tgcaaacgaa 1200 gttaggtctg ccttcgatat tcttgccaga accttccccg acgaagatcc atacaatgcg 1260 gttctatcat atttttttca aacttatatt gaaggtgcca ttggtagaga tccaatgttc 1320 cctgtaagaa tttggaatca atgggaggcg gcagctgaga aatcaccaaa aacaacaaac 1380 tgttgtgagg gctttcacaa cgctctcaat gccacttttc actgtagtca tccaagtatt 1440 tggctattgt ttgatggact acaaagggac atcgcttgtc atcgattgac tttggcaaat 1500 taccagacag gaagaccaga aaaaaagaag agaaaatatg aatcattgca tcaggcagtt 1560 gcaacggctg tgcaagaata tgactcacat gcagatgatg aaagtaaact aaagtattta 1620 cgcagaatgg ctaatctgca atagaaatta ataaatgtgt ttacattttc ctgtagatat 1680 atacatgtac gtacttgtaa ttttgatgat agatatgtat tcatgtaatt acgcaactgg 1740 ttagtttcaa ataatttttt gtcttgtctc aacgaaatca caataaagct aatagtttaa 1800 ataatgaagt attaaaaaat attatcctgt gaaaagtaac catatgtatt tgcggaattt 1860 tcctgtgcgg aatattcttg tgcggaaatt tctttgcgga ttattcctgt gtggaatatt 1920 cttatgcgta ttattccagt gcgaaaaatt cctgtgcgga atattc 1966 // ID LMRP1 repbase; DNA; INV; 214 BP. XX AC L42505; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 01-JUL-2005 (Rel. 2.02, Last updated, Version 3) XX DE Leishmania major DNA repeat. XX KW LMRP1; Repetitive element. XX NM LMRP1. XX OS Leishmania major OC Eukaryota; Euglenozoa; Kinetoplastida; Trypanosomatidae; OC Leishmania; Leishmania major species complex. XX RN [1] RP 1-214 RA Piarroux R., Fontes M., Perasso R., Gambarelli F., Joblet C., RA Dumon H. and Quilici M.; RT "Phylogenetic relationships between Old World Leishmania strains RT revealed by analysis of a repetitive DNA sequence."; RL Mol Biochem Parasitol 73(1-2), 249-252 (1995). XX DR GenBank; L42505; Positions 1 214. XX SQ Sequence 214 BP; 52 A; 67 C; 67 G; 28 T; 0 other; gcaagaatca agaggccgtg ccagagatgg gcgaaggggg acggtgggag cgtgaaagag 60 acgacgggca cgtggcgacg cccgcggaaa gaaagaaagc agaagacgcg tattccattg 120 tgctgatgtg tgccagcctc tctgccacag atcacgagct cagctccact ccaccctaac 180 gccccctcgc cgcacggccc tgtcacaggc tccc 214 // ID Gypsy-73_AA-I repbase; DNA; INV; 3958 BP. XX AC supercont1.150; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-73_AA_; KW Gypsy-73_AA-LTR; Gypsy-73_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-3958 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.150; Positions 1616449 1620406. XX CC Positions [3069-3581] - Integrase core CC 'GCAC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 1803..3941 FT /product="Gypsy-73_AA-I_1p" FT /translation="MQNGLCNASQTFQRYMHRLFGDLDFVIIFIDDICVAS FT SSEEEHRQHVQIVFERLKDNGLVINVDKCRFAKKQVKFLGYLLSSDGILPL FT PDRVEAICRYELPTTVKQLRRFLALLNGYKRFVARASDLQAELLQMIPGNK FT KNDGRRLQWTKTGKESFELCKKSLSEAALLCYPDPTKPLGLMIDASNTAAG FT AVLQQLTAAGWRPLGFYSQKFSQSQQSYSTFGRELTAMKMAVKYFRHLVEG FT RQFTIYTDHHPLTNALSSKTSRLPHEERYLQFISQFTTDIRHISGKDNVVA FT DALSRVESIAQLNIDVSGFSVDQTNDMELQRLLRSSSLKLEKRNCVGCSTP FT IYCDVSIQGKIRPYVPVQQRQRVLHTLHGLSHSGIRATRKLVADCFVWSAM FT NKDVHTFVKHCIECQKSKVTRHTRSPLSNFDLPKKRFQHVHVDIVGPLPTS FT NGFRYLLTMVDRFTRWPEAIPLTDILAETVARVFVSSWVSRFGVPQQITTD FT QGRQFESVLFNELNRIMGVQHLRTTAYHPQSNGLVERFHRTLKSALMCNDP FT KHWADKLPLVLLGIRSALKSEFNCSVAELVYGQQLVVPGEFFDAQIKDVNH FT CDYSQQLHQIFDQLSAKKVNHHGKPKVFLQPALTDTKFVFVRTDAVKKSLQ FT RPYEGPYLIIKRHEKFFDLLISGKQQRISIDRIKPAFVLVEDHDQDTDSKT FT KVTPSGHRIRFLV" XX SQ Sequence 3958 BP; 1091 A; 916 C; 909 G; 1042 T; 0 other; ttggtgaccc cgacggttcc gacagttatc gtttcgaagt tttatcgcga aaccggtcgt 60 aaaatcgttt ccgcagtcga aaaattgttt ccactcattc caccgaaagc agccatattt 120 gttgaacatg acggaggaac gtaacctcgc tgatgcagcg gctgctgctg ttactactgc 180 ttcggtgtcc gtgaagcttc cagatttctg gaagagcgac ccgtctatgt ggtttgccca 240 ggcggaagcc cagttcgcgc ttgctggtgt tgtgaaggat gagacgaaat actactacat 300 catcagcaag atagaccagt ctgtcatctg ccatgttgca gatttaatac aaaacccgcc 360 ggctaacgac aagtataaac aggagaagga tcggctgata tctcgcttcg aaatttccgc 420 acagggcaag ctggaatgtc tgttgaatgc ctgtgacctc ggggatatgc ggccaacaca 480 cctactagca cgtatgcaag agctagcagc aggactcaac atcagcgatg acgtcatgaa 540 ggttttgttt ctacagcgga tgccggataa aatcaagcca attttgtcga tcagtgacgg 600 cacgattgct aagctggcgg agatggccga taagatggtc gagataaccc cccacgtcgt 660 ggcagcagta tctacatccg ttgtgccagg aatagaaatg agcagtttgc aagagcaaat 720 tgcttgcctg acagctgaaa ttcgtcgaat gaaaacagca gctccaagaa gtcgttccgt 780 ctcacgatca cgacgcagtt ccagttcagg tccaaccatc tgctggtacc atcggaaatt 840 cggatcgaat gctcaccagt gtcgtgagcc ttgctcgtac aacgcttcaa aaaactaagc 900 gaacatccat ctgaaacggc ggaggtggat gttttctctg gaagtcgccg tttacatatt 960 tatgacagaa ctagtggctt tcgttttttg attgatacgg gatcggactt ttccattatt 1020 ccggcaacgg ctaaggatcg tcgtaaaccg cctacataat ttcgactgca tgcagctaat 1080 ggcactacaa ttaaaaccta cgagtctcgt cttgttgcca ctgatttggg cttacgtcgg 1140 cggttctgct ggaatttcct cgttgcagat gtgaaaacag cgataatcgg tgcagatttt 1200 ttgtcattct tcggtgtgct ggttgatctt cagcatcgac aattgaccga cgctaaaaca 1260 aaactgcatt ctttcggcgg attgacggcc accgatatct atggcgtaac aaccattggt 1320 atgagtcatc catataagga tttgctgctt caatatcgtg aaataaccct accatcaact 1380 atgcgaacag ctgttccaca acgagaggtc aaacaccata tcgtaacgaa aggcccgcca 1440 gttgcgttca aaccacgaag attggctcct gacaagttag atgcagcaaa aaaagagttt 1500 cagttcagtt cgaattggga atttgtcgcc cgtctagtag ctgctgggct agtccgctgc 1560 actgtgtgcc aaaaaagaat ggacaatgga gattcgttgg tgattaccgt gcattaaaca 1620 aggtaaccac tccagatcga tatcctgtcc cacacattca tgatttattg aatgcttttc 1680 agggaaaaag gatcttcact acaatcgacc tggagcgagc gtatcaccag ataccagtcc 1740 acgaagacga cattgaaaaa acggcggtaa ttactccatt cggtttgttc gagtttttga 1800 cgatgcaaaa tggactctgc aacgccagcc aaacctttca gcgctacatg catcggttgt 1860 tcggagattt ggatttcgtg atcatcttca ttgacgacat ctgcgttgcg tcatcgtcag 1920 aagaggagca ccggcagcat gttcaaatcg tgttcgaacg gctgaaggac aatggcctgg 1980 taatcaacgt agacaagtgt agattcgcca agaaacaggt gaagtttttg ggatacttgc 2040 tgagtagcga tggaattctt cccttgccgg atcgtgtaga agcaatttgc cgttacgagc 2100 taccgaccac agtgaagcag ctacgtcgtt tcctcgcatt attaaatggt tataaacggt 2160 tcgttgcacg agcaagcgac cttcaagctg aattgcttca aatgattcct ggaaataaga 2220 aaaacgatgg cagacgattg caatggacca aaacgggcaa agaatccttc gagctgtgca 2280 aaaaatctct atcggaagca gccctactgt gttaccctga tccaactaag ccgcttggac 2340 tgatgatcga cgcatcgaac actgcagcgg gtgccgtact ccagcagctg acagcagcag 2400 gttggcgtcc tcttggattc tattcgcaaa agttttcgca atctcaacag tcatattcaa 2460 cattcggaag ggagttaacg gccatgaaaa tggctgtaaa gtacttccgt catctagtag 2520 aaggacgcca attcaccatt tacaccgatc atcacccact gaccaacgct ttatcttcaa 2580 aaacatctcg tttaccacat gaggaacgct atttgcagtt tatttctcag tttacgactg 2640 acattcgcca tattagtggc aaagacaacg tggtggccga cgctttgtct cgggtggaat 2700 cgattgcgca actcaatatt gatgtcagcg gtttttctgt ggatcaaaca aatgacatgg 2760 agcttcaacg tttactgcga tcatcatccc ttaaactgga aaagcgaaat tgtgtgggtt 2820 gttcaactcc gatttactgc gatgtttcaa ttcaaggtaa gattcgtcct tacgttcctg 2880 tgcaacaacg tcaacgtgtc ctgcatactc tgcacggatt atcgcactca ggtatccgcg 2940 cgacccgtaa gttagtagcc gattgttttg tatggagtgc tatgaataag gacgttcaca 3000 cattcgtcaa gcattgcatt gaatgtcaga aatcgaaagt gacacgacat actagatcac 3060 cgttgagtaa ttttgatttg cccaaaaagc gtttccaaca cgtacatgtg gatattgttg 3120 gtccgttacc tacctctaat ggatttcggt acctactaac tatggtcgat cgatttacac 3180 gatggccaga agcaattcct ttaacggata tcctagctga aactgttgct cgtgttttcg 3240 tttcttcctg ggtatctcgt ttcggtgtac cacaacaaat aacaacagat caagggaggc 3300 agtttgaatc tgtactcttt aatgaactaa atcggatcat gggtgtacaa catcttcgaa 3360 cgaccgccta tcacccacag tccaacgggt tggttgagcg cttccatcgt acactaaaaa 3420 gtgcgctgat gtgtaacgat ccgaaacatt gggcagataa acttccacta gtattgctag 3480 gaatcagatc ggcattgaaa tctgagttca actgttctgt tgcggagctt gtatacggac 3540 aacagttggt cgttcctgga gaatttttcg acgcgcaaat aaaggatgtc aatcactgcg 3600 attattctca acagttacat cagatcttcg atcagttgag tgctaaaaaa gtcaaccacc 3660 atggtaagcc aaaagtgttt ctgcaacctg cattaactga cactaagttc gttttcgtac 3720 gaaccgacgc tgttaagaaa tcactgcaac gaccgtatga aggaccctac ctgatcatca 3780 agcgacatga gaaatttttc gacttgttga tttccggaaa gcagcaacga atttccatcg 3840 atcgcataaa acctgcgttt gtgctagtcg aggatcatga tcaagataca gatagcaaga 3900 caaaagtcac tccatcagga caccgtatcc gatttctggt gtgactggag gggagtac 3958 // ID Nimb-1_AAe repbase; DNA; INV; 6101 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A Nimb non-LTR retrotransposon from Aedes aegypti. XX KW Nimb; Non-LTR Retrotransposon; Transposable Element; Nimb-1_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6101 RA Kojima K.K. and Jurka J.; RT "Nimb clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1428-1428 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 13 sequences with >96% CC identity. XX FH Key Location/Qualifiers FT CDS 719..2092 FT /product="Nimb-1_AAe_1p" FT /translation="MANMATAKNHPPDPPPEGIPGPSYVNHNDRTVPEWMD FT RLGMHGQRIILSLRPAGNTPLPNPWIIGKSIENASGRSKLESAETEDKGTK FT YILKTRNAEQAKKLMQMTELIDGTKVEIILHPFLNSCRCVVSCREVIDMTE FT SELLNELTPQGISGVRRISRMDGNQKINTPTLILTVCGTVAPKRIFFGPLS FT APTRLFYPRPMICFNCCEFGHTRVKCQKMPACTNCSGNAHVEGQQCNQVPY FT CKNCQGDHAPVSRKCPWYAKEEKIIKIKVEAGVSFGEARKEYEKLHGKESY FT AAVSGAQARIEKIQKDNERDNEIRTLKDEIVKLREANNKISNDEKDREIAS FT LRKEIEGLRKVIEGMSQNRVVLGKQRFNLDISETEGLASETEPQNTEPEAM FT EDEDIYELRGSKEQNGRKRGKNVESESDNSKSSDKFEEKQGNQSKSNNTKR FT GRPPKHRKNDQRK" FT CDS 1905..5993 FT /product="Nimb-1_AAe_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="MKISTSFVVAKSKTEESEEKMLNRKVTIVNPAINSRK FT SKAINRNRTTPNEEDHQNTGKMINENKTCISNHINTDPCRNNDYEHGNLME FT SRSNLETAVGNIEGEIETQPGPKHQHNSCEPSIISWNIQGLKTGRPEIKVI FT NNERQPLAYCLQETMCSTEDQARIRGFVTFHRKRTGGRRASGGVLIAVKVG FT LDCQEIDVESEIEAVAVKVGHPLNTSILNVYIPPGKPIEESEIVNLINQVQ FT APLLVVGDVNANHPLWGSPSSNARGRVLEAAFNECNLAILNNGEATHIHHA FT NGNASCIDIACSTSDLAGSLTFEVIADSHGSDHLPLIITFPATERPIKSRK FT RWKIDEADWQTYQNMVHFKNLPNVDDQIGEIARAILDAAESSIPKTKGIKS FT GKTPVPWWNREVADAIKNRRKALRNLKSKGKKHSNKDELLKKFQEARKTAK FT ETVSRAQHNSWQNFVNGFSVHTPTKEMWSNFQRVQGKRTNSRIFSIKADGN FT TVTEDGEIADALAQAFQTVSSNNSYPKDFQTYKTTVENQPLQYPENNDAEY FT NKPFSVYELEQALEGLRGSTPGEDQIHYKMISMLPYDCKTTLLKAYNQLWG FT ESKYPDSWRQSTVVPIYKGKGDRSNANNYRPIFLNSCLGKVMERLINNRLI FT HILESRELLSKYQYAFRKGKTTVDHLTEMEMIIREAFVKKEYAYAVFLDVT FT KAYDTTWRRLVLNRLSEWKIGGKLLKFLETMLHRRNFKVLANGHLSSERPM FT QTGLCQGSVLSVTLFLIAIDTITSIMPSGIRILLYADDIVILATGINSKAV FT QDQLQHALDLIQEWQGKTGFNISPGKSVLVVFRKHRKRKPKTKITLSLNNQ FT IIPQKKYHKCLGVIFDETLQFDEHVEEVKAACKQRIQILRAVAGRSWGADR FT TTLVKLYRATTVEKILYAAPIVSACNSNTLKKLETVHNAGLRTICGAFRTS FT PILSLHVETGIPSINTLLKQRTAIHAVKNTDVCSTETINSSVGSSCCTTSS FT EESSGEMWGTNTVEFPETAQLRGQRVIDEMQLDLPPVSRYTTPKIPPWERI FT KIPLDNGILNAAREGIPQVAMRNLFTSIKSTKYRIYNVIYTDGSKRENRCG FT YSVVCDEHIIRRRIYGNSSIYAAECAALKEAFQWIVNNEHVGAYLICTDSL FT SAVSTLEKHKIKNKWHDEMIQLYRSIRETGNDIKIMWIPSHIGIIGNEKAD FT CEAKRALGDPYETVTSIDYKEIKTVIKNQIILQWQALWSATRDNKLREVKP FT SVKPYSSAFIGNRKDDVVLARIRIGHTKITHQYLMNKEPPPNCQFCNALLT FT VRHLVTDCPVLDEERQKLNISTSLQNVLEDNLDKAKNVIQYLKNIDMYKQI FT " XX SQ Sequence 6101 BP; 2160 A; 1234 C; 1316 G; 1391 T; 0 other; cagtttcgag ttacatctgt gatcgaaaag cagaagccga ttcgtatcta atttttgaag 60 cctttttacg acgtttttag aatacgagaa atcgatctat agtgttcggt cgttataaaa 120 acatatttgc gtgttttgaa cagtggagta taattaaaat tgcagctaat tggtgcattt 180 taaacggatt tagagttgcc gaatataacc aacgccattg aaaacagttt tgcttgacaa 240 acaaaaccta cagtcgataa agctaccccg gtacaaagcc gactagcaat agcaagcttg 300 tgtactattt tgcacacaag atttgctacg ggagttgctc actgctaggc gcaatcgaaa 360 gctctaccat ctcagcacac tacccgcttg tagtggagca ataactgttt cgagcagcgg 420 tagaggacaa attttgctag ctggaccttt ggacgtttag ctagcaaagg tctaggaaaa 480 acgtcccttt cacggttgga ccttcaaacg cttgatcgtg ttagtatagg ataaagcgta 540 gccgccatcc acttcggtgg agtcgcaaaa cttgttgcct ggaccttaac ccttgggcaa 600 tcaagtaaca gcaggtaaaa gggtagtcgt tacttgccta ggcaagagcg tgaagtattt 660 cttcgaccgg ctcatcaaca ctcacagaca tctgacgcac gcaacgtgat tcaggctgat 720 ggccaacatg gccacagcaa aaaatcatcc cccggacccc cctcctgaag gtatacctgg 780 ccctagctac gttaaccaca atgatcgaac agttcctgaa tggatggatc gattgggaat 840 gcacgggcaa agaataattc tatcgctacg cccggccgga aacacacccc taccaaaccc 900 atggataata ggaaaatcta tcgagaatgc tagtggacgc agcaaactag aatctgcaga 960 gaccgaagat aaagggacca agtacatcct caaaacaaga aatgcagaac aagctaaaaa 1020 actgatgcaa atgacagaat tgatcgacgg aacaaaagtg gagatcattt tgcacccttt 1080 tttgaactcg tgtcgatgtg tagtaagctg tcgtgaagtc atagatatga ctgaaagtga 1140 attgctgaac gaacttactc cacaagggat aagcggtgtc agacgaattt cacgcatgga 1200 cggaaatcaa aaaatcaata ctccaacttt gattttgact gtttgcggaa ctgttgcacc 1260 gaagcgcatt ttctttggac cattgagtgc tccaacaagg cttttctatc cgcgtcctat 1320 gatatgcttc aattgttgtg agtttggtca cactagagtt aagtgccaaa aaatgcctgc 1380 atgcaccaac tgctctggaa acgcacacgt agaaggacag caatgcaacc aagtacctta 1440 ctgcaagaac tgtcaaggag atcacgcgcc agttagccga aagtgcccgt ggtacgcgaa 1500 ggaagagaaa atcattaaaa tcaaggtcga agctggggta tctttcggtg aggctcgtaa 1560 agagtacgag aaactgcacg gaaaggaatc atatgctgct gttagtggcg ctcaagctag 1620 aatagaaaag attcaaaagg ataatgaaag agacaacgaa atcagaacac taaaggacga 1680 aattgtcaag ctacgggaag ccaataataa aatttccaat gacgagaagg acagagagat 1740 cgcaagtctc cgaaaggaaa ttgaaggact gagaaaagta attgaaggga tgagccaaaa 1800 cagagtcgta ttggggaaac agcgttttaa tctggatatt tcggaaacag aaggactagc 1860 ttctgaaacc gaaccccaga acacggaacc ggaagcaatg gaagatgaag atatctacga 1920 gcttcgtggt agcaaagagc aaaacggaag aaagcgagga aaaaatgttg aatcggaaag 1980 tgacaatagt aaatccagcg ataaattcga ggaaaagcaa ggcaatcaat cgaaatcgaa 2040 caacaccaaa cgaggaagac caccaaaaca ccggaaaaat gatcaacgaa aataaaacat 2100 gcatctccaa tcacatcaat acagaccctt gcagaaacaa tgattatgaa cacggaaacc 2160 taatggaatc aagatcgaat ttggaaacag cagtaggaaa tattgaagga gaaattgaaa 2220 cacaaccagg tcccaaacac caacataatt cctgtgaacc ctcaataatc tcttggaata 2280 ttcaaggact aaaaacagga cgtcctgaaa ttaaagtgat caacaacgaa aggcaacctt 2340 tagcatattg cctccaagaa acaatgtgct caacagaaga tcaagctcga atcaggggat 2400 ttgtaacatt tcatcgaaag cgtactggag gtagaagagc atcaggaggc gtattgattg 2460 cggtcaaggt tggattagat tgtcaggaga ttgatgtgga gagcgaaatt gaagcagtag 2520 cggttaaagt ggggcacccg ttaaatacgt ccatccttaa cgtgtatatt ccgccgggaa 2580 aacctattga agaatcggaa atcgttaact tgattaatca agtccaagcc cctttattag 2640 tggttggaga tgtcaacgca aaccaccctc tgtggggttc accttcaagt aatgcccgag 2700 gacgagtgtt agaagcagct ttcaacgaat gtaatttagc tatcttgaat aacggagaag 2760 caactcatat tcaccatgca aatgggaacg cgtcttgcat cgacatagca tgcagtacga 2820 gcgatcttgc tggatcgcta acattcgaag taattgctga ctcgcacggt agcgaccatt 2880 tgccacttat tattactttc ccagcgacag aacggcctat caagagtagg aaaagatgga 2940 aaatagatga agcagactgg caaacttatc aaaatatggt acattttaaa aaccttccta 3000 atgttgacga tcaaatcgga gaaatcgcca gagctatttt ggatgcagca gaaagttcca 3060 taccaaaaac gaaaggaatc aaatctggaa aaactccggt tccgtggtgg aaccgtgaag 3120 tagcagatgc cattaagaac cgcagaaaag ctctgcgaaa cttaaaatca aaagggaaaa 3180 aacattcaaa caaggacgaa ctgttgaaaa agttccaaga agcgcgtaaa actgcaaaag 3240 aaacggtatc aagagctcaa cataattcgt ggcaaaattt cgtcaatgga ttttcagtac 3300 acacacccac aaaagaaatg tggtcaaatt tccaacgtgt tcaaggaaaa cgaacaaact 3360 ccagaatatt ctccataaaa gctgatggaa atacggtcac tgaagacggg gaaatagcag 3420 acgccttggc acaggccttc caaactgtat ccagcaataa cagctaccca aaagatttcc 3480 aaacctacaa aacgacagtg gagaatcaac cgttgcagta tcctgagaac aatgacgctg 3540 agtataacaa accattttca gtgtacgagt tagagcaagc tctcgaagga ctaagaggat 3600 caacaccagg agaagatcag atacactaca aaatgatttc catgcttcct tacgactgta 3660 aaactaccct tctgaaggca tacaatcagc tttgggggga atctaagtac cccgatagct 3720 ggcgtcaatc aaccgttgtt ccaatataca aggggaaagg tgatagatcg aacgcgaaca 3780 actatagacc aatattcctt aacagttgct tggggaaagt aatggaaagg ttaataaaca 3840 accggctcat ccacatccta gaatctaggg aactgctaag taagtatcag tatgcattta 3900 gaaaaggcaa aacgacagtt gaccacttga ctgaaatgga aatgattatt agagaagctt 3960 tcgtcaaaaa agagtatgct tatgcggtat tcttagatgt aacgaaggca tatgatacaa 4020 cttggcgtag actagtgttg aacaggttga gtgaatggaa gatcggggga aaacttctca 4080 aattcttgga gactatgctt cacagaagaa attttaaagt cttagctaat ggtcacttgt 4140 catcagaaag acctatgcag acgggcctct gccaaggatc agtgctcagc gtaacattat 4200 tcttgatagc aattgacact attacaagta ttatgcctag cggaattaga attttgttgt 4260 atgcagatga catagtaata ctagcaaccg gaatcaattc taaagcagta caggatcagc 4320 tacagcacgc actcgacctc atccaggaat ggcaggggaa aactggattc aatatatccc 4380 ccggaaaaag cgttttggtt gttttcagaa aacatcgaaa aaggaagcca aagaccaaaa 4440 ttaccttaag tctcaacaac caaatcatcc cacagaagaa gtatcacaaa tgccttggtg 4500 taatattcga tgaaaccttg caattcgatg aacacgtgga agaagtcaaa gcagcatgta 4560 agcaacggat tcaaattctt cgagcagtcg ctggtagatc gtggggagca gacaggacta 4620 cactcgtcaa actctatcga gcaacgaccg tagaaaaaat attatacgca gcaccgatcg 4680 tctcagcatg caatagcaat acacttaaaa aactagaaac ggtacataat gctggtttaa 4740 ggacaatttg tggagctttc agaactagcc ctatcttaag tctacatgtt gaaacaggaa 4800 tccccagtat caatactctg ttgaaacaac gcacagctat acatgcagtc aaaaatacgg 4860 acgtgtgctc gaccgaaacg attaattcat cagtaggatc atcatgttgt acaactagca 4920 gcgaggaaag ttcaggtgag atgtggggaa caaatacagt agaatttcct gaaacagcgc 4980 agttgagagg gcagcgagta attgatgaaa tgcagcttga tctacctcca gtgagtaggt 5040 acactacacc taagattcca ccatgggaaa ggataaaaat accacttgac aacggtatac 5100 tgaatgctgc tcgagaaggt ataccacaag tagcgatgag aaacctgttc actagtatca 5160 agtcaacaaa ataccgtatt tataatgtca tctacacaga tggctcaaaa cgtgaaaacc 5220 gctgcggata tagcgtggta tgtgatgaac acatcataag aaggcgaatt tatgggaaca 5280 gcagcatata tgcagccgaa tgtgcagctc taaaagaagc gttccaatgg atagtcaaca 5340 acgaacacgt tggagcatac ttaatatgta ctgattcgct cagcgcagtt tcaactttag 5400 aaaaacacaa gattaaaaat aagtggcatg atgaaatgat ccaactttat cgtagtatca 5460 gggaaacagg aaacgacatt aaaattatgt ggattcccag ccacattgga ataataggca 5520 acgagaaagc agattgcgaa gcaaaacgtg cactgggtga cccttacgaa actgtaacat 5580 cgattgacta caaagaaatc aaaaccgtca ttaaaaacca gatcattctt cagtggcagg 5640 ccctctggtc tgcgacgagg gacaataagt tacgcgaagt gaaaccctct gttaaaccct 5700 actcatcagc attcatcgga aatcgaaagg atgatgtagt tctggctcgc attcgtatcg 5760 gacacaccaa gattactcat caatacttaa tgaataagga gcctccacct aactgtcagt 5820 tctgtaatgc gttattaaca gtgagacatt tagtcacaga ttgcccagta ttagacgaag 5880 aaaggcaaaa attaaacata tccaccagcc tccaaaatgt tttggaagat aatttagaca 5940 aagcgaaaaa tgttattcaa tatcttaaaa acatcgacat gtataaacaa atttgaaatc 6000 cctaacgata tgtccaagat gtttgtaatt ttatactaca gtggatccga atgacaatag 6060 ttaaaagatc cactttaatc aataaataaa taaaaaaaaa a 6101 // ID DNA8-78_AP repbase; DNA; INV; 624 BP. XX AC . XX DT 25-AUG-2009 (Rel. 14.09, Created) DT 25-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-78_AP. XX NM DNA8-78_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-624 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2014-2014 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 624 BP; 280 A; 72 C; 62 G; 210 T; 0 other; agggctgtga atttaatgca ctaaaaaacc ttaaaaaatg tcaaaaaaat gcaataaaat 60 atgcacttaa aatgcaaaaa atgcaatata aaatgccaaa aattatagta taaaattcac 120 ttaaaatggt ttttaattta atttttatat ttatattgat attataggat attatacatt 180 agtacattac attttacatg cataaaaatg tgtttattta ataaaaaaaa aaaataataa 240 aaatattgat tgaatttgaa ttttgtataa tgctattcac ctgaaacatt tttaaaaatt 300 ttttttatgt atttattttg ttaaaataat aaaacaaact taaatatatt tatttgcggg 360 taccttattc atctttagtc tttactattg cattggacaa aaaaaaaaaa acaaaaaaat 420 gtttttcaaa catgtcaatt gacggaattt gattaaaaat accggaaaat gaactaaaaa 480 aggaaaatac gacctaaaaa tgcaaaaaaa tgcaaaataa aagtttcata aatggattcg 540 tagttacata attcaaatta tgtacttaca aagctacgtt tcacaatcac caaaaaatgc 600 actttcctac aaattcacag ccct 624 // ID Crack-1_HM repbase; DNA; INV; 5270 BP. XX AC . XX DT 14-SEP-2009 (Rel. 14.09, Created) DT 14-SEP-2009 (Rel. 14.09, Last updated, Version 2) XX DE Non-LTR retrotransposon: consensus. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; Crack-1_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-5270 RA Jurka J.; RT "Non-LTR retrotransposons from Hydra magnipapillata."; RL Repbase Reports 9(9), 1932-1932 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 86..808 FT /product="Crack-1_HM_1p" FT /translation="MKNIEKIISSKLEEHKKSILKETERLLKDQEKSFTSI FT ISANLKIITDRLDMLEKDINNNKCKISNIEKDLCDIKDSLNFQENDISEKI FT SQIKKYYDKEINSLYKKSIDLENRSRRNNLRIDGLQETPGESWEDCEKAVK FT DIFKTLLKIPSEVVVERAHRIGQFKENKPRTLVLKLLNYQDKNKILKAVKQ FT LKGTGLYINEDFAQETIYHRRKLWEEVKKLRSEGKYAILKYDKIFTRDFKK FT " FT CDS 1979..4792 FT /product="Crack-1_HM_2p" FT /translation="MICLTETWCSDESIQNNTNFQIPNYKLISSERKTCKK FT GGGIATYVRNDQAIKVRKDHSISNADSEVFTIEIINSKFKNIIVSTCYRPP FT EGNIKTFSKYLEEIFLKINKEQKKLFCIGDLNIDCLKYNEFPITKLFFDNM FT FQHCILPIINKPTRVTLNSISAIDNILTNSFLDTSLKAGIVKIDITDHFPI FT YFTIKNDTRVNNNLKKITYKRKINKFSIQNFKDTLSAVDWVKVYQECNLGN FT TNSAYNTFTDIFLKHYNNHFPIKEKEIKVKYLSCPWITSGIKKSSKTKQKL FT YVKYLKNRNEINLSTYKQYKNLFEKIKKHSKKIYYSKLLKKTNGDMKKTWN FT IMKEIIGKKNKKTNSLPDRIIINETEHDDKNSIAEHFNNFFANIGPNMASE FT ILTTNDSFENYLTDLNSELIFNGLSYEELENAKNSLKINKAPGIDEICSNV FT VINIFPIIKKPLFEIFKSSIITGTVPEKLKIAKIVPIYKTGEPYLLNNYRP FT ISILPVFSKLLERIIYNKLYIYLTNNNILNKKQFGFQKQHATEHAIIDLVN FT SINYSFDNNEFVLGIFIDLSKAFDTVDHEILLKKMEKYGVKNVALLWFKNF FT LLNRQQCLNTDENICSKMLKIKCGVPQGSILAPLLFLIYINDLPKASNKLD FT VIMFADDTNLFYSSPSITELFETTNIELEKLNNWLKSNKLSLNTEKTNYIL FT FHPNQKRKKLPNILPLLRIKNKNIERTTTTKFLGLLIDQNISWKAHIDYLN FT TKITKNIGVLYKARPMLSQENLKHLYFSFIHTYYTYGNIAWASTHKSNLKS FT LYQHQKHAIRIVYKKDKLTHAEPLFKLLNALNVYRINIYQNLLFMLKFKLG FT LVPPHFLNEFFKSNKNRYDTREVGNFNVPFRKTKLSRFTISXRGPYLYNKL FT ISKNAIITKLDNINCLKILLKKFVLNISNYLNFH" XX SQ Sequence 5270 BP; 2211 A; 831 C; 658 G; 1569 T; 1 other; ggcgtgactt gcgtgaagtg aacagacgtg tttttgtagc gtagattaag catttataaa 60 aaaaaaaaaa aaaaaaaaca tcacaatgaa aaacattgaa aaaataatct caagtaaact 120 tgaggagcac aaaaaaagta tcctaaaaga aacggaaaga ttattaaaag accaggaaaa 180 atcatttacc tcaattatca gtgcaaatct caaaattatt acagatagat tggacatgct 240 agagaaagac attaataaca acaaatgtaa aatatcaaat atagaaaaag acttatgcga 300 cattaaagac agcctaaatt ttcaagaaaa tgatatctcg gaaaaaatat cacaaataaa 360 aaaatactat gacaaagaaa tcaattcatt atataaaaaa tctatagatc tagaaaatag 420 atcaagaagg aacaacttaa gaattgatgg gctacaagaa acaccaggcg aaagttggga 480 agattgcgaa aaagctgtta aagacatttt caaaacacta ttgaaaatac ctagtgaagt 540 ggtggtcgaa cgagctcatc gaatcggaca atttaaagaa aacaaaccaa gaacattagt 600 cttaaaactc ttgaactacc aagacaaaaa caaaattctc aaagcagtaa aacaacttaa 660 aggaactgga ctatacataa atgaagattt cgctcaagaa acaatatatc atcgaagaaa 720 gttatgggaa gaagtaaaaa aactaagaag cgaaggtaaa tacgccattt taaaatatga 780 taaaatattt acacgagatt ttaaaaaata gcgctacact tttttttaac taagtaaacg 840 cgttttcaat tttaaatttt aaaacaatga acaacaaaac aatagatttt gaatcaatgc 900 gctttaatgt tttcgaaact gctaataata tactcattga cgctgatacg catatttttc 960 aaatgaataa ttttgattct cgatatgtta aaactagtga tgtgccgggt acccggttcg 1020 gacccgggga cccggcagtt cggctgtttt accgggtacc cggtatcggc cattttgttg 1080 cagtacccgg caccgggtat cttcaaattg atttttcctt aaatttaaac tttaaccatt 1140 aaagttcttg atataagtct aaaaatgtta tgtagtcgat aaccacaaag ttctattaac 1200 ttaaaacaat ctcacttaaa atatttttta atatattttt gcaaaaaaat ttatacggat 1260 acaaaaactg taacgctaaa tgaaataatt cgagaatacg aataaaaaga atcggagaaa 1320 aagttttaga atagcttctt tttattctgt tagaacacaa gataattttt agcaatttag 1380 ttttgaatga aacgcttata atgataccac ttttcattcg aatcaagcaa aaatcataaa 1440 agtagagtta taagttaaat attatacaca ttttttatat tttaatttcg ataatattta 1500 taacaaaata ttatttagac ccgctccacc aggaaagaaa aataacttta aactaaatat 1560 agttattaaa aaactgttgc acaacttttg aaaaaagtta ggccctttac cctgcattgc 1620 ggggttttgc taaatggtac ttccgccaca taacttttgg ttagttagcg taaaaagcgt 1680 attttgtgtg aatagttcag cgctagagat ttcacaagat tttattccaa cttttaaaaa 1740 atggactcgg tgccgagaac ccggactcgg acccggtttt ttcagccggg tacccggaat 1800 cggagactcg gcataaactg gttccggcac atctcgatat gttaaaactg aaacttttaa 1860 aactgaactt gaaactatta aaaataattt tacagtaatt cacataaata taagaagcat 1920 aaataaaaat tttgacaaac ttaagcactt cctaattgat tgtaattact tatttagtat 1980 gatttgtcta acagaaactt ggtgttctga cgaatcaatc caaaataaca ctaactttca 2040 aattcccaat tataaattaa tatcttctga aagaaaaaca tgcaaaaaag gaggagggat 2100 tgcaacttat gttcggaatg atcaagcaat aaaagtaaga aaagaccact ccatttctaa 2160 tgccgatagt gaggtcttta caatagaaat cataaactcc aagtttaaaa atattattgt 2220 ttctacctgt tatagaccac ctgaaggtaa tattaaaact ttttctaaat atctagaaga 2280 aatttttctc aaaatcaata aagaacaaaa aaagttattc tgtattgggg acctcaacat 2340 agattgttta aaatacaatg agtttccgat tactaaatta ttttttgata acatgtttca 2400 acattgtatt ttaccaatta ttaataaacc gacacgggta accttaaatt ctatttctgc 2460 catagataac atactaacaa actcatttct agatacctcc ttaaaagcag gaatagtaaa 2520 aattgatata acagatcatt ttccaattta ctttacaata aaaaatgata caagagtaaa 2580 taataatttg aaaaaaataa cttataaaag aaaaataaat aaattttcaa ttcaaaattt 2640 taaagacaca ctatcggcag tagattgggt taaagtgtat caagaatgta accttgggaa 2700 cacaaactct gcatataata cttttacaga tatttttcta aagcactaca ataatcattt 2760 tccaatcaaa gaaaaagaaa tcaaagtaaa atatttgagc tgtccatgga tcaccagcgg 2820 tattaaaaaa tcttcaaaaa caaaacaaaa actatacgtt aaatatttga aaaacagaaa 2880 tgagattaac ctatctactt ataaacaata caaaaatctt ttcgaaaaaa tcaaaaaaca 2940 ttctaaaaaa atttactact ctaaattact taaaaaaaca aatggagaca tgaaaaaaac 3000 atggaacatt atgaaagaaa taatcgggaa aaaaaacaaa aaaacaaata gtttaccaga 3060 tagaatcatt ataaatgaaa ccgaacacga tgataagaac tcaattgccg aacacttcaa 3120 caatttcttt gcaaacatag gcccaaacat ggcgtctgaa attttaacta caaatgactc 3180 ctttgaaaac taccttacag atctaaatag cgaactaatt ttcaatggac taagctatga 3240 agaacttgaa aatgcaaaaa actctttaaa aatcaacaaa gcaccaggaa ttgatgaaat 3300 ttgtagcaat gtagtaataa atatctttcc gataataaaa aaacccctct tcgaaatatt 3360 taaatcttca attataacag gaaccgttcc agagaaatta aaaattgcaa aaattgtacc 3420 tatatataaa actggagaac catatttact aaacaactat agaccaatct caatccttcc 3480 tgtattctca aaacttctcg aacgaataat ttataataaa ctgtacatat atttaacaaa 3540 caataacatc ttaaataaaa aacagtttgg atttcaaaaa caacatgcaa ccgaacatgc 3600 aatcatagat cttgtaaata gcatcaacta ctcatttgat aataatgaat ttgttctagg 3660 aatctttata gatctatcaa aagcattcga tactgttgat catgaaatcc tgctcaaaaa 3720 aatggaaaaa tatggcgtaa aaaatgttgc attactctgg tttaaaaact ttttgctaaa 3780 cagacaacaa tgtttaaata ctgatgaaaa tatctgttca aaaatgctaa agatcaaatg 3840 tggcgttccc caaggttcca ttctagctcc cttactgttt ttaatataca ttaatgatct 3900 tccgaaagcc tcaaacaaac ttgatgtcat aatgtttgca gatgacacta atttatttta 3960 ttcatccccg tcaattacag aactttttga aacaacgaat attgaactcg aaaaacttaa 4020 taattggcta aaatctaata aactatcttt aaacacagaa aaaactaatt acatcttatt 4080 ccatccaaat caaaaaagaa aaaaactacc aaatatattg ccattgctaa gaataaaaaa 4140 caaaaatatc gaaagaacca caacaactaa attcctaggt ttacttattg atcaaaatat 4200 ttcatggaaa gcccatattg actatttaaa tactaaaata accaaaaaca ttggcgtgct 4260 atacaaagct aggccaatgt tatcccaaga aaatctaaaa catctttatt tttcatttat 4320 acatacctat tatacatatg gtaatatagc atgggccagt acccataaat ctaatttaaa 4380 atcactttat caacatcaaa aacatgccat tagaattgtc tataaaaaag ataaactcac 4440 tcatgctgaa cctttgttta aactcctaaa tgcactaaac gtctatagaa ttaacattta 4500 ccaaaactta ctcttcatgc taaaatttaa actcgggctt gtcccaccac atttcttaaa 4560 tgaatttttt aaaagcaata aaaatagata tgacactaga gaagtaggaa actttaatgt 4620 accgttcaga aaaacaaaac tgtcgcgttt cacaatttcg tntcgcggtc cctatctcta 4680 taataaactg atatccaaaa atgccattat tacaaaatta gataacataa attgtttgaa 4740 aatcttacta aaaaagttcg ttttaaatat cagcaattat ttgaactttc actaaactaa 4800 aactgaaagt aaaaccaaaa ttataattga aaacaaaatc caaaaaactt caataatata 4860 tattaatata aaaaatcaat ataagtcaat aataattata tttatgctta ataatatatt 4920 tatgtgtatt atgtgtacta caaacagtat ttaccaactt ttttatttct acattttatt 4980 atatgtgaac ctaatataaa atgaagtatt ttgtgaaact tgatacattg taaaacgaaa 5040 atacaaaaaa tattacaagt acaagcggtt tcttgatgac aagaccatct ggtcttctgc 5100 aagtttcccg cgttctttta atattaatga taccctacaa tcatattttt tgtattctat 5160 aaaattgtga tatattttta gtattatata ttttaaaaac attataactt tgtaccaatc 5220 atgtagaatc acaacattat aaagataaaa gaacaaaaaa aaaaaaaaaa 5270 // ID BEL-102_AA-LTR repbase; DNA; INV; 482 BP. XX AC AAGE02018809; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-102_AA_; KW BEL-102_AA-I; BEL-102_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-482 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02018809; Positions 6800 7281. XX SQ Sequence 482 BP; 197 A; 79 C; 75 G; 131 T; 0 other; tgttggcaac acggaatcag ccttttcaac agggatcaac actggtgaag tgacagacaa 60 ctacaatgac acaaggagga tgaaatgaaa cgtcaaaaaa ggtacacaaa ctgaagacat 120 caaacattgc acgtgaagaa gctaattgaa gtaaaaatta taaagttaat agattattat 180 ttgaattcga ttacatttaa caaattatta cacgaaattg taagtaaaat aaatgaattt 240 gacacaacac atatgtaaca cttaaaatat gatttatagc ttaatagtaa tctacacaca 300 attgttactg tacgacggag aacatttgca cacgtgaaag acttatattg taagtaaaca 360 aactattaaa tctaatttaa tcgctaatga aatgctaata tattttagct tgaagcttac 420 tcaactaccc acataaccga gtttgctata ggaaagtccg aatcgtatcc gctccagtaa 480 ca 482 // ID Homo11 repbase; DNA; INV; 2511 BP. XX AC . XX DT 09-OCT-2009 (Rel. 14.1, Created) DT 09-OCT-2009 (Rel. 14.1, Last updated, Version 1) XX DE Homo11 is a putative autonomous DNA transposon. XX KW hAT; DNA transposon; Transposable Element; HOBO; transposase; KW Homo11. XX OS Drosophila mojavensis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; repleta group; OC mulleri subgroup. XX RN [1] RP 1-2511 RA Ortiz M.F. and Loreto E.L.S.; RT "Characterization of new hAT transposable elements in 12 RT Drosophila genomes."; RL Genetica 135(1), 67-75 (2009)DOI 10.1007/s10709-008-9259-5. XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 917..1405 FT /product="Homo11_1p" FT /translation="MLIFAKLIGTGFRNYVQAILNIGAQYGTRDSHDLILS FT RKKLTGNVMVAEYNRIKDLLTTDLPNKHMAFTTDMWTDQFTQRSFLCLSAH FT YIDESFCLKTSILGVKEFLEEKKTGGNILEQVKYILLEYNLEDKRLSCACH FT NLNLVMDDVLEKNLVEEIKTLLEK" XX SQ Sequence 2511 BP; 859 A; 391 C; 470 G; 791 T; 0 other; tagaggtggg cacgggccgg gtttttatcg ggtacccgtg ggcacccgcg gttacccgtg 60 ggtatgggct gaaaaatcta atttcgggct gggtgtgggc ggttgcgggt tagactgctt 120 aaatccgatt tacccgatta aatttttact gttttattaa aatttaaaat gtataaattg 180 tgtaatttgc aggtgctctg tacttgcctg caaaaacagg gttgctttcg acatcgataa 240 ttttgtgtgc gcttgcaaat taagcaatta ttatgcattt aaatttgaat aaaactgtcc 300 gctgacggaa atacaaatct taattttatt ccagcatttt gacaacaaaa tcaataagat 360 tggttcgctg tttagaaaat gtagcaacgt aatattccgt taaatattta aatatttgct 420 atacctgatt tcgtgtacaa atacgattaa tttttgtatc gatgatctct ataaaattgt 480 cattacgatt aaaagcataa taagcattgt acaagaaaat gaacggtaat aataatgagt 540 gtgctaaaac aaaatgtgaa gtgcaaagag caattagtgc caatcacaat atgtacaaat 600 tgatagaaaa ttcttctgga aagagtgatg tttggaaaac tttcaagatt gtggaattcg 660 atggaaagcc tcttgatttt gtgtgctgct gcaaatgcaa gcttgttttg gcttttagcg 720 tcaaaaatgg caccacaaca cttgctcggc acaaatgcgc tttcatgcct ccacctaatc 780 aaccattgtt aaactttaca tcgaaggctg ttccgaagaa tgttctaaaa aacaattagc 840 ttttgtagcc aaagacttgt tgccattaaa cgtaactgag ggtaggtaaa agataatttt 900 ttttttatat tcatcaatgt taatatttgc taaacttata ggaactggtt ttcgtaatta 960 tgttcaagct atattgaaca taggcgccca gtacggaacg cgcgactccc atgatctgat 1020 tctgtcccga aaaaagttaa caggaaatgt tatggttgca gaatacaatc gaataaagga 1080 tctgctcact actgacttgc caaacaaaca catggcattc actaccgaca tgtggaccga 1140 ccagtttacc cagagaagct ttctttgctt aagtgcacat tatattgatg aatccttttg 1200 tttaaaaacg tcaatattag gagtaaaaga atttctagaa gaaaaaaaaa ctggcggtaa 1260 cattttagag caagtgaagt atatattgct tgagtataat ttagaagata agagactttc 1320 ttgcgcctgt cataacttga acttagtcat ggacgacgtt ttagaaaaga atctcgttga 1380 agaaattaag accttattgg aaaaataaat aaatcttgtt aaatatttta agcattcaga 1440 gcttaacaaa aaactgagca aatcactaaa acaagatatt aaaacgcgtt ggaattcagt 1500 tttcattatg cttgaaagca tttgggaagt tcaggaagag attaaaatgt tacttttaac 1560 aaaaaatcat ctacaacgta tagcagatat tgatttcact ttaatggaat gtttaatttc 1620 gtttttaaaa ccattcaaag attgttctga caaattgtca tctgaatcag agccaaccat 1680 tcatatatat gctttatggt acgagaaact aaaaaaaatt gtaaaattga aatgtatgac 1740 tccaatgtta tggaacatgt aaaagaaaag accttgaact ctttagagaa gcgttttcag 1800 gcaacttccg ttcatttggt tggatcattt ttaaatcctc cattcaaaga attaagtttt 1860 ttgccagctg aaaaaaaaat tatgtcttag aaaccgttaa atgcatgatg gcagaaattt 1920 atgatatgca aaatgattca gttgaggacg cgaaaattga aaactctccg ccaaatttga 1980 atcatgaatt tcaagagttt ttggacaaaa atcaaccaaa aaaaagaaaa cttgacgaac 2040 ctgacaaaaa tgcaattgac ttagaaattg ataaatatta tgccacacgc tataattcaa 2100 acttgtcaat cttaaatttt tggcaggagg caacgaacct taaattgttg caagtggtgg 2160 ccaaaaatat tttatgtatt ccggtaagct ctgcaacaag cgagcgagta ttctcctctt 2220 ctggaaaaat tttaaatgaa aggcgaacaa ggctatcaag cgcaaatctt gatatgctcc 2280 ttttcttata taaaaacata taatatttct gtaaataata ttgtaaattt taagaataaa 2340 atggttgtta tttttgagaa aaaaaaattg taatgtttga gaattaaatt tgttgtaatt 2400 gtttcgggcg ggcgggttgg gcgcgggtat gacaattttt tttgctttcg ggtacggata 2460 aaactttgaa cttctgaccg tgggcggaaa ttcggcccgt gcccacctct a 2511 // ID Gypsy-18_RP-I repbase; DNA; INV; 6674 BP. XX AC ACPB02042122; XX DT 28-JAN-2011 (Rel. 16.02, Created) DT 28-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Rhodnius prolixus genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-18_RP_; KW Gypsy-18_RP-LTR; Gypsy-18_RP-I. XX OS Rhodnius prolixus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Euhemiptera; Heteroptera; OC Panheteroptera; Cimicomorpha; Reduviidae; Triatominae; Rhodnius. XX RN [1] RP 1-6674 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Rhodnius prolixus genome."; RL Direct Submission to RU (28-JAN-2011). XX DR Genome; ACPB02042122; Positions 19115 12442. XX CC Positions [4247-4708] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 291..1346 FT /product="Gypsy-18_RP-I_2p" FT /translation="MTSKPITQDILKCIPEYDGTSYDLYHFINTCEELYDT FT YCKPNINEREINHWLLVKTALIKIKGPAKIVIYNNNCQTINEIIRALRSNF FT ADNRSVPDLFSELNRMRAKPKEHPIEYLNRLDEKRTIILTRYKLDGLSGII FT LTELTRQLDAHLVRIFLYGIHPTLGAHLQSLQCQTLDDTRAKIINDCGIVL FT HQLRLTNTPNNESTSYTDSNTHPKNKRPNTTHNNRNNPHHHFNNTFRNFPH FT NNYHQQNYPQHYPQNSHYYPQNSHQNRKPQSNHPMWTPQGSSPFPSKPFNP FT LSQNTVSMRTIKPQELNMTEKRRRNEVNELKTQVEQLTQTVTKLTDHFLVI FT GPGQHPPNT" FT CDS 1583..5146 FT /product="Gypsy-18_RP-I_1p" FT /translation="MIKFYIFPFSPRYNFLLGYETLKLIDASLDFKNNLLR FT YDSSIKELLFSIREENEERTDLKKGKKIIINKNKREEKKLNEENLNNKTEK FT IKINLKKGYNIVEIPVKNRCECGLIDNYEFKSDKIEIGKGLVKTENGKAKC FT LIYAHEVVTICPEPMELDEIVSAHRAPPTTNCPGELETIKTEIPALIRTSH FT MNEEEKNEIIQLVCKYPEIIKRENDKLTSTKLLKHKIVTKDENPVYTRNYR FT HPEAFRADIKDEIQKLLENKIIQSSNSPYNSPIWIVPKKPDASGKRKIRMV FT IDYRKLNEKTIDDKYPLPNIEDLFGRIGRATYFSAIDLASGFHQIEMDPDS FT IPKTAFSTDTGHYEFLRMPFGLKNAPPTFQRAMNILFADSPNILVYMDDII FT IFSDNLTEHLKHLQKVFLKLKEHNLKIQLDKTEFFKKELLYLGHIISNKGI FT SPNPNKIETIKNFPLPKTKKQIKQFLGLTGYYRKMIKNYAKIAKPLTNALR FT NDGEINTDDKEFQDSCTTLKEMLQNHPILQLPNFNKEFYLTTDASNTAIGA FT VLSQNVEGKDLPIAYASRTLSKSEERLSTIEKELLAIVWSCKHFRPYLYGR FT KFSIFTDHKPLQWLHNMKEPTSKLLRWKCALTDYEFDIKYIPGKSNKVADA FT LSRMPQEVTTMDTPPIDSNPDDHSEVNPAEVMDEFLRDYPPQNDTSSLATI FT HSQESSTDQVTLMDRDKILNVEPNQIIIDRGPPNVKVIKIFNKTRIKITVS FT NININEQLDEFILQYLKPKTLYGVYCQTSDILQSEYDSIFNHLQTLIINKY FT PSTKIRRYHKFLLDVEDDPEQREVISNYHAGKTYHRGIAESYEHIRRKYYW FT PGMHKDITDFINQCTTCLKIKYDRRPIKLQYQVTPTPNKPFEKLAADVFTF FT NSQKFLTIIDLFSKKLTVYPITTHNALEIQEKFQIYFSLFPLPASIQMDNG FT REFQNQGVKNLLSLYGIESYYTTPAHSQSQGTIERVHSTLIELLNAIQLQN FT KTNKLERNITLAVIAYNNSLITNLKLSPMEISFGTNNLNPQDIQNTLLDEK FT TKQYHQDLETIHKAIKENIEKEKLQRTQRSNKTRETNINIPKTVYIKSTKH FT KKHTPKNLPGKHNPETKLVKLTNSKSCKEFKIHPDRIKRPRRITRKTSITG FT NHDRPDQPTTSHNTNTSQTTTSLPNNRLD" FT CDS 5064..6650 FT /product="Gypsy-18_RP-I_3p" FT /translation="MIAQINRLLATILILVKQQQAYQITDLTKSKGIIGLK FT LQPSYVINNTYTYIHNINIEEIDQEIKQIEQNLKSINPRNRNPHYINIINS FT FLLYAKDKLAHIYLNKNPRQKRGLINGLGTAISWITGNMDADDKEKYDKII FT KQIANNEHALQHNVQSQLSTNQLIINKFNADLDIIRNNNDKAKSYLQIISN FT ETTTLQLAEQYNFYLINLQLLINKINDIDDSVEFCRNGIVHSSVITHSELS FT HIITETGVHFIDNDPEILWQVGTIKCALHRNFISYFIELPLKSESYETIFL FT LSHPFKLNQETATIYAYPSMILRNYDNLYSSDNCILIKDKYYCKNIKRIQD FT NNCIENILMNKENNCKTIILANPAPFVKYINVINQYLFYNYDNIMLNINNV FT TNITIYSKYCYLLHLDENERIYGIPQPYTYWESKLLDVENSNINHPILQNV FT SFEKLHHLNVNIKPLEILNEFPYDNTHFITLYSIIGLIIIISLCYLLLKNW FT LYRKFRKPGQPAEVGIKLELPPAPTLNPSPRTA" XX SQ Sequence 6674 BP; 2669 A; 1412 C; 950 G; 1643 T; 0 other; taactggcgc ccgaacaggg acctgcaaag tcaatagaat aacattggct tcagtctcta 60 ctgtttattg cagttagcta ctgtgatcat ttatcccatt agcgaacttc atcaacgtgc 120 gacctattgt ctcactcaag aaagaagaca ctctggcgaa ctcccagcat aaacaactaa 180 agcagactac caacgactca tccgacgaag aacagtctac atcaagaacc catactatac 240 actggaaacc aggtacttta aatacaataa ctacaataaa gcacaacacg atgacttcca 300 aacccatcac tcaagatatc ttaaaatgta ttcccgagta cgatggcacg agttatgacc 360 tgtaccattt cataaacacg tgcgaagaac tttatgatac gtactgtaaa cccaatatca 420 atgagcgtga aatcaaccac tggcttttag taaagactgc tttaattaag atcaaaggcc 480 ctgcgaaaat tgtaatttat aataataatt gtcaaacaat aaatgaaatt ataagggctc 540 ttagaagtaa ttttgcagat aatcgatcgg tgcctgatct tttttcagaa ttaaaccgaa 600 tgagggcgaa acctaaagaa cacccgattg aatatttaaa caggcttgat gaaaaacgaa 660 cgataatatt aaccagatat aaattagacg gactttccgg tattatatta accgaactta 720 cgagacagct agatgcccac ctagttagga tattcctcta cggaatacac ccaaccctgg 780 gtgcacacct acaatcccta cagtgtcaaa ccctagatga tacacgagca aagattataa 840 acgattgcgg aatcgtttta caccaactaa gattgacaaa cactccaaat aacgaatcaa 900 cttcctacac ggatagtaac acacacccta aaaacaaaag acctaacaca acacataata 960 acagaaataa tcctcaccac cactttaaca atacatttag aaattttccc cataacaatt 1020 accatcaaca aaattaccct caacattatc ctcaaaattc tcattattat ccccaaaatt 1080 cccaccaaaa tcggaagccc caaagtaatc atcccatgtg gacacctcaa gggtctagcc 1140 cattcccaag caaacctttc aacccacttt cccagaacac ggtatctatg aggactatta 1200 aaccacaaga actaaacatg acagagaaaa gacgccgcaa tgaagtaaat gagttaaaga 1260 cgcaagtgga acaactaaca cagactgtaa cgaaactaac ggatcatttt ttagtaattg 1320 gcccaggcca acacccccca aacacttaga gttcgaatta aaaaacctgt ccaaatcatt 1380 accctacttc ttagacacgg acaacaacaa atggctagtt gacacgggct cgcaaaaaaa 1440 ttatgtcacc cccacaacgg taccaactaa cgctaagaca tacaaagaac gatttctggt 1500 aaaaactccg acaggcgaac aaacaggatc cgagtacatc ttattaaatt taaaaaccat 1560 cttccccgac caacgtgact taatgataaa attttatata ttcccttttt ctccccgtta 1620 caatttctta cttggctatg aaacactcaa actgatagac gcatccctag actttaagaa 1680 taacctacta agatacgata gtagtataaa agaattatta tttagcatca gggaagaaaa 1740 cgaagaaaga acagacctaa agaaaggtaa gaaaataatt ataaataaaa ataaacgtga 1800 agaaaaaaaa ttaaacgaag aaaatttaaa caataaaaca gaaaaaatta aaataaattt 1860 aaagaagggc tacaatatcg tagaaatacc cgtgaaaaat cgctgcgaat gcggactaat 1920 agataattac gaatttaaaa gcgataaaat agaaataggt aaagggttgg ttaaaacaga 1980 aaacggtaag gctaaatgtt taatctatgc acatgaagtc gtgaccattt gtcccgagcc 2040 aatggagcta gatgaaatcg tttctgccca ccgagctcct cctacgacca attgtcccgg 2100 agagctagaa acaataaaaa ccgagatacc cgccttaatt agaacctctc atatgaatga 2160 ggaggaaaaa aatgaaatca tacagctagt atgtaagtac ccggaaataa taaagaggga 2220 aaacgataaa ttaactagca ctaaattatt aaaacataaa atcgtaacaa aagatgaaaa 2280 cccagtctat acaagaaact acagacaccc cgaagctttc agagccgaca taaaagatga 2340 aatacaaaag ttgctagaga ataaaataat acagagtagt aattcaccat ataactcccc 2400 aatctggatt gtcccaaaaa aacctgatgc ttcagggaaa cgtaaaatta gaatggtcat 2460 tgactataga aaactgaatg aaaagacaat agacgataag taccccctcc caaacataga 2520 agacctattt ggtagaatag gcagagccac ttatttttca gctattgatc tggcctcagg 2580 attccaccaa atagagatgg atcccgattc aatcccaaag acagcgttca gcacggatac 2640 aggacattat gagttcctca gaatgccgtt tggactaaag aacgctcccc caacatttca 2700 gcgggcaatg aacatattat ttgcagactc cccaaatatc cttgtctata tggatgacat 2760 aatcatcttt tccgataatt taacagaaca tttaaaacat ttacagaaag tattcttaaa 2820 actaaaagaa cataacttaa aaattcagtt agacaagacc gaattcttta aaaaggaatt 2880 attatacctc ggacatataa tttctaacaa aggaatatca cctaacccaa acaagataga 2940 aactataaaa aacttcccac tacccaagac caagaaacaa ataaaacaat ttttaggatt 3000 aactggttac tataggaaga tgatcaagaa ttacgctaaa atagcaaaac ccctgacgaa 3060 cgctttaaga aatgacggag aaattaatac agatgataag gaattccaag actcatgtac 3120 tactctaaag gaaatgctcc aaaaccaccc tatactccaa ttgccaaact ttaataaaga 3180 gttctattta acgaccgacg cctctaatac agcgataggc gcagtcctct cccaaaatgt 3240 cgaaggcaag gacctcccga tagcctatgc ctcgcgcaca ttaagtaaat ctgaagaaag 3300 gctgagcacc atagagaagg agctgttagc catcgtttgg tcatgcaagc acttcagacc 3360 ctacttatat ggtagaaaat tcagtatatt caccgaccac aaacccctcc agtggttaca 3420 caacatgaaa gaacccacat ccaagctact aagatggaaa tgtgctttaa cggattatga 3480 atttgacata aaatatattc cgggaaaaag caataaagta gcagacgcgt tatcgcgcat 3540 gccacaagaa gtcacaacta tggacactcc ccccatagac agtaatcctg acgaccacag 3600 tgaagtcaat ccagcggaag tcatggatga attcctgagg gattatcccc ctcaaaatga 3660 cactagtagc ctagctacca tacattccca agaaagctct acggaccaag taactcttat 3720 ggaccgagat aagatattaa acgtcgaacc caatcaaata attattgaca ggggaccacc 3780 caatgtaaag gtaataaaaa tcttcaacaa aactcgtatt aaaataactg ttagtaacat 3840 aaacataaat gaacaattag atgaatttat attacaatat ctaaaaccta aaaccttgta 3900 cggcgtttac tgccaaactt cagacatact acaatccgag tacgactcta tattcaacca 3960 cctacaaacg cttataatta acaaatatcc atcaactaaa attcgtcgtt accacaaatt 4020 cttactggac gtagaagacg atccggagca acgagaagtt attagcaatt accatgcggg 4080 aaaaacctac catcgcggga tagcggaatc ctacgagcat atccggagga aatactactg 4140 gcctggtatg cacaaagata taacagactt tataaatcaa tgcacaacat gtttaaaaat 4200 taaatacgac agacggccta tcaaacttca ataccaagtg acccctacac ctaataaacc 4260 cttcgagaaa ttagctgctg atgtattcac ctttaattct caaaaattcc tcaccataat 4320 tgaccttttc agtaaaaaat taacagtcta tcctataaca acacataacg ctctagaaat 4380 ccaagagaaa tttcaaatct acttttcctt atttccttta cccgcctcca ttcaaatgga 4440 caacggcaga gagttccaaa accaaggagt aaaaaacctt ctctcattat acggtatcga 4500 gtcctattac acaactccag cacactccca atcccagggg acgattgagc gagttcattc 4560 taccctcata gaattattaa acgctataca actacaaaac aaaacaaata aactagaaag 4620 aaacattacg cttgccgtaa tagcatacaa taacagccta atcactaatc taaaattatc 4680 cccaatggaa atatcctttg gtacaaataa cctcaacccg caagacattc aaaataccct 4740 actcgacgaa aagactaaac aatatcacca agacctagaa actatacata aagcaattaa 4800 agaaaatata gaaaaagaaa aattacaaag aacccagaga tcaaataaga cgcgcgaaac 4860 aaatataaat atacccaaaa ctgtatatat aaaatcgact aaacacaaga aacacactcc 4920 taaaaatcta ccgggaaaac ataatcctga aactaaatta gtgaaactta caaattctaa 4980 atcttgtaaa gaattcaaaa tacatccgga tagaataaaa agacctagaa gaataacacg 5040 taaaacttct attacaggaa accatgatcg cccagatcaa ccgactacta gccacaatac 5100 taatactagt caaacaacaa caagcttacc aaataaccga cttgactaaa tctaaaggca 5160 ttattgggtt gaaactacaa ccatcatacg taattaataa tacttataca tatatacaca 5220 acattaacat cgaggaaatc gaccaagaaa tcaagcagat agaacaaaat ctaaagtcca 5280 taaaccctag aaaccgtaat cctcattaca tcaatataat caattcattt cttctatacg 5340 ctaaagataa gttagcacac atatatttaa ataaaaaccc cagacagaaa cgcggcttaa 5400 taaatggctt aggaaccgca atttcctgga taactggaaa catggacgct gatgacaagg 5460 aaaaatacga caaaataatt aaacaaatcg caaataatga gcatgcttta cagcataatg 5520 tgcaaagtca attatctaca aatcaactta ttataaacaa attcaacgca gatttagaca 5580 taataagaaa caataacgac aaagctaaaa gttatctgca aattatttca aacgagacga 5640 caactctgca attagcagag caatataatt tttatttaat caatttacaa ttgctaataa 5700 ataaaattaa tgatatagat gatagtgtag aattttgcag aaatggcata gtacattcta 5760 gcgtaatcac tcatagcgaa ctttcacaca tcataacgga aacaggtgtt cactttatag 5820 ataatgatcc ggagatctta tggcaagttg gcacaataaa atgcgccttg cataggaact 5880 tcatatctta tttcatagaa ctacctttaa agtctgaatc atatgagaca atattccttt 5940 tatctcatcc ctttaaactt aatcaagaaa ctgctactat ctatgcctat ccttcaatga 6000 ttcttaggaa ttatgacaat ttatattcta gcgataactg tatattaatt aaagacaaat 6060 attattgcaa aaatatcaaa agaatacaag ataataattg tatagaaaat atattaatga 6120 ataaggaaaa taattgtaaa actattatat tagctaatcc agcaccattt gtaaaatata 6180 ttaatgttat taaccaatac ttattttata attatgataa tataatgtta aatattaaca 6240 atgtaaccaa cattactatc tattctaaat attgctatct cttacactta gatgaaaatg 6300 aacgtatata tggcattccc caaccatata cttattggga aagcaaatta ttagatgtag 6360 aaaactcaaa tataaatcat cctatcctcc aaaatgtctc cttcgaaaaa ttacaccacc 6420 taaatgtaaa tataaaacct ctagaaatac ttaacgaatt cccttacgat aatacccatt 6480 ttataactct ttattctatt ataggattaa taataatcat ttcattatgt tacttgttat 6540 taaaaaattg gctttataga aaatttagga aaccgggtca acccgcagaa gttggtatta 6600 agctagaact tccaccagct ccaaccctca acccttcgcc gaggacggct taaatcttaa 6660 ggagggagga gtta 6674 // ID CR1-78_AAe repbase; DNA; INV; 4219 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-78_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4219 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1166-1166 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 13 sequences with >97% CC identity. XX FH Key Location/Qualifiers FT CDS 282..1088 FT /product="CR1-78_AAe_1p" FT /translation="MNYTTIDSSNNLPVTSQNIRDAGILAPGHTAQSTSNL FT LPGTTPVSTAHKSQNSREELCDKVNDHSTKSSSIANHSERPPTLAIAVPSL FT ASSSIDNDNWYYITRFKPHETEENVIRYIAHHAHCNSNQILCRKLARLNDE FT SRPLSFLSFKINVPKSIEQILVSDGFWPLGVSIAPFLDRRSNSYKPTGTRP FT HYRSKPSTHALKKQLAHTMPPPLQTTPLTSTKLRNLPPVYQLKNQNRFLPS FT PNVAPQVSQLVPAKHQRTAAPQFRTSLV" FT CDS 770..3964 FT /product="CR1-78_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="RLLAIRSVHCSFFRPSIEQLQTYRHSTSLPLETFDTC FT FEETTSTHHATTVANNPSNEYETKKPAARLSIKEPESIPTESKCSSPSLST FT SSSKTPKNCGSPVQDILGLDGIQCYYQNLGGMNSILSDYRLACSDSAYDVY FT AFTETWLGESTLSQQIFGNEYTVFRADRTERTSCKSSGGGVLLAVRSNINC FT NVLQPPNSPAVEQLWVEMKIAGHALFICVTYIPPDRINDPSITFSDTLLWI FT TSRMALXDKILVLGDFNFRSVTWQLSTTGYFYPSPALSSLNSVQKQLLDDF FT STAGLVQINPVFNSNHRLLDLCYVSQDITLDVSVSEAPAPLVKHVRHHPPL FT HIFIKKYVPCKFVPAREKTRYKFWETDFEKMNCFLSTIDWSTIFHDLDTNT FT AVEVFTFVVTYAIDQFTPKDASYAPLNPPWSNLALKKLKSRKRSALKRYSK FT YRSNSAKRIYGTCNSKYKRLNKTLFHAYLNKIQRSLKKNPKKFWSYIDDQR FT KESGLPSTMNLGERQVSTIPDICDIFRQQFSAVFVNENLSPQQVAEAASNV FT PFYPTIGPHPEITVTTIETACKQLKSSSNPGPDGIPAVILKKCCTSLAFPL FT AVIFNRSVNSGCFPTPWKKSFVFPIYKKGKKTDVINYRGIAALCATSKLFE FT LIVLDFLKHNCLKLVSETQHGFIEKRSTSTNLVAYTSFIIKSMEARKQVDS FT IYTDFSAAFDKINHQIMIAKLNRLGICGSLLAWLQSYLLDRTMSVKIGDTT FT SSAFRVTSGVPQGSHIGPFLFLLYLNDVNSLLKCFKLSYADDFKLFYEVVC FT HQDAVFLQNDLDIFTYWCTLNRMSLNPSKCSAISFGRKRSLICFDYSIAGE FT TLNRVTSIKDLGVILDSNLNFKDHISYITAKASKSLGFIFRAAKRFTDVHC FT LKTLYCSLVRSTLEYAVVVWAPYYNNSVQRIERIQHKFIRFALRHLPWNDP FT FNLPRYEDRCSLIDLPLLENRRNISKACFIADVLQSRIDCPCVLNSLNIDI FT HRRTLRSYQFLRLPTSRTNYGHNEPIANMSRLFNKCYHVFDFNLSRSAVKR FT SFSEIFRT" XX SQ Sequence 4219 BP; 1246 A; 1016 C; 713 G; 1242 T; 2 other; ctgctgccag aaatgcacct ctacccaccg ccaackccac ctcctccacc atcgctacca 60 gaacatctcc agtcactggt actccatccg tcactgccaa tattatttcg catcaatcaa 120 ccgcaagcct tccgcctaat gcttcaaatt acaaaattca tcgcactgcc gaccaagcga 180 ttcaaactta cacaggtgcc attcccaaac ctcaacgact caaacccacc acatgctccg 240 ttgcttgcca gacgtctaca gctgctgaga atatttcatc aatgaactac acaacaatcg 300 actcgtccaa caatctcccg gtaacatcac agaacatcag ggatgccgga atacttgctc 360 ctggacacac cgctcaatcc acttcaaatt tgctgccagg tactacgcca gtttccactg 420 cacataaatc tcaaaatagt agggaagaat tgtgtgataa ggtaaatgat cattctacga 480 aaagtagttc tattgcaaac cactcagaac gtccaccaac acttgcaata gccgttccct 540 ctttagcttc gtcaagcata gataatgata attggtatta catcacccgc tttaagccac 600 acgaaactga agaaaatgtc ataagatata ttgctcatca tgctcattgt aactccaacc 660 aaatactttg tcgaaaactc gcccgattga acgacgaatc aagaccactt tcgtttttgt 720 ctttcaaaat caatgtgcct aagagtatcg aacaaattct tgtctctgac ggcttttggc 780 cattaggagt gtccattgct ccttttttag accgtcgatc gaacagttac aaacctacag 840 gcactcgacc tcactaccgc tcgaaacctt cgacacatgc tttgaagaaa caactagcac 900 acaccatgcc accaccgttg caaacaaccc ctctaacgag tacgaaacta agaaacctgc 960 cgcccgttta tcaattaaag aaccagaatc gattcctacc gagtccaaat gtagctcccc 1020 aagtctctca actagttcca gcaaaacacc aaagaactgc ggctccccag ttcaggacat 1080 ccttggttta gatggcatac aatgttacta tcagaatctc ggtggtatga actctattct 1140 ttctgactac agactcgctt gttctgattc agcctatgat gtgtatgcat tcactgaaac 1200 ctggctcggt gaaagcacac tttctcaaca aatttttggc aacgaataca ctgttttccg 1260 tgcagaccga accgaacgca caagctgcaa aagctctggt ggtggagttc ttctagccgt 1320 tcgttccaat atcaattgca atgttcttca gccacctaat agcccagcgg tcgaacagct 1380 ttgggtcgaa atgaaaattg ctggacatgc cttgttcatt tgtgttacct acattcctcc 1440 tgatcgcatc aacgatccat cgataacgtt ttctgatacg ttgttatgga ttacatcacg 1500 catggcacta amggataaaa tattggtatt gggtgacttc aactttcgat ctgttacatg 1560 gcaactcagt actacgggtt acttttatcc cagccctgct ctttcttcac ttaattcggt 1620 tcaaaaacag ctacttgatg atttcagcac tgcgggactt gtacaaatca atccagtttt 1680 taactcaaac catcggttgc tcgacctttg ttacgtaagt caagatatca cgctagacgt 1740 atccgtctct gaagcacctg ctccattagt caaacatgtt agacatcacc ctcctctgca 1800 tattttcatt aagaaatacg ttccttgtaa gtttgttcct gccagagaga aaactcgtta 1860 caaattctgg gaaaccgact ttgaaaaaat gaattgtttc ttgagtacaa tcgattggag 1920 caccatattt cacgatttag ataccaatac agcagtagaa gtatttacct ttgttgttac 1980 atacgccatc gaccaattca ctccaaagga tgcttcttat gcgccattaa acccaccttg 2040 gtcaaacctt gccctcaaaa aactcaaatc tcgcaaaaga tcggctctca aaaggtactc 2100 taaatatcgt agcaactctg ctaaacggat ttacggcact tgtaatagca agtataaacg 2160 cttgaacaaa actctcttcc atgcttactt aaataagatt cagcgtagtc ttaagaaaaa 2220 tccaaaaaag ttttggtcct acatcgatga ccaacggaaa gaaagtggtc ttccatcaac 2280 catgaaccta ggtgagagac aagtgtctac tatacccgac atttgtgata ttttccgtca 2340 acaattctct gctgtcttcg tgaacgagaa cctgagtcca cagcaagtcg ccgaagccgc 2400 ctcaaatgtt cctttttatc caactatcgg tccccatcct gaaattacgg tcacaaccat 2460 tgaaactgct tgtaaacaac tgaagtcttc ctcaaacccg ggtcctgatg gaataccggc 2520 cgttatccta aagaaatgtt gcacttcgct tgcctttcca ttggccgtta tcttcaaccg 2580 ctctgttaat tccggatgct tccctactcc ttggaagaaa tcatttgttt ttccgattta 2640 taaaaaaggt aagaagactg atgtcatcaa ctaccgcgga atcgctgctt tgtgtgctac 2700 gtcgaaactg tttgagttaa ttgttctaga ctttctcaaa cataattgct taaaactcgt 2760 ctcggagacc caacacggtt ttattgaaaa acgttcgaca tcgactaacc ttgtagccta 2820 cacgtccttt attataaaat caatggaagc acgcaaacag gttgattcca tatacaccga 2880 cttctcggcc gcatttgata aaatcaacca ccagatcatg atagctaaac tgaatcgtct 2940 gggaatatgc ggaagtctcc tagcctggct tcaatcttat ttactcgatc gaactatgtc 3000 cgttaagata ggcgatacta cgtcttctgc attccgtgtc acttccggtg taccgcaggg 3060 tagccacatc ggacccttct tattcctgct ctatctaaac gacgtcaatt cactactcaa 3120 atgtttcaaa ttgtcctatg cagatgattt caagctgttt tatgaagtag tgtgtcatca 3180 agatgccgtt ttcttacaaa acgatctgga tatcttcact tattggtgca ccttgaatcg 3240 catgtctctg aacccttcta aatgctcggc catctctttt ggaagaaagc gctctctaat 3300 ctgctttgat tactctattg ctggggagac tttaaataga gtaacttcga tcaaagatct 3360 gggtgttatt cttgatagta acctcaactt taaggatcac ataagctata taacagctaa 3420 agcatcaaag agtctagggt tcatctttcg tgcggccaaa cgttttacag atgttcactg 3480 tctaaaaacg ttatactgct cgttagtgcg ttccaccctc gaatacgcag tcgtagtatg 3540 ggctccttat tataacaaca gcgttcaaag aattgaaagg attcaacaca aattcattcg 3600 ttttgcactc cgtcatttac cctggaatga cccattcaat cttcctagat atgaagatcg 3660 gtgttcattg atcgatcttc ctttattgga aaacaggcgt aacatctcca aagcatgttt 3720 tattgctgat gtgttacagt cccgtattga ttgtccttgt gtcttgaatt cattgaacat 3780 tgatattcac cgtcgtactc tacgttcgta tcagtttctt cgtctaccaa cctcacgtac 3840 taattatgga cacaatgaac ctattgccaa tatgtctcgt ttgttcaata aatgttacca 3900 tgttttcgat tttaatttgt ctcgttcagc ggtaaagaga agttttagtg agatttttcg 3960 aacatagtag gtttatttaa ggtagagtta gcttaattta agaatttatg ttagttttaa 4020 ttcatcattt ggattgtatt attctgttga tgagaaaaga tgagaaggtt ttgcgcccat 4080 ttgagagaga gctaaaatga tagctctact caaacgggct tttcccttct cctaaataaa 4140 taaataaata aataaataaa taaataaata aataaataaa taaataaata aataaataaa 4200 taaataaata aataaataa 4219 // ID BR2_CT repbase; DNA; INV; 240 BP. XX AC K02310; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 25-MAR-1997 (Rel. 2.02, Last updated, Version 2) XX DE C. tentans beta-repeat in Balbiani ring 2. XX KW Satellite; Simple Repeat; BR2_CT; CTBR2; Repetitive sequence. XX OS Chironomus tentans OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Chironomoidea; OC Chironomidae; Chironominae; Chironomus. XX RN [1] RP 1-240 RA Hoeoeg C. and Wieslander L.; RT "Different evolutionary behavior of structurally related, RT repetitive sequences occurring in the same Balbiani ring gene in RT Chironomus tentans."; RL Proc. Natl. Acad. Sci. U.S.A 81(16), 5165-5169 (1984). XX DR GenBank; K02310; Positions 1 240. XX SQ Sequence 240 BP; 100 A; 40 C; 59 G; 41 T; 0 other; aaatgcggca gtaaaatgag aagagtttta gccgaaaagt gtgctgctag aaagggtaga 60 ttcagtgcaa gtaaatgcag atgtttctca agaccaagtt ggtcaggaat taaaccagaa 120 aaacgtagca aatcaggatc aagaccagag aaacgtagca aatcaggatc tagaccagag 180 aaacgtagca aatcaggatc aagaccagag aaacgtagca aatcaggatc tagaccagaa 240 // ID BEL1-I_Dmoj repbase; DNA; INV; 4976 BP. XX AC scaffold_6489; XX DT 13-MAY-2009 (Rel. 14.05, Created) DT 13-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL7_Dmoj; KW BEL1-LTR_Dmoj; BEL1-I_Dmoj. XX OS Drosophila mojavensis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; repleta group; OC mulleri subgroup. XX RN [1] RP 1-4976 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1004-1004 (2009). XX DR Genome; scaffold_6489; Positions 51320 46345. XX CC Positions [4100-4630] - Integrase core CC 'CATAT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 548..4630 FT /product="BEL1-I_Dmoj_1p" FT /translation="MSLESLKTQRANTRRNISRIRNVVDPQLREEKSAPVQ FT AELKCRLEILEAYFKQAMVLQTQIEKECSTDTGRTELEELYVEIKVCITEQ FT LTEDIHESTFAAVSGPTFLSSAKLPRLALPTFDGNYADYKNFSALFNQMVD FT QQNSLSTIEKFNQLISCLKGPALETVKAFQITPENYRKALERLNQRYDNPT FT LVFLDNISSLFMLKSVSKSNSHEIRSLIDNSTALYNSLKSLGNEAQIAQAM FT LIAIVMDKMDMETKRKWNESLDYSKLPTWDLCVQVVERHCQYLESCGKSES FT SQARTATASKPRSQTTTRHGLSFNCSTHSCPLCLSSEHRVSRCSRLQEMSV FT AERYEAAKRHGLCLNCLGKGHQASKCPSTQRCRCCSRLHHSLLHREAAVSR FT STTMLPSGSNSNDVVTHTHTNRTDQVILATAIVEVMDASGNYRIGRALLDS FT CSQVNFISEQFAQALRLSNGSCESAALVSHTHTAAYIAFLREYEDLNHMSL FT VHSPRLDEPHNYIPHHCVFKPNSTSTKLRVVFDASNRTSSQQSLNDLLMVG FT PTIQTDLYTLLLRFRTYKYALTADVVKMFRQVLVDYRDRKFQYILWRDSTD FT KPLRTYSLNTVTYGTASAPYLAIRSMKYLADLHMQTHKLGSQAINSSFYVD FT DFLCGHDTAEGLQQLKAEVIEVLGKGKFELAKWHSNHPDFVDDSTIKDLNL FT EDGSVTSTLGLAWNQRLDGLLFAFRPKCSVDAITKRTILSVASSLFDPLGL FT VAPIIVTAKIILQELWIVKLQWDESVPQNLYLTWKSFAASLTSLESLKIPR FT FCMQPNTKELQLHGFCDASIRAYGCCIYARTVAADGEVQVHLITSKCRGAP FT TRKLSLPKLELAASHLLAQLYVKIKGIFTSKQTYLWSDSSIVLHWIQQHSS FT TLSTFVGNRVSDIQEATPDCQWRHVPTKLNPADLVSRGCTPIELNESIWFM FT GPAYLKQSSNTWPRDKHHEPDPEIVALEMRKSAFKATTDINYLLDTLRNIS FT SHRRCLRIVAWMLRFIHALRKPVELSTPSLSPQELQRAFNCIIWNLQQQHF FT ADEIHALNKNRQISSHIKFLNPFLQITDGYQLLKVGGRLELANAPEVQKHP FT IILPSKDDFVRHYVQFLHVQHYHAGSKALVALIRLQFWIVNARDLARSIVR FT RCVHCVRYKPKLQQQLMGNLPVERLTPSRPFAHCGVDFCGPVNVYLRIQGK FT APRKAYIAIFVCFATKALHIEVVSDLSTDAFLPALRRMIARRGLPVDIFCD FT NATNFTGASNQLKELRTYFFKQENQQSIISFCTNEFINFRFIPPRAPHFGG FT LWEAAVKSAKGLMTRSLMNARCTFEELATITAEVEAILNSTCEHCLHTI" XX SQ Sequence 4976 BP; 1430 A; 1180 C; 1067 G; 1299 T; 0 other; ttttggcgcc caacgtgggg cattgctgtt caagttccgt gcaacgcatt tagagttata 60 atttggctta tgtacagaca tattctgcat acatacttct gtgcataagt gggatatatc 120 agcatacaag ctattagttg tgcatgcact cgcttgcctt gctattcgca ccggtcactc 180 actttggttg acttcgctgt cgctctacga cccccgccgt cgcatcaccg agtcatcttc 240 ttcgtttcgt cattctctcg cagcaaagca gcttctgctt tgcaatcttc gtcacttcgc 300 tacccctacg atccccccgc gatcgcagca gcaagttatc ggcagtgcgc ttgtttatca 360 tattcgtcag tttggataaa tcatattcgt caaaattata aatacataag tgcagttaat 420 cacaaacgca cttatataca aacgcatcaa ctgcactgaa agtcgacgct ggaattactc 480 ttctttttgc ttaatattgt atgcgctcag ttgagtgaaa taaaatccta aagttgttga 540 tcaaatcatg tctctagaga gccttaaaac acagcgcgcg aatactcgcc ggaatattag 600 ccgaattcgt aatgttgtcg atccccaact aagagaagag aaatctgcac cagttcaagc 660 tgaattaaag tgtcgcttgg aaattttgga agcttacttt aagcaagcaa tggtattgca 720 gactcaaatt gaaaaggaat gttcgaccga tacaggacgc accgaattgg aggaactgta 780 tgtagaaata aaggtttgta ttacagagca actcacggag gatatacacg aatcaacatt 840 cgctgcggta tctgggccca cattcttatc gtcggccaag ctgcccagac ttgcgttacc 900 tacatttgat ggcaattatg cggactacaa gaattttagt gcgctattca accaaatggt 960 ggatcaacag aacagcttat caaccatcga gaagtttaat caactgatca gctgcttaaa 1020 gggaccagca ttagaaacgg taaaggcatt tcagatcaca cctgaaaact accgaaaggc 1080 acttgagcgc ttgaatcaac gctacgacaa tccaacatta gtgttcttgg acaatatctc 1140 gtcgttattt atgctcaaga gtgtatcaaa gtcaaacagc cacgaaattc gaagtttgat 1200 tgacaactct acagcattgt acaactcgct aaaatcatta ggcaacgagg cacaaatcgc 1260 acaggcaatg ttaatagcca ttgttatgga taagatggac atggaaacta agcgcaaatg 1320 gaacgaatca ttggattatt ccaagttgcc tacgtgggat ttatgtgtac aggtagttga 1380 gaggcattgc caatatttgg aatcatgcgg taagtccgaa tcgagccagg cccgtacagc 1440 gactgctagc aaaccacgaa gccagacaac cacgcgacat ggtttgtctt ttaattgttc 1500 aactcactca tgtcctcttt gtcttagctc agagcatcga gtctcacgtt gcagtcgact 1560 tcaagagatg tcggttgcag agcgctatga agcagcgaaa cgacatggac tgtgcctaaa 1620 ctgtcttggc aagggccacc aggcatccaa gtgtccttct actcaacgat gtcgatgttg 1680 ctcgagactt catcacagct tactacatcg agaggcagct gtgagtagat cgacaactat 1740 gttgcccagc ggctcgaatt caaatgatgt ggtgacacat acacacacca atcgaacaga 1800 ccaagtcatt ctagcaacgg ccatcgttga ggtcatggat gcatctggaa actacaggat 1860 cggtcgtgct ctattggatt cttgctcaca ggtgaacttc atctcagaac agttcgcaca 1920 agcactacgc ttgtcaaacg ggtcatgcga atctgcagcg ttggtgagtc acacacatac 1980 agcagcatac atagcatttc tgcgcgaata tgaagattta aatcatatga gtttggtaca 2040 ctcaccaagg cttgacgagc cacataatta cataccacat cactgcgtct tcaagccaaa 2100 tagcacttca acgaagctca gagttgtctt cgatgcctca aatcgaactt catcgcaaca 2160 atcactcaat gatttgctaa tggttggacc gactattcag actgatctat atacgcttct 2220 gctacgcttt cgaacgtaca aatatgcact gactgcagat gttgtgaaaa tgttcaggca 2280 agtattggtg gactatcgtg accgcaagtt ccaatacatt ctttggagag attcaacaga 2340 caaaccacta cggacatatt cactaaacac agtcacatat ggtacggcct cagcaccata 2400 tctcgccata cgcagtatga aatatttggc tgatctacat atgcaaactc acaaattggg 2460 atcacaagct ataaattcat cattctacgt ggatgacttt ctttgtgggc atgacacagc 2520 agaaggcctg caacaactca aggcagaggt catcgaagta cttggcaagg gcaaatttga 2580 attagcaaaa tggcattcga atcatccgga ctttgtcgac gacagcacga tcaaagattt 2640 gaatttagaa gatggctcag ttactagcac attgggctta gcatggaacc aacgtctgga 2700 tgggttgcta tttgcctttc gacccaaatg ctccgttgat gcaatcacca agaggacaat 2760 tttatctgtg gcatcttcac tttttgatcc attgggattg gtagcgccaa tcattgtgac 2820 agcaaagatc attctacagg aactgtggat agtcaaatta caatgggatg agtctgtgcc 2880 gcaaaacctt tatttaactt ggaaatcgtt tgctgcatca cttacttcac tggagtcgct 2940 gaaaatacct cgattctgta tgcagccgaa caccaaggag ttgcagttgc atgggttctg 3000 tgacgcttcc atacgtgcat atggatgctg catttacgca cgcacagttg cagcggacgg 3060 cgaggttcaa gttcacctga tcacgtcaaa gtgcagagga gcaccaactc gtaagttatc 3120 gctaccaaaa ctggaactcg ctgcttctca cctactggcc caactctacg tgaaaatcaa 3180 ggggattttc acatcgaaac aaacctatct gtggagcgac tcgtcgatag ttcttcattg 3240 gatacagcaa cattcgtcga cactatcaac ctttgttgga aaccgcgttt ctgacataca 3300 agaagctaca ccggactgtc aatggagaca tgtcccaact aaattgaatc ccgccgatct 3360 ggtctcgagg gggtgtacac ccatcgaatt aaacgaatca atctggttta tggggccggc 3420 gtatctgaag cagagctcaa acacttggcc aagagacaaa catcatgaac cagatcctga 3480 gattgtagca cttgagatgc gcaagtcagc tttcaaggct acgacggata tcaactatct 3540 actcgacaca ctcagaaaca taagttcgca tcgtcgttgt ttgcgaattg ttgcttggat 3600 gctgcgcttt atacacgcgc ttcggaaacc cgttgagctg tctacgccct cgctatcacc 3660 tcaagagcta cagcgtgcat ttaattgcat tatttggaat ctgcagcagc aacacttcgc 3720 agatgaaatt cacgctttga acaaaaatag gcaaatatca agtcatataa aatttctaaa 3780 tccattttta cagattacag acggatatca actattgaag gttggcggcc gtctcgaatt 3840 ggcaaatgca ccggaggttc agaaacatcc aatcatactt cccagcaaag atgattttgt 3900 tcgtcactac gtgcagtttc ttcatgtgca acactatcat gctggctcaa aggcacttgt 3960 cgcactaatt cgtctgcaat tttggatcgt aaatgctcga gacctggctc gctctattgt 4020 cagaaggtgc gtgcactgtg tgcgctacaa gccaaagttg caacagcaac tcatgggaaa 4080 cctgcctgta gagcgattga cgccatcgag acctttcgct cactgtggag ttgacttttg 4140 tggaccagtc aacgtctatc tccgcattca gggcaaggct cctcgcaagg cctacattgc 4200 catatttgta tgcttcgcca ccaaggctct gcatatcgaa gtggtatctg acctttcaac 4260 ggatgctttc ttacccgcgt tgaggcgcat gattgcacga cgaggattgc ctgtggatat 4320 attttgcgac aatgccacga atttcactgg agcaagcaat caattaaagg aactgcgaac 4380 ctacttcttc aagcaagaaa atcaacagtc aattattagt ttttgcacaa atgaattcat 4440 aaactttcgt tttatcccac ccagagcacc acactttggc ggcctctggg aagccgcagt 4500 taaaagcgcc aaggggctta tgacgcgcag ccttatgaac gcccgctgca cgtttgagga 4560 attggccaca atcacagcag aagtagaagc catcctcaac tcaacctgcg agcattgcct 4620 acacacaata tagaagatga acagttgaaa tctttggatc gctggcgttt gataactggt 4680 atcaaacaat atttttggcg acgctggttg acagactact tcaatgagct gaacgtgcgc 4740 cacaagtgga ccaaaccatc acccagtatc tctatcgggg acatggttct catacatgaa 4800 gacaacgtac cttctcagaa gtggataatg ggtcgcatta cagctacaat tcctggacga 4860 gatcaacgag tacgagtggt agatgtccgc actaccaaag gcataatcca gaccagttca 4920 caaaatagcg attcttcctg tctcttgaaa gactgcgtca ttcaatggga ccggga 4976 // ID BEL-126_AA-LTR repbase; DNA; INV; 839 BP. XX AC AAGE02017920; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-126_AA_; KW BEL-126_AA-I; BEL-126_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-839 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02017920; Positions 6849 7687. XX SQ Sequence 839 BP; 321 A; 133 C; 133 G; 252 T; 0 other; tgttagtccc gctacgtcta tagttcgtcg agctccacat tccccaacat cgaccacgcc 60 tgatgagctc tagaaggatt gtttcgtctg cgagaaacgt catgcatggt agatgagaac 120 aagagaaatc taaagcgaga acaaattgaa gtgactgtga tgacatgcaa atcgtcctta 180 aagctattgt agcgtaaaat ttctaaattg agaacaagtg aattatttac aactatcttg 240 attaccttaa atgatattaa agactaagtg aaattattaa gctacttata agtacatgca 300 agtaacttaa acaactaaag gctaaagcaa tatccctatc aatgttaatt ttctttacag 360 tatcgataat gaattattag tatagtaaca tgcataaaat tgaattcgca tccattaatc 420 aaaaacaatt ataaataagg gtgagatgaa caaattaatg aattaatgac aagtactgaa 480 aatgattcaa tttgatactt ctaccttaac agaaaacaaa aactacatat gctgaactat 540 tgaattggat atctaaattt aacataataa gccagataaa aggtgagcag ataaattatt 600 tcgtgcatat tagatatatg aagtcaactc ttgttacagg acgaattata tgtctataga 660 cagaaaatgt catatttgga cggttgagcg gtaacgaatt actacaagta agagtaacat 720 tatttaaagg atcatcacac actaaaatta tactgtttca taggaatttt gtaatttatc 780 atccgtatta cccatcaata aattgacaga attgacgatt cgtttattac cgcccaaca 839 // ID BEL-6_SI-I repbase; DNA; INV; 5681 BP. XX AC AEAQ01022971; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-6_SI_; KW BEL-6_SI-LTR; BEL-6_SI-I. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-5681 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01022971; Positions 404 6084. XX CC Positions [4670-5248] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 275..5614 FT /product="BEL-6_SI-I_1p" FT /translation="MSMTNSIDRQYELFGVIARAVENLHKLGPAKTTRGAV FT QSRLAILKSNWDKFQAQHDSLYNQRSPETLQLAYFADDLHGQCEEQFADAQ FT GTMLNILESFESPTPQCSHAPRASGTPQSTSSGSTWNPSSASHSRRLPRID FT LPNFNGDYSQWHHFKDLFASMINANAELSAVEKLHYLKMSLSDEPATLLKS FT IEISSNGFTRAWDTLIARYDNKRILIEAQLSALFSIRKAKSECSSEVKRLL FT CELKEAIGALATLGCPVQHWDLILIFMTVRQLDVESVKEWEKSLGASSEAP FT SFADFEKFLLGRILTLEAFERTTTSRKPQQANSSRSFGNARIHTAAVSEQK FT CALCSSSHYISSCPKYLGKTLDQRREVVISKNLCFNCLGPHQLKACRSAKR FT CRFCRKPHHSTLHLPAGDSTIASNASSGSSPSTSASTSSPPATHTASIHQH FT ESAAPQDNLEHTFGSVTSNHLSQSRIIHRTPALLATALVDVVSSSGDLYEV FT RALLDQGSEVSFISESVAQLLKLSRRAAAIPIIGIGAQRSSVSNGAVSLKV FT ISQVNKAISLETDALVLPRLTAYLPPARVEFTQWPHIQGLKLADPSFATPG FT KIDLILGANVYAQILEDGIRRGKIGEPIAQKTSLGWVLSGPLSNDTREINA FT SDETSIIGLQCSLDHELLELLQRFWKQEDLAPSSQISMSPEESQCEEHFKT FT THTRDVNGRFVVRLPFKKSVSEFGDSRSIALKMLHRMEKRFEANPSLKIQY FT TDFLREYRVLDHMRSVAASERAPAREFFLPHHGVTRETSTTTKLRVVFNGS FT QKTNLGISLNECLHIGPKLQTDLADILLRWRRHRYVFATDIEKMYRQIRVH FT ENDWPLQKILWRNSPDERPQEYALCTVTYGLASAPYLALRCLQQLALESAP FT THALAAEVLRRDTYVDDVLSGDESIPRAKRKISQLNDVLTAGGFTLKKWIA FT NDLNLLDEIPTCDRESSPTVSVSDNTMHHTLGVRWERQSDNFIFSAPSLAT FT SNNRVTKRSVLSLIARMFDPLGWIAPIIVTAKIFMQDLWAIRLDWDDELPE FT DLKSRWTNYVTLLNGVSHISIPRWFGISSSNLAVEVHGFADASQSALAAVV FT YLRVLNDNENAQVKMICAKTKVAPLKRLTIPRLELSAAALLVRQVKKIREV FT LDLNQAPIHLWTDSTVALTWIKGHPSRWKDFVRNRVSFIQELSNSRWHHVA FT GKENPADLASRGVNPQRLQLEELWWTGPHWLRTRSISWTASTPIHESTDDL FT EERAPRCTTAFKDREQNLWMLDQYSSLRTLLRITAWVQRALKRFRKDVTSP FT SSHEPLTVEEIESSLKWWVKCTQQAYFSAEIKSLKSQNPLPISNDLRRLTP FT FLDTDGLLRLRGRLLRALLDPAEKHPLILPRECRLTTLVIHHHHRKTLHGG FT PQLTLSSIRQKFWIIGGRIPIRAFIHKCVICARHRATTGQQAVGQLPASRV FT IPGRPFLHIGVDYAGPVTLKTVRGRGSKAYKGYFIIFVCFSTSAIHLEVAT FT DYSTEGFLAAFKRFTGRRGICSTITSDCGTNLIGADAELKRLFTASSREWT FT HLANVLTSDGVTWKFNPPSAPHFGGKWEAGVKSVKFHLRRVIGDATLTYEE FT LSTLLVQIEAILNSRPLIALSDDPSDLTALTPGHFIIGSALSTVPEPSLQE FT VSRNRLSRWQLLQTMKESFWQRWSSEYLQQLQTPSKRHRPQDAFQKGSLVL FT IKDERFPPSKWPLARITHVHPGVDGLIRVVTVRTATSTFKRPIVKLCLLPV FT ENKESVQEP" XX SQ Sequence 5681 BP; 1403 A; 1639 C; 1277 G; 1362 T; 0 other; ttttttggtc cttcgagccg gatcgcggat cgcggttttc gtgtcgcgct ctctctttac 60 cggaatcctc ggaattaacg tagttctcga agagtgattt gtcgtgcctc tcgatctctc 120 ggtctctcaa ggcgactcac atctagttat accgcgtggc ctaaaagtgc cgaatattca 180 tcaacgcgga caaggtgact tctcgctgcc gatcaaggtg aaactgtcaa cgagtgcagt 240 gaatcaagag ctcttataat cgaaaccctg caagatgtct atgacgaaca gcatcgatcg 300 ccagtatgag ctgttcggcg ttatcgcgcg tgctgtcgag aacctccaca aactcggccc 360 ggcaaaaacc acccgtggtg ctgtgcaaag tcgcctcgcg atcctgaaat cgaattggga 420 caaattccaa gcccaacatg acagtttata caatcagcga tcgccagaaa ccttacagct 480 agcctacttt gccgacgatc ttcacgggca gtgtgaagaa caatttgctg acgcccaggg 540 caccatgctc aatatcctcg agagttttga gtccccaacc ccgcagtgct cacatgcccc 600 ccgcgcgagt ggaaccccgc agagcacatc atctggaagc acttggaacc cgtcttccgc 660 atcgcattct cgacgcttgc cccgcatcga tttaccgaat ttcaacggtg attactcgca 720 atggcatcat ttcaaggacc tcttcgcatc gatgatcaac gcgaatgccg agctatcagc 780 cgtcgagaaa ctccattatc ttaagatgag cctgtctgat gagccggcta cgctcttaaa 840 aagtatcgaa atctcaagca atggattcac tcgggcatgg gacacactca tcgctcgcta 900 cgataataag cgtatcctca tagaagccca gctctccgcg ctattctcaa ttcgcaaggc 960 caagtccgaa tgctcgtccg aggtaaagcg actcctctgc gagctgaagg aagcaattgg 1020 agccctcgct acgttaggtt gtccagtgca acactgggat cttatcctca ttttcatgac 1080 agttcgtcaa ttagatgttg aatctgttaa agaatgggag aaatcgctcg gagcttcatc 1140 tgaagcccca tcgtttgccg acttcgagaa attccttctc ggtcgcattc tcactctcga 1200 agccttcgag cgaacgacga cctctcgaaa gccgcaacaa gcgaattcgt cacgatcttt 1260 tggtaacgcg cgaatacata ccgctgcagt gagtgaacaa aaatgcgcgc tgtgctcatc 1320 aagccattat atctcatcat gccccaaata cctcgggaaa acgctcgacc agcgaaggga 1380 agtggtcata tcgaagaatc tgtgcttcaa ttgtctcgga cctcatcaac tgaaggcatg 1440 ccgctccgcg aagcgatgcc gcttctgccg taagccacat cactccacgt tacacttgcc 1500 agctggtgac tcaactatcg cctcaaatgc ctcgtctggc tcgtcgccgt caacttcagc 1560 gtcgacctca tctccaccag ctactcatac cgcaagtatt catcagcacg aatccgccgc 1620 tcctcaagat aatcttgagc acacgttcgg ctccgtgaca tcaaatcacc tttctcaatc 1680 gcggattatt catcgcactc cagccttgtt ggctactgct ctcgtagatg tcgtttcatc 1740 cagcggagat ctctacgaag tgcgagctct tctcgatcaa ggctcggaag tctcgtttat 1800 ctcagaatcg gtcgcgcagc tgctcaaact ctcgcgccgt gctgcggcga tcccaattat 1860 cggaatcggt gctcaacgct caagtgtctc aaacggagca gtttctctca aagtaatctc 1920 gcaagtaaac aaagctatct ctctcgaaac cgacgctctc gtattgccgc gattgactgc 1980 ttacttgccc cccgcgcgag tggaattcac tcagtggccg catattcagg gactcaaact 2040 cgcggatcca agctttgcca cgcccggaaa gattgatctc attctcggtg ctaatgtgta 2100 cgctcaaatt ctcgaagacg gcattcgtcg aggaaagatc ggtgaaccca tcgctcaaaa 2160 gacatctctt ggatgggtgc tatctggacc tctctctaac gacactcggg agatcaacgc 2220 ctcagacgaa acttccatta ttggtcttca gtgctctctc gatcatgagc tcctcgagct 2280 ccttcaacgc ttctggaaac aagaagacct cgcgccatca tcgcagatat ctatgtcacc 2340 agaagaatct cagtgcgaag agcatttcaa gactactcat actcgcgacg tcaacggtcg 2400 attcgttgta cgcctgccgt tcaagaaaag tgttagtgag ttcggtgact cgcgctctat 2460 cgcactcaag atgctgcatc gaatggaaaa gcgattcgaa gcgaatccct ctctcaaaat 2520 tcaatatacc gacttcctgc gtgagtatcg tgttctcgat cacatgcgct ccgtcgccgc 2580 gtcagagcgc gctccggctc gcgagttttt tctcccacat cacggagtta cgcgtgaaac 2640 cagcacgacc actaaacttc gcgtcgtatt caacggatcg cagaaaacga atctcggaat 2700 ttcgctaaat gagtgtctcc atatcggacc gaaattgcag acggatctcg cggatatctt 2760 actccggtgg cgccggcatc gatacgtatt cgcgaccgat atcgagaaga tgtatcggca 2820 aattcgcgtg cacgagaacg attggccgct tcagaaaatt ctctggcgaa actctcccga 2880 tgaaaggcct caagagtatg cattgtgtac agtcacatac ggcctcgcga gtgctccgta 2940 tctcgcgcta cgttgcctgc agcaactcgc tctagaatcc gcgccaactc acgcgctcgc 3000 tgccgaagtt cttcgtcgcg acacatatgt cgacgatgtt ctctccggcg acgaaagcat 3060 ccctcgagcg aaaaggaaaa tctcgcagct gaacgacgtt ctcacggcgg gcggatttac 3120 tctcaaaaag tggatcgcta acgacttaaa tctcctcgac gaaattccaa cttgtgatcg 3180 cgaatcatcg cctacggtat cagttagcga caatacaatg catcatactc tcggcgtgcg 3240 gtgggaacgt caatcggaca attttatatt ctcggccccg tcgctcgcaa cctcgaataa 3300 ccgcgttaca aagcgctcgg tattgtcgct catcgcgcgc atgtttgacc ctctcggatg 3360 gatcgcgccg ataatcgtta ccgctaagat tttcatgcaa gatctgtggg ccatccgcct 3420 cgactgggac gacgaactcc cagaagatct gaaatctcgc tggactaact acgtcacgct 3480 actcaatggc gtctctcata tttcaatccc tcgttggttc gggatcagtt catcgaatct 3540 cgccgtggaa gtgcacggct tcgcggatgc gtcgcagagt gctctcgccg ccgttgttta 3600 tctccgcgtt ctcaatgaca atgagaacgc tcaagtgaaa atgatttgtg cgaagacaaa 3660 ggtggcccct ctcaaacgtt tgactatacc gcgacttgaa ctctcggcgg ctgcgctact 3720 cgtgcgtcaa gtgaagaaaa ttcgcgaagt gctcgacttg aatcaagctc caattcatct 3780 ctggaccgat tcaacggtcg cactcacgtg gatcaaaggt catccatcgc gttggaaaga 3840 cttcgtccgg aatcgcgtct cgtttataca agagctctca aactccaggt ggcatcacgt 3900 cgccgggaaa gaaaacccag cagacctcgc atcacgcgga gtaaatcctc aacgtctcca 3960 actagaagaa ttgtggtgga ccggccctca ttggttacgt acgcgctcta tatcgtggac 4020 ggcttccacg cctattcacg aatctacaga tgatctcgaa gaacgagccc cgcggtgcac 4080 aacggcattc aaagatcgcg agcaaaatct ctggatgctc gatcaatact cgtcgctaag 4140 gactctgctg agaataactg cttgggttca gcgtgctctt aaacgctttc ggaaggatgt 4200 aacatcaccg tcgtctcacg agcctctcac agtggaagag atagaatcct cactcaaatg 4260 gtgggtcaaa tgcacgcagc aggcctactt ctcagctgaa attaaaagct tgaagagcca 4320 gaatccacta ccaatttcaa acgatctacg ccgtctaact ccgtttctcg acaccgacgg 4380 gctcctacga ctacgcggac gtctactacg cgctttactc gatcccgcgg aaaaacatcc 4440 cctcatcttg cctcgagaat gtcgattgac gacactcgtg attcatcatc accatcgaaa 4500 aactctccat ggcgggccgc aactcactct atcatcgatt cgccaaaaat tttggattat 4560 cggaggtcgc attcccatca gagcattcat ccacaagtgt gtcatctgtg ctcgtcatcg 4620 ggctaccact ggtcaacagg cggtcggaca acttccagca tcccgtgtaa ttccgggccg 4680 cccattctta cacattggag tcgactatgc cggaccagtg actctcaaaa ccgttcgagg 4740 acgaggctcc aaagcttaca aaggatactt catcattttc gtctgcttta gcacttccgc 4800 aatacattta gaagtcgcta ctgattactc tacggaagga tttctcgcgg cattcaaacg 4860 attcacggga cgacgaggca tctgttccac catcacgagc gactgcggca ccaatctgat 4920 cggtgccgac gcagagctca agcgactgtt caccgcctcc tcccgagaat ggactcacct 4980 agccaatgtc ctcacaagtg acggagtaac atggaaattt aatccacctt ccgcccccca 5040 ctttggtgga aaatgggaag ccggcgtgaa atctgttaaa tttcacttgc ggcgagttat 5100 cggtgacgct actctcactt atgaggaatt atcaactcta ctagtacaaa tcgaagccat 5160 cttaaattcc cgcccactaa tcgctctctc ggatgatccg tcagatctca ccgctctaac 5220 gccaggtcat tttatcatcg gatcggctct ctccaccgtg ccggagccct cgctccaaga 5280 agtatcgaga aatcggttgt cgcgttggca acttctacaa acgatgaagg aatcattctg 5340 gcaaagatgg tcatcggaat acttgcagca attacagact ccctcgaaac gtcatcggcc 5400 tcaagatgcc ttccaaaagg gctctctcgt tctaatcaag gatgaacgtt tccctccatc 5460 gaagtggccc ctcgcgcgca ttactcatgt gcaccctgga gtcgacggtc ttattcgcgt 5520 cgtcaccgta aggacagcta cctcgacgtt taaacgaccc atagtgaagc tgtgtttatt 5580 gccagttgaa aacaaagagt ccgtacaaga accctaattt caccgttcgc actctcgttt 5640 caattttttt cgtctactac gggtagacaa ggcgggcgga a 5681 // ID Sola2-3_AAe repbase; DNA; INV; 4780 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A Sola2 DNA transposon family from Aedes aegypti. XX KW Sola; DNA transposon; Transposable Element; Sola2-3_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4780 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the yellow fever mosquito."; RL Repbase Reports 11(4), 1301-1301 (2011). XX DR [2] (Consensus) XX CC ~96% identical to consensus. 4 bp TSDs. TIRs are ~1050 bp long. XX FH Key Location/Qualifiers FT CDS join(1368..1991,1995..2789,2656..2952,2956..3516) FT /product="Sola2-3_AAe_1p" FT /note="transposase." FT /translation="MSKNREIVVPKKSFCCNPLAKANHKSWRKTTKLTKVT FT QKVIDSVESIESAIVPNLRDKICVSCRVEVTRQQNTVPMDPMDVVENEIED FT CMEHSANISAEEFQSMMDILLRLGIQNFRVRELRLKNFRCQMLKQAVDALK FT HLLKVDCPVYYEDTDVPVIKDMIDNCKTASKPIKFQILTLLTPSWSRQKMM FT DETGCSKYANYNKRLDLILNSAGKWFVEKSQKVRATKGILKLPDPGTSILD FT SEKEAQEKAKQYFLECSTLMPGVRDKISVKTSSGRECRQKRLLLGSLNEIF FT ANFKELSPDSKIGFSSFAKLRPPECVLTGSSGTHTVCVCMKHENIELSLRA FT VKALGMMKDSCLNELIANNLICKSPSPQCYFLECEKCPDLSEFSDKLLEEI FT VNFPLEIIKFKNWIQIENHYQLVDNTTSTPEEFVDNFSSLLNGFLPHFFIT FT KVHRFKKLIASIYYTSNIRRHKKTIFSSIISAVCSTGFCRIFLSRRYIGLK FT NSLPQYIILLISAGTKRLSFRAKVKSETKRGYRTRRLRRKLQNYCPERDTI FT APFREGNGNNSSICAVLQCQRKASSNIIIISPVLEHNYVLVHCCFRKLFDF FT IKTNFAQITTTHVFTDGAGSQYKNKYNFTNITFMKKDFQMNVEWNFHASSH FT GKCPCDGVAGTIKRAAYKYSLAPNRDNLISDASTFYDWAVEHQGETMSFSY FT VEQIEYEASLAKLKLRMENVQAIKGTHMYHYFLPQDEYKLISKRYSKSTES FT IVQNLKCK" XX SQ Sequence 4780 BP; 1562 A; 885 C; 897 G; 1436 T; 0 other; gagcaattct cgctgaaacc aggccgccat cggcacgcat cgttagaatt ccaattttat 60 gtcactgatc gctagttttc gataaaactt aagggtggtc ctttcggttt tctcaaattt 120 gtggacccct cgttcgccag ctagctaaac agtttgcgaa aaagcacatt ttttgataaa 180 tctaaggtat ttcaccacgt gatatgtctg atatttctct tgaaacacat agtaaactta 240 atggcatcgt atagaggaag gatgtagctt tcatttgatg ggaaaaaatt tttggccgcc 300 attttgaatt tggccgccat cttggatttc gtcagaaaaa ccgttttttc accattagcg 360 caccgctcgt tttgaattct gaggtcacca tcagaaagct gagaaaaaac tgagtaagat 420 aggctacaga aactaggtgt gcaatggtat ttaccctatg aaatgaacga ttttttaaat 480 catgttctac gattttgacg tatatggcga gtgcaatcaa tgcaattatc attgttcgta 540 caacaaagta acatgttttc aattgctggt acttttctcg agttgagtga agaataggac 600 attaatttgg agtgaaaaaa aaaactggcg gccatcttgt atttcgacgc catcttggtt 660 ttgagcttaa aaattaatat tttaccgtgt tggctatatt ttggctatat tttgaggctt 720 gcgttgaaaa accaatcagg tttttcatcc tatactttcc agcattggca ttacatcccc 780 attgtgacat agcctgcttc tcagcttttt aattctgaca tttttttata tgtgtttcgc 840 gctgctgtca ataaatcagt gaaacgtgca tggcggtcgc aatgaaagca aacatcgctt 900 atactgtaga gctacaatcg tttgtactgt agagctacaa tcgctcatta ttctatagaa 960 ttgaatgaag aacagtactt ttattgagaa aaaatggcac ccattttaat tttggatgtc 1020 atcttacttt tagaaaatcc cgatttttgc cctgttcgca ccaacgattt atatttcgat 1080 gcacccgttg aaaaaatatc atattgctgt aatcatttat tattcaaacc cagattcagg 1140 ttgaaatcct ttatcgaagt caagcaaatt tccagccgca gaaaaatcca cttgcggact 1200 ttaccataac catcgcgctc ccagtgagca acaggacata tggtttaaat cacaggctta 1260 tcctttcaat catcagtttc attgcaaaga agaaggcaag tatttcttta aaactgcgga 1320 agcaagttca ttgatttgta catccgttga attgaaattc catcaacatg tcgaagaatc 1380 gcgagattgt ggtacctaaa aagtcattct gctgtaatcc gcttgcgaaa gctaaccata 1440 aaagctggag aaaaactact aaactaacga aagttacaca aaaagtaatt gattctgtcg 1500 aatcaattga atctgcgatt gttcctaacc ttcgagacaa aatatgtgta tcttgccgtg 1560 tagaagtgac gcgtcagcag aatactgttc ctatggatcc gatggacgtt gttgaaaacg 1620 aaattgagga ttgtatggaa cacagtgcaa acattagtgc ggaagagttt caaagcatga 1680 tggacattct attgcgactt ggcattcaaa atttccgtgt cagagaatta agattaaaaa 1740 acttccggtg tcaaatgctt aagcaagcgg tggatgcact taaacattta ttgaaagttg 1800 attgtccggt ttattatgaa gacactgacg tacctgttat caaagacatg atcgataatt 1860 gtaaaacagc atcaaaaccc atcaaatttc aaattttaac tttgctcacg ccttcttgga 1920 gtagacagaa aatgatggat gaaactggat gcagtaagta tgctaattat aataagcgtc 1980 ttgatcttat ttgactaaat tctgcaggta aatggtttgt tgaaaaatcc caaaaagttc 2040 gtgccacaaa aggtattttg aaacttccag atccaggtac atcaatctta gattcagaaa 2100 aggaggctca agaaaaggca aaacaatact ttttggaatg cagtacgtta atgcccggtg 2160 ttcgagataa aatcagtgtc aaaacgtcca gcggtcggga atgcaggcaa aaacgtttgc 2220 ttcttgggtc actcaacgaa atatttgcaa acttcaaaga gctttcgcca gattcaaaaa 2280 ttggattctc ttcgttcgcc aaattgcgac caccggaatg tgtacttacc gggagctccg 2340 gtacgcatac tgtttgtgta tgcatgaagc atgagaatat tgaattatcc ttgcgtgctg 2400 taaaagcctt agggatgatg aaagatagtt gcctgaacga gttaattgct aataatttga 2460 tatgtaaaag cccttctcca caatgctatt ttttggaatg cgaaaaatgt cctgatttga 2520 gcgagttcag tgataaattg ttggaagaaa tcgtaaattt tcctcttgaa atcatcaagt 2580 ttaaaaattg gattcaaatt gaaaaccact atcaattagt tgataacacc acaagcacac 2640 ccgaagagtt cgtagataat ttcagcagtt tgctcaacgg gtttttgccg cattttttta 2700 tcacgaaggt acataggttt aaaaaactca ttgcctcaat atattatact tctaatatcc 2760 gcaggcacaa aaagactatc ttttcgagct aaagtcaaat ctgaaaccaa acgaggctat 2820 cgtacacgcc gacttcgcag aaaactacaa aattactgtc cagaacgaga tacaatcgca 2880 ccatttcgcg agggaaatgg taacaattca tccatttgtg ctgtactaca atgtcaacgg 2940 aaagcttcat cataaaacat tatcattatt tctcccgttc tcgagcataa ctacgttttg 3000 gtgcactgct gttttagaaa attatttgat ttcattaaga caaactttgc acagataaca 3060 acgacccacg tgtttacaga tggagctggc agccaatata aaaataagta taatttcaca 3120 aatatcacat ttatgaagaa ggattttcaa atgaatgtgg aatggaattt ccacgccagt 3180 tctcacggaa agtgtccatg tgatggagtg gcgggaacga tcaaacgtgc tgcgtataaa 3240 tacagcttgg cgccaaatag agacaatcta atatccgatg cttcgacatt ctatgactgg 3300 gctgttgagc atcaaggcga aacgatgtct ttcagctacg ttgaacaaat cgaatacgag 3360 gcatccttgg ccaaactcaa gcttcgaatg gagaatgttc aagcaataaa aggtacacat 3420 atgtaccact attttctacc acaagacgaa tacaaactaa tatctaaaag atattcgaag 3480 agtaccgaga gtatagtgca aaacctcaaa tgtaaataat aaggatgtgt tttgtataat 3540 ttaaggtaac attgttatga ttatttttat cgatataatg aactcaaatg tatatgaaga 3600 attaaaaata ttgtaaaatt cgtaaaaatt tgctagaaaa taaaaactta taaattgatt 3660 gaagtacacc tatttactat tcctaatcat aatacatact tgatagctaa ttcaactcat 3720 atttttcatt ttaagttccc aacaatgcat cacaatatat cttgttggtg cgaacaaggt 3780 gagaccggtg atggatttct aatagtgaaa tatcattaaa attatatatc ttttattgat 3840 agcattgttc tccattcaac tgcaataatg agcgattgta ctgtagagct gtaagcgatg 3900 tttgctttca ttgcgaccgc catgcacgtt tcactgattt attgacagca gcgcgaaaca 3960 catataaaaa atgtcagaat taaaaagctg agaagcaggc tatgtcacaa tggggatgta 4020 atgccaatgc tggaaagtat aggatgaaaa acctgattgg gttttcaacg caagcctcaa 4080 aatatagcca acacggtaaa atattaattt ttaagctcaa aaccaagatg gcgtcgaaat 4140 acaagatggc cgccagtttt ttttttcact ccaaattaat gtcctattct tcactcaact 4200 cgagaaaaga accagcaatt gaaaacatgt tactttgttg tacgaacaat gataattgca 4260 ttgattgcac tcgccatata cgtcaaaatc gtagaacatg atttaaaaaa tcgttcattt 4320 catagggtaa ataccattgc acacctagtt tctgtagcct atcttactca gttttttctc 4380 agctttctga tggtgacctc agaattcaaa acgagcggtg cgctaatggt gaaaaaacgg 4440 tttttctgac gaaatccaag atggcggcca aattcaaaat ggcggccaaa aattttttcc 4500 catcaaatga aagctacatc cttcctctat acgatgtcat taagtttact atgtgtttca 4560 agagaaatat cagacatatc acgtggtgaa ataccttaga tttatcaaaa aatgtgcttt 4620 ttcgcaaact gtttagctag ctggcgaacg aggggtccac aaatttgaga aaaccgaaag 4680 gaccaccctt aagttttatc gaaaactagc gatcagtgac ataaaattgg aattctaacg 4740 atgcgtgccg atggcggcct ggtttcagcg agaattgctc 4780 // ID Jockey-N6B_CQ repbase; DNA; INV; 1421 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A non-autonomous Jockey-like non-LTR retrotransposon family from DE Culex quinquefasciatus - consensus. XX KW Jockey; Non-LTR Retrotransposon; Transposable Element; KW nonautonomous; Jockey-N6_CQ; Jockey-N6B_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-1421 RA Kojima K.K. and Jurka J.; RT "Non-autonomous Jockey non-LTR retrotransposons from the southern RT house mosquito."; RL Repbase Reports 11(1), 589-589 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 10 sequences with >98% CC identity. This family encodes a protein similar to Jockey ORF1p CC but does not encode ORF2p. Thus it is a non-autonomous non-LTR CC retrotransposon derived from Jockey, like HeT-A. The consensus CC is ~76% identical to that of Jockey-N6_CQ. XX FH Key Location/Qualifiers FT CDS 80..1315 FT /product="Jockey-N6B_CQ_1p" FT /translation="MEVEESSEEEDFRPVRGNRKRKKSSEVTGIGIEGEKV FT EKTLLNNNKFSPLADNKNNNNASPAGGGDAPAVPPPGQSAGKQKQPPLVVK FT NTTDFYKLVAIVESCKLGFKPIYKLTRFGTKVTCFSVKDFDKLQELLKKKK FT VEFYTHDRRSERNHRVVLRGFPDELKPDEVKYLLKRDLKLDALEVHIIKRK FT EKSTVDETPYIVVFPKGYTNLKKLSAMKVVASTIIRWEAYRNKRPNVTQCR FT NCLQLGHGTRNCHLKGRCNNCGGPHKTDECKVQEAEPKRCANCSGAHEATD FT RSCPKRADFIRRRQQASKPKPPARKAEKQSPAVPAFTPAEFPPLPGAVPDG FT KSKDRPRPAGSSQGGPRDGGATEKEAGEVLYSSAELWGIFSEYIGRFKTCK FT TRLDQVTLVSYMISKYGI" XX SQ Sequence 1421 BP; 396 A; 356 C; 401 G; 268 T; 0 other; cactcagtcg ccagccagcg acaagttaag acgtgttttg ctcgtgttgt catctcgcgt 60 gcttggaagt ttttgacaga tggaggtgga ggaatcctcc gaggaggaag attttcgtcc 120 cgtccgcggg aatcggaagc ggaagaagtc cagcgaggtc actggaatcg gcatcgaagg 180 agaaaaagtt gagaagaccc tgctcaacaa caacaagttc agcccgctag cggataacaa 240 aaacaacaac aatgccagcc cagcaggagg aggggacgcc ccggcggttc caccacccgg 300 ccagtcggca ggtaaacaaa agcaaccacc tttggtggtg aagaacacca cggattttta 360 caagctcgtg gcgatagtgg aaagttgtaa attaggtttc aaaccgattt acaaactaac 420 ccgatttgga accaaggtga cctgcttctc tgtcaaagat tttgacaagt tgcaagagct 480 actcaagaag aagaaagtgg aattctacac ccacgatcgg cggagcgaac gaaatcatcg 540 ggtggttctt cgtggttttc ctgatgaact gaaaccggac gaggtcaagt accttctgaa 600 gagggatctc aagctggacg cgctggaggt gcatatcatc aagcggaagg aaaagtccac 660 cgtggacgaa actccgtaca ttgtggtctt ccccaaggga tacaccaatc tgaagaagct 720 ctctgcaatg aaagttgtgg caagcactat catccggtgg gaggcctacc ggaacaagcg 780 gccgaacgtg acccagtgca ggaactgctt gcagctgggt catggaacca ggaactgtca 840 cctgaaaggc aggtgcaaca actgtggggg tccccacaag acggacgagt gcaaagtcca 900 agaagccgag ccgaagcggt gtgccaactg ctctggagcc cacgaagcca cggaccgcag 960 ctgccctaag cgtgcggact tcatccggag gcgccagcag gcgtcgaaac cgaaaccgcc 1020 ggcaaggaag gcggagaagc agagtccagc agttccggcg tttacgccgg cggagttccc 1080 tccgctgccg ggcgcagttc cggacggaaa atccaaggat cgccctcgtc ccgcaggaag 1140 cagccaaggt ggcccccgag acggtggagc cacggagaag gaagccgggg aggtactcta 1200 cagttcggct gagctgtggg ggattttctc cgagtacatc ggcaggttca agacctgcaa 1260 gacccgcttg gaccaagtaa ccctcgtcag ttacatgatc tccaagtatg gaatttaagg 1320 agtttttttt ttgttattat atattgttat actgatcctc ggtcccaacc tggtcacagc 1380 acctaaaagg acctaataaa aataagttaa gaaaaaaaaa a 1421 // ID BEL-93_CQ-I repbase; DNA; INV; 3249 BP. XX AC AAWU01007335; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-93_CQ_; KW BEL-93_CQ-LTR; BEL-93_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-3249 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 313-313 (2011). XX DR GenBank; AAWU01007335; Positions 20150 23398. XX CC 'CACGT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 31..3249 FT /product="BEL-93_CQ-I_1p" FT /translation="MVSRMKETKCMFCKKADKHEHGWEECSKCGSWAHTKC FT SGVDEKSEEEWICPKCVDSEFLLKVPGNKTKKSGAISDSGTVPGASSVAVP FT TEEELAEEQRAEREAFVKQIELRKKRLAEKLAWSEQRMKEEAEMQALELKT FT QREIEQKQLAHDQEMLQRQLSVEKEFLKKRSALKKQIDASKAKVEALKAEK FT STEQKVTDWLKGGENSGTGGKPKPKPKPAVTPESSDHEDSGEDSDSVKSSK FT SGVKNILKKGGGSFGNTTRITREQLAARKVVSHALPKFKGEPEVWPLFISC FT FEHTTAACGFSNIENLTRLVEALHGEALENVRSSLVFPSAVPGIIQDLRNM FT FGRPEKLLKALLEKVRQAPAPHADRLGTFINFGMVVKQLCDHLEAAGLHDH FT LNNPMLVQELIEKLPPSYQMDWVRFKRGKKENPLRRFANFMKKIVDDASEV FT ADFSPQNMSDHNKPERGKPQGRAFVHVHETEPKQVEAPRIERPSRPCWFCK FT RADHFIRSCEDFKRMNVAERLKEVERLQLCGRCLNNHGSRPCNFRGRCTVP FT NCVGDHHPLLHRAEETVQLQKAENSSRSVIFRMMPVTLQAGKRQYDTVAFL FT DEGSSATLVDDAVAKQLKAEGVPEPLIVNWTGNINRIEDGSRKVELMLSAK FT GSTEKFPLSTRTVGELLLPQQDVNYPEVVGKYTHLSGVPLEDLPSGVPTIL FT IGLDNLHLFAPLESRIGQPCEPIAVRSKIGWTVYGTEKRKPRRSVHLNVHL FT VVPVSSSQAPEKRKVPDGRKPSTVPVSEPKETQAREILEESTTTLGREPEK FT NPALKEKNEPTNRQRNEDCTTKDEQPRAAASEHRSDHGAAPNTNGHETSKA FT PTKMVREIVNNCVAQLTWVRTTWSVNPPLSSHLGDAGERLMRSVSEALMAL FT DDGRRLMDEIPKTATDGAENARPLAYLLQQSGEAEAIFPNSFQRYASPNQP FT GGALPSPHPTVALRDTYQRPKQLAGVTWKRRNDENLPRRTRFGKTISRETM FT EGNERKCWIRGPVDESIVGKDERVSKAWVRTESGWGKRAVAKLAVSEMRVG FT NPKPEADSQTGLRAGE" XX SQ Sequence 3249 BP; 835 A; 788 C; 1069 G; 557 T; 0 other; aaatcaaaat tttttaaacc ggattcggaa atggtttccc gtatgaagga aacgaagtgc 60 atgttctgca aaaaggcaga caagcatgag catggttggg aggagtgttc taagtgcggt 120 tcgtgggctc atacaaagtg ctcgggcgtg gatgaaaagt cggaagaaga gtggatctgt 180 ccgaagtgtg tcgactccga atttctgctg aaagttccgg ggaataagac gaagaagtca 240 ggcgccatca gcgattcggg aaccgtccca ggcgctagtt ctgttgcagt tccaactgaa 300 gaggagttgg ctgaggaaca gcgtgccgag cgggaggctt tcgtgaagca aattgagctg 360 cgcaagaaac gcctggcgga aaagctggcc tggagcgagc agaggatgaa ggaggaggcg 420 gagatgcagg cgcttgagtt gaaaactcag cgcgagatcg agcagaagca gctggcgcac 480 gatcaggaga tgctgcagcg tcagctgtct gtagagaagg agtttttaaa gaaacggtca 540 gctctgaaga agcagatcga tgctagcaag gccaaggtcg aagcgctgaa agccgagaag 600 tccactgagc agaaagtgac cgactggttg aagggcggcg aaaactctgg aactggaggg 660 aaaccgaagc caaagccgaa accggcagtc actccggaga gcagcgatca tgaggactct 720 ggagaagata gcgattcggt gaagtcctca aaaagtggcg tcaagaacat actgaagaag 780 ggtggcggat cgtttggcaa cactactcgc atcacgcgag agcagttggc cgcgcgaaag 840 gtggtgtcgc atgccttgcc gaagttcaag ggcgaaccgg aagtctggcc gctgttcatc 900 agttgcttcg agcacaccac ggcagcgtgt ggcttctcca acatcgaaaa tctcacacga 960 ctcgtggagg cacttcatgg cgaggcgctg gagaacgtgc ggagcagcct agttttccca 1020 agtgcggtgc cgggcatcat ccaagatctc cgaaacatgt tcgggagacc agagaagcta 1080 ctaaaagcgt tgctggagaa ggtgaggcag gcgccggcac cgcatgccga tcgtttgggt 1140 accttcatca acttcgggat ggtggtgaag cagttgtgcg accaccttga agctgcaggt 1200 ctgcacgacc acctgaacaa tcccatgctg gtgcaggagc tgatcgagaa gttgccgcct 1260 agctaccaga tggattgggt caggttcaag cggggcaaaa aggagaatcc actgcgaagg 1320 tttgcgaatt tcatgaaaaa gatcgtggac gacgcttcgg aggtggcgga tttctcgcca 1380 cagaatatga gcgaccacaa caagccagaa agaggaaagc cgcagggcag agcgttcgtt 1440 catgtccatg aaacggagcc gaagcaggtc gaagcgccgc gcatcgagag acccagcagg 1500 ccctgctggt tctgcaagcg ggcggatcac ttcatccgat cctgcgagga cttcaagcgg 1560 atgaacgtcg cggaacggct gaaggaagtg gagagattgc aattgtgtgg ccggtgtctg 1620 aacaatcacg gatctagacc gtgtaacttc aggggccggt gtacggtgcc gaactgcgtc 1680 ggcgaccatc atcccctcct acaccgcgcg gaagagacgg tgcagctgca gaaagcggag 1740 aactcgagtc gttcggtgat cttccgcatg atgccagtaa cactgcaggc cggaaagaga 1800 cagtacgaca ctgtagcgtt cctggacgaa ggatcgtcgg cgacgctggt ggatgacgcc 1860 gtggccaagc agttgaaggc agagggagtt ccagagccgt tgattgtcaa ttggaccgga 1920 aacatcaacc ggattgagga cggatcgcgt aaggtggagt tgatgctgtc agccaaggga 1980 tcgacagaga agtttccact gtcgacacga accgttggag agctgctttt gccgcagcag 2040 gacgtgaact acccagaagt ggtaggaaag tacacgcatc tgtccggcgt gccgctggaa 2100 gatcttccgt caggtgtacc gacgatcctg atcggattgg acaacctgca tctgttcgcg 2160 ccattggagt cacgaatcgg ccagccttgt gagccgattg ccgtgcgatc gaagatcgga 2220 tggacggttt acggaacgga gaagcgaaaa ccgcgcagaa gtgtccattt gaatgtgcac 2280 ttggtggttc cagtgagcag cagtcaagcg cccgaaaagc ggaaggtgcc ggacggtaga 2340 aagccgtcta cagttccggt tagcgagccc aaggagacgc aagcacgaga gattctggaa 2400 gaatcgacga ccactctggg tcgagagccg gagaagaacc cggcactgaa ggagaaaaat 2460 gagcccacga atcgccagcg gaacgaggac tgcacaacca aagatgagca gccgagagcg 2520 gccgcgagcg aacatcgttc ggaccacggg gcagctccga ataccaacgg ccacgagacg 2580 agcaaggctc caacgaagat ggtccgggaa atcgtcaaca actgcgtggc tcagctgacg 2640 tgggtgcgga cgacgtggtc ggtcaatcct cccctgtcat ctcacttggg agatgctggg 2700 gagcgactga tgcgatcggt cagtgaagcg ctcatggcgc tggacgatgg acggcggctc 2760 atggacgaga tcccgaagac tgccaccgac ggagcggaga acgcacgtcc gcttgcgtac 2820 ttgttgcagc agtcgggcga ggctgaagcg atttttccca actcttttca gcgatatgct 2880 tcaccaaacc aaccgggcgg ggctcttccg tcgccgcatc cgacggtggc tctacgggac 2940 acgtaccagc gtcccaaaca gctggccggc gtaacatgga aacgacggaa tgacgagaac 3000 ctgccgaggc ggacgaggtt cgggaagacg atatcgcgag agaccatgga gggcaacgaa 3060 cggaagtgct ggattcgggg ccccgtggac gaatcgatcg tgggcaagga cgagcgggtg 3120 agcaaggcgt gggtgcgcac cgaaagcggc tggggcaaac gagcggtggc gaagctggca 3180 gtgtcggaga tgcgagtagg taaccctaaa ccggaagcgg actcccagac cgggttacgg 3240 gccggggaa 3249 // ID Gypsy-79_AA-LTR repbase; DNA; INV; 157 BP. XX AC supercont1.242; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-79_AA_; KW Gypsy-79_AA-I; Gypsy-79_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-157 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.242; Positions 942032 941876. XX SQ Sequence 157 BP; 53 A; 26 C; 30 G; 48 T; 0 other; tgtaggagat ttaagtaggc aatgaggtat tatataattg aagcaggcta tgattagcag 60 aatagggaca ttcgatgttt tgttattcat taatcctcga gacgccaaat atacaagaca 120 aatcactcaa agctctccac tgtgtttcaa tacttca 157 // ID Gecko repbase; DNA; INV; 198 BP. XX AC . XX DT 19-MAR-2010 (Rel. 15.03, Created) DT 19-MAR-2010 (Rel. 15.03, Last updated, Version 1) XX DE Consensus sequence of Gecko SINE element. XX KW SINE2/tRNA; SINE; Non-LTR Retrotransposon; Transposable Element; KW mosquito; Gecko. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-198 RA Tu Z., Li S. and Mao C.; RT "The changing tails of a novel short interspersed element in RT Aedes aegypti: genomic evidence for slippage retrotransposition RT and the relationship between 3' tandem repeats and the poly(dA) RT tail."; RL Genetics 168(4), 2037-2047 (2004). XX RN [2] RP 1-198 RA Luchetti A.; RT "Submission to Repbase."; RL Direct Submission to Repbase Update (19-MAR-2010). XX DR [1] (Consensus) XX CC 98% identical to consensus. XX SQ Sequence 198 BP; 64 A; 44 C; 50 G; 40 T; 0 other; ggggacggac ctggtgtagt ggttagaaca ctcgcctctc acgccgagga cctgggatcg 60 aatcccatcc ccgacatagt cacttatgac gtaaaaagtt atagtgacga cttccttcgg 120 aagggaagta aagccgttgg tcccgagatg aactagccca gggctaaaaa tctcgttaat 180 aaagatagaa aaaaaaaa 198 // ID BEL-76_AA-LTR repbase; DNA; INV; 195 BP. XX AC supercont1.33; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-76_AA_; KW BEL-76_AA-I; BEL-76_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-195 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.33; Positions 3608679 3608485. XX SQ Sequence 195 BP; 56 A; 45 C; 39 G; 55 T; 0 other; tgtttgggcg cactgcgcaa ataatttgta accgtttcac tatctcaata tcattggccc 60 atcaggcaac actgtagagg agaaaaatca gaataaatcg aaactttgtg aaccgaagac 120 gcgcgtgttt gattctcccg tccagaacca tccaatttct gccgaattgt gttttttctg 180 agctatcgac gaaca 195 // ID SMAR22 repbase; DNA; INV; 1715 BP. XX AC . XX DT 05-OCT-2007 (Rel. 12.1, Created) DT 22-OCT-2007 (Rel. 12.1, Last updated, Version 1) XX DE Consensus sequence of Mariner-type family of repeats. XX KW Mariner/Tc1; DNA transposon; Transposable Element; SMAR22. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-1715 RA Jurka J.; RT "Mariner-type element from freshwater planarian (Schmidtea RT mediterranea)."; RL Repbase Reports 7(10), 1080-1080 (2007). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 255..1577 FT /product="SMAR22_1p" FT /translation="MEPPTKRRKYEASFKLKVVDVAKGSNNCAAARQFDVT FT EKMVREWRKKEDDIRQMPKNKCAMRRGITRWSKLEDDVAEWVLQQRQDGYI FT VTRNKIRSHALKWAKANKEESKDFKATVGWCSRFMNRRNLVIREKTKIAQK FT LPKDLEHKITNFHHFIIQLRKRHKFPLSHIGNMDETPMNFDMLGNKTVDFK FT GVRTVNVKSTGHEKTRFTVVLGCMADGTKLKPMVIFKRKTKPKITFPPGVL FT VHFHEKGWMDENGVKLWIENIWNRRPGGMRKERSLLVWDMFRSHITANSKT FT RLARNNTDIAVIPGGLTSVLQPLDVSLNKPFKDNARAQWNEWMMNGEKSFT FT KNGAMRGASLDVLCEFVIKAWDNVKVESVIKSFKKCGISNAMDGSEDDLLY FT ESEDEAEVDLPEPEWNPYDDGICDESRDVFEKLFETDGDDGDEFDGF" XX SQ Sequence 1715 BP; 605 A; 257 C; 395 G; 458 T; 0 other; ccgtatattt cggcgtataa ggcgaccgtc agataagacg aggtacaaaa ttagagcaaa 60 tttttatgat ttatcatata tcggttgtat aagacgactg ctaaataacg aaaccatgtt 120 tgtttagcct aggctagaga ttgcaacacc cggtatctta tgaaatattg gttatgtaaa 180 gatgatttat tagtatagat tcagtataga ttggtataga ttgtggtcat tttattatta 240 aacccatatc aggaatggaa ccgccaacaa aaagaagaaa atatgaggca agtttcaagt 300 tgaaagtcgt agacgtagcc aagggttcga acaactgcgc tgctgcaaga caatttgatg 360 ttactgaaaa gatggtacga gaatggagaa agaaggagga tgatataaga caaatgccca 420 aaaacaaatg tgcaatgcgg cgaggaatca cacgttggtc gaaattggaa gacgacgttg 480 cagaatgggt actccagcag agacaagatg gctacattgt gaccagaaat aaaataagat 540 ctcatgcttt gaaatgggcc aaggcaaaca aagaagaaag taaagacttc aaagctacag 600 tgggatggtg cagtaggttt atgaacagaa gaaatttagt aataagagaa aaaacaaaaa 660 tcgctcagaa actaccaaaa gatctcgagc acaaaataac aaacttccac catttcataa 720 ttcagctaag gaaaaggcat aaatttccat tatcccatat cggaaatatg gatgaaactc 780 caatgaattt tgacatgctt ggtaataaaa cagtggattt taaaggtgta agaacagtaa 840 acgtgaaaag tacaggacac gagaagacaa gatttacagt ggtattaggg tgcatggcag 900 atggtactaa attgaaacca atggttatct ttaaacgtaa aacaaaacct aaaataactt 960 ttcctcctgg tgttttggtt cacttccacg agaaagggtg gatggacgag aatggtgtaa 1020 agttatggat tgaaaatatt tggaacagac ggccaggtgg tatgcgaaag gaacgcagtt 1080 tgttggtctg ggatatgttt cggagccata taacggctaa ttctaaaact cgtttagcgc 1140 gcaacaacac cgatattgcg gttatcccag gtggtttaac ttctgtactt cagccgcttg 1200 acgtaagctt aaataagcca ttcaaggata atgctagagc acaatggaat gagtggatga 1260 tgaatggtga aaagtcattt acaaaaaacg gagctatgcg tggtgcctca cttgatgttc 1320 tttgcgagtt tgttataaaa gcatgggata atgttaaggt agaaagtgtc ataaaatcct 1380 ttaaaaaatg tggaatatct aatgccatgg atggtagtga ggatgattta ttgtatgaaa 1440 gcgaagatga agccgaagtc gacttgccag aaccagaatg gaatccctac gatgatggca 1500 tctgtgatga atctcgtgat gtattcgaaa aacttttcga aacagacggt gatgatggtg 1560 atgaatttga tgggttttag atatgcataa aagtagtata aatacaacta cctgtactgt 1620 aatactttct tttatatcgg ttgtataaga cgacccacaa ttttcaaggg tgattttttg 1680 actttaaggg tcgtcttata cgccgaaata tacgg 1715 // ID Gypsy-6_AA-I repbase; DNA; INV; 8963 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-6_AA_; KW Gypsy-6_AA-LTR; Gypsy-6_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-8963 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 981-981 (2011). XX DR [2] (Consensus) XX CC Positions [5454-5933] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 1138..3063 FT /product="Gypsy-6_AA-I_1p" FT /translation="MDLLKVNMNSKFPYAGHLTNDEIDYELKLRNYTEDVE FT KELNFKQRLLRRLFNEDEQENRDYRTSLTIDQEFEFVAGRVNMIRGCLAKS FT FDIKYLSRLKHYHMRVQRCIADDVEAKKMKELLLGNIATLLSSQNALQSKM FT QANPKDLLGETDSDQHYSEKSEGAKSLLCQLPFLLPAMEQSELTQGKESGH FT TDQDMRNLGAYRKSENKMTQEKIQDKNENSTSRTNNSPQSEIATPEQNRSD FT EVIGLQAKMDQLELQVSRLTSLLMASIPNLQQQPERGYSNSCELPANRYNQ FT TNQDNQRTRRFEGLSNINLEPHNMGSPRNNVRQNGNRIRFENNSSEDETPV FT DYPHPRERAIHREDRVQYDRKMEKWNLFFTGDSRSTSLEDFIFKVKVLASM FT NSIPNAKLLSNIHLLLRGEASNWFFTYYQPSWTWDIFETRIRFRFGNPNQD FT QGNRQRIYDRKQQKGETFIAFVTEIERMNKLLTKPLSNTRKFEIIWENMRQ FT HYRSKLACFSVSNLDQLIQANYRIDASDPTLHPVGPKLSVHNIEGEEVEES FT EDEEVNAIDRRQNRGQNQGLNRNGQNPRRENQEATRVPMCWNCQQQGHFWR FT NCREMKTTFCYVCGNPGKISSTCDKHPRRSSQNQAVESPSSSGN" FT CDS 3486..6308 FT /product="Gypsy-6_AA-I_2p" FT /translation="MMQGISGLEEVAEIGQVEETLGTELFHFFIHPIETIP FT AIVPSLPDKTLDIPELDLPEASKTTPETLEIEHDLSAEDRKELEDIIRQFP FT CTTENCLGRTTLLQHEIILREDAKPRRQPIYKCSPAIQAEMDREIERYKKL FT DAIEECTSEWANPLVPVRKSNGKLRVCLDSRRINALTKKDSYPMRDMRSIF FT HRLGSAKFFSVIDLKDAYFQIPLKEESRDYTAFRTATGLYRFKVCPFGLTN FT APFSMCRLMDKVIGFDLEPYVFVYLDDIVVATETLSEHLRLLKLVAERLRK FT ANLTISLDKSRFCRKQVSYLGYLLTEKGVAIDSARIEPILNYTRPKSVKDI FT RRLLGLAGFYQKFIKNYSKIAAPISDLLKKNPKKFLWTESAEEAFQNLKSA FT LISAPILTNPDFSCPFIIESDASDNAVGAALVQNINGETKVVAYFSKKLSS FT TQRKYASVEKECLGVLMAIEHFRHFVEGTKFKVVTDARSLLWLFTIGIESG FT NSKLLRWALKLQSYDIELEYRKGKLNVTADCLSRSLEAITPIDPDYEDLIK FT QIIKDPQSYPDFRVIDNRVYKLVKNQGKIEDTRFLWKQYLPISERERVIKD FT IHDKAHLGFDKTLTAVRERFFWPRMSTQIKTFCRNCLTCQTSKATNINTTA FT PIMAQRKTADYPWQFLTMDYVGPLPVSGKGRSTCLLVITDIFSKFILVQPF FT RQATADSLVPFVENMVFQLFGVPEVILTDNGTQFLAKSFQDLLEHYNVTHW FT RTPNYHPQVNDAERVNRVLTTAIRASIKKNHKDWANNVQTIACAIRNSVHE FT ATHYSPYFVMFGRNMVSDGREYRYLRDNVEANDSTNIEREKLYAEIRENLK FT KAFEKHSRYYNLRANANCPTFSLGEKLLKKNTELSDKGKGYCAKLAPKYIP FT AVVKRKVGEHCYELEDEKGKRLGVYNCRYLKKLTSSS" XX SQ Sequence 8963 BP; 2847 A; 1697 C; 2003 G; 2400 T; 16 other; ccccgcgcga attgcgtaat cgttcgccca tgcggtctcc acagtcaacc agccaagcca 60 acgaacaata ggatcagccc aagccatcca acaatagcga ccmtcatmgc cagtcccgtc 120 gtctgcgaag cagccaagcc acagcagcaa tagaatgcgg taaccgtgag taaaamgmaa 180 tcatgtgcac atgtgacgtc atttwgggga gatttgattc gtkagktcga ccaagwtaag 240 ttwgagagca cgcacgctta ggtaggatag gcaccttagg aactmggagg aataggaaat 300 aaamccggaa atgtwaaatt atccactgtg ttgttcccta acsgatgaag cccwgcccac 360 cgattacgta atatgtttca gtctgtggtt cacacgagcg gtctcctttt ggcgctttcc 420 gtmtttcatt gagaktttgt tgaatgatgt gctttcgtat cggtgccgcc atctgtggtg 480 cggtggtaga atcttccagt gatgatggac ttcatagtgg aaagcttctc ttggcacgtg 540 tcgctagggt aaagtgagta ttgttcgctg ctggattatc gaagtctagt aaggcggacg 600 ctaggaaagt ggagtgggca ggattgttct tcaaaccttt taaaggttgt ataatggacc 660 aacttaggtg ggtctcagac cgggcataaa attaacacga gcgggaggtc ctctaacgag 720 gtggcgctta agcttctcgc tttgcgaaga ataatcgcgt cagtccatgg tggcgaattg 780 taagctaggg atatttccgg ccgaccacga actacgaagt tcgtgcggct acatttggcg 840 cccaactgca agtttcgaac ctaggaatgc ttgaataaat tataggatat ttcaatttat 900 tacggaaaat aattaatgtg gattcgttaa agtcgtttct tcgggatatt taggaaatct 960 agttattgtg gaatttttga attgattttc ggattttttc ttcttgctta ggataattta 1020 tttaggaaat actacatttc ggatattcaa aatattgaaa ttgtttattt attaattgta 1080 agaacttaga taagcctaca ttcgttgaat ttgtactttg aattattggc ttacataatg 1140 gatttgttaa aagtaaacat gaattcgaaa ttcccgtatg cgggtcattt aactaacgat 1200 gaaattgatt atgaactaaa attacgaaat tatacagaag acgtggagaa ggaattgaac 1260 ttcaagcaga ggctactcag gagattattc aatgaagatg aacaagagaa tcgcgattac 1320 cgaacatcct tgacaattga ccaagagttt gagtttgttg cgggacgtgt taacatgatc 1380 aggggttgtt tggctaaatc attcgacatt aaatatctgt cgcgattgaa acactaccac 1440 atgcgggtgc aacgttgtat agcagatgac gtagaagcga aaaaaatgaa agagctctta 1500 ttgggaaata tagcgactct tctttcgagc caaaatgcac ttcaatcaaa gatgcaggcg 1560 aatccaaagg acttattggg ggaaacggat tcagatcaac attattctga gaaatcagag 1620 ggagctaaaa gtttattgtg ccagcttccg tttttgttac ccgctatgga acaatcggag 1680 ctaacacaag ggaaagaatc tggacatact gatcaggata tgagaaattt aggagcctat 1740 aggaaaagcg agaacaaaat gacacaagag aaaattcagg acaaaaatga gaatagtaca 1800 agccgaacaa ataatagtcc acaaagcgaa attgccacac ctgaacaaaa cagatcagat 1860 gaagtgattg gtttgcaggc taaaatggac caattagagt tacaagttag tagattgaca 1920 tcactgctca tggcgtccat ccctaacttg caacaacaac cagaaagagg gtatagtaat 1980 tcttgcgaac ttccagcaaa tagatacaat caaaccaatc aggacaatca aagaacaaga 2040 agattcgaag ggctttcaaa cattaatttg gagcctcaca acatgggaag tccacgaaac 2100 aatgtaagac aaaacggaaa cagaataaga ttcgagaaca acagtagcga ggacgaaacg 2160 ccagtagatt atccccatcc gagagaacgt gcaatacata gagaagacag ggtacagtat 2220 gataggaaaa tggagaagtg gaacctgttc tttacgggtg actccagatc gacttcctta 2280 gaagatttta ttttcaaggt caaggtgcta gcaagtatga atagcatacc aaatgcaaaa 2340 ctgctaagca atatacactt gttattgcga ggggaggcat cgaattggtt tttcacctac 2400 tatcaaccct catggacctg ggatatcttt gaaaccagaa taaggttcag attcggtaac 2460 ccaaaccaag atcaaggcaa tcgccaaagg atctatgatc gaaagcaaca aaagggtgaa 2520 acctttattg ccttcgtgac cgaaattgag agaatgaaca agttgctcac caaaccgctt 2580 tcgaatacaa gaaaattcga aataatttgg gaaaacatga gacaacatta ccgctcaaaa 2640 cttgcttgtt tttcggtatc gaatttagac caactcattc aagcaaatta ccgaattgat 2700 gctagcgacc cgacgttaca tccagtgggt ccaaagctct cagttcacaa catcgaaggt 2760 gaggaagtag aagaatcaga agacgaagaa gttaacgcaa tagacagaag acagaataga 2820 ggtcaaaatc aagggctaaa cagaaatggg caaaatccta ggagggaaaa ccaagaggca 2880 acaagagtac caatgtgttg gaattgccaa cagcaagggc atttttggcg aaattgtagg 2940 gagatgaaaa cgacattttg ttatgtttgc ggaaacccag ggaaaatttc ctcgacctgt 3000 gacaaacacc ctagacgatc ttctcaaaat caggctgttg aatccccttc tagttcggga 3060 aactagattt gggatgcggt agagggaacg ctggcatccc aaagaaacaa gtcgttccaa 3120 attcaaatgc cgttccctat gttgaccctc tagaaagcct tttagagata aaaatacaga 3180 caagtcgttg ccctcacgtc caagttcgaa tatttaatga agaaattgaa gctttactgg 3240 attcgggcgc aggaataagt gtaaccaact cgaagcgact cattgaacat catgggttga 3300 aaattttacc gtcgccgata cggatttgta cggcagacaa aaccaagtac tcttgtgttg 3360 gatacactaa cttgccaata acgtttaaag gtattactcg gacaatttca gtggttattg 3420 taccagaaat ttcaaggact ctaattctgg gcataaattt ctggaaagcg tttgacatca 3480 aaccaatgat gcaaggcatt agtggtctag aagaggtggc ggaaattggt caggttgaag 3540 aaactcttgg aactgagttg tttcacttct tcatccatcc gattgagaca attcccgcaa 3600 tcgtgccttc tttaccagat aaaacacttg acattccaga actggattta ccagaggctt 3660 cgaaaactac tcccgaaact ctagaaattg aacatgacct aagcgcagaa gatcggaaag 3720 agctcgagga tattattcgg caatttcctt gcacaaccga aaattgttta ggacgtacaa 3780 ctttgctgca gcatgaaata attcttagag aagatgcaaa acccagacgg caaccgatat 3840 acaaatgttc accagctata caagcggaaa tggatagaga aattgagcgg tataagaagt 3900 tggatgcaat cgaagaatgc acaagcgaat gggccaatcc tctagttccg gttcggaaat 3960 cgaacggaaa acttagagta tgcttggact cacgcagaat taatgctcta actaaaaaag 4020 actcttatcc aatgcgggat atgaggagta ttttccatcg cttaggaagc gcaaaattct 4080 tttctgtaat agatttaaaa gatgcgtatt tccagattcc tcttaaggag gaatcgcgag 4140 attatacggc ttttagaaca gctacaggac tctatcgatt caaggtttgc ccgtttggat 4200 taaccaatgc tccatttagt atgtgcagac tgatggataa ggtgattggt tttgacttgg 4260 aaccctatgt attcgtctat ttagacgaca tagtggttgc cacagaaaca ctatcagagc 4320 atctaagact tttgaagctg gtagcggaac gcctacgaaa agcaaatttg actatctcgt 4380 tagataaatc aaggttttgc cgaaaacaag taagctattt gggttactta ctgactgaga 4440 aaggagtagc aattgacagt gccagaatcg aaccgatttt aaactatacg agacccaaaa 4500 gtgtgaaaga tattcgtcgt ctgttagggc tagcaggatt ttatcaaaaa tttatcaaaa 4560 attacagtaa aatcgctgcc ccaatatcgg acctgttgaa gaaaaacccg aagaaattcc 4620 tatggacaga atcagctgag gaggcttttc aaaaccttaa atcagctctg atatcagcac 4680 caatactgac gaatcctgac ttttcttgtc cattcataat tgaatccgat gcttcggaca 4740 atgcggttgg tgcagcatta gtccaaaaca tcaacggaga aacaaaagtt gtggcctatt 4800 ttagtaagaa gttgagtagc actcaacgaa aatatgcaag tgtagaaaaa gagtgtcttg 4860 gcgtattgat ggctattgag cactttcgtc attttgtaga agggacaaag tttaaagtgg 4920 tcaccgatgc tcgaagtcta ctttggcttt ttactattgg cattgaatca ggaaactcga 4980 agttacttcg atgggctcta aagcttcagt cgtatgacat agagttagag tacagaaaag 5040 gaaaattgaa cgtaacagcg gattgcttat cacggtccct ggaagcaata actccaattg 5100 accccgatta tgaggattta ataaaacaaa ttatcaaaga tccccaaagc tatcccgact 5160 ttcgagtcat agataatcga gtttataaat tggtgaaaaa tcaaggcaag attgaagaca 5220 ctcgattcct atggaaacaa tatctcccaa tatctgagcg agaaagggtg atcaaagata 5280 tccacgataa agctcatttg gggtttgaca agactctaac cgcggtgaga gaaaggttct 5340 tttggccacg catgagtact caaatcaaga cattctgccg taattgctta acttgtcaga 5400 cgagcaaagc cactaatatc aatacgacag cccctataat ggcccagaga aagacggctg 5460 attatccctg gcagttccta acaatggact atgtaggccc tctgcctgtt tcagggaaag 5520 gaagaagtac gtgtctgctg gttatcacag acattttcag taaattcata cttgttcagc 5580 cttttagaca ggcgacggcc gattctttag tccccttcgt tgagaacatg gtgttccagt 5640 tgtttggagt ccctgaagtc attttaacgg acaatgggac ccagttcctg gccaaatcat 5700 ttcaagacct actagagcat tacaatgtaa cacattggcg aactcctaat taccatccgc 5760 aggtaaacga tgcggagaga gtaaaccgtg tcttgacaac ggctatacga gcgtccatca 5820 agaagaatca taaagattgg gccaacaacg tacaaacaat tgcctgcgca atcagaaact 5880 cagtgcatga agccacccac tactcaccct actttgtgat gtttgggaga aatatggtct 5940 cagatgggag agaatatcgg tacctgaggg acaatgtaga agcaaacgac tccaccaaca 6000 tagagaggga aaaattgtac gcggaaataa gagaaaacct caagaaagcc ttcgaaaaac 6060 attcgaggta ttacaatttg agagcgaatg ccaactgtcc tacattctca ttgggggaga 6120 aactattgaa gaagaacaca gaactatcag ataagggaaa aggctattgc gcgaaactgg 6180 ctcctaaata catccctgct gttgtaaaaa ggaaagttgg ggaacactgt tatgagttgg 6240 aagacgagaa gggaaagcgt cttggagtgt acaattgtcg gtacctgaag aagctcacct 6300 cgtcttcata atagaaatag gcgatcaagc tatgcaacct tttaaggcaa cgatgcgtct 6360 ttgaatgcac aaaaatacat gtttggtatc tcatcttgct cctcgcgaac gagttgagat 6420 gttggtggtt cagctatggt acctttttgg aaaggaaact aagctccaag ttgtgagcaa 6480 taaaatcaac ctggttgcat agtagacatt ctcctcgtag atcgagtcga gccatttggt 6540 aaacagctat gtatactcta agagtggacc acgcagttaa cacctgcacc aaaatacaac 6600 cagccattct tgaagtgctc ctcaatgagt cgagccgtcc cacaataaca atcaatctac 6660 tggtgacgaa cttcaagaat gagcatggat aaatggacac aattgaagtg tgagtgggag 6720 gaataacggg aagcagtgtt ttccagaaag ctatgtacta ctcgggacaa ataaacaagg 6780 aaaccctgac gaggtaacgt tttatgaacg gaaatgaccc aatttgacga aggaaaggtg 6840 aaggaatgag ccctgcctaa cctgatttag tccgaatatt aatcgttagt aatcctatcc 6900 ttattctatg ttagttctta agattattgt tagtcttaga atctaaaatg ctcacatgtt 6960 ggtttagttt gtaaatattc catgtttttc gtttgtgaac ttcagcggcg acgttcctgt 7020 tggacaagat ccgttgagcc agtgtttgtg aaccactgcg ttgttgtaaa tagtagtccg 7080 tagtttcatt caagtcccat ttccattcat tgttggcgag agctgttgtt aatgatcctt 7140 gttgtagtgt tctggtatca cagcgattgt cctggggtcc cattgataac aatacgtcca 7200 atatttcata tacttgtctt gtccatagta gtttgtcctt gttatccaat agtagttcgt 7260 tagccaatcc atcaacatcc agtagtagcg tccaatagcc aattccaatt agaaagtttt 7320 gctccgagtc catagtctcc attgaataat tctccagtag tagttgcgca atcccacgtc 7380 gcagcaactg caataaagac acaaagcatt caattctaaa tttggggaaa gtattgataa 7440 atactccccc cgtaatccat ttccagttaa tggttcagcc agaaaaactc ggtaaaattc 7500 aaagcaaacg tttccatttc actaaaagca cactttcatg ttcaattttc cactttgttt 7560 tgatcagttt gacagttcgc ttcgagtcgg tatcgcgcgg tgcgtttagt acttgggtac 7620 gtatgaagtt agtgtgaagt gaagtgtgaa ggatttgatt cgaggggtgt ttagaggttt 7680 cggctctttt caaatttcgc ttatcagatt tgaatctggt gtgaaatgtg aataagacaa 7740 tgtgattacg ttctatgaga gagatggaca atttgcctga ggagaagctt cggggtgggc 7800 catttctcca tagagcagtt atgtttgcgg tgaattcagc aaagggtcgg tgctcaacgt 7860 tatgaatgag gtagacctgg aggctgaatg agcctgtttc aggttattca tgaaaatttc 7920 aattgttttc attctgttcc aggtgaacag ttttgaagac tgattatggg tgattgtttg 7980 aggattattg tagggagtac aaggcaagcc gtttcgctgc cacttgaacg aacttgtcac 8040 tgctatacag tgagttagtt gtagaataaa tgttctgatt ggtattatat agtaatgtcc 8100 attagtttga atatccgtta gttaattgta aatgtgattt gcagctttat tgatgaaagt 8160 aacagataca ttcgctggag accggagtca attgacgttg atggtcagta gtaggcacgg 8220 tccgtgatag atcgtgtgct atagatagag gaggagttga ccttcgcggt cgacgatgat 8280 accatatgac cccgaatgga aagttgattt cagacgttaa gttgcttgtc tagaataaaa 8340 tattgtatat aatattgaat taacaagtat tttcttatga aaatttagtt gaatcaattc 8400 aactaaattt tcatacccta agcagggagt gatgtaacgg tatcccgtta cagctatgta 8460 aataaaacca tatataaata acttgtacat acttgtttgt cgttaatcgt tacaaaacat 8520 cgcatgcaaa atcatataac agatttagaa ggtaacagct ggtaaatctc acgcaattta 8580 agttccacag cgcctgcgca taggcaatcg cttgcaatat tgaaatcgcc attcgctggg 8640 aagtggtatc gctatccgtt gaccctctcg cgagatttaa ttgattgctt gcgatcgggc 8700 cgaaaaccga aggtcaagtt caaagaagac cttgaaattg gggagaagaa catgaaggcc 8760 atacttagca attactggtg ggagcgagag gtcgaaaacc ggaaagggaa tgaaaaagct 8820 aacccgaacg atccgttaaa agacgatggc aattccaagg tttcctatct gagggtttca 8880 agattcctca tcactttgcc tggatagtcg aaccaataag gaagtgtttg cccgtgtacc 8940 gcgaaattgt aatttgattc cat 8963 // ID Crack-29_BF repbase; DNA; INV; 2278 BP. XX AC . XX DT 30-APR-2009 (Rel. 14.04, Created) DT 30-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE Amphioxus Crack-29_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; KW Crack-29_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-2278 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-2278 RA Kapitonov V. and Jurka J.; RT "Young families of Crack non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(4), 834-834 (2009). XX DR [2] (Consensus) XX CC Its 5'-terminal portion is incomplete. XX FH Key Location/Qualifiers FT CDS 2..1972 FT /product="Crack-29_BF_2p" FT /translation="TPAPWLTNDLRDLIALRDSMYKKAKRSGSPTDMTDYK FT AMKNRVNYLCRQAKRDHAAQSVIQGSEGGQAGVWKAIRTLLPNKTHNTVTY FT LEHEGLSISDSKGMAETFNDFFHKVIQDLCDVLSRTVSPFSPTQFLNETNS FT SFSFTPITVEAVTEHLTKLNVKKATGLDSIDNRLLKTGAQVLAPSLTNLLN FT KSLTSGEFPEKWKTAKVMPIHKKGDRTMLSNYRPVSILSSISKILERVVHN FT QLYGYLNQNRLLSQCQSGFRKLHSCETALHSVTEDWIDSIDKKQQTGVIFC FT DLSRAFDTLNHDILLDKLKAYGVDEMACSWFKSYLTGRLQRTCVNSILSNT FT SCVTCGVPQGSILGPLLFIVYINDLPNCVQSCRVAMYADDTIVYFSHRSIQ FT TIQETLQNDCTRLMHWFVANKLSLNPLKCKSMLVGSHKSKATDQQLQLILD FT DTELEQVGTFKYLGVVIDKALLWNDHFLHVNKKLSQIIGIMKCLRPYLNRQ FT ALLTIYTSLFLPHIQYCSTVWDQGGKGAIDKLQKLQYRAGRVILGCDHYTS FT RETVLNLLGWTPVAQHHRRAKAVFMFKALNGMFPPHISHLFTTSADTHSHH FT TRHSTQGGLQLPRTRLQYKKKSLSFSGAALWNSLPAHLRTASSISTFKTIY FT TTEYHMI*" XX SQ Sequence 2278 BP; 673 A; 548 C; 458 G; 599 T; 0 other; tacacctgca ccttggctca ctaacgacct tcgagatctc attgctctta gggacagtat 60 gtacaagaaa gcaaaacgat caggtagtcc aaccgacatg actgattaca aagctatgaa 120 aaacagggta aattacttat gtcggcaggc taaaagggat catgcagccc aatcagtaat 180 tcaggggtca gaaggggggc aggctggggt ttggaaggca atacgtacac ttctacccaa 240 caagacacat aacactgtca cttacctgga gcacgaggga ctgagcattt ctgatagcaa 300 aggaatggca gagactttta acgacttctt ccataaagtc atacaggacc tgtgtgatgt 360 actctcgagg actgtctcgc ccttctcacc cacacagttt ctgaacgaga ctaacagctc 420 gttcagtttt acacctatca cagtagaggc cgttactgaa catctgacaa agctgaatgt 480 aaaaaaggcc accggcctgg acagcattga taacagactc ctaaaaacag gagcacaagt 540 ccttgccccc tcacttacga acctgttaaa taagtcactg acctccgggg aatttccgga 600 aaagtggaag acggccaaag tcatgccaat ccacaaaaaa ggtgacagaa ccatgttaag 660 taattaccgc ccagtgtcca tcttgtctag tatttctaaa atacttgaac gagtggtcca 720 taaccagctg tacggttatc taaatcagaa ccgacttctc tcacagtgtc agtcaggctt 780 tagaaagctc cattcctgtg aaactgcctt gcattctgta actgaggact ggattgactc 840 aattgataaa aaacaacaaa caggtgtgat cttctgcgac ctgtccagag cctttgacac 900 tctaaaccac gacattcttt tggataaact caaagcctat ggggtcgacg aaatggcttg 960 tagttggttc aaatcatacc tgaccggtag attgcagcgg acctgcgtta actcaatctt 1020 atccaataca tcctgtgtga cctgtggagt accccagggg tccattcttg gacccctcct 1080 atttattgtt tatatcaatg acctgcctaa ttgtgtacag tcatgtagag tagctatgta 1140 tgcagatgac acgattgtct acttctccca tcgcagcata cagacaattc aggagaccct 1200 acagaatgac tgcacccgac tcatgcactg gttcgtcgct aacaaactat ccctaaaccc 1260 tttgaaatgc aaatcaatgt tggtgggatc acacaagtcc aaagctactg accagcaact 1320 gcagctcatt ctagatgaca cagagctgga gcaagtgggc acctttaagt acttaggagt 1380 tgtcatagac aaggccctac tgtggaatga ccactttctc catgtcaata agaaactgtc 1440 gcagataatt ggcatcatga agtgtttaag gccataccta aacagacaag cgctgttgac 1500 aatctacact agtctgttcc tcccccacat tcaatattgc agtactgtat gggaccaggg 1560 aggtaaaggg gccatcgaca aactccaaaa gttgcaatat agggcgggca gggtaatatt 1620 agggtgcgac cactacactt ctcgggaaac tgtcttgaac ttactcggct ggacccctgt 1680 agcacagcac cacagaagag ctaaggctgt tttcatgttc aaggctctca acggtatgtt 1740 tcctccacac atatcacatt tgttcactac atctgctgac acacattcac accacacccg 1800 gcacagtaca cagggggggc ttcaactacc aagaacacgg ttacaataca agaagaagtc 1860 tctctcgttt tccggtgcag ctttgtggaa ctcgctacct gcacacttaa gaacagcatc 1920 atccatctct actttcaaaa ccatctacac aacagaatac cacatgatct gatatgtaat 1980 ctgtcacgtc tttgtttttc tgttcacatt ggtatgcaaa ttctcattac ttatttgatt 2040 tcatttatga tttcgagttg tattaggatt aatgtgcaca attgatttgc cttattttgt 2100 cttagttaaa ttgcgttatc atttgccgat tcttgctgag tcaccatgta tttgatttta 2160 aactgttatc atttattgtt atgtatgtac ggggtttccc cccagggatc tttgaaaaac 2220 gccggtcagg cgacatgtac ccctggataa ataaaaataa acaaacaaac aaacaaac 2278 // ID Transib-20_HM repbase; DNA; INV; 4865 BP. XX AC . XX DT 27-FEB-2009 (Rel. 14.02, Created) DT 27-FEB-2009 (Rel. 14.02, Last updated, Version 1) XX DE Transib DNA transposon -a consensus sequence. XX KW Transib; DNA transposon; Transposable Element; Transib-20_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4865 RA Jurka J.; RT "Transib transposons from the hydra genome."; RL Repbase Reports 9(2), 460-460 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(1137..1883,1838..3469) FT /product="Transib-20_HM_1p" FT /translation="MFNNITRLEVFEKIKDTISSQLTSRQIVDQYISMLKN FT ELNVSINLNDDFSRFEKALLEFVCKCRSKLSKKYFRNKSKFLSGESFWLNN FT KILSLINQQPSEQSSQTVDLNVSCLNEQSFGRPCKKYEDMSYRSKRRHASI FT LLVNHDHDKILQAAEMGLKKKKRYSLLYVLKECLSNVDSANEIKCFIQDKN FT KQIVPNTPLEGLALMTAMNLSKTQYQVMRNNSIKCNANIYPTYNNIRDEKK FT SVILMVFIQHSRRKKKCYPDGIYIAENGMRGEVPLKELLYHTVHRMLQITT FT VKNRISQLDSLESCLGMTLYCKWGFDGASSQKQYKQKFYVQINNNDECNVL FT MSDQNLFSTCLVPLKLSYGVMTIWENQVPSSTRFCRPLRIQYIAENPLVLR FT EEKKYFDKQIDLLEESAMIIDVEIMEPEAEKKLVIEVSFNLKLTMIDGKAV FT NAICDNKSAQICNICKCTPRSMNNLDKALIRSTDLQSLEFGLSSLHAWIRC FT FECILHISYRLDIKEWQIRGPENIKLAAVKKKFVQREFAKIGLIVDMPLQG FT GSGTSNDGNTARRAFRQXAEFSRITGVKKDLIYRLYIILSVINLNKKINIA FT KYETYCLETANIFVREYSWFYMPASVHKLLIHSSLTITRFMLPIGMYSEEA FT IEARNKDNKNIRLQHTRKFDRKATMTDQFNYLMLSSDPILSSLNYARSKCK FT HKKMNLDGDAVDLLFQETEKENTNDANDDIYADEEYNELDEDDDFDDRNEE FT LKPININEIRNLNEHNDIIESSENENTDLPFYYYADFENTD*" XX SQ Sequence 4865 BP; 1833 A; 649 C; 701 G; 1672 T; 10 other; gcacagtggg ccagaatttt ttttccgtgg ccaaaagtcc caaaaaagtc aaatcttcta 60 aacttttttt ttattttatt attgaaacat aagtgcttta ctctactcaa aaatgtatcc 120 gtttaattac tagagatgaa ggcaagcttg ctagaaccat tcataatggc tttaaaattg 180 tcgaattttt tttttcaaaa atacttatcc tcaaaattta cataactttt ttcaattaaa 240 aaagctttgg acttttttta ttttaagtaa acatattaac ttagtgctaa taaaaagtaa 300 atgttttagt tatatatttg tatttaattt attcataata tgtcttagaa ttgttggtat 360 tttaaagggg tttttctaaa tttcctcgtc ttcctttttc taaatgccac tcgaagattg 420 tttgattttc tgatttttgc atttactaaa cggcagttat tgttcaaaaa aaaacccagc 480 tgcgaccatg tattctttat gatatatttt ttagttcaaa atcgacgact ttttttgctt 540 ttataaaaag accaatttaa agttaagata tttgtttaag gaatatagtt tttttttttt 600 tttttaattt tattaaacac atcgaaccgt ttttgtttaa aataaacttt tttttattat 660 tttaatgttt tctaaaaagg aaaaaaaaag aaaatcattt cactataaat acgtaaaaat 720 caaatattta agtattcgat tccaaccaaa tgccgctctc ttgaaattgg aattaatgat 780 atttaggaag tataacggtt gtttaatact attaaataaa atcagctgcg actagctata 840 acttagactt aaaatcgatt ttgaaatttc aagattttag gctaatttaa ttcttacttt 900 acaatgcatt gtgacgtgaa aaaaattatt tgtctggaaa tttgaagttt atatgatact 960 atatgattat atgaagacta attagttgat aattgcagcg atttgattgc aattatgttc 1020 cgaaagtaat aaaattattt gaataaattt acactmaatt atagatattt aaattttctc 1080 ttgaagtagc attattaaaa ctcttatctc ataaatagtc aagttatctt ataaaaatgt 1140 ttaataacat aactcgactt gaagtatttg aaaaaattaa agacactatt tcatcacaac 1200 ttacgtcaag acaaatygta gatcagtata tcagtatgct caaaaacgaa ttaaatgtat 1260 ccatcaattt aaatgatgat tttagtcgtt ttgaaaaagc tcttctcgaa tttgtatgca 1320 aatgccgatc aaaacttagc aaaaaatatt ttcgaaataa gagcaaattt ttaagtggcg 1380 agtcattttg gcttaacaat aaaattttgt ctttaattaa tcagcaacct agcgaacaat 1440 catcacaaac cgtcgattta aatgtttctt gtttaaatga acagtcattt gggagaccat 1500 gtaaaaaata cgaagatatg tcctaccgaa gtaaacgtcg tcacgcatcc atattgttag 1560 taaatcatga tcatgataaa attttacaag cagcagaaat ggggttaaaa aagaaaaagc 1620 gatatagttt gctgtacgta ttaaaagaat gtttatcaaa tgtagattcg gcaaatgaaa 1680 taaaatgttt tattcaggac aaaaacaagc aaatcgtacc taatacgcca ctagagggat 1740 tagcattaat gacagctatg aatttatcga agacgcaata tcaagttatg cgaaataact 1800 caattaaatg taatgcgaat atatatccta catataacaa cattcgcgac gaaaaaaaaa 1860 gtgttatcct gatggtattt atatagctga aaatggaatg agaggcgaag ttccactgaa 1920 agaattgtta taccatacag ttcatcgaat gttgcaaata actactgtta aaaacagaat 1980 atcgcaactt gattcccttg aatcctgttt aggaatgact ttgtattgca aatggggctt 2040 cgatggtgca agtagccaaa aacagtataa acaaaaattt tacgtccaaa taaacaacaa 2100 tgatgaatgt aatgttttaa tgtctgatca aaacttgttt tctacttgcc tagtgccact 2160 taagttatcg tatggtgtta tgacaatttg ggaaaatcaa gtgccttctt caacgcgatt 2220 ctgccgccca ctaagaattc agtatattgc tgaaaaccct ttagttttaa gagaagaaaa 2280 aaaatacttt gacaagcaaa tcgatttgct tgaggagagt gcaatgatta ttgatgtaga 2340 aataatggaa ccagaagctg aaaagaagct tgtaattgaa gtaagcttca atcttaagct 2400 tacaatgatt gatggaaaag cagtaaatgc aatttgtgac aataaatcgg ctcaaatttg 2460 taatatttgc aaatgtactc ctagatcaat gaacaactta gataaagctc tgatccgttc 2520 aacggatttg caaagcttag agtttgggtt atcgtcatta catgcatgga ttcgatgttt 2580 cgagtgtatt ctacatataa gttatagact tgatattaaa gaatggcaaa tacgtggacc 2640 tgaaaatata aaattagcag ctgtaaaaaa aaagtttgtt cagcgtgagt ttgcaaaaat 2700 tggccttatt gttgacatgc ctcttcaagg cggatctggg acatcaaatg atggcaatac 2760 tgctcgtcgt gcttttcgtc agcamgcaga gttttcgcga ataacaggag tcaagaaaga 2820 tttaatctac aggctataca ttattctatc tgtcattaat ttaaataaaa aaattaacat 2880 agccaaatat gaaacatatt gtttggaaac tgcaaatatt tttgttcgag agtactcatg 2940 gttttacatg ccagcctcag tacacaaact actcattcat agtagtctga caataactag 3000 atttatgcta ccaattggaa tgtactctga agaagcaatc gaagctcgga ataaagacaa 3060 taaaaatatt cgtttacagc acactcgtaa atttgatcga aaagctacaa tgacggacca 3120 attcaattat ttaatgcttt cgagtgaccc aattctttca tctctaaact atgcaagatc 3180 gaaatgtaaa cataaaaaaa tgaatctaga tggagacgca gttgatcttc tttttcaaga 3240 aactgaaaag gaaaatacaa atgatgctaa tgatgatatt tatgcggatg aagaatataa 3300 tgagttggat gaggatgatg attttgatga tagaaatgag gaactaaaac caataaacat 3360 aaatgaaata agaaatttaa atgagcacaa tgacattatt gaatcatctg aaaatgaaaa 3420 cactgatttg ccattttatt actatgctga ctttgaaaat actgactaag attttattta 3480 tattattttt accatttgat ttaagcaaaa ttattatgtg cactaatatg taaaatagga 3540 ataaagtttt aaccttgagc taataaaaat aaattctaat ttttaaatga ttattttata 3600 ttaacaatat aaaatatagt tagtgtaatt cactagaagt ttatatgaag gatgttaatg 3660 actaatagtt taattatttt tacaatattt taaaacattt acaactttta caacatttaa 3720 aataaaatat ttaaaacatt tttwaaccct tacagtatrt gcaagtgtct ttgyaaagaa 3780 cttcaatttt cttacagttt ctacgttaat tttttatgct taaggttttt taaagttttt 3840 aacattacaa gtttttaaty aaattgctgc aattatyaac taaktagtct tcatataatc 3900 atataaactt caaatttcca gacaaataat ttttttcacg tcacaatgca ttgtaaagta 3960 agaattaaat tagcctaaaa tcttgaaatt tcaaaatcga ttttaagtct aagttatagc 4020 tagtcgcagc tgattttatt taatagtatt aaacaaccgt tatacttcct aaatatcatt 4080 aattccaatt tcaagagagc ggcatttggt tggaatcgaa tacttaaata tttgattttt 4140 acgtatttat agtgaaatga ttttcttttt ttttcctttt tagaaaacat taaaataata 4200 aaaaaaaagt ttattttaaa caaaaacggt tcgatgtgtt taataaaatt aaaaaaaaaa 4260 aaaaaaacta tattccttaa acaaatatct taactttaaa ttggtctttt tatwaaagca 4320 aaaaaagtcg tcgattttga actaaaaaat atatcataaa gaatacatgg tcgcagctgg 4380 tttttttttt gaacaataac tgccgtttag taaatgcaaa aatcagaaaa tcaaacaatc 4440 ttcgagtggc atttagaaaa aggaagatga ggaaatttag aaaaacccct ttaaaatacc 4500 aacaattcta agacatatta tgaataaatt aaatacaaat atataactaa aacatttact 4560 ttttattagc actaagttaa tatgtttact taaaataaaa aaagtccaaa gcttttttaa 4620 ttaaaaaaag ttatgtaaat tttgaggata agtatttttg aaaaaaaaat tttccgaaaa 4680 ttcgacaatt ttaaagccat tatgaatggt tctagcaagc ttgccttcat ctctagtaat 4740 taaacggata catttttgag tagagtaaag cacttatgtt tcaataataa aataaaaaaa 4800 aagtttagaa gatttgactt ttttgggact tttggccacg gaaaaaaaat tctggcccac 4860 tgtgc 4865 // ID Talua repbase; DNA; INV; 259 BP. XX AC . XX DT 27-NOV-2009 (Rel. 14.12, Created) DT 27-NOV-2009 (Rel. 14.12, Last updated, Version 1) XX DE Talua short interspersed element consensus sequence. XX KW SINE; Non-LTR Retrotransposon; Transposable Element; Talua. XX OS Reticulitermes lucifugus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Orthopteroidea; Dictyoptera; Isoptera; Rhinotermitidae; OC Reticulitermes; Reticulitermes. XX RN [1] RP 1-259 RA Luchetti A. and Mantovani B.; RT "Talua SINE biology in the genome of the Reticulitermes RT subterranean termites (Isoptera, Rhinotermitidae)."; RL J Mol Evol 69(6), 589-600 (2009). XX RN [2] RP 1-259 RA Luchetti A.; RT "Identification of a short interspersed repeat in the RT Reticulitermes lucifugus (Isoptera Rhinotermitidae) genome."; RL DNA Seq 16(4), 304-307 (2005). XX DR [2] (Consensus) XX SQ Sequence 259 BP; 42 A; 60 C; 94 G; 63 T; 0 other; gccgatccca gtggccgcgc ggtctaaggc gtgggtctgc ggccgctcgc ttactgggat 60 tgtgggttcg aatcccgccg ggggcatgga tgtctgtctc ttgtgagtgt tgtgtgttgt 120 caggtagagg tctctgcgac gggctgatca ctcgtccaga ggagtcctac cgagtgtggt 180 gtgtctgagt gtgatcgtga agcctcgata atgaggaggc cctaggcccc ctaggggctg 240 ttgagccatg ggaaaaaaa 259 // ID R1D_NLo repbase; DNA; INV; 7071 BP. XX AC . XX DT 08-NOV-2010 (Rel. 15.11, Created) DT 08-NOV-2010 (Rel. 15.11, Last updated, Version 2) XX DE 28S rDNA-specific non-LTR retrotransposon R1 in Nasonia DE longicornis. XX KW R1; Non-LTR Retrotransposon; Transposable Element; R1D_NLo. XX OS Nasonia longicornis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-7071 RA Stage D.E. and Eickbush T.H.; RT "Maintenance of multiple lineages of R1 and R2 retrotransposable RT elements in the ribosomal RNA gene loci of Nasonia."; RL Insect Molecular Biology 19(Suppl 1), 37-48 (2010). XX DR [1] (Consensus) XX CC This family is specifically inserted into 28S ribosomal RNA CC genes. XX FH Key Location/Qualifiers FT CDS 791..1177 FT /product="R1D_NLo_1p" FT /note="broken." FT /translation="LSRAVASSARGGWELQARKRERNSAAGGRRRDSRPLR FT SSLLSLLLPPRVACSLHLDSCRGVGGIFTLGTFRDRKQCVWNQKAMRVESR FT SNACQLILSLKFTCVRKTSRPLGSASHVRNFSKFRKAMRVD" FT CDS 2625..6158 FT /product="R1D_NLo_2p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="KLKITMPGPQKKNVAEVQPEERSPTTPGNPLVAESGR FT LLTGAREDLEHPSVRTWPVRSGGGLRRLGEGEVGLRAVSVRLVRDERIEKM FT ARKTVEVEEPGSKNEIRFLQINAGGGQLVNAEICELIASKKIDIVLAQEPY FT SKVNKGSRYFTGLRRASRAICLKSAGTKTAPKAFVAVPNPDFHAFFVSALS FT TQHCVVAEVHTPSVTFFAVSMYFQFCDDIEVHLGQLEKVLENLSGQKVVIG FT IDANAESSLWSPRGTNEKGEKLERLIAAFGLHVVNDRTQPPTFEERGVSSY FT IDVTLVSGSMIAEVQSWKVKRDWTSSDHNAIVFKITTVAQTDRVDSSRFNI FT RRADWGLLDSTIKELSVSHLDHIILDSAEEVERMADALQKVLYEACETAIP FT RRRRIRKNNPWWTRELTDKKSELYSARRRMQQQWSLPGHSLRKAEYRALLR FT DYCRSVKGAKVGSWQEVVTVRGNEEPWGVVYKQLRGKLQNERTLSSVRCGD FT TESMSMLETANRLLEVHVPDDTPFNETPEQAQIRESINSPPETEDAAPFEQ FT WEIALILASLKNNKAPGFDLLEVRVLKAAIKAIPHHFLRLFNACLEHGVFP FT RAWKQASLIFLPKGGKDSNDSKSYRPISLLPVTGKLYERLVKRRLSDTALG FT PDMISDRQFGFRAGMSTEDAIIELRRLTAASPKKQVAALLFDVKGAFDCIW FT RPAILQSLKEKNCPKNVYKLLVSYFEDRQAQVVWGTNQVSKQATRGCPQGS FT VLGPSGWNLGFDPLLRSLEQGVVTEGGGKLPINFVAYADDLAVLVEGDSRA FT EIEKVGKAVVKHIVEKCSAIKLEVSESKTVGIFVKKPKVVGSKAVKINRKD FT CRKGGARNPKIELGGKSISFEQSVRYLGVHFDANLGISAHCKYLREKLVPL FT FSDLRKLAQCQWGLGHRALETIYKGVFVPTVCYASAAWYKEGAHTDRILED FT LHRQILIAITRCYRSTSYEAACVLAGTLPICIQLRVSMAKYHLRKGEDAEI FT GGVVIRHDPEGLKENYNRVLEVANEMWQARWEASEQGNATRELFFPDVVDR FT VKSDWIRPDHFTSQVLTGHGYFNEKLHQLSLAKAAACLCCGEPDNNLHFLL FT ECPAFAEFRDELITPISGGLEAPEATLMLVSSPEGFAALKEYSRVAFECKR FT QLENALTESDEGLSSESEE" XX SQ Sequence 7071 BP; 1834 A; 1776 C; 1869 G; 1592 T; 0 other; ggttgtttga agtactcaag gcaacagttc ttgttgcagg gtatggatcc ttctggaagt 60 gccaatgata cagcgtctac gtcgcgctag atatcaacat ctctcccaca agttgggggg 120 ccctttccaa agtgaagatt gggccgcgag atggccaata gtatcacggc taagttcgaa 180 cgcgcaaaga tggggcagtg gaatcacctt atgagaacag gacaagattc ccctgtatgt 240 gtcagagctg caggctcggt attgtatagc ataaaggatt acttggtggt attagcaaat 300 acgaaattcc gggagcgctg agggttctcc cggggaagcg ctcccctaaa gaagtgtgga 360 aggaaaaatt tttcctgcag aagcactctc gatgatcggt agatctattg ttatgccgta 420 tcgagcttgt aagcctgggt caagggcccg actcaggttg cgagcccggt ggcccaattg 480 tacacgaacc cgcccgttcg aacacctcaa ccgttgatat acgtgagcct cctttgggta 540 agtgctacga taatcaacgg ttttcgaaag cgaatctgcc tctggcagcc caaactcccg 600 tgcgtctctt gaggcgcacg tatccccttc cttccttcct tccttgcgct cccaaacgcc 660 cgcgtaactc cagtgagtaa cgtgtggttt cttctaggcc gggtaggaat ttattagatg 720 ataggtttta gataggatag aattagctct ggcactctgt gaccacttac ccccgtgggt 780 aacgccgtaa ttatctcgcg cggtcgcctc gtcggctcgt ggcggctggg aactgcaagc 840 gcgtaagcgc gagcggaact cggctgctgg gggtcgacgg cgcgatagta ggcctctccg 900 atctagtctt ctttccctcc ttcttccacc tcgcgtggct tgctctctgc atcttgactc 960 ttgtcggggt gtagggggca ttttcacgtt ggggactttt cgagatcgaa agcaatgcgt 1020 gtggaatcaa aaagcaatgc gtgtagaatc aagaagcaat gcgtgtcagt tgatacttag 1080 tcttaaattc acttgcgtcc gtaagacgag ccgaccatta gggtcggcaa gtcacgttag 1140 gaacttttcc aaattccgaa aagcaatgcg tgtcgattga caaaatagag tttagcttca 1200 ctagcatcca taagacgagc cggccatttg ggccggcaag tcacgttaag gactttttca 1260 aattcttcca tttgagaaag caatgcgtat caactgacaa caaaatcata gccggccatt 1320 agggccggca agtcacgtta gggacttttt catattttcc acgcgcccca agagtagaat 1380 caccccgcga acataaactc aacttcctct caacttcctc tctctctttt ctatactttc 1440 acgcacacat acacactcac aacaaacaca aaacacgatg caggccgcgc atctgggcca 1500 gtttcgctga ttcagcgtta gggtgagacc aggtaacttt gcaggcatcg ttcacataca 1560 cacacacaaa caatataacc aactaaccac acgatacagg ccgcgcactt gggccagttt 1620 cgctgtcaca gcgttagggt gagaccaagt aactttgccg gcatcaacca cgaacacaca 1680 tactaacact ttcgacgcag gccgcgcact tgggccagtt tcgctgtcac agcgttaggg 1740 cgagaccaag taactttgcc ggcataaaat aactcctttt cttaacttag actcttctcc 1800 tctaaactta gctaaatcag gcacgcgagc gagcgctctt ctgctatctg cgcactctaa 1860 tattaatccg aaaagggatt cgcgaccccc tagctcaagg cggcctgaga aagccaggtt 1920 cgacgccagt ttccccgtga caggcaatag aactcttagc aactaatgtg tcttccgtaa 1980 gtccgcgtgc gtgaccccat aggtaccgct atgttgcact gaaaatcatc tcgcgtaata 2040 agagctcgac gtgctcatgt gttgcgttgt ctttcctaag ttatctagct tgggccgaag 2100 gagggctagc cttctcggtc ttacagtgca actagccagg cttacatagc aatctcaggt 2160 actctttaag tcgaaaactc tagtcgcgtc cggcctgtca cgggaagaaa tagaatacac 2220 gaattaaagc cacgcgccgc gtatcgtgca gggatgggta aacagtctgc gcggaccatg 2280 taggttcacg acttcacagt cgctaggagg cggagccgta agccccgtgt aaaaccagag 2340 gcacctcctg ggccgcgtga tagtttggga gcccgggccg tcagttagcc agacggttag 2400 gctgctccgt aacagcacgt gtaaaagctt cgtcgccgga gcacttggtt ggctacctat 2460 ggggacgtgt aacggcacgg agtgctgccg cggctgctga aagttccgct ctctcttttg 2520 ttcgcgcttt ttggcgcatg gtaaccgcgc aggcgtccac ccagtgggaa acaaattcgt 2580 cctaaattaa ggcttctaac aaatatgctt attcctacag gtaaaaactc aaaatcacaa 2640 tgccaggtcc acaaaagaaa aacgttgctg aggtgcagcc agaggagagg tccccaacca 2700 ctccagggaa tcccctcgtc gccgagtcgg gccgtcttct gaccggtgcg agggaagacc 2760 ttgagcatcc ctcagtcagg acctggccgg ttaggagcgg tggcgggcta cgacggcttg 2820 gggaaggtga agtcggcctc cgagccgtgt ctgttagatt agtccgcgac gagcgcatcg 2880 aaaagatggc acggaagaca gtagaggtcg aggagccggg ctccaaaaac gagattaggt 2940 tcttgcaaat aaatgcggga ggggggcagt tagtaaacgc cgaaatttgc gaattaatag 3000 cgtcaaaaaa gatagacata gttctggctc aggagccata ctcgaaagtt aacaaagggt 3060 cgcgttattt cacaggcctt agacgtgcaa gtcgggcaat atgtctaaag agcgcaggca 3120 caaagacggc tcctaaagcc ttcgtagccg tgccaaaccc cgacttccac gccttcttcg 3180 tctcagcgtt aagcacccaa cactgcgttg ttgccgaggt gcatacgcct agcgtcacgt 3240 ttttcgccgt ttcaatgtac tttcaattct gcgacgatat tgaagtacac ctcgggcaac 3300 tagagaaagt attagagaac ctcagcggcc aaaaagtagt aataggcatt gacgcaaacg 3360 cggaatcctc gctttggtcc cctcgtggga caaacgagaa aggagaaaag ctcgagcgac 3420 taatcgcggc tttcggtctc cacgtagtaa acgacagaac ccaacctccg accttcgaag 3480 agaggggagt ttcgtcctac atcgacgtga ctctcgtctc ggggtccatg atcgcggagg 3540 tacagtcctg gaaagtgaaa cgggactgga cctccagtga ccataacgcg atagtcttta 3600 aaatcactac cgtagcccaa acggaccgag tggactccag tcgattcaac atcagacgag 3660 ctgactgggg cctgctcgac tctacgataa aggagttgtc tgtttcccac cttgaccaca 3720 ttatcttgga tagcgcagag gaggtcgagc gaatggccga tgctctccag aaagtcctgt 3780 acgaagcgtg cgaaaccgcc ataccgcgta ggcgccgtat ccggaaaaat aacccctggt 3840 ggactcgaga acttaccgac aaaaagtccg agctctacag cgctaggcgt agaatgcagc 3900 aacagtggag cctccctgga cacagtttgc gaaaagcaga atatcgggct ctcttgcgcg 3960 attactgccg atcggtgaaa ggggccaagg tcggcagctg gcaggaagtc gtcacagtgc 4020 gcggaaatga ggagccatgg ggggtagttt acaagcagct cagaggcaag ctgcaaaacg 4080 aaagaaccct cagttccgtt cggtgtgggg atacggagtc aatgtcgatg ttggagacgg 4140 ccaaccgtct gcttgaggtg cacgtcccag acgatacacc tttcaatgaa acccccgagc 4200 aggcacagat tagagaatca ataaactcac cgcccgagac cgaagatgcc gcacctttcg 4260 agcaatggga gatagctctc atcctcgcat ccctcaaaaa caataaagct cccggcttcg 4320 accttcttga agtcagagtc ttaaaggctg ccatcaaagc cattcctcat cacttcctgc 4380 ggctcttcaa cgcctgccta gagcacggcg tcttcccccg agcctggaaa caggcttccc 4440 tcattttcct cccaaaagga ggcaaagata gtaacgactc gaaatcgtac cgacccatca 4500 gtctccttcc ggttacaggt aaactctacg agcggttagt aaaaaggaga ctatccgata 4560 cagcgctagg accagacatg atctccgaca ggcaattcgg cttcagggct ggcatgtcta 4620 ccgaagacgc gatcatcgag ctgcgcagac ttacagccgc ttcccctaag aagcaggttg 4680 ctgcgcttct tttcgatgtt aaaggcgcct ttgactgcat ttggcgcccg gccatcctcc 4740 aaagcctcaa agagaaaaac tgtcccaaaa atgtatataa acttctcgtt agctactttg 4800 aagataggca ggctcaggta gtttggggga caaatcaagt ctccaagcag gcaactaggg 4860 gctgtccgca gggttcggtt ttaggaccct cgggctggaa ccttggattc gatccgctgc 4920 tccgcagcct cgagcaaggt gtagtgacag agggaggcgg aaaactccca ataaacttcg 4980 tcgcgtatgc ggacgacttg gccgtactag tcgaagggga ctctagggcg gaaatagaaa 5040 aagtaggaaa ggcggtagta aagcatatcg tcgaaaaatg ctcggctata aaattggagg 5100 tttcggaatc taagacggta gggatcttcg ttaaaaaacc taaggtagta ggttcaaaag 5160 cggtaaaaat aaaccggaaa gactgccgta agggaggagc gcgaaatccg aaaatagagt 5220 tgggcgggaa atcgattagt tttgaacagt cagtacgtta tcttggcgtg catttcgatg 5280 caaacttggg cattagcgcc cactgcaaat atcttaggga aaagttagta ccgctcttta 5340 gcgatttgcg taaactggca caatgccagt ggggtctggg acacagggcg ttggagacga 5400 tatacaaggg tgtattcgtc ccaacggttt gttacgcgtc cgcagcgtgg tacaaggagg 5460 gagcgcatac cgataggatt ctcgaagatc tgcacaggca gatcctcata gctatcacac 5520 gatgttaccg atcgacatcg tacgaggccg cgtgcgtact agcgggaaca ctcccgatct 5580 gtatccaact tagggttagc atggcgaagt atcacctgag aaaaggtgaa gacgcagaga 5640 taggcggcgt agtaattaga cacgacccag agggtctaaa ggaaaattac aatagggttc 5700 ttgaggtcgc gaatgagatg tggcaggcgc gttgggaggc atcggaacag ggcaatgcta 5760 ctcgcgaact tttcttccca gacgtagttg acagggttaa aagcgactgg atccgtccag 5820 atcacttcac ctcgcaggtt ctcacgggtc acgggtactt taatgagaaa ctccaccagc 5880 tctctttggc aaaggcagcg gcttgcctct gttgcggcga acccgacaac aacttacatt 5940 ttcttttaga atgccctgcc ttcgctgaat tccgcgatga gctgataact ccgatttcgg 6000 gtgggcttga ggcgccggaa gccacgctaa tgttagtttc ttccccagaa gggttcgcgg 6060 ctttgaaaga atacagtaga gtagcgttcg aatgtaagag gcaattggaa aatgccctta 6120 cggaatccga cgaaggatta agcagtgaga gtgaggaata gagtgactga ggtggtgggt 6180 gaaaggaaga gctgggtgaa agaatgtcta agctaggcaa acaaacgcga agtcaaaagc 6240 atgcttggct tggccctcgc gaaagtcgcc ttagggcttg actcgcgaaa taaactcgtt 6300 cgaaacttgg ccatcgtccg cgtctcacgc gtaaggcccc gtggaaggtc gctatatgct 6360 tgacacgcga gtgcatgctc gatcgtaaaa atttggccgt cgaccgaatt gaccatcttg 6420 ggtctaccaa agaaaaaagt aaattcaaga ataaacttgg tgtgcttgtg cgatactgca 6480 actaaaaagc acgcctctcg aacgaggagt gtagttaggg cctttagaac cttcgcctaa 6540 aacatcgtgg aggagatgtt gaaggtgtcc tgtccccaag cattgttcgc tggcgacaag 6600 ggctctggct gaaggattag aacagagccg ctcgttttgc agcggctcaa agcaggattc 6660 ttgtcctgtc cccgagcgca cctttcgaag aaggtgcgcc cggcataatt atttatttac 6720 taacaacatg tatttctcct aacaggtaca aacaaaatta gtggagatcc aggcgggatc 6780 tcgcgaatgc gccttcccgt ggttccccgt ggacggtccg gtggatggta gagcttgctc 6840 accatcccgc tatgactgac taaagcattc gtcccagttg actgattgtc cccgcacggc 6900 catcctcgga agaccgggcg ggtacaatct gttgatcgcc aatgggcact tgaatttttc 6960 caggaatgtt ctcctttcgg gtggttcgat ggatggtaga tggaaaacaa ggtcgcgtat 7020 gcttatggcg aggaagcgag tccaaataac atcagggcta accgaaatta a 7071 // ID Gypsy-3_BM-LTR repbase; DNA; INV; 178 BP. XX AC nscaf2937; XX DT 19-MAR-2010 (Rel. 15.07, Created) DT 19-MAR-2010 (Rel. 15.07, Last updated, Version -1) XX DE LTR retrotransposon from silkworm: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-3_BM_; KW Gypsy-3_BM-I; Gypsy-3_BM-LTR. XX OS Bombyx mori OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Bombycidae; Bombycinae; Bombyx. XX RN [1] RP 1-178 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from silkworm."; RL Repbase Reports 10(7), 982-982 (2010). XX DR Genome; nscaf2937; Positions 529386 529563. XX SQ Sequence 178 BP; 64 A; 28 C; 37 G; 49 T; 0 other; tgtgggaata tttaggtgta accattggcc acaccgacct gacagatgac agatggtgca 60 tgaagtataa gtgacagatt gattaagacg aaataagaaa agacgtatat ttttgtgaac 120 tgatgattaa ataaaattta acaaattcgc taatttcatt gtgcaccgcg atcccaca 178 // ID CR1-11_BF repbase; DNA; INV; 3978 BP. XX AC . XX DT 30-JUL-2009 (Rel. 14.07, Created) DT 30-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE Amphioxus CR1-11_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-11_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-3978 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-3978 RA Kapitonov V. and Jurka J.; RT "Young families of CR1 non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(7), 1582-1582 (2009). XX DR [2] (Consensus) XX SQ Sequence 3978 BP; 1005 A; 1145 C; 882 G; 946 T; 0 other; ggaggtcaga ggtcatcccg ggcgggcggg aggccgattt tagagctcca tcactttcca 60 ctctttttag ttctttttag ttttgtttcc acactccatc cacacatctt tacatctaca 120 aacagccgta acatcagcct gaactttctg gaccccggac gctcttgggc ggcgcatttt 180 tagttctttt tcttagattt acatagcgcc cccctccccc gtctcccgga gcgtgctcga 240 cgccaccttt gccgaccggt gagaccgttc cacttttata tacattctat agtactgttc 300 tctatattgt atatcccata tttacgatgt aatttgcccc ccaccagcgt ggcgttcctc 360 ctgttgagta ggagagaacc cctgcaggtc aagcattatg acatgtttcg ttacattctc 420 aaccatcttg ggcagcattc tactaacagc ccactgctta gtgttaattg taaactctca 480 ccatgatccc tctcccttca ctcccaggca tgaaggcata ccacacaccc ttaaaccatc 540 ctttaacagt ccattcccgt taactcccct gctggacaac acatgtgcca atcaccccct 600 gaagttaaca ttagtctcta tctttattaa ctctaatggt aaactagcac cccactgcag 660 acctaaattc actctgttca gcctcaccct gataatcctt ctggctggag atgtggagtt 720 aaatccggga cctagggcac cgaagtaccc ctgcggcgta tgccatcgag ccgtcaggtg 780 ggaaaaggtg gacggccggc atgccatctg ttgcgactca tgtgatgtct ggtaccatac 840 tgactgcatg ggtatgtcaa caccggtcta taataccatc aacaacccta acgtgtcctg 900 gatttgctgc tcctgcggcc tacctaactt cagctccagc ctcttcaaca ctacagtatt 960 cgaaacttct aattgtttct cttcttttaa caccagcacc tgttctagtc cagcaagatc 1020 tatcggctcg cctatcgcta cgtcttcgcc tatcggatat ggtattactc caccgcctcg 1080 cgaccgcaca cataccaaac cagtcaggac ctgtgtgatc aactttcaat ctatccgcaa 1140 taaggtacct gaactgcatg cattttgcga agctgtgcaa ccagacattg ttgttggcac 1200 tgagacgtgg ctggactcgt cagtgggcag cagcgaaatc tttccggaca cgtacaatgt 1260 gttcaggaga gacagagctg gccggggagg gggtgtgctt gttgctgtaa agaacaacat 1320 catagcaacc caccacccag acccggactg tccctgcgag ttgacctggg tccgggtcca 1380 tctggctaac agtaagtcca tctacatagg ggcctactat cgccccccct cagctggtca 1440 tgatgatttt gtagcactag aaagatcagt gctgcaaaag cgggctaata acaacaatgc 1500 ccacatctgg ctagcaggtg acttcaacct gcctgatgcg acctggaacc ctgaagctaa 1560 taccaccgct tccaatccat ccactctcac tagcaattac atcagccttg ccaatgactg 1620 tggtctggaa caaatggtct gtgaaccaac tcgtactgtg gggcaaactt cgaacaccct 1680 ggatctgttt ctcaccacaa attccacctt agtggagaga gtcaaaatct taccaggtct 1740 cagcgatcat gacatcccgc tgattgatgt tcaagtaaag ccccaaacat caaactcaaa 1800 agatcggctc atatacctgt ggaggaaagc taacatcgat gcactccagc aggatatggc 1860 aacatacagt cgagagtttg cagaccaggc tcagcacaac actgcctcag aaaactggga 1920 gctgtttaag gcagccgtta gtggtgctgc caataagcat gtccccagga aaaaagtgag 1980 gccacaatcc aacaaacctt ggataactcc ccagattcgc aaagctatgc gtaagcggtc 2040 tgaacttttt tccaaggcta ggagatccaa ctccaatgaa gcctgggcaa agttcaagag 2100 gtgtaggaag ggcgttaagc agagagttag gaaggcccat agggattatg tctccaatta 2160 cctggagtct aacatcgagg ataatcccaa ggccttttgg agctatgtca aatccatcag 2220 acaagactct actggtgcgt ccgccctaag acatcagggg gtgctaacat ctgaccccaa 2280 agaaaaggct gatgcactgg gggcccagtt tgagtcagtg tttaccaggg aggataagac 2340 caccgttcct acccttggcg aacccaaagc tccaactatc ccctccctta acatcacagt 2400 ggagggggta gccaaacagc tctcctgtct taaccccagc aaggccacag gaccagacgg 2460 attaccacct cgcctcctga agaccgtagc cgaacaaatt gcgccaatat tacaggtcat 2520 attctcccag tcaatatcca cgggagatgt tcctgaagat tggagaacag ctaacatcgc 2580 tccaatcttc aaaaaggggg acagaacctt gccatcaaac tataggccgg tgtctctgac 2640 atcggtctgt ggtaaagtgc tcgaacacat cgtgcacagc cacatgatga aacatctaga 2700 cgctcatggc attctgtccc ctgcgcaaca cggcttcagg aagggtctgt cctgcgagag 2760 ccagctggtg cttaccctcc aagacctggc caagaacatg gaccagaaca aacaggtcga 2820 tgccgcggtg cttgacttta gcaaagcatt cgatactgtg ccacacgaac gtctgctgag 2880 caagctcgag cactatggta tttctggcct ccttcagtct tggcttaggg ccttccttac 2940 ggagagaacc cagagggttg tctttgacgg tggaacatct aaagctgtca aggtcacctc 3000 tggagttccg cagggtactg tgttaggccc tctgctattc ctgctctaca ttaatgacct 3060 acccgattcc gtagactcac atgttcgact gtttgctgac gattgcttga tttatcgaac 3120 tatcagcaag ccttctgacg cacaagggct gcagtctgac ctcgatgcgc taacagaatg 3180 gcaaaatcgt tggctcatgt cgttcaaccc ctctaaatgt cacatcctcc atatcacccg 3240 taaaaagcac cccatcataa ctcagtactc cctctgtgga gaagccctta ccggtgttaa 3300 gtcccacccc tatcttggag tacaactgtc cgacgatttg cggtgggaca cccatatcaa 3360 ccacgccact agcaaagctg gtaaggtcct aggagtcatt cggcgcaact tgacccattg 3420 tccatctagg gtcaaggcca cttgttacaa agcgctggta cggcctcact tggaatatag 3480 tgccatcgtc tgggacccgt acaccaacaa agggatacag gcggtggagg ccgtccagcg 3540 cagggcagcc cgagtgaccc taaatgacta ccggcagact agcagcgtga cacaaatgct 3600 ttcagacctg cagtggcgcc ctctctccga aaggaggagg aacgcccgcc tcaccttttt 3660 ctataaagta gtcaacaata atattaacat agacgccagc aatattctga agcctgccca 3720 agggcgcacc cggggtagtc acgacttcaa atatcaacac atatttgcac gcaccgatat 3780 ctacaaacat tcttttttcc cacgcacaat tcccgagtgg aatgccctgc ctggcacggt 3840 tgtaagcgct cccaacgttg agcacttccg tgccaggcag gcggcctgcc cgccctaacc 3900 cgggagcctc agctcccccc cctacttttg cccctcgcgg ggtcatttgg gggtatccta 3960 tgcagatgca gatgcaga 3978 // ID ULTR-1-I_NVi repbase; DNA; INV; 343 BP. XX AC . XX DT 10-APR-2009 (Rel. 14.04, Created) DT 10-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE internal portion of a nonautonomous LTR retrotransposon, DE consensus. XX KW LTR Retrotransposon; Transposable Element; Nonautonomous; KW ULTR-1-I_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-343 RA Bao W. and Jurka J.; RT "Nonautonomous LTR-retrotransposons from Nasonia vitripennis."; RL Repbase Reports 9(4), 789-789 (2009). XX DR [1] (Consensus) XX SQ Sequence 343 BP; 91 A; 80 C; 118 G; 54 T; 0 other; tggcgcccaa ctgaattacc cacaggcgtg gataaggaag atttccacgg ttctccagga 60 gttccacgaa ggaggaggag acgaggaaga aacccccacc tacgaaactg actctcccag 120 gcgggaagag ggatcgagca gatcagctgc agcggcggca gcccagggca ctgcggatta 180 cgacatgcgg tcaagaagac gagtgcgaaa ggtgcgggga tgtacgaggt tggagacatc 240 gtcgaagagc tagtcggcga agaagagttc ctgcatttac cggcgatcct gcggtggatt 300 ggtagagtcg aaatcgggac gataatccga ctctagcggg ggg 343 // ID CR1-6_NVi repbase; DNA; INV; 3601 BP. XX AC . XX DT 08-MAY-2009 (Rel. 14.05, Created) DT 08-MAY-2009 (Rel. 14.05, Last updated, Version 1) XX DE CR1-type non-LTR retrotransposon: consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-6_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-3601 RA Bao W. and Jurka J.; RT "CR1 families from Nasonia vitripennis."; RL Repbase Reports 9(5), 933-933 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS join(2..241,226..570,543..2873,2866..3441) FT /product="CR1-6_NVi_1p" FT /translation="TYPREGCTGPRDGLSKQEELTAGYSGFSNAWTLTRII FT MSHRAWTVELFLLNCTIGPIDPSTSLCTTFLSKATTSRCHYWSVPLLVLML FT MKSLDAVFPGPQETALFLVRFGSSDDASRVLKNRKLLPSSVEATADHTAYQ FT RDLYRKLKLKASLHNTSHPDSLKKVVYVKGTPTLVDLRRTSTAPLSEKPSK FT NLQASETFKKLAGILKPVNKLSLYYQNAYSMYGKLHLFKAKLFQLSILPDI FT VVITETWLQDCVLSSELGMQQFDVFRCDRDLSSHGVQRGGGVLIGVKKFLQ FT AELVHVDNVIEQLIIRISLPSHKLIVAASYLRPDSSVQVFSSHVSNLEDAN FT IKFPDHRMVLLGDFNLHHITWSSDTSAYSPMAYLSPALRSAADIISGYSTS FT MDLVQHFKHHPKKGYSLDLLFAQKDFISPMDFNEDILPCDDHHVPGMFSCN FT ISVASSASCNLKFCKRKFYSADFEAIRARLSEEDWATVLYSFSDDPDSGIA FT RFYEILNGVIDQHVPITKQSLSSYPKWYDSELINAICKKKIAHSTWKSSNS FT KRDEIEFKRLRAVCVRLSRSKYRAYISSVELGLKRNMRTFWSFIKNLRNDS FT SIPSKMYHGPTRAASDVEVSNLFSGYFSSVYRTDSLREFESNDEPLVSLSN FT IVLSIETLRSIVANLDDNINGGPDGIPPYFIKRCWSLLERPIVHIFESMLR FT SGYFPTFWKYSFVVPIYKNGDRHSVANYRPISTLSCIPKILDAYLASELSN FT SLLSKVAQQQHGFISGRSTLTNLLVFNDFLSESLCNHEQTDAVYTDMSKAF FT DSVNHSRLISKLFNFGIRGKLLDLLRSYLTNRSQAVRVNGAVSSPVDVPSG FT VLQGSHLGPLLFCLYVNDLVPKLSAAQVLLYADDIKLFMRVRSADDSVKLQ FT RDLDTLVAWSVENGLSLNANKCSVISFSRSKSLLLFNYCINGSRLQRVDVV FT KDLGVIFDSSLTFITANQQTDVVVSRSLRMLGFIKRSTVDFSDPSAIIYLY FT KTLVLPHFLYCPQIWRPHSQYLINKLESVRHSFLRYIAFKMGKPLHRFCHD FT YSEIAEGCDIXTVDSTFYLHDCVMTFKILRGLTNCERLSDLFVSRTVAYPL FT RNLRDLREGTFSSNLGFYSSINRMKRQWNSIPNNVQSKESVGQFKSSIRSL FT SLAY*" XX SQ Sequence 3601 BP; 908 A; 802 C; 764 G; 1126 T; 1 other; tacgtaccct cgagaaggat gtacaggacc tcgagacgga ttgagtaagc aggaggaact 60 gacggctggc tacagtggct tttccaatgc atggacgtta acccgtataa taatgtctca 120 cagagcttgg actgtggagc tttttctgct gaactgcacg ataggaccta tagatccttc 180 aacgtcctta tgtacaacgt tcttgtcgaa agcaacaact tctaggtgcc attactggtc 240 ttaatgttga tgaaatctct ggacgccgtt ttcccaggac ctcaagaaac agctcttttc 300 ctggtacggt ttggctcgag tgatgatgct tctcgtgtcc tgaagaacag aaagctgctt 360 cctagctcgg tcgaggccac tgcagaccac acagcttatc agagggatct atacagaaaa 420 ttgaagctaa aggcttcatt acacaatacc tcccatcctg acagtctaaa aaaggttgtc 480 tatgtcaagg gcacgccaac cctggtggat cttaggagga cctctacggc tccgctcagt 540 gagaaacctt caaaaaactt gcaggcatct taaagccagt caacaaactg tcgctttact 600 accaaaacgc ctattctatg tacggtaagc ttcatctctt taaggctaaa ctgtttcaac 660 tctctatctt gccggatatt gttgttataa ctgagacttg gctccaggat tgtgtcttgt 720 cttctgagct gggaatgcag cagttcgacg tgttcaggtg tgatcgtgac ctgtcatcgc 780 atggagtcca gaggggtgga ggagtcctga ttggagtcaa gaagtttcta caggctgaac 840 ttgttcatgt tgataatgtg attgagcaac tcattattag aatctctctt ccctctcata 900 aacttattgt tgcggcctct tatttacgcc ctgactcttc tgtacaggtg ttctccagcc 960 atgtatctaa ccttgaagat gcgaacatca aatttcctga ccatcgcatg gtactgctgg 1020 gggatttcaa tttgcatcac atcacctgga gctcagacac ctcggcttac tcaccgatgg 1080 cctacctcag cccagcattg aggtctgcag cagatatcat cagtggctac tccacttcca 1140 tggatctggt ccagcacttc aagcatcatc caaagaaggg atactctttg gacttgcttt 1200 ttgcccagaa ggatttcatc tcaccgatgg acttcaatga agacattctg ccctgtgatg 1260 atcaccatgt tcctggcatg ttctcgtgta atattagtgt agcgtcgtct gcgtcctgta 1320 atcttaagtt ttgtaagcgt aaattttact ctgcggattt tgaggcaata agggctcgtc 1380 tttcggagga ggactgggcg actgttttgt attctttctc agatgatcct gactctggta 1440 tagccagatt ttatgaaatt cttaatggtg taattgacca gcacgtacct atcacaaagc 1500 agtctctcag ctcgtatccg aaatggtacg actcggaatt gatcaacgcg atttgtaaaa 1560 agaaaatcgc tcattccaca tggaaatcgt cgaattccaa gcgtgacgaa atcgaattta 1620 aaagattgcg tgcagtgtgt gtgaggttat ctagatctaa gtatcgtgcg tatatctctt 1680 cggttgaatt aggcttgaag cgtaatatgc ggacattttg gtccttcatt aaaaacttaa 1740 gaaatgattc aagtattccg tctaaaatgt accacggacc cactagggcc gccagtgatg 1800 ttgaagtgtc taatcttttt tccgggtatt ttagctccgt ttatagaacg gattctctgc 1860 gtgagtttga gtctaatgac gagcctctgg ttagtttatc taacatcgta ctgtccattg 1920 aaaccttacg tagcatagtg gccaatttgg acgacaacat caatggcggc cctgatggta 1980 ttccaccgta cttcataaag agatgttggt cattattgga gcgtcctatt gttcatattt 2040 ttgagtccat gcttcgctct ggctatttcc ctaccttttg gaaatactct tttgtcgtac 2100 ctatttataa aaatggagat aggcatagcg tagcaaacta tcgaccaata tccacgttaa 2160 gctgcattcc caagattctt gatgcctacc tggcatcaga actgtctaac tctctcctat 2220 ctaaagtcgc tcaacagcag cacggcttta tcagtgggcg ctccacgctt actaaccttc 2280 tcgtatttaa cgatttcctg tcagaatctc tttgtaacca tgagcaaact gatgctgttt 2340 ataccgatat gtcaaaagcg tttgactccg tcaatcacag ccgtcttatt tcgaaactct 2400 tcaactttgg tatacgcggg aaattgttgg atcttctgag atcctattta acaaaccgtt 2460 cgcaagcggt aagagtgaac ggggcagtgt cgtctcccgt ggacgtacca tccggagttc 2520 tgcaaggttc ccacctggga cctttattgt tttgtttata cgttaacgac ctcgttccta 2580 aactcagtgc tgcgcaggtt ttgttgtatg cggacgatat taaattgttt atgcgtgttc 2640 gttcggcgga tgattccgtt aagcttcaga gggatttgga tacacttgtg gcctggtcag 2700 ttgagaacgg actatctttg aatgccaata aatgctcggt gatctcattt tccaggagca 2760 agtcgttgct tctctttaat tattgcatta atggatctag gctgcagaga gttgatgtgg 2820 tcaaggatct tggcgtgatc tttgactcat cattgacgtt tataacagca aactgacgtg 2880 gtcgtgtcca ggtccctccg aatgctgggc ttcatcaaga gatccaccgt tgatttctcg 2940 gacccctctg cgatcatcta tttatataag actttggtac tccctcactt tctatactgt 3000 ccacagattt ggcgacctca ctctcaatat ctaattaata aacttgaatc ggtacgtcat 3060 tctttcctta gatatattgc ttttaaaatg ggtaaaccct tacatagatt ttgtcatgac 3120 tactcagaaa tcgcagaggg gtgcgacatc cyaacggtgg actcaacgtt ttatctccat 3180 gactgcgtta tgaccttcaa gattcttcgt ggtctaacta actgtgagag actgagtgat 3240 ctctttgtta gtcgaacggt ggcgtatccg ttaaggaact tgcgtgatct tcgagaaggc 3300 accttctcct ctaatctcgg gttttacagc tctattaacc gaatgaaacg ccaatggaat 3360 tctatcccga ataacgtcca aagtaaggag tctgttggtc aatttaagtc ctctatccgc 3420 tcgttatcgc tggcctacta gccagatata tgtatctccg tattattatt atttttgtac 3480 attcgaatct aatgaactat ttgtcacttg ttttatattt ctcaaaactt gtatttatta 3540 tagtattttt gtaaagggcg cttcgcccgt taaattatta aataaataaa taaataaata 3600 t 3601 // ID Gypsy-1_AC-I repbase; DNA; INV; 4405 BP. XX AC AASC02000433; XX DT 18-JAN-2011 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from Aplysia californica (California sea DE hare): internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-1_AC_; KW Gypsy-1_AC-LTR; Gypsy-1_AC-I. XX OS Aplysia californica OC Eukaryota; Metazoa; Mollusca; Gastropoda; Heterobranchia; OC Euthyneura; Euopisthobranchia; Aplysiomorpha; Aplysioidea; OC Aplysiidae; Aplysia. XX RN [1] RP 1-4405 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Aplysia californica (California sea RT hare)."; RL Direct Submission to Repbase Update (02-FEB-2011). XX DR Genome; AASC02000433; Positions 32217 36621. XX CC Positions [3106-3579] - Integrase core CC 'CTCGG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 2605..4032 FT /product="Gypsy-1_AC-I_1p" FT /translation="MTVADSLSRSPVVSTEPEEFGEEIEAHVATIQMSWDA FT TDSKLDEIRNKSQQDPAISAAIQYTRSGWPKFIQNVKSEARDLYAVSSELS FT EVHGLLVRGNRIVIPKAMRGEMLSRIHDGHLGVSKCRERAKSCIWWPGLSS FT EIREVVSRCDHCQRKRPAQPSEPLTPTALPERPFQMVGADLCNYKGHNYLV FT LVDYFSRYLEVAYMPDTTSETVRLKNVFARFGIPELLVTDNGPQFVSDRFH FT KFAKGWGFKHTTSSPHFPQSNGGSERAVEEAKKALSQDDPFLALLIYRSTP FT VSPTGASPAELALGRRLRTTLPTLPSNLNQQVYDRAKIAARDAESKRKYKQ FT AFDRRHGAQNLPELNPGQLVLQKLENDKEWRGPAVVKERCAPRSYIIQSGD FT RAYRRNRKHLKPYVDPPSMPSSPVATPEPTPMSVPLPPPPAVQPVEPDVTP FT PSPQPAAVPASPAPITTRSGRVVRRPARYPD" FT CDS join(113..1084,1088..2605) FT /product="Gypsy-1_AC-I_2p" FT /translation="MLRYRTATKLSAETGAIQVSSLIYAMGRQAEQIFNTF FT TFPAPVEGNDPRDNFDTVLENFDAHFVPKRNLIHERAKFHARAQNSGETIE FT EYVRALYELSEHADFKDRDETIRDRLVLGVLDSELSQHLQLEATLTLKSAI FT ETARHFELVKGQVSSQRQLGAAVTVDSVRYGQRGYSSPAHPRGRGGRRTGT FT HSQHRGSHASPSTFSNCGNCGRGHAQNNCPARGKTCRKCGKRNHFATVCRS FT SSSSSSSSSSSSSSSSSSRNVNEITESETPSFFLGAVDRDSDSNVVTDVNF FT GEVSEPPWRVTLNLGGRATSFKIDTGADVSIMKAVYNRLSPRPKLKTTSAV FT LRSPGGVIRSVGEFIATTLHNNRNFAFRVFVLDDDTDCLPSRDAAVRLGLI FT SRDRIDSANLVFGDVGPKPIRCEPVKIILKEDAQPYSVNVARRIPIPLMDE FT VKAELDRMEAAGVIEKISAPTDWCAPMVPVRKRSGSVRICTDLKKLNLSVK FT RERFMLPTIDDILYKLSGSNKFSKLDATSSFWQLALDDDSAKLTTFITPFG FT RYFYRRLCFGITSAPEIFQRTMQECFMDDILLHTDVQKKHDKLKKKVFQRL FT KERGVRLNKSKCKFDKDEIDFLGHIISGKGVRPDPSKVSAITEMPEPENIT FT DLRRVHGMVNYLGRFVPNLSTVMKPLTDLLNHDAEWAWTPAQSSAFSDVKK FT LLSSAPTLAFFDMRKPVTVSADASSYGLGGVLLQEDRGDLQPVAYCSRTLS FT RAEKNYAQIEKELLAATWACEKFDRYLIGLPSFTLLTDHKPIVPLINSKEL FT NDAPVRCQRMLMWMMRYSGKAVFTKGKT" XX SQ Sequence 4405 BP; 1182 A; 1153 C; 1094 G; 976 T; 0 other; tggtgtcaga agtcagtgga taacgtgtca caagtgtcag aatgcctcat ttcaagcctc 60 ctcagccgtt tgatttttcg aaacctcacg agtggccgag tggcgacacc gtatgctacg 120 ataccgtaca gccaccaaac tgagcgcaga gaccggagca attcaagtaa gttcattaat 180 ttacgctatg ggacgccaag ctgaacagat atttaacacg ttcacgttcc ctgctcccgt 240 cgaaggtaac gatcctaggg acaactttga cactgtctta gagaatttcg acgctcactt 300 tgtccccaag cggaatctta tccacgaaag agcaaagttc catgcacgag cacaaaacag 360 tggggagacg atagaagaat atgttagagc cctgtacgaa ctttccgaac atgcagactt 420 taaggaccga gatgaaacca taagggatag actcgtccta ggagtactgg acagtgaact 480 gtcacagcac ttacaactcg aggcaactct tactttgaaa tcagccatcg aaacagcacg 540 ccatttcgag ctcgtaaagg gtcaggtcag tagccaacgt cagctaggtg cagcagtcac 600 agtagattca gtgcgatacg gccagagggg atattcgtcg cctgctcacc ccaggggtag 660 aggcggccga agaacaggta cacactcaca acacagaggc agccatgcat caccgtcaac 720 tttctctaac tgcggtaact gtggaagagg tcatgcccaa aacaattgcc cagcgcgtgg 780 gaaaacttgc aggaaatgcg gcaagcgcaa tcacttcgca acagtctgtc gcagcagcag 840 cagcagcagc agcagcagca gcagcagcag cagcagcagc agcagcagcc gcaatgtcaa 900 cgaaatcacc gaatcggaaa ctccatcgtt ctttttaggt gctgttgaca gagactcgga 960 cagtaatgtc gttaccgatg taaacttcgg cgaagttagc gagccgccat ggcgcgttac 1020 actgaacctt ggagggagag ctacatcctt caagatagac acaggggccg atgtttcgat 1080 aatgtagaag gccgtctaca accgtctcag tcctcgtcca aagctgaaaa caacgtctgc 1140 agttcttcgc agtcctggtg gagtcataag gagcgtggga gaatttatcg caactacgct 1200 gcataacaat cgcaactttg cgttccgtgt tttcgtcctt gacgacgaca ccgactgtct 1260 tcccagtcgc gacgccgcag ttcgtctggg actgatttct cgagacagaa tagactctgc 1320 caacctcgtg ttcggcgatg tgggacctaa acctatacgc tgcgagccag tgaaaatcat 1380 cttgaaagaa gatgcacagc catactccgt gaacgtcgca cgtcggattc ccattccact 1440 catggacgaa gtcaaagctg aactagaccg tatggaggcc gcaggtgtca ttgagaagat 1500 cagcgctcca actgactggt gtgcacccat ggtcccggtc agaaaaaggt ctggtagtgt 1560 tagaatctgc acagatctga aaaagctcaa cctatcggtg aagcgtgaac gattcatgct 1620 gcccaccata gacgacatat tgtacaaact cagcggatca aacaagttca gcaagcttga 1680 tgcaacttca tccttttggc agttggcact ggacgacgac tccgcgaagc tgaccacttt 1740 cattaccccg ttcggtcggt acttctacag acgtctctgt tttggaatca cctctgcgcc 1800 tgagatattt cagcgtacga tgcaagagtg ctttatggac gacatcctcc tgcacacgga 1860 tgttcagaag aaacatgata agctgaagaa gaaagtcttc cagcggctaa aggagcgtgg 1920 ggtcaggttg aacaaatcca agtgcaagtt tgacaaggat gaaatcgatt ttcttggcca 1980 catcatcagc gggaagggtg taaggccaga tccatccaag gtcagcgcca tcactgagat 2040 gccagagcca gagaacatta cagacttacg acgagtgcac ggcatggtga actatttggg 2100 caggtttgtg ccaaatctgt caacagtcat gaagccgctc accgatcttc tgaatcacga 2160 tgctgagtgg gcctggactc cagcgcagtc atctgccttc tctgatgtca agaagttact 2220 gtcgtcagct ccaacactcg ccttctttga tatgaggaag cctgtcactg taagcgccga 2280 tgcaagcagc tacggactgg gaggagttct cctccaggaa gaccggggtg atcttcagcc 2340 agtggcctac tgttccagaa ctttatctag ggcggagaag aactacgctc agattgaaaa 2400 ggaactgttg gcggccacct gggcgtgcga aaagttcgat cgctacctca tcggtcttcc 2460 gtcattcaca ctgctcaccg accacaagcc cattgttccc ctgatcaact ctaaagaact 2520 gaatgatgca cctgtccgct gccaacgaat gctgatgtgg atgatgcgct acagcggaaa 2580 ggcagtattc accaaaggaa aaacatgact gtcgcagact ccttgtcccg aagtccagta 2640 gtgagcaccg agcctgaaga atttggtgaa gaaatcgagg ctcatgtcgc tacaatccag 2700 atgtcatggg atgccacgga tagcaaactt gacgaaatca gaaacaagtc tcaacaggat 2760 ccagcgatca gtgctgcgat acagtacact agaagcggtt ggccgaaatt catacaaaac 2820 gtcaagtccg aagctcgcga cctctacgca gtttccagtg aactgagcga agtccatgga 2880 ctactcgtca gaggtaaccg catcgtaatc cccaaggcaa tgcgaggaga gatgctcagc 2940 aggatccacg acggtcacct tggagtgtcg aagtgtcgag aacgggccaa atcttgcatt 3000 tggtggcctg gtctcagctc cgagatacgg gaagtcgtct caagatgcga tcattgccag 3060 cggaaacgtc cagcacaacc aagcgaacct ctaaccccga cagctctacc tgagcggccg 3120 ttccagatgg ttggtgctga cctctgcaac tacaagggac ataactacct ggtactcgtg 3180 gactactttt cacggtactt ggaggtggcc tacatgccag acaccacctc ggagactgtc 3240 agactgaaaa acgtcttcgc tcggtttggc attccagaac tactggtgac tgacaacggt 3300 cctcagtttg tgtcagaccg tttccacaag ttcgccaagg gctggggatt taaacacacg 3360 accagcagtc ctcattttcc ccagagcaac ggcgggtcgg agagagcagt cgaggaagcg 3420 aagaaagccc tgagtcagga cgacccattt ctcgcactcc tgatctacag gtctacaccg 3480 gtgtcaccga cgggggcgag cccagcggaa cttgctcttg ggcgacgcct gaggacgacc 3540 ctgccaactc taccgtcaaa cctgaaccag caggtctacg acagggcgaa gattgcagcc 3600 agagacgctg agtcgaagcg taagtacaaa caagcttttg acagacgtca cggagcccag 3660 aacctcccgg agctgaatcc tggtcagctc gtcctgcaga agctggagaa cgacaaggag 3720 tggcgtggcc ctgcggttgt caaagaacgg tgtgcaccga ggtcttacat catacagtct 3780 ggggataggg cttaccgccg gaatcgcaaa cacctgaagc cctacgtcga ccctccttcc 3840 atgccttctt cacctgttgc caccccagag cctactccaa tgtccgttcc cttacctccg 3900 ccaccagctg tccagccggt ggagcctgat gtcacaccac catcacctca gccagcagcc 3960 gttccagcca gtccagctcc aataactaca cgaagtggac gagtggtcag gagacccgct 4020 cgttacccgg actgatgttc ttgcgaccca gtcgtatcct ttcgatgaca aattttactt 4080 tgtgtttatt ctttgctgat tacccaacag ttgtgtagac ttacctgttc gaagtaattt 4140 attttctcaa gttcgttatc gggacgctat gctgttctat gaatgtgaca aagacaattt 4200 gaaatgtgga tattatgaaa taccaaagaa ctttattgaa atgtgttaca aagttctcaa 4260 gtgaaagccc ggaatcttca aagacaattg aactatgtgt gtgaattttg ttaaagttag 4320 aacattcccc tttgagagac tcgaagccga taatctttaa tccattttgg ggtttttcct 4380 ttttcttcaa tattaaaagg ggaga 4405 // ID I_Ele39 repbase; DNA; INV; 6583 BP. XX AC . XX DT 22-OCT-2010 (Rel. 15.1, Created) DT 22-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE An I clade non-LTR retrotransposon family from Aedes aegypti. XX KW I; Non-LTR Retrotransposon; Transposable Element; I_Ele39. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-6583 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6583 RA Kojima K.K. and Jurka J.; RT "I clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (22-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. This consensus is generated from 8 CC sequences with >97% identity, and ~100% identical to the original CC sequence in [1]. XX FH Key Location/Qualifiers FT CDS 382..1722 FT /product="I_Ele39_1p" FT /translation="MASSATAPRGGIFEQHSRSNVPQWMLGSDDIGQVMVF FT VLRRKLDKSNVSENERNQNAALPNPFVVGASIRQVIGVKEASSIETTREGR FT GSRYLLRTSSRSIAEKLTKITKLSDGSDIEIIAHPTLNTVQGVVFEPDTIN FT VDEEKIQKELESQGVHTVRRIKKRINGKLINTPLLVLSISGTILPEYVYFG FT LLRIPLRQYYPSPMMCFHCGHYGHSRKFCQQTGICLRCSTSHDILDGEQCE FT NDPKCLHCKGGHSVSSRDCPKFKEEEKIIRLKIDRRISHAEAKRICSLESR FT TEGFKNVVQDQIQQELAMKDQLIASLQQQVATLVKEIESLKKILKPKPKDL FT PSATQDEQHATLQSAMDSPVVTPASSQSSLLKPNRLSRKDKESTVPPGKPP FT GSRCSSRPSYDVQTRSRSGKRHYDISPTDTDKHKGKRISVPASTSNKPIDI FT DE" FT CDS 1725..6497 FT /product="I_Ele39_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="MPRKTQRNTLTTDFSDPATIMDKSITFPNYSHELRLH FT PTPATGKQSLKRFEDXESVVLSSLPEEEARLQSKVIQDVFLQPVEHQAFSP FT AVMYNKPKDLPHSAKINSTAYDQFRHISVIPSTSKTQIFGYYESPSARPPT FT STLTGDVPVQPDEPPAAVDVDHQTLPASITTIVSYDQSDHIQSFNNTAQPK FT PNGMNHFEIDRKTFPDCSHELRLHPMPATHKLSLKRFEDDESVAFSSLHEE FT EARLQSKVIQDVSIQPVEHQALSSTMRINTLKVLPHSAKINSTAYDHFRNI FT SVKPSTSKTQIFGCYESPSARPPTSTLTGDVPVHPDEPPAAVDVDHQTLPA FT SINSIVPYDLSTHIQSFNNTAQPRVNGLEYPNNSSLDDESLSETRSACSHE FT PEKRMLFFQWNIRGLWCNHTELCHLLLDNPPIVVSLQEMMTKNVNNALRKK FT YTWNSVNRHYSQGAGAAGLGILSEVPHNFIKVNCPIPVCVARLLNPYNITV FT VSIYVPPNSPDSHMLETLNSIKGNTDPPYIIGGDFNAAHEAWGSIKSSSRG FT LKLLSWIVDNNMVSLNNGEPTFISSSHGSTSSIDLTIVSNSIASRLSWTIA FT KDTHGSDHFPIRVYSQERHPTEQHRRRWLYKNANWPTFENSLLDVLQPNKD FT ISIDELNNAIIKSAEASIPRTSGKSIGKTEIWWTDEVQKVVKARRKALRKM FT KRTAPSDPSYESIRKSFQASRAAARKSIVQAKQESWTNFCSSFNPRTPSDV FT LWNNFNRLNGKRKKGVKGLVIDGAFVQDPPIIAERFANYFQQASTTPAISA FT SASNGETMLISSTESNSELDKEFSLDELLRAIDSAKGYSTGKDNIGYPMIR FT HLPISSKYAMLKCFNRIWAEGKFPQQWKEGIVVPVPKPGTTQNTIENYRPI FT TLLSCIGKIYERMINHRLITFLEKHNILHPNQHAFRSGRGTSSYFSDLKEI FT LENAKQSKNHLEFALLDIRKAYDQTWRPHILQQVNRLSVGNCMKSCISNFL FT KDRLFRVCYGSSLSTQKPQDNGVPQGSVLAVTLFLLAMNSVFEAVPKNVRI FT LVYADDIVLVASSKQISTVRHHLANAVSAVNTWAKQVNFSLSATKSCLLHV FT CQRXSHRGSRSLPAIVVDGEAIPEVNSARFLGVWINRRGQFSTHAAKTKQA FT LRNKVNFIRALSPKASRSTLWKISNAVCISKLTYGVELFGDGITALLRPIY FT NELLRISSGALRTSPTLSLATEAGELPFDLRLAELLVRKYCKIIEKTNKNY FT PYFHETVDRILNQLIEHNTPCITKLRRVGRRPWNFPRIRVDWSIKREFKKG FT QNPGIARMLTSNRMDAQYAKHTKIFTDGSKTSNEVGVGVIGPNIYVERSLL FT PQCSIFTAEAAGLLFAARAAPRTPTVILSDSASCLSAIEKGKSTHPFIQEF FT EYVAADKNLVACWIPGHSGITGNEQADNAAERGRSSRLLYRSIPAADAIQW FT IKHQLHAKFQEEWANNSVTFLQRCKPTVEKWIDRENRLEQKTLTRLRIGHT FT KMTKKHLFDNTFSPICDVCNTNLTVEHILISCRKFDLVRDELGLSNNLQII FT LSNTKENESKLLKFIKKCNLSDKI" XX SQ Sequence 6583 BP; 2106 A; 1596 C; 1294 G; 1585 T; 2 other; attcgtattc gggatacatt cgtataagag cacgcgcgga ttgctcccgt ttatattttt 60 gctaattctg ctgttataag ctacaaactc atcgttatac tttatttaag tgaaaccaac 120 accaaatcac atgccggaga agtgaagttt atgtgatttt tggcctcaac atagtgaaat 180 aacattaaaa acattgtgtt aatatcacgc gactagagtc gccattgtcg aagtacctac 240 accgtctcac attcgctata gacaacaacg caataaaaat ccgagtgata ggttgtatcc 300 acattaggat ctatttcgtt caagtgcagt gtcgttaaac tgaaagctat taagtgttag 360 tcgccagcta gctacggcct tatggccagt agcgcaaccg cccctcgggg tggcatattc 420 gaacagcaca gtaggagcaa tgtgcctcaa tggatgctcg gatcagacga tattggtcaa 480 gttatggtgt ttgttctgcg ccgtaagcta gataaatcga acgttagtga aaatgaacga 540 aatcaaaacg cagctctgcc aaatcctttc gtcgttggcg catctatccg acaagtaatt 600 ggtgtgaagg aagctagttc aattgagacg acccgagaag gaaggggttc tcggtattta 660 ctccggacca gctctagaag catcgccgaa aagctcacga aaattaccaa attatcagat 720 ggttccgata tcgaaattat tgcccacccc accctcaaca ctgtgcaggg agtagttttc 780 gaacccgaca ctatcaacgt tgacgaagaa aaaatacaaa aagaactgga atcgcaagga 840 gtccacacag tacgccggat taagaaacga attaacggga agctcataaa tactccgcta 900 ctagtcctat cgatcagtgg tactatactg ccggaatatg tctacttcgg tctactccga 960 attccactcc gacagtacta tccatctcct atgatgtgtt ttcactgcgg tcactatggc 1020 cactcacgga agttctgtca acaaacagga atctgtctac ggtgctcaac atctcacgac 1080 attctagatg gagaacagtg cgaaaatgat ccaaagtgtt tacactgcaa gggtggacat 1140 tcggtgtcat ctcgcgattg cccaaaattt aaggaagagg aaaaaataat cagattgaaa 1200 atcgatcgtc gcatctcaca cgcagaagct aagcgtattt gcagccttga atcaaggaca 1260 gaaggtttca aaaatgttgt tcaggatcaa attcaacagg agttggcgat gaaagaccaa 1320 cttatcgcat cattgcaaca acaagtcgcc actctcgtca aggaaattga gtcgttgaag 1380 aaaattctga aaccaaaacc caaagatctg ccctccgcta ctcaagatga acaacatgca 1440 actctacagt ctgcaatgga ttccccggta gtaacgcctg cctcctcgca atctagtttg 1500 ttgaagccaa accgcttatc ccgcaaagat aaagaatcca ctgttccacc tggaaaacct 1560 cctggtagcc gctgcagtag tagaccttcc tacgacgtcc aaacccggag cagaagtgga 1620 aaaaggcatt acgacatttc tccaactgat accgacaaac ataagggtaa acgaatctca 1680 gtaccggcga gtacaagcaa caaacctatc gacatcgacg aataatgcct cgaaagacac 1740 aaagaaatac tctgactacc gatttttctg accccgcaac gataatggac aagagcataa 1800 ccttcccgaa ttactctcac gaactacgac ttcatcccac gccagcaaca ggcaagcaat 1860 cattgaagcg gtttgaagat gasgaaagtg tagtactctc ttcacttccg gaagaggaag 1920 cgagactgca aagcaaggtg atccaggacg tctttctcca acctgttgaa caccaagctt 1980 tcagtccggc cgtgatgtac aacaagccaa aggatctacc acactctgcg aaaatcaaca 2040 gtacggcata cgaccagttc cggcacatca gcgttatacc ttccaccagc aagacccaaa 2100 tattcggata ttatgaatcg ccaagtgcga ggcctcccac gtcgaccctg accggagatg 2160 ttccggttca acccgatgag cctccggcgg cagtcgacgt ggaccaccaa accctaccgg 2220 caagtatcac tacaatagtt tcttatgacc agtcagacca catccaatct ttcaacaata 2280 cagctcaacc aaaacctaac ggaatgaacc atttcgagat cgaccgcaaa actttcccgg 2340 actgctctca cgaactacga cttcatccca tgccagcaac acacaagcta tcattgaagc 2400 ggttcgaaga tgatgaaagt gtagcattct cttcacttca tgaagaggaa gcgagactgc 2460 aaagcaaggt gatccaagac gtttctatcc agcctgttga acaccaagct ctcagttcga 2520 ccatgaggat taacactcta aaagttctac cacactctgc gaaaatcaac agtactgcat 2580 acgaccattt ccgaaacatc agcgttaaac catccaccag caagacccaa atattcggat 2640 gttatgaatc gccaagtgcg aggcctccca cgtcgaccct gaccggagat gtcccggttc 2700 atcccgatga gcctccggcg gcagtcgacg tggaccacca aaccctaccg gcaagtatca 2760 attcaatagt gccttatgac ctgtcaaccc acatccaatc tttcaacaat acagctcaac 2820 caagagtcaa cggactggaa tatcccaata attcatccct cgacgacgaa agcctaagtg 2880 aaacgcgcag tgcatgcagc cacgagccag agaaaaggat gctgttcttc cagtggaaca 2940 tccgtggttt gtggtgtaac cacaccgaat tgtgccatct tttgctagac aaccctccta 3000 tagtcgtcag cctacaagag atgatgacta aaaacgtaaa caacgccctt cggaaaaaat 3060 acacctggaa ttccgtcaat agacattact cccaaggcgc tggtgcagct ggtctcggaa 3120 tactttccga ggtaccccac aactttataa aagttaattg tccaattcct gtatgcgtag 3180 ctcggctgtt aaacccatac aacatcacag tcgtctccat ctacgtccca ccaaacagcc 3240 ctgattccca catgttagaa acactgaaca gcataaaagg caacacggat ccaccctata 3300 tcatcggtgg agactttaat gctgcccatg aagcttgggg tagtatcaaa tcgtcaagta 3360 gaggactaaa actcttaagt tggatcgtcg ataacaacat ggtctctctc aacaatggtg 3420 aacccacctt cattagctcc tctcacggaa gtacatccag tatcgactta acaattgttt 3480 ctaatagcat cgcaagtaga ctctcctgga caattgccaa agatacgcac ggaagcgacc 3540 attttcccat tcgagtatat tcacaagaaa ggcaccccac agaacaacac cgtagaaggt 3600 ggttgtacaa aaatgcaaac tggcctacgt ttgaaaacag cttactcgac gtactgcagc 3660 cgaataaaga catttccatc gacgaactca acaacgctat cataaaatca gctgaagctt 3720 ccattcctag aacgtcagga aaatctattg gaaaaactga aatttggtgg accgacgaag 3780 tacagaaagt ggtcaaggct cgccgtaagg cactacgaaa aatgaaacga actgcaccat 3840 cggatcccag ttatgaaagc atacggaaat cgttccaggc cagtcgcgct gccgccagaa 3900 aatcaatagt ccaagcaaaa caggaatcct ggactaattt ctgcagtagt ttcaatccac 3960 gcaccccatc cgatgttttg tggaacaatt tcaatagact taatggtaaa aggaaaaaag 4020 gggttaaagg gctcgtcatt gatggggcgt tcgtacaaga tcctcccata attgctgaac 4080 gttttgccaa ctatttccaa caagcatcta caactcccgc aatctcagct agcgcaagca 4140 acggagaaac tatgctgata tcttccactg aatcgaattc agagttagat aaagagttct 4200 cactagacga actcttacga gctatagact ccgctaaagg atattccacc ggtaaagata 4260 acatcggata tccgatgata cgtcatttac caatttccag caaatatgct atgctgaaat 4320 gtttcaaccg catttgggct gagggaaagt ttccacaaca atggaaggaa gggatagtag 4380 ttcctgttcc aaaacctggt accacccaaa ataccatcga aaattatcga cctatcaccc 4440 tccttagttg cataggaaaa atctatgaac gtatgataaa tcatcggctg atcactttcc 4500 tggaaaaaca caatattctc catccaaacc aacatgcttt tcgatctggt cgtggaacct 4560 catcatattt ttccgatcta aaagaaatcc ttgaaaatgc caaacaatcg aaaaatcacc 4620 tcgaattcgc tttattagat atcaggaagg cctacgacca aacttggcgt ccacatatct 4680 tgcaacaggt gaaccgtttg agtgttggaa actgcatgaa atcttgtatt tccaattttc 4740 tgaaagatcg actttttcgt gtttgctatg ggagttcact atcaacccaa aaaccccaag 4800 acaatggcgt tcctcagggt tcggttcttg ctgtaacgct gtttttgcta gcaatgaatt 4860 ccgtatttga agcagtaccc aagaacgttc gaattctagt ttatgctgat gatatagttt 4920 tggtcgcatc ttccaaacaa atctcaacag tacgtcacca cttggcaaat gcagttagcg 4980 cagttaacac ttgggctaag caagtgaatt tttcactctc ggctaccaaa tcctgcctct 5040 tacatgtgtg tcagagakca agtcataggg gatcaaggtc gctgccagca atagttgtag 5100 atggtgaagc tattccggag gttaattctg ctcgcttcct tggagtctgg atcaatcgta 5160 gaggacaatt ttccacgcat gccgctaaaa ctaaacaagc attacgtaat aaagtaaact 5220 tcatcagagc tttatcgcct aaggctagca gatcgacact ttggaaaatc tccaatgcag 5280 tgtgcatatc aaaactcacc tatggtgtag aattgtttgg agatggaatc acggcgctgt 5340 tacgaccaat ctacaacgaa ctattaagaa tctcttcagg tgctcttcga acctcaccaa 5400 cccttagtct ggccaccgag gcaggtgaac tcccattcga cctacgtctt gcagaattgt 5460 tggtacgcaa atattgtaaa atcatcgaaa aaaccaacaa aaactatccg tacttccatg 5520 aaaccgtaga tcgcatattg aatcaactaa tagaacataa tactccgtgt attactaaac 5580 tacgacgagt tggacgccgt ccatggaatt ttccccgaat tcgagtcgac tggtctatca 5640 aaagagagtt taagaaagga cagaatcctg gaattgctag gatgttaaca tcgaaccgca 5700 tggacgcaca atacgctaaa catacaaaaa tcttcacaga cggctcgaaa acatctaacg 5760 aggttggtgt aggagtaata ggaccaaata tttatgttga aagaagtctg cttcctcaat 5820 gtagcatatt cacagctgaa gctgcaggac ttctttttgc agcacgagct gcccccagaa 5880 cacctaccgt tatcttgtca gattctgcca gttgcttgtc cgccatcgaa aaagggaaat 5940 ctacccatcc tttcatccag gaattcgaat atgtagcagc tgataaaaat ctggtggcat 6000 gctggatacc tggccattct ggcatcacag gaaacgaaca ggctgataat gctgctgaga 6060 ggggtcgctc aagtagactc ctatatcggt ccatacctgc tgccgatgcc atccaatgga 6120 ttaaacatca acttcatgct aaatttcaag aagagtgggc taacaattca gttacattct 6180 tgcaacgctg caaaccaaca gtcgagaaat ggatcgacag agagaaccga ttagaacaaa 6240 aaactctaac aagacttcgt atagggcaca cgaaaatgac caaaaagcat ttgtttgaca 6300 acaccttttc tccaatttgc gatgtctgta atactaatct tacggttgag cacatactga 6360 tcagttgtag gaaatttgat ttagttaggg atgagctcgg tctaagtaac aacttgcaaa 6420 taattctaag caatacaaaa gaaaacgaat caaaattgtt aaaatttatc aaaaaatgta 6480 atttatctga taagatttaa tccttaataa acagaggtga atgaaccgcg aggtttaaaa 6540 cctctataat taactcaaaa aaaaaaaaaa aaaaaaaaaa aaa 6583 // ID Gypsy-3_PPc-I repbase; DNA; INV; 4647 BP. XX AC chrUn; XX DT 06-JUL-2010 (Rel. 15.07, Created) DT 06-JUL-2010 (Rel. 15.07, Last updated, Version -1) XX DE LTR retrotransposon from the Pristionchus pacificus genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-3_PPc_; KW Gypsy-3_PPc-LTR; Gypsy-3_PPc-I. XX OS Pristionchus pacificus OC Eukaryota; Metazoa; Nematoda; Chromadorea; Diplogasterida; OC Neodiplogasteridae; Pristionchus. XX RN [1] RP 1-4647 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Pristionchus pacificus genome."; RL Repbase Reports 10(7), 998-998 (2010). XX DR Genome; chrUn; Positions 65608543 65613189. XX CC Positions [3418-3876] - Integrase core CC 'TATGT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 236..1861 FT /product="Gypsy-3_PPc-I_2p" FT /translation="MEEATERMAEVLVAMQQMMAAQQAELKALRDQQAQSA FT TSSGDDSTKSRGPSVDSLEKQIRLFNYNADEGWTYEAWWTRHEGLFNSVKV FT DDKEKNLMLLRHVDDSVDRQFRDHIRPKKLEEMSFSEVQVVMTKLFGDKKT FT IFEKRLEMFNLKMSKVHIDDLREFATRVNRVVEEADVTQLTPDKIKTMIFL FT AGVDLPRHTGAMFHIINGMKKEENPNLEKILEIADTFKEAQHDSQTVTAQN FT RSQVNAIERKFKRSNERQNRKQNQSGGKCYRCGRNHEAKTCAHASTVCFNC FT EKVGHLAVVCRSPKTEKGKKKAKINMIFGEITTNESKFDVRMKMDGCEVTM FT GVDTGSDLTFISEKTWKRIGSPSMSEADAYAVCANGSSMDLEGKCMVTLGM FT NGITVHGSVYVTEKQTNLLGKDLIPFFFSLVPNKKQGANLNAVKADTGYAE FT MVKRDYPEICREGLGLCTKMKASLSLKSDSKPVFCKRREIPLALLTKVDDE FT IDRLLKLEAIEPVDYTDWAAPILVVPKANGKPRVCVDGSKRPSGGP" FT CDS 1716..3338 FT /product="Gypsy-3_PPc-I_3p" FT /translation="MMKSIDCSSWRRSSRLTTPIGQHQFWLFPRLMESHEC FT VSTGLNDRLEAHNHPLPLVSEIMTKLEGCTVFTQIDLSDAYLQIPVDDSSK FT KLLGISTHRGIFRYKRLPFGVSAAPGVFQKCMDTMLAGYKNASAYLDDIVI FT GGVTRGHHDENLKDVLDRLQEYGFRIRPEKCSFGKEKIRYLGFVMDKNGRR FT PDPEKVRAVREMPEPQDESSLRSFLGMANYYSEYIQDMYKLRVPLDKLLKK FT EVNWMWSAECAQAFKEIKSILSSDLNHVHFDPSKEVVLATDASEKGIGAVL FT AHRINGKLRPIAHASRTLKDAETRYSQIEKEGLGIIFGVLKFHHYLYGRRC FT VLQTDHKPLLAIFGSKTGVKIHTAKRLYHWSTLLLAYSFDMEYVNTESFGY FT ADALSRLISASRSDVEEDEDILGLKNVEKAVCKAVRNCASKMPVTVKDLQD FT ATDQDSVLQKVKEYHMSRWPDLKKLKLDRKDHGLLPFFHRKTDLCIVKGCL FT FLADKIIVPQSLQKKVLEMLHISRNSQNEGTGKTNLLLVWNGYSD" FT CDS 3286..4497 FT /product="Gypsy-3_PPc-I_1p" FT /translation="MKALARQTCYWYGMDTQIEQMVKECDQCAAACKQPVK FT IPLEPWPKSTEPWERIHVDYAGPVDGQYFLVIVDSYSKWPEVIMTSSMTAG FT VTVRILDEVISRNGIPRVLVSDNGTQFASEAFNKFLIERGIKHLYSPPYHP FT QSNGQAERFVDSLKRSLLKQKGERSIAEALQVFLFTYRKTPNAQCNGFSPA FT EVFIGRRLRSELQVCVPKTGGLNSNCHSDRMIDSAKEQFDRKNGVRPRKFK FT IGDVVLYRMHVVPNSYKWTKGVITAKIGKVMYEVQLEHRVIRSHANQLILR FT ESSRDDDLEVMEDSEDIFETMNLELIKSTIKLPEENPGDFGMNYLSDQSTV FT PNSPMGSIKEPDPEPVKEPGEESEAQAIAAEPTPVTVPTRKSTRTRKAPSR FT LDIDPSKKRY" XX SQ Sequence 4647 BP; 1370 A; 972 C; 1201 G; 1104 T; 0 other; tttggcgttc aggagtaaac cgaacttgtc ttcttgagtt cgtgtgatta gcaagtggca 60 aactgtgcct ggcggatcgc aacgagtggt attagtgaca aactgtgtct gaccactgag 120 aattaggcgc aaaccgtgcg catcgcaaac tgtgcgtgac tactgccaac cgggcgtgtt 180 gattgttgtt attatcgtcc ggaggttctc tccgaattta gtaattgcag tgaaaatgga 240 agaagcaact gagagaatgg cagaagtgct agtggctatg caacagatga tggcagcaca 300 acaggcagag ttgaaagcac tgagagacca gcaggcacag agtgcaacca gcagtggcga 360 tgactcgacc aagagtcgcg gaccctcagt ggactccctg gagaagcaaa tcagattgtt 420 caattacaat gctgatgagg gatggaccta tgaggcgtgg tggacccgcc acgagggcct 480 gttcaattct gtgaaagtag atgataagga gaagaacctc atgctattga gacatgtgga 540 tgattcggtg gacagacagt ttcgtgacca tattagacca aagaaactgg aagagatgtc 600 attctctgag gttcaagtgg tcatgacgaa attattcgga gacaagaaga ccatcttcga 660 aaagagactt gagatgttca acctcaagat gtcaaaagtg catattgacg atcttagaga 720 atttgcaacg agagtcaacc gagtggtcga ggaggctgat gtgacacagc tcactcccga 780 taagatcaag acgatgatat ttctggctgg tgttgacctt cccagacata ccggagcaat 840 gttccacatt atcaatggca tgaagaagga agagaatcct aatctggaaa agatactgga 900 gatagctgat accttcaaag aagctcagca tgactcgcaa acggtgactg ctcagaatcg 960 atcgcaagtg aatgcaatcg aaaggaagtt caagaggtct aacgagagac agaacaggaa 1020 gcagaatcag tccggtggca agtgctatcg ctgtggaagg aaccacgaag caaagacttg 1080 tgcacatgcg agcacagtat gctttaactg cgagaaagtc gggcatctgg cagtagtgtg 1140 cagatcgcct aagacagaga agggaaagaa gaaagcaaaa atcaacatga ttttcggtga 1200 aatcactacg aatgagagca agttcgatgt gagaatgaaa atggatggat gtgaagtgac 1260 catgggagtc gatacggggt ctgatttgac attcatatct gaaaagacgt ggaaaagaat 1320 cggatctccg tccatgagtg aggctgacgc ttacgcagtc tgtgctaatg gttcgtcgat 1380 ggacctagaa ggcaagtgta tggtcactct cggaatgaat ggtattaccg ttcatgggtc 1440 tgtctatgtc accgaaaagc aaacgaatct gcttggaaag gatctgatcc cattcttctt 1500 ttccctcgtt cccaataaga agcagggagc caatctcaat gcagtgaagg cagacactgg 1560 atatgcagag atggtcaaga gagactatcc agagatctgt cgagagggac tcggtctctg 1620 taccaagatg aaagcttcgc tcagtctcaa gagtgactcg aagccggtat tctgcaagag 1680 gagagaaatt cctctcgctc tattgactaa ggtcgatgat gaaatcgatc gactgctcaa 1740 gctggaggcg atcgagccgg ttgactacac cgattgggca gcaccaattc tggttgttcc 1800 caaggctaat ggaaagccac gagtgtgtgt cgacgggtct aaacgaccgt ctggaggccc 1860 ataatcatcc attgccacta gtgtctgaaa taatgaccaa gctggaaggt tgtacagtat 1920 tcactcaaat cgatctatcg gatgcgtacc tccagatccc cgttgatgat tcgagcaaga 1980 agttactcgg aatcagcacg catcgtggaa tttttcgtta caagagacta ccgtttggag 2040 tgagcgcagc tcctggagtg tttcaaaagt gtatggacac gatgctcgca gggtacaaga 2100 atgcatctgc gtatcttgac gacatcgtaa tcggtggtgt caccaggggt catcacgatg 2160 agaacctgaa ggatgtactt gaccgactgc aggagtacgg gttcagaatc cgtcccgaga 2220 agtgtagctt cgggaaggag aagattcgat atcttggatt cgtgatggac aagaatggtc 2280 gcagaccaga tcctgagaag gttcgtgcag tcagagagat gccggaaccc caggatgaat 2340 cgtctctgag atcgtttctc ggaatggcta actactattc cgagtacatt caagacatgt 2400 acaagctgag agtgcctctt gacaaactac tgaagaagga ggtgaattgg atgtggagtg 2460 cggaatgtgc tcaagcattc aaggagatta agtcaatctt gtcttctgat ttgaatcatg 2520 ttcatttcga tccatccaaa gaggttgtac tcgccactga tgcaagcgag aagggaatcg 2580 gtgcagtgct cgcacacaga atcaatggaa aattgagacc gattgcccat gcatcgagaa 2640 cgctgaaaga tgcggagact cgatactctc agatcgagaa agaaggactc ggaatcattt 2700 tcggtgtcct caaatttcac cattacctgt atggcagacg atgcgtgctt cagacagatc 2760 acaaaccgtt actcgcgata ttcggatcga aaactggtgt gaaaatccac actgcaaaaa 2820 gactgtatca ctggagtact ctgctgctcg catattcatt tgacatggaa tacgtcaaca 2880 ctgagtcatt cggttatgct gatgcactct ctcgattgat ttccgcatca agaagcgatg 2940 tggaggagga tgaagatatt ctcggtctaa aaaatgtcga gaaggccgtg tgcaaagcag 3000 tcaggaactg tgcatccaag atgccggtta ctgtgaaaga tctacaggat gctaccgatc 3060 aagattcagt gcttcagaag gtgaaggagt atcatatgtc gagatggcca gatctaaaga 3120 agttaaagct tgatcgaaaa gatcatggct tacttccgtt cttccataga aagactgatt 3180 tgtgcatcgt caaaggatgt cttttcctcg ctgataagat aatcgttcct caatctctcc 3240 agaagaaggt tcttgagatg ctccatatat ccaggaatag tcagaatgaa ggcactggca 3300 agacaaacct gctactggta tggaatggat actcagattg agcagatggt aaaagaatgt 3360 gatcagtgtg ctgctgcatg caagcaacca gtgaagattc ctctcgaacc atggcccaaa 3420 tcaacagaac catgggaaag aatccatgtg gactatgcgg gtccagtgga tggtcaatac 3480 tttctcgtca ttgtcgattc atactcgaaa tggcctgaag tgataatgac ttcctcgatg 3540 actgcaggtg tgacagtgag aattttggat gaagtcatct ctaggaatgg aattccgaga 3600 gtgttagtat cagataatgg cactcaattt gcgtctgagg ctttcaacaa attcctcatt 3660 gagagaggta tcaagcatct ctactcaccc ccgtatcatc cacagtctaa tggacaagcg 3720 gagcgctttg tcgactcgct taagagaagt ctgctcaagc agaagggcga gcgatcaatc 3780 gcagaagcac ttcaagtgtt cttattcacg tatcgaaaga ctccgaatgc acaatgcaat 3840 ggtttctctc cagcggaagt tttcattgga agaagacttc gatcggaact gcaagtgtgc 3900 gttcccaaga cgggaggact caatagtaac tgtcatagtg acagaatgat cgattctgct 3960 aaggagcagt tcgatagaaa gaatggagtc cgtcccagga agttcaagat tggagatgtc 4020 gtgttgtaca gaatgcacgt ggttcccaac agttacaagt ggactaaggg tgtaatcact 4080 gccaagattg gcaaagtgat gtatgaagtg cagttagagc atcgagtaat tcgttctcat 4140 gcaaatcagt taatactgag agagtcttca agagacgatg atttagaagt gatggaagat 4200 agtgaagata tcttcgagac tatgaatctt gagttgataa aatcgaccat caagctaccg 4260 gaagaaaatc ccggagactt cggcatgaac tatctgtccg atcaatccac cgtgccgaac 4320 tcgccgatgg gatctatcaa ggaacccgat cccgaacccg tcaaggaacc aggagaagaa 4380 tctgaggccc aggccattgc cgccgagcca acgcccgtca ccgtgcccac ccggaagtct 4440 actcgaacca ggaaagcacc gagtcgactt gacatcgacc catccaagaa gagatactga 4500 cgcgcaaagc cagtaaactg tcgcgcaaag ccagtccctt tgaccactat tctgtaaata 4560 tgtgccctaa tctcttcttc atctgcattt gttttcctcg ttgttttatt tatcagatac 4620 tcccaagttc gtatcttaga agggagg 4647 // ID DNA8-22_AP repbase; DNA; INV; 213 BP. XX AC . XX DT 22-AUG-2009 (Rel. 14.08, Created) DT 22-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-22_AP. XX NM DNA8-22_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-213 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(8), 1764-1764 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 213 BP; 66 A; 38 C; 38 G; 71 T; 0 other; catggcttaa atatacaagg ttattgcaat agtaccccca ctatatcatt ttatcacgtt 60 aaggaattaa aatggcagcg taaggggacg acgtatactg gctatactat gatattatat 120 gctatgatat ggaccgtacc cttacgctgc cattttattt ttgtttgata acaatacggg 180 ctatgtgcaa taaccttgta tatttaaacc atg 213 // ID hATm-1_AA repbase; DNA; INV; 5665 BP. XX AC . XX DT 30-OCT-2007 (Rel. 12.1, Created) DT 30-OCT-2007 (Rel. 12.1, Last updated, Version 1) XX DE hATm-1_AA, a family of autonomous hATm DNA transposons - a DE consensus sequence. XX KW hAT; DNA transposon; Transposable Element; KW Autonomous DNA transposon; hAT superfamily; hATm group; KW hATm-1_AA. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-5665 RA Kapitonov V.V. and Jurka J.; RT "hATm, a distinct group of hAT DNA transposons in animals."; RL Repbase Reports 7(10), 1046-1046 (2007). XX DR [1] (Consensus) XX CC hATm is a separate group of hAT DNA transposons. Significant CC identity between hATm and hAT transposases appears after PSI CC BLAST iterations. hATm transposons exist in genomes of different CC animals, including segmented worms (Helobdella robusta, leech), CC flatworms (Schmidtea mediterranea, freshwater planarian), insects CC (Aedes aegyptus, mosquito), and tunicate (Ciona savignyi sea CC squirt). All identified hATm transposons are characterized by CC 8-bp target site duplications and conserved termini CC (5'-TAGGgtGgyccnnA). The planarian hAT-7_SM, hAT-8_SM and CC hAT-10_SM transposons also belong to the hATm group. Their CC putative classification as hAT transposons was not supported by CC significant similarity (BLAST and PSI-BLAST) to known hAT CC transposase proteins. CC hATm-1_AA is a young family of hATm autonomous DNA transposons CC identified in the mosquito genome. The consensus sequence was CC built based on multiple alignment of 20 copies that are ~2% CC divergent from the consensus. TIRs are 770-bp long. XX FH Key Location/Qualifiers FT CDS join(1584..2229,2316..2401,2562..2767,2831..3630, FT 3696..4054) FT /product="hATm-1_AAp" FT /note="transposase." FT /translation="MEDIRLVGKYLPLFPESANLPTYSEVFRKFMFHLRAE FT KQKPRASAQLTTQALKTSWNKKHLNHLIDDVTVEQKILRYYETWKSLQKHI FT LRMSAKETEKRNEFSHSLRNLFDIAKKYKPKTKEEKKASQYLQEQREQSTL FT PTTTETGPTETISCSDFEEEANLQATEETNPPTSEQKEASLQQKKTLTGSD FT FESSSQTQSESFTSIGTIGEDLQIPGPSKSSSTPKPQLKNVITVDVAAALD FT RCKVNLTVHWDSKLLPSILSSKAIPKTERLAIVVSGDGIRKLLAVPPIQNS FT TGIEQAVAVYNAVVDWGLEDRVMFTSFDTTASNTGRKKGACQHLNNMFKRQ FT LIGLPCRHHIHELLVGKAFKVLQFETAQSPNVEIFRRFQMEWHRLEKSSYH FT SGLNDAFVKSSITDNERVILLEFFERQLIEHHHREDYREVIQLTIIFLGGK FT INENFKIRAPGAFHNARWMAKIIYSLKMFLYRYTFALPKATKDGLARFNVF FT IVKVYLRNWFLSSCAAMSANNDLNLLKSLKIYEAVDKEIAEAVHKGFLNHL FT QYLDSNLIGLCLFDEKLDVADKKRIMEALHTSDTQKIVPAKSIIESNDVST FT LELHSFVQGCQDTKKLFHALEIPESLLETDITQWNCKKEYEDAKRRIQSLE FT VVNDNAERGIALIKTYNSKLSTDENQKQFILQLVEEHRKMYKSPNKSDVI" XX SQ Sequence 5665 BP; 1939 A; 966 C; 1012 G; 1748 T; 0 other; tagggtgcca atgaaaatcg atttttcgaa tttcaaaaaa aggtagtgct caaaagtttt 60 gtctcctcga aaaaagtccc catgcaaaat ttgagctcaa tcggacttca ttaagtggac 120 ccccaaagcg gtcaaagttt ggcttttttg acccatgaaa aatctccaaa gggggggggg 180 gtacatgaaa tttccgaaat cgaaattttt tttttgatgc cagatgtctt agaaatgcat 240 gaaacgtcga gatctggtgt tatttggaaa attttttttt ttgaaaaaat cgaccttttg 300 ggacttagta aatttttgag ttgggggagt gaattgaatt tgaaatgaac gatttgaatt 360 caattgctga gaaattcaag gcaatagtat tgaaacatat cctatatcat tgttggccac 420 tgaaagcata ttatatgtga tatttcatta gggtggttca tttactttcc attagggtgg 480 tcctttttgt taaaaaataa aaaaaataaa aaatagaatt ttaattgaat attgaagaca 540 atagtattgg aacatctctc acattatcgt aagccatttt ctacatttaa tatgtggtat 600 tttattaggg tggttcactt attttccata agggtgggcc tttctgtcga aaaatcataa 660 tttgaatggg atattaaaaa caatagtatt ttatcacctc tttcatcatc gtaggccatt 720 tccttcatta gattcgtgat atttcattgg gaaggctcac ttatattcca ttacggtggt 780 cctttaaaaa aattgagaat ttgaatggaa tagtaaatac aagagtaatt aaacatctcc 840 tatgtcatcg taggccactg ttaacatata atatgtgata tttcattagg gctattgtcc 900 tcagttttgc ataagggtga tcatttaatt tgtaaattaa aaaaaaatat tctcaacaga 960 tctaacaaat caactttgtt tgtatagttt tgaatcatga ttctttattg gcatgcaact 1020 cttcgttaag atattttcat taaggttttc ttttgaatag ttaaaataaa acgttccaat 1080 aacaataatc agtgaattca gtgaagcaca actctaactg ccataatcaa aatacagata 1140 aaaaaggaat gaaataattt ttgacgtatt caaattatga caatagcttg atgggcgacg 1200 atgaagggca acaaaagtgt ttagtttata ttttcaaaag agattctcag ccttgggctc 1260 cttcattccc gaaacaaaag tacgttgcct tctgcttgct atagtacacg cgattcggtt 1320 gtgacaacgt aggcaattcc agttttaaca atcagttctc gtttattctt caaagatgta 1380 gctaaaagct cctgcttttt ctacctacat agaatataag ttatacctaa agtgaagaaa 1440 atcaaatcgg tagatttttg tctataccta agcgtgttca aaatagaagc tccaaataaa 1500 ttgtatttca gtggtgaata agaaaagcaa ttcgtgtgtc aaaaaaatta tatcaaggcg 1560 tgtgcaagcg ttttttgtta aaaatggagg atattcggct agttggaaaa tatctacctt 1620 tgttccccga atctgcaaat ttgccgacat acagtgaagt attccggaaa ttcatgtttc 1680 atttgcgagc agagaagcaa aagcctagag ctagcgccca gttaacaaca caagctttga 1740 agacatcatg gaacaaaaag catttgaacc atttgatcga cgacgtgaca gtggaacaaa 1800 aaatcttgcg atactatgaa acttggaaaa gcttgcagaa gcacatacta aggatgtcgg 1860 cgaaagaaac tgaaaagaga aatgaattct ctcattcatt gcggaacctt ttcgacatcg 1920 cgaagaagta taaaccaaaa actaaagagg agaaaaaagc atctcagtat ttgcaagaac 1980 aacgagaaca atccactctt ccaacaacaa ctgagactgg tccaacggaa actatctcat 2040 gtagtgattt tgaggaggaa gctaatctac aggcaacaga agagactaat ccgccaacta 2100 gtgagcagaa agaagcaagt ctacagcaaa agaaaacatt gactggtagt gattttgagt 2160 cctcatctca aacccagagt gagtccttta cttcgatcgg tactatcggc gaagatttac 2220 agataccagg taagtgtgaa ttgaatacat tgcaaatatg ttttaaaatg tttttcatgg 2280 attctaatat ttttgatttt tatattttct tctaggtcct tcaaaatcat cttcaactcc 2340 taaaccacaa ttaaaaaacg ttattactgt cgatgtggct gctgctttgg accgctgcaa 2400 agtgagtgat cgaaatgcgg tatttctaat ttcatctata gcgaaatctt tggggcacga 2460 cgtttcaaca ttattgttga acaaggagtc cattagatat tcaagaaaaa aaaaatcgtg 2520 agcaaaagca ccatgagctg aaaacaacat ttaagccaca ggttaactta acagttcact 2580 gggattcaaa gctgcttcct agtatattgt cgtctaaagc catccccaaa actgaacgat 2640 tggctatagt tgtatctggt gatggcatcc gtaagttgtt agctgttccc cccatccaaa 2700 attcaacagg aatagaacaa gcagtagcag tttataatgc agttgtcgat tggggtcttg 2760 aagacaggta agttttttta gcttcaattt tattattgat aatttctaaa atataacgtt 2820 cgttttttag ggttatgttt acatcatttg ataccactgc atcaaacact ggtcgaaaga 2880 aaggagcgtg tcaacatcta aacaatatgt ttaaaagaca gcttattgga cttccctgta 2940 gacatcacat acacgagctg ttagttggca aggcattcaa ggtactgcaa tttgaaacag 3000 cacaatctcc aaatgttgag atctttaggc ggtttcagat ggaatggcac cgcttagaaa 3060 agagttcata ccattctggt ttgaatgatg cgttcgttaa atcatcaata acagacaacg 3120 agagggtgat tctattagaa ttttttgagc gacaactaat agaacatcat caccgcgaag 3180 attatcgtga agttattcag cttacgatta tattcttggg tggaaaaatt aatgaaaact 3240 ttaaaataag agctcctggt gctttccata atgcaagatg gatggctaaa attatttatt 3300 ctttaaaaat gttcctttac cgatacactt ttgcgttacc taaggcaact aaagatggat 3360 tagcacgatt taatgtattc attgtgaaag tatatctaag aaactggttt ctttcttcgt 3420 gtgctgcaat gtctgcaaac aatgacctca atttgttaaa atcgcttaaa atttacgagg 3480 ctgttgacaa agaaattgcc gaagcagtac ataaagggtt cttgaatcat ctgcagtatt 3540 tggatagcaa tctcatcgga ttgtgtttgt ttgacgaaaa gttggatgta gcagataaaa 3600 aaagaattat ggaagcctta catacatcag gtaaatactt tgtttaattt gttgtaccga 3660 gattattatt tttctaacaa tattttgatt ttcagatacc caaaaaattg tacctgctaa 3720 atccataatc gaatcaaacg acgtttcaac actcgagcta cactcgtttg tacaaggatg 3780 tcaggatact aaaaaattgt tccatgctct agagattccg gaatccttgt tggaaactga 3840 tattacgcag tggaattgca aaaaagaata tgaagatgcc aaaagacgta ttcaaagttt 3900 ggaagtagta aatgacaatg cggaacgagg tattgcatta attaaaacat ataattcgaa 3960 attgtcaacc gatgaaaatc agaaacagtt cattttgcaa ttagtcgaag aacatcgaaa 4020 aatgtataaa agtccaaaca aatcagatgt aatttaggaa atatatgcga tgtaatgtac 4080 caaaaacaac caacatgata tttttttaat atcacaattc ctttcctctc cctttcgttg 4140 cttaccttat tgctactaca accattgcaa aatctaacag ttccccatct taagggcccc 4200 ggggttacag ttcagagcct tctcaccaat cactctgatt ttttttatct gagcttgcac 4260 agatggtaca aaactgaatt ggaaaggtca ctgaggctca atgcttgatc tctgagatga 4320 gtgcatctga aatcacgaag tagcaagcgg attttgtatg agacgaggac tccgaactgg 4380 ttcagcttag tgaagtgttg cattccgaat gaaaatcacg caaaactttt caaaaggacc 4440 tttctaacaa tgagaggctt tctagaaaac tcttttgttg ttaactaagt tacctttaca 4500 tttttcgtcg aacattacgc aaatattatg tagtagtcca caccataatc tttcggtctg 4560 tagatattaa ggttgaaatt tttacaatga caccataatt aaggaaagtg gacgagactt 4620 ttgcgagaaa tcatggtttc aaactatacg aaaaatgatg caccctaatg gaaaatgaga 4680 gaaactccct aatgaaatat gttaacagtg tcctacgatg atgtcgcata tcaaatggaa 4740 ctagaggcgc atgaattcag aggaatttaa agcctctcta aacaaaaatg aagacaatgg 4800 cctacgatta taaaagaggt tttaaaatac tattgtgcta aaaaaaagga tcaaattcaa 4860 aatttcttga ttttttttat ttaattttcg gcaaacacca atggaaaata agtgagccac 4920 cctaataaaa tatcacgaat ctaatgaagg aaatgcccta cgatgatgaa agaggtgata 4980 aaatactatt gtttttaata tcccattcaa attatgattt ttcgacagaa aggcccaccc 5040 taatggaaaa taagtgaacc accctaataa aataccacat attaaatgta gaaaatggct 5100 tacgataatg tgagagatgt tccaatacta ttgtcttcaa tattcaatta aaattctttt 5160 ttttttttaa tttttcaaca aaaaggacca ccctaatgga aagtaaatga accaccctaa 5220 tgaaatatca catataatat gctttcagtg gccaacaatg atataggata cgtttcagta 5280 ctattgcctt gaatttctca gcaattgaat tcaaatcgtt catttcaaat tcaattcact 5340 cccccaactc aaaaatttac taagtcccaa aaggtcgatt ttttcaaaaa aaaaattttc 5400 cagataacac cagatctcga cgtttcatgc atttctaaga catctggcat caaaaaaaaa 5460 aaattcgatt tcggaaattt catgtacccc cccccttggg agatttttca tgggtcaaaa 5520 aagccaaact ttgaccgctt tgggggtcca cttaatgaag tccgattgag ctcaaatttt 5580 gcatggggac ttttttcgag gagacaaaac ttttgagcac tacctttttt tgaaattcga 5640 aaaatcgatt ttcattggca cccta 5665 // ID Gypsy10-LTR_Dpse repbase; DNA; INV; 1848 BP. XX AC Unknown_singleton_21; XX DT 13-MAY-2009 (Rel. 14.05, Created) DT 13-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy10_Dpse; KW Gypsy10-I_Dpse; Gypsy10-LTR_Dpse. XX OS Drosophila pseudoobscura OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; obscura group; OC pseudoobscura subgroup. XX RN [1] RP 1-1848 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1078-1078 (2009). XX DR Genome; Unknown_singleton_21; Positions 4480 2633. XX SQ Sequence 1848 BP; 764 A; 318 C; 334 G; 432 T; 0 other; tgaaacgatc agcgaagcca aagaaggaag acccccagaa gacccagtca aacaaattca 60 gtcaaagaaa tttaagaagg aagacccgtt cgaagatctg ccaaactaag aaggacgacc 120 cctaccggat atttcgtgag caagagttgt attaataaat atataagttc aattaggtta 180 tataagatta taaaaactaa catataagaa aaattaaaag tggatcaact aactttagat 240 tttgaaaaac taaaaatgat taactctcaa gcaaggaccg atctcacagc agatctggtc 300 aaacaattga tagcacttca aatttcacaa gtacaagagc aaattgtaaa tttaaaacaa 360 caaatagaaa ctaaatccga tcaggtatca gattatcagg cagaaacaat agacgagaca 420 ataagagatg acaccacatt aaaggtaata gaatccctac caattttcaa tggtggatta 480 aaccaatatg taggatggag ggaagctgca gaaactgcag tgaagttata taagaaagga 540 agtaagcaat attatattgc attaacaatt ttgagaaaca aaataacagg accagcacat 600 gacgcactga ccaaccacgg gacagttcta aattttgatg ccatactttc acgacttgac 660 tttgtttaca gtgacaagag accaatttac ataatagaac aggagttaag cgttctccga 720 caagggaacc tttcaattat agatttctat aacgaggtta acaagaaaat gaccttgtta 780 ataaataaga caataatgac tcacggaaag gacagtgaaa tcaccaaaga atcaaataaa 840 aaaattagag acaatgcctt acgcattttt gtcactggcc taaacggcgg cattgctgaa 900 atcctatttt cattgaatcc cccagattta ccgaatgcct tagcaaaggt acaggaattg 960 caatctaata acattcgggc ccaatttgcc taccaattca gcggtcccag aaattcaggg 1020 aataataatc tcaatacctt aagattcaac cagcgacaac ctaggacaaa cagttcagca 1080 ttcgaccaaa aaagggattt gcagaggaat aacatgtctt ggggacaacc aatatttctt 1140 gggtaataaa caacaaaatc aggccccaga accaatagag gtagacgaat ctatacaaat 1200 ccggaaccga cagaacagca atcaattcag gggaaataat aacaatagaa attatcggga 1260 gaatactaac ggttattata ggagaaataa taacaattat agccaagccc agcattttag 1320 ggcaaacgta gtaccgaata atcagtctaa cgacaatcaa gcgcagaaac gaaagccaaa 1380 cgaaatgtcg gaacaaccgg ctaataaagc aatgaggata aataacattg aggaggatca 1440 ttttttagat cagcccccga agtaggtctg ccctacctaa aaagagtaga caaaaaaata 1500 aacagagaat taaaagtttt gataaatacg ggagcaactt cctgttatat aaaaaaagga 1560 attatgacaa taaaaaagaa ttaccaattt ataagaaagt tgctacagtg aatgggtttt 1620 caataattaa atatttccat aatataacta ttttcaacac aaagcaagtg ttttatgaaa 1680 tagatggaat ggaggcagat ttactaatag gatttaacct attaaagaaa atcggagcgg 1740 ttattgatac aggaaagggg actttaagtt ataaagacaa tgaggagaaa ctaatatatg 1800 acgaggtcct ttctttaaac aaattaacct ttatagaagg aaaaacca 1848 // ID BEL-610_AA-LTR repbase; DNA; INV; 518 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-610_AA_; KW Pao_Bel_Ele203; BEL-610_AA-I; BEL-610_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-518 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 518 BP; 185 A; 90 C; 90 G; 153 T; 0 other; tgttggcgaa accactgggc tgtttcctaa cccgttgaag tgagttcctg agtacgccac 60 tcgctttaat tgtccgtcat gcgttgacag ctttggggaa acgtcattgg gaagatatgt 120 tcatgatcat tcaaaagcaa acatttcgaa caatagaaag tgaagataga aggtcgattt 180 attgaagcta aagttaatcc taaagtaaaa ttcattgaat tcgtattata attgaattta 240 tgtacaattt gtgcaatagc caggtagata tatatcctaa attataaatg cattactcta 300 aacctataaa ctactatcac agattgaatt tagtacggag aaaacagtga ggacaaatat 360 aacctaaaat tagtaaagaa caccaaatgt aagtaaacct tataaaacta tgaatttgtc 420 taaaactaac atgcaaaata aaattctagc ttgaagctgt ttctaacgga ctcgctatca 480 ggattgttca ttcccgtccg aactgaaagt tctcccca 518 // ID BEL-123_AA-LTR repbase; DNA; INV; 732 BP. XX AC AAGE02026663; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-123_AA_; KW BEL-123_AA-I; BEL-123_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-732 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02026663; Positions 852 121. XX SQ Sequence 732 BP; 221 A; 151 C; 168 G; 192 T; 0 other; tgttcaatgc aacggtagca acaatcagtg agtctaccct aacaacagtc acgagcgtcg 60 aggtatataa tcgtacaccg cagtaacgaa cgcaagagta tcgaccgtca tcagcatcat 120 cgccgtcatc gtattcgtgt ttcgaacacc cgtcggcagt catcgagcgt cggatgtgat 180 cgagctagct agcgttatat gtatacgaag acgaaaccgg ttaaaagtaa atacttatgt 240 actttgtctc tcatacttga cacaagcaat gctttgtgaa tccccatgta taaatttgtg 300 cgcacaaaac caaatacatg atcagtttag tatagttcgt cgtggaatca actgcgtttt 360 attttccgtc cgagtcgaaa gaagtttctt gtcggtcgct gagtgaattc tagtttcgag 420 aacagtgaag tgcccaattg gcctaatttt cctagttttg gagaaattgt gtatcgaact 480 ttggacattg tgacgaactt tagtgcgatt agaaactccc gtgagcacga tcgagtgctc 540 tggagatttc tggaagaaat tagtgtcgga acaaaggaac aagaacaaaa ttagtgtcga 600 aacaaaagaa caaggactgt gacgaatcag tacgaaagtt taataaagtg aatcgcctac 660 ttggccagtg tgaatcagtg caaataggcc ccaccacttt cgaggagccg gacagtgacg 720 acgtcttgta ca 732 // ID BEL-187_AA-I repbase; DNA; INV; 6234 BP. XX AC supercont1.91; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-187_AA_; KW BEL-187_AA-LTR; BEL-187_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6234 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.91; Positions 1040178 1033945. XX CC 'GTCAT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 44..6232 FT /product="BEL-187_AA-I_1p" FT /translation="MPETDLFDCKLCRRSNNVDDMVFCAACQEWYHYQCVG FT VTSAVANESWMCARCAALPSSTRIGELYGSDPLFPATIVTVSSVKATESSS FT IVASTSSVVSLAASAGLPLAATAASPITSSTAASGPVPPAIAGPSGWLPTT FT TTAISATNYGQSLLTDQARASLQWVQEHRAFLERQMEESHKKELERKKAML FT GQLANNAMQGVITEATSNNLNEQSAAVGTSNVGGWLGRMIGEMENLSLTPA FT STETRMTAPSVSVLNNSSTPTTVGSADRSVFDPSLASLLLGVPSIMSRAST FT NSFPSGQYTTMPSVPSYCYPQPVRDPSQQFGAFPISTIPTMSLSNANLTNV FT VTACPTVPIASIPQGPVYGGLSVNASIPRMEPFPHSSQSSLGGHPWVNVPQ FT APQLPQTSPFSVVPTQQQLLARQVMPRDLPLFRGDPEDWPLFFSAYTNSTA FT ACGYTNVENLARLQRALQGKAIEAVKSRLLLPACVPQVMSTLYMLYGRPEL FT IIQTLLEKVRDTPAPKSDRLETLITFGMAVQNLCDHIEATGQVAHMCNPVL FT LRELVDKIPAQQRLNWALYKQQFTGVDLRTFAGFMSMLVSAASDVTVTSDS FT KQSRSGRGERGKDKNFLNAHAVSEKDSKPFKQEEELHKTDVKEVLCLTCNG FT RNHKVKDCATFKRWDPDTRWKAIQDHHLCRICLGKHGRRPCKLQVICGLDG FT CQQRHHQLLHSSSQKLLPPAKEGQKAVKQSDNPGERLNAHHATKKSTLFRI FT LPVKLFWKGKSVETFAFLDDGSEMTLVEESIAERLGIDDGEPLPLCLTWTS FT NVTRQEPDSQRVCLEISGEGKSEKFSLKDARTVSSLNLPNQTLKYGELARR FT YTYLRGLPVTSYESATPGILIGSNNASLTATLSLREGQLGDPLASKTRLGW FT SIYGYTAEGEKAMNFTLHVCECRRECQAGEDLHDLVKQYFTVESVGVSADK FT GPESQEDKRARQILEETTKRIANRFEIGLLWRHDLVEFPNSFPMAMRRLEC FT FERRMKRDPELRTSVQQQISEYVESNYIHEVTRDEMESTDPRRVWYLPLGA FT VRNPKKPGKTRLVWDAAAKVGGVSFNSMLLKGPDLLTPLASVLCKFRERPV FT AVSGDLKQMFHQFRIRQQDAHSQRFLYREHPSQPVKIFVMDVGTFGATCSP FT CQAQYIKNRNAKEHEVEFPKAAEAIVEKHYVDDYLDSFDTEAEAIKVALEV FT KEVHSRGGFEIRNWHSNSDALLRRVGEPRGIQPKAISIDLESEAERVLGLL FT WLPEEDSIAFATDLQLEGIIPTKRNILRCVMSLFDSQGILSHVTIQGRMII FT QDTWRKQTQWDDEVVEAVRMRWLRWTELFRKVGRMRLHRAYFPGYTATEIG FT AVELHVFTDASEEAYACTAYFRVVIMGNVYVTLVMAKAKVAPLKSLSVPRL FT ELMGALLGARLAKAVKEYHTLPICRRVMWTDSKTTLAWIQSQHHRYRQFVA FT FRVGEILSKTEAAEWRYVPTQHNPADDATKWGKGPSTDVMSRWFRGPDFLY FT KPEIEWPEQRSTQSQETDEELRPCMVHQQKLFEDIYKICNFSKWERLLRTV FT AYVHRYIDNCKLKRKKEKLTMGHLQQKELQKAENSLWRTVQSSAYPDEIAA FT LKQGNGISKKDIDKTSPVYKLSPFLDEHGVMRVDSRIGAVTYVTYDFKFPI FT ILPRNHQLTKLVIDWYHRRYLHANHATVQNEVHQRFHVSNLRSAIRQVTKE FT CQACKIAKTVPTIPRMAPLPESRLAAKERPFSYVGLDYFGPIQVRIGRSCV FT KRWVALFTCLTVRAVHLEVAHSLSTESCKMAIRRFVARRGSPVEIRSDNGT FT NFQGASRELRDQIGVIGQKLAETFTNTNTRWVFNPPSAPHFGGSWERLVRS FT VKVALGSLCGNRNPDDETLLTVLAEAESIVNSRPLTTIPLESVSQEALTPN FT HFILLSSSGVVQPSTKLAEPTKVTRTNWNMARQLVDQFWRRWISEYLPTIA FT LRSKWFGEAAKLKVNDLVLIVDEGQRNGWMRGRVVAVIPGADGRIRQAMVQ FT TTKGLFRSPMTKLAVLQVQGDSNTEAVCRPEVRYGSG" XX SQ Sequence 6234 BP; 1694 A; 1493 C; 1647 G; 1400 T; 0 other; ataagcttta agaactgctt cgatcaccaa cgagaaagga aggatgccgg aaacagattt 60 gttcgactgc aaactatgtc ggaggtcgaa caacgtagat gatatggtct tctgtgcggc 120 gtgccaggag tggtaccact accagtgcgt gggagtaacc tcagcggtgg ccaacgaaag 180 ttggatgtgt gcacgatgcg cggctttacc gtcgtccaca cggattggcg agctttacgg 240 aagtgatccg ctattcccgg ctactatcgt tactgtatcg tcagtgaagg ccaccgaatc 300 atcgtcgatc gtagcttcaa cctcttcagt ggtatcgctt gcagctagcg ctggattacc 360 gctggcagca actgcagcat ctccaattac gtcttctact gcagcttctg gtccagttcc 420 gcctgcaatc gctggaccat ccggctggct acctacaacg acgacggcta taagcgctac 480 taactatggc caatcactgc tgacggatca ggcacgagcg agtctacaat gggtacaaga 540 gcatcgagca ttccttgagc ggcaaatgga ggagagtcat aagaaggagt tagagaggaa 600 gaaggccatg ctagggcaac tcgcgaacaa tgccatgcaa ggcgttataa cggaagcgac 660 atcgaataac cttaatgagc aatccgctgc ggttgggacc agtaacgttg gaggctggct 720 ggggagaatg atcggcgaga tggaaaatct gtcgctcaca ccagcgagca cggagacaag 780 gatgacagca ccgtcggtat ccgtactcaa caacagctcc acaccaacaa ctgtgggttc 840 tgctgaccga agcgtattcg atccatcctt agcgtcgctg cttcttggag tgcctagtat 900 tatgtccaga gcttccacaa actcctttcc ttcaggtcag tacactacaa tgccgtctgt 960 tccttcatat tgttacccgc agccggtccg agatccctct caacaattcg gtgcgttccc 1020 aattagcacg attcctacta tgagtctgag caacgctaat ttgacgaacg tggtaactgc 1080 ctgtccaacc gttcccattg catcgattcc gcaggggccg gtttatggtg ggttgtcagt 1140 gaatgcaagt atacccagaa tggaaccatt tccccattcc tcacaaagtt ccttgggtgg 1200 gcatccttgg gttaacgtac ctcaagctcc tcaattgcct caaacgtccc cattttccgt 1260 tgtcccaacg cagcaacaat tgttagcgcg acaagtcatg ccgagggatc ttcctctgtt 1320 tcggggcgat ccggaagatt ggccgctatt ctttagtgca tacactaatt cgacggctgc 1380 atgtggctac accaatgtgg agaatttggc ccgacttcag cgtgcgcttc aaggtaaggc 1440 catcgaggca gtaaagagcc gtttgctgct tccggcttgc gtcccacaag tcatgtctac 1500 tttgtacatg ctgtacggaa gaccggaact gatcattcaa acattactgg aaaaggttcg 1560 tgatactccc gctccaaaat cagataggtt ggagacattg attactttcg gaatggcggt 1620 acagaattta tgtgaccaca tcgaggcaac ggggcaagtt gcgcacatgt gcaaccccgt 1680 cttactgcga gagttggtcg ataagattcc ggcacaacaa cgtttaaact gggcattgta 1740 taagcaacag ttcacgggag tagatttacg cacattcgct ggctttatgt ctatgttagt 1800 gtcagctgct tcggacgtca ccgttacatc ggattccaag caatcgcgat cgggccgtgg 1860 agagcgtgga aaagataaga atttcctcaa tgctcatgcc gtaagtgaaa aggattcgaa 1920 accgtttaag caagaagagg agttacacaa aacagacgtc aaggaagtgt tgtgtttgac 1980 ctgtaacggg cgtaaccata aggtgaagga ctgtgcaacg ttcaaaagat gggacccgga 2040 tacccgctgg aaggcgattc aagatcatca tttgtgtcgc atttgcctag gcaaacacgg 2100 acggcgtcct tgtaagctac aagttatttg cggactagat gggtgccagc agcgacacca 2160 tcagttactg cactccagca gtcagaaact tctaccccca gctaaagaag gccaaaaagc 2220 tgttaagcag agcgataacc ctggagagag gcttaatgcg caccatgcta caaaaaaatc 2280 tacactattt cgaatccttc ccgtaaagct attttggaag gggaaatctg tagaaacctt 2340 tgcgtttttg gatgacggat cggagatgac tctagtggaa gagtcgattg cagaacggtt 2400 aggcatcgat gatggtgaac cactccccct ttgcttaact tggactagca atgtgactcg 2460 ccaggaacca gattctcaac gtgtttgtct ggaaatttcc ggagaaggca aaagcgaaaa 2520 gttttccctg aaggacgcaa gaacagtttc tagcctaaac cttccgaacc aaacgctgaa 2580 gtacggtgaa ctagcccgac gatatacgta ccttcgtggt cttccagtaa ccagctatga 2640 atcagccaca cctggcattc tgattggctc aaacaatgct agcctgaccg caacgctaag 2700 cctgcgtgag ggtcaactag gtgacccttt ggccagcaag actcgactcg gttggtcgat 2760 atacgggtat acggctgaag gagaaaaggc aatgaacttc acgctacatg tttgcgaatg 2820 tcgacgagaa tgtcaagctg gtgaagatct gcacgatctc gtcaaacagt acttcacggt 2880 cgagagcgtc ggagtttcag cggacaaggg gccggaatcg caggaggaca aacgcgcccg 2940 ccagattcta gaagaaacca cgaagcgcat cgcaaacaga ttcgagatag gactgctctg 3000 gcgccatgac cttgtggaat ttcccaacag ttttcccatg gcaatgcgtc gtttggaatg 3060 cttcgagcga aggatgaaaa gggatccgga gctgcgaaca agcgtacagc aacagattag 3120 cgaatatgtt gaaagcaact atatccacga ggtaacgcgg gatgaaatgg agtctacaga 3180 ccctcgaagg gtatggtatt tgccgctagg cgccgtgaga aatccgaaaa agccgggcaa 3240 gactaggcta gtttgggatg ctgctgccaa ggtagggggt gtgtctttta actccatgct 3300 gctcaaagga cccgacttgc tgacaccttt ggcttcagtg ctatgcaaat ttcgggaaag 3360 gccggtagca gttagcggag atctgaagca aatgtttcac cagttcagaa tccgccaaca 3420 agatgctcac agtcagcggt tcctctaccg ggagcaccca tcacaaccgg ttaaaatatt 3480 cgtgatggac gtcggtacct ttggggccac atgctctccc tgccaagccc aatacattaa 3540 aaatcgtaac gcgaaggaac acgaagtaga atttcccaaa gcagcggaag ctatcgtgga 3600 gaaacactac gtcgatgatt acctcgacag tttcgacacg gaagcggagg caatcaaggt 3660 tgctcttgaa gtgaaggaag tgcactccag aggaggattt gagatccgga attggcattc 3720 caactcagac gcccttctgc gacgggtcgg ggaacccagg ggaattcaac cgaaggccat 3780 cagcatcgat ttggaaagcg aagcggaaag agttctaggc ctcctatggt tgccggagga 3840 agattctatt gcatttgcta ctgacctgca actagaaggt attataccga caaagcgaaa 3900 catcctacgt tgtgtcatga gcctgttcga ctcacaaggt atcttgtcac acgtaacgat 3960 acaaggacgt atgatcattc aggacacctg gcgcaaacag acgcaatggg acgacgaagt 4020 cgttgaagca gtccgtatgc gctggcttcg atggacggag ctgtttagga aggtcggtcg 4080 aatgagactc cacagagcct actttccagg atacactgcg actgagattg gtgcagtaga 4140 gctgcacgtt ttcacggatg cgagtgagga agcatacgca tgtacggcat actttcgggt 4200 agttataatg ggaaatgtgt atgttacgct ggtgatggcc aaggcaaaag tagcaccttt 4260 gaagtcttta tcagtacctc gcctggagct gatgggagca ctgctgggag caaggttagc 4320 caaagcagtg aaagaatacc acacacttcc aatatgccgg cgagttatgt ggacagactc 4380 gaagacaacg ctggcgtgga ttcaatcaca acatcatcgc taccgacagt ttgtggcttt 4440 ccgagtggga gagatcttga gcaagacaga agccgccgaa tggagatacg ttcccacgca 4500 acataatcca gccgatgatg caaccaagtg gggaaagggt cctagcacgg atgtgatgtc 4560 ccgatggttc cgcggacccg acttcctcta taagccagaa atagagtggc cggagcaaag 4620 atcgacgcag tcgcaggaga cggatgaaga attgaggccc tgcatggtac accaacagaa 4680 actatttgag gatatctaca agatctgtaa tttttcgaag tgggaacgct tacttagaac 4740 agtcgcatac gttcaccgtt acattgacaa ctgcaagctt aagaggaaga aagagaagct 4800 gaccatgggg catctgcagc agaaagagtt gcaaaaggca gagaatagcc tgtggcgcac 4860 agtacagtca tcagcgtacc cagatgagat cgctgcccta aagcaaggaa acgggatctc 4920 gaagaaggat atcgataaga ctagtccagt atataaactc tcgccgttct tagatgagca 4980 cggggtcatg cgagtagaca gtcgtatcgg agcagtcaca tacgttacgt acgacttcaa 5040 gtttccgatc atcctgccac gaaaccatca acttaccaag cttgtcatcg actggtatca 5100 tcgtcgatat ctacacgcta atcacgcaac ggtacaaaat gaagttcatc aacgcttcca 5160 cgtctccaat cttcgctccg cgatacgtca agtcacaaag gaatgccagg cttgtaagat 5220 cgccaaaaca gtaccaacca taccaaggat ggctcctctt ccagagtcac gtctagcagc 5280 caaggaacgg ccgttctcct acgtcgggct ggattatttc ggacctatac aggttcgtat 5340 agggcgtagt tgtgtaaaac gatgggtcgc gctgtttacg tgcctcactg ttagagccgt 5400 gcacttggag gtggctcatt ccctgtctac ggagtcttgc aagatggcaa ttcgtcggtt 5460 tgtggcgcgt cgtggatcac ctgtcgaaat tcgatcggat aatggcacca attttcaggg 5520 ggccagtcgt gaactgcgag atcagattgg ggtgattggt cagaaactgg cggaaacctt 5580 tacaaatacg aatacacgat gggttttcaa cccaccctct gcgccgcact ttggaggatc 5640 atgggagcga cttgtaagat ccgtcaaagt cgcactcggt tcactgtgcg gtaatcgaaa 5700 cccggacgat gagactttgt taacggtcct ggcggaggcc gagtctatag ttaactctag 5760 accgttaacc accatccctc tagaaagcgt tagccaggag gcgctaacgc cgaatcattt 5820 tatcctgcta agctcaagcg gagttgtgca accttcaaca aaactggcag aaccgacaaa 5880 agtaacgcgg accaactgga acatggctag acagctagtg gaccaatttt ggcggcgatg 5940 gatcagtgaa tatctgccga ctatagctct aagaagcaaa tggtttggcg aagctgcgaa 6000 actgaaagta aacgatcttg tgctaatcgt ggatgaaggt caacgtaacg ggtggatgag 6060 aggtcgggta gtagctgtga ttccgggagc tgatggccga attcgccaag ccatggtcca 6120 gactaccaaa ggattgttcc ggagcccgat gactaagttg gcagtactcc aggtgcaagg 6180 cgatagtaac acagaagcgg tatgcagacc ggaagtgcgt tacgggtcgg ggga 6234 // ID DNA3-9_AP repbase; DNA; INV; 203 BP. XX AC . XX DT 27-AUG-2009 (Rel. 14.09, Created) DT 27-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA3-9_AP. XX NM DNA3-9_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-203 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 1950-1950 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 3 bp TSD CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 203 BP; 43 A; 43 C; 51 G; 66 T; 0 other; aaggcgggat cccacgactc gtgtatttga cgaatatttg gtattcgtgg atactcgccg 60 agtaatggga atggagtact tgtacgaata ttcggtattc gttattcgtt attcgccacc 120 ttcccgtcga ttgtcggtaa cttgccgatt attcgtgtat tcgtggatat tcgtcaaata 180 cacgagtcgt gggatcccgc ctt 203 // ID BEL-3_DPu-I repbase; DNA; INV; 7135 BP. XX AC scaffold_172; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-3_DPu_; KW BEL-3_DPu-LTR; BEL-3_DPu-I. XX NM BEL-3_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-7135 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 653-653 (2010). XX DR Genome; scaffold_172; Positions 77443 84577. XX CC Positions [6042-6608] - Integrase core CC 'CGCAT' target site duplication CC LTRs are 98% similar to each other. CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX FH Key Location/Qualifiers FT CDS join(1059..4799,4803..7004) FT /product="BEL-3_DPu-I_1p" FT /translation="MAPPRNPTPGGSRSAVKGHLTRILKTIKDAESEAMTA FT KLDVELAGEETKLDEKMEHFKKLSVECQKEMTTDNGANQQDLDAEYDSVNQ FT MEDEVRAGKNIIILKRQEWKDFKEEERVKEAERRRMQERKDDDDARDQKMQ FT DFMQRILAAQASTTSTSATTSTTSTTTPAQTTRLPQRQIKPFKGDILEWTP FT FWESFNAAIHSSSIAAVQKFDYLKQYLKGEAFLCVENLELNDTNYQKAIDE FT LKRMDGKPEVLIEAHLHKLNTLQPVKDMSDISALRSLQLRLQSHINALQTL FT GVDKSTYAGLLGSNLIHLLPYKLQVKWTESASNKVTDIDGLIAFIIQQVEA FT AERLNRLREQVAKPSHPTQSKQPQATPATASQLSVGARPDPAPQNGKSFKQ FT KTWNNTAPPKTGPKRYPDCSRPCIFCGEIHYPSHCTMGLKEKKAIIIAMKR FT CFKCFSDTHETVNCTSTNECRRCKGSHHTALCDKQQTRFSSQPASGTTVTG FT SSAITTTACANGFSDLKVKTATVIVARPNGEETRAILFVDDGSHRSWVTKE FT ISRRLNLKIVAVENIATRVFKKKEANPAEITNVVEMSVRGTWYGAPMVKIT FT ALESDYLADTGPYGTADFARKLWMLDIKMADDRFERKEKEETIGILVGVDQ FT MFQILTNEPAIQSPCGLRAYNTKLGRMIAGPSKEKPSIQGQKVIRQLIQVQ FT NHPMPRVTSFTTTCFSSTKIHSREPEVLPGNEETEICSLQRSKSEIGAVNY FT NEEHVNFSAAPTTKKDKHKASKIELKQQTNFDLSLFWRLENFANLDDGGAV FT ESENRFDSFSDEITRLEDGRYCTPIPWSTDRWRLERYFHLAAGRLESMVGK FT LRKIPGDLEDYSKEIELLKNQFFIEEANFNYDGLHTYLPHHPVYRMDKNTT FT KIRPVFDGAAKSKFGPSLNDVVETGPNLNPDLLSVLMRFRMYRIAWIADIE FT KAFLNVALQPEDSKAVRFLWPKEPAIPGSPLIAYKWKRVPFGLSSSPFLLR FT VTVNKHLLSVQSRFPETVEQLREQLYVDDYLGGADDEPAAIKRVEETDEIF FT KEAKLNMRSWATNEETTRQLLEEKGLCNKVVGILSPTLDGQQKVLGIRWDT FT ESDTFKFDPSTVVTAVEDLSEIVTKRKILSISARVFDPIGFLSPTVLLLKI FT IYQKLWEKELGWDDEAPLDIQQSWKLVMSGLKEFENLQIPRWIGYSKHKIT FT SAELHVFGDASESAYGAVAYARLQKENEEPYVILLVSTRVAPLPKKKVTLP FT RLELLSSLLAIRLGEKVRTSLHIELRTRLDSGDPNRWKPFVRNQVESIRKA FT SNPDWWRHCPGIQNPADLASRGAPALVHSNLWWNGPTWLSRKENEWPDSPE FT TEDKIQEEIEVEANGKIVSVVAAIIDPVQPIEWHLDKISKWQKLLRRTGWI FT VRFINRIRKREWIPQEEIIQEVIPVNGKEMTVDQMTVSELHEAELLIYRQL FT QRDRYPKAFNSLQAKLPIHPKEKIASLNPIWDERDKLIRISGRIALALRDR FT EIEPTILLPANHVIVSLFIIDKHQKLDHAGVKTTLSELKERFWIVKGRQMT FT RKACFRCVKCRKLTSPPFSELSAPLPLNRLRQAQSFHITGVDFAGPLYYKP FT PQVRKRRKKTTEVQPIIADPAVVEEDSEEEPLAVEEAPNPNPDPAVPADED FT IELQELLFDDLSPQVVTSSKHLKSYVCLFTCAVTRAVHLELTKDMTVRSFL FT LAFRRFSARRGPVSVMYSDNAQTFHCVARHLKVLRSDPSIHDLLAMRGTLW FT IFSASLAPWWGGFWERMVRSVKDLLRRSNGRSCLEYDELEVSLIEIESVIN FT ARPLNYVGEGIDDPLPITPNQFLNNRRSNCATPEPAINLMAPNSTSATLVQ FT LDKNRRDYVSDICQRFVKDYLLQLDNFHSKGKATRKIRFGEVVVIHDEHTK FT RLMWKTGVVKELIPSRDGLVRSVTLKTANGNLINRAIQCLHPLELREDQDE FT DVDVVD" XX SQ Sequence 7135 BP; 2229 A; 1634 C; 1558 G; 1714 T; 0 other; tggtccttcg agctaacctt tgaatcagtt ttggaaaatt tgtgttcatt cctctctttg 60 gtaaatcgtg cattcaattt cctttattca gagctatatt taaattatta ttccttcatt 120 catttaaatc acattcatcc atttattgga ttcatttggt gaggtcagtg tgaaaaaatt 180 ttgctagctt gtaaattaac aattagggtt atttgaattt tcattttttc tcacgcaaag 240 tgagtgccct gattgtcaat ttttacatca ttttctaagt ctagtaaaat caatcattct 300 atccaatcct tatcgtgtca gacggccttg ataaagccgc caaaattttt tctcacacta 360 cttttctctg gaacctcgtg gcagccgtgg ggcattgggt tccctttacc cccgaattgc 420 atttagagag agggaatttt ccttcagtct agagaaataa tttaaaatct cttccttcag 480 ctgaaacgcg ccattttttt cttaacttcg ttctttggaa ccgcgtggaa gccccagggc 540 acaagagtcc cctttttgcc aattgcattg atagggcggg acttccttca cctctcattc 600 catttcttat cttcatattt tgcattctaa aaaaaaaaaa aaaaaagccg tttgttaatt 660 ttcaagctta acacgtggca caaaaaaaaa ttttccttta aagagcatgg atttccggtc 720 ttcactaacc ttgtagttat agtttaactg gagtctgtta aaccagactc cttttaactt 780 tcctggccga ttagagtagt agcttgacct gattacccaa cagcggctta tttatttaac 840 gtgggagctt gatctaccac attttgcgag ttgtgagcag ccaggaaagc ccgccataca 900 gcacgctttc agtgaagagg agactcagcc atttaggaag aagaatctcc aatacacacc 960 ggatttaaca acgattcaag attcaaagtt tcaacttcat taagtcattt cgctacgaat 1020 tataatttct cttctctcat tcaaggagta gcatcagcat ggcacctccc agaaacccaa 1080 ctccaggagg gtcaagaagc gctgttaaag gccatctgac ccgaattttg aagacaatta 1140 aagatgccga aagcgaagcg atgacagcca aactagatgt ggaactcgca ggagaagaga 1200 caaaactcga cgagaagatg gagcatttca agaagctctc agtcgaatgc caaaaagaga 1260 tgacgaccga taatggagcc aatcaacaag atcttgacgc cgagtacgac tccgtcaatc 1320 aaatggaaga cgaagtcaga gctggaaaaa atatcatcat tttgaaaagg caagagtgga 1380 aagatttcaa agaagaagaa agagtaaaag aggcagaaag acggaggatg caggagcgga 1440 aagatgacga cgacgctcga gaccagaaaa tgcaagattt catgcagcga atactggccg 1500 ctcaagcatc caccacgtct acttcagcca ctacatctac cacttcaaca acaactccag 1560 cccagacgac aagacttccg caacgtcaaa tcaaaccatt taaaggagac atcctggaat 1620 ggactccatt ttgggaaagc ttcaacgccg cgattcattc ttcgtcgatt gcagcagttc 1680 aaaaatttga ctaccttaag cagtatctta aaggtgaagc ttttctgtgt gtcgaaaatc 1740 ttgagctgaa cgacacaaat taccagaaag cgatcgacga actcaaacgg atggacggga 1800 aaccggaagt gttgattgaa gcacacctcc acaaactcaa cacgctgcag ccagtcaaag 1860 atatgagtga catatctgct ttgagaagtt tacagctcag attgcagtca cacatcaacg 1920 cgcttcagac tttgggagtc gacaaaagca cttatgccgg actccttgga tcaaatttaa 1980 tccatctact tccgtacaag cttcaagtca aatggacaga gtctgcaagt aacaaagtaa 2040 ccgacattga tggtttaatt gcatttatta tccagcaggt ggaagcagct gaacgactga 2100 atagattgag agagcaagta gccaaacctt ctcatccaac acaatccaag cagcctcaag 2160 caactcctgc aacagcatct cagctatcag ttggagctag accagatcca gctccccaga 2220 acggcaaatc ctttaaacag aagacgtgga acaacacagc accacctaaa actggaccga 2280 agcggtatcc ggattgcagc aggccgtgca tcttctgcgg ggaaattcac tatccctccc 2340 attgtacaat gggtttaaag gagaaaaaag ccatcatcat agccatgaag agatgcttca 2400 aatgcttcag tgacacccat gaaacggtca actgcacgtc aaccaacgag tgcagaagat 2460 gcaagggcag ccatcacacc gccctgtgcg acaaacaaca gacaagattt tccagccagc 2520 cagcttcagg cacaacagtg accggaagca gcgccatcac taccaccgct tgtgccaacg 2580 gattctccga cttgaaagta aaaacggcga cggtgatagt ggctaggccg aacggagagg 2640 aaacccgcgc catcttattt gtggacgacg ggagtcatcg ctcctgggtg acaaaagaaa 2700 tttctcgacg tctaaatcta aaaattgttg cagtcgaaaa tattgccact cgagtcttca 2760 agaaaaagga agccaatccg gcggaaataa caaacgtcgt ggaaatgtcg gtgcggggca 2820 cctggtacgg agccccaatg gtaaaaatca ctgcattaga gtcagattat ttagccgata 2880 caggcccgta cggaacggca gattttgcca ggaaattatg gatgctggat ataaaaatgg 2940 cggacgatcg ttttgagcga aaggaaaaag aagagacaat cggaattctt gttggagtcg 3000 atcagatgtt tcagattctc accaacgaac ctgccattca aagcccgtgt ggtttacgag 3060 cctacaacac gaagctgggc cgaatgatag ccggtccctc caaagaaaag ccatccattc 3120 aaggacaaaa ggtcattcgg cagttgattc aagttcaaaa tcacccgatg ccacgggtaa 3180 cttccttcac gacaacttgt ttcagttcaa cgaaaattca ttctagagag cccgaagtcc 3240 tacctggaaa cgaagagact gaaatttgca gtctccaacg ttcaaaatct gaaattggag 3300 cagtcaacta caacgaagag catgttaact tctcagcagc tccaacaaca aaaaaggaca 3360 aacacaaagc ttcaaaaatt gaattgaaac aacaaactaa ttttgatcta tctttatttt 3420 ggagattaga aaattttgcc aatctagacg atggaggagc agtggagtca gaaaatcgat 3480 ttgactcctt cagtgatgaa atcactcgcc ttgaagacgg gagatactgc acgccgattc 3540 catggtcaac agataggtgg agactagaga gatatttcca tttggcagct ggcagattgg 3600 agagcatggt gggaaaatta agaaaaattc caggagatct ggaagattac tccaaagaga 3660 tcgagctttt gaagaaccaa ttttttattg aagaggccaa ttttaactac gatggactcc 3720 acacgtacct tccacatcac ccagtctacc ggatggataa aaacactacc aaaattcgtc 3780 cagtgtttga tggagcggca aaatccaaat ttggtccgag ccttaacgac gttgtagaaa 3840 ctggacccaa tttaaacccg gatctcttgt cagttctgat gagattccgt atgtacagaa 3900 ttgcctggat tgcagacatc gaaaaagcgt ttttaaacgt cgcgctgcag ccggaagatt 3960 ccaaagccgt cagatttctg tggccaaagg agccagctat tcctggctcc cctttaattg 4020 catacaaatg gaaaagagtg ccatttggcc ttagctcaag tccatttttg ttaagagtta 4080 cagtcaacaa acatcttctt tcagttcaat ctcgttttcc agaaaccgtg gagcagttaa 4140 gagagcagct ttacgtcgac gactatcttg gaggagccga cgacgagcca gccgcgatta 4200 aacgagtgga agaaacggat gaaattttca aggaagccaa actgaacatg cggagctggg 4260 ccaccaacga agaaacaacg cgccagctcc tggaggagaa aggactttgc aacaaagttg 4320 tcggcattct ctctccaacc ttggatggac agcaaaaagt cctaggaatc cgctgggata 4380 cggaatctga caccttcaag tttgatccat caaccgtcgt caccgcagtc gaagatttaa 4440 gcgaaatcgt caccaaacgg aagattctca gcatttcggc cagagtcttc gacccaatcg 4500 ggtttttatc acctactgtt ttactcttaa aaattattta tcaaaaactt tgggagaaag 4560 aattaggttg ggatgacgaa gcgcccctag acatccaaca atcatggaaa ttggtgatga 4620 gtggtctcaa ggaattcgaa aatcttcaaa ttccacgatg gatcggctat tccaaacaca 4680 agatcacatc ggcagaactt cacgtctttg gagacgcctc ggagtcagct tacggagccg 4740 ttgcctacgc ccggctccaa aaggaaaatg aagaacctta cgtgattctg ctagtcagct 4800 aaacaagagt cgctccactg ccgaaaaaga aagtaacctt gccaagactt gaattattaa 4860 gttctctttt agccattcgc ttaggtgaaa aagtaagaac ttctcttcac atcgaactgc 4920 gaactcggct ggattcggga gatccaaatc gatggaaacc attcgttaga aaccaggttg 4980 aatccatccg caaagcttca aatccggatt ggtggcgtca ttgtccaggg atccaaaatc 5040 cggctgatct cgcctcgcgg ggagcgccag cgctggtaca ttcgaatctt tggtggaacg 5100 gcccaacatg gctctcaaga aaggagaacg aatggccaga ttctccagag acagaagaca 5160 aaatccaaga agagattgaa gtagaagcga acggcaaaat cgtcagtgta gtagcagcca 5220 tcatcgatcc agttcaacca attgaatggc atctggataa aatatctaag tggcagaaac 5280 ttctacgacg gacgggctgg atcgtaagat ttatcaacag gatccggaag agagaatgga 5340 ttccacaaga agaaattatc caagaagtaa ttccagttaa cggaaaagaa atgacagtcg 5400 accagatgac agtaagcgag ttgcacgagg cggagctact tatttatagg cagctccaaa 5460 gggataggta tcccaaagca tttaattctc ttcaagcaaa actaccaatt catcccaaag 5520 aaaagatcgc gtcgctcaat ccaatctggg atgaaaggga caaactcatc cgaattagcg 5580 ggagaatagc actcgcccta agagatcgag aaattgaacc aaccatcttg ctgccagcta 5640 atcatgttat cgtttctctt ttcattatag acaaacatca gaaactcgat catgctggag 5700 tcaaaacaac attgtcggaa ctcaaggagc gtttttggat agtcaagggt cgtcaaatga 5760 ctcgaaaggc ttgtttcagg tgtgtgaaat gtcggaagtt aacctcgccg ccattcagcg 5820 aattatccgc tccacttcca ctcaaccgac tccgacaagc gcaatcattc cacataactg 5880 gagtcgattt cgccgggcca ctgtactaca agccaccgca agtgcggaaa aggagaaaaa 5940 agacgaccga agttcaaccc atcattgctg atccagccgt agtcgaagaa gattctgaag 6000 aggagccact cgcagtagaa gaggctccaa atccaaatcc agatccggca gttccagcgg 6060 atgaggatat tgaactacaa gaattgttat tcgacgattt atcgccgcaa gtcgtaacca 6120 gttcaaaaca tttgaaaagt tatgtgtgtt tattcacttg tgccgttact cgagcagtgc 6180 atttagagct cactaaagac atgacagtgc gttcttttct actcgccttt cgtcgattct 6240 cagcacgaag agggccagtt tccgtcatgt actctgacaa cgcccaaacc ttccattgtg 6300 tagctcgcca tttaaaagtt cttcgttcag atccatctat tcacgatctt cttgcaatga 6360 gaggaactct ttggatcttt tctgcaagcc ttgcaccctg gtggggagga ttctgggaga 6420 gaatggtgag gagcgttaaa gatctacttc ggcgctccaa tggaagatct tgcctggaat 6480 atgacgaact tgaagtaagt ctaattgaga ttgagagcgt aatcaatgcg cgaccactta 6540 attacgtagg agaaggaatt gacgatccgc tcccaatcac tcctaaccaa ttcttaaaca 6600 atcggcgatc aaattgtgct acgccggagc cggcaattaa tttaatggct cccaactcta 6660 ctagtgcaac actcgtacaa ctcgacaaga ataggagaga ttatgtgagc gacatctgtc 6720 agcgattcgt caaggactac ctgctacaac ttgacaactt ccattcaaaa ggaaaagcga 6780 caagaaaaat ccgtttcggt gaagtagtcg taatacacga cgaacacacc aaacgtctga 6840 tgtggaaaac tggggtcgta aaagaactca ttcccagccg agacgggctt gtccgttcag 6900 tcacattgaa gacagcaaac ggtaatttaa ttaatcgagc tattcaatgt cttcaccctt 6960 tagagctacg tgaagaccaa gacgaagatg ttgatgtggt ggactgagat ccggagccgg 7020 aagaagatcc agtccctgct gatccagctc caattattcc agcagctatc gaagccgtcg 7080 gtgatccagt cgacggagaa gtcgagccgc atcgcatggg ctctggtggg gagta 7135 // ID Gypsy16-I_Dya repbase; DNA; INV; 4823 BP. XX AC chr2h; XX DT 19-MAY-2009 (Rel. 14.05, Created) DT 19-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy5_Dya; KW Gypsy16-LTR_Dya; Gypsy16-I_Dya. XX OS Drosophila yakuba OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; melanogaster subgroup. XX RN [1] RP 1-4823 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1101-1101 (2009). XX DR Genome; chr2h; Positions 613583 618405. XX CC Positions [2697-3173] - Integrase core CC LTRs are 90% similar to each other. XX FH Key Location/Qualifiers FT CDS 1419..3323 FT /product="Gypsy16-I_Dya_1p" FT /translation="MKCQLDKCEFMKNKVEFLGFVVSDRGIETNPSKVAAI FT ANYPCPKTLRDLRSFLGLSGYYRRFIQNYAKLAKPLNSLLRGEDGHVSKRM FT SSKKIISLDKEALESFKKLKSNLVSKEIILRYPDFKEFHLTTDASNFAIGA FT VLSQDNHPISFLSRTLSKAEENYAANEKEMLAIIWALKAVKNYLYGKAKVK FT IFTDHQPLTHSLSSWNGSARIKRWKAYLEEYDYKIFYKPGKENVVADALSR FT IPLEQINSVASTQHSAESSRHDLIPSVEAPINVFKNQIFFYKSNKYDYKFT FT IPFPTFHRHEIYRLEYTTDDILTDLKKHLKPSVINGIHTTEDIMGKIQTTY FT RSHFKGIKTRFTQFRVEDLTDETEQEERILQIHRRAHRNALENQKQLAETY FT YFPRMKQKIIHLVKQCKLCKKAKYDRHPPNPEIQETPIPEYPGHILHIDIY FT STERHLVLTAIDKFSKLAQARIINSKAIEDIRTPLRDIIFYFGVPKHIVMD FT NEKAFNSQSIKFMLEQLQIEVYTAPPYKSSVNGQVERFHSTLSEIMRCLKK FT EGTERNFKELLERAVYEYIYSFHSVTKKRPLEVFFGRRVTTDPDQYETARQ FT DNIERLKNKQEIDRTNHNKKKNEILNYQPGEEDIGKK" XX SQ Sequence 4823 BP; 1856 A; 979 C; 802 G; 1186 T; 0 other; aagtaattaa ctggcgcccg aacagggacc ggtgcgcgta cgatacaaca aaacgttttg 60 taagttcgcg gctcgcaata acatttgcgg caaccggcct aatcgcgagt gacaagaacg 120 cgcaactcac ggaaaattaa catctttcgt tcgttcaaat tccctctggc atatcgaact 180 taacatgttc atttgcccat cggtggactc caaggactga aggcagggcc acagaccacc 240 cctcaaccct gtacgcatcg ctggatccca agtagtcttc gttagataaa caggatgaaa 300 tccggatcgt cggagggagg agccacatca aaaagaatac ccatagtact gtaagcttaa 360 ccaaccaaat ttaaaaaaaa aaaactcaaa actaacaaac aaattgaaca aagaaacata 420 taaatttaag tgatcaatga aaaaataaat agaataaatc acctttcaat gctgtctcag 480 taggcggcag cacagtaatt acgcatcata cggtagcaaa cctttttaat attcaggatg 540 ttgaagtaaa attttttctg ttgcccaatt aaaatctttc gatgccattt tgggcaatga 600 tagtttaaaa gaattagggg cagtaataaa aatcagcaaa aatattatga tattaaagaa 660 cggtctctcg atccccataa aagaaaaaac attcgaagca gtcgacacaa ttattccgag 720 aacatcccac ttgtccgaca aacaaaaaac caagctaagc gaactactca agtcgtttcc 780 agacttgttc gcagacccaa accaaaaact gacctacaca actaacgtaa aggcaacttt 840 ccgaacctcg tccgataatc ctatttactc aaaattctat cagtatccaa tgacactgaa 900 agatgaagta aacaaacaga gaaaagaact tttagaagat ggtataattc gaccttctag 960 gtctccgtac aattcaccag tgtggattgt acccaaaaag gcagacgctt caggcgaaaa 1020 aaaatatcgc atggttattg attatcgcaa actcaataaa ataaccattg cagataaata 1080 ccctattcct gagataaatg aagttcttac ccaattagga gtataaagta ttctccgttt 1140 tagatctgaa aagtggtttc catcaaatct cgttaaaaaa tagtgacata gaaaagaccg 1200 ccttctccgt aaacaacgga aaatatgagt ttacacgact cccatttggt cttaagaatg 1260 tgccctcaat tttccaacgt gcattagatg acatacttcg tgaacatatt ggaaaaatat 1320 gttttattta catagacgat atcatcatat ttagcaaaga tgatgatacg catattgaga 1380 acttggctaa aatttttcaa actttacaga acgccaacat gaaatgtcag ctagacaaat 1440 gtgagttcat gaaaaacaaa gtcgaattcc ttggatttgt tgtctccgat aggggaatag 1500 aaacaaaccc gagtaaagtg gcagcgatag caaattatcc ctgccctaaa actttgaggg 1560 atctaagatc cttcctaggc ctttccggct attatagaag atttatacaa aattatgcca 1620 aattagcaaa acccctaaat tcacttttaa gaggggaaga tggacacgtg tccaaaagaa 1680 tgtcatccaa aaaaataatt tcattagaca aagaagctct agaatctttc aagaagctta 1740 aaagtaactt agtgtcaaaa gaaataattc tccgttaccc agatttcaag gagttccatt 1800 taactactga tgcttcaaat ttcgcgatcg gcgcagttct atcgcaagat aaccacccta 1860 tatcatttct ttctagaaca ctttccaaag ccgaagagaa ctacgcggcg aacgaaaaag 1920 aaatgttagc cataatttgg gcactcaagg ccgtcaaaaa ttacctatac ggtaaagcaa 1980 aggttaagat ctttacagat catcaacctt taacacattc tctgagtagc tggaatggaa 2040 gcgccagaat caagagatgg aaagcttatt tggaggaata tgattataaa atattttata 2100 aaccaggaaa agaaaatgtt gttgcagacg cgctctcaag aataccttta gagcagatta 2160 actctgttgc ctctacacaa catagcgccg aaagttctag acatgaccta atccctagcg 2220 tcgaagcccc aataaatgtt ttcaaaaatc aaatattctt ttataaatca aataaatatg 2280 actacaagtt caccatccca tttcctactt tccatagaca tgaaatatac agactcgaat 2340 acaccacgga cgatatacta acagacctaa aaaaacacct aaagccttcc gtaattaacg 2400 gcattcacac gactgaagac attatgggaa aaatacaaac aacctacaga agtcacttta 2460 aaggaatcaa aactcgattc acccaattca gggtagaaga cttgacagac gaaactgaac 2520 aggaagaaag aattcttcaa atacatagga gagcacatcg aaacgctcta gaaaatcaaa 2580 agcaactagc agaaacctat tattttccaa gaatgaaaca aaaaattata catttagtta 2640 agcaatgtaa actttgcaaa aaagctaaat atgacagaca ccccccaaat ccggaaattc 2700 aagaaactcc tattccagaa tatcccggtc acatattgca cattgacatt tattctacag 2760 aacgacactt agtactcact gcaatagata agttttctaa attagcgcaa gcacgaatta 2820 taaattcaaa agcaatagaa gatatcagaa cgcctttgcg tgacattata ttttatttcg 2880 gagtcccaaa acatatcgtt atggacaacg aaaaagcttt caattcccaa tcaattaagt 2940 tcatgttgga gcagctacag atagaagtct atactgcccc accatacaaa agttccgtta 3000 acggtcaggt agaacgattc cattcaacac tctcagagat aatgagatgc ctcaagaaag 3060 agggcactga gagaaatttt aaagaacttt tagaaagggc agtttacgag tacatttaca 3120 gttttcattc ggttacaaag aaacgaccat tagaagtatt cttcggtagg agagtaacaa 3180 cagacccgga ccaatacgag acagccagac aagacaatat tgaaagactt aaaaataaac 3240 aggaaattga ccgaacaaac cacaataaga aaaaaaacga aatactaaac tatcagccag 3300 gagaagaaga tataggaaag aaatagttaa ggaaaacaag cacactacag taacaacaga 3360 atcgggaaaa acggtacata aaagccatat caaaatctaa aattttttca gaatattact 3420 atggattacg aacgatgtca gtggaacgat gaaccttaca aactatacga acgcacagat 3480 actgacaatg aacaagggga tcggaaaact acaaacctca acaactaaac taatacacct 3540 cattaacctc gaccaaatcc aaaccgcttt agatatttta catgaacata cggaaaatag 3600 ccttaaacgc agtccattat acgctaccct tcaccatgag ataaacacaa ctactcacat 3660 attccgaaca atcacagcct ttcccaaaac tcgaaaaact aggtccttaa attggctagg 3720 gtcaggatgg aagtacataa cgggtagccc agatcatgat gacctggtta tgatagagca 3780 gaacttagat aaacttattg acaataatat taacaaacag ttagaagttc ttccttctaa 3840 ataaactaac ttcagtatct aatgctctca gcaattctat taaaaaaagc agttcagtaa 3900 gtatcgagac ggctattagt ttgcacaacc agattcgcct tttaaaagaa gaaattatta 3960 acattaaata tgccattcaa tgggcacgat taaacgtagt taacacactt gtattaaatg 4020 aagatgagct aattgaaatc gagaaaattt ttaaaagaaa taatatgcca accctatcaa 4080 tggaagaaat aatggagttt tccgatgtgt caattctaca taataaaaca acacttctgt 4140 atattgtaaa agttccaaac ttagaagaaa tacaatacca agaccttatt ataaaaccca 4200 tagtaaaaaa caattcaatc atacacttaa actttcaaga aatctttgtt aacaaaaaca 4260 ttacctatgg cattccccaa acttgtaaaa ccgtagaata tataaaaata tgcgataaga 4320 aaaaaattgt taatataagt aattcaaaat gtataccaaa gctagttcag ggacagaaag 4380 cgctgtgtag ttttagcaat gccgaccatg tacccaaaat agaagaagtc gacgatggaa 4440 tcattctact aaacgactac gatggaaacg caacatggaa caacacagaa cttcacctag 4500 agggcaccta tctggttcaa ctatcaaatg actcgataat tatcgacaac cagcagttca 4560 gcaacttgga acctacagtc tcgacccccg gagcacctct tgtgcagttt acggccgaag 4620 aaaaagaaag acttaaggtt ctatctcttg aagcattgga agcattgcac ataaacaaca 4680 ctggacagct gatgaacata agaacacatt caacagttaa cagcattact ttgttaacgc 4740 tctttgggat tgtgctggtt ctcatcctgg gtttgcacat ctatagccaa cggaagagga 4800 gggggaagag ttaacaacaa aaa 4823 // ID BEL-53_AA-LTR repbase; DNA; INV; 357 BP. XX AC supercont1.141; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-53_AA_; KW BEL-53_AA-I; BEL-53_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-357 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.141; Positions 378454 378810. XX SQ Sequence 357 BP; 110 A; 70 C; 71 G; 106 T; 0 other; tgttcgatgg aaacccatct tcaagcttcg gtgatttgaa gtaaatgaaa atttgctgca 60 aaaccagcag cacggttgct gcgatgaatt ttggttttac cagtggcggt gcgtaccatg 120 atggacgaac gtgacggaag cgtaccagtg gcaccaacaa gaagaaaaaa aaacttttgc 180 tctattcttt gcatcattcc attcgctttt tcattctttc gagagcagtg aacgcaaaca 240 catgtaaagt caatttttat tcattaaatc gttttgagtt gaaaaacaat gtgttagtta 300 ttccacaaac aatattacag tccattattt cgccgatcaa gtgaattggc cgcaaca 357 // ID Gypsy-18_SI-LTR repbase; DNA; INV; 280 BP. XX AC AEAQ01023712; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: long terminal DE repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-18_SI_; KW Gypsy-18_SI-I; Gypsy-18_SI-LTR. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-280 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01023712; Positions 16 295. XX SQ Sequence 280 BP; 84 A; 53 C; 52 G; 91 T; 0 other; tggtctccgg atgttagtca taaaccttct aacattcggt aatcataaat gttgcatatt 60 gtcattgtta gagcacgttt taacccgctc aacgtacaat agttaagagg tgaaatggtt 120 tgcctggaga agacatgaag gcagatctct ctcgactacg gacatgaacg cgtctccgcc 180 gcgcaatcaa tttggatatc gataagttta gttttccctt gttatatttg ataatataaa 240 gtaaagacta tatttataca ttaactgact atttcatcca 280 // ID Gypsy-623_AA-I repbase; DNA; INV; 6325 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-623_AA_; KW Gypsy-623_AA-LTR; Ty3_gypsy_Ele125; Gypsy-623_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-6325 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [5208-5696] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 1084..2439 FT /product="Gypsy-623_AA-I_1p" FT /translation="MNITENRFIIDEGMKSLNNILRNLNKAPNRKYRQFTL FT KQKLNHAKLNYGRITDALAKIETVIEESELLFLTKGARQVYSDIHILISTK FT LEFAAIHEISIFSLGYAILFINKLKSKISPKMAKVDIKTGATLIHVYDGNP FT TNLDTFLDSVALFADLVDNDNAAASQDVKNAAKATALRFIKTRLTEAARQA FT IPENATLDQLIDALKANCSPKTTADNVLAKLKSTRQSTSAEAFCTEVEKLT FT QELKTIYIRNRIPADVATQMATKSGVDALIAGTKNSETKTILRAGTFTKIN FT DAIQKLHENDATQKVQGSHSEDRAKSSNGIQAQMFMANRNQYRGRGRGSNL FT NYGRGNYNSYNRYGGNWRNNNFNHPQHYQPRFQNTRGQGNYRGQWNHRGRP FT SIYLMQPNPQMMNQSVQQQPLAQQMQQNTQQSIMPNQNLQNNQTQMPNQHF FT LGTPFGR" FT CDS 2850..6041 FT /product="Gypsy-623_AA-I_2p" FT /translation="MLFLPSRHEVTRYIPGMQLQEDMVVYSQEIQPGIFCG FT NTIISAKEPIFKFINTTNDVAYIAYATFKPKMQPLKNFHVCKPKKANKTQS FT ERRAKILSEVNMHEIPTYARNDFEKLITEYEDTFCLPDEILTFNNFYKQNI FT NLNDNVPVYIPNYKTIYSQGEEMERQVQKMLQEDIIEPSVSSYNSPILLVP FT KKSENNDKKWRLVVDFRQLNKRVLPDKFPLPRIDSILDQLGRARYFSTLDL FT MSGFHQIPLEENSRKFTAFSTNSGHYQFKRLPFGLNISPNSFQRMMTIAMA FT GLTPERAFVYIDDIVVIGCSLNHHLQNLRSVFERLRKYELKLNISKCKFFR FT TEVTYLGHKLTDKGIIPDESKYETIKNYPIPKNADDVRRFVAFCNYYRKFV FT ENFSQIAYPLNQLLKKNTKFNWTSECQHAFNLLRHNLMSPRILQYPDFSKE FT FTLTTDASDIGCGAVLSQATEAGDQPIAFASKTFTQAEKNKPTILKELLAI FT HWAINYFKPYLYGKRFTVRTDHRPLVYLFGMKNPTSKLTRIRIELEEFDFD FT VVYIKGKENVAADALSRIVTTSDMLKAVDVLVVNTRSMTKKNIISNNKKNI FT NENTEIDHLSCYEIENPYEARKLMKLSTEIENNAIKFKILKKDNKTLHAQV FT HEDVRNGSHALVHALPKIEKITKGFKTRKIAMSADDYIFKMVPINLFKLMA FT NKTLKDLQIIIFKEPIFVNNIDDIRNILTNHHNTPIGGHVGQHRLYLRLRG FT KYKWHNMKNSIVQFVKACELCKRNKILKHTIQPMVITTTPSKAFEVISIDT FT VGPLPRTPNNNRYCITIQCDLTKYIEIIPIQNKEANTIARGIVQEFILTYG FT HFTEMRSDQGTEYNNEVIEQISKILEIKQTFSTPYHPQSIGSLERNHRCLN FT EYLRSFTNEHQTDWDDWIKFYEFAYNTTPHTEHGFTPFELIFGKKANLPQD FT IIQTSNEPIYNFDLYKNELTYKLRKSQAIAKEKLIIQKHQRKKQFDNNINP FT LPISINDTVYLKNENRRKLDSFYIGPYQVISISEPNCEIKHVNSGKSLTVH FT KNRLIRA" XX SQ Sequence 6325 BP; 2433 A; 1127 C; 1090 G; 1674 T; 1 other; ggaaatggcg accgttccga tcgaacctga tgaataaact tagtgataag tgcatatagt 60 gaaacccaat agttgaagat gggtaaaaca agttcgaagt cagacgtagt gcataatgcg 120 gacccwcaag tacgaataat caacaaccaa gaatatcatg ctgaaatgtt acagcagcac 180 gaaacactaa tttatttgat cctgggcata gtggctactc agttactatt gactctgtac 240 ttcatgctga agaaacgcga gcgcaatcga gcgttcaaat tggccaaaag tgtgaacaat 300 ttggcagaag cgtgagtgat agaaaattta aaaaacagaa aagccagtgg aaaaaacaca 360 acaaatctga tcagtgtacc tgtgccattc atgcaaggag aactgcaatg aaaaagtgtg 420 aactaaaacc tacaaaatgg caactcttaa tttgctaaaa tcagtgcttg agcaattgga 480 tattgccagg gaaaggtgca tggaagttga atttgcagtg aatgctaacg ttaagaaact 540 attgcaaatc ttatcagatc aaggactgtt acaattagtg atctgtaata acgaacaaat 600 catagtgaaa gtgaacaaag aggcatcaag ttttagtgca ttgtcacaac taatcacagt 660 aagaagaaca aacttgttac aaatagctgc aaagctggtg ccgactgtga tgggtaattt 720 actcattttt acaaggatgg gagttatgtc ccaccacgaa gctttacacc aacgtatggg 780 aggtcaaata attggagtgg ctttctgacc accagtgatc gtgttagata gaagctgtga 840 ccataattgc gaccgcccac gtttgtttta ttctacacac cgggcgttaa gtgcaaatat 900 tcacgagaca acacgaatta cccgaagcag tgaaattgag tgccagtgcg tgaggattac 960 caacgagacg aaccctgatg aagtagtttt gtgtgccgct ccatgccgac tgatacgacg 1020 actgatgcag ttaaataaca tgtcgctgcc agatgtacca atcaagtaac ataatacaaa 1080 ggtatgaaca taactgaaaa tagatttatt attgatgagg gcatgaaaag tttgaataat 1140 atcctgagga atttaaataa agcgcccaat agaaaatata ggcaatttac attaaagcaa 1200 aaattgaacc atgcaaaatt aaattacggg cgaattacag atgccttagc taaaattgaa 1260 acggtgatag aggagtcaga attattattt ttaactaagg gagctcgaca agtttacagt 1320 gacatacata ttttaatatc aaccaagctt gaattcgctg caatacatga aatcagcata 1380 ttctcattag gttatgcaat tctttttatc aacaaattaa aaagtaaaat tagtccgaag 1440 atggcaaaag tagatattaa aactggtgcc acacttattc atgtgtatga tggtaaccca 1500 acaaatttag atacattcct ggattcggtt gcacttttcg cagacctcgt tgataacgat 1560 aacgcagccg catcacagga tgttaaaaat gctgctaaag ccaccgcact tcgatttata 1620 aaaacccgac taacagaagc tgctaggcaa gcaattccag aaaatgcgac tctcgatcaa 1680 ttgatcgacg cattaaaggc aaactgttcg ccgaaaacca cagctgataa cgttttagca 1740 aagctaaaaa gtacaaggca atcaacttcc gcagaggctt tctgtacaga agtagaaaag 1800 ctcacccaag agttgaagac tatctacatt aggaatagaa ttccagcaga tgtggcaacc 1860 caaatggcca ccaaaagtgg agttgacgca ctaattgccg gtacgaaaaa ttcggaaaca 1920 aaaacaatat tacgcgcagg aacgtttact aaaataaacg atgcgattca aaaattacat 1980 gagaatgatg caacacaaaa ggttcaggga tctcattcgg aagatagagc taaatcttca 2040 aacggtattc aagcacaaat gtttatggct aacaggaacc aataccgagg ccgtggacga 2100 ggtagtaatc ttaactacgg acgaggaaac tataattctt acaacagata tggaggcaat 2160 tggcgaaata ataatttcaa tcatccacaa cattaccaac cgcgtttcca aaatacgaga 2220 gggcaaggga actatcgggg tcaatggaac catagaggac gaccatcgat ttaccttatg 2280 caacctaatc cacaaatgat gaaccaatcg gttcagcaac aacctctagc tcaacaaatg 2340 caacaaaata ctcaacaatc aattatgcca aatcaaaatt tgcaaaataa tcaaacacaa 2400 atgccaaacc aacatttttt aggcacaccg tttggacggt aggcactata aatgcagccg 2460 tctctaatta cgtaaattta tctttagatt tatcagacac tcgatgtacc ttcataatag 2520 atactggtgc tgatatatct attataaagg caaataaggt gaaatctact caaatttact 2580 atccaaacga gaaatgtatt atttcaggaa ttggtcataa cggaatttcg tcactaggta 2640 gcacttttgc taaaatatct attgaaaacg tatcagtaga acagaaattc catatagtag 2700 aaaatgattt tccaattcca acagatggaa tcattggaag agatttttta actaaatatc 2760 attgcaaaat tgactatgaa ccatggttgt tatcctttac gatagaccaa gtacagattt 2820 caataccaat agaagataat tttcagaaaa tgttgtttct accatctcgt catgaagtta 2880 ctcgatatat accaggaatg cagttacaag aagatatggt tgtatattca caagaaattc 2940 aacctggaat attttgtggc aatacaataa tatcagccaa agaacccata ttcaaattta 3000 ttaatactac aaatgacgta gcttatatcg catatgcaac tttcaaacca aaaatgcaac 3060 cgcttaaaaa ttttcatgtc tgtaaaccca agaaggctaa caaaactcaa tcagaacgac 3120 gagcaaaaat cttgtcagaa gtcaatatgc atgaaattcc cacttacgcg agaaatgatt 3180 ttgaaaaatt aataactgaa tatgaagaca ctttttgtct gcccgatgaa attttaacat 3240 tcaacaattt ttacaaacag aacattaact taaacgacaa tgttcctgtg tacattccaa 3300 attataaaac aatttattcg caaggagaag aaatggaaag gcaagttcaa aagatgctac 3360 aagaggacat tattgaacct tctgtgtcat cttataactc gccaatacta ctagtaccga 3420 agaaatcaga aaacaacgat aaaaaatggc gtttggtagt agactttcgt caactaaaca 3480 aaagagttct accagataaa ttccctttac caagaattga tagcatttta gatcagcttg 3540 gtagagctag atatttcagc actttggatt tgatgtctgg tttccatcaa atccccttag 3600 aagaaaattc aagaaagttt acagcatttt cgacaaattc tggtcattat caatttaaac 3660 gactaccatt tggattaaac attagcccga atagttttca aagaatgatg accatcgcta 3720 tggctgggtt aaccccagag cgtgcatttg tctatataga tgatattgtg gtaattggat 3780 gttctttgaa tcaccatttg caaaatttaa gatcagtttt cgaacgattg agaaaatacg 3840 aattaaaatt aaacatttcg aaatgtaaat ttttcagaac ggaggttaca tatttgggcc 3900 ataagttaac ggataaagga ataatacctg atgaatcaaa atatgaaaca atcaagaatt 3960 accctattcc taaaaatgcc gacgatgtgc gaagattcgt tgctttctgc aactactatc 4020 gcaaattcgt cgaaaatttt tcacaaatcg catatccttt gaatcaactt cttaaaaaga 4080 atactaaatt taattggact tcggaatgcc aacatgcatt taatctttta cgtcataatt 4140 taatgtctcc taggatatta cagtatcccg atttctctaa agaatttaca ttaacgacag 4200 acgcatccga tataggttgc ggagctgtct tgtcacaggc tacagaagca ggagaccaac 4260 caatagcttt cgctagtaaa acctttacac aagcagagaa aaataaaccg accatactta 4320 aagaattgtt agccatacat tgggcgataa attattttaa accgtacctg tatggtaaac 4380 gattcaccgt cagaaccgat catagaccac ttgtctacct ttttgggatg aaaaatccta 4440 cctcaaagct gacaagaatc agaattgaat tagaagaatt tgattttgat gttgtatata 4500 ttaagggtaa ggaaaatgta gcagctgacg cattatcacg tatagttact acatcagata 4560 tgttaaaagc agtagatgta ctagttgtaa atactagatc tatgacaaag aaaaatatca 4620 ttagcaataa taagaaaaat ataaatgaaa acacagagat tgatcacctc tcgtgctatg 4680 aaattgaaaa tccctatgaa gcaagaaaat taatgaaact gtcgacagaa atagaaaaca 4740 acgcaatcaa atttaaaata ttgaagaaag ataacaaaac acttcacgca caagtgcacg 4800 aagatgtcag aaatggaagt cacgcattag tgcatgctct tccaaaaatt gaaaaaatta 4860 caaagggatt caaaacgagg aaaatagcaa tgtctgcaga tgactacata tttaaaatgg 4920 taccaatcaa tttattcaaa ctaatggcaa ataaaacatt aaaggattta caaataatta 4980 tttttaaaga accaatcttc gtaaataaca tagatgatat taggaatatt ctaactaatc 5040 atcataatac ccctattgga ggtcacgtag gacagcatcg tctgtacctc agactccgtg 5100 gaaaatacaa atggcataat atgaaaaatt ctattgttca attcgtcaag gcctgcgaat 5160 tgtgcaaaag aaataaaata ttgaagcaca ctatacaacc aatggtgata acaacaacac 5220 cgtccaaagc attcgaagta atttccatcg atacggtagg tccactacct agaacaccaa 5280 ataacaatcg ttactgtatt actatacagt gcgacttaac aaaatatatc gaaattattc 5340 caattcaaaa taaggaggca aatacaatag caagaggtat agttcaagaa ttcatattaa 5400 catatggaca ctttactgaa atgcgttctg atcaaggtac cgagtataat aacgaagtaa 5460 tagaacagat cagcaaaatt cttgaaatca agcaaacatt ctcaacccca tatcaccctc 5520 aatcaattgg atcattagaa aggaatcaca ggtgtttgaa cgaatacctg cgatcgttta 5580 ctaatgaaca tcaaactgat tgggatgact ggattaaatt ttacgaattc gcgtacaata 5640 caacacccca caccgaacat ggtttcacac cgtttgagtt aattttcgga aaaaaggcaa 5700 accttccaca ggatattata caaaccagta atgaaccaat atacaatttt gatttatata 5760 agaacgaatt aacatataaa ttacgtaaat cacaagcaat agcaaaggag aaattaatta 5820 tacaaaaaca ccaaagaaaa aaacaatttg acaacaatat taatccacta ccgatttcaa 5880 tcaatgatac agtatatctg aaaaacgaaa atagaagaaa attagattcg ttttatattg 5940 gaccttatca agtaatatca attagcgagc caaactgtga aataaaacat gtcaattctg 6000 gaaaatcttt aacagtacac aaaaacagac taataagagc ataattaaga agagcttaat 6060 tttaagtaag agttccaccg agcctaacct ctaacaaaaa atagtaaaaa taaaaaaata 6120 agtaaataga tgaggtaggt gaatgttccg agcttcaaag ctacggagtg aaaaattcgg 6180 ttagaactat agtagaatta agtaccaaca ttgtaaaaaa aaaaaatata tatatatata 6240 tataatcatt tgtaagagtc aactggagag acatttcttt tagaataatt tcattacatt 6300 acattattct cccaaagggg gatgg 6325 // ID CR1-56_BF repbase; DNA; INV; 581 BP. XX AC . XX DT 30-JUL-2009 (Rel. 14.07, Created) DT 30-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE Amphioxus CR1-56_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-56_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-581 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-581 RA Kapitonov V. and Jurka J.; RT "Young families of CR1 non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(7), 1627-1627 (2009). XX DR [2] (Consensus) XX SQ Sequence 581 BP; 196 A; 148 C; 109 G; 128 T; 0 other; ctgatactga acgccccata caggacacca acaactgaag ttctctccca gctaggctgg 60 aaagacatta aagcaacaca ccagtacagc aatgccgtgc tagtctacaa ggccctaaac 120 agcaaactac caccctacat gcgtcagatg tttgtgtact gcagggacca gagtacacgg 180 acaaccagac aaagcacaag tagtcaactg gttgtcccaa aaccgaatcg agagaccttc 240 agaagatcaa tagcctaccg tggaccctat gtctggaaca acctaccacc agacacacgt 300 acggctccaa atctgaccag ctttaagaga ctcttgaagt aaaagtaatt gaccaaaacg 360 aacggacacg gactatgaac actgagccat ctgcaccact atatgttttg taccatgtat 420 gtgtagtgat ataggttgat gaatctaata gtactcaatg taatctgtat ctgtattata 480 ttttatatga ccctgacctc cctcatgacc tcaatgaaaa gcggcctgct ggccgatttg 540 agcttctcat gaataaacaa aggttcaaac aaacaaacaa a 581 // ID BEL-595_AA-LTR repbase; DNA; INV; 345 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-595_AA_; KW Pao_Bel_Ele34; BEL-595_AA-I; BEL-595_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-345 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 345 BP; 111 A; 73 C; 76 G; 85 T; 0 other; tgacagttct caattcaatc gtggcccaaa atccaggtat gccattgcag ctgacagcaa 60 tgctggcagt ttaatttatg gctagtgata actcgttcaa acgaaagatt tcacacaacg 120 tagggtagaa aagggaggta gatgaaataa aattagtcag ttatatccaa accattatcg 180 agtcaacttc aaagttcaat aaaagtaaag tgagtcgtaa agtgcttaga gattgagtga 240 tcttgtggtg cccagttcag aagacactta ccaagtccgg tagtgtccag aaacgtccgt 300 gtgtgagccc aattccccgc cggaaaccac gatcctgccc taaca 345 // ID Gypsy-22_CQ-I repbase; DNA; INV; 6852 BP. XX AC AAWU01028702; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-22_CQ_; KW Gypsy-22_CQ-LTR; Gypsy-22_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-6852 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 423-423 (2011). XX DR Genome; AAWU01028702; Positions 15044 8193. XX CC Positions [2765-3226] - Reverse transcriptase CC Positions [4514-4969] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 645..1835 FT /product="Gypsy-22_CQ-I_3p" FT /translation="MPKNKREKSALKRAREFFPQNFSDTEESDSSVESNHD FT LAHIPIKDTPEIKNLTIEFNMEQIGQQLAEMAANLNNLLAEQQRQQQLINA FT LGAQQAGQAAQAPVAIPAAPPQHANFFAIPDPIKHGISKFDGNKKQLNAWL FT SSVETTLQQLQPYYTDDQNAMFLRTIINGKIEGRAKDKLCANGNPTTFEEV FT KSILIHHLGDHQDLTFYKCQLWQHVKTPNTTLYSHYNSIKETIQIIKNLSQ FT QDPDYRTAWNVMNKSIEQDALAAFLAGLPKDMFGHAIAAKPVNIEEAYAFV FT CKFKSSENLANNINNRSFKKNTEQNKIITENKPQKFKINPHQQNNSQGQQN FT KNDEPMEIGSTNSRLTLNRRQFNNNENLDENTSESEEESVDLNFCWATGSQ FT TET" FT CDS 2915..5251 FT /product="Gypsy-22_CQ-I_1p" FT /translation="MDPESRSKTAFSTALGHFEFNRMPFGLKNAPATFQRG FT MNNILAEFIGTICFVYLDDIIIIGKNLKDHMENLSKVLERLEKYNLKIQLD FT KCEFMKRETEFLGHVITQDGIKPDPGKIEKILDWKLPETKKEIKQFLGLSG FT YYRRFIKDYSKITKPMTKYLHKSEKSIDLKDVSYKSAFEELKKIIASDQVL FT AYPDFDKPFILTTDASNFALGAVLSQMQEKVEKPIAFASRTLTKCETRYST FT IEKEALAIMWAVQKFRPYIYGNECTLYTDHKPLQYIKTCNKNQKILTWRDE FT LENYKLNFLYKPGKANVVADALSRKIETKDDEDNFEVNANDILSISEHSSE FT DLDLSTQSDDDTIHSAEESAEDYVHFVSRPVNYYRNQIVFKIAHFTSVIHE FT SPFTHYHRHTIIQPTFSKNDVTDFFKKYHNGKQTAIIAPENIIQIIQDVYK FT ENFNNKGHFVLTQLIVEDVCNHDRQNVIISKEHERAHRGINEVEAQIKRAY FT FFPKMRNLIKMHINSCHICNTHKYERKPYNIKISPRPVTEKPFTRVHMDIF FT IINNNSFLSLVDSFSKHLQMYFIKHKNLVQVQKALAKYFTSFGFPNEIVTD FT HETTFRSIQLKNYLAQLGVELKYASSSESNGQVERTHLSIAEIFNTNKHKF FT AGMQTKSIIKLSVALYNDTIHSATKFTPNEIIFNQNNIVNPGELIEKAQEM FT FLEARVNLEKAKTRQTKNNDQKEDPPQLEDDQEVFVIPNVRSKTAPRAVMT FT KIKEVKRKTFKNLRNVKRNKSKIKRLKK" FT CDS 5339..6853 FT /product="Gypsy-22_CQ-I_2p" FT /translation="MILFIIVYLFTTSLCQELEIKHLENKPILVIKHKNCK FT IQSGNIKIIHTVNLTDLETTINLLTNIAYTNIDGKNQLTQIVKYKVKQLYA FT NFYQLKPSNHRRQKRWDTVGTTWKWIAGNPDAEDLRIINKTLRQLIEENND FT QYYVNEQLGQRIQHLTNNIAQNLADNRIIKNEIDILTIIVNIDTVNTLLTN FT IQDAILLSKALVTSSKILSPKEIHTIKQLIEQQGVLVEMPDEAFNLVTPKF FT TVSDETLLYILQLPQLEKEESKVIRILPLTIKNAAINNHPEFLVKTRRELY FT TTTNPDDYVQRRSFIKKFNDACIAPLVLGTHSRCNTTSDTETRTKLLTNNL FT MLITNAKNQQLDSNCGPDNRTVEGNLLISFSNCSIIFNERKISSSEMFTQP FT DVLEGALHNLIIESTQIKDHDIESVHNNTIVNRHLLQQVHLAQYSNKMWNW FT GLLSGISTSTITLTAIIIFIIIKSNCILRGAAKKIAHHRNLRKSSNTRVAD FT DTPIPPGGV" XX SQ Sequence 6852 BP; 2595 A; 1489 C; 1134 G; 1634 T; 0 other; attaagtgta ataataaaag tgatttttta aaacgtaacc cgcgtcgcgg ttctcgtaat 60 tggcgcagcc ggtaggatac ggatcgaagg aaaatagtgc gataacagtg gtagcagcag 120 gactaaaaac tgcagcataa gtggcttcgt cgaaggacat cggcagcaag caggcggagt 180 tcccaaaaac aacgctgctg tggagttgaa ggcccccccg tgcttttcca gacatccgtg 240 catcaacgcc cccctgactg taagtaaacc gcattttttt taccctaatt aagtgcgtcc 300 ccccgtagtg aaagtgcgag ggtaggagtt aacgcaagtc aactcaaaat cgtgattggt 360 tcttgaggtt ttcgaacagc aaaaaccctt cacgtcctcc tacagcacta ccagataggc 420 tgtaagggta gagatccgtg gcactaccag ataggccaca gatttaaaaa tcgttttcaa 480 aagaaaacgt ctgctagtaa cactaccaga taggttacta gtacagaaat tagcacacta 540 ccagataggt gctagtttac cgccaaacta aataataaat tactcaagtg tgtttctgtt 600 atctgacacc ataaacgtgt cggagaagtc atttgaaaag ctttatgccc aaaaacaaac 660 gcgaaaaatc cgccctcaaa agagcgcgtg aattttttcc gcaaaacttt tcggacactg 720 aagaatcgga ttcttcagtt gagtcgaatc acgatctagc gcacatccca attaaggata 780 cgccagaaat aaagaatctg accatcgaat ttaacatgga acagataggt caacaacttg 840 ccgaaatggc tgctaacctg aataaccttt tagcagagca gcaaaggcag caacagctca 900 ttaatgctct tggagctcaa caagctggcc aagcagctca ggcacccgtc gcaatacctg 960 cagcgccgcc gcaacacgct aacttttttg ctatccccga tccaattaaa cacggaatca 1020 gcaaattcga tggcaacaag aaacaattaa acgcatggct ctcgagtgtt gaaacgacac 1080 ttcaacagct ccaaccatat tataccgatg accaaaacgc aatgtttttg cggacaataa 1140 taaatgggaa aattgagggt agagccaaag ataaactttg cgccaatgga aacccaacca 1200 cattcgaaga agttaaatca attttaatac accaccttgg agatcatcag gatcttacgt 1260 tttacaaatg ccagttgtgg cagcatgtta aaactccaaa cacaacacta tatagtcact 1320 ataatagcat caaggaaaca atccaaatta ttaaaaactt atcacaacag gaccccgatt 1380 atagaacggc ctggaacgta atgaataaat cgatagagca agatgcttta gctgcgttcc 1440 ttgctggact cccgaaggac atgtttgggc atgctattgc agccaagcca gtcaatatcg 1500 aagaagccta cgcatttgtt tgcaagttca aaagctccga gaatctcgca aacaatataa 1560 acaatagatc attcaagaaa aatactgaac aaaacaaaat cataacagaa aataaaccac 1620 aaaaatttaa gataaacccg catcaacaaa acaatagtca aggtcaacaa aataaaaacg 1680 acgagccaat ggaaatcggg tcgacgaata gcaggctcac tttgaatcga cgacaattta 1740 acaataatga aaatttagat gaaaacactt ctgaatccga agaagaaagt gttgatctaa 1800 atttttgttg ggcgacagga agtcaaacag aaacctaaat tttttgcctt acatcaaagg 1860 taaaaacaat ttaaagttac taatcgacac aggtgctaat aagaacgtta ttagacctgg 1920 aatcctagaa aaaagacaag aaaccaaaga gacaacaatc aaaaatctct ctggtaacca 1980 caaaattagg ctcaagggaa aagagaacct tataggcttt gaccttaagc cacaaacgta 2040 ttttgaactt catttccaca attttttcga tggaatcatt ggatcagaat ttcttgctaa 2100 aaacaatgcc gaaattgact acggtaaaaa ctccattaaa ttaggaaaaa ctctaatcaa 2160 ctttggaaaa tacttcccag ctaaacaaat tttcaatcac actatcacag ttaaaacaac 2220 agaagatggc gactggtttg tcccaaaaca ccaaaaattt ttgaaacata ctgtcattga 2280 acctggattg tacaaatcca taaataattt tactacggtg cgacttttat ccttaactaa 2340 taaaccacca atcattaata acagtttaaa actcaaagta aataactttg aaacaataac 2400 accaatacca attaaatcag atgacgtgat ctgttccaat cagctctcag aactaatcag 2460 aacaaatcac ctatccaact tagaaaaaga aacacttttt gaaacacttc ttgcacatca 2520 aggagtttta ttaaaaactg acgaaaaatt aacagctaca cctattataa agcaccagat 2580 aaaaactacc gatgaacagc ccgtctatac aaaatcttat cgatatccac aagctttcaa 2640 aaaagatgtt gaagtgcaaa taaaagaatt actagagaat gatattatat gtcactctac 2700 cagcccttat tcttctccta tatgggttgt accgaagaaa acagacgcat ctggaaaaaa 2760 gaaggtacgt gtagtcatag actaccgtaa attaaatgaa aaaacaatta atgacaaatt 2820 tcccattcca caaatagagg aaattcttga cagtttggga aaatccgtct atttcacaac 2880 tttagatctt aaatcaggat ttcatcaaat tgagatggat ccagaatcta gatcgaaaac 2940 agcattttca accgcactag gacattttga atttaatcga atgccgttcg gcttgaaaaa 3000 tgctccagcc acattccaaa gaggaatgaa caatattttg gctgaattta ttggaactat 3060 ttgttttgtt tatttggacg atataattat tatcggaaaa aatctgaaag atcacatgga 3120 aaacctgagt aaggttttag aaagattaga aaagtacaac ttaaaaattc agctagacaa 3180 gtgtgaattc atgaagagag aaactgagtt cctcggccat gtaatcacac aagacggcat 3240 aaaaccagat ccaggtaaaa ttgaaaaaat tctggactgg aaattacctg aaacaaaaaa 3300 agaaattaaa caatttttgg gcctttctgg ctattaccga cggtttataa aagattactc 3360 aaaaatcaca aaaccgatga ctaaatactt acataaatct gaaaaatcga tcgatcttaa 3420 agacgtaagt tataaatcag cgtttgaaga actgaaaaaa attattgcct cagaccaagt 3480 cttggcatac ccagattttg ataagccttt catcttaact actgacgcaa gcaacttcgc 3540 tttgggagca gttctttccc agatgcaaga aaaggtagaa aaaccaattg catttgcaag 3600 cagaacactt accaaatgtg aaacaagata cagtacaatc gagaaggaag cactggccat 3660 aatgtgggct gttcaaaaat tcagacccta catctatggt aatgaatgta cattgtacac 3720 tgaccataaa cctcttcaat acatcaaaac atgtaacaaa aatcaaaaaa ttttaacttg 3780 gcgagatgaa ttggagaatt acaaattaaa ctttctctat aaaccaggaa aggctaacgt 3840 cgtcgccgac gcgcttagca gaaaaataga gacaaaggat gatgaagata actttgaagt 3900 aaacgctaat gacattcttt caatttcaga gcactcgtca gaggatttag atttatcaac 3960 acaatcagat gatgacacaa tccactccgc ggaggagtct gctgaagact atgttcactt 4020 cgtgagtcga cctgttaatt attacaggaa ccaaattgtt tttaaaattg cacacttcac 4080 ttcagtcatc cacgaatctc ccttcacaca ctatcataga cacaccatca ttcaaccaac 4140 tttttcgaag aatgacgtga cagatttctt taagaagtat cacaatggaa agcaaacagc 4200 aatcattgcc cctgaaaata taatacaaat aattcaagac gtatacaaag aaaattttaa 4260 taacaaagga catttcgttc ttacccaact tatcgtggaa gatgtttgta accacgacag 4320 acaaaatgta atcatttcta aagaacacga aagagcccat cgtggaatta acgaggtaga 4380 agcccaaatt aaaagagcat acttctttcc gaagatgaga aatttgatta aaatgcacat 4440 taattcatgc catatttgta acacccataa atatgaaagg aaaccttaca acataaaaat 4500 ttccccaaga ccagtgacag aaaaaccatt cacaagagtt cacatggaca ttttcattat 4560 caataacaat agttttcttt cattagtgga tagtttttcg aaacatctac aaatgtattt 4620 catcaaacac aaaaatttag tacaagtgca aaaggcacta gccaaatatt tcacatcatt 4680 tggattccca aacgaaattg tcactgatca cgaaactact tttcgatcta tacaactaaa 4740 aaattatttg gcacagttag gcgtcgaatt gaaatatgca tcttcttcgg aatcgaatgg 4800 ccaagtagaa aggacccatc tttccatagc agaaatcttc aacaccaaca aacacaaatt 4860 tgcaggtatg caaactaaat cgattatcaa attatcagta gctctgtata acgatacaat 4920 tcactctgca acaaaattca cccctaacga aatcattttt aaccaaaaca acattgttaa 4980 tcctggcgaa cttatcgaaa aagcgcaaga aatgttttta gaagcaagag taaatttgga 5040 aaaagctaaa actagacaaa ccaagaacaa cgatcaaaaa gaagatccac ctcaattgga 5100 ggatgatcaa gaagtttttg ttataccaaa cgttagatca aaaactgccc caagagcagt 5160 catgacaaaa attaaggagg taaaaaggaa aacgttcaaa aacctcagaa atgttaaaag 5220 aaataaaagc aaaattaaac gcttgaaaaa atagccctct acgcagtccc taaaaatgca 5280 cttaccccat aaataaatta actaacctgc actaacacta ctaatattac tttttcagat 5340 gatacttttc ataattgttt accttttcac aacaagtctt tgtcaagaac tagagatcaa 5400 acatttggaa aacaaaccga ttctggtgat taagcacaaa aattgtaaaa tccaatctgg 5460 aaacataaaa atcattcata cagtcaattt aacagactta gaaacaacca ttaatttact 5520 aacaaatatc gcatacacta acatagatgg taaaaatcaa ctaacacaga tagtaaaata 5580 taaggttaaa caactctacg ccaacttcta tcaactgaag ccatcgaacc atcgaagaca 5640 aaaacgatgg gacacagttg gtacgacctg gaaatggatt gcgggaaacc ccgacgcaga 5700 ggacctccgc atcatcaaca agacacttcg tcaactgatc gaagaaaaca acgatcagta 5760 ctacgtcaac gaacaacttg gccaacgcat ccagcacctg acaaacaaca tcgcgcaaaa 5820 ccttgctgac aacaggatca tcaagaacga gatcgacatc ctaaccatca tcgttaacat 5880 cgacacagtg aacaccctat tgacgaacat ccaagacgcc atcttgctgt caaaagccct 5940 agttaccagc agcaagatac tgtctccaaa ggaaatccac acgatcaagc aactcatcga 6000 acaacaagga gtgctcgtcg aaatgccaga cgaagcgttt aacctcgtga cgcccaaatt 6060 caccgtgagc gacgagacgt tactgtacat cttacagctg ccacaactag aaaaggaaga 6120 atccaaagtc atacggattc tacctttaac catcaaaaat gctgctatca acaatcaccc 6180 agagttccta gtgaagaccc gacgagaact atataccaca accaacccgg atgattacgt 6240 tcaacgtcgt tctttcatca aaaagttcaa cgacgcctgc atcgcgccac ttgtacttgg 6300 cacacacagc cgctgcaaca caacatcaga cacagaaaca cgcacgaagt tgttaaccaa 6360 caacttgatg ctgatcacca acgcaaaaaa ccaacaactc gactccaact gtggaccgga 6420 taaccgaacc gtggaaggca atttgctgat ttcattttcg aattgctcaa tcatcttcaa 6480 cgagcgaaaa atttcatcaa gcgaaatgtt cacccaaccc gacgtcctgg aaggagcact 6540 acacaacctc atcatagaat caacacagat caaagaccac gacatagaat ctgttcacaa 6600 caacacaatc gtgaacagac acctgcttca acaagttcat ctggctcagt acagcaacaa 6660 gatgtggaac tggggccttc tcagcggaat ttctacatca acaattacgc tgacggcaat 6720 catcattttc atcatcatca aatcgaactg cattttgaga ggagcagcca aaaagatcgc 6780 ccaccatcgg aacctgagaa aatcatcaaa cacacgtgtc gcggacgaca ctccaatacc 6840 ccccggagga gt 6852 // ID DNAX-3_TCa repbase; DNA; INV; 2328 BP. XX AC . XX DT 21-MAR-2009 (Rel. 14.03, Created) DT 22-MAR-2009 (Rel. 14.03, Last updated, Version 1) XX DE Non-autonomous DNA transposon. XX KW DNA transposon; Transposable Element; Nonautonomous; DNAX-3_TCa. XX OS Tribolium castaneum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Coleoptera; Polyphaga; Cucujiformia; OC Tenebrionidae; Tribolium. XX RN [1] RP 1-2328 RA Jurka J.; RT "DNA transposons from insects."; RL Repbase Reports 9(3), 672-672 (2009). XX DR [1] (Consensus) XX CC 2bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Tribolium CC castaneum Genome Project. XX SQ Sequence 2328 BP; 721 A; 421 C; 429 G; 755 T; 2 other; gtccgggaca tttctatttg gacataaaaa atttgaattt tctgacattt atacgtgtat 60 tcagcgtcta tgtgaacaaa aaatgttctc tctctcattc tatggtcctt gagctttaat 120 aacagggaca ttttttaaat tggtagcatt agaaaatgac agtaaacagc tgactgatgt 180 aaacaataac aaaggaaatg ttataatttt aatctatata accagtggct gctttaaaag 240 tgtgtactaa cgggataata acatatttac gcaagtattt ttgaaggtag aaacattact 300 tgtcatataa caattactta gattatatga tgagacggga atggcacaac acagaattgc 360 agtcagtgat tgtgctggag aaagctgctg tgacgtactg acagtactga ctatcgctgg 420 ttgtacgatt tgtgcaattt gtgatatatt ttgtgtgaaa aactggaaaa caaagtgcat 480 aactgcataa cctcacttat cgcagttttg cacattttga tgatgggaat tgcggttcaa 540 gtaattatga gttatagatg gagcccactg tttacttcaa agtaagttca cgacatttac 600 agatactttc tttatgtttt tgtcttttct ggtgtagagg gttaaccttt ctattttcat 660 ataaaattca aaccaaacga aatcgaaaag cttaatattg tcatgttttt tattaattat 720 taaaaaattt atttaaaaaa tcacagatag tgtttataat ttgcagctga agccttttgg 780 gtccactttt ttcgttgtta tttcctggta gccactcttg tttcaagcag aatgggtgta 840 aagttaaaaa taatcccata aaattgttaa tataactcct ttgcactggc atggggtatt 900 tgaaaactta cgtttaacaa atacgtcaca aagtctcagc catcgtcgaa aaacataata 960 agggctttat aacaagaagc gctttccgat tttactgtgg acgtgtggaa gaatagtgtg 1020 aaatattgcg aaaaaccaac ttttactctt gtttacattt ttgtacgtct ctatctgaga 1080 gtagtcgtct tgtataaaaa atatagccta ctaataaagg aaattaaagt gtttttattt 1140 ctgggttgcc cccaattcca ctttgccctt gcctgataat caagacactg ataatgaaat 1200 tggtgtaagt gcatcgaata caaacgtgtt gaatagatga ataaaagaga gacaactgct 1260 tttttcaaaa aggccatttc aaatttattc aaaaagcaca cccccattga aaagggtcca 1320 ttcacgatga agtttggctt cctgcatgac ataggtctcg grcaccaatc tcactgtaaa 1380 tggtctaaat tggtcttgca tctctaagac atggactcac tgagagcgac ttctgcagta 1440 ggcttgagac ctgcctgcct gtctcgcaac aatctctgca ttatgattcc aaatcattct 1500 ataaaacgct gtcaattaca agccatgata tgaggcatga ggccgtctca gtgtatgtct 1560 tgtggtgtac attgaaactg tttcttaaag aacccattca caatgagatc tgcttcgcta 1620 catgcggcgc gacaggactc aaagactaat cccaaaagac acaaaaaact gacacagttt 1680 atcattttaa gccattgaca gcgttttcga atgttggtgc aaagacttgt cgcgaggcag 1740 gtttcagcct tccgccaacg tcagtctcag tgaaaccctc tcaaagtgat ctcagcgaaa 1800 accattcaca atgagattgr tctatgaggc ctatgtcgtg ccgcatgcag cgagatatgt 1860 ctcattgtta atgaaaatag ggcctttttt gttgtttaaa gtcgtttttt aaaacttttg 1920 tggcaaattt tactgatgtg gcagagaggg ggcatgcaat tggaacatga cgatgactac 1980 cctaaaaggc acttcgaggc ttttatggcc tttttaatgc aaatttcaac tttaaacctg 2040 acagtcatac caaggtcaaa cataacctca acagtaatcc agcaacggta cttcaaagca 2100 tttgaaagtt tctccatttg tttttaagtg ttctagttgg tacattttgc tgtactttac 2160 tacaataaag aaatatttaa atttttatta ttgtttacat ttgtcatctg tcaaacacat 2220 caaacaacca agttccctaa atttgagctc ttatctagtt tttgtgctct atatagtctc 2280 gcgtcaacgt atgcgctatg acgtatctca tatagaaatg tcccggac 2328 // ID DNA8-48_AP repbase; DNA; INV; 817 BP. XX AC . XX DT 25-AUG-2009 (Rel. 14.09, Created) DT 25-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-48_AP. XX NM DNA8-48_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-817 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 1978-1978 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 817 BP; 293 A; 113 C; 118 G; 291 T; 2 other; gtgttggcca acgatatcga taattcgata cgatatcgat atcgtgggta ttttcttaat 60 aatttcgata tcgatatcta aaaataatta tcgaaaatat cgcgatatcg gtgtgattaa 120 tcggggctcg gaattttgcg gttttacgga tgattaaaca ctaaactgag taaacagtat 180 aaaataaaat aatattttat cttattttat tatattttat ctttatattt tattatatct 240 gtaaaagtaa atactaaacg tagattatga tacgaggcga tattacgtnt tcttacgatc 300 aagatccttt aaattggtgg aaagaaaacg aatctaaata ctcagctgta tcattactag 360 ccaaaaaata cttaagtata gtggccacta gtgttccctg tgaacggtta tttagcgaag 420 ctggaacaat catttccaag aaaagaaata gattatctcc tgagcgactc aatcaactgt 480 tatttttgaa ttcatatttc aaatcgaacc caagttctga aacattgatg gaaaaactgc 540 tcgaagaaga aattttttga ttaaatacat tggatacttt tcagtttgta caaaataatn 600 tataattgtt aattaattta cactcataat atattaaaac aatatgtatt ctatactttt 660 aatatacttt caatacactc ataatataat atgtattcta tacttttaat gttttaatat 720 gcttcgatat cgatatcggt attttttcga tatcgcaata cgatatcggg gaaaaaaata 780 cgatatcgat attgaaaatt atatcgttgg ccaacac 817 // ID I_Ele2 repbase; DNA; INV; 6104 BP. XX AC . XX DT 08-OCT-2010 (Rel. 15.1, Created) DT 08-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE An I clade non-LTR retrotransposon family from Aedes aegypti. XX KW I; Non-LTR Retrotransposon; Transposable Element; I_Ele2. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-6104 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6104 RA Kojima K.K. and Jurka J.; RT "I clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (08-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. This consensus is generated from 11 CC sequences with >98% identity, and ~100% identical to the original CC sequence in [1]. XX FH Key Location/Qualifiers FT CDS 140..1429 FT /product="I_Ele2_1p" FT /translation="MTGKPPALMAGVPDDPIGSQRCGRTPGWMQSKDEMGQ FT TMVLVLRCKVPENVTADPRLPEPFTIGTSVELAIGAKEARSVKTTREGRGS FT RYLLRTNSKTIINKLQKLTELIDGTPIEIVPHPTLNTVQGIVYEPDSINTD FT EKMIEKHLIPQEVHTVRRIKKRVNGKLQNTPLLILSFHGTELPEYVYFGLL FT RIPVRTYYPSPLLCFNCGSYGHARKSCQQPGICLRCSQPLHVTEGEQCANT FT PNCFHCNEEHPIMSRECSKYKEEDKIIHLKIDRGISFGEARRLYSEENRRE FT TIARMIQNQLKQEVAKKDQLIATLQKQVADLAKEXAILKSAPQEPQPTFTK FT PSSLPPNSSVAPTRVSGTSRKTEHQSRRDKPFVSPSAEDNGTDDGIRTRSR FT SGKRVFEISPTDSRGNRGKRVSNHPCTSNNTTNTET" FT CDS 1396..6006 FT /product="I_Ele2_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="MYKQQHHQHGNVMTGETYKTISPNSDPDLATKTDTNT FT EERNLYQDARMDPSYKPTEHDSGHRSKNTLDYNNQHCERFEGGECVVPPSL FT PEEEASLAREAIRDVYLQPVDRQTSSSAAIKRKSSALPYNIVSNSAAYDQY FT NNLNCKPSSSSTSFDSPCARLTTSTLTRDVPDFSDEPLAAVDVXKPTVSAS FT IFTSRYLPVNHSILPNSPTNLGTNTATTQRRSNTTENKNNESFLRSHLHEE FT PRRAKGKREVTYPLPVLEVDARGGRNDAYHERLAFLSPSQNPVEPNYTHTF FT TTDAAPNILHRHPRSPRSNSKSSDATISSNSXEQAVSPSFAMQWNICGLRS FT HYSELQMLIAKHQPVVVSLQETNIDHQRAPAKLLGDKYEFLFSSCSTHGRQ FT GAGVAVRRGSPFQRIQIQTRIQAVAVQLFAPTEITVVSVYLPPKDKNAINL FT LEDLLEELPKPVMLLGDLNAHHVAWGSTTSTAITEAKRRGEAILELVVQND FT MVVLNDGSHTRIDPVTGNSQALDISICSTSVAAKFAWKIQTDTSDSDHLPI FT LIDLLSASSELHCRSKWIYGKANWELFQEITSETLRPGNFLTVEEFTNRII FT SAAEASIPKSSTKRGTKSVAWWSEDVKIAVKQRRKRLRALRRLGPEDPKRQ FT EALKQFHDARASSRKIVKEAKQKCWEDFVESINPTSSASEVWNKLNRLQGK FT RTSNTLALNLPKGFTNNGRDVAEALADEYETKSANSSYPEKFRRKHASNSN FT LYNKSQRPHLYKRYNTDLTIEELVWALDRKGGSSTGADNIGYPMLQRLPIS FT SKTAMLELFNRIWNSGTFPERWKLATVVPIPKPDADRSRADGYRPIALLSC FT LGKVFERMINRRLTTELETNKKLDPRQHAFRPGKGVESHLAHLESLLSFEN FT DEHVEIVALDISKAYDTTWKPGILRTLKEWKISGRMLNMLCSFLVDRSFQV FT SANGHISSSRIAENGVPQGSILSVTLFLIAMQPIFTKIPPNAEILLYADDV FT ILVVKGSNRQLLRQIMNKTVRSVTEWANSVGFSIAPTKSKLLHCCRLRHRK FT RGRAIKINGASIAHVRKMRILGIMLDSNLNFKQHLASVKSSCSKRINILRI FT LGCRLKRSSRFTLLKAGAALITSKLFFGLGLTSRNIDDMERILGPTYNEVV FT RQSSGAFVSSPINSVMAEAGCLPFRLALIQRLAQLAVRLLEKNPSAVNYPV FT VLRAKDFLRQTTGYSMPNVCSTLRNYDREWYAHSPYTENHVRNKIKAGTNS FT NIVIPTFQELISNQYQYHKKTFTDGSKDADFTGVGVVMEDKEESYALPEAC FT SVFSAEAHALMTAATIASEHHNTIIFTDSASCLDALQSGHSKHPWIQSIER FT VSRSRSITFCWIPGHCGIAGNERADSLAKQARSKQKLDIALPAQDLIKNIR FT RKIWSVWELEWRQSVSQLRQVKCSPIKYPDRKCASEQRTLTRLRIGHTRLT FT HSYLFNKSSPPMCNSCGTRTTVQHILTDCRLYAHQRSNCGITGSLCEILSY FT NPQRETAILQFLKDSNIYNEI" XX SQ Sequence 6104 BP; 1912 A; 1609 C; 1289 G; 1290 T; 4 other; gcgctgatag actttattgt cccgcaagcc ccgagtgcga gttgacgcca acaacgaatt 60 caattcggtc gtacgaaatt ttgtcgcggg aaacgaataa agaagtggat ccgaattagt 120 gctaccctcg tccggtcaaa tgaccggtaa gccccccgcc ctgatggcgg gggtaccaga 180 cgatccaatt ggctcccaac gctgcggaag aacaccggga tggatgcaaa gtaaggatga 240 gatgggccaa accatggttc tagttttacg ttgcaaagtg ccggaaaacg taactgctga 300 tcctcgcctc cctgaaccat ttaccatcgg cacatcagtc gaactagcaa taggagcaaa 360 ggaagccaga agtgtaaaaa ccacccgtga aggacgtggc tcacgatatc tccttcgcac 420 taactcgaaa accatcatca ataagctcca aaagttgacg gaactgatcg acggcacgcc 480 aatcgaaatt gtcccacacc cgacattgaa caccgtccag ggtattgtgt atgaaccgga 540 ctccataaac accgacgaga aaatgatcga gaaacacctt atcccccagg aggtacacac 600 ggtgcgtcgt atcaaaaaac gagtaaacgg caagctacaa aacacccccc tgctaatcct 660 gtcatttcac ggtacagaac tcccagagta tgtgtacttt ggactcttac gaattccagt 720 aagaacctac tacccatcgc cactactttg tttcaactgt ggatcgtacg gccatgcacg 780 aaaatcctgc caacaaccag gtatctgctt acggtgctcg caacctctcc atgtaacaga 840 aggcgagcag tgcgcaaaca caccaaactg cttccactgc aatgaagaac accccatcat 900 gtctcgggaa tgctctaaat acaaggaaga ggataagatc attcacctaa aaatcgatcg 960 cggaatttcg tttggcgaag caagacgtct ctacagtgaa gaaaatagaa gagaaaccat 1020 cgctcgtatg atccagaacc agctcaaaca ggaagttgct aagaaggacc aactgattgc 1080 aactcttcaa aaacaggttg ctgatctagc caaagagwta gcaattctca agtcagcccc 1140 tcaagaacca caaccaacgt tcacaaaacc atcatcatta ccaccaaatt catctgttgc 1200 acccacccgg gtatctggaa caagccgtaa gacagagcac cagtcgagaa gagataaacc 1260 atttgtatca ccgtctgctg aagacaatgg aaccgatgat ggcattcgaa ccaggagccg 1320 aagtggtaaa cgtgtattcg aaatttctcc gacagactcc cgcggcaacc ggggcaaacg 1380 tgtatcgaac catccatgta caagcaacaa caccaccaac acggaaacgt gatgaccgga 1440 gaaacctaca aaactatctc cccaaactcc gaccctgacc tggcaactaa aacggacacg 1500 aacacggaag aacggaactt atatcaagac gcaagaatgg accccagcta caaaccaacg 1560 gaacatgatt ctggacatcg atcgaaaaat acgctagatt acaacaatca acactgcgag 1620 cggtttgaag gtggggaatg tgtagtaccc ccttcacttc cagaagagga agcgagtctg 1680 gcaagagagg cgatccggga cgtctacctc caacccgtcg accgccaaac ttccagctcg 1740 gctgctatca agcgaaaaag ttcagcttta ccttacaaca ttgtttctaa ctctgctgct 1800 tacgaccagt acaacaacct gaactgtaaa ccatcgtcca gttctacttc ttttgattcg 1860 ccatgtgcga ggcttaccac gtcgaccctg accagagatg tcccggattt ttccgatgag 1920 cctctggcgg cagtcgacgt ggmaaagccg actgtatcgg caagtatctt cacttcacgc 1980 tatttaccag tgaaccattc aatccttcca aactctccca ccaatctagg gacaaacaca 2040 gccaccacac agcggagaag caataccacc gaaaacaaga acaacgagtc ctttctccgt 2100 agccatcttc acgaggaacc tcgacgagcg aagggaaagc gtgaagtwac ctaccccctt 2160 ccagttctgg aagtggatgc tcgggggggc aggaatgacg cctaccacga gcgccttgca 2220 tttctctccc cctcccaaaa tccagtcgag ccgaactaca cccacacttt cacaactgat 2280 gccgcaccaa acatcctcca ccgacaccca agaagcccac gatctaactc aaaatcctcc 2340 gacgctacaa tatcatcaaa cagcgawgag caagccgttt caccttcttt tgccatgcaa 2400 tggaacatat gtggtctcag gtcacactac agcgagctac agatgctgat cgcgaaacat 2460 caaccagtag tagtctctct ccaagaaacg aatatagacc atcaaagagc accggcgaaa 2520 ctcctgggcg acaaatacga attcctattt agctcgtgtt cgacacacgg aagacaaggc 2580 gcaggtgtgg ccgtcaggag gggctcaccc ttccaacgaa tacaaataca aacccgtatc 2640 caagctgttg cggttcaact cttcgctcca acggaaatta ccgttgtctc agtgtatcta 2700 cctccaaagg acaagaacgc tatcaaccta ctagaagatc ttctagaaga gcttccaaaa 2760 ccagtaatgc tcttgggaga cctgaacgca catcatgttg cctggggaag cacaacaagc 2820 acagctatca ctgaagcaaa aagaagaggt gaagcaattc tggaattggt ggtccagaat 2880 gacatggttg tcctaaacga cggctctcac acccgaatag atcctgtcac cgggaactct 2940 caagccctcg atatctcgat atgttctaca tcggtggcag ctaagttcgc ctggaaaata 3000 cagacggata cctcagacag cgatcacctg ccaattttaa ttgatctgtt gagcgcttca 3060 agcgaattgc actgtcgatc gaaatggatt tacggcaaag ctaactggga actattccaa 3120 gaaatcacca gcgaaactct acgtccagga aattttttga cagtcgaaga gttcacgaat 3180 agaatcatct cagcagctga agcttctatt ccaaaaagct caactaaacg cgggacaaaa 3240 tcggttgcat ggtggagcga agacgtcaag atagccgtga aacaaagacg caagcgactg 3300 cgtgcccttc ggcggcttgg acccgaagat cctaagaggc aagaggcttt gaaacaattt 3360 cacgacgccc gtgcttcgtc ccgtaagatt gttaaagaag caaaacaaaa atgctgggaa 3420 gacttcgtag aaagtatcaa ccccacgagt tcagctagcg aggtatggaa taagctaaac 3480 agactccaag gtaaacggac ttcaaacaca cttgcactaa acctcccgaa aggtttcaca 3540 aacaacggaa gagatgtagc agaagctctg gctgacgagt acgaaacaaa atcagctaat 3600 tccagctacc ctgaaaaatt cagaagaaaa cacgcatcaa actccaactt atacaacaag 3660 tcccagcgac cacacctcta caaacgctac aacacagacc ttactataga agaattagta 3720 tgggctcttg atcgcaaagg aggttcatca actggtgcag ataacattgg ctatccaatg 3780 ctgcagcgtc taccgatatc gtccaaaacg gccatgttgg agctgttcaa ccgaatatgg 3840 aacagtggaa cattcccaga acggtggaaa ttagcaacag ttgtgcctat tcccaaacca 3900 gacgcggatc gcagcagggc agacggctac agaccgatag ctctgcttag ttgcttagga 3960 aaagtcttcg aaaggatgat aaaccgtcgt ctcactaccg agctcgagac aaacaagaaa 4020 ttggaccctc gtcaacatgc tttccgccca ggtaaaggtg ttgagtccca tctagctcac 4080 ctcgagtccc ttctaagctt cgaaaacgat gagcatgtcg agatcgtagc cctcgatata 4140 tctaaagctt acgacacaac atggaagccg ggaattctgc gcactctgaa ggaatggaaa 4200 atttccggcc gaatgttaaa tatgctttgc agcttccttg tcgacagatc attccaggtc 4260 tccgccaacg gacacatatc aagctcaagg atagctgaaa acggagtgcc acagggatcc 4320 attttgtcgg tcaccctctt cctgatagcc atgcagccca tcttcaccaa aatacctcca 4380 aacgctgaaa ttctgctata cgctgacgat gtcatccttg tagtgaaggg gtcaaatcga 4440 caattgttac gccaaataat gaataaaact gttcgttctg ttactgaatg ggccaatagt 4500 gttggcttct cgatcgcgcc aaccaaatcc aaacttcttc actgctgccg actgcgtcat 4560 cggaagcgag gccgagctat taaaatcaac ggagcatcta tagcgcatgt tcgaaaaatg 4620 agaatactcg gaattatgct agattcaaac ctcaacttta agcaacacct agcatccgtc 4680 aagagcagct gcagcaaaag gattaacatt cttagaattc taggatgccg actgaaaaga 4740 agcagcagat tcaccctact gaaagcagga gcagctctga tcacctcaaa actgtttttc 4800 ggcctaggtc ttacaagccg caatattgat gacatggagc gaatacttgg accaacatac 4860 aacgaggtag tacgccagtc ttcgggagcc tttgtatcca gccccatcaa ttctgttatg 4920 gcggaagctg gatgtctccc atttcgcctg gcattaatac aacgtctagc gcaactagcg 4980 gtgcggcttc tggagaaaaa tccctcagca gtcaactatc ctgtagtgct aagagctaag 5040 gattttctac ggcaaacaac aggctacagc atgccaaatg tatgtagcac gctgagaaat 5100 tatgaccgag aatggtatgc acactccccg tacaccgaga accatgttag aaataagatc 5160 aaagcgggta caaacagcaa tattgttata cccacattcc aagagttgat atctaaccag 5220 taccaatacc acaaaaaaac cttcactgac ggctccaaag acgctgactt caccggagta 5280 ggtgtagtca tggaggacaa ggaagaaagt tatgcgctac cggaagcctg cagcgtgttc 5340 tcagccgaag ctcacgccct catgaccgct gcgactatag caagtgagca ccataacacg 5400 attatcttca ccgactctgc tagttgtctg gacgctctac agagcggtca ttctaaacac 5460 ccgtggattc aatccataga aagagtttcc cgaagtcgaa gcatcacgtt ctgttggatc 5520 ccaggtcact gtggtatcgc agggaatgag cgcgctgaca gcttagccaa gcaagctcga 5580 agcaaacaaa aacttgacat tgcattacca gcccaagacc tcattaagaa catcaggcgt 5640 aagatttggt ctgtatggga gctcgagtgg cgtcaaagtg tatcacaact aaggcaagta 5700 aaatgttcac ccataaaata cccagaccgc aaatgcgcct ctgaacaacg cactttgaca 5760 cgacttcgta taggacatac tcgtctcacc cattcgtatc tgttcaacaa gtcgtctccg 5820 ccaatgtgta acagctgtgg gacgcgaact actgttcaac acatcctgac tgactgcaga 5880 ttgtacgctc atcaacgatc taactgtgga atcaccggct cactatgcga aattttatcg 5940 tataacccac aacgagaaac agcaattcta cagtttttaa aagacagcaa catatacaat 6000 gaaatttaat atcctaatac ttgtatagta aacaaataag ttaacgctga cacgaatgcc 6060 actaaagtgg taaagtgtcc ttaataaata ataataataa taat 6104 // ID Gypsy-22-LTR_NVi repbase; DNA; INV; 768 BP. XX AC . XX DT 17-APR-2009 (Rel. 14.04, Created) DT 17-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: long terminal DE repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy-22-LTR_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-768 RA Bao W. and Jurka J.; RT "LTR retrotransposon from Nasonia parasitic wasp."; RL Repbase Reports 9(4), 782-782 (2009). XX DR [1] (Consensus) XX SQ Sequence 768 BP; 189 A; 190 C; 190 G; 199 T; 0 other; tgagacggca ggccccgccc ggggataaga gagcagcaat agataacttg cctggcgtag 60 gcaagttacc gactgagtgg cgctagtagc caaatgtaaa tagtctacct acgagctggt 120 agatggcgtg cgcgcatgta acacaggaga gagggagcga ctcgggtata taaggagccc 180 cggagccagc agcgagcatt aatgatctgg ctctcaactg agcaacgtgc gaatggtagt 240 actcgctgct tgctccggtc tccaccatac ccagtagtcc atgagcccca ggctctcgaa 300 gagtatgtgt gcagtccact ccctctcctt gtaggctgag atccagtatc gcttataaat 360 aaatattaaa tgattaaatg aataacattt aaaacatata aactatttca acccttgttt 420 tccacccggt gtaggatggt agtaccctgt tcttaggtgg cgcatctggt tctagactca 480 tggtgagatg gtagtacctc acacatcagt ctaatagctg cgtggtttac ctaagtgtgc 540 gggatatgcg tagtggatat accctactca tctccttcaa cacacgagca gctctcctct 600 ggtggctcgg ccattacaga ggatagggag gtcttttgtc cactcaatag atattaccgg 660 gtggatcaaa gactgttatc gttcttcgaa tgtgcttctc tcctgcttta ctaacctcgg 720 gtcacgaatc cttcgcgttc gcgtactaac ccttcgcgga ccgtttca 768 // ID hAT-56_HM repbase; DNA; INV; 3676 BP. XX AC . XX DT 23-DEC-2008 (Rel. 13.12, Created) DT 23-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type DNA transposon family - consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-56_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3676 RA Bao W. and Jurka J.; RT "hAT-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 2044-2044 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(628..1971,1817..3172) FT /product="hAT-56_HM_1p" FT /translation="MKRRFDSGASKRRRRKELAEEACTNSKKIYSFLKKSQ FT NTVQIEQANQTQSVSDNGNIENIVVPIAEDPILEQQPAGNLNIINQERVET FT EISLQENAGENKIGDMETEITDPIHQGNPFNFIDIGIINKNNSCQVNSFLK FT ITCFEIPNNIVKDSNHHAFPYRFLNKTLANGETCKRDWLCWSVEKQSLYCA FT PCFLFNKNAANVSFFSSSAGWGISRGWRRLKDRIPSHESSIYHKENYVVWK FT SASRAALCETSVDNLLLSELKTETENWKKLFQRILDVIIFLSERGLALFGS FT NQRIGDRANGNFLGIIELLSKYDPLLAEHVKHVRESQQCKQRMQAHYLSMR FT IQNEFIDICGSHVQTAILHEIVKAKYFSIIVDATPDCSYKEQTTLVIRYVK FT ILDNSNFSVEERFILFENFSKKNWKRNRCSNTRNIENFEAGFRSLHWSSLR FT QRCIIQTFQLKKDLFYLKIFQKKTGREIAARTLEILKTLKLDFEACIGQAY FT DNGANMAGKYNGVQAVLIQQNPNCMFSSCGNHSLNLVGVDCAESCKEAVTY FT FGTIQQMYNLFSSSPQRWEILKQHLPVSLHGMSKTRWSARIDGVKPVAQHL FT NSVRSALNELGVLHLTAQAKMELNAIQKYISKFDCILMSSIWMKLLTMIHQ FT TNLIIEARHATLDIEKDNIENLCNDIQRLREQFDKILNESKFVARNIGVSC FT EFLTNRHFPSQDDAELYYKINVYFVIIDSIKSGLTRRFQSLQEICKLFGFL FT WQFKNLNDEDLVLAAQHFQHKYDKEISQELENEVLFFKRIYGANFKLDCTP FT KKLLEEILGLGLSGVFPNITIALRIFISLPASTASGERTFNVLKQIKNYHR FT STMGQERLNGLAMLNINCDIARKLDFSNIIAAFSEQKARKAFVKIN*" XX SQ Sequence 3676 BP; 1235 A; 589 C; 634 G; 1218 T; 0 other; caggcccttt cagaggggga gccacatggg ccatatggcc tgggcctccg attttgttgg 60 ggcctcacat tttcaaaaca catttgttaa acacgaaaat aaatatcaaa gtaattattt 120 taatctaatt attctaaaca atttaacttt aggttaactt aaaataataa attaatttct 180 ttttcataat tacacatttc caaacctaaa ttcgacatga taaagattac gtcatgtaat 240 gatgcaatta accctccgtc gggcgcgctt gatcaaattg atacaattac ttttaactcg 300 tacgagtttt taagttatga cttagtgttc tgctttttag tagcttcatt ttttcttaaa 360 ttttttagtt gtaattcatc aaacaagatt aaatcaattt gttttgaact gtttcttatc 420 attttgtata aaattgatat attgcgcttt tgtacattgc gttttgtacg ttacatttcg 480 taactttgag aatagtaata tttttttgtt ctggtaaata ttttttaatt ttaaaacata 540 attcgtttta taaatttgta taagttcgat tgatgtataa atttattata ttttcaaatt 600 tcaatattat ttccagatac cattaaaatg aagagaagat ttgatagtgg tgcaagtaaa 660 aggcggcgga gaaaagaatt ggccgaagaa gcatgcacca actcaaagaa aatatattcg 720 tttttaaaaa aatcacaaaa cacagttcaa attgaacaag caaaccaaac tcaaagtgtg 780 agtgacaatg gtaacattga aaatatagtt gttccaattg cagaagaccc aattttggaa 840 caacaaccag caggaaattt aaatattata aaccaggaac gggttgaaac cgaaatctca 900 ttacaggaaa atgcaggtga aaataaaatt ggagatatgg aaacagaaat aactgatccg 960 attcatcaag gtaatccgtt caattttatt gacattggga ttatcaacaa gaacaattct 1020 tgtcaagtga actcttttct taagattact tgttttgaaa ttccaaataa tatcgttaaa 1080 gactctaatc atcatgcatt tccttatcgt ttcttaaaca aaactcttgc aaatggagaa 1140 acatgcaaaa gagactggtt gtgctggagt gttgaaaaac aatctttgta ttgtgcacca 1200 tgttttcttt ttaacaaaaa tgctgcaaat gtgtcatttt tctctagttc tgccggatgg 1260 ggcatcagta gaggttggag aagattgaaa gatcgtattc cgtcgcacga aagttcaatt 1320 taccataaag aaaactatgt tgtatggaaa tctgctagta gagcagcatt atgtgaaact 1380 tcagtggata atttactttt atcagaactc aagactgaaa ctgaaaattg gaaaaaatta 1440 ttccaacgta ttctagatgt tattattttt ctttctgaac gtggattagc actttttggt 1500 tctaaccaac gaataggtga tcgagcgaat ggaaactttt taggtattat tgagcttctt 1560 agcaaatatg acccactttt agctgagcat gttaaacatg ttcgagaatc acaacagtgt 1620 aagcagcgaa tgcaagctca ttatctttca atgcgaattc agaacgaatt tattgatatt 1680 tgtggttcac acgttcaaac agcaatcctc cacgaaattg tgaaagcaaa atatttttca 1740 ataatagttg atgctactcc agattgctcg tacaaggagc aaactactct ggttattcgt 1800 tatgttaaaa ttttagataa ttcaaacttt tcagttgaag aaagatttat tttatttgaa 1860 aatttttcaa aaaaaaactg gaagagaaat cgctgctcga acactagaaa tattgaaaac 1920 tttgaagctg gatttcgaag cctgcattgg tcaagcttac gacaacggtg ctaatatggc 1980 tggaaagtat aacggggtgc aagcggtttt gatacaacaa aatccaaact gtatgttttc 2040 tagctgcgga aatcactcat taaatttagt tggtgtcgat tgcgctgaat catgcaaaga 2100 agcagtaacg tattttggaa caattcagca aatgtacaat ttattcagta gcagtcctca 2160 aaggtgggaa attttaaaac aacatcttcc tgtttcgttg catggaatgt ccaaaacaag 2220 atggtcagca cggattgatg gtgttaaacc agttgcacaa catttgaact cagtaagaag 2280 tgctttaaat gaacttgggg tgctacattt aacagcccag gctaaaatgg aactcaatgc 2340 tattcaaaag tatatttcca aatttgattg cattttgatg tcgtctatct ggatgaagct 2400 actcacaatg attcatcaga caaatctaat catcgaggca cgccatgcca ctcttgatat 2460 tgaaaaggat aacattgaaa atctatgtaa tgacattcaa aggttgcgtg aacaatttga 2520 taagatttta aatgagtcaa aatttgttgc aagaaatatt ggtgtttcat gtgagttttt 2580 aactaatcgt cattttccaa gtcaagatga tgctgagttg tattataaaa tcaatgtata 2640 ttttgtcatc attgactcaa ttaaatctgg cctcacgcga cgatttcagt ctttacaaga 2700 aatttgtaaa ctttttggat ttctctggca gtttaaaaat ttgaatgacg aagatcttgt 2760 attggctgct caacattttc agcacaaata tgacaaagaa atatcacaag aacttgaaaa 2820 tgaggtttta ttttttaaaa gaatctatgg tgctaatttt aaactagact gcacaccaaa 2880 aaaattgctg gaagaaatac ttggacttgg tctttcaggt gtctttccaa acattacaat 2940 tgcattacga atttttatca gcttgcctgc atcgacggct tcgggcgaac gcacatttaa 3000 tgttttaaag cagataaaaa attatcatcg ttcaactatg gggcaggaac ggttgaatgg 3060 acttgctatg ttgaatataa attgcgatat tgcacgaaaa ctagattttt caaatataat 3120 tgctgcattt tcagaacaaa aagcaagaaa agcgtttgta aaaattaatt aaaactaatt 3180 cattctgttt tacttttagt gaatttcttt agatttcatt ttcaaaatgt gacatattta 3240 actattcatt tatagtgaca aataattaca tatcgaaaaa aaccaaataa taaaaaccaa 3300 ttcataatta tttatgcatg aatcattcat ttgtgaaatc tgaattgtaa cattcacaat 3360 aaagtaacag tgtaacattc cactcatacg tcgtaccttc ttaattccca ggcctcccca 3420 aaaatgcgtt tttagtagac aagcgcgtct cttggctagt tggcttgata aaacgtacaa 3480 aaggcgccac atctgtgtgt gacggtctgt ctaatttatt gcatttagtc ttaaagttag 3540 acatttatcg ggtcctgatc tatctgaaga cttaaatggc tttgcattat cttgcgtcgc 3600 gttttgcgca aatttttctt ttggggcctc cgatttttaa ctggccgggg cctctccaaa 3660 cctctgaaag gccctg 3676 // ID piggyBac-16_SM repbase; DNA; INV; 2312 BP. XX AC . XX DT 11-AUG-2009 (Rel. 14.08, Created) DT 11-AUG-2009 (Rel. 14.08, Last updated, Version 2) XX DE DNA transposon from Schmidtea mediterranea. XX KW piggyBac; DNA transposon; Transposable Element; piggyBac-16_SM. XX OS Schmidtea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae. XX RN [1] RP 1-2312 RA Jurka J.; RT "Families of autonomous piggyBac elements from planaria."; RL Repbase Reports 9(8), 1826-1826 (2009). XX DR [1] (Consensus) XX CC ~97% identical to consensus. Low-copy. CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 398..2200 FT /product="piggyBac-16_SM_1p" FT /translation="MGKNQKNCLLLKKLIKIKKDQSDNESDAIDIDQCDNE FT IELEEEVNKIESSSSEDEDLCYIAPNKKRARVISDSETDSEDEVFRYTAPS FT KKRARVISDSETENDLDAIDTSSDKEISAEKAADGTLWETLNEGRNVGRSP FT IYTIFKDVSGPTAYAKRHIMLGSASSAFNLIIDEGMMAYIKSCTELEARQV FT LNNKEWTVTKSQLWAFVAILFARGAYEAKNLKCSYLWSAKWGPAFFAQTMS FT RDKFIDILRFIRFDKKNERSERLKTDKFALISKIWEKFIENSQASYKPGAN FT ITIDEQLFPTKARCRFTQYMPNKPDKFGIKFWLASDVRSKYLVNGFPYLGK FT DETRGSNIPLSEFVVTKLAEPYLGCGRNITTDNFFTSISLAKKLLAKKTTL FT VGTIRANRKELPKIAKAKKDKMTQFSTKLYRSENCTLTIYKSKPNKKVSLL FT STKHKNITIEKNKKLVPETVTYYNSTKYGVDVLDQMARKYSVKASSRRWPL FT QVFYNILDLAAINAWILYKETTQVNISRKDFIFQLAEELRSKYREEVENTS FT IPMTEIHATDARKNCQVQQSCKRNRCTNHCAKCNRNVCGKCVSKTEFICKK FT CFS" XX SQ Sequence 2312 BP; 817 A; 373 C; 429 G; 693 T; 0 other; acctattttc acggttgtga gccaatttga ctcatagaat ggtttaacag ttaatcgcat 60 aacatgaaat ttattggttc tatcaggcct gggcaacagt caagtcgcac gtgatttgtt 120 tcgcttctca tttggcattc ctcacataca atatttgtgt gtatgtggga gccattgtcg 180 cttgttttcg ctactccgct acatcattac ccacttctgt catcatttcg atttatacat 240 cgcgtgtgag cagtgcagtg aagtagtata taatttataa ttttataaaa ttcatttttt 300 gccagaaaaa tttgatccac aaaggtaatt ttgtaattta tttgtaaaag gtattacaat 360 tgatatttat tttatctggt tatctagggt ttctattatg ggaaaaaatc agaaaaattg 420 tcttttgttg aaaaaactaa tcaaaataaa gaaagaccaa tcagataatg aatcggatgc 480 aattgatata gaccaatgtg ataacgaaat tgaattagag gaggaagtca ataaaattga 540 atcttcaagc agtgaagatg aagatctttg ttatatagct cctaacaaaa aaagagccag 600 ggttatttct gattctgaaa ctgacagcga agatgaagtt tttcgttaca cagctcctag 660 taaaaaaaga gctagagtta tttctgattc tgaaaccgaa aatgatttag acgctatcga 720 tacatcttct gataaagaaa tttcagccga aaaagctgct gatgggactt tatgggaaac 780 attgaacgaa ggcagaaacg taggtagatc accgatttat accatattca aagacgtttc 840 tggtccaact gcctatgcta aaaggcatat tatgcttggt tcggcaagta gcgcattcaa 900 tttgataatt gacgaaggta tgatggcgta tataaaatca tgtaccgaac tcgaagctcg 960 tcaagttttg aataataagg aatggacggt tacaaaatca caattgtggg cctttgtagc 1020 aattttattt gctagaggag catatgaggc aaaaaatttg aagtgttcat atttgtggtc 1080 tgcgaagtgg ggtccggctt ttttcgccca gacaatgtct agagataaat ttatagacat 1140 acttcggttc atacgttttg ataagaaaaa cgaacgcagt gaaagactga agacagacaa 1200 atttgccttg atttcaaaaa tatgggaaaa atttatagag aatagtcagg cttcttacaa 1260 acctggcgca aacataacga tcgatgagca attattccct actaaagcta gatgccgatt 1320 tacgcagtac atgccgaata agccggataa attcggcatc aaattttggc tggcatccga 1380 tgttagaagc aaatatcttg taaacggttt tccgtacttg ggcaaggatg aaacccgagg 1440 atcaaacatt cccttgagcg aatttgtagt aactaagctt gcagagccat atttaggctg 1500 tggacgaaat attaccacag ataatttttt tactagcatt tcacttgcaa aaaagttact 1560 tgcaaagaaa actactttgg ttggcactat acgtgcaaac cgaaaagaat tacctaaaat 1620 tgcgaaagcc aagaaagata aaatgacaca gttttctaca aaactgtata gatctgagaa 1680 ttgtaccctg accatttata aaagcaagcc taataaaaaa gtttcactac taagtacaaa 1740 acataaaaat ataacaattg aaaaaaataa aaaacttgtt ccagaaacag taacctatta 1800 taatagcacc aagtatggcg ttgatgtact tgaccaaatg gcccgaaaat atagcgttaa 1860 agcaagttca cgcagatggc ctctacaagt tttctacaac attttagatt tagcagctat 1920 caatgcttgg attctgtata aagagacgac tcaagtaaat atttcacgaa aagattttat 1980 tttccaattg gctgaagagc taagaagtaa atatagagaa gaagtcgaga acacttccat 2040 tccaatgaca gaaattcatg ctactgacgc acggaagaat tgtcaagttc aacaatcatg 2100 caaaagaaac agatgtacaa accattgtgc taaatgtaat agaaatgtct gcggaaaatg 2160 tgtttcaaaa accgagttta tatgtaaaaa atgtttttca tgaatttttt gataataaac 2220 atttcattta aattgatttg gaactatttt ctttgcaaaa aacttcaatt ttcttgtatg 2280 agtcaaattg gctcacttca gtaaaaatag gt 2312 // ID DNA8-65_AP repbase; DNA; INV; 168 BP. XX AC . XX DT 25-AUG-2009 (Rel. 14.09, Created) DT 25-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-65_AP. XX NM DNA8-65_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-168 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2000-2000 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 168 BP; 41 A; 36 C; 45 G; 46 T; 0 other; cataggcgca atttcaactt ttgacttggg ggggctgaat atattaaagg caagcagacc 60 attgcaatat agcatattaa aattgtatgt cctaggtttt ttgggggggg ctacggctaa 120 gtttgggggg gcttagcacc cccaagcccc ccccaatttg cgcctatg 168 // ID INV2j_DB repbase; DNA; INV; 889 BP. XX AC AF368887; XX DT 13-SEP-2004 (Rel. 9.08, Created) DT 27-FEB-2008 (Rel. 13.03, Last updated, Version 2) XX DE Drosophila buzzatii chromosomal inversion 2j. XX KW P; DNA transposon; Transposable Element; Nonautonomous; KW KEPLER transposon; GALILEO transposon; chromosomal inversion 2j; KW INV2j_DB. XX NM INV2j_DB. XX OS Drosophila buzzatii OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; repleta group; OC mulleri subgroup. XX RN [1] RP 1-889 RA Caceres M., Puig M. and Ruiz A.; RT "Molecular characterization of two natural hotspots in the RT Drosophila buzzatii genome induced by transposon insertions."; RL Genome Res 11(8), 1353-1364 (2001). XX RN [2] RP 1-889 RA Marzo M., Puig M. and Ruiz A.; RT "The Foldback-like element Galileo belongs to the P superfamily RT of DNA transposons and is widespread within the Drosophila RT genus."; RL Proc Natl Acad Sci U S A 105(8), 2957-2962 (2008). XX DR Genbank; AF368887; Positions 1 889. XX CC Chromosomal inversion region containing GALILEO and KEPLER CC transposon sequences. XX SQ Sequence 889 BP; 292 A; 155 C; 182 G; 260 T; 0 other; aaaaaaaaat acctactaat tgtagggccc tcaaaatgat tccaaacttc agtgattttt 60 aattaatgac attcgcggta agaaaaaatg gaaacgataa cttttcgaac gcaaattgag 120 aagaacgcaa aatgagccat acgagcaagt tagcaaaaat gatctagtca acctaactga 180 gcaagtgagc caaaatgagc aagtgagcca tatcactaac catacaacac atagactgga 240 caactagaac aaactttttt cacgaagcag attatgattt cccgtcctag tcatcttcgt 300 atttgctcgg ctctttaatt tactgttcgg gcagcgactc gcggacaaat ttcaattttt 360 gtgagctgct ggcttgtgcc tgtgtagaca cacatacgta catacatata tatatatgag 420 aggcatagag agagtgagtg agtaaagaaa gaatagcaca cagcggactt catatgagaa 480 taaaatcatt tcgttcgagc gctggcagcg aactgaatat caactgctgt attttcgctg 540 tccgaacagt aaattaaaga tccgagcaaa tacgaagatg actaggacga gaaatcataa 600 tctgcttcgt gcaaaaagtt tgttctagtt gtccagtcta tgtgttgtat ggttagtgca 660 tatatacata tgtgtacgtc tgtgatgtga taataattat cgtcgagtat cacttgtata 720 ggcgtttgaa tgaatactct gccttgggca tttgcatatt tcttgcatgg gtctcaaata 780 ttgggcgtgt tcatacctca acttaagatt ttgggttcaa attgcaggag aggtcaataa 840 atttgtatta cataatttga cactatggct aaaatgggac atgttattt 889 // ID Copia-16_CQ-LTR repbase; DNA; INV; 169 BP. XX AC AAWU01015501; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-16_CQ_; KW Copia-16_CQ-I; Copia-16_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-169 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 348-348 (2011). XX DR GenBank; AAWU01015501; Positions 12038 11870. XX SQ Sequence 169 BP; 51 A; 44 C; 29 G; 45 T; 0 other; tgccaactaa accaaccctg taccccggaa gtagtaccaa cctgacggcg cgagtgtagc 60 tgtcaaacat cattctcaag taaaacgttc tcgaataaac acgcggaaat tttgtagtcg 120 aacttaaatt cgttttaatt tcgtttttcc gctgccattc cgaaaacca 169 // ID Dneoca1 repbase; DNA; INV; 415 BP. XX AC GU229944; XX DT 19-APR-2011 (Rel. 16.04, Created) DT 19-APR-2011 (Rel. 16.04, Last updated, Version 1) XX DE Mariner-like element internal region of mellifera subfamily. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Dneoca1. XX OS Drosophila neocardini OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; cardini group; OC cardini subgroup. XX RN [1] RP 1-415 RA Wallau G.L., Hua-Van A., Capy P. and Loreto E.L.; RT "The evolutionary history of mariner-like elements in Neotropical RT drosophilids."; RL Genetica 139(3), 327-338 (2011). XX DR EMBL/GenBank/DDBJ; GU229944; Positions 1 415. XX CC Clone Dneoca1. XX SQ Sequence 415 BP; 99 A; 112 C; 108 G; 96 T; 0 other; tgggtgccgc atgagttgac gcaaataaac atttttgact gtatggatgc atgcgaatcg 60 cttctgaatc gcaacaaaat cgacccgttc ttgcggtgaa gctgcccaga cggtggccaa 120 gcctagattg acggccagga aggttcttct gtgtgtttgg tgggattgcc agggaatcat 180 ccactataag ctgctcccct atggccaaac gctcaattcg gacctgtact gccaacaact 240 ggaccgcttg aatgcagcac tcatgcagaa gagggcatct ttgatcaaca gaggccgaac 300 tgtcttccat caggacaacg tcaggccaca cacatctttc gtgacgcacc agaagctctg 360 ggagctcgga tgggcggttc atttgcatgc accgtattgt ccggacctcg cccca 415 // ID MARINA repbase; DNA; INV; 1267 BP. XX AC . XX DT 14-SEP-2004 (Rel. 9.08, Created) DT 14-SEP-2004 (Rel. 9.08, Last updated, Version 1) XX DE Mariner-like transposable element from Endopterygota - a DE consensus. XX KW Mariner/Tc1; DNA transposon; Transposable Element; MARINA; KW mariner-like element; putative transposase. XX OS Endopterygota OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera. XX RN [1] RA Yoshiyama M., Tu Z., Kainoh Y., Honda H., Shono T. and Kimura K.; RT "Possible horizontal transfer of a transposable element from host RT to parasitoid."; RL Mol. Biol. Evol 18(10), 1952-1958 (2001). XX RN [2] RA Yoshiyama M.; RT "Mariner-like elements from moths."; RL Unpublished. XX RN [3] RA Gentles A., Kohany O. and Jurka J.; RT "Mariner-like transposable element from Endopterygota - a RT consensus."; RL Direct Submission to Repbase Update (JUN-2004). XX DR [3] (Consensus) XX CC Average similarity to consensus sequence 88%. XX SQ Sequence 1267 BP; 407 A; 231 C; 298 G; 331 T; 0 other; gataagtccc cggtctgaca catagatggc gtcgctagta ttaaatgcat attattttta 60 tatagtacca accttcaaat gattcgtgtc aaaatttgac gtctgtaagt caattagttt 120 gtgagataga gcgtcttttg tgaagcaact tttgttattg tgaaaaaaat ggaaaaaaag 180 gaatttcgtg ttttgataaa atactgtttt ctgaagggaa aaaatacagt ggaagcaaaa 240 acttggcttg ataatgagtt tccggactct gccccaggga aatcaacaat aattgattgg 300 tatgcaaaat tcaagcgtgg tgaaatgagc acggaggacg gtgaacgcag tggacgcccg 360 aaagaggtgg ttaccgacga aaacatcaaa aaaatccaca aaatgatttt gaatgaccgt 420 aaaatgaagt tgatcgagat agcagaggcc ttaaagatat caaaggaacg tgttggtcat 480 atcattcatc aatatttgga tatgcggaag ctctgtgcaa aatgggtgcc gcgcgagctc 540 acatttgacc aaaaacaaca acgtgttgat gattctgagc ggtgtttgca gctgttaaat 600 cgtaataaac ccgagttttt gcgtcgatat gtgacaatgg atgaaacatg gctccatcac 660 tacactcctg agtccaatcg acagtcggct gagtggacag cgaccggtga accggctccg 720 aagcgtggaa agactcaaaa gtccgctggc aaagtaatgg cctctgtttt ttgggatgcg 780 catggaataa tttttatcga ttatcttgag aagggaaaaa ccatcaacag tgactattat 840 atggcgttat tggagcgttt gaaggtcgaa atcgcggcaa aacggcccca tatgaagaag 900 aaaaaagtgt tgttccacca agacaacgca ccgtgccaca agtcattgag aacgatggca 960 aaaattcatg aattgggctt cgaattgctt ccccacccac cgtattctcc agatctggcc 1020 cccagcgact ttttcttgtt ctcagacctc aaaaggatgc tcgcagggaa aaaatttggc 1080 tgcaatgaag aggtgatcgc cgaaactgag gcctattttg aggcaaaacc gaaggagtac 1140 taccaaaatg gtatcaaaaa attggaaggt cgttataatc gttgtatcgc tcttgaaggg 1200 aactatgttg aataataaaa acgaattttg acaaaaaaat gtgtttttct ttgttagacc 1260 ggggact 1267 // ID BEL-37_CQ-LTR repbase; DNA; INV; 234 BP. XX AC AAWU01044402; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-37_CQ_; KW BEL-37_CQ-I; BEL-37_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-234 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 228-228 (2011). XX DR Genome; AAWU01044402; Positions 4858 4625. XX SQ Sequence 234 BP; 62 A; 52 C; 51 G; 69 T; 0 other; tgtttgggca cactgcccaa ggaaaagtag atgttgtgtt ttgtttgcac ctttatttct 60 tggcccgtta aaacggcgga caaattttga tggcccttct ggcaaccctg taagcagaga 120 aaatacagaa taaaccgaac ctttgttgag acaagtcgcg cggtgttttt ctcccgtcag 180 aacccctcca ttttctcgcg gattttaaaa agtgttaaac tgagctatcg aaca 234 // ID CR1_Ele33 repbase; DNA; INV; 5332 BP. XX AC . XX DT 25-OCT-2010 (Rel. 15.1, Created) DT 25-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE A CR1 clade non-LTR retrotransposon family from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1_Ele33. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-5332 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5332 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (25-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. This consensus is generated from 20 CC sequences with >97% identity, and ~100% identical to the original CC sequence in [1]. XX FH Key Location/Qualifiers FT CDS 392..1225 FT /product="CR1_Ele33_1p" FT /translation="MAKNCGTCSKAINGIDVVVCRGYCGGFFHLNECSGVT FT RAMQSYFTSNRKNLFWMCDNCAELFENSHFRVISNQADEKSPLNSLASAIT FT ELRTEIRQLHAKPVAYPSPAESPRWPSFDQRRGTKRPRIIETNVRAQDSCR FT VGSKKAEENVVSVAICKSEADNRFWLYLSKIRPDVTTEAVCEMTKANLNME FT NDPIVVKLVPKGKQIESLSFVSFKIGLDPLLKRKALDPETWPEGLLFREFV FT DYSASKFRSTLNMNTRMTPLLQPQTPVSSVTPVMDLS" FT CDS 1264..5238 FT /product="CR1_Ele33_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MEAPYSPNTVPFFLPTLISRPVPVFGCEEGVFQTVSI FT GKYTCDVNKTLPVKSSNSSNTSPNYICSRSRHQKHVTIMNNQPSSSIELST FT HTESPGCTPASLKGAPNPLVTVEPLLPAFSSHPGPVYEFGGGVFRPAIAGK FT YTCNKNFNLPVNPVDSSNAMSIASPPSSSSSSTILSCSSHAEPPGCTPASL FT KGAPNPLVTVEPLLPAISSHPGPVYEFGEGVFRPAIAGKYTCNKNFNLPVN FT SVDSSNATSIASLPXSSSSSTILTCSSHAESPGCTPASLMEAPNPLVTVEP FT LLPAFYSHPGPVYENGEGVFQPAQAGKYSQAPTYALPDATEHSSTLGRQED FT IMLPPHAVQSPNADAIEIIENSNVMVGNGADRLLSIYYQNVRGLRTKIVQL FT RLLLSSCDYDVLIFTETWLRADIESSEISTDYTFFRCDRNERTSHHSRGGG FT VLIAVKNRLNCDSIEMTNYERLEQIVVCIRFQKHSLYIAAIYLPPNSIPEL FT YAKHADAMQFIVEQAEATDIVMSIGDFNLPNLCWQQDDDINGLIPTNVTAE FT HEQSLIEAMFAVGFRQINSLVNSNSRLLDLAFINLPERVDLTAPSSPLLPM FT DIHHPPFILLLCENDELLNLADDENYEVKFNFEVCDFELLSDIFGSTDWNA FT LLGNTNLEAMLSAFYQKLNSVFGEHVPKTRRNTRSIFNKPWWTPELRNQRN FT NLRKARKRFFQSRDDCDRSHLQEIEIRYKTLLLSTYESYISTVQSSLKQNP FT SRFWEFVKNLQSSNRIPPSVTYNGIEAHNTSEAADLFAEFFESVFSKVSPV FT QRSNIFAHLQEHDFSFPDLQFSPDEVLEALRDLDIKKGPGTDGIPPLVIRN FT CAAALCIPITCIFNCSLRERTFPAAWKMAYIVPVYKSGSRNHVSNYRGISI FT LCCLSKILEKLIHKTLSNTLTSIISESQHGFMKNRSTTTNLMSYVSTLFRE FT VEARQQVDSIYVDFAKAFDTVPHIVVIEKFKRLGFPRWLTEWLYSYLTNRY FT AFVMVNSTRSRTFNITSGVPQGSVLGPLIFNVFINDLYSLLSSCNLSFADD FT LKFYRTISSPQDCIALQEDIDIMLIWCDNNGMRVNSGKCKIISFTRCNTSY FT QHRYFIGAEELVRVNSICDLGVTIDFKVRFNEHIGLTVAKAFSVLGMIRRH FT TNSFTDVYALKTLYCSLVRSILEYASPVWCPSQVTQILSIERVQKKFLRFA FT LSRLPWNDARNLPSYHERCQLIRLESLSARRTNQQRLFIFDLIKGGIDCPA FT LLELLSFNAPSRRFRNAVLLYIPNHRTNYGYNSSLSSCLRAFNDVGDKFDF FT DVTKNIFLNRIRN" XX SQ Sequence 5332 BP; 1492 A; 1276 C; 1105 G; 1457 T; 2 other; tctggcatca ctgcttatgt atgtatgcaa gccaatttcg gtctcgtaat ttatattcgc 60 ttttaacgtt aacttggaag tgtaaaatcg tagttttgat ttaatcgtcc gtctgtgtat 120 tctctgaaga aaattagtgg acttttatgt gctagtgtga cgtgcattag tgattaattc 180 ctatttgacg agtgatttgt tttgctcgtt ctggttctgc cattcatcaa acgcttactg 240 cccatcgcgc cgaattattt ttcacctgaa gtgacatcta ccagcattca tcgcaaactg 300 gtgctaagcg ttgcttatcc gcttatttgt gctttcgggt accgctactg ttcaaaagtg 360 ttccaggaaa catacccata ggcgcatcga gatggctaaa aattgtggaa catgctcaaa 420 agctatcaac ggcattgacg tggtagtgtg ccgtggttac tgtggaggat tttttcattt 480 gaatgaatgc tctggggtaa caagagcgat gcaatcgtac ttcacatcaa acagaaaaaa 540 ccttttctgg atgtgcgata actgcgccga attgtttgag aactcccatt tcagagtaat 600 ctcaaatcaa gctgatgaga aatctcctct caactcgctt gcatcggcta ttactgagct 660 gcgtacggaa attagacaac tgcatgcaaa gccagtagcc tatccatctc cagcagaatc 720 accgcgctgg ccttcttttg atcaaagaag aggtaccaag cgtccccgta taatcgaaac 780 gaatgtgcgc gcccaggata gctgtcgtgt tggtagtaaa aaggcagaag agaacgttgt 840 atcagtcgcg atatgtaaat ctgaagcaga taatagattt tggctatatc tgtctaaaat 900 tcgcccagac gtcacaacgg aagctgtatg tgaaatgacg aaagctaatc tcaatatgga 960 aaacgatccg atagtggtta aattagtacc gaaaggaaaa caaatcgaat cactttcttt 1020 cgtctccttt aaaattggcc tggacccgtt actcaaacga aaggcacttg accctgaaac 1080 ttggcctgaa ggactgctgt tccgagaatt cgtcgactat agtgcctcaa aatttcgatc 1140 tacgctgaat atgaacacga gaatgactcc gttgctacaa cctcaaacgc cggtatcttc 1200 agtgacccct gttatggacc tgagctaacc ttcatcaacc caggacgcaa aagtattaga 1260 actatggaag ccccgtattc ccccaacaca gtcccgtttt tcctgcctac gctcatcagt 1320 cgtcctgttc ctgtgtttgg gtgcgaagaa ggggtcttcc aaaccgtctc aattggcaag 1380 tacacatgcg atgtgaacaa aactcttccg gtaaagtcat ccaattctag caacacgtca 1440 ccaaactaca tctgttcgag atcacgccat caaaaacatg ttacaatcat gaataatcaa 1500 ccatcatcgt ccatcgagct ttcaacacac accgagtcac cgggatgcac gcctgccagt 1560 ctcaagggag cccctaatcc tctcgtcaca gtcgagccac tcctgccagc gttcagcagc 1620 catcccggtc ctgtgtacga gtttggaggg ggggtcttcc gacccgcaat tgcaggcaag 1680 tatacgtgca ataagaattt caacctcccg gtaaatcccg tcgattctag caacgcaatg 1740 tccatcgcaa gcccaccatc atcatcatcc tcatcaacaa tcctatcctg ttcatcacac 1800 gccgagccac cgggatgcac gcctgccagt ctcaagggag cccctaatcc tctcgtcaca 1860 gtcgagccac tcctgccagc gatcagcagc catcccggtc ctgtgtacga gtttggagag 1920 ggggtcttcc gacccgcaat tgcaggcaag tatacgtgca ataagaattt caacctcccg 1980 gtaaattccg tcgattctag caacgcaack tccatcgcaa gcctaccawc gtcatcatct 2040 tcatcaacaa tcctaacctg ttcatcacac gccgagtcac cgggatgcac gcctgcaagt 2100 ctaatggaag cccctaatcc cctcgtcaca gtcgagccac tcctgccagc gttctacagc 2160 catcccggtc ctgtgtacga gaatggagag ggggtcttcc aacccgctca agcaggcaag 2220 tactctcaag ccccgacata tgctctacct gatgcgactg aacattccag cactttaggc 2280 cgtcaggaag acattatgct acctccacac gccgttcaat cacctaacgc agatgctatt 2340 gaaatcattg aaaattcaaa cgttatggta gggaacggtg ctgatcgcct tctctccatc 2400 tactatcaga atgttagagg cttacgaact aaaatagttc aactgcgctt gctgctcagt 2460 agctgtgact atgatgtttt aatattcaca gagacatggt tgcgtgcgga tatcgaaagc 2520 agtgaaattt caactgacta cacttttttc cgatgtgatc gcaatgaacg tactagtcat 2580 cattcacgcg gtggtggagt gctcattgca gtcaagaaca gattgaactg cgattcaatc 2640 gaaatgacaa actacgagag actcgaacaa attgttgttt gtatcaggtt tcaaaaacat 2700 tctctctata ttgctgccat ctatcttcca ccaaactcga tccccgagct gtacgccaaa 2760 catgctgatg caatgcaatt tatcgttgaa caagcagaag cgactgacat tgttatgtcg 2820 attggtgatt tcaatttacc caacctatgc tggcagcagg atgacgacat aaatggcctc 2880 attccaacga atgttactgc tgaacatgaa caaagtttaa ttgaagctat gttcgctgtt 2940 ggctttcgac aaatcaatag tctcgttaat tcaaatagta gacttcttga tttagctttc 3000 atcaatcttc ctgaacgcgt agacttgact gctccttctt caccgttgtt gccgatggac 3060 atccatcatc caccgtttat ccttttgctc tgtgagaacg acgaacttct gaacctagcg 3120 gacgacgaaa attacgaagt caagttcaat ttcgaagtat gtgattttga gctcttgagt 3180 gacatttttg gaagtacaga ttggaatgct ttgcttggaa atacgaatct tgaagcgatg 3240 ttatcagcct tttaccaaaa actgaacagt gtattcggtg aacacgtgcc aaagacaaga 3300 cgtaacacga gatcgatttt taataaaccc tggtggactc ccgagctacg caaccaacgt 3360 aacaacctca gaaaagcacg gaaacgcttc tttcaatcaa gggacgattg cgacagaagt 3420 catcttcaag aaattgagat acggtataaa acgctacttt tatccaccta cgagagctac 3480 atttctacag tacagtcaag tctcaagcaa aacccgtctc gattttggga attcgtgaaa 3540 aatcttcaat ctagtaatcg cattcctccg agcgtaacat ataacggaat tgaagctcac 3600 aacacttcgg aagccgctga tctatttgca gagttctttg aaagtgtatt cagcaaagta 3660 tcgcctgtgc agcgttcgaa tatttttgca cacttgcaag agcatgattt ttcatttcct 3720 gatttacaat tttcaccgga tgaagtccta gaagctctac gtgatctgga tataaaaaaa 3780 gggcccggaa ccgatggtat tcctcccctg gtaataagga attgcgctgc tgcattgtgt 3840 attccaataa catgcatttt caattgttcc cttcgcgaaa ggacatttcc ggctgcatgg 3900 aaaatggcct acattgttcc ggtttacaaa tccgggagcc gtaatcatgt ttctaattat 3960 cggggtatat ccatactatg ctgtctttcg aaaattctgg agaagttgat tcacaaaact 4020 ctgagcaaca cactgacatc gatcatctcc gaaagtcaac atggattcat gaaaaatcgc 4080 tcaacgacaa cgaacctaat gagctatgta tcgacgttgt ttcgggaagt ggaagcaagg 4140 caacaggttg actctattta cgtagacttt gccaaagcat tcgacacggt gccgcatatt 4200 gtggtgatcg agaagttcaa gcgcttgggc tttccacgat ggctcaccga gtggctctac 4260 tcatatctaa cgaaccgata cgcctttgtg atggtcaact caacgcgttc tcgtacgttc 4320 aacattacgt ccggagttcc acaaggaagt gttttggggc cacttatatt caacgtcttc 4380 ataaacgact tgtattcgtt gctctcttca tgcaatctgt cgttcgcgga tgatctgaaa 4440 ttctaccgca ccatatcatc tcctcaggac tgcatcgccc tccaggagga catcgacatc 4500 atgctcatct ggtgcgacaa caacggcatg cgcgtgaata gcgggaaatg taaaataatt 4560 tcgtttactc gctgtaacac ctcttaccag catcggtact tcatcggggc ggaggagcta 4620 gtccgcgtta attcaatttg cgaccttggt gttacgatag atttcaaggt gagatttaac 4680 gagcacatcg ggctgactgt cgccaaagca ttctccgttc taggaatgat tcgaaggcac 4740 accaactctt tcacagatgt ttacgcctta aaaacgctgt actgttcttt ggtgcggagc 4800 atcttagaat atgcctctcc tgtctggtgt ccatctcagg ttacgcagat actttcaata 4860 gaacgagtgc agaaaaaatt cctgaggttt gcgctgagta ggctaccgtg gaacgatgca 4920 aggaatttgc cgagctacca tgaacgatgc cagctgataa gattggaatc tttgtcagcc 4980 agacgtacca atcagcaacg tttgttcata ttcgacctga tcaaaggggg aatagattgc 5040 cctgcactac tggagctgct gagtttcaat gctccttcta gaagatttcg gaacgctgtt 5100 cttctgtata taccaaatca tcggactaac tatggttata acagctcctt gagttcttgc 5160 ttgcgagcat tcaacgatgt tggagacaaa tttgattttg acgtgacgaa gaacatattt 5220 ttaaatagaa taagaaattg attttacttt ttttatatac taaattcagt ctgtacgata 5280 tatcgaagac ggtgtaaata aataaataaa taaataaata aataaataaa aa 5332 // ID Copia-14_SI-I repbase; DNA; INV; 4262 BP. XX AC AEAQ01018334; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-14_SI_; KW Copia-14_SI-LTR; Copia-14_SI-I. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-4262 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01018334; Positions 366 4627. XX CC Positions [1534-2061] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 115..4260 FT /product="Copia-14_SI-I_1p" FT /translation="MNVSSLTRIETLNKENYDTWKMQMEALLVKNDAWVYA FT NGECQKPVLAGDNSNESAVKTWVKNDNKAKSDIILAISPSELKLVKGCETS FT REVWLKLENTYQSKGPARKATLLKQLTLQRMEDGGDVREHINKFFDAVDKL FT NEMEVVINPDLLAIMLLYSLPPSFENFRCAIESRDELPNPDILRVKIMEES FT DARKSESRNTVQNAMIAKKGAYKQWNSNAKKNASSESKEKFKFQCHRCRKF FT GHKSAECRSKSEKNSQAAKNTEDISLRASEISRVGDTAMTGSQIREEAWCL FT DSGASSHLCKELQDFTETHKVARGELSLANNSSTEIMARGTALFATEVYGK FT TNNVYLKDTMYVPDLRSNLLSVGKITDNNYDVIFKKDVALIVDYEGNTKLI FT AERKGGLYFVREDQRQECKAISEPDKRSSGTLENWHRRLGHLNIRDLKNAQ FT RNGSLIGLNFNQFDDNFNCDVCVKGKMTRTPFPKRSNRRTESLDIIHSDVC FT GPMRVESMGKAKYYVTFIDDNSRWCEVRFLKSKSEVFEKFKEFQRVVENQK FT GRKIKHLQSDNGGEYRSREFDDHLKECGIVRRLTIAHNPEQNGIAERKNRT FT LLDMARCLLIQSGLPPSFWAEAISTANYIRNRCPTESLGGNTPFEVWTGEK FT PDVSNFKEFGCPVFCLDRNTAKGKFDDRCKRGVFLGYSEQSKGYRVWIPDE FT RKIEVTRDVAFQEAPNVPPEEYEDFSPENNIEDTETEPKDSLRREVVIENL FT SLEDTTDDTLLDEGENSEVEELNDDVISDDDLRGENSPPIACRAPGRPRII FT RTGSRGRPRKLFQSRYADASGVEAESVLLSEIPIEQAVVGPDSDEWYDAMA FT SELKSIIKNDTWTIVERPKDAEVIKSRLVLRNKYKPNGTIERRKARLVARG FT FAQRPGVHFNQTFAPVARLSSIRLLVGIAAHHGMKIHQLDVTTAYLNGKLE FT EELYMEPPRFIIDALKRLIEKETRSDITVKAMTMSEELSTGEKVCLLKKSL FT YGLRQAGRNWHETLSGALRKIGATPTNADPCVYRLGRGEDIVLIAAYVDDM FT LIASKSAKKIQEVKTSLALEFDVKDLGEIKHCLGIEFTQCGGTISMHQSGY FT IKDVLSRFGMSDCKPVSTPMDSNVKLVKPEENSNTQVADAPYRELVGALMY FT LAVSTRPDIAFAVSSLSQFNDSYDQTHWTAAKRVLRYLKGTMNLGIVFKPT FT TDPLRCFVDSDWASCPTDRRSYTGYITVLSNGPVSWEARKQRTVALSSTEA FT EYMGLTEAAKEAIHLRGFLTELEFKDLTNVVIYNDNIGAQRLAENAVFHAR FT SKHIDVRHHFIRDALKSDSLKVKYVPTEDMMADLLTKGLPGPKHKRCVELL FT GLGPPQVSIEASTHVSRGS" XX SQ Sequence 4262 BP; 1320 A; 944 C; 1080 G; 918 T; 0 other; caggttatgg gcccagacac ccgaaaagct cacaatcgaa acgggaattg actgaggagc 60 gaaatcaacg tgtagaggtt agttttctcg caactcgtac aatcgtgatt aacgatgaac 120 gtttccagct taacaagaat cgagactctg aataaggaga actatgatac ctggaagatg 180 caaatggaag cattactggt gaaaaacgac gcctgggtat acgctaacgg cgaatgtcaa 240 aaacccgtgc tcgcaggcga taactctaac gaaagcgcag tgaaaacgtg ggtaaagaac 300 gataacaaag ccaaatcgga cattatttta gcaattagcc catcggaact caaactggta 360 aagggatgtg aaacgtctcg cgaggtatgg ttgaagctgg aaaatacgta tcaatctaaa 420 gggccggcca ggaaggcgac actgctcaag cagctcactc tacaacggat ggaggacggc 480 ggcgacgtgc gtgaacacat caacaagttc tttgatgcag ttgacaagct gaacgagatg 540 gaggtcgtaa taaatcctga tttacttgca attatgctgc tgtatagttt gcctccaagc 600 ttcgaaaact ttcggtgcgc tatcgagtcc cgagacgaat taccaaatcc ggacatactt 660 cgcgtgaaaa tcatggagga aagtgacgcc cgaaaaagtg aatcacggaa caccgtacag 720 aacgcgatga ttgccaaaaa gggggcttac aaacagtgga actcgaacgc gaagaaaaat 780 gcatcgtcgg aatctaaaga aaaattcaaa tttcaatgtc accggtgcag gaaattcgga 840 cacaagtcgg cggaatgtcg aagcaagagc gagaagaatt cacaagccgc taagaatacc 900 gaagacataa gcttgcgtgc ttctgaaatt tcaagggtag gagatacagc catgactgga 960 agtcagatcc gagaggaagc atggtgcctg gatagcggcg cctcttcaca cctttgcaag 1020 gagctgcagg actttacgga gactcataaa gttgcgcgtg gagaattgag cttggccaat 1080 aactcgtcca ctgagataat ggccaggggc acggcgttat tcgcaacaga agtttacggt 1140 aagacaaata acgtttattt aaaagacaca atgtatgttc cagacctacg ctcaaactta 1200 ttatccgtcg gaaaaattac ggacaacaat tacgacgtaa tattcaaaaa ggacgtagcc 1260 ctgatagtcg actacgaagg aaacacgaaa ctgatagcgg aaagaaaagg aggtctatac 1320 tttgtgcgcg aagatcaacg acaggaatgc aaagccattt cggagcctga taagagatcc 1380 tcgggaacgt tagaaaattg gcatcgacgc cttggtcact taaacatacg agatctcaag 1440 aacgctcaac gtaacggatc tctgattgga ctgaatttta atcaattcga tgacaacttt 1500 aattgcgacg tgtgcgtaaa agggaagatg acaagaaccc cctttccaaa gaggtcgaac 1560 cgacggacgg aatcgctcga catcatacac tccgacgtgt gcgggccaat gagagtggaa 1620 tcaatgggta aagctaaata ctacgtcacg ttcatcgacg acaattccag atggtgcgag 1680 gtacgatttt taaaatcgaa aagtgaagtt tttgagaagt tcaaggaatt ccaaagagtc 1740 gtggaaaatc aaaaaggaag aaaaataaag cacctccagt ccgataacgg aggagaatat 1800 cgaagcaggg aattcgacga tcatctaaaa gaatgcggca ttgtcagaag gctcaccata 1860 gcacacaatc cggaacaaaa cggaattgcg gaaaggaaga accgcacact tctcgacatg 1920 gcccgatgtc ttctgattca gtccggtctt ccaccgtctt tctgggcgga agcgatttcc 1980 accgccaact atattcgaaa tagatgccca acggagagtc tgggcggaaa tacgccattc 2040 gaagtctgga caggggagaa accggatgta agcaacttca aagaattcgg atgccccgtt 2100 ttctgcctcg acagaaacac agcaaaggga aagttcgacg atcgttgcaa gagaggagtt 2160 tttctgggat actccgagca gtctaaaggt taccgcgtgt ggatccccga cgaaagaaaa 2220 attgaagtca ctcgggatgt agctttccaa gaagcaccca acgttccacc tgaagaatac 2280 gaagatttta gtccggagaa taacatcgaa gatacggaaa cagagccgaa ggattcattg 2340 cgccgtgaag tcgttatcga aaacctatct ttggaggata ctactgacga tacattactc 2400 gacgaagggg aaaacagcga agtcgaggaa cttaacgacg acgtgatatc ggatgatgat 2460 ctccgtgggg aaaactcacc gccgattgct tgcagagcac ccggacgtcc tagaattata 2520 cgcactggat caaggggaag gcctcgtaag ttatttcaat cgagatacgc cgacgcctct 2580 ggcgtggagg cagaatctgt tttgctttca gaaatcccta ttgagcaggc tgtcgtcggt 2640 cctgactctg acgagtggta cgacgcaatg gcctcggaat taaaatcgat catcaagaac 2700 gacacttgga caatcgtcga acgccctaag gacgctgaag tcattaagag tcgattagta 2760 ttgaggaata aatacaaacc gaacggaacg atcgagagaa gaaaagctcg cctcgtggcc 2820 cgaggctttg ctcaacgtcc gggagttcat ttcaatcaaa ctttcgctcc tgttgcacgc 2880 ttgagttcaa tccgacttct tgtggggatt gcagctcatc acggaatgaa gattcaccaa 2940 ctcgacgtaa ctaccgctta tctaaatggt aaactcgagg aagagctgta tatggaaccc 3000 ccgagattca tcattgacgc cctaaagcgt ctgatcgaga aggagactcg aagcgacatc 3060 accgtcaagg caatgactat gtcggaggag ctgagtaccg gcgaaaaagt gtgcctctta 3120 aagaaatcgc tttacggtct tcgtcaagcc ggaaggaatt ggcatgaaac gttgagcgga 3180 gcgctgagga aaatcggggc aactccgact aatgcggacc cttgtgtata cagactcggg 3240 cgaggggaag acatagtact tatcgccgct tacgtcgacg atatgttaat cgcttcgaaa 3300 agtgcgaaga agattcagga agtcaaaacg agtttagcac ttgaattcga cgtaaaggat 3360 ctcggcgaga tcaagcactg tttaggaatt gagtttacac aatgcggagg cacgatatct 3420 atgcatcaat ctggttatat caaagacgtc ttaagtcgtt tcggaatgtc cgattgcaag 3480 cccgtgagca ctccaatgga ctccaacgtc aaactagtga aaccggagga aaactcaaat 3540 actcaagttg cggacgctcc ctatagagag ctggttggag ctttaatgta ccttgccgtt 3600 tctacgagac cggacattgc gttcgcggtc agctcgttga gtcagttcaa cgatagttac 3660 gatcaaactc actggaccgc tgccaagaga gtgctgcgtt atttgaaggg cactatgaat 3720 ctcgggatcg tgttcaaacc tacaaccgac cccctgagat gcttcgtcga ctctgactgg 3780 gccagttgcc ctactgacag gcgttcgtat acggggtata tcaccgtttt aagcaatgga 3840 cccgtatcct gggaagccag gaaacagaga accgtcgctc tatcttcgac cgaagcggag 3900 tacatgggac tgacggaagc cgcaaaggaa gccattcacc ttcgcggctt tcttactgag 3960 ttggaattta aggatcttac aaatgtcgtc atctacaacg acaatatagg ggcgcagaga 4020 cttgccgaga acgcggtgtt ccacgcaagg agcaaacata tagacgtgag acatcacttc 4080 atcagggacg ctttgaaatc cgattcatta aaggtcaaat atgtacctac cgaggacatg 4140 atggcggact tactaactaa gggtttgcca ggtccaaaac acaagaggtg tgttgagcta 4200 ttgggactcg gaccccctca agtcagcatt gaagcttcta cccacgtatc gaggggaagt 4260 at 4262 // ID Gypsy-48_AA-LTR repbase; DNA; INV; 199 BP. XX AC supercont1.113; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-48_AA_; KW Gypsy-48_AA-I; Gypsy-48_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-199 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.113; Positions 2187287 2187089. XX SQ Sequence 199 BP; 71 A; 38 C; 28 G; 62 T; 0 other; tgttgtaccc tttgtgcatt tcccaccgtg caactgatta ttcctactcg ggtattcatt 60 aatgtcataa caaaaataaa cacaacctga accagtgtga ttgattctgg aagctaactt 120 ttaacttgat taataaagta ctttaaaaga gtaaagttta atacaaataa ctattgcgaa 180 taatccgaac cagatcaca 199 // ID Gypsy-86_AA-LTR repbase; DNA; INV; 273 BP. XX AC supercont1.246; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-86_AA_; KW Gypsy-86_AA-I; Gypsy-86_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-273 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.246; Positions 1418589 1418317. XX SQ Sequence 273 BP; 77 A; 51 C; 55 G; 90 T; 0 other; tgtaggaata ttatcttgaa cacccctgca aactctccca gttgcacatc atctctgagt 60 catcagcata tcgattggaa cgaacagctt gggaagacag tgtttaatag tgagcgcaat 120 cgatcggtcg tattacggtt ttagtggaga atatacgtat tgaatttatt cgccggggtt 180 taagttacgt ttttcgatat ccaaagtttt catctagacg cgtatccctt aattttcagt 240 taaaagtgca aatagttagt catatttgct aca 273 // ID hAT-41_HM repbase; DNA; INV; 3524 BP. XX AC . XX DT 07-DEC-2008 (Rel. 13.12, Created) DT 13-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-41_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3524 RA Jurka J.; RT "hAT-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 2029-2029 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 731..3307 FT /product="hAT-41_HM_1p" FT /translation="MSKRQRDVGRKYLSGNMKRTLAKVKKTEAEKEKGALD FT KYLSKSVDLFETEKSDKQVESDDSETNISKEITSEASSSFLSGCNAQQNTS FT ELCINISVNTHKDNEDISESREENSNPALCDILKIKDDPATWPNKINQNVR FT DYLVNEGPPKITVENFPQNEKGLHFSKFHCKRKLKNGEVIERPWLIYSESF FT DKVYCYYCKLFDTNSDSALATSGFDSWSNIHTRLEDHEKSKKHLQCTLNCY FT ELQQRLSTGLTVDTLNEKLIRQETKRWNQVFERLVASVQFLAERNLAFRGS FT EEQIGNPHNGNFLGVIELLGKFDPVMQDHIQKIINKDIHDHYLGKTIQDEI FT IDTIGQAVLQEIIARIKSAKYFAVILDCTPDISHQEQMSMVLRYVADGTHS FT DVPAGIYEHFIKFIIVESSTGENLFNTLVKEIEMLGLDVENIRGQGYDNGA FT NMKGHNSGVQARLLERNSRAFYTPCACHNYNLVLGDMAKTCSDAMTFFGTL FT QRIYALFASSTKRWTVFKKHVKGLSVKPLSETRWECRMESVKAVRFQAAEV FT CDALEELAKSTNDAQGKSEAESLVSQMTNYRFLVALCFWHSLLFQVNFTSK FT ELQSNTIDIAAGLTSFEKLFDWLKTYRETGFVAALIDAKELANELEVEPIF FT RQKRCYKKKKTFLYESSDEPILDPKTEFRVNFFNQVVDKALQSLQPRFMQL FT KEHHKLFGFLYNFQNMSKEDLRKHSADLEIALTDITKDIEGFMLSEEMEAI FT KPILPVQQQKPKELFKYLACNDRSTAFPNLFIALKILMTIPVTVASGERSF FT SKLKLIKTYLRSVINQERLNNLALISIESPISREINYEKILKDFASKKARK FT IHFDL*" XX SQ Sequence 3524 BP; 1239 A; 522 C; 631 G; 1132 T; 0 other; cagggccggt cttaggccga tttgaccgat tgctccaaat agggccccgc gcttggtagg 60 ggccccgcaa tttatagcat atcattataa atggcagtaa ttttttttta taaatttcta 120 tcaatcttaa tataattcgt acagttaacg ttgaacgttt aatgcgcatg gtaacgtcta 180 ttgttatttt tagtttatct tagttattgc ggtatatggt ataaaatata ttagtttcca 240 atctactttc gaaagttcca gaagtactta tttccgaaaa ttcacgaggc aagcaatgaa 300 catggcggca gcttttatat tgttaaagat taagctgtat ttacttttga aatgtttgta 360 atatattttt tgtttgttta tgctataatt atttttcatt ttgtttacta ttagttttat 420 ttacttttgt ttaacgcaaa cttttctcgc ttttcagtaa atattttata tttaagacaa 480 tataaatttt attggataat tttaagagta ctttttttca agtctaataa ataatcttaa 540 tattaaatgt ttgtttattt atattaagat atttgcagta atcaattgct tacatttgtt 600 aattttactt ttaattttgt aagggggatt ttttatttat tttagtttaa ttaaactatt 660 ttattttatt gtgcaattta gtagtaaaaa aagagataag tttttagttt tgaagaacac 720 aagccaaaat atgagtaaaa gacaaaggga cgtcggtaga aaatatctaa gtggtaacat 780 gaagagaacc ttagctaagg taaagaaaac tgaagcagaa aaagaaaaag gtgcgttaga 840 taaatatttg agtaaatcag tagatctgtt tgaaactgaa aaatcagaca aacaagtcga 900 gtcagacgac agtgaaacaa atatttctaa agaaattaca tcagaagcta gtagtagttt 960 tctttctgga tgtaatgctc aacaaaatac atctgaatta tgtataaata tatctgttaa 1020 cacacacaag gataatgagg atatttcaga atcaagagaa gaaaattcaa acccagcact 1080 atgtgatatt ttaaagataa aagatgaccc tgctacatgg ccaaataaaa taaatcaaaa 1140 tgtcagagat tatttagtga acgaaggacc ccctaagatc actgtagaaa attttccgca 1200 aaatgaaaaa gggttacatt tttcgaaatt tcactgtaaa agaaaattaa aaaatgggga 1260 agttatagaa cgaccatggc taatatattc tgaatcattt gataaagttt actgttacta 1320 ttgcaaactt tttgacacaa attctgattc tgcacttgct acttctggat ttgacagctg 1380 gtcaaatatt catacaagat tagaggatca tgaaaaatca aaaaaacatt tgcaatgtac 1440 acttaattgc tatgaactgc agcaaaggtt atctactgga ttaactgttg atacattaaa 1500 tgaaaaatta ataagacagg aaacaaaacg ctggaatcaa gtctttgagc gacttgttgc 1560 ttctgtacag tttttggcag aaaggaactt ggcatttcgt gggtctgaag aacaaattgg 1620 aaatccacat aatggcaatt ttctaggcgt cattgaattg ctaggaaaat ttgatccagt 1680 tatgcaagat catatacaga aaattattaa caaagacatt catgatcatt atttaggcaa 1740 gaccatacaa gatgaaatta ttgatactat aggccaagct gttctacagg aaattattgc 1800 tagaattaag tcggcaaagt attttgcagt gattcttgac tgcactcctg atatcagcca 1860 tcaggaacag atgtcaatgg tgctaagata tgtcgctgat ggcacacact cagatgtccc 1920 agcaggaatt tatgaacatt ttataaagtt tattatagtt gagagcagca ctggtgaaaa 1980 tttatttaac actcttgtga aagaaattga aatgctagga ctagacgttg aaaacatcag 2040 aggacaaggg tatgacaatg gagctaatat gaaaggacac aattctggag tgcaagcacg 2100 acttttggaa agaaattcac gagctttcta tactccttgt gcgtgccata attataatct 2160 tgttctagga gatatggcta aaacgtgttc agatgccatg acattttttg gaaccctgca 2220 gcgtatttac gccctttttg cttcatctac aaaaagatgg actgttttta aaaagcatgt 2280 gaaaggcctg tctgttaaac cattatcaga aacaagatgg gaatgtagaa tggaaagtgt 2340 taaagctgta cgatttcaag ccgcagaagt gtgtgatgct ttagaagaac tagcaaaaag 2400 cacaaatgat gctcaaggga aaagtgaggc tgaatcatta gtaagccaaa tgacaaatta 2460 taggtttctg gttgcactat gtttttggca ctctttattg tttcaagtaa attttactag 2520 caaggaactt caaagcaata ccatagacat tgccgcgggg ctgacatctt ttgaaaaatt 2580 gtttgattgg ttgaagacat accgagagac aggctttgta gctgcattga tcgatgccaa 2640 agagcttgca aatgagttag aagttgaacc catatttcga cagaagcgct gttacaagaa 2700 aaagaaaacg tttctttatg aatcttctga cgaaccaatt ttagacccca agacagaatt 2760 ccgggtaaat tttttcaatc aagttgtaga caaagcattg caatctcttc aaccacggtt 2820 tatgcagctg aaagaacacc ataaactttt tggatttcta tataattttc agaatatgtc 2880 taaagaagat ctcagaaaac attcagccga cctagaaatt gctttgacag acattacaaa 2940 agatatagaa gggttcatgc tttccgaaga aatggaggca attaagccaa tactgcccgt 3000 tcaacaacaa aaacccaaag aactctttaa atatttggct tgtaatgata ggtcgactgc 3060 atttccaaat ttgttcatag ccctaaaaat actgatgaca attccagtta cagtcgcttc 3120 cggtgaaaga agtttttcaa aactaaaatt aattaaaacc taccttaggt cagtaattaa 3180 ccaagaacga ctaaacaatt tggcactaat atcaattgaa tctcctataa gtagagaaat 3240 taattatgaa aaaattctaa aagatttcgc aagcaaaaaa gcaagaaaga tccactttga 3300 cttatagatt atttgtttat tatgacaata acaagaactg ttttgtttaa attcattgta 3360 atatttttat tattgtttat tgattcgttt gtggagttat gtttttttgt ataataaatg 3420 tgtttcaata attttattaa tagtagttat tgtaaaaaaa aaaatcaggt gggccccgca 3480 agcattcgtt aaattgggcc ccgcaattcg taaggccggc cctg 3524 // ID Chapaev-1_ACa repbase; DNA; INV; 4112 BP. XX AC . XX DT 30-SEP-2007 (Rel. 12.09, Created) DT 30-SEP-2007 (Rel. 12.09, Last updated, Version 1) XX DE Autonomous DNA transposon - a consensus sequence. XX KW Chapaev; DNA transposon; Transposable Element; Chapaev-1_ACa. XX OS Aplysia californica OC Eukaryota; Metazoa; Mollusca; Gastropoda; Heterobranchia; OC Euthyneura; Euopisthobranchia; Aplysiomorpha; Aplysioidea; OC Aplysiidae; Aplysia. XX RN [1] RP 1-4112 RA Kapitonov V.V. and Jurka J.; RT "Chapaev - a novel superfamily of DNA transposons."; RL Repbase Reports 7(9), 775-775 (2007). XX DR [1] (Consensus) XX CC Chapaev-1_ACa is a very young family of DNA transposons. The CC genome contains several copies of Chapaev-1_ACa that are less CC then 1% divergent from the consensus sequence. Chapaev-1_ACa CC belongs to the Chapaev superfamily. Hallmarks of the Chapaev CC transposons are 4-bp target-site duplications, terminal inverted CC repeats with the conserved '5-CAC and GTG-3' termini, and the CC Chapaev transposase. The Chapaev transposase is characterized by CC the conserved D-x(60-80)-D-x(220-290)-E catalytic triad. Chapaev CC transposons populate genomes of different animals, including sea CC urchin Strongylocentrotus purpuratus, amphioxus Branchiostoma CC floridae, starlet sea anemone Nematostella vectensis, sea hare CC mollusc Aplysia californica, mosquitoes Aedes aegypti and Culex CC pipiens, and nematode Caenorhabditis elegans. The N-terminal CC portion of Chapaev transposase in Chapaev-1_ACa, Chapaev-2_ACa, CC Chapaev-3_ACa, Chapaev-1_BF, Chapaev-2_BF, Chapaev-1_NV, CC Chapaev-2_NV, Chapaev-3_NV, and Chapaev-1_SP is similar to the CC N-terminal portion of RAG1 (100-370 aa in the human RAG1). It CC includes a novel type of zinc finger, called Chapa: CC H-X7-C-R-X-C-G-X35-D-X4-H-X4-C-X2-C-W-Xn-C-X2-C-X8-G. In the CC amphioxus and anemone Chapaevs, the N-terminal portion contains CC also the RING finger motif. Some Chapaev transposases (e.g. CC Chapaev-2_ACa) show low similarity to the RAG1 core. XX FH Key Location/Qualifiers FT CDS 727..3540 FT /product="Chapaev-1_ACap" FT /note="Transposase." FT /translation="MEHTHEIHSSSLNELCRICGCKNVTKVQQRQKRKART FT CDSLQSDILLIYEIDIREDTDDKYSKFICHKCHMRIYDTKKNTTPSTLKKA FT RDLVRNSQHIWCTFENQTTIKDCSVCRHRVNLSDGCLKPAKEPAATVAGDS FT PSTSPSLHSQHSPVTPSQTTSTYDDPTTRPSSSNTPHSSHINTVFPSQTQS FT AEQNEITCISIPLATSTPLRPSKPELKDIGTSPMCLQTDKVHQDSTTSPLF FT KTLDESDTLAHSLSQPLDAPLSKREDDVSTHLFKRKLFTTKNPQNVVTFKT FT GGQPIAVKRLIIPRKDSSEAASPTKRRRSRLLEGTRANVAGPSMSAEDIQF FT ASELKKMPGKRRQGVFNKAGAQSRIRLTPQQTLIMKETAGFSGRQGRYYGK FT ALKQVGVHLANEHSVRKLSKEVVSDSVEVEHRMFLDNQGKEQGVPYGRIKN FT LSSFVDSLLDEYVENDLLVWPDTIPQEEVWIKIGGDHGKNSFKMTLQTVNT FT YKPNAKQNTIVIATAAVKDTHENIVRFLAGGLGDDIASLSAHTWRGKRLKI FT FVNGDYDFMCKMYGLSGPQGTHPCLWCLIPKAKHYVFPETYQQRDLHMLHT FT DHAAFMAQHGGDKKGAAQHHNCLHAPLLTTELDHVTPPYLHILLGIVLKHH FT KLLEIEADKLDRTIASMTPKTLTKLGLTLMKYGQNYKTAQEIQEKIRFTRT FT CAAMSDTQEEKGTFRTETRRLRHTLSELDRVELSPRSGPVASSLDTILTKH FT RITPQAYHSRSFIGNHCHKYLNPKVYRHLTQTIVEQTQMYTYDPIIVDKAH FT TIGLIFDSLNKAYSQIHDDISHSIPIPKTSIPAIQTAIDTYMKLYRRHFPK FT KTIPKQHILEQHCIPFITQHGFGLGLLGEQGTESCHQSISKIEKRAQGIVD FT NTEKLRYVLNAHLLQTAPSLRIEAAGTEKATTAP" XX SQ Sequence 4112 BP; 1333 A; 919 C; 816 G; 1044 T; 0 other; cacagcgccg aaaactggcc gaaatcgtgc ctcttagttt cccaggccag cgacatctgg 60 aagtcattaa atcaccgtcg gaaccgctcc gtgccgggtt acgcgggggt cactggccgc 120 tccgttgcgc atcagctaca ggcgggcagg tcatggcgta atcgacatgt ttgacactaa 180 ttagcggcag tatcctcact ctaaccacga aatattccat acaaaaagaa aagaccaaga 240 ttctctctct cactctacga aaacccgatc ttaatgtggc gatatttgcc gatgttacag 300 tgtgtgcaaa atattctgaa tttcatcgaa tttaatctgt gtgtgctgca ctttattcac 360 ttctaatttc aaaagtgctc gatcaatctc aacaaatttt ggtacatgca ctgatgacat 420 atttgtgctt tacgtgtgta attttcatca tcgtttgaag aatagttatt gcaatattaa 480 gtcgtttaca tctgattctc agccagtgat tgttatttgt cccatgcaac tgtgtattct 540 ggattctata ctgttttaag ttcaacagtt ctccgttaat ttcatagatt gttgtttcta 600 tgaaaggcag acatatttgt gcttattgtg tataaatttc atttttgttg attgagccat 660 gttacgatat taggtagaat ccatttgccc gtgattcatg tatagcagtt ttactgaaat 720 ctccaaatgg aacacacaca cgagatacat tcctcctccc tgaatgaact ttgtcgcatt 780 tgcgggtgta agaacgtgac taaagtacaa cagaggcaaa aaagaaaggc taggacatgt 840 gactcactgc aaagtgatat tttattaatt tatgaaattg acattagaga agacacagat 900 gacaaatatt cgaagtttat ttgccacaaa tgccacatga ggatatatga tactaagaag 960 aataccactc catccaccct taaaaaagcg cgagatctgg tcagaaacag tcaacacata 1020 tggtgtacat tcgagaatca aactacgata aaagattgca gtgtttgccg tcatcgtgtc 1080 aatttgtcag atggctgcct gaaaccagca aaggagcccg ccgcaacagt tgctggtgac 1140 tctcccagca ccagccctag cctccactca caacacagcc ctgtcacacc cagtcaaaca 1200 acctccacat atgatgaccc aacaaccaga ccctctagtt caaacacacc acattcttcc 1260 cacataaaca ctgtttttcc ctcacaaaca caaagtgcag aacagaatga aattacatgt 1320 atttcaatcc ctcttgcaac atcaacacca ctaagacctt caaaaccaga gttgaaagac 1380 attggcactt ctccaatgtg cctacaaaca gataaggttc accaagacag cacaacatcc 1440 ccacttttca aaactttaga tgagtcagac acgttagctc actcactcag ccaacctcta 1500 gatgcacctc tcagtaagag ggaggatgat gtcagcacac atttgtttaa aagaaaactt 1560 ttcactacta aaaatccaca aaatgtagtc acattcaaaa cagggggaca gccaattgct 1620 gtaaaaagat taattattcc aagaaaagat tcttctgaag cagctagtcc cacaaagaga 1680 agacggtcgc gattgctaga aggtacgaga gcaaatgttg ctggaccatc gatgtcagct 1740 gaagacattc agtttgcctc tgaattgaag aaaatgccag gaaaaaggag gcaaggggtt 1800 ttcaacaaag ctggggctca gagtaggatt agattaaccc ctcaacaaac tttgattatg 1860 aaagagacag cgggatttag cggtaggcag ggtaggtatt atgggaaagc tttaaagcaa 1920 gtaggggtac acttggctaa tgaacattcc gttaggaaat tatcaaaaga agttgtgtca 1980 gactctgtag aagttgagca cagaatgttt cttgacaacc agggaaaaga acagggtgtc 2040 ccgtacggcc gcattaagaa tttatcctca tttgttgata gcctgttaga tgagtatgtt 2100 gagaatgacc tccttgtgtg gcctgacaca ataccacaag aggaagtgtg gatcaaaatt 2160 gggggggacc atggcaaaaa ctcgttcaaa atgacacttc agacagtcaa tacatacaaa 2220 cccaatgcta aacaaaacac aatcgtcatt gctacagccg cagtaaaaga cacacatgaa 2280 aacatagtta ggtttttagc tgggggttta ggagatgaca tcgcatcatt atcggcacat 2340 acatggaggg ggaaaaggct gaaaatcttt gtcaatggag attatgattt catgtgcaaa 2400 atgtatggtc tgtcgggtcc acaaggaacg cacccatgcc tgtggtgctt gatacctaaa 2460 gccaaacact atgtttttcc tgaaacttat cagcaaagag atctgcacat gttacacaca 2520 gaccatgctg cgttcatggc acaacatggt ggggacaaaa agggagcagc tcaacatcac 2580 aactgcttgc atgcaccttt actcacaaca gaactagacc atgttacacc tccataccta 2640 cacatattgt taggtattgt tctaaaacac cacaaattgt tagagataga agcagacaaa 2700 cttgatagga ctattgcatc catgacaccc aaaactctca caaagcttgg acttacactc 2760 atgaaatatg gacaaaatta caagacagca caagaaatac aagaaaagat ccgctttact 2820 aggacatgcg ctgccatgag tgacacacaa gaagagaaag gaacgtttag gacagagaca 2880 aggcggctgc ggcacaccct gtctgaacta gatcgcgttg agctgtcccc ccgatctgga 2940 ccagtggctt caagccttga cactattctc actaaacacc gcattacgcc acaagcttac 3000 cacagtcgat cattcattgg caatcattgc cacaaatacc tcaacccaaa ggtgtacaga 3060 catctcactc agacaattgt agagcagaca cagatgtaca cgtatgatcc aatcattgtt 3120 gacaaagcac acacaatagg tttgattttc gactcactca acaaagcata cagtcaaatc 3180 catgatgaca tctcacacag catacctatt ccgaaaacct ctattccagc aattcagaca 3240 gcaatagaca cgtacatgaa gctgtacaga agacattttc caaaaaagac aatccccaaa 3300 cagcacattt tagaacaaca ctgcattcca ttcataacac aacacggctt tggcttgggc 3360 ctgctgggag aacaaggaac tgaaagctgt caccagtcta tttcaaaaat tgaaaagcgg 3420 gctcaaggca tagtagacaa tactgaaaag ctacgatatg ttttgaacgc acatttgctc 3480 cagacagcgc cttcactgcg catcgaggct gcaggcaccg agaaagcaac cacggcacca 3540 tagataaacg tatatatcta ctgtacaaaa tgtgtaaata atctagatct gagtcattaa 3600 tgaaagatac tgtgatgtga atttaatcac aagcccgacg aatttgactt ttgaacgaaa 3660 aagcaaaaat atggattatt gatcttccaa cgatgatgtc ctttggtgag tagataccca 3720 cattatttgt ctttattctg agccagtttt agtgaattta ttcgagccct ttggttgtta 3780 cataactttt aattcggtgt atattacatt aattaccaca tataagacgg ctgtatttga 3840 tcgcatgcag ttacaaaact atacaagata tctttgttgc gttttcacaa actgtacatt 3900 gaaagaagaa gctcaaagat ggaacactgt gtatatattg ataaccctgc tttcattcgt 3960 gagcgttgaa acaaaactgt gtaaaattgc cgtggtatgt aactccgcgc gtggagtgag 4020 cgcaggtaga ctgaccggca gctgaccccg gtcacgctga cctagcggcg tacggcctgg 4080 cgaaactaaa cgaccgctga ttcggcgctg tg 4112 // ID Gypsy-210_AA-I repbase; DNA; INV; 4431 BP. XX AC supercont1.2264; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-210_AA_; KW Gypsy-210_AA-LTR; Gypsy-210_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4431 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.2264; Positions 7810 3380. XX CC Positions [3537-4004] - Integrase core CC 'CTAAT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 353..2002 FT /product="Gypsy-210_AA-I_1p" FT /translation="MTPRSPPFRCESIEKNKLSREWETWKWSLECYFAAYD FT ISDQKIMRAKLLHLGGVELQRIFRSLPEHDKSPLVALEPKVYDLAIELLDT FT YFQTGRQDVIERRKLRKIKQEANEKFSHYVIRLRQQSLNCGFEKHPAKVAE FT ILKEIYLIDVIVENCRSDELRKAILKRDRSIREIEEIASTIEDTDQQMKEL FT KENNTSREVAVYEVNRSGQARATWRRRTTVERKPLQGYPSGRFKRQLDTKP FT IPFKTSKYSCFACGQQGHLANSRDCPARGRMCRRCRELGHFETVCKKQKQG FT SSAVKTQRNIHNIEESQEAQEDVEQNNDCDSEDPKKVFYAFYGGNETNVLE FT GVIGGVAVMMLIDSGADANLIRYETWKMMKEKEIRVVTSTKGSTRVLKGYG FT SDKPLDVVGTFKAEVTIGRSTTMAEFFVVKGGQKDILGDATAKKLGVLKVG FT IDVNQINAETKPFSKINGVKAHIRMMEDARPVFQPLRRIPIPMEDAVNRKL FT ENLLIRDIIEVKQGPSTWVSPLVIVGKASGEPRICLDLRRVNEAVVREHFP FT MPM" FT CDS 3396..4406 FT /product="Gypsy-210_AA-I_2p" FT /translation="MRMMKSHLRTNVWWSRMDQDVEKFVKQCKGCTLVSAP FT NPPEPMIRRELPDQPWRDIAADFLGPLPEGQYLLVVVDYYSRFIEVCEMSV FT ITAAEIIKELVTIFSRFGLPSTLRVDNGPQFSSKCEEFSDFCESNGISLVN FT TIPFWPAMNGEVERQNRSLLKRLRIAQELGKDWRSEMRKYLLTYHATNHST FT TGKSPAELMFGRKIRSKLPQVELTNFNDEEVRDRDAVSKEKGKVYGDAKRR FT AKESEIGTGDLVVCKRMKKSNKLDADFSAEEFEVVRKVGGDVTVKSLESGK FT EYRRNVAHLKRIGGKNKSKTVSENEATREKRTRVEPAKFKDFILH" XX SQ Sequence 4431 BP; 1472 A; 752 C; 1069 G; 1138 T; 0 other; actggcgacg aggataacgg taagagcatt gctcattgta tttatttatc gaatgggaaa 60 ccgcaatagt gcaattgaat tatggtgttg cgccacgtga tacattatta gctgagaaga 120 aaaatgttct aaaaggtgtg ttaatggatc tatcggatga tgtactgcaa aggtatggtt 180 aattttaagg aaaatatttg aattatatta ctctgaatcg tggatttcaa tagagtaagg 240 gtggcgtttc aatatcatta ttcgtgaaat ttcggctcaa atgaccaacc aaaccaccta 300 tcgatgaaaa aaaatagttg tcatagcaac atgataaaag cctaggcatg ctatgacacc 360 acgttctccc ccattccgct gcgaatcaat tgagaagaac aagctttctc gtgaatggga 420 aacatggaag tggtctttgg agtgctattt tgcagcttat gacatcagcg atcagaaaat 480 aatgcgagct aaattgctgc atttgggcgg agtagagttg caaaggattt ttcgaagttt 540 gcccgaacat gataaatcac cactagtagc tttggaacca aaggtttatg atttagccat 600 tgaacttctg gacacttatt tccaaaccgg cagacaagat gtaattgagc gccgcaagct 660 tcgaaagata aagcaggagg caaatgaaaa gttctcccat tatgtcatac gcttaagaca 720 acaatcactg aattgtggct tcgagaaaca tccagcaaaa gtcgctgaaa ttctgaagga 780 aatttatttg attgacgtca tagtggagaa ctgtcgctct gatgagcttc gcaaagctat 840 tctcaagcga gatcgatcga tcagagaaat tgaagaaata gcgtcgacaa ttgaagatac 900 ggatcagcaa atgaaagaat tgaaagaaaa caatacgtct cgggaagtag cagtttatga 960 agtcaacaga tcaggacaag ctagagctac ttggcgaagg cgcactacag tggaacgtaa 1020 accattacag ggatatccgt ctggaagatt caagcgtcaa ttggatacga agccaatccc 1080 gtttaagacg tcgaaatatt cctgttttgc ttgtggacaa caaggtcatt tagcaaattc 1140 acgtgattgt ccggctcgtg gccgtatgtg tcgtcgttgc cgtgagctag gtcactttga 1200 gacagtctgc aagaaacaaa agcaaggatc aagcgcagtg aagacacaga ggaacatcca 1260 taacatcgag gaatcacaag aagcacaaga agacgttgaa cagaataatg attgcgattc 1320 tgaggatcca aagaaagtct tttatgcgtt ttatggtgga aatgaaacga atgtattgga 1380 aggtgttatt ggaggggttg cggtaatgat gttgatagac tcaggagccg atgcgaactt 1440 gattagatac gaaacatgga aaatgatgaa ggaaaaggaa atcagagtag taacatcgac 1500 gaaaggatcg actcgtgttt taaaaggata tggaagcgat aaaccattag acgttgtcgg 1560 tacatttaaa gctgaagtta ctattgggag aagcactacg atggctgagt ttttcgttgt 1620 aaaaggtggc cagaaagaca ttttgggaga cgcaactgcg aaaaagcttg gggtgttaaa 1680 agtcggcatt gatgtgaacc agatcaacgc agaaacaaag ccgttcagta aaattaatgg 1740 agtcaaagca cacatacgta tgatggaaga cgctcgccct gtgttccagc cactacgtcg 1800 gattccaatc ccaatggaag atgcagtaaa cagaaagctg gagaatttgt tgatacgtga 1860 cattatcgaa gtgaaacaag gaccgtctac atgggtttct cctctggtaa tcgtaggaaa 1920 agcgtctggg gaacctagaa tttgtttgga tttgagaaga gtgaacgaag cagtagttcg 1980 agaacatttc ccgatgccga tgtagacgag tacttggcca gattgggtgg aggtaaaatg 2040 tggagtaagc tggatatccg tgaagcattc catcaagttg agctggccga agattctcga 2100 gacgtgacaa ctttcatcac taacaaaagg attgttcagg ttcaagagac taccgttcgg 2160 acttgtaaca gctcccgaag ttttccaaag aataatggaa gaaattctag ccggttgcga 2220 aggtacatac tggtacctag acgatgtgat ggtagaagga gaaacaaagg aaattcatga 2280 taagaacttg aagaaggtac tgtagaaaga gatagatatt aaacaataaa tgatgtgtat 2340 aactagtcat gtaatttttg taggtacttg atagattcaa tgaacgcggc gttgaactta 2400 actgggagaa atgcgagttt ggggtaacaa agttagaatt tcttgggtat catatttctg 2460 ataaaggaat cagtccatct agaacaaagg tgaacgctgt tctgtcattc cgccctcctt 2520 cctgtgaatc ggaaattcgc agttttctag gattggcaaa ctatttgaat aagttcatcc 2580 ctaatctcgc tactgtagct gaaccattga gagagctaac gaagaaatca gtgaaattta 2640 tctggagcga taaacatgat cgtgcttttc agttaatcaa aaataagcta gcagcagcaa 2700 ctaaactcgg gtttttcgat gttttggata gaaccacagt gtttgcagat gctagtccta 2760 cgggacttgg tgctgttctg attcaaactg atagctctaa cgaatcacga gtgatatgct 2820 acgcatcaaa atctttaact gataccgaac gccgttattg tcaaaccgaa aaggaagctt 2880 tggcattagt gtggagtgtt gaaagattcc acattctatg gaagaccttt tgagctgttg 2940 acagactgca aagctttgga atatttattt acacctagat ccaaaccttg tgctcggatt 3000 gaacgatggg cgttacgtct tcaatcgttt gaataccagg ttgtacatat tccagggaat 3060 tacaatattg cagatagtct ttcacgactt gcaacgttga aatcgcatgc cttcgatgca 3120 gatgaggaat taattatacg ggaaatagct gtttcggcta gctcatccgt tgcactaaaa 3180 tggaatgaaa tcagagctgc aagcgaaatg gatgatgaaa tattagagat cacacaaatg 3240 ctcaagaatg aaaatgtgga aaacatgcca atagcataca aaattatttc gaacgatgtg 3300 tcattgataa tgttctatta cgcgtggatc gtatcattat acccactagt ttgagaacaa 3360 gagtagtgca aatagcacat gaagggcacc cggggatgcg aatgatgaaa agtcatctga 3420 gaacaaacgt atggtggtct cggatggatc aggacgtaga gaaattcgta aagcagtgca 3480 aaggatgcac cttagtatcg gcccctaacc ctccagaacc tatgattcgt cgagaactgc 3540 cagatcagcc atggagagat atagccgctg atttccttgg tcccttaccc gagggacagt 3600 acttgttagt cgtggttgat tactacagta gattcataga ggtttgcgag atgagtgtaa 3660 tcacagctgc tgagatcata aaagaacttg tcaccatttt cagccggttt ggacttcctt 3720 caactttgcg agttgacaat ggcccacagt tcagttcaaa atgtgaagag tttagcgatt 3780 tctgtgaaag caatggaata tctctagtga ataccattcc attttggcct gccatgaatg 3840 gagaagtaga acgccaaaat agatcgttgc tgaaaagatt acgaatcgct caggaattgg 3900 gtaaagactg gagatcagaa atgcgaaagt atttactcac gtatcatgca acaaatcaca 3960 gtaccacagg aaagtcacca gccgaattga tgtttggcag aaaaattaga agcaaactgc 4020 ctcaagtaga actcactaac ttcaatgacg aagaagtacg agatagagat gcagtatcaa 4080 aggaaaaagg aaaagtatat ggggatgcaa aaagaagggc gaaggaaagt gagattggca 4140 caggagatct tgtcgtatgt aaaagaatga agaaaagtaa caaattagat gcagattttt 4200 cagcagagga attcgaagtt gttagaaaag taggagggga tgtgacagtt aagtctttgg 4260 aatcagggaa ggagtataga cgtaacgtag ctcatttgaa aagaattggg ggtaagaata 4320 agtctaaaac tgtatcagaa aatgaagcca ctcgtgaaaa acgtacgaga gtggaacctg 4380 caaaattcaa agattttata ctacattaat tgtgtaaaag ctaagggggg t 4431 // ID Mariner-19_SM repbase; DNA; INV; 1292 BP. XX AC . XX DT 12-AUG-2009 (Rel. 14.08, Created) DT 12-AUG-2009 (Rel. 14.08, Last updated, Version 2) XX DE Mariner DNA transposon from Schmidtea mediterranea: consensus. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-19_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-1292 RA Jurka J.; RT "Mariner DNA transposons from Schmidtea mediterranea."; RL Repbase Reports 9(8), 1868-1868 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 121..1122 FT /product="Mariner-19_SM_1p" FT /translation="MQTAEANLHRKITDNERSLIIRKYEDQKSIQLISEEL FT DINVKTVTSIIRLYKNTGRVNALTQRRPRGSIITEEAKEFIRSEIESDVSV FT TLGALKLKLQASLNIICSTTTIDSAIRDLNYSFKRVELVPERRNAAANIEE FT RFNYAGRYLGYDEDKVIFLDEFGVSCSTRQKYGRSLVGTTPRKVVRAIRSR FT NYSVCAAISKRKIIHHVIKDSAFNREAFLSFLRSLILNLVEANITGATIIM FT DNCSIHKGEEVRNLIVENGFELVFLPPYSPQLNPIEEVFSKWKSLIKSANA FT NTREELTAAIMSTIGRISESDCVGFFNHVRDFGVMALRREEF" XX SQ Sequence 1292 BP; 458 A; 191 C; 232 G; 410 T; 1 other; cagggttttc atattaattt aatcctttgc aaatatattt taatcccttt caagaatatt 60 ttaatcaatt caaaatttat tttaatcatt tccaaattta ttttaatccc aaaattacat 120 atgcagacag cagaagccaa tttacatagg aaaattacag acaacgaaag gtcacttatt 180 atccgtaaat atgaagatca gaaaagcatt caacttattt ctgaagagct cgacataaac 240 gttaaaacgg tcacttctat aataagactt tataaaaaca ctgggagagt aaatgcgttg 300 acacaaagac gaccacgggg aagtataata acagaagagg caaaggagtt tatcaggtcc 360 gaaattgaaa gtgatgtttc cgtaacttta ggagcgttaa aattgaaact tcaggcaagc 420 ttaaatatca tatgctcaac aaccacgatc gacagcgcaa taagagattt gaattactct 480 tttaaacgcg tcgaacttgt accagagaga agaaatgctg ctgcaaatat agaggagcga 540 tttaattatg ccggtagata tcttggatat gatgaagata aagtaatttt ccttgatgaa 600 tttggtgtga gctgttcaac ccggcaaaaa tatggccgaa gtctcgtggg aacaactcca 660 cggaaagttg tacgggctat aagatcgcgt aattattcag tttgcgcggc cataagtaaa 720 agaaaaatta tccatcacgt aataaaagat tcagcattta atagagaagc atttttatca 780 tttttaagat ctttaatttt aaatttagtt gaagcaaata ttacaggagc aacaattata 840 atggataact gtagtataca taaaggagaa gaagttagaa acttaatagt tgaaaatggc 900 tttgagctcg ttttcttgcc tccctacagt ccccagttga atcccattga agaagtattt 960 tctaaatgga agtcactcat aaaatctgct aatgccaata cgagggaaga attaaccgct 1020 gctatcatgt cgacgattgg tagaatttcc gagagtgact gtgttggatt ttttaatcat 1080 gtgagggatt ttggcgtcat ggcgttgcga agagaagaat tttaattaaa tttattgttt 1140 tgtctttttt atatcctwtt tttgcatgtt tccttatttc cattttcatt cgtttttata 1200 tcttaaatta aaataaagtt ggaaatgatt aaaataaatt tggaaatgat taaaatatat 1260 ttgcaaagga ttaaattaat atgaaaaccc tg 1292 // ID Copia-29_DPu-I repbase; DNA; INV; 5157 BP. XX AC scaffold_212; XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from Daphnia: internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-29_DP_; KW Copia-29_DPu-LTR; Copia-29_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-5157 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Direct Submission to RU (16-DEC-2010). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR Genome; scaffold_212; Positions 138566 133410. XX CC Positions [1731-2261] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 60..3728 FT /product="Copia-29_DPu-I_1p" FT /translation="MAAYYSKDVSHIKKFDGTDFSFWKFQVELVLEQHQLL FT SVVKGREICPLPDVQENELILPDVQENELIANQDAITLWKTKDVAARSCLI FT STIEDSCKRSLLNCRTAAEMWTRLTVQYQQNVAESKHILCNQYYQYAFEPG FT NSVMAHISAIEGMAIQLTDLGVEIGATQLMTKILLTLPPSFQSFQSAWDIL FT PDQEKTLATLTSKLVLAETMNKHFGGHTEKDQAFFNKRTGSAPPRSHPPRS FT HGLSAKQLFCSACGPGLHSTATCRKRKWNQSTDVPPSGSSEKQEKCTYCTW FT ENHRVEDCSIRLKHERELAATKTRSKKSKDRSGYASKKEDASTKEKEDLNG FT DRHPNNDTMAFPAYSSASSRRSGSTLFVADSGATDHMSDQLHIFNVFTAVT FT PGDWKIKGIGTNSALQVHGYGTVNIRSKVDGIWYDGILEKVLYVPNLGITL FT FSIGTAADLGCVVTFDKNQIRLCQNNTPVAVGTRSSQDDLYYLNIEVVPTT FT ASAFLSVSSVPLSTWHLRLGHISTTIIKLMESTNCVIGLQLSKSEELPVQQ FT CEGCAYGKSQRLSFPTSGHIKATEIGHLVHSDLCGPMSVASPSGAVYFLIF FT KDDYTGFRVLYFLTHKSQVFHYFQLYSSQLHTETNQHVKILRTDNGGEFTS FT SEFSNFLSKEGIKHETSAPHTPEQNGVAERENRIVMESARSLLHAEAISLE FT LWAEAVACAVYCLNRALNTNNKICTPFEGWYKQKPTISHLRIFGTKAYVHI FT PGVERKKLDAKSKLCTFVGYSDTQKAYRFWCQETRKIVISRDAIFCEKDET FT VFPVNPLVPEKISNELPSSTSFLPKRKNNVFPPSDINKDVPSPGDRRNPAR FT VRHQAVPRSSLHALSAETSTTEPLSYDDAISSPDYLLWKQAMEEEMAALNL FT NHTWTLTTLPRGRSPVDCRWIYKLKHRVDGSIERYKARLVAKGFSQRPGID FT YDQTFSPVVKYDSLRTILSVTAAEDLELYQLDVTTAFLHGVLKEEVYLRQP FT EGHVIPGQETQVYRLHKSLHGLKQASRNWNEKFDEFLTKFGLVPSQADSCV FT YFLRKDAEITIVSIWVDDGLVASSSKKIVLEIIEHLQLQFEIVSRPADLFV FT GLLINRNRDKKYLHLSQPTYISKILSKFCMQDSHPKATPADPFNKLTKESS FT TADSDSSKLFPFQEAVGSLLYCMITTRPDISYAVGQVAQFTTNPGKSHCEA FT LKRILSYLKGTST" XX SQ Sequence 5157 BP; 1541 A; 1012 C; 1016 G; 1588 T; 0 other; acaggttatg ggcccagaga attatttgtt ttttgtttaa tttacaaatt ttttaagtga 60 tggcggcata ttattcaaaa gatgttagtc acattaaaaa gtttgatgga acggattttt 120 ctttttggaa gtttcaagtg gagttggttt tagaacagca tcaacttttg agtgttgtta 180 agggtcgaga aatctgtccg ttacctgatg tccaagaaaa cgaactaatt ttacctgatg 240 tccaagaaaa cgaactaatt gctaatcaag acgcaatcac cctttggaag acgaaagatg 300 ttgcagcccg ttcgtgttta atcagcacaa ttgaagattc ctgcaaaaga agtttgttaa 360 actgtcgcac agcggctgaa atgtggacaa ggttaaccgt tcagtatcag caaaatgtag 420 ctgagagtaa gcatattctt tgcaatcaat actatcaata tgcctttgaa cccggtaatt 480 cagtaatggc acacatctct gccattgaag gcatggcaat tcaactaaca gacttaggag 540 tggaaattgg tgctacgcaa ttaatgacta agattcttct tactcttcca cctagttttc 600 aaagttttca gtctgcctgg gatattttac cagatcaaga aaaaaccttg gccactctga 660 catcgaaact tgttttagcc gaaacaatga acaagcattt tggtggccac actgaaaaag 720 atcaagcatt ttttaacaaa cgaacaggtt cagcacctcc aagatctcac cctccaagat 780 ctcacggttt gtctgcaaaa caattatttt gttctgcttg tggacctgga cttcattcta 840 ctgccacttg tcgcaagcgt aaatggaatc aatctactga cgtcccacct tcaggtagca 900 gtgaaaaaca ggaaaaatgt acttattgta cctgggaaaa tcatcgcgtt gaggattgtt 960 caattcgatt gaagcatgaa cgagagttag ctgccacaaa aacaagatcg aagaaatcaa 1020 aggatcgaag cggttatgct tctaaaaagg aagatgcttc tacaaaagaa aaagaagatc 1080 tgaatggtga tcggcatccc aacaacgaca ctatggcgtt tccggcgtat tcatctgcct 1140 cttctagacg gtctggatca actttatttg ttgctgattc tggagctact gatcatatga 1200 gtgatcagtt gcatatcttc aacgttttta cagccgttac acctggagat tggaaaataa 1260 agggaattgg aaccaattca gcactgcaag ttcacggcta cggcacagtt aacattcgta 1320 gtaaagtcga tggcatctgg tatgatggta ttttggaaaa agtcctgtat gtgcctaact 1380 tgggaatcac tcttttttca atcgggacag cagctgatct tggctgtgtt gtaacttttg 1440 acaaaaacca gatccgactt tgtcagaata atactccagt agctgttgga acgcgctcca 1500 gtcaagatga tttgtattat ttaaatattg aggttgtacc aaccacggca agtgcttttt 1560 tgtcggtctc atctgttcct ctctcaactt ggcatctgcg tttggggcat atcagcacaa 1620 caattatcaa gttaatggag tcaacaaatt gcgttattgg tttacaattg tcaaaatcag 1680 aagaacttcc agtacagcaa tgtgaaggat gcgcgtacgg caagagtcaa cggctgtctt 1740 ttcctacttc cggacacatc aaagctaccg aaattggtca tctcgttcac tcggatctgt 1800 gcggtccaat gagtgtagca tcaccaagtg gggctgttta ttttttgatt ttcaaagatg 1860 actacactgg ttttagagtt ttatattttt taactcacaa atcacaggtg tttcattact 1920 ttcaattgta ctcatctcaa cttcatactg aaactaatca acatgtaaag attttgagaa 1980 cggataatgg aggcgaattt actagcagtg aattttcaaa ttttctgagc aaagaaggga 2040 tcaaacatga aaccagtgcg ccgcataccc cggagcagaa tggagtagcc gaacgtgaaa 2100 accggatcgt catggagtcg gcgaggagtt tattgcatgc agaagctatt tctcttgaac 2160 tttgggctga agccgtggca tgcgcggttt attgtcttaa tcgtgcttta aataccaaca 2220 acaaaatctg tacacctttt gaaggatggt acaaacagaa accaactatt tctcatctga 2280 gaatctttgg tacaaaggca tatgttcata tccctggggt ggaacgtaaa aaacttgacg 2340 cgaagagtaa actctgcacc tttgttggtt atagtgatac tcagaaagct tatcgttttt 2400 ggtgtcagga gacaagaaaa atagtaatca gtagagacgc tattttttgt gaaaaagacg 2460 aaaccgtgtt tcctgtcaat ccgcttgttc cagaaaaaat aagcaacgaa ctcccttcgt 2520 ccacttcatt tctaccgaag agaaagaaca acgtttttcc tccatctgac atcaacaaag 2580 atgttccgtc tccgggggat cgccgaaatc cagctagagt gagacatcaa gcagttccaa 2640 gaagttcgct tcatgcttta agtgcagaaa cttcaactac agagccacta agctacgacg 2700 atgctatctc gtcaccggat tatttgctct ggaaacaagc tatggaggag gagatggcag 2760 cactcaatct caatcatacc tggacattga ctactttgcc tagaggtcgt tcacctgtag 2820 attgccgctg gatatacaag cttaaacacc gagttgatgg ttctatcgaa agatacaaag 2880 ctcgcttggt tgctaaaggt ttttcccaac gtccgggaat tgattacgac caaacttttt 2940 ctccagtagt caagtatgat tctttgcgta ccattttatc agttacagct gcagaagatc 3000 tggaattata tcaactagat gtgaccacgg cttttcttca tggtgtttta aaagaagaag 3060 tctaccttag acaaccagaa ggtcatgtca tccctggaca agaaactcaa gtttatcgtc 3120 ttcacaaaag cctgcacggt ttgaaacagg cgtcgcgaaa ttggaatgag aaatttgatg 3180 aatttctgac caaatttgga cttgtcccca gtcaagccga ttcgtgcgtc tattttcttc 3240 gaaaggatgc agagattact atcgtaagca tctgggtaga cgatggtttg gtggcaagca 3300 gcagtaagaa aattgtttta gaaatcatcg agcatctgca gctccagttt gaaattgtat 3360 caagaccggc tgatttattt gttggcctct tgattaacag aaaccgtgac aagaagtatc 3420 tgcatctctc gcagccaact tacatcagca aaatcttgtc caaattttgt atgcaggata 3480 gtcaccctaa agctacacca gccgatccat ttaataaact gaccaaagag tcttcaactg 3540 cagattcaga ttccagcaag ttattccctt ttcaagaagc ggtcggcagt ctcctatatt 3600 gcatgatcac tacaagacct gatatcagct atgcagtggg ccaagtggct cagttcacta 3660 ctaatcccgg aaagtctcat tgcgaggctt taaaacgtat actttcctat ctgaagggaa 3720 catccactta gcgatcccac ccgaattgca tttgattcga atcgattcga attgattcga 3780 atcgattcga attgattcga atcgattcaa cttgaattgc gttttttgag atttttcgaa 3840 tattcagttc aactcgaatt ggaaaaatca aaattcatat tcatttgaat tgtgatattt 3900 ttgggtaaat attcatgaaa atttacctga atattcattt cccgattcat gaatattcat 3960 tgtgaatatt cacgaatatt tgggtcgatt cgtgtcaaat ttgacaaaat ttttgcaaag 4020 ttggaatatt tttagtttca atttcgaccg ctagatgtgc cacctcttcg tttcgataaa 4080 cgaaggggat ggatattgtt tgggtacttt caaatggaca actattctat tattattgaa 4140 ttatcactaa atccagtcga tttaagtgat attctactgc aattatttaa aacagttaaa 4200 attatggatg ttttagtgta atatttcttg cctaataaac aaaaacaatt ttttttttat 4260 ttttcttgat ttcattcgaa tcgattcgaa ttccattcga attgattcga attgagtttt 4320 tgtcaattca aatcaattca aattcaactc gaattgtatt ttgaaaaaaa ttattcaact 4380 cagattgcgt gaaatacctc gaattgattc gagtgggatc gctacatcca cttatggttt 4440 acgttttcag aagactcagt caaatggaat tttgtttgca tttacggacg ccgattatgc 4500 tggtgacttg gattcccgaa gatctacaac tggctatgtg ctgacactga acgccggacc 4560 ggtggcttgg ggtagcatca aacaaaaagt gtagctctct ctactacaga agcagaatac 4620 atcgctgcct gccacaccgc aaaggagata gtttggttgc gcaatctact tcatgagatt 4680 ggttttgagc aagttgattc taccactttg ttctgtgaca atcaaagtgc tgtccgcctt 4740 gtgttcaatc ctgaatttca caagcgaact aagcacatcg aagttcagca ccactttatc 4800 cgtgaaaagc agaatgatgg ttctcttcat atgcagtacc tacctacaga agaacagctc 4860 gcagatttat ttactaaacc tcttcctggc ccacgttttg aaaaattaca ccggcaaatt 4920 gggattgaag atttattagt ttaaatatgg tctgactttt taaatctatt cttcaaactc 4980 ttacactgtt taagcgattt gtttcatttt ctgagtgatg attcatcatc aatttctcac 5040 atttaaatct tatgtttttt cttctaaatc tagaagaagt atttaacttc taaaaattca 5100 tactgagttg tctcatatgt ctcataaaaa ctgtattcct tgtttaagag agggtgt 5157 // ID CR1-77_AAe repbase; DNA; INV; 3533 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-77_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-3533 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1165-1165 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 15 sequences with >97% CC identity. XX FH Key Location/Qualifiers FT CDS 3..431 FT /product="CR1-77_AAe_1p" FT /translation="NIVAYVSKKTRCSPAQVRCSKLIRQDRSQPITFISFK FT LSVPKAFESTIVANSFWPEGVSITPFLERRPNAYRRKKLFSPQPITSLPRH FT QSHQVYGLQMKPSPNNAPYVQQPPTPILRQSHVLHNNRPQHLSLPVRNRFQ FT SSLV" FT CDS 158..3316 FT /product="CR1-77_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="FLARRSLDYSFFRTASQCVQTQETVFTSTNHEFTTSS FT VTSSLRSSNETISKQCTLRPTTTNSHPSSITCSPQQPASTSKPASSKSVPI FT IPCLATVLTCYYQNVGGINTSVHSYMLACADASYDLIAFTETWLDERTLST FT QIFGNCYVVYRVDRNEQNSRKTRGGGVLLALRSSLNCRQLYPPNSSVEQVW FT ASINLSTYMLYVCVIYIPPDKVNDPVVIDAHLESLTWISSQMELSDKIVVI FT GDFNLAGITWNTGSSHLFPDPSKSTIALSSINLLDGYSTANLCQINNVPNT FT NGRLLDLCFISEDFLNNDEIHLTAAPISLVKSCTHHPALHISISLPQTYTY FT IDTTESIYYDFGKTNFAAMTEFLAGFNWDSVLVGLDTHTAAKAFCDIMSYA FT IDQFTPKRQKQTKTYPPWSNPCLKRLKSAKRSCLKKYSKNRTNLLRSRYIT FT SNRKYKVLNEALFSAYQHRVQLNLKNNPKKFWNYIDEQRKESGLPSCMKLG FT DTEVSTVPDICHAFQKHFRSVFSDEMLDDLEVSKAASNAPNLPTVGNHPTV FT SSIMLETAIKSLKSSSNPGPDGIPSLILKKCFSVLQFPLLHIFNLSLRCGS FT FPEIWKSSYLFPVFKKGCRRVISNYRGIAALCATSKLFELVVMDFITHNCS FT SYISETQHGFMPHRSTTTNLVSYTSFITRCIEKGLQVDAIYIDLSAAFDKI FT NHKIAVAKLERLGFTGSFLSWLQSYLIGRSMSVKIGECLSSPFQVTSGVPQ FT GSHLGPVIFLLYLNDVNLSLKCFKLSYADDFKLYYVINDHNDADFLQCQLD FT VFVDWCKTNRMVMNATKCSIISFTRKHRTIMSDYHVSDHTLERVSSIKDLG FT VIVDSNLSFRDHVSYVVGKASKSLGFIFRAGKHFSDVYCLKTLYCSLVRSN FT LEYAAIVWSPQYQNSILRIERIQRKFIRFTLRNLPWTDPSNLPEYKDRCQL FT INLNLLSTRRNLAKLLFTCDVIQSRVDCPELLEKFNFDIHRRALRSHTFFR FT LPLCRTNYGRNEVVNSMCRLFNQYCNVFDYNLSRAVLKTRFAVVLGV" XX SQ Sequence 3533 BP; 1013 A; 792 C; 631 G; 1097 T; 0 other; ataatattgt ggcctatgtt agtaagaaaa cgcgttgtag tcctgctcaa gtccgttgct 60 caaagcttat acgacaggac agaagtcaac ctatcacatt tatatctttt aagttaagtg 120 ttcccaaagc tttcgaaagc acgattgtcg caaatagttt ttggccagaa ggagtctcga 180 ttactccttt tttagaacgg cgtcccaatg cgtacagacg caagaaactg ttttcacctc 240 aaccaatcac gagtttacca cgtcatcagt cacatcaagt ttacggtctt caaatgaaac 300 catctccaaa caatgcaccc tacgtccaac aaccaccaac tcccatcctt cgtcaatcac 360 atgttctcca caacaaccgg cctcaacatc taagcctgcc agttcgaaat cggttccaat 420 catcccttgt ctagcaaccg ttttaacatg ctattaccaa aacgttggtg gcattaatac 480 atccgtacac tcttatatgc tggcatgcgc cgatgcctcc tatgatctca tcgcctttac 540 tgagacttgg ttagatgaga gaactttgtc cacgcaaatt tttggaaact gctatgttgt 600 ctacagagtc gatcgaaatg agcaaaatag ccgtaaaact cgcggcggtg gtgttttatt 660 agctctacgg tcatcgctaa actgtcgtca gctttacccc cccaacagct cagttgaaca 720 agtctgggcc tcaatcaacc tatcgactta catgctctac gtttgcgtca tatacattcc 780 acctgataaa gtgaacgatc ctgtcgtcat tgacgctcat cttgaatctc ttacctggat 840 ttcttctcaa atggaactca gcgacaaaat tgtagtcatt ggcgatttca atttggccgg 900 cataacatgg aataccggct catcgcatct ttttcctgat ccatcgaaat ctacaattgc 960 actttcatcg atcaacctgc ttgacggata cagtacagca aatttatgcc aaatcaacaa 1020 tgtaccaaac acgaatgggc gccttttaga cctctgtttc atttcagagg actttttaaa 1080 caacgatgaa atccacttga cagctgctcc catttctcta gttaaatctt gtacccacca 1140 ccctgctctg cacataagca tttctctgcc acaaacatat acatatattg atacaacaga 1200 aagtatctac tacgattttg gaaaaacaaa ttttgctgct atgactgaat tccttgctgg 1260 attcaattgg gattcagttt tggttggtct cgatactcac actgctgcta aggcattttg 1320 tgacattatg tcttatgcta tcgatcagtt cacccctaaa cgacagaagc aaacaaaaac 1380 ctatcctcca tggtccaatc catgtctcaa gaggttaaag tcagccaaac ggtcttgtct 1440 taaaaagtac tctaaaaatc gaacaaattt gcttagatca cgttacatca cttccaatag 1500 gaaatataaa gtgttgaatg aggcactttt ctccgcttac cagcatcgtg tacaactaaa 1560 tcttaaaaac aacccaaaga agttctggaa ttatatcgac gagcaacgaa aggagagcgg 1620 cctaccatca tgcatgaagc ttggagatac agaagtgtct actgttccgg atatctgtca 1680 cgcatttcag aagcatttcc gcagcgtttt ttcggatgaa atgttggatg atctagaagt 1740 ctcaaaagct gccagcaatg cgcctaacct cccaactgtt ggcaatcatc ctaccgttag 1800 tagcataatg ctggagacgg ctataaaatc actcaaatct tcctcaaatc ctggcccaga 1860 cggcatccca tctcttattc tgaagaagtg cttttcagtt cttcagttcc ctttgcttca 1920 catcttcaat ctgtctttac gttgtggatc gtttccagaa atctggaagt cgtcgtattt 1980 atttcctgtc ttcaaaaagg gatgcagacg tgtcatctca aactatcgtg ggattgccgc 2040 cttgtgtgcc acttccaagc tattcgaatt ggttgtcatg gacttcatta ctcacaactg 2100 ttcaagttac atttcggaaa ctcaacacgg attcatgccg caccgatcaa ccacaacgaa 2160 cctggtatcg tatacatcct ttataacccg ctgcattgaa aaaggacttc aagttgatgc 2220 aatttacatc gatctctcag cggccttcga caaaattaac cacaaaattg cagtagccaa 2280 actagagaga ttgggcttca ctggttcctt cctttcctgg ttacagtcat atttgatcgg 2340 tcgttctatg tctgttaaaa tcggagaatg cttatcatca ccatttcaag taacatcagg 2400 ggttcctcaa ggaagccatt taggacctgt aatttttctt ctttatctta acgatgtaaa 2460 cctgtctctc aaatgtttca agttatctta tgctgatgat tttaaattat actacgtcat 2520 caacgaccac aacgacgctg attttttgca atgtcagctg gatgtttttg ttgattggtg 2580 taagacgaac aggatggtca tgaatgcgac taaatgttca atcatttcat tcactcgtaa 2640 acatagaact ataatgtcgg attaccatgt ttctgaccac acattggaac gcgtaagttc 2700 aatcaaagac cttggtgtca tcgtggacag caacctgtca ttccgtgatc atgtctcata 2760 tgttgttggt aaagcttcca agagtctagg tttcattttt cgtgccggta aacatttttc 2820 tgatgtttat tgccttaaaa ctttatattg ttctttggta cgatctaatc tcgaatatgc 2880 agctattgtt tggtccccac aatatcagaa cagtatttta cgtattgaac gaatccaaag 2940 aaagtttatt cgatttacgt tacgaaacct accgtggaca gatccctcaa atttaccaga 3000 gtacaaagac aggtgtcagt taattaacct taatttactc tccactcgtc gaaaccttgc 3060 caaacttttg ttcacctgtg atgtgataca gtctagagtt gattgtcctg aactgttaga 3120 aaagtttaat tttgatattc atcgtcgcgc actccgttct cacacatttt ttcggctacc 3180 tttgtgtaga accaattacg gtagaaatga agttgtaaat agtatgtgtc gtttgtttaa 3240 tcagtactgt aatgtctttg attataatct atctcgtgca gttttgaaga ctcgctttgc 3300 tgtagtgtta ggtgtctaga ataagattcg atgtttaaaa agtgtatgta aaattaagta 3360 ttagattaag ctacgtcttt gtatcatttg gaattgtgtt ctgttgatat gaaaagacgg 3420 gtaggttttg tgcctatttg agaaagaggt tcaaattgtt ggtctctact caaacgggtt 3480 tttccctact cctaaaaaat aaatgaaaat gaaaataaat gaaaaatgaa aca 3533 // ID Gypsy-81_AA-LTR repbase; DNA; INV; 204 BP. XX AC supercont1.19; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-81_AA_; KW Gypsy-81_AA-I; Gypsy-81_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-204 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.19; Positions 3747392 3747595. XX SQ Sequence 204 BP; 58 A; 41 C; 34 G; 71 T; 0 other; tgttgcatta tgtagttatt gaacgccaca tgagaatgta atttgtcaac gacccttttg 60 tacgtcattc attacgcgcg cattcctctc cattgactag ttttccattc attctgtatc 120 cgagtgcgta tcagtaagaa cgttgaataa agctagtttg taaaccgttt cccgaaatat 180 tttatttgat gaccaaataa aaca 204 // ID BEL-80_CQ-I repbase; DNA; INV; 5359 BP. XX AC AAWU01021955; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-80_CQ_; KW BEL-80_CQ-LTR; BEL-80_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-5359 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 301-301 (2011). XX DR GenBank; AAWU01021955; Positions 44816 39458. XX CC Positions [4381-4986] - Integrase core CC 'GTTAC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 40..5328 FT /product="BEL-80_CQ-I_1p" FT /translation="MDKLERVRNAQLAQVKRELAAAEKLGERKASIGEATD FT RLELLKELAGKFRQTQESIEQEQENPEVIASVHDYREEFNESYYAARNLLE FT KYIVDNNPDETGSTDSGHTVIDGNGELREAMRLLLNTQRTMMERSGNAGGN FT LEPAGNNVPNVRLPAIDVPKFSGERKHWSSFKDIYTTTIHNRNDLRPSLKM FT QYLVSYVDGFAQQLVGRYSISDAHYEEAWTALTDYYDKKKFTVFALVREFV FT EQPAVEEAVSGELRKLATTSDEVVRQLNALGAEFNTRDPWLIHILLEKLDD FT ETRSLWAQRIVEVDNPSLDDFLKFLGNRCDALETCSAFSRKVSPNVPKKET FT TKKPPSEKKVQSLYSAAVEEKCAKCSKEHPLYQCDEFRKMDLQCKRDLVAQ FT EKLCYNCLRSSHIAKSCSSKSVCRNADCKQRHHTLLCPKSTVRQVVNESAT FT EENSTIDPVVTMVAQVPADVPRKAFILPTAIIRIRGADGRLIQARALVDSG FT SEASLISEACVSKLGLPRANGKVAVTGMGQQAAGTTRGVVKLEIANRFDDT FT SVLQTMAYVMGKLTSTLPTQLCQVHPSLLDRDVQDFLADPAYQRPGPIDLI FT LGCDVFLALLRPGQVKDDGGVPVAQNTIFGWIVSGNQAIYTHRIQANVSIV FT NLHAELDINRTLRMFWEQEEIPKPAQLTPSEEAAAEFFKSTLSREKDGRFI FT VRLPFDESKPALGESLGPAIKRLRSMERHFRSDSEFHKLYSDFLTEYQALG FT HMEEVPANEVEVEAGKCCYLPHHAVVKDSTTTKLRVVFDASCATSTGVSLN FT DRLLAGPNVNQELFSVFLRFRTYKVAFTADAEKMYRQVWVHPADRDYQRIV FT FRETEDQPIKHYRLCTVTYGTKCAPYLAIESMKQAASEFKLKYPEAAKKIE FT LDTYVDDFLSGAQTVQRAKELKSQVVEILESAGFHLRKWTTNCPELLQHAA FT ETDQTPVEVKLDERANAVKALGILWHPKEDEFSFKVNLSPNSVNTKRQLLS FT DSSKLFDPFGWLAPVAIKIKILYQHLWLCELSWDDGLPATVEPAWKEIKET FT LHLLEQIRIRRFAPNKDGKIELHGFSDASEAAYSAVVYAREPDETGQAEMN FT LLAAKTKVAPIRQVCVPRLELNGGTLLANLMLAIVAALSHLEVELYAYTDS FT SIVLHWLSAHPRKWKTYVANRTSAILEVLPRDRWSHVRSENNPADCASRGL FT TPAELVAHPMWPHGPKEMNSKDVSWKNVPLEPIDDEDLLETRQLKVLHSTA FT MVIRTDYSIESRLLARRSSYTLIVRTLAYVNRFLLALKSEDANLEPGLSPN FT EIYDAKAQLARFAQHGAYEKEIQLLLKGEELPAKDKLSALHPFLDGQGTMR FT VGGRLQNSSHPYDVKHPIILPGSHKVTELLLRELHLRNLHAGPTLLTATVN FT QQYWVIGLQAAVRQAVQGCARCVRLKGKTASQLMGSLPVTRVMGTRAFAHV FT GVDYAGPVKVHASCVRGVKTTKGYIVVFVCMATKAVHLELASDLSTNTFIG FT ALKRFVSRRSHPNEMWSDCGTNFVGADTWLKEIRGALEKHNVAANRFLTNL FT GIKWVFNPPSAPHRGGLWEAAVKSAKKHLVAVLGSDAATFEELSTVLTQVE FT ACLNSRPLCPLSADPDSYEALTPGHFLVGQPLNLIPEPGVQHLPMNRLDKW FT QLVHRHTTDIWSRWRDEYLAHLQPRTKWRTTETNVKEDQLVLVKNDNAPPT FT QWELARIVKLHPDASGVVRTVTLRRGQAEYLRPVQKICVLPTD" XX SQ Sequence 5359 BP; 1351 A; 1388 C; 1555 G; 1065 T; 0 other; tatggtcctt caagtccgga tagtcgggaa ccggtgaaca tggacaaact ggaacgggtc 60 cgaaatgcgc agcttgcgca ggtgaaaagg gagcttgcag cggccgaaaa gctcggcgaa 120 cggaaagcat ccattggaga agcgacggat cggctggagc tgctcaaaga actcgccgga 180 aagttccggc agacacagga gagcatcgaa caggagcaag agaatccgga ggtcattgcc 240 tcagtccacg actatcggga ggaatttaac gaatcgtact acgccgcacg gaatcttttg 300 gaaaagtaca tcgttgacaa caatccggat gaaacaggtt ctacggatag tggccacacg 360 gtgatcgatg gaaacggcga actgcgcgaa gcgatgcgac tactgctgaa cactcagcgt 420 accatgatgg agcgttcggg gaatgcaggt ggaaatctgg agccggcggg gaacaacgta 480 ccgaatgtta gacttccggc gatcgacgtg ccaaagttca gcggggaacg gaagcactgg 540 agctctttca aagatatcta caccacgacc attcacaacc ggaacgattt gagaccgtcg 600 ctgaagatgc agtacttggt ctcgtatgtg gacgggttcg cgcagcagct agtaggccgg 660 tactcgatct ccgatgccca ctacgaggag gcatggacag cgctcacgga ctactacgac 720 aagaaaaagt tcaccgtatt tgccttggtt cgagagttcg tggaacaacc ggcggtggaa 780 gaagcagttt ctggcgaact tcgaaagctc gccacgactt ccgatgaggt cgtgcgtcaa 840 ctcaacgcac ttggagctga gttcaacaca cgagatccgt ggcttatcca cattctgctg 900 gaaaagctcg acgacgaaac gcgttcactg tgggcgcaac gtattgtcga agtggacaac 960 ccctcgttgg acgacttcct gaaatttttg ggcaaccggt gcgatgcgct ggaaacctgc 1020 tcagcgtttt cgaggaaggt gtcgccgaac gttcccaaga aggagaccac gaagaaaccg 1080 ccgtcggaga agaaagtgca gtcgttatac tcggcggcgg tggaggaaaa gtgcgcgaaa 1140 tgttccaaag aacatccgtt gtaccagtgt gacgaattta ggaagatgga cctgcagtgc 1200 aaacgggatc tggtggcaca ggaaaagttg tgttacaact gtttgcgatc gtcacacatt 1260 gccaagtcct gcagttcgaa atcagtgtgc cggaatgcgg actgcaaaca gcgtcaccac 1320 acgttgttgt gcccgaaaag tacggtgaga caagtggtga acgagagcgc aacggaagaa 1380 aactccacga tcgatcctgt cgtcacgatg gtggcgcaag tgccagcgga tgtgccacgg 1440 aaagcgttca ttctgcccac agccatcatc cgcattcggg gagctgacgg ccgtctgatt 1500 caagcacggg cactcgtcga ttccggttca gaagcatcgt tgatctcgga ggcctgcgtt 1560 agcaaactcg gactgccacg tgccaacgga aaggtggcgg tcaccggtat ggggcaacaa 1620 gctgctggaa caacgcgagg agtggtgaag ctcgaaattg ccaaccggtt cgacgacaca 1680 agcgtgctac agacgatggc ctacgtcatg gggaaactga cgtccacact cccaacgcag 1740 ctctgccagg tgcatccaag cctcttggac agagatgttc aagatttcct cgctgatccg 1800 gcgtaccagc gtccggggcc gattgatttg atcctgggat gtgatgtctt tcttgctcta 1860 ttgcggccgg ggcaagtcaa ggacgacgga ggggttcctg ttgcgcagaa cacaatcttc 1920 gggtggattg tgtccgggaa tcaagccatc tacacgcatc gtatccaagc caacgtctcg 1980 atagtcaatc ttcacgcaga gttagacatc aatcgcacct tgcggatgtt ttgggagcag 2040 gaagagattc cgaaaccggc gcaacttact ccatctgaag aagctgccgc cgagtttttc 2100 aagtccacgc tttcacggga gaaagatgga cgtttcatcg tacgattgcc gtttgacgag 2160 tcgaaaccgg cgcttgggga atcgctgggt ccagccatca aacgattgcg atcaatggaa 2220 aggcatttcc ggagcgattc ggagttccac aagctttact ccgatttcct caccgagtac 2280 caggccctgg ggcacatgga agaggtgcca gcgaacgagg tggaagtaga agctggaaag 2340 tgttgctatc tacctcacca cgccgtcgtc aaggacagca cgacaacaaa attgcgagtg 2400 gttttcgacg cctcctgtgc aacgtcgacc ggagtgtccc tcaatgacag gctgcttgca 2460 ggacccaacg tgaaccagga gctgttctct gtgtttctgc gctttcgaac ctacaaggtg 2520 gcgtttacgg cagatgctga aaagatgtac cgacaggtgt gggtgcatcc agcggaccgg 2580 gactatcagc ggatcgtgtt ccgcgaaacg gaggaccaac ccatcaagca ctaccggctg 2640 tgcaccgtta cttacggcac caagtgcgct ccgtacttag caatcgaatc gatgaagcaa 2700 gcagccagcg agttcaaact caagtatccg gaggcggcca agaagatcga gttggacacg 2760 tacgtcgacg attttctctc cggagcacaa acggtgcagc gggcgaagga gttaaagagc 2820 caagtcgtgg agattctgga gtccgctggt ttccacctgc ggaaatggac cacgaactgc 2880 ccggagctac ttcaacacgc ggcggagaca gaccagacgc cggttgaagt gaaactggac 2940 gaacgagcga acgcggtcaa ggcacttgga attttgtggc atcccaagga ggacgaattc 3000 tcgttcaagg tcaacctcag cccgaacagt gtcaacacca aacgacaact actttcggat 3060 tcttccaagt tgtttgaccc gtttggatgg ctggcacccg tggccatcaa aatcaagatt 3120 ttgtaccagc acttgtggct ctgcgaactc tcttgggacg atggtctgcc tgctactgtt 3180 gaaccagcgt ggaaggagat caaggagacg ctacacctcc tggaacaaat acgcatcagg 3240 cgattcgcac ccaacaagga cggcaagatc gagttgcacg ggttttcgga tgcatcggaa 3300 gcggcgtact cggcggtggt gtatgcgagg gaacctgacg aaactggaca agcggagatg 3360 aacctactgg cagctaagac gaaggtggcg ccgatacgcc aagtgtgcgt acccagactg 3420 gagttaaacg gtgggactct tctggcaaat ctgatgctgg cgatagttgc ggcgctttcg 3480 catctcgagg tggaactcta cgcatacacg gacagcagca ttgttcttca ctggctatct 3540 gcgcacccac gcaagtggaa gacgtatgtg gccaatcgta cgtcggccat tctcgaagtg 3600 ctgccacgtg atcgctggag ccacgtgcga agcgagaaca acccggcaga ctgcgcctct 3660 cgtgggctca ccccagctga gctggtagcg cacccaatgt ggccccatgg acccaaggag 3720 atgaactcga aggatgtcag ctggaagaac gtaccactcg aaccgataga cgacgaggac 3780 ctgttggaaa cccgccagct caaggtgttg cacagtactg caatggtcat ccgaacggac 3840 tacagcatcg agtcacggct gttggcacgg cggtcgagct acacgctcat tgtacgtaca 3900 ctggcgtacg tcaatcgttt tctgttggca ctcaagtctg aagatgcaaa cttggaacca 3960 ggtctgtccc caaacgaaat ttacgatgcg aaagctcagt tggctcgatt tgcacaacac 4020 ggcgcgtacg agaaggaaat tcagctgcta ctgaaaggcg aagaactccc ggctaaggac 4080 aagctgtctg ctcttcatcc gtttcttgat ggccaaggca caatgcgagt aggagggcgg 4140 cttcagaact cctcgcatcc gtacgatgtc aagcacccca tcatcttgcc gggcagccac 4200 aaggtgacgg agctgttgct gcgagaacta catttgcgaa acctgcacgc cggacctaca 4260 cttctcacag ctacggtcaa ccaacagtac tgggtgattg gactccaagc agctgttcgt 4320 caagcggttc aaggttgcgc ccggtgcgtg cggctcaagg gaaagacggc ttcgcaactc 4380 atgggcagcc tgcctgtgac tcgagtgatg gggactagag catttgcgca cgtcggcgtg 4440 gattacgcag gtccagtgaa ggttcacgca tcctgcgtgc gaggcgtaaa gaccacaaag 4500 ggatacatcg tagttttcgt gtgcatggcc acaaaggccg tccatttgga actcgccagt 4560 gatttatcta ccaacacctt cattggcgca ctgaagcggt tcgtatcgag gcgttcgcac 4620 ccgaacgaga tgtggtccga ctgcggcaca aacttcgtgg gagcagacac gtggctgaaa 4680 gagatccgag gtgcgctgga gaagcacaac gtagctgcta accggttcct gactaatttg 4740 ggcatcaaat gggtgttcaa tccaccatca gcaccgcacc gtggtgggct ctgggaggcc 4800 gcggtaaaga gtgcaaagaa gcatctggtc gcggtgctag gatccgacgc agctacattt 4860 gaggaacttt caacggtgct gacgcaagta gaggcatgcc tcaattcacg tccgttgtgc 4920 ccactttcag cggacccgga tagctacgaa gcattgacac ctggtcactt cttagttgga 4980 caaccgctaa acctcatccc ggaaccaggt gtgcagcacc tcccaatgaa tcgcttggac 5040 aagtggcaat tggtacacag acatacgacg gatatttgga gccgttggcg ggatgaatat 5100 ctcgctcacc tgcagccaag aacgaaatgg aggacgaccg agaccaacgt aaaggaggat 5160 cagctggtgc tggtcaaaaa cgacaacgca ccgccgactc agtgggagtt ggccaggatt 5220 gtgaagctgc acccggacgc atctggagtt gtccggacgg taacgctgcg acgaggtcaa 5280 gcggagtact tgcgccccgt tcaaaagatc tgtgtactac caactgattg aggcacaggt 5340 gcctcaaggt gggggagta 5359 // ID Copia-112_AA-I repbase; DNA; INV; 4130 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-112_AA_; KW Copia-112_AA-LTR; Copia-112_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4130 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [1520-2017] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 116..3373 FT /product="Copia-112_AA-I_1p" FT /translation="MADFKGLAFEKLNNRNWSTWKFRIEMLLTREEVWHVI FT AEPKPEPVDDRWKKADQKARATIGLCVEESQYSLVKSVDSAKGYWDKLCAY FT HEKSTITTQVALLNKLWSMNLSEGGDVECHIRELEEVYDRLAAAGHVLTES FT FKIVLLFRSLPESYQGLATLLQSQLDAGTTMEAVKAKVLEEFERRNERSGR FT VSGSASSEATAMKSATKKSREKSGAVKTCYHCGKPGHLRRDCRAWKNAKQG FT DDEKKKPEAKQSAKKANDDESYVCFSATNSAKNNEWYVDSGASCHMTNDEQ FT FFTKLVKKSGPSVVLADGKVVKTAGCGYGTLRGVSGSGNVIDVKLTDVLLV FT PSLTSGLISVDKLTSKGFTAVFEARGCEIRDKTGEVVVVGDRYGGLYRLKL FT GEASRKVEEKAHNPLCQHQWHRRFGHRHPGVINRISDEKLGNGMKVVDCGI FT RLTCEPCIEGKLARNPIPKAAERKSTLPLDLVHTDLCGPMKTTTPGGKRFI FT MTMIDDFSRYTVVYLLAKKSEAPGKIKEYVRFVQNLFGRKPKIVRSDGGGE FT YCNQELRTFFAEEGIKAQYSTAYTPQQNGVAERKNRSLQEMARTMLLDASL FT PTRYWGEAVVAAAFLQNRLPSRAVDTTPFEKWHGRKPELGYLRVYGCHAFV FT HVPDVKRGKFDGKARKLRFIGYSEEHKGYRFVDTSTDKITISRDARFLELD FT DGSSHGLAKDPSQEPVNTEVVWPLKENSDQQAEEEDVPAELPEDQDEDSMF FT FDLEDQDAEADPLAVKQEEEDSEEEDREEAARSNQGAGVRQSGRRTRGVLP FT NHLSDYVVGVAAHSELEPVTFEEAMSFPERDAWKSAMDSEMASHKKNGTWE FT LVPLPKGKKLVGSRWVFKLKRNECNEIVRYKARIVAQGFTQTPGVDFGDVF FT APVTRHATLRTLLALAGKKNLVVKHLDVRTAYLYGNISEEIYMKQPPGYVA FT RGKEELVCRLRRSIYGLRQSARCWNERLTEVLKKMGFEASDADACLFVGTV FT NGKKVYLIVYVDDFLVACESEQVITSVFEQLRKSFDIRCLGDVKRFLGLEV FT QKKDDVYSVSLTNYIDQLVARVGLSEAKV" XX SQ Sequence 4130 BP; 1070 A; 867 C; 1293 G; 893 T; 7 other; caggttatga gcccggtttg cgtttcggaa agtgcgggag gaaatttttg agaaattttc 60 gtttcgtttg gcggcgtcgc gagtggctaa tcggcggtgg tagtgcattt gcaaaatggc 120 ggatttcaag ggactcgctt tcgagaagtt gaacaatcga aattggagca cgtggaaatt 180 ccgcattgaa atgttgctca cccgtgagga ggtttggcat gtgattgccg aaccgaagcc 240 agagccagtg gacgaccggt ggaaaaaggc tgatcaaaag gcgagggcga ctattgggct 300 ttgcgttgaa gaaagccaat atagcctagt gaaaagtgtc gattcggcga aagggtactg 360 ggacaagtta tgtgcgtacc atgaaaaatc gacaatcacg acacaagtgg cattgctgaa 420 caaattgtgg agcatgaatc tttcggaagg tggtgacgtt gagtgtcaca ttcgcgagtt 480 agaagaagtg tatgaccggc ttgcggccgc ggggcacgtt ctgacggagt ccttcaagat 540 tgtgttgttg ttcagaagtt tgcctgaatc gtatcagggg ctggcgactt tgctgcagag 600 ccagctggat gcgggtacga cgatggaggc ggttaaagcg aaagtgcttg aagaatttga 660 gcgccggaat gagcgttctg ggcgagtgag tggatcggcg agcagtgaag ctaccgccat 720 gaaaagtgcg acaaagaaga gtcgagagaa gagtggtgcg gtgaaaacgt gctaccactg 780 tgggaagccc ggtcatctcc ggcgtgattg tcgggcatgg aagaatgcaa agcagggtga 840 cgatgaaaag aagaagcccg aagctaagca gagtgcgaag aaagccaacg atgacgaaag 900 ttatgtgtgc ttttcggcta ccaacagtgc gaaaaacaat gaatggtacg tcgatagtgg 960 tgccagctgt catatgacaa atgacgaaca gtttttcact aagctggtta agaaaagtgg 1020 cccgagtgtt gtgttggcag acgggaaagt tgtgaaaacg gcgggttgtg gttatggcac 1080 acttcgtggt gtgagcggaa gtggtaacgt cattgatgtg aagctaaccg acgttttgct 1140 tgtgccgtcg ctaacgagcg gcttgatatc cgttgataag ctaacctcca aagggttcac 1200 ggctgttttc gaggccagag gttgtgaaat tcgtgacaaa actggtgagg tcgttgttgt 1260 cggtgaccga tatggtggct tgtatcggtt gaagcttggt gaagcgtcga ggaaggtgga 1320 agaaaaggcg cataatccgc tgtgccagca ccagtggcac cgacgttttg gccacaggca 1380 tccgggagtg ataaacagga ttagtgatga gaagctaggt aacgggatga aggtcgtaga 1440 ctgcggcatt cggcttacct gcgagccgtg catcgagggc aaactggctc ggaatccgat 1500 cccgaaagcg gcggaacgca aatcaacgct gccgttggat ttggtccaca cagatctctg 1560 tgggccaatg aaaacgacaa cccccggcgg taagagattc atcatgacga tgattgacga 1620 ctttagtcgt tataccgtgg tctacctact cgccaagaaa tcggaagcac caggtaagat 1680 caaagaatat gttcgttttg tccaaaattt gttcggccgc aaacccaaga tcgtccgatc 1740 tgatggcggc ggcgaatact gtaaccagga attgcgcaca ttctttgcgg aagagggaat 1800 caaggcccaa tactctacgg cgtatactcc gcaacaaaat ggcgttgcgg agaggaaaaa 1860 taggtcgctk caggagatgg ctcgaaccat gttactggat gctagtcttc ccacgcgtta 1920 ttggggagag gcagtggtgg cagcggcgtt cttgcagaac aggcttccct cgcgtgctgt 1980 ggatactact cccttcgaaa aatggcacgg acgcaaacca gagttaggat acctccgcgt 2040 ctatgggtgc catgcgttcg tacacgtccc agatgttaaa cgcggtaagt ttgacgggaa 2100 ggcgcgaaaa cttcgcttca ttggatattc cgaagaacat aaggggtaca ggtttgtcga 2160 cacctctacc gacaagatca ccataagccg ggatgcgcga ttcctggagt tggatgatgg 2220 atcttctcac ggcctggcga aggatccgag tcaggagccc gtgaacactg aagtggtatg 2280 gccgctcaag gaaaactcgg accagcaagc tgaagaagag gatgttcctg cggaactgcc 2340 tgaagatcaa gacgaggata gtatgttttt cgacctggaa gatcaggatg ccgaagctga 2400 ccccctggct gtgaagcagg aggaggagga ctcggaggaa gaagaccgtg aagaagcagc 2460 gcgatcgaat caaggagcgg gcgttcggca atcaggccgc cggacgcgtg gtgtgctacc 2520 caaccacctg tcagattacg ttgttggtgt agcagcacat tcggagttgg aaccggtaac 2580 tttcgaggag gcgatgtctt tcccggaacg agatgcttgg aaatcggcta tggacagtga 2640 gatggcgtcc cacaaaaaga atggtacgtg ggagctagtt cctctcccga aaggcaagaa 2700 actagttgga agccgatggg tattcaaact caagcggaat gaatgcaacg agatcgtgcg 2760 ctataaggca cggattgtgg cgcaggggtt tacccagact cctggcgtgg actttggcga 2820 cgtgtttgct ccggtgactc gccacgccac tctgcgaact cttttggcat tggctggaaa 2880 gaagaatctc gtagtgaagc acctagacgt ccgaaccgct tatctgtacg gcaacataag 2940 cgaggaaatc tacatgaagc agcccccggg atatgtcgct cgaggcaagg aggagcttgt 3000 atgccggctg cgtaggagca tatacgggct gcgacaatcg gcccgctgct ggaacgagag 3060 gctgacggaa gtgctgaaga agatggggtt cgaggcaagc gatgctgatg cgtgtctgtt 3120 cgtgggcacc gtgaacggga agaaagtgta cttgatcgtt tatgtagacg attttctcgt 3180 ggcatgcgag agcgaacaag tgatcaccag cgttttcgaa caactgagga agagttttga 3240 cattcgatgc ctcggcgatg tgaaacgttt cctgggtctc gaagtacaga agaaagacga 3300 tgtgtatagt gtgagcctga caaactatat agaccagttg gtagcgagag tgggactcag 3360 tgaagcaaag gttkcaaaaa cgccgatgga caagggttta cttggtgatg aggtgaacga 3420 taaaccgtta gaagacgkta cgaaatacag aagcgcggtg ggtgcactta tgtatattgc 3480 cgtgtgcgct cgccccgaca tcatgamcag tgctgcaata cttggccgga agttcagtgc 3540 acctwcggag agtgactgga cggcagccaa acgcgtaatc cgttacctca aaggaacacg 3600 cgattggagg ctgattctcg gcggtgataa cgaagatctw gtggcatttt cggacagcga 3660 ttgggcagga gataccggga cacggaagtc aactaccggc ttcgtgctat tttattcagg 3720 aggagccgtt tcgtgggcaa gtcgccggca agactgtgtc acgctgtcga ctttagaagc 3780 agagtacgtt gccctaacgg agacgtgcca agaggtggtc tggatgcgac gactgcttcg 3840 tgatcttgga gaggagcaga ctacagcgac ggttgttcac gaggacaatc agggatgcct 3900 cagctttgct cagtccgagc gttctagcaa gcggtcgaag cacatagaaa caaagcgcca 3960 cttcgtgaag aacctatgtg aacgaggcga ggttgcgttg gwgtattgcc cgacggagga 4020 catgaaggcg gacgttctaa cgaaaccgtt gggaagagtc aagcatcatc acttcgcgag 4080 cgagcttgga ttggttggcg gtgtgccgag cgtcaaacat tgaggaggag 4130 // ID DIRS1 repbase; DNA; INV; 7053 BP. XX AC K00624; M11339; XX DT 27-JUL-1999 (Rel. 4.06, Created) DT 04-JUN-2010 (Rel. 15.07, Last updated, Version 2) XX DE Slime mold (D.discoideum) transposon DIRS-1, complete, clone DE SB41. XX KW DIRS; LTR Retrotransposon; Transposable Element; transposon; KW reverse transcriptase; retrotransposon; insertion sequence; KW heat shock protein; DDIDIRS1A; DIRS1. XX NM DIRS1. XX OS Dictyostelium discoideum OC Eukaryota; Amoebozoa; Mycetozoa; Dictyosteliida; Dictyostelium. XX RN [1] RP 4520-7053 RA Zuker C., Cappello J., Chisholm L.R. and Lodish F.H.; RT "A repetitive Dictyostelium gene family that is induced during RT differentiation and by heat shock."; RL Cell 34(3), 997-1005 (1983). XX RN [2] RP 14-7053 RA Cappello J., Cohen M.S. and Lodish F.H.; RT "Dictyostelium transposable element DIRS-1 preferentially inserts RT into DIRS-1 sequences."; RL Mol. Cell. Biol 4(10), 2207-2213 (1984). XX RN [3] RP 1-371 RA Zuker C., Cappello J., Lodish F.H., George P. and Chung S.; RT "Dictyostelium transposable element DIRS-1 has 350-base-pair RT inverted terminal repeats that contain a heat shock promoter."; RL Proc. Natl. Acad. Sci. U.S.A 81(9), 2660-2664 (1984). XX RN [4] RP 56-4869 RA Cappello J., Handelsman K. and Lodish F.H.; RT "Sequence of Dictyostelium DRS-1: An apparent retrotransposon RT with inverted terminal repeats and an internal circle junction RT sequence."; RL Cell 43, 105-115 (1985). XX DR GenBank; M11339; Positions 1 7053. XX FH Key Location/Qualifiers FT CDS 492..1496 FT /product="DIRS1_2p" FT /translation="MSTTVNNNDASSSNTSASNSAESFDLRMKSMEDQINN FT LSLAFTRFMKEPMFSSNTNSRSQPSHDDSNTENEQSEDESSNNVDVPTDYQ FT LSDTLLGQYKHMVNNQGLLVEEECILKRDEISELNKVFNFPSNFQVNVAPF FT GTPEGITVSSNVKNNDTDLLSVEKRINDSLKPLLLMSSMLSSDSSNVDVEL FT ISYLTQSAIVLAVNTQASLSRVRRNNIAKEIYGSEVLLPIKIKDTPKMFDE FT TETERVRKLAKSIRKNNEAKQSLLKLNYHSKSNVKKLVNSSGNNTTGNSSN FT SKSSSGSNGRSNNFNGSPSNVASGSNNTKSANGTNNRFQKNKK" FT CDS 1390..4410 FT /product="DIRS1_1p" FT /note="tyrosine recombinase at C-terminus." FT /translation="MADLTTSMDHQVMLHQVATIPSLQTVPTTVFRRTRSK FT LTSRWTFVPPQTSLERIGSSKLLSRGRKWIKSPSASKLQADAKPDSDFNSR FT GSEIRLHHKGSTRLVIRRCHRTSTSKPLFKARFLLQRVYGSKTWNESTSSS FT SRSKKIKHLHQQPIIQDGRNQESTINGQTRLLHGKTRYQESLSPRFSRSAI FT QRLIPLRVERFALPLENNAVRVIDSSSYLYNVVKTCTSNVERYQRIRHRIL FT GRSINRRFNKRRMFIQPQKDNGLTCQTRFQVKSRKECSRTNSINYFSRITN FT RFGINEASCSQRKEEKCNQGNKKLFKTRLLLPKKTCWFKRKVNRTERCSHP FT IQTLHSSNKQLSLSVSDSSQWRLGSIVPHSSRCQVRDFTLVNSSKPMEWKR FT NQSVSKLRLCSYNRCLGIRCRCHSQERKQGNQNLVIPVVNNSIKHVVKSSR FT NARSANGLSSAMSETEQLQAEDSNRQHYHSLLHQSPRWSNTRSLSSIRTTL FT ETMPQEESELDWRAYSRILQCKSRPPQPSFRDESQIIDQSNQELQLATEEG FT SVQSHTTSIRSNTDGSVRISPQPSNDQLLNNQNEYTPPRLESMEAMSGLPT FT THSFAFYPGEDELIQFEEGFYNTDLPNLEISNLVSDDSSSSSSSSSSHVSS FT STGNIPRSIDQTISRVDTNPDSTTLETGDYSTFQSHVMSFARTTNTKTAEL FT LMKSWEPSTLKVYSSSYTRFRNFCTLNSLNPANITLVVFMDYLTHLFKHKP FT PLAFSTINGHRSMLNQLLLLRNQTDIVNDPFITRIMTGIHKLRPSSAKYKE FT IWDANQVFKHLSTIKVIPKYTYTALLNKTLVLCKMFGLARSSDLVKWSFKG FT LIITPDSIKGPVINAKEQRSGVVSILELTSLDDTNSQVCPVRHLATYLRAS FT KGRRKPHSGDSVFIKNEVNRSKLMILTQIVLSTLSKSGIDIVKFKSHSTRS FT AMASLLVSNNVPFHVVKKMGRWKSNDTVDTFYDKRIIGEKSGGFLNTVVQI FT S" FT CDS 1622..3445 FT /product="DIRS1_3p" FT /note="reverse transcriptase and ribonuclease H." FT /translation="MLNPIPISIPEGPKSDCITKEVQDLLLDDAIEQVLPN FT HYSKRVFYSNVFTVPKPGTNLHRPVLDLKRLNTYINNQSFKMEGIKNLPSM FT VKQGYYMVKLDIKKAYLHVLVDPQYRDLFRFVWKGSHYRWKTMPFGLSTAP FT RIFTMLLRPVLRMLRDINVSVIAYLDDLLIVGSTKEECLSNLKKTMDLLVK FT LGFKLNLEKSVLEPTQSITFLGLQIDSVSMKLLVPKEKKKSVIKEIRNFLK FT LDCCSPRKLAGLKGKLIALKDAVIPFRLYTRRTNNFHSQCLTLANGDWDQS FT FPIPQDVKSEISHWLIVLNQWNGKEISLFPSYDYVLTTDASESGAGATLKK FT GNKVIKTWSFQWSTTQSNMSSNRREMLALLMAYQALCRKLNSCKLKIQTDN FT TTTLSYINRQGGQIQDLSVLFEQLWKQCLKKKVNLIGEHIPGFFNVKADHL FT SRLSEMNHKSSTRVIKSYNWQLKKEVFNRIQLQFGQIQMDLFASHLNHQTT FT NYSTIRMNTLHLDWSQWKQCLAFPPPILLPSILEKMNSSSSKKVSIILIFP FT IWRSATWYPMIQAQVPRHHRHMFPQVLGTFQEVLTKQSVESIPIQIQQRWK FT LGIIQLSNLM" FT CDS 5195..5815 FT /product="DIRS1_5p" FT /note="ribonuclease H." FT /translation="SKLGHSNGQQLNQTCRQIVEKSPLLMAYQALCRKLNN FT CKLKIQTDNTTTLSYINRQGGQIQDLSVLFEQLWKQCLKKKVNLIGEHIPG FT FLNVKADHLSRLSEMNHKSSTRVIKSYNWQLKKASVQSHPTSIRSNTDGSV FT RISPQTSNEQLLNNQNECTPPRLESMEAMSGLSHQSFAFYPGEDELIQFEE FT GFYNTDLPNLEIRQLGI" FT CDS 5601..6257 FT /product="DIRS1_4p" FT /note="tyrosine recombinase N-terminal region." FT /translation="MDLFASHLKHQTNNYSTIRMNALHLDWSQWKQCLAFH FT TNLLPSILEKMNSSSSKKVSIILIFPIWRSGNLVSDDSSTSSSVIIVTFFL FT KVLEHSKEGIDQTISRVDTNPDSTTLEAGDYSTFQSHVMSFARTTNKKTAE FT LLMKSWEPSTPKVYSSSYTRFRNFCTLNSLNPANITLVVFMDYLTHLFKHK FT PPLAFSTTNSHRSMLNQLLLLRNQTDIVN" XX SQ Sequence 7053 BP; 2392 A; 1447 C; 1148 G; 2066 T; 0 other; aagcttcttg ttcccaaaga aaagaagaaa agtgtaatca aggaaataag aactttttat 60 attatcatat atatatatat attatgaata acatttattt atttgaattt cccaaatatt 120 taagataatt ttttagaatg ttctagacat tcgaagaata aaaaattttc gaaagaaaag 180 taaaaattcg aaccggcaca atgacgcgat aattgcgcaa ggtcgaaaaa ctgaaaaatt 240 ccgaaccgag actatgcaca aatttgtgaa gggtcgaaaa ctcttatttt tttgagtttt 300 gcgaaatttt aagaaaataa aaacgtataa tagtggcact aaaaactaaa accattttta 360 tttgggaatt cctcctatat atatttaaga tacacacaca cacaaatttt atacaaataa 420 ttatttctct ttattaattt ttaatattat tatttttctc atattataat tttagatatc 480 aacaataaaa tatgtctacc actgttaata ataatgatgc ctctagtagc aatacctctg 540 cctctaatag cgctgaatcc tttgatttaa gaatgaaatc aatggaggat caaatcaaca 600 acctttcctt agcctttaca agattcatga aagaacctat gttctcatca aataccaatt 660 cacgtagcca accttctcat gatgactcta acaccgagaa tgaacaaagt gaagatgaat 720 caagtaacaa tgtcgatgtt ccaaccgatt atcaattatc cgatacctta cttggtcagt 780 acaaacatat ggtaaacaat caaggtttac ttgtcgaaga agaatgtatc ctcaagagag 840 atgagatatc cgaattgaat aaagtattca actttccatc aaacttccaa gtgaatgtcg 900 ctccattcgg tacacctgaa ggtattactg tatcatccaa cgtcaagaac aatgatactg 960 acttgttgag tgttgaaaag cgaatcaacg atagcttgaa acctttgctt cttatgtcaa 1020 gtatgttatc atctgatagc tccaatgtcg atgtagaact tattagttac ttaactcaga 1080 gtgcaatcgt cttagccgtt aacactcaag catcgcttag tcgtgtccgt cgtaacaaca 1140 tcgctaaaga aatctatggt tctgaagtac tcttaccaat taagatcaag gatacaccaa 1200 agatgtttga cgaaactgaa actgaacgtg taagaaagct agccaagtca atcagaaaga 1260 acaacgaagc taaacaatca ttgttaaaat tgaattatca ttccaagtcc aatgtcaaga 1320 aattagttaa ctcaagtggt aataacacta caggaaacag tagtaatagc aaatccagta 1380 gtgggagtaa tggccgatct aacaacttca atggatcacc aagtaatgtt gcatcaggta 1440 gcaacaatac caagtctgca aacggtacca acaaccgttt tcagaagaac aagaagtaaa 1500 cttaccagta ggtggacgtt tgttccacca caaacaagtt tggaaagaat tgggtcttcc 1560 aaacttttgt caagaggtcg taaatggatt aaaagtccat ctgcttccaa acttcaagcc 1620 gatgctaaac ccgattccga tttcaattcc agagggtccg aaatcagatt gcatcacaaa 1680 ggaagtacaa gacttgttat tagacgatgc catcgaacaa gtacttccaa accattattc 1740 aaagcgcgtt ttttactcca acgtgtttac ggttccaaaa cctggaacga atctacatcg 1800 tccagttctc gatctaaaaa gattaaacac ttacatcaac aaccaatcat tcaagatgga 1860 aggaatcaag aatctaccat caatggtcaa acaaggttat tacatggtaa aactcgatat 1920 caagaaagcc tatctccacg ttttagtaga tccgcaatac agagacttat tccgcttcgt 1980 gtggaaaggt tcgcactacc gttggaaaac aatgccgttc gggttatcga cagctcctcg 2040 tatctttaca atgttgttaa gacctgtact tcgaatgttg agagatatca acgtatccgt 2100 catcgcatac ttggacgatc tattaatcgt cggttcaaca aaagaagaat gtttatccaa 2160 cctcaaaaag acaatggact tacttgtcaa actaggtttc aagttaaatc tagaaaagag 2220 tgttctcgaa ccaactcaat caattacttt tctcggatta caaatcgatt cggtatcaat 2280 gaagcttctt gttcccaaag aaaagaagaa aagtgtaatc aaggaaataa gaaacttttt 2340 aaaactagat tgttgctccc caagaaaact tgctggttta aaaggaaagt taatcgcact 2400 gaaagatgca gtcatcccat tcagacttta cactcgtcga acaaacaact ttcactctca 2460 gtgtctgact ctagccaatg gagattggga tcaatcgttc cccattcctc aagatgtcaa 2520 gtcagagatt tcacattggt taatagttct aaaccaatgg aatggaaaag aaatcagtct 2580 gtttccaagt tacgactatg ttcttacaac cgatgcctcg gaatcaggtg caggtgccac 2640 tctcaagaaa ggaaacaagg taatcaaaac ttggtcattc cagtggtcaa caactcaatc 2700 aaacatgtcg tcaaatcgtc gagaaatgct cgctctgcta atggcctatc aagcgctatg 2760 tcggaaactg aacagctgca agctgaagat tcaaaccgac aacactacca ctctctctta 2820 catcaatcgc caaggtggtc aaatacaaga tctctcagtt ctattcgaac aactttggaa 2880 acaatgcctc aagaagaaag tgaacttgat tggagagcat attccaggat tcttcaatgt 2940 aaaagccgac cacctcagcc gtctttcaga gatgaatcac aaatcatcga ccagagtaat 3000 caagagttac aactggcaac tgaagaagga agtgttcaat cgcatacaac ttcaattcgg 3060 tcaaatacag atggatctgt tcgcatctca cctcaaccat caaacgacca actactcaac 3120 aatcagaatg aatacactcc acctcgattg gagtcaatgg aagcaatgtc tggccttccc 3180 accacccatt cttttgcctt ctatcctgga gaagatgaac tcatccagtt cgaagaaggt 3240 ttctataata ctgatcttcc caatctggag atcagcaact tggtatccga tgattcaagc 3300 tcaagttcct cgtcatcatc gtcacatgtt tcctcaagta ctgggaacat tccaagaagt 3360 attgaccaaa caatcagtag agtcgatacc aatccagatt caacaacgtt ggaaactggg 3420 gattattcaa ctttccaatc tcatgtaatg tcattcgctc gtacaacaaa tacaaagaca 3480 gctgagctgt taatgaagtc atgggaacct tcaactctca aagtatatag ctccagttat 3540 acaagattcc gcaatttctg tactttgaac tctttgaatc cagcaaacat taccttagtt 3600 gttttcatgg attatcttac acatctgttc aagcacaaac ctccgttagc cttctcaaca 3660 attaacggtc atcgctctat gttgaatcag ttgttactcc ttaggaatca aactgatatt 3720 gttaatgatc cattcatcac aagaattatg actggtattc acaagttgcg tccttcatct 3780 gcaaagtata aagagatatg ggatgcaaac caagtattca agcacttatc tactatcaaa 3840 gtcatcccta agtacacata cactgcgcta ttaaacaaga cacttgtact ctgtaaaatg 3900 tttggtttag caagatcatc agacttggtg aagtggtcgt tcaaaggtct cattattact 3960 cctgactcaa tcaaaggtcc agttatcaat gctaaagaac aaagaagtgg tgttgtttca 4020 atattagaat taacatcgtt agatgataca aactctcaag tatgccctgt tcgccacctt 4080 gcaacatacc ttagagcctc taaaggaaga agaaagcccc attcgggtga ctctgtcttt 4140 attaagaatg aggtgaaccg ctccaagtta atgatattaa ctcaaattgt actatcaacg 4200 ctctcaaagt caggcattga tattgtcaag ttcaaatctc actctacccg ttccgctatg 4260 gcttctctgc tggtgtccaa taacgttccg ttccacgttg tcaagaagat gggtcgttgg 4320 aaatcaaacg atactgtaga taccttctac gataaaagaa tcattggtga aaaatctggt 4380 ggtttcttaa atactgtcgt ccaaatttca taatatatat atatatatga taatataaat 4440 taaaaattta atttattaaa ttatatttta tattaaatat atatatatat attagtccca 4500 tcccacccgc ccttagtcgg aattcataaa tcaaattgtt ttagttttta gtgccactat 4560 ttatacgttt ttattttctt aaaaatttcg caaaactcaa aaaaataaga gttttcgacc 4620 cttcacaaat ttgtgcatag tgtcgtcggt tcggaatttt tcagtttttc gaccttgcgc 4680 aattatcgcc gttcattgtg ccggttcgaa atttttactt ttctttcgaa aattttttat 4740 tcttcgaatg ttctagaaca ttctaaaaaa ttatcttaaa tatttgggaa attcaaataa 4800 ataaatgtta ttcataatat atatatatat atttaatata aaatataatt taataaatta 4860 aatttttaat ttaaaactag attgttgctc cccaaggaag cttgctggtt taaaaggaaa 4920 gctaatcgca ctgaaagatg cagtcatccc attcagactt tacactcgtc gaacaaacaa 4980 gtttcactct cagtgtctga ctctagccaa aggagattgg gatcaatcat tccccattcc 5040 ccaagaggtc aaatcagaga tttcgcattg gttaacagct ctaaaccaat ggaatggaaa 5100 agaaatcagt ctgtttccaa gttacgacta tgttcttaca acgatgcctc ggaatcaggt 5160 gcaggtgcca ctctcaagaa aggaaacaag gtaatcaaaa cttggtcatt ccaatggtca 5220 acaactcaat caaacatgtc gtcaaatcgt cgagaaatct ccgctgctaa tggcctatca 5280 agcgctatgt cggaaactga acaactgcaa gctgaagatt caaaccgaca acactaccac 5340 tctctcttac atcaatcgcc agggtggtca aatacaagat ctctcagttc tattcgaaca 5400 actttggaaa caatgcctca agaagaaagt gaacttgatt ggagagcata ttccaggatt 5460 cttaaatgta aaagccgacc acctcagccg tctttcagag atgaatcaca aatcatcgac 5520 cagagtaatc aagagttaca actggcaact gaagaaagca agtgttcaat cgcatccaac 5580 ttcaattcgg tcaaatacag atggatctgt tcgcatctca cctcaaacat caaacgaaca 5640 actactcaac aatcagaatg aatgcactcc acctcgattg gagtcaatgg aagcaatgtc 5700 tggcctttca caccaatctt ttgccttcta tcctggagaa gatgaactca tccagttcga 5760 agaaggtttc tataatactg atcttcccaa tctggagatc aggcaacttg gtatctgatg 5820 attcaagcac aagttcctcg gttatcatcg tcacattttt cctcaaggta ctggaacatt 5880 ccaaggaagg tattgaccaa acaatcagta gagtcgatac caatccagat tcaacaacgt 5940 tggaagctgg ggattattca actttccaat ctcatgtaat gtcattcgct cgtacaacaa 6000 ataaaaaaac agctgagcta ttaatgaagt cttgggaacc ttcaactccc aaagtatata 6060 gctccagtta tacaagattc cgcaatttct gtactttgaa ctctttgaat ccagcaaaca 6120 ttaccttagt tgttttcatg gattatctta cacatctgtt caagcacaaa cctccgttag 6180 ccttctcaac aactaacagt catcgctcta tgttgaatca gttgttactc ctcaggaatc 6240 aaactgatat tgttaattga tccattcatc acaagaatta tgactggtat tcacaagttg 6300 cgtccttcat cagcaaagta taaagagata tgggatgcaa atcaagtatt caagcactta 6360 tctactatca aagtcatccc caagtacaca tacactgcgc tattaaacaa gacacttgta 6420 ctctgtaaaa tgtttgttta gcaagatcat cagacttggt gaagtggtcg ttcaaaggta 6480 tcattattac tcctgactca atcaaaggtc cagttattaa tgctaaagaa caaagaagtg 6540 gtgttgtttc aatcttagaa ttaacatctt tagatgatac aaactctcaa gtatgccctg 6600 ttcgccacct tggcaacata ccttagagcc tcaaaaggaa gaagaaagcc attcgggtga 6660 ctctgtcttt attaagaatg agggtgaacc gctccaagtt aatgatatta attcaattgt 6720 actatcaaca ctctcaaagt caggtattga tattgtcaag ttcaaatctc actctacccg 6780 ttcgctatgg cttctctgct gttgtccaat aaagtttcgt tccacgttgt caaaaagatg 6840 ggtcgttgga aatcaaatga tactgtagat accttctacg ataaaataat cattggtgaa 6900 aatctggtgg tttcttaaat actgtcgtcc aaatttcata atatatatat atatatgata 6960 atataaatta aaaatttaat ttattaaatt atattttata ttaaatatat atatatatat 7020 tagtcccatc ccacccgccc ttagtcggaa ttc 7053 // ID P-11_HM repbase; DNA; INV; 3202 BP. XX AC . XX DT 17-MAR-2008 (Rel. 13.03, Created) DT 17-MAR-2008 (Rel. 13.03, Last updated, Version 1) XX DE P-type DNA transposon family: consensus. XX KW P; DNA transposon; Transposable Element; P-11_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3202 RA Jurka J.; RT "P-type DNA transposon families from Hydra magnipapillata."; RL Repbase Reports 8(3), 357-357 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 139..2721 FT /product="P-11_HM_1p" FT /translation="MVNKCAVFNCRTGYKSKKNESSSLSASKAVFSFPKDT FT VLEEKWMKFVNRKDWKPSKHSVICLDHFEKKYLKYGKRVTLKYELNPVPTI FT YSDEISIPSSIIPKIPTFRKPPTDRHSTIPDEIHTFQEMDKIKNIDMLHES FT NSPNGFSFQRYESFVLYYRLFFVDTTPFIESIRIDNNLHVKLHYKSSPLPL FT PKWFRTINNCKLTSLSILNNLVSYMHNLVEENSYGLVNELSNLIFYKSKGR FT PPYSSQVLRFALMQRYSSFQAYSLLLKQLPLPSMSLLKKLVSGGIDSIKSL FT KLLLNEGKISNDCVLLFDEMYLQQSCEYHSGRLIGQDTEGNLYKGIIVFMI FT VGLKKSIPYVIKAIPEISLTGDFIKDEIEESLNTLRSAGFNIRALIADNHP FT TNVSAYSKLLSVYGSEKPEENFCITFNGHIIYLMYDSVHLLKNLRNNLIST FT KKFIFPSFQFTDYIFNIKVSAGDISWGLLHRVHEKDETLSANLRKASKLGE FT KTLYPGNNKQDVNLALNIFHETTSAAIQSYFPNEFSASEFLKLVNLWWVIS FT NSKMKYSNHKFGNASVMGDGKPQFLRAFAHWIDQWQNSQYSSTERYSLTAQ FT TSKALIVTLQSTSSLIEDLLFEGYNYVLTSCFQSDPLERHFSKYRQMSGGR FT FLVGLREVITSEKILQMRSLLKENFDFWKMDLKTFKSVVNINSISELFIND FT IQECDLDDNSKEVAIFIAGYICRKLLKKVKCELCSLLLKSDLIDLNTEYIE FT LLSRGGLITPSKLLANHVIICFAMLDAVKKSLLTIEILDIRKVSEYILYEY FT NLNHSFICEDHLKWGLSLVNQTVINIFYNNEQKILSNGKRKDAVLSFKRRQ FT REKKKKEM" XX SQ Sequence 3202 BP; 1178 A; 434 C; 467 G; 1123 T; 0 other; cggatatgtt gcataaatag gccgaaaatt agaaggccag tttttagcat ttcggttatt 60 tttcatactg ccagttaata ttgtttaaat tttattttgg gagatattga ttaagttgga 120 aatttagttt tggatataat ggtgaacaaa tgtgctgttt ttaattgtcg aactggctat 180 aaatctaaaa aaaatgaatc tagttctctt agtgcaagta aagctgtttt tagttttcct 240 aaagatactg ttttagaaga aaaatggatg aaatttgtta acagaaaaga ttggaagcca 300 agcaaacatt cagttatatg cttggaccat tttgaaaaaa agtatttaaa atatggaaaa 360 cgagttacct taaaatatga gctaaaccct gttccaacca tttattctga tgaaatttct 420 attccatctt ctataatacc taaaattcct acttttcgta aacctccaac tgaccgacat 480 tctaccatcc cagatgaaat tcatacattt caagaaatgg ataaaataaa aaacattgat 540 atgcttcatg aatctaactc tccaaatgga tttagttttc aaagatatga aagttttgtt 600 ttatattatc gtttattttt tgtagacaca actccattta ttgaatcaat acgcatcgat 660 aacaatttgc acgtaaagtt acattacaaa agttctccac ttccactccc taaatggttt 720 agaacaatta acaattgtaa gcttacaagt ttatcgattt taaataattt ggtgtcttat 780 atgcacaacc ttgttgaaga aaattcgtat ggtttagtta atgaactatc aaatttaatt 840 ttttacaaat ccaaaggtcg tcctccatat tcttcacaag tattacgttt tgcacttatg 900 caaagatatt catcatttca ggcttattcc cttcttttga aacaactccc attgccatca 960 atgtcactac taaagaagtt agtttctggt ggaatcgatt caataaaatc attaaaactg 1020 ttattaaatg aaggaaaaat aagcaatgat tgtgttctcc tttttgatga aatgtattta 1080 cagcaatctt gtgaatacca cagtggtaga ttaattggtc aagatactga gggtaatctt 1140 tacaaaggaa taattgtgtt catgattgtt ggtcttaaaa aatcaatacc atatgttatt 1200 aaagctatcc cagaaattag tcttactgga gattttatta aagatgaaat agaagaaagt 1260 ttaaatacat taagatctgc tggttttaac attcgtgcat tgattgctga taaccatcca 1320 actaatgtgt cagcttattc aaaattactc agcgtttatg gttcagagaa gccagaagaa 1380 aatttttgta tcacttttaa tggtcatata atttatttga tgtatgatag tgtacatttg 1440 ctcaaaaact taagaaataa tttgattagt accaaaaagt ttattttccc atcatttcag 1500 ttcactgatt acatttttaa tataaaagtc tctgcaggtg atatatcttg gggattgctt 1560 catagagtgc atgaaaaaga tgaaactcta tcagcaaact taagaaaggc atcaaaacta 1620 ggagaaaaga cattatatcc aggtaacaat aaacaagacg tcaacctagc attaaatata 1680 ttccatgaaa caacatctgc agcaattcaa agttattttc ctaatgaatt ttcagcatca 1740 gaatttttaa aacttgttaa tttatggtgg gtaatatcta attctaaaat gaaatactct 1800 aatcataagt ttggaaatgc atctgttatg ggggacggta agccacaatt tttacgtgca 1860 tttgctcact ggattgatca gtggcaaaat tcgcaataca gttccaccga aagatattct 1920 cttacagctc agacttctaa agcccttatt gtaacactgc aaagtacttc tagtttaatt 1980 gaagatcttc tatttgaagg atataattat gttctaacct cttgtttcca aagtgatcca 2040 ttggagcgac atttttcaaa gtatcgtcaa atgagtggtg gcagatttct tgttggatta 2100 cgtgaagtaa taacttctga aaaaattttg caaatgagat cattattaaa agaaaatttt 2160 gacttttgga aaatggattt aaaaactttc aagtcggttg taaatataaa ttcgatttct 2220 gaacttttta ttaatgatat tcaggaatgt gatttagatg ataacagtaa agaagttgct 2280 atttttattg ctggatatat atgcagaaaa ttactcaaaa aagttaaatg tgaattatgt 2340 tctttattgc taaaaagtga tctcattgat ttaaatacag aatatattga actactatcc 2400 agaggaggtc tcataactcc ctcaaaattg cttgcaaatc atgttattat ttgctttgct 2460 atgcttgatg cagtaaaaaa aagtttactt acaatagaaa ttcttgacat cagaaaagtt 2520 agcgaatata ttttatatga atataacttg aaccacagct ttatttgtga ggatcattta 2580 aagtgggggc tttctcttgt taatcagaca gtaataaata ttttttataa caatgaacaa 2640 aagattttaa gtaatggaaa gcgcaaagat gctgtattat cttttaaaag aagacaaagg 2700 gaaaagaaaa aaaaagaaat gtagtttatg tatatatata tatatatata tatatatata 2760 tataaataga tatattttgt atatgaaatc ttgatgaaac ttgctattag tattttttga 2820 tttttgcctg aaatctctac ggtaaaaaaa tgtgtataaa aatagtcata ttcctaattt 2880 tatatttaaa atattaagtt gttcaaaaga aaaaattatt tgaagtgaac tacgaaaatc 2940 aaataaacaa attaaaaaaa aaactactag atgttaataa taaaatcaac ctaaacgttt 3000 ttaaataagg aaaaactaat acgaaaagat attttactat ataaatctat ttaatttaaa 3060 taattataat atatttatta acttaaatta aaaaacctag caatttaatt atatttaaac 3120 ctatttaaac ttaaatcccg aaaacttgta atattgaaaa actggccttc taattttcgg 3180 cctatttatg caacatatcc gg 3202 // ID MuDR12x_AP repbase; DNA; INV; 1397 BP. XX AC . XX DT 30-JUL-2009 (Rel. 14.07, Created) DT 30-JUL-2009 (Rel. 15.12, Last updated, Version 2) XX DE MuDR-type DNA transposon: consensus. XX KW MuDR; DNA transposon; Transposable Element; MuDR12x_AP. XX NM MuDR12x_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-1397 RA Jurka J.; RT "DNA transposons from pea aphid."; RL Repbase Reports 9(7), 1361-1361 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX FH Key Location/Qualifiers FT CDS join(164..208,302..931) FT /product="MuDR12x_AP_1p" FT /translation="MYVYHLINYVMLRLAXGKRLTVLDNFKFKFKYESKIS FT GNKTWECANKTCKAKLVFNKEKTLIQEKSVLDHSHYIFGLPFLEPDEVEDS FT FVFDLIADMPSNNQIVQFCDYLTETYMQDIFPPSLWASKSEVITNNVCESF FT HSKFNAYFYHHHPSLYKFIDALQDIQVDTYIKIRSAEEGVIQKRNKATTQK FT YKFINERRQMKEDGRISRYEFLKKVCFKNKRQNVN" XX SQ Sequence 1397 BP; 519 A; 185 C; 197 G; 493 T; 3 other; cctttttcta aaagttgata tgtaaactgc ggcaaaaaaa aatttacatt atacaatctg 60 cgacaaaacc gataagatac attatacaaa ctgcggcaat ttattttata taccttctat 120 atacctatac tatattatag tagtacctaa atcacgtacc aatatgtacg tatatcattt 180 aattaattat gtgatgttgc gtcttgcgta gaaattggta aggtaaatcc aaatgtattt 240 ttaagtttta acgaagcata tcggtaatat cggttccttt acttttgata atactagtta 300 gnatggcaaa cgattaacag tgttagacaa ttttaaattc aaatttaaat atgaatcaaa 360 aataagtggt aataaaacgt gggaatgcgc taataagaca tgcaaggcca aattggtctt 420 caacaaggaa aaaactctaa tacaagaaaa atcagttttg gatcatagtc attatatttt 480 tggtcttccg ttccttgaac ctgatgaagt cgaagattcg tttgtttttg atttaattgc 540 agatatgcca tctaataatc aaatagtaca attttgcgat tatttaacag aaacttatat 600 gcaagacata ttccctcctt cnttgtgggc atcgaaatca gaagttataa caaataatgt 660 atgtgaatct ttccattcaa aatttaacgc atatttttat catcatcatc catcactata 720 taaatttata gatgctctac aggatattca agttgacaca tatataaaaa ttagaagcgc 780 agaagaaggg gtaatacaaa aaagaaataa agcgacaaca caaaaataca aatttattaa 840 cgaacggcgg caaatgaaag aagatgggcg aatttcaaga tatgaatttt tgaaaaaagt 900 atgttttaag aataaacgac aaaatgttaa ttaggtacct acttattata ttattttatt 960 ttaaaaattt ttttttgatt ttacgatcac tatacgatct tttaaaataa tttttaataa 1020 taataatatt tttcaatttt aaaattggtt ttttctttat ttttacgcct aatactctaa 1080 gagtctctta gagacttcta ttgagtataa tattataggt aatattcatg ttaaaaatac 1140 atttgtattt acctaaccaa tttctacgca agacgcaaca tcacataatt aattaaatga 1200 tatacgtaca tattggtaca cataggtagg tatacgtgat ttaggtacta taatatagtt 1260 anatgttata tcgtatatat agaaggtata taaaataaat tgccgcagtt tgtataatgt 1320 atcttatcgg ttttgtcgca gattgtataa tgtaaatttt ttttgccgca gtttacatat 1380 caacttttag aaaaagg 1397 // ID Crack-19_AAe repbase; DNA; INV; 4320 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A Crack non-LTR retrotransposon family from Aedes aegypti. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; KW Crack-19_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4320 RA Kojima K.K. and Jurka J.; RT "Crack clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1235-1235 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 9 sequences with >95% CC identity. XX FH Key Location/Qualifiers FT CDS 183..1037 FT /product="Crack-19_AAe_1p" FT /translation="MDELLATMKGLQVSMDKLNQKCDENGVNQKKTDDAVK FT KLSASICALTKSQSDIKDSVEEVKSSQTFIAAQYDDIKSLHEDMKREFAGC FT STKVSEHDGQILALNNEVIEMKKRLRMAEQNGLKNEIIINGIPKEVSLAEA FT TIVTKVAAAVGVQLFSTDIERTHRSRNGMILVDFSNLKVRNDILRARKGKS FT IYLDEIDFGNAVLPGSSRSSPTNKRXHTKVFINENLTRETRSIFREAKSLR FT GSHGYKYVWCNNGNIYCKKDDSADVYIIDSTEDLKRLRSSPRKA" FT CDS 1138..4026 FT /product="Crack-19_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MNTNDDDFISNVRLGDNLDIAYTHVNETFTNTKINLL FT YMNINSIRNKLADLLVYISNFGSIVHIICITEIRLNPDETYICNIPNYDVV FT CCPRSNRSGGGACMFIHQSMQFDVTKNEEFLEGSCIVVSLREPKLSIAVIY FT RPPHASLNDSIHYLDSLLESAGKLVCVGDFNINLLSTNSSNYVSMVESNGF FT SFLNKIDHNSATHGNVPTGTIIDHSFTNLYSQKFCMCIDNISFTDHKALLI FT SFGSNXRLSNNVKTTNIKKYNYGDISHDLAQFVVSADSLDNLNQLIEDCMS FT KNSYVIHSHRKYRPKAPWIDEIVLREIRHRDQLYQRMVQWPQNSTIRNNFK FT KQKNXVTRLIKQRQKLYYQNKIALNRCNIRKVWSILNEIVFRKPSSNTTIS FT KILINDRILTDKSEICEAFNYYFKNIPNELFSNLTQTFNNTPRNFTLDRSI FT QNSIVMLPASNNEISNCIRNLSNSSAVGVDGIPSKILKQNEEIIAPKLTCL FT INQALDEGVFPNCLKSARIVPIYKSGHKTYVGNYRPIAILPTLSKPFERII FT HRRLCNFFDQHNIISAHQYGFRPNSNTTSATVNLVNEIQLNLDQRKLCSAI FT FIDVSKAFDCVPHTILLQKLQNYGIRNKAFALLESYLKNRVQCVRIGNNTS FT TYLTVDSGVPQGSILGPLLFNVFINDIFDLPLRGKLQLYADDAALVYNSSS FT VESLYQDMQHDLNLIHLWFFHNGLTINASKTKYIVFSSTDRLNVLPDLWLG FT EEKLERVFSLTYLGLIIQQNMKWNLHIEQTHKKLNKFLGILRQSSYILPIK FT EKKNLFYAHVHSQICYLSVIWQNAPNYVISKISTTLNKFMRTIFWEDYLDP FT SIRTADLYKNNKIMNFNQIKYFESALFVYKVKNNIIKHTLDFQSLSELHRF FT NTRNMNNIRMITPRTNYIRYGCIYPAILNFNDLPLNLKNIPSIYTFKKALK FT MYVLEXIV" XX SQ Sequence 4320 BP; 1513 A; 795 C; 770 G; 1238 T; 4 other; agtgctcagt ctgcaacact gaaatcaaag tagacgataa acatgtgaaa tgtgtctatt 60 gcagcaatgc gttgcacgtg aaatgtgcgg gtctatctga atcgcgctat aacaaaattc 120 agaaaaacca ttgctattac tgctctggtc aatgtgagac ctcacatgtg aacagcaaaa 180 agatggacga actacttgca accatgaaag gcctacaagt ttccatggac aaacttaatc 240 aaaagtgtga cgaaaacggt gtgaaccaga aaaagacgga tgatgctgtg aaaaagttga 300 gtgcgtccat atgtgcactc acaaaatccc agagtgatat caaggattcg gttgaagagg 360 tgaaatcgtc ccaaacattc atcgctgcgc agtatgacga catcaagtcg ttacacgagg 420 acatgaagag ggaatttgcg ggatgctcga cgaaagtgag tgagcatgac ggccagattc 480 tagcgctgaa caacgaagtc atcgagatga agaagcggct acgtatggct gaacaaaatg 540 ggttaaaaaa cgaaatcata ataaacggta tcccaaagga ggtatctctt gccgaggcga 600 ctatcgtcac aaaagtagct gcagcagttg gggtacagtt gttttccaca gacatcgaaa 660 gaactcaccg cagcagaaat ggaatgattt tggtcgactt cagcaatttg aaagtgcgga 720 atgacatact acgtgcacgc aagggtaaat cgatatacct ggacgaaatc gactttggca 780 atgcagtgct acccggcagc tctaggtcct cgcctaccaa caagcgtcmt catactaaag 840 ttttcataaa tgagaacttg accagagaaa cccgttcaat ttttcgagaa gccaaatctc 900 tccgtgggtc ccacgggtat aagtatgtgt ggtgcaataa cgggaacatc tactgcaaga 960 aagatgactc tgcggatgtc tatatcattg attccactga agacctcaag agattgcgct 1020 catctcccag aaaagcttga cctctcatac tctctgttgt tgcttgctgt tggtgttgtt 1080 ggaggctgtt gaggatggtg ttggtgcgtt gctgtccata caaatatatc cttcacaatg 1140 aacacaaacg atgatgactt cattagcaat gtacgattag gtgataattt ggatatagct 1200 tacacacatg taaatgaaac ttttaccaat actaaaatca acttattgta tatgaacata 1260 aactcaatac ggaacaaact agcagattta ttggtttaca tttctaattt tggctcaatc 1320 gttcatataa tatgcataac tgaaataaga ctgaacccgg acgaaactta catctgcaac 1380 atacccaatt atgacgttgt ttgttgtcca agatctaata gatctggtgg aggtgcctgt 1440 atgttcattc atcagtcaat gcagttcgat gtgacgaaaa atgaggaatt tttggagggt 1500 agctgtattg tggtatcact aagagagcct aaactgagca ttgcagttat atatcggcct 1560 ccacatgcta gtttaaacga cagtatacac tacttggatt ctcttctgga atctgcagga 1620 aagttggtat gcgtaggtga cttcaatatc aaccttctta gtacaaattc tagcaactat 1680 gtatcgatgg tagagtcaaa cgggttttcg tttttaaaca aaattgacca caacagcgca 1740 acgcatggta atgtgcctac aggaacaata atagatcatt cattcaccaa tttatattcg 1800 cagaagttct gtatgtgtat tgacaacatt agcttcacag atcataaagc tctcttaatt 1860 agtttcggct caaacamtcg tttatccaat aatgtaaaga ctaccaatat caagaagtac 1920 aactatggag acatttctca cgatctagcg caatttgtag tatctgcaga tagtttggat 1980 aatttaaacc aactgataga agattgcatg tcaaaaaatt catatgtaat ccattcccat 2040 agaaaatatc gacccaaagc accatggata gatgaaatag ttttaagaga aatacgacac 2100 agggatcaat tataccaaag aatggttcaa tggccacaaa actccactat ccgtaataac 2160 tttaaaaaac agaaaaactw tgtgactcga ctcattaagc aaagacagaa gctttactat 2220 caaaataaaa tagcactgaa tcgttgcaac ataagaaagg tgtggagcat tttgaatgaa 2280 attgtatttc gtaaaccatc tagtaacaca actatctcaa aaatacttat aaacgatagg 2340 atattgacag ataaatcaga aatatgtgaa gcgtttaact attatttcaa aaacatacct 2400 aatgaactgt tttcaaattt gacccaaaca ttcaacaata cgccccgaaa tttcacttta 2460 gatagatcta ttcaaaattc tattgtaatg ctgcccgctt cgaataatga gataagtaat 2520 tgcattcgta acctcagtaa ctcaagtgca gtaggagtgg atggtattcc atcaaaaatt 2580 ctaaaacaaa atgaggaaat tattgctcct aaattaacct gcttaatcaa ccaagccctc 2640 gacgaaggtg tttttccgaa ttgtcttaaa tcagcaagaa tagtaccaat atacaaaagc 2700 ggccataaaa cgtatgttgg aaattataga ccaatagcta tcctgccaac tttgtccaaa 2760 ccgtttgaaa gaatcattca tcggagactt tgtaattttt ttgatcaaca caacataatc 2820 agtgcacacc aatacggatt tcgcccaaat tcaaatacta ctagcgcaac tgtaaattta 2880 gttaatgaaa ttcaattgaa cctagaccag cgcaaattat gctctgcaat atttatagac 2940 gtgtcaaaag cattcgactg tgttcctcac actatattac tacaaaaact tcaaaactat 3000 ggtataagaa ataaggcttt tgcgctgctt gaatcgtatt taaaaaatag agtccaatgt 3060 gtaagaatag gtaacaacac tagtacatat ttaactgttg attctggtgt accacagggc 3120 tcgatattag gtccactgct tttcaatgtt tttatcaatg acatttttga tttgcctttg 3180 aggggaaaac tccaactcta tgccgatgat gctgccttag tttacaactc aagtagtgta 3240 gaatctctat accaagatat gcagcatgac ttaaacttaa ttcatctctg gttttttcat 3300 aatggattga cgattaatgc atctaaaact aagtatattg tattctccag taccgatcgc 3360 ttaaacgttt tacctgactt gtggttaggt gaagagaaac tagaaagagt tttttctcta 3420 acttacttag gactcataat tcagcaaaat atgaaatgga acttacacat tgaacagact 3480 cataagaagc taaacaaatt tctcggtata ctccgacaga gtagttatat tttacccatt 3540 aaagaaaaaa agaatctgtt ttacgctcat gtccactcac agatttgtta cttaagtgta 3600 atatggcaaa acgcacccaa ttacgttatt agtaaaattt ctacaacatt aaacaaattc 3660 atgagaacta tcttttggga agactattta gacccgagta ttagaactgc agatctatac 3720 aaaaacaaca aaataatgaa cttcaatcaa attaaatatt ttgagtcggc tttgtttgtg 3780 tataaagtta aaaataatat tattaaacat actttagatt ttcaatcttt gagtgagcta 3840 caccgattca atactagaaa catgaacaat attagaatga taacaccaag gactaactat 3900 atcagatacg gttgtattta cccggctata ttgaacttca atgatttacc tctgaatttg 3960 aaaaacattc cttcgattta tacattcaaa aaagcactga agatgtacgt attggaamac 4020 attgtttaaa acttatatat tattagaaaa atacttagaa tactctcact ggcacgaaat 4080 actatcgata taagacctcc caatcagtcc ttagactgtc tgaggtctac cataacttta 4140 tacgttattc gttaggggaa gaaaacaaga ttatcgaaac attgtaaatt actattgtaa 4200 aaattttaaa ggggttttta cgcctgagat tgaaccatat aaatgatgaa aaatgtttcc 4260 atcttggctt ttccccttgt tgaaaaaaaa aaataaaata aataaaataa aaaaataaaa 4320 // ID Gypsy-5-I_HM repbase; DNA; INV; 3711 BP. XX AC . XX DT 24-DEC-2008 (Rel. 13.12, Created) DT 24-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE An internal portion of the LTR retrotransposon - a consensus DE sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW reverse transcriptase; Gypsy superfamily; Gypsy-5-I_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3711 RA Bao W. and Jurka J.; RT "Gypsy LTR retrotransposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 1976-1976 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 34..3669 FT /product="Gypsy-5-I_HM_1p" FT /translation="MSKILKPTRLDLDHNAPTASKEWKHWKRTFENFIEDC FT GEDAPDKFRSIINFVSSNVYDYIEECDSYESVIETLENLYIKTPNEIFARH FT QLLIRHQTSSESLEDFLQELRKLSKNCNYKDVSAEQYREEQIRDAFISGIN FT SNYIRQRLLENSTLNLQSAFNQARSLDIAQRNSDSYMQKTSSFSSNLNIXA FT AIKLPEEPSAALNAMTINPQTKCIYCGNQSHNQRNQTSARKNCPAQNVTCF FT KCGKKGHFSKVCLGRPMNITHSVSAAIHKPSLCTVSSGCPLSLSHAAVIVR FT INDEPLTALIDSCSSDNFISKKIIETLKIKTLPSKKKILMALTTMESGLVG FT CCSVNLKVNGLFYNNVQFGVLENLCSDIILGYDFQKLHKNLIFPLGGEKED FT LIVTKESNLCALQMAKIKIPSLFTSISDKTQPIATKSRRYNNDDRNFIESE FT ILQLLSDGIIEPCFSPWRAQVVVVKDLTKSNKKRLCIDYSQTVNLYTELDA FT YPLPRIDDMINGLAKYKVFSTFDLKSAYHQIPIKNSDKKYTAFEANGKLFQ FT FCRIPFGVTNGVSMFQRSMDKFVEEEKLHDTFPYLDNITIGGYDQAHHDKN FT CLRFQDASQRWGLTLNESKSVISASSINILGYCVSHNNIKPDSERLRPLDL FT LPPPSNILSLRRTLGMFSYYSKWIPSFANKVRPLINVKSFPINSEALEAFQ FT FLKRELHAATLSSIDESLPFVVECDASDVAVSATLNQNGRPVAFMSRTLGK FT SEINYPSVEKEAVAIIEAVRKWRHLLLRNQFKLITDQRSVAFMFDNRKKTK FT IKNSKIQCWRMELAEFSYTISYRPGSDNIVPDALTRAFCATTQSQTNLSGI FT HSCLCHPGVTRMLHFVKSKNLPFSTEEVKRVCSNCQICSELKPKFYRSSNV FT CELIKSIRPMDRLSIDFKGPLPSNSRNKYLFTVIDEYSRYPFAFPCPDISS FT TTVVKCLDQIFSFCGFPTYIHSDRGSSFISNELKSYLTQKGIATSNSTPYH FT PISNGQVERYNGLIWKNICLALKTHNLDVRNWEIVLSDALHSLRSLLSTAT FT NCTPHERLFSFPRRSSSGTSLPSWLTPGPIFLRRFVRTNKNDNLVDKVELV FT DVNPTYANIRYPDGRESTVSLQDLAPYPAPKEKFNDINENSINESSENVTA FT KVLDVNKDPSPEIASEIRVRVDEPVTEPFADVLRRSARACKAPVKYDDYIL FT K*" XX SQ Sequence 3711 BP; 1230 A; 679 C; 636 G; 1165 T; 1 other; atttatttaa caaacatata aaattattgg gaaatgtcaa agatcttaaa accgacgagg 60 cttgatcttg atcataatgc accaactgct tcaaaagaat ggaagcattg gaaacgaaca 120 tttgaaaatt tcattgagga ttgtggagag gatgctccag acaaatttcg ctccatcatt 180 aattttgttt catcaaatgt ttatgattac attgaagagt gtgattccta tgaaagtgta 240 attgaaaccc tagaaaactt atatataaaa acccctaatg aaatttttgc aaggcatcaa 300 ttacttataa gacatcaaac atcgagtgaa tccttagaag actttcttca ggaactacgt 360 aaactcagta aaaactgtaa ttacaaggat gtttcagctg aacaatatcg agaggaacaa 420 ataagagatg cttttatttc tggaataaat tcgaattaca ttcgtcaacg acttcttgaa 480 aattcaacat tgaatttgca atcagcattt aaccaagctc gttctcttga cattgcacaa 540 agaaactcag attcttatat gcagaaaaca tcctcctttt cctccaattt aaatattgyt 600 gccgctataa aattaccaga agaaccctca gcagcactca atgctatgac tattaatcct 660 caaacaaaat gcatttattg tggtaatcaa tctcacaacc aacgtaatca aacctctgct 720 aggaaaaatt gtccagctca aaatgtaacc tgttttaaat gtggaaaaaa aggtcatttt 780 tccaaagttt gtcttggcag gccgatgaat ataacccaca gtgtttcagc tgcaatacat 840 aaaccaagtt tgtgtactgt ttcaagtggt tgtccgttga gcctctctca tgcagctgtt 900 attgtcagaa ttaatgatga accgttaact gcactaatcg attcctgtag ctcagataat 960 tttataagca agaaaattat cgagacttta aaaattaaaa cgttacccag taaaaagaaa 1020 attttgatgg ccctcacaac aatggagtct ggattagtgg gctgttgttc ggttaatctt 1080 aaagtaaatg gtttatttta taataatgtt caatttggtg tcttagaaaa tctttgtagt 1140 gatatcatac ttggttacga cttccaaaaa ctacacaaaa atttaatatt cccactcggt 1200 ggagaaaaag aagacttaat tgtcacaaag gaaagtaacc tatgtgcact gcaaatggca 1260 aaaataaaaa taccatcctt gtttacaagc atttcagaca aaactcagcc aatagctaca 1320 aaatcaagaa gatataataa tgatgatcga aactttattg aaagtgaaat attacagtta 1380 ttatcggatg gtataattga gccatgtttc tcaccatgga gagcccaggt agtggtcgta 1440 aaggatctta ctaaatctaa caaaaaaaga ctttgtattg attactctca aacagttaac 1500 ctttacactg agttagatgc atatcctctt cccagaatag atgatatgat aaatggactt 1560 gcaaagtaca aagtattctc tacatttgac ttgaaaagtg cttaccacca aatccctatt 1620 aaaaattcag acaaaaagta tacagcattt gaggcaaatg ggaaactatt tcaattttgc 1680 aggattccat ttggagttac taatggtgtg tctatgttcc aaaggtctat ggacaagttt 1740 gttgaagaag aaaagttaca tgacactttt ccatatcttg acaatataac cattggtggt 1800 tatgatcagg cacatcatga taaaaattgt ctgcggtttc aagacgcttc acaacgttgg 1860 ggtctaactt taaatgaatc aaaatcggtt atttcagctt cttctataaa cattcttggt 1920 tattgcgtta gtcataacaa cataaaacca gattctgagc gattacgtcc tttagactta 1980 ctacctcctc cttctaacat tttatcactt cggcgaactt tgggtatgtt ttcttactat 2040 tcaaagtgga taccttcgtt tgctaataaa gttcggcctt taattaatgt taaatccttt 2100 cctatcaata gtgaggcact tgaagcattt caatttctta aaagagaact tcatgctgcc 2160 actttgagtt ctattgatga gagcttacct tttgttgttg aatgtgatgc atcagatgta 2220 gccgtatcag ctaccttgaa tcaaaatggt cgtcccgttg cctttatgtc tcgaacatta 2280 ggcaaaagtg aaatcaatta cccatcagta gaaaaagaag ctgtggctat tattgaagcc 2340 gttcgaaaat ggcgacattt actgttacgg aatcaattta aattaattac tgatcagcgt 2400 tctgtagctt ttatgtttga taaccgaaag aaaacaaaga ttaaaaatag taagattcag 2460 tgctggagga tggagttagc agagtttagt tatactatat cgtatcgtcc tggaagcgat 2520 aatatcgtcc ctgatgcact cacacgtgct ttttgtgcta caacccaatc tcaaacaaac 2580 ctaagtggta tacatagctg tctatgtcac ccaggggtta ctcgtatgct tcactttgtt 2640 aaatctaaaa atttaccttt ttctacagaa gaggtcaaga gagtttgttc taactgccag 2700 atttgttcag aattaaaacc aaagttttat cgatcctcaa atgtctgtga gttaatcaag 2760 tcaatacgtc caatggatcg cctaagtata gattttaaag gaccattgcc atctaattct 2820 agaaataaat atctgtttac agtcattgat gaatattctc ggtatccatt tgctttccca 2880 tgtcctgata ttagttcaac aactgttgtg aagtgtttag atcaaatatt ctccttttgt 2940 ggttttccaa cgtacattca ttccgacaga ggctcatctt tcatatctaa tgaattaaaa 3000 tcttacctta ctcagaaagg aattgcaaca agtaattcta ccccatatca tcctatcagt 3060 aatggacaag ttgagagata taatggcttg atctggaaaa atatttgttt ggcactaaaa 3120 acccacaacc tagatgttag aaattgggaa attgttctat cagacgctct ccactcactc 3180 cgttcgctac tttcaactgc aactaattgt acaccccatg agaggttatt cagttttcct 3240 aggcgctcgt catccggaac ttcgttaccc tcatggttaa ctcctggccc aatttttcta 3300 agacgttttg tgcgtacaaa caaaaacgat aatctcgtgg ataaggttga attagtggat 3360 gtaaatccaa catatgctaa tattcgttac cctgatggac gagagtcaac tgtttcactt 3420 caagatcttg caccgtatcc tgccccaaaa gaaaagttta atgatatcaa tgagaatagt 3480 atcaatgagt ctagtgagaa tgtaactgcg aaagtacttg atgtaaataa agacccctca 3540 cctgaaattg catcggaaat cagagttagg gtagatgagc cagtaactga gccatttgct 3600 gatgtattac gacgttctgc cagagcttgt aaagctcctg taaaatatga tgactatatt 3660 ttaaaatgac aaaactagtt tatgggcctt tttctacata gatgggaaga t 3711 // ID Copia-1_TCa-LTR repbase; DNA; INV; 185 BP. XX AC ChLG6; XX DT 21-FEB-2011 (Rel. 16.02, Created) DT 21-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the red flour beetle genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-1_TCa_; KW Copia-1_TCa-I; Copia-1_TCa-LTR. XX OS Tribolium castaneum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Coleoptera; Polyphaga; Cucujiformia; OC Tenebrionidae; Tribolium. XX RN [1] RP 1-185 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the red flour beetle genome."; RL Direct Submission to RU (21-FEB-2011). XX DR Genome; ChLG6; Positions 1875620 1875804. XX SQ Sequence 185 BP; 63 A; 29 C; 31 G; 62 T; 0 other; tgttaagata caattcagca cgctatacaa ccaattatac ggttagagtt agtgactttt 60 tgttaagttg gtaataatgt aagtaagaaa atggcgtgta caagaagcaa gtagttttaa 120 gttgaataaa tccgtttttg cttaaatcac tttatatatc ttccatcgtt ccacgaaccc 180 taaca 185 // ID hAT-19_SM repbase; DNA; INV; 1697 BP. XX AC . XX DT 10-FEB-2008 (Rel. 13.02, Created) DT 03-MAR-2008 (Rel. 13.02, Last updated, Version 1) XX DE hAT DNA transposon: consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-19_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-1697 RA Jurka J.; RT "hAT DNA transposon from Schmidtea mediterranea."; RL Repbase Reports 8(2), 68-68 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 539..1486 FT /product="hAT-19_SM_1p" FT /translation="IVPLIRAKILSFXIYVFSLYAIVHSQLEDNDSEYEDL FT LLFCHVRWLSRGSFLERFLNLLPEIITFLDTMGEKHEQLEDPVWVKKLAFL FT TDFMGHYNSLNLQLQGKGKNIIELVSSVNAFKAKLKLFASQLKRQNFKHFP FT YLEKHIKLAGECNIEIFCFELENFNQEFEKRFANLNNLQPIFEFISFPFGE FT IDNENISSNIAHTFQLNSSNLELEILTLQSDIILKSHANENNFWNLLPEEK FT YPLLRSVAMRIFAFFGSTYLCEAAFSQMKNIKSQFRSSLTDDHLMASIRLC FT ISDYKPNFTRLVDEMECHTSTSKN" XX SQ Sequence 1697 BP; 594 A; 261 C; 256 G; 583 T; 3 other; tattctakat caggggtggg caaacggtcg atcgcgaaca gttcagcagt cgatcgcggg 60 catttaaaat aatttactag gttaatttca aattttttaa tttaaaatat gtactaaaaa 120 tttagcagac gatgaaacat taatttaaaa ttaaacatgt atctatagta caactgacgt 180 gactaattcc ctgaatacac cttgtggtga actgctcaac ggccttatcg aaatctatat 240 aaatattata ttaaaagaca cgattttgga aatattgaat cgtccgatca aatatcatga 300 tgttctagta atcgtactat cgatctgtat tacctgctta ttcaatttct acatcagtca 360 agacgatctt tcagtttcaa atcaacaaca gaaatttcat cggtcgtttc gccatgcaaa 420 accgaatatc aatttcaaac ttacaaggaa gatcactctc cttatgattt cgcagatcta 480 gcattggctc aaatttttac gtctttcggg aacgttatca gtgctaaggt gtatttagat 540 cgtgccacta atcagagcaa agatactttc ttttyggata tatgtttttt ctttgtatgc 600 tattgtacat agtcaacttg aagacaatga ctcggaatat gaagatttac ttcttttttg 660 tcatgttagg tggctcagcc gaggaagctt tttagaaaga tttctaaatc ttttgcctga 720 aattattact tttttggaca caatgggaga aaaacatgaa caattggaag atccagtatg 780 ggtaaaaaaa ttagcgtttc ttacggattt tatgggccat tacaactcgc ttaatttaca 840 attacaagga aaaggaaaaa atattattga gttggtcagt tcggtaaatg cttttaaagc 900 aaaattaaaa ctttttgctt cacaattaaa aaggcaaaat tttaaacatt ttccatattt 960 agaaaaacat attaaacttg ctggtgaatg taatattgaa atattttgtt ttgaactgga 1020 aaattttaat caagaatttg agaaacgatt tgcaaatctc aataatcttc aaccgatttt 1080 tgaatttatt tcatttcctt ttggagaaat cgataacgag aatatttcat caaatatagc 1140 tcataccttt caattgaatt catcgaatct agagttagaa attttaacat tacaatcaga 1200 tataattctg aaatcacatg caaatgaaaa taatttttgg aatttattac cagaagaaaa 1260 atatccatta ttgagatcgg tggcaatgag gatatttgca ttttttggtt ccacttatct 1320 atgtgaagca gcgttttctc aaatgaaaaa catcaaatcg caatttagga gttctctgac 1380 tgatgaccac ttaatggcat ctattcgatt gtgcatcagt gattacaaac ctaattttac 1440 aagactcgtt gatgaaatgg aatgtcacac atctacatca aaaaattaat acaaaagttt 1500 tatcaataaa gtatgtctat attccacaaa atagtaaaca caatgtatct atttaatata 1560 ttattattaa taaagacgtt aattaaattt ttatttgttt ttaatttaac ccatttttgg 1620 aagtcgatcg cgcataaata agaaaaatta aagtcgatca taccagtaat cagtttgccc 1680 acccctgwta tatatta 1697 // ID R1A_SS repbase; DNA; INV; 1535 BP. XX AC AF015820; XX DT 27-JUL-1999 (Rel. 4.06, Created) DT 27-JUL-1999 (Rel. 4.06, Last updated, Version 1) XX DE Scolopendra sp. retrotransposon R1 reverse transcriptase gene, DE partial sequence. XX KW R1A_SS. XX OS Scolopendra sp. WDB-1997 OC Eukaryota; Metazoa; Arthropoda; Myriapoda; Chilopoda; OC Pleurostigmophora; Scolopendromorpha; Scolopendridae; OC Scolopendra. XX RN [1] RP 1-1535 RA Burke D.W., Malik S.H. and Eickbush H.T.; RT "R1 and R2 Provide an Estimate of the Age and Stability of RT Retrotransposons."; RL Unpublished. XX RN [2] RP 1-1535 RA Burke D.W. and Eickbush H.T.; RT "R1A_SS."; RL Direct Submission to Genbank (24-JUL-1997)Biology, Univ. of RL Rochester, 135 Superior Road, Rochester, NY 14625, USA. XX DR GenBank; AF015820; Positions 1 1535. XX SQ Sequence 1535 BP; 277 A; 364 C; 454 G; 440 T; 0 other; gccatggctg atgattcact cgttctggtg gctgctccct ccgttaaacg tattgaggaa 60 ttgtggatcg tctgtcggga agacctggaa aactgggcca ctgagaacaa cttggaattc 120 aacagggaga agactggact tcttttccac tcttggagac ctgttcttcg accgccagtc 180 ctacggtttg gtggtgttga tcttgttccc agtaccggtg tcaggtactt aggcatcatt 240 gtggatgtga aatttgactg gctggctcat gttcgctttt tagccggtaa agtcaagcgg 300 atggccaatc ggatttcagt ggttgtgggt gcgaattggg ggattaagcc tctcgttttg 360 cggactttgt atctgcgtgc cattttgcct atgcttctgt atggtgcacg gatttggggc 420 ctgagagcgg gaagagtttt catgcagaag agggtaaatg ccttgccccg cttcttgctg 480 ttgaatatcg ctaggactta ccgcacggtg tcgacggagg cgttgtatgt cctatgtggg 540 ttacctcttc tggctattgt cgctgaagag accgctctgc actatgctag actctctggt 600 ccggactatt gtcgtgatac tcctgttgct gaatggactc accctgccga ccgtgttgcc 660 ctaccggtgg gaacgtatga gaccatggat gccaccaggc ttcagctctt cactgacgct 720 cgcgacgtga gggcagaacc gcatcgggct tcgtcattta tgacaacggg gttcaaattg 780 atcactctag tgttcgcctg ggtgacggta attctgtctt tcagtctgag gttctagcta 840 tttggtttgc cgttatggcc attcggcggt tggcggtgga tgaggcttct cttttttctg 900 actctttgtc ggcccttaca gccatcagtg gcagcaaccc ggtggatgct ctggtcaggc 960 ggacgagggt ggccgttgac gaactgtgca ccacctgttc tcttcacctg ggttgggtca 1020 aggcccatgt cgggatcgcc gggaatgagg aggctgaccg tttcgctaaa ctcgctctgg 1080 acaaggatat tgagcatcgg aagttgctac caaggtcgca ctctagcctg attaagcaga 1140 ggactatctt ctttcagtgg caggccctgt gggaggggca caaggacggg cggccggtct 1200 tcgccctctt cccgaaggtc agctcgaggg gtgccttatt tggccggaaa atgatgcagg 1260 tcataaccga gcatggtaat tttgccaagt atctacaccg atttgctttg atggagtcac 1320 ctgcttgtcg ctgtgggggt ggtgaggagt ccgtggccca tatcctgcgt gagtgcgatc 1380 tgccttttcg tgctgctgca agatctcgtt tccttcgcga gatggttccg cgtgggctca 1440 cgtgggcgga tatggatggt tcctggtgcc gtgacgaatt tgctactcac tttgagcgtt 1500 ttgttaatga cgtcatcagt gatcctttca cctaa 1535 // ID Mariner-27_SM repbase; DNA; INV; 1905 BP. XX AC . XX DT 12-AUG-2009 (Rel. 14.08, Created) DT 12-AUG-2009 (Rel. 14.08, Last updated, Version 2) XX DE Mariner DNA transposon from Schmidtea mediterranea: consensus. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-27_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-1905 RA Jurka J.; RT "Mariner DNA transposons from Schmidtea mediterranea."; RL Repbase Reports 9(8), 1876-1876 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 189..1706 FT /product="Mariner-27_SM_1p" FT /translation="MSKRKPKLSLDEKCKILDALEDNAKPKDLCKRYRVNK FT STISKIRTNKENLRNYTKEMIQSAKKIKRISAVKVPEVEKALYLWFLNERL FT NHKIVTDEILQIKALELHRNLNCDVLFMASHGWIQKFKRRHSIRLLKICGE FT KLSANHSAMPAFIEKFKRKLEELQILPEQIYNADETGLVFKNLNSKTLVSN FT GEKNAPGRKNSKERITVMGCVNATGRHKLPLMIIGRAKQPRCFKNIVLPNI FT HYRSSRNAWQTSELFKEWFHNVFVPEVQNHLAEKNLSPQAILLLDNASAHS FT LSTELISNDGNISVMFFPPNTTAILQPLDQGIIKCLKTSYRKKLLLSLVSS FT SSDTLEERLKKITLKDVVFMIVESWYAIKNDTIINAFKQLFGSDEELHFNL FT PLNHFLEEDNIPLIELYRRVLPDNNSDYDQIINWANGSNENIRSLITEDDI FT VVNPVFDAEDNNNVCSTQDIDEALRNVNNLLNWAESNLAIRDILNLRKIRE FT DVILKKITEN" XX SQ Sequence 1905 BP; 709 A; 290 C; 327 G; 579 T; 0 other; ccagtaaaga ttcaaaaaaa gaaaccccat aaaaagaaac ttttcacaaa aagcaactct 60 cggaaaaaga aatataatta tattcaattt caaaatttta ttcagtaaaa gcaaccttcg 120 caaactgcgg tttttttaaa attaatttgt caaatatatt atttaaaatt tagtatatcc 180 cagtcactat gagtaaaaga aaaccaaaat tatcgctgga tgaaaaatgt aaaattttgg 240 atgcattaga agataatgcg aaacccaaag atctttgcaa acgttatagg gtaaacaaaa 300 gtacaatttc gaagatacgt accaataagg aaaacttaag aaattacaca aaggagatga 360 tacagtctgc aaaaaaaatc aagagaattt ccgccgtcaa agttcctgaa gttgaaaagg 420 ctctttatct gtggtttttg aatgagcgac ttaatcataa gattgtcacc gatgaaatcc 480 tgcaaattaa agctctagaa ctacatagaa atttgaattg tgatgtttta tttatggcca 540 gtcatggttg gattcaaaaa tttaaaagaa ggcactcaat tcgacttctt aaaatttgtg 600 gtgaaaaatt atcagcaaat cattccgcta tgcctgcatt tattgaaaag ttcaaaagga 660 aactagagga gctgcaaatt ctgcccgaac agatttataa tgctgacgaa actgggctag 720 tatttaaaaa ccttaattca aaaacgttag tttcaaatgg agaaaagaac gcacctggta 780 gaaaaaatag taaggagagg attaccgtaa tggggtgtgt aaatgcgact ggccggcata 840 agttgcccct gatgattata ggtcgagcga aacagccacg atgttttaaa aatattgttt 900 tacctaacat acactataga tcatcaagaa atgcttggca aactagtgaa ctttttaagg 960 aatggttcca taacgttttt gtgcctgaag ttcagaatca tttggccgag aaaaacctat 1020 ctccgcaggc cattttgtta ttagacaatg cgagtgctca ttctttgagt accgaattaa 1080 tttcaaacga tggcaacatt tcagtcatgt tctttccgcc gaacactact gcgatactgc 1140 aaccactcga tcagggtatt ataaagtgtt tgaaaacttc ataccgcaaa aaattgttac 1200 tcagtttagt gtcatccagt agcgacacct tggaagaaag gttaaaaaag attaccctga 1260 aagatgtagt ttttatgata gtagagtcat ggtatgcaat taagaatgat acaattataa 1320 acgcttttaa acagttattt ggttcagatg aagaattaca ttttaatttg cctcttaatc 1380 atttcctgga agaagacaat ataccattaa tagaactata ccgccgcgta cttccagaca 1440 ataactcgga ttatgatcaa ataataaact gggctaatgg aagcaacgaa aatatcaggt 1500 cattaataac tgaagatgat atagttgtta atccggtttt cgatgctgaa gacaacaata 1560 atgtatgttc cactcaggac atagatgaag ctcttcgaaa tgttaataat ttgctaaatt 1620 gggccgagag taatttagct attcgggata ttttaaacct tcgaaagatt agagaagatg 1680 taatcctaaa gaagattaca gaaaattgat ttgttttgtt gatattaaat tgttctttaa 1740 aatatgttat aaacctataa ttataaactt ttgttcagca atttctgggt ttagtaaaaa 1800 aataagtaat tgcatactat attcataaaa agaaacttta ttcagaaaaa gaaatttttt 1860 atcaaaatat gtttcataaa agtttctttt tctgaatctt tactg 1905 // ID P-3_AP repbase; DNA; INV; 5564 BP. XX AC Contig17451; XX DT 21-AUG-2009 (Rel. 14.08, Created) DT 21-AUG-2009 (Rel. 15.12, Last updated, Version 4) XX DE P-like DNA transposon. XX KW P; DNA transposon; Transposable Element; P-3_AP. XX NM P-3_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-5564 RA Jurka J.; RT "P-like DNA transposons from pea aphid."; RL Repbase Reports 9(8), 1799-1799 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX FH Key Location/Qualifiers FT CDS join(2752..3399,3372..3878,3865..4293) FT /product="P-3_AP_1p" FT /translation="MSQDELYCTLIFDEMKIKNYLESSKFLDVVEGFEDLG FT PKGRSNKLAGQAMVFMIRGLYSSWKMPICYFLPATAMKNNILSDLIVEIVH FT RLLNCGFFIKAVICDQGANNVSALKLLKVTKDKPFFEVDGRKMYSILDTPH FT LFKNFRNHFIKNNFKFQNEEVSFQDVRNVYNIDKNSTTSRSLLKITENHIN FT PGPFQMMSCKLAMQLFSNTMAATIKTQYNGCYDKNMTFVYTGELKSKTALH FT KANMIKFLNDLLDVLNSKSLYNSNRFKCAISDTRPQQLQFLEKARSTFETL FT EKMDVKNKKSKTTRPLCFDGMCWTINAIIMLYNEQKEIGFNFILTGRLNSD FT VIENTFSIFRQRGGYNRNPTTRIFRTRFRRNAKMSLMKPSSRNHLVGSSNC FT EPDDDTHLMTNKDNETRLLDMDSSFSSLSSVSMLGINNSLEEQEGLVNLEN FT CSNTYFAGYLAMKCLLKFSCSNCEKIMIKSGTILNQLEYLIFCRNYDSKTS FT KLHLKVPTTTITEFVISSQKILPKILEKNLIK" XX SQ Sequence 5564 BP; 2128 A; 742 C; 769 G; 1925 T; 0 other; gcataatatg tgataatata ttaatattaa tatattatta ttttctatat tatatttcat 60 gcaattaatt tgtcatcggt cctagagatc aaaatgttgt ggtttgaaaa ttgttttaat 120 taaatgtatt gctttaataa gttatcatcg tatttttata attttaattt taatttgaag 180 gaaataaagt ttattataga tgttttatat tatatcatat agttttaagc tgaccgttat 240 atattttatt aagataggtt tctcattaaa ttacaacgta aataatttac ataatattta 300 tttcacttat tttagtaaaa ttccagttga agacaattaa tcattgcaca ttcatttaaa 360 ttgatataat atattgcaac cttacactta tttagacatt ctacgtttta ataaacttaa 420 acaatcaaaa cataggttag ttataactta taataaataa taataataca aaattcatta 480 ttattaatac aaatatataa aaattaatag ctaatcttaa aaaaaatcat ttaattctta 540 tagtattcct attacaatat ttttataact atttttttta taaatacatt ttgtacattt 600 aataacaatc ccaatattac agatgttcaa taccataata tctagtgata aattgtaaac 660 gttatactga gataaaacga atatcggtcg gcgtcgccgc accgggtcgc gtatcttcag 720 tttacggtct gctttcgtgt tgtaagctca cggtcgagca aatagccgtc cagttaaaaa 780 ttgtgtaaaa ttaaattatt taaataaaaa aaaatcaaca aatttaaaat aaggcattcc 840 agtgtacgaa ttattaaaaa atagggaaac caaaatacca atattcacac tagatgccgc 900 aacaacagaa taaaaaaaaa cttaagtgat gaatatgagt ttatatatta tggacaaagt 960 atatacaaat aataaaagaa gtctaagcca tgaatgttat tatttacaat aatcataata 1020 taattttatt taatataata ttttaaatat tattaagaat ttttaatttt ttttttgatt 1080 tttcacttct aacatgtttt aaattttttg aaaaccaatt aaaatttcta aaaagtatac 1140 aaattatcaa atgattaatt aaaaattcta gatgtggaat acaatcttct tcaatactaa 1200 gtgaatgcat taaatttgtt ttaatttgtt cgccaacaga ttgagcgaat tttattttat 1260 gaggtttttt ttccaatatt ttagcaagta tttttttgag atgaaataac aaattctgtg 1320 atggtagttg tatagtccgc gacacgaaaa gtggccggtg agtaccgtga cccgtagcgg 1380 tttccctacc tgttagccat tggttctcaa cctataccgc atacagcgcg ggggtcgcag 1440 tgccgggcgc ccgcgcgtaa tgtgagccac tgaggcgaaa cggccgctag gttcgcgatg 1500 atcatcggcc acttttcatg tcgcggacta taggtacttt taaatgtaat tttgatgttt 1560 ttgagtcgta atttctacaa aaaattaaat attccaattg gtttggaata gtaccagtat 1620 atctatatat atatatgtat tagaccgaca ggtcactagc gtatgtcatg tacacttaat 1680 aaaaaaataa tcctggcaag gtcgctgtaa acgtctcaca cggtggctta cccagattcg 1740 gcgtggagct gaaaaacttt ggtgttaagg tcgccaaccc acagccgaga ttcggaaatc 1800 ggagcgtgtg ggatttttaa ataataataa tagaaattgg gaaaggcaat aataaaatat 1860 aattgatggg tgaattaaag gtaaatttat tttaaccgtt atggtaaaga ataatagtta 1920 attttatgac caagaaaagc ctacaccggc ataaaaactt ggccgcaact cacatctaat 1980 gaactatcac gtaggtatat ctgtgataat cattttttga tgaagattat acaaataatg 2040 aaaaattaag attgactaga aatgcagttc ctataaagta taaagacttt gaacatttac 2100 atgtatctac acccactaaa gtttaccaaa aaataggtgt tctaagtcca attgcaatag 2160 aaactatttc aaaaaatgga tctcctagtg tatatgttcc aacaccaaat aaaagtcatc 2220 tgactcagat aatattagat caacatcttc aactccagtt ttgacttcaa aaaccagaca 2280 atttattgaa gaatcttacg aaccaactgg cttaaaacct aaaatattat tctgccacaa 2340 caatgatagt aaaatatcaa aactaaagct tgctcttcaa cttaaaaaaa acaaatccgt 2400 aacaaaaatg cctctttatt aaagttaaaa aaaatttgaa aatacttaga gaaagtaaaa 2460 aaaataaaag tactttatta gattctttat actatccatc ttcagattct aaaacacttg 2520 taaaaatgca ggtattaaga cctaaatttt caaggaaaag attcacaaaa aatgaaaaaa 2580 attttgcttt aggtctattc tacaaatcac catctgctta caaattctta aaaaataaca 2640 agcaattaac tctaactggt ttatcaacca tccttcgctg gattggaagt tcaaaattca 2700 aaccaggatt taatgcagga atatttaaac agttaaaaaa aaagtgaatc gatgagtcaa 2760 gatgaacttt actgtacttt aatctttgat gaaatgaaaa ttaagaatta tcttgaatct 2820 tccaaatttt tggatgttgt tgagggattt gaagatttag gacctaaggg tcgatcaaat 2880 aaattagctg gccaagcaat ggtatttatg attcgtggat tatactcttc ttggaaaatg 2940 ccaatatgtt attttttacc agcaacagca atgaaaaata atatactaag tgatttgata 3000 gtagaaatag tacatcggct attaaattgt ggatttttca tcaaagctgt aatatgtgat 3060 caaggtgcca acaacgtatc agcacttaaa cttttaaaag ttacaaagga taaaccattt 3120 tttgaggttg atggaagaaa aatgtattca atacttgaca ctcctcatct gtttaaaaat 3180 tttagaaacc attttataaa aaataatttt aaatttcaaa atgaagaagt atctttccag 3240 gatgttagaa atgtctataa cattgataaa aacagtacta caagtaggtc attattgaaa 3300 ataacagaaa accacataaa ccctggacct tttcaaatga tgtcttgcaa attagcaatg 3360 caactgttta gcaatacaat ggctgctacg ataaaaacat gacgtttgtg tatacggggg 3420 agctgaaatc caaaactgct cttcacaaag caaacatgat aaagtttttg aatgaccttc 3480 ttgatgtttt aaatagtaaa agtttataca actcaaatcg atttaagtgt gcaatatctg 3540 atacaagacc acagcaactt caatttcttg aaaaagcaag atcaacattt gaaactttag 3600 aaaaaatgga tgttaaaaat aaaaagagta aaactacccg acctctatgt tttgatggta 3660 tgtgctggac aataaatgca attataatgc tgtataacga acaaaaagaa attggcttta 3720 attttatttt aactggacga ttgaattcag atgttataga aaacacattc tccatttttc 3780 gacaaagagg aggatataat cgaaacccta ctactagaat ttttcgtaca aggtttagac 3840 ggaatgccaa aatgagttta atgaaaccat ctagtaggta gttcaaattg tgaacctgat 3900 gatgataccc atttaatgac caataaagac aatgagacaa gattgcttga tatggacagt 3960 agcttttctt ctttatcatc ggtttcaatg ttgggtatca ataattcact ggaagaacaa 4020 gaaggtttag taaatttaga aaattgttca aacacttatt tcgctggata tttagccatg 4080 aagtgtttat tgaagttttc ctgttcaaat tgtgaaaaaa taatgataaa atctggtact 4140 attctaaacc aattggaata tttaattttt tgtagaaatt atgactcaaa aacatcaaaa 4200 ttacatttaa aagtacctac aactaccatc acagaatttg ttatttcatc tcaaaaaata 4260 cttcctaaaa tattggaaaa aaacctcata aaataaaaat cgctcaatct gttggcgaac 4320 aaattaaaac aaatttaatg cattcactta gtattgaaga agattgtatt ccacatctag 4380 aatttttaat taatcatttg ataatttgta aactttttag aaattttagt tggtgttcaa 4440 aaaatttaaa acatgttaga agtaaaaaat caaaaaaaaa aaattaaaaa ttaaaaataa 4500 tatttaaaat attatattaa ataaaattat attatgatta ttgtaagtaa gaacattgat 4560 ggtttagact tcttttatta tttgtatata ctttgtccat aatatataaa ctcatattca 4620 tcacttaagt ttttttttat tctgttgttg cggcatctag tgtgaatatt ggtattttgg 4680 tttccctatt ttttaataat tcgtacactg gaatgcctta ttttaaattt gttgatttta 4740 tttttattta aataatttaa ttttacacaa tttttaactg gacggctatt tgctcgaccg 4800 tgagcttaca acacgaaagc agaccgtaaa ctgaagataa cgcgacccgg tgcggcgacg 4860 ccgaccgata ttcgttttat ctcagtataa cgtttacaat ttatcactag atattatggt 4920 attgaatatc tgtattctgt aatattggga ttgttattaa atgtacaaaa tgtatttata 4980 aaaaaaatag ttataaaaat attgtaatac tataagaatt aaatgatttt tttttaagat 5040 tagctattaa tttttatata tttgtattaa taataatgaa ttttgtatta ttattattta 5100 ttataagtta taactaacct atgttttgat tgtttaagtt tattaaaacg tagaatgtct 5160 aaataattaa tgtgtaaggt tacaatatat tatatcaatt taaatgaatg tgcaatgatt 5220 aattgtcttc agctggaatt ttactaaaat aagtgaaata aatattatgt aaattattta 5280 cgttgtaatt taatgagaaa cttatcttaa taaaatatat aacggtcagc ttaaaactat 5340 atgatataat ataaaacatc tataataaac tttatttcct tcaaattaaa attaaaatta 5400 taaaaatacg atgataactt attaaagcaa tacatttaat taaaacaatt ttcaaaccac 5460 aacattttga tctctaggac cgatgacaaa ttaattgcat gaaatataat atagaaaata 5520 ataatattat attaatatta atataatatt atcacatatt atgc 5564 // ID PPSAT3 repbase; DNA; INV; 89 BP. XX AC K02941; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 28-SEP-1995 (Rel. 1.08, Last updated, Version 1) XX DE P.pollicaris satellite, consensus sequence. XX KW SAT; Satellite; Simple Repeat; PPSAT3; Repetitive sequence. XX OS Pagurus pollicaris OC Eukaryota; Metazoa; Arthropoda; Crustacea; Malacostraca; OC Eumalacostraca; Eucarida; Decapoda; Pleocyemata; Anomura; OC Paguroidea; Paguridae; Pagurus. XX RN [1] RP 1-89 RA Fowler F.R. and Skinner M.D.; RT "Cryptic satellites rich in inverted repeats comprise 30222034f RT the genome of a hermit crab."; RL J. Biol. Chem 260, 1296-1303 (1985). XX DR GenBank; K02941; Positions 1 89. XX SQ Sequence 89 BP; 22 A; 20 C; 27 G; 20 T; 0 other; caggtccgga cctgtgagat ttcgccaaaa atttggggtt tcgacacggg aaatttttcg 60 ggccagaaaa gtcgacaggt ccggacctg 89 // ID CR1-37_HM repbase; DNA; INV; 4130 BP. XX AC . XX DT 17-DEC-2008 (Rel. 13.12, Created) DT 17-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE CR1-type family - consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-37_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4130 RA Bao W. and Jurka J.; RT "CR1 families from Hydra magnipapillata."; RL Repbase Reports 8(12), 1865-1865 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(347..1096,1014..2351,2324..4066) FT /product="CR1-37_HM_1p" FT /translation="IWIFLNKTIGRKLGAGSHSQSTKDIHNWCSQLSHAFL FT ELFERLQKLETTKIENDSNIQNLKTEIIKAANSSKSICDWSLIVKQGKQKK FT KPTEQLVATNVAINELEERKRRAKNVIIYGVAESANENIEMKSKDNDESKV FT KEIFKIIDQENVIPKFIRRLRSKTDKPGPILVELTEDFLRNKILGSAKKLR FT SNELHKNCYLSPDLTEAQRLQDFNLRTERNKMNQARTDNDPFWYGIRGNQI FT VKFKKKSNQIKQEQTMIPFGMASVAIKSSNLKRNQINKFPQDLVSDGIEVE FT KLSCLYLNATSLDNKLNEFKVVIEQYKPNVVAVTETWFNNNSVVNVEGFHL FT YRKDRLDGRRGGGVCLYIDNLIDSYELNDAGLNLCKLEQVWAVVYFGKDKY FT LLGCIYRPNDFIDMSEFHVVFSRAREFIDKNKFKDFLIMGDFNFPSISWAN FT GCVDVISNENGIEHKFSDILNENFFYQHINIPTFQLSDDQLINTLDLVFTT FT QSASVSLIDSKFVLGNIKRGHLIICFDFILADKAVSDYKSKQRFMFAKSKF FT DKISDLISNVDWIRCFSNKNVQEMFDELIYYTEVACNHFTPTKMTINSLKI FT RPAWVNASLKFLIRSKQNLRYKNCSCKWKDPVLKFEYKMICKKVEHEIKKV FT RQNFEYDLVKRAAKNPKLLYSYITNVQQLHKASVQLHKQSTSSCTATTINK FT LLYSYINNQRAVRDSIKALKGTDGETTQNPKYVADILNKNFQAAFVREDDG FT PLPYFQIRTNETFIMTTEDFKYEDVFLRLKNLKENKSSGVDNLHSAILKNC FT ASAFAIPLTYIFKESFETSKLPEQFRSANITPLYKKGEKTLAVNYRPISLT FT SIACKIMEGIMRCKLEIFLNEYKLIVKQQHGFVKSKSCTTNLLDTLDFIST FT SLENGKPVDVIFLIFVRTXAFDTVPHKRLLLKLAAYGISGLTIKWIEAFLK FT NRKQRVVLGENVSSWAEIISGVPQGSVIGPLLFVLFINDFPETLKNISMLY FT ADDAKIMNEMISYKSTVSLQADLDRAFKWTQEWLVKFNISKCMVMHYGINN FT RKTPLYINKQKLNVTESERDLGVIFSNNLKWKNQVISCVGKANQMLGMIRK FT CFVRLDIRLLRSLYVSFIRPLLEFAVPVWSPNQKGEIDLLERVQHRATRLI FT PSLKKINYENRLKALDLTTLTNRRKRGDMIQLFKIFNGFNKLETKRKFNFQ FT QTQTRGHRFKYVKEITKQAHRENFFFNRSANLWNSLPNDLVNSETVNSFKA FT GLDCWMSSSQANQLS*" XX SQ Sequence 4130 BP; 1563 A; 631 C; 707 G; 1226 T; 3 other; aatacagaag aaaaaaaaga aaaaaaaaat atttaaatag tttcttttta ttgtgttatt 60 ttagtaatta aagtttaatg agtttgatat tcatctaaaa gttacatttt ctcttttaaa 120 tttttactaa atttttacga aaaaaacaca aaaaacaaac wgttctatct tgtgttaata 180 agtatttgtg ttaataaggt acaatattca attcactacg agaggtaatc cactgttatt 240 tagggtggta gccctctcag cttacagttt cagacaggag gtaatccact gtttagggtg 300 gtagccctct tatcaagaga gtaagaataa tcaattataa taatgaattt ggattttctt 360 aaacaagaca attggccgaa aactgggtgc tggttcacat tcgcaatcaa ctaaagatat 420 tcataattgg tgtagccagc tttcacatgc ttttttggaa ctctttgaac gattgcaaaa 480 actagaaaca actaagattg aaaatgacag caacattcaa aatttaaaaa cagaaataat 540 caaagcagct aattcaagca agtctatctg tgattggtct cttattgtta aacagggtaa 600 acaaaaaaag aagccaacag aacaactagt agcaacaaat gtagccatta atgaattaga 660 ggagagaaag agaagagcaa aaaatgttat tatttatggt gtagctgaat ctgctaatga 720 aaacattgaa atgaaatcaa aagacaatga tgagtcaaaa gtaaaagaaa tttttaaaat 780 tatagatcaa gagaatgtta taccaaaatt tattagacgg ttgagatcta aaactgataa 840 accaggccca attctagttg aattaactga ggattttctt cgcaataaaa ttttaggatc 900 agctaagaaa ttgagaagta atgagctaca taaaaattgt tatcttagtc cagacttaac 960 ggaagcacaa aggttacaag atttcaatct tagaactgaa agaaataaaa tgaatcaagc 1020 aagaacagac aatgatccct tttggtatgg catccgtggc aatcaaatcg tcaaatttaa 1080 aaagaaatca aatcaataag tttcctcaag atttggtttc agatggtatt gaagttgaaa 1140 agctgtcatg tttatattta aatgctacct ctctcgataa caaattaaat gaatttaaag 1200 ttgtgatcga acaatacaaa ccaaatgtag ttgctgtgac tgagacttgg ttcaataaca 1260 attcagtggt aaatgttgag ggttttcatt tgtatagaaa agacagatta gatggcagac 1320 gaggcggtgg agtgtgttta tatatagaca atttaataga ctcatatgaa ctaaatgatg 1380 caggtctaaa cctttgcaaa cttgaacaag tgtgggcagt agtttacttt ggtaaagaca 1440 agtatctgct tggctgcatc tacaggccaa atgattttat agacatgtca gaatttcatg 1500 tggtattcag tcgtgcacgt gagtttattg ataaaaataa gttcaaagat tttctaatca 1560 tgggtgactt caattttcca tctattagct gggcaaatgg ctgtgtagat gtaatttcaa 1620 atgagaatgg aattgaacac aaattttcag atattttaaa tgaaaatttc ttttaccagc 1680 atatcaacat accaactttc cagctgtcag acgatcaact tataaatacc ttagatttag 1740 ttttcacaac tcagtcagca agtgttagtt taatagactc aaagttcgtt ctgggaaata 1800 taaagagagg tcatttaata atatgttttg attttatctt agctgataaa gcagtcagtg 1860 actataaaag caagcaaaga tttatgtttg ccaaatctaa gtttgacaaa atatctgatc 1920 taatttcaaa tgttgactgg ataagatgct ttagcaataa aaacgttcag gaaatgtttg 1980 atgaacttat ttattacaca gaagtggcat gtaatcattt cactcctact aaaatgacaa 2040 tcaactctct aaaaatcaga ccagcctggg taaatgccag tttaaaattt cttatcagaa 2100 gcaagcaaaa tttaagatac aaaaattgct cgtgtaaatg gaaagaccct gttctaaaat 2160 ttgaatacaa aatgatttgc aaaaaagtag agcatgagat caaaaaagtt agacaaaact 2220 ttgaatatga tctggtaaaa agagctgcca agaatccaaa gctcctgtac agctacataa 2280 ccaatgtaca acaactacat aaagcttctg tacagctaca taaacaatca acaagctcct 2340 gtacagctac ataaacaatc aacgagcagt cagagattca atcaaagcat taaaaggaac 2400 ggatggtgaa accactcaaa atccaaaata tgttgcggat attttaaaca aaaactttca 2460 agcagcattt gtgagagaag acgatggtcc gctgccttac tttcagataa gaacaaatga 2520 aacgtttatt atgacgacag aagattttaa atatgaagat gtctttttaa gattaaaaaa 2580 cttgaaagaa aacaaatcaa gtggtgttga taacttacat tcagctattt taaaaaactg 2640 tgcatcagca tttgctattc cacttacata tatttttaaa gaatcgtttg aaactagcaa 2700 attgccagag caattcagat cagcaaatat tactcctctg tacaaaaaag gtgaaaaaac 2760 cttagcagta aactaccgac cgatatcact cacctcaatt gcttgtaaaa taatggaagg 2820 tataatgaga tgtaaacttg aaatttttct gaatgagtat aaactcatag tcaagcagca 2880 gcacggtttt gttaaaagta aatcatgcac aacaaatcta cttgatacac tagattttat 2940 ttccaccagt ttagaaaatg gtaaacctgt tgatgttatt tttttgattt ttgttaggac 3000 twkggcattt gacacagtgc cccataaaag gctgttacta aaactagcag cttatggaat 3060 atctggtcta acaattaaat ggattgaagc ttttctgaaa aatagaaaac aaagagttgt 3120 gctaggcgaa aacgtatctt cttgggcaga aattatcagc ggtgtgccac aggggtccgt 3180 aattggacca cttttgttcg tgctatttat taacgatttt cctgaaacct taaaaaacat 3240 atctatgctg tacgctgatg acgctaaaat aatgaatgaa atgatttcct acaaatcaac 3300 agtgtctttg caagcagatc ttgatagagc ttttaagtgg actcaagagt ggcttgtaaa 3360 gttcaatatt tctaaatgta tggtcatgca ctatgggatt aataacagaa aaacaccact 3420 ttatatcaat aaacaaaaat taaacgtaac agaatctgaa agagatctag gagtaatttt 3480 ctcaaacaac ctaaaatgga aaaatcaagt gatatcatgc gtaggcaaag ctaatcaaat 3540 gttgggtatg attagaaaat gctttgttcg tttagatatt agactactga gatcacttta 3600 tgtatctttt atcaggccac tattagaatt tgcagtaccc gtttggtcac caaatcaaaa 3660 gggtgaaatt gacttactgg aaagagttca acatcgtgca actcgactaa taccatcact 3720 caagaaaatc aactatgaga atcgtctaaa agctcttgac ttaactacgt taacaaatag 3780 aagaaagaga ggagacatga ttcaactttt taagatcttt aacggtttca acaaattaga 3840 aacaaaaaga aagtttaatt ttcaacaaac tcaaacaaga ggtcaccggt ttaaatatgt 3900 aaaagaaatc actaagcaag ctcatcgaga aaactttttc tttaacagat cagccaattt 3960 atggaatagc ttgccgaacg atttggttaa ttcagaaaca gttaacagtt ttaaagccgg 4020 tcttgattgc tggatgagca gcagtcaggc aaatcagctg tcatagtgtg ctaataacac 4080 actcactggt tcataccagt tacagcattt aataataata ataataataa 4130 // ID C6_TC repbase; DNA; INV; 1433 BP. XX AC U16295; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 25-MAR-1997 (Rel. 2.02, Last updated, Version 2) XX DE C6 interspersed DNA element. XX KW Transposable Element; C6_TC; Interspersed repeat. XX OS Trypanosoma cruzi OC Eukaryota; Euglenozoa; Kinetoplastida; Trypanosomatidae; OC Trypanosoma; Schizotrypanum. XX RN [1] RP 1-1433 RA Araya E.J., Cano M., Novak E., Guevara P., Ramirez J. RA and Franco da Silveira J.; RT "Characterization of a reiterated family of interspersed DNA RT elements in the genome of Trypanosoma cruzi."; RL Unpublished. XX RN [2] RP 1-1433 RA Araya E.J.; RT "C6_TC."; RL Direct Submission to Genbank (25-OCT-1994)Jorge E. Araya, Escola RL Paulista de Medicina, Microbiology Immunology and Parasitology, RL R. Botucatu. XX DR GenBank; U16295; Positions 1 1433. XX SQ Sequence 1433 BP; 341 A; 358 C; 392 G; 342 T; 0 other; gaattccaac cgctgatgcg catttgttac gaaaggcagg aattatcgag gacgcttcgt 60 ccagatacgg gcggatggat taataccttt ctcagttgtg gaggagaaaa ccaccggttt 120 acgacgacga cgatggatcg cgtggccacg cgacaagaac agagacgccc ttacgaggcg 180 aatgttcctc ttttacatat tccattatta ccgcctgtga tggcgaggcg cttctcttga 240 ttaagcatcc cttttttaag tctctttacg cggagactcg gatctctttc gatgccacgt 300 ggagtacggc tgcgctggtg gagtcgacac ggctcccaat ggatacaagg ccagccagaa 360 attctccagt tattacttcc gacaattgcg ggggtgacaa cggtggttcg cccctctggg 420 ccgcatcacc aatggtcgat cgacgtatga tcatatatac gcataacacg ggtcaaaaag 480 cgatgtgata tcgtgggagg cccaagtgct ttgcaatgcg gacagttgtc acgcatccat 540 gggggaagaa cgcgaatcgg gcgctacaca gtacaccttc ttggggtgca gtttgatcac 600 acacaccggg cggtatccct gagtgacaag tttgtccgct ctgtacgcgc catgccggcg 660 ttaattcttt gaccatcgcg gaaatggaga ttatgcgtca cgctttttgt acgcggctgc 720 cattttgggc acgcgtttat gtgactacta cttttttatt aaggcagtgc gacgacgatt 780 gtccgcactt aaccggggat tgtgcaggag acatccccgg cgaacctacg cctacagcgg 840 ttggtttggg cgagagattg cgacacacca tcgacgataa tcgtaagcga aacagtcaag 900 cccacggaga aggcatcggc tgccatcatc acacggcacg catcgctcca tgatgggagc 960 cgtttttatt ccagacttcc ggcgacgtta aaattgccgg agaaaaatgg gagaggaagc 1020 cttttcttat catgcaggcc gaggcacgtg cggtacgctt agccttatcg gccttttccg 1080 ccattttgcc atccaccatg ggcgtttggg tggacaatac ttcgctgcaa ggagcggcga 1140 ataaaggcag ctcaaaatca cacgcgttga cgtgggagct gcaacggata tacgagtttt 1200 tggactctcg cggaatacag gcaacatttg cctacgtgcg gtctgcagaa aaccccgcag 1260 acgatatcac gcggtcgtgt ttttacactt cagacttggc gaaggggtgg aacttccgaa 1320 ggggagcggc ggggtcttgt ggttgtagga ccccaaagtc tgccacttcg taagtaataa 1380 tattttcaaa tcctaactga ggacaaggac catgctaatg gtccacggaa ttc 1433 // ID Gypsy-74_AA-I repbase; DNA; INV; 5007 BP. XX AC supercont1.281; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-74_AA_; KW Gypsy-74_AA-LTR; Gypsy-74_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5007 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.281; Positions 681308 676302. XX CC Positions [4144-4485] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 371..4021 FT /product="Gypsy-74_AA-I_1p" FT /translation="MDQPRMPLLVPRVPVPNIVGSTPVRRTPQSSGPVQQQ FT AYSTQGGLDQRILAPVAPAPSAPNEQGFGDNSEGKVPELHQHPLSEDLQQS FT SVGQTLSNLVGALDALNFKLSSMERTQQAQQYQLQQMQQNLVPSQAEPPPA FT AASRSNGFEHWSDLSIFHQPRVWPGNVAPLTMPNANNGNQWPTVPPAMLHR FT NNAPPCVGQHAGQHTGQNLCANPTFNSVPHSTRAENVGFTASRTFDVPKPL FT FKGDLEQSHPVEFLQDVDRYVDSVRLEQNCKLPFALSCLEGEARTWARGFG FT YLLVNYDQFRFHFLQQYWGQRAQRLVREEIMYGNYTIKTPSRMAEYFLALI FT TKARHLDTAPSELELVLHLTQHFPRSVGARLSNCSDIQSAYTLLQTEDHHH FT NTYNRNVSRARAEVNQSSTMASANAGNRPWRGQQNNNNDGQWNRNVRATVA FT ELEDEGGAEVHSVANIFVGSDELLAEYEVSGQPKHEKSPLIEARIDSLVKL FT VLLDSGSELSCIDKALYHDLKSRGIPLGEFPVQNTTIQGAYGKKKMTISNQ FT VFVSVYIQNETIDVCLAVVEELCSPLIFGMDTLHYLKAQMDFQRRDVCLTL FT NGREVVLPFCADDSSRPQRSAQLCRIEVGEVCEPPAKDSTNSSPLDGDFKR FT SDEEQAKLDLLLEEFADIFSDVPGLTTEYEHEIRVTDDVGFNQKQYPIPYR FT YVNKVREQIDKMEMWGIISRESTAYINPLVVTMKKSGDVRVCLDARRLNAV FT MEKDNEKPSDVQQIVQKFHDTRIFSLIDLTSSYWQIPIKPSHRKFTGFLFE FT TRSYVFNVLPFGLCTAVASFSRAMDVILGPEILEFIEKYLDDLLVKSSLFN FT EHLEHLRKLFTRLRVAGLTISASKSEFCKIKLKFLGHIVGQDGVSIDPDKV FT SAIQQFPEPSDQKSMKSFLGLASYVSRFTPKYASVAKPLYALLRADAWWRW FT SDEERQAFSAVKQLILRHTTLRYPLLDKEFCVQTDSSYQGLGAMLFQWDDQ FT NDRMVISYASRLLQAAERNYTATELEALAVVWSLNKWRQYLLGTQFTVYTD FT HKALIFLKQCQLLNGRLTRWILFLQQFNFDIQHCRGKDNEIADALSRYPEG FT STPTSLRSSERQFVLARLTAEEKGFRQLMRQLPVQQQTDEALLPLYLISTD FT GVHRSGTREFTVVDDVLFVKNNPKAGWKNVIPRKMIGVVLRFYHEQSGHYG FT LQKRSKQ" XX SQ Sequence 5007 BP; 1463 A; 1134 C; 1178 G; 1232 T; 0 other; tagcgtgggg gctcaaccgg gattgatatt atctcccgat gccgtatatg agatgagatg 60 atatgacaaa gaaaattaac cttaattcga aaagacaata gtggcgacat actacacgaa 120 acatgagggt gcgcttctaa tagttcgcaa aagagatgta ttacctctga agaaaattag 180 ttatatcgag atgccgatgg ctgtatattt tacacggatc agttccagat tattatcaaa 240 ttatcaaggc atgcgaaagc cttgaatagg tgagtagcgt tcctgtttct atacataagt 300 ggtttggacc tacctggtct tattattccc caggaccctc atctcacaca ccgacggatc 360 atctaccgtc atggaccaac caaggatgcc gttactggtg cctcgagttc cagtccctaa 420 tatcgtggga tctactccgg tccgaagaac acctcaatca tcaggtccgg tgcaacagca 480 agcatattca acccagggag gactcgacca gagaatcctt gccccagttg ccccagcacc 540 atcagcaccg aatgaacagg gatttggtga taactcggag ggcaaggttc cagaattaca 600 tcaacacccc ctctctgaag atcttcagca gagcagcgtt gggcagacac tgtctaatct 660 agtgggggcc ctagatgcgc tcaactttaa gctgagttca atggagcgaa cacagcaagc 720 tcagcagtac cagttacaac agatgcaaca aaatttggta ccatcacagg cagagccgcc 780 accagcagcc gcttctcgtt cgaatggatt cgaacattgg agtgatttga gcattttcca 840 tcaacccaga gtatggcccg gtaatgtcgc gccattgacg atgccaaacg ccaataacgg 900 aaatcaatgg ccaacagtac ctccagcgat gcttcacaga aacaatgcac caccgtgcgt 960 cggtcagcat gccggtcagc ataccggtca aaatttgtgt gcgaatccga cattcaactc 1020 cgtacctcac agtaccagag ctgaaaacgt tggtttcacg gcctctagaa cctttgatgt 1080 cccaaaacca ctcttcaagg gtgatttaga acaaagtcat cccgtcgaat tcctgcaaga 1140 tgtcgatcgt tatgtagatt cggtcaggtt ggagcagaat tgcaaactac cattcgccct 1200 ttcctgcctg gaaggagaag ctagaacatg ggccaggggt tttggttacc tgctagtaaa 1260 ttacgaccag ttccgcttcc atttcctcca acagtattgg ggacagcgtg cacagaggct 1320 agttcgagaa gaaattatgt atggaaacta tacaataaaa actccatcca gaatggccga 1380 atattttttg gcgctgatca ccaaagcacg tcacctggat accgctccat cagagttaga 1440 gctagttttg catctaacgc aacactttcc tcgtagtgtc ggggcacgac tgagtaattg 1500 ctctgacata caatcggctt ataccctact acaaactgaa gatcatcacc acaacactta 1560 caatcgaaac gtgtctagag caagagccga agtaaatcag tccagtacga tggccagtgc 1620 gaatgccggc aaccggcctt ggagaggtca gcagaataac aacaatgatg gacaatggaa 1680 tcgcaacgta agagctacgg tagccgaact agaagacgag ggaggggcgg aagttcattc 1740 cgttgcgaat atctttgttg gttcggatga acttttagcc gaatacgaag tgagtggtca 1800 gcctaagcac gagaaatcac ctctgatcga agcccgtatc gacagtttag tgaagctagt 1860 cttgctagac agtggtagcg aacttagctg catcgacaaa gccctgtatc acgatttgaa 1920 gtccagagga attccactgg gggagtttcc tgttcagaat accaccatac agggcgccta 1980 tgggaaaaag aaaatgacaa tttcaaatca agtattcgtt tctgtttata ttcagaacga 2040 aaccatcgat gtttgcctag cagttgtaga ggaactttgc agcccattga tatttggcat 2100 ggacaccttg cattatttga aagcccagat ggattttcaa cgccgcgacg tctgtttgac 2160 tcttaatggc agggaagttg tgctgccgtt ctgtgcggat gattcaagtc gacctcagcg 2220 atctgcacaa ttatgccgta ttgaagttgg agaagtttgt gaaccgccag cgaaggattc 2280 aaccaacagc tctccactgg acggagattt caaacgaagt gatgaggagc aagctaaact 2340 tgatttgctg ttggaggaat ttgctgatat attctccgat gtgcccggtc ttacaacgga 2400 atacgaacat gaaatccgcg taacggatga cgttgggttc aaccaaaagc aatatccaat 2460 cccctaccgc tacgtcaaca aagtccgtga acaaatcgat aaaatggaaa tgtggggaat 2520 catctctcga gaatcaacag cctatattaa tccattggtg gtcactatga aaaagtcggg 2580 agatgttcga gtgtgtttgg acgcccgccg tctgaacgcc gttatggaga aggacaacga 2640 gaaaccctca gatgtgcagc agatagtgca aaaatttcac gacactcgta tcttctcctt 2700 aatagatctt accagctcgt actggcaaat accaatcaaa ccaagccacc ggaagttcac 2760 tgggtttctg tttgaaacaa ggtcgtacgt attcaatgtc cttccatttg gattatgtac 2820 agcagtggcc agtttttccc gtgcaatgga tgtcatattg ggaccagaga ttctagagtt 2880 tatcgaaaaa tatttggatg accttcttgt gaaatctagt ttattcaacg agcacctaga 2940 acatctccga aaattattta ctcgacttcg ggttgctggg ttgactatca gcgcatcgaa 3000 aagtgaattc tgcaaaataa aactcaaatt tctggggcat attgttggac aagatggtgt 3060 tagcatcgac ccggataaag tgtcagccat tcaacagttc cccgaaccca gtgatcagaa 3120 atccatgaaa tcctttctgg gactagcaag ttatgtgtca agatttacgc ctaaatatgc 3180 gtctgttgca aaaccactct atgctcttct ccgagctgat gcatggtgga gatggtccga 3240 tgaagagcga caggcattta gcgcagtgaa gcagctaatt ttacgccaca ctactcttcg 3300 ttatcccctc ctggataagg agttttgcgt ccaaacggat agttcgtacc aaggactagg 3360 ggccatgtta ttccaatggg acgaccagaa tgaccgaatg gttatatcgt atgctagtcg 3420 gctattgcaa gcagcggaac gaaactacac tgctaccgaa ctggaggcgt tggcggtggt 3480 gtggtcgctg aacaaatggc gacaatactt gttggggacc cagtttacag tttacaccga 3540 ccacaaagcc cttatcttcc tgaaacaatg tcaactgctc aacggccggc taacaagatg 3600 gatcctcttc ctccagcagt ttaattttga cattcagcat tgccggggaa aagataacga 3660 aatagccgat gcgttatcac gctatcctga aggaagcacg cctacgtctt tgcgttcatc 3720 agaaagacag ttcgttcttg ccagacttac tgcagaggaa aaagggttcc gtcaactgat 3780 gagacaacta ccagtacaac agcagacgga tgaagccttg ttacctctat acttgatctc 3840 tacggatgga gtccatcgtt caggaacaag agagtttacg gtggttgatg atgtgctttt 3900 tgtgaaaaat aatccaaagg ccggttggaa aaatgttata ccaaggaaaa tgataggagt 3960 agttttgcgc ttctatcatg agcaatccgg tcactatggg ctacaaaaac ggtccaagca 4020 atgaataagg tagtgtactg gacaggcatg agagcagatg ctaaggctta cgtacgtggt 4080 tgctttatat gccaacgcac aaaacccatg aactctcggc tacacggtac ccgacgtagc 4140 ataattccag atgggccaaa taaactgttg tcggtggact tatttggtcc acttccacca 4200 ggaccggccg gcgtacgtta tgtatttata atgattgatg tattcacaaa gtacgttact 4260 ttagacgcag tgaaaaagcc aacagcctac gtactttggg gaaagctgga gagaagaatg 4320 caagagcttg gaaaaccgtt ggcgatactt tgcgatcaag gtactcagtt caccgcaaaa 4380 tattgggtaa ggatgttgaa aagtagcaac attcacttag tgtatacatc ggtgcggcac 4440 cctcaagcca acccggtaga gcgaatcatg agagagttat ctagataatg ccgagcatat 4500 tgccgagtca atcatcggtt gtgggcaaag aatctacacc agttcagttg ttggatcaat 4560 tgtgcctatc atgaatccac tggctcaacc ccttatgaac tacaatttgg aaaatcggca 4620 aaggatgctc tgcaaaatct gtttagtttc cctccttcga aacggtcacc cgtcaattac 4680 gaccagatcc gaatcatcct gcagaagaaa gcggacaatc gaaatgagcg ggcgaagaag 4740 caaacaaaac gatttgtcgc tggagatctg gtattactga aggcaaatcc aatgtcatcg 4800 gaagcagacg caataatcaa aaagtttctg gatgtttacg agggaccata tgagataaag 4860 gatgtggtgc atgacgatgt gtttatgcta caccacaagg aatctggaaa agcgaggggc 4920 atgtttcaca tcaaccttct gaagccgttt gtacagtcgt ggcagccgca aacttaatta 4980 aacactgcgt ttgctggagg ggggagc 5007 // ID L1-45_AAe repbase; DNA; INV; 4783 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE An L1 non-LTR retrotransposon from Aedes aegypti. XX KW L1; Non-LTR Retrotransposon; Transposable Element; L1-45_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4783 RA Kojima K.K. and Jurka J.; RT "L1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1398-1398 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 10 sequences with >91% CC identity. XX FH Key Location/Qualifiers FT CDS 167..1390 FT /product="L1-45_AAe_1p" FT /translation="MTAVRARENTFKVDLSNFPKRPSFEEIHGFIHETIGL FT TVDQVLRLQMNHAQNCAHVKCRDLKTAQDVVDRHNNRHEFEVNKTKIKVRL FT AMDDGGVEVKIHDLSENVRNEDISSYLTQFGDVVSIKEQFWGENFAFKGVS FT SGVRVAKVILRRHIKSFVTICGEETLISYRNQPQWCKHCTNPSHPGMTCVE FT NKKLLGQKIDLNNRLKAAQNKSSYASVLHQPASVASLMPEFVGTNLNQLNE FT AARSSRAQTVADEAAASSSASVSTPSKYQERMDENEGMVNVVNTVESASTA FT AAAVVADATAAPDVNAVAAAAVVAVPVVADGSSVDQSTQQDXQTRPDANVT FT APCHVSAFKIPTNPIPSNPTSMMISESESNESSTESGPFTKVKRARGRPKK FT QKLDVTLPMYVDSA" FT CDS 1530..4712 FT /product="L1-45_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MNHPIRSYNIGSININAVSNQNKIASLRSFVRLVDLD FT IILLQEVESANMCIPGYNVITNIDENKRGTAIALKAHIPYSNVQRSLNSRI FT LTVKVSDSVTICNVYAHSGTQNFSAREKLFREQLPFYLQNSAEHMIVGGDF FT NCVIAAKDATGTSNFSRSLKQLVDNLHLSDTWDCLHGSSIDYSFVRPNSAS FT RLDRIYVSRSLVPHLRTSELFVTSFSDHKAYKIRCCLPNLGKPHGRGYWSI FT RAHVLNEENLDEFRQKWNRWLRERRNYNSWMSWWIECAKPKIRSFFRWKTN FT ESFREFHATNELLYRRLRDAYSELYHNPNGMTKVNEIKAKMLNVQSRFSKA FT YERLNDRFVCGEKVSVFQIAERYRKKTVISSIHHHEHCLTDVSEIETHVYE FT FFKDLYSAEELNNGENFPTNRVIPPDSNTNTVMMNEITTAEIFFAIKESAS FT RKSPGNDGIPKEFFLKAFDIIHPQLNLIMNEALRGNIPEKFVEGVIVLCKK FT KSNDNTVKGYRPISLLNYDYKLLSRILKQRLENVMIENNLLTPSQKCSNSE FT HNIFEAIHSIKDRIAELNCRRVSGKLISFDLDHAFDRVSKDYLLVVMRNFR FT LNPNFVELLGKIMSASNSRLLINGNLTPKFPIQRSVRQGDPLSMHLFVLYL FT HPLLVKLRTICNHPLDLVVAYADDISIVITDMRKLNLIKQAFLDFGRCSGS FT LLNFDKSFAVNIGTQHNNDAVWPRVRESIKILGVTFFNSQKQTIDFNWGEV FT IRNTSRLMWLFKPRNLTIQQKVVVLNAFVTSKLWFLASVFSIPNQAIARIT FT SHIGNFIWERYSLRIPIEQLTLPKAMGGLNLHLPMHKCKALLITRFIKDQQ FT HTPFARSFNHCLQNPPNVAAIPALYPCLKSIAKELPYVPGQLRENPSTSAL FT HNYYREKLRRPKIVEEYARANWKRIWRNIRTKGLTSLEQSTYYLLVNGKIP FT HAELLYRQNRAQSPMCHQCPNSVEDLEHKFTTCLRTNALWNYLQPRLEATL FT GRRIGFNDLRLPELGNVASDRKYRALKLFIGYVNFVLDANNDLSVQALEFV FT LNCL" XX SQ Sequence 4783 BP; 1410 A; 1091 C; 992 G; 1288 T; 2 other; cagttaggtt caagctatcg ctgcaaccag acgtgttttt caatcttact ccgtatcggt 60 tttttttcgt ctcgcttctt gtctcccact gcggtgcgga gaacgttttc tagcgtcaga 120 ttttctggcg acaataagta ttgtgctatc caacgcgtta gcaaccatga ccgccgtgcg 180 tgcccgcgag aataccttca aggtagactt gtcaaacttt ccgaagcgtc cttctttcga 240 agaaatccat ggatttatcc atgagacgat tggtctcacc gtggaccaag tgctgcgcct 300 gcagatgaac cacgcacaaa attgtgcgca tgttaagtgc cgtgacctga aaactgctca 360 ggatgtggtt gaccgtcaca acaaccgaca cgagttcgaa gtcaacaaaa ccaaaatcaa 420 agttcgcttg gcaatggatg atggaggcgt ggaggtaaaa atccacgact tgtccgaaaa 480 tgttcggaat gaggacattt cgtcatacct gacgcaattt ggagacgttg tttcaatcaa 540 ggaacaattt tggggagaaa atttcgcctt caaaggcgtg tcatccggag ttcgagtggc 600 aaaagtaatc ctgcgacgtc acatcaaatc gtttgtgacc atttgtggcg aggagaccct 660 gatttcttac cggaatcagc ctcaatggtg caagcattgc acgaacccat cacatccagg 720 gatgacatgt gtcgagaaca aaaagttact cggacaaaag atcgacctca acaacaggct 780 gaaagcagcg caaaacaagt cgagctacgc cagcgtgcta caccagccgg caagtgtagc 840 ctcgcttatg cctgagtttg ttggtaccaa tctaaaccaa ctgaacgaag ccgctcgctc 900 atctcgagcg caaacagtag ctgatgaagc tgctgcgtcc tccagcgcaa gcgtctccac 960 accctccaaa taccaagaac gaatggacga gaacgaagga atggtaaacg tggtaaacac 1020 cgtcgaatcc gcttctactg ctgctgctgc tgttgttgct gatgctaccg ctgcccccga 1080 tgtcaatgct gttgctgctg ctgctgtcgt tgctgtcccc gttgttgctg atggttcatc 1140 ggtggatcaa tccacccaac aggatgmgca gactcgccct gatgccaacg ttactgcgcc 1200 ctgtcatgtg agtgcgttta agattcccac aaatcccatc ccatccaatc ccacttccat 1260 gatgatttcc gaaagcgaaa gtaacgaatc gtccaccgaa agtggaccat tcacaaaggt 1320 caaacgtgca agggggcgtc cgaaaaagca aaaattggac gtcactttac ctatgtatgt 1380 cgattctgcc taaaaatact aacgatgttc tgaaagttat ccgatggacc taaactaaat 1440 actaatcaga tgcaccttaa tcccatccaa attattaacc tcgagtcagc cgcgagtatg 1500 ttcggctccc cgtcttgtaa cacgatatta tgaaccaccc tatccgaagc tacaatattg 1560 gttcgatcaa tataaacgct gtctccaacc aaaacaaaat tgcttcgctc cgctcattcg 1620 ttcgtttggt ggatttagac attattcttc tacaagaggt tgaatctgca aacatgtgta 1680 tccctggtta taacgttatc accaatatag acgagaataa aagaggtacc gcaatcgctc 1740 taaaagccca tataccctac tccaatgtcc agagaagtct gaatagccgt atcttaacag 1800 ttaaggtttc tgattccgtt acaatctgta atgtttatgc gcattccggc acccagaact 1860 tttctgctcg tgaaaaacta ttcagagaac aactaccctt ttatctgcaa aattctgctg 1920 aacatatgat agtaggcggt gattttaact gcgtgatcgc tgccaaggat gccacaggta 1980 caagcaattt tagcaggtcc cttaagcaat tagtagataa tctgcatctt tctgatacgt 2040 gggactgtct ccatggaagc tctattgatt acagttttgt tcgcccaaac tctgcctctc 2100 gtttagatcg aatatacgtt tcaaggtcgc ttgttccgca tctccgcacc tccgaattgt 2160 ttgttacgtc cttttcagat cataaagcat acaagattcg ttgctgtctt ccaaatctgg 2220 gtaaaccaca cgggagagga tattggtcca tccgtgcgca tgttctcaat gaggaaaatt 2280 tggatgagtt tcgacaaaag tggaatcgtt ggttgcgcga acgccgaaac tataacagtt 2340 ggatgtcgtg gtggattgaa tgtgcaaagc caaaaattcg tagctttttc agatggaaaa 2400 ctaatgaatc gttccgcgag tttcatgcga caaatgaact gttgtatcgt cgactacgag 2460 atgcgtatag tgaactgtat cataatccga atggaatgac aaaagttaac gaaatcaaag 2520 caaaaatgtt aaacgttcaa agccggtttt ccaaagcata tgaacgtttg aatgatcgat 2580 ttgtgtgcgg ggaaaaagtt tctgtgtttc aaatagctga acgttatcgg aagaaaactg 2640 tgatcagttc catccatcat catgaacatt gtctaaccga tgtttctgaa atagaaactc 2700 atgtgtacga gttctttaaa gatctgtatt ctgcagaaga actcaataat ggtgaaaatt 2760 ttccaaccaa tcgagtgatc cctcctgatt caaatacaaa tactgtcatg atgaacgaaa 2820 tcacaaccgc ggaaattttt ttcgcgatta aagaaagtgc ttctcgaaaa tcccccggaa 2880 acgatgggat acccaaagag ttctttttga aggctttcga tatcattcat ccccaactca 2940 acctaatcat gaacgaagct ttaagaggaa atatccctga gaaatttgtt gagggagtta 3000 tcgttttgtg taaaaagaaa tcaaatgaca atacggtaaa aggatatagg ccaataagtc 3060 tcttgaacta tgactacaag ctactatcaa gaatacttaa acagcgtctg gaaaatgtta 3120 tgatcgagaa taatcttctw acgcctagtc aaaaatgttc aaattctgaa cataatattt 3180 ttgaagcaat tcattcgatc aaagatcgaa ttgcagagct taattgccgc cgtgtttccg 3240 gaaaattgat ttcgtttgac ctcgatcatg cttttgatcg cgtcagtaaa gattatcttt 3300 tggttgtgat gcggaacttt cgtttgaatc ctaatttcgt agaactctta gggaagatta 3360 tgtccgcttc aaactctcgc ttgctcatca atggaaatct aactcccaag ttccccatcc 3420 aacgctccgt ccggcaagga gaccctttga gtatgcatct ttttgtcctt tacctccatc 3480 ccctcctggt taagttacgc accatatgca atcatccgct agatttggta gtagcgtatg 3540 ctgacgacat atcgatcgtg ataactgaca tgcgcaaatt gaacctcatt aagcaagctt 3600 ttctggactt cggccggtgt tcagggtcgc tcctgaattt cgacaaatct tttgctgtta 3660 acatcgggac ccaacataac aatgacgctg tatggccacg agtgcgtgaa tctatcaaga 3720 tactaggagt aactttcttc aactcgcaaa agcaaacaat tgatttcaac tggggcgaag 3780 tcataaggaa tacgtctcgg ttgatgtggt tgttcaagcc aagaaatctc accatacagc 3840 agaaggttgt ggtgcttaac gctttcgtca catcgaagct atggtttcta gcatctgtgt 3900 tcagcattcc gaatcaagct atagcacgca ttacttcgca tataggaaac ttcatctggg 3960 aacgatattc tctcaggatt ccaattgaac agctcaccct gcctaaagct atgggtggtc 4020 ttaacttgca tctcccaatg cataagtgca aggctttgct gatcacgaga tttatcaaag 4080 atcagcaaca cacaccgttc gctaggtcct tcaatcactg ccttcaaaat ccaccaaatg 4140 tagcagcaat ccccgctctc tatccctgct tgaagagcat cgccaaggaa ctcccttacg 4200 tgcctggaca acttagagaa aatccttcga catctgctct gcacaactat taccgtgaga 4260 aattaaggag gccaaaaata gtggaggaat acgcaagagc aaactggaag agaatttgga 4320 gaaatatacg aacaaaagga ctcacatctc tggaacaatc tacgtactac ctactagtga 4380 atgggaagat tccacatgct gaacttctat accgacaaaa ccgtgcacaa agtcctatgt 4440 gccatcagtg tccaaacagc gtagaagatc tagaacataa attcactact tgcctacgaa 4500 cgaatgcatt gtggaattat cttcaaccac ggttggaagc aaccctaggt aggaggatag 4560 gattcaacga tctccggtta ccagagctgg gaaacgtagc cagcgacagg aaataccggg 4620 cgctgaagct atttatcggt tacgtaaatt ttgttttaga tgccaacaac gatctctctg 4680 tccaagcgtt agaatttgtt ttaaattgtt tgtaaatgta aaatgtgtac gcgtctagtt 4740 aaatgaagtt ccaataaatg tgtttaaaaa aaaaaaaaaa aaa 4783 // ID I-77_AAe repbase; DNA; INV; 7908 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE An I non-LTR retrotransposon from Aedes aegypti. XX KW I; Non-LTR Retrotransposon; Transposable Element; I-77_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-7908 RA Kojima K.K. and Jurka J.; RT "I clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1348-1348 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 15 sequences with >96% CC identity. XX FH Key Location/Qualifiers FT CDS 512..2137 FT /product="I-77_AAe_1p" FT /translation="MMSGAYPHLSHPPGDPGPSNKRDGRHTGPTLPGWADP FT LGDHGQVIILRIQGAERDLPNRPTLIRKSVEAYLGVKITDAYPERKGISYM FT VKVRNQRHAEKLKGMKQLSDGFPIQIIEHPVLNQSKCVISCRESVNYETEE FT ILDELKEQGVTAVRRITRREGNESVPTPTLILTIDGTVTPQNIDFGWIRCK FT TRPYYPAPMLCYGCFDYGHTRARCQQKTPTCSNCCGEHTHTPEAPCQEAVF FT CKHCKKSDHPVSSRKCPTYAKEVEIGKLRVDMGIGYPAARRIYESEHRTQT FT AASIVAAGNDQRFAELNAKFDRLLIETGKKDAKLSALIAENQEKDDQINAL FT ISSIKERDTMIAERDIRIAALEAAFTSVNERLPTATAVASFKGSSNDQIAL FT PQRTKKGKKGTKENCSSADATTSAMDRLELTRKHGTIEDLVAENQLLKKKD FT VIQSQVIESLRKGSEKRTATTMHNADDSHSSTPTNGNPSHKKSLGNRNSSE FT TITKRTKTDQTTMEISDDESPKTNSQTSELYIPSDLFSSDEMNDEV" FT CDS 2197..7587 FT /product="I-77_AAe_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="MATKKQEHRPKDQRVKQEGSRKENYTPKRHTTNFSRY FT NSRPVGDGCSPLPITGHSGLSASFHAAQTALQLAEASPLVSTARVFSGSAL FT TTSIYNVTLAPHRNNLRSPDSMTLKRSPQDNEGSSESNDNLTIPLRNIHGI FT ETLDRVPGSRSPYSVKVITPPEPGNIPRYSVTSHIGNKKTYLKTKAVANCN FT RSDPRGSAHRIPEDALTNARKYDLWRAAAGDFRLAVAGPSGVSSLLTAGLT FT PLPVAGASPSASTGGALSESAQTTSCFIDIAASYYAKTNPTNKKGSRDHLI FT LPLPDHEGPRNPDKASGSRGPPDAEVKAPPESGNNLRHSWANSGSEDDEEI FT TGSSDNITIPLVNNFRNKLQDNVSASRGPPSVEVIAPPEVENIPRRRPAAN FT PLKEKPIKPVSSSRPFYGLNPSDEPHPRRNPIRNCPSKYRSLTKIQDSSGT FT TPRFSPPATNTDKYDDLYITHYGPATIISADTPALPPGHPGHFNPGGKSHP FT CANDMTQPLHAHSTDLGLPENPNLGDESLVNPTQNTTNQSTLPSAGSSSPP FT LIGEALSYSTSXSHSLSTSPNPSQNIASNRTHPYMARGGRIALAAQWNVNG FT FFHNLQDVEMLVRDQQPVVLALQEIHRATPXVMNNTLGKKYQWYTKNYANI FT YQSVAIGVAAELSASEIKIDTDLPIVAIRLPWPFSVSVASIYLPNGKQPNL FT ESRLKDALKQIPEPMMILGDLNGHHRAWGSRRNCARGSIIVNVANQFNLTI FT LNDGSPTFSRGRVDTAIDVSLVSTQITNRFLWSVESDLRGSDHAPIMLTLE FT NTSAPETTRRPRWLYEQANWPDFQTSLETELEHHPPMSVSDLSALINEVAS FT ITIPKTSPNPGRRALHWWTDEAKKAVKARRKALRAFQRAKKRLPEDHPDWT FT LAQESYQVARNTCRQVIREAKDKSWTNFLDGITEEQSSSELWRRINCFNGK FT RRAKGMALKVDGVTTKDPGVIADVLADAFHELSSIQKYPEPFLKRHPQPET FT AVHRFQVPPDKGQPFNRPFSLKELEYALKKAKGKSAGPDEIGYPMLKNLPP FT SGKAALLATINNEWTAGTLPDSWKHSYVVPIPKSSGPANDVGSYRPIALTS FT CVAKLMERMANRRLIEHLVEKKKLDHRQHAFRPGYGTGTYLASLGQILDDA FT QKNGEHAEMAALDLAKAYNRAWTPGVLDKLARWGITGNMLLFVKNFLDGRS FT FQVVVGNHKSKVVREETGVPQGSVLAVTLFLVAMSGVFLALPKGIFILLYA FT DDILLIVTGKHPKSIRRKLQAAVSAVTKWAQDAGFDIAAEKCARLHVCQTR FT HKPPQKPITVNGRPMPTRKTVKVLGVNIDRYLTFRTHFDRVKEACKNRVRL FT IRSISGRRTTSDRATRLRVADAIINSRLLYGIELTCRAFDTMVSCLSPTYN FT STVRTLSGLLPSTPAAAACAEAGILPFRYKAAMTICKRAVSYLERTEGDGQ FT VCFLEEQANYALRAVAGSTLPPVAGLHRVGPRSWKAREITFDSTIKSQFRR FT GATPAAVKTCFRQILEERYSDTEVRYTDGSKLAGRVGIGIHGTSLDLAYRL FT PEQCSVFSAEAAAIYLAVAKDSQAPVLIVSDSASALSAIGSCTNRHPYIQA FT TQTKLDNTSGPITFMWVPGHCGIIGNERADALASLGRQSRFRTNEVPGEDI FT KVWVKTTIWNAWATEWTRDRSSFLRQIKPDVLPWTDVRDWREQRVLTRLRT FT GHTRATHNMGDGRNFRKFCETCNTRNTVEHIISNCPIYEYPRRQYDITSTS FT RALQNDPACERILLNFLKEAGIFNEL" XX SQ Sequence 7908 BP; 2253 A; 2226 C; 1867 G; 1554 T; 8 other; gtcattcgac agttcggttt taccgcgacc agtcgcattt tcctactcga ggtagagtat 60 tttctccgtc gctagtgatt ttttwctkag ttttacccag tgagtttggc taaaagtgca 120 gttgggatcg cggaccacta cggaactgtg ggcatccatc gtgaaatcgg ccaggtgagc 180 tgtttttaca gttaagaaag tgggcaaack tttattgaac aagtgctggc ggccattgtg 240 tccaccatca cgtgtttcga gtcgttgttg gagcacgttc acaacttgat taacgaacgt 300 tggttcgact acccaccgct tggcggtggc gaagatatcg acatagtagg gtcctttgga 360 atttgaattg acggttggts attcttaaaa gctctgtgtg gcgcggtaga agaattatta 420 atttagggta aacaatataa atccattgcc cgggagtgtt attattgcgg aaagtttcaa 480 agacgcaaac aastactggt agtctcccga catgatgtcg ggggcctacc ctcatctctc 540 tcatcccccg ggggaccctg gtccctccaa taaaagggat ggaagacaca caggaccaac 600 cctgccagga tgggctgatc cattgggcga ccacgggcaa gtgatcatcc tacggattca 660 aggggccgaa cgagatctgc cgaatagacc gaccctaatc aggaagtcgg tcgaagcata 720 tctgggcgtc aaaattactg atgcttaccc ggaaaggaaa ggaatctcct acatggttaa 780 ggtacgaaac cagcgacacg cagagaagct gaagggaatg aagcagctga gcgacggatt 840 cccaatccaa atcatcgagc acccagtgct taatcagagc aaatgcgtga tcagttgccg 900 cgaatcggtt aactatgaaa cggaggagat tctcgacgaa ctaaaggagc agggagttac 960 cgcggtccgt cgtatcacac ggcgagaggg caacgaatcc gtaccaacac cgacattgat 1020 cttgacgatc gacggaacag tgacaccgca aaatatcgac ttcggttgga ttcgctgcaa 1080 aacaaggccg tactatccag cacctatgct atgctatggg tgcttcgact acggccatac 1140 gcgtgcccga tgccagcaaa aaactccaac gtgcagcaac tgctgtggtg agcacacaca 1200 tacacccgag gccccatgcc aggaagcagt attctgcaaa cactgcaaaa agagcgacca 1260 cccggtgtca agcagaaaat gccccacsta cgctaaggaa gtggaaattg gaaaacttcg 1320 ggtcgacatg ggtatcggat acccagcagc ccgtagaatc tacgaaagtg aacaccgcac 1380 acaaactgcc gcctccatcg tcgctgccgg aaatgaccaa cgctttgctg aactgaatgc 1440 aaagttcgac cggttactca tcgaaactgg aaaaaaagac gcgaaactaa gtgccttaat 1500 cgctgaaaat caagaaaagg acgatcaaat taacgcactt atcagttcga ttaaggaacg 1560 ggacacaatg atcgcagaaa gggacataag aatagcagcc ctggaggctg cgtttacctc 1620 ggttaatgaa agactgccga ccgccaccgc agtagcttcc ttcaagggat cctcaaatga 1680 ccaaatcgcc ttaccccaac gtaccaaaaa gggaaaaaaa ggcacaaaag aaaactgctc 1740 ctccgccgat gctacgacat ctgccatgga ccggctggaa ctaacccgaa agcatggcac 1800 gattgaagac ctagtagccg agaaccagct attgaagaaa aaggacgtga ttcagagcca 1860 ggtaatcgaa tctcttcgaa aggggtcaga aaagaggacg gccaccacca tgcacaacgc 1920 tgacgatagc cattcgtcaa cgccaaccaa cggtaatcct tcccataaaa agtcactcgg 1980 taatagaaat agttcagaaa ctatcaccaa acggacaaaa acagaccaaa ccacgatgga 2040 aatctcagac gacgaatctc cgaaaacaaa ttcccaaaca tcagaactat acatcccttc 2100 cgacttgttc tcgtcggacg agatgaacga tgaagtctga agttatcact tcttaagccg 2160 cccaccaaac tctgcacact gccataacat ccactaatgg ctaccaaaaa acaagaacat 2220 cgaccaaagg atcaacgggt aaaacaagaa ggctcaagaa aggaaaacta tacacccaaa 2280 aggcatacca ccaacttctc acggtacaac tccaggccgg ttggcgatgg atgctctccg 2340 ttgccaataa ctggccattc aggattatca gcttcattcc acgcggctca aacggccctg 2400 cagttagcgg aggcctcacc gttagtttcc accgctagag ttttctcggg atccgcccta 2460 acgaccagta tctacaatgt tactcttgca cctcaccgta acaacctccg gtcccctgat 2520 agcatgacgt taaaacgctc accacaggac aatgagggaa gctcagagtc gaatgataac 2580 ctcacaatcc ccctaaggaa catccacgga atcgagactc ttgatagggt cccgggcagt 2640 cggagcccgt atagtgttaa ggtcattact ccaccggaac ctgggaacat tccccggtat 2700 tcggtgacaa gccacatcgg aaataagaaa acctatctaa agacaaaagc cgttgctaat 2760 tgcaacagat ctgacccacg aggatcagca caccgcatcc cggaggatgc actaacaaat 2820 gcacggaagt acgacctttg gagggctgca gctggtgact ttcggttggc agtagccggt 2880 ccttcagggg tatcgtccct tttaacagcg ggtctgacac ccttgccggt agcgggggcc 2940 tcaccgtcag cttctaccgg tggggctctg tcggagtccg cacaaacaac tagctgcttc 3000 atcgacattg ccgcatccta ctatgcaaaa acaaatccaa cgaataagaa agggtcgcgt 3060 gatcacctca tactccccct accggaccac gagggaccaa gaaacccgga taaggcctcg 3120 ggtagccggg gcccccctga tgcggaagtc aaggccccac cggaatcagg gaacaacctc 3180 cggcactcgt gggcaaactc cggtagcgag gacgatgagg aaattacagg gtcgagtgat 3240 aacatcacaa tccccctggt taacaacttt agaaacaaac tccaggataa tgtctcggct 3300 agtcggggcc ctcctagtgt ggaagtcatt gccccaccgg aagtagagaa catcccccgg 3360 cgtcgtccgg cggcaaaccc cttaaaggaa aaaccgatca agccagttag cagctctagg 3420 cccttctatg gccttaaccc gagtgatgaa cctcacccca gaagaaaccc tatcagaaat 3480 tgtccaagca agtatcgatc ccttacaaaa atccaagact cttcgggcac aacaccacgc 3540 ttctccccac cagcaacaaa cactgataag tacgatgacc tgtacataac ccattacggt 3600 ccagcgacaa taatctccgc ggacacccct gccctacctc ctgggcaccc cgggcacttc 3660 aacccgggcg gtaagtccca cccctgtgct aacgacatga cacaacccct acatgcacac 3720 agtacggacc tcggtctccc cgaaaaccca aacctgggtg acgaatccct tgttaatcca 3780 acccaaaata ccaccaacca gagtactctg ccctcggcgg gtagctcatc accacctttg 3840 attggtgaag ctctgtctta ttcaacctcc gwctcccact cgctgagcac gagcccaaac 3900 ccctcccaga acatcgcctc caatagaacc cacccgtata tggcccgcgg tggccgaatc 3960 gcccttgccg cgcagtggaa tgtgaatggc ttctttcaca accttcaaga cgtagaaatg 4020 ttagtgcggg atcagcagcc tgtggtcctg gctctacaag agatccacag agcaacacct 4080 gmagtgatga acaatacact aggcaaaaaa taccaatggt acaccaagaa ttacgccaac 4140 atctaccaat cggttgccat cggagtggcc gccgaactct ccgctagcga gatcaagata 4200 gacactgatc taccgattgt tgcgatccgc ctcccgtggc ccttctccgt ctcggtggct 4260 tccatttacc taccgaacgg caagcagccc aatttagaat cccgactcaa ggatgctctt 4320 aagcaaattc cagagccaat gatgatccta ggcgatctca atggccacca tcgagcttgg 4380 ggcagccgcc gcaactgtgc gcgtggctcc atcattgtta atgttgcaaa ccagttcaat 4440 ttaacgatcc ttaacgacgg ttccccaacc ttctcccgtg gacgagtcga caccgcgatt 4500 gacgtctccc tagtatcaac tcaaatcacc aaccgcttcc tctggtcagt ggagagcgat 4560 ctccgaggga gcgatcacgc accgatcatg ttaaccttgg aaaacacttc ggcacccgaa 4620 acaacgcggc gaccacgctg gttgtatgag caagcaaatt ggccggattt ccaaacatcg 4680 ctcgaaaccg agttagaaca ccaccctccc atgtccgtct cggacctatc cgccctcatt 4740 aacgaagttg cgtctattac catccctaaa acaagcccca acccgggacg tcgcgcgctc 4800 cactggtgga ccgatgaagc aaaaaaagca gtcaaggccc gcaggaaagc gctccgggcc 4860 ttccaacgcg caaaaaagag acttcccgaa gaccatccgg actggaccct tgcccaagaa 4920 agctaccagg ttgcccgtaa cacctgccgt caggtgattc gagaggccaa agacaaatcc 4980 tggacaaact tcctggacgg cattaccgaa gagcagtcgt catctgagct ctggcggcgc 5040 attaactgct tcaatggcaa aagacgggcc aaaggtatgg ccctaaaagt tgacggtgtt 5100 acaacaaaag acccgggtgt catagccgac gttctggcgg atgcttttca tgagctctca 5160 tcaattcaaa aatatcccga acctttcctc aaacgccacc ctcaaccaga aacagcggtt 5220 caccgtttcc aagttcctcc agacaaaggc caacctttca accgtccttt ctcgctcaaa 5280 gagcttgagt acgccctcaa aaaagcgaag ggaaaatcag ctggacccga cgaaatcggg 5340 tacccgatgc tgaaaaatct tcctccaagc ggtaaagccg cattactggc aaccatcaac 5400 aatgagtgga ccgccggtac gcttcctgat agctggaagc acagctatgt cgtgcccata 5460 ccaaaaagct ccggtcccgc caacgatgtt ggaagctacc ggccaatcgc tctgacgagc 5520 tgcgtggcca aactcatgga aaggatggca aaccgcagac tgatagagca cctcgtggag 5580 aagaaaaagc tggaccatcg gcagcatgcc ttccgaccag ggtacggcac tggcacttat 5640 ctggcctctc tcggccaaat cttagacgat gctcagaaga atggcgagca cgcggaaatg 5700 gcagcacttg atctggccaa ggcctataac cgagcgtgga cacccggagt gcttgacaaa 5760 ctagcccgct ggggaataac aggtaacatg cttttgtttg tcaagaactt ccttgacggg 5820 cgatcattcc aagttgttgt gggaaaccac aaatccaagg tagtgagaga agaaactggc 5880 gtaccacagg gatcagtgct agcggttacc ctgttcctag tggcaatgag cggcgtattc 5940 ttggcgctcc caaaggggat cttcatctta ctgtatgccg acgacatact attaatcgtt 6000 acagggaaac accccaagag catccgacgc aagctgcagg ctgcggtatc ggctgtgacc 6060 aagtgggcac aggacgccgg tttcgacatc gccgcggaaa agtgtgcaag gctgcacgtg 6120 tgccaaacca ggcacaagcc accacaaaaa ccaatcacgg tcaatggcag accaatgcca 6180 acaagaaaaa ccgtcaaagt cctcggtgtg aacatcgatc ggtacttgac attccgcact 6240 cacttcgacc gtgtcaaaga ggcctgcaaa aaccgggtga gactgatccg tagcatctca 6300 ggtagacgta ccacaagcga cagagcaacc cgactgcgcg tagctgacgc cataatcaac 6360 agtcgactcc tttacggaat tgagctaacc tgccgagcat tcgacacaat ggtgtcctgc 6420 ctgtctccga catataacag tactgtgaga acactatcgg gcctacttcc ttcaactccg 6480 gccgccgccg cttgcgcaga agccggtata cttcctttcc gctacaaggc agcgatgaca 6540 atttgcaaac gtgcagtcag ctatctggag cgcacggagg gcgacgggca ggtttgcttt 6600 ctcgaagagc aggccaacta tgccctgaga gctgtggccg gttccacgct tcccccggtg 6660 gccgggctcc accgtgtagg accgagaagc tggaaggcca gagagatcac cttcgactcc 6720 acgatcaagt cccaattcag aagaggcgca acccctgcag cggttaaaac atgcttccgg 6780 caaatattgg aggaacgata cagtgacacc gaagtacgct acacggatgg ttccaagcta 6840 gcgggccgag tgggcattgg tatccacggc accagcttag atctagcata ccgtctgcca 6900 gaacagtgtt ctgtgttttc cgccgaagca gcagcaatct accttgcggt cgctaaggat 6960 agccaggctc cggtccttat tgtgagtgac tctgctagcg ccctgtcagc gattggatca 7020 tgtaccaaca ggcaccccta cattcaggcc acgcagacta aactggacaa cacaagcgga 7080 ccaataacgt tcatgtgggt acccggtcac tgcggcataa tcggcaacga acgagccgac 7140 gcactggcta gcctcggccg gcaaagccgt tttcgaacaa atgaagtccc gggtgaagac 7200 atcaaggtct gggtaaaaac cactatatgg aatgcatggg cgactgagtg gacgcgtgac 7260 agatcctcct tcctccgaca aatcaagccg gacgtcttac cgtggacgga cgtgcgtgac 7320 tggagagagc aaagggtcct aacgcgcctg agaacaggcc acactcgagc aacccacaac 7380 atgggggacg gcaggaattt ccggaagttt tgcgagactt gtaacaccag gaacaccgtc 7440 gagcacatca tcagcaactg ccctatctac gaatacccca gaaggcagta cgacataaca 7500 tccaccagcc gagcgcttca aaatgacccc gcctgcgaga gaatcctact taacttcctc 7560 aaggaagcag gtatcttcaa cgagctttga cgaactttga cgagttcgaa gaatacagtc 7620 aaacacccca ctggacaact aacacagcaa cggacacgga cacgaaaatg gcactgaaac 7680 ggtaacacaa aaacaggaaa cgaatgattt acgatgaaga cctaggatgg aacaaaaaac 7740 ggactgagat ttggattacg tttaaaattg tacagcaaca ttggtgagag atacccctcg 7800 ggtacactca ttttttcttt ttttttcttt tcttttattt ttcaccgaga tgagccagcc 7860 tcgggctgca aatctcgtta ataaagataa taataataat aataataa 7908 // ID Gypsy-34_DWil-LTR repbase; DNA; INV; 1800 BP. XX AC scaffold_181096; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-34_DWil_; KW Gypsy-34_DWil-I; Gypsy-34_DWil-LTR. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-1800 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_181096; Positions 692545 694344. XX SQ Sequence 1800 BP; 540 A; 326 C; 356 G; 578 T; 0 other; tgtagtgccc agtttctgcc gttagtttta tttttaaatt ttaaggcgtt agtttgtatt 60 tcagtagtgg tgggtaattg tgtcgataaa agtaatcagc tgagcttaga agttcgatag 120 ttttactagt gcaaccctag cgaaatagca tcatgcgcga tcgcatgcat tggtcacact 180 cttttctgac gaacttccgc cctggctaga cgcataaacc acgctgttcc tccagaattg 240 atttaaaaat ttaaatctga aattttcttg gccatcaagc caaattatta aataaatatt 300 aatctggagc atataggaca acaattattg tggagttttg tgaatcttat tatcgtaagt 360 taaatttttg tggatatttg aatctttgga agtttctaat aagttctttt ggaaacttag 420 ccctaatcaa attgaatgct gaaagcctaa gcaaggaatt ttatatttga attccgagaa 480 tttgccaggg ataaattttg agacaccgtg ggctcaaaaa ctgcacaccg tgaagggcac 540 agagacattt cgacaaagtc gaccagcttg gagtagccgc ccgaatcgga tacgggcaga 600 gaagtttctt cgtgcggatt gagagtccct tagaccgaat ttgttgcata ggaatgccag 660 acgcttgatt taacctaaca ttttattcct aagcatcatc cagcggacga cgaaggcctt 720 cgacggaccg gacggatggg taagtcctaa gttcttgaaa aaatcatgtt cgatctggta 780 attccctgtt tgtatgcaaa aagaatacta gttcagtgtc aaataatcag caaataagaa 840 aaaattctct cgcgtacagt aaagctaaag aaaagatcaa agaaaaaaga aaattgccaa 900 tcagtgttga gaaattaaat cgatattaag aatcgccctt gactgtgtga gcgagcctgc 960 agttatttat aattaatctc ataatttaag ttggagactt ccaaattcca attcacattt 1020 tcatttttga atatctataa gtttatatat atccacctta caattaagat ttcacaaaac 1080 aaaacatatt atccctaatt tggcttttgg atgcctgaaa tgatccaatt atttaatttg 1140 attaccgtag attataagaa tcatcatttt agtttagtca actcggatta tgtttccgac 1200 ttattgaagt attttggtat tccgtaaccc tcgtgaataa atatcatttg taatgtttgg 1260 gtaaaagtta tgtgatcttc cttgtagcga aatgtgagcg gtgagctgaa ttattttggg 1320 attttccaga atcttgcgaa aggaaaatca ttggttggga gggtttagac gcgaagctat 1380 aataacatcc tcatagcttc cctattcgtg cgatactaat ataccctttt tctctatcct 1440 tctagttcat ctcttttggt taaataattg tgtaatattc gtgctctagg gactttgttt 1500 tattttatgt actacgggtg tattgagtaa agctggtcca aagaaccccc ccaaagactg 1560 tttgctagcc gacagccaga agtcccaccc ggaccaaact tctcgataca ctaagcccgt 1620 agaagtgttg gctttggcct aatttgtggt actcttgccc atcataacat ctgatgatta 1680 gggactgcga gagagaccac caatataggc tgaacctgac ctcttcttgg atcaaagaag 1740 attctttgac taagagggga tgtacatgat ctggattcat tccgggttgt gtacattaca 1800 // ID BEL-6_DWil-LTR repbase; DNA; INV; 409 BP. XX AC scaffold_181074; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-6_DWil_; KW BEL-6_DWil-I; BEL-6_DWil-LTR. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-409 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_181074; Positions 129445 129853. XX SQ Sequence 409 BP; 160 A; 74 C; 58 G; 117 T; 0 other; tgtttaagat taaactacgg tttcatttga aatggcttca caaaatcaga tgcaacaaga 60 aatatacaaa acgaaaacaa aaatataaca caaaaataag tcataaagat tcatatgaca 120 attattgctt aaaaaccaaa ccaaacatgc atccacactc tatggaaaac acttaagatt 180 acaacagcgg cgcgttatca agtggtcaac gctgagcatg aaatataaac aattttcaac 240 ccccgaaaaa cgtgggtcat tagaatatgt atttttaaga acaattgtcg cctcctagtg 300 tttaagagtt attgtacaca caatactttt gtaaataaaa tcagttcata ttttgaattc 360 aacgaatgaa gattgggtta ttttccttct ctccgggaat ccctattca 409 // ID Copia-33_AA-LTR repbase; DNA; INV; 170 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-33_AA_; KW Copia-33_AA-I; Copia-33_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-170 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 958-958 (2011). XX DR [2] (Consensus) XX SQ Sequence 170 BP; 53 A; 39 C; 20 G; 58 T; 0 other; tggtgtaatc gttatcaaag ccaccattgg ttttaatgat ggaccatagc aacctgttta 60 ttacacctaa catgaaaatt tgtaatcatt cctaatctga ctctcctcaa gatcactaca 120 ataaacgtta ccgttcctta atttcaaacc actgcgtttt ctatattaca 170 // ID DNAX-3C_AP repbase; DNA; INV; 144 BP. XX AC . XX DT 27-AUG-2009 (Rel. 14.09, Created) DT 27-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNAX-3C_AP. XX NM DNAX-3C_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-144 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2054-2054 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 4bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 144 BP; 26 A; 41 C; 44 G; 33 T; 0 other; tactctgttc agcgagagag cacgctaatt ctgcgcatcc ctccgcgtca ccaggctatc 60 gaaaccggtt ccccaatggc agggtgttgc gtgggggaac cagttttcgt cggtgcgcgg 120 cgtggtctct cgctgaacag agta 144 // ID BEL-138_AA-LTR repbase; DNA; INV; 264 BP. XX AC supercont1.251; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-138_AA_; KW BEL-138_AA-I; BEL-138_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-264 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.251; Positions 110448 110185. XX SQ Sequence 264 BP; 81 A; 49 C; 50 G; 84 T; 0 other; tgcaacagta agtagtagct attctctgtg ttattctttg gctacataga ctttgtacat 60 tcaactttat agatttgcca agttcgcaaa atcacgatat tttggattgt tggggcagcg 120 taacaactag ggtgaccaat gtaagatttg tatggtatga tttaaatgaa gtaaactaat 180 gaaatacata gcttacagct ctgaattaca acccaaaccg tggtgtttcg tgctcgctaa 240 gatctccaaa agtttccccc aaca 264 // ID Copia-6_AA-LTR repbase; DNA; INV; 182 BP. XX AC supercont1.224; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-6_AA_; KW Copia-6_AA-I; Copia-6_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-182 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.224; Positions 320231 320412. XX SQ Sequence 182 BP; 51 A; 40 C; 31 G; 60 T; 0 other; tgaggtataa ttcaggtaat accgtttaac cgtgtatagt aaatctctac acagtctccc 60 tctgagatct gatacacagc catcttgttt ttcactctgt aaaagttcct cgagtaataa 120 agacacgttt gttataacta gcgagttacg taagtcttta tccgaattgt ccggtcatcc 180 ca 182 // ID Gypsy-95_CQ-I repbase; DNA; INV; 4199 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-95_CQ_; KW Gypsy-95_CQ-LTR; Gypsy-95_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4199 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 569-569 (2011). XX DR [2] (Consensus) XX CC Positions [3204-3662] - Integrase core CC 'AACAC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 465..2186 FT /product="Gypsy-95_CQ-I_1p" FT /translation="MDKWDIPPFKFKALPWNEVRSEWIKYKRHFMYVVSAT FT GERDRTRIKNIFLAKAGPDLQEVFSTIPGADVEGSRTVDPFEVAIEKLDNY FT FAPKQHETFERNVFWTLRTSVGESLEKFMIRCTDQASKCNFGSTVEESRAI FT SVKVILFAPSDLKEKLLQVKDLNIDEAAKIISSHESIKQQALAIGQGLSDD FT SDATSNVNKIQNRPQRRPQPNCTRCGYSGHTSLDENCPARDRQCAKCKRQG FT HFAVMCLTAPKRKFKIEAGPSKRWKPERIREVRENDDKPPENFIFNIGDQD FT ELIWLRVGGVLLHVLVDSGCKKNIVDERSWKYLKENGVRASNQQTNCEEIF FT LPYGSQAKPLTTVGKFDATVTIDDEGDTIEEPATFYVIKEGQQCLLGRTTA FT TRLGVLRIGLPSTHGINAVGPKEKHPFPKISGVQVQIPIDESVTPICQHPR FT RPPIALQSRIEDKIKALLAGDIIEPVEGGCQWVSPLVTVLKDNGDLRLCVD FT MRRANTAILRERHIMPTIEDFLPRFTTAKYFSRLDIKEAFHQVRNNKKIYK FT QRRTVYAQRVTPTHINIRWSLKRRAGT" XX SQ Sequence 4199 BP; 1195 A; 1029 C; 1126 G; 843 T; 6 other; tatggcgacg aggtaaagta tacgctacta gtctttgaga ggggagagaa attcatgaga 60 ttttcagcaa tttttatttt tggggaggga ttaacgaagc ttaccaaaag tgaggttaag 120 aaaacaaggg agaaagtttg gctaacgttg cgggtttttt tcttttcaat tacagtcgag 180 gccgccggag tggcccgagc ttgagctaaa agctcagtcg aaaagcaatc acaaaggtgg 240 ttggcagtag tcccagctgc aaaggcagcc gagtttcgaa gccaccaagt tggcaagccc 300 agtggcgtag ttcgcaaagg cgaccagcgg agaggtacaa ccccgaaagc aagcacattc 360 cgcccgacgg aagtggagag tcgtcaaaca gaaaacggta cttattaggc aaattgattg 420 atttacgaaa attctgataa gaattaattt cttcgcttta gaggatggac aagtgggaca 480 taccaccgtt caaattcaag gccctgccgt ggaatgaggt ccggtccgag tggatcaaat 540 acaaacgcca cttcatgtat gtagtttctg ctacggggga gagagacaga acgcggatca 600 agaacatctt cctggcaaaa gccgggcccg acttacaaga ggtcttcagc acgatcccgg 660 gagcagatgt cgagggttca aggacggtcg acccgttcga ggttgcaatt gagaaactgg 720 acaactactt cgcaccaaaa cagcatgaaa cattcgagag gaacgttttc tggacgttac 780 gaacatccgt gggcgagtcg ctcgaaaaat tcatgatacg atgcacagac caagccagca 840 aatgcaattt cggaagcacg gtggaggaaa gtcgcgccat cagtgtcaaa gtgatactgt 900 ttgcccccag cgacctcaag gagaaactcc tccaggtgaa agatctgaac attgatgaag 960 cagccaagat catcagctct cacgaatcca tcaaacaaca agcgcttgca ataggccagg 1020 gcctatcgga cgattcagac gcgacgtcga acgtcaacaa gatccagaat cggccgcaaa 1080 gacggccgca accgaattgc acccgctgtg gctactcggg tcacacaagc ttggatgaga 1140 actgtccggc tcgggacagg cagtgtgcga agtgcaaacg gcaagggcac tttgctgtaa 1200 tgtgtctcac cgctccaaag cgaaagttca agattgaggc cggcccgagc aagcgttgga 1260 aaccagaacg aatccgggaa gtcagggaaa acgacgacaa acctccagag aacttcatct 1320 tcaacatcgg agatcaagat gagttgatct ggctgagagt gggtggagta ctgctccacg 1380 tgttggtgga ctcgggatgc aaaaagaaca tcgtcgacga gagatcttgg aagtacttaa 1440 aggagaacgg tgtccgcgcc tcgaaccagc agacgaactg cgaggaaatt ttcctgccat 1500 acggctctca agccaagccg ctaactactg tgggcaaatt tgatgcgacc gttacgattg 1560 atgacgaagg tgacacaatc gaagaacccg cgacgttcta cgtcatcaaa gaaggtcaac 1620 agtgcctact tggtcgcacc actgccacga ggttgggagt actacgcata ggcctgccga 1680 gcacgcatgg aataaatgca gttggaccga aagagaagca cccatttccg aaaatcagcg 1740 gggtgcaggt ccaaattccg attgacgaat ccgtaacacc catctgccag cacccgcgtc 1800 ggccgcccat tgcgttgcaa tctcggattg aagacaagat caaggcactc ctggcaggtg 1860 acatcatcga accagttgaa ggggggtgtc agtgggtctc gccactagta actgtcctca 1920 aggacaatgg agatctccgg ttgtgcgtcg acatgcgcag ggcgaacacg gcaatccttc 1980 gggaacgaca catcatgccg actattgaag atttcctccc aagatttacg actgcgaagt 2040 atttcagccg tctcgacatt aaggaagcgt tccaccaggt aagaaataac aaaaaaatat 2100 ataagcagag acggacagtt tatgcacaaa gagtaacacc aacacacatt aatattaggt 2160 ggagcttaaa gaggagagca ggtacataac cacgttcatc acccacgtgg gtcttttccg 2220 gtacaagcgs ctcatgtacg gaatcgtgat cgcttctgag gtcttccagc gcatcatgga 2280 gcaaatcctg agcccctaca gcaaaaacgt agtgaactat atcgacgaca tcctcatctt 2340 tggtkcgacg gaaaaggaac acgacgacgt cctgcgggcg gtgctgaaca cgctgcacga 2400 tcgaggaatc ctactsaacc aaggmaagtg tttgttcaaa acttccaagc ttcagtttct 2460 gggccacgcg atctcttcgg aaggcattga accgtgcgga agcaaagtgg aagccctgca 2520 gaatttccgg gccccgtcga cgccggagga agtacggagc tttttgggat tggtgacata 2580 catcggccgc ttcctcccgg atcttgcaac ggttacggct ccacttcgtc agctgaccca 2640 ttccggtgtt aaattcgwct ggggcaagga gcaacaggaa gccttcctgc tgctgaaggc 2700 tatgatctca aacgtgaaat tgctatactt tttcgacaac tctttgagga caagagtgat 2760 cgcggacgcg tcgccggtcg ctctgggcgc ggttttgatc cagttcggcg acgaaacaga 2820 cgacttccct cgaccaatcg cgtatgcaag caagagcctg acagaaaccg agcgcactgt 2880 caaaccgaaa aagaagcgct cgcgctggtt tggagtgtgg agaggttcac cgtttatctg 2940 atcggacgaa gtttcgaatt agaaaccgat cacaaacctc tggaggcgat tttccagcct 3000 acctccagac catgtgccag aatcgagcgc tggctgcttc gactccaatc tttcmggttc 3060 cacgtcaagt atcggaaggg agcgggaaat attgccgatc cgctgtcgcg tctagttcag 3120 cactcgtcat ctgaggattt tgacacggtc aaccagttca tgatacttgc agtatgccag 3180 tcagtcgcaa ttgacatcca tgaactcgat caggctacca agtcggactc gatactagag 3240 gcagtcaaac agtgtattcg caccggaaac tggtcgtcat cgactattac agtcgataca 3300 aggaagtaga gctgatggcg aagataactg caaaggaaac ggtgcttaga ctcgacaaga 3360 tcttcacacg gctggggtac ccacaaacga taacgctaga caacgccaag cagttcgtcg 3420 gtgtagaaat ccaagagtac tgcaagacgc acggcatcta cctgaatcat tcggctccgt 3480 attggccaca ggagaacggt ctagtggaga agcagaatcg atcgttgctg aagaggttga 3540 agatcagcca cgccctgaat agagattgga agcaggatct acgggagtat ctggtcatgt 3600 attacactac tccgcactcg accaccggaa agacaccaac cgaaatgatg tatggccgga 3660 cgatccgctc gaagattccg gcgatcagtg atatcgatgg tgtcccgttg aacacagaag 3720 aggccgatcg agatcgcatc ctaaaacaaa aagggaaaga gaacgaggac gcccgccgta 3780 atgcacggag atcatctatc agcaccgggg acaccgtcct aatgcagaat cttctacctg 3840 ggaacaagct gactacgaca ttcagtccga cggagtacac agtggtgtcg cgcgatggac 3900 ctagggcgac gatccacgac tcgagcagcg gcaaatcttt tgaacgaaac gtcgcacacc 3960 taaaacggat cggaaagcct gttgctgaag atgagttcac atgcgaggcc ggggctgtgg 4020 agaactccgt tgacgtgcga gcgggggata tcggccacag caatccgatc gttggcgacg 4080 aactccaagt atttgtggac agcgaggaac cggaaccgga acaaccgagg aagtcaatac 4140 gaccgcttaa aaagccagcc agatgggctg actatgtttc ttcgtgaaaa aaggggaga 4199 // ID BEL-190_AA-I repbase; DNA; INV; 6830 BP. XX AC supercont1.75; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-190_AA_; KW BEL-190_AA-LTR; BEL-190_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6830 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.75; Positions 200286 193457. XX CC Positions [5815-6372] - Integrase core CC 'GAACC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 25..6828 FT /product="BEL-190_AA-I_1p" FT /translation="MQKSFVRQTRSKTRATEGAATSRSQPTRDDGARLNIP FT EIVATVFEPSLVDIEHGDDNDCAECRRPNSAELYMVQCGCCKHWYHFSCAK FT VDTVTARSREFVCAKCKALDPPPPASTRTGRSSGASSKRMQIARDLQRLEE FT ERHLRERLEDERMQNEKLLVEKAMREKLEREKEYLDRKHELLRQQDEEALS FT MRSSRSSRSQASSRHKVEAWFDQQQRLDSLILPLNQKDPEQVSANPPADPV FT GSSTPVDMSKITDVELQQAPAGIMEAESVPRTTDSISIGESPGTEDDFRQV FT QPQHNIERQPNVPSLPLVNLQPFIRLLEEANPTSTFVPKPINADPRIVKRV FT MGPSLPYAVWQRETSEMRKQHAREQQCPEELEEPRNRHHREQELVDLLKRS FT EEQREQDRLRMQEIEAMLTRQHAIEKQHQKENDVRRKREMELINQLKLIEQ FT QYMQEKVKHDEERQSFLAKEQHLTEQLDFLRLQSRQPSTAEQHGKLTDPTC FT QQGSTVSTNPPAAPVSSSQMEHFETSHLYETNPNRPTPCPSIASQYRRVGS FT PQTPYVTPKLGPRQPDFIQAEPAQFDLFETAYPQPVLQAPTPHQMAARQVI FT SRELPIFSGDPIEWPLFISSYNHSTEACGYSSSENLLRLQRALKGAAKEAV FT SSFLLHPSTVPLIISTLQTLYGRPEQIVHNLVAKVRSTPAPKAERLETLIQ FT FGLAVQNLCGHLKAVGMANHLSNPILLQELVDKLPANIKFSWALHQENLPM FT VDLDSFSEYMRKVTNATSGVTNFCVTGKPTKDEKVKIKDKAFVNAHATYER FT KDSKPATNPVSTDQREQIQDPKGKRSEGSKYCPVCSVDNHTTASCPVFKKL FT SIDNRWNFVKENRLCRRCLVSHTRWPCEGEVCGINSCQKRHHRLLHYEPAK FT KQCQQAVNSTDATVTIHRQPFSSTLFKILPVTLYGKNGSVNTYAFLDDGSS FT ATLLEKTIANELGMSGSAHSLCIHWTSGINKKIATMELETFSISEPGSEKR FT FNLSEVYTVENLGLPEQTLDFEQLTKEFEHLRHLPVKSFQRAVPGLLIGLS FT NSHLLTTVKIREGKEQEPIAAKTRIGWAVCGRVRGGEKQFHHRQMHICTDS FT FDHDLHDYVQEFFSVESLGVAIAPNLEGSEEQRARQILEETTVRTEGGKFE FT TGLLWKHDLIEFPDSRPMAERRLRCLERRLEKNPQLYDSVRQQITDYESKG FT YIHRVTNEEMSKFDPRRTWYLPLGVVLNPNKPGKIRVIWDAAAKVGGVSLN FT TMLLKGPDFLTPQLSVTFKFREREVAFSGDIQEMFLQVGIRKEDRSALLFV FT YRNSPREPMVTMASDVAIFGATCSPAQSQFVKNRNATENEEEYPRAAAAIK FT NKHYVDDYLDSVDTEEEAVELALDVAKVHQKAGFHIRNWVSNRSSVLEAIG FT EVNPVTVKNLSMNSQSGFERVLGMSWLPCEDVFCFTINVQDNLDAEIAPTK FT RRMLGFVMKIYDPLGLVGSLVVQGKILLQDVWRAKVDWDEQIPEVLFARWK FT QWMQILKEVNEVRIPRCYFPGYSPACYDTLELHIFVDGSAQAYSSAAYFRV FT RDRGQIRCALVASKTKVAPLQLVSVPRLELQAAVIGARLRKSIEDGHSIKV FT TRTYFWSDSSTVISWIKSDTRRYRQYAAFRVNEILSLSKAEEWRWLGTKSN FT VADEATKWGKGPNCKSGSRWVRGPAFLHGEENCWPKDKFEIIDETQEELRP FT AYVCSHFVAETIIDISRFSKYERLLRSMAYVHRFSNKLLQRMKRMPFEDTG FT SISGEDLRNAERFLWRYAQSDVYPDEVALLKRNSHLTPNKRQSLCRSSSLA FT NLPPVMDEHGLIRVDGRIAAADYVEYDAKFPVILPKGHPITSLLLDYYHRK FT FRHANNETIVNEIRQKFYVPKLRTQVRSVAKRCQWCRIYKVVPEVPKMSQL FT PRFRITPFVRPFTFVGIDYFGPYLVKVGRCAVKRWVAIFTCLTVRAIHMEV FT VHSLSTDSCKKAVRRFIARRGAPQEVYTDNGTNFIGASKELNDELREIYNT FT LGSSFTDTNTQWRFNPPAAPHMGGCWERMVRSVKTAIGTLPESRKLDDESF FT ITLLAEAEHMVNSRPLTFLPLDSEEKESLTPNHFLMLSSNGVRQSIKMPVD FT EKSALKGSWELIQNKLDYFWTRWTVEYLPIISRRSKWFKEVAPIEVGALVM FT IADGKIRNQWIRGRVTRTYPGKDGAVRLADVETPTGVLKRRAVCKLAVLDV FT CDRQEPAMSNGDIGNGHHGADRRGYGAVEPVFTRHEGE" XX SQ Sequence 6830 BP; 1974 A; 1534 C; 1778 G; 1544 T; 0 other; atctcaaaga tttgagccat caccatgcag aaatctttcg ttcgtcagac gcgatcgaaa 60 actcgagcca ccgaaggagc ggctaccagt cgttctcaac ctacccgtga cgacggtgca 120 cggctcaaca ttccagagat agtcgcaaca gtcttcgaac cttcgttggt tgacattgag 180 cacggcgatg ataatgattg tgccgagtgt agacgaccca atagcgcgga gttgtacatg 240 gtacaatgcg gatgttgcaa gcattggtac catttctcgt gtgcaaaagt ggatacagtt 300 acggcacggt ctagagaatt cgtttgtgcg aaatgcaaag cgttggatcc accaccaccc 360 gctagcacgc gaaccgggcg atcgagtggt gcaagttcga agagaatgca gattgctcgg 420 gatcttcagc ggttagagga agaaagacat ttgcgggaac gactggaaga tgaaaggatg 480 cagaacgaga agttgctggt cgaaaaagcg atgagagaaa aattggaacg cgaaaaggag 540 tatttggatc ggaagcatga attgcttcgc cagcaagacg aggaggcatt gagtatgcgg 600 agtagccgta gcagtcgaag tcaagcaagc agtcgacaca aggtagaggc ttggtttgac 660 caacaacaaa ggctcgattc gctcatcctt ccgttgaatc aaaaggatcc tgaacaggta 720 tctgcgaatc cacctgcaga ccctgttggt tcatcaacac ctgtggacat gagcaaaatt 780 actgacgtcg aactccaaca ggctccagct ggaatcatgg aagctgaatc cgttcctcgt 840 accactgata gcatttccat tggagaaagc ccggggacgg aagatgattt ccgtcaagtt 900 cagccacagc ataatataga acgacagcca aacgttccca gtctaccgtt ggtgaactta 960 cagcctttca ttcgtcttct ggaggaagcg aatcccacaa gcacgttcgt cccaaaacca 1020 ataaatgcgg atccaagaat cgtgaagaga gttatggggc catcattacc ctacgcagta 1080 tggcaacgcg aaacaagcga gatgcggaag cagcatgctc gtgaacagca atgcccagag 1140 gaattagaag aaccccgtaa tcgacaccat cgcgagcagg agttggttga tctgttgaag 1200 cgttccgagg aacagcggga gcaggacagg ttgcggatgc aggaaattga ggctatgcta 1260 acgaggcaac atgcaatcga aaagcaacat caaaaggaga acgacgttag gagaaagcga 1320 gaaatggagt taataaatca actgaagctc attgaacaac agtacatgca ggaaaaggtg 1380 aagcatgacg aagaaaggca atcgtttctg gccaaggaac agcatctgac tgagcagctg 1440 gattttttgc gtctacagag cagacaacca tcgactgcgg aacagcacgg aaaactgacg 1500 gatccaactt gccaacaagg ttcaaccgtg agtactaacc ctccagcggc tccggtaagt 1560 agttctcaaa tggagcattt cgaaacaagt catttgtacg agacaaatcc gaatcgacct 1620 acaccttgtc caagtattgc aagtcagtat aggagagtag gatcccccca gactccatat 1680 gtaactccaa agttagggcc aaggcagcca gacttcattc aagccgaacc ggcgcaattc 1740 gatctcttcg aaaccgccta tcctcaaccg gtgttacaag cccctacacc tcatcagatg 1800 gcagcaaggc aagtcatatc gagagagttg ccgattttct ctggtgaccc catcgaatgg 1860 ccacttttca ttagcagcta caatcattct accgaggcat gtggctactc ctcttcagaa 1920 aacctgctac gtttgcaacg agctctgaaa ggtgcagcca aggaagctgt tagtagtttt 1980 ctcctgcatc cgtctacagt gccactgatc atttcgaccc ttcagacgct gtacgggaga 2040 cctgaacaga tagtgcacaa tcttgtagct aaagtgcgta gtacgcccgc tccgaaagca 2100 gaaagactgg agacgcttat ccagttcggt ttggctgtgc aaaatttgtg tggtcattta 2160 aaagcggtag gaatggcaaa ccatctttcc aaccctatcc tgcttcaaga gctggtggac 2220 aagctgccgg ccaatattaa gtttagctgg gcacttcatc aagagaatct acccatggtt 2280 gatttggatt cattcagtga atacatgagg aaggttacga acgctaccag tggagttacg 2340 aacttctgcg taacaggaaa acctacgaaa gacgaaaagg tcaagataaa ggataaggct 2400 ttcgtgaatg ctcacgcgac gtatgaacgg aaggattcaa agccagctac caatccagta 2460 tccactgatc aacgagagca gatccaagat cctaaaggga aacgaagcga aggaagtaag 2520 tactgtccag tatgtagcgt tgacaaccat actactgcga gctgcccagt cttcaagaag 2580 ctttcaatag ataatcggtg gaattttgtg aaggaaaaca ggctgtgtcg ccgttgtctg 2640 gtttcccata ctcgttggcc atgcgaaggt gaagtctgcg gaatcaatag ttgccaaaaa 2700 cgacaccatc gactgctaca ctatgaacca gccaaaaagc agtgccagca agcagtgaac 2760 tcaaccgacg caactgtgac tatccatcgt cagccctttt cttcaacatt gttcaaaatc 2820 ctaccggtga cactgtatgg gaagaatggc tcagtaaata cgtacgcttt cttggatgat 2880 ggttcgtctg caaccctcct ggagaaaacc atagcaaacg aattgggtat gagtgggagt 2940 gcccattcct tatgcattca ttggaccagc ggaataaata aaaaaattgc gacaatggaa 3000 ttggaaactt tcagtatttc ggagcctggg agtgaaaaac gcttcaacct ttccgaagtg 3060 tacactgttg agaatctggg tttgcctgaa cagacgttgg atttcgaaca gctaacgaaa 3120 gagttcgaac atttgcgtca tcttcccgtc aaaagcttcc agagggccgt tcctggattg 3180 ctgattggat tgagcaattc gcatctgctg accaccgtaa agattcgaga aggcaaggag 3240 caagaaccaa tcgctgcaaa gacccgtata ggatgggccg tctgtggtcg tgtacgaggg 3300 ggagaaaagc aattccacca ccgtcagatg cacatatgta cagattcgtt cgaccacgac 3360 ctgcacgact acgtccagga gtttttctct gtggagagcc ttggagtagc catagcccca 3420 aacctggaag gtagtgagga acaacgagca cgccaaattc tcgaagaaac aacagtacgg 3480 acagagggcg gtaaatttga aactgggcta ctttggaagc atgatttaat tgagtttccg 3540 gatagtagac caatggctga gcgtcgatta agatgcctgg agagacgtct ggagaagaat 3600 ccacagctat acgatagcgt gcggcaacaa ataacggatt acgagtcaaa agggtatatc 3660 caccgagtaa ccaatgaaga aatgtcgaag tttgacccgc gccgcacttg gtaccttccg 3720 ttaggcgtgg tactcaaccc gaacaagccc gggaaaatac gagtcatttg ggacgcggca 3780 gcaaaagttg gcggagtatc tttgaacacg atgcttctaa aaggaccaga cttcttaaca 3840 ccgcagctgt ctgtaacgtt taaattccgt gagcgggaag tggcgttttc cggcgacatt 3900 caagagatgt tccttcaagt cggaatccgt aaagaagatc gcagcgcgct gttatttgtc 3960 tatcgcaatt ctccaaggga gccgatggtg acgatggcgt ccgacgttgc tatcttcgga 4020 gcaacttgtt ctccagctca atcacaattt gtgaagaacc ggaacgctac agagaacgag 4080 gaagaatacc ctagagcagc ggcagcgatc aaaaacaagc attacgtcga cgactatctt 4140 gatagtgtcg acactgaaga ggaagccgtc gagctagcat tggatgtggc taaagtccac 4200 caaaaagctg gtttccatat ccgcaactgg gtttcaaatc gatcatcagt cctcgaagca 4260 atcggtgaag tgaacccagt aacagtgaaa aatctttcga tgaacagtca gagcggtttc 4320 gagcgggttc ttgggatgtc gtggctaccg tgcgaagatg tgttctgttt cacaatcaac 4380 gtacaagaca acctggatgc tgagattgcg ccgaccaaga ggagaatgct tggcttcgta 4440 atgaaaatct acgatccatt ggggctagtt ggctcgctag ttgtacaagg aaagatacta 4500 ctccaagacg tctggagagc taaagtggat tgggacgagc aaattcccga agttctattc 4560 gcacgatgga aacaatggat gcagattctg aaggaggtga acgaagtccg catacctcga 4620 tgctactttc cgggatacag tccagcctgt tacgatacct tggaactcca catattcgta 4680 gatggaagcg cacaagcata ttcctctgct gcttactttc gtgttcgaga tcgtggccag 4740 atacgatgtg cgttagttgc ctctaaaaca aaggtagctc cgcttcaact agtttctgtt 4800 ccacgattag aattgcaagc ggccgtaatt ggtgcacgtt tgcggaagtc catcgaggat 4860 ggtcactcaa taaaagtcac acgcacgtac ttctggagtg attctagcac tgtcatttcg 4920 tggatcaaat ccgacacccg tcgatatcgg cagtacgcag cgtttcgagt caatgagatt 4980 ttaagcttat ccaaagctga agaatggcga tggttgggaa ctaagagcaa tgttgcagat 5040 gaagccacca agtggggaaa aggaccaaac tgcaagtctg gaagtcgctg ggtacgtggt 5100 cctgcctttc tgcatggtga agagaattgt tggccgaagg ataagttcga gataatcgac 5160 gaaacgcaag aagagctgag accagcgtac gtgtgcagtc actttgtggc agagacaatc 5220 atcgacatat cacgattctc caaatacgaa cggctgctac ggagtatggc gtatgtgcac 5280 cgtttcagta ataagctttt gcaacgaatg aaaagaatgc cgttcgaaga tacagggagt 5340 ataagtggtg aggatttgcg gaatgcggag agatttctct ggcgatacgc tcagtcagat 5400 gtttatccag atgaggtcgc cttattgaag cgcaacagcc atttgacacc aaataaacga 5460 caatcgctgt gtcgaagcag ttcgttagcg aatcttccac ctgtaatgga tgaacacgga 5520 ctgatccgtg tggatggtcg aatcgctgct gcagactacg tagaatacga tgccaagttc 5580 cctgtgatac tgcctaaggg acacccgatc acaagcttgt tgttggacta ttaccaccga 5640 aagtttcgcc acgctaacaa tgaaacgatc gtgaacgaaa ttcgtcagaa gttttacgtg 5700 ccgaaactaa gaactcaagt tcgatcagta gcaaaacgtt gccagtggtg ccgtatttac 5760 aaagtcgttc cggaagtgcc gaagatgagt caactacctc ggttccgtat tacgcccttt 5820 gtgagacctt ttacgtttgt aggcatagac tacttcgggc cttatctagt aaaagtcggt 5880 cgttgtgcgg tcaagcgttg ggtagcgata ttcacgtgtc taaccgttcg ggctatacac 5940 atggaggtag tgcattctct gtcaacggat tcgtgcaaga aggcagttcg acggtttata 6000 gctcgccgag gagcgccaca agaagtgtac acggataacg gaacaaactt cattggtgcg 6060 agtaaagagc taaacgacga attacgagaa atctacaaca ccttgggtag cagttttacg 6120 gatacgaata ctcaatggcg gttcaaccca ccggctgccc cacatatggg cggatgttgg 6180 gagcggatgg tccgatccgt gaagactgct attggaactt tgccggagtc gcggaagttg 6240 gacgacgaat cgtttatcac cctactggca gaggcggagc acatggtgaa ttcgcgcccg 6300 ttaacctttt tgccactcga tagcgaagag aaggagtcgt taactccaaa ccattttttg 6360 atgcttagct ctaatggagt tcgacaatcg atcaagatgc cggtagatga aaaaagcgct 6420 ctgaaaggca gttgggagct aattcagaac aagttggatt acttctggac tcgctggacc 6480 gtcgagtatt tgccaataat atctcggcga tcaaaatggt tcaaggaggt tgcaccaatc 6540 gaggtaggag cgttggtgat gatagcagac gggaaaatcc ggaaccaatg gatacgtggt 6600 cgtgttaccc gtacatatcc cggcaaggat ggagcagtac gactagcaga cgtagagacg 6660 cctacagggg tgttgaagag aagagcggtg tgcaagctgg cagttttgga tgtgtgcgat 6720 cgtcaggaac ctgcaatgag caatggtgat ataggcaacg ggcatcatgg agcagatcgt 6780 cgtggatatg gtgccgtcga accagtgttc acacgacacg agggggagga 6830 // ID Ingi-1_Rpro repbase; DNA; INV; 3885 BP. XX AC ACPB01056162.1; XX DT 02-FEB-2010 (Rel. 15.02, Created) DT 02-FEB-2010 (Rel. 15.02, Last updated, Version 2) XX DE A family of Ingi non-LTR retrotransposons from Rhodnius prolixus. XX KW Ingi; Non-LTR Retrotransposon; Transposable Element; Ingi-1_Rpro. XX OS Rhodnius prolixus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Euhemiptera; Heteroptera; OC Panheteroptera; Cimicomorpha; Reduviidae; Triatominae; Rhodnius. XX RN [1] RP 1-3885 RA Kojima K. and Jurka J.; RT "Ingi non-LTR retrotransposons from insects."; RL Repbase Reports 10(2), 149-149 (2010). XX DR EMBL/GenBank/DDBJ; ACPB01056162.1; Positions 1 3885. XX CC This sequence corresponds to gb|ACPB01056162.1|:623-4507. The AP CC endonuclease region is broken. XX FH Key Location/Qualifiers FT CDS 113..3841 FT /product="Ingi-1_Rpro_1p" FT /note="includes broken AP endonuclease, reverse FT transcriptase, and RNase H domains and a CCHC FT zinc-finger motif." FT /translation="MRPRNARSTNGLSQGPDNTLLKVFFWNAGGLTPDKFS FT ELKAFIAKEDIDIFGIVEAGASTENPQYFQVPGYQTYVLKRARQVASGIIF FT AAKTSLTVDFIVKKEMVENDRLEAVQADVWAGKLHRRFYVLYNPPSNRPDL FT SWLEAQWGPRDILYGDINAPSVRWGYMQTSPVGRIFEDFIDANPVVAIRNL FT ECSPTFISYSGNTSSPDVILAHAALDGEIEHSLEHAPGDFGHKILKLVITR FT DSSCPKEPRFTRWNLRKAKWDLFSKLSDQEIKEDLIQGSVDAAFNNICTAV FT LKCATQAIPRGCVKRYSPFWNEDLQRLKEERDIARTAYEQSRLQEDKIRLK FT EAGALLRKEIINAKRNSYNQFLENLDFRKDGVQAHRFLSKISKNMEPAKRQ FT PFVASSGKTLTSNEDIAKLFCSYYAKISSWQPEQKNRKLNLKLPPTCLNEK FT ENSLFHTPFCWAELEIALDTMPIKKCAGPDNIPPEFIKHLGGSARSTLLRF FT MNKVWEQGIPSVWRKADILPIPKKGKPTNRPENCRPIALTSMFSKLYERLL FT LSRLQEFLEERQLISDKQAGFRKHRNTVEQVAYLSQEIKNGFHRKQSTVAV FT FVDLKAAFDKVWRERLIQKIRDIGVAGNMYKSIFSFLAQRYIRVKYNGSFS FT PFKQTRKGVPQGAVLSPTLFSLMINDVVAAVEAIPGVRVLLYADDLVIWAT FT SSHVDAIESYLNKAMETLYQWTTNNKLEVNTSKTEYQVFTLSTNKKQVHIK FT FNNQILPQTDSATYLGIKMDGRLGWKKHIDKLVQQGEGRLGLLKRLAGIKW FT GANQDLMTLTYKTYIRPTIEYGNEVLLTASDNSLKALDLLQNKALRLITGA FT AKSTPIAAMEIQTGIEPLQIRREISAINLFERVKRLKIKHWDLQQSSAQRL FT KTQKPLAVKVSEVIATRGLDLGRKQPFERSCVQYGHLAEGKLNLLQPLLKD FT KTSEQEMRALALETIHSRYPARDWLHIYTDGSARAATHNAGAGVYCREFTL FT AEPVGRMCTNFDGEVKAISMALAKLIEDQKDNPQIVLFIDSQAAILAISSR FT HYTDHSEVGIARSHIAFLIERGTRVVLQWVPSHCGLTGNEQADELARRRAE FT MQQPDTPLTFSAVKNVVKSSYNEEIKRTQQHQAEGKRWRTLLENPVSVTAA FT RKEAVACFRMTTGHDYLQKHLHRIGVVDSPMCPLCLQEEKMDSQHLPVCHA FT LIDVVANDSTNPHTHENKAIMTKLYWAARSRMD" XX SQ Sequence 3885 BP; 1212 A; 848 C; 920 G; 905 T; 0 other; ggggggtaac attgctcgcg tccgtacaga tagtacacgt gccccctggt tagcgctaaa 60 tgagcaatag tgtcgccata cactcagccc aataactcca atttgcgaca ccatgagacc 120 acgaaatgct aggtctacaa acggcctttc tcaaggcccc gacaacactc tgctgaaggt 180 atttttctgg aacgcaggtg gcttaacgcc ggacaaattc tctgaattga aggcttttat 240 tgctaaagag gacattgaca tatttggaat agtggaggcc ggtgcttcaa cagaaaatcc 300 ccaatacttt caagtgccgg gttatcagac ctatgttctt aaaagggcca gacaagtagc 360 gtcgggaata atttttgctg caaagacgtc acttactgta gatttcattg tcaaaaaaga 420 aatggtagaa aatgataggc tggaagcggt gcaggcggat gtatgggccg gtaagcttca 480 caggcgtttt tatgtattgt ataacccgcc ctcgaacagg cccgatctct cctggcttga 540 agctcagtgg ggtcctcggg acattctata tggggacata aacgctccct ctgtaagatg 600 gggctatatg caaacctctc ctgtcgggag aattttcgag gatttcatcg atgcaaatcc 660 tgtagtggca ataagaaatc tggagtgcag ccccacgttt atttcctaca gtggcaatac 720 tagcagccct gatgttatct tagctcatgc agccttagat ggcgaaattg agcactctct 780 ggaacatgca cctggagact ttggacacaa gatactgaag ctagttatta ctagagatag 840 cagctgtcct aaagaacccc ggtttacccg atggaatttg agaaaagcta aatgggatct 900 cttttccaaa ctgagcgacc aggaaataaa ggaggattta atacagggca gcgtcgatgc 960 tgcttttaat aatatctgta ctgcagtttt gaaatgtgca actcaagcca tccccagagg 1020 atgtgtcaag aggtactctc ccttctggaa tgaagatttg caaagattga aggaagaaag 1080 ggatatagcc agaactgcct atgaacaaag tcgccttcaa gaggacaaga tccgtctaaa 1140 ggaagcaggt gccctcctta ggaaagagat aatcaatgct aagagaaact cctataatca 1200 gttcctagaa aatttggact tcagaaaaga tggagtacag gcacatcgct tcctgtccaa 1260 aataagtaaa aatatggagc ctgctaaacg acagcctttt gttgcctctt ctggcaaaac 1320 ccttacatca aatgaggata ttgcaaagct cttctgttca tactacgcta aaattagtag 1380 ctggcaaccg gaacagaaaa accgaaaatt aaatcttaaa cttcctccta cttgccttaa 1440 cgagaaggaa aattcgcttt ttcacacccc attttgctgg gctgagttag aaatagccct 1500 cgatacgatg ccaattaaga aatgtgccgg acctgacaat atacctcctg aatttatcaa 1560 gcaccttggc ggatcagcaa ggagcacttt actaagattt atgaataagg tttgggaaca 1620 aggtatccca agtgtctggc gaaaggctga catactaccc atccccaaga agggtaagcc 1680 gacaaacaga cctgaaaact gcaggcctat tgcccttacc agtatgtttt caaaactgta 1740 cgaacgatta ctgctatcga ggttgcagga attcttggaa gaacgacagt tgatttctga 1800 taagcaagcc ggcttcagaa agcacagaaa cactgtggaa caggtagcat acctgtctca 1860 agagataaaa aatggtttcc accgaaaaca gagcacagta gcggtattcg tggatctaaa 1920 ggcagccttt gataaagtgt ggagggaaag actaattcaa aaaatcagag acattggtgt 1980 ggcaggaaac atgtacaaaa gcattttctc ctttctagca cagagataca tccgggtaaa 2040 atataacggt tctttctccc cattcaagca gacgagaaag ggagtccctc agggcgctgt 2100 tcttagccct accctattct cgctaatgat taacgatgtt gttgcagctg tagaggccat 2160 tcccggggtc agggtgttat tatacgcaga tgaccttgtg atatgggcta caagcagcca 2220 tgtagatgcc attgagtcgt atctaaacaa ggcgatggag acgctgtatc agtggactac 2280 aaacaacaaa ctagaggtta acacgtctaa aacagaatat caggtcttca cactttcaac 2340 aaataagaaa caggttcaca taaagtttaa taatcaaata ctcccacaaa ctgactctgc 2400 cacgtacctc ggtattaaga tggatggcag gctaggctgg aaaaaacata ttgacaagtt 2460 ggtccagcag ggagaaggaa ggctggggct gctaaagaga ttggcgggaa taaaatgggg 2520 cgcgaaccag gatttaatga ctctgaccta taagacttat atcaggccta ctatagaata 2580 tgggaatgaa gttcttctga ctgcgtcaga taactcttta aaagcgttgg atctactaca 2640 aaataaggcg ttaaggttga tcacgggagc cgcaaagtca acacccattg cggctatgga 2700 gattcagact ggtatagaac cccttcaaat cagaagagaa atttcagcaa taaacctctt 2760 tgaaagagta aaaaggctga agataaagca ctgggatttg caacagtcgt ccgcccaacg 2820 cttgaaaaca cagaaaccac tcgctgttaa ggttagcgaa gtcatagcca caagaggcct 2880 ggatctaggc agaaaacaac cttttgagcg cagctgcgtg cagtacggac acttggcgga 2940 gggaaaacta aatcttttgc aaccgcttct taaggataaa acttccgagc aggaaatgag 3000 agcgttggcc ctggagacaa tacattcccg atatcccgct cgggactggc tacatatcta 3060 cacagatggt tccgcccgag cggcgaccca taacgcaggg gccggagtgt actgtcgaga 3120 gtttacgcta gctgagccag tgggtaggat gtgcactaat tttgatgggg aggtcaaagc 3180 tatctccatg gctctggcta aattaataga agaccaaaaa gataaccccc aaatagtgct 3240 ctttattgat tcccaggctg caatactggc aatctctagt agacattaca ctgaccactc 3300 ggaggtgggg atagcaagaa gccacatagc ttttctgatc gagcggggaa ctcgtgtcgt 3360 gctccaatgg gtgccgagtc attgcggcct aacggggaac gagcaagcag atgagctagc 3420 ccgcagaaga gccgaaatgc agcaacccga cacaccgctc accttctccg ctgttaaaaa 3480 tgtagtaaag tcttcttata atgaagaaat aaaaaggacc caacagcatc aagcagaagg 3540 aaagcggtgg cggacgctgc ttgagaatcc tgtttcggta acagctgcaa gaaaggaggc 3600 tgtggcatgc tttcggatga ctacagggca tgactacctc caaaaacatt tgcacaggat 3660 tggggtggtg gatagcccta tgtgtccttt gtgtctgcaa gaggaaaaaa tggacagtca 3720 gcacttaccg gtctgccacg ccttgataga tgtcgtcgcg aacgacagca caaatccaca 3780 tacccatgaa aataaagcca ttatgaccaa attatattgg gctgccagaa gccgaatgga 3840 ctgagcacca acggcttggc gttaataaaa aaaaaaaaaa aaaaa 3885 // ID Kolobok1-N2_NVi repbase; DNA; INV; 2705 BP. XX AC . XX DT 16-FEB-2009 (Rel. 14.02, Created) DT 16-FEB-2009 (Rel. 14.02, Last updated, Version 1) XX DE Kolobok-type family. XX KW Kolobok; DNA transposon; Transposable Element; Nonautonomous; KW Kolobok1-N2_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-2705 RA Jurka J.; RT "Kolobok-type DNA transposons."; RL Repbase Reports 9(2), 482-482 (2009). XX DR [1] (Consensus) XX CC Putative non-autonomous. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Nasonia CC Genome Project. XX FH Key Location/Qualifiers FT CDS join(208..444,470..1138,1504..1881) FT /product="Kolobok1-N2_NVi_1p" FT /translation="IATHLILILIFKRRIGSLNAKRKRKYHFIQSAQSVNP FT AIDGGTNVYTVQIFLLCESMFVTYLYTIHVNKIYVYAKVVVAITTMDAIHS FT KWRKPKPKKNRKSATSAANLIKARANVNKTDQNKNATKTSVPNRNSLSEKP FT CTRSVSNNTVNAMQAKLQESATTRKLAFTDMLKSSSDVVDEEENMFININV FT FKVIADKMTCEFCKTNSVNFELGQRNGFAVNLLFICKVCGEVCVDTFSSSR FT TDPNKNNSSFVINEIVVQSFTKVGLGIAALQDVCASVISEVLSKSRKIVHQ FT VHKSLIQIYYRIISSVHPRTYSKCQRGYTFSAVEKVLQKYSSSKTRIEIAA FT AHAVSEYNLGRQRSAEILASITNSKMTSHAEKITTQKDKKRKLSSDKKHEP FT ESKAARISKKYGSKRQESLKVQSEGVTYGAGCF*" XX SQ Sequence 2705 BP; 994 A; 379 C; 461 G; 871 T; 0 other; ggtgggattc tggtgttttt cgaccaaaaa atcgaaaaaa tttttatttc tttaaagtgt 60 tctaaagaat gcgtagaatc tagtatcaag ttccttttat ggggataaat tttatataca 120 tgaaaagcaa gctatagaag tttaaaaaaa ataatacgtt tttcgatttt ttaataaaat 180 tctaaatgtg tgcattctag tgcataaatt gctacacatt taattttaat attgattttt 240 aaacgacgaa taggtagttt gaatgcaaaa cgaaaacgta agtatcattt tatccaatcg 300 gcccagtctg tgaaccctgc gatagatggc ggcacaaacg tttatacggt acaaattttt 360 ctgctgtgcg agtctatgtt tgttacatac ttatatacca tacatgtaaa caaaatatat 420 gtatatgcta aagttgtagt tgcataattt ataatttatt aatttataaa ttacaacaat 480 ggatgctata cattcaaagt ggcgtaaacc gaagccaaaa aaaaatcgta aaagtgctac 540 gagtgcagct aacctaataa aagctcgagc aaatgtaaac aagactgatc aaaataaaaa 600 tgcaacaaaa actagtgttc caaatcgtaa ttctctcagt gaaaaacctt gcactagaag 660 tgtttctaac aatacagtaa atgctatgca agctaaactg caagagtcag caacaacaag 720 aaaattggcg tttactgata tgttaaaaag ttcaagtgat gttgttgatg aggaagaaaa 780 catgtttatt aacataaatg tttttaaagt tatagctgac aagatgacat gtgaattttg 840 taaaacaaat tcagtgaatt ttgaattagg acaacgaaat ggttttgctg tgaatttatt 900 gtttatttgc aaagtttgtg gtgaagtatg tgtggataca ttttcaagct cacgtacaga 960 tcctaataag aacaactctt ctttcgtgat caatgagatc gtagtgcagt cttttacaaa 1020 agtaggatta ggaatcgctg ctcttcagga tgtttgtgct tctgtaatat cagaagttct 1080 ttcaaagagc agaaaaattg tgcatcaagt tcataaaagt ttaatccaga tttattattg 1140 aaaaagtgca ttctttcttc gtacgatgga tcttggataa gcgtgggttc acctctctta 1200 tggatttggt gcttattgat gtggtcttgt tgtcgactat gaagtgctta gtaaatattg 1260 ccacatgtgc atgattactg aaaagttagg atctgactct ccagagtttg atattggtac 1320 aaatcaactt caggcaatta aaaattatga tgctcttccg gctccatgga gttagccata 1380 gctgaaattc tatggcgaag attgtagtga ttgtaaatga gatatatgac aatgctatac 1440 aatggtcttt acagctttaa gagaaatagt aaaaagtgct atatatgaaa gattgtcaaa 1500 tgaagaatta ttagctcggt gcacccaagg acttactcaa aatgccaacg aggctataca 1560 ttcagtgctg tggagaaagt gctccaaaaa tattcttcat caaaaactag aatagaaatt 1620 gctgcagctc atgcagtttc tgagtacaac ctgggtcgtc aaagaagtgc tgagatactt 1680 gcatcaatta caaacagtaa gatgacaagt cacgcagaaa aaattacgac tcagaaggat 1740 aaaaaaagga aactctcaag tgacaagaaa catgaaccag agtctaaagc tgcaagaatt 1800 tcaaaaaaat atggcagtaa acgacaagag tcattaaaag tacaatcaga aggtgttact 1860 tatggtgccg gatgtttcta agtgagttct aaaattttat ataaatttta tgataatagt 1920 ataaatatta atcaagagaa ttaataattt tatatagttt tacaggatta caggatttga 1980 gattattatt atctgatgac tgagatcaga ctgatgctta gtgaggttat gtaaatataa 2040 aataagaaga tattaatgtt aacattaaaa accaaagtac ggatgattat gaaatcattt 2100 tgtaaagatt tatgtacagt gaaatagatt atttataaaa tattatgaat aacagtactt 2160 ttttacaaaa acctcatttt ctcagcttta agttttgaaa tattcaaaat gcatcctgcg 2220 caatgcattt ccatttcatg attccagcaa actacggtga attctttgtt attttaacag 2280 caaatcatag tgattacatt tttattatag cccatatgat attaatggca aatccaggta 2340 catcaaaatt tattgtaatt taactcactt tattattaat tttatgcatt caaatgacat 2400 atttatttat aaaaataaaa aaatttattt gaaaatctaa tcactataat tctttcacat 2460 cggcttaaag attacactca aattttctgg aatcatgaga tgcaaatggg ttgcgcagga 2520 taccttagca atatagctaa attataataa aaaattctgc ttcatctgca ttatccaact 2580 aaatattaga taaattgggc tgaaaattta cataaaatca tattttttta tatacataat 2640 gaggtcaaaa tcttagctga ctagatctaa agggggaaaa aaagtgaaaa caccagaacc 2700 ccacc 2705 // ID BEL-143_AA-LTR repbase; DNA; INV; 209 BP. XX AC supercont1.294; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-143_AA_; KW BEL-143_AA-I; BEL-143_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-209 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.294; Positions 1027738 1027946. XX SQ Sequence 209 BP; 69 A; 39 C; 50 G; 51 T; 0 other; tgttattcga cacgaattgc gcacttttca tcactacctt gatacgccgc tgctatcggt 60 gagttgtact aactgatctc acaatcgacc agccggtgag tatgtagatg gtaatgagaa 120 taaacaatcc caaagcagac cagaattaag agaggaagaa gggataaatg agagagtaga 180 agtctgtgaa cgatactttg gttaagcca 209 // ID Gypsy-32_DWil-LTR repbase; DNA; INV; 368 BP. XX AC scaffold_181155; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-32_DWil_; KW Gypsy-32_DWil-I; Gypsy-32_DWil-LTR. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-368 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_181155; Positions 1213584 1213951. XX SQ Sequence 368 BP; 113 A; 77 C; 88 G; 90 T; 0 other; tgtgagatgg gataaggtag ttacagcgtg gtgccaactc ccttatgaag attggcaacg 60 cagcagcagt cgtgaggtat cgtgagcagt caatgttcag cagtcaacag tcaatgttca 120 gcagtcagcg gtcgaggttc agcagtcaac agccaacgtt cagcagtcaa tagtcggtgt 180 tcagcagtca gaggtatcag cagtcagaag tatcagcagt cagcagtaag gagccgataa 240 atcagcagta cacagaaccc gagatttaag ttgcatattg taatcattat tcgcccgtcc 300 accatttata atgctaataa actataatat aatacatttg gggactcaat tgaggactcc 360 atcttaca 368 // ID Gypsy1-LTR_DV repbase; DNA; INV; 227 BP. XX AC scaffold_12963; XX DT 15-OCT-2009 (Rel. 14.12, Created) DT 15-OCT-2009 (Rel. 14.12, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy1_DV; KW Gypsy1-I_DV; Gypsy1-LTR_DV. XX OS Drosophila virilis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; virilis group. XX RN [1] RP 1-227 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(12), 3095-3095 (2009). XX DR Genome; scaffold_12963; Positions 18180033 18179807. XX SQ Sequence 227 BP; 93 A; 34 C; 47 G; 53 T; 0 other; tgtaatagga taagagcatc attggtaacc ctagccaggg tattatagtg tcactatcga 60 tgttaattat aacagtacga taagtaaata tcgataaaga aaagcggctg cagagaagaa 120 gtggaaggca gtctctctac agtactagca gtaagaagac gtgttgttaa aaataaaaag 180 gcaacccgaa cataagttaa atcaacgaag aaaatcatac tattaca 227 // ID hAT-14_SM repbase; DNA; INV; 2695 BP. XX AC . XX DT 23-JAN-2008 (Rel. 13.01, Created) DT 23-JAN-2008 (Rel. 13.01, Last updated, Version 1) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-14_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-2695 RA Jurka J.; RT "haT-type repetitive family from freshwater planarian Schmidtea RT mediterranea."; RL Repbase Reports 8(1), 15-15 (2008). XX DR [1] (Consensus) XX CC It is present in >1000 copies in the genome. The youngest copies CC are 99% identical to consensus. CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 507..2447 FT /product="hAT-14_SM_1p" FT /translation="MDRWLKEGTLKKKNSSISAPTAEKTAEDISEEQQPTT FT STKSDHSQLKRPRNDSILCIKKRKYDISYLSYGFTSIGNEDTPKAQCVICN FT KILSNASLAPTKLKRHLATNHPDHTNKPLDFFKRKLEELQNTKLSMVKIVK FT TNNQKANEASYKVSYRIALAGKPHTIAETLIMPCVKDIVLCMFDDEKSVEK FT IDMVPLSNNTVARRIQDLADDIEKELISRLHICDAYSLQLDESTDVAGLAV FT LLVFVRYNFGKTIEEDLLLCNFLETNTTGEEIFNSINNFMQKHDINWQKCI FT DVCSDGAKSMVGKTAGVISRIKNIVPNCSSSHCVLHKHALVCKKISIDMKT FT VLDEAVQIVNFIKARPLQSRLFKIICEDMGSQHTALLLHSEVRWLSRGKVL FT SRVFELRRELSMFLSEQKHERFAVLTNSCWLMRLAYITDIFSKLNEVSLSL FT QGKSITVFTVKYKISALSRKIEFWINCVENNNVSCFSTLSDFIIENDCLLN FT EDIRIDIVKHLRELYDNLQKYFPASNNDLMWVENPFIIKEKPDCLSISEYE FT NLIEINSDEQLKYNFNTVHLTTFWGNLISEHKVLAERAIRVLLPFSTTYLC FT ETGFSYYTSTKTKYRNRLNCSADMRIQLSAITPNISRICQEKKQQHCPH" XX SQ Sequence 2695 BP; 960 A; 419 C; 431 G; 885 T; 0 other; cagcgtttcc caaagtgggt tccgcggaac cctagggttc cgcgaactga tagcaggggt 60 tccgcgaact ttttgtaaat atatttccaa aaaatcttta taaataattt tcgattccat 120 acgaatccaa tttaaaattc tttttcgatt ccatttgata ttgcctcaaa aaacatatat 180 ccattctatt ttatatgtcc caattccatt tcattcagca tcatcagtac aaagtattgt 240 tataaaacct agatctggca acatttacat tgtgcaaaca atatttttat tcccgtttta 300 gatttttaag tcaataaaaa aaagggaaac cacagttttt aattatattg ttttaaaata 360 cgcattaaaa ttacgaattt aaaatttcat ttaataatat ttgataaaaa ggtaagtaat 420 tttttcaaat tttgaaaata atcgtatgag tattttttga ttatcgataa attttgtgaa 480 tatagttatc ccaaattttt gtgattatgg atcgttggtt aaaagaaggg actttgaaaa 540 agaaaaacag ttcaataagt gcacctactg ctgaaaagac tgccgaagat atttctgaag 600 aacagcagcc taccacttct acaaaaagtg atcattctca attaaaaaga ccaagaaatg 660 attcaatctt gtgtataaaa aaacgaaaat atgacatcag ttatttgtcg tatggattta 720 caagtatagg aaatgaagac acacccaaag cgcagtgtgt catttgcaat aaaatattgt 780 cgaacgcttc tcttgcacca actaaactta aaagacatct ggcaacgaat catcccgacc 840 atacaaacaa accattagat ttttttaaac gaaaacttga ggaattgcaa aataccaagt 900 tatccatggt gaaaattgta aaaacaaaca accaaaaagc aaatgaagct tcgtacaaag 960 taagctaccg aatagcgctc gctggtaaac cacacaccat agcagaaaca cttattatgc 1020 cctgtgtgaa agatattgta ttgtgcatgt ttgatgatga aaaatctgtt gaaaagattg 1080 atatggttcc attatcaaac aatacagttg cccgccgtat tcaagatctt gcagatgata 1140 ttgagaaaga gcttatttct cgcctgcata tatgtgacgc ttattcacta caactggatg 1200 aatcaactga cgtagcaggg ctcgccgttt tgcttgtttt tgtacgatat aattttggta 1260 agacaataga agaagattta cttttatgca atttcttaga aacaaacact actggcgaag 1320 aaattttcaa cagtattaac aattttatgc aaaaacatga tattaattgg caaaaatgta 1380 ttgatgtttg tagtgatgga gcaaaatcaa tggttggaaa aactgctggc gttatatcaa 1440 gaattaaaaa tattgtacca aattgtagca gcagtcattg tgtccttcat aagcacgctc 1500 ttgtatgtaa gaaaatttct attgatatga aaacagtttt ggatgaagct gttcaaattg 1560 taaattttat taaagcgcga ccactgcagt cgagactttt taaaattatt tgtgaagaca 1620 tgggaagtca acacaccgct cttcttttac attcagaagt aaggtggttg tcgcgaggca 1680 aagtattatc ccgagttttt gaattacgcc gagagctttc tatgttttta tcagaacaaa 1740 aacatgaacg gtttgctgta ttgacaaact cttgctggct tatgcgactt gcatatatta 1800 ctgatatttt ttcaaagcta aatgaagtga gcctttctct acaaggaaaa agtataacag 1860 tatttactgt taaatataaa atttcggcat tgtcacgaaa aatagagttt tggataaatt 1920 gcgttgaaaa caacaatgtg agttgttttt caacactttc tgattttata attgaaaatg 1980 attgtcttct taatgaagat atacgtattg atattgtaaa acatttacgt gaattatatg 2040 ataaccttca aaaatatttt cctgcttcca ataatgatct catgtgggtt gaaaatcctt 2100 ttattattaa agaaaaacct gattgtttat caatctcgga atatgagaac ttgattgaaa 2160 taaactccga cgaacaatta aaatacaatt tcaacactgt ccatctgaca acattttggg 2220 gaaatctaat ttcggaacat aaagttttgg ccgaacgtgc aatacgggtg cttttaccat 2280 tttctactac atatttgtgc gaaactggat tttcttatta tacttcgacg aaaacgaaat 2340 atcgcaatag attaaattgt tcggcagaca tgcgcataca actatcagca ataacaccaa 2400 atataagcag gatttgccag gaaaagaagc aacaacactg tccacactaa taacttactc 2460 attgataagt ttattttatt tttgatatgt gattgatcaa caatcaaatg aatttattaa 2520 tttaattttt tgtattattg tttttttgtt ttaataaata ctataatttt ctataataat 2580 atatagctca tattaatatt taaaattaaa attcgaatat attagtatat aagggttccg 2640 tcaaaaattt gaaatcaaaa agggtgccgc ggtgaaaaaa ctttgggaaa cgctg 2695 // ID MSAT-5_AAe repbase; DNA; INV; 276 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE Minisatellite-type sequence: consensus. XX KW MSAT; Satellite; Simple Repeat; MSAT-5_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-276 RA Jurka J.; RT "Tandemly repeated DNA from Aedes aegypti."; RL Repbase Reports 11(4), 1451-1451 (2011). XX DR [1] (Consensus) XX CC 22-bp unit. XX SQ Sequence 276 BP; 72 A; 60 C; 84 G; 60 T; 0 other; caggatttcg gaggaaatcc tcgcaggatt tcggaggaaa tcctcgcagg atttcggagg 60 aaatcctcgc aggatttcgg aggaaatcct cgcaggattt cggaggaaat cctcgcagga 120 tttcggagga aatcctcgca ggatttcgga ggaaatcctc gcaggatttc ggaggaaatc 180 ctcgcaggat ttcggaggaa atcctcgcag gatttcggag gaaatcctcg caggatttcg 240 gaggaaatcc tcgcaggatt tcggaggaaa tcctcg 276 // ID Gypsy-22_AA-LTR repbase; DNA; INV; 250 BP. XX AC AAGE02020536; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-22_AA_; KW Gypsy-22_AA-I; Gypsy-22_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-250 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02020536; Positions 5539 5788. XX SQ Sequence 250 BP; 69 A; 43 C; 71 G; 67 T; 0 other; tgtagagtcc gaagataaat ataaaaacct tgtttctaaa gacaccttga ttaactgcac 60 gaacaacgat gatgaccgga ttgagaacgg tgttcggttg ccggagaggt ggaggggaga 120 aagtgtgtgg cgattgatgt gaacggagtg tttcttcgtg gagtgtgttt ccgacgtgtt 180 gtgaagtgtt tctgccattg agatatcgtg aggttaatta cctcgaaaac cccctggaca 240 aaaccctaca 250 // ID BEL-17_DWil-LTR repbase; DNA; INV; 288 BP. XX AC scaffold_181134; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-17_DWil_; KW BEL-17_DWil-I; BEL-17_DWil-LTR. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-288 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_181134; Positions 1049930 1050217. XX SQ Sequence 288 BP; 97 A; 40 C; 54 G; 97 T; 0 other; tgtttctgaa cgaaatgcat tattattgtt agcatataag aaatttgact tgtattccat 60 attgttagtg cagtaagata attatcgaat tgcagtgtca tattaccatg aaattgttat 120 cgagttcgct ggtatcgata tgtaatcgat atcgggtaga aagcttagtt tagaaaaacc 180 attgaattga ccggttgtaa ctttatacaa attgcaaata tattgaacaa taaatcaaat 240 aggcggtatc gccttaaaca gtctaactag gcagttggtt tccgaaca 288 // ID Gypsy-125_AA-LTR repbase; DNA; INV; 505 BP. XX AC AAGE02022342; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-125_AA_; KW Gypsy-125_AA-I; Gypsy-125_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-505 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02022342; Positions 46542 46038. XX SQ Sequence 505 BP; 173 A; 79 C; 120 G; 133 T; 0 other; tgtagtaaat aactcagcag tctagggaga aataaacaaa ctattacata cttacagtta 60 actgctgctg tcgcactagc tgagaaacat tctgacccgg tgaaagagta aataatgaaa 120 atcgtccact gaactatatt cttgtttgac agctggtggc gatgaaacat ttcgagcgaa 180 agcttgggcg aaatcgaaaa ggaaatcttt cattgataca ccgtaaaaag aaataagtgg 240 tttctagggt tgagctttgg aaatgttcga aaatagaacg gacgagtctg aacgcattcg 300 aaaaacgcta attaagccgg aaaggacgcg aagctttcgc aagttggatt tcgagaggat 360 cgagatcagc attcagtgat cgaaatttgt cgaggacagt cgcgtttatt tttggagttc 420 gattgaaaaa aagttgtgac cacgtttctc gaaaagtgaa gttttgaata ttggaaaatg 480 ttgaactatc aatgggaata cgaca 505 // ID Polinton1_SM repbase; DNA; INV; 12786 BP. XX AC AAWT01021134; XX DT 14-DEC-2007 (Rel. 12.12, Created) DT 14-DEC-2007 (Rel. 12.12, Last updated, Version 1) XX DE Polinton-type family. XX KW Polinton; DNA transposon; Transposable Element; Polinton1_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-12786 RA Jurka J.; RT "Polinton1_SM: Polinton-type element from the planarian Schmidtea RT mediterranea."; RL Repbase Reports 7(12), 1213-1213 (2007). XX DR EMBL/GenBank/DDBJ; AAWT01021134; Positions 18657 31442. XX FH Key Location/Qualifiers FT CDS 655..1191 FT /product="Polinton1_SM_1p" FT /translation="MIKEILIILLVFNMVNSIVVTVQGKNSTRCTCPNTAV FT LMDILYMIDRPRYLKTPVKRLSERMSNLEFRFEHTYETLKRAKNELYDDII FT YLNNTIGIIKNELRKSILKLNENRKLNEKMMNQVQYSIKFYSNKTTDQFNR FT IMKLQQPVNSGGNKLKEVNSVNKSSLPFPLYIFTYFLSIA" FT CDS 3475..4050 FT /product="Polinton1_SM_2p" FT /translation="MELRNDAVYQIVGPSRSGKTMFLCKLLKSNLFQTKFN FT KIYWHRGADEEHGLTQDNFCKLKNMKIVKGFDKNWSSRLRKGDVIIIDDLY FT HKANNEKDFNNLFTKISRHVGVTVIFITQNLFHQGGAHRTRNLNVQYLVIF FT KNPRDATVIDFLARQAYPNNRNFLISAFQDATKSPHGYIYFLISHNNAMMI FT YG" FT CDS 4633..5775 FT /product="Polinton1_SM_3p" FT /translation="MSNLNLNPLVVMEKIWTNPEDETGFGGVAKLKKRVPK FT SKKETQKWLSDQLAYSLNKPMRKRFPIRAYKTFGINDLWQMDLMEMIPYSK FT INKGYKYILTCIDVFSRFARGVPTKTKSAEEISKAINNLFKNGHPDNLQTD FT LGIIIIIIIIIIIIIFIIIIIIIIIIIIIIIIILGKEFYNSKVKQILDRLK FT INHYSVHSKYKAAHVERFNRTLRDRLKKYFVHQGNKIWINVLPKILVSYNN FT SSHRGLNGLRPVDISSKTRVQSKKVTITKPKYKVGDHVRISKISASPFIKN FT FNSNWSDEVFQITKINTKQSPEMYVIKDSDNNVIQGKFYEQELQVISKPTV FT FKIQKILKTKMVGKNKQYYVKWHGYQKPSWISSKDLVK" FT CDS 5878..6714 FT /product="Polinton1_SM_4p" FT /translation="MEIGKLLCKTLHLCTINNFERYCHIPNHILDENSGMF FT FKDGLNVSCDFSPFAIKFESIEDAKRMGSDKLNIVVPGTEYLIRFKTFDEK FT SVTIENIKISVSYINEVEDTFTFTKDLYIETLEELHKYISFNCLQIFKTIE FT FLKDEICKFSLNNNICEVTFDKNLANTLELDKQTYRASAFGVRKPKIIKLH FT RQMFIYTNIIDPIIVGDSLVPLLKSVWVDKHDQDEVVNINIDNTMYLPVST FT SCINNIEINIRDDSGKFIEFSIDTKTHLTLHFHKLDGQ" FT CDS join(8898..9689,9706..10215) FT /product="Polinton1_SM_6p" FT /translation="MGSVKIQFTILIQFEKDGEIKDNFFSNKAELFSDNFL FT LDGITKLNNKIERFTYLGSGWRVVKIKEIDFILTKYFPINRLAGHSYIPTP FT RSLSGKKAIVNVQNNNHHCFIYSILGLLKRSVVRTHRNRITSYENYLDELN FT FDPIDDCPMRLCNIPKFERKNKDLKLAINVFTYNEKPSKTLDDINYIPHHP FT HLDIIHRSKVTDGTPIYLLLLEDGDNYHYTAVFDLDKLMNCHATTSYNVQI FT SVHWCPHCLNGFRLKPAFDTHVAKFSNYAKSIIPPFVIYADFESILPLDIR FT HHQIHIPIAAGLLLINNYTNTKQYYNFIGDDCVLAFLKKIDEIALSIVLPY FT YEVHCQKTICISFNEKREFKDCHTCYLCKTFIRDKVKDHCHFSGKYLGAAC FT RKCNVSRRIQKQLPVVFHNLRVYDLHHILKYGLNNFSSWE" FT CDS 11356..11976 FT /product="Polinton1_SM_7p" FT /translation="MPQNRLSEIDRPSYIGKSVLDLSKLRMYELQYKELAK FT YRNEFNCEINIVAGDTDSFFLEIKNCRLDTLLPAMIRDSLLDTSNYDPAHP FT LYSRNLECVIGKFKDESKGLWYEEWVFLRPKCYSLLGTKDTMRAKGVKLLG FT TEIEYQSYLDCFNNDTIFTVPQTRIGTRNHQLFTFKNNKIALTNKDDKRNL FT VGRNSSLAFGHYLRIDA" XX SQ Sequence 12786 BP; 4692 A; 1742 C; 1976 G; 4376 T; 0 other; tttttattat tattattcta ctacttactt ttataagtag tatatatttt gtaaatattt 60 tattattatt aatattatta ttattattat tattattatt attgttatta ttatttttta 120 tacatattat ttattattat taatattatt actatatagt aaaaaccaag acatatacga 180 caaattttat tataaagaga taattctact atagtaatat tgtcgaagag ccttattcaa 240 agcccattcc acatactttc gatatcgggc agcgcagcaa tgacatccat caccacaggg 300 atattcaggt tccatatctt ttgacaaatt gttttgttcg atatcccata ggaactcgtc 360 gattacagta aacatccata ttgatgtatg attgaatatc aagtctggaa ttattgatgc 420 tggcttctcc atgatttcat gtattttggg ataatagaat tccagacgtt ctttaataac 480 tttaaaataa tcatctaaaa aacaagtaat aataataata gtaataataa attaccataa 540 tgtttaattt cattcatagc ttgataaaaa ttgaaataac aatttagacc aatgagaaca 600 cacacacaaa tctttagcta taaacaataa atattaattt atattgtttt gaaaatgatt 660 aaagaaattt taataatatt attagttttt aatatggtca actcgattgt tgtaaccgtt 720 caaggtaaaa attcaacaag atgtacatgt cctaatacag cagtattgat ggatattttg 780 tatatgatag ataggcctag atatttgaaa actccagtca aaagattatc tgaaagaatg 840 agtaatttag aattccgatt tgaacataca tatgagactt taaaaagagc taaaaacgaa 900 ttatatgatg atattatata tctaaataat accattggta ttatcaaaaa tgaattaaga 960 aaatcaatat taaaattgaa tgaaaatcgt aaacttaatg aaaaaatgat gaatcaagtt 1020 caatatagta taaaatttta ttctaataaa acaactgatc agttcaatcg tataatgaaa 1080 ttacaacaac cagttaacag tggtggcaac aaattgaaag aagttaacag tgtcaacaaa 1140 tcatcattgc catttccatt atatattttt acatattttc ttagtattgc atgattctga 1200 ttgttcgaat atagtataaa taagcttaca cttccaaata tttttataaa atgccagaaa 1260 caaaattttt tataactttg ttgaatgagc ttgtttctaa taatagagaa gaagatgata 1320 tcattaaaaa cataaaattc atggtaaacc gtaccgagaa aatcgttgaa gacatcccat 1380 ttgattatga aaaaattctc aatttgccaa cattggagag aaagaaggca ttccaatcaa 1440 tatttagtta catgttcgat gaaactaata aatggaggcg aataatagta ggattaatgt 1500 atatgaaaca catgatatta aaatctggat ccgactatga aaattatttc caatggtcta 1560 tagaggtaaa ttactttata ttgttattaa tttatcatta tttaggtgat agaaacaaaa 1620 attgatccgt gggtagaaac aaacggtgga tggtcagcga ttattcaagt tgccccagaa 1680 gccaaacgct ctgatattta tatcgaaaat tgcaaatttt tattaaaact attaactgga 1740 gcatcggttt tatatttatg taataaagct tatcatagat tttaataaat atttctttgt 1800 tattattatt attaataaaa aaaaactaca tttttaaaga tttattgatt attacatgta 1860 cattcttcat caccacaata ttctatcata atattttcta taacatggtc aatacaattc 1920 ttaaattcgg tagggtgatc tttaataatt tcggtagtat cttcagtatc ttcagtatca 1980 ttaatatttt cttcatcatt aataatttca gtattaattt ctccatcatt aacaatttca 2040 gtatcgtcac tctgacattc agaatcacaa catgaatctt ttttactcaa tgcttcaact 2100 agagtcaaca tatctctaat ttttacatcg gttatttctt cgagtttatc aaacaaaaag 2160 tttattaaat attctaaacg aaattaatta attattttat attttatatt ttaccttttt 2220 cagacatcat ttttaaaact attttaaaac taattataat tttttttaaa gaataacaaa 2280 gaaaacatta atacatttat taaacactaa acactataca ttaaaataat tgtgaaagtc 2340 tagcagtaac aattttccag attggagaaa gcacatcatt ttgatcagct ttatatcctg 2400 ggattgctat tgttaaatac cagttttcct gttctttgaa acctatttct gtaaatgcat 2460 cataatatct ggtatattga ctggtgaaga aagcattttc attgcatttt aaattggaat 2520 ttgttatatt ccactcaaca tatcgctggt accggaaatc gcagcaactg caacccagaa 2580 tacagaggta ttccgtttcc aattccttta ccatgcggtt gattttgtca tcccaaagag 2640 cttgcttaat cagctcggaa atcataattc tggtttcttc atatattgca gcttcgatca 2700 ttgctagctc cttgatcttc aaagtttcct tcatttgagg ataataataa ataattctgt 2760 caatcaccat aggggcatat gtgtctaaat aaataataat aaaagaaaat aaataaataa 2820 taaatttacc atattgactg gtatttttct tctgcatatt tttcttcggt gtagtttgca 2880 ttttaaaaat gaaatgaaat gaataaacaa gttgttttaa tcaatcatta gccaatccga 2940 tatcgaccaa tcccgtgatt tcattttgta tgtcacaaac aatatttttg tactctagat 3000 ttcgcaaccc taatatgact attttgccac tggcaaacac gttaacacac attaggttgt 3060 attttaaaag cctcagtgca ggaaataatt ctgggtcgaa catacaaaga cattttgttg 3120 acatgttgta tagattgata gattgtccca tatccatggt aaccgtaatt gactgtattt 3180 taatttactt tattctatat tgtaatttat tggcatctaa aagtttcttg caccccatga 3240 ttctgcattt gccggatttg aaaaatatta tagggtattt tccactccta tctacaattt 3300 gctgtggttt accatttgga aacgtcattt ttgatacatc gaaagtttct ttaaaattta 3360 tatttgataa cataattaaa caataatgaa atattaacct atatatatga ttttttgaaa 3420 gagttcatta aatcatataa tataaacaaa aacctataat tatactaatt aataatggaa 3480 ttaagaaatg atgctgtata tcaaatcgtc ggcccaagtc gtagtggtaa aactatgttt 3540 ctgtgtaaat tactgaaatc taatttattt caaacaaaat tcaacaagat ttattggcat 3600 aggggtgcag atgaagaaca tggattaact caagataatt tctgtaaatt gaaaaacatg 3660 aaaatagtaa agggttttga taaaaactgg tcaagtcgat taaggaaagg tgatgtcatc 3720 attattgatg atttgtatca taaagctaat aatgaaaagg attttaataa tttattcaca 3780 aaaatcagta gacatgttgg tgttactgtg atctttatta ctcaaaatct atttcaccag 3840 ggtggtgcac atcgaacaag gaatttaaat gttcaatatt tagttatttt caagaatcct 3900 agagatgcaa cagttattga ttttcttgct cgacaagcat accctaataa tcgtaatttt 3960 ctcatcagtg catttcaaga tgctacaaaa tcaccacatg gatatatata tttcttgatt 4020 tctcacaaca atgcaatgat gatttacggg taaaaacaga tattttcaat aaagaaggtg 4080 ccatggtata taaacaaagt tgaaaacatt aacatgtaaa aatgtttatg aaatccagac 4140 ttaaaaatat ccgaaaactt agacaaagta ttaaaaagaa aaacaaagtt ccaccaaaaa 4200 ccagttatac accatctaaa aaactaataa ccgagaatta ccctgcaatg aatctaattt 4260 cgaaaaacat tattaaaaag agtaagaaaa aagttaaaat tgtaaaaaat cataatagtt 4320 tctcatattt taatgcattg ttaaaggctt cgagtatgaa aagaatgtct attttacaat 4380 catttccaac ttttgttgtc gatgatctac tcaaaattct agtaaaagta gtcagaggta 4440 aaattaaaat tagtaaatct aaaaaactag tattgaataa acatcgtaag cctttgttat 4500 cgctagtaaa taataaaaat cgtaagcaaa tgagaaaaat catatataaa caacaaggcg 4560 gttttatcgg agcaatgtta ccattagcat tatcattatt atcacaataa acgatggggc 4620 aatcataacc taatgtcaaa tttaaattta aatcctcttg ttgtaatgga gaagatttgg 4680 acgaacccag aagacgaaac aggtttcggt ggagtggcaa agttaaagaa aagagtccca 4740 aagtcgaaaa aggaaacgca aaagtggttg tcagatcagc ttgcgtacag tttaaacaaa 4800 ccgatgcgga aaagatttcc aattagagct tataagacat tcggtattaa tgatttatgg 4860 caaatggatt tgatggagat gataccgtat tcgaagatta acaagggtta taaatatatt 4920 ttaacatgta tcgatgtttt cagtcgtttt gcccgcggtg taccaacaaa gacaaaatct 4980 gcagaggaaa tatctaaagc aattaataat ttatttaaaa atgggcatcc ggacaatttg 5040 caaaccgatt taggtattat tattattatt attattatta ttattattat tatttttatt 5100 attattatta ttattattat tattattatt attattatta ttattatttt aggcaaggag 5160 ttttacaata gtaaagttaa gcaaatcctg gatcgtctca aaataaatca ttattctgta 5220 cattccaaat acaaagccgc acatgttgaa agatttaacc ggacattgag ggatagattg 5280 aagaaatatt tcgtgcatca aggtaataaa atatggataa atgtattacc gaaaatactt 5340 gttagttata ataattcatc acatcgaggt ctgaatggtt tgagacccgt tgatatcagc 5400 tcaaaaacac gagtacagtc taaaaaagtt actattacaa aaccaaaata taaagttggt 5460 gatcacgtta gaattagtaa aatatctgct tcacctttca tcaaaaattt caatagtaat 5520 tggagtgatg aagtatttca aattactaaa attaatacta aacagagtcc tgaaatgtat 5580 gttattaaag atagtgataa caatgtaata caagggaaat tctatgaaca ggaattacaa 5640 gttatttcca aacctacagt gttcaaaatt cagaaaatat tgaaaactaa aatggtgggt 5700 aagaataaac aatactatgt caaatggcat ggatatcaaa aaccctcgtg gatatcatca 5760 aaagacctag taaaatgagt ttctatacaa ttttaccgag caatagttgc ccactaatcc 5820 atccggaaaa tcaagcgagt aaatttatgg ttgatcttca aaatcctatt tatttacatg 5880 gaaattggga agttgctttg caagacttta catttgtgta caatcaataa ttttgaacga 5940 tattgtcata ttccgaatca tattttagat gaaaatagtg gcatgttttt caaagatgga 6000 ttgaatgtta gttgtgattt tagcccattt gcaataaagt ttgaaagtat tgaggatgcc 6060 aagcgaatgg gtagtgataa gcttaacata gtcgttcctg gtactgaata cctaatacgt 6120 ttcaaaacat ttgatgagaa aagtgtaacg atcgaaaaca tcaaaatatc tgtatcatat 6180 attaatgagg tggaagatac atttacattt actaaagatt tatatattga aacacttgaa 6240 gaattacata aatatatttc attcaattgt ttacaaatat ttaaaactat tgaatttctt 6300 aaagatgaaa tatgcaaatt cagtctgaat aataacattt gtgaagttac atttgataaa 6360 aatttggcta atacattgga attagataag caaacatata gggcaagtgc ttttggtgta 6420 cgcaaaccaa aaataataaa acttcatagg caaatgttca tatatactaa catcatcgat 6480 cctataattg ttggtgattc tttggttcct cttttgaaat ctgtgtgggt tgataaacat 6540 gaccaggatg aagttgtcaa tataaatatc gataatacca tgtacttacc ggtatcaaca 6600 tcatgtataa acaatataga aataaatatt cgtgatgata gtggtaaatt tattgaattt 6660 agtatagata caaaaactca tttgacatta cattttcata aattagatgg ccaataacga 6720 tattattagt atgtatagtg gtagccaaag tggtggcgaa ttaccatatt ttattggtaa 6780 acagtatgga tcaggatggt tgagaactat tggtagattt gcacttccta ttcttaaacg 6840 cattggtagt tttggtatga aaaccgctaa agatgtgata atgaatgagc agaaaatatt 6900 accatcgctg aaatcaaatg cattatcaga gttagggaaa gttctaccgg gtatgtttca 6960 aaaacaggaa tcagcaccgg caccggctca aaaacgtaaa agacatcata aacatataaa 7020 caaacgaatg aaaggacatg gaaccatatt tcaaaaatga tttgtggtaa atctgaattg 7080 tgcttatttg atagaccatc tccccaagcg gtaattgaaa tgggggcctt cgaagacgta 7140 ttcccgatga attctattat ggatagtaga actgatattg aatttcatat taatggttcc 7200 caaactgaat atctagattt gaacgataca ttgttaactg ttcagatcaa agttgtagat 7260 aaagatagaa aaccgttggg tgaacctagt gatgtcatac caaacaattt cttgtttcat 7320 acacttttta aagatgcagt actcggattt aataatatca agattgaggg tggtaacagt 7380 acatatattc acaacgctct aatagaaact attataaatt acaatagcga tactaaaaat 7440 acatgtttaa ttcctattgg ttatggtagt gatgacgata gaaaaaagtg gattaaagga 7500 tcgaaattat ttaccatggg ctcatcatta caactcgatt ttatggatca accgaaatat 7560 ttactccaag gtgttaatgt acacattaaa ttgaaaagat ctgattcggc attatcatta 7620 acaagtgcta gcacagcacc tattcttcaa ttagttgatg caaaattatt ggtgagaaga 7680 gttcgtgttg agccctctgt attagccggt catcaactgg ggcttaattc taagcatgca 7740 atatatccat tgaaaacaaa ggaaattgtt caatttgcta ttgctaaagg ttcagcttct 7800 ttctataagg aacaaatatt tggagataga agaatgccca actttatatt ggtaacattt 7860 caaagtgaat ctcaatataa tggatcatac ttaacatcga gctcaatttt caaaattttt 7920 ggagtaaaat cgctaacttt atcaaaaaat agtgattata gagaaacata tactcaagat 7980 ttcgataatg ataattattg tgctacatat atgcagagta tagtgagaaa tatgggttat 8040 ttggataaaa accttaattg tggtataact cttgatgatt ttaaaaacaa atatctattc 8100 tttacatttg ttttggcacc cgattttgat ttaaatcaga gtcaattacc tcaaaatgga 8160 aatttacgat tagatataaa gtttgctaaa gctacaactg aaccggtaca tgttgtaata 8220 tatggagtat ttgaaaatga agttcaaata actgcaaaca gaacagtact agtttgaaca 8280 gagttcaaac attgttggtt tgaacgaaca ttctttgttt gaaaatgttg gtataaatgt 8340 atagataatt tatgataatt tgtcatggag taaggttttt aaatgtatta ttaatataaa 8400 tgtattatta atataagggg cactctaccc acttaaatca atttacattg aataaaaagg 8460 cacttttacg taaaatcaat aaacgtaaaa ataataataa caataataat aataataata 8520 ataataatac gtcgagacat aataataata ataataataa taataataat aataataata 8580 ataatacgtc gagacataac aacaacaata atacgtcgag acataataaa agacctgctg 8640 catatattca gtgtgttccc tctaaactta aaagggttga aatcatcatt agcgatgatg 8700 aagaaatcat cattagcgat gatgaagata ataataatca aataggaggt tcgattaata 8760 ttgataatga acctgaaata aggttagatt taaatgggag tcataaaagt attagatttg 8820 atgtagaaca tgaaaatcta atacaaatct aaataaaaaa taaaatagtt aatagattaa 8880 aatatgaaat aaaccaaatg gggtcggtta aaatacaatt caccattctt atccaattcg 8940 aaaaagatgg tgaaattaaa gataactttt ttagtaataa ggcagaactt ttctctgata 9000 attttctgct tgatggtatt actaaattaa ataataaaat agaacgattc acatatttag 9060 ggagcggatg gcgggttgtt aaaatcaaag aaatagattt tatattaaca aagtattttc 9120 ccataaatag actagctggc catagttata tcccaacacc gagatcttta tctggcaaaa 9180 aagctatagt taatgttcaa aacaataacc atcattgttt tatatatagt atattaggcc 9240 ttttaaaacg tagtgtagtt cgcacacata gaaacagaat tacatcttat gagaattatt 9300 tggacgaatt gaattttgac cccattgacg attgtcctat gagactctgt aatattccca 9360 agttcgaaag aaagaacaag gacttaaagc tggctataaa tgtttttact tataacgaaa 9420 aaccgtctaa aactctagat gatataaatt atatccccca tcatccgcat ttagatataa 9480 ttcaccggtc caaggttact gacggtacac ctatttactt acttcttttg gaggacggtg 9540 ataattatca ctatactgct gtttttgatt tggataaatt aatgaactgc catgcgacta 9600 catcttacaa tgtccaaatc tctgtacatt ggtgtcctca ttgtcttaat ggctttagat 9660 taaaacccgc atttgacaca catgttgcct aaaaataaat attaaaaatt ttcaaattat 9720 gcaaaatcta ttatcccgcc attcgtaatt tatgcggatt ttgagtcaat tttaccacta 9780 gatataaggc atcatcaaat acatataccg atagcagcag gtctactatt aattaataac 9840 tatacaaata ctaaacaata ttataatttt attggcgatg attgcgtttt ggcgttttta 9900 aaaaagattg atgaaatcgc tctttcgatt gtcttgccat attatgaggt tcattgtcaa 9960 aaaactatat gtatatcttt caatgagaaa cgggagttca aggattgtca cacttgttat 10020 ttgtgtaaaa cgtttattag ggataaagtt aaggatcatt gtcatttctc aggtaagtat 10080 ttaggggctg cttgtaggaa atgtaatgtt tctcgacgaa tccagaaaca attacctgtg 10140 gtttttcaca atttgagggt atatgatctc catcatattt taaagtatgg tctaaataat 10200 ttttcttctt gggaataata ttattccgac cacttctaaa aaatttatct ccctcattgc 10260 gtacattaat aaattgcctg ttcgttttat aaacagcatg caatttgtta attcatcttt 10320 agcaaaagcg gttaagacac ttactgattt gcctttaact gattctgttt ttgatggtgc 10380 aattgttaga cccaaggcta ttttccctta tgactttgca aaatctcgtg aggttctcga 10440 atcgacgact gaattgcctc ctatttgggg ttctgtatct gctgatgaat acgctacaga 10500 ccaacaaatt tggacacaga aaaattgcga gacaatgtta gatgttaacc tacttcaaat 10560 tagacgtatt tcttcttgcg gattattttc agcaatttcg cgcaaaagca attgcttata 10620 atactctaga acctcttaat ttttatggaa ttcctggtat gtcatgagcg tcagctctca 10680 tgacattaca ggaacccata gaacttctgc aggacatgaa aatgtttaat ttctacgagg 10740 gaggaatacg gggtgggtta acatttgtta acaaacatta tgtcgcaagt tctgaggata 10800 cagaactctt gtatatagat attaataatt tgtatggttg ggcacttagc caatatcttc 10860 cgtatgctga ttttgtttgg gtgtatgatg gtttggatga ggttttgaat gaatgtgcga 10920 atgttgtgga tattgaaatc cttccatatg gttatactat ggaagttgat atcgagattc 10980 cagattattt acatgatttt ttgaataatt ttccgattgg gccggaaaag atgtgtccac 11040 cgaattctaa agttgagaag ttgatgctca cgcattggcc taagaagaac catgttcttc 11100 attagcgact tctcaagctt tatctttctc tcggtgtgaa ggttgttaaa gttcatcgca 11160 caattaaatt caagcaagca cctatatttc atgcatacat tgaaaaaaac agaaaattaa 11220 gagctcaaag tactagcgag ctatataggg acttgtttaa gctttataat aatagtttgt 11280 gtggaaagtc tgtcgaaaat cttaaaaaac ggatgaattt gaggctatgc aattctgatg 11340 aaaagatgat tgtatatgcc tcaaaaccga ctttcagaaa tagacaggcc tagttacatc 11400 ggaaaatcag ttctagacct gtctaaactt agaatgtacg aactacaata taaggaacta 11460 gcaaaatatc gcaatgaatt taattgcgaa ataaatatag ttgccggcga cactgattcg 11520 tttttccttg agattaagaa ctgtaggcta gacactcttt tgccggcaat gataagagac 11580 agccttttag atacctcaaa ttacgacccg gcacatccct tgtactcaag aaaccttgaa 11640 tgtgtaattg gaaaatttaa agatgagagt aagggtctat ggtatgagga atgggttttc 11700 ttaagaccca agtgctatag tcttttaggt acaaaagata caatgagggc taagggggtg 11760 aagttgctgg ggacagagat tgagtaccag tcatatttag actgttttaa taatgacact 11820 atatttacgg tgcctcaaac taggataggg acgcggaacc atcaattatt tacatttaaa 11880 aataataaaa ttgcgcttac caataaagac gataagcgta atttggtggg tagaaattcc 11940 agtctcgcat ttgggcacta tttaagaatc gatgcttaga gtatgatcaa gtatcgcaca 12000 gcccaagcta tcaaagcaca cacagcccaa gctatgacag ctcctaagag aacagctgtt 12060 tcagtcgaga cattgatgtt gtagttttgt cggattaatt tatttgaata taatgtattt 12120 tgtgatttat tcgttatttt agtaaataat cataatttta tatgcttagg gaaatgaaga 12180 tagtacaatt tgtgttttta tttagagttt tttattaata tagtattaat aattttattg 12240 gtatagtatt aataacaatt ttatttagag ttttttatta atatagtatt aataataatc 12300 ataattttat ttatatatat tttttcttaa tatagcatta ttaataataa ttttatttag 12360 agtttttatt aatatagtat taataataat cataatttta tttttatgct cgaggaaaaa 12420 tgaagatgga tcttgataga aaaaatttat gttttattga taataataat gtattgtttt 12480 atgtgtattt attgttaata ataatgtaat gttttatttg tttttattgt taataataat 12540 atattttatt aattttattg ttatagtatt gataataata atgtattgct ttatatgttt 12600 ttattgttaa taataataat tttatttaga gttttttatt agtatagtat caataataat 12660 aataataata ataataataa taataataat aataataata ataataataa tattaataat 12720 aataaaataa atttacaaaa tatatactac ttataaaagt aagtagtaga ataataataa 12780 taaaaa 12786 // ID Gypsy-623_AA-LTR repbase; DNA; INV; 588 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-623_AA_; KW Ty3_gypsy_Ele125; Gypsy-623_AA-I; Gypsy-623_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-588 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 588 BP; 188 A; 139 C; 109 G; 152 T; 0 other; tgtagcatac cctaatacac aaatgtcata tatgcaaagt caaactgtaa taaaatttac 60 tcaccaataa caaagcaacg accgtgcgca aaaacgcata gttatgcaat tgtaaacaac 120 acatctggaa gtcattaacc accaaggcgc accgttctat cgactctctc tctacctcta 180 cttgtgtggc gcgagggccg atagcccgca gcgcggagga ttcaaattac aaagccatgg 240 gcacttgaag caaataaccc aaaagaatcg ataatgcaag gataggcaat tttggctatt 300 tgtcactcca gtcgtctcct ttacatatct ataccactct ttcagacaca tgcattgtat 360 tgtatttgac agggtataaa aaccctagaa ctttcagaca ataaaggatt cttttgattt 420 agctcgaccc aattggatgt tatccaatct tggctatcga ccactacgtt gaagactcca 480 atgcgaccca cttggtgaat accaggatgg ctatcgcacc gtaattaaag taagtccgtc 540 gcgtttgcac cgtagtcgtg ttcgttaagt gaaaactgga aaaaacca 588 // ID SMAR25B repbase; DNA; INV; 2662 BP. XX AC . XX DT 12-AUG-2009 (Rel. 14.08, Created) DT 12-AUG-2009 (Rel. 14.08, Last updated, Version 2) XX DE Consensus sequence of Mariner-type family of repeats. XX KW Mariner/Tc1; DNA transposon; Transposable Element; SMAR25B. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-2662 RA Jurka J.; RT "Mariner-type element from freshwater planarian (Schmidtea RT mediterranea)."; RL Repbase Reports 9(8), 1890-1890 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 524..2305 FT /product="SMAR25B_1p" FT /translation="MSHNSSVIKTKRKAISLDTKIKILDQLATGQGATAVG FT KHFGIHEATVRTIKKNETAIRKSVCSGTKISAKSSSYVRDVVKEKMEKALV FT VWIEDKSQKRIPVDGITIKQTALRIYKRIKELEPGTSSQSNKKFEFSASTG FT WMTGFLKRHALHNVKIMGETASADELAAKEFPQKLRKIIEDGGYTPDQVWN FT ADESGLFWKKMPSRTYVAKSQKTAGGFKVAKDRVTLLFCSNASGERMLKPL FT LVNRALRPRSMKGVDFNKLPIHWMANKKAWVTTAIFTEWFQKYFVPEVRRY FT MNAKCLEFKVLLILDNAPGHPVLEHPNVQFCFLPPNTTSLIQPLDQGIIAT FT FKTYYIKRSFQYVLDKLDQNEEATVIDEWKQFSIMDCINQVGLALAELKPS FT TLNSCWKNVWPECVKSKDPVIHNNAEYTDIITLAHTIGGDGFDDLSFADIE FT ELLVDKSLSENEIIDLTLETRHDKEHSDNDEEDPPVLNATLIKEGLDLARK FT LGNHFEQHDPDEERAKKFQRELKSLMASYRELYNGLTRNQTQSLITDFVLK FT ISELSTQETSDNNTKIISQDDEQNNSSDSSDIAVLRKRIRVSENDNE" XX SQ Sequence 2662 BP; 933 A; 402 C; 449 G; 875 T; 3 other; cacctaattc tgtttttaca cgaattgttt ttacacgaat ttgatttaac acgaatataa 60 aaagattttt ttctcttttt acacgaattg ctttgtttta acacgatttt caatattcta 120 tgaatttttt ttgatgttct attaattttt tttattttca tttttcacaa attatgtata 180 catacatttt aacttcgcat tatattcttc agaattcagt tatattatgt cgattatatg 240 cgtatgtttt ctctttgggt gggattcccc gaaatataca taaagtaaaa acgtaatgta 300 ctttgtacgc aatttagttc ggttttgaga attgaaaaga aaagacgttt agttgttgcg 360 aatgtttaac aagttataag taaaataagg tgctcgtata aatttattaa ttctgctcgt 420 atagatttat taacccaata acaggtaagt tttaaataac tttcaaatct ttgtattatt 480 aattataatt atagattaaa tttattttgg tagccttaac ataatgtcac ataattcatc 540 tgtaataaaa acgaagagga aagcgatcag tttagatact aaaattaaaa ttttagatca 600 acttgcaacg ggacaaggcg caacggctgt aggaaagcat ttcggtattc acgaagctac 660 cgtaagaacg attaagaaaa atgaaactgc gattagaaaa tccgtatgtt ctggaacaaa 720 aataagtgct aaatcatcat cgtacgtaag agatgttgtc aaagagaaaa tggaaaaagc 780 tttggtagta tggattgaag ataaatcaca aaaaagaata ccagtagacg gaattactat 840 caagcaaaca gcattaagaa tctataaacg tattaaagaa cttgagccag gcacttcatc 900 tcagtcaaac aaaaaatttg aattttctgc aagtacaggt tggatgacag gttttcttaa 960 aagacacgct ctccacaatg taaaaattat gggagaaact gcatctgcag atgaattggc 1020 tgctaaagaa tttcctcaaa aacttagaaa aattattgaa gatggaggat acaccccaga 1080 tcaagtttgg aatgcagatg aaagcggcct tttttggaaa aaaatgccta gcagaactta 1140 tgttgcgaaa tcgcagaaaa ctgccggtgg ttttaaagta gcaaaggacc gtgttacgtt 1200 gttgttttgt tccaatgctt caggagaacg tatgttaaaa ccactgctag taaatcgtgc 1260 cttaagacca cgttcaatga aaggtgtaga tttcaataaa ttgccaattc actggatggc 1320 aaacaaaaag gcctgggtga cgactgcaat ctttacagaa tggtttcaga agtacttcgt 1380 cccagaagtt agacgataca tgaatgcaaa atgtctagaa tttaaagttc ttttaattct 1440 agataatgca cctggccatc cggtcttgga gcacccaaac gtgcaatttt gttttctacc 1500 gcctaatact acatccttaa tacaaccgct agaccaaggg ataattgcta catttaaaac 1560 gtactacata aaacgttcat tccaatacgt gctagataaa ctagatcaga atgaggaagc 1620 aacagttatc gatgaatgga aacaattttc tattatggac tgcattaatc aagtcggatt 1680 agcgctagct gaattaaagc catcaacctt gaactcgtgt tggaaaaatg tttggccaga 1740 atgcgtcaaa agcaaagatc ctgtcatcca taataacgct gaatatactg acatcataac 1800 actggcacat acaattggtg gagatggatt cgatgatcta tcatttgcag atatagagga 1860 attgttagtt gataaaagct tgagtgaaaa tgaaattata gacctcaccc ttgagactcg 1920 tcatgacaag gaacatagcg ataatgacga agaggatcct cctgttttaa atgcaactct 1980 aattaaagaa ggtcttgatc tcgccaggaa attaggtaat cattttgaac aacatgatcc 2040 tgatgaggaa cgagctaaaa aatttcaacg tgaactgaaa tcattaatgg catcttacag 2100 agaactttat aatggtttaa cgcgaaacca aacacaatct ttaataactg atttcgttct 2160 aaaaatttct gaattatcaa cacaggagac tagtgataat aatacaaaaa taatttccca 2220 agacgatgaa caaaataatt cgagtgatag cagtgatatt gcagtcttac gcaaacgtat 2280 acgtgtgtca gaaaatgaca atgaataaaa atttgttatg tttatattat ttttactttt 2340 atgtttatat tatttanttt tttgatatta atgaatttct gntattattt tcttttttcc 2400 aaatcataat ttatttttta taattaatat atttaatgaa ttaaaacgtg taagcacgaa 2460 aaataaattc ttttgtgaac tctaacaagg atttttttat atttattatt atacttttgt 2520 ttgtttattg tacttttgat tgctgtttta ggtatggaac caatctacta tttttncatt 2580 cggccaatat ctttttttac acgaattttt tttacacgaa tttcttagga acgtatctat 2640 cgtgtaaaaa cagaattagg tg 2662 // ID Gypsy-136_AA-I repbase; DNA; INV; 8134 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-136_AA_; KW Gypsy-136_AA-LTR; Gypsy-136_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-8134 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1005-1005 (2011). XX DR [2] (Consensus) XX CC Positions [5073-5552] - Integrase core CC 'CCAT' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 356..2674 FT /product="Gypsy-136_AA-I_1p" FT /translation="MAVRLPFADHLINDEVDYELTIRGKIEETKNDVEAKY FT RLLRYLFKEDEKEGRVYESPFTIDQEFDLICSRISELRSKLANGLDDRSFS FT RLKHYWGRVYRIMTKDGESERMRKELLKDIRTELSKFEGRKELAQRPDPFN FT FEGEDQSKKNGAKQKEQVDQNGNREGAEDRASEREKALEAQVKELQRKLEI FT LLNESKGAQGGDKNSDMENESLKGVNDGSRGYAGSASHTENQNEREKVDRD FT SRASLSGQSSKHNELARNYAQGNETYDNYRQNQERGSYHRQESRNYRAVER FT NDTVRERFQGDDEWDSYDRRRDGLGFDRFGNEHLERRNERFPLNRTSQMER FT EVVQRPYEVRGPWNEAVTRRAEVAQDRDAYRGQQASNAFRDEQIGHPFERR FT RSNGPWRHQQDVRRSYDQRWSGENAERHWSRDRDEQFRRSVGREPGIQLEF FT SSDEFSEDERRFTRRQREPRGNFRRISEAEIRDADRRMEKWHLSFSGDARS FT RSLEDFLLKVRRLARMDRIADDVLMQRIHTILRGEAYDWYLCYADEFLDWE FT QFEERIRYMYGNPNKDQGNRQKIYERKQNRNETFLSFKMEIERLNKLLSTP FT LDPQRIFEVIWDNMRPHYRSKLACRTVDSLRKLEYYAYRIDANDPIFRNSR FT EGPSKTNAVHNIEVERKEDESYSSDSEAENVNVMGSRFERDKKHRDQRSAG FT TNGRSQGGTARDTPNTVQLPLCWNCRKNGHLWRNCPEEKRLFCYLCGAQGK FT TATTCENHSGDARVRAMDNSGN" FT CDS 3483..5939 FT /product="Gypsy-136_AA-I_2p" FT /translation="MDAELERYKRMDAIEECASEWASALVPVRKANGKLRV FT CLDSRRINALTKKDSYPMRNMGEIFHRLEKAKYYSVVDLKDAYFQIPLKEE FT CRDFTAFRTPKGLYRFKVCPFGLTNAPFTMCRLMDKVIGFDLEPYVFVYLD FT DIVIATKTFSEHVRLLRIVGERLAKANLTISLDKSRFCRKKVAYLGYLLTD FT KGVSIDNARISPILDYARPKNVKDVRRLLGLAGFYQRFISNYSRIVAPMSD FT LLKKSKQKFVWTEAAEAAFGDLKAALIAAPILGNPDFSQPFCIESDASDLA FT VGAALTQQQDGQPRVIAYFSKKLSSTQRKYSSVERECLGVLLAIQHFRHFV FT EGTRFRVVTDARSLLWLFTIGVESGNAKLLRWALKIQSYDIELEYRKGKNN FT VLPDCLSRSVETVLTVNVDEGYQELASKIESCPTQYSNFRVVDGEILRYCK FT SEDGIEDGRFRWKRYPPKADRVEIIRRIHEEAHLGSEKTLAAVRQRFWWPK FT MAVDVKRQCQACMKCQTSKASNQNTTPPMMGQKKVVEHPWQFLAMDYVGPL FT PSSGKGRSTCLLVVTDLFSKFVMVQPFREATAETLAHFVENSIFLLFGVPE FT VVLTDNGTQFASKTFGELLEKYHVSHWRTPNYHPQVNDSERVNRVITTAIR FT ATIKNHHKEWANNLQRIANAIRNAVHSATKYSPYFLVFGRNQVSDGREYRQ FT MRDTGSADQAGLVPEEKEKLLEEVRENLKAAYQKHSSYYNLRSNANCPTYS FT VGEAVLKKNTILSDKGKGVSAKLAPKYVPAVVKRVVGSHCYDLEGVDGKRL FT GIFHCKFLKKLAKPGSP" XX SQ Sequence 8134 BP; 2363 A; 1494 C; 2214 G; 2057 T; 6 other; aattggcgcc caactaaggt cttcggaatt atttgaaaat ttatgtaatc tttgatgtag 60 agttaaattt tgtgaaatat ttatgcggta tttatggata gttttaggtt cacgtgttga 120 tatttttgga gtatgatttc gtctggattt tgaattttgt ggagattctc tgagtgtttt 180 gaggattttg taattgagat atattctttg gaggatattg agttctgtta aagattgaat 240 tgggatagtg taatataata attaatttga aagatttttg tgatagttta tgcgttgaat 300 tctatttatt ttctttattt cttattatat ttgaattgaa tttacggttt tcggaatggc 360 tgttaggctt ccatttgcgg atcatttgat caatgacgag gtagattacg agcttacgat 420 tagaggaaaa attgaggaga cgaaaaatga cgttgaggcg aaatatcggt tgctgaggta 480 cttattcaaa gaagatgaga aagaaggaag ggtatatgag tctccattca cgatcgacca 540 agaatttgat ttgatctgct ctagaatttc ggagttgagg agtaagttag cgaatgggtt 600 ggatgatcga tcgttttccc gtttgaaaca ctactggggt agggtctaca ggataatgac 660 caaggatggt gagtcggaac gaatgcggaa agaactgctg aaggatatac gaaccgagtt 720 gagtaaattt gagggacgga aagagctagc gcaacggcca gatccgttta actttgaagg 780 tgaggatcaa tccaagaaaa acggcgcaaa gcagaaagag caggttgatc aaaacgggaa 840 tcgggaaggg gcagaagata gagctagcga gagagagaag gcgttggagg cacaagttaa 900 agaattgcaa cggaaattgg aaattttgtt gaacgaatca aaaggggctc aaggtggtga 960 caagaatagc gatatggaga atgaaagttt gaagggtgta aatgatggta gccgaggata 1020 cgcaggttcg gcgagccata ccgagaatca aaatgagaga gagaaggttg acagggattc 1080 tagagcttct ctaagcggtc aaagctcaaa acataatgaa ttagctagga attatgcaca 1140 agggaatgag acgtacgata actacaggca aaaccaagaa cgcggcagtt accaccgaca 1200 ggagagcagg aattaccggg ccgtagaaag gaacgataca gtgagggagc gatttcaggg 1260 agatgatgaa tgggacagct atgacagacg acgcgatgga ctgggcttcg acaggttcgg 1320 taatgaacat ttggagcgaa ggaacgaaag atttcctttg aacagaacga gtcagatgga 1380 gcgagaggta gtacagcgac cgtatgaagt gcgaggaccg tggaacgaag cggtgacacg 1440 tagagcggag gtggctcagg acagggatgc ctaccgaggt caacaagcta gcaatgcctt 1500 ccgagatgag cagatcggac atccattcga gaggcgtcgc tcaaacggac cttggagaca 1560 tcagcaagac gtgcggagga gctacgatca gcgttggtcc ggtgagaatg cagagcggca 1620 ttggagtcgg gatcgagacg aacaatttag acgaagtgtt ggtagagaac ctggaattca 1680 gttggagttc agttccgatg aattctcaga agacgagagg aggtttacca ggagacaacg 1740 cgagccgaga ggtaacttca gacgaattag tgaggctgag attcgggatg cggatcgtag 1800 aatggagaaa tggcacttga gctttagtgg ggatgcacga agcagatcgc ttgaagattt 1860 tctgctaaag gtgcggcgtc tagcaagaat ggataggatc gccgacgacg ttctcatgca 1920 gagaattcat accattctac gtggtgaagc atatgattgg tatctgtgct acgcggacga 1980 gtttctcgat tgggaacaat ttgaggaaag aatccggtat atgtacggca atccgaataa 2040 ggaccaaggg aatcgccaga aaatatatga gcggaaacag aaccgaaatg aaactttcct 2100 tagtttcaaa atggaaattg agcggctgaa caagcttctc agtactccat tagatccaca 2160 gcgtattttc gaagttatct gggataacat gcgccctcat taccgttcga agctggcatg 2220 tagaacggtc gatagcttgc gaaaattgga gtactacgct tatagaatcg acgcgaacga 2280 cccaattttc agaaattctc gggaaggacc aagtaagacg aatgcggtcc ataatattga 2340 agtagagcga aaagaagatg aatcttacag ctcggattcg gaagcggaga atgtaaatgt 2400 gatgggaagc aggtttgaga gagataaaaa gcacagagat cagcggtcag caggaacgaa 2460 tggtagaagt caaggaggaa ctgctcggga caccccaaat acggtacagt tacctctatg 2520 ttggaactgt aggaaaaacg gtcatttgtg gagaaattgt ccagaagaga agagactatt 2580 ctgctatctg tgtggcgcac aagggaagac ggcgactacg tgtgaaaacc actcgggaga 2640 tgcgagagta cgggctatgg ataactcggg aaactaagcc aggagtgcca gctagggaac 2700 gacagcattc cggtttcgag agttcccagg gagatggagg cgattccata cgtagatcca 2760 taccagaacg tatgtgaggt aaaaattcac acggacacgt gtccgcatgt ggcggtacaa 2820 tcttcaacaa gacctacgac gcacttttag actcaggagc gagtgtagcg taacgagctt 2880 ggcaagcata gcggaagaaa acgggctgac cgtgcaccag agtccggtta aaatagtgac 2940 agcagataag actgtgcaca agagccaagg gtatataaac ttaccgatgg aatttcgggg 3000 tattacgaag attataccga ccttgatagt accgcaggtt gccaggagtc tgatcttggg 3060 gtacaatttt tggaaaacct tcggtataca gccaatgatt caagggacag acggttttga 3120 gcaagtagcg accgttagac accagtttag caagtggtga aggtagtcgt ccgattgagg 3180 ttccaattct tccgattgag acgttaccga cgatcaaagc atccgatccg gatgagtcgt 3240 tggatattcc ggcattggaa ttacccgagc cgtcgaaagc aactccggaa accattgaaa 3300 cggaacatga attgacaaaa gaacaaagga aggaactgac agcagcgatc aaggtattcc 3360 catgtacaac cgagaaccgt ttgggaagaa cgtcggtgat tcagcatgag attgtattga 3420 cagaggaggc caaaccgagg cgtcaaccgc tataccggtg ttcaccggca atccaggcgg 3480 aaatggatgc ggaactcgag cggtataagc gaatggatgc cattgaagaa tgtgcgagtg 3540 aatgggctag tgcactagtt cctgttcgta aagcgaacgg gaagctacgt gtttgcctag 3600 actccaggag gattaacgca ctcacgaaaa aggactccta tccaatgagg aatatgggcg 3660 agatattcca tcgtttggag aaggcgaagt attactccgt agtggatctg aaagatgcct 3720 attttcagat ccccctcaag gaagagtgcc gagacttcac agcgttcagg acacccaaag 3780 gcttgtaccg tttcaaggtt tgcccctttg gattgacgaa cgcaccgttt acgatgtgcc 3840 ggctcatgga caaagtcata gggtttgacc tcgaacccta tgtatttgtg tacttggacg 3900 acatcgtgat cgctaccaag actttcagcg agcatgtacg tctgctgcgc atcgtagggg 3960 agaggttggc gaaagccaat ctgaccattt cgttggacaa gagcaggttc tgtaggaaga 4020 aagttgcgta ccttgggtat ttgctaacgg acaaaggagt ctcgatagac aatgcacgaa 4080 tttctccgat tctggactat gcacggccga agaatgtcaa ggacgttcga cgactgttgg 4140 gtttggcggg cttctatcaa cggtttatca gcaattacag ccggatagtt gccccaatgt 4200 ctgatctgtt gaagaagtcg aaacagaagt ttgtctggac cgaggcagca gaagcggcgt 4260 tcggagactt gaaagcggct ctgatagcag ctccaatctt gggaaaccca gacttctcac 4320 aacctttctg catagagtca gatgcatctg acctcgctgt gggagctgct ttgacccagc 4380 agcaagatgg tcagccacga gtcatcgctt actttagtaa gaaactcagt agcacacaac 4440 ggaagtattc cagtgtagaa agggagtgcc taggagtttt actggcgata caacacttcc 4500 gtcacttcgt ggagggcact cggtttcgtg tagtaaccga tgcacgtagc cttttatggt 4560 tgtttacgat cggcgtagaa tctggcaacg caaaactgct gaggtgggcg ttgaaaatcc 4620 agtcctatga cattgagctg gaatatagga aaggcaagaa taacgttttg ccagattgcc 4680 tgtctaggtc ggttgaaaca gtwctgacgg tgaatgtgga cgaggggtac caggaactgg 4740 cgtcgaagat cgaaagctgt ccgacacagt actctaattt tcgtgtggtc gatggagaaa 4800 ttttgagata ctgtaaatcg gaagacggaa tcgaagatgg acgtttccga tggaagagat 4860 atccgccgaa ggctgatcgt gtggagatta tccgaagaat ccacgaagaa gcgcatttgg 4920 gttcggagaa aacgctagcg gcggttcgac agaggttttg gtggccgaaa atggcggtgg 4980 acgttaagcg gcaatgccag gcttgcatga agtgccaaac aagcaaagcg tcgaaccaaa 5040 atacgactcc accaatgatg ggccagaaga aggtcgtaga acatccgtgg cagttcttag 5100 cgatggacta tgtgggtccg ttaccatcgt ctggaaaggg gagaagcacc tgcttactag 5160 tagtgactga cctgttcagt aagtttgtca tggtgcaacc gttcagggaa gcgacagcgg 5220 aaactttggc tcacttcgtt gagaattcca tcttcctgct ctttggcgtg cctgaagttg 5280 tgcttacgga caacggaacg cagttcgcgt cgaaaacgtt cggggagttg ctggagaagt 5340 accatgtctc tcattggaga actcccaact atcaccctca ggtgaacgat tccgagaggg 5400 taaatcgtgt tatcacgact gctatccgtg ctaccatcaa gaatcaccat aaagagtggg 5460 caaataacct gcagcgtatc gcgaatgcga tccgaaatgc ggtccatagc gctaccaagt 5520 actcgcctta ttttctggtg ttcgggagga accaggtgtc cgatggtaga gaatatcggc 5580 agatgcgtga cacggggtca gcggaccagg caggtctggt accggaagag aaggagaagc 5640 tactggaaga agtacgggag aatttgaagg cagcgtatca gaagcattcg tcatactaca 5700 acctcaggtc gaatgcaaat tgtccgacat attcggtagg tgaagcggtt ctgaagaaga 5760 acacgatcct ttcggacaaa gggaaagggg tgtcggctaa gctggcaccg aagtacgttc 5820 cagcagtcgt caagagggta gtgggtagcc attgctatga tttggagggc gtggatggaa 5880 aacggcttgg aattttccat tgcaagtttc tgaagaaact tgccaagcca ggctctccgt 5940 agtttgaccg atttggacct tcagctatgt acctgtctta gacagtaaca aggaccagtg 6000 ctaggcctgg tctctaaaat atttgaaaag ggacctgttt tccagctatg atgctcttgg 6060 gttgcaccca agagttacaa atactccgtt gaagctgtgc tccgggtaat gaaaacactt 6120 gttcgccttt gcgaagctta gcgctgatct tcggatcagc tgccgcccct acaaaacctt 6180 tacttactag gggcatcctt ggactgtttt atcggtccaa gaagaagctt cgcattgtgc 6240 ttgataaggg agttctctag ttctttcaaa gtagaaccgt tggaaaggag tgtttgtggc 6300 tatacgtgat cccctcgacc agaagtcaga gagagccctg gaagttttcc cgggcgatga 6360 tctcgtaact tagaagctca gagccgtagt gcgttgacga gcagtaccaa agggaagtaa 6420 aaatcgtgag tctgatattg gggagtatca gatgctgatc gaaatcaaga ctcagaagtc 6480 atccataagg aaggacgatg ttgagcaacc cgagtcaatc gagcgatagc tacaacatct 6540 tcagaaacgg gctctgtctt gaaggaattg gatgtactgc cgtaatgtgg gatggacaag 6600 gtcatatgaa ggttgtttga aaatttggcg ttgaatgaga gataagggat gtttcgaaat 6660 ttgtaaataa tgttagtgtt gtagttcaca aaggtatttc ttaaggctag gttagtactt 6720 ataattaata tctttaataa aataatataa aatacaaaaa acagcttaaa agtagggttt 6780 gggtccatgc tccgtcaatg tcttcsttca catcttcagt cagtcgtcca agtacctaaa 6840 acagcaaaac aaagaattag tacccgttca tgcacaaatc agcacttgga aagatgcctt 6900 cctggtaktc gatccaccgg aagttgccgt gtcggtcctt aagcggtctt caaaagaggt 6960 cgtccwgaca gggtccggtt cagtgcwgca gaagatcgtc aacgtcttcg tattcctccg 7020 tcagtccgtc tcgatgtcca tcgtagctct aaaactgcaa aaacaagaga acttcaacat 7080 aataaacagt ataacagcta ccttttccgt tagtttttag gaaatttagc caatttccgt 7140 acgtaaatca aggattaatt accagaaact ttcacaaatt tttgttgcac tgcactttca 7200 atgtttactt ttttactttg acgtttatca gaaagggtgg gtggttcagt gtcgtgagtg 7260 tccggagaga atttctctga gtgtcgtaat agagttggtc ttgtttgttt tcattgatta 7320 atgtttcgta agagctgaga gagagcatcg gcgtttgaaa cattagtgag atgatagttt 7380 gtgagaattt atgttataca ctcatacatt gatggaagcc ggagaacact ctggtgtgtc 7440 aatgtatgta gtgcgaataa tgtgtgatgt atgttacgtt ttgtcccaca cagagacgtt 7500 gagaccaggt agcaggcttg gtggttctct gtgagacaga acagttgcgg tgagttcaga 7560 gcagaaacat gttaccaacg atgaggattg gtgtagttga agcctgaatg aaccagtcat 7620 ggagatgtga gttgcggtca tataagagag gcagttaacc aacagagagg attggtgttt 7680 gtctcgcttt atgtgtacct gttaagtgtc gatttwcgtt cattgagaat tcgttcaatg 7740 atgtgttgtc atgatttggc ttctgctgag actgaaaggt ttctgtagca atgccgtttt 7800 ctgaattttt gaggtttgag catttgttgg tatctgtatg cacacccatt ctgagaacgt 7860 caaagcgctc atatgtatct tcgttggagt ggacctatat tttgtatata tttgtaaata 7920 gtaatgatag ttgatatgaa ttagtttttt ttttgttgta tttattatga attattttgg 7980 tgaataggat aggaacccgg atgaattagg aaaaattctc aaaataaggg aataatgaaa 8040 ttgaaatact ttgtacattt ggtgtacagt ttaccactac gaaaatttgg tttaaatcct 8100 taaaccaaat tttcgtaaat cagccctggt gcaa 8134 // ID BEL-18_CQ-LTR repbase; DNA; INV; 730 BP. XX AC AAWU01035939; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-18_CQ_; KW BEL-18_CQ-I; BEL-18_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-730 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 190-190 (2011). XX DR Genome; AAWU01035939; Positions 10164 10893. XX SQ Sequence 730 BP; 238 A; 165 C; 158 G; 169 T; 0 other; tgtggacgga agcaacagca tcgacggtac aacattgcgt gagggtccct cgtcaaacag 60 caaacacatc tacaccacca gggtagctga gcgaagtcta cgtcgatcgg gccgtctagc 120 tagtgccaac cagaagaaga ggaagaatta agcgaaacga gtagggattg tacgcagtgc 180 gatttgcaga acgccacctt ttttgtttaa aacgacttaa acataacaat tagtagtttt 240 aaaacattgc gctgaacgat actccgtgta cgaccgaagc gagtggttct agaccgtggt 300 cgcgagaacc acacagctac ttcggtaatt aacgactccc cgcaacttta ggttagaatt 360 cagatgattt gctaatttcc ctatttaatt tacatgtaaa ggttccgagc agaacctggc 420 aagtgtccaa acggccaagg aggccaaaaa gatccccaga acacgcgtaa gcaattaggt 480 atgatcaaat gtgaattagc tccaaacata cctcctaaca atctctgtac gaacgattac 540 cgtaggctgg gcaaaactgg tcgtcaccga acatcgttgt cgtgtttaat gatcaatcgg 600 attgagggaa aactaattgt aagtctacag aacatctact gatagcaatg aacttaataa 660 aacgaattat ttcagcttta agctgcgcta cacaaaacag ctgctgcaaa gagtttcacc 720 caaatcaaca 730 // ID Sola3-1_CB repbase; DNA; INV; 6050 BP. XX AC . XX DT 28-JAN-2009 (Rel. 14.02, Created) DT 28-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE Sola3 DNA transposons from Caenorhabditis brenneri. XX KW Sola; DNA transposon; Transposable Element; Sola3; TTAA TSD; KW Sola3-1_CB. XX OS Caenorhabditis brenneri OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-6050 RA Bao W., Jurka M.G., Kapitonov V.V. and Jurka J.; RT "New superfamilies of eukaryotic DNA transposons and their RT internal divisions."; RL Molecular Biology and Evolution 26(5), 983-993 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS join(1674..1847,1831..2430,2550..2960,2964..3842, FT 3862..5184) FT /product="Sola3-1_CB_1p" FT /translation="MLQKHIYRRLNKNACSMPDSLNGVHAGGRTNIFHHHL FT TVQEADLLMKEKGILVHPGIRFIPEYVSLSVYSLSLPDFFSAVCRIHENYL FT NDLKTSSEQNTGRRKSVVNYEEEVIEEAESCSQDDPEYFCEESLNQSPAMS FT SLQKFAEAAGIDLSCYSRKPYSELQERSRQKKAVRVRKLVDVMIEVIAPND FT KEEFSTRVFKKQLAGIASNDNENLNKILKQMSDHYLAAFDRRSKLTILSLV FT ADVLPYSSVVQYIPNLRKNTIGMQLRVSLSSLQGKETRIYELFIPIQPHNY FT DWTPLWFKKSQDVGRLESGNSKFYPSTKFYRNLQHVSQYAGGLKSFSFYFL FT NFLLKETGQMELMLSKSTIFRILKVCEATERSATTCVDYFIANGMEVNFAN FT NTKIHFLLKAFDELHSIVDNWIEQDAIDLDILKELKIELHETAQYLRADYR FT LHVKKASRVADHCATFALSDPNDAKMSLSCSSGEHAHKHDLKCPRCESGDA FT TLTRIKNYAEDLLTETKNLPTNADSEAQRKVDELVAKYEEQVQLISKYIDQ FT VFEMKKHLLRAAVTNQEREDIINDLEDNHALITIDFAQKYLPRWHREKQSD FT YFGKKGLSWHVSHVAARIGDNYTQHSFIHIYDSEVQQVSYLLKCNMTSFIF FT SEQRVSDPDVVSHRYRIEESRNYDNIYPVGQRRSVDLTLINLGAYHCVSTI FT ASLHWLMNEFGITVKSYSFSEAQNGKSSSDRDAARVKQKASRYVSMGNDIT FT TSADFFKAIKSGKQLNGVSIYHGSVAKDLDGKAKWDGISDLNYFTVEDNGI FT RARKYSGIGEGKFVPTDELVSMNGTYSFEEAGFVASGIGSIESERQAVRDG FT TQTKFWYYSPKRCTVSNCEEETEETTTTELDETVDTSSDEKKLFLCSHPGC FT SASFLTVSYLEKHLLRGVHKITPEKLTMNDYSLKMYSRGLEEVNRKRKHSG FT PIDVIAEAIDDFKEETETVRLSTGWALAKKATRKAFVADVKKFLVDCFEEG FT LNKKRLNPSTIAKRMAAATNADGTKRFTVEQRLDVKQIAGFLSRESRKRRA FT TSLRLRRDENNVVHVVTEENVEDLEERHWDADWIDYIEDEMTWTEWDEFLA FT AIAESAEPLFDYNNVDANPI*" XX SQ Sequence 6050 BP; 1906 A; 1128 C; 1270 G; 1746 T; 0 other; gaggcaatct ctgagagcct cgagaactcg atggcattag aaattctaga attggctgta 60 acttggtaaa gtaggtatat aaggtcccct gaacacgata gtccactttt tttttggccc 120 cgcccctttt ccagctggcc acgccccctt tcagctcaaa agtcaacttt tttccccgaa 180 aagctattga tgtctactct ttgggaatcg atagaagaca cttttagaag cattttcaca 240 gttcagacca actttctagc tctcctcctt catgttgaaa aattcagttt tttgtgaaaa 300 ttttcgattt tttgattcgc tccggaaatt tttctacata gtcacgcgag ataatgttat 360 aatcgtattc tacgcagttt tttacccctc tgggcaaaat atcagggcat gtttcgcggt 420 gtattaagac acttctaggc cgattttcat caaaaatcgc acatatcaga aaactaaact 480 tcacatattt tgtttcaaag aaccgattcg gactttattg atctggaaat gcataaaaat 540 gatattgagt gatgttttca taacatttga gaccgtctgg gatcaagttc cagactctac 600 tagctaaaaa tatgatattc tgaagtttca catggatact ctaaacatcg tattttagtc 660 tgaaattttt caactaaaaa agttgggata agttcagatt catgttctac gtagtttttt 720 acccctaaaa gaaacatatt aggacatact acgcagtgta tatgggcatt tttagatgtt 780 tttctacttt ttgcgcacgc acagaaagtg gtttgtctag cgcctccctt tgtaaccaac 840 ggactttagt attgcccatt tgcattccca tctgtttggt tgttattcta tcaacagaag 900 caaaacagat ggcgtcttca tcacatggtt tagtgggggg gggggggggg tgtaatgcag 960 cagcagctgc agagaataat tgatgagatg cacaatgtgt aatccctaga gaatgcgaaa 1020 gaacagcgtt attcattatt ctccgttttc tttttctctt cgttttcctt gtggttttgt 1080 tcagtatgtt tatatacatt gagagaaaca attcagaatc aatactccac agtttgtaat 1140 cagatcatcg agggtgtgca taatgaacgt tgccttgtac ttggtgcttt tattttacgc 1200 agtatactct ttgcctgtca attacagcac aaaagtaagt ttttcttcaa ttcagttttc 1260 caaaatagga tttgcagagg ccgaaagaag acgagagcgg tatacctaca aaaaaaactt 1320 gtcaaagatc aacgtgttat ttttctgttt ttcgaaccaa caagtttgag aaggagcctc 1380 gtcccgaaaa gtctatagtc agcacagaat gttcaactga tatcgataag taagtttttt 1440 ttgaagctcg caactctcag ttttggtcct acagactcgg aagctactat gatcatgctc 1500 taaatccgac tactggcgaa aataatgcag tctctctcat cctctgccga ggagcgggta 1560 tatgtgatca agatttcagc aagcctgaag tgaaagctcg atggaaaaat gtgctgattt 1620 gtaaaaacca cgttaacgaa ctattagata aatggttaga ttctaatagt aagatgttgc 1680 aaaaacatat ttatcgccga ttgaacaaga acgcttgttc aatgccagat agcctgaatg 1740 gtgttcatgc tggaggaaga actaatattt ttcatcatca tctaactgtt caagaagcgg 1800 atcttttgat gaaagaaaaa ggaattttag ttcatcccgg aatacgttag tctatctgta 1860 tatagtttat cacttcctga tttcttttca gctgtgtgtc ggattcatga aaactatctg 1920 aacgatttga agactagctc tgaacaaaat actgggagaa ggaaatctgt ggtgaactat 1980 gaagaagaag taattgaaga agccgaatca tgtagccaag atgatcccga atacttttgc 2040 gaagagagtc taaatcaaag cccagccatg agttcgctgc aaaaatttgc cgaagctgct 2100 ggaatagatt tgtcatgcta ttctaggaaa ccatattcgg aacttcaaga aaggagccgt 2160 cagaaaaaag ctgtgagagt aagaaagttg gtcgatgtta tgattgaggt gattgctcca 2220 aatgacaaag aagaattcag tactagagtt ttcaaaaagc aacttgctgg tatagcatca 2280 aatgataacg aaaatttgaa taaaatcctg aagcaaatgt ctgatcatta tctcgctgct 2340 ttcgatcggc ggagcaaact gacaatactt tcccttgtcg ctgatgttct cccctattct 2400 tcagttgtcc aatacattcc taacctgagg tgagcatcat tcttgtatca aaacaattgt 2460 tctctattct tttagtcgat acatgtacgg agaagcaagg aaatttgctc gtcgcaatat 2520 cgattctgaa gagccggtga aaatattaga aaaatacaat cgggatgcag ttgagagttt 2580 cattgagttc attacaaggt aaagaaaccc gaatctatga actatttatt cctattcagc 2640 cccacaatta tgattggact cccctatggt ttaagaaaag tcaagatgtc ggacggctcg 2700 aaagtggaaa ttccaaattc tatccgtcaa caaagttcta cagaaattta caacatgtat 2760 cacagtatgc tggaggtttg aaatcgttta gtttttactt tcttaatttc ttacttaagg 2820 aaaccggtca aatggaattg atgttgtcga agtccacgat tttccgtata ttgaaagtat 2880 gtgaagcaac agaacgatcg gcaacgacgt gcgtcgacta cttcatcgcc aacggcatgg 2940 aggtaaattt tgctaacaac taaacaaaaa tccatttctt attgaaggcg ttcgacgaac 3000 ttcattctat tgtagacaac tggatagaac aagatgctat tgatctggat attttgaagg 3060 aattgaagat tgaactacat gaaaccgcac aatacctccg agcggattac cgccttcacg 3120 ttaagaaagc tagtagggtt gcagatcact gtgccacttt tgccctaagc gatcctaacg 3180 atgctaaaat gtcattgtct tgctcttcgg gtgaacacgc tcacaaacac gatctcaaat 3240 gtccgagatg tgagagtgga gatgcgacgc tgactagaat caagaactat gcggaagatc 3300 tactcaccga aacaaagaat ttaccgacaa atgcggattc agaagctcaa agaaaagttg 3360 atgaacttgt tgcgaagtac gaggaacaag ttcagttgat cagtaaatat attgatcaag 3420 ttttcgagat gaagaagcac cttcttcgtg ccgccgtcac aaatcaggag agagaagata 3480 tcatcaacga tctcgaagac aaccacgcgc taataacgat tgatttcgcg cagaaatatt 3540 taccaaggtg gcaccgcgaa aaacaatccg attatttcgg aaagaagggg ctcagttggc 3600 atgtatctca tgtggctgct cgaattggag acaactatac tcaacacagc ttcattcaca 3660 tatatgacag tgaagtgcag caagtaagtt atttgctcaa atgtaatatg acttcattca 3720 tattttcaga acagcgagtt agtgatcctg acgttgtctc acatcgctac agaattgaag 3780 aaagtaggaa ttacgacaat atctatccgg tcggacaacg ccggtcagtt gacctaacct 3840 tatagagcaa taagtaattg aataaattta ggggcctatc actgtgtttc aacgattgct 3900 tcactccact ggctgatgaa tgaattcgga atcactgtaa aatcgtacag tttttctgag 3960 gcccaaaacg ggaagtcatc gagtgaccgt gatgctgctc gtgtgaaaca gaaagcgtcc 4020 agatatgttt cgatggggaa tgatatcacg acttcagccg attttttcaa agcaataaaa 4080 agtggaaaac agctcaatgg agtttccata taccatggat ctgttgcgaa agatcttgat 4140 ggtaaagcga aatgggatgg aatttccgat ttgaactatt ttaccgtgga ggataatgga 4200 attcgtgctc gtaagtattc tgggatcgga gaaggcaagt tcgtaccaac tgatgagtta 4260 gtgtcgatga acggtacata ttcgttcgag gaggctggat ttgttgcttc tggtattggt 4320 tctatcgagt ctgaacgaca agctgtacga gacggcaccc agactaaatt ttggtactac 4380 tccccgaaaa ggtgcacggt ttccaactgt gaagaggaaa ccgaagaaac gacgacgacc 4440 gaactagacg aaacagtcga tacttcaagt gatgaaaaga agctttttct ttgttctcat 4500 ccaggatgtt cagcttcgtt tttgacagtt tcatatctgg agaaacatct tttacgtgga 4560 gtccacaaga ttacgcccga aaagctgaca atgaatgatt attccttgaa aatgtactcc 4620 cgtggattag aagaagtaaa ccggaaaaga aaacattctg gtccgattga tgtaattgcg 4680 gaagccatcg atgactttaa ggaagaaacc gagactgttc gtctttcaac tggttgggct 4740 ctagcaaaga aagcaacacg aaaagccttc gtagcagatg tgaaaaaatt tttggttgat 4800 tgctttgaag aaggtctcaa caaaaaaaga cttaatccaa gtacaattgc taaacgcatg 4860 gctgctgcaa caaacgctga cggaactaaa cgattcactg tagaacaacg gttagacgtc 4920 aagcagatag caggattttt gtcacgagaa tcacgaaaaa gaagagccac ttccctacgg 4980 ttgagaagag atgaaaacaa tgttgttcat gttgtcaccg aagaaaacgt tgaagatttg 5040 gaggaacgtc actgggatgc tgattggatc gattatatcg aagacgaaat gacttggact 5100 gaatgggacg agtttctggc ggcaattgct gagagtgctg aacctctttt cgattataac 5160 aatgttgatg caaaccctat ctaacttttt tcaaaactaa aagcttcata attccgtctg 5220 attcatcttt cagtgcgcaa aagtagaaaa acatctaaaa atgcccatat acactgcgta 5280 gtatgtccta atatgtttct tttaggggta aaaaactacg tagaacatga atctgaactt 5340 atcccaactt ttttagttga aaaatttcag actaagatac gatgtttaga gtatccatgt 5400 gaaaattcag aatatcatat ttttagctag tagagtctgg aacttgatcc cagacggtct 5460 caaatgttat gaaaacatta ctcaatatca tttttatgca ttccagatca ataaagtccg 5520 aatcggtcct ttgaaataaa atctgtgagg tttggttttc agatatgtgc tgtttttgat 5580 gaaaaagggc ctaaaagtgc cctcatccac cgcgaaacat gtcctgatat tttgcccgga 5640 ggggtaaaaa actgcgtaga acacgataat aacattatct cgcgttactc cgttgaaaaa 5700 ttttcggaga agaagaaaaa agtgaaaaat tttacaaaaa tgtgaaattt tcagcatgaa 5760 ggagaagagc tagaaagttg atttgaactg tgaaaatgct tctaaaagtg tcttctattg 5820 aatcccgaag agtagacatc gatagctttt cggagaaaaa agttgacttt tgagctgaaa 5880 gggggcgtgg ccagctggaa aaggggcgga gccaaaaacg aaaagcgcca ttatgttcag 5940 ggggtcttat atacctagtg tacaatgtct cactagactc cgatcacttt tgtaaaatcc 6000 taatgccatc ggtataacgg gcagtgctaa attctctcag agaatgcctc 6050 // ID Gypsy-25_DWil-LTR repbase; DNA; INV; 436 BP. XX AC scaffold_181136; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-25_DWil_; KW Gypsy-25_DWil-I; Gypsy-25_DWil-LTR. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-436 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_181136; Positions 1776609 1777044. XX SQ Sequence 436 BP; 139 A; 83 C; 119 G; 95 T; 0 other; tgtgtgggat gtgcatttgc gaagtgtatc aatcccactg tatacgagag acgaagacga 60 gacaacgatc aagagagaag acgtgagcgt gtgagaaggc acgatcaaag agggatgaca 120 aacgatagag agatgacgac gaggagagtt ggtttttgcg cgtcaaacaa gacggacgtg 180 ttttgaggtc ttgcgtcata gtcggatgca aggtcaagtc gagtaccata gacgggtgct 240 aaaagtcgtg tgtcaaagtc ggacacttaa agtcgagcgc caaagtcggg cgcgaaaagt 300 cgaagcgtca tagacggacg ccaaagttga gccgagccaa acgtcgattg tcatttcaca 360 tagtgtaatc tctataaagt cgtattttca tgtaaacccc ctagactata attatatcgt 420 taaataaaac cccaca 436 // ID RTE-12_BF repbase; DNA; INV; 2915 BP. XX AC . XX DT 29-JUL-2009 (Rel. 14.07, Created) DT 29-JUL-2009 (Rel. 14.07, Last updated, Version -1) XX DE Amphioxus RTE-12_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW RTE; Non-LTR Retrotransposon; Transposable Element; RTE-12_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-2915 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-2915 RA Kapitonov V. and Jurka J.; RT "Young families of RTE non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(7), 1710-1710 (2009). XX DR [2] (Consensus) XX FH Key Location/Qualifiers FT CDS 1..2895 FT /product="RTE-12_BF_1p" FT /translation="IATWNVLSMKQEGSALLVANELARLDITVAGLTEVRW FT PGSGSCRANDYTYLWSGREDGLHRQGVALALAPSAHKALSFWKPINNRLLL FT ARLRHRHGMISIIVAYAPTDVAPAEEKDDFYSKLSALVANISRHDIIWVLG FT DFNATTGTNTTGYESSLGPYGTGSRNDNGSRLLEFCACHRLRTERSFFPHK FT QIHCMSWISNDGHTTKEVDHILTNSRWHTSSNCRVYRSAQFGNADHRLLSM FT TVALYLQKDPKSAQQRKFDIARLQSPDVSADFQLELHNRFEALLDISDEDH FT GDPAKIWDGFKHNLTEAATKTVGFTRSRKKRAFLSMETLKVVELRRQARLN FT GDISEYRRLNGSRNKLLQRDKQNWLENTAKEAEAAARAGNQSVLYKTLRTL FT SGKTTPPTANVKALDGSDLETPEQQLERWREHFSTLLNRPPPPPCNELDAM FT ASMAVEDDSISVDPPSLDEVEKAIGKLKPGRAAGADGIPPELLLKGGPAVV FT AWLHALIVSIWRTGNIPTDWRLGVILPFWKKGPKDNCGNYRGITLLSVPGK FT VLAHILLARLRPLLLQKQRQEQSGFTPGRSTVDRILTLRILAELRREYREP FT LYATYVDLKQAFDSVDRAALWKILKILGVPSKLLGLLSSLYSDTVSCVRVN FT GQTSDVFEINSGVRQGCVLAPTIFNTAIDYVMDRTVTQSSCGANYGRVRIT FT DLDYADDVALLAELLETLSIALQHMDAATKPLGLMISWQKTKVQSLCDYQP FT PPENQVINGQEVEAVNKFSYLGSTITSDCRCIADIQVRIGRAAAAMANLAN FT IWSNKQLSLQTKLNLYNSLVLSILLYGSEAWTLTASWEQHLDAFDTKCLRR FT ILGLHWYDFVPNATVRQMTKQPPISQLIRSARLRIFGHLARSSPLSEPARL FT ILEPTPRWRRPRGRPRMKWLDQLTSDLAAVNMDLPTAWQAAQDRSFWRRCR FT GATLLGASGL" XX SQ Sequence 2915 BP; 757 A; 802 C; 725 G; 631 T; 0 other; atagcaacat ggaacgttct ctcaatgaag caagagggat cagccctcct agtggccaac 60 gaacttgccc gtcttgacat cacagttgcg ggactaactg aggtgaggtg gcctggatca 120 ggctcttgtc gagccaatga ctacacttac ctctggtctg gcagggaaga tggcctgcat 180 cgccaaggtg tggcacttgc cctggcccct tcagcccaca aggccctcag cttctggaaa 240 ccaataaaca accgtcttct tctagcccgt ttaaggcacc gtcatggcat gatctccatc 300 atcgtggctt atgcacccac cgatgtggct ccagcggagg aaaaggatga tttctattcc 360 aaactctcag cgctggtggc caacatttct aggcatgaca tcatctgggt attaggtgac 420 tttaatgcta ctacaggcac caacaccact ggttatgaaa gctcccttgg gccgtatgga 480 acgggttcac gcaatgacaa tggcagccgg cttctggaat tctgtgcctg tcatcgcctg 540 cgtacagaac gctccttctt cccccacaag caaatccact gtatgtcctg gatcagcaat 600 gatggccaca ctaccaaaga ggtggaccac attctgacta actcgcgctg gcatacctcc 660 tccaactgtc gtgtctaccg tagtgcgcag ttcgggaacg cggatcacag gctactctcc 720 atgacagttg ccctctacct tcaaaaggac ccaaagtctg ctcaacaacg taaatttgac 780 atcgcaagac tacagtctcc tgatgtctct gctgacttcc aacttgaact gcacaaccgc 840 tttgaagcac tgctggacat cagcgatgag gaccacgggg atcctgctaa aatatgggat 900 ggcttcaaac acaacctgac ggaagctgcc acaaagacag tgggatttac aagatccaga 960 aagaagagag ctttcctatc tatggagaca ctcaaggtgg tagagctaag gaggcaagct 1020 agactcaatg gtgacatcag tgaatataga cgcctcaatg gcagccgcaa caaactctta 1080 cagagggaca aacaaaactg gttggaaaat actgccaagg aggccgaagc agcagcacga 1140 gctggcaacc agtctgtcct gtacaaaact ctcaggactc tttctgggaa gaccacccca 1200 cccacagcca atgtaaaggc cctggatggg tcggacctgg agacaccaga acaacagcta 1260 gagaggtgga gggaacactt ttcaacccta ctgaaccgtc ccccaccacc tccatgtaat 1320 gagctggatg ccatggcttc aatggcagtg gaggatgact ccatctccgt agaccctccg 1380 tccctagatg aagttgagaa agccattgga aagctcaagc caggccgtgc tgccggtgca 1440 gatggtatcc caccagagtt gctgctgaag ggaggcccag ctgtagttgc ctggctgcac 1500 gcactaatcg tcagcatttg gcgaaccggg aacatcccta cagactggcg ccttggtgtt 1560 atcctcccat tctggaagaa ggggcccaag gataactgcg gcaactatag aggaatcact 1620 ctgctcagtg taccaggaaa ggtccttgcc catatcctcc tggcaagact acgcccccta 1680 ctccttcaga aacaacgcca ggaacagagc ggtttcaccc caggtcggtc tacggtggac 1740 agaattctta ctctcaggat cctggcagag ttaaggaggg agtatagaga accactctat 1800 gcaacgtacg tcgaccttaa acaggccttc gactctgtgg acagagctgc actgtggaaa 1860 atcctgaaaa tccttggcgt tccgtccaaa ctcctgggtc ttctctcttc cttgtactca 1920 gatacggtct cgtgtgtgag ggtgaatggc caaacgtctg atgtttttga aattaacagt 1980 ggggtaagac aagggtgcgt gcttgcgccc accatcttta acacagcaat cgactacgtc 2040 atggatagga ctgtgaccca gagttcctgc ggtgccaatt atggcagggt taggatcaca 2100 gaccttgact atgccgatga tgtagctctc ctggcagagc tgctggaaac tctcagtata 2160 gcactccagc acatggatgc agcaacaaag ccacttggtc ttatgattag ctggcaaaag 2220 accaaggtac aaagcctgtg tgattaccaa cctcccccag aaaaccaggt tatcaacggc 2280 caggaagtgg aagctgtcaa caaattcagc tacctgggaa gcaccatcac ctcggactgc 2340 aggtgcattg cagacataca ggtacgcatc ggtcgggcgg ctgcggctat ggcaaacctg 2400 gcaaacatct ggtcaaataa acaactctcc ctccaaacga aactcaatct ctataatagc 2460 ttagttcttt ccatcttgct ctatggatct gaggcatgga cactaactgc gtcctgggag 2520 caacatcttg acgctttcga cacaaagtgc ctacgtcgca tccttgggct gcactggtat 2580 gactttgtcc caaatgccac agtgcgtcag atgacgaagc aaccccccat atctcaactg 2640 atcaggtctg ccagactacg gatttttggc catcttgcta ggtcgtcacc tctgtccgaa 2700 ccagccaggc tcatccttga accgacccca aggtggagaa ggccaagagg gaggccccgc 2760 atgaaatggc tggaccagct aacatcagac ctggcagctg tgaacatgga cctccccact 2820 gcctggcagg ctgcccagga caggtctttc tggagaagat gtcgaggcgc cacgctctta 2880 ggagcaagcg ggttgtgagt gtgagtgtga gtgag 2915 // ID Crack-24_AAe repbase; DNA; INV; 4632 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A Crack non-LTR retrotransposon family from Aedes aegypti. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; KW Crack-24_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4632 RA Kojima K.K. and Jurka J.; RT "Crack clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1240-1240 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 14 sequences with >96% CC identity. XX FH Key Location/Qualifiers FT CDS 403..1449 FT /product="Crack-24_AAe_1p" FT /translation="MMYEDESDIICYVCEKSEVNSLLTIECANCSKCAHFR FT CKKLFGKAVAKARKKPYLCSVECAKIHACTDEKSPALISEIRLLGQAISES FT QKEAVMVRKALELTRMQLDTLVKTSKGIEDSQQFLSNQFDDLRAKFIEFRE FT EIDVMKEENNKAREEFRDLQQKYHALLSSVDSMESKLRSVSQAAVANKVVI FT LGLPLTDDEDLKAVVCDVGTAVGIELSPSVIENVQRIFSKRNQNSSVPILV FT SFSNHATKEKLFECKRRYGALLASTVSRKFEGSADRVIIRDEMTIEARNLF FT WEAKNMQAALNMKYIWPGRDGKVLLRRCDGGKVHEVGSKLQLLKLSEQLSQ FT DIATVE" FT CDS 1521..4403 FT /product="Crack-24_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MDNFYYDNLYCLNKNSQYINRPFSFKILQLNIRGMNR FT LDKFDAVKEFLSLYSGQIDIVVIGETWVKAERKSLFQIEGFKSIFSCRENT FT VGGGLAVFVKSKFEFEQVTCMHDEGLHHIHVRLKIQESDFNVHAVYRPPCF FT DLARFYSRMESVCSIHRKSASSVVVGDMNIPVNLVHRGVVSEYQDLLSCYN FT FAVTNTYPTRPASNNILDHVICSENLQRNVTNETVCTDISDHCFILSTLHL FT KRPIRTINLEKTVVNYTKLNNAFQAAMQNMPQSNANVKLLYVLNTFQSLRE FT KFSTVVKVQAKIKGSCPWMTFELWKLICIKDKVLKSCRKRPNDTGLQELLA FT HVSRKVQYAKLWAKKNYYSNLFLASSPKNTWENINELIGRRKDSDNVVKLM FT VDGQITSCGATIADKFNQFFSTIGPQLASTMNSSNEINKFNTLRLNNSTVF FT LRPATEREVLLKINALDSKKCSGPDGISATFIKVHHEFFATLLTEVFNECI FT SSGTFPDSLKIASVTPIHKDGSKEDVNNYRPISVLSMLSKLLEQLFVSRLL FT SFLEDQHVLYRHQFGFRAGSSTWTATCELVDSIYSALDNRKISGVLFLDLK FT KAFDTIDHRILLQKLEYYGVRGVALNFIRSYLTNRKQYATVNKSKSRLCPL FT TVGVPQGSTLGPLLFLLYMNDLPMLKLNGKPRMFADDTSLSYEVDNPEQLI FT RQMSEDMTQLQAYFTENLLSLNLSKSKYMIFRSTRQRIPDHGELSIGSQVI FT EEVDEMKYLGLTFDPTLTWRGHVEKLRRHISSFCGVLWKIAKFVPMKQLNT FT IYHAFIQSKLCYLVSIWGTAPKTVIKPLQVTQNRCLRIVYDKPRLFSTKIL FT YEQADQSTLPILALCELQSLVCMQNLIKNPTAHHNFNLQRPDHLHNTRHQA FT HLAVTRRNTENGKKAFDYRGKISYNSLPDTVKSEHSIFKFKKVVRRHIMSL FT LERFII" XX SQ Sequence 4632 BP; 1466 A; 897 C; 997 G; 1271 T; 1 other; ctggcaacac tgaaaatcat cgacctaagt gactaaattg gagtgcaaat tattacagta 60 aattgtgatc gaaatcagat tagtaaaatc gtactattgc gatttgcttg aaggcgaagc 120 tgttgtgctg gggtccacga tatagtttgg ctgaaaaaca taagtaccga cgttattctg 180 gtaatgatag atttgcagta caactcacga tctgcattat acattatcac tcaatacgtt 240 ctgcgatcta ttgaagtagt ggttggtgct gacggtcact ctggagtggt attacacact 300 gatacctact gtcctatcat tgtttgcgat tggctcaaca catttgcgat ttgttttgat 360 cggtcgtgtt aagtggtaag taccactgat ctgttcggtt gaatgatgta cgaagatgaa 420 agcgatatca tttgttatgt ctgcgaaaaa tcagaggtca attcgttact aacaattgag 480 tgtgccaatt gtagcaaatg tgctcacttc aggtgtaaaa agctgtttgg gaaagccgtc 540 gctaaagcta ggaagaaacc gtacttatgt tcagtcgaat gtgcaaaaat ccatgcctgc 600 actgatgaaa aatctccggc attaatttcc gaaatacgtt tgcttggaca ggctattagc 660 gagtcgcaga aagaagcggt tatggtcagg aaggctctcg agctgacacg gatgcagtta 720 gatactctcg tcaaaacaag caaaggcatt gaggattcgc aacagttcct atcaaatcag 780 tttgatgatc tgagggcaaa atttatcgag tttcgtgaag agatcgacgt gatgaaggaa 840 gagaataata aagccagaga ggagttcaga gatctgcaac aaaaatatca tgcgctgctt 900 tcgtccgtag actcgatgga atctaaactg cgtagcgtga gtcaggctgc tgttgccaac 960 aaagttgtca ttctcggttt gccgctcacc gatgatgagg acctaaaagc agtcgtctgt 1020 gatgttggca ccgcagtagg gatcgagttg tctccatcag tcatcgaaaa cgtgcaacga 1080 attttcagta aacgaaatca aaattcatcg gtacccatat tggtgtcctt ctctaatcat 1140 gcaactaagg aaaaactgtt tgagtgcaaa cgcagatatg gagcattact tgcatccact 1200 gtatcgagga agttcgaggg atcagctgac cgcgtaatta ttcgtgatga gatgacaatt 1260 gaagcccgaa atctcttctg ggaagcgaaa aatatgcaag cagctctcaa tatgaaatac 1320 atctggcccg gtcgtgatgg caaagtgtta ctgaggcgtt gtgatggcgg caaagttcat 1380 gaagttggta gcaaactgca gttgctcaaa ttgtccgaac agctgtcaca ggatattgca 1440 acggtagaat agccctctgg tctgtcctga tcacagaaat ttacgtcgtt atactgcaat 1500 aaggttatct ctgaacatca atggataatt tttattacga taatctgtat tgtttgaata 1560 aaaactctca atacataaac cggccattta gttttaagat tcttcagcta aatataagag 1620 gaatgaacag gttagataaa ttcgatgctg taaaagagtt tctttcactc tactctggac 1680 agatagacat agtagtgatt ggtgaaacat gggtcaaagc cgaacgcaaa tcgttatttc 1740 aaatagaggg ttttaaaagc attttctcat gtcgcgagaa tacggtgggt ggcggactcg 1800 cagtcttcgt taaatccaaa ttcgagtttg agcaagtaac gtgtatgcac gatgaaggtt 1860 tgcaccatat tcatgtacgt ttaaagatac aggagtccga tttcaatgtg catgctgttt 1920 acagaccacc atgttttgat ctagcacgtt tttatagtag aatggagtca gtatgctcga 1980 tacaccgaaa atccgcatcg agtgtggttg ttggggatat gaacattcca gtaaaccttg 2040 tacatcgagg agtagtaagc gagtatcaag atcttctcag ctgctataac tttgcagtta 2100 ccaatacgta tcccaccagg ccggctagta acaatattct cgaccacgtc atctgctcag 2160 aaaatctgca aagaaacgtt accaatgaaa ccgtttgcac ggatatcagc gatcactgtt 2220 tcattttgtc tacgttgcat cttaagaggc ctatcagaac gattaatttg gagaaaactg 2280 ttgttaatta caccaaattg aacaatgcat ttcaagctgc catgcaaaat atgcctcaga 2340 gtaatgcgaa tgttaaactt ttgtacgtgc taaacacatt tcaatcgctc agagaaaaat 2400 tttccacggt ggtgaaagta caagcaaaaa tcaaagggag ttgtccatgg atgacttttg 2460 agttgtggaa attgatatgt ataaaagaca aagtcctgaa gagctgtcgg aaacgtccca 2520 acgatacagg cttgcaggaa ttactcgctc acgtctccag aaaggttcaa tatgcgaagt 2580 tatgggctaa gaaaaactat tacagcaatt tgtttctcgc atcgtctcca aaaaacacct 2640 gggaaaatat taacgaactc ataggtcgtc gaaaggatag tgacaacgtt gtcaaattga 2700 tggtagacgg acagataaca agttgtggag caaccattgc cgacaagttc aaccagtttt 2760 tcagcaccat cggaccccag ctggcttcga caatgaattc aagcaatgaa atcaataaat 2820 tcaatacgct acggctcaac aacagtactg tattcctacg tccagcmaca gaacgtgaag 2880 ttttgctaaa aataaacgcc cttgacagca aaaagtgcag tggaccggat ggaatttcag 2940 caactttcat taaagttcac catgaattct ttgcaaccct attgaccgaa gtgttcaatg 3000 aatgcattag ctctggaaca ttcccagata gtttgaaaat tgccagtgtg acaccaattc 3060 acaaggatgg gagtaaagaa gacgtgaata attacagacc tatttcggtg ctgtcaatgc 3120 ttagtaaatt gctagaacag ctttttgtaa gcagactttt gagcttctta gaagaccagc 3180 atgttttgta taggcaccaa ttcggattcc gtgcaggatc aagtacctgg actgctacct 3240 gtgagcttgt agacagtatt tattcagccc tagacaatag gaaaatctct ggagtactat 3300 ttttagacct gaagaaggcg ttcgatacca ttgaccacag gattttgttg cagaagctgg 3360 agtattatgg tgtaagggga gttgctctaa atttcatccg aagctattta accaatagaa 3420 agcagtatgc gacagtgaac aagagtaaaa gcaggctttg ccccctgacc gtaggagtcc 3480 cccaagggag cactctggga ccgcttctct ttctactgta catgaacgat ttgccaatgt 3540 taaaactcaa tggtaaacca cggatgtttg ctgacgacac atctttgtct tatgaagttg 3600 acaatcccga gcaactgata cgacaaatgt ctgaagatat gactcaacta caagcatact 3660 tcactgaaaa tttgttgtct ttaaatcttt cgaaatcaaa atatatgatt tttcgctcaa 3720 cgagacagag gattcccgat cacggtgaat tgtcaatagg ctctcaagta attgaggagg 3780 ttgatgagat gaaataccta ggcctgacgt ttgacccaac tctgacatgg cgtggacacg 3840 ttgagaagtt aagacgtcat attagttcct tttgcggagt cttatggaag attgcaaagt 3900 ttgtaccaat gaaacagcta aacacaattt atcatgcttt tatccaatcg aaattgtgtt 3960 acttggtatc aatttggggt acagctccaa agactgtcat caaacctctg caggtcaccc 4020 aaaaccgatg cctcaggatc gtgtatgaca aacctcgatt gttttcaaca aaaatactgt 4080 atgaacaagc tgatcaatca acactcccaa tcctagcact atgtgaacta caatccttag 4140 tatgtatgca aaacctcata aagaatccaa cagcacacca taactttaat ctacaacgac 4200 cagaccattt gcataacact cgtcatcagg ctcatctggc tgtaacccgc aggaataccg 4260 aaaacggtaa aaaggctttc gactatagag gaaaaatttc gtataatagt ttaccagata 4320 ctgtgaaatc cgaacatagt attttcaaat tcaaaaaggt agttagacgt cacatcatga 4380 gtttgttaga acggttcata atatgatctg atcgatgatt tgagctagtt agaagttaga 4440 agagaccctt aaaaggaatt agtttccact gggtttcaac gtcaatgtag agaatacccc 4500 acaaatgtat agcatatatt caatgtatta tgattcaata aaaaaaaata aatcagttgc 4560 gaccattacc agggagatca acctcaagag ctctctggtg tgggggagag tggagggcgc 4620 aaaaaaaaaa aa 4632 // ID Chapaev-6_HM repbase; DNA; INV; 5402 BP. XX AC . XX DT 27-FEB-2008 (Rel. 13.02, Created) DT 27-FEB-2008 (Rel. 13.02, Last updated, Version 1) XX DE Autonomous Chapaev DNA transposon -a consensus sequence. XX KW Chapaev; DNA transposon; Transposable Element; Chapaev-6_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-5402 RA Kapitonov V.V. and Jurka J.; RT "Autonomous Chapaev transposons from the hydra genome."; RL Repbase Reports 8(2), 32-32 (2008). XX DR [1] (Consensus) XX CC Chapaev-6_HM is a very young family of autonomous Chapaev DNA CC transposons that can be still active in the hydra genome (they CC are ~0.1% divergent from their consensus sequence). The consensus CC sequence was obtained based on a multiple alignment of 4 copies; CC it codes for a 997-aa Chapaev transposase (ten exons). CC Chapaev-6_HM is characterized by 4-bp target site duplications, CC 12-bp terminal and 21-bp subterminal inverted repeats (separated CC by 16- and 6-bp regions from the 5' and 3' TIRs, respectively). CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(466..1407,1534..1826,1921..2133,2260..2499, FT 2689..2765,2873..3069,3235..3564,3726..3884, FT 4124..4316,4537..4883) FT /product="Chapaev-6_HMp" FT /note="Transposase." FT /translation="MTESHETKLKTLCRICGESVDNDSVLLTKHIARIEFC FT FFIRIDKDHSNIHPQKMCYRCFSILRNIEKGSNTELKLRSWPENCNLTECV FT CFKSNKGRKKKKKISGRPSSISYKENIWTRHKINNIQSKLSPQTKLNLKLQ FT DINCILNPHVHLCVCTICNDIMHKPIIIKNCLHSFCASCLLPLIIGKQIAK FT TKCPKCSFSIPSDGLISSTNVIEMIENLQEVCKQGCGRLFKITQLSERKKH FT QKTCQKTQISSSTTLSLPTSSSSTSQINLSDIFALDSTSIIPRNIEDAALH FT VIKQKMSQTTSNVIEFPTGGPRPLCFTSTPRAYKESSDVCLRTIRRRQLQL FT KHNMSKTCGESISSTIKQTATLLKSFNENEKEHILSKSNISKAKISAEEMI FT SLKANMSSTYVNMKILSRWLKNNNVICASNAKQRNVAKNWSCDDLIVKNAP FT FMVEKKDSKGSYEIKELPCAYIENLQGHITNVLDRLDSNNLLLYNKIKDNE FT IHIKIGGDYGGDSFKMFYQVANVEKPNAKTNTTIFNIFEAKDYTTNLKISL FT ARFTSDIDLLQKMIWKEKQIRVFVFGDYEFLCAIYGITGANGRHSCLFCNI FT TSQGMSNPINDNIQMRSLKTLDLSLAKFNNHGADPKFAKLCDNVIDQRLFN FT VPLDQIGVPALHISLGIYLKFFNMLEDSCHTIDIKIAGLMAVNNQMLDCEE FT FNAYIEHQRQINQLQISVQELEDKIRVITEALEMQILFNAENEEKIKLIFE FT PHLIHFGKKKKEQISELKILLEADHIKKSFGPLVKELDKVLNSLGVQRQAY FT HGKSFVGNHVNKMLKEKSILELCNSIPNLVVELGFNDTNIHKETIEICKNF FT NVLFSKFGICHKLINSCNQFNEDNKQDLENQIKDFMKYFRENWPNASITPK FT LHMLEYHALPFIRKWGVGLGTYGEQGGESIHAEINRMKSTYCHMKGVGRLR FT SIMNEHFIKNNPNVKKYQKKAQPRKRKIEDTKNNSIKYCKMI" XX SQ Sequence 5402 BP; 2180 A; 746 C; 720 G; 1756 T; 0 other; cacggtggtt taagtttaag catttgcgga catttattat gcgcatgcgt tactttaaaa 60 acaacaacaa cattgtaatt ttaatttttt tttgttatgg ttttttatcg taacctaata 120 aaaaaggtca tatcattgct ttgaagatgt ttataccaga tataaattaa taaactaata 180 aaatatatgg ccaaggaaaa catatttaat ttgaaatact taaatttgtt aatttttatg 240 taataaaaaa cgcgggaaag tcggaaatta ttttatggtc atcctaatta ctaaaacata 300 ataattaagc atttattaga atttttatgc cttaaagctg tatatatttt aaaagaagaa 360 ggtttcaact ttataattaa aaaaaatcag agtccggtta cttttttaag tttaaaaatt 420 ttgtatatca aatcttatat tttagcaata aaataacata caaaaatgac tgagtctcat 480 gaaacaaaac taaaaacatt atgcagaata tgtggagaaa gtgtagataa tgatagtgtt 540 cttttaacaa aacacattgc tagaatagaa ttttgttttt ttatccgaat cgataaagac 600 cactcaaata tacatcctca aaaaatgtgt tatagatgct tttcaatatt aagaaatatt 660 gaaaaaggtt caaacactga actaaaacta agatcatggc ctgaaaattg caatctaact 720 gaatgtgttt gctttaaatc caataaagga agaaaaaaaa agaaaaagat aagtggaaga 780 ccttcgagta ttagttataa agaaaatatt tggacaagac acaaaataaa taacatacaa 840 tctaaacttt cacctcaaac aaaacttaat cttaaactgc aagatattaa ctgcatctta 900 aaccctcatg tgcatctttg tgtttgtact atttgcaatg acattatgca taaaccaatt 960 attataaaaa actgtcttca ttcattctgc gcatcatgtc tattaccact tatcattggc 1020 aaacaaatag caaaaacaaa gtgccctaaa tgttcctttt caattccaag cgatggttta 1080 atttcatcaa cgaatgtcat tgaaatgata gaaaatttac aagaagtatg caagcaaggt 1140 tgtggtagat tgtttaaaat cacacaactg tcagaacgaa agaaacatca aaaaacttgt 1200 cagaaaactc aaatatcaag ttcaacaaca ttatcattac caacatcatc atcatcaact 1260 tcacaaataa acctttcaga tatatttgca ttggattcaa caagtattat accaagaaac 1320 attgaagatg ctgctctgca tgtaataaaa caaaaaatgt ctcaaacaac atccaatgta 1380 attgaatttc cgacaggtgg accaagggta agactatttt tacaaaagta gagactgtaa 1440 aattactata acaaaacaaa aatgctttta attttataat agaaaactat agtatgattt 1500 taaatatttt aaataaatta cacaacattt tagcccttat gcttcacatc aacaccaaga 1560 gcctacaaag aaagttcaga tgtatgttta cgcacaattc gtagaagaca attacagcta 1620 aaacataata tgagcaaaac atgtggagaa agcatcagtt ctacaatcaa gcaaactgct 1680 actcttttga aatcttttaa tgaaaatgaa aaagaacata ttctgagcaa atcaaatatt 1740 tcaaaagcca aaatatctgc tgaagaaatg atcagtttaa aagcaaacat gagcagtact 1800 tatgtcaaca tgaaaatatt atcaaggtac acaagattgc aattattata tagaaatctt 1860 aaaaaaaaac aaattcaata cttttaatag ttagtgatga taaaaaaaac attattttag 1920 gtggttgaaa aacaacaatg taatatgtgc ttctaatgca aaacaaagaa atgtggctaa 1980 aaattggtca tgtgatgatc ttattgttaa aaacgctcca tttatggttg aaaaaaaaga 2040 ttcaaaagga tcttatgaaa tcaaggaatt gccatgtgca tatatcgaaa atctacaagg 2100 acatataaca aatgtgctag atagactaga taggtacaaa caaatctttt ttaaaaaaac 2160 tgtaacatat tacaattaat taaattactt aaaaatagta caataaattt aatgcaatac 2220 tataaaaaca aaaattaatt tttcttgtta taattttagc aacaaccttt tattatacaa 2280 caagattaaa gacaacgaaa tccatataaa aataggaggt gattatggag gtgattcgtt 2340 taaaatgttt tatcaagtgg cgaatgtgga gaagcctaat gccaaaacta atacaacaat 2400 atttaatata tttgaggcaa aagattatac cacaaatttg aaaatttcat tggcaagatt 2460 tacatcagat attgatttgt tgcaaaaaat gatttggaag taagaacata ttacaaacaa 2520 caaaataatt caattattca aacacactac aattcataaa catccatcca atatttgaca 2580 taaacaaaat catcaactat tcatttctta ccacaattgt caggaatatg atggattaca 2640 gtttcttttt aaagaaatgt ataataattt aattctgttc aattttagag aaaaacaaat 2700 tcgcgtattt gtatttggtg attatgaatt cttgtgtgca atttatggaa taactggtgc 2760 aaatggtgag tacatattta atacagattt ctttaaaata actcatctac gccagatata 2820 tggaacatga tttattttgc ctttttaaca cgaacttgtt cattaacttt aggtagacac 2880 tcatgcctgt tttgcaacat cacaagtcaa ggaatgtcaa atccaattaa cgataacatt 2940 caaatgcgat cacttaagac actggattta tctctagcaa aatttaacaa ccatggtgca 3000 gaccctaagt ttgctaaatt atgtgacaat gttattgacc aacggttgtt taatgtgcca 3060 cttgatcagg tgctttgttt ttattgatta ctcttatttt atttattagg cattatttgc 3120 aaattcgcat aaataattgt ttattcaata taatacaata caagtttttt ttgcggtaaa 3180 tgtaaatggt ggtgtatgca attgtaaata atataaactt tcttgtacta ttagatagga 3240 gttcctgctt tacatatctc acttggtata tacttaaagt ttttcaacat gttagaagac 3300 tcttgtcaca caattgacat aaaaattgca ggtctaatgg cagtaaacaa ccaaatgctt 3360 gactgtgagg agtttaatgc atatatagag caccagcgcc aaataaatca acttcaaata 3420 agtgttcaag aacttgaaga taaaatacgt gttataacag aagcacttga aatgcaaata 3480 ctttttaacg ctgaaaatga agaaaaaatt aaattaatat ttgaacctca cttaattcac 3540 tttggaaaga aaaagaaaga acaggtaact actttaaatt ttattattca ctaaaaaata 3600 tttaatttag attggtaaac aaacccagaa taatacagtg gataatctta tacattggat 3660 aatctaagtt ttgctttatt gattacattt ttgtgaattt ttcataagaa catatttaat 3720 catagatctc agaattgaaa atactgcttg aagcagacca tataaagaaa tcatttgggc 3780 cacttgtaaa agaactagat aaggtactta attcattagg agtacaaaga caagcatacc 3840 acggaaaaag ttttgttggt aatcatgtta acaaaatgtt aaaggtaaaa tatatctaat 3900 atatatttcc taaattgata aaatctactc tttactaaaa acaaagtgta cctaaattta 3960 aatgtacaat acaatataaa tatatgtgtg tatgtgtgtg tgtgtgtgtg tgtaggtgga 4020 atagatggct tatactaaca ttactagaca taactaaata cgtaattaat aacagttgct 4080 acatataata catatttcac aactatttta tatactaaat taggagaaaa gtattcttga 4140 actttgcaac tccataccaa accttgtagt cgaacttggg ttcaatgaca ctaatataca 4200 caaagaaact attgaaattt gtaaaaactt caatgtactc ttctcaaaat ttggaatttg 4260 tcacaagctt attaactcct gcaatcagtt caatgaagat aacaaacagg accttggtat 4320 gtcatatgtt attttttggc agaacaaaat aaatagagaa aatcatttta attatttgat 4380 ctttatcttt gtttttttaa gtcattgtgt tatatatata tatatatata tatatatata 4440 tatatatata tatatatata tatatatata tatatatata tatatatata tatatatata 4500 tatatatata tatatatata tatatatagg ttacagaaaa tcaaatcaaa gatttcatga 4560 agtattttcg tgaaaactgg ccaaacgctt caatcacacc aaagctccat atgctggagt 4620 accatgcatt acctttcatt agaaaatggg gagtaggact aggtacttat ggagagcaag 4680 gtggagaaag tattcatgct gaaatcaacc gtatgaagag tacctactgc catatgaaag 4740 gagtaggtag attaagaagc ataatgaatg agcatttcat aaaaaacaac ccaaatgtaa 4800 aaaaatacca aaaaaaagct caaccaagaa aaagaaaaat tgaagatact aaaaataatt 4860 cgataaaata ctgcaaaatg atttaaagat aataaatggc agtaataaag cctccataat 4920 atttctgtca gcatttttaa cctttcaaaa aaaacgaaaa cttaccaatc tttaaaagca 4980 ttatattgta acaaaaaaaa gtaaccggtt tttgattttt gctttaaatg attcacaaac 5040 tattatattt tttaaaaaaa atcgaggtta ttactaaaaa atttaaaact ttagaatata 5100 gactataatg taagcacagg ttaaacttcg cctaaaaaat agctaagttt taatttagtc 5160 ccacttctat tctctttgta aaattattta aatgatagct ttgatgacac ataccatacc 5220 taaaatattg ttttaatatc atttcttata ggcaatttag aaatatctac aaataaatct 5280 agtttaaagt ttagacaact ctattagtag ttttctgttt acttgcgaac attattgttt 5340 accaacattt gaataaaata tagcgcatgc gcataataaa tgtctattac taaaccaccg 5400 tg 5402 // ID TransibN5_DP repbase; DNA; INV; 1468 BP. XX AC . XX DT 13-JUN-2005 (Rel. 10.05, Created) DT 13-JUN-2005 (Rel. 10.05, Last updated, Version 1) XX DE TransibN5_DP is a nonautonomous DNA transposon - a consensus DE sequence. XX KW Transib; DNA transposon; Transposable Element; Nonautonomous; KW Interspersed repeat; TransibN5_DP. XX OS Drosophila pseudoobscura OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; obscura group; OC pseudoobscura subgroup. XX RN [1] RP 1-1468 RA Kapitonov V.V. and Jurka J.; RT "RAG1 core and V(D)J recombination signal sequences were derived RT from Transib transposons."; RL PLoS Biol 3(6), (2005). XX DR [1] (Consensus) XX CC TransibN5_DP belongs to the TRANSIB family of DNA transposons. CC This element is characterized by 5-bp target site duplications. CC TransibN5_DP has imperfect 37-bp terminal inverted repeats (3 CC mismatches). The consensus sequences was built from 2 copies CC 95% identical to each other. XX SQ Sequence 1468 BP; 496 A; 226 C; 262 G; 484 T; 0 other; cactgtgggc cagcgacccc atttttcgac gttaattcga aaaaatgcat gtcaatgaaa 60 cttaatttta attttacatt taattagtat atacattaat taaaaaaaat caaaaatctt 120 ttttgtttta atgcatcttt ggcctttttg caatgtttta aagtcagctg tttgataacc 180 tgcaatcgat agtaaaaaac gctggaaact gtttgtttac tttgagagct taccaaattg 240 aaactaagtt caaaaactga aatcaaatta atggaaaaga aaggccaggg tatgtagagt 300 ttttatatta aacacatttg tgctcatgtt tttgaatatg tttttgtttt tattaatttg 360 tgtcgtccaa aggtagagaa tacttaacca atgtgcccgt ctcttatggt caataactaa 420 ttttgtgcgc aattgtgcga gaaagtaatt atgtaatatt gtagtgcaaa ctgtgtataa 480 aataatccta aaactcgttt ttgcgccatt ttgcgccggt gtgcgtattt taatttattt 540 gtaggtgagg tggagttcac ttatgaaaag ctgctcagta tttacattga gaataatcga 600 agcgccgcag cactttcgag ttggattttc gaagaactga aggcatacaa aataagcacg 660 gagaaaaagt atttggcaaa atgcgggaga agcatttaag aaaaagcata actcatcgct 720 tttgtccaag atggttatca aagttgaaag aacgtcgtca tcgaattcca tgacttcaac 780 atcaagatca gttggacgcc caccaattga atatgtaaat gctggacccc gttgaaagac 840 acatttctta ctccgagaaa ccccaatacg gtaaagaaaa atctttttac cgtaaaaaat 900 gagtcggtaa cactaacggc ggatgaggct ttagcatatc tgcttgaaaa ttcattaact 960 aaaagccaat ataacaactt tcgcaatctc agcaatagca aatcgtgtga tcttcttcca 1020 gcttacaaca aagtaagaga aaatattttt tataacaata atgcttccct agaaactgat 1080 acttattctg gtatgtgtga aagtgacgaa tcggattacg ataatgaaaa ttcattttca 1140 atagaaatgg atgtgggaga agattaatta taaattcgtt gtactttgaa aaacttcaac 1200 attcatcccc ctctaagtat taagtttaag tgtaaaaatt ttacttttta agaattttaa 1260 taacatattc tattataatg gactacgtgt taaaaatcaa tattagcgaa tcttttcttt 1320 tatgggcgtt tggtgtgggc gtggtcggat cggtctaaaa ttaagatata tatatataca 1380 cgtcgataat attttgtgta caaaatttga agtctctagc tttattattt taatttacgt 1440 caaaaaatgg gatcgctggc ccacagtg 1468 // ID DNA2-5_AP repbase; DNA; INV; 218 BP. XX AC . XX DT 25-AUG-2009 (Rel. 14.09, Created) DT 25-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA2-5_AP. XX NM DNA2-5_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-218 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 1940-1940 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 2 bp TSD. Putative Mariner element. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 218 BP; 44 A; 66 C; 65 G; 43 T; 0 other; cggcccacga gagagtatca ccgtcgcccg acgaaaactc gtccgattcc gcgcattccc 60 cactactcgg cattgccgat agctgttcgg caacggtcgg ctgcgctcaa cgctatacgg 120 tgtatagcgg tcgtggaaat agtgggggaa atggggcctc ggaacggtct ccgccgtcag 180 tctgcatgcg caaaacggta atactctctc gtgggccg 218 // ID Polinton2_SM repbase; DNA; INV; 11578 BP. XX AC AAWT01092375; XX DT 03-JAN-2008 (Rel. 13.01, Created) DT 08-FEB-2008 (Rel. 13.01, Last updated, Version 2) XX DE Polinton-type family. XX KW Polinton; DNA transposon; Transposable Element; Polinton2_SM. XX NM Polinton2_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-11578 RA Jurka J.; RT "Polinton2_SM: Polinton-type element from the planarian Schmidtea RT mediterranea."; RL Repbase Reports 8(1), 23-23 (2008). XX DR EMBL/GenBank/DDBJ; AAWT01092375; Positions 35903 47480. XX CC Regions masked by "n" contain unrelated repetitive elements. CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 266..691 FT /product="Polinton2_SM_1p" FT /translation="MIKEILVIILVFSVFKMANSIVVTVQSKNSTKFVCPD FT TAGLMDIFYMIDRPRYLKTPVKRLSERMSNLEIRFELTYETLKRAWNAIYD FT ELLYLNQTIGIIKNELRKSIFKLNENRKLKEKMINQVQYSIKFILIKQLIS FT SIV" FT CDS 4523..4780 FT /product="Polinton2_SM_2p" FT /translation="MKRMSIFQSFPTFVVDDLLKILLKVVRGKIKISKSKK FT LVLNKHRKSLLSLVNNKNRKQMRKIIYKQQDNFIGAMLPLAISLLSGIQ" FT CDS join(8886..10226,10175..10807) FT /product="Polinton2_SM_3p" FT /translation="MITPSFVIYADFEAVLPKDIKHHQIHMPISAGLLLIN FT NNTNTTQYFSFIGLECVLEFLKKVEEIAVTIVLPYYENHGKKPMNNLTSNE FT EQQFIACRTCYFCKNLIRNKDKDHDHFTGQYLGAACCNCNINRKISKQLPI FT VFHNLRGYDLHHILKYGLNEFTSWKLNIIPTTTEKFISLIVNIKKLAVRFI FT DSMQFVNCSLAKAVKTLTDLPLTFYAFDGLIVKAKGIFPYDFAISLEVLKS FT TYEPPKKWDSVSDDEYATAQLIWTQQNCTTMLDYMLTYLKLEVFLLAYYFQ FT QFRAKAIAYNCLEPLNFYGIPGMSRASALVALKEPIEPLQDMGMFNFYEGG FT IRGWLTFVNKHYVASSDDTELLYIDINNLYGWALSQNLPYGDFVWVHENLN FT DVLSECVNANIEFLPYGYSMEVDINIPERVYHYLNDFSLAPEKMCPPNSKL FT RSLFACSGKNVSAQFKVEKLMLTHYSKKNHVLHWRLLKLYIELGAKIECIH FT RAIKSKQAPIFKDYIQRNTDLRSNSKCELHRDLYKLFNNSLYGKSVENLIK FT QMNLRLCNSAEKMIVFASKPTYKKFIKIVDDLIAAHLNKDSICLDRPSYIG FT QLVLDLSKLLMYQLQYNELAKYRSEFYCDINIVAGDTDSFSREDKHCMDIG FT KLQYLYLRK" XX SQ Sequence 11578 BP; 4124 A; 1557 C; 1629 G; 3899 T; 369 other; cttatccttc aatatttcat atatatctgg ataatagaat tccaggcgtt ctttgataac 60 tttaaaataa tcatctgtaa aacaagttaa aataataata ataaattacc acaaagttca 120 atttcattca ttcattgaac tgcgctcggt acaataatat ccattataat gataaaaatt 180 gaaataacaa ttacttttat actaatttaa acacacacaa acttttatct ataaacaata 240 aatattaatt tatatagatt tgaaaatgat taaagaaatt ttagtaataa tattagtttt 300 tagtgttttt aaaatggcta actcaattgt tgtaaccgtt caaagtaaaa attctacaaa 360 attcgtatgt cctgatacag ccgggttgat ggatattttt tatatgatag ataggcctag 420 atatttgaaa actccagtca aaagattatc tgaaagaatg agtaatttag aaattcgatt 480 cgaactcaca tatgagactt taaaaagagc ttggaacgca atatatgatg aacttctata 540 tttaaatcag accataggta ttatcaaaaa tgaattaaga aaatcaatat tcaaattgaa 600 tgaaaatcgc aaacttaaag aaaaaatgat aaatcaagtt caatatagta taaagtttat 660 tctaataaaa caactgatca gttcaatcgt ataatgaaag ttcaacagcc aatacaacct 720 gttaagagtg gcaacaaatc atcaataatc acatttccat tatacatttt tacctatttt 780 cttaaataaa gacattaaaa cttcccaata tccatgcatg attctgattg gtcgaatata 840 gtataaataa gcttacactt ccaaatattt ttataaaatg ccagaaaact aaatttttta 900 taactttgtt gaatgagctg attgttaata atggagatga tgatgatatc attaaaaaca 960 taaaattcat ggtcaaccgt accgagaaaa tagtagaaga tatcccgttt gagcatatta 1020 tccatttgcc aacattggag agaaagaaaa attcattcca attgatattt agttatatgt 1080 ttaatgaaaa taataaatgg atgcgaataa tagtaggatt aatgtatatg aaacatatga 1140 tattaaaatc cggatccgac tatgaaaatt atttccaatg gtctatagag atatattact 1200 ttatattatt attttatttg ttatcattat ttaggtaata gaaacaaaaa ttgatccatg 1260 ggtagaagga aacggtgggt gggtaacaat tattcaagtt gacgcagaag ccaagcgttc 1320 agatatttat atttcaaatt gcaaattttt attaaaacta ttaactggag catcggtcat 1380 atatatttgt aataaagttt accatgaatt ttaataaatg tattattctt tgttatttac 1440 tgaattatta ataaataact acatttttaa agatttattg attattacat gtatatgctt 1500 catcaccaca atattctatc attatatttt ctatgacatg gtcaacacaa gtcttaaatt 1560 cgttaggctc aactaaattg gggtttcttc attgttaata atttcagtat cttcattatt 1620 aataatttca gtattttcat tattaataat ttcttcatca tcactcttac attctgaatc 1680 acaacatgaa tcttctttac ctaatgcttc aactagagtc aacatatctc taattttcac 1740 accggttatt tcttcaagtt tataaaacaa aaagtttatt aaatattcta aacgaaatta 1800 attaattatt ttatatttta cctttttcag acatcatttt taaacaattt taaaaataat 1860 tataattttg tttatagaac atgttgtaaa gaaaataaaa tctgtttaaa tacatttatt 1920 aacattaaac actaaaatat acattaaaat aattgtgaat gtctagcagt aacaattttc 1980 cagatcgtag aaagcacatc actttgatca gctttatatc cttggattgt tattgttaaa 2040 taccaatttt tctgttcttt gaaacctatt tctgtaaatg catcataata cctagtatac 2100 tgactggtga agaatgcatt ttcattgcat tttaaattcg aattgaactt attaaaaacc 2160 gtatcaatat ccttgatatc tatcattgag agtgtccagc aagagattga aattgagttc 2220 tattgagttt gactggattt accaattgca aattgatagg ctcctttaaa aaatccattt 2280 tataactgtg gaacatctga actagttgga cattgatagt agagctcact attttcttag 2340 tcatgtttga tttatgatat ttataaagtg gtgacgagct tttatacact tgctagatac 2400 tgattaacca atcagaatcg ttgttgcatg tatgtatttc tatacaagtc tattacttat 2460 ttctattatt actaactatt atttttttta ttattattat tatgtgttgt attattatta 2520 ttttattatt attatgtgtt gtattattat tattagtatt attatttcgt ttatcatata 2580 aaacaatgac taaataaaat tattactatt aaaacttgtt attactaact attaatacta 2640 aaaacaaatc aaaaaacaaa tcaaaaacat gtaacacaaa acaattattg tcttaaaagt 2700 tgtagtttac aaaagttcct tacggattgg acaaggcacc attcaaagta ccgctgataa 2760 cggaaattgc agcaactaca atccagaata tatgggtatt cagttgccat ttccgttttc 2820 agcctgttga tttcattatt ccattgagcc tgattgatca gctctgaaat cagtattttt 2880 gtctcattaa atatgaactc tggaattatt gttgctgcct tatcctttat tatcgcattg 2940 agttgatgat aataatactg gagtcttcca attactaggg tttcatattc ttctaaataa 3000 aatactaata ataaaagaaa ataaataaat aataaattta ccatatcgaa tgttatttgt 3060 catcggctta gcatttatat tttcgatttt aaaaaatgaa ataaaatgaa taagcaagtt 3120 gttttaatca aacattagcc aatcagatgt cgaccaatcc cgtgatttca ttgtgtatgt 3180 ctgtaacaat atttttgtac tttagacttt gcaaccttaa tatgactatt ttcccactgg 3240 caaacacgtt aacacacatt gggttgtatt ttaaaagcct cagtacagga aataattctg 3300 gctcgaacat acaaagacat tttgttgata tgttgtatag attgatagaa tgttccatat 3360 ccatggtaac cgtaattgac tgtattttaa tatcctttat tctatattgt aatttattga 3420 catctaaagg tttcttacat cccatgattc tgcatttgcc ggatttgaaa aatattatag 3480 ggtatttttc actcctatca aaaatttgat ggggtttacc attcggaaac gtcatttttg 3540 acacgtcgaa agttccttta aaatttatat ttgacaacat aattaaacaa taatgaaata 3600 ttaatctata tatatatgat ttttagaaag agttcattaa attatataat ataaacaaaa 3660 acctataatt atacaaatta ataatggaat taagaaatga tgcggtataa tcaaattgtc 3720 ggcccaagtg gtagtggtaa aactatgtct gtgtgtaaat tactgaaatc taatttattt 3780 caaacaaaat tcaacaagat ttattggcat aggggtgcag atgaagaaca tggattaact 3840 caagataatt tctgtaaatt gaaaaacatg aaaatagtaa agggtttcga taaaaactgg 3900 tcaagtcgat taaggaaagg tgatgtcatc attatcgatg atttgtatca ggaagctaac 3960 aaagaaaagg attttaatga tttattcaca aaaatccgta gacatgttag tgttactgtg 4020 atctttatga ctcaaaatat atttcatcag ggtggcagac atcaaacaag gaatttaaat 4080 gttcaatatt tagttatttt caagaatcct agagatgcaa aagttattga tttttcctaa 4140 taattgcaat tttcatacca gtgcatttca agatgctaca aaatcaccac gtggatatat 4200 atttcttgat ttcacacaac aatgcaatga tgatttacgg gtaaaaacag atattttcat 4260 ggtttataaa ctaatttgaa aacattaaca tgtaaaaatg tttatgaaat ccatgcttaa 4320 aaatgtctga aaacgtagac gaagtattaa aaagaaaata aagactgaaa aagttccacc 4380 aaagcctagt tataaacaat ctaaaaaact aataaccgag aattttcccg caataaatct 4440 aatttcgaaa aaccctatta aaaggagtaa taaagagtca caaagtttcc catattttaa 4500 tgcattgtta aaggcttcga gtatgaaaag aatgtctatt tttcaatcat ttccaacttt 4560 tgttgtcgac gatttactca aaattctatt aaaagtagtc agaggtaaaa ttaaaatcag 4620 taaatctaaa aaactagtat tgaataaaca tcgtaagtct ttgttatcgc ttgtaaacaa 4680 taaaaatcgt aagcaaatga gaaaaatcat atataaacaa caagacaatt ttatcggagc 4740 aatgttacca ttagcaatat cattattaag tggaatacaa taaacgatgg gacaatcatt 4800 acttaatgtt aaattaaaat ttaaatcatc ttgttgtaat ggagaaattt tatacaaacc 4860 cagaagacga agcagatttt ggtggagtgg caaagttaaa gaaaagagtc ccaaattcga 4920 aaaaggaaac gcaaaagtgg ttgtcagatc agcttgcgta cagtttaaac aaaccgatcc 4980 gaaaaagatt tccaacaaaa gcttataaga tattcggtat taatgattta tggcaaatcg 5040 ttttgcccac ttcagacaaa atctgcagag gaaatatcta aagcaattaa tattttattt 5100 aaaaatgagc atcctgacaa tttgcaaacc gatttaggta tttttattat tacaagttaa 5160 tattattatt attattataa gcaaggagtt ttacaatggt aaagttaagc aaatcctgga 5220 tcgtctcaaa ataaatcatt actctgtaca ttctcaatac aaagctgctc atgttgaaag 5280 atttaaccgg acattgaggg atagattgaa gaaatatttc gtgcatcaag gtaataaaat 5340 attgataaat gtattaccga aattacttgt tagttataaa aattcatcac atcgaggtct 5400 gaatggtttg agacctataa atatcagctt aaaaacaaca gtgagttcta aaaaagttgc 5460 taaaattaca aaaccaaaat ataaagttgg tgattacgtt agaattagta aaatatctgc 5520 ttcacctttc atcgaaaatt tcgatagtaa tttcagtgat gaagacagtg acaacaatat 5580 aatacagggg aaattctatg aacaggaatt acaagttatt tccaaaccta caatgttcag 5640 aattcagaaa atattgaaaa ctaaaaaggt gtgtaaggat aaacaatact atgtcaaatg 5700 acacgaatat caaaaaccct catgaaaatc atcaaaatac ctagtaaatt gagtttctat 5760 acaattttac cgagcaatag ttgtccccta atccatccgg aaaatcaaac gaataagttt 5820 atggttgatc ttcaaaatcc tatttattta catggaaatt gggaagttgc tttgcaagac 5880 tttacatttg tgtacaatac attcccattt tatagtcatt gtaaaatcaa ttacaagaag 5940 cgggttccca aaacatgaag tggtattttc tcaatcaata atattgatcg ataatgtcat 6000 attctgaatg ggtagtgaca agcttaacat tgttgttcat agtactgaat acctattacg 6060 tttcaaaact tttagtgatg tcagccttgt gaatgaacac atcaaaataa ctgtattgta 6120 tattaatgag gtgaaagata catttacgtt aataataaat tttttgttaa tttattatcg 6180 attgctacaa agctcatgag gatgcaaaga caccgaagct gcagaagaac cccgatcgac 6240 aaattcaacg attgtaattt ttttttcaat tgattcagat gggttgatgc tacnnnnnnn 6300 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 6360 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 6420 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 6480 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 6540 nnnnnnnnnn nnnnnnnaaa cgagtcacga ctctaaagat ttcaataaag ttggaatgga 6600 tgtaaaactg ggtggtttcg atttcatgtc attaaaaatc taatttcaat caaatcatct 6660 ccagaaaact gtaccccagt gactagattt agatcactgg ttgttaagta tttatcaatt 6720 aaccttttta cataaacatg acctgataat ctattaagaa ctggaacttc cggaatttca 6780 ttaaaaatcg aacatattaa cgacatctta aaggagattt aaacagtttt aagagccact 6840 aattatttat ttgaaatata ttacaaattc aaattacaaa tctattttca atgcttcaca 6900 ttgttgataa ggtcattgga tacaaagatt aatgtgcaaa gtgctgtgac tattaatgtt 6960 ttgaatgttt ttaaatagtt tacgtatttt aaatcatgtt tatttattgt tttttttaaa 7020 aaaacttttt tttaatatta ttaaaataat tgaaatttta aattacatta attttttaaa 7080 aagttataat ataaatgaaa ttttaaatta attaaaagtt atttgaatat attttataaa 7140 tatttattta ttacaaaatt aaattaataa aaaataattg gtttaaatta aatgttaatt 7200 cataatattt ttagttggtt ttgagtacct gaagnnnnnn nnnnnnnnnn nnnnnnnnnn 7260 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 7320 nnnnnnnnnn nnnnnnnnnt tatatattat tatttgattc atatatgatt taaataataa 7380 taaaaaatgg ataataattt acatattcct caggtcgatt atataaaaac tattaccaat 7440 ctccaattgg gcataactct aaatgtcata agactatgga acgttactat ggtcatagat 7500 tctatggaac ctactttttt atttgccata aatgcgatac cgcccatgat attgaatatt 7560 tagagtttta taaggaacac tttacccatt taaatcaatt taaaattaat aaaaagacat 7620 tatttcaaat gattaataat aataataata aaacaaataa taataataat aataacacaa 7680 ataataataa tacaaataat aatacaaatg ttactaaaaa cataaataat aacacaagta 7740 ataaaacaaa taataaaaaa tcaaaaagac ctgctgataa aatcgtgccg tgtatccctg 7800 ctaaaattta aaaaattgaa actattgtta ttagggatga cgaatctgaa aataatcaaa 7860 tagtgggttc ggtaaataat gaaataaatg taaacgatga acctgaaata gaggttgact 7920 taggggttcc cgtaaaaata ttcgctttga tttagaaaat caaaacctta atcaaatcgc 7980 cctaacagat aaacttataa ataaattaaa aaatgaaata aagaaaatgg gatatgtaaa 8040 agtccagttt acagctctca ttcaattcgc aaaagatggg gaaataaaag ataatttctt 8100 ttctaatagt gcgagtgtct tccccgatag ttttcttgct gatgaggtga gtcgattaaa 8160 tgagaaaatc gaaaaattca catatttggg aagttggtgg agcgtagtta agattcgtga 8220 aatcgatttt attttaacca cctattatcc gataagtcga ttagctggac atggttttat 8280 ccctacacct gaatctctag taggtaaaaa ggcgattgta aatgttaaaa acacagatca 8340 gtagtgtttt atatatagtg ttttagcgat tttaaagcgt gatatagttt taatgcatag 8400 ggagcgtgtt tcttcttaca ctacttttat ggatgaatta aattacgatc caattactaa 8460 ttgtccaatg agactctgta acatacccaa atttgagaga aacaatagcg agttaaattt 8520 ggaaatcaat gtttttgaat ataacaccac cccttctaaa caattgaatg atattgatta 8580 cgtacctcac catcctcacc tcgtctaaat tcatagaaca aaggttatcg gcggtaacca 8640 aattaatctt atattattac aggacggtga taattatcat tataccgccg tttttaattt 8700 gaataaactc acgaattgac attctagtag tgaatgtaac accagaattc gcataaaatg 8760 gtgtccccat tgtttacacg gtttttgtta gataaaacct ttaaggcaca tgttgatttg 8820 tgcgctaaaa atcaaatagg aacaaccctg ataataaaaa cttgatattt aaaaactact 8880 ctaaaatgat taccccgtca ttcgtaattt atgcggattt tgaggctgtt ttgcctaaag 8940 atataaaaca tcaccaaata cacatgccga tatctgcagg cttattatta atcaataata 9000 atacaaacac aactcaatat ttcagtttta taggcttgga atgtgttttg gaatttttaa 9060 agaaagtaga ggaaattgct gtcacaattg tcctaccgta ttatgaaaat catggtaaaa 9120 aacctatgaa taatcttact tctaatgagg aacagcagtt catcgcatgt agaacttgct 9180 atttttgtaa aaatttaatt agaaataagg ataaggacca tgatcatttt acaggtcagt 9240 atcttggtgc tgcctgttgt aactgtaata ttaataggaa aataagtaaa cagttaccca 9300 ttgtctttca taatttgaga gggtatgacc ttcatcacat tttaaagtat ggtttgaatg 9360 aatttacatc ttggaaatta aatatcattc ctaccaccac tgagaaattc atctcgttaa 9420 ttgtaaatat aaagaaattg gctgttagat ttatagatag tatgcaattc gtaaattgct 9480 ctttagctaa ggctgtgaaa actctaactg atctacctct aactttttat gcttttgatg 9540 gtcttattgt taaggcaaag ggtatctttc cttatgattt tgcaatatca ctagaggtcc 9600 ttaaatcgac ttacgaacca cctaaaaaat gggattctgt atctgatgat gaatacgcta 9660 ccgcccaact aatttggaca cagcaaaatt gcactacaat gttagactat atgttaactt 9720 acctaaaatt agaagtattc ctccttgcat actattttca gcaatttcgc gcaaaggcta 9780 ttgcttacaa ttgtttagaa cctttgaatt tttatggaat tcctggtatg tcacgggcgt 9840 ccgctctcgt ggcattaaag gaacccatag aacctctgca ggatatggga atgtttaatt 9900 tctatgaggg tggaatacgg ggttggttaa catttgttaa caaacattat gttgcaagtt 9960 ctgacgatac agaactcttg tatattgata ttaataattt atatggttgg gcacttagcc 10020 aaaaccttcc gtatggtgac tttgtttggg tgcatgagaa tctgaatgat gtactgagtg 10080 aatgtgtgaa tgcgaatatt gaattccttc cgtatggtta ttcaatggaa gttgacatca 10140 atatccctga gcgtgtgtat cactatttga atgacttttc gcttgctccg gaaaaaatgt 10200 gtccgcccaa ttcaaagttg agaagttaat gctcacacat tacagtaaga agaaccacgt 10260 tcttcactgg agacttctca agctgtatat agaattgggt gcgaaaattg agtgtattca 10320 tagagcaatt aaatctaaac aggcacctat ttttaaagat tatattcaga gaaatacaga 10380 tttacgctca aactcaaagt gcgagcttca tagagattta tacaaattat ttaataatag 10440 cttatatggg aagtccgttg agaatttgat aaaacagatg aatttgaggc tatgcaattc 10500 agctgaaaag atgattgtat ttgcctcaaa accaacttac aaaaagttca ttaaaatagt 10560 ggatgatttg attgctgctc atctgaacaa agattctata tgtttagaca ggcctagcta 10620 cattggccaa ttagtgctag acctgtctaa gctgctgatg taccaattac aatataatga 10680 attggcaaaa tatagatccg aattttactg tgatattaat atcgttgccg gcgacacgga 10740 ttcgttttcc cgtgaggata agcactgcat ggatattgga aagttgcaat atctttattt 10800 aagaaaatag gtaaaaatgt ataatggaaa tatgaatatt gacgatttat tgacactgta 10860 aacaggtagt attggctgtt gatctttcat tatacgattg aactgatcag ttgttttatt 10920 agaataaact tttatactat attgaacttg atttatcatt ttttcattaa ttttgcgatt 10980 ttcattcaat tcgaatattg attttcttaa ttcatttttg ataataccta tggtctgatt 11040 taaatataga agttcatcat ataatgcgtt ccgagctctt tttaaagtat catatgtgag 11100 ttcaaatcgg atttctaaat tactcattct ttcagataat cttttgactg gagttttcaa 11160 atatctaggc ctatctatca tttacaaaat atccatcaac acggctgtat caggacatac 11220 gcattttgtt gaatttttac tttgaacggt tacaacaatt gagttagcca ttttaaaaac 11280 actaaaaact aatattatta ttaaaatttc tttaatcatt ttcaaatcta tataaattaa 11340 tatttattgt ttatagataa aagtttgtgt gtgtttaaat tagtataaaa gtaattgtta 11400 tttcaatttt tatcattata atggatgtta ttttaccgag tgcagttcaa tcaatgaaca 11460 tcatggtaat ttattattat ttttattttt tacttgtttt gtagatgatt attttaaagt 11520 tatcaaagaa cgtctgaaat tctattatcc agatatacat gaaatattga aggataag 11578 // ID Sola1-1_DPu repbase; DNA; INV; 3995 BP. XX AC ACJG01006694; XX DT 27-FEB-2011 (Rel. 16.02, Created) DT 27-FEB-2011 (Rel. 16.02, Last updated, Version 1) XX DE Sola1-type DNA transposon from Daphnia. XX KW Sola; DNA transposon; Transposable Element; Sola1-1_DPu. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Direct Submission to Repbase Update (09-FEB-2011). XX RN [2] RP 1-3995 RA Jurka J.; RT "DNA transposons from Daphnia."; RL Direct Submission to RU (24-MAY-2010). XX DR EMBL/GenBank/DDBJ; ACJG01006694; Positions 1354 5348. XX FH Key Location/Qualifiers FT CDS 90..2294 FT /product="Sola1-1_DPu_1p" FT /translation="MFQNVNDELEITESELEKNKSVFTDITKINLCLTNDG FT LRLRRQIEERDTQQLSAVNPNKIEPEFLCVSGPVIENGISTTSGDDLNQPR FT FLRSMAKVMDEINLILLENERLFNELSQLQSQLTDSNSKLQMFQNVNDELA FT ITESELEKKRSVITGIRKINLCLTNDDSWLRRHSKDRDSWANQPQHLPEDS FT YVMPQNQQGLSNLLEKRDYQAREKPEDPYVLPPIPEKLVFFEKKDNQPQHL FT PENPHALTPNPKKSDLSEKKDEEVGMRYLSDEKNSNELYPWVLLEVLRSNQ FT RYQIKPCIQLPFLQSFNPENPDLVEGLQTDSQFHLKPGDLHVMHSCRNAEK FT IDWFEKNESQHIPEDPHALPQNQEKSNLSEKKYNLPRHKTGETNALPLNSP FT KIDLKGKKDNQPRHQPEVPSFVLLPINQENSDLFEKKDYQPQHIPENQHGW FT PPNPQKSDLLEKKFEHPRHKQEDSHALPENQENSDLLEKKDNLRQHKTEVT FT NALPLNPEKCDLMEKKDNQLQLEPEVPSNVLPPNPAEKSHSFEKIDNEVIK FT DRKFCRKEHQEESQRQQTEPEKPKEKQEKNVHETLKHIRQNIKKSLLSVKG FT LLQQPKQVASENNPIIDKCIIYGKTFPLCKGENEFKIAVVNYNFWESIGKP FT PIVKANWTLTSKTGKNLPLLGEITVKIVVNDKKRMIPIAVINNPLETNALK FT NIYIGNSDLPIRGMKPLKRMELLFRKLRKQEPKKG" XX SQ Sequence 3995 BP; 1346 A; 778 C; 740 G; 1131 T; 0 other; taaacctgat tttattggaa aatgaacgct tggcaaatga gttaagtctg cttcaatccc 60 aactgacaga ctccaactcc aagctgcaaa tgtttcagaa cgtaaacgac gagttagaaa 120 tcaccgaaag cgaactcgaa aaaaataaaa gtgtctttac tgacatcact aagattaatc 180 tctgcttgac aaatgatggt ttgcggttaa ggcgacaaat tgaagagagg gatacacagc 240 agctttcagc tgttaacccg aataagattg aaccggaatt tctttgcgtt agcggacctg 300 taattgaaaa tggaatttcg actaccagcg gagatgactt gaaccaacca agatttcttc 360 gttcgatggc gaaagtgatg gatgagataa acctgatttt attggaaaat gaacgcttgt 420 ttaatgagtt aagtcagctt caatcccaac tgacagactc caactccaag ctgcaaatgt 480 ttcagaacgt aaacgacgag ttagcaatca ccgaaagcga actcgaaaaa aagagaagtg 540 tcattactgg catcagaaag attaatctct gcttgacaaa tgatgattcg tggttaaggc 600 gacatagtaa agacagggat agttgggcga atcagcctca acatttacca gaagactcat 660 atgtgatgcc ccaaaatcag caaggattat cgaatttgct ggaaaagaga gactatcaag 720 cccgagagaa accagaagac ccatatgttt tgcccccaat tccagaaaaa ttagtttttt 780 tcgagaagaa agacaatcag ccccaacatt taccagaaaa cccacatgca ttgaccccga 840 atcccaaaaa atctgatttg tcggagaaga aagacgaaga agtaggcatg agatatctat 900 cggatgagaa aaattcaaat gaattatatc cgtgggtact tctagaagtt cttagatcga 960 accaaagata tcaaatcaaa ccatgtattc aactgccttt tcttcaatcg ttcaatccag 1020 aaaaccctga tttggtggaa ggcctacaga cagatagtca gttccatctt aagccaggag 1080 acctacatgt aatgcactct tgccggaatg ccgaaaaaat tgattggttt gagaagaatg 1140 agtcccaaca tataccagaa gacccacatg cactacctca aaatcaagaa aaatctaatt 1200 tatcggagaa aaaatacaat ctaccccgac ataaaacggg agaaacaaat gcattgcctc 1260 taaattcccc caaaattgat ttgaagggta agaaagacaa ccaaccacga catcaaccag 1320 aagtcccatc atttgtcttg ctcccaataa atcaagaaaa ttctgatttg ttcgaaaaga 1380 aagactatca gcctcaacat ataccagaaa accaacatgg atggccccca aatccccaaa 1440 aatctgattt gttagagaag aaatttgaac atccccgaca taaacaagaa gattctcatg 1500 cattgcctga aaatcaagaa aactctgatt tattggagaa gaaagacaat ctacgccaac 1560 ataaaacaga agtcacaaat gcattgcctc taaatcccga aaaatgtgat ttgatggaga 1620 agaaagataa tcaactccaa cttgaaccag aagtcccatc gaatgtcttg cctccaaatc 1680 cagcagaaaa atctcattcg ttcgaaaaga tagataatga agtcatcaaa gatagaaagt 1740 tttgtagaaa agaacaccaa gaggaatcgc aacggcaaca gacggaaccg gaaaaaccga 1800 aagaaaaaca agaaaaaaat gtgcatgaga cacttaagca tattagacaa aatattaaga 1860 aatcattact atccgtaaaa ggattacttc aacagccaaa gcaggtagca tcggaaaata 1920 atccaattat agacaaatgc ataatatatg ggaaaacgtt tccgttgtgt aaaggggaaa 1980 atgaatttaa aattgcagtc gtcaattaca atttttggga gagtatagga aaacctccaa 2040 tagtaaaagc caattggact ttaacaagta aaactggaaa aaatttaccc ttactagggg 2100 aaataacagt gaaaattgta gtcaacgaca agaagagaat gattccaatc gctgttatca 2160 ataacccttt ggaaacaaat gcgttaaaaa atatctatat tggaaacagt gatcttccca 2220 taagaggcat gaagcctctg aaaagaatgg aattactttt taggaagttg aggaaacagg 2280 aaccgaaaaa aggttaagat ttggcactga tagtttcgaa caaaaaaact taacttcgac 2340 actatatagg aacttttggt tgaattttct tcgggttgac agctgaaaac tatgtgaaat 2400 cctgtcttca atttttgcct taacgacgaa tcatcatttg tcaagcgaag atttatctct 2460 gtgatttcag ttatgacact tttctttctt tcgcgagttc gcttttggtg atttctaact 2520 cgtcgtgtag gttctgaaac atttgcaact tggagctgga gtctgtcagt tgggattgaa 2580 gcggacttaa ctcgttcgcc aagcgttcat tttccaataa aatcaggttt atctcattcg 2640 tcactttctt catcgaacca agaaatcttg gcgctctctt tcatgtcatc tacgctggta 2700 atcgaaattc catcgttaat tacaggtccg cttacgcaaa tatctcccgg ttcaatcttc 2760 ttcgggttac agctgaaagc tgtgtatatc cctctcttta atttgtcgcc ttaaccacga 2820 atcatcattt gtcaattaga gatttatcta agtgatatca gtaatgagac ttttcttttt 2880 ttcgagttcg ctttcggtga ttgctaactt gacatttacg ttctgaaaca tttgcaactt 2940 ggagtttgag tctgtcagtt gggattgaag tggacttaac tcgtttgcca agcattcatt 3000 ttccaataaa atcaggtttg tattatccgt aactttcgcc atcgaacgaa gaaatcttgg 3060 ttggttcaag tcatctccgc tggtagtcaa aattccatcg ttaaagatag aaaagcaagc 3120 taatcagctc gtcgaaggga tagtcgtaag ccagttgaaa actgtcttaa atgaaatatt 3180 cagggcctat tgttaagtcg gacttattca caattcacac ttttcttcac atactttttg 3240 gtcttttata aaacgggatt tttctgtcac agttcaatat ccttgcaagt aatttacttt 3300 tagtttgctc aactcgtatg ttcctattaa aaaaaaagtt caataattga atttaaaaat 3360 aaccatcttt tagtcaaata ataactgcaa aacataaatg caaatctggt tgttgcgtca 3420 tggttgggac atcaaagatt gcgtcatcac ggcgatgcta tggacatctt cgagttcgct 3480 ttctgtgatt gctaactcgt catttacgtt ctgaaacatt tgcagcttgc agttggagtc 3540 tgttagttgg gatcgaagct gacttaaagc atttgtcaag cgttcacttt ccaataaaat 3600 caggtttatc tcatccgttg ctttcgccat cgaacgaaga aatcttggtt ggttcaagtc 3660 atctccgctg gtagtcgaaa ttccattttc aattacaatt ccgctaacgc aaagaaattc 3720 cggttcaatc ttcttcgggt taacagctga aagctgctgt gtatccctct cttcaatttt 3780 tcgccttaac cgcaaatcat catttgtcaa gcagagatta atcttagtga tgtcagttat 3840 gatactttta ttattttcga gttcgctttc ggtggtttcc aactcgttta cgttctgaaa 3900 cgttatcatc agcttgaagt tggagtctgt cagttgggct taaagcagac ttaactcatt 3960 tgccaagcat ttattttcca aaaaaatcag gttta 3995 // ID Mariner-1_SM repbase; DNA; INV; 1882 BP. XX AC . XX DT 07-FEB-2008 (Rel. 13.02, Created) DT 03-MAR-2008 (Rel. 13.02, Last updated, Version 1) XX DE Mariner DNA-transposon from Schmidtea mediterranea. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-1_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-1882 RA Jurka J., Bao W. and Tempel S.; RT "Mariner DNA transposon from Schmidtea mediterranea."; RL Repbase Reports 8(2), 145-145 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 383..1516 FT /product="Mariner-1_SM_1p" FT /translation="MERNKRELSNEQRRLIYEDLLSESDNGILKRGAYNSI FT AAKYHVSIQTVAKIWKRGKEISPSNVASKKRGRVGPKKKNFELNGKIRNVP FT LNKRKTIRSLAGALKMSKTSLHRRIKAGEVRVCTATLKPLLTDANKEERIM FT FVFRHLQFFEDGSIRFPDMFDMVHIDEKWFYLTEEQSRYYLETDEVEPHRQ FT VKSKRFITKVMFLAAVARPRFDAHKNEWFDGKIGIWPFIKMEPAKRNSKNR FT SRGTFVSTPLVVDKKEYRKMVIEKVLPAIKSKWPGNRIERSKTIFIQQDNA FT RPHISPDDSEFVQHATSDEFDIRLICQPPNSPDMNILDLGYFRAIQSAYYS FT TNPSNIDDIIKFVGQSFNDMPRDSLNKVFYLSKIV" XX SQ Sequence 1882 BP; 669 A; 319 C; 324 G; 570 T; 0 other; tactccctct gtcccataag ttttcgtttc gtgtccgcgg attatcgcga atatacggct 60 ctacctcaat atttttttct tatgagttga attaatcata cgactttcta aaaaacataa 120 atataaccgt atcttcttgt tttattacat ctgtcccata ttactggata gttttcacaa 180 aaagcttaaa atactctata atttgtcaag aattaaagaa agtaacgata attttaaact 240 taattgttca catttaacaa tgttttatga ttactggaca tcccctatca ttttgacatt 300 tcatttttgt ggtttttttt gtaaacacca tttcagtttt tcgtcataaa agtttaattt 360 tttcaaattt ttatccataa atatggaacg aaataaacgt gaactttcca atgaacaaag 420 gaggctaata tatgaggatt tgctaagtga aagtgacaac ggaattctta agaggggtgc 480 atacaattcc attgcagcta aataccatgt gagtattcaa actgtagcca aaatctggaa 540 acgaggcaaa gaaatcagtc catcgaatgt ggcttcaaaa aaacgcggtc gagttggtcc 600 aaaaaagaag aattttgaat taaatggcaa aattcgaaat gtgccactga acaaacgaaa 660 aacaatcaga tcacttgcag gtgccttgaa aatgtcaaaa acgtctctgc atcgccggat 720 aaaagctgga gaagtgcgag tctgcacggc aacattaaag cctctgttga cggatgcaaa 780 taaagaggag cggattatgt ttgtattccg tcatttgcaa ttttttgaag atggaagtat 840 tcgattcccg gatatgtttg atatggttca cattgatgaa aagtggtttt atctaactga 900 ggaacaaagc cgttattacc ttgaaacaga cgaggttgaa ccgcatcgcc aagtaaaatc 960 aaaaagattc ataacaaaag tgatgtttct tgcagctgtt gctagaccta gatttgatgc 1020 acataaaaat gaatggtttg atggcaaaat tggaatttgg ccattcataa aaatggaacc 1080 tgcaaaaagg aatagcaaaa atcgtagtag aggcactttc gtttccacac cccttgttgt 1140 tgataaaaag gagtaccgca aaatggtgat tgaaaaagta cttccagcaa ttaaatctaa 1200 gtggcctgga aatcgcatag aacgaagtaa aacgattttt attcaacaag acaacgctag 1260 acctcatatt tcaccagatg attcagaatt tgtccaacac gcaacaagtg atgaatttga 1320 tattcgattg atctgtcaac cacctaatag ccccgatatg aatatcttgg atttgggata 1380 ttttcgagcc atccaaagtg cctattattc aactaatccg tctaatatag atgacattat 1440 caaatttgtt ggacaatctt ttaatgacat gcctcgtgat tcgttgaata aagtttttta 1500 tctctccaaa attgtatgat tgaaagcatc aaagtcaatg gaggaaacaa ttacaagcaa 1560 ccacatatgg ccaaggatcg tcaaatcaac aatgacacgc tacctgtatc tttagaagta 1620 aaagcagaaa tattagcaaa tgctctttcc gttctaaata aataaaaagt ttcattctct 1680 ttttaaaaat atagcaatca agtgaccaca acacaaatac atgtttttat gcacttatgt 1740 agcagaacta ccatttttaa gaaataaatt taataaaaat atatcaactt tttgacaaaa 1800 ttagttttta tagaaacata tccagtaata tgggacaaga aaaacggaga aaactatcca 1860 gtattatggg acagagggag ta 1882 // ID Gypsy-237_AA-LTR repbase; DNA; INV; 242 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-237_AA_; KW Gypsy-237_AA-I; Gypsy-237_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-242 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1076-1076 (2011). XX DR [1] (Consensus) XX SQ Sequence 242 BP; 78 A; 44 C; 48 G; 70 T; 2 other; tgactatgta tattattgaa attaaatgaa attgcttata actagagggg gaagaattgt 60 aggmttttmg gtcttatgga ccaaaccccc ccgttaatga gttaccacaa tgaactttaa 120 tactcgtgtg agtgtcgctc tgcgactata aaagagcact gaataaacca gctgatcaat 180 gtttatactt ggatcgtaac tagtaaacaa gtgttgtcac tccgtgaaag aagccaccct 240 ca 242 // ID hAT-43_SM repbase; DNA; INV; 2392 BP. XX AC . XX DT 14-AUG-2009 (Rel. 14.08, Created) DT 14-AUG-2009 (Rel. 14.08, Last updated, Version 2) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-43_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-2392 RA Jurka J.; RT "haT-type repetitive family from freshwater planarian Schmidtea RT mediterranea."; RL Repbase Reports 9(8), 1846-1846 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 788..2215 FT /product="hAT-43_SM_1p" FT /translation="MVKAFGNEDLAKKIQSVSLSRQTVTRRIEEIGVHITN FT KLKNKVNDCSYFSVALDESTDISDTSQMLVFIKIVNEDYSCSEELLHLRSL FT HGTTKGEDIYKEVKCVVAQYGGFDKCTAIVTDGARAMTGKNIGLAGLIQKE FT GVNCQMFHCIVHQESLCGNSMQLKDIMDKVVKITNLIRGGNRSLTHRKFKT FT FLEELKAEYGDLLLYSNIRWLSAGKCLKRFFTLRKEIHLFMTEFGIDENLT FT KYLVDENFLCSLAFLTDITDYLNVLNKSLQGKDQNICHLYQHVTGFRNKLK FT LFKSNLQENNLKHFESCREFHEEMKGLNIVVNFEKFVPKLNILIEDFNTRF FT KQFDDFKDSINLFCDPLKIDIDFVDPKYQLELCDLQADPFIISSGTSGIEL FT FKLLEKEKYPKLKDFGLQIISMFGSTYICERAFSDMKYIKSKYRNSMNDST FT LEHLIRLATTNIDTNIKELVAQQIRPQCSH*" XX SQ Sequence 2392 BP; 874 A; 358 C; 407 G; 753 T; 0 other; cagcgctgcc caactggcgg ccgcgaacaa aatcaatttt tgacataatt tttttaataa 60 atatctgcaa ttcatctaaa tttttatata gcaaattcgg tgtcaaaata ccgtcttcac 120 attatcgaag atactaagtg ttctaagaaa acagaaataa catttttaca aattataatt 180 aagaaatacg accagttgcc cgttcaagtg tctcttaaat taacgttagt gatagtgcgg 240 attcaatggt gttacgacca aatagtaaga cactgtgtgt tgtcaaacct ccatcattac 300 ccgctcgtca agtctatctt gctattttat aaagtgtctg ttgttcgatc acataataat 360 taacaaaatt tagcatttat tgatagatta aattattcaa taaatatgac ttctaataac 420 aaaagaaaac acaattctga aaatcgagag tttcaagatc gctgggagtt cgattattta 480 tttattatga ataaaggaaa accacagtgt ttggaatgtt tacaaaccgt agccgtgtgt 540 aaagaataca acataaagcg tcattacgtt tcaatgcatg aaaaaaatat tcttcgtgca 600 caggagatgc gagaaaagca caagtaaaca tattgaaaaa acagcggaat tatcaaacta 660 atttatttac caatgtaaca aaatcccaag aagcatcagt agcagcatcg tttgaagttt 720 gcaacgaaat agcaaaagca aaaaaacatt ttcggatgga gaattaataa aaacctgtgc 780 tataaaaatg gttaaagctt ttggtaacga agatcttgca aaaaaaattc aatccgtttc 840 actttcacgt cagactgtaa ctcgacgtat agaagagatc ggtgtacata taaccaataa 900 attaaaaaat aaagtaaacg attgcagcta tttttctgta gctttggatg aaagcacaga 960 tatttctgat acaagtcaaa tgcttgtttt tataaaaatt gtaaatgaag attatagctg 1020 ttcagaagag ctcttacatt tgcgatcatt acacggaaca acaaaaggtg aagatattta 1080 caaggaagta aaatgtgtag tagcccagta tggcggattt gataagtgca ctgctattgt 1140 aactgatggc gccagagcta tgacaggcaa aaatatcggg cttgcgggac tcatacaaaa 1200 agaaggagtc aattgtcaaa tgtttcactg tattgtgcac caggagtcat tgtgtggaaa 1260 ttcgatgcaa ctaaaagaca taatggacaa ggttgtcaaa attacaaatt taattcgtgg 1320 aggaaatcga tctcttacgc atagaaagtt taaaactttt ttggaagagc tgaaagcaga 1380 atacggcgat ttacttttat atagtaatat tcgctggtta agtgctggaa agtgccttaa 1440 gagatttttt acacttcgaa aagaaataca tctatttatg acggaatttg gtattgatga 1500 aaatttgaca aaatatttgg ttgatgaaaa ttttctatgt tcacttgctt ttctaacaga 1560 cattacagat tatttaaatg ttttgaataa gtcattgcaa ggaaaagatc aaaatatatg 1620 tcatctttat caacatgtta caggatttag aaacaagctt aagttattta agtcaaattt 1680 acaagaaaac aatctgaagc atttcgaatc ttgcagagaa tttcacgaag aaatgaaggg 1740 attgaacatt gtagtgaact ttgaaaaatt tgtgcccaaa ctaaatatcc taattgaaga 1800 ttttaataca agatttaaac aattcgatga cttcaaagac agtataaatt tattctgcga 1860 tccattgaaa attgatattg atttcgtaga tcccaaatat caattggaat tatgcgacct 1920 tcaggctgac ccctttatta tatccagtgg aacgtctgga atagaattgt ttaaactttt 1980 ggagaaagaa aaatatccaa aattgaagga ttttgggctt caaattattt ctatgtttgg 2040 gagtacatac atttgcgagc gtgcattctc cgacatgaaa tatataaagt caaaatatag 2100 aaattccatg aatgattcca cccttgaaca tttaattcga cttgcaacga ctaacattga 2160 tacgaatata aaagaacttg ttgcccaaca aataagacca caatgctctc attaaaatta 2220 aatttaaata ttttttatta ttaataagtc acgtattatt ttttactgta ttttctttgt 2280 ataaataaaa ttattctttt atattgattt tttaattgtt tatctttttg gcccgcaatc 2340 tttgtaacag aaattatatg gcccgcggac caagagaagt tgggcagcgc tg 2392 // ID Crack-4_CQ repbase; DNA; INV; 3380 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A Crack non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; Crack-4_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-3380 RA Kojima K.K. and Jurka J.; RT "Crack non-LTR retrotransposons from the southern house RT mosquito."; RL Repbase Reports 11(1), 35-35 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 10 sequences with >96% CC identity. XX FH Key Location/Qualifiers FT CDS 21..2858 FT /product="Crack-4_CQ_1p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="MNKSVLNHKSAVKIFQWNIRGMNSLEKFDGVRELIDR FT YGDPIDVLVLGETWVKEGNQEMYNLAGYKSIFSCRARSNGGLVVYVRSNIN FT AKIVDIQSHNGFHLIHLLLSLGKMKLNLIAVYRPPGFPSGDFLDKLEAKLG FT VLQSDQEVILLGDTNVPCNVEANSTVQEYVRLLSSFNLQVTNNITTRPASQ FT NLLDHVVCSTGIPGRITNETVETELSDHSMILTSLEMKVEKVAKTLTTDIV FT NHELLDELFARSITQFPGEVSAAERLLFAINQYKEALESATRTVTVKAKVK FT GNCPWMTLELWQMIKIKEQVLKSSKQHPSNAHLRELVKHTSQLLSKRKAQS FT KRNYYYNLITTSTPTNSWKLINEVLGRSKKKDSEILVRDQSGVLKDNIQAA FT NXLNKHFCEVGENLASSIPSDRNVNHFRTLTRHDLTIYMRPATSQEVIVLI FT GQLNPKKSPGPDRIPASFIKKHHLLFAELLKDAFNEMIETGVYPDFLKEAR FT VVPIFKSGDPTDVNNYRPISTLSMLNKLLEQMVAARLSSFLDTHKHLYNQQ FT YGFRKGSSTLTATCELLKDIYDSLDERKFCGALFLDLQKAFDTINHELLLQ FT KLEFYGIRGKANSLLRSYLTNRMQHVSVNGTRSESRRISTGVPQGSNLGPL FT LFLVFINDLSRLQLTGKIRLFADDTLLLYSSRDCRTIRNNMLEDLTILNQY FT FASNLLSLNVSKTKFVMFHTCQRRVPELSPIRFGNQVVERVSHFKYLGLVL FT DETLSWEAHIQHLKRCIAPICGVIRKLASFIPSVWLLKLYFALVQSRLQYL FT ALNWGTAAAFRLRELQTLQNRCLKSILAKPYLYPTRQLYIDAPETVLPVRG FT LHMQQTLVHMWNMLKDDATHHNLEFEIINSMTRQNGDIRIPRSNTELCKNR FT VTYVGSKLFNELPNDLKELNSKVVFKKNLRRHIKNNLLNALS" XX SQ Sequence 3380 BP; 1056 A; 806 C; 702 G; 812 T; 4 other; acaattttga agaatgtatt atgaataaaa gcgttttaaa tcacaagtcg gctgtgaaga 60 tctttcaatg gaacattcgg ggaatgaaca gtctggagaa gtttgatggc gttcgggaac 120 tcattgatcg ttatggagat ccaattgatg ttcttgtact aggggagaca tgggtcaagg 180 agggcaacca ggagatgtac aacttagcgg gctacaaaag catattttct tgtcgagcaa 240 ggtcaaacgg tggattagtt gtatatgtcc gcagtaatat caacgcgaaa atcgtcgaca 300 ttcaatcgca taacggtttt cacttgattc acctgctgct gtctctggga aagatgaaat 360 taaacttgat tgcagtatat cgcccacctg gatttccttc tggcgacttt ctggacaaat 420 tggaagcaaa actaggagtc ttgcaatctg accaagaggt tatccttctg ggagatacca 480 acgttccgtg caacgtggag gcaaacagta ctgtgcagga gtacgttcgg ttgttatctt 540 cgttcaatct gcaagtcaca aacaacatta ccactagacc agcaagccaa aaccttcttg 600 atcacgtggt ttgttctacg ggcatcccag gaaggataac aaatgaaacg gttgagacag 660 aattgagtga tcattcaatg atcctgacat ccctggaaat gaaagtggaa aaggtggcta 720 aaacattgac cacagacata gtaaatcacg agttgctgga tgaattgttt gctcgttcga 780 ttacgcaatt tccgggggag gtcagtgctg cggagcgtct cctgtttgca atcaaccaat 840 acaaggaagc cttagagtcc gctacaagga ctgtgactgt aaaagcgaag gtgaagggaa 900 actgtccatg gatgactttg gagctgtggc agatgatcaa aatcaaggag caagtcttga 960 aaagcagtaa acaacaccct agcaatgcac acctccgaga acttgttaaa cacacttcgc 1020 aactgctgag caagagaaaa gcccagagca aaaggaatta ctattacaac ctaattacta 1080 catctacgcc gacgaactcg tggaagctga tcaacgaagt ccttggaaga tcaaagaaga 1140 aggattcgga aatacttgta agggaccaat ctggagtatt gaaagacaac atacaagctg 1200 caaacwcgtt gaacaagcac ttctgtgaag tcggagaaaa cctggcctca tcaatcccaa 1260 gtgacagaaa cgtgaaccac ttccgcaccc tcacccgcca tgacctgacc atttatatga 1320 gacctgcaac atcccaagaa gtcatcgttc tgattggcca attgaacccg aaaaagtcac 1380 ctggaccgga caggatacca gcttcattca tcaaaaagca tcatctgttg ttcgctgaac 1440 tcctcaaaga tgccttcaac gagatgattg agacaggagt ttacccggac ttcctgaaag 1500 aagctcgagt ggttcccatt tttaaatctg gtgacccaac ggacgtgaac aactacagac 1560 ctatttcaac tctatccatg cttaacaaac tgttggagca aatggttgca gcgcgcttaa 1620 gtagctttct ggatacccac aaacatctat acaaccagca atatggattt agaaaaggct 1680 ccagtacact aacagccacc tgcgagctac tcaaggacat ctacgacagc ttggacgaga 1740 gaaaattttg tggtgcactg tttctggatc tgcaaaaggc gtttgacacg attaaccacg 1800 agctgttatt acaaaaacta gaattctatg gaattcgcgg aaaagcaaac agtcttctta 1860 ggagttacct gacgaaccga atgcagcacg tgagtgtcaa tggcacaaga agcgaatctc 1920 gacgtatatc gaccggagtt ccgcagggga gcaatctcgg cccattgttg tttttggtgt 1980 ttatcaacga cctgtctcgc ttacagttga ctggaaaaat acgtctcttc gctgatgaca 2040 cgctcctact ttatagtagt cgagactgtc gcaccataag gaacaacatg ctggaagacc 2100 tgaccatact taaccagtac tttgcgtcga acctgctgtc gctgaacgta agcaaaacaa 2160 aatttgttat gttccataca tgtcaacgaa gagttccaga gctgagtccc ataaggtttg 2220 ggaaccaagt ggtcgagcga gtctcccact tcaagtacct tggacttgtc ttggacgaaa 2280 ccttgtcatg ggaagcgcac atccagcact taaaacgttg tatagcgcca atctgcggag 2340 tcatcaggaa actagcttct tttataccat ctgtttggtt gttgaaattg tacttcgccc 2400 ttgtacaatc tagattgcag tatctcgcac tgaactgggg tacagctgcc gctttcagac 2460 tacgtgaact ccagacactg caaaaccgtt gcctgaagtc aattcttgca aaaccctacc 2520 tctacccaac acgccaactc tatattgacg ctccggaaac tgtccttccg gttcggggac 2580 tacacatgca gcaaacacta gttcacatgt ggaacatgtt gaaggatgat gccacccacc 2640 acaatttgga gttcgaaata attaactcta tgacaagaca aaacggggat attcggattc 2700 cccggtcmaa cacagaactc tgcaagaacc gggttaccta cgttggaagc aaattgttca 2760 acgaactccc caatgacctg aaggaactaa actccaaagt agtattcaaa aagaacttac 2820 gccgacatat aaaaaacaat ctactgaacg ctttgtcctg atcctccact aaccaatatg 2880 aaaaaatctg ctcttgtcgt ttccgccgcc accgaccgcc cgccgcccgc caccaaccgc 2940 ccaccgccaa ccgcccaccg ccaaccgccg atcgccaccg cccactgcca accacccatc 3000 gcccaccgcc caccaccagc cgttacccta cactgcccat cacacacact ttcgtctgaa 3060 caccaccaaa ttgaaattcc acacactatt tattaacatt tgaacgccaa gattagcact 3120 ctcttaaaag agcaaacttt gctcactgag atgtgcaaat aatgtaaatg ttaaatgtat 3180 twtttttgaa aagatagtag gttttatgcc ctagtgcttt tgtcctacma aaaaaaaatg 3240 tactgcagaa aaaaaatata actgcaaata aagaaaagaa ctatcgagcc taacctcaat 3300 accttgaggt aggtgaaaaa ttccagtata gtaagctgtg aaattgcgta attcgatctt 3360 ttcaaaaaaa aaaaaaaaaa 3380 // ID PIGGYB_SM repbase; DNA; INV; 5560 BP. XX AC . XX DT 28-JUN-2007 (Rel. 12.06, Created) DT 17-FEB-2008 (Rel. 13.03, Last updated, Version 3) XX DE DNA transposon from Schmidtea mediterranea. XX KW piggyBac; DNA transposon; Transposable Element; PIGGYB_SM. XX NM PIGGYB_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-5560 RA Jurka J.; RT "PIGGYB_SM: PiggyBac-type element from freshwater planarian RT (Schmidtea mediterranea)."; RL Repbase Reports 7(6), 363-363 (2007). XX DR [1] (Consensus) XX CC It is an ancient element and it probably carries other inserts, CC such as LINE fragment between the ORFs. XX FH Key Location/Qualifiers FT CDS 522..1394 FT /product="PIGGYB_SM_1p" FT /translation="MRPEEMKRILKRLENGEISEDDSTDNEDDIDYYSSQR FT DRLMELEDEEDVGNFPTDPNLDADPPAANVDTDMQHNLETVERSPTPTTSS FT QNNLQTAPFNSRNLVWRVKNMDIVQNAFQFAGNTEYSQEIMNLDTPFQFFS FT YFFGEEILRFLVDESNKYAFQKNPNFTEPVTVLELRKFIGILIFTSVYHYP FT SVRSYWSNITKFEPIAQTMNRNRFDKIRQIFHMNDDSKHLPIDHPQHDRLH FT KVRPVIDYLNKKFITVPFEHRLSLDEQMCSTKIKHFMKQYLTGNSFGRHV" FT CDS 4681..5391 FT /product="PIGGYB_SM_2p" FT /translation="MKSNVPRGTYHENVASHEGQEFSATSWKDNKQVLLLS FT TYVGAEPADTITRYEKKLKANVQVSCPRVIKEYNAHMGGVDLMDSFIGRYR FT IRIKSRKWTMRLFYHLFDMTVINAWVLYKKVNTVKGKLQKNIMKFADFRTE FT LADTLCRYQSHSKNKRGRPSSNSRQDTTVPSKRPRTGVQVLPSTDVRCDCI FT GHEKSFMDSRNKCKFNNCRKLTSWFCKKCKVSLCDNKNNQCFALFHA" XX SQ Sequence 5560 BP; 1801 A; 902 C; 885 G; 1970 T; 2 other; cccttaataa cctagagttt taattttgat accaatcttt taaacctaat gttcacatta 60 ctattagttt tttttatagg ttggctgttc gtattttact gttgtataat attctttcca 120 acgaaattaa aaaaaaacaa atgtcattat tgttttattt tatctatgac atattacttg 180 cgcggcaaag tgaacaagta aacacatttt actaaaagtt ttgttgaata caaatttgat 240 atattttctt tttctgaaga atataaatac ttttgaggta agtgaatata ttattttcgc 300 tatgatataa ttactttttg cgagtgcatg ttttacttga agaaactagt gctatttttg 360 tgagtatcaa ttttgatact tcagtcattc cgccctacat aaaacctacc tactaattct 420 ttagctataa aaactttttg aacttccgtt tcattatttt tctacatagc ttggtgttgg 480 gattactttt atttttattt tattatagaa atttatcaaa aatgagacca gaagagatga 540 agagaatatt gaagcggtta gaaaatggtg aaatttcgga agacgattcc accgacaatg 600 aagatgacat agactattat tcgagccaaa gagatcgcct gatggagcta gaagatgaag 660 aggatgtagg taattttccc actgacccaa acttggatgc agatccgcct gcagcaaacg 720 tagataccga catgcaacat aatttagaga ctgttgaacg aagcccaact cctacaacca 780 gcagccagaa caacctgcaa actgcacctt tcaactctag aaacttagtg tggagagtaa 840 agaatatgga catagtccaa aatgctttcc agtttgctgg aaacactgag tattcacaag 900 aaattatgaa cttagacact ccttttcagt ttttttctta ttttttcggt gaagaaattt 960 taagatttct agtggacgag tcaaataaat atgccttcca aaagaatccc aactttacag 1020 aacctgtaac tgtcttagaa ttacgtaaat ttattggaat tcttattttt actagtgtat 1080 atcattatcc cagtgtcagg tcatactggt caaacattac aaagtttgaa cccatagcac 1140 aaacaatgaa tcgcaacaga tttgataaaa ttcgacaaat ttttcacatg aatgacgact 1200 ctaaacattt gcccattgat catccccagc acgaccgact gcataaagta agaccagtga 1260 tagactattt gaacaagaaa tttattactg taccatttga gcatcgtcta tcactagacg 1320 agcaaatgtg ttccactaaa attaaacatt ttatgaaaca atacttaact ggcaattcgt 1380 tcgggcgaca tgtctaaaac atcatcgtta ttagactcaa ttaattcacg cacattagat 1440 cattagcaaa taaatggtgc ttatgtaatt gttgaggggt tatcgaaatt aaaacttatc 1500 gaattctagt agctttatat ttgattgtaa ttctttcgaa actggtggac ggttcatcca 1560 gaagctttag gtacaataaa tctcgtgatt aattcctccc gagctcttgt gtatccattt 1620 aaggttataa caatcaagta tcactatgat actaagaaaa atggttcgcg tacatctcgg 1680 tggacagggg ttcactcaca tctgcgaatt gatgtttgat aagttaaact tgactattgt 1740 tttcctttac ttgatgggac gtgggtgaga gggttatatt atttacggac ctggcctttt 1800 agtatgtacc cctttatagt gcgatacata ctagtatact taataatatg tgagataaat 1860 ctactaagta acttagtttg atctaaatag gtatgttaat tctaatatat aatatattcg 1920 atattatgtg tagcatttca atatattgac gtaagagaac ttatactttc aaccaatttt 1980 ctttaacaat tttatatgca ggtgattcga acttttaagg tcgaggccgg atgaacggat 2040 ataacaattg tgcgtagacg tattgtttgg cgatctcacg tatgtgattt ccggcggatc 2100 ttaccgtagc gacttaagcg attgacagga ggtgatttag cacgtttaca tccggtaatt 2160 aagtccattg tatttcattt ttcaggatta tcgatcgacg gattcgtcta agcctagtta 2220 aatcaattcg ggtaagttat tacgttgatt aatgtggaag taccttactg acatttttaa 2280 ggaaaaactt cttactacaa ttgaggcttt caagcagatt caatcccata cctttggaag 2340 atttttatta tgtggaattt tatttgttca acggaaaaga tcggtagtaa gccttcttgt 2400 gatttaacaa tttatgtaat gttagataaa aaaatttgct ctcctcttaa aactttgatt 2460 tttgtattat attaatttga ttcaaacgat aactcattta tttcgtctgt aatcatttac 2520 tttagtccct gaattcgttt cattaaaaga aaactttatt tctaatctgt aaattattct 2580 agttctgatt aagagacggt tttagtaatt tattttgatt tctattactc aattttctta 2640 cctttttctt gtcaactaat ttaatcaatc attattttaa ttccatgaaa ttactttaat 2700 attaacccaa tactattaaa tctatgaatc cagtttttac tttatcctta ttcagggatt 2760 tacttgatct ccactaaaga aacgatctta tttttattca aattattttt aattttttta 2820 atcactgaat gacttggttt aatcgacatt gcattatggt taatcaagga atttatatta 2880 tttaatcaac aaattaatat aagaaaactt tatttttaat ccataaatta cgttaacttt 2940 aatcgtttta attaaactaa tttaatcaat aaagtatgta cgtttacgta aaattccttt 3000 aattttaatc caaatgaaac aaattcagct tcatcaacga gttatttgtc tttactcaat 3060 aatcaataat ctaatttaat tttaatgaat aatttatcta atgattttta aatttaaatt 3120 agtatgctaa tgattagttt aattcacatt attgatttaa attagctata tcaacgattt 3180 tattttaatt tattcccgta cccttcccct ttaattggcc ttaggcgcat tttgtgttta 3240 tattatattt aatgagtgcc cactggacta ctcggtgtga tgattttgag ccgtgtgtag 3300 tataaatttt aggtcgagtt tatggaaatt gagagacgag gaagcagcca aaagactact 3360 tccttatccc tcttttcctc tatacggaat aatattgacg ttccgacccc tgcgttccgc 3420 acgccggagt cttcacgggc acgaatcatt ggcgattttc cacgcactta cacaaatccg 3480 aatactatgt atgattttac taacttgtga tatgtcttgg ctgtgggccg gaaaaatttc 3540 gacaaaaatt aagtttattt tatttaaata tataaaaaca atgtattaat attaataata 3600 tttattaatt acaaacaaaa aataaacatr attaccctac attgccaatc cataaaaatc 3660 gcataaaatt tcatataata atataaaaca aaaatacctc tctccccacg ccgacaccca 3720 agcagagctc tatcgaggtt gcaaggatgc gacgtggaga gagagttcca aaatraacat 3780 ttctcaccta aaaattaaga ttcacaaggt aaaaaatttt ttaaacagaa ttaaacgatt 3840 tcaaaaatgg ttttaaaaac tggttaaatt aaatgtgaaa atgaatcggt atatatacct 3900 attgtacgac cctaaccttg cgtgttaaat ttgatttagt aaaacaaaat ttgaccttgt 3960 aattttctag tatattctat tctttaattt tattattttg aaatttaatt tgacaatttt 4020 ttctttagaa tttttattga attatttcta attcaaatta tttgatttta atttcatttt 4080 catgaacttt ttattaattt gtcaaatttc ggtcaaattc gttgggtttt cgtgttatgt 4140 ttatgtttat ttcgtgcgga tcttgttgca ttttgttatt attttaattt atttttgtgt 4200 ttataatttt atatttaatt gttttttttc tcatattttg atagggttta attggtgttt 4260 taataggttt gagtttaaat ctgcatgtta tatttgcatg tgtgttaccg attcatttaa 4320 acatttatca aggccgtttg tgttctggcc caacacttac ccaacaagcc ccataagtgg 4380 ggattcaaat tatacgtttt atgctctctg tctgggtatg cctacagttt tgagatttac 4440 tcaggagcta aggacataga ccgtttacct ggcgagccag acctcggagc tgtatcgaat 4500 acagttattc gattgttacg accagtacca aggcacgtca atcatataac ttattttgac 4560 aatttttata cgaatattcc tctgctacat tacttgacca acgaaggtat ttattgtctt 4620 ggaacagtac agagaaacag gcttggaaag tcgtgcaagt tgccggagaa acgggaaatt 4680 atgaagtcta acgtacccag aggaacatat cacgaaaatg tggcctctca cgaaggccaa 4740 gaattttcgg ccacaagttg gaaagacaac aaacaagtac tattactttc cacgtatgtt 4800 ggcgctgaac cagcggatac tataacccgc tacgaaaaaa aattaaaggc caatgttcaa 4860 gtatcatgtc ctcgggtcat aaaagaatat aatgcacaca tgggcggcgt tgatttgatg 4920 gatagcttca ttggtcgata tcgcatccgc ataaagtcga gaaagtggac aatgaggctc 4980 ttctaccact tgtttgacat gactgtcatc aatgcttggg tactatacaa aaaggtaaac 5040 acagtgaaag gaaaacttca gaagaacatt atgaagtttg cagattttag aactgagctt 5100 gccgatactt tatgtagata tcaaagtcac tcgaaaaata aaaggggaag acccagtagc 5160 aacagtcgac aagacactac agtaccttct aaaagacccc ggacaggtgt acaagtatta 5220 ccctctactg acgtgcgttg cgattgtatt ggtcatgaaa aaagttttat ggattctaga 5280 aataagtgta aatttaataa ttgtaggaaa cttacctcct ggttctgtaa aaagtgtaaa 5340 gtgtcgttat gcgataataa aaacaatcaa tgtttcgcat tatttcatgc ataggtatat 5400 ctgatttctt tatgacctta atgcctgaag tatcaaaatt gatacctgtt tttttctaaa 5460 taaatatgta aaataaatat tgtgattttt ttagctattt ttcccttcat ctagaccctc 5520 aagcaaacac aataatatta aaaaaaatca ctactaaggg 5560 // ID Gypsy3-LTR_Dya repbase; DNA; INV; 264 BP. XX AC chrU; XX DT 18-MAY-2009 (Rel. 14.05, Created) DT 18-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy3_Dya; KW Gypsy3-I_Dya; Gypsy3-LTR_Dya. XX OS Drosophila yakuba OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; melanogaster subgroup. XX RN [1] RP 1-264 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1043-1043 (2009). XX DR Genome; chrU; Positions 5678669 5678406. XX SQ Sequence 264 BP; 90 A; 59 C; 51 G; 64 T; 0 other; tgtagcaagc aggcacacta ccgtaacata cacacagtga ttgtacatac agtcctagat 60 atacatactc ttagttttaa gtacaatttt acgctaatgt taagatcgtg cttacggact 120 actttgtgcg tcccgttcgc acgatcagca actgacacac caatacatac atgtaatagg 180 ctaagaagga acggaacgaa gaaataaaga acacttcgtt tgtgacaaga agagagaacg 240 gccgcatttg aatcgcctcc taca 264 // ID Crack-6_AAe repbase; DNA; INV; 4588 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A Crack non-LTR retrotransposon family from Aedes aegypti. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; KW Crack-6_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4588 RA Kojima K.K. and Jurka J.; RT "Crack clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1222-1222 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 12 sequences with >97% CC identity. Closely related to Crack elements in Culex pipiens CC (Crack-1_CP to Crack-4_CP). XX FH Key Location/Qualifiers FT CDS 690..1499 FT /product="Crack-6_AAe_1p" FT /translation="MPVNQSSVASNYKMSCGNVTTSNVILATLKKGLDSFT FT DQISASQQFMSSKFDTILDDFAELCNEVRLLKRDNVELRKTVAQLERQLES FT HSCTVHRQGKLLDDFSRESVSCNAIVTGIPRVPHEETAKLIEKTFSVVSPH FT IDMKQVKHCERLSVTKSHHETPPIRVVFNNVDAKRKFVKAKIEYGRLRVGS FT ITRCHGRADQVVSVRNELSPLKIELLNELKHSQHAIGFSYVWASTAGDILV FT RFNKSSKPIVIRTRADFHKLTSNQQSKQL" FT CDS 1505..4411 FT /product="Crack-6_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MYDMTMDNILKESNIFVEPPIQRSIFARESCFNMLQI FT NIRGMNNIEKLDSLAIFIDNLGTPVDILIICETCIKQDRSSFYNMNGFSSI FT HSCRTQSAGGLAVYVRNGIGYDIISNSTIDGYHHINLQIPMDNSVCSIHGI FT YRPPDFDFVRFRNLLEEILASSNLSKPIFIFGDMNVAINRDRRDKEEYMDL FT LQSYGMIVTNTNITRPESDNILDHSIARIDDITRITNYTISCELSDHNYVL FT TSLKTFVASNRMRTLTKTIIKYEVVDRKFNEFLTVYDWANANPNERLVKIA FT EKYIELRNANTTTKSMHVKLKKEGCPWFNYDMKVLCHARDTALKRWKRNRT FT NDHLAEVLRLANKKLVAAKRKAKKHYYGQFFNAGNPKELWNRINGLLGTNC FT QLREIMLHIDGREVSDAALVSNLFNEYFTSIGRLLAENLSSSNNINAHYTI FT REVPNTFFLTPATSQEVSNIISNMDSSKATGYDGLPITALKHHCTVLSTII FT ADAFNDSAITGIYPECLKKAIVYPVFKTGDRKLLNNYRPISVLPAVNKVFE FT QLLTQRLLEFLDSNSFFYNKQYGFRKGSSTQIAVLELVDELSDTIDRHKNI FT AGCLFLDLSKAFDTIDHGFLLKKLEACGVRGLPNNLLRSYLQNRKQAVIVN FT GCRSSELSIEIGVPQGSNLGPLLFLIYVNDLCNLRLNGVPRLFADDTVLSY FT TGRTPQEIVDKMREDLLIVSEYLENNLLSLNVAKTKFMIIRGPRTILSEHT FT GIMFKNEEIEEVECFKHLGIILDCNLSWKNHIKHLVQTCAPICGMLRKLSY FT FLPRHVLLKIYYAFINSRYQYASCVWASVGVNQLKALQVQQNRCLKAIYRL FT PFLHPTINLYDSFNHSIVPIGALHAYQTSIVIHRIICIQDIHHNFELQNVS FT HQYETRNLDRIVASNFASEIGRRRFSSIGPRIYNNLPSSVKNCEFITTFKK FT NLKKYVLDHLGRYL" XX SQ Sequence 4588 BP; 1428 A; 978 C; 851 G; 1327 T; 4 other; gttgttgtga cgctaagtga gacagtcaaa ttattgcgct cctcatcata attaaatagc 60 tgattattta caccacgaat aagtgctaaa tatttgaaaa aatcgtgttt gtgtacctct 120 gcaccgcatc tcgcagttwt ctaatccaat ttcgtaaaac tggatgagat atacgaacaa 180 atagtatgat tctctgctac acctcaacga tcattgcatt cgccattctg ctgctgttgc 240 tgctcatccc gaaataaacg tttcwtcacc cgccattcgc catccaccac ccgccaccgc 300 cacccgctat ttgccaccca cccttcgaca gccacaaaac gccaccactg tttgccccgc 360 cgccacctcc gctgccaaaa atattgctca tgctgcttca atataacgaa cgtttgttat 420 cgcaagaccc gtccatgtaa cgttggatca ataaagctga cttgctgcgc tgtgtttcat 480 ttctaccacc cctcacctcc tctgacgcta cgtccgctgc catcaacgat cgctcgattc 540 catctaccaa catcttcaca gcagccaagg tgagcaagat acatacaata gattgttgtc 600 aaattgttgc tcacactaga ggccacgtcg ctcaaaaaaa aaattgtaca gcgccatctg 660 tttgtgaaaa cgtgcaccaa aactgctaga tgccagttaa tcaatccagt gttgcctcaa 720 actataagat gagctgtggt aatgttacta catctaatgt gatattagcc acattgaaaa 780 aggggcttga ttcatttact gaccagatat ctgcaagcca acaattcatg tcctctaaat 840 tcgatactat tctcgatgac ttcgcggaat tgtgcaatga agttcgcttg ctgaaacgag 900 ataatgtcga gcttaggaaa actgttgcgc aacttgaaag acaactcgaa tctcacagtt 960 gcacggtaca tcgacaggga aaattgttgg acgacttcag tcgagagtca gtttcatgta 1020 acgccatagt tactggaatt cccagagttc cacacgaaga aactgcaaaa ttgattgaga 1080 aaaccttctc tgttgtttct cctcacatcg acatgaaaca agtcaaacac tgtgagagac 1140 tgtctgttac taagagtcat cacgagactc ctccaattcg agtggttttc aacaacgttg 1200 atgctaaacg aaaatttgtt aaagccaaaa ttgaatacgg cagactacgt gttgggtcca 1260 ttacacgttg ccacggcaga gctgatcaag ttgtctctgt aaggaatgaa ttatctccac 1320 tgaaaattga acttttgaat gagctaaaac attcccaaca tgccataggc ttctcctatg 1380 tgtgggctag tactgctggc gacatcctag taaggttcaa caagtcttca aagcccatcg 1440 tcatcaggac tcgagcagac tttcataaac tcacttcaaa tcagcaatca aagcagttgt 1500 gaaaatgtac gacatgacca tggataacat tttaaaggaa agtaacattt ttgtcgagcc 1560 accaattcag cgatcaattt ttgcacgtga atcctgcttt aatatgctgc agatcaatat 1620 tcgcggcatg aataatattg aaaaacttga ttcattagcc atatttatcg ataacttggg 1680 tactcctgtg gatatcctga taatttgcga aacgtgcatc aaacaagaca ggtcgagttt 1740 ctacaatatg aatggattct cctctataca ctcttgtcgt actcaatctg ctggaggctt 1800 ggctgtttat gtaaggaacg gcattggcta cgatatcata agcaactcaa ctatagatgg 1860 ttatcaccat atcaacctac agataccaat ggacaattcc gtttgttcta ttcacggtat 1920 atatcgtcct ccagattttg attttgttcg tttccgwaac ctcctggaag aaattttggc 1980 ttcgtcgaac ctttccaaac caatatttat ctttggtgac atgaatgtgg caatcaaccg 2040 cgacagacga gacaaggaag aatacatgga ccttctacaa tcatacggca tgatagttac 2100 aaacactaac attacacgac ctgagagtga caatatcttg gatcattcta ttgcacgaat 2160 tgatgacatc actagaataa ctaattacac cattagctgt gaacttagcg accacaacta 2220 tgttctaaca tctttgaaaa catttgttgc cagtaaccgt atgcgaacat tgacgaagac 2280 tatcattaag tacgaagttg tcgatagaaa attcaatgag ttcctgacag tctacgattg 2340 ggccaatgcg aatcctaatg aaaggctggt aaaaattgca gaaaagtata ttgagcttcg 2400 gaacgcgaat acaactacca aatctatgca cgttaaactg aaaaaagagg gatgtccatg 2460 gtttaattat gacatgaaag tcctttgtca tgcaagggat actgcgctga agagatggaa 2520 gagaaaccgt acgaatgatc atctcgccga agttctgaga ttagccaata aaaaacttgt 2580 ggcagcaaag cggaaagcca aaaaacatta ttatggtcaa ttttttaatg ccggcaatcc 2640 caaagaactt tggaatcgca tcaacggtct gctgggtacc aactgccaac tacgtgaaat 2700 catgctgcat attgacggta gggaggtttc tgatgctgca cttgtatcta atctcttcaa 2760 tgaatacttc acctcaattg gtagactact agctgagaat ttgtcatcat caaacaacat 2820 taacgctcac tatactatca gagaggttcc aaatacattc ttcttgactc ctgcaacatc 2880 gcaagaagtt tccaatatca tctctaacat ggatagttca aaagcgacag gatacgatgg 2940 attaccaatc accgctctaa aacaccattg cacggtttta tcgacaatca tagccgatgc 3000 cttcaatgat tcagctatta ctggaatata ccccgaatgc cttaagaaag cgattgtcta 3060 tccagtattc aaaactggag accggaaact acttaataat tatcggccaa tatccgtgtt 3120 accggcagtt aacaaggttt tcgaacaact gctcacccaa cgtcttctcg aatttttgga 3180 ttctaactct ttcttctata acaagcaata cggttttcgt aaaggctcct caactcaaat 3240 agccgtgctt gaactagtag atgagctgtc agatactatt gatcgtcata agaacattgc 3300 tggttgtctg ttcttggatt tgtcaaaggc ttttgatacg atcgaccacg gatttcttct 3360 gaaaaaactt gaggcgtgtg gagttcgagg tcttcctaac aacttgctaa ggagttacct 3420 gcagaatagg aaacaagctg tcattgttaa tggatgtcgg agtagtgaac ttagtattga 3480 aatcggggta ccccagggaa gcaacctcgg accgctgttg tttcttattt atgttaacga 3540 tctgtgtaac ttacggctta acggtgttcc tagactattt gctgatgata cagttctctc 3600 ttatactggg cggactcctc aagaaatagt tgataagatg agagaggatc ttcttattgt 3660 aagtgaatac ctagaaaaca atctgctatc actaaatgtt gcgaaaacta aatttatgat 3720 aataagagga ccacgcacta ttctctcgga acatacaggt atcatgttta agaacgagga 3780 gattgaagaa gttgaatgtt tcaaacattt aggtatcatt ctcgattgta acctgtcatg 3840 gaaaaatcac atcaaacatc tagtccaaac ttgtgctccg atttgtggca tgcttaggaa 3900 actatcctac tttcttccta gacacgtttt attaaaaata tattatgctt ttataaatag 3960 tcgttaccaa tatgcatcat gtgtatgggc ctctgtaggt gtcaatcagt taaaagcatt 4020 gcaggtgcaa caaaacaggt gcctaaaagc catatatcgc cttccatttc tacatcctac 4080 gataaatcta tatgattctt tcaaccacag tatagttcca atcggtgctt tacatgcata 4140 tcaaacctca attgtaatcc atagaattat ttgtatccaa gatatacatc acaattttga 4200 gttacagaat gtatcacatc agtacgaaac ccggaatttg gaccgtatag ttgcctcgaa 4260 ttttgcgtct gagataggta gaagaagatt ttcgtccatt ggaccwcgca tttataacaa 4320 tttgcctagt agcgtaaaaa actgtgaatt tattacaaca tttaagaaga atttgaaaaa 4380 gtatgtgctt gatcatctag gtagatatct ttaagtagac gaattatatc aattagcatg 4440 taagaattat tataacaaca acaacaacac tctcggtact tttgagactt tttagaggag 4500 ctttgagttc attaaagtct taaaagtgca gtgccgctat gtattgcaat agttatgctt 4560 aataaatttg aatttgaatt tgaaaaaa 4588 // ID Gypsy-10_OD-LTR repbase; DNA; INV; 281 BP. XX AC CABV01000282; XX DT 24-FEB-2011 (Rel. 16.02, Created) DT 24-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Oikopleura dioica genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-10_OD_; KW Gypsy-10_OD-I; Gypsy-10_OD-LTR. XX OS Oikopleura dioica OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; OC Oikopleuridae; Oikopleura. XX RN [1] RP 1-281 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Oikopleura dioica genome."; RL Direct Submission to RU (24-FEB-2011). XX DR Genome; CABV01000282; Positions 15683 15403. XX SQ Sequence 281 BP; 87 A; 59 C; 59 G; 76 T; 0 other; tgacgattag aactgcattt tagcagtttc taatcagtca gaaacgtgga cggcactccg 60 tgctgagtac cgccggactt tacgttaaga gccgagtact cgtatttcga gtcgagctct 120 cgtaatttag aagtcggcgc cacgttccct ctccctataa aagcagcagc ttttagggat 180 ttactcaccc cgcaaaagtt ctaattgaaa taaagaacta gtattaaatt aagaaaacta 240 cgtgtttgaa ttaataagag aatcggtaac ttgagaagcc a 281 // ID SINE-3_CQ repbase; DNA; INV; 418 BP. XX AC . XX DT 26-DEC-2010 (Rel. 16.01, Created) DT 26-DEC-2010 (Rel. 16.01, Last updated, Version 1) XX DE Non-LTR retrotransposon from Culex quinquefasciatus - consensus. XX KW SINE; Non-LTR Retrotransposon; Transposable Element; SINE-3_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-418 RA Jurka J.; RT "Non-LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 595-595 (2011). XX DR [2] (Consensus) XX CC >98% identity to consensus. Putative SINE. Present in over CC 10,000 copies in the genome. XX SQ Sequence 418 BP; 97 A; 92 C; 122 G; 107 T; 0 other; gccgggaccg tggtgtaggg gtaagcgtga ttgcctctca cccagtcggc ctgggttcga 60 tcccagacgg tcccggtggc atttttcgag acgagatttg tctgatcacg ccttccgtcg 120 gaagggaagt aaatgttggt cccggactaa cctaaaaaag gttaggtcgt tagctcagtc 180 caggtgtagg agtcgtctcc ctgggtcctg cctcggtgga gtcgctggta ggcagttgga 240 ctaacaatcc aaaggtcgtc agttcgaatc ccggggtgga tggaagctaa ggtgtaaaaa 300 gaggtttgca attgcctcaa caatcaagcc ttcgaacacc tagtttcgag taggaatctc 360 gcaatcgaga acgccaaggc aatgctgtag agcgaataat ttgatttttt gatttttt 418 // ID BEL-72_AA-I repbase; DNA; INV; 6326 BP. XX AC supercont1.274; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-72_AA_; KW BEL-72_AA-LTR; BEL-72_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6326 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.274; Positions 188787 182462. XX CC 'GTTAT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 25..6326 FT /product="BEL-72_AA-I_1p" FT /translation="MSGKDSTTSVPISPQGCKGCDRPIHVDDMVACDLCNN FT WWHFNCAGVHASVSERSWICINCVECRKSLKSFKSTSSNARLRLRQLAEAK FT DLEDRIFMEKAERDRAYLTQKHQIETEIESDVTETSSSRGSDTSHSRRKSL FT HDWIVDQNRSLEGAVGGLELTESKSLFERAATRPRTGTIPKTVSGTHPATP FT IFTSTAELPIVNRKAAVQAAQSIELDHYTEGKQLPHDSSQVQMQESLLNMQ FT HPRSDPAVRSLQRRVSIQSRVGQSVALAPPSKTFGLLEPATIENQPMIMEL FT HTTAPLRGSGSNRSVCMPCQPEYVDVSAQPRLLEQSQHIFEQAVMHNQRST FT PQPAPRYKHTINKETFSYRPDAHPLSAHTRAPQAEWNGHMSYQRPLAPSRH FT SINAQITDSTQLYADHDGQPNQYHHATGQSGSPQRDREEYLLSHQPSVCQQ FT SRIVQQPLPLQSVPAISAQQLAARQVVPRELPKFSGDPLEWPMFLQAFETT FT TQLCGIQPDENLARLQKCLVGSARDKVQSILTLPAAVPNIIATLRDECGRP FT DQLVHRLLIKVRGAPHPNINKLDTLITFGREVSNLVTYIEAANLQAHLSNP FT MLLSELVAKLPPNLRLDWGLHSRSTGDESLKAFSDFVATIRSAACHVSMSS FT EFSQSEDISSRRKDKSSFINAHSSEESSTSKPKNAEVRPCPLCNSVGHRIR FT DCDIFKESTVGERRKLVDQHRLCQRCLVGHGKWPCRTKQPCSMNGCTEMHH FT RLLHSEQAIDTVSTSDPRALISTHHCVLRSTLFKIIPATLYGAGKTITTYV FT FLDDGSSHTLMDQDLADELGLDGENMPLCLQWTGNVTRKEPTSRQVNISIS FT NIHGGSKYEMFDVRTVKKLDLPRQTLEYEQLSSQFPYLQGLPIASYREAAP FT RLLIGANNANLILTLARRERRIGEPVATKTRLGWALFGGIRCKSESANLLV FT HDCGCCQDDSLHELVKEYFAVDNLGITTPVMESAEDQRAQKILHESTKRTS FT SGKFETGLLWRRNDIEMPNSYPMAERRLRCLERRLEREPELKRKVDDLIGE FT YLQLGYAHIATEDELKNSNPKRTWYLPLGVVRNPRKPDKVRVVWDAAAKVN FT GISLNSLLLPGPDLLTPLLTVLCQFRQRQYAVSADIRQMFHQLLIKPEDRQ FT VQRFLYRSDSRSTPTVYVLDVATFGATCSPCSAQYAKNLNAFQHMDTFPDA FT AEAIVKNTYVDDYLDSRDTIDEAVQLSLDVRAINQKAGFELRNWQSNSKEI FT LRRIGEKCHDSVKNFAAEKSTPTERVLGMTWMTDNDVFVFSTQFREDLLPL FT LSCDTVPTKRQVLRVVMSYFDPLGIISNYTIHGRILLQDIWRSKANWDDPI FT TNEDFVSWQRWVLLAPQLTKVKIPRCYFPAYDPNSFETLQLHVFVDAGDQA FT YGAVAYFRIVDRGIPRCALVAAKAKVAPLKPLSTPRSELNASVIGIRLMKM FT IEKNHSFTINKRFLWTDSTTVLSWLHADPRKYRQYVALRVSEILSESEISE FT WYWIATSKNVADKTTKWGHGPDFDSESEWYSGPSFLLRPETEWTIKQPHIM FT DPPEELKAINVHRETGYESIIDFAEFSQLDSLLRKLAYLHHFRRCCVSKPR FT SPSATGKVMLNQEDYDAAEKSLWKIVQAQEFHNEIYTLKKNASLPLNERVR FT LGKSSPLTKLSPFIDEDGVLRLQSRIDPNAIYYAFNFRNPIVLPKQGHVTE FT LLVLKYHQRFGHANVETVVNELRQRYYIPSLRAVVKSLVKRCMWCRVYRAV FT PSPPKMAALPEPRVQPYIRPFSFVGLDYFGPMLVKRGRTNVKRWVALFTCL FT TVRAVHLEVVHSLSTESCKMAIRRFIARRGAPREIFSDNGTNFRGAARELA FT EEIKNINAEVAVTFTNVDTKWTFTPPSAPHMGGVWERKVRSIKEAFKSLCH FT RDKLDDEEFSTFLMEAEMIVNSHPLTFVPLDEPNQEVLTPNCFLLMDSNGA FT NNAPKIALDNSGALRRNWKLLNHLLNQFWKTWIKAYLPTIARRTKWFNEVR FT PLQTGDLVIIVDEGVRNGWLRGRVIKVYPGGDGQVRKVDVQTDSGILKRAA FT TKVALIDVQDNSKTNSAIGITGGT" XX SQ Sequence 6326 BP; 1852 A; 1467 C; 1433 G; 1574 T; 0 other; ttctcgaaaa ttgtggcgca caggatgagc ggaaaggact ccacaacatc cgtaccaata 60 tctccccaag ggtgcaaagg atgcgaccgc ccgattcatg tagacgacat ggttgcctgc 120 gacctatgca ataattggtg gcactttaat tgtgctggag ttcatgcatc cgtaagtgag 180 aggtcgtgga tttgcataaa ttgcgtcgaa tgccgtaaat cacttaagtc gttcaaatct 240 accagttcta atgctcgtct tcggctccga caacttgcag aggcaaaaga tctcgaagat 300 cgtatcttca tggaaaaggc ggaaagagat cgcgcttacc ttacccagaa gcatcaaatc 360 gaaaccgaaa ttgagtcaga tgtgacggaa acatccagca gcagagggtc tgatacgtcg 420 catagccgtc gcaaatcgct gcacgattgg atagtggacc aaaatcgttc tctagaaggg 480 gccgtaggtg gcttggaatt gaccgaaagc aaatctttgt ttgaaagagc agctacacgt 540 cctaggacag ggaccattcc taaaacagtt tcgggtacac atccagcaac ccctattttc 600 acatccactg cagaattgcc catcgttaat cgtaaagcag cagtacaggc agctcagagc 660 attgaactcg accattacac ggaaggaaag caactaccac acgattcatc ccaggtccaa 720 atgcaagaga gccttttgaa catgcaacat ccacgatctg accctgccgt ccgaagtctg 780 caacgtcgtg tttcaataca atcacgcgtt ggccaatctg tggctttggc tccaccgtct 840 aaaacattcg gcttactgga accagccacc atagaaaacc agccgatgat catggaacta 900 catacaaccg caccacttcg gggatcaggt tcaaatagat ctgtatgtat gccttgtcag 960 cctgaatacg ttgatgtatc agctcaacct cgactactgg aacaatcgca acacattttc 1020 gaacaagctg tgatgcataa tcagcgtagc actccacaac cagcaccgag atataaacat 1080 actatcaata aagaaacatt ctcttatcga ccagatgcac atcctctctc cgcacataca 1140 cgtgcaccac aggctgagtg gaatggtcat atgagttacc agcgaccgtt agcaccatca 1200 cgacattcaa tcaatgctca aatcaccgac agtactcaac tgtacgctga ccacgatggt 1260 cagccaaacc aataccacca tgctactggt caatcaggtt caccacaacg cgaccgagaa 1320 gagtatttac tcagccatca accatctgtt tgtcaacaaa gtcgaatagt gcaacaacct 1380 ctgccgttgc agtcggtgcc tgcgattagc gcacagcaac tcgctgctag gcaagtggtg 1440 cctagggagc taccgaaatt ttctggtgat cccttggagt ggcctatgtt tctgcaggca 1500 ttcgaaacga caacgcaact gtgcggcata caaccagatg aaaatctggc taggcttcaa 1560 aaatgtctcg ttggaagcgc tcgggataaa gtccagagta ttctcacgtt gcctgctgct 1620 gtccccaata ttattgcaac attacgggac gaatgtggcc ggcctgacca actggtacat 1680 cgtctgctca ttaaagtgcg aggtgcaccg catccaaaca tcaataagtt agatacgctt 1740 atcacgtttg gacgggaggt cagtaacctt gtcacttaca tcgaagccgc aaatcttcag 1800 gcccacctct cgaacccgat gttgctttcg gagttggtag ccaaacttcc gccgaatctt 1860 agactggatt ggggattgca ttcccgttca acaggagacg agtcactaaa agcattcagt 1920 gatttcgttg ctacgattag atctgcagca tgtcatgtat ccatgtcttc ggagttttcg 1980 caatcagaag atatttcatc gagaaggaag gataagtcta gcttcatcaa tgcacatagc 2040 tcagaggagt cttcaacttc gaaacccaag aacgctgagg tgagaccttg tccattatgt 2100 aactccgttg gacacagaat acgcgattgt gacatcttca aagaatcgac ggtaggggaa 2160 agacgcaaac tggtcgatca acacaggttg tgtcagcgat gcctagtggg ccatggcaaa 2220 tggccatgtc ggactaaaca accgtgcagc atgaatggat gtactgagat gcaccacaga 2280 cttctgcatt cagagcaagc tatcgataca gttagtacat cggatccacg agctctaatc 2340 tcaactcatc attgcgtact gcggtcaaca ttattcaaga taattcctgc aacattgtat 2400 ggggccggta aaaccatcac aacttatgtc ttcttagacg acgggtccag ccatacgttg 2460 atggaccagg atttagctga cgaattgggg ctcgatggag aaaatatgcc actttgtctt 2520 caatggacgg gaaatgtgac taggaaagaa ccaacatctc gacaagttaa tataagtatt 2580 tcaaacatcc acggaggctc taaatatgag atgttcgatg ttagaactgt taaaaagcta 2640 gacctccctc ggcagacttt ggagtacgag cagttgtctt cccagttccc atacttgcaa 2700 ggcctcccaa tcgcaagcta tcgtgaggcc gctccacgac ttctaattgg tgcgaacaat 2760 gcaaacttga ttttgacgtt agctcgtcga gagagaagaa tcggagaacc agtggcaaca 2820 aagacgcgtc ttggttgggc cttattcgga ggtataagat gcaaatctga atctgccaac 2880 cttttggtgc acgattgtgg atgctgtcaa gatgattcat tacatgagct tgttaaggaa 2940 tattttgctg tagataatct cggaattacg actcctgtaa tggaatctgc agaagatcag 3000 cgagcacaaa aaattctgca tgaatctacg aaacgcacat catctggtaa atttgaaacc 3060 ggattactct ggagaaggaa tgacatcgaa atgccgaaca gctatcctat ggcagaaaga 3120 agattgagat gcctcgagag aaggcttgaa agagaaccag aacttaaaag gaaagttgat 3180 gatttgattg gagaatatct acaactcgga tacgcacaca tagctactga ggatgagcta 3240 aaaaattcga acccaaagcg cacctggtat ttaccactag gagtggtacg caaccctcga 3300 aagcctgata aggttcgcgt tgtgtgggac gcagcggcca aggtaaatgg aatttctcta 3360 aattcacttc ttttacctgg acctgacctg ctgacacctt tgctgactgt tttgtgccag 3420 tttcgccaaa gacaatatgc tgtttcagcc gatatccgtc aaatgttcca tcaacttctc 3480 ataaagccag aagatcgtca agttcaacgt ttcttatatc gttctgattc tcgttccact 3540 ccaacggtgt atgtattaga tgtggccaca tttggcgcaa cctgctcgcc atgttcggcg 3600 caatacgcca aaaatttgaa cgcgtttcaa cacatggaca catttccaga cgcagcagaa 3660 gccattgtca aaaacacgta cgttgacgac tatctcgatt cacgggacac tatcgatgaa 3720 gctgtccaat tgtcacttga tgtacgagcg atcaaccaga aagctggatt tgagcttaga 3780 aattggcaat ccaactccaa agaaatctta cgacgtattg gagagaagtg tcatgattcg 3840 gtgaagaatt ttgctgctga aaagtcgact ccaacggaac gtgttcttgg aatgacctgg 3900 atgacagata atgacgtatt tgtgttttca actcaatttc gagaagatct actccctctt 3960 ctatcatgtg ataccgtacc aaccaaacgt caagttttga gagtggtaat gagttatttt 4020 gatccgcttg gaatcatttc caactatacc atacatgggc gaatacttct tcaggacatt 4080 tggcgttcca aagccaactg ggacgaccct atcaccaatg aagactttgt gagttggcag 4140 agatgggtat tgcttgcgcc acagcttacc aaggttaaaa ttccccgatg ctacttccct 4200 gcatatgatc caaacagctt tgagacgtta caactccatg ttttcgtaga cgctggagat 4260 caggcttacg gtgccgttgc ttattttcgt attgtagata gggggatacc aagatgcgcc 4320 ctggtagctg caaaagctaa agttgcgcca ctcaagcctc tctctacccc acgaagtgag 4380 ctaaatgcta gcgtcatcgg aattcgactc atgaagatga ttgagaagaa ccattcattc 4440 acaatcaaca aacgctttct gtggacggat tcaacaactg ttttatcctg gcttcacgca 4500 gatcctcgta aatatcgcca atacgtagca ctgagagttt cagaaatact atcggagagc 4560 gaaattagtg aatggtattg gattgctaca tctaaaaatg ttgctgataa gacaaccaag 4620 tggggtcacg ggcctgattt cgattctgaa agtgaatggt atagtggacc aagctttcta 4680 cttcggccgg aaaccgaatg gactatcaaa cagccccata ttatggatcc accagaagaa 4740 ctcaaagcca tcaacgttca tcgagaaact ggctatgagt cgataattga cttcgccgaa 4800 ttttcccaat tagattcgtt gctgagaaaa cttgcatatc ttcaccattt tcgacgatgt 4860 tgtgtcagca aaccaaggtc cccatcggct acaggcaagg tgatgttaaa tcaagaggat 4920 tatgatgcag ctgaaaagag tttatggaaa atcgttcaag cgcaagagtt ccacaacgaa 4980 atttacacgc tgaagaaaaa cgcttctttg ccgttgaacg aacgagttcg tctaggaaaa 5040 tcgagtccgt tgacgaaatt atcacctttt atcgatgagg acggagttct tagattgcag 5100 agccgcattg acccaaatgc aatctactat gctttcaact ttcgtaatcc tatcgttttg 5160 cccaaacaag gccacgttac agaactccta gtattgaagt atcatcaacg gttcggtcac 5220 gctaatgtag aaaccgttgt taatgagctg cgacaaagat attacatacc aagtcttcgg 5280 gctgtcgtaa agagcttagt aaagcgatgc atgtggtgcc gagtgtatcg tgcagttcca 5340 agcccaccga aaatggcagc gctgcctgaa ccacgagtgc aaccttacat acgaccgttt 5400 tctttcgttg gactggacta ctttggccca atgctcgtaa aaagaggtcg cacaaatgtt 5460 aaacgttggg tagcattgtt tacttgcctc accgtgagag cagtgcattt agaggtggtg 5520 cactccttgt ccacggaatc ctgcaaaatg gccattcgtc gcttcatagc tcgcagagga 5580 gcgccacgag agattttcag cgacaacggc actaactttc gaggggccgc tcgagaattg 5640 gcagaagaga tcaaaaatat caatgctgaa gttgcagtca ctttcacaaa tgtggatacg 5700 aaatggacct ttactccccc gtccgcaccc cacatgggag gagtgtggga gcggaaggtc 5760 cgttctataa aagaagcgtt taaatcattg tgccatcgtg ataaacttga tgacgaggag 5820 ttttcaacat ttctaatgga agcggaaatg attgtcaatt cccacccact gacatttgtt 5880 cctctagatg aacccaacca agaagtactg acaccgaatt gtttcttgct gatggattcc 5940 aatggggcaa ataatgctcc caaaatagct cttgacaact ctggggcact gagaaggaac 6000 tggaaacttt taaatcatct actaaatcaa ttttggaaga cctggataaa agcatatcta 6060 ccgaccatcg cacgaaggac caaatggttt aatgaggtac gtcccctaca aaccggagac 6120 ttagttatca tcgtcgatga aggtgtccgc aacgggtggc tacgtggacg tgtgatcaaa 6180 gtgtatccag gaggagatgg acaggtcagg aaggtggatg ttcaaacaga ttcagggatt 6240 cttaaaagag ctgcaactaa ggtggcctta attgacgttc aagataatag taagaccaat 6300 tctgcaatag gtattacggg ggggac 6326 // ID CR1_Ele36 repbase; DNA; INV; 3856 BP. XX AC . XX DT 18-OCT-2010 (Rel. 15.1, Created) DT 18-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE A CR1 clade non-LTR retrotransposon family from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1_Ele36. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-3856 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-3856 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (18-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update and extension. This consensus is generated CC from 20 sequences with >97% identity, and ~98% identical to the CC original sequence in [1]. XX FH Key Location/Qualifiers FT CDS 335..3769 FT /product="CR1_Ele36_1p" FT /note="endonuclease and reverse transcriptase." FT /translation="MEAPNPSNSVEPRRQHQXSRPGPVVETEERGXHRAPF FT GKYRSVSFTSQTETTSALSNDPSHSNNIETQSHPGPVFGDMEGVYQSITSG FT YSSDENYARNSTSQPWRIVESILEAPGSSASVVPTPALAHHSRLXPGGNCS FT PGVFQPALAGKYIHDLAHTSPDSASNFSSMAGSTNTSPLTSRVSSQQDIIV FT YYQNAGGINSDVDEFCVAVSDNVYAIIVITETWLDSRTLSRQVFGNDYEVF FT RCDRNTNNSRKAKGGGVLVAVRSTFQARELENEAWRSLEQVWTSIKLGDRT FT LFLCALYIPPDRVRDRALVETHCQSVLSVIEMANATDELIVIGDFNLPGIS FT WKTSHSGFLFPDVDHSVIHANASFLLDNYSSATLSQINHVTNQNQRSLDLC FT FVSAQDTAPFLCEAPSPLVKTALHHPPLQLSIKTTLLHDFNISPAAVSYDF FT RKADHRNIANLLSSFNWDDILGNSDIDTAAQTFSNVLNYVIDRHVPKKSHH FT PPSRSPWQTNELRRLKSRKRAALRKYTKHRTTSLRRSYVQINHEYKRVAKR FT CFLQYQQGIQRKLKACPKQFWKFVNEQRREAGLPSSMMLNGNVASTPNDIC FT QHFARKFASVFTEDRLTEDDIQLAANSVPLLSENMDAIDIDRNVIARACSK FT LKKSFNPGPDGIPSAFLKEHLDSLLTPLLLVFQLSVTVGVFPSCWKLAYMF FT PVHKKGNKRDCNNYRGITSLSAIAKLFELIIMEPLFSHCKPYISTDQHGFL FT AGRSTATNLLCLTSYITNSMSERAQTDVIYTDLTAAFDKLNHRIAIAKLDR FT LGINGSLLQWLQSYLTGRHLFVDLGDCQSPTFAVSSGIPQGSHLGPLLFLI FT YFNDVNLVIKGPRLSYADDLKLFLRIRSTDDCRFLQLQLNAFADWCHLNQM FT DVNPLKCSVISFSRKKETIVHKYALFGQDIERVNQIKDLGVILDIQLSFKQ FT HISFVVDKASKILGLIFRISKHFSDIYCLKSLYCSLSRSVLEYCSVVWTPH FT YNNGVERVESVQRRFLRFALRRLPWSDPYRLPSYESRCQLIRLEPLSVRRD FT TARALFIADILQGHIDCPALLEQVNINVQPRALRNNVTLRLPLQRTNYSMF FT GAIYGLQRLFNRVAALFDFNLSRQLLRRRFSSFFDRS" XX SQ Sequence 3856 BP; 998 A; 953 C; 809 G; 1093 T; 3 other; gttaaatttg tatctctagt ttgggtttat ccacaaattt ctgcattaca acgaaaaaat 60 tgtagtaata gcctttttat acgattagat gaaaaaactt tatttggacc aatttcacaa 120 gaaaatcgaa ttttcaattt gactgttatt tccaacggga tgatatcagg tatgcaagat 180 aattattaga ttaccattat caagcatttt gagacgtatt taaggttttg tccgaccttt 240 gtatgaagga ccgccactgt gagccgatcc aaactacatc gctgcatatc ctcctctgga 300 atgcgcttca ccaggacgca cgaaccgacg ctcaatggaa gcccctaacc cgtccaactc 360 agtcgagccc cgccgccagc atcaackcag ccgtcctggt cctgttgttg agactgagga 420 aaggggctkc caccgcgctc cttttggcaa gtatcgtagt gttagcttca catcacaaac 480 tgaaaccact tctgctctca gcaacgatcc ctcgcattcg aataatatcg aaacgcagag 540 tcatcccggc cctgtgtttg gggatatgga gggggtatac cagtctataa cctcaggcta 600 ctccagtgac gaaaattatg cccgaaattc tacatcacaa ccgtggcgca tcgtggaaag 660 cattttggaa gcccccggct cttctgcctc agtcgtgcct actccagccc tcgctcatca 720 cagtcgcctc gktcctggtg gcaattgtag tccgggggtc ttccaacctg ctttggcagg 780 caagtacatt cacgatttgg cccatacctc gcctgattcc gcttcaaatt tcagcagcat 840 ggccggatcc accaacactt cgcctcttac atcccgggtt agcagccaac aagacattat 900 cgtatactat cagaacgccg gtggtattaa cagcgatgtg gacgagtttt gcgttgctgt 960 ttcggataac gtttacgcca tcattgtaat cacggagacg tggttggact cccgcactct 1020 atctcgtcag gtatttggta atgattatga ggtatttcgt tgcgatcgca ataccaataa 1080 cagccgcaaa gcaaaaggtg gaggcgtgtt agttgcggta agatctacat tccaagctag 1140 agagctcgag aatgaagctt ggaggagctt agagcaggtg tggacgtcga taaagctagg 1200 tgatcgtacg ttatttctgt gcgccttgta cattccgcct gatcgtgtac gcgatcgagc 1260 ccttgtcgaa actcattgtc agtcagttct ttctgttatt gaaatggcta atgcaaccga 1320 tgaactcatc gtgatcggcg acttcaacct acctggcata tcatggaaaa cgtcacatag 1380 cggctttctt tttccggatg tggatcactc ggttatacat gccaacgcct cgtttctcct 1440 ggataattac agttcagcta ctctctctca aatcaaccac gtcaccaatc aaaatcaacg 1500 cagtttagac ctctgctttg tgagtgctca ggatacagcc ccgtttttat gtgaagctcc 1560 ttctcctcta gtaaaaacag ctcttcatca tcctcctctg cagctttcca tcaaaactac 1620 gctgttacat gactttaata tatcgcccgc cgcagtgtca tacgattttc gcaaagctga 1680 ccaccgtaac atcgccaact tgctctccag tttcaattgg gacgacattc tcggcaattc 1740 tgatatcgat acagctgcac agactttttc taacgttttg aattacgtca ttgaccgaca 1800 cgtccctaaa aagtctcatc atcctccatc tcgttcccct tggcaaacga atgaactgcg 1860 tcgtctgaaa tcgagaaaaa gagcagccct taggaagtac acgaagcatc gaacaacatc 1920 tcttcgtcgt agctatgtcc aaatcaatca cgaatataaa cgagtggcga agcgatgttt 1980 tctgcagtat cagcaaggaa tacagagaaa acttaaagca tgtcctaagc agttttggaa 2040 attcgtgaac gaacaacgtc gtgaagctgg cctgccgtcc tccatgatgc tcaatggcaa 2100 cgtagcttcc accccgaatg acatttgtca acattttgca cgaaaatttg ctagcgtctt 2160 taccgaagat agattgactg aggacgatat tcagctcgcc gccaatagtg ttcctcttct 2220 cagtgaaaac atggacgcta tcgatattga cagaaacgtg attgctagag catgctcaaa 2280 gctaaaaaaa tccttcaatc ctgggcccga tggtattcct tcagcgttcc ttaaagagca 2340 tttagacagt ttgctgactc cactgctcct tgtttttcag ctgtcagtga ccgtcggcgt 2400 cttcccgtca tgttggaagc tagcgtatat gtttcctgtt cacaaaaaag ggaataagcg 2460 ggactgcaac aattatcgcg ggattacctc attaagtgct atcgccaagc tgttcgagct 2520 gattatcatg gagcccctgt tttcgcactg taagccgtac attagtaccg atcagcatgg 2580 tttcttggct ggccgatcca cggctactaa cctcttatgc ctaacatcgt atatcacgaa 2640 cagcatgtct gaacgtgcgc aaacagacgt tatttatacg gatttgacag ctgctttcga 2700 caaattaaat caccgcatag ccattgcaaa gctagatagg ctgggaatca acggcagcct 2760 tttacaatgg cttcaatcgt acctgaccgg tcgtcatctg ttcgtcgatt tgggagactg 2820 ccagtcacct acttttgctg tctcatctgg aatccctcag ggaagtcacc tggggcctct 2880 gttattcctg atttacttca acgatgtcaa cctggtgatc aaaggaccac ggctttctta 2940 cgcagatgac ctgaagctct ttcttcgaat ccgctcaact gatgactgca gatttcttca 3000 acttcagctg aacgcttttg ctgactggtg ccatttaaac cagatggatg ttaacccttt 3060 aaaatgctcg gtgatatcgt tctcacgcaa aaaagagaca attgtccata agtatgcctt 3120 atttggtcaa gacatagagc gtgttaacca aattaaggac ctgggagtta tcttggatat 3180 ccaactctcc tttaagcagc acatctcgtt cgtcgttgac aaagcttcca aaattctagg 3240 acttatcttt agaatttcta aacatttctc ggatatctac tgtcttaaat cgctctactg 3300 ctccctgtct cgttccgtct tagagtactg ctcggtagtt tggacgcctc actacaacaa 3360 tggagttgag agagttgaat ccgttcaaag gagatttttg agatttgcac tacgaagact 3420 tccgtggtcg gatccctatc ggctgccgag ctatgaaagc cgttgccaac ttattcgcct 3480 ggaacccctg tccgtccgca gagatacagc tagagctttg tttattgccg atattctgca 3540 aggacacata gattgccctg ccctgttgga gcaagtcaac ataaatgtgc agccccgagc 3600 ccttcggaat aacgtgacgc tgagattacc tcttcaacga actaattata gcatgtttgg 3660 cgctatttac ggtctgcaga ggcttttcaa tagagttgcg gctctctttg atttcaactt 3720 gtctcgccaa ctacttcgac gaagattcag ttcgtttttt gatagatcgt aagaatgaca 3780 atttttaata tgtttagttt taagatgaca tcattggggc tttgaagcct gttggtggta 3840 caagtaaata aatgaa 3856 // ID TTAA2D_AP repbase; DNA; INV; 431 BP. XX AC . XX DT 26-AUG-2009 (Rel. 14.09, Created) DT 26-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; TTAA2D_AP. XX NM TTAA2D_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-431 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2071-2071 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC TSD is TTAA tetranucleotide. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 431 BP; 150 A; 65 C; 69 G; 146 T; 1 other; gaggatgtca gcgcactatt tgttttctct ctctctgacc cacgcgcaac atagcaaatt 60 tacgtttagc agaaccaacc ttgtgttatt acttttaata ttagagtgaa ttcacctatt 120 atcaaacttt aaagttaaaa acattatctg tgtctctacg ttggctattt tatgatattt 180 taatttttat gtaagttatg agcgtgtaaa ttattacaat ttaaaaatgc ttataaattg 240 tataaaaatt aaaatattgt aaaaaaagcc aacgtagaga cacagaagta atgttcttaa 300 cttaaagttt gataataggt gaattcactc taatattaaa agtaacaaca carggttggt 360 tctgctaaac ataaatttgc tatgttgcgc gtgggtcaga gagagaaaac aaatagtgcg 420 ctgacatcct c 431 // ID BEL4a_Cis_I repbase; DNA; INV; 2487 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 06-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE Internal portion of BEL LTR Retrotransposon from Ciona savignyi. XX KW BEL; LTR Retrotransposon; Transposable Element; internal portion; KW BEL4a_Cis_I. XX OS Ciona savignyi OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. XX RN [1] RP 1-2487 RA Smit A.F.; RT "BEL4a_Cis_I - BEL LTR Retrotransposon from Ciona savignyi."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX CC Ci000072, Ci000434. Full ORF from bp 44 to 2443 encodes CC peptidase, but lacks RVT and int. XX SQ Sequence 2487 BP; 836 A; 494 C; 552 G; 605 T; 0 other; gtaacttaac aacgacgtcg gcgttgattg gcatctacag tgactggatc cagataatcc 60 acagagtgaa cgaataatgg atgacacaaa tacggacaaa attaatcggg cggtaattga 120 tgagacgaat gttacagtgc gtcggcatgt taatttaaat agaaagttaa ttgcccaaga 180 agcaaagttc aattattcaa tcgaaaaggt gcaggaatcc gtgatcgtgt ctgatgaaca 240 cgaaccaata caacgggcag tagccgattt atttgaacaa aaaaggaatt tccgaatctt 300 aatgactgac cttgagcatg agggtgatcc gtccatgttt ggtaagtgga ccgaaaaatc 360 agaacgatta cagattttat tcaatgagac cagagaccta gctcaaaacg caattgacaa 420 acggcgacaa accaaattgt gtgacaaggc gactaggtca tcccctgcaa gctacgctct 480 accaataaaa ttagaagttt ttaccggtaa tccccttcat tttccatcct ggaataccgc 540 gttcgacgcg ctcgtagatt ctaacactaa tacgtgtgct gcacagaagc tgaatttgct 600 gcagcaatac ctttcggggg accccagatc ggtggtggaa ggctacgtgc tgttgcagtc 660 ggaggaggcg tacacccaag caaggaaaat attgaaagaa cggtatggaa acgatagttt 720 ggttagtaag tccttcacgg ataagctaaa tgcttggccc aatataaatg taaatgatgc 780 taagggtcta aggagattct ccgactttct caatcaaatt gtggccgcca aaagccaaat 840 caccgaatta ggggttcttg acttctccaa tgaaagtaca aaaattgttg cacggttacc 900 gccgtttatt attaacaaat ggcgtgacat tgtactctca tacaaattat ctgaacatgg 960 aacttatcca ccattcgaaa aattagcaga atttgtgtct ttgcaatcgg agcgggagaa 1020 tatcccggaa ttgcaaggta ttggcttgac ctctgatcgc agagggaact tacaaaacaa 1080 accgaccact tcattctaca acgcgtcaag tgaaccatgg aagtatcaac aacaggttaa 1140 tagatcgtcg ccaaatttga actgcacgta ttgtaaaagt tataaccatg gactaaacga 1200 ttgcaatata tttatgaaat tgaatttatg tgatagaaag gattttctgc gaataaataa 1260 tttgtgttat gggtgtggtt ctagctatac tcacattgcg agagagtgca cgcatcgagc 1320 tacatgcaaa atatgcaaac gttctcactt gacctcatta cacgtttatt ggaccccaca 1380 attctattgc aatcgcctga aatctactcc aactgacata acctgcagac aagaaagttc 1440 gatgatagtt ccggtatggg taagggatat aagtcaacca caatattcaa tcctttgtta 1500 cagcattttg gatggtcaat cgaacacaac atttatatcg gaggagatat gtcgacgatt 1560 gaataagaac ggaactccaa ctcacctgga tttaactacc atggctggac gagggcagat 1620 tgcacgcagt cgtcgaatat caaatattga ggtactgagc tatgacagaa aagtgaaatt 1680 taaaatagat caatcgtaca cctgtccaga aatttcacag gatcgttctc aaataccaat 1740 acccagtcac gctaaaagat ggtcgcacct aaatcggatt gaaaaattta tattgcccct 1800 gcaagcaaac gtaccaatcg gaatgctaat tggaactaat gttcctggag ccatcagacc 1860 acgtgaaatt atagcagggg gagaaaatga gccatatggt caaaaatcgg ctctaggctg 1920 gggaattgtt ggaagttggg ttccaaaaag gggcacagag aatttgcgtc agtgctatgg 1980 gacggaaact aactccaaat tgggcgtacc gaaaaacaaa cattcaaaaa cgaaagagtt 2040 aggttcatct ggactgtctg aatttgtcga accaccgctc agcgacaaag ggagcataac 2100 ggatgctgaa tgtccaaaac cattaacgtc gagccacaac ctaacattaa agaagacgtc 2160 ttcatgccct attagagctg ttaaaaagaa cgatgtgtac gtgaagcgac cttgtaaaaa 2220 ggtagagtac cggacaaatc aattctgggc tcgcagcaga aaaaagttct tgctcacttt 2280 gaaacaaaga aagaaatggc aagatgggca cacgaacatg aaagtgaggg atgaggtacg 2340 ggcgacaccg gggattcccg aacggggagc caaggtggaa ggtcacgcac accgcataat 2400 aaaatcgcaa ataagatggt tacatatgca aagaaacttt tagctaatat cttgtttaaa 2460 aaaaatcggt acatttttgg agagcca 2487 // ID Gypsy-98_AA-I repbase; DNA; INV; 6983 BP. XX AC supercont1.272; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-98_AA_; KW Gypsy-98_AA-LTR; Gypsy-98_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6983 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.272; Positions 39110 32128. XX CC Positions [3840-4265] - Reverse transcriptase CC Positions [5323-5799] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 499..3024 FT /product="Gypsy-98_AA-I_1p" FT /translation="MTLDREYLVGLCVTYLTEEELIFELEIRNLLKDRDRS FT LSVNRRKLKNALREELDGKVRIFTYTKDSAVEMKVCKERFVRVMQNLGDDS FT LDKLMSKTGLLHLYHRLKLFRKTYGVGSLANEANWLFSNVIQTYIDYFDEN FT ALIVPATMPSQAPSVPVDLQGAASLEAQAEASAHDLIILGQSVASLTISTG FT AIPKNTVSHSTPSFPNTTSMSTVELDPIVSNQIAPASVSVDRTPVSISAPV FT VTCSFSGGVQLTWATPSYQVASRVQRARQVRFDTGRDFPIKAYGHLSEVIS FT SAGEQCSVQSSISIVSGSVQSSVSTWQGENNVIPGQARATSGPPVSETMRR FT LPVYSEAYEIPVSSQWSDPYDFPRSTPSISRYNPYQVPQFVQDCPFTMSTT FT QTFAPPQGQSFNPFSGSNFPTQTQSMYRPTDVRPPRSEPPLADLLGLDLRR FT TPVQTSNPNPTNNRTSDFRSSIPNNASFQSGRPYTPAPTPPLNPYQDFDFN FT PVANRAKTIPVVKWPMKYAGDDRGTGLNSFLWEVNDWTKSEQISEPELLRS FT FGNLLTGKAKMWFTSNKHRFTTYSELIDSLKATFRHPDLDHYLLMEIHQRR FT QQKNELFLEFFLDVEKKFKSLSTPVSEVEVVQAVRRNMRAEYKRALIGREF FT HDLFSLQMAGQDVDATNTYLFSKPQVQTNAIEATGELPYNQNRKGGQSSQP FT FQKGSNQGKNQPWNQTNKSNPGSSANNPPGKPPVRKDQRFQQDGKSQSNPR FT SEKNGSSDSESPDSPPSKDVSKGVETQIRTHIPLNDKFICFNCRSQQHLTD FT QCSQPYKVHCQVCGFQGFPTHRCPFCSKNGQRRKDNGPSK" FT CDS 3444..4766 FT /product="Gypsy-98_AA-I_3p" FT /translation="MDFFSAFNLHISQGTDILFLVDYEENSDSNPPDFVEL FT SSEQEAALEIVKQKFKPAMSDQLEVTSLVQHFIQLTDEYKDSPPIRVSPFP FT YSPAIHKALNEEIDRLLRLGIIEESSSEWALNAVPIKKPNGTVRLCLDARK FT LNARTKRDSYPLAHVSRILGRLGKTRYLSTIDLKDAFLQIALSEESKPLTA FT FCIQGRGMFQYTRLPFGLTNSPATLSRLMDKILGAGELEPWVFVYLDDIIV FT ASDNFEDHIRLLEEVTRRLSDANLSINLEKSKFCRSEVPFLGYLLSCDGLR FT PDPSKVQGILDFEAPRTIRQVRRFLGMVNYYRRFIPDFSTLSAPISDLLAG FT KPKNVRWSKEAEEAFRTIKERLITAPILSNPDFEKEFIVQTDASDRAVAGV FT LTQIQDGSEKVISFVSQKLNAAQQNYYRKRGTRCTHFGRKIQGLHRR" FT CDS 4684..5814 FT /product="Gypsy-98_AA-I_2p" FT /translation="MRLSKITTEKEALAVLISVEKFRGYIEGSHFTLITDA FT SALQYIRNNKWRPSSRLSRWSLELQHLDMTIVHRRGVDNIVPDALSRSICA FT IRASKPGISSYEDLIVKVEEEPDKYSDFRYEDGQLWKYVPVDEEPYDVRFE FT WKMIPPPGNRQRIIEQEHEGCFHLGVEKTLNRLQLRYYWPHMSSEIRKCIQ FT KCAICKESKPATVPTVPVMGKQKLADHPWQIIALDYIGPLPKSKSGNIHLL FT VIQDLFSKWCQLHPVRRIEAGNLCKILREGWFLRNSIPEIVLTDNASTFLS FT KDFKALLEQHGIKHWTTFRHHSQGNPVERLNRSINAAVRTYCRKDQRGWDS FT KIADIEHVFNNTMHSATGFTPFFVTRKHEIARRI" XX SQ Sequence 6983 BP; 1973 A; 1588 C; 1579 G; 1843 T; 0 other; tgaacctcta atttgttata tttggcgccc taacgaaaaa gtaactgttg gatccattcg 60 agagctaatt tttcgttgga gaagggggga gatttcacgc atgttggatc gaaaaactgc 120 attaaaaaat tacctctctc aaaattataa ttgcactgtt tcgttctata tagcatcgtt 180 cgatcatttc tattccatgc gaacgatagg gtgatcgctc agataggtta gatacagcac 240 gtgctaggta ggattttgtg tggtaagata gcacggtgcc gattgaacaa acaacacaca 300 cgatcggttt cgctttgtta aaaaattggc aaagcctaaa aatcagaaat tttcgcttct 360 atctgcatgc ctatattttt tgtgttacta ttgtttttgg ttgaacatat gcggtatagc 420 gctgctgagc cgtaaatcga tccgtatttg agtattgtct attcgttttt ttgcaccctg 480 tttcagtttt atttcgcaat gacactggac agagaatacc tcgttgggtt gtgcgtcaca 540 tacctaaccg aggaagaatt gatttttgag ctcgagattc gaaacctgtt gaaagatagg 600 gatcgaagcc tgagcgtcaa ccgcagaaaa ttgaaaaatg ctctgcgaga ggaattggat 660 ggcaaagtgc gtatttttac gtacaccaaa gattctgctg tggagatgaa agtatgtaag 720 gagaggtttg ttagggtgat gcaaaattta ggagatgact cgttagacaa attgatgtca 780 aagacgggac tgttacatct ctaccatcgc ttgaagcttt ttcgaaaaac ttatggcgtt 840 ggcagtctag caaatgaagc caattggtta ttttcgaatg ttattcaaac ctacattgac 900 tatttcgatg aaaatgcgtt aattgtacca gctacaatgc cgtcgcaggc accatcagta 960 cctgttgatt tgcaaggggc tgcgagcctc gaggcccaag ctgaagcttc agctcatgat 1020 ttgataattt tgggtcaatc ggtggcttca ttgacgatct caacgggagc aattccgaag 1080 aatactgttt cacattctac tccgtccttt cctaacacaa cctcaatgtc tacagtggaa 1140 ctagatccga ttgtttcgaa tcaaattgca ccagcgtctg tatcagtaga tagaacccca 1200 gtgtcaatta gcgcaccggt cgtcacgtgt tcattttcgg gtggagttca gctcacttgg 1260 gcaacaccaa gctatcaagt ggcatcgcga gtgcaaagag caagacaggt tcgttttgat 1320 acagggcgcg attttccgat aaaggcatat ggtcatttga gtgaagtgat ttctagcgcg 1380 ggtgaacagt gtagtgtgca aagttcaata agtattgtga gtggctcagt gcaatcaagc 1440 gtcagtacct ggcaggggga gaataatgta atacctggtc aggcgagggc tacatcggga 1500 cctccggtat ctgaaaccat gcgacgacta ccagtgtact ccgaagcgta tgaaattccc 1560 gtgtcatctc aatggtcaga tccttatgac tttcctcgtt cgacaccttc catttctcga 1620 tacaatcctt atcaagttcc acaatttgta caggattgtc cttttaccat gagcacaact 1680 cagacttttg ctccaccaca gggtcaatca ttcaatcctt tttcaggatc gaatttccca 1740 acacaaaccc aatctatgta tcgtccaaca gatgtccgtc ctccaagatc tgaaccacct 1800 ttagccgatc tgcttggttt agatttgcgt aggacacctg ttcaaacttc gaacccgaac 1860 cctaccaaca accgaactag cgacttccgc agttcaatcc caaataacgc ttcttttcag 1920 agcggtagac catatactcc cgctccaaca cctcctctga atccttatca agacttcgac 1980 ttcaacccgg tagccaatcg tgccaaaact ataccggtag taaaatggcc gatgaagtac 2040 gctggtgatg accgaggtac cggactaaat agcttccttt gggaggtaaa tgattggacc 2100 aaatcagagc agatttcaga gccagagtta ctgagatcct tcggtaacct tttaactggc 2160 aaggcgaaga tgtggttcac gagcaacaaa catcgattca caacatattc ggagttgatc 2220 gatagtttga aagcaacatt ccggcatcca gatctggatc attacctgct gatggaaatt 2280 caccaaaggc gacagcagaa aaacgaattg tttttggagt ttttcttaga tgtggagaaa 2340 aagtttaaga gcttatcgac tccagtttcg gaagttgaag ttgtacaagc ggtgcgccga 2400 aacatgcgcg ctgaatacaa acgagctcta atcgggagag aattccatga tttgttttct 2460 ttacaaatgg ccggacagga cgtcgatgcg acaaatacgt accttttctc gaaacctcag 2520 gtacaaacca atgctattga agcaaccgga gagcttccat ataatcaaaa ccgtaaggga 2580 ggacagtctt ctcaaccatt ccaaaaaggt tccaaccaag gaaaaaacca accttggaac 2640 caaactaata aatccaatcc aggatccagc gcgaacaatc caccaggtaa acctccggtc 2700 cggaaagacc agcgtttcca acaggatgga aaatcgcagt ccaatccaag atcggagaag 2760 aacggatcca gcgattcgga gtcaccggac agcccaccct ctaaagacgt cagtaaggga 2820 gtggaaaccc agattcgcac ccacattcca ttgaacgaca agttcatctg tttcaattgc 2880 cggagtcagc aacacctgac cgaccagtgc tcacaaccgt acaaagttca ttgtcaagtg 2940 tgcgggtttc aagggtttcc tacccaccgc tgcccatttt gttcaaaaaa cggacaacgt 3000 cggaaggaca acggtccttc caaatagtca attccacgaa ttcagatcaa ttggcaaacc 3060 tctccattga tgagtcgctt ggcgaactcg gatatgaaga gttagcaatt gatcggggcg 3120 taccggcgtc ggacgtccca ccgtcgacgg ttctttcagt ccttcacgac aatagaccgt 3180 atattaaacc caaaattttt ggaatttctg ttcgcactct tctggattgc ggtagtcaga 3240 aaactttggt ttctgctaaa gttgcttctc tttggacagg tcctaaaacc aagattttcc 3300 caagcaatct tacactaacc agtgcctctg gtgatgcact gaatgtggta ggatgcattt 3360 accttccttt cgatttcagg gatcaaatta aagttttaga gactaccatt gttgaagacc 3420 ttcccgttga ttgcattgcc gggatggatt tcttctctgc tttcaacctc cacatctcac 3480 aaggcacaga tattctgttc ctcgtagatt atgaggaaaa ctcagacagt aacccacctg 3540 acttcgttga gctcagctcc gagcaggaag cggcactgga aattgttaag cagaaattca 3600 agccggctat gtcagaccag ctcgaggtta cctcactagt ccaacatttt atacaactga 3660 cggatgaata taaagattct ccacctatac gggtatctcc attcccgtat tcacccgcta 3720 tacacaaggc tttgaacgag gagatagacc gtctacttcg cctcggaatt attgaagagt 3780 cgtcttcgga atgggctctc aatgcagttc cgattaaaaa acctaatggt actgtacgtt 3840 tatgtctgga cgcgcgaaag cttaatgctc ggaccaaacg agacagttat ccgttagcgc 3900 acgtcagtcg tattttagga cgattgggca aaactcgata tctgagtaca atcgacttga 3960 aggatgcatt tcttcaaatt gctttgagcg aagaatcgaa gccgttaaca gcgttctgta 4020 tacaaggtcg cggaatgttt caatacacgc gcctcccgtt cggcctcacc aatagccctg 4080 caactttgtc cagactgatg gataagatcc tcggagctgg agagctggag ccgtgggtgt 4140 ttgtctatct agacgacatc attgtggcca gcgacaactt cgaggaccat atccgtcttt 4200 tagaagaagt aacacgaagg ttaagcgatg ctaatctatc gattaacctc gaaaagtcca 4260 agttttgccg atccgaagtt ccgtttctgg gatatctttt gtcctgtgat ggtctccgtc 4320 ctgacccgtc aaaagttcag ggcatcctgg attttgaagc tccccgaacc atccgtcaag 4380 tccgacgctt tctcggaatg gttaattatt accggcgttt tatacccgac ttcagcaccc 4440 tctcggcgcc gatctcggat ctgttggccg gaaagcccaa gaacgtgcgt tggtctaaag 4500 aagcggagga agccttccga acgatcaaag agcggttgat caccgctccc attctatcaa 4560 atcccgattt tgagaaggag ttcatagtgc aaacggatgc gagtgatcgc gccgtagcgg 4620 gtgtactcac tcaaattcag gacgggtcgg agaaagtcat cagctttgtt tcccagaaac 4680 ttaatgcggc tcagcaaaat tactaccgaa aaagaggcac tcgctgtact catttcggta 4740 gaaaaattca ggggctacat cgaaggtagt cactttaccc taataacaga tgcttcggcg 4800 ctccaataca tccgcaacaa caagtggcgt ccgtcgtcac gacttagtcg ctggagcttg 4860 gagttgcagc atctggatat gaccattgtc caccgccgag gtgtcgacaa tatagtgccg 4920 gacgcattgt cccgcagcat ttgcgctatt cgagcttcca agccaggaat atcatcgtac 4980 gaggacctga tcgttaaggt ggaggaagag ccggataaat actcagattt ccgttacgag 5040 gacggacagt tgtggaagta tgtaccggtc gatgaggaac cttacgatgt acggtttgag 5100 tggaagatga tcccgccgcc tggaaaccga cagcgtataa ttgagcaaga acatgaggga 5160 tgctttcacc taggagtaga gaagacgctg aaccgtctac aacttcgtta ctattggcca 5220 catatgtcgt ccgagatccg gaaatgcatt caaaaatgcg ccatttgtaa ggaaagcaaa 5280 ccagcgactg tcccaacggt gccagtaatg ggcaagcaga agctagcgga ccatccctgg 5340 cagataattg ccttggatta cataggcccg ctgcccaaga gcaaatcagg aaacatccat 5400 ctactggtga tccaggatct attcagcaag tggtgtcagc ttcacccagt gcgaaggatc 5460 gaagccggaa acctctgtaa gattctccgg gaaggatggt ttctgagaaa ctccataccc 5520 gaaatcgtgt tgactgataa tgcgtcaaca ttcctttcga aggactttaa ggcattattg 5580 gaacaacacg gaataaagca ttggaccaca tttcgccacc atagccaagg aaatccggta 5640 gaaagactca accggtccat caacgccgca gtgcgaacat actgccggaa agaccagaga 5700 ggctgggact caaaaatcgc tgacatagaa catgttttta ataacacaat gcactcagca 5760 actggattca ctccgttctt cgtaacgagg aagcacgaga tcgcacggag aatctgaaag 5820 aatctgaaga aagcgtacgc aaccggcgct cagcgctaca acttgcgcaa acgcgtgagg 5880 ccggacgact ttcaagtggg tcagcaagtg taccgacgca atttcaaaca atcaaatgct 5940 ggggaatatt acaatgccaa actggctccc atgtatttac cgtgcaaagt gattgtgaaa 6000 cacggatcta gttcttacga attggaggat cttgaaggga agaatctggg tacatggcca 6060 gcagagcata tcaaaccgta aattatccct tccgttacgc ttgcaaccaa tagctatttc 6120 ctttctaaac tgtttcctgt aattttcaaa ttttgatgct ttagccagag agtaagtgag 6180 ctcactctcc gagtcactca gcaactgctg ctgctctctc aagtgtactg agaggttttg 6240 catgctgtga gtggttcaag ccttctatgt gacgtctatt gtacctctca gtgggagtag 6300 ggtgaattag agacaagcat aggagatctt tttcccgtct taccagacgg atattaaaat 6360 tttaccggcc ggccgataca aaaataaatt ttattccaat tctaacacta actgatcgtg 6420 aggaaaacct aatgtgtctt ttgaatgaaa agatggccaa aatagggaat ttctgaacga 6480 tcaaaaccga ggggagtatg gcttatgaat ggtttaaggt ttctattgtg aacgggacag 6540 taataaaaga aaagagcagt tcgatccact tgcggaaagt tttccaccat tgccagtaac 6600 tggaggttgc tgtaacttat ctatttaggt gggacgaaga taaatagtca accagcacca 6660 ctcttgccta aggcagagag atacgaatgt tcgaggagtt cgataaaccc aacattggac 6720 tcaccgtgat ggcgttgcta gatcacgggg gtcccccaca ttgcgcgtac ccgaatcgag 6780 cattctctaa atccagacat tagaccgaat ttgatcaact taaatttgat tccgtttgta 6840 gtttgcacat aagtttattt catatttctt tattttgtta tctggttgtt cattttggtg 6900 ctttaatata ttattttgtt gtttgatttt tgtgttggtc cagaaaaaaa aaaatctttt 6960 tttttattgt gccttggggg aaa 6983 // ID DNA4-2B_AP repbase; DNA; INV; 783 BP. XX AC . XX DT 22-AUG-2009 (Rel. 14.08, Created) DT 22-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA4-2B_AP. XX NM DNA4-2B_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-783 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(8), 1738-1738 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 4 bp TSD (TATA). CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 783 BP; 302 A; 86 C; 84 G; 301 T; 10 other; agggtgattc atcaaccgcc ccccattxtt tctttaataa tgaatttatt caaattctga 60 tttttggaat ttttaaatac ttatgcttaa agaccatatt ttcaaatact tagatttttt 120 tgtaccattt aagaatattg tctttagcga tacaaaactt ctxtttttca aatgagaacc 180 tacctttttt actgtaaatt atttagtaxa ttattttttt aatgtttatx tacgtatcta 240 attcgaaatt cgaacgagta gtttttxagt tatttaaatg taaatactat agataattta 300 aattacgtaa taatacattt ggaagataat atgagaattt atatxttaaa taatatggta 360 attcgtattt aataxttttg gaaaaatgtt ctgaataact ataataaata ataattaata 420 tattaaatat txaatttaat ttatattttt ataaaatcat tattataagg actatactta 480 attatcctta tagttattax atttaaataa ctcaaaaact actcgttcga atttcgattt 540 agatacgtca aaattttcga aaaaatxatc cgtttaaata atattataca gtaaaaaaag 600 gttgatttga aaaacagaag tttgtcacaa tcaccacagg acactccttt aaatggaata 660 aaaaaatcta aatatttgaa aatacggttt taagtatcta cttaaaaatt ccaaaaatca 720 gaatttgaat aaatgcatta ttaaaggaaa aaaaggtggt gagcattctt ggtgaatcac 780 cct 783 // ID Copia-10_AA-I repbase; DNA; INV; 2599 BP. XX AC supercont1.241; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-10_AA_; KW Copia-10_AA-LTR; Copia-10_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-2599 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.241; Positions 1460717 1463315. XX CC Positions [1506-2033] - Integrase core CC 'TATTG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 147..2597 FT /product="Copia-10_AA-I_1p" FT /translation="MAEQNKFAFARLSNHNWQIWKFRMEMLLTREELWYVV FT GDVKPEPVTDQWTKDDRKARATIGLCIEDNQFGLVKNADSAKAFWDELKAY FT HEKNTVTSRVSLLKKLCSVNLAEDGDLECHLVVLEDLFDRLSNAGQPLEES FT LRIAMILRSLPDSYGTLVTALESRADADITMQLVKSKLLDEFERRKERSGD FT SFDMKAMKSAMQSVGGVSERVCYFCKKPGHVRRNCRLYLAKQKSGEDEKKP FT EGKSIAKKVKNETCSAGVCFMVADDGQRECWFIDSGASCHMTSDKKFFTTF FT EDKAGPNVILADGKVAATAGCGEGVVLCVGGDGNTIEVKLRDVLYVPSLTS FT GLVSVDKLTAKGFVVKFRKDGCSICDVSGKVVVAGEQTGSLYKLKLAEVAR FT KVEGKMHNHNCQHQWHKRFGHRDPAVLSKIVDGKLGVGVKVTDCGIRQTCE FT PCLEGKLSRNPIPKVAERKSKRVLELVHTDLCGPMRTTTPGGKRFLMTMID FT DFSRYTVLYLLEKKSEAAGKIKQYVRYVENLFGQKPQVIRSDGGGEYANEE FT LRRFFAEEGIKAQYTTAYTPQQNGVAERKNISLQEMATTMLLDAGLDKRYW FT GEACVSAAYLQNRLPSRSVDTTPYEKWCGRKPELGHLKAYGCPAYVHVPGV FT KRGKFESKARKLTFIGYSEEHKGYRFVDTGTDMVTISRDAKFLEMQDPKCS FT SREKVNEGSEVEWFSGAQPTEVDNFDGSELDKSEEEDYFDLDNTAGEQDPL FT QVKEEIQDSSADENPEDMRRSRRSNRGVPPSHLSEYVVGIARADGRSARSQ FT EEIFRGMQRGSALRR" XX SQ Sequence 2599 BP; 724 A; 479 C; 802 G; 594 T; 0 other; ggtaatggtc ccagaggaaa tccgaaatat tatttccgga tagtttagga agaaagtgtg 60 agaaactgtt ttcctgtttc gggtgttcgc gccgcgtgtg attgaatttt tgtttcctgc 120 tgtggtgatt cagttcggaa aacaaaatgg cggagcagaa caaattcgcg tttgccaggt 180 tgagcaacca caactggcag atctggaagt tccggatgga gatgcttctg accagggagg 240 agctctggta tgtggtgggc gatgtgaagc cggagccggt taccgatcag tggacgaaag 300 acgaccggaa agctcgtgcc accatcggtt tatgtatcga ggacaatcag ttcggtttag 360 tgaaaaacgc ggacagtgcg aaagctttct gggatgagtt gaaagcatac cacgaaaaga 420 acacggttac gtcgcgtgtg tctttattga aaaagctttg cagtgtgaac ctcgcagaag 480 atggtgatct cgagtgccat ttggtagttt tggaagattt attcgaccgt ttgtcgaatg 540 cggggcaacc gctggaagag tcgctgcgga tcgcgatgat cctgcgcagc ttgcccgact 600 cctacggaac attggtgacg gccctagaga gtcgtgccga tgccgatatc accatgcaac 660 tggtaaaatc gaagctactg gatgaattcg agaggcggaa ggaacgatcc ggtgattcgt 720 ttgatatgaa agcgatgaaa agtgcaatgc aaagtgtggg cggtgtttca gaacgagtat 780 gctatttctg taaaaagcct ggccatgtgc gtcggaattg ccggctttat ctggcgaagc 840 agaaaagcgg tgaagacgag aagaagccag aaggaaaatc gattgcgaag aaagtgaaaa 900 acgaaacctg cagtgccggt gtgtgcttca tggtggcgga tgatggacag cgtgaatgtt 960 ggttcatcga cagtggagcg agctgccaca tgacaagtga caaaaagttt ttcacgacgt 1020 tcgaagacaa ggctggtcca aatgtgattt tggccgacgg caaagttgct gccacagctg 1080 gatgcggtga aggtgtagtg ttgtgtgtag gtggagatgg aaatacaatt gaagtaaagt 1140 tgcgcgatgt gttgtatgtt ccctccctca ccagtggact tgtgtctgtg gacaagttaa 1200 ccgcgaaagg atttgtggtg aaattccgaa aggatggctg ttccatttgc gacgtcagcg 1260 gtaaggtggt tgttgccgga gagcagactg gatccctgta caagctgaag ctggcggaag 1320 tcgcaaggaa agtggaaggt aagatgcaca atcataattg tcaacatcag tggcataagc 1380 gtttcggaca cagagacccg gcagtattgt cgaagattgt cgatggcaaa ttgggcgtag 1440 gcgtgaaggt cacggattgc ggcatcaggc aaacttgcga gccatgtttg gagggtaagt 1500 tgtcacggaa tccgattccg aaagtagcgg aaagaaaatc aaagcgtgtt ttggagttgg 1560 tccacacaga tctctgtggt ccaatgcgaa cgacgacacc tggtggaaaa cgatttctta 1620 tgacgatgat cgacgatttc agtcgataca cggtccttta tctactggag aagaaatcgg 1680 aggctgctgg taagataaag caatatgtga ggtacgtgga gaacttgttt gggcagaaac 1740 cacaagtgat cagatctgac ggcggtggtg aatacgccaa cgaagagttg cgacgattct 1800 ttgctgaaga aggtattaag gcacagtata cgactgcgta tacgcctcag caaaacggag 1860 tggccgaaag gaagaacata tcgctacagg agatggctac aacgatgctg ttggatgctg 1920 gtttggacaa gcgttactgg ggagaagctt gcgtttctgc tgcatacttg cagaacaggc 1980 ttccgtcccg gtcagtagat acgacaccgt atgagaaatg gtgtggtcgc aagcctgagc 2040 tggggcatct gaaggcatac ggatgtccgg catatgtgca tgttccagga gtgaaacgtg 2100 gcaagttcga gagcaaagcg agaaagttga cgttcatcgg ttattcggag gagcataaag 2160 gctaccggtt cgtagatacc ggaacggata tggtgacgat aagccgagat gcgaaattct 2220 tggaaatgca ggatccgaag tgttcatcac gggagaaagt gaatgaaggt agtgaagttg 2280 agtggttctc gggggcacaa cctaccgaag tggacaattt cgatggatcc gagttggaca 2340 aaagtgaaga agaagactat tttgatttgg acaatacggc aggagaacaa gatccgttgc 2400 aagtgaaaga agaaattcaa gattctagtg cggacgaaaa tcctgaagac atgcgtcgat 2460 ccaggaggag taatcgaggt gtcccgccta gtcatctgtc ggaatacgtt gtaggcatag 2520 caagagctga cggcaggagc gcaagaagtc aggaggagat attccgagga atgcagcgag 2580 gatcagcatt gaggaggag 2599 // ID CR1-26_CQ repbase; DNA; INV; 4189 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-26_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4189 RA Kojima K.K. and Jurka J.; RT "CR1 non-LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 30-30 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 4 sequences with >98% CC identity. XX FH Key Location/Qualifiers FT CDS 395..4114 FT /product="CR1-26_CQ_1p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="MGRISILGTPNPTDLGPPAVEVHPRRPGSQVRSDVGV FT SQPATTGKYSPLRTLQSLQRTPIFSLDNTRITSSEIPLTSSGAAVPSPSLS FT PVQSTRLPCESTSPGRAIESPMGASCPPDTATSSPIYHFGTFSARRCHQSR FT PGAGVGAEAEVSQMAHLGTYMSCSEHLPSPNNHPHSSAGFVSPERAHQHLA FT MEFQAAAAHLAVSTTPGRVYLSTMGDPGPPATVEPPCHRRLFSRPGPVCSE FT GNGVSQTASSGKYTSNESAVLSGSDLSLDSSTFPDAAAAVVQTEPVERRFL FT TVYYQNVRGLRTKTNETFLALCRCDYDVIVFTETWLTEAIKDAELTKNYKL FT FRCDRNPESSSLLRGGGVLIAAKAKLECKAVELLNCESLEQTVIRLTLPLQ FT TLFVCCVYIRPVSDPATYATHAAAVQQVIDLSKESDSVLVVGDYNLPNLIW FT YHDDDTDSLLPVNASSEQEVTLVESMLPLGLTQVNDLPNAFGKLLDLVFVS FT DNVSVELFEPPCPLLKVDQHHNPLLMQLEYRTALNDNTVEPDAVFDFKRCD FT YTTLNERILAVDWLTFLDAPCVNSVTEMFYEKLFEIFNEVVPRKSVRICHR FT YKQPWWNDDLRLLRNRLRKATSRFLSTKTAWDKVVARNIEEEYLNLQQESF FT QNYINDLQSTVKSNPSKFWTYVNAKKQTEQTPLDVSYRDVNSSSPEESASF FT FADFFKSVYNTNSTVYPDSHLEGAPTSNISIPRPTLADDDILKVLSSVDPS FT KGPGPDGLPPVFVKSCANSLVVPVSVIFNKSLEHAIFPDIWKLASISPIHK FT SGNVSKVENYRPISILSCLAKVLEKVVYDRLFPAVRRIISANQHGFMRNRS FT TTTNLLSFVSPTINVLESGGQVDAVYIDFEKAFDKVPHTLTIKKLKKLGLP FT DWIVAWLHSYLTSRKAFVNLRVTRSDIFDIPSGVPQGSHLGPLIFILFVNE FT LNTLTGSSTLMYADDLKLYRTVKSHVDCLALQADVDTLLHWCDRNGMTAHA FT KKCKVLSFSRSRNPIISDYSMNGQQLDRVETFKDLGVVVDSKLRFNQHIAL FT TTAKAFAMLGFLKRSTKHFDDPYSLKSLFCSLVRSVLEYAVVVWAPFHETQ FT VTRIERVQRAFVRYAFRKLRWNDPLRLPPYEQRCALFNLPTLARRRVFIQR FT LLVFDVLRDNIDCSSIRELVHFYVPARQGLRNNPPLLSVPRHRTVYGHHNP FT LDVCLRKFNDVHSCFDFIVSKNVFKSRISS" XX SQ Sequence 4189 BP; 1042 A; 1160 C; 967 G; 1018 T; 2 other; taaccagcat aacggtagaa ccaaggcccc aacggttctg gttgtacctt accggactga 60 atccaaaaat caacaccaac gacgtcaagt tgatcgtgga acgttgcctt ggaacaagct 120 tggatcttga tcctaagctc ctggtcccga aggataccga ctgctcgcgt tacagattcg 180 tttcgtatcg aatcggcttg gatcccaaac tccgagataa ggcccttgat ggaaacaatt 240 ggccaagaaa cctcggtgtc agagagttcg agtttttcga cacggattca aaaaaccgat 300 caacgcctcg tgtctctcca gcagaatagt ggtggatcga gacctcggct acgcatcacc 360 tctacaaagt tcccgctaca tcttccagtc gaccatgggg cgcataagca ttttgggaac 420 ccccaatcct accgacctcg gcccacccgc tgttgaagtt catccccgtc gcccgggctc 480 acaggtccgt agtgatgtgg gggtctccca gcctgctacc acaggcaagt actcgcctct 540 tcgaacactc caatcgcttc aaagaactcc aattttcagt ctcgacaaca ctcgcattac 600 ttcctcggaa atcccgctta ccagctcagg cgctgctgtg ccttcaccca gtttgtcgcc 660 ggttcagtca actcgtctgc cgtgtgagag cacatcacca ggacgcgcca tagaaagccc 720 tatgggagcc tcttgcccgc ccgacactgc aacctcttcg ccaatctacc actttggaac 780 attttctgct cgccgctgcc atcagagtcg tcctggtgca ggtgtcggcg ccgaggcaga 840 ggtctcccaa atggcacacc ttggtacgta catgtcttgc tctgaacatc tgccctcgcc 900 aaacaatcat ccgcattcca gcgctggctt cgtttcacct gaacgagcac atcaacatct 960 cgcaatggag tttcaagcgg cagctgctca tcttgcagtt tcaaccacac cggggcgcgt 1020 gtacctaagc actatgggag atcctggacc tcctgccaca gtcgagccac cgtgtcatcg 1080 tcgtctcttc agccgccctg gtcctgtgtg cagtgaagga aatggggtct cccagaccgc 1140 cagctcaggc aagtacacgt ctaacgaaag cgcagtgtta tcgggctcag atctctcctt 1200 ggattccagc acctttcctg atgctgctgc tgctgtagtt caaaccgaac cggtggagcg 1260 ccgcttcctg acagtatact accagaacgt gagaggtctc agaacaaaaa ccaacgaaac 1320 atttctggca ctatgtcgct gcgactacga cgttatcgtt ttcaccgaaa cctggctcac 1380 ggaggcgatc aaagatgccg agttaacaaa aaactacaag ctgtttcgat gcgaccgcaa 1440 cccagaaagc agttctctac ttcgaggagg aggggtcctt atcgcagcta aagctaagct 1500 cgagtgtaaa gctgtcgagc tactcaattg cgaaagcttg gaacagaccg taattcgcct 1560 gaccctgcca ctgcaaacgt tgttcgtgtg ctgtgtctac atcagacccg ttagcgatcc 1620 cgccacctac gccacccacg ccgccgccgt gcagcaagtc atcgaccttt ccaaggaatc 1680 ggactctgtc ttagttgttg gcgattacaa cctcccgaat ctaatctggt accatgacga 1740 cgacacggac agcctccttc ctgtaaacgc ttcttcagag caggaagtca cgctagttga 1800 gtcgatgttg ccgctgggtc taacacaagt aaacgacttg cccaacgcgt ttggtaagct 1860 gctggatctg gtatttgtca gtgacaacgt atcggtcgag ttgttcgagc ctccatgccc 1920 tcttttgaaa gttgatcagc accacaaccc gcttcttatg cagctggagt acagaactgc 1980 gttgaacgac aacactgttg aacccgatgc tgtctttgac ttcaaaaggt gtgattacac 2040 gactttgaac gagagaatcc ttgctgtcga ttggctgacc tttctggatg ctccctgtgt 2100 caactctgtg actgaaatgt tctacgaaaa actctttgaa atcttcaacg aggttgtacc 2160 aaggaagtca gtacgcatct gccaccgcta caagcagccc tggtggaacg acgatttgcg 2220 cttgctccga aatcgtctgc ggaaagccac tagcaggttt ctatcaacta aaacagcgtg 2280 ggacaaagtc gtcgcgcgta acatcgaaga agagtacctg aatcttcaac aagagagctt 2340 ccagaactac atcaatgatc tgcaaagtac cgtgaagagc aacccgtcaa aattctggac 2400 ctatgtgaac gcaaaaaagc agactgaaca aactccgctg gacgtgtcct accgtgatgt 2460 caacagttcg tcgcccgaag agtcagccag tttttttgct gatttcttca aatctgtata 2520 taacacaaac tctacggtgt acccggacag ccacctggag ggggctccta cctctaacat 2580 ctcaatccct cgcccaaccc ttgcagatga tgacatcctc aaggttcttt cgagcgtcga 2640 tcctagcaaa ggtcctggtc ccgatggtct cccaccagta ttcgtgaaaa gctgtgcaaa 2700 ctcgcttgtt gtgccggtct cggtgatctt caacaaatcg ctggaacatg caatcttccc 2760 cgacatctgg aaacttgcgt cgattagtcc aatccataaa tccggtaacg taagcaaggt 2820 cgaaaactac cggcctattt ctatcctaag ctgtttagca aaagtgctag aaaaggtcgt 2880 ttacgacagg ctgttcccag cagtgcgacg gattatctcg gcaaaccaac acgggttcat 2940 gcggaaccgg tcaactacca cgaaccttct ttccttcgtc tcacctacca tcaacgtgtt 3000 ggagagtgga ggccaggtgg acgcagtgta tatcgacttc gagaaggcgt tcgacaaggt 3060 gccgcacacg ctgaccatca agaagctgaa aaaactgggc ctcccggact ggatcgtcgc 3120 ttggcttcac tcctatttga cgagcaggaa ggctttcgtt aatctccggg taactcgttc 3180 cgatatcttc gatattcctt ctggcgtccc gcaaggtagc cacctggggc cgctcatttt 3240 catactgttt gtgaatgagc tcaacacttt gacgggttca agtacgctta tgtacgccga 3300 cgatcttaaa ctctaccgaa ctgtmaaatc tcacgtcgac tgcctcgctc tacaagcaga 3360 tgtggacacc ttgctgcact ggtgcgaccg gaacgggatg acwgcgcacg caaagaagtg 3420 taaggttctg tcattcagcc gttcgaggaa ccctataatc tcggactaca gcatgaatgg 3480 acaacaactc gatcgtgtgg aaaccttcaa ggacctggga gtagttgttg acagtaagtt 3540 gaggttcaac cagcacattg ccctaacgac cgccaaagct tttgccatgt tgggcttcct 3600 gaaaaggagt actaagcact tcgacgatcc ttactcgcta aaatcgctgt tttgctcgtt 3660 agtaagaagt gttttggagt acgcagtcgt tgtatgggcc ccgttccacg aaactcaagt 3720 caccagaatc gaaagagtac agcgggcttt cgtacggtac gccttccgga aacttcgctg 3780 gaacgatccc ctgcggttgc caccgtatga gcaacgttgt gcgctgttca acctgcctac 3840 gctcgcccgc cgccgtgttt ttatccagcg tctactggtc ttcgacgttc ttagggacaa 3900 cattgactgt agcagcatcc gtgaactggt ccatttctac gttccagcac gtcaaggact 3960 caggaacaac ccaccactgc ttagtgtccc gcggcaccga acggtttatg gacatcacaa 4020 cccgttagat gtttgtttga gaaagtttaa tgacgttcat tcatgttttg attttattgt 4080 gtctaaaaat gtgtttaaat ctagaattag ttcatagttt ttttaagaaa gtagcctgtg 4140 catcatattg atggaggtgt cgacaaataa ataaataaat aaaaaaaaa 4189 // ID CR1-11_CQ repbase; DNA; INV; 2423 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-11_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-2423 RA Kojima K.K. and Jurka J.; RT "CR1 non-LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 13-13 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 7 sequences with >94% CC identity. XX FH Key Location/Qualifiers FT CDS 3..2402 FT /product="CR1-11_CQ_1p" FT /note="reverse transcriptase." FT /translation="HSHDGFLYPDPRLQAKSVNITDLLDVYSTATLEQINS FT FPNENNNTLDLCFVSRQDVAPPIYVAPAPLVKTVPHHQPLILSLDVLRYPA FT RTVTFEPIRYDYRKADINGIAEILTSIDWTNTLDPDDVNVAVQTFSNILSY FT AIDRHVPKKAILPPDRIPWQTNELKKLKSKKRAALKKFSRRRTLPLRDHYL FT RLNQRYKTLNNHCYNRYRYRIQQKLKSNPKAFWKHFNDQRKESGLPSSMFL FT DNETASNPDTICQLFAEKFSRMFINETLSPHEVAAATENVPPCGNVLDRLV FT LDDQRIMNAAAKLKSTSSVGPDGIPSSILKKCIQCLLSPLRLLFQHSLDKG FT VFPDLWKTAFMFPVYKKGNKRDIDNYRGISALCAVSKLFELAVLDLVFFHC FT KQQICAEQHGFMPGRSTTSNLLTFTTYLTKGLASRSQTDVIYTDLSAAFDR FT LNHDIAIAKLKKLGFSGSLLGWFQSYLSRRTISVKIGECVSLLFLAYSGVA FT QGSHLGPLIFLLYFNDSNFVLDGPRLAYADDLKIFRYVNSTDDAELLQQQL FT DQFADWCATNRMTLNPQKCSVISFTRKLRPITHNYTLLGTTVPRVECINDL FT GVLLDAKLTYKNHVAYIVAKASRLLGFIFRATKKFTDVYCLKSLYCGLVRS FT TLEYSSVVWNPWYQNSSERIESVQRRFIRFALRLLPWSDPHRLPSYESRCQ FT LIHLDSLAVRRNAARVTAVADLLTSRIDCPSLLGELNLQARSRVLRGGAFF FT RIPLEAANYSAHGSIIGLQRTFNRVASIFDFNVSRDVLKKQFLEFFRCN" XX SQ Sequence 2423 BP; 623 A; 698 C; 519 G; 579 T; 4 other; tgcactctca cgatggattc ctgtatcctg atccacgcct ccaggckaag tccgttaaca 60 tcaccgatct gttggacgtc tacagtaccg ccacgctgga gcagataaat agcttcccaa 120 acgagaacaa caatacgctt gacctctgct tcgtcagccg gcaggacgtt gcgcctccaa 180 tatatgtcgc cccagckcct ctagttaaga ccgttcccca tcaccagccc ctgatcttgt 240 ccctkgacgt actccgctat cccgcacgca ctgtgacatt cgaaccgatc cggtacgatt 300 accgcaaggc cgacattaac ggaatcgctg agatattaac gagcattgac tggactaata 360 cgctagaccc agacgatgtt aatgtggcgg tgcagacctt ttctaacatt ctctcgtacg 420 caattgaccg tcacgtaccc aaaaaagcaa tcctgccccc cgatcgcatc ccgtggcaga 480 ccaacgagtt gaagaagctg aaatccaaaa aacgagccgc tctcaaaaag ttttcccgaa 540 ggcgcacact accactcaga gaccactacc ttcgactgaa ccaacggtac aagaccctga 600 acaatcactg ttacaaccgc tatcgatacc ggatccaaca aaaactgaaa tctaatccga 660 aggccttctg gaaacacttc aatgaccaga ggaaagaaag tggtttgcct tcctctatgt 720 tcctcgacaa cgagaccgct tcaaatcccg ataccatctg tcaactgttt gcagagaaat 780 tttccagaat gttcatcaat gaaacccttt caccgcacga ggtcgcagcc gccactgaaa 840 atgtgccacc ctgcggcaac gttctggacc gactcgtact cgacgaccaa cgaatcatga 900 acgcagctgc caaattgaaa tcaacctcat cagtaggacc tgacggaatc ccctccagta 960 tcctgaaaaa gtgcatccaa tgcctgctgt ccccactacg attactgttc cagcactctc 1020 tcgacaaagg ggtttttccg gacctgtgga aaactgcttt catgtttcca gtctataaaa 1080 agggaaacaa acgtgacatc gacaactacc gggggatctc ggcgctttgc gctgtctcca 1140 aactgttcga gctcgctgtg ctcgacctgg tgttttttca ctgcaagcaa cagatctgcg 1200 ctgagcaaca tggcttcatg ccgggacgtt cgactacatc gaacttgctg accttcacca 1260 cctacctgac caaggggcta gctagtcgaa gccaaaccga cgtaatctat actgatctct 1320 ccgcagcctt cgacaggctc aaccatgaca tcgcgatcgc taagctcaaa aaactagggt 1380 ttagtggcag tttgctcggt tggtttcagt cctatctttc cagacgaaca atcagtgtga 1440 agatcggtga atgcgtctcg ctgctattcc tcgcctactc cggtgtagcg caaggaagtc 1500 acttggggcc tcttatattc ttgctctact ttaacgacag caattttgta ctggatggac 1560 caagactcgc ttatgccgac gacttgaaga tcttccggta cgtgaactca acggatgacg 1620 ctgaactact tcaacagcag ctggatcaat tcgccgactg gtgtgccaca aaccgcatga 1680 ccttaaatcc tcagaagtgt tccgttatat cgttcaccag gaagctcagg ccgatcacgc 1740 acaactacac gctcctcgga acaaccgtcc cgcgggtcga atgcataaac gatctcggag 1800 tgctgctaga cgcgaagctc acgtacaaga atcacgtcgc atacatcgtt gcaaaagcct 1860 caagactgct tggtttcata tttcgtgcga ccaaaaagtt taccgacgtc tactgcctga 1920 agtcattgta ctgcggcctt gttcggtcaa ccctggaata cagttccgtg gtgtggaatc 1980 cctggtacca gaatagctcg gagaggatcg agagcgtgca gcgacgattc attcgtttcg 2040 cactccgctt gcttccatgg agtgaccccc accgtctccc aagctacgaa agcagatgtc 2100 agctaatcca tctggactct ttggctgttc gtcgcaacgc cgcacgtgta accgctgttg 2160 ccgacctgct cacatcaagg attgattgcc cctccctact cggtgagctg aatctgcaag 2220 ctagatctcg tgtgctgcgc ggaggtgcct tcttccgaat cccactwgaa gctgccaact 2280 acagtgcaca cggctccatc atcggacttc aacggacctt caatcgagta gcctcaattt 2340 tcgattttaa tgtgtcacgt gatgttttga aaaagcaatt tttagagttt tttagatgta 2400 attaaagtat tattgtgtaa cat 2423 // ID RTE_Ele2C_AAe repbase; DNA; INV; 3397 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE An RTE non-LTR retrotransposon from Aedes aegypti. XX KW RTE; Non-LTR Retrotransposon; Transposable Element; KW RTE_Ele2C_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-3397 RA Kojima K.K. and Jurka J.; RT "RTE clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1443-1443 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >95% CC identity. The consensus is ~85% identical to RTE_Ele2 and CC RTE_Ele2B_AAe, and ~77% to RTE_Ele5. XX FH Key Location/Qualifiers FT CDS 334..3372 FT /product="RTE_Ele2C_AAe_1p" FT /note="endonuclease and reverse transcriptase." FT /translation="GSIPNSLSTTKNGERRRNERYFRQPTWQRNKDYDWKL FT GTWNVRTLNEPGRVSLLARELRKVGVCVAAIQEVRWLRSGEREFRAVDPVA FT NTAFKYNIYYSGGDKAEHGVGFIVIGKQMKRVIRWRPINERICVLRIRGKF FT FNYSLINVYAPTNDKPDDVKDAFYECLDKTYGECPKHDVKIVIGDANAQVG FT KEDFFRPIIGKESLHSVTNDNGLRLVNFAATRGMAISSTYFARKDIRKHTW FT QHPNGELCNQIDHVLVDGRHFSDVIDVRTFRGPNIDSDHYLVVSKIRARLS FT TVSSSRNRQSLRFNIQRLSADGVAADYHQKLDERISEIDESVNLSDLWESV FT HGAVSTVAREVVGTAQRRPRNGWFDEECQRMTNEKNMARSRMLVSGTRQSR FT ERYKEARAAEKRIHRRKKKEHEEAVIAQAQEAMEENNMRRFYESVNGVRRK FT TAPSPAMCNDREGNLLTDKTMVAARWKEYFESLLNGNNGSGSGSRIQIDDD FT GQAVEPPTLDEVKKAINGLKNNKAAGKDELPAELLKHGSEQLHELLHRIIS FT RIWEEEQMPTSWLEGLISPLYKKGHRLECANYRGITLLNSAYKIMSGVLFN FT RLRPYEESFVGEYQAGFREGRSTTDQMFTLRQILDKFREYNLQTHHLFVDF FT KAAYDSVKRNELWQIMSEHGFPAKLIRLIRATLDGSKSSVRVVDEISSSFV FT TLDGLKQGDALSNLLFNIALEGAIRRAGVQRSGTIITRSHMLLGFADDIDI FT IGIDRRAVEEVFVPFKRETARIGLTINTTKTKYMIAGGQRGSGRVSGSEMV FT LGGDKFEVVEEFVYLGTLVTCDNDVTREVKRRIAAASRAFYGLRNQLKSRS FT LQTKTKLALYKTLILPVVLYGHESWTLKEVDRRAFGVFERKVLRTILGGKL FT ENGIWRRRMNHELYQVYKEVDIVKRIKHGRLRWAGHVARMPEERQAKIIFN FT REPGRGRRLRGRPRTRWLFAVEEDLRALNVQGDWKRLAQDRAQWRRLIHSA FT QIHRSEL" XX SQ Sequence 3397 BP; 963 A; 755 C; 948 G; 731 T; 0 other; gggctgcaaa atggtgagtc gacaatcagg aaggagcgtc caacacagct ctggtcctca 60 caagttccta cctcgcgctt ccacgggtca aatgatgaca aagaccgcca gctaagggtt 120 gcgtacttag ctggtagtgc agcctgggca ctgttgtcct tctgacatca gctagagtga 180 ggaggtgcgt tctgagcgtc tgtacaccag gaggtgcggc tcaaacagcg tctgttctgg 240 tatccagcgg ctgagtacga aacgctgtat cacgtcagct acacctaaga tggcagtccc 300 atcaacgtga tgtaggtagc gcgaccccgg taaggtagca taccgaattc attatccacc 360 acgaaaaatg gagaaagaag aagaaacgaa cgttattttc ggcaaccgac ctggcaacga 420 aataaggact atgattggaa actcggtacc tggaacgtca ggaccttaaa tgaacctgga 480 cgagtgagcc ttttggctcg tgaattgcga aaagttggcg tgtgcgtggc tgctattcag 540 gaagtgcgtt ggctaagatc tggagaacgt gaattcagag cggtagatcc cgtcgctaac 600 accgctttca aatacaacat ctattacagc ggtggcgata aagctgaaca cggagtcggt 660 ttcatagtga tcgggaagca gatgaagcgc gttattaggt ggaggccgat caacgagcgg 720 atctgcgtat tgaggattcg gggcaagttc ttcaactaca gcctgatcaa cgtatatgca 780 ccgactaacg acaaacccga tgacgtgaag gacgcgtttt acgaatgtct agacaagacc 840 tatggagaat gcccaaaaca cgacgtgaaa attgtcatcg gcgacgctaa tgcgcaggtc 900 ggaaaagagg acttcttccg ccctatcatt ggtaaagaga gccttcactc tgttaccaac 960 gacaacggcc tacgtttagt gaactttgct gccaccaggg ggatggccat cagtagcact 1020 tactttgcac gcaaggatat ccgcaagcac acctggcaac acccaaatgg cgaactttgc 1080 aaccaaatcg accatgttct ggtagacggc cgacattttt ccgacgtcat cgatgttaga 1140 actttcaggg gtcctaatat cgactcagac cactatctcg ttgtaagcaa aattcgagcg 1200 cgattatcaa ccgtctcgag ttctagaaat cgacaatcgt tgcgtttcaa tatccaacgc 1260 ctgtcagcag atggtgtagc agcggactac catcaaaagc tcgacgagcg gattagcgaa 1320 atcgacgaaa gcgtcaacct cagcgatctg tgggagtcag tccacggagc agtgagcaca 1380 gtagcgcgag aagtggtagg tactgctcaa cgaagaccaa ggaacggttg gttcgacgag 1440 gagtgccaga gaatgacgaa cgagaagaac atggctagaa gccggatgct ggtgtctggt 1500 acccgtcaga gcagagagcg gtacaaggaa gcaagggcag ccgaaaaacg gatccatcgc 1560 agaaagaaaa aggagcatga agaggcagta attgctcagg cgcaagaagc tatggaagag 1620 aacaacatgc gacggttcta cgagtccgta aatggtgtgc ggagaaaaac agcgccgtct 1680 cccgccatgt gcaacgaccg cgaaggaaac ttgctgacgg ataagacaat ggtggccgcc 1740 aggtggaaag agtactttga gtcattgttg aatggaaata atggaagtgg atctggtagc 1800 agaattcaaa tcgatgacga tggacaggct gtggaacctc caacgctaga tgaggtaaaa 1860 aaagctatca atgggctgaa gaacaacaag gctgctggga aggacgagct cccggccgaa 1920 cttctcaaac acggaagcga gcagctgcac gaactccttc accgtatcat atcgaggata 1980 tgggaggaag aacaaatgcc tactagttgg ttggaaggtc tcatttcccc tttgtacaag 2040 aaagggcatc gactggagtg cgccaattac cgagggataa cactccttaa ttcggcgtac 2100 aaaatcatgt ctggagttct gttcaacaga ttgagaccgt atgaggagtc ctttgtcggc 2160 gaataccaag ctggttttcg agagggccga tcaacgacgg atcaaatgtt taccctgcgt 2220 caaatcctag ataaattccg ggagtacaac ttgcagactc atcatctgtt tgtagatttc 2280 aaagcagcgt acgattcagt gaagagaaac gagttgtggc aaattatgtc cgaacatggc 2340 tttccggcga agctgattag actgattcgt gcaacgcttg atggatcaaa atcaagtgtg 2400 cgggtggtgg acgagatttc atcatcgttc gtaaccttag atggattgaa acagggtgac 2460 gctctttcta acttgctgtt taacatagcg ctcgaaggtg ctatcaggag agccggtgtg 2520 cagagaagcg gtaccattat cacacgttct catatgctcc ttggcttcgc ggacgatatc 2580 gacatcatcg ggattgaccg tcgggcagtg gaagaggtgt tcgtgccttt caagagggag 2640 acagcgagaa tcgggctcac gatcaacacc acaaaaacga agtacatgat agctggtggt 2700 caacgtgggt ccggacgtgt tagtggtagc gaaatggtgc taggtggtga taagtttgaa 2760 gtggtggaag aatttgtgta tcttggaaca ctagtgacat gcgataatga tgttacccgc 2820 gaggtgaaaa gacgtattgc agctgcgagt cgggctttct acgggctccg taaccagctg 2880 aagtcccgta gcctgcaaac gaaaacaaaa ctcgcgttat acaagacact gatccttccg 2940 gttgtccttt atggccatga atcatggaca ttgaaggaag tcgaccggag agctttcggg 3000 gtgtttgaac gtaaagtgct gcgaacaata ctcggcggta aactagaaaa cggcatctgg 3060 cggcgtcgca tgaatcacga gttgtaccaa gtgtataaag aggtggatat tgtcaagcgc 3120 ataaaacacg gcaggctgcg ttgggctggt cacgttgccc gtatgccgga agaacgacaa 3180 gcaaagataa tattcaacag agaacccgga cgaggccgcc gacttcgtgg taggccgcgc 3240 acacgatggc tttttgcggt tgaggaggac ttaagggcac ttaacgttca gggcgactgg 3300 aagcgattgg cccaggaccg agcccagtgg agaagactca tccattcggc gcagattcat 3360 cgtagcgaat tgtagcccat caagtatcaa gtaagta 3397 // ID BEL-22_CQ-I repbase; DNA; INV; 2508 BP. XX AC AAWU01010298; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-22_CQ_; KW BEL-22_CQ-LTR; BEL-22_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-2508 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 197-197 (2011). XX DR GenBank; AAWU01010298; Positions 1368 3875. XX CC 'GGAC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 23..2377 FT /product="BEL-22_CQ-I_1p" FT /translation="MDPERLKSLTNKRSVILAKVKWELSVANAIKTRNPSL FT GEVAERRDKLTDLARCFEGIQTEIEEATPNLEEVITVFNHRILFEEAYFQI FT KDLYTEYLDQHVEEPEDRSENRQDNDLRDAIRALLESQQQMLLAQCQKPTP FT QSLALAAGSQVSNSVPQNVVKLPQIDIPKFTGERKHWRSFKDLFECTIHNR FT TDLRDSVKMQYLFSYLDGEAKGKVDSFSINEDNYREAWDALVTFYDKKKYT FT VFALVREFVDQQQVTSSNGLKKLVATSDDVVRQLKALGREYESRDPWLIHL FT LLEKLDRETRSLWAQRIINIENPTFAAFLEFLQQRCDALETCLAFSKKPAS FT DSTKKGRHVLHTTAVASCAKCSESHPTYHCEQFKAMDVDGRRELVLSQKLC FT FNCLKPSHTTSKCPSKSTCHKPDCNQRHHTMLCYQQQLQGYEPEETDHDLC FT PQLPEEAEEVWSFSANAAQTKANKHGPSTAALLPTVVVNLQGKDGKLHQVR FT CLVDSGSQASLITEACVKRIGMKRTKVSLEVSGVNGEIVGNTAGAVTLVMS FT SRFNGEAKLTTQAYVLGRLTATLPNQRFSVADLPFLEGLELADPDFNSPSE FT MDVILGTDVFLSILRAGQVHNQQGIPVAQRSIFGWMVAGKITNPCRISTHI FT AVLNERTPTITTTHNNLDGSITKRPKKNGIDVTPYNVALLNLAVSSTDPQA FT CIVLFATRLRLPGLLSVKVWARGEGTEYLYLRSDRRTRSVPVQTVQSPPCT FT HSWTATQARSSSQGNWSWSRITAHHHQPGNWPGS" XX SQ Sequence 2508 BP; 635 A; 667 C; 728 G; 478 T; 0 other; tttggtcctt cgcggcacgg atatggatcc agagcggttg aagtccctca ccaacaagcg 60 aagcgtcatc ctcgcgaagg tgaagtggga gttgtccgtc gcgaatgcca tcaaaacccg 120 caatccgtcc ctcggcgaag tggcggaacg gcgggacaag ttgaccgacc tcgctcggtg 180 tttcgaaggc attcagaccg agatcgagga ggccacgccg aacctggagg aagtgatcac 240 ggtgttcaac caccgaatcc tgttcgagga ggcgtacttc cagatcaagg acctttacac 300 ggagtacttg gaccaacatg tggaagagcc agaagacaga tcggaaaacc ggcaggacaa 360 cgatttgcgg gacgccatca gggcgctcct ggaatcgcag cagcagatgc ttctcgcgca 420 gtgccagaaa ccaacaccgc aatcgttggc gctagcggcg gggagccagg tatcgaactc 480 ggtaccccaa aacgtcgtca agcttccaca gatcgacatc ccaaagttca ccggggagcg 540 caaacattgg cgctcattca aggacctctt cgagtgcaca atccacaacc ggaccgacct 600 gcgggattcg gtcaagatgc agtacttgtt ttcatacctg gatggggagg cgaaggggaa 660 ggttgactcg ttttcgatta acgaggacaa ctaccgcgag gcgtgggatg cactcgtgac 720 gttttacgac aaaaagaaat acacggtgtt tgccctcgtt cgggagttcg ttgaccagca 780 acaagtcacg agctcgaacg ggttgaagaa actcgtagca acatccgatg acgtcgttcg 840 ccagctgaag gcactgggaa gggagtacga gtcacgggac ccgtggctca tccatttgct 900 gcttgagaag ctggaccgag agacgcgctc actgtgggcg caaaggatca tcaacatcga 960 gaacccaact tttgccgcgt ttttggagtt tttgcagcag cgatgcgatg ctttggagac 1020 atgtttggcc tttagcaaga agccggcctc cgattcgacg aagaagggga ggcacgtttt 1080 gcacacaacc gctgttgcga gctgtgcgaa gtgcagcgag agtcatccga cgtaccactg 1140 cgagcagttc aaggccatgg acgtggacgg aaggcgggaa ctagttcttt ctcaaaagct 1200 gtgcttcaac tgccttaagc cgtcccacac aacgagcaag tgcccttcga agtcaacgtg 1260 ccacaaaccc gattgcaatc aacggcacca cacgatgctg tgctatcagc agcagcttca 1320 aggctatgaa ccggaagaga cggaccacga cttatgtcca caacttccgg aggaggcgga 1380 agaggtctgg tcgttctcag cgaacgcagc ccaaacgaag gcgaacaagc acggcccgtc 1440 gacggcagca ctgttgccta cggtggtcgt gaacctgcag ggaaaggacg gcaagttgca 1500 ccaggtgcga tgcctcgtcg acagtggatc acaagcatcc ttgatcacgg aggcatgcgt 1560 caagcgcatt ggaatgaagc gcacgaaggt ctctctggaa gtatctggag taaacgggga 1620 aattgtcggc aacacggccg gcgcagttac gctggtgatg tcttcgcgtt tcaacggaga 1680 agccaagctc accacgcaag cctacgtgtt gggaaggctg acggcaaccc tgccgaacca 1740 gcgcttcagc gtagcggatc tgcccttcct ggaggggctg gaactagcgg atccggactt 1800 caacagtcca agcgagatgg atgtgattct cggaacagat gtcttcttgt ccatcctgcg 1860 agcaggacag gtccacaatc aacaaggaat ccccgtagca cagcgttcta tcttcggttg 1920 gatggtggcg ggtaagatca cgaatccgtg tcggataagt actcacatcg cggttctcaa 1980 cgagaggact cccacaatca cgacgacaca taacaacctg gacggatcaa ttacgaagcg 2040 accgaagaaa aacggaatcg acgtaacacc ttataatgtg gccctcttaa accttgcggt 2100 ttcgtcaacg gatccacagg cgtgtatcgt tctttttgcg acaagactcc gcctcccggg 2160 tctcctaagt gtgaaggtat gggcacgggg agagggcacc gaatacctat atttacgaag 2220 cgaccgaaga acaagaagcg ttccggtgca gacggtgcaa tcgccaccat gtacccattc 2280 ttggacggcg acgcaagcaa ggtcatcaag ccagggaaac tggtcctggt caagaataac 2340 agcacaccac catcagcctg ggaactggcc cgggtcgtag cagttcatcc ggatcgagca 2400 gggctggttc gggaagtgac gctgcgtcga ggaatgttcg agtacttgtg ctcggcacag 2460 aagatctgtc cgcttccgaa ttgagacgct gtctcaaggc ggggagta 2508 // ID SMAR32 repbase; DNA; INV; 1286 BP. XX AC . XX DT 22-JAN-2008 (Rel. 13.01, Created) DT 22-JAN-2008 (Rel. 13.01, Last updated, Version 1) XX DE Consensus sequence of Mariner-type family of repeats. XX KW Mariner/Tc1; DNA transposon; Transposable Element; SMAR32. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-1286 RA Jurka J.; RT "Mariner-type element from freshwater planarian (Schmidtea RT mediterranea)."; RL Repbase Reports 8(1), 20-20 (2008). XX DR [1] (Consensus) XX CC >10% divergent from consensus. Several hundred copies. CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 183..968 FT /product="SMAR32_1p" FT /translation="MYKLENHDLRLLYLYEWKSKNNSVEAERNINNAFGSD FT TVNERTIRRWFKKFDDGEIDLENMPRGRPNIKLSEEDLREAVAQNSTKSVR FT ELAVDLSVSKTTVHDNLKTIGMVKKLQKWVPHELTELQKLNRLQICISLQA FT RHRKSSFLNNIITCDEKWIQYDNRKRSGEWLDKHQCPNQFPKPELHQKKVM FT LSVWWSKYGVIHYCFLKMGTTINSERYCVELEKMHEKLTISHPAIVNRKGP FT IILHDNARPHVSKVTIKNYTT" XX SQ Sequence 1286 BP; 461 A; 201 C; 219 G; 405 T; 0 other; tattaggttg tccggaatga aatgtccgaa taatatataa taattttaaa tcatcataac 60 ttttttttat ataaaacgta tttctttatt ttaaagttca ttagaaagcg cttatttcaa 120 attcaaatat taagtaagat ttataataat ttaattacta taaatttcat tttcaaatag 180 aaatgtacaa gttagaaaat catgatttga gacttcttta tttatatgag tggaaatcga 240 agaacaattc agttgaggcg gagaggaata ttaataatgc ttttgggagc gatacagtta 300 atgaaagaac tattaggcgt tggtttaaaa aatttgatga cggagaaatt gatctcgaaa 360 atatgcctcg aggtagaccc aatatcaagc tgtcggagga agatttacgt gaagctgttg 420 ctcaaaattc aacaaaatct gttcgagaac ttgctgtaga tctgtctgtc tcaaaaacaa 480 ccgttcatga caatttaaag acaattggta tggtcaaaaa gcttcaaaaa tgggtcccac 540 atgaattgac tgaattgcag aaattaaaca gattacaaat ttgcatatct cttcaagcac 600 gacatcgaaa gtcatctttt cttaataata ttattacctg cgatgaaaaa tggattcaat 660 atgataatcg taagcggtca ggagaatggt tggataaaca tcaatgccca aatcagtttc 720 caaaaccgga acttcaccaa aagaaagtca tgctctcagt ctggtggtct aagtatggtg 780 taatacatta ttgttttctg aaaatgggta caacaatcaa ttctgaacga tactgtgtag 840 aactagaaaa aatgcatgaa aaactcacca ttagtcaccc tgcaattgtt aatagaaaag 900 gtccaattat cctgcatgat aacgctaggc ctcacgtatc gaaagtcact ataaaaaatt 960 acacgactta ggatttgaag ttttaccgca tccaccttac tcacccgatc tatcaccaac 1020 tgatttccac ttatttagac atttaagtct ttatatcaag ggcaggaagt tcaaagattt 1080 ggaggagata aaatcgacag ttatcacttt cttggacttc agacatgaga cttttttaaa 1140 ggtatagaaa agctcccgca acgatggcaa gagtgtatta atgttgatgg atcatatttt 1200 gattaaataa taaatgttta aatatatttg tacgtctttt gaattataat ccatttagcg 1260 gacatttcat tccggacaac ctaata 1286 // ID Gypsy-33-LTR_NVi repbase; DNA; INV; 1084 BP. XX AC . XX DT 14-MAY-2009 (Rel. 14.05, Created) DT 14-MAY-2009 (Rel. 14.05, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: long terminal DE repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy-33-LTR_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-1084 RA Bao W. and Jurka J.; RT "LTR retrotransposon from Nasonia parasitic wasp."; RL Repbase Reports 9(5), 1003-1003 (2009). XX DR [1] (Consensus) XX SQ Sequence 1084 BP; 362 A; 224 C; 195 G; 303 T; 0 other; tgtaacaccc cgtacaagct ccgcaaaaaa atttataaat attaaatact aaattatatg 60 aaacaacatt ttagattata caagagcatc ttatgctccc ccactcccaa tacgaagcta 120 cgcgccacag ctgacagctg agctcaaagc agcagtacaa cagcagcgcc acgagccagc 180 cagcggcaac agcagcaagc agcaagaaag cggggagagc gcgcaacacg cgcctatatc 240 tatatagacc gagcggcgag cgcagcggcg caattataat gagagtagga gtagagagcg 300 agcggcggcg cggcgtgcgc attatattat aagtgtgtaa atgatttttg tatataggtg 360 tgaatgagtg tgtgcaagta tgaatgtgat agtttttccg gcggctcgat tcccgctctc 420 tccgtggcct ataaaaggac ccgcacaacc cgtattagct cactagctga agcgcaactt 480 aagcgccagc gcaacttaag cgcctagtct cttctcaaga gcaactcacg ccttgagaag 540 ataaacgact tagaattatt tcgaatattt ctcgaactta taaaatttct gtatctcaaa 600 tacttataaa attgtcgagt caggcacgat atcgtatctt aaagcgtatc agtcctacac 660 ttgtcataaa ttcaaagact tgttcaaata cttataaaaa tttcgaacta aagcagataa 720 cgcaaaattc aagcgtatct acttaaaatc ttatttgtaa aaatcaaagc ggatttattc 780 atatattttg tgacacagaa tatatgcaac aattaatgca atattcttat ccttaaatat 840 aatattgaat atatttattt tgttattgaa aagaaataac atacttttat tattgaaatg 900 aatttccact acctggtatc ccggacctgt cttatatgat aataatacac tactatatat 960 aattactttc ctatttatga tctgttgtca aaattcggac gtagatcagt aaatgattac 1020 tgctcttctc taaaatttaa aatctcatac gcgctacgga ccgagaagag tcgtgaccgt 1080 ttca 1084 // ID Copia-6_CQ-I repbase; DNA; INV; 1839 BP. XX AC AAWU01041312; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-6_CQ_; KW Copia-6_CQ-LTR; Copia-6_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-1839 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 327-327 (2011). XX DR Genome; AAWU01041312; Positions 6230 4392. XX CC 'GACCG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 92..1839 FT /product="Copia-6_CQ-I_1p" FT /translation="METNGVVAGVPLFVGSNYRSWKCRMLAVLDEHELEEC FT IQQEAAEVEELKVKEEDSTAQKEVKLKALEKRKKKEKRCRSFLLSRLDDDH FT VEQVQDKSTPREIWLALAAMYERQSLAKRLHLKQELILLRQGGDTLREHFT FT KFDRIIRELKSTGATIDELDVICHLLLTVNPRFEMVVTSIQTVPERDLSLE FT FVKCRLLDEETRKNSSESVGVKTGEAAFSGTSALKSQQQKKKKAFKCFVCG FT KEGHKAADCPEKKSDETKASARYGSSDDEVCFVGMEKLPELREVSWIVDSG FT SSEHIAKDRELFEELVPLKNPVTIAVAKKGKSIVAEHRGVIRLTSAVDGKT FT IPIALKDVLYTPEASANMLSVRKLDERGLKVVFSSGTVSIKQEERTVAIGK FT QSGRLYLLVLHERGPGRVPEDPEGCCEGGSDEAEKTKPQRGRSKREPKQDK FT KAYTGFALSAADFAPVALSEIEEQPDVSRKKTAGSARKPDGGAAETFLEVC FT VPAATKIKEKLRTEKMRARHRRASGAGHGQGAGETCCEGRPKEDHRGQEGN FT RPGAGGCRGCADSAGADHCDPVASRGTWRLPGLSGG" XX SQ Sequence 1839 BP; 444 A; 392 C; 672 G; 331 T; 0 other; ggttatgagc ccaggttgcg cgagtgccga aaagttgttt ttcgcggcgt tttagtgagt 60 gaagtttgga agttggtgga acggattgag gatggagacc aacggcgttg ttgcgggggt 120 gccactcttc gttggctcga actaccggtc gtggaaatgc cggatgctgg ccgtgctgga 180 cgagcacgag ctggaagaat gcatccagca ggaggcggcg gaagtggagg agctgaaggt 240 gaaggaggag gactcaacgg cgcagaagga ggtaaagttg aaggcgttgg agaagaggaa 300 gaagaaggag aaacggtgcc ggtcgtttct gctttcgcgg ctggatgacg accacgtcga 360 gcaggtccag gacaagtcga cgccgaggga gatttggctg gccctggctg cgatgtacga 420 gcggcaaagt ttggcgaaac gtttgcacct gaagcaggag ctgattctgc tgcgccaggg 480 gggcgacact ctacgggagc acttcacgaa gtttgaccgg atcattcgtg agttgaagtc 540 tactggagcg accatcgatg aactcgacgt catctgtcat ctgctgttga cggtgaatcc 600 gaggttcgag atggtggtga cgagcattca gacggttccg gagcgagatt tatcgctgga 660 attcgtcaag tgccgtttgc tcgacgagga gacgaggaag aactccagcg aatcggttgg 720 tgtgaagacg ggggaagcgg ctttctcggg aacgagtgcg ctgaagtcgc aacagcagaa 780 aaagaagaag gcgttcaagt gttttgtttg cggaaaagag ggccataaag ctgcagactg 840 tccggagaag aagtccgacg agacgaaagc gagcgcacgc tacgggtcga gtgacgacga 900 ggtgtgcttc gtcggcatgg agaagcttcc ggagctgcgg gaggtgagct ggatcgtgga 960 ctctggttca tcggaacaca tcgccaagga tcgtgagttg ttcgaggagc tcgtgccgct 1020 gaagaacccg gtgacaatcg ccgtggcgaa gaaggggaag tcgatcgttg ctgagcatcg 1080 tggcgtgatc cgtctgacct cagccgtgga cggaaagacg atcccgatcg cgctgaagga 1140 cgttctgtac accccggaag ccagcgcgaa catgttgtcg gttcggaaac tggacgagcg 1200 cggactgaag gtggtgttca gttcgggaac agtgagcatc aagcaggagg agagaactgt 1260 ggcgatcgga aagcagtcgg ggaggctgta cctgctggtt ctacacgaac gtggaccagg 1320 tcgcgtgccc gaagatccgg agggatgctg cgagggcgga agtgacgaag cggagaaaac 1380 caaaccacaa cgaggtcgaa gcaagcgaga accgaagcag gacaagaagg cgtacacggg 1440 cttcgccttg agtgcagctg acttcgcccc tgttgcgttg tcggaaatcg aggagcagcc 1500 tgacgtatcg aggaagaaga ctgctggttc tgcgcggaag ccggatggcg gagcggcaga 1560 aacgtttttg gaagtctgcg tccctgcagc aacgaagatc aaggagaagt tgagaacaga 1620 gaagatgcga gcgaggcatc gacgagcgag tggagcagga cacgggcaag gagccggtga 1680 gacctgctgc gaaggaagac cgaaggaaga tcatcgaggt caagaaggta atcggcctgg 1740 agcaggcgga tgtcgtgggt gtgctgattc agcgggggct gatcattgtg atccggtggc 1800 atcgcgcggg acatggcgac tcccaggatt gagcggggg 1839 // ID Harbinger-N17A_BF repbase; DNA; INV; 423 BP. XX AC . XX DT 04-AUG-2008 (Rel. 13.08, Created) DT 04-AUG-2008 (Rel. 13.08, Last updated, Version 1) XX DE Amphioxus Harbinger-N17A_BF autonomous DNA transposon - DE consensus. XX KW DNA transposon; Transposable Element; Harbinger-N17_BF; KW Harbinger-N17A_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-423 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-423 RA Kapitonov V. and Jurka J.; RT "Harbinger-N17_BF - a family of autonomous DNA transposons from RT the amphioxus genome."; RL Repbase Reports 8(8), 809-809 (2008). XX DR [2] (Consensus) XX CC It is a subfamily of Harbinger-N17_BF. XX SQ Sequence 423 BP; 107 A; 93 C; 99 G; 124 T; 0 other; ggccacactg acttaatttt atggatgaca tccgcgcgcg cattgatttt cgcctgttct 60 caaaaaaaaa ctgtggccac atgtcaccca atagcatccc tggtcaatca gaattttatg 120 cttgccagag gatctgctct ccagggtaag ggggcacatg gtcagggcat ggggccacag 180 cgcccttgtt cccctcttta aacaagaatg gcaataaaaa cattattgaa atggcgacga 240 ttgtctctta cagtcttact ggaggtgaac ataagacaaa ccaagcctgg gaaagaatgg 300 actggatgtt tagtcctgtg gtttagcagc attttttttg ctggattttt cttttttttg 360 ccctcgcgca ttagttttgg ggtctccaga ggatgtcatc cataaaatta agtcagtgtg 420 gcc 423 // ID BEL-28_CQ-LTR repbase; DNA; INV; 216 BP. XX AC AAWU01010753; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-28_CQ_; KW BEL-28_CQ-I; BEL-28_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-216 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 210-210 (2011). XX DR GenBank; AAWU01010753; Positions 28233 28448. XX SQ Sequence 216 BP; 49 A; 58 C; 48 G; 61 T; 0 other; tgttcgggtg ctttagagca actctcccat ctggcgattt ggcaaccctg ctcatctgac 60 agttcgcgcg caggagacaa acctcgtgca ttcttattca tctatcaaac gcaacggacg 120 caaataaagg agtagtttag tttagtttgt aaacgtggtc ttttcatttc ctccgggatc 180 tctcgcgttc cgtcggccca gtccaaggtc caaaca 216 // ID MuDr-1_BF repbase; DNA; INV; 10738 BP. XX AC ABEP01000344; XX DT 30-MAR-2009 (Rel. 14.03, Created) DT 30-MAR-2009 (Rel. 14.03, Last updated, Version 1) XX DE MuDr-type DNA transposons from Branchiostoma floridae. XX KW MuDR; DNA transposon; Transposable Element; Ulp1; MuDr-1_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-10738 RA Bao W. and Jurka J.; RT "MuDr-type DNA transposons from Branchiostoma floridae."; RL Repbase Reports 9(3), 683-683 (2009). XX DR [1] (Consensus) XX CC TSD is 9-bp long. TIR is 55-bp long. Transposases contain Ulp1 CC domain. XX FH Key Location/Qualifiers FT CDS 7033..8910 FT /product="MuDr-1_BF_1p" FT /note="The predicted protein is not completed, it FT only represent the 3'-terminal portion." FT /translation="SPFPCRLFCSFFYHSRYIQLNVAFSSCYRKYHTDLPT FT YLKDRPRGVVLHIMARLQEAQSYEQDDIVSIGPGIFQVKSSSKRGQQHTVN FT FGATSTNMPSCTCRDWLKHKLPCKHFCSVFNLQQEWGWEKLSAVYRDNPLF FT SLDSTLLSSSSSTCTSSEMDEMDDSSSPSLTSTADPPPPSPTPSPAVYGEL FT PVNKPKRTKQLQRECASLLKEMTNITYNLQDELYLTSMKEQLSCMLEEMMG FT HSAHDGNFRVRSPRKGKRKMTDSHALPTLPPKHPSSNRVGRRADMMKSTFQ FT VNYSLNAKEKAPELEVSVVECDDLQAMVGVETEAVIGVETEAVIGVETEAV FT IGVETETVIGVETEAVIGVETEAVLDCDEAQNEWLVINNTKLTQKDREILQ FT EDQWLNDKHMNSAQHLISSEYPLIDGLRDTVILTANQQGPVPASSDCVQIH FT NINDHWVVSTSIGGNITVYDSLQPSMKPELRSQLADLYRQFAIGEDSIIPV FT NVICAQRQQGGNDCGLYAVANAVALVEEIPPTQIVFQQGQMRHHFEECLEN FT KAIKMFPHDTKVGQANTVSVSYQLTTYCCHEHRPGSPMIMCDKCAQWYHYS FT CVNLRDNEVYALVTQQEDYVCPTCHSD*" XX SQ Sequence 10738 BP; 3274 A; 2288 C; 2102 G; 3074 T; 0 other; ggtcgcggat agatcctatc cgggatttca tgttcaccgg aagtgacgca atggcggggt 60 taagggttaa taataccctg ggaacgacgt cacgtcacgt gattaaagac ttgtcaaccc 120 gccggtactc cggtaccgtt ctatagaatg gctggcaatg gagacccggg gccgccggca 180 actagttgcc atgtcagaaa tcccgaagtt tccttcaata cattggagga gtttgaaaga 240 actcgtttga aagttcaaga agagacgggt tccacatata tcaagcttca gaaaaccagc 300 gccaatttcg atgaaaagag tgagttgtca tactttgttt tcgcgcgaat tgagtggctc 360 actaagcaga ggtttggaga gtattacgta atgttcaaac ttaaaaacaa tagcatttgt 420 gaggatgtac ctttttaata tgaattaaca gatatatact gttctgtcct gacattccaa 480 gttacagcca ttattttgct cattgtttta ggaaggactc cctattacac tgtcacatgg 540 tccgtcatct ggacctaatt aagtaacttg agaagataga accggcctat tacttaagta 600 ctccttcaga cttaacctgt agctgatggt tctgattact tatataaaag aactgctgca 660 gaaagttatt tttcttattc acacgacctt tccattaata gtattttgca aaataacccc 720 ctgccacttt attgacatac tgctatgctg ggtcattgct tctccaactt taaatgcaaa 780 cacaaattct agaaaataat gaatgatccc attttagaaa ataatggtct tgcaccacag 840 tatttacagg acattcttcc accaagtttt ttttgtactc tatcccaaac aacctcacaa 900 atcaaaaaaa aaaactgtaa caactttgat attgtccata cttttattca ctgtacaggt 960 agattctctt aactgataca aaccaacttc actactacta ctttctgttg tccattttgt 1020 tcttacacat gtgaataaaa aaatcaaatg aaaacatccc ttgctttatg gatcttttgt 1080 ctgatctata ctgtaaagta cagaacaatc tgatcatgga aaaactactt ggttctgtca 1140 tatttttacc atgctttaca caacacttgt cttcatccat tctataatac atgtacatca 1200 tcatcatcag tcactgtgtt ctgcatctaa agtccttgaa aagccctata gacatgtaca 1260 tgatataaaa catactagta gtctgtttcg ggcaataatg tcaatgtcat cttaggtgag 1320 acaaaaagct ggagcacaga tgcatccaca tacagaacat acacctatcc atcatcttaa 1380 ttaccccaaa taaccactag taaaacatgt aagggtgcta gtaagagtac attccaaccc 1440 ttactatgtc attgcagatt catgctaagg ttgtgataac tatttaatta tgttattgtc 1500 ttctgttcct tgtcaattgc ataaaatacc attgtaaggt gcctcacctt gtctcacctt 1560 gtctctattg aaagaaaata cacctgtcat acaagtaagg tgcctcacct ggtctcacct 1620 tgtctctagt caaagaaaat acacctgtca tacaagtaag gtgcctcacc tggtctcacc 1680 ttgtctctag tcaaagaaaa cacacctgtc atacctgtaa ggtgcctcac cttgtctcac 1740 cttgtctcta gtcaaagaaa atatacctgt catacaagta aggtgcctca cctggtctca 1800 ccttgtctct agtcaaagaa aacacacctg tcttacctgt aaggtgcctc accttgtatc 1860 accttgtctc tagtcaaaga aaatacacct gtcatacaag taaggtgcct cacctggtct 1920 caccttgtct ctagtcaaag aaaatacacc tgtcttacct gtaaggtgcc tcaccttgtc 1980 tcaccttgtc tctagtcaaa gaaaatacac ctgtcttaca tgtaaggtgc ctcacctggt 2040 ctcaccttgt ctctagtcaa agaaaacaca cctgtcttac ctgtaaggtg cctcaccttg 2100 tctcaccttg tctctagtca aagaaaatac acctgtcata caagtaaggt gcctcacctg 2160 gtctcacctt gtctctagtc aaagaaaata cacctgtcat acaagtaagg tgcctcacct 2220 ggtctcacct tgtctctagt caaagaaaac acacctgtct tacctgtaag gtgcctcacc 2280 tcaccttgtc tctagtcaaa gaaaatacac ctgtcttacc tgtaaggtgc ctcaccttgt 2340 ctctagtcaa agaaaataca cctgtcttac ctgtaaggtg cctcaccttg tctcaccttg 2400 tctctagtca aagaaaatac acctgtcata caagtaaggt gcctcacctg gtctcacctt 2460 gtctctagtc aaagaaaaca cacctgtctt acctgtaagg tgccctcacc ttgtctctag 2520 tcaaagaaaa tacacctgtc ttacctgtaa ggtgcctcac cttgtctcac cttgcctcta 2580 gtcaaagaaa acacacctgt cttacccgca aggtgtctca ccttgtctaa cctggctaaa 2640 caaagacaat taacgctaat accaggacgc aagagtccag tctccgttta cagagggttc 2700 gtagcaagta cagtctacta ctgctcttct cgacaaccat gacaagaagt ttagaccact 2760 tcatgttgct ttgtgactgt aagatcacgt ccagaatatg tgcaggttca actacctcca 2820 gttgtgtatg tccaattgta atcttaagtg gcggtgattg tcttcagttc actcgacact 2880 aatcctcagt gccggactct atgttgccta gacgagatga agctgctgac ccatggcatg 2940 acgatggacg aacaccaaga tccatcagtt cactgattga aatttgtaac acatttcatg 3000 catattcaga cagaatccaa ttgatgttgg catctgcaag ggaaaacagt tcacattaac 3060 ataaagaatt aaaactgtta ttttgcatta gttatgacat ctattacatg tttatttttt 3120 caatagcctg tatgtacgta aagaacatag gttcctatca aactgtttcc atgtgcagat 3180 acccgctttg taacggcaaa tatatttaga aacgttgatt ctatttcaaa tatgacttaa 3240 cgtggaagat gcaagttgta gttttgtcta gtttttaagg taatggccaa atttaactta 3300 acttaacttg tacctacagg tgtcatcttc cacaactgct actgtcttca aactgtcatt 3360 tctgtgaaaa acataatggc aaatttcatc tgattggcta gcataaacaa tgcacagctt 3420 acaatgtata tacttgttat gacattccaa tacagatgtg caaccctcag ctgagaagag 3480 catcaactgg aagctgaagt atgtggaatt tgatggcgtt ccattccaac ttgtgagcaa 3540 ggcggtgtat ggctgtcatc agggggaaga cagggacaag cataagaaag acaagaggaa 3600 ggctgagaaa cagcaacaag cagtatgtta ttctaaaaat gaacttgaca tgaagttgaa 3660 atatattata atgttagcac ttcaaagtac taatttgcat agataatgag taaaatatat 3720 atttagtggg aaatccatat ctcataatgt tccaatacta ccatacataa aaatgtaggt 3780 aacatttcaa caaagaaaaa ggcatgaaca gaattagtta cattttctgc agtccaactc 3840 aaaggggaaa gtttgagatg aagtgtaatg ctgttttgtt taagtgtgat gtacagtaac 3900 atgtacttct atgtctctct gtacctttaa tagctagaag accatggttt taggagaaga 3960 catcgccagg tgcaaaatac caaaaagatg gactgtcctg tcaagttcac agttcattat 4020 ctcatcagat ttcctgattt caaggtatat aaaacttcaa tactataact cctgcctttt 4080 ttcataaggg agacattcat attaatctgt gcaagcagca tgaatgaata atgatgatga 4140 tgttataacc caaagtccca atatttattg agatgaagaa taacagcata acttgagaag 4200 tctcaattaa agaaagtctc cattaaaacc tgtgtgtctc atgcaaattc ctgcatgtat 4260 catgcaaaac tctgcaggag atcatgtaaa atccggcagg atactttgta tgacttgatg 4320 aataacacag agaccccagt gtaaatgaag gtaattctat acttgcgcca ttggataagg 4380 caaacatttt tggacataaa tcatgaatca gtgtcactgg ttgcacaagg aaacacttta 4440 aaatactcta gccttctttg ctaagatgaa gtctattgta aaattgtatg acacaattgg 4500 cctaaagaca agttaacatt gcagctgtta atgcctgtgt aatttttcag attgaagata 4560 acaaacaatc ggcaaaagat aaagtttcca gagacctgaa gaaggcacta gcagaagatc 4620 cgtcagcggt taaggccgtt tgggaatatc gaacacgctt cccaccatgg acgaccatag 4680 aaaccaccca actgttggaa tggtatgtat acaaaatctt gaattacatg gccattagaa 4740 gatgattgaa tatcaaggat aactatccac ttcccctcta catggtatag ggagtgagat 4800 gtaataagac aaagtatcaa aactgtcaaa atccattttt ctataggttg ctgaattgag 4860 tcagcccgtg gatgacagaa tcagacaaca ggtggtgaag gtcaccgtgg gtggtgccaa 4920 aagcgtgact gaggtaaagc accacctgaa cacattcatt accaatgaat tgttcagagg 4980 cgagactcct cctcctccaa cacaacggaa gtactacccg actgacagag atctgtggaa 5040 cattatggcg acagtaaaag attcaacaag gaactcatca caggaccaag ccaacgtgca 5100 ggtaatgagc aaaataatac agtccatgag taacatctgc tctcattatc gcaatataat 5160 agaccatgtt cctacagaca tcctaattgt aagaaaaggc aaaacttttc ctttaaattt 5220 tgtcagtctc ccacatgcag tgatagtcct tccatgattc tttttccaga ttctgtccac 5280 aagatgggcg ctgcaagagg actgcaaggt caagtttcgt ccgagtgaag tccatgagga 5340 tggtacgaag actaacgtcc tcttctgtta tcaaacagca tggcagcaga gattattact 5400 tttgtatggc caacaaatgt gcctgctcga tgcgacatac cgcacatgtc gttacgacct 5460 tccgcttttc tttctgtgtg taagaacaaa tgtatgctac accgttgttg gagtctttgt 5520 cccacacaca gagaggacag tggacatcag agaagccctg caggtgttca aggagtggaa 5580 tccagagtgg aacccctccc actttatggt agacttctgt gaagcagaaa ttggtgccct 5640 cgaagaagag ttccaaggtg aaactcaact caaacttttc aatgcatgta tacattggtt 5700 tataactgtc ttgtcacatc atcaactgct tgctgacaat atggggaccc cggtatactc 5760 accaggattg tacacagatt ggtagcaaat tagggtaatt ctggcaaagc aaatgtttgc 5820 ggacatacat atgtacatgt acagtgtctt gtcctgttca tagatacatg tatatatgtt 5880 accaatcgta tatgtgtctc ttgtatttca tgcagatgcc aaggtactgc tgtgtgactt 5940 ccatcgtgag aaagcatggg tggagtgggt acgcaagaaa gaccatggtg taagtcatgt 6000 gcaagccaca gtacttgacc ttctgaggga cattgcagct gctgccacaa ctgaagagta 6060 tgagagatgc ctgtccctgc ttcgtgaatc agaagtgtgg aaagagaatg agaggctgcg 6120 ggcatggttc tccaataagt ggcttgatga taaatgtact aaggtgcgta tcaagaaatc 6180 tactgtattt catcatagga cactcaaaca tggtgcatat ttctttcttc ttttatctta 6240 ttgacaagca gagaaaaata ttgattatgc aaataaagac ctaattagta tccttgctgt 6300 attaaaaggt taactaaagt tgaacactat tagatataat ttatgcaagt tggcctcatg 6360 tacatgctac aataaaaact aaagaagaga ttttcatgat gagatttttt gtgtaggttt 6420 ttttgtcttt gctcctatga taaacatgaa ttataaaggt ctgtttccta tggacttttg 6480 tacgtacctg tgccttattg tgtaaatctt ttttacaaga tgtaactctt ttcccctttt 6540 taatcacata tagaggtggg tgcaagcttt caaggatgaa gatctgaagg tggcgatcta 6600 caccaacaat ggggtggaga ggcaaaacga gacgttgaaa tactcccacc ttgatggccg 6660 taaaaggaga agcctcaccg agatgctcac agtagtcgta acagacttct tgccgacagc 6720 atatcgaaag tgagctgaag tccaactctt aagattatct tgcacggaca tgtacatgtg 6780 taatttgtaa aaatgtaaca tttctttctt ctttcttaat atataattgg tatgtcatac 6840 ttaagtcagt tggtcaaaac aatgtaagtt ttttaatcaa tattctgtat cttaattcat 6900 agttactgaa atgttagtat tctgatgtta ttcattgaaa aacatttaga taccccttca 6960 taataaattt atggtggacc tgcatagcca attatgtgtt tattttactt ttgtcttcgt 7020 cttcaatgtt agagtccatt tccttgccgt ttattctgtt ctttctttta tcattcaagg 7080 tacattcaac tgaacgtggc attcagctca tgctatcgga aatatcacac cgacctaccc 7140 acttacctta aggacaggcc aagaggagtt gtattacata tcatggcaag gttgcaagaa 7200 gcccagtcct atgagcaaga cgacattgtc tccattggcc cgggaatctt tcaagtcaag 7260 agcagcagca agcgcggcca acaacacaca gtcaactttg gagcaacatc taccaacatg 7320 ccatcctgca catgcagaga ttggttaaag cacaagttgc cgtgtaaaca tttctgctca 7380 gtcttcaacc tacaacaaga atggggctgg gagaagctgt cagcagtata tagggacaat 7440 cccctcttct ccttagacag tacactacta tcttcaagtt caagtacctg tacttcaagt 7500 gagatggatg agatggatga cagcagcagc ccatcattaa catccacagc agatccacca 7560 ccaccttctc ccacaccatc accagcagta tacggtgagc tccccgtaaa caagccaaag 7620 agaacaaaac aactgcagag ggaatgtgct agcctgctga aagaaatgac aaacataacg 7680 tacaatcttc aggatgaact ctacctcacg tccatgaagg agcaactatc atgcatgttg 7740 gaggaaatga tgggacactc agcacatgac ggaaacttcc gagtcagaag cccccgaaaa 7800 gggaaaagaa agatgactga tagccatgca ctaccaactt tgccaccaaa acacccttct 7860 tcaaataggg taggtcgtcg tgctgacatg atgaaatcaa cttttcaggt gaactacagt 7920 ctgaacgcca aggagaaagc tccagagctt gaagtatctg tggtagagtg tgatgacttg 7980 caggctatgg taggcgtcga gacggaggct gtcataggcg tcgagacgga ggctgtcata 8040 ggcgtcgaga cggaggctgt cataggcgtc gagacagaga ctgtcatagg cgtcgagacg 8100 gaggctgtca taggcgtcga gacggaggct gtgcttgact gtgatgaggc gcaaaatgaa 8160 tggttggtga tcaacaatac aaagctgacg cagaaggaca gagaaatcct tcaagaagac 8220 cagtggctga acgacaaaca catgaactct gcacagcacc tgataagcag tgaatatccc 8280 cttatcgatg gtttacggga cactgtgatt ctaacagcaa atcagcaagg ccctgtcccg 8340 gcatccagtg actgtgtgca aatccacaac atcaacgatc actgggttgt gtccacatcc 8400 attgggggga acatcaccgt ctatgattca ctacagccat ccatgaagcc tgagcttcgc 8460 agccagttgg ctgatctgta caggcagttt gctattgggg aagacagcat cattcctgtc 8520 aatgtcatct gtgcccagag gcaacaaggt gggaatgact gtggactgta tgcagttgcc 8580 aatgctgtag cactggtgga ggaaattcct cctacacaaa ttgtgtttca gcaagggcag 8640 atgaggcatc atttcgagga atgtctggaa aataaagcaa tcaagatgtt cccacatgac 8700 accaaagttg gccaggcaaa cactgtgtca gtcagctacc agttaacaac atattgctgt 8760 catgagcaca gaccaggctc ccccatgatc atgtgtgaca agtgtgccca gtggtaccac 8820 tactcttgtg tcaatctaag agacaatgaa gtgtatgcct tggtcacaca acaggaagac 8880 tatgtctgtc ctacctgtca cagtgactag aactgctact gtacaacagc tacttttgtt 8940 tcattttgta aggggatatt agtttgccca ctttgcaaga gtccaagttg cccttaaata 9000 tatgcttgat tgttacataa agatgtaact aactgtaaat atgtatataa ccctgttcat 9060 tttattaaca tttcaaagtt aaaaagacag tcaagctatc tttttgggca tcacatgaac 9120 gagggcactg ctacttccta tgctttatat gcatatgaca atagcatgga acatttatta 9180 cattaccctc tttttttttt gtatcacatg tctccagtgt gatacagggc ttgactaggg 9240 ttgaactccc tactcaccga gggggcccaa attacgttac cctccatatt ttttgctaca 9300 tatttttttc tctgtgtgat tgtgggcttg actagggttg cactccctag tcgttgaggg 9360 ggcccaactt catgttccat gttactgcct tatgttgtaa gcacagttag tagttgatgt 9420 tagtaatcat ttcggcattc tgggagctca aatcattagt gttatttttg gggggtgggg 9480 gatgacaccc ttggcagatg acctcctcgt tggtgggtgt cggtaatgtt gtcaaataag 9540 gcaacagcta acgatctgcc atacccttta agaaaacaaa tatatcaaag ggtgtgttcc 9600 accctttcaa aagagtttca taaattaaca taaattaaca aaaatggtca atcataaaga 9660 gttgactgac aaacagacaa taatagtaat accattaatt tgtaatttca gttgtactga 9720 tcatttattg accgttgttg accctcctct tactctccaa gcagagattg agttgcgact 9780 caacctctgc ttggagggta agaggagcgc cagtgtaagt tttctgacta ctaacgttac 9840 atctacacga ctcaacctct gcttggagag tagcttgaac cccactatca cataacgtat 9900 atgtataaag tataaagaat acaacgtcct ggaaaagctc tgtaaattcg tttacacaag 9960 ttttaaatga gagaagagac tatgttgatt ttagaagatg taaaatgtaa cacctacatt 10020 gtaatttgta tacttcagta atatctaact caatgcattt agcccaggtt ggtaggaata 10080 tgcaataaag atcatgattg tcattgtcat tgtcataact tatcccatac tacaactcct 10140 gtataagttt aagttagata agtgttaaat tatagaactg tagttatgta gatgccatac 10200 tttgtacccc gtacaattgt tgtgcaataa agctattatt attattataa cggtactagt 10260 aaaacaaaga ccaccagtaa cactaccatg tgttgaactt gaactgcaga atacaacacc 10320 caccaaggag ccagacaagt aatccttggt acacatcaat tgggatgata tgaagacatt 10380 tcttacgtac aaatcttagt aacatgtaac gttactgttg ttttcttcga ctgtcgaccg 10440 gatttcagtc agtgattgct gacattttgg agaaaaatgc gccaagaaat tatattacat 10500 ttcgccttac ctgtcatcag gggatccggg agcttgttct atacgcagag aagctactgt 10560 aacttaaatg aaacgtccat gcatctgaca aaagtaaaat ttccgtgtgt tctccctcca 10620 ccgtgctctc actcatgaat cagaaaggcc atgtttcagc caagtctgtg gtcatgcgca 10680 ctgcggccct tacatcactt ccggtcaaca ccgaaatccc ggataggatt tatccgcc 10738 // ID Outcast-20_AAe repbase; DNA; INV; 5659 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A Outcast non-LTR retrotransposon from Aedes aegypti. XX KW Outcast; Non-LTR Retrotransposon; Transposable Element; KW Outcast-20_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5659 RA Kojima K.K. and Jurka J.; RT "Outcast clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1434-1434 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 12 sequences with >95% CC identity. XX FH Key Location/Qualifiers FT CDS 267..1823 FT /product="Outcast-20_AAe_1p" FT /translation="MMAEGISDKMEDNDWTHVLARKSKARHKDRDDENDSD FT YDRKTKRNKHNETKPKKDKYHATRVLTDDDDDNLNDENRIQNGDSNTNNEN FT QDNNAHKSQSNTGTKRKTDQFTTKRQNKDNTDGXYNDGWIKNQNYKTFIIH FT KEIKDKEQNNNVKYPHPMEVAKMLKNIGVTKYNTMKSVGRGKFQISFEKPR FT DAEQLLNSKLLTDSFGFSIFVPTRFKESIGVVGDVPPSITDDEIIENSSCE FT NNLKIYKVERIKKRVGENKFQPTYSIKIFFKGETLPKSIEIYGTHRFVEPY FT VFPLKICFKCWRFGHREKFCKSKETRCCNCGQFHNEQNCDTPEPKCVNCSG FT NHKASNKECPERLRQDLIRQDMAVNKSSYFEASDKYPKQTKSNLQTRLDSL FT RDFPRLEDNSQNSTTNVRQKTKRPINQTPYFTTPQNEFNRQDTKHEFLTNP FT YKTSEIEKITQMIKEDLIRQFNLNNMFEKIKAIQKTIIQSTNKTDTIEQDL FT LLINISEELNKIVNPEVTTTELK" FT CDS 1845..5480 FT /product="Outcast-20_AAe_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="MQDNFRKLTIVQNNVTSLRPIDTRESIQNFLLRHNTD FT IVMLSEIWLKPEETYNFTGYKLLTETRSAGYGGVGFLVKNEIEFKHFKLPN FT LQPIESIAIVTQNTLPKILFISIYIPPLPVNNNDIREPLKKLFETIDTFHG FT SVILAGDFNAHNRLWNPFHENCSRGELIEHLLDNQELVLLNDGTSTLIKSP FT NTTPSAIDLTFASPEIACKINWKVLDEDFFSNHRVIEFDIDNTARKYDYNK FT VYFNKKQAINKLNMLQPHAFHTTDDITEIIKEKFNECTYTINNNKNRSPKK FT WWTEKIKELLKIKNDKLKEFLRNQTDENYTDFKRSRAVLKREIRREKRNCW FT KELIDSIDENMNAKTLWNTVKMVSGGRPAKNNLNLLNNKPLAQQFINQNFP FT PITDIINNPPISDNLIKIDHLEIIKIIKSKKDHSTPGIDNLSFYILKHLNL FT NLIIRFTELMNEVLNTGQIPNDWRTIRIIPLLKPNKDPDNVQSYRPLAMLN FT VLIKLINNVIKNRLNNYIHEHEIIPKNSYGFKKHTSAINCVNTLIVKVNEA FT KREGMVLAATFLDLTKAFDNVDINKLLKIMEQLKIPSEIINWVFSYLKERK FT MILELNDGTKIIQISNKGLPQGCPLSPVLFNIYTSLIHNIAKEGEILIQFA FT DDFTAIVIGFSTATVAEKMNNFLTRLSLQFKNLGMQINPNKSATIIFKNKY FT DPTISIKLDNSRIPVVENHKILGVQVDHKLSFKSHINNSITKAKKKINLLK FT MISRKRSGAHPNQMIKIYKAIVRPHLEYGLTIIGSVPKTTFKKLETVQHLA FT IRTCLRQLNSTPNHVVLYESGEIPLKCRAELLTLKEIAKTFFYNNSLIENS FT LQNIMSLDYLPKHTSYLERTASLNNYLFFQLIPRSRQNYNTTIHNKITVSN FT EIKDLKKKNLNSITQKQMVLQLIEEKYKGVYQIYTDGSILNSLVGCGYYDA FT QEKLSYSCKLKTGYTILSAEIVAIIKAIDYANRKNINELVIFTDSKNTCTL FT LAKPFQTENSLIIKLLNDINQSDIQRIYVQWIPSHIGLIGNDRADHAAKMG FT TTRNVEETIGYTLEDMFNLFKNEVNKQWQDQYEQISIDKGKFHYEHSKIID FT NRPWFKGLNLSTIETIQIGRIRTGHVVTKNKLANWNLVANSRCDHCGDNED FT LTHILHYCPKYESVRKNISILNNKTPIVNILINNNPGEYTQIVKYLKKINK FT YV" XX SQ Sequence 5659 BP; 2353 A; 985 C; 913 G; 1404 T; 4 other; cattctctcc aagctagtcg accagttcag acgatccttg acgagtgttg tgaattttaa 60 ttttaatttt ccgcgaacgt ttcagaaaag tgtaaaataa actgactaac ctctagtgtt 120 actgtatata ttagtgataa gattacatct aagttgttgt aagtattctc cattaagttg 180 attttatagg gtgtttatta tttggtgtgc ttttcgggga tattttgtac aaaagaggga 240 agggaaggga aaccgattat gggaacatga tggcggaggg aattagtgac aagatggagg 300 acaatgattg gacacacgtg ctggcacgga agtcgaaggc taggcacaaa gacagagacg 360 acgagaatga cagtgactac gacaggaaaa cgaaacggaa caaacacaat gaaacgaaac 420 cgaaaaaaga caaataccat gcgactcggg ttttgactga cgacgacgac gacaatctaa 480 atgacgaaaa cagaatacaa aatggagact cgaacacaaa caacgaaaat caagacaata 540 acgcacacaa atcccaatcg aacaccggta ctaaacgcaa aactgaccaa ttcacaacaa 600 aaagacaaaa caaggacaac accgatggaa wttacaatga cggctggatt aaaaatcaaa 660 actacaagac atttattatc cacaaagaaa taaaggacaa ggaacagaac aacaatgtaa 720 agtacccgca cccgatggaa gtggcaaaaa tgctgaaaaa cataggcgtg acgaaataca 780 acacaatgaa aagtgtaggt agaggcaaat tccagattag cttcgaaaaa ccaagagatg 840 cagagcaatt gttgaactcc aaacttctga cagacagttt tggtttctca attttcgtac 900 ctactaggtt caaggagagc ataggggtgg taggggatgt acctccatcc ataacagatg 960 atgaaataat tgaaaacagc agttgtgaaa acaatctaaa aatttacaaa gtggaaagga 1020 ttaagaagag agtaggagaa aataagtttc agcccacata ttcgatcaaa atttttttca 1080 aaggcgagac tctaccaaag tccattgaga tctacggtac tcatcgattt gtcgaaccat 1140 acgtttttcc attgaaaatt tgttttaagt gttggcgatt tggtcatcga gaaaaatttt 1200 gtaaatcgaa ggaaaccaga tgttgtaatt gtggacaatt ccataacgaa cagaattgcg 1260 atacacccga acctaaatgt gtaaattgct caggaaacca caaggcttcg aataaggaat 1320 gcccggaaag actgcgacag gacttaatca gacaagacat ggcagtaaat aaatcatcat 1380 attttgaagc atctgacaaa tacccaaaac agaccaaaag taacctacaa actagactgg 1440 attcattacg cgattttcca cgactggaag acaacagtca aaacagtacg acgaacgtga 1500 gacaaaaaac caaacgaccg attaaccaaa caccatattt cacaacaccc caaaatgaat 1560 ttaacagaca agacacaaaa cacgaatttc taacaaatcc atacaaaacc tctgaaatag 1620 aaaaaatcac acaaatgatc aaagaagacc taattcgcca gttcaacctt aacaatatgt 1680 tcgaaaaaat caaagctata caaaaaacca taatccaaag cacaaataaa acagatacaa 1740 tagaacaaga tttgttactt attaatatca gcgaagaatt gaataaaata gttaacccgg 1800 aagttactac tacagaatta aaataaaatc gaaaaactct tgaaatgcag gacaacttca 1860 ggaaactgac catagtacaa aacaatgtca caagcttgcg accaatagat acaagagagt 1920 ccattcaaaa cttcctctta aggcacaaca ctgatatagt aatgctaagc gaaatttggc 1980 taaaaccgga agaaacttac aactttacag gatataaatt gttgacagaa accagatccg 2040 ctggttatgg aggagtaggg tttctggtca agaatgaaat agaattcaaa cattttaaac 2100 tacctaactt acagccaata gaatcaatag ctatagtcac acaaaacact ctacctaaaa 2160 tattgttcat atccatttat attccacctc taccggttaa taataacgat atcagagaac 2220 ctttgaagaa attatttgaa acaatagaca cttttcacgg atcagtaata ttagcaggag 2280 atttcaacgc ccacaatcga ctctggaatc cttttcatga gaattgttca cgaggagaac 2340 tgatagagca cttactagac aatcaggaac tagtactatt gaacgatggt acaagtactc 2400 tgataaaatc acctaacact actccgtctg ccatagatct aacattcgca tcaccagaaa 2460 tagcttgtaa aataaattgg aaggtacttg atgaagattt ctttagtaac catagagtaa 2520 ttgaatttga tatagataac acagcacgta aatacgatta caataaagtt tatttcaata 2580 aaaaacaagc cataaataaa ctgaatatgc tacaaccaca tgctttccat acgacagatg 2640 atataacgga aataattaaa gagaaattca acgaatgtac gtacaccatc aacaataata 2700 aaaacaggag cccaaagaaa tggtggacgg agaaaatcaa agaattattg aaaataaaaa 2760 acgataaatt aaaggaattt cttagaaacc aaacagatga aaattacaca gactttaagc 2820 gaagtagagc cgttttgaaa agagaaatac gaagagaaaa aagaaattgt tggaaggaac 2880 taatagattc aatcgatgaa aatatgaatg ctaaaacact ttggaacaca gttaaaatgg 2940 tcagtggagg tcgtccagcg aaaaataatt tgaacctttt gaataataaa ccattagctc 3000 agcagttcat caatcagaat tttccaccaa tcacagacat tataaataac ccccccatat 3060 cagacaattt aataaaaatt gatcacttag aaattattaa gattatcaag tcgaaaaaag 3120 atcattcaac tcctgggata gataatttat cattttatat attaaaacac ctaaatttaa 3180 atctgatcat ccgattcaca gaactaatga acgaagttct taacacagga caaataccca 3240 acgactggag aacaattaga ataataccat tgctcaaacc aaacaaagat ccagataatg 3300 tacaatctta caggccatta gctatgttaa atgttttgat caaacttatc aacaatgtaa 3360 ttaaaaatag actgaataat tatatacatg aacatgaaat aattcctaaa aattcatatg 3420 gttttaaaaa acacacatcc gcaataaatt gtgtaaacac tctcatagta aaggtaaatg 3480 aagcaaaaag agaaggtatg gtactagcag ccactttttt agatttaacc aaagctttcg 3540 ataatgttga catcaacaaa cttttgaaaa taatggagca actaaaaatt ccgtcagaaa 3600 ttataaattg ggttttttca tacttaaaag aaagaaaaat gatcttagaa ctaaatgatg 3660 gcactaaaat wattcaaatt tcaaataaag gcctaccaca aggatgtcca ttatctccag 3720 ttctttttaa catttataca agtttgatcc acaatatagc taaagaaggt gaaatcctca 3780 tacagtttgc cgacgatttt acagcaattg taataggttt tagtacagca acggtagcag 3840 aaaaaatgaa caactttctt acaaggttgt cattacagtt taaaaattta ggtatgcaaa 3900 tcaacccaaa caaatcagca acaataatat ttaaaaataa gtatgatcct accatttcga 3960 taaagttaga caattcacgt attccagtag tagaaaacca taaaatttta ggggttcaag 4020 tagatcacaa gttatcgttt aaatcacata ttaacaatag tataactaaa gcaaagaaaa 4080 aaataaattt actcaaaatg attagtagaa aacgtagtgg agctcatcca aaccaaatga 4140 taaaaattta caaagcaata gtaaggcccc atctagaata tggtttaaca attataggct 4200 cagtaccaaa aacaacattt aaaaaactag aaacagtgca gcatctagct atcagaacct 4260 gtcttaggca attaaattcc acacccaatc atgttgtatt atatgaatcc ggagaaatcc 4320 cactcaaatg tagggctgaa ttattaactt tgaaagaaat tgcaaaaacc tttttttaca 4380 ataatagcct gatagaaaat agtttgcaga acattatgag tttagattat cttccaaaac 4440 acacttcata cttagaaaga acagcttcat tgaataatta cttatttttt caactcatcc 4500 caagatcaag acaaaattac aacactacta tccacaataa aattacagta tcaaatgaaa 4560 taaaagatct caaaaagaaa aatttaaaca gtattacaca aaaacaaatg gtattacaat 4620 taatagaaga aaagtataag ggagtttatc aaatatacac agacggttct attttaaatt 4680 ctttagtagg gtgcgggtac tatgatgctc aagagaaatt atcatatagt tgcaaattaa 4740 aaacaggata cacaattttg agtgcagaaa tagtggccat tatcaaagca atagactatg 4800 ccaatagaaa aaatataaat gaattagtaa tatttacaga ctctaaaaac acatgcacac 4860 ttcttgcaaa accattccaa acagaaaaca gcttaataat aaaattgttg aacgatatca 4920 atcaatctga tatacaaaga atttatgtac aatggatccc aagccatata ggattgatag 4980 gaaatgatag agcggaccac gcagccaaaa tgggcacaac aaggaatgta gaagaaacta 5040 taggctatac actcgaagac atgtttaact tatttaagaa tgaagttaat aaacaatggc 5100 aggaccaata cgaacaaata tccatagaca aaggaaaatt ccattacgag cactcgaaaa 5160 ttatcgacaa tagaccatgg tttaaaggac tgaacttatc aactatagag acgatacaaa 5220 ttggacgaat aagaactggt catgtagtga ctaaaaacaa actagcgaat tggaatcttg 5280 tcgctaattc ccgatgcgat cactgtggtg ataatgaaga cctaacacac attctacatt 5340 attgtcccaa atacgaatca gtccgtaaaa acatatctat acttaataat aaaacaccta 5400 tagtaaatat attaatcaat aataatcccg gggagtacac acagatagtg aaatacctga 5460 aaaagatcaa caagtatgtt tgagaaaaat taaataaagg atgtaattat gtcaagagta 5520 tsaaacagta aaaataccag aggggcttca gtttcgsgcg acctattcga ttaaagtttc 5580 aacaccacta ctatcttcac aacccttggc tttatggatc aaaaggtctg agccatcaaa 5640 ccgagagaaa aaaaaaaaa 5659 // ID Sola1-3_Lgigantea repbase; DNA; INV; 3913 BP. XX AC . XX DT 12-MAY-2011 (Rel. 16.05, Created) DT 12-MAY-2011 (Rel. 16.05, Last updated, Version 1) XX DE DNA transposon: consensus. XX KW Sola; DNA transposon; Transposable Element; Sola1-3_Lgigantea. XX OS Lottia gigantea OC Eukaryota; Metazoa; Mollusca; Gastropoda; Patellogastropoda; OC Lottioidea; Lottiidae; Lottia. XX RN [1] RP 1-3053 RA Yuan Y.W. and Wessler S.R.; RT "The catalytic domain of all eukaryotic cut-and-paste transposase RT superfamilies."; RL Proc Natl Acad Sci U S A 108(19), 7884-7889 (2011). XX DR [1] (Consensus) XX SQ Sequence 3913 BP; 1420 A; 623 C; 597 G; 1273 T; 0 other; cgtcacctca ggtctataag gttctatctc cactttttga aaaacatgag ttattggtac 60 cgttaaaatg agcttggaaa gttctatctc ctgataaaat tgtaattatg tgtaaggctg 120 ttcatacaga aatacaggct atattagtat ggtttatatc cattattggt attaaactta 180 tttccatagt gttctatgtc cagaaaccaa tcagtacgta ccattcttat cacgtgacgt 240 caaatcaact aacagttctc tctcttttcc ttgccttgac aaaattccac atggtgtgat 300 tttaaaaaga ttttggctaa atttcaatgg atgtaaagta tgtttggtga ttttataact 360 tcaaaaacca ttataagcta gtaaataaaa ttgattttta aagcattata attaatttga 420 tgactataac tacaaggcaa aatttttttg gatatagaac agtatagctc taaccagata 480 ttgactttta caatacatgt tatttctatt tatctatatc agaaaaatag atgtttaata 540 gtaggttttc tttcaatttt cagattgaat caactagaaa atctcccgga cttgccccta 600 atacacatat tttcacattt aagacttcct gaattaattt tagttgtaaa cagagtttgc 660 agaaggttta atcatataat ttcgacaact tcaattcttt ggcaagaatt tatatttgac 720 tttacacttg aactcttaga agaggattta gaatatattt tttctaaagc ttataaattc 780 aaaacttttg attgtggccg ggctaaatac ttatgtgaat tggccgacgt agattattat 840 tttactaaag gactttgtca atccgcagaa ctaacgtggt taaacctttc aggatccgtt 900 ttatcaacaa cttctttcct taaatatctg cctaatcttg aatttttgga tctgtctgaa 960 acacctaatt tacacaattc agagtttcat gttttaaaat gttgttcgaa acttgttcaa 1020 ctgtacatat cgttttgtga tgttaaaata gatacaataa ttgaaagttg caaacatcta 1080 aaatctatta agacactcga tatttctgga attccaatta gcctggaaca atgtgttcaa 1140 ctcttaaatc atacttatga tacacttaca tgctgcatat taaatatccc tagatatata 1200 actgtaagag aattttttgt gttaattgaa ctaaagtaca agaactgtcg attttttgtc 1260 ctcagaattg acaagtatgg agacaaaatt tggagttaag aatgccctcg aagttttgga 1320 tttccttgat attgacaatg attgtagtga atatgattcc tcgtctgatc attttgaatc 1380 tgtagaacca gccattaaca tacccctttc ttaccaaaat aataatatca atgaaatctt 1440 accagcagct caaaatatat ctaatgatga atcttttgaa ataactttgg acgaaatata 1500 cgtaattccc gacactgaac tcagtacagc aaatgttata atagatcacg aaagtaatat 1560 ctcaaccata atagatagtg ataaaaatgc taaaattccc gaaacagcaa atgctgaatc 1620 tagaccttct cgaaaacgga agaaaaacca agcagaatgg gcaaaaaatg tacgaaaaag 1680 gagacgacaa tcaggaaaag aatatacgga ctcaaacgga aacctacagc gaaaaagatc 1740 actgaaatat gacacaaatc atacatgtag attcaaatgt tgtgaacatt tctcatatga 1800 tggttgtcta caaatctaca cggagttttg gactcttaac gactcccaga aaaagatttt 1860 ctatagccaa actacaagta aagaacgtaa ggcaagaact agattgccta atactgaatt 1920 tactagtaga aagcagttta catataaata tcatttgaaa aaaagtgatg aaaatatacg 1980 tgtgtgcaaa cagttttact tagatactct ggatataagc cagacacgaa tccaaacttt 2040 tcatgaaaaa gatgaaagag gaaatatcag atacagtgat aagcgtggtg cacatagttg 2100 taaaaatatt atcaatactg aaccccaaaa ggaaataatt cgtaaccata ttaaatcatt 2160 tcctactatt ccctcacatt attgtagggc aaattccaag agacagtatt tagagcccgg 2220 gctcactgta caaaaaatgt acaatctata tgttgataac tgcaatagaa atgaaataaa 2280 acccttaaaa ttatatgttt atcgagatat ttttaatttg gaatttaaca ttggattcca 2340 tgtcccgaaa aaagacaagt gtgacacgtg tgaaaaatac aaaatattaa cctcccaacc 2400 aaacgttaat gaaattaatg ctcaaacatt acgtgatcat ttgaacttaa agcaagaaac 2460 aaaaatagaa agggacacgg atagaaacag aaaagataat tctgtagttg tgtgctttga 2520 tatgcaaaat gtcatgacgt gtccgcgagc ttcagtttca aatttctttt ataaaagaaa 2580 gctttcggtt tttaatttaa cagcacattg ttcttataac aaagtggctt ataatgcaat 2640 atggacagat aatatgatgg gtaggggggc taatgaaatt gcaagtgctt tgtgtgttat 2700 attacaaaaa gtgtgcgaag atattccaaa tattaatcat ctgattttgt ggtctgacgc 2760 gtgtgttcct caaaacaaaa attccataat ggtaacagca ctgacaaaat ttgtcgagag 2820 acattcaggt ctcatcattg aacataaatt tggaacacct ggacattctt ctatacaaga 2880 agtagacaat gttcacagtc acattgaaaa agcactagga gtagccgaaa tttatagccc 2940 aatatccctt gtcagagtct tacttaaagt tcgaccaaaa cacatgaaag ttatacaaat 3000 gaaaaaatct caattctttg atttccaaaa ggtttcacaa cgatttaaat ttaatattcc 3060 attttgtaaa ctaaattatt tgagaattga ttcttcccga catatgcagt gtgattacaa 3120 attttgtttt tctgaagaat tgaaaaccca tactatttct agaaaaaaga gcacccggtc 3180 agatggaaaa tgtaaaacct ttccgaatcc ctcggtattg gaaaaaaaca tgacatactc 3240 gaaagaaaaa atcaacgatt tgaaaagtat gatgccattt atgccacttt ctgactgtga 3300 ttactacaga cttactcttg gcatagacgc taataattaa tatgaattat attacacgat 3360 aaagccttta tttaaatgtc aaaagcatca aaaaggtaaa acagtcaaat tccctccgat 3420 tatattgatt tttattgatt attacatatt aaaattactg gaatatcgtc ggtcgacata 3480 ttatcaaaat ttggtttcca tatatacaat ggtcagcttt tattacaaca tataagaaaa 3540 tgtcttcaaa tattattcat tttattttaa ttccattttt tttattaatt aatcgatcta 3600 tgtttaaagc taggaaatgt gggatataga acgggtgcca atatgagatt gtcttctgct 3660 actaattcca tatcgttcta tatcccaatt cctccattgt ccgtttttgt aaaacaaaaa 3720 tatgagaaat ttgtaaactt tttgtcaaaa tgagttttat taaagtaaaa ataaatgaat 3780 gatttttcac aacatgttaa ttttgatata ttccttcact taatatggca tttatatggt 3840 tgtcaaagtt tcaaatttga ttttctcaaa aacaacaaga acggagatag aacattatag 3900 acctgaggtg acg 3913 // ID BM2A repbase; DNA; INV; 318 BP. XX AC X70929; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 28-SEP-1995 (Rel. 1.08, Last updated, Version 1) XX DE Bm2a repetitive element. XX KW L1; Non-LTR Retrotransposon; Transposable Element; BM2A; KW Repetitive sequence. XX OS Bombyx mori OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Bombycidae; Bombycinae; Bombyx. XX RN [1] RP 1-318 RA Ichimura S. and Mita K.; RT "Direct submission."; RL Unpublished. XX RN [2] RP 1-318 RA Ichimura S.; RT "Direct submission."; RL Direct Submission to EMBL/GenBank/DDBJ (02-FEB-1993). S. RL Ichimura, National Institute of Radiological, Science, Anagawa. XX DR GenBank; X70929; Positions 274 591. XX SQ Sequence 318 BP; 79 A; 91 C; 73 G; 70 T; 5 other; cacacatcat tacggatcct cccgacccat taacggtgct tttaagtacc acaagcacca 60 gtcaccgtcc tcgttgaacc cgtcgcttgc gacgaagggc ttgacgagcg aatgaaccca 120 cagacacagc ccactgagtt tctcgccgga tcttctcagt ggntcgcgtt tccganccgg 180 tagtatattc tgcgaagcgc tgcttttgct agggctagtg taagcaacac tccggtttga 240 gccccgtgag ctcacctaca tgttagggcg antgnanaat cagcctctca agaccatcag 300 cataggtagg aaaaaaaa 318 // ID DNA8-69_AP repbase; DNA; INV; 439 BP. XX AC . XX DT 25-AUG-2009 (Rel. 14.09, Created) DT 25-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-69_AP. XX NM DNA8-69_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-439 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2004-2004 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 439 BP; 138 A; 69 C; 73 G; 159 T; 0 other; cagtgtcggc ctggcccagg cggctaggag gcgatcgcct ccgaggcccc gacccgggta 60 tttttcaaga aataaatgaa attatttacc aaaatgtacc tactatatat agcccagtgg 120 tatgcgtttg gatccactaa ataattttat aataataata taatatacat tttattattt 180 taatgatact aaaatcataa tatcttagtc cgtggtaata atacaattat atgataattt 240 tggaatctat gattatgttg aaatacgtac tgtgatgttt gatcgatatt atattactat 300 atatataata tatatcgatc catattatgt gataaatatc tataaaacgt ttttaaatta 360 tttttttgag atttggtatt tttgagaggc cctttgaata atttgcctcc tgggcccttt 420 agatcccagg ccgacactg 439 // ID PIF_Harbinger-1_TV repbase; DNA; INV; 4521 BP. XX AC . XX DT 16-MAY-2011 (Rel. 16.05, Created) DT 16-MAY-2011 (Rel. 16.05, Last updated, Version 1) XX DE DNA transposon: consensus. XX KW Harbinger; DNA transposon; Transposable Element; KW PIF_Harbinger-1_TV. XX OS Trichomonas vaginalis OC Eukaryota; Parabasalia; Trichomonadida; Trichomonadidae; OC Trichomonas. XX RN [1] RP 1-3053 RA Yuan Y.W. and Wessler S.R.; RT "The catalytic domain of all eukaryotic cut-and-paste transposase RT superfamilies."; RL Proc Natl Acad Sci U S A 108(19), 7884-7889 (2011). XX DR [1] (Consensus) XX SQ Sequence 4521 BP; 1547 A; 727 C; 714 G; 1533 T; 0 other; ggggagaagc ttcctaaact ttttaatccc ataaatatgg tcattatctg gaataaagag 60 ttaagcgtca tatgtcactt tttgtactaa ggaaacatcg actttatgcg ttatgacttt 120 tcgatgagga aacaagtgtt cacgtattat aaacttcctt attccaataa accgacccta 180 aacataatta attttaagca ttataatggt ctcttttcta tgataaccaa atcaatttgg 240 ttatcataga aaagcaccaa aaactagaat aggaatcggc aatatggaat tatgaaacaa 300 ctaggataaa atggatatgg gacattcatg aatgcattca cttttttcaa gactttttaa 360 tgaatttcaa attcctcaga ttattttcaa tagattttcc gtaccatatc atttttatta 420 gacaaagcta tcttttcgat gaaatttttg caaaaatact ctgttttgaa aaagatttat 480 gtaaattttt tgataaaagt gtcttgaatg attatcattg ataataccat tccaattgaa 540 tgctgttttc ggctgctttt ataaaaattt gatcatattg tactcataat ttttttgtat 600 tttggtcaaa aactaagcta acatcagctg tatatttgca taagccatta ataagctttg 660 gtataggtta tactgagttt agacgatcat acaatttaga cttttttcat ttaatttagt 720 ctaacaaaga gtgattatct tactatttga ttcataaata tataggaaat aatagtttta 780 taccttaata taataaaatt ttataaaatt ttcccttctc ctttggcaat attagtcatt 840 aatagatata tttaacatga tggttctaaa tatatgaata gataaaattt tcctttcaat 900 tttacgaacc aaatttatta ccatatccta aattttttct atgtttatag acctttggaa 960 ctttgtatga agtctaaata tacatgtaga gtattcaaac aaaacaaata tcaaaatcta 1020 aacctttgaa ttaccttcaa tgcttaaccg aattactaga tcttatatat atgtgtgtac 1080 taataccttt tgatgcataa aaaactccaa aagagtttca tttttaaaat agtcgatatc 1140 aattcaaaaa attgaaacta taccgtcaac tgtcaatttt tcgatagttc gtttttttaa 1200 atgtaattgc tttttttata tgcattaatc atttgatagt agtgggatca aattacatat 1260 cttaaaaatg tattccgagt gaaaaaaatc aatagaattg accacagata tctgtttaat 1320 ccaaaaatat tattagaatc atatttaata aaaatgctat gcttttagaa ataaacgaaa 1380 gaataaacgc actgtttcaa accgtcaaaa ttgaatttgt gtcaaatatc ttgaatttac 1440 aattttagct atttagataa ttagagaagc atttgatcaa tttttgtgtt aaacaactat 1500 ttttcgaatt tctgagatat gaaaattaat tcactaactt cttaaaaaca tgaattgaat 1560 gaaaactctt agaagaaatg ttcatatcac tttcattaat ggagtacaag gatgattccg 1620 agttcttgga tgatatccag aagtccaaaa cccgatccaa ggaaatcgtg agaatgcaaa 1680 tgcacttagc ttatcttgat gaatcagagc ccgcaagtga agatgaaaat caagcaccag 1740 ctaatgccga agaatttaaa gttgtcagaa agtttagctt acaaaaattc cttgaggcac 1800 acccaatctt aaatgtcaaa gacctttgta acttttcttt ggatcaactc ttgtatttag 1860 ttgatgttat tcgcgagcat gagttttcat ccagacgcgg tcggaaaaac ggttgtgatc 1920 cattagatgc gttattccta acattaactt actattgcac ttatcttcca ttgcaaaaaa 1980 tgtcaggaat tgttagtttg aaacagagtt acctcagcaa aatcattaac aaaacaacaa 2040 acaggatgtt cccaattttc attgctgaat tcattccaaa acaatattct ccctgtgaca 2100 aggagttcga taactttcca acatgtgttg ctgcagttga tagctctact attcctttta 2160 ccgttcctct tgatcttgct gagcgtaaaa gttcatggga tgctaaaaac cattgcaatg 2220 gtatgaaggt tcaagttctt gttaatcctt taggacaggt aattcacatg aatacaagtt 2280 ttttagcttc agttcacgat aagaaagtat ttgatttatc aggagcttca gattttttac 2340 acgtccaacg tggagttgaa tctgttgcat taggacttct tgcagataga ggttacatcg 2400 gtattcagaa ataccatcca acagccgtca ttatgcaaag aggtgatgac gaagaagtta 2460 caaaacgcaa taatgcgatt gcacatgatc gccaaattgt cgaacgccgc tttgcacgcg 2520 ataaaagtaa ttggagtgtc ttggcaatag gatatcgtgg tgaaaaaggc aatcttccac 2580 ttattgttca aggattgttt gctttagata actatcacat cgattccaca cctctcagtg 2640 agaaggataa ctgcgatttt ccttgcactc caaaagatga aaaagaagct gcagcaacac 2700 cagaaccaaa tggttcagga agcccaatgc caaagccaag atgtgttcca aaagatcttt 2760 atctccaaac tccagaaact ccaatgaaga ttccatctcc actagctgga tatgaaaaga 2820 ttgataagat agttgttaaa agaacacaaa atgcaatcta caacatcgaa acaaaaatgt 2880 caattccagg attacataat gagggagcta cctgccatgt caatgccgtt ttgcaagttt 2940 tgttctacgt aaatactttc agaaaagatg taatggagca tttgggagat gtcgaatata 3000 cgccagtgaa agcacttgcc gaaatttatg atcagatgtt gcactaccgc cacaaggaat 3060 acacaatttc atgctcacaa ttcatcacaa cactcggcaa gcaatacttc cagcagcaag 3120 atgccgatga cacatttcaa aagattgtac atgatgttac ggaatcagca aacaaaccag 3180 atattgccgt tacattccaa tatcaaactg agactcctga tgaaagcttc tattcggttt 3240 cgcttcacat tcctcttggt gcagcaggta ttgatcaagg aattaaacag atgtgcacag 3300 atgattctca tatcaatgaa atctcaacat tcctctgggt cgacgttcag aggaatttgt 3360 ctgatccaaa tgtcttcttt gacaaatttg agatcaagaa gaactacaaa actcaactca 3420 atgatggaac ttatgtattt aagatacatt ccattattgc ttatcatcaa ggtcatttcg 3480 ttgccttcgt caacaaagga agagtttggt tcatgctcga cgatgaaatt acttacattg 3540 ttcctgatgg tttgatcaaa ggtcttcaag gtggagatgt tggcagcgat ttgtggtcat 3600 atcttggtgg tgtcaaatgg cttgcaaaat gtgtcatcta tcgcaaacac aagttctatg 3660 gagctggtga tgatgatgat tgaaaacagt ttaatttttt ttaatcttat tataagggat 3720 aattctctaa atatatgatt ttttcttaag ttagtgaata tttttcttgt gtgcttttac 3780 ttatttaagt atttagtatt attacttaaa aattttgaac atttatttta taatattgag 3840 atttaaaata aactattgct cgatcgtatc tttatttgct cttttaataa aaaatctaac 3900 ttactattga ttagatattg atttaaaaaa aataatataa tagcataata aaatttcttt 3960 atttccccgt acaaagatat agccgtaaca agacattaac gtttggaaag taaatttgaa 4020 aaaaagtatt atattataaa agaagttaca aatttttaga atatttttgt aaaataagag 4080 caaaaagtag aatgcaaact cttcgggtac taaaacctgc cattaaattg agataaataa 4140 tagattttgt aaaattttgt tgtaattctt gtatcagtct tgccttgagt atgagtttga 4200 gtctaccctt gtttccatat tgcctattcc tgttttaacg cggctattcc tattccagtt 4260 tttggtgctt ttctatgata accaaattga tttggttatc atagaaaaga gaccattata 4320 atgcttaaaa ttaattatgt ttagggtcgg tttattggaa taaggaagtt tataatacgt 4380 gaacacttgt ttcctcatcg aaaagtcata acgcataaag tcgatgtttc cttagtacaa 4440 aaagtgacat atgacgctta actctttatt ccagataatg accatattta tgggattaaa 4500 aagtttagga agcttctccc c 4521 // ID Helitron-1_BM repbase; DNA; INV; 5072 BP. XX AC . XX DT 01-MAY-2010 (Rel. 15.07, Created) DT 01-MAY-2010 (Rel. 15.07, Last updated, Version 2) XX DE DNA transposon - consensus sequence. XX KW Helitron; DNA transposon; Transposable Element; Helitron-1_BM. XX OS Bombyx mori OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Bombycidae; Bombycinae; Bombyx. XX RN [1] RP 1-5072 RA Jurka J.; RT "DNA transposons from Bombyx mori."; RL Repbase Reports 10(7), 935-935 (2010). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 69..1016 FT /product="Helitron-1_BM_1p" FT /translation="MTTICRYCNALKFKRETAGLCCASGKVKLDPLLTPPQ FT PLKPLFDGTDPDSSHFLQHILEYDNCFRMTSFGANIIREGGFMPTCKVKDT FT XHITNHSHQHNELQHITTTYTPHTTPSHEVTPYSSTNDCLQLQXQGQIYXX FT HGSMVPTRNEPHQFLQIYFISSMVDQLNVRCNIQGTQQLKRRIIEQLQAFF FT HANNAXVNMFKTALERMPSDTHKFVIRADCTPTGEHVRRFNAPTVNDVAAI FT IVGDPTKSRDIVVQRRSNIMHRVNETHRLYDALQYPIIYWQQCSWYKEYNS FT FTSDTKNKLNFMDCLKVLLCELKV" FT CDS 2491..4851 FT /product="Helitron-1_BM_2p" FT /translation="MELIPNYGTPDLFITFTCNPKWTEIERELEPGQKPQD FT RHDIIARVFQQKLKVMMDVLTKYRVFGDTRCYMYSVEWQKRGLPHAHILIW FT LLNKLHSNEVDDIVSAEIPDPVTDPHLHDIVTTQMVHGPCGALNPLSPCMA FT DGKCTKRYPRPLVAETVTGNDGYPVYRRRSKEDNGRTIKVKVQNQEIEIGN FT EFIVPYCPLLSXIFETHANVESCHSAKSIKYLCKYVTKGSDMAVFGIASEN FT ANDEISNFQMGRYVTTNEALWRLLSFQIHERYPTVVHLAVHLENGQRVYFT FT EANAAQRAERPPSTTLTSFFAMCEADPFAATLMYVEMPKYYTWNQSTKKFQ FT RRKQGTPVPDWPQVFSTDALGRMYTVHPRNDECFYLRLLLVNVRGPKSFAH FT LKTVNGHQCQTYREACQLLGLLENDSHWDLTLADSVVSSNAYQIRTLFAII FT ITTCFPSQPIQLWNKFKDDICEDILHRLRIQTNNPDIQITDEIYNEGLILI FT EDQCLTIANKLLIEVGMIAPNRSMHDAFDQELNRELQYNVDTLQEFVRNNV FT PLLNQQQKQVYETLMQAVDNNTGGLFFLDAPGGTGKTFVVSLILATIRSRC FT DIALALASSGIAATLLDGGRTAHSALKLPLNLNTIDTPTCNISRSSAMGKL FT LMQCKLIVWDECTMAHKKSLEALNFTLKDFRRNNNIFGGSMILLAGDFRQT FT LPVIPRGTPADELNACLKASPLWNNVKTLSLTTNMRVQLQNDQSAARFSKQ FT LLAVGNGKVPVDATSGLITLTNDFLSPICRLSISSY" XX SQ Sequence 5072 BP; 1664 A; 989 C; 986 G; 1426 T; 7 other; atcgcttggc attccaatat gatcccactg cgaactacag tgatgatgaa aatttagata 60 ttggaccaat gacgactata tgccgatatt gcaatgcgtt aaagttcaaa agagaaacgg 120 ctggattgtg ctgcgcaagt ggaaaagtca aactagatcc attacttaca ccaccacagc 180 cactgaaacc attgttcgat ggaactgatc ccgattccag ccattttctt caacacatcc 240 ttgaatacga taactgcttt cgcatgactt cctttggagc taatatcatt cgagaaggcg 300 gcttcatgcc gacttgcaag gtaaaagata caanacacat aaccaatcac tcacaccaac 360 acaatgaatt gcaacacata acaactacat atacaccaca cactacacca tcacacgaag 420 ttacaccgta ttcttcaaca aatgattgtt tgcaattaca gatncaagga caaatatatn 480 atttncatgg ttcaatggtg ccaacacgaa atgaaccgca tcaatttctg caaatatatt 540 tcatttcgtc gatggtggat cagctgaatg tgcggtgcaa tatacaggga acncaacagt 600 taaagagacg aattattgaa cagttgcaag cattttttca cgctaataat gctgnggtta 660 atatgttcaa aacagcattg gaacgaatgc catcggatac gcacaaattt gtcataagag 720 cggattgtac cccaacaggg gaacatgtgc gaagattcaa tgcacccacc gttaatgatg 780 ttgctgctat tattgttggc gatccaacta aatcgcgaga cattgtcgtt cagcgaagaa 840 gcaatatcat gcatcgtgta aacgagacac atcgtttgta cgatgcgtta caatatccaa 900 tcatttattg gcagcagtgc tcttggtaca aagaatacaa cagttttact tcagatacaa 960 aaaataaact gaatttcatg gattgcttaa aagtgctgtt atgtgagctt aaagtatgac 1020 aattaacaaa aattattgaa actagtggtc ccgcagtagt cgaaattcga ctacaattaa 1080 ttgaaattat aagtttgaac attattaagg ttctattgtc aaagaatatt acttctatag 1140 tcacaaattt cgtctacacc tatagacaaa taaatataaa gacgaacaat actatcctat 1200 tctcaatttg actacgactt aaagcaataa gaaaagtctg acaataaaca aagggttata 1260 ttatgcaggt gtgcgtgggt gtgcgtcaaa tacttggtag tgtgtgtaat tttttcttga 1320 tttaatttat tatctactat ataaaaataa gtcgggtttt ccttcctgac gctataactc 1380 cagaacgcac gaaccgattt ccacggtttt gcattcgttg gaaaggtctc gggctccgtg 1440 aggtttatag caaagaaaat tcaggaaaaa attcaacaga aaagcgtgaa aatctttttt 1500 tttttcttta acgcgccagc ggcaacacgt ttatttgaca actgtctatt gtgtactact 1560 aatcaattga attgtcctta aagcataaaa aatcacttaa tgacacaaac cacattcgtt 1620 tagtaattag atttaaaata agaatttaat cgatcagaaa tttagcttta tgattatttt 1680 tcctgggttc ttagctttga gtttattatc agtataatct attttatatg tttagtttga 1740 tttatgttaa aaagaatata aagtaacaaa ttaaaatgtt gtaaaacgct acttttttca 1800 aaacgtttct aaacgttttg taagttatca aaatgacatc catcaaagtg acgttttcta 1860 aagtcggggc ggaataagca tatactttaa gtgtataatc tatggaataa gcccccagtt 1920 aatttcactg tgaaattgtg aaaaaaagta aaaaaaaagt tattcgtttc ctaacatttc 1980 aattggaaag catcaaatca atcacgtgta ttgtgcaaat ttttattcaa tttcaacagc 2040 aggcatttgt ctttaaggtc gtaagttgaa ctgtgtaaat tatgtggatc gtgcatttga 2100 ggttaaattc accactctgt ccatagtcca aaatgcctag aagacgacgg gcgaacatcg 2160 gccgccgcac aagacatgca agccagcaac aagtgtattc acagaactta agcgaagaaa 2220 gacaaaatat aataagagaa aatgcccgat tgagacaacg cgtgagcaca cgaagatcat 2280 tggcatcata caatcgcttg gcattccaat atgatcccac tgcgaactac agtgatgatg 2340 aaaatttaga tattggacca atgacgacta tatgccgata ttgcaatgcg ttaaagttca 2400 aaagagaaac ggctggattg tgctgcgcaa gtggaaaagt caaactagat ccattactta 2460 caccaccaca gccactgaaa ccattgttcg atggaactga tcccgaatta tggaactccg 2520 gatttattta ttacgttcac atgcaatccg aagtggacgg aaattgaacg tgagttagaa 2580 ccgggccaaa aaccgcaaga tcgccatgac ataatcgcca gagtatttca gcaaaaactc 2640 aaggttatga tggatgtgct tactaagtat cgagtttttg gtgacacacg ttgttatatg 2700 tactcggtgg aatggcagaa gcgtggacta ccgcatgctc atatcctaat ttggttgctg 2760 aacaaattac attcaaatga agtggatgac atcgtatcag ctgaaattcc tgatccagtc 2820 actgatcccc atctacacga cattgtgacg acacagatgg tgcatggacc gtgcggtgca 2880 ttaaatccat tatcgccttg catggctgat ggaaagtgca caaaacgata tccgcgaccg 2940 ttagttgctg aaacagtcac agggaacgat ggatatccag tttatcgtcg gcgttcaaaa 3000 gaagataatg gtcgaactat caaagttaaa gttcaaaatc aagagattga gatcggaaat 3060 gaattcattg taccatattg cccgctgcta tcangaattt tcgaaacaca tgcaaacgtt 3120 gagagttgtc attcggccaa atcaatcaaa tatttgtgca agtacgtcac aaaaggcagc 3180 gacatggctg tgtttggtat tgcgtcggaa aatgcgaatg acgaaatcag caacttccaa 3240 atgggcagat acgtcactac taatgaagca ctgtggcgat tattgtcatt tcaaattcat 3300 gaaagatatc ccacagttgt acatttagca gtgcatttgg aaaatggcca aagagtttac 3360 ttcactgagg ctaatgcggc acaacgagct gagagaccac catcgacaac attgactagc 3420 ttctttgcaa tgtgtgaagc agatccattc gcagcgacgc tgatgtacgt tgaaatgccc 3480 aagtattaca cttggaatca atcaacaaag aaattccaac gtcgcaaaca aggaacccca 3540 gttccagatt ggccacaggt gttttccact gatgcactag gtcgcatgta taccgttcat 3600 cctagaaatg acgaatgttt ttatttgcga ctgctgttgg taaatgtacg tggaccaaaa 3660 tcatttgcgc atttgaaaac tgtgaatggc caccaatgcc aaacatatcg agaagcatgt 3720 caactattgg gtttgctgga gaacgattct cattgggatt taacacttgc ggattcagtt 3780 gtttcatcaa atgcgtacca aatacgaacg ctgttcgcaa ttatcatcac cacatgtttt 3840 ccttcacaac caattcagtt atggaacaaa ttcaaagacg acatatgtga agatatcttg 3900 catcgcttgc gcattcaaac gaataatcct gacatccaaa taaccgatga aatctacaat 3960 gaaggattga ttctgattga ggatcaatgc ttgactattg caaacaagct actgattgaa 4020 gtaggaatga ttgcgccaaa tcgatcgatg cacgatgcat tcgaccaaga attaaatcga 4080 gagctgcaat acaatgttga tacattgcag gaattcgttc gaaataatgt gccgttgctg 4140 aatcaacagc aaaaacaagt atacgaaaca ttaatgcaag cggtggacaa taatactggt 4200 ggtctattct tcctggacgc acctggagga acagggaaaa catttgtcgt ttcattgatt 4260 ttggccacta ttcgatcaag atgtgacata gctttggcgt tagcatcatc tggaattgcg 4320 gcgactcttc tagatggcgg tcgtactgca cattctgcgc ttaagttgcc actcaattta 4380 aacacaattg atactccaac atgcaatatt tcccgatcca gtgcaatggg aaaattgttg 4440 atgcaatgca agctcatcgt ttgggatgag tgcacaatgg cacataagaa atcacttgaa 4500 gcacttaact tcacactgaa ggattttcgg cgaaataaca acatctttgg cggctcgatg 4560 atattgttgg caggcgattt caggcagacg ttgccagtaa ttccccgtgg aacgcctgca 4620 gatgaattga atgcttgcct gaaggcatca cctttatgga ataacgtaaa aacattatcg 4680 ctaaccacta atatgagagt tcaacttcaa aatgatcaaa gtgctgcacg attttccaaa 4740 caattgttag ctgttggcaa tggaaaagtt ccagttgacg cgacatctgg attaattact 4800 cttaccaacg actttttgtc gccgatttgt agactctcaa ttagctctta ttgaaaatgt 4860 tttcccaaac attagtgaga attatgagaa ttatgcttgg ttaagtcaac gagcaattct 4920 tgccgcaaag aataatgatg tacacgcact gaatttcacc attcaatcaa aaattgctgg 4980 cgatttggtg acatacaaat ccgttgattc cataacaaat cccgatgatg tagtaaatta 5040 tccaacggag tttttgaact ctctggagtt gc 5072 // ID Sola2-6_HM repbase; DNA; INV; 4173 BP. XX AC . XX DT 11-FEB-2009 (Rel. 14.02, Created) DT 11-FEB-2009 (Rel. 14.02, Last updated, Version 1) XX DE Sola2 DNA transposons from Hydra magnipapillata, consensus. XX KW Sola; DNA transposon; Transposable Element; Sola2; Sola2-6_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4173 RA Bao W., Jurka M.G., Kapitonov V.V. and Jurka J.; RT "New superfamilies of eukaryotic DNA transposons and their RT internal divisions."; RL Molecular Biology and Evolution 26(5), 983-993 (2009). XX DR [1] (Consensus) XX CC TIR is ~550-bp long. CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 1161..3479 FT /product="Sola2-6_HM_1p" FT /translation="MFLGGCKEINMQLAGKLKNIGINAIPGEKICPSCMIK FT LISFKDDEQREPTCSQIYEDEDNELNINELLKKNLNSSLTTMNISPLKMHA FT VNSHSRVALGKKKLLHIQKTVKLGIARTLNIDPEQLSKEDSFNKLSKEVKG FT KANDMDILIRKIKEKMLISNRNQKIQLLTLVPISWSHKDIEKEFNVTNYMV FT QISRKLLQKNGILSFPEKKKGKEIPQETVDKIIEFYCNDENSRLMSGKKDF FT VSIARRSHMQKRLILSNLKELYAKFKTCYPDIKSCFSNFCSHRPKWCITVG FT ASGTHTVCVCTYHQNVKLMISAVKLSKDYHELIDMLVCSRANKTCMIHRCP FT LCPEDSTLVHYLENELYSQDDIDEDDDYCIDYKQWKTTDRSELLSLTETTS FT SFINTLKNKLQKLTVHSYIAKSQAASLRKLKNELSSTEVIVLCDFSENYEF FT VVQDEIQSFHWNKIQATLHPTVIYYKENNVLKCDSICFISDDLLHDVDMVY FT HVMKLTIEHIKSNISHQIESVHYFSDGCAGQYKNCKHFVNICHHEQDFAMK FT CTWSFFATSHGKSPCDGIGGTVKRLTSIASLHRTTTNHILTASAMFNFCQN FT EIIGINFYYISKESNTIIRSEMEKRYTTAKTLPGTRSYHNFVPISNTKIGT FT KIVSEQLEYSFVFNFQTDEIIKEFLSNISIGVFVCLIYDNNPWIGLVEEID FT VENKDFQVSFMHPCYPSRSYCWPSRDDVCWVPSTNLLLMINIPTTVTGRQY FT KISEIDHQSIEDSWANFKPEN*" XX SQ Sequence 4173 BP; 1523 A; 576 C; 640 G; 1434 T; 0 other; gggtcattcc agctcaattc aatttatgct cggcaccatg ccatctcaga ttttgctgat 60 ttttggaata caagcttgat ttaataaaag aaaacttcac tccaaatttt agtgtctgac 120 tccttacggt ttcagagata ttgatacttg gtcccaacct cttttttgac tttttttttc 180 agcgccattt attttttagg ctcattgcac gacacaatta gctttaactt atgaaatact 240 tgtttaatag tggtcaaaat ttctacctaa ctatttttta gggtgaggaa cttaaaaatg 300 atggctaaaa cacacaaaaa ctcaagcttc tatataaaaa ttcaatttca aaatggtgta 360 acactttttg taagtttttt gatgccctgt acaaaaaaat gtttatctca agatgcctaa 420 aagataaaat tctgaaattt tttcctaatg ttctatatgt atcaaactcc ataatacaaa 480 aaatctggaa gggttttctt ttttaaatta gatttatttg catttaaagt tttatttaaa 540 aaattaagaa aaactgttat ttaataatga aaaaaacaat ttgcataaat tttttgcatt 600 ttttaaaaag atttttagaa tcttgctgtg taaggtacag ttctaatata taaacttgtt 660 tgcatttcac ataaaagatc tttattttat gcaatcatta tgtttaatag ctaatggatt 720 ttctattttc aaatctttaa acaataatta ttttaaaatt gtttcttaca gatctaaaaa 780 ataatataac aatatgaatg ataaagaaag atgttcaatt ggaataataa ataaagaaat 840 ttgtaacaaa caaacatatg taaaatctca tgatatcaag aaagtttcaa gtcttaatga 900 tgtggaaaaa gacttgataa ttagtcgttc tggcattcaa tttgatgaaa atgcaacggt 960 ttgcctacac catgaatatg tttatttaaa acgttactct acaatccaca taacttgctg 1020 tgatccattt aattcacaca atggaaaaag aagaaaaggt attattactt aaccatgaaa 1080 cttttttaaa actttattgt aattatttat aaattatata tatttttgca aaatactcac 1140 tgaagcagta attatcactc atgtttttag gaggatgtaa agaaattaat atgcaacttg 1200 ctgggaaatt aaaaaatatt ggtattaatg ctattcctgg agaaaagata tgtccctcgt 1260 gtatgattaa attgatatca tttaaggatg atgagcaaag agaaccaact tgtagccaga 1320 tttatgaaga tgaagataat gaacttaata taaatgaatt gttaaaaaaa aatcttaatt 1380 cttctttgac gactatgaac ataagtcctc tcaaaatgca tgctgtaaat agtcactcaa 1440 gagttgctct tggcaaaaaa aaattgttac atattcaaaa aacagtaaag ttaggtattg 1500 caaggacttt aaatatagat cctgaacagt taagtaaaga agattccttc aataaattat 1560 ctaaggaagt taaaggtaaa gcaaatgaca tggatatact tattagaaaa atcaaagaaa 1620 aaatgttaat ttccaatcgt aatcagaaaa ttcaactact aacactggtt ccaatttctt 1680 ggtctcacaa agatatagaa aaagaattta atgttactaa ttatatggtg cagatatcac 1740 gcaaattgct gcagaaaaat ggtattttat cttttcctga aaaaaaaaaa ggaaaggaaa 1800 tacctcaaga aacagtagat aaaattattg aattttattg taatgatgaa aatagcaggc 1860 taatgtctgg taaaaaagat tttgttagta tagctcgaag atctcatatg caaaagcgtt 1920 tgattctttc aaatctaaaa gagctatatg caaagtttaa aacctgttat ccagatataa 1980 agagctgttt ttctaacttt tgctcacatc gtccaaaatg gtgcattaca gttggagcat 2040 ctggtacgca tacagtatgt gtgtgcacct accatcaaaa tgttaagtta atgattagtg 2100 cagttaaatt atcaaaggat tatcacgagc ttattgatat gcttgtttgt agtagggcaa 2160 acaagacttg catgatacat cgttgccctt tgtgcccaga agattctaca ttggtacatt 2220 atcttgaaaa tgagttgtac tctcaagacg atattgatga agatgatgat tattgtatag 2280 attacaaaca atggaaaaca acagaccgat ctgaactatt aagtctaaca gaaacgacaa 2340 gtagctttat taatactttg aaaaataagt tacaaaaact tactgttcat tcttacattg 2400 caaaaagtca agctgcttct ttaagaaagt taaaaaatga gttaagtagt actgaggtca 2460 ttgtcctttg tgatttctca gaaaattacg agtttgttgt acaagacgaa atacaaagtt 2520 ttcattggaa caaaattcaa gccacattgc atccaacagt aatttattat aaagaaaata 2580 atgtcctcaa atgtgattca atatgtttca tctcagatga tttactgcat gatgttgata 2640 tggtttatca tgtgatgaag ctaacaattg aacatattaa aagcaatatt tctcatcaaa 2700 tagaaagtgt ccattacttt tcagatgggt gtgctggcca gtacaagaac tgcaaacact 2760 tcgtaaatat atgccaccat gaacaagatt ttgctatgaa atgtacctgg tctttctttg 2820 caacaagtca cggtaaatca ccctgtgatg gtattggtgg gactgtaaaa agactgactt 2880 caattgctag ccttcatcga acaacaacaa atcatattct tacagcgtct gcaatgttta 2940 atttttgcca aaatgaaatc ataggaatta atttttatta catatccaag gagtcaaaca 3000 cgataattcg aagtgaaatg gagaaaagat atacaactgc aaaaacgtta ccaggcactc 3060 gtagctatca caatttcgtt ccaataagta acacaaaaat tggaactaaa attgtttctg 3120 aacaattgga gtattcattt gtttttaatt ttcaaactga tgagattatt aaagaattct 3180 tgtctaatat aagcattggt gtgtttgtat gtttgatata tgataacaat ccatggatcg 3240 gactagttga ggaaattgat gttgaaaata aagactttca agttagtttc atgcatccat 3300 gttacccttc aaggtcatat tgttggcctt ctagagatga tgtttgctgg gttccatcaa 3360 caaacttatt gctaatgatt aatattccaa caactgttac aggaagacaa tacaaaatca 3420 gtgaaattga tcaccagagc attgaagata gctgggcaaa ctttaaacca gagaactaat 3480 ttgttatctg tcaaagattt ttaattctta ttttttttaa atgtcaaaga ttgttaattc 3540 ttattacctt ttagttttaa tttttcttat tgtaatttaa ttatttccat ggttttagtg 3600 tgattttaat atttttttga tttattattg taagacttta aatgcaaata aatttgattt 3660 aaaaaagaaa acccttccag attttttgta ttatggagtt tgttacatat agaacattag 3720 gaaaaaattt cagaatcata tcatttaggc atcttgagat aaactttttt ttgtacgggg 3780 catcaaaaat cttacaaaaa gtgttacacc attttgaaat tgaatattta tatagaagct 3840 taaatttttt aacgttttag tcatgatttt tgagttcctc accctaaaaa gtagttaggt 3900 agaaattttc atcactattg aacaagtatt tcataagtta aagctaattg tgtggtgcaa 3960 taagcttaaa aaataaatgg cgctgaaaaa aaaagtcaaa aaagagattg ggactgagta 4020 tcaatatctc tgaaaccgta agaagtctga tactataatt tggagtgaag ttttctttca 4080 ttaatttaag cttttactcc aaaaatcagc aaaatccaag atggtatggt gccgaaaatt 4140 ttttttttta gttgatttga cacggaatga ccc 4173 // ID BEL-1_TCa-I repbase; DNA; INV; 5551 BP. XX AC singleUn_1341; XX DT 21-FEB-2011 (Rel. 16.02, Created) DT 21-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the red flour beetle genome: internal DE portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-1_TCa_; KW BEL-1_TCa-LTR; BEL-1_TCa-I. XX OS Tribolium castaneum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Coleoptera; Polyphaga; Cucujiformia; OC Tenebrionidae; Tribolium. XX RN [1] RP 1-5551 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the red flour beetle genome."; RL Direct Submission to RU (21-FEB-2011). XX DR Genome; singleUn_1341; Positions 10197 4647. XX CC Positions [4485-5069] - Integrase core CC 'GTAGG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 3060..5423 FT /product="BEL-1_TCa-I_3p" FT /translation="MWNSKCDSIQYGCSNFGHELTKPTKRSILSTISQIFD FT PLGLLGPIIIKGKMFIQELFKQKLSWDEPVSGELKSKWFALFQGFMAVDEL FT KIPRHVLLNNSVSVDIHGFSDASEKAYGACIYCRSTDQLGNVSVKLLCSKS FT RVAPLKVLTIPRLELCGALLLARLINKAEESLQIIINKRYCWCDSKIVLSW FT IHSDPSRWRIFVSNRVTQIQNLTNGHTWLYIETHNNPADVLSKGIEPKELQ FT NYCLWWDGPSFLHDSSELWNNKSIACQTLTSEEQKCKVVVTVSVNDFDVFN FT KFCSTFSCFKKLIKSFAWWTRFKNRLQKRPVKSSKTLDLEEYVAAETTLIR FT LLQEQCFDKDLYDLKSGTLSSSSKIKSLHPFLDTNGVIRVGGRLGHSTYSF FT SKKHPILLPNKHKLTDLIATDCHLKLLHVGPQGLLSALRETYWPIAGRNLA FT RKVYRNCVTCFKANPQPLQHIMGELPSCRVKQTFPFYCVGIDYAGPFHLKD FT RTTRNPKIVKAYVCLFVCMAVKAVHIEVVSDLTSEAFLACLKRFISRRGKP FT KDIFSDNGLNFVGAANELRELYVFLKSESTQNKILEFLTSEKISWHFIPPR FT APHMGGLWEAGIKSMKFHLTRIVGNASLTFEELNTIVTQVEAILNSRPLVP FT LSSNPDDLSVLSPGHFIIGRAMVALPDYDYQEIPENKLSRFQRLQRIVQHF FT WSRWTREYICELQNRSKWRNNSLNALKKGSLVIVRDDQTSPQQWRLGRVLD FT LHPGHDGIVRVVTIKFATQIAKRPVAKLCVLPIGESV" FT CDS join(211..1503,1507..2973) FT /product="BEL-1_TCa-I_1p" FT /translation="MPPKRNAQLEIFNLQSKAGSFESQLNHFENYIRSIDD FT SNVNHQLIIQLELRISNIQSLYDKYETIYNELKNYDAATSITSLNDFGDSF FT YNILAAAKSIISTFESNKQNLPQYLPQDANNSCGAFNGAYEEQVKLANLDL FT PTFDGDYMQWAGFKDSFTTLIHNSKRIISKTNKFHYLKLSLKGAALKVIEN FT MQANDSNYDIAWDLLHQRFDNKQLIVKSHLDAIFELPTVNKDSCQSLRELH FT DNLNKNLRALKNLGQPVEYWDTIIIHIILNKLDSNSKRAFAEFKRQCEFPT FT LDDLNNFIKDRCIVLEQLSFKTKAEQPAQINKPKRTPPYQNHYAYASTTNN FT DNTNSGKSCPLCKADHFLHSCDEFYKMTVIERFNFVSDTKLCKNCFNSGHR FT TSQCKSARVCKHCRKKHHSSIHFEKTEPQNNTVAAVSKSSSQILLATIVVN FT IKDVKGITHKVRCLMDGGSQSNFITSKLCKKLGLVTTPINYTVLGINSYPS FT NINKQVKICISSSMTKFECNLHCLVIQKITEKLPIISFDKNILNLPPDVVL FT ADPNFNVSSEIDLLMGASLAFQILRGEPISLGNDNLPIIHNTQFGWVVAGN FT LITEAISQNTSFSFCNTNMSDNVSNEDLQRNLSQFWEIEEDVKLLSSLSKE FT NQLCETFFSETVSREPSGKFIVKMPLKDNHVFLGDSEQMALRRFYSLEQKL FT AKNPQLKTQYSQFINEYKEMNHMSLVSKQFSDLEPGYYVPHHAVYKAESLT FT TKLRVVFDASAKTDSGLSLNDVQMAGPNLQNDLSSILLRFRKHAVVLTADV FT MKMYRMIHIHPEQRKFLRIFWRDNSNEQLQLYELNTVTYGTASAPYLAIKC FT LNVLADENKETFPKTCEIIKGDFYVDDVLTGADAVEEVLQIQNELSTLFWA FT KQVSYLENTYLTTTQF" XX SQ Sequence 5551 BP; 1804 A; 993 C; 1019 G; 1735 T; 0 other; ttttggtcct tcgtacccag gattcgttct catatcacaa cgcaacttcc caaggttagc 60 cattgcgtcg cccgaccaga cacaacaagc acaggtcagc acattccatt ttccattatg 120 atttccttta ttctaagatt tccaagtatt gttttaaacc ttttatacca ttcattttaa 180 ttgcttgaat acatgttgta aacttacata atgcctccga aaagaaatgc tcaattggaa 240 attttcaatt tgcaaagtaa ggccggatct tttgaaagcc aactaaatca tttcgaaaat 300 tatattcgtt caatcgatga ttcaaatgtg aatcatcagt taattataca acttgagctg 360 cgtatttcta acattcagtc attatacgat aaatatgaaa cgatatataa cgaattaaaa 420 aactatgacg cagcaacctc tatcacgtct ttaaatgatt ttggagacag tttttataac 480 attttagcag cagctaagag cattatctcg acatttgagt caaataaaca aaatttacct 540 caatatttac ctcaagatgc aaacaactct tgtggggctt ttaatggcgc ttacgaagag 600 caagtcaaat tagctaattt agatttgccc acctttgatg gcgattacat gcaatgggca 660 ggattcaagg actcatttac gaccttgatc cacaacagta aacgaataat ttcaaaaacg 720 aataaatttc attacttaaa gttaagccta aaaggtgctg ccttgaaggt catcgaaaat 780 atgcaagcaa acgactcaaa ttatgatatc gcatgggact tgttgcatca acgctttgat 840 aataagcaat taatcgttaa aagtcatttg gacgcgattt ttgaattacc gacagttaat 900 aaagactcat gccagtctct acgggaactg catgataatt taaacaaaaa tcttagagct 960 ctcaaaaatt tgggccaacc cgtagaatac tgggatacta taattataca cataattctc 1020 aataaactgg attcaaattc taaacgcgct tttgcagaat ttaaaagaca gtgcgaattt 1080 ccgacattgg acgatttaaa taactttatc aaagatcgtt gtattgtttt agaacaactg 1140 agttttaaga ctaaggcaga acaacctgct caaattaaca aaccaaaacg aactccacca 1200 tatcaaaacc attatgcata tgctagtacg acaaataacg acaataccaa ttctggcaaa 1260 tcatgtcctt tatgtaaggc tgatcatttt ctgcattcat gcgacgaatt ttacaaaatg 1320 actgtaatcg agcgtttcaa ctttgtaagt gacacaaaat tgtgtaaaaa ttgttttaac 1380 tcaggtcatc gcacatctca atgcaaatct gcgagagtat gtaagcactg ccgcaaaaaa 1440 catcattcgt caattcattt tgagaaaaca gaaccacaaa ataacactgt ggccgcggtt 1500 tcttaaaaat cgtcttctca gattcttttg gcaactattg tagtaaacat aaaagacgtc 1560 aaaggaatca ctcataaagt tcgttgcctg atggacggag gctcacaaag taattttatt 1620 acatcaaaat tatgcaaaaa attaggactt gtaactacac ctattaatta cactgtgttg 1680 ggcataaact cttatccatc gaacattaat aagcaagtca aaatctgtat ttcatcctca 1740 atgactaaat ttgaatgcaa cttacattgc ttggttattc aaaaaattac ggaaaaatta 1800 ccaatcatat catttgataa aaatatttta aatctacccc ctgatgtagt tttggcagat 1860 ccaaatttca atgtttctag cgaaatcgat cttttaatgg gtgcgtcact agcttttcag 1920 atcttacgtg gtgaacccat atcattgggt aatgataatt tgcctattat tcataatact 1980 caatttggtt gggtggtagc aggaaattta atcacagaag ctatatcaca aaatacgtct 2040 ttctcttttt gtaacacgaa tatgtctgat aatgtctcaa acgaagacct tcaaagaaat 2100 ttgagccaat tttgggaaat cgaagaagat gtaaaactac tttcttctct ttctaaagaa 2160 aatcagttat gcgaaacatt cttttctgaa acagtatcac gcgaaccttc cgggaaattt 2220 attgttaaaa tgcctctgaa agataaccat gttttcttgg gtgattcaga acaaatggca 2280 cttcgacgat tttacagtct ggagcaaaag ttagccaaaa atcctcaatt gaaaacacaa 2340 tattcgcagt ttatcaatga gtataaagaa atgaatcata tgtctttggt ttcaaagcaa 2400 ttttcggact tagaaccagg ttattacgtg ccacatcatg ctgtttataa agctgaaagt 2460 cttacaacta aattgcgtgt tgttttcgac gcatcagcta aaactgattc gggcctatcg 2520 ttaaatgatg tgcaaatggc cggtcctaat ttacagaatg atctttcttc tattttattg 2580 agatttagaa aacacgctgt tgtcctaacg gcagacgtga tgaaaatgta tcgaatgatt 2640 catatccatc ccgaacaaag aaagtttttg cgtatatttt ggcgagacaa ttcaaacgaa 2700 caacttcagt tatacgaact taacacggta acgtatggta ctgcgagcgc accgtattta 2760 gctattaaat gtctaaatgt gctagcagat gaaaataaag aaacgtttcc aaaaacgtgc 2820 gagattatta aaggtgactt ttatgttgat gacgtcctga cgggagcaga cgcggttgaa 2880 gaagttttac aaatccaaaa tgaactatca actctatttt gggccaagca ggtttcgtac 2940 ttagaaaata cctatctaac aacgacacag ttttgaataa tattgaatca cacacagcgg 3000 acgttaaata caacatacta catgtaggcg ccgaagagaa aacaaaaact ttgggtataa 3060 tgtggaattc gaaatgtgac tcaattcagt acggttgcag taattttgga catgaattaa 3120 caaagcctac caaaagaagc attctttcaa caatatctca aatatttgac ccattaggtt 3180 tgctgggacc tattattatt aagggaaaaa tgtttattca agagctattt aaacaaaaac 3240 tctcatggga tgagcctgtt tcgggtgaac tcaaatcaaa gtggttcgct ttgtttcaag 3300 gttttatggc tgttgacgaa ttaaaaatac ctcggcatgt tcttttaaat aatagtgttt 3360 cagttgacat tcatggattt tctgatgctt cagaaaaggc ttacggtgca tgtatttatt 3420 gtcgatcaac tgaccaactg ggaaacgtgt cggttaaact attgtgttca aaaagtcgcg 3480 tggcaccatt aaaagtacta actatacctc gtttagaatt gtgtggggca ttattactgg 3540 ctcgacttat taataaggca gaagaatcac tacaaataat tatcaacaaa cggtattgtt 3600 ggtgcgattc aaaaatagtt ctctcttgga tacattcgga tccgagtcgc tggagaattt 3660 ttgtgtctaa cagagtaaca caaattcaaa atcttaccaa tggacatact tggctataca 3720 ttgagacaca caataatccc gcagatgttc tgtccaaggg gatcgagccc aaagaattac 3780 aaaattattg tttgtggtgg gacggtccat cttttcttca tgactcatct gagctttgga 3840 acaataaatc aattgcctgt cagactctaa cttcggaaga acagaaatgc aaggtcgttg 3900 tcacagtttc ggttaacgat tttgatgtat ttaataaatt ttgttcaaca ttttcatgtt 3960 tcaaaaaact aattaaatca tttgcgtggt ggacaagatt taaaaatcgt ctgcagaaaa 4020 ggcctgttaa gtcttcaaaa acattagact tggaagaata tgttgctgca gaaactactt 4080 tgatacgtct gttgcaagag caatgttttg ataaggattt atatgattta aaatctggaa 4140 ctttgtcgtc tagcagtaaa attaaaagct tgcatccatt tttagacacc aatggtgtaa 4200 taagggttgg tggcagattg ggccattcaa catattcatt ttctaaaaaa catcccattt 4260 tacttccgaa taaacataaa ttaacggatt taattgccac tgattgtcat ctgaaacttc 4320 tacatgttgg gcctcaaggt ttattaagtg ccttgcggga aacttattgg ccaattgctg 4380 gtagaaattt ggctagaaag gtttatcgga actgtgtcac ttgttttaaa gctaatcctc 4440 agcctttaca acatataatg ggtgaattgc ctagttgtag agtcaaacaa acgtttccgt 4500 tttattgtgt aggaatagat tacgctggtc cgtttcactt gaaagatagg acaactagaa 4560 atccaaaaat tgttaaagca tatgtttgcc tatttgtctg tatggctgtc aaagcagtac 4620 atatcgaagt tgttagtgat ttaactagcg aggcctttct tgcatgttta aaacgcttca 4680 tctcgcggag aggtaaacct aaagatattt tcagtgacaa tgggttaaat ttcgtgggtg 4740 ccgccaatga gttgcgggag ttgtatgtat ttttaaagtc agaatcgact caaaacaaaa 4800 ttttggaatt tttgacatca gaaaaaatta gttggcattt catacctcct agagcccctc 4860 atatgggagg tctttgggag gcaggaatta aatctatgaa gtttcactta acgagaattg 4920 ttgggaatgc cagtcttact ttcgaagaat tgaacactat agtcactcaa gtagaggcaa 4980 tacttaattc acgacccctt gttcctttat cctcaaatcc tgatgaccta tcagtgctgt 5040 caccgggtca ttttatcatt ggacgagcaa tggtggcatt accagactat gactaccaag 5100 aaataccgga aaataaatta tcacgatttc aacgacttca gcgaatcgta caacacttct 5160 ggtcccgttg gacccgtgaa tatatatgtg agctgcaaaa ccgaagcaag tggaggaaca 5220 attcactcaa tgcactgaag aaaggctcct tggttatagt gcgggacgac cagacctctc 5280 cgcagcaatg gagactaggc agggtgttgg atctacatcc aggacatgat ggtattgtgc 5340 gggtggtgac gattaaattt gcaacgcaaa ttgcaaaaag gcctgttgct aagttgtgtg 5400 tcttaccgat tggtgagagt gtttgattga cgatttctcc gttttatatt ttattttttt 5460 ttgtatttta tttttttttt gtttttattt tatgtaattt tgttatttgt gttatgttgt 5520 attgaaagca tgcttgcaat ggggggcggc a 5551 // ID hATx-3_HM repbase; DNA; INV; 3080 BP. XX AC . XX DT 07-DEC-2008 (Rel. 13.12, Created) DT 07-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hATx-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hATx-3_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3080 RA Jurka J.; RT "A distinct, diverse family of hAT transposons from Hydra RT magnipapillata."; RL Repbase Reports 8(12), 1822-1822 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX SQ Sequence 3080 BP; 1174 A; 456 C; 476 G; 972 T; 2 other; gggcgggtca aaaccatact ttttttaaaa ttttgacaag cagcactatg tggagttttt 60 gaattggtat aataaacttt gtagactaaa tttttttttt tacacacacc cctaggggtt 120 gcaccacaca ccctaatttt tccacaaaaa aaaaattttg taaaaaaaat tttttttttt 180 tttgtgtata cattctttta acaaaagaac tttaagcaaa aaaaaaattg ttgctcaatt 240 tcagaattat taacttcatg ccttaaataa agtttcaaaa tatcatgtca gcataagaac 300 ttttgctatg ctactatgta attttttagt ttattctatt tgttgcattt tagtctctat 360 acatagtata tttagtgttt gtgtatatat ttgtctactt tttatttttt agtaattaaa 420 aataaaatat aactaataat aagcaacaca ttgtatcatg gcaacatttc atactcgcag 480 taaaactgac aacttaacat ttggacaagg ttctaaattt ccaactgcaa tgattcctac 540 aaaaagagat gttgtgaaat attgctatta tgttcgtaga aatgaagatg ctaaaggcaa 600 aggatcaagt tctgctcata aatgttacag attagttacc actgaaattg aaaatatttg 660 gcgtaaattc agttttccaa caatcgaaac agagggagta ttctcaaaag tatgtagatt 720 aatggaaaaa gcatccaatt taaataaaac ttcaagactt agaagaaatg ctaattttta 780 tgacaaattg aaagcttttg acaacatgtt tgatatatgt tcgtgcaatt gttatgattt 840 aggagtagaa agagagaaat gcagatgtac ttttaaagtg ccaataaaag agtgggatgc 900 gtttgttggg caaaagaaac gcacaaatca acttggacta cttgatcgtg cctacacatc 960 tactttaaag aaaactgaaa ctagaaaaaa aaaaatttgt aacaaggcaa aaacagttaa 1020 atacacataa aaatgttgag aaagacattg atagacaaat tactgatgac aatttctatg 1080 aatctgagga taatatcata ttagatgaat ctaataatga atatgacaag ctatatgaat 1140 ctgatgtctt aaaaaaatgt aatcaaaata ggtttcatta caatgaacta gcaatcatag 1200 cagatcgtta tcgtgtcagt tgcagagcaa ctgctgctat tgtcaatgct gctctaaaag 1260 acatgggtat actaaatgaa tcaaatatgc ttgataggaa aaaagtagaa agagaaagac 1320 tttgtgttgg acaaaaaaat gtaaatgaaa gaaaatgcaa aaatttgaaa ttacaatgta 1380 ttggttttga tggaagaaaa gacaatacaa taacaactgc agggataatt aaagaagaac 1440 atattacaat tgttagagaa ccaagcagca gctatataga tcatctaaca cctgataatg 1500 gaacttctcg ctgcattgct aatgaaattc tccatttaat atttgaaact ggcagcagcg 1560 gttctttaaa tgctctctta tgtgatgcaa ctgtagtaaa tactggaaaa tttggagggg 1620 tgataaaatt aattgaaaca gaactcgaaa gaccaatgca gtggctcgtt tgtcaactac 1680 atttaaatga acttcccttt aaacatgtat ttgagttgat tgatggaaaa acttctggcc 1740 ctggatcatt taaaggagaa attggtaaaa aaataacaga ggacttaact aatttagcag 1800 tagttccttt cgaaaaaatc aatggttctt ttcatttaat tccagataat atctttagtg 1860 aattaagttc agatcaaaaa tatctttaca gtatctgttt atctattcaa tcaggagatg 1920 tttcaaatga aatatctaaa aattctcccg gaaacattca tcatgcycga tggctcacaa 1980 gagcaaatcg tattctacgc ttatacattt ctacagaaaa gccaacaaaa gaacttattg 2040 acttggttac aataattatg caatgctatg caccaggatg gttccaaatc aaatcaaatt 2100 atttggcaat caatggcgct caaaattttt ggtatcttgc tcaattaatt aaaaatgcag 2160 ttattgatgt aaaatacaag gaggtaatgg agcaagtttt gaagcaaaac tcatattttt 2220 ctaacccaga gaatatttta ttggcgatga taagtgatga aagaaaaaca agttagaact 2280 acagccattc aaaggatact taaatccagr aatactagcg aatccaatag acattttaaa 2340 cttcctacaa cattaaatct tggagccaaa gattactgtg aattaataaa ttgggacgtc 2400 gaggaagttc attcaccacc tttactgaat gattattcaa atgaagatat tttgaaagcc 2460 aatgacagtc cattaagtat cccaaagtat ccttgtcata gtcagaatat ggaaagaata 2520 gttggagtag tcacaaaggc atcagaaaat agatatggct acgaaaaacg tcacaaattt 2580 gtaataaacc tgttggagtc cagggaaaaa atgcctaaat ttgattggaa agcgcaatgg 2640 aaataagaca tttataaaaa tatacatttt tttaatagcc tataaaaact tttcattcta 2700 tttttttttt tatttctaaa agttttgttt ctttgcttaa aaaaggtaat atatgaacag 2760 aaagcatttt aaagcttata tcccttattt aacaccttgt atcccttatt ctaataaaat 2820 acctaaacag tattttagta gaacattttt gtagtatacc taaatcatta caattatata 2880 aaataaaata aaacaattat aaaataatta tatataaaag ttgcttatac cccctttttc 2940 tctaaaacgc caaaaaatcc gattttgtgt gcgaccccta cggaggctag cacagacctc 3000 aaaattggta tacatcctat acacatatat aggaaccccc cacaatatgc tgcttcttaa 3060 aaaaaatttt tgacccgccc 3080 // ID CR1-88_AAe repbase; DNA; INV; 4842 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-88_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4842 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1176-1176 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 17 sequences with >97% CC identity. XX FH Key Location/Qualifiers FT CDS 384..1193 FT /product="CR1-88_AAe_1p" FT /translation="MACNKCQKVTNDSDHIVCRGYCGNSFHMICVKLDYSL FT RGILKDHDKNLLWMCDDCAELFSSDYFRKMSSRCTMENVPDETSIKSLKDD FT IAGLKEIVSTLSSKVDAQPTTPLLSVPWPGTNSKIRHNTVPNTPKRVRDDG FT FSREKPSNSRGTKPASEMIKTVAPPEELFWVYLSAFDPNTSENDMVEFVKN FT CMELTADVEPKAVKLVPKDKDLSTLSFVTFKIGVNKSLKDVALSKDTWPEN FT VYFREFENHSKNQRRVIRVSTGKSPHSGQ" FT CDS 1238..4693 FT /product="CR1-88_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MEASNPPITVEPLLPATNSRPGPVFGIGEEVFQNSYA FT GKYLSNMNIACSEIPAASSQSPSHNLHQSSSFAPPGRKPALSFMEASNPPI FT TVEPLLPATYSRPGPVFGIGEEVFQNSQTGKHISIPNNTCPEVFATSSRSG FT QQRVPLPLQQEQPIGFKDFPSNDFRPSAFSPSTEPHGSEQNRIISTSNRHQ FT DNLLLYYQNLGGINSTIEKYRLAISDQCYDIMVFIETWLNDDTLSSQVFGP FT EYEVFRCDRSIKNSRKATGGGVLIAVNSNLKPKAIDNTSWECLEQVWTTIN FT LGDRKLFVCAVYIPPDRTRDIEYIEAHCRSVHAILETASAVDEIVVLGDFN FT LPSISWISSDRGFLYPDTNHSQFHPGAVTLLDSYSTATLQQINSVTNENDR FT YLDLCFVSDLDSSTSVVMAPCPLVKMVAHHPPLLVTIKHSLVHDSINSSAT FT FSYDFHKADHRSIAELFTTLDWENVLDLNNIESAAQTFSAILAYAIDRHVP FT KKSQEHKTGPPWQTTSLRRLKTFKKAALRRFTKHRTVPLRIAYVRLNHKYK FT YASQRCFSRYQHNIQRNLKSNPKRFWKFVNEQRKVSGLPSAMRFNSIEGTT FT PREICDMFSEKFASVFSREDLTADQVELAVSNVPRSSHAIGSFDIDETMIT FT KASSQMKSSFNPGPDGFPSVLLKKHIDVLVTPLLFIFRASISSGVFPSCWK FT IAHMYPVYKKGDKRDVNNYRGITSLCAVSKLFELVIMEPLKAHCQQQLSED FT QHGFMPGRSTASNLICLTSYIMDSMVRRNQTDVIYTDLTAAFDMVNHDIAI FT AKLDRFGINGRFLQWFRSYLTGRELKVVIGDCQTASFMSTSGIPQGSHLGP FT LIFLLFFNDVHRQIKVPRLSYADDLKIFLEIHSIEDCDLLQQQLSSFADWC FT TINRMIVNPTKCSVITFSRKRSPIHFSYQLMGAEIERVDHVKDLGVVLDTT FT LTFNQHVSYVVGKASRVLGCIFRIAKNFTDVYCLKSLYCSLSRSVLEYCSE FT VWSPNYSNGVERIESVQRRFLRFALRRLPWRDPFRLPSYENRCRLIDLEPL FT HVRRNTARALFIADSLQGRLDCPTILEQININVAPRTLRNNLMLRLPLRRT FT NYSMHGAINGLQRTFNRIASEFDFDTPRRTLRRRFSSFFCKRRW" XX SQ Sequence 4842 BP; 1288 A; 1126 C; 1033 G; 1395 T; 0 other; acgttcagtc gtgtttatca ctaaacggtt attttatgtt tgtttgtata cattttcaac 60 agttttaaaa tcacgattcc ctgagctatc gagagtgatg tgtttttgct ttctcgtggt 120 tatgttgtgt tgctgttcgg ttcagtgaaa actgtgattt tttgtgaatt tgtggatgtt 180 taaactggat ctgttgtgaa gaaaacataa acattctttg ctctatcctt gcgggcataa 240 attgaatgaa cggttgtatg ctacattcga tccaattcac tacaacacat tgaagcgctg 300 cacgttgtct accgaaaggc gatagcttgt tcgttggcga tacagtctgt actccacttt 360 cacgtcttcc attgaccaca taaatggctt gtaacaagtg tcagaaagta accaatgatt 420 ctgatcatat cgtttgtcgt ggatattgtg gtaattcgtt ccacatgatc tgcgttaagc 480 tggattattc gcttcgtggc atactgaaag atcacgataa aaatctgctt tggatgtgtg 540 acgattgtgc agaattgttc tccagtgact acttccggaa aatgtcttct cgttgcacga 600 tggaaaatgt acctgatgaa acatctatta aatcgctcaa agatgacatc gcaggattga 660 aagaaattgt tagcactctt tcttccaaag tcgatgcaca accgactact cctctattat 720 ccgtgccatg gcctggtacg aatagtaaga taaggcataa tacggttccg aatacgccga 780 agcgtgtgcg cgacgatggg ttttctagag aaaaaccatc taactctcgg ggtaccaaac 840 cagcgtccga aatgatcaaa accgttgcac cacctgagga attattctgg gtttacctgt 900 cagcgtttga tcccaatacg tccgaaaacg acatggttga attcgtcaag aattgtatgg 960 agctcacagc ggatgtggaa ccaaaggctg tgaagttggt ccccaaggac aaagaccttt 1020 caaccttgag ctttgtcaca ttcaagatcg gtgtaaataa atcgcttaag gatgtagctt 1080 tatccaaaga tacttggcca gaaaacgtct attttcgaga gtttgaaaat cattcaaaaa 1140 accaacggag agtaataaga gtttccactg ggaagagtcc acacagcggc cagtaaatca 1200 atttgggatt ccgggacgca agcctgccct cagctttatg gaagcctcta acccgcccat 1260 cacagtcgag cccctcctgc cagcgaccaa cagccgtccc ggtcctgtgt ttgggattgg 1320 ggaggaggtc ttccaaaatt cgtatgcagg caagtacctt tcaaatatga acattgcgtg 1380 ctctgaaata cctgctgctt ccagtcaatc gccatcacac aatctgcatc agtcttcttc 1440 tttcgcacca ccgggacgca agcctgccct cagctttatg gaagcctcta acccgcccat 1500 cacagtcgag cccctcctgc cagcgaccta cagccgtccc ggtcctgtgt ttgggattgg 1560 ggaggaggtc ttccaaaatt ctcaaacagg caagcatata agcattccga acaatacatg 1620 ccctgaagta tttgctactt ccagccgatc cggacagcaa cgagttccgt taccgctaca 1680 acaggaacaa cctattggtt tcaaagactt tccatcgaac gactttcggc catctgcttt 1740 cagtccatct acagagcccc acggaagcga gcagaatcgc attatatcga cctctaatcg 1800 tcatcaggac aatctgctgc tgtactacca aaacctggga ggtattaatt caacaatcga 1860 gaaatatcgt ttggcaatat ctgatcagtg ctacgatatt atggttttca tcgaaacttg 1920 gctaaatgat gacactctct ctagccaggt gtttggtcct gaatacgagg tgtttcgatg 1980 tgaccgtagc ataaagaaca gtcgaaaagc tactggtgga ggtgtattaa ttgcagtcaa 2040 ctcaaattta aagcctaagg ctatagataa tacatcttgg gaatgtcttg agcaagtttg 2100 gacgacaatt aacctcggtg atcgtaagct gttcgtgtgt gctgtgtaca tcccacctga 2160 tcgcacacgt gacatcgagt atatcgaagc ccactgtaga tctgtacacg ccattctcga 2220 gacggctagt gccgttgatg agattgttgt gctcggagat ttcaatctac ctagtatctc 2280 gtggatttca tcagacaggg gtttcctcta cccggacact aatcactccc aattccaccc 2340 tggagcggtc actcttcttg atagctacag cactgccact ttacaacaga tcaactctgt 2400 caccaatgag aacgaccgtt atttggacct atgtttcgtt agtgatttgg attcgtctac 2460 atcagtagtg atggctcctt gccctctcgt taaaatggtt gcgcaccatc ccccattact 2520 cgtaacgatt aagcactcct tggtgcacga ttccattaat tcgtccgcca ccttttccta 2580 cgatttccat aaagcggatc atcgcagtat cgctgaattg ttcacgactc tcgactggga 2640 aaatgttcta gatcttaata atatcgagtc tgcggctcag actttctccg ctattttggc 2700 gtacgcgatt gacaggcacg taccaaaaaa gagccaggag cacaaaaccg gacctccttg 2760 gcaaaccact tccctgcgac ggttaaaaac gtttaaaaaa gctgccttgc ggcggtttac 2820 taaacaccgc acagttccac taagaattgc ttatgttagg ctcaatcata agtataaata 2880 tgccagtcag cggtgcttct ccagatatca gcacaatatc cagcgaaatt tgaagtcaaa 2940 cccaaagcgc ttctggaagt tcgtcaatga gcaacgaaaa gtatcgggac taccatctgc 3000 tatgaggttc aacagcatcg aaggcacgac tccccgcgaa atttgcgaca tgttttcgga 3060 aaagtttgca agtgtgttct caagagaaga tttaactgcg gatcaagttg agctcgccgt 3120 tagcaacgtt ccccggtcat ctcatgcgat tggaagtttt gatatcgatg aaactatgat 3180 taccaaggcc tcatctcaga tgaaatcatc tttcaacccc ggaccggatg gatttccttc 3240 agttcttcta aaaaagcaca ttgacgttct agttactccg cttcttttca tctttcgggc 3300 atcgattagc agcggtgttt tcccatcttg ctggaagatt gctcacatgt acccagtgta 3360 caaaaagggc gataaacgtg acgtcaataa ttaccgtgga atcacatccc tatgtgccgt 3420 ctcgaagctg tttgaattgg ttattatgga acccctgaag gctcattgtc agcagcaact 3480 gagcgaggat caacacggat ttatgccagg gcgatcaact gcatccaacc ttatttgcct 3540 tacttcctat ataatggata gtatggttcg ccgtaatcaa acagatgtga tatacacgga 3600 cctaactgca gcctttgata tggtgaacca tgacatagca atcgccaaac tagacaggtt 3660 tggaatcaat ggcaggtttt tgcaatggtt tcgatcatat ctgacaggtc gagaattgaa 3720 ggtcgttatt ggtgattgtc agactgcttc gttcatgtct acgtcaggaa tccctcaagg 3780 aagccacttg gggcctttaa tttttctgct tttcttcaat gatgtacacc gtcagataaa 3840 ggtcccccgg ttgtcttatg cagacgatct taagattttt cttgaaatcc actctatcga 3900 ggattgcgac cttctccaac aacaactctc tagttttgca gattggtgta ctatcaatcg 3960 catgattgtt aatcccacga agtgctctgt gatcactttt tccagaaaga gaagtccaat 4020 ccatttcagt tatcagctca tgggtgctga aatcgaacgt gtcgaccacg tgaaggattt 4080 gggagtggtc ttagacacga ccttaacttt taatcaacat gtgtcttacg tggttggtaa 4140 ggcgtcacgt gttctaggat gcatttttag gatagctaaa aactttactg atgtgtattg 4200 tcttaaatca ctgtactgtt ccttgtcacg ttctgttcta gaatactgct cggaggtttg 4260 gagtccaaac tactccaatg gagtcgagcg tatcgagtct gtgcaacggc gattcttacg 4320 cttcgccctc cgtaggctac cgtggagaga ccctttccgc ctccccagct acgagaaccg 4380 ttgccgattg atcgatcttg aacccttgca cgttagacgc aatactgcca gagctttgtt 4440 tatcgctgac tctcttcaag gtcgacttga ttgccccact attttggagc aaatcaacat 4500 aaatgttgca ccccgtacac tgcgcaacaa cctcatgctc cgcttaccgt tgagacgaac 4560 caattatagt atgcacggag ctattaatgg tctgcaaaga acatttaatc ggatagcctc 4620 agaatttgac tttgacactc cccgacgcac acttcgccga cgtttttcaa gttttttttg 4680 caaacgccgc tggtgatagt tgaatgtttt gtgtagtatt gacttgtgtt cactttaagt 4740 ttaatttgac cttttgttag tattaagact agaattaagt tgacatcatt ggggctgttg 4800 attgcctgtt gatgtattaa agaataaaga ataaagaata aa 4842 // ID Crack-32_AAe repbase; DNA; INV; 4194 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A Crack non-LTR retrotransposon family from Aedes aegypti. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; KW Crack-32_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4194 RA Kojima K.K. and Jurka J.; RT "Crack clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1248-1248 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 8 sequences with >97% CC identity. XX FH Key Location/Qualifiers FT CDS 53..460 FT /product="Crack-32_AAe_1p" FT /translation="MKNEEVVEAKPLAVSSTNSSPPIVVIFNSEEAENVFF FT DGKRKHGTXMVSEIAGSFSGVTSRVTIRDAMTTYARELLKEAREKQEMLGM FT KFIWPGRSGKILMKRHEGSNVEQILTKQQLHEFIKSGNSGSGNPKHHS" FT CDS 527..3406 FT /product="Crack-32_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MANYFYNDFSELNKNQLNVMNCRFKLKILQLNIRGMN FT NLYKLDRIKEVLATIRGNIDVLVLGETWIKEDRKRIYSINGYKSLFSCREG FT SQGGGLVIYVRESIAYREISNEHCNGFHHLQIHLDVAGSPFVLHAVYRPPS FT YNTNDFFSKLESMLASSRSYVCAIIGDINIPINISTCSIVQEYLSLLNCYN FT FTPTNTYPTRLASSNILDHVICSEALQSCVVNETVYCEISDHCYVLSTFSL FT LKPLSEKVLKTTIVDHMRLNNAFVASVHQMPQGSAEDKLKYVIQSYNDCRD FT RFSKIVTVKARIKGFCPWMTLNLWKWIRLKENYIKRSRRYPTDNEAKRMLD FT HVSRRVQKEKEKAKRAYYXTMFNGSSQKEMWKNLNRTLGNXNKSNDDVILQ FT IDGQDIPTGEGVANHFNKFFTSIGPQLASSIRSDMDVNKYNTLQRLPESLF FT LRPTTEQEIILKIQKLDANKCAGPDGIPAAFVKSHHQIFSNLLRDVFNDCV FT STGHFPDFLKVAKVTPIHKSGSRADVNNYRPISVLSVLSKILEKLLVDRLV FT DFLHVHHVLYNNQFGFRSGSSTLTAANELVDDIYEAMDTRRIMGVLFLDLK FT KAFDTINHELLLKKLEFYGVRGTCNALIRSYLSGRTQYVSVNGSRSTLSSV FT QVGVPQGSNLGPLLFLIYINDLANLKLYGKPRLFADDTSLSYKAGDPNDII FT QQMKSDMELLHGFFNENLLSLNLSKTKYMIFHTSRLRVASHPDLLVNSTKI FT EKVSSFKYLGLVFDSNLKWNDHIRKLHMEISSICGIMWRLSTILPHKQLLT FT LYHAFVQSKLTYIVSIWGAASQTNLRKLQTLQNRCLKIVYRKPRLYPSVDL FT YKNSALSILPIAALRMQQNVTQLHNLLFNPIVHHNQDLSRAPHRYNTRNQS FT DLLLQRSNTDAGKKRFAYCGKKQFNDLPDYLKDEQNVQRFRKNLKLYMKSN FT IGQYI" XX SQ Sequence 4194 BP; 1310 A; 896 C; 840 G; 1141 T; 7 other; tgaaaatgtt gtggatattg tgttcagtat aggaagagct tgtgattgtc cgatgaaaaa 60 cgaagaggtc gttgaggcta aacctctggc cgtctcatct acwaattcga gtccgccaat 120 agtcgtcata tttaattcgg aggaagccga gaacgtgttc tttgatggta agcggaagca 180 cggtacgktg atggtatcgg agatcgcagg aagtttttct ggagtgacaa gcagagtaac 240 catccgagat gcaatgacca catatgcaag agaactactg aaggaagcca gggagaagca 300 agaaatgtta gggatgaagt ttatctggcc aggaagaagc gggaaaatct taatgaagcg 360 tcatgagggt tctaatgttg agcaaatcct gactaagcaa caattgcatg agttcatcaa 420 gtcaggaaat tctggatcwg gaaatcccaa acatcattca taagttgaag cataaaaact 480 gccttaaatt aagtcgacaa ctgataaacc ggaactcctt attgtaatgg ctaattattt 540 ttataatgat ttctctgaat tgaataaaaa tcaactgaac gtaatgaatt gtcgwtttaa 600 attaaaaatc ctgcagttaa atatacgagg tatgaataac ttgtataagt tagaccgtat 660 taaggaagta ttggcaacga ttcgtggaaa tattgatgtt ttggtgttag gagaaacatg 720 gatcaaggag gatcgtaaaa ggatttattc tataaatggt tacaaaagtc ttttctcgtg 780 tcgtgaaggc tctcaaggag gtggtttggt tatttatgtt agagaatcaa ttgcatatag 840 agaaatatcc aatgaacatt gcaacgggtt ccatcatctt caaattcacc tggacgtcgc 900 tggatcgccg tttgtgctgc atgctgttta ccgtccacca tcctataaca caaatgattt 960 cttctctaaa ttagagtcta tgcttgcatc atcgaggtcc tacgtctgcg cgattatcgg 1020 ggacataaat attccaatca acatatcaac gtgcagcatt gtgcaagaat acctaagtct 1080 gttgaattgt tacaacttca cacctacaaa tacataccca accaggctgg caagtagcaa 1140 catattggac catgtaatat gttcagaagc tctgcagtcc tgtgtggtta atgaaactgt 1200 atactgtgaa ataagtgacc actgctatgt cttgtcaaca ttctcgctgc ttaagccact 1260 gtcggaaaaa gttttgaaaa caactattgt ggatcatatg cggctgaaca acgcctttgt 1320 tgcttctgtg caccagatgc cgcaaggctc tgcagaagac aaattgaaat atgtcataca 1380 gtcatataac gactgccgcg acagattctc aaaaattgta acagtgaagg caaggatcaa 1440 aggtttctgt ccatggatga ctttaaacct ctggaaatgg atacgattaa aagaaaatta 1500 tattaaaagg tcccgtaggt accctaccga taatgaagct aaacggatgc tcgatcatgt 1560 gtcaagacgc gtgcagaagg aaaaagaaaa ggccaaacgt gcgtattatk taactatgtt 1620 caacggtagc agccaaaaag aaatgtggaa aaatctcaac agaacacttg ggaatwmcaa 1680 taagtcgaac gatgatgtta ttctccaaat tgatggacaa gacattccaa ctggagaagg 1740 cgtagcaaac catttcaaca aattcttcac ttctatcgga ccccagcttg catcatctat 1800 caggagtgat atggacgtga acaaatataa cactcttcaa aggctgcctg aatcactgtt 1860 tctacgtccc actactgaac aagaaatcat tttaaaaatt caaaaattgg atgctaataa 1920 atgtgctgga ccggatggga tacctgcggc atttgttaaa tctcatcatc aaatcttctc 1980 aaatttgctt cgagacgtat tcaacgactg tgtgtcaaca ggacacttcc cagacttcct 2040 gaaagttgca aaggttacac ctatacacaa atccggtagt cgagcagatg ttaacaacta 2100 tcgaccgatt tcggttctct ctgtgctcag caaaattttg gaaaagttgc ttgtggatag 2160 attagtggac tttcttcacg tccatcatgt actgtacaac aaccagtttg gctttcggtc 2220 aggttctagc actctcactg cggcaaacga gttagttgat gacatttacg aggctatgga 2280 tacacgaagg attatgggtg tacttttcct agacttgaag aaggcgttcg acacaatcaa 2340 tcacgagttg ctgcttaaaa aacttgagtt ctatggcgta cggggaacat gtaatgcgct 2400 gattagaagt tacttatctg gtcgaaccca atatgtttcg gtaaacggat ctagaagcac 2460 cctgtcatca gtccaagtag gtgtaccgca gggtagcaat ttaggccctc ttttgttctt 2520 aatttatata aacgacctcg caaacctcaa gttatacgga aaaccgcgcc ttttcgctga 2580 tgatacttcg ttatcatata aagctggaga tcccaacgac ataattcagc aaatgaaaag 2640 tgacatggaa ttgttacatg ggtttttcaa tgagaatttg ctatctctaa accttagcaa 2700 aacaaaatac atgatatttc atacatctcg actccgagtt gcatctcatc ctgatcttct 2760 tgtcaactca acgaagattg aaaaagtctc aagtttcaag tatctgggcc tggttttcga 2820 ttcaaatctc aaatggaatg accatatacg taaactacat atggaaataa gttctatatg 2880 tggtataatg tggagattat caactatttt accacacaag caacttctca cgttatacca 2940 tgcattcgtg caatccaagc tcacgtacat cgtatctatt tggggagctg ccagtcaaac 3000 aaatttacgg aaactacaaa ctctacagaa ccgctgtctg aagattgtct accgcaaacc 3060 gcggctttac ccatcagttg acctttacaa aaactctgca ttgtcaattc taccaatcgc 3120 cgcactccgt atgcagcaaa atgtgacgca attgcacaat ctgctgttca atcctatcgt 3180 gcatcacaat caagatctat ctagagctcc gcatagatac aacacaagaa atcaatcgga 3240 cttgctgctg caacgctcaa atacagatgc agggaaaaaa cgattcgcat attgtggcaa 3300 gaaacaattc aatgatttgc cggattattt gaaggatgaa caaaatgtac agcggtttag 3360 aaaaaatctg aaactttaca tgaaaagtaa cattggtcaa tatatatgat ggaacgtatg 3420 caaaacaatc tctatggcaa ccacttctgc ctctgaatcc tgctcaagtt tccagccgaa 3480 tctctaaact gcatcgccgc tctgtatccg tgcgcaaact cagtccccgt ccaacagaca 3540 acttctgcat tcactcacct actggtgaca tgggacgcat ctatcaggca tctcgagagc 3600 ccctgctcac tccaagctac agcaacccat ctccacatct gcatcgaaaa catctatcag 3660 gcaccttgag agctcctgct cgctccaacc tacaccaact catttccgca tctgcatcgg 3720 aagtgcattc atcgcagtca tccgtcgttc ccggttgcat ggtttggaga gagagtaggg 3780 gtctctcagc ctgcaactgc aggcaagtaa cgtgaagttt cctcatcttt ctgcctctga 3840 atcctgctcc agttttcagc cgaatcttga actgcatcgc cgctccgtac tcatgcgcaa 3900 actcagtcac cgtacaacag acaacttctg cagtcactca cctcctggtc acatgggacg 3960 catctatcac ctcgtcaatt tcgtcaattt cagcagatca ttactttagt gaaaagtatt 4020 tcaatgtagt ggcaatggca cttccttcaa agagactaag tctcactgga agtgctaatg 4080 aattgtaaat tacttagaaa agaagaggag gttttatgcc tttcggagaa gaggttcaaa 4140 aggaacttca ctccgagagg cttttccctg ctccaaataa aacaaataaa aaaa 4194 // ID BEL-96_AA-LTR repbase; DNA; INV; 441 BP. XX AC supercont1.22; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-96_AA_; KW BEL-96_AA-I; BEL-96_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-441 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.22; Positions 2752618 2753058. XX SQ Sequence 441 BP; 115 A; 122 C; 79 G; 125 T; 0 other; tgttagatat taaattgaac taccactgta tctgaaatct taccccgact gaaacatttc 60 agcctcgcac ctctttctat tattcactta tgccattgaa atacagtttt agtttagaac 120 agagtactga tcacatgtgc aaagtctccg tggtttatcc gaaaatcgtc atttttgcca 180 gcaattctcc catcccgaag attaccacgg ccgaatggaa ttctcgttgt cggtttatgt 240 gatagtgcca aaatacgtcg atctgctaga ttgaagcgat attctcccaa ccgtttgcct 300 ttccgcgacc cattgtcgct gaggttgatg tgaacctgta tcgagcaaac ggccgcatca 360 caccccgtaa gaacccgttc ccgtcctcgc tgtggccccc aaaacatcaa agtcaattaa 420 ccacccgtcc agtgcccatc a 441 // ID BEL-71_AA-I repbase; DNA; INV; 6581 BP. XX AC supercont1.155; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-71_AA_; KW BEL-71_AA-LTR; BEL-71_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6581 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.155; Positions 497027 490447. XX CC Positions [5629-6192] - Integrase core CC 'TTACC' target site duplication CC LTRs are 93% similar to each other. XX FH Key Location/Qualifiers FT CDS 210..4340 FT /product="BEL-71_AA-I_2p" FT /translation="MTMSSRTLRSKRSDNPNQVVGAKVSAKGGVQVVVTED FT PDKSYPVRTCQVCRGMDTDEMVQCDDCQKWHHFQCVGVTQQIENFPWSCAR FT CEAAKGVQEPSSTAGALQRSGSSSKRLPEANSNPVLVSHQQVHAQMQTSTA FT ELYSAAEDSIQPEVTGQVPFVSSLRWVVNTSREPSRNASRVSSVSSSRSSQ FT ALAKLKLQKLEELRIIERREAEQQRITAAEEARKEKVFLEQKYQLLEQAMS FT ESWSSKDDKASRTEQWVASASANCSLLKPPNAVNQFALTERLGKLQVDPLT FT NARIGSSRSSQPEPNPPDPFPVPESMLAYSVDPPAPDRMQHMALGTTSHKP FT LCDQSRLTSSMGVPLRVSITQIPRPCEYREDLRSNQPADPPRPMSLPAVSL FT QTRDSATRYPGSHPVHQPSQNVSTPDYQQFTPHMNRAPHSSTMRTSRAYDQ FT EEDMDPCPISRKQLAVRQAISRDLPTFSGCPEEWPLLLSTFNSTTTMCGFT FT NEENIVRLQRSLKGRAYEAVKSRLLHPSNVNGVMSTLKMLFGQPEIIVDSM FT MSKINSLPPLKEDKLETLVDFAVSVENFCATVDACGLEEYLYNISFLHQLV FT NKLPPSIKLNWAQYRQILPIANLPSFSSWLYSLAEAASAVIIPNVAFESKP FT NRSDPRAAKKINSFVNAHSEDTTSGYHTAAQFNAKIEANCCPVCKESCKSI FT ATCKRFLEFSRDSRWATVRDLGLCRRCLRRHKGGCQSKLCGRNGCELKHHE FT LLHNDQKEAPPSNRSNSQNTSSSTQHQVSITSPPPNPSDHGCHTHRLTSSQ FT VLFRYLPIVLHGKQRSIQTFAFLDDGSELTLVDEELVNELELEGEPMPMFL FT HWTGGAKRREEGSRSVKLQVSAKHNESKIYAMNGVRTVAELLLPSQTLNFQ FT ELSTRYSHLKGLPIDSYQDVRPRILIGMKDQHLTLVQKSREGTLHQPIAVK FT TRLGWTVCGGGDQENSANLVHSVFHVCACDSPTDDDLHRRMKEYFTLDSLG FT IAYPVKKTLRNAEEERALTLLEARTVFKGDRYETGLLWRHDDLRLPDSRPM FT ALRRLQCLKKRMNKDPKLAEVLNSKISEFVTKGYARKLSDKELGQIYPRVW FT YLPIFPVTNINKPGKIRMVWDAAATAYGVSLNSVLLKGPDQLCELFSILIQ FT FREGRIALTGDVREMFLQVLMRPVDQQCQRFPWYEEDGTLSVYILQVMSFG FT ACCSPCSTQYVKNLNAERFKNDYPTAVEVIQKRHYVDDMLVSVDTEEEAIQ FT LAQQVKRVHSEGGFEIRNWISNSKHVIQALQECQTEEKNLDVSSEIATEKV FT LGMWWCTNTDTFTYKVGWNRYGKALLEGQHCPTRRQMLRVLMSMFDPLGLI FT SQFLVYLKILLQEV" FT CDS 5008..6291 FT /product="BEL-71_AA-I_1p" FT /translation="MTHYASPDAVIRVEEYSNWKRIVKIVALLYRFASNCK FT RQLQRKPKMVGPISNLEFKTAESYLFRQAQQETFAEEIAHLRKPQDYLEPP FT VPAIPKSSSLYHKSPWMDECGVLRMRGRISACEYATEDAKHPIILPRDHHT FT TKLIVAHYNQKYHHQNHETVVNEIRQKFSIPHLRSIYAKMRKNCQRCKNDR FT AVPRVPIMADLPAARLDAFTRPFTHVGVDFFGPYEVVIGRRAEKRWGMLAT FT CLTIRAIHIEVVHSLSIDSCIMALQNFIARRGKPRTFFSDRGTNFIGARRV FT LQETENAISQEELMKEFMDADTNWKLLPTASPHMGGSWERLIGSVKKKNLM FT AILPARKLTDEVLRNLLTEIENIVNSRPLTHVPIDDDSAPALTPNHFLLGS FT SSGLKPLSNADGSGVVLRQSWLLSQIQANQFWKR" XX SQ Sequence 6581 BP; 1894 A; 1587 C; 1533 G; 1567 T; 0 other; atttattatt tccgtttact gcaaaaaatc cggtcatgtc ttttataacc tgtaaataaa 60 aaataaaata atattttaat cagtccacag tccgggaagt ctcccctttt gcgcttacgg 120 cagccagtat ttatagctca aaaatactgt aggtacgatc aagtaaaagg taccagagtc 180 ataaggaata tcacttgaga taggtcgaca tgacaatgtc aagtcgtact ctacggtcaa 240 aacggtcaga taacccaaac caagtggtcg gtgcgaaagt tagtgcaaag ggtggtgtgc 300 aagtggtggt cacggaagat ccagataaat cctatcccgt gcggacatgc caggtgtgcc 360 gaggaatgga cacggatgag atggtgcaat gcgatgactg ccagaaatgg caccacttcc 420 agtgtgttgg agtaacgcag cagatcgaga attttccctg gagttgtgcc agatgcgaag 480 ccgcaaaggg tgtccaggaa ccaagttcga ctgcaggtgc tctccaaaga agcggtagtt 540 catcaaagcg gctgcctgaa gcgaattcga atccggtgtt agtttcacat cagcaggtcc 600 atgctcagat gcaaacgagt acagctgaac tttattctgc agcagaagac tctattcagc 660 cggaagtcac cggtcaagtt ccgttcgtgt cttctttgcg atgggtagtg aatactagca 720 gagaaccatc tagaaatgct tcgagagttt cttcggtatc gtcaagtcgt tcttcacaag 780 ctcttgccaa gctcaaacta caaaaactgg aggagttgag gatcattgag cgacgtgaag 840 cagaacagca acgaataacc gcagcggagg aagctagaaa ggaaaaagta tttctggagc 900 aaaagtacca attattggag caagccatgt ccgaaagctg gtcctctaag gacgataaag 960 cgagccgaac cgaacagtgg gtagcaagcg caagcgcaaa ttgtagccta ttaaaaccgc 1020 caaacgcagt caatcaattc gctttgactg agcgattagg taagctgcaa gttgatccac 1080 taactaatgc acgaatcggg agtagccgtt catctcaacc agaaccaaat ccaccagatc 1140 catttcctgt accggagtcc atgttagctt actccgtcga tcccccagcg ccggatagaa 1200 tgcagcatat ggcattaggc acgactagtc acaaaccact gtgcgatcag tctcggttaa 1260 cttcgtccat gggagtcccg ctccgggttt caatcacgca aattcctcgg ccctgtgaat 1320 atagagaaga tttacgctcc aatcagccag ctgatccacc acgaccaatg tcactgccag 1380 cagtgtcact ccaaacgcgt gattcagcaa ctcgatatcc tggatcacat ccggttcatc 1440 aaccatccca gaatgtatcg actccagact atcaacagtt cacgccacat atgaatcgtg 1500 ctcctcattc ttctacaatg cgaactagcc gtgcctacga ccaggaagag gatatggacc 1560 catgtcctat ttctcgaaaa caactagcgg tcagacaagc aatatcgagg gatctgccca 1620 cattctctgg atgccccgag gaatggccat tgttactttc tacctttaat agtacgacaa 1680 caatgtgtgg tttcaccaac gaagagaata tagtccgtct gcagcgaagc ttgaagggtc 1740 gtgcgtatga ggcagtcaag agcagattgt tacatccatc aaatgtcaac ggagtcatgt 1800 cgacgctcaa gatgttgttt gggcaacccg agataatcgt agattcgatg atgtctaaaa 1860 taaattctct tccgccactt aaggaggata aattggaaac ccttgttgat tttgcggtca 1920 gcgttgaaaa tttctgcgca acagtggacg cgtgtggttt agaggaatac ctttacaata 1980 tatcctttct ccatcaactc gtcaacaagc ttccaccgtc gattaaattg aactgggcac 2040 agtataggca aatccttccg atagccaacc tgccttcctt cagcagctgg ctgtactcac 2100 ttgcagaagc agctagtgcc gtcatcatcc cgaatgttgc cttcgaatct aagcccaatc 2160 gtagtgaccc acgggcggca aaaaagatca attcattcgt taatgctcac tccgaggata 2220 ccacgtcagg atatcatacc gcagctcaat tcaacgcaaa gatagaagcc aattgctgtc 2280 cggtatgcaa ggagagttgc aaatcgatag ctacgtgtaa acgatttttg gaattttcac 2340 gtgattctcg ctgggcaacc gtgcgggatc tgggactctg ccgcaggtgt ctacgtcgac 2400 acaagggagg atgtcagtca aaactgtgcg gaaggaacgg ctgcgaactg aaacaccacg 2460 agctactgca caatgatcag aaagaagcac caccgtcaaa ccgttcaaac tcccaaaaca 2520 cttcttcgtc gactcagcat caagtctcca ttacatctcc accaccaaat ccgtccgatc 2580 atgggtgtca cactcatcgt cttacatcaa gtcaagtttt attccgctac ttgcctattg 2640 tattgcatgg aaagcaacga tcgattcaaa cgttcgcctt tttggacgac ggatcggagc 2700 tcacattggt cgacgaagaa ctggtcaacg aattagagct tgaaggagag cctatgccaa 2760 tgttcctgca ttggaccgga ggagctaaac gccgagaaga aggatcaaga agtgttaagc 2820 tgcaagtttc tgcgaaacac aacgagtcga agatttacgc catgaacggt gtgcgaaccg 2880 tagcagaact gctgcttcct tcacaaacgc ttaactttca agagttgtca acacggtatt 2940 cacaccttaa aggcttaccg attgattcct accaggatgt tcgaccgaga attctaatag 3000 gaatgaagga tcagcatctt accttagtcc aaaaaagccg cgaaggaact ttgcaccaac 3060 cgatcgccgt taaaacccgt cttggatgga cagtttgtgg gggaggagat caggaaaatt 3120 ccgctaatct cgttcactct gtatttcatg tctgcgcatg tgactcaccg acagatgacg 3180 accttcacag aaggatgaag gagtacttta cattagacag cttgggaatt gcatatccgg 3240 taaaaaaaac tcttcgcaat gcagaagagg aacgagcttt gactctcctt gaagcccgta 3300 cggtgttcaa aggagatcgg tacgaaacag gacttctttg gcgccacgac gatctacgtc 3360 ttccagacag tcgtccaatg gcacttcggc gtttgcagtg cctcaagaag cgaatgaaca 3420 aagatccaaa gctagctgaa gttctcaatt caaaaatatc tgagtttgtc acgaagggat 3480 acgccaggaa gctcagtgac aaggaactag ggcaaatata tccccgggta tggtacctgc 3540 caatatttcc cgttaccaac atcaacaaac ctggtaaaat tcggatggtg tgggacgcag 3600 ctgcaaccgc ctacggcgta tctctgaact cagtcttact taaaggacct gaccaactat 3660 gtgaattatt ctccatacta atccaattcc gcgaaggacg catcgcctta acaggagacg 3720 ttcgagagat gtttttgcaa gtactcatgc gtccggtaga ccagcaatgt caacgttttc 3780 cgtggtatga agaggatgga acactctctg tttatattct gcaggtaatg tctttcggag 3840 catgctgttc cccctgcagc acccagtacg tgaagaatct gaacgctgaa cggtttaaaa 3900 acgactatcc gacagccgtt gaagtgattc agaaacgcca ctatgtcgat gacatgcttg 3960 taagcgtaga cacagaggaa gaggcgatac agcttgccca acaagtgaag cgtgtacatt 4020 ccgaaggcgg attcgaaatc cgcaattgga ttagtaactc caaacatgtc atacaggctt 4080 tgcaggagtg tcagacagaa gagaaaaatc tagatgtatc atcggagata gcaacggaga 4140 aagtactcgg aatgtggtgg tgcaccaata cggacacatt cacctataag gtgggctgga 4200 atcgttatgg taaagcgcta ctagaaggtc aacattgtcc aactagaagg cagatgctgc 4260 gcgtcctgat gtcaatgttt gatccacttg gattgatttc tcagtttctg gtgtacttaa 4320 aaatcctgct acaagaagtt tagcgttctg gtgttgactg ggatgataaa atggacggcg 4380 ttctgttcga aaagtggcaa acgtggctta gagtactccc acaagtcgaa cacttacaaa 4440 ttcctcgttg ttacttctat caacatactg ctgcctgcgg gaccgtccaa ctccacacgt 4500 ttgtcgacgc tagtgaaaat ggaatggccg ccacttgcta cttacgtttc atccgtgaca 4560 atgtcgtaga atgtagttta gtggccgcta aaactagggt agcaccgctg aaattcctct 4620 ccattccgag gctggatctc caagcagcag tgatagggac gagactagcc cgatcagtat 4680 ctgaagccct cttaatccag atttcacgcc gaatgttttg gtcggattca caagacgtac 4740 tatgttggat caattctgac catcgccgat tttcgcaatt cgtggctttt cgagttagcg 4800 agatcctaga cacaacagag atgtacgaat ggatgtacgt ccctaccgac ctgaacgttg 4860 ccgatgaagg aactaaatgg aaaggattac ctgatctcac tcctcaatcg agatggtata 4920 tagggccaca attcctttat cactcggaag aagattggcc tcagtcgtcg aagagtagca 4980 acactgcaga ccaagagctt gagcgttatg acacactacg cctcgccaga tgctgtgata 5040 cgtgttgaag aatactccaa ttggaagcgg attgtcaaga ttgttgcttt actctaccgc 5100 tttgcaagta actgcaaacg gcagctacaa aggaagccta aaatggtcgg tcccatttcc 5160 aacctggagt tcaaaactgc agaatcgtat ctctttcgcc aagctcaaca agaaaccttt 5220 gcagaggaaa tcgcacacct cagaaaacct caagattatc ttgagccacc tgttcctgca 5280 attcctaagt caagttcact ttatcataaa tcaccctgga tggatgagtg cggagtgtta 5340 agaatgcgtg gtagaatcag tgcctgtgaa tacgcgaccg aagacgcaaa acacccaata 5400 atacttcccc gtgaccatca taccaccaag ttaatagttg cacattacaa ccaaaaatac 5460 caccatcaaa accatgagac cgttgttaac gaaatccgtc agaaattcag cattccgcat 5520 ttgcgctcga tctatgccaa gatgagaaag aactgtcagc gctgcaagaa tgaccgagca 5580 gttccacgtg tgcctattat ggccgatcta cctgcagcac gcctagatgc tttcacacgc 5640 ccattcacac atgtaggtgt cgacttcttt gggccgtacg aggtagtcat cggtcgtcga 5700 gcagagaaac gatggggaat gctcgccact tgtctcacaa tccgagcgat tcacatcgaa 5760 gttgtacact ccctaagtat agactcgtgc atcatggctc tgcagaattt cattgcaagg 5820 agaggtaaac cgcgtacatt cttcagtgac cgaggcacca acttcatagg tgcaaggcgg 5880 gttctccaag aaacagagaa tgccatcagt caagaagagc tcatgaagga atttatggac 5940 gccgatacta actggaaact tctgcctaca gcttcaccgc atatgggcgg tagctgggag 6000 cgtctaatcg gcagcgtcaa aaaaaaaaat ttgatggcga ttctgccagc taggaaactt 6060 acagatgaag ttctgcgcaa cctattaacg gaaatcgaaa acatcgttaa ctccagacct 6120 ctaacacatg ttccgattga cgatgattca gcgcccgcct tgaccccgaa ccacttcctg 6180 cttgggtcat ccagtggctt gaaacctttg agcaatgcgg atggcagtgg tgttgttcta 6240 cgtcaaagct ggctcctatc tcaaatccag gccaaccaat tttggaagcg ataggttacc 6300 gattaccttc cagagataac ccgccgcacg aaatggttca tgcataccaa gcctatcgaa 6360 atcaacgatg ttgtggtgat tgtcgatgcc aaatctccac gaaattgctg gtcgaagggt 6420 cgaataatca acattcaggt cggcagagac ggtcaaccta ggtcagcaac cgtaaggact 6480 gcggtcggga tctacgagcg accagtggcc aagttggccg ttttagacgt acggcgcgaa 6540 gctgagtaag ctggccaggt ggccagcgta cctgggggga g 6581 // ID Gypsy-11_DWil-I repbase; DNA; INV; 4597 BP. XX AC scaffold_181036; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-11_DWil_; KW Gypsy-11_DWil-LTR; Gypsy-11_DWil-I. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-4597 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_181036; Positions 40685 36089. XX CC Positions [3674-4147] - Integrase core CC LTRs are 92% similar to each other. XX FH Key Location/Qualifiers FT CDS 543..2531 FT /product="Gypsy-11_DWil-I_2p" FT /translation="MLVKLSTLQVAELDIILQTNNIHVNGNKQAKIDEITT FT ILGTDQINTDEYDFNQQNSTMQRQMDELKQMVADLSQAVSTINVRTQNLMT FT ERQERANEPARSEENHQHDVVEQQSMGQRISPRDYKSNTSIKDVIGMLPEF FT DPIKGAVNSQQFLDKVEQLQIVYEWRDATILFAVQQKLRGVAKDWLDSQRL FT YQTWSQFKDALLKDFPSVVNISDVYRQMMRRKRKHNETLIEYFYSMMAIGR FT KGNIDDKSINSYIINGLNQQESTKALLAMNLCTCAELFRSLENMNSSSVWQ FT SYRTAEYASSDTAKTMEVNKDNNAKGPKCFNCNNIGHIAAKCPVESKKPRC FT SFCSKIGHEGKDCRLKRSTVSKVDSIKDKKRPPILKKILIEGHEYEAFVDT FT GSDNTFMQKSQVPIDAVLQVMTNSFRGFGGGIVESKECLLTEIVWDNKRID FT VCIYVVQDKELNYAVLLGRDILCAEDETKVETKSTNSPTEVKCEFDIGAEV FT DDPQRKQVSDLLNAYTECFAEDFSNIGRCKSTKMEIKVTTTNPIVGRRYQV FT PFAKRDALRTVVDELLKYNIIQRSTSPHAASSILVPKPNGEHRLCVDFKTL FT NAVTVKQHYPMPVVEEQLAKLAGNHFFTTLDMTSGYYQIEMSNESNESNVW FT ILSNRNEQRLNPRD" FT CDS 3221..4597 FT /product="Gypsy-11_DWil-I_1p" FT /translation="MSRQPVLKEQSSIMETDGIFKVIPIDDDWVYTMQKQD FT PKLKQIFEIFQMKDKSADWKQIKNDYVISKSRLLRKTDKGDKLVDPQAVRW FT RITKSNHDDVGHYSAQKTMERIGRYFWFPRMRKFIKSYIDSCPECCINKIK FT GGKPDGEMHLQDVIPMPFRTINIDHIGPFPKSKSGNIYILVIVYAFTKFTI FT LVATRNTKTIPVIKALSHMFGIFGQPLRIISDRGTSFTSKEFKDFVQNYGI FT QHIKTAVRTPRANGQAERMNKTLLNALKTSTYKNKDWDQQLFNIQWSMNTN FT VNSTTKFSPNDLVFNFNPKDVSYNRIIQAMTEDQFEETDVQLNRHQAAENI FT KAEQEKWKKRFDLKHAKPQKFEVDDLVVIDYVPMATGDTHKLDPAFKGPYV FT VTKVLGNDRYIVEDLPDRSVSQRRYSNVISSDHMKPWCVLIPGLDIEEEGY FT DYIGINEEESGEAG" XX SQ Sequence 4597 BP; 1730 A; 823 C; 979 G; 1065 T; 0 other; aatcagtagt gggatacgaa gacacacggc tttcggaaaa gacggaataa tttatttcat 60 aagtgatgaa ataaatttaa gaatccaacg agacaaggga aaaagacaac tgcaaaagtg 120 caaaacggtc acgatttccc gaaagagtaa aagagatatc tataggtcaa cgaaacagtg 180 gaagaaagac aaaacggcaa agaagcaaaa cggtcacgaa acttccaaag ctgtaaaaag 240 acaagggtac atacaagaaa aaaaaacgcg ggaaaagacg caacggcatt aatcgcaaga 300 cggtcacgaa ttcccaaagt aacggaaaaa ggtttgcacg gatttcatgg ggaggcaaac 360 atcgacgtag aaatctccac gagaaaggca atcacagaaa agacggaaaa agtccatacc 420 taactatcga atgtatcgat agcacttatc gataaaccaa tcgagggaaa ggtccagatc 480 taaaaagcac ggaaaaaaaa aaagaaaaga gacaattgac tagaaaaaaa aaagttaaga 540 aaatgttggt gaaattatct acactacaag tggcggagtt ggatattatt ttacagacaa 600 ataatattca tgtaaatggt aacaaacaag ccaagattga tgaaattacg acaattctag 660 gaacggacca aataaacacc gacgaatacg atttcaacca gcaaaatagt acgatgcaga 720 gacaaatgga tgaactgaaa caaatggtag cagatttatc gcaagctgtg tcaacgatca 780 acgtacgaac acaaaatctg atgacggaac gacaggagcg tgcaaatgaa cccgcaaggt 840 cggaggaaaa ccatcagcac gacgtagtcg agcaacaatc tatgggacaa cgaataagtc 900 cacgggacta caagagtaat acatcgataa aagatgttat aggaatgctt cccgaatttg 960 acccaataaa aggagcagtg aattcgcaac aatttttgga caaagtagaa cagttacaga 1020 tcgtgtatga gtggagagat gccaccatat tgtttgctgt acagcagaaa ctaagaggtg 1080 tagctaaaga ctggttggat tcacaaagac tatatcagac ttggagtcaa tttaaggatg 1140 cactactaaa ggattttcca agtgtagtta atatatcaga tgtttacagg caaatgatgc 1200 gccgcaagag aaagcataac gaaacgttaa ttgaatactt ctactcaatg atggcgatag 1260 gtcgaaaagg taatatagac gataaatcca tcaactcata tattatcaac ggattaaatc 1320 agcaagagtc aacaaaagct ttactagcca tgaatttatg tacatgtgca gaattgtttc 1380 ggtcactgga gaatatgaat tcttcatccg tatggcagtc ataccgtacg gcggaatacg 1440 catcgtcaga taccgcaaag accatggagg ttaacaagga taataatgct aagggtccaa 1500 aatgctttaa ttgtaataat attggacata tcgcagccaa gtgtccggtt gaatcgaaaa 1560 agccaagatg ttcgttttgc tctaagatag gacatgaagg aaaagattgc agacttaagc 1620 ggtcaacggt gtcgaaagtt gatagcataa aggataagaa gcgtccccca atcttgaaga 1680 aaatacttat tgaaggacat gaatatgagg cgtttgtaga cacaggcagt gacaatacat 1740 tcatgcaaaa gtctcaggta ccgatcgacg cagttctaca ggttatgacc aattctttca 1800 gaggttttgg aggaggaatt gttgagtcaa aggaatgcct gttaacggag atcgtttggg 1860 ataacaagcg aatagatgta tgcatctacg tagttcaaga taaagaactt aattatgcag 1920 tcctcttagg aagggatatt ttgtgtgcag aggatgagac aaaggtcgaa acgaagtcaa 1980 ctaattctcc gacggaggta aaatgcgagt ttgatatcgg cgctgaggtt gacgatcctc 2040 agcgcaaaca ggtgagtgat ctattgaacg cttatacaga atgctttgcc gaagattttt 2100 caaatattgg cagatgcaaa tcaacaaaga tggagatcaa agtaacaacc acgaatccaa 2160 ttgtgggacg gcgatatcaa gtaccatttg caaaaagaga tgcattacga acagtagtag 2220 atgaacttct gaagtacaac ataatccaac gaagcacatc accacatgca gcatcctcga 2280 ttttggtgcc caaacccaat ggagaacatc gcttatgtgt tgatttcaaa accctcaacg 2340 cagtcacagt aaagcaacac tacccgatgc cagtcgtaga agagcaattg gcaaagttag 2400 caggaaatca tttctttaca acgctggata tgacgtctgg atattatcaa atagaaatga 2460 gcaacgagag caacgagagc aacgtctgga tattatcaaa tagaaatgag caacggttaa 2520 atccgaggga ttaactctac gtccatctaa gtgcaagttc atgaaaaaag aagtcaaatt 2580 tctaggtcat attgttactg gtaaaggaat tcaaccagga aaaaaaaaga cacaatgtat 2640 agctgaatat ccgcagccaa cgaatgaaat agaaatacgg agatttttgg gaattactgg 2700 attcttccgt aaatttgttc caaattatag cataatagca caaccattga gtatgctttt 2760 aaagaaaaaa caaaacttta tatggacacc tgatcaaaat gaagctttca atcgtttaaa 2820 agaagctatt acaacggaac cagtcttaac gctatatgac ccaacgaaat atcatgaagt 2880 gcatacggat gcgagcacca caggaatttc agcaatacta tttcaacagg aagacgagga 2940 tatcaagccg gtattttact tcagtagact ttgtacggat tctgaaagtc ggtattgaag 3000 ccatgcactg gaagttttag ctatagccga atcattggaa agattccggg tataccttct 3060 tggatctcaa ttttcagtaa taacggattg caacgctgta gcaacattaa agaactcaac 3120 cgctcttcag ccgcctattg cgcgatggtg gctcagactc caagaaatcg acttcatatg 3180 taagcaccga ccgggcttag atttaccaca tgtagatgga atgagcagac aaccagtact 3240 taaggaacaa agcagcatta tggaaacaga tggcattttt aaagtcatcc caatagatga 3300 tgactgggtg tataccatgc aaaagcaaga tccaaagcta aagcaaattt ttgaaatatt 3360 tcaaatgaaa gataaaagtg ctgattggaa gcaaataaaa aacgattatg tcatttctaa 3420 aagcaggtta ctaagaaaaa cagataaagg cgataaatta gtagatcctc aagcagttcg 3480 ttggagaatt actaaaagca accacgatga cgtaggtcat tacagtgcgc agaagacaat 3540 ggaacgaatt ggtcgttatt tttggttccc gaggatgagg aaatttatta agtcatatat 3600 agactcatgt cccgaatgct gtatcaataa aataaaagga ggaaaaccag atggagaaat 3660 gcacttgcag gacgtaatac cgatgccatt ccggacaata aatatcgatc atattggacc 3720 atttccgaaa agcaagagcg gaaatattta cattttggtg atagtatacg cattcaccaa 3780 gttcaccata ttagtcgcaa cacgcaacac aaagaccatc ccagttatta aagcgctttc 3840 gcacatgttt ggtatttttg gacaaccatt acgtatcata tcagatagag gtacttcgtt 3900 tacatcaaag gaattcaaag attttgtgca aaactatggc atacaacata ttaaaacagc 3960 agtgcgcaca ccacgcgcca acggtcaagc tgagagaatg aataaaacac tactaaacgc 4020 gctgaaaaca agtacatata aaaacaagga ttgggatcaa cagttgttca acatacagtg 4080 gagtatgaat acaaacgtga attcaacgac aaaattttct ccgaacgatt tggtatttaa 4140 ttttaatccc aaagatgtat cgtataaccg aattatacaa gcaatgacgg aagaccagtt 4200 tgaagaaacg gatgtacaac taaatcgtca tcaagcggca gaaaatataa aagctgaaca 4260 ggaaaaatgg aaaaaaagat tcgacttgaa gcatgccaaa cctcaaaaat ttgaagtaga 4320 tgatttagta gtcattgact atgtaccgat ggctacagga gacactcata aattagaccc 4380 agcgtttaaa ggaccgtacg tcgtaacgaa agttttagga aacgatcgct acattgtaga 4440 agacctaccc gatcgttcag tatctcaacg acgttacagt aatgtaatat cgagcgatca 4500 catgaaacca tggtgtgttt tgattccggg tctagacatc gaagaagaag gctacgacta 4560 cataggaatc aacgaagagg agtcaggaga ggccgga 4597 // ID Gypsy-179_AA-I repbase; DNA; INV; 6319 BP. XX AC supercont1.143; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-179_AA_; KW Gypsy-179_AA-LTR; Gypsy-179_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6319 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.143; Positions 1046392 1040074. XX CC Positions [3474-4049] - Reverse transcriptase CC Positions [5236-5697] - Integrase core CC 'GGCCT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 5062..6219 FT /product="Gypsy-179_AA-I_1p" FT /translation="MLQLAHEGHPGQSLMKRRLRERCWWPGIDQSAVDTCE FT SCEGCRLVQLPDPPEPMMRRQLPDRPWVDIAIDFLGPMPSAEYILVVIDYY FT SRYMELEVMSKITAQETIRRLRRIFRIWGPPRTITLDNAKQFVSCEFEEFC FT KANGIHLNHTSPYWPQANGEVERQNRSLLKRMKIANALYDDWKAELNSYLE FT LYNNTPHTVTGKAPSELLQNRKLRTKLPCIDDLQTMPPSSDFRDQDHERKI FT MGKRWEDVKRRAKPSTISIGDTVLMKNLCPTNKLSTNFHKEKFLVVNRQGS FT NVIVQSMETGKTYDRNTSHLKTVSDPVPITVNDPDTNLDDRSLPTTCVDPN FT IDEESHDGAVVSHSVPEPCESLLPSVQLRRSARTPKPKEIYSP" FT CDS join(1848..2789,2793..4631) FT /product="Gypsy-179_AA-I_2p" FT /translation="MDDPNQTERDVSNGSNGPIIQQHLSPFSPASYNLPHF FT KFKHLPQSEVRNAWGSWIRWFETVMAASNITDGPSRKMQLLAMGGLELQSA FT FYGIPGCDEQDDPLQDPYMSMKDKLSQYFSPKHHDSFERFLFWSMAPSDAE FT SIEKFALRVQQRAEKCSFGMTATESRQIAVIDKIIQYASGDLRQKLLEKEH FT ITLDEAIKIINAYQSVRYQSSKMNPKPIGSLANRPYEQIDRVNRLYESSAK FT KYGANPRCLRCGYTRHREAVHCPALNKTCLRCKKIGHFQTVCKTKSSFNSV FT GIFILITSKENFKVVIPSPHIYGTFDRKRKATIVNSQGPSHQSKYPRSYSR FT PVYQIDDQQNKEPKREEFPVYNVGESDELITCCIGGVDVTMLIDSRSKHNL FT IDDTTWEIMKLQDVNIRNERFDHDKRFLAYGRVPLNLITVFDADLEINDGG FT KLLKTSASFYVIEKGQQPLLGKITAQHIGVLQIGLPSSKDALISRVDTMKE FT PFPKMKGITLSLPIDRSVSPVIQPLRRCPIPLLQQVEAKLKELLQLDIIEK FT VTQPTSWVSPLVPIMKDNGELRLCVDMRRANQAIQRLNHPLPVFDDMLPKF FT RDAKLFTTLNIRQAFHQVELSEDSRDVTTFITNWGLYRYKRLLFGVNCAPE FT LFQNLMESILAECKNTVVFIDDILIFGSTEEEHDLAVKHTLSVLNRYGILL FT NIHKCKFKQTEISFLGHRLSPEGVAPADEKVQAILHFRAPQTKEELRSFLG FT LVTYVSRFIPNLATVNYPLRKLLKLETPFDWKQEHQESFDQLKRIIGSIQH FT LGYYDPKDRTLLITDASGVGLGAVLLQFKNNQPRVIGYASKSLSETEKKYP FT PIEKEALGIVWGVERFRMYLLGIHFELETDHRPLETLFTANSRPTARIERW FT MLRIQAFKFKVRVCLDLKHLCIV" XX SQ Sequence 6319 BP; 2036 A; 1141 C; 1440 G; 1702 T; 0 other; agatggcgaa aaagatggga tctggtgcag tgttgataat ttaaatttat ttacagaagc 60 gatttgcgag tgatttgagt gatttttatt ccaagtttca attaaccaag gtaaaaaaaa 120 gagagaagaa ataagcgtga tcatgtcaga atctacatag gatacattgt ttggtttggc 180 ttggactgaa attctcatta atcgtgagtt aaagtgaata aaattaaaat aaaacgaaaa 240 gcagttctga agaaaagaaa aaaaaataca aggctgtggc tgatagcgtc gggaaaaagc 300 tgattcattg aatttgttcg aaattgaaat gacgccaagt ccgccatgtt ggaaaaatgc 360 tgaaaaaagt gaaccgagcc tccgcgtgtt aatgcatgta aaaaaaaaaa atcagactag 420 cactgatagc atcctggtgt gtgttgagtc gctaaaggaa taatggctgg gtagatgttg 480 gtggtgaata acctttatcg tgaaataatg ctacatcttg gactgtattg gcatcggtag 540 tgtttggtgt gtgtggcagt attatttacg gcatgattgt ttgcaagttt aatctacgtg 600 tgcgggaaag gtgaggtcag aagagtgaga aaaatggaag tgagattttg gaatgtttgg 660 tgtgaagatg agttgtttgc ttctgtacag ctaaggaatg tgttaaattg tggtaagtca 720 aaagtaatac catatataat gcgtgcaaat ctgcaaagaa atcagtcagc atgaccaagg 780 taggtaccct ttatccgaaa aatggctgag acatatcaat taatagcaac atgggagaac 840 agaaaaagtg aggttatgtt ttgtcgtgtg atttgggttg taaaaacata ttgagatggt 900 atatgtaata tgatgttgaa gctaccaaca caaaactgat tgtgtggaac agcagtaaat 960 tcacttacgc tggttagcac cagctataaa aagaaaagtt ggcaaataaa ttcaagtgca 1020 ttggttagca ccagctttta aaagatatgt tggaagtaac actaacggag gaaaattggt 1080 ttgaatagat caaagctttt ggaatgttgc gaggatctat tgagggaata catcgaaaat 1140 aactatgaaa ttcttgattg actttgcaca cggttttgaa tgttgaacta ttgtcattgt 1200 gattattgaa ggtcgctctt agatgatgcg aatgtttgtt taaatgaaga aaaaaaatgt 1260 aaataatggt cgtgattgac acgaatgaat gatgtgcgtt attgaatgat acgtatgttg 1320 caaaatcaat ttaatttaga gaatgtcgct cttggatgat gcggatgatg tgcgctcttg 1380 gatgatgcgg gagttacatg gaataaaaag tgagcacgtc actcttgaat gatgtgaatg 1440 aatgatgtgc gctataagat gatacaaaag ttaaataaaa ctaatgttta ttaaaatctg 1500 gctcttggat gatgcgaata tatgttgtgt gctcttggat gatgcgaaac tgatatgaaa 1560 taaccaataa agaatgtttt gagaacattg ctcttggatg atgcgaaaga tacacaaaat 1620 aaaaataaaa agttataaaa agttgtacat tggtcttaga tgacacaaaa aaaagtaatg 1680 atgtgcgctc ttcgatgttg cgtatgatag taaatactaa attctcgact attgtatgaa 1740 aatgtaacca aatggtacta ttagattctg acatgaggct tcgagtgaca caaatctgag 1800 caagtgttgt aaacattatt gaaaaattgt tcacagcttt cgtcaccatg gatgatccga 1860 atcaaactga gagggatgtc tcgaatggtt caaatgggcc gataatccag caacatttat 1920 ccccgttcag tccggcatcg tacaatcttc cacacttcaa atttaaacat ttaccgcaat 1980 cagaagtacg caatgcttgg ggatcatgga ttaggtggtt cgaaaccgtt atggcagctt 2040 cgaacatcac ggatggccca agccgtaaaa tgcaattgct tgctatgggc ggcttagaat 2100 tgcagagtgc tttctatggg attcctgggt gtgacgaaca agatgatcca ttgcaagatc 2160 cgtacatgtc aatgaaagat aaattgagcc aatatttctc tccaaaacac cacgacagct 2220 ttgaaaggtt tcttttttgg tcaatggcac ccagtgacgc cgaatccatt gaaaaatttg 2280 ctctcagagt ccagcagagg gctgaaaagt gctcgtttgg aatgaccgca actgagagta 2340 gacaaattgc ggtaatcgat aaaatcatac aatatgcctc cggagacctg cgtcaaaaat 2400 tactggaaaa agaacacata actctcgatg aagcaatcaa aataatcaac gcgtaccaat 2460 ctgttagata ccaatcgtcg aaaatgaacc caaaaccgat cggaagtcta gccaatcgcc 2520 cctatgagca aatcgatagg gtcaatcgac tttacgaaag ttcagccaaa aagtacgggg 2580 caaatccaag gtgcctgcgt tgtggataca ctcgacatag agaagctgtc cattgtccgg 2640 cattgaataa aacgtgcctg cggtgcaaga aaattggaca ttttcaaacg gtttgcaaaa 2700 ctaaaagctc atttaacagt gtaggtattt ttatattgat aacaagcaag gaaaatttta 2760 aagttgttat accttcacct catatttatt agggaacatt cgatcgaaag cgaaaagcta 2820 ctattgtgaa tagtcaaggc ccctctcatc aatctaagta tcctcgatcc tactctcgac 2880 cagtgtacca aatcgatgat cagcaaaata aggaaccaaa acgtgaagag tttccggtat 2940 acaatgttgg agaatccgat gagttgataa cgtgttgtat tgggggagtt gatgttacca 3000 tgctcattga ttcgagatct aagcataatt tgatcgacga cacaacatgg gaaataatga 3060 agctgcaaga cgtcaacatc cgaaatgaaa gatttgatca cgataaacgg tttttggctt 3120 atgggcgtgt tccattgaac ctaatcactg tttttgatgc tgatctggaa atcaacgatg 3180 gcgggaagct tttgaaaacg agtgcctctt tttacgtgat cgaaaaaggc cagcaaccct 3240 tgctgggtaa gataaccgct caacatattg gagtactaca aattggtttg cctagttcga 3300 aagatgcgtt gatctccaga gtcgacacga tgaaagaacc attccctaag atgaaaggaa 3360 taacgctaag tttgccaatc gatagatcag tttcgcctgt tatccaacct ctccgtcgct 3420 gccctatacc actgctgcag caagttgaag ctaagctgaa agagttgtta cagttggata 3480 taatagagaa ggttacccaa ccgacgtcat gggtatcacc gttggttcca atcatgaagg 3540 acaacggtga gctaaggctc tgtgtagata tgcgaagagc caatcaggct atccaaaggc 3600 tcaaccatcc cttgccggtc tttgacgaca tgttgccgaa atttcgagat gcgaaacttt 3660 tcacgacgtt aaacattaga caagcttttc accaagtgga attgtcggag gatagtcgtg 3720 atgtaaccac gtttattaca aattggggtt tgtatcgtta caaacgactt ctctttgggg 3780 taaactgtgc cccggaactg ttccaaaatc tgatggaaag cattcttgca gagtgtaaaa 3840 acacagtagt atttatcgat gacatcctga tttttgggtc cacggaggaa gaacacgatt 3900 tggcagtgaa acatacattg agcgttttga accggtatgg tattctattg aatatccaca 3960 agtgcaagtt caagcaaacg gagatttctt tcttgggtca tagactttca ccagagggag 4020 tggcaccggc ggatgaaaaa gttcaggcta tactacactt cagagctcca caaacaaaag 4080 aagagctgag gagctttctc ggcttagtga catacgtgtc gcgtttcatt ccgaatttgg 4140 caactgtgaa ctatccttta cggaagttac tcaagctgga gaccccgttt gattggaagc 4200 aagagcatca ggagtcattc gatcaactta agcgaataat aggatctatc cagcacttag 4260 ggtattacga ccctaaagat cgaacattac taatcaccga tgcatcagga gtagggctag 4320 gcgctgtatt attgcagttt aagaacaatc agcctagggt tattggatat gcctcaaaaa 4380 gcctctcaga aaccgagaaa aagtacccgc cgatcgaaaa ggaagctctt ggaattgtgt 4440 ggggagtcga gcgcttcaga atgtatcttc ttggaataca ttttgagctg gagacagacc 4500 atagaccgtt ggaaactctt tttacggcga attcaagacc aacggcaaga attgagcgtt 4560 ggatgctgag aattcaagca ttcaagttca aagtaagggt ttgcttagat ttaaaacatt 4620 tgtgtatagt atagaatctt attctgataa ttgttcacag gttgtttatc gtaaaggttc 4680 tgctaactta gcagatacat tttctagact tggagctcac gtccctgata ctcattggac 4740 ggaagaatgt gacgttttca ttcggcgtgt cttagcggaa tcgttgtcca ccttggtaaa 4800 acctttgaat caagatgatt tcgatccaga agctgaaatc tttattcgaa ctatccaaga 4860 ggctgctgca gtagatatcg aagaagttat tcaggctaca gcatcggatc aggagatgca 4920 aaagcttaga gcatgtatcg taaacgattc atggaataat gaagatttaa aacagtataa 4980 tccgtttcgt aacgaattta catatgtgaa ttcactcatc ttgcgtggat ccaagctata 5040 ccgaaaagcc tgcggtctag aatgctgcag ctagcacatg aggggcatcc tggacagtcc 5100 ctgatgaagc gtcgtttgcg agaaagatgc tggtggcctg gtatagatca atctgcagta 5160 gatacatgtg aatcatgcga gggttgtcgc ttggttcagc ttcccgatcc accagagccg 5220 atgatgcgac gtcaattgcc cgaccgtccg tgggttgata tagcgataga tttcctggga 5280 ccaatgccat ccgctgaata tattctggta gtcatcgatt attacagtcg ttacatggaa 5340 ctagaggtga tgagtaaaat aacagcacag gagactatca gacgcttaag acgtattttt 5400 cggatctggg ggccacctag aacaatcacg ctagacaatg caaaacaatt tgtatcttgc 5460 gaatttgaag agttttgcaa agcaaatggc attcatctca accatacgtc accgtactgg 5520 ccccaggcca atggggaagt tgaacgccag aacaggtcgc tgttgaaaag aatgaaaatt 5580 gcgaatgcat tgtacgatga ttggaaagcg gaactcaaca gttatctaga actgtacaac 5640 aacactccgc atacagttac tggtaaagcc cctagtgagc ttctgcagaa tcggaaatta 5700 cgcaccaaac ttccttgtat cgacgatctt cagaccatgc ctcctagtag tgactttaga 5760 gatcaggatc atgagagaaa aattatggga aaacggtggg aggatgtgaa gcgcagagca 5820 aaaccgagca ctatctcgat tggcgacaca gttctcatga aaaacctttg tcctacgaac 5880 aagctgtcga ccaatttcca taaggagaaa ttcttggttg tgaacagaca aggatccaac 5940 gtaatcgttc aatctatgga aactggtaaa acatacgatc gaaacacatc tcacttgaaa 6000 acggtctctg atccagtgcc aataacggtc aatgacccgg acacgaattt ggatgatcgg 6060 tcattgccca ccacatgcgt tgatccaaat atcgatgaag aatcacatga cggtgctgta 6120 gtctcacatt cggtaccgga accatgtgaa tcgttgctgc cttcagtaca actgcgacgg 6180 tcggcacgaa cacccaagcc gaaggaaata tacagcccat gaagctaagt atccacatgt 6240 attaacgttt tactatttaa tgtaataaaa caaaactatc attaacgaaa taaaataaat 6300 tatttgaaag aaaagggga 6319 // ID BEL-195_AA-I repbase; DNA; INV; 5970 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-195_AA_; KW BEL-195_AA-LTR; BEL-195_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5970 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 881-881 (2011). XX DR [2] (Consensus) XX CC Positions [5007-5567] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 72..5906 FT /product="BEL-195_AA-I_1p" FT /translation="MVQCSRCLKWYHFSCVGVDQSIAELVWYCRTCLGEDV FT SARESLGQPGITRVQPPRAAKKGKNSTGKSQDKNVQKRIDAPIGSSEQSEV FT AKQPAGQLSISKGQVSGEAGTSKTQRLTNTERLLSINPSRTRSHSGNSRSE FT SHTNSESSKKSRSSSQKTMSKGSKRDLALLRIEEMKERSRLEALEEENQLQ FT ILKLKQKQIERKRRELEELQQLSKLVEEESDCNEDPEKQDKVEIDYIDKVQ FT GWMSSSNPLHPGRQRRLSESSMEEISLMIPEADSERAARRRHLQLRSSSML FT AGPTKLQIASRQVFPKNLPSFSGHPQDWPLFVSAYEQANASCGFTNSENLI FT RLQQALHGKALETVRNLLLLPENVPTIMDKLRRRFGNPEVLSTMLAQRVQK FT LEGPDSENLESLIEFGSAIEEFTQHLYVSKLNDHLKNPILMQSLVQKLPPC FT YAMQWVEYKRRSRIVDLKTFGSFMEDLVDKALEVTFERMDLAATKRREKPK FT AKVFTHLVEGDNEASGIQQNNKVQTTVVQASPRRDISCSICDEPGHFGRDC FT REFRCASVERRWMIVKHLGLCALCVYNHGKWPCRSKIRCEIDGCKGLHNPL FT LHPPISSETAKMEAHCNSHSSMNKTVLFRIIPVTLYSERNKFDTLALVDEG FT SSTSLIDGEIAEFLQLSGPREPFQMRWTNGVSRTETQSRNVEMKISGRNCK FT AFDLCIARTVKKLDLPAQKLDSSRLIEEFHHFGDIQIPSYDLDSPKLLIGI FT DNMHLIAPLASRVGKRGEPIAVECKLGWTIYGPRPNMIADVHFLGHHRCVC FT EECSKADQELNQMLRDNFKLEAVGESPIRLESKEDCVARNILERTIKRIGN FT RFEIGLLWKNGIPRFPESYSMACKRLTKLEARLERNPGIRESLQKQIEEYV FT MKGYAHKITDQELQETPPERCWYLPLNYVVNSKKPGKVRTIWDAAARTNGV FT SFNDSLLKGPDLVAPLSGVINGFRERQVAFGGDIRETFHQILVRKEDRQSQ FT RFLFRFDMNFEPEIFVMDVVIFGASCSPCLAQYVKNMNAREHVQEYPEAAD FT AIIRKHYVDDYFDSVDTENEAIVRAKAVRKVHANAGFEIRNWVSNSSKVLK FT ELGEVEPQEVKLLDMSASDTERVLGIMWRPDIDSFVFSTEFRSELQPYIKE FT GAWPTKRIALRCVMSMFDPKQFLAPLLIHGRILMQNLWRSGIGWDEKLGEE FT HYDQWLRWTRLLPLIDDIQIPRCYLGEMSSTVYDTVQLHVFADAGEDAYGC FT VAYFRFTDGKNVHCAFVEAKAKVAPLQYLSIPRKELEAAVLGARVMKAICE FT SHSVKIKKYFLWTDSNAVISWVKSDARRYKPFVAHRIGEILSVTNPENWRW FT VPTKDNAADDVTKWGNATQVCSDNRWFRGPDFLYSAEEDWPEQKHKSIEVE FT EELRASVLFHDIVLPDVILSRIEHISKWKVLVRMVATMYRFITNCRRKIKK FT QSIEALPYSKKGATTVSAEQVPLRQEEYLLAENYLWRVAQGAEFADEVKTL FT FKNQDSPKGQPRMIEKSSPLYRLSPFLDECNVVRMEGRTVAADYAAFDVRF FT PVILPKQHLIVEKLVEFYHQKCGHGSREMVVNEVLQRFYIPGLRSLVERVS FT RHCLWCKVQKAKPVVPRMAPLPASRMAVSEHPFSYVGVDYFGPIEVVVGRR FT REKRWVALFTCMTVRAIHLEEVYSLTTQSCEMAIRRFVKRRGSPIEIFSDN FT GTNFVGASKDLEAQIRTINVECADTFTGAKTKWTFNPPSAPHMGGVWERMV FT RSVKAAMSVFANGERLTDEILQTTLAEVEHLINSRPLTYVSTNVQEDKEAL FT TPNHFLTSCPLIECMPSRNAMQLGDRLRNSYCQAQYLAEELWSRWQREYLP FT LMNRRSKWLEERKPVAVGDLVYVADSEKRRTWERGVIEEVFTGDDGRIRSA FT MVRTKTGLKRRAVAKLAILEI" XX SQ Sequence 5970 BP; 1825 A; 1141 C; 1546 G; 1458 T; 0 other; ataatcctca aaaaaatcta ccactggatg cgatctaccc ctgctgcgaa gaagacgacc 60 agtatgacgc gatggtacag tgtagtcggt gtctgaaatg gtaccacttt tcatgtgtcg 120 gagtggacca aagcatagcg gagcttgttt ggtattgccg tacatgcctc ggagaggatg 180 tatccgcacg tgaatcatta ggacagccag gtatcacacg cgtacaacca ccgcgagcag 240 ccaagaaggg taaaaattct accggtaagt cgcaggacaa gaatgttcaa aagaggatag 300 atgctccaat cggatcgagc gaacagagcg aagtagctaa gcagcctgca ggccagctaa 360 gtattagtaa aggtcaggta tcaggtgagg cagggacatc taagacacaa agattaacga 420 atactgaacg gttactgtca ataaatccgt cgcgtactcg ttcacatagt ggaaattcgc 480 gttccgagag tcacaccaat agtgagagtt cgaagaaaag tcgcagttca agtcagaaaa 540 ctatgtcgaa gggttcgaag cgtgatctgg cattgttgcg aattgaagag atgaaggaga 600 gaagtcgttt ggaggccctg gaagaagaaa accaactgca aatattgaag ttgaagcaga 660 agcagattga gcggaagcgt cgggaactag aagagttgca gcaattatcg aaactggtag 720 aagaagaatc tgattgcaac gaagaccccg aaaaacaaga caaagtggaa atcgattaca 780 tcgataaggt gcaaggttgg atgagctcta gtaacccatt gcatccgggt agacagcgac 840 ggctttcgga aagtagtatg gaggaaattt cattgatgat tccagaggcc gatagtgaaa 900 gggcagcgag acggcgtcac ctgcagctgc gaagttccag tatgctggca gggccaacga 960 agttacagat agcttctcgc caagtattcc ccaaaaatct accgagtttt tcaggccacc 1020 cgcaagactg gccattgttt gttagtgcct acgaacaggc aaatgcgtcg tgtggattca 1080 cgaattcaga aaatctcatc aggcttcaac aagccctcca tggaaaggcc ttagaaactg 1140 taagaaatct gttgcttcta ccagaaaatg tgccaacgat catggacaag ctacgccggc 1200 gttttggaaa tcccgaggtt ctctcaacaa tgctggcgca acgagttcaa aagcttgaag 1260 gaccagactc ggagaatttg gaatccttga ttgaatttgg tagtgcaatc gaagaattca 1320 ctcagcatct atacgtgtcg aaactaaacg accacttgaa gaatccgatt ctaatgcaga 1380 gcttagtaca aaaattaccc ccgtgttatg ctatgcagtg ggttgagtac aaacgacggt 1440 cacgaatagt agatttgaaa acctttggga gtttcatgga ggatttggtg gataaggcgt 1500 tggaagtaac cttcgaaaga atggatttag cagcaacgaa aaggcgtgaa aaaccgaaag 1560 cgaaagtttt cacccatctg gtggaaggag ataacgaagc gagtggcatc caacaaaaca 1620 ataaagttca gacaacagta gtacaagctt ccccgagacg agatattagt tgttcgattt 1680 gcgatgaacc aggacatttt ggtcgagatt gtcgagaatt tcgttgtgca agtgtggaac 1740 gaagatggat gatagtaaaa catttgggtc tgtgtgcgct ttgcgtttat aatcatggca 1800 aatggccctg tagatcgaag attcgatgtg agattgatgg atgcaaaggt ttgcacaatc 1860 cgttgttaca ccctccgatc agctcagaga ctgccaaaat ggaggctcat tgtaacagtc 1920 atagttcaat gaacaaaacg gttttgtttc gcataattcc ggttacactt tatagtgaaa 1980 gaaataaatt tgacacgctt gcgctggtag atgaaggatc gtcaacttca ctcatcgatg 2040 gagaaattgc agagttcttg cagttaagtg gaccaagaga accgtttcaa atgcgctgga 2100 cgaatggagt tagccgtacg gaaacgcaat caagaaatgt cgagatgaag atctcgggga 2160 gaaattgtaa agctttcgac ttgtgcattg cgaggactgt taagaaatta gatctgcctg 2220 ctcaaaagtt ggattcaagt cggcttattg aagaattcca tcatttcgga gatattcaaa 2280 tcccaagcta cgacttagat tctccgaaac tcctgatcgg tatcgacaac atgcatctta 2340 ttgctcctct ggcgtcacgt gttgggaaaa gaggagaacc tatcgctgtt gaatgcaagc 2400 taggatggac gatttacgga cctcggccaa acatgatagc agatgtacat ttccttgggc 2460 accacagatg cgtttgcgag gagtgcagta aggcagatca agaattaaac cagatgttgc 2520 gggataattt taagcttgaa gcagttggag agtcacctat acggctcgaa tcaaaggagg 2580 actgcgtagc tcgaaatatt cttgaacgaa ccattaagcg gattggcaac cgtttcgaga 2640 ttggtttact ctggaaaaac ggtataccga gatttccgga aagttattca atggcgtgta 2700 aaagattaac gaagctggaa gctcgactag aacgaaatcc aggcatccgt gaaagtcttc 2760 agaaacaaat tgaagaatac gtaatgaaag gctatgctca caagattacg gatcaagaac 2820 ttcaagaaac accacctgag cgatgttggt acctaccctt aaactatgtt gttaattcga 2880 agaaacctgg aaaggtcaga acgatatggg atgctgctgc gaggacaaac ggcgtatcgt 2940 tcaacgattc attactgaag ggacccgatt tggttgcgcc actgtctgga gtaatcaatg 3000 gatttcgaga acgacaggtg gcctttggag gtgacatacg agagacgttt caccaaattc 3060 tggtgcgaaa agaagatcga cagtctcagc gttttttgtt tcgatttgat atgaactttg 3120 aaccagaaat tttcgtgatg gacgtggtca tcttcggggc gagctgttcc ccttgcttgg 3180 cacaatatgt caaaaatatg aatgccagag aacatgtgca agagtatcca gaagctgctg 3240 atgccataat acggaaacac tatgtcgatg actattttga tagtgttgat acagaaaacg 3300 aagcgatagt acgagcgaag gctgttagaa aggtacacgc gaacgctggg ttcgaaatac 3360 gcaattgggt gagcaattca tcgaaagtat tgaaagaact tggtgaagtg gagccgcagg 3420 aagtgaagtt gcttgatatg agtgcttccg atacagagag agtccttggc ataatgtggc 3480 gaccagatat cgattcgttc gtgttttcca ccgaattcag gagcgaactg caaccataca 3540 tcaaggaagg agcatggccg acgaaacgaa tcgctcttcg ctgcgttatg agcatgttcg 3600 acccgaaaca atttttggcg cctttgctta tccatggtcg tattttgatg cagaatttat 3660 ggcgaagtgg catcggctgg gatgagaaat taggtgaaga acattatgat caatggttac 3720 gatggacaag gttgcttcca ttgattgatg atatacagat tccccgttgt tatcttggag 3780 agatgagttc tacggtgtat gatacagtgc agcttcatgt cttcgcggat gctggggaag 3840 acgcttatgg atgcgtagcg tatttccggt ttacggatgg taaaaatgtg cactgtgcgt 3900 ttgtggaagc aaaagccaaa gtggctcctt tacagtacct gtccattccg agaaaagagc 3960 tagaggctgc tgtattagga gcaagagtaa tgaaagctat ttgcgaaagc cattcagtga 4020 aaataaagaa atattttcta tggacggatt caaacgcagt tatatcgtgg gtgaaatcag 4080 acgcaaggcg atataaaccc tttgtagctc acagaattgg cgaaatatta agcgtaacca 4140 atccggagaa ttggcgctgg gttcctacta aagacaacgc tgctgatgat gtaacaaaat 4200 gggggaatgc gacacaagtt tgttccgata atcgatggtt tcggggtcct gactttctgt 4260 acagcgcaga agaagactgg ccggaacaaa agcataaatc cattgaagtt gaagaagagc 4320 tacgagctag tgttttgttc catgacatcg tacttccaga tgtaattcta tccaggatag 4380 agcatatttc aaagtggaaa gtacttgtac gaatggtggc aactatgtat cgtttcataa 4440 cgaactgtcg acgtaagatc aagaaacaat caattgaagc tttaccatat tctaagaaag 4500 gagctacaac cgtctcggct gaacaagttc ctttgcggca agaggaatat ttattagcag 4560 aaaattattt atggcgcgta gcgcagggtg cagaatttgc cgatgaagtg aagacgttgt 4620 tcaaaaacca agattctccc aagggtcaac cgagaatgat cgaaaagagc agtccgttgt 4680 atcggctgtc cccattttta gatgaatgta atgttgtgcg aatggaagga cggacggtag 4740 cggccgatta tgctgcattt gatgtacggt ttccagttat tctgccaaaa cagcatttga 4800 ttgttgagaa gttagtggaa ttctaccatc aaaaatgtgg tcacggcagt cgagaaatgg 4860 tagtgaatga ggtacttcag cggttctaca ttccaggact gaggtccttg gttgaaagag 4920 tttcacgaca ttgcttatgg tgcaaggtac agaaagcaaa accagttgta cctagaatgg 4980 cgccgcttcc cgcatctaga atggctgtaa gcgaacatcc tttctcatac gtcggtgtgg 5040 attactttgg ccctatcgaa gtggtagtcg gacgaagacg agaaaaacgg tgggtcgcat 5100 tattcacttg catgacagtt cgcgccatcc accttgagga agtctacagt ttaacaacgc 5160 agtcatgtga aatggctatt cgtagatttg tgaaacgtcg aggaagcccc attgaaattt 5220 tttccgataa tgggactaac tttgtcggcg caagcaaaga cttggaagca cagatacgaa 5280 ccatcaatgt cgaatgcgct gatacgttta cgggagcgaa aactaaatgg acattcaatc 5340 ctccctcagc tccacacatg ggaggcgtat gggaacgaat ggttcgatca gtcaaggcgg 5400 caatgtctgt atttgcaaac ggagaaaggt taacagatga aattctgcag actactctgg 5460 cagaagtaga gcacctgatc aactcacgcc cgttgaccta tgtttccaca aacgtacaag 5520 aagataaaga agcacttaca cctaatcact ttctgacaag ctgccctttg atagagtgta 5580 tgccgtcgag aaacgcaatg caattggggg atagactacg aaacagctat tgtcaagccc 5640 aatatttagc ggaagagttg tggagtcgtt ggcagcgaga atatctacca ctgatgaatc 5700 ggaggtcaaa gtggctcgaa gaaaggaaac cggttgctgt tggagattta gtgtatgttg 5760 cggattcaga aaagcgaagg acctgggagc gtggagtgat cgaggaagtc ttcacaggag 5820 atgatggacg aatacgttcg gcgatggtgc gtaccaaaac cggattgaag aggcgagcag 5880 ttgcgaaact agcaatattg gaaatatgag tatagtatag taagcactgt gctgattaca 5940 gtcaacaaca gggcttacgg gctggggaga 5970 // ID Gypsy-27_DPu-I repbase; DNA; INV; 4483 BP. XX AC . XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from Daphnia: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-27_DP_; KW Gypsy-27_DPu-LTR; Gypsy-27_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-4483 RA Jurka J.; RT "LTR retrotransposons from Daphnia."; RL Direct Submission to RU (16-DEC-2010). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR [1] (Consensus) XX CC Positions [3310-3768] - Integrase core CC LTRs are 93% similar to each other. XX FH Key Location/Qualifiers FT CDS 157..4479 FT /product="Gypsy-27_DPu-I_1p" FT /translation="MDSLPQFDQFKVHCDPHTAGLRWEKWIIRFERLMEAI FT NVVSQTADSADQKTATDKRRLALLLHYAGTEVEDVFDTLPGVQDDKNTYSK FT AKPLFQAYFQPKKNVELEVFNFRRSRQEPGENMDAFATRLRQLATRCDFND FT SDLEIKLQIIQGCKSSNFRKLCLRDQLSLKKMLENARAAEAADRYAESVEQ FT SSKQATNSVNKSRNNFKFKSHKETGDNGKKTTSKCRKCGGQWPHTGSPCPA FT KGKCRNCGYDWPHPESKPCPAKGKKCESCGMLNHFSKCCKNKKAKPGSSSK FT QEHTRQVNDRESSSDESTWTISPTRSDDRPLTHIKLCGGKCKMMIDTGSTC FT NLLSFKEYNELPNKPTLHPTGNSVNGYGGNPLKIIGKFKTLIESKHAYADA FT VIYVSDNEKADNLLGYSTAKELKLVKVSNSVSSSNDYSFVKQKFGPLFSGL FT GKMKNYQVKLHIDQSIQSVTQKHRRIGFHLRKGVDKELTNLENEDIIEPVT FT DGPTEWISPCHAVPKPKQPGKLRVCVDMRAPNKAILRTRHIIPTIEDLVVD FT LNGAKYFSKLDMNQGYHQLELAPESRYITTFSTHRGLFRYKRLNFGMNCAS FT EIFDDVIRQTVSGIPGVVNRSDDIFVTGKTKEEHDSSLEKVLKRLMDKNLT FT LNFEKCEFGKEEIDFFGLHFSGKGVSPTQAKVEAIKKAEPPSTPDEVSSFL FT GMVTYCSRFIPNAAAISEPLRRLTHTKQEWVWKEEQEKAFKNLKKSLCKAV FT TLSYFDVGKPTSIVVDASPRGLGAILTQVSKDGSNSIIAYASRLLTDVESR FT YSQTEREALAITWAILHFHLYVTGKEFTVITDHKPLEAIFNKPLIQPPARI FT ERWLLKLQLYDFTVIYQPGKSNPADYMSRHPIPSTAVSTREQSIAENYVNS FT ICMKSVPKPVPLGSITSATKEDKAIQLIFKILSRKRCKKEEVEKLNSQEKD FT WFISLKQIQLQLTRIETEEGEMLLKDTRIVIPHSLTEKIIDIAHQGHQGIV FT KTKALLREKVWFPGIDKAVERKVTNCIPCQACSKKNCPEPLQMSSLPAEPW FT TEVSIDFADMPMGEHLLVVYDDYSRYPVVTKVSSTSAKVVTKKLEDIFAQF FT GIPEVVKSDNGPPFQSYDFKQFAQDLNFKHRRVTPLWPQANGGVERFMKTL FT KKALRCAVAEKRNWREEIPRFLLNYRNSPHASTGKAPATLLFGRSLRTRLP FT QIPKTKKEEEIRNKDLKSKSRMKENADKKAKHRELKEGDFVLLRRDKLANK FT LQAPFDPSPYRITEVKKSMITAERNNKKVTRNVTFFKKIERKDEESEESEE FT SEDEYYHNALENVNFPEQIQVPVHDDPQQNEPVEEPENANDAPPAGVQLPV FT EGTQAYERDTVERAISVARARLLEALVRNPPPNELVRPTRVQSEPTSPEES FT TRPKRNRQPPKRLDDFVVNMFHSKKGK" XX SQ Sequence 4483 BP; 1630 A; 950 C; 911 G; 988 T; 4 other; ktccctgtag tcattcatag tagtgcagtc attcataaca tgtaccgtgc acmstgtagk 60 cactgagacc ttacacgaga atacacacac acttacagag agaatattac actggcgacg 120 aggtaaagta agttttaaaa tcccttacat tttataatgg attcgctacc gcagtttgat 180 cagtttaaag ttcattgtga tccgcatacc gctggtttga gatgggaaaa gtggataatt 240 cgcttcgaaa ggcttatgga ggcaatcaat gtagtctcac aaactgcaga ctcagccgac 300 cagaaaacag ctaccgataa gcgtaggctg gccttactgc ttcattacgc aggaaccgaa 360 gttgaagacg tatttgacac cctgccaggt gtgcaagacg acaaaaacac ttactctaaa 420 gccaaacctc tgttccaagc ctattttcaa ccgaagaaaa atgttgaact tgaagtattt 480 aatttccgaa gatccagaca agaacctggc gaaaacatgg acgcttttgc aacaagactc 540 cgtcaactcg caacaaggtg tgattttaat gattctgact tagaaattaa attacagatt 600 attcaaggct gcaagtcaag caacttcaga aaattgtgct taagagatca actgtccctg 660 aaaaaaatgc tagaaaatgc acgcgcagca gaagccgctg atcgctatgc agaaagcgtg 720 gaacagtcat cgaaacaagc cactaactcg gtaaataagt caagaaataa tttcaaattt 780 aagtcgcata aagaaaccgg agacaacgga aagaagacga cgtcgaaatg tcgcaaatgt 840 ggaggacaat ggcctcacac aggaagtccg tgcccggcaa aaggcaaatg tcgaaattgt 900 ggatatgact ggccgcatcc cgaatcaaaa ccgtgcccgg caaaaggcaa gaaatgtgaa 960 tcttgtggaa tgttgaatca tttcagcaag tgctgcaaga acaagaaagc caagcccgga 1020 tcatcttcaa agcaagagca caccagacaa gtcaacgacc gagagagctc gagtgacgag 1080 tcgacttgga caattagtcc aaccagatcg gatgacaggc cactcacgca cattaagctg 1140 tgtggcggaa aatgcaagat gatgatcgac acaggatcaa cttgcaacct gctgagtttc 1200 aaagaataca atgaactccc aaacaagccg acacttcatc ctacaggaaa ctccgttaat 1260 ggatacggcg gaaatccttt gaaaattata ggcaaattca agacactgat tgaatcaaag 1320 cacgcatatg ccgatgctgt tatttacgtt agcgacaatg aaaaggctga caaccttctt 1380 ggatactcca cagcaaaaga gctcaagttg gtaaaagtat caaattcagt atcttcttca 1440 aatgactatt cattcgtaaa acaaaagttt ggaccactgt tttcgggact tggaaaaatg 1500 aagaattacc aggtaaaatt acacatagat cagtcgatcc agtcagtaac tcaaaaacac 1560 agaagaattg gttttcacct gcgcaaaggt gttgacaaag agctgacaaa cctggaaaat 1620 gaagacataa ttgaacctgt cacagatggc ccaactgaat ggatttctcc atgtcatgct 1680 gtaccaaagc caaagcaacc aggaaaacta agagtatgtg tcgacatgcg agcaccaaac 1740 aaagcaattt tgcgaaccag acacatcatt ccaactatag aagatttagt tgtggattta 1800 aatggagcaa aatacttttc aaaactagac atgaatcaag gatatcatca gctagaacta 1860 gctccagaat caaggtacat tacaactttt tcaacacacc gtggattatt tcgttacaaa 1920 agattgaact ttggaatgaa ttgcgcgtct gaaatttttg atgacgtcat tcgccaaaca 1980 gtaagtggaa tcccaggtgt ggtaaacaga agtgatgaca tttttgttac aggaaaaaca 2040 aaagaagaac atgattcaag cctggaaaag gtactgaaga gactcatgga caaaaacctc 2100 acactcaatt ttgagaaatg tgaatttgga aaagaagaaa tcgacttctt tggattacac 2160 tttagcggaa aaggagtttc accaactcaa gctaaagtag aagctatcaa gaaagcggaa 2220 ccaccgtcaa cgcctgacga agtatcaagc tttttaggaa tggtaacata ctgtagccgt 2280 ttcatcccta atgccgctgc aatcagcgag ccgctaagac gccttactca caccaagcaa 2340 gaatgggtgt ggaaggaaga acaagaaaaa gcattcaaga atttaaagaa aagtctctgc 2400 aaagccgtaa cactctctta tttcgacgta ggaaaaccaa cgtccatcgt ggtcgatgca 2460 agcccacgag gcctaggcgc aatcttgaca caagtttcaa aagatggatc aaactccatt 2520 atcgcctacg ctagcagact gttgacagac gtagagtcac gctattcgca gactgagaga 2580 gaagcactcg ccattacatg ggcgatcctc cattttcatc tctacgtaac cggtaaggaa 2640 ttcaccgtga taacagatca taagccacta gaagcaattt tcaacaagcc gctcattcag 2700 cctcctgcaa gaattgagcg ctggctttta aagcttcaat tgtatgactt cactgtcatt 2760 tatcagcccg gaaaatccaa cccagctgat tacatgtcac gacaccctat accgtcaaca 2820 gcagtctcaa caagagaaca atcaattgct gaaaattacg tcaattcaat ctgtatgaaa 2880 tcagttccaa aaccagttcc tcttggcagc atcacgagcg ccacaaaaga agacaaagca 2940 atacaactga ttttcaagat ccttagtcga aaacgttgca aaaaggaaga agtagaaaaa 3000 ctgaacagcc aagaaaaaga ttggtttatc tcactcaaac aaatccagct gcaactgaca 3060 agaattgaaa ctgaggaagg agaaatgttg ttaaaagaca cccggatagt tattcctcat 3120 tctttaacag aaaaaataat tgacatcgcg catcaaggac accaaggtat cgttaaaact 3180 aaagcgttac taagagaaaa agtttggttt ccaggaatcg acaaagcagt tgagagaaaa 3240 gtaacaaatt gcattccatg ccaagcatgc agcaagaaaa attgccccga accattgcaa 3300 atgtcaagtt taccagctga accttggact gaggtcagta tagattttgc agacatgccg 3360 atgggagaac atttgttagt tgtttacgac gactactcaa gatatcctgt cgtaacgaaa 3420 gtttcatcaa cgtcagcaaa agtggtaaca aagaaactag aagatatctt tgcacagttt 3480 ggaatacctg aagtggttaa aagcgacaac ggtccgccat ttcagagtta tgattttaaa 3540 caatttgctc aagatttaaa tttcaaacac agaagagtca ctcccctatg gccacaagcc 3600 aatggcggag tagaaagatt catgaagacg ctcaaaaaag cactgcgctg tgcagtcgct 3660 gaaaagcgca attggagaga agaaattcca agattcttac tcaactatag aaattcaccg 3720 cacgcttcaa caggaaaagc accagcaact ttgttattcg gacggtcatt acgcacacgt 3780 ctaccgcaaa taccaaaaac aaagaaagaa gaagaaataa gaaacaaaga cctaaaatca 3840 aaaagtcgca tgaaagaaaa tgcagacaaa aaggcaaagc atagagaact aaaggaagga 3900 gattttgtat tattacgaag agacaaactg gcaaacaaac tacaagcgcc atttgatccg 3960 tctccgtatc gtattactga agtgaagaaa agcatgatca ctgctgaaag aaataacaag 4020 aaagttactc gcaatgttac attcttcaag aaaattgaaa gaaaagatga agaatcagaa 4080 gaatcagaag aatcagaaga tgagtattat cataatgcac ttgaaaacgt taactttcca 4140 gagcagattc aggttccagt tcacgacgac ccacaacaaa atgaaccagt agaagaacct 4200 gagaacgcca acgacgcacc gccggctgga gtccaattac cagtcgaagg aacgcaagcg 4260 tatgagcgag ataccgtcga acgagcgata agcgtagcca gagctagact actggaagca 4320 cttgtacgca acccgcctcc aaatgaatta gttcgaccga cgagagttca atccgaaccg 4380 acatccccgg aagaatcaac gagaccgaaa agaaacagac aacctcccaa acgtttagat 4440 gattttgtag ttaatatgtt tcattcaaaa aaaggaaagt gaa 4483 // ID R1A_DAn repbase; DNA; INV; 5710 BP. XX AC . XX DT 08-NOV-2010 (Rel. 15.11, Created) DT 08-NOV-2010 (Rel. 15.11, Last updated, Version 2) XX DE 28S rDNA-specific non-LTR retrotransposon R1 in Drosophila DE ananassae. XX KW R1; Non-LTR Retrotransposon; Transposable Element; R1A_DAn. XX OS Drosophila ananassae OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; ananassae subgroup. XX RN [1] RP 1-5710 RA Stage D.E. and Eickbush T.H.; RT "Origin of nascent lineages and the mechanisms used to prime RT second-strand DNA synthesis in the R1 and R2 retrotransposons of RT Drosophila."; RL Genome Biology 10(5), R49-R49 (2009). XX DR [1] (Consensus) XX CC This family is specifically inserted into 28S ribosomal RNA CC genes. D. ananassae contains two subfamilies of R1. XX FH Key Location/Qualifiers FT CDS 582..2087 FT /product="R1A_DAn_1p" FT /translation="AAPLAPVEGDMSAMSAMSALSESELSASSRASSRASS FT AIGAARGRRRGRKSKATSLSAPTQAKLLAATGADRPEPVGVLEELALSSLE FT TPVMPATSAVDASAATFAAPAVVAGVVADAAEAAAAVAAGAPAAVPERSAV FT SMRSLDCIIKNGADGVMEEAASIKETFVSLVVESLMNPQQATALMRCISRY FT DGIVQALMVRNAVLEESSKLRAAMPLPPPPVQMPSVPQAVVSRPPVGYASA FT YPSLPAPSAAPVPAPRKPRDTWSAVVMSKDPKLSGKEVAEKVRREIAPSLG FT VRLHEVRGLARGGAIIRTPSSGEIKRVVASKKFGEIGLEVKPNAVQRPKLV FT VQDVATQIAPEEFMAELYANNLREHIPEAEFRKAVHLDTKPWTVADGATVS FT VTLECEPKVLDLLEGGRIYIKWFSYRCRALVRTYACHRCVGFNHKVAECKQ FT KDNVCRQCGQSGHSARSCTNPVDCRNCRFYGHPSGHSMLSPSCPVYAAVLA FT RVNSRH" FT CDS 2084..5155 FT /product="R1A_DAn_2p" FT /note="reverse transcriptase." FT /translation="TLMFRFIQANCGRGRAATLELAVRLRESGCLFALLQE FT PYVGSGTSDVLPEGMRIYTDRRQKAAILVDHQDVICMPMEPLTTDYGVCVS FT VKGSFGSIFLCSAYCQFDTELEPYLRYMDAVLLQASRTPAILGLDANAASP FT MWFSKLPRNPEGHANYNRGELLSEWMLENGAVALNQPSPVFTFDTYRARSD FT IDVTIANDAASMWATFEWRVDEWELSDHNIITVVVTPTTTRAVESLAPVPS FT WNISNARWQLFEQEMVSRAADIPEDFSQSPLDQQVSTLRCMVHDACDLAIG FT RRTPRSPRRRKPGWWTANLNTAKREVRRLRRRLQNARRRGDDDAAELIVVA FT LRQASDDYKKLILRAKEDSWRRFVGENAHDPWGRVYKICRGRRKCTEIGCL FT RVDGELVTDWGDCARVLLRNFFPVAESDAPIAGAEEVPPALEEVEVDACVA FT RLKSRRSPGMDGINGTICKAVWRAIPQYLTALYSRCIQSGYFPREWKCPRV FT VALLKGPDKDRCEPSSYRGICLLPVFGKVLEAIMVDRVREVLPEGCRWQFG FT FRQGRCVDDAWRHVKSSVAASPARYVLGIFVDFKGAFDNVEWSAALRRLAD FT LGCRELSLWQSFFSDRRAVIRSSSGTVSVPVTRGCPQGSISGPFIWDLLMD FT VLLRRLQPYCAFSAYADDLLLLVEGNSRAVLEDKGAQLMSIVEAWGTEIGV FT TISTSKTVIMLLKGALARTPLVRSAGANLPYVRSCRYLGITVSERMNFFMH FT IASLRQRMTGVVQALARVLRVDWGFSPRARRTIYAGLMVPCALFGASVWYV FT VTTRQVVARRRLLACQRLILLGCLPVCRTVSTLALQVLAGAPPLDLAAMKI FT AVKYKLKRGYPLEGDDWLYGEDLSGLSWTQRVSRVDECLLSDWQSRWDDGD FT SPGRVTHKFIPDAGFVYRNPDFGFSMRAGFLLTGHGSFNAFLHRRALSDTA FT ACSCGDPNEDWEHILCACPLYADLRDLDGLGLQHVGGTWTFGRILESGDRT FT RRLTEFAEEVFRRRRGLQ" XX SQ Sequence 5710 BP; 1166 A; 1466 C; 1772 G; 1306 T; 0 other; ttcaagtaag cgcgggtcaa cggcgggagt aactatgact ctcttaaggt agccaaatgc 60 ctcgtcatct aattagtgac gcgcatgaat ggcttaacga cggacgtgtt ttggagtgct 120 ccgtaattaa agtcattttt ctccggtaat tcggtgtcgc agacgtttga atttagtgtt 180 aaaattcgcg tattaattac gcgaatcgaa cagtgctatt ttctcgtcgg gaaattaagt 240 ttttctattg aaaaacagtg ttttcgtttt tgcacgtgtt ggcacgtggt ggaaattttc 300 cacagataaa gctcggtgag tcttctggtg tgagcttcag tgcaaaggtg tgtgtgagag 360 tactcaagag agagcttgag agagagcttt gagggtgctc tcgcccgtgc tttttgctga 420 gcttacgcga gctttgggag tgtgctttct tgtgcaagct ttggtgcgtg cttttgcgcg 480 cgctttagta aacacttgga gcatactttt gggctgcctt tttatttgca tatttttgtt 540 gcatattttt ggggccccca ttttcagcac agctccacta ggcggcgcca cttgcccccg 600 tagagggcga catgagcgcc atgagcgcta tgagtgcgtt gtcggagagc gagctctccg 660 caagcagccg ggcaagcagt cgcgcaagta gtgcgattgg tgctgcacgg ggacgccgtc 720 gaggacgcaa gtccaaggcg acgagcctct cggcgccaac gcaggccaag ttgttggccg 780 ctactggagc ggacagaccg gagccagttg gggtgttgga ggagttggcg ctctcctcgt 840 tggagacgcc ggtgatgcca gctacctcag ctgttgacgc ctccgccgct acctttgctg 900 cccccgctgt tgttgctgga gttgttgctg atgctgctga ggcagccgcc gctgtcgccg 960 ctggtgcccc cgctgccgtc ccggagcgat ctgccgtctc gatgcgcagt ctcgactgca 1020 tcatcaagaa tggagctgac ggcgtaatgg aagaagcggc cagcatcaag gagacgtttg 1080 tttccctggt ggtggagtcg cttatgaacc cccagcaggc tactgcgctg atgagatgca 1140 tcagccggta tgacgggatc gtccaggcgc tcatggtgcg caacgcagtg ttggaggaat 1200 ccagcaaact gcgagcggcg atgccgctgc caccgccgcc ggttcagatg ccatcggtgc 1260 cgcaagctgt ggtctcacgg ccacccgttg ggtatgcatc cgcatatccc agcctgccgg 1320 caccttccgc agctcccgtg cccgcgccgc gaaaaccacg cgatacgtgg tcggccgtgg 1380 tgatgagcaa ggacccgaaa ctttcgggta aagaggttgc tgaaaaggtg cgccgggaga 1440 ttgctccttc gctgggagtt cgcctgcacg aggtgagagg gcttgcgcgt ggcggtgcca 1500 ttatccgcac cccctcgtca ggtgagatca agagggtcgt agccagcaag aagttcggcg 1560 agattggcct cgaggtcaag ccgaacgctg tgcaacggcc caaactggta gtccaggatg 1620 ttgctaccca gatcgctccg gaggagttta tggcggagtt gtatgccaac aacctccgag 1680 aacacatccc ggaggcggag ttccgcaagg ctgtccatct ggacaccaag ccttggacgg 1740 ttgccgacgg agctacggtg agtgtcacgt tggagtgcga acccaaggtg ctagatctgc 1800 tggaaggtgg gcggatctac attaagtggt tctcctaccg atgcagggcc cttgtgcgga 1860 cgtacgcgtg ccacaggtgt gttggcttca accacaaggt ggcagagtgc aagcagaagg 1920 acaatgtctg caggcaatgc ggacagtccg gccacagtgc gcgcagttgc accaatccgg 1980 tggactgccg gaattgccga ttctacgggc atccctcggg gcacagcatg ctgtccccga 2040 gctgtccggt gtatgcggcg gtgctagcga gggtgaattc tagacattaa tgtttaggtt 2100 catccaagca aactgtggtc gaggtcgggc tgcgactctt gagctcgcag tccgcctgag 2160 ggagtccgga tgtctgtttg cactgctgca ggagccatac gttggtagcg ggacgagtga 2220 tgtgctgcct gaaggaatga gaatatacac cgatcggaga caaaaggcag ccatcctcgt 2280 ggatcaccag gatgttatct gcatgccgat ggagccactt accaccgact acggcgtatg 2340 cgtgagtgta aaagggagtt ttggctcaat cttcctttgc tccgcatatt gccagttcga 2400 caccgagcta gagccgtacc tcaggtacat ggatgcggtc ctgctgcagg ccagcagaac 2460 ccccgcaatc ctgggactcg acgcgaatgc agcatccccc atgtggttta gcaaactccc 2520 tcggaacccc gagggacatg ctaactacaa ccggggtgag ctgctgtctg agtggatgct 2580 ggagaatgga gccgttgctc tcaaccagcc cagcccggtg ttcacgttcg atacctaccg 2640 tgcgcgtagc gatatcgacg tgacaattgc caacgacgca gcatcgatgt gggccacatt 2700 cgagtggaga gtggacgagt gggagttgag tgaccataac attatcactg ttgtggttac 2760 tccaacgact acgcgcgcag ttgagagcct agctcctgtg ccgtcctgga acatctccaa 2820 tgctcgctgg cagttgttcg aacaggagat ggtgagtcgg gcagccgata ttccggaaga 2880 cttctcacag tcgccgttgg atcagcaagt ttcgaccctg cgctgcatgg tccacgatgc 2940 gtgtgacctt gcaatcggaa ggaggacgcc gagatcgcct aggagaagga agccaggttg 3000 gtggaccgcc aacctgaaca cggcgaaacg cgaagtccgg agacttcgcc gccggcttca 3060 aaacgctcgc cgtcgaggag acgacgatgc ggctgagctg attgtggtcg cgctgaggca 3120 agcctcagac gactacaaga agctcatcct aagggcgaag gaggacagct ggagacgctt 3180 cgtgggagaa aacgcacacg atccctgggg gcgcgtctac aagatctgcc gaggacgcag 3240 aaagtgcacg gagattgggt gcctccgcgt tgatggcgag ctggtcaccg attggggtga 3300 ctgtgcgcga gtgctcctcc gtaacttttt cccagttgcg gagtccgatg caccgattgc 3360 cggagcggag gaggtcccac cggccctcga ggaagtagag gttgatgctt gtgtcgccag 3420 gttgaagagc cgtcgctctc ccggcatgga cggcatcaat ggcactatct gcaaagcagt 3480 gtggcgcgcc atacctcagt atctgacggc gctttattcc cgttgcatcc agtcgggtta 3540 cttcccccga gagtggaagt gcccacgtgt agtggcgcta ctcaaggggc ccgacaagga 3600 taggtgcgaa ccctcgtcgt accgtgggat atgcctgttg ccagtatttg gcaaggtgct 3660 tgaggccatc atggttgatc gtgtgagaga agttcttccg gaaggatgca gatggcagtt 3720 tggatttcgc caaggacgat gtgtggacga cgcttggagg cacgtcaaga gcagtgtcgc 3780 tgccagcccg gcacgatacg tgctcggcat cttcgtcgac ttcaagggtg cgttcgacaa 3840 tgtcgaatgg agtgctgcac tgcgccggct cgccgacttg ggatgccgtg agttgagctt 3900 gtggcagagc ttcttctcag acagaagagc agtgatccga agcagttctg gaactgtgag 3960 tgttccagta actagaggct gccctcaggg atccattagt ggcccattca tttgggatct 4020 gctgatggac gtcctgctgc ggcgcctcca gccatactgc gcttttagcg cgtatgcgga 4080 tgacctgctc cttcttgtcg aaggcaactc ccgagccgtg ctggaggata aaggagcgca 4140 actgatgtcc atcgttgaag cgtggggtac ggaaatcggt gttaccattt ccaccagcaa 4200 gacggtgatt atgctgctga agggagccct cgcgcgcacc ccgttggtga gatctgccgg 4260 agcaaatctc ccatatgttc gcagctgccg gtaccttggc atcacggtca gcgagcgcat 4320 gaatttcttc atgcacatcg catcgctgcg tcagcgaatg accggagtcg tacaagcatt 4380 ggcgcgtgtg ctgcgagtcg actggggatt cagtcctcgg gccaggcgga ccatttatgc 4440 tggactcatg gtgccttgtg cactatttgg tgcctcggtg tggtatgtcg tgacgacgag 4500 gcaagttgta gctaggaggc gacttcttgc ctgccagagg ctgatccttt taggatgcct 4560 cccggtatgc cgtacagtgt caaccctggc actgcaggtg cttgctggag ccccgcctct 4620 tgacttggct gctatgaaaa tagccgtcaa gtacaagttg aagcgtggat acccgttgga 4680 gggggacgac tggctctatg gcgaggacct ttcaggtctg agctggacgc aacgagtctc 4740 gcgggtagac gaatgccttc tgtcggactg gcagagcaga tgggatgatg gtgattcgcc 4800 tggtcgggtg actcacaagt tcatcccaga tgcagggttc gtctatcgga atccggattt 4860 cgggttctcc atgcgcgcgg gattcttgtt gacagggcac gggtcgttca atgcatttct 4920 gcatagaaga gcccttagcg atactgctgc atgctcatgt ggcgatccta atgaggactg 4980 ggaacacatt ttatgtgctt gcccccttta tgcagacttg cgagacctcg atggactcgg 5040 tttgcagcac gttggcggaa cctggacgtt tggaaggatt ttggagagcg gcgacaggac 5100 cagaagactc acggagtttg ctgaggaggt gtttcggagg aggaggggct tgcagtagcc 5160 catttttctc gccgtgtggt agcggcgtag aatactgcca cagctcccca tagcttgtcg 5220 taggaggcga ctaatatggc aaggttcccc atccgagctt gtcggagcta aaggggtggg 5280 tcaccgagcc cacaaacttc ggtaccacgg gttggatagt gtccaagcac taccatttga 5340 ggtaggcccc cttgtgggag tatcgtggtg gctgtggttg atacccaaat cgcgggtaga 5400 gtcctcggac tcgacgtgga gttgcgttat acaactcggg cgctgtgacc catagatcag 5460 tagaggtgat agatacactt cgctcctcac caaggggaag tattctgtcc gactcgcaga 5520 tacttaaatt ggtaccgggc tagttgctat gtatttgcta tagcttctat tccggggcgt 5580 tggcaggcgt accatccgtg cccatgcact atatgcctac tcgtaggtat attcgagtgc 5640 cgtggttgta atcccttcag tgtggaacac gccacgttaa acaagatcgg agagatccga 5700 gacatacacc 5710 // ID Gypsy-2-I_LG repbase; DNA; INV; 7191 BP. XX AC . XX DT 07-APR-2009 (Rel. 14.04, Created) DT 07-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE An internal portion of the LTR retrotransposon -. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW reverse transcriptase; Gypsy superfamily; 4-bp TSD; Gypsy-2-I_LG. XX OS Lottia gigantea OC Eukaryota; Metazoa; Mollusca; Gastropoda; Patellogastropoda; OC Lottioidea; Lottiidae; Lottia. XX RN [1] RP 1-7191 RA Bao W. and Jurka J.; RT "Gypsy LTR retrotransposons from Lottia gigantea."; RL Repbase Reports 9(4), 930-930 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 100..942 FT /product="Gypsy-2-I_LG_1p" FT /translation="MGEEFFQTLSNKLSTLTETISAQGVTNIITPFDGEPT FT KFRDWVKSIEKYAVLAKIDTAQMKLIAFQTSKGIVSNFIQRYIESNPDSDY FT LQLKQELSIRFSEVIDPLHALALLRKVKQNSNENVQCFGERLLLLATEAYT FT GPSTVEVERQLIGFFVDGLKENYLKLRIMRENPTTLQEAVTSAMTEQNLRK FT RFHLRTGNNASTHASSSSSPEPMEVDHFRQKRCYNCKKLGHVSKNCRARQV FT NAVDNNYNKQKVKCWNCGAFGHISRNCKKPNNANNRQGN*" FT CDS 918..4838 FT /product="Gypsy-2-I_LG_2p" FT /translation="CKQSTGKLGNSSCNSADKKSNKNGNHSESNNQSFQIN FT LAGNPSSCVIKLEKQTFRSLVDTGAEVSLMHSRVYKNLKFKPKLKKEKITL FT FSVNNKQIDMEGSAIFDFRLGGKKVSHKFYVVSSMNRNIILGRDFLVEKKI FT RLYFDLKSLRYEGTYIPLEEDIHISSLLRLSAELIVKPQTYNIGMARLKGD FT NQTTGTLLVKSYDKGFIGGQPGLMVVNSIVPVNNDNEVNARVPIAIVNESN FT KTIRLKRECIVGIAQTIENGQICSIESTKTREHNPNNDDLFKDLDCDHRHK FT DRLSKFLTLHRDIFPSSDTELGQTDTVEMTIDVGSNDPIKLRPYRTPLNKR FT DTVGKAIDDMLDAGIIRRSCSPYSFPVVIVDKKDGSKRFCVDYRALNKIIK FT VNSYPLPLIDDVLALLNSAKMFSTLDLRAGFWQIKVNEADKEKTAFVCHKG FT LYEFNVMPFGISNSPGIFQQLMSIVLDGLDKFAVAYLDDILIFSKSPEEHL FT KHLEIVFNRLREHNLRLKLKKCTFMKEETKYLGFKITTEGLMADPDKVLAI FT RGMSAPTSVKEVRSFIGAMSYYRRIIPNMSKIAEPLIDLTKKYAKFNWTDK FT CQKAFDYLKDSLTTVPMLGYPNADLPYVLYTDASDTCIGACLTQVVDGIVS FT HRLGKTQCKWAVCEKEAYAISYSLQKLDYYLHNAQFVIRTDHKPLKYLLES FT PMQNKKLQLWALNISGYNCKIEYIPGETNTVADMLSRIPSDSTDIRQEDNS FT DDEPEISNNTFEINVINSNKFNPGDYDDCDDNVSTKPTNTVEINDGLTNID FT MVVEQTKDEDIELVKQRLTQNQNQSSTNNKYIIMEDLLYFISDPDGEPQTR FT LYIPKQLKKEVIKQYHDLNSHPGIDKTYATIKVKNFWPKMYKELYEYVDTC FT VTCQTRSTMKNKPDTIETGIPPYPMAKLSLDLSGPYPTTLSGNKYIISFVC FT WYSGWVESFPVKDKTAENVAHLIIEEIFPRFGCPLAIVTDNGTENENRIVK FT ETLDKLNIKHILTSYYSPKSNARVERNHRTLHDMLAKRVSSNHSTWDIFLN FT QCLGAIRFNVSKTTKQSPFSLLYSRDPVLPLDNILKPRTKYYGEDHHLLAL FT ENQHKSFIQVHRYVKKIKSKQSEYLNKNTKNVKYKIGDPVFYKNNRRSNKL FT DNRWTPYYRIVEQKGPVSFIIRNQLTGQTTKTHADKLRLANIDQWDIPTND FT SKVRKSKYVVPPSDTSDSDQSEHVDFSDSDSPDDDKPLARLVDKYRRERDN FT SSSEEDDIPLFELTQRLQQKARENEMNETDDEESSSCDMSDADIN*" FT CDS join(5233..6609,6613..6837) FT /product="Gypsy-2-I_LG_3p" FT /note="putative env protein." FT /translation="MKTVLTMLVVCIVILPTTDSILTYKNTLFRPVNRIAT FT SRSRWLLTLVHDIRPYLDFLRQLVKRHWTCFMDSRQYNSQLLYPRQFLCRI FT WTHKIIIILNIPLLDNDDHYIIYKVHSLPVPMMTNHVLSKSKGTSRFVAQY FT KLETTTLAVSVDSTKYALITDNESKQCTNPHMNFCQIKSPIFPINLSKLCV FT MALFSKNEIKASSRCDVSVKPNEILPTSVYFTNGYWAVVVRERTRFSIVCR FT DTHLEHSSIVVKPPVQIIHLKPTCKAVNDHVTLPSHNTNKSTYFDNTSFKH FT LLTNFNESITDIWKPITDHFGNYTLTKLPPKLKGTETIPIEKLVSELDDLH FT PIENLKKQTWPLWVWVTLCVVSAVGILILYIVVKLYMKPLMLKFKSPWRRS FT DRKGENSGKVPTDQSTEAIPMVTMNQPSAPDDERLYPDIPRATTLHSLLNL FT AQQQRAIQKAETNILHRIQRPTTHPILYSQTSLIQYLSHLSAQSSSSYVNC FT NVPFVSRQLNPIDMVDISREQTQYGIHDIIHIQSQIC*" XX SQ Sequence 7191 BP; 2578 A; 1387 C; 1301 G; 1925 T; 0 other; cttgccgaat tggtgcctta agaccaggat tctgaacggg aaaacaaaga cagaaaggac 60 acaccactgt aagtaacctt tatttctatt accgtcaaaa tgggagaaga attttttcaa 120 actctgtcaa acaagttgag tacattgaca gaaacaatct cggcacaggg agtaaccaac 180 ataatcactc cgtttgacgg agagccaacc aaatttcgag attgggttaa gtctattgag 240 aaatatgccg ttctcgccaa aatcgataca gcacaaatga agttgatagc atttcagact 300 tcaaaaggaa ttgtgtctaa cttcattcag cgatacattg aatcaaatcc ggattcggat 360 tacctccaat tgaaacaaga gttatcaatt cgatttagcg aggtcattga ccctctacat 420 gctttggcgc ttttacgaaa agtgaaacaa aattcgaatg aaaatgtaca atgctttggc 480 gaacgattgc ttcttttggc cacagaggca tatacgggac catcaacggt cgaggtggaa 540 cgacaattga tagggttctt tgtcgacggt ttgaaggaaa attaccttaa actaagaatc 600 atgagagaaa acccgaccac cctacaagaa gcggttacat ccgccatgac cgaacaaaat 660 ttgcgaaaac gatttcatct tagaacagga aataatgcct caacgcatgc ttcatcgtct 720 tctagcccag aaccaatgga agtcgaccac ttccgacaaa aacgatgtta taattgtaaa 780 aaattaggcc acgtaagcaa aaattgtcgg gcacgccaag taaacgcagt tgataataat 840 tataataaac agaaagttaa atgttggaac tgcggagctt tcggccatat aagccgaaat 900 tgcaaaaaac ccaataatgc aaacaatcga cagggaaact agggaactct tcttgtaatt 960 cagccgacaa gaagagtaat aaaaacggaa atcattcaga atcaaataat caatcttttc 1020 aaataaattt ggcaggcaat ccaagctcat gtgtcatcaa attagaaaaa caaactttta 1080 gatccttggt tgacacaggc gcggaagttt cattaatgca cagtagagtg tataaaaatt 1140 tgaaatttaa acctaaatta aagaaagaga aaataacttt attttcagtc aacaataaac 1200 aaattgatat ggaaggcagc gcaatttttg attttagatt ggggggtaaa aaagtatcac 1260 ataaatttta tgtagtttca tcaatgaata ggaatataat tcttggtaga gattttcttg 1320 tagaaaaaaa aataaggctg tattttgact taaaaagttt aagatatgag ggtacctata 1380 ttcccctcga ggaggatatt catattagtt cactgctgag gttatcggcc gaactaatag 1440 tgaagcctca aacttacaat ataggtatgg cgagactaaa aggcgataac cagacgacag 1500 gtacgctcct tgtaaagagc tacgataaag ggtttattgg aggacaacca gggttaatgg 1560 ttgtaaattc tattgtacct gttaacaatg acaatgaggt caatgcacga gtacctattg 1620 ccattgttaa cgaatcaaac aagaccataa gactgaaaag agagtgtatt gtaggaatag 1680 cccagacaat agaaaatgga cagatttgtt caatagagtc aacaaagacc agagaacata 1740 atccaaacaa tgacgatttg tttaaggact tagactgtga ccacagacac aaagacagac 1800 tatcaaaatt cttgacctta catagagata ttttcccctc gagtgatacc gaactaggtc 1860 agacagacac ggttgaaatg acaattgacg taggttcgaa tgaccctatt aaacttaggc 1920 cctatagaac tcctttgaat aaaagagata cggtaggaaa agcaatagat gatatgttgg 1980 atgctggaat tatccgacga tcttgctcac cgtacagttt tcccgtcgta atagtagata 2040 aaaaagatgg atcgaaacgg ttctgtgtag attatagggc tttgaacaaa atcataaaag 2100 tcaattctta cccacttcct ttaattgacg atgtattagc cttacttaac tctgcaaaaa 2160 tgttcagcac acttgattta cgagctggat tctggcaaat taaggttaat gaagcagata 2220 aagaaaaaac agcctttgtg tgccacaaag gactctatga atttaatgtc atgccattcg 2280 gaatatctaa cagtcctgga atttttcaac agcttatgtc catagtactt gatggactag 2340 acaaatttgc agtcgcctat cttgacgaca ttctgatctt ttccaaatca ccagaagaac 2400 atttaaaaca tttagaaata gttttcaata gattaagaga gcataatctc cgactaaagc 2460 tcaaaaagtg cacattcatg aaggaagaaa ctaaatatct tggcttcaaa atcactacgg 2520 aaggtctaat ggcagaccca gataaggtac tggctatacg tggaatgtcg gccccgacat 2580 cagtcaaaga agtgaggtca ttcataggag caatgtcata ttatcgacgt attataccaa 2640 acatgtccaa aattgcagaa ccattaatag atttgactaa aaagtacgct aagttcaatt 2700 ggactgacaa atgtcaaaag gcctttgact atctaaaaga cagtttgaca acagtgccta 2760 tgttaggata tcccaacgca gacctgccat atgtactcta cactgacgca agtgacactt 2820 gtatcggcgc atgcctaaca caagttgtag acggcattgt aagccaccga ttaggaaaga 2880 cccaatgtaa gtgggcagta tgtgagaaag aagcttacgc aatttcatat agtctccaaa 2940 aattagacta ttatctccac aatgcacagt tcgtgattag gactgatcat aaaccactta 3000 agtatcttct cgaatcacca atgcagaaca aaaagttaca actctgggca cttaatatat 3060 ctggttataa ctgcaaaatt gagtacatac ctggtgagac taatacagta gcggatatgt 3120 tatcacgcat accaagtgac tcgacagaca taagacaaga ggacaattca gacgatgaac 3180 cggagataag taacaatact ttcgaaatta atgttataaa cagtaacaaa tttaatccag 3240 gtgactatga tgattgtgat gacaatgtct ctacaaagcc gacaaacacg gtcgaaatta 3300 acgacggttt aacaaatatt gacatggtag tagagcagac aaaagacgaa gacattgaac 3360 tggtcaaaca aaggctaact caaaatcaaa atcagtcaag tactaacaac aagtatatta 3420 taatggaaga cctattgtat ttcatatcgg accccgacgg tgaaccgcaa acaagattat 3480 atatacccaa acaattaaag aaagaagtaa ttaaacaata ccatgatttg aactctcacc 3540 caggtattga caaaacttat gctacaataa aagtgaaaaa tttttggcct aagatgtata 3600 aagagctcta tgagtatgtt gacacatgtg ttacctgtca gacgaggtca actatgaaaa 3660 acaaaccaga tacaattgaa accggaattc ctccatatcc catggcaaaa ttaagtttag 3720 atctatcagg accatatccc actacattat caggcaacaa atacataatt agctttgtat 3780 gttggtattc aggatgggta gaatcattcc cagttaaaga caaaacagca gaaaatgttg 3840 ctcacctaat aatagaggaa atatttccaa gatttggctg ccctttggca atcgtcacag 3900 acaacggtac agaaaacgaa aatagaattg taaaagaaac cttagacaag cttaacatta 3960 agcatatatt aacttcctat tattcaccaa aatcaaatgc tagagtagaa agaaaccacc 4020 gtactcttca tgacatgtta gcaaaacgag taagctcaaa ccattcaact tgggatatct 4080 ttcttaacca atgtttaggt gcgattagat ttaatgtaag caaaacgact aaacaatcac 4140 cattctcttt gctttattct cgtgaccccg tattaccatt agataatatc ctaaaaccca 4200 gaactaaata ctatggtgaa gaccaccact tgttagcact tgaaaaccag cataaatcat 4260 ttatccaggt tcaccgatat gttaagaaaa ttaaaagtaa acagtctgag tatttaaata 4320 aaaacactaa gaacgtcaag tataaaatag gtgatccggt attctataag aacaatcggc 4380 ggtcaaacaa actagataat aggtggacac cctactatag aatagttgaa caaaagggac 4440 ccgtctcttt tataataaga aaccagttga ccggacagac cacaaagaca catgccgaca 4500 aattacgctt ggcaaacatt gaccaatggg acattcctac taacgactca aaggtacgaa 4560 aaagtaaata tgtagtaccg ccctctgaca caagtgattc agaccagtcg gaacatgtag 4620 acttttctga ctctgattca cctgatgacg ataaaccact agcacgacta gttgacaaat 4680 ataggcgcga aagagacaat tcgagctctg aagaggatga catacctctc ttcgaactta 4740 cgcaaagatt gcaacaaaaa gcaagggaaa atgaaatgaa tgaaacagat gatgaagaaa 4800 gctcatcatg cgacatgtca gatgccgata taaattagca tgcaaaagag atagtttagg 4860 atgatgggaa gggccctcct tgatttcttg aagaaactat cctggtatgc actgacataa 4920 tcattagatc aatgatgcag ggataatgca ttagatcttg tcaatgatga aaatgtttac 4980 tatacgtggt ttgcgaagac gtagacaatt atttatccct tgaaacaaca gatccttctt 5040 gcacggataa tgcaacagat cttttgtgaa atatttatac ctgctgtccg cgaataacag 5100 ataatgtagg ctattttata acttaatgtc tattacttgt aaggcgagac ctgtaatgga 5160 cgttgtttgt gtcctttttc ataaggatta atatggactt aatcatctga cccgatattt 5220 ttcaggtcga agatgaagac agttcttaca atgctcgttg tatgcattgt gatcctccca 5280 acgactgaca gtattctgac atataaaaac accttgttca gacctgtaaa tagaattgct 5340 accagtcgat caagatggtt gttgacctta gttcatgaca taagacctta tttggatttc 5400 ttaagacaac ttgtcaaacg acattggacg tgctttatgg acagtagaca gtataatagc 5460 cagctattat acccaagaca attcctctgc agaatatgga ctcacaaaat cattataatt 5520 ttgaatattc cgttactcga caatgatgac cattatataa tttacaaagt acattccctc 5580 ccagttccaa tgatgactaa tcatgtttta tcaaaatcaa agggcacatc ccgatttgtg 5640 gcacaatata aattagaaac aactacgctt gctgttagtg ttgatagtac gaaatatgct 5700 ttaataacag ataatgagtc caaacaatgt acaaatcctc acatgaactt ctgtcaaatt 5760 aagagtccaa tctttccaat caacttgagc aaactatgtg ttatggcatt attttctaaa 5820 aatgagataa aggcttcaag taggtgcgat gtttcggtaa aacctaatga aattttaccc 5880 actagtgtgt atttcactaa tggttattgg gctgttgttg tgcgtgaacg aactcgattt 5940 tcaattgttt gtcgggatac tcacctcgaa cattcctcta ttgttgtcaa accacctgtt 6000 caaattattc acctaaagcc gacatgtaag gcggttaacg accatgtaac cttaccctct 6060 cataacacga ataagagtac ttattttgat aacacttcct tcaaacattt attgacaaat 6120 tttaatgaat caattaccga tatttggaaa ccaattaccg atcactttgg caattataca 6180 cttactaaat taccacccaa attaaaaggg actgaaacta tccccatcga aaaattagta 6240 tcggaattag atgacttaca cccaatcgaa aaccttaaaa aacaaacatg gccattgtgg 6300 gtgtgggtta ctctgtgtgt cgttagtgct gttggtatac tcatccttta tatagttgtc 6360 aaactatata tgaaacctct tatgttaaaa ttcaaatccc catggagacg ctctgataga 6420 aaaggtgaaa actctggtaa agtccctact gatcaaagta cggaggcaat accaatggta 6480 acgatgaacc aaccaagcgc cccagatgat gaacgcttat atcctgatat acctagagcc 6540 actacattgc attctctcct caaccttgct caacaacaga gagccatcca gaaagcagag 6600 acaaacattt aacttcatcg tatccaaaga cctacaactc acccaatact ttattctcaa 6660 acgtcactaa tccagtattt gagtcatctt tcagcacaat cttccagcag ctatgttaac 6720 tgcaacgtac catttgtttc aagacagcta aatcctattg acatggtaga tatatccaga 6780 gaacagacac agtatggtat ccacgacata atacatatcc aatcacaaat atgttaatca 6840 cagaatatgg tagatacccg attaccacgc agaagcgaaa aactagctac agtagacttc 6900 ttgtatccag ccaacgaatc caacatgttg aacaggaacg atgtacacta gctaccaaca 6960 acgaactgaa accaatgata tggtagaccg tacgtgacag acaacagata atcataatgt 7020 gaatcagtga ccaaagccac tcgagtgcta attcaaatgg gagttgaact gtgaaagaag 7080 ttgacagggt caactgttgt atataggaca attagcttat taactctaga cgaactttat 7140 tgtgtatttc attattctta cccttgtttc acacttgaaa caggggcgtt g 7191 // ID Jockey-11_AAe repbase; DNA; INV; 3782 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A Jockey non-LTR retrotransposon from Aedes aegypti. XX KW Jockey; Non-LTR Retrotransposon; Transposable Element; KW Jockey-11_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-3782 RA Kojima K.K. and Jurka J.; RT "Jockey clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1377-1377 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 12 sequences with 89-93% CC identity. XX FH Key Location/Qualifiers FT CDS 35..913 FT /product="Jockey-11_AAe_1p" FT /translation="MVHTKPEYDRVCANLKDAGVEFFTHDIPGDKPFKAVI FT RGMDNTDPKRIQAELRNRYKLDATAVYRMARRDELTKMYPDCLYLVHFRKG FT SVSLNALQAVRTIGSIIVXWEPYRGGRRDVTQCQRCLNFGHGTRNCHLRPR FT CSICAESHDSAACPRNGAAVDAVAFKCANCGEAHQGSDRRCPKREAFKQLR FT KAASTSNQPGRRKEKAPAFRPEDFPPLPAGGNFPQVPPTWPRRRPPPAAAN FT QPSGEARMNPSIRRGAGHAGLRGAHHQAILQDPVRAAACCSLHYQQIWGLS FT R" FT CDS 927..3269 FT /product="Jockey-11_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MERLLCKGKTIELADFVGKHSIDVGLITETHLKSGDS FT FWLPDHNIIRLDRTNSRGGGVAIIVKKGIKFSILPHIRVSVLEALSVEVET FT STGSIRCTAVYCPRQCTDSNGLGTSFKNDLTALTRTNSRFVIGGDLNARHE FT AWRNYRRNQNGKLLFDHEQHGLYTVQFPDEPTFISPAGNPSTLDFFLANVQ FT LSKPVALNDLSSDHQPVVAEINVNVASAPSYLRKDYHHVNWVAYARMVDRR FT VDENQPLDTEEDIDRALEGLHQAITVADETCVRRVPVRGKFVAIDSHTQML FT IRLRNIRRRQFQRTGDLSKKRETAYLNRQIASRMASIRNESFGRVIQNLDD FT RSRPFWKVAKVLKTHPKPVPPLKVGDSLLITPLEKANAIGDHIASSHLLGS FT QIHSPMEDQVAQCVHDIDTSPCAVPPEDKITSEQISSALKQTKNMKAPGFD FT GIFNLILKHLNIKVYELLSAIFNRCLELHYFPSSWKVAKIIPIRKPGKDPT FT LPSSYRPISLLSALSKLFEKLILNRLVKFVDERNILLPEQFGFRKGHSTTH FT QLVRVMNTIRSNKAVSKSTAMALLDVEKAFDNVWHDGLVYKLCYFNFPPHL FT IKIIRSYLQQRSFRVSLNGYLSNAFPIPAGVPQGSLLGPLLYSIYTSDIPP FT LGXGCVFFLFADDTAIAVKGRMPTEITNKLQRCLDAFVEYASTWKIKINAS FT KTQAVMFLHRQSYRLKPPPNCTVTMDGTRVEWSSEVXYLGLLFDDKLLFRS FT HVDRTLTKCSALNQKPVSADCPQISPLEGE" XX SQ Sequence 3782 BP; 977 A; 1086 C; 913 G; 798 T; 8 other; ggaattcaaa ctgaccagga tcggcaccaa ggtgatggtg cacacgaagc cggagtacga 60 tcgggtgtgt gccaacctga aggacgcagg ggtggaattt ttcacccacg acatcccggg 120 cgacaaaccc ttcaaggccg tcatcagggg catggacaac accgacccga agaggatcca 180 agcggagctt cggaaccgmt acaaactgga cgcaactgcc gtctacagaa tggctcggcg 240 tgacgagctt acgaaaatgt acccggattg cctctatctg gtgcattttc gtaagggctc 300 ggtttcgctg aacgctctcc aggcggtgcg caccatcggg agcatcatcg tcmgctggga 360 accctatcgc ggaggccgcc gcgatgttac ccagtgccag cggtgcctca actttgggca 420 cggcaccaga aattgccacc ttcgacctcg ctgcagcatc tgcgccgaaa gccacgactc 480 tgctgcctgc cccaggaatg gggccgcggt ggatgcggtg gcattcaagt gtgccaactg 540 tggcgaagca caccagggct ccgatcgtcg gtgcccgaaa cgagaagcat tcaagcaact 600 ccgaaaggcg gcttcaacat caaaccagcc gggacgaagg aaggaaaagg ctccggcgtt 660 caggcccgaa gatttcccgc cgcttccagc tggcggcaat ttcccgcagg tgccccccac 720 ctggccgcgt cggcgtcccc cccctgccgc tgcgaatcaa ccgtccggtg aggcacggat 780 gaatccctct atccgacgag gagctggtca tgctggtctc cgaggtgctc atcatcaagc 840 gatcctgcag gacccggtca gagcagctgc gtgctgtagt ctccattatc agcaaatatg 900 gggactaagc cgctgaccat cgccacatgg aacgcttgct ctgtaagggc aaaaccatcg 960 agcttgctga tttcgtcggc aagcacagca tcgacgtggg gctcatcacc gaaacccacc 1020 tcaaatccgg cgacagtttc tggctgccgg accacaacat catccgcctt gatcgcacca 1080 actccagggg gggcggcgtt gcgatcatcg tcaagaaagg catcaagttc tccattctac 1140 cccacatccg cgtgtctgtc ctggaggcac tcagcgtgga ggtggaaaca tcgacgggga 1200 gcatccggtg cacggcggtg tactgtcctc gccagtgtac cgatagcaac ggcctgggaa 1260 ccagcttcaa gaacgacctg actgccctca ccaggacaaa ctctcgcttt gtcatcggtg 1320 gcgacctgaa tgcccgccac gaagcttggc ggaactatcg gcgaaaccag aacggaaagc 1380 tgctcttcga ccacgaacaa catggattgt acacggtgca attcccggac gaacccacct 1440 tcatctcgcc ggcaggaaat ccctcaaccc tggacttttt cctggccaac gtccaacttt 1500 ccaagcctgt ggcgctcaac gacctcagct ctgaccacca accggtggtg gccgaaatca 1560 acgtgaacgt ggcctccgct ccatcctacc tgcgtaagga ctaccaccac gtcaactggg 1620 tggcatacgc gcgaatggtg gaccgccgcg ttgacgagaa ccagccattg gacaccgagg 1680 aggacatcga ccgggccctg gaggggcttc atcaggccat aaccgtggcc gacgagacgt 1740 gtgtgaggcg agttccagtg aggggtaagt ttgttgccat tgattcccac acccaaatgc 1800 ttatccgcct gcgcaatata cgtagacgcc agtttcagag gaccggagat ctgagtaaga 1860 aaagagagac ggcctatctg aacaggcaga ttgcttccag aatggcttca atccgcaacg 1920 aaagtttcgg acgtgttatc cagaaccttg atgaccgatc tagaccattc tggaaggtag 1980 ccaaagtcct caaaacccac ccaaaacctg ttccacctct caaggtcggt gattcgcttc 2040 tgatcactcc gctcgagaaa gccaatgcaa tcggcgatca cattgcttca tcccatctgc 2100 ttggatcgca aatccacagt ccaatggaag accaggttgc gcaatgcgtc cacgatatag 2160 atacctcccc ttgtgctgtc ccaccagaag acaagatcac ctctgaacaa atctcctcag 2220 cgctgaagca aaccaaaaac atgaaggccc ccgggtttga tgggatcttc aacctcatcc 2280 tcaaacattt gaacatcaaa gtttacgaac ttttgagcgc catcttcaac agatgcttgg 2340 aattgcacta cttcccaagc tcctggaagg ttgccaagat cataccgatc cgcaaacccg 2400 ggaaggaccc cactctacca tcgagctata ggcctatcag cctgctctca gcgctgagca 2460 aactctttga aaagctcatc ctcaaccgcc tcgtcaagtt tgtcgatgaa cggaacattc 2520 tcctgccgga acagttcggg ttcaggaagg ggcattccac aactcaccaa ctcgtgaggg 2580 tgatgaacac catcagaagt aacaaggctg tatccaagtc cactgccatg gccttgttag 2640 acgtcgagaa ggcgtttgac aatgtatggc atgacgggct tgtatataag ctttgttact 2700 tcaactttcc acctcacctg atcaaaatca tccgaagcta ccttcagcag aggtcgttca 2760 gggtatcgtt gaatggctat ttgtctaacg ccttcccaat accagcagga gtgccgcaag 2820 gcagcctgct tggtccgctg ctttacagca tctacacatc cgacattcct ccccttggcg 2880 akggctgtgt gttctttctg ttcgcagatg acaccgcaat agccgtcaaa gggaggatgc 2940 ctaccgaaat caccaacaaa cttcaacgat gccttgacgc ctttgttgag tacgcttcca 3000 cctggaagat caagatcaat gcctccaaaa cccaggctgt gatgttcctc catcggcagt 3060 cctacagact gaaaccccca ccaaactgca ctgtgaccat ggatggcaca agggtcgagt 3120 ggtcgtcaga ggtastgtac ctcggactgc tgttcgacga caagctactg ttccgctcgc 3180 atgtcgatag aaccctgacc aaatgctccg cccttaacca gaagcctgta tccgctgatt 3240 gcccgcagat ctcgcctctc gagggcgaat aaactggcgg tctacaagcw ggtgatatcc 3300 cctgcactac tstacgcagc tccggtgtgg ggwcaatgcg cacaaaccca catagctagg 3360 atccaagtcg cacaaaatcg tgcgctaagg atgatccttg atagcccttt tggcacaagg 3420 atcatcgacc tacacgagga ggcaaattgc cccsaattag cgagaaaatt aggaatacga 3480 cagagtcttt taagcagaaa tgcataatat ctgaacatgc tctaataagc gccttgtata 3540 tagtagatta ggttagtttt aaggtgtaaa tagtagtttt aagtttgtat atattttcca 3600 aattccacca atcccacatt tgtggtgaac ttaatatttc attttatttc taaaaaacca 3660 ttattatatt tcaaatcgac caccaaagat gaaaaggtat actcctgtat cacttaaaac 3720 taaatatgta atccacaaaa acatgcaata aagattattg aaagtcaaaa aaaaaaaaaa 3780 aa 3782 // ID GLT2_SM repbase; DNA; INV; 3878 BP. XX AC . XX DT 15-AUG-2009 (Rel. 14.08, Created) DT 15-AUG-2009 (Rel. 14.08, Last updated, Version 2) XX DE Repetitive DNA from Schmidtea mediterranea: consensus. XX KW LTR Retrotransposon; Transposable Element; GLT2_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-3878 RA Jurka J.; RT "Repetitive DNA from Schmidtea mediterranea."; RL Repbase Reports 9(8), 1911-1911 (2009). XX DR [1] (Consensus) XX CC 4bp TSD. 93% identical to consensus. Several thousand copies. CC Preliminary classification: Gypsy LTR. CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX SQ Sequence 3878 BP; 1386 A; 570 C; 599 G; 1306 T; 17 other; tgttgggcaa aaactgataa aaaccctttt tcttttactt ttgtttagcc agcacgataa 60 cgcctgtgtg gttttgagca gaaaatttaa aattacttaa aaacggaaat ttaaggaaaa 120 ataaaataaa taaaaattta ataaaaattc tgagatataa aaatgtaaaa atttaaattt 180 aaatcaaaaa gaataaaaat tttattaaat tttggaatat tcgagaattc cttgttttac 240 tgcatcaaaa cacacacaca aacaatcacg atatataagg cgaagctggc tgaacaaaaa 300 acactcacta atcaactcta aatcaactct aactctctat tttactccta ctaaaactct 360 tactcttata tttttactag tatatcattg tctcattgtt tggaaaaagt aattttagtt 420 tatttttaat atgattttat tcatgatgat ctttgttatt tttttttatt ttttgttaat 480 tttactatgg aaaagctcgc aagaaacgtc ctttcgtgat cgaggaattc gaccgtgttg 540 ttgaacctcc atcatcaaag gatttgttct cgagacaaaa tgatttaacc ttcgacgaca 600 ataatcgatc tacgacttac ataatagaat tncatgcata caagtcaata aaatgaaatt 660 attttttcgn taccccggat cgaaccggtc gttagtgggg gaagccgcgg tgccggtagt 720 aaccaggcct gcccagtatt ctccccttcc taaatcctat cgcccgaccc tcgaaagagt 780 tgagtcgagg tcctgggaaa ggctaaatat aaagttataa catagaaaag gaaagatata 840 ccgactagtt tatagaagag tcagaaagaa tacattttat tttttgtaaa caacaatgtt 900 attttagcat aaatgtatga cacagctcac taaaatccgc gactaaccac attaggaagt 960 caaatcggcc atagctcgca ttttcgtatc aaatctacat gccaaatcga cgtttgaatg 1020 ttacccaaag actttacgtt atcgtagtac ttaaatttcg acttttcgct cagttcataa 1080 aattaaatcc acgactccat gaccttaaaa caactattga agtgactatt atactcgtaa 1140 atttattatt aatactaaac atttcattat taattttatc attttatttg gatattatta 1200 tcgttaaaat tattattttt ctggtgttat tgtcgttatt catattgcga attttcgtta 1260 attgttttga attattattc cgagttttca ttatttttaa cattattaat attgacccta 1320 aacgattatt aatattatta acattattat aagcattatg agcaccatac tattattacc 1380 gaaaattcca tatagtacca ttaatattac tttattcata attctgatgt tattaatatn 1440 aattttatta ctattatcct tgtttttcac actaatatta ttgttaatat cantattatt 1500 attattaaaa tattnttatt attaattatt attaaaatat tattattatt attattaaaa 1560 tattattatt attattaaaa tattattatt attattaata ttattagtat agtattgtta 1620 tnactatata gtatcattaa tatcgttaca caatatcatt gaggataaac cagtatccct 1680 agaacgaatc gantattatt attattatta ttattattat tattattatt attattatta 1740 tcattantat tattaattta atattattat tattatcatt antattatta ttattattat 1800 taatattatt aatacattat cattattacc agagctatat agtagcatta taaggatagc 1860 ttattaatta cattagatat acctctgaca ttatcattat tattaccatt attattaacc 1920 ttagtaatat tattaagttc atattaatca ttttattatt ttcattatga atattattat 1980 tattaattat atcatgctta ttattattat tactaaaata ttattattat cattaagtta 2040 ctattattac tattattaat attaatatgt taatattatg attataatta aaatgaaaca 2100 cttacaaaag ttttatttaa aaaagaaaaa gcaaatatca ataaaaattc ccataatcgg 2160 gataaanctc ccaaaatttg cgaggtacaa taagccgcga atctctacaa tttaaagtta 2220 gaaaagaagt aaaaagaaat aagtatacct tacagaatat gtcgacgaaa tagaaacagc 2280 cgcggaggtc gtagaggtcc gtagaattcc tgaaatacct aaaattatta tttaaaagga 2340 aatacaaaat tgtgaccttg gaaacttncc gattgttgac tcgttgctat ggcaacgggt 2400 gaaaagcnaa gagtgaanta ctnaaatgtg gaaaaacaaa cggagantaa aatctaaatt 2460 gttagagaaa taaaatttta attattattt tacaataaaa tttttgtnaa tcattttaga 2520 gtaattaaat ttagaatttc taaatttgaa ttcaataaaa ttttaatttg gtaaattaat 2580 ttttaaaata ggaaaataca aacttcataa tgttattata atgttgataa tgtagtgaat 2640 tacgtaacta cccttggtta tcacaggtat tgtcaatgtt agcgttgcta atgcctgtta 2700 taatgaatcg atcataaaac actagtagat tcctttaaga tatctaagta gaaaaggaca 2760 tacccatcct ttccccatcc agatgacaag tgtttgtttt atttaaaatg attaccatat 2820 attgatatat taatacttat tgtacataga gaataataga catgaagtta agaatgaatt 2880 ttgtttagtg gactctcaca agaaaactct ggaacaattc cgaagggata tagccggtta 2940 ccaccaaacc gcagtaacac aaacgcagag ggcagaggcc atggtgagga agtgtcgata 3000 catggagcag acatggctcc gccccgagaa ggcaatcaag ctgaaagaag agatcgacac 3060 actgaggaat caattagctc agaaggacga agccctgcgg tcggccgaac agaaggcccg 3120 ggagctaggg aaacaggtag aagagtggaa gttgagagac aacatccgga caggcctcct 3180 gcagagtggn gttcacctat ccgaagaaca gcaggcgata gtgctaagaa atgccacagg 3240 taaagacact ggcattggac aaatgataac aggtccatca cctacaccgg aagggttgcc 3300 gaagaggaac cgcagaaaga accggcccaa caaaagagaa agggagcgac aggccagagt 3360 aaatcgtgca cgcctgttgg ggcacatctt ggacgccaat gggttttgat ggcgaaattt 3420 tgcgtttaac tgaatttaac tgaattaaat ttgtagatgt taattattta ttacctatgt 3480 aatccatata tgttaattat ttatcgatta taatataaat tgataacttg ttattaaatt 3540 tgtttgttat tgacgggtag atatgactag agtggttgaa tatggggaaa cagagtgatc 3600 taatgattag ataaattaga aattagataa gatgagaagg ggacaattaa gtaagagcat 3660 gaagtcttct atgacagctc atggacccga ggggggttag ggcattagca gttcttgaga 3720 aggagctcac cttttgctaa ccttccaccc actagttcac agtaccaacg tcatttatga 3780 cgggtgacta gcggttcaaa ttcttatgca ggcgctaggt ggggatgata acccttaggc 3840 agttcaaaac tattttaagg acccatcagt gcccagca 3878 // ID Copia-12_DPu-I repbase; DNA; INV; 4336 BP. XX AC scaffold_242; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-12_DPu_; KW Copia-12_DPu-LTR; Copia-12_DPu-I. XX NM Copia-12_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-4336 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 687-687 (2010). XX DR Genome; scaffold_242; Positions 41532 37197. XX CC 'CACTT' target site duplication CC LTRs are 99% similar to each other. CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX FH Key Location/Qualifiers FT CDS 41..1339 FT /product="Copia-12_DPu-I_3p" FT /translation="MASSTEEKSMKAVQFNGSNFAVWKFSIFLKLKLRGLA FT PIVEGLRPKLLQAEPEDIARWVRGDIHAMRYIFETCNEEQQQNLLTCDTSH FT DMWLSITSQHQQGTAERRQSLNQSFLNYKFKPKHSVRAHVESIKLLAKNYS FT DAGGIIDDAQICNKVLTSLPPSYDHFRTSWECMTETDKTLIALMTKLCSQE FT ERLNLRTGGQKSPDDKAFFGKTPAHQSTFTSLQNRGRGNSRRFQGGRNRSR FT GSGPPIEGRNVRFADSNGSSRKRGRCNNCGRPGHWADDCWDDPYDEPYNPS FT WKPKEPSSAINLSWRKGKPAKEDTAKVASHHIPSTTEFLLDSGATRNMCHQ FT RHLFRSFKEILPGTRWINGIGQDRVEVLGIGDVSITPIINGDTRPFTLRDV FT LYSPRIGVNLVSVQRLPTTSHTSIFTSQGHSSLAEVFLS" FT CDS join(1339..2361,2365..4215) FT /product="Copia-12_DPu-I_1p" FT /translation="MTATRVEEALYQLNFALSTDVAMLVRSSFKISLRDWH FT ERFAHQHSELIVKMADSNAVTGLNLPSSNRTPLEKCHDCIAAKMTRKVFPH FT STSQQSRIGALVVSDVCGPMQVASIGGARYYVCCRDVFSNYRSIYFVKHKS FT EASDCSRAFIASVHAQTGNLVAIFRTDGGTEYLKLDPWLQRKGIIHQTTCR FT STPQQNSIERDNRTIVSLSRPCLLSNKSLPLSLWAEAASCVVYTLNRSLST FT TCLVKTPFEYWTGKKPDVSNLRKFGSEFYVLVPAEMHQKLDAVGKLCFFVG FT NSPTQKEDRYWDPSTGKVNTSRDVSPIEHRYEPRIPAINVQNGIDVFSTEK FT YGEQQPEDDEPPDEQQPMELGNHALLPDEAEDNRPNAEEEVQPIVPPAVIE FT PTVEPPVLLPRRSQRQGRKKEIVSMLAALDQDDEPNHYRDAIVAHDAPDWK FT IATRKEYDSLIENDTWELITPPPGHPIIESRWTFKVKPATRTREKIFKARF FT VAKGFSQVPGVDYNADEVYAPVIKHDSLRMLLALTTRLDLFLHTLDVKTAF FT LYGDLKEYLLVKQPEGFVQPGTEDLVCHLLRPLHGLVQSPRNWNDKFNFFL FT EKFGLTRSTADPCLYYNRGENSDDYTILGIWVDDGILATKTKEKAAQIIKY FT LESHFKMKSGDADIFVGLEITRDQEKRELYVHQNSYIQTVLKRFRMTGCNS FT STIPADPNSRLTRADCPKNTGRPPMDSTYYRCAVGALLYIGRMTRPDILFS FT VNAASRFCEDPGKPALSAVKRILSYLSGTSDYGIRYDGTQDNTLTAYSDSD FT YAGCPDTSRSTSGMLFMLNNGPISWTSHLQKSVAQSTCEAEYYAAGHASRT FT IVWLRELLDQVGFRQPEPTPLLCDNNSAISMVLNPVFHEKAKHIRTKHHYI FT RTQQENGIVKMIKVPSEDQLADGLTKPQPTAYFQLNRTRIGVRPAPQQVSN FT NLN" XX SQ Sequence 4336 BP; 1213 A; 1256 C; 966 G; 901 T; 0 other; ggttatgggc ccagttacct aggaaccaga ttattacaag atggcaagtt caacggaaga 60 gaagagcatg aaagctgttc agttcaacgg cagcaacttt gctgtttgga agttcagcat 120 cttcctcaaa ctgaaactca gaggtctagc cccaatcgtg gaaggcttga gacctaaact 180 acttcaggct gaaccagaag atatcgccag atgggtgaga ggcgacattc atgccatgag 240 atacatattt gagacctgca acgaagaaca gcagcagaat ctactcacct gtgacacctc 300 ccatgacatg tggctgtcaa tcacgagtca acatcagcag ggcacagcag agaggagaca 360 atctctaaat cagtctttcc taaactacaa attcaaaccc aaacactccg ttcgagctca 420 tgttgagtcc atcaaactgc tagccaagaa ctatagcgat gccggaggca tcatcgacga 480 cgcccagatc tgcaacaagg tgctaacgtc attacctccg agctatgatc acttccgaac 540 atcatgggag tgcatgaccg agaccgacaa gaccctaatt gctctcatga ctaagctttg 600 cagccaagag gagagactaa atctcagaac cggcggacag aaaagccccg acgacaaggc 660 cttctttggc aaaactcccg ctcatcaatc gacgttcact tctttgcaga accgcggccg 720 cggaaactct cgacgctttc aagggggaag aaacagatca agaggtagcg gcccaccaat 780 cgaaggccgt aatgtcagat tcgcagattc aaacggctcg tcacggaaga gaggacgatg 840 caacaattgc ggtcgcccgg gtcactgggc agacgactgc tgggacgatc cgtacgacga 900 accatacaac cccagctgga aacccaagga gccatcgtca gccatcaatc tctcatggcg 960 aaaggggaaa ccggcgaaag aagacacggc caaagtcgca agtcatcata tcccctccac 1020 cactgaattt ctcctcgact ccggagccac acgtaacatg tgccaccaaa gacacctgtt 1080 tcgctcgttc aaagaaattc tcccaggaac caggtggatc aacggaatcg gtcaggatcg 1140 agtggaggta ctcggcatcg gcgacgtatc catcactccc atcatcaatg gcgacacgag 1200 acctttcact ctccgcgacg tcctatactc tccgcggata ggcgtaaacc tcgtctcagt 1260 ccagcgctta ccaacgacga gtcacacgtc catttttacg agtcaggggc attcatcact 1320 cgccgaggta ttcttatcat gacggcgaca cgagttgaag aagcactcta tcaactgaat 1380 ttcgcactct caaccgacgt cgccatgctg gtccgctctt cattcaaaat ttcccttcgc 1440 gattggcacg aacgcttcgc ccaccaacat tccgagctga tcgtgaagat ggccgacagc 1500 aacgcagtca ctggacttaa tcttccttca tccaacagga cgccgctcga aaaatgtcac 1560 gactgtatcg ccgccaagat gaccaggaaa gtgtttcccc acagcacttc ccaacaatcg 1620 cgtatcggag ccctcgtcgt tagcgatgta tgcggcccaa tgcaagtagc cagcatcggc 1680 ggagccaggt actacgtctg ctgtcgcgac gtgttcagca attatcggag catttacttc 1740 gttaaacaca aatccgaggc atcggactgt tccagagcat tcatcgcctc tgttcacgct 1800 cagacaggca accttgttgc catctttcgg accgacggag gaacagagta cctcaaactc 1860 gatccatggc tccaacgaaa aggaattata catcagacaa cttgtcgatc caccccacag 1920 caaaactcaa tagagcgcga caatcgcact atcgtttcac tcagccgacc ttgtctatta 1980 tccaacaaga gtcttcctct cagcctctgg gcggaggcgg cgagttgcgt agtctacaca 2040 ctaaatcgat cactctctac cacctgtctc gtcaagactc cttttgagta ctggaccggc 2100 aagaagccag acgtgtcgaa tctacggaaa ttcggatccg agttctacgt cctcgtacca 2160 gcagagatgc accagaaact cgacgccgtt gggaaattgt gtttcttcgt tggcaactcg 2220 ccaacacaaa aggaagaccg ttactgggat ccctccaccg gcaaagtcaa cacaagtcga 2280 gatgtgtccc caattgagca tcgctacgag cctcgaattc ccgccatcaa tgttcaaaac 2340 gggatcgacg tattctccac ttaagaaaaa tacggtgaac agcaaccgga agacgacgag 2400 ccacccgacg aacagcagcc tatggaactc ggaaaccatg ctctgctacc agacgaagcc 2460 gaggacaacc gaccgaacgc cgaagaagaa gtccagccga tcgttccccc agcagtgata 2520 gaaccaaccg tcgagccacc ggtccttcta ccccgtcggt cccaacgaca gggaaggaag 2580 aaggagatcg tgagcatgtt ggccgctctc gaccaagacg acgaacccaa ccactaccga 2640 gatgctatcg tcgctcacga cgcacccgac tggaaaatag ccacccgcaa ggaatacgac 2700 tccctaattg agaatgacac ctgggagctt atcactcctc ctcccggtca cccaatcatc 2760 gaatcacgat ggacatttaa agtcaaaccg gccactagaa cgagggagaa aattttcaaa 2820 gctcgtttcg tggccaaggg cttttcacaa gtcccaggag tagactataa cgcggacgaa 2880 gtgtacgccc ccgtcattaa gcacgactca ctcagaatgc tgcttgctct aaccacgagg 2940 ctcgatcttt ttctccacac acttgacgtc aagactgcct ttttgtacgg cgacctgaaa 3000 gaatatctac tagtgaagca accagaggga ttcgtccagc ctggaacaga agatctcgtg 3060 tgccacttgt taagaccact ccacggcttg gtgcagtcgc ctcgaaattg gaacgacaag 3120 ttcaattttt tcctcgaaaa attcggcttg actcgatcga cggccgaccc ttgtctttac 3180 tacaacaggg gagagaattc cgacgactac accatcctag gcatctgggt ggacgacgga 3240 attttagcca ccaagacaaa ggaaaaggcg gctcaaatca tcaagtacct agagtcacat 3300 ttcaaaatga aatccggaga cgctgacatc tttgtcggac tagaaatcac gagagatcaa 3360 gaaaagagag aactctacgt ccaccagaat agctacattc aaacagttct caagagattc 3420 agaatgactg gctgtaattc ttcgacgatt cccgccgatc ctaattcacg cctcaccaga 3480 gccgactgtc cgaagaacac tggtcggcct ccaatggatt cgacatatta ccgatgcgca 3540 gtcggtgctc tcctatacat cggtcggatg actcggcccg acatcctctt ttcagtgaac 3600 gcggcctccc gattttgtga agacccagga aagcccgcct tgtcagcagt taagcgcatc 3660 ctttcctact tgtctggcac ctcagactat ggcatccgtt acgacgggac acaggacaat 3720 actctcactg cttacagcga ctcggactac gcaggctgtc ccgacaccag tcggtccacc 3780 agtggcatgc ttttcatgct caacaacggc cccatctcat ggaccagtca cctgcaaaag 3840 tcagtggctc agtcaacctg tgaggcggag tactatgctg ccggccacgc atctcgcact 3900 attgtctggc tcagagaact actcgatcag gtagggttca gacaaccgga acctactccc 3960 cttctctgtg acaacaatag cgccatctca atggtgctga acccggtctt tcacgagaaa 4020 gccaagcata tccgaaccaa acatcattac atacgcactc aacaggaaaa cggtatcgtc 4080 aaaatgatca aagttccgtc cgaagaccaa ctggcagacg gcctaaccaa gcctcaaccg 4140 acggcctact ttcaactgaa cagaactcgc ataggcgtcc gaccggcccc gcaacaagtg 4200 tcgaacaatt taaattaaaa caaaatttca aagttccttc gccctacact cttctctgct 4260 cctctttatt attatttgcc ttcctcttgt tgttccaggg gaagtacggc taaagtgtat 4320 tttaattaag gggaag 4336 // ID Gypsy-15_RP-LTR repbase; DNA; INV; 133 BP. XX AC ACPB02047260; XX DT 28-JAN-2011 (Rel. 16.02, Created) DT 28-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Rhodnius prolixus genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-15_RP_; KW Gypsy-15_RP-I; Gypsy-15_RP-LTR. XX OS Rhodnius prolixus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Euhemiptera; Heteroptera; OC Panheteroptera; Cimicomorpha; Reduviidae; Triatominae; Rhodnius. XX RN [1] RP 1-133 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Rhodnius prolixus genome."; RL Direct Submission to RU (28-JAN-2011). XX DR Genome; ACPB02047260; Positions 145 13. XX SQ Sequence 133 BP; 53 A; 10 C; 21 G; 49 T; 0 other; tgtagggatt gaaatataaa aatatattgt cgtatttagt tttaaaggaa acatatgatt 60 aaaaatttcg aggaccatat gtttccagat tgttgtatta ttctaaaata tgagtaataa 120 aaagtttcta aca 133 // ID Gypsy-99_AA-I repbase; DNA; INV; 4529 BP. XX AC supercont1.75; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-99_AA_; KW Gypsy-99_AA-LTR; Gypsy-99_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4529 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.75; Positions 898779 894251. XX CC 'GTCCC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 243..4505 FT /product="Gypsy-99_AA-I_1p" FT /translation="MNAEQFELFMQQQNSVLREIVSSFQNVQVQQRQLPEG FT RQQPQLQMAANSAASSILLPPPLELSGDMEENFDFFSTNWKNYASAMGMDE FT WPQEQDRQKTSILFSIIGSAALKKFFNFELTEEQRRSPDAALEAIKAKVVR FT ERNKIVDWFEFFSLMQQSMELIDDYVTRLKCLAKLCKFGALEEDLIVYKIV FT TSNKWPKLRSKMLTMQNLTEVKVVDLCRAEEIAEKHASAMGACSVEVNMIR FT KKNLKCKYCGDRHQFAKGICPALGKKCNVCGGKNHFAKVCKSDRMQKGKSR FT RKVKKVQEDCSDLDDTDSVSSDNEESECSGDEAVIGKVYDFWRNVMADVEL FT FVDGKWKVVQCELDTGANTSLVGHNWLKEVGGGNQLELLPSKYRLQSFGGG FT AIPVLGEVRLPCKAKNRKYTLALLVVDVPHKPLLSAKVCKKLGFIKFCNSV FT SITSPLPEEQLLNVYRIRAQQIIDQHSSIFEGYGKFPGVVSLEVDPDVAPS FT IQQPRRVPIAMRDKLKKELKKLEQDGLIIKETQHTDWVSNIILVRRNGKDS FT DSIRICLDPIPLNRALKRPNLQFTTIDEILPELGKAKVFSTVDAKKGFWHV FT VLDRSSSLLTTFWTPFGRYRWIRLPFGIATAPEIFQMKLQEVIQDLEGVEC FT IADDILIFGTGDNLQQALVNHNVCLEKLLIRLEENNVKLNKAKLRLCMTSV FT KFYGHVLTTRGLQPDESKVETIKSYPRPQDRKELQRFIGMVNYLSRFIPNL FT SSNFAVLRRLISDKEPWVWTDKEEEEFVRVKSLVADTRSLQYYDVNQPLII FT ECDASSFGLGTAVFQSQGVIGYASRTLTSTERNYAQIEKELLAILFSCVRF FT DQLIVGNPKTIVRTDHKPLVNVFKKPLLSAPKRLQHMLLSLQRYNLEIQYV FT KGKENIVADAISRAPLDGELPEDQFKKRNIYQVFRQLEEVNPSKFLRITDE FT RLTEVIQETARDSSMQQIIRYILEGWPTSVDKVPDGVKIFYKHRNELSHQD FT GIVFRNDRVVIPYRMRRTMTEKVHVSHNGVEGTLKLACANLFWPGMTTQIK FT DAVSQCAVCAKYAASQPAPPMQSVAIPVHPFQLVSMDVFFADYQGEKRKFL FT VTVDHFSDFFEVDLLKSLTPLSRHGKPQLVITDNGTNFVNKEWRQFAIEWE FT FQHSTSAPYHQQANGKSEAAVKIAKRLMKKSEESGVDFWYALLHWRNIPNK FT IGSSPAARLFSRSTRCGLPTSLENLMPKPVQNVPSSIEEARRKTKYQHDKA FT ARNLPILETGSPVYVQLHPESTKHWTPGSVTSKMNDCSYLVNVDGALYRRN FT LVNLKPRKEPDTPVQSTMPSANESSVPLEAPSRDTVEVATFTPEETVSNQS FT SSELNELLVRSDTLAIPMHQPTKLPSIQECESSGRLVRNERPKREKRLPCK FT LKDYDLN" XX SQ Sequence 4529 BP; 1328 A; 902 C; 1143 G; 1156 T; 0 other; tggtgtcaga agcaaaaggc ttcaaaaagt gtttccggcg tcattaaatc gcggaagtga 60 agaaagcgag gaaaaacaca agctcgggga gtgaaaaaat catttgcctt gattttgttt 120 ccgtcgtgcg ctcgtccgaa tgtgagccgg tcggtaccag tgcggcagcc atattttttt 180 cgcgtcttgc gaagtgttgt gtgttttccc ggaattaaat ccaaaacgat ctcggtttcg 240 aaatgaatgc cgagcagttc gagcttttca tgcagcagca aaatagtgtg ctgcgagaaa 300 ttgtgagttc gttccaaaat gttcaagtgc aacaacgaca gctgccagaa gggcggcagc 360 agcctcaatt gcaaatggct gccaactcgg cggcttcgtc catcctgctt cctccacctc 420 tcgagcttag tggagacatg gaggagaatt ttgatttttt ttctacgaac tggaaaaatt 480 acgcgagtgc tatgggtatg gatgagtggc cccaggagca agacaggcaa aaaaccagta 540 ttttgttttc gataatcggt agtgcggcgc ttaaaaagtt ctttaatttc gagttgacgg 600 aagaacaacg gcgttcgcca gacgcggctt tggaagcaat aaaagcaaaa gtagtgcgcg 660 aaagaaacaa aatcgtagac tggttcgaat ttttttcgct catgcaacaa tctatggaat 720 tgatcgatga ctatgtcaca cgtttgaagt gtttggctaa gttgtgtaaa ttcggtgcac 780 ttgaagaaga tttgattgtc tacaagattg ttacctctaa caaatggcca aaattacgtt 840 ctaagatgtt aacgatgcaa aacttgacgg aagtgaaagt tgtggatcta tgtcgagcag 900 aagaaatcgc cgaaaaacat gccagtgcta tgggtgcttg tagtgttgaa gtgaacatga 960 tcagaaaaaa gaatttaaaa tgcaagtatt gtggcgatcg gcatcaattc gccaagggga 1020 tttgccctgc actggggaag aagtgcaatg tgtgtggtgg caaaaaccat ttcgcgaaag 1080 tgtgtaaatc agaccgaatg caaaaaggca aaagccggcg aaaggtgaag aaagtgcaag 1140 aagactgtag tgaccttgac gacacggaca gtgttagcag cgataacgag gaatccgaat 1200 gcagcggtga tgaagcagta attggaaaag tgtacgattt ctggcggaat gtgatggcgg 1260 atgtggagct gtttgttgat ggcaaatgga aagtagtgca gtgtgagctg gatacaggag 1320 ccaacactag tctggttggc cataactggc taaaagaagt aggtggcggt aatcagctgg 1380 aactattgcc ttcaaaatac cggctgcaga gcttcggtgg cggggccata ccggtccttg 1440 gtgaagtaag gttgccttgt aaggcaaaga accgtaagta cacactcgca ttgctagtcg 1500 ttgatgttcc ccataagccg ctcttgtccg caaaagtttg caaaaagctg ggtttcatta 1560 aattttgcaa ctcggtttcg atcacttcac cactcccgga ggaacaactg ttgaatgtct 1620 accggattag agctcagcag atcattgacc agcacagcag catattcgag ggctatggta 1680 aatttccggg cgttgtatcg ctggaagtgg atccggatgt tgcaccgtct atacagcagc 1740 cacggcgtgt accaatagct atgcgtgaca aattgaagaa ggagttaaaa aagcttgagc 1800 aggatggtct tattatcaag gaaacgcagc atacggactg ggtcagcaat attattcttg 1860 ttaggcggaa tggtaaagat tcggattcga ttcggatctg tttggatccg attccattaa 1920 acagagcatt aaagcgacca aacttgcagt ttacgacgat cgatgagata ttgccggaac 1980 ttgggaaggc aaaggtgttc tctacagtag atgccaagaa gggtttctgg cacgtggttc 2040 tggatcgttc cagcagtctt ctaaccacat tttggacacc tttcggaagg taccgttgga 2100 tccgcttacc tttcggcatt gcaactgctc cagaaatttt tcaaatgaaa cttcaggagg 2160 tgatccagga tcttgaaggc gtggaatgca ttgcggatga cattttaatt ttcggtactg 2220 gcgataatct gcagcaagct ctggtgaacc acaacgtgtg cctagagaag cttcttatcc 2280 gtttggaaga gaacaatgtg aagctaaaca aggctaagtt gaggctgtgt atgacatcag 2340 tcaagtttta tggacacgtg ctgactactc ggggactaca gccggatgag agcaaggtgg 2400 aaactataaa aagctatccc agacctcaag atcgcaagga acttcagcgt ttcataggga 2460 tggttaatta tctcagccgt ttcattccaa acttgagctc caattttgcg gtactgagac 2520 gacttatttc ggacaaggaa ccttgggttt ggacggataa ggaagaagag gagtttgtcc 2580 gtgttaagtc tttggtagcg gatactaggt ctttgcagta ttatgatgtc aatcagccct 2640 taataatcga gtgtgatgcg agttcctttg ggttgggaac agctgtcttt cagtctcagg 2700 gagtaatcgg ctatgcgtcc cgcactctaa cgtcgactga aaggaattac gcccagatag 2760 aaaaggaact tcttgccatt ctattttcgt gtgttcgatt tgaccagctt attgtgggta 2820 acccgaagac gatcgtaaga acggaccata aaccgttggt aaacgtgttt aagaaaccac 2880 ttttgtctgc tccaaaacgg ttgcagcaca tgctgctgag cctacagcgc tacaacttgg 2940 agatacaata tgtaaaagga aaagaaaaca ttgtagctga tgctatttct cgggctccac 3000 tggatggaga acttcctgaa gatcagttca agaagcggaa tatttatcaa gtttttcggc 3060 agctggaaga agtaaaccca agcaagttcc tcagaatcac cgatgaacgt ttaaccgagg 3120 tcatacagga aacagcaaga gattcgtcaa tgcaacagat cattcgatac atcctggagg 3180 gttggcctac gtcggtagac aaggttccag atggcgtgaa gattttctac aagcatcgga 3240 atgagctcag ccaccaggat gggatagttt tccggaacga tcgtgttgtt attccttatc 3300 gaatgcgacg tacaatgact gagaaggtac atgtaagtca caacggagtc gaaggaacgt 3360 taaagctagc ttgtgccaat ttgttttggc caggtatgac cacacagatc aaggacgctg 3420 tttctcaatg tgcagtatgt gcaaaatatg cagcctccca accagctcct cctatgcaaa 3480 gtgttgccat accagtacat ccatttcaac tcgtatctat ggacgtgttt tttgcggatt 3540 accagggtga gaaaaggaaa tttctagtaa ccgtagatca tttttcggat ttttttgaag 3600 ttgatctgtt gaagagtttg actcccctct cgcgtcacgg aaagccacaa ctggtgatta 3660 cagacaatgg gactaatttt gtaaacaagg aatggaggca gtttgcgatt gaatgggagt 3720 ttcagcattc gacatcagct ccctatcatc aacaagctaa tgggaaatcc gaggcggctg 3780 ttaaaattgc gaagagattg atgaagaaat ccgaagaatc aggcgttgat ttttggtatg 3840 ccctgttaca ttggcgcaac atacccaata aaataggatc gagtccagca gcgcgcctat 3900 tctcacgcag tacacgctgt ggattaccaa cgtcgttgga aaacctaatg ccgaaacccg 3960 ttcaaaatgt tcccagcagt attgaagaag ctcggcgaaa gacaaaatat caacatgaca 4020 aagcagcaag gaacctaccg atactggaaa caggttcacc ggtttacgta cagttgcatc 4080 ctgaatcaac gaagcattgg actcctggat cggtcaccag taagatgaat gactgctcat 4140 atttagtgaa tgtagatggt gctctctatc gtcgaaattt ggtcaacctg aagccacgta 4200 aagaacctga tacgcctgta caatctacta tgccttctgc aaatgaatca agcgttccat 4260 tggaagcacc ttcaagagac accgtggaag tagcgacatt tacacccgaa gaaaccgtat 4320 cgaatcagtc ttcatctgag ttgaacgagt tgttggtaag atcggatact ttggcgattc 4380 cgatgcatca accgacaaaa ttgccatcca tacaagaatg tgaatcgtca ggacgactgg 4440 taaggaatga aaggccaaaa agagaaaaac gtttgccgtg taaattgaaa gactatgatc 4500 tcaattagct tttcacagaa aggggaaga 4529 // ID Gypsy-76_CQ-LTR repbase; DNA; INV; 146 BP. XX AC AAWU01003228; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-76_CQ_; KW Gypsy-76_CQ-I; Gypsy-76_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-146 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 532-532 (2011). XX DR Genome; AAWU01003228; Positions 11383 11528. XX SQ Sequence 146 BP; 42 A; 47 C; 25 G; 32 T; 0 other; tgtggcgact catgcgcccc gaattcccct atactgtggg aagcagaata accgataaat 60 aagaacctac tacctactta ctctcttcgt gcactcagca cacgatctgt tgatcggtct 120 caacaccgat cacagcacac accaca 146 // ID I-3_AC repbase; DNA; INV; 6009 BP. XX AC . XX DT 27-JUL-2009 (Rel. 14.07, Created) DT 27-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE A family of Nimb non-LTR retrotransposons from a sea slug - DE consensus sequence. XX KW Nimb; Non-LTR Retrotransposon; Transposable Element; I group; KW I-3_AC. XX OS Aplysia californica OC Eukaryota; Metazoa; Mollusca; Gastropoda; Heterobranchia; OC Euthyneura; Euopisthobranchia; Aplysiomorpha; Aplysioidea; OC Aplysiidae; Aplysia. XX RN [1] RP 1-6009 RA Kapitonov V.V. and Jurka J.; RT "Nimb - a novel clade of animal non-LTR retrotransposons."; RL Repbase Reports 9(7), 1538-1538 (2009). XX DR [1] (Consensus) XX CC Nimb is novel clade of I-like non-LTR retrotransposons. It CC includes families of retrotransposons present in fish, molluscs, CC sea squirts, sea urchins and insects: I-1_DR, I-3_DR, I-5_DR, CC nimbus, I-3_AC, I-4_AC, I-1_CI, I-1_SP, I-1_AA, I-1_BM. I-1_CI is CC a family of tunicate Nimb non-LTR retrotransposon. The consensus CC sequence was derived from multiple alignment of several copies CC ~96% identical to each other. The 3' terminus is composed of the CC (TAC)n microsatellite. XX FH Key Location/Qualifiers FT CDS 574..2112 FT /product="I-3_AC_1p" FT /note="ORF1." FT /translation="MDSKKQKDPSPAPNVKKSRLAEAGAAAEPSKNSKKGK FT RTFTDASFLPPPKELLRQPYVAISSTKDAQRRITDLNLFKTGDALRKVLGY FT IPTCVQKLSCGALLVKCATEIEVTKLLDTTTFGGIQCKSEHYQKFNRSKGV FT VRSHELKGCSMEEIVEHCEGVVEARRITLRRGETTIETNTIVLTFESCRPP FT ATVRASYLVLDVRPYVPNPLRCFKCQKFGHSQSRCRHAAVCPRCGKTGHPE FT KECKASPCCPNCHGQHTAFSKECPTWLQERAIQEYKARNGCSFQEARKVVC FT PPITTPVIGRSYSMATQMVTRSASNPTTDLPKPAVNRQVSVPKNTRNKKPA FT TIPVAPPFEVPLSNSYGVLGEDGEDSSSAPSSPPHPSSIISSSSSSSSSSS FT SSSSSSSSTSQPIPLMECEVGLLPSPQPPPPPSQPPRSAPPPPTPPADKPL FT PMEEDISDPPPRPDPGKGGVCHSSKSPTRPPSNPGRQPRKHSASPARRSQS FT VSSKPGPLSSKSKSKNS" FT CDS 2115..5783 FT /product="I-3_AC_2p" FT /note="ORF2: AP endonuclease, RT, RNase H." FT /translation="MAVLQWNAGGIRPSIAELQHLQATLCPRVACIQETKF FT APASTIDLRGFTAYHHIHTENQIASGGTTIYTDNHTLHRQVHLKTTLQAIA FT VRVTLHRPITICSVYIPPNYPLKITQLNDLKSQLPKPFILLGDFNAHSPLW FT GDKPIDQKGKIIENFLFQNNICLLNDKSPTYRNRATYGTSSLDLTFCTPDL FT VPDLEWSVLDDRTDHFPILLKNNSLQNDPIPERFNFKKADWDGFANQCSKI FT LNPNFDHNYNKFHEKLLEICHNHIPKTKTKPRRNKIWFSWECEDAVKKKKA FT AYRRAVNNTTIENKILYLKARAEARRCLRESKRQSFQKYISKINSQTPLNQ FT VFQIAKKFLGRRTDTIRHVQRPDGTTAETQVDIANTIATSLAKNSSQANYN FT TTFQNIKTNREKQHLNFHSNNTETYNADFTIDELTSCISDLGDTAPGPDDI FT HNKIIRHLPSETLTLLLHIYNDMWRSHSFPESWRLATVIPIPKPNKDHTHP FT SNYRPIALTSCLCKLIEKMIHRRLMWFLETTGSLSGLQCGFRKTRSTLDHL FT VRLETFIRETFAERGHMIAIFFDLEKAFDTTWKYGILRDLHALGIRGHLPA FT FIGNFLKNRSFQVRVGSSLSDPHQQEEGVPQGSILSPLLFEIKINSIVKTL FT QNSIDSSLYVDDFLICCRSKNPHLRSMERQLQTQLQKLETWANNNGFKFSP FT SKTVAVHFCRTGRRVLQPDLTLYGERIPVQDQARFLGVIFDKKLTFIPHIK FT DLRVRCQKALNALKILSNPEWGGDSKHLLHLYISLVRSRLDYGCQVYGSAK FT PHVLKILDPIQNQGLRLALGAFRTSPVESLQAEANIPPLSLRRTQLSLQYA FT VKQSSTPENPAHNSIFQTNPETINRFQNNEKLIKPFGLRILEDLKNLHFSK FT QDIQQILFPDLPPWHLTKAKVDLQLTTHNKESTNTKTFQAEYKKLIDKYPG FT AELIFTDGSKKDNEVGAAALSPNNTHQRRLHPEASIFTAEATAIDMALDSV FT ENSPNKSFLILSDSLSCITSLTNYTLLHPKIVRLQQKIHTLSETGKNIMFA FT WIPSHVGIRGNEQVDDLAKKASSLLLGSTTNTLPQTDFRNKVKKYTSSLWE FT SHWENQKNNKLYEIKPRLEQRNPSDLSRRDEVIFTRLRIGHTALTHKYLLQ FT GEEKPFCVGCDTDFTIRHILTECLDFGEIRRKYYKCKKLQDIFSVVVPSRI FT LNFIKAIGLYGEL" XX SQ Sequence 6009 BP; 1733 A; 1757 C; 1195 G; 1324 T; 0 other; cacccccagt gggtcggggg gatataaata gacccactgc ttcgtctgca gaaggtgcaa 60 ccccttcaaa ggcggagtgt cgcacacgga ccccaaccct ctgggatccg tgtgcgacac 120 ttactttggt gtatcctcct cttctccttc cttctttctt tcttcttcta tctttcctcc 180 caccgctgac ctcaagtgca gaaaaaagag aggtcaccca tcccacagag tgggcgcctc 240 gttgaccgtg aggtgcgacc cgtgcggagg gaaaccgggt acctggcggt tgaggtgaga 300 tcagtccacg gactgtccct caccaacaca cctctttggc ccaggttttc ctcttagacg 360 ggagggtggc acggcccatg ccactcaatc ggttgagagc tgaaacccca tcaagtcctg 420 ggtcaatctt gacgggaggg tctcgtactt tctcgttgag agtatagctc cggggtagcc 480 ctggcacagc tctggatatc cgagctatgt gtccccttgt tgggctccgt ggcgggcggg 540 gagccccgga gtcgaatcaa atagaaaact ttcatggatt ccaaaaaaca aaaagaccct 600 tccccagctc ccaacgttaa aaagtctcgg ctcgcggaag cgggtgcggc cgccgagccg 660 agcaaaaact caaaaaaagg taaaaggacc tttaccgatg cctctttcct ccctcctccc 720 aaggagcttc tgaggcaacc gtacgtggcc atctccagta ccaaagacgc acagagaaga 780 attaccgatc tcaacctctt taagacagga gatgcactac ggaaagttct gggatacatt 840 cccacatgtg tacagaagct ttcctgtggt gcactccttg tcaagtgcgc aaccgagatc 900 gaagtgacca aactcctgga taccacgacc ttcggcggta ttcaatgtaa gtcagagcac 960 taccaaaaat ttaaccgctc gaagggcgtg gtccgcagcc atgagctgaa aggctgctcc 1020 atggaagaga ttgtggagca ctgtgaggga gtggtcgaag cccgccggat tactcttcgg 1080 cggggggaga ctactatcga gacgaacacg attgtcctaa cattcgaatc gtgtcgccct 1140 ccagccaccg ttcgggcatc ttacttggta ctcgacgtac gaccctatgt gccgaaccct 1200 ctgcgctgtt tcaagtgtca gaagttcggc catagccagt cacgatgccg tcacgcagcc 1260 gtgtgtcctc gttgtggcaa gaccgggcat cctgagaagg aatgcaaggc tagtccttgt 1320 tgccccaatt gccacggcca gcacacagct ttcagcaagg agtgcccgac ttggctccaa 1380 gaacgcgcca tccaagaata caaggcgcga aatggctgct ccttccagga ggcccgtaag 1440 gttgtctgcc cgccaatcac aaccccagtc attggccggt catattcaat ggcaacccag 1500 atggtcacga ggtcggcaag taatcctact accgacctcc caaagcccgc ggtcaacagg 1560 caagtctctg tgccaaagaa caccagaaat aaaaagcccg ctactatccc tgtggcaccc 1620 ccctttgagg tgccactctc taacagctat ggggttctag gggaggacgg ggaggactct 1680 tcctccgccc cctcttctcc ccctcacccc tcttctatca tttcttcttc ttcttcttct 1740 tcttcttctt cttcttcttc ttcttcttct tcttcttcca cctcacagcc catccctctg 1800 atggagtgtg aggtgggcct tctcccctcc ccccagcccc ctcccccccc ttcccagccc 1860 cctcggtctg ctccacctcc ccctacccct ccagcagaca aacccctgcc tatggaggaa 1920 gacatctccg acccccctcc ccggcctgac cccgggaagg gcggagtctg tcactcctcc 1980 aaatccccca ctagaccgcc gtccaatcca ggacggcaac ctagaaaaca ctctgcctcc 2040 cccgcaagga ggagtcagag tgtttcttcg aagccggggc ctttgagctc caaatcaaaa 2100 agtaaaaatt cataatggcg gtcctgcaat ggaatgccgg cggaatccga ccttccattg 2160 cagaattgca acatttacag gccacactgt gcccacgggt ggcctgtatt caagaaacaa 2220 aattcgcccc ggcttcaacc atagacttaa gaggatttac tgcgtaccat cacattcaca 2280 cggagaacca gattgcttcc ggtggtacca ccatctacac cgacaaccac acactccaca 2340 gacaagtaca tcttaagacg accctccagg ctattgccgt acgggtcacc ctgcacagac 2400 caatcacgat ctgctcagtt tacatcccac caaattaccc cctcaaaata acccaactaa 2460 atgacttaaa gtctcagctc ccaaaaccct ttattttgct aggtgatttc aatgcccaca 2520 gccccctttg gggagacaaa cctatagacc agaaaggaaa aataatagaa aactttttat 2580 ttcaaaataa catctgtctc ttaaatgata aatccccaac ttacagaaac agagccactt 2640 atggcacttc atcccttgac ctcacctttt gtacccctga cctagtccca gatcttgagt 2700 ggtctgtgct ggatgaccgt acggaccatt tcccaattct cctcaaaaac aatagcctcc 2760 aaaacgaccc gatcccagaa cgcttcaact tcaaaaaagc cgactgggac gggtttgcca 2820 atcaatgttc caagattctt aatcccaatt tcgaccacaa ctacaataaa ttccacgaaa 2880 agctcttaga aatatgccac aatcacattc caaaaacaaa aacaaaaccc cggaggaaca 2940 agatctggtt ctcttgggaa tgtgaggacg cagtaaagaa aaagaaggca gcgtatcgaa 3000 gagccgtcaa caacaccacc atagaaaaca agatacttta tttaaaggct cgagctgagg 3060 cacgtagatg cctcagggag agcaaaagac aatcattcca aaaatacatc tccaaaatca 3120 acagccagac accactcaac caagttttcc aaattgctaa aaaattctta ggtagacgca 3180 cagacaccat tcggcacgtc cagaggccgg atggaacaac agctgaaaca caagtagaca 3240 tcgcaaacac aatagctaca tctctcgcca aaaactcatc ccaagcaaac tacaacacca 3300 cttttcaaaa catcaaaact aacagagaaa aacaacacct aaacttccac tcaaacaaca 3360 ctgaaaccta caacgcagac ttcacaatag acgaactaac atcgtgcatc tcggacctgg 3420 gcgatacagc cccgggaccg gatgacatcc acaacaagat cattagacac ttaccctcag 3480 aaacactcac actcctacta cacatataca acgacatgtg gcggtcgcac tcctttccgg 3540 agtcttggcg tctcgccact gtcattccaa tacccaaacc aaacaaagac cacacacacc 3600 catccaatta cagacctata gccctaacga gctgtctctg caaactgatt gaaaaaatga 3660 ttcacaggcg actcatgtgg tttctggaaa cgacgggctc gctgagcggt ctccagtgcg 3720 gattcaggaa aactcgatcc actctggatc acctggtccg cctggagacc tttatacgcg 3780 agacgttcgc tgaaagaggc cacatgatcg ccatcttttt tgacctagaa aaagccttcg 3840 acacaacctg gaagtacggc attcttcgag acctgcacgc actcggcata cgcggtcacc 3900 tccctgcctt cattggcaac ttcctgaaga accgttcctt ccaggttcgt gtcggatcct 3960 ccctttcaga cccccaccaa caggaggaag gagtccccca ggggagcatc ctttctcctc 4020 tcctattcga aattaaaatt aactccatag tgaaaaccct ccagaacagc atagacagct 4080 cgttatacgt tgacgacttc ttgatatgct gccgatcgaa aaatcctcat ttgagatcta 4140 tggaaaggca gctgcaaacg cagctgcaaa aacttgaaac gtgggcaaac aacaacggtt 4200 tcaagttctc cccgtcaaag acggttgcag tacacttctg ccgaacaggc agacgtgtgc 4260 tgcagccgga tttgacacta tacggggaaa ggattcccgt ccaggaccaa gctcgtttcc 4320 tcggcgtcat tttcgacaaa aaacttacat tcataccaca catcaaagac cttcgtgtga 4380 ggtgccagaa agcgttaaac gctttgaaaa ttctatcaaa tccggagtgg ggcggggact 4440 caaaacacct tctccacttg tacatctccc ttgtccggtc caggcttgac tacggctgtc 4500 aagtctacgg atcggcaaaa ccacacgtct taaaaatact agatccaata cagaaccaag 4560 gactccgcct cgctctgggg gctttccgca cttctccggt ggagagcctc caggccgagg 4620 ccaacatacc accactctca ctcagacgca cacaactctc actacaatac gcagttaaac 4680 agagctcaac accggaaaat ccagcgcaca acagcatttt tcaaacaaat ccagaaacca 4740 taaacagatt ccaaaacaac gaaaaactca tcaaaccttt cggcctcaga atccttgagg 4800 acctcaaaaa cttacacttt tccaaacaag acatccaaca aattcttttc cccgacctcc 4860 ctccgtggca cctcaccaag gcaaaagtag atttacagct gaccacacac aacaaagaaa 4920 gcacaaacac caaaactttt caagcagaat acaaaaaact aatagataaa taccccggtg 4980 cggagctcat cttcacggat ggctccaaaa aagacaatga agtgggcgcc gcagccctat 5040 ctcccaacaa tacacaccag aggagactcc atcctgaagc ctccattttc actgcggagg 5100 ccactgccat tgacatggca ctggactctg tagaaaattc tcctaacaaa tctttcctta 5160 tcctttctga ctctctgtct tgcatcacct cactcacaaa ctacacactc ttacacccaa 5220 aaatagttag gctacaacag aaaatccata cactatccga aacaggaaaa aacatcatgt 5280 tcgcatggat acccagccat gtgggcatcc gaggaaacga gcaggtagac gacctggcga 5340 agaaagcttc atcccttctg ctgggatcaa cgacaaacac tctcccacag actgacttta 5400 gaaataaagt caaaaagtac acctccagtc tgtgggagag ccactgggaa aaccagaaaa 5460 ataataaatt atatgaaatt aaacccagac tagaacagcg aaatccttcg gatctctcca 5520 gaagagacga agtcattttt actcgattgc gcatcggaca cacggccctg acccacaaat 5580 acctcctaca gggagaggag aagccgttct gtgtcggatg cgacacagat tttaccatta 5640 gacacatttt aacagaatgt ctagactttg gagagatccg aaggaaatat tataaatgca 5700 agaaattgca ggatattttc tccgtcgttg ttcccagcag aattttaaat tttataaagg 5760 ccattggcct ttatggcgag ttgtaaagtg aatctgatat tatttaaaat ttttattgtt 5820 acttaaaagt tgttttagat cacaaatgta tttacaccga atttgaatat tttaactttt 5880 tagcatttta aagtaaatta gttaccagtc gtgaagtgaa ctttcgcaga ggtgaacaac 5940 ttgtgtttcg cgcgctcata tgaccttagc agttgcgagc gccgtaaaac cttatactac 6000 tactactac 6009 // ID Copia-20_SI-I repbase; DNA; INV; 4110 BP. XX AC AEAQ01023503; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-20_SI_; KW Copia-20_SI-LTR; Copia-20_SI-I. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-4110 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01023503; Positions 6628 2519. XX CC Positions [1571-1981] - Integrase core CC 'GTAAC' target site duplication CC LTRs are 96% similar to each other. XX FH Key Location/Qualifiers FT CDS 407..1981 FT /product="Copia-20_SI-I_1p" FT /translation="MKRIKAFFEPKTLNALLELLREFFSYSWKSNDTVGTF FT VAGLKVIARKIEALDSEDFGNKFNEKLVMAKILGCLPKDFDSFVTSWSILS FT EEMSLEFFLEKLANAERNIEGCTDDVSVEAFKSQCKSANGQVMKTNEKKFK FT GKCHKCGKIGHLKRDCRSKSDKSEKQEATKSQKEMKEFKDVQEKELSASSA FT HRLKDEGCIIADSDGSVHHTGNVEWFSLLRKIDVPLTLNIADGKTLKATHV FT GDIRIEKSIDGKKWEKRVWKTVYYCEDMGSESLFSTTFMEKTRGYGFYHGN FT GTMQLMDGRKTILGERRINSQYVPFIRVVPPSASVKITRSIELWHQRLGHV FT SDNMIPAMVKNNLVDGLDVILKKRDDCDSCHFGKQKVSSHPTREKRECLPG FT QRFHSDVCHVGIMSWNKCKYFLTLKDEASGYRRVFFMKSKEEVSSILKIFF FT LAAEKETGRKAISLRTDNGTEYINEKVKEVLRELNITHELSSPNVKQCNGM FT AERENRTLCDTARSLLFNADLSKTDRHLL" XX SQ Sequence 4110 BP; 1354 A; 676 C; 971 G; 1109 T; 0 other; gattatgaga cttaatctat gctgattaaa taaagattag agacgtgtgg tattagtgag 60 gttagaaacg cgtggtacca gtgaggttaa gagaattaca ggtaacggaa cgattgctac 120 ggaaactaca ctgaaagaca ttcgagttac taagctcacg aaagcaacgt atagtcgttg 180 gaaaatcgag atccgtgatg ctctggagag ttatcaaatt tgggaagtca caaccggaag 240 gacgactaag ccaagcgagg tgagatccga ggacggcgtc gttactaatg tgaaggaaat 300 tgatgactgg agagttaaag acagcaaggc acggtctgtt attcgatcaa cgctcgacga 360 cacaactttt tatcaggtat gcgactgtga gacgtcagca gacattatga aaaggatcaa 420 agcgtttttc gagccgaaga cgttaaacgc tttactggaa ttgctacgtg agttctttag 480 ttattcgtgg aagtctaatg acacagtcgg tacgttcgta gcaggattaa aagttattgc 540 acgtaagatt gaagctttag attccgaaga ttttggaaat aaatttaatg aaaaacttgt 600 aatggcaaag attctaggat gcctgccgaa ggatttcgat agctttgtta caagctggtc 660 cattttgtcg gaggagatgt ccttggagtt tttcttggag aaacttgcta acgctgaaag 720 gaacattgag ggatgcactg atgatgtatc ggttgaagcg tttaagtcac agtgtaagtc 780 agcaaatggt caggtaatga aaaccaatga aaagaaattt aagggcaagt gtcataaatg 840 tggcaaaatt ggacacttaa aacgagattg taggtcgaaa tcagataagt cagagaaaca 900 ggaagcaaca aagtcacaaa aagagatgaa agaatttaag gacgtacaag agaaagaatt 960 gtctgcatca tcggctcacc gtttgaagga tgaaggatgt attatcgcag attctgacgg 1020 aagtgttcat catactggta atgttgaatg gttttcacta ctcagaaaga tagatgtacc 1080 acttacatta aacattgccg atggtaaaac gttaaaagct actcacgtgg gtgatattcg 1140 gattgaaaaa tcaatcgatg gtaaaaagtg ggaaaaacga gtatggaaaa cggtatacta 1200 ctgtgaagat atgggtagtg aatcactctt ttcaactaca tttatggaga aaactagagg 1260 ttatggattc tatcacggaa atggaactat gcagttaatg gatggacgga agacgatact 1320 cggtgaaaga aggattaaca gtcagtatgt accttttatt cgagttgtac cgccatcagc 1380 ttctgtaaaa attactcgat caattgaatt atggcatcaa cgccttggtc atgttagtga 1440 caatatgata ccagcaatgg tgaaaaataa tcttgttgat ggccttgatg taattttaaa 1500 gaaaagagat gattgcgatt cctgccattt tggaaaacaa aaagtaagtt cacatcccac 1560 tcgagaaaaa cgtgaatgct tgcctggtca acgttttcat tcagatgtat gtcatgttgg 1620 aattatgtca tggaataaat gcaagtactt cttgacactt aaggacgagg catctggtta 1680 tcgcagagtt ttcttcatga aatcaaaaga agaagtatca agtattctaa agatattttt 1740 cctagcagca gagaaggaaa ctggaagaaa agctatttcg ttgagaactg ataacggaac 1800 cgaatacata aatgaaaagg tgaaggaagt cctacgggag ctgaacatta cacacgaact 1860 gtcgtcacca aatgtaaaac agtgcaatgg tatggcagaa cgagagaatc gtacactatg 1920 tgatactgct cgatcgctgt tatttaacgc tgacctttca aaaacggatc gtcatctgtt 1980 atgaaccgaa gctgttggct gcataccttc gtaatcgggt accaaaccga ggagttacaa 2040 ctacaattcc atatagtgaa tggtatggaa agaagcctga ggtaactcac ttgagagttt 2100 ttgaggcgaa agcttttgtt cgtatttttg actcgacaag acataagatg gatcctaaag 2160 caaagaagat gatatttgtt ggatatgatc gtcatactga caagatatat cgggtctttg 2220 atttcgaaaa gaagattgtg gaaagagtcg ccgatgtaac aatagaggat gtaacgaata 2280 cgaatgagca agttcttttt ccgctaatgt tcgaagagca agaggaggtt tctaccgaat 2340 tactcaagca agaagaaact cttgaagatt tatctagaga ggatgactct accgacgagt 2400 tctactcaga cgaaggagaa gtaattcaag tttcatctga acctcaaaag aaaagaggac 2460 gaccagtagg attacggtct tatcagaaac cagtgcttcc atcagatcga gtgctgcgag 2520 ataggacgga taagtcagtt cgtattgctg caatgaatgt atctctggac ccaatctcct 2580 acgaagatgc aatttcaaga gatgattctg actactggaa gcaagcaatg gatgatgaaa 2640 tggcgttcat acgtaaaaat aacttgggaa cttgaattgt taccaaatga acaatctaca 2700 gtttcatgtc gatgggttta taaatcgaag ttgcggtccg atggaacaat caaacgttat 2760 aaggcgagat tagttgctcg aggctttagc cagacatata ttattttgag attttctcgc 2820 cggtagtact ttatgaatcg gtgagagcaa ttttagctat cgtagctaaa tacaacatgg 2880 aacttgtaca gtttgatgtg aagacggctt tcttgaatag tccactggaa gaagatatct 2940 atatgcaaca gccggagggg tatgaagtgg atggatcaag tcgtgtctgt cacttaaaga 3000 aggactgtat ggactcaagc aagcccctcg taattggaat aatatattca acgattttgt 3060 tatgtctcat ggttttaagc gatcagaagc tgatccgtgt gtcttcgtga agggagctta 3120 tacagatgat tggatgatac tgtctctata tgttgatgac ggcttgatag aatgtaaaag 3180 gaagagaaca cagtgtatct ttgtttcatt gttgatctca gaattcgaag ttacgtgtca 3240 tgagccgacg tgttatgttg gaatggaaat agctcgaaat cgagagacag ggacactatg 3300 tatcaagcaa caaggataca tttcacgtat gttgcatcgt tttggaatgg aggactgtaa 3360 atctgtaaag tctccaatga attcgtctat tgagtcaact gaactgaagg aaacaaaaga 3420 tgaggaaaaa cgttttccct acagagaagc aactggctgc ttaaattata ttgctacagt 3480 gtcgagacaa gatatctcgt atgcagtgag caaacttgcc cgatattcca atgatccaca 3540 gcagcttcac tggaaggcag tgaaacgtgt tatgaagtat ctcaaaggta cgatcgatgt 3600 ctcattgtat ttccacaaag aattatcaga tgaattgatt ggatattgcg attccgacta 3660 cgctggtgaa ctggaagaaa gaagatccac gtccggatat gtttttctta ttcatggtgg 3720 accgatcgct tgatcatcga gtctacaacg tattacagca ctctcctcat cggaagcgga 3780 atacatgtcg atctcggaag cattaaaaga acttctttgt ttgagaacac ttgttaaatc 3840 tcttggatta gagcaaacaa agtcaacaga gttgaaagtg gacaatcaag cagctatagc 3900 aatgtcaaga aatccggagt tccataaaag aaccaaacat attgcggttc gatttcatcg 3960 cgtcagacag gaacaggaag ctgggaaggt tcatgtcaca tatgtatctt caagcaatca 4020 ggtagcagat ttacttacaa aacctctacc ttggcctacg atttcaagat gtcttggaca 4080 aatgggaatg acgtcgggaa caagaggagg 4110 // ID I_Ele8 repbase; DNA; INV; 6227 BP. XX AC . XX DT 14-OCT-2010 (Rel. 15.1, Created) DT 14-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE An I clade non-LTR retrotransposon family from Aedes aegypti. XX KW I; Non-LTR Retrotransposon; Transposable Element; I_Ele8. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-6227 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6227 RA Kojima K.K. and Jurka J.; RT "I clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (14-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. This consensus is generated from 17 CC sequences with >95% identity, and ~99% identical to the original CC sequence in [1]. XX FH Key Location/Qualifiers FT CDS 504..1835 FT /product="I_Ele8_1p" FT /translation="MASTSGGGPIGPLNRNFPGLTETFGAVTTLLLMGKNG FT SALPEDPFIVGESVEEWAGPVERSNIEGYGTKYVMRTRNQAQVEKLLQLNT FT LKDGTEVSVILHPKFNTSRCVISTYSLISMEEKEILNKLASQGVTDVRRIM FT KAKKEKTPAIILTFSRAEYPQSVKVGLLQVPTRPYYPNPLLCFKCYSYGHS FT RNNCPNPQRCFNCSSQHEEMDTCDQPAFCINCEKNHRPFNRQCEVYRKEVD FT IIRTKIDFNLSYPDARKRVEAGNGSYAKVTAQPRLDKTRFDAMAEQIKKQQ FT DMIEKLEQQLQNQRSLEEKVTEMLERSKAKETKIEELLKYVQQRDEKIRKL FT EAQNNNLKRLLDGMQAKQRSESLTSEPSLEMESIKRKSKRHTQPTDQQQVS FT KGSAGMSPPPKRTSSRTRSPIMTRQTSNQEDTAKSNYIAESSFKSTPPNK" FT CDS 1838..6130 FT /product="I_Ele8_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="MAQNQNQLVTQSTNSIHPKINNHRFEACERVVPPSLR FT KEAQGEEETVRDVTPQPVDGKSSSAAGWHAPHSQHQSQTDQQNTNHPSTCS FT LHTKNVLSHNSLNPTSEVSMSALSRAMHVSTSEKSNMPKEAPSDGTLNCLP FT RAALTDAYQYDLFYNHMQRPGSSETPTITRCRTFPPTNAYNPSAEEQQKSG FT SENLIINDREYGESYSNRRRSPTISTNDTFSGSATTRLALQWNINGLFNNL FT GDLQLLIHDNAPQAIALQEIHCRNARNLDRLLEGKYRWYVKTGSTLFQNVA FT LAIHQSVAHTFVPLNTALIAVAAKVHLPFRHTIVSIYLQNSGISNLEEQLA FT SLFDQLEKPLLVLGDFNGHHYAWGSPKTDGRGVAILNTAEQHDLIVMNDGS FT PTFLRTNCRSCIDVSLASSNILASLNWYIHSDPMGSDHFPIEIQHNAPPPE FT TTRRPKWKLNEANWEGYEQTITSLIDPNREYQLEELSNIIIDAATKQIPKT FT SSRPGRKASHWWCDEVKTAVKARRKALRAVKRIPRDHPNNDAIHAEYRRLR FT NECRKTIREAKQRSWEKFLDGFDCEQSTSEMWRRVNALSGRRKLRGIAICQ FT DDIISRDPTFVANNIGKYFAKISSLAEYDDSFLKFQEDKNCTLSKIIVPND FT EENNDFNQPFTLEELLFSLDASNGKSAGPDGISYPLLKRLPLRGKIALLEM FT FNAIWKAGSFPPDWRHSLVVPIPKKQATSTNPQDFRPIALTSCISKVLERM FT VNRRLTTVLQEKQLLDFRQHGFLKGRGTGSYLASFGQVLHDALSNGLHVDI FT AALDLSKAYNRVWRPSVLRQLINWGIVGNMGKFIKGFLDRRTFQVIIGNTL FT SEEFIEESGVPQGSVLAVSLFLVAMNSVFNDLPDGIFIFVYADDIILVVVG FT RSPKLIRRKLQSAVRKVSKWAISCGFKMAAEKCVISHVCNFKHHPWTNPVI FT VDGCEVAFKKEAKILGVVVDRKCNFASHFSFVKKDSESRIRLIKAISGRHS FT TNNRRSLMNIGRSIIISKLLYGLEITIRSATLMIQMFSPTYNKMIRLTSGL FT LPSSPTLSTMVEAGILPFEYTLTAAASNRAISFIEKTYGAQRDIFILDETR FT KLLQQYTNIDFPEIATLHRVGNRRWDRPNPKVDWSIKREIRAGESPEKIQA FT IFNELIENKYATHCKIYTDGSRSNGKVGIGISSPIGNFARRLPDQFTVFSA FT EAAAVYFAIKKCTSIPNASIIFSDSASTLAALENPQQKHPLIQAIENSILP FT NTTLCWVPGHSGVRGNEEADKLASIGRTSTKWNLGIPRADMKLVIQNALQD FT AWYRRWESNNGEFLRKIKNTVSEWSDRKNRKEQKILSRLRIGHTRVTHAHY FT ISNSTKTTCETCSVPLTVEHILLNCQKYADIRNNLNLQDSIRTVLSNDVNE FT EEKLLKFLKETKLXNEI" XX SQ Sequence 6227 BP; 2008 A; 1475 C; 1312 G; 1430 T; 2 other; tcagtttcgt gtaatcggga tacggtagta taacaacatc gcgaaacgtc tagtcttcaa 60 atgctatttt ctctttgaaa taagcgtttt aaaccccgtt ttttcaaccg acgtaaaagt 120 taacgtgaga actcatcttc acgaagtgcg gtacagacaa attgttatgc gtaggcgtta 180 aaagtgtacg atccaatgtt aagttaacac cgggcaacag cggcatcatc gagaacaata 240 agcacataca gcagtgcgta acgagtagtg ttgtgtgggt caagcaaata acccgagtga 300 tagtttgtga caatacgtgg cgaaaaagtg catgctgaga cgaattcgcg tgaaaaccgt 360 tttsttgggt ttcttattcc tcctcaatcc tcatcctcgc agcaggggtg ggtagcgggt 420 gaccgctacc aattttttct gcgttaattg atttgtattt cttcgatctg aaaaaagtgc 480 gtgtgattgg ctgtggttgc taaatggcct ccaccagtgg aggcggcccc atcgggcctt 540 taaatcgaaa cttccctggg ctaacggaaa cgtttggagc tgtaacaacg ctgcttctga 600 tgggcaagaa tggatcagca cttccagagg accccttcat agttggagaa agtgtcgagg 660 aatgggctgg ccccgttgag cgcagtaata ttgaaggata tggcacgaag tacgttatgc 720 ggactaggaa tcaagcccaa gtcgaaaaac tgttgcaatt gaacactctc aaggatggaa 780 cggaagtaag cgtaatacta caccccaagt tcaacacaag tcggtgcgtt atttccactt 840 actcgctaat cagcatggag gaaaaggaaa ttctcaacaa gcttgccagt caaggtgtca 900 ccgatgtcag aaggatcatg aaggcaaaaa aggaaaaaac gccggcgatt attctcactt 960 tcagccgtgc ggaataccca cagtcggtca aagttggatt actacaggtc cctacccgac 1020 cctactatcc taatccatta ctctgcttca aatgctactc ctacggccac tcaagaaaca 1080 actgccctaa tccccaaagg tgtttcaact gttcgtcaca acatgaggag atggatacat 1140 gcgatcaacc tgccttctgc atcaactgtg agaaaaacca tcgacccttt aaccgtcaat 1200 gcgaagtata ccggaaggaa gtggatatta tccgaaccaa aattgatttc aacctctcat 1260 acccggatgc acgaaaacga gtcgaagcag gaaacggaag ctatgccaaa gtgacggcgc 1320 agcctcgttt ggacaaaacc cgctttgatg ctatggctga acaaattaag aagcagcaag 1380 acatgatcga aaagttggaa cagcagctac agaatcagcg aagtttggaa gaaaaggtga 1440 cagaaatgct tgaacgtagc aaagcaaaag agacaaaaat cgaggaactc ctgaaatatg 1500 tacagcagcg tgatgagaaa atcaggaaat tggaagcgca aaacaacaac ttaaagcgtc 1560 tcctggatgg aatgcaagca aaacagcgga gtgaatcgct aaccagcgag cccagcttag 1620 aaatggaaag cataaagcga aaatcgaaac gccacactca gccaacagac caacaacaag 1680 tatccaaggg atcggccggc atgtcaccac cgccgaaaag gacctcctct agaaccagaa 1740 gccctatcat gaccaggcaa acgagcaatc aagaggatac cgctaaatcg aactacatcg 1800 cagaatccag cttcaagtca acaccaccta acaaataatg gcccagaacc agaatcaatt 1860 agttacgcaa tcaaccaaca gcatccatcc caaaatcaac aaccaccggt ttgaagcgtg 1920 cgaacgtgta gtaccacctt cacttcgtaa agaagcgcag ggggaagagg agactgtccg 1980 ggacgttaca ccccaacccg tcgatggtaa atcttcctct gcggccggtt ggcacgcacc 2040 gcactctcaa catcaatctc aaacagatca gcagaataca aatcatccct ccacttgttc 2100 cttacatacc aaaaatgtcc tttcacataa ttcccttaac ccaacctctg aggtttcaat 2160 gagcgcactt tccagagcca tgcacgtctc aacgtcagaa aaatcaaata tgcccaaaga 2220 agccccatct gacggtacct tgaactgtct gccacgagcc gctctgaccg acgcctatca 2280 gtatgatctt ttctacaacc atatgcaacg accaggatct tctgagacgc ccaccataac 2340 acgatgtcga accttcccac caacaaacgc ctataaccct agtgctgaag aacaacagaa 2400 atcaggttcc gaaaatctca tcataaatga tcgcgaatac ggagaatcat actcgaatcg 2460 tcgaaggtct ccaacaatct ccacgaatga caccttttct ggtagcgcaa caactagact 2520 cgcactccag tggaatataa acggcctgtt caacaaccta ggcgatcttc aactgttaat 2580 acatgacaac gcaccacagg caatagcact tcaagaaatt cattgccgga atgctcggaa 2640 cttagatcgc ttactggaag gaaagtatcg gtggtatgtc aaaacaggat ccacactttt 2700 ccaaaatgtt gccttagcta tacaccagtc cgtggcgcac actttcgttc cactcaatac 2760 ggctctcata gccgtagctg caaaagttca cctgcctttt cgtcacacaa tcgtgtccat 2820 ctaccttcaa aattccggaa tttcaaatct agaggaacaa ctagcgagtc ttttcgatca 2880 actggagaaa cccttgctcg ttcttgggga ttttaacggg catcactacg catggggctc 2940 accgaagact gatggaaggg gtgtcgccat actgaacacg gctgagcaac atgatcttat 3000 cgtgatgaat gatggttcgc ccacattcct ccgcaccaac tgcagatcgt gcatagatgt 3060 atctctcgca agcagtaata tccttgcctc tctaaattgg tacattcact cggatccaat 3120 gggaagtgat cacttcccaa ttgaaattca gcacaatgcc cccccaccgg agacaacccg 3180 tcgtccaaaa tggaaactca acgaagcaaa ctgggaagga tatgaacaaa ccatcacatc 3240 tctaatagat cccaaccgcg aatatcagct tgaagaacta tccaacatca taattgatgc 3300 tgcaacgaag caaatcccga agacgagcag tagacccgga agaaaagctt cacactggtg 3360 gtgcgatgaa gtgaaaactg cagtgaaagc ccgtaggaaa gctttgaggg cggtgaaacg 3420 tattcccagg gaccacccca ataatgatgc aattcatgcc gagtacagac gacttcgaaa 3480 cgagtgtaga aagacaatca gagaagcaaa acagaggtct tgggagaaat tccttgatgg 3540 tttcgattgt gagcaatcaa cgagtgaaat gtggaggcga gtcaatgcac tatctggtcg 3600 aaggaaattg agagggattg ctatttgcca agatgatatc atatctcgtg acccaacgtt 3660 cgttgctaat aacattggaa aatacttcgc taagatttca tcacttgctg agtatgacga 3720 cagtttccta aaattccaag aagataagaa ctgcacgcta tcgaaaatta tcgttccgaa 3780 cgatgaggaa aataacgact tcaaccagcc tttcaccctg gaagaacttc tcttctcttt 3840 ggatgccagc aacggaaaat cggctgggcc agacgggata agttatccgt tactaaaacg 3900 tctgccatta cgaggtaaga ttgcgttatt agaaatgttt aatgctatct ggaaagccgg 3960 aagttttccc ccggactggc gtcacagtct cgttgtcccg atcccaaaaa aacaggcaac 4020 gtcaactaac ccccaagact ttagaccgat tgcactaact agctgcatct caaaagtttt 4080 ggaacgcatg gttaatcgcc gactaacaac agttctccaa gaaaaacaac tcctagactt 4140 tcgacagcac ggattcctca aaggacgtgg cacgggatcg tacttggcat cgttcggaca 4200 ggtattacat gacgctctct ccaacggact acatgttgat atagctgcat tagacctctc 4260 taaggcctac aatagggtct ggcgaccctc agtattgcgc cagctcatca actggggcat 4320 cgtaggaaac atggggaaat tcattaaagg cttcctggat agacgaactt ttcaagtgat 4380 catcgggaac actttgtcag aagaattcat cgaggaatca ggtgtaccgc aaggatccgt 4440 actagccgtc tctctctttt tggtggctat gaactccgtc ttcaacgatc tccccgatgg 4500 tatatttatc ttcgtgtacg ccgatgacat aattcttgtc gttgttggta gaagtcccaa 4560 actcatacgg cgtaagctcc aatcagcggt gcgtaaagtc tcgaagtggg caatttcctg 4620 tggcttcaag atggctgcgg aaaaatgtgt aatttcacat gtttgtaact ttaaacatca 4680 cccttggaca aacccagtaa ttgtcgatgg atgtgaggtt gcgttcaaaa aagaagcaaa 4740 aatcctagga gttgtcgtcg acagaaaatg caactttgct tcgcattttt ccttcgtgaa 4800 aaaagacagc gaaagcagaa tacgcctaat taaagctata agcggaaggc actcaactaa 4860 caaccggagg tcgctaatga atattggtcg gagcataata atcagcaagt tactgtacgg 4920 cctagaaatt acaatccgtt cagcaacgct tatgatccaa atgtttagcc ctacttacaa 4980 caaaatgata cgtctcactt ctggtctact tccaagctct ccgactttgt ccacaatggt 5040 tgaagctgga atactccctt ttgagtacac cctaactgcg gcagcaagca acagagcgat 5100 aagctttatc gaaaaaacat acggcgcaca aagagacatt tttatcctcg atgagacaag 5160 aaagttgttg caacaatata ctaatattga ctttcccgaa atagccacac tccaccgagt 5220 cggaaatcga cgatgggatc gtcctaatcc caaagttgat tggtcaatca aacgagaaat 5280 acgggcaggc gaatcaccgg aaaaaattca ggcgattttc aacgaactca ttgagaataa 5340 atacgctaca cactgcaaaa tctacacaga tggttcacgc tcaaatggaa aggttggcat 5400 tggtatatct tcgccaatag gaaatttcgc acgcagatta cccgaccaat ttaccgtctt 5460 ttcggcagaa gctgcggcag tttactttgc aatcaagaag tgtacaagca tccccaacgc 5520 atcgatcatc ttttctgatt ccgctagtac attggctgca ttagaaaatc cgcaacaaaa 5580 gcatccccta atccaggcta tcgaaaactc tattctaccc aacacgactc tttgctgggt 5640 cccaggacac agtggcgtta gaggaaacga ggaagcagac aagctcgcta gcatcgggcg 5700 tacatcaacc aaatggaatc tcggaatacc tagggcagat atgaaattag taattcagaa 5760 tgcactacag gatgcatggt atcgaagatg ggaatccaac aatggtgaat ttctgagaaa 5820 aatcaaaaat actgtgtcag aatggtcaga cagaaaaaat cgaaaagaac agaagatctt 5880 atctcgttta agaatagggc acacaagagt tacacatgca cactatatct ccaattcaac 5940 taaaacaact tgtgaaacct gctcagtccc tctgactgtt gaacacattt tactgaactg 6000 tcaaaaatat gctgatatac gtaacaattt gaacctacaa gatagtataa gaaccgtgtt 6060 aagcaatgat gtaaatgaag aggagaaatt gttgaagttt ttgaaagaaa ccaaattgtw 6120 caacgagata taaaaattag ccttaaacgt ttgaagaggc gaaccagctg cgaaagctga 6180 aaacctctat aataaagata aaaaaaaaaa aaaaaaaaaa aaaaaaa 6227 // ID Gypsy-22_IS-I repbase; DNA; INV; 4062 BP. XX AC ABJB010933243; XX DT 15-FEB-2011 (Rel. 16.02, Created) DT 15-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the black-legged tick genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-22_IS_; KW Gypsy-22_IS-LTR; Gypsy-22_IS-I. XX OS Ixodes scapularis OC Eukaryota; Metazoa; Arthropoda; Chelicerata; Arachnida; Acari; OC Parasitiformes; Ixodida; Ixodoidea; Ixodidae; Ixodinae; Ixodes. XX RN [1] RP 1-4062 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the black-legged tick genome."; RL Direct Submission to RU (15-FEB-2011). XX DR Genome; ABJB010933243; Positions 2874 6935. XX CC Positions [3097-3567] - Integrase core CC 'GAACG' target site duplication CC LTRs are 95% similar to each other. XX FH Key Location/Qualifiers FT CDS 595..1998 FT /product="Gypsy-22_IS-I_1p" FT /translation="MSSEPAALLQVQARLHSSAQDNVKRDAPRECGRCGST FT KPEDNSCGWAKSRCYRCGRHGHLAKKCRNAAARKSQVTKAAHATTLAVAEA FT TSDVEGRDETHIWTLVSERKNFLEPPIRRTLTWGGVELQMEVDTGSPVCVI FT SRQIFEKYRKVWPCLKPPRVKLSCYTGRIPVLGELQLRACYKGVDVDCSLT FT VLDCSGPSLCGRDLLAKLKDAGMSILQWAGHDSAAPDPKCSSSVNNIFSNY FT QDVFSRDLGVIKGPPASLQLKDGVSPKFCKARPIPYALRDKVSLELDRLVS FT LGVPSPVKHSEWATPIVPVLKKDGNVRICGDFKATLNPACAVEQYPLPVIE FT DIFARLNGGESFSTLDLRDAYNQVPLDEAARKLCVINTHRGLFCYNRLPVG FT IASAPAIVQRKMDLILAGLPGVQAYLDAVLVSEKKDDNGERLKSVLGRFRE FT HGVKLRYDECTFRQPAVTYSVIA" FT CDS 2305..4047 FT /product="Gypsy-22_IS-I_2p" FT /translation="MAWEQSCFTPRGNVYKPIGFRSRTLTQAERNYSQLER FT ETIALVFGVTKFRDYLLGREFTLVTDKQTLLGLLRADGLTPAMAAARIQRW FT ALYLGGYRYKLHYVPGKQLLNSGALSRLPMLSTEPEDDGEPPDYALFLETL FT DDGVVTTHELKDLAAADSTLARVKQYILRGWPKSTKGLEPSVLPFYDRKLE FT LSVAHELVYWGNRAIIPVKAQSRMLQLLHETHQGSSAMKFVARSLFWWPGL FT DRDIERLSANCHNCIINLPMPTAAPPVPWPKTLEKWSWLHIDFAGPLAGKM FT ILVVVDSHSKWIEAVPIKHATTASTISCLRNIFSHFGVPRTIVSDNGTQFT FT SQEFATFVKRNHITHLRTAPSHPQSNGAAERAVRTVKDGLRKMKEGTLEDK FT LSRLLFNYRRTLQRNGKSPSEMLLGYQIRSRLDSCFPQTVAEPPQGRDDWA FT VQPDSTVYVRNYGAGEKWTPGRVKTTTGARMATVETPAAVIRRHTDQVRPR FT QDASPQSLPDSHASNATEASGRASPSQAVEVAVGPTAQLSPDRDQQLDYPT FT TPPSSTTKTDAGGAQPAIPRRSTRLRKPIDRFHY" XX SQ Sequence 4062 BP; 1066 A; 1074 C; 1066 G; 856 T; 0 other; gtggcgacga gtccggacct cgcccaagcg gacaaaaaag ccccggagca gctatacagc 60 tcagcagcgt cccgaggaca gctatggccc gcctgcaact accggaattc gacgaagaca 120 tagataagtg gaaaccttat ctcattaagg tcgaagccta ctttgaggct aacaccgtca 180 cagactctgc taaaagaaga gcgctgctag tggcggcact aagcacgaag actgtacaag 240 tactagcacg aagagtagcc ccccgcacgc ctaattcttt gactcatgaa taagttgtac 300 aagccctgaa tgagtactac gacccgaagc ggcaggaaat aacggaaagt tacaagttct 360 tcaaccgttg tcaaatggaa ggcgagtctg tccatgcgtt tctggttgaa atacgccgca 420 tagcagacaa tcgcaacttt ggcagcatgt tggaccgcat gctcagggac agaatcgttt 480 gcggggtgcg ctcaagtacg ttacaaaagc agctgcttgc aaaaagagaa ctaacactag 540 aggaagccga agctttagcg gtttcggcaa aacaccgaaa acgactctaa aaaaatgtct 600 tcggagccag cggcactact gcaagtgcaa gcgcgtcttc attcttcagc acaagataac 660 gtaaagcgcg atgccccgcg agagtgcggc aggtgcggga gcacaaagcc tgaagacaac 720 agctgtggct gggccaagtc tcgttgttac cgttgtggac ggcatggtca cttggcaaag 780 aaatgtcgga acgctgctgc ccgcaaaagc caagtgacta aggcggctca cgcaacgacc 840 ttggctgttg ctgaagctac gtcagacgtg gaaggcaggg atgaaacaca catctggact 900 ttggtttcag aaagaaagaa ctttctcgaa ccgccgattc gccgcacgtt aacgtggggg 960 ggagtagaac tacaaatgga agttgacacc ggatctcctg tttgtgtcat atcacggcaa 1020 atttttgaga aatatcgcaa ggtgtggccc tgcttgaagc cgccacgtgt taagctgtca 1080 tgctacacag gtcgtattcc agtgttgggt gagcttcaac tccgtgcgtg ctataaaggc 1140 gtggacgtcg actgctccct gactgtactg gactgttcgg gaccaagcct atgcggccgg 1200 gacctccttg ctaagctgaa ggatgccggg atgtctattc tgcagtgggc cggacacgac 1260 tctgcagcgc cagacccaaa atgcagctca tcagtcaaca acatctttag taactaccag 1320 gacgtcttct ccagggacct aggcgtaatc aagggacctc cagccagcct gcaactgaag 1380 gacggtgttt cccccaagtt ctgtaaggcg agacccatcc cttatgcgct tcgtgataag 1440 gtgtcgttag agcttgaccg gctagtatct ctcggtgtgc catcacctgt aaaacactcg 1500 gaatgggcaa cccccatagt cccagtactt aagaaagacg gcaatgtacg aatttgtggt 1560 gatttcaaag ccacattgaa ccccgcctgc gcggttgaac aatatccgct gcctgttatt 1620 gaggatattt ttgcacgctt aaatggtggc gaaagtttca gcaccttgga cttacgagac 1680 gcgtataatc aggtgccatt ggatgaagcc gcgcgaaagc tttgtgtgat caacactcac 1740 aggggcttgt tttgttataa taggcttccg gttggcattg cctctgcccc tgcaattgtt 1800 caaagaaaaa tggacctgat acttgcgggc ctgccgggtg tgcaagccta tcttgacgct 1860 gtgcttgtat ccgaaaaaaa ggacgataac ggggagcgac tgaaaagcgt gctggggcga 1920 tttcgcgaac acggtgttaa actgaggtat gacgagtgca cttttcgtca accggccgtc 1980 acttactcgg tcatcgcata gacaagcaag gccttcaccc gactgagaaa aacgtggacg 2040 caatcacaca ggcacccagc ccgcgtaaca tcagtgagct ccgttcgttc ctgggaatgt 2100 taacgtctta tgctaagttt ctgccaaaca tgtcaactct cctcgctccg ctatatcgac 2160 tattggagaa aaattcacga tggcagtgga agcaaccaca gaaaatcgtt ttcaacaggg 2220 ccaagcaatg gcttaaggaa gcgaaggttc tggggcattt cgacccggct aaggagctaa 2280 agctagaatg cgacgcatcg ccgtatggcg tgggagcagt cctgtttcac acctcggggc 2340 aacgtttaca agcccatcgg gttccgctct aggaccttaa cacaagcaga acggaactat 2400 tctcagcttg aacgagagac catcgcactc gtgtttggtg tgacaaagtt tcgagattat 2460 ctgcttggtc gagagtttac cttggttacc gacaaacaaa cactcctggg cctcctgaga 2520 gctgacggac tgactccggc aatggccgcc gcccggattc aacgctgggc gctttacctg 2580 ggtggctacc gttacaagct gcactacgtg ccaggaaagc agttgctgaa ctcgggcgcc 2640 ctcagcagat tgccgatgct gtcaacggag ccggaagatg acggtgagcc cccagactat 2700 gctctcttct tggagacctt ggatgacggc gtggtgacaa cgcacgagct aaaagatctt 2760 gcggccgctg attccaccct agctcgtgtc aagcagtaca tattgcgcgg ctggcctaaa 2820 agcacaaagg gactggagcc ttctgtgctg ccattttatg atcgcaaact ggagttgtcg 2880 gtagcacacg aactagtata ctggggtaat cgagcgatca ttccggtaaa agcccagtcg 2940 agaatgctgc agctactgca tgaaacgcac caaggttcct cggctatgaa gtttgttgcc 3000 cgatcgttgt tttggtggcc aggactagac cgcgatatcg aaagactatc agctaactgc 3060 cacaactgca taataaactt gcccatgccg acagcagcac ccccggtgcc ttggccaaag 3120 accctagaga agtggtcttg gctacacata gattttgcag gaccgctagc tgggaagatg 3180 atactcgtag tggtggacag ccattccaaa tggatcgaag ctgtacccat caaacatgct 3240 accacagcga gtacgatcag ctgtctacgc aatattttca gccatttcgg cgtgccacgt 3300 acaatagttt ctgacaacgg gacgcagttc actagccaag aatttgccac attcgtaaaa 3360 agaaaccaca tcactcacct ccgcacagcc ccttctcacc ctcagtcgaa cggagcagcc 3420 gaaagggcag tccgaacagt gaaagatggc cttcgtaaaa tgaaagaggg gacattggaa 3480 gacaaactgt cgcggttgct attcaattat agaaggactc ttcagcgcaa tgggaaatcg 3540 ccttccgaga tgctactggg ttaccagata cgttctcgtt tagactcatg ttttcctcaa 3600 actgtcgcag agccaccaca gggacgtgat gactgggcgg tacaaccgga cagcaccgtc 3660 tatgtgcgca attacggcgc cggtgagaaa tggacacccg ggcgcgtcaa gaccacaaca 3720 ggagcccgga tggcgaccgt ggaaacccct gcagcggtta ttcggcggca caccgaccaa 3780 gtacgccccc gccaagacgc aagcccgcag agtctgccag atagccacgc atcaaacgct 3840 acagaggctt caggtcgagc atcgccttcc caagccgtcg aagtggctgt ggggcctacc 3900 gctcagctat ctcccgacag ggaccagcag ctagattacc cgactacgcc gccatcctct 3960 accacgaaga cggacgcagg tggagctcaa ccagcgatcc cccggcggtc gacaagactt 4020 cgtaaaccaa tagataggtt tcactattaa gggaagagaa at 4062 // ID CR1-4_CQ repbase; DNA; INV; 3697 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-4_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-3697 RA Kojima K.K. and Jurka J.; RT "CR1 non-LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 4-4 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 10 sequences with >98% CC identity. XX FH Key Location/Qualifiers FT CDS 290..3658 FT /product="CR1-4_CQ_1p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="MRASRLPDATTSSPSYRTXSPASRSHQSRPGVGVGVG FT GGVSQTAPPGKYYFRTQPSPPDACPTYSSRATEPATRNVYASPXSARAIQS FT SPGRTVESLMRASRLPDATTSSPSYRTRSPASRSHQSRPGAGVGVGGGVSQ FT SAPPGKYHPSFPHCTSDGPSPSSKLRIYYQNVRGLRTKVDAFFLAITECDY FT DVIVLTETWLAAEILSPQLFGSNYRVFRNDRNERNSVKRSGGGVLIAVASR FT LDCVSDPTPVCDTLEQLWVRLSVGASQRVCIGVFYFPPGMNRNIDLIKSHL FT DSIGSVISSLDLNDSFIQFGDYNQPGLKWIPSEEGGLEIDPVLSSMPLASS FT YLIDGFNLHGLTQLNSCINVYNNCLDLVLVNDVVLPKCAVFEAVEPLVPLD FT ITHPALEMVVDVPAPTVTTEPPSAIQQPFNFRKANYAAIKEELSYVDWSFL FT ETSSSIDAAVDSFQTVLNNIFAVHVPRRRPSQKPPWSNATLRRLKQLKAAA FT LQDFCNHRSPFFQTRFKLASKRYRGCNRTLYRRYVIRTQHDLRKNPKNFWS FT FVNSKRKTDGLPPTLYLNDVAATTPEEKCALFAAHFKDAFNSRPASEAEIM FT AATRNTPANAFECSFTEVGNGQVTAAIDKLKLSYSAGPDGIPSSVIKRCSD FT VLLLPLSILFNLSLCQSTFPAKWKRSFMFPVHKKGAKSSIKNYRGITSLSA FT CSKVFEILINSVLFESCKHYISTDQHGFIPKRSVATNLAQFVSHCLQEMDA FT GAQVDAVYTDLKAAFDRVDHGVLLKRLEVLGVSANLIRWLKSYLTDRVMRV FT RIGNSESDIFTSLSGVPQGSNLGPLLFILFVNELALLLAGGCRLFYADDVK FT IFTVVRSVLDCSTLQRQLDRFNEWCLRNFLTISIDKCNAISFHRKLKPVIF FT DYAIAGSTLARVTQVRDLGVTLDCKLSFTSHRHDIVSRANRQLGFIFRIAE FT DFRDVACLRSLYCALVRSILEFCSVVWCPYQSTWTAKIESVQRKFTRLALR FT RAHNPAGWSPYEERCRILRLDTLEKRRHISQASFVAKVLAGEIDSPWILAQ FT IQVYAPERPLRQRPYLQLARRNTNYGQHEPIRFMCSRFNVFSQLYRPNIST FT SMFQQRARLWYSSQF" XX SQ Sequence 3697 BP; 858 A; 1075 C; 842 G; 917 T; 5 other; gctccggtga atttttttgg gagccawcaa acgccgccac gcagactcag caaaacctgc 60 caccgccccc tctccaacct tctccaacaa aatcgtcgat gccggaaacc accccaaagt 120 caacgccgaa gccaacgcca aagtcaacgc cgaaatcttc actggaaacg gagcctactt 180 tgccgtcggg ctcacccatg gaccacgaga actaacgccc acgcttacaa aatcgccttc 240 ggttcattat ctaccagctt cwtcgccagg acgcmccgta gaaagcctta tgagagcctc 300 tcgcctgccc gacgccacca catcctcgcc aagctaccgc acgagwagcc cagctagtcg 360 cagccaccag agccgtcctg gtgttggtgt cggtgtcggg ggaggggtct ctcaaacggc 420 gccccctggc aagtactatt ttcgaacgca accttctcca cctgatgcct gcccaactta 480 cagctcgcgt gccacagagc ccgcgacccg gaatgtctac gcttcaccgm tgtcagctcg 540 cgcgatccag tcttcgccag gacgcaccgt agaaagcctt atgagagcct ctcgcctgcc 600 cgacgccacc acatcctcgc caagctaccg cacgagaagc ccagctagtc gcagccacca 660 gagccgtcct ggtgctggtg tcggtgtcgg gggaggggtc tctcaatcgg caccccctgg 720 caagtaccat ccaagctttc cccactgcac atctgatggc ccctcgcctt ccagcaaact 780 gcgtatctac taccagaatg ttcgaggtct ccgaacgaaa gtcgatgcct ttttcctcgc 840 catcactgag tgcgattatg acgtcattgt cctgactgaa acttggctgg ccgccgagat 900 tctgtcccct cagctgttcg gttcgaatta ccgtgtattc aggaacgaca gaaatgagcg 960 gaacagtgtt aagcggagcg gaggaggtgt tctgatagct gttgcatctc gtctggactg 1020 tgtctcagac cctactcctg tctgcgacac tttggagcaa ctctgggtta ggttatccgt 1080 aggcgcttca caacgcgttt gcataggcgt gttctatttt cccccgggaa tgaatcgcaa 1140 cattgaccta attaagtccc acttagactc aattggcagt gtcatttcca gccttgatct 1200 gaacgattcg ttcattcaat tcggcgacta caatcaacct ggcctaaaat ggattccctc 1260 ggaagaaggt ggacttgaga tcgacccggt gctgtccagt atgccacttg ccagcagtta 1320 cctaatcgat ggttttaacc tgcacgggct tactcagctc aactcctgta tcaacgtcta 1380 caacaactgt ctcgaccttg tcctggtgaa cgacgttgtg cttcccaaat gcgctgtctt 1440 tgaagctgtt gaaccgctgg taccactcga catcacccat cctgcactgg aaatggtcgt 1500 cgatgttccc gcacctacag ttacaaccga accaccatcc gcaatccagc aacccttcaa 1560 cttccggaaa gccaactatg cagcgatcaa agaggagctg agctatgtcg actggagttt 1620 tttggagact tcttcgagca tcgatgccgc cgttgattct tttcaaactg ttttgaacaa 1680 catttttgct gtacacgttc cccggcgaag gccttctcag aagccacctt ggtctaacgc 1740 cactttacga agactcaagc agctcaaagc cgcggctctc caagatttct gcaatcatcg 1800 ttcacccttc ttccagacca gattcaagct cgctagcaaa cgttacagag ggtgcaaccg 1860 tactctgtac aggcgatatg ttattaggac gcagcacgat ctccggaaaa atccgaaaaa 1920 tttctggtcg tttgtaaact cgaaaagaaa aacggacggc ctgccaccta cgttgtacct 1980 gaacgatgtt gctgctacta ctcccgagga aaaatgcgcg ctttttgcag cccactttaa 2040 ggacgcgttc aactctagac ctgcttcgga ggcggaaatc atggcggcaa cacggaatac 2100 tccagccaac gcattcgaat gttcctttac ggaggttggg aatggccaag ttactgccgc 2160 tattgacaaa ctcaaactgt cctactcggc cggccccgac ggcattcctt cctctgtcat 2220 caaacgttgt tcagatgttc tcctattgcc tctttctatc ctcttcaacc tttctttatg 2280 tcaaagtacc ttcccggcta agtggaaacg ctcttttatg tttccagtcc acaaaaaggg 2340 tgccaaaagt agcatcaaga actatcgtgg aatcacctcc ctctcggcct gctccaaggt 2400 ttttgaaatc ctcataaaca gtgtcctgtt tgagagctgt aaacactaca tctccacaga 2460 ccagcatgga tttattccga aacgttcggt tgccacgaat ctggcccaat tcgtgtctca 2520 ttgccttcag gaaatggatg ccggagcgca agtcgacgct gtgtatacag acctcaaggc 2580 tgctttcgat cgagtcgacc acggagttct tctgaaacga ctggaagtac ttggagtttc 2640 tgctaatctc attcgctggt tgaagtccta cctcactgac cgggttatgc gtgtcaggat 2700 tgggaactct gaatcggaca tattcaccag tctgtctggc gttccccaag gcagtaactt 2760 agggccccta ctgtttatcc tgttcgtcaa tgaattggca ctgctgctag ccggaggatg 2820 tcggctgttc tacgcagatg acgtgaaaat attcacagtt gtaaggagcg ttttggactg 2880 ttcaactctg cagagacagc tggatcgttt caacgagtgg tgcttgcgta acttcctgac 2940 gatcagcatt gacaaatgca acgccatctc tttccatcga aaactgaagc ctgtgatctt 3000 tgactacgcc atcgcgggaa gcacactggc gcgggtgacc caagtacgag atcttggagt 3060 tactctggac tgcaaactca gtttcaccag ccatcgccat gacatcgtgt ccagagccaa 3120 ccgtcagctg ggattcatct tcaggatcgc tgaggacttc cgggatgttg catgtctgcg 3180 ctccctgtac tgcgcgctgg taagatccat cctcgagttc tgctctgttg tctggtgtcc 3240 ataccaaagc acttggacag ccaagatcga atcagtccaa agaaaattta ctcgacttgc 3300 ccttcgccgt gcgcacaatc ctgctggttg gagtccgtac gaggaacgtt gccgaatcct 3360 gcggttggac acacttgaaa aaaggcggca catctcacaa gcatcgttcg ttgccaaagt 3420 gctggctggc gaaatcgaca gtccctggat cctagcgcag atccaagttt acgctcccga 3480 aaggccgctc cgccaacgtc catacctgca gctcgcacgt cgcaacacca actatggaca 3540 acatgagccg attcgattta tgtgtagtcg ttttaatgtt ttttctcaac tctatcgtcc 3600 taacatttcc acttccatgt ttcaacaacg tgctcgactc tggtattcta gtcagtttta 3660 atgtatatgt gtcggtgtcg tcgctagttt ttataag 3697 // ID Gypsy-2_DFa-I repbase; DNA; INV; 3766 BP. XX AC ADHC01000031; XX DT 21-APR-2011 (Rel. 16.04, Created) DT 21-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Dictyostelium fasciculatum genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-2_DFa_; KW Gypsy-2_DFa-LTR; Gypsy-2_DFa-I. XX OS Dictyostelium fasciculatum OC Eukaryota; Amoebozoa; Mycetozoa; Dictyosteliida; Dictyostelium. XX RN [1] RP 1-3766 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Dictyostelium fasciculatum RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; ADHC01000031; Positions 1290778 1287013. XX CC Positions [1813-2331] - Integrase core CC 'ATCAC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 504..1829 FT /product="Gypsy-2_DFa-I_2p" FT /translation="MIPSGRSRHLQMLSPSLLVVFTKIMTIEVLYLAYKNA FT FYNVKTIYKELQERTMDFVQMRSVAISLSSTVAPNGHRQINVPEFNYGDPL FT GWLSGLYYRSDNNNNNNNSNNNKRKHNHLVNQDLSKIKCFNCQRFGHKADV FT CRDNRNNQHVDKKSRDTKQCNICRSSEHWTNQCPRNKSSSMTCVSDTLPYL FT YISINFDSNLIKAKVDTGSEINFIDFRLVPKEMVMTSDLFVTGVNGHTVPS FT LGQASVPVIFTSNRGQITRKINFELTDRAGLCLLGIDTIMSCKASFGEGNS FT KLEMNDATGNTHIINLSRDVKSPSLTVDESKSYKNNIKKVYQTLLSKNDPI FT NGIQKIVKELDNVTTNIADGLGNISNDMDWQETVKSELNRNQRRMLNRCMA FT PRIKNSISIVDNEAEVVNEDMKDVSNVALVFISHNDSVETKECNITTT" FT CDS 1753..3753 FT /product="Gypsy-2_DFa-I_1p" FT /translation="MYQMWHLCLSVIMTQWKQRNVTLQPPNYCFQRVHIDV FT ITGLPNAKYHGKTVNALLVVVDALSKMNRLIPTTTTLKSHGVISLFNREIF FT KLHGLPKQIISDSDSKFTSKLFKKLEAMYNIKINHSNPTHHQSNGQVERMI FT RTIENIIRKDLLQSSVENHKVSWAECIDNIEFAINNTVHSTTGYSPFTIYL FT GSNPLSPLNLLDTFTPNFNDNTDIKITENHKLKEIIRDIVIKRINEGNAIS FT KARYDHNKVENNIKSGDLVYVRRDRDTHINKTEPIYSGPLLVNGVDTRGNV FT EVKDYVAGADGSTSKDLLVNPRFIKTLKPLLATGNPVHTEGAENAQGNPVH FT TEGAENAQGYPVHTEGAEDAQSYPVHTEGTENVQGYPVHTEGTENVQDYPV FT HTEGAQGARDNLDGRVGDDRTLYQDLNKIQSTLQGQGVTGGDQGIVSQAPG FT DIYDDVPNTADWTKQRVVLNDTLKKVEESIGKGKNKLQVKDNNRVIGKEDI FT VITNYNISKINYDDIKINFIEILHQYRQQLSSPTTTKYFNMKEMNSIRIIQ FT SLQQNNNDSPTKDIVNSIGVITSRAWLKPLFIVKSKRTFKYQGNGRSRTEY FT LININDCEVWMPNDLVNQQHVRAVSKTQQVSLAIVSTGKVFSPGCSDQVLQ FT PWFSYHLNKRGTWDSVTI" XX SQ Sequence 3766 BP; 1352 A; 701 C; 711 G; 1002 T; 0 other; tggtggacta cgaaaaacaa ccgaaattac tatatataga aactcatttt aaattttttt 60 taattttaaa tttttgaact ttttaaattt tcaaacctca ttaatcgtac aatgagtaac 120 gacaacattc aacaacgtag acgactcaca ctcgctcctg ctcctaatct agaggaattg 180 cagccaaagg aatcagtgcc aaccggccca gagaataaag tcaagcatta cctattggaa 240 aacaatgatt atcgaatgac cgatgatgct tttgtaaaga gtgtagccag ttttcttagt 300 gtatcatcca actcagtcga agagatcatt cactctcttg gaaaggattt cgacaaagca 360 ttgtggtgta atggtactgg taaaatcctg ttcaacagca tcgagaccat cagcaaatgt 420 gatgatgtag cccactggct cgatcaaatc gatatttcaa tacctggtcc ttaccaagtg 480 tatgtcgctc acacaattcc gttatgatac caagtggacg gtctcgtcat ttacaaatgc 540 tttcaccatc attgctggtc gtgtttacga aaataatgac aatcgaagtt ttgtacttgg 600 cgtacaaaaa tgcattttac aatgtcaaga ccatctataa ggaattacaa gaaagaacca 660 tggattttgt tcaaatgaga tccgtcgcta tctcgcttag ttctaccgtc gctcctaacg 720 gtcatcgtca aataaatgta ccagagttca actatggtga tccattaggt tggttatctg 780 gactgtacta cagatctgac aacaacaaca acaacaacaa cagcaacaac aacaaaagaa 840 aacacaatca tttagtcaat caagatctat caaaaatcaa atgtttcaac tgtcagcgat 900 ttggtcacaa agctgatgtc tgtcgtgaca acagaaacaa tcaacacgtc gacaagaaat 960 caagagatac caagcaatgc aatatttgtc gtagtagtga acattggaca aaccagtgcc 1020 ctcgaaataa atcgtcatcc atgacctgtg tctctgacac tttaccttac ctctacattt 1080 ctatcaactt tgattctaat cttattaaag caaaagttga tactggcagt gagattaact 1140 ttatagattt tagattagtt ccaaaggaga tggtcatgac tagtgacttg tttgttactg 1200 gggtaaatgg tcatacagtt ccatcgttgg gacaagcaag cgttccagtc attttcacaa 1260 gtaaccgtgg tcaaataact cgtaagatca actttgaact tacagacaga gctggtcttt 1320 gcctacttgg aattgacaca atcatgtcat gcaaagctag ttttggcgaa ggtaattcta 1380 agctagaaat gaatgacgca actggaaaca cacatatcat caatctatct cgggacgtca 1440 agagtccaag tttaacagtc gatgaaagta aatcgtacaa aaacaacatc aagaaagtgt 1500 atcagacact attgtcaaag aatgacccta tcaatggtat ccaaaagatt gtgaaagaat 1560 tggataatgt taccacaaac atagcagatg gattaggaaa tattagtaac gatatggatt 1620 ggcaggaaac agtcaagtca gaattaaatc gtaaccaaag aagaatgctc aacagatgta 1680 tggctccaag aataaagaat tcaatctcta ttgtggataa cgaagcagag gtggtcaatg 1740 aagatatgaa agatgtatca aatgtggcac ttgtgtttat cagtcataat gactcagtgg 1800 aaacaaagga atgtaacatt acaaccacct aattattgct ttcaacgtgt tcatatagac 1860 gttattactg gtttacctaa tgctaaatat catggtaaaa cagtaaatgc tttgttagta 1920 gttgtcgatg cgctatctaa aatgaatcgt ttaataccta ccacaactac tcttaaatca 1980 catggtgtaa tatctctatt caatcgagag atattcaaac tgcatggttt accaaaacag 2040 atcatctctg attctgactc gaagttcacc tcgaaattat ttaaaaaatt agaggcaatg 2100 tacaacatca aaattaacca tagcaaccca acccaccatc agagtaatgg acaagtagag 2160 cgtatgataa gaactataga aaatatcatt agaaaagatt tattacaaag tagtgtagag 2220 aatcacaaag tatcatgggc agagtgcatt gataatatcg aatttgctat caacaataca 2280 gtacattcaa caactggtta ttctcctttt actatttatc tcggatcaaa tcctttatct 2340 cctcttaatc tactagacac ttttacacca aacttcaatg ataacacaga tatcaaaata 2400 acagagaatc ataaattgaa agagataata agagacatag ttataaagag aattaacgaa 2460 ggcaatgcaa tcagcaaagc aagatatgat cacaacaagg tagagaataa tattaaatct 2520 ggtgatttgg tttatgtgcg tcgtgacaga gatacacaca taaataagac agaacctatc 2580 tattctggtc ctttgttagt aaatggagtg gacactcgtg gaaatgtcga agttaaagat 2640 tatgtggcgg gagcagatgg ttcgacgtca aaggacttat tggttaatcc tagatttatc 2700 aaaacgttga aaccattatt ggctacagga aatcccgttc atacggaagg tgctgagaat 2760 gcacaaggaa atcccgttca tacggaaggt gctgagaatg cacaaggtta tcccgttcat 2820 acggaaggtg ctgaggatgc acaaagttat cccgttcata cggaaggtac ggagaacgta 2880 caaggttatc ccgttcatac ggaaggtacg gagaacgtac aagattatcc cgttcatacg 2940 gaaggagctc aaggtgcacg tgacaatttg gatggacgag tcggtgacga tcgaactctg 3000 tatcaagatt taaataagat tcaatcgaca ttgcaaggac aaggtgtaac cggtggagac 3060 caaggaatag tgagccaggc gccaggcgac atctatgacg acgtcccaaa tacggctgac 3120 tggaccaagc aaagagttgt gttgaatgat actcttaaaa aggtagaaga gagtataggt 3180 aaaggtaaga ataaactaca ggttaaagat aataatagag ttataggaaa agaagatatc 3240 gtcatcacca attacaatat ttcaaagata aattatgacg atatcaaaat caactttata 3300 gaaatattac atcaatacag acaacaacta tcctccccaa ctacaactaa atatttcaat 3360 atgaaagaaa tgaactcaat tagaatcata caatcattac aacaaaacaa caatgattcc 3420 ccaacaaaag atatagtcaa tagtattgga gtaatcacaa gcagagcatg gttgaaacct 3480 ctgtttatcg tcaagtcaaa gagaacattc aagtatcaag gaaatggacg tagtagaacg 3540 gaatatctca ttaatatcaa cgattgtgag gtatggatgc caaacgatct agttaatcaa 3600 caacacgtcc gggccgtatc aaagactcaa caagtatccc tagccattgt atccaccggt 3660 aaggttttta gcccaggttg cagtgaccaa gtgttgcaac catggttttc ctaccatctc 3720 aacaagcgtg gtacgtggga ctcggtcaca atttaagaag ggagaa 3766 // ID Loner_Ele4 repbase; DNA; INV; 5831 BP. XX AC . XX DT 07-OCT-2010 (Rel. 15.1, Created) DT 07-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE A Loner non-LTR retrotransposon family from Aedes aegypti. XX KW I; Non-LTR Retrotransposon; Transposable Element; Loner; KW Loner_Ele4. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-5831 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5831 RA Kojima K.K. and Jurka J.; RT "Loner non-LTR retrotransposons from the yellow fever mosquito."; RL Direct Submission to Repbase Update (07-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. This consensus is generated from 9 CC sequences with >97% identity, and ~100% identical to the original CC sequence in [1]. The consensus is ~87% identical to Loner-6F_AAe CC and 77-79% identical to Loner-6_AAe, 6B_AAe, 6C_AAe, 6D_AAe and CC 6E_AAe. XX FH Key Location/Qualifiers FT CDS 535..1830 FT /product="Loner_Ele4_1p" FT /translation="METDGDGTSGENSELSSTKRFRIKTYPSSFLGPFVVY FT FRKKEKPINVLLISSEIYKLYKSVKEIKKISLDKLRVIFGSRDDANSLLES FT KLFCNSYRVYAPCDSCEINGIIYDEDLDCNDIVNFGSGKFKNKAIPPVKIL FT DCMRLSKLTFSDKKSSYMHSNCIKITFEGSVLPDYVDIDNVIFQVRLYYPK FT IMHCDRCLLFGHTSQFCSNKLKCSKCGEGHSSVDCNKSSDICIYCGKKHNF FT LKECSVYIAHQTQSNHKIKNRSKLSYSEVTKTISDLPSTNMFAPLSQNVDN FT EVSNDDHEHFVYKPPNKRKRIIKSIIKNDDFTNPQPSTSFDTNFPPLSSPT FT SRVIPGFQKNMTYSEENSNEKNNNSNSTFQKTENSGEDNSILHILEELIEF FT LGLNEFWKKIIKIILPFLASIFEKLNSIGPLICSLFSS" FT CDS 1833..5525 FT /product="Loner_Ele4_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="MASSKNYNLNILQWNCRSIIPKIDRLKVLVVNCDVDI FT FCLNETWLEETKSFRIPSFNIVRKDRNTAYGGVLIGIRENIEFKYLDFTID FT SEIEYVAVSIKHNGLEFSIICIYIPPQTKFSLLELKTILNNIPSPFYILGD FT LNAHNLAWGSHKTDGRGSLIMDVIDDLNLNILNDGSFTRIVVPPGRHSCID FT LSICSNCLSMNSSWETVEDANGSDHLPILISIYNPNFNQNNLEHCVPDLTK FT NVDWVKFSDLVSLALLSFDNSLSTIESYDNFSKLLVECLKKSQTKKIMQST FT SKKKHSSFWWDEDCTIALKNKSNAFKLFRRSGSREHYILYCKTEAQFTRVT FT KFKKRNYWRNFIENLDSDTSLSKLWTVARNMRNYNVSNIPVLEYSEDWIDQ FT FASKICPDFTPLDIKYKNSSSYNYFPNLCTEFSIEEMDLALSITKNTAPGI FT DNIKFIVLKNLPIDGKLHLLSIYNNFLFQNIFPMEWRFVKVVSILKPGKNP FT SLVESRRPISLLSCLRKLMERMILNRLELWAEKNNIFSSSQFGFRKGRGTR FT DCVALLTSHIELSFNKKQDVVSTFLDVSGAYDSVLIDLLYNKMIDCKLPLI FT IANFLCNLFSFKIMHFFHNGATKMIRYSYFGLPQGSCLSPFLYNLFTRDIV FT SIIPNGCYFIQFADDKVISISGKNREVIRHFMQSCLDNIDTWAHNNGFTFS FT VQKTKFIIFSRKHSPIDIDLYLNNHHIDQVFEYKYLGIWFDPKLTWNNHIQ FT YIQTVCSKRINFLRTITGTWWGAHPSDLITLYKTTIRSVMEYGCFSFGSAV FT QSHFSKLEKIQFRCLRICLKLMNSTHTKSVEVLAGITPLRIRFDELNCKFL FT FQCFSNNHPIINILKSLYEINPSSKILSSFIYCSAHNVMLDTSPGFHEYSM FT NVHSFRPHIDFTLHEELKNIHPNARASFANLLFQRKFIGVEPKQIYFTDGS FT LINNVAGFGVFNFHIAHFFKLETPCSIYIAELTALHFACCLIKNCAPNIYM FT VCSDSFSSLHALNTINFNFKTSNIILSIKEILNDLFCRGFIIKFVWVPAHC FT NIYGNEQADSLAKLGVSRGIIYNRVIYPSEYFSKLKQHSMDNWQNTWNASD FT KGRYCYSICPKVKCLPWFDQLSLGRNFICFYSRLMSNHYICNSHLYRINIK FT DSNLCECGESYEDIDHIVFNCTRFSLPRKAIFEKITKLGHNIPNSIRDILA FT CKNLNILKILYGYFNEISYFS" XX SQ Sequence 5831 BP; 1881 A; 825 C; 909 G; 2216 T; 0 other; cagaagaggt taatgtggca tctttggtaa gtaggctttg accgggttgc cggtattgtt 60 tctgcgcttg tcaatttttt tccatttttc aggtggattt catttccaag cgattggttt 120 cgaaggtgga cattttacga tctggaacat cgaggatttg catgagacga actgtactga 180 ctaacatcaa gtttggtgct tgtggaaaat aaaagctgtt tggctagacg gtttggccgt 240 gtgctggtgg tgtgaattcg aatttgaagg aagaaactcg aacaagctga tatcgtcaag 300 gatttgatga aatagaaacg aacttaaaaa atcaagcttc aactaatcaa gtttattttt 360 gtgctggtga aaatacttgg ttggcatcga cgatctcaag tcgttgtatc agattgaagt 420 tccttaagta taattttatt tattttatta ttctaattat tattactcgt atacattatc 480 aatattatta ttatatttgt ttatttttta taattacccc gtcaattttc cattatggag 540 actgacgggg atgggacatc cggtgaaaat agtgaattat cttctaccaa acgttttcgt 600 attaaaactt atccctccag ttttttaggt ccttttgttg tttattttcg caaaaaggaa 660 aaacctataa atgttctttt aatatcttcg gaaatttata aattgtataa atctgtcaag 720 gaaattaaaa aaatttctct tgacaaattg agggttattt tcggatctcg tgatgacgca 780 aattcattac tagaatctaa actattttgt aattcatatc gagtttacgc accttgtgac 840 tcgtgtgaaa ttaacgggat catttatgat gaagatttgg attgtaatga tattgttaat 900 ttcggttctg gcaaatttaa aaataaagct attccacctg tgaaaatttt agattgcatg 960 cgattatcga aattaacatt ttctgataaa aaatcatcat acatgcattc caattgcata 1020 aaaattacat ttgaaggatc tgttctacca gattatgtag atatagacaa tgttattttt 1080 caagttagac tttattatcc gaagattatg cattgcgatc gttgccttct tttcggtcat 1140 acatcgcaat tttgttcgaa caaattaaaa tgttctaaat gtggtgaagg tcattcatct 1200 gttgattgta acaaatcttc agatatttgt atttattgtg gtaaaaaaca taatttctta 1260 aaagaatgtt ctgtttatat tgcacaccaa acacaatcga atcataaaat taaaaataga 1320 agcaaattat catactctga agtaactaaa acaatttctg atttaccctc aacgaatatg 1380 tttgctcctc tttcacagaa tgttgataat gaggtatcaa atgatgatca cgaacatttt 1440 gtatataaac ctcccaataa aagaaaaaga attataaaat caattataaa aaatgatgat 1500 tttacaaatc ctcaaccatc aacatcattt gataccaatt ttcctcctct tagttctcct 1560 acttctcgag ttattcctgg ttttcagaaa aatatgactt attctgaaga aaactctaat 1620 gaaaaaaata acaattctaa ttctactttt caaaaaactg aaaattcagg tgaagataat 1680 tctattttac atattttgga agaattgatt gagtttctag gtttaaatga gttttggaaa 1740 aaaataatta agataatttt accttttttg gcctccattt ttgaaaaatt gaattcaatt 1800 ggacccctca tttgctcttt attttcttcg taatggcttc atccaaaaat tataatttaa 1860 acattttgca gtggaactgt cgaagcataa ttcccaaaat tgatagatta aaagtgttag 1920 ttgttaattg tgatgtagat atattttgtt taaatgaaac gtggttagag gaaactaaat 1980 cgtttcgaat tccatctttc aacattgtcc ggaaagatcg caacactgca tatggtggtg 2040 tgcttattgg gattcgagaa aatattgagt ttaaatattt ggattttaca atagattcag 2100 aaattgaata tgttgctgtt tctattaaac ataatggttt ggaattttca attatttgca 2160 tatatattcc tccccaaact aaattttcat tattggagct taaaacaatt ttaaataata 2220 ttccatcccc tttctatata ctaggcgatt tgaatgctca taacttggct tggggtagcc 2280 ataaaactga tggtaggggt tcacttataa tggatgtaat tgatgactta aatttaaata 2340 ttcttaatga tggttctttc accaggattg ttgtacctcc tggtcgccat tcttgtattg 2400 atttatctat ttgttccaat tgtttatcca tgaattcttc ttgggaaact gttgaagatg 2460 caaatggaag tgaccatctt cctatattga ttagtattta taatcctaat ttcaatcaaa 2520 ataatctaga acattgtgtt cctgatttga caaaaaatgt agattgggtt aaattttctg 2580 atttagtttc ccttgcatta ctcagttttg ataattcact ttccacaatt gaaagttatg 2640 ataatttttc aaaattatta gttgaatgtt taaaaaaatc acaaaccaaa aaaataatgc 2700 aatctacttc taagaaaaaa cactcttcat tttggtggga tgaggattgc accattgctt 2760 tgaaaaataa atctaatgcc tttaaattat tcaggcgttc aggatctagg gagcattata 2820 ttttgtattg caaaacagaa gctcaattta ccagagttac aaaatttaag aaaaggaact 2880 attggagaaa tttcattgaa aatcttgatt ctgatacatc tttatcaaaa ttgtggactg 2940 ttgctcgtaa catgagaaat tataatgttt ctaatatacc agttttggaa tactcggaag 3000 attggattga tcaatttgct tctaaaatat gtcctgattt tacaccatta gatataaaat 3060 ataaaaattc ttcatcatat aattatttcc ctaatctttg tactgaattt tctattgaag 3120 aaatggatct ggcattatca attacgaaaa acactgcacc tggtattgat aatataaaat 3180 ttatagtgtt gaaaaattta ccaattgatg ggaagttaca tttactttca atatataata 3240 attttttatt tcaaaatatt tttccaatgg aatggcgttt tgtaaaagtt gtaagtattt 3300 taaaacctgg taaaaatcct tctctagttg aaagtagaag accaattagt ttattatcct 3360 gtcttcgtaa attaatggaa aggatgattt taaatcggct tgaattatgg gctgaaaaaa 3420 acaacatttt ttcttcgtct caatttggtt ttagaaaagg tcgaggtaca cgtgattgtg 3480 tagctctttt aacttcacat attgagctat catttaacaa gaaacaagat gtagtttcca 3540 catttcttga tgtttctggt gcatatgatt ctgttctgat agatttgctc tataataaaa 3600 tgatcgattg taaacttccc ttaattattg caaatttttt gtgcaattta ttttctttta 3660 aaattatgca tttttttcat aacggtgcaa caaaaatgat ccgttacagt tattttggtc 3720 ttccacaggg gtcttgttta agtccatttt tatataattt attcactaga gatattgttt 3780 ctattattcc taatggatgt tatttcattc aatttgctga tgataaggtt atttccatca 3840 gtggtaagaa tagagaagtt attcgtcatt ttatgcaaag ttgtttagac aacattgata 3900 catgggcgca taataatggt tttacttttt cagtgcaaaa aactaaattt attatttttt 3960 ctcgaaagca ctctccaatt gatattgact tgtatttaaa taatcaccat attgatcaag 4020 tttttgaata taaatattta ggaatatggt ttgatcctaa attaacatgg aataatcata 4080 ttcaatatat tcaaaccgtt tgttccaaga ggattaattt tcttcgaacg atcactggta 4140 cttggtgggg agctcatccc agtgatttaa ttacacttta taaaacaact attcgttctg 4200 taatggagta tggttgtttt tcttttggta gtgctgttca atcgcatttt tctaaactgg 4260 aaaaaattca atttcgttgt ttaagaattt gtttgaaatt aatgaattca acccatacta 4320 aatctgttga agttttagct ggtattactc ctctccgaat tcgtttcgat gaattaaatt 4380 gtaagttttt atttcaatgt ttttcaaaca atcatccaat aataaatata ttgaaatctt 4440 tgtatgaaat taatccttcc agtaaaatat tgagctcatt tatttattgt tctgctcata 4500 atgtcatgtt agatacatct cctggttttc atgaatacag catgaatgtt cactcttttc 4560 gacctcatat tgattttacc ttacatgaag aattgaaaaa tattcatcca aatgctcgtg 4620 cttcttttgc taatttatta tttcagcgta aatttattgg agttgaaccc aaacaaattt 4680 attttacaga tggatctttg ataaataatg tagcaggttt tggagtattt aactttcaca 4740 tagcacattt tttcaaatta gaaactcctt gttccattta catagctgag ttaacagctc 4800 ttcattttgc ttgctgcttg atcaaaaatt gtgccccaaa catttatatg gtgtgctctg 4860 atagttttag cagtcttcat gctttgaata ctatcaattt taatttcaaa acaagtaata 4920 ttattttatc tattaaagaa atattaaatg atttgttttg cagaggattt atcataaaat 4980 ttgtttgggt tccagcacat tgtaatattt acggtaatga acaagctgat tctttagcaa 5040 aattaggtgt ttctcgtgga ataatttata atcgtgtaat ttatccatcc gaatactttt 5100 ctaaattgaa acaacattct atggacaatt ggcaaaatac ttggaacgca agtgataaag 5160 ggcgttattg ttattctatt tgtccaaagg taaagtgttt gccttggttt gatcaattat 5220 cacttggacg taattttatt tgtttctatt ctagacttat gtctaatcat tatatatgca 5280 acagtcattt atatcgcata aatatcaaag attcaaatct ttgtgaatgc ggtgagtcat 5340 atgaagatat agatcatatt gtctttaatt gcactcgttt tagtttgcca agaaaggcta 5400 ttttcgaaaa aattacaaaa ttgggtcaca atataccaaa ttctatccga gatattctgg 5460 cgtgtaaaaa tttgaatatt ttaaaaattt tatatgggta ttttaatgag atttcttatt 5520 ttagttgata cgttttttat atattttcag atagtggacc ttccgtataa tttgtttttc 5580 ttcggagatt ttacctgttt cagttccagt cacttggata caaaaggacc ctcatgatga 5640 ttacgacggc tccgatatgg atcaattccg gatgagcctt tagttcttaa gttttctttt 5700 tgtaatgatt tcagaaaaga taaagaggtt ttgtgccttt ttgagaaaga ttccaaagtg 5760 aaatcactca aaggggtttt tccctctttc aaaattgaag ttaaaaataa ataaataaat 5820 aaataaataa a 5831 // ID Crack-35_AAe repbase; DNA; INV; 5781 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A Crack non-LTR retrotransposon family from Aedes aegypti. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; KW Crack-35_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5781 RA Kojima K.K. and Jurka J.; RT "Crack clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1251-1251 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 7 sequences with >97% CC identity. XX FH Key Location/Qualifiers FT CDS 301..1416 FT /product="Crack-35_AAe_1p" FT /translation="MLEEDEKIVCFVCGESEENPEKTIECVQCAKSVHFRC FT KNLRGNAILKAKKRPFYCSVECADMFARSAREQNGYDQIIDEIKLLAQSVR FT ESKQESTHLRLAFEQSAYKIDTLVKTTKAIEESQDFLSKQFDVLQKDFDGF FT KKQLSGLEMENERMRAELGSWRTEQATVTSRLDQLEMEVDKVNRGMISRNA FT VILGVPIVDDENPKALVIKLGSLVGYALNESNVVNARRLLDKNRLNRSAPI FT LVSFCSATTKEKLFEMKRAYGPLELSKLSDSFRGSTHRVVIRDELTSFGRT FT LYQQAKELQSSMGFKYVWPGRNGKILIKRQDGGKIEEIGCKKQIEDLKKTS FT AKRSLNSSSNLSLTSLSPVQEPASKRIQM" FT CDS 1461..4358 FT /product="Crack-35_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MMNNNMNNYFYYTIAELNKNVDSNFDNNKLKLLQINI FT RGMNCMDKFDRVRDILSLYTGNIDIVVIGETWLKENCTNLYHIDGFRSFFS FT CRPDSLGGGLAVYVKQTIKVSVPKIEHDNGMHHIHVIVHANALPFHVHAVY FT RPPSYDFSRFLTTMDSIITAQGIDTSCVIVGDVNVPNNHQDNPLVREYSNL FT LRCYNFEVTNTYTTRPASNNILDHVVCSEAVKNKVVNETILTDISDHYPIL FT STFKLGRTVEKRTLEKIIVDKQKLNDAFENAVSGLAYESAEDRLYKVMDLY FT KTLREKFSKKVVVQAKIKGHCPWMNFDLWKLLRIKENVLASCKRYPNESRP FT KELLAHISKMVQRTKDNCKKTYYGNLFRCNNQKQSWKNLNEVLGKSRSGKM FT EEIKLMVNNQEISGHSVADAFNQYFCTIGPKLASTINSQRDINKYDTLTPL FT HASIFLRPTTEQEVVIEIKNLDANKSSGPDGIPVKFIKDHHHTFSALIRDV FT FNQAINTGIFPDRLKVARVTPIYKSGSKNDINNYRPISVLPVLSKIFEKLL FT VTRLLNFLHTHNIMYSHQYGFRTGSSTLTATSELVDEIYDAIDTQKLVGVL FT YLDLKKAFDTIDHEMLLRKLEYYGLRGIANNLIRSYLTGRSQFTHVNGSSS FT CKRALTVGVPQGSNLGPLLFLLYVNDLPKLNLHGKPRLFADDTSLSYEASD FT PNNIVDQMEQDLVKLQDYFNENLLSLNLSKTKYMMLHSPRRNISPHRDLIV FT QAIPIDKVQCFKHLGLMIDSKLTWSDHINSLQKTISSTCGMLWKLSKFLPR FT NALLTMYHAFVQSKLQYLVSLWGAAAKSRLKPLQTIQNRCLKAVFNLPRLY FT STANLYENSPASILPVSALRELQCLXQIHNQMYNQEMHHNQDIERASHRYS FT LRNAAFLLISRTNTEMAKKSVSYFSKKCFNSLPEVVKLEQNTNKFKQVVKS FT MIKARVHRYIL" XX SQ Sequence 5781 BP; 1776 A; 1237 C; 1220 G; 1531 T; 17 other; agcagagatt ttgttacaag tagtcctgaa aggcagctgg ggttccaacg tgcatgcgag 60 gtaatttcgt atcgtttatc aagaaaaatc attagaaaat tgataccgct ttcgacatct 120 acgattgcta ctattgtcag aaagtggcgg tgtaatagac acaaagcgtt tgtcagtaat 180 taggcccatg cgtattgcgt attgtgcgcg tagcgattga tctatagctg gtgctaccat 240 cattatcgat tgttggtaaa cagtggtcga gtaataaata tcttcttcat tgctgaggaa 300 atgttggaag aggatgaaaa aattgtttgc ttcgtctgtg gtgaatcaga agaaaatccg 360 gaaaaaacca tcgagtgtgt ccaatgcgca aaatcagttc attttcgctg caaaaatcta 420 cgggggaacg ccattctaaa agccaagaag cgaccttttt actgctccgt tgaatgtgct 480 gacatgtttg cgcgtagtgc ccgagagcaa aacggttatg atcaaattat cgacgagatc 540 aaattgctcg cacaatccgt ccgagaatcg aagcaagaat ccacgcacct acgactagcc 600 ttcgaacaat ctgcctacaa aattgatact ttggtgaaga ccactaaagc aatcgaggag 660 tcgcaagatt tcctttcaaa gcagttcgat gtgctacaga aggattttga tggcttcaag 720 aagcagttga gtggattaga gatggaaaat gaaaggatgc gtgcagaact tggaagctgg 780 agaaccgaac aagccacagt aacgtcaagg ctggaccagt tggaaatgga ggtagacaaa 840 gtaaaccgcg gcatgatttc gcgcaacgcg gtcattttgg gtgttccgat tgtcgacgat 900 gaaaatccta aggctttagt gataaaacta gggagcttag ttgggtatgc gctcaatgaa 960 agcaatgttg tcaatgctcg tcgcctgctt gataagaacc gcctcaaccg aagtgcacct 1020 attctagttt ccttttgctc tgcgacaacc aaggagaaac tgtttgagat gaaacgagcg 1080 tatggaccgc ttgaattgtc gaagttgagt gattcatttc gtgggtcaac ccaccgagtg 1140 gtaataagag acgaactaac atcatttgga agaacgctat accaacaagc aaaagaattg 1200 cagtcgtcga tggggttcaa gtacgtgtgg cccggcagga acggaaagat tctcatcaaa 1260 cgccaggacg gtgggaaaat tgaggaaatt ggttgcaaga agcagatcga agatttgaag 1320 aaaacgtcag ctaaacggtc cctaaactct tcatccaatt tgagcttaac gtccctgtct 1380 ccagtgcaag aaccagcatc aaagcgtata cagatgtaaa ctgtttagta agaaaattgt 1440 atgtgtacta ttatatcgca atgatgaaca acaatatgaa taactatttc tattatacaa 1500 ttgctgaatt aaataaaaat gtcgattcga attttgataa caataaattg aaacttttgc 1560 aaataaatat aagaggaatg aattgtatgg ataaatttga tagggttagg gatattcttt 1620 cattatacac tggaaatata gacatcgttg taattgggga aacgtggctg aaggaaaatt 1680 gcacaaatct ctaccacatt gatggcttcc gcagtttctt ttcgtgtcgt ccggattcat 1740 tgggtggtgg attagcagtg tatgttaaac aaacgatcaa agtaagtgtg ccgaaaattg 1800 aacacgacaa cggcatgcac catattcacg tgatagttca cgcgaatgca ttaccgttcc 1860 atgtacatgc agtttataga cctccatctt atgacttctc gcgttttttg acaacgatgg 1920 attcaataat cactgcacaa ggaatcgaca cttcctgtgt tatcgtaggt gatgtaaacg 1980 ttccaaataa tcatcaagac aacccattgg tccgcgagta cagtaatctt ctaagatgtt 2040 acaattttga agtgaccaac acatacacga ccagaccggc aagtaataat atcttggatc 2100 atgttgtctg ctccgaagca gttaagaata aagtggttaa cgaaaccatt ctcactgaca 2160 taagcgatca ctatccgatt ctttcgacct tcaaactggg tagaacagtt gagaagcgaa 2220 ctttggaaaa aataatagtt gacaaacaaa aactgaatga tgccttcgaa aatgcagtta 2280 gcggcctggc ttacgaatca gctgaagatc gtctgtataa agtaatggat ttatataaaa 2340 ctttaagaga aaagttctcg aagaaagtgg tcgtacaagc taagatcaag ggccattgcc 2400 cttggatgaa cttcgatctt tggaaactcc tgcgtatcaa ggaaaacgtt ttagctagct 2460 gtaagcgcta tccgaatgaa agtagaccaa aagaactcct tgcacacatt tccaaaatgg 2520 twcagcgaac taaagataat tgtaaaaaaa cctactatgg taatctgttc cgttgcaaca 2580 atcagaagca aagttggaaa aacttgaatg aagttttggg aaaatctcgt tctggtaaaa 2640 tggaagaaat aaagctgatg gtgaataatc aagaaatcag tggacactca gtggcggatg 2700 cattcaatca gtacttttgc acaattggac ccaagcttgc ctcaacaata aatagccaac 2760 gagacataaa caaatacgat acgctaacgc ctcttcatgc ttcaattttc ctgcggccta 2820 caaccgagca agaagttgtg atcgaaataa aaaacctaga tgccaataaa agcagtggac 2880 cagatggaat acctgtaaaa ttcatcaaag atcatcatca tactttctca gccctgattc 2940 gagatgtttt caatcaagcc attaacacgg gaatcttccc agataggctg aaggtagcac 3000 gtgtgacgcc gatctataaa tcaggaagca aaaatgatat taacaactac aggccaatat 3060 cagttttacc tgttttgagc aaaatatttg aaaaactact agtaactagg ctgctcaact 3120 ttcttcatac gcacaatatc atgtatagtc atcaatacgg atttaggaca ggttccagta 3180 cccttacagc cactagcgaa ttggttgacg agatttacga tgctatagat actcaaaaac 3240 tcgtaggagt gctttacctc gacttgaaaa aggcgtttga cactatcgat catgaaatgt 3300 tgcttcgaaa actggaatac tatggattaa gaggaattgc aaataatttg attcggagct 3360 atcttacggg aagaagtcag ttcactcacg taaatggatc aagcagctgt aaaagggcat 3420 tgacggttgg cgttccacaa ggtagcaacc tcggaccttt attgtttctc ctatatgtga 3480 acgatttacc aaaacttaac ttgcatggaa agcctcggtt atttgcggac gacacctcgt 3540 tgtcatacga agcatcggac ccaaataata tagtggatca aatggagcaa gatctcgtaa 3600 agctccaaga ttactttaac gaaaatttgc tgtctttgaa cttgtctaag accaaatata 3660 tgatgctaca ttcacctcgt cgcaatatct caccacacag agatttgata gttcaagcta 3720 tccctataga taaagttcaa tgttttaagc atcttggttt gatgattgac tcaaagctca 3780 catggagtga tcacataaat tcactccaga aaaccatcag ttcaacgtgt ggcatgctat 3840 ggaaactttc caagttttta ccaaggaacg cattgttgac gatgtatcac gcattcgttc 3900 aatcaaaact gcaatatctt gtttcgcttt ggggagctgc agctaagtcc cggttgaaac 3960 cactacaaac cattcaaaat cgttgtctga aagctgtgtt taacttacca cggctatact 4020 ccacggcaaa tttgtacgaa aacagtcctg cttcaatatt gcctgtgtct gctttgcgag 4080 aactgcagtg tttgsttcaa attcacaatc aaatgtacaa ccaggaaatg caccataacc 4140 aggatataga acgagcttca catagatact cactacgaaa tgcagcattc ctactaatat 4200 cgcgtaccaa cacagagatg gctaagaaat cggtttcata cttcagtaaa aaatgtttta 4260 attctttacc agaagtggtg aagcttgaac aaaacacgaa taaattcaag caagttgtga 4320 aatccatgat aaaagctaga gtacaccggt acatattatg attcgtaatg tacactttgc 4380 ctggataaac acgaaatcga caattcctgc tcccccgctg ctacctcatt ggtgtaccgt 4440 gaacaaacat cgttcgtggc atcttcattc gctatcggcc gggaagaaat ttcagttctt 4500 ccaggtaaga gctggtatat gtctggccga agtcagtgat gctgcgcgcc tagtaatgtt 4560 gaaaaaccgt tccagttctt gcttctctgc tgtttctcta caacatcttc aatgccgcac 4620 cgtgagcaac cgttcgtggc atctcatccg ctatcggccg ggaggaaacc ccagttctat 4680 caggcaagag ctggtatatg tctggccgaa gtcagtgatg ctgcgcgcct agtaaagttg 4740 awamwccagt tctatcagtt cttgcttatc cgctgcttcc ctacaaccgc ttcaatgccg 4800 caccgtgagc aaccgwtcgt ggmatctcat ccgctatcgg ccgagaggaa gccccagttc 4860 tgccaggtaa gagctggtat atgtctggcc gaagtcagtg atgctgcgcg cctagtaawg 4920 ttgawaaacc gttccagttc ttgcttctct gctgtttctc tacaacatct tcaatgccgc 4980 accgtgagca accgttcatg gcatctcatc mgctatcggc caggaggaaa ccccagttct 5040 atcagccgca ccgtgagcaa ccgttcgtgg catctcatcc gctatcggcc gagaggagac 5100 tccagttcta tcagtttttg cttatccgct gcttccctac aactgcttca atgccgcgct 5160 gtgagcaacc gttcgtggca tctcatccgc tatcggccga gaggaagccc cagttctatc 5220 aggtaagggc tggtatatgt ckggccgaag tcagtgatgc tgcgcgccta gtaaagttga 5280 taaaccgttc cagttcktgc ttctctataa tcgcttcaat gccgcaccgt gagcaaccgt 5340 tcgtgtcatc tcatccgcta tcggcctgga ggaaacccca gttctatcag ttcttgtttc 5400 tctgctgctt ctctacaacc tcttcaatkc cgcaccgtga gcaaccgwtc gtggaatctc 5460 atccgctatc ggccgagagg aagccccagt tctgccagtc actgccacca accactactg 5520 ccctccatca gcccaccatc gtaacacatc gccatacgtt aaattcctaa ttaagataag 5580 aagcttktaa cataagttga aatactacac ttccttaaaa gagcaactat gctcactgga 5640 atgtgtattg caatgtataa aacataataa tgaaaagatg aggaggtttt atgcctgttg 5700 gaggaagatt cttgaaagaa gatcacctcc aatgggcttt tccctgctcc ataagagaag 5760 aaataaawaa awaaaaaaaa a 5781 // ID TBRP1 repbase; DNA; INV; 794 BP. XX AC L08172; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 28-SEP-1995 (Rel. 1.08, Last updated, Version 1) XX DE T.borreli repeat sequence. XX KW Repetitive sequence; TBRP1. XX OS Trypanoplasma borreli OC Eukaryota; Euglenozoa; Kinetoplastida; Bodonidae; Trypanoplasma. XX RN [1] RP 1-794 RA Maslov A.D., Elgort G.M., Peckova H., Simpson L. RA and Campbell A.D.; RT "Organization of mini-exon and 5S rRNA genes in the kinetoplastid RT Trypanoplasma borreli."; RL Unpublished (1993). XX DR GenBank; L08172; Positions 1 794. XX SQ Sequence 794 BP; 247 A; 193 C; 126 G; 228 T; 0 other; aacttacgct ataaaagtca cagtttctgt actatattgg tattagaagc tttccggatt 60 catttctgga caattttttt ataagatctt cggatctttt tttttgtcca tttttttatt 120 ttttccgaaa aagggagtac gacactttgg gggttcccaa gccatcactg acctctgttc 180 ttattaatta ttatgtacgt gataagtttg gcgttatatc tatcaagaaa cacatttatt 240 ttaccttttt attgattctt tcactttttc acggcgttta cacggaaaac cgtgattggg 300 gggaaagggc ggtaaaacga caaaaatccc tttaaaaacc cgttaataaa gcatataata 360 agcttatttt gcaaaaaagg tggtttggaa ggtcaaaaag tatcccaact tttcaaaaaa 420 acaacaacca cccaatgaca accacaataa tcccactata aactacacgc aaatgggaaa 480 accctacttt ttttaaatac gtaattttta ctataatgtc ccctactccc tggcccctag 540 gggcaaagca caccaaaccc ctttccacca atagcggcca agcatgctat aggtgcatca 600 gggggcattt gcgggtgacc ttatcacccc cccagccctt ttacaccacc acatcgaccc 660 aaaaattcaa caattcctca ccccaaaaac cccccttaga acccaagggg actttggagg 720 cttcaaaaag tcccccgagg ctctggagcc cgcaaacata tagtcgctcg aaaaaaaaac 780 actacagttc tttc 794 // ID Tx1-13_CQ repbase; DNA; INV; 3520 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A Tx1 non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW Tx1; Non-LTR Retrotransposon; Transposable Element; Tx1-13_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-3520 RA Kojima K.K. and Jurka J.; RT "Tx1 non-LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 645-645 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 7 sequences with >96% CC identity. XX FH Key Location/Qualifiers FT CDS 179..3433 FT /product="Tx1-13_CQ_1p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="MNYTCIIASINLNAIQTRLKLSLFHDFVKNSNADIIF FT XQEVAFNNFSFIFTHHAIVNVCRNQGTAILIRKGXDYDDIICDPNSRIVSV FT LVNGINYINVYGHAGFXLKKERETLFAENIAIHLNKAAARAHVLGGDFNCX FT LDPTDTKGNTKKNCNSLKVLTSKMALKDVXKALKNQAVQYTFXRGDSASRI FT DRFYASQDXLKSVLNFETWPVIFSDHRAILMKISIQKSDLGTSFGRGYWKI FT NTAFINDDECENEFCFTYEALQQRHSYENNFMNWWTTHFKSKVKNFFKMKS FT IEFNRNIAMRKSELYKTLIELSSRQNDNEDVSREMGLTKSLIFKIESEKLK FT YYESKLSSNSILTNERTNIYQIIARHKFNQSSRIEKLVDGCNKIVGTENIK FT KFLFNHFSENFKKHDFLEPSCRSLNYIKKRLNQDDSNSIIGPITEEELKKV FT VWSCTKKKTPGPDGICYEFYVKHFDLIKNDMIKLFNDLLRGNVVIPSSFCE FT GVITLIPKSGCAENISGFRPISLLNCDYKIFTKILANRIQNVLGNILEEGQ FT TACVNSKSCVDSLDMLRKIIVKASGSKRFKTAIVSFDMERAFDNVSHDLLW FT SILEKFNFPDQFIACLRNLYRNATASVLFNGHLTNCFKIGRSIRQGCPLSM FT LLFVIYLEPLIRELYSSVTGILLDHSFIKVVAYADDVTMVVKTDYEFDNVL FT KIIEEXGKCSGIRLNSKKSSYVRINNCKLGPQILREETQIKVLGMLIVNNW FT KDMIDKNYSKIISACRGKLNSICIRNLNFVQKSWILNTFILSKCWYLGQIL FT PPDKRHIDELKKMVLRYFWGSQIFKVEAGQLYLPFERGGLALVEAEVKFKA FT LFIRNILYKNNVVNRKHYLIDNISTKNLTRNTYDWVNIAIEYEAYDFLNTS FT ALIYSSLLSKQSCEPKIQSKAPSPDWLSTWENFSENYMDSDAKSIIYEILN FT DIYPNKEKLCSYNVRGLNNAECEICXELDTNLHRITNCTNSKEIWEWVVEK FT VNVNMKANLPDPMEILSAYIGTKNRIFQAVYWIVSEAIVYNMRHYKNPSLF FT VFKSILRNKRWDQRQFFVRRFGKYAFVF" XX SQ Sequence 3520 BP; 1247 A; 549 C; 669 G; 1041 T; 14 other; acggacaaga caccaaaagg agaaagtggt aagccgcagc cacgtttaac gattcctaac 60 ctgctcaagg ttctggatgg tcgcagtcga tcaaggtcgg cgcaaaagga gaaacctaac 120 cccaaaaact aacaamaact aaccccaaaa agaaagaaca actaaccctt aacccawcat 180 gaactacacg tgcattatcg ccagtatcaa tctaaacgca attcagacca ggcttaaact 240 ctcacttttt cacgactttg taaagaacag taacgcagat attatttttk gtcaggaagt 300 agctttcaac aatttttcat tcattttcac ccaccatgcg atagttaacg tatgcagaaa 360 tcaaggcacg gctatwttga ttcgtaaagg tttkgattac gatgatatca tctgtgaccc 420 aaacagtaga attgtttccg tactggttaa tgggattaat tatattaacg tttatgggca 480 cgctggattc mwgctaaaaa aagaacggga aactcttttc gctgaaaata ttgccattca 540 ccttaataag gcagctgctc gagcgcacgt gctgggtggg gattttaatt gcwtattgga 600 tccgactgac accaaaggaa ataccaaaaa aaattgtaat agcttgaaag tactgacgtc 660 aaaaatggcc ctcaaggatg tagamaaagc tttaaaaaac caagcagtgc agtacacctt 720 twtgagaggt gattcagcat ctcgtattga cagattttat gcttcacaag atmttttaaa 780 atctgttttg aattttgaga cctggcctgt gatcttctca gatcataggg cgattttgat 840 gaaaatttcc attcaaaagt cagacttggg gacatcattt ggaagaggtt attggaagat 900 aaacacagca tttattaatg acgatgaatg tgaaaatgaa ttttgtttta cgtacgaagc 960 tctacaacaa cggcactcgt atgaaaataa ttttatgaac tggtggacga cccattttaa 1020 atccaaggtt aaaaacttct tcaagatgaa aagtattgaa ttcaatcgga acatagccat 1080 gcgcaaatct gaattataca aaacgttgat tgaattatcc tctagacaaa atgataatga 1140 agatgttagc agggaaatgg gcttaackaa atcattaatt tttaaaatag aatctgaaaa 1200 attaaaatat tatgaatcaa aactttcctc aaacagtatc ctcaccaacg aaaggacgaa 1260 catttaccag attattgcca ggcataaatt caaccaaagt agtagaattg agaagcttgt 1320 agatgggtgt aataaaattg tcggcacaga gaatattaaa aagtttttat tcaaccattt 1380 ttctgaaaat ttcaaaaagc acgacttttt agaacccagt tgcagatcat tgaattatat 1440 taaaaaaaga ctaaatcagg acgactcgaa ctcgataatt ggacctatta ccgaagaaga 1500 attaaagaag gtagtatgga gttgtacaaa gaagaaaacc ccgggccctg acggtatttg 1560 ctatgaattt tacgttaagc attttgattt aataaaaaat gatatgatta agttatttaa 1620 cgacttgctg cgtggtaatg ttgtaattcc ttcttctttt tgcgaaggag taataacatt 1680 gataccaaaa tctggatgtg cggaaaacat ttctggtttt cgtcccataa gcctgttgaa 1740 ttgtgattac aagattttta caaaaatact tgcaaatcgt attcagaatg ttttgggaaa 1800 tatccttgag gaagggcaaa ctgcatgcgt taatagcaaa tcatgcgtgg acagtcttga 1860 catgctgaga aaaattattg tcaaagcttc cggttcaaaa agatttaaaa ctgcaatcgt 1920 tagtttcgat atggaacgcg cattcgacaa tgttagtcat gatttattgt ggagcatatt 1980 agaaaaattt aactttccgg accagtttat cgcatgtcta agaaatttat accgaaatgc 2040 taccgccagt gttctattta acgggcattt gacaaattgt tttaaaattg gcagatccat 2100 ccgacaaggc tgcccactct cgatgctact tttcgtaatt tacttggaac cattgataag 2160 agagctctat tcttctgtta caggcatatt actagatcat agttttatta aagtggtagc 2220 gtatgcggac gacgtaacaa tggttgttaa aacagattat gagtttgata acgtgctgaa 2280 aattattgaa gagtwtggga aatgttctgg aattcgattg aatagtaaaa aatcttctta 2340 cgtacggatt aacaactgta aattaggacc acagattctt agagaggaaa cacaaataaa 2400 ggtacttggg atgttaattg taaataattg gaaagatatg attgataaaa attacagcaa 2460 aattatatca gcatgcagag ggaaattgaa ttcaatctgc attagaaatt taaattttgt 2520 tcaaaaatct tggattctaa atacgtttat tctttcgaag tgctggtatc ttggtcaaat 2580 tttaccgcca gacaaacgcc atattgatga attgaagaaa atggtgttaa gatatttctg 2640 gggttctcaa atttttaaag ttgaagcagg acaattatat ttacccttcg agagaggtgg 2700 attagcgtta gttgaggcag aggtaaaatt caaagcatta ttcattagaa atatcttata 2760 caaaaataac gttgttaata gaaaacacta tcttatagac aatatttcta caaagaattt 2820 aactcggaat acatatgatt gggtgaacat tgctattgaa tacgaagcct acgatttttt 2880 gaacacatcg gcactaattt atagctcgtt gttgtctaaa caaagctgtg aaccgaaaat 2940 ccagagcaag gcgccaagtc ctgactggtt gagcacgtgg gagaatttta gtgaaaatta 3000 tatggactct gatgcgaaat ctataatcta tgaaatttta aacgatatat atccaaacaa 3060 agaaaaatta tgcagctata atgtgagggg attaaataac gccgaatgcg agatttgcma 3120 tgaacttgat acaaacttac atagaataac aaattgtaca aattcaaagg aaatttggga 3180 atgggtagtt gaaaaagtga atgtcaatat gaaggcaaat cttccggatc caatggaaat 3240 cctttctgca tacataggaa caaaaaatag aatttttcag gccgtttatt ggattgtgtc 3300 agaagctata gtttataata tgaggcatta caagaatcca tcgttatttg tatttaaaag 3360 cattctcagg aataaaaggt gggatcagcg acagtttttt gttaggaggt ttggaaaata 3420 tgcatttgtt ttctaacagc gttgtgtatt taaagattta gatgtaccta catatgtaac 3480 ataaaacaaa agtgaaataa aattgtattt aaaaaaaaaa 3520 // ID Gypsy-17_DPu-LTR repbase; DNA; INV; 176 BP. XX AC scaffold_1168; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-17_DPu_; KW Gypsy-17_DPu-LTR; Gypsy-17_DPu-I. XX NM Gypsy-17_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-176 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 750-750 (2010). XX DR Genome; scaffold_1168; Positions 911 736. XX CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX SQ Sequence 176 BP; 51 A; 41 C; 34 G; 50 T; 0 other; tgtaatagtc tcgaattcgt ctaagtctgt ctctatccct atttgtaaac tcttaagcgt 60 ctgtctagtt atgaattccc atgtgtatac aacccaagcg acgccaagct tggtgagaaa 120 gagaagctga aatatacgac agttatcact cagactagct ctcgagcgga ctacca 176 // ID CR1_Ele23 repbase; DNA; INV; 6333 BP. XX AC . XX DT 08-OCT-2010 (Rel. 15.1, Created) DT 08-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE A CR1 clade non-LTR retrotransposon family from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1_Ele23. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-6333 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6333 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (08-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update and extension. This consensus is generated CC from 13 sequences with >98% identity, and ~99% identical to the CC original sequence in [1]. XX FH Key Location/Qualifiers FT CDS 1354..1884 FT /product="CR1_Ele23_1p" FT /translation="MSCNKRQKVIDDADRVNCRGYCGNSYHMICVMLDYSL FT RDILKGHEKNLFWMCDGCAEMFSSDHFRNIASRCTNGHVPDESSFKTLKED FT IAGLKEIVRNLSSXIDSKPVTPVMTTSWPVPNRMSGLNPVPNTHKKKRENS FT QLKEKPSNIRGTKAASELVKTISPPEELFWLYLYIEYI" FT CDS 2086..6159 FT /product="CR1_Ele23_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="KSSKKPTSSHQCDIREKSTPCPVIHQPPGRTXASSLL FT EASDPPITVEFFLPATCSRPGPVYGVFRISNAGKFTVLPNHPCPEVLSASR FT PYPSNVLLRPSFAWIPGRTPDLSFMEASNPPNTVEPFPPAIHSRPGPVFGI FT GGEVFQNTSAGKYFTSTHNTCSETFITSRTRGLSVSGLGFTPILGSMEASH FT PPIKIQSSLPTFNRGPGLVLGSKGSSLASFLVVSDPHQPERTACSIEEVPE FT PLDPVAPFVSSHHSRSGPVVEYGDGIFRPPIAGKYQTLWNNSLPDMPLRFS FT VPHGSIDSLNNIRERINCTSHPLLGPMFDASLQPERTACSTKEVPKPLGSV FT ALAGSGHHSRSGPAAECGDGGFRPFTSGKSPSDGNISSTDARSCFSKHXTP FT NTLRIYYQNVRGLRTKIDDFFLAVSENEYDIIILTETWLNDVIYSAQLFGD FT HYRVYRNDRSSVNSCKSRGGGVLIAISSNFNSFREPASVHDSIEQLWVRVD FT MHQVIVSIGVLYLPPDRKNNLNDIRNHVDSIGSVISRLGPNDLALQFGDYN FT QSTISWTSSETGPPSIDLDRSCLSVASSALFDGFNLHGLTQFNGIKNTRGR FT VLDFVLLNDAALPTCTVRRALEPLVPIDDSHPPIEVDVCCLAPVSFEPEFD FT ATCLDFRKADYTVLNAALLSVDWSFLNSADNVDDAVEYFTNSVNRIVASHV FT PVIRPAPKPIWSNNRLRSLKRRRSAALRRYCNHRSAYLKQQFNTISREYRE FT YNKLSYLRYTTRTQQNLRSNPKKFWAFVRSKRKEDGLPTSMFYGSNTASSA FT AEKCELFAAQFSAAFNDSAATPSQVEDAVHDTPIDVLNFTTFRIDAENVTK FT AIRKLKSSYSAGPDGIPPALLKRCLPALHEPLAVIYNLSFQREQFPSRWKH FT SFLFPIYKKGDKRDVSNYRGITSLSACSKVFEIIVNDVLFNCCSHYISIDQ FT HGFFPKRSVTSNLAQFVSTCLQAMDDGKQVDAVYLDLKAAFDRVDHRILLK FT KLERIGISQDCIEWFRSYLTGRSLCVKIGSCHSTHFRNVSGVPQGSNLGPL FT LFSIFVNDVSLILPPGVKLFYADDAKLYVVVNCIEDCIQLQTLLCKFERWC FT VHNCLTLSIEKCQVITFSRKRKPITFQYSLSDKFLERVQRVRDLGVILDEK FT LTFRFHYDDIISRANRQLGFIMKVTTDFQDPLCLKALYCALVRSILEFADV FT VWCPFQTTWISRIESVQRKFVRFALRTLPWRNDQHLTSYHDRCQLLGIDTL FT ENRRRIAQAMYVVKLLNGTIDSSALLARINFNAPERSLRRHSFLRLEGRNS FT RYGQHDPIRFMSNTFNSVAHLFDFNMPVSAVQQRFSSHFRTNQADQ" XX SQ Sequence 6333 BP; 1649 A; 1526 C; 1305 G; 1843 T; 10 other; taagagtgta ttaggagttg gttactctgc cttcactatt agaggtcatt catactcttt 60 ctagtagcaa atcattatat aaagatctct gccttctcta tcagagacct tacatgatct 120 atttttggag cttcgccttc tctatcaaag tccaactcat aagaaactct gccttctcta 180 tcagagcctt acataattat tttggtacat tgccttctct atcgaagtcc aatataaaag 240 gaattctgcc ttctctatca gagccttaca tgwcattttt gaaacttcgt cttctctccc 300 ttacattaga tactccacct ttgttagaaa gccttggata actgcagaac agcatgagaa 360 attgaattgc ttgccatttc atttcgtttc atttcagttt cctaattcaa ttttctgtaa 420 cattgttcaa acatcatctt gaaaatatta aaaacctcac aaaaattgct tgcttcataa 480 cttgaataca tttctcaaac ttcaaacatc tcctaccaat tgatattcgc ttgtgggggg 540 agactagtct gcgccattgt tgtgaccgga ggatkgttct tcatatttct agtaattttc 600 acgtctttgc ttcattacgt tcgttgaaag actgaataat tttattcatt gtcgtgacat 660 ttctctatct caaagcctcc tatgttcttt attttctgga ttatttggat acgctattcc 720 tagctctatc cttctcaaca cacagcactg ttcagagtcc cacgtcctag tgccagacsa 780 tgatcagccg ctcctaacat ggagaacaga cgctgctttg agccgcttct aacatggaga 840 acagacgctc tgataagcta caacctcaga agagaggagc cccccttccc tgtcagcata 900 cgaccaaggt cccaccaggg ttggttaccc gatcttccct acggttactc gtaccccagg 960 cggcaccacg gctcgtttgt ttatcaacag tcgtgtacgg taccgagtga atattttatt 1020 tacgttttgt ggacgcacat agtgatttta aatcgttatt tgtcgtttat atgttggaca 1080 accgatcgtt ttgtcgttgc gtgtaaagtg cagtgctaat atcgaattga agtagtgatt 1140 cstggtttat tttgtctctg gagatatttg gtgataagtg gtgactgtgt ccattttcct 1200 tgtggtaaaa agttgaacct agggttgcat acaacacgcg atctgtaaaa gcttttctat 1260 cgcatacgct gtgaagtata cccaaaggcg ctttgattct ggtcgccagt acagtattct 1320 ctcctgttgt acgggtttca tcgaaccgag atcatgtcat gcaacaagcg ccagaaagtc 1380 atcgatgatg cggatcgagt taattgccga ggatattgtg gaaattcgta tcatatgatt 1440 tgcgttatgc tggattattc tctccgtgac attctgaagg gccatgagaa aaaccttttc 1500 tggatgtgcg atggttgtgc tgagatgttc tccagtgacc actttcgaaa cattgcgtct 1560 cgttgtacta atggccacgt tccggatgaa agttcsttca aaacccttaa ggaagacata 1620 gcagggttaa aagaaatagt taggaacctg tcgtccawaa tagactcgaa gccggttact 1680 cctgtgatga ccacatcctg gccggttccg aatcgaatgt ccggattaaa tcctgttccg 1740 aatacacata aaaaaaaacg cgaaaatagc caactaaagg aaaaaccttc taatattcgt 1800 ggcactaaag cagcgtccga actggtcaaa acaatctcgc cgccagagga gttattttgg 1860 ctgtatttgt acatcgagta catctgataa ggacatcgtt accttcgtga agaactgtat 1920 ggatctcacc gacatggagc cataagttgt aaggcttgtt ccgaaagata aagaccctac 1980 aactttgagt ttsgtgacat tcaaagtcgg ggttaacaaa gcactcaaag acttggcgct 2040 ttccagcgaa atctggcccg aaaatgttta tttccgggaa tttgaaagtc atccaaaaaa 2100 ccaacgtcga gtcatcagtg tgacatcagg gaaaagtcca caccatgtcc agtaattcat 2160 cagccaccgg gacgcacast tgcctccagc cttttggaag cctctgatcc gcccatcaca 2220 gtcgagtttt tcctgccagc gacctgcagc cgtcccggtc ctgtgtatgg ggtcttccga 2280 atctcaaatg caggcaagtt tacagtctta ccgaatcatc catgccctga agtgttatca 2340 gcttctagac catatccttc caatgtctta ctccgacctt cctttgcttg gatcccggga 2400 cgcacgcctg accttagctt tatggaagcc tctaacccac ccaacacagt cgagcccttc 2460 ccgccagcga tccacagccg tcccggtcct gtgtttggga tcgggggaga ggtcttccaa 2520 aacacatcgg caggcaagta cttcacatca acgcacaata cctgctctga aacgtttata 2580 acttccagaa ctcgtggcct gagtgtatct ggcctaggtt tcacgcctat tctcggatcc 2640 atggaagcct ctcacccgcc gatcaaaatc cagtcttccc taccaacgtt caatagaggt 2700 cccggtcttg tgttggggag caagggatca tctctagcaa gttttctcgt cgtgtcagat 2760 cctcatcaac cagaacgcac tgcctgcagc attgaggaag tccctgagcc tctcgaccca 2820 gtcgcgcctt ttgtatccag ccatcacagt cgttctggtc ctgtkgttga gtatggtgac 2880 gggatcttcc gcccacctat agcaggcaag taccaaactc tatggaacaa ttcgttgcct 2940 gatatgccat tacgttttag cgtgccacat ggatcaatcg attcactcaa caatatccga 3000 gaacgtatca actgtacgag tcatcctctt ttgggaccaa tgtttgacgc atcccttcaa 3060 ccagaacgca ctgcctgtag cacgaaggaa gtccccaaac ccctcggatc agtcgcgctc 3120 gcaggttccg gccatcacag tcgttctggt cctgcagccg agtgtggtga cgggggcttc 3180 cgccctttta cttcaggcaa gtccccttct gatggtaaca tttcaagtac tgacgcacgc 3240 tcgtgtttta gcaagcatcm gactccgaac acactgcgca tatattacca aaatgttcgt 3300 gggctccgga caaaaattga cgacttcttc ttggccgtca gcgaaaacga gtacgatatc 3360 atcatcttga ctgaaacctg gctcaacgat gtaatatact ctgcacaact gttcggagat 3420 cattatcgtg tgtatagaaa cgaccgcagc tccgtcaata gttgcaaatc tagaggagga 3480 ggagtactta ttgccatctc ttcaaacttc aacagttttc gtgaacctgc cagcgtacat 3540 gattccattg agcaattatg ggtgcgagtt gatatgcatc aagtcatagt gagcattggt 3600 gtcctttacc tgccgcctga ccgcaaaaac aatctcaatg atattcgcaa tcatgttgac 3660 tccatcggat cggtcatctc tcgccttggt ccaaacgatc tcgcccttca gttcggggat 3720 tacaatcagt caacgatctc gtggacttcg tcggaaactg ggcctccatc gatagatctg 3780 gatcggtcgt gcttatcagt tgccagtagt gctctgtttg atggtttcaa tctccatggt 3840 ttaacacagt tcaacggaat taagaatact cgaggccggg tgcttgattt tgtactttta 3900 aacgacgcag ctttgcccac gtgtactgtg agaagagctt tagaacctct tgtacctatt 3960 gatgatagcc acccgccaat agaagttgat gtttgctgtt tggctccagt ctctttcgaa 4020 cctgaatttg atgctacttg cctcgatttc cgcaaggctg actatactgt gttaaacgcc 4080 gcgttgctct ctgttgactg gagtttcctc aactcagctg ataatgtcga cgatgctgta 4140 gaatatttca ccaactctgt caatcgtatc gttgcatcgc atgtccctgt aatcagaccc 4200 gctccgaaac caatatggtc taacaacaga ttgcgctcac tgaagcggcg tcgttctgca 4260 gcgcttcgta ggtactgcaa tcaccgctct gcgtatctca agcaacagtt caacactatt 4320 agtagagaat atagggaata taacaagctt tcgtacctac gttacaccac acgcactcaa 4380 caaaatttgc gctcaaatcc gaagaagttt tgggcttttg ttagatcaaa gaggaaagaa 4440 gatggactac ctacatcgat gttctacggg tctaacactg ctagttcagc tgcagaaaag 4500 tgcgaacttt tcgctgccca attcagcgcg gctttcaacg actccgcagc aacaccctct 4560 caagttgagg acgctgtaca tgacactccg atcgatgtac tgaacttcac cacctttcgg 4620 atcgatgctg aaaatgtaac caaggcgata agaaagctga aatcctcgta ctccgctggg 4680 ccagatggaa ttcccccggc gttgcttaag cggtgtcttc ctgctttaca tgaaccgctg 4740 gcagtaattt acaatctgtc ctttcaacgc gagcagtttc cgtcacgttg gaaacattca 4800 ttcctgtttc caatttacaa aaaaggggat aaacgagatg tgagcaatta ccgcgggata 4860 acttcacttt ctgcttgttc gaaagttttt gaaattattg taaatgacgt gctgtttaat 4920 tgttgcagtc attacatttc aattgatcag catggttttt ttcctaaacg atctgttaca 4980 tctaatctcg ctcagttcgt ctcaacgtgt ctgcaagcaa tggacgacgg aaagcaggta 5040 gacgctgtgt atcttgatct gaaggctgct ttcgatcggg ttgatcaccg aatactacta 5100 aaaaaattgg aaaggattgg aatatcccaa gattgcatag aatggttccg atcatacctt 5160 actggcagat cactgtgtgt taaaatagga tcctgtcact ccacacactt tcgcaatgta 5220 tcaggagtac cgcaaggcag taatctcggt ccacttctat tttcaatctt cgtcaacgat 5280 gtgtcgctaa tactaccacc tggagtaaag ctgttttatg cggacgatgc caaactatac 5340 gttgttgtca actgtattga agactgtatc caattgcaga cactcttatg taagtttgag 5400 cggtggtgtg tccataactg cctcacgtta agcatcgaaa aatgccaggt cattacgttt 5460 agcaggaagc gcaaaccaat aacgttccaa tactcactct ccgacaaatt cctggagcgg 5520 gtacaacgtg tacgagatct cggcgtcatt ttagatgaaa aactcacatt ccgttttcat 5580 tatgatgata tcatatcacg ggcaaaccga cagctcggat ttattatgaa agttaccact 5640 gattttcaag atccgctttg cttgaaagcg ctgtactgcg cacttgtgcg gtccatactt 5700 gagtttgccg acgtcgtctg gtgtccgttt cagactacct ggatatcgcg cattgaatca 5760 gtgcagcgaa agtttgtacg ctttgcacta aggactcttc catggcgtaa tgatcaacac 5820 cttacctctt accacgatcg ctgtcagcta ctgggaattg acacgttaga gaatagacgt 5880 cgaattgcac aggccatgta tgtagtgaag cttctaaatg gaaccataga ttcttcagca 5940 ctcctggcca gaataaattt caatgcacct gaaagatccc ttcgaagaca cagtttcctc 6000 cgattggaag gccgcaacag tcgttacgga cagcacgatc cgattcgctt catgtcgaac 6060 acgttcaata gcgttgctca cctgtttgac ttcaacatgc ctgtctctgc agtacagcaa 6120 cgattttcgt cgcatttccg cactaatcaa gccgaccagt aatttgtttt gtatgtgtac 6180 taagttagtt ttgtgtttta cactgtagtt gtttattgtt ttagttcatt cgtgtttact 6240 ccaatgcaga ttttttttat atacttcatt aagacataag tcagatggag attgttatta 6300 ttaataataa ataataataa ataaataaat aaa 6333 // ID CLAI_CT repbase; DNA; INV; 112 BP. XX AC M24188; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 20-SEP-2005 (Rel. 2.02, Last updated, Version 3) XX DE C.thummi ClaI repeat. XX KW Nonautonomous; CLAI_CT; ClaI repetitive sequence. XX NM CLAI_CT. XX OS Chironomus thummi OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Chironomoidea; OC Chironomidae; Chironominae; Chironomus. XX RN [1] RP 1-112 RA Schmidt R.E., Godwin A.E., Keyl G.H. and Israelewski N.; RT "Cloning and analysis of ribosomal DNA of Chironomus thummi piger RT and Chironomus thummi thummi."; RL Chromosoma 87(4), 389-407 (1982). XX DR GenBank; M24188; Positions 1 112. XX SQ Sequence 112 BP; 50 A; 11 C; 13 G; 38 T; 0 other; tatatttttt tgaaaagagc ctactatgat cgaaaacaaa aaatgcttta aatagcattt 60 taaaacatat ctggtattat aaatgataaa gaatgcaaat ttcgtataaa ta 112 // ID Gypsy-2-LTR_DP repbase; DNA; INV; 323 BP. XX AC . XX DT 17-MAR-2009 (Rel. 14.03, Created) DT 17-MAR-2009 (Rel. 14.03, Last updated, Version 1) XX DE Long terminal repeat of the LTR retrotransposon - a consensus DE sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-2-LTR_DP. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-323 RA Bao W. and Jurka J.; RT "Gypsy LTR retrotransposons from Daphnia pulex."; RL Repbase Reports 9(3), 658-658 (2009). XX DR [1] (Consensus) XX SQ Sequence 323 BP; 69 A; 72 C; 77 G; 105 T; 0 other; tgtcacgttt caataacgtg gcaattgatt ggtttcccag aacagacatt tggctgttcc 60 agggtcgacg tttcccattt gattctgatc ctgggaagtt gacgattcct cgacgcttca 120 caataatagt tcttggaaac gagacgacgt tgacggagga tacaaaaggg gagcatgttt 180 ctctagttag ttagttgttg ttcactgttc ttgtctgtgt gtcggtggtg gagtcgcgac 240 ctccagactg ttttcttttt cttttatgaa acagcttcaa gtcgcacttt ccaaccgaac 300 gtgttagccc ctctcccgtg aca 323 // ID Copia-96_AA-LTR repbase; DNA; INV; 172 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-96_AA_; KW Ty1_copia_Ele181; Copia-96_AA-I; Copia-96_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-172 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 172 BP; 50 A; 38 C; 39 G; 45 T; 0 other; tgatggcagc cctgcgccat tcagcatcgc gcgccaatcg agtgtaaaag gagctgactg 60 aatgaagctc aaaagtgaat ggaaagaagg aaatgaataa agctcattca agtgttttgc 120 ctcgatctgt aaagacgtgt tttactgttc tctctgctaa tttacccctc ca 172 // ID Gypsy7-I_Dmoj repbase; DNA; INV; 5762 BP. XX AC scaffold_6500; XX DT 13-MAY-2009 (Rel. 14.05, Created) DT 13-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy7_Dmoj; KW Gypsy7-LTR_Dmoj; Gypsy7-I_Dmoj. XX OS Drosophila mojavensis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; repleta group; OC mulleri subgroup. XX RN [1] RP 1-5762 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1062-1062 (2009). XX DR Genome; scaffold_6500; Positions 24725693 24731454. XX CC Positions [2543-3004] - Reverse transcriptase CC Positions [4191-4661] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 1532..4249 FT /product="Gypsy7-I_Dmoj_1p" FT /translation="MSIDEETRNQEEINYHDYMQKEYYNQLASYSQYYPQY FT YQEQSYPQTSDYTYTQPATTEQETEQPNEIEIEKAENFSAASLGNQQYIII FT TYKGKNIKCLIDTGSTVNMMTYNVFNLPIHDTDCLIQTSNGPLTITKKVNV FT LSNNLFKSNNEFLLHQFSPDYDLLIGRKLLSDAQAVIDYKSQTVNLYNKKY FT KLINDIPITNQNHFVAQPILETYQHLNNPIRDNSKNQESINSSPYRLDHLN FT EEEKFNLISLLKQYQHIQFKEGDQLSFSNQEKHVINTTHNSPLYTKPYSYP FT QAYEQEVEHQIQDMLDQGIIRPSNSPYCSPLWIVPKKQDASGKQKFRIVID FT YRKLNDVTINDRHPIPNMDEILGKLGKCNYFTTIDLAKGFHQLEMHTDSIK FT KTAFSTKNGHYEYLRMPFGLKNAPATFQRCMNNVLRPLINKHCLVYLDDII FT VYSTSLKEHLHSLQLVFEKLSEANLKLQLDKCEFLKQETSFLGHVITKDGI FT KPNPDKIKAILKYPIPTKTKEIKGFLGLTGYYRKFIPNFADIAKPMTTVLR FT KDAKIDTKNPEYIRAFEKLKQLISNDPILKIPDFKNKFILTTDASNVALGA FT VLSQDGHPISFISRTLNDHETNYSAIEKELLAIVWATKTFRHYLLGRHFEI FT NSDHQPLCWLYKMKEPNSKLTRWRVKLSEYDFDIKYIKGKENHIADALSRV FT KIEQTYFGEETQHSADEDNSNLICITEKPINTFKRQIIFSKGIPNITKQNY FT FKKLIIHITYDNMTKDKAEKYLIDYFCTKTSAMFIDSDADFEIIQNAHKEI FT INTSHTKILRSLIQLKNINSYAEFKELILKTHEKLLHPGIQKTNKLFSETY FT YFPNSQLLTQNIVNECNICNLAKTEHRNTNMPMKITPRPEHCRDKICNRHL FT FL" XX SQ Sequence 5762 BP; 2381 A; 1133 C; 748 G; 1500 T; 0 other; taaataaata ctgcataaat acgtgtcaat agacctacaa ttgttagtct taagttgata 60 ttcaataaag caagttgtca taatccaacc ttgacttcgt cttttaaatc atattagcgt 120 tgctgttagt atatgtcgca gcatttattc gactcgtaac agttttatct tagtggcgca 180 gccggtagga tagtaataaa ttgtggatcc tgatgcaata actaagttat taattcttgt 240 gtcccgttta ctgctatccg ctaaacaaca aacgcgaagg caacccacgc gatggtcaat 300 atcgcgtcaa gttttaaaat aaaacctcga cagtgacgtg tataccgaaa ttcgcatatt 360 gccaaagccg cggattgtgc aaaaaacttg aaaataattc gagaagacgc tatagaattc 420 gaagagaata taattccaga aacatttggt tttctcgcaa accttcgaaa gtgctatcag 480 tgcatacaaa ccccaacata cgtactaaat aatcaataaa aataaaataa accgacaaca 540 aataataaat aaaatataat tataataaat taaataaata aacaaaatgg aaggcgacgc 600 aacaacaccc ctttctgata ccaacatgac ccaagtgatt aatcaactta aaaatgttga 660 aaagttcacc ggaaataaaa acgctttata cacgttcatc aaccggatcg attatatact 720 ggccctgtac accaccaacg atgcgaggca gagatcaata ctctttggac ttgttgaaag 780 ctgcatcagc agtgaagtca tgttatctct tggcatgacc aacctaacta catggacgga 840 tctccgaaaa cagctaatcc taaacttcaa gacgcagacc aagaaccata ttctactgga 900 ggaattcagg aacactccgt tcaaaggcaa cgtacgagct tttctggaag aagcagaaca 960 ccgtaggcag acattgatga gtaagctaga attagaaaat aatcaagatg aaaaaatact 1020 atatgatcga ttaattaaaa caagtattga cgacctaatt caaaaacttc ctaccaatat 1080 ttgtattcga atcataaatt gcgaaattcc agatttaaga tcactaatta acatattaca 1140 agaaacaaat ctttttgatg aaccctgtca aaatttaaat caaaacaaag taaagcaaga 1200 taaaaatttt aagccaactt atccaaatcc aaatattcaa aagtttccca catttcttca 1260 accaaaccat ccttatgtcc catatcaacc ttattttcaa caaccatatc ttccacctaa 1320 tcattattca cctcgaccga tgcaaccaac gcctaaccca gttagaccat ttaaccctac 1380 acttagacca attttcagac aaaatcaatt tgacaataat cgtttttgac aacatacaca 1440 aacaactcca caacaaactc ctttcaatag ttttggcgta gccaaccaac caacaaagcg 1500 tgttagacaa gcagaaagtg aacaaagtaa aatgagtata gatgaagaaa ctagaaatca 1560 agaagagata aattatcacg actatatgca gaaagaatat tataatcaat tagcttctta 1620 ttcccaatac tatcctcaat attaccaaga acaaagttat ccacaaacct cagactatac 1680 atacacacaa ccagccacaa ccgaacagga aacagaacaa ccaaatgaaa tagaaatcga 1740 gaaagccgaa aatttttcag ctgccagcct cggaaaccaa caatacataa ttatcaccta 1800 taaaggaaaa aatataaaat gtttaataga tacaggatct actgttaata tgatgacata 1860 caatgttttt aatttaccaa ttcacgatac cgattgtctt attcaaacca gcaatggacc 1920 tttgactatc acaaaaaagg tgaatgttct ttcaaacaac ttgttcaaaa gtaataatga 1980 atttctttta catcaatttt caccagatta cgatctacta attggtagaa aactgctttc 2040 agacgcacaa gccgttatag attataaatc ccaaactgtt aatttataca ataaaaagta 2100 taaattaatc aatgatattc cgatcacaaa ccaaaatcat ttcgtagctc aaccaattct 2160 tgaaacttat caacacctca acaacccaat cagagataat tctaaaaatc aagaatctat 2220 caactcaagt ccttaccgac tagaccattt aaatgaagaa gaaaaattta atttaataag 2280 tttattgaaa caatatcaac acattcagtt caaagaaggc gatcaactta gtttcagtaa 2340 tcaagaaaaa catgttatta acactactca caattcacct ttgtatacaa aaccatacag 2400 ttatccacaa gcatacgaac aagaagttga acaccaaatt caagacatgt tagatcaagg 2460 tataatacgc cccagtaatt ccccttattg tagtccttta tggatcgtac cgaaaaaaca 2520 agacgcttca ggtaaacaaa aatttagaat cgtaattgac tatagaaaat taaacgacgt 2580 aactataaat gacagacacc caataccaaa catggatgag attttaggaa aactaggcaa 2640 atgcaattat ttcacaacaa tcgaccttgc aaaaggtttc catcaacttg aaatgcatac 2700 agattccatc aaaaaaacag ccttttccac aaaaaatggc cattatgaat atcttcgtat 2760 gccatttgga ttaaagaatg cacctgcaac ctttcaacgt tgcatgaata atgttttgcg 2820 accgttaata aataaacact gtttagtata tcttgacgat ataattgtat attcaacatc 2880 tttaaaagaa cacctacatt cactgcaatt agttttcgag aagctttctg aagcaaatct 2940 caaattacaa cttgacaaat gtgagtttct aaaacaagaa acttccttcc taggacacgt 3000 cataactaag gatggaataa aacccaatcc tgataaaatc aaggcaattc taaaatatcc 3060 tattcccaca aaaactaaag aaataaaagg ttttctcgga ctaaccggtt attatagaaa 3120 gtttattcca aattttgcag acatagccaa accaatgaca acagtacttc gaaaagacgc 3180 aaaaatagac actaaaaatc cagaatacat tagagctttt gaaaagctaa aacaacttat 3240 ttccaatgac ccaattttaa aaattccaga tttcaaaaat aaattcattt taacaactga 3300 cgcaagtaac gtagcccttg gagctgtact ttctcaagac ggacatccta tcagttttat 3360 tagcagaacc ctcaatgatc acgaaaccaa ttatagcgct atagaaaaag aacttttagc 3420 tattgtatgg gcaacaaaaa cctttcgaca ctaccttcta ggtcgacatt ttgaaatcaa 3480 tagtgaccat caaccattat gttggttgta taaaatgaaa gagccaaatt caaaattaac 3540 ccgatggaga gtaaaattat cagaatacga ttttgacata aaatacataa aaggaaaaga 3600 aaaccacata gctgacgctc tatcgcgagt aaaaatagaa cagacttatt tcggagaaga 3660 aacacaacat agcgctgacg aagataacag taatttaatt tgcattacag agaaaccaat 3720 aaacacattt aaaagacaaa ttatattttc caaaggcatt ccaaatatta caaaacagaa 3780 ttacttcaaa aaacttataa ttcacatcac atacgacaac atgactaaag acaaagccga 3840 aaaatattta attgactatt tttgtacaaa aactagtgca atgtttatag acagtgatgc 3900 agatttcgaa atcattcaaa acgcacataa agaaattatc aacactagcc acactaaaat 3960 tctacgaagc ctcattcaat taaaaaatat aaactcatat gccgaattta aagaacttat 4020 tttgaaaacc catgaaaaac tcttacaccc cggaattcaa aaaacaaata aactcttcag 4080 tgaaacttac tatttcccca acagccaatt actgacacaa aacatagtaa acgaatgtaa 4140 catttgcaat ttagcaaaaa ccgaacacag aaacacaaac atgccaatga aaattactcc 4200 cagacccgaa cattgtcgcg ataaaatttg taatagacat ttattcctct gaaggcaaac 4260 attatcttag ttgcatagac atttactcaa aattcgctac tcttgaaaat attaaagcaa 4320 aagactggat agaatgcaag aatgcattaa tgcgtatatt caatcagtta ggaaagccaa 4380 aattattaaa agctgaccga gacggcgcat ttaccagctt agcactcaaa aaatggcttg 4440 aaaatgaagg agtagaatta caattaaata ctacaaaaac aggaatagca gacatagaac 4500 gactacacaa aacaataaac gaaaaaatta gaataatcaa cactttaaat gacgaagaaa 4560 ataaactcag caaaatagaa accatacttt acatttacaa tcacaagact aagcatgaca 4620 caactggaca aactcctgca cacatatttt tatatgcggg acaaccaaag ttaaacactc 4680 aagaaaataa agaaaaagta ataaataaaa tcaacagtga ccgacaagaa tacgaagtcg 4740 ataccagata tcgaaaagga ccattacaaa aaggaaaact agaaaacccc tttaaagcca 4800 ataaaaatgt tgaaaaaata gatccagatc attaccaaat tactaacaga aatagaatta 4860 cacgttacta taaaactcag tttaaaaaat taaagaaaat taaccaaatt caaattccac 4920 aggtttctac ttcataatca tactatcatt tcccatatcc caattgactt tcaatccatt 4980 aatacaatca aaattatccc atatcctgat cataatggct atcaactcga ttacattagc 5040 acaacttctt attatgagaa agataataaa atctttaatt aagccaaata ccattgtaac 5100 ttggaacata acttttacaa ttgtaaaaca aaattgccaa aaaaaaaaaa aaaaaaaaaa 5160 aaaaaaaaaa aaaataatgt gagtcctaat attcagggta acaaaattta tagatatcaa 5220 tcaatgtaat ctgcaaataa acgacataat tgtaagctgc ataactattt tcaactagaa 5280 gttgacctta caccattgta tcccccaata gaaattatta aagtaaaacc tataaatcat 5340 gatgacattg taaaaattgt atcacaaaat aatactttaa catatacaat cattgtaatc 5400 acaataacca tattcataat tgtaatccta ttttttaaat atgtatcttg taaccccata 5460 aaattatttc gaacaaaatt taaaaaaaca aattcacgag cccaacaaac aaacaacgcc 5520 atcagtcgaa acagaagagc aaaaacagga aaacacgatc atcgagatga cacaatcaac 5580 tttaccaaaa ctttacccgt ctgtgtcacc ttgactcgca ggctcgcctc ttaaggagag 5640 ggaagtgaca taatcatact gattacgtct acaatattca gaagctgtat ggtcgcgtca 5700 ttaagctgat tcgtctttgc aaggcgcgca gcttcgttga cctaggtcta gtattcggat 5760 ga 5762 // ID R1-2_DGr repbase; DNA; INV; 5678 BP. XX AC . XX DT 08-NOV-2010 (Rel. 15.11, Created) DT 08-NOV-2010 (Rel. 15.11, Last updated, Version 2) XX DE 28S rDNA-specific non-LTR retrotransposon R1 in Drosophila DE grimshawi. XX KW R1; Non-LTR Retrotransposon; Transposable Element; R1-2_DGr. XX OS Drosophila grimshawi OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Hawaiian Drosophila; OC grimshawi group; grimshawi subgroup. XX RN [1] RP 1-5678 RA Stage D.E. and Eickbush T.H.; RT "Origin of nascent lineages and the mechanisms used to prime RT second-strand DNA synthesis in the R1 and R2 retrotransposons of RT Drosophila."; RL Genome Biology 10(5), R49-R49 (2009). XX DR [1] (Consensus) XX CC This family is specifically inserted into 28S ribosomal RNA CC genes. D. grimshawi contains two subfamilies of R1. XX FH Key Location/Qualifiers FT CDS 217..1761 FT /product="R1-2_DGr_1p" FT /translation="RCFTVALIFAFALHLPSLCEKGTFFRVRVNLLSVLTS FT VCVNKSIYRLSRVVMPPRKKRPAELVAEGALSVSSDDDTSLSSDASSDVVI FT RRRGVLTKQRLRKATSETAVNSAGKNVDGRRSESNLMVTLKVPREKGEACN FT SGTPGVSAGVPAVEEPIAARSASNALSLKRKGTPIEKMKAISKELVECALA FT ECKTGTLVGSVLGYATQYEELLFTLIAENERLKGRLEAVRFNATNEHAGHV FT TAGQTRVSPAAVSPRGPMSTMLSAPEMPRPVETWSLVVRSKTAGKTAKDVV FT EKVVKEVGPTLGVRVHEVKPLRDGGAIIRTPSAAERKKIAGNAKFSEVGLE FT VSVKERLGPKVVVQGVHAVISPDEFMAELYELNLKDKMSKEAFTKGVRIAS FT KPWSQEGSAAVNVVLEGAGLAMQTLLDAGRCYVKWFSFRVRNFDPVVGCYR FT CLGFDHKVAECRLKQDVCRRCGQMGHRVAQCSNALNCRNCAFKGRPSEHLM FT MSPACPVYSAIVARASARH" FT CDS 1706..4831 FT /product="R1-2_DGr_2p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="CLRLALCTVQLWRGRVLDINMSSRLLQLNCQKSYAVM FT CDLGDMIVRRGSVVALLQEPYVANGCVRGLPAGMRVFPDSRANSAVVVNDV FT SIECTVVNSTDWGVCVSLSGNFGRLYVASVYCKFGDPLEPYIAYMDEVLLL FT AGSVPFILGIDANAPSPLWFSKISRSARYLNRSRGEVLAEWAVSQDVRVVN FT EPSEWYTFAGPMGQSDIDVTLANVAATSVFGFQWSVLGGHGVSDHNPIEIV FT ITHTSTTRESDGGNRWRTCGANWPLHGIFVSEAATQVPLSTFSAMNVDEQV FT VCVNRWVTCANDRLFERHRKVNLKRVKWWSHELSVKRRSVRSLRKRFQRAR FT SANAENAGQLRLAYSQCMNEYKQMLVRVKEDEWRSFLERNKDDPWGRAYKV FT VRGRRREADVSGLRVGDVQLTAWSDCMNVLLNEFFPRADHQNLPPSVVGDV FT DPLLDSELEVAFSMLKSRKSPGMDGFTGEMCKSVWKSIPDYMNVMYGKCMN FT EGYFPNEWKCARVIVLLKSPDRVRSNPRSFRGISLLPVLGKVLERVMVERL FT QERVSSQMSDRQFGFRKGRCVEDAWRFVSDSVESSNSRHILGIFVDFKGAF FT DHLSWPSVLERLSECGCRELAIWESYFSGRRACAVGRHESVSLNVVRGCPQ FT GSICGPFIWNLMMDTLLWQLERVCKCCAYADDLLILVEGQSRADVEASAAT FT YLRVVYEWGLRVGVSLAMDKTVTMLLKGRLSRSRPPLVRLNGVSLRHVSEV FT KYLGIVFGERMCFTPHIAYVKGRLLSLVGQVRRILRSDWGLSRSAARTVYD FT GLFVACATYGSSVWCKAVLTVVGRKNVLACQRVMLLGCLPVCRTVSTEAMQ FT VLLGVAPLDLEIRRRSLSYRIKRRLPLLQNEWLADRDVESLGLSECKKLLN FT ECVLSDWQVRWDTSENGRVTHRFIREVTFAVDRPDFRLHLSFGFLLTGHGS FT LNAFLHSRRLCDSPECPCGWVGETWEHVLCECPLYADLRDLSVLGITRGIS FT GYDVSQVLSTCEGVRRMSEFARAAFARRRLIRGEVG" XX SQ Sequence 5678 BP; 1246 A; 1129 C; 1803 G; 1500 T; 0 other; cagtacgagt tcagacttgg gaacgaacgg acgtgtcttt gctgtcgcgg ttaaacagag 60 ataacggttc ttaaataacg ggatcttgtt tacatttcgc caatttactg ttgcgcttgt 120 tttttgcatt tgctttcgca tttgccttcg ctgtgcggaa agtgtgaata acggtgcttt 180 gagtagcggt actttgtgta gcggtgctct gagtaacggt gctttacagt tgcgcttatt 240 tttgcatttg ctttgcatct gccttcgctg tgcgaaaagg gtacattttt tcgtgtgcgt 300 gtgaatcttc tgtcagtgct tacgagtgtg tgtgtgaaca aatctattta tagattgtca 360 cgtgttgtga tgccgccacg caagaagagg ccggcggagc ttgtagcgga gggggcgctg 420 agtgtgtcgt cagatgacga cacatcgctg tcctctgatg catcaagcga tgtcgtgata 480 agaaggaggg gggtgctgac taaacagcgg ttgaggaagg caacttccga gacggctgtg 540 aattcagcag gtaagaatgt ggacgggcgg cgctcggaga gtaaccttat ggttaccctt 600 aaagtgccta gagagaaagg tgaggcgtgc aattcgggca cgcctggtgt gagtgccggt 660 gttcccgctg tggaggagcc cattgctgcg cgttcggcga gcaatgcgct atcattgaag 720 cgaaagggaa cgccgataga aaaaatgaag gcgataagca aggagctggt agagtgtgca 780 ctcgctgagt gtaagacggg gacactggtc ggtagcgtgt taggctacgc gacgcagtat 840 gaggagctgc tgtttacgct gatagcggaa aatgagcgcc tcaaggggcg tttggaggct 900 gttcgcttta atgcgaccaa tgaacatgct gggcatgtga cagctggcca aacgcgagtt 960 tcgccggcag ccgtgagccc aaggggtccc atgtcgacaa tgttgtctgc cccggagatg 1020 cctcggccag tcgagacctg gtccttggtc gtgcgcagca aaactgctgg taagaccgcc 1080 aaggacgtgg tcgaaaaggt agtgaaggag gtagggccta ccttgggtgt gcgcgtgcat 1140 gaggtgaaac ctctgcggga tggaggagcg attattcgca caccatctgc tgccgagcgg 1200 aagaagattg cgggtaacgc aaaattcagc gaagttggtt tggaagtgag tgtgaaagag 1260 agactggggc cgaaagtggt cgtgcagggc gttcacgctg taatctcgcc tgacgagttc 1320 atggctgagc tatacgagct gaacctcaag gacaagatgt ccaaagaggc ctttactaaa 1380 ggtgtccgca ttgcgagtaa gccttggtcg caagagggta gcgcagcagt gaatgttgtt 1440 ctcgagggtg ctggcctggc catgcaaacc ctcttggatg ccggacgctg ctacgtgaaa 1500 tggttctcat ttcgtgtgag gaactttgac ccggtagtgg gatgctatcg ttgccttggc 1560 ttcgaccata aggtagcgga atgccggctt aagcaggacg tttgtcgtcg ctgcggccaa 1620 atgggtcacc gcgtagctca gtgcagcaat gcattgaact gtcgcaattg cgcattcaag 1680 ggtaggccat ccgagcatct tatgatgtct ccggcttgcc ctgtgtacag tgcaattgtg 1740 gcgcgggcga gtgctagaca ttaatatgtc tagcagactt cttcagttga actgtcagaa 1800 gtcatatgca gtcatgtgtg atttgggtga tatgattgtg agaagaggca gcgtagttgc 1860 cctgttgcag gaaccctatg tggcaaatgg ttgcgtacgg ggactgcccg cgggtatgcg 1920 agtatttcct gacagcaggg ccaactctgc tgtcgttgtg aatgatgtca gtatcgaatg 1980 cactgttgtg aattcgactg actggggtgt gtgtgtgagc cttagtggca attttggtag 2040 attgtacgta gcaagcgtat actgcaagtt cggggatccc ctcgaaccgt atatcgcgta 2100 tatggatgag gtgctactac tggctggtag cgttccgttc atccttggta ttgacgcgaa 2160 tgcaccgtcc cccttgtggt tcagtaagat atctagatct gctaggtatc tgaaccgctc 2220 taggggtgag gtgctggccg agtgggctgt gtcccaggat gtccgggtcg ttaacgaacc 2280 cagcgagtgg tacacgtttg cgggcccgat gggccagagt gacattgatg tcactcttgc 2340 gaatgtggca gcaacgagtg tgtttggttt tcaatggagt gtactgggtg gacacggtgt 2400 gagtgaccac aatccgattg agattgtcat cactcacact tccaccacgc gtgaaagtga 2460 tgggggtaac cgctggcgca cttgtggtgc gaattggccc cttcatggga tttttgtgag 2520 tgaagcggca acgcaagttc cgcttagcac ttttagtgca atgaatgttg acgagcaggt 2580 cgtatgtgtg aataggtggg tgacttgtgc gaatgatcgc ttgtttgaga ggcaccgaaa 2640 ggtcaacctc aaacgagtga agtggtggtc gcatgagttg agcgttaagc gtcggtcagt 2700 gcggtcccta aggaagcgat tccaaagggc cagatctgcc aatgccgaga acgctggcca 2760 acttaggtta gcttacagcc agtgtatgaa tgagtacaag caaatgcttg taagagtgaa 2820 agaggatgaa tggcgttcct ttttggagcg caataaggac gacccctggg gtcgtgctta 2880 taaagttgtg agaggtaggc gtagggaagc ggatgtaagt ggcctccgtg tcggtgacgt 2940 tcagttaacg gcatggagtg actgcatgaa tgtcttattg aatgagttct tccctagagc 3000 ggatcatcag aacttgccac cgagtgtggt tggtgatgtt gacccgctcc tggatagtga 3060 attggaggtg gctttctcga tgttaaagtc gaggaaatca cctggtatgg atggttttac 3120 tggggaaatg tgtaagagtg tctggaagtc aattccagac tatatgaatg taatgtatgg 3180 aaagtgtatg aatgagggat atttccccaa tgagtggaag tgtgcaaggg tgattgtgct 3240 tttgaagtcg cccgataggg tcaggagcaa tcctcgttcc tttcggggca tcagtctcct 3300 accagtgctg ggtaaagtgc tggaaagagt catggtagaa aggctccaag agagagtgag 3360 tagccaaatg tcagatcggc aatttggttt taggaagggc agatgtgtgg aagatgcgtg 3420 gagatttgta agtgactccg ttgagtccag caactccagg catattctag gcatctttgt 3480 tgattttaaa ggggcgtttg accacctgag ttggccgagt gtgttggaga ggttgagcga 3540 atgtggctgc cgggaattgg ctatttggga gagctatttc tctggcagac gtgcgtgtgc 3600 tgtaggtcgg catgaaagtg ttagcctgaa tgtggttcgt ggctgcccac agggatccat 3660 ctgtggtcca tttatatgga acctcatgat ggataccttg ctatggcagc tcgagcgtgt 3720 atgcaagtgc tgtgcgtatg cggacgacct gctcattctg gttgaggggc aatcgcgagc 3780 ggatgtagag gcaagtgcgg caacgtactt gcgagttgtg tatgagtggg gtctcagagt 3840 tggcgtcagt ctggcaatgg acaagaccgt gacaatgctg cttaagggca gattgtcgcg 3900 tagtcgacct ccattggtta ggctaaatgg cgtcagcctg aggcatgtgt cggaggtgaa 3960 atacctcggc attgtgtttg gcgagaggat gtgcttcact cctcatatcg catatgtgaa 4020 agggcgattg cttagtttgg ttggacaagt gcgtcggatt ttgagaagtg actggggcct 4080 cagcagatct gctgctcgca ccgtatatga tggtctattt gttgcctgtg caacttatgg 4140 atcgtcggta tggtgtaagg cagtcttgac tgttgtaggc agaaagaatg tgctggcttg 4200 ccagcgtgtg atgttgttag ggtgtctgcc tgtatgccgc actgtctcca cggaggcaat 4260 gcaggtattg ttaggagtag cccctctgga cttggagatc aggcgtcgaa gcctgagcta 4320 taggatcaag aggcggttgc cgttgctgca gaatgaatgg ttagcggata gggatgtgga 4380 gagtttaggg cttagtgagt gcaagaaatt gctaaatgag tgtgttctgt ctgactggca 4440 ggtcagatgg gatactagcg agaatgggag ggtcactcat aggtttattc gggaggtcac 4500 atttgctgtt gaccgtccag acttcaggct tcacctgagt tttggatttt tgttgacggg 4560 ccacgggtcg ctgaatgcat tcttgcattc aagacgactc tgtgatagcc cggaatgccc 4620 ttgtggctgg gtgggagaga catgggaaca tgtcctctgt gagtgccctt tgtatgcaga 4680 tctgcgagat ctaagtgtgc ttgggataac gcggggtatt agtgggtatg acgtaagtca 4740 agtgctctcc acttgtgaag gggtaaggag aatgagtgag tttgcacggg ctgcatttgc 4800 cagacgacgt ctcatacgtg gagaagttgg gtgaatgttg aatgtctgtt gggggtatga 4860 atgtgggggt gtgtgaatgg attgctgagt tgcgttcggg ggtcaccagt cccggcttta 4920 tggagcaaaa ctggaagtat ccttgtggta cgagttctga ccggaggact gatccggtac 4980 cacgggcgtt ggggtgttca ggggcggtct cgaccctcgg ctctcagcgc ttttgttggg 5040 gatatcagct ggccgcctag cggtctgaat tcgttcagtt atgaatgccg ctggttgcgg 5100 aagaggaagg aataggcctc gctccccaca gtgagaaaac catgtccata aagcatggcg 5160 tatactgttc catctgagat ttggtgttta catcaattcg cattctattc cgtacagtga 5220 tatctttcct tctttttggg tcaccaaccc gtaactgttt tggagttaaa attgggagta 5280 ccctacgggt acgaggcctg accggaggct tgctttccgg taccacgggt aatcaggagc 5340 ccgcggaaca tagtttccgt cctggttttt ggttgcggcc cttcggggag tttcgtggtg 5400 gctgtggttt gacacccaaa tgcgggtaga gctattgact cggcgtgttg ttgcgctata 5460 caacagggtg ccgtgaccca tagagcggaa gtcgttttag ataggcggcc ctccaaacca 5520 aggtggaagt tcacgaccaa acagtagtga cttcaaattg gtacctgcgg aatattaatt 5580 ccaatggggc ggtaattgac gcttgaatta attccgtgct tggcaccgtg agattaagcc 5640 atcccggcag gtgctcacgt taaaccaatt gactttaa 5678 // ID hAT-21_HM repbase; DNA; INV; 3338 BP. XX AC . XX DT 07-DEC-2008 (Rel. 13.12, Created) DT 12-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-21_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3338 RA Jurka J.; RT "hAT-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 2010-2010 (2008). XX DR [1] (Consensus) XX CC Average identity to consensus >98%. CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 498..3152 FT /product="hAT-21_HM_1p" FT /translation="MEQKRKKLSGSQYKKKRLKNEEQERKLSNVMKKFLVQ FT KCGLDEPSTSAATDLANNNTVPPQSLPEEVVEESQILTPSDLHEQPGSTST FT STATALDLANKNTVPPQSIPEKIVEEIQKITSSEVHQEPGRISSGSQQVLK FT EIPSDPALWLTFCMSSQTRQLLVERGPHQIKEFEFPINKGKRRFLPSYYSK FT VLSNGEVVERSWLIYSIASDAVFCFCCILFDNSSDISDWPKKGYSDWKNLI FT RALTMHEKSVNHRNAFRAWKELDIRLKQKKTIDAEYQRIMDMELQHWRGVI FT KRIMSIIKLLASQCLAFRGSTEHLFQPNNGNFLKLVELLSEFDPVMEEHIR FT RVQRESDKWSVTYLSNNIQDELINLMGNSILRKIVEITKIAKYFSIIADCT FT PDVSHYEQLSLTIRVVTFNSIQNKYEIAEFFIGFFEASDSTGEGLAKLILT FT HLEKLGFELKWLRGQGYDNGANMKGVRKGVQNRILEKYPRAFYVPCACHSL FT NLVVNDAASSTTETTFFFSIVQELYTFFSGSTKRWEVLKKYVSQLTLKPLS FT ATRWASRIDALKPLRFQLCEIYDALILIIEDVNRDAETKVKARGLAKNIKN FT YKFICGVILWHDILFEINSVSKLLQSVTINISDCVRMLSETIKKVKSYRQS FT GYIQMKIAAKEIAENLECSTEFPDDTEVRPRRKKRQFDYEKAVDEPLTEEK FT KFKINFFNYILDITLNSLNERFTLLETHSKKFQFLYDILKLKDIDDKTLEN FT YCSSLEFILSVENETDINANDLREELRDVSRMLPYSTKPLDVLNYLCQNSL FT ISLYPNTVVALRILLTLPVSVASGERSFSKLKLIKNYLRSSIGQTKLKNLA FT LISIESAMASTLDYTSVINEFAKVKVRRVKL*" XX SQ Sequence 3338 BP; 1187 A; 531 C; 597 G; 1023 T; 0 other; cagggccgtg cggtgctatg atacaaagag gcggccgcct caggcggcac cattaagggg 60 cggcaaagta gcgaaatcaa aatgcaattt cagggttgca ataataatta ggtactaaat 120 cataaactgg actcatttaa attttaatta ctaccaaact cgtgaatgtt tttatcactt 180 taaaagtttt ttcattgtcc acatttttct ataaaaataa aaaaaaaatt tctccctcat 240 ttagcattcg cttttgctat tcctttcaat atgaataaaa acgttttgaa acgcatatta 300 gcgtggacta ttttcttatt atgtttcaaa atactgtttt caaacgtttt taaacgtttg 360 tgtttgcctg tctttgcaag ttaaatttta actttgtaat gtcatcatgt ttttaactgt 420 taaaaacatg atgacattac aaagatttta caaaattttt acaaaaattt attacagcag 480 tgacagcaat cgtcagcatg gaacaaaaaa gaaaaaaatt atctggaagt caatacaaaa 540 agaaacgctt aaagaatgag gaacaagaaa gaaagctcag caatgttatg aagaaattct 600 tagtacaaaa gtgcggttta gatgagccat ccacatcagc agcaactgat ttggcaaata 660 acaacacagt gccgccccaa tcacttccag aagaagttgt tgaagagagc caaatactaa 720 caccatcaga tcttcatgaa cagccgggca gcacatccac gtcaactgcg actgccttgg 780 atttggcaaa taaaaacaca gtgccgcccc aatcaattcc agaaaaaatt gttgaagaga 840 tccaaaaaat aacatcatca gaagttcatc aagaaccggg ccgcattagc agtggcagtc 900 agcaggtatt aaaagaaatt ccttcagatc ctgcattatg gcttactttc tgcatgtcat 960 ctcaaactcg tcaactactg gttgagaggg gtccccatca aattaaagaa ttcgagtttc 1020 ctattaataa aggaaagcgt agatttttgc cctcgtatta ttcgaaagtg ttgagcaatg 1080 gtgaagtcgt tgagagaagt tggcttattt actctattgc aagtgatgcc gttttttgct 1140 tttgttgtat attatttgat aattcgtctg atattagtga ctggcccaaa aaaggctatt 1200 ctgactggaa aaatcttata agagccctca caatgcacga aaagtctgta aatcacagaa 1260 atgcttttag agcatggaaa gaattggaca ttcgattgaa gcagaaaaaa actattgacg 1320 ctgaatatca acgaattatg gatatggaac ttcagcattg gagaggagtt ataaaaagaa 1380 ttatgtcaat aattaaacta ttggcctcac aatgtttagc ttttcgtgga tcaactgaac 1440 atttgtttca acctaataac gggaacttcc taaagttggt agaattactt tctgagtttg 1500 atccagttat ggaagaacat atcaggagag ttcaacgaga gtctgataaa tggtcagtga 1560 cttatctaag caataatata caagatgaat tgataaattt aatgggaaat tcaattttaa 1620 gaaaaattgt cgaaataaca aaaatcgcta aatatttttc tataatcgct gattgtaccc 1680 ccgacgtcag tcattatgaa cagctctcct taacaataag agttgttact tttaactcca 1740 ttcaaaataa atatgaaatt gcagaattct ttattggttt ttttgaagct agtgattcaa 1800 ccggagaagg tttggcaaaa cttattctga cacatttgga aaaattaggt tttgagttaa 1860 agtggcttag gggacaaggc tatgacaacg gagccaatat gaaaggagta agaaagggtg 1920 ttcaaaatag aatacttgaa aaatatccaa gggcttttta tgtgccctgc gcatgccatt 1980 ctttaaacct tgttgtgaat gatgcagcgt cttctactac agaaacaaca tttttttttt 2040 ctattgtaca agaactttac actttttttt ctggttctac aaagcgttgg gaagtgctaa 2100 aaaaatatgt ttctcaatta actctgaaac cattaagtgc tactaggtgg gcaagcagaa 2160 tagatgcttt aaaaccttta cgatttcaac tttgcgaaat ctatgatgca ttgattctga 2220 taatagaaga cgtcaataga gatgcagaaa ctaaagttaa agcgagaggg ttagcaaaaa 2280 atattaaaaa ttacaagttt atatgtggag taattctttg gcatgatatt ttgtttgaaa 2340 ttaattcagt ttcgaagctt ttgcaatctg ttactataaa tatatcagat tgtgtaagga 2400 tgctatcaga aaccataaaa aaagtgaaaa gttatagaca atctggatat attcagatga 2460 aaattgctgc aaaggagata gcagagaatc tcgaatgcag tacagaattt cctgatgaca 2520 ctgaagttag acccagaagg aaaaagcgtc aatttgacta tgaaaaagct gtcgatgaac 2580 ccttaacaga ggagaaaaaa tttaaaataa actttttcaa ctatattctg gacattaccc 2640 ttaattctct gaatgaacga ttcacgcttc tggaaaccca tagcaaaaaa tttcagttct 2700 tatatgacat tttaaaactc aaggatatag atgacaaaac gctagaaaat tattgttcca 2760 gtcttgagtt tatactttcg gttgaaaatg aaacagatat aaatgcaaat gaccttagag 2820 aagaattgcg tgatgtatcc agaatgctac catattctac gaaacctttg gatgttttaa 2880 attatttatg ccaaaatagc ttaattagtt tgtatccaaa tactgttgta gctctgagaa 2940 ttttattaac tctccctgta tctgtagcta gtggggaaag aagtttctcc aaattaaaat 3000 taataaaaaa ctacttaaga agttcaatag gacaaacaaa acttaaaaat ctggcattaa 3060 tttctattga atctgcaatg gctagcactc tagactacac atcagtaatt aacgaatttg 3120 ctaaagttaa agttagaaga gtaaagctgt aatactgtac attgttgtag tttatagttg 3180 aaatcattat tttaaaattt caaaataaaa agtagaattc gtacattttg tgcagttttt 3240 ttttaaatca caatcgaagg ttattttaaa gaaaaacatt taaggggcgg catttcgaag 3300 atttgcctca taatgagaaa atctaccgca cgggcctg 3338 // ID BEL-104_AA-LTR repbase; DNA; INV; 478 BP. XX AC supercont1.298; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-104_AA_; KW BEL-104_AA-I; BEL-104_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-478 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.298; Positions 1129155 1128678. XX SQ Sequence 478 BP; 134 A; 97 C; 112 G; 135 T; 0 other; tgttggagat ggtcattaat tgtatatagt tttttatcaa tgaatttatt atttccaaat 60 gacaccaaca atttcctcat ctcaacttga ccacttagtg agagtgagcg gtgaccgaac 120 acactaggta gccgatggtc actcagcaac gctgtaggca ggagaatata cagcaaattg 180 taaatagaga tcagtagcaa aatacacgtt gaactgcaca gtacctacgg tcggtcgttc 240 ccaattggtc ttatcgatga caaaatcgac taagtgttga gctaaaatcc caactttaac 300 tttcggcgcg tgtgtgattg gttgcgttct atagagaacg gcaaagtccc cccgtaaaag 360 ggacttgtga aaattttgtg gttggtggca cttgtactcc gtgttagtgt ttctcggtgc 420 tagtccagga ctagcgaagc aagtccttcg agactcggca aggcgagtat cgtcatca 478 // ID BEL-21_AA-I repbase; DNA; INV; 6304 BP. XX AC supercont1.128; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-21_AA_; KW BEL-21_AA-LTR; BEL-21_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6304 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.128; Positions 465268 471571. XX CC Positions [5330-5893] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 896..6304 FT /product="BEL-21_AA-I_1p" FT /translation="MVGKHTPSRTVIPREVHSLTIREPFQQYVTSSRPLYH FT PQSFSGIRPPYSCPPMFQRIALPSTERGNPADQTYQGQPPSWSTPVQYGSM FT PNPGLNSHPGISPAAEPALRYRESRPEHVASVNASNHSSLFNPSEVRIVPD FT AIPEIERPTTLSSQQIAARQVVGRDLPSFNGHPADWPMFVSSFEQSTASCG FT YSNAENLVRLQRSLTGHALEAVRSRLLLPANVPHVIETLRTLYGRPELLIR FT SLHEKIRRTPGPKHDRPETILEFGLAVQNFVDHLLAAQQEEHLSNPMLLQE FT LVEKLPGSMRMDWATFKCQRPRATVATFGEFMSNLVKAASEVSFQLPELFQ FT GLNDGKQQRVKERARIQTHLALVPPSIKPTNISARKISKPCPICDHEGHRV FT AECSTFKQFNVDERWKAVHDKGLCRTCLNNHGKWPCKSWQGCRNEGCQLKH FT HSLLHTPSVPPTTSHSVNVSMSQLSSDDHQTLFRILPVVLYGKDKCVTVFA FT FIDEGSQITMLEDKVAKELGVAGPRRPLTLQWTGNVKRSELGSQEVDMQIA FT GKNSDVRYDLLQARTVSCLLLPTQSMNYRALCARYPHLKGLPVEDQDRVQP FT KLLIGLDNLRLGIPLKLREGGIYDPIAAKCRLGWGIYGCTSKIPVTRVTVN FT FHTTAEPSPDDLLNVQLRDYFMLDNCGAVSHTKKLESEEDKRATRLLEETT FT RRTTVGFETGLLWKSDMREFPDTYPMALRRMKALENKLQSNPSMMQRVREQ FT ISEFERKGYIRKVSATEQSALDHRKTWFLPLGVVVNPKKPEKIRLIWDAAA FT KVGGVSFNSYLLKGPDLLTSLPRVLSGFRLFPVAISGDIREMFLQIRLQAS FT DRNAQMFMFRNSSRDPVQIYAVDVTMFGSTCSPSSAQFVKNLNAEQYSSEY FT PRAATAIKEHHYVDDYLDSFRTVEEAVQTLNDVKLVHARAGFEIRNFLSNK FT SEVLKRTGEIEPDSSKEFALVRAETTESVLGMKWIPAEDVFTYTFAMRSDL FT RPVLDDKHVPTKREVLKVVMSLFDPLGFVAFFLVHGKVLMQDVWAAGIDWD FT EKISEELYFRWRQWTYYFPQLDNLRIPRCYFQSPFPANLDRLELHVFVDAS FT DSAYACVAYYRLETANGVQVALISAKTKVAPLKVLSIPRLELKAAILGVRL FT LENIHGYHTYPISRRILWSDSSTVLAWIRAEHRKYNKFVAVRIGEILSATE FT IQEWRWVPSGMNTADSATKWKNIPDLSFDSPWFRGPVFLRQSEEHWPQQKN FT ILTTTEELRSTNVHFSTRPLIDASRFSKWAKLLHTMAFVQRFVQNLRRKRK FT GLTLQLGAFQQDDLKNAEEMLWKVAQAEAFSEEIAALVDSTGYPDARHRRV FT PKSSPIYKNNPFLDERGVLRMRGRIGAATFVPDEAKYPTILPRQHSITFLL FT TDWYHRRFRHANRETVTNEMRQRFEIAKLRALIQKVMKNCMMCRVMKAAPN FT PPVMAPLPAIRLQPLVRPFTYVGLDYFGPILVKVGRRQAKRWVALFTCLTI FT RAVHLEVVHSLSTESCIMAVRRFIARRGAPAEFYTDNGTCFQGANKQMERE FT IMEARSNALAATFTSTGTQWRFIPPAAPHMGGAWERLVRSVKAALASVAAA FT SRTPDDEVLETVLLEAEMLINTRPLTYIPLESADQEALTPNHFILGSSNGD FT KIAPFAPVDNPSVLRSSWRLAQSISQDFWARWLKEYLPVITRQAKWFEEKP FT DIAIGDLVMVVNGSARNQWVRGLVEAVIPGRDGRVRQAFIRTASGVLRRPA FT VKLAVLDVRGNGEPQPGVLKNRKLNQVSREGG" XX SQ Sequence 6304 BP; 1748 A; 1470 C; 1591 G; 1495 T; 0 other; ttcttttaga attttgtacg tggataatcg aaatgatagg acatcacgat caatcgatgt 60 atcagtgcaa gacgtgccac ggggcggatt cggtggacgc tcacatgttc acctgtgacc 120 agtgtcgcca atgggagcac tttcagtgcg cagggctcaa cgaagcggtt caaaatcgcc 180 cattcatctg taagctgtgc agaatggcaa ccactggcac ccagcgagtg cttcggtcaa 240 gaactaaagg tattctcctg actaacccat ccgtggcaga cgccacttct aataagagcg 300 ggaaggtatc gtccgtacac agttactctt cgcggtcgtc agtcgtaaga gcgcggttgg 360 agctagccga ggaggaagct agaatgagac agaaggagct tgaggaggag gaagaactca 420 agaaacttga gcttgaagaa gagaagaagc agttggaaga gaggaagaga ttattagaag 480 aagaagccca tctgcggaaa cgagcattgg aagcaaatcg ggagcgtctg gccaagcagc 540 aatcgatccg ccgagaatca ttagagaaaa ggaacgaaat tctcttgcaa atatccgaac 600 gcggaagcgt gttggagtca acaactagtt ccgttgagaa ggtttccaag tggttaacag 660 ctcatcaacc ggaagggacg tccgaaggga atgatacggg ggtacaaaga ggccctaatc 720 ctttggccca tgtcggagta cagagagctc cgttgaatac cgctgggaaa cgttaccaac 780 agagatagtt tcaccaaaaa cgtacgatag caccccgcaa cccacgcgtc cagaaacaga 840 tgatgtgcgg ccttcagaag cgcttcgatc gggaaatcat gttccgacag cacccatggt 900 tggaaagcac accccttcga gaacagtcat tcctcgggaa gttcactcac tgactattag 960 ggagcctttc caacagtacg tcacatcatc acgccccttg tatcatccac aatcgttctc 1020 tggtattaga ccaccgtatt cctgtccacc gatgtttcaa cgcattgctt tgccatctac 1080 cgagcgaggt aatcctgccg accaaacgta tcaagggcaa ccaccatcgt ggagtacacc 1140 agtccaatat ggttcaatgc cgaatcctgg attaaactca catcctggca ttagtccggc 1200 agcagagcca gctctccgat accgagagtc tcgtccagaa catgtcgctt cagtgaatgc 1260 atccaaccat agttcactct tcaatccttc cgaagtgcgt atagttcctg atgcaatacc 1320 ggagattgag aggcctacca ctcttagctc acaacaaata gctgctcggc aagtcgtcgg 1380 cagagatctg ccttcgttca atggacaccc tgctgactgg ccaatgtttg tatccagctt 1440 tgaacaatca actgcttcct gtggatattc aaacgctgaa aatctggtgc gccttcaacg 1500 aagcctcaca ggtcatgctc ttgaagcagt gcgaagcagg cttttgctcc ctgccaatgt 1560 accacacgtc attgaaacgc tacgaaccct ctatggtcgc ccagagctat taattcgttc 1620 gcttcatgaa aagataagga gaacccccgg tcccaaacac gataggccgg aaacgattct 1680 cgaattcggc ttggcagttc aaaatttcgt tgatcacctg ctagcggcac aacaagagga 1740 acatctttca aacccgatgt tattgcaaga actggtagag aagttgccag ggtccatgag 1800 gatggactgg gctacgttta aatgccaacg accaagggcc accgtcgcaa catttggaga 1860 gttcatgtca aacttagtga aggcggccag tgaggtcagt ttccagctcc cagagttatt 1920 tcagggctta aacgacggaa agcaacagcg tgtaaaggaa agagccagaa tacagactca 1980 tttagccttg gtacctccat ctattaaacc aaccaacatt agtgcacgta aaatttcgaa 2040 gccgtgcccg atttgtgatc acgagggaca cagagttgca gaatgttcga cattcaagca 2100 gttcaatgtg gatgagcggt ggaaggcggt gcacgacaaa ggcttgtgcc gaacctgctt 2160 gaataatcac ggaaagtggc cttgcaagtc gtggcaagga tgtaggaacg aaggatgcca 2220 actgaaacat cattcacttc ttcatacacc ttcagttcca cccactacgt cgcattctgt 2280 gaacgtatcg atgagtcagc tatcttctga tgatcatcaa acattgtttc gaattttgcc 2340 ggtcgttctt tatggaaaag ataaatgtgt aactgtcttc gctttcatcg acgaaggatc 2400 gcaaatcacg atgttggaag acaaagtagc aaaggagctt ggcgttgctg gtccaagaag 2460 gccactcacc cttcaatgga cgggtaatgt gaagcgtagt gaattaggat ctcaggaggt 2520 ggacatgcaa atagcaggga agaatagtga cgtacgctat gatctacttc aggcgcgtac 2580 ggtaagctgc ttgctgttgc caacccagag catgaattat cgcgcactgt gtgcgcgtta 2640 cccacatctc aaaggcctgc cagtggaaga tcaggatcgt gttcaaccaa aactgttgat 2700 aggactagat aatctgcgcc taggaatacc actgaagttg cgtgaaggag gaatatacga 2760 tccaatcgct gcaaaatgtc gtttggggtg ggggatttac ggatgcactt ctaaaatccc 2820 tgtaacgaga gttacagtca atttccacac caccgcagag ccgagtcctg atgaccttct 2880 taatgtacag ttacgagatt atttcatgct ggataactgc ggagcagttt ctcataccaa 2940 gaagctagaa tcagaagaag ataagagggc aacacggttg cttgaggaga cgacacggcg 3000 aacaactgtt gggttcgaaa caggccttct atggaaatct gatatgcgag agtttccaga 3060 cacgtaccca atggctctgc gtcgaatgaa agcgttagag aacaagcttc aaagtaatcc 3120 atccatgatg cagcgagtac gggagcaaat cagtgagttc gagagaaagg gctacattcg 3180 caaagttagt gcaactgagc aatcggcatt agatcacagg aagacttggt ttcttccctt 3240 aggagtcgtc gtaaatccaa agaaacccga aaagataagg cttatttggg atgcagccgc 3300 gaaagttgga ggagtatctt tcaattcata tttactgaag ggcccggatt tgcttacatc 3360 tttgcctcgg gttttaagtg ggtttcgcct ttttccggtg gctatttccg gagatataag 3420 ggaaatgttt ttgcaaataa gactgcaagc gagtgatcgg aacgcgcaaa tgtttatgtt 3480 ccgcaacagt tctcgagatc cagtgcaaat ctacgccgtt gatgtaacaa tgttcggatc 3540 gacctgctct ccatcttcgg ctcagtttgt caaaaacttg aatgccgaac aatattcctc 3600 agaatacccg cgggcagcaa cagctattaa agaacaccat tacgtggatg attatttgga 3660 cagcttcagg acggtggaag aggcagtcca aaccttgaat gatgtcaagc ttgttcacgc 3720 cagggctgga ttcgaaatcc gaaactttct ttcgaacaag agcgaagttc ttaagcgaac 3780 tggggagatc gaaccagatt cgtccaaaga attcgccctc gtacgagcag agaccacgga 3840 gtcagtgttg ggaatgaaat ggatccccgc agaggacgtt ttcacttaca cgtttgccat 3900 gcgaagtgat ctgaggccag ttcttgacga taagcatgtc ccaacaaaac gagaagttct 3960 aaaggtagtc atgagtcttt tcgatccgct gggcttcgtg gcatttttct tggtgcatgg 4020 gaaggtgttg atgcaagatg tttgggctgc tggcatagac tgggacgaga aaatcagcga 4080 agaactctac ttccgttggc gtcaatggac atactatttt ccgcagttgg ataatctgcg 4140 tattccacgg tgctattttc agtctccatt tcccgcgaac ttagatcgac tcgagctcca 4200 tgtatttgtt gacgccagcg actctgcata cgcgtgcgtc gcttactatc gacttgaaac 4260 agcaaatggg gtgcaagtgg cgttgatcag cgctaaaacc aaggtggctc cgttgaaagt 4320 gttgtcaatc ccacgtcttg aactaaaagc tgctattcta ggagttcgtt tattggaaaa 4380 catccatggc tatcacacat atcctatcag tcgtaggatt ctctggagtg actccagcac 4440 tgtgctggct tggatacgag ccgagcaccg aaaatataat aaatttgtcg ccgtacgaat 4500 tggtgagatt ctttcggcta ctgaaattca agagtggaga tgggttccat ccggtatgaa 4560 tacagctgac tctgccacca agtggaaaaa cattccagat ctttcgtttg acagtccgtg 4620 gtttcgcgga cccgtttttc tacgtcagtc agaagaacat tggccgcaac aaaagaatat 4680 tcttactaca actgaagaac ttcgatccac caacgttcac ttttcgacgc gtccattgat 4740 cgatgcatcg cgtttcagta aatgggcaaa actattacat actatggcgt ttgtacaacg 4800 atttgtccaa aatcttcggc gtaagcgtaa aggcttgaca ttgcaattgg gtgctttcca 4860 gcaggacgat ctaaaaaatg ccgaagagat gctatggaaa gtggcacaag cggaagcgtt 4920 ttccgaagaa atcgccgctc tcgtagactc cacgggatat cccgatgcgc gtcacaggcg 4980 cgttccaaaa tccagtccaa tttataaaaa taatcccttt cttgatgaaa ggggagtact 5040 aaggatgcgt ggtaggatcg gagctgccac atttgtgccg gacgaagcta agtacccaac 5100 aattctacca cgacaacatt ccattacatt tcttcttaca gactggtacc atcgccgatt 5160 ccgacacgcc aaccgagaaa ccgtcacgaa tgaaatgcgc caacggtttg aaatcgccaa 5220 attacgagcc ttgatacaaa aggtaatgaa gaactgtatg atgtgtagag tgatgaaagc 5280 cgcccccaat cctcctgtga tggccccgct tcctgcaata cgactgcaac cattagttcg 5340 tccatttacg tacgtcgggc tggattattt cggacctatt ttggttaaag tcggccgaag 5400 acaggcaaaa agatgggtag ctctctttac ctgccttacg atccgtgctg tgcacttgga 5460 ggtcgtacac agcctgtcca ctgaatcatg tataatggca gtacgccgtt tcattgcgcg 5520 ccgtggtgcg cctgcggagt tctataccga taatgggacc tgcttccagg gagcaaataa 5580 gcaaatggag agagagatta tggaagctcg aagcaatgca ctagctgcta catttacgag 5640 cacaggaaca caatggcgtt tcatccctcc agccgctcct cacatggggg gtgcttggga 5700 gcgccttgtc cgatcagtca aggcagcatt ggcatcagtg gcggcagcat ctcgcacgcc 5760 agacgacgag gttctagaaa ccgtcctgct cgaagcggag atgctgatca acactcggcc 5820 ccttacgtac atccctctag aatcggctga tcaggaagcg ctgaccccaa atcattttat 5880 tttggggtca tctaatggtg ataaaatcgc gccgtttgca ccggtagata atccgtcggt 5940 gctaaggagc agttggaggt tagcacagtc aatatcgcag gatttctggg caagatggtt 6000 gaaggagtac ctaccagtaa taactcgtca ggcgaaatgg tttgaagaaa aaccggatat 6060 agcgataggc gatctggtga tggttgtgaa cggatcggcg aggaatcagt gggtgagagg 6120 acttgtggaa gctgtcatac ccggacggga tggaagagtt cgccaagcgt tcatccggac 6180 agcatcagga gttttgcggc gaccagcagt aaagctggca gtactcgacg tgcgagggaa 6240 tggtgaaccc caaccaggag ttctcaagaa ccggaagtta aaccaggttt cacgggaggg 6300 ggga 6304 // ID Mariner-2_PPc repbase; DNA; INV; 1260 BP. XX AC . XX DT 07-JUL-2010 (Rel. 15.07, Created) DT 07-JUL-2010 (Rel. 15.07, Last updated, Version 1) XX DE Mariner-type DNA transposon from the Pristionchus pacificus DE genome. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-2_PPc. XX OS Pristionchus pacificus OC Eukaryota; Metazoa; Nematoda; Chromadorea; Diplogasterida; OC Neodiplogasteridae; Pristionchus. XX RN [1] RP 1-1260 RA Jurka J.; RT "DNA transposons from the Pristionchus pacificus genome."; RL Repbase Reports 10(7), 959-959 (2010). XX DR [1] (Consensus) XX CC >99% identical to consensus. XX FH Key Location/Qualifiers FT CDS 166..1179 FT /product="Mariner-2_PPc_1p" FT /translation="MNTARDYRSVLLHYFLLGRTATESHRKLVQTHGENAP FT SLHTCFNWFTRFKRGDYNLEHQPHPGRPSSRVRGRVLRELKANPKSSVRDI FT EKTIHIPKTTVARILHDAGKTPKLPQVIPHDLTTAQLKKRVDVCQGLLHRR FT SNFNWVSHIVAMDEKWITYDNPERKLQWVDVDEKPQQAPKAELHGKKELLC FT FFFSVLGPIYWEILPPGITIKADLFTTQLEEVAVRVPPKLLSEGKILMLMD FT NARPHHAKITQKKMDELEMEWLPHPPYSPDLSPCDYHCFRSLSNFCRGKKF FT KNRDALVKEFEAWINSKPQAFWKRGIETLPDRWRQVVTTKGAYIDY" XX SQ Sequence 1260 BP; 332 A; 342 C; 279 G; 307 T; 0 other; tatcagggtg tcccacatat ttttggaatt tgcttgccaa taattactgc atgatcgaca 60 atcattctga tgcaatacaa tattacgtca tcatgttgtg caattgttta cacttagccg 120 cttctcgatt gcatcagtct ttgactagtt tcatcagtta tagtcatgaa cactgctcga 180 gactaccgct ccgttttgct tcattatttt ctcctgggtc gaacggcgac ggaatcccac 240 cgtaagctgg tccagactca tggcgaaaat gccccctccc ttcacacctg cttcaattgg 300 ttcacccgat tcaagcgagg ggattacaat ctggaacacc aaccccatcc tggacgccct 360 tcttctcgtg ttcggggtcg tgttctcagg gaactgaagg ccaatcccaa atctagtgtg 420 cgcgatattg agaagaccat ccatattcct aagaccacag ttgctcgaat cctgcatgat 480 gctgggaaga cacctaagct cccccaagtg attccgcatg acctcacgac agctcagctg 540 aaaaagcggg ttgacgtttg tcagggatta ctccaccgta gatccaattt caattgggtc 600 agccacatcg tcgccatgga tgagaagtgg atcacctacg ataatcctga gaggaaactt 660 cagtgggtgg acgtcgatga gaaaccccag caggctccaa aggcggaact tcacggaaag 720 aaggaactcc tctgtttctt cttctccgtc ctcggtccca tctactggga gatcctccct 780 cccggcatca ccatcaaggc tgacctcttc actacccaac tggaagaggt ggcggtcagg 840 gtccctccca aactcctgag cgagggcaaa atcctcatgc tcatggacaa cgcgcgccct 900 catcatgcga aaatcaccca gaagaaaatg gacgagctgg agatggaatg gctccctcac 960 ccaccttact cccctgatct gtccccttgc gactaccact gtttccgctc cctctctaac 1020 ttctgtcgag gaaagaagtt caagaaccga gacgcacttg tgaaggagtt cgaggcatgg 1080 atcaactcta agccccaggc cttttggaag agagggatcg agaccctgcc cgatcgatgg 1140 aggcaggtcg tgactactaa gggagcatac attgattact gatcatttgt tgttgttgaa 1200 ggaataaaca aaaaaataaa aataaataaa aatgtcacaa atatgtggga caccccaata 1260 // ID BEL-8_DPu-LTR repbase; DNA; INV; 421 BP. XX AC scaffold_140; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: long terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-8_DPu_; KW BEL-8_DPu-LTR; BEL-8_DPu-I. XX NM BEL-8_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-421 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 664-664 (2010). XX DR Genome; scaffold_140; Positions 162741 163161. XX CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX SQ Sequence 421 BP; 96 A; 120 C; 71 G; 134 T; 0 other; tgtgcaagac gtctaattcg tcccgtttgg caacgcatca ccgacaaaga agcttcccct 60 ccctatacgt ttcattgtct actttgcata accttttcta ttttttctat cacttcttct 120 cagtaaactg tgtcgctgag acgtgtgtct cttaaaatta aacttaaaac acttttccgt 180 ctagtcgccc ggcatccggc tattcacctt ctctcgactt gtccctttcg ttttgatacg 240 agccggctgg cacccgaccc tccatctgtc aagtcccttc ccagactaat atcgttaaat 300 cgtggcaaag tcaggaatac aaactcactt cgtgttccat tcaaacttcc gtgtttagtc 360 ccctctttgt cttagcttaa agtaaatatt cgatgtagcg acggcacgga ggcctgacac 420 a 421 // ID DNAX-4B_AP repbase; DNA; INV; 173 BP. XX AC . XX DT 28-AUG-2009 (Rel. 14.09, Created) DT 28-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNAX-4B_AP. XX NM DNAX-4B_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-173 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2056-2056 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC TSD duplication unclear (it could be TA or TATA) CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 173 BP; 37 A; 55 C; 40 G; 41 T; 0 other; ctctgttcaa gttaagaaac gtagtcggcg gcgaccagtt ttcgtcgggc cgcagtcagg 60 caacacccgt ttgtcacccc cacccagtca attatctgta tacctgccgt gtagtattgc 120 cacgcccctg cgcacaagtc ggccgccgac tacgtttctt aacttgaaca gag 173 // ID Mariner-36_SM repbase; DNA; INV; 1577 BP. XX AC . XX DT 14-AUG-2009 (Rel. 14.08, Created) DT 14-AUG-2009 (Rel. 14.08, Last updated, Version 2) XX DE Mariner DNA transposon from Schmidtea mediterranea: consensus. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-36_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-1577 RA Jurka J.; RT "Mariner DNA transposons from Schmidtea mediterranea."; RL Repbase Reports 9(8), 1885-1885 (2009). XX DR [1] (Consensus) XX CC It contains two significant ORFs. CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 201..659 FT /product="Mariner-36_SM_1p" FT /translation="MNMKNTNSKRKRKDLSIEEKQKLLKLFDEDKGCSQRF FT LSSKYNLSLGCINNLMKSRDSILNSNLNTKQKRKIHLRSGYQIDAILYEWF FT QIQRTRNISVSGDILQQKALELANHLGNHDFKASNGWLNSFLSRHMISSKI FT KAFSDRQVKETSLF" XX SQ Sequence 1577 BP; 618 A; 199 C; 246 G; 514 T; 0 other; cagtcaaccc tgttcaatat gaacccagtt taatatgaac atttaagtca aaaattatta 60 gaaatcatat ggtttgcatg gtaaattatg ttttataata cgaacactgt ttattttgaa 120 caatgaacaa aaaatagtgt aaaataaatc aataatttat acaatttgtt gtttaaatat 180 ttcattttta actgtctttt atgaacatga aaaacacaaa ttcaaaacga aagagaaagg 240 atttaagtat cgaagagaaa cagaaattgt taaaattatt cgatgaagat aaaggttgtt 300 ctcaacgctt tctttcatca aaatacaatc tttcattagg ttgtataaat aatcttatga 360 aatccagaga ctccattttg aattcaaact tgaacaccaa acaaaaacga aaaatacatt 420 tacgtagtgg atatcaaatt gatgccatct tgtatgaatg gttccagatc cagagaacaa 480 gaaatatcag tgtcagtggt gatatattac agcaaaaagc attagaatta gcaaatcact 540 taggaaatca tgattttaaa gcatcaaatg gttggcttaa ttcttttctt tccaggcata 600 tgatatcatc gaaaattaaa gcctttagtg ataggcaggt caaagaaacc tcgttgtttt 660 aaaaatttcg atatagaaaa attacctatt tattatcgtt ggaatacaaa atcttggatg 720 acgttgacaa tttttcaaga ttggttaaaa gatttaaaca atataatgaa agcagaggat 780 cggaaaattt tattattgtt agataacgca cctattcacc caaaagactt tgaattgagt 840 aatatcgaat tattcttttt cccaccaaat acgtcatctc ttattcaacc tcttgaccaa 900 ggtattataa aaagttttaa agactattat aagaaatttt tgagcctgtc aataaatttt 960 aatttagata ataacgttac acaagagata tggtgcaaaa agattgattt atatcaagca 1020 gtagcttgga tttcgaaagc atggaatagt gtaaaatctg cgactatttc taattgtttt 1080 accaagtctt ttgaaaatgc gtatgttaaa gaagtgaatg ctgaatgcga tgattttcca 1140 tgttttgaaa ttaatccaaa atttgacgaa catctcttgt ttgatctagt agaaaaatat 1200 gaaaatgaaa gttctgaaaa tgaagaaaca aatgataatt ctgttgaaga agtaaaagaa 1260 gatacagtaa taacaggatt cgaagcctat gagtatgcaa gaaaacttga aaaatacttc 1320 caagctacta ttcctgataa gatgaacaaa atatgggatt tgattgatga aattcaagga 1380 gaaaaagcat ctagacaatt gaaaatcaca gactatatta tgcgtcaaag aaataaaaat 1440 tgtgataaat aggtgtaata aatacatttt ttgtgaaatc ttttatattg ttatgttaat 1500 aactgttcat attatttttt aatatgaaca tttgaaaaaa tcggaacgct agtgttcata 1560 ttgaacaggg ttgactg 1577 // ID hAT-12_HM repbase; DNA; INV; 4159 BP. XX AC . XX DT 07-DEC-2008 (Rel. 13.12, Created) DT 12-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-12_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4159 RA Jurka J.; RT "hAT-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 2001-2001 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 891..3482 FT /product="hAT-12_HM_1p" FT /translation="MNYHHKSGSQKRKEKLSRLKNVNEKQTLLQSYGFSVS FT SKEKDLENEYHNVEANFNNNDNVVEQAADLINVTADFPDPDNIENKDDVEP FT GNLLVPINNISENYKSNITDNKITKTKLLDFDVGSLKNERPNASEIEEAVR FT RGPEKIPSQFPKDVNGHSFPVCILHTSMKNGEYVPRDWLVWSKVKQSIFCF FT PCRLFSKLPTASRSRLTTISGYCFQRKWKKLHDKIPEHQNSSNHKYCYIKW FT RLLEKSIDSNSTVDIMLLQTIKNQASQWKQVLRRILDVTLFLAERGLGFRG FT TSDLVGVAANGNFLGILELLSHYDSVLKDHLNKVMKSQKLKRRQQANYLSP FT EIQNEFIECCAKKVLDVILSEREAAKYYSILVDATPDSAHMEQTVFILRYV FT YLNEENSLYEVQERFLEFVDCNQKTGKAIAELICSVIKKHNIPMIDCRGQG FT YDNGSNMSGQYKGAQAEIIKENQLAIYSPCACHSLNLCGVHAAECCAEVIN FT FFGVVQKTYNLFSSSPQRWQILKENIGCSLHSMSDTRWSARIESVKPFAEH FT IPGLKSAIQDLKKLNLTAETRSDIKRIEKYLGSFECIILASTWFKVLTSIN FT YRNTVLQARDATLDVEVLNLKSLIDDLLLLRNNWDSILNECKLVAENLGMV FT SNDIFPEKKRSRKARFSDEEQNNKLGNTSAEFCFKRDVFYVLLDCVIGNMN FT RRFEAAKDLEETFGVVWKYMTLDKDLLREKASSICVKYSIDVSLDLIDELE FT HLKAIHASNLGIIQLSPFQLLNKLHELKLDSLFPNILVVLRIFCTLPVTVA FT QAERSFSTLARVKNVLRSTMCQDRLSNLGRLAIEAPLARKLDFDAVIELFA FT SKKSRKAYFG*" XX SQ Sequence 4159 BP; 1447 A; 643 C; 746 G; 1323 T; 0 other; catggccgta gagagggggg gtcagggggt atattgtcct ggggcccggc tctgttcaag 60 ggcccggact tgtgaattat aaagtattat atttgccttt ttaaatatac tgcatttgga 120 ttggttgttc taattacagt ttttaacatt agtatcaccg taaacgttaa atttaaatgc 180 aaaacataat aaatatttta taaattttat aagctaagtt attcggaaat aaatcaaaag 240 gaaaatatct caagaaaaaa attatgtagt aaaaaatata ctttgaacac attatttgga 300 agcatttata gtttgtataa ttcgagtact ttagatcatt tgcagaacat taaatcactt 360 gcggtatttt aaaataaaag acaaagaatg ccagaaaatt gtctatttga ttctacagtt 420 tagtttattt ttatgttaaa gtttaatcag ttgtatatgg agttttgtta taaggtataa 480 aacattcaat tataaaaatt gctttattta ataaattata tgacgcttaa gatcagtgac 540 gtagcgaata tttttgcgac cggggcaaaa atgaaataaa tgcgcttttt tacaataaaa 600 ttaagaaatt attacttttt atgttaaaat tggtttcatt ttttttttgc tcctcaattt 660 tttgggcctc ttaaagcctt cgcgacaggg gccatttccg ctttggtctc ccctcactac 720 gctaatgttt aggagtgttt gttgcacata ttgctttata atacatacca tcgatatatc 780 aatgacatat ctttaatatg gtatattata tcatatagag aatgttttat ttaccggaat 840 gataattaat gccaatttat ataaaatctt atttagtagt caaataaaaa atgaactacc 900 atcacaaatc tggatctcaa aaaaggaaag aaaaattgtc gagattgaaa aatgtaaatg 960 aaaaacaaac tctacttcaa agttatggat tttcagtgtc ttcgaaagag aaagatttag 1020 aaaatgaata tcataatgta gaagctaatt ttaacaataa cgacaacgtt gttgaacaag 1080 ctgcagattt aattaatgtg actgccgact ttccagatcc tgataacata gaaaataaag 1140 atgacgtgga gcccggcaat cttttagttc ctatcaacaa tatctctgag aattataaat 1200 caaatataac tgataataaa attactaaaa caaaactgct tgactttgat gttggtagtc 1260 taaaaaatga aagaccaaat gcttctgaaa ttgaggaagc agttcgtagg ggtccagaaa 1320 agatcccatc tcaatttcca aaagatgtga acggacattc atttcctgtt tgtatactac 1380 atacaagtat gaaaaatgga gaatatgttc cccgtgattg gcttgtttgg agcaaagtta 1440 aacaatcgat attttgtttt ccttgtcgcc tttttagcaa actaccaaca gcaagtcgct 1500 cacgtttaac aactatatct gggtattgtt ttcaaagaaa atggaaaaaa ttacatgaca 1560 aaattccaga gcatcaaaac agcagcaacc ataaatattg ttatattaaa tggcggctat 1620 tggaaaaaag tattgactct aattccactg tcgatattat gctgctgcag accattaaga 1680 atcaagcatc acagtggaaa caggttcttc gccgaatttt agatgtcaca ttgtttctag 1740 ccgaacgagg tttgggattt agaggaacaa gtgatttagt tggagttgca gcaaatggca 1800 attttttggg cattttagag ctcttgagcc attatgattc tgtcttgaaa gaccatctga 1860 acaaagtgat gaagtcgcag aaattgaaga gaagacaaca agcaaattac ctttcaccgg 1920 aaatacaaaa tgagtttata gagtgttgtg ccaaaaaagt attagacgtt attctaagtg 1980 aacgagaagc agcaaaatat tactcaatac ttgttgatgc aacccctgat tcagcacata 2040 tggaacaaac tgtatttata ttgcgttatg tttatttaaa tgaagaaaat agtctctacg 2100 aagttcaaga gcgatttcta gagtttgttg actgcaatca gaaaacgggg aaagctattg 2160 ctgaattaat ttgcagtgtt attaaaaaac acaatatacc aatgatagac tgtcgtggtc 2220 aaggttatga caacggtagc aacatgagcg gtcaatataa aggagcacaa gcagagataa 2280 ttaaagagaa ccaattggca atttattctc catgtgcttg tcatagctta aatctgtgtg 2340 gtgtccatgc cgcagagtgc tgtgcagagg ttataaactt cttcggtgtt gtgcaaaaga 2400 cttataactt gttcagctct agcccgcaaa gatggcaaat tttaaaagaa aacattggat 2460 gctctttgca tagcatgtca gatacacgtt ggtcagccag gattgaaagt gtaaaacctt 2520 ttgcagaaca tattcctggt ttaaagagtg ctatccaaga cttaaagaag ctaaatctta 2580 ctgccgagac tcgatcagat atcaaacgta ttgagaaata cttgggttca tttgagtgca 2640 ttatactcgc aagcacttgg tttaaagttc taacaagcat caattacaga aacacagttc 2700 tgcaagccag agatgcaact ttagatgtgg aggttttaaa tttaaaaagc ttaattgatg 2760 acctcttatt attgagaaac aactgggatt caattttaaa tgaatgtaaa cttgttgccg 2820 aaaatctggg tatggtttcc aatgacatat tcccggagaa aaaacgcagc cgaaaagcta 2880 gattttcaga tgaggaacag aataataaat taggcaacac cagtgcagaa ttttgtttca 2940 agcgcgatgt tttttatgtt cttttagact gtgtcattgg taatatgaat cggcgttttg 3000 aggcagccaa agatcttgag gaaacatttg gtgttgtatg gaaatacatg acattagata 3060 aagatctgct gcgtgagaaa gcttcatcta tttgcgttaa atacagcatt gatgtttctc 3120 tcgatttaat agacgagtta gaacatttaa aggcaattca cgcatcaaat cttggcataa 3180 tacaattatc gccgtttcaa ttattaaaca aactccatga gttgaagctt gattctcttt 3240 tcccaaacat tttagttgtt cttagaattt tttgtacact tccagttacc gttgcacaag 3300 cagaacgctc tttcagcacg cttgcaagag ttaaaaatgt tttacgttct acgatgtgtc 3360 aagatcgact ttctaatttg ggaagactag cgattgaggc gccacttgca aggaaactag 3420 actttgatgc agttatcgaa ctttttgcaa gcaaaaaatc tcgcaaagct tattttggtt 3480 aattgcagta caattaaata caattattta ttttgaacac ttagaattca aaatattctt 3540 ttttttataa agaaaaccaa agaatatact tgtttgtctg taaaactctc ttagagaaat 3600 tagagtcttt attaatactt tttattttaa tttacattct cttctttttt atattatatc 3660 aagttaatta cttatagtaa aaatattaaa gaaaaggcta cttaattaaa aagcttttta 3720 ttacaaaatt ttttgctaaa taaacttagt tactccgcgc agcggccttg cttgccaaag 3780 ttcgtgtttc tgagttaaag agttgagaga gggttgtacc acaattaaca acaaaaaata 3840 aaaaataaaa acacaaaaaa atgagtagcc tcctcgacta tagtgacccc tcggccttgg 3900 gaaggtgaat aatgtaaaaa aaaaaaaaat ctaacaaatg caacttgtta aaaaaatgaa 3960 aggattcgtt gaccaaccta actaattaaa gacacatcta gtcctagact ttaagattga 4020 ttttgattta agttagacta aagacacgtc atagacttta agtttgatgt tttcaatatg 4080 gatatgaata ttaatgtaaa ataaaggggg cccgccaaaa aatttacccc ggggcccggc 4140 ttggctctct acggccctg 4159 // ID MuDR3x_SM repbase; DNA; INV; 2211 BP. XX AC . XX DT 12-AUG-2009 (Rel. 14.08, Created) DT 12-AUG-2009 (Rel. 14.08, Last updated, Version 2) XX DE MuDR-type DNA transposon element - a consensus. XX KW MuDR; DNA transposon; Transposable Element; KW Autonomous DNA transposon; MuDR3x_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-2211 RA Jurka J.; RT "MuDR-type elements from Schmidtea mediterranea."; RL Repbase Reports 9(8), 1901-1901 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 456..1883 FT /product="MuDR3x_SM_1p" FT /translation="MATHMDSINFGVSIKGRRTVMFKNFEYIKARENKNGE FT VYWRCKMYKSYKCNAHLKTFEDRIVSTQDPEHTHQGNSATSLARLAVAEMK FT QKMGETSATPAAVVGVVSRHLENDVLQALPKRSTLARCLQRHRNAALKQND FT NGATLPPLPSDTTFIVPRRFSEMLLYDSGPEGDRVLIFGSSDLLDGLARAK FT LWIADGTFKVVPSIFFQLYTIHFELVQGNNPVGIYCLVTNKDRTTYDIIMH FT QLRILIPSANPERILVDFETAAMSAFQAAFPNAAISGCYFHLCQSVLRKAN FT EVGMKQAYESDNIVRVAVRCIPALAFVPVADVNDAFELLSDEISELHERMP FT ELLSYFEHTYIRGRRRAGRVQNYGPSLFSIQRWNHHEAAAEGIARTTNAVE FT GWHYGLQSLFQCHHPTLWTFLDGLSKDLQKQKASFLQGISGVNQLPRKKYR FT ELKERVGRAVNLYLAANVLTFLKSMAHLSHE" XX SQ Sequence 2211 BP; 673 A; 449 C; 454 G; 635 T; 0 other; gacttatgaa cacgcgacat ttgaacacgc gtgacatttg aacacacgtg agactatgat 60 cacagagaca tatgaacata caacctcatt atcgcaacga cagattaaca cagtcattat 120 gaacacgcga tattagaaaa cgaaatatag atattcagca tgaaaattat agacgaagta 180 tagagtgtac ataaattaag acaaaacaca acagaagaac acgaagtaaa aaatgtgctt 240 tgatgagtac ttgaaacgtc cttgtacagt tgaacacgcg gcaaactaac agtagactaa 300 acgtcctttg atgttccctg atgcataatt aagcactaac taggtgagac aaagagcaaa 360 tatataaacc gaactttcag cataaaaatt aaaaaaataa aacttgtctc attctattta 420 aatattagaa ttagttttaa ttatattaat cacccatggc aacacacatg gattcgataa 480 attttggagt ttctataaaa ggtcgacgaa cagtaatgtt caaaaatttt gagtatatta 540 aagcgcgtga gaataagaat ggagaagttt attggcgctg taaaatgtat aagagttata 600 aatgtaacgc tcatctgaaa acattcgagg atagaatcgt tagcacccaa gatccagaac 660 atacgcatca aggcaacagc gcaacctcac tcgctcgtct tgctgtcgct gagatgaaac 720 aaaagatggg cgaaacgtcc gccacgcctg cagctgttgt aggagttgtt tctcgtcacc 780 tcgaaaatga tgttctgcag gcattaccga aacgatcaac acttgctcgc tgtcttcaac 840 gccatagaaa cgctgctctc aaacagaacg ataacggagc tactttgcct cctctacctt 900 ctgatacaac tttcatcgtt ccacggcgct tttctgaaat gcttctgtac gattcaggac 960 cggaagggga ccgagtgtta atctttggaa gctctgatct gcttgacggg cttgcacgag 1020 caaaattgtg gatagcagat ggaacgttca aggttgttcc ctctatcttt ttccaactgt 1080 acaccatcca ttttgagctc gttcaaggca acaatcctgt tggaatttac tgcttagtga 1140 cgaacaagga tcggacgaca tacgatatca tcatgcacca actgcgaatt ctcattccgt 1200 cggccaatcc tgaacgtatt ctggtcgact tcgaaacggc tgcgatgtct gcttttcagg 1260 ctgcatttcc aaatgcagcc atctcaggat gctactttca tctctgtcag agcgttttga 1320 gaaaagcgaa tgaagttggg atgaagcagg cctatgaatc agacaacatc gttcgcgtag 1380 cagtcagatg tatacctgca ttggcttttg tcccggtggc tgacgtcaat gatgcttttg 1440 agcttctatc agacgaaata agcgaattac atgagcggat gcctgagctt ctttcatact 1500 tcgaacacac ttacatccgt ggacgaagac gagctggaag agttcaaaat tacggcccga 1560 gtttgttctc aatacaacgt tggaatcatc atgaagctgc agccgagggc atagccagaa 1620 ccacaaatgc agtagaggga tggcattacg gtctccagtc cttgttccaa tgccatcatc 1680 ctactctatg gactttcctg gatggcctct cgaaagattt acagaagcag aaagcgagtt 1740 ttcttcaggg aatatccggc gtcaatcaat tgccaaggaa gaaatatcgt gaattgaaag 1800 agcgagtagg acgtgcagtt aatctgtatt tagcagcgaa tgttctaact tttttgaagt 1860 cgatggcaca cctctctcat gaatagtttt tcttgcggat cgtttactaa atctactatt 1920 tatggaactg cagtcgataa ctatatttgt acgctatctg caattctata tacgtgtgac 1980 aatattttta aacggtaata aatttgttgt tatctgtaat tttaccttcc aggaacatat 2040 attaaattta atttatattt tgtgttctta tggtgcgtgt tcataatgac tgtgttaatc 2100 tgtcgttgcg ataatgaggt tgtatgttca tatgtctagt gttcatatgt ctctgtgatc 2160 atctcacgtg tgttcaaatg tccgtgttca aatgtcgcgt gttcataagt c 2211 // ID Jockey_Ele8 repbase; DNA; INV; 4364 BP. XX AC . XX DT 15-OCT-2010 (Rel. 15.1, Created) DT 15-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE A Jockey clade non-LTR retrotransposon family from Aedes aegypti. XX KW Jockey; Non-LTR Retrotransposon; Transposable Element; KW Jockey_Ele8. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4364 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4364 RA Kojima K.K. and Jurka J.; RT "Jockey clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (15-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. This consensus is generated from 22 CC sequences with >95% identity, and ~99% identical to the original CC sequence in [1]. The consensus is ~81% identical to Jockey_Ele7. XX FH Key Location/Qualifiers FT CDS 205..1455 FT /product="Jockey_Ele8_1p" FT /translation="MGRNRKQKADSASIPAPLADSGQTSAPKRARNEDANP FT AAYSRLLANNQFASLPVDQAPPGAKVPPLFTASKDLSALRSELAANNIRPL FT FKLCHTGTKIMCASGADYDKAGKLLKAKGVEFYTHDAPGSKPLKVLVRGLP FT EFTPEAIVDEMKAAGLKPTNVFPIRRAQGGRHRDQLYLAHLEKGSTTMAGL FT TRVRALFNIVVEWERYRPKKRDVTQCGNCLAFGHGTRNCHMKPRCGKCAGA FT HATTTCQPMEEGIEPKCANCGANHEGSSRNCPKRAEFLAIRQQASAKKLGR FT QRQRQPPPPLTEEHFPTPRYQVPNLPPLQPTHRQASRQPAPSVQHRLAAAA FT AAPPVQNAPPPGWGNPGRSAPGTPPSDDGSLYTPEQMLEYTRDLFQRLRAC FT RSKSEQINAANSVVFAFLAKYGP" FT CDS 1253..4117 FT /product="Jockey_Ele8_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MRPLLDGGILDAVPPALLPPTTAPCTLRSKCWSTPGT FT CSNGCAPAVPSRSRSTPPTRWYSPFSPNMAREATTKILNWNACSLRSKNRE FT LSAFLDQEGIDIAVITETHLKPEVNIFIPDFRLVRLDRCGSEGGGVAVALR FT RNVNCTLLPSFQLQVIEAVGVRVETSIGPITIIAAYCPKQTNINDGTSAAL FT KQDIVKLTRRQGQFILAGDLNARHETWGNPRRNRNGLILQQDLEEGHYTIL FT SPDSPTRLSRSGAHATIDIFLTNMADNISQPVVHQDLSSDHYPVVAEVGSL FT VNRHRVTRRNYHRVDWGQFQRCVDANIQYEAPLASAEDIDRHLQSVEEAIS FT VAREQHVPASGQVSNTLFIDRVTKDLIRLRNTKRRQYQRSGLPALKSEVNR FT ISKIIKARMVDLRNDDFSNKIRSLPDCARPFWKMTKLLKSKPRPIPPLIPL FT DNTDSKDRLITPAEKAAEIGRHFVSSHNLGLDIPSPHEAAVSEHAANLHRS FT PNDFSEELEITADELLAYLKTSKNMKAPGFDNILNLELKQLSLQFYQHLAL FT IFNQCLRLSYFPSSWKSAKVIPIKKPGKDPSSPKSYRPISLLSGLSKLFEK FT AINRRLLSAADQNNILLEEQFGFRRGRSTVHQLTRVTNILRRNKSLAKSSA FT MALLDVEKAFDNVWHDGLVYKLHRYNLPTYLVKIIKNYLFNRTFRVSLNGV FT NSDPHNIPAGVPQGSILGPLLYNLFTSDMPQLPEGGSLSLFADDTSVVYSG FT RFTRALTSRLQRGLNVLSDYLNSWKICINAAKTQVILFPYSKSPRLVPAED FT CKITLGGSTVEWSDAADYLGLTLDSKLLFRQQVDKTVTKSNILLKALYPLI FT NRKSTLSLKNKLAVYKQVILPVIEYGVPIWESCAKTHHLRLQRTQNKFLRM FT ILNSPPRTRTTEVHRLAEIKTLEERFGDSIGRFRARCHQSGQQVIRDLVPP FT " XX SQ Sequence 4364 BP; 1096 A; 1347 C; 1073 G; 847 T; 1 other; catcattccg aatcgaactg ccgttgtgaa cggttgcgtg tctttttgag ttgcgagcac 60 aagttcgaat tctcccccgt tcgtgtgcga agtgctcttt tcgcgatttc ccgctcccgc 120 gactgctctc cggcgctaac ctacctcctt ggggtggcta gccaaaaggc atctctgcaa 180 gcagagcaac cgcggcccat agcgatgggt cgcaacagga agcaaaaggc ggactccgcc 240 tcgatccccg ccccgctggc cgacagtggg cagaccagcg cgccgaaacg cgcaagaaat 300 gaggatgcca acccggcggc gtacagcagg ttgctggcaa acaaccagtt tgcctccctg 360 cctgtggacc aagccccccc gggcgcgaag gttccccccc tcttcacggc gtcgaaggac 420 ctctcggcgc tgcgatccga gctagcggcc aacaacatcc ggccgctgtt caagctgtgc 480 cataccggca ctaagatcat gtgcgcctcc ggcgccgatt acgacaaggc cggcaagctg 540 ctgaaggcaa aaggggtgga attctacacc cacgatgccc ccggcagcaa gccgttgaaa 600 gttctcgtcc gagggctgcc ggagttcacc ccggaggcaa tcgtggacga aatgaaggcg 660 gctggactca agccgacgaa tgtgttcccc atccggagag cacaaggagg acgacatcgg 720 gaccagctct acctggccca cttggagaag gggtccacca ccatggcggg actgacgagg 780 gtgagagcgc tcttcaacat cgtggttgaa tgggagcgct accgcccgaa gaaacgtgac 840 gtgacgcagt gtggcaactg cctcgcgttc ggccacggga cgaggaactg ccacatgaag 900 cctcgctgtg gcaaatgtgc cggcgcgcac gccacgacga catgccagcc aatggaggag 960 ggcatcgagc cgaagtgcgc caactgtggc gccaaccacg agggcagcag ccgcaactgc 1020 cccaaacgtg cggagtttct ggcaatccgc cagcaagcgt ccgccaagaa gctgggacgg 1080 caacgccagc gccaaccacc cccaccgctg actgaggagc acttcccgac gccccgctac 1140 caagtgccca atctgccacc gcttcaacca acccaccggc aggcatcccg ccagccggcc 1200 ccttccgttc agcatcgcct cgcggccgcc gccgccgccc caccggtgca gaatgcgccc 1260 cctcctggat gggggaatcc tggacgcagt gcccccggca ctcctccctc cgacgacggc 1320 tccctgtaca ctccggagca aatgttggag tacaccaggg acttgttcca acggctgcgc 1380 gcctgccgtt ccaagtcgga gcagatcaac gccgccaact cggtggtatt cgcctttctc 1440 gccaaatatg gcccgtgagg ccaccaccaa gatcctcaac tggaacgctt gctccctccg 1500 gagcaaaaac cgagagttgt ccgccttcct ggaccaggaa ggcatcgaca tagccgtgat 1560 cacggagacg cacctcaagc cggaagttaa catcttcatc cccgacttcc ggctcgtgcg 1620 gcttgaccgg tgcggatccg aaggtggagg cgttgcggtt gctctgcgga ggaacgtaaa 1680 ctgcaccctg ctgccgagct tccaactaca agtcatcgag gccgtaggtg ttcgggtgga 1740 aacctccatc ggcccaatca cgatcatcgc ggcgtactgc ccgaagcaaa ccaacatcaa 1800 cgacggaaca tcggcggccc taaagcaaga catcgtcaaa ctgacacggc ggcaggggca 1860 gttcatcctg gctggcgatc tgaacgcaag gcacgagacc tggggaaacc cccggcgaaa 1920 taggaacggc ctcatcctac aacaggacct ggaggaaggc cactacacca tcctgagccc 1980 ggattcaccc acccgcctga gccggtccgg ggcccatgcg accattgata tcttcctgac 2040 caacatggcc gacaacatct cccaaccggt cgttcaccag gacctgagct cggaccacta 2100 cccggtggtg gcagaagttg gttccctggt caaccggcat cgggtcaccc ggcgcaacta 2160 ccaccgtgtc gactggggcc aattccagcg gtgtgtcgac gctaacatcc agtacgaggc 2220 tcccctggcg tccgctgaag acatcgaccg gcacctgcag agcgtcgaag aggccatctc 2280 cgtggcccga gaacagcatg taccagcatc tggtcaggtg agcaacaccc tttttatcga 2340 tcgtgttacc aaagatctca ttcgtttaag gaacaccaaa cgcaggcagt accaacgctc 2400 tggtctgcct gcgttgaaaa gcgaagtcaa tcgcatatcc aaaataatca aggccagaat 2460 ggtggacctc aggaacgatg atttttccaa caagatccgc tctctcccag attgtgctag 2520 gccattctgg aagatgacca aacttttgaa atccaaaccc agacctattc caccgttgat 2580 cccattagac aacaccgact ctaaggatcg cttgataacc cctgcggaga aggctgctga 2640 gataggtcgg catttcgtca gctcccacaa tctagggcta gacattccta gcccacacga 2700 agctgctgtt tccgaacacg cagctaacct acaccgatct cccaacgact tctcggagga 2760 gttggagatc actgctgacg agttgctggc ctatctcaaa acatccaaaa acatgaaggc 2820 cccaggtttc gacaacatcc tgaacttgga gctcaagcaa ttgagtctcc agttctacca 2880 acatctggca ctgattttca atcagtgcct ccgacttagc tacttcccct cgtcgtggaa 2940 gtcagcaaaa gtcatcccca ttaagaaacc tgggaaggat ccttcctccc ccaaaagcta 3000 tcgacccatc agccttctct cagggttatc aaagcttttt gaaaaagcga tcaacagacg 3060 gctgctttcg gcagccgatc aaaacaacat cttgctcgag gaacagtttg gctttcgacg 3120 cggtcgttca accgtgcacc aactgactcg agtaaccaac atcctcaggc ggaacaagtc 3180 ccttgccaaa tcctccgcca tggcgttgct cgatgttgaa aaagcattcg acaacgtctg 3240 gcacgacggc ctggtgtaca agctgcaccg atacaatctt cccacctatt tggtgaaaat 3300 catcaaaaac tatctgttta acaggacgtt cagggtttcc ctcaatggag tcaactcaga 3360 ccctcacaac atccccgcag gtgttccaca gggcagtatt ttaggtcccc tactatacaa 3420 cctgttcacc tcggacatgc ctcagctccc tgaaggcggc tctctgtcac tgttcgctga 3480 cgacacatca gtcgtctaca gcggcagatt cacgagagca ctaacatctc gactccagag 3540 aggcctgaac gtcttgtcag attatttgaa cagctggaag atctgtatca acgcagcgaa 3600 gacccaggtc atcctcttcc cctattccaa atcccccaga cttgttccgg ctgaggattg 3660 taaaatcacc ctcggtggat caacggtgga atggtctgat gcagccgact accttgggct 3720 aacgttagac agcaagcttc tcttcagaca gcaggttgac aaaacggtca ctaaaagcaa 3780 catcttgctc aaagcattgt accccctgat caaccggaag tcaactctgt ctctgaagaa 3840 caagcttgct gtttacaaac aggtcattct tccagtaatt gaatatggcg ttccaatctg 3900 ggagagttgc gctaaaaccc accatctgag gctccagagg acccaaaata agttcctccg 3960 gatgatcctc aacagtcctc cgagaacgag gacaaccgag gtccaccgtc tggccgagat 4020 caaaacwtta gaggagcgct ttggtgactc gatcggaaga ttcagggctc gttgccacca 4080 atctggccag caggtcatcc gggacctcgt tcccccctag gttatcaaat ttttcttgta 4140 gtgtaaatag ttagttaggt tatcaaattc tctttttata aaaactacca gagcccataa 4200 ggccaagaaa aactagggta aaaccacaaa attaaaccag ccaaacaaaa accattacaa 4260 acaaagttga aggacccctt cggggtcaaa ctcttaacat gtaaaaatat tttgtaccaa 4320 aaatccaaat gaaaaataaa catgaattta atgaaatgaa atga 4364 // ID Copia-18_SI-I repbase; DNA; INV; 4166 BP. XX AC AEAQ01022549; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-18_SI_; KW Copia-18_SI-LTR; Copia-18_SI-I. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-4166 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01022549; Positions 4723 558. XX CC Positions [1566-2075] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 501..2462 FT /product="Copia-18_SI-I_2p" FT /translation="MEVRQYGTFVAGLKVIIRRIEALESEDFGRNFNEKLL FT MAKILGCLPKEFDNFVTSWSLLSEDTSLESFLEKLTNAERSITERSDDTSH FT EAFKIRPKSTNSQSKKTMNKRFPGKCHKCGKIGHMKKDCWSKVEKQESSEE FT KKETKSQQNEVGLSASSVFKVNDDNKIIADSGASIHLTRNIEWFSSLRKLD FT TPLILSVANGKTLQATHIGNITIEKSIDGKKWIKRIWENVYYAEDMSSESL FT FSTTFMEMTKGYSFYHGNGLMQLKDGRETILGGKRIGNQYVPYIRVKPPSF FT HAKAAQSIGLWHQRLGHVSDKTIRAMCKNNLTDGLEVIFTKRDDCDSCHFG FT KQTINQHPTKEKRDCLPGERFHSDVCHIGIMSWNKCKYFLTMKDEASGYRR FT VFFLKTKDEVSNILKQFFIDAERETGRKAISLRTDNGTKYVNENVKEVLKL FT RNIIHELSPPNVKQCNGMAERENRTLCDTARSLLFNTDLSRTDRHLLWTEA FT SGTAAYLRNRIPNRGIMNTTPYYEWYGKKPNISHLRIFGAKAFVRVPDTMR FT RKMDPKSRKTVFVGYDRLTDKIYRVFDPTRKVVERVSDVIIQDSSDENNQV FT LFPLSSDKQVEDFEEPAIESSSSGNGEKEEKDDSEETIDNSYKEEVSSSDD FT YEEVR" FT CDS 2449..4149 FT /product="Copia-18_SI-I_1p" FT /translation="MKKLDDINEDLLKMRKRGRPVGSKNFQKPVPSTDREL FT RSGSNKLACIAAMKVSMDPTSYEDAISREDAHLWKKAMDEEMMSLTKNQVW FT KLEELPKDRPTVSCRWVFKSKLKADGTIERYKARLVARGFNQTKGVDYFET FT FSPVVRYESVRTVMAIAAKYDMEMTQFDIKTAFLNGPLKERIYMQQPEGYD FT DGTHRVCLLQKGIYGLKQASRNWNIFFNDFVMEYGLIQSDADPCIFTKDAN FT TDNWMILCLYVDDGLIVCKSNKLLKDFISSLRRKFEITCHEPSCYVGMDIK FT RDREMKTIYVNQQGYISRTLCRFGMQDCKPVTSPMDSSVKLMEQKNEEDII FT GKRFPYREAVGSLNYIALISRPDISYAVNTLARFSNNPSEVHWRAIKHVMK FT YLKGTIGFSLCYKGKSEDKLVGYCDSDYAGDLVSRRSTSGYVFMLHDAPIS FT WSSNLQRVTALSSTEAEYMSISEALKELLWLRSLLESFGLKQAGKTELRVD FT NQAAIAMSKNLEFHKRTKHIEIRFHRIRQEQEAGNVDVTYVPSNKQVADLL FT SKGLTWAKISEYLEILKMTS" XX SQ Sequence 4166 BP; 1494 A; 647 C; 927 G; 1098 T; 0 other; gattatgaga cttagtgcac gtcgagtcga taggtggtta tttcgaggtt aggagagata 60 acagacagaa agttgtgtat tgctaagaag atatcggttc tctagcgata agaatcacag 120 aaagaaacga tggctgatga aatttttagt cgagacattc gagtgacaaa actttcgaag 180 acaacgtaca gccgttggaa gattgagatt cgagatgctc tcgaaagtca tcggatatgg 240 gaaatcgcca atggtaatgt aaatcaacca caagaagtaa gtgaagcagg cgttgttaaa 300 aataaaaaag aaatagagga ttggaaagcc aaagacagta aagcgcgttc gattattcgg 360 tcaacgttag atgatacgac cttcgaccag gtatgtgatt gcgaatcttc agaaaatatt 420 ttgaagagaa ttaaagccgt ctatgagcca aaaacattaa atgttttgtt agaattgtta 480 cgtgaattct tcgtctactc atggaagtca gacagtacgg tacattcgta gccgggctta 540 aagtaatcat tcgtcgaatc gaagcgttag agtcagaaga ctttggtaga aattttaatg 600 aaaaattgtt aatggcaaaa attcttggat gtctgccaaa agaatttgat aattttgtca 660 cgagctggtc acttttgtca gaagatacat ctttggaatc ttttctagag aaacttacaa 720 atgcagagag aagcattact gaacgttcag atgatacgtc acatgaagca ttcaaaatcc 780 gaccaaaatc aacgaacagt caatcgaaga agacaatgaa caagaggttc ccaggtaaat 840 gccataaatg tggtaagatc ggtcacatga agaaggactg ctggtcaaaa gtggaaaaac 900 aggaatcgtc tgaagagaaa aaggagacaa agagtcaaca gaatgaagtt ggtctttcag 960 cgtcatcagt cttcaaggta aacgatgata ataaaattat tgcagactca ggagcaagta 1020 tacacttaac aagaaacata gaatggtttt cgtcgctgcg taaattggat acaccattaa 1080 tattaagtgt cgcaaatggt aaaactctac aagcaactca tataggtaat atcactattg 1140 aaaagtcgat tgatggtaag aaatggatta aacgaatatg ggaaaacgta tactatgcag 1200 aagatatgag cagtgaatcg ttattttcga caacatttat ggaaatgaca aaaggatata 1260 gtttttatca cggaaatggg ctcatgcaat taaaagatgg acgagaaaca attctaggag 1320 gaaaaaggat tggaaatcaa tacgttccct acattcgagt caaaccacca tcttttcatg 1380 caaaagcagc acagtcaata ggattatggc atcaacgact aggacacgta agtgataaaa 1440 cgatacgagc aatgtgcaaa aataatttga cagatggtct tgaggtaatt ttcacaaagc 1500 gagatgattg cgactcatgt cattttggta agcaaacaat taatcaacat cctactaaag 1560 aaaaacgaga ttgtttgcct ggtgagcgtt ttcattcgga tgtctgccat atagggataa 1620 tgtcgtggaa taaatgcaaa tactttctaa caatgaagga tgaagcatct ggctatcgga 1680 gagttttctt tttgaaaacg aaagatgaag tatcaaacat actgaagcag tttttcattg 1740 atgcagaaag agaaacagga agaaaagcaa tttctttaag aacagacaat ggaacgaaat 1800 atgtaaatga aaatgtaaaa gaagttctaa agttaagaaa cattattcac gaattatcac 1860 cacctaatgt taaacaatgt aatggaatgg cagaacgaga aaataggacc ttgtgtgata 1920 ctgctcgatc tttgctgttt aatacggatt tgtcaagaac ggatcgtcac ctcctttgga 1980 cggaagctag tggtacggca gcttatctta gaaatcgaat tccaaatcga ggaatcatga 2040 atacaacacc atattatgaa tggtatggta agaaaccaaa catctctcat cttcgaattt 2100 ttggtgcaaa agcttttgta cgtgttcctg acacaatgag acgaaaaatg gatccaaagt 2160 caagaaaaac tgtgtttgtt ggatatgatc gattaactga taaaatctat cgagtttttg 2220 atccaacaag gaaagttgtg gaaagagttt ctgatgttat aattcaagat tcatcagatg 2280 aaaataatca agtattattt cccttatcgt ccgacaagca agtagaagac tttgaagaac 2340 cggctattga atcctcaagt tcaggaaatg gcgaaaaaga agagaaggat gattctgaag 2400 aaacgataga taattcatat aaggaggaag taagttcaag tgacgattat gaagaagtta 2460 gatgacataa atgaagattt gctgaaaatg agaaaaagag gacggccagt tggctcaaag 2520 aattttcaga agccagttcc ctcaaccgat agagaactaa gaagcggatc aaataaatta 2580 gcatgcatag ctgcaatgaa agtgtcgatg gatccaacat catatgaaga tgcaatttct 2640 agagaagatg cacacctttg gaagaaagca atggatgaag aaatgatgtc attaaccaaa 2700 aatcaagtgt ggaagctaga agaattgcca aaggatcgac ctacagtctc gtgcagatgg 2760 gtttttaaat ccaagttaaa agcagatgga acaattgaac gatataaagc gagacttgtg 2820 gctcgaggtt tcaatcaaac taaaggagta gactattttg aaacattctc acctgtagtt 2880 agatatgaat cagtgagaac tgttatggca atagctgcta aatacgatat ggaaatgacg 2940 cagtttgata taaagacggc ttttcttaat ggtccactta aagaaaggat atatatgcaa 3000 caaccagaag ggtacgatga tggaactcat cgtgtttgtc tcctgcagaa gggtatctac 3060 ggactcaaac aagcatctag gaattggaat atctttttca acgattttgt aatggaatat 3120 ggcctgatac agtccgatgc ggatccatgt attttcacta aggacgctaa tacagataac 3180 tggatgattt tatgtttata cgttgatgat ggactcatag tttgcaaaag caataaattg 3240 ctgaaggatt ttatttcgtc gcttagaaga aaatttgaaa ttacttgtca cgagccatca 3300 tgctatgttg gtatggatat caaacgtgat cgagaaatga agactattta tgtcaatcag 3360 caaggataca tatctcgaac tttatgtcgg tttgggatgc aagattgtaa accagtaacg 3420 tcaccgatgg atagttctgt caaattaatg gaacagaaga atgaagagga tatcattgga 3480 aagagattcc cttatcgaga agcagtgggc agcctaaact atattgcgtt gatatcgaga 3540 ccagacatat cttacgctgt gaataccctt gcaagattct cgaataatcc gtcggaagta 3600 cactggagag caattaaaca tgtgatgaaa tatctcaagg gaacaattgg tttttctttg 3660 tgctacaaag gaaagtcaga agataaatta gttggatact gtgattccga ctatgcagga 3720 gatctagtgt caagaaggtc aacatctgga tatgtgttta tgctacacga cgcaccaata 3780 tcctggtcat caaatctcca acgagttaca gcactttcgt caacagaggc ggagtacatg 3840 tccatttcgg aagctctaaa ggaactccta tggttacggt cactgttgga atcttttggt 3900 ctgaagcaag caggaaagac tgaattgagg gtagataatc aagcggcaat agctatgtcg 3960 aagaaccttg aattccataa aaggactaaa catattgaaa tacgttttca tcggattaga 4020 caagagcagg aagcagggaa cgtggatgtt acctacgtac catcaaataa gcaggtagca 4080 gacttgttat cgaagggctt aacatgggcg aagatttcag agtatctgga aattttgaag 4140 atgacgtcat gaacaggggg gtgtgt 4166 // ID Gypsy-14_OD-LTR repbase; DNA; INV; 225 BP. XX AC CABV01004575; XX DT 24-FEB-2011 (Rel. 16.02, Created) DT 24-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Oikopleura dioica genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-14_OD_; KW Gypsy-14_OD-I; Gypsy-14_OD-LTR. XX OS Oikopleura dioica OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; OC Oikopleuridae; Oikopleura. XX RN [1] RP 1-225 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Oikopleura dioica genome."; RL Direct Submission to RU (24-FEB-2011). XX DR Genome; CABV01004575; Positions 1004 1228. XX SQ Sequence 225 BP; 56 A; 69 C; 32 G; 68 T; 0 other; tgttgtgttc ttgcttacaa tcccactatt acgcctgtca taatgttacc cgtcatcacc 60 caccagtaca cttgactggt tacgacacac cccactaaat tctgctgcac tatataagcc 120 cgctctgcct tgccactgta tcactcgcta gaccaataaa gccactattc ctcgcagtct 180 tattcttcct aaggttttct aggatctata agcaggaact ctaca 225 // ID Gypsy-7_OD-I repbase; DNA; INV; 12853 BP. XX AC CABV01000577; XX DT 24-FEB-2011 (Rel. 16.02, Created) DT 24-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Oikopleura dioica genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-7_OD_; KW Gypsy-7_OD-LTR; Gypsy-7_OD-I. XX OS Oikopleura dioica OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; OC Oikopleuridae; Oikopleura. XX RN [1] RP 1-12853 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Oikopleura dioica genome."; RL Direct Submission to RU (24-FEB-2011). XX DR Genome; CABV01000577; Positions 4459 17311. XX CC Positions [2873-3346] - Reverse transcriptase CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 92..5872 FT /product="Gypsy-7_OD-I_1p" FT /translation="MTNGNLESLSGFQHYDDDKVRRLTFTEGDSAARASKV FT DGWKMALTMSLQDRLKRHNPGNVMHGMVTKIPLLWGKSFDASGPLGIPSTL FT AIFKDWQDEGRYDGYEYLEQMKVVGPDMQKSTMFDGTNYTANKQKERFIDT FT FFPHLKVEDFPYLFNGACQQIPTCPVTEMRAAVRTVGPVILNSVELNSKSS FT VEQGDCINLDSYCQQMKELHGQLPMSEMSRTLSILKSFSLGHLRDRRAANN FT FDNVTRCLEELLIAHPKGGLYTEGTCLGNSTSLRVRRYVNPAREKWGPAID FT SVAMFFTLAPLSKKDFDDILQKVAKKQGVDVSTVNTRDLSNQKRLFFEEYD FT RITSAKGHTQFEHLKIATDCVNGGDIRGELARFEKSWIEKSSKKKGSVNLV FT ETNESEVDNEVNAVGQRQVRRTENRRPLTLRDAQESGKYWVNAKTKKVCYV FT NETKLPALTEEQMRSWKSIPNKAFVALTPRIGGNKHNRRNLRQRNVVEKAY FT EKGRNVRIGRKQYRAFEIDEGDQCCPVFEEIAEQIALVDVDSGSETEALPE FT EAHFLENEDDIIPGKNETIMQVSKIFSLKDRTKRKTKEKNLAYAKFKVLNS FT MNVIKEFKVLFDSGASCNVCPEEFVSIIMHNWSGNVRTIDCEASEAKVANG FT NALQFENYNADFEIDFTEKFKLKFTNTKVHKAGGNTIILGRPGMAENGVEM FT KVKSSPCEAELQFTVRGSTLESLWKETGSSPLNDLNEVRLDGEVRQVEAVL FT ERTPTFYNHLYEGSEFKNERPTNAFDPNGTRAWYRKIERIHQENRNKNTVE FT DVIVDPDSEVLKNDNKTRDKLDAILNKYRSVFEATVGQVTAKEFEVHATID FT PSRSDRSPKSAPLGYGKNLPESIQKGIEDTLDKEAAEGVLRFLPKGMTALN FT SVSFFGVGKRDAETAKIEMSPTNVRIVVDCSKGLNDNTQHCARQMDSIQRI FT LQLAAPHTKKGFIAQIDISSMFHCFTLAENLWPHFVVMHPKMGEMCYCRLP FT MGWIKSPGVARDLITRIMYQHLEYTQIYMDDMFIYGPTKDSLLERLENVLA FT TLKFRNLRLKGKKCLIFSRDVVLLGRRVKDGKILMNKHILQKALESTPETI FT TTIKALKRYLGVINYLSIGLPRRTETLWELNKEASGSKKLSEKVNWTPELT FT AAYVKVAKAINEQMLELFPVEKQLATYLVVDSSNLGSGAYLYQLNNEKTQI FT VRLWSKKRGDNGLKTQWSSCQLELSGILNAVLNFVWEIDYVTEPVTIITDS FT LSVQKLFTRARQGKSLSQDKRINDMIVKLLAFDIEIVYDSGQSTQIHLADF FT ISRSEYLLTPCDKTCAICDLTERNVIPSKELGEKMNLVTEIEEAIVNFEVF FT EAIYDIQNEEAFKTQVGKTVKNYHITPKLRKEVVEMAVLRSANKQLKRKIA FT KNDPKVSRITHILEELEAVGSTEFLTDYRLIRAMQLSDKIVGKALNSKSSN FT KMPGPKDRRAETLRGTTFLSQEGPGILRKKKIMVGTRMVEPIVIPDGYAEA FT VVEIFHKGNCGTITRLTNILKSEVWFNRLAALVQERVKSCLNCTYMRNQPK FT IVTAQREYDDKDPEYIGETLFADVITRNAHRAHDETTMKFWVVSDCISSYT FT RLYPVGNKENNAARATEILTESIADFNRGISGQIPIKIIMDGSSVNQSLKK FT REIWKELNVEIVITEKTGGSKNYLAPLDSRIAKLSPVLQTEIWNKLNPKVT FT ALKVESKINSTKGGHGYSAFEIWNNRSQFSNVPLSINIKEIRDYIKESRKL FT SRHARERNEIKGRVRAPYVLKPFEKNDKYGGDLVSPIKLGDMVLIEGNFDK FT NKQHPWFQVVEDDKFPTGIDWEHGIIFTQRMGITRKSRYKWSFKAIKAIID FT GRTNNADAMQTHKAAMRRFREQIPELNMIEHLKPRSNVTDKTLWLYGRCIP FT IYE" XX SQ Sequence 12853 BP; 4073 A; 2598 C; 3405 G; 2777 T; 0 other; ttatacgtgg tgactgaagt ctttaattag actgaaggat agattccacg ccgacttata 60 cgaacctcgc ggctaggacg tttcttgagc catgacgaac ggtaatttgg agtccctaag 120 cgggttccaa cattacgatg acgataaagt cagacgactg accttcacgg aaggagacag 180 tgcagctcgt gcaagcaagg ttgatggttg gaaaatggct ctgacaatga gtctccaaga 240 tcgtttgaaa cgccacaatc caggaaatgt gatgcacggc atggtcacta aaatcccgtt 300 gctctgggga aagtcattcg atgctagcgg tccgctcggc attccgagca ctctggccat 360 tttcaaagat tggcaagacg agggacgtta tgacggatat gagtatctgg agcagatgaa 420 ggtcgtcggg cctgacatgc agaaaagtac aatgtttgac ggtacaaact atacggctaa 480 taaacagaag gaacggttca ttgatacgtt tttcccgcac ttgaaggtgg aagacttccc 540 ctatttgttc aacggagcat gtcagcaaat accaacatgc ccagtaaccg agatgagggc 600 tgcagtccga actgtgggtc ccgtgatctt gaattcggtc gaactgaact cgaaatcgag 660 tgtagaacag ggcgactgta tcaacctgga ttcgtattgc cagcagatga aggagctgca 720 tggtcagctg ccgatgtcgg agatgtcgcg aacattatcg attctgaaaa gcttttcgtt 780 aggtcatctt cgggacaggc gagcagctaa taatttcgat aatgtgacca gatgtctgga 840 agagttgttg atcgctcatc ctaagggagg tctgtacacg gaaggtacat gtttgggaaa 900 ctcaacctcg ttgagagtaa gacgctacgt aaatcccgct cgagaaaaat ggggtccagc 960 aatcgatagt gtggctatgt ttttcacgtt agctccgctc agtaagaagg atttcgatga 1020 tattttgcag aaggtagcaa agaagcaagg agtggacgtc tcgaccgtaa atacaagaga 1080 cctgtcgaat cagaaacggt tgttttttga ggaatacgac aggataacgt cagcgaaagg 1140 tcatacgcag tttgagcacc tgaaaatcgc gacggactgt gtaaacggcg gagacatcag 1200 aggagagctt gcaaggttcg aaaaaagctg gatcgaaaag tcatctaaga aaaaaggtag 1260 cgtaaaccta gttgagacca acgaaagtga agtcgataac gaagtaaacg cggtggggca 1320 acgtcaagtt cgcaggacgg agaacagacg gccgcttaca ctgagagacg cccaagaaag 1380 cggaaagtac tgggtcaacg ccaaaactaa aaaggtttgt tacgtaaatg aaacaaagtt 1440 accggcgttg acggaggagc agatgcgttc ttggaaatcc attccgaata aggcgttcgt 1500 tgcgttgacg ccaagaattg gcggaaacaa gcacaaccga cgaaatctac gccaaaggaa 1560 tgtggtggag aaggcgtacg aaaagggccg caatgtgcgg attggacgga agcaataccg 1620 cgcgtttgag atcgacgaag gagatcaatg ctgtcctgtg tttgaagaaa tcgcagaaca 1680 gatcgcattg gtagatgtgg actcaggatc ggagacggaa gcgctgccag aagaggctca 1740 cttcctcgaa aacgaagacg acattatccc aggtaagaac gaaacaatta tgcaagtttc 1800 taaaattttc agcttaaaag atcgaacgaa aaggaaaacg aaggagaaaa acttggcata 1860 cgccaaattc aaagtgctaa acagtatgaa cgtaataaaa gaatttaaag tactcttcga 1920 ctcgggagcg tcatgcaacg tatgtcccga ggaattcgtg tcgattatta tgcataactg 1980 gagcggaaat gttcgcacta tagattgtga ggcaagtgaa gcaaaagtag caaacggcaa 2040 cgcgttgcag ttcgagaatt ataacgcaga ttttgaaata gattttaccg agaaatttaa 2100 gctcaaattt acaaatacaa aggtgcacaa ggcaggcgga aatactatca ttctaggcag 2160 gcctggcatg gcagaaaatg gcgtagaaat gaaggtaaaa tcgtccccat gcgaagcaga 2220 attgcagttt acagttagag gtagtacgtt agaaagttta tggaaagaaa cgggatcgag 2280 tccgctgaac gacttgaacg aagtgagact ggatggtgaa gtgagacagg tcgaggcggt 2340 tctagagaga acacccacgt tttataacca tttgtacgaa ggaagtgagt tcaagaacga 2400 gcgccctact aacgcattcg atccgaatgg aacaagggca tggtacagaa agatagaacg 2460 aatccaccaa gagaacagaa acaagaacac ggtagaagac gtgatcgtag atcctgacag 2520 tgaggtccta aaaaatgata ataaaacaag ggacaagtta gatgcgattt tgaataaata 2580 cagatcggtt tttgaagcca ctgttggtca agtaacagca aaggagttcg aggtacacgc 2640 gacaatagac ccgtcaaggt ctgatagaag tccgaaaagc gcgccgttag gttatggtaa 2700 gaatctaccg gaatccatcc aaaagggaat tgaagatacg ctggacaagg aggccgcgga 2760 aggggtcttg aggttcctac caaaaggaat gacagcgtta aattcggtaa gcttctttgg 2820 agtcggaaaa agagacgcag aaacggcaaa gatcgaaatg tcaccgacaa atgtacgcat 2880 agtggtggat tgtagcaaag gcctaaacga caatacgcaa cactgtgctc gccagatgga 2940 ctcgattcag agaattttac agttggcagc gccgcatact aagaaaggtt ttatagcgca 3000 gatagatatt tcgagtatgt ttcattgttt tacgttagca gaaaatcttt ggcctcattt 3060 tgtggtaatg catccaaaaa tgggggaaat gtgttattgt agattgccga tgggctggat 3120 caaatcgcca ggagtagcga gagacttaat tacgaggatt atgtatcagc atttggaata 3180 tacacaaata tatatggatg acatgttcat ttatggaccg acaaaggatt cgttactaga 3240 acgattagag aacgttctcg ctacgttaaa atttcgaaac cttcggctga aaggcaaaaa 3300 gtgtttaatt ttctcgcgag acgtagtgtt gttgggccgc cgagtcaagg acggaaaaat 3360 actaatgaac aagcatatat tgcagaaagc actggagtcg acaccggaaa caataacgac 3420 cattaaagcg ctgaaacgat atttaggcgt aataaactat ttgtctatcg ggttgccccg 3480 aaggacagaa acattatggg aactaaacaa ggaagcgtca gggtcgaaaa agctttcaga 3540 aaaggtgaat tggacgcctg aattaacggc ggcatatgtt aaagtagcca aggctataaa 3600 cgaacagatg ttggaattat tcccggtaga aaaacagctc gctacatatt tggtagtgga 3660 ttcgagcaat ctgggatcag gtgcgtatct ctatcaactt aacaacgaaa agacgcagat 3720 agtacggttg tggtcgaaga agagaggtga taacgggttg aaaacgcagt ggagctcgtg 3780 tcagctagag ttgagtggca tcctgaatgc agtattgaac ttcgtttggg aaattgatta 3840 cgttacagaa ccagttacta taataacgga ttcgttgtca gtacagaaat tgtttacgcg 3900 agcacgtcag ggaaagtccc taagtcaaga caagcggatt aacgacatga tcgtaaagtt 3960 gttggctttc gacattgaaa tagtatacga ctcaggacag tctacgcaaa ttcatttagc 4020 cgattttatt agcagatcgg aatacttact gacgccttgt gataaaacgt gtgcgatttg 4080 tgatttaaca gaacgtaatg taatcccgtc aaaagagtta ggagagaaga tgaacctcgt 4140 aacggaaata gaggaagcca tagtgaattt tgaagttttc gaggccattt acgatataca 4200 gaacgaagag gcgttcaaaa cgcaggtggg gaaaactgtg aaaaactatc atattacccc 4260 aaaattgaga aaggaggtag tagaaatggc agtgctgcgt agtgcgaata agcagttaaa 4320 acgaaaaatt gcgaaaaacg accctaaagt cagtcgaata acgcatatat tggaagaatt 4380 agaagcagtt ggctctacgg agttcctaac ggattataga ttgataagag cgatgcaact 4440 gtctgacaag atagtcggta aagcccttaa ttcgaagtca agtaataaaa tgccgggacc 4500 gaaagataga cgagcggaaa cgttgagagg cacaacgttt ttgagtcaag aagggccagg 4560 cattttaaga aaaaagaaga tcatggtcgg gacgagaatg gtagaaccga tcgtgatacc 4620 ggatgggtac gccgaagcag tagtcgaaat attccacaag ggaaattgcg gaacgatcac 4680 aaggttgacg aatatcttga aatcggaagt gtggtttaac aggttggcgg cgttggtgca 4740 agaaagagtg aagagttgct taaattgtac gtacatgaga aaccagccaa agatagtaac 4800 tgcacaacgt gagtatgacg ataaggaccc ggaatatatc ggcgagacgt tgttcgcaga 4860 tgtgataacg agaaacgctc atagagcgca tgatgaaacg acaatgaagt tttgggtagt 4920 aagcgattgt ataagcagtt atacgaggct ttatccggtc ggaaataaag aaaataacgc 4980 ggcaagagct acggaaatac tgacggaaag tatcgcggac tttaacagag gaataagcgg 5040 acagataccg atcaaaatca tcatggatgg ttcgtcagta aatcagagtt tgaagaaacg 5100 ggagatttgg aaggagctta atgtcgaaat tgtaattacg gaaaaaacag gagggtcgaa 5160 gaattacctg gcgccactcg actcgcgaat agctaaatta agtcccgttt tacaaactga 5220 aatctggaat aagcttaatc caaaagttac agcgttaaag gtagaaagta aaattaactc 5280 gacgaaaggc gggcacggct acagcgcgtt cgagatctgg aacaacaggt cgcagttcag 5340 taacgttccg ttgagtatca acataaagga aatacgagat tatataaaag agagtcgaaa 5400 gttgagcagg cacgcacgag aaagaaatga gattaaggga agagtgcgag caccttacgt 5460 gctaaaaccg tttgagaaaa acgataagta cggaggagac ctagtgtcgc caataaaact 5520 aggcgatatg gtcttaatag aaggaaactt cgacaaaaat aaacagcatc catggttcca 5580 ggtcgttgaa gatgataaat ttccgacagg gatcgattgg gaacacggta taattttcac 5640 gcaaagaatg ggtatcacac gcaaaagtcg atataaatgg tcgttcaaag ctataaaagc 5700 gattattgac ggaagaacaa acaatgcaga cgcgatgcag actcataaag cagcgatgcg 5760 aaggttcaga gagcaaattc cagagttaaa catgatcgag catttaaaac caagaagcaa 5820 tgtaactgat aaaacgttgt ggctgtacgg ccgatgcatt ccaatttatg agtaacgagt 5880 ttattgcaga gaaaaagcaa ttacagatca cggacatcgt cgaaagagtc gtggaagttg 5940 gaagcgccag ccacgatagg ctgaggaggt agtcggacgg attcgcgagg cacgtcacgt 6000 tgacggggct aaactgagta atgaaaagta cactcaaaag agctacctca ttattggcgg 6060 tcgaccgctt ggcccgatca aagaacgact ttgaggggcg atcagttggt gcagacaagg 6120 aagtctcacc ggtaccgttg gtgcaaggga cgaggcgagc cttctcggct gccttagcgt 6180 taattttttg gatctctaga tcgttcttgg cccgttgaag tcgtagtttg tcttcctttc 6240 tgtcagcttt gatttgttcg tcgcgtcgac gagtctcggc gcgttgttca cgagcagcat 6300 cccgacgctc tttacgagaa gaatcgccac caccaccgag gcttggcaaa tgattccata 6360 aaagagcgca tccgtcttcg cagcaagcta tgcaaagacc gcccatgcaa atgaatacga 6420 acagcacgcc ggcaatggtt aagaacctga aaagagctgt tactttacca tcaaaactcg 6480 taataaggtc gcaattaatt accaagtata cagctcagcg ccaccaggca gccaccgggg 6540 gtcttccggc tcagcagtag gacgaaccat gtgaagcaga gtctcgatgg gtacatgtac 6600 gccagtgcca cgacgtggac gagcagattc gctggtagaa ggcgtaggag atgcgccacg 6660 gcgtcgtcga gaaagtacgt cgtaagtttg cggataaaac gtgtattggt aagagtcaga 6720 accggatttc tctcgtacgc agtcagctcg ctgaacattc gagccagctt cgaaggtcag 6780 tgtcacggta cggaaatgct tgtattgtgg aaagtctgtc aaagcttcct tcgcccagtt 6840 ctccaaaaca gagtcgcaga cgtcagttgt ggcgtaatgg aaaagctcat ggccagcacg 6900 agcgagaggt cgagccagat tggacgtttt gtcaagaggc gtagtattag acgataaaac 6960 ggaagaaacc gagtcagcag cggtctgcaa gataggaccg acatggttga ttacggcgga 7020 agtgtggttt gcagcgcgag acacggaacg agaaatgctc ttggatgcag cctccttgac 7080 cttcggagag gtaatccaca tcacgtagcg gtcgatcttc tccataagac ccagctgatc 7140 tacggtcact agttcgccag cggcatttag tacgtagtca gttccgttga tcagttgata 7200 caggccggtc tgctcgtaga agttcggcat tcctccagaa acggcgtcaa taagaccagg 7260 catgatgggt tgcattggag gaggattagt tcgccaaaaa gcctgtagaa ataaagtaga 7320 acggaaaatg agagtttaac ttacgatttt ggcgaaaagg aaaacgaagc cttggtaaag 7380 ccatatgatg acgttgagaa ttccggtgta ccacaagacg aaaaccagaa gcgcaaagca 7440 gatgaggcca aaaagcgtgt acgtaccgaa agtggcgcaa atcatgctcc aggcagaagg 7500 accacgacaa actggacgct tgttaccttt ccagcgacgt tctcgacgct cggcgcgagc 7560 ttgtttcttc tttagcttgc gtttacgttg gtttatgtac atcgggtgaa gtacagcagg 7620 tccgggaaga tcgggagtct catggtcgta acgctctagg cggacgcggt aatgttcgga 7680 gtccttgact aaatagtggt cagtagttgg aacactcata tcgaatgaat ggtcgaataa 7740 agtgcagttc ggttaaatac agagagaaag aggcccagtt tcctccaaca ctaggcggtg 7800 aaaggattca gtaaggttta cgatggagcg ctgcagcttg tttacgttag aatcgaagca 7860 ggtcggaaaa cagcgggaca aataagcgtc agcgcaaaaa tatttatgct cgtactgcaa 7920 aaacaggtca agcgtctcag caaatagatg caggaactcg tcaaagggcg tatccgtgcc 7980 atctggcgtg acaacactcg gtgggaaatc gagacaaatg tcaaaaatga gcgggcgacc 8040 gatttcgatc aaataggtcg ataaataccg cataaatcca aaataggtaa aatatggaga 8100 acgattacga agagtcgaaa cgtagagaag tcgtaagctt ggattgaaat ttcctacggt 8160 aaagataagc agcgtgatta acgtagaggt cgtcggcgga tcggaagcgt ttcagaaaat 8220 cgtaattaaa ctcggggaaa aggttgtcgc aataatgaag ttgcacactg aagtcatgag 8280 gtgaaagtta aatgagaact tagccctacc cgtgagcggt ccgaggagca aaatccggaa 8340 gagaagcgag cataacgtcg gtagcttggg catactttaa cgcgttcagc ttcaagtcgc 8400 gcccgactag tcgccgagcg ctgcgtgcta ccacttcgcg gcaaggcgtc cctggcaaat 8460 catcgtattc gctttcgtca ggatcagcgc aagaagacaa cggagagaac tcgtcgagac 8520 aggaaatgtc catcccggga gtgacgaggc cggataaatc aacgttcggg tcgaccggga 8580 cagatcgtgg tcgtttggga gtaagcgaaa tcattgtaga ttcggcgtag cggcgtttag 8640 tcgacagaac tttggctaaa agagggatat acgcaagaat cacgcaagat cttacgcaag 8700 acgtggcgtc gtttatggca aatccgcttc acagtcggcg cggggccaat agtcgagtcg 8760 gcgtcgcgag cgcgttttga gttgcgcgtt attctcatct gaaaagacaa attacgtaag 8820 gatacgtagg cagtacgatc agcagacacc ggtcgggcca gttagcacac acaatgagcg 8880 gaacgtttaa aataacggaa aaggattttt gtggggaatg cctgagcgcg ttctgcggaa 8940 tccgagctag agaagccgtc aaccacccat gcgggtgcgc tatctgctac aagtgcttct 9000 cggagcaagt caagtactgc aaagtatgcg aagcgcacgt ctaccgagta gtgaaacgga 9060 tctactgcag taaggagtat cgccgtcgta agtggacgta caaccagctg cagaaaatta 9120 cgaatgaaaa cgacacaaag gcctctcttt tgaaaaaact tgcggataaa aacttactgt 9180 gatcgctctc gcaaattgtc tcctgtaata cagtcgagca acactctgaa aaattcgatt 9240 ttttagatgt cggaaaacgt aacggaatcg gcgctgaaac ggtgttatga cgaaatcgct 9300 tcgccggaac ccaagatccc atcaccgact tttccgacgg cccgccaaga agagaaatct 9360 aacggaaaag ttggaatcgg acgaaaccga ttggccatgg ataacggaag cagtcataaa 9420 ccggctgcgc catccgaaaa gatcagagca gaacgcggcg atacggatgg ccatgatcgg 9480 cccaaagttg ggaagctcgc cgtggaacgc actcatcctc aatgtcgaga tagacatagt 9540 aactcgggtt atggagaatt taatgaacgc agaatcgaaa ggtgaataga aatgacgaac 9600 aggtttaaca taacattcag tttatacggc gttgatgaga gcacgagcag aaaaagcgat 9660 cgaaaagcgc atcacaggtc agcgaaataa gagacgatcc cctcgacgag gtttttcgtc 9720 aggttcgcga gcgtcaggag cgcgtttaac acgtacagca gccaaaagac catctttcgg 9780 tagaatgtgg cgataaaact ttaaggtccg ccggccgtaa aagagctcag ccgttccgga 9840 cgcaatcaac ggaacagctc cttaaataag ttaaataaaa ttgcgttata acgaagtacg 9900 gacgtggctg agtgtagcaa tcggaagggc agcaatcgta acgctcgccg ttaatacgaa 9960 cagttgcgcg gcgaggagtg ccaaaacgag ttttgccaac aaaatgagcg agaatacgat 10020 aaaaacctga aaataaataa catgtttcgt agcttccctc accgttctct gcgacaatgg 10080 cgacagggga gaagcacatt aagagcccag ttgatggcaa gtccggtcca agcgaagatc 10140 tcgtcctaaa cgcaatttac gagctacggg acgcgctgga ggaggaaaac gaggcggaag 10200 cgcttgagag ctggatcgaa agcatgaaaa ccccaggtaa cttacgtgtc ccaaaaagca 10260 gtgccgttcg tcagtcagga acgcatacat agacagataa acagccatat ccaaattttc 10320 tttgggccac gagtcgaaat ttgcctttag ttgcatccca acttaaagca tcattaataa 10380 cgaaaattat agcaaagctg aataccaagc tgttgcccac cgtgttcgaa cgggatgacg 10440 cgtgcaaagg gacgccggac taccgaacaa ttgaagagct tgtatgaacg aattcaaaca 10500 ggtactgcgg tcggtaaaat aggcactcgg gactataaaa gcagacggct ggattttgtt 10560 cttaggatga cggaaccgta tcataaacta aaataacgtg aatattcagc tctcaaggac 10620 gagaacgaac ctgacgccat ttaacgatat aaagaaataa ggcaaggggc acgtccatgc 10680 caaatgacga gtgtgagttg ccgcaaattt tcccctcatg tcgataaacc cctaaaaaga 10740 gcgtattagt cgttaaataa agtaaattac cggtaaaaga gtagcattcg agtagctcag 10800 tagatgtcat ggtgatccat ttcagcattc ggttcaataa atgacggcga gtgtaaacac 10860 gctgatcgtg cgacaacgca tcatcatgtt ctagcaaaat ttcctcccca gaagcgtcac 10920 tatcttcgtt atcagagagc ccgtcaacgg cagaaatctc gttcgtaaca tcgcagtatt 10980 gttgaaacag atcaacaacg ttgagagtgc gtatcgtaga accgattggt acgacaaggg 11040 aaatagttcg atattgtacc ttgcgcgtag cgaaatcgag aaatttacca acaacagtaa 11100 gctttccagc acgcaagtca tagtcggtga tgtttgactg atgaaaacat cggttcatgg 11160 tttccaaact ggaaaaccta gccttgtact cgtcatgacg caaagaataa cggacacggg 11220 gagtaatatc ggcaaattcg taactcttat aatccacgcc ggaagctcgt tgatctgtca 11280 gcttgtgtaa agaacgcaac actcgatcgg ggtccaggcg ccgaaggtcg tcaacaccac 11340 taaaggaaaa gggattaaga atgaaagtgt tcgaaggtaa cggttcgcta gcggaacggc 11400 tggacaacat ggcgaacgcc ttcagaggcg aacaacaaca cgccgaggcc gaaagaatgg 11460 acgatcaacc ggtggatcca ataccgcccg tgagagcccc agtggagcca aacgcttttg 11520 atccaggctt cctcggagta cgcccggaag tggcgaatga acgagaagga acgcacgcgc 11580 aacaagcgca aatgctggaa gcgatagccg actcgcgtga cgaaaaaacg cataagcgcc 11640 ttgctcaagg tgtgcgtcca ggcgagttct tcgaactatt cgcctcgctc ggtaagaaat 11700 caatttagtt gggaaataaa aactaaagca atttaggatg ggtgaagaaa actcgcacta 11760 acgcaatctg tgcggcgttt ttagagtctc tcaagtcgtg gtgtgaccac atgcgagacg 11820 ttgacatggg catgaatgcg ctgccggaat cggtaaccga acaatttaaa ataaaaactt 11880 acgtttgaat aatagaagca gcacatgccg ctatcgatgc cattgacggt attcctgaat 11940 tgggcgcaga aggcgatctg gaagttgtgc ttggacgctc ctagccagct cgcagggttc 12000 gtagttcacg ttaatttcaa ctacacgccg ggatttacga agcaaaccgg cactagagtt 12060 gtcttggaag ggctggattc gatcgcggaa gtacgtagat tgataaacga gcgatcgtct 12120 gaaaatctga acgagagtat cgtctctgcc gaattggctc ggctcgagaa ctttaaacgc 12180 gagatcggat tcgcccacaa ggtgagtgga gagcgtacct ataaaagcgt aacgaattca 12240 cagctgacgg agtacccagc tgacggatct gatgcggata ttatcgcatg ggcgctcaag 12300 gcgaacgaag aaatcggaaa aagcgccaag aaaatgtccg attttacgaa catggtcgca 12360 tccacggaca aatggacgct ggacggtaca tataagtacc ttacgtcttg ctccatgctc 12420 aaagagctgc aaggagtcgc acgattcgac aatctcagct cgaacacgat cgtcacggtg 12480 tcggacacag gtagtctccg agccttcgtc tacaacgacg gaacagtgtc aagaagtacg 12540 tatctaaaat caaaacaatt ttgggaaaaa ttctcgattt gtaaaaataa tttcagccaa 12600 taagctgcca gcatactgca ggtcttggcg cctagttacg ggtggcagaa gcaacataaa 12660 ctacgggaat gcaggaggct tcttcccgga aggaggatat cgaagagaag aacgtgatgg 12720 cgatcatcga gagccaccgg caaagcgacg ttggggtcga ggcggaggct cgtaccgagg 12780 aaacagtaac agaggcgcgt atcgaggacg cggcagagga ggtcaccatt aagaaaggca 12840 tttcaggtaa aaa 12853 // ID Crack-29_AAe repbase; DNA; INV; 5437 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A Crack non-LTR retrotransposon family from Aedes aegypti. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; KW Crack-29_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5437 RA Kojima K.K. and Jurka J.; RT "Crack clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1245-1245 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 5 sequences with >96% CC identity. XX FH Key Location/Qualifiers FT CDS 1124..2179 FT /product="Crack-29_AAe_1p" FT /translation="MEGEGSICVECNKVERDASKLITCMYCFAEAHFKCRN FT LVGNAARRIKDKMYFCSHNCSSIYQRITEMQNQKSLIVDTLAAELKGAVSS FT AVSQEMKNVRSEVHQITTAIERSQQFLSDKFEAIVSDFQDLKKENEHLKLE FT IDRLKHTQHTLSNTVHKLEHTVDKNARLANCNNAVVFGVPFYPGENTTEIA FT NKILACYGVNVGSDSILSAERLSGNNKTKNALIPIRVSFKDSGMKETVFDK FT KKEYGVLLSSAINTNYLINGKPTSITIRDELTPLSLELLGKMREYQEKLKI FT KYVWASRGGNILVKKNEHSKPEIIKTRDDLHELINRYCNHSPEKTTPSPKR FT KCGNNSSNK" FT CDS 2233..5115 FT /product="Crack-29_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MANVINAYHDCINEFNLNYPECSNDQFRIAQWNIRGM FT NDLRKFDNIPLFLESTKVPIDVFVVGETWLKSSNSGLYNITDFNAVFSCRE FT TSSGGLAVYIKSGLSFAVLNNVSCDGFHLIHIEVKKNGIYYEVVGLYRPPS FT FDFYRFHEEIENILSVQNSRLRFIVGDINIPVNLSNNNVVVRYKNLLESYN FT YSCSNTFVTRPISNNILDHLVCNNDVLGRIRNDTIFTDVSDHLPIVSSLEI FT YGPKESVILTKQIIDRDKMNALFTQFLNNFVCGDNVNDSISTITNTYSSIL FT EQCTKIKSEQVNFKSKYCPWLNHYVWQWIKLKRKYRKKLKSDPHNEYLKSM FT FIYVCKKTDNAKKKCKKEFYGNLLENTCHAKLWKNLNTIMGRYKAKTCIEL FT NIDGRKTSSISEVCDIFNNFFTQIGTNLASKISISNLNPLSNVKRVGRSIF FT LKPTNANEVIDIIKELNAKKSCGPDNFPACVFKNNATTFSYIIVDLFNKML FT QQGVFPDCLKIAKVIPVYKSGDASDPSNFRPISTLSTFSKIFEKLLVNRFI FT SFINQNNILYKYQYGFRKGCSTTTATVELVEFLLSKIDSKCIVGGLFLDLK FT KAFDTLNHRILLQKLECYGFRGLANDIIKSYLTDRQQFVAIDGCRSSLQTV FT NVGVPQGSNIGPLLFLIYINDLGNLPLQGIPRLFADDTAIFYPSNDTPSII FT SSINSDLQILMQYFDCNLLSLNLLKTKYMLFHSPRKKVTTHMNPSVGQVAI FT EKVLKFKYLGLVLDANLSWGSHIEHIQGKIASLCGLMYRVRPFVPRNALLK FT FYFGCIHSHLQYLIIVWGHACKSKLKKLQVLQNRCFKIIYSLPQLYSTNQL FT YTNLTHKALPIRGLCELQSCLFIYDVIKNPNMHHNLILNTRIHGHNTRHAN FT NLVRSRASSNLGQMRISFYGPTVYNLIPEQLKLINSRLLFKTSLKQFFRSK FT INSFLL" XX SQ Sequence 5437 BP; 1760 A; 883 C; 1017 G; 1775 T; 2 other; ctggcaacac tgttgtttgg ttgatacgcg ttatatcttg catattttgt gattgatttt 60 atagagtgta atttattttt ttaattatat ttcatcattc gatttgaacg tgagtggcat 120 atatttcatt gtttgtaagt cagtcatatt gattatgtgc taaatataga taaataagca 180 attttaaaat agawgggtat gatatcttca cagaagagat gaatatatgc ttgtactgag 240 gtgatgctga tgttgtattg ttgactgctg ctgttgttgt tgctgtgtga ttggttgttg 300 gctctgtggc acgmaaataa aatttactaa accttgtgta tgagtgcgga agtgattcag 360 gttgtgcttg ctaagtggtt ctatcagctg caacagggga gaaatacatg caaattattt 420 tggtcgacag atttaacgaa aatcaggcga acgggcgtct tacacaatca agccgttggt 480 tcgtgcttgg tgtactggtg ctgtagtagt agggttagta gtacactttg tggttctgtt 540 acagaagaaa tgcatgttga gtatctctag ttcagcaggt agattcggtg aaatacgagc 600 gaacgggcgt cttgcacatt gtagccgttg gttcgtgcta ggtgtattgg tgctgaagta 660 gtttgtatag taacacactt gttctgtata ttcaaggagt gaaattattt gcatgcaaga 720 cttgcacgca aatttgagta attctcgctg ggtggacaga aatagccaat cttaataacg 780 agtacccaat actgtacaat taattcgcaa tcctaacgtg caccttagat acttatgaac 840 tttttcatgg tagcttcata tgcagtatgt tagcaacaat actttgattt tgatgtgtgt 900 atgcaaatgg aaaatggttt atgatatcat caaattctat taacaggctc caatatctat 960 ttttagccgt ttgcaaatag agtactgact acgatagccc ttgcataact tttatttttt 1020 aaagtggtgt gtgtaacggt tcatttctat ttgcagcagt ataaagttgt ttaccttggt 1080 tgattgattt tggacctgtg actgttgtat caaatctgta tcaatggaag gcgaaggtag 1140 catttgcgtc gaatgcaata aggtcgaaag agatgccagt aaattaataa catgcatgta 1200 ttgtttcgct gaggcgcatt ttaaatgtcg taatttagtt ggcaatgcgg cccgccgtat 1260 taaagacaaa atgtattttt gctcacacaa ttgctctagt atttaccaac gaattacaga 1320 aatgcaaaat caaaaatctt tgatagttga cacgcttgcc gctgagctta agggagcagt 1380 gtcgagtgct gtgtctcagg agatgaaaaa cgttaggagt gaggttcatc agattacaac 1440 tgcgatagag cgatcacagc aatttttgtc agacaagttc gaagcaatag tatcagattt 1500 tcaagatttg aaaaaggaaa atgagcattt gaaactcgaa atagataggt tgaaacatac 1560 gcaacacact ttgtcgaata cggttcacaa actggaacac acggtagata aaaatgctcg 1620 tcttgctaat tgcaacaatg ctgtagtttt cggggttcct ttctatcctg gggagaacac 1680 gacagaaata gctaacaaaa tattagcctg ctacggtgtg aatgttggtt ctgattcgat 1740 tttgtcagcc gaaaggctca gtggtaataa caaaaccaaa aatgcactaa ttcctattcg 1800 cgtttcattc aaggacagtg gtatgaaaga aactgttttt gataaaaaga aggagtatgg 1860 ggtgctcctt tcatctgcga tcaatacaaa ttatctaata aatggcaaac caacctctat 1920 tacaatacgc gatgaattaa ctccgttgtc cttggaattg ttgggtaaaa tgcgcgagta 1980 tcaggaaaaa ctgaaaatta agtacgtctg ggctagtaga ggtggaaata tcttagttaa 2040 gaaaaatgag cactcaaagc cggaaataat taaaacaagg gatgatttgc atgaattgat 2100 taatcgttac tgtaaccact caccggagaa aacaactcca tcgcccaaaa gaaagtgtgg 2160 caacaatagt agtaataagt aatttaagat aatgtgtttt cttatgtatg tttatatatt 2220 tgtttgtatt caatggctaa cgtaataaat gcttatcatg attgtatcaa tgaattcaat 2280 ttgaactatc ctgaatgtag taatgatcaa tttcgaatcg ctcaatggaa tattagaggc 2340 atgaacgatc tgcgaaagtt tgataacatt ccattatttt tagaaagtac taaggttcca 2400 attgatgttt tcgttgttgg tgaaacgtgg ttgaagtcaa gcaatagtgg tttatacaat 2460 ataactgatt ttaatgctgt tttttcttgt cgagaaacat cgtctggtgg tcttgctgtt 2520 tatattaaaa gtggtttgag ttttgccgtt ttgaataatg ttagttgtga cggatttcat 2580 ttgatccata tcgaagtcaa gaaaaatgga atttattatg aagtggttgg cttatatagg 2640 ccaccatctt ttgacttcta ccgatttcac gaagaaattg aaaatatttt gtccgttcaa 2700 aacagtcgtc ttcgtttcat agttggtgat ataaacattc ctgttaacct gtccaataat 2760 aatgtcgttg tgcgttacaa aaacctcttg gaatcataca attatagctg ttcaaatacc 2820 tttgttaccc gtccaatcag taacaatatc ttggatcacc ttgtatgtaa caatgatgtt 2880 ttgggtcgaa taagaaatga tactattttt acagatgtca gtgaccactt acctatagtg 2940 tcttcactcg aaatctatgg acctaaggaa tctgttatac tgactaaaca aataattgat 3000 agagataaaa tgaatgctct ctttacacag tttttaaata attttgtttg tggtgataat 3060 gttaatgatt ctatctctac tattacaaat acatacagta gtattttgga acaatgcaca 3120 aaaatcaaga gtgaacaagt gaattttaaa tcaaaatatt gtccttggtt aaaccattat 3180 gtatggcaat ggataaaact taagcgtaaa taccgtaaaa aattgaaaag tgatcctcat 3240 aacgaatact taaaaagtat gttcatttat gtttgtaaaa aaactgataa tgcaaaaaag 3300 aaatgtaaaa aggaatttta tgggaattta ttagaaaata cttgtcatgc caaattgtgg 3360 aaaaatttaa acacaattat gggacgctac aaggcaaaaa cttgcattga gcttaacatt 3420 gatggacgca aaacatctag catctcagaa gtctgtgata tattcaacaa tttctttact 3480 caaatcggta caaatcttgc ttcaaaaata tcaatatcca acttaaatcc gttgagtaat 3540 gttaaaagag tagggagatc tatttttctg aaaccaacga atgcaaatga agtaattgac 3600 ataataaaag aactcaacgc gaagaagagt tgtggtccag acaactttcc agcatgtgtc 3660 tttaagaaca atgcaactac attttcatac ataattgttg atcttttcaa caaaatgtta 3720 caacaggggg tattcccaga ttgccttaaa attgcaaagg taatacctgt ttacaagtct 3780 ggtgacgcat cagatccaag taattttcgg cctatttcca ctctctccac atttagtaaa 3840 attttcgaga aattattagt gaatcgtttt atcagcttta taaatcaaaa caatatttta 3900 tataaatacc agtacgggtt tcgaaagggg tgtagcacaa caacagcaac agtcgaactt 3960 gtagaattcc tgcttagcaa aattgacagt aagtgtatag ttggtggcct ctttcttgat 4020 ttaaagaaag cgttcgacac gttaaaccat agaattcttc tacaaaaatt ggaatgctat 4080 gggttccgag gtttggcaaa cgacattatt aaaagttatt taacagacag gcaacaattc 4140 gtagctattg acggttgtcg tagctctcta caaacggtta acgttggagt cccgcaaggg 4200 agtaatattg gaccgctatt atttctaatt tacatcaatg atcttggaaa cctaccactt 4260 caaggaatac ccagattgtt cgctgacgat acagcaattt tttacccaag taatgatact 4320 ccatcaatta tatcttctat aaacagtgat ttacaaattc tcatgcagta ttttgattgc 4380 aatttacttt ctttaaattt attaaaaact aaatacatgt tatttcattc cccacgtaaa 4440 aaagttacta cacacatgaa cccttctgtt ggacaagttg ctattgagaa agttttgaaa 4500 tttaaatacc taggcttagt gcttgatgcc aatctttcat ggggaagtca catagaacac 4560 attcaaggaa aaattgcttc tctttgtggt ttgatgtatc gagttagacc atttgtccct 4620 cggaacgcac ttcttaaatt ctattttggt tgcatacact cccatcttca ataccttatc 4680 attgtctggg gccatgcctg caaatccaag cttaaaaaac tacaagtcct acaaaacaga 4740 tgttttaaaa taatttattc tctaccacaa ttgtactcta caaaccaact ttacacaaat 4800 ttaactcaca aagcacttcc aatacgtgga ttatgcgaac ttcagtcttg tttatttatc 4860 tacgatgtca taaaaaatcc caacatgcac cataatctga ttctgaatac cagaattcac 4920 ggacacaata caaggcatgc taataattta gtgagatcca gagcttccag taatcttggc 4980 caaatgcgaa tatccttcta tggtcctact gtatataatt taattccaga gcaattgaaa 5040 ctgataaata gtagattatt gttcaaaaca agtcttaaac agtttttcag gtccaaaatt 5100 aattccttcc ttttataaaa ccatatcaac tgccagattc tagctacatt cttttgaagt 5160 acactctata aaatgcgata aattgttgtg ttaaatatgt catattattt ctgttcaatt 5220 aaatttctag cgaatcaggg atccctttaa aggaaatcaa ttccactggg tatccctagt 5280 taagttatta ttgaagtata ccttacacac accgataagc ttcttttttt gttttcttgt 5340 tagtttttta gaatttaagt gaagatgagt ccactaccag ggggctcatt aacagagctt 5400 tttggtgtgg gggtaagcgg cgggtaaaaa aaaaaaa 5437 // ID Copia-106_AA-I repbase; DNA; INV; 4204 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-106_AA_; KW Copia-106_AA-LTR; Ty1_copia_Ele116; Copia-106_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4204 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [1535-2038] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 1091..3328 FT /product="Copia-106_AA-I_2p" FT /translation="MLIGERRQPAVVKDVLYVPGLQCNLFSVRRVEDNGMA FT VKISNGSVMISKGERVMATGRRAGQLYKMNIELRGSAGASSEPGAMAAKQD FT CFDLWHRRLGHLGEASMKTLWKHGMVETSTKVLDWPNQCICEVCIQAKQTR FT NPFDDVAERRATRPLEVIHSDVCGPFTPRTWDNKRYFLTFIDGYSHFTVVY FT LLHSKDEVQEKFEEYYARVTAFFGTRVARLRCDNGGEYLGGSFQKFCRSKG FT ISIETTVPYSPQQNGVAERMNRTLLEKSRAMIQDAGLPKSMWGEAVLTAAY FT VTNRSPTAALKEKKTPFEAWYRKKPNIDKMRVFGSVAHTWIPKEKRTKLDP FT RSEKNILVGYTTNGYRIWNAKKKKIFISRDVVFDEKRSGSEQEKPVTQVLY FT RSYAESEVSEDRSVKELDMTEPTDAIANQPEEKLTDEEGSEVTDDDGEGNH FT DDDEIDSALPSQPERDNCPAEVITRRSERERRLPGKLLDFITGSRATTAAE FT CFSGVSDGFGESTMAFALNAEMFVDNLPTTIDGLRRRDDWNDWKQAIGSEL FT ESLRKNETWELVPRPQGKNIVDCRWVFRIKRDEAGNVDRYKARLVAKGYSQ FT RKGFDYDETYAPVAKGATVRALLAVANQMRYHIHQMDVKTAFLNGKLTEEI FT FMRQPEGLEPTDPGLVCRLKKSLYGLKQAPRSWNSEFHNFILQLGFQRSNA FT DSCLYWRRQGNEVVYLLLYVDDIILVSGRLDLIAEIKKKLSSTNLK" XX SQ Sequence 4204 BP; 1156 A; 959 C; 1230 G; 858 T; 1 other; ataggttatc ggcccaggac agaaaggaat cagttcttca agatgtcgga agaaaagctt 60 tggctgtttg acggaaccaa ctttggtaat tggaagttcc ggattgaagt acttctggaa 120 gacaggaaca tgctggaatg tttgaccaag gccatcgacg aggaggagta ttcgaaggat 180 ctggcgacgg acacagcgga ggtgcgtgcc gagaagaaga agaagctcga agctcgatgg 240 gccctggatc gaaaatgcaa gaacctgatc atcaaccgga tcgcggaaga ccaattggag 300 tacgtcaaag agaagaggac ggccaaggat gtctgggatg ccctacggaa caatttcgaa 360 caaatcggaa tcgctggtaa gttgatcctg aggaagcaat ttcaagagct gaagctgtcc 420 gaaggtggtg atgtgaagca atcctgctga agtttgaaaa ggttctgcga gagctgcggg 480 cggcmggagt gaacattcag gaagaagacg tcgtttgtca gcttctgctg gcgcttccga 540 agtcgtatga tgcgctcatt actgcactgg agacgatcca accgcaacag ttgactctgg 600 agtacgtgaa gaagcgccta ttggacgaac aagcgaagcg agcgaatcaa tcaagccgcg 660 gcggagattt cggccaggcg agcgactcgg cgttcgtcgg aaagaaaatg gggcttaagt 720 gctttggatg cggtaagctt gggcacaaga gagccgaatg tccggaaaat tgtcctccaa 780 acgaatctgg tcaccaagga aaccgaaaaa gcggagggaa ggttaaccgg aagcagaaat 840 atcgagcgaa cgtcgctgag gagtcggaag tcgcgttcgt cgctaccgcg acgggagatt 900 gtttgtcggc ttccacggca acggaagacg tgaaaatcga ttgggttctc gactcgggag 960 ccacggacca tatgctgagg agcaaggaat ttttcgagga actacacccg ctagctgaga 1020 aggtacgtat agcagtggcc aagtgcggcc aggcgctcta tgcgagtacg cggggaccgt 1080 gagagtgaac atgctgatcg gagaaagaag gcaaccggca gtagtgaaag acgttctgta 1140 cgtaccggga ctgcaatgca atctcttctc ggtacggcgt gttgaagaca acggtatggc 1200 ggtgaagatt tcgaacggct ccgtgatgat ttcgaagggt gaacgggtga tggctaccgg 1260 tcggagagcc ggacagttat acaaaatgaa tatcgagtta cgtggaagtg caggtgcatc 1320 atccgaacct ggcgcgatgg ccgcaaagca agattgcttc gatttgtggc accggcgtct 1380 cggccatctg ggagaagcaa gtatgaaaac tctttggaag catgggatgg tggaaacgtc 1440 gacgaaggtt ctcgactggc ccaatcagtg catctgcgag gtatgtatac aggcgaagca 1500 aacccgaaac ccgtttgacg atgttgcaga gaggcgtgcg acgcgcccgt tggaagtaat 1560 ccactcggac gtgtgtggcc cgttcactcc gaggacctgg gataacaagc gttactttct 1620 gacgttcatc gacgggtata gccattttac ggtcgtgtac cttctacaca gcaaggatga 1680 ggtgcaggaa aaattcgagg agtactacgc tcgggttacg gctttcttcg gcactcgtgt 1740 ggcaagactg aggtgcgaca acggcgggga gtatcttgga ggatcgtttc agaagttttg 1800 tcgctcaaaa ggtatcagta ttgaaacaac cgttccatac agcccacagc agaacggtgt 1860 agcagagcga atgaatcgca cccttctgga gaagtcccgt gcgatgatac aggatgctgg 1920 tttgcccaaa tccatgtggg gtgaggcggt actaaccgcc gcatacgtga ccaaccgtag 1980 tccaacggca gcgctgaaag agaagaaaac tccatttgag gcctggtata ggaagaaacc 2040 gaatatcgac aagatgcgtg tttttggatc agtggcgcac acttggattc caaaggaaaa 2100 gcggaccaaa ctggatccac gttctgagaa gaatattctg gtaggctaca ccacgaatgg 2160 ctatcgcatt tggaacgcaa agaagaagaa gattttcatt tcaagagacg tggtattcga 2220 cgagaagaga agtggttctg agcaagagaa gccggtcact caagtcttat acagaagtta 2280 tgcggaatcg gaggtatcag aagaccgttc ggtaaaggaa ctggacatga cggagcctac 2340 ggatgcgatt gcgaatcaac ctgaagaaaa gctaaccgat gaggaaggca gtgaagtcac 2400 tgacgatgat ggcgagggca accacgacga cgacgagatc gacagcgcgc tcccttcgca 2460 accagaacgt gacaactgcc cagcagaagt gatcacgagg cgcagcgaac gggagcgcag 2520 acttcccggt aagttgttag atttcattac cggctcaaga gcaactactg ccgctgaatg 2580 tttctcaggt gtttctgatg gcttcggtga atcgacgatg gcgtttgcgc tgaacgccga 2640 gatgtttgtg gacaacctgc cgaccacgat tgacgggctg cggagacgag acgactggaa 2700 cgattggaag caggccatcg gaagtgagct ggaatcgttg cgcaaaaacg agacttggga 2760 acttgtgccg cggccacaag gaaagaacat cgtcgactgc agatgggtct tccgcatcaa 2820 gagggacgaa gctgggaatg tcgaccgtta caaagcccgt ttggtcgcaa agggctattc 2880 ccagaggaag ggcttcgatt acgacgagac ctacgcacca gtcgcaaagg gagccacggt 2940 gcgtgcgcta ctggcggtag caaatcaaat gcggtatcac atccatcaaa tggacgtaaa 3000 aaccgcgttc ctcaatggaa aattgacgga ggaaatcttc atgcgacaac ccgagggact 3060 ggagcctacc gaccccggcc ttgtctgtcg gctgaagaaa tcgctctatg gactgaagca 3120 ggccccaaga agctggaatt cggagttcca caactttatc ctgcaactgg gcttccaacg 3180 ctcgaacgcg gacagctgtc tctattggcg gcgacagggg aatgaagtgg tctatctcct 3240 tctttatgtc gatgatatca tcctggtgtc gggcagactg gacctaatcg ccgaaataaa 3300 gaagaaacta tcatcaacca atttgaaatg acggacattg gcgagatgaa gacgtttctg 3360 ggcctgaaga ttgatcgaga ccgacgaaag ggactactga agataagtca acccaagtac 3420 atcgctgacc tgctgcgacg ttttggaatg aacgactgca aaccaacaac gacaccactg 3480 gaaccgaatc tcaaattgga gcgatgcaaa ggtgaatcac tgactaccga gccctaccgc 3540 gagctgatcg ggtgtctgtc atatctggcg ctttcgtcaa gaccggatat ctgtgcagcg 3600 gtgaacttct tcagcaaatt tcaatcagca ccaacggacg cacactggag taatcttaaa 3660 cggattcttc gctaccttaa gggaaccgca aatcacggac tggttttcca aagacagcaa 3720 gcttcgaagc ctctggaggg atacgctgat gctgactggg gcaacgatcc ggatgaccgc 3780 aggtcgatct caggaaatgt tttccaggtt ttcggtggga ccgtctcgtg gatgactcgc 3840 aaacaagcta cggtggcatt atcttcgacg gaagcggagt acgtctctct cagcaatgcc 3900 gcgtgtgaag ccatctggtt gcgaaatctc atcctggaac ttggtgtcga actgcagcat 3960 ccagtgccct tattcgaaga caaccaatcc tgcatctgta ttgccgagga gccccgtgac 4020 cacaagcgaa tgaaacacgt ggacatacgc tacaatttca tccgtgaaaa gctgcaggaa 4080 gggctgttca agatccacta cataccaaca ggccaacagg tagcagacct gttcaccaaa 4140 ggtttggcgc gtggaccatt tgagacgctt agagataagt taggattgtt cggttgagcg 4200 ggcg 4204 // ID Gypsy3_MH-I repbase; DNA; INV; 3687 BP. XX AC ABLG01001411; XX DT 24-JUN-2009 (Rel. 14.07, Created) DT 24-JUN-2009 (Rel. 14.07, Last updated, Version -1) XX DE LTR retrotransposon from northern root-knot nematode: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy3_MH; KW Gypsy3_MH-LTR; Gypsy3_MH-I. XX OS Meloidogyne hapla OC Eukaryota; Metazoa; Nematoda; Chromadorea; Tylenchida; Tylenchina; OC Tylenchoidea; Meloidogynidae; Meloidogyninae; Meloidogyne. XX RN [1] RP 1-3687 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from northern root-knot nematode."; RL Repbase Reports 9(7), 1522-1522 (2009). XX DR Genome; ABLG01001411; Positions 11220 7534. XX CC Positions [805-1257] - Reverse transcriptase CC Positions [2398-2871] - Integrase core CC 'GCCAA' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 58..3465 FT /product="Gypsy3_MH-I_1p" FT /translation="MALLSSVKKWWKSHECKEFSEQSSDILHMRISEKPKW FT LTKKVLVNGSEVEFILDSGAQITCVSENTWKCIGSPKLIGMPCKGKSFTGN FT QFKMLGSFIASIEVDGVQEVLETHVTSENWNLFGLPWIIEFESKLNYPIVS FT SIKKSIEPFEVLKIEKDESEISEIVKRLSVKFKKVFEEGLGHCTKMKAHLH FT LKPGVKPVFVRPRPVPIGVKDAIEKELNRLSEMGAIKPIEFANWAAPILAI FT KKANGKTRVCMDYSTGLNNAIELDRHPLPKPSEIWAEIHGSKVFSQLDLRD FT AYLQIELDESSKKLTCINTHKGLFEVQRLPFGVKSAPGIFQRFMDKLISGI FT PGVFAYLDDVIIVSRNIEEHKVRLFEIFDRIEKWGLKIQLEKCNFFKEKLK FT FLGHIVSKSGIEPDPEKKKKISDLARPKDVKELKSFMGTINYYGRFVSEMH FT KLRGPLDKLLKKEVKWEWSKEQEDAFQSVKKVLSSNLLLTHFDPELEIIVT FT ADASNYGVGAVISHRFPDGKEKVIEYASKSLNAAEKNYSQIEKEGIALVYA FT VQKFHKMLYGRKFVLRTDHKPLLAIFCKNKGIKVFSASRLQRWALLLTNYD FT FKIEFVRTDHMGMADTLSRLISESNTSEDKVIALVTKEESSEDEDSVEESA FT KYVLNIMLNEIPVNNEVIKSETEKDTELSVVKSYIKNGWPKLMSEPKLNPW FT SNRRLNLEVIEDCIIFNGKIVIPKSLIKSVLKVLHETHPGVNKMKGVAREY FT MYWPSMAKDIEEYVGDCLKCQAAAKKPIKAELRPWPKTDRNWERVHIDFAG FT PCKDGKLYLIVIDANSKWPEVFGNMTTSAKDTIKCLEWLGTHYGYPETIVS FT DNGSPFRSNEFKVYCENRGISQVFSAPYHPQSNGQVERFVDYFKRMMIKNS FT GKKDWLQEVLLFYRASPHVALEGKSPAEVFLGRKLRLKLARLMPRKVNIRN FT SDRTLQQLKMKKWFDSHHGVKNRKIEKEVHFLNYRNGRSNWLQGEVIKKTG FT VIYKIYSPVLNAIVTRHENQIRNKSVSSKSEELPTWWRNSENNFSNNSNIV FT KKPEYRQNKYQLRTNIKPTQRLVVTRDSVKKYAEAPVYTVPRTFQGSISSG FT PDHNSSVAGTSKRGHDENDKDDNKMMSC" XX SQ Sequence 3687 BP; 1364 A; 458 C; 839 G; 1026 T; 0 other; gtggcgcggt cgacaacaac atccagtgtg gtcagcagcc gacaacagca acaaaaaatg 60 gcactactta gttcagtgaa aaagtggtgg aaaagccacg agtgtaagga gtttagtgaa 120 cagagcagtg atattctcca tatgaggata agtgaaaaac ctaaatggtt gacaaaaaaa 180 gtattagtga atggcagtga agtggagttt attttagatt caggagcgca aattacttgc 240 gttagtgaga atacatggaa gtgcattgga agccctaagt tgattggaat gccttgtaaa 300 ggaaaaagtt ttacaggaaa tcagtttaaa atgttgggca gttttattgc aagtattgaa 360 gtagacggag tacaagaagt attagagacg catgtgacaa gtgaaaattg gaatttattc 420 ggattgccat ggattattga gttcgaaagt aaattgaatt atccaattgt ctcaagtatt 480 aagaaaagta tagaaccttt tgaagtactg aaaattgaaa aggacgagag tgagattagt 540 gaaattgtaa aaagattgtc agtgaaattc aaaaaagtgt ttgaagaagg cctgggccat 600 tgtacaaaaa tgaaagcaca tttgcattta aaaccaggag tgaaaccagt atttgtaaga 660 ccaagaccag taccaattgg agtgaaagat gctattgaaa aagaattgaa tagattgagt 720 gaaatgggtg cgataaaacc tattgagttc gctaattggg cagctcctat attggctatt 780 aaaaaggcaa atggcaaaac aagagtgtgt atggattatt ccacaggatt gaataacgca 840 attgagttag ataggcatcc attaccaaag ccaagtgaga tatgggcaga gatccatgga 900 agtaaagtat tttcgcagtt ggatctcaga gatgcctatt tgcaaattga attggatgaa 960 tcctcaaaaa aattgacgtg tattaacacg cataaaggat tgtttgaagt tcagagattg 1020 ccatttggag tgaagtcagc accaggaata tttcaaagat ttatggataa attgataagt 1080 ggaattcctg gagtgttcgc atatctggac gatgtaataa tagtgtctag aaatattgaa 1140 gaacataaag tgagattgtt cgaaattttt gatagaattg aaaagtgggg attaaaaatc 1200 caattggaaa agtgtaattt tttcaaagaa aaattgaagt tcttgggtca tattgtttct 1260 aaaagtggta ttgaaccaga tccagaaaag aaaaagaaaa taagtgattt ggctagaccg 1320 aaagacgtca aagaattgaa gtcctttatg ggaactataa attattatgg acgttttgtc 1380 agtgaaatgc acaagttaag aggtccattg gataaattgt taaaaaaaga agtgaagtgg 1440 gagtggagta aagaacaaga agatgcattt cagagtgtta aaaaagtatt gagttcaaat 1500 ttattattaa cgcatttcga cccagagtta gaaataatag tgacggctga tgcatcaaat 1560 tatggagtag gcgcagtgat ttctcataga ttcccagatg ggaaagaaaa agtgattgaa 1620 tacgctagta agtctttaaa tgctgctgag aaaaattata gtcagattga aaaagaaggc 1680 atagcattag tgtatgcagt gcaaaaattc cataaaatgc tatatggccg taagtttgta 1740 ttgaggacag accataagcc cttattggcc atattttgca aaaataaagg aataaaagta 1800 ttttcagcat ctagattgca aagatgggca ttgttattga caaactacga ttttaaaatc 1860 gagtttgtaa gaaccgatca tatgggcatg gcagacactt tgtcaagatt gattagtgaa 1920 agtaatactt cagaagacaa agttattgct ttagtaacta aagaagaaag tagtgaagac 1980 gaagattcag tggaagaatc agctaagtat gtattgaata taatgttaaa tgaaatacca 2040 gtgaataatg aagtaataaa aagtgaaaca gaaaaagata ctgaattaag tgtagtgaaa 2100 agttatatta aaaatggttg gccaaagtta atgagtgaac caaaattaaa tccttggtca 2160 aacagaagat tgaatttaga ggtaattgaa gattgtatta tttttaatgg aaagatagta 2220 attccaaagt cattgataaa aagtgtattg aaagtattac atgaaacaca tcctggagtg 2280 aataaaatga agggtgttgc tcgtgagtat atgtattggc caagtatggc aaaagatatt 2340 gaagagtatg tcggtgattg tttaaagtgt caggcagcag ccaaaaaacc aatcaaggcc 2400 gaattacgac cttggccaaa aaccgacaga aattgggaaa gagtgcatat tgattttgcc 2460 ggtccttgta aagacggaaa attgtacttg attgtaattg atgctaattc aaagtggcct 2520 gaagtgtttg gcaatatgac cacatcagcg aaagacacaa taaaatgtct tgagtggctg 2580 ggaacccatt atggttaccc tgaaacaata gtgtcagata atggaagtcc atttcgctct 2640 aatgagttca aagtgtattg tgaaaatagg ggaattagtc aagtgtttag tgccccctac 2700 catccacaat ctaatggtca agtggaaaga tttgtggatt attttaagag aatgatgatt 2760 aaaaatagtg gtaaaaaaga ttggcttcaa gaagtgctat tattttatag agcaagccct 2820 catgtggcat tagaagggaa atcccctgca gaagtgtttc ttggtagaaa attgagattg 2880 aaattggcaa gacttatgcc tagaaaagtg aatattcgaa atagtgatag aaccctgcaa 2940 caattaaaaa tgaaaaagtg gtttgatagt catcatggag tgaaaaatag aaagattgaa 3000 aaggaagtgc atttcttgaa ctatagaaat ggaagaagta attggttgca gggtgaagta 3060 attaagaaaa cgggagttat atacaaaata tactccccgg tattgaatgc tatagtaact 3120 cgacatgaaa atcaaatcag aaataagagt gttagtagta aaagtgaaga gctccctact 3180 tggtggcgaa acagtgaaaa caattttagt aataatagta atatagtgaa aaaacctgaa 3240 taccgtcaaa ataagtatca gttgagaaca aatatcaaac caactcaaag attggtagtg 3300 acaagggata gtgttaaaaa atatgcagag gccccagtat atacagtgcc aaggacattt 3360 caaggatcca tcagttcggg accagatcac aacagttcag tggcaggtac cagcaaacgc 3420 ggacacgacg aaaacgacaa agacgataac aaaatgatgt cttgctgaga ggagccaatg 3480 tataatagga taagattatt gttatgtaaa tattttttta ttgtaatttt ttcttgttcc 3540 tattgcccca tttccccatt gtttgtttgt atatttattt tgttttgtgc catggctggc 3600 gtagtatttt attgtattgt ttattgaagt attgtattgt aaggcttcgg cctaaagtgt 3660 gagccgaacc tttttcaagg ggggagg 3687 // ID Poseidon-4_HM repbase; DNA; INV; 2017 BP. XX AC . XX DT 13-DEC-2008 (Rel. 13.12, Created) DT 14-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE Penelope-like element. XX KW Penelope; Non-LTR Retrotransposon; Transposable Element; KW Poseidon-4_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2017 RA Jurka J.; RT "Penelope-like elements from Hydra magnipapillata (Poseidon RT group)."; RL Repbase Reports 8(12), 2087-2087 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 369..1817 FT /product="Poseidon-4_HM_1p" FT /translation="MRPVVSTIGTPPYGSSEYLVKIIQPTLNKNKTRLXNS FT SSFVNEAKSWVIYPNEIQVSFDVIALYPSIPIDKAIPVIIDILNNDKDDLE FT NRTKLTLTDVHQLIELCLSQCYFLYKNEIRTIPNSGPIGLSLMVVVAEAFL FT QHLETKTLRVAEINLFSPKSFKRYVDDGHARFDSLEKHDKFLELLNQQDPA FT IQYTSEIENERKELNFLDITIINTNNFHYDFKIHRKPAITNIQIKPNSNIN FT PAITSGVFKGFISRASKICSPKYLQDEIQFIINMFVENGHKKSELELITKN FT YLKIKNNPVGVVADQTKNGNITIKLPWIPRIGSKLRKELKRYGAKVIFTTP FT PTLKRILCNNKSKLLPNSKPGVYKLTCSCGGVYIGETKKKIIQRSIEHQIN FT SMKGNWHASGATEHCKNCHGIFNWMHPKTIAFKSNYHERKIRESLEINYAQ FT TNYEIRHFPPLLNKDNGVKIASNSWRPLFKKIIEISK*" XX SQ Sequence 2017 BP; 795 A; 334 C; 287 G; 596 T; 5 other; aaatcttgac aagttctcta aaaaggaaat taccagacaa tctcacaagg caacaaaggt 60 tcgccttaaa taatytaaaa aagaataaag acgctctaag attttacccg tttgataaag 120 gttcaggatt tgtagtaatt aacgaatcag atgcttttaa gaaattagat gatgaaatta 180 aaaagtccgt aattatcaat tatgacccta cacaaaccat cacaacaaaa tttcaaaagt 240 ttttacgtaa gttaagaaaa gaaaacaaat ttgataaaaa aacttatttt caactatacc 300 cttctgattg cattcctcca cgactatatg gcgtcataaa agctcataaa tcagataaaa 360 actatccaat gcgaccagta gtttccacta tcggtactcc gccttatgga tcttcagaat 420 atcttgttaa aattatacaa ccaactttaa ataaaaataa aacaaggtta mtgaattcgt 480 cttcatttgt caacgaagca aaatcmtggg taatttatcc taatgaaatt caagtctcgt 540 ttgatgtgat agctttgtac ccctcaattc caattgacaa agcaatacct gtaattattg 600 atatactcaa caatgataaa gatgatcttg aaaatagaac taaattaact cttactgacg 660 ttcaccaatt aattgaactc tgtctaagtc aatgctactt tttatataaa aacgaaatta 720 gaactatacc gaattctggt cctattggat tatcacttat ggtcgttgtt gcagaggcat 780 ttttgcaaca tttagaaaca aaaactttga gagttgcaga aataaattta ttttcaccaa 840 agtcattcaa acggtatgta gacgacggcc atgcgagatt tgattcatta gaaaaacacg 900 acaaattttt ggaattgtta aatcaacaag atcctgcaat tcaatacaca tcggaaatag 960 aaaacgaaag gaaagaacta aactttttag atataacaat aataaataca aacaatttcc 1020 actatgattt caaaatccat cgaaaacccg caattacgaa tatacaaata aaaccgaatt 1080 ctaacattaa tcctgcaata acatctggag tcttcaaagg ttttatatca agagcctcaa 1140 aaatttgttc tccaaaatac cttcaagatg aaatacaatt tataataaat atgtttgtcg 1200 aaaatggcca taaaaaatct gaattagagc ttataacaaa aaattatctc aaaatcaaaa 1260 ataatcctgt tggtgtagtt gcagatcaaa caaaaaatgg taacataaca ataaaactcc 1320 catggattcc aagaatagga tcaaaattaa ggaaagagct caaragatat ggcgctaaag 1380 ttatatttac aacaccacca actttaaaaa gaattttgtg taataacaaa tccaaactat 1440 tgccaaatag caagccgggt gtgtataagc ttacgtgttc atgcggtggc gtttatattg 1500 gagaaacaaa aaagaaaatt atacaaagaa gtattgaaca tcarataaac agtatgaaag 1560 gtaattggca tgcttccgga gccacagaac actgcaagaa ctgccacggt atatttaatt 1620 ggatgcaccc aaaaacgatc gcattcaagt ctaactacca tgaaaggaaa ataagagagt 1680 cacttgaaat taattatgcc caaacaaatt acgaaataag acacttccca ccgcttttga 1740 acaaagacaa cggtgttaaa attgcgtcaa atagctggcg acctcttttt aaaaaaatca 1800 tagaaatttc aaaatagttg tttacgttac gtcagctcat ctttgtaatg tcttctataa 1860 cggtttttta catacgtttt tttttgtgag atttttattg tatttaattt tacgtctgat 1920 gacggtctga actattcaga ctgaaatatt acataaataa attaaaatat tagtacttag 1980 ctgtattgta tgtgttataa attaaataaa ttaaagt 2017 // ID EHINV2 repbase; DNA; INV; 225 BP. XX AC X61182; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 28-SEP-1995 (Rel. 1.08, Last updated, Version 1) XX DE E.histolytica inverted repeat downstream of rRNA genes. XX KW EHINV2; Inverted repeat; Repetitive sequence. XX OS Entamoeba histolytica OC Eukaryota; Amoebozoa; Archamoebae; Entamoebidae; Entamoeba. XX RN [1] RP 1-225 RA Bhattacharya S.; RT "EHINV2."; RL Direct Submission to Genbank (01-AUG-1991)S. Bhattacharya, RL Jawaharlal Nehru University, School of Environmental Sciences, RL New Delhi 110067, INDIA. XX RN [2] RP 1-225 RA Mittal V., Sehgal D., Bhattacharya A. and Bhattacharya S.; RT "A second short repeat sequence detected downstream of rRNA genes RT in the Entamoeba histolytica rDNA episome."; RL Mol. Biochem. Parasitol 54(1), 97-9100 (1992). XX DR GenBank; X61182; Positions 1533 1757. XX SQ Sequence 225 BP; 96 A; 21 C; 18 G; 90 T; 0 other; aatgtttctt agtactatta acatttgaat tgaaaatagt ttaaataata ttatagtttt 60 ttaatataat ggatcttctt aaaaatagaa atttaaaata gataacactt ccttcatatg 120 tcttgaataa aaatgtatct actatttgaa tataaactat tctataaata gaacattaaa 180 taaaatattt tataatatct cagttataca acgttcagaa ttaat 225 // ID BEL-234_AA-I repbase; DNA; INV; 5519 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-234_AA_; KW BEL-234_AA-LTR; BEL-234_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-5519 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 923-923 (2011). XX DR [1] (Consensus) XX CC LTRs are 98% similar to each other. The consensus includes a CC non-autonomous DNA transposon (deleted). XX FH Key Location/Qualifiers FT CDS 101..2260 FT /product="BEL-234_AA-I_1p" FT /translation="MQRTPVNKNVVASIPRVNDSISGEQHDRNKTVSESLR FT SCASGSGASKRSHKSKVSSVSKTQSQIRALELVDELEEEELRVSLEQDRIE FT AEQRLEALKIQKDLELKREQKKAEFIRKKRERAIQISELRSCSGASSCGST FT QRVQAWLNENENTANDDVDNGATGNAPRGAAATGTAVNEALCHALKALHSR FT NVRDLPQFSGNIMDWPIFENEFKTSTAEYKLTDRENLRRLNNALQGKARKT FT VECLLSAHENVDLIMRMLKSNFGRTEWVVANRLELLRNLDYVKDGNIESFR FT SFYNAVIGTAVALKNVKAEAYLINPELISQLAEKLPVFSKQMWVRHKAFLL FT KQDLIIGFQEFSRWLEDEMENQLASMNPVFSNRKERKDVSFIRPRPPVLNV FT NARAAQVQGKCPLCSSNDHRSLAKCEQFTKLSVNQRRSAARSCKVCYICLR FT QDHSRRECRSDKTCSVCDKNHHDLVHSDEEYKKPVNTFEKKQAEDLCHVNG FT RNERTLLRVGKVRIRSQGRVQDVFALFDEGSSTSMIDSNLAEKLDLRGPVS FT PVTYRWTNGITHNDSESMMLSFQIAGPSDQAKWYEVDNVRTIKNLNLPSVD FT FDIAKIMSLYPLLDEEKMSVLQNASPKMLIGSNNAGLIVPLKTVQYSVRGL FT QLTRCHLGWTIHGEIEPGTVQVADQFHVFLCDNNQDLELTELIKQQYKVED FT FGIRQQVQRCPKKMRELWTS" FT CDS 2191..3522 FT /product="BEL-234_AA-I_2p" FT /translation="SRRFWNQTTGPTMSEEDERALDIMNRTLKRCEDRFEV FT GQVYRYNNFSFPDSKPQALRRLTIMERKMDADPKFAEQYCQKIEDYIEKGY FT ARKLRPEELNETSNTWYLPHFSVMTANKFRLVMDAKAKSHGFSLNDLLLKG FT PDFVPSLTAVLMRGRQKKVAFMADIKEMFHQVRIREQDQNSQRFFWRGMNR FT STPPEVYVMMAMIFGAVSSPSIAQFIKNFNAKELEERLPGVLRPIVKQHYV FT DDYFDSTDTEQQAIDLIKNVITAHEHGGFKLVKFVSNSEAVLKSLDPSLRA FT EPKTDVRVLGLLWKLSADEIVFSFDFPNLDEALRSGVDIPTKRQLLQFMMG FT IFDPLNILSPVTIHLKILFQDLWRLQIAWDDQIPEGLIPKWKEWLRKTAQL FT KEITIPRYYFPGVPSFRSVELHAFSDASDKAFATVIYMVHRYVVIALPGI" FT CDS join(3479..4435,4439..5479) FT /product="BEL-234_AA-I_3p" FT /translation="FTWYIDMLLSLCRVSDKSHVALVLAKSRVAHLKSRTV FT PRLELQGCVLSSQMMKVVQEELEVEITSTYFWTDSKICLAWLCTKEKLTAY FT VGSRVCKIKENGHNIGMWNWIPSELNVADLATKSSKIESMTDWLRGPEFLK FT HRQSEWPNQACVTMSSEEAVNFHLGLEADEESVCHQVYGKSDAVDMPDIGR FT FSDYNRLIRSTAYYFKMRKILALPKKEKPKKFMINVHDMEEAKLAWFRKVQ FT XESFAEEICSLRTSGHVKTSSRLHAHSPVLVDGIIRMKGRIQDKSKPFDVN FT NPVILPDNHSFTNLLIGTYHNANGHQGETIVNNLKHRHRILKIKSQVKKFG FT NLCVKCRELRAKPNVAQMGNLPPERTTPFAFPFTHTGIDYFGPLTVKVGRH FT VEKRWVVLFTCMTSRALHLEVVPSLDANSCIMAIRCFMAIRGIPQKILTDN FT GTNFTGANQELKKLVKELDQQQIEETLSVRGIEWSFIPPGAPHFGGCWERL FT VRSVKTGMRAMLKERNPTDLVLRTTLCEVMNVVNNRPLTEVSSDPQEPEPI FT TPNMLLLGRNNHMQYDHDFIESSLDCRAAYKQAQIYADRFWRKWVSAYRPE FT LMKRQKWQDNRNYYEFAIGDFVMIIDENSHRGCWPKGIIEKVFYGSDGKVR FT TVTIRTSRSTYTDQLRR" XX SQ Sequence 5519 BP; 1655 A; 1149 C; 1351 G; 1363 T; 1 other; tttggtcctt cgataaccaa cacccagttt atatggaaga gcacctgcag cgaccgaaaa 60 ggtgcaacgc aggtacggtc ccttagaaag aaaatccaca atgcaacgaa cccccgtaaa 120 caaaaatgta gtagctagta ttcctagagt taatgatagc ataagcggtg aacaacatga 180 tagaaataag acagtatctg aatcgttaag aagttgcgcg tcaggtagtg gtgctagcaa 240 gcgatcgcat aaaagcaagg tgtcatcagt gagcaaaacc cagagccaga ttagagcttt 300 ggaactcgtc gacgagctgg aggaagagga actgcgtgtc tccctagagc aggataggat 360 tgaagcggag cagcgacttg aagctctcaa gatccagaaa gatttggagc tcaagaggga 420 gcagaaaaag gccgaattca tcaggaagaa gcgtgaacgt gcaatccaga ttagtgagtt 480 gcgttcgtgt tctggtgcat cgagttgcgg ctcaacgcaa cgtgttcaag catggttgaa 540 cgaaaacgag aatactgcca atgatgatgt tgataacggc gccactggaa atgcgccgcg 600 cggtgcagct gctaccggta ccgccgtgaa tgaggcattg tgtcatgctc tcaaggccct 660 ccatagccgg aatgtcagag atctaccgca gttttccgga aacataatgg actggccgat 720 attcgaaaat gagttcaaga cgtctaccgc cgagtacaag ctgacggaca gggagaacct 780 cagacgattg aacaacgcat tgcagggaaa agctcgcaaa actgtggagt gtctgctgtc 840 tgcccatgaa aatgtcgatc tgattatgcg aatgctgaaa tccaatttcg gacgcactga 900 gtgggtcgta gctaatagac tggagctact acgtaacctg gattatgtta aggatggaaa 960 catcgaatct ttccgttctt tctacaatgc cgtcatcgga acagcggtgg cgcttaaaaa 1020 tgtaaaagca gaagcgtacc tgatcaatcc tgaactgatt tcccaacttg ccgagaagct 1080 gcctgttttc agtaagcaaa tgtgggtgcg tcacaaagcg tttctattga agcaggatct 1140 gatcatcgga ttccaggagt tttcgcggtg gttggaggat gagatggaaa accaactggc 1200 aagcatgaat cccgtcttct caaaccgaaa ggagcgcaag gatgtatcgt tcatcagacc 1260 aaggccacct gttctaaacg tgaatgcgcg agcagctcaa gtgcaaggga aatgccctct 1320 atgcagttca aacgatcacc gaagtctagc caagtgtgaa cagttcacca aactgtcggt 1380 aaatcaacga cgttctgctg caagatcctg caaagtttgt tacatatgct taaggcagga 1440 tcattctcgt cgtgaatgtc ggtccgataa aacctgttcg gtatgcgaca agaaccatca 1500 cgatctggta cattcggatg aagagtacaa gaagccagtt aacacgtttg agaagaagca 1560 agctgaggac ctttgtcatg tcaacggaag aaacgaaaga acgctgctac gtgttggtaa 1620 ggttcgaatt cgcagtcaag gcagagtaca ggacgtgttc gctctgttcg acgaaggatc 1680 atccacctcc atgatcgact cgaatttggc tgaaaaattg gatctacgtg gtccagtatc 1740 ccctgtaaca taccgttgga caaatggaat tacacacaac gattcagagt cgatgatgct 1800 atcattccaa attgctggac caagcgatca agccaaatgg tacgaagtgg acaatgtaag 1860 aactatcaag aatttgaatc ttccaagtgt ggacttcgac atcgctaaga taatgagttt 1920 gtatccactt cttgatgagg aaaaaatgtc cgttttgcaa aatgcttctc caaaaatgct 1980 aattggatca aacaacgctg gtttgatcgt tcctttgaag actgtgcagt attcagtgcg 2040 aggattacaa ctaacgcgct gccatctcgg ttggacaatc catggagaaa ttgaacccgg 2100 gacagtgcaa gttgctgatc agtttcacgt tttcctctgt gataacaatc aagatttgga 2160 gttgactgaa ttgatcaaac aacagtataa agtcgaagat tttggaatca gacaacaggt 2220 ccaacgatgt ccgaagaaga tgagagagct ttggacatca tgaatcgcac gttgaaacga 2280 tgtgaagacc gtttcgaggt tggccaggtg tatcgttaca acaacttttc gtttcccgac 2340 agcaagccgc aagcgctccg ccggctaacc atcatggaga gaaaaatgga tgcagatccg 2400 aagtttgccg aacagtattg ccagaagatt gaagattata tcgaaaaagg ctacgcgagg 2460 aagctgagac cagaggagtt gaacgagacg tctaacacct ggtacttgcc gcatttcagc 2520 gtgatgactg cgaataagtt tcggctggta atggatgcta aagccaaatc gcatgggttc 2580 tccctgaacg accttttact gaagggacct gatttcgttc catctttgac cgctgttctt 2640 atgagaggta ggcagaagaa agtggctttc atggcagaca tcaaagagat gttccaccag 2700 gtgcgtattc gcgagcaaga tcagaattcc caacggttct tctggcgtgg catgaatcgt 2760 tcaacccctc ccgaagttta tgtgatgatg gcgatgattt ttggtgcggt atcttctccg 2820 tcaattgccc agttcattaa gaatttcaat gcaaaagaac tagaagaacg acttcctgga 2880 gtactgcgtc caatagtcaa acagcactac gttgatgact atttcgattc aactgacact 2940 gagcaacaag caattgatct gattaagaat gtaatcacgg cgcacgagca tggtggtttc 3000 aagctggtta aatttgtatc caactctgaa gctgtactga aatcgctgga tccatcgttg 3060 agagctgaac cgaaaacgga tgttcgtgtt cttggtctgt tgtggaagct cagcgccgat 3120 gagatagttt tttccttcga ttttcctaac cttgatgagg cactccgctc cggagtagat 3180 attcctacaa agcgacagtt actacagttc atgatgggaa ttttcgatcc cctgaacata 3240 cttagccctg ttacaatcca tctgaagata ttatttcaag acttatggcg acttcaaatt 3300 gcttgggacg atcaaatacc tgaaggactg atcccaaaat ggaaggaatg gcttcgaaaa 3360 accgcgcagt tgaaggagat aacaattcct agatattatt ttcccggcgt tccgtcattt 3420 cggagtgtag agcttcacgc tttttcggat gcaagcgata aagccttcgc taccgtgatt 3480 tacatggtac atcgatatgt tgttatcgct ttgccgggta tctgacaaat cgcatgttgc 3540 actggtacta gccaaatcac gtgtagctca cttgaaatca cgaactgtcc ctaggcttga 3600 actacaagga tgtgtgcttt ccagccaaat gatgaaggtg gtacaagagg aactggaggt 3660 ggaaataaca tctacgtact tctggacaga ttcaaaaatt tgtttggcgt ggctctgtac 3720 caaggaaaaa ctaactgcct acgtaggatc gagagtctgc aaaatcaaag aaaatggaca 3780 taacatcgga atgtggaact ggattccttc ggagttgaac gttgcggacc tcgcaactaa 3840 atcttcgaaa attgagagca tgactgactg gctgcgtggt ccggaattct tgaaacaccg 3900 tcaaagtgaa tggccgaatc aagcatgcgt aacgatgtca tcagaagaag cagtgaattt 3960 tcatttggga cttgaagctg acgaagaatc tgtttgtcat caagtttacg gaaagagtga 4020 tgcagtagat atgcccgata ttggaagatt ttcggactat aaccgtctga taagatcaac 4080 agcctattac ttcaagatga ggaaaatcct ggcgcttccc aaaaaggaaa aaccgaagaa 4140 attcatgatc aatgtacatg atatggaaga agcaaaattg gcgtggttca ggaaagttca 4200 amtggaatcc tttgctgaag aaatctgcag tttgagaacg tcagggcatg tgaaaacatc 4260 aagtcgattg cacgctcatt ctcctgttct ggttgatggt attataagaa tgaaaggtcg 4320 tatacaggat aagagcaaac cctttgatgt taataaccct gtgatacttc ctgacaacca 4380 tagcttcaca aacttgctga ttgggacata tcacaatgcg aacggtcatc aaggatgaga 4440 aacgattgtc aacaatctca agcatcgtca ccgtatcttg aagattaagt cccaggttaa 4500 aaagtttgga aacctttgtg tgaagtgcag agaactacgg gcgaaaccga acgtggctca 4560 aatgggtaat ttacccccag aacggacgac tcctttcgcc ttcccgttta ctcatacagg 4620 aatcgactat tttggaccac taacggtgaa ggttggacga cacgtcgaaa agcgctgggt 4680 cgttctgttt acgtgtatga ccagtcgagc gttacactta gaagtggtgc catcattaga 4740 tgcaaacagt tgcataatgg ctattcgttg ttttatggcc attcgtggta tacctcagaa 4800 aattctaact gacaacggca caaacttcac gggtgcaaat caagaattga agaagttggt 4860 caaggagctc gatcagcagc aaatagaaga gacactaagt gtcaggggaa tagagtggtc 4920 tttcatacct ccaggtgcgc cccattttgg cggttgttgg gaacgattag tgcgttcagt 4980 gaagactgga atgcgtgcta tgctgaagga aagaaatcct acggacttag ttcttcgaac 5040 aacactttgc gaggtgatga acgttgtcaa taatagacct ttaacggaag tctcaagtga 5100 tcctcaagag cctgagccaa taaccccgaa tatgctgttg ctcggaagaa acaaccacat 5160 gcagtacgat cacgacttta tcgagagcag tttggattgc agagcagctt acaagcaggc 5220 acaaatatac gccgatcgtt tctggcgaaa atgggtgtca gcttatcgtc ccgaattaat 5280 gaagaggcaa aaatggcaag acaaccggaa ctattacgaa tttgccatag gagatttcgt 5340 gatgatcatc gatgaaaact cacatcgtgg atgttggccc aaaggaatta tagagaaggt 5400 tttctatggg tcagacggca aagttcgaac agtaacgata aggacttcac gatcaacata 5460 cacagaccaa ttacgaaggt aattctgtta cagggcagtt ctcttggtgc cccggagga 5519 // ID Copia-5_DPu-LTR repbase; DNA; INV; 304 BP. XX AC scaffold_118; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-5_DPu_; KW Copia-5_DPu-LTR; Copia-5_DPu-I. XX NM Copia-5_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-304 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 674-674 (2010). XX DR Genome; scaffold_118; Positions 81962 82265. XX CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX SQ Sequence 304 BP; 80 A; 68 C; 50 G; 106 T; 0 other; tgttgatttc aaatattatt ctcgcactgt tgcctaaacc tatttgtgtt gcggaaaccc 60 acaacacctc cctccccaag gtttgacggt cttgcacaac cgtcgtctgc atttttactt 120 gcttacttca cgtgttctga actggtgtgt acctgtctct gtccgaaaag gtgacttaga 180 gttttgtaaa aagcaagtta atctgtaccc actttgttaa actactcagg tattaataca 240 gcaagtgcta ttcaatagtt cgtatcttat ttctgttagc ttaatactaa agttaaactc 300 aaca 304 // ID CR1-41_HM repbase; DNA; INV; 4312 BP. XX AC . XX DT 18-DEC-2008 (Rel. 13.12, Created) DT 18-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE CR1-type family - consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-41_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4312 RA Bao W. and Jurka J.; RT "CR1 families from Hydra magnipapillata."; RL Repbase Reports 8(12), 1869-1869 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 64..804 FT /product="CR1-41_HM_1p" FT /translation="MEITIKNIEKIITNKFEEQKRSLLKETERLLKDQEKN FT FTQIVSGNIKIITDRLDKIENELNINKVNMRNFEKDVNDVKESLNFQEEKI FT KEELTQIRKRYDLEIKNLNKKNTDLENRSRRNNIRIDGLKDMPGESWSDCE FT KSVKNIFTNNLKISSEVIVERAHRIGSYKEDKTPRTIVVKLLNYQDKNKIL FT SSLKNLKGSGIYINEDFAKETMEERKRLWEEVKNLRNQGKYATIKFNKIYC FT REFRK*" FT CDS 861..3944 FT /product="CR1-41_HM_2p" FT /translation="MAKQTKDFETQHFNVFKSANNALLDNFYDPDLHFFNE FT NNFNSRFFDLKTFKNELKSLNNNFTVIHINIRSIINNIDKLKHFLVECNFL FT FSMICLTETWCSDESSRINSNLLIPNYKLISYERKVQKRGGGIMTYIHNDL FT TTKVRDDLSVSDADGEVFTIEIINQKLKNILVPTCYRPPEGDIKKFSNHLK FT QIFLKNNKEHKILFCIGDININCLQYNEDAVTKIFFDDMFQHYIFPIINKP FT TRVTSTSITAIDNILTNSFYDSSLKAGVIKTDISDHFPIYFSLFQDTKTNN FT SKTKVYKRKINKFTIQQFKDSLSAVRWDKIYQKCNLGHTNSAYNEFLEIFL FT KHYNNHFPIIEQQLKVKYLQCPWITRGIKKSSKKKQKLYIKYLKNRNEKNL FT NTYKQYKNLFEKIKKNSKKNYYSKQIKNATGDIKKTWNIIKEIIGNKKTKS FT NNLPARIIIDNIEYSDKNKIAEKFNEFFVNIGPNLASKIEYPNNSFESYLN FT DPQSELKYKELSYQELNVALNSLKLNKTPGIDDICSRIVANVFTVINKPIF FT EIFKSSIKTGVVPDKLKVAKVVPIFKTGESYLINNYRPISVLPTFSKLLER FT VIYNRLYEYLIQNKILNKKQFGFQKQYSTEHAILDLINSISDSFDKKQYVL FT GIFIDLSKAFDTVNHNILLKKMESYGVKNITLDWFKNYLSNRQQCVISDNY FT KHSKLLRIECGVPQGSILGPLLFLLYINDLTKTSKKLDVIMFADDTNLFYS FT SNSIKNLYECMNKELEKLNIWLKLNKLSLNTEKTKYILFHSKKQKNNIPTI FT LPPLKIDKVNIKRTETTKFLGILIDENITWRAHISTINTKISKNIGILYKV FT KPMLSQDNLKSLYFSYIQSYLTYANIAWASTHKSKLNTLYLSQKHASRLVY FT NKYKLTHAEPLLKTLNALNVYQINIYQNILFMLKYNLGLVPSHFLDNFFQI FT NSNRYTTRATGNFTLPKKTTKFSRFSISYRGPYLYNKIISENIELKTLNNS FT ATLKKKLKHLILNIKNFIDMY*" XX SQ Sequence 4312 BP; 1832 A; 664 C; 543 G; 1273 T; 0 other; aaaaaagttt ttgccgcgaa cgcacgtgtt tttgactgct tttcaagaat attaatttta 60 aaaatggaaa taacgatcaa aaatatcgaa aaaattatta ctaacaagtt tgaagaacaa 120 aaaagaagtt tgctcaaaga aacagaaaga ctgttaaaag accaagaaaa aaacttcaca 180 caaatagtaa gtggaaatat aaaaataatt actgatagac tcgacaaaat tgaaaatgaa 240 ttaaatatca acaaagtaaa tatgcgaaat tttgaaaagg acgtaaacga tgtgaaagaa 300 agtctcaact ttcaagaaga aaaaataaag gaagaattaa cacaaataag aaagagatat 360 gatcttgaaa ttaaaaattt aaataaaaaa aatactgact tagaaaaccg ttctcgtcgt 420 aacaacataa gaatcgatgg tctgaaagat atgccaggag aaagttggag tgactgtgag 480 aaatctgtga aaaatatttt tacaaataat cttaaaattt caagcgaagt tattgtggaa 540 cgagcacatc gaattgggtc gtataaagaa gacaaaacac caagaactat cgtcgtgaaa 600 ctattgaact accaagataa aaataaaatc ttaagttcac ttaaaaatct aaaaggaagt 660 ggaatatata taaatgaaga ttttgctaaa gagacaatgg aagagcgaaa gagactttgg 720 gaagaagtta aaaatcttcg caatcaaggt aaatacgcaa cgattaaatt taataaaatt 780 tattgtcgcg aatttagaaa ataagaaaat aagttttgcg taagttaaag cgatcgaagc 840 aaagttaatt ctttttactt atggctaaac aaacaaaaga tttcgaaacg caacacttta 900 atgttttcaa atcagccaat aacgctttac ttgataactt ttacgaccca gatttgcatt 960 tttttaatga aaataatttt aactcgcggt tttttgattt aaaaactttt aaaaatgaat 1020 taaaatcatt aaataataac tttacggtta tacacataaa cataagaagt ataattaaca 1080 atattgacaa gctaaaacat tttcttgtag aatgtaattt tttgttcagt atgatttgct 1140 taacagaaac ctggtgttct gacgaatcat ccagaataaa ttcaaactta ttaattccga 1200 actacaaact aatatcttac gagcgaaaag ttcaaaaacg aggtggtgga attatgacat 1260 atattcataa tgatctgaca actaaagtta gagacgatct ttctgtttct gatgccgatg 1320 gtgaggtctt tacaattgaa ataataaacc aaaagttaaa aaatatactg gtgcccacct 1380 gttacagacc acctgaaggt gatataaaaa aattttctaa tcatctaaaa caaatatttc 1440 taaaaaacaa taaagagcac aaaatattat tctgtattgg agacattaac ataaactgtt 1500 tacagtacaa tgaagatgct gttaccaaaa ttttttttga tgatatgttt caacactaca 1560 tcttccctat aataaacaaa cccacccggg taacatcaac ttcaatcact gcaatagata 1620 acatattgac taattcattt tatgattctt cattaaaagc aggtgtaatc aaaacggata 1680 tatccgatca ctttccgata tacttttctt tgtttcaaga tacaaaaaca aacaactcaa 1740 aaaccaaagt ctataaacga aaaattaata aatttactat tcaacagttt aaagactcac 1800 tatcggcagt gagatgggat aagatttacc aaaaatgcaa ccttgggcac accaactctg 1860 cttataatga gtttttagaa atattcttaa agcattataa taatcacttc ccaattatag 1920 aacaacaatt aaaagtaaaa tacttacaat gtccatggat tactagaggt ataaaaaaat 1980 cttcaaaaaa aaagcaaaaa ctttacatca aatatttaaa aaatagaaat gaaaaaaacc 2040 taaataccta caagcaatac aaaaatttat ttgaaaaaat taaaaaaaat tcaaaaaaaa 2100 actactactc gaagcaaata aaaaatgcaa ctggtgacat taaaaaaaca tggaacatta 2160 taaaagaaat aattgggaac aaaaaaacta aatcaaataa tttgcctgct cgaattatca 2220 tagacaatat agagtacagt gacaaaaata aaattgctga aaaattcaat gaattttttg 2280 ttaacattgg ccctaatctt gcctcaaaaa ttgaatatcc taataactcg tttgaatcat 2340 atctaaatga tccccagagc gaactaaaat acaaagaact aagctatcaa gagcttaacg 2400 ttgcactaaa ctccttaaaa ttaaataaaa ctccaggaat agacgatatc tgtagtagga 2460 tagtggcaaa tgtctttaca gtgataaata aacctatatt tgaaatcttc aagtcttcga 2520 ttaaaacagg agttgtacca gacaagttaa aagtagctaa agtagtacca atatttaaaa 2580 caggtgaatc atacttaatt aataactata gacctatctc ggtactccct accttctcta 2640 agcttctcga acgagtaatc tacaacagat tgtacgagta tttaatccaa aataaaatcc 2700 taaataaaaa acaatttggc ttccaaaaac aatattcgac tgaacacgca atcctagatc 2760 taattaatag tataagtgat tcttttgata aaaaacaata tgttctgggg atctttatag 2820 acctgtcaaa agcctttgac acagtaaatc ataatatctt acttaaaaaa atggaaagct 2880 atggagtaaa aaacatcact cttgactggt tcaaaaacta cctaagcaat aggcaacaat 2940 gtgttatttc agacaattat aaacactcaa aactactaag aatagagtgc ggtgtccccc 3000 aaggttccat tcttggaccc cttctgtttc ttctatatat taacgacctt acaaaaacct 3060 cgaaaaaact tgatgttata atgtttgccg acgacacaaa tttattttat tcctcgaact 3120 caatcaaaaa cctttatgaa tgtatgaaca aagagcttga aaaattaaac atctggttga 3180 aattaaataa gttatcacta aatacagaaa aaacaaaata tatattgttt cattccaaaa 3240 aacaaaaaaa taatatacca actatacttc ccccactaaa aatagataag gtaaacatta 3300 aaagaacaga aacaacaaaa tttcttggga tacttattga cgaaaatata acttggagag 3360 cccatattag tacaataaac accaaaattt caaaaaatat tgggatactt tacaaagtta 3420 aacctatgtt gtcccaagat aatcttaaat ctctttattt ttcttacatc caaagttatc 3480 ttacgtacgc taatattgca tgggcaagta cacacaaatc caagttaaat actctttatt 3540 taagtcaaaa acatgcatca agattagttt ataataaata taaactcacc catgctgaac 3600 ctttactaaa aaccctaaat gcactgaatg tttaccaaat caacatctat caaaatattc 3660 ttttcatgct taaatacaat cttggactag tcccatcgca ttttttagat aacttttttc 3720 aaattaactc taatagatat accacaagag caacagggaa cttcacatta ccaaaaaaaa 3780 caacaaagtt ctcacgattc tctatttcct accggggtcc gtacctatat aacaaaataa 3840 tatccgaaaa tatagaactt aaaacattaa ataattctgc caccttgaaa aaaaaattaa 3900 aacacctcat acttaatatc aaaaatttta ttgacatgta ttaaacttaa caacaagatt 3960 tatactaaaa tgtttatgtg tatcactgac tgtgtattgt ttaaccactt ttaactatat 4020 ataaaaatta attaaattta tttaaccatt tgaaaaggta cttgatgata agacttcata 4080 gtcttctgcg agtctccttt acaacaacat gtttatattt gaagaaacaa tagtttttat 4140 ttaaaaaaaa aaaacagtgt ttattaatct aacgttatat tttattataa ttgttaaatt 4200 acgtattctt atgttatgtt atatatttta ctctttgcat taaattatat gatagaattg 4260 taacgtaatg aagtgtatat tgaagttgta aaaaaaaaaa aaaaaaaaaa aa 4312 // ID P-8_HM repbase; DNA; INV; 3170 BP. XX AC . XX DT 17-MAR-2008 (Rel. 13.03, Created) DT 17-MAR-2008 (Rel. 13.03, Last updated, Version 1) XX DE P-type DNA transposon family: consensus. XX KW P; DNA transposon; Transposable Element; P-8_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3170 RA Jurka J.; RT "P-type DNA transposon families from Hydra magnipapillata."; RL Repbase Reports 8(3), 354-354 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(835..954,1030..2643) FT /product="P-8_HM_1p" FT /translation="MSNMNSSNILKQKYSPEVIVRAFEYFAQSRSLYDQLR FT NDFCFFKTQQNSQKNCMLLLDEVYVKPMLTYHGGQLFGKAVNNESNFATTI FT LAFMVVCFYGGTKFIVKMLPVNNLDADFVYDQIIILINQIKKTNGKVIALI FT NDNNRVNQAFFKRFDCLSLWLTTDGIFLLFDFVHVLKCIRNNWITEKSGEI FT EFSYRGKKYNAKWDHIKKLQKLEEGELVKMSKLTYVAVNPRPIDRQKVDTC FT LKVFCEETINALKCHQGMKDDDIDGTIIFLAKVVQFWKIVNVKSLSEEIHL FT KDNLRDPISSVNDLQLNDLIKFSEMLKPTLIQGKRIKSITMDTSNALYQTC FT NGLVKLSKHLLDNSQMYVLLGNFTTDPIERAFGKLRQGSGGTYFINTQQVI FT EKWNINKTKLFLQKTINISSAEFTSGHICDNCFYKLSEDECEMFNHLHELL FT DQLSSNIKSSLVYIAGYISRKDNEDDNDDTYLYYSNYGNFTSALDRGGLKI FT PSDTFCEWVFLSYILFNFVGTSNFCRYSMSCLFMDIACIYGFTIITRNHCY FT ILTNILFNNFCYMHSPASDKETKVKALKLFC" XX SQ Sequence 3170 BP; 1142 A; 433 C; 495 G; 1100 T; 0 other; caaggcgtac ttaaataaca ggccttctca tcgcgcaacg gcttaaaaaa tgtggaggct 60 gatttagtta gcggcaaaag ttactacgaa taacttgtat ccctaccaga agacttaaaa 120 atgcccagaa aatgttgtgt tactggttgt aactctaact atgactcagt aaaagataag 180 acaactactt tcagaattac cgagagatcc agtagagcgt ctaagatgga ttaaagcaat 240 cccaagagaa aatataccag acaaacatga cactgtagtt tgtgcaaaac attttccacc 300 aaacttccaa gttattaaag taaaaggaag agaaagacct cgagatccac ctagtgtatt 360 cgaacaaata cccaagagtt tagtgccaac tcctccacca gaaaaaagaa caacaacaaa 420 atccaaaagc tctgtaagag agcttagaga gtctctcaga tcaataaaag aagatgaact 480 ttcaaaattc gaagagcttt acaaaattga ttcattcgaa aatttttgta attcgttatc 540 tctagaaaaa cttaatgatt cagttgttta tctatttata attgtttaaa tagctgttgt 600 attcaatcaa aagagcatga aaacaatact ggtattaata aattttgttt aatagtattt 660 aatgacttat catatgaagc ctatcatgct ggagtaagat gcactattac ttccttagtt 720 aaaaatcgta ttcataagtt caaatgttgg tcgcaagttc atgaggcatt acgatatttg 780 atttcctttg aaaagtcaga aaagaaaaca gttttaattc agcaatttga atgcatgagt 840 aacatgaact caagtaatat tcttaagcaa aaatattcac ctgaagttat tgttcgagct 900 tttgagtatt ttgcacagtc acgtagttta tatgatcaat taagaaatga tttttaactt 960 cctagtattg ctacccttac acttataact tcaaaagttt cacactgatg ataatacttt 1020 tatttgtagt gttttttcaa aactcaacaa aatagtcaaa aaaactgtat gttattacta 1080 gatgaagttt atgttaaacc tatgctaact tatcatggag gacagttgtt tggtaaagca 1140 gtaaacaatg aatcaaattt tgcaacaact atattggctt tcatggttgt atgtttttat 1200 gggggaacaa aatttatagt taaaatgtta ccagtaaata atttggatgc agattttgta 1260 tacgatcaaa ttattatcct aatcaaccaa attaagaaga caaatggaaa agttatcgct 1320 ttaattaatg ataataatag agtaaatcaa gcatttttta aacgatttga ttgcttatct 1380 ctatggttaa caactgatgg cattttttta ctttttgatt ttgtgcatgt attaaagtgt 1440 atacgaaaca attggataac tgaaaagtct ggtgaaattg agtttagtta tagaggaaaa 1500 aaatataatg ctaaatggga tcacatcaaa aagcttcaaa agttagaaga aggtgaattg 1560 gttaaaatgt ctaagctgac ttatgttgct gttaacccaa gaccgattga tagacaaaag 1620 gttgatacat gtttaaaagt tttttgtgaa gagactataa atgcattaaa gtgtcatcaa 1680 ggtatgaaag atgatgatat tgatggaact attatttttt tagctaaggt ggttcaattt 1740 tggaaaatag tcaacgttaa aagtctatct gaagaaatcc atttaaaaga caacttacga 1800 gatcccatat catcagttaa tgatttacaa ttaaatgatt taattaagtt ttctgaaatg 1860 ttaaagccaa cattgattca aggaaaaaga attaaaagta taaccatgga tacaagcaat 1920 gcattatatc aaacttgtaa tggtttagtt aaactaagta agcatttgtt agacaattct 1980 caaatgtatg tactgcttgg gaattttaca acagacccca tagaaagagc atttggaaaa 2040 ctcagacaag gtagtggggg aacttatttt ataaatactc aacaagtaat tgaaaagtgg 2100 aatatcaaca aaaccaaact atttcttcag aaaactatca acatttcaag tgctgagttt 2160 acatctggtc atatttgtga taattgtttt tataaacttt cagaagatga gtgtgagatg 2220 tttaatcatt tacatgaact tttagaccaa ctttcatcga atattaagtc atctttagtt 2280 tacattgcag gttatatttc tagaaaagac aatgaagatg ataacgatga cacttatctg 2340 tattatagta attatggaaa ttttacatct gccttagatc gtggtggtct gaaaataccc 2400 agtgatacat tttgtgaatg ggtattctta agttacattt tgtttaattt tgttggcaca 2460 tcaaactttt gtagatattc tatgtcctgt ttattcatgg atattgcatg tatttatggt 2520 ttcaccataa ttacaagaaa ccattgttat attcttacaa atatattatt taacaatttt 2580 tgctatatgc attcacctgc atcagacaag gaaaccaaag ttaaagcttt aaaactgttt 2640 tgttagcttt tgtgaataaa tataccataa aaagtttcca atttaacttt agatgatatt 2700 tcgtttaaaa attgttaatt tatattttta ctttttaatt ctttctgtta atgtttcatt 2760 gtatgcaaat acatagaagt agcttggtta tcttttgtta ttgttatttt tgaacttaca 2820 aggtgttgtt ttttttttac tttcttgaca aaataacaat tatttttgtt ttataaacgt 2880 ttttgtaata aaaatatcga cttctttata aaatgataaa atattattta gtaaaaaatg 2940 gcatggttta agccttacat tcttcttatc cttagttttt tatttaatca ggttatgcat 3000 ttagagaaat taatcaaatc acaagctcct actatattgt taaaagaatt tttaaatatt 3060 taaaacttaa aaaaaaattg tttaatagtt gagagcatca aaaccaaatc agcctccaca 3120 tttttcaagc cgttgcgcga tgagaaggcc tgttatttaa gtacgccttg 3170 // ID Gypsy-85_CQ-I repbase; DNA; INV; 5081 BP. XX AC AAWU01006083; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-85_CQ_; KW Gypsy-85_CQ-LTR; Gypsy-85_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-5081 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 549-549 (2011). XX DR GenBank; AAWU01006083; Positions 31634 26554. XX CC Positions [4103-4570] - Integrase core CC 'CCAAG' target site duplication CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 796..2829 FT /product="Gypsy-85_CQ-I_1p" FT /translation="MFVRRHLECRIFIELAKAVVIQVNLKRSLIGVIVFFQ FT MEDARPVPQFRCEEIEKNRLYNEWKIWKRALECYFDAYDIKDQKKMRAKLL FT HLGGPQLQRVFENLPDRENFPVVSTKKKKWYTVAINALDGFFQPCRQDCLE FT RHKLRQLRQKEGERFADFLLRLRQQVADCGFEKYAPETKNVLVEIFLIDVI FT VEGCTSPELRRRILQQDRTLSEIETLGVALEGAELQVKDFNAKPVEPVTEQ FT KAFKVTVRPKPFQPAYHNSGNFARNKASPYPQRVIRCFSCGRPDHLSTDRN FT CPARNIRCRKCKQMGHYEPYCRQREKKFIRRQDTHQDTASSRKVHLVEETK FT VKAEAEQHNGSTKTYYTFFSGNETNVVLCTVGGVELDLLVDSGSDVNLIPD FT TVWESLKQKSVEVRQCVKGCTKVLKAYASNSPLAILGSFVAEVKVGQRAVQ FT AEFHVVAGGQRSLLGDRTSKELRILRIGVDVCSVASAMEPFTKIKDVQVQI FT LMNPEVKPVCQPIRRVPIPLEDAVNKKLDQLLAKDIIEVKHGPAPWVSPLV FT VVGKSNGEPRICLDLRRVNEAVLRERHPMPVVDDYLARLGKGKYWTKLDIK FT DAFLQVELAPESRDITVFITNKGLFRFKRLTFGLVNAPECFQKVMDQILVG FT CEGAFWYLDDVIIEGSRREELEERVAKVM" FT CDS 3170..5050 FT /product="Gypsy-85_CQ-I_2p" FT /translation="MLTRKDSKFVWSVVHQTAFDRIKDAIADVMKLGFFNK FT DQRTTVMADASPIGLGALLIQTDDAGNSRVVTCASKSLTDTERRYCQTEKE FT ALSLVWAVERFQMYLYGRRFDILTDCKALVYLFTERSRPCARIERWVLRLQ FT AFEYLVSHIAGNRNLADVLSRLSTIVPVPFDIREELFVRQISLSAATAVAL FT RWEEITKASLEDPEIQDVLECIDNGNIYQMPIAYRVVANELCRFGDVLLRT FT DRIVVPTALRERVVCIAHEGHLGIRTMKAHLRSAAWWPKMDMAVESYVKRC FT RDCLLVSSPDPPEPMVRKELPSGPWEDIAIDFLGPLPNKETLLVVVDYYSR FT YVEVCEMKSTTAKETIAQLSKIFCRFGVPNTMRADNGPQLNASCEEFTNFC FT GEVGVRLVNTIPYWPQHNGEIERQNRSFLKRMKIAHEAGRDWRMELNKYIL FT SYHATPHPTTGRSPAELMFGRKIRSKLPQLPREASFDEEVRDHDKLQKEKG FT RVYADTKRKARTSEIEVGDRVLAKRMRKDNKLSSDFGPEEFEVIRKSGADV FT TVSSAQDGVQYRRNVRHLKRILKGSEDQTFHPASGDEGQDESGSTEYPINH FT EEQNTSFEQAASSKRARREPTKFRDYVAH" XX SQ Sequence 5081 BP; 1334 A; 1054 C; 1412 G; 1281 T; 0 other; attctggcga cgagggtgaa ctaaaggtga gctaatttgt tcgattttaa gaatgttaga 60 acggtttttc tggtcgcttg aggtttgcca agatggctgc ctcgggctga agatgccgat 120 ttcttcacat aaaggtgtgt tggtgtgttt tcaaaaagaa ggggggagag agagacgtaa 180 atgtattcgt tttctctccc tggtgtagac agcagactct gagtgacgga atgtcattcc 240 ttggagtgcg tttagtgatg caatcaattc aaagctctgt ttttatttta ttgttgacgt 300 atgactacaa cgttttgacg taggataaga atttggtgtt tattttaaat agtttgattc 360 ctgatgcttt tcagggagtg gacaaaacga ttgttcacgg gtagaatgtc cctgtaacag 420 tcgttacagg tggtggacat acgttgttca cgggtagaat gtccctgtaa cagtcgttac 480 aggtggtgga catacgttgt tcacgggtag aatgtccctg taacagtcgt tataggtggt 540 ggacatacgt tgttcacggg tagaatgtcc ctgtaacatt agttacaggt ggtggactta 600 gttgttcacg ggtataatgt ccctgtaaca atcgttacgg gtggtgaatt gaagttcttc 660 acgggtagaa tgtccctgta gctgtcatta caggtggtga acatattgtt gttcacgggt 720 agatgtcctt gtaatatttt acaagtggtg acaacaattt gttgttcacg ggtgtttctg 780 tgacgggtgt tacagatgtt cgttcgtaga catttagaat gtcgaatttt tattgaatta 840 gcaaaagctg ttgtaataca agtgaattta aaacgatcat taatcggagt tattgtgttt 900 ttccagatgg aggacgctcg gccggtacca cagtttcgct gtgaggagat cgagaagaac 960 cgtttgtaca acgagtggaa aatttggaaa agagctctcg agtgttattt tgatgcgtac 1020 gacattaagg atcaaaagaa gatgcgggct aagcttcttc atcttggagg accacaacta 1080 cagcgcgtat tcgaaaacct acccgatcgt gagaattttc cggtggtatc caccaaaaag 1140 aagaaatggt acaccgtcgc aataaacgct ctcgacggtt tctttcaacc gtgtcgtcaa 1200 gactgtttgg agcggcacaa gctaagacaa ctgagacaaa aagagggcga aaggtttgca 1260 gattttctgc tgcggttgcg ccagcaggtg gctgactgcg ggttcgagaa gtacgctccg 1320 gaaacaaaga atgttctcgt ggagatcttc ttgattgatg tcatcgtcga gggttgcaca 1380 tcgcctgagt tgcgtcgacg gattctccag caagatcgta cgctgtccga aattgaaacg 1440 ctcggagttg cgttggaagg tgctgagctg caggttaagg actttaatgc taagcctgtg 1500 gaacccgtca cggagcagaa ggcgttcaaa gtcacagtta ggccaaagcc ctttcaaccg 1560 gcctaccaca acagcgggaa ctttgcacgc aacaaagcat ctccataccc tcaacgggtt 1620 atccggtgtt tcagttgcgg aagacccgac cacctgtcca ctgaccgaaa ctgcccggcg 1680 aggaacattc ggtgccgcaa atgtaaacag atgggacatt acgaacccta ctgtcgtcag 1740 cgagagaaga agttcatcag gcgacaggat acccaccagg atactgcatc atctcggaag 1800 gtgcacctgg tcgaagaaac gaaggtcaaa gcggaagcgg aacaacacaa cggctcaacg 1860 aagacttatt acacgttctt ctccgggaac gaaacgaacg ttgtgctttg caccgttggc 1920 ggtgttgaac tcgacctgct cgttgactcc gggtcggatg ttaacctgat tccagatacc 1980 gtatgggagt ccttgaagca aaagtcggtg gaagttcgtc agtgtgtgaa aggatgcacg 2040 aaggtgttga aagcttacgc gagcaactcc ccgttggcaa ttctcggttc atttgttgct 2100 gaggtgaagg ttggacaacg agcagtacaa gcggagttcc atgttgttgc tggaggtcag 2160 cgatctcttc tgggtgatcg cacatcgaag gagctgagga ttttgagaat cggagtcgat 2220 gtttgcagcg tggccagcgc catggaaccg ttcaccaaga tcaaggacgt gcaagttcag 2280 atcttgatga acccggaggt gaaaccagtt tgtcaaccca tccgtcgagt cccaatccca 2340 cttgaagacg cagttaacaa aaagttggac cagcttctag cgaaagatat aatcgaagtc 2400 aaacacggtc ctgcgccttg ggtttctccg ttggtagtgg tgggaaaaag taacggagaa 2460 ccgagaattt gcctcgattt gcgtcgcgtc aatgaagcgg tgctccgcga gagacatccg 2520 atgccggtcg tagatgacta cttggcacgg ttgggaaaag ggaagtactg gacgaaactg 2580 gacattaagg atgcgttcct gcaggtcgag ctggccccgg agtcacgcga catcaccgtg 2640 ttcattacga acaaggggtt atttcgtttc aaacgtctca catttggcct agtgaatgcc 2700 ccggagtgtt ttcaaaaggt catggatcaa attcttgtcg gctgcgaagg ggcgttctgg 2760 tacctggacg atgtcataat cgaaggcagt cgtcgtgaag aactcgaaga acgtgttgca 2820 aaggtaatgt gactttgtgt gctagtcgcg tttcttgaaa tgactgtatc aacctaacag 2880 gacaaataaa cgatgaattt ctattatttt gtttcattag gtccttaggc gattcaagga 2940 acgaaacgtt gagttgaact gggacaaatg cgagtttggg ttgacagaaa tcgagttctt 3000 gggacatcgt ataactgctg acggtatcgt gcctaccaac gacaaggtga aggccataaa 3060 atcattccgg cggccggaaa acgaagccga agttcggagt tttttgggcc tggccaacta 3120 ccttaacaag tttatacccg atctggccac attggacgaa ccgctgcgca tgctaacacg 3180 gaaggacagc aaatttgtgt ggtcggtggt gcaccaaacg gcattcgacc gcatcaagga 3240 tgcaattgcg gacgtaatga aactgggatt cttcaacaaa gaccaaagaa cgacggtgat 3300 ggctgacgca agtccaatag ggctgggcgc tctgttgatt caaactgacg atgctggcaa 3360 cagcagagta gttacctgtg cttcgaagtc cttgaccgat acggaacgga gatactgcca 3420 gaccgagaag gaggcgttgt ctcttgtttg ggctgtcgaa aggttccaga tgtacttgta 3480 tggaagacgt ttcgatattc tgacggattg caaggcgtta gtgtatctgt ttacggaacg 3540 atcgcgaccg tgtgctagaa tagagcgctg ggtactccgt cttcaagcct ttgaatatct 3600 ggtcagccat attgccggaa acaggaactt agcagacgtg ttgtcacggt tgagtacaat 3660 agtgccggtt ccgtttgaca tcagggagga gttgttcgtt cggcagattt cgttgtcagc 3720 cgccactgct gttgctttac gctgggaaga aatcacgaaa gcatcgctgg aggacccgga 3780 gattcaggat gttctggagt gcattgacaa tggcaacatc taccagatgc caatagcata 3840 ccgtgttgtc gcaaatgagt tatgccgatt tggagacgta ctgctgcgta cggacaggat 3900 tgttgttccg actgcgcttc gggaaagagt tgtctgcatc gctcatgaag gacatttggg 3960 cattcgaact atgaaagctc atctccggag tgcggcgtgg tggcccaaga tggatatggc 4020 tgttgagtca tacgtgaaga gatgtcgaga ttgtcttctc gtttcctctc ctgatccacc 4080 cgaacctatg gtcaggaagg agttgccaag cggaccttgg gaggacatcg ctatcgactt 4140 cctcggaccc ctgcccaaca aggagacatt gttggtggtc gttgactatt acagtcgata 4200 cgttgaagtc tgtgaaatga agtctaccac cgcgaaggaa accatcgcac agttgagtaa 4260 gattttctgt cggtttggtg tgcccaatac catgcgagcc gataatggtc ctcaattgaa 4320 cgcatcttgt gaggagttca ccaatttttg tggtgaggtc ggggtaaggt tggtcaacac 4380 tataccatac tggccacagc ataatggtga gatagagcga cagaaccggt cattcctaaa 4440 acgaatgaaa atcgctcatg aggcgggtcg cgattggaga atggagctta acaaatatat 4500 tctctcgtac cacgccacac ctcatccaac cactgggcgt tctccagccg agctcatgtt 4560 tggtcgtaag atccgttcta agttgccaca gctgccacgt gaagcgagct ttgatgagga 4620 ggtacgcgat catgataagc tgcagaagga gaaaggtaga gtttatgcag atacgaagcg 4680 taaggcgcgc acgagtgaaa ttgaggttgg ggaccgagtt ttggcaaaac ggatgcgtaa 4740 ggacaacaaa ctatcttcag attttgggcc ggaagaattt gaagttatcc ggaagagtgg 4800 cgcagatgtg acggtgagtt cagcgcagga tggtgtgcaa taccgcagga atgtccgaca 4860 tctcaagaga atccttaaag gaagcgagga tcaaacattc catccagcga gcggtgacga 4920 aggtcaagat gaatccggat cgacagagta cccaatcaac catgaggagc agaacacgtc 4980 gttcgagcag gctgccagtt cgaaacgggc aagaagagaa ccgacgaagt tccgggacta 5040 tgtggctcat tgagcttttc aaattttaaa caaggggaga t 5081 // ID MERLIN4_SM repbase; DNA; INV; 1002 BP. XX AC . XX DT 10-FEB-2008 (Rel. 13.02, Created) DT 04-MAR-2008 (Rel. 13.02, Last updated, Version 1) XX DE Consensus sequence of Merlin-type family of repeats. XX KW Merlin; DNA transposon; Transposable Element; MERLIN4_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-1002 RA Jurka J.; RT "Merlin-type element from freshwater planarian (Schmidtea RT mediterranea)."; RL Repbase Reports 8(2), 156-156 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 90..1001 FT /product="MERLIN4_SM_1p" FT /translation="MKVDEFFANISTDEKAFNLAKSLGLITSNVRFCSRCG FT GKMNLEKGKTRHGIDVRLRCIKRYCRKTASVFENSIFEYGHLKMRTVFMLF FT YCYSEKKTRYETSSICEIDMSTVTTYFSYFREAILKYSPFVSVARGEAPDE FT YEVDETHLYSFKNHQGRVLRGQKYWVIGIINRRTKAVNLLVTRRRTKVICH FT DFIVRHAPVGSTIYTDKWGGYHGLEAKGFIHKTVNHKIEFVNRNNRSIHTN FT RIERLWRSLKEYIRKSLRLPALIKAVQEFQIIWNLKIKTANERLDLLIDSI FT KKQSINFNPLNY" XX SQ Sequence 1002 BP; 362 A; 145 C; 188 G; 307 T; 0 other; ggaaattggg gtgattaaaa tttatagatt aaaaatgata gattaaattg atcaaaaatt 60 tttataaaaa ttctaatatt ttagatagca tgaaagttga tgaatttttt gcaaatatct 120 caacagacga aaaagcattt aatcttgcta aaagtcttgg tttaatcacc agtaatgttc 180 ggttttgttc tcgatgcgga ggaaaaatga accttgaaaa agggaaaact agacatggaa 240 tagatgttag actcagatgt atcaaaagat attgtcgaaa gacagcatcc gtttttgaaa 300 atagcatatt tgaatacggt catctcaaaa tgagaacagt ttttatgcta ttttactgtt 360 attctgagaa aaagactaga tacgaaacta gtagtatctg tgaaatagat atgtcgactg 420 tgacgacata tttttcatat tttcgagaag caattctcaa atattcgcct tttgtttctg 480 ttgctcgtgg tgaggcgcct gatgaatacg aagtagatga aacacatctt tatagcttta 540 agaatcatca gggaagagtc cttagaggcc aaaaatactg ggtgattggt attattaaca 600 ggcgaaccaa agctgttaat ctgctggtca ctagaaggcg tactaaagtc atttgtcatg 660 actttatagt aagacatgcc ccagttggct ctactattta cactgataaa tggggaggat 720 atcacggact agaagctaaa ggattcattc ataaaacagt gaatcataaa attgagttcg 780 tcaacagaaa taatcggtct attcacacaa atcgaattga acgattgtgg agaagtctaa 840 aggagtacat ccgaaaaagt ttacgactgc cagctttaat taaagctgtt caagaattcc 900 aaatcatatg gaatttaaag ataaaaactg ctaatgagcg tttagacttg ttgattgatt 960 caataaaaaa acaatctatc aattttaatc ccctcaatta cc 1002 // ID BEL-43_AA-LTR repbase; DNA; INV; 218 BP. XX AC AAGE02017492; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-43_AA_; KW BEL-43_AA-I; BEL-43_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-218 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02017492; Positions 28083 28300. XX SQ Sequence 218 BP; 68 A; 49 C; 33 G; 68 T; 0 other; tgttctcgct ataataatcg ctaaagcaaa tttgattcag tgttatccta ccaaactaca 60 taaccttgat tataattcaa cttcccatga aatttgccaa cactgtatga aacagaaaat 120 aaagcagtca ttgttaagaa gctcacgcga gcggtcgttt tttatttcgc taacgctgag 180 ccctattcga ttatacagtc cactctcttg gagctaca 218 // ID CR1-9_BF repbase; DNA; INV; 3670 BP. XX AC . XX DT 30-JUL-2009 (Rel. 14.07, Created) DT 30-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE Amphioxus CR1-9_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-9_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-3670 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-3670 RA Kapitonov V. and Jurka J.; RT "Young families of CR1 non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(7), 1580-1580 (2009). XX DR [2] (Consensus) XX SQ Sequence 3670 BP; 1236 A; 935 C; 760 G; 739 T; 0 other; agataggtgg cgcttttaca gcggccgcat tactaagtat gtccctgagt aagctctaca 60 aaagtcgatc taaagtgcta gttcggtgcc atcggacaag tatcggttca gagtacgaca 120 gatgaattgt cattacatcc cctgaacacc aagatctgac ggaaagagag cgttatcgtc 180 aaaaatcttt caccaaaagc cgacaaccag ccaaggaggg agggcgacgc agaaacaaca 240 aaggaccctc aacacgtacc atcctttaga cggtccaaac aagccccgac tacaaaccca 300 gcaaacgtca acgtacttca gggcaggttc gaacacttat acttactatt ttgtaatcat 360 cgtgggccaa tgtagcccct gaaacaggag atcgaacaag tttgtccttt gtttacctga 420 cgccggggag gttactccag gtcagacccg acagaaatgt tcttgaaccg catactctcg 480 tacacttcaa gcactattcg tgctcttcgg acaaacatta aacccactcc tctggttaaa 540 cgagctatct tcgattgtgg cataaacaaa taccaccgcc gtcctagagg atgcagaggg 600 ggaaagaacc ttacccgcca taaccccaca cttactgata catctatcgc accacctgct 660 acgtacgaag tagagaacag ggatcgatgg atgaacctta cgaataatga caaatgccct 720 cccgatagac acaaagataa gaaactggaa ctgtggtaca cgaacatccg gggcctcaga 780 gcaaaaagag agcatctaag tgtgcgacta gcagaatctt acaccaaacc agatatcatt 840 atggtcaccg aatctaagct cgactcttca atattagaca atagtcctga catcaacatc 900 gatgggtact taatagagcg aaaggatcga aacactggaa aggactgggg tgggtgtctg 960 atctactata aatctggcat tggactgcac agaaggcatg atttggaacc tcctgaacat 1020 gaattgatga tatgcaccat caagctacat acaggtacac ttctgctatc actcctgtac 1080 agaccaccag gacaaagggt catagactgt gcgcccattg actggtacac tgaaaaccta 1140 gaccatctcc gcacaaagac caaagctatg gggacaatcc tagccggtga ctataatgca 1200 catcataaag aatggttact aagcaaaaaa actgacccac ctggaaggca cacactcacc 1260 ctttgtacca ctcatggcct aactcaactt gtcaagggcg ccacccacca aaaaggcaat 1320 agactggacc tcatcatgat tgatacaccc aacatctgct cggacgtaca tatagaacct 1380 gaaataggaa agtcagacca cttcctgcta actacgtcga taccatgttc tccagtccta 1440 gagaaaacaa cacctaggaa aatctggatc tacaagaaag ccgactggca tgctctccga 1500 gaggaacttg cccagacaaa ctgggatgac ttgcttgacc cagaaaaccc agaacgctca 1560 tgtacaaacg taacaaaggc tatccaagag gccatgaaac accacattcc gagaagagaa 1620 ctcaagacct ttgatggaaa acctgagtgg tacaatggtg cgtgtgagag agccctacag 1680 aaaaaactga aatcttggca ccagtacaaa gcaaacgaaa cccaagagac acgaaccaga 1740 tacaacaagg ccagaaatga ctacacctat gtcaccagaa aagcaatgaa agcccacaaa 1800 aagagggtca aggcaaaaat gaccacagga cttaaaaatg gaagcaaaag ctggtggtgg 1860 acggcaaaga ggctgatggg gcaaggtgga caatgtgata ttcctttact gacttctggc 1920 aaccagactt acatacaccc tgaggagaaa gctgagtgct ttgctagcat attcgccgac 1980 aaatccacca tatcacaaga agaaaatgag aaagaagtcc ctctggtaac gaccagaact 2040 acatccagcc tagaggaagt aaccttctgc cctgatcaag tgcacgaaga gttgtctacg 2100 ctcgacacca acaaggctac tggtccagac tccatacccg ccagagtact aaaacaagct 2160 gcacctgagc tagccgggcc tctagccaga ctcttccagc ttctactaga caagcaccac 2220 atgccgaaac agtggaaaat tgctaatgtt attccagtcc acaagaagaa caacaaacaa 2280 gatccaaaca actatagacc tatctccctg ctgagcataa tcagcaaggt aatggaagct 2340 ctcataaaca aagctctctg gtcacacatc aataagaaca gactcataag caacaaccaa 2400 tttggattta gggcaggtca ctcgacaaca gatgcgctga cctttgtaag ccaacttcta 2460 catgacacca aggacagacg gcaagagagc agattgatct gcctggatat aagcagggct 2520 tttgaccgag tatggcatag ggggctcatt gctaaactga acgccatcgg tgtcaaagga 2580 agcttactaa aatggattga ggattacctg tcaaatagag aactgaaagt agtaatcagt 2640 gggaaaactt ccacctccaa agtcatcaac gctggagttc cccaaggctc catcctaggc 2700 cctctcctgt tcctcatctt catagacgac atcaccgaaa agataagaaa cactgccatg 2760 ctatatgctg acgacacatc tcttatgaac atcataagga aaagacaaga aagaacccta 2820 gccgcccaat cactcaacac agacctcatg gggatccaga attgggcaaa agactggaat 2880 gttctttttg gggccacaaa gtgcaaaagc atgatagtaa gtaacttgaa ggatgtcgaa 2940 ggaaaccacc ctgatctcac ctttatggat acaatcctca ctgaggtgga agaagtagac 3000 ctgctaggac ttacaataag gagaaatctg acctggtcac accacataga caaaatgtca 3060 actgatgctg gaaaacgact tggcctgcta agaagagtct ccccatacct aagtcctgaa 3120 cagagagcta ctatttacaa gtgcatggtt aggtcctcga tggagtacgc ctctactgtg 3180 tggatgggtg caagcgccac atcccttagc tcgctagacg ccatacaaag aagagcaaca 3240 aagataatcg acatgcctca agactcattg gacagtattc agatccagcc tctagaacag 3300 cgaagaaatg ttggggccct ctcactccta caccgaatgt atcaccaaga cgcaccaaca 3360 ctactgaaca accttctacc cgagccctac gtacaccgcc gagaaacacg tctgtcaaca 3420 tcccaacaca gtgcagccct agaacctgtc aagtcaacct cctcctgtca caaaagaact 3480 tttctcccag ccacagttaa gctatggaac tgtctaccac aagacattgt aaatatcaga 3540 gacttgaaga actttaaacg aatagttaat gcccacctca ccgatattcg ccagcgtgat 3600 gcattgtaaa tgtcagcaca gctgctaaag atgaggtgtg gtcacagcat agataaaaaa 3660 aaaaaaaaaa 3670 // ID TTAA5_AP repbase; DNA; INV; 442 BP. XX AC . XX DT 21-AUG-2009 (Rel. 14.08, Created) DT 21-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; TTAA5_AP. XX NM TTAA5_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-442 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(8), 1785-1785 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC TSD is TTAA tetranucleotide. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 442 BP; 141 A; 73 C; 83 G; 144 T; 1 other; gaggacgtca cacccgcatg tgttgtctcc gtcttacaca cgtacgacat agacaaaacg 60 cgtttacgca gtctgagtag aactcgctca attttggttc tagagtaaaa atacctatta 120 taaaattgaa ttgtgacaat atttctgagg gcaagtcgtt gcgttttttt taaaaattta 180 aattttgaaa agttaaaggt aggtatttta aaaatgttga aattttttct atttctaaac 240 gatattatra acgttatatt gggtatataa tggaaaaaac gcaacgactt gccctcagaa 300 atattgtcac aattcaattt tataataggt atttttactc taaaaccaaa attgagcgag 360 ttctactcag actgcgtaaa cgcgttttgt ctatgtcgta cgtgtgtaag acggagacaa 420 cacatgcggg tgtgacgtcc tc 442 // ID L2B-3_AAe repbase; DNA; INV; 4581 BP. XX AC . XX DT 06-OCT-2010 (Rel. 15.1, Created) DT 06-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE An L2B clade non-LTR retrotransposon family from Aedes aegypti. XX KW L2B; Non-LTR Retrotransposon; Transposable Element; L2_Ele7; KW L2B-3_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4581 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4581 RA Kojima K.K. and Jurka J.; RT "L2B clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (14-OCT-2010). XX DR [2] (Consensus) XX CC [1] Originally named as L2_Ele7. CC [2] Consensus update and re-classification. This consensus is CC generated from 20 sequences with >95% identity, and ~97% CC identical to the original sequence in [1]. It is closely related CC to CR1-1_AG and L2B-1_CP, and belongs to the L2B clade. Thus CC renamed. XX FH Key Location/Qualifiers FT CDS 297..1682 FT /product="L2B-3_AAe_1p" FT /translation="MSVQCDVCCKAISAAKERVFCFGGCGQVLHAKCADLT FT SAGETALRENLSIKYMCHDCRKKQVSLNDMVGKCDSILNAINEIKNRLEKI FT EAKLERNGCDEAVKQCEQNVKMVVEESAKLHGEQLKILESQIAISANSPAI FT GKGQAEGDEYPSGGSFVEVVRRKKNKRDTGSVLRSGRVRNRSVATPDKNEN FT TSRSAKAQPISNANVKEVELSVSNKKFGCKVRVKPIATQSNHQTKKDVRNI FT INPTQMGIKSVRNGVNGSIIVECGNEDEAEGFVKMINEKLSNGYSVDIEQP FT KRPRIKILGAXGNYDSNELIDIFRDQNDIEDVEFLKVLKCIPTKNNPENKC FT SLICEVDANTFERVVRKGKLNIDFERCRVLESIDIFRCFKCCGYGHKSGEC FT KNNLHCAKCADRHDVKDCSSDKEFCVNCINSNRERKTQFDVNHSAWSVDCP FT IYLKKVTISKSFINYNA" FT CDS 1686..4487 FT /product="L2B-3_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="QSRRATDVVLLNIAGITSHFNELEIIVSRQKPKIVML FT TETHLTSDIGISEYAIKNYHMICCHSASRHTGGVVMYIHEQVKFHVVDNST FT CGSNWFIAIKVVKGLKVGVYGVLYHSPSANQQEFLLHLEQIWLERVIDDKV FT INLIGGDFNINWKNVSDKRKLQNLMQCFNLNQKVNDITRRTIRSQTIIDLV FT FCNDDELNVTINQGNKISDHETIQIQFNECSEPVEDYITFKCWKKYSKAAL FT ISLLRRSMSDNSERELDEKADILSTLLKENINKLVVIKSIKCNSRKKWYTI FT ELKSLQLARDDAYRKASTSWDEADWRQYKVLRNEYAYSIRKAKAEFTQRKI FT EQNRGNSKQLWKTLKSLWKSREKPPIRLTFDGEVVENDHEICERFNSYFVD FT SIQQINSSIEDVPNCFEDESDHFAETWNVFHRISYDVLCRSIHKIGSSSGI FT DNVNLQVLKDSLEVTGEYLLGIINESLEQGNFPRSWKQSTVVPIPKVSGTT FT KAEEHRPINMLPIYEKVLEIIVKDQLLEYLNEQKILISEQSGFRQNHSCES FT ALNLLLYKWKRMIEEKKTIIVLFLDLKRAFETISRPEMLKVLKKYGIGGQV FT LKWFESYLSSRTQICQYRNSTSVPKSVPFGVPQGSVLGPILFILYINDMKK FT AIKHCDINLFADDTVVFIAEKNEKTAIRKMRDDVELLSKWLKIKKLKLNVE FT KTKCMIISNKKQLNYTELKMDIEGIEVERVNVFKYLGVYIDDKLKFKEHID FT NVIKKVARKYGMLVRLGSQLTFWSKIFLYKTLVAPHIDYCSSVLFLASDTH FT LSRLQRLQNKFMRYILNCSKYTPILGMLEALQWLSVKERIVFNVLTIIFKL FT TNNQMPEYLTNIIERGRNIHTHQTRRNYDLRVVPFTMTSTQKSIYYRGIRI FT FNELPNEIKHAQSIAEFKRKCSNWIKNRYR" XX SQ Sequence 4581 BP; 1682 A; 616 C; 972 G; 1308 T; 3 other; attcgtttgt caaagcatgt gtgaacaagg tgtgaatatc gagctctaca ataaaaacat 60 ttttttactg ttttaatcag ttctattcgc gacaattgtg atatcattag aagcgtaata 120 agattttcta tccaacggtg tattgcgtgt tcttaggaac gttgctatat taatgttata 180 gtgcaattaa aatttgcaac atagtgctat ttccaaacgg ggatcagtga aagcaataaa 240 tttacaaagc ggcctacggc cgtgcgaaaa caatacacgc ctaagcggtg acaaaaatgt 300 cggtgcagtg cgacgtttgc tgtaaggcaa ttagtgcggc gaaagaaaga gtgttttgct 360 ttggtggttg cggtcaagtg ctacatgcaa aatgcgccga tttgactagt gcgggagaaa 420 ctgctttgcg tgaaaactta tcaattaagt atatgtgcca cgactgcagg aaaaagcaag 480 taagccttaa tgatatggtg ggcaagtgtg acagtatact caatgcgata aatgagataa 540 aaaatcgttt agaaaagatt gaagcaaagc tggaaagaaa tggttgtgat gaagctgtga 600 agcaatgtga gcaaaatgtg aaaatggtag tggaagagtc tgcaaaattg catggagagc 660 aattaaaaat cttggaatca caaattgcga tttcagccaa cagtcctgct ataggcaaag 720 gccaagcaga gggggatgaa tacccatctg gtggcagctt tgttgaggta gtsagaagaa 780 agaagaataa gcgtgatact ggctctgttt tgcgttctgg tcgtgtccga aatagaagtg 840 tcgcaacacc agataaaaat gagaacacaa gtagaagtgc gaaggcacaa cctataagca 900 atgcgaatgt gaaagaggtg gaactgagcg tttccaataa gaaattcgga tgtaaagtgc 960 gggttaagcc wattgcgaca cagtcaaatc atcaaacaaa aaaagatgtg aggaacataa 1020 tcaatcctac ccaaatggga attaaaagtg tgcgaaatgg tgtgaatggt tcaataattg 1080 ttgaatgtgg aaatgaagat gaagccgaag ggtttgtaaa aatgatcaat gaaaagctaa 1140 gtaatggtta ttcagttgat attgagcagc ctaagagacc aaggatcaaa attcttggtg 1200 cggawggtaa ctacgattca aatgaattga tcgacatttt tagagatcag aatgatattg 1260 aagatgttga gtttttgaaa gtactaaaat gcattcccac gaagaataac cccgagaata 1320 aatgttctct tatctgcgaa gttgatgcaa atacgtttga gcgcgtggtg cgtaaaggta 1380 agctaaatat tgatttcgag agatgtcgag ttctagaaag tattgatata ttcagatgct 1440 tcaaatgttg cggctatggg cataaatctg gtgaatgcaa aaataattta cattgtgcta 1500 aatgtgcaga cagacatgat gtaaaggact gttcatctga taaagaattt tgtgttaatt 1560 gcatcaattc gaatagagaa agaaaaacgc agtttgacgt taatcattcg gcatggagtg 1620 tggattgtcc catttattta aaaaaagtaa cgatctcaaa gagttttata aattataatg 1680 catagcaatc aagaagagcg acggatgttg ttcttctgaa cattgctggc attacgtcgc 1740 atttcaatga gttagaaatt atagtcagca ggcaaaaacc caaaattgtt atgctaacag 1800 aaactcattt aacttcagac attggaataa gtgagtatgc tataaagaac tatcatatga 1860 tatgttgcca ctctgcctcc aggcatacag gtggggtggt tatgtatatc catgagcaag 1920 taaaatttca tgtagttgac aattcaacat gtggatcaaa ttggttcatt gctataaaag 1980 tagtcaaagg gctaaaagtt ggagtttatg gtgtattgta tcattcacca agtgcaaatc 2040 agcaagaatt tttactgcat ttggagcaaa tttggcttga aagagtaatt gatgacaaag 2100 taataaacct aatcggtgga gactttaata tcaactggaa gaacgtcagc gataaaagaa 2160 aacttcaaaa tctaatgcag tgttttaatc taaatcaaaa agtgaatgat atcacgcgtc 2220 gtactattag atcacagact atcatcgatt tagtattttg taacgatgat gaattgaacg 2280 tgacgattaa tcaaggaaat aaaatttccg atcatgaaac tattcagata caatttaatg 2340 aatgttcaga acctgtagaa gattatataa ctttcaaatg ttggaaaaaa tattcaaaag 2400 ccgcacttat atctctgttg agaagaagca tgtctgacaa tagtgaaaga gaattagatg 2460 aaaaagcaga tattttgagc actttgttga aagagaacat aaataagctt gttgttatca 2520 aaagcataaa atgcaacagt agaaaaaaat ggtatacaat agagttgaaa agtttgcaac 2580 ttgcaagaga cgatgcgtat agaaaagcta gtactagttg ggatgaagct gattggcgtc 2640 agtacaaagt tttaagaaat gagtatgcat actctatcag aaaagctaaa gcagaattca 2700 ctcaaaggaa aattgaacaa aatagaggaa atagcaagca attatggaaa accctaaagt 2760 cgttatggaa gagtagagaa aaaccaccaa tcaggttgac atttgatgga gaggttgtag 2820 aaaacgatca tgaaatatgt gaaaggttta atagttattt tgtggatagc attcagcaaa 2880 ttaacagcag tattgaagat gttccgaact gctttgaaga tgagagcgat cattttgctg 2940 aaacatggaa tgtgttccat cgtatttcat atgatgtgtt gtgtagatct attcacaaaa 3000 ttggtagttc atcaggcatc gacaatgtaa atttacaagt gctaaaagat tcattagaag 3060 ttaccggaga atatttgctt ggaataatta atgagtctct agaacaggga aatttcccta 3120 ggagttggaa acaatcaacg gtggtaccaa taccaaaagt gtctggaaca acgaaagctg 3180 aagaacatag acctataaat atgctaccaa tttacgagaa agtgttagaa attatagtta 3240 aagatcaatt acttgagtat ttaaatgagc aaaagatatt aataagtgaa caatctggat 3300 ttagacagaa ccattcgtgt gaatctgctt tgaacttact tttgtataaa tggaaacgaa 3360 tgatagagga aaagaaaacg attattgttt tatttttgga cctcaaacgg gcattcgaaa 3420 ctatatcacg cccagaaatg ttaaaagttt tgaaaaaata tggcatagga ggacaggttc 3480 tcaaatggtt cgaatcatat ctatccagtc gcacacaaat ttgtcagtat agaaactcta 3540 cttcagttcc aaaatcagtg ccgtttggtg ttcctcaggg aagcgtttta ggaccgattt 3600 tatttatttt atacatcaac gatatgaaga aagctataaa gcattgtgac ataaatttat 3660 tcgcagacga taccgttgta ttcattgcag agaagaacga gaaaacggca attagaaaaa 3720 tgagggatga cgtggaattg ttgagcaagt ggttgaaaat taaaaagcta aaattgaatg 3780 tggaaaaaac taaatgcatg attataagta acaaaaaaca attaaattat actgagctaa 3840 aaatggacat tgagggaatt gaagttgaga gagtgaatgt ttttaaatac ctaggggttt 3900 atattgatga taaattaaag ttcaaagagc acatagataa tgtaattaaa aaagtagcaa 3960 gaaaatatgg catgctagtt agattaggta gtcaattgac gttttggagt aagatattct 4020 tgtataaaac attggtggcg ccgcatatcg attattgctc ttcagttttg tttttagcca 4080 gtgatacgca tttgagtagg ttgcagaggt tacaaaacaa attcatgcgg tatatattga 4140 attgcagcaa atacacgccc atattaggaa tgttggaagc tttgcagtgg ctttcagtga 4200 aagaacgcat tgttttcaat gtattgacaa ttattttcaa attaactaat aaccaaatgc 4260 cagaatactt gacaaatatt attgaacgag gacgaaatat tcatacgcat caaactagac 4320 gaaattatga tctacgtgtt gtgccgttta ctatgacaag tacacaaaaa tcaatatact 4380 atagaggaat aagaattttt aatgaattgc caaatgaaat taaacatgca caaagtatcg 4440 ctgaatttaa aaggaaatgt tcaaattgga ttaaaaatag atatagataa taagatgatg 4500 ttttgtactc cgtattttta tttgtaaatt gtaattatca aaagataaat aaatgaacta 4560 ttattattat tattattatt a 4581 // ID BEL-14_CQ-LTR repbase; DNA; INV; 562 BP. XX AC AAWU01030126; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-14_CQ_; KW BEL-14_CQ-I; BEL-14_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-562 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 182-182 (2011). XX DR Genome; AAWU01030126; Positions 18245 18806. XX SQ Sequence 562 BP; 139 A; 147 C; 150 G; 126 T; 0 other; tgtctgagtg cgacgaagca cgcagccact gaccaggggt cagtcgattg gagccattag 60 ccgaggttac ttcccatctc ttctaatcac cttggcaacg ggagcgacca ttgggagcga 120 ctgggagcga ctgagagcga tctgcgaacg atcgagaggg cgatcgagac tacgagcgag 180 caactgatcg gagctgtagt tcagttccgc gccgaccata cccgcagtcg ggcctcaaag 240 ggaggacccc cgttcaattt ctatttgtac ctagttataa gaagaaatat agatgttaat 300 gtagtctagt gtaataaaaa agtgtttttt gtacaaagtt gtgcgtttaa tttatttaac 360 cgggttgaaa ctgagcccat cggtgctcct gctggccgcg catcggccga acctgtttcc 420 tgcccaatcc ggccacacac ggcggaatta tcgaggtttt tcacccggcc gaggaatcac 480 caacgcacta aacgtccgcc atcgccgcca acggcgatcc actctgcgct atcgaagaag 540 gtgagtttgg aggaccacta ca 562 // ID Gypsy-3_CQ-LTR repbase; DNA; INV; 206 BP. XX AC AAWU01007771; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-3_CQ_; KW Gypsy-3_CQ-I; Gypsy-3_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-206 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 386-386 (2011). XX DR GenBank; AAWU01007771; Positions 8117 8322. XX SQ Sequence 206 BP; 53 A; 41 C; 44 G; 68 T; 0 other; tgttgcgtta cgtagtttgt gaacgactag agttaagcct tttgttgacg ttccctccga 60 acgtcattca ttgaagcgcg cgttctcgct ctttctcgac gattcattca ttccgtttcc 120 gagtgcgcag caataaagac gttgttgaat aaacgctgtt ttgtgaagtt aagcagaagt 180 attttatttg acacagaaat aaaaca 206 // ID BEL-65_AA-I repbase; DNA; INV; 5328 BP. XX AC supercont1.275; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-65_AA_; KW BEL-65_AA-LTR; BEL-65_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5328 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.275; Positions 742532 747859. XX CC 'TGTAT' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 1978..5295 FT /product="BEL-65_AA-I_1p" FT /translation="MLNNHAVVNLHVDLDLNRILQKFWEQEEISKPLPLTP FT SEQQVTEYFGSTLQRDETGRFIVRLPFDDSKPALGDSLAVATKRMKSIERR FT FHLDPEFKRQYTSFMHEYQNLGHMEEVPAEQIQKDSSECYYLPHHAVLKSD FT SLTTKLRVVFDASCASTSGVSLNDKLLAGPNKNADLFAVALRFRSHRIAFC FT ADVAKMYRQVLVHPADRDYQRIVFRDEPDQPLRHYRLCTVTYGTKTAPFLA FT IESMREAAKPYEAVEAIKFDFYVDDFLSGADDEEEASQLKEQVVEILSSAG FT FELRKWSSNNKKLLPTNDEDEAVPMKMLNDSETVKALGINWHPSEDVYGFK FT VNFSPLSINTKRQMISDSARLFDPFGWLAPVTVKIKILYQHLWLFDVFWDD FT PLPAAQIRIPRCIFPNRGRIQLRGFSDGSEQAYAAVVYARTTDEQGDVSVV FT LVAAKSRMAPIKQISLPRLELNGAHLLADLMHQITEALPHVQVEHWAWTDS FT SIVLQWLASHPRKWKTFVANRTAAILEYLPRSCWRHVSSKDNPADLASRGL FT SPAELVQNQLWFQGAPWLKEDESEWHQDPSLEEIDEDVMEVKRIKSLHIST FT TKVSTSFEIEKRLFVKFSSFAFMVRLLACVHRAIHNFKAVVRNNDIDDRRT FT GERTEDELNRAKIQLLQAAQHDEYGQKIELLRKGKSLQPKHILSALHPFLD FT ADGTMRVGGRLQHSAQPYDKKHPIILPPKHRATELLVRELHLRNLHAGSAL FT LAATLRQEYWVVGCQTVVRKVVQGCTRCVRLKGKTAGQLMGSLPPARVLAT FT RAFSHVGVDYAGPVRLKATCVRGVKITKGYLVVFVCLSTRAVHLEVASDLS FT SDTFIGALKRFISCRGYPVEIRSDNGTNFVGADRKLREFVEQILTHSKDAS FT RYLSNLGINWVFNPPSAPHMGGIWEAAVRSVKKHQVAELGESAITFEHLST FT LLCQIESCLNSRPLCPLSTDPDSAEALTPGHFLIGQPINLVPEPSVKQLPA FT NRLDQWQEVQQQTERIWNRWKDEYLASLQPRSKWRTAQPNINVDQLVLVKN FT DNASPAQWELARVEKIHPDSSGAVRVVTLRRGSTVYQRPIHKLCVLPFD" XX SQ Sequence 5328 BP; 1398 A; 1312 C; 1385 G; 1233 T; 0 other; tttggacctt cgatccgaat aatgtcggac ctgcagaagt tgctaatccg ccggaacgcc 60 ctgatggttc aggtgcgtgg agagctttca gtcgcagacg ctatcaagga gcggaagccg 120 tccctaaatg aggttaggga ccgactgaac cagctaaagg agcttgctgc caatttccgg 180 gagacacaag gatgcatcga ggagaagcag gagaatcccg aagctatcgc gtctgtccat 240 aacgttcggg aggagttttt caccatgttt taccgagcaa aggatatttt cgagggcttc 300 ctagacgtcg atgatgtttg atcatccgtc agccagcgaa cggcagtcga aaccaccgat 360 tggaaagagg ctatgcattt gctcattgag acgcagcgga tgctactgct ggatcaagaa 420 cggaccacga aaaccgttca aaatctgacg cagttgtcga atagtgcacc tctggatggt 480 agtcaggtgt cgaatccttc accccagttg gctgttcgat tgcctgctat caatattcag 540 ccattcagcg gagaacgtaa acattggatg acgtttaagg atatttacgt ttcgactatc 600 cacaaccgag ttgacatttc ggatgcgctg aaaatgcagt atcttttttc atatcttgaa 660 ggagatgcta agagattagt gagcaagttt acaatctctg gagcaaacta tcaaaatgca 720 aggagcactt tgattaccca ctttgacaag aagcgctaca cagtgttttc gttggtgcat 780 gaatttctgg ctcaatctcc ggttacacaa gctactccac agtcactggg cagattggta 840 acaacatccg acgagattat ccagcagctg gacgcattgg gcgatgagtt caccggtcgg 900 gacccatggc tgattcatct tattctggag aaattggaca aggagacaag agcaagctgg 960 tcacaagaag ttgtcgacac ggaggatccc acttttgaag atttgttggg atttctcaag 1020 aaacgttgtg aggtgctgga aacgtgttcg gcatttccga agaaaccagc gaatgaaccg 1080 aagaaggagg cgaccaaacc ggtgccggcg aagctgaaga cgctgcatac aacagtggag 1140 aaaaaagtgc gcgaagtgct ctagtgagca caatacgtat cagtgtgatg aatttaagaa 1200 catgagtgta aaagacagac gagaactagt gcaaaaagcc aagttatgtt ttaactgcct 1260 tcgaccatca cattcggtga aatcgtgttc gtcgaagtcg gtatgtcaca accccgattg 1320 taaacagcgc caccacacgc tgctgtgttc cgtggcgaag agggctgaac aagtggaagc 1380 gaaaaaagaa gaagacgtag ttccgtcgcc ggaagtgccc gcaaaggagg aaggtgtcgt 1440 cgttgcattg acagccggaa tgccgatgag tgtgagcaag tcgttgctac ctacggcggt 1500 agtgcaagtg cagcaagcgg acgggggttt cacgtcggcg agaattctga tcgattctgg 1560 atcgcaggca tcactggtga ctgaagcttg tgttcggaag ctgaagctgc cgcgacgaaa 1620 cgggaaactc gtggtcaacg gtcttggcca ccaagaagtg ggaacaacgc gtggtttggt 1680 tactctgcgt ttggcatcgc ggttcaacga caccgtagtc ttgaccaccg aggcctacat 1740 tctcggaaaa ttgaccacca ctatcccatc ccagcgaaca tgaagctctt ggacgaccta 1800 ggagaactag ccgacccgga attcaaccgg ccgggtgcta tcgatatcat cctaggtgca 1860 gacgtttttc ttgcacttct ggaaggagga caagtcaaaa acgaaagcgg ccaaacggtg 1920 gcccaatgaa cgattttcgg atggatcgtt gccggacaat atgacgcgtc ggaagtcatg 1980 ctcaacaacc acgcggtcgt aaatctgcat gtcgaccttg atctgaaccg catcttgcag 2040 aaattttggg agcaggagga gatttcgaag ccactaccgc ttactccttc tgagcagcag 2100 gtcaccgagt atttcggttc gaccctgcaa cgagacgaaa cgggtcgctt catcgtcaga 2160 cttccctttg atgattcgaa gccagcgctc ggcgattccc tcgccgtagc cacgaagcgg 2220 atgaagtcca tcgagcgacg attccatctg gatcccgagt tcaagcgcca gtacacttct 2280 ttcatgcacg aatatcaaaa ccttggtcat atggaagaag ttccggctga gcagatacag 2340 aaggacagca gcgagtgcta ttatctgcca caccatgcgg tactaaaatc ggatagtttg 2400 actacaaaac tacgtgtcgt ctttgacgcg tcctgtgcgt caacctcagg agtctcgctg 2460 aacgataaac tgctagctgg acctaacaag aacgctgacc tttttgctgt ggcattgcgg 2520 ttccgttccc atcgaatagc cttttgtgcg gatgtagcca aaatgtaccg ccaggtactg 2580 gtgcatccgg cggaccgcga ttatcaacgc attgtgttcc gagatgaacc agatcagcct 2640 ctaagacatt atcggctctg taccgtgaca tacggtacca agactgcccc gttcttggca 2700 atcgagtcca tgagagaagc agccaaaccc tacgaagcgg tggaagcaat caagttcgat 2760 ttttacgtcg atgattttct gtcgggtgcc gacgacgagg aggaagcaag ccagctaaag 2820 gaacaggtag tcgaaatcct ttcgtctgct ggtttcgaat tacgcaaatg gtcgtctaac 2880 aacaagaagc ttctaccgac taacgacgag gacgaagcag tgccaatgaa aatgttgaat 2940 gatagcgaaa cagtgaaggc cctcggaatt aattggcacc cttccgaaga cgtatatgga 3000 ttcaaggtca acttttctcc gctcagcatc aacacaaagc gccagatgat ttcggactca 3060 gcaagactat ttgacccttt cggttggttg gctccagtaa cggtcaagat taagattctc 3120 taccagcacc tttggctatt cgacgtcttt tgggatgatc ctttgccagc cgcacagatc 3180 aggattccac gttgtatttt ccctaatcgt ggtcgcattc aacttcgcgg attttccgat 3240 ggctccgagc aggcatacgc agcagtggta tatgcgagga caaccgatga acaaggcgat 3300 gtgtcggtag ttctagtcgc tgccaaatca cggatggcac caatcaagca aatttcactg 3360 ccccgcctgg aactcaacgg tgcacatcta ctcgcggatc tgatgcacca aatcacggaa 3420 gctctaccac acgttcaggt cgagcattgg gcgtggactg attctagtat tgttctccag 3480 tggctggcgt cccatccccg aaaatggaaa accttcgtgg ccaaccgcac tgctgctata 3540 ctagaatacc taccacgaag ctgctggcgc cacgtgtcta gcaaggacaa ccctgcagat 3600 ttggcatcgc gaggtttgtc acctgccgaa ttagttcaaa accagttatg gttccaagga 3660 gcaccgtggt tgaaggaaga cgagtccgaa tggcatcaag acccatccct ggaagagatc 3720 gacgaggatg taatggaagt caagcgtatc aagtcactgc acatctcaac gacgaaggtt 3780 tctacaagtt tcgaaatcga gaagcgactt ttcgtcaaat tctccagttt tgcatttatg 3840 gttcgcttgt tggcttgtgt acatcgagcc atccacaact tcaaggccgt tgtgcgcaat 3900 aatgacatcg acgatcggcg caccggtgag cgaactgaag atgaactaaa tcgagcgaag 3960 attcaactac tacaagccgc ccaacacgat gagtatggcc aaaaaataga acttctgcga 4020 aagggtaagt ctttgcaacc caaacatatt ctttctgctt tgcatccctt tctggacgct 4080 gatggaacta tgcgtgtggg tgggcgtctc caacactcgg cccagccata cgataagaaa 4140 caccccatca ttctacctcc aaaacatcga gctaccgagc ttttagtgcg tgagctacac 4200 ctacgtaatc tccatgctgg atcggctctc ctggcagcta ctcttcgtca ggagtattgg 4260 gtagtcggtt gccaaacggt cgttcgtaag gtggtacaag gctgtacacg atgcgtacgt 4320 ctcaagggga agacggctgg tcagctgatg ggaagcctac cgccagctag ggtgctggcc 4380 acgagagcgt tctcccatgt tggtgtggac tatgctggtc ccgtaaggct taaggcaacg 4440 tgcgtccgag gagttaaaat caccaagggg tatctcgtag tgttcgtgtg cctatcgacc 4500 cgtgcggtgc acttagaagt ggcaagcgat ctttcatccg atacctttat cggcgctctc 4560 aagcgattca tatcatgccg tggttaccca gttgagattc ggtcggacaa cgggaccaac 4620 tttgttggtg cggaccgtaa gctacgagaa ttcgtggagc aaattctgac gcacagcaag 4680 gacgcaagtc gctacctttc aaatctggga atcaactggg tcttcaaccc cccatcggcg 4740 ccgcatatgg gcggcatttg ggaggccgcc gtacgaagtg ttaaaaagca tcaagtagct 4800 gaacttggag aatcagctat tacgttcgaa catttgtcaa ctctgctatg ccagattgag 4860 tcgtgcctca attcacggcc gttgtgtcca ttgtccaccg atcctgacag cgccgaagct 4920 ttgacacccg gacatttttt gatcgggcag cccatcaatc tggttcccga gccaagtgtg 4980 aagcaactgc cagccaaccg gcttgatcaa tggcaagagg tgcagcagca aaccgaaaga 5040 atttggaatc gatggaagga tgaatatctg gctagtcttc aaccacgaag caagtggcgt 5100 accgctcaac cgaatatcaa cgtagaccaa cttgtgctag ttaagaacga taatgcctcg 5160 ccagcgcagt gggagctggc acgtgtcgag aagattcatc ctgattcatc gggagcagtt 5220 cgagtggtaa cgctacgcag aggttctaca gtataccagc gaccgatcca taaactctgt 5280 gtgctaccat tcgattgatg ccccttcgtg gcctcaaggc ggggagga 5328 // ID Gypsy-147_AA-LTR repbase; DNA; INV; 856 BP. XX AC AAGE02030242; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-147_AA_; KW Gypsy-147_AA-I; Gypsy-147_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-856 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02030242; Positions 25892 25037. XX SQ Sequence 856 BP; 232 A; 203 C; 238 G; 183 T; 0 other; tgttacattt aaaataaatg tcccataggt actactcaat tatcgttaaa ttgtgtttgt 60 cagagtgtag gagttagatt atttattgta aaaaaatccc cccttttttg tgccgtaaac 120 tgctgagccc cactccgaca acgaaaacgc gatgacataa gagggaaacg caaaaatcga 180 ctcttcggtt gggagtttcc gcccatcggc gatcggtgag ggtctcgagt taacgagaca 240 gacagcgatc gtcctcttgt gctcgaaccg ccgaaggaga cagatcgaaa acttagtccg 300 ccgtgttgaa ataccactcg cccatcttct gggttaagtc cgcgtgctat agtgttggca 360 aggagaagat tctgatccct ttggggccga agaactagtg attgtgtaga gaagcgtcgt 420 gagtgcgagg ccaagacggg ccacgattgg aagaaacccc cccccgaaca agtgaaagct 480 gccacccgcg agtgcatagt agcgagtaag agttgccgcg tgcgagtgca taatagcgca 540 attaggccac gaaccccaag ctgacagtgg acgtagtgtg aatatcacga agacagtgag 600 tgagtgagtc gactggggtg agaaaagtgt gaggacggcc ccctcccagt gatagcgtta 660 gtgagtccgt cctacggaga gaaggacgcc gctgccgcgt gcgagcgtga gtgtgggcca 720 tagaccggtc tggaaagcgg cggaaggaac cccttcccaa cctaattaat accctcgatt 780 agggaagcca ggaagccccc gagatgagtc actgaggtaa gtcgcatgca tcctactgag 840 ttaaccctaa cttaca 856 // ID Gypsy-29_CQ-LTR repbase; DNA; INV; 187 BP. XX AC AAWU01023199; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-29_CQ_; KW Gypsy-29_CQ-I; Gypsy-29_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-187 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 438-438 (2011). XX DR Genome; AAWU01023199; Positions 38138 37952. XX SQ Sequence 187 BP; 61 A; 35 C; 56 G; 35 T; 0 other; tgtggtgcat tggcaccaag atcaacaaac acacctatac catataatct tggtaagcca 60 acgtagcggg gggagataaa ccattgtagc ggtggatagg cagacgtgta cggaaagcga 120 gagaaggcag acgcgcggcg gagaaccgtc gtggattaaa gttggaataa agagtttagc 180 tatcaca 187 // ID Nimb-1_DPu repbase; DNA; INV; 5821 BP. XX AC . XX DT 26-FEB-2010 (Rel. 15.03, Created) DT 26-FEB-2010 (Rel. 15.06, Last updated, Version 2) XX DE A family of Nimb non-LTR retrotransposons from Daphnia - DE consensus sequence. XX KW Nimb; Non-LTR Retrotransposon; Transposable Element; I group; KW Nimb-1_DPu. XX NM Nimb-1_DPu. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-5821 RA Kapitonov V.V. and Jurka J.; RT "A recently active family of target-site specific Nimb non-LTR RT retrotransposons from daphnia."; RL Repbase Reports 10(3), 243-243 (2010). XX DR [1] (Consensus) XX CC The consensus sequence was derived from multiple alignment of CC several copies less than 2% diverged from each other. The 3' CC terminus is composed of the (AACC)n microsatellite. Usually, CC Nimb-1_DPu elements are inserted at the same target site in the CC same unclassified repetetive element and are flanked by 10-16-bp CC target site duplications (TSD): CC 5'-AAAAAATCGTATAAAAGGAGAGAGAAAAGGC[TSD]Nimb-1_DPu[TSD]-3. CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX FH Key Location/Qualifiers FT CDS 250..1956 FT /product="Nimb-1_DPu_1p" FT /note="Zink finger (zinc knuckle; three CCHC FT motifs)." FT /translation="MNEEEIRAAKAAAASARASLAPLMLNEQGIAEEPEPD FT SMEEDLFEELRTEEQEATTEAEKEEQRRRRKAAADKWLRDKGFTMVQRGKK FT GEPSNKRGRKDIETPSPAGETESSSLPSRGKPPPIPPRRSEATTESSQQVK FT QPPANRMTTSFETRLVAKENDMIVDLKKVSPILINKRLSEQVSKGSVSSWK FT INPGGSLTVVATGAAEILEVMAIQTMGSWTISVVPKTCKGVVHGVHHLITK FT EEFLEEASAVNQGNVVKIVNAARIESRPGGPYKSTSSVIVTFEAAELPDSV FT KILNSIHRVTQYIPEPTQCYKCRRPGHIAKECGGKQRCARCGGVHLTKDCT FT VTREHWKCPNCKGPHSAAFRGCPQWKEAKAVIRIAVVENIPRSAAVAKVRS FT GQTYAAAAKATDQSDTQKEKTSEPNNAGSSDKSPDKNAQTTEPLQKVAQQK FT GRKDKLSSAEKKLEAFMKIMAQFMDILSAYTNNQHLRTLAGAAKALLTHES FT SDNSDSSKDDQSSDSDDVEEEEITPALFTTGIQKTPEKKKKNKKKKKNKRK FT KEDDPAADSTASVQTSQSKDAT" FT CDS 1956..5696 FT /product="Nimb-1_DPu_2p" FT /note="ORF2: AP endonuclease, RT, RNase H, CCHC." FT /translation="MMTRSTAFSFNNPKTQIRILQWNARSLNSNGTEFVKF FT LSEMTSPPNIICIQETWLDESKEFEIDGFDHIRADRRDRIGGGVATFVAIG FT TPFRQIDLPPSTLEAVAVEVFTDEKSISICNIYHSSQENDVQVFENILRDL FT PPDSTPFLCGDFNSHHEMWGGKKNDHKGLSLVSFIEENGLVALNDGNITYR FT SSSGASSVLDLTITTPDIAAKCSWSVVSESTFGSDHHPVMTNIAIPVPVER FT SNNSRWNFKKADWVGFAQTCADISYMDLFSTDIDVFNENVTHAIILAASNH FT IPVVRQGNHKPGVPWWNAACADAIKEKEKAWVRANASRNPDDYKIYSTLRN FT KSRSVMRNAKKEHWRAHCSTINKNTTSRDLWEKTNQMLGRNNKTKPISVLL FT DKNGACVTESSAKANLFAKHYKAVSSDTNLSESFLQHRRISEQSMEDGEAD FT EPDGRIGDAETNADFSMQELTTALKEAKDTAVGCDRISLSMLRHLPDTALR FT TLLALYNESWKQGTLPKGWKHSLIIPLLKPNKTASKPESYRPISLTPVPCK FT VLERMIKIRLSWYLEKNLLLAPNQSGFRRNRGTMDNLVRLENSVQSALNNK FT GYVMAVTLDLEKAYDMIWIKGLIYKLKKLGIRGNMLKWITSFLVGRTAQVS FT LNGTQSELFSCLNGTPQGSVISPLLFIILINDIAKKRRSCMSGMYADDIFI FT WQKHRNIHFLRKKVEKDTQDVIVELRLWGFLVSAAKTGAIVFSKRNIPENL FT SITVDGASIQVQKSIKLLGMTLDSKLNWCKQIDDVVSKCSKVLNFLRLISG FT TKWGAHAKPMLQIYQALIRSKLDYGGELIESASITAKKKLDKIQAQALKTV FT VGAPRDTATEALLRETGEMPLHLRRNLASVKHYLRCYEVKSDLLAEKEEWF FT KNAKQCFLSRVESVTNTLDHHPKSVEKLSISKHPPWQALLPEVTTYTPNDL FT KWQNVVHIYTDAAKSKDGRCAVAFVVPEKVIVEQKRLVDGLATAKCELVAI FT HLAVKWAEAHAALGSFVILTDSKKALQILNSSRENNSVKTSTVDTWRALSR FT AGTTVSFSWVKGHSGISGNEQADRAAKNGLELKPSLSCRRDTNDIRENVED FT LLLKKWQGLWDKPKTECGRFTHRHSPVVSRIASLFGQSRSEQVFLSQIRLD FT MLPLNHRLFKRKKHPTGLCSNCDAEEKENVEHVLLACPAYSKEREEMVYVV FT SENTPTLQELLNFNDEVTLKAVLDFFNAIGIKARLGF" XX SQ Sequence 5821 BP; 1906 A; 1489 C; 1358 G; 1068 T; 0 other; tcgtctgcta tcgacaggtg tacacctgtg tctcgcagct cgtaataatt ttgtgaaata 60 tagtgtaaca agtttcgatc agtgcgtaat catagaagaa acaaagtgtg ctttctcttg 120 tagttccaca agttgggaca ataaagggag gcatcaagtc ttaatttctc gtgtttgttt 180 ttcccccagt gaaatcaaaa acccaccccc cacttccact gtcactgcca cctgacggta 240 gcaggtgaca tgaatgaaga ggagatcagg gctgcaaagg cggccgccgc tagcgctcgg 300 gcgtccctgg ccccgttgat gctcaacgag caggggatcg cagaagagcc ggaacccgat 360 tcgatggaag aagatctctt tgaggaattg cgtacggaag aacaggaggc cactacggaa 420 gctgaaaaag aagagcaaag gagacgccgc aaggcagctg ccgataagtg gcttcgagac 480 aagggcttta ccatggtgca aagaggaaaa aaaggagaac cgtccaacaa gcgaggcaga 540 aaagacatcg aaaccccaag cccggcagga gaaacagaat catcttctct cccctcaaga 600 ggcaaaccac caccaattcc tccccgtagg tcagaagcga ccacggagtc aagccaacaa 660 gtcaaacagc caccggccaa cagaatgaca acctcattcg aaacccgcct tgtagcaaaa 720 gaaaacgaca tgatcgtaga tctaaagaag gtgagcccga ttctaatcaa taagcgcctt 780 agcgagcaag taagcaaggg ctcagtctcc agctggaaga taaacccggg tggctcgtta 840 accgttgtcg caacgggcgc agctgagata ctagaggtga tggccatcca aaccatggga 900 tcatggacca tatcggtggt accaaaaacg tgcaaaggcg tagtacatgg agtgcatcac 960 ctcattacta aggaggagtt cctggaagaa gcaagtgcag tcaaccaggg caacgtcgtc 1020 aaaattgtaa acgctgcgcg catagaaagc cgacccgggg gcccatacaa gtctacaagc 1080 agcgtcattg tcacgtttga agctgcggag ttacctgact cggtcaaaat actaaatagt 1140 atacaccggg taacccaata catcccagag cctacccagt gctacaaatg tcgccggccg 1200 ggacacatcg ccaaggaatg cggaggaaag caacggtgtg caagatgcgg aggagtgcac 1260 ctcaccaaag actgcacggt aacgagagag cactggaagt gcccaaactg caagggcccc 1320 catagtgcag ctttccgtgg ctgcccccag tggaaagaag caaaggcggt catccgaata 1380 gcagttgtgg aaaacatccc tcgatctgca gcagtggcca aagtaagaag tggtcaaaca 1440 tatgctgcgg cagcaaaagc aacggatcag agcgatacac aaaaagaaaa aacctcggaa 1500 ccgaacaacg caggctcctc cgacaaatca ccagacaaaa atgcacaaac aaccgagccc 1560 ctccaaaagg tggcacagca aaaagggaga aaagataaat tgtccagcgc agaaaaaaag 1620 ctggaagcct tcatgaagat aatggcgcaa ttcatggaca tcctgagtgc atacaccaac 1680 aaccaacacc taagaaccct cgccggagca gccaaagctc tgttgacaca cgaaagcagt 1740 gacaacagcg acagcagcaa agacgaccaa agcagcgaca gcgacgatgt cgaggaagag 1800 gaaattacac ctgctctctt cactacagga atccaaaaaa caccagaaaa gaagaaaaag 1860 aacaaaaaga aaaaaaagaa caaaagaaag aaagaagacg accccgctgc agatagcacc 1920 gccagtgtac aaaccagcca gtcaaaagac gccacatgat gaccagatct acagcattct 1980 cttttaacaa ccccaagact caaatccgga tactccaatg gaatgcaagg tcccttaact 2040 caaacggaac cgagttcgta aagtttttat ctgaaatgac atctcctcca aatatcattt 2100 gcattcaaga gacgtggcta gacgagtcta aagagttcga aattgacggc ttcgatcaca 2160 tcagagcgga cagaagagac aggattggag gaggagtagc gacttttgtc gccatcggca 2220 cccccttcag acagattgat ctgccgccat ccacactaga agcagtcgca gtagaggttt 2280 tcaccgacga aaaaagcatc tccatttgca atatctacca ctcaagtcag gagaacgatg 2340 tgcaagtttt tgaaaatatc ctcagagatt tacccccaga ctccactcct tttctgtgcg 2400 gagacttcaa cagccaccac gaaatgtggg gaggaaagaa aaacgaccac aaagggctca 2460 gtcttgtatc tttcatcgaa gaaaacggac tagttgcact taacgacggg aacataacgt 2520 acagatccag ttcaggagca tcctccgtcc tagacttgac cataacaacg ccagatattg 2580 ctgcaaagtg cagttggtcg gtcgtatctg aatcaacgtt cggcagcgat caccacccag 2640 taatgacaaa catcgccata ccggtccccg tggagcgcag caacaactct aggtggaatt 2700 ttaaaaaagc tgactgggtc ggctttgctc aaacgtgcgc agacatatcc tacatggacc 2760 tcttctccac cgatattgat gttttcaatg aaaatgtgac tcacgccatc atcctagcag 2820 ccagcaacca catcccagtc gtgcgacaag gaaatcacaa gccgggcgtc ccatggtgga 2880 atgcagcgtg tgcagacgcc atcaaagaaa aagaaaaggc ctgggtgaga gcaaacgcat 2940 caagaaatcc agacgactac aaaatataca gcactttacg taacaaaagc cgatcggtga 3000 tgcgaaacgc gaagaaagag cactggcgag ctcactgtag cacaataaac aagaacacaa 3060 cgtcccgtga tctctgggaa aaaaccaacc aaatgctggg gcgaaacaac aaaacaaaac 3120 cgatctcggt gctactcgac aaaaatgggg cgtgtgttac ggagtcatca gcaaaagcca 3180 acctatttgc aaaacattac aaagcagtaa gtagtgacac caacctttca gaatcctttc 3240 tccaacaccg ccgaatctcc gaacagtcca tggaagatgg tgaagcggac gagccggacg 3300 gacggatagg agacgcggaa acgaatgcag atttctcaat gcaagagctg acgaccgccc 3360 taaaagaagc aaaagataca gcagtgggat gcgaccgaat atctttatca atgctacgcc 3420 acttaccgga tacagcacta cgtacgctac tggccctgta caacgagtcc tggaaacaag 3480 gcacgctacc gaaaggatgg aagcactcgc tgatcatccc cttgttgaag ccaaacaaaa 3540 ctgcatcgaa acctgagtca taccggccga tctcgcttac cccggtgcca tgcaaggtcc 3600 tagagagaat gattaagatc agactctctt ggtaccttga gaaaaatctc ctcctcgccc 3660 caaatcaaag tggtttccga cgaaacagag ggactatgga caatttagtg aggctcgaaa 3720 actcggtcca atcagcccta aacaacaaag gatatgtgat ggcagtcacg ctcgatcttg 3780 aaaaagccta cgacatgatc tggataaaag gactaattta caagctcaaa aagctcggca 3840 taagaggcaa catgctcaag tggattacct cctttttggt cggtcggaca gcacaagtat 3900 cgttaaatgg aactcagtcc gagctcttct cctgcctaaa cggcacacca caaggaagtg 3960 tcatcagccc gttattgttc attatcctca ttaacgacat tgccaagaaa cgccgctcct 4020 gtatgtcggg aatgtacgcc gatgacattt tcatctggca gaagcaccga aacatacact 4080 tcctgagaaa aaaagtagaa aaggacacac aagatgttat cgtcgagctc agactatggg 4140 gcttcctagt ttcggcagca aaaaccggag caatagtttt ttccaagcgg aacatacccg 4200 aaaatttgag catcacagta gacggagcta gcatccaagt gcaaaagtca atcaaactac 4260 tgggcatgac actcgacagc aaactaaatt ggtgcaaaca aatagacgac gtcgtcagca 4320 agtgctcaaa agttttaaat tttctacgcc tcatatctgg aacgaaatgg ggcgcacacg 4380 caaagcccat gctacaaatc taccaggccc tgatccgctc aaaactcgac tatggaggag 4440 aactcatcga gtccgcctct atcacagcaa agaaaaaact ggacaaaatc caagcacaag 4500 cgctgaaaac agtggtgggt gcgccaaggg acactgcaac agaggccctg ctacgagaga 4560 cgggcgaaat gcccttgcac cttaggagaa atctcgccag cgtcaaacat tatttaaggt 4620 gctacgaggt gaagtctgac ctgctagcag aaaaagaaga gtggttcaaa aacgccaagc 4680 agtgttttct ctccagagtc gagtccgtga caaacacctt ggaccatcat ccgaaatcag 4740 tcgagaaatt atccatcagc aagcacccgc catggcaggc tctgcttcca gaagtcacga 4800 cgtacacacc aaacgacctg aaatggcaaa atgtggtcca catttacaca gacgcagcaa 4860 aatcaaaaga cgggcgatgt gctgttgcct ttgtggtacc agagaaagtc atcgttgagc 4920 agaagcgtct ggtggacggt ttggcgacag caaagtgcga actcgtagca atacacctgg 4980 cagtaaagtg ggcagaagca catgcagcac tggggagctt tgttatccta acagactcaa 5040 aaaaggcatt gcaaatactc aactcaagta gagaaaacaa cagtgtcaaa acaagcaccg 5100 tggacacctg gcgagcactt tccagagctg gcacaactgt cagtttttcg tgggtaaaag 5160 gccacagtgg aatatccggt aacgagcaag ctgatcgtgc tgccaaaaat gggctggaac 5220 taaaacccag cctcagctgc agaagagaca caaacgacat ccgtgaaaac gttgaagacc 5280 tcctcctgaa aaaatggcaa ggcctctggg acaagccgaa aacggaatgt ggacgtttca 5340 cccacagaca cagtccagtc gtgtcaagaa tcgcctccct gttcggccag tcaagaagtg 5400 agcaagtgtt cttatcgcag attcgcctcg acatgctgcc gctaaaccat cgtctgttca 5460 aacgcaaaaa acatccgacc ggtctctgca gcaactgcga cgccgaagaa aaagaaaatg 5520 tcgaacatgt actgctggcc tgcccggcgt actcaaagga gcgagaagaa atggtgtacg 5580 tagtgagcga aaacactccc acactgcaag agctgctaaa tttcaacgac gaggtaactc 5640 tcaaagccgt tttggacttc tttaatgcca taggcatcaa agcccgactc ggtttctagc 5700 aagagaaacc aaaaacctaa agccggtgcc cagctttctc cctaaaacgc atgtgcaagt 5760 ggcgtaaata ggcctgccgg cctggaaacg tcaaacccca tctccaaaaa accaaccaac 5820 c 5821 // ID Poseidon-6_HM repbase; DNA; INV; 3211 BP. XX AC . XX DT 13-DEC-2008 (Rel. 13.12, Created) DT 15-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE Penelope-like element. XX KW Penelope; Non-LTR Retrotransposon; Transposable Element; KW Poseidon-6_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3211 RA Jurka J.; RT "Penelope-like elements from Hydra magnipapillata (Poseidon RT group)."; RL Repbase Reports 8(12), 2089-2089 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 880..2946 FT /product="Poseidon-6_HM_1p" FT /translation="MKEKFYKLKNDISYKINFEKNRPNQILKPAILNLTNI FT NLDKATTELLNLGPKFVPIQKNIPYMDIITNIETCALELEQQNKQTQAEIL FT RQNCSKILTSSLKLKIKDNLTKQQRLSLQNLKKHSHLKIYPFDKGTGFALL FT YENEAKXKLEEQIKNSKIIDYDPTPTFTTKFQKLLCKLRKQGKLDNATYFK FT MYPSDCVPPRIYGMVKAHKPEKNYPMRPVVSTINTPPYGTSEYLVKIIQPT FT LNKNITRLINSRNFVNDAKEWKIDPNEIQVSFDVINLYPSVPIDEAIPVII FT NILNDDLEDLKTRTKLTLVDIHQLIELSLSICYFLYEDKIRILPNSGPIGL FT SLMVVIAEAFLQNIEKRALDIAFVNSFQPITYKRYVDDSHARFDSKEKQEL FT FLKALNEQNPSIKYTVELENKRKQLNFLDICITNTMNGFYEFQIHRKDAIT FT NVQIKPNSNINPSITIGVFKGFLCRAKQICSQKYLSQEIDFLINTFFENGY FT NKNNLIKITKNYLDGISKNTTSNQLDQKFVKLPWIPIIGPKLRREFKKQNI FT RVIFTSAPSLSNILCNNKTKLPPNSNPGVYQLKCSCGNIYIGETKKKIISR FT SAEHQRACKNQKWSSSGATEHSKTCRGNFDWLNPKTLAVVQEYKIRKIRES FT LEINKAKVRREIGIGEGVLNRDDGDKVKTRTWGPILTKI*" XX SQ Sequence 3211 BP; 1235 A; 524 C; 442 G; 1007 T; 3 other; tttaatatgt aagttgattt taataaaagg cagcgtataa ctatttttaa atatattttg 60 atataaaaat atatcgatat ttcgctgacc tattgtggtc agcgtcatca ggtaataata 120 aaaaacgtta caaaataaca taaaaacaaa ccgttaaaaa gaagacatta aaaagcgtca 180 aatataaaaa ccaattaaat ataataatga cgtcagtaaa tttttgtcaa aatgggtccc 240 caagttcttg ttttaacctt atccccgtca tctcgattta gaacaccttc cccaatacct 300 atctcacgtc gaacttttgc tttgttaatc tccagagatt ctctaatttt tcgtatcttg 360 tattctkgaa cgactgccaa ggtcttagga tttagccaat caaagttacc acggcatgtt 420 tcggaatgtt ccgtggctcc tgaactcgac catttttgat ttttgcaagc tctttggtgt 480 tctgcacttc ttgaaatgat tttctttttg gtctcaccaa tatatatatt tccgcatgag 540 catttaagtt ggtaaacccc cggatttgag ttcggaggta gtttagtttt gttgttgcat 600 aatatattat ttaaactagg agctgaagta aatataactc tgatgttttg ttttttaaat 660 tctcttcgga gttttgggcc aataatagga tccaaggtag tttaacaaac ttttgatcta 720 actgattaga tgtagtaaga aagattatat tcaagtaaga aattaattga tgaattaaat 780 ttatatcttc aatctaaaat aagatctaat gactatgaga taattaataa tataacgaac 840 aaatctaaaa attaccatta ttttataaaa aaaactaaaa tgaaagaaaa attctataaa 900 ttgaaaaatg acatcagtta taaaatcaat tttgaaaaaa atcgtcctaa tcaaatttta 960 aaaccagcta tactaaattt aacaaatatt aatctggata aagcaacwac tgaactactt 1020 aatcttggtc caaaatttgt cccaatacaa aagaacatcc cttatatgga cataatcact 1080 aatattgaaa cttgcgcctt agaactcgaa caacaaaata aacaaacaca ggcagagatt 1140 cttcgacaaa actgttccaa aattctaacc agttctttaa aattaaaaat aaaagataat 1200 ttaacaaaac aacaacgcct ttctttacaa aacctgaaaa aacattcaca tctaaagata 1260 tatccgtttg acaaaggaac aggttttgct ttattatacg aaaatgaagc aaaagygaaa 1320 ttagaagaac aaataaaaaa tagtaaaatt attgattacg acccaacacc cacattcact 1380 actaaatttc aaaaactatt atgtaaatta agaaaacaag gtaagttaga caatgctact 1440 tattttaaaa tgtatccatc agattgcgtt ccacctagaa tctatggaat ggttaaagca 1500 cacaaaccgg aaaaaaacta cccaatgcgt cctgttgttt ccactattaa cacacctcct 1560 tatggaacat ctgagtatct ggttaaaatt atccaaccaa cattaaacaa aaatataact 1620 cgactaatta attcaagaaa ctttgtcaac gatgctaaag aatggaagat agaccctaat 1680 gaaattcagg tttctttcga tgtaattaat ttatatccat ccgtaccgat cgatgaagca 1740 attcctgtta ttattaacat attgaacgat gatcttgaag atttaaaaac taggactaaa 1800 ttaactctcg tagatataca ccaattaatt gaactatcgt taagcatatg ttacttttta 1860 tacgaagata aaatccgaat tttacctaat tcaggtccaa ttggtttatc attgatggta 1920 gtcatagctg aagcattttt gcaaaatata gaaaaaagag cacttgatat agcctttgtt 1980 aattcattcc aaccaataac ttataaaaga tatgtggatg atagccatgc tcgcttcgat 2040 tcaaaagaaa agcaagaatt atttcttaaa gctttaaatg aacaaaaccc ctccataaaa 2100 tataccgttg aacttgaaaa caaaagaaaa caacttaatt ttttagatat ttgtattaca 2160 aatacaatga atggatttta tgagtttcaa atacaccgta aggatgcgat aacaaatgtt 2220 caaataaaac caaactccaa catcaacccg agcataacca ttggcgtctt taaaggtttt 2280 ttatgtcgag caaaacaaat ctgctctcaa aaatatctct cacaagaaat tgatttcctc 2340 ataaatacat tttttgaaaa cggctataac aaaaataatc ttattaaaat aaccaaaaac 2400 tatttagacg gcatttcaaa aaatactaca tctaatcagt tagatcaaaa gtttgttaaa 2460 ctaccttgga tacctattat tggcccaaaa ctccgaagag aatttaaaaa acaaaacatc 2520 agagttatat ttacttcagc tcctagttta agtaatatat tatgcaacaa caaaactaaa 2580 ctacctccga actcaaatcc gggggtttac caacttaaat gctcatgcgg aaatatatat 2640 attggtgaga ccaaaaagaa aatcatttca agaagtgcag aacaccaaag agcttgcaaa 2700 aatcaaaaat ggtcgagttc aggagccacg gaacattcca aaacatgccg tggtaacttt 2760 gattggctaa atcctaagac cttggcagtc gttcaagaat acaagatacg aaaaattaga 2820 gaatctctgg agattaacaa agcgaaagtt cgacgtgaga taggtattgg ggaaggtgtt 2880 ctaaatcgag atgacgggga taaggttaaa acaagaactt ggggacccat tttgacaaaa 2940 atttaactga cgtcattatt atatttaatt ggtttttata tttgacgctt tttaatgtct 3000 tctttttaaa cggtttgttt ttatgttatt ttgtaacgtt ttttattatt acctgatgac 3060 gctgaccaca ataggtcagc gaaatatcgt tgatatattt ttacatcaaa atatatttaa 3120 aaatagttat acgctgcctt ttttattaaa atcaacttac atattaaata tatatatata 3180 tatataacaa gaaatgactt ttaaagatat a 3211 // ID L1-8_CQ repbase; DNA; INV; 4633 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE An L1 non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW L1; Non-LTR Retrotransposon; Transposable Element; L1-8_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4633 RA Kojima K.K. and Jurka J.; RT "L1 non-LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 138-138 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 4 sequences with >99% CC identity. XX FH Key Location/Qualifiers FT CDS 146..1294 FT /product="L1-8_CQ_1p" FT /translation="MTSVRPRENTFKVDLSVFPKRPSFEEIHSFVHDVMGL FT RIDQVKRLQMNHVQNAAHVKCDTLKTAQDAVEEHDGRHEIELNKVKYKVRL FT QMDDSTVEVKVHDLSENVRDEELIGFLRHYGDVHCIKELVWGENFAYKGIS FT SGIRVVRMTLRKHIQSFVTVQQEKTLVTYRGQPQTCRHCSRLSHPGITCTD FT NKKLVGQKSDLSDRLKAAQSTDSTSYATVVDKGTAIINSLLPNFVATNLNQ FT LNQAASESKQKELDTTAQQPSCSHLSTATDGVEQSSSSTITPAVAEDDSMS FT DETIVPVANETELMDQDLQQRGYDDSAQFDATSPVDVSGSPFKSPPLPLPI FT KQVHSSVSESDESSAEGSEFQKVKPRRGRGRPKKPRTDSV" FT CDS 1383..4577 FT /product="L1-8_CQ_2p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="MNHPTMSYNICTINTNAIANNNKIESLRSFIRQTDAD FT VVLLQEVENSNLTIPGFEVLTNVDDSKRGTAIAIKANIPYSNVQRSLDSRI FT ITVKLGTSTTVCNIYAPSGTQNVSSRENLFKNSLPFYLQNSAEHLVLGGDF FT NSVIRNEDSTGTSNFSISFKTLVGNLDLHDTWSSLNRNQSAFSFVRSNAAS FT RLDRIYVSSSLVSHLRVSEYLVTSFSDHKAFKVRCCLPDLRVLQGRGYWSI FT RSHILTNENVEEFGLRWNRWLRERRNYNSWVSWWIKCAKPKIRSFFKWKTN FT EKFRQFNANNEHLYRQLKEAYEGLLGNPSRREDINRIKSQMLQIQRNFSKV FT YERLNDQTIGGERISGFQLGERTRRKKTSFIKSIQHQNRQLTDPALIENHV FT FEYFKSLYSKQPVEPNVNFPSNRSIPAGSDSNDRLMDEITTSEIFFAVKSS FT ASRKSPGCDGIPKEFYLKTFEIIHPQLNLILNEMLQGNISEELVEGVIVLC FT KKKTNESSIKSYRPISLLNFDYKLLSRILKQRLEKVMVENNLVNSEQKCSN FT SERIIFEAVQAIKDRIVEINCTNRAGRLISFDLDHAFDRVDRGFLLGVMRS FT MGFNERFLLLLERIMSASYSRLLVNGKLTQKFPIERSVRQGDPLSMHLFVL FT YLHPLLVKLLSLCNHPSDLVVAYADDISVIVRDDCILDTLKQAFVEFGECS FT GAVLNLHKTLGMNIGRNRSREIPPWPSMAESLKILGVTFFNSLKETTKHNW FT NELVRKTSRLMWLYKPRSLTLVQRVVVVNTFVTSRLWYLASVLSITNVMIA FT RITSQIGYFIWAQYPHRIAMEQLALPVCKGGLNLHLPVHKCKALLVSRYIA FT DREYTPFARNFEQQMSNPPNVMGIPALYPCLKVVAKLLPYIPTNLTANPSA FT STLHEYYRENLRVPKVMEENPNIDWNRVWRNTRSKALNSAERSTYYLLVNG FT KIPHAALMFRQNRAMNPNCEHCPNSIEDLDHKFATCSRVRHLWRHLRLKLG FT TILGRRIQIKNFLLPELKHCEVRSKQKALKTFINYVNFILDPKSSLSITAL FT DFHLDCNV" XX SQ Sequence 4633 BP; 1365 A; 1066 C; 1026 G; 1176 T; 0 other; tagttgggtg caagcttttg cccagaccag acgtatttac cgcgcgttct gcaaaagttt 60 gttttttgcc cgtgtgcagc cacacttcgg tgtgaggaaa atcttctctt tcattctcgc 120 taccagcaaa caattgccag cgacgatgac ctccgtgcga ccccgggaaa ataccttcaa 180 ggttgacctc agtgtgttcc ccaagcgccc aagcttcgaa gaaattcact cctttgtgca 240 cgatgttatg ggactacgca tcgaccaggt gaaacgactg cagatgaacc acgtgcagaa 300 tgctgctcac gtgaagtgcg ataccctgaa gaccgctcag gacgctgtcg aggagcacga 360 tggacgccac gagattgagc taaacaaagt gaaatacaag gtgcgattgc agatggacga 420 ttcgacggtg gaggtcaagg tccacgacct gtcagagaac gtgcgtgacg aggagcttat 480 cggtttcctg cgacactatg gggacgtgca ctgcatcaaa gagttggtgt ggggggaaaa 540 ctttgcgtac aagggcatct cttccggcat tcgcgtggtg aggatgacac tgaggaaaca 600 tatccaatcg ttcgtgacag tccagcagga gaaaaccctg gtcacttaca gaggtcaacc 660 ccaaacttgc cgccattgtt ctcgtctgtc gcatcccgga atcacatgta cggacaacaa 720 aaaactcgtg ggacagaaga gtgacctgag tgatcggctc aaggccgcgc agtcaaccga 780 ctcgaccagt tacgccaccg ttgtggacaa agggaccgct attatcaact cgcttttgcc 840 gaacttcgtc gccaccaacc tcaaccagct caaccaagcc gcctctgagt cgaaacaaaa 900 ggaactcgac accacagcgc agcagccgtc ctgctcgcac ctgagtaccg caactgatgg 960 agtcgagcaa tcctcctcct caacaatcac tccagccgtg gcagaagacg actcgatgtc 1020 ggacgagacg atcgtaccgg tcgcgaacga gacggagctg atggatcaag atctgcagca 1080 gcgtgggtat gatgattcag ctcaattcga tgctacctct cctgtcgatg tgtcagggag 1140 tccgttcaaa tcccctcccc ttcctctccc tatcaaacaa gtgcatagta gtgtttctga 1200 atccgacgag tcgtccgctg aagggtcaga gtttcagaaa gtgaaaccaa gacgcggtcg 1260 gggtcgtccg aagaagccga gaactgactc ggtctaataa catcgattaa aacatcccta 1320 ctaacctatc ccatttacta atctcgagtc ggttgcgagt attcaacccc acaaaactaa 1380 tcatgaacca tcctacaatg agctataata tctgcacaat caacactaac gcaattgcaa 1440 ataacaacaa aattgaaagt ttgcgatctt ttatccgcca aacagatgca gatgttgtac 1500 tgctacaaga agtcgagaac tcaaatttga ctatcccggg gtttgaggtt ctcacgaacg 1560 ttgacgattc caagagaggt acagctattg caataaaagc aaacattccc tattcaaacg 1620 tccagcgtag cttggacagc cgcatcatca ctgtaaaact tggaacttcg accactgttt 1680 gtaatattta cgcaccatct ggaactcaaa atgtgagctc tcgtgagaac cttttcaaaa 1740 actctctacc attctacctc caaaattctg ctgagcacct tgttttgggt ggggatttca 1800 atagcgtgat tcgtaatgag gactcaactg gtactagtaa ctttagtata tcattcaaaa 1860 cattagttgg aaatttggac cttcacgaca cgtggagttc tttaaatagg aaccagtcag 1920 cattcagctt cgttcgctca aatgctgctt ctcgtcttga tcgaatatat gtgtcctcct 1980 ctcttgtttc tcaccttcga gtctctgagt acctggtgac gtcattctct gatcacaaag 2040 cttttaaggt tcgatgttgt ctccccgatt tgagagtgtt gcaaggaaga ggttactggt 2100 caatccgttc acacatactc acaaacgaaa atgttgagga gtttggtctg agatggaatc 2160 gatggttgag ggaacgaaga aattacaaca gctgggtcag ctggtggatt aaatgtgcaa 2220 aacccaaaat acgtagtttc ttcaagtgga aaacgaatga aaaattcagg caatttaatg 2280 caaataacga acatctctac cggcagctta aggaagctta cgagggactc ttgggtaacc 2340 caagtagaag ggaagacata aatcgaataa aatctcaaat gctacaaatt caacgaaatt 2400 tctcaaaagt ttacgagaga ctgaatgatc aaactattgg gggggaaaga atatctggtt 2460 ttcaacttgg ggaaagaact cggcggaaaa agacaagctt tataaagtcc attcaacatc 2520 aaaatcgtca gctaactgat cccgctctga ttgagaatca cgtgttcgaa tattttaaat 2580 ccttgtactc aaaacaacct gtagaaccaa atgttaactt tccgagcaat cggtcgattc 2640 cagcagggtc agattcaaat gatcgtttga tggatgaaat aacaacatca gaaatctttt 2700 ttgctgtaaa atctagtgcc tcgagaaagt cgccagggtg tgatgggatt ccaaaggaat 2760 tttacttgaa aacctttgag atcattcatc cgcaactcaa tctaattttg aatgaaatgc 2820 tccaaggaaa catttctgag gaacttgttg aaggagtaat tgtactttgt aagaaaaaga 2880 caaacgaaag ctcgatcaaa tcataccgac caatttcttt attgaatttc gattacaaac 2940 tgctttctcg catattgaag caaagattag agaaagttat ggtagaaaac aatctcgtga 3000 attctgaaca aaagtgctcg aactctgaaa gaatcatttt tgaggcagtg caagcaatca 3060 aagaccgaat tgtcgagatt aattgcacta acagagctgg gagacttatt tcgtttgatc 3120 ttgatcatgc ttttgatcgc gttgatagag ggtttcttct gggggttatg agaagtatgg 3180 gttttaacga aagatttctg ctgctcttgg aaagaatcat gtccgcttcg tactctagat 3240 tgctcgtcaa tgggaaactc actcaaaaat ttcccattga acgctctgtt agacaaggtg 3300 acccattaag tatgcacttg tttgtcctct accttcatcc tctcctagta aaactcctct 3360 ccctctgcaa tcacccctct gacttagtag tagcttatgc ggacgacatt tccgtgattg 3420 ttcgtgatga ttgtatactg gacacactca agcaagcatt tgttgagttc ggagagtgct 3480 ctggagctgt gttgaacctc cacaaaacat tggggatgaa cataggcagg aaccgctcac 3540 gtgagatccc accgtggcct tccatggcag agtcgctcaa aatactcgga gtgacttttt 3600 tcaattcact gaaggagact acaaaacata attggaacga gcttgtccgg aagacgtcga 3660 ggctgatgtg gttgtacaag ccacggtcac tcactctcgt gcagagggtt gtcgtagtaa 3720 atactttcgt aacgtcgagg ctatggtacc tggcctccgt tctgagtatt acaaatgtga 3780 tgatcgctag aataacttcg caaattgggt attttatctg ggcacaatac ccacacagaa 3840 tagcaatgga gcaattggcg ttacccgtct gtaaaggagg gttaaacctg catctccctg 3900 tgcacaagtg caaggcgttg ttggtcagcc ggtacatcgc cgatcgagag tacaccccct 3960 ttgcacggaa ttttgagcag caaatgtcca acccacctaa cgtaatgggt atccctgcac 4020 tctacccgtg tttgaaggta gtggccaagt tgctcccgta catccccacg aacctgactg 4080 cgaacccgtc ggcgagtact ctgcatgagt actatcgaga gaacctgcga gtaccgaaag 4140 ttatggagga gaaccccaac atcgactgga atcgagtatg gagaaacacc cgaagcaaag 4200 cactcaactc agctgaaaga tcaacctact accttttggt caacggaaag atccctcacg 4260 ccgcgctaat gttcagacaa aacagagcca tgaatccgaa ctgtgaacac tgtcctaatt 4320 ccattgaaga tcttgatcat aagttcgcaa cgtgcagtag ggtaagacat ctttggcgtc 4380 accttcgttt aaagttgggg acaattttag gtagaaggat tcaaattaaa aattttctct 4440 tgccagagct aaaacactgc gaagttagga gcaaacaaaa ggcgctgaaa acatttataa 4500 attatgtaaa tttcattcta gaccctaaaa gctcgctatc aattaccgca cttgatttcc 4560 acttagattg taatgtatga aatgatgctg taactcaaaa gtctgaataa acgtttttac 4620 aaaaaaaaaa aaa 4633 // ID Hovi1 repbase; DNA; INV; 2690 BP. XX AC . XX DT 08-OCT-2009 (Rel. 14.1, Created) DT 08-OCT-2009 (Rel. 14.1, Last updated, Version 1) XX DE Hovi1 is a putative autonomous DNA transposon. XX KW hAT; DNA transposon; Transposable Element; HOBO; transposase; KW Hovi1. XX OS Drosophila virilis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; virilis group. XX RN [1] RP 1-2690 RA Ortiz M.F. and Loreto E.L.S.; RT "Characterization of new hAT transposable elements in 12 RT Drosophila genomes."; RL Genetica 135(1), 67-75 (2009)DOI 10.1007/s10709-008-9259-5. XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 628..2526 FT /product="Hovi1_1p" FT /translation="MILPLFYRMETPNDISEKIRNGTYVLLPKRMGRSPVW FT DLFADVRKSDGTILDGYLCCRKCNRLFKYDGKHTSNLNRHRCFYVGVEDHK FT PITMEDKKAALEACAQWIIADCRPYSIIEGAGFHKLAKFLVQIGAKYGENI FT KIDDLLPEAGTISRQLNKTAQMKKVEMKAFLEQRMENRIEASVTMDMWTDM FT YVKRNFLCASLHFQKDNRLQNIALGIKSMDLSRTNDELILSKLKNMLAEFG FT IKHLDKLIFVTESEDQKTKPFRNLTRINCSSELLSNVLELAEDASSELKGI FT IQNCKKLVKYFKNSNSYHNLPSILRSPGTTRWNSNFVMFKFITEHWVKITK FT ILADLNQDPLLGDINISTIQSLLKLYQDFDTILHKLQDASYPSLCFVLPSI FT NKLKVLCAPDPNDIAAIVEFKQNILSNLSTWTSKLSVYHRVANFLYPPAKE FT YIEDKHDGAKEFCRKELLERLSQSSNNIKKEKSDTETQTMDSHTNDIKDDN FT TNSWKSRDLNEENSNTSMQSMDSNNPEDKYTDFKFFFSHLIHTPKLSLEDW FT VNYEIERYSSHIEQYDRYFDVMTWWQQNQTNFPYLSKLAFFILSIPASIAA FT SQGIRNLAGNLIVTGKSNCIAPKAVDSTVFLNLAP" XX SQ Sequence 2690 BP; 911 A; 493 C; 530 G; 756 T; 0 other; catgccaaag gcaaggcccg agcgtcgtta ttcctttagc ggtgcaccca gcttgcagac 60 cacggaggac acgctggccc agttgtgcgg ttcctatgaa attgtcgacg tctatgatca 120 aaataatagc acatctatgg acctgcccaa gaaatcgata aggaataacg ctgacaaagc 180 gaaatgtgat caatccgcag atttggatga cgatgttatg gacaagaatc gcaaacgcaa 240 tcggcacacg gaaaatctgg aggagctctg tagtgcgcaa aatgcattaa tgacgcgcgt 300 tgccgacagc ttggacgcaa tacgctccaa catggccgag cagacgaatg taatgctgca 360 gcatttcaaa agaatggagg aaatcgaatt gaaaaagctg gaggcgatca aatgttttaa 420 gaaatagcat atttttattg tccttagtta gtatgtttct tttaaatatc tataagcttt 480 taataaatgt gtacatatct atttgcatat attcatgtgg aatagtttta gttttggtta 540 gattgtagaa tacccttccc tttttaataa gtttatgtat aaactgataa gtatttatat 600 tttcaataat atttgcttat tataaccatg attttgcctt tgttttacag aatggaaaca 660 ccaaacgata tttcggagaa aatacgaaat ggaacatacg tgcttctgcc caagcgcatg 720 ggcagaagtc ctgtttggga tctatttgca gatgtacgta aatccgatgg caccatactg 780 gatggttatc tgtgttgccg taaatgcaat cgactcttta aatatgatgg caaacataca 840 tcaaatctga atcggcatag atgcttctat gttggcgtgg aggaccataa gcccataaca 900 atggaggata agaaagccgc tcttgaggct tgcgctcagt ggataattgc cgattgccgt 960 ccctatagca ttatagaagg tgccggtttt cataagctgg ccaagtttct cgtacaaatc 1020 ggtgcgaaat acggtgagaa tataaaaata gatgatttgc tacccgaagc gggaaccata 1080 tcgcgccagc tgaacaaaac ggcgcaaatg aaaaaagtag agatgaaagc atttttggag 1140 cagagaatgg agaatagaat tgaggcctcc gtcaccatgg acatgtggac agatatgtat 1200 gtaaagcgta attttctatg tgctagtctt cactttcaaa aggacaacag gctgcagaac 1260 atagcactcg gcattaagtc catggacttg agtcgtacaa acgatgaact cattcttagt 1320 aaattaaaaa atatgttggc cgaattcggc atcaagcatc ttgataaact gatatttgta 1380 actgagagtg aagaccagaa aacgaaaccc ttccgcaatt tgacacgcat aaattgttct 1440 agcgaattgc tttcaaatgt tttggagctc gccgaagacg cttcaagcga actgaaggga 1500 ataattcaaa attgtaaaaa gctggtaaaa tattttaaaa attcgaatag ctatcacaat 1560 ttaccatcaa tattgagaag tccgggcacg acacgttgga attcaaattt tgttatgttc 1620 aaatttataa cagagcattg ggtaaaaata acaaaaatac ttgctgattt aaatcaagat 1680 cctctacttg gtgatattaa tataagcaca attcagtctc tactcaaatt atatcaagat 1740 ttcgatacaa tattgcataa attacaagac gccagctatc cttccctgtg ctttgtcctg 1800 ccatcgatta acaaattgaa agtactttgt gcaccagatc cgaatgatat tgcggcaatt 1860 gttgagttta aacaaaatat tttaagtaac ttgagtacgt ggacctcgaa gttgagtgtc 1920 tatcaccgag tcgctaattt tttatatcca cccgccaagg aatacataga agacaagcac 1980 gatggggcaa aagaattctg cagaaaagaa cttttagagc gtctcagcca aagttcaaat 2040 aatatcaaaa aagaaaagtc ggatacagag actcaaacaa tggactccca tacaaatgat 2100 attaaagatg ataatacgaa tagctggaaa tcgagagact tgaacgaaga gaattcaaat 2160 acgtcaatgc aatcaatgga ctccaataac cctgaagata aatatacaga tttcaagttt 2220 ttcttctctc acctaatcca tacgcccaag ctttcacttg aagattgggt taactatgaa 2280 attgaacgtt actcaagcca tatagagcaa tatgaccgat attttgatgt tatgacctgg 2340 tggcagcaaa accaaacgaa ttttccctat ttatcgaagc tggcattttt tatactttca 2400 atacccgcat ccattgcagc ttcacagggt atacgcaatt tagctggaaa cctaatagta 2460 actggcaaaa gcaactgcat agcgccgaaa gctgttgaca gcacggtatt tttaaacttg 2520 gctccttaaa ttaactattt ggccacaaaa tgttaagaaa cttttcaaat ttgataaaag 2580 aaaatgccaa atggtattcc aagaaatgta taaaaaataa agatttaaca ttcattttgg 2640 atttatatat atatatctat atatagttta gccagattgc atttggcatg 2690 // ID Harbinger-1_BF repbase; DNA; INV; 4812 BP. XX AC . XX DT 04-AUG-2008 (Rel. 13.08, Created) DT 04-AUG-2008 (Rel. 13.08, Last updated, Version 1) XX DE Amphioxus Harbinger-1_BF autonomous DNA transposon - consensus. XX KW Harbinger; DNA transposon; Transposable Element; Harbinger-1_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-4812 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-4812 RA Kapitonov V. and Jurka J.; RT "Harbinger-1_BF - a family of autonomous DNA transposons from the RT amphioxus genome."; RL Repbase Reports 8(8), 796-796 (2008). XX DR [2] (Consensus) XX CC This is a young family of autonomous Harbingers characterized by CC 15-bp TIRs and TAA TSDs. The consensus codes for two proteins: CC transposase (2 exons) and DNA/protein-binding SANT/Myb protein (4 CC exons). XX FH Key Location/Qualifiers FT CDS join(644..783,1168..2152) FT /product="Harbinger-1_BF_1p" FT /note="transposase." FT /translation="MLGFYDTLMNELMREHRCDFKSFVRMEPQMFYDLLMR FT VGPSIEKSRNVRPALPAALKLAITLRFLATGNSYQSLEFAFRVAHNTISQF FT IPQVCKAITTEYGDEVFKTPSTPDEWRQVAQGFQKRWQFPHVCGAIDGKHV FT RIRKPKKSGSVYFNYKGFFSIVILALADHNYKFLWANVGSPGSNSDCGIFN FT NCSLEPSLSGGTIGFPDADPITNDDRDTGYFIIGDDAFPLRTYILKPYAQR FT YLTIEERIFNYRTSRARRVVENAFGIMAMRFRCLLTTLSVAPETATRITEA FT CLTLHNLMRMRYPGLQNMDLDYEDEHHQLMPGSWRNEAVLQEVEDEGRGPR FT ATADGKKLRAYFRKYFNSNAGSVPWQLHAIGQ" FT CDS join(4077..3717,3571..3387,3148..2957,2694..2488) FT /product="Harbinger-1_BF_2p" FT /note="SANT/Myb." FT /translation="MPKGRKAAGKDKGKQPRRPYTRRKKVLKTSEELDAET FT SGSELSEPGGGRGESNVDGSSDEPLSSSQQPSQKASITEDQDEKIASFIES FT HQAFYDMSTPEYKNKQKKDAWLKDVAPEIGLTPKQILTRFQTMRTDYFKLK FT RKLAGKSGQGQTRVTPLQDFKLRRYKFLDAHYRGRASSTELGSVKQPVFSV FT DEEEEEESSDGPRHDVSKASILPKNKKSKKGLGDVLVELLADSQKELKQSQ FT ATLAESSSSASSSTRGGSERDAFAQWLARVQHDIPDDRWRSYQREVFDVAL FT RYSTGTQPPQTRQHEPLDPPASP" XX SQ Sequence 4812 BP; 1188 A; 1182 C; 1163 G; 1279 T; 0 other; tgcctacgtc acattaccta gggtgcccgc gcggtgttta ggcgatgtag gccccggccg 60 agctctgaag ccgggaagac atcggctcac gtagtcacaa atgaatttgt tttcacgcgt 120 cccgttagag gttccggggg ggtccggcgg gcattccggc ggcttttgtt caaaagtctt 180 atgctttaag tttttcatcg ctcgatgttc gataggcacc cgtaagctgc cctggggatc 240 tcccgtgtac tccgtagggg tctggcaggg ggccggccgg tgcaccccgg gtggataact 300 actaactgtg ttacgcctcg agaaccacaa ccaggcatcc tgcgggctgt tcgggcaatg 360 gtattttttt tggggggggg ggggggggga cataagtact ggttgcctca tcttcgttcg 420 cttttgcacg gtgtgctacg tggcaccaac actagtagga gcgggaatgc cactgcacct 480 caaccgacat gtgccgcttg ctcaactcag gctgaggttg gcgacggctc gtcaccgttt 540 catccttgtt cttgctgcat gggtagcgga agaggagaga caagcggaac ggcatcgcca 600 gcgaagaaga tggtgggttc ggccctggtt gacgcgtcga cccatgcttg ggttctacga 660 tacgctgatg aacgagctaa tgcgagaaca tcgttgcgac ttcaagtcct tcgtgcgcat 720 ggagccccag atgttctatg acctgctcat gcgtgtgggg ccttctattg agaagagtag 780 aaagtgagtg tttttactgc ttatatttcc ctgtcgacaa aattaaaatg aaagtaaaca 840 accatgatga caaattagat tatgacatgg tcacattcat aaaggcaggg gcctggccgg 900 gctgtttgtg gaatcgaaga acaaaacttt atatcgggaa atgtacagag attagactac 960 tgactaatgt gagagtcatt gttacgtttt atgcgttttc ttgtccttta tgtcatactt 1020 gtccttcccc aaaattaacc gaccgagccc cggccttgca atgtgaccat gtcattacca 1080 tgtgactacc cagtgtccat gccattatca gatttccatg tcgtttgcca agccgtatca 1140 aatatttttc tttcattttt cttgcagtgt tagaccagcc ctgccagctg ccctgaaatt 1200 ggcaattact cttcgattcc tggcgaccgg caactcgtat cagtccttgg aatttgcctt 1260 tcgagtagct cacaatacta tatcccagtt cattccccaa gtttgcaaag caattacaac 1320 ggagtacggc gacgaagtct tcaaaacccc ttcgacccca gacgagtggc gtcaggtggc 1380 ccagggcttc caaaagcgct ggcagttccc ccatgtttgc ggcgcgattg atgggaaaca 1440 cgtgaggatc agaaagccaa agaagtctgg aagcgtctat ttcaactaca aaggattctt 1500 ttccattgta atccttgctt tggctgacca taactataag tttctctggg caaacgtggg 1560 ctcgcctggg tctaactctg actgcgggat tttcaacaac tgttcgttgg agccatctct 1620 ctctggtgga acaatagggt ttcctgatgc tgacccaatc accaatgacg acagggacac 1680 aggttatttc atcatcggcg acgacgcctt cccgcttcgt acatacatcc tgaaaccata 1740 cgcccaacgc tacctcacca ttgaagaacg aattttcaac tacaggacgt cacgtgccag 1800 aagagtggtg gaaaacgcgt ttgggatcat ggctatgcgc ttccgatgcc tcctaaccac 1860 cttatctgta gctcccgaga ctgccacacg catcactgag gcctgcctca cattgcacaa 1920 tctcatgaga atgcgctacc cgggcctgca gaacatggac ctggactatg aggacgagca 1980 ccatcagctg atgcctggat cctggagaaa cgaggctgta ctgcaagagg tggaagacga 2040 gggccgagga cctcgagcaa cagcagatgg aaaaaaactg cgggcctact ttcgaaaata 2100 cttcaacagc aacgcaggaa gcgtaccatg gcagcttcac gctataggac aataagctgt 2160 gaatttatac ggttattcat gttgatgttt gggtagtaca tggatagtgg caacaaacgc 2220 ttcttcaact tgcttgcgaa tagttatgtg gaaagagatg agatatagtg ttacttgctc 2280 ctttataagc acgccaaggt aattcagtct tattcttgta ttgtacattg tagttttctt 2340 tgttgatttt agttcaaata ctgcatcaaa atgagacaac aaatatagga aagaaaatgt 2400 gtttattcca tgaaacactg aataatgaca aagagagatg aaaagtggtg ttgaattttt 2460 aggaactttt cagcttgttg gcttctaagg actcgctggt gggtcaagcg gctcatgttg 2520 gcgggtctga ggaggctgcg tccccgtgct gtagcgcaga gccacgtcga agacctccct 2580 ctggtatgac cgccacctgt cgtcagggat gtcatgctgc accctggcca gccactgtgc 2640 aaacgcatcc ctctcacttc ctccacgggt gctgctacta gcgctgcttg atgactataa 2700 taacaatgga aaaggaacat ttagttttat atatttctat atataattca tgataacgaa 2760 gtcgattcta gaactaatta tttgttgaca acgattattg aaatgaatcc gcaataaagc 2820 ccaaactggc aaaagacctt gggagatttt taacctaata atccattata agtctttaaa 2880 cgttttgcaa gaatgtatct attgcattta caaagtacaa ttacagaaat aacaacatta 2940 acgagcacta ctgtacctct gctagggttg cttgagattg cttgagctcc ttctgtgaat 3000 ccgccaaaag ctcgacaagt acatcaccta gtcccttctt ggacttcttg tttttgggca 3060 ggatcgacgc cttgctgacg tcatggcgtg gtccatcact gctttcctct tcttcctcct 3120 cgtccacgct aaagacaggc tgcttaacct gaaagatatg caatgaaaag tacaggatgt 3180 ttagtaaaag taacgaaaag tttaatgggc ttagccgcca caccaactac atcaattaag 3240 acaccctcat attgtcacag tcattacaca cctgtactgc gcagacatgt aaagtagggc 3300 tgtggtattc atgctacgtc agactattca aatttgcatg taaactcagc tcgttaaggg 3360 tgtactacat acaaatagat gcttaccgat ccgagctctg tggatgaagc cctgccccgg 3420 tagtgcgcat ccagaaactt gtagcgacga agtttaaagt cctggagagg tgtcaccctg 3480 gtctgtccct gtccagactt ccctgccagc tttctcttca gtttaaagta atctgtcctc 3540 attgtctgga accttgtgag gatctgtttg gctgtaatta gatacaaggg gaaagataag 3600 ataatccaaa acgtatatcg aacgttcaat aatcaacttg tatgacataa actatcaaat 3660 aaagtttcag atctcaaaaa caagcatgaa gtttatgaag tcagaaagaa acatacgcgt 3720 caatcctatt tcaggggcca catccttcag ccaggcgtcc ttcttttgtt tgtttttgta 3780 ttccggagta ctcatgtcat agaaagcttg atggctttca atgaacgaag caatcttctc 3840 atcttgatcc tccgtaattg acgctttttg tgatggctgc tggctggagg ataatggctc 3900 atccgaggag ccatccacat tagattcacc cctccctcct cctggctcgc tcaactcact 3960 gccgctggtt tctgcatcca attcctcgct ggttttcaag accttcttcc ttcttgtgta 4020 aggtctcctg ggctgcttgc ccttgtcttt acctgctgcc tttcttccct ttggcatggt 4080 tcagttctag aacacgcgta cgtgcggtgg gagttctaga acacgcgtac gtgcggtggg 4140 atgtttcgta ccacctgatc tgtaaggtca caaaattgac ctcgtcctca gctggacacc 4200 gacgggtgcc ctctcggtgt gttcataatt agattacggc ttggctcggg atgttccccc 4260 ctccgggccc tggttcattt taagtttgac ttaaaatgaa ccagggcccc gcccgagacc 4320 aaaaataagc cggcagccgc tgggagaccc acaggacccc gaccggaagt ggcaaaaata 4380 gcccgtaaat tacgcgcccg ggtccctatg tggaaatgtg acgatggcca aaattgtcct 4440 gtttgatgac caagggaccc cgtcagcggc cctggcggtg ttccgggatg acacaacgga 4500 tgcaatagtt tgtaatgtaa aaaagttaga cctgtaccgt cgagtaccgc cggggtatct 4560 cgcgggtatc gggcgggtag tgtaggggac cctgccgatg tgttcacgat tagttgacgg 4620 ctttgattcg gcgtattttc ctgtccgggc tctgattgat tttaagtccg acatacaatc 4680 aaccggggcc ccgtccgagc tccaaagtaa cccggcagcc gccgggggtc tcgcggggcc 4740 ccgcccgaaa gtgcaacaaa tcagccccta acctacccgg ccgggcccct atgtgcaaat 4800 gtgacgttag ca 4812 // ID piggyBac-1_BM repbase; DNA; INV; 2467 BP. XX AC . XX DT 30-MAY-2008 (Rel. 13.05, Created) DT 30-MAY-2008 (Rel. 13.05, Last updated, Version 1) XX DE A piggyBac DNA transposon from a silkworm - a consensus sequence. XX KW piggyBac; DNA transposon; Transposable Element; KW horizontal transfer; piggyBac-8_SM; piggyBac-1_BM. XX OS Bombyx OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Bombycidae; Bombycinae. XX RN [1] RP 1-2467 RA Kapitonov V.V. and Jurka J.; RT "Families of autonomous piggyBac element from the freshwater RT planarian genome and two clear examples of horizontal transfer of RT piggyBac transposons between a flatworm and insect."; RL Repbase Reports 8(5), 535-535 (2008)Repbase Reports. XX DR [1] (Consensus) XX CC piggyBac-1_BM is a very young family of piggyBac transposons, CC characterized by 15-bp TIRs (one mismatch) and TTAA target-site CC duplications. The consensus sequence was reconstructed based on CC multiple alignment of 7 copies that are ~99.9% identical to the CC consensus). The transposase is probably incomplete due to CC deletion of its short N-terminus. Surprisingly, the complete CC piggyBac-1_BM DNA sequence is 88% identical to the planarian CC piggyBac-8_SM (Schmidtea mediterranea). The high nucleotide CC identity, including non-coding regions, between these transposons CC in two species diverged from their last common ancestor over 500 CC million years ago, is a first clear evidence of horizontal CC transfer of piggyBac transposons. XX FH Key Location/Qualifiers FT CDS 725..2263 FT /product="piggyBac-1_SMp" FT /note="piggyBac transposase." FT /translation="MPNKRTRRCMRFPSSSEDEDGSNVPNQQTEIAADGTI FT WTRIEEGGVVGRLPIHSAFKDVHGPTAHAKRNIMKGNLSSAFLLLIDNHIL FT EHIRICTELEASRVLGKTWTITQEKLKAFLAILYARGAYEGNTLRLQYLWN FT KKWGPSFFSSTMSRRDFTDILRYIRFDKRNQRSQRLQTDKFALVSAVWDKF FT IENSQNCFKPGAYITVDEQLFPTKARCRFTQYMPNKPHKFGIKFWLASDVE FT TKYVVNGFPYLGKDETRNASTPLSEFVVMKLLEPYTMKGRTVTTDNFFTSI FT PLALKLRSKSTSLLGTIRANKRELPKICKLKKDSMARFSTLLYQSNGCTLT FT VYKSKPNKKVLLLSTKHKNIKIDKAAKKIPETVLFYNKTKFGVDVTDQMAR FT KYTVKSGCRRWPLQVFFNILDLAGINSWILYKNTTGENISRKDFLFRLAEE FT LASEYQTSRQKPHEADISTTAGTVSVRKWCQIGYCNNNKTTNICNKCKKSL FT CGKCTKSKIYICRNCER" XX SQ Sequence 2467 BP; 885 A; 397 C; 458 G; 727 T; 0 other; cactagattt accagaccag tcattttgac tggtcgtgca acttcaattc aaaattacta 60 ctacattaaa tatttgcttt ccaaaatgat gttatgactt ttgtaggtat aagtagagaa 120 tatattttat gaacttaatt ttactttata taatgttagt aacaaatata ttttttgtta 180 taatgattta tacaagaacc agtcaaaatg actggtgcat gtttctagga agaaatgtat 240 gatttcggga gaatcatgag accgtgaaaa tacaaatagt gcccatgctt cttttgttca 300 ttacgttatc gttatcgagc cgatttgggc gattaccgta tctatacgat ttttttacga 360 gtacttagtc caagcgatct ccaaacgtaa tcattcatct ttacccatac agtgacacgt 420 aataaacatt tatgagtatt tgaaaaatat tcaaatatat gttcttcagc aagtagcctg 480 cgtatcgtta ccccctaaaa atgtcaaaac gaataaaagt ttctacatat ttagaaatgc 540 aacaaacata cctcaggatt gttcagatgt ggaaagtgaa ttatttgata gcgacggtga 600 attattagat acacaaattg ctccacaagg caatgatgaa aatagtgcta atatcttgag 660 tagtgatgag gaagttcttg atgaagacaa tataaatcag tcaagtgaca gtgattgcga 720 aaatatgcca aacaaacgca cgagacgatg catgcgattc ccttccagtt cggaagatga 780 agatggaagt aacgtaccaa atcagcaaac tgaaattgct gcagacggaa ctatttggac 840 gagaattgaa gaaggaggtg ttgttggtag attaccaatt catagtgctt tcaaagatgt 900 acacgggcca acagcacatg ctaaaagaaa cattatgaaa gggaatctaa gtagtgcgtt 960 cctattattg attgacaatc atattttgga acatatacga atttgcacag agttagaagc 1020 ctctcgagtt ttggggaaaa cctggacaat tacgcaagaa aaattgaagg catttcttgc 1080 aatattgtac gcacgcgggg catacgaagg aaatactttg agacttcaat acttgtggaa 1140 taaaaaatgg ggaccatcat ttttttctag cactatgagt agacgagatt ttacagatat 1200 tttacgatac attcggttcg ataaaagaaa tcagaggagt caacgcttgc aaacagacaa 1260 attcgcttta gtctcagcag tttgggataa atttattgaa aacagtcaaa attgcttcaa 1320 accgggagct tatattactg tggatgagca actttttcca acgaaggcca gatgcagatt 1380 tactcagtat atgccaaaca aaccccataa atttggcatc aaattttggt tagcgtctga 1440 tgtagaaaca aaatatgtgg taaatggctt tccatattta ggaaaagacg agactcgaaa 1500 tgcatcaacc cccctaagcg aatttgtcgt aatgaaactt cttgaaccgt acaccatgaa 1560 gggtagaact gtaacaactg ataatttttt tacaagtatt cctttggcgt tgaaattacg 1620 atctaaaagc acttcgttac ttggaacaat acgcgcaaac aagagggaac tgccgaaaat 1680 ttgcaaactg aaaaaagaca gcatggcacg tttctcgacg ttgttgtacc aatctaatgg 1740 atgcacactt actgtttata agagcaaacc aaataaaaaa gtacttttac taagtacaaa 1800 acataaaaac atcaaaattg ataaagcagc taaaaaaata cctgaaactg tattgtttta 1860 taataaaact aaatttggcg tcgatgtgac tgatcaaatg gcacgaaaat atacggtgaa 1920 gtctggttgc agaaggtggc cacttcaagt gtttttcaat attttagatt tagccggaat 1980 aaatagctgg atattataca aaaacacaac aggagaaaat atctcacgga aagactttct 2040 gtttcgatta gcagaagaac ttgcttcaga atatcagact tcaaggcaaa aaccacacga 2100 agctgatata tcaactaccg ctggcacggt ttctgtgcgc aaatggtgtc aaataggata 2160 ttgcaataac aataagacta caaatatttg caataaatgc aagaaaagtc tatgcggaaa 2220 gtgtacaaaa agcaaaatct acatatgtag aaattgtgag cgataaactg taaaaattaa 2280 gtcttttgtt atttgacaat ttccataaat ccatttttgt ataaaaatta atattgttga 2340 attcttttta ccatttttta agagtgaata aagactggaa actcagttaa gaattttttt 2400 tatacaccag tcatattgac tggttatggt agaaatacgt atatttaaat ggctggtaaa 2460 tctagtg 2467 // ID CR1-27_BF repbase; DNA; INV; 3702 BP. XX AC . XX DT 30-JUL-2009 (Rel. 14.07, Created) DT 30-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE Amphioxus CR1-27_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-27_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-3702 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-3702 RA Kapitonov V. and Jurka J.; RT "Young families of CR1 non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(7), 1598-1598 (2009). XX DR [2] (Consensus) XX SQ Sequence 3702 BP; 950 A; 893 C; 839 G; 1020 T; 0 other; aactttcgca aggtggcgct cttaggggca gccgcggttc ttgcacctct gccccaaaag 60 agcccctttt gtaccattta ctacagcaat atccgggaat gtgcaatata tgtgttacta 120 taagtgttgt aagtatctgt ggaccgtcta agacaactat ctaaggtata gaggagaatt 180 gtcagtgtaa acgtgactgt taccgagtct ggctgtgtgt agaccactgc gagcaatggc 240 gtcgtccagt atccgcaggt attctatgga tttcctccgg agcctgagtc gcgtggtccc 300 caggatcgat ccactgacta aggagagtat acgcctcaac ggactcttca agagaagcag 360 accaaggggg acaaaagcgg ggaaacacat tttacgtgag atttccacag tcttaggcca 420 tcgtccctcc tctttccaac agtcggatac ttccttcata tgcaaatcaa cgcgccatcg 480 ctacctgtcc accgtcaaaa caatcagaca tgatgatcca tcattggctg cggtaagtga 540 caacaaccgt actaagccat tccctatgcc taagatcctg ttgaccaatc caaggtctct 600 agtaaataag ttactcaaag tcatttacct caatgcgaga agtgttaagg cagttaatcc 660 tcatagaaat aagttggtac agctccagaa cttgatgcat ctaaactgtc cagacattct 720 ggctattact gagacttggc tcaacgatga tgtccgcgac caagaagtca ttccacccgg 780 ttttgtaacc taccgtaagg accgtcatca tgttcatgcc ggcagggccg gtggtggcgt 840 ccttctggcc gttcgttcag acgtatgtag tatgcgacgg tcagacctcg aacccaagga 900 tgaaattctc gtctgtgaag ttcagcctcc cggtttgggc aagttggcca ccgtgctctg 960 ctacagacca ccctctgggg atttgtcaac attcaccgtg aacttgtctt cagtcttaga 1020 gaaagttcag agtgagtacc gaatgtgttg cgtactttgg gactttaatc tcccacgggt 1080 tgactggacc agttgtgccg tgacggacaa gggcaaagaa tctgactttt gtgatgtaat 1140 aaacgatcac ttccttcttc agcacaacac aattccatcc aactccagca acaacatgtt 1200 ggacttagtc ttcagcaaca tccccgaaag ggtgtcggac ataagtgaac ttccatccga 1260 attcgatacg gaccatacta tcttagaatt cggtatccat tgcaaactac agtccaagcg 1320 tggcctctct cggaaggttt acaactacag tcgtgccgac tgggatggtc ttaggtcaca 1380 tctcgcctct aggcaactag ctgaagccgt tcaacagtgt caggacattg actccgcctg 1440 ggaattctta tcatctatga tacggtcagc tgtcgaccag ttcgtgccct ggcgcagagt 1500 taaggactca acaacgcctc catggataga cggggaagtt cgagacatcc agaaccgcaa 1560 gcagacggca tggcgcagag ccaaacggac agactctcca gcacactggg cgaagtatcg 1620 agtgctccgc aacaaactga agaacgttct gtctcgtaag catgtgggct atttagacag 1680 cctatcatct accctgcagg acgcaccgaa gcgcttctgg acatttgttc gtgcgaaatc 1740 gaaatcaaga actttgccat ctgtcgtata tcttgaggac tctgttgcca aatcggctgg 1800 cgacaaggct agcctgttca acaagtactt tttctccact ttcagtcaag ttgaccagaa 1860 tgttagtgca cctactattg acatcaaagt tattgacagt ctttgtagcc ttcaatttaa 1920 tgtagactca gtccgagatg tactttctaa tctggacaca agtaaagccg ttggtcctga 1980 cagtatttcc ccacatgtgt taaagaattg tgcccagtca attgcacttc ccctgacttt 2040 gctgttcaac caatccgtta gttcaggtgt agttccttct aactggaagg aggcaaatgt 2100 gtccccagtc ttcaaaaagg gggacaaaca agtcgtctca aactacaggc ccgtttcttt 2160 attgtctatc gttagcaaag tcatggagag gtgcatatac gacatagtgt ttccaattct 2220 ccacgattct attcatggcc tgcagcacgg gttcattaag gggcgttcaa caaccactca 2280 actgttagag gtataccata acgtaggctc tatcctagac agaggtgggc aggtcgatat 2340 gcttttcttg gactttgcca aggcgttcga ctcggttcca cactctcgtc tgctccacaa 2400 gctacagatg tatgggttta acggtaaact gctgtcatgg tttaactcgt accttactga 2460 ccgaaagcag cgagtggtgg ttgaaggcag tcactcagag tggctacccg ttacttccgg 2520 tgtaccacag ggctcgattc taggcccaat gttattttta ctgtatatta atgacttacc 2580 tagcactgca aagaactcca ttgtagcgtt gtttgctgac gattcgaaat gctacagaga 2640 gattcgcaac ctagatgact gtcataaact acaggccgat atttcgtcta tgtatgattg 2700 gagccttcgg tggggtatgt cattccaccc ctcgaagtgc aaggtgctcc gtttgactcg 2760 ctccaagagc ccaataacct ttgcatacta tatgtctgac attgcactgt ccacagtcag 2820 tagtatgaat gacctgggag ttcttgccca gtctgacctg ttgtggaaca gccatattgt 2880 gaacattgta aagaaagcca attctatgat tggatttatc atacgcacag taggctttga 2940 ttccagcctg gaggtgcgca aggcattgta tgtttcactt gttagatcag tacttgaata 3000 ttgttgccct gtgtggtctc cactgtcacg taatcacatg tacctgttgg aaggcgtgca 3060 gaggcgggcg actaagttca ttctgcgtgc caatgacaat gtaaatgccg gtgaacttga 3120 ctatagagat agattactgt gcctcaacat gttacctctc tcatacagga gagagatcgc 3180 tgatatcatg ttatttgtta agtccctagc caacatgaat gatctggacc tgtctaacta 3240 tgtatcattt tctgctagac caacaaggag tagtcggtcg ttcatgctga tgccatgtcg 3300 ctgtaaaaca tctacatttg ctatgtcata cgtaccacgg ttagttaccg aatggaacaa 3360 gcttgatgtc acagttcgca gcataggagc tgtgtcgtcg caccagtccg acctattgtc 3420 tttcaaacag ctacttgtga aatctactgt tgatagattc agaagccact ttgtatccga 3480 taacctctgt acctggtcta ctgcatgtaa gtgcgcctcc tgccaggatt taagggctcg 3540 gtagtctgat cgattgttta tgtcgttttg tattaatctt aatctgctgt atctattgta 3600 tttttatgtc ggggcgaggg ccagtaaagg tgtcaccacc tgttcccccg ccccttccac 3660 ttgtgtggtg aattttaata aacaaataaa caaataaaca aa 3702 // ID hAT-44_HM repbase; DNA; INV; 3103 BP. XX AC . XX DT 15-DEC-2008 (Rel. 13.12, Created) DT 15-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type DNA transposon family - consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-44_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3103 RA Bao W. and Jurka J.; RT "hAT-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 2032-2032 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 417..2363 FT /product="hAT-44_HM_1p" FT /translation="MSEQIYQHISFFGKGSTRKIYCKYCDKQITAKSTLRW FT EQHLRGCRKTSDEVKLCFKKIKKEVQSVIVETKGYDNMASQTSENLVFKES FT GQPAKRTMLNSYFDTISKDDIXKIDKSYARTFFHTGVPFALADSSAWKQHH FT ANLRPAYKPPSSKCISGRLYKDALNDEKHAIKKYVDSSEYVSIVTDGFSNI FT NANHLVSYSIHVENRTMKPIAYKIEPTGQEQQTGINIAKRIENVILEIGVD FT KVTSIVTDNASNMRAAWDIIEKKYPKIFCNGCAAHTINLLVKDICLLPEFV FT DILQKSGKLTAFVKQRTSLIDQFRIIQNRVKQENNLKKMRALSHVVLTRWY FT SHHTSVARNLENKLVHLNLINSGAFSRVSISKKKSDYVNIVQDNFFWEECQ FT RFINVMKPLSKLVGKLESDSCLLSEVYIGFVNLMEIWKEDSVLKTLVLSRW FT EFIHTPSMGFAYFLDPRNHGGRNMYTESPPSSKSDLVIVLELLPKYIVETR FT GLCNYEISKKEIKSFQRLCANPTPDFATQIKLMDPDIWWGVEGASKFPNLA FT KVARIVFTIPTSQAASERIWSLYDFIHSKRRNRLAKDKAIGLVLMYANATL FT HEKEANIADIMLGNTSDHDESDESDEDGVNVDFTLETSVDTSDLSSQLXN* FT " XX SQ Sequence 3103 BP; 1106 A; 413 C; 508 G; 1074 T; 2 other; tagagatcgc cggttacagt atacggtata ctaactttac ggtatacttt accgtatagt 60 tggtatacta aaaattacta taccaattag tatacctgtt atttatgaaa tattttttaa 120 atgattttat aattcaataa tgatattatc ttaattgatt aaataattgt attatatgta 180 ttacgtgatt acttttttag ataaataata caatatttat taataataaa tatagtatta 240 tttatagtag tattaaaaga attatcattt attaaagtgt tgtttttttg taatttaact 300 tttgaaaggt ttttgaattt atttatttaa attttaattg ttttttgttc ttttttattt 360 gtttattgta tgtataaaag taacatgttt actcttacct tgaagtaaaa tatttaatgt 420 cagaacaaat ttaccaacat atctcttttt ttggaaaagg ttcaacaaga aaaatttatt 480 gcaaatattg tgataaacaa attacagcaa aatcaacatt aagatgggag cagcatttga 540 gaggatgtag aaaaacgtct gatgaggtta agttgtgctt taaaaaaata aaaaaagaag 600 ttcagagtgt tatagttgaa acaaaaggat atgataatat ggcttcacaa acctctgaaa 660 atttagtttt caaagaatct gggcagcctg ctaaaagaac tatgctaaat tcatactttg 720 acactataag caaagatgac attgawaaga tagataaatc atatgcaagg acgtttttcc 780 atacaggagt accttttgca ttagctgatt cttctgcatg gaagcaacat catgctaatc 840 ttagacctgc atataaacct ccatctagta aatgtataag tgggcgtctt tataaagatg 900 cattaaatga tgaaaaacat gcaataaaaa aatatgttga tagttctgaa tatgttagta 960 tagtgactga tggattttca aacattaatg cgaatcatct cgttagttat tcgattcatg 1020 ttgaaaatag gactatgaaa cctattgctt acaaaattga gcctactggt caagaacagc 1080 aaactggcat taatattgca aaacgcatag aaaatgtaat tcttgaaata ggtgttgata 1140 aagttacaag cattgttact gataacgcat caaacatgag agctgcatgg gatattattg 1200 aaaaaaaata tccaaaaata ttttgtaatg gttgtgctgc ccatacaata aatcttttag 1260 taaaagatat ttgccttctt cctgagtttg tggatatttt acaaaagtct ggaaaactga 1320 cagcctttgt taaacaaaga acatcactta tcgatcaatt tcgtataatt caaaatcgtg 1380 ttaaacaaga aaataatttg aagaaaatgc gagcattatc gcatgtagtt ttaacacggt 1440 ggtatagcca tcatacatct gttgcccgta atcttgaaaa caaactagtt catttgaatt 1500 taattaactc tggcgccttc tcgcgtgtat ctatatcaaa aaaaaaatca gattatgtaa 1560 acattgtaca agacaatttt ttttgggaag aatgtcaacg ttttattaat gttatgaaac 1620 cattatcaaa attagtaggt aaactggaat ccgatagttg tctactttct gaagtttata 1680 taggatttgt taacttaatg gaaatatgga aggaagactc agtattgaaa actcttgttt 1740 taagcagatg ggaatttatt cacacgccat ctatgggatt tgcatacttt cttgatccac 1800 gaaatcatgg tggcagaaat atgtatactg agtctcctcc atctagcaaa agtgatcttg 1860 ttatagtact tgagttgctt ccaaagtaca tagttgaaac aagaggactt tgcaattatg 1920 aaatttcaaa aaaagaaata aaatcatttc agaggctatg tgcaaatccg actccagatt 1980 ttgcaactca gattaaatta atggatcctg atatctggtg gggagttgag ggtgcaagta 2040 aatttccaaa tttagcaaag gttgcacgaa tagtctttac tattccaact tctcaagctg 2100 caagtgagcg aatatggagt ttgtatgatt ttattcattc aaagcgaaga aaccgacttg 2160 caaaagataa agcaattgga ttggttctta tgtatgcaaa tgcaacactc cacgaaaaag 2220 aagcaaatat tgctgatata atgcttggca atactagtga tcacgatgag agtgatgaga 2280 gtgatgagga tggtgttaat gttgatttca ctttagagac ttcagttgat acaagtgatt 2340 tatcgtcaca gttggytaat taaatgcttt atttaattac taagaagttt taaaaatctt 2400 tatattacat cgacaaatct ttaaatcatc gaaaaatatt agtcgaattt tgtttaaatg 2460 aaaataaaaa tgtttttgtt tactttagta tttatttatt cttccacctg attatgttta 2520 ttttgctgtc tcaaaggcat cctaaaaaag tagagagcag acaaagtcca gatggccaac 2580 aaatctataa caaggttagg gccccagagt gcaaaaactg caaaaaattg aacaagggag 2640 gagttttgtg tcatgtgata agtttctgct gtttctattc ttctaaaatt atttgaaaac 2700 accagaactt atttttaaaa tgcttgttct agtaagtggt ggattttgat caacgctttt 2760 tatttaatta cttaagttat tagttttaaa aatctttaaa tatatttttg ttcttacttt 2820 aaatcagtta aatatcaaat ccaataaaaa cattcaacct aatgctaatt tgaatattta 2880 gctaaggacg gggttgggaa attaaaaaaa tttgtctggt ccccttgcga ctgaacaaat 2940 aacatttttt aacttctctt atagtttcgc taacaaattt attaagtaaa aaaagttgtt 3000 tattgcaatg acgtcattgg tatacttaac agtgtacttt acggtatact atacagtaaa 3060 cggtatacta aaattttagt ataccgtaaa ctagcgatcc cta 3103 // ID Gypsy-9_SI-LTR repbase; DNA; INV; 201 BP. XX AC AEAQ01022575; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: long terminal DE repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-9_SI_; KW Gypsy-9_SI-I; Gypsy-9_SI-LTR. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-201 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01022575; Positions 5142 5342. XX SQ Sequence 201 BP; 46 A; 31 C; 52 G; 72 T; 0 other; tgtaaagtgt agatcgatta aatgaagaga ctgtcgcttg ttagcaacgt gttgttgtta 60 tgatcggact tttggttttg acatttcgtc ttgccaataa cgtcgttttt tctaataaaa 120 gcctgttttt ttattagtgt gcgcgtgtgt tactgaggtg tgcgagtgcg aaagtacgag 180 gcgattcacc cggtcgtaac a 201 // ID Homo4 repbase; DNA; INV; 3423 BP. XX AC . XX DT 09-OCT-2009 (Rel. 14.1, Created) DT 09-OCT-2009 (Rel. 14.1, Last updated, Version 1) XX DE Homo4 is a putative autonomous DNA transposon. XX KW hAT; DNA transposon; Transposable Element; HOBO; transposase; KW Homo4. XX OS Drosophila mojavensis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; repleta group; OC mulleri subgroup. XX RN [1] RP 1-3423 RA Ortiz M.F. and Loreto E.L.S.; RT "Characterization of new hAT transposable elements in 12 RT Drosophila genomes."; RL Genetica 135(1), 67-75 (2009)DOI 10.1007/s10709-008-9259-5. XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 1717..3000 FT /product="Homo4_1p" FT /translation="LFKIVIFVLYGALHKGWGSCGAFNGHEIYERNDMHWF FT VIIIYLFIFFIITFIYLYFFHYMYLGLNIRAKIVEILTQFGCDLELDNPVI FT VTDRGSNMVKAFCGFEKINCANHLLNNAIEKAIDAVPAVGNIVSMCTKLVK FT YFKKSGINFSLGLTLKSFCPTRWNTVYYLLKSIEINWIEITTILRDKNQIA FT RIEGINVSEVGSIVRILEAFETVSKKIEASQRPTIHLILPNLNKLHKICEI FT DYTDIENIQDLKLKLQTQISITILSNLSKYHNIALFLFPPTNKLKQFTDTE FT KDTIINECKNIMKHFRAESIATTDITEAEEDEFAEYIEHQQVDSTTDIIGQ FT EIAGYSNLNIAYHSNFDVLAWWNMHQHIFPLLHKTSCKIFCIPASSAASER FT VFSKARNLITEKRCLLATNPDHINKIMFLNSNMN" XX SQ Sequence 3423 BP; 1115 A; 621 C; 590 G; 1097 T; 0 other; tagagagctg catggcagca ctcataacct agcaaaattg agtccacttg aattgagact 60 gagtgagtga atgagggaat acaaactagt tgatagaatg tgagttcgta tggactcaat 120 gcctcatatc tcactccatt aagtaaaaag gatgagaata aataagacgt aaataggaat 180 acaacattta accgaagcat acaggcgtcg aggcacttga gtttctgcat gcatatatct 240 tcgtgtcaac acttcgtatt tgttcggctc cgtcgcttga gggttgagca gtgaaaaaat 300 ctaatgccaa aatcagtgcg ctgcgagcgg tcaagcagaa agcacgcatt ttaatatgag 360 gcctggttgc gcacttattt tttcttattt tccttctctc tctctgccct ttccatttag 420 agagtatatc gaacttttga aagggtttgc cttgctcatg catgttagtg ttttcatgtc 480 attatgttat gtttcgatct atccataatc ttattaaatt ttatattaat taattataat 540 acaaagtatc taattgacca ttcagaattt actaagcgga ccaaaaagat ttatgcgggt 600 cataaaattc atacttatgt actatttaaa aatgccagga tgcttatgcc aatcgcttat 660 gtatttgctg tttattctcg tatgagacac taatatttgc taatcttttt tggtactcta 720 tctatctccc tctctcaata tgagcgaaaa ttgaaattcc ttcgcgacga tcccgttata 780 aacacactgt tgcaatctct tgctatatgt aacgcctgtg tgcgttagtc ttgctgccta 840 tgtaatatgc ctgtagaaaa tgtgagcgaa aattgaaatt ccttcgcgac gatcccgtta 900 taaacacact gttgcaatcc cttgctatat gtaacgcctg tgtgcgttag tcttggtgcc 960 tatgtacaat gtaaattcga aaatttaccg tttgttttgt ctacttacaa tgcttaatag 1020 tattaagaaa aagttcattc cttcgacacg ttgtcagcaa gcagaccttt cagctatgga 1080 cggagaaaat aattctatcg aaattgtatg taaactgtgg ttgtttaaat tgacttgaaa 1140 taaacatatg tacatatgta tgtaggttcc tgaaaacgca gaggatatta agcaacatct 1200 tctgagtggc atatataaac tgactgcgaa gagaggtcgt agcgaagtgt ggaattattt 1260 ctcagtaatt gaaaaagaaa atggaacgca attggctgac actgtagcct gtaagatgtg 1320 ttattctgtt tttaagttta caggaagcac gtcgaattta gtcaaacata aatgttataa 1380 agtgaacgcg aaagaaatac gcaaatcttt agctatcgaa gtaaatgctg aaacaaaaca 1440 ggaatgtgta actatcgcaa ctgaatgggt tattaaaaac tgtcgccctc taaaaatgat 1500 agacgattct ggtctaaaaa agttttcatc atttttaatt aatgttggcg caacgtttgg 1560 ttccaatgtc gacgtagaca aattactgcc acacccaaca acaatatccc gaaatattat 1620 aacattgtat gagtcacatt ttggcccaat aaaagccgaa atacaaaaat ataaatcctt 1680 cggttatgct attaccactg acttatggac ggatagcttt ttaaaatcgt catatttgtc 1740 ttgtacggtg cattacataa aggatggggt tcttgtggag cgtttaatgg ccatgaaatc 1800 tatgaaagaa acgacatgca ctggtttgtt attataattt atttatttat tttttttatt 1860 attactttta tttatttata tttttttcat tacatgtatc taggtttaaa tattcgagcc 1920 aaaatagtag aaatactaac tcaattcgga tgcgatttag aattggataa tcctgtaata 1980 gtaacagatc gcgggtcaaa tatggtgaaa gcattttgcg gcttcgaaaa aattaattgc 2040 gctaaccacc tgctcaacaa tgctatagag aaagctattg atgctgtccc cgcggtagga 2100 aatattgttt ccatgtgtac gaagcttgta aaatatttta agaaatccgg tattaacttt 2160 tcattaggat taacattaaa aagcttttgc cctactcgct ggaatacagt ctattaccta 2220 ttaaaatcaa ttgaaattaa ttggatagag ataacaacca tattaagaga taaaaatcaa 2280 atagccagaa ttgaaggaat taatgtaagt gaggtaggtt ccatagtccg tatactagag 2340 gcatttgaaa ctgtgtcaaa aaaaattgaa gcttctcaac ggccaaccat acatttaatt 2400 cttccaaatt taaataaatt acacaaaatc tgtgaaatcg attacacaga tattgaaaat 2460 attcaagatc tcaagctaaa actccaaacc caaatttcaa ttaccatcct ttcgaattta 2520 tctaaatatc acaacatagc cctatttctc ttccccccca caaacaaatt aaagcagttc 2580 acagacactg aaaaggatac tataataaat gaatgcaaaa acataatgaa acattttcgt 2640 gcagaaagta ttgctactac tgacatcaca gaagcagaag aagatgaatt tgctgaatat 2700 attgagcatc agcaagttga tagcactaca gacattatag gacaggaaat agccgggtac 2760 tcgaatctaa atattgcata ccattcaaac tttgatgtct tagcttggtg gaatatgcac 2820 caacatattt tcccattatt acataaaact agctgcaaaa ttttctgtat tccagcgagt 2880 agcgccgctt cagagagagt attttctaag gctagaaact tgattactga gaaacgctgt 2940 ttactagcca cgaatccaga tcacataaat aaaattatgt ttctcaattc taatatgaac 3000 taaacaaata aatatattgt attacaaaat tacatagtct ttcgtttgtt tattatttat 3060 gagaagggta tgcaaccatt cgtacgcata aagaatgtgt tgtcgctacg aaattgcaac 3120 gctaaatttg ccgtgtgtct gccgtctggt ttgcacacac acaattacat agcccttcgt 3180 gttttccttt tttttttttt ttcgttgctg gcagcgcagc aactcataca ctcggcaaag 3240 caactcatat actcagactc aaacttattt attctttttc gctcaataca atattgtgca 3300 tccaagtcat atacgagcgc actcacacac tcatacaatt ttcgttttgc gctcggtgcc 3360 actcatctca atatgagcat tgttaattcg aagtcgtttt ttcggcactc atgcagctct 3420 cta 3423 // ID hATm-21_HM repbase; DNA; INV; 2905 BP. XX AC . XX DT 07-DEC-2008 (Rel. 13.12, Created) DT 09-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type family: consensus sequence. XX KW hAT; DNA transposon; Transposable Element; hATm-21_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2905 RA Jurka J.; RT "Families of hATm elements from Hydra magnipapillata."; RL Repbase Reports 8(12), 1915-1915 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 309..2633 FT /product="hATm-21_HM_1p" FT /translation="MHFNINILSFSQIIMTGLTKISTRMATDQLAFGQPAE FT LPVSVLPTELDIARSFLYEKQKWMIETSGKKEPDNSIIAEIIVSQVESVYK FT RASLPTVRKDTIKDRVLKVYEKRQSIIRIPTARRYKNPKCPNPNDLSDYFK FT NKKVEFRKEISGLFEVVDNENVPRMERDFLEDQRSIRKMIISSTVDKEATR FT KQQQKESSKKADDTRGVKERQRRESQFETVASVSVKIGRDEDENSEEDNST FT TSPDINTDSDFQNKPSTSTSASSTASVVEVALKNRGRDLRQIAEAKVRNLI FT SDRATASIVNATLRIYNIPEIVDKSKVRRAVNVILEEDDAVPLVVCCGIGY FT DGRKDVTLTQERVGEKHYQSRKTEEHVTIVSLPDGKFLGHVSPTDGKAITV FT SKSLVQFLQEKNIIDQIEALACDGTNTNVGAEGGINHLIEVSIGRALQWNV FT CLLHANELPLRHIIIELDGPTAGANTFAGPIGKLLPDVLNYPVIKYKRFGR FT SQPLSKMPNEVWKDLSTDQKYMWRITSALISGQFDKDLERLKIGPCSHSRW FT LTTACRICRLYASTTKPSTVLNILTGYIVNVYAPMWFTVKYYELGINGPQN FT LFFLMQRSELIEDGAAKKIVQRCIQRNAFYAHPENILLAELASSHKPDRID FT AVQTIIEARANHQDQASSVRLFRVPELNFKASKWQLMIKWSETDILEPPLI FT RNMSNSELLALVAEPLIVPKFKCHTQMVERAVKEVTKASQKVVGTKRRDST FT IKTSMVNRAKYPKQESKKDFIPK*" XX SQ Sequence 2905 BP; 984 A; 512 C; 587 G; 821 T; 1 other; ttaggcggtt cataaaaaac ttttttttga aaatgtccgc tccaaagttg ttaatttact 60 tcaatggagt acacatagct atattccaaa tttcatcaaa atcggttcat atcttgaccc 120 tcctccattg atttgaaatt tctgactttg cgcaacagga aaatggccca aaaaactaca 180 atatttttca ctttattaca aattgaacaa tattaagcag gaatttcaat gtttttaaag 240 aacaataaaa gatgcaggat ataatttcat gactaacatc tttatttgat ccacgttttt 300 ctcatagtat gcattttaat atcaatattc tttctttttc acagattata atgacaggtt 360 taacaaaaat atctacccga atggcaacag atcagcttgc ctttggtcag ccggcagaac 420 tgcctgtaag cgtacttcca acagaattag atattgctag aagcttcttg tatgagaaac 480 aaaaatggat gatagaaacc tctggaaaaa aagaacctga taattctatt attgcagaga 540 ttattgtgtc acaagttgaa agcgtttata aacgtgcatc cttaccaaca gttagaaaag 600 acacgattaa ggaccgtgtt ctcaaagttt atgaaaagcg tcaaagtata atccgaatac 660 caactgccag aaggtataag aacccaaagt gtccaaatcc caatgatctc tcagactact 720 ttaaaaataa gaaagttgag tttcgcaaag aaatttctgg tttgtttgaa gttgttgata 780 atgagaatgt tcccaggatg gagagagatt ttctcgaaga tcagcgatct attcgaaaga 840 tgatcataag tagtacagtg gataaagagg ctacaagaaa acaacagcag aaagaatcaa 900 gtaagaaagc cgacgacacc aggggtgtaa aagaacgaca acgacgagaa tcgcagtttg 960 aaactgttgc ttccgtgagt gtgaaaattg gcagggatga ggacgaaaac tccgaggaag 1020 ataacagtac tacaagtccg gatattaata cagattcgga cttccagaat aaaccaagta 1080 catcgacctc agcctcaagt actgctagtg ttgtagaagt tgctttgaag aatcgaggaa 1140 gagatcttcg acagatagca gaagcaaagg tgcgtaattt gatctctgat agagcaactg 1200 cttcgattgt taatgccact cttcgtatat acaatattcc cgaaatagtt gacaaatcaa 1260 aagttcgcag agctgttaat gtaatattag aagaagacga tgcagtacca ttagtggtgt 1320 gttgtggaat tggttacgat ggtcgaaaag atgtgacttt aacgcaagag agagtaggtg 1380 agaaacacta tcaatcaagg aaaactgagg aacatgttac aatagtttca ctccctgatg 1440 gaaagttcct gggtcatgta tcaccaaccg atggtaaagc tataactgtt tctaaatctc 1500 tcgtccagtt tctacaagaa aaaaacataa ttgatcaaat tgaagccttg gcatgtgatg 1560 gcactaacac aaatgttgga gcagaagggg gaataaacca tttgattgaa gtaagcattg 1620 ggcgtgcttt acaatggaat gtttgccttc tgcacgcgaa tgagttgcca ttaaggcaca 1680 tcatcattga gttggatggt ccaactgcag gtgcaaacac ttttgctgga cctattggca 1740 agttactacc agatgtcttg aattatcctg tcatcaaata caaacgcttt ggtcgttctc 1800 aacctctatc taaaatgccc aacgaagttt ggaaggattt gtcgacagac cagaaataca 1860 tgtggagaat tacatcagcg ctcatttccg ggcaattcga taaggattta gagcgtctga 1920 agattggacc atgtagccac agtaggtggc ttacgactgc atgccgtatc tgtaggctgt 1980 atgcgagtac aacaaaacca tcaacagttt taaacatttt gactggatat attgtgaatg 2040 tttatgctcc tatgtggttt actgtgaagt attatgagct tgggattaat ggtcctcaaa 2100 atctgttttt cctaatgcaa agatctgagt taatagaaga tggagctgct aagaagatcg 2160 tacagcgctg cattcaacgg aacgcttttt atgcccatcc agaaaatatt ttgctagctg 2220 aactagcctc atcccataag ccagaccgca tagatgctgt tcaaacgatc attgaagcca 2280 gagctaatca ccaagatcaa gcttcttctg tacgtctttt ccgggtgcct gaactgaatt 2340 ttaaagcatc aaagtggcag ctaatgatta agtggagtga aactgacata ctagaacctc 2400 ctttgattag gaacatgtca aatagcgaat tgttggccct ggtagcagaa cctctcattg 2460 taccaaaatt taaatgtcac acccaaatgg ttgaacgagc tgtgaaggag gttacaaagg 2520 ccagtcagaa ggtagtaggt acgaagcgac gggattcaac aataaaaaca tcaatggtga 2580 atcgcgccaa atatccaaag caggaatcaa agaaggattt tatcccaaaa tagtcatatt 2640 ttcccttaat aaaaaatatt tttttatttt actatagact atgttgtata acttttaata 2700 aaacgtwaaa ataagtgaaa aatattgtag ttttttgggc cattttcctg ttgcgcaaag 2760 tcagaaattt caaatcaatg gaggagggtc aagatatgaa ccgattttga tgaaatttgg 2820 aatatagcta tgtgtactcc attgaagtaa attaacaact ttggagcgga cattttcaaa 2880 aaaaagtttt ttatgaaccg cctaa 2905 // ID R2A_TM repbase; DNA; INV; 1410 BP. XX AC AF015817; XX DT 26-AUG-1999 (Rel. 4.07, Created) DT 26-AUG-1999 (Rel. 4.07, Last updated, Version 1) XX DE Tenebrio molitor retrotransposon R2 reverse transcriptase gene, DE partial cds. XX KW R2; Non-LTR Retrotransposon; Transposable Element; R2A_TM; R2_TM. XX OS Tenebrio molitor OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Coleoptera; Polyphaga; Cucujiformia; OC Tenebrionidae; Tenebrio. XX RN [1] RP 1-1410 RA Burke D.W., Malik S.H. and Eickbush H.T.; RT "R1 and R2 Provide an Estimate of the Age and Stability of RT Retrotransposons."; RL Unpublished. XX RN [2] RP 1-1410 RA Burke D.W. and Eickbush H.T.; RT "R2A_TM."; RL Direct Submission to Genbank (24-JUL-1997)Biology, Univ. of RL Rochester, 135 Superior Road, Rochester, NY 14625, USA. XX DR GenBank; AF015817; Positions 1 1410. XX SQ Sequence 1410 BP; 441 A; 334 C; 357 G; 278 T; 0 other; gcctacgcgg acgatatcat cctgcttgca gataacgtcg agactgcgca aagacttctc 60 aacgcctcca caagattctt tgaagaccgg gggctcgaca tcaaccccgc gaagtgcgag 120 tctctagtga tgcgtactct cccgtcaaag aagaaggtct tcactgtgac aactccgcaa 180 ttctacgtta aggctgaacc gatcaagcca atagaagttg gaaaggcttt taaatatctg 240 gggcagcagt tcacatgcac gggttcaacc aactgctcaa ccaaagatct aagcgaacag 300 ctacttagaa taaagaaggc cccactgaag ccccaacaaa aaataaacat catcaaaacg 360 tatctgatgc cggcctacat ccactcaatg cagaacccag ccgtcaacaa aaagatatta 420 cgagaagtgg ataggaaaat cagaatggtc gttaagggga tactacatct tccactccat 480 ttgtccaaca ccgcaatata tgccccggcg aaaatgggag gtctaggaat gttttccttt 540 tcaagaaaga tccccatcat tgtactaaaa cggttaaaca acttaagccg tacttgctcg 600 aacttccacc ttgtgttgcg agaggccgcc ccatgggtca atagattaaa gaaaatggta 660 aggccagacg taacgacaaa agagcaagtg gacagagcaa atggggtgga acatgagggt 720 tcgtactacg gcggcggtac gatgcaatgt cggaacgact ctgcttcaaa tacttggatc 780 aacacccccc caaggtattg gaccggatcg gattacgtga aagcggttca gctaagactg 840 aattgcctgg ccaccagagg ccttccctat aacccgccgg aacaaagaag atgcagagcg 900 ggatgcgaca gggttgaaag cctgtcgcac gtcttacaga aatgccctct aggacacaca 960 atgaggatga gaaggcacaa ttacatcgtt aaaaggctta agaccatggc tctgaagaag 1020 ggatggacgg tggaggagga gccgatatgc gatgcgcgcg gtgtactgag gaagccagac 1080 cttatcctca gcaatgaaga gacggcccta gtagtggacg ccatggtgtc ctgggagtgc 1140 cccagagacc ttgcagagac attcgattgc aagaggctgg tatatgacca gcctgagttt 1200 accagcgtac tcgaatctaa ataccatcca aggaatatca gagtactacc cttcatcatc 1260 ggcgcaagag gtatgtggtg cagacagagc accgaaacac tagaagcgat gaaggtgaac 1320 aatgtgagca atcgaaggga actggtccat acaacactga gaggaagctg gagtatacat 1380 cgagaattta cgaggagaag ctgggaatag 1410 // ID BEL-645_AA-I repbase; DNA; INV; 5996 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-645_AA_; KW BEL-645_AA-LTR; Pao_Bel_Ele219; BEL-645_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-5996 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [4989-5507] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 61..2172 FT /product="BEL-645_AA-I_1p" FT /translation="MASDSDSVNQRVLDFTETPCGICGPSTQDEEMIGCDG FT CSAWFHSRCVNVVSGKLPKKWFCQNADCQVKAEEYRKQREARKQQRSRKEV FT DESDKSSVNSRLGASSVEAKVRALEERQKRQYEEMEAELQLRKREREMQRA FT FERKKMELELQMRAEEEEEQKAWQAEMLRKKAEQIERMKANQESFEKKMAA FT MDKQLAEFSIQKGPTSASKLGGDGRASKVNEKVLKLSKENVKKLAADDDDL FT DEDSEDDEESEVSSQSSESSDEECPIPPSRHLKRKGMQKVDEVSQNGLEQQ FT RTGPTKAQLAARKGLTYKLPKFSGKPAQWPLFYAAYKASNEACGYMNHENL FT MRLHEALEGDALELVSGQLLLPQSIPRVIEKLRRHFGRPEQLLESLLDKVN FT RLQPPKADNLRSFVPFGNTVEQLCSHLEAAELHQHLVNPLLIKSLVDKLPD FT REKREWIHFRRGRGETTLRTLTDFLMDIVADACEANVDVEFKPPQQPKAVP FT QSGKKLHKEKGGLYLHNEASNPAATSSEVKKLKPCKMCQGTNHRLRHCADF FT KKLRYVNRLRLVTREKLCHVCLNEHDGQCKFRIRCNVGECREPHNPLMHPV FT ENTVGLSAHIRTNCTVLFRIVPVQLHCGGKPLRCSHFSTKVPPSRWWRKSS FT PIAWVLSEYKSDSPSGGRETYREWRKPAEPACGYRAQTPVPTTRCSCTRFI FT RLVN" FT CDS 2912..5134 FT /product="BEL-645_AA-I_2p" FT /translation="MLVPLVKVLAGFREWSIAFGGDLKEMFHQLKIRAEDK FT QKQRFIFRKHSEDPPSVYVMDVATFGSTSSPCSAQFVKNRNAEEFASQYPE FT ASAAIIHRHYVDDYFDSVDTEEEAVKRAQEVRLVHQKGGFEIRNWVSNSPE FT VLRSLGEEKPAAPVHFGRDKQTSNERVLGVIWDPDMDQFAFSTTHREELLP FT YLYEGKRPTKRLVASCVMGFFDPLGLLSPFTIHGKIVIQYLWRSGCDWDQE FT IDPDSWQLWKRWTALLPVVEAIRIPRCYIGGAKSAEVDSLEVHIFTDASEH FT GYGCVAYLRAVINGAVHCSLMMSRAKVAPIKRQSIPRLELMGAVLGARISQ FT TVLDMHSYQICRTVFWTDSRTVYSWLHSDQHRYKQFVAFRVGEIQELTKVT FT DWRWIPTKLNVADVLTKWGQGPPLQNDGEWFNGPSFLYRPEEQWPTQEATI FT EETHEEARGVVLFHGVIDVEPISRWTKLLRVTASVVRFIANCRRKKRGEPI FT VTTRATAKQRQMIEATAANYAALHAPLQREELQRAETLLWRQAQWDSFPDE FT MSALTNNLKRQPGRPVENIKKSSPIYKSSPVLDDEGVLRMDGRLANSEESS FT FDKKHPIILSRAHEITQRLIQHYHEEFGHANSETVFNEMRQRFQIPKMRPA FT IQQVVKNCVWCKVNKCRPRSPRMAPLPVERVTTQHRPFSSVGLDYLGRWKS FT RWGVERRSVGWQFLPVWRCGGTPRGRAQPYHGIVPDGDHAL" XX SQ Sequence 5996 BP; 1577 A; 1536 C; 1798 G; 1083 T; 2 other; ttctcaaaaa taaaktgaca tacgattgtg tccttgtgag aagccaatct tgccccaaat 60 atggcttccg attccgattc cgtcaaccaa cgcgtccttg atttcacgga aactccgtgc 120 ggtatctgcg gtccgtcgac tcaggacgaa gaaatgattg gttgtgatgg ttgttccgcg 180 tggtttcatt cccgttgtgt caatgtggtg tccggaaagt tgccgaaaaa gtggttctgc 240 caaaacgcag actgccaggt gaaagctgag gagtaccgga agcaacgaga agccaggaag 300 cagcagcgca gccgcaagga agtcgacgag tcggacaaat ccagcgtcaa ctcccgctta 360 ggcgcatcta gcgttgaagc aaaggtccga gcgctagagg aacgtcaaaa gcggcagtat 420 gaggagatgg aagcggagct gcagctgcgg aaaagagaaa gggagatgca gcgtgcmttc 480 gagcgaaaga agatggagtt ggagctgcag atgcgtgccg aagaagaaga ggagcaaaag 540 gcttggcaag cggaaatgct ccgaaagaag gcggaacaga tcgagcggat gaaggcgaac 600 caggagtcgt tcgagaagaa gatggcggcc atggataagc agttggcgga gttttcgatc 660 caaaaggggc caacatcggc gtcaaaactc ggtggcgacg gtcgtgccag caaagtcaac 720 gaaaaggtgt tgaagctgag caaggagaac gtcaagaagc tagcggcaga cgacgatgac 780 ttggacgagg atagcgagga cgacgaagaa tccgaagtgt cgagccaatc ttccgaatcg 840 tcggacgagg aatgtcccat tccgccatcc aggcacctca agcggaaagg catgcagaag 900 gtggatgaag tgagccaaaa cgggcttgag caacagcgga ctggaccgac caaggcgcag 960 ctagccgcga ggaagggact aacctacaaa ctcccaaaat tctcgggcaa accagcgcag 1020 tggcctctgt tctacgccgc ctacaaagct tctaacgagg cctgtggtta catgaaccac 1080 gaaaacctta tgcgactgca tgaagcgctg gaaggcgatg ccctcgagct agtttccggg 1140 cagttactgc tccctcaatc aatcccgagg gtcatcgaga agctgcgtcg gcatttcggc 1200 cgcccagaac aactactcga gagcctgctg gataaggtga atcgtctgca acctccgaag 1260 gcggataatc tccggagctt cgttccattc ggaaacacgg tggagcaact ctgtagtcat 1320 ctggaggccg cggaacttca tcaacacctt gtcaacccac tgctgatcaa gtcactggtc 1380 gataagctgc cggatcgcga gaagcgtgag tggatccact tccgaagagg ccgtggcgaa 1440 acgacgttgc gaacgctgac agacttcctg atggacatcg tggcagatgc ttgcgaggcc 1500 aacgttgacg tcgaattcaa gccgccgcag cagccgaagg cagttcccca atccgggaag 1560 aagctgcaca aggagaaggg tggactatac ctccacaacg aagccagcaa cccagccgct 1620 acgtccagcg aagtgaagaa gctcaaaccg tgcaaaatgt gtcagggaac caaccaccga 1680 ctgcgacact gtgcggactt caagaagctg cgatacgtta accggttgag actggtgact 1740 cgagagaagc tgtgccatgt ttgcctcaac gagcacgacg ggcagtgcaa atttagaatt 1800 cgctgtaatg tcggcgaatg cagagaaccg cacaacccgc tgatgcaccc ggtcgaaaac 1860 acggtcggat taagtgcaca cattaggacc aactgcacgg ttctattccg gatcgtgccg 1920 gtacaactcc actgcggtgg aaaaccgcta cggtgctcgc atttctcgac gaaggtgcct 1980 ccgtcacgct ggtggagaaa aagctcgccg atcgcctggg tgctttcgga atacaagagc 2040 gactcaccat caggtggacg ggaaacgtat cgagagtgga ggaaacccgc agaaccagct 2100 tgtggatatc gagcacaaac gccggtgcca acgacaagat gctcctgcac acggttcata 2160 cggttggtaa attgatgcta ccgcgtcaga agctggacag cgaggagctc gccgctgaat 2220 acgggcacat gcgaggcttg cctatcgagt cctacgacgg acagccgcag ttgctcatcg 2280 gagcgaataa catccattcg tttgctccga tggaggcaaa gataggcact acaatggagc 2340 caatcgcagt ccgaactaac ctcggatgga cggtgtacgg gccgaggcaa agcaccacgg 2400 ttgcatcggg caactacctt ggctaccacc agagaatcac caacgatgat ctgcacgagc 2460 tgctgaaaag ccactacgcg ttggaagagt ccgtcgtggc gattccgcaa gaaacagcgg 2520 aggagaaacg tgcccgggaa atattggagc gcaccaccaa acgcgtcggc gatcgcttcg 2580 agaccggatt gctgtggaaa acggatgacc cacgattccc ggacagttac ccaatggccc 2640 tgcggaggat gaagcagctg gagaagcgac tcgagaaaaa cgagaagttg cgggaaaacg 2700 tctgcagaca gatcgacgag taccagcaaa aggggtatgc acacctcgcc accgcggaag 2760 agctgaacgg cactgcctcc gatcaggtct ggtacctccc gctcaacgtc gtccaaaacc 2820 caaagaaacc tgaaaaggtt cgcctcgttg ggacgccgcg gctacggtac aaggaatttc 2880 cctgaactcg caactgctag caggaccgga catgctagtt ccgctggtaa aagttcttgc 2940 tggtttccgc gaatggagca tcgctttcgg cggcgacttg aaggaaatgt tccaccagct 3000 gaagattcgc gccgaagaca aacaaaagca gcggttcatt ttccgaaaac attcggagga 3060 cccaccgagt gtctacgtca tggacgtagc gacgttcggg tcgacgagct ctccctgctc 3120 ggcccaattc gtgaagaatc ggaacgctga agaattcgcc tcacaatacc cagaagcatc 3180 ggcggcgatc attcaccgac actacgtcga cgactacttc gatagcgttg atacggagga 3240 agaagctgtc aagcgagctc aggaggtacg actggtccac caaaaaggag gattcgagat 3300 ccggaactgg gtgtcgaatt caccagaagt cctgcggagt ctgggagagg agaaaccggc 3360 ggcgccggtg catttcggcc gtgacaagca aacgtcgaat gagagagtct tgggagtcat 3420 ctgggatccg gacatggacc agtttgcctt ttcaacgaca caccgcgagg agttattgcc 3480 gtacctctac gaaggcaaac ggccgacgaa aagacttgtc gccagctgcg tgatgggatt 3540 tttcgatccg cttggactgc tgtcgccgtt taccatccac ggaaaaatcg tcatacagta 3600 tctgtggcga tccggctgcg attgggacca agaaatcgat cccgactcct ggcagctgtg 3660 gaagcggtgg acagccctgt taccggtggt tgaagctatc cggattcccc gctgttacat 3720 cggtggcgct aaatctgccg aagttgattc gctggaagtc cacatcttta ccgatgccag 3780 cgaacatgga tacggatgtg tggcatattt gcgagcggtt atcaacggag cggtccactg 3840 cagtttgatg atgtctcggg cgaaggtggc gccaataaaa cgacagtcga ttccccggct 3900 ggaactgatg ggcgcggtcc tgggagcgcg aataagccaa actgtgctcg acatgcattc 3960 gtaccagatt tgccgtactg tcttctggac ggattcccgc accgtttata gttggctgca 4020 ctccgatcaa caccgctaca aacaatttgt tgccttccgc gtcggcgaga tccaagagct 4080 gacaaaggta acggattggc ggtggatacc aacgaagctc aacgttgcgg acgtgctaac 4140 gaagtgggga caaggccctc cactgcaaaa tgacggcgag tggttcaacg gaccatcgtt 4200 cctgtaccgg ccagaagagc agtggccaac acaagaagcg acgatcgaag agacccacga 4260 agaagcgaga ggcgttgtat tgttccacgg agtgatcgac gtcgaaccga tatcacgctg 4320 gacgaagttg ttgcgggtga cagccagcgt agtgcgcttt atcgccaact gccggcggaa 4380 gaaaagagga gagccgatcg ttactacacg ggctacggca aagcagcggc agatgatcga 4440 ggcgacagca gcgaattacg cggcgctgca tgcaccactg cagcgagaag agcttcagcg 4500 agcggaaacc ttgctctggc gacaggcaca gtgggacagc tttcccgacg agatgagcgc 4560 gctcaccaac aacctcaagc gccagccggg cagaccggtg gagaacatca agaagagtag 4620 cccgatctac aaaagttctc cggtactgga cgacgaggga gtgctgcgca tggatggcag 4680 acttgccaat tcggaggaga gctccttcga caagaagcat ccaattattc tgtcgcgggc 4740 ccacgaaatc acgcaaaggt tgatacagca ctaccacgaa gagttcgggc acgccaattc 4800 ggaaaccgta ttcaacgaga tgaggcagcg gtttcaaatt ccgaaaatgc gtccggcgat 4860 acagcaggtg gtgaaaaatt gcgtctggtg caaagtcaac aagtgtcgac cgagatcacc 4920 gagaatggct ccccttcccg tcgaacgagt caccactcaa catcggccct ttagttccgt 4980 cggcctagat taccttggcc ggtggaagtc acggtggggc gtcgaaagga gaagcgttgg 5040 gtggcagttt ttacctgttt ggcggtgcgg cggtacacct cgaggtcgtg cacagcctta 5100 ccacggaatc gtgccggatg gcgatcacgc gctttagtag caagtatgga aagccacagc 5160 aaatcttttc cgataacgcc acatgctttc ggggagcgaa caacgagatg gtcaaactgg 5220 agaaaatcaa ccaggagtgt gcggaaaccg tatgcagctc gactaccgcc tggcacttta 5280 ttccgcctgg aatcccacac atgggtggtg tctgggagcg gatggtgcgg tcggtgaaag 5340 aagcgatgcg ggcactcgac gacggacgaa aattgaccga cgagatcctg gtgacaactc 5400 tggctgaagc ggaagacatg atcaacacgc gcccgttgac ctacttgcca caggattcga 5460 gcgaatgcga agcactgacg ccgaaccact tccttcgagg aacggtgtcc ggtgcagacc 5520 gaaggtggac ggaggatcga cggctcccgc agaggcactg cggaacgtat acaagcggtc 5580 gcagttccta gcagaccgaa tgtgggagag atggacgaag gagtaccttc cgacgatcaa 5640 ccagcggtcg aaatggttcg acgatcagaa accgctcgaa gtgggcgatt tggtcttcgt 5700 agtcgacgga aagaatcgga agtcctggag acggggcgtc gtcgaagcgg tgatcaaggg 5760 ctcggatggc cgagtccggc aagcggacgt gaggacggct gacggtaagg tgcagagacg 5820 gggcgtagta aacctggcga cgctggaggt aaggtaaatc cgggaatgcc ggatgttacg 5880 ggccggggtg ttgcgacgcg gtttgaaccg gcctaacaag tgaacaggtc tagtcgcgag 5940 cataaccgca acatgacagt tcgcgtgagt atgtagtggt aggcgaagga gagaga 5996 // ID CR1_Ele31 repbase; DNA; INV; 4939 BP. XX AC . XX DT 25-OCT-2010 (Rel. 15.1, Created) DT 25-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE A CR1 clade non-LTR retrotransposon family from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1_Ele31. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4939 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4939 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (25-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. This consensus is generated from 13 CC sequences with >96% identity, and ~100% identical to the original CC sequence in [1]. XX FH Key Location/Qualifiers FT CDS 379..1191 FT /product="CR1_Ele31_1p" FT /translation="MAKNCSKCSEAINGIDYVVCRGYCGAFFHMNACSGVT FT RALLNYFTTNKKNLFWMCDKCAELFENSHFRTISTNADQTSPLNLLTTAIT FT ELRTEIKQINAKPKAQFSPAVGWPSLTERRTTKRPFEQVVARASENCRVGS FT KQPQANVVTVPVCQKEDNQFWLYISRIRPDVTTEAVQAMVKANLNVDDDPT FT VVKLIPKEKDISTLTFVSFKIGLDPSMKPKALDPETWPEGLLFREFEDYGA FT QKFRVPLKHRKPMTPSLSVSSPITPVMDLS" FT CDS 1089..4868 FT /product="CR1_Ele31_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="RLWSPKISSTSEAQETNDTVTVSFFSHHTCNGPELST FT INPGRTTLSTLEAPEPPSTVVSFLPALSSRPGPVFWSGEGVFQSVAIGKYS FT FNIDNIVPDALVDSSQLLQFRHSSSGSIVAEEGCMPVSLMEASYPLISVES FT VPPSFISHPCPEFECGEEVFQTTTTGKYAYSLNNYVPDYSGASSRSLYSCL FT SGSARIVPAPGCTPASLMEAPNSLNTVEPFPPALRSHPGPVFENEEGVFQP FT VNAGKFANVRNTALPVISIASSYQTPDDFRSIDDVIEINDRPCTGSTAPSS FT SNLTPNVGTRLAGISSRTLSVYYQNVRGLRTKISRLRLLLSSCDYDVLVFT FT ETWLRGDIESNEISPDYAFFRCDRSESSSQHLRGGGVLIAVKNYLKCEYFP FT LDNCETLEQVAVRINLHHRTLYVIAVYLPPNSSQDLYTAHSCAVQLIADRL FT MESDILLSAGDFNFPHLRWHLDEDINGYVPVNTSTDTELNFTEAMFASGLR FT QINSFINTNGRLLDLMFTNLPEYLDVIAPCSPLLPVDSHHMPVIILLDESD FT TQTFPDNLETCFRFDYSACDFELLNSVFENTDWGRSSNYATVDQMLSSFYE FT SLAGVISRYVPRKRQAFISIYNKPWWNPELRHLRNILRKMRNRYFNTKNDH FT DKTRLRDFEQQYKESLYASHESYLASIQRNVKQDPSRFWDFVKQRKAGIAI FT PSSVSFDGVTAHSNTEVADLFATFFESVFSRASPVQRRDSFEQIPVHNITL FT PFVQFSVEEVQNALEDLDSTKGPGTDNLPPLFWKNCAATLAMPITAVFNRS FT LRDRRFPSAWKTASITPIHKSGSYNCVANYRGVSILCSLSKVFEKLIHAVL FT YRAAAPIISNNQHGFMKHRSTTSNLMCYVSAVSRELESRRQIDAVYVDFAK FT AFDSVPHDIICRKLSHLGFPAWLTDWLCSYLTDREAFVKVKSTRSRTFNIP FT SGVPQGSVLGPLIFIIYVNDLYELLSSFNLSYADDLKIFRVIASSADCVEL FT QEDINRLLIWCDDNGMRVNSKKCKVISFSRSNSISLHQYNMGLDFLERVDS FT ICDLGVTIDSKLRFNEHISIITAKAFTVLGFIRRHASGFTDIYCLKTLFCS FT LVRSILEYASPVWSPFYVKHNLAIERIQKSFLRFALRHLPWNDPINLPSYP FT ERLKLINLESLSARRIRSQRLFIFDLITNNIDCPDLLELIPWNVPSRRLRN FT SVLFVIPFHRSNFGYSNCFHMCLRSFNDVCDEFDFNMSKNVFSVRIRGLD" XX SQ Sequence 4939 BP; 1308 A; 1144 C; 1028 G; 1459 T; 0 other; tggcaactct gttcgattgg atatatgtag gatctgaatc gtgaatattt tttattactt 60 ttccgttagt catttatcgt ttttattcgt tattttcatc acggccgatc aattcagtgt 120 ttcgtttcaa cgtgcgggtg ttgtatcgta tgataattgt gtatcaagtg cgttttcgcg 180 atctatgaca cagctgtgat tgtgcacata aacaatactg caccataacc ctcgccgcga 240 ttttttcgct cgaagcgaca tctgcgtatc aaaatcggaa tcgaatattg catgcgattg 300 caaacttaac ttgtccaagt tcgtcattca acgaggagca tatcttgaat tggattgcat 360 aaccgtaggc gcatcatcat ggcaaagaat tgctcaaaat gctcggaagc tattaatggc 420 atcgattatg ttgtctgtcg tggatactgt ggtgcattct tccacatgaa tgcatgttca 480 ggcgttacac gtgcgttgct taattatttc acgacaaaca agaaaaatct attttggatg 540 tgcgacaagt gtgctgagct tttcgaaaac tcacatttcc gcactatttc gactaatgct 600 gatcaaacat cgccactcaa tttgcttacc actgcaataa cggagttaag aaccgagatt 660 aagcaaatca atgctaaacc taaagctcaa ttctcaccag ctgttggctg gccctcatta 720 acggaacgaa ggacaactaa aagacctttt gagcaagtag tcgcacgtgc ttcggaaaat 780 tgtcgtgttg gtagtaagca gccgcaggct aatgtcgtga cagtacctgt ttgccagaag 840 gaagacaacc aattctggtt gtacatctcc cgaatccgac cggatgtgac cacagaagct 900 gtgcaggcta tggtcaaagc aaatctcaat gttgatgatg accctactgt ggttaaattg 960 attccgaaag agaaagacat tagtactctc acatttgtgt ctttcaaaat cggtctcgac 1020 ccgtcaatga aaccaaaggc tcttgatcca gaaacatggc ctgagggctt attattccgt 1080 gagtttgaag actatggagc ccaaaaattt cgagtacctc tgaagcacag gaaaccaatg 1140 acaccgtcac tgtcagtttc ttctcccatc acacctgtaa tggacctgag ttaagcacca 1200 tcaacccggg acgcactacg ttaagcactt tggaagcccc tgaaccaccc agcacagtcg 1260 tgtcattcct gccagcgctc agcagtcgtc ccggccctgt gttctggtct ggagaggggg 1320 tcttccagtc cgttgccata ggcaagtatt cattcaatat cgataatatt gtacctgatg 1380 cgctcgtcga ttctagtcaa ctacttcaat tccgccattc atcatccgga tctatcgttg 1440 ctgaagaggg atgcatgcct gttagcctca tggaagcctc ttatcccctc atctcagtcg 1500 agtcagtccc gccatcgttc atcagtcatc cctgtcctga gtttgagtgc ggagaagagg 1560 tcttccaaac aaccactaca ggcaagtacg catacagttt gaataattat gtacctgatt 1620 attctggcgc ttccagtcgg tcattgtact cctgtttgtc tggctcagcg agaatcgttc 1680 ctgccccggg atgcacgcct gcaagtctta tggaagcccc taattccctc aacacagtcg 1740 agcccttccc gccagcgctc cgcagtcatc ccggtcctgt gtttgagaac gaagaagggg 1800 tcttccaacc agttaatgca ggcaagtttg caaatgttag gaatactgct cttcctgtaa 1860 tttccatcgc ttctagttac caaactccag atgatttccg atcaattgac gatgtgatcg 1920 aaattaacga tcgtccttgc actgggtcca ctgcgccttc atcaagcaac ttaacaccca 1980 atgttggcac cagattggca ggtatttcaa gtcgtacact ttcggtgtac taccaaaatg 2040 tcagggggct acgcacgaaa atttcccggc tacgtttgct gttgtccagc tgcgactatg 2100 atgtactcgt tttcaccgaa acttggctcc gaggtgatat cgaaagtaat gaaatttcgc 2160 ctgattatgc tttctttcga tgcgatcgca gtgaatcaag cagtcagcat ttgcgaggcg 2220 gcggagtact tattgccgtt aaaaactacc tgaaatgcga gtattttcca ctggacaact 2280 gtgagaccct tgaacaagtc gccgttcgca taaatttgca tcatcgcacc ctctatgtga 2340 tcgcagtgta tttgccacct aattcaagcc aagatctata tacagctcat tcatgtgcag 2400 tacagcttat tgcggatcgt ctcatggagt ccgacatact gttatcggca ggagacttta 2460 actttcctca tctgcgatgg catcttgacg aggatatcaa tggatacgtt ccagtaaata 2520 cttcaactga caccgagctc aactttactg aagcaatgtt tgccagtggc ctgagacaga 2580 taaatagctt catcaacacc aacggcagac ttttggacct catgtttacc aatctcccag 2640 agtatctgga cgtaattgca ccttgttctc cgttgttacc ggttgatagc catcacatgc 2700 cagtcattat tttacttgac gagagcgaca cgcaaacatt tcctgacaat cttgaaacgt 2760 gttttagatt tgactattcg gcgtgtgatt ttgaactact gaactctgtt ttcgagaata 2820 ctgactgggg acgatcatcg aattacgcta ctgtagatca gatgttatca tcattttacg 2880 aatcgctagc tggagtgatt tctcgttacg ttcctcggaa acgacaagca tttatttcca 2940 tttacaacaa accatggtgg aatccagagc ttcgtcacct tcgtaatatt ctccgtaaga 3000 tgcgcaaccg ctattttaat acgaaaaacg atcacgataa gactagactg cgtgactttg 3060 agcagcagta caaagaaagt ctttacgcat cacatgaaag ctatttagca agcatccaaa 3120 gaaacgtcaa gcaggatcct tcccgtttct gggattttgt aaagcaacgc aaggcaggta 3180 ttgccattcc gagctctgtc tctttcgatg gagtcaccgc ccattcaaac accgaagttg 3240 ccgatctctt cgccaccttt ttcgagagtg tttttagtag agcatcgccc gttcaacgtc 3300 gtgatagttt cgagcagata ccagttcata acataactct accgttcgtt cagttttctg 3360 tagaggaagt tcagaatgct cttgaagatc tggattctac aaaaggacct ggtacagaca 3420 atctgccccc tttgttctgg aagaactgtg cagctacgct cgcaatgcct attactgctg 3480 ttttcaaccg ctcgcttcgt gacaggaggt ttccaagtgc gtggaagacg gcatctataa 3540 caccgattca caagtccgga agctacaatt gtgtcgcaaa ttatcgcggg gtgtctatac 3600 tttgcagcct aagcaaagtg tttgagaagc tgatacacgc agttttgtac cgagctgctg 3660 ctccgatcat ctctaacaac cagcatggct tcatgaagca ccgctccaca acatcgaacc 3720 ttatgtgcta tgtgtcagct gtatcccgtg aacttgaatc aaggcggcaa atcgatgcag 3780 tgtatgtaga ttttgcaaaa gcttttgatt cagtaccgca tgatatcatc tgcagaaaac 3840 tcagtcatct tggttttccg gcgtggctta ctgactggct ttgctcctac ttaaccgatc 3900 gtgaagcttt cgtgaaagtg aaatccacac gctccaggac attcaatatt ccatccggcg 3960 tcccgcaagg aagtgtcctc ggtccgttga tttttatcat ttatgtaaac gacctttatg 4020 agctgctttc ttcatttaat ctttcatatg cggatgactt aaaaattttc cgtgtgattg 4080 cctcctctgc agattgtgtc gaactacaag aagacataaa tcgactgctt atctggtgcg 4140 acgataatgg catgcgcgta aacagtaaaa agtgcaaagt aatttccttc tcacgatcca 4200 atagcatttc gctccatcag tataacatgg gactagattt tttggaaagg gttgactcaa 4260 tatgtgatct gggtgttacg attgactcaa aattgagatt taacgagcac ataagcatca 4320 taactgccaa agcatttacc gtgcttggtt tcatccgtcg ccatgcttcc ggttttaccg 4380 atatttattg tcttaagacg ttattctgtt ctctagtacg cagcattctt gagtatgcat 4440 caccggtttg gtcgccattc tacgtaaagc ataatctggc aattgaacgt atccaaaaaa 4500 gctttttaag gtttgctttg cgtcatcttc cttggaatga tccgataaac ttaccgagct 4560 atccggaaag acttaagtta attaacttag aatctctctc tgcaaggcgc atcagatcgc 4620 agagattgtt tatttttgac ctcatcacaa acaatataga ctgccctgat ctgttagaat 4680 tgattccgtg gaatgttcct tccagacgtc ttcgtaattc agtgttattc gtgattcctt 4740 tccatagatc taattttgga tacagcaact gttttcatat gtgtttgcgt tcgttcaatg 4800 atgtttgtga cgagtttgat tttaatatgt ccaaaaatgt attcagtgtt agaataagag 4860 gtttagatta agtagttatt aagaaatcag tctgtacggc gtagccgaag atggtgcata 4920 aataaataaa taaataaat 4939 // ID hAT-48_HM repbase; DNA; INV; 4352 BP. XX AC . XX DT 15-DEC-2008 (Rel. 13.12, Created) DT 15-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type DNA transposon family - consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-48_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4352 RA Bao W. and Jurka J.; RT "hAT-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 2036-2036 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(1152..1610,1690..3315,3473..3802) FT /product="hAT-48_HM_1p" FT /translation="MATSSRSKNPLWNYFKVSDEDVGKAICILCKKILSRG FT SKDAHNMSTTNLQNHLKKVHYNIHSMILSQKKIINHESLPNPNEKSIKQFC FT TTRTICSDRTSSAESVQIITSTATDETQGSSRVSNNTIAFFQKASTQVTQL FT TVAEFIYYSVFMILYYNKCSGEGGKIHTSIRHLDPSLAIVSYKLLSNFFNS FT LNLFQVSLREILQRKELWQLDNPKAQAITKLIGEMICLDLQPYCIVEDKGF FT TRLLKNLAPNYTIPSRKYFSTKVIPLMYETIKAKVKYELDQADFLSLTSDG FT WTCQHTIQSYYSLTARFVTHEFTVKHVILQVTHFPESHTGHNISKFINEAL FT QSWEIPHEKIHAVLTDNAANVMAAIKESNLGDKHLPCLIHTLQLCIQKKIF FT REQRTASDTIAVFRALAGHFHHSSSAVAKLKEIQSQLKLPEHNIIQDVSTR FT WNSTYYMLERFIEQKKAITLYCITRNQTNAKNPTENQWQLAEMLVCILKHF FT ESTTKDMSKETACISEIIPFIYAMEKFLDYACDTATEIKTVVDELKKDFNC FT RFQKYKDNVDLKIAMMLDPRFKLKFVEDKNHNMFKEILHLEFYRFCSKSHL FT EQVKEIEFGVARGSGSDSEESLSSFSESKIHESDSDFDGHGSPLKKKKLMS FT MDIYSLFHQIGANEYALQNSENSDGRRKKKKNTSSMGKTLTVKEKWKFKPK FT IITYILCISTDEVNFYLSLPILPKNECPYKWWSAYRYTSSDKDTYKLNLSK FT FSKKVLCAPPSSVKSERLFSTSGNIFEAKRNRLLPEHGEQIAFLNCNMPAF FT IE*" XX SQ Sequence 4352 BP; 1555 A; 702 C; 692 G; 1403 T; 0 other; ttagggatgt accgaagctt cggcttcggc ttcggtttgc cgaagcttcg gcaattttgc 60 cgaagcttcg gttcggccga agccatgggc gttttttgcc gaatcattgg cttcggcaaa 120 aaaatggaca ttactaagct ggcattcgac tgaggaagac attttttaaa cgttcacacg 180 ttgtctgttt tccgaataaa acgtagttga acgttcaaaa ttcttctttt tcttttcttc 240 ttctgagaaa tttaaataag cctctgatct atacagttgt gaagattaaa tgaatccagc 300 aacggctttt ggaggtggca aggccgtgtg tagggcggcc aaaaacggca cgatagcgga 360 tagaaagtaa agttacttaa tgaaacacat tgattttttt ataatcactc aatataagct 420 gatttaaact gtaacattat agtaaacttt atataattaa ataccataac cctaatattt 480 agttatgatc actcgagctt agcaataaaa tttatttatg cattagttgg tagtaaaata 540 atgaaaaatt attaaaattt aattttttta gcatttagtt attagttagc atcaacttct 600 aaaatccgag ttatgttaaa ttatatattt tttaatatta aattaaaatt aaaaattaaa 660 cctggaaata aatcttcatt taaaattact tagatgtaac taaaacatag cgcacgagag 720 ttaagagagt taactattct cgaaattttt aaaaatctat acaataagtt aattttaacg 780 cacgaaacaa tttttattca gaaagtttgt tttactaagt cacagacaga acacagaaag 840 acactacaca gaaagacact acacagaaaa gacactacac agaaaagcac aattgaaatg 900 taagtcaatt gcgtttgttt ttgattttgt acgctggatc attagtaaaa gtaaatatac 960 ctgctctaca aatataggtt aattggttat ttacgacttc gatttttttt aattcaaata 1020 atatttgaat caaaaaattg aacgaatatt gtttatttca ataaatatta ttaaaggtaa 1080 gaaaataatt aaactttcta tttcaatttt cagaattgtt attataactt taagttatcc 1140 attagcgcat tatggctacc tcatcgcgaa gtaaaaatcc cttatggaat tattttaaag 1200 tgagtgatga agatgttggc aaagccatct gcatactttg caagaagatt ttgtctagag 1260 gaagtaaaga cgcacacaac atgtcaacaa caaatttaca aaaccatctg aaaaaagtac 1320 attacaatat acatagcatg attctttctc agaagaaaat tattaaccac gaatctttac 1380 caaatccaaa tgaaaagtct ataaaacaat tttgcactac tcgaacaata tgctcagata 1440 gaacaagctc tgctgaatca gtgcaaatca ttaccagcac tgccacagat gaaacgcagg 1500 gaagctcgcg tgtgtctaat aacacaattg ccttttttca aaaagcatcg acacaggtaa 1560 ctcagctaac tgtagctgaa ttcatttact attcagtatt catgattttg tgactggatg 1620 ggggcatgtc ttgtcgaaaa attatctttt gtatagaaaa atgttaactt aacttatcta 1680 agcttataat attataataa atgttctggt gaggggggaa agatacatac ctctatccgc 1740 cacttagatc cgtcgctggc cattgttagt tacaagttgc taagtaattt ttttaattcg 1800 ttgaatttgt ttcaggtctc actgcgggag attttgcaaa gaaaagaatt gtggcaactt 1860 gataatccta aggctcaagc tattaccaag cttattggag aaatgatatg tttagacttg 1920 cagccatact gtattgttga agataaaggt tttacccgac ttttaaagaa tttagctcca 1980 aactatacaa tacctagtcg aaagtatttc tccacaaaag tcattccact gatgtatgaa 2040 accataaaag ccaaagtaaa atatgagctt gatcaagcag atttcttaag tttaactagt 2100 gatggttgga cctgtcagca cacaattcaa tcgtattaca gtttaacagc tagatttgta 2160 acgcatgagt ttacagttaa acatgtcatc ttacaagtta cacacttccc tgagtcgcac 2220 acagggcaca atataagtaa attcataaat gaagcgctac aatcatggga aattccccat 2280 gaaaaaattc atgcagtttt aactgataat gcagccaatg taatggctgc cattaaagag 2340 tcaaatttag gcgataaaca tcttccatgt ctgatacata ctttacaact ttgtattcag 2400 aaaaagattt ttagagaaca aagaacagca agtgatacaa tagctgtttt tcgagcatta 2460 gcaggacatt tccaccactc atctagtgct gtggctaaac taaaggaaat tcaaagtcaa 2520 ctaaaattgc cagagcataa cataattcaa gatgtttcca caagatggaa ttcgacatac 2580 tatatgcttg aaaggttcat tgagcagaaa aaagctatta cattgtattg cattaccaga 2640 aatcaaacaa atgcaaaaaa tcctacggaa aatcaatggc aacttgctga aatgcttgta 2700 tgcatattaa aacacttcga aagtactaca aaagacatga gtaaagaaac tgcatgtata 2760 tcagagatta taccatttat ctatgcaatg gaaaagtttc tagactacgc ttgtgacacg 2820 gcaactgaaa taaaaactgt tgttgatgaa cttaagaaag attttaactg tcgttttcaa 2880 aaatacaagg ataatgttga tttaaaaatc gcgatgatgc ttgatcctcg tttcaagtta 2940 aagtttgttg aagacaagaa tcataatatg tttaaagaaa tactccattt agagttctac 3000 cgtttttgtt ccaaatctca tttagaacaa gtaaaagaaa ttgaatttgg tgtagcaaga 3060 ggtagtggat cagactcaga agaatcatta agttcatttt cagaatctaa aattcatgag 3120 tccgactcag attttgatgg ccatggctct ccattaaaaa aaaaaaaact aatgagcatg 3180 gacatatata gcttgttcca ccaaatcggt gcaaatgagt atgctctaca aaattcagaa 3240 aattctgacg gacggagaaa aaagaaaaaa aacactagtt caatgggaaa aactctaact 3300 gtcaaagaaa aatggtaaga aaacagaatc tatttatgat ttttatattg tttatacatt 3360 aaatattaaa tatttttttt atctctattc tcattctcca ttttttttat ttcaaaagac 3420 ggataacagc attcttttat tttgtttagt tgattaaatt gtaatatttt aaaaatttaa 3480 acctaaaatt atcacctata ttttatgtat cagcaccgat gaagtcaact tctacttgag 3540 ccttccaata ctgccaaaga atgaatgccc ttataagtgg tggagtgcat accgttacac 3600 ttcctctgat aaagatactt acaagttgaa tttgtcaaag ttttcaaaga aagttttgtg 3660 cgcgcctcct tcatctgtaa aaagtgaacg cttatttagc acatctggaa acatatttga 3720 agcaaaaaga aacagactat tacccgaaca tggagaacag attgcatttt taaattgtaa 3780 tatgcctgct ttcattgaat aaatttttat taactaaata atagactgta gcttcttagt 3840 tcaactgatt caacttttta acaaaattta aaacaaatac caaattttgt gttcaaatta 3900 aaataatgac tacttttata tttttacatt ttacgtgttt acatttttat aaattctttc 3960 aaagaataga ccataattta ccttaagtat cagtaaatgt ctaagattta gtgatgtaat 4020 ctttcctata cctgctagat atatagagtt aagactctaa atttcactct ctaaattttt 4080 gtacaaaatt tgtttaaatt ttttataata ttggttttgt tttataattt tcctaaaatg 4140 ggcatgtagg ccaaaacatt gaaacctaaa acattgatat ataattttga aagcaacttg 4200 attattcaaa ccaccgtggt gaagtaaatt gagctttaaa ttacgtatag cttaagccga 4260 agcttcggct tcggccgaag cttcggcttc ggcaaagtgg cttcggccga agcttcggct 4320 tcggcaaaat tgcagcttcg gtacatccct aa 4352 // ID BEL-30_CQ-I repbase; DNA; INV; 6272 BP. XX AC AAWU01042468; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-30_CQ_; KW BEL-30_CQ-LTR; BEL-30_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-6272 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 213-213 (2011). XX DR Genome; AAWU01042468; Positions 1450 7721. XX CC Positions [5321-5878] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 23..6271 FT /product="BEL-30_CQ-I_1p" FT /translation="MSAPGVHNLTGYECVLCEEHSNADSQMVFCEKCQGFF FT HLQCAGVTEADRDLSFTCKKCVGPGEKVDQTDDENDGATVVDDSKKADETL FT PGKPDPSVLNKPNGRIEQEDAELQHQREMQQLQEAFENQLRREQEKRMMIL FT RLERQMLEQKAALNAEFNKRKNELYDEFKHTAGEVEPEEGAVGGEIVPVND FT HRGAFPKYSTPIKHPDVRTGKTEESSFPRPLGPKPLSTPAPNPFPWPEPSP FT PAPASNPVPWPAPPPPAPAPNPAPAPNPAPAPNPAPAPNPVPAPNPGPAPN FT PVPAPNPAPALNPAPAPNPAPAPNPAPAPNPAPAPNPAPAPNPAPAPNPAP FT APNPAPAPNPAPAPNPAPAPNPAPAPNPGPAPNPAPAPNPAPAPYPDPVPP FT IPEDIRNNRGDQQDPDFYGDRELTRAQIAARKGPFANLPVFTGRPEDWPMF FT ISSFNNGNRACNWSNLENLGRLQTSIQGRALERVQTSLLYPESVPRVIETL FT RLLYGRPEQLLHCLMQKARKAESPRMDRLSTIINFGVVVQQLCDHLVASGL FT VDHLVNPMLIAELVEKLPDPTKVEWVRYKRQFAAVNLSTFADFLSERVSEA FT TEATAYSEPQEERRPNRERHEKKPRGRENEGFLNTHLGSEQLSNQTKGDAS FT GGRRPCRACNRTDHRLRFCDDFLKLAWDARMKIAEDWDVCLLCLNEHGQTR FT CRFKGHCNVGSCKERHHPLLHPPSPTVPLSANCHVHNSEQQPVIFRVVPIK FT VHHDGRSFEVVAFLDEGSSFSLMDSSVADQLKLEGTRKPILVKWTAGMSRM FT ERDSRSVDLSISAADSQDRFLLRNVHTVQQLQLPNQKMQYSAVAARFKHLR FT GLPIADCDNGPPQILIGLKHIHVYAPLESRIGNPGEPIAVRTQLGWTIYGP FT QTGDDVVSGYTGHHAAGNLSDHDLQELLRRHFTLEDSGLAVAVLPVSAEDC FT RARELLEKTTIRVGDRFQTGLLWRDDNPQLPDSFPMALKRMTELERKLAKN FT PALQKNVDEQISEYQRKGYAHIATAKELSQAVPGKIWYLPLNVVLNAKKPE FT KVRLVWDAAASVQGKSLNSALLKGPDLLTSLPTVLSRFRERPIAFGGDIAE FT MYHQLQIQPCDKSAQRFLYRPTGSNQPVIYVMDVATFGSACSPCSAQYVKN FT RNAEEYAEKYPDAVEAIVGSHYVDDYLDSTFTIGEAIKRASEVAFIHSKAG FT FQLRKWVSNSPEFLQHFGVQGADQLVPLGTDKNGGMQRVLGMSWNTFDDVF FT VFVTNLRDDLQPYLTEGKLPTKRILLSIVMSFFDPLGLWALFTIFGKIIIQ FT DLWRNGCPWDQVLDETSAAKWFKWIALLPRVQAMTVPRCYFLGVKLSDYVN FT LELHVFTDASEEAYGAVAYFRVWVYGKPKVSLVTGKAKVAPLQYMSIPRLE FT LQAGTLGARMAAAIKANHTFPIRQTVMHTDSATLLSWIRSDHKKYKQFVAH FT RIGEILSHTDVDDWQFVATKFNIADVLTKWGKHGPPLDGDSEWLGPEELFI FT PEEDLVLRELPEPNVREELRAFYLFHEIEFASCLIDPSHFSRWKTMVRTVA FT CVFRFITNLRRKQKKQPIHTLQATFNLERFVLGTESLVKVPLTRDEYQRAE FT NHLWQAAQQEGFPDEKKIMLKNREQEQQKWHSIERSSPLHRLAPFLDEYGV FT IRMEGRSAYAEFLPFEQRFPIVLPKGHDITNKLIDLYHRKFGHANRETVVN FT ELRQRFYIQNIRAAVLKVMQTCLKCKNNKCQPAAPRMAPLPVQRLTPQLRP FT FSYVGVDYFGPVVVTIGRRTEKRWICLFTCLVTRAIHMEVAHSLSSASCMM FT AIRRFICRRGAPLEIFSDNGTNFQAASKELAQTVKRIELECADIFTDARTR FT WNFNPPAAPHFGGVWERLVRSAKDAIKALHDGKKLTDEVLLTVLAEAEDMV FT NSRPLTYMPQESADDEALTPNHFIRGLPAGERDEANSPTSSAEALRDNFKR FT SQQLADMLWQRWLKEYVPTINHRTKWCTEQDSVKEGELVYLVDGNNRRTWV FT RGIVERVILGADGRVRQAMVRTSKGVYRRPVAKLAVLECRSKSGQDLGPGP FT ELREGE" XX SQ Sequence 6272 BP; 1574 A; 1786 C; 1702 G; 1210 T; 0 other; aacttctcaa aaattagttg agatgagcgc acccggcgta cataacctca ccgggtacga 60 gtgtgttctt tgcgaagagc acagcaacgc ggatagccag atggttttct gcgagaaatg 120 ccaaggcttt ttccatctcc aatgcgctgg ggtgaccgaa gccgacaggg acctgtcctt 180 cacgtgtaag aagtgcgttg gtccgggtga aaaggtcgac caaaccgacg atgagaatga 240 cggagcaacc gtcgtggacg actccaagaa agcggacgaa accctgccag gcaagcccga 300 cccgtcggtc ctcaataagc cgaatgggag aatagagcag gaagatgcgg aactgcagca 360 tcagcgcgaa atgcagcaac ttcaggaagc ttttgaaaat cagctccggc gagagcaaga 420 aaagagaatg atgattctcc gcctggaacg acagatgctg gagcagaagg cagcgctaaa 480 tgccgaattt aacaagcgaa agaatgaact ttacgacgaa ttcaagcaca cggccggcga 540 ggtagaacct gaggagggtg ctgtgggtgg agaaattgta cctgtcaacg accatcgtgg 600 agcgttcccg aagtattcaa cccctatcaa acatcccgat gtaagaactg gcaagacgga 660 agagagctca tttcccagac cgctggggcc gaaaccacta tcaacgcctg ctccaaatcc 720 attcccgtgg ccagaaccat cgccaccagc gccggcttca aatccagtcc cgtggccagc 780 accgccgcca ccagcgccgg ctccgaaccc tgcgccggcc ccgaaccctg cgccggctcc 840 gaaccctgcg ccagctccga acccagtgcc ggctccaaat ccagggccag ctccgaaccc 900 agtgccggct ccaaatccag cgccggctct gaaccctgcg ccagctccaa acccagcgcc 960 ggctccgaac ccagcgccgg ctccaaaccc tgcgccagct ccgaaccctg cgccagctcc 1020 gaaccctgcg ccggctccga acccagcgcc agccccgaac ccagcgccag ctccgaaccc 1080 tgcgccggct ccaaaccctg cgccagctcc gaaccctgcg ccagctccga accctgggcc 1140 ggctccgaac cctgccccag ctccgaaccc tgcgccggct ccatatcccg atcccgtccc 1200 gccgataccg gaagacatcc ggaataaccg cggtgaccag caagatcctg atttctacgg 1260 cgatcgcgaa ctgactagag cacagattgc tgcacgaaaa ggtccattcg ccaatctacc 1320 agtgttcacg gggcggccag aggactggcc tatgtttatt agcagcttta acaacggcaa 1380 ccgagcgtgt aactggagca atctcgagaa tctcggtaga cttcagacaa gcatccaagg 1440 ccgagcgctg gagcgggtgc agaccagcct actttacccg gaatctgtcc caagggtgat 1500 cgagacgctc agactgctct acggcaggcc ggaacaactg ctgcactgct tgatgcaaaa 1560 ggctcggaaa gcggagtccc ctcgaatgga tcgcctttcc acgatcatca attttggcgt 1620 agtcgtgcaa caactgtgcg accatctggt ggcctcggga ctggtggacc atctagtgaa 1680 cccaatgctc atcgcggaac tggtggaaaa gttacccgat ccaacaaagg ttgagtgggt 1740 gcggtacaag cgtcagttcg ctgcggtaaa tttaagcaca ttcgcagact ttctctctga 1800 aagggtctcc gaagcgacgg aggccacagc ctactccgaa ccgcaagaag aacgacgccc 1860 gaaccgagag cgacatgaaa agaaaccgag aggtagagaa aacgagggtt tcttgaatac 1920 gcacctggga agcgagcagc tgtcgaatca aactaagggg gacgcatctg gaggtcgcag 1980 accatgccgt gcgtgcaatc gaaccgatca ccgcctccgg ttctgcgacg atttcttgaa 2040 gcttgcctgg gatgcccgaa tgaaaattgc tgaggattgg gacgtatgtc tgctctgcct 2100 taacgagcac ggtcaaacgc gttgccgttt caagggtcac tgcaatgttg gaagttgtaa 2160 ggaacgtcac caccccctct tacacccccc aagtccaaca gttccgcttt cagcgaactg 2220 tcacgtgcac aattcggaac agcaacccgt aattttccga gtagtcccga tcaaggtcca 2280 tcacgacggc cgctcgttcg aagttgttgc ctttcttgac gaaggttcgt cgttctcttt 2340 gatggatagc agcgtagcgg atcagctaaa gctggaggga accagaaagc cgatacttgt 2400 caagtggaca gctggtatga gcaggatgga gcgtgactca agatcggtag acttgtcgat 2460 ctccgcggcg gattcccagg accggtttct gctgcgcaat gtgcacaccg tacaacaact 2520 gcagcttccc aaccaaaaga tgcagtactc ggcagtggcc gcccgcttca aacacctgcg 2580 tggcctgccc atagctgact gtgacaatgg tccaccgcag attctgatcg gtctcaagca 2640 tattcacgtg tacgccccgc tcgagtcgcg aattggcaat ccaggagagc ccatagctgt 2700 acggactcaa ctcggttgga cgatttacgg cccccaaact ggcgacgatg tagtgtccgg 2760 gtatactggt catcacgccg cgggtaatct atcggaccat gatctacaag aactcctacg 2820 aagacatttc acattggaag actccggact ggccgtcgca gtgcttcctg tatccgctga 2880 ggactgcaga gcgcgagagc tgctagagaa aactacaatc cgagtcggcg atcgctttca 2940 aacgggcctc ctgtggcgcg atgacaatcc acagctcccg gacagtttcc cgatggcgct 3000 gaagcgcatg acagaactgg aacggaagtt ggcgaagaat ccggccctac agaagaacgt 3060 ggacgagcag atcagcgaat atcagcgaaa aggctacgct cacatcgcta ctgcaaaaga 3120 gttgagccaa gctgtgccag gcaagatctg gtatcttcct ctgaacgtcg ttctgaacgc 3180 aaaaaagccg gaaaaggtcc ggcttgtctg ggacgccgca gcgtccgtcc aaggaaaatc 3240 gctgaattct gcgctgctca aaggaccaga tctactgacg agcctgccca ccgtgttgag 3300 tcgcttccgg gaacgtccca tagctttcgg tggcgacatc gccgaaatgt atcaccagct 3360 gcagattcaa ccgtgcgaca aatccgcaca acgatttctt taccggccaa cggggtccaa 3420 ccaaccggtg atttacgtga tggatgtcgc cacgttcggc tccgcgtgct cgccgtgctc 3480 ggctcagtac gtgaaaaaca gaaatgcgga agagtatgca gagaaatatc cagacgcggt 3540 ggaggcaatc gtcggcagcc actacgtcga cgattatttg gactcgacgt tcaccattgg 3600 agaggcgatc aagcgagcaa gcgaagtagc atttatccac tccaaagctg ggtttcagct 3660 acgaaagtgg gtttcgaaca gcccagagtt cctgcaacac tttggagttc aaggtgcaga 3720 ccagctggtg cccctcggca ccgacaagaa cggcggtatg cagcgagtct tgggaatgtc 3780 ctggaacacg ttcgacgacg tattcgtatt tgtcactaac ctgcgcgacg atcttcagcc 3840 gtacctaaca gagggaaagt tgccgaccaa gcgaatcctg ttgagtatcg tcatgagctt 3900 cttcgatcca ctcggattgt gggcgttgtt tacaatattc ggcaaaatca taatccagga 3960 cctgtggagg aacgggtgcc cgtgggatca agtcttggac gagacctcgg cagccaagtg 4020 gttcaaatgg atcgccctcc tacctcgcgt gcaagccatg accgttcctc gctgttactt 4080 tctcggcgtg aagctgtcgg actacgtgaa cctcgaactc cacgtgttta cggacgcaag 4140 tgaggaggct tacggcgcgg tcgcctactt tcgagtctgg gtgtacggca agccgaaagt 4200 atcgctggtg acggggaaag caaaggtggc cccgctgcag tacatgtcca ttccccgcct 4260 cgaactacaa gccggaaccc ttggagcccg gatggcagcg gcgattaagg cgaatcacac 4320 ctttccaatt agacaaaccg taatgcacac cgattccgcg acgctgctct cgtggatccg 4380 gtccgaccac aaaaagtaca aacagtttgt agcccaccga atcggagaaa tcctcagcca 4440 cacagatgtg gacgactggc agttcgtagc aaccaagttc aacatcgctg atgtgctgac 4500 caaatgggga aaacacggac cgccactgga cggcgacagc gaatggctag gtccggaaga 4560 gctcttcata ccggaagaag acctggtcct gcgggagttg ccggaaccca acgtacggga 4620 agaattgcga gccttttacc tctttcacga aattgagttc gcctcgtgcc ttatcgaccc 4680 ttcccatttc tcgcgctgga agaccatggt aagaacagtt gcctgcgtgt ttcggttcat 4740 caccaatctg cgccggaagc aaaaaaagca gccaattcac accttacaag ccacattcaa 4800 cctggaacgg ttcgtgttag ggacagaatc gctagtgaaa gtccctctca cccgcgacga 4860 gtatcaacga gcagagaacc acctgtggca agctgcacaa caagaaggtt tcccggacga 4920 gaaaaagatt atgctgaaga accgcgagca ggagcagcag aagtggcatt ccatcgaacg 4980 ttccagccca ctacaccggt tggcgccttt cctggacgag tatggagtta ttcggatgga 5040 aggacgttcg gcctacgccg agttcttacc cttcgaacag cggttcccca tcgtgttacc 5100 aaagggacac gacatcacca acaagctgat cgatctctac caccgcaagt ttggtcatgc 5160 caatcgagaa acggtggtga acgagctgcg ccaacgcttc tacatccaga acatccgagc 5220 tgcggttctg aaggtgatgc agacctgcct gaagtgtaag aacaacaaat gccaaccagc 5280 agccccgaga atggcccctt taccggtgca gcgtctcact ccgcagctgc gtcctttcag 5340 ttacgtaggt gttgactatt ttggcccagt agttgtgaca attgggagac ggaccgaaaa 5400 gagatggatt tgcctgttta cgtgtctggt gactcgagca atccatatgg aagttgccca 5460 tagcttgagt agcgcgtcct gcatgatggc gatcagacga ttcatctgcc ggcgaggtgc 5520 acctctggaa atcttttccg acaacggcac taactttcaa gccgccagca aggagctggc 5580 tcagacggtg aaacgcatag agctggaatg tgcagacatc ttcaccgacg caaggactcg 5640 gtggaacttc aatcctcctg cggcgccgca ttttggtggt gtttgggagc ggcttgtaag 5700 gtcggcgaag gacgcgatca aggctctaca cgatggaaag aagctgacgg atgaggttct 5760 tctcactgtg ctggccgaag ccgaggacat ggtgaactct cggcccctca cctacatgcc 5820 ccaagaatcc gcagatgacg aagcgctcac gcccaaccac ttcatccgcg gcctgcctgc 5880 tggggaacgc gacgaagcca acagcccaac tagttctgct gaagctttgc gtgataactt 5940 caaacggtct caacaactcg cggacatgct gtggcagagg tggttaaagg agtacgttcc 6000 cactatcaac catcgtacga agtggtgtac cgagcaagac tcggtcaaag aaggtgagct 6060 ggtgtacctg gtcgacggaa acaaccgcag gacctgggtt cgtggaatcg tcgaaagggt 6120 catcctaggt gcagacggta gagtacgaca agcaatggtg agaacatcga aaggtgtcta 6180 ccggcgtcca gttgccaagc tagcggtact ggaatgtagg agtaaatctg gccaagacct 6240 tggtcctgga ccagagttac gggaggggga at 6272 // ID CR1-1_TCa repbase; DNA; INV; 2416 BP. XX AC . XX DT 21-MAR-2009 (Rel. 14.04, Created) DT 04-APR-2009 (Rel. 14.04, Last updated, Version 3) XX DE CR1-type retrotransposon: consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; Nonautonomous; KW CR1-1_TCa. XX OS Tribolium castaneum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Coleoptera; Polyphaga; Cucujiformia; OC Tenebrionidae; Tribolium. XX RN [1] RP 1-2416 RA Jurka J.; RT "CR1-type retrotransposons from Tribolium castaneum."; RL Repbase Reports 9(4), 735-735 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Tribolium CC castaneum Genome Project. XX FH Key Location/Qualifiers FT CDS 485..2296 FT /product="CR1-1_TCa_1p" FT /translation="MALRAARKNYYGQRIQNADNKSKCAWNIVNKICGRSK FT KCCEIQIEGDPKQIVNDINQYFINSAPFVRSKSNNNTISSNAKTMYXYPVT FT SQEIITIIQNLKNKKSAGFDEISNNIIKYCNEEISAPLSFVIQNSLDYGIF FT PEQLKRAQVLPIYKKGDPTLISNYRPISLLSSFSKIFETVISSRITNFLMK FT FQLLNSSQHGYIKGKSVDSAIFSFTNALLDKLEEKKVVLGIFLDFSKAFDC FT LDHNILISKLKKYGIRGIMLKLITSYLNNRYQQVTITKNSTVYTSNKLLIK FT QGVPQGSILGPLFFVIYINDLLNILTVSDNNSAVNFADDTNILIWGENMTD FT TMKEAKICMNKVINWTKENKLVLNFEKTYGIHFAICNHKNTEKPNKIVISS FT KEIELIEKTNFLGICIDHKLSWTNHIDNLLKKLNSVHFTIRVLKKYINIEQ FT LRIIYFANFQSLLSYGIIFWGQSGKISKVFVAQKQVMRTMFNLKYNETCRN FT IFKSNNFLTVTGLYIYRLLLFFFKQNLLFNVTNDHKYDTRKVDLEYPRHKL FT TLTEKNSYYAGIKLYNSLPVHLRTPCNLNLFKKKIFDYILDLEPYTLNEFY FT NRQVF*" XX SQ Sequence 2416 BP; 972 A; 336 C; 331 G; 775 T; 2 other; attctaagca ttgtaaaagt attttttatc atataatatg tatgaaatgg ttcaagaaaa 60 cacaagaata accgcacgca gtaaaacaag aatagacaat gtttttacaa atatgacctt 120 taataacttt aaaatagaag ttcttgaacc acacatatcg gatcacatgg gtatacagct 180 aactacgagc ataaaaaata atactaataa tactgatgca gtaaaaaacg ttagaatttt 240 aaatgtatat gcatgtaatc gcaatgatat caatgctcag tggagcaatt ttataaatat 300 tttcaatgat aaatttaacg aatgttttcc tctaaaacaa ataaaagaaa aaaaaacaag 360 aaaatattta taaaaatgat cctaaagtaa aatattataa atattatcta gatttgttat 420 ctgtgataag taagtgtgac gataaatatg ttattcttta taagtatttg aaaaaaaata 480 cgacatggct ctacgagcgg cacgtaaaaa ttactatgga caacggattc aaaatgcaga 540 caataaatca aaatgtgctt ggaatattgt taacaaaatt tgtggcagaa gtaaaaaatg 600 ttgtgaaatt caaatcgaag gtgatcctaa gcagattgtt aatgacatta accagtattt 660 cattaattca gcgccctttg tacgaagtaa gtcaaataat aatacgatct catccaatgc 720 taaaactatg tacntatacc cagttactag tcaagaaatt attactataa ttcaaaattt 780 aaaaaataag aaaagcgcag gctttgacga aatatctaat aacataatta aatactgtaa 840 tgaagaaatt agcgcacctt tatcttttgt tattcaaaac tctttagatt atggaatttt 900 cccagaacaa ttaaaacgag cgcaagttct tcctatatac aaaaaaggtg acccaacatt 960 aatatcaaat tataggccaa taagtctgct gtcatcgttt tctaaaattt ttgaaactgt 1020 tatatcttcc agaattacta actttttaat gaaatttcag ctactgaatt catcccaaca 1080 cgggtacatt aaagggaaat ctgttgactc agctatattt agcttcacta atgctctgtt 1140 agataaatta gaagaaaaaa aggtggttct tggaattttc ttagatttct ccaaagcctt 1200 cgattgccta gatcataaca tcctaataag taaattaaag aaatacggca taagaggaat 1260 tatgttaaag ttgatcacat cttatcttaa taataggtat cagcaggtca caataacaaa 1320 aaacagtact gtatatacat caaataaact tctaattaaa caaggtgtac ctcaaggtag 1380 tatactaggg cctttatttt tcgtaattta tataaacgac ctattaaaca ttttaacagt 1440 ctcagacaac aatagtgcag ttaactttgc tgatgacaca aacatattaa tttggggtga 1500 aaatatgaca gatacaatga aagaagcaaa gatwtgcatg aataaggtaa taaactggac 1560 taaagaaaat aaattagtgt taaattttga aaaaacctac ggcattcatt ttgcaatatg 1620 taatcacaag aatacagaga aaccgaataa aattgtaata tcatctaaag aaattgaatt 1680 aatagagaaa actaattttc tgggaatatg cattgaccat aagttaagct ggactaatca 1740 tatagataat ttattaaaaa aattaaactc agtacatttt acaatacgtg ttttaaaaaa 1800 atacattaac attgaacaac taagaataat atacttcgca aattttcaat cattactaag 1860 ctacggtata atattttggg ggcaaagtgg aaagatatca aaagtatttg ttgctcaaaa 1920 acaagtcatg agaactatgt ttaacttaaa atataatgaa acttgtagga acatctttaa 1980 aagcaataac tttttgacag taactggttt atacatatat agattacttt tatttttctt 2040 taaacagaat ctactattta atgtcactaa cgaccataag tacgacactc gcaaagtaga 2100 cttagagtat ccaagacaca aacttacgct gacggaaaaa aattcttatt atgcaggaat 2160 aaaactatac aatagtttac ctgtgcatct acgcacacct tgtaatttaa atttatttaa 2220 aaagaaaata tttgactaca ttttggatct cgaaccatat actttaaatg aattctataa 2280 tagacaggtt ttttgactta tgtcttaaaa ttatttttat gttattgtta tttctgtgta 2340 taatactgac gtttttcgta tcaattttgt tactacatct gtataaattg atagaaataa 2400 attattatta ttatta 2416 // ID CR1-19_HM repbase; DNA; INV; 4550 BP. XX AC . XX DT 15-DEC-2008 (Rel. 13.12, Created) DT 15-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE CR1-type family - consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-19_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4550 RA Bao W. and Jurka J.; RT "CR1 families from Hydra magnipapillata."; RL Repbase Reports 8(12), 1847-1847 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(805..1524,1454..4474) FT /product="CR1-19_HM_1p" FT /translation="MTNLTPAQLKQVEDAIKKLIHAETAKWSAKISELEQV FT IKQQQLIIDKLIQNPNELTNTTTSSIESSWVTVASKNIKKKQPIEQLKIIN FT TFAIEQQQKKKREKNVIIYGIKESAKLSVADKHIDDKETVQNILVTISSDA FT KPKSVKRIKSTSSKPGPIIVELSDVSERNPLLTAAKKLKNSDNYKNIFISP FT DLTEAERYLNYQLRMKRNELNSNINLKTSPFRYGIRSSGVVKLKNIQTTTK FT HRPLDTVYEAAESSNSKIYKPLHKHYYPSSTLLPANLKYKSKNLHNKKTNY FT SVNCYYTNATSLDNKMADLISISRTFDYHVIAISETWFRNDSVPNIDGYSL FT YRKDRQDGRRGGGVAIYITDVFNSHEVNQFYTNLLEQVWVSVKFNNKHILI FT GCIYRPNDASLIKYVAESFKLAHSLIKKPYDDCLIVGDFNFPNIKWSNDGI FT ESFNPNNPTEYDFWNIFQNSLFTQHITFPTFQTDYGSSNNMLDLLFTYNSN FT SILNLSPSTILGNTTKGHFSITWRFLVSELFTESESQSFNYKKGDYDSISN FT FIASIDWEFLFLHKTADQMFNILISNYRTACTYFIPKINKTKKKYDPWVTP FT HIKNLIREKKSAWHINCSTKWENSELKIKYNRLNRSVKTEIFKAKKAHEKE FT IISKSQNNPKIVYKYINKQLKIKDRIRSLKDTSGRIIEALPDIVDLLNSQF FT KSMFTLESSENFPLLNKSGILSECIYNENLDFSISDIIQELADLKPYKSFG FT SDGLHPLILKICAKSFAIPLTKIFHKSFNSGVVPTQWKLANISPIYKKKGD FT KLLAENYRPISLTSVACKIFEKLVKKKILKHINDNKMLSANQHGFLYHKSC FT VTNLIETFDIISSSIASHQFIDVIYLDFAKAFDSVPHNRLLLKLRSFGING FT KLLQWIKSFLVGRKQRVTINNTSSAWVDVLSGVPQGSVLGPLLFILYTNDL FT AKLTETSEIFADDTKLFSKTLSQSNFVDLQDRLRLVYEWSNTWLLKLNTSK FT CCVIHYGRNNPHYNYFLQNNQGINQSLRSSTKERDLGVIFTSNLKWSAHIQ FT TVVSKSLQILGMLKKTFTFIDEKMALKLYKVFIRPHLEYGVSIWAPFLKAD FT MALVESVQRKATKWAYSLRHLQYCERLKRLNLPSLEFRRLRCDLIQIFKLI FT HRFDIVNDKYLPKYINSVSRGHHSKIRREYVLNCLPRYNHLYNRSAKYWNN FT LPEQITLYSTVLSFKNRLDSWLWKKFGTNIYKTGCL*" XX SQ Sequence 4550 BP; 1617 A; 805 C; 599 G; 1529 T; 0 other; cttaagcctt tgagccgata agacgtgttt ttgtttgagc gtaagttttc aggaatttta 60 aatttttttc tttttttttt tttttttttt tccgtattct ttatctttct tttttactta 120 tttttctaac tttttctaac cactacttct ttttctactt attgctttat ctaattagct 180 tattaattac ttttctactt aaaattatag taatatatta actattcgtg tctataatac 240 agttttttgt tgtacttcgc atacctaaat taaatataat tcttatattt tatacctata 300 cttatatttt attatatata tctttattta tatattacta ttataaaaat agttataact 360 gagatttttt tttatgcttc agttcgctta attttttttt tttttttttt tacagagtta 420 taaaaagttt tttttttttt tttttttttt tttttctttt tcttcgccaa aagttgcgtt 480 tgcacctgtt ctttgttttt agttttctga acaataagct aaatttttgc gctctgacac 540 aaactatatt agtaaattat atatccatat attatatata aaatacatat aaaaaagtta 600 tatataaaca tatacaaata atataaagaa attaaatata tacaaataaa tatgataaca 660 cggaaaaaaa cacatactga atctttttag tccctgtgta ataagttatt tgtcataatt 720 gtaacaaata aatttgcaat aattgtaaca aataaataaa cataaaaaca tcactgatat 780 aaaaaaaaaa aaaattcatc tgtgatgact aatttaacac cagctcaatt aaagcaagtc 840 gaggacgcaa ttaaaaaact tattcatgct gaaactgcga aatggtcggc caaaatatct 900 gaactagaac aagtaataaa gcaacaacaa ttaataatag ataagctaat ccaaaatcct 960 aatgagctaa ctaataccac aaccagtagt atcgaatcca gctgggttac agttgcaagc 1020 aaaaacataa aaaaaaagca gcctattgaa caactaaaaa ttatcaacac atttgccatt 1080 gagcaacagc aaaaaaagaa gcgtgaaaaa aatgttatta tttatggcat taaagaatct 1140 gcaaagttat cagttgctga taagcatatc gacgataagg aaacggttca gaatatctta 1200 gttactattt catctgatgc aaaaccaaaa tctgtcaaac gtattaaatc tacgtcctct 1260 aaaccaggac caattatcgt cgaactttct gatgtctccg aaagaaaccc attattaaca 1320 gcagcaaaaa aattgaaaaa ttcagataat tataaaaata tttttataag cccagatttg 1380 actgaggctg aaagatattt aaactatcag ttaagaatga agcgtaatga attaaatagt 1440 aatatcaatt taaaaacatc gccctttaga tacggtatac gaagcagcgg agtcgtcaaa 1500 ctcaaaaata tacaaaccac tacataaaca ttactaccct tcttccactc ttttaccagc 1560 aaacttaaaa tacaaaagta aaaatcttca taacaaaaaa acaaactata gcgttaattg 1620 ttattatact aatgctacat ctctcgataa taaaatggcc gatttaattt caatatctag 1680 aacgtttgat tatcacgtta ttgctatctc tgaaacatgg tttcgaaatg actcagtacc 1740 taatatcgat gggtatagcc tatatcgtaa agatcgtcaa gacggacgtc gaggtggcgg 1800 tgtagctatc tacattacag acgtatttaa ctctcatgaa gttaaccaat tttatacaaa 1860 tcttctcgaa caagtctggg tatctgtaaa atttaacaac aaacatatac ttatcggttg 1920 tatatacaga ccaaatgatg ccagtttaat aaaatatgtc gctgaatcct tcaaattagc 1980 tcatagttta attaaaaagc cctatgatga ctgtttgatt gtcggtgact ttaacttccc 2040 gaatataaag tggtctaacg atggtattga atcttttaac ccaaacaatc ctaccgagta 2100 tgatttctgg aatattttcc aaaacagcct ttttactcaa catatcacat ttcctacttt 2160 ccagactgat tatggttcct caaataatat gttggacttg ttatttacct acaacagtaa 2220 ttctatttta aacttgtctc ctagtactat actaggtaac accactaagg gtcattttag 2280 tatcacttgg agattcttag tttcagaact gtttaccgaa agtgaatctc aatcgtttaa 2340 ttacaaaaaa ggtgattacg attcaatatc taactttatt gcatcaatcg actgggagtt 2400 tttattcctg cataaaactg cagatcaaat gttcaatata ctaatttcta actatcgcac 2460 tgcttgcaca tattttattc ccaagattaa taaaacaaaa aaaaagtatg atccttgggt 2520 taccccacat attaaaaacc tgatccgaga aaaaaaatcc gcttggcata taaactgctc 2580 taccaaatgg gagaatagtg agcttaaaat aaaatacaat cgccttaatc gttcagtaaa 2640 aactgaaata tttaaagcaa aaaaagcaca tgaaaaagaa ataatatcaa aatcccaaaa 2700 caacccaaaa atcgtttaca aatatatcaa caaacaacta aaaattaaag atagaattcg 2760 ctctctcaaa gatacttcag gcagaattat tgaagctctt cctgatatag ttgatcttct 2820 gaactctcaa tttaaatcta tgtttacact tgaaagctct gaaaattttc ctttgctaaa 2880 taaaagtggt attctttctg aatgcattta taatgaaaac ttggatttct ctatctcaga 2940 tattattcag gaacttgccg atctaaagcc ttataaatcc ttcggatctg acggcttgca 3000 tcccttaata ctcaagatat gtgccaaaag ctttgcaatt ccattaacaa aaatctttca 3060 taaatccttt aattctggag tagtacccac tcaatggaaa ttagcaaata ttagtccgat 3120 ctataaaaaa aagggagata aattacttgc tgaaaactat agacctattt cactaacatc 3180 tgttgcctgt aaaatctttg aaaaacttgt aaagaaaaaa attttaaagc acattaacga 3240 taataaaatg ctctctgcaa atcaacatgg cttcctttat cataaatcat gcgtaactaa 3300 cctaatagaa acttttgata tcatttcgtc ctccatagca tctcaccaat ttattgatgt 3360 aatctatctt gactttgcca aagcgtttga ctctgtacct cacaacagac ttttgttaaa 3420 attaagatct ttcggaataa acggcaaact cctacaatgg attaaatctt ttttagtggg 3480 cagaaagcaa cgcgttacaa ttaacaatac cagctctgca tgggttgatg ttctcagtgg 3540 tgttcctcaa ggttctgttc tcggcccgct cctcttcatt ctctacacaa atgatctcgc 3600 aaaactcact gaaacttctg agatatttgc cgatgatact aaactttttt ctaaaacctt 3660 atcgcaatct aattttgttg atctgcagga tcgtctacgc ttggtatacg agtggtcaaa 3720 cacttggtta cttaagctaa acacttcaaa atgctgcgtc attcattacg gtcgaaacaa 3780 cccccattac aattactttc ttcaaaataa ccaaggaatc aaccaatctc taagaagctc 3840 tactaaagaa cgtgatttag gagtcatttt tacatctaac ctcaagtgga gcgctcatat 3900 tcaaacagtt gtctctaagt ccttacaaat tctcggtatg ctgaaaaaga cattcacttt 3960 cattgatgaa aaaatggctc tcaaattata taaagtattt ataagaccac atcttgaata 4020 tggagtgtcc atttgggctc cttttttaaa agctgatatg gctttggttg aatctgttca 4080 acgcaaagct acaaaatggg cttacagctt gaggcatctt caatactgcg aaaggctaaa 4140 aagattaaac ctaccatcac ttgagtttcg acgccttcga tgcgatttaa tccaaatttt 4200 taaattaata catcgttttg atatagttaa tgataaatat ctaccaaaat atattaactc 4260 cgtgtctcgt ggtcaccatt ctaaaataag gcgtgagtat gttctaaact gtcttccaag 4320 atataatcat ttgtataata gatcagcaaa atactggaac aatctaccag agcaaataac 4380 tttgtactcc actgtacttt ctttcaaaaa tagactagac tcctggttgt ggaaaaagtt 4440 tggaacaaat atttataaaa ctggttgttt atagtggggt gtttcccctg ctcgctgtat 4500 gtgtacattc tatgtaaatt tacggctaca actataataa taataataaa 4550 // ID Gypsy-59_CQ-I repbase; DNA; INV; 11272 BP. XX AC AAWU01037375; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-59_CQ_; KW Gypsy-59_CQ-LTR; Gypsy-59_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-11272 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 497-497 (2011). XX DR Genome; AAWU01037375; Positions 26390 15119. XX CC Positions [4705-5166] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 284..1714 FT /product="Gypsy-59_CQ-I_2p" FT /translation="MTDRILELRSRGIQFVTELKRRRSERQRRSYSFSEPP FT EDPDHNPGPLIHSFSAEFLFAKGVQANSSLLDKSKFEADLEEISFKDPEIP FT VPKEELLTFLEEHSVLDPEVDPEVEDLPNVTFNELQASFIETSLNFEKLLF FT KMAFAFVKAVTCIPDFDGSYEQLQTFLDVVDIFARQAIPAAPAAAAAGDNN FT NDAAENVNANEIQLLAAVRMKLKGKALDKASEIMKPTWALTRASLQEQFQS FT VISMESISRMINNLKQEYAESFVDYKERADKIYSYVRLLGENEFADRQLRL FT NFISGIRLASMRQFATTLDSATYADLAKELKKKGDFMDEVTECWQKKNKAR FT IAFDSNFNPNSSRTNDQSSRNNSQRQNYSYNNNNNNNSRNNNNNSRRDNTN FT QNHSFSNTNSPNQNSSNRQNSNVSDLNNSSNFRPSLNSTGFNHTRQNQNRN FT NNSTLFTQNPNQVEIPFCEPSTSYMTAMSSKN" FT CDS 1786..5517 FT /product="Gypsy-59_CQ-I_1p" FT /translation="MGLNKKVKIDIFHSPNKHFLIEAITERCPNTKTTFLL FT DTGAHRNFLKASLLSQLKIKSKDIDKADIIPVIGVSNTQVMSIGSVWVKLC FT IEGVTFPIKFHLLENLLAAPAIIGAEFLKTHTTFVGNHFEYMIWQRFPDVT FT VEYEKEHCPTLSNDFTQYDESFTGIVNETVSQEEWEENLLDEILDLEKNTD FT CDFIEYKNTEPIEGHERLQELAKLIDLSHLHPENFAGVRDIIIKHKDIFFL FT AGDRLSTTYAAKHEIETTTNVPINKRQYRFPEATKSHINEEIEEMLKQGII FT KPSTSPWNAPVLCVPKKPDPNGNKRYRIVVDFRALNTITKSFIYPIPLINE FT ILDNIGESSYFSSLDLKSGFYQVPIDPRDAQKTAFSTPRGHFEFTRMPMGL FT KNSPSTFQRLMNSIIYEIGDVQAFVYLDDIIIFGKNAKEHNHHLKKVLNAL FT RKHNLKIEPGKCQFLKNEIQYLGHIISKEGIKPTNANIKVIQGLKPPKTIK FT EVRSFLGTVNFYGKFIPDIAGKRKPLNDLLKKNVKFVWSDDCQRAFEELKE FT SLISEPLLVRPNYKDTFVLTTDASDYAVGTVLSNEKTIDRPIAYASKALKD FT SEKRYHTIEKELLAIVWAIEYFRHYIFNQKFIIYTDHRPLIAIDRLKETSP FT ILTRLRLKLIGIECDIRYKQGRENIVADFLSRINTPEQEEIQAAVTTRAQN FT KKNEEGKDKEVISADQSRLTPPNDSDENNPLTILPYVNNDIRETENKAKED FT LYSCFEYAVTNYAFMDTNITPSDNFFTQEDCDGRFVIANSKNLHNELSELC FT KLPHGSRQFAREKVINIPENKLWGIVVHGNSRALIESRKFFDLFVTHFLEC FT FKNFKINSKIQIIAFRQIRQPEILQMIEFVAWKLNMNIHLYNANAERINVR FT PDEVETVLREFHDAPLGGHIGAKRMRKRVGLVYHWKNMRRDIENYVRQCDS FT CQRNKIGKSNKIPMKITSTASEPFEKIYMDIVVLPESECGNKYGLTVQDDL FT TRYLNIIPIANQESVTIARALICKFGTPLEVVTDRGTNFMSNLMKEVCKLL FT KIKKICTNAYHPQANLVERSNRELKTYLRQFVCNRPKMWDQQIPYFLFEYN FT TAFNSSTGFSPYELIFGKSPRIPASIYVVKEGMTYNEYVLEMKKIFADIHT FT KALENLVVSKTKSKQIYDESANEWQPMWGEKVYVHNVPTGTGQKLQSYWRG FT PYEVVELKSEQTTVLKNGNKLEEVHNNRLKRYVD" FT CDS 5768..7522 FT /product="Gypsy-59_CQ-I_3p" FT /translation="MKLRLSDLVTGGKLLNNSILNFQSTCDTLVNQSSTIG FT CSKISKNLESIVTQISDRNDYLKTFVKKRAKRSWHTLGIMNTDDLARIDND FT LDRLRGNEEQIKNSINRQTEVIDSIYNFINNSMEQLNSKILSLHIKVKNVT FT HLTNEDKKFHNKIRETLNLENELMGLGLWIQLVSEELKQQQQIFTQLILNV FT EQSIDTTLVTKLVEPKLLLNILMSQTNTLPVDTTFPRRRDNQIYPEIVNLI FT EISSKVGSNLEIHVSLKIPLVNRKIYETYEAETEPGLLNDLVSFIKVDPNV FT LLLAEGTNWGHIISSTEFDNCQQLGDVAMCQLQAIEEDLADKNECLTMFYF FT KNSTATCEIRVLKATTNLWFKSREPNVWTYVAPNKTDIELLHGKNITRLSV FT YGTGKIRLNVNMQIRTKHVRIDYVNYNESEITELIIHAPNQTYLNLSNFNY FT KINDIPLIEEGKNIYSIIDHKSLFNLGIDVKDLQNYKTTLKNIIYDPMDHP FT WEFTGLLSGVALGMGGLICLILCNFKGSVVVNKTRVVPNTQTFDLSDTMFQ FT GKIFTEKPATIIDCGTNMHTVKPSNYGVKPSYYGIKIQ" XX SQ Sequence 11272 BP; 4292 A; 2256 C; 1962 G; 2762 T; 0 other; cgtggtgaca gcgcaaaccg gctcaaggag agtcacggaa cgccattcct gaccaggaca 60 aaaccagatg agatccctca taatactgtg gtaagtaact attttttttt tatctttttc 120 tgtgtgatga aaaaaaattg taaagtgaaa attgcagtga taatttgtgt gtaacgttta 180 aagagaaatt aatgaaattt gtgtttttat tctctcttgt cattaaattt atttttcatt 240 ttaatgacga aatcacgcaa taaaaaattg gattacattt taaatgacgg acaggattct 300 tgagcttcgt tctcgtggaa tccagtttgt tacagaatta aaacgacggc gaagtgaaag 360 acaacgtcgg tcttactctt ttagtgaacc gccagaagac ccggatcata atcctggacc 420 tttaatccat tctttttctg ctgaattcct ttttgcgaaa ggagtgcaag caaactcttc 480 cttattagat aaatctaaat ttgaagcaga cctcgaagag atctctttta aagatccgga 540 aataccggtt cctaaagaag aacttttgac ctttctagaa gagcacagtg tcctcgatcc 600 agaagtagat ccagaagtag aagacttacc gaacgttaca ttcaacgagc ttcaagctag 660 ctttattgaa acaagcctta attttgaaaa attattattc aaaatggcct ttgcatttgt 720 caaagccgtc acttgtatcc cggatttcga tggctcctac gaacaactac agaccttttt 780 ggatgtagtt gatatttttg caagacaggc catccctgcg gctcccgcgg cagctgcagc 840 gggagataat aataatgatg cagccgaaaa tgtcaacgcc aacgaaatcc aattacttgc 900 ggcagtcagg atgaagttga aaggaaaggc cctggacaaa gcttcagaaa ttatgaaacc 960 aacatgggcg ctgaccagag caagtcttca agaacaattc caatccgtaa tctcgatgga 1020 gagcatctcc aggatgatta ataacttaaa acaagaatac gcggaaagct tcgtcgacta 1080 caaggaacga gcggacaaaa tatattcata tgtgagattg ctgggagaaa acgaatttgc 1140 cgacaggcag ttacgtttaa atttcatatc aggaattaga ctagccagca tgcgacaatt 1200 tgcaacaacg cttgactctg caacttatgc agaccttgcg aaagaattaa agaaaaaggg 1260 agatttcatg gatgaagtca cagaatgttg gcagaaaaag aataaagcca gaatagcctt 1320 tgattcgaat tttaacccga atagctcaag aacaaacgat caaagctcca ggaataactc 1380 acaaagacag aattatagtt ataacaacaa caataacaat aacagtagga acaacaataa 1440 caattccaga cgggacaaca cgaaccagaa ccactccttt tcaaacacaa acagtccaaa 1500 ccagaattct agcaatagac agaattcaaa tgtttcagat ttaaacaatt catctaattt 1560 tagaccctca cttaacagca cggggttcaa ccacacaagg caaaatcaga atagaaataa 1620 taattccaca ctctttactc aaaatcccaa tcaagtggaa ataccttttt gtgaacctag 1680 cacatcctat atgactgcta tgagttcaaa aaactaattg aagggggctt tctggatgag 1740 gaacatcgaa agcctcctca aattgaacaa tttgtaggtt gtactatggg actaaacaaa 1800 aaagtaaaaa ttgatatttt tcattcccct aataagcatt ttttaataga agcgataaca 1860 gaacgatgcc ctaacaccaa aacaacattt ctgcttgaca ccggtgcaca ccgaaatttt 1920 ctgaaggcat cattactatc acaattaaaa ataaaatcaa aagacataga taaagcagat 1980 ataataccag taataggagt gagtaacacg caagtgatgt ccataggatc agtttgggta 2040 aaactgtgca tagagggagt cacattccct attaaatttc acttgcttga aaatcttttg 2100 gcagcaccag caattatcgg ggctgaattt ttaaaaactc acacaacatt tgtgggtaat 2160 cacttcgagt atatgatttg gcaaagattt cctgatgtaa cagtggaata cgaaaaagag 2220 cactgcccta cgttaagtaa tgatttcacg caatatgatg aatcatttac gggcattgta 2280 aacgaaacag tatctcaaga agaatgggag gaaaaccttc tagatgagat acttgattta 2340 gaaaaaaaca cagattgcga ctttatcgaa tataaaaata cagagccaat tgaaggacac 2400 gaacgtttac aagaactagc caagctgatt gacctaagcc atttacatcc agaaaatttt 2460 gccggagttc gcgatattat aataaaacat aaagatatat tctttttagc aggcgacagg 2520 ctatcaacca cttacgccgc taaacatgaa attgagacaa ccacgaatgt tcctatcaat 2580 aaaaggcaat atcggttccc agaagcaacc aaaagccata taaatgaaga gatagaagaa 2640 atgctgaaac aaggcataat taagcccagt accagtcctt ggaatgctcc agtactttgt 2700 gtaccaaaga aacccgaccc aaacggcaac aaacgttatc gaatagtggt agatttcaga 2760 gctttgaaca ctattacgaa aagttttata tacccaatcc cgcttatcaa tgagatttta 2820 gataacattg gggaaagctc atacttctct tcactagatt tgaaatcagg cttttaccag 2880 gtaccaatag atcccagaga tgcgcaaaaa acggcattct ctactccaag agggcatttc 2940 gagttcactc gaatgcctat gggtctgaaa aacagcccaa gcacttttca acgtctaatg 3000 aactcaatta tttatgagat cggggacgta caagcttttg tttacttgga tgatataatc 3060 atttttggaa aaaacgcaaa agaacataat caccatttaa agaaggtttt gaacgcatta 3120 agaaaacata acttaaaaat agaaccggga aaatgtcaat ttcttaaaaa tgaaatacaa 3180 tatctcgggc atattatatc aaaagaaggt ataaaaccca ctaatgcaaa cataaaagtt 3240 attcaaggcc ttaaaccgcc caaaacaatc aaagaagtaa gatcttttct gggaacggta 3300 aatttttacg gtaaatttat cccagatata gcagggaaaa gaaaacccct aaatgacctt 3360 ctgaaaaaga atgtaaaatt cgtttggtca gatgattgtc aacgggcatt tgaggagcta 3420 aaagaatcac ttatctcaga accattactg gttcgcccca attataaaga cacatttgtg 3480 ctgacaacag atgcaagtga ctatgccgtc ggtacagtat tatcaaacga aaaaaccatt 3540 gatcgcccga ttgcgtacgc aagcaaagct ttgaaagatt cagaaaaacg ttaccacaca 3600 atcgaaaagg aacttttagc gatcgtttgg gcaatagagt attttcgtca ttatatattc 3660 aaccaaaaat ttattattta cacagatcat aggccattaa tagccattga tcgactcaaa 3720 gaaacatccc ctatacttac acgtctaagg ctcaaactaa taggcatcga atgcgatatt 3780 cgatacaaac aggggaggga gaacatagtg gcagattttc tgtcccgcat taatacccca 3840 gaacaagaag aaattcaagc agcagtaacc accagagccc aaaataaaaa aaatgaagaa 3900 ggaaaagaca aagaagtcat ctcagcagac cagagcaggt tgacgccacc gaacgactcc 3960 gacgaaaaca acccactcac tatacttcct tatgtaaata atgatatccg agaaacggaa 4020 aacaaagcca aagaagacct gtactcttgc ttcgaatatg cggtaacaaa ctatgccttc 4080 atggacacaa acataactcc ctcagataat ttttttacac aggaagattg cgacgggaga 4140 tttgtcatag caaatagcaa aaatctgcac aacgagctat cagaattgtg caaattaccc 4200 catggaagtc ggcaattcgc cagggagaaa gttatcaata taccagagaa taaattatgg 4260 ggcatcgtag tccacggtaa cagtagagcg ctgatagaaa gtcgaaaatt tttcgatctc 4320 tttgttactc attttttgga atgttttaaa aattttaaaa ttaactcaaa aatacaaatt 4380 attgcattcc gccaaatacg tcaaccagaa attctgcaaa tgatagaatt cgttgcatgg 4440 aaattgaata tgaatatcca tttatataac gcaaatgccg agaggattaa cgttagacca 4500 gatgaggtcg aaacggtatt acgagaattt cacgatgctc cgctaggggg acatatcggg 4560 gcaaaaagaa tgcgtaaaag agtgggtcta gtataccatt ggaagaacat gagacgggat 4620 attgaaaact acgtcaggca gtgcgattca tgccaaagaa acaaaattgg aaaatcaaac 4680 aaaatcccaa tgaaaatcac ttcaacagcc tcagaacctt tcgagaaaat atatatggat 4740 atagtcgtcc tcccagaatc tgagtgcggt aacaaatatg gattaacagt gcaagatgat 4800 ctaacaagat accttaacat tatcccaata gcaaaccagg aaagtgtcac aatcgctagg 4860 gctttaattt gcaaatttgg cactccccta gaagtagtga cagatcgggg aacaaatttt 4920 atgagtaatc taatgaagga agtgtgcaaa ctgcttaaaa tcaagaaaat ttgtacaaac 4980 gcgtaccatc cgcaagcaaa tttggtagaa agatctaata gggaattaaa aacctatctg 5040 agacaatttg tctgcaacag accaaaaatg tgggaccaac agatacccta ttttttgttt 5100 gaatataaca cagcctttaa ttcatcaacg ggtttttctc catacgaatt aatcttcggg 5160 aaatcaccta gaatcccagc ttcaatctac gttgtgaaag aaggcatgac atataatgaa 5220 tatgttctgg aaatgaagaa aatttttgca gatattcata caaaggcact ggaaaatctt 5280 gtcgtgagca agactaaaag taaacagata tacgacgagt cggcaaatga atggcaacca 5340 atgtgggggg agaaagtata cgttcacaat gtccctactg gaactggtca aaaattacaa 5400 tcatattgga gaggtccgta cgaagtggtt gagcttaaat cggaacaaac caccgttttg 5460 aaaaatggga acaaattaga ggaagtccac aataacaggc ttaaacgtta tgtggactaa 5520 atattcaaaa aattatataa acgaaaaaaa aaaagggtaa ttaaaaaaaa aaaaccatat 5580 gcaatataaa aataaaaaca ataaaagcaa taacactact taacaaaaaa acaaagttga 5640 cacactactt tgttctcctt tttacaggtt ctgcatctcc atagcggaac caggagaaat 5700 aaacccatca cttatcctga taagaaggga gaaggtccta gttaaacaag gacactatga 5760 attagggatg aagctcagat tatcagatct agtaactggg ggtaagttac taaataactc 5820 aatcttaaat tttcaatcta cttgcgacac acttgtcaat caatcatcca caatcggatg 5880 ctcaaaaata tctaaaaacc tagaatcaat agtcacacaa atttcagaca gaaatgatta 5940 tttaaaaacc ttcgttaaaa agcgcgcaaa acgttcatgg catacactag gcattatgaa 6000 cacagatgac ctcgcccgca tcgataacga tttggatcga ttgagaggaa atgaagaaca 6060 gataaaaaat tctatcaatc gacagaccga ggtcatagat tcaatataca atttcatcaa 6120 caactcaatg gaacaattga actccaaaat tttaagccta cacatcaaag taaagaacgt 6180 tacacactta acaaatgaag ataaaaaatt ccacaacaaa atccgagaaa ccctaaattt 6240 ggagaatgaa cttatgggcc taggtctatg gattcaattg gtctccgagg aactaaaaca 6300 acagcaacaa atttttacac aattgattct taatgtggaa caaagtatag acacaacact 6360 ggttaccaag ttagttgaac caaaattgtt attaaatata cttatgagtc agacaaacac 6420 ccttccagta gacacaactt tcccacgacg cagagacaat caaatttatc cagaaattgt 6480 aaatcttata gaaatttcta gcaaggtagg gagtaactta gaaattcacg tatctctaaa 6540 aatacctttg gtcaaccgta agatttatga aacttacgaa gcggaaacag aaccgggttt 6600 attaaatgac ctggttagtt tcattaaagt tgatcccaat gtattgcttc ttgcagaggg 6660 caccaattgg ggtcacatca tcagcagtac agaattcgat aattgccaac aactcgggga 6720 tgtggccatg tgccaattac aagcaattga ggaagattta gcggacaaaa acgaatgctt 6780 aacaatgttt tattttaaaa attctacagc aacatgcgaa atacgcgtgt tgaaggccac 6840 aactaatctt tggtttaaat caagagaacc aaatgtatgg acctatgtcg cacctaataa 6900 aactgatata gagctattgc atggtaaaaa tataacacga ctgagtgtat acggaacagg 6960 gaaaatccga ctaaacgtca acatgcaaat ccgtactaaa catgtacgaa tagactatgt 7020 aaattacaac gaatcagaaa taacagaact aataatacac gcacccaatc aaacgtatct 7080 aaacttatcg aacttcaatt acaaaataaa tgacattccg ttaatagaag aaggcaaaaa 7140 catatattca atcatagacc ataaaagcct gttcaattta ggaattgacg taaaagattt 7200 acaaaactat aaaacaactt taaaaaacat catatatgac cctatggatc acccttggga 7260 attcacagga ttactttcag gagtagcctt aggcatggga ggactgatct gtctaattct 7320 gtgtaatttt aagggatcag ttgtagtgaa taaaacaaga gtggtcccta atacacaaac 7380 cttcgacttg tcagatacga tgttccaggg aaaaatattt acagaaaaac ctgcaacaat 7440 catagattgc ggaactaata tgcacaccgt taaaccatcg aactacgggg ttaaaccatc 7500 gtactatggg ataaaaatac agtgaaaaaa aaaattacac cgaaggtata atagcaacta 7560 atcgaaacat aaaccaaaaa aaaaaacaaa attcacttac tgttttcaaa tttatattac 7620 ccaaattaaa ataactacat tttcatcctt ctgggcaata tttttttagc aacattccta 7680 aaaaaaaaaa aaaaataaaa taaaattata ataataaaat aataataatc aagcgagaat 7740 acgtcataga aatcgaactt accaagttct aacaacgccc ccaataaatc taccttcatc 7800 acgtaaggcg cgaacgatgc actaggacca ccggcatcgg attcagcatc ctcctcgctg 7860 gaagtatccc accaagggcc tcccaatgtt tcaagaacag ggggagtaga ccgtacggaa 7920 tgatcttaaa aataattata gttgtagatc atccactaca catcatcaca ctaaatcaca 7980 tgcactttta gaatcttacc agataagtcc agccaatctt caggcccagg aaaaccggga 8040 aaagaacggc taatggctac gtcctcccga aaatcaattt cggaaaggtc cgaaccggcg 8100 gagctgtaac ctgcatcgat ccacgtgtcg gccatctgaa cacacttttt tgaatgaaac 8160 tggctcaaaa agtcatcata taacacagcg attccaaacc ttattctagg gaccccgttg 8220 agaatcgctg acataaaaca tccatacacc tgcatgcatt tttttttagc acaactaatc 8280 caccactaga tttgcacaac tattttcctc acatataaac gtcaccacac atccatatta 8340 tcgcataaaa ctcatcactg catacaagat atctgataac acgaccatac catcacaatt 8400 atcacacgta aaaataaaat aaacacattc actaaatgca cgtattatta ccaccacaca 8460 gagcaccgaa tcaaagcagc aacagacgcg ataaggaaat tctaaacctg ccgggcgccg 8520 acgaaaacta caaggacgtg aagcagaaaa cagaacgttg caatttttaa ccctcaccgc 8580 aacatcataa atataaaaac cgaaaaaaaa gtaagtatgg cccaacagcg aattgacatt 8640 ttaaacaaaa tgctaatgtg tagcctgtgc cgcgttaaca cgccatggct catcaattta 8700 aatcaacgac cggacctaga gcgggacatc gtagaattta tcggtgtccc aaggtacgag 8760 accggccgta ttacgcgata cgcttgtcca tattgtctgg acaagatccg tatcgcgtta 8820 acaagaaaaa ggcaatttct ctcaacagaa tttgctataa ttcaatttgt tgagcaggga 8880 ataaatgcct cgccaaacaa tatcgatggg gtcttgcccc caattccata tcctgggact 8940 caattttatg gaacattgcc tacagcaaca tttccaaaca cccaaacaac aacatttcca 9000 aacagtgttc tgccagcggc acttcgctcg ttaacaccac ttcagacttc tcaaccgcta 9060 aagcggacca aaaacattaa aaaaaaaatg aacccaaatt aaagagatca cgcccttctg 9120 ggacacccaa aactatacac acccgtcaat caaaaaccaa aactaaaaca cagataaatg 9180 ttcagagctg tgagccagct aataccagct taccccccga gaacaactcg gtggtacgtt 9240 caccccgtca caattggacg aaggaggcag gaccattaga cacaatatcg catactaata 9300 ccacacgcaa tcctgaatct gaacatcaac caagcgaatt agaggcattc gatgcatttt 9360 tggcatcaca tcattattat aatcaacaaa ctcaggagaa ccgtcgtact ccgagtaatc 9420 caaatattat gcgatacact agcctctaaa gaaaagaagt acatgacaca ttctcctgac 9480 tattatacca gtgcgaatgc tgtcaatggc aaaatagatg tggttggatt tgtacaattt 9540 ggggcttata aaaccgtctt gtacaaacat tcaaatattt acaaaacttg aagaagagga 9600 taggataaat caatgatggc attcggttta gcactcagtt ttttagaatt aacacatgga 9660 tatatacaat tttttttccc ctttctcttg tctttattta caaaaccaaa ttagaaaatg 9720 actaccttta acaaggagcc aattgaagtt gtgataaaac aaatatttga catggtagag 9780 gaggaaaaca aaaaaattga ggtacaaatc agggaaggaa agatagatgg acaggtgata 9840 gggccaaccg aaaaggcgga tttcatatac gaaggtctta tggaagactt caggaaagag 9900 aggaaaatga aagaagcaaa gaggagaaga aacggacgca aattcgggtg ctggtagcta 9960 ggatgatagc taaggatcga tcggtgttca aaaaggcaag cgcatgatgg gtggcgacag 10020 tccgaaggcg ggccagagag acgattcatt ggatggcaga atgaacaaca tgaaggctag 10080 aaagagacta cgctgctagc agacgaacca agtgcaacta acggatacgt gaaaaccaaa 10140 gatgccaaat gaagataacg taccatgaca gaaatacaca cccaacgcat tctgtcaaaa 10200 aactcgcagg aaaaggaaaa aaaaaggagc gcttttcaaa cgcgcatact gaaaccaagt 10260 caacggggct cgattgttgc aagccaaacg caacatgtcg gcccacaata catgaatgat 10320 gcaagctcaa agcatcatcc acaatcaacc gtaaggtacc ccacaatcat tttcatcaaa 10380 acgtactgtt cagcccacaa cgcacaaaca tggttatcat acaactgtta aggaagtaca 10440 tatcaaacgt acttgcccta accacatggc aaaactatgg aattagccca acaggccaat 10500 cgtccccagc gcagtaaagg aaggatacaa ttatcaactg gattgaagaa tcatctctaa 10560 ggccagccca acgacctagc cattccagga gcggagccta agtaaaccga tcacacgcaa 10620 ccgaagcaaa catcgagacg acatcgagac gacatcgaga cgacgacgct tcacatggaa 10680 attctttctt tgaccgcaaa ccaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 10740 aaaggaggag gagacaaatt aattcacctc aaccagaagc aagaaaatta ctacatgagc 10800 aattattcaa tttttctttt aaaaaaaaaa cacatttttt atattacaca ttataaccac 10860 ttatattcaa tttttataaa cacatttttt atattacaca ttgtaaacac ttattccatt 10920 tttcttaaaa aacacatttt ttataataca cattataaac acttattcca tttttcttta 10980 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa acacacactt tttataaaac acattataaa 11040 cacttcttca attaataaac acttattcaa ttttataata cacatcataa acacttattc 11100 aatttttaat aaacacattt tttatattac aaattataaa cacttattcc atttttcttg 11160 aagaaacaca tgagaaaaac ataaaatcag attagcgcct tattcaaaaa cctgcgcgaa 11220 cacgaagatc gatctttcag taacgatcat cgagctgata agtatggggg ag 11272 // ID DNA2-3_TCa repbase; DNA; INV; 1361 BP. XX AC . XX DT 22-MAR-2009 (Rel. 14.03, Created) DT 22-MAR-2009 (Rel. 14.03, Last updated, Version 1) XX DE Non-autonomous DNA transposon. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Nonautonomous; KW DNA2-3_TCa. XX OS Tribolium castaneum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Coleoptera; Polyphaga; Cucujiformia; OC Tenebrionidae; Tribolium. XX RN [1] RP 1-1361 RA Jurka J.; RT "DNA transposons from insects."; RL Repbase Reports 9(3), 664-664 (2009). XX DR [1] (Consensus) XX CC TSD is TA. Based on that, it is classified as putative CC non-autonomous Tc1/Mariner. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Tribolium CC castaneum Genome Project. XX SQ Sequence 1361 BP; 479 A; 194 C; 199 G; 481 T; 8 other; cagggtggtc cagctcagct cccgtcttct gtagctcagt tattagtaaa gttggacttt 60 tgatattttg tagacgttat ttatactagg acacgtttca ggaaaatatt tttgattata 120 cagagtgtcc caaaaaagta tgacgtcata ataataattt tttaatggca tgctattatt 180 tttttgtgct cattaggatg cctttttagc ttcttacatg tctctaattt tttttaaatc 240 ggttcaggga ttactttaaa aaaaatcggt tttacacctt tagttttaaa aattaataaa 300 aaatagaaac agtgcttttt tgatgacatt attastacgt acgtcttgaa caaggtctaa 360 cagacttatt aattaagttt gattgatctg aattactaca gggtgtttac aatttatttt 420 caaacttytt aaattttaga gccttttatt ttgcttattt cgaatggcaa ctcctgttta 480 tttttacatc attcgatgca gaattaaaaa acaacatttt ttattattac aaccytaacg 540 taaatcctaa cacattcgga gttatttgcc aaaaacytta accatatttg attaacawaa 600 cgagttttta gcaaataact ccgaatgtgt taggatttgc aataaggttg taataacaaa 660 aaattgtatt ttttwattct gcatcgattg atataataat aaacagagtt gaaactatct 720 tcgactttta ttcaataaat ttgtcattaa aatacttgac aaaacaagtt tttagcaaat 780 aactccgaat gtgttaggat ttacgttagg gttgtaataa gaaaaaaakg tcgttttcta 840 attccgcatc gaatggtata aaaataaaca ggggttgcca tttgaaataa gcaaaataaa 900 aggctctaaa gtttaaaaag tttggaaata aattgtaaac accctgtagt aattcagatc 960 aatcaaactt aattaataag tctgttagac cttgttcaac acgtacctag taataatgtc 1020 gtcaaaaaag cactgtttct attttttatt aatttttaaa actaaaggtg tataaccgat 1080 ttttttggtt tttatttttt taacagtaat ccctgaaccg atttgaaaaa aaattaggga 1140 catgtaagaa gctaaaaagg atcctaatga gcacaaaaaa taatagcatg ccattaaaaa 1200 aatattatta tgacgtcata cttttttggg acactctgta taatcaaaaa tattttcctg 1260 aaatacgtcc tagagtataa ataacgtcta caaaatatca aaagtccaac tttamtaata 1320 actgagctac agaagacggg agcttgagct ggaccaccct g 1361 // ID hAT-58_HM repbase; DNA; INV; 3726 BP. XX AC . XX DT 23-DEC-2008 (Rel. 13.12, Created) DT 23-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type DNA transposon family - consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-58_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3726 RA Bao W. and Jurka J.; RT "hAT-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 2046-2046 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 801..3380 FT /product="hAT-58_HM_1p" FT /translation="MESTHNKEKNVFKTHNNKKPSGAQYRKRKAEKVDNLK FT KNYKKLENFFTSENSDCNKIKEGENVAHVLSDTSTSDEEDGIVSVGNVHYD FT EKDIDNENDDGQDGDVNDRTKETAIDYSDCGLWPIHRNIQFVDYVINIGAI FT QVNLDTYPRDNRRRHFSNAYYNRKLSNGEVISRRWLVYSVSKDAVFCFCCK FT LFDFNMNLKLNGNGFNKWKNLTEALKIHENSKSHRIAYQLWIETEIRMKAG FT ETIDKQEQKLIEKDSLRWRSVLERLMNITLYLATNNMAFRGSSDKLYAVNN FT GKFLGLVQLLAKFDPIMLNHVTLALKGDISDHYCGKTIQNEMIDIMASKVT FT NIIISKALKSTYYSIIADCTPDVSHKEQLSLTIRIVNISEYPIKINEHFLG FT FFNVNDTTGLGLTEIIIGALKDFGLNISFCRGQGYDNGANMKGKKIGLQKR FT ILDLNPLAFYLPCGSHSLNLVICDAAQSSLNSINVFGIIQRLFTLFSASTS FT RWNVLLSHTTNFTLKRLCDTRWEAKIESLKAIRYQISSVHDALITIYETET FT KTPDIAHEAQTLAEQLKDYSFLVSLIAWYNVLFQINVVSKAMQAKDMDLVQ FT CAEMLKKCITFLENYRTFGFKQATVDAKELAEELNIDAIFKPLKRVRRVKR FT HYDENAFDEPIQNPEKKLEVTFFNPLLDTCLSSINERFEQLNEYVNIWSFL FT FNLDNLPIKNELLKLCKQLQEKLTVNTKSEIDGALLCDELISVQFFIKDKL FT SNVENEINTKEMTPLFVLNLIKKHCLQELYSNTWIVLRILLTIPVTVASGE FT RSFSKLKLIKTYLRNTMLQDRLNSLSMLSIEQDIAENLDFSSLIRDFADKK FT ARKVRF*" XX SQ Sequence 3726 BP; 1355 A; 551 C; 624 G; 1196 T; 0 other; cagtgccggc cttagggggg ggcgaggttg ggcgccgccc aagggcgcca ggaaactggg 60 gcgccaaacg ttgtatgcaa actttaacaa tataataaat agaaaaaata agtcaaggga 120 tcgtccataa attacgtaac gcaaagtttc acaaaaaaca aaacaaaaaa agatcaaaag 180 ttttccctcg cacagtctta atttaaataa acaatgaaaa taggaaatta agtcaaatga 240 aaatcgccgc gtaaaagagc ctaatatctc ctactcctgt tcttaaaata tattttgcgt 300 tgcgtagttt atggacgatc cccaacctaa attttttcga tttcgatttt cgattttttg 360 ttattttttc gaatgtttgt tttttttata aaaataattt ctttaaataa tttatttatt 420 gtattattac actgttaaaa attgtttatg caaaattaat aatttgatta aaagtgtttc 480 aatcatagca tatattttgt tacttaaaat cattttttta attgtatctt tctcggcgcc 540 aattattttg cgttccattt atttaaagtt taaattttaa acattatttt cggtacgagt 600 tgcgagcgcg tcgtttatta atttgtagaa gaaatttaaa gttgtatttg tgattcaaat 660 tcagtttttg tcaagttttc aaattttgcg gtaataaaaa gcaggtaagt ataaataatg 720 aaattgttta tgaattatta gcttgcgagt aattttttat tttaaaattt tagacaaata 780 gaaactgtca gcagcaatca atggagagta ctcataataa agaaaaaaat gtttttaaaa 840 ctcataataa taagaaacca tctggtgctc aatacagaaa acgtaaagca gaaaaagttg 900 ataatttgaa gaaaaattat aaaaaacttg aaaatttttt tacatcggaa aatagtgatt 960 gtaataaaat aaaggaagga gaaaatgttg cgcatgtact tagtgataca tcaaccagcg 1020 atgaagaaga cggaatagtg tcagttggaa atgttcacta cgatgaaaaa gatatcgata 1080 acgaaaatga tgatggccaa gatggtgatg taaatgatcg caccaaagaa acagcaatcg 1140 attactcaga ctgtggtttg tggccaattc atcgcaacat tcaatttgtt gattatgtaa 1200 ttaatatagg tgcaatacaa gtgaatttgg acacatatcc acgcgataat agaagacgac 1260 atttttcaaa tgcttattac aaccgaaaac tttcaaatgg tgaggttatt tcacgacgtt 1320 ggcttgtgta ctcagtttct aaagatgcgg tattttgttt ttgctgcaaa ctttttgatt 1380 tcaacatgaa tttaaagtta aacggaaatg gattcaataa atggaagaat ttaacagaag 1440 cattaaaaat tcacgaaaat agtaagtcac acagaattgc ttatcaacta tggattgaga 1500 cagaaatacg aatgaaagct ggtgaaacta ttgataaaca agaacaaaaa ctaatcgaaa 1560 aggacagttt aaggtggcga agtgttcttg aacgattgat gaacattact ttatacttgg 1620 caacaaataa tatggccttc agaggttctt ctgataaatt gtatgcagtt aataacggca 1680 aatttttagg attggtacaa ttactcgcta aatttgatcc tatcatgtta aaccacgtca 1740 cgctagctct taaaggagat atttctgatc attactgtgg aaaaaccatt caaaatgaaa 1800 tgatagatat aatggcatca aaagtaacca acataattat atccaaagct ctaaaaagta 1860 catattattc tattattgct gactgcactc ctgacgtatc tcacaaagaa caactctcgc 1920 ttacgattag aattgtaaat atatcagaat atcctataaa aataaatgaa cactttctag 1980 ggtttttcaa tgttaacgac acgacaggtc ttggtttaac agaaatcata attggagctt 2040 taaaagattt tggactgaat ataagctttt gtcgagggca aggttacgat aacggtgcaa 2100 atatgaaagg taaaaaaata ggtctacaaa agcgaatatt agatttgaat cctttggctt 2160 tttaccttcc atgtggaagt cattctctca atttggttat ttgtgacgca gcgcagtctt 2220 cactaaattc cattaacgtt tttggcataa tacaaagatt attcacatta ttttctgcat 2280 caacttcacg ctggaatgtt ttacttagtc atacaacaaa cttcactctt aaacgattat 2340 gcgacactcg ttgggaagcc aaaattgaga gcctcaaagc catccgatac caaataagca 2400 gcgtacatga tgctttaatt actatatatg aaactgaaac caaaactccc gatattgcac 2460 acgaagcgca aacattagca gaacaattaa aagactatag ttttttagtt tctttaattg 2520 catggtacaa tgtactattc caaataaacg ttgttagtaa agcaatgcaa gccaaagaca 2580 tggatctagt acagtgcgct gaaatgttga aaaaatgcat cacatttttg gaaaattaca 2640 gaacgtttgg ctttaaacaa gcaacggttg atgccaaaga gttagctgaa gagttgaaca 2700 tcgatgcaat atttaagccg cttaaaagag tgcgacgagt gaaacgtcat tatgatgaaa 2760 atgcttttga cgaaccaatt caaaatcccg agaagaaact tgaagtgaca ttttttaacc 2820 ctttattaga tacatgtctg tcgtcaataa atgaaagatt tgaacaactt aatgaatatg 2880 taaatatatg gagtttttta tttaatttgg ataatttgcc catcaagaat gaacttttaa 2940 aattgtgtaa acaattacaa gaaaaattaa ctgtaaatac aaaatcagaa atcgatggtg 3000 ctttgctatg cgacgaacta ataagtgtgc agttttttat aaaagataaa ttaagcaatg 3060 ttgaaaatga aataaataca aaagagatga caccactttt tgtactgaat cttattaaaa 3120 aacattgtct acaagagtta tattcaaata catggattgt tctgcgcatt cttcttacaa 3180 tacctgttac tgttgctagt ggagaacgta gtttttctaa actgaagtta atcaaaacat 3240 atttaagaaa taccatgttg caagatcgac ttaattctct atctatgtta tcaatcgagc 3300 aagatatagc tgaaaattta gatttttcta gtttgataag agatttcgcg gacaaaaaag 3360 ctagaaaagt tagattttaa aatttaataa ctcttgttta tatagttaga ggatttatta 3420 tatgtttaat aaaaacattt gttattttta aagcagcgtt gcgttgtcta tttatttagt 3480 cactcatttc tataaaccca tatcatgaaa agtctaagtt tataatttta ataaaaaatt 3540 aattaaataa gaaacttgat gcacaatatg gatctctttg cattaaacta tatattccct 3600 ttagcactct tgttcatcat tttagagcat tggtctcgtt gttaatggta acaacctacc 3660 catatatacc taggggcgcg ttcacaagtt ttgcccaacc tgtcaaaaaa gctaaggccg 3720 gcgctg 3726 // ID TE-1_AAe repbase; DNA; INV; 1263 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 11-APR-2011 (Rel. 16.04, Last updated, Version 1) XX DE A putative non-autonomous transposon family from Aedes aegypti. XX KW Transposable Element; nonautonomous; TE-1_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-1263 RA Kojima K.K. and Jurka J.; RT "Putative transposons from the yellow fever mosquito."; RL Repbase Reports 11(4), 1439-1439 (2011). XX DR [2] (Consensus) XX CC ~96% identical to consensus. Both termini are TATATGA. Many CC elements of this family contain an insertion ~94% identical to CC DNA-TA-8_AAe at the TA dinucleotide (686-687), and some copies CC do not have the insertion. This insertion is excluded from the CC consensus. XX SQ Sequence 1263 BP; 401 A; 237 C; 223 G; 402 T; 0 other; tatatgatag cttacccgac gtggctagcc acgttccaac gaaatcttaa gggtgtgagc 60 cgtttcaccg attcgtttaa cttttattta aaatttgaca gctcgatagt tatttcccct 120 ccggttgaaa gtaaccctgc tctgaagcat gattaaactt gacagttcga aagttatttt 180 ccgctcccga gaatttgaga ataactaacc ctctgtcaac tttgttcaga aataactact 240 cgactgtcaa agctcaaatg ggttgaccgc gaagttcaat ttaaccattt gagggaaaat 300 aactatcgaa atgtcaaatt ttaactaata gttaaacgaa ttggtgaaac ggttcacagc 360 ctaaagcgaa aagctaccta taatctttca aatcgcatga tactgatgac ctcaaaatcg 420 cctaaattta tgctcaggca atactccatc gagcacgtgg cctcaatact gctgtagtgt 480 tctaatctga aataattata gccttatatg ctgttgtagt gtttcttgtg ttgtgaaata 540 tttcaaaagt tccagaagat tctatgaatt ggagagtaga atcgtaccaa aaaatctagc 600 atatcctccc aaatagctgc tgtagtgtag tatgattcat tgtcattttt caaaattaca 660 gttgtgaaaa ctgtcgactc agagaaatac catgtcatgg aacaggaatg ttttaagcaa 720 tcatcatagt aagcataaaa ttttgtatta agcttgcatc catgagtcga tatcgagtca 780 ttgagcatcg actcttggag gtttcaatgt aaattaataa aactcgaata tttggcgcac 840 attcatctca tttatgttta tcctgctttt gcatatgtcg aactcgtttt tgaacctgag 900 atggaactgg agaaaccgtt tcacattttc aactcacacc cgattcaagt tcgactgaac 960 tgcaatttgc ttgaagttga tttattgacg ttttgatcac caaccaaatt tcaaatcaga 1020 cgacaatgat gaaatgttcg agaatattct acgaattgga atgaacaaag ttccagaaaa 1080 ctccagaata ttctgccaaa gagctattgt agtactccac aatttgcaat tgtaatgttc 1140 taatcttaaa atgtactgtt gtagtgctct taaattgctg ttgtagtgct caacgaaact 1200 tttcgatgaa acatttcaaa gcgttactga cagacagaca actctgggat tttatatata 1260 tga 1263 // ID Gypsy-222_AA-LTR repbase; DNA; INV; 1006 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-222_AA_; KW Gypsy-222_AA-I; Gypsy-222_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-1006 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1048-1048 (2011). XX DR [2] (Consensus) XX SQ Sequence 1006 BP; 287 A; 230 C; 277 G; 190 T; 22 other; tgtatgacag gwaatgtttg tattagttgt gcaggtttcc ccagacagac ccactaggca 60 tacaastagg gcagtagtat gtcwtatgac cttaaccacg caggctacca cccaccgagc 120 tcccaaaacc caccaccaga tgtcckgkag cataacgttc gggcggccat gaggggtggc 180 gaaatggggg ccatcgttga taagacccga atgaaagggs aatgaccggc samscaggtc 240 aagaacctsg amggcmaaaw wtaaaccacw ggtcgttcac ttgcccggak wcwgctgaag 300 aagcwagtcg twaagcatta ccaaaattta aggaaaagca gtgaaaaagt ttgagaggaa 360 gaagtgaacg agaagtggaa gtcaggtaac cgccgagaga ccacttaatt tctattactg 420 tggtttaata tcgcatctaa tcccatacca ggaccccgcc gttttttggt atagaaacca 480 ccgcatttct attaccgccc tacgacgtcc tagagcccta ttccgggtac tttccggcct 540 cgatattagc gccgaagacc gatccgtcac caaatcggta gcctaggacg cgtaggaggg 600 tcccagcagg ctgcagccga agtcccccca agccaacccg aggagagcgt ggccagacgt 660 ggccgaagca ggagagcgcc gaaggccgat accaggccgw gaggaaggag tggacwggtg 720 gtagctgcga gaagaagggg cccctaacag gtcagattag cacataagaa agtgggagga 780 aagtgtagaa aagaagtgga tagagagagg gttgtagccc aaataaaact gtatctagtg 840 aaggaaaata aagtaggagt tttgaacagt gaagtgctgt gggttttcct aaatttctag 900 tgcgaagatt ccctgtccgc ggtaaattaa ggtaagcgtg ccgaccctga ggcagttgag 960 tcggggagtc tccaagacgc tcccgattgg ctgaccgaaa cttaca 1006 // ID BEL-161_AA-I repbase; DNA; INV; 3133 BP. XX AC AAGE02024744; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-161_AA_; KW BEL-161_AA-LTR; BEL-161_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-3133 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02024744; Positions 49603 46471. XX CC 'GTAAT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 27..3089 FT /product="BEL-161_AA-I_1p" FT /translation="MPNSTRSTRSRNAKVCCQACNDPDNDQMVSCNICGSR FT WHSACVNYSDISGQDLAFVCPKCQKSPAVVHAAVSSVGCKSLNKAGSKTSS FT ISTRASSRTKKARLQLEQLEAQKALAMKRLELELREQERKLEHQTKEAEVL FT LQHRLEQEKQRKEIEIEQRQLELEQTILEESFRLREIIALDDDEEEDDGKS FT VISEQSTFSKVNKWKTLSSTMVASTGRVHETGLNNDEAATGTRMPYRQKGT FT QHPSSITQGVLETALAGISLGQSYLEPSLGGLIGRTENTILPVVKENPLFV FT SQPANPIGSTQLATNNISQTNILLDPARSSFSQQPTHNCPPAVAHPGGVCQ FT RNEEEMLRRPLPPRTMDGYVPTRTQLEQRLPISFEPQRSFGQQQHSTIRQN FT REQPESSELPADWEGPSARQLAARQVMARDLPTFSGNPEDWPLFISSYNNS FT TRACGFSDVENLARLQRCLKGHALESVRSRLLMPAGVPHVIATLETLYGRP FT ELIIHTLLQKIRNVPSPKQDRLETLIAFGMAVQNLSDHLEAGRQLAHFNNP FT MLLFELVEKLPAHMKLDWSLYKQRFADANLRTFSQYMQTLVRAATDVTMHY FT DPTQNLQRMSKEKLCKEKTFCGAHSKDDTLEGMSREEEDQDMIHPVVPACL FT ICKNPDHRVKDCVEFDRKTIDERWKLTQQLGLCRLCLGAHGKRPCKVRKQC FT DFEGCQRRHHPLLHSCPDPRDVRTNSKPNEVNAHKDGDDQTSCEIMPSGSQ FT AITNHHSTDKKTLFRIIPVTLYGNNRSISVYAFLDDGSEKTLIDEEVIKEL FT GVVGEPQKLCLQWTANVKRAEINSKRVNLEISGNNPDTRHLLTGARTVRNL FT DLPKQSLRFAELAREFPYLKGIPVEGYRDAIPKILIGNDNAHVTSTLKIRD FT GQPGEPIAAKTRLGGTIYGYNQDNLEANTYSFHIREYALEKTIKGAVGLDE FT VEEESSSKGVYVGDTSQLRGPSCDCSNDILITKGKGYCCRVRGNLLKSGNR FT WFGKKVNASVTKPLE" XX SQ Sequence 3133 BP; 990 A; 682 C; 761 G; 700 T; 0 other; atctaaaaga ctgagatcgc atcagcatgc cgaatagcac gagaagtacc cgcagtcgca 60 atgctaaggt gtgttgccaa gcctgcaatg atccggacaa cgatcaaatg gttagctgta 120 atatttgtgg ttcgcgatgg cattctgcct gtgtgaatta cagcgatatt tcgggacagg 180 atctggcgtt cgtttgtccc aagtgccaaa aatcaccagc cgtagtacac gctgcagtta 240 gttcagttgg ctgtaaaagt ttaaataaag ctggaagtaa aactagttct atttcaacac 300 gcgctagctc acgtacgaaa aaagcccgcc tacagcttga gcaactagaa gcccagaaag 360 ctttagcaat gaaacgtctg gaactggagc tccgagagca ggaacgtaag ttagagcatc 420 agaccaagga agctgaagta ttactacagc atagacttga gcaggagaag cagcgcaaag 480 aaatagaaat agaacagcgg cagctagaac tggagcagac catattggaa gaatctttcc 540 gtttgcggga aattattgca ctagatgatg atgaagagga agacgatggc aaaagtgtta 600 tctctgagca aagcacattc agtaaagtga acaaatggaa aaccttaagt tccacaatgg 660 tagcctcaac aggaagggtt cacgaaactg gcctgaataa tgacgaagca gcgactggga 720 ctagaatgcc atacaggcag aaaggcactc agcatccgtc tagcatcaca cagggtgttc 780 tagaaacggc gttagcgggt atttcattag ggcagtcata tctagagcca tctcttggag 840 gactaattgg tcggactgag aacaccatat tgccggttgt taaggaaaat cctctcttcg 900 tttcccagcc tgcaaaccct atcggctcta ctcagttagc cacgaacaat atatctcaaa 960 ccaacattct cttggatcca gctcgttcat cgttcagcca acaaccaacg cataattgcc 1020 ccccggctgt tgcacatcca ggtggtgtct gtcaacgaaa tgaagaagaa atgttgagaa 1080 gaccacttcc accacgaaca atggatgggt acgtaccaac caggacacag ttggaacaga 1140 gacttccaat atcattcgaa ccacagcgat ctttcggtca gcagcaacac tcgacaatcc 1200 ggcaaaaccg agagcagcct gaatcatctg aattgccagc ggactgggag ggaccatccg 1260 cacggcaatt ggccgcaaga caggtgatgg ccagagactt gcctacattt tcgggtaatc 1320 ccgaagactg gccgttgttc ataagctctt acaataactc gacaagagcg tgtggcttct 1380 cagatgtgga gaatttggcg aggctacaac ggtgcttaaa agggcacgct ttagagtcag 1440 tgcgaagccg actgttgatg ccagctggtg taccgcacgt tatcgctact ttggaaacat 1500 tgtatggcag accggaattg attattcaca ccttgttgca gaagattcga aacgtgccat 1560 caccgaaaca agataggctt gagactctca tagcatttgg aatggcagta caaaatctca 1620 gcgaccatct ggaagccgga agacaattag cacatttcaa caacccgatg ctactattcg 1680 agttggtaga gaaactccca gcgcatatga aacttgattg gtcgctatac aagcaacgtt 1740 ttgctgacgc gaatcttcga acgttttcgc aatacatgca aacgttggtg cgagctgcta 1800 cagatgtaac catgcactac gatcctaccc aaaatctaca gcgcatgtcc aaggaaaagc 1860 tttgcaaaga gaagacgttt tgtggtgctc attcgaaaga tgacacacta gaagggatgt 1920 cacgagaaga agaggatcag gatatgattc atccagtagt accggcgtgc ctaatatgca 1980 aaaatccaga ccatcgcgtg aaagactgtg tcgaattcga taggaaaaca atcgacgaac 2040 gctggaagtt gactcagcaa ctgggtttgt gtcgactatg tctcggtgcc cacggcaaac 2100 gtccttgcaa agtacgcaag caatgtgatt ttgagggatg tcaaagacgt caccacccgt 2160 tgttacattc gtgtccagat ccaagagatg taagaaccaa cagcaaaccc aacgaagtaa 2220 atgcccataa ggacggagat gatcaaacat cctgcgaaat aatgcccagt ggttctcaag 2280 caattactaa tcaccatagc acggataaga agacactttt ccgcatcatt cccgtaacac 2340 tatacggaaa taatcgctca atatcagttt acgccttttt agatgatgga tcggaaaaaa 2400 cgttaatcga tgaagaggtg ataaaagagc taggtgtagt aggagagccg caaaagctgt 2460 gtctgcagtg gactgctaat gtgaaaaggg ctgagattaa ctcaaaacgt gtgaatctag 2520 aaatcagtgg aaataatcct gatactagac atcttctgac tggagcacgt actgtgagga 2580 atttggactt gccgaagcaa tcactgagat ttgcagaact cgcaagagag ttcccttatc 2640 tgaaaggtat accagtcgaa ggataccgtg atgcaattcc aaaaatctta ataggaaacg 2700 acaacgcgca cgtgacgtcg acgctaaaga tccgtgacgg ccaacctggg gagccaattg 2760 ccgccaagac acgtttaggg gggacgatat atggatacaa tcaagacaat ttggaggcaa 2820 atacctacag cttccacatt cgagaatacg cactagagaa aacaataaaa ggtgctgtgg 2880 ggttggatga agtagaagaa gaaagtagta gtaaaggagt ttacgtgggt gatacatctc 2940 aactgcgagg accttcttgc gattgtagca acgatatact gatcaccaaa ggaaagggtt 3000 attgttgtag ggttagggga aacctactta aatcaggcaa tagatggttc ggtaagaaag 3060 ttaacgcttc tgtaacaaaa cctttggagt aaagctagat cggacccctt tctagcctta 3120 cgggtcgggg aaa 3133 // ID DNA-AATT-1_CQ repbase; DNA; INV; 324 BP. XX AC . XX DT 29-DEC-2010 (Rel. 16.01, Created) DT 29-DEC-2010 (Rel. 16.01, Last updated, Version 1) XX DE A DNA transposon family from Culex quinquefasciatus - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; KW DNA-AATT-1_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-324 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the southern house mosquito."; RL Repbase Reports 11(1), 48-48 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >97% CC identity. TSDs are likely AATT. XX SQ Sequence 324 BP; 112 A; 50 C; 56 G; 106 T; 0 other; gggtcctaaa atgaagctta gattgctgat attattgttt acagcgataa agcttatttt 60 tctgagtaca atgaaccttt gtacgaccac aaagagttta aaatggattt ttaaatcaat 120 tttgaaaaat taacctcgcg gtccttcttg acagaaaagt tcctacttga cagctcgttc 180 caaggggacc atagttgatc catcgaaaaa atgttgtcta gtcaaaaaaa attttgcatt 240 aaaatgaaaa aaagtgatca gaaattgttt ttaatcgtgt tatttaccgt tgtacataaa 300 aattgacata gggctttagt accc 324 // ID Howilli2 repbase; DNA; INV; 2847 BP. XX AC . XX DT 08-OCT-2009 (Rel. 14.1, Created) DT 08-OCT-2009 (Rel. 14.1, Last updated, Version 1) XX DE Howilli2 is a putative autonomous DNA transposon. XX KW hAT; DNA transposon; Transposable Element; HOBO; transposase; KW Howilli2. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-2847 RA Ortiz M.F. and Loreto E.L.S.; RT "Characterization of new hAT transposable elements in 12 RT Drosophila genomes."; RL Genetica 135(1), 67-75 (2009)DOI 10.1007/s10709-008-9259-5. XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 537..2234 FT /product="Howilli2_1p" FT /translation="MKAFSKVGFFATNGKVLKFVSNQTSNLSRHKCCLSLR FT QPAETKKVSNMDKTTATKKCLEWVVQDCRPFSAVNGTGFRSLVEFFVTVGA FT TYGANVDIDDLLPDATTLSRNAENDAEEKRAIVSNDIKEAVENNAVSATID FT MWTDQYVNRNFLGVTLHYLKEFKLNDIILGLKSMDFQKSTGENILKKLQSI FT FAQFNVDNITNIKFVTDRGTNVKKALENNIRLNCSSHLFSNVLEKSFDEAR FT ELKDILYACKKIVKYFKKASLQHRLTTTLKNSCPTRWNSNYNMVSSIVKNW FT TTVNNILSETEGGQKLIAVNISTLKIILVLLEDFERIFKELQTCSSPSLCY FT VLPSIAKIKILCEPNTEDISLISVLKARILVNLNTIWKENLGIWHKAAFFL FT YPPAANMEHEDSLEIKTFCIDQMNNYVSFPNEPSQNNSFTSESNPASPNCQ FT LEVIPKKVTFVPKTTFFFSQLVAQSNNNLKSPLEELDNYCSERVSLTEDFE FT PIEWWKTKENCYPLLSKLALQLLAIPSSSAAAERVFSLAGNIITEKRNRLG FT PKTVDNLLFLHSFFKNNNS" XX SQ Sequence 2847 BP; 961 A; 497 C; 513 G; 876 T; 0 other; cagagaactg caagtgtgtc aatttttgcc accttactca cactcaatga aatttgtgtg 60 ccggtgccac tcatcaccgg ctcagcaaat catgaacaca cacttataca aaattatgtg 120 agtgccacat ctaccacata taccatctgt cgctcacaac aatacaccgt tttctttttg 180 catggcaacc aagtgttttt cggcaacgcg tgccggttgc actcacactt aaagcgaacg 240 tagagaaaaa aacaaaagtg tgaaaaagac aacatgcaat aacgcaagtg gtcaaaatac 300 atgtatatca cccaagtgtt ttataggcac catggtgtgt gtgttgaatt gtgtgctaat 360 tgcttctgaa ctgaatcagt tcttagcttt gattagtgcg tgtttgattt tgccatcgaa 420 aacaatgaga gcagaagacg tgaaaaattt aataaatcgt ggcatttata aagtagcccc 480 aaaacacaaa ggcaagagcg taatttggtc aatattgtgt gacatatata aagaagatga 540 aagcgttctc gaaggttggg ttttttgcaa caaatggaaa agtcttgaaa tttgtttcca 600 accagacatc aaatttgtcg cggcacaaat gttgtctatc attacgacag ccagctgaaa 660 caaaaaaagt ttctaacatg gacaaaacca cagcaacaaa gaagtgtctt gagtgggtgg 720 ttcaggattg tcggcctttt tccgcagtaa acgggactgg ttttcgcagc cttgtggaat 780 tttttgtgac agtcggtgct acttatggag cgaacgttga cattgacgac ctactgccgg 840 atgcaacaac attgagtcgg aatgcggaaa atgatgcaga agagaagcgg gcgatagtat 900 ctaatgatat taaagaagct gtggaaaata atgcagtatc ggctaccatt gacatgtgga 960 ctgaccaata cgtaaacaga aattttttag gtgtcactct gcattattta aaagaattta 1020 aattgaatga catcattttg ggcctaaaat cgatggactt tcaaaaatcg acaggggaaa 1080 atattttaaa aaaattgcaa tccatttttg cacaatttaa tgtcgacaac ataacaaaca 1140 taaaatttgt gactgatagg ggcaccaatg taaaaaaagc cctggaaaac aatattagat 1200 taaattgtag cagtcattta ttttcaaatg tgttagaaaa atcgtttgac gaagcaagag 1260 agcttaagga catattgtac gcctgcaaaa aaattgttaa atattttaaa aaagctagtt 1320 tgcagcacag attaactaca acgttaaaga actcatgtcc tactcgatgg aactccaatt 1380 acaacatggt ttcttcgata gtgaaaaact ggacaacagt aaacaatata ttaagtgaga 1440 cagagggagg tcaaaaactt atagctgtaa atatatctac gttaaaaata atactagtac 1500 ttttagaaga ttttgagaga atctttaaag aattacaaac atgtagctct ccttcattgt 1560 gctatgtttt gccctcaata gcgaaaataa agattctttg cgagccaaat actgaagata 1620 tctcgttaat atctgttctc aaagcaagaa ttttagttaa tcttaacacg atttggaagg 1680 aaaatttagg catttggcac aaggcggctt ttttcttata tcctccagca gcaaacatgg 1740 aacatgagga ttcattagaa attaaaacct tctgcattga tcaaatgaac aattatgttt 1800 cttttccaaa tgaaccttcg caaaataatt catttacatc agaatcaaac ccagcaagtc 1860 caaattgtca acttgaagta attcctaaaa aggttacgtt tgtgccaaaa accacttttt 1920 ttttctccca actggttgca cagtcgaata acaatttgaa atcgccccta gaggaattag 1980 ataattattg tagtgaaaga gtttctttaa ctgaggattt cgaacccatt gagtggtgga 2040 aaaccaaaga aaattgttat ccattactat ccaagttagc attgcaactt cttgcaatac 2100 catccagtag tgctgcagct gaaagagtat tttctttagc gggaaacatt ataacagaaa 2160 aaagaaatag attgggtccg aaaacggtag acaatttgct ctttttgcac tcttttttta 2220 aaaataataa ttcgtaattt aattcctatt ataacttcat tgaaggaaaa aatttttttt 2280 ttatttgctt tgaagcgata taatgtttat atttccctta tgttaatttt gttaattata 2340 ttaaatgaat gcactgaagc aagttaatct ttatgtaata tattaaatga ataaaataag 2400 ttatttttga gtttttgtta attatattaa atgaatgcac tgaagcaagt taatctttat 2460 gtaatatatt aaatgaataa aataagttat ttttgagttt aatattttct tttattgttt 2520 actattttcc tgttgtacgc actaatgcag ctacttgcac aaaagacacc gcaacatcgg 2580 cgacactcta cacattctat gactcaattt ttgtgtatgt cttttcaccc atcttttttt 2640 tcgaggctca agcgccggca catgctgtac acacacatgc gaaaaaatac ctactcactt 2700 ttgcagcttg tgtgcacaaa cactcaagta ttttcgtgag gcgagcattg tacacatttg 2760 tgcggtggtt gaaaacaccc gggtgtcaaa atcacccaac atttacccgg tccaccttga 2820 gtaccggtgg cttgttgcag ttctctg 2847 // ID Sola2-1_BM repbase; DNA; INV; 3705 BP. XX AC . XX DT 12-MAY-2011 (Rel. 16.05, Created) DT 12-MAY-2011 (Rel. 16.05, Last updated, Version 1) XX DE DNA transposon: consensus. XX KW Sola; DNA transposon; Transposable Element; Sola2-1_BM. XX OS Bombyx mori OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Bombycidae; Bombycinae; Bombyx. XX RN [1] RP 1-3053 RA Yuan Y.W. and Wessler S.R.; RT "The catalytic domain of all eukaryotic cut-and-paste transposase RT superfamilies."; RL Proc Natl Acad Sci U S A 108(19), 7884-7889 (2011). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 1113..3308 FT /product="Sola2-1_BM_1p" FT /translation="MVGRSKYSLICSSYMMPRSSRSRLQLSEEQKKKRRRE FT QKKLSMRRARSKLDSVALEERRKKDRERYHRKKEEGLIKTIKDLKPRDQRQ FT LRKMWREKAKLRREKEKIKRTTEQMLHENTPPSSPQPSSSTSSFSRIRSGK FT AVGARNKRRLKAKNEYLINRLIVLERKLAKYRMRLHRIKKGQKRGNTVLVK FT RKIHDFFIDDEHSRLTAGKKETITRRKVKKQIRLLNDTILNLHKIFNNKTG FT LNISYETFRRHRPFWVIFPKTTSRNTCLCRQHANNDYIASALHQAKIISFS FT NATDVARSLCCDNILRVSCLERTCTLCFEKTLDYTVINGNDTILYQRWVTK FT KVPQIIKGNEKLCQKTLKECVRTSHQLLVNKFNISLSTFMQHLANIMNQYK FT AIRYIKQNLSPSQSLLHIDFSEKYSCKYGSEVQSAHFGGSKSQLSLHTCVY FT YSVDSQPPTNLIKTTSICTVSENLRHDPVLICAHLKPVIEKIKLITPDLTE FT LHILSDGPATQYRNKTMFHMLANYVSKISNVETIVWHFSEAGHGKSAPDGV FT GGCVKRTCDKAVANGQDISGIDSFVDCVKGTCKGIDIIRINNDVSDIQKIA FT DANKVRPFKGTLKIHQITWSSKSPNIIHCRRLSCLLCAPHIRCTHFKIGQI FT QIEYISRDTKESSPTISSSILSRTGTPRPVSSIDTASASTVTGPRTPSPLE FT SFLNTPSPELTTRIVRQPLTPRKQNIVYTDSG" XX SQ Sequence 3705 BP; 1287 A; 671 C; 623 G; 1124 T; 0 other; gggtcataca catacataag ttcagggtgt atgttgttta aaaagaatgg attttattca 60 aattaaatta tatcgggaca tgaattacag ttaccatttt aaatacaaat gataatcaag 120 caaaaaatta ttaaattacc tgtgtaactt attttttagt ttcctattat ccctttctgt 180 ggactgttcg caaaggcagt tgctacattt cgatatcatt tattgttgtt tggtgagagt 240 tatcgtataa taaaatgcaa atatatcttt agtatatatg tcaaatattt aaaaatatgc 300 atataattat tataaataga ttaatttata ataaaatttt aaataaacat ttagttacac 360 gcaaccgcat cacttctgtg gacttaaaat ttcgccaaaa aaatcaatat ctactaaaag 420 tactactcac cacatttttt ttatactggc cacattataa agctagcatt ggccattttg 480 gatacagcta ctgagcgttg gcgatcgttc ggaatatggt ttataaattc agtggtgatt 540 tcgtaaaaga tcggtagaac ttttctgttc ctgatatatt tcggtaagta attattgatt 600 tcaacaatta tcacgtagcc gtatggccgt tcatggagta atttcggaaa tgtttttgtc 660 ttattttcta accttaaaaa ctaaaagagc attgtgattt ttttattgaa tgtcctttct 720 gtggacccct tttatgtgga cttgaatgtc cacgcgaaag acaatgtctg cagaaaatat 780 cacattattg atttcaaaac taaaacactg tcttacttat ctcttaacta gatgacgttg 840 actcctttgg tcgtgcatgc tattaaatct acataacatg aattttaaaa gaggttgtct 900 tgtattttgg ccaatgccaa cacgaaagaa agtgtccaca gaaaatcgtt cttaagtttt 960 ccaactgatt cagcagcttg tttgtaaacg taaacaatgt gcattgtgta tcatcggtag 1020 ctgatttata tcccctgcta gacctctgct ccaagacacg cttcttgcgt ataaactaaa 1080 agggtttggc ttatttgtat ctattgcatg gtatggtagg tagaagtaaa tatagtttaa 1140 tttgttccag ctacatgatg ccacgttcaa gtagatctag gttacaactg agtgaagaac 1200 aaaagaaaaa acgtaggcgt gagcaaaaaa aactaagtat gcgcagagct agatcaaagt 1260 tagattctgt tgctcttgaa gaaagacgca aaaaggaccg agaaaggtat catcgtaaaa 1320 aagaagaggg gctgattaag accattaaag acttgaaacc ccgagaccag cgacaattac 1380 ggaaaatgtg gcgagagaaa gcaaaactta gacgtgagaa agagaaaata aaaagaacga 1440 ctgagcaaat gttacacgaa aatacacccc cttcaagtcc acaaccatca tcttcaacat 1500 catctttttc acgaattcgg tcaggaaaag cagtaggtgc acggaataaa cgaagattga 1560 aggctaaaaa tgaatattta ataaatagat taattgtttt agaacgaaag ctagctaaat 1620 acagaatgcg tctacacaga attaaaaaag gacagaaacg aggtaacact gtactagtca 1680 aacgaaaaat acatgacttt tttattgacg atgagcatag cagactcaca gcaggcaaaa 1740 aagagactat cacacgacgt aaagtaaaaa agcaaatacg tctgttgaat gacacaattc 1800 tcaatctaca caagatattt aacaataaga cgggtttaaa catatcatat gagacatttc 1860 gtagacatcg tcctttttgg gtgatatttc ctaaaacaac ttctagaaat acctgtttgt 1920 gtcgtcaaca tgccaataac gattacattg ctagcgctct acatcaagct aagataatat 1980 cattttctaa cgccactgat gttgccaggt cgctttgctg tgataacatt ctgagggttt 2040 cttgtcttga acgaacatgt accctatgtt ttgaaaaaac tttggactat acggtcatca 2100 atgggaatga cacgattctt taccaaagat gggtcactaa gaaagtgccc caaataatta 2160 aaggcaatga aaaattatgt caaaaaacac tcaaagaatg cgtcagaact tctcatcaat 2220 tacttgttaa taaatttaat ataagtttgt cgacatttat gcaacatctt gcaaatataa 2280 tgaaccaata caaggctatt cgatatatta aacaaaactt atcgccatca caaagtttac 2340 tgcatattga tttttcagaa aaatattcct gcaaatatgg atcagaagtt cagtcagccc 2400 actttggtgg atcaaaatca caactctcgt tacacacttg cgtttattat tctgttgatt 2460 ctcagccacc aacaaacctt ataaaaacaa catccatttg cactgtctcg gagaacttac 2520 gacacgatcc tgtactcatt tgcgcccact tgaaacccgt aattgaaaaa attaaattga 2580 ttacaccaga tttgacagag ctgcatatat taagtgatgg acccgctact caatatcgta 2640 ataaaacaat gttccatatg ttagcaaact atgtgagcaa aatttcaaat gtagaaacta 2700 ttgtgtggca tttcagtgag gctggacatg gcaaaagtgc tcctgatggc gtaggtggct 2760 gtgtaaaacg cacatgtgac aaggctgttg caaatggcca agacatatca ggtattgaca 2820 gttttgttga ttgcgtaaaa ggcacctgta aaggcattga cattattcgt attaataatg 2880 atgtatctga tatacaaaaa attgccgatg ctaacaaagt acgccccttc aaagggacct 2940 taaaaattca ccagatcaca tggagctcaa aatctcctaa cattattcac tgcaggcgac 3000 taagttgctt attgtgtgca cctcacatta ggtgtacaca ttttaaaatt gggcaaattc 3060 agattgaata tatttcacga gatacaaaag aatcttcgcc tacgatatca tcatctattt 3120 tatcaaggac gggtaccccc agaccagtat cttctattga tactgcatca gcgtcaaccg 3180 tcactggacc aagaacacca tctccgctcg aatcgttttt aaacacaccg tctcctgaat 3240 taacaacgag aatagtccgt caaccgctaa ccccacggaa gcaaaatata gtatatactg 3300 actcaggatg actcagcaac ccccaaaaaa tccagattcc atacattctt tgataccagc 3360 gacacggagg actcaattcc atccccaaaa aaagttaagt gtcagtcgct ttttgattat 3420 agcgatgaag aaaaatttta aagtgttttt tattttaata ttaataaagt actgtttata 3480 cataagtata ttttttcaat tttgattctt ttctgtacga gtctgtccct tctgtggtca 3540 agaaaaaaat taaaaaccct ctatttcagc aaaacctgaa aaacgatctc gttttatatt 3600 ataatttaga tactatgatt aataactaat aaatgaaact gattttgtaa aaatacgaac 3660 gatgtttttt tttccaaaaa ttttaccttg ttttgtgtat gaccc 3705 // ID CR1-118_AAe repbase; DNA; INV; 4803 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-118_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4803 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1206-1206 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 10 sequences with >96% CC identity. XX FH Key Location/Qualifiers FT CDS 425..1288 FT /product="CR1-118_AAe_1p" FT /note="PHD zinc finger at the N-terminus." FT /translation="MACPRCQKVTSDNDYIICRGYCGASFHTFCAKVDQPA FT LETLKQYDQNIFWMCDDCASLFSNGHFRNLASRCGVDHASPNTDAINSLKD FT DLEKLNQAVKSLTAKVDSQPKTPAGSLKFQYNDVLKTPFSSKRRRISESST FT PQVVTRPVNNRGTSKTLCESVKTVQLKDDLFWIYLSAFDPSTTDDEIVEFV FT RKCLNISIEQPIPKVVKLVSKVRDVSTMRFVSFKVGVAKSLSEIALCADSW FT PENIYFREFENRPKNVPPIVKINMNPNPPAKDVETTSAVPLDQQSQQ" FT CDS 1153..4722 FT /product="CR1-118_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="EYLLQGIRKSSKKRSTDSKDQHESQSACERCRNHFGC FT SVGSAKSTMTTFYASSAAVTPSTPGRIACSTEGDPYPPNPVVPVADCSCHH FT SRPGPVVLVCDGISRQHPSGKYTCNRTVALPNYTSCSSEQQYFASLLSSES FT QSSSHTSSPGRTVYSTLEAPESLDPVVNAAVPCHHRRPGPVVGDDEGVFQP FT VTTGKYWSSCKFLSPDISSSCSSREPALDRRLAPAPASRDLSAKLRIYYQN FT VRGLRTKIDDVFLSTSDLDYDAYVFTETWLDDRIQSRQLFNNNYMVFRVDR FT SSTSSSRRRGGGVLIGVHKKFGSSQVEIATEVRVEQLWVKVSIRATDIFIG FT VLYIPPDKSQDRTFVQYHLDSVSNICTLQNGSSSMILFGDFNQPRLFWKTN FT NRYAFVDPLQSHVSSSSQILLDGMAFHGLRQRNLNRNCSERILDLVFSTDS FT IMTNGFASLAIEKVVPLDRYHPALEFAVAALPPVSFYEDIDLSERDFRRGD FT FNVLDQMLLQADWSGVYQSNDVDGAVASFNAILHNCIMQTVPLKRPPRSRF FT GQYIPSFLKRRRIKLLQQYCRRRCMNTKRLFLEASRIYRGYNRFLYKLYVK FT RQQEDLRQKPKNFWNFVNTKRKENGLPVTMHLGDIEASTAEEKCTLFAQQF FT SSVFANRVATTAEINEAVRAVPRDMVDLDVFHINEGMVDVAFKKLKMSYRP FT GPDGIPSCIFKKCSTSLIAPLVAIFNKSMHLQQFPTAWKTSHMLPVFKKGD FT KTDIVNYRGITSLSAGSKCLEIIVNNVMFTSCCSYLSEKQHGFYPKRSVES FT NLCDFTSTCICAMDNGVQVDAVYTDIKAAFDTVNHDILLAKLLRLGVSARM FT CQWLKSYLSYRNLCVKIGPVVSNPFHPACGVPQGSNLGPLLFSLFFNDVTI FT LLANGGLLIYADDLKLFLVVRTEADCRELQDLLDTFARWCCLNFLIISVPK FT CCVISFRRSKSPILYDYSINGQKLERVDKVKDLGVLLDCQLTFKLHYSAIV FT DKANRQLGFVFKMAKDFDDPLCLRALYCSLVRSILEFASTVWSPYEAVWIA FT RIEAVQRKFVRRALINLPWRHPEMLPPYEDRCALLGIEPLRVRRRIDRAVY FT GAKIIRSEVDCAALLERIRLYAPERVLRSRQVIQVESRNTNYGANDPVNSI FT SRCFVEGYELFDFNSSVESFRQRLYRSRLFN" XX SQ Sequence 4803 BP; 1289 A; 1033 C; 1028 G; 1451 T; 2 other; atggaactga aatgctatga aaatctattc atggagacta tcgtattgac cataacagtt 60 gatggmaccw ctgttgatgg ttatatcgaa gcttaacgac agctcttctg ttttttgata 120 ttttacctat gaaatcggtt tttattattg tgaccaatag ttctatgttt taatgaatat 180 tgtgctcaag tttttatgtt gttgatatga actatagtgt gttaattgta gtgattttgc 240 gctgtgatag tgattgaagc attattgttg ttaccgttta caccgtcccc gcatcgtcac 300 cagaagttgt aactgacagc aaatagctga ttccatcgct ttcttgcaag tttgctttca 360 acacatccat acggtctgtg acataatcgt gcctacatct cggcgtttcc tgctcattgc 420 aattatggct tgtcctcgtt gccaaaaagt tacttcggat aacgattaca tcatttgtcg 480 tggttattgc ggagcttctt tccatacctt ctgcgcaaag gtggatcaac cagcgcttga 540 aacgttgaag caatatgatc agaatatttt ttggatgtgt gacgactgcg ctagcttatt 600 ctctaacgga cattttcgta atctggcttc gcgttgtggt gttgaccatg catctcccaa 660 taccgatgca atcaattcat tgaaagatga tctggaaaaa cttaatcaag ctgtgaaatc 720 tcttacggcc aaagtcgatt ctcagcctaa gactccagct ggttctttga aattccagta 780 caatgacgtc cttaaaacac cgttttcatc gaaacgccga cgtatctctg agagtagtac 840 gccccaggta gtcactagac ctgttaataa tcgtggcacc agcaaaacgt tatgcgaatc 900 agttaaaact gttcaattaa aagacgactt attttggatc tacctttcgg catttgaccc 960 gagtaccact gacgacgaaa ttgtggagtt tgttcggaaa tgtctcaaca tcagcattga 1020 acaaccaatt ccaaaggtgg tgaagctggt gtctaaagtc agggacgtat ccactatgcg 1080 tttcgtctct tttaaagtgg gagttgctaa atccctaagt gaaatcgcac tttgtgccga 1140 ttcgtggcct gagaatattt acttcaggga attcgaaaat cgtccaaaaa acgttccacc 1200 gatagtaaag atcaacatga atcccaatcc gcctgcgaaa gatgtcgaaa ccacttcggc 1260 tgttccgttg gatcagcaaa gtcaacaatg acaacttttt atgcatcctc tgccgccgtt 1320 acaccatcta ccccaggacg cattgcctgc agtactgagg gagatcctta tccgccaaac 1380 ccagtcgtgc ctgtagctga ttgttcctgt catcatagtc gtcctggtcc tgtggtcttg 1440 gtttgtgatg ggatctcccg acaacatccc tcaggcaagt atacgtgcaa tagaacagtg 1500 gccctgccta attatacctc atgttccagc gaacagcaat attttgcttc acttctctcg 1560 agcgaatcgc aatctagcag ccatacgtca tcaccaggac gcacagtgta cagtactttg 1620 gaagccccgg agtccctcga cccagtcgtg aatgcagccg ttccatgcca tcatagacgt 1680 cctggccctg tggtcgggga tgacgaaggg gtcttccaac cggtgacaac gggcaagtat 1740 tggtcttcct gcaagtttct ttcgcctgat atatcttcga gttgcagttc acgtgaaccc 1800 gctttggaca gacgattggc gccggctcca gcttcaaggg acttgtcagc taaactacgt 1860 atttattatc agaatgttcg gggcctaagg acaaaaattg atgacgtttt cctttctaca 1920 tctgatctgg attatgacgc ttacgttttc accgagacat ggttggacga tcgtattcaa 1980 tctcgtcaat tgtttaacaa caactacatg gtattccgtg ttgatcgctc ttctacgagt 2040 agctcccgtc gtcgtggtgg cggcgttctt atcggcgtac acaaaaagtt tggatcatcg 2100 caggttgaaa ttgctacaga agttagggtt gaacaattgt gggttaaagt ttccattcga 2160 gcaactgata tctttattgg cgttttgtac attcctcctg ataaaagcca agacaggacg 2220 tttgtacagt atcatcttga ttctgtgtcc aatatttgca cattacaaaa tggttccagt 2280 tctatgatcc tgttcggcga tttcaatcaa cctcgcctat tctggaaaac taacaaccgc 2340 tatgcttttg tcgatccact tcaatcacac gtatcttcgt cgagccaaat ccttcttgac 2400 ggtatggctt ttcatggatt acgacaacgg aatttgaatc gcaactgcag tgagcgtatt 2460 ttggatctag tattcagcac tgactcgatt atgaccaatg gatttgcctc gttggcaatc 2520 gaaaaggttg ttccacttga tcgctatcac cctgccctcg agttcgctgt agctgcgctg 2580 ccaccagtat cattttatga agatatcgat ctatctgaac gcgattttcg tcgaggtgat 2640 tttaatgtat tggatcaaat gcttttgcag gcggattggt ccggagtgta tcaaagtaat 2700 gatgtggatg gtgctgttgc ttcgttcaat gctatccttc ataattgcat catgcaaact 2760 gttccgctga agagaccacc aagaagccgg tttggtcaat acatacctag ctttctgaaa 2820 cgccgtcgca ttaaactgct acaacagtac tgtcgccgta gatgtatgaa cacaaagcgt 2880 ctatttctcg aagctagtcg catttaccgt gggtataacc gtttcctgta taaactttat 2940 gttaaaagac agcaagaaga tctacgacag aaaccgaaaa acttttggaa ttttgtcaac 3000 acgaaacgaa aagaaaacgg actacctgta actatgcatt taggtgatat cgaagccagc 3060 actgctgaag aaaagtgtac gctgtttgcc caacaatttt ccagtgtttt tgccaatcga 3120 gttgcaacga cagcggaaat caatgaagca gttcgagccg taccacgaga tatggttgat 3180 cttgacgtgt ttcatatcaa tgaaggtatg gttgatgttg catttaagaa actcaagatg 3240 tcttatcgtc ctggaccaga tggtatccca tcatgtatat tcaaaaaatg tagtacatcg 3300 ttaatcgcac cgctggttgc catctttaac aaatcgatgc atcttcagca gttcccaacc 3360 gcttggaaaa cgtcgcacat gttgccagtg ttcaaaaagg gtgataaaac ggatatagtt 3420 aactaccgcg ggataacttc cctttcagct ggttctaaat gcttagaaat catcgtgaac 3480 aatgtcatgt ttacttcatg ttgttcctac ctcagtgaaa aacagcacgg cttctaccca 3540 aagcgttcag ttgaaagtaa tctatgcgat ttcacgtcga catgcatttg tgctatggac 3600 aacggtgtgc aagttgacgc tgtatacaca gacatcaagg ctgcttttga caccgttaac 3660 cacgatatcc ttctggcaaa actacttcgg cttggtgtat cagcaagaat gtgtcaatgg 3720 ctcaaatcgt atctcagcta tagaaatctt tgcgtgaaaa ttggaccagt tgtatcaaat 3780 ccctttcatc cagcatgcgg tgtaccacag gggagtaatt tagggcccct attattttcg 3840 ttatttttca acgatgtgac tattttgctc gctaatggtg gattgctgat ttacgcagac 3900 gacctgaagc tatttttagt agtgagaact gaagctgact gcagagaact tcaagacctt 3960 ttggatacct tcgcacgttg gtgttgccta aacttcctca taataagcgt tcctaaatgt 4020 tgcgtcatct cgttccgtcg aagcaaatct ccaattctgt acgattacag tataaatgga 4080 cagaaactgg aacgcgttga taaagtcaag gatcttgggg tgttgttaga ctgccaattg 4140 acgtttaaat tgcattattc cgccatcgtg gataaagcaa accgacagtt ggggtttgtc 4200 ttcaaaatgg ccaaagattt tgacgatcct ctttgcttgc gcgctttata ttgctctctt 4260 gttagatcta tcttggaatt tgcatctact gtgtggtcgc catatgaagc cgtgtggatt 4320 gcaagaatag aagctgttca aagaaaattc gtgaggcgtg ctttgataaa tctaccatgg 4380 cgacatcctg aaatgctacc accttatgag gatcgttgtg ccttgctggg aattgaaccg 4440 ttacgtgtac gtcgcagaat cgatagagct gtatatggag cgaaaataat tcgtagcgaa 4500 gtagattgtg cagctttgtt ggagcgtatt cgactctatg cccctgagcg tgttttacgg 4560 tccagacaag ttattcaggt tgaatccagg aatactaatt acggagcaaa tgatccggtg 4620 aattctatta gtagatgttt tgtagaaggt tacgagctgt tcgattttaa ctcttctgta 4680 gagtcattca gacagcgatt gtacagatct cgtttattta attaattatt ttaagatttc 4740 ttcattaaga ccccgatgtc agatggaaat aacaacaata aagaaacaat aaagaaagaa 4800 gaa 4803 // ID DNA-TA-2_CQ repbase; DNA; INV; 390 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A DNA transposon family from Culex quinquefasciatus - consensus. XX KW DNA transposon; Transposable Element; nonautonomous; DNA-TA-2_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-390 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the southern house mosquito."; RL Repbase Reports 11(1), 52-52 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >96% CC identity. TSDs are TA. XX SQ Sequence 390 BP; 98 A; 88 C; 90 G; 114 T; 0 other; cacccaaagt taaagaacga ggggatagtc ctcacgaaaa gaaacgtgag gaaagtgcta 60 ttttgcgaga gtatagtcct ctcgcgtgtt cattcgttca ttggaacagt tcatcctatt 120 gagtggctta tatggacaaa tgaccgccac ttagtcaccg aaccatggtg gcccatacgg 180 caaaggcacg gttcaatatg ccgaaggtct tgggttcgag tctcggtacc ggtacttttt 240 ttttgataga tgaacttttt ttgaagatga acccatgagt aaagtactct cggtaatttc 300 gggatttatc ctctcagtcc gcacacagta ccattttact accgaatcgt gctctttatc 360 ctctcgtatg ccgatgccca gttctgggtg 390 // ID EnSpm-4_HM repbase; DNA; INV; 8857 BP. XX AC . XX DT 29-DEC-2008 (Rel. 13.12, Created) DT 29-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE EnSpm-type family from Hydra magnipapillata - consensus. XX KW EnSpm; DNA transposon; Transposable Element; EnSpm-4_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-8857 RA Bao W. and Jurka J.; RT "EnSpm-type families from Hydra magnipapillata."; RL Repbase Reports 8(12), 1905-1905 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 2862..4946 FT /product="EnSpm-4_HM_1p" FT /translation="MSYFYSQRTKKRKIAVVVSQILNSACLSIPDSHVSTC FT VSEANSINLPINMVQISSSTVPINYLPLLDVSSSNDDLNYLLDDSCNTEEF FT NDLPEWVARINDDSDEEDTDYTEELDKSETLKDKLVSWAVKYNISQTACTN FT LLGFLHKLHPDLPKDARALLSTCRNVNVRNVAGGEYFYFSLQYWLSIFTER FT YPLNNNINQLNLHINIDGVPIFKSSTNSMWPILCSVKNTGWCLFPIAIYFS FT KSKPTSLTEYMEDFVSEMIMLEKTGFKKSGKCYSIKLDAIICDAPARAFIK FT CIKPPNAYNSCERCVQTGEWLGKVVMPNLFAVLRTDSGFRNNADPAHHCIS FT FVSPVLQLNIGLVSGFPLDYMHLVCLGVVRRLINQWIHGLSAYRLSRTTID FT FISNKLVSMKTYTPREFARKPRSLLEYKHWKATELRQFLLYSGVVVLKGIL FT PKELYVNFLCLSVAIRILISPSLCNIYVDYSQELLKYFVSNFCMLYGNDQC FT VYNVHSLIHLPDDVRRFGVLDNISSFPFESYLGRLKKLIRSPQSPITQIVC FT RLSEGHLQHSETNGVDFKSKFKKIHFKGPVPLPLCHLSQYKKYFGSQFLVS FT NENGNNCFAIDNKVCLVKNILAENDSNESDAIVVYVEFERKEPFFTDPLDS FT SLLSVFYVEKLSYLVKVYSLKKLRTKYYLLPHKNGFVVIPQMHFY*" XX SQ Sequence 8857 BP; 3081 A; 1154 C; 1321 G; 3299 T; 2 other; cccagccaac caatatatat gggtaccgta tgggattcgt ccggttatct ggtatgggat 60 acataagggc atttttggtg ggatccgtct gggacccgtg ttgaaattat tctcgattac 120 cagaagttaa tattgaggca aatataaaga agtatattta aaataaaaat ggctcttcgt 180 ttagcgatat gtgtaattta tttaaacaat tttatgaatt attcatgcat ttattaataa 240 ttatagttgt ttttgtttgt aaatatttat ctaattaata tttttgattt atgcatcata 300 aaaattataa acgaaaaatt tttaattttc atttgttttt cggatttgtt gttatgcata 360 aactaagttt tattaataaa aattcttttt cgtgaacatt taagatgttt tattttaata 420 ccacttttta attattgaaa cataactaaa aaatggtcat gatctaatcc ataaaataag 480 tttaaaaaaa ttaatcttga ttaactttat caagattgtt ttctttaaag ataactttaa 540 taaactagat aaactttatc aagattgttt tatttcataa agattatgat aaaataatta 600 aactcacgca tggtccatag cgtggcattt aacgcatggt ccatggcatg gatcatgtca 660 cggaaaggta gaataatgca ttcacaaagt aatgtattta aaataatgca ttcacaagtt 720 atgcattaac ataatgcatt cacaaagtta ttaacaggaa ataattgttt cctgtaaata 780 acttagtgaa tgatgtctac cattatgatg tctatctttc catgcgttaa tttaattaca 840 ttatcaagat ttctttttaa acttttttca tggattacat catgacaact acttggctga 900 gcctcattaa aattaactta ccatgcagca tggtaaatta attttattga agcacattaa 960 agctaaattt agtcatttag ctttatatcc ttacttaagc agttgatttt tatgtatttt 1020 atgtgaagaa ttgtttttaa aagatttaat gttgttaatt aatctttaaa actgaaatta 1080 aaaagtgagt ttttgaataa aatattatac attattttca gaaagatcgt agtaaattgg 1140 tttataaggt aacattttat tattattaca acaattctaa ttttcaaaac caaataatta 1200 tgcaactcat tgcgtgcgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg 1260 tgtgtgtgtg tgttatatat atatatatat atatatatat atatatatat atatatatac 1320 atatatgcat acacatatat atatacacgt atgtgcacat atatatatac acacacactt 1380 gaaaaaaaaa gattacaaaa aaatatatat tatatatgta aagtgtgtta gtgtaattta 1440 cataatccct gcttaatgat cttaaataac agagcagtaa taaattagta aataaacact 1500 aatctatctt tttcttctgt aaattgcgaa gaaaaagtta gtaactatag ttggataaac 1560 taatattgta ttatgttgtt aatggtgtaa tattgtatat cctattgtat aaattgcatg 1620 ctgttagttt gttaacttta aggacaaagt agctcaataa tacatccagt ctcaatataa 1680 gattttgtgt ttgataaagc atatcattaa tgtttatcag taaaatgact cccagacaga 1740 atttaattat aatattttta tagggtttag tcagatataa ttaggtaaat caaattagtt 1800 gacataaact acctaacaaa actttagcaa aactctaaaa aaatcaattg aaaaattgac 1860 ttgcacaatg ctgattcgca cttttaccac ctaaaattta tacctatttc ataaattaag 1920 caatttacct agttttgttg tgaaacttaa cctatatcat atttttttaa attcttttat 1980 ttgataaaaa ggtgcaattg aacctttaat gaaagaatta aatttctttt agaattgctt 2040 ttatgttatg ttactcaaat tttgatgaat aaattttcac gactgtttac aataaacaga 2100 tttcactcta attttataat atgaatattt ctactttatg tacctaaact atcaaagtaa 2160 aaaagccaac tcttatattg cataattgaa agtaatttaa aagttcttta tatttttgat 2220 taattttgaa caaaagcatt tgcatatggc ctattccaca taaaaccttt ttattcttca 2280 gactctgtaa aataaaataa gctaaaaagt gttttgattc tggcaaaaag agggcgaact 2340 tattaccgaa ataaaattta acttacattt ttcatggaat cggccgtata tttagactga 2400 tttttatatt ttatttgtta aaacattttg aattaaaaat ataaatctaa acaatgacat 2460 gtctgtatag gttatttgaa actaagtata tagctgtata twttgtcatc catctttgta 2520 actgcaataa aaagttatat aaaattgagt tatacaaaga agacyttttt tcttgatgtt 2580 actgatttta aaatgaactt aatcaatcat aaagagttgt ccattgtttt taatgtcagt 2640 ccttacgttt tacttttatc tttttttaat acctatatgt tttgtttgct atttcaatca 2700 atatgttttt attacctatt ttatctaaat tgtgtgacct ataatatgca ggtaagcaat 2760 tatctgcaca gtgtaattat taaacaatca tttttagctt gtactaataa tttttttcca 2820 tttagaaatt taaatctaat gtataataga gtttatttgg aatgagctat ttttattcac 2880 aacgtacaaa aaagcgtaag atagctgttg tggtttcaca aatattaaat agtgcatgtt 2940 taagtatacc tgactcacat gtatcaacat gtgtttctga agcaaatagt attaatttgc 3000 caataaatat ggtacaaatt tcaagttcca ctgttccaat taattatttg ccattacttg 3060 atgtgagtag ttctaatgat gatctcaatt atctgctcga tgactcttgt aacactgaag 3120 agtttaatga cttaccagaa tgggttgctc gaattaatga tgacagtgat gaagaggata 3180 ctgattacac tgaagaactc gataaaagtg aaactttaaa agataagtta gtttcttggg 3240 cagtaaagta taacatttct caaacagcat gtaccaattt gcttggtttt ttgcacaagt 3300 tacatcctga tttgccaaaa gatgcaagag cattacttag tacttgtaga aatgttaacg 3360 ttagaaatgt tgctggtggg gaatatttct attttagcct acagtattgg ctttctattt 3420 ttacggaaag atatcctctc aataataaca taaatcaatt aaatttgcat attaatattg 3480 atggtgtccc aatttttaaa agctcaacaa atagtatgtg gcctatactt tgctcggtta 3540 aaaacacagg atggtgtcta tttccgattg caatctattt tagcaaaagc aaaccaactt 3600 ctcttactga atatatggaa gattttgttt cagaaatgat tatgttggag aagacagggt 3660 ttaaaaaaag tggcaaatgt tactctatta aattagatgc aattatttgt gatgcaccag 3720 cacgggcctt tataaagtgc attaaacctc ccaatgctta taattcttgt gaaaggtgtg 3780 tgcaaactgg agagtggctt gggaaagttg ttatgcctaa tttatttgca gttttgcgaa 3840 ccgacagtgg tttccgaaat aatgcagacc ctgctcatca ttgtatctcg ttcgtatctc 3900 ctgtattgca actaaatata ggcttggtta gtggatttcc tctagattac atgcatttgg 3960 tttgtttagg agttgtacgt agacttataa atcaatggat acatggtttg tctgcttatc 4020 gactatcaag gacaacaata gattttatat ccaataaact tgtttcaatg aaaacttata 4080 ctcctcgaga atttgcacgc aagcctagat cactattgga atacaaacat tggaaggcta 4140 cagaactaag acaatttctc ttatattctg gtgttgttgt tttaaaaggc attttgccta 4200 aagaattata tgtaaacttt ctgtgtctgt cagtagctat tagaatactt attagtccat 4260 ccttatgtaa catctatgta gattattcac aagaattatt aaaatatttt gtatctaatt 4320 tttgcatgtt atatggaaat gaccagtgtg tgtataacgt ccattcattg atacacttgc 4380 cagatgatgt tcgaagattt ggagtgctag acaatatatc ttcttttcct tttgaaagtt 4440 atttaggaag gcttaagaaa ttaattcgca gtccacagtc tccaattaca caaattgtat 4500 gtagattgtc agagggccac ctacaacatt ctgaaacaaa tggtgttgat ttcaaatcta 4560 aatttaaaaa aatacacttc aaaggtccag ttccattgcc attatgccac ctttcacaat 4620 ataaaaaata ttttggttct cagtttcttg tttctaacga aaatggaaac aactgctttg 4680 caatagacaa caaagtttgt ttagttaaaa atattcttgc tgaaaatgat tcaaatgaat 4740 ctgatgctat tgttgtttat gtagagtttg agcgtaagga gccttttttt acagatccac 4800 tcgactcatc tttactgtct gttttctatg ttgaaaaact ttcatatctt gtaaaagttt 4860 attctttgaa aaaattaaga actaaatatt atttgttgcc acataaaaat ggttttgttg 4920 taatccctca aatgcatttt tactaaagtt aatattgtgt ataaaaactt gagttttaaa 4980 taataaataa tttatgtatt aatgtttgtt atatatttag tatttattaa taatcatgtt 5040 ttttttggta ttattacatt ttcaatggca tttatatata tatatatata tatatatata 5100 tatatatata tatatatata tatatatata tatcactttt tttttttgaa agttgtgctt 5160 gtcttttttt tatttttaag ttgttttttt taaaatttca actattatga aaaaaattct 5220 atttagaaag tatttattaa tagaaagttt acttttaaaa actttaatta taaaaacatt 5280 ataataaaaa gatacataat tattatattt aaaagcttta aagcatttct tatttcatat 5340 cgtataaaaa gtatagtggg aaatcttttt tgtttttttt tgtttgaaag tttacttttg 5400 tgtaaacaaa aaataaataa aaatccttaa gttaagaact ttaaatttac atttaaagtt 5460 cttaaagtaa gtcttaaagc tagtttatag taaaattaat tgagtaaaaa tatgtttgca 5520 attgttgcat ttgataccac ttctgagaca gactatgttc cattaaaatg gcttgaggga 5580 gtggatgtta ataatatcca attaatgatt acaaataaca catcagtaaa gtgccattgg 5640 ccaccattca aaaatcccaa cactgttact aaagctaaaa ataacagtta tgatgcagag 5700 atgaattggc cattctacat agcaagggtg ctgggattag ctagtaaatt ttttttgcat 5760 ataggttatt ttcttcattt aatgtatatt tatacatgac tatttatcag tttttatttt 5820 gaaacaaatt tttagacaat ctgaacaatg ctcgtcaaaa agccaaaata gctgaagaca 5880 catctagttt ggaaaatgct tttgaggaga cagaagaaag ttcaggaact ttcaggaaaa 5940 acaaaaaaag acggtatgca tttaatgact tttttgttta tctataaata aatcaaaaca 6000 ttttatttta aatgagcaca ataattttac tttaaaaaac ttgattattt aatttttgtg 6060 tcattgagct gttaataata taataaagca atgatagttt tttttatgaa tatatgctca 6120 acctattttt tgttgtatag taaatcaatc aatatgtcag attatatttt aaaaaaatgt 6180 tatgttcttg tcttttttat attgatgttt ttagagtgac aaagatcagt tatgacagag 6240 aaagtgaaaa tgaggattct gaaaacgatc tattgcacag tgtagaacca aagtaatttt 6300 ttttttacaa gtttggtttt tttgtttttc cttaattata gatatattta tatttattac 6360 ctgtagacac taacattgtt tgagattatt tttgtatttt aaaagaaaga agatgttagc 6420 aaattcaact cttgtcatta aaaagcccca gcaaccctat tacaaactct catcaaacag 6480 caaattcaga gttgatcatc agttgcaaca gtgagtttca ttatttgtgt gtataccaag 6540 atttatttat atataattta atataatgtt ctaaataaat gatatagaaa ttcataaagt 6600 tgcaaaaatt tggctcaaac ttttcataat gtaattttca ttaaaaaata atgttattac 6660 cacctttcca aaccacacaa agtatatagc cgtagatgca taccatgcaa agtatggatc 6720 cctgaatgtt tcaaaaaaat ttgaaagtat ttgtattaaa gcttcaaatt gattatctta 6780 cttttagtaa tgttttaaaa attttcatgt gtgtgttaag taaattaaaa agtaacatgc 6840 tggtgaaaaa atatttgtat aattagtttt tattatcaat ttaattagtg atgaacattt 6900 taatgacttg aatatatgga atagcgaaca attttcgccg cctattgctt taactcctga 6960 atgcgcaaca tcagttccaa tgttgagttc tactattgca ggtaattcta aagtaacaac 7020 ataatttaca ctattttaaa tcttgacaaa ctgctaaagt tcaaaaggtt taatatgaac 7080 cattctctga atgggctaaa tacaaatgtt ttaaaaattc aaaaaaaaaa ggtttgtttt 7140 gagtgctaca ttattaaatc tgcagtactg gttaaagata actgtttgca ttaatttttt 7200 tatttagcct ccggtttaag tcaaactgct aacctgcaaa ctatatttaa tttgataaca 7260 actttggtta gcaatgttga ggagataaaa aaaactctaa aagtacatac atcactactg 7320 cattctatta aacacaaagt gcgagtgaat gaagaagaaa tatttgattt gccagaagat 7380 ataaaattgc cactgacatc tatagctgaa gttgacacat tggaggaaaa gttactttgc 7440 gcagagaaaa aaaaaattct tgtgagttta aaatagttta caaatttaac tcaagtttag 7500 taaggattaa ttttaaaaag taacatgggt gggttgaact ctgatggtgg actaattata 7560 tcattactca atcaggggta atgatataat aatcagaagt caggaaggag aaataaaata 7620 attttttttg ctgatttctt actgataaga gttttaaact tattttagcc gcatcacttg 7680 atgtgctcaa gattaaattt tttaattcta attaaactta acttgtgtat taaaatttgt 7740 tttttaacat tttataaaaa actttttcgg tttaaaatgt tgtcgagtaa ggcttgttta 7800 aaagaagtgt gatcgaacgt gacaattaca tcttttttat tttatgtcat ttctttaagg 7860 caaaacattt atcggacatt ggtggtgaag accttcgaaa ttttattatg agagcgatgc 7920 ctgttctact tgatggcctt cttgctcgcc aattcaattt gacagggcaa aaaggaaaga 7980 aatcgtttaa agcactattg ctttgccaag ttttttttag taagtaatat aaatttatgc 8040 ataaccctag ttattattga ttactttaat tgattaattt aatacagtgt agtttcattg 8100 caatcatagg tgctgtgaaa ctgaacccaa acacaaaaca atgcactatt aaagatattg 8160 aaattgaatt gtcgaaatgg ttttctaacg ctagagatcg tggcactgat ggtcgtcgat 8220 ttgcaatcac gaaaattaca aagacggctc tcagttgaga ttgatggtca aaaaaagcat 8280 ctaccgatct ctttcaatat gatttgtatt ttgtttaatc ttttttttat aaatgaagaa 8340 aaaatttatc aaaatcggta gtattttatg taagagttac ttttttttaa taaagtcttc 8400 aaaatatcga gctgatttgc gtaatcaata attttaatat acgcatttat gcctacatcc 8460 atttgtaaat tgcacatgtt cataaacaat aataaatata gagttcaata ttttttgttt 8520 acgtacagac attgtatgat tgaattgtgt ataaaaacat taaaacacgt attataagta 8580 aaccgtatgt gaccaattag tggaaaagaa atcggaaact tacgggatct ttatgggaat 8640 cccagcgaaa taacagacag agcccatatg ggaatcgtaa gggcaaacaa atggtaaatg 8700 tctgggagcc ttatgggcag cccagcggca acccttacgg ttcccatatg gggcccgtgg 8760 acaacacccg ctgggatccc gttgggaaac ccttatgggt cccgtatcca agcccatata 8820 agtcccttac gggacccatg tacactggtt ggctggg 8857 // ID Penelope-1_AAe repbase; DNA; INV; 2465 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A Penelope-like element family from Aedes aegypti. XX KW Penelope; Non-LTR Retrotransposon; Transposable Element; KW Penelope-1_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-2465 RA Kojima K.K. and Jurka J.; RT "Penelope-like elements from the yellow fever mosquito."; RL Repbase Reports 11(4), 1435-1435 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 5 sequences with >96% CC identity. CC Both termini are uncertain. XX FH Key Location/Qualifiers FT CDS 3..2030 FT /product="Penelope-1_AAe_1p" FT /note="reverse transcriptase." FT /translation="MGSRFALPYTNMNELPIYHLIADVENIIQTNSTTEVQ FT ERNRLPTKSKIFSTXKEERASMIRQPTSTGVPSKQRTFLKEHPDLVVIEAD FT KGNXTVVMKREEYECKMQRMIDDDDTYRKLNRDPTTTYQRRNNNFAKRLAD FT LKLIDRATEMRLKTYKATAPRIYGAPKAHKEGLPLRPVVPCMTSPAYTLSQ FT YVGKIIQKSITGKYNATDSFTFCEYINNVELPPDYVLISLDVVSLFTCIPK FT DLVLRDIINNWHSIKQHTNINLDLFLEMTEFCIDCSYFRFKGQFYQQVFGT FT AMGNPLSPIIADLVMETLLDNVVARVSFPMPVLKKYVDDLILAVPKDKVEE FT VRRMFNEYHDRIQFTVEEENDRKIPFLDLLLVRQTDQTVKTEWYMKPIASG FT RFLNYHSAHSFKLKVNVATNFIHRVKTFSTNIDPNTARGIIRSQLKLNDYP FT PTIINRLIDRSRERRVNTTTNDDVTTPETTVYRSMVHIGQLSGKIQKFLKK FT DYPNVTISSKNAKAVGNILPPVKDVEDKINKSNVIYSIPCADCPACYVGMT FT TNKLQTRIASHRSCSNKLQSMWEEGKTTEDLEVAQLRERTALLDHSAASHH FT TFAFDRTQIVDSSFKKQNLHILETCHIINTHNTVNKRTDTDNLSNTYAGVL FT HTLKCSXANRQTRHTTQTESTQQESTQ" XX SQ Sequence 2465 BP; 821 A; 599 C; 516 G; 518 T; 11 other; gtatggggtc aagatttgcg ttaccataca cgaatatgaa cgaacttccg atctaccact 60 tgatagcgga cgtggagaac atcatccaaa caaacagcac aaccgaagtc caggaacgca 120 acaggttgcc aaccaaatcc aaaattttct ccacgcsgaa agaggaaagg gcaagcatga 180 tccgacaacc aacttctacc ggagtaccat caaagcaacg aacgtttttg aaggaacacc 240 cggatctagt ggttatagaa gcggataaag gaaacckaac ggtggtgatg aaacgggagg 300 aatacgagtg caaaatgcaa cgcatgatcg acgatgacga cacgtaccgg aagctcaacc 360 gggacccaac aacaacgtat caaagacgca acaacaattt cgccaaacgc ttggcggatt 420 tgaagctcat cgatcgtgcc accgaaatga gattaaaaac ctacaaagcg acagcaccgc 480 gaatatatgg tgccccgaaa gcacacaagg agggcctacc attgagacca gtggtaccgt 540 gtatgacgtc accggcatac acactgtcgc aatacgtggg taagataatc caaaaatcca 600 tcacagggaa gtacaacgcg acggattcgt tcacgttttg tgaatacata aacaacgtcg 660 aactaccgcc ggactacgtt cttatttccc tggacgtagt gtcgctgttc acttgtattc 720 ccaaggactt ggttctccgt gacatcatca acaactggca ctctattaag caacacacca 780 acatcaacct ggacctgttt ctggagatga ccgaattctg cattgactgc agctatttcc 840 ggttcaaagg acagttctac cagcaagtgt ttggtacagc gatggggaac cccttgtctc 900 ccataatcgc ggatctagta atggaaacgt tactagacaa cgttgtcgcg agagtcagtt 960 ttccgatgcc tgtgctgaag aagtatgtag acgacttgat cttggcggtg cccaaggaca 1020 aagttgaaga ggtgagaaga atgttcaatg aataccacga cagaatccag tttactgtag 1080 aagaggaaaa cgaccggaaa attccgttcc tagacttgct tttggttcga caaaccgacc 1140 aaacagttaa gactgaatgg tacatgaagc caatcgcctc aggaaggttt ctgaactacc 1200 attccgcaca cagtttcaaa ctgaaagtaa acgtggcgac caacttcatt catcgagtca 1260 aaaccttctc caccaacatt gacccgaaca cagcacgtgg catcattcgc tcccaactaa 1320 aattgaacga ttatcccccc acaatcatca accgcctcat cgatcgatcg agagaacgac 1380 gggtcaacac caccacaaac gatgacgtca cgaccccaga aacaaccgtt tatcgatcaa 1440 tggtacatat cggacaatta tcgggaaaaa ttcaaaaatt cctgaaaaag gattatccga 1500 atgttaccat tagttcgaaa aatgctaaag cggtaggaaa cattttaccg ccggtaaaag 1560 atgtggagga taaaatcaac aagtcaaacg tcatctacag catcccatgc gcagactgcc 1620 cagcgtgtta cgtaggtatg acgacaaaca agctgcaaac gaggatcgcc agtcaccgct 1680 cctgttccaa caagctacaa agcatgtggg aagaaggcaa gacgacggag gacctggagg 1740 tagcacaact tcgagaacga acagcgctgc tagatcactc cgctgccagc catcatactt 1800 ttgcgttcga tcgcacacaa atagtggatt ctagttttaa gaaacagaat ttacacatac 1860 tggaaacatg ccatattata aacacacaca acacagtcaa caagcgcact gatacagaca 1920 atttgagcaa cacatatgcc ggcgtattgc acacacttaa gtgtagtgaw gccaatagac 1980 agacaagaca cacgacacaa acagaatcaa cacagcaaga gtccacacaa taagaataaa 2040 agtggcgata aacaggcaac tktatcgwag ttctgcgacc cggatcgttt ttcggattct 2100 acgcccgtgt taagaccaga acgggcaaag taagtaaccg agacgataaa agtgctccag 2160 cactccctgc tcgcctgttm gtttawattt ttatctgact cttgcttgta gattctgtat 2220 gttgacttat gctgcctatg ctgtgggcaw gttaacgaga aaacggaaca cagttagtaa 2280 gtkaactcaa cataatcaaa cataaacaca acataaaatg ttcattgtcc acagactcct 2340 tgaaaaaggc acaataaatg tgtccgaaac gtcggatgaa aacktaaaat ccgtttttga 2400 gcataataga ctgaaggcca taacctacaa cataatcatc acagtcgtat ccccgagaaa 2460 gwttc 2465 // ID Gypsy-212_AA-I repbase; DNA; INV; 4749 BP. XX AC supercont1.5; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-212_AA_; KW Gypsy-212_AA-LTR; Gypsy-212_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4749 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.5; Positions 732451 737199. XX CC Positions [2160-2702] - Reverse transcriptase CC Positions [3774-4256] - Integrase core CC 'CAGG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 444..4676 FT /product="Gypsy-212_AA-I_1p" FT /translation="MSESESDLSFEEALGRLSVGGGSDQRVSGTVKIEKDN FT DLPPNVVDRSKKNKHSYKLKMEEQYKAEIEALRTELAQVKLAARSASQAVA FT STSATMNTRMPDFRELKEYVSTFDPKVPSCLSADIWVKSIDDTGDVYEWTN FT AVRLHCARLNLGGCAKLWLESCPNAMRDWATFKAEIVKGFPSKKNPIYYHN FT LLSSRKWKPGEIVEEYVYEMLALGRKGGFDEETTVTYITSGLRQYVKRSGM FT TIGKVCTVETLLEELRWIDSVDAVAASKVADPSPAVQSAQRKSESFSDVCF FT QCHQAGHIARRCPRVMCHLCKKEGHMKKECPSVAFKREAPRNQTPKPMRVI FT DQKSAFVKNVLVGGVSMRALVDTGGKVSTIQEKFAKNVGEIKPSQKVLRGF FT GKKEIAVTSKVCAELQVDGVSLPVELQIVPTWVQDTAVILGEDIIDSEGLV FT MMKRKGDVRFEWDGAQQQRQSPREPRPRDSSVESSIANMYTIDVESKGQAI FT TENQINSDGSVEYNTHLCRLIANYRDCFALNMREMGRATSAEMKISLTSDE FT PVYVKPRKLEYARESVLAEIVHDLLEAGIIAETESPYNSQVVLVPKKNNEF FT RMAIDYRLLNSRTIKDKFPMPDIESCLQKLAGADLFITIDLYSGYYQIPLE FT PESQNFTAFSTVDGHYRFLRMPFGLVNGCAVFQRAMNKIVEKLRKQKIIIV FT AYIDDLILPGKTEEELLHKLERLLVALREEGFTINLKKSYFFMRQVDFLGF FT EVGKEGVRPGERKTTAVAEFPVPETVHTVQQFLGLSGFFRRFVQNYSLIAE FT PLFRLLRKEAEFVWEAEQQQAFDKLKELLVERPVLVLYDPNAEVELHTDAS FT SVGVAGILLQKVNDVWKPVSYFSKKNSKTEVNYHSYELEMLAVVASVDRFR FT QYLVGRFFVIRTDCSAIRDAYAKKEMNKRVARYFLKLLEYDFRIEHREGSR FT MQHVDALSRSPTEEPRELETVADCIMVLELSNSDFLVSMQRQDPRLLAVID FT KLANDPRCDEDRQIQQNYVVENNRLYRKIGDRKCWVVPNSVRWRIVKSYHD FT DKGHFGEQKVLSMLQDLFWFPKMRKYVRAYIAACPKCAFFKSKPGRPEGFL FT NPIPKTPVPFHTVHMDHLGPFPRSAKGNEHILVVICGFTKFLLMKAVKTTN FT AQPVVTMLEEISCVFGLPSRIITDRGTAFTSNVLQKFCDDYGIDHVLVAVG FT TPRGNGQVERSNRTILTAIRTMVDSGDRKWDEKVRAVQSAINTAPNATTGL FT SPTSLVLSYRPKDMVQNEIVSVIAVESENPQVTTDELRERVQHATQVSQTR FT QKKYYDDHRREAQKYEVNYIVLVAKDQYIPGGSRKLEARFKGPFIVSEVLT FT NDRYRVTTVPGFETARWFSTVYSADRMKRWCSIADLEDTVHSSDEEEY" XX SQ Sequence 4749 BP; 1345 A; 883 C; 1324 G; 1197 T; 0 other; tcctgtagac aggatagaga tctcgggtaa aaccgtacta gtgagttagt cggtggtgat 60 cgcgtagata tattccgtgt ggggatatcg cggtgtgata cacttcgtgt tagtgcgtga 120 cacgggatcg agacttgcgg aagaagcaat atactttcag tgccatgagc cctcatgaaa 180 accgcgtggt tggaccatta caatagctta gtgtccgggg acagtgcgat tcaatagcta 240 ttatctacgg agtataatag gtcaaaagct ccatcgtgcc gcgtccgtgt agttgagtgt 300 tcttttgtcg ggaacgactc tgttgctgtc tcaggtgagg gtagctgagt gagatcgctg 360 gaaatcggct atcgtgtgag atctttggag agctggactt tagatgccac gcagtggtga 420 aactttagtg aagagtgagg gttatgagtg agagtgaaag cgacctctca tttgaagaag 480 ctttaggccg cctgagtgtc gggggtggta gtgaccagag agtttctggg acagtgaaga 540 ttgagaaaga taacgatcta ccaccgaacg ttgtagaccg cagcaagaaa aacaaacatt 600 cgtataagtt gaaaatggaa gagcagtata aagccgaaat cgaagctctc agaactgagc 660 tcgctcaggt caaactggct gcccgaagtg cttcacaagc agtagccagt acgagtgcta 720 ctatgaacac acgaatgcct gattttaggg aactaaagga gtatgtgtca acgttcgatc 780 ccaaggtacc ctcgtgttta tctgctgata tatgggtaaa gtccatagac gacactggag 840 atgtatatga gtggacgaat gcagtccgcc ttcattgtgc aagattgaat ctcggtggat 900 gcgccaaatt atggctggaa agttgtccaa atgcgatgag agattgggca acctttaaag 960 ccgagatagt gaagggtttc ccctccaaga agaacccaat atattatcac aacttgcttt 1020 cttctcgtaa gtggaagcct ggagagatcg tggaagaata tgtctacgaa atgttggcat 1080 tgggacgaaa aggtggattt gatgaagaaa ccacggtgac gtacattacg agcggtctac 1140 gtcaatatgt aaagcgatct gggatgacta tcggcaaagt gtgtacagtg gaaacattac 1200 ttgaagagtt gagatggatt gacagcgtcg atgcggtggc tgcctccaag gtggcggatc 1260 ctagtccagc agtgcaaagt gctcaaagga agagtgagtc gttttcagac gtctgtttcc 1320 agtgtcacca agctggacat attgctcgtc gttgtcctag agtgatgtgc catctttgca 1380 aaaaggaagg acatatgaag aaagaatgtc cctctgttgc gttcaaacgc gaagccccca 1440 gaaatcaaac cccaaaaccg atgcgcgtga tagatcagaa aagtgccttc gtgaaaaatg 1500 tgcttgtggg tggtgtttcg atgagagcgc tagtggatac gggcggaaaa gtttccacta 1560 ttcaagaaaa gtttgcgaag aacgtcggtg agattaaacc gagccagaaa gtgcttcgcg 1620 gtttcgggaa gaaggagatt gctgttactt cgaaagtgtg cgccgagttg caggtggatg 1680 gtgtgtccct gccagtggag ttgcaaatcg taccaacctg ggtacaagac acagcggtga 1740 ttttgggcga agatattatt gatagtgagg ggttagtgat gatgaaacgc aagggcgatg 1800 tgcggtttga gtgggatgga gcacaacaac aacgacaatc gccacgtgaa cctcgccctc 1860 gtgatagcag tgtagaatcg tcgattgcga atatgtatac gattgatgtg gaatcaaagg 1920 ggcaagctat aaccgaaaat cagatcaaca gtgatggatc ggtagagtac aatacgcatt 1980 tgtgccggtt gatagctaac tacagagact gctttgcgtt gaacatgaga gagatgggtc 2040 gtgctacgtc tgccgaaatg aagatcagtt taacgagtga tgaacctgta tatgttaaac 2100 cacgaaaact tgagtacgcc cgagagagtg ttttggctga aatagtgcac gatctgttgg 2160 aagctggtat aatagctgaa actgagtccc cttacaatag ccaagttgtg ttggtaccaa 2220 agaagaacaa tgagtttcga atggctattg actatcgtct tctcaactcc cgaacaatca 2280 aagataaatt tccgatgccc gatattgagt cgtgtttgca aaaattagcg ggggcggatt 2340 tgtttatcac cattgattta tatagtggtt attaccaaat tccactagaa ccagaaagcc 2400 aaaactttac tgcgttttca acggtagatg ggcactaccg ttttcttcgt atgccctttg 2460 gactggtaaa tgggtgcgcg gtgtttcaaa gagcgatgaa taaaatagtg gagaagttga 2520 gaaagcaaaa aattattatt gtggcgtaca ttgatgattt gattctgccg gggaagacgg 2580 aggaagagtt gttacacaag cttgagcgac tgttagtggc attgcgcgaa gagggtttca 2640 ctataaattt aaagaagagc tactttttca tgcgtcaagt tgatttcttg ggatttgaag 2700 ttggtaaaga aggtgtgcgg ccaggtgaac gaaaaacaac ggctgtcgca gagtttcctg 2760 tgcccgagac agtacacaca gttcaacagt tcctgggatt gtcgggattc ttcagaagat 2820 ttgtgcagaa ttacagcctt atcgcggaac ctctattccg cttgctacga aaagaggccg 2880 agtttgtgtg ggaggcagaa caacagcaag cgtttgataa acttaaggag ctgttagtgg 2940 aaaggcccgt gcttgtgcta tatgatccaa atgcggaagt ggaattacat accgatgcgt 3000 caagcgtcgg cgtagcagga attttgctgc agaaagtgaa cgatgtgtgg aagccagtga 3060 gttatttcag caagaagaat tcgaagaccg aagtgaacta tcatagctac gagctggaaa 3120 tgttagctgt ggtggcgagt gttgatcggt tccggcaata cttggtagga cgtttctttg 3180 tgattcgcac agattgctcg gcaattcgtg atgcttatgc gaaaaaggaa atgaataaga 3240 gagtggcgcg atattttttg aagcttctag aatatgattt ccgtatcgaa catcgtgagg 3300 gctctaggat gcaacatgtc gatgcactga gcaggtcgcc tacagaagaa ccgcgcgagc 3360 ttgaaactgt ggcggattgc attatggtgt tagagttatc gaattcggat ttcttggtca 3420 gtatgcaacg acaggaccct cgtttgctag cagtgattga taaactggcc aatgatccga 3480 ggtgtgatga agatcgacag atacagcaga actacgttgt ggagaataat cggttgtatc 3540 gaaagattgg tgatcgaaag tgctgggtcg tcccaaacag cgtgcgctgg cgtatagtga 3600 aaagctacca cgacgataaa ggacattttg gcgagcaaaa agtgttatcc atgctgcagg 3660 atttattttg gttccctaaa atgagaaagt acgtccgtgc gtacattgcg gcttgtccaa 3720 agtgcgcgtt ttttaaatca aaaccaggtc gaccagaagg gtttctcaac ccgataccga 3780 agacccccgt gccctttcat acagtgcaca tggaccattt aggtccattc cctaggtcgg 3840 cgaaaggcaa cgaacacatt ttagtggtga tttgtggatt cacgaagttc ttgctgatga 3900 aagcggtgaa aactactaac gctcagccag tagtgacaat gctggaagaa atttcctgtg 3960 tttttgggtt gccgtctcgt ataattactg accggggtac cgctttcact tcaaacgtgc 4020 tacagaagtt ttgtgacgat tatggaatag accacgtatt ggttgctgtt ggaacgccga 4080 gaggaaatgg tcaagtggag cggagcaacc gaaccattct taccgccatt cgaaccatgg 4140 tggattcagg cgaccggaaa tgggatgaga aggtgagagc agtgcagagt gccatcaata 4200 ctgcaccgaa tgcgacgaca ggcctctcgc caacgtcatt ggtgttgtcg tacaggccga 4260 aagatatggt gcaaaatgaa attgtgtcag tgattgcggt tgaaagcgaa aatcctcaag 4320 ttactacaga tgagctacgg gaacgagttc aacatgctac gcaagtgagt caaacacggc 4380 agaagaagta ctacgatgat catcgacgtg aggcccagaa gtacgaggta aattatatag 4440 ttttggtggc aaaggatcag tacattccgg ggggaagtcg caagttggag gctcgattca 4500 agggcccatt cattgtatcg gaggtgctga ccaacgatcg ttatcgcgtc acaacggtgc 4560 ctggctttga aacggcaaga tggttcagta ccgtttactc agcggaccgg atgaagcgat 4620 ggtgttcaat tgcggatttg gaggacactg tacatagcag tgatgaagaa gagtactgaa 4680 aactttgaat ttgttaatga aagtgaatag aatgacgatg aggacatcta tgaagtctgg 4740 gtgtccgaa 4749 // ID BEL-2_ASu-I repbase; DNA; INV; 6607 BP. XX AC AEUI01005845; XX DT 08-APR-2011 (Rel. 16.04, Created) DT 08-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the pig roundworm genome: internal DE portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-2_ASu_; KW BEL-2_ASu-LTR; BEL-2_ASu-I. XX OS Ascaris suum OC Eukaryota; Metazoa; Nematoda; Chromadorea; Ascaridida; OC Ascaridoidea; Ascarididae; Ascaris. XX RN [1] RP 1-6607 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Ascaris suum genome."; RL Direct Submission to RU (07-APR-2011). XX DR Genome; AEUI01005845; Positions 893 7499. XX CC Positions [4459-5022] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 388..5544 FT /product="BEL-2_ASu-I_1p" FT /translation="MTHLASCLHGNAKEAIAGLPITNESYQEAKDVLAEKF FT GNPVIVRRHLYRTLQMLPSCSDKLSDIKKFSEALSRACKQLRALKEPLDNP FT PLVLSVLQKLPLSILEEMMRDKEAQEWDMELVKRALDSYLRRKEEIASLMD FT GGRRAPHTEFQRSSASVATAFAATKNYTEKGCALCAQPHFADECFTYRGIA FT DREQRITDLKLCFRCLREGHKSRNCKFTRPCFYCRGNHHSALCPQRERVNL FT SWRSPAMQVNNQGGMLQDERYQVRRPSINWPQRESVTRARPEWASRSVVEL FT TEGVNNDSASAGLKTINQRRPSTSPNQQVTYQTNAVVVERAEVKGMASSPQ FT VPTAKDFKGRLEGECRNLPARDTKVYLPTTPVEVYNPKDEASRTVETYALF FT DSGSQRSFITTLVAKQLNLSPVTSEVISLNTFASKRTKKLESQLVEVGLCL FT EKGNTEVIEVNVVPRLVRELTCRTPPKQEKKGENLGQELIVAPGILIGSDF FT YWQLIGSGAPERLANNLYRISTKVGPMIGGRLVEEKEIHSDCGVNVIGGAS FT NAEDRWAEVEKFWKLESIGVNELSKDQENDRALECFDKTIEFINGRYHVCW FT PWKYEETLESNYGLTFGRLRSLLKRLQGDPELLAMYNQNLHEQENLGIIEE FT VPSGSRPDGPVRYVPHQPVITPSKNTTKLRIVFDASAKASKREKSLNECLY FT SGPALLPDLGGMMLRFRLPKIIMIADVEKAFLQLALKKTERDVTRFLWLDQ FT PERGLNNENLKTYRFTRVAFGVVASPFLLGATLRFHLDKVAREAKEQDKEK FT AERSRALTEQIKENTYVDNVVLSAETPKEAVETYHAAKTIFDKASMNLRQW FT ASNDTYVTSQFQETDLMPEGTTNVLGLGWDRQEDTLTVRLAVKDRNEDVVL FT TKRRVLSHLSANFDPLGLIAPVLLKGKLFFQTLWKEGYQWDEILKPEHRVS FT WTNIVTNWKEYHTFVTERRVIRSGSKGLQLHAFVDASQHAFAAAIYLRSEG FT TRNIESHLVFSKTRLSPLRDISIPRLELMACVIGVRALTYVQKQLKVAIEK FT KVLWSDSKCALAWISSSRALPIFVANRVQELNKQDGIVYRHVGGKENPADL FT ASRGVNPKALANNNIWWKGPDWLVETEEKWPLGNDQMKGEVEKKDMISSVL FT ATLTTLERDHEKEEDWLNKVAARVSTWRKLKGVIGYALRWRRIKNEVRHQH FT AMLSMEELQCAERKIMDVMQSKHFHELREALQHNKPHQLRRQLGVIEIDGL FT LRCKGRYEHEDIPAHTKWPILVPSKCRVTELIVEEAHKKNFHHGLQKTLCD FT VRERFWIPKGRAVVKRVLRRCPTCKKYEGGPFMLPPMPALPAYRIRRAFPF FT QHTGVDYFGPMLTKDKGQTTKIWGCLWTCLVTRAVHLDTVSSLLAEQFIQT FT FRRFVARRGMPERILSDNATTFTSAGKILVSEHQKKELNEGLAQYGSENGL FT RWDFITSHAPWKGGAYERMIGLLKRSLSKTLGRKVLRIEELQTLLCEIEAI FT LNSRPITYVYESIHEGKAIRPIDFFSPNVDLLISSSEDKDELSDPSYRPES FT EINALQEKYRVSLYVLDDFWRRWRDEYLLSLREQQKMEHLNPRNAVVREPE FT VGEVVIIYEEIVPRGQWKTGIITCINRSEDNQIRKVVLKTITGENLERPIN FT HLYPLELKACEQREKKKGNDPQTHRSQPENSSNWVLRSRIVQKKAVGRSN" XX SQ Sequence 6607 BP; 2156 A; 1306 C; 1678 G; 1467 T; 0 other; agagacctta cgagactaga aacggatgtg gaaaagataa atgcgagcca tgctgcgtgg 60 agagagctcc tcgtaggttt aactggcgaa aagcaacaaa aggagagtga gcggtatgaa 120 aaagaatgtg aggcccaaaa cagtttcctg gaggtagttg ataaggcaga tgaggcagta 180 gctactctga agtgcaagct tgcagagcta gatttagcag agaaaacatt gccagttctg 240 gaagataaac aagaaggaaa gggatggaga acagaattac ctcgagctac tctcccccaa 300 ttccacggcg atcccctaga atggagaggg ttttgggatt cctttgaagc ggtggtagat 360 gggcagccga ttccgccagt gataaaaatg acacatctgg cgagctgttt gcatggaaat 420 gcaaaggaag ctattgcggg acttcccatt accaatgaaa gttatcaaga agctaaagac 480 gtcctagcag aaaagtttgg taatccagtt attgtgagga ggcacctata ccgaacactg 540 caaatgttac cctcatgttc agacaagtta agcgatataa agaaatttag cgaggcctta 600 tctagggctt gcaaacagtt gagagccctc aaagaaccac tcgacaaccc tccattagta 660 ttgtcggttc tgcaaaagct ccctctttcc attttggaag agatgatgag ggataaagaa 720 gcacaagagt gggacatgga gctggtgaaa agggcgctag actcctacct gagaagaaag 780 gaagagatag ccagcctaat ggacggggga cggagagcgc cacacaccga atttcagaga 840 agtagtgcaa gcgtcgcaac ggcgtttgca gccacaaaaa attatactga gaaaggttgt 900 gcgctatgtg cgcagcccca ttttgcggat gagtgtttta catatcgagg gattgctgac 960 cgagaacaac gcatcacaga tcttaagtta tgctttagat gtctgcgaga aggtcacaaa 1020 agtaggaatt gtaagttcac cagaccatgc ttctactgtc ggggcaatca tcacagtgca 1080 ctttgtccgc agagggagag ggttaatcta tcgtggcgga gcccagcaat gcaagtaaat 1140 aaccaagggg ggatgctaca agatgagcgc tatcaagtac gcagacccag catcaactgg 1200 ccacagagag agtcagttac aagggcgcgg cctgagtggg cttctcgatc ggtagtagaa 1260 ctaactgaag gtgtcaacaa tgactctgcc agtgcagggc tgaagacaat caatcagcgc 1320 cgacctagca cttcaccaaa tcagcaagtg acatatcaaa ccaatgccgt ggtagttgag 1380 cgagctgaag tcaaaggtat ggcttcctct ccacaggtgc ccacagccaa ggacttcaag 1440 ggaaggctcg agggagagtg tagaaaccta ccagctcgcg atacaaaagt ctaccttcct 1500 actactccag tagaggtata taatccgaaa gatgaggcaa gccgtacagt ggagacttac 1560 gcactgttcg atagcggttc gcaacgatcc tttatcacca cgttggtggc aaagcagctc 1620 aatctaagtc ctgttacgtc agaggttatt tcgcttaaca ccttcgcttc caaacggacg 1680 aagaagttgg aatcccaatt ggtagaagtt gggctttgtc tggagaaggg gaacacggag 1740 gtaatagaag tgaacgttgt cccgaggctg gttagggagt taacctgtcg gacaccgcct 1800 aaacaagaga agaaaggaga aaatttggga caagagctga tagtggcccc agggattctg 1860 ataggatcag acttctactg gcaactcata ggaagtggcg cacctgaaag actcgcgaat 1920 aacctctacc ggatctcgac aaaggtagga ccgatgatag gaggacgact cgtagaagag 1980 aaagagatcc acagcgactg tggggtgaac gtcatcggag gtgctagcaa tgctgaggac 2040 cgctgggcag aggttgaaaa attttggaag cttgaaagta ttggggtgaa cgaattatca 2100 aaagaccaag aaaacgaccg agcattggaa tgtttcgata agacaatcga atttattaac 2160 ggtcgatacc acgtctgttg gccatggaag tatgaagaaa cgcttgaaag caactacgga 2220 ttaaccttcg gtcgtctcag gtcactcctg aagcgactgc agggagaccc agaattgtta 2280 gcaatgtata accaaaatct gcatgagcaa gagaatctag ggataatcga agaagtacct 2340 agtgggagta ggccggatgg gccggttagg tacgtcccgc atcagccagt tataactcca 2400 tcaaaaaata ctacaaaatt aagaattgtg tttgatgcct ctgcaaaagc gagcaagcgg 2460 gaaaagagtt tgaatgaatg tctttattcc ggacctgcac ttttacccga tctaggcggg 2520 atgatgttga gattccgtct tccaaaaatc attatgatag cagatgtaga aaaagcattt 2580 cttcaactag ctctcaagaa aactgaacga gatgtgacta ggttcctgtg gctagatcaa 2640 ccggagagag ggctaaataa tgagaatttg aagacctatc gctttacaag agttgcgttc 2700 ggtgtagtag ccagcccctt cctgcttggg gcaacgttaa gattccattt ggataaagta 2760 gctagggaag caaaggaaca ggataaggag aaagcggaga gatcaagagc actaaccgaa 2820 cagatcaaag aaaacacata cgtagacaac gttgttttaa gtgccgaaac cccaaaggag 2880 gcagtggaaa cctaccacgc tgcaaaaaca atatttgata aagcgtcgat gaacttgagg 2940 cagtgggcct ccaatgacac gtacgttacc agccagtttc aagaaacgga cttaatgccg 3000 gaaggcacaa caaacgtgct tggattagga tgggataggc aggaggatac cctgaccgta 3060 aggctcgcag tcaaagatag aaacgaggat gtagtgctaa caaaaagaag agtcctttca 3120 cacctgtctg caaactttga ccccctagga ctaatagcgc cagtacttct aaaagggaaa 3180 ctcttttttc aaacgttatg gaaagaagga taccaatggg atgaaatact taagccagaa 3240 catcgggtga gttggaccaa catagtgacg aactggaagg aatatcatac gtttgtaacc 3300 gaacgtagag ttattaggag tggcagtaaa ggattgcaat tgcacgcatt cgtagacgct 3360 tcgcagcacg cctttgccgc ggcgatctac ctacgtagcg aaggtacgag aaacatagag 3420 agtcacttgg tgttctcgaa gacgcgacta agccccttac gagatatctc catacccagg 3480 ctggagctaa tggcatgcgt tatcggggtg agagcattga catatgtaca aaaacagttg 3540 aaggtggcca ttgagaagaa agtactatgg agcgactcaa aatgtgctct agcatggata 3600 tcgagtagta gagcgctacc gatattcgtg gcgaaccgcg tacaagaact aaacaagcag 3660 gacgggattg tgtatagaca tgtgggtgga aaagaaaacc ctgctgacct tgcatcgaga 3720 ggggttaacc caaaagctct ggcaaacaat aacatttggt ggaagggtcc ggactggtta 3780 gtagaaacgg aagagaagtg gcccttggga aatgaccaaa tgaagggaga ggtcgaaaag 3840 aaggatatga tcagcagcgt actagcaacg ctaactactc tggagcgcga ccatgaaaag 3900 gaggaagact ggttaaacaa agtagcggcc cgagtcagca cgtggaggaa gttgaaggga 3960 gtaataggat atgctctgag gtggagaaga ataaaaaacg aggtcagaca ccaacacgca 4020 atgctgagta tggaggaact acaatgtgcg gaaaggaaga tcatggatgt catgcaaagt 4080 aaacacttcc atgaactaag agaagcactg caacacaaca aaccgcacca attacgaagg 4140 cagctaggag tgatagaaat agatggatta ctaaggtgta agggaagata cgagcacgag 4200 gacatccctg ctcacacaaa gtggcccatt ctagtgccat ccaaatgccg agtcacagaa 4260 ttgatcgtag aagaagctca caagaaaaac tttcatcatg gactacaaaa aacgttgtgc 4320 gatgttagag agcgcttttg gatcccgaaa ggtcgagcgg tagtaaaaag agtgctcaga 4380 aggtgtccaa cttgcaaaaa atatgaaggc ggcccattca tgctaccccc catgcctgct 4440 ttaccagcgt atagaataag acgggcattt ccctttcagc acacaggagt ggactatttc 4500 ggccccatgt taaccaaaga taaaggacag accacaaaaa tttggggatg tctgtggact 4560 tgcttggtca ctagagcagt ccacctggac acagtcagca gtttattggc agaacaattt 4620 attcaaactt ttcgaagatt tgtggctcgg agaggaatgc ctgaaagaat tttatccgat 4680 aatgccacga cattcacatc tgcgggaaag attttggtga gtgaacacca gaagaaggaa 4740 cttaacgagg ggctagccca atatggaagc gaaaatggcc tacgatggga ttttattacc 4800 agccacgcgc catggaaagg aggagcttat gagagaatga ttggattact taagcggtca 4860 cttagcaaaa ccctaggcag gaaagttttg agaattgaag aattacagac tttattatgt 4920 gaaattgaag caatcttaaa cagtagaccc atcacttatg tctatgaaag tatccatgaa 4980 ggtaaagcca taaggccaat cgatttcttt agcccaaatg tagacctctt gatctccagt 5040 tctgaggaca aggatgagtt gtccgaccca agctacagac ccgaatcgga aataaacgct 5100 ttgcaagaga aatacagagt aagtctgtac gtccttgatg acttctggag gagatggcgg 5160 gatgagtact tgctaagcct aagagagcaa caaaaaatgg aacaccttaa cccacggaac 5220 gcagtggtga gggaaccaga agttggggaa gtggtaatca tctacgagga aatagtacca 5280 cgaggtcagt ggaagacagg tataattaca tgcattaaca gaagtgaaga taatcaaata 5340 cgcaaggtcg ttctgaaaac aattactggc gaaaatctgg aaagaccaat aaaccacctt 5400 taccctctgg agttaaaggc atgcgaacag agggagaaaa agaaaggaaa cgacccccaa 5460 acccaccgat cgcagccaga aaattcaagt aactgggtgt tgaggtcacg tatagtacag 5520 aaaaaagcgg tagggaggag taattagaaa cagctaccgc cctttaccac cataagatta 5580 accgtttaaa taaactttat caaaaactca tgagggggcc ggtgggagtg tcgcggctgc 5640 gcgcgaagaa ataaaatatg cgggagtcga ataaagctgc agaattagtt tctgcagaag 5700 aaatgagaaa attagcaaac aagggagtga agttgttgac aggcactcaa atatgatgag 5760 cacaaaaatg catgagcgag cggtaccagt ggcacaaaga ggcctttgtc tcccaaccac 5820 ataaaagaaa cgaagccgct ataaacacat cattaaagtc atatgcctca aactataaac 5880 acatcataaa agtcatatgc ctcaaattaa cggaaaaaca actctttttg gaattaaaca 5940 aaaacacaag gaagaggcgg ggcagccatc agaaggtgtc aacaagaggg ctatataagc 6000 aaacccaaag tttcggcaat tgttgagtac taaatttgga tcggctttgc gactccagcg 6060 tttttcactt atgtatatgt gacgtgtttc ctgtcttact gtagtgttgt gctatgaccg 6120 tgtttaatga gataagcgtg tgatgtgttc cctgtagtac tgtagagttg tgctatgaca 6180 gtgtctgata aatatgcgag ttgtttcttt cgggttagaa tgtataagaa tatcacggac 6240 atcaccgata attatatatg acctttcttt gttacgtttt aatctaagca ctgatatgct 6300 atgtacatat cggacatgta tacgaattgt ttaatcaagt tataatctat tagcatatca 6360 cgcaatctcc catcattata cgtgaatttt tagccatgcg ttaatctatc gtcatcatat 6420 acgtctctga taatcaaatg ttcttcgtca cgttttttta ttatctatat acgataataa 6480 aggtttaatt acccaagttt gaggtgtggt tcacaaagca gtggctaggt gcacacgggt 6540 cgttgcacag cagaatctga tagtggtatt ataaaggacg aaccacctca gcgcaagtgt 6600 atattct 6607 // ID Gypsy-24_AA-LTR repbase; DNA; INV; 110 BP. XX AC supercont1.380; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-24_AA_; KW Gypsy-24_AA-I; Gypsy-24_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-110 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.380; Positions 564978 564869. XX SQ Sequence 110 BP; 41 A; 18 C; 13 G; 38 T; 0 other; tgttagcaag ccataattta taattaatta atcatagcaa gccttgtatc ctaggaaatt 60 aaataaaaca ttctagtttt cgccttaatt cagtctagac gtatataaca 110 // ID I-64_AAe repbase; DNA; INV; 6423 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE An I non-LTR retrotransposon from Aedes aegypti. XX KW I; Non-LTR Retrotransposon; Transposable Element; I-64_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6423 RA Kojima K.K. and Jurka J.; RT "I clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1335-1335 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 12 sequences with >92% CC identity. XX FH Key Location/Qualifiers FT CDS 22..1227 FT /product="I-64_AAe_1p" FT /translation="MTQLIDGTSVQITEHPTLNSTRCVVSCRDVIDVTETV FT LLEELKDQGVKEVRRITRRIGNERENTPSIILTCRGTNRPDHIDFGYIRCR FT TRPFYPSPMQCFNCWCFGHTKLRCKSNAATCGKCSENHPIPEDKACSNGNF FT CKQCQTNEHALSSRSCPFYKMENTIQRVKIDQGLSYPAARRVVEGDNGGRS FT YANAVESDNNDAITELSGRIDQLTAVVDSKDKEITELRNALAARDAPPPVE FT RSEIESLQAIIASQSKQIEALTNQLSTFLKMVMPAGSIVTSDIVVPATNSS FT PVLEDPLALIVNEFPSPSNPKSDENSEYDSTSPDHSPNSTPRPTRASTTKP FT ANKFARPGTPVPVAIKSASSPTRTPNKRSLTRIELTNLQQQKRVKHKNVSE FT GTGTISKR" FT CDS 1304..6085 FT /product="I-64_AAe_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="MPSHTQSRNMLQRNDAQPADLSRQETGEPEAQFQPES FT PGRPMHSIQQQYLHVYLDSRGASVPEATPQPESLDRPRHSENGSDEEMHCA FT SDSRGIDGPDARDPPGRSNQLQHPMDEKGTKSSRMKNPMISPGSLGPSSAD FT DTSIPELVDPPWRQENVGRGTEKGWLSYTPKQDDVGISSRNEGKESTSSRK FT PGRFHNYAPERKSFPDDRNVEDQFNSFNKMQLPIDALTNAARYDIHIKKAL FT GCGAVVPPGPSWLQRTGKKTIETSVLITDSALFCPEYNQRTTQDILADKSN FT FNEGTSLSPLAAPFFPVVETTLPITAAPSSSISEQANQPDITVGKSSASSD FT SPGFPVPSSGMVFPRTDVSNTTLNNLNTSASTSQSSSSLPYPAPTLPKPVN FT FLLQWNINGYYNNLANLELLTHGSSPWCLTLQEINKISIPQLNKSLGGRYQ FT WTIKKGSNFRHSVGIGVLKTIPFEPLDINSDLPIIGXQLQGPICVNIINAY FT LPCSTIPDFSKRMAKVFEAIPGPILFVGDTNSHHQAWGGDKSDSRGVALLS FT LFEVADMVVLNNGSDTFFNGNFSSAIDVSAVSRSLLSKLLWGINSDTYGSD FT HFPIQISFSTVSPETTRRPRWKYEKADWQSYDLNVRLRLQDNPPNTLPEFT FT RLVYDAANASIPRTSNKPGRKALHWWSEDVRKAVKARRKALRAAKRLPADH FT PERAEVLNRYRTLHVVCRKIIRNSKLASWESFLESLNGSQSSAQLWGSINA FT LSGKRKTTQMSLSINGAYISDPPVVAAALGDYFSHLSSRSNYNDSFMRRVH FT PCSASLPNFHVPNDPSNDLVNTAFSLNELKFALSCCSGNSAGPDNVGYPLL FT KNLPATGVIKLLELINQSWLSDKYPDEWHESLVIPIPKANCCSKDPTKYRP FT IALTCCLSKVMERMVNRRLKQKLESDRRLDQRQHAFRSGFGTNTYFAALGD FT VLHKANAEGLHTEVISLDISKAFNCTWTPLVLQQLVDWGLSGHIVHFCKNF FT LVNRFFRVAIGDTTSESYPEETGVPQGSVIAVTLFLVAMNSVFSVIPRGVH FT IFVYADDILLVVSGSTRGRTRIRAQAAVSSVVKWATSVGFSLSANKSVRCH FT VCKYRHQYDRTPIRVDGNPIPSKKTVKVLGIIIDRHLSFKPHFTNVKNNCQ FT TRLNLLRTISRPHRSNNRGIRFRVAEAIVDSRLLYGLELTCIAYNQLVEIL FT APIYNSYIRTISGLLPSTPADSACVEAGRPPFRHFITKAICTKAAVHAAKT FT SGRRKMFLLEEGNKIFRRVANRNLPPVAKYHWYGDNSWHKETPKIDDEIKR FT RFRAGDNSQTLRMSVLEWLQTKYSGYEHRYTDGSLSQVGVGIGIIGHSLEI FT SKSLQPWYSIFSAEAVAVFIAATTVSVRPILVLTDSASVISALQSDTPQHP FT WIQGIIKNSPHSTVFAWIPGHCGIPGNVAADRLAGIGHAERRYSSTAPLDD FT VKRHIRKKFRDHWNVEWASSHSSYIRKIKQDTSAWDDRKSLREQRVVSRLR FT TGHTRLSHNFDGADFNVICSTCNVRNTVEHFLCVCPQYEFSRQTYGLSSSI FT REILSDDASATTSLICFLKDAGLFYRI" XX SQ Sequence 6423 BP; 1844 A; 1693 C; 1357 G; 1528 T; 1 other; cagtttaaac gtctgttgtc catgacccaa ctaatcgatg gaacatccgt acaaataaca 60 gaacatccaa ctctaaattc tacacggtgt gtagttagct gtcgtgacgt gatcgatgtc 120 accgaaaccg ttttgttgga ggagctcaag gatcaaggcg ttaaagaggt acgcaggata 180 acacgacgaa tcggaaatga aagggaaaat accccttcaa ttatcttgac ttgtcgagga 240 acgaatcggc cagaccatat agatttcggg tacattcgtt gtcggacaag gcccttctac 300 cctagcccga tgcaatgttt taattgctgg tgtttcggcc ataccaaact acgctgcaag 360 agtaatgcgg ccacctgtgg aaaatgttca gagaatcatc ctatccctga agataaggcc 420 tgttctaatg gtaacttctg caagcaatgt cagactaacg agcatgctct ttctagccgt 480 tcctgtccgt tctacaaaat ggaaaatacc attcagcggg ttaaaataga ccaaggattg 540 tcatatccag cggctcgtag agtagtagaa ggcgacaatg gcggtagatc ttacgccaac 600 gccgttgaat ccgataacaa cgatgctatc acggaactta gtggaagaat cgaccagctg 660 actgccgtag ttgatagcaa agataaggaa attaccgagc ttcgtaacgc cctggccgcg 720 cgtgatgctc cgccgccagt agaaagatct gaaattgaga gtttgcaagc aatcatcgct 780 agccaatcga agcaaatcga agcacttacc aatcaacttt ccacgttctt aaaaatggtt 840 atgccggctg gatccattgt cacatctgac atagtagttc ctgccactaa ctcaagtcca 900 gttttagaag atccactagc tctcattgta aatgaatttc cttcgccatc caacccgaaa 960 tctgacgaaa actctgagta tgattcaacc tcgccggacc acagtccaaa ctcaactcca 1020 cgtccaacaa gagcctcaac tacaaaacct gcgaataagt ttgctcgtcc gggaacaccg 1080 gttcccgtag ctatcaaatc tgcctctagc cccactcgga ctcctaacaa acgatcgctt 1140 acacgtattg aactgactaa tctacaacaa caaaagcgtg taaagcataa aaacgtctcg 1200 gagggtacag gtaccatctc gaagcgttaa cgccactctc tatcgataaa tccagttttc 1260 tccaccagcc aatcgccata aagcaaatac cgatagccaa atcatgcctt cccacactca 1320 atcacgcaac atgctccaac gaaacgacgc acagcccgcg gatttgagtc gacaggaaac 1380 cggcgaaccg gaagcccagt ttcaaccgga atcgccgggt agacctatgc actcaatcca 1440 acaacaatat ctgcacgttt acctggatag tcgaggcgcc agtgtaccgg aagctacacc 1500 ccaaccggaa tcactggacc gacctcgaca ttcggagaac ggatcagatg aagaaatgca 1560 ttgtgcctcg gatagtcgag gcatcgacgg accggacgct agagatccac cgggacggtc 1620 caaccaactt caacatccga tggacgaaaa aggaacaaaa agcagcagaa tgaagaaccc 1680 gatgatatct cctggtagcc taggcccctc cagtgcggac gacacttcca taccggaact 1740 ggtggatccc ccttggcgcc aggagaatgt cggaagaggg acggaaaagg gatggctatc 1800 ctatacccct aagcaggacg acgtgggaat atcttcaagg aatgagggaa aagaaagcac 1860 aagctccaga aaaccaggcc gtttccataa ctatgctcca gagaggaaat catttcctga 1920 tgacagaaac gttgaagatc agttcaactc cttcaacaaa atgcagcttc cgattgatgc 1980 actgacgaat gcagcgaggt atgacatcca catcaaaaaa gcactaggat gtggagctgt 2040 cgttcctcca ggtccttcct ggttacagcg aacaggcaaa aagaccatcg aaacatcggt 2100 actcatcact gatagcgctc tattttgccc ggaatacaat cagagaacaa cccaagacat 2160 cctagcagac aaatcgaatt tcaatgaagg tacatctctc tcaccacttg cagcaccttt 2220 cttcccggta gttgaaacaa ccctgcccat aacggctgca ccttcatcgt ccatttccga 2280 gcaagcaaat cagccagata ttactgtagg taagtccagc gcctcctctg attctcctgg 2340 attcccagta ccttcgagtg gaatggtctt cccccgtacg gatgtgtcca acactacttt 2400 aaacaacctc aacacaagcg cctctacaag tcaatcatca agcagcctcc catatccggc 2460 tcctacccta cccaaaccag tgaattttct tctccagtgg aatatcaacg gatattacaa 2520 caacttagcg aatctcgaac tcctaaccca tgggtcctct ccttggtgtt tgacgctgca 2580 ggaaatcaac aaaatctcta tcccccaact caacaaatca ctcggtggta gatatcagtg 2640 gaccataaaa aaaggtagca acttccgaca ctcagtaggc attggtgtgc taaaaacaat 2700 accattcgaa ccgcttgaca tcaactcaga tcttccgata attggagwgc aacttcaggg 2760 ccccatatgt gtcaatatca tcaacgcgta tctaccatgt tcaacaatcc cagactttag 2820 caaacgcatg gccaaggtat ttgaagccat ccccggtccg atattatttg ttggagacac 2880 caactctcac caccaagcat ggggaggtga taaatccgat tccagagggg tcgctttact 2940 aagcctgttt gaagtggccg acatggtggt cttaaacaac ggttccgaca ccttcttcaa 3000 cgggaacttt tcttctgcca ttgatgtctc agcagtcagc cgttcacttc tcagcaagtt 3060 gctctggggc ataaactccg acacctatgg aagcgaccat tttcctattc agatcagctt 3120 ttccacagtc tctcccgaaa caaccagaag gcctaggtgg aaatacgaaa aagccgattg 3180 gcagtcctat gatttgaatg tgaggctgcg tcttcaagac aatcccccaa atacattgcc 3240 ggaattcacc agactagtct atgacgctgc aaatgcctcc atccctcgaa ccagcaacaa 3300 gcccggccgt aaagcccttc actggtggtc agaagacgtc cgcaaagccg tcaaagcacg 3360 tcgaaaagca cttcgagcag ctaaaaggct accagcagat caccctgaaa gagcagaggt 3420 gctcaatcgt tatcgcacat tgcacgtcgt atgtagaaaa atcatcagaa actcgaagtt 3480 ggccagttgg gagtctttct tagaaagcct gaacggatcc caatcttctg ctcaactttg 3540 gggtagtatc aacgccctca gtggtaagag gaaaaccact caaatgtccc tttcaatcaa 3600 cggcgcttat atatcagatc cacctgtggt agctgcagct ctcggagact atttctcgca 3660 cctttcctcc agaagcaact ataacgattc cttcatgcga cgcgttcatc cctgctctgc 3720 atcgctgccc aatttccacg tccccaacga tccaagcaac gatttggtta acacggcttt 3780 ctcgctcaac gagctaaaat ttgctctcag ctgttgttct ggtaactcgg ccggcccaga 3840 taatgtcggt tatccactat tgaagaacct tcccgcgaca ggtgttataa aacttctaga 3900 attaattaac caatcatggc tctccgacaa atacccagat gaatggcacg aaagtttagt 3960 aattcccatc cctaaagcca actgctgttc taaagatcca actaaatacc gtcccattgc 4020 ccttacatgt tgtctctcca aggtgatgga aagaatggta aatcggcggc tcaaacagaa 4080 gcttgaatcc gatagacggc ttgaccaaag gcagcatgca ttccgatcag gttttggtac 4140 taacacgtac tttgcagcat tgggcgatgt gcttcataaa gcgaatgctg aagggctaca 4200 cacggaagtg atatctctag atatttcaaa ggcttttaat tgtacctgga caccgctagt 4260 ccttcaacaa cttgttgact gggggctatc agggcacata gtgcactttt gcaaaaattt 4320 ccttgtcaat cgctttttcc gtgttgcaat cggagatact acctcggagt cttacccaga 4380 agaaactggt gtcccccaag gatctgttat tgctgtgacc ctcttcctcg tagcgatgaa 4440 cagtgttttc tccgttattc cccgaggtgt ccatatcttc gtgtatgctg acgatatatt 4500 gttagtcgta tctggcagca cccgtggtcg caccaggatt cgggcccaag ctgctgttag 4560 ttcagtcgta aaatgggcga catcagttgg tttcagcctc tccgccaaca aaagtgtacg 4620 ctgccacgtc tgtaagtata ggcatcaata cgacagaacg ccaatccggg tggatggtaa 4680 ccctattccc tccaagaaga cagtgaaagt actcgggatc ataattgatc gacacctgtc 4740 ttttaagccg cattttacca acgttaagaa taactgccaa acaagattga atctacttcg 4800 gacaatctcc cgtccacatc gttccaataa tcgtgggatt cgattccgag tagctgaagc 4860 cattgttgat agccgtctcc tgtatggctt ggagctcacc tgcatcgcct acaaccagct 4920 ggtagaaata ttagcgccca tatacaacag ctacattcgt actatttccg gactcctccc 4980 gtctacgcca gccgactctg catgtgtcga ggcgggtcga ccaccttttc gccatttcat 5040 aaccaaagca atttgcacta aggctgcagt ccatgctgca aaaacctccg ggcgccgtaa 5100 aatgtttctc cttgaagaag gaaataagat tttccgtagg gtggccaata gaaatctccc 5160 ccctgtggct aagtaccact ggtacggaga caatagttgg cataaagaaa caccaaaaat 5220 cgatgacgaa atcaagcgtc gatttagagc gggtgacaac tcgcaaacgc ttcgaatgtc 5280 agtgcttgaa tggttgcaga caaaatattc aggctacgag caccgttaca cagatggctc 5340 cctctcccag gtaggagtag gtattggaat catcggacat tctctggaaa tcagcaaaag 5400 tcttcaaccg tggtactcaa tcttctcggc ggaagctgtg gcggttttta ttgctgctac 5460 tactgtcagc gtgcgaccaa ttttagtgtt aacggactcc gccagtgtca tctctgccct 5520 tcaatctgat accccacagc acccttggat acaaggaatt attaaaaatt caccacacag 5580 cactgtgttt gcctggatac ccggccattg cggaatccct ggaaacgttg cagcggatcg 5640 actcgctgga atcggtcacg ccgaacggag gtatagctcc acagccccgc tggatgatgt 5700 caaaagacac atcaggaaaa aatttaggga ccactggaat gtcgaatggg ctagctccca 5760 ctcatcatat atccgaaaaa tcaaacaaga tacgtctgcc tgggatgatc gtaaatcctt 5820 acgtgaacaa cgggttgtct caagattgcg tactggacac acgcgtctct ctcacaactt 5880 cgatggagct gatttcaacg ttatctgctc aacgtgcaac gtgagaaaca ctgttgagca 5940 cttcctatgt gtctgtcccc agtatgagtt ctcacgtcaa acatatggac tctccagcag 6000 tatccgggaa attctaagtg acgatgcatc cgctacaacc tccctgatct gtttcctcaa 6060 agatgctggc ctcttctata ggatctaact gtgggcagcc cgagatcatt ttcgttgtta 6120 tagtggcctc accacaaatt aatagggcaa atcttgtgca aacactttgg atttggaaca 6180 ttagcgcaaa taggtatact agaataacaa attataacac agtgaagtaa tccgggaacg 6240 tcagtaactt taagagatta gctacgaaac ctttggcgag accccacggt tgggctcgaa 6300 gacggaccct tatggttcgc cttactttcc cccgttacac gagtgccctt caggcacctc 6360 atgtggcagt gatgaactag ccaaacgagt taaaaatcac tttaataaag aaaaaaaaaa 6420 aaa 6423 // ID hAT-5_HM repbase; DNA; INV; 4303 BP. XX AC . XX DT 07-DEC-2008 (Rel. 13.12, Created) DT 11-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-5_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4303 RA Jurka J.; RT "hAT-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 1994-1994 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(931..1263,1358..1987,2173..3333,3363..3896) FT /product="hAT-5_HM_1p" FT /translation="MSGFKKKDSGCQVRKNAAQRTANEEKNKRTLQDCGIT FT VTKKGEDESPTASNSSLPSQSIQVVNSFPRERIFSHLINSMLYKFLTYFYR FT LELKTMSLTFLKARESPYQKSERNLYSQVLPTEEPIATIPSNDPALWAAHL FT SKVERDSVLLQGLPRNPSAFPKDSNKKKVPESIFYETSLNGEKTCRDWLVW FT SVSKKSFICFPCSLFGSKQSFGIGHQSHLLRWNDGISCNWHKLPEKVKSHQ FT NNAQHRNFYIEWKTALESLENQSGIDAALENSIRNEAARWREILRCILDVT FT LFLASRNLSFRGKSIYHYSIVISTIFKSIFLGSSKMIGDDDNGNFLATLEL FT LAKHNKTLQLHLEEVSRCQQEGNKMNAHYLGWSTQNEFIKECGGIVHGAII FT NEAHMAIYYSILVDGTPDVSHTEQIAFVLRFVYFGTDKRWTVKERFLRVEN FT LEKKIGADIAKLIMDVLEQNGIDLKNCRGQGYDNGANMSGIYKGVQAIILQ FT KNPQALYMPCSAHSLNLAGVHSAESSVEVKNYFGRVQSLYNLFSGSPSRWK FT VLIETTGLSLHQTSQTRWSARIEAVKPLVKRPREILESLKKLRDFDLTADQ FT LNEVKSLEKWVHSFEFIVMTTFWYKTLQSINYVSLALQSENISLDDEMKLI FT KTLIEDLNRLRSSWTSILNEARLIASGLASFGFQSEFVKKRTKKEEDLSRR FT GEEHRSFPAYSIPLLILLFSRSAIDSKLLRRRRICFRSLWSSKSLVMSNEE FT ESEESVEAPIQLEEKCKVLAQIYATDVEEEKLIEEVRHLDALKRSNLFGPK FT ESLTSMTLLNGIYQKGLQPLFESVCILLRIFNTIPVSVAEGERSFSKLALV FT KTALRSTMSQERLTNLLVISIEHDLAKKSVLR*" XX SQ Sequence 4303 BP; 1403 A; 782 C; 819 G; 1299 T; 0 other; cagggccgcc gtgagggggg ggggtctact gggttgtttt gtcccgggcg ccaggtcgac 60 aggggcgcac gaagccgaat ataaataatt ttttttttca tgtttattta aaaaaaagat 120 gaagtactgg gaagcagcgt ggaaaataag aaatcaaaag cagccggtca atttgaccag 180 ctaaaacata taatagacgg tcaattgctt tttttggccg gtcaattttt tttagtttaa 240 attttcaact ttgtaatgaa gtgcttcata attggacttt ttaaataaaa aaagaaatta 300 atgctttaaa actaagaaat tagcattact ttcaaaacat tagttttaaa taaagcgatt 360 ctttttatat agcggaaact cgcaaataaa gtgaatacta atgattaatt taacaacccg 420 ctgaacttca gcgggttgtt aaattaataa aaagttattt taaacattta cttcagcggc 480 tgaagtaaat gtttaaaata actttagtgc acggagattt tcattcaatt aaaaacggag 540 attttttact ttattaaaaa aagagattag attaaatcca agtagtttga tattgatata 600 aaatattaaa acaattttat taaatttatt tctctaattt ttaaaatctc tcaaatttga 660 aacaattttg aaacacgtga aaatatacac gtttgtgctg tgttctacaa taaataactc 720 atttaaaaag tcagatactg catatttatc gtgtatatca tcacttctta atgtaagtat 780 aaacaatttg agaatttaga ataactttgg catcataaaa cagtaaataa cgggtttttt 840 atcgatgtga tagtctgata gacccaatca agaagaaatt caaacagttc tttgcttatg 900 agcggatgct gtttcaaaaa gaattaaaac atgtccggtt ttaaaaagaa agattctggt 960 tgtcaagtaa gaaagaatgc tgctcaaaga actgcgaatg aggaaaagaa caaacgcact 1020 cttcaagact gtggtattac agttacaaaa aaaggagaag atgaaagtcc tactgcttca 1080 aactcttccc tgccaagtca atcgattcaa gtagtaaata gttttccccg cgaaaggata 1140 ttttcccatt tgattaattc aatgttatat aaatttttaa cttattttta taggttggaa 1200 ctgaaaacga tgtcactgac atttctgaaa gctcgggaga gcccctatca aaaatcggaa 1260 cggtagctgt gaatgaagag aatattttgg taagttctac ctctttaaaa gaaatgatgt 1320 taataaactt ttaagctaat aaaataagat tttttaaaat ctttattctc aggtgttgcc 1380 aacagaagaa cccatcgcta ccattccttc caatgatcct gctttgtggg cagcacatct 1440 ttccaaagtg gaaagagatt ctgtgcttct acaggggctt cctcgaaatc cttcagcttt 1500 cccgaaagat tctaataaga agaaggttcc tgaatcaatt ttctacgaaa cttctctcaa 1560 cggggagaag acatgccgag attggctggt ctggagtgta tctaagaaat catttatttg 1620 ctttccgtgt tctctgtttg gaagcaaaca atcttttggg atcggacacc agtcgcacct 1680 tctaagatgg aatgatggaa taagctgcaa ctggcacaag ctacctgaga aagtcaaaag 1740 tcaccagaat aacgcccagc atcgaaactt ttacatagaa tggaaaacgg cgctagaaag 1800 cttagaaaat caaagcggaa tagatgcagc tcttgaaaac tcgataagaa atgaagcagc 1860 caggtggcgt gaaattctac gatgcatctt agatgttact ctttttttag cgtcacgaaa 1920 ccttagtttc agaggtaaat ctatttacca ttattctatt gtgatatcga ctatctttaa 1980 atctatttag cttatattta ttagattaac tagatatttt ggaaatcatt aaataaaaca 2040 aatccttctt tattctttta aataaatatg ttataatatt cataaatcat aatataaaat 2100 catcatagaa tcatcataaa ataaaacaaa tgcttcttta ttgttttaaa taaatctgtt 2160 ttattattat aatttttagg ttcatcaaaa atgattggag atgacgacaa tggcaacttt 2220 ctagctaccc tagagctctt ggccaagcac aacaagactc tccaactaca cttagaagaa 2280 gtttcccgct gccaacaaga aggtaacaaa atgaatgccc attacttggg ctggagcact 2340 caaaatgaat tcatcaaaga gtgcggagga atcgttcacg gtgccatcat caacgaagct 2400 catatggcca tctactattc cattcttgtg gacgggactc cggacgtctc tcacaccgag 2460 caaatcgcct ttgttctccg ctttgtatac tttggtactg acaaaagatg gaccgtgaag 2520 gagcgctttc tgagggtcga gaatcttgaa aaaaagatag gtgctgacat tgccaagctt 2580 attatggacg tcttagaaca aaatggaatc gatctcaaaa actgcagagg tcaaggatat 2640 gacaatggag cgaacatgtc cggtatatac aaaggagtac aggcgattat actgcaaaaa 2700 aatcctcaag ctctctatat gccatgcagc gctcatagtc tcaaccttgc tggtgttcat 2760 tctgctgaat cttcggttga agtcaagaac tactttggcc gagtccagtc actttacaat 2820 cttttcagtg gaagccctag tcggtggaaa gtcttgattg aaaccactgg tttgtccctt 2880 catcaaacgt cgcaaaccag atggagtgcg cgaatcgagg ccgtgaagcc gttagtcaag 2940 cgacccaggg aaatcctcga atctttgaaa aagctccgcg attttgatct aacagctgat 3000 caattgaatg aagtaaaatc tctggagaag tgggttcatt cattcgagtt catcgtaatg 3060 acaacctttt ggtataaaac tcttcaatca atcaactacg tgagccttgc actccaatca 3120 gaaaatatct ccttggacga cgagatgaaa ctcattaaga ctcttattga agatctaaat 3180 cgactgagat catcttggac cagcatcctt aatgaggcac gtttgatagc atctggtctt 3240 gcttcttttg gttttcaatc agaatttgtg aagaagagga cgaaaaaaga ggaagacctt 3300 tcacgaagag gcgaggaaca ccgctcattt ccatgaagac gaagcaaaag agtttgaggt 3360 gagcgtattc aataccgctc ttgatactct tattcagcag gtcagcgata gattccaagc 3420 tgctgagaag acgacgaata tgttttcgtt ctctgtggtc atcgaaatct cttgttatgt 3480 caaatgaaga agaatcagaa gaaagtgttg aagcacctat ccaattggag gagaaatgca 3540 aggtcttggc acaaatctac gcgaccgatg ttgaggaaga gaaacttatt gaagaggtcc 3600 gccacctcga tgctctgaag cgatccaatc tttttggtcc aaaagaatct ctcacttcca 3660 tgacgttatt gaatggaata taccagaaag gtcttcagcc gctcttcgaa tcagtctgca 3720 ttttgctgcg catcttcaac accatccctg tttccgttgc ggaaggcgag agatctttta 3780 gtaagcttgc tctagttaag acagctttaa ggtctacaat gagccaagaa cgactcacga 3840 atcttctggt tatttccatt gagcatgatc ttgccaaaaa gtctgtgcta cggtgaagtg 3900 atttccaaat tcgccatgag taaagctcga aagatcaatt tcctctgaag cattgaaaca 3960 acaatagctg ttgtattcta gacaaattta cagttgctct taacatccaa tagtgtttaa 4020 tgattgtctt tggtctgttt ctaccatgtt ctagtgtcga ctttatcatt ttatgtatga 4080 gtgtataacc aagtcaaaag accattgcaa ttatctagaa ataaactgtt caaatgttca 4140 ttaattaaaa ttaaaaaatt ctatattctt ttatcctctt tgatttgtta tcaaatggct 4200 tattatatgt attgaattat ggatcattag atcagctttc actttcagca gggcgcttgg 4260 gaaattttga ccccgggcgc cgcaaaggct cacggcgggc ctg 4303 // ID Gypsy-61_AA-I repbase; DNA; INV; 5319 BP. XX AC AAGE02020262; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-61_AA_; KW Gypsy-61_AA-LTR; Gypsy-61_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5319 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02020262; Positions 27560 22242. XX CC Positions [4063-4533] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 871..2502 FT /product="Gypsy-61_AA-I_1p" FT /translation="MSALIGSIDHYVRGSSFASYMKRMNILYQLNNVTDNN FT KKNLFLALSGSVIFDEIELIYPGIDIADIDYGDMISKLKERLDKVQPNMMH FT RHKFHARVQGVDEPAENYVLALKLLASHCGFGAHREEAIKDKIVFGLRDQD FT LKHKLLMKDDMTLEEVEQMVIRTELAKCRAKELEEKNEDPREVNSVKYRLG FT SQSNFNDMNRDGRQGSSNYSRRFSNRSRSTSRDRERTRFNHNRYDYYKRQR FT EEYQPRNSYSDYRGQGDGNPHQNVICNFCKMRGHIKRNCYKLKNRKNVNFV FT EADPVEINSYDFKRLQIRDSENEDDDYPCMMIASNRFSEPCLVKVIVEGIE FT LSMEIDCGAAVTVVSLSVYRMYFSHIKVSKCSSRLVVVDGQQLSIFGEISV FT KVMVNEIQHQMTLIILDCTRRFVPLFGRNWMDVFYPSWRKTFGNTMRINAM FT NKDADNSLKLDMGNIESDIKSRFPKVFDGDFSNPITGYEADLVLREDRPIF FT RKAYDVPFKLREKVVGHLDSLEKQNVITPLQVSEWASPVVIVPKKIMT" FT CDS 2508..3878 FT /product="Gypsy-61_AA-I_2p" FT /translation="MVIDCKVSKNKQIIPNTYPLPLAQDIFASLAGCKWFC FT CLDLAGAYTQLKLSERSKQFVVINTIKGLYTYNRLPQGASSSAAIFQRVMD FT QILIGLKHVRCYLDDVLIAGETKEECLSNLYLVLERLQKANLKVNFKKCNF FT FLNSLTYLGHLVTEKGLLPSPEKLLTIEKAKVPADTTELKAFLGLINFYGK FT FVPHLSAKLNCLYALLRKDTKFVWNEKCQQTFEDSKKALLTANFLEFYDPA FT KPIIVVSDACSYGLGGVIAHMVDGKEKPISFASFSLNPAQKTYPILHLEAL FT ALVCTVKKFHKFLFGQKLTIYTDHKPLLGIFGKNGKHSLCVTRLQRYVMEM FT SICEFDICYRPSANMGNADFCSRFPLEQGVPKKLDSGCIKSINYFGDFPLD FT YSLIAKETKTDNFLSRVAICIENGWPNKTDNDLRAINIFFAIRSRGSGRLC FT HIPGSSFHTRSS" XX SQ Sequence 5319 BP; 1788 A; 811 C; 1172 G; 1548 T; 0 other; ttataagtgg acgacgagta gaaaactttt ttgatttttg tttcaaagtt tcattacaga 60 aagcattgaa gctttttttt ttccgtatcg gtcagtgaaa gcaggaagcg tgcattagtg 120 gaacgtgcag cgaagataat tgtcgtttac aaccatagta tcaggcagta tggaaagtca 180 gtgagctggg tggaattgta gttaaagcag ctggcataag gaaaagttaa aaaaaactgt 240 tgcgaaaaaa aaagtgtaaa gaaaaattgg acggaagctg tgccacatgg tctatattca 300 gcaaaatagc cattgcaatt tttcattgtc ccgcttggtc cgccatttca aggtgtagaa 360 gtggagttga gctttaattg aattgcttta agagagagtg agcgaaggaa ctcaaaggta 420 ctgtataaga aacataagtg atattgagtg gttgaaatca gacggcggtt ttagctgagt 480 gtagcgactg tcgaaacaac acagatttgt tctggcagtt gtgagcatag tcaatcgaga 540 gatcatcagt gatatagtgt ggttgagaga tctacagcaa agtgatagaa gagagtgaga 600 acaccaaaat aacataaaga gcttgcaggc agctgaatta gtgagaaaaa agagcgaatt 660 gggaagtcaa cggctgagag gactgccggc gagaccaaca ttggttagga cgagcatcgg 720 tgagacgaga gtcatttgaa taacattggt gagacgagct tcgtgagacg attcgattgt 780 ggtgagcaga aaagagtatt aagctgtgta tcaagtaagt cattttagac catttcatct 840 cattcaggtg tgtttatcga agtcgtcagc atgtctgcgt taatcggaag cattgatcac 900 tatgtgcggg gctcaagctt tgctagttat atgaaacgaa tgaatatcct ttaccagttg 960 aacaatgtta cagataataa caagaaaaac ctattcttgg cgctaagtgg gtcagttatt 1020 ttcgatgaaa tagaacttat ttacccggga atagacattg cggatattga ttatggtgac 1080 atgatcagta aactgaaaga aagattggac aaagttcagc ctaatatgat gcacaggcac 1140 aaattccatg cacgtgtgca gggtgttgat gagcctgctg aaaactatgt attagctttg 1200 aaattactcg ctagtcattg tggttttggt gcacacagag aagaagcaat caaggacaaa 1260 attgtttttg gtttaaggga tcaggatctg aagcacaagc ttttaatgaa agatgatatg 1320 actttagaag aggtagagca gatggtaatt cgaactgaat tagctaagtg ccgtgcaaaa 1380 gaactagaag aaaagaatga ggatccaaga gaggtgaact cagtgaagta ccgtttagga 1440 agtcagtcga atttcaatga catgaataga gatggaaggc aaggttcaag taactacagt 1500 aggcgtttta gcaatagaag tagaagtact agcagggatc gtgagcgtac gagattcaat 1560 cataatcgct atgattacta caaacggcaa cgcgaagaat accaaccaag aaacagctat 1620 tcagattatc gtggtcaagg agacggaaat ccacatcaaa atgtaatttg caatttttgt 1680 aaaatgagag gtcatataaa acgtaattgt tataaattga aaaatcgcaa aaatgtaaat 1740 tttgtggaag ctgatccggt tgagatcaat agctacgatt tcaagcggtt acagataaga 1800 gattcagaaa atgaagatga tgattatcct tgtatgatga tagcaagcaa tcgttttagc 1860 gagccgtgtc ttgtgaaggt tatagtagaa gggattgagc taagcatgga aatagattgc 1920 ggtgcggcag tcacagttgt tagtctatca gtatatagaa tgtatttcag ccacattaag 1980 gtttctaaat gcagtagtcg tttggtcgta gtagacgggc agcagttatc aatttttggc 2040 gaaatttcgg ttaaggtgat ggttaatgag attcagcatc agatgacttt aataatactt 2100 gattgcacaa gacgttttgt cccgttattc ggcagaaatt ggatggacgt tttttatcct 2160 agctggagga aaacttttgg aaataccatg agaatcaatg ctatgaacaa ggatgctgac 2220 aatagtttga agctagacat gggtaatatt gaatcggaca taaaatctcg ctttccaaaa 2280 gtttttgacg gtgatttttc aaaccctatc acaggatatg aagctgacct tgttctgaga 2340 gaggatagac ccatatttag aaaagcttac gacgtgccgt tcaaattgag agaaaaggtt 2400 gttggtcatc tggattcgtt agaaaagcag aacgtaatca caccattgca ggttagtgag 2460 tgggcgtctc ctgttgttat tgttccaaaa aagataatga cataagaatg gtaattgatt 2520 gtaaagtgtc caaaaacaaa cagatcattc ccaatacata cccgctacct ttagcacaag 2580 acatatttgc gtcattagca gggtgcaaat ggttttgctg ccttgattta gcaggcgcgt 2640 acactcaact caagctatcg gaaagatcaa agcaatttgt tgttataaac acaattaagg 2700 gattatacac ttataaccgg ttgccacagg gtgcatcttc tagtgctgcc atatttcaaa 2760 gggttatgga ccaaatttta ataggattga aacatgttcg ctgttactta gatgatgtac 2820 tgattgcggg agaaactaaa gaagaatgtt tatcaaattt gtatttggta cttgagagat 2880 tacaaaaagc taatctaaag gtgaatttta agaaatgtaa tttttttttg aattctctaa 2940 cgtatctcgg acatttggtt actgaaaaag ggcttcttcc atcgccagaa aaattattga 3000 caattgaaaa agctaaagtt cctgcagata ccactgagct caaagcattt ttagggttaa 3060 taaattttta cggaaaattt gttcctcact tatctgccaa attgaattgt ttatatgctt 3120 tattacgaaa agatactaag tttgtttgga acgagaagtg tcagcaaact tttgaagata 3180 gtaagaaagc tctattaact gcaaattttc tcgaatttta cgacccggca aaaccaatta 3240 ttgttgtatc tgatgcatgt agctatggat taggaggagt gatagctcat atggttgatg 3300 gtaaagaaaa acctataagt tttgcatcat tctctttaaa ccctgctcaa aaaacatacc 3360 caatcttgca tttagaggca ttagctctag tatgcactgt taaaaaattc cacaaatttc 3420 tttttggaca aaagttaaca atttatacag accacaagcc gcttttggga atttttggca 3480 aaaatggcaa acattcgctt tgtgttacaa gattgcaaag atatgtcatg gaaatgtcta 3540 tttgtgagtt cgacatttgt tacaggcctt ctgccaatat gggaaatgca gatttctgct 3600 caagatttcc tctagagcaa ggtgttccca agaaactgga tagtggttgc atcaagagta 3660 taaactattt tggagatttt cctttggatt attctttgat tgcaaaagaa actaaaacag 3720 ataattttct ttcaagagtg gcaatttgta ttgaaaatgg gtggcctaat aaaacagaca 3780 atgatttgcg agcaataaat attttctttg cgattcgatc tagaggtagt ggaaggttgt 3840 gtcatatacc aggatcgagt tttcataccc gttcctctta ggaatgatat tttgaagctg 3900 cttcattcca atcacaatgg gattgttaaa atgaaacaaa cagctcgccg atctttgttt 3960 tggttcgatt tgaataaaca catagaatta tttgtaaaac actgcgaacc atgtataaaa 4020 atgtcagtgg tacctaaacc ggtttgtagc acttcatgga ctccaacaaa tcggccattc 4080 agtcgtattc atgctgattt ctttttcttt gaatcaaaaa catatttttt agttgtggat 4140 agttatacaa aatggcttga agtggatata atgagatatg gaacagatgc aaacaaagtt 4200 attaagaagt tcacagctat ttttgctaga tttggactac cagacgttct agtgacagat 4260 ggagggccac cattcaattc tagtcatttt gttaaattta tggaacgtca gggtatacgg 4320 gtcatgaaaa gtcctccgta taatcctagt agcaacggac aagcagagag aatggttaga 4380 ttagttaaag atgtattaaa gaaattcctt ctggatcctt tgataaaatc acttgatgat 4440 gatgatagat taacttattt tttagcgaac tatagaaaca cttgttcttc gtcagatgaa 4500 agatttccaa cggagaaact tttgagctac agacctaaga ctttagttga cattttgaac 4560 cccagacata gttataaaga ttttctggta cgaaaagatt tgtccccatc aagtaaataa 4620 aaaaataaag acattttagc ttcaaataaa gagacaaaat atgacatccc gttgtccgaa 4680 gatccatttc tcaatttagt gctaggggat agggtttggt ataaaaataa tgacaaacat 4740 gcaattgaga aatgggtcga ggcaaaatac gtcaaaagag tatcacctaa tgtgtttgaa 4800 atctctttcg gcaggcacaa ttgcaatgca catcggaatc aactaaagat tgtgagtccg 4860 cgacaatcta ggtcgacaat acgactacca atacagcgac agaaacgacg tagggagtct 4920 attgactctg aggatgactt cttagggttt tcggacgaga cgaacgcagc taggaaggat 4980 cttgacaata cgaaacaaaa gcatgcaagg acaagcccca ttagaacgcg aagtttttcg 5040 cgatcgaaaa gaaagaagga tgaagaagaa aatcgaaata attgaaagcc ccagtagcaa 5100 tgaattgata ttgagtcctg ggatttgtat tcagaaatta caacttctag catgccaaca 5160 agaaaaaaaa aagatgtatt tcattgtcag acatttggcg ctcaaagtta taggatatta 5220 agttaagagc atgtttgtta tgaattcgaa taagattggt tgtcatcaat ttattataaa 5280 gtaattatga attaaggtca gttcttaagg ggaaagaat 5319 // ID Copia-21_AA-LTR repbase; DNA; INV; 196 BP. XX AC AAGE02020628; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-21_AA_; KW Copia-21_AA-I; Copia-21_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-196 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02020628; Positions 29563 29758. XX SQ Sequence 196 BP; 58 A; 49 C; 29 G; 60 T; 0 other; tgaaaagtat tgtattaaca tagtagcaat cgccctaact ctaggtgtta ctatcatagc 60 aaccgccctc tgttattttt cctaacaagc aattccatcc caatactcca accagttaag 120 acgtgttatc aataaagcta gtttgttcgt tacgttacgt gttcgtgtct tcactacaaa 180 cgagccaaga ccctca 196 // ID Kolobok-22_HMa repbase; DNA; INV; 2375 BP. XX AC . XX DT 22-JUN-2010 (Rel. 15.06, Created) DT 22-JUN-2010 (Rel. 15.06, Last updated, Version 3) XX DE Kolobok-type DNA transposon - a consensus. XX KW Kolobok; DNA transposon; Transposable Element; Kolobok-22_HMa. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2375 RA Jurka J.; RT "DNA transposons from Hydra magnipapillata."; RL Repbase Reports 10(6), 793-793 (2010). XX DR [1] (Consensus) XX CC >95% identical tp consensus. CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX SQ Sequence 2375 BP; 803 A; 309 C; 371 G; 886 T; 6 other; ggtggtccta tactaaaaaa aatcaaaaaa atcgattttt caaaaatngc atattcttga 60 agcccaaaca ctctncattc caaatatata tagatcatca tagtgagtaa atttaaaaat 120 gctctaaaat agcaagtttc atagtgaagt taaataattt tcttagcaac ggccttagca 180 acgggttgtt tatattanaa aataggtgtt ttagagtgtt gcgccattta ctataagttt 240 ttgacactgg tgccagtaac tgtttcatgt ttatttttct aaataagcca tgacctttaa 300 gtacaattac tatgtttaca tatattactt atagaaaaac tatcagcctc gtgttatact 360 ttatttgctc cgtgcttgtt tttaatatgg gaaaaattaa atgtgattca agtttaagaa 420 ggaaagcagg tggtaacagg attttcaaaa aaaaaatact ttaaaggaaa tcaattttcc 480 tcagcatcaa caattagcat taatttaact tctcaatcgg aaatcactaa cttaactcaa 540 tcaaaaatct tgacatcaac atcttcaaaa aagttatcac atagcagtaa aaatagtaat 600 gcaaaaatta atgattcaaa taaattgcct tcatgcttta tacttattga cacagacatt 660 ttaaaatcaa ttgttacttt gattggacat tgtccttaat gccataacaa ttcaattaaa 720 atctcaactg atttatcaaa aaagaaaggc ttggcagtgt ttttaaacat ttcttgtgca 780 acaccttcgt gcgattggaa acagtctttt tattctagca aagagattga aaagcaaggt 840 ccaggaatgt cactatttga aataaattgt agatttgttg ttgcaatgag agagatagga 900 aaaagtcatt ctggtttgga aaaattttgt ggtttaatga acttacctcc tccaatgaac 960 ataaaagcat ttaatgatgt tcaagataaa attcanatca tatatcaaat tgtagctgat 1020 gtgaaaatga aaaatgcagc aaatgaattt cgatttctta acaacgatat ccaaccaaat 1080 gattttgata ttaatgtaat agctgatgtt gttgtttcct gtgatggaac ttggcaaaaa 1140 aggggtcatt cctcacttaa tggtgttgtt actgttattg ctagtgactc agggaaatgt 1200 gttgactanc gagttctttc aaaaatatgc aatgcctgta catcatggga atcaaaaaaa 1260 gatagcgatc caaaacttta tgaaaatttt ttggaaaccc atgattgttt aaataatcac 1320 gaaggttctg ttggatctat ggaagtttct ggaattgtag attgttttat gaaatctata 1380 gtcaaaaggt atattaaaca tatttgagta tttaaatgtt acaccaggta catttgcctc 1440 agtactgaac caaaatagac caagaaagaa taaagatgat ggaaaaaaag ttgcttaaca 1500 caataaaaca aaaaagaaag ctgcttcgtg cagagcgaaa aggttttaca gatatagtaa 1560 atgaaagtga aggtgtatct tatgaagcag gagtttttta gatantagtt ttttgttact 1620 tttttaatta aatttatgtt aaggcagtta ttgtattttt taacgtagtt attgtttgtt 1680 atctttttat attcaagctt tttttcattg attgaataat tctattttgg ttaaattttt 1740 ttcaattatt agagttttca catttttctt ttttctgtgt ttttagtaat gctcaagtcg 1800 ttttgttttc aataattttt tgaccttttt atatattaat tttttttttt ttatattaaa 1860 aatctttatt attgtttttc tcaatatatg agttttttgc acgctgcgat aaatatcttg 1920 agattggttt gtttgattgt gttgaaattt tcaggagttg tttattatat atatgcacat 1980 tcaccttacc aaaagaatta atgttagtta actggttcta aaaataaggt ctaggttcta 2040 agccttttcg gggctttttt taggccttaa actggttttc ttacaaaagc ttcaatagtt 2100 tgttaaattg tttgattttg gcagggtgag tgtaatcatc tgatattaaa aagctgtgaa 2160 aatttcatga tcataccatt attagtttac aggctgctta tcgcagcgcg tagtccatat 2220 ttagggtcca tggacccaaa attttgacca aaaaaaaatt tttttaattt tttttttctt 2280 tactgtgctt taaattcaaa atactatttc ttatatgaaa atcataaaat atgaatgcat 2340 ttttaaaatt ttggttttat tagtatagga ccacc 2375 // ID Gypsy-35_NVi-LTR repbase; DNA; INV; 228 BP. XX AC . XX DT 30-JUN-2009 (Rel. 14.07, Created) DT 30-JUN-2009 (Rel. 14.07, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: long terminal DE repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy-35_NVi-LTR. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-228 RA Bao W. and Jurka J.; RT "Gypsy LTR retrotransposons from Nasonia vitripennis."; RL Repbase Reports 9(7), 1383-1383 (2009). XX DR [1] (Consensus) XX SQ Sequence 228 BP; 61 A; 43 C; 52 G; 71 T; 1 other; tgttgtgacc gagcctgaac gttgcgcgat tcgaacacca agaatcgcgc atagcgaccc 60 ttagcaacga gataaacgac gtcgcggatg ttattgttta tgtttagtct ggttctgtca 120 gctgttgaga acagttgtgt ttacgatttg aataaagagt ccgagttcaa gagtgatttt 180 ctataaataa cgtgttgtta tttttattac ccgaacccgr tcataaca 228 // ID Gypsy-21_DPu-I repbase; DNA; INV; 4916 BP. XX AC scaffold_12; XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from Daphnia: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-21_DP_; KW Gypsy-21_DPu-LTR; Gypsy-21_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-4916 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Direct Submission to RU (16-DEC-2010). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR Genome; scaffold_12; Positions 1366390 1361475. XX CC 'TGGAC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 677..4651 FT /product="Gypsy-21_DPu-I_1p" FT /translation="MVSSCNFGTAAVIESVLRDQIVFGVASEHVREKLLFE FT TDLKLAGACNIVRACESASSKLTQMAPRGESTVHRLHDSQPKGKQGMSSYN FT KSRQPGGNMQQYVNCQDCGRRHRKDQCSAAKVMCFSCQQVGHFANRCPNGR FT SQQGTSHPRKMAPPPAPPTDSRQQTMRPAQRGTFMQQQLHAVEEEDFDGQL FT TGANGFLGEDYVTHQLTCTEEKVDEWYEDMAVDGKATIRFKLDSGATCNVL FT PYELYASVCPNGAPLEPGPRVRNYSANGGYLNVLGVYKGQVVRRGIAYVLR FT FVVVNEPGQPAILGLPACKLMKLIKRVHSITVSQPQLQPPIVKEFADVFNG FT IGKLPIEHEIRLTTGPSHVDPVVSAAGRIPFSLEKKVFDKLDQMVADNIIA FT PVVEPTEWVSRMLVVGKPDGDVRICLDPSDLNKAIQRQHFMVPTVEQLFGK FT IGKAKYFCSLDAASGFYQIPLSNRSSYLCTMATPKGRYRFLRLPFGLVSAP FT EVYLQAMSELFGDLLGVLIYFDDFLVMGETMEELECNLRRVLVRCREKNLK FT LQLKKCQFFVQSLPWLGHVIGNGSLKPDPEKVEAIVKMPAPTDKNGLIRLL FT GMVTYLDKFCKDLAVLTRPLRDMLKQDAAWVWDAQQEQALSAFKSAISSLL FT VLRLFDVYKPLVVSVDASPIGIGAVLLQDGQPVAFSSTSLTETQKRYCQIE FT KELLAVQFGLLRFRQYVYGQKVTVESDHKPLVGLLEKPIATCSPRIQRMRL FT QLQRFDFRLVYKPGKELFIADTLSRAPSPRLFTDDVTQDSEDQVHHVLHSL FT VTSVSTRKRYAEATALDPTLQLLKTVIQKGWPEKRAQCPAAVKPYWSVRSE FT LSMVEGILLCGSRLVVPMSLRRETMEGIHDGHFGETKSVLRAKSAVYWPGW FT EDQVKNMVASCSVCQENRGRNPKLPLHPVRLPDYAFQLVSADLFEFERVNY FT ILLVDSYSKWPCVVPLKSTTSSAIIEEMSRFFCDFGRPEELESDNGTQFSS FT AELREYCASLNIKQVTSSPEFAQSNGLVERHIQTVKRTLLKMVAEGKSLWE FT ALAAIRSTPVSGSLPAPSVLLQGRNLRGVLPFLDASLSPKLVPASFVRQEL FT SRRQQTAAFVQPRPVSVRSSALTVGQRVRALIKGTWQVGVVNVVCPEPHSY FT IVRLIDGRMFRRTRWAINVDNANRPTAVQRTMQPSRPQFSRGPVVVPLTQP FT PALVQPSGSSAVAVQSAVAGTARPVAGVNTPSGQSSVFQSSAQQESTPVRS FT VEAPSRPVREIPASPARLFVSRIPVRDRVVWLPPSTSSGQHAPVAALGVTR FT SGRRYTKPPPSSQ" XX SQ Sequence 4916 BP; 1105 A; 1146 C; 1303 G; 1362 T; 0 other; tggtgtcaga agtgctcgag taatttttca atttctcgcg tgtggtgtgt ttcttgatgt 60 ttcacactga gttggaactg gtaattgtcg ttccctagcc ctggttgtgg ggtgattcgt 120 ggtaattacg tggtaattac gtctctgtcg tgccaacgct atccgtttgt atctcaagca 180 tccgttctcg cacgtggttc aactaaactg tatccaggcc aatcgagggc atccgtcctc 240 tctcgtggtt gatttcaaca ttttctgttg cgtatgtatc ctggttatcg tttatttgtg 300 tttccgacgt ttgaaattgt gcagccatgg cccaaggctt gaagttccca gattcgtttg 360 cgttcgtaat ttggccttgg agtgggcgca atggcgtcgt cagttcgagt ggtatattaa 420 agccacgcgt aaggacgagg acgatgaaga agttttgatt ggagtgttgt tgtccctgct 480 tggaagagag ggagtcaaga tttatgagac gttaccattg acagcggcaa atgcaaagaa 540 gattgctgaa gtgttgactg catttacaac gtattttgag cccctaaaaa gcgaggtgtt 600 cgatcgtttt ctgtttcatc gtcgtgtcca gcagccgggt gaatctttcg acacgtggct 660 ggtggagtta cgcagtatgg tatcgtcatg taattttggt acggccgcgg ttattgaatc 720 tgtattacga gaccagattg ttttcggcgt ggccagtgaa catgtgcgcg aaaagctttt 780 atttgaaact gacttaaagt tagccggggc ctgcaacatt gttcgcgcgt gtgaatcagc 840 ttcgtcaaaa ctgacccaga tggcgccacg aggagagtct actgttcatc gcctacatga 900 cagccagcca aaaggaaagc agggcatgag ctcttacaac aagtcgcgtc aacctggcgg 960 gaacatgcag cagtatgtta attgtcagga ttgtggcaga cgtcatcgaa aagatcaatg 1020 ttctgcagca aaagtgatgt gcttctcgtg tcaacaagtc ggccattttg caaataggtg 1080 tccgaatggg agatctcaac aaggtacgtc acaccctcgc aagatggcgc caccacctgc 1140 acctccgacg gattcccgac aacagacaat gcggcctgca caacgcggaa cttttatgca 1200 acagcagttg cacgccgtcg aagaggaaga ttttgatggc cagttgacag gagccaacgg 1260 attcttggga gaagattacg tcactcatca actcacctgc acagaagaaa aagtggacga 1320 gtggtatgaa gatatggcgg ttgatggaaa ggcaactatt cgttttaaac ttgactcggg 1380 tgctacgtgt aatgtgttgc catatgaatt gtatgcgagt gtgtgcccga atggtgctcc 1440 tttggaacct ggtcctcgag taagaaacta cagcgcaaat ggtggttatc tgaatgttct 1500 gggcgtgtac aagggacaag tggttcgtcg tggaatagcc tatgtgcttc gatttgtggt 1560 ggttaatgaa cctggtcaac cagccatctt agggcttccg gcgtgcaagt taatgaagct 1620 catcaagcgc gttcattcca tcaccgtgtc gcagccacag ctacagccgc caatcgtgaa 1680 agagtttgca gacgtgttca acggcattgg aaagcttccc attgaacatg aaatacgtct 1740 gacaactggc cctagtcatg tggatccggt ggtgtcggcg gcgggtcgta ttccgtttag 1800 tttggagaaa aaggtgttcg acaaactgga tcagatggtt gctgacaaca tcattgctcc 1860 agtagtcgaa ccgacggagt gggtgagcag aatgctggtg gttggaaaac ctgatggaga 1920 cgtccggatc tgtcttgatc catctgactt gaacaaggct atacagcggc aacacttcat 1980 ggtgccgacg gtggaacagc tgtttggaaa gattggtaag gcaaagtatt tttgtagcct 2040 cgacgctgca tctgggtttt accaaatacc cctgtctaac cgctcttcat atttgtgcac 2100 catggcaaca ccaaaaggaa gatatcgttt cctgcggctc ccgtttggac tcgtctcggc 2160 tcccgaagtt taccttcagg ccatgtcgga gctgttcggc gatctgctcg gagtgctgat 2220 ttactttgat gactttttgg tgatgggaga aacaatggag gaactggagt gtaatttgcg 2280 ccgagtgctg gtacgttgcc gagaaaagaa tttaaagttg cagttaaaga aatgtcagtt 2340 ctttgttcag agtcttccct ggttgggcca tgtgatcgga aatggatctt tgaagcctga 2400 tcctgaaaaa gtggaagcta ttgtgaagat gccggctcct acggacaaga atggcttgat 2460 acgcttactt gggatggtaa cgtacttgga caaattttgt aaagacttgg ctgttctaac 2520 gcgtcctctt cgtgacatgt taaagcagga tgcggcatgg gtgtgggacg cccaacaaga 2580 acaagcctta agtgccttca agtccgccat ttcctcattg ctggttttac gtttgtttga 2640 tgtttacaaa ccgttggtgg tatcagtgga tgcttcgccc atcggcattg gcgcagtgtt 2700 gttgcaagat ggccagccgg tggctttttc gtccacgtca ctgaccgaga ctcaaaaacg 2760 ttactgtcaa atagaaaagg agctgttggc ggttcagttt ggtttgctgc gattccggca 2820 gtatgtttat ggacaaaaag tcacagttga gtcggaccac aaaccgcttg ttggtctgct 2880 ggaaaagccg atagcaactt gttccccaag gatccaacga atgcgacttc aactgcaaag 2940 gttcgacttt cggcttgtat ataagcccgg caaggaactt ttcatcgccg acaccttgag 3000 ccgtgcacct tcgccgcggt tgttcacgga tgacgtcacc caggatagtg aagaccaagt 3060 gcatcatgtg cttcatagtc tcgtcacgtc cgtgtccaca cggaagcgtt atgccgaagc 3120 cacggccttg gatccaactc tgcaactttt gaaaacggtg attcaaaaag gatggcctga 3180 gaagcgtgct cagtgtccgg ctgcagtcaa accgtattgg tcggtgcgga gcgaattgtc 3240 aatggtggaa ggcattttat tgtgtggcag tcgtttagtg gttccaatgt cccttcgtcg 3300 cgagaccatg gagggtatac acgacggtca tttcggtgaa acgaagtctg ttttacgcgc 3360 gaagtcagca gtgtactggc ccggttggga ggaccaagtg aaaaatatgg tggccagttg 3420 ttcggtttgc caagaaaacc gtggtcgtaa ccctaagctg ccgttacatc ccgtccggct 3480 tccggattac gcctttcagc tggtgtctgc tgatttgttc gagtttgagc gcgtgaacta 3540 tattttgctg gtggactcat acagcaaatg gccgtgtgtc gttcccctta agtcaacaac 3600 gtcgtcagcc atcattgaag agatgtcacg atttttctgt gactttggac ggccagagga 3660 gttggagtct gacaacggga ctcaattttc cagtgcggag ttacgtgaat attgtgcatc 3720 attgaacatc aagcaagtga cgtcgagtcc cgagtttgcc caatccaacg gactggttga 3780 gcgacacatc caaacggtaa aacgcacttt actgaagatg gttgctgaag gaaagtccct 3840 gtgggaggcg ctggcagcca ttcgttcgac cccggtgtcc ggctcattac cggccccgtc 3900 cgtcctactt caaggacgta atcttcgtgg tgtgctgcct tttctcgacg catccttatc 3960 tccaaagctg gtgccggcgt cgtttgtacg tcaagagttg tctcgtcgac aacaaacggc 4020 ggcgtttgtt caaccccgcc cagtgagtgt tcggtcgtct gcgttgactg taggacaacg 4080 tgttcgtgcc ttgatcaaag ggacgtggca agttggtgtc gtaaatgtgg tgtgcccaga 4140 accacactct tatatcgtcc gcttaattga tgggcgaatg tttcgtcgta cacgctgggc 4200 aatcaacgtc gataacgcga acagaccgac agccgttcag cggaccatgc aaccttcacg 4260 tccgcaattt tcacgtggcc cggttgtcgt tccactcact caaccgccgg ctttggttca 4320 gccgagtgga agttcagcag tggctgtcca gtcagccgtc gcagggacgg cccgtccagt 4380 cgctggtgtc aatactccgt ctggacaatc ttccgtgttc cagtcgtctg ctcaacaaga 4440 gtctacaccg gtccgttctg ttgaagcgcc cagccgccct gttcgagaga ttccagcctc 4500 acccgctcgt ttatttgttt cccgtattcc cgttcgtgac cgagtggttt ggttgccccc 4560 ttcgaccagc agtggtcaac atgctcctgt ggccgccctt ggtgttaccc gctcgggccg 4620 acgctacacc aagcctccac catcctctca gtaaatgacg caagtcgaca ttcaatcctc 4680 ccgttcgatt tatcttcatg taatttcagt tcattgtcgt gtcattattc atcgtttcac 4740 tactcatcgt tcatgcgttt attttcatct cacctactca ttgttcattc atcttcgttc 4800 acatcctcat tgatcattca tgttggttta ttttcacatt ctcgtcttac attttggttg 4860 cattcatttg tgttccgttt gggggctatt gttcgtatat ggttaagggg gggaga 4916 // ID Gypsy-12_IS-I repbase; DNA; INV; 4104 BP. XX AC ABJB010305691; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the black-legged tick genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-12_IS_; KW Gypsy-12_IS-LTR; Gypsy-12_IS-I. XX OS Ixodes scapularis OC Eukaryota; Metazoa; Arthropoda; Chelicerata; Arachnida; Acari; OC Parasitiformes; Ixodida; Ixodoidea; Ixodidae; Ixodinae; Ixodes. XX RN [1] RP 1-4104 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the black-legged tick genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; ABJB010305691; Positions 9465 5362. XX CC Positions [3080-3595] - Integrase core CC 'GTTCC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 65..4054 FT /product="Gypsy-12_IS-I_1p" FT /translation="MATGKPAGSGAGPVTVGVPSLRPPPQFDFGNPGAWRQ FT WRQQFEDYCYASGLYAASDEVRVRTLLYCMGPKARDILSSLQVEEADFSEF FT NHIAEKFDGYFIHPANELYESARFHRRVQQAGESADSFFTVLGNFVKKCNY FT PSREVEERLIRDRFVVGLRDVKLSDRLCRNPRLTLEEALIQVRQAEDAERE FT RRIRDSTGASAASAEAKAISIDAARVTGNRPPKTARNVAVGNAGSSPCGFC FT GRDFHPRSECPASRALCSKCHKKGHFAAVCRSGTKLERHSKRLASVELGAV FT SSDGSTRAKFVEVKVDGMPLSFKVDSGAEVTVIPSTFSGIPARLEPPEGQL FT TGPAGQPLDVLGAFVATLEWKNKTSRQKLYVVRSQVVPLLGFPGIQALGVV FT KFVEPVAAVQQGRNDTSPTGLFQGLGELKEEYIIRLQPDAIPFSLHVPRRV FT PIPLRDVVKTELDKMEAQGVIRRVSNPTPWCAGMVVVPKPSGAYRICVDLT FT RLNKVVLRERHILPTVDQVLGLLGEARVFSKLDATSSFHQVKLAEDCQELT FT TFITPFGRYCFRRLPFGITSAPEFFQRQMSYILEGQDGVVNMIDDVLVFGR FT DSAEHDRRLAEVLDRLARAGLTLNKAKCKFGVTSVGFLGVIVSGNGIAPDP FT QKVAAVQRMEPPVDVGGVRRILGMVNHVGRFLPHLSDVTAPIRELLNKRSV FT WLWGPSQQAAFQRLKDMISSDVCMANYHPEHPTIISADSSSFGLGAVLMQD FT QPTGERRAVAFASRALTPTEQRYSQTEKEALAVVWAVQRFDEYVRGLRFTV FT ETDHQPLTALLGEMDVDVLPPRIQRLRIKLMRYQYRIFYVPGKLLATADTL FT SRAPLTSAVEASSVELYVQNVIHSIQEGSPVSPEDVRRHQASDGECVTLEK FT FCNHGWPQRNKLPSHVMKYWNFRGELSVCEGLLLKGGRLVIPAALQPEVLQ FT LIHEGHQGVNRCKARARDSVWWPHVGQQIEAMVQTCDRCASTRTQRAEPLL FT PTPPVDLPWKQVGADLFHLDGQDFLLLVDYHSRYPEVVTLRSTSSQAVIAA FT VKSSMARFGIPEVLRSDNGPQFSSHDFASFARDYGFQHITSSPGYPQSNGE FT VERAVRTVKDLFRKSDDVFLALLSYRDTPGVSGFSPAQLLMGRRLRSRMPR FT AKEKLRPEVPALGAFRERDSAARRQQAVDFNCRHGVQVLRDLSPGEEVWVT FT DAQCSARVLNGAQRPRSYVVETQRGMLQRNRRHLVPYGAHAPGAPVPGEPV FT ADPPIPEEALAGLPAPPSETSAEPQQRRSAATSANPWVNGSSTSGDFRQHE FT HRYTRSGRRVIPPVRLDL" XX SQ Sequence 4104 BP; 872 A; 1121 C; 1272 G; 839 T; 0 other; tggtgtcagg agtggttcgc taggctgcct cctacggcag cggccgcggc gaagttcgga 60 gaccatggct acaggcaagc cggcggggtc cggggcggga cccgtaacag tgggcgtgcc 120 aagcctacgc cccccgccgc agtttgactt cggtaacccc ggagcatggc gacagtggcg 180 gcagcaattt gaagactact gctacgcatc aggcctctac gccgccagtg acgaagttcg 240 tgtgcggaca ctgctttact gcatgggacc caaggcacgc gacatcctga gctcgctgca 300 agtggaagag gcggacttca gcgaattcaa ccacatcgcc gaaaagttcg acggctattt 360 tatccatccc gccaacgaac tgtacgagag tgcacgcttt catcgccgcg tccaacaagc 420 cggtgagtcg gccgactctt ttttcacggt tttgggaaac ttcgtaaaaa aatgcaacta 480 tccttcgcgg gaagtggagg agcgtttgat acgggaccgc ttcgtggtcg gtcttcgaga 540 tgtcaagttg tcagacaggc tttgtcgcaa tccgagactg acactcgaag aagccctgat 600 tcaagtacgt caggctgaag atgccgagag ggagcggagg attcgcgatt ctaccggagc 660 ctcggcggca tctgccgaag cgaaggcgat aagcatcgac gcggcgagag tcacggggaa 720 ccgcccacca aaaacagcgc gcaacgttgc cgtagggaat gcgggttctt cgccgtgcgg 780 gttctgcgga cgagactttc atccgcgttc cgagtgtcct gcaagccgag cactctgcag 840 caaatgccac aagaagggcc acttcgcagc ggtatgtcgc tccggaacaa agctcgagag 900 gcacagcaag aggctcgcgt cggttgagct gggtgccgtc agttcggacg gcagcactcg 960 ggcgaagttc gtcgaggtga aagttgacgg gatgccacta agtttcaagg tcgacagcgg 1020 ggccgaagtc acggtcatcc ccagcacatt ttctgggatt ccggcgcggc tcgagccgcc 1080 ggagggacag ctgaccggtc cagcgggcca gcctttggat gttctcgggg cgttcgtcgc 1140 cacgcttgaa tggaagaaca agacgagcag gcagaagctt tatgtggtcc ggtcccaggt 1200 tgttccgctc ctgggatttc cgggtatcca ggcgctcggc gtggtcaagt tcgtcgagcc 1260 ggtggcagcc gtccagcaag gaaggaacga tacttcgccg acggggttgt tccaagggtt 1320 gggtgagctc aaagaagagt acatcatccg ccttcagccc gatgccattc ctttctcgct 1380 tcacgttcca agaagagttc ctatcccctt gcgcgacgtt gtgaagacgg agttggacaa 1440 aatggaagcc cagggcgtca tccggcgggt gtcgaatcct acgccgtggt gcgcgggaat 1500 ggtcgtcgtg cccaagccgt ccggcgccta ccgcatctgc gtcgacctca cgaggctgaa 1560 caaggtggtg cttcgagagc gccatatcct tccgacggta gaccaggtct tgggacttct 1620 tggtgaagct cgagtttttt cgaagctgga cgctacatcc agtttccatc aagtgaagtt 1680 ggccgaagac tgccaggaac tcacgacgtt tataaccccg ttcgggcgtt actgtttccg 1740 ccggttgccc ttcgggatta cgtcggcgcc agagtttttc cagcgccaaa tgagttacat 1800 cctcgagggt caagatggcg tcgtcaacat gatagacgat gtcctcgtct tcggccggga 1860 tagcgcggaa cacgatcgcc gactggcgga agtcttggat cgtctggcga gggcagggct 1920 gactttgaac aaggccaaat gcaagtttgg cgtcacaagc gtcggtttcc ttggcgtgat 1980 cgtgagcgga aacggaattg caccagatcc gcagaaggtg gcagccgtgc agcgtatgga 2040 accgccggtc gacgtcgggg gcgtgcgccg gatcttgggc atggtcaacc atgtggggag 2100 gtttctgcca catctctccg acgttacagc accaatccgc gagctactga acaagcggag 2160 tgtttggctc tggggtccaa gccagcaagc ggcattccag agactgaaag acatgatttc 2220 ttcagacgtc tgcatggcca actaccaccc agaacaccca accatcatat ctgcggattc 2280 cagctcgttt ggacttgggg cagttttgat gcaggaccag ccaacgggtg aacgccgagc 2340 cgtggctttc gcatcccgtg cactcacgcc aaccgagcag aggtatagtc aaaccgaaaa 2400 agaggccctg gcggttgttt gggccgtaca acgcttcgac gagtacgtca gaggtctgcg 2460 attcaccgtg gagacagacc accagccttt gacagcccta ctgggagaaa tggacgtgga 2520 cgttttgcct ccaaggatcc agcgcctgag gatcaagctc atgaggtacc agtacaggat 2580 cttttatgtt cccggaaagc tgttggctac ggccgacact ctgtccaggg ccccattgac 2640 atcagccgta gaagcgagct ccgtggagct ttacgtgcag aacgtcatcc attccatcca 2700 ggaaggttct ccagtcagcc ccgaagatgt ccgtcggcat caagcatcag acggcgagtg 2760 cgtaacgctg gagaagttct gcaaccacgg ctggcctcaa aggaacaagc ttcccagcca 2820 tgtcatgaag tactggaatt tccgtgggga gctgagcgtc tgcgagggtc tgcttctcaa 2880 gggtggccgt ctagtgatcc cagcagctct gcaacccgaa gttttacagt tgatacatga 2940 aggacaccag ggggtgaaca ggtgtaaggc gcgggcccga gactcggttt ggtggccaca 3000 tgttggacag cagatcgagg ccatggtgca aacgtgcgac cgctgtgcat ctacccggac 3060 gcaacgagcg gaacccctgc tgccaacgcc tccggtggac ctgccatgga aacaggtggg 3120 agcggatttg tttcatctgg acgggcagga ttttctttta cttgtggact accactctcg 3180 ttacccagaa gtggtaacat taagaagtac atcaagccag gctgtgattg cagcggtcaa 3240 gagttccatg gctcggttcg gtataccgga agtactaaga agcgacaacg ggccacagtt 3300 ttcttcacat gacttcgcaa gtttcgcaag agactatggt tttcagcata tcactagtag 3360 ccccggctac ccccaatcga atggggaggt ggagcgcgct gtgagaaccg ttaaggacct 3420 tttccgcaag agcgacgatg tctttttggc actgttgtca taccgggaca ccccaggcgt 3480 ttcaggtttc agcccggcgc aactgctgat ggggcggcgt ctgcgttctc ggatgccgag 3540 ggccaaggag aagctgcgtc ctgaggtgcc ggccctgggt gccttccgag agagggacag 3600 tgcggcgagg cggcagcaag cagtggactt caactgtcgc catggtgtgc aggtgcttcg 3660 agacctctcg cctggagagg aggtctgggt aaccgatgct cagtgctctg ctcgggtgct 3720 caacggtgcg cagcgtccac gttcctatgt cgtggaaact caacggggca tgctgcagcg 3780 aaaccggcga cacctggtgc cttacggtgc tcacgctcct ggggctccag ttcctgggga 3840 gccggttgct gatcctccga ttcccgagga ggctttggct ggtcttccag ccccgccttc 3900 agagacgtcg gctgagccgc aacagcgacg atcggcagcg acgagtgcaa acccatgggt 3960 gaacggaagt tcgacatcag gggacttcag gcagcacgag catcgctaca caagatcagg 4020 gcgacgggtt attccgcctg tgagactgga cttgtgattg ttgttggtct agctaagaga 4080 agggcatcct tctggaaggg ggga 4104 // ID Tx1-4_CQ repbase; DNA; INV; 4997 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A Tx1 non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW Tx1; Non-LTR Retrotransposon; Transposable Element; Tx1-4_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4997 RA Kojima K.K. and Jurka J.; RT "Tx1 non-LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 636-636 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 5 sequences with >96% CC identity. XX FH Key Location/Qualifiers FT CDS 143..1339 FT /product="Tx1-4_CQ_1p" FT /translation="METRKNTLKLQFVTSAKVPSHLDVLKFMAKGVKIPAA FT DVHSVYKDENDQKFYVKFIDETSYNRFCNTVEDQYWFHYEDGSRTPVQLEL FT ASRQFKYVRLFNLPPETEEKEIAAALAKFGKIRQHVREKYPADLGYHVFSG FT IRGVYMEVEKEIPANLYIAHFRARVYYEGLKNKCFFCKAEGHMKVDCPKLA FT SLRSSAETGGQPSYSSVTANLKIATGSAKETASPLLNMTLIPVPAQRTKTK FT EQAQQEEAGVVPSTSTVQPPVQANPDLALPQAKPAEGQGEEQKEASPIDAI FT VANDPSDXSAEQTDAETDDETEKCNDGQGEAGGSASDAMIVDGDQQNANDT FT KQEEAIDREEWRLQQKQQLKRPIAESLSTDSNAGQGKGLLKKKPRGGKGKR FT GSKGK" FT CDS 1572..4823 FT /product="Tx1-4_CQ_2p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="MAYIRKCSTVNLNAISSTLKLSLLKDYVWNNDLDFVF FT LQEVATENFRFLPSHNALINISSDGKGTGVLIRNSIEFSNVCMNPNGRIIS FT VVIDDINFINIYAHSGSKYKKERDLLFSEDLLIHLSEHKENVILGDFNCII FT DKNDSNGLNKNISNGLKTLVTSLGLIDIELKKNNTRTFTFMRCNSKSRLDR FT VYSSIDFLEKVRSLETVPVPFSDHHSIIIKFEINRQFMFFHGRGYWKMNSS FT LLYDLDILNKFKNLYSNLKQRNSFQNLSYWWNNDVKTSIKKFFKTENFITN FT QQSHREKSFYYNCLKEIIEKQRANIDCSQELALVKSKLIELEQQRLNKIKY FT KIKPESLQNDEKLSLYQITSFINRNTSSKQLKLRIDGRETSDYNLLRNEIF FT NNFSQKFKKQPVNNDNTDEILNTLTAHLDDHEKHNLAKPIEMSELETALRF FT SSKRSSPGPDGISYEFYTHCFDIIKNDLLKLFNTYYINEEYPPGLFTSGII FT TLIPKKGDKLDLQNKRPISMLNTDYKLFTKILWNRIQPILEKLIGPGQAAC FT VKDSSCIHNLKLLRNVLIKANKSKKFKGAILSLDLEKAFDRVDHEFLWKIL FT KKFNFPDNFISCLRKLYKNATSSVLFNGFLTSPFAILSSVRQGCPLSMVLF FT ILYLEPLIRLIDRNLKGVLIDNNFVKVVAFADDINIFIRDDSEFDMTLQLI FT HYYSVYSKIKLNSKKSQFLRFNSCRLGPQQVKEVEEMKILGITFKKDYVET FT IEKNYKDLIQSIIVSLILQQSRRLNLFQKVTILNTFILSKLWYVAQIFPPE FT NKHICTLRSICGKFIWKGLFYKVERKELYLPVLEGGLSLIDVEAKCKALFV FT KNILFSRSNDVVNIDKFMLEQIHTKTITRNTREWLKDADLFMNETHLNTCK FT KXYDTILSRMKITVKKRIELPNKNWQNLFENTNKNFLTSDSKSVLFMILRD FT IIPCNAKMFRHGVRGVESPNCDCCGVPDSVEHRIKNCCSSXKVWSWLNEIL FT KTRFKLTLCDADELLSCNISETNSKEKAALWLTIESMCYCLQNSKSGSIEE FT LKSHIRESRWNKRELFKKHFKHFLNLW" XX SQ Sequence 4997 BP; 1716 A; 808 C; 989 G; 1480 T; 4 other; cagttcgctt ttggacttcg gacagtcaag tcaagttctc gaagtgcgcg cgaagaagct 60 gtaaacgttt tgtcgcactc tgtgcgaccc acagctatct ctattttttt caagcggaat 120 ctaatctacg gatagataaa aaatggaaac ccgaaagaac acgttgaagc tgcagtttgt 180 gaccagtgcc aaagttccgt cgcatctaga tgtgctaaag tttatggcca agggagtgaa 240 aattccagca gccgacgtcc attccgttta caaggacgaa aatgaccaga aattttacgt 300 gaagttcatc gatgaaacca gctacaaccg attctgtaac accgtggaag accagtactg 360 gttccattac gaggatggtt cgaggacacc ggttcaactc gagttggcaa gccggcagtt 420 caagtatgtc cgtttgttca acttgccacc ggagacggag gagaaagaaa tcgcagcagc 480 gttggcgaaa tttgggaaaa tccgacagca cgtgcgggaa aagtaccccg cggacctagg 540 ctaccacgtg ttcagcggca tccgcggcgt gtacatggag gtggaaaagg agataccggc 600 caacttgtac atcgctcatt tccgtgcccg ggtgtattac gaaggactga agaataaatg 660 cttcttctgc aaagcagaag ggcacatgaa ggtcgattgc cccaagctgg ccagtctcag 720 gagcagtgct gaaactggtg gtcagccttc gtacagcagc gttacagcga atctgaagat 780 tgccaccggc agtgcgaaag agacagcaag cccgttactc aacatgacct tgatcccagt 840 tccagctcaa cgcaccaaga ccaaagaaca agctcagcaa gaagaagccg gcgtggtacc 900 aagtacgagc actgtgcagc cgccagtaca agcgaacccg gatctggcgt tgcctcaagc 960 gaaaccagcg gagggacaag gagaggagca gaaggaagct tcgccgatcg acgcgatcgt 1020 cgcgaacgac ccgagcgacc mgagcgccga gcagaccgac gccgaaacgg atgacgaaac 1080 cgagaagtgc aacgatgggc aaggcgaggc ggggggatcc gccagcgacg cgatgatcgt 1140 cgacggagat cagcagaacg cgaatgatac gaagcaggag gaggcgatag atcgggagga 1200 gtggagactt caacagaagc agcagttgaa acggccgatt gcggaatcgc tgtcgactga 1260 ttcgaacgca gggcaaggta agggactttt gaaaaagaaa ccacgtggcg gtaagggaaa 1320 gcgaggtagt aagggcaagt agaaaccaat tgaagttttt ttttctcttt tctttctaat 1380 tttcttctaa attttatttc taattttatt ttattattta tttattttat tttgattcca 1440 tttttattca attttttttt gttctcaaac tttagtttca agaaatgttt ttttttaata 1500 actcgatatg gaattgttga atattatatt cgtgactttg gctttgattt ctttgatatt 1560 ctaaaggtat aatggcatac atacgcaagt gttcgacagt taatcttaat gcaatcagct 1620 caactttaaa actctcgtta ctaaaggact atgtgtggaa caacgattta gattttgtgt 1680 ttttgcagga agttgcaacc gaaaattttc gttttttacc gtctcataat gctttaatca 1740 acattagcag tgatggaaaa ggaactggtg ttttgataag gaactctatt gaattttcaa 1800 acgtttgcat gaatcctaat ggaagaatta tctcagttgt aattgatgac ataaatttca 1860 ttaatattta tgcacattct ggttcaaaat ataaaaaaga aagagatttg cttttttctg 1920 aagatctgct tatccatttg tcagaacata aggaaaacgt tattttaggt gattttaact 1980 gcataattga taaaaatgat tcgaacggat tgaataaaaa tataagtaac ggattgaaaa 2040 ctttagtaac atcattagga ctgattgata ttgaattaaa gaaaaataac acaagaactt 2100 tcacttttat gagatgtaac tcaaaatcca ggctggatag agtgtacagc tcaattgatt 2160 ttttagaaaa agttagatct ttagaaacgg tacccgtacc gttttcggat catcatagta 2220 ttattataaa atttgaaata aatcgacagt ttatgttttt tcacggaaga ggttattgga 2280 agatgaattc atcactttta tatgatctag acattttaaa taaattcaaa aacttgtata 2340 gtaatttaaa acaaagaaat tcgttccaaa acttaagtta ttggtggaac aatgatgtga 2400 aaaccagcat caaaaaattt ttcaagacgg aaaactttat aaccaaccag cagtcgcacc 2460 gtgaaaaaag tttttattac aactgtctaa aagagattat tgaaaaacaa agagctaata 2520 ttgattgctc tcaggaatta gcgttagtca aatcaaagtt aattgaacta gaacaacaaa 2580 gattgaataa aatcaaatat aaaatcaaac cagaatcact tcaaaatgac gagaaactat 2640 cactttatca aataacatct tttatcaatc gcaatacatc ctccaagcaa ttgaaacttc 2700 gaattgacgg cagagaaaca tcagattata acttacttag aaatgaaata ttcaataatt 2760 tttctcagaa gttcaaaaaa cagccagtaa ataacgataa caccgatgaa attttaaaca 2820 ctttaactgc gcatttagat gatcacgaaa aacataattt agctaaaccg atagaaatgt 2880 ctgaattaga aactgcttta agattttcat ccaaacgttc ctctccagga ccagacggaa 2940 ttagttatga gttttatact cattgttttg atattattaa aaatgactta ctgaaattat 3000 ttaacactta ttacatcaac gaagagtacc ctccaggatt atttacgtcg ggaattataa 3060 cattaattcc taagaaaggt gataaattag acttgcaaaa taaaagacca attagtatgc 3120 ttaacacaga ttataaatta ttcacaaaaa ttctatggaa tcgtatacaa cccattctag 3180 aaaagttaat tggaccaggc caagcagcat gtgttaaaga tagttcttgt attcataatt 3240 taaaattatt gagaaatgtt ttgataaaag caaataaatc gaaaaagttt aaaggagcga 3300 ttttaagtct tgatttggaa aaagcctttg atcgtgttga tcatgagttc ctgtggaaaa 3360 ttttgaaaaa gttcaatttt cctgacaatt ttattagttg tttacgcaaa ctgtataaaa 3420 acgcgacatc atccgtatta tttaatggtt tcctgacttc accctttgct attcttagtt 3480 ctgtaaggca aggttgtccc cttagcatgg ttttatttat cttatattta gaaccactta 3540 ttcgattaat tgatagaaat ttaaaaggtg ttttaattga taacaatttt gttaaagtag 3600 tagcttttgc ggatgatata aacattttta taagagatga tagtgaattt gatatgactt 3660 tacagttaat acactactac agtgtatatt ctaaaattaa attaaattct aagaaatcac 3720 agtttttaag atttaacagc tgtcggttgg gtccccaaca agttaaagag gtggaagaaa 3780 tgaaaatttt aggaataact tttaaaaagg actatgttga aaccatcgaa aaaaattata 3840 aagatttgat tcaaagcatc attgtaagtt taattctaca acaatctcga cggttaaatt 3900 tatttcaaaa agtaacaatt ttaaacactt ttattctttc gaagttgtgg tatgtagctc 3960 agatttttcc accggaaaat aaacatattt gcacattacg atctatttgt ggaaaattca 4020 tttggaaagg actgttttat aaagtagaaa ggaaggaact ctaccttcca gtattggagg 4080 gaggtctatc gcttatagac gttgaagcaa aatgtaaagc tctttttgta aaaaatattt 4140 tattttcgcg tagtaatgat gttgttaata tcgataaatt tatgctggag caaattcata 4200 ctaaaactat cacaaggaat actagggaat ggttgaagga tgcagattta tttatgaatg 4260 aaactcattt aaatacctgc aaaaaaatkt atgatacaat tttgtcaaga atgaaaataa 4320 cagttaaaaa gcgaattgag ttaccgaata agaactggca aaaccttttt gaaaatacga 4380 acaaaaattt tcttacatct gatagcaaat ctgttttatt catgattctc cgggacatta 4440 ttccatgcaa cgctaaaatg tttcgacacg gtgtaagggg agttgaatct ccgaactgtg 4500 actgctgcgg tgtgccagac agtgtggaac acagaattaa gaattgctgc tcctcgaasa 4560 aagtttggtc ctggcttaac gaaattttga aaacaagatt caaattaact ttatgtgacg 4620 ctgatgagct tttatcttgt aatataagcg aaacgaatag taaagaaaaa gcagcattgt 4680 ggttgactat cgaatcgatg tgttattgtt tgcaaaatag taaaagcgga agtattgaag 4740 aattgaaaag tcacattcgc gaatcgcggt ggaataaaag agaattgttt aaaaaacatt 4800 tcaaacattt cttgaatcta tggtgaagag gttttgttgg aggattagga tttgttagga 4860 ggttttgttt tttttttttc tttgctttct gtacagtaaa ttaaagtaaa gtgccacgtg 4920 gttaggagtt ggaagtaaat tgtatttgta gtttgattag acgtcaawaa cgcttaaaaa 4980 aaaaaaaaat aaaaaaa 4997 // ID BEL-632_AA-I repbase; DNA; INV; 6060 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-632_AA_; KW BEL-632_AA-LTR; _Pao_Bel_Ele55; BEL-632_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-6060 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [5106-5657] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS join(24..2348,2352..6008) FT /product="BEL-632_AA-I_1p" FT /translation="MAYKYPCCDEGEDFDASVRCERCEEWYHFCCVNVDDS FT VAEVVWYCGTCLGEDRDIRTSLGAPGNTAISPVVFNHPRQIEGDRAGIGKS FT TRSKTKGRITRDQAQDEIVEDVEVGNSTQAEDQETIDSERKQEEEEANKDD FT RDAFLHALTEAFGRYCNQVKRSESVPVKKPDRVRESVKKSASLSARVSSPE FT STRTRAWRELQELEEWNRLEEQEEENQRKILELERKRIARKKKVLERRSEL FT RKIIDQGRKVEEFVFNEDEFPEMEAAVEFDEDLELEMFPKDVGPLQRRKNT FT RPPKSQLLDVRLPRPSFEEELGVRHTAFDGECSEERGAIGRSLSRIKMPEP FT GPSRLQIASRQVFPRNLPKFNGAAEEWPIFISAYEQANQSCGFTNAENLVR FT LQEALKGKALETVRNRLLLPENVPLIIEKLKKRFGNPEVLSTRLANRIQQL FT EGPKVESLESVIEFGSAVEEFTQHLKAAGLVNHLKNPILMQSLVQKLPSYY FT AMEWVEYKRRAKAVDLETFGAFMECLVEKALEATFEKSEEIERSKSRDKSR FT TKAFVHATDGGSTQSGQKSVTEKSKSNLPPQSLRRDLKCSVCQEPGHFGRN FT CQELLKQDLEGRWKTVEKLKLCPLCLYDHGQRSCRIRMQCKVEGCQECHNA FT LLHGRRKQAARVQAQCNSHRHETRAVLFRMIPVMLYNGKKKVDTLALIDEG FT SSVTMIDSELAKMLGVDGPIEPLEMTWTNGVQRTEYESKKVVLEISAINSR FT ERYELVKAHTVHRLDLPTQKMDARLVDEFRHLDHIDIPSYGRSVPKIMLGI FT DNLHVIAPLDSRIGECGEPVAVLCKLGWAVYGQQAKVVDRVHFLGHHRCSC FT EECSKTDKGLDELLKQHYQLEEIGVSPVRLESDEDRRAREILEKTTRRVGE FT RFETGLLWKEDDVTFPESFDMALRRMRSLEARMNRNPGVRQILEKQLGDYL FT VKGYAHRITEQELNETPKGKEWYLPLNYVTNPKKPGKVRLVWDAAAKAHGV FT SFNDMLLKGPDMITSLPAVINGFREKRIAFGGDIKEMFHQALVRPEDRQAQ FT RFLFPGADGEVEIYVMDVVIFGSSCSPCSAQFVKNLNAQEHAEQYPEAADA FT IVRKHYVDDYFDSADTEEEAVRKASDVCRVHAKGGFEIRNWVSNSEEVLRQ FT LGGPKCAAKPLEVDKNCPTERVLGVLWDPEGDNFVFSTEIRQDLQAYIREN FT AWPTKRIALRCIASMFDPKQFLAPLLIQGRIIMQDVWRSCIGWDDKLTEEH FT YERWKKWTSLFHLIDEIRVPRCYLGGLESEAYQTVQLHVFTDAGEKAYGCV FT AYFRFENGGVVQCAFIEAKAKVAPLQYLSIPRLELEAAILGARMMKSICEN FT HSFPVLKRFLWCDSDTVVSWVKSDQRRYRPFVAYRIGEIISTTNPEDWRWV FT NSKSNPADDLTKWGKETEIKSSNRWYKGPEFLYQGEENWPKQNRPRVEVAE FT ELRASVLFHYIALPEGLMERMQHISRWSVMVRTIATMFRFVSNCRRRIQKK FT PIEALRKADGGGISSDEVPLRQEEYHAAEVLLWKAVQADEFADEVRTMLKN FT KELPLSEAIPLERSSSLFRLAPFLDEDGVVRMEGRTGAAEYAAFDARFPIV FT LSKDHVVTKRLLEHYHRQFGHSSRETVVNEIRQRFYIPTLRVQVDKVIRGC FT MWCKIQKAKPTVPRMAPLPGQRLAAKVDPFSYVGIDYFGPLDVSIGRRKEK FT RWVALFTCMTVRAVHLEVAYSLTTESCKMAIRRFVKRRGSPTEIFSDNGTN FT FVGANRDLVREINAGCADTFTSSKTRWTFNPPSAPHMGGAWERMVRSVKEA FT LKAFTDGRKLTDEILQTVLVEAEYLVNSRPLTYVSTNVKEDQEALTPNHFL FT RGCSTLECLPSRDPVDLADTLRSSYNRAQFLTDGFWDRWQKEYLPTLNKRT FT KWFVDRRQVAVGDLVFVAEGDKRSSWERGIVKEVFTGKDGRIRSANVQTSR FT GVKTRPVAKLAVLEIDSEE" XX SQ Sequence 6060 BP; 1644 A; 1280 C; 1860 G; 1275 T; 1 other; ttcctcaaaa tctatctacc accatggcct acaaataccc gtgctgcgat gaaggggagg 60 atttcgatgc cagcgttcgt tgtgaaaggt gtgaagaatg gtaccacttt tgctgcgtga 120 acgtggatga tagtgtcgcc gaagttgtct ggtactgtgg gacctgccta ggagaggatc 180 gagacatccg tacttcgttg ggtgccccag gcaacaccgc catatctcca gtggtgttca 240 accatcccag gcagattgag ggtgataggg caggtatcgg taagtcgacg agaagcaaga 300 ccaaaggtcg catcactagg gaccaagcgc aggatgagat tgtcgaagat gttgaagtag 360 gcaactcaac tcaagcagaa gatcaggaga caatagattc agaaagaaag caagaagaag 420 aagaagcaaa taaagatgac cgcgacgcgt ttttgcatgc tttgacggag gcattcggtc 480 gttattgcaa ccaggtcaag cgttcagagt ctgttcctgt gaagaagccc gatcgtgttc 540 gtgagagtgt caagaaaagt gccagtttga gtgccagagt gtcgtcgccg gagtcaacgc 600 gtacccgtgc gtggagagaa cttcaagaac tcgaggagtg gaaccgattg gaagagcaag 660 aggaagagaa tcagcgtaaa atcctggagc tggagcggaa gagaatagcc cgcaagaaga 720 aagtgctgga aagaaggtcg gaattacgga aaatcatcga ccagggaagg aaggtggaag 780 aattcgtgtt caacgaggat gaattcccag aaatggaggc tgcagttgag ttcgacgagg 840 acctggaact ggaaatgttt ccgaaggatg ttggaccatt acagcgtcgg aagaacacga 900 gaccaccgaa atcccaactt ctggacgtca gacttccaag gccgtcgttt gaagaagaat 960 taggtgttcg tcatactgct ttcgacggag agtgtagtga agagcgaggc gctatcggac 1020 gcagtttgag tcggatcaag atgccagagc caggtccatc gcggcttcaa attgcttcga 1080 ggcaggtgtt tccaaggaac ctcccaaagt tcaatggcgc tgccgaagaa tggccaatat 1140 tcatcagcgc ttacgaacaa gcaaatcagt cctgcggttt caccaatgct gagaatttgg 1200 tccgtctaca ggaagcattg aaagggaaag cattggaaac ggtccgaaat cgactgcttc 1260 tgccggagaa cgtcccgttg ataatcgaga agctgaagaa gcggttcgga aatccagaag 1320 tcttgtcgac gaggttagca aatcggattc agcaacttga aggaccgaag gtcgagagtc 1380 tcgagtcggt gatagagttc gggagtgccg tagaagaatt cacccaacac ctgaaggcgg 1440 caggacttgt gaaccacctg aaaaacccga ttttgatgca gagtttggtc cagaaactgc 1500 catcgtatta cgccatggag tgggtggagt ataagcgacg agccaaagcc gttgacctcg 1560 aaacatttgg cgcgttcatg gagtgtctgg tggaaaaggc attggaggca acctttgaga 1620 agtcggaaga gatcgagcgt tccaaatcgc gggataagtc gaggacgaaa gccttcgttc 1680 acgccaccga cggaggaagc actcaatccg gtcaaaagag cgtgacggag aaatcaaagt 1740 cgaatttgcc gcctcagtct cttcgccggg acttgaaatg ttcggtttgc caagaacccg 1800 ggcattttgg gagaaactgt caggaattgc tgaaacagga cctggaaggg cggtggaaga 1860 cggtcgagaa attgaagctg tgtcctctct gtttgtacga ccatggccag cgatcgtgtc 1920 gtataaggat gcagtgcaaa gtggaaggat gtcaagaatg ccataacgct ctgctacatg 1980 gacgtcgaaa acaagcagcg agagttcaag cacagtgcaa tagtcatcgc catgaaaccc 2040 gagcagtgtt gttcaggatg attccggtaa tgctgtacaa tgggaagaag aaggtcgaca 2100 cgcttgcgtt gatcgatgaa ggatcgtccg ttaccatgat cgacagtgag ttggcgaaaa 2160 tgctaggtgt tgacgggccg atagaaccgt tggagatgac ttggacgaac ggtgtgcagc 2220 gtacagaata tgaatccaag aaggtggtgc tggagatatc cgccataaat tcgagggagc 2280 ggtatgagct ggtgaaagcc cacacggtgc ataggttaga cctccctaca cagaagatgg 2340 acgcaagama gctagtggat gagttcaggc acctggacca catcgatatt cccagctatg 2400 gcagaagtgt accgaagatc atgttaggaa tcgacaacct acatgtgatc gctccgctag 2460 attcgcggat cggagaatgc ggcgaaccag ttgccgtgct ttgcaagctt ggttgggcag 2520 tgtatggaca gcaagcgaag gtcgtagaca gagtccattt cttgggccat catcgatgct 2580 cgtgcgaaga gtgtagcaag acagataaag ggttagatga gctgctgaaa caacactacc 2640 agctggaaga aattggagtt tcccctgttc gcctggagtc ggatgaagat cgcagagcac 2700 gcgagattct ggagaagact acccggaggg ttggcgaaag gttcgagacc ggactgttat 2760 ggaaggaaga tgatgttaca tttccggaaa gcttcgacat ggccttgaga aggatgcgga 2820 gtttggaggc gcgaatgaat cggaatcctg gagttcgtca gattcttgag aagcagttgg 2880 gagattatct ggtcaagggt tacgcacatc gtatcactga gcaagagttg aacgagacgc 2940 cgaaaggtaa agagtggtac ttgccgctga attacgtaac gaaccctaaa aagccgggca 3000 aggtacgatt agtctgggat gcagccgcta aagcgcatgg agtgtcgttc aacgatatgc 3060 tactgaaagg accggacatg attacttcgt taccagcggt gattaacggc tttcgcgaga 3120 agaggatagc attcggcggt gacataaagg aaatgtttca ccaagcactg gtgcgaccgg 3180 aggatcgtca agcacaacga ttcctgttcc ctggtgcgga tggagaagtg gaaatctacg 3240 tgatggatgt ggtgattttc ggatcgagtt gttcgccttg ctctgcgcaa ttcgtgaaga 3300 atctgaatgc tcaagaacat gctgaacagt atccagaagc agcggacgcg attgttcgga 3360 agcactacgt cgacgactat tttgacagcg ccgacaccga agaggaagca gttagaaagg 3420 ccagtgacgt ctgccgagta catgccaaag gaggtttcga gattcgcaac tgggtcagta 3480 attcggagga agttctgcgg cagctcggag ggccaaagtg tgcggcgaag ccgctagaag 3540 tcgacaagaa ctgccccacg gagagagttc tcggagtatt gtgggacccg gaaggagata 3600 atttcgtgtt ttcaacggag attcgacaag accttcaggc ctacatccga gaaaacgcct 3660 ggcctacgaa aaggattgca ttgaggtgta ttgcaagcat gtttgacccg aaacaatttc 3720 tcgctcccct gttgatacag ggaaggatca tcatgcagga cgtgtggcgt agctgcatcg 3780 gctgggacga taagctcacc gaagagcatt atgagagatg gaagaagtgg acaagcctgt 3840 tccacctgat tgacgaaatt cgagttccac gatgttatct cggtggacta gagtcggagg 3900 cgtaccagac agtgcagcta cacgtgttta ccgacgccgg agaaaaggcc tatgggtgcg 3960 ttgcctactt caggttcgag aacggcggag tcgttcagtg cgcattcatc gaggcgaagg 4020 cgaaggtggc tccacttcag tacttgtcta tcccgaggtt ggagttggag gccgctattc 4080 ttggtgctcg gatgatgaag tcgatttgcg aaaatcattc ctttcccgtg ttgaagaggt 4140 tcctgtggtg tgattcagat acagtggtgt catgggtgaa gtccgatcaa cgaagataca 4200 gaccgttcgt cgcttatcga atcggagaga tcatcagcac cacgaatcca gaagattggc 4260 gttgggtcaa ctcgaagagc aatccagcgg acgatctgac gaagtgggga aaggaaacgg 4320 aaataaagtc gagcaaccga tggtacaaag gaccggaatt cctgtatcaa ggagaggaaa 4380 attggccgaa acagaaccgt ccgagagtag aggtcgccga ggaacttcga gcaagtgtgc 4440 tgttccatta cattgcgttg cccgaaggtt tgatggagag gatgcaacac atctcaagat 4500 ggtcggtgat ggtccggacg atagcaacga tgttccgctt cgtatccaac tgtcgtcgtc 4560 gtattcagaa gaagccgatt gaagcattgc ggaaagccga tggtggaggt atctcgagcg 4620 atgaagtacc gttgcggcag gaagaatacc atgcggcgga ggtcctcttg tggaaggcgg 4680 tgcaagcaga cgagttcgcc gatgaagtga ggacaatgct gaagaacaaa gagttgccgt 4740 tatcggaagc gatacctctc gaacggtcaa gttcgttgtt tcgcctggca ccgtttctgg 4800 atgaagacgg agtcgttcgg atggaaggtc gaaccggagc agcggagtat gcagcgttcg 4860 atgcaagatt cccgatcgtt ttgtcgaagg accacgtggt gacgaagaga ctgttggagc 4920 attaccaccg gcagtttggc catagcagca gagaaaccgt ggtgaatgaa atccggcagc 4980 ggttttacat cccgacgtta agagtgcagg tcgacaaggt gatacgtgga tgcatgtggt 5040 gcaagattca gaaggcgaag ccaactgtgc cgagaatggc accattacca ggacagcgtt 5100 tagcggcgaa ggtcgatccg ttctcctacg tcggtatcga ctacttcggc ccactagatg 5160 tgtccattgg acgaaggaag gagaaaagat gggtggcact cttcacttgc atgacagtgc 5220 gggcagtgca tctggaggtg gcgtatagcc tgacgacgga gtcgtgcaaa atggcgatca 5280 ggaggttcgt gaagcgacgt ggcagtccaa ccgagatttt ttcggataac ggcactaact 5340 tcgtgggtgc aaaccgagac ctagttcgag agataaacgc gggatgtgca gataccttca 5400 ccagttcgaa gacccgttgg acgtttaacc caccgtcagc accacatatg ggtggtgcgt 5460 gggagagaat ggtgcgctcg gtgaaagagg cgttgaaggc gttcaccgac ggacggaagc 5520 tgacggatga aatcctgcag acggtgttgg tagaagcaga atacctggtg aactcgcgcc 5580 cattgacgta cgtgtccacc aacgtgaaag aggaccagga agcgctaact ccgaaccact 5640 ttctgcgtgg ctgttcgact ttggagtgtc tgccgtcaag agatcccgta gatttggccg 5700 atacattgcg gagcagttac aatcgagcac agtttctgac ggatggtttc tgggaccgtt 5760 ggcaaaagga gtacttgccg acgctgaaca agaggacgaa atggttcgtc gaccgacgac 5820 aagtggcggt cggagacttg gtgtttgttg ctgaaggtga caagcgaagt agttgggagc 5880 gaggcattgt gaaggaagtg ttcaccggaa aggacgggcg gattcgttcg gcgaacgtgc 5940 agacgagcag aggtgtgaaa actaggccgg tagcaaagct tgccgttctg gagatcgact 6000 cggaggagta atccctgttc cgcacccgag aagcagccac agggtttacg ggtgggggga 6060 // ID Copia-2_RP-I repbase; DNA; INV; 4076 BP. XX AC ACPB02046047; XX DT 28-JAN-2011 (Rel. 16.02, Created) DT 28-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Rhodnius prolixus genome: internal DE portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-2_RP_; KW Copia-2_RP-LTR; Copia-2_RP-I. XX OS Rhodnius prolixus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Euhemiptera; Heteroptera; OC Panheteroptera; Cimicomorpha; Reduviidae; Triatominae; Rhodnius. XX RN [1] RP 1-4076 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Rhodnius prolixus genome."; RL Direct Submission to RU (28-JAN-2011). XX DR Genome; ACPB02046047; Positions 35139 31064. XX CC Positions [1562-2059] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 110..2476 FT /product="Copia-2_RP-I_1p" FT /translation="MDSRGIIVIDKLLGQSNWKDWKFQVRIQLQCNDAMQA FT IDGTLLKPTQLPLKATSKDVEEYNKSLIAYNRIEYYAQGLIASSVTPEVRK FT LINMCTTSKHMWDKLRSIYEQRTEQRQDRLFNEFFGIREKDTSDNIAGHIA FT RLEKLWEELKDETWKEDKVKLPDSLFLNRVLNTLPGQYFEFINAWESVSKK FT DRTIDLLRERLCTVELRFNERSSVCSGDVQKDSVALVVSKEKNVSKDYRTS FT RRGPRCYFCSEFGHVVKYCPNKHGKDAKSDQGARDSAFFGAFEANTSGSDE FT WFADSGATHHMTMNKNFLFSYQLFPALQPVLIGNGQHMMAIGKGNVKVEMQ FT VNQGWKKGTLYDVWLVPDLSKNLFSISSVVSKGLKAVFDSRGCTIMRKNTV FT AATATRSNGIFKVDMRVVLPETLLGTSPQANLANKLDSLQVWHERLSHQNK FT KYVGKFLNENGVNFVDDDFFCEACLYGKQSRMSYGQRTVKPMKPGELVHAD FT VCGPMPVASLGGSQYFLILKDDYSKYRSVYFLKRKSEVSDKLKIFLAEVRT FT RGHTLKEVLTDGGTEFNNGDVKEIIEKAGLSHRVTMPYSPEQCGGAERENR FT TIVEAARSMIHAKGLPESLWAEAVNTATYVINRTGPTPVDNKSPYELWFGR FT KAAISHLKVFGTECFVHVPKQQRKKWDKKSKKGYLVGYCHDKDGFRVWIPQ FT RNDVITSRDVVFKNEETSLCSAESKILEERLEEETGPDLVYIPVPTMGNVE FT QVSTVDDEMVEEMSSPKRLRDRTALRKPIHLEDYVSCV" XX SQ Sequence 4076 BP; 1212 A; 692 C; 1044 G; 1128 T; 0 other; ggttatgggc ccaggacagt tagaatagaa gaacaattcg gagtgtttgc gtttaagttt 60 tgctttgcac tgtgtattat cggatccttt ttgttttggt atattggcaa tggattcacg 120 aggaattatt gttattgaca aattgttggg ccagtcaaat tggaaagact ggaagttcca 180 ggtgaggatc caattacaat gtaacgatgc aatgcaggct atagatggca cacttttaaa 240 gccgacacag ctgccactta aagctacatc caaagatgtg gaagaatata ataaaagttt 300 gattgcttat aacagaatag agtactatgc tcaaggcttg attgcttcga gtgtaactcc 360 tgaagtaagg aaacttatta atatgtgtac tacatcaaaa cacatgtggg ataagctacg 420 cagcatttat gagcagcgta ctgagcagcg tcaggatcgg ttgtttaatg aattttttgg 480 cataagagaa aaggatacat ctgacaatat tgcggggcat atagccaggc tcgaaaagtt 540 gtgggaggag ttgaaggacg agacgtggaa ggaagataag gtaaaacttc cagattccct 600 gtttttgaat agagtgctaa acaccttgcc tggtcaatat tttgaattta ttaatgcctg 660 ggagtccgtg tccaagaagg accgtaccat tgacctattg agggagcggt tatgcacggt 720 tgaattgcgg ttcaatgaga ggtccagtgt gtgctcgggt gatgtacaga aagactccgt 780 ggcattggta gtatcaaaag aaaagaatgt ctccaaagac tatagaacgt ccagaagggg 840 gcctcgctgt tatttctgca gcgagttcgg acatgtcgtc aaatattgtc caaataaaca 900 tggaaaggac gccaaatctg accaaggtgc gcgggattct gctttttttg gagcctttga 960 ggccaacact tctgggagtg atgaatggtt tgcggattct ggtgcaactc atcacatgac 1020 tatgaataaa aattttttgt ttagttacca actttttcca gctttgcaac ccgtacttat 1080 tgggaatgga cagcacatga tggctattgg taaaggaaac gttaaagttg aaatgcaagt 1140 caatcaaggt tggaagaagg gtaccttgta cgatgtttgg ctggttcctg atttgagtaa 1200 gaatcttttt tctatttcct ctgttgttag taaaggatta aaggcggttt ttgactcaag 1260 aggctgtact attatgagaa agaacactgt agctgccacg gcgacaagaa gcaacggtat 1320 tttcaaagtg gatatgaggg tagttctgcc tgagacacta cttggaactt ctcctcaagc 1380 gaatttggct aataagctgg acagcctaca agtatggcat gagcgtttga gtcatcaaaa 1440 caagaagtac gtaggaaagt ttttaaatga aaatggcgtg aattttgtcg atgatgattt 1500 cttttgtgag gcttgtcttt acggcaaaca gagccgcatg agttatgggc aacgaaccgt 1560 gaaacccatg aaacctggcg aacttgtgca tgcggatgta tgtggcccta tgccagtcgc 1620 atccttgggt ggatcacaat attttttaat actcaaagat gactacagta agtataggag 1680 tgtctatttc cttaaacgaa agtctgaagt atcagacaaa ttgaaaatat ttttagccga 1740 agttcgtaca cgcggtcaca cattgaagga ggtgctgacg gatgggggta ctgaatttaa 1800 caatggtgat gttaaggaaa ttattgagaa ggcgggcttg tctcatcggg tgacgatgcc 1860 ctactcgccg gaacagtgcg gtggcgcaga gagagaaaat cgcacaatag ttgaagctgc 1920 acggagtatg atacatgcca aagggttacc ggagagtctg tgggctgaag ctgttaatac 1980 tgccacatac gtgattaaca ggacggggcc cacaccagtg gataacaagt ctccctacga 2040 attatggttc ggtaggaaag ctgctattag tcatctgaaa gtttttggca ccgaatgttt 2100 tgtgcatgtc ccaaagcagc agagaaagaa atgggataaa aaaagtaaaa agggatacct 2160 cgttggatat tgccatgaca aagacggttt tagagtttgg attccccaga gaaatgatgt 2220 aattaccagt cgcgacgtgg tgtttaaaaa tgaagaaaca agtttatgtt ctgcagaaag 2280 taagatacta gaagaacgcc tggaagaaga gactggtcct gatcttgtgt atattccggt 2340 tcctacgatg ggcaatgtgg aacaagtttc cacagttgat gatgagatgg ttgaagaaat 2400 gtcgagcccc aaaagactga gagacagaac tgcgctgaga aaacccattc acctggaaga 2460 ttatgttagc tgcgtgtgag gagccgctaa cactggaaga ggcactggaa tccgaggagg 2520 agccgagctg gagagaagcc atcaaagaag aacttgcatc gctggatgaa aatcaaacat 2580 ggactcttgt tgatctgcct gaaggaagga aggcaataga caatcgatgg gttttcaaga 2640 gaaagtacaa ttctgatgga actataagac ataaagccag attagttgct cggggtttca 2700 cccagaagcc tggggttgat tatagagaaa cgttttcacc agtagtaaga tgggatactg 2760 ttagggcgtt tttaagcgta gttgaatcta gaaattttat tttgggtcaa tttgacatta 2820 aaacggcttt tcttaatgcg aatatagatg aagaaattta tatgaggcaa ccgaaaggtt 2880 ttagtgatgg ctcggagcgt gtttgcctgt tagagaaaag tttatacggc cttaagcagg 2940 catctaggaa gtggcaccaa aggttttgtc gattttttgg agaactgtgg cctagtttgt 3000 agtgacgaag atacatgctt gtttatttcc tcagatcact ccttgatctt gattttgtat 3060 gtcgatgatg ggcttgtggc tgtcaacaaa cgagaagttt tggatgtttt ctttttgaaa 3120 ctgcagaagg aatttaaagc aactgtgagt tacagtgtga accaattttt aggtatggaa 3180 gtaagtcaga tacaggatgg ctctatcttc ataagccaac agacgtatgt gaagaaggtt 3240 attagtaagt ttggcatggt tgaggcgaac cctgtactga cacctgttga tcccagccat 3300 ctcgccgaag tttgcactga gaatctagat aagtcctatc cttaccgcga gctggtaggt 3360 agcttaatat atttatccat tgtcaccaga cccgatattt cgtatgttgt aggtgtgttg 3420 tcacaggtat tggacaaacc caaactcaaa cattggaata tggttaaacg tttgctgcga 3480 tacattaagg ggacgcaaga ttttggaata cactaccagc aggactcaaa ccggaaattg 3540 gaagtatata ccgatgctga ctatgcaggt gattcaggtg atagaagatc aactacagga 3600 ttatttttcc gctattgtgg aggggctgtt agctggagga gtagcaagca gaagtgtata 3660 gcactttcca ccacagaagc cgaatttatt gctgccagtt caaggagcga aggaagctat 3720 atggctgtcc ccgtttgtat gcctctttgg ttggctcagc tgatgttcca gcgctaatga 3780 ttgataatca gagtgctatt cgactgatta agaacccgga gttccatagt cgcactaaac 3840 atattgatgt acgatacaag ttcgtcagag aaaagtacca ggagggtggc ctacaggtgc 3900 aatatgtacc tagtgaatac cagattgcag atatctttac gaaaggactt tcaagacccc 3960 gcttcgagag catgagggag agtttgggac taatttctaa gggcgctctt tcatcaacct 4020 tttaaattgg actttaaagg gggggttcaa attaagaggg agtattagaa ttaatt 4076 // ID Gypsy-167_AA-I repbase; DNA; INV; 7458 BP. XX AC supercont1.341; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-167_AA_; KW Gypsy-167_AA-LTR; Gypsy-167_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-7458 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.341; Positions 277074 269617. XX CC Positions [3577-4080] - Reverse transcriptase CC Positions [5134-5613] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 467..2758 FT /product="Gypsy-167_AA-I_2p" FT /translation="MAYASNQIGLGAYINPSDLLPDEVQYELELRGVNMSR FT APLVEQRGMLRNLQLDELRSPKLIKSKKTILDELAVIQVKLITIRQEFEIQ FT GVRQYLISRLRHVRLRVLRLNAINAEQEKVREDLLEEIDFLFDHFGNPDDR FT TPVMVEERAAAQGGTVRIPRDPTLEFPPPPGVARSSGIATVVKSNLAGNTT FT TFLGGLNRPRVAFQSSLSSGSPGLTNAAFSSHPSQGIISIPHEEGEAANLP FT AMIPPPAEELGVEVVSNEPEFPTPPLLDFNFGIEDTLHHPRASQSMPENNL FT GPEGPSSHSGPPQPSAQFQSTQNFNRILDQQIPPPTADPPHESPVTPTITE FT SIRLRAEVSRIVEDVLTNQLEGIMIRIAQGMQEFQSRMSQSEGRRVENVPQ FT NPNPYTTVQREPLEQRVPLAGNTTTRQGPTEMPSYRIQNATSTPVPRIERE FT IRPPQAPRNPSAEVHPPYNAAANLGVDTRMFQNRIPINKWPIRFSGDPKGM FT SVEEFLKRVDVLARNNQVSEMELLSKANFLFRPESVAEIWYYTFSHKFASW FT DTLKYHLRLRFEVPNKDKVIEREIRERKQLPNETFVTFMGEIERLCQQLSK FT KMTEKTRKGILLDNMRDWYRPHLAFIDTEGISIETLCNLCYELDKSVYRSY FT TQRPRPYQVHCLDEENLDDVSAQEDPEINAIARGKSREQVGVMERKETSRE FT QVTSSTTPILCWNCRQFGHYWRDCAKNKRVFCHICGESDVVAMNCPSNHRS FT SHIGPKNELSEGN" FT CDS 2830..5991 FT /product="Gypsy-167_AA-I_1p" FT /translation="MNIQTKENRCPYVTVNILGTEVKGLLDSGAAASILSD FT QNLLQKHNFKKEPTNLKIKTADGTSHECANLIYIPYSFHGTTKVVPTLYVP FT NLAKKLILGNDFWKIFQITPAFEVNGYLTRLETNCVISDLVDLNCIDQYFS FT EGVEINLILEQSSDRVMPVENEDVSLDLPSVEEPQPLNSSDIVTEHQLNPE FT EQKELTKIIDMLKGDGKLGRTVVLEHRIEILEGEKPKRPPRYRWSPAIERE FT MEKEIQRMKDLDVIEESTSDWCNPLLPVKKSSGEWRLCLDCRRINEVTKKE FT AYPFPDMQVILGRIEKARYFSIIDLSKAYWQIPLAPESRDFTSFRAGKSLY FT RFKVMPFGLTGAPVTQTKLMNRVLGYDLEPQVFVYLDDIVVTSRTVEEHFK FT LLRTIAERLRAANLSISIEKSRFCQKRISYLGYILSEEGLSIDSKKLDPIL FT NYPSPTTLKEVRRLMGMIGFYKQFIPNYSTILAPITDLLRKNRSKLIWTET FT AEQALGTVKGILTSPPVLANPNFNLQFVIESDASAVAAGAVLVQYQEGVRR FT PIAFFSRKFSATQRKYSATERECLAVILAIERFRHFVEGTQFRVVTDAQSL FT KWLNKISVEGNSARLARWALKLQQHDILLEYRKGKLNVVADALSRAVEVDV FT LSVDLDLERLKKNITAKSDQYKDFKVSHGKVYKYVPSGRSEDRRFEWKLLP FT SKSERIQILQAEHGIAHLGFYKTLRRIQEKYYWPGMSVEVKRYCRGCETCM FT ISKHPNHPKIPPMGRPKLASMPWQVISVDYMGPFPRSKSGNTSLLVVTDHF FT SKFVLIQPMREAKTETLVKFLESMVFLLFGVPEILISDNGPQFKSLLFETL FT IKKYNVSHWKNANYHPANNPTERVNRVIVAAIRSYLKGGNQKACDENLQKV FT AMAIRTAVHESTDFTPYFINHGRNYVSSGQEYAMIRNSSSDELDYDPKTLN FT QEMKEIYDQVRTNLKKAYEKYSKYYNLRSRAKVSSFKVGDIVLKKNFFLSS FT KANNFNAKLAQTYSPAKVIAKRGMHCYELEDLAGNNIGVFRSADLQSK" XX SQ Sequence 7458 BP; 2339 A; 1547 C; 1597 G; 1975 T; 0 other; cttggcgcat aagtggatta agttgttatg acattctgac ctcgtgacgc gtaaccgtta 60 cattttggcg cccaacgtgg ggccctttga aaaattaggg ggcttgaatc aatttaggaa 120 caaattttgc attcgaatac gactttgatt tcatttcatt tcttttcgat ttcgattatt 180 ctaagaatca tagtttaagg atcaatactc acgtgaatta aagaaaatcg gtttattttg 240 taggtttagt ttttcgtttt tttttttcca agttctttag ttgaagttat ctaaaatagg 300 ggaaaagtat cgttaggact aagaattgaa aaaaaattag ctggagtctt ctttaatgta 360 tttttttctc tgggttggtt tgtattccct tttgtttttt tttgcattga ttcacattga 420 ttaatctcct tttttgaata ttgaattgac tttctattta cgtgcgatgg cttacgcgag 480 caatcaaatt ggtttagggg cgtatattaa ccctagtgac cttctacccg atgaagttca 540 atacgaacta gaacttcgag gggtgaacat gtctcgagct cctttagtcg agcaacgagg 600 catgctacgc aatttgcaat tggatgagct tcgtagccca aaactgatta aatccaagaa 660 aactattttg gacgaacttg cagtgattca agtgaaactg atcaccataa gacaagagtt 720 tgagatacaa ggagtacgac agtacttgat atctcgctta cggcatgttc gtttgagagt 780 gttacgactg aatgccatca atgcggaaca ggagaaagtt agggaagact tgttagagga 840 aatcgatttt ctgttcgatc atttcgggaa tccagatgat cgcacacctg tgatggtaga 900 agagagagcg gccgctcaag gagggacagt gagaattccc agggacccaa cattggaatt 960 tcctccaccc ccaggcgttg cgcgttcatc gggaatcgcg actgtggtta aatcgaactt 1020 agctggtaac accacgacat ttttgggtgg gttaaatcgt cctcgagtcg cttttcagtc 1080 gagtctgagc tcaggatcgc ctggtctcac caacgcagca tttagcagtc acccatcaca 1140 agggatcata tcgatccccc atgaagaagg agaagcagcg aacctcccag caatgatacc 1200 tccacctgca gaggaactag gggtcgaagt ggtgtctaat gagccggaat tcccaactcc 1260 accacttttg gacttcaact tcggaatcga ggatacactc catcatccga gggcatcgca 1320 atcaatgcca gaaaataatc ttggacctga aggtccgtca agtcattctg gaccacccca 1380 accatctgcg caatttcagt cgacacaaaa cttcaatcgc attctcgatc aacagatacc 1440 tcctccgaca gcggatccac ctcacgagag tccggtgacc ccgacaatta ctgaaagcat 1500 aaggcttcga gctgaggtta gtcgaatagt cgaagacgtt ttgactaatc aactggaagg 1560 aataatgata cgaattgccc aaggaatgca ggaattccag agtaggatgt cacaatctga 1620 aggaagacgc gtagaaaacg ttcctcagaa tcccaatccc tatacaacag tccagagaga 1680 accgttagaa cagagagttc cgttagccgg aaataccact actcggcaag gtccaactga 1740 aatgccaagc taccgtatac aaaatgcaac ctcaactccg gtaccacgaa tcgaaaggga 1800 aatccgacca ccgcaagctc ctcggaaccc gtctgcggaa gttcatcctc catacaatgc 1860 cgcagcgaac ttaggtgtcg acacccgaat gtttcagaac cgaattccaa ttaacaagtg 1920 gccgattagg ttcagtggag acccaaaagg aatgtccgtt gaggaattcc tgaagagagt 1980 cgacgtatta gcacgcaata atcaagtctc ggagatggaa ctgcttagta aagcaaactt 2040 cctattccgg ccagaatcag tagctgaaat ttggtattac accttcagcc acaagtttgc 2100 atcatgggat acgttgaagt accatttaag gcttcgtttc gaagtgccca ataaggacaa 2160 ggtcatcgaa cgggagatcc gagagcgaaa acaattgcca aacgaaacgt tcgtcacttt 2220 catgggagag atagaacgtt tgtgtcagca attgtcgaaa aaaatgacgg agaagaccag 2280 gaagggaatt ttgctcgata acatgcgtga ttggtatcga ccccacttgg ccttcatcga 2340 caccgaagga attagcattg agaccttatg taatctctgc tacgagttgg acaaatcggt 2400 ctataggtcg tatactcagc gacctagacc ataccaggta cactgccttg atgaggagaa 2460 cttggacgat gtttcagctc aagaggaccc ggaaattaat gccattgccc gaggaaaatc 2520 gagagaacaa gtaggagtca tggaaaggaa ggaaacatca agagagcaag ttacgtcgtc 2580 tactacacct atcttgtgtt ggaattgcag acagtttggg cactattgga gggattgtgc 2640 caaaaataaa agagtatttt gtcacatctg tggcgaatca gatgtagttg ccatgaactg 2700 cccgagtaat caccgaagtt ctcatatagg tccaaaaaac gaactctcgg aagggaatta 2760 gggagcgacc cttccaacag ctcgaatacg cctccccctc cgctcgtgta tcaaaaattt 2820 gcccctataa tgaatattca aacgaaagag aacagatgtc cgtacgttac agtcaatatt 2880 cttggtacag aggtcaaggg acttttagat tctggagcgg ccgcaagcat tttaagtgat 2940 cagaacctgc tccaaaaaca taattttaag aaagagccca ccaatctgaa aattaaaacc 3000 gcagatggaa catcacacga atgtgccaac ctgatttaca ttccgtactc ctttcatggg 3060 actaccaagg tggttccaac tttatatgtc ccaaatcttg ctaagaaatt aattcttggc 3120 aatgacttct ggaaaatatt ccaaattaca cccgcttttg aagtaaatgg atatctaact 3180 cgattggaga ctaactgtgt aatttctgat cttgtggact tgaattgtat cgaccaatat 3240 tttagcgaag gcgttgaaat caaccttatc cttgaacaat cctccgatag ggtaatgccg 3300 gttgaaaatg aagacgtgag tttagattta ccgtccgttg aagaacctca accattgaat 3360 tcttcagaca ttgtgaccga acaccagctt aacccggaag aacagaaaga acttacgaaa 3420 attatcgata tgctcaaagg ggatggcaaa ctaggaagga cagtcgtatt ggaacaccga 3480 attgaaattt tagagggaga aaagcccaag aggccaccca ggtacaggtg gtcaccagct 3540 atcgaaagag aaatggagaa agaaattcag cgtatgaagg atttagacgt gatcgaagaa 3600 tcgacatccg attggtgtaa ccccctcctc ccagtgaaaa agtcatctgg tgaatggagg 3660 ctgtgcctcg attgccgtcg gatcaatgag gtcactaaaa aagaggctta cccattccca 3720 gatatgcagg tgatcttagg tcgaatcgaa aaagccaggt atttttcgat catcgaccta 3780 tcaaaagcct actggcaaat ccctttggct ccggagagcc gagactttac gtcctttaga 3840 gccggaaaat ccttgtatcg attcaaagtg atgccgtttg ggttaaccgg tgctcccgta 3900 acacaaacca aactaatgaa ccgggttcta gggtacgatc ttgaacctca agtcttcgta 3960 tacctagatg atattgttgt tacatctagg accgtcgaag agcatttcaa gttgttacga 4020 acaatcgcgg aacgattgag agctgcgaat ctgtcgataa gtatcgaaaa atccagattt 4080 tgccaaaaaa ggatatctta tctaggttac atattgtcgg aagaaggact ctcgatcgac 4140 agcaaaaaac ttgatcccat tctgaactac ccctcaccca caacactaaa ggaggtcaga 4200 aggttgatgg gcatgattgg tttctacaaa caatttatcc caaactattc tacgatcctg 4260 gcacctatta ccgatctgct aagaaagaac agaagtaagc ttatatggac tgagacggca 4320 gaacaggcat tgggaactgt caaaggaata ttgacttcac cacctgttct cgccaatcca 4380 aattttaatt tgcaattcgt gattgagtca gacgcgtccg cagttgctgc tggagctgtt 4440 ctagtacaat atcaagaagg agtgcgtcga ccaatcgcct ttttctcaag gaaattctca 4500 gcaacacaga ggaagtactc cgcgacagaa cgcgagtgtt tggcggtaat cctcgccatt 4560 gaacgattca gacacttcgt agagggaaca cagttccgag ttgtaacgga tgcccagtcc 4620 ctaaaatggc tgaataagat cagcgtggaa ggcaactctg cgagattggc tcgatgggca 4680 ttaaagcttc aacagcacga tatcctctta gaatatcgaa aaggaaaatt aaatgtcgtt 4740 gcagatgctc tctctagagc agtagaggtt gatgtactct cagtagatct cgatttggaa 4800 aggttgaaaa agaacatcac tgcaaaatca gatcagtata aggacttcaa agttagtcat 4860 ggaaaagtgt acaaatatgt tccttcgggt cgctcggaag atagaaggtt tgaatggaag 4920 ttacttccat ccaagtcaga acggattcaa attcttcaag cggaacatgg aatcgctcat 4980 ttagggttct ataaaacact tcgtcgaata caagaaaagt attactggcc aggtatgagt 5040 gtggaagtta agcgatactg tcgtggatgt gaaacatgca tgatatccaa acatccaaac 5100 cacccaaaaa ttcccccaat gggtcgacca aaactcgctt ccatgccatg gcaggttatc 5160 tctgtggact atatgggacc attcccacga tctaaatcag gaaatacatc acttcttgta 5220 gtcaccgacc acttctcgaa attcgtcctt atccaaccta tgcgcgaagc caaaacagaa 5280 acactggtaa agttcctaga gtcgatggta tttttactgt tcggggtccc tgagatttta 5340 atatccgaca atggacctca gttcaaatcg ttattattcg aaacgctaat aaaaaagtat 5400 aatgtgagcc attggaaaaa cgcgaactac caccccgcta acaatcccac agagagggta 5460 aatcgtgtca ttgttgccgc tattaggtcc tatttgaagg gaggcaatca aaaggcctgt 5520 gatgaaaatt tgcagaaagt tgcgatggct ataaggactg cggtacacga gtctactgac 5580 tttactccat attttataaa ccatggaagg aattatgtta gctctgggca ggaatacgct 5640 atgataagga actcttcgtc agacgaatta gattatgatc ctaaaacatt gaaccaggaa 5700 atgaaggaaa tttatgatca agttcgcacc aatctgaaaa aggcttatga aaagtacagc 5760 aagtactata atctgcgatc aagagcgaaa gtctctagct tcaaggtagg agatattgtt 5820 ttaaagaaaa atttctttct atccagtaag gctaataatt ttaacgccaa actggcccaa 5880 acgtactcac cagccaaagt gatagccaaa cgcgggatgc actgctacga gctcgaggac 5940 ttagccggta acaacattgg agtcttcagg tcagctgatt tgcaatccaa gtgagcaaat 6000 tgtgttccag ctatgacggt gttccagcaa cacataaaca aaatttccat tgagaaagat 6060 aaaatactct ttgaggcttg cttatccaac gttgagatgt ccttcgtgtt tcctccgttg 6120 tggacaatga atccgtttat gttaaggatt tgcactgtat tcaaaaccat taaaaaaaaa 6180 gcccttacgc accataaaca ctctgggagc tgataacgct ccggtattgc ttgtttacag 6240 aggtgtaaac ctacaaagaa agccatcact tttcaatatt cttcctattt tctttaatca 6300 tttcttcgtc tcaatagact tcatcactta actcaataat atcaaggatt tcccactttt 6360 cacatttggt ttcactaaaa ttgtagatca tttttgattg aactttccgc tccatatttc 6420 ttcttccacg aatcaacaac ttttgttcgg acattctgaa ctaggaacaa aagttgctaa 6480 atttttcaat ggagtcaatg acctccgatc caatcaaccc agcagcacgg caagatagtt 6540 atctccaacc agacccattc gctaaaaaga aaagaagacc aaatcagcct acataaggac 6600 agataccagt tcaaaattca tgaccttttc gaattccatt ccatagcaca gtaatattta 6660 ctaattattc acacgtaaaa taatcatatt actagcactt tagttctaat ttaaacactt 6720 ttcaaacgct tcgcgacgaa ccgacaccat ttagactcgc gacggatagg tagactaact 6780 gacagcctat cgttgccatg gaagagagta gtgagtagta agagaaaatg agtggcttga 6840 ggaatgctgt cagttgcaga gggtaccgtc gttgggtgta ataccggatc acgatcagcc 6900 agtataatta tgaaggaaat agggaggagt ataaggacga tcatttagat tttttgtgag 6960 tagcttttcg gatagttatc tgaagtagag aaccaagagt tccagttcgg taccgttcgc 7020 cgagttgtat ggttgagtgg atactcgact acgggatgaa gacccttgtt gttcagattc 7080 gttttttttt gtggcggata gtaatctgcg ttgatccata tacatgaagt aggatcgaag 7140 atgtggaaaa ggaataaaat aagaagaagg tctagcgcga tgtgagtgat ccgtaataac 7200 gttgtaaaaa tggttccgtg aaaaatcgaa aagcttcatt gtacatagta atcttaaaaa 7260 tctaataata tataaagtaa aatctaagta atcaaacatt tcactttcgc acatatttca 7320 actcaataca ataagttgag agtcataaac aaataaggta agacaaagat tttgtttttt 7380 ttttggttcc ggttaaaatt taatacaaat atcaaaaata cattttcaaa tatatttttt 7440 agttgagata gggggaaa 7458 // ID SMAR3 repbase; DNA; INV; 1992 BP. XX AC . XX DT 24-SEP-2007 (Rel. 12.09, Created) DT 01-OCT-2007 (Rel. 12.09, Last updated, Version 1) XX DE Consensus sequence of Mariner-type family of repeats. XX KW Mariner/Tc1; DNA transposon; Transposable Element; SMAR3. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-1992 RA Jurka J.; RT "SMAR3: Mariner-type element from freshwater planarian (Schmidtea RT mediterranea)."; RL Repbase Reports 7(9), 992-992 (2007). XX DR [1] (Consensus) XX SQ Sequence 1992 BP; 793 A; 277 C; 307 G; 614 T; 1 other; tacagtaaaa ctcgtaattg aacctcggat aaatgagcaa cccctctaat tgaacatata 60 taaccttaat tggttggttc aggcaatttt aatatataaa ttaattgctt aattgaacac 120 tcttttaaat gaacaactct gattaatgaa caccaattaa ggcccttatt tgttttataa 180 aacttttaat tgaacattaa ttttttttat taattggatg tcattatttt acataattat 240 aaatacttaa acccaatcaa aaaattatct atttctatta aataattttt gtcaattaaa 300 atgtccaaac gaaaagcagc aacattagaa gaaaaatacg aaataattaa gcgaattgaa 360 tcgcaaaaag tttcgcaaat ccaagtttct attgaaacar gagttccacg aagcacaata 420 gcaagatggt gtgggcttga aaaagataaa attatagagg catatgaagc aagcaatcaa 480 agtcctgcaa gaaagcgtct cagaggttca aaatatgaaa aaatcgatga atgccttttt 540 gaatggttca aagaaaaaag aaaagttaac gttccaataa atggtcctat tcttcaagca 600 aaagcagaag attttgcaac acttcttgga caagacttca aaccatgtaa tggatggctt 660 gaccgatgga aaacaagata taatgtatcc ttccaaattc ttaatggaga atcggccaaa 720 gttaatgaat caactgtaga agaatggtta agtaaacttt caaacatcac aaaagatttc 780 aagtttgaag acatttataa tgcagatgag acagggctat tttataaatg tttgccaaat 840 aaaacattta atataaaagg cgaaaagtgc tttaatggtg aaaagtcaaa agaaagaatt 900 acaatacttt ttggatgcaa tgcaacagga tcagataaac tcacaccttt agtaatcggc 960 aaatctctaa aaccaaggtg cttcaaacgc ataaatatgt caaatcttgg agtttactac 1020 cggagataat agaaaagctt ggatgaccga agaagttttt cgtgaatgga ttgcaaaaat 1080 tgacaaaaag tttcaagctg aaaatagaaa aatacttcta tttattgata acttttcagg 1140 acatgctttg aaatcacaat ttaacaatgt aaaagttgtg ttttttcctc caaattgtac 1200 ttcaaagctt caaccaatgg atcaaggcat aattgcaaat ttcaaactta aatatagatc 1260 tgcgatggtt agaagaattt tggagaatct tgatggtgaa gttgatcgtg aaaatattga 1320 agaaataaat gtcaaacaag caattgattt tatcgtatca gtctggaagg atatgaattg 1380 ttcaatcata caaaattgct tcaaaaaatg tggatttgat atcgaaatag cgtcaaataa 1440 tgataataac gatatagaac ccattaagtc tgcaattgtt cagcttcaat caaacaatat 1500 aataccagag aacttcaact ttgaagatta tgttaattgt gatgaagaac ttgctgtatc 1560 agctatcatt gatgaccagg aaatcgttga aaccatcttt gaaaaagata aagaaattgc 1620 agacaaaact gatgagcaaa ttgaggaaga tgaagataaa tttcaacgtc caactaaaaa 1680 acaagcatta gacgcaatca acacattaag atattatttt caatgtgatg aagctgatac 1740 aaaagatgaa cttgaaatta ttcacaatag aaaaagataa gtatttcaaa taaaaaacaa 1800 acaaaaatat ctgatttttt taaacgttaa atataaatgt attctttgct tttatttttg 1860 tatttggttt ttataataat taattaaata aatttttttc ttaatttttt tgtaactctt 1920 aattgaacac cctgctaaat gaacatgatt ttcgtgcccc attgaaagtt catttaaacg 1980 agttttactg ta 1992 // ID Copia-23_AA-LTR repbase; DNA; INV; 204 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-23_AA_; KW Copia-23_AA-I; Copia-23_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-204 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 950-950 (2011). XX DR [2] (Consensus) XX SQ Sequence 204 BP; 53 A; 49 C; 35 G; 67 T; 0 other; tgttggcgta gtaggcaatc ggacgtcgcc tccctaaatt gagttgttag gatcccctag 60 caacaaccag ttacctgttg gtgaaaatta acaaactggt ttattccata atcaaactcc 120 gttcagttaa cacctaataa acctaagctt ttcgtttctg ttaaactgtg cctattattt 180 ggagtttccc tgccattccc ttca 204 // ID P-32_HM repbase; DNA; INV; 3157 BP. XX AC . XX DT 07-JAN-2009 (Rel. 14.02, Created) DT 07-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE P-type DNA transposon family: consensus. XX KW P; DNA transposon; Transposable Element; P-32_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3157 RA Bao W. and Jurka J.; RT "P-type DNA transposon families from Hydra magnipapillata."; RL Repbase Reports 9(2), 444-444 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(152..331,335..733,702..2840) FT /product="P-32_HM_1p" FT /translation="MPGCAAVGCHNRPEKGFIMKCFPRNEKQRAIWTSKVK FT RLNWIPTNTSFLCEVSIRKYSFMLILEMSYFRIHTNTLYSFNFFIKVHFDA FT DQWEKTREDGSRKLKHNAIPTVFKHVRAKHKRKNPLERFPIPKKKKISCIE FT SGGEVVENVLIEVNNNEVSESVCTTEVDKSLLIQDLNAKLERLEKKVINQK FT KKNKKKSLIKKKKINILKSKARINTIQKYNHIKRDKFRVLKTIFKEDQILS FT LSRKNMKFIKWSNDTINEALELKFTCGQSGYENLLRLNLPYPSLRTLRRRL FT ENLKFQSGILTEVFEFMKTKVNAMKIHEKECVLILDEMSITESVDYDTSTS FT SYYGVVTLPEHYGQANHALVFMLGGISTRWKQTVAYYYTGSSVNGGVLKNI FT VLSIIKYTEEIGLKVNSVTSDMGAINQAMWKSFNISCTKSGSISSSIKHPC FT DPTRTIDFLADVPHLLKNIRTSLLCNKHFIITAKIQEKYKLQSPIIDSKHL FT NKLVDFQEGMDLKFAYKLTSEHLDEKHFNKMKVSRATSVMNHDVSCGLKFI FT ASEIKDDSILTTAWFIGIVDKWFTLMTSRNPGMALSMKNDKVYNETITFLN FT EVIEIFNGLKILGQTGKIRWKPIQSGVILSTTSVINLQNKFLLEKKFDFLM FT TSRFTQDCLENLFSLVRAKQVIPTALQFKNNLKLICVAQFLKKSSKGSYDI FT DDRQFLSGFLDIVNSKVHKPQQRECILLPKDWDKTSNLILSNAEQNCLFHI FT AGYIISRVKKIEKVCTICFESLVSKNRVNADYTKFVTDKEHKVGSLFFANE FT TTFAFFLQMELIFRSFEPYLNSQKNCNTMLLEKINSDEQIKSLTLYPSCHQ FT IKLKLKNRFITFRLKIFSLKKKRLLCKKKKTKKSSCELGSKSMTMHALALK FT V*" XX SQ Sequence 3157 BP; 1195 A; 431 C; 497 G; 1034 T; 0 other; tagagagcct atatactcag agagcggaca tacgtgtgat cgtttttgat tggttgttcg 60 atttttatga ttttagacgc catctttttg ccaaaaatac taactgcgat taaagtctac 120 tactaaattt gtattaaata tttgtaaaag tatgcctggt tgtgctgcag taggctgcca 180 taacaggcct gagaaaggat ttattatgaa atgcttccca agaaatgaaa aacaacgggc 240 gatttggact tctaaagtta aacgacttaa ctggatacct acaaacacaa gctttttatg 300 tgaagtaagc ataagaaaat acagttttat gtaattaatc ctagaaatga gttattttag 360 aatccataca aatacattat attcgttcaa tttttttatt aaggtacact ttgatgctga 420 tcaatgggaa aaaaccagag aagatggatc aagaaaactc aaacataatg caattcctac 480 tgtctttaaa catgttagag caaaacataa gcgaaaaaat ccactcgagc gttttccaat 540 accaaaaaaa aagaaaataa gttgtataga aagtggagga gaagttgtgg aaaatgtttt 600 aattgaagta aataacaatg aagtatctga atcagtttgc acaactgagg ttgataaaag 660 tttacttata caagatttaa atgcgaaatt ggagagatta gaaaaaaaag tcattaatca 720 aaaaaaaaaa aattaacatt ttaaaatcga aagcaagaat aaatactatt caaaaatata 780 atcatataaa gagagataaa tttagagttt tgaaaacaat atttaaagaa gatcaaattt 840 tatccttaag tagaaaaaat atgaaattca ttaaatggtc aaatgatact ataaatgaag 900 cgttagaact caaatttaca tgtggtcaaa gtggttatga aaatttactt cgtttaaact 960 taccataccc atcattacgt accctacgac gaagattaga aaatctaaaa tttcaaagtg 1020 gcattctaac agaagtcttt gagtttatga aaactaaggt gaatgccatg aaaatccatg 1080 aaaaagaatg tgttttaatt cttgatgaaa tgtcaataac tgaaagtgtt gactatgaca 1140 catctacaag ctcttactat ggagttgtta cattacctga acattatggc caagctaacc 1200 atgctttagt ttttatgctt ggaggaatat ctacaagatg gaaacaaact gttgcatatt 1260 attatactgg aagttctgtt aatgggggtg ttttaaaaaa cattgtccta tctataataa 1320 agtatactga agaaatagga ttaaaagtta atagtgttac ttctgatatg ggtgccatta 1380 accaggcaat gtggaaatct tttaatatta gctgtactaa aagtggcagt atttctagtt 1440 caattaaaca cccatgtgac cctactcgta caattgattt tttagcagac gtaccacatt 1500 tattaaagaa tataagaaca tctcttttat gtaacaaaca ttttataata acagctaaga 1560 ttcaagaaaa atataaactt caatcgccaa ttatagattc aaaacacctt aacaagcttg 1620 tagactttca agaaggtatg gatttgaagt ttgcttacaa gcttacatct gaacatcttg 1680 atgaaaaaca tttcaacaaa atgaaagtat caagagcaac atctgtaatg aatcatgatg 1740 taagttgtgg attaaaattt attgcaagtg aaataaaaga tgattctatt ctcacaactg 1800 cttggtttat aggaattgtt gataaatggt ttacattaat gacttcacga aacccaggta 1860 tggctcttag catgaaaaat gacaaagttt acaatgaaac aataactttt ttaaatgaag 1920 tcatagagat attcaatggt ttaaaaatat taggccaaac tggaaaaata aggtggaaac 1980 ctattcaatc tggtgttata ttatcaacaa catcagttat taacttgcaa aataagtttc 2040 ttctagaaaa aaagtttgat tttctaatga catcacgatt tacgcaggat tgcctggaga 2100 atcttttttc cctggtacgt gcaaaacaag taattcctac tgctcttcaa tttaaaaata 2160 acttgaaact tatttgcgtt gcccaattct tgaaaaaatc atcaaaaggc agttatgata 2220 tagatgaccg tcaattctta tctggttttc ttgatatagt taatagtaag gtgcataagc 2280 cacagcaaag agagtgcatt ttgttaccta aagattggga taaaacctct aacttaatct 2340 taagtaatgc tgagcaaaat tgtttgtttc atattgcagg ttatataatt tctagagtta 2400 aaaaaattga aaaagtttgc acaatttgtt ttgaatctct agtttctaaa aatagagtta 2460 atgctgatta cacaaagttt gttacagaca aagagcacaa agtgggtagt ttattttttg 2520 ctaatgaaac aacatttgca ttctttttac aaatggagtt aatattcaga tcatttgagc 2580 cataccttaa ttcccaaaag aattgtaata ctatgctttt agagaaaatt aattctgatg 2640 aacaaattaa aagtctaact ttgtatccta gttgtcatca aattaaattg aagctaaaaa 2700 acaggtttat aacattcaga cttaaaatat ttagccttaa aaaaaagcgt ttattatgta 2760 aaaagaaaaa aactaaaaaa agttcttgtg aacttggcag caagagtatg acaatgcatg 2820 cattggcact aaaggtttag gttcaactaa tttataaaat aaactgtaaa gcttttttca 2880 aattacttat tgattacaac tacttttttg ctgtgttttt ttttaaaggt atttcctaaa 2940 taaataaata attttgtcga aaaataattt tgaaaaaata aaacataaaa aggtttaatt 3000 tttaaaaata tatttctgtt taaaatatta ttgtgctcat tttctttcaa ttgtttttac 3060 ttgaattcgt gaatgtcata cgctagtttg gcaaaaacat ggccgccgtt cttccgtatt 3120 ttccgtatgt ccgctctctg agtatatagg ctctcta 3157 // ID Transib-1_HM repbase; DNA; INV; 4186 BP. XX AC . XX DT 29-JAN-2008 (Rel. 13.01, Created) DT 29-JAN-2008 (Rel. 13.01, Last updated, Version 1) XX DE Autonomous Transib DNA transposon -a consensus sequence. XX KW Transib; DNA transposon; Transposable Element; RAG1L_HM; KW Transib-1_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 2687-3457 RA Kapitonov V.V. and Jurka J.; RT "RAG1 core and V(D)J recombination signal sequences were derived RT from Transib transposons."; RL PLoS Biol 3(6), (2005). XX RN [2] RP 1-4186 RA Kapitonov V.V. and Jurka J.; RT "Transib-1_HM, a family of autonomous Transib transposons with a RT transposase closest to the RAG1 core."; RL Repbase Reports 8(1), 1-1 (2008). XX DR [2] (Consensus) XX CC Transib-1_HM is a young family of autonomous Transib DNA CC transposons were active in the hydra genome just a few million CC years ago (they are ~4% divergent from their consensus sequence). CC The consensus sequence was obtained based on a multiple alignment CC of ~50 copies; it codes for a 669-aa Transib transposase. Among CC known sequences of Transib transposases, the Transib-1_HM CC transposase is most close to the RAG1 core. A portion of it (pos. CC 2687-3457 in the consensus) was reported previously as a RAG1L_HM CC protein [1]. Like other Transib transposons, Transib-1_HM is CC characterized by 5-bp target site duplications and short terminal CC inverted repeats (they follow the 12/23 rule [1]). CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(1152..1559,1702..2200,2353..2678,2780..3553) FT /product="Transib-1_HMp" FT /note="Transib transposase." FT /translation="MATLEEYCLTEHQLFTLIIDNNVTHVSDLTTIIENET FT QARLGENEVGILKNILRKFQHLKERKFKKLKLEKDAFLKYGSKEVIIRISP FT KTKESSSVFEPADKNDEETSPYRMIKHWSALSRKHQLRRSQKLWENVQDFA FT KNETMDVKSVCEFLLTRCEGKNKTLKLKIPTVTAAAICYESNLGRMTYTQQ FT RKMLKANSIDVFPTWNEIQLFRNSITPLTTSYLPDPFVGVYASFIESTKIT FT IERILRLEGISVSSRENSLTLEVKFGYDGSGGHKIYNQLHNVNTSNIIMAM FT FCPLKILNHEKSTQRPLMIQMGKESNENIQSLSLFNEDIQCMKEDGIIVKH FT NQNDYHVKVNIVAHSLDRKASNIYQGLGGSYCDLCDLSKEQCENIQTVQDG FT FIINRDVKSMQQIFLDLVLHGLLRSFDHFMKVVVHMAAGVLTWKESKLNNN FT TTLLKEEKARLQEIIRVKTGIKWDFAESTGQKGNTTTGNVARKLLHNNVMR FT KVITDHISSQTDRKLVEKYGLLLSTILRAMSSGKKINIELYKQMCQELNIL FT LLQEFYWVSITPTLHKILAHSWELIEINDSTGLKSWSEEGMEANNKRLRYF FT REKLSRKINQLVNIEDCFKRLWLGSDPLVAEERSKGLLFCKNCLDRGHSKR FT SCMRRSASADDILNNLFVCK" XX SQ Sequence 4186 BP; 1607 A; 607 C; 633 G; 1339 T; 0 other; cacagtggtt tattttgcac tttttgtgga catttgacta tacttaggaa aaatctcttt 60 gtgcgacatg gaaaaacaaa acatggtcat gatatatgtt tttagcttaa tttttgaagt 120 tccatcatta aataatactc ctgagggcat accataactc taggacacca aaataacaca 180 ctgcaaaata ccctaaaaaa agtcagtaat aacatttttt cagataggta tacatagtca 240 aaattcatgt tttaagggta acctttcaag ctttatccat taaaaatact cttatgttca 300 tcaccatgat tgcactatag actctaggac acctaaaggg cactttaaag catgggttac 360 aagatataat acaggattct tatataaagt ctaaatacat gtttttacgt cagttggata 420 aaaaaatatt ccgctaaaaa tattcacaaa atcataaaat atataaagga cacaaagaac 480 acacactgaa aaacataaaa aaagacacta aaaaggaaag tcattaactt ttctaaaata 540 attatatagt cgcaatatgt ttttgaaata tttttttaag cttcatgaat taaaaaaatc 600 atataataac tgtaggacac caaaatgttt gatttaatta gttattgtct attaaaataa 660 agttaatata cgatgttaca tgtgttttta attgccatac atctaaagcg ataaatctag 720 ggccccataa aatttttaaa taaatgaaac attttttcta ttatatgtca aattttaaaa 780 gttcgatgat cttgttgtgg ttttaaatac gaagatttga agactaaaaa cttttaaatt 840 aaaagcacgt ttaagtagtt aagcagttaa gtccgtcatc agcaaactaa ttgtggtaat 900 tgaacgcatc ggagaaattc caatctcaca attaggaaac aatcagatac caggcaatta 960 ttataaaaca atttatttct accatcaaaa ttcatatttt tatttgtatg atcatttaaa 1020 cttatataaa tagagtaaat acttttatga taacattatt ttgaaataag aaagtaaaat 1080 taattataaa caattaaaag cacagtaaaa tatttaaaaa agaaaaataa taataacaaa 1140 aacaataaaa tatggcaaca cttgaagaat attgcttaac agagcatcaa ttatttactt 1200 taataattga caacaatgtc actcatgtat ctgatttgac taccataata gaaaatgaaa 1260 cgcaagctag gttaggagaa aatgaagttg gaatactaaa aaatattttg cgaaaatttc 1320 aacatttgaa agaaagaaaa tttaaaaagt taaaattaga aaaagacgct tttttaaaat 1380 atggatcaaa agaagtcatt ataagaataa gtccaaagac taaagaaagt agctcagtct 1440 ttgaacctgc tgataagaat gatgaagaaa ccagtccgta taggatgata aaacattggt 1500 cagccctatc aaggaaacat cagttaagaa gatcacaaaa gctatgggaa aatgtgcagg 1560 tattataatt agttgcatat aatatatata tatatatata tatatatata tatatatata 1620 tatatatata tatatatata tatatatata tatatatata tatatatata tatatatatc 1680 atgttttatc attatttata ggatttcgct aaaaatgaaa caatggatgt taaaagtgtg 1740 tgtgaatttc ttctaacaag atgtgaagga aaaaataaaa cattaaagtt gaaaataccg 1800 acagtgacag cggctgctat ttgttatgag tcaaatttag gtcgaatgac ttatactcag 1860 cagagaaaaa tgctaaaagc aaacagtata gatgtatttc caacctggaa tgaaattcaa 1920 ttgttccgaa attctatcac tccattgaca acaagttatc ttccagatcc ttttgttggt 1980 gtctatgcaa gttttattga atctaccaag attactattg agagaatttt gaggcttgaa 2040 ggtatttccg tttcttctag agaaaacagc cttacactag aagtaaaatt cggctacgat 2100 ggaagtggtg gacacaaaat ttacaaccaa ctgcacaatg ttaacacaag caacattatt 2160 atggcaatgt tttgtccact taaaatttta aatcatgaaa gtattagatt gtgggaacag 2220 caatacccta actctactaa gtctaacagc aataccctaa ctctactaag tctaacagca 2280 ataccctaac tctactaagt ctaacagcaa taccctaact ctactaagtc taacagcaat 2340 accctaactc agagtctact caaaggccac ttatgattca aatggggaaa gagtcaaatg 2400 aaaatatcca atctctttct ttatttaacg aagacataca gtgtatgaag gaagacggaa 2460 ttattgttaa gcataatcaa aatgattacc atgttaaagt aaacattgtc gctcatagtt 2520 tagacagaaa ggcttcaaat atttatcaag gccttggagg ttcatattgt gatctttgtg 2580 acctttcaaa agaacaatgt gaaaatatac aaacagttca agatggtttc atcataaaca 2640 gggatgttaa atccatgcaa caaatatttc ttgacttagt aaatgaggac gggaccattt 2700 tgaaaaaaaa aaggattacg aaattcgaca cggtcaaact acatgtccaa ttgcatgttc 2760 gaatgttagg tccattcagg ttttacatgg tttgctcagg agctttgatc attttatgaa 2820 ggtagttgtt cacatggcgg caggtgtctt aacttggaaa gaatctaagt taaacaataa 2880 cacaacattg ctgaaagaag aaaaagcaag actacaagaa attatccgag ttaaaacagg 2940 aataaaatgg gattttgcag aatcaaccgg acaaaagggc aatacaacaa ctggaaatgt 3000 tgccagaaaa cttttacata ataatgtaat gcgaaaggtt attacagatc acatatcaag 3060 tcaaactgac cgcaaactag ttgaaaagta tggcttgctt ctttccacta ttctgagagc 3120 aatgtcatct ggcaaaaaaa ttaatattga gctatacaaa caaatgtgtc aggagctaaa 3180 catcttgctt ctacaagaat tttattgggt aagtataact ccaacgctgc acaaaattct 3240 tgctcatagc tgggagttga tagaaataaa tgattccacc ggtttgaaat cttggagtga 3300 agaaggaatg gaagctaata ataagcgtct cagatacttc agggaaaaac tatcaagaaa 3360 aattaatcaa ttggtcaata tagaagattg ctttaaaaga ctttggctgg ggtctgaccc 3420 attagttgca gaagaacgga gcaaagggct gttattttgc aaaaactgct tggatagagg 3480 gcattctaaa cgctcatgta tgcgcagaag tgcatcagct gatgatattt taaacaatct 3540 ttttgtttgt aaataaatta attttgtaac atatgtgtat ttgttaattt ttcaaaacca 3600 agttcttaga ctttaacctt aagatggcaa ttttaaaact catttttttt aaaaaccgct 3660 aatggggcca acttaagcat agcagcaaat gaaagctttt attttgtatt tcggaaatca 3720 tataaggttt agtatggcgc cattagcggt ttttgagata ttcaccgaaa tgttcaaaaa 3780 ctataacaaa atcatgtgca aaaattcaca gtttttatat acattttatg aaaattatcc 3840 aattttcttt ctgaaatact attattatta gccagaaaaa tattttttta tgtaggctct 3900 ttatccctta ttgttaaata ataattttcc aagttttaga tcaataatct tttttttttc 3960 atcacttttt tccgagtgat actcaaaaac tctgttgggt tttaaagtgt gttttttctt 4020 ccaaattgtt caagtttgca tggccgtaac tttttaaata gacaaaaatt ttaacttttt 4080 ttttatagct taaatctgca tgctatttaa gtatacaaaa attaagttaa attttaattt 4140 caaaaaatta aattttttac tcatgtccac catttaaacc actgtg 4186 // ID Gypsy-29_AA-LTR repbase; DNA; INV; 228 BP. XX AC supercont1.18; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-29_AA_; KW Gypsy-29_AA-I; Gypsy-29_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-228 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.18; Positions 752756 752529. XX SQ Sequence 228 BP; 57 A; 56 C; 59 G; 56 T; 0 other; tgtgtgtatg tggaacaacc ctacataatt ggagatgtga gatgtgtcaa catcacatct 60 gtaacgcatc gggtggaacc aatacaaatg caactcaggc ctcgtgcccg attacgcatt 120 gggctcggcc tctttcttgt ttcagccatc aaccggagta gacgtatcta aggtgagtag 180 cagcgctcca tgatgggatt ccgctggcgt catcggcgga atcacaca 228 // ID Gypsy-6_AC-I repbase; DNA; INV; 4133 BP. XX AC AASC02015807; XX DT 18-JAN-2011 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from Aplysia californica (California sea DE hare): internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-6_AC_; KW Gypsy-6_AC-LTR; Gypsy-6_AC-I. XX OS Aplysia californica OC Eukaryota; Metazoa; Mollusca; Gastropoda; Heterobranchia; OC Euthyneura; Euopisthobranchia; Aplysiomorpha; Aplysioidea; OC Aplysiidae; Aplysia. XX RN [1] RP 1-4133 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Aplysia californica (California sea RT hare)."; RL Direct Submission to Repbase Update (02-FEB-2011). XX DR Genome; AASC02015807; Positions 95002 99134. XX CC Positions [3091-3429] - Integrase core CC 'GGAAC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 55..3978 FT /product="Gypsy-6_AC-I_1p" FT /translation="MAEHDTDVNIHQAGKTTVHRRVSLPPPKPFDGKAESW FT PRWRQRFNRFRSCTGLQNKSQPEQVSTLLYTMGEVADDILTTLVVEESTST FT YTEVIEAFDTHFDARKNIIFARAKFNKRTQLPGESVDIFIQDLHRLADDCG FT YGALKDELIRDRIVVGVRDDDLSKQLQLKLKLTLSEAIQISRQAEARDESQ FT TLVRPRVELVTTKPHNMRSASSASRPLHKPHTPHTGSKRPCGYCGRLPSHK FT RDACPAKSAVCGSCGKRGHYKAVCRSHPKVQEVGINSYHIAEAPDFLGSIT FT VNEVDSPDEWSAIIKLNSVPTKFKLDTGAAVSVIGDRSVDTSQPLNSCGKI FT LKGPGDTTLSTLGTFEADLKFKEKAMKETIFVVKDQPHALLSRSACVKLGL FT IARLNAVHETSSDFKREFPDVFKGLGQLKDPYTIKLQEGADPVCLYTARKV FT AHPLLPKVKTEIDRMLAEGVISPVTEPTEWCSGMVVVPKRNGSVRICVDLT FT SLNKAVRREVHPLASVDEQLAKLAGSTVFTKLDATSGFWQIPLDPDSRRYT FT TFITPFGRFCFNRLPFGISSAPEIFQRKMCNILDGLPGVICHIDDILIHAP FT DQDTHNKRVRQVLQRLQRAKITLNDKCEFSQPSIKFLGHIVNNTGLKVDPA FT KVDAIRKFPQPKNVTELQRFLGMVNQMAKFSPNLATATEPLRALLKKDSLW FT TWGEHQEASFQCTKDGLTTTPILAHYSPDRETIVAADASMSGLGAVLLQIQ FT DDGSRRPVSYISRALTDAEKNYAVIEKEALAATWASERFSEYILGKQYTLE FT TDHKPLVPLLSTKELHKMPPRIQRFRLRLMRFSPNVLHVSGKHQITANALS FT RAPTNLPSDAEALFVDEVADFAQQAIDSLPASPQRIQNIISQQKSDPETSE FT VRTYCSRGWPAYMPENPLLKQYWTNRHHFSIVDDILLFDDRLVIPRDLRMD FT ILSRLHESHLGITKCRALAQTSVWWPNITSQIEDMVKKCNTCAKLRPVQKE FT PLLPSSFPDRPWSRLAMDLFDLKGQTYIVVVDYYSRWVELRLLEQLNSVFV FT INKLKSIFATHGIPETIVSDNGPQFSSASFQAFAKEFGFIHVTSSPRYPQS FT NGEAERAVQTVKLLLKKANDPYTALLLYRATPLQNGYAPSELLMGRKLQTK FT VPIMQDNLHPSYVDLPSIRQREEKRKEIMRLQYNKRHGVKSLPELCPGDYV FT HIRDLGRPGIVNNRHPNPRSYTVTTEKGSTVRRNRSHLVATPEPTTTQEPG FT DNTPSSTRQPDTPSPARESAQTPAAPVQTPRVSRYGRKIRVPTRLDL" XX SQ Sequence 4133 BP; 1175 A; 1104 C; 929 G; 925 T; 0 other; tggtggcagc ggttgaacac ggtatataag tgtcatttct ttactttttg aatcatggct 60 gaacacgaca cggatgttaa cattcatcag gccggtaaaa caacagttca tcgccgcgtg 120 tctctacccc cgccgaagcc ctttgatggg aaggcagaat cgtggcctcg ttggagacaa 180 cgtttcaaca ggtttcgttc gtgcacgggt ctgcagaaca agtcgcagcc agagcaggtc 240 agtactctac tgtacactat gggggaggta gccgacgaca tcctcacgac tttggtagtc 300 gaggaaagca cctctacata cacagaagtg atagaggcgt ttgacactca ctttgatgca 360 cgcaaaaata tcattttcgc tcgtgcaaaa ttcaacaaac gcactcagtt accgggagag 420 agtgtcgata tattcatcca agacttgcac aggctcgcag atgactgtgg ctatggagcc 480 ctcaaagatg agctcataag agacagaata gttgtcggag taagggacga tgacctgtcc 540 aaacagctcc aattgaagct caaactcacc ctcagcgagg ctatacaaat aagccgacaa 600 gctgaagctc gtgacgaaag ccagacactc gtacgacctc gagttgagct agtcacaaca 660 aagccacaca acatgcgtag tgcgtcgagc gcttcaaggc ctctgcacaa accacacact 720 ccacacacag gttccaagag accttgtgga tactgtggtc gcctccctag ccacaaacga 780 gacgcctgcc ctgccaagtc ggcagtctgt ggttcctgtg gcaagagagg ccactataaa 840 gcagtttgcc gtagtcaccc caaagttcag gaagtcggaa tcaactccta ccacatagct 900 gaggccccag actttttagg ttccatcact gtgaacgaag tcgacagccc agatgagtgg 960 tctgctatca tcaagttgaa ttccgtccca acaaagttca agcttgatac tggtgctgcg 1020 gtctctgtca tcggagacag aagtgtggac acaagtcaac ctctcaatag ctgcggcaaa 1080 attctaaaag gccctggtga cacaactctg tctactcttg gcacctttga ggctgatcta 1140 aagttcaaag aaaaagccat gaaggagaca atattcgttg tcaaagacca gccacatgcc 1200 ctactcagca ggtcagcttg tgtaaagctg ggtctcattg ccagactcaa tgcagtccac 1260 gagacatcat ctgatttcaa acgagagttc cctgatgtct tcaaaggact aggacaactg 1320 aaagacccat acacgatcaa gctacaagaa ggcgctgatc ctgtgtgctt gtacacagca 1380 cgaaaagtcg cacaccctct actgccaaaa gtcaagactg aaattgaccg aatgctggct 1440 gaaggcgtga tctcacccgt aactgagccc acagaatggt gttccggcat ggttgttgtg 1500 ccaaagagaa acggctcagt gagaatatgc gtggacctca ccagtctaaa caaggcagtg 1560 cgacgagaag tccatcctct ggcatccgta gacgaacagc ttgccaagct tgcagggtct 1620 acggtattca cgaagttgga tgcgacgagc gggttctggc aaatccctct cgatccagac 1680 tccagaaggt acactacctt cataacaccg tttgggagat tctgctttaa ccgtcttccc 1740 ttcggcatat cctcagcccc agaaatcttc cagaggaaaa tgtgtaacat tctggacggc 1800 cttccaggtg tcatttgtca catagatgac attctcattc atgcacctga tcaggacact 1860 cacaataagc gagttagaca agttctgcag cgtctccaga gagctaagat cactctaaac 1920 gacaagtgtg aattttcaca gccatcaatc aagtttctcg ggcacattgt gaacaacaca 1980 ggcctcaagg tagatccagc aaaagtggac gccattcgaa agttccctca gcccaagaac 2040 gtcactgagt tacaacgctt tctgggaatg gtgaaccaga tggcaaaatt ctctcctaac 2100 ttggctactg caactgagcc tttgcgagca ttgctcaaaa aagactcatt gtggacctgg 2160 ggagaacatc aagaggcatc ttttcaatgc accaaagatg gcctcaccac aacaccaata 2220 ctggcacact actcgcctga ccgggagact atcgttgctg cagatgcttc catgtctggc 2280 ctcggggcag tacttctaca aatacaagat gatggttctc gcagacctgt cagctacatc 2340 tcacgagctc tgacagatgc agaaaagaat tacgcagtca tcgagaagga agccctagca 2400 gcaacatggg catcagaacg cttcagtgaa tacatccttg gaaagcagta tacgctggag 2460 actgatcaca aacccttggt ccctctcctc tctaccaagg aactccacaa aatgccacca 2520 aggatccaac gattccgtct gaggctgatg cgattttcac caaatgtgct gcatgtttca 2580 ggaaagcatc agatcactgc gaacgcactt tctcgtgctc cgacaaatct gccttcagat 2640 gctgaagcac tcttcgttga cgaagtggca gacttcgctc aacaggctat tgacagcctc 2700 cctgcttcac cacagagaat tcaaaatatc atctcacagc agaagagcga cccagaaacg 2760 tcggaagtcc ggacatactg ttccagaggc tggccagcat acatgcctga aaatccactt 2820 ctgaaacagt actggactaa tcgtcaccat ttcagcattg tggatgatat ccttttgttt 2880 gacgatcgtc ttgtcattcc acgggatttg agaatggaca ttctcagtcg acttcacgaa 2940 agccatcttg gcatcacgaa gtgccgagct ctagctcaaa cttcagtctg gtggcccaac 3000 atcaccagtc agatcgagga catggtgaag aagtgcaaca cttgtgcaaa acttcgacct 3060 gtccagaagg aacctctgct cccttcctcg tttccagacc gaccttggtc tcgattggcc 3120 atggatctgt ttgacctgaa aggtcagacc tacatcgtcg tggtcgacta ttactctcgt 3180 tgggtagagc tacgcttact cgaacagctc aacagcgtat tcgttatcaa taaactcaag 3240 tccatcttcg ccactcatgg catcccagag acaatcgtgt ccgacaatgg ccctcaattt 3300 tcaagcgcca gcttccaagc ttttgcaaaa gagtttggtt tcattcacgt caccagctct 3360 ccgagatatc ctcagagcaa tggtgaggca gaaagagcag tgcaaacagt gaagcttctc 3420 ctcaagaaag ccaacgatcc gtacactgcc ctcctgctat accgggctac tcctttacaa 3480 aacgggtatg ccccaagtga gctcctcatg ggaaggaaac tccaaaccaa agttcccatt 3540 atgcaggaca acttacaccc aagttatgtt gatctaccct ctattcgtca aagggaggag 3600 aagaggaaag agatcatgcg tttgcaatac aacaagagac acggtgtcaa atctcttcca 3660 gaactttgcc ccggtgatta tgtccacatt cgagacctag gccgaccggg catcgttaac 3720 aacagacacc ccaatccgcg ctcctacaca gtgaccactg aaaaagggag caccgtccgc 3780 cgtaacagaa gtcatctcgt ggctacccca gagcctacta caacacaaga gcccggagac 3840 aacacaccgt cttcgacacg gcaacctgat accccaagcc ctgctcggga gtcagcgcaa 3900 acgccggcgg ccccagtgca gacaccacgg gtcagccgtt acggcagaaa aatcagagta 3960 ccaacgagac tagacctgtg acccagtgac tgtgtgacct atatttagct ctgtgcttta 4020 aattctaatc taagtctgaa gaactcttta attaagtgtt gtaaacaagt gaaacttata 4080 gagatctatt aaaatagatg attaagtact gggacaataa cttcgggggg aga 4133 // ID BEL-172_AA-I repbase; DNA; INV; 6170 BP. XX AC supercont1.269; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-172_AA_; KW BEL-172_AA-LTR; BEL-172_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6170 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.269; Positions 1510876 1517045. XX CC 'AATTT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 22..6168 FT /product="BEL-172_AA-I_1p" FT /translation="MSNASSNPDTIYGCAGCDRPDTADDIVQCDKCAAWWH FT YSCAKVDASVADRTCWMCAKCTTPPPPPRSTTIRTTSSNRRALLAINLQRL FT AEEKELRKKELEISMEKEFVKAKYDLLEQCALDDEAESHSVRDRIEEFEAR FT DRERHVNDWVGQVAASLQKSRSLTDPKILSIPKADSTPADPFHLGKATAVD FT TTKTNVVLDVIDDRLSEVNDRHKNKEPPRRAVVVKRSQNRRDENATANENQ FT KQLEMQLQKCQEISISDHQIIKDLQNQLRRCQLQLQQRSEEPQVPDPQSIW FT PLRDVSRGAVPKSKPTIQTMPAPVESQHEQSRTNFARSTEIDLTSIPQQPQ FT LDNYGSHEQSRTNFARSTEIGLASTPQQTQSSNHGGHEQMRTNFARSAEIG FT LANSPQILQSSSYNQVADSLFLEVRRPSPEQLAARQVMPRDLPDFYGDPEE FT WPLFISSLRNSTSACGYSRVENLARLQRCLKGNALKSVRYNLLDPESVPEV FT IRTLQTLYGRPEVIISKLIKSVRDAPAPKSERLETLIDFGMAVRNLVCHLI FT AADQRSHLSNPVLLQELVEKLPASVKMQWAQHLIQFPDMTLQTFSNFMSSV FT VESVSKVVLYTGSQSHRPEKQRSKEKGYSHAHVEAIEPGQQKIDSPKPCPV FT CGKADHRIKDCDSFKRSSVDSRWKTVSSLKLCRCCLGQHGRRVCRSSARCE FT VDGCQYRHHRLLHPPKQTNSYNASTTNTTESHTYHHCEQSILFRIIPVTVT FT GPSKTIDTFAFLDEGSSATLVEQSLAEQLELQGPVIPLCLKWTADVSRSEE FT RSQIVSLEISESGRARKYRLDNARTVERLNLPTQTLCFEELQEKYQHLAGL FT PIRSYTKAIPRLLIGLRNLSLAVPQKIREGKGGPIAVKTRIGWCVYGNLTE FT NREQNHFSYHICECDAGDKLDNLIREYFDAEDTGLRFTEQIESDEMQRAKS FT ILERTTKRDKNRFTTGLLWKYDHIELPDSFPMALQRLKCLERRMARDPILK FT DNLHKQLKEYEHKGYAHQASTAELNEADPRRTWYLPLGTVVNPKKPTKVRL FT IWDAAAKVDGVSLNTFLLPGPDLLVSLPAVLFRFRLYPVAVCGDIKEMFHQ FT IGVIAADRHAQRFLWRETPEEPPKVFLMDVLTFGSASSPSSAQFVKNRNAK FT EHEAQFPRAAEGIVKCHYVDDYLDSFEDEVDAKLVSEQVRHVHLNGGFEIR FT NWSSNSMVVLEHLGEASKVAMKDLTAAGNGESEKVLGMLWITETDMLCFST FT TFRADIDELIRSKARPTKRQILRTVMSLFDPLGLLASFLVHGKILMQDVWR FT SGIKWDERVDDRTEQRWHNWIELFARVDEIKIPRCYFVTASTERYGTLQAH FT LFVDASEAAYSAVVYFRIIDGEGNPQCSLVTAKTKVAPIKYVSVPRLELMA FT AVLGARLLAFVGDNHTVPIQQRFCWSDSCTVLAWLRSEHRRYKQFVACRVG FT ETLSLTKVNEWRWVPSRLNIADEATKWGKGPCFDADSRWIQGPKFLSLPEE FT HWPRMEPAELLTDEELRPCHLYHEVVTPLIEFERFSNWNRLLRSVAFVFHP FT FTVYKARKLNINVKQGPTHEDLKAAEILIWRTVQSTVYPDEITTMSQNRNL FT PSSQQRPLEKSSPLYKLTPMMDEDGVLRIDSRTGAARVSTFDLKYPVILPQ FT KHHVTRLLIDHYHRKFLHCNAETVVNELRQKFYIPRIRVAVKTESRLCQWC FT KVPKMGPLPEARLSPGVRPFSYIGIDYFGPILVKVGRSSAKRWVCLITCLT FT IRAVHVEVAFDLSTQSCIACIRRFVCRRGAPLEIYSDNGRNFVGADGVLRE FT QMKRIDEEAAATFTNAQTKWCFIPPSAPHMGGSWERLVRSIKAALINIPQD FT RKLDDEALLTYLAEAESIVNSRPLTYLPLDTPEQEALTPNHFLLGSSSGVK FT QPTRDLGCPTRLLRNTWDTLQVAIDQFWQRWIREYLPTLTRRTKWFGDHKS FT VDSGDLVVIIEDMKRNGWVRGRVLEVIKGKDRRIRQAVVQTSSGVFKRPVS FT KLALLDVAVSSKADVDTHPYGEG" XX SQ Sequence 6170 BP; 1730 A; 1502 C; 1555 G; 1383 T; 0 other; aaactctaag aaattcgacc gatgtcgaat gcaagctcca atcccgatac catctatgga 60 tgcgctggct gcgatcgccc cgatactgcc gatgatatcg tgcagtgtga taaatgtgct 120 gcttggtggc actattcctg tgccaaggtc gacgcatccg tagcagatcg cacttgctgg 180 atgtgcgcca aatgtacgac cccaccgcca cctccgcgat ctaccacgat cagaacgacg 240 tcatcgaatc ggagagcctt gctagccata aacctacagc ggctcgccga agagaaggaa 300 ctgcggaaaa aggaattgga gatcagcatg gagaaagagt tcgtgaaggc caaatacgat 360 ctactggagc agtgtgcgtt agatgatgaa gccgaatctc acagcgtacg cgatcgaatc 420 gaagaattcg aagcccgaga tcgagagagg cacgttaacg attgggtagg tcaagtagca 480 gcgtcgctcc agaagagcag atcgctcact gacccgaaga tcctctctat tcccaaagcc 540 gacagtactc ccgccgatcc tttccacttg ggaaaagcga ccgcagtaga cacaacaaaa 600 acaaatgttg tactcgacgt aatcgatgac cgattgagcg aggtgaatga ccgccacaaa 660 aataaagagc cacctagacg cgctgttgtt gtgaaacgga gccaaaacag acgagatgaa 720 aacgcaacag ccaacgaaaa tcaaaaacaa cttgaaatgc agcttcaaaa gtgccaggag 780 atttccatca gtgatcacca gatcataaaa gatttacaaa atcagcttcg tagatgtcag 840 ctgcaactgc agcagcgcag tgaggaacca caagtacctg atccgcagtc gatttggccg 900 ttacgagacg tttcaagagg agcagttccg aagtcaaaac cgaccatcca aactatgcca 960 gcaccagtag agtctcaaca cgagcagtcg cgtaccaatt tcgcgcgctc aacggaaatc 1020 gatctcacca gtattccaca gcaaccacag ttggataact acggtagtca cgagcagtcg 1080 cgtaccaatt tcgcgcgctc aacggaaata ggtctagcca gcactccaca gcaaacacaa 1140 tcgagtaacc acggaggcca cgagcagatg cgtaccaatt tcgcgcgctc agcggagatc 1200 ggccttgcca atagtccgca gattcttcag tcgagtagct acaatcaagt agcagattct 1260 ttgttcctgg aagtgcggcg accgtcacca gagcaactcg cggcgaggca agtgatgccc 1320 agggatctgc cagacttcta tggagaccct gaagaatggc cgttgtttat aagcagtctt 1380 cgcaacagta catccgcttg cgggtacagc agggttgaaa accttgctag gctgcagcgt 1440 tgcctaaaag gcaacgcact caagtcagtg cgatacaact tgctcgatcc ggagtctgta 1500 ccggaagtca ttcggacgct gcaaacacta tatgggcgtc ctgaagtgat tataagcaag 1560 ctgatcaaaa gcgttcgcga tgccccagca ccaaaatccg agcgattgga gacgctaatc 1620 gattttggca tggcagtcag aaatttagtc tgccatctaa tcgccgcaga tcagcgatcc 1680 cacctctcca accccgtgct gttgcaagaa ctggtggaaa aactgccagc cagtgtgaag 1740 atgcagtggg ctcaacatct tatccagttt cccgatatga ctctgcaaac cttcagcaat 1800 ttcatgtcat ctgtggtgga atcggtgagc aaagtggtgc tgtacactgg tagccaaagt 1860 caccgaccgg aaaagcagag atccaaggaa aagggatatt cgcatgccca tgttgaagcc 1920 atcgaacctg ggcaacagaa gattgattct ccgaaaccct gtccggtatg cggaaaggct 1980 gatcatcgga taaaggactg tgactcattc aagaggagca gcgttgacag tcgatggaaa 2040 acggtttcat ctttgaagct ctgtcgctgt tgtttgggtc aacatggacg aagggtttgt 2100 agaagctcag ctcgatgtga ggtcgacggt tgccaatatc gccatcaccg actattacac 2160 ccgccgaagc aaacgaatag ctataatgct tcgaccacta acacgacgga aagccacacg 2220 taccatcact gtgaacaatc catacttttc cgcatcattc cggtgactgt gaccggacct 2280 tcgaaaacca tcgacacttt tgccttcttg gatgagggat cttccgctac attagtggaa 2340 caaagtttgg cggaacagct agaactccaa ggtcctgtaa ttcctctttg tttaaaatgg 2400 acagccgatg tatctcgttc ggaagagagg tcacaaatag tttcgttgga aatctcagaa 2460 tcaggacgcg ctaggaaata ccgcttggat aatgccagaa cagtggaacg cttaaatctt 2520 cccactcaaa cgctctgctt tgaagaactt caagaaaagt accagcacct ggctggattg 2580 ccgattcgta gttacacgaa ggcgattcct cgactgctga tcggactgcg gaacttgtcg 2640 ttggccgtgc cacaaaagat cagagaaggg aaaggaggtc caatagcagt gaaaacacgt 2700 atcggttggt gcgtttacgg aaacctgact gagaatcgtg aacagaatca tttcagctac 2760 catatctgcg aatgtgatgc gggagataag ctggacaacc tgattcgcga atatttcgat 2820 gcagaagaca ctggtctacg ctttacagag cagattgaat cggacgaaat gcaaagagcc 2880 aaaagtattc tcgagagaac tacaaagagg gataaaaacc gcttcactac cggactgttg 2940 tggaaatacg atcatatcga gcttcccgac agtttcccaa tggctcttca gcggttgaag 3000 tgcttagaga ggcgtatggc acgcgatccg atcctcaaag acaacttgca caagcagctt 3060 aaagaatacg agcacaaggg ttacgctcat caagcctcaa cagccgagct aaacgaggct 3120 gacccccgac gaacgtggta tcttccgttg ggtaccgtcg tgaacccaaa aaagccaact 3180 aaggtacggc ttatttggga tgctgccgcg aaggtcgacg gggtgtcatt gaatacattt 3240 cttctgccag ggcccgatct tctggtatca ttacctgccg ttttgtttcg atttcgccta 3300 tatccggtag cagtctgcgg tgatatcaaa gaaatgtttc accaaattgg tgtcatcgca 3360 gctgaccgac atgcccagcg ctttctttgg cgagaaacgc ctgaagaacc tcccaaggtc 3420 tttctgatgg acgtgctaac ctttggctcg gcgagctcac cttcgtctgc acagttcgtc 3480 aaaaaccgaa atgcgaagga acacgaagcc caatttccta gggcagctga ggggattgtg 3540 aagtgtcatt atgtcgacga ttatctcgac agttttgagg acgaggtgga cgcgaaacta 3600 gtttctgaac aagttcggca cgtccatttg aacggtggat ttgagatccg aaactggtcg 3660 agcaacagca tggtggtact cgaacatctc ggtgaagcat cgaaggttgc aatgaaggac 3720 ctaacagcag cgggaaacgg agaatcagaa aaagtactcg gaatgctctg gataaccgaa 3780 acagacatgt tgtgtttttc cacgacgttc agggcggaca tcgatgaact tattcgctcg 3840 aaggctagac cgacaaaaag gcaaatacta cgcacggtaa tgagtctatt cgacccacta 3900 ggacttctgg catcatttct agtgcatggt aaaatcctca tgcaggatgt ttggcgaagc 3960 ggtataaaat gggacgaacg cgtagacgac cgaactgaac agcggtggca caactggatc 4020 gagctgttcg cgcgagtaga tgagatcaag attcctcggt gctacttcgt gacagccagc 4080 actgaacggt atggaactct acaagcacac ctgttcgtcg acgccagcga ggctgcatat 4140 tctgccgtcg tttacttcag gatcatcgat ggagaaggca atccacaatg ttcgctggtg 4200 actgctaaga ctaaagtcgc acccatcaaa tatgtatccg ttccgcgttt ggaattgatg 4260 gccgccgttt tgggagctcg attactagca tttgtaggag ataaccacac ggttccgatt 4320 caacaacgtt tctgctggtc tgattcctgt accgtattgg cctggctgcg ttccgagcat 4380 cgcaggtata aacaattcgt agcgtgtcgt gtaggagaaa ctctttcgtt gacgaaagtt 4440 aatgaatgga gatgggtgcc aagcaggttg aacattgcag acgaggccac aaaatgggga 4500 aagggaccct gtttcgacgc agacagtcgt tggatccaag gtccgaagtt cttgagctta 4560 ccggaggaac attggccgag gatggaacca gcagaactct tgactgatga ggaactacgt 4620 ccatgccatc tgtaccacga agtagttact cctctgatcg aatttgaacg gttttccaac 4680 tggaatagat tgttgcgatc agtggcattc gttttccatc ctttcaccgt ttacaaggcc 4740 cggaagttga atatcaacgt gaagcaagga ccaactcatg aagatctgaa agcagccgaa 4800 attctgatct ggagaacagt gcagagtacc gtctatccgg atgaaataac cactatgtca 4860 caaaaccgaa atttgccgag cagccaacaa cgcccactgg agaaatcgag ccctctctac 4920 aaactcacac ctatgatgga cgaagacgga gtcttgcgta tcgatagccg cacaggagcg 4980 gctcgggtga gtacgtttga cttgaagtat cctgttatac ttccacagaa acaccacgta 5040 acccgtttgt tgatcgatca ctaccatcga aagttccttc actgtaatgc agagactgtc 5100 gtgaatgagc ttcgacagaa gttctacata ccccgaattc gagttgccgt gaagactgaa 5160 tcaaggctgt gccaatggtg caaagtgcca aaaatgggtc cgctacctga ggctcgctta 5220 tctcccggag tacgaccgtt cagttacatc ggaattgact attttggacc gatccttgtg 5280 aaagtgggtc gctcaagcgc aaaaagatgg gtctgtctca tcacctgctt aactattcgc 5340 gcggtgcacg ttgaggttgc ctttgaccta tccacccaat cttgcatagc ctgcatccgc 5400 agatttgtct gtcgcagagg agcaccgttg gagatatact ctgacaatgg acgaaacttt 5460 gttggagctg atggagttct acgtgaacag atgaagcgca tcgatgaaga ggctgcagca 5520 acgttcacta atgcacagac gaaatggtgt ttcataccac catctgctcc ccatatggga 5580 ggatcatggg agcgccttgt gcgttccata aaggcagcat tgatcaacat accacaggac 5640 agaaagctag atgatgaagc tctgttaaca tatctagcag aagcagaatc catcgttaat 5700 tctcgtcctt taacgtactt gccgcttgat acgcccgaac aagaggccct cacgcctaac 5760 cactttcttt tgggcagctc aagtggcgtg aaacaaccga caagagactt aggatgccca 5820 actcgcttgc ttcgcaacac ctgggatacc ctgcaggtgg ccatcgacca gttctggcag 5880 cgctggatcc gggaatacct ccctacactt acaagacgaa cgaagtggtt tggtgatcat 5940 aagtccgtcg attctggtga tttggtggtc atcatcgaag acatgaaaag gaatggatgg 6000 gttcgaggac gagttcttga agtgattaag ggaaaggaca gaagaattcg ccaggctgtt 6060 gttcaaacat cgagtggagt gttcaaaaga ccggtttcca aactggcttt gctggacgtc 6120 gcagtaagca gtaaggcaga cgtggacact catccttacg gggaggggaa 6170 // ID CR1-45_HM repbase; DNA; INV; 4445 BP. XX AC . XX DT 18-DEC-2008 (Rel. 13.12, Created) DT 18-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE CR1-type family - consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-45_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4445 RA Bao W. and Jurka J.; RT "CR1 families from Hydra magnipapillata."; RL Repbase Reports 8(12), 1873-1873 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 71..763 FT /product="CR1-45_HM_1p" FT /translation="MATKNFTSAQLKDILEIHENTIMKMFNDRIEKLENKL FT NSVIDENKELKKELGELNKAVEFVSNNYDKIVADHQTSKTSLINEYANSNM FT INKLAELEDRSRRNNLRFCGIEEVENESWEDSENKVREFLNNKLQLHGKFE FT IERAHRVGKKLPENNKKNRSIVVRFLNYKDKATILRKYTTMKLWTQKVYIN FT EDYSDHTMELRKKLLNEAKDLRAKGKYAKVIYNKLITRDL*" FT CDS 810..3893 FT /product="CR1-45_HM_2p" FT /translation="MDSNETLNFESSFYNFFQTNDFLLDDESDPDLNYFSN FT SGALQNKCNYFYTHEIKGFLDQNNINTIHINIRSLKKNFEIFRNFIEETCN FT LFNIICLTETWCSSDDVNFFTNFELPGFNVISLARKTNKRGGGVLIYVKNN FT LRYFTRHDMSISDADKEVLTIEILTNKIKNKILSCCYRPPSGEIKNFNSFL FT CNDVIKKSNHENKFIYLVGDLNLDCFQYHVNNNIKRFYNGIFENGAIPLIS FT KPTRITQSSASLIDNILTTDVFNESLKKGIIKNDISDHFPIFFSINIDNKL FT VPNEKRVFKKRFFTNENLKSFKEQLSLIDWSIINTSDDINLVYNSFFKSFY FT DIYETNFPEVNLNLKAKSIKSPWITKGLRKSSKIKQKLYINYLKSKTNENK FT IIYKNYAKLFESLKKKQKKNYYLNLLNIYKLNFKRTWFIIREITGSKKCTS FT HSLPNTVKHNNEFLYDKRQITEEFNKYFVSVGPNLAKRIPIANGLMNDLRF FT PLNSYLNSFELSFEEFENAFKMLKSNKAVGPDGINGNIIISSFDVLKDILF FT KIFSISIKQGIFPHALKLAKVIPILKSGDIENISNYRPISLLSVFSKVLER FT ILYNKIYSHLTLNNLLYSNQYGFQKNNSTEHAILQFTRNISDSFENSQFTL FT GVFIDLAKAFDTIDHEILFKKLEWYGITGKILIWLKSYLNNRKQFVYANDN FT VSSSLLNISCGVPQGSILGPLLFLIYINDLPKASNLMTIMFADDSNLFLSH FT NNIFTLFSNMNIELVKISEWFRLNKLSLNIDKTKWILFHPYGKKHQLPSNL FT PFLFIDNIVINRVLVTRFLGVYIDENLTWKCHIANLCSKISKSIGILYKIR FT NVLDKNTLIQLYYSLIHCHINYANIAWGSTYKSKLKPLYRQQKHVARLINF FT KDRFAHAKPLLYDMKALNIYELNVFNILCFMYKCKTNLSPVSFRNLYLQRD FT RNKYILRNDNLIRQPFSQTNFGKFFISFRGPFLWNSIVLNTSKDFSQECNF FT DSFKQNLKKLIFSTENILIYF*" XX SQ Sequence 4445 BP; 1735 A; 591 C; 565 G; 1554 T; 0 other; ttcaccgcga acattcaccg cgaacggacg tgtttttttt tttattacta gttatcaaaa 60 aaatttcaaa atggctacta aaaattttac atctgcacaa ttgaaagata ttttggaaat 120 tcatgaaaat actattatga agatgtttaa tgatagaatt gaaaagcttg aaaacaaatt 180 aaactcagta attgacgaga ataaggaact aaaaaaggaa ttaggcgaac ttaataaagc 240 tgttgaattt gtaagtaata attatgataa aatagttgct gatcatcaaa catcaaaaac 300 atcattgata aatgaatatg caaattctaa catgattaac aaattggctg aacttgaaga 360 cagaagtcga cgcaacaact taagattctg tgggattgaa gaagttgaaa atgaaagctg 420 ggaggacagt gaaaataaag taagagaatt tttaaataat aaacttcagc tacacggaaa 480 atttgaaatt gaaagagcac atcgagtcgg aaagaaacta cctgaaaaca ataaaaaaaa 540 cagatcgatc gtagtcagat ttcttaacta caaagacaaa gccacgatat taagaaaata 600 cacgacgatg aaattgtgga ctcaaaaggt ttacataaat gaagattata gtgatcatac 660 gatggaattg cgaaaaaaac tcctgaatga ggcaaaagat ttaagggcaa agggtaagta 720 cgctaaagtt atatataata aattaatcac acgggattta taaaagaaat tctttgatct 780 tatttcgatt atgaaaaaac aaaaataaaa tggattctaa cgaaacctta aattttgagt 840 ctagttttta taattttttt caaacaaatg atttcttatt agatgatgaa tctgatcctg 900 atcttaatta tttttctaat tctggtgctt tgcaaaataa atgcaactac ttttataccc 960 atgaaattaa aggatttctt gatcaaaata atattaacac aattcacatt aatatccgca 1020 gtttaaaaaa aaattttgaa atctttcgta attttattga agaaacttgt aatctattta 1080 atataatttg cttaacagaa acatggtgca gttcggatga cgtaaatttt tttaccaatt 1140 ttgaactgcc gggttttaat gtaatttcat tagcacgaaa aacaaataag cgtggcggag 1200 gcgttcttat ttatgttaaa aataacttac gatattttac ccggcatgac atgagcatct 1260 ctgatgctga taaagaggtt ttaacgattg aaattttaac caataaaata aaaaataaaa 1320 ttctaagttg ttgttatcgc ccaccttcag gtgaaattaa aaattttaat tcgtttttat 1380 gtaatgacgt tattaaaaaa agtaaccacg aaaataaatt tatctactta gtgggtgatc 1440 tgaatttaga ttgttttcaa taccacgtca ataataacat taaaaggttt tataatggca 1500 tttttgaaaa tggtgcgatt cccttgatta gcaaaccgac aagaattact caatcaagtg 1560 cctctttaat tgataatatt ttaaccactg atgtttttaa cgaatcctta aaaaagggca 1620 taattaaaaa tgacatatct gatcattttc ctattttctt ttctataaat atagataata 1680 aattagttcc taatgaaaaa cgagttttca aaaagcgatt ttttacaaat gaaaacctta 1740 aatcttttaa ggaacaatta tctctaattg attggagcat tattaacacc tctgatgata 1800 taaacttagt ttataactcc ttttttaaat ctttctatga tatttatgaa actaatttcc 1860 ctgaagttaa tttaaatctc aaagctaaaa gtattaaatc gccttggatt acaaagggtt 1920 tgcggaagtc ttcaaaaatt aaacaaaaat tatacattaa ttacttaaaa tctaaaacaa 1980 atgaaaataa aattatatat aaaaattacg ctaaactttt tgaaagcctc aagaaaaaac 2040 aaaaaaaaaa ttattattta aatttactaa atatatataa attaaatttt aaacgcactt 2100 ggtttataat aagagaaatt actggcagca aaaaatgtac atctcactct ttgccaaaca 2160 cggttaaaca taataatgaa tttttatatg ataaaagaca aattacggaa gaatttaata 2220 aatattttgt gtctgtagga ccaaatctgg caaaaagaat tcctatagct aacggtttaa 2280 tgaatgatct acgtttccct ctaaactctt acttgaactc ttttgaatta tcttttgaag 2340 agtttgaaaa tgcctttaaa atgttaaaat ctaacaaagc agtaggccct gatggtatta 2400 atggcaacat tattataagt tcatttgatg ttttaaaaga tatactcttt aaaatttttt 2460 caatatcaat taaacaggga atttttccac atgctctaaa attagcaaaa gtcataccaa 2520 tattaaaaag tggtgacatt gagaatataa gtaactatcg tccaatttca ctcctttctg 2580 tattctctaa agttttagaa agaattttgt acaataaaat ttatagtcat cttactttaa 2640 acaatttatt atacagcaat caatatggat tccaaaaaaa caattctact gaacatgcca 2700 ttctacaatt tacaagaaat atatctgact catttgaaaa ttctcaattt actttaggtg 2760 ttttcattga cttggcgaaa gcttttgata ctatagatca cgaaattctt tttaaaaagt 2820 tggagtggta cggaattact ggaaaaatat taatttggct aaaaagttac ctaaataatc 2880 gaaaacaatt tgtttatgcg aatgataacg tatcatctag tttgttaaat atttcatgtg 2940 gagttcctca aggatccata ttaggacctc tcctattttt aatatatatt aatgatctac 3000 caaaagcttc taacctaatg acaattatgt ttgctgatga ttctaaccta tttctttctc 3060 ataataacat ttttacactt tttagcaata tgaacattga actagtcaaa atttccgaat 3120 ggtttagatt aaacaaacta tcactaaata ttgataaaac taaatggatt ctttttcatc 3180 cttatggtaa aaagcaccag ctgcctagca acttaccttt tctatttatc gataacatag 3240 ttattaatag agttttagtg acaagatttt taggtgtata tatcgatgaa aatcttacat 3300 ggaaatgcca cattgctaac ttgtgcagta aaatttcaaa gagtataggc attttataca 3360 aaataagaaa cgttcttgat aaaaatacct taattcaatt atattattcg ttaattcatt 3420 gccatatcaa ctatgcaaac attgcttggg gtagcactta taaaagtaaa cttaaacctc 3480 tctatcggca acagaagcat gtagcacgcc ttataaattt caaggaccgt tttgctcacg 3540 ccaagcctct tttatatgat atgaaagcac tcaatatata tgagttaaat gtttttaata 3600 ttctttgttt tatgtataaa tgcaagacca acctatcacc cgtttctttt cgtaacttgt 3660 atttacaaag agatagaaat aaatatattc taaggaacga taacttaatt cgacaaccat 3720 tttctcaaac taattttgga aaatttttta tttcatttcg cggaccattt ttatggaata 3780 gtatagttct aaatacttct aaagattttt ctcaagaatg taattttgat tcttttaaac 3840 aaaatcttaa aaaactcatt ttttcaactg aaaatatact aatctacttt taaatttttg 3900 aaaatcactt gtctatatta gtatatgaat atatttaatg tacattcttt tttatttatt 3960 tattttttta tcaacaagca attatttaac atgttttatt ttagtgaatt atttaacatg 4020 ttttagtttt taagttttag aactcttcta ttgtaatatt gtatgtttat ttataatatt 4080 atattacgaa ctttatatac ttttattaat aaatttaatt ttgtaataaa aagaaactga 4140 acaaaaataa ataaatacaa agaatgaaaa cattgaaaaa aaaaaaataa taataataat 4200 aaaaaaaaaa aaaaaaatta aaaaaaaaaa aaaaaaacgt tcaaaagcgg tttctctgtg 4260 acaagacctg atggtcttct ttgagtatcc gcgttcttta tatttattaa tattctttat 4320 attattatat tattcttaac gatatcttga cttgttacct tttctttttt tattttttat 4380 attatttgta tctgtatcat aaatattgta ataaagaaca aaaaaaaaaa aaaaaaaaaa 4440 aaaaa 4445 // ID BEL-619_AA-I repbase; DNA; INV; 7217 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-619_AA_; KW BEL-619_AA-LTR; Pao_Bel_Ele177; BEL-619_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-7217 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [4390-4965] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 152..2638 FT /product="BEL-619_AA-I_3p" FT /translation="MSHIKLITNKNGHCRLCTEDDKAGNMAVQCDDCDRWF FT HQHCVNLEQLPSKKERWICPKCTEQEVEKQKLTQELNMLKNPVTMGQFEQL FT IARLNIKNEPEQNHLSILVKRQAMMSLPRFSGTAKEWPRFKSAFESTTDEG FT EFSNVENLNRLQQALEGNALKAVSHFMFEPENVPKIMERLEENFGRSNQIY FT EEYLNNLMKTQGNSNNSVVEISDALDAFVAHIGIMEKPEYLKDFRLIDEIV FT KKLPFNLQVQWIEVLQRNAESPTLKHLSEWLMRIAKLYRAVQVNTSQTKDK FT KVRNNVHHQQPDKERKQPFEKTTYNKRSENPTQGKRLPLTCGYCKSNHTIF FT NCESFNAMNAAERRQVVKEKTLCWSCLRPNHQSKNCPSARSCGIGTCKATH FT NRLLHTKNTPSIESSAKPTPIVETGSENVQSINHHQDSKTIKYFQILPVTL FT RNGDNIIKTYAFLDAGSSLTLLDEDTANGLGLEGRPETLKLLWTQDISKAT FT NTKIVQLEINGINQKRFALKGVRTVKNLQLPMQTLDFGSMEKKYPYLKDLP FT IKNYSAAKPTILIGIKHSHLLVPLQTVIGGEHEPIALRTKLGWIMFGNAVS FT QIDNEYAMMIHQDDEMREMMKQHFSVEDFGVKIVNETVESVETERSKEILK FT NTLKRSGDRYEVGLLWKYDKSNFPDSYGNAYNRLLSLEKTLNKQNNVELLH FT WAKKTFKEYEDKGYIRKLTETEIENPGTDRVFYLPHFIVTNTNKIPPKPRL FT VFDAAAKIKGVSFNSELLSGPDATSSLFGVLLRWREGMVAVTGDIKEMFHQ FT VKIRPEDQQAQRLYGGIATQVKIQIRT" FT CDS join(3232..4260,4264..7146) FT /product="BEL-619_AA-I_1p" FT /translation="MDLIKSAENLSIPRCITTEAITELELHVFVDASEDAF FT AACIYSRSKTHYGFCIRLLAAKSRVSPIKPISIPRLELQGALLGTRLTNQV FT KEKLRVKITDITMWSDSKTVLAWIKTEKRKYHQFVGHRIGEILDSTTKDQW FT RWVPSGLNPADEGTKLVKGKSIWLDGPDFLRQPEDIWPAKSSHEPTTAEIV FT AFHNEPETYSFIRENDYSSWRRLVKGLIPLKKFATWLQCKEALAKGMKYSE FT WKCIENALYRKAQNDAFPDEMEALISGKPFPKTSSILGFRPFLDEFGVMRC FT KGRLERANILPLATRIPIIMPQHHRLSKLLVRSYHEEYLHQEDGAVMTALH FT KFWIINLKSVLKNVKNCCQKCILTRSKPIEPLMAPLPPPRVQGYVRPFTNC FT GVDYFGPYKVNITRNITDKRWGVLFTCMASRAVHIEMAPKLDTDSFLLCLR FT NLQNRRGKVAKLYSDNGTNFVGADAVLKSLVKEINAKMESGTAAEMEISWH FT FNPPGAPHFGGAWESLVKLAKQAIREMMDKWKENVLPKPESLQAAFTQAEY FT ILNSRPLTDVPINCAEDEAITPFHILTGKGGKYVPPRMPTTTKDEKEQWKL FT VQFYQKYFWDRWKAEYIPRLLKRTKWQQKVKPIKVDDIVVLADQDGLPGSW FT LKGRIVKVFPGKDGQVRSADIQTQRGILTRRAGKIAVLDVSNKEPSLTTAS FT ENLITHNEIEDNQPKRKLVDTTFPNWGKRIKHDEDEVKSRARSLNDSHRQT FT DQRKIRYTTPGTVAGVFVTAAALLSLTDALIVKPIESDGLVFDHHGTCVLK FT RGIWKTKLTTNLYPEEDISLLNEIHRNISKALESMEGMVKDYTLNQTVQTI FT GRHCDETIMDITQFSRPKRSKGIFGYIKDFIFGGDDVEEDIAAMRIHDDRL FT FHDISDTMAKINGKTAHMGEQFNDRIFRLHQDINRLHNDARFEMIETKTLE FT TTILAREMIDEIGSKYRKLRQNPLPREVQQQLKQNISASLPSGYTTLDHEL FT SNDIKFAMINGSLIIIVETIVVNKDLYELFNVYAVPNSVNFSQIIIDERSI FT AINHHKEYFYPEENDMVKLNATHLLITRATIKRKHDCISASIMHDITNIKC FT MTQIRSSPYSHLIQLSESNKVLFYKSDNEKVVVHCNSVVSSVPYEAGIITI FT GPDCRIESAYSTIYGSISGESHKPITFFKPRAQLYYHIEKPHDANANVTSE FT IEPENPYLNDVITVVGTPEVIYRNSNTVIPLLVIIILGLIIGVYVYCKYFN FT NKVHASNITESQSNQDVNQRRSINLLPLPMVRYRNDPVVEEAV" XX SQ Sequence 7217 BP; 2591 A; 1377 C; 1479 G; 1768 T; 2 other; attggtggct ccagagagga aaatcgtgca tcatcaagtc atcaagttgc ctgttgcttg 60 ttgcaaatca ggaacaccag cgcgttatca gcacgatcat cagccccaag gtgcaaagga 120 aatttctcag ttgaagaaat aaggaagcga aatgtctcac atcaaactaa taaccaacaa 180 aaacggacac tgccgtttat gtaccgaaga tgataaagcc ggaaatatgg ccgttcaatg 240 cgacgattgc gatcgttggt ttcaccaaca ttgcgttaat ctggaacaac ttcctagcaa 300 aaaggaacgt tggatttgcc caaaatgcac ggagcaagag gtggagaaac aaaaattaac 360 gcaggaacta aatatgctga aaaatcccgt tacaatggga cagtttgagc aattgatcgc 420 tagattgaac attaaaaacg aaccagaaca aaatcatcta agtatccttg tgaaaagaca 480 agccatgatg agtctgcctc gtttttccgg tacggccaaa gaatggccaa gatttaaaag 540 tgcatttgaa tccacgaccg atgaaggaga attcagtaac gttgaaaatc tgaatcgcct 600 tcaacaagcc ctagaaggaa atgcactgaa agcagttagc cactttatgt ttgaacccga 660 aaatgttcca aaaattatgg aacgtttgga ggagaatttt ggtcgttcaa accaaatata 720 tgaagagtat ctcaacaact taatgaaaac tcaaggcaac tccaataatt ctgtcgttga 780 aatatccgat gctctggatg cttttgtagc acacatcgga atcatggaaa aaccggaata 840 tctaaaagat ttccggctca tagacgaaat cgtcaaaaaa cttccattca atttacaagt 900 acagtggata gaagttcttc aacgaaatgc agaatcacca acattgaagc atctctctga 960 atggctaatg agaatcgcaa aactctatcg agctgtgcag gtaaatacat cgcagacaaa 1020 ggacaaaaaa gtaagaaaca acgtccatca tcagcagcca gataaggaga gaaaacagcc 1080 atttgaaaag acgacgtata ataaacgatc ggaaaatcca acacaaggaa aaaggttgcc 1140 ccttacctgc gggtattgta aaagtaatca taccatcttt aattgcgaaa gtttcaatgc 1200 aatgaatgca gctgagaggc gacaagtggt aaaggaaaaa accttgtgtt ggtcatgcct 1260 tcgtccaaat caccaatcca aaaattgtcc atcagcaaga agctgtggaa tcggaacctg 1320 caaagcaaca cataatcgtt tgctccatac aaagaacact ccatctatcg aatcaagtgc 1380 aaagccaact ccaatagtcg aaacagggtc agagaatgta caatccatta accaccacca 1440 agattcgaaa acaatcaaat atttccaaat tcttccggtg acgttaagaa atggagataa 1500 cattataaaa acctatgcgt tcttggatgc aggatcatcg cttacacttc ttgatgaaga 1560 caccgcaaat ggtttaggcc tcgaaggaag gccggaaaca ttgaagttgt tatggactca 1620 agatatatcc aaggcgacaa ataccaaaat cgttcagctt gaaattaatg gtataaatca 1680 aaaacgattt gcattaaagg gtgtaagaac ggtgaaaaat ttacaactac ccatgcaaac 1740 tttggatttt ggctcaatgg aaaaaaaata tccttaccta aaggatttgc caatcaaaaa 1800 ctattctgca gcgaagccaa caattctgat cggaataaaa cacagccatt tattggtccc 1860 attacaaacg gtaataggcg gggaacatga gccaatagcg cttagaacga agctcgggtg 1920 gataatgttc ggtaatgctg tctcgcaaat agataacgaa tatgccatga tgatacatca 1980 agacgacgag atgcgtgaaa tgatgaagca acatttctcg gttgaagact tcggcgtcaa 2040 gatagtcaat gaaaccgtag agtctgtaga gaccgagcgg tccaaagaaa tattgaagaa 2100 cacgttgaaa agatcaggcg atcgctatga agtcggatta ttgtggaaat acgacaaatc 2160 gaattttcca gacagttacg gaaacgcata caatcgtctg ctatctttgg agaaaactct 2220 taataagcaa aataacgttg agctcctaca ttgggctaag aaaaccttta aagaatatga 2280 agacaaaggt tatattcgga agcttacgga aactgaaatt gaaaaccccg gaacagacag 2340 agttttttat ctgccacatt tcatagtgac aaatacaaat aaaattccac cgaagcccag 2400 actagttttc gacgcagccg caaagattaa aggagtttcc tttaattcag agcttctttc 2460 tggaccagat gcaacatcat ctttgtttgg agtactatta cgatggagag aaggcatggt 2520 agcagtcact ggagatataa aggaaatgtt tcatcaagtc aaaatcagac ctgaggatca 2580 acaggctcaa agactctatg gagggattgc gacccaagta aaaatccaga tacgtacgtg 2640 atgcaagtca tgactttcgg gtctacatgt tcaccagctt cagcgcaagc ggtaaaaaac 2700 accaatgctg aattataccg agaaatctac ccggatgctg tagattccat tatagacggt 2760 cattatgttg acgatttatt ggacagcttc aacaacgcgg tgactggcat taaagttgtt 2820 agacaaatca cagaaataca tgaccatgca ggatttcata taagaaattt tgcttccaat 2880 agtcctgaat taatggattt aattcctgaa aacagacgta tggtggcaaa tgttaaatca 2940 ctcgatgaaa aggaatcaaa tgtggaaaag gttctaggca tctactggaa cactctccaa 3000 gactcatcgg gtacaaattg aacatcaata agcttggaga agaagtgttg aaaaacatca 3060 gagcccccac catgagagaa gtactggcat tcataatgtc aattatgatc ctttgggatt 3120 aatcagcaac attacattca aggcaaaatt ctatatcaag aactgcacgt ggcatcacta 3180 gaatgggatg attgtatacc agaccaactg ctggtgccat ggagagattg gatggacctg 3240 ataaaatcag cggaaaattt gtcgatacca cgctgtatca ccacggaagc gattactgag 3300 ctggaactac atgtatttgt agatgcttca gaagacgcat tcgccgcatg tatatactcc 3360 agaagtaaaa ctcattacgg cttctgtatt agacttctcg cagccaaatc gagagtaagt 3420 ccaatcaaac ctatatcgat tcccagattg gaattacaag gagcgttatt aggcacacgg 3480 ttaacgaacc aagttaaaga aaaactgcgt gttaaaataa cagatataac aatgtggtca 3540 gattcgaaaa ctgttctcgc ctggatcaaa acagagaaaa ggaaatatca tcaattcgta 3600 ggacacagga taggagaaat actagattcg accacgaaag atcaatggcg ttgggtccca 3660 tctggtctaa atcctgcaga tgaagggacg aaactagtca aaggaaaatc tatttggttg 3720 gacggaccag atttcttaag acaaccagag gacatttggc ctgcaaagtc atcacacgaa 3780 ccaactactg cggaaatagt tgcctttcat aacgaaccag aaacatattc gttcatcagg 3840 gagaatgatt actcaagctg gaggagatta gtcaaaggct taattccact aaagaaattt 3900 gcgacctggc tacagtgtaa agaagccctt gcgaaaggca tgaaatatag tgagtggaag 3960 tgcattgaaa atgcacttta cagaaaagct caaaatgatg cttttcctga cgagatggaa 4020 gccttaatca gtggtaaacc atttccaaaa acaagttcaa tacttggttt tcgaccattt 4080 ctggatgaat ttggggttat gcgctgtaaa ggcagactag aaagagcaaa tatactacca 4140 ttggccaccm gaatcccaat tatcatgccg caacatcatc ggttgtcaaa gttattggta 4200 cgttcgtatc atgaagagta cttacatcag gaagatgggg cagtaatgac agcacttcat 4260 camaaattct ggattatcaa cttaaaatca gtattgaaaa atgttaaaaa ttgttgccaa 4320 aaatgtattt taaccagatc aaaaccaatt gaaccactta tggcaccatt acctccgcca 4380 agagttcaag gatatgtgag accatttacg aactgcggtg tggattactt tggaccgtat 4440 aaagtaaaca taactcggaa catcaccgat aagcgatggg gtgtattatt cacatgcatg 4500 gcatcaagag cagtacacat cgagatggct cctaaattag ataccgattc ttttctgcta 4560 tgtctacgaa acttgcaaaa ccgacgagga aaggttgcga aattgtacag tgacaatgga 4620 acgaatttcg ttggagccga tgcagtacta aaatctctgg taaaggagat caacgcaaag 4680 atggaaagcg gaaccgcagc tgaaatggaa atatcatggc atttcaatcc accaggagca 4740 ccacattttg gaggtgcttg ggaaagcttg gtgaaattgg caaaacaagc tataagagag 4800 atgatggaca aatggaagga aaacgtacta ccaaaaccag aaagtcttca agctgcattc 4860 acgcaagcag aatacatttt gaattctcgt ccattaacag acgtaccaat taactgcgct 4920 gaggatgaag caataacacc attccatatt ctaactggaa aaggtggaaa atatgttcca 4980 ccacggatgc ctacaacaac aaaggatgaa aaagagcaat ggaaattagt acaattctat 5040 caaaaatatt tctgggaccg atggaaggct gaatatattc cacggttatt gaaacgcaca 5100 aaatggcaac aaaaggttaa accaatcaaa gtggacgata tagtagtgtt agccgatcag 5160 gatggtcttc caggatcttg gttgaaaggt cgtattgtga aagtatttcc cggcaaggat 5220 ggccaagttc gttctgcaga tatacaaact cagaggggaa tcctaacgag acgagccggt 5280 aaaatagctg ttttagatgt cagcaataaa gagccatcat taacgacagc ttcggagaat 5340 ttgattaccc ataatgagat tgaagacaat caaccaaaac gaaaacttgt tgatactacg 5400 ttcccaaatt ggggaaaaag aataaaacac gacgaagatg aagtcaaatc aagagcaaga 5460 tcacttaacg attcgcatcg tcaaaccgat cagcgaaaaa taaggtatac aacacctggg 5520 accgtcgcag gagtttttgt aacagctgca gcattgcttt cacttacgga tgcgctgata 5580 gtcaagccaa ttgaaagtga tggattagtg tttgaccatc atggaacatg tgttttgaaa 5640 cgaggaatat ggaaaacaaa attaacaacg aatttatacc ctgaagaaga catatcgctt 5700 ctcaacgaaa tacaccgaaa tattagtaaa gccctggaat ccatggaagg tatggtgaaa 5760 gattatacct tgaatcaaac cgtacaaaca attggaagac actgtgacga aacaatcatg 5820 gatataacgc agttctctag gccaaaaaga agcaaaggaa tatttggata tatcaaggat 5880 tttattttcg gtggagacga cgtggaagaa gacattgcag ctatgcggat tcacgatgat 5940 cgattgttcc atgacatttc tgatacgatg gcaaaaatca acggaaaaac ggcacacatg 6000 ggtgaacagt tcaatgacag gatattccga ttgcaccagg atataaaccg tttgcataat 6060 gatgctagat ttgaaatgat cgaaactaaa acattggaaa caactatatt agctcgtgag 6120 atgattgacg aaattggttc caagtacagg aaattacgtc agaatccatt accacgtgag 6180 gtgcaacagc aattgaagca aaatatatcc gcatcactac caagcggata cactacattg 6240 gatcacgagc tatcaaatga tataaaattc gctatgatta atggatcact tatcattatt 6300 gttgaaacca ttgttgttaa caaagatctg tacgaacttt tcaacgttta tgctgttcct 6360 aactcagtta atttttcgca aataattatc gatgaacgca gtatcgcaat taatcaccat 6420 aaagaatact tctatcccga agaaaacgat atggtaaaat tgaacgcaac ccatttattg 6480 attacacgag caacaatcaa acgtaagcat gattgtatat ccgcttcaat tatgcacgat 6540 attaccaata tcaaatgtat gacacaaata cgcagcagtc catattcaca tttgattcag 6600 ctatcagaat caaataaggt gcttttctac aaaagcgaca acgaaaaggt ggtagtgcat 6660 tgtaattcag ttgttagttc agttccttac gaagctggaa tcataacaat tggaccggac 6720 tgtaggatcg aaagcgcgta cagcacaatt tacggaagca tttcaggaga atcccacaaa 6780 ccgatcacgt tttttaaacc acgggcgcaa ctttactacc acatcgaaaa accgcacgat 6840 gcaaatgcaa acgttacatc cgagatagaa ccggaaaatc catatttaaa cgacgtaata 6900 actgtagttg gaactcccga agtaatatac agaaactcaa atacagtaat accgctgcta 6960 gttatcataa tattaggatt aattattggc gtatatgttt actgcaagta ttttaataac 7020 aaggttcatg catcaaatat tacagaaagt caatctaatc aggatgtgaa tcaaagaaga 7080 agtattaatt tgttgccatt acccatggta agatacagga atgatccagt agtagaagaa 7140 gcagtgtgaa tatgatagat aatttggagc tcattttttt aattcatgta actctatcga 7200 attacggagg ccggaat 7217 // ID CR1-15_CQ repbase; DNA; INV; 3411 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-15_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-3411 RA Kojima K.K. and Jurka J.; RT "CR1 non-LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 18-18 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 10 sequences with >96% CC identity. XX FH Key Location/Qualifiers FT CDS 16..3369 FT /product="CR1-15_CQ_1p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="MATPTPSDTAAPPFAAADLTCPGPVVGVRDRSCQTCP FT TGKYPYISNTSSSERLPIFSPSTVPDSTRRLPSAEAVMVSQLMDPAPLDNP FT SHRAALPSPGRSLDGTMEVPNRSDAVALPASGQPSRPGPAFGSREGVFQQQ FT TPGEYSESTVFSSPDNPLACSPPRHLTIYYQNIRGMRTKTEKLRLALMSSD FT YDVVVLTETWLHGNILDSEFSANYKIFRLDRNTATSSAARGGGALVAVKSD FT LGAREVELANCERLEQSTVRINFHSFSLFVCGIYVRPRTTPDVYTLHAESV FT QQILEKATDQDVVVVVGDYNLPDLVWVFDEDVGGFLPANASSDAEVALTES FT FLANGLVQINFLLNNSNRLLDLAFVNDAAAFELLQPPNSLLPIDAPHPPFV FT LKLEICAHTPASESIDPVAEEFDFKRCNFETLNSRLEAVDWSTMDTAGSLD FT DAVTEFYDQLLNVLRDTVPVRARRFRCPSRTPPWWNSQLRNLRNQLRKARK FT RFVNRSSWGNKVTLSYLETEFAEQQVTSYQNYIAGVQDNLQSNPKNFWSYV FT KERKQVGEIPTDVSYREEKSTSPEAAANLFARFFETVHSSQDPTVPPEQLN FT ELATYNINLPLLTVSVAEVKKALGSVDAAKGPGPDRIPPSVVKNCCSSLAR FT PVTSIFNRSLSEGAFPSEWKVASITPIHKSGSRAKVENYRSISILSCLAKV FT LEKLLVDHIYPSVKNIISEYQHGFMKQRSTTSNLMAYTNWIIRRMEKRQQV FT DAVYIDFAKAFDRVPHKLTIAKLTALGLPDWVTRWLNSYLVNRSAYVKIAG FT SLSTCYEIPSGVPQGSHLGPLIFVLFINDLCSRLQSCKLLYADDLKIFRRI FT GDSDDVRALQEDINVLLRWCVQNGMEVNEKKCKLISFYRIRSPNLAEYNMG FT RSTLERVHSIVDLGVTIDCKMEFNQHVSISVAKSYAMLGFLRRNAAGFTDV FT RVLKTLYFSLVRSVLEYAVPVWAPYYAVHQQKIESIQRRFVRFAARVLPWN FT DPVTLAPYTNLCALVGLPTLQHRRVLLQRLFVFDVLRNNIDCSDLLEEVHI FT RVPSRELRNHQLLDIPRHTTNYGHHNPLDACCRKFNHSTILENFDFNVCKS FT VFRSRILLLS" XX SQ Sequence 3411 BP; 806 A; 989 C; 837 G; 778 T; 1 other; acgaagcaag gcactatggc gactcctact ccttccgaca cagctgcacc cccgtttgcc 60 gctgccgact taacctgtcc cggtcctgtg gtcggagtgc gagataggag ttgccaaact 120 tgccccacag gcaagtaccc gtatattagc aacacttcct cttccgaacg cctcccaatt 180 ttcagcccaa gtaccgttcc cgattccacg cggcgtttgc cgtcggctga agctgtcatg 240 gtgtcacagc tgatggaccc cgcgccgttg gacaacccat cgcaccgtgc cgcgctgcca 300 tcaccgggac gctcgctaga cggtaccatg gaagtcccca atcgctccga cgcagtcgcg 360 ctcccagcca gcggccagcc aagtcgtccc ggccctgcgt tcggaagtcg agagggggtc 420 ttccaacaac aaactccagg cgagtactct gagtcaacag ttttttcttc tcctgacaat 480 cccttggctt gcagccctcc acgacacttg accatctatt accagaacat tcgcggcatg 540 cgaacaaaga cggagaaact gcgcctcgcc ttgatgtcca gtgactacga cgtagtggtt 600 ctcaccgaaa cctggttgca cggcaatatt ctggactcgg agttctccgc gaactacaag 660 atcttccgac tcgaccgaaa caccgccacg tccagcgcag ctcgtggagg aggagcactt 720 gttgctgtca aaagcgacct tggagctagg gaggtagaat tggctaactg cgaacggttg 780 gagcagtcaa cggttcgtat caacttccat tccttctctc tcttcgtttg cgggatttac 840 gtccgaccac gcaccactcc cgacgtgtac accttgcatg ctgaatccgt ccagcaaatt 900 ctcgaaaaag caacggatca agacgtggtg gttgttgtcg gggactacaa ccttccggat 960 ctcgtctggg tcttcgatga agacgtcggt ggctttctcc ccgcgaacgc gtcgtccgat 1020 gcagaggtgg ctctcaccga gtccttcctg gccaacgggc tggtacaaat aaacttcctg 1080 ctgaacaact ccaaccgctt actagacctc gcctttgtga atgacgctgc ggcctttgag 1140 ctgttacagc ccccgaactc actcttaccg atagacgccc cgcatcctcc attcgttctg 1200 aagctagaga tttgcgcgca cacacctgct tccgaatcta tcgatccggt tgctgaggag 1260 ttcgatttca aacgttgcaa ctttgaaacg ctgaattcaa ggctagaagc tgttgattgg 1320 agtacgatgg acacagcggg ctcactggac gacgccgtga cagagttcta cgaccagctg 1380 ctgaacgtgt tacgcgacac agtgccagtt cgagccaggc gatttcgttg cccttccagg 1440 acgccgccgt ggtggaactc gcagcttcgg aacttacgca accaactgcg gaaggcacgt 1500 aagcggttcg tgaatcgaag ctcgtgggga aacaaggtca cactctcgta cctcgaaact 1560 gagtttgctg agcaacaggt tacgagctac cagaattata tcgcaggagt tcaggacaat 1620 ctccagtcta acccgaagaa tttctggtcg tatgtcaagg agaggaagca ggtgggagaa 1680 ataccaacag atgtgtcgta tcgcgaggaa aaatcgactt ctccggaggc agccgcaaac 1740 ctatttgcac ggtttttcga aaccgttcac agctcgcaag accccacggt tcccccagag 1800 cagctaaacg aactagcaac ctacaacatt aacctcccgt tgctcacagt ctcggttgcg 1860 gaagtcaaga aggctcttgg ctcggtggat gcagcaaaag gccctggacc ggaccggatc 1920 cctccatctg tcgtcaagaa ctgctgcagc tcgcttgccc gtcccgtcac aagcattttc 1980 aaccgctcgc tctccgaagg tgctttcccg tcggagtgga aggtcgcctc aattaccccg 2040 atccacaaat cggggagtcg cgccaaagtt gagaactacc gttcgatctc aattctgagc 2100 tgtttggcaa aagtattgga aaagttgctt gttgaccaca tctacccgtc cgtgaaaaac 2160 atcatctcgg aataccagca cggcttcatg aaacagcgat ctacgacgtc caacctgatg 2220 gcgtacacga actggataat ccggaggatg gagaagcgtc aacaagttga tgccgtctac 2280 attgatttcg ccaaggcctt cgatcgagtc ccccacaagc taactatcgc caaactaaca 2340 gcgcttggcc tgccggactg ggttactcgt tggctcaact cgtacctggt caaccgttcc 2400 gcctatgtga agatcgctgg ttctctctcg acctgctacg aaataccttc cggcgtcccg 2460 caaggaagcc accttggccc gttgattttc gtgctgttca tcaatgacct gtgttcccgt 2520 ctccagtcat gcaagctgct gtacgccgat gacctcaaaa tmttccggcg aattggagat 2580 tctgacgacg tccgtgctct ccaggaagac atcaacgtgc tgcttcgctg gtgcgttcag 2640 aacggaatgg aagtgaacga gaaaaagtgc aagctgatct cgttctacag gattcggagc 2700 ccaaatcttg cggagtacaa catgggccga tccacactcg aacgcgttca ctccatcgtg 2760 gacttgggcg tcacgattga ctgcaagatg gaattcaacc agcacgtctc catctccgtg 2820 gccaaatcgt acgctatgct cggtttcctt cgaagaaatg ctgctggttt caccgacgtg 2880 agggtcctca aaacgctgta cttctcgcta gttcgcagtg tgttggagta cgccgtaccg 2940 gtgtgggcgc cgtattacgc cgtccatcag cagaagatcg agagtatcca gcgtcggttc 3000 gtaaggtttg cagctcgagt gttgccctgg aacgaccctg tgacgctggc accctacacg 3060 aacctgtgcg ctctggtcgg cctgccaaca ctccaacaca gaagagttct ccttcaacga 3120 ctgtttgtgt ttgacgtctt gcggaacaac atcgactgca gtgacctact cgaagaagtc 3180 cacatccgag ttccctcaag agaacttcgg aaccatcagc ttctcgacat accaagacac 3240 acgacgaact acggacacca caaccctctt gacgcctgct gtagaaagtt caatcactcg 3300 actattttgg aaaattttga ctttaatgtg tgtaaatctg tgtttagaag tagaattttg 3360 ttgcttagtt agtaagaaac cctaaagtct gcaaagatga taaataaata a 3411 // ID BEL-179_AA-I repbase; DNA; INV; 6665 BP. XX AC supercont1.1; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-179_AA_; KW BEL-179_AA-LTR; BEL-179_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6665 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.1; Positions 528693 522029. XX CC Positions [5551-6183] - Integrase core CC 'GCATC' target site duplication CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS join(1258..5145,5149..6576) FT /product="BEL-179_AA-I_1p" FT /translation="MPPKKPTLKLLITKLVEAQSSLSDIWQFIADFKEDTK FT LSQIEVRMEKLEELWEEFSETLVEIKSHDDYNEEEGYDKERREFNNRYFEA FT KSFFMDQVKEKREPQALDQSFRNHDSSIHGNAALDNVRLPQIKLPTFDGND FT DEWLGFRDLFTSLIHWKPDLPEVEKFHYLKGCLLGEPRSLIDPLKITKANY FT QVAWDMLLKRYNNSKQLKKRQIQSLLSLPTLSKESVAELHTLLEGFERIVQ FT TLDQVVQPAEYKDLLLVNILTSRLDPVTRRGWEEFSATKENDSLDDLKDFL FT NRRMHVLESLPSKSTDTRGVQPTAQLKQKSSFTKTSFTSAQASGGRCFACS FT GNHPLFHCNTFQRLSVSDRDGLLKTHSLCRNCFRTGHQARECQSKYSCRHC FT KGRHHTLVCFKSEKGGETKVTSVTKGNAPSSNKETPGVSNPSSSQVANTVA FT TNTTVSNAAYQYSSQVLLATAIVVMVDDDGNQHPARALLDSGSESNFVTER FT LCQRMRVSRDKVDISVLGIGQASARVKQRIRAVVRSRVSPFSREMDFLILP FT KVTVNLPTISVNIDGWAIPSGIELADPAFFESKGVDIVLGIESFFDFFETG FT RRISLGKQLPTLNDSVFGWVVCGGTSTTTTSLQISCNTSTSDRLDLLLEQF FT WECEEIGSEKPYSPEERRCEELYQRTVRREQDGRYTVALPRNNDILPSLGE FT SRDIAFRRLQGTERRLARDASLRQQYIAFMEEYHAMGHMRKVDQTTQGMVH FT RCYLPHHPVVKEASTTTKVRVVFDASCKTSSGVTLNDALLVGPVIQEDLRS FT IVLRCRTRQIMLVSDVEKMFRQINVCEEDRPLQCILWRNSPIEEVDEYELN FT TVTYGTKPAPFLATRTVKQLVEDEKTRYPLAARAASEDVYMDDVITGVDDV FT ETALALRDQLEKMMSGGGFRLRKWASNCPKVLEGIAPEDLAIKQSAEINLG FT SEPSVKTLGLTWVPKTGVLKFNFLIPALDVQKEFTKRQILSIIATLFDPLG FT LLGAAITTAKIFMQQLWSIEDENGKKLDWDQPLPLTVGEDWKKLYNQLPAF FT NEISIDRCVIIPGATSMEIHCFSDASEKAFGACVYLRSEDADGKVLVRLLT FT SKSRVAPLKCQCIPRLELCGALLAAQLYEKVQASIKGSMKTYFWTDSTCVL FT RWIHATPTTWTTYVANRVAKIQAITENGYWRHVPGVENPADLISRGIQPQE FT IVNNRFWWQGPSWLEGGPEQWPNLPQNFEEEGANERRRTVVASVASPYAEF FT NEEYTSKFASYNDMIRRTAYWLRLMDLLRKPRSERNGSFLSTLELREAENV FT LIRRVQKEVFSREWKAIINQTAVPSNSPLRWYSPTISEDGIMKVGGRLNHS FT QETEDSKHPIPLPARHAFTRMLLQHYHERLLHAGPQLMLAVVRLRFLPLGG FT RSVAKQLVHNCIKCYRSKPTPIQQYMGDLPSARVTPARAFLRTGVDYFGPV FT YLRPAPRRPAVKGYVALFICLCTKAVHLELVTDLSTDRFLQALHRFVALRG FT KCSDIYSDNGTNFVGARNKLQEFLRLLKDQTHRNIVSKDCSTQGIQWHFNP FT PSAPHFGGLWEAAVRSAKKHLLKTIGEEPVSPEDFETLLTQVEACLNSRPL FT TPLTEDPEDLQPLTPAHFLVGESLQAISEPNLEDIPLNRLDKYQLMQKQLQ FT HFWRRWRQEYLCQLHARTKRWKPPVQVEIGKLVVIQDDNLPPMRWRMGRIL FT ELHPGDDGVVRVVTLKTASGKLTRPVEKLCMLPVPDCNDDPEPATTMSSNQ FT A" XX SQ Sequence 6665 BP; 1773 A; 1621 C; 1651 G; 1620 T; 0 other; ttgtggtcct tcgaaccgga tcaggaccag ggaaggatca acgaagctcg atttggaaac 60 ggtcgccgct gggtcgacga tacggacgct gttcggtgcc attttgaccg ccaaggctgg 120 cttcaatgtg gtgattgtta tcgcgaataa aaaggatcgc cgcaataaag gatcagattt 180 ggatttctcg gatcggattt cggatactgg attcgttcac acggcttgag gcggattgtg 240 accactattc gatgccaatt ggattgcatg gttcctagtg gaataggacg acgaccacgg 300 ataaggttcg gttcatcacc ggaggacatt cactggacaa gcgccgaaac ttgtctcaca 360 ggtacgtgtt aatgaacaag tatatgtcca gtcgaagcgt tcctttcacc ctacttacac 420 ccaatttcga ccatttttct ggaaatttct gcgacggagc cacgccggac aggacgtctg 480 gcgcagacag acacattgtt ggaatctgtg gtgccgtttt tccaaaccag gacgattcgt 540 ggcaacggac tgccatttcg tcatcgcttg aggagtttca ctcggtggtg gttccattca 600 ctgcccaccc atccattcat aaattcgatt ccttcccact gaagatcgcc accagtgatt 660 tcggaacaat cactgccacc gtgttcaccg tagatcgtcg aagcccatcc acgtgtgagc 720 gaagaggtcg cgccaacaag tttccgtcca ggtaaataca ctttatggta tatgtcgccg 780 aagcaaggca tacagtaaat acacctgttg atcgaaattt atatcctctt cgactcatca 840 tttacacacg caagccctgt agacccaatt gggtctgttg attccatttc gttactttgt 900 gaaaaaaaaa atttgggaaa tttgcgaata ttgcccgctg gattggttcc actgtttggt 960 tgtctgggta agccaccaca gtatatgtcc agccgtggca tttattttga atactacatc 1020 atatttcccc cctcttcatt cctcttctgg tcgttggttg ctgagacgaa ctggttaccg 1080 ttacggacgt cttattggac attttgactg gtattggcag ttcaattggg cttaatcgcc 1140 tgtttcgtca ggttagtgaa cacgatacat agtatatgcc agtcaagcta ggcgtatacg 1200 cggagtacta caccaccatc tctctcgttc cgttaccggt aacttgtgac gtcaaccatg 1260 ccgccgaaga aacctacgtt gaagctactg attacaaagc tcgttgaggc gcagtcttcc 1320 ctcagcgata tttggcagtt tatcgcggac ttcaaggagg acaccaaact ctcccagatt 1380 gaagtaagaa tggaaaaatt ggaggaactt tgggaagaat tctccgagac gctagttgag 1440 ataaaatctc atgacgacta taacgaggag gagggctacg acaaggaaag gcgggagttc 1500 aacaaccgtt attttgaggc caaatccttt ttcatggatc aggttaagga gaaaagggaa 1560 ccacaagccc ttgatcaatc tttccggaat catgattcgt cgattcacgg taatgctgcg 1620 cttgataatg tacgtttgcc ccaaatcaag ttgccaactt tcgatgggaa cgacgacgag 1680 tggctagggt ttcgggacct atttacctcc ctcatacact ggaaacctga cctgccggag 1740 gtggaaaagt tccactattt gaagggctgc cttctagggg aaccacggag cctgatcgac 1800 ccgctgaaga ttaccaaggc aaactaccag gtagcatggg atatgctgct gaaacgttat 1860 aacaacagca agcagttgaa gaaaaggcaa attcaatcac tgctgtcgtt gcctacactc 1920 tccaaggaat ccgtcgcgga attgcacacg cttctggaag gctttgagcg aatagtgcaa 1980 accctcgatc aggtggtcca gcctgccgag tataaggact tgttactggt taatatccta 2040 acgtcacgct tggatccagt gacgcgacga ggttgggaag agttttccgc aactaaagaa 2100 aacgattctc tggatgacct caaggatttc ctcaaccgac gaatgcacgt gctggagtcg 2160 ctaccatcca aatctaccga caccaggggt gtccaaccaa cagcgcaatt gaagcagaaa 2220 tcatcgttta ccaaaactag tttcacttca gctcaggcgt ccgggggtcg ctgttttgct 2280 tgttcaggaa atcatccact gttccactgc aacacattcc aacggttatc agtctcggac 2340 agggacgggc tgctgaaaac acattcccta tgtcgcaatt gtttcaggac ggggcatcag 2400 gcacgcgaat gccaatccaa gtactcgtgc aggcattgca agggtcgtca ccatactttg 2460 gtctgcttca agtcagaaaa gggtggtgaa accaaggtca cgtcggttac aaagggcaac 2520 gctccatctt ccaacaagga aacaccaggg gtttccaatc caagttcctc tcaagtggct 2580 aacacggtag ccactaatac aacggtttca aatgcggcat accaatactc atcccaggtc 2640 ctgctggcaa cggcgatcgt cgtgatggtg gacgacgatg gtaatcagca tcccgctcgc 2700 gctctcttgg attccggttc cgagagcaac tttgtaacag aacggttatg ccaacgaatg 2760 agagtgagtc gagataaggt ggacatctcg gtcctaggca tcggacaggc ttcagcaagg 2820 gttaagcaac gaatccgagc ggtggtacgc tctcgagttt cccctttttc acgggaaatg 2880 gacttcctaa ttttacccaa ggtgactgta aaccttccga caatttcggt caatattgac 2940 ggatgggcaa ttccaagcgg gatcgaactg gccgatcctg cgtttttcga gtcaaaggga 3000 gtggatatcg ttctcgggat agaatcgttt ttcgatttct tcgaaacagg caggcggatt 3060 tcactaggca aacaacttcc gacgctcaac gactcggtat ttggatgggt cgtttgtgga 3120 ggcacgtcga ctaccaccac ttcacttcaa ataagctgta acacatcaac ttcagacagg 3180 ctggatttat tgttggagca attttgggaa tgtgaagaaa tcggatcgga aaaaccatat 3240 tctccggagg aaagacgatg tgaggaactt tatcagcgga cagttcgtcg agaacaggac 3300 ggtcggtata cagttgccct tccaaggaac aacgatattc ttcccagctt aggtgaatcg 3360 cgagatatcg cctttcgacg actccagggg acggaacgta gattggcaag ggacgccagt 3420 ttgcgacaac agtacatcgc atttatggag gaataccatg cgatgggtca tatgcggaaa 3480 gtcgatcaaa ctactcaagg gatggtccat cgatgctatt tgccgcatca tcctgtggtt 3540 aaggaggcaa gcaccaccac aaaggtacga gtagtattcg acgcctcctg taagacgtcg 3600 tcaggtgtca cattgaacga tgctttgctg gtagggccag tgattcagga ggacttgagg 3660 tcgattgtct tgcgatgccg tacacgtcaa atcatgctgg tttctgatgt ggaaaaaatg 3720 ttccgccaaa tcaacgtttg cgaggaagat cgaccgctcc aatgcatcct atggaggaat 3780 tcaccgatag aggaagtcga cgagtatgaa ctgaacaccg ttacgtacgg taccaagcca 3840 gcacccttct tagccacacg tacggtcaaa caactggtgg aggacgagaa aacccgatat 3900 ccgctggcag ctcgggcagc tagcgaggac gtatatatgg atgacgtcat cacaggcgtg 3960 gacgatgtcg agactgcact agcactacga gatcagcttg aaaagatgat gtctggtgga 4020 ggatttcggc tcagaaagtg ggcctcaaat tgtcccaagg ttttggaggg tattgcgccg 4080 gaggatctgg ccataaaaca gtctgcggaa attaaccttg gttcggaacc gtcggtgaaa 4140 acattaggac tgacgtgggt gccgaaaaca ggcgttctca aattcaactt cctaataccc 4200 gctctcgacg tgcaaaagga gttcaccaaa cgtcagatct tgtccataat cgccacgtta 4260 ttcgatccat tggggctact aggagccgct attaccactg caaagatttt tatgcagcag 4320 ttatggtcga ttgaagacga gaatggtaaa aaactcgact gggatcagcc acttcctctc 4380 acggtgggtg aggattggaa gaaactatac aatcaactgc ctgcattcaa cgaaatctct 4440 attgatcgat gcgtcattat tcctggagca acttcgatgg aaatacactg cttctcggat 4500 gcttcagaga aggctttcgg agcttgcgtc tacctgcgaa gtgaggacgc cgacggaaag 4560 gtattggttc gtcttcttac ttcaaagtcc cgagtagcac ccttaaaatg ccagtgcatt 4620 ccacgattgg agctgtgtgg agcattactt gcagctcagc tctatgaaaa ggttcaagca 4680 tccatcaagg gttcgatgaa gacctatttc tggaccgatt cgacgtgcgt gctgcgttgg 4740 attcacgcca caccaacgac gtggactact tatgtagcaa accgagtggc aaaaatacag 4800 gcgatcaccg aaaacggata ctggcgacat gttcctggtg ttgaaaaccc cgcagaccta 4860 atatccagag gaattcaacc gcaggagatc gtgaacaacc gattctggtg gcaaggacct 4920 agttggttgg aaggtggacc ggaacagtgg ccaaatctgc cacaaaactt cgaagaagaa 4980 ggtgcgaatg agagacgtcg gacagtagtt gcaagtgtag cttcaccgta tgccgagttc 5040 aacgaagaat acaccagcaa attcgcatct tacaacgaca tgatccgccg gaccgcatac 5100 tggctacgct taatggatct gcttcgcaag ccacggagtg aaagataaaa cggatctttc 5160 ttgtcgacat tagaattacg agaggcagag aatgtgttga ttcgacgagt tcagaaggag 5220 gtcttcagta gagaatggaa ggcaataata aaccaaactg cggtgccatc aaactctccg 5280 ttgcgatggt attctcccac gatttccgag gatggaatca tgaaggttgg cggtcggttg 5340 aatcattcac aggaaacgga agacagcaaa cacccaattc cgctcccagc acgtcatgct 5400 ttcacacgaa tgctcctaca acattaccat gaaaggttgc tacatgctgg tccacagctc 5460 atgctggctg tagtaagact acggtttcta cctttgggcg gaagaagtgt cgccaagcag 5520 ctggtgcaca attgtataaa atgctatcgc tcgaaaccaa ctccaatcca acaatatatg 5580 ggggaccttc catctgcgcg tgttacacct gcgcgagcgt ttcttcgtac aggtgtcgac 5640 tattttggac cggtgtacct tcggccagcc ccacgacgac ccgcagtgaa gggttacgtc 5700 gcccttttca tttgtctgtg taccaaggcg gtgcaccttg aattggtgac cgatctgtcg 5760 actgaccggt tcctccaggc tctgcatcgc ttcgtagctc tacggggaaa gtgcagtgac 5820 atctattctg ataatggcac aaatttcgtt ggagccagga ataaattgca agaatttctg 5880 aggctactta aggatcaaac ccatcgaaac atcgtttcaa aggactgctc aacccaaggg 5940 atccagtggc attttaatcc gccaagcgct cctcatttcg gtggtctctg ggaggccgct 6000 gtccgttcgg ccaagaaaca ccttctaaaa accatcggcg aagaaccagt atcaccggaa 6060 gatttcgaaa cgcttctcac acaggtggaa gcatgcctaa actcccgtcc cctcacacct 6120 ttgacggagg atcctgagga tttgcagcca ttgactcccg cacatttcct tgttggcgag 6180 tcactccaag cgatatcaga acccaacctt gaagatattc ctctcaaccg cctggacaag 6240 taccagctca tgcagaaaca gcttcaacat ttctggcgaa ggtggcgcca ggaatattta 6300 tgccaacttc atgcccggac caaacgctgg aaacctccag ttcaagtgga aattggcaaa 6360 ttggtcgtta tccaagacga caatcttccc ccgatgcgat ggagaatggg acgcatactt 6420 gaactccatc ctggtgacga cggtgtggtt cgagtcgtta cactcaaaac agcatctggc 6480 aagctgactc gtccggtgga gaaattatgt atgttaccag taccagattg caacgacgac 6540 ccagaacctg ctactacaat gtccagcaac caagcgtaat caccgttcca ttcccctacc 6600 ttgtcgaaga ggatattcct gttttttctc ttttcagaaa tgtggcattt ctgggtgggt 6660 gagga 6665 // ID Gypsy-206_AA-I repbase; DNA; INV; 5639 BP. XX AC supercont1.44; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-206_AA_; KW Gypsy-206_AA-LTR; Gypsy-206_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5639 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.44; Positions 1395213 1400851. XX CC Positions [4305-4772] - Integrase core CC 'CTCTT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 1188..5477 FT /product="Gypsy-206_AA-I_1p" FT /translation="MTLTSIADPYVPYSMPFSQYQEQLEWIFKYNELPEDR FT YKTSFLAVCGKEVFTELKRLFPGRDFNELTYKQMTDELKKRYDKNDSAVVH FT SYKFWTKRQGRNESLEDFVITVKNLAERCDFGEFKERAIRDMLVIGVNDTQ FT LQKRLCDEEDLSAAKAERLILNSEISASRTKQLNHDDDRRVSVVARLGQRS FT DVSRSRNRFRNRSRSFDRNRSFSARSRSNSRYPSKRGGVSGKPLYLCSFCK FT KTGHTRKFCYRLHGKSPRKNQASVKFVDSPKPSSSSKDATLFKRLKKDLDS FT DSDYSDGMPCLMISSVNKISDPCYVEVLINRKRLTMEIDCGSAESVISEEL FT YLRNFANCEMQYCNKKLVVIDGNKLKVVGKILVEVQLAGNRKQLSMVVLRC FT NNDFIPLVGRSWLDEFYTGWRNTFTNPILTVGNIDVADEQKVVEEVKSKFP FT TIFTKDFSSPIVGYKGDLILKEDIPIFKKAYEVPLRLRQQVVDYLADLEQQ FT GVITPVEASEWASPVIAIVKKDQSIRLVIDCKVSINKVLLPNTYPLPVAQD FT LFATLSGSTIFCSLDLEGAYTQLLLTDRSKRFMVINTIKGLYTYNRLPQGA FT SSSAAIFQKVMDQVLCGLENVSVYLDDVLIAGKDFSDCKKKLLLVLERLAK FT ANIKVNWKKCKFFVSQLPYLGHVLTDKGLLPCPDKVKTIREAKAPRNVSEL FT KAFLGLVTYYSKFIPNLSTRISCLYNLLKKNVTYCWTDKCEKAFNDCKGFL FT LNPNLLEYFDPYKPIVVVTDACSYGLGGVIAHLVEDEEKPICFTSFSLNSA FT QKNYPILHLEALAVVSTVKKFHKFLYGMHFTIYTDHKPLIGIFGKEGKNSI FT SVTRLQRYVMELSIYDFDIVYRPSSRMGNADFCSRFPLSDEVPKELAREYI FT KNLNFSDEFPINYKEVARETLNDEFLQSILKYLKQGWPQRLERRFVDVYSH FT YQELEEVEGCVLFQDRVVIPERMKNKILKMLHMNHSGISKIKQLARRTVYW FT FGLNKDVEDYVKTCVICHQMTAINKKAPYSQWIPTKKPFSRIHADFFHFDR FT KVFFVVVDSFTKWIELEYMRYGTDCNKVLKVLLGIFARYGLPDVVVTDGGP FT PFNSDKFVKFLENQGILVMKSPPYHPESNGQAERTVRLVKDVLKKFFLDPD FT MRRLDIDEQIAYFLSNYRNICLDKGQFPSERLLSYKPKTMLDLINPKNNFK FT HHLTNSHDDPDFVDEKGDKKPDAFTKLKNGDLIFYKNFNTTDIRRWLPATF FT LRQISDATFQISLGGRIVLAHKRQLKLSSDTQRKGTLVFPFESVNTPVKDT FT ALLLPPNPEAATVSSSLTNQEASILSSVPLSESLLSSSPVRNNFRTTGKRR FT REDEEHDSEDSSSPEFEFYGYPADSFIFANDEPDFENQESDPLPVVSNRRS FT KRRRNKKRRKSDFVYY" XX SQ Sequence 5639 BP; 1728 A; 944 C; 1229 G; 1738 T; 0 other; gtgacgtacg aggatagatt tttttttttt ttgaaaaagt gatagttttt cattacttaa 60 tttttttcgc gtgtgaattt tgtgaaaatt tttagtggat cgaggcaatt tccacaagtt 120 ttatccaagg agctcggcgt aagtaaattt tccctccttg tgaaaagaat tttttccacc 180 atatttttat gcttggttgg ccattgttct gtgttttatt gaaatcaatc ttttgtgcca 240 gaacaataac ctcgctttgt tctattgttg gtgatgaaaa gctgaaaagt gacaattttc 300 ttaaattata tttggtgtag acaaaagcca ttataaaaac gccattttgt caaaacgtgg 360 taaaattttg tcgacgataa atttttctac acacactttt tcgcattaca tatttacagc 420 atatttttta cgagaatcac aaacaccagc agaaattttc tcagtgtgcg aatcagcgta 480 ccagaatatc ccagtgactg ggtacgcaaa agggaaaatc gacgaaaatt ttgacagttc 540 cgaagaagta gtgtgcgtga gttgagcgga ttgcgaagga ctgaattcac cactaccgac 600 aactcgagcg acagagtttg gtttacgtct gaagtcaaca gagaaaagtg catccaaagt 660 gcatcagaga tcaacgaact agtggcctgc tgtgaaaacc cgctagccgg cgaagaattg 720 gttgtttttt gaggactgcc gtttggaaag tgcgttgatt ttctgcagtg gaagccattc 780 atctggttgg atttgccatc aaggacatgg tgtacgtgct gctgggattc gccatcgttg 840 ctactgttgc tgttgtggac tgtggatgca gaagtgcatc tgaaggatcg ttcgtttggt 900 gctacaggga ttttgcatgc agaagtgcat ctagaggaga ttttggtgcg gatttggttg 960 ctgctacagt ttttgagcgg attttgcttg cggaacgcaa cgttttatcg agagtgcttt 1020 ttgtcatcat aaatcaggta cgtggttgtt ttttttgttt acttttttga tcttttttgt 1080 gcaagagcaa gattattttt tgattttatt ttattttgac tattgattga gttcgatttt 1140 aatttgtttt tttttgtgta aaattcttgt gcatttcttt ggaaaggatg accttgacat 1200 caattgcgga tccttacgtg ccgtattcaa tgccattcag ccaatatcaa gagcagctgg 1260 aatggatttt taaatacaat gaattgccag aggatcgtta caaaacgtct ttccttgcag 1320 tttgtggaaa agaagttttc acagaactaa aaaggctttt tcctggaaga gatttcaatg 1380 aactaacata caagcaaatg acagatgaat taaagaaacg ttatgacaag aatgattctg 1440 cagtagttca tagctacaaa ttttggacga aaagacaagg caggaatgaa tctttggaag 1500 attttgtgat aacagtaaaa aatttagcag aaaggtgtga ttttggagaa tttaaggaaa 1560 gggctatacg cgatatgctc gtgataggag tgaatgatac tcagttgcag aagagacttt 1620 gcgatgaaga agatttatct gctgcaaagg cagaacggtt gattctcaac tctgagattt 1680 cagctagtag aacgaagcaa cttaatcatg atgacgacag gcgagtaagc gtagtagcaa 1740 gattgggtca gcgatcggat gtttccaggt ctagaaatag atttagaaac agaagtcgta 1800 gttttgacag aaatcgatca ttttcagcca gaagtagaag taacagtaga tatccaagca 1860 aaagaggtgg agtttcaggt aaaccacttt acctttgttc cttttgcaag aaaactggac 1920 acactagaaa attttgctac cgtttacatg ggaaaagccc tcgcaaaaat caagctagcg 1980 ttaaatttgt agattctcca aagccatctt ctagctctaa agatgcaaca cttttcaaga 2040 ggttgaagaa agatttggat tcagattcag attattctga tggaatgcct tgcctcatga 2100 tttcttccgt caataagatc agcgatccct gctatgtgga ggtgttgatt aataggaaac 2160 gtttgaccat ggaaatcgat tgcggttcag cggagagtgt gatttcagaa gagttgtacc 2220 tgcgaaactt tgcgaattgc gagatgcaat attgcaataa aaaactagtt gtgattgacg 2280 gtaacaaact caaagttgtt ggaaaaattt tagtggaggt acagctcgcc ggaaatcgga 2340 aacaactcag catggttgtt ttacgctgca acaatgattt tatcccgctg gtaggccgct 2400 catggttgga tgaattctat accggatgga gaaacacttt tacgaatcct attttgactg 2460 ttggaaacat tgacgtagca gacgagcaga aagttgttga agaagtaaaa agtaagtttc 2520 cgacaatttt taccaaagat ttttctagtc caatagttgg ttataaggga gatttaatct 2580 tgaaggaaga catacccatt tttaagaagg cctacgaagt tccattacgt ttgagacagc 2640 aagttgttga ttatttggca gatttagaac agcaaggagt aataacgcca gttgaagcca 2700 gcgaatgggc ttcgccggta attgctattg tgaagaagga tcaaagtatt cggttggtga 2760 tagactgtaa agtctccatc aataaagttt tgcttccaaa tacgtatcct ttgcctgtag 2820 cgcaggatct gtttgctaca ctttcgggtt caacaatttt ttgttctttg gacctagagg 2880 gtgcttatac tcagctgctt ttaacggatc gttcgaagag gtttatggtt ataaacacca 2940 taaagggtct ctacacctac aatagactcc ctcaaggagc ctcatcaagt gcagctatat 3000 ttcaaaaggt catggaccaa gtcctttgtg gtttggagaa tgtttcagtt tatttggacg 3060 atgtactgat tgcgggtaag gacttcagcg attgcaagaa gaaacttttg ttggttttgg 3120 agagacttgc caaggccaat ataaaagtaa attggaaaaa atgcaaattt tttgtttcgc 3180 agttgcctta cttgggacat gttttaacag acaagggttt acttccctgt cccgataaag 3240 taaaaactat ccgtgaagcg aaagctccac gaaatgtttc agaattgaag gcatttttgg 3300 gacttgtaac ttactattct aagttcattc ctaatttgtc cactcgcatc agttgccttt 3360 acaatctttt aaagaaaaac gtgacatatt gttggacaga caaatgcgaa aaagcattca 3420 acgattgtaa aggatttctt ttaaacccaa accttttaga gtattttgac ccgtataagc 3480 ccattgtcgt agtaacagat gcttgcagct atggtttagg aggtgtaatc gctcatttgg 3540 ttgaagacga agaaaagcca atatgtttca cctcattttc tttgaacagc gcacagaaaa 3600 actaccccat tttgcatctt gaggctttag cagttgttag cacagtgaag aaatttcata 3660 aatttcttta cgggatgcat tttaccattt acaccgatca caagccttta atcggtattt 3720 ttggcaaaga gggtaagaat tccatttcag tgacgcgttt acagagatac gttatggagc 3780 tttctattta cgattttgat attgtgtatc gaccttcttc aaggatgggt aatgccgatt 3840 tttgtagtcg ttttccttta tccgacgaag ttccaaaaga gttagcaagg gagtacatca 3900 aaaaccttaa cttttccgac gaatttccaa taaactataa agaagttgcc agagaaacct 3960 tgaacgacga atttttacag tcaattttga aatacctgaa gcaaggttgg cctcagagat 4020 tggaaaggcg cttcgtggat gtgtactctc attaccagga gcttgaggaa gttgaaggat 4080 gcgttttgtt ccaggatcgc gtggttattc ctgagagaat gaagaacaaa attcttaaaa 4140 tgcttcatat gaatcactca ggtattagca aaataaagca actagcacgg aggacagttt 4200 actggtttgg actgaataag gacgtggagg actatgtaaa aacctgcgtg atatgccatc 4260 agatgacagc gataaacaaa aaagctccgt attcccagtg gatcccaacg aaaaaaccct 4320 tcagcagaat ccatgcggat tttttccact ttgataggaa agttttcttt gtagtcgtgg 4380 acagctttac caagtggatc gagctagagt acatgcgata tggaacggat tgcaacaagg 4440 ttttgaaggt tcttttggga attttcgcca ggtatggttt gccggacgtt gtggtcacag 4500 atggaggacc acccttcaac tccgacaaat ttgtaaagtt tttggaaaat caggggattt 4560 tggtgatgaa aagtccaccg tatcacccgg agagtaatgg acaagcagag agaactgtta 4620 ggctggtcaa ggacgtcttg aaaaagtttt tccttgatcc agatatgaga agattggata 4680 ttgatgaaca aatagcatat tttttgtcta attatcgtaa catttgttta gacaaaggcc 4740 agtttccttc tgaaagatta ctctcttata aaccaaaaac tatgttggac ttaattaatc 4800 ctaagaataa ttttaaacac cacttgacta actcgcatga tgatcctgat tttgtcgatg 4860 aaaaaggtga taagaaacct gatgcattta ctaaactcaa gaatggagat ctgatttttt 4920 ataaaaattt taacactaca gacatcagaa gatggttgcc agccactttt ttaagacaaa 4980 tttctgatgc tactttccag atttctcttg ggggaaggat agtgttggcg cataagcgtc 5040 agcttaagct atcgtcggac actcagcgca agggaacttt agttttccca tttgagagtg 5100 tgaatacccc agtaaaagat acagcactcc ttttaccgcc aaatccggaa gcagcaacag 5160 tttcgtcatc tctaaccaat caagaagcat caattttgtc atcggtacca ttatcagaat 5220 ctcttttgtc atcgtctccg gtaaggaaca attttcgtac tactggtaag agaagaagag 5280 aagatgaaga acatgattcc gaagatagtt ctagtccaga attcgaattt tatggctacc 5340 cagccgattc attcattttt gcaaacgatg agcctgattt tgaaaatcag gagtcagatc 5400 cacttccagt ggtttcaaat aggagatcaa aacgaagaag aaataagaag cgtagaaaaa 5460 gcgattttgt ctactattaa tagaatcgaa taagcaagaa atctataaat aaattgtaaa 5520 tataatttag tgtatacatt taagcataat gtgcgttcga aggtttttaa attgaattag 5580 ttggatttta aataaattta gaataaatga attattctcg aactaaagga tggaggagt 5639 // ID MOGWAI1_EI repbase; DNA; INV; 3526 BP. XX AC MOGWAI1_EI; XX DT 16-MAY-2005 (Rel. 10.05, Created) DT 16-MAY-2005 (Rel. 10.05, Last updated, Version 1) XX DE Mogwai-Ei1 (MOGWAI1_EI), a new member of the Tc1/mariner DNA DE transposon superfamily from the single-celled eukaryotic DE reptilian parasite Entamoeba invadens. XX KW Mariner/Tc1; DNA transposon; Transposable Element; mariner; KW Interspersed repeat; Tc1/mariner superfamily; MOGWAI1_EI; KW Mogwai-Ei1. XX OS Entamoeba invadens OC Eukaryota; Amoebozoa; Archamoebae; Entamoebidae; Entamoeba. XX RN [1] RP 1-3526 RA Pritham E.J., Feschotte C. and Wessler S.R.; RG Department of Biology, University of Texas at Arlington, RG Arlington, TX 76019, USA; RT "Unexpected diversity and differential success of DNA transposons RT in four species of Entamoeba protozoans. Mol. Biol. Evol. (2005) RT In press."; RL Direct Submission to Genbank (16-MAY-2005). XX DR Repbase; MOGWAI1_EI; Positions 1 3526. XX CC Mogwai-Ei1 is a member of a new clade of Tc1/mariner elements CC found in E. invadens. The TIRs of Mogwai-Ei1 are 26-bp long and CC are flanked by TA TSD. The element appears to contain two genes, CC although both have accumulated nonsense mutations. The first CC gene may encode a reconstructed putative protein of 694-aa. The CC N-terminal region of the protein contains a domain of ~400 aa CC with 21% identity and 43% similarity to a member of the CC Fen1/XPG/RAD21 family of structure-specific endonucleases from CC Encephalitozoon cuniculi (Genbank NP_597617). The central region CC of the protein display a DD33E motif and is most closely related CC to the putative transposases encoded by Ant1 from Aspergillus CC niger, TBE1 and Tec elements from ciliates and various CC prokaryotic insertion sequences of the IS630 family. The second CC (pseudo)gene may encode a 309-aa protein that contains a domain CC ~20% identical and ~40% similar to Ulp1 proteases from plants and CC animals. Interestingly, members of other Mogwai families also CC harbor a second gene that is distantly related to those of CC Mogwai-Ei1 and share some similarity with Ulp1 proteases, CC suggesting that it is an integral component of Mogwai transposons CC and not merely a transduced host gene. Together this data CC suggests that Mogwai elements belong to a clade of Tc1/mariner CC elements distinct from previously established eukaryotic clades CC of the superfamily (e.g. mariner, Tc1, pogo?). XX SQ Sequence 3526 BP; 1386 A; 467 C; 542 G; 1131 T; 0 other; tattcctttt tgatttaaaa aatgaataaa atattctaca catctattaa tagtaaaaat 60 gatgtatttc aaacacttat gggaattatt attgtattga tcatttttga tgaaatttgt 120 ttttcaaaat tgttataaaa ataatttaaa aataaaaccc tataaaaaga taataaacat 180 ataatatatt gaaatataaa ctttagaatt attcaaaatt ttattaactt aaaaatcaaa 240 tgaaaaaatt tgattgggca ttcccattaa caatggaatg tagaaaaggc ttatcaatgg 300 atctccaatt aaacttttct tttggagctc aagccacacc atttaacatt tgtaaagttc 360 ctccaaaacc aactttcttt gaaacggaac cagaaatcac aaatgaaact aaatatttcg 420 aaaatccaaa tgaaaaagat cccatattta tcagaaatgc atttttagca tcaaaacata 480 ttgatcaaca ctttgaaatg tacaatgcca ggatgaaaga aatatctaaa caatactacg 540 aagaaaatga tattcatgaa gtattgtttc ctagacaaat tgttccatct actcaattga 600 atctcaaaag aaaaaaaata ttacaatgaa agaaaaaacg acttgtatac acaaatatat 660 atttattatt ctgattgtaa aaattataat agaactgcca aacaattcaa attgcctcct 720 tcaaccatcc acaatatcat tgaaagggtt aaaaatggag tgtctctgac tggtaactaa 780 aaagggagga aatcggggca caattacaaa tttgattata atacaataac aatgatactc 840 gaagaaataa gcgcgaacaa acacacaacc ttaaaattga taacaaaaca tgtaaacgaa 900 aaaaaaagag atgaatggat ggaagctata gctcggtacg aatcttattc aatgagagat 960 gaacacgaaa ttaaagagat gttgaaacag aaaaaagtca gtttaagtgg tgtacacaga 1020 atattaaaga gactaggagt gacactgaaa ctagtcaaaa aagaacaatt taatcgaaac 1080 gtcaaagttc ggattgaaca aagaaaaatg tatgctgatc atatcgaata tttgaattat 1140 ttccaattcg tttatgcata tatagacgaa gttggagtca actttaatca cacgagaaga 1200 cagggttatt cgtttatagg ccaatgttgt agtgttgaat ccaaaaacat aagagagcca 1260 aatttaacca cgtttgccat ggtggtaccg ggtaaaaatt tggtgtttcg tattagtagt 1320 aaaagctcca atgtcatttt catcaacagc gtcaaagaag tttttattcc acaattaata 1380 aagtggtatg gtccagtttt tatacacttt gtaattgata acgcatcaat tcacaagaag 1440 gatatgatat tggtgtgtca agagtacgga aatatatgta acatatttag taccatattc 1500 acctcaatta aacgctatcg aaaaatgctt ttcagcaatg aagtcttatt tggctgagat 1560 tctaagtgat aacgagtcgt tcaaaaaagc attgaacttt atgaaatgtt acgaatatca 1620 tgatttgata ctcaaggatg tacgaatgac taacataaat ataaaaatgt tttcgtcttt 1680 tgttgcagct tcattgcttt gggtgagttc tcaaagtaca tacaatttcc atattaaaac 1740 gtttgaatgg gtgaaaagag caaaagaagg gtttcgcttt gagaacgacg acaacttaaa 1800 caataaaatt gttcgaattg aagtaccgat ccctgacgaa gaatttgtgt ttagttataa 1860 tgataataac tcacaaagta ctgtaaattt tatatccaga aatgatgtac ttcatgtttt 1920 gacgtatgac ccatttacga ttaaaaatga aaagtccttt acaaattgtg aacgcttttt 1980 tcaagaaaaa cgtcaagaag gagaattgta tcgttataat gaaggaaaaa ataactacca 2040 atttgaacac aaaatcattc ctgatatttc tgacactttt gcaattcagg aaaataatat 2100 aatggaagaa tgtgattatt ataatgaaga aaagttgaaa agtgagtgtt caaaaagtga 2160 atatatgttt tcacaaattt caacaataaa atgttgtgac gatattgata aaaatacaag 2220 tccaaaactc atattacctc gagaaattga tgaatcaaca atcgacttaa ttcatcaaaa 2280 aatcaatgtg atgatgaatg caaatgttgt gttgggactc aaattcaatt agaaaatatt 2340 ttaacccaat tattgacaat acatatcatt caaaatcttg gcaatattgg ggttattatg 2400 atccaaatga ttacagaaga gataccacaa ctacaaatga aattaaagaa aattgctaaa 2460 aatgtttaac aaattcacaa atggaaataa caaaacaaat ttgtcattct tcatttcaaa 2520 atggtactaa tatagaagag ggtcaaacaa tcggaattgg ttggaatttt ggggataatt 2580 tgatggattt gagtgattta gatgatgaag ttataattgt taaaaaagaa atggaagtta 2640 tagacttgga tggttcttta aaatatgaaa atattgaaaa tgcatattcc gaattcttga 2700 attataaaac atcaaaaaaa caaatggaat ttgattattc aagaataaat tcctagttga 2760 tttttatggt taaaccaatt ataaacttgc catattctac tgatatatcc gcaattcaaa 2820 atgtcataac ttctgaaatg cttaatacag aaatactcga tttgtatatt ttgtggatta 2880 aactgagaaa tgtgtgcttg tggaaaagat cgatttgttt aactgattgg aataatttta 2940 ccgatggtgg aattataaag aaaagagtta gagaagcgat caaaaatatg ggtatttgtg 3000 atactttttt gcttcctgtt ttttataatt cacattttat gttgtgtgga gtattttttg 3060 ttgattcctc aacacccatc atagccgagt tcaacagcat taagaattac tcaagtagat 3120 ttcttagaaa agtttccaaa tttcttagag atgaattcga aaaacagaaa agaattaatt 3180 tcaatttaaa atgggttgtt gaaatacaaa ctccaattca atttgactta atatcatgtg 3240 gttcttattc gtcttatttt atagaagcga tattcaaaag caatccaaga tgtgtcaacg 3300 acatccataa gtgcttttct gattcatcag cttttaattt tcggactgaa attttatctt 3360 tgacatcagt cgacaacata caaagagttc ttaatcttat ttcttattgt tgaattaatt 3420 gtgtaatttt attttttttt cttcccaaaa atgcatactt aataaaattg tcctattaat 3480 ataaatatca aggggtagta ttcaattttt aaatcaaaaa ggaata 3526 // ID RTE-17_BF repbase; DNA; INV; 1820 BP. XX AC . XX DT 29-JUL-2009 (Rel. 14.07, Created) DT 29-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE Amphioxus RTE-17_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW RTE; Non-LTR Retrotransposon; Transposable Element; RTE-17_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-1820 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-1820 RA Kapitonov V. and Jurka J.; RT "Young families of RTE non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(7), 1715-1715 (2009). XX DR [2] (Consensus) XX FH Key Location/Qualifiers FT CDS 3..1715 FT /product="RTE-17_BF_1p" FT /translation="ETTVSQEDHLSESELEECLKALKSGKAPGYDKIPIEA FT YHHSPAAKQELFRVVNLIWDSEIIPPELVKGIFIMLYKKNDRNNYSNYRAI FT CLLSHAYKLLSAVIARRLHVQLEPLLPDSQAGFRPARGTRDNVCILKWTIN FT MLLRESKPAVVTFIDYAAAFDTESQVFLDEALSSVGASLKVRRIIQAVFSV FT ASGCVCVTSPNGNQELSEPFKISRGVLQGDIFSPVAFIAGLMRIFALHDNP FT DSGVTVGSPPHQVTISRLEYADDAGLLDESVQDASERISAIAIGSRNDAAM FT EISVPKTKAMHIHKKDRVSKTTDAEIAALNLKHRCPECTRDFSTKRGLAIH FT RSRWCLHHPSTANIRSRAGSLADKEVQRRKRLAKEEERDNVVLLDQQLENV FT HTFDYLGSRMQCDGDQKADVKHRMDIAQSRFSSMHHIWRDHRLPQRMKLRL FT YKSSVCSTLTHGCEAWDLTQDVTRMINGFNSRCLQVITKKHFRDTATHPDF FT NLVAAIRRRRLRYLGHVLRMDPSRLVRRTLKAYVCGGENPPEGSLLMDCEX FT LPFELVARQARNRQMWNAKVNKIN" XX SQ Sequence 1820 BP; 530 A; 476 C; 430 G; 383 T; 1 other; gtgagaccac ggtaagccaa gaagaccacc tgagtgaaag tgaacttgaa gagtgtctga 60 aagcccttaa aagcggcaaa gcaccaggat atgacaagat cccaatagag gcctatcatc 120 actcacccgc agcaaaacaa gaactgttta gagtggtaaa cctgatctgg gactctgaga 180 tcataccccc agaacttgtg aaaggcatat ttatcatgct ctacaagaaa aatgaccgga 240 acaactacag caactacagg gcaatctgtc ttctaagcca tgcttataaa ctcctctctg 300 ctgtgattgc ccgtcgccta catgtccagt tggagccgct ccttccagac agtcaggcag 360 ggtttcgacc tgcaaggggc acccgcgaca acgtctgcat cctgaagtgg acaataaaca 420 tgttgctgcg cgagtcgaaa ccagcagtcg ttacattcat tgattatgca gctgcttttg 480 acactgaaag tcaagtcttt ctcgacgaag cgcttagttc tgttggagct tccctcaagg 540 tgcgcagaat catccaagct gtattcagcg tcgccagtgg gtgtgtctgt gttacaagcc 600 ccaacgggaa ccaagagctc tcagagccgt ttaagatctc gcgaggggtt ctgcaaggag 660 acatcttttc tccagtcgcc tttatcgccg gcctcatgcg catatttgca ctgcatgaca 720 atcccgactc aggggtaaca gtaggcagtc ctcctcacca ggtaaccatc agtcgcctgg 780 agtacgctga cgacgcgggg ctgctggatg aaagtgtcca agacgcctcc gaacgcatct 840 ctgcaatagc cattggttcc aggaacgatg cagccatgga gatatcagtc ccgaaaacaa 900 aagctatgca catacacaag aaagaccgtg tatcaaagac gactgatgct gagatagcag 960 ccttgaacct caagcacaga tgcccagagt gcactagaga cttctccacc aagagaggcc 1020 tagccatcca tcggagccgc tggtgcctac accacccctc caccgccaac atccgttcac 1080 gtgcaggtag cttggcagac aaagaagtac agcgacgaaa gagacttgcc aaggaggagg 1140 agcgtgataa cgttgtccta ctggaccaac aactggagaa cgtccataca ttcgactacc 1200 taggaagccg catgcagtgt gacggtgacc agaaagcgga tgttaaacac agaatggaca 1260 tcgcacaatc ccgcttcagc tccatgcacc acatatggag agatcacaga ctgcctcaga 1320 gaatgaaact ccgactgtac aaatcctctg tttgttcaac cctcacacat ggctgtgaag 1380 catgggactt aacgcaggac gttacgagaa tgataaacgg ctttaacagc agatgcctac 1440 aggtgattac aaagaaacac ttccgagaca cagcaacaca cccagacttc aacctggtgg 1500 cagccatccg ccggagacga ctccggtacc tgggccacgt tcttcgcatg gacccctcca 1560 gactggtcag gcgcaccctg aaggcctatg tgtgcggagg agaaaacccg cctgaaggtt 1620 ccctcctaat ggactgtgaa sacttacctt tcgaactggt agctaggcaa gccagaaata 1680 ggcaaatgtg gaacgccaaa gttaataaga taaactgact ttatgtatat attgaaagca 1740 cggtctgtag atattgtagc atagggccca cggcctaaga aatgtgaatt atatatatat 1800 atatatatat atatatatat 1820 // ID Gypsy-11_DPu-LTR repbase; DNA; INV; 410 BP. XX AC scaffold_48; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-11_DPu_; KW Gypsy-11_DPu-LTR; Gypsy-11_DPu-I. XX NM Gypsy-11_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-410 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 738-738 (2010). XX DR Genome; scaffold_48; Positions 542485 542076. XX CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX SQ Sequence 410 BP; 108 A; 91 C; 89 G; 122 T; 0 other; tgtcacgtac aaggtgacag agacttgtat gggcttgacg actggaacta aatgcaaacg 60 ttttacgact aaacttatgt ctttcttaga atagcatgta aggtgaccct agacgaaacc 120 cctacgtatc ctgacgattg cctgacggag tggggtcacc cctaaatttt cctatatata 180 tctaaccccg agacctcctc ttgttaatta ttattctccc agacttcttc tgtgttggta 240 taattggtga agggcactgt cgtcttagtg aattagagct agattagatt agagacgaca 300 gtgaaggtgt ttctctgaga ctgagccatt tttatagaca agccagctcc gtgaactact 360 tttattgatc ggccgtgtca acgtgaatcc cgtataacct gcccgcgaca 410 // ID Shinagawa-8_AAe repbase; DNA; INV; 2338 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.03, Created) DT 16-FEB-2011 (Rel. 16.03, Last updated, Version 1) XX DE A non-autonomous DNA transposon family from Aedes aegypti. XX KW DNA transposon; Transposable Element; nonautonomous; KW Shinagawa-8_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-2338 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the yellow fever mosquito."; RL Repbase Reports 11(3), 845-845 (2011). XX DR [2] (Consensus) XX CC >97% identical to consensus. 7-9 bp TSDs. TIRs are ~100 bp long CC and composed by degenerate repeats. Related non-autonomous CC elements, named Shinagawa, are found in Aedes aegypti and CC Culex quinquefasciatus. The insertion of DNA-TA-8_AAe-like CC transposon around 610 is excluded from the consensus. XX SQ Sequence 2338 BP; 769 A; 441 C; 435 G; 693 T; 0 other; gattctatac cattacccgg aataccatta cccggaatgc aatttaccgg aataccactt 60 accggaatgt accattaccc ggaaaaaccg tttaccggaa tgtaccattt accggaatgc 120 accattaccc ggaaagccaa atcttccatt ttcgacgacc attttcagta catttgtata 180 ttttttttat tctctttcaa tgatgctcag cacaaattat gtcacgctaa ccatggacat 240 tcttgactcc ctcggtttct ataaacactt tttcaaaaga caaattcaaa cagctatgga 300 ccaaatatgt ttttcttctt gccggcgtta cgagtcaata ggaacaaagc tattctagta 360 gaatttctac agttaatagc tgaccacttt tgcattctta catagtttga caagtatata 420 agttctctag ctctgaaacg tcgtcaaatt tgccaactag aaataatttt gaatcggcgg 480 tattcaattc taccaacctt agcttgatca ttatgaatag ctgcactttg ccgcaatgaa 540 tattacgata ccaaaattaa attgcacatg tcatttattt gcgattttat tagaaaggaa 600 tcgcctgata tagatctatg aaaactatgt cttcaaattt aggctgtaga actaaacatt 660 acaggcaagg ataaaaagaa tgaagagaat tatttttgtt ttgtatattt gatgtcaaat 720 attaacgagg tcaatataaa tcgaaaaaaa tcgcctaacg attaaggtgg agatgcatcg 780 aagccaaacc tcaaattttg aagagcacat atctagagaa ctaaacaccc gttcaagctg 840 aaaacttaat cgattggtta catccagcgt gtgatcaatc gatcaagttt tcagctcgaa 900 ccgctgcctg gtttcctaga tttgtgctct tgaaaattta aggattggct tcgatgtata 960 gtcaccttaa ggttgggtgc attattgtaa atagaaaatc tcagcatttg aaaatgtttg 1020 gatttatgat ggttttctta ttcccttcaa ataacaattt agaatccatt ttaaatattg 1080 tatcacaaaa aaactctaat cttgcgaggg aagggacata caacagtgct attctcattt 1140 tagctttaag gctcccccaa atcaacacga ttttttaagc gacatcgaca aaaaatcgac 1200 gcgattttgt tgttgagtcg ctgcaagcac tcttatatgg aaccatctag atcgacacga 1260 cgaaaatcgc tgcgaaattg cgtcgctgtg gcttagacac gctcatcaaa ctgtgcatgc 1320 agcgcatgtg catgtcgctg tgaccgtaaa aaatcgcctt gatttaaggg agcctttaga 1380 ggaaatctaa aacatcaaaa cgaaactaag agtgccttgg tgattgagag tagagacact 1440 attcgccatt gtgacgtcca tctttggatt ccgaaatatt gaacctaatt agatctactt 1500 ttcatttgaa tttgcttgat tttgctatat actacgaatg tcataactct gaatgtctgt 1560 accccaaatc caatgccagg ataatatact agaaagaaca gtaagcaatc atattgtcat 1620 aaatttggtc attttcgact aaacacattt taccaaaaat gccgagatcg gtggaactcg 1680 aaagaaccgc caaccaaata aagaagggtg aatatcagat ggcctgaaat gggtcattct 1740 agacaaaagg aatatacgga ggatttgcat tcggttattg acgtttacag aaacgacatt 1800 cggtggattg acactcggcg aaaaggagca caactaatcg tgttcaaaat tctaatcatt 1860 gcttattaat taacaattgg acgcttgtag aaccgacggt cgaatgtcaa aagtaaagtt 1920 acataattgt aaccaaattt agccagttca gccaagacaa gctgcaaaga taagttgaaa 1980 gaacagctta tttttaaaag aagggtgaat catctatgaa atgcaaatat atgaagatgg 2040 tgcatattga catgatcaag tatgaagata tcttagactc ctcttcgatg cttaggaatt 2100 ttgaaggaag aacatcccct gtccattatc gttacaaaat actgacccga atgaccgcaa 2160 aacggttaaa aggacgttat gaaaaaggcg agaaatagaa aaataatttt ccggtaaatg 2220 gtccttccgg gtaatggttt tccggtaaat ggttcattcc ggtaaatggt tttccgggta 2280 atggtcttcc ggtaaattgc attccgggta atggctttcc gggtaatgtc ggagaacc 2338 // ID SMAR16 repbase; DNA; INV; 2539 BP. XX AC . XX DT 03-OCT-2007 (Rel. 12.1, Created) DT 22-OCT-2007 (Rel. 12.1, Last updated, Version 1) XX DE Consensus sequence of Mariner-type family of repeats. XX KW Mariner/Tc1; DNA transposon; Transposable Element; SMAR16. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-2539 RA Jurka J.; RT "Mariner-type element from freshwater planarian (Schmidtea RT mediterranea)."; RL Repbase Reports 7(10), 1074-1074 (2007). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 1067..2257 FT /product="SMAR16_1p" FT /translation="MPARTYIAKSEKKASGFKASKDRISLLLCSNASGDRI FT LKPLVINKFNRPRALKSKDLKQLPVNWMANSKAWVTTALFTEWFKNCFLPE FT VEIYLRQKGLDFKVLLILDNAPGHVNIEHNNVQILFIPPNTTSLIQPLDQG FT IIATFKKYYIKSTYESILNKIENETLPLSEIWKKFTILDCINHVANAINQI FT RPMTLNLCWKAIWPECVTGNGTIAEISDEIIALAHSFGGEGFDTFNQNDIE FT ELLTDNALNDDEVIALTMDADDELENNDNVEEENPVPFTEKIIREGLQLCN FT NLENFFILHDTNYERALKFQRDLNNCISGYKELHKQLVTESRKSKQTLITD FT FAVHKSVSKQKRGEIDADFNETESLGSSSENESDFEPRFRKLARRLSFSDD FT SCN" XX SQ Sequence 2539 BP; 927 A; 373 C; 441 G; 798 T; 0 other; tacactagaa cctctttttg tgcgatcttt ttttgtgcga ttttgaagtt atgcggtttt 60 tttaaattca actttaaaaa aataattacg tataaacatg cactattttc atattttaac 120 gtacaaagat tagattttag cttgttcatt tgaaaatgac ctgtaaaaac atgcacaaac 180 atgagtcaca cacatgtgcg taacaggagg gtccaagcgg ttttgtttta ttaaaaaacg 240 ggccattgct aatagaatgc ggccggcgct attcgatatt gaacatccat gtaaattttg 300 tctcccactc tcttctatag tacaagtgca ttgttcagtg tgcccttggc aaattacaag 360 ttacgacgta tcacttcatc ttccttgttt ttcaagtgag tttctatgtt agtttattat 420 ttgtgcatta atatttttat tttataatct taatagattt ttctttaatt taccctagta 480 tggagaaaaa aaataccaac gaaaagcgat ttctttagaa agaaaaattg aaattttaga 540 ttctttgcaa aaaggagaaa gaatatgcga gattagtaaa catttcaact taggagaatc 600 gactgtaaga gcaatcaaga agaatgaaat tgcaataaga aaatcgatca ttgatggcac 660 gaagttaagt tcaaaattat catcctacac aagagataac atattagaaa aaacagaaca 720 agcgctcaga atattcattg aagatttaac aaaaaaaaga atccctataa gcggttatat 780 acttcaagaa aaagcgcgca aattttatga agaaattgaa aaatcagaat catctacttt 840 ttcaagttcc aaagaaaata gaaaatttat cgctagtaac ggatggttaa gtggattctt 900 gaaaaggaat gcatttcata atttgaagat tcgtggtgat gtcgcatccg caaatcaaga 960 agaagccaaa aagtttccag agaaattgat taaaattatc gaagatggag gatattgtcc 1020 ccatcaggta ttcaatgcag acgaaactgg attattttgg aaaagaatgc ctgctagaac 1080 ttacatcgcc aaatctgaaa agaaagctag tggattcaaa gcaagtaaag atcgaatatc 1140 attattacta tgtagtaatg catcgggaga tagaatattg aaacctctag taattaataa 1200 atttaataga cctcgtgctt taaaatccaa ggatctaaaa cagctaccgg taaattggat 1260 ggctaacagt aaggcctggg tgactacagc tctatttact gaatggttta aaaattgctt 1320 tttaccagaa gtggaaatat atctgagaca aaaaggtctt gattttaaag ttttattaat 1380 cctcgacaat gcacctggtc atgtaaatat agaacataac aatgttcaga tactttttat 1440 tccaccaaat acgacaagcc tcattcagcc attggatcaa ggaataattg caacttttaa 1500 aaaatattac atcaaaagta cgtatgaaag tattttgaat aaaatagaaa atgaaacatt 1560 accattaagc gagatttgga aaaaatttac aattcttgat tgtattaatc acgttgcaaa 1620 cgctataaat cagataaggc cgatgacttt aaatttatgt tggaaagcaa tatggcctga 1680 atgcgtcaca ggtaatggaa ctattgctga aatatcagat gaaatcatag ctttagcgca 1740 ttcatttggc ggggaagggt ttgatacttt caaccaaaat gatatagaag aattgctaac 1800 agataatgca ttaaatgatg atgaagttat tgctttaact atggatgctg acgatgaact 1860 agaaaataat gacaatgtgg aagaagaaaa tcctgttcca tttacggaaa aaataattcg 1920 agaaggtcta caactctgca ataacctaga aaactttttt attcttcatg atacaaatta 1980 tgaacgagct ttgaagtttc aacgcgacct gaacaattgt atttctggat ataaagaact 2040 gcataaacaa ttagttactg aatcaagaaa atccaaacaa actttgatca ctgattttgc 2100 tgtacataaa agcgtgtcaa agcagaaacg tggtgaaatt gatgctgatt ttaatgaaac 2160 agaatcgttg gggagttctt ctgaaaacga aagtgatttc gaaccacgtt tcagaaaatt 2220 ggcgcgaaga ttatcattca gcgacgacag ttgtaattaa aatagatttt actttaatgt 2280 aggtatatgt gaatgatagc gaaactccta ttttttttaa taaaataatt aaaaagtaat 2340 tatgtagata tgtgtatttt ttttcacttt ccaaactttt tttatgcgat ttgagcatat 2400 atataaatat tgacagtttc gaagttatgc ggtttttaaa attcttagaa cgcatctcac 2460 tttttctata taagtcgtag ttttttttgt gcgaccttcc gcagaacgca tctaccgcac 2520 aaaaagaggt ttaagtgta 2539 // ID Copia-131_AA-I repbase; DNA; INV; 4390 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-131_AA_; KW Copia-131_AA-LTR; Ty1_copia_Ele140; Copia-131_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4390 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [1903-2406] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 370..4377 FT /product="Copia-131_AA-I_1p" FT /translation="MENRLKRVIVPLFKGDEKSFPFWKKRMEQHFKFEGLL FT HTLEKTPEEEDFFQPVAGASAAAETERKDKLQARLKEEDAAVNELLLAIDD FT EPMGHIMHCAYAKDIMDRLGEIFMKRGPMAMLGLRTKLLSLKSRKFTCLKQ FT LFATHQEIIRDLESMGEVIASTEKLTSLLVAIPDEFQHLIGALSVLRKDDL FT DTMSLEQVQRVFLDAEEGKTKRNTPGPSQVAMTAPQQNYRNLKCFQCGRYG FT HKMRNCREARRRKEASKPTFHTHPGGKKRDVALVTRQNGAFMVRIPLDELN FT PDLRRDCRVFQMDFKRLPVPSPTFGGVTLLDSGASNHMFRDAQMFNEMWDY FT VANIETAKEGEMMRTRRAGNVNLRTNKNFNVHLYNALFVPNLNFNIVSVSR FT IESTGKAVLFRNGGVEILDNDGTVILTGKRVNGLYILDIEFVGYANKVFAS FT KMLSDAELWHVRYGHLGAQNLNKLVRDNMVERLDVELSAMSDCPFACKSCI FT YGRQTREVFDNSITSRSNRPLELIHSDVCGQLPEETYDGCRYFVSFIDDYT FT HFTVVYLIRYKSEVLDKFREYEAMATAHFNLRIAKLRTDNGGEYFGGEFVR FT FCRERGIQMTPTAPYTPQQNGVSERMNRTLMEKVRTILHESGCPFMFWGEA FT LYASTYTLNRGPASALSVPKTPYEMWFGVKPDVSKLKVFGCIAYTHVNMEY FT RTKLDKKSRRLCMMGYAPNGYRLWDENFDRIAVSRDVKFDEHHFYFPKVDD FT ELQENPNFCEVEKRPELEIIEVENDLPLNENSEDDFVSVGEPSENEFQDIP FT EQEGRPQRTIRPPAWLRDYETVALSLNGSGDIPQNIDELRKRTDWDQWKQA FT IDEELRALKENNTWTLVDELPEGHKAINSMWIFNIKDGETVRYKARLVAKG FT CSQRPGIDFSDTFAPVAKMTTIRTMLSIAVKKNWIIHQMDVKTAFLNGNLN FT EEVYMKLPRDENGIVKICKLNKSLYGLKQAGRNWNQRFNEVVTQLGFQRLN FT SDTCLYKCPEKDLFIILYVDDILLFGSNISGIKWIKEKLSDYFKMKDLGDV FT KNFLGLEISRNLERGTIELSQQSYVDKILDTFGMKDCRAASTPMDLNCKWV FT RSEKCTDKPFKELLGCLQYLTLMSRPDITIAVSILSQFQSMPGDEHWVGLK FT RILRYLHGTKTYRLVYSREKDDEPLKGYADADFANDIEERKSNSGNVFLVY FT GNIVSWKSKRQQIVTLSSTEAELVSLCEAGKEGVWISNLLREVGISSIPFT FT IYEDNIPCIRISEEPREHQRTKHIDVRYMYVRNLIHEKKIVLEYIRSEDQI FT ADMFTKPLMKSRFNKMCELIKMIN" XX SQ Sequence 4390 BP; 1331 A; 784 C; 1076 G; 1199 T; 0 other; ggttatgggc ccagggctaa gtgactgtct tgaagaattg tttggacgga aattttctcc 60 gtaaacaatt gaatacaagt ttgctcagtc ggagaccttc ttttctgagg ggcaagctta 120 gagattagga gtgagcggtt gttcagacgg agaccctctt ttctgtggga caatcgtttg 180 cgcattcgag tttgttctgt cgttgatctt tccgaggagc aagctcaggg gatacttgag 240 aaaaagtgtt gttcagacgg agaccatctt ttctgtggga cgcactttga tcaaagattt 300 ttgggtgttc taagacagtc tgatcagtga gtatactttc ttccaggagg tttacctcag 360 attacaaaca tggagaatcg tctcaaacga gtgattgttc ctttgtttaa gggggacgaa 420 aaatcgttcc cattttggaa gaaaaggatg gagcaacact tcaagtttga aggtttgttg 480 cataccctcg agaagactcc cgaggaagaa gatttcttcc aaccggtggc tggtgcctca 540 gccgcagcag agactgagag gaaagacaaa ttgcaagcgc gtcttaagga ggaagatgcg 600 gcagttaatg aactgttgct tgcaattgat gatgagccga tggggcacat tatgcattgt 660 gcttatgcga aggacataat ggatcggcta ggagagattt ttatgaagcg tggcccaatg 720 gctatgcttg ggcttcgaac gaaactactt tcgctgaagt cacgaaagtt tacctgtttg 780 aagcaacttt ttgcaaccca ccaagagata atcagggacc ttgaaagcat gggtgaggtt 840 attgcaagta ctgaaaagtt aacttcttta cttgtcgcta ttcctgatga attccaacat 900 ttaattggag ctctttccgt tttgcgaaag gacgacctcg acacaatgtc acttgaacaa 960 gtgcaaaggg tgttcttgga tgctgaggag ggcaagacca agcggaacac gccgggacca 1020 tctcaagttg ccatgactgc accacaacag aattaccgga acctgaaatg tttccaatgc 1080 ggacggtatg gacacaagat gcgcaactgt cgggaagcgc gtcgccgaaa ggaagcatcg 1140 aagccgacgt tccataccca tccggggggc aagaaacgcg atgttgccct ggtaacacga 1200 cagaatggtg cattcatggt gcgtattccc ctcgatgagc tgaatcctga tctcagacgc 1260 gactgccgag tgttccaaat ggatttcaag cgactgccgg ttccatcgcc cactttcggc 1320 ggcgttacgc tgctagattc gggagcctcc aatcatatgt tccgcgacgc acaaatgttc 1380 aacgagatgt gggactatgt tgcgaacatc gagacggcta aagaagggga gatgatgcga 1440 acacgcagag ccggaaatgt aaatttgcgg acaaataaaa atttcaacgt tcatttgtat 1500 aacgctcttt tcgttccaaa tctgaacttt aatattgtct ctgtttcgcg tattgagtcc 1560 actgggaagg cagtcttatt caggaatggt ggagttgaaa ttttggacaa cgatggaacg 1620 gtgattctta ctggaaagag agtcaatggt ttgtatatct tggatattga gtttgttgga 1680 tacgcaaata aggtatttgc aagtaaaatg cttagtgatg ccgagttatg gcatgttcgg 1740 tatggacact tgggagccca aaatttaaac aaacttgtga gagacaatat ggtagagagg 1800 ttggatgtag aactatctgc catgtctgat tgcccatttg catgtaaatc atgtatttat 1860 ggaagacaaa ccagagaggt ttttgataac tcgatcactt cgagatcgaa ccgaccactt 1920 gagcttattc actctgacgt ttgtggtcaa ttaccggaag aaacctatga tggttgtaga 1980 tactttgttt ctttcattga cgattacaca cattttacag tagtgtatct aatccggtat 2040 aaaagtgaag tcctggacaa gtttcgggaa tatgaggcta tggcaacggc tcatttcaat 2100 cttcgcattg ccaagcttcg tacagataac ggtggtgaat acttcggtgg agaatttgta 2160 aggttctgcc gagagagagg aattcaaatg acacccactg ctccctacac accacagcaa 2220 aatggtgtaa gtgagcgaat gaatcgtacc ttaatggaaa aagtaaggac gattctacat 2280 gagagtggtt gtccatttat gttttgggga gaagctttgt atgcttctac atatacattg 2340 aaccgcggtc ccgcgagtgc attatctgtt ccaaaaacac cttatgagat gtggtttgga 2400 gtgaaacctg atgtcagtaa actaaaggtt tttgggtgta ttgcatatac acacgtgaac 2460 atggaatacc gtacaaagct tgataagaag agtagacgtc tttgtatgat gggatatgca 2520 ccgaacggat acagactatg ggatgaaaac tttgacagga tcgcggtttc gagggatgtc 2580 aaatttgatg aacatcattt ttatttccct aaagttgatg acgaattaca agagaatccc 2640 aatttctgtg aggtggagaa aagacctgaa cttgagatca ttgaagtaga aaatgattta 2700 ccattaaatg aaaatagtga agatgatttc gtgagtgtag gagagccatc tgagaatgag 2760 tttcaagata ttcctgagca agaaggtcgt ccccagagaa ccattagacc tcctgcctgg 2820 ctaagagatt atgaaactgt tgccctctcc ttgaatggtt caggtgatat accacagaat 2880 attgatgaat tgcgtaaacg aactgactgg gatcaatgga aacaagccat tgatgaagaa 2940 cttcgcgcct taaaagaaaa taacacatgg actttggttg acgaattgcc tgaagggcat 3000 aaagccatta actccatgtg gatttttaac atcaaagacg gagaaacagt ccggtataaa 3060 gcaaggctag tagcaaaggg gtgctcgcaa cgccctggta ttgatttcag tgataccttt 3120 gctccagttg caaagatgac aactattcgg acaatgcttt cgattgcagt taagaaaaac 3180 tggataatac atcaaatgga tgtaaaaact gcattcctta atggcaatct aaatgaagaa 3240 gtgtatatga aactaccacg ggacgaaaat ggaattgtta agatatgtaa gttaaataaa 3300 agtctttatg gtttgaaaca ggctggaagg aattggaacc agcgtttcaa tgaagttgta 3360 acacaacttg gttttcaaag gctgaacagt gatacatgtc tttataaatg tcccgagaaa 3420 gatttgttta tcatactata tgttgatgat attttgctct ttggaagtaa catatccggt 3480 ataaaatgga tcaaagaaaa gctttcggac tatttcaaga tgaaggattt aggggacgtt 3540 aaaaacttcc taggactgga aatttccaga aatcttgaaa gaggtacaat agagctttct 3600 cagcaatcct acgttgataa gatactggat acgtttggta tgaaagattg cagagcggca 3660 tctactccaa tggacttaaa ctgtaaatgg gtgagatcag aaaagtgtac cgataaacca 3720 tttaaagaac tgttaggttg ccttcaatac ttgacattaa tgtcaaggcc tgacataacc 3780 attgctgtga gtattttgag ccaatttcag agcatgccag gagacgaaca ctgggttggg 3840 ttgaagagga ttcttcgata tctccatgga acgaaaacgt atcgcctggt ttattctcgt 3900 gagaaggatg acgaaccact caagggatat gcagatgctg attttgcgaa cgatattgaa 3960 gaacggaagt ccaattctgg aaatgttttc ctggtgtacg gtaacatcgt gtcctggaag 4020 agcaagcgac aacagattgt gacattatcg tcgacggaag cagaactggt ttcgttgtgt 4080 gaagctggta aagaaggtgt ttggatatca aatttgctac gagaggttgg aatttcgtcc 4140 attcccttca ccatatatga agacaacata ccctgcattc ggatttcgga ggagccgagg 4200 gaacatcaac gaacaaaaca tattgacgtg cggtatatgt atgtgcgaaa tttgattcat 4260 gagaagaaaa ttgtgttaga atacataaga agtgaagatc agattgctga tatgtttaca 4320 aaacccctaa tgaaatcacg tttcaacaaa atgtgcgaat tgataaaaat gataaattaa 4380 ggggacgtat 4390 // ID PAO_I repbase; DNA; INV; 3534 BP. XX AC L09635; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 08-AUG-2007 (Rel. 12.07, Last updated, Version 2) XX DE Bombyx mori retrotransposable element Pao. XX KW LTR Retrotransposon; Transposable Element; KW Long terminal repeat (LTR); PAO; PAO_I; PAO_LTR; KW Repetitive element; retrotransposable element; KW reverse transcriptase; internal portion. XX NM PAO. XX OS Bombyx mori OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Bombycidae; Bombycinae; Bombyx. XX RN [1] RP 1-3534 RA Xiong Y., Burke D.W. and Eickbush H.T.; RT "Pao,a retrotransposable element from Bombyx mori with a highly RT divergent reverse transcriptase domain and unusual long terminal RT repeat."; RL Unpublished (1993). XX DR GenBank; L09635; Positions 630 4162. XX SQ Sequence 3534 BP; 1281 A; 882 C; 783 G; 588 T; 0 other; tttcatggtc cattcgagcc ggatagagga cacatttcat tattctttcg agcccggtag 60 aggatacgtt ttgaaaagga aactggaaca atgccgataa cgcggtcaac aggaagagga 120 cggatcgaaa ctgaacagtc gccgccctca gaaaccacag ctacaacaca gagcatgtgg 180 acagaaacaa ccgcgaatac cgtcatgagt ctggtagccc ccacaacaga atcatcgtgc 240 gcaacagcca acacggaagc cacgacgaaa ctcgcagaga aacctgggaa ctcaaaaaca 300 gaggccgtca aacagtatat agccaaacag aatgacgtgc caacaaaaca gcgcgccggt 360 acagtcaaaa gtgaccggtc gcgaaacaga aaagaacaga agatagccaa agctagagaa 420 gagctcgccc gcctacaggt ggagttagca gccgcccgat tggccacgct cgaagctgga 480 tctgatgacg aaaacagcga atcagaatac agtaagtcag aactcgacga aagagtgggc 540 acgtggttgg aaacccaacc cacaaaaacg gaaaatcacg accgacataa ggaaacaccg 600 gcgggagcct gcgacaaaca agacttctca gatctaaccg cagcaataac actcgccgtc 660 aaagccgccc gcgaaccaag atacacagaa ttgccattct ttaatggaaa tcaccaagat 720 tggctatcct ttcgtgcagc ctatcacgag acgatgaatt catttacaaa aacagaaaat 780 ataaacagac tcagaaggaa cctgaaagga agggcaaagg aagccgttga cggattactt 840 ataacgaacg ctgatccgtc cgacgtcata agaagtctag aagcgcgatt cggaagaccg 900 gaaacaatag ccataacgga gttagacacg ctacgagcgc tgccaagact aacagaaaca 960 ccaagagaca tttgtatatt ctccagtaag gtgaccaacg ccgtagctac gcttcgtgca 1020 ttaaattgca cacattattt atataatccg gaaactacca aaacaatgtt agaaaaactc 1080 acaccaacac tacgttaccg atactacgac ttcaccgcgg tacaaccgaa ggaggatccg 1140 gatctgatta aatttgaaaa attcatgaaa agagaagccg aactgtgcag cccttatgca 1200 cagcccgaac aggcggggca ctactcgcag cccgcacaac acaacagacg cacacagaac 1260 gtgcacatag tcagtgagaa gccatcacga gctaaatgtc cggtatgtag caacactgaa 1320 cacaccacaa cagactgcta catattcaag aaggcagact caaacacaag atgggacatc 1380 gctaagaata aacacctgtg tttccgatgt ctacagtata agaataaaac ccacaactgt 1440 aaaccgaaga cgtgtggcat taatgattgc aaatatactc acaacaagat gctacacttc 1500 gacagaaaaa ttgaaaaaac agacaacagt gacaaggaaa caacagagaa cattaattcc 1560 gcttggaccg gaaaacagaa acagtcctat ttgaaaataa tcccagtcca agtacaagga 1620 ccgataggca cggttgatac atacgcgctg ctcgacgatg gatcaacggt aacactgata 1680 gacgaaatca tctgcaagaa gactggaaca acaggaccaa tcgatccgtt acacatacag 1740 gcgattaaca acataaaatc aacggaaaca aggtcaagaa gagtcaacct cacgctcaga 1800 ggcctcaaca gtcgaaaaga aataatacaa gcgagaacag ttaacgacct acaagtaaca 1860 gcacaaaaaa taccaaagga acagatagac gagtattcgc acctacaaga catcagtgac 1920 atcatcacgt acgagaacgc gaaacctgga atcctgattg gccaagacaa aaactacact 1980 ggcacatgtt actagcttcg aaagttagac gaggcaacag gaatcagcca atagcgtcac 2040 tgacacctct aggctgggta ttgcatggag gtcgcactcg taccttaagc caccacatta 2100 aatcatgcta gcgaaaccca ggaagatgat aaaatagaaa atctggtaaa acagtatttc 2160 gctatggatg cgctgtgcat cacaccaaga agaccaaaaa cagacccaga ggaacaggcg 2220 cttcgcatcc tcaacagcaa tacagtccac acaacagatg gaagatacga aactgctctg 2280 ctctggaaaa cagataatgt cagtctacca gacaactaca ataactcgtt aaagcgactg 2340 ataaatatag aaaacaaact cgatcataat ccggaactga aacagaaata cacagaacag 2400 atggaagcac tcgttgcgaa aggctacgcc gagcccgctc caaaaacaaa aacagagaac 2460 agaacgtggt atctacctca cttcgccgtc gtgaaccccc cgaagccgga aaaactccga 2520 gtcgtccacg acgccgccgc cagaacaaga ggggtagctt taaatgatat gctgcttaag 2580 ggaccgaacc tactccaatc actgccagga gtgataatgc gattcagaca gcataatata 2640 acagcaacag cagacatcaa agagatgttc atacaagtaa aattgagacc tgaagacaaa 2700 gacgcgctcc gttatctctg gcgcaaagat cagcgagata acaagccccc agaagaaaac 2760 agaatgacct cgttgatctt cggggcgtca agttctcctt ccacagcaat atatgtaaag 2820 aacttgaacg cccagaaaca tgaagccacg cacccggagg cggcagccac aatacagaac 2880 agacattacg tagacgacta cttggacatt tttaaaggtt taaaagatgc agtactcgta 2940 acaacagact ttcgtcgaaa acacgaaaga aagccgacct cgaagacctt ctggatcgac 3000 agtgagatag tgttaagatg gacaagaacg gaatcacgct cgtacaaacc atacgtcgcc 3060 caacgcctga cagctataga agacagttca acaataaacg agaggcgatg gttacccacg 3120 aagcacaacg tagccgacga cgtgacccga cacgtcccaa tgtcgtacca gaatgaacat 3180 agatggttca gatggacaga attcctacgc caacgacaga actcctggcc gacggaatcg 3240 gcgtcagaaa ctacagaacc gatgggtgaa gtaaacatag cagctgcagt accggcggga 3300 gcttcatggc cgaggcgtcg ccatgaaaag tggaagtgcc agccacgaaa tacacgcatg 3360 cgagggaaaa gtgatagcga catatccagg tcccgacaac gtggggcgca tcgtagacat 3420 cagaaccaag ggtggagttc tacggagacc agtacgaaaa ctactgatcc tgcccatcga 3480 agaagaccat cctgcaccga gaagaatgcg actgactcgc acggcgggag taat 3534 // ID BEL-117_AA-LTR repbase; DNA; INV; 348 BP. XX AC supercont1.311; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-117_AA_; KW BEL-117_AA-I; BEL-117_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-348 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.311; Positions 482952 482605. XX SQ Sequence 348 BP; 118 A; 72 C; 65 G; 93 T; 0 other; tgacagttct caaactgatc atgacccaaa cacaggcagt gctgtcagtc ataagcctta 60 tgctacaaac ttgcatgcaa gtgcagcatt aaaattcaat tgcgcaatac atatgttata 120 cacaacatag gataaataga cgatgaattt gtaataaagt cagttagtaa aagttcaacc 180 tccgaagaat aaagttcatt acaataagaa gtgtttctaa atcaaacata agttttaata 240 gactctggtg atcttgagtg cccaagcctt gcctaagtgg agtagtgcat tgtgtcctat 300 tgtgaaccca gagtgtgtcc cgccggaaat cacgatcctg ccccaaca 348 // ID BEL-82_AA-LTR repbase; DNA; INV; 539 BP. XX AC AAGE02026656; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-82_AA_; KW BEL-82_AA-I; BEL-82_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-539 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02026656; Positions 12458 11920. XX SQ Sequence 539 BP; 199 A; 88 C; 96 G; 156 T; 0 other; tgtgacgacg agagcccccc ggcgctacat catcaagatt taccacaggg aagtgaaggc 60 gtgtcggcat atgtgtcggc agacgggaat gaaccctata cagcggagta gtgataagag 120 aagaaaatgt aaacataatc gaacaaaaag atctattgca gtattgaaca catgcaggtc 180 taaaatttct ataattcatg caaattaagt tcataaatct atcaatttgt tcctaacttt 240 attactatta tgtacaattg gttctggtga acagtgaaaa ctatacagta agtgaaatac 300 tttattatca cgataatttg ttaacttaat gaatttgttc ctatgtacag ctttctactt 360 aacctaaatt tgataaaacc taaaggagaa tagtacggca aaactgataa atttgtgagt 420 aacactattg aattatttat aaattgtaaa atcctaaatt gcaataaaaa ttctagctaa 480 agcagttccc aaacaacaac tacgagttct gtggggcgct aagaaaatca gttgtaaca 539 // ID Gypsy-162_AA-LTR repbase; DNA; INV; 251 BP. XX AC AAGE02018483; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-162_AA_; KW Gypsy-162_AA-I; Gypsy-162_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-251 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02018483; Positions 15826 15576. XX SQ Sequence 251 BP; 86 A; 47 C; 43 G; 75 T; 0 other; tgatatgttt tcattttgag tgtatccatt ttgccttacg catactctga tataaacgaa 60 atgcgatgtt ttcaagaaca atcagaagcg accaacgaga ggtctgttgg tcaaacggaa 120 aagagaaata aatcattact attcgagtgc tagctagaaa cagttgtcta cctcttatac 180 cttcaaacca atcacggacc ttgatctaac ttagttgaag ttcaattaag cttagtaaag 240 caaatctaac a 251 // ID Copia-14_CQ-LTR repbase; DNA; INV; 184 BP. XX AC AAWU01015579; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-14_CQ_; KW Copia-14_CQ-I; Copia-14_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-184 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 344-344 (2011). XX DR GenBank; AAWU01015579; Positions 12556 12739. XX SQ Sequence 184 BP; 50 A; 43 C; 33 G; 58 T; 0 other; tgttggagat ttaagctgtc cacttgtctg tggtagatta gtaaacaaga agaaatgagt 60 aaacgccctt tagcgtgtag ctcaactcct gtcagatatt ttcactctgc tgtaaacact 120 ctcgaataca actttgacaa acgtacacgc ggtttttatt tccactgccg ctccaccttt 180 taca 184 // ID TEC1 repbase; DNA; INV; 698 BP. XX AC M29914; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 28-SEP-1995 (Rel. 1.08, Last updated, Version 1) XX DE E.crassus Tec1 transposon-like element. XX KW Mariner/Tc1; DNA transposon; Transposable Element; KW Repetitive sequence; TEC1; transposon. XX OS Moneuplotes crassus OC Eukaryota; Alveolata; Ciliophora; Intramacronucleata; OC Spirotrichea; Hypotrichia; Euplotida; Euplotidae; Moneuplotes. XX RN [1] RP 1-698 RA Jahn L.C., Krikau F.M. and Shyman S.; RT "Developmentally coordinated en masse excision of a highly RT repetitive element in E. crassus."; RL Cell 59(6), 1009-1018 (1989). XX DR GenBank; M29914; Positions 725 28. XX SQ Sequence 698 BP; 235 A; 93 C; 148 G; 222 T; 0 other; tttgtagtac gattagagag aatttgctgg aaatgataaa gtctggcgca gacttattca 60 ccaatgttat aaatcatgca ttagttagcc ttagcatggt ctatggatat catgtggata 120 gtagttagtg gtctaggcta acctgcccat atggagtcca gacgatcgct acaggcttag 180 gaggttgtcc attgaaatac ctcatattta ggattattac atgtaaatct aggataattt 240 gctatcgaat tttcagcaaa gtcctttgta aatagagaat ttgtaaatat ccattaggct 300 ataattcgcc aagccagttg ctctaaggca tgagattggc atagtcagat agtctgaaga 360 gttctctaca atatagacta atatggtcat cttaaagtcg ccatgaacct ggaaggggtt 420 ggggttgacc aaaataaagt cagcgcgcag gcaaaccaga gagctattta gaggctattt 480 ggcataattc attagagttt ggtaatattt tgaggatagt aaagggttta aaaagacaaa 540 ttaagtgatg atcataagga tgcaaaatga aagcttgttt atgtggaaac ttaataaggt 600 gagaaaggta gatagaaaga gtattattta agattaaatt gttcaattat taattataat 660 tattaattta tattgtttca atataaaact ccctctat 698 // ID Gypsy-7_TCa-LTR repbase; DNA; INV; 137 BP. XX AC chrUn_33; XX DT 21-FEB-2011 (Rel. 16.02, Created) DT 21-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the red flour beetle genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-7_TCa_; KW Gypsy-7_TCa-I; Gypsy-7_TCa-LTR. XX OS Tribolium castaneum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Coleoptera; Polyphaga; Cucujiformia; OC Tenebrionidae; Tribolium. XX RN [1] RP 1-137 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the red flour beetle genome."; RL Direct Submission to RU (21-FEB-2011). XX DR Genome; chrUn_33; Positions 55482 55618. XX SQ Sequence 137 BP; 38 A; 32 C; 23 G; 44 T; 0 other; tggtacgaac gtcgtcgcat atagacgtac ctactccggg cagagcgata agtttctcct 60 caaattgttc actttttatt tattcattaa actattaaaa gtacctatgt gtcttattgg 120 ataagcaccc gccacca 137 // ID Gypsy-41_OD-LTR repbase; DNA; INV; 195 BP. XX AC CABV01004654; XX DT 24-FEB-2011 (Rel. 16.02, Created) DT 24-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Oikopleura dioica genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-41_OD_; KW Gypsy-41_OD-I; Gypsy-41_OD-LTR. XX OS Oikopleura dioica OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; OC Oikopleuridae; Oikopleura. XX RN [1] RP 1-195 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Oikopleura dioica genome."; RL Direct Submission to RU (24-FEB-2011). XX DR Genome; CABV01004654; Positions 2567 2761. XX SQ Sequence 195 BP; 36 A; 59 C; 40 G; 60 T; 0 other; tgtagtggct cgcagaaagt gctcgtcgat tctcatcagc taaccgtttt gcccgcgcgc 60 tcccatctcg cagggcgccg ggactttccc tctcactctt gttttgccag ttttacgtgc 120 aactcctcaa taaagttctt attttctcca gctagctacc gtgttcagtt cttagcatct 180 gacgtagagc caaca 195 // ID Gypsy-4_DPu-LTR repbase; DNA; INV; 104 BP. XX AC scaffold_118; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-4_DPu_; KW Gypsy-4_DPu-LTR; Gypsy-4_DPu-I. XX NM Gypsy-4_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-104 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 724-724 (2010). XX DR Genome; scaffold_118; Positions 231943 231840. XX CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX SQ Sequence 104 BP; 26 A; 24 C; 29 G; 25 T; 0 other; tgttgtatct tgtcgtgttg ggcagagccc ctagcggtaa tgagccgtag ttcaatacag 60 agaggctcag cctcagttgc acacagacag gctagtgcat aaca 104 // ID DNA8-11B_AP repbase; DNA; INV; 483 BP. XX AC . XX DT 27-AUG-2009 (Rel. 14.09, Created) DT 27-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-11B_AP. XX NM DNA8-11B_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-483 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 1962-1962 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 483 BP; 113 A; 80 C; 80 G; 209 T; 1 other; cagggcttga aaccgattga aaaaatttcg gtttcggttt cgattttgta ataccggttt 60 ccaatttttt ccgattttgt tttcggtttc agtttttaaa cggtttatac cggtttctaa 120 aatcggtttc aaggcctgga ctatttacgt tcattcttag tttattaatt tntttgaatc 180 gccggatcga gcgttcgaat cgccttttgt gtaaagtaaa ctttccaaaa tgttcaagat 240 accaacgatt ttaaaaaaaa taccgttttt taaattatat tttcggtttc ggttttaaat 300 tataataccg gtatccaatt ttttccggtt tcggttttga atttaatacc ggtttccaat 360 ttttttcggt ttcggttttg aatttaatac cggtttccaa ttatctcgcg tttcgttttt 420 gatttacttt tcggtttttt tcacggttta taccggtttt ataaataccg atttcaagcc 480 ctg 483 // ID DNA-TA-2_AAe repbase; DNA; INV; 511 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A non-autonomous DNA transposon family from Aedes aegypti. XX KW DNA transposon; Transposable Element; nonautonomous; KW DNA-TA-2_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-511 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the yellow fever mosquito."; RL Repbase Reports 11(4), 1271-1271 (2011). XX DR [2] (Consensus) XX CC ~98% identical to consensus. TA TSDs. The sequence shows a CC similarity to PONY_AA and MITE_AA. XX SQ Sequence 511 BP; 169 A; 99 C; 82 G; 161 T; 0 other; taccgttttg attcatatta cggacactta aggcttcagt gaacttcaac taatcgaaac 60 atatataaat tgaatcgttc cctatacaac tttagcgtaa tcaggccttg caaacactta 120 aatttcgaat gctgggtgaa gatttggatt ccacagcttt aattcacttt aatcaacaaa 180 aatgttcggc ctcttcatga tttttattcc ggacacaacc acacttttgc ttcatattcc 240 ggacacattg attcgaattc cggacagctc atgaaaaaca caattagaat agtcgaatca 300 ttaactaaat gcactaatcc gttagaaaga cgtcaaaact agttgaacat tgtaaatttt 360 catggattgc tatgtaaaat gtttataaaa atcatctttc aaattgggaa ctttttgacg 420 gcccattttg aaacatttcg agtgaaatct ttcccataca aagtagagtg tccggaattt 480 gaagctgtcc gggattcgaa tcaaaacggt a 511 // ID Rehavkus1_AP repbase; DNA; INV; 4934 BP. XX AC . XX DT 24-JUN-2009 (Rel. 14.07, Created) DT 26-MAY-2011 (Rel. 16.05, Last updated, Version 3) XX DE DNA transposon. XX KW MuDR; DNA transposon; Transposable Element; Rehavkus1_AP. XX NM Rehavkus1_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-4934 RA Jurka J.; RT "DNA transposons from pea aphid."; RL Repbase Reports 9(7), 1363-1363 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX FH Key Location/Qualifiers FT CDS join(690..1340,1344..2486,2490..3095) FT /product="Rehavkus1_AP_1p" FT /translation="MHVYRSYCQSTSYPKIIIDATGSLIKNFKKFGMNKTK FT TIYLYEALVYDESKLHSFTVSNMISERHTTLAIYNWLANWLNFNVPSPRET FT VCDQSMALLSACVKCFTQYSSLKQYIRVCAKLALGKLSPDPIWLLNCFIRT FT DVAHFIKLVSKWVPLKNTQGRVKEVILRSIGVIIKNQSISEIYSIVLSLFV FT VLTNESDGINAETGRDTPCEYHKKKIIITSTGFVNYEEWYNEVFSTVDSEE FT EIRDFIEEDETKNIDLDNDKNSFQSWAEEIFEKSKEYIEEGNGINVMYLPK FT LVPLIIKCMHFLPLWSGFMIPIFKYGKLTASSAGVESSFKKLKIVTFKDMD FT LPTNIDLFLERHIISLRGNSLLRSSNYTHTSDNFQEVIPINQTNELHEDDF FT IRIDENNINDIIMDIENEIPIKKSITVNSNNCEENTAKEGWDRKSKRQRIT FT NSYLNSNSHLRHIDLNNSRSMTSLPNLKNGSKVTDLKTRNLKDYGKVAISN FT TCAFDTLASIFMVAYCTSKRYTEEINCLDNTNEFLSFVSSIVKKGINPTTY FT KERAQIMLSYLEPEKTKIDYDITLVSCHATTEFVIKKLFLDIPTAFDYTLC FT NPKCDYSTENQKPVTYVTFHTENSLDGLQNFLLERLSIDYLTCGHINQYQT FT HPCSGDKTIRTEASTKHLFIEILKWEGTYSKYINKNITYNHYIQLFMLSYF FT LITGDITFYQDQSSEAAPQLKIPLKDIPQVLTHNNTTYELRGVCDYRKGLS FT RLRTSVGHYVAYCKRGPNNWELFDDTNKKSKAVSQNTEVLCELLIYTV*" XX SQ Sequence 4934 BP; 1794 A; 776 C; 780 G; 1584 T; 0 other; cacgaccgtc tcaccagaac cgcccattca aaaatttcaa tttttttttt tacagggaat 60 tagcacgtaa ttagcatgaa tttgcacgac tatcgccaaa atttacacct gtatgacaaa 120 aaatgatatg cggtgctgga gagacgtctc gccagcaccg cccaacatct aaaattatta 180 ataataacag acttgatatg ttaaaatgta gcacgtaatt agcatgaatt tgcacgacag 240 accccgaaat tcacacgtgt ctggttcgcg agaaaaaccc gtttgactat ttcgcgcgtc 300 tcgccagcac cgcccaaaat ggaaattttg cgaaaatcga aaaactgttt gcgccaatat 360 ttagcacgtg attagcatga atttgcacga ctgactccga aattcgcacg tgtctggttc 420 gcgagcaaaa accgttcgac tttttcgcgc ttctcgccag caccgcccta aagtgttaat 480 aaccttcaaa tataataaaa cactacacat ttaaaaaaag aaaatcaccc gctaataatt 540 tgcaaaataa tattctctta tcgcatcact atagttgtgg attacataat attatgtact 600 attaattttt taacgaaaat gagtataaag acattctaca tgatttaagt tgggaaccat 660 tttatatcca ttaccatagt ggcgaacaga tgcatgtcta cagaagttac tgtcaaagta 720 cttcttatcc taagatcata attgatgcaa ctggatcatt aattaagaac tttaaaaaat 780 ttggaatgaa taaaactaaa acgatttatc tatacgaagc tttggtgtac gatgaatcaa 840 aactacacag ctttacggtt tcaaacatga taagtgagcg gcatacaact ctggctatct 900 ataactggct agcaaactgg ttaaatttta atgttccttc acctagagaa actgtttgtg 960 accaatctat ggccctgttg tcagcatgtg tgaaatgttt tacccagtat tcttcactaa 1020 aacagtatat cagagtttgt gctaagttag cattaggtaa gctatcgccg gatccaattt 1080 ggttgctcaa ttgttttata cgaacagatg ttgctcactt tataaaatta gtgagtaaat 1140 gggttccatt aaaaaatact caaggtagag ttaaagaagt aattctacgc agcataggag 1200 taataattaa aaaccaatcc atatctgaaa tttattcaat tgtattatcc ctatttgtcg 1260 tgttaactaa tgaatctgat ggaattaatg cagaaacagg aagagacact ccatgtgagt 1320 atcataaaaa aaaaataatt taaataacat caactggttt tgtcaattat gaagaatggt 1380 acaatgaagt gttttcaact gttgattctg aggaagagat tcgtgatttt attgaagaag 1440 atgaaacaaa aaatattgat cttgataatg acaaaaattc atttcagagt tgggctgaag 1500 aaatatttga aaagagcaaa gaatatattg aagaaggtaa tggcattaat gttatgtatt 1560 tgccaaaatt agtgccatta ataattaaat gtatgcattt tttaccctta tggtcaggct 1620 tcatgatacc aatttttaaa tatggaaaat taacagcgag ttctgcaggg gtagaatcaa 1680 gttttaaaaa actaaaaatt gtaactttta aagacatgga ccttcctaca aatattgatt 1740 tgtttttaga aagacacata atatctcttc gtggaaattc tcttttgcgg tcttcaaatt 1800 acacacatac aagtgacaac tttcaagaag taataccgat taatcagact aatgaattgc 1860 acgaagatga ctttattcga attgacgaaa ataatattaa tgatattata atggacattg 1920 aaaacgaaat cccgataaag aaatctataa cggtaaatag taataactgt gaagagaaca 1980 ctgctaaaga aggatgggat agaaaatcca aaagacaacg aataacaaat tcgtacttaa 2040 actcaaactc acatttgcgt catatagact taaataattc acgcagcatg acatcactac 2100 ccaatttaaa aaatggttca aaagttactg atctaaaaac tcggaattta aaagattatg 2160 gtaaagtggc aattagtaat acttgtgcct ttgatacact ggcttcaata ttcatggtgg 2220 cttattgtac aagtaagcga tatacggaag aaataaattg cttggataac acaaatgaat 2280 ttctttcttt tgtatcgtcg attgtcaaaa aaggaatcaa ccctacaaca tacaaagaaa 2340 gagctcaaat aatgttaagt tatctcgaac cagaaaaaac aaaaattgac tatgacataa 2400 cattggtttc atgccatgca acaacagaat ttgtcataaa aaagctattt ttagatatac 2460 ctacggcatt tgattatacg ttatgttgaa accctaaatg tgactactca acggaaaacc 2520 aaaaaccagt gacatacgta acatttcaca cagaaaatag tctagatggt ttacaaaatt 2580 ttttattaga aagattatcg atagattatt tgacttgtgg acatataaat caatatcaaa 2640 cccacccatg tagtggtgat aaaactatta gaaccgaagc atctacaaaa catttattca 2700 tagaaattct taaatgggaa ggtacttaca gtaaatatat taacaaaaat ataacctaca 2760 accattatat acaattattt atgttatcgt attttttaat tacaggagac attactttct 2820 atcaagacca atcatcagaa gctgctccac aactaaaaat acctcttaag gatataccac 2880 aagttctaac acataataac acgacatatg aattgcgagg tgtatgtgac tatcggaaag 2940 ggctaagtag attgcgcaca tctgtaggac actatgtagc gtactgtaaa agaggaccta 3000 acaattggga actatttgat gacaccaaca aaaaatcaaa agcagtcagt caaaatacag 3060 aagtattgtg tgagcttctc atttacacag tataaaaaat gtactattaa cttaccacta 3120 aatgtacaaa gaaaatattt ttacttaata aatttaaatt ttataggtac tactttaaaa 3180 taacttgaaa gtatattatt atctattaaa aattattcca gcctccaaat atatccaatt 3240 aaaaaaaaaa aaattcaata agactgtcag aaatttctat taaacgtttg aaaaactctg 3300 tgtgaaattc taataaatac taaataatac tgcagcagtg taatgagctg ggaaggaaaa 3360 tacttcatac atgtacttat ttatacaata agtacccata cccttggata tataatattt 3420 atgatgcaat cggcttcagt cgcccgagaa accgaaaact aatttaattg tgactataaa 3480 gaaatattat aagttgattt tgaatttacg tggatgcgtc tgagtaaagt taaagttaaa 3540 ttaatgaaaa catcaaacag gtaacattat ttcaaaattt ctaaccagaa attacatttc 3600 aaaaatcttt tatttaaaaa tataaacttg gtgcagctcg ttactaggat aatagattct 3660 tcttttttaa tataaaccaa tggcaagtac gtcttgcaat caatcaggaa cgtctcgata 3720 ccatactata attatttatt tcgtaataaa ttatgtataa tgtatattgt aaataagaca 3780 aacatttttg ttcagaattt tattaaattg tattattgcg ggcaccttga tataggtaac 3840 atcagtagtg tcataaaaat gtcatacaat ttttaataat attttcacaa acatttaata 3900 gttaaaaatg gttagctaat ttatataata tactattaca aaataaatta tagatttcta 3960 ttgtttttta aacttaatct attattaatt ttcaatatta gtattttatt ttataatatt 4020 tatttttaag tcaataaaaa tgagatgtgt aagttcctaa ttcatacact gtaaaaaaaa 4080 gttgatcatg ttagatttta taaacatatt attatattta tctaattatg aatgtgtatt 4140 atatgtgatt attttttttt ttttttttta ggacatatta atcagttaga gaacatttga 4200 tatataaatt aatatagttt ttactgtgta tgaatttata tatataattt ttccgccgtg 4260 taggtataag aactaacata cgtacctacc tgctatttac catgtgggaa cttatattat 4320 gtacctttta atattgtaag agcttaaatt tgtatttttt ttacccacta gggctcccgt 4380 aggaacttac attcacccga taattcataa tatgctaatc gtgtactgaa tttcgtctta 4440 acaattttaa gacttacgtg gttaggcgag acgagcgaaa tagtcgaacg ggttttgctc 4500 gcgaaccaga cacgtgcgaa tttcggagtc agtcgtgcaa attgatgcta atcacgtgct 4560 aaatattggc gaaaacagtt tttcgatttt cgcaaaattt ctattttggg cggtgctggc 4620 gagacgcgcg aaatagtcaa acgggttttg ctcgcgaacc agacacgtgc gaatttcggg 4680 gtctgtcgtg caaattcatg ctaattacgt gctacatttt aacatatcaa gtctgttatt 4740 attaataatt ttagatgttg ggcggtgctg gcgagacgtc tctccagcac cgcacatcat 4800 tttttgtcat acaggtgtaa atgttgtcat gcaaattttg gcgatagtcg tgcaaattca 4860 tactaattac gtgctaattc cctgtaaaaa aaaattgaaa tttttgaatg ggcggttctg 4920 gcgagacggt cgtg 4934 // ID Gypsy21-LTR_Dpse repbase; DNA; INV; 230 BP. XX AC Unknown_group_247; XX DT 15-MAY-2009 (Rel. 14.05, Created) DT 15-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy21_Dpse; KW Gypsy21-I_Dpse; Gypsy21-LTR_Dpse. XX OS Drosophila pseudoobscura OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; obscura group; OC pseudoobscura subgroup. XX RN [1] RP 1-230 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1121-1121 (2009). XX DR Genome; Unknown_group_247; Positions 30327 30098. XX SQ Sequence 230 BP; 92 A; 45 C; 73 G; 20 T; 0 other; tgtaagaagg agtgggccag ctggcctatt gaaggggcag cggggagcag caacagcagt 60 aataacagca gcagaggaac agctgatgag cagtaacagc agcagcaaca gcagtagtaa 120 cagcagacag agaagaagag gaacagagcg aagcagtgaa aaggagagaa cagcagcata 180 cggacgcgac tcaagcagca gcagaaaatc agcagaaata agccgtaaca 230 // ID Gypsy-21_DWil-LTR repbase; DNA; INV; 269 BP. XX AC scaffold_181130; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-21_DWil_; KW Gypsy-21_DWil-I; Gypsy-21_DWil-LTR. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-269 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_181130; Positions 40640 40372. XX SQ Sequence 269 BP; 96 A; 44 C; 59 G; 70 T; 0 other; tgtaaagacg agatgggcac actgatctga gattgcccct attttgatga ggcgtgtagc 60 agcggagagt gagaagcatc taagagagag agtcgaaggg cagtcgtcgt cgagaacggg 120 caagatataa acgggttacg agagcgttcg gcagttatga aatcaattat tattctcaaa 180 gaaatataca tatgtactac acctcaattt aaacatttca gttaatatta tatacaataa 240 actattacaa atataatgtt aacctcaca 269 // ID Penelope-1_CQ repbase; DNA; INV; 1702 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A Penelope-like element family from Culex quinquefasciatus - DE consensus. XX KW Penelope; Non-LTR Retrotransposon; Transposable Element; KW Penelope-1_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-1702 RA Kojima K.K. and Jurka J.; RT "Penelope-like elements from the southern house mosquito."; RL Repbase Reports 11(1), 600-600 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 10 sequences with >97% CC identity. The consensus is severely truncated. XX FH Key Location/Qualifiers FT CDS join(1..576,573..1295) FT /product="Penelope-1_CQ_1p" FT /note="reverse transcriptase and GIY-YIG FT endonuclease." FT /translation="VQHDIIYSWAEIAPHTQICLDLFLEIVAFLMDASYFV FT FRNQHYLQIQGTAMGNPASPVFADLTMETLIDNVLHTISCPITTVHKYVDD FT LFLVIPEDKIEEVLATFNAYHDSIQFTYEVERDGRLPYLDMTLVRQADGTI FT RTEWYTKEIASGRLLNFLSIHPLPQKINIATNFINRVYNLTFVGGDKDKEN FT DKQIVHKYLSLNDYPRSLINRCLNRKNNPNPQNRPNHPNSEELQRYRSIPY FT IPTLTPRIAKLLQQDYPNIVVAPRSTHTVNNLYTRIKDPIPMLNRHNVIYC FT IKCDQCECRYIGMTSNLLKNRLSGHRSDVNKLDQLIQSGHTYTDVEVQAHK FT EKTALVSHCIDTGHRFDTTQAKILDQSFKRSSLPLLEMIWIHNTADTVNKR FT TDTDGLNSTYAGILHVLHKHKTKTKPQKHTTQSQLNQST" XX SQ Sequence 1702 BP; 584 A; 456 C; 294 G; 366 T; 2 other; gtacaacacg acatcatcta cagctgggcc gagattgctc cacacacaca gatctgtctt 60 gatctgttcc tggaaatcgt agccttccta atggatgcga gttacttcgt gttccgtaac 120 caacactacc ttcaaatcca gggtaccgcg atggggaatc cagcatcgcc cgtgtttgca 180 gacctgacga tggaaacgct gatcgacaac gtgttacaca cgatcagctg tccsatwact 240 acagtgcaca aatacgttga cgatctcttc ctcgtcatac ccgaagacaa gatcgaggaa 300 gttctggcca ctttcaacgc ataccatgac agcatccagt tcacctacga ggttgagcga 360 gacgggagac tcccatacct cgatatgaca ctggtaagac aagcagatgg taccattcgg 420 acagagtggt acacgaaaga gatagcatca ggccgcctcc ttaacttcct atccattcat 480 cctcttcccc aaaaaatcaa tattgcaacc aattttatca accgtgttta caatctcacc 540 tttgttggtg gtgacaaaga caaagaaaat gacaaatagt tcacaaatac ctttccctaa 600 acgactatcc ccgctcatta atcaatagat gcctcaaccg aaaaaacaat cctaatcccc 660 aaaatagacc caatcaccct aattccgaag aactccaaag atacagatcc attccttaca 720 tcccaacact tacaccccgt attgccaaac tactacaaca agactacccg aacatcgtag 780 tagcacccag atccactcac actgtcaaca acctctacac acggatcaag gacccgatcc 840 cgatgctgaa ccgacacaac gtgatctact gcatcaaatg tgaccagtgt gagtgtagat 900 acatcggcat gaccagcaac ctgctcaaaa acaggctgtc tggccatcgc agcgatgtaa 960 acaaactgga tcagctgatc cagtccggcc acacgtacac cgacgtagag gttcaggctc 1020 acaaggaaaa aaccgcgctt gtgtcccatt gtatcgatac cggccaccgg ttcgatacaa 1080 cacaagccaa gatactggac cagtccttca agagatcatc tctccctctc ttggagatga 1140 tctggatcca caacacagct gacactgtca acaagaggac cgacaccgat ggcctgaaca 1200 gcacatacgc tggaatacta catgtcctac acaaacacaa aaccaaaaca aaaccacaaa 1260 aacacacaac acaatcacaa ctgaaccaat ccacctgaca gttagcaatt agacgttgta 1320 cagtacaaag tgacccagca gttgaatgtt gttttacaac aatttatttg gacaaacatg 1380 ttagtatttc acaatagttt acaaaacaat attaaattaa ttctttaaac agtgatgtgc 1440 agcaaacgta aaggacagtt taacaatagg actaaaacat ttttgacaag aaacagacaa 1500 ctagtaagta cacaaaacac aaccaaacac acacaaattg taaaagcttg acatttttag 1560 taaaccccct gaagaaggcc tcaaaataaa ggccgaaacg tcggtgaagt agagaaaatc 1620 tgtttttctt tttcgacaag actgctcgcc aaaaccagaa agataaaaat aatcagtcgt 1680 tgcgaacccg gatcaaaata aa 1702 // ID REP-7_CQ repbase; DNA; INV; 1022 BP. XX AC . XX DT 29-DEC-2010 (Rel. 16.01, Created) DT 29-DEC-2010 (Rel. 16.01, Last updated, Version 1) XX DE A repeat family from Culex quinquefasciatus - consensus. XX KW Repetitive element; nonautonomous; REP-7_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-1022 RA Kojima K.K. and Jurka J.; RT "Repeats from the southern house mosquito."; RL Repbase Reports 11(1), 610-610 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >95% CC identity. No TIRs. XX SQ Sequence 1022 BP; 319 A; 189 C; 187 G; 327 T; 0 other; aaaaaaaaac gttacttaat ccaccgatgg tggttggtgc cttcctcaca attatgtaca 60 tatttgtttc tgtagtaaga atgtcaaacc tacaaatttt atctaaattt tacttcttac 120 aacatatcta catggagtat ggggtctaga atcccaaaaa tcattggacg taatattaga 180 acaacaccta cggggctgtt tcatgaaaat tgttcacttt caatagttat attactttaa 240 agttttttaa gccttaaggc agtaaaaaaa tatatctgtt catttttccc gggagtttca 300 aatcaacaaa atcccgggag ccagatctcc ggatagatcc gaacaacttt ttcatcaaaa 360 tatttgagat ccggcctctg aaatgtgtac aaatgacact taggtgctca taacttttga 420 tagggttatc agatcttcaa tcttttgggc ttgttggaaa ggtcttttga ttacctatcc 480 aacgatgggt cgcatgatag atccggacaa ctttttcatc aacatatttg agatccggcc 540 tctaaaatgt gtacaaatga cactaaagtg ctcataactt ttgatagggt tatcagatct 600 tcaatgtttt gggctcgtta gaaaggtctt tcaaatacct ttctaaaaat gtatgatcag 660 acgggttttc ttacaaaaac cacccttttt acaatctttc aaactttagc cagaatcgtt 720 tttttagcat aacttttgaa atacttaaca aaacttcata atagttaata taggtcttgt 780 gggaccctaa gacggatcga atgagaccag aacggcccaa atcggttcag ccagtccgga 840 gataatcgag tgcatttttt tcggtgcacg gacttacaga catacacacg cacagacatt 900 tgctcagaat ttgattctga gtcgataggt atacgtgaag gtaggtctac gaggtcgaat 960 taagaagttc atttttcgag tgattttata gcctttcctc agtaaggtga ggaaggcaaa 1020 aa 1022 // ID piggyBac-15_SM repbase; DNA; INV; 2357 BP. XX AC . XX DT 30-MAY-2008 (Rel. 13.05, Created) DT 30-MAY-2008 (Rel. 13.05, Last updated, Version 1) XX DE DNA transposon from Schmidtea mediterranea. XX KW piggyBac; DNA transposon; Transposable Element; KW horizontal transfer; piggyBac-N1_BM; piggyBac-15_SM. XX OS Schmidtea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae. XX RN [1] RP 1-2357 RA Kapitonov V.V. and Jurka J.; RT "Families of autonomous piggyBac element from the freshwater RT planarian genome and two clear examples of horizontal transfer of RT piggyBac transposons between a flatworm and insect."; RL Repbase Reports 8(5), 534-534 (2008)Repbase Reports. XX DR [1] (Consensus) XX CC piggyBac-15_SM is a family of piggyBac transposons, characterized CC by 14-bp TIRs (2 mismatches) and TTAA target-site duplications. CC The consensus sequence was reconstructed based on multiple CC alignment of 10 copies, which are ~97% identical to the consensus CC sequence. The silkworm genome contains >100 copies of the 242-bp CC nonautonomous piggyBac-N1_BM transposons, which is a deletion CC derivate of piggyBac-15_SM. These transposons are 93% identical CC to each other. Given that planarians and insects have split from CC their last common ancestor more than 500 million years ago, CC evolution of these two families of piggyBac included horizontal CC transfer. XX FH Key Location/Qualifiers FT CDS join(446..1148,1222..2156) FT /product="piggyBac-15_SMp" FT /note="piggyBac transposase." FT /translation="MSKQTRQYKFLEDINNIGEEFSQIGEDEFSENEDILV FT SNESELDSIDTDDEKENTVNICCGRKRIRLLSSFEESEVEINKQHVANDGS FT VWEEIKVGGTPGRIPLHNIFRQEVGPTGYAKRHIMKGNVSTAFSLIIDHRI FT MNHIRSCTIEEAKRVLGSDWSLSQEKLDAFIAILYARAAYGANNLNISFLW FT NNIWGPNFFSETMSRNNFTEILRFIRFDKKKSAEPTFENRQIRYDEQLFPT FT KVRCRFTQYMPNKPDKFGIKFWLASDVDSKYVINAFPYLGKDENRPSSVQL FT SEYVVLKLMESFTGCGRTVTTDNFFTSKSLATQLLAKNTTLVGTIRSNKRE FT LPLIAKQKKDDMERFSSKIFTTDNCTLTIYKSKPNKKVLLLSSKHKFVTVE FT NNDKRLPETVSYYNKTKFGVDMTDQMARKFTTKSKSCRWPLQVFFNILDLA FT GINAWILYKQTTGENISRQDFLLKLAVELGADFREAREQPKKRTTTKDSLP FT KSTTDASQRKRCQIGYCKENKTNKICSKCKKYVCGKCTMDTLICKKCDEQ" XX SQ Sequence 2357 BP; 851 A; 351 C; 419 G; 736 T; 0 other; cactagacat accataacgg gtcaaatgac ccattttgaa cttttgcatt gaaaatgcac 60 gtatcatttt attgcttttg cacatttgac ttcatgactt tttcgaacta attaatctac 120 agttaatata gtattttttt tatttttttt tgttttgttt atgttgagta atacttgtcg 180 tatactgtta tctaccacag cgggtcattt gaccctgaga gcattcgttt gttttaagca 240 gttttagttg cacatttctg caaaaactgt tgcgtaaagt agttcaatca aaatagctga 300 gaataatttc aagaaaatat caaaatttta tattggattt agctttttag tgtttgaaaa 360 ggtagcaaaa attgaaagtt ttggtaagta gcttatattt tattttgatg gctttagaaa 420 tacaggtcgc tttttaccca gaaaaatgtc gaagcaaaca agacagtaca agttcttaga 480 agatataaat aatattggcg aagagttttc gcaaattggt gaagatgaat tttctgaaaa 540 tgaagatata ctggtttcaa atgaaagtga gttggattct atagatactg acgatgaaaa 600 ggaaaatact gtgaatatat gttgtggacg aaaaagaata agattactct caagttttga 660 agaaagtgaa gtagaaataa ataaacaaca tgttgcgaat gatggatctg tttgggaaga 720 aataaaagta ggtggcacac ctggaagaat accacttcat aacattttca ggcaagaagt 780 aggcccaaca ggatatgcaa aacgtcatat aatgaaaggt aatgttagta ctgcattttc 840 attgataatt gaccaccgta ttatgaacca tataagatca tgcacaatag aagaagcaaa 900 gagagtgtta ggatccgatt ggagtttatc tcaagaaaaa ttagatgcat ttattgcaat 960 tttgtatgcc cgagctgcat atggagcaaa taatttgaat atttcttttt tgtggaataa 1020 catatggggg cctaattttt tttctgaaac tatgagcaga aataatttca ccgaaattct 1080 tagatttatt cgttttgata aaaaaaagtc agcggagcca acgtttgaaa acagacaaat 1140 tcgctatggt atctacgatt tggaatattt tcattgaaaa cagccaaaac tcttataaac 1200 cgggtggtaa tattactata gacgaacagc tatttccgac taaagttaga tgtagattta 1260 cgcaatacat gcctaacaaa ccggataaat ttggaatcaa attttggttg gcatcggacg 1320 tcgatagcaa atacgtcata aatgcatttc catatctagg aaaagatgaa aataggccct 1380 cttcagtaca actctctgaa tatgttgtac tcaaactcat ggagtcattt acgggttgcg 1440 gaagaactgt taccacagac aacttcttca caagcaaatc cctagcaaca caactcctag 1500 caaaaaacac tacgttagtt ggaactattc ggtcaaataa aagggaatta ccactaattg 1560 cgaagcaaaa gaaagatgat atggaacgat tttcatcgaa aatttttaca acagataatt 1620 gtactcttac aatctataaa agtaagccaa ataaaaaggt acttttgctt agttctaaac 1680 ataaatttgt aacagtagaa aacaatgata agcgtttacc tgaaactgtt tcatattaca 1740 ataaaaccaa atttggtgta gacatgaccg atcaaatggc gagaaaattt actacaaaat 1800 ctaagtcttg cagatggcct ctccaagttt ttttcaatat ccttgattta gctggcataa 1860 atgcctggat tttgtacaag caaacgacag gagaaaatat ctcgagacaa gattttttac 1920 tcaagctagc agtagaactt ggcgccgatt ttcgagaagc ccgtgagcaa ccaaaaaaaa 1980 gaacaactac gaaagattcg ttgcctaagt ctactacaga tgcttctcaa cgtaagaggt 2040 gtcaaatagg atactgtaaa gaaaataaaa ccaacaaaat ttgctctaaa tgtaaaaaat 2100 atgtttgcgg aaaatgtaca atggacacac ttatatgtaa aaaatgcgat gagcaatgaa 2160 ttaaaaattc ttatttttta cgtttgtaat agaaatggtc aatatactat ttatttttaa 2220 ttataataaa ttttattttt atgttaattt ccttcctaac attcatattt catacattca 2280 agcaggaaac gtgaagtcag gtcatttgac ccgctatggt agaaataggt atctaaaaag 2340 tgttggtatg attagtg 2357 // ID Gypsy-42_OD-I repbase; DNA; INV; 12003 BP. XX AC CABV01001282; XX DT 24-FEB-2011 (Rel. 16.02, Created) DT 24-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Oikopleura dioica genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-42_OD_; KW Gypsy-42_OD-LTR; Gypsy-42_OD-I. XX OS Oikopleura dioica OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; OC Oikopleuridae; Oikopleura. XX RN [1] RP 1-12003 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Oikopleura dioica genome."; RL Direct Submission to RU (24-FEB-2011). XX DR Genome; CABV01001282; Positions 13865 1863. XX CC Positions [2942-3418] - Reverse transcriptase CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 119..5980 FT /product="Gypsy-42_OD-I_1p" FT /translation="MQPSVDIFESTAGHSYYSGNKPRKLAFSSSDDAIRHK FT RCYAWRDHLCAILAANIRKNDGDNRFRDQIASMPLTWGDNNDKMPSLAGRL FT SIPTDLREFEDWRAAGRYDGLAELENCVVNDNHDNFVLTNGQTAAQSLAVF FT QEHFFPGTRLPAVNYFALVNGSSIAIMSDICLNCEDTTQSDVRNGYVKNLN FT FYVDILVKDHCQESHKNDFLRVVKFVKFVGRGELAHIKTNTLYDIIKRMLR FT RVMSDNAAGGRFPDGTPMGGSRFRSQFTIQPDRFGPFAETMTLLLCCVGAS FT PSNWETIIKSIATKRRIEYREVNNRDIIMEKRLFLDEIEKLCRATTHAELS FT QVEISIDLSTNNFREMLSQISNQQRVAVIKAQKRETHNFNQITEEIEGSPE FT TQAQNDPDEVNAFNAKFGNQKVKTWQDSNQRQQSRGFSKFVNLKDIKFNKN FT ARIDFDTGRVLEFTQTKLNSNDLKEAIKSSPHLVTRTRTRERRPDVRRKVN FT NFIARKGTRRSPHNQDTYVLATEVTCNELMALQDRLEFSSDDEIDFQSYAV FT ENTNGTGDSYAAFNIPGEIFAKFEEINNSNLRAKLEKYDESYDAKVERTTL FT SIWFCLPDCEKYVLNKKEHEWYRILLDTGASVSLISESLFRLLKAGNIRVE FT SAAGGAGRTAGGGKISFLPIAVSFTVRLKQNVKLTFTRARVCAGNDKTVLL FT SLGDLARNLGNIKTYESEDGFKIRVSVRGVDISDLGQKTGVGVRHLQNAVH FT CFNRYEFVRNDRLLPAERCPTLVPFDKSEFIDMKPTDRHNPDSKYAYGRKL FT EKIHQANREINTWRECSIDPSNTACKVIPEFEKFKTKIWTILENNKPLFEG FT TQGCVPGDDMVVEAEISGNTAGKIVNQSSNARRTTSEKDALIEKLDNEAAE FT GVIGFVPKGMTQLHRVTFFPVGKKDAKTGKVELCSGSIRVVADCKRAKINE FT DTVYLARPTDCIRNVLQGVARFTETGLIACIDISSMFYCFRISDKMRAHFC FT FEHPYQDDHCYRRLPMGWISSPALAREKLNRILYKHRDYCFVYVDDIVIGA FT DTPEEFLENFAGVLHTLCVANLRLKGKKCDILGKDMKILGRRVVNGVIKAS FT PHVLTKIADLTPDKVTTVKCMKRVLGYITYIGDSLPFRTELLAEMHQAASS FT KRKLSEKIAWTEQLTKSYLKAQKAINETLLGLYPVAKKLDTFLVVDSSNKG FT TGAFLYQVGADKKVRINKIYSKKRPDAENETQWSSCLVELHGILSAAIHFE FT WEIDIVEGVVTVITDSSSVEKLWKRCLNGQELASDKKINERLMKLMAYNIR FT IIYKSCRSADIDLADIISRSDEMLKPCDENCKTCLDVNKNGYLTEKDSRCF FT YNLESFDLHPSVSISRFERIFDAQAAANIKLELKHCREKFYFCGDELDIGI FT NVMQTRARAREICVPAKKESESTPWIRDARAIVEELNEFFRTNRLTDFFKN FT RGLIRKIQLTDKSIQKALESKREGGKKPYADKTAEALRNSCQLEVKDNVGI FT LRKNAMFVETKMVEPIVIPATFLELILRKLHGASGCTSLAAYKKIVKCDFW FT AAGMHKIIEKVHRSCTGCTYYRKHSKIEVIPQEYDDTAPTELGQVFFSDVI FT TRNTHGMKENHAEPTFKFYVVSEAVSGLCKIYPISNKDNNGEVGTDVILQA FT LGDFARGPLKEKKIRVFVDGCSVNKNIAKRLVWNEFEVKIILPVAFSKSKN FT YVSPIDSRIGKITKYLVAEISKKGSPTRIAVATSNRANCTPGRHGFTPYEI FT YYERDRYNKKITVDLEKLIKYIKECREKGRQAQLRNQTKGRTRRPLKLTPF FT EEGDSYESEFEKPIKVGDLILIEGDWNKNDLNPYFKVVATDAYPSGVDWFE FT GVVYTHKLGVSRKHVHVWSLTAIRAIVDGREAGDVENAGLMMVAHNDELVL FT PEFMHSRTQVFNRELWSKPLISYKRLIARK" XX SQ Sequence 12003 BP; 3673 A; 2467 C; 2917 G; 2946 T; 0 other; gtttacgtgg tgactgagag ttccttggaa acagattccc tgatctggag accagggact 60 ccggcacgga cttagccgaa ccaaacgcaa acctcgagtc gggatcgcag ataccgagat 120 gcaaccaagc gtggatattt tcgagagtac cgcgggtcac tcatactatt cgggaaataa 180 gccaagaaaa ttagcatttt cgtctagcga cgatgctata aggcataaac gatgttacgc 240 gtggcgtgac catctgtgcg ctatattagc ggcaaacatt cgcaagaatg acggggacaa 300 taggttccgc gatcagatcg cgagcatgcc gctaacttgg ggtgacaaca acgacaaaat 360 gccttcactg gcaggtcgtt tgtcaatccc aactgatctg agagaattcg aagattggcg 420 agcagcaggc cgttatgacg gcctggcgga attggaaaat tgcgttgtga acgacaatca 480 tgataatttc gtgttaacga acggtcaaac agcggctcag agtctcgcgg tttttcagga 540 acacttcttt cctggtacgc gtttacccgc tgtaaactac ttcgcgctag tgaatggttc 600 ttcgattgca attatgtccg atatttgttt gaactgtgag gacacgactc agagcgatgt 660 tcggaacgga tacgtaaaga atctaaattt ttacgtggat attttggtaa aggatcattg 720 ccaggagtca cacaagaatg atttcttacg cgttgtcaag ttcgttaagt ttgtgggtcg 780 cggagagtta gcacacatca agactaacac gctctacgac ataattaaac gaatgttaag 840 gcgagtaatg tcagataacg cggcaggcgg gagatttccc gacgggactc cgatgggagg 900 tagtcgtttt cgatcgcagt tcacaatcca gccagaccgt tttggtccct tcgcagaaac 960 tatgacactt ttgctatgtt gtgtcggcgc gagcccaagt aattgggaaa caataattaa 1020 gtcgattgca acaaaacgca gaattgagta tcgggaagtt aacaaccgcg acatcattat 1080 ggaaaaacgc ctatttctgg acgaaattga aaagctgtgc agagcgacaa cacacgcgga 1140 gctttcacaa gttgaaattt caattgatct atccaccaat aatttcagag aaatgctatc 1200 tcaaatatca aaccagcaac gcgtagcggt catcaaggcc caaaaacgcg aaacgcataa 1260 ttttaaccaa ataaccgaag agattgaagg atcccccgaa acgcaggcgc agaatgatcc 1320 agacgaggta aacgcgttca atgccaaatt tgggaatcaa aaagtgaaaa cttggcaaga 1380 ttctaaccag cggcagcaat cacgcggctt cagtaagttt gtcaacctga aagacattaa 1440 gttcaacaag aacgcgcgta tagatttcga cacggggcga gtgttagaat ttacgcagac 1500 gaaattgaac tcgaatgact taaaggaagc aattaaatcc tcacctcatc tagtcactag 1560 aacgcgtact cgcgaacgcc gacctgacgt gcgacgaaag gtcaataatt ttatcgcgag 1620 aaagggcact cgacgtagtc ctcacaacca agatacatac gtcctagcca cagaagtaac 1680 gtgcaacgag cttatggcac tgcaggaccg cctagaattc tcatccgacg acgagatcga 1740 ctttcagtcc tacgcggtcg aaaatacgaa tggcaccgga gacagctacg ctgcatttaa 1800 cataccaggt gagatttttg caaaatttga agaaataaat aactcaaatt taagagcaaa 1860 actagaaaag tacgatgaaa gttacgacgc gaaagtagaa agaacgacgc tgagcatatg 1920 gttttgttta cctgactgcg aaaaatatgt tttaaataaa aaggaacacg aatggtatag 1980 aattctacta gacactggag ctagtgttag tttaataagt gaaagtttgt tcaggttgtt 2040 aaaagcagga aacattcgcg tagaaagcgc agcaggcggc gcaggcagaa ctgcaggtgg 2100 cgggaaaatc agtttcctcc caatcgcggt gtcgtttaca gttcgcctaa aacaaaacgt 2160 taaacttacg tttaccaggg cacgcgtttg cgcaggtaac gataaaacgg tattattgtc 2220 actgggagat ttagcgcgta atttgggcaa cattaaaact tacgagagtg aagacggttt 2280 caaaattcgc gtatcggtca gaggagtaga catcagtgat ctcgggcaga aaaccggagt 2340 tggagtgagg catttgcaga atgcagtcca ttgtttcaat cgttatgagt ttgtcagaaa 2400 cgaccgtctt cttcccgcgg aaaggtgtcc aacactcgtg ccttttgata agagtgaatt 2460 tattgacatg aaaccaacag accgtcacaa tccagattcg aaatacgcgt acggccgaaa 2520 gctcgaaaaa attcaccaag caaatcggga gataaacacg tggagagagt gtagcattga 2580 tccttctaat accgcgtgta aagttattcc tgaatttgaa aagtttaaaa ctaagatttg 2640 gactatattg gaaaataata aacctctttt tgagggtacg caggggtgcg taccggggga 2700 cgatatggtc gttgaggcgg aaatttcggg caatacggcg ggcaaaattg tcaaccagtc 2760 cagtaacgcc aggcgaacca catctgaaaa agacgcgctg attgaaaagc tcgacaatga 2820 ggcggctgag ggcgtgattg ggtttgtccc gaaagggatg acacagttgc acagagttac 2880 cttttttcca gtcggaaaga aggacgcgaa gacaggtaag gtggagcttt gttctggctc 2940 catacgtgtc gtcgcggatt gtaaaagggc gaaaatcaat gaagatacag tttatctcgc 3000 ccgtccaact gattgtattc gtaacgttct ccaaggagtt gcgcgtttca ctgaaacggg 3060 tcttatcgcg tgtatagaca tttcaagtat gttttactgc tttagaataa gcgataagat 3120 gcgcgcgcat ttttgcttcg agcatcctta tcaggatgac cactgttatc gtaggcttcc 3180 gatgggatgg atatcatcgc cagccttggc acgcgaaaaa ttaaaccgga ttctttacaa 3240 acaccgcgac tactgttttg tgtacgtcga cgacatagtt ataggggcgg acacgcctga 3300 agaattttta gaaaatttcg cgggggtgtt acacacatta tgcgtggcga atctgcgttt 3360 gaaaggtaaa aagtgcgata tactcggtaa agatatgaaa attttgggca gacgagtggt 3420 gaatggggta attaaagcga gtccgcatgt attaacaaaa attgcggacc taactcctga 3480 taaagtcact acagtcaaat gcatgaaacg cgttttggga tatatcacat atatcgggga 3540 cagcttaccg ttccggacgg agctcctcgc agaaatgcat caggccgcgt catcgaagag 3600 aaagctgtct gagaagatcg cgtggacaga acagctaact aaaagttatt tgaaggccca 3660 gaaagctata aacgaaacgc tgttaggttt atacccagtc gcgaaaaaat tagatacctt 3720 tttggtggta gattcctcaa ataaaggtac aggcgcgttc ctgtaccaag tcggggctga 3780 taaaaaagtg cgtattaata agatatactc caaaaagcgg ccagacgcgg agaacgagac 3840 gcaatggtcg agttgtctcg tggaattgca tgggattctt tccgcggcta tccatttcga 3900 atgggaaatc gacattgtgg aaggcgtagt aacagtaatt acggactcgt cgagcgtaga 3960 aaagctgtgg aaaagatgtc taaatggaca agaattagcg agtgataaga aaataaatga 4020 acggttaatg aaattaatgg catataacat ccggataatt tacaaaagtt gcagatccgc 4080 ggatattgat ctagctgata taatctcgcg gtcagatgag atgctcaaac catgcgatga 4140 aaattgcaaa acttgtttag atgtaaacaa aaatggttat ttgacagaaa aggattcgcg 4200 ctgcttctat aacttggaga gttttgatct ccatccctcg gtttccatct cgcgttttga 4260 acgaatcttt gacgcgcaag ctgcagcaaa tattaagctc gaacttaaac attgtagaga 4320 gaaattttac ttctgcggtg acgagcttga tattggcatc aacgtaatgc aaacgcgggc 4380 aagagctaga gaaatctgcg tgccagcaaa aaaggagtcg gaaagcactc catggattcg 4440 agacgcgcgg gcaattgtag aagagctgaa tgaatttttt agaactaata ggttaacaga 4500 tttctttaag aaccggggtt taatcaggaa aattcagctt actgataagt caatccaaaa 4560 agccctagaa tctaaacgcg agggaggaaa gaaaccatac gcggataaaa ccgcggaggc 4620 gttgagaaat tcgtgccaac tggaagttaa ggacaatgtt ggaatcctac gtaagaacgc 4680 gatgtttgtt gagacgaaga tggttgagcc aattgtgatt cccgcgacat ttttagaatt 4740 gattctgaga aagcttcacg gagcttcagg ttgtacttca ctggcggcct ataagaaaat 4800 tgttaaatgc gatttttggg cagcggggat gcataaaatc atcgagaaag tgcatagatc 4860 atgcacaggg tgcacctatt atagaaagca ttcaaaaatt gaggtgatac cgcaggaata 4920 tgatgataca gctccaaccg agttgggtca agtgttcttc tccgatgtta taacgcgcaa 4980 cactcatggt atgaaggaaa atcacgcgga accaacattc aagttttacg tggtcagcga 5040 agcggtatcg ggtttatgta aaatataccc aatttctaat aaagataata atggggaggt 5100 tggtacggac gtaatcctac aagcgttagg agatttcgcg agaggcccct tgaaagagaa 5160 gaaaataagg gtgttcgttg acggttgcag cgtcaataaa aatatcgcga agaggttagt 5220 ttggaacgag ttcgaagtga aaattatact tccagtcgcg tttagcaaat ccaaaaatta 5280 cgtcagtcca attgactcgc gtatcggtaa aataaccaag tatttggtcg ctgagatttc 5340 aaagaaaggt tcaccaacgc gtattgcagt agcaacctcg aatagagcca attgcacgcc 5400 gggtagacac ggatttacgc catatgaaat ttattacgaa cgcgatagat ataacaaaaa 5460 gatcacagtc gacctcgaaa agcttataaa gtacataaaa gagtgtcgag aaaaagggcg 5520 gcaggctcag ttacgcaatc aaactaaagg ccgaacgcga cgaccgttga aattaacccc 5580 atttgaagaa ggggattcgt atgagtctga atttgaaaag ccgataaaag taggggatct 5640 aattcttatc gagggagact ggaacaaaaa tgatttgaat ccttacttta aagtcgtagc 5700 gacagacgcg tatccatcgg gcgtagattg gtttgaaggt gtcgtataca cacacaagct 5760 cggtgtatcg cgaaaacatg tccatgtatg gtcattaaca gccatcagag ccattgtaga 5820 cggccgcgaa gcgggcgacg tagaaaacgc gggtctgatg atggtagcac acaatgacga 5880 gctcgtccta cccgaattta tgcactctag aacacaagtg ttcaatcggg aactgtggtc 5940 caagcctctc atatcctata agagacttat tgcaagaaaa tagagcactg ctcagaattt 6000 attgtaagaa aataataaaa cgcaaaaaaa aaaaaaataa attttaatag tcaaattttg 6060 atcttttaat taaataaaaa gcataatttg ggggacattg tttattcttc cgataaaagt 6120 tttatctctc aaataaagtt taaaaataat agtagggcac tgaaattgaa gtgaaaagtg 6180 tgacgcgtaa caatatggat taggtggcag atttgtgaac caaaaactgc aggaagacga 6240 acgctcgtag aagcacacgc ggctgtgggt tgtagtgtag aaaataacgc tgaaaccggg 6300 taatataaaa ttaaaacacg tactcttacg tgagggatgc ttgaaacgcg taatgaccta 6360 ctctccatcg ggcaaaaaca ggttcaggcg tagcaggtga agctacaacg cgatatattt 6420 aaattatatt tttaataaat aacgctaacc tgttatcgtg gtacgcgttt tcgagaacca 6480 gttgaaagtt tgctcgcaaa ggtgagaggc gccaatgacg aggctctgag cctattcggt 6540 agatcctcag catatagacg gtcgagaaac agatgtgcca tttcctgaag ccgcaaagca 6600 tgagcactcg tccgttgctt gcaaaagcac ccgccgcggt agaattctga aagtgaagtg 6660 tggtagaaac taagcagatc aacgcaggcc ttggccagtt ccgcgtcaat gttcgtagat 6720 tgaagcttga acacaggcgg tagatcaagt ttgcactcca gaactagttc cactcgagcc 6780 gtgcggaaag tttttcctga aaataataaa atataagtac aagattaact ttacctatat 6840 aaaaaagtag gcttcgcgtt tcggtaaaag gattttcgcg ttccatcaca tcgtaaaagt 6900 gtcgcttccg cagatgcact aacgcggacg aaactggcaa cctcgcggag gtataaaggt 6960 tgagtaagtc aatctcgaac tcaggaaaaa gctcagttgg caaattgcga gcagaatagc 7020 ttatttatcg ataattataa gttattttaa tgatagaacc tctgggacgt ctttgtagct 7080 acgcggtaaa ttcggcggta tgagcgcgca gatgaggtgg actcgcaaga cgatggctca 7140 gtcatatcta caacgccgtc caaactgttg agttgacgcg ttatatcagc gcgagtaaca 7200 agcggagtcc ggaaaaccgt gtcattttgc tccacattat acccagagaa ggttgtctct 7260 ggtcgtctgt ctggtaatgc tgccaaacgc attaaagtag tttcggcagg cggtcggcat 7320 gacacggaag gaggcaaaga attgtaaacc gtggagccag tagaaaattc gcgggagtcc 7380 tggtttgaca cgcgtaaaga acacggctcc tcatccaagt gcactgtcgg tttgtcgagc 7440 gcggatgaag ctccgcctgg aagttttgta gattcgtggg gtggcggcga ctcaactaaa 7500 atatatataa cgcgtgttac gccgaaacgt acaaacagaa tgcgctctcg gacaagggaa 7560 caggctcctt gtacataaga gcagtctcgg aaattgcgcg catctttttg tcgggtgaat 7620 ttatatcgaa tgcctcatta ttaagattcg cttcgcttcg gtctcgtttt ttgctctcca 7680 tcgtattcat tggaaaattg aattacgcgc gttaaatgca atgtgtgcgt ggatagaaat 7740 acgcaaagat tggcaccggt tggccaatat cgcacgaaag atacaatctt agttgtaaaa 7800 cgcgttttca acggacaaag tccgacagcg agttatttat ctaatagaga aaattttaag 7860 aatgttcaaa ttcgctgatc gcgattcatg gctctcgaat ctaccagatc ccgtcatagg 7920 gccgtgcggt ctttgtcgac gcgaattgga cttctccgaa cggataggtc gcgtctgtaa 7980 gtgctcagcc agttactgtg aagactgctg gacagatacc gccacgacaa aatgtgctaa 8040 atgcggagca atccgagcac aggtcagcgc acgcttggca accataaaat tcgccacgcc 8100 ggcggaggtc ttggtgacaa aaacgcggaa gtctatctgt gacctggcaa aaggtctaaa 8160 ataagtccaa ctgtataaaa tcaattttaa gtagcgtttg agatacaacg cattggacaa 8220 gatcaaataa cacgagcaaa agaaaacgcg gacatcagcc ttgaaaagct ggtttttgac 8280 gcgaaaatgg gagtttgtgg agctgaagtt ccacatgggg tacgcaacaa acgccgcgga 8340 ctctcaaccc gaccgcaaaa ggttttatat agttgaaaca ttaaagatca aaccagttgg 8400 tttagcttaa gtttaccgac cttgatgacg gaggtgaaac gcggcatcca gtacagtctc 8460 ctcgcgagtt cgagtttccg aacaatgcac tcgactgggc tgaacgaaaa ggcgattatt 8520 aaacttgccc tacaaacaca cgcgtacttt gaaataagaa tgaaactaaa aattgaaact 8580 tatagaatgc cgaaacgtgg attgaattcc ccgcgagatt ataaattttc ggcgcctaaa 8640 aaaccttgtc gcagctcagt acgcaaaaga tcactttctg gtaatgtatt gagttcgaag 8700 gtaaaattta cagttaaatt agaaagcgca gtgggagatg cattgttgca attcgcggat 8760 tggccaaact cggcagatga tacggcgttg gaatggttga aagacttctc agacgcgtca 8820 gggggagatt atcgtcgagc gcgtacggta ctgtagccac cgaggggact tggggtgcga 8880 gctgatttcg cgatctgtcg gatatgcggc ttgtggtcag gccacatagg catctgagcg 8940 tgtgtaaaaa ggccacttac gtagtcgggc tcaaggtcga cttccaagag cggcaagttc 9000 agcccgtatt ttggcgtagt cgcgtactga tgagctagct ctcggcctag ccagaactcg 9060 gcagtaccgc ggatgaacat cttcttttga tctaaaagtt taggttaatt ttgaatgaac 9120 accgattttt ccactaaccc gcgtaataat attggtcctt cggcatggac tcgacagcgg 9180 gacctcggcc tagccggata gaacgcgatt tgtagcccgc gatcttcgcg tcgaccatga 9240 gcttgataag cattcgccac actccgttgc gcatgatgat aatcggggac gcgtagacac 9300 aagtcaggaa atatgaaccc gcgagattta tgacggtaat ctcgtcctaa aatacagatg 9360 agaacgagct gccaaatact gattcaccgg taaaaatagg tacttacaat tcctagaaat 9420 gggtgctccg tggattccat gtatttgtac gcagcgagca tctgcataaa cgcgatgaca 9480 ggccgagtca agaattcttc gtgctcgccg ttgaaaattc gaagtcgcac taaaatagtt 9540 tgttttagtc tgcgatgatg gagatcaact taccgtgacc ttttatgtcc gggaagttaa 9600 ccttgctggc atgcgcgtct ctgcggtaga accgctcacc aagacaggtt tgcagaattc 9660 gcgaaatgat gcgctgtggc gatcgctcct tgttcagctc ctcggtgtcg aaaatgagga 9720 cctgctccga cagactcggt acagattcat cttcatatag atcgccgcgt aaatctcgaa 9780 ccttttcgac ccgctaaatt tagttttaga taaataagtt gtaatttaat tacctttcgc 9840 agatattcgg tccaaattaa gaactcagca tatggttctt tctcgaatga gaagtaatgg 9900 tcgtcaccat catgagagca gtcgccaaag ttgagcgctt tgacataata attagccgcg 9960 tcgtgaagtc taactgtcgc gtatttgaat actttcagaa agaacaggca taaagcttgc 10020 tgtttcgact gatcgtcgac atcgtcatgc tcgatttgct ccatgattct gtagaaaacc 10080 cctagaaaag gcccaagtgg ctgtttatcc tttccctttt tcggaaattc aaaaaagcgc 10140 cgaagtcgca cttcgtcgtg ttgaccggcg tttaggacga tgggaatgtc aactccgccg 10200 cgttcaagcg acttttcgtt tacagacgcg ctcaggacaa tagacatgtt gcgagaaccg 10260 ttgataattt gcccgtcaac acggaacgct ttccgcgcgc cctcgaaagg ctgaacacgc 10320 cattccagtt ccttgaactc cttttcgcac aggctgtcgt tgcggttctc ctcaaatttg 10380 tcgaataacg cgtttatttt cgtcaggtcg cgctcatata tcgcatcgaa attactaaaa 10440 tgaaaaattt ttaattaaaa taaatgaagt tttcagaacg cgttctaatg gaaaagagct 10500 tcatggagaa gacaacactt gtaacgcgtt tcaaagcagt ttgaaaaaca ttcacttaat 10560 aatttgtcaa aagattatct aacagcgcga tataaaatta aattatgaac ataagaatgg 10620 cggaaactag cggtatggat tggaaagcaa catcatttct cgccctcgga ccagatgacg 10680 aggttgcggc ggaatcagca aggccagaga tgttctctag ttctgctcca aaaaacttga 10740 ctgcgagtca ggctaaagca agcgcccacg cgaaaatttt ggctgagtca aataccgcgc 10800 tagccgagct aaaatcaggc gtaaaaattt cgcagtttat gccaatcttg cgtaaagttt 10860 acgaattccg agaaaacaag gatgatccgg aagtcgcgaa atttatgacc aacctcggcg 10920 agacttgcga aaaagtagct gagacggaaa aggatgtcct ttttctcacg gaagcggtaa 10980 attaaaaatt agcaattaaa aaattttttt tttgtttgtt tgttaaaaaa taataaccaa 11040 aaataccgcg tacttactcg agtgttaaag agatggtctc ggcagcgatc agttccgtgg 11100 tcgcttctat gcctgcgggc atcgcaagat attctcggtc tcttcctgaa tgcgccaaat 11160 cggtatgaga aaagtgatct ataaaaacaa cgcgaaaata tttagcgtgg gcggatttta 11220 ctcctacgtg gtggatcaga ccgatctact tttcccgcgc aactcagggt tgaaaatgcg 11280 cctctaccgc gtggaaaatg tggccgcatt gaaagccaaa ctttggactc aatcaacgga 11340 agtaaagtgc gaagatttgg ttatggatgc aaagaccagc ctaaacgcta tttatagcag 11400 cctcgcgcct acttgggtaa ataccgttga cccgaaaaaa taatttaatc ggctatatag 11460 atgaccgatc cgccggagat tgaacgcgag gacgtagaaa ccgaccaggc taaatggata 11520 acaactgcct gcaacctcgg caatctgtca aagaaaactg gcgaaacatt catcaaccag 11580 gtggaacaga tgagacactg gtcgctggac gatttgatca gctacctcaa gacttggtca 11640 gacaagactc gtctaacaaa tgtcctgtct tatgaccatt tacgcgatgc tacattagtc 11700 agcatacaag cggacggaga gctacgcgca ttatcaacgc gatacgatat taactcccgc 11760 cgttcggtcg ttaacagggt caacctgcca tattggtgcc agggctggac gatcgtgacc 11820 aatggtcgaa gcttcaacaa cagcgaggga ggattcgcgg ctggaaacca gtacagaaaa 11880 ccccgcggag gtagaggcca ttcaaactcg ccatacagtc gaggaggcgg aggcggatac 11940 agaagttcgg gatatcgccg atattaggta tatacttcaa ccctaatttg ggggacgtaa 12000 ggc 12003 // ID Gypsy-6_PPc-I repbase; DNA; INV; 5133 BP. XX AC chrUn; XX DT 06-JUL-2010 (Rel. 15.07, Created) DT 06-JUL-2010 (Rel. 15.07, Last updated, Version -1) XX DE LTR retrotransposon from the Pristionchus pacificus genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-6_PPc_; KW Gypsy-6_PPc-LTR; Gypsy-6_PPc-I. XX OS Pristionchus pacificus OC Eukaryota; Metazoa; Nematoda; Chromadorea; Diplogasterida; OC Neodiplogasteridae; Pristionchus. XX RN [1] RP 1-5133 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Pristionchus pacificus genome."; RL Repbase Reports 10(7), 1004-1004 (2010). XX DR Genome; chrUn; Positions 71222703 71227835. XX CC 'GATAT' target site duplication. DNA transposon insertion masked CC by "n". LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 1786..4485 FT /product="Gypsy-6_PPc-I_1p" FT /translation="MCNKWSQPKDNQSRYKKVHMVYGDLNEGQQHFNAAVE FT MNGRVIEMLVDTGSDITLISESTWREIGSPEKKAHDAFPKCANGTPLELTG FT KCHLTLELNGITVQGSVYTAKADANILGRDFIVNFFRLVPNRVTVNAISTE FT DDYYVNWIKNEFYEICKEGLGTCTKSDGSLTLKEGAKPVFCKRREIPLAML FT PKVNEELDRLLEIEAIEPVDHLEWAAPILTVPKANGKWRVCVDFSTGLNEN FT LEQNKYPLPVMAEIFARLEGCTVFSQIDLSDAYLQIPVEESSRNMLGVSTH FT RGLFRFTRLPFGVSVAPGIFQKIMDSMMTGCENAQAYLDDIVVGGRTKDEH FT DMNLKKVLMRIQEYGFRIRPEKCSFGLNQISYLGFIIDKAGRRSDPMKVRA FT VREMPEPQDQSSLRSFLGMVAYYGPFISGMHKLRGPLDNLLKDGVDWEWSH FT ECAKVFKEIRPVLASDLNLIHYDPDKEIVLASDASEKGIGAMIAHRVNGRL FT IPIAHASRTLKDAEVKYSQIEKEGLGLIFGVTKFHRYLFGRKFVMQTDHKP FT LLWIFGSKTGVPAHTARRLFHWCTILLGYDFTMEYVNTESFAYADALSRLI FT ADSRGEKDKEYEIDEVERIVCNAVTCNLEKLPITTKDIRTESLNDPVLCHV FT RSYHLSRWPDERTIKKNGLYSRILPFWTKRKEIAVVNDCLMVGDRVVIPQR FT LKNEVMRMLHSGHPGIVRMKSVARQACYWYGIDGDIEKCVQSCIECAAAAK FT RPAKAPLEPWPKANTPWERIHLDYCGPVDGQYLLVMVDAYSKWPEIVSTTT FT ITAGVTIRILNESISRNGLPRVIVTDNGTQFNSDAFNRYCSKRGIQHLNSP FT AYHPQSNGRKICRYGKEKSGETEGRKTSGGGTTVVPDELQENTESTM" XX SQ Sequence 5133 BP; 1435 A; 820 C; 1189 G; 1101 T; 588 other; gtggcgttca ggagtctcag tactcacgat cgtccttggc cttgtgtttg ttgtgataga 60 ggcagagcct ctcacttcaa gtgatagttg agtgtcgggt gtggcagggc caccccttca 120 caagagcagt gatagaggta ggacctctca ctttcaatca ataagcgtgt cgggtagagg 180 cagggcctcc ctttcacaga acgagagaaa tcattgatta atagacggaa gtgctgctga 240 aatcggcaat tatcgatcgt gaacatcaca caaagatgga tgaatcatcg aatttcgagt 300 tgatgaaggc gatgatggaa cagctcaaat cacagaatga tgcaattcag gagcttttga 360 aagagagaaa tgagagagct atgggagctg agatcagagg aagaatgggt ccatctattg 420 atgctcttga gaagcagatt agaacattct catacagacc agaagaaggt ttgacgtttg 480 atcaatggat ggagagacac agagagattt tcgagaatga tttcagtgga ttggaggatg 540 gtgaaaaggg tagaatattg ctgaggaaag tcgatgataa ggtcgacaaa cagcttagaa 600 atcacatccg tcccaagagt ccagctgatc tgaaattcga cgaaatcgtg gctgtaatga 660 aagatctatt cggagataaa cgatcccagt tttaaaagag actcgatctg ttccaactaa 720 agatgtcaaa gttgagatgc gaggatctga aagagttttc aggaattgtg aacagagtgt 780 acgaggatgc taatatcagt gatatgaagc cggaggaatt caaagcaatg attatgctga 840 gtggaattga cctacctcgt tacactgcta ctctctttca tgtgatgaac cagattggag 900 acgagactcc cacaatggaa tctgtcttga aagtggctga tagctacagg aaagtatacg 960 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1020 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1080 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1140 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1200 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1260 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1320 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1380 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1440 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1500 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnngt atacggagat 1560 gcgcaggcag ccactggtca gaatggacag tgtactcctg cagtgaatgc agtctccaaa 1620 tacaatccaa gatggtgcaa tcgagagtct ccgaagagtg gagacaacag tcagacgaaa 1680 gagtcggtgt gttctcgatg cggaagagac agtcatagag aagatggctg tcgttacatc 1740 aacacagagt gccacaaatg tcatcggaaa ggacatctga gtgtgatgtg caataagtgg 1800 agtcagccga aggacaacca atcaaggtac aagaaggtcc atatggtcta cggtgatttg 1860 aatgaaggac agcaacattt caatgctgca gtagagatga atggtagagt gattgagatg 1920 ctggttgata ccggatcaga tatcactctc atctctgagt cgacgtggag agaaatcggt 1980 tctcctgaga agaaggcaca cgatgcattt ccgaagtgtg ctaatggaac accactggaa 2040 cttaccggaa aatgtcatct cacgctcgag ctcaatggta tcactgtaca aggaagtgtg 2100 tacacagcga aggctgatgc gaatatactc ggaagagact tcattgtcaa tttcttccgt 2160 ctggttccca atcgagtcac tgtgaatgcg ataagcacag aggacgatta ttatgtgaat 2220 tggataaaaa atgaattcta tgagatttgc aaagaagggc tgggaacgtg cacgaaatca 2280 gatggaagtc tgacactgaa agaaggtgcc aaaccagtat tctgcaaacg acgagagatt 2340 cctttggcaa tgttgcctaa agttaacgag gaattagata ggctactaga gattgaggct 2400 atcgagccag ttgatcatct cgagtgggcc gcgcccattc ttacagtgcc aaaggccaat 2460 ggaaagtgga gagtgtgtgt tgatttctca actggtctga atgagaatct agagcagaac 2520 aaatatccat tgcctgtgat ggccgagata ttcgctcgtt tagaaggatg cacagtattc 2580 agtcagattg atctgagcga tgcgtatctg cagattccag tagaggagtc gagtcgcaat 2640 atgctgggcg tgagtaccca tcgaggatta ttcagattta cgagactacc attcggagtc 2700 agtgttgctc ctggaatatt ccaaaaaatc atggactcaa tgatgacagg atgtgagaat 2760 gcacaggcat atctcgatga tatagtagtt ggaggaagaa ccaaggatga gcatgatatg 2820 aatctgaaga aggtgctcat gagaatacaa gaatatggat tccgtattcg ccccgaaaaa 2880 tgctcttttg gactgaatca gatcagttac ctcggtttca taatcgataa agctggtcgc 2940 cgttcagatc ctatgaaagt gagagcagtc cgagagatgc cggagcctca ggatcaatca 3000 tcattgagat cattcttggg aatggtagca tattatggtc catttataag tggaatgcac 3060 aaactgagag gacctctcga taatttgctg aaggatggag ttgattggga atggtcgcat 3120 gaatgcgcta aagtattcaa ggagattcgt ccagttctgg catcggatct caatctcatt 3180 cattatgacc cagacaaaga gatagtgctg gcttcagatg caagtgagaa aggaatcgga 3240 gcaatgattg cacacagagt caatggcaga ctcattccca ttgctcatgc atcgagaaca 3300 ctcaaggatg ctgaagtgaa atactctcag attgagaagg agggactagg actcattttc 3360 ggagtcacca agtttcatag atacctcttt ggaaggaagt ttgttatgca aacggatcac 3420 aaaccgttac tgtggatctt tggatcgaaa acaggtgttc ctgctcatac agcgagaaga 3480 ctatttcatt ggtgcactat ccttcttgga tacgatttca ctatggagta tgtgaataca 3540 gagagcttcg cttacgcaga tgctctatct cgattgattg ccgattcgag aggagagaaa 3600 gataaagagt atgaaatcga tgaggtagag aggatcgtat gcaatgccgt gacatgcaat 3660 ttggagaaac ttccaataac aacgaaagat atcagaacgg agtctctgaa tgatcccgtt 3720 ctctgtcatg tgagaagcta tcatctcagt agatggccag acgaaagaac cattaagaag 3780 aatggtctgt actctagaat tcttcctttt tggacgaaaa ggaaggaaat agcagttgtg 3840 aatgattgtc tgatggttgg agatcgagtg gtcattccac aaagacttaa gaatgaagta 3900 atgaggatgc ttcattctgg acatcctgga atagtgcgaa tgaagtcggt tgccagacag 3960 gcgtgctatt ggtacggaat cgatggagac atcgaaaagt gtgttcaatc gtgtattgaa 4020 tgtgcagctg ctgctaagag accagcgaaa gcaccgctgg aaccatggcc aaaggcgaat 4080 actccatggg agagaatcca tctcgattac tgtgggccag tggatggaca gtatctgctt 4140 gtgatggtcg atgcatactc gaaatggcca gagattgtga gcacaacgac gattaccgca 4200 ggagtcacca ttcggatcct gaatgaatcc atttcgagga atggtctccc tcgagtcatc 4260 gtgactgata atggcacaca attcaactct gatgctttca atcgatactg cagtaagaga 4320 ggcattcagc atttgaatag ccctgcatac cacccacaga gtaatggccg aaagatttgt 4380 agatacggta aagagaaatc tggagaaaca gaagggagaa agacctctgg aggaggcact 4440 acagttgttc ctgatgaatt acaggaaaac accgaatcca caatgtgatg gaaagtcacc 4500 aggagaagtg ttccttggta gaaagattcg atcagaaatc gatcttatga ttcctattat 4560 gagattcgac tcaaacgatc agtgtataaa tgatccgatg aaggatcagt tcgacaggaa 4620 gaacggtgta aagacaagga agattggaat gggagatgaa gtgatgtatc acatgcacgt 4680 acctccgaat ggtttcaaat gggcaaaagg cactgtgatt ggaaagaaag gaaaagtaat 4740 gtatgaggta caattggaga acagaaagat cactgcccat gccaatcagt tgagagtgag 4800 acaagaatca gatatggaaa ttgaatatca ggaggaaaat gaatttccga tgaatgagaa 4860 tatcctagaa aaggaattcg atatgaatga tggtgaggtt atcaatgatg agaccggatc 4920 gattgagaaa gaggatgaga gtgttcctca atctacagaa gtggtagatc tcccggtaga 4980 accgaggaga tcaacacgag atcgtaagca gactaagcat ctcgacgtcg atccgtccaa 5040 gaaatcctac agaaaggaat gatccatcga atcactatta ttgtaattaa tgtttgttca 5100 gactctcccc aagctcgagt cttggagggg agg 5133 // ID BEL-1_SI-LTR repbase; DNA; INV; 404 BP. XX AC AEAQ01000373; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: long terminal DE repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-1_SI_; KW BEL-1_SI-I; BEL-1_SI-LTR. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-404 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01000373; Positions 3025 3428. XX SQ Sequence 404 BP; 102 A; 99 C; 107 G; 96 T; 0 other; tgtagcgtaa acagttcgac acaatgttgt ctcgttaaaa ttcgtaacat tggtcttgca 60 taggcgaatc cgtttcctat gcacgtgttt atatgtcgcc aagcaacgac gcgttcggct 120 cattggcccg tcgcgaatgg gcgcgagccg tgtattactg cgcgcggaat agaaagagaa 180 ggcgagacaa gacgccagat tcgtgtaagc cgtggccgag ctcacaagga aagcgagcga 240 gtttttctgt gcgacctttg taccggtgaa gagtgattaa aatacatacg tacaaattga 300 ggtgattacg catttacctc cctcacatcg ctcccgatcg caatctttcc ggtagcacgt 360 gtggtaggcg gacgaagcac gctcacgcgt tatcgaacgc caca 404 // ID Gypsy-608_AA-LTR repbase; DNA; INV; 399 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-608_AA_; KW Ty3_gypsy_Ele137; Gypsy-608_AA-I; Gypsy-608_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-399 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 399 BP; 124 A; 109 C; 59 G; 107 T; 0 other; tgtgctgtgc ctgcatcatt tcacattata ttattgccta cgaagaacca ttcgcgtttc 60 ccacactcca cttgataaca tttcgaaacc cacttgagtt gacaaacctt tcgaacccaa 120 atcattccac acctcgccaa catgtggcgt gggaagttaa aaccaaaaca taaagtaaca 180 aactgtaata gaacacccgg caatgcatcg ccacgcattg caaggcattg cacttaactg 240 acctacatct ggttctcgct aacgatcgtc actcacccgt agctcactgg cacagcgcaa 300 ctctatccat ctctctttga tatattctgt ctctatcgat atgcataatc aatacatgta 360 acttagctcg taagatttga caaaataaag aaacattca 399 // ID Gypsy-174_AA-I repbase; DNA; INV; 6935 BP. XX AC supercont1.141; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-174_AA_; KW Gypsy-174_AA-LTR; Gypsy-174_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6935 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.141; Positions 157998 151064. XX CC 'AAAT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 571..2577 FT /product="Gypsy-174_AA-I_1p" FT /translation="MDSMYSRQYRSMNVNHLLDDELEYELALRKVEFTSGE FT SRDIKRRKLRNALKEERESNSFVSRRISEEARESELRLVEEKLAMIRDSFE FT NRKAKKTELPQFETRLVHLYFRLLRLKKVFNTGGLEQIALGLLNSNFSRNS FT RDPEVDGEYSGGLVNQPGHESAQNPESIDSEESEEEVEEESNPRTDNEKEE FT SSEKSDVLDFTVRNANSRRSSTPRAKEDKVVNADELFDRIMQQVDRAITSK FT LKTLNIGTRVEDKEVKKIEKLSSRPSVRETKPYKAVAKVNRQPEKEGEQRD FT AGRSKKKRNRVYSESSNESRASERDSDRNSESADTETEVEEQTDNGRRLKR FT RPRPVCDWKLKYDGRDDGRGLNKFVSEVEFMAEAENISKRSLFNEAIHLFS FT GDARAWYIEGRRNREFRNWNELVVELKLEYQPPDMDYHYEQQAAARRQKRG FT EKFQDYYYAVKDIFEQMASPPTDQRKFEIIFRNLRSDYKNSLLVKGVRTLR FT MLKVWGRKLDSANWFLYRKQEGEQGSKSAQVHEITTASPAREQRPPRNNWK FT ESGFKSRQSWRNPNSGGYQGPRNDPRGFRKPSPESSKSDQQTKPKQMQSNE FT DPKEGSSKELLQKRVDAYRIPERTVCYNCRGNYHHSSACLKAKEVFCVACG FT FQGFLKPDCPFCGKNGKRSM" FT CDS 4325..5698 FT /product="Gypsy-174_AA-I_2p" FT /translation="MNSKWKPSSKLSRWSMLLQQYDMVVKHRKGSENVVPD FT ALSRAVETIDLSKEDWYNRMYKKVSDTPDNFPDFKIEQDKLYKLVSSPTDA FT MDYQFEWKLCLPENLRLEVIKQEHDQSLHPGYEKTIHRLKIRYYWPKMAMQ FT VKKHVRSCSVCRQCKPSTLPTAPVMGNQRVSSKPFQILALDFIQNLPRSRN FT GKSHLMVLMDLFSKWTLLLPIRRIESKEVCRLVEDCWFRRFGTPEVVISDN FT ATTFTGKEFQELLVRRGIQHWPNSRHHSQANPVERINRTINACLRTYMEKD FT QRLWDTRIAEVEELLNTTIHSSTGLSPYRIIYGHEKITMGNEHRLERDTKD FT ISLAKRDEIRQKLNENIFKIVADNLKKSHEKVRKTYNLRHRKFCPTYHVGQ FT KVYKRNFRQSSAAERYNAKYGPLYMPCVIVARRGSSSYEVADSSGKILGVF FT SAADLRPGDDEKP" XX SQ Sequence 6935 BP; 2261 A; 1230 C; 1641 G; 1803 T; 0 other; attggcgacc aacgttaaag tcgtgaacgg taatcggagg aacgattagt atttgaaaaa 60 aatcttgaaa ttttttaaac gggcgacagg tgaatgcaca atcaattagt tgattgtgca 120 atccgacatt tgatatccgc taatcatagt tcgcaagtcg acaaatggaa gcgtgaagac 180 attggctaca ttcggttccg ttaagagtgc gaaaaagaag aagcaacaaa agcagtaaaa 240 agttgatgga attcggagac tgggacaagt tttgaaattg gtgctggtga ctaaccttga 300 aggacataca cttaaccaaa aacacttgaa catttcttaa aggtcctgaa tgaacttgaa 360 tacaaaatac aaaacaacac aaaagatagc gcatttattg aattattttt gaatttcttc 420 tcatatacta cttcttattt tctagcttaa gtttttatta tattttattt attgttgaat 480 ttgcattgat aatctagagt taattagcat atgttgcact gaattattga atttttattg 540 aactttacta aatcactgta tttgaacaaa atggattcca tgtattctcg ccaatatcgt 600 tcgatgaatg taaaccacct cctggacgac gagttagaat atgagcttgc attacgcaaa 660 gttgagttta ctagtggtga atctagggat ataaagcgaa ggaaattacg taatgcattg 720 aaagaagaac gagagtctaa tagttttgtt tcacggagaa tatcagaaga ggcacgtgaa 780 tcagaattac gcctggtaga agagaaatta gccatgatca gagattcgtt tgaaaataga 840 aaggcaaaga agacagaact tccccagttc gaaacgcgtc tggttcacct gtattttcgg 900 ttgctacgtt tgaaaaaagt gttcaacact ggtgggttgg aacaaatagc cctcggtttg 960 ttgaacagta atttctcacg gaatagtagg gacccagagg ttgacgggga atactctgga 1020 gggttagtca accaaccagg acacgaaagt gcccaaaatc cagagagcat agacagtgaa 1080 gaaagtgaag aggaagtaga agaagaatcg aacccacgaa ctgataacga gaaagaggaa 1140 agtagcgaaa agagcgatgt acttgatttt accgtgagaa atgcgaatag taggagatcc 1200 agtactccca gagctaaaga ggacaaggtc gtcaacgctg atgaactgtt cgacaggatc 1260 atgcagcaag tggatagagc cataacgtcc aagttgaaaa ccttgaacat aggaactaga 1320 gtagaggata aagaagtgaa gaaaattgaa aagttgtcaa gtaggccgtc agtaagggaa 1380 actaagccgt acaaagcagt agcaaaagta aacaggcaac cggaaaagga gggtgaacag 1440 agagacgcag gtaggtcaaa aaagaaacga aatagggttt attccgaatc atctaatgag 1500 agtagagcta gcgagcgaga tagtgaccgt aatagcgaga gtgcagacac agaaaccgag 1560 gtcgaagaac aaacggataa cggaagacgg ttgaagcgaa ggcccagacc agtctgtgac 1620 tggaaattga aatatgatgg tagagatgat ggaagaggtt tgaataaatt tgtgtctgaa 1680 gtagagttca tggctgaagc ggagaacatc agtaaacgca gtttattcaa cgaggctata 1740 cacctgtttt ccggtgacgc tagagcctgg tatatagagg gaaggaggaa tcgagagttt 1800 cgtaattgga atgaacttgt agtcgaattg aagttagaat atcaaccacc cgatatggac 1860 taccattacg aacagcaagc ggccgccaga aggcagaagc gaggagaaaa attccaggat 1920 tattattatg cggtaaagga catttttgag cagatggctt ctccgccaac ggatcaacgg 1980 aaatttgaaa ttattttcag aaatctccgt tcggactata aaaactcatt gttggtaaaa 2040 ggagtccgaa cgttgagaat gttgaaggtg tgggggcgta aattggactc agccaactgg 2100 ttcttgtatc gcaagcagga gggtgaacaa gggtcgaaat cagcacaagt gcatgaaatt 2160 accacagcgt caccagcccg tgaacaaagg cctccgagga acaattggaa ggaatcgggt 2220 ttcaaaagta gacaatcgtg gaggaatccg aactctggtg gatatcaagg tcctagaaat 2280 gatcctcgag ggttcagaaa accttcaccg gaaagcagca aatcagacca acagacgaaa 2340 cccaaacaaa tgcagtcaaa tgaagatccg aaagagggga gtagcaaaga acttctgcag 2400 aagagagtgg atgcgtatcg cattccagag aggacagtgt gctacaattg tcgtggaaat 2460 tatcaccatt caagcgcctg tttgaaagct aaggaagttt tttgcgtagc ttgcggattt 2520 caaggatttc tcaaaccgga ttgcccattt tgtggaaaaa acgggaagcg gtcgatgtaa 2580 gaggtcgtcg acgcaaacct aaaaagcctc taacgaaatt tgtttcttct tctcgagaaa 2640 tagaagagtt gattgtgcag gttgacggag ataatcgtcc ttttgcaaaa gttgatatct 2700 tgggattttc agtgatcggc ttgttagatt gcggagcaca gatgacggtt ttaggagttg 2760 gttgcgataa acttctacgt gacctaaaat tgaagctgct gccaactgat ttgaagctga 2820 cgactgctga gggatctcgt ttaaacgtta aaggatacgt caatctacct ataacattca 2880 atggcaaaac tcgactggta acagccatcg ttgcgcccac cctgaaccga cgccttattt 2940 taggaatgaa tttctggaat gttttcagaa tagaaccatc aataaatcag gaagaatgtg 3000 ctgtagagga gataagatgt gagctggatg aggaactaaa atttactgag gaagaagaac 3060 agatgttaga ggatgtaaag aaggagttta aggttttcaa agaaggagat cagttggagg 3120 taacgcccct gatatcacac aaaattgaat ttgaggactc gttcaaaaat gctggtccaa 3180 tacgtttgaa cccgtaccct tggtctccag aaatccagaa atgcgtaaat gacgagttgg 3240 ataaatggtt ggcgtctgga gtagtggaac gatcaaacag tgattgggct ctgcttatag 3300 ttccagtgac caaaaagggt gaatccggag ctgagggacg cataaaggta cgcatgtgct 3360 tagatgctag gaaattaaat gaaagaaccc gtagagatgc atatcctctt ccgcatcagg 3420 accgtatact aggaaggctg agtgcttcta aatatttgtc aacgatagat ttatcaaagg 3480 cgttttggca aattccgtta caaccagaat cccggaagta tacggcattc cgagtgtttg 3540 gaagaggact tttccagttc actagactac cgtttggatt ggtcaacagt ccggcgactt 3600 tatagcgttt aatggaccaa gtgttgggtt acggtgaact ggagccaaac gtgtttgtgt 3660 accttgacga catcgtcata gtaagcaaca cgttggagga acatctccgg aaccttaaag 3720 aggttgcaaa acgtcttaaa gcagcaaatc tttcaatcaa cattgaaaaa tctaaatttt 3780 gcgtgcaaga gcttccatac ttaggatttg tattatccaa aaatggcatt cgacctaatc 3840 ctgataagat tgaagccatt gtgaacttcg agcgacccac ttcagtaagg tccctaagac 3900 gttttctggg catggtaaat tactaccgac gatttatatc ggattttagt gaagtgacgg 3960 ctcctttaac taatttattg aagggaaaac cgaaaattgt tcactggaat gatgaagctg 4020 aaaatgcatt cattattttg aaagaaaaac tgatcacagc tccaatctta gcctgtccgg 4080 atttcggaaa gccatttaca attcagaccg atgcgagtga caccgctata gcgggagttc 4140 tcacacagga tgtggatggg aaggaacacg taattgcgta tttttcccgg aagcttacga 4200 cctcacagcg ttcatggaaa gcggcagaga aggaaggagt ggctgctcta gaggcgattg 4260 aaaaatttat gtagagggag ctcggtttac cttgattacg gattcgccag cgctttcatt 4320 cataatgaac tctaaatgga aaccgtcatc caaacttagc cgctggagca tgttacttca 4380 gcaatatgat atggtcgtca aacacaggaa aggatccgaa aacgtagttc cagatgcact 4440 ttcacgtgca gtggaaacta tagatttgag caaagaagat tggtacaacc ggatgtacaa 4500 aaaagtctca gatactcctg ataattttcc agacttcaaa attgaacaag ataaactgta 4560 taaacttgtc tcttcgccca cggatgcaat ggattaccaa tttgaatgga agctttgttt 4620 accggaaaat ttacgcctcg aggtaataaa gcaggaacat gatcaatcat tgcatccagg 4680 gtatgaaaaa actattcata ggctgaaaat acggtattac tggcctaaaa tggccatgca 4740 ggtcaagaaa cacgtgcgtt catgcagtgt atgcagacaa tgcaaaccct cgactttgcc 4800 aacagcgcct gtcatgggaa accaaagagt gagcagcaaa ccgtttcaga ttttggcact 4860 cgactttatt caaaatcttc cccgcagtcg caatgggaag agtcatttga tggtcctaat 4920 ggatttattc tcaaaatgga ctcttctgct accgattagg aggattgaga gcaaagaggt 4980 gtgtcgtcta gtagaggatt gctggttcag gagatttgga accccagagg tcgttatttc 5040 cgataacgct accactttca ccggaaagga atttcaggag ttgttggttc gcagagggat 5100 acaacactgg cctaactcgc gtcatcacag tcaggccaat cctgttgaaa ggataaatcg 5160 cactataaat gcatgcttac ggacttatat ggagaaggat cagcgattat gggacaccag 5220 aatagctgag gtcgaggaat tgctgaacac taccatacat agctccactg gtctatctcc 5280 ctatcggatt atatacggac atgaaaaaat cactatgggc aatgagcatc gattagagcg 5340 agatacaaaa gacatctctt tagcaaaaag agacgaaata cgtcagaaat tgaatgaaaa 5400 tatattcaaa attgttgctg acaatttgaa gaaaagtcac gaaaaggtcc gaaaaactta 5460 taacctacga cataggaaat tttgtccgac ttaccatgta ggacagaagg tctacaagcg 5520 taattttaga caatcgtcgg ccgcagagcg gtataatgcg aagtacgggc cgctgtatat 5580 gccatgtgtc attgtggcga gacgtggaag cagctcctac gaggtggctg atagctcggg 5640 taaaatccta ggggttttct ctgctgcaga ccttcgtccc ggggatgatg aaaagcccta 5700 aaacacaaac tagggttaat accaacagca caatattgta tcggcatcgt tgtacaaagc 5760 ctgtagttgc cgcttatgga tcctcatctg gtattcataa acataacggg gtcatcgttc 5820 gacctagggc tgctgtgcac tcttgtgcgt agagatcgtt catgctcatg gttgtgctct 5880 cgcgtatcca catttggagc gagagatggg taactcatga gtgcagggag tttaagagcc 5940 aatatgaaag tatctggttt ctgcgatcgt ccagttgttc gatcgataat tgctagcaaa 6000 aagagaaagc gagttggcat cagtatgatt gatatagggt gacgtcacat aggagttagt 6060 attagggatc attggggaag aaaggagaga tgttcttttg tttaggcgcc gtacaaaaga 6120 accgtgctag cacgttgagg aggaatgctt tcctaaatat aagggagaga gataataatt 6180 caagataaac gtttaattgc atgtaatgaa tagtaaatta taaataacct gtttgtaata 6240 agtattaaat taaaagaaaa acgatcgttg aaacagtcaa acagagcact taccctaaac 6300 aatagaagag tcagtttagc gtccaataga catgataggc tcgttctcag cataattccc 6360 tattagtcgg gtggccaacc ggctgaatca tgctgggaat aatccatatg tttccaatgt 6420 ttgtccatga tcagttagtt cgtccaatag attcgccact attttaatat atcttgtcca 6480 gtttgattct tttgtttaat tgtcccaatt gtagatttcc aattttagtg tccgtccaga 6540 tagaatgtgt aataataact attcaaccat ctccttttca ttagctaccg ttggtagtat 6600 tggataaaag agagtaatgg tagacttatc gtttaaccat aatgggtgcg tgcttatcaa 6660 cacacatact aacaatataa aatcacacgt gattgattat ggcaagttcc ttaggggaac 6720 ttggccatta gttgtcagcc gattttaaaa tgttgtgtca tgtctcaaaa aaacttttaa 6780 tcaattttat actgcttcgc gattcgaaaa aaataagaaa attattatag catttgtctt 6840 tgtggtttca atttgttaat aagcgtttta tgtaggtaaa aaaaaatata caaataattg 6900 aagatcttca attatttacc gtggtgggag gatag 6935 // ID CR1-125_AAe repbase; DNA; INV; 4611 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-125_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4611 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1213-1213 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 20 sequences with >99% CC identity. XX FH Key Location/Qualifiers FT CDS 217..1062 FT /product="CR1-125_AAe_1p" FT /translation="MAMEKKHCKECKLEVNDIEPVRCGFCDAFFHISQQCC FT GFNHRANRDILSQGKAMFICNDCRSELNGRSIKRYLQDQLDSQNSTHADGD FT ASDNLTSQVQLLADAVGKLSKKVDVLSMGQMSGSKTNLLLTPTFRKWPKLG FT VKRPRMETEQYESTALNSDRGTRNIDLGDLSIDTIMPVPTPPKFWLYLSGF FT QPLISTDDVQKIVTRCMDLSSPCDVVRLVPKGKDVSNMSFVSFKIGFDPSV FT KEQALQASTWLNGLTFREFVEQPKNYRRTVANPMDINQTPV" FT CDS 1164..4529 FT /product="CR1-125_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MEALEPPTTVEPFQPPFSSRPGPVCGSGVKGFQIDIA FT GKYDFSMISMTSDPFIDSSLSASASSRSTIQTSPLGRNDVSSMEVPKPPTA FT VEFFLPAFSSRPGPACGRGVRDFQPDLEGKFNTVMQSTILDSSIASSQFPT FT EPVRFCSDDVDHDMTTEPMLQICNDDEQGTGVSIYYQNVRGLRTKITDFYT FT SVTASLYDVVVLTETWLDEVIPSCMLFGNDYVVYRGDRTPLTSTKKRSGGT FT LIAVKRTIPSNLIPINDEGLEHVWVSLRMKRSSLAIGAAYIPPDKASDVQL FT TNRHITSMESVIPKHDAVAVFGDYNRPGLRWIRNVNHTCSVDVTSSSLTQS FT NAALLDGMSSNNMWQLNYIKNCYGNTLDLLFVSEELLNGLTIEEVDDPLVP FT IDICHKPFSCTIENCEPSQNTFEDEFDINSLNFRKTDFIGLSQHLTLVDWA FT VVIECPTINEAVERFTTILQNAFAIFVPRRRSPKKPAWSNTELRRWKRRCS FT TAKRLYRKNRTAANKLAFQLASRQYRSLNCRLYKEHTASTEQNLKHNPKCF FT WNFVKSKRKETGLPSSMYLDQKRANTSLDKCELFASFFSSMFDSNSGSTDA FT AMELTPAGCVEVATFEVSIEAMQKATKKLKSSYLPGPDGIASCILKKCSQE FT LLEPLVHLFNLSLRVCTFPTRWKASFMFPVHKKDSKNDVRCYRGITSLCAC FT SKLFEIIVSDYMYCQTKHYISTSQHGFYPGRSVSTNLSEFVSFCLRNIEQG FT RQVDAIYTDLKAAFDKVDHNILLAKLGKLGCSPSFVEWLRSYLCDRKLAVK FT IGSSISRWFVSSSGVPQGSNLGPLLFSLFMNDAIIHIGDGFCLVYADDLKL FT YVLINGKEDCHKLQTLLNKFVSWCRVNRMQLSISKCFVVSFHRKMSHVSFD FT YTIDGELLNRTQIIRDLGVCLDSTLSFRNHHEEIIDKARRQLGFMCKLSKE FT FRDPYTLKSLYVSLVRPILETASIVWDPYHSTVASRIESIQKRFIRFALRN FT LPWSDPLNLPPYEGRCLLIGIETLEQRRKRAKALFVAKLIAAEIDAPNLLQ FT MISFNTPNYAIRRPEFLRLPFRRRDYASQEPVRSMMECFNEVVHLFDFNIT FT TKKFKTRLLRYVYN" XX SQ Sequence 4611 BP; 1282 A; 1002 C; 1022 G; 1305 T; 0 other; cactgtgtat ttgttgtaaa cagtcacgta tcttagtcgc cggctatttt gatgtttttc 60 gtcgtgaaaa cgtagtttaa attttgttca atttatcaca tttgtcgttt cgtgaaaagg 120 tgaaccattt gtgtgtattg gacaaccaaa aatcgaccgt gaaattgctg aaaacacgga 180 attaatttca ctcgatctcc cgactaacca agagcaatgg cgatggaaaa gaaacattgc 240 aaagaatgca aactggaggt gaatgatatt gaacccgtac ggtgtggctt ctgtgatgcc 300 ttttttcaca ttagtcagca gtgttgtgga ttcaatcatc gagccaatcg agatatactg 360 tcgcaaggaa aggcgatgtt catttgcaat gactgccgat ctgaactgaa cggtcggagt 420 ataaaacgct acctacagga ccaattggac tctcaaaact caactcatgc tgatggtgat 480 gcttcggata atctgacatc ccaagttcag ttgctcgctg atgccgttgg taaattgagt 540 aagaaggttg acgtgctgtc gatgggtcaa atgagtggaa gcaaaacaaa cctgctgctt 600 acacctactt ttcgaaaatg gcctaaattg ggtgtgaaac gtcctcgcat ggaaaccgag 660 caatatgaat ccactgcttt gaattccgat cgtggcacta ggaacattga tcttggcgat 720 ctgtccatcg acactattat gcccgtacca actccaccta agttctggtt atacctgtct 780 ggtttccaac cactgatctc gaccgatgat gtacagaaga ttgtgacacg ctgtatggat 840 ctttcttcgc cgtgtgatgt cgtacgttta gtgccgaaag gaaaagatgt ttcaaacatg 900 tcgtttgttt ccttcaaaat tggctttgat ccgtcggtga aagagcaagc actgcaggcg 960 tcaacttggt tgaatggact gacatttcgg gagttcgtgg aacaaccaaa aaactacagg 1020 cgtacggtgg ccaatcccat ggacatcaat cagacacctg tttaacgtgt tgcatttcga 1080 actatgaaga ccgtggaatc gccattctcc cttatccgtc atcgcaaatc aagctaccac 1140 tgggacgcat cgatgttggt cttatggaag ccctggaacc ccccaccaca gtcgagccat 1200 tccagccacc gttcagcagt cgtcccggtc ctgtgtgtgg gagcggtgta aagggcttcc 1260 aaatcgatat tgcaggcaag tacgattttt caatgatctc gatgacttct gatccgttca 1320 tcgattccag cctatctgca tcagcttcat ctcgttcaac aattcaaact tcaccactgg 1380 gacgcaacga tgttagctct atggaagttc cgaagccccc caccgcagtc gagtttttcc 1440 tgccagcgtt cagcagtcgt cccggccctg cgtgtgggcg tggtgtaagg gacttccaac 1500 ccgatctcga aggcaagttc aataccgtca tgcaatcaac gattcttgat tcgtccattg 1560 cttctagcca atttccgact gaaccagtcc ggttctgtag tgacgatgtt gaccacgata 1620 tgactactga accgatgctt caaatctgca acgacgacga acaaggcact ggggtctcga 1680 tttattatca aaacgtcaga ggactacgga caaaaatcac ggacttctac acttctgtta 1740 ctgcttcgct ctacgatgtg gtcgttttga ctgaaacctg gttggatgaa gtgattccat 1800 catgcatgct gtttggtaat gattacgtcg tttatcgtgg tgatcgtaca ccgctaacaa 1860 gcaccaagaa acgttccggc ggtactctta tcgcagtgaa gcgcacaatc ccgtccaatt 1920 tgatcccgat taacgacgag ggacttgagc atgtgtgggt gtccctgaga atgaagagaa 1980 gtagtcttgc tattggagca gcatatattc ctccggataa agcgtcggat gtgcaattga 2040 cgaatcgcca tattacaagc atggaatctg ttatccctaa acatgatgca gttgctgtat 2100 ttggtgatta caatcgacca ggtcttcgat ggataagaaa cgtaaatcac acatgttcag 2160 tggacgtcac ttcatcttct ctgacgcagt ccaatgcggc attgctagac ggaatgagct 2220 caaacaatat gtggcaattg aactacataa aaaactgcta cggaaacacg ttggatttgt 2280 tattcgttag tgaagaattg ttgaacggtt tgactattga ggaagtcgac gatcctctgg 2340 ttccaattga catttgccac aagccttttt cttgtaccat cgaaaactgc gaaccctcgc 2400 aaaatacctt cgaggacgag tttgacatca attcgttgaa tttccgaaaa accgacttca 2460 ttgggttgag ccaacacttg acgctagtgg actgggcggt tgtcatagaa tgtccgacta 2520 taaatgaagc tgttgaaaga ttcacgacta tccttcagaa cgcttttgct attttcgttc 2580 cacgtcgacg ttctccgaaa aagccagctt ggtcgaacac tgagctgcgt cgttggaaaa 2640 ggcgttgcag cacagcaaaa agattgtatc gcaaaaatcg cactgcagca aacaagttgg 2700 catttcagtt ggcgagtcgc caataccgca gtttgaattg cagattgtac aaagaacaca 2760 ctgcctccac agaacagaat ttaaaacaca atccgaaatg tttttggaac ttcgtaaaat 2820 ccaaacgcaa agaaactggt ttgccgtctt caatgtatct ggatcaaaaa cgtgccaata 2880 cttcgttgga caaatgcgaa ttgttcgctt ctttttttag ttccatgttt gattcgaact 2940 ctggaagcac tgatgctgcc atggaattga ctcctgctgg atgtgtagaa gtagctacat 3000 tcgaagtctc aattgaagca atgcagaaag cgacgaagaa gttgaaatca tcatatcttc 3060 ctggacctga cggaattgcc tcgtgtatcc tgaaaaaatg cagtcaagaa ttactggagc 3120 ctctagttca tcttttcaat ctttcactga gagtctgtac cttccctaca cgctggaaag 3180 catcgtttat gtttccggtg cataaaaagg acagtaaaaa tgatgtacgg tgttatcgtg 3240 gtattacgtc gctttgcgcg tgctcaaagt tatttgagat tattgtgtcg gattacatgt 3300 attgccagac gaaacattat atctctacga gccagcatgg attttatcca ggaagaagtg 3360 tatcaacaaa tttgagtgaa ttcgtctcgt tttgtctacg gaatatcgaa caaggccggc 3420 aggtagatgc aatttacacg gatctgaaag cagcgtttga caaggttgat cacaacatat 3480 tgcttgcaaa gttgggcaag cttggatgtt cgcctagttt tgttgaatgg cttcgttcgt 3540 acctctgcga taggaagctg gctgtaaaaa taggatcttc tatttcacga tggtttgtga 3600 gcagttctgg tgttcctcaa ggaagcaact tgggcccgct tttgttctcg ctgtttatga 3660 acgacgctat tatacatatt ggagacggct tctgcctggt atatgcagat gatttgaagc 3720 tttacgttct gataaatggt aaagaagact gtcataagct ccaaactctc cttaacaagt 3780 tcgtatcgtg gtgtcgtgtc aacaggatgc agcttagtat ttccaagtgc ttcgttgtta 3840 gcttccatcg aaaaatgagt catgtgtcgt ttgactacac tatcgacgga gagttgttaa 3900 acagaactca aattattcga gaccttggcg tgtgtctgga ttccactctt tcattcagga 3960 atcaccatga agagatcatc gacaaagcac gtcgccaatt agggtttatg tgcaaattga 4020 gcaaagagtt tagagacccg tacacgctca aatcgttgta cgttagtctt gttcgaccaa 4080 tccttgaaac cgcttcaatt gtttgggatc cctaccacag cactgtagct tcccgtatag 4140 aatccattca aaaacggttc atcagatttg cactacgaaa cttgccatgg agtgatccac 4200 taaatctacc accgtacgag ggtcgctgtc ttctgatagg aattgaaaca ttggagcaaa 4260 gaagaaaacg ggctaaagct ctgtttgtag ctaaactaat agctgcggag attgatgcgc 4320 cgaacttgct tcaaatgatt agcttcaata ctccaaacta cgccattcgc cgtcccgagt 4380 ttcttcgatt gccattcaga cgacgagact acgcttccca agaaccagtg cgatcaatga 4440 tggagtgttt taacgaagtt gttcatctgt tcgattttaa tattacgacc aagaaattca 4500 agacaaggct tttgcgatat gtttataact agttctaaga ttttattcat taagactcaa 4560 aagtcggatg aaatatacaa ataataataa taataataat aataataata a 4611 // ID Gypsy-624_AA-LTR repbase; DNA; INV; 283 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-624_AA_; KW Ty3_gypsy_Ele75; Gypsy-624_AA-I; Gypsy-624_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-283 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 283 BP; 63 A; 62 C; 69 G; 89 T; 0 other; tgttggggac aactgaaatg cggatagccc ctacctaccg ccgccgctgg actcgcgggt 60 gactgtgtta tctgttaccc tttgatgtcg acacagcgca tacatgtatg aaagtgaatg 120 ttatgtatgt tgtccaaatg taaataaagt tagtttgtta ttgtttgttt taccgctaac 180 aacgcgtttt aattcgtcgt ccgaattagt agttttaatt tgttttctcg cgctcaagtc 240 cgcgatatgc gagttcggcc accacgtacg ggcgcgggat cca 283 // ID Gypsy-141_AA-I repbase; DNA; INV; 4735 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-141_AA_; KW Gypsy-141_AA-LTR; Gypsy-141_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4735 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1015-1015 (2011). XX DR [2] (Consensus) XX CC Positions [3731-4198] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 497..1744 FT /product="Gypsy-141_AA-I_1p" FT /translation="MESDMRAIPPFCCEHIERSRLAREWKSWKGTLECYFE FT AHNVQDQKIMRAKMLYLGGPQLQRVFANLPDTEKFPLVSIEKRWYDLAVEK FT LDDFFKPVRQDTLERHRLRDMKQMKDERFAQFVLRLRQQVAECGFEKYSSE FT IAKVARFYFNLLNLISFNLCCNSKVLTEITLVDVIAQGCLSNELRSRILKE FT DQTLTQIEALGAMIESVEEQVKGLSNLQSSGENVYRIEERKIGGRNFESSV FT EKFNRTGNNQNTRDRAACFSCGRHGHFSNSPHCPARNQECRNCKMRGHFET FT VCRKGRKRAAYENESKPVKKIRLIESTETPLENQAKETTEADKTYYAFYSG FT NKSNMMTCCIGGVNWDMLVDSGADCNLITAEAWNKFKEVGIKVHSSTKGCT FT RKLKAYGSENLLNVSGTFVADI" FT CDS 2600..4516 FT /product="Gypsy-141_AA-I_2p" FT /translation="MFKVKELEFLGHNVTADGIHPSKVKTDAILSFRDPKN FT EAEIRSFLGLANYMNKFIPNLATIDEPLRKLTQKGTFFEWKPQHAEAFEEI FT KKIMSRVGQLGYYDVNDRTAVTADASPVGLGAILAQYDSQGQVRIISYASK FT SLTDTESRYCQTEKEALALVWAVERFQVYLLGKPFDLVTDCKALQFLFTPR FT SKPCARIERWVLRLQAFEYNVVHISGENNAADALSRLAVLDPKPFDIHEEL FT LIREIATSAANSMALNWEEIEKESIEDEEIQNVQELIGTDQQLELPLAYRV FT IAKELCQVGNVLLRTDRLVIPGRLRNRVLSLAHEGHPGSRMMKTHLRTHVW FT WPKLDQDVDRFVKSCRGCCLVSAPEAPEPMVRKEMPSAPWEDVAVDFLGPL FT PDGLYLLVVVDYYSRYVEVKEMISINVQAVIEELSVIFGRYGLPVSLRADN FT GPQLSENCNELKTFCLENGIKLVNTIPYWPQQNGEVERQNRTILKRLRIAQ FT ELGKSWRKELSQYLLVYHSTNHPTTGKSPSELMFGRRIRSKLPQVPVYHSN FT DEEVRDMDRLQKEKGREYSDKKRKANHSKIEVGDEVLIKRMKKNNKLDTDY FT INEEFVVVRKQEADCTLKSIKTGREYRRNVAHLKKIEKNYK" XX SQ Sequence 4735 BP; 1547 A; 751 C; 1110 G; 1325 T; 2 other; atctggcgac gaggtgaaag cggaaggtat tttatttatt tttattacat taacattacc 60 ggtttatttg ataaattcaa aatgtcgttt tcgacgaaac agtgaaaaac ggaggttctg 120 ccatttgtgg tagcgattaa aatttcaagt tagaaaatgg agtttctacc tttttggtag 180 cgatcggatt atgcattcaa agtggagttc ctactattta agtagcgttc ggatttagga 240 tccatagaat ggagtttcta ccatttcggt agcgatcgat ttagagctcg aattaagaac 300 ggagattcca ccttgatggt ggcgataaaa tcttatatac tcggagtatc tgccttttcg 360 gtagcgaaaa agacgtaaac tgttaaaaaa aaactatggc gacggcagac gattcagttg 420 aatttattta aaggatttgg gttgtctgaa accagtttgg gataattgtt tatcatgtgt 480 ttttaaaata ttacagatgg aatctgatat gcgagctata ccaccgtttt gttgcgaaca 540 catcgaacgt tctcgattgg ctagagagtg gaaatcttgg aaagggacat tggagtgtta 600 tttcgaggct cataatgtac aagatcaaaa gattatgaga gccaaaatgc tgtatcttgg 660 tggtccgcaa ctgcaacgag tgtttgccaa tctaccagat actgagaaat ttcctttggt 720 atctattgag aaacgttggt atgaccttgc ggtagaaaaa ctggatgatt ttttcaaacc 780 tgtgcgacaa gatactctag agagacatcg tttacgagac atgaaacaga tgaaggatga 840 acggtttgcg cagtttgttc tccgtttacg ccaacaggtg gctgaatgcg ggtttgaaaa 900 atattcctct gaaatcgcaa aggtagcacg tttttatttt aacttattaa acttaatttc 960 atttaatctt tgttgtaatt ccaaggtgct aactgaaatt acgctagtag atgttattgc 1020 acaaggttgt ctttcaaacg aattacgaag tcgcattttg aaggaagatc aaacgctgac 1080 gcagatagaa gctttgggag ctatgataga aagtgtggaa gaacaagtga aaggtctgtc 1140 caacttacaa tcatccggcg agaatgttta tcgtattgaa gaacgtaaga tcggtggaag 1200 aaattttgaa agttctgttg aaaagtttaa tcgtaccggt aacaatcaaa atacgcgaga 1260 tcgtgctgca tgttttagtt gtggaaggca tggtcacttc tccaattctc cacattgtcc 1320 tgcgcgtaac caggaatgca gaaattgcaa aatgcggggt cattttgaga cggtgtgccg 1380 aaaaggaagg aagcgtgctg catatgagaa tgaatccaag ccagtgaaga aaatacgact 1440 aattgaatct acagagacgc cattggaaaa tcaagcaaaa gagacaactg aggcagataa 1500 aacttattat gcattttact ctggtaacaa gtcaaacatg atgacctgtt gtattggcgg 1560 agtcaactgg gatatgttgg tggattcggg agctgattgc aatttaataa cagccgaagc 1620 atggaataag tttaaagaag ttggtatcaa ggttcattct tcaacgaaag gttgcacgcg 1680 gaaactgaag gcttatggaa gtgagaattt gttgaatgtt tccggtacat ttgttgctga 1740 cattsaagta ggtcacaaaa gtgtagaagc tgaatttttc gtggtaactg gwggccaaca 1800 atgtttgttg ggagatgaaa cctctaagca gttgggtata ctcaaggtcg gattagatgt 1860 gaataaagta acggaagaag tgaagccatt ttcaaaaatt tctggagttc aaatcaaaat 1920 tcacactgat ccggaggtca aaccagtttt ccaaccactg agaagggtcc cgataccact 1980 agaatctgct gtaaagacaa aattggaaca attgcttgca agagacataa ttgaagtcaa 2040 gactggacca acgagctggg tatctccact ggtagttgtg gggaaagcaa acggcgatgt 2100 taggctgtgc ttggatctgc gtcgagttaa tgaagcggta ttgagggagc gtcatccgat 2160 gccaatagta gacgagtatt tggctcgtct gggaaaagat atgatacgca gcaaattaga 2220 catccgtgaa gcatttttgc aagtagaact tgaacctgat tcaagagata ttacaacctt 2280 cataacaagt caagggctat tcagattcaa gcgattaccg tttggtttag ttactgcccc 2340 tgaagctttt caacgaacca tggatgaaat actcactggc tgtgaaggaa cgtattggta 2400 tttggacgat gttatcattg aaggatccac tgaggaagag catgatcgtc gcgtaaataa 2460 ggtagtttaa tgtttagtac tatttggttt ttttttttgt tttaataatc gaaaataata 2520 attaaataaa tcaatttgtc gtgtttaggt tcttaaccgt ttgaagggac gtaatgtgga 2580 attgaattgg gataaatgca tgtttaaggt caaagaattg gaatttctcg gacataatgt 2640 cacggcggat ggtatacatc cttcaaaagt taagacagac gctattcttt cgtttcgcga 2700 tccaaagaat gaagctgaga tacggagctt tctgggtctc gctaattaca tgaacaagtt 2760 tattcctaat ctagcgacta tcgatgaacc tcttcggaaa ttaactcaga aaggaacctt 2820 tttcgagtgg aaaccgcaac acgcagaagc ttttgaagaa attaagaaga ttatgagtag 2880 agttggacag ctggggtatt acgacgttaa cgatcgcact gctgtaacgg cggatgctag 2940 tccagttggt ctaggagcca tcctcgctca atatgatagc caagggcaag tacgtataat 3000 cagttatgct tcaaaatcat tgactgatac ggaatcacga tactgtcaga cagaaaaaga 3060 agcacttgct cttgtctggg cagttgaaag gtttcaagtg tacctattgg gaaaaccatt 3120 tgatctagtg acagattgca aagcattgca gtttctgttt actcctaggt ctaagccatg 3180 cgctcggata gaaagatggg tactccgttt acaagcgttt gagtataatg tggttcatat 3240 ttctggcgaa aataatgcag ctgatgcttt gtccagatta gccgtgttgg accccaagcc 3300 tttcgatatt catgaagagt tgttaattcg agaaatagcg acatccgcag ctaattcgat 3360 ggcgttaaat tgggaagaaa tcgaaaagga gtctatagaa gatgaagaaa ttcaaaacgt 3420 acaagaattg atcggaacag atcagcaatt ggaacttccg ttggcataca gagttatagc 3480 caaagagttg tgtcaagttg gaaatgtact acttcgaaca gatcgtctag ttattcctgg 3540 aaggcttaga aatcgggttc tgtcattggc gcatgaaggg catccgggta gcagaatgat 3600 gaaaacgcat ttacgaacac atgtatggtg gccgaaactt gatcaagatg tagatagatt 3660 tgtgaaaagc tgtcgaggtt gttgtttggt atctgcacct gaagctccgg aaccaatggt 3720 aagaaaagag atgccttcag cgccttggga agatgttgca gttgatttcc ttggtccatt 3780 gccagatggt ttatatttat tagtagtagt tgactattat agtcgctatg ttgaagtaaa 3840 ggagatgata tctatcaatg tccaagctgt gattgaagaa ttaagcgtca tttttggtcg 3900 ttatgggctt ccagtatctc tcagagccga taatggaccc cagttgagcg agaactgcaa 3960 tgaattgaaa acattttgtt tggagaatgg aattaaattg gtcaatacca taccatattg 4020 gcctcaacaa aatggagaag tggaacggca aaatagaaca atacttaaaa gattgcggat 4080 tgctcaggaa ttaggaaaaa gttggaggaa ggaattgagt cagtatttgc ttgtttatca 4140 ttcaactaat catcccacga ctggaaaatc accatccgag ttgatgtttg gaagacgtat 4200 ccgaagtaag ttaccacaag ttccagtata ccactcaaat gatgaagaag ttagagatat 4260 ggatcgatta caaaaggaaa aaggaagaga atacagtgat aagaaacgga aagcaaatca 4320 cagtaagatt gaagttggcg atgaagtact gataaaaaga atgaagaaaa ataataaatt 4380 ggatacagac tatatcaacg aggaatttgt tgtggtgcgg aaacaggaag cagactgcac 4440 attgaaatcc atcaaaaccg gaagagaata taggcgtaat gttgctcatc ttaagaaaat 4500 agaaaaaaac tacaagtaat gagggatctg tgtcacagta tggcaatgaa gaaggcgaaa 4560 gtaatgtaaa taataacgaa gcactaacag aacagaatag aacaaacaga acacgcaagg 4620 agccacatca tttcaaagac tatgtaccat attaacataa taattatcta aaccgtaatt 4680 tttaaatata atgaattctg aacgtaacat aatttctata aaaacaagga aggat 4735 // ID Tx1-2_AAe repbase; DNA; INV; 4945 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A Tx1 non-LTR retrotransposon from Aedes aegypti. XX KW Tx1; Non-LTR Retrotransposon; Transposable Element; Tx1-2_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4945 RA Kojima K.K. and Jurka J.; RT "Tx1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1458-1458 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 10 sequences with >98% CC identity. It is positioned at the deepest branch of the Tx1 CC clade, and does not show sequence specificity. XX FH Key Location/Qualifiers FT CDS 124..1185 FT /product="Tx1-2_AAe_1p" FT /translation="MDGLVKNTLMFRFPDGAPGPSVVEIARFVKSFDADKF FT TMESSYKISEERCICIKFMNERAMKDALMQNPENHVFQYSSGETVEVKMSV FT AGGCSKYIRIFDLPPEVPDQEIGNVLGKYGVVRRMVREKFPAQLELDLTTG FT VRGVYIEIKKEIPATLFFLNRRGRIYYEGVKHKCFLCKQEGHLKADCPRNT FT SNNKRNTEVVENAEVGRPDSAGSSSVSSPPQPPSFAAMLKSKPPVVGTEGS FT KMTLLVPAAKMVAEPVCNSEEGESSHVVKAPATQNADKTIGATVDTDDSDG FT LPNMEVDCSATKRQHEGSSTEDEGNTRASRSRKQKKDSDPLKIIESEPIQS FT KGKDRRARSKN" FT CDS 1548..4802 FT /product="Tx1-2_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MNVIRKVASININAIRVDVKKSLLKEFILTNDLDIVL FT LQEVAFDDFSFLPSHLAVVNRDGKCSGTALLVRKSIEVHDCMKSPCGRIIS FT MIIDGVNFVNIYAHAGSQFKAERDDLFEVQTITHMSKVAATDTIVGGDFNC FT ILENDDTRGNFKNVCRGLKTLTNRFNLMDVELKIKGHKKQFTFFRNDAASR FT IDRFYSTNNLFKKISKFETNAIPFSDHHAIQFCYQIEPNLRSPVVGRGYWK FT IKNFLLQDADVTEEFDNVYNSLKSRNKYNNNFAEWWSFDVKNKIRQFYKSK FT FFEFSNRNRTQKDLCYAKLKQLEIKQMNGQNVSDELAKTKSDIINIETNRL FT KSISSNFKPNTMLENEKMGIYQVSKIVKMQSNPNVLQLKNDKGVLCDQLET FT KQIVYDHYKDLFAGRHDNVAADCMLNSLTKVLSANSKQDLIEPITENELLD FT TIKRCTLKKAPGPDGITYEFYLVHYTTLKTDLLKLFNGIFNGSVKPLESFS FT DGIVTLIPKIGNKNDINNYRPISLLNTDYKLLCKIIANRLKKCLEELIDEG FT QTAGIKNKSCTDNLDIIRTLVIKAQQSKKFKFALLSLDLEKAFDKVCHERL FT WEIMLKFDIPQQVVNCIKQLYKNAFSKVLVNGFLTQSFKIEKSVRQGCPLS FT MLLFTLYIEPLIRHLNKSLSGILILNRFVRVLAFADDLTLIIRSDQEIDAV FT MGILENFSKFAGIKLNLRKSGFLRFNNCAIGPQKITEKDELKILGIFIGTN FT YRKVVETNYRKVIQAINATIQAHMSRRLNLLEKVVVLNTYILSKLWYVAQI FT LPASNIHIAQIRKITGSFLWGYHRIFKIERSQLYLHYFKGGLNLIDVDTQC FT KSLFVRSLLFKGATAISHYLIRTHNLKGLSLNTQRLIQKAKEIKDMTHLVT FT NKQIYEYMLTKLNIEIKVEKKYPQIHWEVVWENINQNFLSLSARSTLYEIF FT NDIIPNKIKMFNNLANIDSFSCDVCGKPDNNVHRFMNCIKTADIWNWVSDI FT IKNRLKIKITSPHLLLYRGIGKHNCRLKAALWLVCEAIVYNVKNHKNPSLY FT MFKKDIRDYRWNNRILFCKHFKNYLNIC" XX SQ Sequence 4945 BP; 1688 A; 823 C; 999 G; 1434 T; 1 other; cagttgacga tcagatctcg ttcagtatag acgctttcta ttagatcgga ggctttgcac 60 aaaagagctk gttaacagtg gttgtatatt gtgttggaaa caaagtgaaa caatagctag 120 acaatggacg gtctcgtgaa gaatacgctg atgttcagat ttccggatgg tgctcccggc 180 cctagcgtcg tggaaattgc gcgtttcgtc aaatcgttcg atgcggataa gtttacaatg 240 gagagcagct acaagatttc cgaagaacgg tgcatttgca tcaaattcat gaacgaacga 300 gcaatgaaag acgctctcat gcaaaatcca gagaatcacg tgttccagta ctcaagtggt 360 gaaacggtcg aagtcaaaat gtcggtcgcc ggaggatgta gcaaatacat ccggattttt 420 gatctgccgc cagaagttcc cgaccaggag attggtaatg tgctcggaaa gtatggagtt 480 gtacgtcgga tggtgcgaga aaagtttcca gctcagctgg aactggacct aaccacggga 540 gtccgcggtg tgtacatcga gatcaagaaa gaaatcccgg caactctatt ctttctcaac 600 cggcgaggaa gaatctacta cgaaggtgtg aagcataagt gtttcttgtg caaacaggag 660 ggccatctga aggcagactg ccctcggaat acatcgaaca acaaacggaa cacggaagtg 720 gtcgaaaacg cagaagttgg ccggcccgat tctgctgggt cctcttctgt gtcttcaccg 780 ccacaaccac ctagtttcgc tgcgatgctg aagagtaagc caccagtcgt tggaactgag 840 ggaagtaaga tgacgctgct agttccagct gccaaaatgg tggctgaacc cgtgtgcaac 900 agtgaggaag gcgaatcgag ccacgtggtt aaggctcctg caacgcagaa tgctgacaaa 960 acgattggcg caacggtcga cacggacgat agcgacggcc tgccgaacat ggaggttgat 1020 tgctcggcaa cgaaaagaca acatgaggga tcttcaacag aagatgaagg taacacgcga 1080 gcttccagaa gccggaagca gaaaaaggat agtgatccgc taaaaatcat tgaatccgag 1140 cccattcaat cgaagggaaa agatcggagg gcaaggtcga agaattgaga ttagggatgc 1200 cgtcttttgc gaacttttaa gtgatgatga ctttggctat ggttttggag ttttgattat 1260 ggatggtttt tatacttctg ctgggatttc cttcgtattt tatggctgcc ttgaaaatct 1320 tttgctacgt ttatatggac atggtatcgg aaaatgatat gtacttctat tgcttttatg 1380 gattggaact ttggctacgt ctcaggttat gagcccggat aacgatttac actttaatag 1440 ggatcacctt ccgttttatt attatgacaa cttcagcaac gtttactgtt atgatttttg 1500 gataacgatt aattactttg aaaatttggc gttatattta ttctgaaatg aacgttattc 1560 ggaaagtagc ttcaattaac ataaatgcta taagagttga tgttaaaaag tcgcttttga 1620 aagagtttat tttgaccaat gatttggata tagttcttct acaagaagtc gcatttgacg 1680 atttttcttt tttaccatca caccttgccg ttgtaaaccg ggatggaaaa tgctctggaa 1740 cggccttgct agtacgaaaa tccattgaag tacatgattg tatgaaaagc ccatgtgggc 1800 gaattatttc gatgattatt gatggtgtta attttgtaaa tatttacgcg catgctggtt 1860 ctcagttcaa agcagagcgg gatgatttgt tcgaagttca gacaattact cacatgagta 1920 aagttgcagc taccgatacg atagtaggcg gtgatttcaa ttgtattttg gaaaatgatg 1980 atacccgtgg gaacttcaag aacgtttgcc gaggtttaaa aacccttaca aatcggttca 2040 atcttatgga tgttgagctg aagatcaaag gccataaaaa gcaatttacc ttttttagga 2100 atgatgccgc ctcgcgcatc gatcgatttt attcaacgaa caatttgttt aaaaaaatta 2160 gtaaatttga aactaatgcg attcccttct ctgaccacca tgcgatccaa ttctgttacc 2220 aaattgaacc caatctaagg tcacctgtcg tagggagagg gtactggaag attaaaaatt 2280 tcttgttgca agatgctgac gtaaccgaag agtttgataa tgtgtataat agtttaaaat 2340 ccagaaataa gtacaataat aattttgcag aatggtggtc tttcgacgtt aaaaacaaga 2400 taaggcaatt ttacaaatca aaattttttg aattcagtaa tcgaaaccga actcaaaaag 2460 atctttgtta cgctaagctg aaacagttag aaatcaaaca aatgaatggt caaaatgtga 2520 gtgatgaatt agcaaaaacc aaatcagata ttataaatat agaaactaat cgcctaaaat 2580 ccataagctc caattttaaa ccaaacacga tgttggagaa cgaaaaaatg ggtatttacc 2640 aagtatctaa aattgtgaaa atgcaatcta accctaatgt gctacagcta aaaaatgata 2700 aaggtgttct ttgtgatcaa ttagaaacta agcaaatagt ttatgatcat tataaggatt 2760 tgttcgctgg aaggcatgat aatgttgcag ctgactgtat gttgaattca ttaaccaaag 2820 ttctaagtgc aaattcgaaa caagatctta ttgaacccat aacagaaaat gaattgctag 2880 atactattaa acgctgtact ttgaaaaaag cacctgggcc agatgggatc acctacgagt 2940 tttatctagt tcactacact acattaaaaa ccgatctact aaagcttttc aacggaatat 3000 ttaacggttc agtgaaacca ttagagagtt tttccgatgg gattgtgaca ctaatcccga 3060 aaataggaaa caagaatgac ataaacaatt atagaccaat tagtctattg aacaccgact 3120 acaagctttt atgtaaaatt atcgcaaacc gtcttaagaa atgtcttgaa gaacttattg 3180 atgagggcca aacagctggt atcaaaaaca aaagttgtac agataatctc gacatcattc 3240 gtactcttgt aattaaagct caacaatcaa aaaaattcaa atttgctttg ctcagtctag 3300 atctggagaa agcttttgac aaggtttgcc atgaacgttt gtgggaaatc atgcttaagt 3360 ttgatatacc gcaacaagtc gtaaactgca ttaagcagct atacaaaaat gccttttcta 3420 aagttttagt aaacgggttt ttgacacaat cctttaaaat agagaaatca gtacgacaag 3480 gttgtccttt gtcaatgtta cttttcacgt tatacattga gccattaata agacatttaa 3540 ataaaagttt atcaggaatt ctgattttga acagattcgt acgtgttctt gctttcgcag 3600 atgatttgac tttgatcatt agatctgacc aagaaatcga tgcagtgatg ggaattttag 3660 aaaatttctc taaattcgct ggtataaagc tgaatttaag gaaatcaggt tttttgcgtt 3720 ttaacaattg tgcgatagga ccacagaaga tcactgaaaa agatgaattg aaaattttag 3780 gtatttttat tggtaccaat tatagaaaag tagtcgaaac caactacaga aaagttattc 3840 aagcaataaa tgctacaatc caagctcata tgtctagaag acttaatctt ttggaaaagg 3900 tcgttgtttt aaacacctat attctctcga aattgtggta tgttgctcaa atcttaccag 3960 ccagcaatat acacattgca caaatacgta aaataacggg aagtttttta tggggttatc 4020 atagaatctt taaaatagaa cgctcacaat tatacttgca ttattttaag ggagggttga 4080 atttaataga tgtagacact cagtgtaaat ctcttttcgt gagaagttta cttttcaaag 4140 gtgccactgc tataagtcac tatttgataa gaacacacaa tttgaaagga ctaagtttaa 4200 atacacaacg tcttattcag aaagcaaaag aaatcaaaga tatgacgcat ctagtaacta 4260 acaagcaaat ttatgaatac atgttaacta aactaaatat tgaaattaag gttgaaaaaa 4320 agtatcccca aatacattgg gaagtagtat gggagaacat caatcaaaac tttttaagtt 4380 tatcagctag gtcaacattg tatgagatat ttaatgacat cattccgaac aaaattaaaa 4440 tgtttaacaa tcttgctaac atagatagct tttcatgcga tgtgtgtgga aaaccagata 4500 ataacgtcca tcggtttatg aattgtataa aaacagctga tatatggaac tgggtgtctg 4560 acataatcaa aaaccgatta aaaattaaaa taacttcgcc acatttgcta ctttatcgtg 4620 gaataggaaa acataattgt agattaaaag cagcattatg gctagtatgt gaagcaatag 4680 tgtacaatgt caaaaatcat aagaatccaa gtttatacat gttcaagaaa gatatacgag 4740 attatagatg gaataatcgc atactgtttt gtaaacattt caaaaactat cttaacatct 4800 gttgaacacc aacagaattt agttaatgat tagaattaag gtacatttta ttaaaaaaaa 4860 acactatgta tgttagacta tacattgtta tgtttgtgaa tgttgtaaaa agttaaataa 4920 acagtaaaaa tgaaaaaaaa aaaaa 4945 // ID Gypsy-29_DPu-I repbase; DNA; INV; 10357 BP. XX AC scaffold_68; XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from Daphnia: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-29_DP_; KW Gypsy-29_DPu-LTR; Gypsy-29_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-10357 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Direct Submission to RU (16-DEC-2010). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR Genome; scaffold_68; Positions 629373 639729. XX CC Positions [5347-5847] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 1934..3070 FT /product="Gypsy-29_DPu-I_4p" FT /translation="MANTALDDLITALNRRDDIQAIPLFSGTATDQYISSW FT LSEADSIATIHKWSAEIKKANLASRLRGPALKWHTQRLIQAPNEEYTAWKQ FT ALKDHFKHPADRDKQIIKLENLQQTPNVPIRNFIDKINALYNAIYNDGRNP FT QNADPELDSLKNDKLVQILLKGAIKPIRELIFQRLPSSNVTWKQAQEAAIM FT AESIMFKQLALEPQTSSTATQQNPNVDLLTVTIMQQQQKKLEELEKRLNSI FT NFLGDKSNTTESSVNYMGDDSRSRTSQRQYNNSNSSSVQWKDQQQKQYHNT FT DNQQHNRQRSNSGNYRSSSGQRDKPQDDQRNRDRSKSPFRRPPTPKEDGNK FT PPRDKSQVRCYRCEGIGHYASECRTRDPRKFRKQEK" FT CDS 3028..4077 FT /product="Gypsy-29_DPu-I_1p" FT /translation="MSHQGPQEVPKTREINFKHKSYSLLRVPVAFSHLKTF FT ALIDTGAAASFISDEYLAMIPDAAVLNEFACTTKRIFQSASGESMPVTGIF FT QLLLQLSPECNVHHTFYVLPKLEEGCILGIDFLHLHNITLDVTKKEMRLGT FT NENVKIIKLNQLKKKTFPLYRVVDQPRIDFDIKHIKDLPVRNKFLGVLVKF FT MSIFASKLTELGKTNVLKHTIPTNGTAVSMRPFKTPFALRPFLEKQLNEME FT RHGIIERSTSPYRAQLLMVKKKSGELRVCNDFRNLNSVTIKDRYPLPLIED FT ILHLLHGAKLFTTLDLFSGYWQIEIADEDKFKTAFSCEFGHFQYARMPFGL FT CNAPSSF" FT CDS 4729..6618 FT /product="Gypsy-29_DPu-I_2p" FT /translation="MVHDGKRSVRNLPHGENIFPYLFGTFTIVTDHASLKY FT LMGKREPTGRLARWSLYLQQFDMEIHYRPGKLHQNADALSRSPVYAIVTPK FT FFVDDWIIAQQEDKFCKMLMEKQTGKEHPDHFGEEDSFKILPSGLWATSRE FT KIVVPMEFQKEIMVRYHDHKLAAHMGIEKTLANIRNKYFWPKMARDVRIHV FT TNCLICAKRKAAKACKAPLQPFPVAEYLWQRVAMDIVGPVTESYRGNKFIL FT VLMEYVTRYVIAFPLKETTAQTIVKKFIKHVITKEGIPAQILTDQGSNFQS FT ATMAELCKQLGTKQLRTTSYHPQTDGAVERFNQTLGNMLTTHTHNNPREWD FT EHLGYIVAADNTTPHSSTGDTPFFLLKGRDAIEPTDLRPPLRNRYLEDQNN FT VYAQQWQEAIELAKANLIVAQARQKHYYDRNLQECSFEPNDVVLLKILKAQ FT KGKFKMRWNGPYVVIEKLSNLNYLIRHQNDTYPVVVHVNRMRKWRGETKFN FT NENSDTENENSETDNEDPATTTPTTPERENTAKSSSNDPENNENSATAGHQ FT PTIANANDAVEPINGANTEVQQTTETTVENLIQTAPAEANEITTPVVLPEE FT PQGMIKRRRGRPRKNPTAPNSTIIPTPHTHNL" XX SQ Sequence 10357 BP; 3933 A; 2408 C; 1648 G; 2368 T; 0 other; attggcgacc ttgccatacg gcccatcatc tcttccggac aaccacacaa ctctacattc 60 aacaccacct acgtccagct gctcccacag cgggattggc taccaatagt agtaagtcaa 120 tgtcaaatcc atcacatttt gacccagttg accctcacgg ggcatgcgct cccagagaga 180 acgaaatcag agccattaca taccctgacc acgaacccca tacaattagt agcccaaccg 240 catatcccaa tttgaaagaa atatcattcc aaccaataca ttccatcgaa tccccacctc 300 aatactctgc cataaccttc ccaatcccta ccacctaccc ggatctatca atcgttcgag 360 caaatctccc tatagcagta cgcccacgta taataagacc tccaactccc ttaagaaatc 420 aacccagccc gctaccaatt aaacgctccc gtagtgctga tagcccacaa tcagactcgg 480 atagaacaac aatatcacaa tattattcgc aagacatcta tccaaccgaa aaaagaaaca 540 gactaatttt agaatttaac acataccaaa cagctttagc cgaatccctc tcaacgttag 600 cagaagctaa aataaaagct gccgaataga aactacttcg aaaaaatata gaaaaaaatt 660 aaatactgaa atcaaatcaa aagaaaaaga acaaagaaaa ctaactaaaa aattacatac 720 tgcagaacta gaaataaacc gcctagagta tcgttctcag acgtaccagg tagcagcaga 780 atctgcattt cgaacgtctg aaatccagaa aactaatctc gattcacagc tacaggtcac 840 aatatcagaa aacaggaatt taatctcagc cttagaactc aaaaccaggg ccgcagctca 900 attagaaacc gatctcttaa cagaaaataa atcacttcag gcagatataa aactacttaa 960 aaattcaaac caatctaaat ctaatcaaga acaaaccctt caagcggaac ttcaacaact 1020 ccaactacaa ttatcaaaga gaaactcaga attacaagac acctccagca cagtaattat 1080 ccttcaagcg caaataactg acctccaaaa agaaactcag caactagaaa aggctcacaa 1140 agaaatcctt caacaaaatc aaagcataca atcagaacta gccgttagca aaaacgaaat 1200 ccaattaaaa tcttcagaca actctacatt agaaaaagct cataagaagc tactccaaca 1260 gctccaaact actatatctg acttaggaaa taccaaactc caactaattg atacaaataa 1320 agaattaaaa gatatacagc attcacacaa agaactctta caactgttta acacaactaa 1380 atcagaactc gatactacaa atagctttag tgaaacaatt aaattaaaaa ataatactct 1440 cagcgaacaa gtatccttac ttacccaaca gtctgaacga cttcaaaacc aactagcagg 1500 agaaaaccaa acagccgcca caaattatac tttcagccta gaaaacgaca tcattacatt 1560 aaaacaatca aacacacaac tccaaatttc caattccaac taccaaaaac aagctacttt 1620 tcaagaaaac caaattgaag aactagaaac tcgagtaaac aaattactcc aggtagagca 1680 agcactccta atcgaaaaaa acgaaactag aacactttca cttcaactaa tttcattaaa 1740 cacaacatta accgatatac aacaagaaca cataaaatta caaaacatca ttcaagggta 1800 tcaagaccaa ttagacgttg atttaccttc aatctcctcc actgaagcta ataacgtatc 1860 aaatcaacaa tcaccatcac ctacccccaa cgtcctattg aacgagcaac ccctaccaga 1920 cgacatcgac gacatggcaa acaccgcttt agatgacctc atcacggcat tgaatcgacg 1980 agacgacata caagcaatac cacttttcag cgggacggca acagaccaat atatttcttc 2040 ttggttatca gaggctgact ccatagctac catacacaaa tggtcagcag aaataaaaaa 2100 ggcaaactta gcatcacgac tacgcggacc agccttaaaa tggcatacgc aacgactaat 2160 acaggctcca aatgaagaat acacggcttg gaagcaagct ttaaaggatc acttcaagca 2220 tcccgccgac agagataagc agataattaa attagaaaac ttgcaacaaa ccccaaacgt 2280 accaataaga aatttcatag ataaaattaa cgccctatac aacgccattt acaacgacgg 2340 tcgcaaccct caaaacgctg acccagaatt agattcatta aaaaacgaca aattagttca 2400 aatacttctt aaaggcgcaa ttaaacccat ccgtgaactc attttccaac gtctaccttc 2460 ttccaatgtc acgtggaaac aagcacaaga agcagctatt atggccgaat ctataatgtt 2520 taagcaattg gcattagaac cacaaacatc ctccactgcg acgcagcaaa accccaacgt 2580 ggatctgtta acagtcacaa taatgcaaca gcaacaaaag aaattagagg aactagaaaa 2640 aagacttaac agcataaatt tcttaggtga caaatcaaac acaactgaat catccgtaaa 2700 ctacatggga gacgactcca gaagcagaac atcacaacgc cagtacaaca actcaaattc 2760 cagttcagtc caatggaaag accaacaaca aaaacaatac cacaatacag acaaccaaca 2820 acacaaccgt caacgatcta acagcggcaa ctaccgaagt tcaagcggac aacgggataa 2880 accacaagac gatcagagaa atcgtgaccg atccaaaagt cctttcagac gcccccctac 2940 accaaaggaa gacgggaaca aaccccccag agataaatca caggttagat gttaccgttg 3000 tgaaggaata ggacattacg cttcagaatg tcgcaccagg gaccccagga agttccgaaa 3060 acaagagaaa taaattttaa gcacaaatca tatagccttt tacgtgtacc cgtagctttt 3120 agtcatttaa aaacttttgc cctaatagac actggcgcgg ctgctagttt tatttcagat 3180 gaatatcttg ccatgatccc cgacgcagca gtattaaatg aatttgcatg taccacaaaa 3240 agaatttttc aaagtgcatc aggagagagt atgcccgtta caggaatttt tcaattatta 3300 ttacaactgt cccccgaatg caatgtgcac catacttttt acgtattacc aaaattagaa 3360 gaaggatgta ttttgggaat agattttttg catttacata acataacgtt ggacgtaact 3420 aaaaaagaaa tgcgattagg cacaaacgaa aatgttaaaa taattaaact taatcaatta 3480 aaaaagaaaa cattccctct atatcgtgta gtagaccagc cgcgcataga ttttgatata 3540 aaacacatta aagacctccc tgttagaaat aaatttttag gagtattagt aaaattcatg 3600 tcaatttttg caagtaaatt aacagagcta ggtaaaacaa atgtattaaa acacacaata 3660 ccaacaaacg gtaccgcagt atcgatgcgc ccatttaaaa cccccttcgc tctacgccct 3720 ttccttgaaa aacagttaaa tgaaatggaa agacacggca taatcgaaag aagtaccagc 3780 ccctatcgtg cacaattgtt aatggtaaag aagaaatctg gagaacttcg agtttgcaac 3840 gacttcagaa atctcaacag cgtaacaatc aaagatcgct acccgttacc actcatcgag 3900 gacatccttc atctactgca cggagcgaaa ctgtttacca cactcgatct tttcagtgga 3960 tactggcaaa tagaaatcgc agacgaagac aaattcaaaa ctgccttctc atgcgaattc 4020 ggtcatttcc agtacgcgcg catgcctttc ggactatgca acgcacccag ttcattttaa 4080 cgtgcaatgg aaatcatctt acgtcctatc attaacaaat ttgtcatggt atacatcgac 4140 gacattattg tgttcagcaa aaatattcaa gaccacatat accatctcga acaagttttt 4200 actttgttac tagatgctgg cttgaaaatt aaaatacaaa aatgtaaatt tgccaaaaac 4260 gaagtggaat atttgggaca catagtttca gaagaaggtg tgaaagtcga tccggctaaa 4320 gtaaaagccg ttcgcaactt ccccctcccc aagaaaatgg aacaacaaaa accgatcgtt 4380 tttaggaatt gcaggttatt acaggaaatt catcgaacaa tttgcagaca tcgtccatgc 4440 actcacccag ctcacacgca ggaaagtgca atggcaatgg ggaactgcgg cacagacagc 4500 gttcgacaag ataaaggagc tcctctgctc agcgcccgtg ctcgcgtatc caaactttgc 4560 gcaacccttc atcattcata cggatgcgtg cggctacggc gtcggcggaa ttttatcaca 4620 aatgccgagt gcccccagtg aaccagagtc tgtacaagag tccatcatgg actcaaaaga 4680 gcatccaatt gcatatacat caaaacatct aaacgatcta caaataaaat ggtgcacgac 4740 ggaaaaagaa gcgtacgcaa tttaccacac ggtgaaaaca tttttccata cctttttgga 4800 actttcacta ttgtgactga tcacgcttct ctaaaatatt taatgggtaa aagagaacct 4860 acgggaagac ttgctcgttg gtctctctat cttcaacagt tcgatatgga aatacactat 4920 agacctggaa agctacacca gaatgcagat gcattaagta ggagtcctgt ctacgctatc 4980 gttaccccta aattcttcgt cgacgactgg atcattgcac aacaagaaga caaattttgc 5040 aaaatgctca tggaaaaaca aaccggaaaa gagcacccgg accacttcgg ggaagaagat 5100 tcctttaaaa tattaccaag tggcttgtgg gccacatcac gagagaagat agtcgtgccg 5160 atggaattcc agaaagaaat tatggtaaga taccacgacc ataaactagc agctcacatg 5220 ggtattgaga agacgctagc aaatatccga aataaatact tttggcctaa aatggctagg 5280 gatgtcagaa ttcatgtaac caactgctta atctgtgcaa aaagaaaagc tgctaaagca 5340 tgtaaagccc ccctccagcc tttcccagtt gccgagtacc tctggcaacg agtagcaatg 5400 gacatcgtgg gacctgtgac agaaagctac agaggaaaca aattcattct ggttctgatg 5460 gagtacgtca ccagatatgt aattgctttc ccattaaaag aaactactgc acagactata 5520 gtaaaaaagt tcataaaaca tgtaattaca aaagaaggca ttccagctca aattttaacg 5580 gaccaaggtt caaacttcca atccgcaaca atggctgagc tatgtaaaca gttaggaacc 5640 aaacaactca ggactacatc ataccaccct caaactgacg gagcagtcga aaggttcaac 5700 caaactctag gaaatatgct aaccacacac acgcataaca atcctcgaga atgggacgaa 5760 cacctcggtt atatcgtcgc agctgacaac acgacgcctc attccagtac aggagacaca 5820 ccctttttcc tactcaaagg aagagatgcc attgaaccaa cggatttaag gcccccttta 5880 aggaacaggt acttggaaga tcaaaataac gtctacgctc aacaatggca agaagccatc 5940 gagttggcaa aagcaaattt aatcgttgca caagcgcgac aaaagcatta ctacgaccgt 6000 aacctccaag aatgttcttt cgagccaaac gacgtagtgc ttttaaagat attaaaggct 6060 caaaaaggta aattcaaaat gagatggaac ggaccatatg tagtcattga aaaactgtct 6120 aacctaaact atttaatacg ccaccaaaac gatacgtatc cggtagttgt acacgtaaac 6180 agaatgcgaa agtggagagg tgaaacaaaa tttaacaacg aaaattcaga caccgagaac 6240 gagaactcag aaaccgacaa cgaagaccca gccactacaa cacctacaac gccggaaaga 6300 gaaaatactg caaaatcatc atcaaacgat cctgagaaca atgaaaatag tgcaactgct 6360 ggccaccaac ctactatagc caacgcaaat gacgcagttg aacctataaa tggtgccaac 6420 acagaagtac aacaaaccac cgaaaccact gtcgaaaact taattcaaac agcaccagca 6480 gaagcaaatg agataacaac cccagtagtg ttacctgagg agccccaggg tatgataaaa 6540 cgaagaaggg gtaggccacg aaagaatcct acagcaccta actctaccat aatccctact 6600 cctcatactc ataatttgtg aaaaacaata cgtttaccac aataagaatt acattctctt 6660 tacagatgta ttcttccgtt ctactaattc tactcttcgc atctctttca tctacactaa 6720 caatcgacac gtgtaattgt tctcagccaa tcgaaaaagg tatcatcgat ctagaagatc 6780 cactatactg ttcccaccga acctgtaagt aaaccaaaga aagtacaata taaattatgg 6840 acgaaaaaca aggaccccat aacatggaca ggctacgctt gcacacaatg gctgtctcaa 6900 aaagaaatat ccacaaattt tttattgtca catgacacca catttaagaa acaagtgctt 6960 ttaagtcagc gccaatgatt gctgggcgtc tgcacagtac ccggaaatgt gtgacacaaa 7020 tccgatgaca aaagatggca atacgcttaa atccttacta gaaccagaag gcgaaggata 7080 ttggatgacc actcaatcat gcaaagcaaa gagttgcatc acacaaatta tcaaactgac 7140 taaacaatgt cacgactgtc cagtaacatc accattcgga attttaggca attctagcga 7200 cgcagtattc gccaaacata acgatttaac aatcgtatgg aaaaagccag actcaagtca 7260 ggagccagac tgcgacatta aactattttt acaggaagcg gtaacctcac tgacggaatt 7320 aaacaatcca aattagaaga tgccaactcg caaatagaaa taatcttaaa caacaacatc 7380 acgcggctat gcaacaatgt aacggcattt tcagtaatgg gaataccaga cacccacatt 7440 gaaatcactt ctgaaaagag aagacgaaaa cgaagtatcg aaaataagcc atccggaatc 7500 ctccgactag ctcacagagc taaccgttgc ctagcgtaca cgcgaaatga atatattaca 7560 aagatgtccg ctgaaatttg tgcaatttca ggaaatctta cggatcgtgc cctgtatgca 7620 aacactgtgg atcacggaca aagttttcaa tttctggagt caggctacat tctggctaca 7680 gaagaagact actgtctcca cgcatgggaa cccgaccaaa tcgccgtaaa gcagtgcatc 7740 gattttggac gacaaaccaa aactatgcac ggtccgcaaa catggatgtt aacagcaaac 7800 cccgaggaaa acccacttac cccattaatg ataataaacg tagataaaaa ttaatgcctt 7860 acaacagcca acaataacca cacagtttac ttatcaccct gcaatgcaag cgaatatcaa 7920 tattatatat ttgaacacat cctcccagag aacagctatt tcgaattaaa tttggcaaaa 7980 gaaacgacca tttcagcaaa cggattctta gaattcaaat ccaatacgct aaataaccca 8040 tttgggcacg caccacaatt ttacggcaac atcttttctc gtttattacc tgaatactgt 8100 ctatccagct tggaagataa cagtataaca attaaattat gtaaaaccag tatttttaaa 8160 actgtttacc cacagcaaga attttcttat tacaataaaa cgctaacctt gctaggtaca 8220 acttcttgcg ttacacctct aacaacggta gtttgctcag attcaaacat agcagaagct 8280 aaatggaact acgatttggg tctattcaca ttaacagaca gctcgacttt aaaatgccta 8340 acagtagtca acgactcaac aatcgaaatg gcaccgtgca aatcattaaa taaaaaccaa 8400 cagtgggact ttcaacacaa cgtcgtcact taccctttcc ctttagaata cattccaaca 8460 atgaaagagc tcaaacaaaa acgagaccga tcagacagac taaacagaag aaaaccaaaa 8520 attattgaaa caagaacaca acaaaacaca ggaccaatcc cacaaaacca aactcaacca 8580 ataagtagcg acgcgccaaa tgtgcctaaa ctatcaaacg aagaccgtga aattctctca 8640 atagaaaatc aacaattcgt ggaagcacaa gcgataaaac acgaaacaca actggctaat 8700 gaagtacgtc aactttactg cagaatcact acactacaaa gaaaccaggc aatgatttta 8760 gctcagacaa acggtctgtt agcaggaaga tctttaaacc tcggaaagtg tagtagagtc 8820 ttaggaagcg gaaggacact tattctgcaa cagtgcatga ttatcccagt taaaatcgaa 8880 gcacttctca cccagtgcgg atatcaaccg tttttccaga caccaaacgc aaatttcaca 8940 gttggcaaag acggttggtc cctccatcct tttcaagatt gtttttggaa caacaaatat 9000 gtaactttta tatatattaa ctcacttatt ttatatatat gtaactcttt atcagggtta 9060 gtactagata gcaagtcacg ttccagtttc tgggacatga ctagctggat tgaaataata 9120 aagtacggat ttctcgcatt aataatattt ctactatcaa cactagtatt atggatatgt 9180 tttactgttg tcccatttta taaaattttt gctctatgta agaggaaacg gaaacatgtt 9240 cagactgaac aagaaattcc gttaactcaa agaaaatcta gaaaatcaca ctctcatcgt 9300 acaacaaaaa tggaccctga aaaaggtctc tgttgggacg atggatgtgt aatccaaaaa 9360 atggaatcgt aaaaataatt aatgacaacc gcattcttct gaccacattc tttaaaaatt 9420 tttgtcttaa atagttttaa agtagaaaat tatttccaaa tttaaaaatt tacactaaaa 9480 aaaaacaaca acaaaacaaa caaacaaaca aaaatccata aacaattcaa gactatttat 9540 tgactacaca aattcaagtt ggctcacacg aaattaaata ccccattcag tattctaaat 9600 gaaataaaat caattattaa ccagccattc ctccaactgc cattttacac gacgcacctc 9660 attcaaatcg tcaaatggat attgaagctc tccatataca attccgcatt cccgtcaaag 9720 ggatagaaaa tatctttccc tgattccatc caccacaata gtcttgaaca cccgaatgga 9780 taaaaattac gataaaaggc ataacctttg tatcgactaa aaaaaaaaaa aaaaaaaaaa 9840 aaaaaaaaaa aaacaaacaa acaatatttt aattaaacat gaaactaaca atcaaaaata 9900 ttatgtctaa atgtgaaatt cttacttgta ataaaaatcg ttaacgtgca ccgcactgaa 9960 atgatgccga caagccggat acagcatccc aaggtcgaca gcgtcgaaac cttccaggaa 10020 cagcacacga tgcctatcct ctaactttcc ccaaaagtta tgaccatcga aagccgaaag 10080 agccacaaaa gaacaactgg acggcgtaac tgacatgttg caggatttaa agacgaacaa 10140 caactgatct tcacctgcaa gcagtttctt ctttaaaaag ggaaggagga gaagatgcac 10200 cctaacaacc atatatctat gtgtagtata attatagtaa aaatgtagat gcatattctt 10260 ctggtaaaga aaaactatgg gcaaatgaaa aaaaatatat atattctgtt gttaccccta 10320 tcatatagtt ttcattcttc tttaaaaagg gaaggaa 10357 // ID I-8_AAe repbase; DNA; INV; 5599 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A I non-LTR retrotransposon family from Aedes aegypti. XX KW I; Non-LTR Retrotransposon; Transposable Element; I-8_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5599 RA Kojima K.K. and Jurka J.; RT "I non-LTR retrotransposons from the yellow fever mosquito."; RL Repbase Reports 11(4), 1362-1362 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 25 sequences with 87-97% CC identity. XX FH Key Location/Qualifiers FT CDS 255..1760 FT /product="I-8_AAe_1p" FT /translation="MDQDSDHCEEPPAKIKPPDPNGXVQHTQPILLSQIPS FT PDSALSVPPLSAHAAPVVPPPPMPPSSSHVAPPSSSSSPSLQSISIPKPPQ FT SKRHRCYPESSKGPYEVFVRQKDKPVKVLLVSAEVHKHFRSVKEVKQIGFT FT KVRIIFSDRLEANNVVYNELLSRLYRVYIPSERVEIDGVINQAEMDLNYLT FT NEGYGKFKDPRVPQVSILECQQLAEARVEDNVKTYSPSNEIRVTFEGTILP FT DYVEIDKALIRVRVYTPKVMLCAKCKRFGHTEIYCSNQACCGSCGKRHAEG FT ECTIQQPACLYCKGTHLNKNECPTYKKQMMLAKARVVQKSKLSYAAIVKAP FT VSPNFEHENEYSSLSELSDLEEEAHHSSYVPPKRKKLRFDNVAKRTKQHND FT KVPPPVGQSAKQARPPLEIPGFKKDAFPPLPDRNGGLDSNRRANESSNSNP FT CTDKFNVPQLIHSFAEAFDLGKFWIDLLLKLTPIFKTVLSKIITCLPLLGS FT FLSIDG" FT CDS 1764..5438 FT /product="I-8_AAe_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="SESKKRFSILQWNCRSILPKIGNLRSFLNRIEADCFA FT LSETFLVESLFFSIPSFNIVRLDRETRSGGVLIGIKSNYSFERVTIRNSLP FT IEIVACSVQIDTQNVTIASIYIPPSAKIDRNKLRVVIGQIPQPAFILGDFN FT SHGTEWGESVDDSRANIIYEILDEYNLAVLNTGEITRIACPPHKCSRVDLS FT LCSSAIALSSSWSVMNDPADSDHLPIHITFGTFEHQQTTVQFDLCRHINWN FT DYSSLILQAIDDNNELQPDELYSFLTSIINEAAMKAQTKPIPKRPRKSKTP FT TIWWDKDCTEMINQRSEAYKMFRRSGATEHFLRYKKVEAKTKRLLKQKKRS FT FWRTYVESLDSQTALSSLWSTARKMRNYVPPPSNLXKFDSEWIQDFAQIIC FT PAFAXVKRPNFLNESSVXDGLDLIHDYSLSELEHALTACNNSSPGLDNVKF FT PLIQKLPEKAKLLLLKSFNWIVQNNAIPDSWSIVKIVAIKKPGRDPGSADS FT YRPIGLLSCYRKTLEKMIQFRLESWAEKNNILSPTQYGFRQNRSTRDCQIL FT LATDIQLAYTSKEELAAIFLDIKGAYDSVLIDILCKKLNELGVPGQLASFL FT YSLLSYKEMHFEQNNKISLTRCSFAGLTQGSVLSPLLYNMYVSDIDSCIHK FT NCYLVQYADDSCFWTFVKNIELAKEYLQTTLNNLIKWANDIGFTFSPNKSE FT TVVFSRKRFPAKVEVQLGSEKIPTSSSTKYLGVWYDQKSTWKMHVEYLVTK FT CQKRINFLRTISSTWWGAHPNDMIKLYKTTILSVIEYGCFVFHQTAKVHFL FT KLERIQYKSLRICLGLMTSTHTMTVEILAGVPPVDVRFLELNSRYLVAKHI FT SSSPILNNIDKLFEMNPXSTYLKAYRLCLTIPSGPQEHLHYFDIDRPSAFP FT DIPIDKSLISAVRDAPSAERRMILAICNEKLSGFDEQNIFYTDGSLLDGHV FT GFGVYNVNHQVYFRLVEPCSVYMAELTAIWYVLDYVGSLPPGEFLICSDSL FT SSLEAIKNIRLSSKTSXMVLKIRNGISELQGRNFKISFLWVPAHSGIXGNE FT YADALAKEGTVRGEEFHRALEPHDFNTVIKHTMIQIWQNQWSHSDLGRFCY FT GILPNVKLKPWYTGINCDRTFIKNMSRLISNHFTLNAHLFRIKITQTNLCE FT CNEDYEHFDHILWNCNKHTDSRITMQTALSRIGRPLNTPIRDVLGTMDIPA FT MRIINNFILNAKIKI" XX SQ Sequence 5599 BP; 1754 A; 1201 C; 1064 G; 1569 T; 11 other; cagttgcttc tcacctctcg acgagtacgg ttgtgtttta tatccagctg tggttcagta 60 cagaaaagta tactagtagc gtgtgatctt tgggatctca tttttcgtgc tatcttcctt 120 ccaccttcgt ttttcggttc gcagataagt acccttttga ttgattattc tcttttgagt 180 gactacagca actgctgcaa caaaaaaaaa gttttccgtt tgacgctctg gaagctaagc 240 ttccatattg aaccatggac caagacagcg atcactgcga ggaacctcca gctaagatca 300 agcctcctga tccgaatggt cakgtacagc atacacagcc gattttgcta tcacagattc 360 cttctcctga ctctgctctt tccgtaccgc cattgtccgc tcacgctgct cctgttgttc 420 ctcctccgcc tatgcctccg tcttccagtc acgttgctcc accgtcgtcg tcgtcgtcgc 480 catcactcca gtcgatctca ataccaaagc ccccgcaatc aaaacggcac agatgttacc 540 cagagtcctc caagggtccg tacgaagtgt tcgttcgtca gaaagataaa ccagtcaaag 600 ttctgcttgt atcggccgag gttcataaac attttcggtc tgtcaaggaa gtcaaacaga 660 tcgggttcac taaggttcga atcatctttt ctgatcgtct agaagccaat aatgttgttt 720 acaacgagct tctcagtagg ctgtacagag tttacatccc tagtgagaga gttgaaatag 780 atggtgtaat aaatcaagcc gagatggact tgaactacct aactaatgaa ggttacggaa 840 agttcaagga cccacgtgtt ccacaggtgt cgattcttga atgtcaacaa ctagccgaag 900 cgcgggttga agacaacgtc aaaacctatt ctcccagtaa cgaaattcga gtcacattcg 960 aaggtaccat tcttccagat tatgtggaga ttgataaagc tctcattcgt gtgcgcgtct 1020 acaccccgaa ggtcatgctc tgtgccaaat gtaaaagatt tggtcatacc gaaatctatt 1080 gttcgaatca agcttgttgt ggtagctgcg gtaagcgcca cgctgagggt gaatgtacga 1140 tacagcaacc agcttgcttg tattgcaaag gtactcacct taacaagaat gagtgcccta 1200 cctataagaa acaaatgatg ctagcaaagg ctagagttgt tcaaaagtcc aaattgagct 1260 acgcagcgat agtgaaagct ccagtgtctc ccaattttga gcatgaaaac gaatacagct 1320 cgttgagcga actgtccgat cttgaagaag aagcccatca tagcagttat gttcctccca 1380 aaaggaaaaa acttcgattc gataatgttg ctaagcgtac aaagcagcac aacgataaag 1440 ttcccccccc agtcggtcaa tctgccaagc aagctaggcc tcctctagaa attcccggat 1500 tcaagaaaga tgcatttcct ccactcccag atcgtaatgg tggtttggac agcaaccgac 1560 gcgcaaatga gtcctctaat tcgaacccat gtacagataa gttcaatgtt cctcagttga 1620 ttcactcgtt cgcagaagcg ttcgatctag gcaaattctg gatcgacctg ctgttgaaac 1680 tgactcccat tttcaagacc gttctttcaa agataataac gtgcttgcct ctcctgggtt 1740 cattcctgtc tatcgatggc taatccgaat cgaagaaaag attctccatc ttgcaatgga 1800 actgtcgaag cattttaccg aaaataggta atcttcgctc ttttctcaat cggatagaag 1860 cagattgttt tgcactctct gaaacatttc tagtagaaag tttgttcttc agcattcctt 1920 cgtttaatat tgttcgtcta gatcgtgaaa ctcggtcagg aggagtattg atcggtatta 1980 agagcaacta ttcattcgaa cgagtgacca tccggaatag cttgcccata gaaattgtcg 2040 cmtgttccgt ccagatagat acccaaaatg tcacaattgc ctccatttat attccaccct 2100 cggctaaaat agatagaaat aagctgcgtg tagtgattgg tcaaattccg caaccggcat 2160 tcatattagg tgatttcaac tcccatggca ctgaatgggg agagtcggta gacgattcca 2220 gagcgaacat tatctatgaa attttagatg aatacaatct tgccgttctg aatacgggag 2280 aaataacccg aatagcttgt ccaccccata aatgtagcag agtcgacctt tctctttgct 2340 cttcagcaat tgcgttaagt tcatcctggt ctgtcatgaa tgatccagca gacagtgacc 2400 accttcccat tcacatcaca ttcggtactt ttgaacatca acaaacaaca gtacagtttg 2460 atctttgtcg ccacataaat tggaatgatt attcgtcatt gattttgcaa gctattgatg 2520 acaacaatga gttgcaaccg gatgaattat attcttttct cacttcgatc ataaacgaag 2580 ctgcaatgaa agctcaaact aagcccatac cgaaacgccc acgtaaatct aaaactccta 2640 caatttggtg ggataaggat tgtactgaaa tgatcaatca aagatcagaa gcttataaaa 2700 tgttcagaag atctggtgcg acagaacatt ttctgcgtta taagaaagtt gaagcgaaaa 2760 ctaaacgcct tcttaaacag aagaaacgaa gtttttggag aacatatgta gaatcacttg 2820 atagtcaaac tgctctcagt tctctatgga gtacagcacg caaaatgcgt aattatgtcc 2880 ctccaccgag caatcttgan aaattcgatt ctgaatggat tcaagatttc gcccaaatca 2940 tatgtcctgc ttttgccscc gtgaagagac caaactttct gaatgaaagc tcggttatkg 3000 atggcttaga tctaattcat gattacagtt tatctgaact ggaacatgca ttaacagctt 3060 gtaacaactc gtcacctgga ttagacaatg ttaaatttcc tctgattcaa aaattgcctg 3120 aaaaagctaa actccttcta ttaaaatcat tcaattggat cgttcaaaac aacgccattc 3180 cagattcatg gtcgatagtg aaaatcgttg caatcaaaaa acctgggaga gatccaggca 3240 gcgcggattc atatcgtccc attggattat tatcatgtta ccgaaaaaca ttagaaaaaa 3300 tgattcaatt tcgattagaa agctgggcag aaaagaataa tattctttcc ccaactcaat 3360 atggatttag acaaaataga agtacacgtg attgtcaaat actattagca accgacatac 3420 aattggccta cacctccaaa gaagaactgg cggctatatt tttagacata aaaggtgctt 3480 atgattcagt tttaattgac atactttgta aaaaactaaa cgaactcgga gttccaggtc 3540 agctagcaag ttttttgtac tctttgcttt cgtataaaga aatgcacttt gaacagaaca 3600 ataaaatatc gcttacacgt tgtagttttg caggactcac acaaggttct gtgttaagcc 3660 ctctacttta taatatgtat gtaagtgata ttgatagctg tattcacaaa aactgctatt 3720 tagtgcaata cgctgatgac agttgttttt ggacgtttgt taaaaacata gagttggcga 3780 aagagtattt gcaaactaca ttgaataacc tcataaaatg ggccaatgac atcggattta 3840 cgttttcacc aaataaaagc gaaactgtag tcttttcaag aaaacgtttc cccgcaaaag 3900 ttgaagtaca acttggatca gaaaagattc ctacatcatc aagcacaaaa taccttggag 3960 tatggtacga tcagaaatct acatggaaaa tgcacgttga atatcttgtc acgaaatgtc 4020 aaaagcggat aaactttctt cgaacaatat cwagtacatg gtggggtgca catccgaatg 4080 atatgattaa gctttacaag acaaccattc tgtcagtaat agagtacggt tgcttcgtgt 4140 tccaccaaac agcgaaagtc cactttttga aactagaaag gattcaatat aaaagtctac 4200 ggatatgctt aggactaatg acttccaccc acacaatgac agtcgaaata ctagcaggag 4260 tgcctccggt cgatgtaaga tttctagaat tgaactccag ataccttgta gctaagcaca 4320 tatcttcgtc acctattctc aataacatcg ataaactgtt tgaaatgaac ccgsagtcca 4380 cttatttgaa agcatataga ctctgtttaa cgatcccttc tggacctcaa gaacatttac 4440 actactttga catagaccgc ccatctgcct tccctgatat accaatcgat aagtctttga 4500 tatctgcagt tagagatgcg ccaagcgctg aacgtagaat gatactagct atctgcaacg 4560 aaaaattgtc tggttttgac gaacagaaca tattctacac cgatggatcg ttgttggatg 4620 gacatgtggg ctttggggtt tacaatgtta atcaccaagt gtatttcaga ttggttgaac 4680 catgttcagt atatatggct gaacttacag caatatggta cgtwttggat tatgttgggt 4740 ctttaccacc aggcgaattc ttaatatgct cagatagttt gagctctttg gaagccatta 4800 aaaatattag attatcctcc aaaactagtm acatggtact caagatcaga aatggtattt 4860 ctgaattgca aggaagaaat ttcaaaattt cattcctttg ggttcctgct cattccggga 4920 tamaaggaaa cgagtatgcg gatgccctgg caaaagaagg cacagtacga ggagaggagt 4980 ttcatcgagc attggaacca catgatttca atactgttat aaagcacact atgatacaaa 5040 tttggcaaaa ccaatggagt cacagcgatc tgggaagatt ctgttatggt attcttccta 5100 atgttaaact aaaaccttgg tacacaggaa tcaattgcga ccgcacgttc atcaaaaata 5160 tgtcaagact catttctaat cacttcactc tgaacgctca tcttttcaga attaagataa 5220 ctcaaaccaa cctatgcgaa tgcaacgaag actacgaaca cttcgaccac atcctttgga 5280 actgcaacaa acatacggat tcaagaatca cgatgcagac cgctctgagt cgaatwggga 5340 gacctcttaa cactcccatt agagatgttc ttggaacgat ggacatccca gctatgcgaa 5400 tcatcaataa cttcatatta aacgccaaaa tcaaaatata atgtctagta gtaataagaa 5460 aaaaaaatgc ttaaccatat ataaaattca actattttat aaaaaaaaaa ctcgatgtat 5520 cggctacgaa atggttacaa ctaactaaaa ttgcctaata aatcgtgggt tttgtgccca 5580 caagaaataa aaaaaaaaa 5599 // ID Gypsy-17_AA-LTR repbase; DNA; INV; 214 BP. XX AC supercont1.336; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-17_AA_; KW Gypsy-17_AA-I; Gypsy-17_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-214 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.336; Positions 890636 890423. XX SQ Sequence 214 BP; 80 A; 38 C; 33 G; 63 T; 0 other; tgtaatatac gaagtatcca caattgctgc ataggaatat cttgcataca agaaggaagt 60 ttttatgaat atggcaataa agcactgaac tgaaattgta tactaacact tactaattaa 120 tgatgagctc atctacaata tgaacaataa actgacttga attacggccg gctaactgat 180 cagacgtttt atattgatcc tcccaaatat caca 214 // ID Harbinger-N5_BF repbase; DNA; INV; 324 BP. XX AC . XX DT 04-AUG-2008 (Rel. 13.08, Created) DT 04-AUG-2008 (Rel. 13.08, Last updated, Version 1) XX DE Amphioxus Harbinger-N5_BF non-autonomous DNA transposon - DE consensus. XX KW Harbinger; DNA transposon; Transposable Element; Nonautonomous; KW Harbinger-N5_BF; non-autonomous. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-324 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-324 RA Kapitonov V. and Jurka J.; RT "Harbinger-N5_BF - a family of non-autonomous DNA transposons RT from the amphioxus genome."; RL Repbase Reports 8(8), 818-818 (2008). XX DR [2] (Consensus) XX CC This family is and old putative family of non-autonomous CC Harbinger transposons: copies are only 81% identical to the CC consensus. The Harbinger-N5_BF elements have generated numerous CC minisatellites made of their termini. It can be that this CC transposon belongs to the Kolobok superfamily. XX SQ Sequence 324 BP; 75 A; 101 C; 73 G; 75 T; 0 other; agcccctgtc acacattcag gaccttccca cgactttgct cccgaccact ccccaaccaa 60 ggtcggcgct gggttgggag tagcctggaa tccatcctat tctagctcca gttcgctcct 120 cattgcatcg catccaagca gaatatgact ttcccgactt tgatttttca aaattcatag 180 tcggctggcg cagacgactt ttaaagacta gtaggcaacc tcgccgacca actcccaacc 240 acagatgacc ttgctcacga ctaactccca accatggtct ggagaaggtc gggaggccat 300 ggtcggctat gtgtgacagg ggct 324 // ID CR1-112_AAe repbase; DNA; INV; 4600 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-112_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4600 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1200-1200 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >98% CC identity. XX FH Key Location/Qualifiers FT CDS 161..1435 FT /product="CR1-112_AAe_1p" FT /translation="MSCEICACDTSVDSVLWSCVGCPRKFHANCIGVTVQR FT GSLRRRDRKVVDVNSYVLPCCESCQELIQAKLDFNGLLEQQKVLEKQLNAN FT TEVMHRLSLHQEKPNVVHEAIEGMEILLTSVRNELAAINKNSSLAGSVVAI FT KNHITGLLDTAITATTENMYSTLKTVTSDISTDLRNINDEISHLSQLSIDT FT AASCAMNSNPMLGLDILDELKSLSANILTNKNTSAPPSIEYPSLEAELNNK FT IAEVSGWRLLGNRKVWKADWTQYDMRKSFHKRQQNMADKAKKRRKRRIRNA FT NKNCNKNNINNNDVNINHHCHNNVPSKIHHKVSKQQNNNHGNFAFNNSQER FT SHQLPPDRELLAAAKHHFSRPPTNYRPTMQFKRGEILNPYPAREAPRQSVA FT VPPPNWTTEGSSTGGSCEACGACRHSCFLRN" FT CDS 1399..4470 FT /product="CR1-112_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MRCMPAFVFSTELTDPSEAENNSIQPGEANQHLVTNN FT NDNSPYGLTDLNNELPIISSETQPTEILIYCQNFNRMKSAFKINEIHKNIL FT SSSYSIILGTETSWNDDVKSEEIFGSNYNVFRNDRDLLLTQRRSGGGVVVA FT ISSNFSSEIIDSPKFKEFEHVWVKSNIGNETHIFVSVYFPPDQACKSTYES FT FFLIAEEIITQLPPENKVHIYGDFNQRNADFIPDFENESILLPVVGDNETL FT QFIFDKTASLGLNQINHVKNKQNCYLDFLLTNMHEDFCVSESISPLWKNEA FT FHTAIEYSMFMHIYHTPHEYVYENIFDYSRANYTNIRLKLDNANWQSVLRN FT QNNIECAIEIFYNLLWETIREEVPVKKRRRNHNSKNPIWFNKQIINLKNRK FT QKAHKVYRRYKKPDDLEKYLVICDQLNLAISTALTEYNIKTENEIKSCPKN FT FFNYVKTKLKSCNFPSTMTLDEKVGNNSEDICNLFAKFFQENYSTFSENDR FT DYSYFSHFADFPSDVGVNSINVQDILFGLKNLDATKGSGPDEIPPGFIKNL FT ATELTTPLFWLFNMSLQTGQFPKVWKKSFLIPIYKSGKKSDIRNYRGIAIM FT SCIPKLFESIVNKNMFAQIKNRITNAQHGFFKGRSTTTNLLEFVSYSLSAM FT DKGYFVEALYTDFSKAFDKLDIPMLTFKLEKMGIEMSLLKWIKSYLNDRQQ FT IVKFNGKRSNPIHVTSGVPQGSHLGPLLFILYVNDVSYILNKLRVLIYADD FT MKLFLEIKNDDDHNVFKNEIQIFYTWCCKSLLELNIKKCNLISYSRKRTTP FT NMSIVLGNEHVKKCDKIRDLGVILDSKLSFVDHYNAIIHRAGNMLNFIKRF FT GQHFRDPYTLKILYVAYVRSILEYCSIVWSPYTKKHEERIESIQKQFLLYA FT LRNLGWSVLPLPSYESRCMLINMKSLKVRRDYAMVSFVNDIVSQRIDSADI FT LSKLNFYTPTRHLRNRYLFTVGHHRTNYAKFSPLNQMMTIYNQHCEMIDVN FT MSRTKLKKYFFDIQNNSN" XX SQ Sequence 4600 BP; 1629 A; 828 C; 805 G; 1338 T; 0 other; tttaagtcgt caaccagttc ggacgcgttt gttaattgct ccagcccacc tgtgcgcgtt 60 ttcatttgtc attaaatttt tattacatca gctttaagta gctacaggtg ccgtgcgcga 120 tattgtttat aatcgaatca gttttttttt gtcacgcgta atgagttgcg aaatttgtgc 180 ttgcgacact tctgtcgact cggtcttgtg gtcatgtgta ggctgcccgc ggaagttcca 240 tgcaaactgc attggtgtca ccgtgcagag gggttcactg cggagaagag acaggaaggt 300 agtggacgtt aactcttacg tactaccatg ctgtgaatcc tgtcaagagc taatccaggc 360 gaaattggat tttaacggac tattggagca acaaaaagtt ctcgaaaaac agctgaatgc 420 caatacggaa gtgatgcaca gactctcgct tcatcaagaa aaaccaaatg ttgtccacga 480 agctattgaa ggtatggaga ttttgcttac tagcgtgaga aatgaactgg cagcaataaa 540 taaaaacagt agcttggctg ggagtgtcgt tgccatcaag aatcatatta ccggccttct 600 tgataccgca ataacggcaa ctactgaaaa tatgtactcc acgctaaaaa cggtgacatc 660 cgatatatct actgacctgc gcaacataaa cgacgaaatt agccatttaa gtcaattatc 720 aattgacaca gcggcaagtt gcgcgatgaa ttcaaatcca atgctggggc tggacatcct 780 cgatgaatta aagtctttgt ccgcaaatat attgacaaac aaaaatacgt cggctccacc 840 atcaatagaa tatccaagtt tggaagccga attgaataat aaaattgctg aagtgtcagg 900 atggcgccta cttggaaaca gaaaagtttg gaaagccgat tggacgcaat atgacatgcg 960 caaaagcttt cacaaaaggc agcaaaatat ggccgataaa gcgaaaaagc gtagaaagcg 1020 gagaatcagg aacgctaaca aaaattgtaa taagaacaac attaacaaca acgacgtgaa 1080 cattaatcac cattgccata ataatgtccc atccaaaatt catcacaaag taagcaaaca 1140 acagaataac aatcacggaa atttcgcctt caacaatagc caggaaagga gtcatcaact 1200 tcctcctgac agagagcttc ttgcggcagc gaaacatcat ttctctagac cgccaacaaa 1260 ttatcgacca accatgcagt ttaaaagagg agaaatttta aacccatatc cggcgaggga 1320 ggcgccccgg cagtccgttg cagtaccacc cccgaactgg acgacggaag gatcttcaac 1380 tggtggatct tgtgaagcat gcggtgcatg ccggcattcg tgttttctac ggaattgacg 1440 gacccctcag aggcagaaaa taattcaatt caacccggtg aagctaatca gcacttggta 1500 acaaataata atgacaattc accatatggt ttaacagact taaataacga actaccaata 1560 atttcttctg aaacacagcc aactgaaatt ttaatctatt gtcagaactt taatcgcatg 1620 aaaagtgctt ttaaaattaa tgaaattcat aaaaatatat taagttcatc gtactcaatc 1680 atattgggta cagaaacaag ttggaatgat gatgtgaaaa gtgaagaaat ttttggtagc 1740 aattacaacg tatttcgcaa tgatcgtgac ttacttttga cacaaagaag atcgggcgga 1800 ggagtcgtcg ttgcaatatc atcaaatttc agttctgaaa tcattgattc acctaaattt 1860 aaagaatttg agcatgtgtg ggttaaatca aatattggta atgaaactca tatttttgta 1920 tcagtttact ttcccccaga tcaggcttgt aagtcaacat acgagagttt tttcctgata 1980 gcagaagaaa tcataactca acttcctccc gaaaacaagg tgcatattta tggcgatttc 2040 aatcaacgta atgcagattt cattcctgat tttgaaaacg agagtattct actccctgtt 2100 gttggcgata atgaaacttt gcaatttatt tttgacaaaa ctgcatcctt aggccttaat 2160 caaataaatc acgtaaaaaa taaacaaaac tgttacctag attttttatt gacaaatatg 2220 catgaagatt tctgtgtaag tgagtcaatt tcaccattat ggaaaaatga agcgtttcac 2280 acggcaatag aatattctat gtttatgcat atttatcata ctcctcacga atatgtgtat 2340 gagaatattt tcgattatag tagagccaat tatactaata ttagattaaa attagataac 2400 gcaaattggc aatctgtttt gagaaatcaa aataatattg aatgtgcaat agaaatcttt 2460 tacaatttat tgtgggaaac cattagggag gaagtacctg ttaagaagag acgaaggaat 2520 cacaactcaa aaaatccaat ttggttcaac aagcaaatca taaatttgaa aaatcgaaaa 2580 caaaaagctc acaaagttta cagaagatat aaaaaaccag acgatttaga aaaatatttg 2640 gttatttgcg accaactcaa tttagccatt tctacagcac ttaccgagta caacataaaa 2700 actgaaaatg aaataaagtc atgtccaaag aattttttca attacgttaa aactaaactc 2760 aaatcgtgca actttccatc aacaatgact ttggatgaaa aagtaggaaa taattctgaa 2820 gatatttgca atcttttcgc aaaatttttt caagaaaatt attcaacatt ttcggaaaac 2880 gatcgagatt attcttactt ttcacatttt gctgactttc cgagtgatgt tggcgttaat 2940 tctataaatg ttcaagacat tttgtttggt cttaaaaatt tggacgccac taaaggatca 3000 gggccagatg aaattccacc cggattcata aaaaacttag caactgaact cacaactcca 3060 ttattttggt tattcaatat gtctcttcaa actggccaat ttccaaaggt atggaaaaaa 3120 tcattcctca taccaatata taaatcaggt aagaaatcgg acattcgaaa ttatcgcggt 3180 attgctatta tgtcatgtat tccaaaactt ttcgagtcaa ttgtaaacaa aaatatgttt 3240 gcccaaataa aaaatcgtat aacaaacgct caacacggct ttttcaaagg tcgttcgacc 3300 actacgaacc ttctggaatt tgtaagttac tcactgagtg caatggataa aggttacttc 3360 gtagaagctc tttacactga ctttagtaaa gcatttgata aacttgacat tccaatgttg 3420 actttcaagc ttgaaaaaat gggaatcgaa atgagtctcc ttaagtggat caagtcctat 3480 ttaaacgacc gtcagcaaat agtaaaattc aatgggaaaa gatcaaatcc aatacatgtt 3540 acatctggag tacctcaagg ctcacactta ggccctcttc tttttatttt atatgttaat 3600 gacgtttctt atattctaaa caaactgagg gtccttatat atgctgacga catgaagcta 3660 tttttagaaa taaagaatga tgacgatcat aatgttttta agaatgagat acaaattttc 3720 tacacgtggt gctgcaaaag tttattggaa ttgaatataa aaaaatgtaa cctcataagt 3780 tatagcagaa aacgaaccac accaaatatg tctattgttt taggaaacga acatgtaaaa 3840 aaatgtgata aaataagaga cttaggagtt atcttagact caaaactatc atttgtagat 3900 cactataatg caataattca tagagcagga aatatgctca atttcataaa acgatttggc 3960 caacactttc gtgatcctta cacattaaaa atactttatg ttgcatatgt aagatcaata 4020 ttagaatatt gtagtattgt ttggtcacct tacacaaaaa aacatgaaga acgtatagaa 4080 tcgatacaaa agcagttttt attatacgca ctacgtaatt taggctggtc agtacttcct 4140 ctaccatcat atgaatcacg atgcatgctt atcaatatga aatcactgaa agtgcgtcgt 4200 gattatgcta tggtttcttt tgttaacgat attgtttcac agcgcattga ttctgctgat 4260 atactttcaa aactgaattt ttatactcca actcgtcact tgcgcaatcg ttacttgttc 4320 acagtaggtc atcatcgcac gaactacgct aaatttagtc cgttgaatca gatgatgact 4380 atatataatc agcattgtga aatgattgat gttaacatgt ctcgaacaaa actgaaaaaa 4440 tacttttttg atatacaaaa caatagtaac tgaggaatgt attatgaatc gagtatactc 4500 aacaaattta taatatataa acacatagat attaagaaca caatgtaatt tagaatggtc 4560 tacaaaatgc ttgacgtcaa ataaataaat aaataaataa 4600 // ID Copia-23_CQ-LTR repbase; DNA; INV; 184 BP. XX AC AAWU01016733; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-23_CQ_; KW Copia-23_CQ-I; Copia-23_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-184 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 362-362 (2011). XX DR GenBank; AAWU01016733; Positions 9017 9200. XX SQ Sequence 184 BP; 58 A; 57 C; 22 G; 47 T; 0 other; tgttagcaga tgtccaaccc tgcagtaatc aacatcacac gcacacaacc acacaactaa 60 tgagacagcc ctagctgtca aacatgactt taaataaaat caatccaagt tattcgctca 120 cacagtacgg acgtgtttac tcttttctaa gcctcattcc ctctctccac tcgcaattcc 180 taca 184 // ID hAT-5_AP repbase; DNA; INV; 3133 BP. XX AC Contig22772; XX DT 19-AUG-2009 (Rel. 14.08, Created) DT 19-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE DNA transposon. XX KW hAT; DNA transposon; Transposable Element; hAT-5_AP. XX NM hAT-5_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-3133 RA Jurka J.; RT "DNA transposons from pea aphid."; RL Repbase Reports 9(8), 1789-1789 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC Aphids have very diverse TEs often present in young single copies CC or very small families. Therefore original sequences are often CC more representative than consensus sequences from this genome. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 3133 BP; 1157 A; 434 C; 494 G; 1048 T; 0 other; gggtggtcct tattttttga attcaaaaaa ttgttgacga gtaccccaaa ttttgttcta 60 tgtattgcaa aaataattat gataaagttt gggcatgata agtctacccc ttcccgtgcc 120 tcaaaggggt tgaagttttt ggtgatgata tatcggtaaa tctatttttg ctctccgata 180 atataatatt tagactatta gctgtgtatt tttagaccac aaaaccacaa acacatttta 240 tctttgacta taacacttat cgcattttta tctacatgcc atagccgtat atttcataat 300 aaattgaata cgtgatagtg attaatagaa ataaaacagt aaaacataat attaaattac 360 cattattagt taatctgtcg gtttgttcag acgttcagtg ttgttcacaa tagtgtcata 420 agttagtgcg acttttctac acaaagtgtt acacatttgt tttttttttt agttttacgt 480 ttgatttaaa aatgtcccgt caatatattt tccttttagg taatatttcc aatcaaatta 540 ctggaaataa gttaccttcc aatggagatt gcctttgggt attattttat aacatgcgag 600 ttgtaaattt aaatttgaat gacagttcgg gacttgttgc tgatgaatgc ttattgtttt 660 ggaaaaaggc aagaatacct actcaatatc attgtgatat agttcgaaaa ataaaaaatt 720 tgtatgaaag ttggcgaagt ttagataaaa ataaaacaag aaagtctgca acacaacagt 780 gcaaagaaaa caattttaaa aactctttaa ataatttgtt cgatattgcc cacaaagatg 840 ctttcaatat aattaaaatt gatgaagaca ggcagtttct aactttacaa agacaagaag 900 gtcgtgtcgg gtatatggca ggtgtggaca aaaaattatg tgcagttgaa gaacgaaaat 960 gtgaacgaga gcaaaggaga gaattgttca agcaaacatc catgatgcga cctggtaaat 1020 taatttttat attaaaaaaa taaataattt atctgattat aataacatta atttacataa 1080 tatacaatat aaaattgact atatctatta ttttatttgt aatattattt ttttcttagg 1140 cccatctcac actgttgatt ttgaggagag tatatcaagc gatgatgaac gtgatgtgct 1200 taaatctaaa gattccgaaa tgtgtgaaag tattataatg ccaattaaaa ataaacgagg 1260 taggaaagaa attatgacta gtcgtttagc atcagcttta gacaaatgta aggttagtga 1320 cagggatgca gttcacttat taatagcatg tgccgaggta tttaatgtaa atgtaaatga 1380 ctatgcaata aatcgctctt cagttaaaag aagtcgtgaa agttttcgtt accaaatatc 1440 ttccgaaatt aaaacagaat ttcatcaatt aaatcttaat tttgcagttg tccactggga 1500 ctcaaaaata cttcctaatt tgattggaac cgaaaatgta gacagattac cagttattat 1560 aacagcacca aacgtagaac aacttcttgg ggtaccacat ttgtcatctg gtactggaaa 1620 ggaaatttct tctgcagtgt atgacacatt aaaagattgg agtatgttgg aaaaagtgca 1680 agcgtttgtt tttgatacaa cagcatccaa cagcggtaga ttaaatggat catgtgtgct 1740 attagaacaa atgttaaacc gaccaatact gtttttggct tgtcggcatc acatatttga 1800 aattattttg caatcagttt tctcatactc aaaacttaca acaatgtctg gtcccgaaat 1860 tcctattttt aaacgtttta aaaataactg gaatcaaata gatcaaacga aatacaacac 1920 ttgggtaagt gataatgagg tcaaaaaaat attgcataaa gtagccgatg atgttataat 1980 attttgtaaa gatactttaa accaaaatct accaagggat gattacaaag aatttttgga 2040 attggtgata atttttttgg gtggggtacc acctaaaagt attcatttta aacgtccagg 2100 agcttatcat ctagctagat ggatgtgtaa gggaatatat tgtttgaaaa tatacatatt 2160 tcaagaacaa tttaaactga caaaagcaga aataacttca ttaaaaacta tatgctgttt 2220 tattgtgaaa tgctatatag agttttggtt taggtctcct aatgctattg aagcacctta 2280 taatgatgta ctttttttaa gaaaacttga agactataag tccgatgata agaaagtggc 2340 tgagcttgct ataaaaaaat ttataaatca tttatggtat ctgggagaag aaaccgcttg 2400 tttttcgtta tttgatgaca ggattgaaaa tcacgtgaaa aaacaaatgg ctcaacaatt 2460 attagaaaat gacgaccttc aagaagacga attgaccaca gaaatacaaa aaaaatatgt 2520 attaaaaaat ggtgatgtct cacaattttt aaaacaagac ttaccacttg aattaataca 2580 caataataca atacataaat aatcattata ttatatatta tttttttttt taatttacag 2640 gaaactttca acctcacaac atatttgtaa gaaactttta tttagtaaat aaatattata 2700 tatagcaaca attatagtac ttcatgtttt gtgcgtaaca ttaatattta agagattaga 2760 ggtcaaaaac taaaaactta taaaatttcc taaaatatta attatgacca gttttttttt 2820 aaatttacgg gattccatca acttaacaac atatttataa gaaactttta tatagtaaat 2880 actaaatata tatagctaca gtaatagtac tttaagtttt ttgcgtaata tcaatattta 2940 agagagtaca cgtaaaaaat ggaaattcat ccccaattag taaaacttca acccctttga 3000 ggcacgggaa gaggtagact tatcatgccc aaactttatc ataattattt tttcaataca 3060 tagaacaaaa tttggggtac tcgtcatcaa atttccaaaa aaaaaaattg acctctataa 3120 ctaaggacca ccc 3133 // ID Gypsy4-NVi_LTR repbase; DNA; INV; 246 BP. XX AC AAZX01001018; XX DT 05-NOV-2007 (Rel. 12.11, Created) DT 05-NOV-2007 (Rel. 12.11, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: long terminal DE repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy4-NVi; KW Gypsy4-NVi_I; Gypsy4-NVi_LTR. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-246 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from Nasonia parasitic wasp."; RL Repbase Reports 7(11), 1123-1123 (2007). XX DR Genome; AAZX01001018; Positions 26351 26106. XX SQ Sequence 246 BP; 68 A; 69 C; 57 G; 52 T; 0 other; tgccaagcgt cgaagtgcgc ctgcgcacgc acagaagtag cgagcgacga gaacgaagcc 60 gcgagctcgt gccgagtcta acctacgagc agcgtcacag cgtgtacatg gccgttactc 120 cgaatagaac gtctcaaaag gtgtattagg agcatcgtct gaataaagag gcatcacagc 180 atattcatct gtgtcgacct tttcttctcc ttccctaacc ctcaagtatc acgttcaaat 240 acaaca 246 // ID Gypsy-75_AA-I repbase; DNA; INV; 3938 BP. XX AC supercont1.331; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-75_AA_; KW Gypsy-75_AA-LTR; Gypsy-75_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-3938 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.331; Positions 588425 584488. XX CC Positions [1343-1843] - Reverse transcriptase CC Positions [2987-3475] - Integrase core CC 'AGAAG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 44..3934 FT /product="Gypsy-75_AA-I_1p" FT /translation="MNVKPPEFNVGDSWPLYQERLERFFVAYELNDEADDE FT RRAAFLLTSVSMEVYQIIKNLCFPDKPETKNFAQLCELMRQRFTPTVVVFR FT ERSRFFEARQGDSETVVEWATRLKKLAADCDFGANLSDFIKNMFVVGLRRG FT PIFERVCEEEPSVAYEDLVKLAMKKESTLQQRGMLEVHRIQGDNRKKNETV FT KCFACGKGDHDFRKCQYKSYVCKICDKKGHLAKVCPSREKDAAANRSSGGK FT KRSPKVNHLRLNKVDFLPPVLLKICVNERQIDFEMDTGSPVNAISTGLFHK FT LFSNVKLNTDVQEEFVCYNGSSFRAVGKFYARMRYKAHDSKEEIFVFDGDR FT HPLLGRQTMSSWGLKIDFCALTVAEDHIKKLETILCKHEAVFTGELGCLKD FT MKVHLPLKDEAVPRFCKPRKVPLALKEKVEAELDRLERTGIITKASTADVE FT WGTPLVPVLKKDSSIRLCADYRVTVNPFLKDDHHPLPIIDEIFAALQGGKH FT FSKLDLKNAYYQLEVDEETKKLLSWSTHRGVFHMNRLPFGTKTACSVFQAT FT MERVLQGCRGTVSYLDDVMVTGSTTEEHLQNLDHVLQRFEDAGFLLNKEKC FT EFFKEEVDFLGHYIDKEGLHKDPHKVKAILNVKPPTDAKEVRSFVGLVNYY FT AKFCPHLAHSLKPLYELLKEGVSFSWTKKQQQAFEEAKNLIAGETVLVHYN FT RDLPIKLYCDASNAGIGAVITHVFPDKNERPISFASRVFKKHEGNYSTIDR FT EALAIYYGVTKFYSYLAGRLFILMTDHKPLIGLFSTKGVPETAARRLQRWA FT VFLSSFDYEIQHVKGVDNIPADFLSRFPLASVEDEDKFNDDFAEEDPTIFL FT NFLEQETRSLVERRQLKVESRRDKTVSRVVEYLKSGWPSNILEEEIKKFYS FT KRDELSVEEGVLLWGYRIVIPTKLRKNILMELHSVHLGIVKMKSLARSYFW FT WPSMDKEIEELGRKCELCIQLRPERTDPISPWRLTSSPGDRVHIDHFQFRG FT ADFLVMVDSYSKWIEVFPVRTLTSKETIEKVCEYKSRFGHISTLVSDNGTA FT FSSDEFKNFCMNRGTKHLRTAPYSPCSNGAAENAVKTVKSALTKLSSDPAF FT QKKTVAHRLFSFLEMYRATKHATTNESPFKLMFGREMHIRFDVLKTDNTRR FT QQEASFNQDRTRSVTFEVGEVVYARDYRDPKKPAWVRATVVKKLGTVLYEC FT EAASIGLIKRRSHQLLKYPYDDFDDEHSGATTNARERENAESSDESVYGDD FT FVEVEEPPRVGQRQPDGTYVTRSNRVVRPPRYLEGQ" XX SQ Sequence 3938 BP; 1090 A; 821 C; 1065 G; 962 T; 0 other; gtttggcgac gaaaggaaaa aatcaaattt tcaagattcc gtcatgaatg tgaagcctcc 60 ggagttcaac gttggcgatt cctggccact ttaccaggaa cgcctggaaa gattcttcgt 120 ggcctacgaa ctcaacgatg aagcggacga tgagaggcga gctgcgttcc tgctcacatc 180 cgtgtcaatg gaggtatacc agattatcaa gaatctgtgt tttccggaca agccagaaac 240 caagaatttt gcccagctct gcgaattgat gcgccagcgg tttaccccta cggtggtggt 300 ttttcgagag cgttccagat ttttcgaagc acgtcaaggc gacagtgaga cggtggtcga 360 atgggctacc cgtttgaaaa agttggcagc cgactgcgat tttggtgcca atttgagcga 420 cttcatcaag aatatgttcg tcgtaggatt gcgtcgtggc ccaatttttg aaagagtttg 480 tgaagaggaa ccttcggtgg cctacgaaga tttggtgaaa ctggctatga agaaagagtc 540 aactctacag cagcgtggta tgctagaagt gcaccggatc caaggagata acaggaagaa 600 gaatgaaacc gtgaagtgtt ttgcttgcgg aaagggagac catgattttc ggaagtgcca 660 gtataagagc tacgtttgca agatctgcga taaaaaggga catttggcca aggtttgtcc 720 atcaagagag aaggatgcgg cggcgaatcg gagttccggt gggaaaaaaa gaagcccgaa 780 agtgaaccat ctgcgtttga acaaggtgga cttccttccg cccgtgctgc tgaaaatttg 840 cgtcaacgag agacagattg acttcgagat ggacacagga agtccagtga acgccatttc 900 tacgggactc ttccataagc tattttccaa cgtgaagcta aatacggatg tccaagaaga 960 gtttgtgtgc tacaacggca gtagtttccg ggcggtcgga aagttctacg ctcgtatgag 1020 gtacaaggcc cacgattcca aggaggaaat tttcgttttc gacggtgacc ggcacccatt 1080 gctgggacga cagaccatgt ccagctgggg tctgaagatt gatttttgcg cattgacggt 1140 tgctgaggac cacatcaaga agctagagac gattttgtgc aagcatgaag ctgttttcac 1200 gggagagcta ggctgtttga aggatatgaa ggtccatttg ccactgaagg acgaagcagt 1260 accaagattt tgtaaaccaa ggaaggtgcc attagctttg aaggaaaagg tggaagccga 1320 gttggatcgt ttggagcgta caggtatcat taccaaagca tccacggcgg atgtggaatg 1380 gggaacacca ctagtacccg ttctgaagaa ggattcatcg attcgtcttt gcgctgacta 1440 tcgtgttaca gtcaatccgt ttctgaagga cgaccatcat ccgttaccta tcatcgatga 1500 gatttttgcg gctttgcaag gaggtaagca cttttccaag ctggacctca agaatgctta 1560 ttaccagctc gaggtcgatg aggaaaccaa gaagcttttg tcgtggagca cccatcgagg 1620 cgtgttccat atgaatcgtt tgccgtttgg aacgaagacc gcttgttccg tgtttcaagc 1680 tacaatggag agagtgttgc aaggctgccg tggaactgtc agttacctag acgatgttat 1740 ggtcaccggg tccactaccg aggaacatct acagaacctg gatcatgttt tgcaacggtt 1800 cgaagacgct ggatttctgt tgaataaaga aaaatgtgaa tttttcaagg aagaagtgga 1860 tttccttggc cattacatcg acaaggaagg gttgcataag gacccacata aggtgaaggc 1920 catattgaac gtgaagccac caacagacgc taaggaggtc cgttcctttg tgggactcgt 1980 gaattactac gccaagtttt gtccccattt ggcacacagt ctgaagcccc tgtacgagct 2040 gctgaaggaa ggtgttagtt tctcctggac taagaagcaa cagcaagcgt tcgaggaagc 2100 taagaatctc attgctggag aaacggtgct ggttcactat aatcgagatc ttccgatcaa 2160 gctgtactgc gatgcatcga acgcaggaat cggagctgtc attactcacg tttttccgga 2220 taaaaatgag cgtccgatat cgtttgcttc aagagttttc aagaaacacg aaggcaatta 2280 ttcaacaata gaccgggaag ctttggccat ttactacgga gtcacgaagt tttacagtta 2340 tttggccggt cgacttttca ttttgatgac cgatcacaag ccgctgattg gattgttcag 2400 cacaaagggt gttcctgaaa cagcagcgag acgacttcaa cgatgggccg tgtttttgtc 2460 aagcttcgac tatgaaatcc agcacgtgaa aggtgtggac aatatcccgg cagattttct 2520 ttcccgtttt ccgttggcca gtgtcgaaga tgaagataag tttaacgacg attttgcaga 2580 agaagacccg acaatatttc tgaacttttt ggaacaagaa actcgttcgt tagtcgaaag 2640 gaggcagttg aaggtcgaga gtcgtcgtga caagactgta agccgtgtgg tggagtattt 2700 gaagtctggt tggccatcga acatcctgga agaagagatc aagaagtttt attccaagcg 2760 agatgaactc agcgttgagg aaggtgtcct gctttggggt tatcgcatcg taattccaac 2820 caaattgcgg aagaacatcc tgatggagtt acactcggtt cacttgggca tagtcaagat 2880 gaagagcctt gcgcggtcat atttctggtg gccgtcaatg gacaaggaga ttgaagagct 2940 tggacgaaaa tgcgaattgt gtattcagtt acgaccagag cggacggatc caatatcacc 3000 gtggagactt acaagttccc ctggggatcg agttcatatt gaccatttcc agttcagagg 3060 tgccgatttt ctggttatgg tggacagcta cagcaagtgg atagaggtgt tcccagtgcg 3120 cacgctgaca tcgaaggaga cgattgagaa ggtctgtgaa tataaatcac gttttggaca 3180 catttccact ttagtctcag acaacggtac ggcgttttcc tcggacgaat tcaagaattt 3240 ctgtatgaat cgaggaacca aacatctgcg gacggcacca tatagcccgt gctccaacgg 3300 ggcggctgag aatgcagtga agacggttaa gtcggccttg acgaagcttt cttctgaccc 3360 agcatttcag aaaaagacag tcgctcaccg tttattttcc tttttggaaa tgtaccgtgc 3420 aacgaaacat gctactacga acgaaagccc tttcaagctc atgttcggga gggaaatgca 3480 cataaggttc gatgtactga agacggataa taccaggcgg caacaggaag cgagttttaa 3540 tcaagatcgt acaagatccg tgacttttga agttggtgaa gtggtgtacg caagagatta 3600 ccgtgatccg aagaaaccag cctgggtgcg agcaacagtc gtgaagaaac tgggtactgt 3660 cctgtacgag tgcgaagcag cgtcaatagg tttgatcaag agaagaagcc accagctgct 3720 gaagtatccc tacgacgatt tcgacgacga acacagtggg gctacaacca atgcgaggga 3780 acgtgaaaat gctgaatcca gcgacgagtc ggtgtacgga gatgatttcg tcgaagtaga 3840 agaaccacct cgagtcggtc aacgacaacc agacggaaca tacgtgacta gatcaaatcg 3900 agtagtgcgc cctccgcgct acttagaggg gcagtagt 3938 // ID Gypsy6-I_AP repbase; DNA; INV; 4577 BP. XX AC Contig9985; XX DT 21-APR-2008 (Rel. 13.04, Created) DT 21-APR-2008 (Rel. 15.12, Last updated, Version 0) XX DE LTR retrotransposon from pea aphid: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy6AP; KW Gypsy6-I_AP; Gypsy6-LTR_AP. XX NM Gypsy6-I_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-4577 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from pea aphid."; RL Repbase Reports 8(4), 447-447 (2008). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC Positions [2007-2513] - Reverse transcriptase CC Positions [3573-4049] - Integrase core CC LTRs are 98% similar to each other. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX FH Key Location/Qualifiers FT CDS 837..4328 FT /product="Gypsy6-I_AP_1p" FT /translation="MCRELQLNFCETKQQIIEGLYSRELCLYLMSRSHSHE FT NELLNDIVIFTKINDSRSTRFKNLSHPTHPEATTKNSNCQPEVATNPSTTN FT TNSKLSFQKKKTRCFNCGSFDHISTGCTQPKRRPGSCFTCGSIEHQITACP FT QGKSKPQAVKTQNCSSSATMMLHPSDMVTPAYFINVDLKISDKYVSNVLAM FT IDTGSPVSLLKEKLYPLECTPLMPPSNSGIVGINGSELIVLKQSFVDIYPP FT DTNEPINIKLNIVPDNTINYDCLLGRNFLSHPRVLFTINDGKFEIEFKRND FT IIPFSEIYNLEPTNEQSNIKEPELDIETTLSYNTQNKIKDIIITKYISPAT FT TNHELNTVNVPFSEIKIELKDLSVFYFNPRRLSYFEKNKLQDIIDNLLEKK FT IIRPSCSEFSSPIVLVKKKNGELRLCIDYRELNKRTVRDRYPLPLIDDHLD FT LLRDKNYFTCIDLKDGFHHIVVEENSRKFTSFTTPLGQYEYCRMPFGLCNG FT PSKFQRYVNNIFSEQIKAKKIVVYFDDIVIATETVEAHLDILSDVLSLMKL FT YNLQIRFDKSQFLKKEIIYLGYLVNSSGIRPNPRNVSVILKYPIPCNQKAL FT HSFIGLVSYFRRFIPNFSTIAKPLYDLLKKDTVFVFGEDKLAVFETIKQKL FT SEQPILCLYNPNAETELHCDASSLGFGSILLQKQADNKFHPVFYFSQRTTS FT VQSRYHSYELEMSAIINSIKRFHVYLQGIKFKIVTDCNSVTLTLKKKEINP FT RIARWALFLQNYDYEIQHRSSNRMQHVDALSRNHHILVLEGCTFNQMLAIN FT QCTDPAIKEIYKLLQNSENQFYELRNGLVYRKSDGRLLFYVPVNMRDQVIR FT SCHDDMGHVGMNRTIELIKRVYWFPKMTDCVKSHIENCLKCIVFSPKVGKA FT EGFLKLIEKGTKPFHTIHIDHYGPLNKTVGCFRYIFVIVDAFSKFLTLYPV FT RTVNTKEACSKLIEYFSYYSKPIRIVSDRGSCFTSHAFKDFCAIHDIQHVL FT IAAGSPQANGQVERYNHTLKVMLSKLLHEEDQNWNKHLNKVQFAINNTFNR FT AIKNSPSNLLFGMNQHGDTHDYLRLILESDNLQNCERDLGKIRDVAQDNNL FT DSQLKNKAYYDSGHRPAHQYSIGDLVMIKNVDTTPGSSKKHIPKFKGPYRV FT KKSIR" XX SQ Sequence 4577 BP; 1586 A; 790 C; 752 G; 1449 T; 0 other; aaattcagaa gttggattcg aaggagatat cttattgtta cggattctca caattgaagt 60 ttgaacagtg aatagtaagt tgacgaattg ttatgtcaat ctattcatta ttgctaatta 120 atgatggacc aacacgtgga ttcagaaggt aagattgacc ataacctaca aaattgtttc 180 gaccataatt acactagaag taatgatgct aacaacattg ctatcagtgg tgcagccaat 240 agtccgaatt tttcttcgac gtcgtccagt gttgctgcac ctgatgtacc caacaacact 300 cgtaatccgt tatcaaacga tgtcaatgtt aacaacaaca gcttaccccc acaatcgaat 360 aatatgatat cagatacaaa ttctgttgtg atggcactaa ttgaacaaaa tcgattactt 420 atggaacaac tgatgtcaca tcgttccaca acaccaagtt catctattca aagtaacata 480 tcgaacggct attatgtgat gcctaatttc catgaatcat tgccgaactt catcggaact 540 gagtcataca ttgaagcatc aaattggatc aagagcataa aatcgaccgc tgatcttcat 600 aactggcctg attcattcaa attggagatc atacggacta aacttaaggg tgctgctcat 660 aactggtacc ttggacgaac attctctgac tgggagcaat ttgaaagaca gtttaaagag 720 actttcattg gtacacaaac ttcaacagta gaacgtacaa aattgttaat tgctcgtcac 780 cagcggaaag gtgaaatgat aatagaatat ttccatgaca atattttacg gcacgtatgt 840 gtcgtgaact tcaacttaat ttttgtgaaa ctaagcaaca aattattgaa gggttgtatt 900 cacgtgaatt atgtttatat ttaatgagcc gaagtcattc tcatgaaaat gagcttttaa 960 atgatattgt tatttttaca aaaatcaacg attctcgaag cacgcgattc aagaatttaa 1020 gtcatccaac acatccagaa gcaacaacca aaaatagtaa ttgtcaacct gaagtagcaa 1080 caaacccatc taccactaat acaaactcca agctgtcgtt tcaaaagaag aaaacgcgat 1140 gtttcaactg cggttcattt gaccacatct caactggatg tactcaacct aaacgacgac 1200 ccggttcttg ttttacatgt gggtccattg aacatcagat cactgcctgt ccacaaggca 1260 aatcaaaacc tcaagcagtc aagacccaaa attgttccag tagtgctaca atgatgctgc 1320 atccatcgga tatggtaaca cctgcatatt ttattaatgt tgatttaaaa atttctgata 1380 agtatgtctc aaatgtttta gctatgatag acactggtag tccagtgagt ttacttaaag 1440 aaaaattgta tcctttagaa tgtactcccc ttatgccccc atctaattcg ggtattgtag 1500 gaattaatgg atctgaactt attgtactta aacagtcctt tgtcgatata tatcctcctg 1560 atactaatga accaattaat ataaaattaa atattgtacc tgataacact ataaattatg 1620 attgtttgtt aggtagaaat tttctgtcac atccgagggt tttgtttaca attaatgatg 1680 ggaaatttga gattgaattt aaacgtaatg acatcatacc ttttagtgag atatataatt 1740 tggaacctac taatgaacaa tctaacatta aggaacctga actagatatt gaaacaacac 1800 tgtcatataa tacccaaaac aaaattaaag atattattat tacaaaatat ataagcccag 1860 ctactacaaa tcatgaattg aatactgtta atgtaccttt ttctgaaatc aaaatagaac 1920 ttaaggatct tagtgttttt tattttaacc ctagacgttt atcttacttt gaaaaaaata 1980 aactacagga tattattgat aatctattgg aaaagaaaat cattaggccc agttgttctg 2040 aattcagtag tccaatagtg ttagtaaaaa aaaaaaatgg tgaactacgt ttgtgtattg 2100 attataggga actgaacaag agaaccgtaa gggataggta tccccttcca ctcattgacg 2160 atcacttaga cttattgcgt gataaaaatt attttacttg tattgatctg aaagatgggt 2220 tccatcatat agttgtagaa gaaaattcta ggaagtttac ctcatttacc acacctttag 2280 ggcaatatga atattgtaga atgccttttg ggttatgtaa tggtcccagt aaattccaac 2340 gttatgtgaa taatatattt tctgaacaaa ttaaagctaa aaaaattgtt gtttattttg 2400 atgacatagt gattgcaacc gaaacagtag aagcgcattt agatatatta tctgatgttc 2460 tatccctaat gaaactatat aatttacaaa taagatttga caaaagtcag tttttgaaaa 2520 aagaaataat ctaccttggt tatcttgtca attcttctgg cattcgtccc aatcccagaa 2580 atgtatctgt tattttaaag tatccgattc catgcaacca aaaagcgctc catagcttta 2640 ttgggttagt gtcgtatttc cgtaggttta tccctaattt ttctactata gcaaaaccat 2700 tatatgatct tctaaagaaa gatacagtat tcgtctttgg tgaagacaag ttagcggttt 2760 ttgaaaccat taaacaaaag cttagcgaac aaccaatttt atgcttatac aacccaaatg 2820 cagaaactga gcttcattgt gatgcctcga gtttaggatt cggatcaata cttctccaaa 2880 aacaagccga taataagttt cacccggttt tctattttag ccaacgcact actagtgtac 2940 agtctcgtta tcacagctat gagttagaaa tgtcggcaat aataaactca ataaaaagat 3000 tccatgttta cttacaaggc attaaattca aaattgtaac tgattgcaac agcgtaacat 3060 tgacattaaa gaaaaaagaa ataaatccta ggattgctcg ttgggcactg ttcctacaaa 3120 actatgatta tgaaatacaa cataggtcat ctaaccgaat gcaacacgtt gatgcgttaa 3180 gtaggaatca ccacatacta gttcttgaag ggtgcacatt taatcagatg ctggcaataa 3240 atcaatgcac tgacccagct attaaggaaa tttataagtt gttacaaaat tctgaaaatc 3300 aattttatga acttcgaaat ggattagtct atcggaaatc cgatggaaga ttgttattct 3360 atgtaccagt taatatgcgt gaccaggtca tcagaagctg tcatgatgac atgggccatg 3420 taggtatgaa tagaacgata gaattaatta aacgggtata ttggttcccc aaaatgactg 3480 attgtgtaaa aagtcacatt gaaaactgtt tgaaatgtat agtgttttct ccaaaagtag 3540 gcaaagcaga aggtttcctt aaactaattg agaaaggcac taaacctttc cacacaattc 3600 acattgatca ctatggaccc ttaaataaaa cagttggatg ttttagatat atatttgtaa 3660 ttgtagatgc atttagtaag tttttaactt tatatccagt ccgtacagtt aataccaaag 3720 aagcctgttc aaaattaatt gaatattttt catattacag taaacctatc cgtatagtat 3780 ctgatcgcgg atcttgcttc acctctcatg cgtttaaaga tttctgtgct attcatgaca 3840 ttcaacatgt ccttatagct gctggttccc ctcaggctaa tggccaggtt gaacgttata 3900 atcatacact taaggttatg ttgtcaaaat tactccatga agaagatcaa aattggaaca 3960 aacatttaaa taaggttcaa tttgccatta acaatacttt taaccgtgca ataaaaaatt 4020 cacctagtaa tcttttattt ggtatgaacc aacatggtga cacccatgat tatctaagac 4080 ttatattaga gtcagataat ctacagaatt gtgaaagaga cttaggaaaa attagagatg 4140 ttgcacaaga caacaaccta gactcacagc tcaaaaataa ggcatattat gattcaggtc 4200 atcgtcctgc ccatcaatat tccattggtg acctggtgat gataaaaaat gttgacacaa 4260 ccccaggttc tagtaaaaaa cacattccta aatttaaagg gccttacaga gttaaaaaaa 4320 gcattaggta acgataggta tgtcctcaat gatgtcgaag ggtttcaagt cactcagact 4380 ccctttgatt cagtatatga gagtaaacac atgaaactct ggattaaaat tcaaagttag 4440 tattttctat acccacagtt ctattatgtt ttgattatct caaattctta tttaatattg 4500 ttttgttaat tgatataatt tgtttattta atattgttat tgtttaaatt agttgcttgt 4560 taattgtata ttgataa 4577 // ID CR1-84_AAe repbase; DNA; INV; 4386 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-84_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4386 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1172-1172 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 14 sequences with >97% CC identity. XX FH Key Location/Qualifiers FT CDS 311..586 FT /product="CR1-84_AAe_1p" FT /translation="MVKANLALDANPEVVKLVAKEKDINTLSFVSFKIGLD FT PSLKTKALNPNTWPEGLLFREFEDYAQKFRFPLKSRRPMTPLLHPPAVTPV FT MDLS" FT CDS 604..4281 FT /product="CR1-84_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MSKQPSSMQTSFANQPSPGCMPASCMEAPSPLITVEP FT LLPATLSHPGPVYEVGGGVFQTPNAGKYSYVLNNSLPEFSDVCRHHLSYKD FT NASLHSVGNIPLRSTQPGCMPASCMEAPSPLITVEPHLPATRSHPGSVYDV FT GGGVFQTPNAGKYPYVLINSLPELSDLHRHHLRNRDNASHQSVENYQLHST FT QPGCTPASLMEAPNPLPTVEPLLPATHSHPGPVCEVGGGVFQTPNAGEYSS FT DMSNSLPEFFNIRSFSSCADSNSIHQSAMNRQLSQNRTLHSPDDVMIYYQN FT VRGLRTKLDDFYTAVNSSSYDVLVLTETWLDDVIESRMLFDERYTTYRCDR FT NSLTSNKKRSGGVLIAVLNKFPSSVIPVSDLSTEHVWISVKLSRYSLIIGG FT IYLPPDVSNDHLVIRRHVACVEEISMSMKDGDNIIIFGDYNLPGLRWTRSN FT RLYCTVDIPNSTLSRSCTIFLDGMYSNGIEQINDMKNHAGNTLDLVLLKDL FT IDGKFDLEEAVDPIVPVDPAHPPLVLTIHGATGALISTADECDLSAFDFKR FT ADFETLTHRLIGIDWSTVYRCPDIDGCVDKFTQILHHTFTECVPLLRPPSK FT PPWSCRELRIRKKERNAAHRKYRTDRSSLNKATFKACSQRYKVLNNRLYRR FT YAKKVERNMKRNPKQFWNFVNSKRKDNGLPSVMHMDDRMANTSDEKCFLFA FT EFFESVFENSDTNVSNERALTHVPRDAIDANTFHITAEDLTKALCKLKLSY FT QPGPDGVPACILKKCFTGLIVPLLHIYNMSMQQCRFPTLWKSSFMFPVHKK FT NDKNNVRNYRGITSLCACSKLLEIVVSSYMFSRVKQYISVKQHGFFPGRSV FT TTNLVEFSSLCVKNIGRGKQIDTVYTDLKAAFDRVSHSILLAKLEKLGVSV FT QLVAWFKSYLCDRKLAVKIGSSVSSWFSNGSGVPQGSNLGPLLFSLFINDV FT VLVLGEDFCILYADDMKIYKIIEHISDCYELQSLLDRLIEWCGANKMTLSV FT PKCSIISFHRKKQPIMFRYAFGEEFLSRVDFIRDLGVILDTEFTFKYHYEE FT IVKKARRQLGFMSKITKEFRDPYTLKSLYVGLVRPILESSSIVWDPYHATM FT IDRIEAVQRKFLRFALRLLPWNDAVNLPPYEARCQLLQIEPLQLRRENAKA FT VFISKVLTGELDAPNLLGSLDINVPAYTLRSNDFFRLPRRLHAYDSNEPIR FT SMMNIFNQMYHNIDFI" XX SQ Sequence 4386 BP; 1216 A; 1038 C; 913 G; 1219 T; 0 other; ccactggtct acgcttacta ctgcgatctc tgagcttcgg aatgagatca aacaaatcaa 60 tgcgaaacca tcaccccaat tattgacgcc caataacaac gcttggccct cgatagattg 120 gcgccgaccc agtaaacgat tgcgtaatca tgacaccgca acaagggcat ctgagacatg 180 tcttgttgga agcaaatagc cgctagctga tgtcgtttcg gttccaatat gtgacaataa 240 ccgtgatcaa aaattttggc tatatttgtc gaggattaga ccggacgtgg ctaacgatac 300 agtttgtgcc atggttaagg ccaaccttgc acttgacgct aatccggaag ttgtgaagct 360 ggttgccaaa gagaaagaca tcaacacact aagctttgtg tctttcaaga ttggtctaga 420 cccatcactg aaaactaagg cattaaaccc aaatacttgg cctgagggtt tgttgttccg 480 cgaattcgag gactacgctc aaaaatttcg gtttccgctg aaatccagaa gacctatgac 540 tccgttactc catccacctg ccgtcacccc cgttatggat ctaagctaaa aattgaaatc 600 atcatgtcaa aacaaccatc ttcgatgcaa acatcgttcg caaatcaacc ttcaccggga 660 tgcatgcctg ccagttgtat ggaagcccca agtcccctca tcacagtcga gcctctcctg 720 ccagcgaccc tcagtcatcc cggtcctgtg tatgaggttg gaggaggggt cttccaaacc 780 cctaatgcag gcaagtactc atatgttttg aacaattctc ttcctgagtt ttccgacgtt 840 tgcagacatc atttgagcta taaagacaac gcttcacttc attctgtggg aaatattcca 900 ttacgttcaa ctcaaccggg atgcatgcct gccagttgta tggaagcccc aagtcccctc 960 atcacggtcg agcctcacct gccagcgacc cgcagtcatc ccggttctgt gtatgatgta 1020 ggaggagggg tcttccaaac cccaaatgca ggcaagtatc catacgtttt gatcaattcg 1080 ctgcctgagt tatctgatct tcacagacat catttgagaa atagagacaa cgcttcacat 1140 caatcagtgg aaaattatca attacattca actcaaccgg gatgcacgcc tgccagcctt 1200 atggaagccc cgaatcccct ccccacagtc gagcctctcc tgccagcgac ccacagtcat 1260 cccggtcctg tgtgtgaggt tggaggaggg gtcttccaaa caccgaatgc aggcgagtat 1320 tcatctgaca tgagcaattc gcttcctgaa tttttcaaca ttcgtagttt ttcatcgtgt 1380 gccgatagta actctataca tcagtcagca atgaacagac aactcagtca aaaccgaacc 1440 ctgcattccc ccgacgacgt gatgatatat taccaaaatg ttcgagggct gagaacaaaa 1500 ctggatgact tctacaccgc cgtgaacagc tcttcgtacg atgtccttgt actcactgaa 1560 acgtggctag acgatgttat cgaatctcgg atgctattcg acgaaaggta tacgacgtat 1620 cgttgcgacc gaaattcatt aacgagtaac aaaaaacgat ccggtggagt gctaattgct 1680 gtgctcaaca aattcccttc gtcagtaatc ccagtgtcag acctgtctac agagcatgtt 1740 tggatctccg ttaagctcag tcgatatagt ctaattatcg gcgggatcta tctaccaccg 1800 gatgtgtcaa atgatcattt ggtgattcgc cgtcatgttg cctgtgtaga ggagatttcg 1860 atgtcgatga aagacggtga caacatcatt atctttggag actacaactt acctggttta 1920 cggtggacac ggtcgaatag gctttattgt actgttgaca tacccaactc gacactttca 1980 cgatcatgca cgattttttt ggacggaatg tattccaacg gtattgagca aattaacgat 2040 atgaaaaatc atgcaggtaa tacgctcgat ctcgttcttc taaaggattt aatcgatggg 2100 aagtttgatc tggaggaagc tgtcgatcct attgttccgg ttgacccggc tcatccacct 2160 cttgttctca ccattcacgg cgcaaccggg gctcttatta gtacagcgga tgaatgtgac 2220 ctctccgcgt ttgattttaa aagagccgac tttgagacat tgacacatcg gctgatagga 2280 atcgattggt ccactgtata ccgttgtcct gacatcgatg gttgtgttga caaattcacg 2340 cagattcttc atcacacttt cacggagtgt gtgcctttgc tacgcccccc ttcgaagcca 2400 ccatggtctt gtcgtgaact tcggatacgt aaaaaggaac ggaatgctgc tcatcgaaaa 2460 taccgcactg atcgatcatc gttaaacaag gccactttta aagcttgcag ccagagatac 2520 aaggttctga acaaccgtct ttatcgacgt tatgctaaaa aagtcgaacg caacatgaaa 2580 cgtaacccta agcagttttg gaactttgtg aattccaaac ggaaggataa tggtttgccc 2640 tcagttatgc atatggatga ccgcatggct aacacttctg acgaaaaatg cttcttgttt 2700 gctgagttct tcgaatcagt ttttgaaaac tcggacacta atgtgagtaa tgaacgtgcc 2760 ctaacgcacg taccgagaga tgccatagac gctaacactt tccacattac tgcggaggat 2820 ctcaccaaag cgttatgcaa gctcaagctt tcctatcagc ctggtcctga tggagttcct 2880 gcctgcatcc tgaagaaatg tttcacggga ctaattgtac ctcttctgca catctacaac 2940 atgtcaatgc agcaatgtcg ttttccaaca ctgtggaagt catcgttcat gtttccggtg 3000 cacaaaaaga atgacaaaaa taatgtccga aactaccgag gaatcacttc tttatgtgct 3060 tgctcaaaac tgttggaaat agtagtgtcc agctacatgt tttctcgtgt aaagcagtac 3120 atcagtgtga aacagcatgg ctttttcccg ggacgaagtg ttactactaa cttagtggaa 3180 ttttcgtcac tttgtgtgaa aaatatcgga cgtggcaaac agatagacac tgtttacact 3240 gatttgaaag cagctttcga tcgtgttagc cattcgattt tactggcaaa gctagagaag 3300 cttggtgttt ctgttcagct cgtagcttgg ttcaaatcat acctgtgtga caggaaactc 3360 gcggtcaaaa ttggctcatc tgtctccagc tggttctcga acggttcagg tgtacctcaa 3420 gggagtaacc ttggcccatt attgttttcc cttttcatca atgatgtcgt cttagttctc 3480 ggtgaagact tctgcatact ttacgccgat gatatgaaga tttacaagat tattgagcac 3540 atttctgact gttacgagtt acaatcgctt cttgaccggc ttattgagtg gtgtggggca 3600 aacaaaatga cactcagtgt tcctaaatgc tctatcatta gtttccatcg taagaaacaa 3660 ccgatcatgt tcaggtatgc gttcggtgaa gaatttcttt cacgagttga ctttatacgt 3720 gacctgggcg ttattcttga caccgaattc acctttaagt atcattacga ggaaatcgtg 3780 aaaaaagctc gtcgtcaact aggatttatg tccaaaataa cgaaggaatt tcgcgaccca 3840 tacactctta aatcattgta tgtaggtctg gtccgtccta tactggaatc atcctctatt 3900 gtttgggatc cctatcatgc aacaatgatt gaccgaattg aagctgtgca gcgcaaattc 3960 ttaagatttg ctttacggtt gctcccttgg aatgatgctg tgaacctgcc tccgtacgaa 4020 gctagatgcc agcttctgca aatcgaacca ttacaactac gcagagaaaa tgccaaggct 4080 gtgttcatct cgaaggtcct gactggggaa cttgatgctc ctaacttact gggatcgtta 4140 gatataaacg tgccagcata tacgctccgc tccaatgatt tctttcgctt accacgacga 4200 ctccacgcct acgattcgaa cgagccaatt cgatcaatga tgaacatttt caatcaaatg 4260 tatcataaca tagattttat ttaataatga cgtttatact ctaactatag attcattaag 4320 acacacaata ttgtcagatg aatacaccac caaataaata aataaataaa taaataaata 4380 aataaa 4386 // ID L2-1b_Cis repbase; DNA; INV; 5653 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 01-SEP-2010 (Rel. 15.1, Last updated, Version 2) XX DE CR1 Non-LTR Retrotransposon from Ciona savignyi. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; L2-1b_Cis. XX NM L2-1b_Cis. XX OS Ciona savignyi OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. XX RN [1] RP 1-5653 RA Smit A.F.; RT "L2-1b_Cis - CR1 Non-LTR Retrotransposon from Ciona savignyi."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX CC Ci000020, Ci000056, Ci000120; 94% identical to L2-1b; on average CC 1% diverged only. XX FH Key Location/Qualifiers FT CDS 112..792 FT /product="L2-1b_Cis_1p" FT /translation="MNTMESALMKKLEELDTALAQIKSKQDKYESNCNLRT FT DTAPTVPVSDSSHPKELQEIRKSLNDLEASLKDLKAVVRDTDERLDNLEQY FT GRRNCLIIHGCKRIPQDNFLSYVLSILNRLKLPYTISKAAIDIAHVLPSKR FT DSTPIIVKFVQRMVRNDVYDSKKHLKGTGMSMTESLTLRRLRIVEKAREAF FT GFRSVWTNNGVIFTVHQNTRRVIHRLSDISSILARSK" FT CDS 864..4202 FT /product="L2-1b_Cis_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MQTISIRKNACLNCSKTIRVNQNFIYCDNCNANFHCK FT CLPTPPKQQLNIKKNVTTLHHKWFCDNCQDPPCLPFSNIPDLELSSLLSIS FT SSQTNPDQVTRLTLDPEKLNNFFDNLNENMLDEEDELTSIRCPESYVNTNQ FT CKSFLFEKHQGLTILSLNIRSLANPNNFTKLEALVASLQFKPEVIAITETW FT IVDNRSGHYSNLPGYVFVGNSRSKSRGGGVGMYIRSHLEFQIKDCISIMEE FT KVFESLFVVFPTLTPHXKPNPKSSLICGVIYRSPKLDKQSNSVFMHKLSES FT LEKIDNRNCNCIIVGDFNYDLLNINNNTVTDFTNTMQDFCYESIINKPTRF FT TDSGATAIDHIWTTIKPPLVKACILTDTLSDHLPVILALRPNSENNSKSNG FT SQSSSRSFSDANIVSFNNNLQHLNTDNIFCESDPNLAYSNLIEQYNTEFEN FT SFPLVSNKAPKYHLKWFNEEVKELNKTKQKLYKKYILKRTVSSKLEYNKAR FT NLYHHKIQAAKKCQYRKIFKTNRNNIKATWRTINELLGKSKPQISKSFEID FT GLSTSNPIHIANHFNDHFSNVATELVNRIPPSTSHHTDYLKSQSSSTMYIF FT PTSPHEIKTIILDLKSKSSSGLDHIPSKVVKSTPVNILQVLSYIFNLSLQS FT GKFINDFKIAKVIPVFKKGKHSLINNYRPISLLPSFSKIIEKLMYNRTHSF FT LSHNNVLHNNQFGFRKAHSTSHASNLLVNIISDHLENKKSVIGVFLDLSKA FT FDTIDHDILLNKMSHYGIRGVALDWFTSYLSNRQQIVDFNGTLSSNLNPVK FT LGVPQGSILGPLLFLIYINDLPNCLTHSKAIMFADDTTIFTPGRNQSITCD FT NANADLDRLYTWLSSNKLVLNTDKTKYMYFSNSTKSTSLPPPLISINNYKI FT EQVNSFKFLGLTLNDNLSWKPHMQILHKKLNTNLLMVRKIKPLVDQPSLLT FT LYHSLILSHIHYCISTWCYGNNQMLTKLQRVCNKFIRLIFNLGKRENTVSI FT MRQHGLLTIHDMHKLEILSIMHKCHNNTLPPALHNCIPNKPKTTMKTRSNS FT QFLIPFCNKSLSQQSLKYIGPKFWQQLPNNIREIKSLKKFTKVVKQHLLED FT PHPLP" XX SQ Sequence 5653 BP; 1832 A; 1193 C; 822 G; 1804 T; 2 other; attctatttg aataaggttg catgagcgaa taggattttt caaataactg ggcatattta 60 ttaaatcttc tgataccgtg ctgaaagtga actctctgac aagcaagaaa tatgaataca 120 atggaatctg cacttatgaa gaagttggag gaactggaca ccgcactagc tcaaattaaa 180 agtaaacaag ataagtatga gtccaattgc aatttaagga cagatactgc tcctactgtt 240 cctgtttcag attcttctca tccgaaagaa ctgcaggaga ttcggaagtc gctgaacgat 300 ctggaagcct ccctaaaaga tctaaaggct gtggttagag acacagacga gcgacttgac 360 aacttggagc agtatggcag gaggaactgt cttattatcc atggctgcaa aagaattccc 420 caagacaatt tcttaagtta tgtgctatcc attctaaacc gccttaaact gccatatacc 480 atttcaaagg cagcaatcga catcgcacac gtgctaccat cgaaaaggga ctccacacct 540 ataatcgtga agtttgtaca acggatggtg aggaacgatg tgtacgactc caagaaacac 600 ctgaagggga cggggatgtc gatgacagaa tcactgaccc tgcggcggct ccgaatcgtg 660 gagaaagcca gagaggcctt tggctttcga agcgtttgga ccaacaatgg ggtcatattc 720 actgtacacc aaaacacaag gcgagttata catagattaa gtgatatatc ttctatcctt 780 gctaggtcta aataatttaa tcatttgtca atttcagtgc catatttttg cactatgcct 840 agtataaatc ctagctcctt attatgcaaa ccatatccat aagaaagaat gcctgtttaa 900 actgcagcaa gacaatccgt gtaaatcaaa actttattta ctgtgacaat tgtaacgcaa 960 attttcattg caaatgcttg ccaacaccac caaagcaaca gttaaatatt aaaaagaatg 1020 tcacaacttt acaccacaaa tggttttgtg ataactgcca agatccaccg tgccttccat 1080 tctcgaatat tcccgatctg gaactttctt ctttactttc catctcctcc tctcaaacca 1140 accctgacca agtaacccgc ttgaccctcg acccagaaaa attaaataac ttttttgata 1200 atttaaatga aaatatgcta gatgaagagg atgagttgac ttccattagg tgcccagaat 1260 catatgtcaa taccaaccaa tgtaaatcct ttttatttga aaagcaccaa ggcttaacaa 1320 ttttaagtct taatattaga tcactggcca acccaaacaa ctttaccaaa ctagaggcat 1380 tagttgcatc cctacagttc aaaccggaag taattgccat tacagaaaca tggatagttg 1440 ataatcgtag cggtcactac tcaaatttac ctggctatgt ttttgtaggt aatagccgat 1500 ctaaatctag gggcgggggg gtgggaatgt acattcgttc tcatttggag tttcaaatta 1560 aggactgtat ttctattatg gaggagaaag tttttgaatc actatttgta gtttttccca 1620 ctttaacacc acacntgaaa cccaacccta aatcatcgtt aatctgcggt gttatctata 1680 ggtctccaaa actggataaa caatcaaatt cagtctttat gcacaaatta tcagaatcac 1740 ttgagaaaat tgacaaccga aactgtaact gtattatagt tggggatttt aattatgacc 1800 tcctcaatat taacaataac actgtcactg atttcactaa tacaatgcag gacttctgct 1860 atgaatcaat cattaacaaa ccaacccggt ttacagactc aggtgcaact gcaattgatc 1920 atatttggac cactattaaa cctccattag taaaagcttg cattctaaca gatacattat 1980 ctgaccattt accagttata ctggcactga ggcctaattc agaaaacaat agcaaatcta 2040 atggatctca atctagctcg agatctttca gtgatgcaaa tatagtctca tttaacaaca 2100 acctacaaca tcttaacacg gacaatatat tttgtgaatc agatcccaac cttgcatatt 2160 ccaacctaat tgagcaatac aatacagagt ttgaaaatag ctttccacta gtatccaaca 2220 aagcacctaa ataccactta aagtggttca atgaagaagt aaaggaatta aataaaacta 2280 aacaaaaact atacaaaaaa tatatactga aaagaacagt ctcttccaaa ttggaataca 2340 ataaagctcg aaacttgtac catcacaaaa tccaagctgc aaaaaaatgt cagtatagaa 2400 aaattttcaa gactaacaga aataacataa aggctacatg gaggactata aatgaactac 2460 tcgggaaatc taaaccacaa atttcgaagt catttgaaat tgatggatta tctaccagca 2520 acccaataca cattgctaat cactttaatg atcatttttc aaatgttgca actgagttgg 2580 taaaccggat tccaccatcc acctcacacc acacagatta cctaaaatcc caatcatctt 2640 ccaccatgta tatattcccc actagtccac atgagattaa aacaataata ctggacttaa 2700 aatctaaatc cagcagcggc ttagatcata tcccctcaaa ggttgtaaaa tccacaccag 2760 tcaatatatt acaggtacta tcttatatat ttaacctatc cttacaatca ggaaaattta 2820 taaatgactt caaaattgca aaagtcatcc cagtgtttaa aaaaggcaag cactcactga 2880 taaataacta ccgaccgatt agtctacttc cttcattttc taaaataata gaaaaattaa 2940 tgtacaacag aacccactct ttcttaagtc acaataatgt acttcataac aatcaatttg 3000 gatttaggaa agctcattca actagccatg ctagcaatct cctggttaat ataatatctg 3060 atcatttgga aaataaaaaa tctgttattg gtgttttttt agacctgtca aaagcattcg 3120 acaccattga tcatgacatt ttgcttaaca aaatgtctca ttatggcatc cggggtgtag 3180 cactggattg gttcaccagt tatttatcaa accgacaaca aattgttgat tttaatggaa 3240 ctctatcttc caacctgaac ccagttaaat taggagttcc tcagggatcc attctaggtc 3300 ctttactatt ccttatatat atcaatgacc tgccaaattg cctaacccat agcaaagcca 3360 ttatgtttgc agatgacacc accattttta cccccggccg caaccaatcc attacctgtg 3420 ataatgccaa tgcggatcta gacagattat acacatggct ttccagcaac aaactggttc 3480 ttaacactga taaaacaaaa tacatgtact tttctaactc cactaaatct acttcgttac 3540 caccacctct catttcaata aataactata aaattgaaca agtcaacagc tttaaattcc 3600 ttggtctaac tttaaatgac aacctttcct ggaaaccaca catgcaaata cttcataaga 3660 aacttaacac taatttactc atggtacgta aaataaaacc ccttgttgac caaccctcac 3720 ttcttacatt ataccactca ctaatactaa gtcacattca ctattgcata tccacctggt 3780 gttatggtaa taatcaaatg ctaaccaaat tacagcgagt gtgcaataag ttcattcgcc 3840 taatatttaa ccttggtaaa cgtgaaaata cagtttctat aatgagacaa catggacttt 3900 taacaattca tgacatgcac aaattagaaa tattatccat aatgcacaaa tgccataaca 3960 acacacttcc acctgctctc cacaattgca taccgaacaa acctaaaact acaatgaaaa 4020 caagaagcaa ttcccaattt ttaattcctt tttgtaacaa atccctatcc caacaatccc 4080 tgaaatacat aggcccaaag ttttggcaac agttaccgaa caacattcgc gagattaaat 4140 cgcttaaaaa gtttaccaaa gttgttaaac aacacctctt ggaagaccct catccgcttc 4200 cctagtttcc attaatataa tattatcgtt attattatta gttttttttt tttttttttt 4260 tttttttttt ttaaataaat aaatctcctt tgccatttta aaatttcatg tacctcttgc 4320 cctaataaac tcaaatttac ttctttacct tatctatctt caagttttgc atatatttta 4380 cacatgcatg cattttcgtt gacttgtttt cacatttccc taataaaatc acatgcatga 4440 ttagtttcca aattttgttg tttttaaatt atgatcacca aattaaatca cagtcttgac 4500 tttctttgca tgttaacaat attgatttgc ctacacaaac acaagtttta tttatgatga 4560 tgttttttac agtattctct gttacttggt aagccgtggg cacatttggt tttgtttttc 4620 tgtgaaatag tttttgccgg ccctaacaga gttctgtatt tttaaagctt atcacttcag 4680 tagttttact tgtttttggt gtataaaata tctttcacca tctgtgcaca acacaccctg 4740 tcccctgatc tctttttatt tctatntact tttctatttc tttaatgtct tgaagcattt 4800 ttacaaccat actgccaaga cattattgtt ccagatatcc ggacacggta atttcatcta 4860 cgaggagaag gtcgtccgtt ttttcttgcg tggctcctcc tcctcaaatt tactgttcga 4920 gagttggcgg tgtgatccta gagacctaga tgaagatggc attcaataat acactaacat 4980 gtcacaacca attttcattg atataatcgg atgcttgctg caaccttttg tatttttcat 5040 tttctagcaa cccttttttg gcttgctatt tatttatcta tttattatta tttcttattt 5100 atttacttta acattttgtt tgttttgttc ttttgttttg ttatttatgg tgcactaatg 5160 tagtgctcct ttttcacact cctaaataag taataaatca aatgcattaa tgttatgcaa 5220 tttgtgtgtt ataataacaa tatttaatgt aaatatgccc cggtaagtgt gcttgaaact 5280 aatagtgtca tctgacatta gtggaatttc attttagggg gtttttcttt ttccactctg 5340 gtccatgctt tcgggccaca ccttctccta aagccggctc ccccactgta ctggtcgcgc 5400 tttaccgcac actcttcttc agggctctaa atcgtttagt cttaattaaa tttcgattgc 5460 taaatattgt tcatttacct aggtttggcg gccgcgtccg ctatattcat taatttatat 5520 cctttttttt actgacgtgt tctaacgact gttcgttttg ccgtcttttc ccgttcctgt 5580 atcgtgatta tattcgtttg ccatttttaa aaactggggc aaaaaatgaa taaactaaac 5640 taaactaaac taa 5653 // ID EnSpm-10_HM repbase; DNA; INV; 4817 BP. XX AC . XX DT 08-JAN-2009 (Rel. 14.02, Created) DT 08-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE EnSpm-type family from Hydra magnipapillata - consensus. XX KW EnSpm; DNA transposon; Transposable Element; EnSpm-10_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4817 RA Bao W. and Jurka J.; RT "EnSpm-type families from Hydra magnipapillata."; RL Repbase Reports 9(2), 381-381 (2009). XX DR [1] (Consensus) XX CC The 3'-end is incomplete. CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 1173..3116 FT /product="EnSpm-10_HM_1p" FT /translation="MLYQNSYNYLAEDTLKSSNVEFGDISYFDENQPSNLT FT NSSDDDCNITDDEHGYYPSCSSDNEEAQDEVTLKELLICWSVKHNVTHSAL FT SDLLKILSKQHPDLPKDSRTLLKTKSSSDIIIMQDMFGQKAEFVYFGLKHQ FT LISLLVFNSIESCKVSLLFSIDGLPLYKSSGKQFWPILCSVTIGENIYKPF FT VVSIFCGNSKPKSSAIFLASFVAEFNVLKEQGLFIGENHFTIHAKGFVCDA FT PARAFVKCIKGHNGYYGCERCIQKGTRVDNRTVFPDVNAMKRTDASFALFQ FT QVQHHKPDAVPPLLELNIGHISQFPLEYMHLVCLGATRRLLLHWLRGKRAV FT KISTLIADLISNGLKNLAAYVTVEFARKPRSLKDIDRWKATEFRLFLCYLG FT PLVLRSHLTNNLYQHFMLLHVAINILANPNMCWTHVDYAEKLLNIFVMQMP FT DFYGSSSINYTMHSLCHICDDVRQYGSLDEYSAFPFENALGIMKRLLRSGH FT MPLQQLCRRLSERINSNTYSISRNMHLIPSLKRLHSNGPTLGYYGKQYLKV FT EYAGFTYFSANSNNCALLDSGKVVLIENVIDADNIMIITRVFGLKENFYTY FT PCESSLLNIFKVSQLSDQFFAFPITSIVKKCLLLPRETYFVSFALLHRD*" XX SQ Sequence 4817 BP; 1707 A; 641 C; 730 G; 1731 T; 8 other; cccaagtagc aaacaatatt ggcacaatat tggcgcgata ttgggtatct tggcctagat 60 tggccaagat acgcaatatt gcgccaatat tagcaaacaa tattgcgcca atattggcat 120 atcaatattg gccaatatta gcataccaag attacgccaa tattatttta ttgctattgg 180 aatataatat tggcgcaata ttgacatgcc aatattggcc aatattgata tgcaaatatt 240 ggtgcaatat tgtttgccaa tattggccca atattgtttt gtttttatta ctcaaatatt 300 ggccaatatt gagattttta catattggta cgcaggcgta acattattgc ttattcataa 360 tgtcctttag ttttgtcctt tgatgtccac aatactttta atagaatatt tacagattga 420 atcacgagaa gaagtggttt taatttattt aaaaacagat tgtatcacga gaagaagtgg 480 taaaagtaaa gaaatccttt taattgaaaa tttagagatt gtatcacgag aagaagtggt 540 tgtaatttat ttaaaacaga ttgaatcacg agaagaagtg gttgcagttc atttagaaac 600 agattgtatc acgagaagaa gtggaaaaat atttctgtgc tctaagataa agattgaaat 660 agtaagttta cttttatata aaaacagttt gagttaatta catctgcaag atttgtaaag 720 tctattttat tttgttgaat atattcaatt aagttaaaga tattcaatta agtaacaaat 780 acgtcatata gtttaccttt taataagttt gtaaatgaat aaattgtaaa taaggtatta 840 ttaaatcttg atttaataat atatgtatat atatatatat atatatatat atatatatat 900 atatatatat atatatatat atatatatat atataatttt attttatgct taagtgtaat 960 ataatgttat aggtaatgaa aaaatcaata aaaacaactc gatctgctaa atcacattat 1020 attagaagag tagtaaagca acatttaaat gaaattcact caacaagtaa tgttgaaaat 1080 acgaatactc aacaagtaat gttgaaaata cgaatgaatt acttaactgt ggaaatgcaa 1140 taaacttcag tgatgagtct attaaccata atatgttata tcaaaattct tataattatt 1200 tggctgaaga cacattaaaa tcctcaaatg ttgagtttgg tgacatatca tattttgatg 1260 aaaaccaacc ctcgaatttg actaatagtt ctgatgatga ctgtaacata actgatgatg 1320 agcatggcta ctatccttca tgttccagtg acaatgaaga ggctcaggat gaagttactt 1380 tgaaagaact gcttatttgc tggtcagtca aacacaatgt aacacatagt gctttatcag 1440 atttactaaa aatcttgtca aaacaacatc cagatttgcc caaagatagt agaaccttat 1500 tgaaaacaaa aagcagttct gacattatta tcatgcaaga catgtttggg caaaaggcag 1560 agtttgttta ttttggtttg aaacatcagt taattagttt gctagtgttt aacagcattg 1620 aaagctgcaa agttagttta ttgtttagca ttgatggatt gcctttgtat aagagctctg 1680 gaaaacagtt ttggccaata ctttgcagtg ttacaattgg tgaaaatatt tataaaccat 1740 ttgtagttag catattttgt ggcaacagca aacctaaaag ttcagctatt tttctagcta 1800 gttttgtagc tgaattcaat gttttgaagg aacaaggttt atttataggt gaaaatcact 1860 ttaccattca tgctaaaggc tttgtgtgtg atgcacctgc acgygctttt gttaaatgta 1920 taaaaggaca taatgggtat tatggttgtg aacgatgcat tcaaaaggga actagggtag 1980 ataatagaac tgtatttcct gatgtaaatg caatgaagag aacagatgct tcctttgcat 2040 tattccagca agttcagcat cacaaaccag atgcagtacc cccactgtta gagcttaata 2100 tcggtcacat ctctcaattt ccactagaat atatgcatct agtttgtctt ggtgccacac 2160 gaaggttact yctgcattgg cttcgtggaa aacgtgcggt aaaaataagc acgctaattg 2220 ctgatttaat ttcaaatgga ttaaaaaatt tggcagcata tgttactgtt gaatttgcac 2280 gtaaaccaag atcactaaaa gacattgatc gatggaaagc cacagaattt cgtytatttt 2340 tgtgttacct tggacctctt gtattacgaa gccatcttac caataatctg taccagcatt 2400 ttatgttatt gcatgtcgca attaatattt tagctaatcc aaatatgtgc tggacgcatg 2460 ttgattatgc agaaaaattg ttaaatatat ttgtcatgca aatgccagat ttctatggca 2520 gcagctcaat aaactacaca atgcacagcc tttgtcacat ttgtgatgat gtaagacagt 2580 atggttcact tgatgagtat agtgcatttc cttttgaaaa tgctttaggc attatgaaac 2640 gattgttacg aagtggccac atgccattgc agcagttgtg tcgacgtttg tcagaaagaa 2700 ttaatagcaa cacatattct atcagtcgca atatgcacct gataccaagt ttaaaaaggt 2760 tacacagcaa tggtcctacc ttagggtact atggaaagca atatctaaag gttgagtatg 2820 ctggtttcac atatttttct gcaaattcca acaactgtgc tttacttgat agtggtaaag 2880 ttgtccttat tgaaaatgtt attgatgctg acaatataat gataattaca cgtgtttttg 2940 gacttaaaga aaatttctac acctatcctt gtgaatcttc tttgctcaat atttttaaag 3000 ttagtcaact atctgatcaa ttttttgctt ttcctataac ttcaattgta aaaaaatgct 3060 tattattgcc tagagaaact tattttgtta gttttgcttt acttcatcgg gactagtaaa 3120 caaactaaat gttagggtgt tattttaatg ttataaattt taataatttt tttaaagaaa 3180 gatataataa ttatatttac taaagtcaat tttgttttag cttgagttta attttatagt 3240 agttttttaa tcttcaaaag taaagttaca attgactgta catttactgc agtatgccag 3300 ttaactaaat ctttttattt aaaaatgaat atgaaattag tttgtaccaa ttaatttttg 3360 ttttcttttt tttagtgttg catttaaaca aagttaagga gagctcataa cgatttgcat 3420 ttcctaacaa ctattatttt tgtttttaaa ctccaacttt ttgtgtcttc tgtctatttt 3480 tgactaatac tgttacatat ttttaaaaag tagaaaactt ttttaargtt atttaaaagc 3540 tgaaagtttt tttttaatga tctttttcaa tcaagtgtga aactgtcaat aaaagagaca 3600 taacacaggc agcatttata aaacaaatgt gttaagtatt tacaaaaaat gtcactttta 3660 tttaaaagta aaaagtctca ttgtttccag tattaaaact tttgataaat tgctcaaaac 3720 ctcacatttt actgtaatta gtaaaaatgt taaaatctaa gttttcacat taattatccc 3780 ttcactatgc tgaccacaca agtagtgaag gatcgtggct gatatttgac tggggccggg 3840 gcagggtttg gtaaaacata gctggggtca aggttattgc cggggctgcc ttaatcttga 3900 caaacccttt aaatgaaatg agtctttaat cttgtattaa aatttacctt atatatatat 3960 atatatatat atatatatat atatatatat atatatatat atatatatat atatatatat 4020 atatatatgg tgatttaaaa aataatrttt tttaatgaga aacaaaaata aactagtaat 4080 tataataaca ataataaaaa taaraagaat aagaatatta acaataataa taatgataat 4140 aacaataata ataacaacaa caacaataat aaaaacagta atagtagtac aataaarttt 4200 aaatattttt aatcactttt tttattgcag cagtctagaa attgtgaatg cctaactact 4260 ttaatttttg gctagtacaa gcagaacatt tatcataatg agccatttgt ctgggaaaag 4320 tttatactat tagtatttta atacttttgt cattccaaaa gaaaaacaat gtgtctgact 4380 tccttatttt tgactggtat aaaaagagta tttttttatc agtccattac caaagctatt 4440 tggaaaagtt rttttaaggt ttgtaaaatt ttttttattt taaataaata tctttttatt 4500 taagattctt tcttaaatgt aattaattat taaattttat attatttttt atttataatt 4560 aaatataaat ataaaaaaat atataaattc acctccaagc tcaagtacaa gtctacatga 4620 gttttaagtt tccaaatcaa tacaacaaga atgtattaga ctcaaaacaa aaaatcttga 4680 aacacttttt ttttaaaaaa atataatata agagcacttc atataataca tttttatgtg 4740 gggctcctga agactttgta ccacctaatt caaactatta cgatgtatgg gagcctgaat 4800 cttcaggctt ctacaaa 4817 // ID Gypsy-138_AA-LTR repbase; DNA; INV; 274 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-138_AA_; KW Gypsy-138_AA-I; Gypsy-138_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-274 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1010-1010 (2011). XX DR [2] (Consensus) XX SQ Sequence 274 BP; 96 A; 54 C; 49 G; 75 T; 0 other; tgttatgtat ctagatatca caatgtaact catcgatatt gatctaggga agaaactcag 60 cgataattaa tatccgctaa tgttagatga ggatccaatc aggcaacact attgaaagac 120 tataaaagct atcagagata ttgtgtagct ctcttttatc tattgctctt cgagcgaaca 180 acaccgttgt aagtatagtt ggtaagaata aataagtgaa accaacaacg cgtgtgaaag 240 ttactcccag cctccgaact ccccgaatac aaca 274 // ID Gypsy-15_AA-LTR repbase; DNA; INV; 173 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-15_AA_; KW Gypsy-15_AA-I; Gypsy-15_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-173 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1000-1000 (2011). XX DR [2] (Consensus) XX SQ Sequence 173 BP; 59 A; 38 C; 25 G; 51 T; 0 other; tggcaacact gccgttcagt atattacaag acattcaatg agataaaccg ctatttcctc 60 gagaaagttt taaattcgtt ttaaagtccg tttaattaac tgttctttga ataaagagta 120 ctaaatgtaa tcgcgatagc ctactctacc ccaagatccc gaaataccca aca 173 // ID Copia-37_CQ-I repbase; DNA; INV; 5382 BP. XX AC AAWU01006939; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-37_CQ_; KW Copia-37_CQ-LTR; Copia-37_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-5382 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 377-377 (2011). XX DR GenBank; AAWU01006939; Positions 4674 10055. XX CC Positions [2545-3075] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 2428..5382 FT /product="Copia-37_CQ-I_1p" FT /translation="MGHLGNSSLKMLRNGLVSGVQFKDAVADNCVVCAKGK FT QTRLPFPKKGHRANDILELVHTDICGPMEETSVGGSRYYVSFVDDKTRRIA FT VYFLETKSESEVLHAFDEFRTLAERQTGRKLKVLRSDNGKEFVNRSFQKRL FT RELGIKHETTVEYTPEQNGMAERVNRTCVERARCMLFEAKLPKSFWAEAVS FT AAIYLINRSPTKGHNLTPEEAWSGRKPDLSHVRVFGIKAMAHIPKQKRKKW FT DEKSKETILVGFDEQTKGYRLYDPVKKSIFKSRDVIFIGEAGRPETAKPAE FT SSNPKRVRTFVSLEFDNISEPVAEPIVQEMPLAANPEVPDAERDDVPEQPA FT ALAHSGPEVVPPATEPETSEDEESESEAEFFSQVESSVGESSDDSDVTILA FT LPPRKTLYPPESQVLRRSGRERAVPGKYKNYVLPSKGLPSHHSTDNPSSDD FT FAEAGPSSVKHGLKASKGATKKSTDPRTVAEALRSDEADLWRTAMGEELQA FT LMDNNTWELVQLPADKKAIGCKWLFKTKRDEHGRVVRHKARIVAQGFKQKY FT GTDYDEVFAPVAKQVTFRVLLTIASRRNSVVKHVDVKTAYLNGELEETIFM FT RQPEGYTTGDERTVCRLRRSLYGLKQSARVWNRKVDSTFKSIGFQQSKSDP FT CLYMRRQNDTFAYILIYVDDMVIVTQTEEEFNAIFESLHRNFTVTNLGDIR FT HFLGMEVERSDAGYKLNQATYIRKLVQRFNLEQAKPSKVPLDPGYLQQKEE FT ECKPLPNNQDYLSLIGGLLYVAVHTRPDIAVSTSILAQKSSKPNQQDWNEA FT KRVLRYLDSTINHKLQLGATNEELQMFVDADWAGNSRDRKSNSGVLLQFGG FT GLVAWCTRKQTCVALSSTEAEFVALAEGCQELIWTRRLLEEVGEAALGPTR FT VNEDNQSCIKLVESDKIERRSKHIETRYFFVRDLQEKGTIRLQYCPTENML FT ADVMTKPLARIKLEKLRSAIGIRPDQFEEE" XX SQ Sequence 5382 BP; 1461 A; 1286 C; 1469 G; 1166 T; 0 other; ttctaaaagg ttatgggccc agattcaaga attgaaagaa gatgaatccg caagtcaacg 60 gtcaagttcc agacaaccct caagccccgc agcaagttcc ccatcactca aacatgctca 120 gtctcccgtc gattgaacat ttaaccggcc gggacaattg gtcgacgtgg aaattcgctg 180 ttgagacgtt cttggaactg gaggagcttt gggaagttgt gaagccaacg ttgaacgctg 240 acggaacctt gccaccgatt gacgaacgga agtgccgcaa agcgcgcggc aagattatcc 300 tactactaga cccaacggtc tatgtgcacg tgaagggcgt caagacagct cgtgaagcct 360 ggtcgaagct ggaagcggct ttcgaggacg cgggactgat gcggcgcgtt ggtcttctac 420 ggaagctgat tacgacgacc ctcacggcat gcggttctat ggagatctac gtgaacgaaa 480 tcgtgtccac cgcccaccag atccgcggcg ttggatttga tgtcagcgag gaatggatcg 540 gaactttcct actcgctggt cttccggaag agtacaagcc gatgctgatg gccctggaaa 600 attctggtct cgccattacc ggggacagca tcaagacgaa gttactccag gaaatgcatt 660 cggtcccgga gaagactgcg ttcgtcggac gaaaaccgta tccgccgaag cagaagccca 720 gcgcagccca aggacggtca gctgagcaac aaccaaaagg accgaagcac agagtaatcc 780 agaggacgca gtccgaacaa atgtcatcga attttgtggt tgcgttacgg ttgatgttgt 840 tggattttcc taagggtagc aggcgttata tttttttcag gcgttatgtt tttgaacgga 900 tatttgaatt taatctagta gccaataatt aaaaataatt tcctgtcaaa ctgactttcg 960 tttcgttttg caaaaaaaaa aatcgacgaa ccgatcgggg ggtcctgttt ttaatccacc 1020 gggcaaactg acctcgtctc ggggccgtcg gaaagtcggc tttgaagaag tgtcctgttt 1080 aacggaagga ggaaatccgc caggctcgga ttgtgaaaaa aattgtatgt gttttttaaa 1140 tgtttgaata taatatggca aataaaattg taaacggcta taaatttaat ttatataatg 1200 ggaattgttt cttattttca aattaaaacc gaaaggaggc aatatataat gcgcgggaca 1260 tgttttttgg gagctgcggg gttttcgata gcaagaaaac tgtttaaatc atcaaaagtt 1320 ggtcacatca cctcctcttc tcccctccaa ggccgagaac acaaacgcaa tatatcacaa 1380 atcaatagaa tttggaactt gagcgatggc acttttgaga gtggattgaa agatgggctg 1440 cgaatcgata acacaactcc tactcgaaat tgacctctcg gtagcatttc aatcatgcaa 1500 ttaatcagcc actgcagcca tggcgacgta cgcaaacctg tgctagcaac tttcccatct 1560 gcatgcattc cgcggtgttt tttccaagac gttaagacgc cctgcggctc gggaaacgcc 1620 ttatccaata tttaccggaa aaacagataa cagtttgaaa gcaacctaaa aaatatgaat 1680 tttcaactga ttgagtcgga ttgagtaatg ttttgaattt ctgacaaaga aaactgtcat 1740 tgaaaaacgt aacgcctctg aaaatccagt gaactaagat agcaaaaaat agcttctcat 1800 gttaggtttt tcctgaaggg gggatgtacc acgatgtacc gcgtaacgca accataaatt 1860 aaataccagt gctccctctg gattactctg tgaccgaagt gcaagcgttg ccatcgacac 1920 ggacacgttg ccagggactg ccaatcaaga actggttcag ccttcgcaac ggtgttggcg 1980 acgaacgaag gttttgaaga tgacagctgg atttttgatt cgggagcatc ggagcatttc 2040 acgaagaatc gtagcctgct gactggtgaa cgggcggcga gcggtatcgt tatggcagct 2100 gacaagcaac cgatgaacat cgtagccacc ggaaccgtcc tgctcaagcc gaagtgtaac 2160 tccgaaagcc tgccgattga aatcaaccac gtcaagttca tcccttcgat gtccaacaat 2220 ctcttgtcgg tgagtcaaat cgttcgacga ggtttcgagg tacgttttac caaccgtggc 2280 gtcagaatcg aggatgcgga tggcgatctc atcgctaccg gccgccacga cgtgaagaac 2340 gggttgttcc agttcgagga aaaggagagg gatgttgcgt tgaccctagc agcttcacca 2400 ccaagcatgg acgtctggca ttgaaggatg ggacatctcg gcaattcaag tttgaagatg 2460 ctgagaaacg gcctagtctc tggagtgcag ttcaaggacg ctgttgctga caactgcgtg 2520 gtgtgcgcta agggtaagca aaccagacta ccatttccga agaaaggaca ccgtgccaac 2580 gatatcctgg aactcgtgca cacggacatt tgcggaccga tggaggaaac gtcagttggc 2640 gggagcaggt actacgtgtc gttcgtggac gacaaaacga gacggattgc agtctacttc 2700 ttggagacga agtccgagtc tgaagttctt cacgcgttcg acgagttccg cactctcgct 2760 gagagacaaa ccgggcggaa gctgaaagta ctccgcagcg ataacggtaa ggaattcgtt 2820 aaccgttcct tccagaagcg tctacgtgaa ctcggtatca agcacgagac cactgtagag 2880 tacaccccgg aacaaaacgg tatggccgag cgggtcaacc gtacctgtgt ggagcgagca 2940 cgctgtatgc tattcgaggc gaagctaccg aagtcatttt gggcggaggc ggtgtcagct 3000 gcgatttatc tcatcaatcg ttcgcccacc aaaggtcaca acctcacacc ggaagaagct 3060 tggagtggac gcaagcctga cctgtcgcat gttcgagttt tcgggatcaa ggcgatggca 3120 cacattccaa agcagaaacg caagaagtgg gacgagaaat cgaaggagac catccttgtc 3180 ggtttcgacg agcaaaccaa ggggtatcgt ctctacgatc ctgtgaagaa gagcattttc 3240 aaaagccgag acgtgatctt cattggcgaa gcaggacgac cagagactgc aaaaccagcc 3300 gaaagcagca acccgaagcg agtgcgtact ttcgtgagtc tggagtttga caacatctcg 3360 gaacctgtag ctgaacctat cgtgcaggag atgccacttg cggctaaccc cgaggtacct 3420 gacgctgaac gcgatgatgt cccggagcaa ccagctgcgt tggcacattc cggaccggag 3480 gttgtgccac ctgcgaccga gccagagacg tctgaggatg aagagtcgga gtccgaagct 3540 gaatttttct cgcaagttga atcaagcgtt ggcgagagct cggacgacag tgacgtgaca 3600 atcttggcgc tccccccgcg aaaaactcta tatccacccg agtcacaggt gttgaggcgc 3660 agcggcaggg agcgcgccgt cccaggcaag tacaaaaatt atgtccttcc gagcaaaggc 3720 ttgccgtccc atcactccac agataaccca tcttctgacg attttgcgga agccggcccc 3780 agcagcgtta aacatggact caaggcgagc aagggtgcta ccaagaagtc gacggaccct 3840 cgaacggtgg ccgaggcgct gcgcagcgac gaagctgatc tgtggcggac cgccatgggc 3900 gaagagctcc aggcgctgat ggacaacaac acgtgggaac tggtgcagct tcctgcggac 3960 aagaaggcga tcggttgcaa atggttgttt aaaaccaaac gtgacgagca cggacgtgta 4020 gttcgtcaca aggcacggat cgtggctcaa ggatttaagc aaaaatatgg cacggactac 4080 gatgaagttt tcgcacctgt ggcgaagcag gtgactttcc gagtgttgct aaccattgcg 4140 agccgcagga actccgttgt caagcatgtc gacgttaaaa cggcttatct gaacggcgaa 4200 ctggaggaaa caattttcat gcggcaaccc gaaggttaca ccaccggcga cgaacggacc 4260 gtttgtcgtt tgaggaggag tctgtacggt ttgaagcagt ccgcacgtgt ttggaaccgt 4320 aaggtggact caactttcaa gtcgatcgga ttccagcagt cgaaatcgga tccttgcctg 4380 tacatgcggc gccagaatga tacgtttgcg tacattctaa tctacgtcga tgacatggtc 4440 atcgtcacgc agaccgagga ggagttcaac gccatcttcg agagtctgca tcggaatttc 4500 accgtaacca acctcggtga tatccgacac tttctcggaa tggaagtcga gagaagcgat 4560 gccggttaca agctgaacca agcgacgtac atccggaagc tggtacaacg gtttaatctg 4620 gagcaagcga agccgtccaa ggtaccactc gaccccggct acctgcagca gaaggaggag 4680 gagtgcaagc cacttcctaa caaccaggac tacctcagct tgattggagg tctgctgtac 4740 gtagcggtgc acaccaggcc agatattgca gtgagcacgt cgatcttggc ccagaagtcc 4800 agcaagccga atcaacagga ctggaacgaa gccaagcgag ttctgcgcta tctggattcg 4860 acgatcaacc acaagctcca gctgggtgcc acgaacgaag aattgcagat gttcgttgac 4920 gcagattggg ctggaaattc acgggaccgg aagtcaaact ctggcgtact gttgcagttc 4980 ggaggtggac ttgtcgcatg gtgcaccagg aagcaaacct gtgtggcgct gagctcaacc 5040 gaagcagagt tcgttgccct agctgaagga tgtcaggagc tcatctggac ccgcagattg 5100 ctagaagaag ttggagaggc agctcttgga ccaaccaggg tgaacgagga caaccagagc 5160 tgcatcaagt tggtggaaag cgacaagatc gagcgccgaa gcaaacacat cgagacgcga 5220 tacttttttg ttcgagatct gcaggagaag ggaacgataa ggttgcagta ttgccccacg 5280 gagaacatgc tagctgatgt gatgacgaag ccgcttgcga ggatcaagtt ggagaagctg 5340 cgttcggcga ttgggattcg accggatcag ttcgaggagg ag 5382 // ID Copia-4_DWil-I repbase; DNA; INV; 3657 BP. XX AC scaffold_181075; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-4_DWil_; KW Copia-4_DWil-LTR; Copia-4_DWil-I. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-3657 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_181075; Positions 979131 982787. XX CC 'ATAA' target site duplication CC LTRs are 91% similar to each other. XX FH Key Location/Qualifiers FT CDS 91..1908 FT /product="Copia-4_DWil-I_1p" FT /translation="MSTQFSIEKLVGENYSVWSVKMKSLLIHSDLWSVTCG FT RLKKEASDTTEQKELFDSKDEKTLASILLCVSPSQFNGIKHCKTASEAWAK FT LSAIHMPSGPARKIQLFKQLTDSIREHLNNFSNVVEKLNEIQEQVPDEMLV FT IILLASISEDFENFVVAIETRDSLPSLSALRIKLLEEGERRKIGEVAEKSE FT NKMYSSRQENGKEYKAFREPKNIECWLCGRRGHFASKCGSKERKDNASEKN FT SKKSFSVFSTSGTKIGLSKNAWCVDSGATAHLCADKSMFTSMREHKEKIKL FT AGNSYIEAEGCGTVELKCKNITIELVDVLYVILQCNFISVSKAIENGLKVI FT FVKNKVFVKDSNHRDLMIAEKHNGLFLFESKTKTVFAAETNVNQLVKWHDR FT FGHLNFDSLNKMIKQKMVYGLNAKVYSTNDLVCETCAKSKICVKKFPKFSE FT SRSKELLGLIHTDICRPMRTASQGGARYFATFIDDKSRYVSVYMLKNKSDI FT FQKFKDFKMLAENQTGKRKKAIRSDNGREYLSDVFEQFLSSHGIKRQLTVP FT HTPQQNGVAKRANRTLVEMARSMIVHAGVGESFWGDAIMTAAYLRNRAETA FT TLNFETSS" XX SQ Sequence 3657 BP; 1209 A; 595 C; 890 G; 963 T; 0 other; gcagaaagta aaactgtgcg ggttagtcgt aaagacttgt aagagattcg ttttgttaat 60 taaagtttgt gattaactaa tttgttcaag atgagtacac aattttcaat cgaaaaacta 120 gttggtgaaa attatagtgt atggtcagta aagatgaaaa gtctgttgat tcattctgat 180 ttgtggtcgg tgacatgtgg tcggttaaaa aaagaggcgt ctgatacgac tgaacaaaag 240 gagttgtttg acagtaaaga cgagaaaacc ctcgcaagta tattgttgtg cgtaagtccg 300 tcacagttca acggtataaa acattgtaaa accgcgagcg aagcatgggc aaagttatcc 360 gcgattcaca tgccctccgg tccggctaga aaaatacagt tgtttaagca attgacagac 420 agtattcgag agcatttgaa taatttttct aatgtcgtag aaaagttaaa tgaaattcag 480 gaacaagtgc ctgatgaaat gctcgttata attttgttgg caagtataag tgaagatttc 540 gaaaattttg tagttgccat agaaacgcga gactcgttgc catcgctaag cgctctaaga 600 attaaattgc ttgaagaggg agagagacgc aagattggcg aagtagccga gaaaagtgag 660 aataaaatgt attcgtcgcg tcaagaaaac ggtaaagaat acaaagcatt tagagaacca 720 aaaaacattg aatgttggtt gtgcggtcgt cgcgggcatt tcgcgtcgaa gtgtggttcg 780 aaagagagaa aagacaacgc gagcgaaaag aacagtaaaa agtctttttc ggtattctct 840 acttcgggaa caaaaatcgg tttgtcgaaa aatgcgtggt gtgtagacag cggggctaca 900 gctcatttgt gtgcggacaa atcaatgttt acgagtatgc gtgagcacaa agagaaaatc 960 aagttagcag gcaacagtta tatagaagcc gaaggttgcg gtacagtgga actaaaatgc 1020 aaaaatatca cgatcgaatt agttgatgtg ctatatgtaa ttttacaatg taattttatt 1080 tcggtatcaa aggcaataga aaatggctta aaagtgattt ttgtcaaaaa caaagtgttt 1140 gttaaagaca gtaaccatcg tgatttaatg atcgcagaga aacacaacgg tttattttta 1200 tttgaaagta aaacaaaaac agtgtttgct gcagaaacaa atgtaaatca gctggtcaaa 1260 tggcatgatc ggttcggaca tttaaacttc gatagtttaa ataaaatgat taagcagaaa 1320 atggtctatg gtctcaatgc taaagtgtat agtacaaatg accttgtatg tgaaacgtgt 1380 gcaaaaagta aaatttgcgt taagaagttt ccgaagtttt cggaaagtcg ctcaaaggaa 1440 ttgttgggct tgattcatac agacatttgt agacctatgc gtacagcgtc acaaggtggt 1500 gcgcgctatt ttgcaacttt catagatgat aaatcacgtt atgtatcggt gtacatgtta 1560 aagaataaaa gcgatatttt ccagaaattc aaagatttca aaatgctagc agaaaatcag 1620 acgggtaagc gtaaaaaagc catacgaagc gacaacggtc gtgagtacct gagcgatgta 1680 tttgaacagt tcttatcgag tcatggaata aaaagacaac tcactgttcc ccacactcca 1740 caacaaaatg gtgtggcgaa gcgcgccaat agaaccttgg tggagatggc gcgaagcatg 1800 atagtgcatg ctggagttgg cgaatctttt tggggcgatg cgataatgac ggctgcatat 1860 ttacggaata gagcagaaac tgcaactttg aactttgaaa caagttcatg aaagagttgg 1920 agcacggtcg agaatcatca tgtgagtcag atgtggtaat ctttcagcca agtgaaaaac 1980 ttgattgcgt gcataatgat gaaatccaag aaaattctga ggagcagcca attccaattg 2040 ttgatgagca aagtggtgac gaagaaaatg ctgaggagga gcaaccatat atccgaagag 2100 gaccagggag accaaagatt gttcgcactg ggcgtgctgg tcgtccacga agagagtaca 2160 atatgctgaa tgtactagat gcgaaagacg ttattgttcc tacctgtata aatgaggcat 2220 ttaagtcaga gcagtcggaa aactggaaat cagcaatgca gaaggagtat gacaacctag 2280 aaaatattaa gacatggagt ttggttgatc tttctgcaaa caaaaaggtt atcggttgta 2340 aatgggtttt taccaccaaa cgagatgaga gcggaaaagt tcagcgtttc aaggcaagat 2400 tggtcgcaaa agggtgtggt caacaatatg gaattaacta caaggaaaca ttttccccag 2460 ttgcacgtta ttcctcaatc agactggtga ttgcgttggc ggtggagcac gagatgcact 2520 tgcaccagat ggatgtctct gcagcatacc tcaatagtga gctacaagat gacgtttata 2580 tgcgacaacc tgaaggtttt gtcgatgctg atcatcccaa gcgtgtactc aagctacata 2640 aatcgttgta cggcttgaaa cagagtgggc gtgagtggaa ccaaaagcta gatagcattc 2700 ttttgaagat tggtttcgtt ccgtgtgcaa gcgaaccgta tatacacgag caaagtcaat 2760 gggtatttat gtattattgt tgtaaaagtt gacgatttta tacttgcatg taccagaaaa 2820 gacgatgtgg cctacataca tgcagtacat cgaaggcttg ttgcatgaat atggtatgca 2880 ggactgcaag ccgaacgcga ctcctctaga ggttgggttc cagacgaagt gcgatagcga 2940 tgattgcggt caggtggaca agacacgcta tcaatcactc attggttcgt tattatattt 3000 ggcgttgact acacggcctg atataataca ctctgtggct aagttggcgc agagaaatgc 3060 tgatccgcac aaggaacatg aggttgcagc caagagggta ttgagatatc taagatgtac 3120 ctcagatgtg agacttcatt atagcaaaac tggtgttcct attcactgtt ttgttgaagc 3180 ggactgggct ggcgattgca acgatcggaa gtcgttcacc ggttggtcat tccttattgc 3240 tagagcggcg gtgtcatggg aatcaaagaa gcaaaatctg gttgccctta gtagcacgga 3300 ggccgagtac gtagcacttt ctacagcagc caaagaagcg atatacatca gaaagctgat 3360 caacgagatg ggattcgggc cgatggccaa gctgctaatt tatagtgaca accagagcgc 3420 acaatgtctt gctaaagatg ctaaatttca ttcacgtagc aagcatactg aaataaaata 3480 tcattttgta agggaaatgt ataaagaaaa tgtaatagat atcaaataca ttgccacaga 3540 tagtatgacc tcagatatat taacaaagaa cttatgtaag gtcaaacatg taaagtttac 3600 agaaatgttg ggattaaaat aatttttgta taagatagac ttcgcgttga gaaggag 3657 // ID hAT-9_HM repbase; DNA; INV; 3074 BP. XX AC . XX DT 07-DEC-2008 (Rel. 13.12, Created) DT 11-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-9_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3074 RA Jurka J.; RT "hAT-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 1998-1998 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 1084..2550 FT /product="hAT-9_HM_1p" FT /translation="MYDELFKDIKIKISAANYLSLTTDIWTADTAKIAFLS FT ITGHWIDLTKFSQETAVLRVIHFPEKHTGVHIKEYLQKGLETFEIPLSKIH FT LIVTDNASNMKAAVKNSGMSSIPCFIHTLQLCIHDSIFSQETIKEILTLCR FT GITTHFNHSPIACAKLKSIQQQLSTVPHKMKQDVPTRWNSSFEMLQRTFEQ FT KVPLATYAAEYDEIKLPTNYQWGIVEKLVHILAPFENLTKKCGRRDETCAQ FT IIPSVLALKVLLQKASSSDIYAGIMTMIDELIKSVTKRLDKFLLNKMLCVS FT TFLDPRYKLLYQHADENVIKEWVEEAWKEMQGAQDVVDSDSDNAPIVKAPA FT SNGFISIDDCFKDVASMRTGRKQSERRSEAERELVDIEVVTTDIIKKQTIL FT RSAITQEIDNYLSLPLLENKSSPFLWWSKCGMQFEKLKKMALKYLTAPPSS FT IESERLFSAGGDIYEATRSRLKADNGEYLMFVHYNLKLIKQLK*" XX SQ Sequence 3074 BP; 1099 A; 508 C; 520 G; 947 T; 0 other; ttagtgatgt gccgggtacc cggttcggac ccggagaccc ggtaattcgg cagttttacc 60 gggtacccgg tatcggccat tttgttgcag tacccggcac cgggtatctt caaattgatt 120 ttttctttag tttaaacttt aaccattaaa cttcttgata taagtctaag agtgttatgt 180 agtcgataac cacaaagttc tattaactta aaacaatctc acttaaaata ttttttaata 240 cacttttgca aaaaaattta tacggataca aaaactgtaa cgctaaatga aataattcga 300 gaatacgaat aaaaagaatg ggaaaaaaag ttttagaata gcttcttttt attctgttag 360 aacacaagat aatttttagc aatttagttt tgaatgaaac gcttataatg ctttcgaacg 420 ctgatagcac ttttcattcg tatcaagcaa aaatcataaa agtagagata taaggtaaat 480 attatacaca tttttcatat tttaatttcg agcgtattat taaaaaaatt atcgttttat 540 ataaaataaa cgaaaacttt gatatcaaaa ctagtttcat tatctttata gaaattttat 600 ttgaaagaaa aattatggca aaaaaaagta tagcgtggaa gttctttgat ataagcacga 660 ctcctcatat gagtaagtgc aaactatgtg atagtttaat tacgagagga ggaaaatctt 720 tcaagtctta tggaactaca gcaatggtaa aacatttgcg tttaaagcat tccaaagagt 780 ttgaacttgc ggaagaaaaa aaaaaggttc aatcgttgcc gtctatttct gaagcctcag 840 ttgcttctac caccaataca gtacaaagta acattattca ggctttacac aaaaaaaaaa 900 cagtggaata ttgatgacca cagatctatt cgaatccata aaataatagg aaaacttatc 960 acattggata ttcagccatt ttctattgtt gaagatacgg ggtttaatga actgattaaa 1020 gatgcttacc ccaattacaa attaccatgt agaacatatt tcagccaaaa cgtaatccct 1080 agcatgtatg atgaactttt taaagatata aaaataaaaa tctccgcggc aaactatttg 1140 tctctgacga cagatatttg gacagcggat actgcaaaaa tcgcattttt aagcattact 1200 ggacattgga tagatctaac taaattttcc caagaaactg ctgttcttcg agtaattcat 1260 tttccggaaa agcacacagg tgttcatata aaagagtatt tacaaaaagg tttagaaact 1320 tttgaaatcc cgttaagcaa aattcatctt attgttaccg ataatgcaag caatatgaag 1380 gctgcagtaa aaaacagtgg aatgtcatca attccatgct ttatccatac tttgcagcta 1440 tgtattcatg actctatatt tagccaagaa accatcaagg aaattttaac tttatgcaga 1500 ggaataacga cacattttaa tcattcgcca atagcatgtg caaaattaaa atcaattcag 1560 cagcagttaa gtactgtacc tcacaaaatg aaacaagatg ttccaactcg ctggaacagt 1620 tcatttgaaa tgctacaaag aacgtttgaa caaaaagttc ctttggccac ctatgcagca 1680 gaatatgatg aaattaaact tcctactaat tatcaatggg gaatagtcga aaaacttgtt 1740 cacattcttg ctccatttga aaacttaaca aaaaaatgtg gtaggaggga cgaaacttgc 1800 gcacaaatta taccatctgt tcttgcatta aaagttttat tacaaaaagc ttcatcaagt 1860 gatatatacg ctgggattat gacgatgatt gatgaactaa taaaatctgt tactaaaaga 1920 cttgataaat ttttgcttaa taaaatgtta tgtgtatcaa catttttgga tccacgatat 1980 aaattacttt atcaacatgc agatgaaaat gttattaagg agtgggttga agaagcatgg 2040 aaagaaatgc aaggagcgca agatgttgtt gactctgact ccgacaatgc tccaatagta 2100 aaagctccag catctaatgg attcattagc attgatgatt gttttaaaga tgtagcttct 2160 atgagaaccg gaagaaagca gagtgaaaga cgttccgagg cagaaagaga attagtagat 2220 attgaagttg ttaccactga cataattaag aagcaaacaa tattaaggtc tgcaattact 2280 caagaaatag acaactattt atctctgcct cttttagaaa ataaaagttc tccgtttttg 2340 tggtggagta aatgtggaat gcaatttgaa aaactaaaaa agatggcatt aaaatatctt 2400 acagcacctc cgtcatcaat agaaagtgaa aggttattta gtgctggtgg ggacatatac 2460 gaagcaacaa ggagcagact aaaagcagat aacggagaat acctaatgtt tgtgcattac 2520 aatctaaagt taataaaaca attaaaataa gatacagcaa ataagtgttg tttaagttct 2580 ttttatttgt ttaaatttgt tttattttaa tcactctaca gtcgagaagg tctacctttt 2640 tttgcaattc tttatatttc ttaatctttt gctttccata atataacttt caatggcgga 2700 acttcaaatt ttgccccttt ccccacaaaa tattatttag gcccgctcca ccaggaaaga 2760 aaaataattt taaactaaat atagttatta aaaaactgtt gcacaacttt taaaaaaagt 2820 taggccctac aactcaggcc ccttaccctg cattgcgggg ttttgctaaa tagtacgtcg 2880 gccacataac tttttgttag ttagcgtaaa aagcgtattt tgtgtgaata gttcagccct 2940 agagatttca caaattttta ttccaacttt taaaaaatgg actcggtgcc gagtacccgg 3000 actcggaccc ggttttttca gccgggtacc cggaatcgga gactcggcat aaactggttc 3060 cggcacatct ctaa 3074 // ID BEL-642_AA-I repbase; DNA; INV; 7229 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-642_AA_; KW BEL-642_AA-LTR; Pao_Bel_Ele132; BEL-642_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-7229 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [5091-5675] - Integrase core CC LTRs are 95% similar to each other. XX FH Key Location/Qualifiers FT CDS join(618..1982,1986..6056) FT /product="BEL-642_AA-I_1p" FT /translation="MSVNDDYRLLCKQERSLRISLENMKEFAHAHQAGADR FT HDVDLRIAKLDEIWEKFVGVRLRIEMLTEDVGDGDISTDTEESEEAKLQRH FT IRLQKQHDRDNARIIKDFENEVYQLKKAFFSLLQAGPSCVQQVQQYQNAAV FT PQSKVKLPELKLPTFSGRMSDWVTFRDTYKNLIHNDNCLSDMDKFTYLRTS FT LTGDALQEIASIEMSAVNYAIAWEALENAFENKKLLVMTYLDSLFALESLR FT QENFESLSRLVNGFEKNLQMLTKIGEDTEGWSTLLQYMLCKRLHPTTLRQW FT ESHYNSKEVPKYKDLIKFLKSHCSVLQSITPGKQFQAESRKPVRPSISHAG FT VQSTSCCPFCGESAHSAFKCAKFSKMRIAERVDAVKKHSLCLNCLSSGHIA FT RFCTRGSCFHCGQRHHSLLHANSSTTANSSTKSGQSNGNQPIKKPQGQNSQ FT PPQAKPTPNGSTNTQPTQSGRTDPQNSTNASNAHSHTSATDCPSTSYHTAP FT LSTTRHVPHTVLLSTAVVSLCDQFGNTILARALLDSGSQRCYMSEAISQKL FT KFKRTREHLPIAGIGGARTASTKAVFAEVHSLVSKYVTNLKFHVLPRVTVD FT LPTRSIDIRSWNVPKELILADPTFHESGAVDLIIGAEVYLELMIAERQIKL FT GDSGPVLQNTLLGWIVSGGIPNDSITLPSVSACATERIEEELARFFELESC FT RTTSTLSLEESACETHFEKTTKRDPSGRFIVQLPRKQFLVERLGDTQAIAT FT RRFLALERRLDMDPAVKKMYIEFIDEYLRMQHMREVSPRELSTFPVTYFLP FT HHAVLKPDSTTTKLRVVFDASCLSSSGVSLNEALMVGPVVQEDLTSITLRF FT RLRKYAMTADIEKMYRMIKMHPLDHPLQCILWREESAKPIRFFVLTTVTYG FT TSSAPYLATRCLKKLAEDEKANFPAATETIIYDFYVDDMLKSVDSVEEAGQ FT LSRDLIHVLGTAGLTLRKWSSNSREVLDQIPPYLRDERSSLDLELSTPTVK FT TLGIKWEPRLDIFRFTVPQWNAATEITKRIILSDFAKLFDPLGLVGPSLVP FT AKVFLQDLWRTKCSWNDTLPDELQTWWREFRESLEGLTHLQVPRWIAFGSD FT TISAELHMFCDASKKAYGACIYLRCTSFDGKVTSALLTAKSRVAPLEDLEK FT KRKQTSIPRLELSSALTGAHLYEKVVQSIKITAQPYFWTDSMIVKCWIAAV FT PSRWAMFVANRVSEIQHITRGGIWNHIAGLDNPADVLSRGMAPDQLKDYGM FT WWQGPPWLCLDKASWPKTATINSDDLDPALLEERTTVSAPAQVVEPSPIFG FT LRSSLQDLVRTVSLIRRFTYNCRNTSDRRIGFVTHEERAEALHQLLSLAQQ FT ESFPEDLAALRRTGEVKSTSRLRKLAPRIVNGILLVGGRLQNANISAGRKH FT PVILDNHHPLSILIAVHYHQKNLHAGQQLLVASMREKYWPLSARNLARKVI FT HRCVRCFRARPTAHEQLMADLPAERVTPAPPFMRVGVDYCGPFYIQYPYRK FT GGPIKCFVAVFVCLAVKAVHLEVVGDLTTQAFIAALRRFVSRRGRPEILMC FT DNATNFVGARRELDELRKLFNNPDFIKAVTEETSWENINFKFIPAKSPNFG FT GLWEAAVKSMKEHLKRTLGNTVLVSDEMATLVAQIEACLNSRPITPLSSDA FT DDLEFLTPGHFLVGRPLTAMPEPSLLDINESRLSKWQRVQYFLQRIWNRWS FT TQYLSNLHARTKWTRQRNNLFVGVMVLLCEDNVPPLRWRMGRVTEIHPGKD FT GNIRVVKVRTKDGDFLRAISKVCVLPIQDNEGARSATREED" XX SQ Sequence 7229 BP; 1870 A; 1797 C; 1772 G; 1759 T; 31 other; gtgcgtttaa gtccgggacc ggaacgaaca tttttggtcc gaatcgaacc ggatcgtcgg 60 gaawcgcgma agtgmagtgg aaaaagktgc cgtcttctga sccgccgcgt ggtcgctgat 120 tgttctccac gagggagaac aataggacac gggcgtcctc aacattgagt gtgctkctgc 180 cwtcgtccat cwttaccckt tgcctatgcg gagagctgck gcggatcgag gmcgccwtcg 240 gagctwccag gcgasaccgg acggatttgc caggacamca ggataagtac tgtgggcagg 300 tttgtctgtg gcgcgggtct twgttgcatg tgagaccgcg gccgacggaa aktsaattga 360 gccaacgtga attgaaattg agtgagmttc aataaaaaga ttmtttaatt gcttgcatga 420 tcaaattgtg tcgtccataw cattttkcgc tgagctgagc cctgaccagt ccgccgtctc 480 cmtwggtttg tccgcttcgt wggttggttt gcwttgtcck ttccccagtc aaaaccccca 540 ttgtccagtc aaaccgcgtg gtcgatccga gtgtcgcgtt cgagtgaagt gcaagtgtga 600 tacaatagtg ttcsattatg tcagtgaatg atgattaccg actgctgtgt aagcaggagc 660 gttctctgcg gatttctctg gagaacatga aggagtttgc tcatgcccat caagcagggg 720 ccgacagaca tgacgtcgat ctccgcatcg ccaaactgga tgaaatatgg gagaaatttg 780 ttggtgtgcg actgaggatc gagatgctga ccgaagatgt tggagatggt gatatttcca 840 ctgataccga ggagtcagag gaggccaagc tgcaacgtca catcaggttg cagaaacagc 900 acgatcgtga taatgcgaga atcattaagg atttcgagaa cgaagtttac cagctgaaga 960 aagccttttt cagtttactt caggctggtc caagctgcgt gcagcaggtt cagcagtacc 1020 aaaatgcagc tgtgccgcag tctaaggtta agctccccga gctaaaactg cctaccttta 1080 gcggaaggat gtcggattgg gttacattcc gtgacaccta caagaactta atccacaacg 1140 acaattgcct ttccgacatg gataaattca cttatctcag aacctctctc acgggcgacg 1200 cactgcaaga gatcgcatcg atcgagatgt ctgctgtcaa ttatgccatt gcgtgggagg 1260 cattggaaaa tgcattcgag aataagaagc tgctggtaat gacgtatttg gattcgttat 1320 ttgcactcga gtctcttcga caagaaaatt tcgaatcgtt gagtaggcta gtgaacggtt 1380 ttgagaaaaa ccttcagatg ctcaccaaga ttggtgaaga cacggaaggt tggagcacgc 1440 tgttgcagta catgctttgc aagcgtctgc atcccacaac attaaggcaa tgggagtcgc 1500 attacaactc taaagaagtt cccaagtata aggacctgat taagtttttg aagagtcact 1560 gttcggtgct gcagtctatc actcctggaa agcagttcca agctgaatcg aggaaacctg 1620 tacgaccgtc gataagtcat gctggtgtcc aatcaacaag ttgctgtccg ttttgcggag 1680 aatctgctca ctccgccttt aagtgtgcaa agttttccaa aatgcgaatt gcggaaaggg 1740 ttgatgccgt gaagaaacac tcgctttgct tgaattgcct gtcatcggga cacatcgctc 1800 gtttttgcac tcgaggatcg tgcttccatt gtggtcaacg tcaccactcg ttgcttcacg 1860 cgaactcgtc gaccaccgcg aattcatcaa caaaatctgg acagtcaaat gggaatcaac 1920 cgataaagaa accacaagga cagaactcgc aaccaccaca agctaagcca acaccaaacg 1980 gtcastcaac aaacactcag cctacacaaa gtggacgcac agatccacaa aactccacca 2040 atgcaagtaa tgctcattca cacacttccg ccacagactg ccccagtaca agttaccaca 2100 ctgcacctct ctcaacaaca cgccatgttc cacacactgt tttgctctct accgctgtcg 2160 taagcctttg cgatcagttc ggtaacacca tactcgctcg tgcgttgctg gactctgggt 2220 ctcagcgatg ctacatgtct gaagccatct ctcaaaagct caagttcaaa cgaactcgtg 2280 agcacctacc gatcgctggt atcggtgggg cgcgaactgc atccactaaa gcagttttcg 2340 ctgaagttca ttcgctcgtt tcaaaatacg tgacaaacct taaatttcac gttttgcctc 2400 gagtaactgt cgatctccca acaagaagca tcgatattcg atcctggaac gtcccgaagg 2460 agttgatttt ggccgatcca acattccacg aatctggagc agtggatctg ataattggag 2520 ctgaagttta tctagagctg atgatcgccg aacgtcaaat caagctaggc gattccggtc 2580 cggtactgca gaacacgctg ctgggctgga tcgtgtccgg tggaattccg aatgattcaa 2640 tcaccctgcc atccgtgtca gcatgcgcta ccgaaagaat tgaagaagag ttggcgcggt 2700 tctttgagct ggaatcgtgt cgcaccacta gcacgctgtc tttggaggaa tcagcatgtg 2760 aaacgcattt tgagaaaacg acgaaaagag atccaagcgg ccgattcatc gtccagttgc 2820 cgaggaaaca gtttctggtt gagcgtctgg gagatacgca agccatcgcc actcgtcgat 2880 tcttggcact cgagagaaga ttggacatgg atccagccgt gaagaaaatg tacatcgagt 2940 tcatcgacga gtacctccgg atgcagcaca tgcgcgaggt ttcaccaaga gagttgagca 3000 catttccagt cacgtacttt ttaccacatc atgcggtgct caagcccgac agcacgacta 3060 ctaaattgcg tgtcgtgttc gatgcgtcat gcttgagttc ctctggagtt tcattgaatg 3120 aagcgttaat ggtgggtcct gtagtccagg aggatctcac ctcaatcaca ttgcgttttc 3180 gcctgcggaa gtacgcgatg acagccgaca tcgagaagat gtataggatg attaagatgc 3240 accctctgga ccatccactg caatgcatat tgtggagaga agaatctgca aagccaattc 3300 gattttttgt gctgactacc gtcacatacg gcacgtcatc tgcgccgtac ctagcaacgc 3360 gctgcctgaa gaagttggcg gaggatgaaa aggctaactt tcctgctgcc actgagacaa 3420 ttatctacga tttttatgtg gacgatatgc tgaagagcgt cgacagcgtt gaggaagcag 3480 ggcagctttc aagagacctg attcacgttc tgggaacggc cggacttacg ctgaggaagt 3540 ggagctccaa ttctcgagaa gtgctggatc agatcccgcc ttacttacga gatgaacgtt 3600 cgtcattaga tctcgaactc tcaaccccca ccgtcaagac ccttggaatc aaatgggaac 3660 ctcgattgga cattttccgg ttcactgtac ctcagtggaa cgctgctact gagattacta 3720 agagaattat tctgtccgat ttcgcgaagc tcttcgatcc acttggtctg gttggaccta 3780 gcttagttcc agctaaggtt tttctccagg acctttggag aacaaagtgt tcctggaacg 3840 atactctacc ggacgagctt caaacttggt ggagagaatt tcgagaaagt ttagagggcc 3900 tcactcatct ccaagtccct cgttggattg cgtttggaag cgacacaata tccgccgaac 3960 tccacatgtt ctgcgatgcg tccaagaagg cgtatggtgc ctgcatttat ctgcgatgca 4020 catcgttcga cgggaaggtt acatcagcac tgctgacggc aaaatcccga gtggctccac 4080 tcgaagacct agaaaagaag cgaaaacaaa catctattcc tcgcttagaa ttatcgtctg 4140 cactcacagg tgcacatttg tacgaaaagg tcgtgcaaag cataaagatc acggcacaac 4200 cgtatttttg gactgactca atgatcgtca aatgctggat tgctgctgtt ccgtcgcgct 4260 gggctatgtt cgtggccaac agagtatccg agatccagca catcacccgt ggaggtatct 4320 ggaatcatat cgcgggacta gacaatcctg cggacgttct gtcaagggga atggctccag 4380 atcagctaaa ggactacgga atgtggtggc aaggaccacc ttggctgtgc ttggacaaag 4440 cttcctggcc aaaaactgca accatcaact ctgacgattt agatcctgcg ttactcgaag 4500 aaagaacaac ggtatcagct ccagctcaag ttgtcgaacc cagcccgatc tttggtctac 4560 gatcctcact tcaagatcta gttcgcactg tgtcccttat tcggagattc acgtacaact 4620 gtagaaacac tagcgatcgt agaattggct tcgtaacgca cgaggaacga gcagaagcct 4680 tgcatcagct tctctcacta gcgcaacaag agagcttccc agaagatctt gcagcgctgc 4740 gcagaactgg cgaagttaag tcaacgtcaa ggctgaggaa gctagctcca cgcatagtga 4800 atggaatcct cctggtcggt ggccgattgc agaatgccaa catttcagca gggcggaagc 4860 atcctgtcat cctagacaat catcacccac tatcgatcct tatcgccgtt cactatcatc 4920 agaagaatct gcatgctgga caacaactcc tggtggccag tatgcgggaa aagtattggc 4980 cattgtcagc acgtaacctg gctcggaagg tcattcatcg ttgcgtcaga tgttttcgcg 5040 cacgaccgac ggctcacgag caactcatgg cggacttacc agcagaacgt gtcactccag 5100 ctccaccgtt catgcgcgta ggagtggact actgtggtcc gttttatatt cagtatccct 5160 atcgcaaggg cggtccaata aaatgtttcg tcgcagtatt tgtgtgcctt gctgtgaagg 5220 ctgttcattt ggaggtggtt ggagacctca cgactcaagc cttcatcgcg gcattgagaa 5280 ggttcgtctc tcgccgtggt cgacccgaaa ttctgatgtg cgacaacgca acaaatttcg 5340 tcggagcacg acgtgaatta gatgagcttc gtaaattgtt caacaatcca gatttcatca 5400 aggcggttac tgaggagaca tcctgggaga acatcaactt caaattcatc cctgcgaagt 5460 cgcccaactt cggaggcctt tgggaggccg ccgtgaagtc gatgaaggaa cacctaaagc 5520 gtaccctagg caacacagtt ctcgtgtcgg atgagatggc cactcttgtg gcccaaattg 5580 aagcatgcct caattcgagg ccaattacgc cactttcaag cgatgctgat gacctggaat 5640 tcctaacccc cgggcacttt ctggtgggta ggccacttac agcaatgccg gaaccatctc 5700 tgctagacat caacgaatcg aggctttcaa aatggcagcg cgtgcagtac ttcctccaac 5760 gcatttggaa tcgctggtca acccagtatt tgtcgaactt gcacgctcgt acgaaatgga 5820 ccagacagcg gaataatctt ttcgtcggcg tcatggtgct gctctgcgag gacaacgtgc 5880 cgcctctgag gtggcggatg ggcagagtca ccgagattca cccagggaag gacggcaaca 5940 tccgtgtcgt caaagtccgt acgaaggacg gagactttct gcgagcaata tccaaggttt 6000 gtgtcctgcc aatacaggac aacgaaggag cgcgatctgc aacgcgcgaa gaggattgaa 6060 tccttttcca taaagcgccg tccgggcgct gcggaggcct tcgggtctcc gcgttccagt 6120 taagtttttg ttttttacat tacgatcaaa aagctcaggg aatgttctgt cctccatagt 6180 tcatccacgg cggccggagc cgtccacccg atttcgtgaa tttccgtatg ccaatccatc 6240 gatttctgtc catccgagaa gttcatcaac taaacaaaca tcaactcaat tcctgtaatg 6300 ctcaccaaag aagttgttgc gtcaagtatt tcgtgggggt tgaggaatcc ccacacaaag 6360 ttacgtgttc tgtattttgt tcattcattc gtgcaatcaa acaccattcc gtttgatcag 6420 cagggttttg tggagaagat catcgctact gggttaatcc gacgaacccc tacgtctgca 6480 gtgagcccga gatgactaga gctgggtttc cttcggttcg catcgagcat catgcgacaa 6540 acgccagtgc catgggcaac attcctacac tcgggacaag tcagcagggg tgtcgacggc 6600 gccttggcgc ctgcggaggc ggacggcctc cgcaacccag ggaaktcatt ctatatttct 6660 attttattgt taaaaaatcg cctctcatct gtttcatagc tacctatcga ggatccctca 6720 gccgaggccg aagtgtccca ccggagacga gtcgtggaga aatccgtgga cggtcctcac 6780 gattgatcaa tagatactac gcatgcctag ccgaatacga aaaatgacga gacgacgaag 6840 atcagcagta acagaaaatc ccacaccagt agtcgataac tgattctaga aagtaaccag 6900 cgaaccttat gtatagatat ctttatttct gtcaacgcac tagaatataa twagctatta 6960 gtttgttgaa attaggtcat ttcaaggcgg ccggtatgtt tgcgccgatc gcccactgta 7020 ttgcgctgat gcgatacaga tcatatagta gataactaga ttttgcattt caaccactac 7080 tactacaaga tcactaaatt aaccattagt gtccgagtta gaagatatag ctgctctact 7140 tttccccaag caaaatgtga acctatttca tgctataccg atcgcttcac gatcgccgat 7200 aggcgcgacc agagataacg atcggacag 7229 // ID Gypsy-30_OD-I repbase; DNA; INV; 6577 BP. XX AC CABV01002978; XX DT 24-FEB-2011 (Rel. 16.02, Created) DT 24-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Oikopleura dioica genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-30_OD_; KW Gypsy-30_OD-LTR; Gypsy-30_OD-I. XX OS Oikopleura dioica OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; OC Oikopleuridae; Oikopleura. XX RN [1] RP 1-6577 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Oikopleura dioica genome."; RL Direct Submission to RU (24-FEB-2011). XX DR Genome; CABV01002978; Positions 12409 18985. XX CC Positions [2415-2969] - Reverse transcriptase CC Positions [4296-4745] - Integrase core CC 'ACGC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS join(573..2657,2661..4745) FT /product="Gypsy-30_OD-I_2p" FT /translation="MSFNQDQIKTVKDALELASTSRAQKEALEKLPQSILE FT NQEVKNEIAEYEKIIDQCTKLVMSGTLPLGAPVVQSSAGSSVISASVINTA FT FTRHFGHSKFQGTSPKETDHFLGLADSFQKSYIINRATGEPAKDKEQFFLS FT CLRAHLDVAPSRAIGDALDKVADYAAARKLINKHFGGKLNHLLVISEAAAI FT ECDADLPLHDYAGKVQQKLNEAYEACIREFKRVNADKNMTVQDCFYLFGTH FT LLSEKLRLQMPDLYNKLIDKMSDSFRPDDLAAAAETLRRNMTPTDANCHYL FT GKRRGAYAKHDNKNKNNRRSGRGQANRGHSGGHHKNKDSGSGDQQVKQTTA FT HAADEEEATEDISAISLACFRAAPVYQSLPTLVPDPYVTSIKFSHDTCSFA FT TDVEIDSGSFATILPKKVIPQKILSQLKKAPLTISGLNGAAQSPLGYLEMK FT IVFDTGPELEAKVYVLEDAPCLLGRDVLRCRDVRSVELGKTYLRLNFENRY FT KSVPPQKVQLRKNYRVGVAKPNDGNEVQQVHNGSPILWVKQNLGIDLPTHA FT PLEHREKIAEDLKELKAAFATSSQSLGEFPEEAELPTVPGEVRNRKQIPIP FT QAQHSLVDEHIRNMLEDKVIAPCDDPKGWNSPIILVPKKDKTTRFCVNFAP FT TINKVASDKSDPFIVARMDETISSLGEGQLYFTVADLKNGYWQVKLEEDQV FT KTAFQWRSKTYKFIRVPFGYTFSGSIFSRCVAKMLDHVKMRRNVQSYVDDL FT IQFGGSFTEYRESLRQLLKAVIKFGVKLKASKCQFLQREAHFLGRVITKAG FT VQTDPAYTRSLLSMPPPTNHRELRSLVGSLTWLKEFAEARMGEEISSHLFA FT HVMRPITALLVTCKRGVIPPPFQWTPEADSAFTQLKTRLANPPVISFPDFR FT HTFILHTDASDLACGGILTQIINGKTKLVAAVSHTFTRAEANWSVSEKECF FT GILWSVEKLSRLLKGTKFIIHTDHYSLTYMDKTAFRNSKIARWQSRLAEYD FT FVLQYIKGSKNNFADWISRPFGTDNLKSRDTGPVENAGRFLNIGNSDLVVY FT IPSWCTEQTNLPITARKLIASVNVAKIIRPSPDPEMEGEMAQFAIHQQDDP FT FLAKITRAVRKARATSSKVDLESIIDKNDHRRVELLKIANRLSICRTSNCL FT VINDRRGPRAVVPEALRAAFVRRAHDLQAHCGLPRMKENLKMLWWIDMDKD FT CENYVRSCVSCLKTKGAHGRPQAPPSGQVQKGRFPGDILNIDYVMMKEPSN FT GYRYMLTCICSFSRYLWAIPVRRDNALSAAQGLTSICLQYDFWPRLIHSDR FT GLHFVNSTIDEFCKSNEILHSLTCAWRPEANGVVERAHRTLKNGLYSTCHS FT ENMTWTKALPYVTRAMNASICKQCSKF" XX SQ Sequence 6577 BP; 1880 A; 1703 C; 1413 G; 1581 T; 0 other; agtggtgtca tcgtaactac gtgaagacag gacgactcct cgagtaattt tcaaccggta 60 aacttaggat cttagaactc aaacgcctca aaatcaaccc tcgtcgtacg tcaaaaatct 120 tcctggaagt cgccaagtcc attccaaggc cgcgctttcc gcctgtggaa tccaacggcg 180 aaaaatctct tcgcgcactc cgctgatcgc atcagcaaaa ccgctgtcta caacttacgc 240 tccaaaggcc cccaaaaagt gatctcaatc agagttcact ggggtcaggc caggtacact 300 cgtctcgctg ctgatatttc gcgcattcta gtaaccgccc gagcttccgt tgcacttctt 360 cgctaccgtc cgtcaatttt ctcagttgtt ggtctatcta ttggggattt ctgttgcact 420 cagcgcgcct actatcccta taatttgacc tagagcttaa ccgtctagcg tcaaagggat 480 cggcccaggt acaaaactca ccatacagat aaccagtcgc ctaagtctct tcctcgccgt 540 cgcgccaact agaatcttcg tcgccaaaaa aaatgtcgtt caaccaggac caaatcaaaa 600 cggtcaaaga cgctttagaa ctcgccagta cgtcccgggc acagaaggaa gcgctcgaaa 660 agctcccaca gagcatttta gaaaaccaag aggtcaagaa cgaaattgcc gaatacgaaa 720 agatcatcga ccagtgtaca aaactcgtca tgtcaggcac cttacctctc ggcgctcctg 780 ttgtccagtc ctctgctgga agttccgtca tttctgcttc cgtaatcaac actgcattta 840 cgcgccactt cggccactca aaatttcaag gcacaagtcc gaaagaaaca gaccattttc 900 ttggtctcgc ggactctttc cagaaaagct atatcatcaa ccgcgccacg ggcgagcctg 960 ccaaagacaa ggaacaattc ttccttagct gcttacgagc acatttggac gtcgctccct 1020 ctcgtgccat cggtgatgcc ctcgacaagg tcgcagatta cgctgccgcc cgcaagctga 1080 ttaacaagca cttcggcgga aagctgaacc atcttctggt gatctcagaa gctgctgcca 1140 tcgagtgcga cgccgatctg ccgctgcatg attatgctgg taaagtccaa caaaaactaa 1200 acgaagctta cgaagcgtgt atacgcgaat tcaaacgcgt taacgctgac aagaacatga 1260 cagtccaaga ttgcttttac ctgtttggta ctcaccttct ttcggaaaag cttcgccttc 1320 aaatgccaga tctttacaac aagctgattg acaaaatgtc cgacagtttc cgccccgacg 1380 acttagctgc tgccgccgag acgctcagaa gaaacatgac tccgaccgac gccaactgcc 1440 attatttagg aaaaagacgc ggcgcatacg ccaagcacga caacaagaac aaaaacaacc 1500 gtaggtctgg caggggtcaa gccaaccgcg gtcattcagg tggccaccac aaaaacaaag 1560 attctggatc cggtgaccaa caagtcaaac aaaccacagc tcatgccgcc gacgaagaag 1620 aagctacaga ggacatttcg gccatttctc tcgcttgttt cagagctgct ccagtgtacc 1680 aatctttgcc aacacttgtt ccagacccat acgtgacaag catcaaattt tcacatgaca 1740 catgctcgtt cgccactgat gttgaaatcg actccggatc gtttgctacg attctgccta 1800 agaaggtcat tccacagaag atattgtcgc agctgaaaaa ggctccgctg acgatctcag 1860 gcctgaatgg cgctgcgcag agtccacttg gctacctgga gatgaaaatt gtcttcgaca 1920 ctggcccaga actagaagca aaagtttatg tgctggagga cgccccttgc ctgctcgggc 1980 gcgatgttct ccgttgtcga gatgttcgct cagttgaact cggcaaaact taccttcgcc 2040 tgaatttcga gaaccggtac aagtcagttc cgccgcaaaa ggtccagctc aggaaaaact 2100 accgagttgg agtcgccaaa cctaacgacg ggaacgaagt tcaacaggta cataatggaa 2160 gtccgattct gtgggtgaaa cagaatttgg gcattgacct gcctacacat gcacccttgg 2220 agcaccgaga aaagattgca gaagatctga aggagctcaa ggcggcattc gcgacatctt 2280 cgcagtcact tggcgaattt ccggaagaag cggagttgcc cactgtacct ggcgaggtac 2340 gaaatcgtaa gcagatacca attccgcagg ctcaacattc actggttgac gagcacatcc 2400 gtaacatgct cgaagacaag gtaatcgctc cgtgcgatga ccctaaaggt tggaattcgc 2460 cgattatact cgtccctaaa aaggacaaaa caactagatt ttgcgtgaac ttcgctccga 2520 caatcaacaa agttgcttcc gacaaaagcg atccattcat cgtcgcccgc atggacgaaa 2580 ctatctccag cctcggtgaa ggtcagctct acttcacagt agcagatttg aagaatggct 2640 actggcaggt gaagctataa gaagaggatc aggtgaaaac tgccttccaa tggcggtcga 2700 aaacatataa attcatccgc gtgccgttcg ggtacacttt ttctggatcc attttcagca 2760 gatgtgtcgc caaaatgctc gaccacgtaa aaatgcgtag aaatgtacag tcgtacgtgg 2820 atgatctgat ccagttcggc ggttctttca cagaatacag agaatcgcta cgccaacttc 2880 tcaaagccgt catcaaattt ggcgtaaagc tcaaagccag caaatgccag tttcttcaaa 2940 gagaggccca ttttctgggg cgtgtcatca caaaggcagg agttcagacc gaccccgcat 3000 atacaaggtc gctgctgtcg atgccgcctc ctacaaatca tagagagtta aggtctctcg 3060 tcggatcact tacctggtta aaagagttcg cagaagctcg aatgggcgaa gaaatcagct 3120 cacatctgtt cgcgcatgtc atgcgcccga taacagctct tctagtcacg tgcaagcgcg 3180 gcgtaattcc accaccattt caatggactc cagaagccga tagcgcattt actcagctca 3240 aaactcgact cgcaaatcca ccggtgatca gctttccaga ttttcgccac acttttattc 3300 tgcacacaga tgcaagtgac cttgcgtgcg gtggcattct tacacagatc atcaacggaa 3360 aaacgaagct tgtagctgca gtttcacata cattcacgcg cgcagaagct aactggtccg 3420 tctcggagaa agagtgtttc ggcatattgt ggtcagtcga aaagctcagt agactcctga 3480 aaggaacaaa atttatcatt cacaccgacc actactcgct tacctacatg gataaaacag 3540 cgtttcgcaa ctcgaagatc gcccgttggc agtccaggct agccgaatac gattttgtac 3600 ttcagtacat aaaaggatcg aaaaataact tcgctgactg gatttcaagg ccatttggta 3660 cggataactt gaaaagccgc gacaccggtc cagtcgaaaa tgcaggccgt ttcctgaaca 3720 ttggcaacag cgacctcgtt gtgtacatac caagctggtg cactgagcaa accaatcttc 3780 caataaccgc tcgcaagctc attgcgtcag taaacgtcgc gaaaataatt aggcccagcc 3840 cagatccaga aatggaagga gaaatggccc agttcgcgat tcaccagcaa gacgatccat 3900 ttcttgccaa aataacgagg gcagttcgca aagcccgagc aaccagctca aaagtagacc 3960 tcgagtcaat catcgacaaa aacgaccacc gccgagtaga attgctcaaa atcgccaatc 4020 gtctcagcat ttgtcgaaca tcgaactgtc tggtcattaa tgaccgtcgc ggcccacgcg 4080 ctgtcgttcc agaagctcta cgcgcagcgt ttgtccgtcg tgctcatgat ctgcaagctc 4140 attgcggtct tccacgcatg aaggagaatc taaaaatgtt gtggtggatt gatatggaca 4200 aagactgcga aaactacgtt cgatcatgcg tctcatgcct aaaaactaaa ggagctcacg 4260 gtcgccctca agcgccgcct agcggtcaag ttcaaaaagg ccggtttcct ggcgatattt 4320 taaatatcga ctatgttatg atgaaggaac catctaatgg ttatcgctac atgttaacat 4380 gtatttgcag cttcagtcgc tatttgtggg ctattccagt ccgccgagat aatgctttat 4440 cagccgcgca aggactaaca tctatttgcc tacagtacga tttctggcct aggctgattc 4500 acagtgatag aggtttacat tttgttaatt caacaatcga cgaattttgt aaatcgaacg 4560 aaattctgca ctcactcacc tgtgcttgga gaccggaggc caatggcgta gtagaaagag 4620 cccatagaac gctgaaaaat ggtttataca gtacttgtca ctcagaaaat atgacttgga 4680 ctaaagcctt accgtacgta actcgcgcga tgaacgccag catctgtaaa cagtgttcca 4740 aattttgagg aaaactcgtg tttccctgtt tccctgtttc cctcgtattt tgagaattgc 4800 tgtttccctg tttccttgtt tccctctaat ttccagaact gctgtttccc tgtttccctg 4860 tttcccaggg aaacagtttc cctgtttccc ttgtttccct cttatttttt ttctcacata 4920 taaattattt ttttcttaaa tttttgatct atatgacatt ttaaccggta atttttgcct 4980 aagttttatg taagcattca attttacgct tattcaaacg tatttaaact ttgatacttc 5040 ataaaaattg aatttgctct agaacgaggc ttattttgtt tacgaatatt ctcgattttt 5100 ctggaacatt gagttaaaaa acaacaaatt tgtgcgaaat ataagttcta gaaccatttt 5160 ttgggtattt ttactaaaaa aacaacaaga aggaaacagg gaaactgttt ccctgtttcc 5220 ctttttttaa cgatagttgt ttccttgttt ccctgtttcc ctttcaaaat caaaaacgct 5280 gtttccctgt ttccttgttt cccggaaaca gggaaacagg gaaacgaaac agggaaacag 5340 gtaaacaggg aaacaaaaat ttggaacact gtctgtaaat cgacaggtca gattccaaaa 5400 gaggcctggt tcggaaagcg caaagctcat acagataacc agtcgccgaa acagttcgca 5460 gctggaattc gtcaacgcat agagcgcgta agccagctca taaaagtttc ccaagaagca 5520 gccgccgcag atactgagcg tcgaaatagt aaaaagctgc caccgacacc gctcatcgaa 5580 ggtcagctcg tttatgtcaa acgagaattg aattccgccg gaaaatcagc agggctgaaa 5640 tgggtgggcc cgttgcgcct gatacgcagt aattccagcg tttgcttggt cgaagacgcc 5700 aaaaagaagc gcgactggat cttcagaggc cacgttgcgc cagcagacga gcgtcacgag 5760 cacctaaaag agcttagatc gattcacgct gacgaagaac tttacatttg ggcacatttg 5820 tccgaactcg tagctaatcg cgcagcagat tctgtctcac cttcgtctag gggggatata 5880 tctagtaaaa cgaagctcga tcacgcccag cagcagccta aaattgttag ctccgaaatt 5940 caaggagtac aggaacagaa caaaattgaa aaactggcag agaacgccac agaaagtcaa 6000 catcagcttc aacctgaacc gatggaaata tcacagagca ctgaacatca gtcaacgatc 6060 gtcgatttat caaattcaac ttctctcggg gaaaagacaa ttgcagataa aactgctgat 6120 tcaacttcca aagcggaaaa atcaacgttg aagatggcaa cagataccac tacttctcaa 6180 ccggaagact gcgctgacga tgccactttt ctctccgcaa gatctttcca gccagatgat 6240 tccattcttg aatcagactt gagcttaatt gctggaagcc ttgtcccacc gccggccaat 6300 cagcagctgc gtccgccgct gccgtcgaca ccgaaaaatg ttcccacgct cccgccagca 6360 tcactgaaag cgccgcgcaa gcgttcgtcc gtcaaagcga cgccaccata tcgacctaag 6420 agaaattgcc gctctcctga tcgaattaat attggttcaa atcgaaccaa gaaatactag 6480 agcaatttcg aaaagacctc gcccagctac agtacagaag ccgacttgcg atgttaattc 6540 tttaaaactg taccctcaaa gtaaactcta gggggga 6577 // ID BEL-617_AA-I repbase; DNA; INV; 5848 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-617_AA_; KW BEL-617_AA-LTR; Pao_Bel_Ele194; BEL-617_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-5848 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [4892-5449] - Integrase core CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 410..5848 FT /product="BEL-617_AA-I_1p" FT /translation="MEEERRQLEEEKALMEKERAIRRRELAAEEKFIKAKY FT EVETRLAEEDGSARLSVAAGMVTGKVHDWLKENQMQLNVALKVPQNDREAK FT PVESLKVPVRDHDSDSKSNPKSTIADFHPERSNLTERVERDEELEDLNPCC FT RQEAHCIAGQGARSISREMGPTAEQLSARQIWPKKLPSFSGDPEDWPLFYS FT SYETGNTACGFSNVENLIRLRECLRGPAREAVLTKLMFPHSVPSIMETLRR FT LYGRPELLVKNLLNKVRRLETPKPERLDTLISFGMAVNQLCDHLEAANLEG FT HLFNPTLLEELVEKLPATVKLEWVRFKRLFRHPSLKEFGEFMETMVADASE FT VTTLVQPKSGPCKSDKAPRREKGQVFTHSSTTVESNSERLPCPICSGTDHR FT VRNCEKFKQMNVESRIKAVERWQLCGVCLHDHGKWRCRTKIRCDVTGCQGR FT HHPLLHRPLTSTQGAVQVHQRMQKSVLFRIIPLTLHHGTRRYDTYAFLDEG FT SNMTLVETSLVRNLGIEGVPEPLELKWTSDIGRKEYSSQRADIIVTGKGPT FT NSYCLNSAHTVDSLNLPSQSLSIENLHERYPYLRDIPIASYEKAVPRVLIG FT LDNIELFSPLECRVGGSGEPIAVRCLLGWTIYGPVTSESATPGIVSVHRCG FT CDRDQDLTEMIRKQYLLEDAGISPFTLPEPAEEKRAREILERTTVKVDGHY FT ETGLLWKSDDIKLPDSRPMALSRLRSLETKLAKNPELRLNVHKQINEYIRK FT GYAHKATEEELAAMNKQQVWFLPLNVVTHPRKPDKKRLVWDAAARVNGVSL FT NSQLLKGPDQLVTLPSIICKFRERSIGFGGDVREMFHQLKIRPSDKRFQLF FT LFRFDTTQPPDVYVMDVATFGATCSPCSALYVMRLNADACKEEFPAACVAI FT KEKTYMDDYYDSLDTPEEAGVRAMQVREIHARAGFDMRNWVSNSPQVLEVL FT GEEGNKLLPIAADKQASERVLGMTWEPTADVFLFSVNLREGLTDYIIGNRR FT PTKRAALRCIMSFFDPLGLLSPYLVHGKIIMQDLWRSGIDWDTEIGDAEYA FT KWSEWTNLLKKLDNVKIPRCYFQNIRPESLLDLQLHVFTDAGEQAYGCAAY FT LRYAIGKNVKCSLVMGKSKVAPLKALSIPRMELQAALLGSRLMESICSNHS FT LKITGRYLWTDSSTVLSWIRSDHRRFKPYVAHRVGEILSLTQPEQWNWIAS FT QDNIADCLTKWSKETEPVSDGRWFNGPDFLYTSEECWLAKDIGPSKADEEL FT RPCFVLHHMTSFKQGSIIDVSRFSKWETLLRTLALVRRFISNCQLRIKKLP FT IEAVNVDERVLKSLSQSLPAIHVPLRQREYVEAENLLFRIAQAETYPDETE FT ILLNNRDKACDKWLNLEKSSVLFKVSPFADEHGVLRVEGRTANAMYAAFDA FT RFPIIMPKNHPLTLLLLDHYHRNYGHANRETVVNQVRQRFEIANLRTTIDK FT VRKRCQLCKIRKCKPLPPRMAPLPEQRLTPYIRPFSYVGLDYLGPLEVAVG FT RRREKRYVAVFTCLVVRAVHLEVAHDLTTSSCIMAIRRFVRRRGSPVEIFS FT DNGTNFVGASRILADQIRRINSDCADTFTDAKTKWTFIPPGAPHMGGAWER FT MVRSVKEAMRALDDGRKLDDEILLTVLAEAEGFINSRPLTYMPQESGGTEA FT LTPNHFIFGNSSGAHNPIRTPVDLAEALRNSYKRSQYLSDAVWDRWLKEYF FT PAVNKRSKWFCDAKPVKVGDLVYVAEGKRRTWVRGKVVELIANKDGRVRQV FT IVKTASGTFKRPVVKLAVMEILSNGESRANHPEMPLDPRGGE" XX SQ Sequence 5848 BP; 1713 A; 1292 C; 1489 G; 1348 T; 6 other; attctcaaag atgagcgagg atgccgcwac tmataattgt ggagcctgta aaaagccmga 60 ctctgccgat gcgggcatgg tagcctgcga tgactgctcc gtctggtttc attactcctg 120 cgcaggagta tcacctggag tagctgaacg wtcgtggaaa tgcaagacct gtctcgcacc 180 ccttggaaca acaccactga acactggggt gagcaaaaag aacccgaaat ccgccgcagc 240 tgggaagaag ggcgatgatg gtaaaagtat tckaagcaac gtaggatcta aaacagcaaa 300 gaatgttcct ccaccgagat cgatcaggac cacctcgtcc aacgctaggc ttcaagcgca 360 gttgacctta caacgcctgg aggaagaagc cttgttggaa aaacggaaga tggaagaaga 420 gcgtaggcag ttggaagagg aaaaggcctt gatggagaag gaacgagcaa ttcgtcgcag 480 agaactggct gccgaggaaa aatttattaa agcaaaatac gaggtggaaa cgcgtttagc 540 tgaagaggat gggagtgctc gactgagtgt cgcagccggt atggtcacgg ggaaggtcca 600 cgattggctc aaggaaaatc agatgcaact caacgttgca ttaaaagtcc cgcagaatga 660 cagagaagca aaaccagttg aaagcttgaa agttcctgtg cgagatcatg attctgattc 720 taaatcgaat ccgaaatcaa caattgccga ttttcacccg gagcgaagca accttacaga 780 aagagtggag cgagatgaag aattggaaga tttgaatcca tgttgccgtc aggaagctca 840 ctgtatcgca ggtcaaggtg camgaagcat ttcaagagaa atggggccca cagcggagca 900 gctatcagca aggcaaatct ggccgaagaa gttgccgagt ttttctggag acccagaaga 960 ttggccactg ttttatagca gttatgaaac tggaaacacc gcttgcgggt tttcaaatgt 1020 agaaaacctg attcgacttc gagaatgtct tcgcggtccg gctagagaag cagtcttgac 1080 aaaattaatg tttccccaca gtgtcccttc gatcatggaa acacttcgtc gtctatacgg 1140 aaggccagaa ctactcgtaa aaaaccttct taataaagtg cggcgtttag agacacctaa 1200 accggagagg cttgacactc taatcagttt cggtatggcc gtcaatcaac tgtgcgacca 1260 tctagaagcg gccaacctgg agggtcactt gtttaaccca acgcttctcg aagaattagt 1320 tgagaaatta ccagcaacag tgaaattaga gtgggtccga ttcaaaagat tgttcaggca 1380 cccgtcgctc aaagagtttg gtgagttcat ggaaacgatg gtagcagacg ccagtgaagt 1440 aacaacccta gtccaaccca agtcaggacc atgtaagtcg gataaagcac cacgtcgaga 1500 aaaaggacaa gtcttcacac attcttccac aacggttgag agtaacagcg aaaggctacc 1560 ctgtcctata tgctcaggaa cagatcaccg agttcgcaac tgcgaaaagt ttaagcaaat 1620 gaacgtggaa agcaggataa aagcagtgga gcgatggcaa ttgtgtggag tatgtctcca 1680 cgaccacggc aagtggaggt gccgtacaaa gatacgttgc gatgtgaccg gttgccaagg 1740 acgacaccat ccgctactac atagaccact cacttcaacg cagggtgccg tacaagttca 1800 ccagcgaatg caaaaatcgg tgctctttag aatcatacca ctaacactcc accatggcac 1860 tcggcgctac gacacatatg cattcttgga tgaaggttcc aacatgacac tagtagagac 1920 tagcttggta cgaaatttgg gcattgaagg tgtaccagaa ccgctggaat tgaaatggac 1980 ctctgacatc gggcgaaagg aatattcatc ccaacgggca gatatcattg tcaccggtaa 2040 agggccgaca aatagttact gtttgaactc agctcacact gttgactccc tgaatttacc 2100 tagccagagt ttgtcgattg agaaccttca cgaacgttat ccctatcttc gtgatatacc 2160 tatagcgtcg tacgaaaaag ctgttccaag agttctaatt ggactcgaca acatcgaatt 2220 gttttcgcct ctggaatgcc gcgtaggagg atccggtgaa ccaatagcag ttcgatgtct 2280 cctagggtgg acaatctacg gacctgttac gtcagaatcc gctacaccag gaatagtaag 2340 cgttcaccga tgcggctgtg atagagatca agatctgact gaaatgatac gtaaacaata 2400 tttgcttgaa gatgccggaa tttcgccatt tacactgccg gagcctgccg aggaaaagcg 2460 cgcccgagaa attttggaac gtactaccgt taaggttgac gggcactatg agacaggact 2520 cctatggaag tcggacgata taaaactgcc agatagccga ccgatggctt tgtccaggtt 2580 acgaagtttg gagacaaagc tggcgaagaa tcctgagttg cggttaaacg ttcacaagca 2640 aatcaatgaa tacatacgca agggctacgc tcataaagcc acagaagagg aactcgctgc 2700 gatgaacaaa caacaggtat ggtttttacc attgaacgtg gtgacccatc ctagaaagcc 2760 tgataagaaa cgtctagtct gggacgctgc agctcgcgtg aacggcgtct cgttaaactc 2820 ccaacttttg aaaggcccgg atcaattggt tacactaccc tcgattattt gcaaatttcg 2880 agaacgcagc attggattcg gaggggacgt gagagaaatg tttcaccagt tgaagataag 2940 accctccgac aagcggtttc aactatttct cttcagattc gacactaccc aacctcctga 3000 cgtttatgta atggatgttg cgacttttgg agcaacgtgc tcaccatgct ccgctttgta 3060 tgtgatgagg ttgaacgctg atgcctgcaa agaagagttc cctgcagctt gtgttgccat 3120 caaagagaag acatacatgg atgactatta cgacagcctt gatacaccgg aggaagcagg 3180 cgtacgagct atgcaagtga gagaaatcca tgcacgtgct ggtttcgaca tgagaaattg 3240 ggtgagcaac agtccacagg tcctcgaggt gttgggagaa gaaggaaaca agttactccc 3300 aatcgctgca gacaagcaag cttcagagag ggtgctagga atgacctggg agcctaccgc 3360 agacgtcttc ttgttctctg tcaatttgcg agaaggtttg actgactaca tcatagggaa 3420 tcgacgcccg accaaaagag cagcgttacg gtgcatcatg agtttttttg accctctcgg 3480 cctcctgtca ccatacctag ttcatgggaa aatcataatg caggatttat ggcgctctgg 3540 catagactgg gatacagaga taggagacgc ggagtatgcc aaatggagcg aatggacaaa 3600 tcttcttaag aaactggaca atgttaaaat cccgcggtgc tattttcaaa atatacgtcc 3660 cgagagtctt ttagatctgc agcttcatgt gtttacagat gccggagaac aagcctacgg 3720 atgtgctgct tatttgagat atgctattgg aaaaaacgta aaatgttcct tagttatggg 3780 caaaagcaaa gtggcaccgc ttaaagctct ttcgatccct agaatggaac tacaggctgc 3840 gcttctagga tcgaggctaa tggagagcat ttgttccaat cattcgttga aaattactgg 3900 tcgatacctg tggacagact ccagcacagt gcttagttgg attcgttctg accatcgaag 3960 atttaaacct tatgtcgctc accgcgtggg agaaattctt tctcttaccc aaccagagca 4020 atggaattgg atcgcatcgc aagataatat tgcggattgt ctgacaaaat ggagcaagga 4080 aacggaacct gtatcagacg gtagatggtt caacgggcca gacttccttt atacttcgga 4140 agagtgttgg ttagcgaagg atatcggacc gtcaaaggcg gatgaagagt taagaccttg 4200 cttcgtccta caccatatga ccagtttcaa gcagggcagt ataatcgatg ttagccgctt 4260 ttccaaatgg gagacgttac ttcggacgct agctttggtc aggcgcttta tctccaattg 4320 tcagctacga ataaagaagc tacctattga agctgtgaac gttgacgaaa gggtgctgaa 4380 gtccttaagc caatcattgc ctgctataca tgtgccgtta cgccaaaggg aatatgtgga 4440 ggccgaaaat ttgttgttca gaatcgctca agctgaaaca tatcctgatg agacagaaat 4500 actgttgaac aaccgagaca aagcttgtga caagtggttg aacttagaaa agtcgagcgt 4560 gttattcaaa gtatcaccgt ttgcggatga gcacggtgta ctaagagtag aaggaagaac 4620 tgcaaatgcc atgtacgccg cattcgatgc tcgtttccca atcatcatgc ccaaaaacca 4680 tccactaact cttttgctac tagatcacta tcatcgcaac tacggccacg cgaatagaga 4740 aaccgtagtc aatcaggtgc gacagcggtt cgagattgct aacctacgca ccacaatcga 4800 caaagtgagg aagagatgtc aattgtgcaa aataagaaaa tgtaagccgc taccaccgcg 4860 gatggctcct ttgccagaac agcgtctaac accttacatc cgtcccttca gttacgtggg 4920 gctggattat ctaggtccgt tagaagtagc agtaggaagg cgtagagaga aaagatatgt 4980 agctgtgttc acttgtctgg ttgtccgcgc agttcacctc gaagttgccc atgatttgac 5040 cacatcgtct tgcataatgg cgatacgtcg gtttgtcaga agaagaggtt ctccggtcga 5100 aatattctcc gacaacggga ccaatttcgt cggcgctagt cggatattgg cggatcagat 5160 aagaagaatc aatagcgatt gtgcggatac gttcacagac gctaagacta agtggacctt 5220 tatccctcct ggggcacctc atatgggtgg cgcctgggag cgcatggtga gaagcgtgaa 5280 ggaagccatg cgtgccctag acgatggaag aaaactggac gatgaaatct tgttgacagt 5340 tttagcagaa gccgaaggtt tcataaattc acgtcctctt acctatatgc ctcaggagtc 5400 ggggggtact gaagcactta ccccgaatca tttcattttc ggaaactcct caggggccca 5460 caatccgata agaactccag tggacttggc tgaagctctg cgcaacagct ataagcggtc 5520 acagtattta tctgatgctg tatgggaccg ctggttaaag gaatacttcc ctgcagtcaa 5580 taagagatca aaatggtttt gtgacgcaaa gccggtgaaa gtcggcgatt tggtatacgt 5640 agcggaaggc aaacgaagaa cgtgggttcg tggtaaggtg gtcgaattaa tcgccaacaa 5700 ggacggtaga gttcgccagg tgatagtaaa gaccgcttca ggaacattca aacggccagt 5760 agtgaaactg gcagtaatgg agattctgag taacggtgaa tccagagcta atcatcccga 5820 gatgccactg gatccacggg gaggggaa 5848 // ID hAT-N3A_BF repbase; DNA; INV; 362 BP. XX AC . XX DT 30-SEP-2008 (Rel. 13.09, Created) DT 30-SEP-2008 (Rel. 13.09, Last updated, Version 1) XX DE Amphioxus hAT-N3A_BF autonomous DNA transposon - consensus. XX KW DNA transposon; Transposable Element; hAT-N3A_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-362 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-362 RA Kapitonov V. and Jurka J.; RT "hAT transposons in the amphioxus genome."; RL Repbase Reports 8(9), 912-912 (2008). XX DR [2] (Consensus) XX SQ Sequence 362 BP; 107 A; 90 C; 79 G; 85 T; 1 other; cagggttcta gccagcatcc gtccttccgt cctttgacgg aattttgcag ctggggacgg 60 acaaaaattt tgcccattcc gtcctctgtg acggacaaaa atcggcatgt aaagatatta 120 aacaaatgaa attttgcaaa gatctccaga aattcagcac agatgccatt gaaccaagct 180 gagtctactc taccagcaaa atttgcccct caaaatgcag gaaatagcgt ttcagagggt 240 cttagatttc aaaattttcc cggacccccc taggaccgtc gcgyggaagc ctctggcgct 300 cgacaggatt tccattcaaa atccagggga cggaaaaaaa tttcaggctg gctagaacac 360 tg 362 // ID Copia4-NVi_I repbase; DNA; INV; 4731 BP. XX AC AAZX01004118; XX DT 05-NOV-2007 (Rel. 12.11, Created) DT 05-NOV-2007 (Rel. 12.11, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: internal DE portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia4-NVi; KW Copia4-NVi_I; Copia4-NVi_LTR; internal portion. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-4731 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from Nasonia parasitic wasp."; RL Repbase Reports 7(11), 1110-1110 (2007). XX DR Genome; AAZX01004118; Positions 6664 1934. XX CC Positions [1714-2244] - Integrase core CC LTRs are 95% similar to each other. XX FH Key Location/Qualifiers FT CDS join(1807..3081,3085..4398) FT /product="Copia4-NVi_I_1p" FT /translation="MGKAKYIITFLDDLSRKTFVYFLKTKDEVPAVVQNFI FT KLVNNQTQKKVKIFRSDNGKEYVNQRLKAALEQLGVKHETSIPYVPQQNGR FT AERVNRTLLEKMRCMLAEAKLSKMFWAEAISTACYISNRSPRRCLNGRTPE FT EVWPGSQTDLSSLRVFGCRARAYVPSYQRSKIDLTSKPAILLGYCEDQKGY FT RLWNESERKVFTSCNVQFFEEPHGIAQKSKSVYFPVESENLDESTIPQNVV FT EPEKETIDVDLPVKKKIVEPVNKKDVEPAVEPVKKKIVEHVKKKIVEPVNK FT IICKNVPVEESEETRCVPEKENRKRDSQGLNHSSEPVIKRTKQTDSDILTL FT NPAAEKCSRSTQRPNTRSAGRQAKSVDEGSSDEDFSSVPIRKSMRRKLKPK FT HLDEFVTYSAISETGEPQSYQQEINGPEVEQINAMREEYQSIIKNSTWELC FT DLPHGQPVIGSKWILTTTYKARLVARGFSQVRGINYDETYSPVVRYTSLRL FT LFAYAARRDFDIFHLDVETAFLHGEMGKTVFLQQPQGFIKRGHETKVCRLK FT KAMYGLKQGSRNWNLKLDSAFKSLNLVQSNYDSCVYTYKTVNKSIIVALFV FT DDILVFTDSTDFFNILKDGLKSVCTIKDLGPVRRCLGINIHHERTLGVIEL FT AQKDYIATLLENFGVDECRPTTTPMDSNHTQAQNSFSEESLPVNQIPYQNA FT VGSLLYLAQATRPDLAYAVSTISKYNQNFNESHWNVVKRIFRYIQGTKDLR FT FRYSKQDNPSIIGYCDASWATDPDDSRYTSGYVFTCQGSAISWNSRRQATV FT ALSSTEAEYLSLSAATQEAMWLRRLSIELSIQPEEPFWFTVTTKVQSISPT FT ILVLVHVPSI" XX SQ Sequence 4731 BP; 1596 A; 943 C; 1048 G; 1144 T; 0 other; tagtgtaatt gagttgcaaa ttcattaaag ataatatatt cttaagagat agtgagtgac 60 tttattttac ctggcttcac acacctgctt ccatacctaa agcatcaggt tatgggccct 120 gccccattca aagaactccg aaaaagtttt ctttgccgtg atatgtgcga aaccgaggtc 180 gaacagtgcc ggctataaat atagcgaatt acgcaacact acagttagag tgcagaattt 240 gaaaatggcg agtcgcagtt caaacggtac ataccaactc atcgacaaat tagttggccg 300 tgaaaattat agacagtggt cagtagccat gagggcttac ttggaagtcg aagatctatg 360 ggacaccata gaagctcgca cagacggcac cctgtctaca gatccgaaaa agatgcagaa 420 aacgcgtggg cgcataattc ttgcagtgga atccgaagta tacgcgtaca tcgaaaatac 480 gacatcaccg caagaggtct ggaatgagtt ggcaaaaact tatgatgata agggactgac 540 cagaaaagta tcgcttctac aagcaactac cacgactaga ctgaaaaatt gcagatgtat 600 ggaagagtac gtatcaaata ttatttctgc tactaaaaag ctcagcgcaa gtggcacaaa 660 attgcaagaa gatttagtgg gtgcgctttt gctgtcgggt ttaccgtcat catatcagcc 720 tatgataatg gcgcttggaa gctccggagc agaaataacg gccgatttgg tcaaaaacaa 780 attattagaa gaaaactaaa atggcgtccg tgaaagcgac tccgaaacgc tcggactcta 840 cacgcaccag tcgcatcact caagcttcca gtctcgaaat tcgaacaaca gagggcgcca 900 cccatccaag caaaagcagc gtaatcagta tggtcacttg gcgaatagat gtacagtcca 960 gaatcgcaaa tcccaaaaag cttgcacagt agctacatca gctgacgaga gcgaatgtga 1020 ggacactgtg tgaccacgtt aaccatgcgt gatactcact cagccgtcgt cgacactgtt 1080 cttgccgatg ctgaagccca aaccacgata cctgtggaga cgttgtttgt agtggatcta 1140 gtgagtaatg caagcaaatc agaatggatt ttagactccg acgcgtcaat ccacatgtgt 1200 tgaagtgacg cgcatatgtc taacgtgcga aaaccaaaga ctgctacagt tacagcagct 1260 aacaaagcta aggtgccagt aaaagcagaa ggtgaattgg tcttacagta ttatgatgac 1320 aacaaatact ccaaagtgcg tcttgagaat gtgctttgcg tacctgacat aacatcaaat 1380 ctgatatcag tcagtgccct agtcaagaaa ggatttagag ttttatttaa agatactaag 1440 tgtgaggtga taggcaagaa caacgtggtt gtgctccaag gccaactaac aaacaacaac 1500 atctacaaga taaaccttag accaaattca caccaggaag caaccaagtc agtaatagct 1560 ctcaaggtca ctgaaaagtg tagcatagat ctgtggcaca gaagaatggc gcatttaaat 1620 cagtaatatt taaatcagct tcgccaggta accacaggta ttgactttga taagagccag 1680 gttggcaagt gtgaaatttg tgtcacagga acgctcgtgc aaaagccttt tcatcaaaac 1740 aataaaagta ccaacagtat attggacctt gtgcatagtg atgtctgtcc agtagatcac 1800 ctgtcaatgg gcaaggccaa gtatattatt acattcctag atgatctttc gagaaagacg 1860 tttgtttatt ttttaaaaac taaagatgag gttccagcag tggtacaaaa ttttattaaa 1920 ctagtcaaca atcaaaccca gaagaaagtc aagatattta gaagcgacaa cggtaaagag 1980 tatgtcaacc agagactgaa agcagcattg gagcagctcg gtgttaaaca cgagacctcc 2040 ataccgtatg ttcctcagca aaatgggaga gcagagaggg tcaacaggac cctattagaa 2100 aaaatgaggt gcatgttggc agaggccaag ctatcaaaga tgttttgggc agaagctatc 2160 tctacagctt gttatatttc taacagaagc cccagaagat gcttgaacgg acgtacgcct 2220 gaagaagttt ggccaggaag ccagacagac ttgtcgtctc tcagagtatt cggatgtcgt 2280 gccagggcct acgtgccaag ctatcagaga tcaaaaattg atctcacatc taaaccagct 2340 attctactag gctattgcga agatcaaaag ggctacagac tctggaatga gtcagaaaga 2400 aaagttttta cctcttgcaa cgttcaattt tttgaagagc cacacggcat agcccagaag 2460 tctaaatcag tatacttccc tgtagagtca gagaacttag atgaaagcac gataccgcag 2520 aatgttgttg aacctgaaaa ggagacaatt gatgtagact tacctgtaaa gaagaaaatt 2580 gttgaacctg taaataagaa agatgtagaa cctgctgtgg aacccgtaaa gaagaaaatt 2640 gtcgaacatg taaagaagaa aattgttgaa cctgtaaata aaattatatg taagaatgta 2700 cctgtggaag aatcagagga aactagatgt gtacctgaaa aagaaaatag aaagagagac 2760 tcgcaagggc taaatcacag ctcagaacca gtgatcaaga ggactaagca gactgacagt 2820 gacatcctaa ccttgaatcc cgcagcagaa aagtgtagca gaagcacaca aagaccaaac 2880 accagatcag ctggtagaca ggcaaagtca gtggatgaag gaagctccga tgaagacttc 2940 tcatcagtac ctattcgcaa atctatgcgc aggaagctta agccaaaaca tctggatgag 3000 tttgtgacat attcagctat tagcgagaca ggagaaccac agagttatca acaagaaatc 3060 aatggacccg aggttgagca gtgaataaat gccatgagag aagaatacca gtcaattata 3120 aagaattcta catgggaatt atgtgattta cctcatggac aaccagtcat cggatccaaa 3180 tggattttaa ctacaacata taaagctaga cttgtggcac gaggtttctc ccaggtgaga 3240 ggaataaact acgatgagac ctactctcca gtggtgcgtt acacatcact gagactgctc 3300 tttgcctatg cagccagaag ggacttcgac atctttcact tggacgtgga gacagccttc 3360 ttgcacggtg agatggggaa gacggttttt cttcagcagc cccagggatt tattaagaga 3420 ggacacgaaa ctaaggtatg cagattgaag aaagctatgt atggtctaaa acaaggaagc 3480 agaaactgga atttgaagct ggacagtgct ttcaaaagtc tcaacctcgt tcaatccaac 3540 tatgattcat gtgtttatac atataaaaca gtaaataaat ccattatagt ggcattgttt 3600 gtcgatgata ttttagtttt tacagacagc acagatttct tcaacatact caaggatgga 3660 ttgaagagtg tctgcacaat aaaggacttg ggacctgtcc gtagatgcct tggaatcaat 3720 attcaccatg agaggaccct gggtgtaatt gagttggccc aaaaggacta catcgccaca 3780 cttctggaga attttggcgt ggacgaatgt agaccaacta ccactcccat ggacagcaac 3840 catactcaag ctcaaaatag tttttctgaa gaatcgctcc cagtgaacca aattccttac 3900 cagaatgctg tgggatcgct tctgtatctg gcacaagcta cgaggccaga cctggcctac 3960 gctgtcagca ctataagtaa gtacaaccaa aactttaatg agtcacattg gaatgtggtc 4020 aaaaggattt ttagatacat ccaaggcacc aaggacttaa gatttaggta cagcaagcaa 4080 gataatccaa gtattatagg ctactgcgat gccagctggg ctacggaccc agatgactca 4140 agatatacat ctggctatgt ttttacctgt caagggagtg cgatcagctg gaacagcaga 4200 aggcaagcca ctgtggcact ttcaagtaca gaagcagaat acttatctct gtctgcagcg 4260 actcaagaag caatgtggtt gagaagactc agtatagaat tgtcgattca gcctgaagag 4320 cctttttggt ttactgtgac aacaaaggtg caatcgatct ctccaacaat tctcgtttta 4380 gtccacgtac caagcatata aacgtgagac accattttat aaaagagatc attcaaacca 4440 aacagataag ggtaagattc gcgagttcat ctcatattct agctgattct ctaacgaagg 4500 cagctactcc caggaaaatt caagattttg tcggtgcagt gggactcaag acaaatcaag 4560 agcagaagaa atctcaaggt cataacaagt gatcaagact gaagatagat tataaatgta 4620 tattttattc gattgaacct ttaaaaaaaa aaatttttat aatgtaattg tccgagtatt 4680 agtaattgtg atattgtact gcgaatgaca attatttagt tgagaggagc g 4731 // ID Gypsy-34_OD-I repbase; DNA; INV; 8760 BP. XX AC CABV01001480; XX DT 24-FEB-2011 (Rel. 16.02, Created) DT 24-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Oikopleura dioica genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-34_OD_; KW Gypsy-34_OD-LTR; Gypsy-34_OD-I. XX OS Oikopleura dioica OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; OC Oikopleuridae; Oikopleura. XX RN [1] RP 1-8760 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Oikopleura dioica genome."; RL Direct Submission to RU (24-FEB-2011). XX DR Genome; CABV01001480; Positions 61355 52596. XX CC Positions [3729-4196] - Integrase core CC 'ACCAT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 1881..5348 FT /product="Gypsy-34_OD-I_1p" FT /translation="MKELSIGDAPTKYRRQLEELISQYTDVFAIEDEKLGT FT TDAMSYKIDTGDAAPVASQRYKTPYYLRKELKKIIDANLESGLLEPCSSPW FT AAPVLLVKKANGKWRLVCDYRKLNTVTIANQYPLPDIDGLIDQMAESTVFS FT TADLFTGFHQIPCDPETKQKVAITTDFGQFTWTAMPMGGKNAPAVFQRMMD FT KLFSTIPNNRLAIYLDDLCLHSKTYEDNLFTIEQMLRILRKNNLKIRAAKT FT EFLKPRIKFCGAILENGFRYLNPDKTRAVRELGRPQNKKEAASVFGLLNYH FT RTFIPHFASKAAPINKAMSKGFKWTHEADEALELLKAEIGNFVDALKIPNP FT NTGEFAIETDASEKGIGAVLLYKKDSNSHFQPAAYLSQKFDEAQKNYNISE FT KELLAGKKAMEKWSHYLLGRQFLWFTDNSCVNWAHRIMSRKLKIAKWLAEI FT SDFDFKTVLKPSKQMVVSDCLSRFQAADPERKIEVNMVKPSEFIFLQESDP FT VLCEIMDYQTKDRWPTIQNEHIKPFSRLRSKLMYGKHGELGISKNGFKAIP FT PKALHKDILEEYHDNTGHPGVSQTLAEIENKYFYPSMRETVAAHIKSCTDC FT QRIKPVNNPLNAPLGHVKAPAQPFERYSVDLIGPLPLTDNHMRYICVSTDL FT FSKRTNATALRTKEPDEVLAALKNEWLRNPHLPREVLMDNGGEFKEVKDYC FT VSKGLKVSLSPAYHPQTNGECENRNRTLKSRLKLACKLENWDLFLPEAIHQ FT MNSAKHSVTKLSPFEIETGYPGENPNDKYKITQNRRDVNLQNIEEKIQNNH FT NKRKSEDEKIHDFKKNDLVLCKNTSPYEKIYKWMGPMRITEVRKQGLSFTL FT LNLESGRELTRHISHIKPFIQRESSEISDGIITSEKRPKDHKNEIKTHKKR FT ERIPYTIMTRSIKRDIERRNAPLASSREHQETILNTDSRESEASQTEEPAA FT SDQQTPNDSLTPINDPSNETVFYDCDSPTTTQEDNQSTDSNGHEVEFLTNK FT MEEEPKPRKLISPIVQRIAELHDRALDKFIKDYKCNIKITSWSALQNKKAK FT KLEKINEWIRLNHKDWEKDDDGFYLIKHNALMLDAKCYLSSFTILELKVLA FT KHMGLDIETTLRPKTAILAEFKTKAKEKFPFMKCTPSGNLIIDPAHFT" FT CDS 6828..8735 FT /product="Gypsy-34_OD-I_2p" FT /translation="MKIIFLFLANVAAHDYTFDHKNGLLFQENDPVWVYDA FT RVPVDVNIMLLSPREKFRAAFKADCGEEYLENKLEQNLFGRNDTNSTSFRE FT SSEDCLTAFRTFDNVILSFLGEDLVTSRQYNRTTADKRRRLYEEQYDREVK FT ERKLMEEKIRRGRRSIDPVSGLLIAGTLGYAISIDARSIQRDNLITQQINY FT ERERISELEEVVETVNDKLDLAIKRIRKSRRPIVSWGGLAIPDDAKAVKMI FT MEGADAEINQYFAQQSATLGREITQSVLTLQNHRLPLNPVFLDAIKAQCIA FT HQQTSEEEAKEFCTNYAFHSTRWDTRLRFNGMGISTWERKDGKALGKDDME FT IKQVVISVRIEIPRMKLKADKYTAINLGYFKDDQSRWTVEVPQHLVVMPSK FT EVLEMRPSDCDVFTPTYACSAVSLVPNQCAESILLHNSTKYCETREIDNKK FT CGYFEDTTRAFVSMRESGIAQFFHHAPSENVNKIDSVKKTKFPGVLDCGPV FT ILRISASMKAERNTTMIRYIDPIQIKMRSVQDEEMDSMNNKILHNLDTVKS FT MGNTILKMNMTTLEMMKTTAIVESKNAAEEAKDYIFKTFIKPLIGTMGTLA FT GIVLLAFTIYTLTCLRRKKRKTIIFGGLTRTNRSECSV" XX SQ Sequence 8760 BP; 3111 A; 2040 C; 1669 G; 1940 T; 0 other; tggtgaccac gagcgtccga cacccaattg aacttcttga agccttcgaa cacaatacag 60 agccaacgac aaagataacg atcaagacga acaaacgaca attaaagagc acaataacga 120 caatcaagag taaaattgac cgcatgcaac ttgtcttcca gacatcactc aaccttcacg 180 acatcaaaag cgaggagaga gactccttcg aaagagagtt taaggcaaca cgacgagaaa 240 tttgtctttc atacctcgat tcaagtttca gagaacgcta cgaggacgaa ttcgaagaca 300 taaaaggatc aagtgatttt tacactctgt taaacgagat cattggggga ggtgaatccc 360 gaagaagaag cacgagaagc acgatcaaga attaaggaaa tctcgagaag agtagatgaa 420 aatgaaacct tttcacgatt ttacacaaga cttgaaaaac tagcaaccac ggcgagtaaa 480 tcaaatgaaa cactcaaaaa acactatctc gatgaagcct tccattcaaa tcttacccca 540 gatctccgac gctaccttct tgatcaaggc agatcaaaag cttctacaaa agcgatcgcc 600 gactatctcg actcgatgaa gaaacacgaa aagagaatcg aaataaaagc agtcaccgca 660 gaagagacac ttctcagaga acaagtgtcg gctctcacac atcaattcac ccacctccca 720 cagctgctgg agaactcgct aggctcatca ctaaaatcat tgattgacgc aaaaatcgac 780 aatattcgaa gagaatttgc cgacattaac aaaatccagc cgagaaacga acctacacga 840 ggcaataagc aacaaccaac atacgaacaa agaaatgcaa tcaacacgac atctcatttt 900 acgagacaga atgtaacgac cgaccccgac cagaaaccct tgagacagct cgaaagacac 960 caggatggaa caccaatcac ctgtcacgcc tgcggagtta agggccactc aaagaagaac 1020 tgccgaggaa ctgtcatctg cagaaactgt ggccagcgag gacatattgc gggtatctgc 1080 cgaatgtcaa aaaactaaga aatggacttt atgaatcgtc acatagagtc catgcctccg 1140 tctcgtccat ctcaaagcga atcggaccaa aacttctctc tcacgcaatt atctacggtc 1200 agcgcattat ttttcaaatc gacactggag ctcaagtgtc ttgcctacct cattatctca 1260 tcccgtcatc aatgaaatca tcaatcgtgc cgtcaacaat tcaacttcag tcctataatg 1320 gtcaagatat ccaagtctac ggatgccttg ctgccgacat cacccttgga gaaatccaac 1380 ttccgaactg catctttcag attgttgcag acaactgctc accaatcctg gggactccag 1440 agcttgctga aaatggtatt gaaatcgatt ttcaaggagg acttattaga aaaggccttc 1500 aacaacaaca cttcacactg tgcgatagtc cagctgccgc aatcaagatc atctaaaaat 1560 caagtttctt cgagacatca gcaaccgccg ccgagtcact tacgataaaa ccacactcct 1620 caacctttat caacgtcaat ctttcagcct gtccgacaag ctatatttgt gccattcctg 1680 aacgaatctc caaaaataaa tcatttgaag tttacgacca atgtctacag ttacacagtc 1740 ttcaagaagc aactaagatt caaatctcca ataattcttc tttacctatt cacatcaagc 1800 aaaaggcaac gatctgcaaa ataagtgaag ttgaagtttc agtgccagat caagctgtta 1860 aaggtaaatt cgagacaatc atgaaagaac tttcgatcgg agacgcgccg accaagtatc 1920 gcaggcagct ggaagaactt atttcccagt acacagacgt atttgccatc gaagatgaaa 1980 aactgggaac aacagatgcg atgtcgtaca aaattgatac tggcgacgca gcacctgtag 2040 cttcccagag gtacaagacg ccatactacc tacgcaagga acttaagaag ataatcgatg 2100 caaacctaga gtcaggactc ctcgagcctt gctcgagtcc atgggctgca ccggtacttc 2160 ttgtaaaaaa agctaatggt aaatggcgcc tcgtctgtga ctacagaaag ctcaacacag 2220 ttacaatagc caatcaatat cccttaccag acatcgacgg actaatcgac caaatggctg 2280 agtcaactgt cttttcgacg gcagatctgt ttactggatt ccaccaaatt ccttgtgatc 2340 cggaaacaaa gcaaaaagta gcgataacta ctgattttgg ccagttcaca tggactgcaa 2400 tgcccatggg aggcaagaac gccccggcag tattccaacg aatgatggac aaactcttca 2460 gcactattcc gaacaaccga ctcgcaatat acttggacga cctgtgctta cactcgaaaa 2520 cgtatgagga caatctcttt acaatcgagc aaatgctgcg aatcttacga aaaaacaatc 2580 tcaagatccg cgccgcaaaa acggagttcc ttaaacctcg aatcaaattc tgtggcgcta 2640 tcctcgagaa tggattcaga tatcttaatc cggacaaaac aagagcagta agagaacttg 2700 gtagaccaca aaacaaaaag gaagcagcca gtgtttttgg tcttcttaac taccaccgaa 2760 catttattcc tcattttgct tcgaaagcag cgccaataaa taaggcgatg agcaaaggat 2820 tcaaatggac acacgaagcc gacgaagcac tcgagcttct caaagctgaa attgggaact 2880 ttgttgatgc actgaagatc ccgaatccga atactggaga atttgcgata gaaacggatg 2940 ctagtgaaaa gggcatcggc gcggttcttc tctacaaaaa agacagcaat tcccattttc 3000 aacccgcagc ttacttgtct caaaaattcg acgaggctca gaaaaattac aatattagcg 3060 aaaaagagct cctcgctggc aagaaagcga tggagaaatg gagtcactat cttcttggac 3120 gccaattttt atggtttaca gacaactcat gcgtcaattg ggctcacagg attatgagta 3180 gaaagcttaa aattgctaaa tggttggcgg aaataagcga ttttgatttc aaaacggtac 3240 tcaaaccgtc gaagcagatg gtcgtttctg actgcctgtc aagatttcaa gcagctgatc 3300 cagaacgaaa aattgaagtt aatatggtca aaccgagcga gtttatcttc cttcaagagt 3360 cagacccagt tctttgtgaa ataatggatt accaaacgaa ggaccgctgg ccaacaattc 3420 aaaacgaaca cattaaacct ttctcacgtc ttagaagcaa attgatgtat ggaaagcatg 3480 gagagcttgg tatttcaaaa aatgggttta aagcaatacc accgaaagca cttcataaag 3540 acattcttga agagtaccat gacaacacag ggcacccggg agtttctcaa acacttgccg 3600 aaatcgaaaa caaatacttt tatccatcga tgcgcgagac ggtagcagct cacataaaat 3660 cttgtacaga ttgtcagcga ataaaaccag tgaataatcc gcttaacgca ccacttggac 3720 acgtcaaagc acctgcacaa ccatttgaaa gatactcagt cgatcttatt ggaccactgc 3780 cgttgacaga caaccatatg agatatattt gtgtgagcac tgatcttttc agcaaacgta 3840 caaatgcaac tgcactccga acaaaagagc cagatgaagt actcgctgca ctaaaaaacg 3900 aatggcttcg aaacccacat cttcctcgag aagttttaat ggataatgga ggcgaattca 3960 aagaagttaa agactattgt gtatcaaaag gtctcaaagt atcactgtca ccggcatatc 4020 acccgcagac taacggcgag tgtgaaaatc gtaaccgcac actgaaatca agattaaaac 4080 tagcttgtaa acttgaaaat tgggatcttt tcctccctga agcaatccac caaatgaatt 4140 cggcaaagca ctccgtcaca aaactgtctc cgttcgagat cgagaccggc taccccggag 4200 agaatcccaa cgataaatat aaaatcacgc agaaccggcg agatgtcaat cttcaaaaca 4260 tcgaggaaaa gattcaaaac aatcacaata agcgaaaatc ggaagacgaa aaaatccatg 4320 actttaagaa aaatgacctc gttttgtgca agaacacctc accatacgag aagatctata 4380 aatggatggg accaatgcgt ataactgagg tccgcaaaca aggattaagc ttcacgcttt 4440 tgaatcttga atcaggaaga gaactgacgc gacacataag tcacattaaa ccattcattc 4500 aacgagaatc aagcgaaata tctgacggaa ttataacaag cgaaaagagg ccaaaagatc 4560 ataaaaatga aataaaaact cataaaaagc gtgaaagaat cccatacacg attatgacaa 4620 ggagtataaa acgtgatatt gaaagaagaa atgcaccttt agcaagcagt cgagagcacc 4680 aggaaaccat cttaaacaca gattcacgag aaagtgaagc aagtcaaacg gaagaacccg 4740 ccgccagtga ccaacagaca cctaacgata gcttaacgcc aataaacgac ccttcgaacg 4800 aaaccgtctt ctatgattgc gactcaccaa cgacgacaca agaagataat caatcaacag 4860 acagcaacgg acacgaagta gaattcttaa cgaacaaaat ggaggaagaa ccgaagccac 4920 gtaaacttat ctcgcccata gttcaaagaa tcgcagagct tcacgaccgg gcacttgaca 4980 agttcataaa agactacaaa tgcaacatca aaataacatc atggtctgct cttcaaaaca 5040 aaaaagcgaa aaaactcgaa aaaataaacg agtggatccg ccttaatcac aaagactggg 5100 aaaaggatga tgacggattc tatttgataa aacacaatgc gctcatgctc gacgccaagt 5160 gctatctcag ctcatttaca atattagagc ttaaagtact ggcaaaacac atgggcctcg 5220 acattgagac aactttaaga ccaaaaacag caatactcgc agaattcaag acgaaggcga 5280 aagaaaaatt cccatttatg aaatgcacac caagcggcaa cttaattatc gatccagcac 5340 atttcaccta aataaaaatg ttttcttaac gccgcttctc caaatctagt tcaccgaaga 5400 caactacgat gcaaaatcaa aattttgact tctttacaat accatatcac aaaaacgaag 5460 cttttccaca agtttttcct acacttgttt taaagccagt tatctacgct caactttaag 5520 cactttaaca attgtctcca aaagatcgac acgaaaagag agccagccga aatctactat 5580 taatttttaa tagtagaacc cacctttttc tcgaaatcct tacaaatcgc ctccgaaatg 5640 tctgattgca tgactaaatg gctttttgga aagcgctaag cgtcagcttt cttctgaaca 5700 cttcagcaac aaaatgtgac atctaaagaa aaaatcgaaa ttttggcact ctacttcacc 5760 gaaaacgatg gaaaatcaaa aatttcgatt tctcttcaat accatatctc aggaacgaag 5820 catttcaaca agtttctttt acacttgttt tgaagccagt tatctacgct ccattttaag 5880 cacgttcaca agtgtgtaca aaagatcaac acgaaaagag agccagccga aatctactat 5940 taatttttaa tagtaggacc cgtcactttc ccgaaatctt caaaaatcgc ctcagaaatg 6000 tatgattgca tgactaaatg gcttattgga aagcgctgag cgtcagcttt attctgaaca 6060 ctccagcaac aaaatgcgac atctaaagaa aaaatcgaaa ttttggcact ctacttcacc 6120 gaaaacgatg gaaaatcaaa aatttcgatt tctcttcaat accatatcac aggaacgaag 6180 catttcaaca agtttctttt acacttgttt tgaagccagt tatctacgct ccattttaag 6240 cacgttcaca agtgtgcaca aaagatcaac acgaaaagag agccagcaga aatctactat 6300 taatttttaa tagtgggagc caccattttt ctcgaaatct tcataaatcg cctcctaaat 6360 gtctgattgc atgactcaat ggctttttgg aaagcgctga gctttcttct gaacacttca 6420 gcaacaaaat gcgacatcta aagaaaaaaa tcgaaatttt gaccccattt ggggaaaatg 6480 ttaggattcc taacattttc ctcaaagacg gcataattca tcgctgaaac acccgaatac 6540 atcaccaaag ggctttttaa aaaagctttt tctgcactct tcactttaag ccgcttccga 6600 caatcgagct ttaaatcaaa catagcgtga aaagacgaaa acttcaagca gtaaatcggc 6660 taagcgggat accgaccgtg aacagtaaca aacagctcgc tacacacagc tgacccgatt 6720 acgtttcaca catgacctgc tgattggaat caacgacaga tggcgcgcag cccaaaaacc 6780 aatataaagc cagtccaact cgaaagaaag catcaatcaa aaaaaaaatg aaaatcatat 6840 ttttgttttt ggcaaacgtt gcagcccacg actacacctt cgaccacaaa aacggcttgc 6900 tctttcaaga aaacgatcca gtctgggtct atgatgcaag agtacccgtc gacgttaaca 6960 ttatgctgct ttcaccaaga gaaaaattcc gagctgcctt caaagcggat tgtggagaag 7020 aataccttga aaacaagctc gaacagaatc ttttcggtag aaacgacacc aactccacat 7080 cgttcaggga gagcagcgaa gattgcttga cggcttttag aactttcgac aatgtaatcc 7140 ttagttttct tggtgaagat ctagtgacat cccgacaata caatagaacc accgcggaca 7200 aaagaagacg gttatacgaa gaacaatacg acagagaggt caaggaaaga aagctgatgg 7260 aagagaaaat aagacgaggc cggagaagca ttgatccggt ttcgggactg cttattgcag 7320 gaactctcgg atatgctatc tcaatcgacg caagatcgat tcaaagagac aacctgatca 7380 cacagcaaat caactatgaa agagaaagaa taagcgagct cgaggaagtc gtagaaacag 7440 tcaatgacaa attggacttg gctataaaac gcataagaaa atcaaggagg ccaatcgttt 7500 cctggggcgg tcttgcaatt ccggacgacg cgaaagcagt aaagatgatc atggaaggag 7560 cagacgcaga gatcaaccag tatttcgcac agcaaagcgc aacactagga agagagataa 7620 cacaatctgt tctaactctc cagaatcacc gtctgccact caacccagtc ttcttagatg 7680 caatcaaggc tcaatgcatc gcccatcaac aaacgtctga agaagaagca aaggaattct 7740 gcaccaatta cgcctttcat tcaacaagat gggacacccg acttcgattt aacggcatgg 7800 gcatctcgac ctgggaacgc aaagacggca aagcactggg aaaggacgat atggaaataa 7860 aacaagtagt catttcggtc cgcatcgaaa taccaagaat gaaactgaag gccgataaat 7920 acacagctat caatctaggt tatttcaaag acgaccaatc aagatggaca gtggaagtcc 7980 cacagcatct ggtagtgatg ccaagcaagg aggttctaga aatgcgcccg agcgattgcg 8040 atgtcttcac acctacgtac gcatgctcag cagtgtcact tgttccgaac cagtgcgcgg 8100 agtcaattct gctacacaac tcgacaaagt actgcgaaac acgagaaatc gacaacaaaa 8160 aatgcggcta cttcgaagac actacgagag cgttcgtctc aatgagagaa tcgggaatcg 8220 cccaattttt tcatcatgct ccatcagaaa acgtcaacaa gatagacagc gtcaagaaaa 8280 caaaatttcc aggcgtactt gattgcgggc ctgtaatatt gagaatcagc gcgagcatga 8340 aagcagagag aaacacgacg atgataagat acatcgaccc catccagata aagatgagaa 8400 gcgttcaaga tgaagagatg gactctatga acaacaaaat ccttcacaac ctcgacaccg 8460 ttaaaagcat gggaaacaca attctcaaga tgaacatgac tactctggaa atgatgaaaa 8520 caacagcaat cgtcgaatcg aaaaacgccg ctgaagaagc aaaagactac atattcaaga 8580 cattcatcaa accgcttatc ggaaccatgg gaacgctcgc tggaattgtc cttcttgcat 8640 tcaccattta cacccttact tgcctacgaa gaaaaaaacg aaaaacgata atttttgggg 8700 gactcactcg aacaaaccgc tccgaatgct cagtttgaaa acccgccgac tgacgatggc 8760 // ID Gypsy-12_SI-LTR repbase; DNA; INV; 597 BP. XX AC AEAQ01024794; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: long terminal DE repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-12_SI_; KW Gypsy-12_SI-I; Gypsy-12_SI-LTR. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-597 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01024794; Positions 268 864. XX SQ Sequence 597 BP; 159 A; 140 C; 192 G; 106 T; 0 other; tgtaagaaat atgcgagccg cggcgaaagc gacaaagcga acggcggccg accgatggcg 60 ccaggtgatc ggcgaagcga gcgggagcgc gagctgtcat gctccgaaat catgacacgg 120 tacgtcatag cggaccgtta aacgaagggc caatagaata tcgatagggg cgcgagctcc 180 aaaatcgcgc gaaggagaga tttcgaggac tcgagagaga gcgagcctgg ggaccggtcg 240 tgtcccaagc cggagtcgtg tcgtgccgtg ccgaaagccg tgcagccaaa ggctgaatag 300 taaagcttgt gagagtgccg aaaataaagt gtcgtcgcaa cgcatcttgt cgtatcattc 360 accctgctcc cttccctgga cgttcaagca agaaccagag tgccgggcca gccatctctt 420 gcattggagc acgtggcgag agagtggagc cggcttacag gaggacaagg agcagaggta 480 ttaagtcgtg tcgtgtcgtg tcgtgtcgtg tcgtgtcgtc agagatagga aaagcgtgag 540 agaagccatc cgggaaacat ttaaagtcga gttcaccaaa aggaaacgta tctttca 597 // ID BEL-85_AA-I repbase; DNA; INV; 5928 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-85_AA_; KW BEL-85_AA-LTR; BEL-85_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5928 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR [2] (Consensus) XX CC 'AGAGG' target site duplication CC LTRs are 99% similar to each other. CC An internal insertion of Gypsy LTR was removed. CC Therefore, the sequence is labeled as "consensus". XX FH Key Location/Qualifiers FT CDS 40..4431 FT /product="BEL-85_AA-I_1p" FT /translation="MPETDRNDCKVCRKSNNLDNMVLCPRCEEWYHYRCAG FT VDESVADRSWICANCLILPPVDPPGQTSTPVSNNPVGLTAPSANHPGIVTS FT APFSSSSLTTVATTASNGQSILTEQARASLRWIQEQREILEREMEENHRQE FT MERKRMELKKLANKAMMGIIDTTKDGLANSLDEQPGAAAAGICKVQNWMQQ FT VVGEMKDLSVAQPSGRAEVITSGQPTFDISSVPTTASVNLSSFDPISSSTL FT HGVRDRSSIVPPTTFASAQLCSVASSQSVNLPPFTISSVPQTTVAPPQLIS FT SVQAEFDSSVRLGTMPGGLVSRVPTIPGWGINAPTNVVTACSTMPALLNYS FT GLMPSERQPVVPPAPASTFLPGFSTNGREVNSSQFSQQLPLPGYTAAFGPA FT GAPPNIPNNVPAFPRLYDPVGQHRAPTQHQLMARQVMPKDLPPFRGDPEDW FT PLFYSAYVNSTTACGYTDVENLARLQRALQGRALDAVKSRLLLPACVPQVI FT NTLYMLFGRPELIIQTLLNKIREIPPPRSDRLDTLISFGMAVQNLCDHLEA FT AGQLAHLCNPTLLLVEKLPAQQRLDWALYKRQFVAVDLRTYATYMSTLVAA FT ASDVTVLGDIKLSRASKGDRAKEKNFLNAHSVAELPKRESKEETTPIFLCL FT ACNGAGHKVKDCGVFKKWGPESRWKIVQDHHLCRICLGKHGRRPCKSQTCC FT GIEGCQQRHHQLLHTQSHKSHPPAEPKAAKPGNGSGEGVNAHRTEEKSTLF FT RILPVQLSWNGKSVETFAFLDDGSSMTLVEQSVADCLGIDDGESLPLCLTW FT TSNVNRKIPNSQRVSLKISSPGKSEQYALIDTRTVGNLKLTTQTLKYEELA FT RKYAHLRGLPIKSYDSVVPGILIGSNNAGLIATLKLREGQLGDPLAAKTRL FT GWTVYGFAADHTKTENFSFHICECHMEQGTDQELHDLVKQHFTLESVGVSA FT DRTPESDEDRRARCILEETTKRTSKGFETGLLWRTDSVELPNSYPMAVRRL FT ECFERRMNKDPGTRDSVQRQIQEYLENGYIHEVTPHELEYTDPKKVWYLPL FT GVVRNPKKPNKIRLVWDAAAKVGGVSLNDMLLKGPDLLVSLAAVLCGFRQY FT RVAVCGDLKQMFHQFRIRQEDVHSQRFLYREHPSKPIKIFVMDVGSFGATC FT SPSQAQYIKNLNAEEHESEFPQAAVAIKKKHYMDDYLDSFDTEDEAIKVAL FT EVKTVHDRGGFEMRNWHSNSTTLLERVGEPKQQQMKAISIDTESEAERVLG FT LLWLPTEDSLAFAADLKLDGTVPTKRNILRWVMSVFDSQGILSHITVQGRM FT IIQDTWRSQVTWDEEVNDATQTQWRKWIKLLEEVNKIRLPRAYFPGLSAVD FT IGMVDLHIFTDASEEAYAATAYFRTVVNGKVYCALVMAKAKVAPLKAVSVP FT RLELMGAILGARLAKAVCEYHTIPIRRRVLWTDSLATLAWIQSQHRRNSN" FT CDS 4943..5926 FT /product="BEL-85_AA-I_2p" FT /translation="MAPLPDARLAATVRPFSYVGLDYFGPIQVKVGRSCVK FT RWVALFTCLTIRAIHLEIAHSLSTESCKMAVRRFVARRGSPLEIYSDNGTN FT FQGAGRELREQIEAIGKGLEETFTNTSTKWIFNPPSAPHFGGMWERLVRSV FT KMALGSLCTDRNPDEETLATVIAEAESVVNSRPLTFIPLDDFAQEALTPNH FT FLLLSSNGVAQPPKALAKLTKVTRSNWTTNQQLIDQFWKRWVREYLPTIAR FT RTKWFNECDEVKVNDLVVVVNENQRNGWARGRVLSVIPGRDGRIRQAMVQT FT SGGVFRRPVSKLAVLQVRDMSNAQSSVNPEVRYGPG" XX SQ Sequence 5928 BP; 1606 A; 1460 C; 1586 G; 1276 T; 0 other; aaagcttgaa gatattgacg taaacaactc ggtcacagga tgccggagac ggacaggaac 60 gactgcaagg tttgccgcaa gtcgaacaac ttggacaata tggtcctctg tccacggtgc 120 gaggagtggt accattatcg gtgcgctgga gtagacgaat ccgtcgcgga tcgaagctgg 180 atatgcgcga attgcttgat tctgccacca gtcgatccac ccggccaaac aagtaccccg 240 gtgagtaata atccagtcgg gttgaccgcc ccctctgcta accatccggg aatcgtgacg 300 agtgcgccat tcagttcatc atcgcttacc acggtggcta ccacagcgag taatggacag 360 tcgatactga ctgagcaggc aagggcgagc ctccgctgga ttcaggagca acgcgagata 420 ttggagcgcg aaatggagga gaaccatcga caagagatgg aacggaagcg gatggaattg 480 aagaagctgg caaacaaagc gatgatgggg attatcgaca ccacaaagga cggattggca 540 aacagtttgg acgagcagcc aggagcggct gcggcgggaa tctgtaaagt gcagaactgg 600 atgcagcagg tggtcggcga aatgaaggac ctctcagtcg ctcagccgag cggaagggca 660 gaggtgataa catcgggaca acctacgttc gacatcagtt ctgttccgac aaccgccagc 720 gtgaatttaa gctcgtttga tccgatttct tcatcaacgc ttcatggcgt tcgagatcgg 780 tcatcgatag tgccaccaac aacgtttgct tctgctcaac tgtgttcagt tgcatcctcc 840 caatcagtaa atctcccccc ttttacaatc tcctcagttc ctcaaaccac ggtagcccct 900 ccgcagttga taagcagtgt acaagcggag ttcgattcct ctgttagatt gggaactatg 960 ccagggggat tggtcagtcg ggtgccaact ataccagggt ggggaataaa tgcaccaaca 1020 aacgtagtga cagcttgttc tacaatgcca gcgctattga actattccgg attgatgccg 1080 agtgaaagac agccggtggt acctcctgca cctgcctcga cattcttacc aggattctcg 1140 acaaatggga gagaggtaaa tagtagccag ttctctcaac agctgccgct acctgggtat 1200 acagcggcct tcggtccagc tggcgctcca ccaaacatac cgaacaacgt cccagcattt 1260 ccgagactct acgatccagt gggacagcac cgtgctccca cacaacatca attgatggcg 1320 aggcaggtta tgccaaagga tctccctccc tttcgcggtg atccagagga ttggccattg 1380 ttttatagcg cttacgttaa ttcgacaacg gcgtgcggct atacagacgt ggagaactta 1440 gccagactgc aaagggcgct gcaaggtagg gcgctagacg ctgttaaaag tcgcctactg 1500 cttccagcgt gtgtgccgca ggtcatcaac acactctaca tgctgtttgg tcgtccagag 1560 ctgataattc agacgcttct gaacaagatc cgcgagatcc cgccaccgag atcggatcgg 1620 ctagatacac tgatctcatt tgggatggct gtccagaatt tgtgcgatca tctggaggca 1680 gcagggcagt tggcgcactt atgcaacccg actctactac ttgtggaaaa gcttccagca 1740 caacaacgtt tggattgggc attatacaaa aggcaattcg tagcagtaga ccttcgtacg 1800 tatgcaacct acatgtctac attggtggca gcggcctcgg acgttacggt tctcggagac 1860 attaaactat cacgagcgag caaaggagac cgagccaaag agaagaactt tctcaatgct 1920 cattcagtag ccgagctgcc gaagagggaa tcaaaagaag agaccactcc tatatttcta 1980 tgtttggcgt gcaacggtgc cggtcataag gttaaggact gtggcgtctt caagaagtgg 2040 ggtccagaaa gcagatggaa gattgtgcag gatcatcacc tgtgtcgaat atgcttggga 2100 aagcatgggc gtcgcccgtg caagtcgcaa acctgctgcg gaattgaagg ctgtcaacag 2160 cgccaccatc agctactcca cacgcaatca cataagtcac accctccagc tgaaccgaaa 2220 gcggcgaaac caggaaatgg ctccggggaa ggagtaaacg ctcatcgcac tgaggagaaa 2280 tcgacgctgt ttcgaattct tccggtacaa ttgtcttgga acgggaagtc tgtcgagacg 2340 ttcgcttttc tggacgacgg atcatccatg acgttggtcg aacaatccgt tgccgattgc 2400 ttgggaatcg acgacggtga gtctctacct ctatgcttaa catggaccag caacgtgaac 2460 cggaaaatac ccaactctca acgagtctcc ctaaagatct ccagtccagg gaaatccgag 2520 caatacgcac tgatcgacac tcggacagtt ggcaatttga aactcacaac gcaaaccctg 2580 aagtacgagg agctggcacg aaagtacgcg catctacgtg ggttgccgat taaaagctac 2640 gactcggttg taccagggat tctgatcggg tcaaacaacg ctggtttaat tgcaacgtta 2700 aagctgcgtg aaggtcaact aggcgacccg ttagcagcta aaactcgtct agggtggacc 2760 gtttatggat tcgctgctga tcatacgaag acggaaaact tcagcttcca tatttgtgag 2820 tgccatatgg aacaagggac agatcaggag ctacatgacc tggtgaagca gcatttcacc 2880 ctcgaaagtg tgggagtatc agcggatcga acccctgaat cggatgaaga cagacgagct 2940 cgctgtatcc tggaggagac aacgaaacgg acttcgaagg gcttcgaaac aggtctactg 3000 tggcgaacag actccgtgga acttcctaat agctatccaa tggcagtgcg ccgtctagaa 3060 tgcttcgagc ggagaatgaa caaggaccca ggcacgcgtg atagcgtgca gcgtcaaatc 3120 caagagtatt tggagaatgg ctacatccac gaggtaacgc cacacgaact ggagtacacc 3180 gatccaaaga aggtctggta cttaccactg ggggttgtta ggaatcccaa gaagccgaac 3240 aagatcaggc ttgtctggga cgcagcggca aaggttggag gtgtatcatt gaatgatatg 3300 cttctgaaag gtcccgacct gctagtttct cttgctgcag tgctgtgtgg tttccgacaa 3360 taccgagtgg cagtctgtgg cgatcttaag caaatgttcc accagtttag gatacgccag 3420 gaagacgtgc acagtcagcg gttcctgtat cgtgagcatc catcgaaacc aatcaagatc 3480 ttcgtaatgg acgtgggtag ttttggtgcc acttgctccc ctagtcaagc gcagtatatt 3540 aaaaacctca atgcggaaga gcatgaatca gagttccctc aggcagctgt cgccatcaaa 3600 aagaagcact acatggatga ctaccttgat agcttcgata cagaggatga agcaatcaaa 3660 gtagcgttgg aggtaaagac ggtacatgat cgcggcggct tcgaaatgcg gaattggcac 3720 tcgaattcga cgacactgct ggaacgggtc ggagaaccga agcaacagca aatgaaggcg 3780 ataagcatcg acaccgaaag cgaggcggaa cgagtgttag gcttgttatg gctaccgacc 3840 gaggatagtt tagctttcgc agcagatctg aagctggacg gaactgtccc tacgaagcgg 3900 aacattctac ggtgggtgat gagcgtattt gattcccaag gaattctgtc gcatattacc 3960 gttcaaggac ggatgatcat acaagatacc tggcgaagtc aagtcacttg ggacgaagaa 4020 gtcaacgatg cgacgcaaac ccaatggcgc aagtggatta agctattgga ggaggtgaat 4080 aaaatcagac tccctcgagc ctactttcct gggctctccg cagtggacat tggaatggtc 4140 gatctacata tcttcacgga tgccagcgaa gaagcttacg cggctaccgc gtacttccgc 4200 accgtggtga acggtaaggt ctactgtgca ttggtgatgg ctaaagcaaa ggtagcccca 4260 ctgaaggctg tgtcggtacc gcgattggaa ctgatgggcg cgattttagg agcaagatta 4320 gcaaaagcag tctgcgagta ccacacaatt cccattcgac gtcgagtatt gtggaccgat 4380 tcattagcaa cgctggcttg gattcaatcg cagcatcgaa gaaacagcaa ctagtcactg 4440 gctgtttgaa aaaggaggag cttcagcaag ctgaaaacag tctctggcgt atggcccaag 4500 cagcggcgta cccggacgag atagcagttt tgaaacaaca gctcacaaac tccaagaagc 4560 aaattgaaaa atcaagcccg gtctataaac tctcaccgta tattgacgaa cattgtgtaa 4620 tgcgcatgga tagccgtatt gggatgcttc catatatcgc ttatgatttc aagtttccag 4680 ttatcctgcc acgtaatcac cgactaacaa ttctcctcat caactgcttg taacgttggg 4740 cgccaattgc aatggatgat gggcaaagaa ccaaaatggt aacaactggt accaccgtag 4800 gtacttgcac gcgaacaacg agacagtgca caacgaaatc cgccaacgat ttcatgtttc 4860 gaatcttcgt acagtggtga ggcaagtggc aaaaaactgt caagcttgca aggtttccaa 4920 agccttgccg gtgactccca agatggctcc actacccgac gctcgtctcg ccgctactgt 4980 tcgcccattt tcctacgtcg gactcgacta ctttgggcca atccaggtaa aggtgggacg 5040 cagttgtgtc aagcgatggg tggccctttt cacgtgtctg acgatcaggg caatccactt 5100 ggaaatcgcg cattcgttgt ccacagagtc gtgcaagatg gctgtgcgac gtttcgtagc 5160 tcgccgagga tcgcccttag aaatctactc ggataacggc accaattttc agggtgcagg 5220 ccgagaactg cgagagcaaa tcgaagctat aggtaaagga ttggaggaga ctttcaccaa 5280 cacgagcacc aagtggatat ttaatccacc gtccgcgcca cactttggag gaatgtggga 5340 gcgcctcgtc agatcggtca aaatggcatt ggggtccttg tgtaccgatc gaaatccaga 5400 cgaagagacg ctggcaaccg ttatcgctga agcggaatcc gtggtcaact cacggccact 5460 aacctttata cccttggacg acttcgccca agaagctcta acaccaaacc acttcctcct 5520 gttaagctcc aatggggtag cacaacctcc caaggcatta gcgaagttaa cgaaagtgac 5580 gcgatcgaat tggaccacga accaacaact aattgatcag ttttggaaac gctgggtgag 5640 ggagtacctc ccaacgattg ccagacggac taaatggttc aacgaatgcg acgaggtgaa 5700 ggtaaacgac ctggtggtcg ttgtgaacga gaatcaacgt aacgggtggg ctagaggccg 5760 agtgctctcg gtgattcctg gccgagacgg ccgcatccgt caagcgatgg tgcagacctc 5820 aggcggagtc ttccgtcgac cagtatcaaa attggcagtg ctgcaagtgc gagacatgag 5880 taacgcacag tcttctgtga acccggaagt gcgctacggg ccggggga 5928 // ID BEL-3-I_HM repbase; DNA; INV; 5627 BP. XX AC . XX DT 08-JAN-2009 (Rel. 14.02, Created) DT 08-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE LTR retrotransposon from Hydra magnipapillata: internal portion - DE consensus. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-3-I_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-5627 RA Bao W. and Jurka J.; RT "LTR retrotransposons from Hydra magnipapillata."; RL Repbase Reports 9(2), 434-434 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 459..5627 FT /product="BEL-3-I_HM_1p" FT /translation="MSSIAGKRRVRTITVTAIKNIFERANECMTNFSQDSN FT SFEELSSFKEVLIEKYNKVKAIDDDLINLIEDDEELLNEENLANEFSLFFK FT KNIKIIEKFNNNIYKSINESQSSSLKSFSNNNIKLPPLKLSTFSGKPEEWQ FT TFYENFECAIHNNNDLSPIQKLNYLRNLLDGKALKTISGLALTNDNYHTSL FT ELLKERFNDKQLLISTHMKSLLSLERVQSINNISLLRRIHDNIEVQIRSLE FT NLGIDSTMYGPLLIPIIMQKIPEELNLIIARNFDSCDYWDIKYVLKSLKAE FT LQAREKAHSGRECNFDPSTAEVLHSSASFKPQYELKCIFCSKSHKSESCKT FT VTNINSRRNILRDKRRCFNCLKSGHMSKDCRSRISCYECSGKHHVSMCFKK FT FPLHIDDSKESNNKLTSVATTEALKSKNVLLQTAMVTVKNKDKTLRCRLLF FT DSCSQLSYISPSLRKKLFLPTLGRKQLNIKTFGNQSQTQXLEMVKLSILTL FT KNSLIPITCFVKPICAPLSGQYIEEAVDSFEHLKGINLADSNVDCKSLEVD FT VLIGADYYWSFFNGKVIRANTGPVALQSSVGYILSGNVNASSGKDCCVFST FT HTLKVETEFINEIPQFNENIKKFWELENSGVEDNIFDHFKSNIYFDNKECR FT YTVKLPFKNNHALLSDNYSLCIKRLNVILNKLKNDSELLNEYNKIIIEQLN FT SGVIEKVNNYDSSIGEVHYLPHRPVCRKDKTTTKVRIVFDASSTLSGPSLN FT DCINAGPSLATPLFFILLRFRANKYAFIADIEKAFLQIALDKNDRNYLRLI FT WFDDIYNINNSNLFSSALATYRICRVPFGVTSSPFLLNATLIYHAERYCLN FT DNNISSKLLQSLHIDDLISSCVTIEEGVLFFNKCKDILKEGGFNLRKFESN FT SSAFEKQINGDNYERQTNTRVLGLKWNKTEDSIIYSFEDLLNVASIVPTKR FT EVMSFIASIYDPIGLINPVVVTCKNLFQRICVSKLGWNENLYGDLLSTWNL FT ILSDFKSVFQIAVPRWYMSPGLGNINNVKFELHGFSDASVKSFGCCVYLRF FT FNANFSRASLIASKSRVAPLGKNTMPRLELSATLLLAKLLASIYDQLISIY FT NISNIVYWTDSTICLHWIFNTNNTYEQFVQNRLNKIRELTLICNWNYIESF FT RNPADIISRGSSLKKLNNNELWFYGPNFLNDINIKWPYYEHVNHSNETLEV FT LCNVVHVKVNVNLDFINVDKFSDFRYLLRVTSWILRFINNAKDKKKNIFKT FT GLITAEEIDNAKRLWIKFSQINLIDENKFKQLRKDLRLFVSDGIYRCRGRI FT EHADLEYDTKFPIFIFNNCYFAKLLILHFHKLVKHNGVKETINALRSQYWI FT PCCRQLAKSVIRECTTCRRIEGRPYSYPPAPPLPESRLNSDFAFKCIALDY FT AGPLYIKDIYGDCTLNKAWIFLFTCCSSRSILLDLVADCSSRSCIMGIRRF FT IGRRGVPEIIYSDNGSQFVSTETQYFAANHSIKWKFNAPSAPWWGGMFERL FT VRMTKRCLKKALKNSKSSYEEVRTLLSEIEMVLNNRPLTYLYNNQGDEALT FT PNHLVFGHKLKLESSLSACNDIEQDVHIRNSMLSNTLNHFRSRFKSEYLTE FT LREYHKSKRSNGCNGIRVNDIVLIESDNCKRQLWKLARVEELLYSNDGVVR FT VAKVCYKQENGSSCLITRPINKLYPTEYSDNEKIVTVAFINEKNIPLLVNG FT GRE" XX SQ Sequence 5627 BP; 1900 A; 734 C; 1006 G; 1986 T; 1 other; tttggagccc acggagatat ttgttgctga tcctaaagtt atcagctatc gttcaagttt 60 attgctgatc ctaaagttat cagctatcgt tcaagtttat tgctgatcct aaagttatca 120 gctatcgttt aagttgctgt tgatcctgaa gttatcagct gtttttaaag ttgttgctgt 180 tctgaaagtt accagctgtt attaaatttc gttgctgatt ctcaagttcg aacaaagatt 240 aaagtttgtg agaagataaa gttgaagtta agtactaatt gatattgtac ttgttttttg 300 ggtcaatcct ttttttgggg ggttgtttga tgcacgtgtt tttattagca ttaaaaaatt 360 atatatattt gtaatttatt tgtgtttgtg tcagaattga ccatagtttt ttttaagtaa 420 gcattatttg ttgttgacat ttatagggtt tttatataat gtcgagtatt gctgggaaac 480 gacgtgttcg taccattacg gtcactgcta ttaaaaatat atttgaaagg gccaatgagt 540 gtatgactaa ttttagccag gatagtaact cttttgagga actatctagt tttaaggaag 600 ttttgatcga aaaatataat aaagttaaag caattgatga tgatttgatc aatttaattg 660 aagatgatga ggaattgtta aatgaagaaa atttagctaa tgagttttct ttatttttca 720 aaaagaacat aaaaataatt gaaaaattta ataataatat ttataaaagt attaacgagt 780 cgcaatctag cagtttaaaa agtttttcaa ataataacat taaacttcca cctttaaaat 840 taagtacctt tagtggaaag ccagaggaat ggcagacctt ttatgaaaat tttgaatgtg 900 ccatccataa taataatgat ctttccccaa tacaaaagct taactatcta agaaatttac 960 ttgacgggaa agctttaaaa acaatttcag gcttagcatt aacaaatgac aactatcata 1020 catcattgga attacttaag gaaagattta atgataaaca actgttaatt tcgacacata 1080 tgaagtcact tctatctcta gagagagtgc aaagtattaa caacatttct ttattaagac 1140 gtattcatga taatatcgaa gttcaaatta gaagtttaga aaacttaggt atagattcaa 1200 ctatgtatgg acctttattg attcctatta taatgcaaaa aattccagag gagttaaatt 1260 tgattattgc tagaaacttc gattcctgtg attattggga tataaaatat gttttaaaat 1320 ctttaaaagc tgaattgcaa gctagggaaa aagctcactc aggtagagag tgtaattttg 1380 atccttcaac agcagaagta ttacattcta gtgcttcctt taagccacaa tatgaattaa 1440 aatgcatttt ttgtagtaaa agtcataaaa gtgaaagttg taaaactgta acaaatataa 1500 attcaaggag aaatatttta agagataaaa ggcgttgctt taattgcctg aaatcaggtc 1560 atatgtctaa agattgtcgt tctagaataa gctgttatga gtgctcaggg aagcatcatg 1620 tttctatgtg ttttaaaaag tttcctttgc atattgatga tagtaaagaa tcaaacaata 1680 agttaacgtc tgttgctact acagaagcat tgaaatctaa gaatgtattg ttacaaactg 1740 ctatggttac agtaaaaaat aaagataaga cactaagatg tcgcctttta tttgacagtt 1800 gttctcagct aagttatatt tccccatcat tacgaaagaa gctctttcta cccacattag 1860 gtagaaagca attaaatatt aaaacttttg gcaatcagag ccaaacccag amcttagaga 1920 tggttaaact ttctatactg actttaaaga attctttaat acctataact tgttttgtta 1980 agcccatatg tgctcctctt agtggtcagt atatagagga ggctgtcgat tcctttgaac 2040 atttaaaagg aatcaattta gctgatagta atgtagattg caaatcattg gaagtagatg 2100 tacttattgg agcggattat tattggtcat tttttaatgg caaggtaatt cgcgccaata 2160 caggtcctgt tgctttacaa tcatcagttg gttatatatt aagcggaaat gtaaatgcat 2220 catctggaaa agattgctgt gtcttttcca cacatacttt aaaagttgag acggaattca 2280 ttaatgaaat tccccaattt aatgagaata ttaaaaagtt ttgggaacta gagaattcag 2340 gagtggaaga taatattttt gatcatttta aaagtaatat ttattttgat aataaagagt 2400 gtagatatac agttaaatta ccatttaaaa ataaccatgc attattgagc gataactata 2460 gtttatgtat aaaacgtctg aatgtaattc ttaataaatt aaaaaatgat tctgagctat 2520 taaatgaata caataagata attatagagc aattaaattc aggggtaata gagaaggtaa 2580 ataattatga ttcgagtatt ggtgaggtgc attacttgcc acatcgccca gtatgtcgta 2640 aagataaaac tacaacaaaa gtgagaattg tttttgacgc aagttccact ctatcaggtc 2700 catcattaaa tgattgtata aatgctggac caagtttagc tacacctcta ttttttatat 2760 tattacgttt tcgtgctaat aaatatgcct ttattgcaga catagagaaa gcatttttgc 2820 agattgccct tgataaaaat gatagaaatt atttgcgtct aatatggttt gatgatattt 2880 ataatattaa taatagtaat ttgttttcat ctgcattggc tacttatcgt atatgtagag 2940 tgccttttgg tgttacttcg tctccatttc tcctcaatgc cacattaatt tatcatgctg 3000 agagatattg ccttaatgat aataatattt cctcaaagtt gttgcaatcc ttgcatatag 3060 acgatttgat ttctagctgt gttaccattg aagaaggtgt tttatttttt aataaatgta 3120 aagacatatt aaaagaagga ggttttaact taagaaagtt tgagtctaat tcttcagctt 3180 ttgagaaaca aattaacggt gacaattatg agagacaaac gaatacaaga gttttaggcc 3240 tcaaatggaa taagacagaa gattcaataa tttattcatt tgaggattta ttaaatgtag 3300 cctcaatagt tccaacaaag agagaagtga tgagttttat tgccagtatt tatgacccaa 3360 taggtcttat taatccagta gttgtaactt gtaaaaattt atttcaaagg atttgtgttt 3420 ctaagttagg ttggaatgaa aatttgtatg gtgatttatt gtctacttgg aatcttattt 3480 taagtgattt caaatctgtt tttcaaatag cagtacctcg atggtatatg agtcctgggt 3540 tgggtaatat aaataatgta aaatttgaat tacatggttt ttctgatgca agtgttaagt 3600 cgtttgggtg ttgtgtttat cttagatttt ttaatgctaa ttttagtcgt gcctctttga 3660 tagcctcgaa gtctagggtt gccccattag gcaagaacac aatgcctcga ttagagcttt 3720 ctgcaacttt acttttagca aagctattag catctattta cgatcaattg atatctattt 3780 ataacatttc aaatatagta tattggactg attcaacaat ttgtttgcat tggattttta 3840 atactaataa tacctatgaa cagtttgtac aaaatcgttt aaataaaata agagaattga 3900 ctttaatttg taattggaac tatattgaat cctttaggaa tcctgcagat ataatatcac 3960 gaggatcttc gcttaagaaa ttaaataata atgaactttg gttttatgga cccaattttt 4020 taaatgatat taatattaaa tggccatatt atgaacatgt taatcattca aatgaaacac 4080 ttgaagtttt gtgtaatgtg gtacatgtta aggttaatgt taatcttgac tttattaatg 4140 ttgataaatt tagtgatttt aggtatttac ttcgagtaac atcgtggatt cttcggttta 4200 tcaataatgc gaaagataaa aagaagaaca tcttcaagac tggtttaatt acagctgaag 4260 aaatagataa tgcaaagcga ttgtggatta aattttcaca gataaatcta atcgatgaga 4320 ataaatttaa acaattacga aaagatttac gtttatttgt tagcgatggt atatatcgat 4380 gtcgtggcag aattgaacat gctgatttag aatacgatac aaaatttcct atttttattt 4440 tcaataactg ttattttgct aaattattaa ttttacattt tcataagctt gtcaagcaca 4500 atggtgtgaa agaaacaatt aatgctttaa gatcccagta ttggatacct tgttgtaggc 4560 aattagcaaa gagcgtgata cgagaatgta caacttgtcg acgtatagag ggtaggcctt 4620 attcataccc acctgctcct cctttgcctg aaagtcgttt aaattctgat ttcgctttta 4680 aatgtattgc tttggattat gctggtccgc tgtatattaa agatatttat ggcgattgta 4740 cgttgaataa agcatggatt ttcttattta catgttgcag tagtcgttcg attttattag 4800 acttagtagc tgactgttca tctagatcat gcattatggg tatccgtaga tttattggaa 4860 gacggggtgt tccagaaatt atttattcag acaatggttc acaatttgtt tcaactgaaa 4920 ctcagtactt tgctgcaaac cattctataa aatggaagtt taatgcgcct tcggcaccgt 4980 ggtggggagg gatgtttgag cgactagtac gaatgactaa aagatgtttg aagaaagcct 5040 taaagaattc gaagtcttct tatgaagaag tacgaacgtt gctttctgaa attgaaatgg 5100 tactaaataa ccgaccatta acttatttgt ataacaatca aggtgatgaa gcactcacac 5160 ctaatcactt agtttttgga cataagctta agcttgaatc aagtcttagt gcttgtaacg 5220 atatagaaca agatgtgcat atacgaaaca gtatgttatc taatacattg aatcatttta 5280 gaagtagatt caaaagtgag tacttgacag agttgcgtga atatcataaa tcgaagcgga 5340 gtaatggatg taatggtatt cgtgttaatg acattgtatt aattgaaagt gataactgta 5400 aaagacaatt atggaaatta gcacgtgttg aagaattgtt atattcaaac gacggagttg 5460 ttcgcgttgc taaagtttgt tacaaacaag agaatggtag ttcgtgttta attactcgac 5520 ccattaataa actatatcct actgaatatt cagacaatga aaaaatagta actgttgcct 5580 ttattaacga gaagaatatt cctttacttg taaatggtgg ccgggag 5627 // ID Gypsy-605_AA-I repbase; DNA; INV; 5353 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-605_AA_; KW Gypsy-605_AA-LTR; Ty3_gypsy_Ele44; Gypsy-605_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-5353 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [3021-3521] - Reverse transcriptase CC Positions [4659-5126] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 639..1793 FT /product="Gypsy-605_AA-I_1p" FT /translation="MPDGHDDDSEEDHSNVVAVDKLSENHDELAVARNDSN FT LLSTMSSWTLSTINIPECTPSQGEDEIDKRAYEYWKDTFVSSIKLINVTDE FT LVLFGLFKVKAGSKLREIYQTTVSDPGMSNEISCPFSNAMERLDEYFGSRT FT YILAQRGKLMNMTQTPTETSIQFVRRVGTAAKLCNYTEEEEMEAVVRVMTK FT GAMDARVRVLAHRNWVKQGSMKDLIDLVRDRELERANEEEFQRSHRVEETK FT AIAALSQQPREFPRQTQNYRPNLSENWRFGFRGRHGSNRGRGRGLPNRGFQ FT HNRSYAPTNNCWRCGSVYHQANQCPVIPKVCRNCGIKGHIARVCSEVPGQM FT WQSGPRKRHNVVKEEAISPKIAAIEDVGDRKDEVREIEEVSP" FT CDS 2931..5234 FT /product="Gypsy-605_AA-I_2p" FT /translation="MPPSRNVYTHIPAVYKDLTKKKLNDLLCAGIIEKVTT FT DMDRSFCSSLLVVPKGKHDIRLVTDLRGPNRCIYRTPFKMPTFESILLELH FT GAQWFSTIDLSSAFFHVELNEDSRHLTNFFAGDQIYRYRRLPFGLCNAPDI FT FQEALQTIVLAGCKGTVNYLDDIMVSGRTKAEHDENLAKVMACLKNHNVKI FT NEEKCVFGKQSVNFIGHTVSSKGWMISDDKIAAIRGFRNPETMCEVKSFLG FT LINFVDRFIHNRADKTQRLRELAKSDHFYWNKDLADEFEYLKNDALNVITK FT LGYFSKDDKTEIYVDASPYGLGAVLVQFDEHSVPRVIGCASKALSETEQRY FT PQTQREALAMVWGIERFSMYLMSLNFTVRTDAESNEYIFGGLHRIGKRAVS FT RAEAWALRLQPYQFDVERVPGKTNVADALSRLVEKSQTAESFDDTNEKHLL FT FLLDSGMLEISLDDIESHARDNEEIIRVKSSIETGFWETGLRRFECQSKEL FT RVFGSLVYRDEQIVLPDTLRQKAIDAAHQGHVGVGSTKRILRDYFWWPRLS FT KDAEARLKQCETCLRLSKKNRPIPLSSRSLPSGPWEILQIDFFTDKSFGTG FT EFLVIVDTYSRYLHVIEMKCMDAETTNAALNKVFLNWGYPLIIQSDNGPPF FT QSSKFVKIWEDKGIKIRKSIPLSPQTNGAVERQNEGIKKALAAAKLDNVNW FT RQALDNYVHVHNKVRPLSRLGVTPFELLVGRKFRGTFPCMWSGASETLDRQ FT DISEKDAMSKLIKQYAD" XX SQ Sequence 5353 BP; 1643 A; 1029 C; 1295 G; 1383 T; 3 other; atggcgcagt catccacatc gtgaggtgag ttgaagagaa caatgaagtg tgatagtgcc 60 acgaatcaaa ttccgatgga aatggacgga atgttgtaat cgaaaaaaaa aatcattttg 120 ctttgctatc ccgattgaaa taggcggaat agatgtcgat tggtgcttgg tgttcggtgg 180 aaacagacga atacaattca agcgttaaac gtgtgcgtaa gcctctccgg tggaaataga 240 cggagagaga tcatattttt wtattccgat ggaaatgggc ggaatacttt ttcaatacag 300 tggttgaaaa cttccgtgga aaaggacgga gtattttact aaaagtgttc ggtagaaaag 360 acgagcacga cgagcaaaaa aaaaaatacc tatattcatt tgtcaattgt agtccgattt 420 tcgtagaaaa cggaacgcat acatttagtg cattgaaact cattttgttt ccacctggac 480 ctgtgtgata gcattggatt atacggttag actagcttat ttttttttct ctcgaatcta 540 atttcagaga aacgaaaacc gccatcaaga agcgattgcg tgccgctctt gaagaaaacg 600 attctttgcg ggcgcgtatt gctcagctcg aagagtcaat gcccgatggt catgacgatg 660 acagtgaaga ggaccattcc aatgttgtgg ctgttgataa gttgtcagaa aatcacgatg 720 agcttgcggt tgcaaggaac gattcgaatt tgctgtcaac tatgagcagc tggacattga 780 gcacgatcaa tatacccgaa tgcaccccat cgcagggaga agacgaaatc gataaacgtg 840 cctacgaata ctggaaggac acatttgttt cttccattaa gctgataaat gttactgatg 900 aactcgtatt gttcgggctt ttcaaagtta aggcgggatc gaagttgcga gaaatttacc 960 aaacaacggt gtcagaccca gggatgtcca atgaaatctc gtgtcctttc tcgaatgcta 1020 tggagcgtct ggacgaatat tttggttctc ggacatacat ccttgcccag agaggcaaac 1080 taatgaatat gactcaaaca cctacagaaa ccagcattca gttcgtaagg cgtgtgggaa 1140 ccgcagccaa attgtgtaat tacaccgagg aagaagaaat ggaggccgtt gtacgagtta 1200 tgacaaaggg cgcgatggat gctagggtcc gagttcttgc tcaccgaaat tgggttaaac 1260 agggatcgat gaaggacctc atcgatctcg taagagatag agaactcgaa agagcaaatg 1320 aggaagaatt ccagcggagt cacagagtcg aggaaacgaa agcgattgca gctctttcac 1380 aacaacctcg agaatttccc agacagacgc agaactatcg cccaaaccta tctgaaaact 1440 ggcgcttcgg attcagaggt cgacatggat caaaccgagg acggggtcga ggattaccaa 1500 atcgaggatt tcagcacaat cgatcctatg cccctaccaa taactgttgg cgttgcggca 1560 gtgtttacca tcaggctaac cagtgtccgg tgattccaaa agtttgccgc aactgtggwa 1620 ttaaggggca tattgcgcga gtttgttcgg aagtgccagg tcagatgtgg cagtcaggac 1680 ctcgtaagcg tcacaacgtt gtcaaagagg aggcgatcag tcccaagatc gcagctatcg 1740 aggatgtagg agataggaaa gatgaggtac gagaaattga agaagtttct ccttgattga 1800 tttttwttgc tgtttttttt tctttgggaa aatttgaagt aaataaatga actgttgatt 1860 ttttttttgt actgattaca gagaaaacag attattgcat ctgaaatgga aatcgctcaa 1920 caaacttcaa aatccgccga aaaaaccata caatttgata atttccttcc gaagcagtcg 1980 tcccacgata ttcagagaat cgttaccgaa agcatcgctt acattgatac tcatgtggcg 2040 aaaacacccg tgaaattttt cattgattca ggagcgcagg tcaacaccat cataaagaag 2100 gattttgaga gaattctagc taatactgac ggcagagata acattctggc gttggaatat 2160 tctacggaca agcgtcttaa agcgtacgct tctgataatg aaatcaaggt gattgctaac 2220 ttttcggctg aactagttat ctctgaagac agaccagtat cgattgaaaa attttatgtc 2280 gtggatgaaa tacgatcact tttgggattt aatacggccc ttagatacag tgttctcgaa 2340 gtggggttgg atgttgcagt taatgctctt cgcgcgaact cctcttggaa gtgtgagctc 2400 gacaacgatt tgtgctgtcg tgctgtcacg tgcttgatca gcaaaaggag ttcccgaaat 2460 ttaacataac tccggtatca ttgagatacg ataagaccat gccgccctca cgtaatgttt 2520 acacacacat accggcggtg tacaaagatc tcactaagaa gaagttgaac gacctgctgt 2580 gtgccggaat aattgagaag gtgacgaccg atatggatcg atcgttttgt tcatcgctac 2640 tagttgttcc caaaggaaaa cacgatattc gtttagtgac cgacctccgg ggacctaacc 2700 ggtgtattta tcgaacacca tttaagatgc cgactttcga gtcaattcta ctggagcttc 2760 atggagcaca gtggttttca actattgatc tgtcaagtgc ttttttccat gtggagctta 2820 atgaagattc gcgccatctc accaattttt ttgctggtga tcaaatttat cgataccgac 2880 gtcttccatt cggcctgtgc aatgcaccgg acatctttca ggaggcttta caaactatcg 2940 tattggccgg gtgcaaagga accgttaatt atttggacga tattatggtc tccggaagaa 3000 ctaaggcaga acatgatgaa aacctcgcaa aggttatggc atgcttgaaa aatcataatg 3060 tcaagattaa cgaagagaaa tgcgttttcg gtaaacagtc cgtgaatttc ataggacaca 3120 cggtttcatc caaaggatgg atgatttctg atgacaaaat tgcagccata agaggatttc 3180 gaaatccaga aacgatgtgc gaagtgaaaa gttttttggg actcataaac tttgtggatc 3240 gtttcattca caatcgagcg gataaaactc aaaggctccg agaactggca aaatcagacc 3300 atttttattg gaataaggat ctcgcggatg aatttgaata tctaaaaaac gatgcactga 3360 acgtgattac caagttggga tatttcagca aggacgacaa aacggaaata tatgtggacg 3420 cttctccgta tggcttgggt gcagttcttg tgcagttcga tgaacattca gtgcctaggg 3480 tgattggatg tgcttccaaa gcattatcgg agacagagca aagatatcct cagacacaac 3540 gagaggcatt ggctatggtt tggggaatcg agcgtttctc tatgtacttg atgagtttaa 3600 acttcacagt aagaacagat gcagagtcta atgagtacat tttcggagga ttgcaccgta 3660 ttggcaaaag agcggtatcg agagcagaag cttgggctct gagactccag ccataccaat 3720 ttgacgttga acgtgttccg ggaaaaacga acgtggccga tgcactctcc agactggttg 3780 aaaaatctca gacagcggaa tccttcgatg atacgaacga aaaacattta ctgttcctgc 3840 tggattcagg aatgctggaa atctcattgg acgatattga atctcacgca cgagataacg 3900 aggaaataat tcgagttaaa agctccatcg aaactggttt ttgggaaact ggtttgcgac 3960 gtttcgaatg tcaatcaaaa gagctacgtg tctttggatc attggtatac agagatgaac 4020 agattgtact cccagacact cttcgacaaa aagcaattga tgctgctcat caagggcatg 4080 ttggagtagg ttccacaaaa agaattctgc gagattactt ttggtggcca cggctgagca 4140 aagacgcaga ggctcgtcta aaacaatgcg aaacgtgtct tagattgtct aagaagaatc 4200 gacccatacc tctgagcagt aggagtcttc caagcggccc gtgggaaata ctgcaaatag 4260 attttttcac ggataaaagt tttggaactg gagagttcct agtgatagta gatacgtatt 4320 cacgctacct tcacgttatt gaaatgaaat gtatggatgc agagactacc aacgcagcac 4380 tcaacaaagt tttcttgaat tggggatacc ccttaattat acaaagtgac aacggccccc 4440 cattccaaag ctccaaattt gttaagatat gggaggataa aggaattaaa attcgaaagt 4500 caattcccct gagtccacag accaacggcg ctgttgagag gcaaaacgaa ggcattaaaa 4560 aggcgttagc ggcggccaaa ctagacaacg tgaactggag acaggccctt gacaactacg 4620 tacacgtgca caacaaagta agacctcttt caaggctggg cgtaaccccc ttcgagcttt 4680 tggtaggaag gaagttcaga ggcacgtttc cgtgtatgtg gtcgggcgca tccgaaacac 4740 tcgatcggca ggatattagc gaaaaagatg ctatgtcaaa gttaattaag caatatgcag 4800 actaaagagc aaatcttcag atttaatcgt gggagacaga gtgtttttaa ctcagttgaa 4860 gcgatacaaa tccgacccaa actttggcgg cgagaggttt acagttgtag ccagagatgg 4920 agcaaagata gtggtacgca gcgatcgagg aatacttttc gccagaaatg tagcagacgc 4980 gaagagaatc gaggatatca tggatggtac aaatgtcgac gatgattcac gttcggaaaa 5040 caatacggca tttggtaagt aaaaaaaaga agaagaaacg ttttgagcgg ttttgggtag 5100 ttctttatcc tcaatttttt tttttaattt tctgcagact tgcttcacaa agagcatcag 5160 gagatggaaa acgtgagtga tcggcgttca gagggaacat cggaagtgcc cgaagaaggt 5220 agcctgctcc aggatcaatg taaacccgat gaagtttcat gcggaagacc caagcgtaat 5280 tatcgaaaac cgagaaaatt tgatgatatg gttctgtata cgattttcga ataagtgtag 5340 aggtgggaag aca 5353 // ID Mariner-29_HM repbase; DNA; INV; 4191 BP. XX AC . XX DT 30-DEC-2008 (Rel. 13.12, Created) DT 30-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE Mariner-type family: consensus sequence. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-29_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4191 RA Bao W. and Jurka J.; RT "Families of Mariner elements from Hydra magnipapillata."; RL Repbase Reports 8(12), 1963-1963 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 1740..3398 FT /product="Mariner-29_HM_1p" FT /translation="MVRNYKPKRQKNYNAEIFQAALQTIIDGETISTAAKQ FT FNIPRTTLIDKLRGKHQTHSIGGKTIFFKNQEETLVQRLLYMADRGFPLTI FT MWLRSAAYFYAKSLCRRHLLQKSIPKSWHNKCLASRDWFLSFRKRHTQLTL FT RIPEGLSRARAEAFNENRVQTFFNDLQPFYEQLDIKNYPNLIYNCDETGLS FT SVPNVSVKVIALKGSHEVQKLTIGERGTLTTYLATVNAAGDSLPPFFIFKG FT TKIPDASKFPLGTKVHCSQSGYIDQEIFIAYLHHFNEFCTKVNGKKVLLFL FT DGHKSHVTIEAIELAIKMDIEIICIPPHSTHRLQPLDTHVNKVVKHMWSEA FT LSNYLAENDDISLPRCNFHIIFNKIWPNILTKRGLIVNAFYHCGLHPLRNP FT VKDFEYKKAEVFNKATIHQGDQNLSILRTIIPSPEKKRKPTHSKRHIALIT FT SPGNVKLIKEPPAKKAAIKNVVSVKQKSKKDHDKSVKTKSKKTFFSADSDH FT PSSSSFILKNNTDKDFCCVCAAVWAEASEDWLKCLNCNQWACEGCFGVDTC FT ANCVQ*" XX SQ Sequence 4191 BP; 1534 A; 546 C; 597 G; 1514 T; 0 other; gggtaaggcg gggtgaattc gtcacggagc gaatccgtca cttatgcgac tcagctttaa 60 aataataatg cataaaaact aagaaatcta ttaactaggt tttttcgaaa ctttttcaac 120 tgatatttgt agaatattac taaataaatg ttttatatat ttacctccga ttattcttta 180 aatattgcaa ataaacattg aagttgaaaa aatatttttc tgcttaaatg cctatcgtca 240 aaaaattatc aaaatacgca atgttttgac cacaaagtat atattttttg attgattgat 300 gtcagtagtt tttaaataat aatttaagtt actatgtttt tgcctttgga atcgtcacgt 360 aatgattgtt agcacatgct caaatgaggg gcaacttcgt cgttgaagaa aagttaatac 420 gaaaaataaa ttaggttgtt tcatcatgca aatttttaaa ctaccgagcg ccgtttgaaa 480 gaaaaacaaa ttgtgatcaa aatgcgctcc aaagctttag acatttataa gagaaaggct 540 cttcagaata tattaattta cagttagctt ttaaaacttt ttgaagtttt aattataaac 600 acgactttga tgtgaaacat gtaaacaaac ttattataat gtatagtctg aaatatatat 660 acagtaaata ataagtatat aaaacatata tgttatatat ttttaggcta tccactgcct 720 atttttatac atatttaaaa tatgtataag tattaaatat tgttaaaata ttattgaaat 780 tcaataaata ttattagaat agttaatatt atttactaat tttacattaa aataagactt 840 ttattatata ttatatagca aagatcttat tctattttaa aaaatatata taatattata 900 ctatttagta taactattat atattgtaaa aagtaaatag taagtatata gtataaaatg 960 tgtgtacagt ataaaatatc tgttataagt attttttgta gtatttatta taaacaaatg 1020 tatcattcat tggaaaatat aaagcattta tatatatata tatatatata tatatatata 1080 tatatatata tatatatata tatatatata tatatatata tattttttgt gtatatatat 1140 atatatatat atatatatat atatatatat atatatatat atatatatat atatatatat 1200 atatatatat atatatattt atatatatat acatatatat atatatatat atttttttta 1260 tatttttacg tatatatata ttatatagca aagatcttat tctattttaa aaaatatata 1320 taatattata ctatttagta taactattat atattgtaaa aagtaaatag taagtatata 1380 gtataaaatg tgtgtacagt ataaaatatc tgttataagt attttttgta gtatttatta 1440 taaacaaatg tatcattcat tggaaaatat aaagcattat atatatatat atatatatat 1500 atatatatat atatatatat atatatatat atatatattt ttttgtgtat atatatatat 1560 atatatatat atatatatat atatatatat atatatatat atatatatat atatatatgt 1620 atatacatat tgtatatata tatatttatt tatatatata ccaatataaa aaaaactaag 1680 aatgtattat tacttatggt agattttaat aacattattt ttcatgttaa aacgtataga 1740 tggtacgtaa ttataaacca aagcgacaaa aaaattataa tgcagaaatt tttcaggctg 1800 ctctgcaaac cattatagat ggagaaacta tatcaactgc agcaaaacaa tttaatattc 1860 cacgtacaac gttgattgat aaattacggg gaaaacatca aacacattct ataggtggta 1920 aaacaatttt ctttaaaaat caagaagaaa ctcttgttca aaggctgctt tatatggctg 1980 atagagggtt tcctctcaca ataatgtggt taagatctgc tgcttatttc tatgccaaaa 2040 gcttgtgtag acgtcattta ttacagaagt ccattccaaa aagttggcat aataaatgtt 2100 tggcatcacg tgattggttt ctctcattcc gaaaaaggca tactcagttg acactaagaa 2160 ttcctgaagg gctttctcga gcaagagcag aagcatttaa tgagaatcgc gttcagacat 2220 tttttaacga tttacaacct ttttatgaac agcttgacat aaaaaattat ccaaatttga 2280 tttataactg tgatgaaact ggcctttcat ctgtgccaaa tgtctctgtt aaggtaatag 2340 cattaaaggg ttctcatgag gtacaaaagt taactattgg tgaacgtggc acactaacaa 2400 catacctggc cactgttaat gcagcaggag acagcttacc accatttttc atatttaagg 2460 gaacaaagat tccagatgca tctaaatttc ctttgggtac aaaagtgcat tgtagtcaaa 2520 gtggatatat tgatcaagag atttttattg cttatttaca tcattttaat gagttttgta 2580 caaaagtgaa tgggaaaaaa gtattgttat ttttggatgg ccacaaaagt catgtgacta 2640 ttgaagcgat tgaattagca ataaagatgg atattgaaat aatatgcatt ccacctcata 2700 gtacacatcg ccttcaacca cttgacacac atgttaataa agtagttaaa catatgtggt 2760 ctgaagcctt gtctaattac ttagctgaaa atgatgatat ttctttgcct cgttgcaact 2820 ttcacattat ttttaacaaa atatggccaa atatacttac aaaaagagga ttgattgtta 2880 atgcgtttta ccattgtggt ttgcatcccc taagaaaccc agtaaaagat ttcgaatata 2940 aaaaagcaga ggtttttaat aaagcaacaa ttcaccaggg agaccaaaat ttatctattt 3000 taaggactat tataccttct cctgaaaaaa aaaggaaacc tactcattca aaacgccaca 3060 ttgctcttat tacatcacca ggtaatgtta agttaataaa agagcctcct gctaaaaagg 3120 ctgcaataaa aaatgtggtc agtgttaaac agaagtcaaa aaaggatcac gataaatcag 3180 ttaaaaccaa atcaaaaaag actttttttt cagctgattc tgatcaccca agttcttcaa 3240 gttttattct aaaaaataac actgataaag acttttgttg tgtttgtgca gctgtttggg 3300 cagaagcaag tgaagattgg cttaaatgtt taaattgcaa tcagtgggca tgcgaaggat 3360 gttttggagt tgatacatgt gcaaattgtg tgcaatgaac atataaacgt cttgtaaatt 3420 tttaaatagt catagtagtg atatgatttt tgttatttaa atagtcatag tagtgatatg 3480 attttttgtt gctttaacgt gctatttttt gcgctagcgc ggttatacgg ttttcattta 3540 tgccaaaatc attatttgat attttcttgg aaaaagttta acaattgatg tccttttaga 3600 tgtccttttt gatgtccttt ttgatgtcct tttaccctag gccactgcag tcgaggaggc 3660 tactttagtt gtgtttacaa ccctctctca actctataac tccgaaacac gaaccttgac 3720 gatcaaggtt cgtgtttcgg agttatagag ttgagagaaa aacaagttga gcgcagtact 3780 accagggacg cggtgggaat cgaactcgca acttctaggt ttacttctag attagggaaa 3840 tcatcctact gtgacctcaa aaatgtttta tgtcaaagtt tttttctcct aggtaagttg 3900 cctttaccac tagaaaggaa tttcgtctat cgcccgagtc acgaattttt aaattagttt 3960 agctgtactg acgaattcgc cccgtaattg acgaattcgc ccctcatgac ttgatgatga 4020 aaaaatcaaa ttatacgcta aagtaacttt agaattcatt ttgattcaat acattaacaa 4080 aatagagacc ctaattaata aaaaaaagtt tttaaactag aaaagttcac ttatttatat 4140 gcgaattcat tttgttaaag taaaacatga cgaactcgcc ccgccttacc c 4191 // ID hAT-7_HM repbase; DNA; INV; 4597 BP. XX AC . XX DT 07-DEC-2008 (Rel. 13.12, Created) DT 11-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-7_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4597 RA Jurka J.; RT "hAT-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 1996-1996 (2008). XX DR [1] (Consensus) XX CC This is a relatively young family: youngest sequences are >99% CC identical to consensus. CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 2396..4444 FT /product="hAT-7_HM_1p" FT /translation="MSKSIFISTLVNGEKINRTFLIYSESKGSIFCAPCYL FT FGGTTSFATVGFSDWKKGEEKIKRHENSQHHKLCVMNMKERKEVLNRVDQK FT LKYQVETEINYWKNVLTRVVAVVKSLSSRGMSFRGDDDRFGSVHNGNFMMS FT LELIAQFDPFLAQHIEKFGNKGKGSTSYLSFNIYEQFISIMADNVIQQMVK FT EIKEAKYFSISIDSTPDISHVDQLSFIFRYVQKNGCPVERFLGFLPNSGHK FT SEELADAVFLVLESHGLDINNCRGQSYDNASNMSGRYTGLQARIKKVNPLA FT TFVPCSAHSLNLVGECAVDCCIYASEFFILLQNIYKFFSASTYRWEILQKN FT LIKTENVSLKKLSDTRWSARSDASISLNKNWIEIINALSFIKDDKTEKSIT FT RSEANGLYLKLDSLETAIMATLWGDILERFNKTSKQLQSVEIDLETVVSLY FT ESLIRYVLDLRSMFDIYEERAINKSGHKTYRKIRKRKRKLTADEKREEETN FT FKIRDSLRINTYYAIIDKLHSELERRKLCYDEANKKFNFLFQIIKLSPSEV FT YKKAEVLQNVYKNDLSSSFANECIQFRSYLMSLTENLRPKTIIDVYKMIRT FT EKLQELFPYVDIALRIYLCCPTSNCSAERSFSALKRVKSYLRSRMTSDRLN FT RLAILSIESVLTIDMNFNDIINTFAKQNSHRKL*" XX SQ Sequence 4597 BP; 1686 A; 627 C; 686 G; 1598 T; 0 other; cagagacgta tttacggggg ggccaggggg ggctttggcc cccgggcgcc atgttttgac 60 agggcgccag ctttcagaat gttagaattt tagaaaattg taaaatagtc tatttatatc 120 atttcacgaa aatttttata ttattattat gttcaaatca tagacctact tatcgactta 180 ctaaatatct ctagttgatt ttagaaaaat ttagttagca gaagaaaata tcaatcaaaa 240 ggagatcaaa ttaaacttgc aaattataca taattatgcg ccaattattc atctaaaaac 300 gttgatttga tactgaaagt cacgtgtcta aacaaattat ttatttatta taaatatttt 360 aatgatacgg tatgtagcga tagcttaacg gttaagctca tcactctgac gctggcgctc 420 cgaggtctac tcctacgagt gttcttattt aatatactat ctaatcttga tataaataaa 480 atttatattt ttttttgaat aaatgattta taatttttat taatcaaata cttgatcctt 540 atttattaat tagtaaataa tatactagaa aaaaataaat ggtgcgtttt acctaacatt 600 ttcattttac tcttaattga ctttattaac agaaatactt aggtatagac acagctattt 660 acttttaaaa aatgaaataa ttcttcaaag attctcttta cattttctaa aaaaaaaaaa 720 ctctttaaaa atactttaaa ctttaaataa tcagtttgtt ttttaattta tttttatata 780 atttgaagtg atgagcagaa taaattacgc attaattata ttaaattagc aatttgactt 840 tgatcgcctt ttttgctaac catcttgtat ataccacgtg tgcacaatat aattttgaat 900 ttttaagatt gctgtcagtt ttttttattg ttggcctaag tattttgttc ttcggtatgt 960 tgtctgctta atcccttgta ttacagtata cagtcttgaa aaaggtaaat attcgtattt 1020 cgattattgt ttttagtttt agtattaaca ttattgttat catatatata tgataacaat 1080 atatatataa tacacacttt ttaggattct tattttaaaa agcaatgatg attcatagta 1140 aaaagcttag tggcgcagaa aaccgaaaac gaacccatgc aaaaaatgaa gctcaagcaa 1200 aaatactttt aaagactgct aagttagaaa catattttag ctttacagat aaacagcttc 1260 acacatcgcc gggtaagttt atttagctag aaaaccatac gttttattaa ttttgctttc 1320 tagcaaatat atatacatat aggtacatat tgttctgtta tttattgttt atacttcata 1380 ttgttatgtt atttattatt aaatgtaaat aatttatatc taaaatgtag atatgtgtag 1440 taatcgttat ttatgctttc agttgaaaat gtaaataata caagcgatga aaatgatcac 1500 agctttattg ttaatgagaa taacctatca aatccagacg atttaatatt aatattatca 1560 agtaagagct gtactgaaac tagtaaaact ctattgactc cagaatccaa agaactaaat 1620 gaaacaccct gtttgtccat gcatgaacat aacaagcccg aatgtttatt gtaaaatctt 1680 tatttaaaat ttgatataat ttataaataa tatcttttgg tacatgcatg ttatacaaaa 1740 tttattatat ttttctctat agaaaccaaa atgatcacag attcactgtt aataaagata 1800 atctaacaaa tccagacaat cttatattac taaataacag cagcacagaa acgagtaaaa 1860 ctctattgtc tacagaatcg gaagaaataa atgatacacc acgtttgcca atggttgaac 1920 ataacagtct tcctaaacaa agtattgttc ctgagtaagc tttttttaat ttttttcatg 1980 gtaggttaca attataaaca tgtacaaagg tagactattt ttttaagtcg cttgtcatgt 2040 aaaattatgc tggaaattaa attattaaca aatatagtac actctgtaca ttttaaacga 2100 tacctatatc ctctagttat attatattat gcatttatct ctattaccta atgcttgaat 2160 gtttttttta tacaatttgt tataaacttt gaataatatt ttttggtttt tgtatattaa 2220 acaaatatat tttcccttta gaaatgatcc tgccaattgg ataaaaaatc aagaaacgat 2280 agattattta tcaatcaatg gatttaacca aaatttagag aacaataatt tccaaaagtc 2340 aaaaagaatt tacactcaaa taattggtgg ggtaagacgt actcgttttc gatatatgtc 2400 aaaaagtata tttatatcta ctcttgttaa tggtgaaaaa attaaccgaa catttttgat 2460 ttactcagaa agtaaaggat caatattttg tgctccctgc tatttatttg gtggcactac 2520 atcgtttgcc acagttggtt tttcagattg gaaaaaagga gaagaaaaaa tcaagcgtca 2580 tgaaaattca caacaccaca aattgtgtgt aatgaatatg aaagaaagaa aagaagtatt 2640 aaatcgagtt gatcaaaagc taaagtatca agtagaaaca gaaataaatt attggaaaaa 2700 tgtattgaca agagttgtag ctgttgtgaa atccctttct tctcgtggaa tgtcttttag 2760 aggtgatgat gaccgttttg gatctgttca taatgggaac tttatgatga gcttagagct 2820 tattgctcag tttgatcctt ttttagcaca acacattgag aagtttggaa ataaaggaaa 2880 aggttcaaca tcatacctat catttaatat atatgagcag ttcataagta taatggcaga 2940 taatgttatt caacaaatgg tgaaagaaat aaaggaagct aaatactttt ccatcagtat 3000 tgattctact cctgacatta gccatgtaga tcaactgtca ttcatttttc ggtatgttca 3060 aaagaatggc tgtccggtag aaagattttt agggttttta cctaattctg gtcataaatc 3120 tgaagaatta gcagatgctg tatttttagt tttagaatcg catggattag atattaataa 3180 ttgtcgtggt caaagttacg acaatgcgtc taatatgtct ggtagataca caggacttca 3240 agctcgcatt aaaaaagtta atcctttagc aacatttgta ccatgctcgg cacactcttt 3300 aaatcttgtt ggtgagtgcg ctgtggactg ttgtatttat gcttctgaat tttttattct 3360 gcttcaaaat atttataaat ttttcagtgc ctctacttat agatgggaaa tacttcagaa 3420 gaatttgatt aaaacagaaa atgtatctct taaaaaacta tcagatacca gatggtcagc 3480 aagatctgat gcaagtatta gtttaaacaa aaattggatt gaaataatta atgcactgtc 3540 ttttatcaaa gatgataaaa cagaaaaatc tattacaaga tcagaagcta acgggctata 3600 tttaaaactg gatagtcttg aaacagctat aatggccaca ttatggggcg atatactaga 3660 gagattcaac aaaactagca aacaattgca gtcagttgag attgatttag aaacagttgt 3720 aagtttatac gaatctttaa ttcgatatgt attagattta aggagtatgt ttgatatcta 3780 tgaagagaga gcaataaata agtcaggaca caaaacatat cgaaaaatac gcaaaagaaa 3840 aagaaaacta actgcagatg aaaaaagaga agaagaaaca aattttaaaa ttcgagactc 3900 tttaagaatt aatacatatt atgctataat tgataaactt cattctgaac tagaaagaag 3960 aaaattgtgc tatgatgaag ccaataaaaa atttaacttt ttatttcaaa taataaaact 4020 ttcgccatca gaagtttaca aaaaagcaga ggtacttcaa aatgtatata aaaatgatct 4080 ttcgtcttca tttgctaacg agtgtataca attcagaagt tatttaatga gtttaactga 4140 aaacttaaga cctaaaacta taatagatgt ttataaaatg atcagaactg aaaaacttca 4200 agagctgttt ccttatgtag atatagcatt aagaatatat ctatgctgcc caacgtccaa 4260 ttgttcagcc gagagatcat tttcagcatt aaagagggtt aaatcttatt tgaggtcgcg 4320 aatgacaagt gatcgtctta ataggttagc aattctatct atagaatctg tacttaccat 4380 agatatgaac tttaatgaca ttataaatac atttgcaaaa caaaactcac atagaaaatt 4440 ataattttct atagtgatta atataactta atatatgttg gtatagaaaa tgtttttttt 4500 tttaatgatc accccatatc aaagttgaca ttattcttaa aaaagggcgc ttaaaggaac 4560 agtgccccag ggcgccaaaa attcaaatac gtctctg 4597 // ID Harbinger-N11_BF repbase; DNA; INV; 350 BP. XX AC . XX DT 04-AUG-2008 (Rel. 13.08, Created) DT 04-AUG-2008 (Rel. 13.08, Last updated, Version 1) XX DE Amphioxus Harbinger-N11_BF non-autonomous DNA transposon - DE consensus. XX KW Harbinger; DNA transposon; Transposable Element; Nonautonomous; KW Harbinger-N11_BF; non-autonomous. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-350 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-350 RA Kapitonov V. and Jurka J.; RT "Harbinger-N11_BF - a family of non-autonomous DNA transposons RT from the amphioxus genome."; RL Repbase Reports 8(8), 803-803 (2008). XX DR [2] (Consensus) XX CC It contains 18-bp TIRs and is flanked by TWA TSDs. XX SQ Sequence 350 BP; 85 A; 96 C; 83 G; 86 T; 0 other; agccccgttt acaattcatg cggacgccgg ccgaatttgt agatcggtca gagctcgccg 60 aaaatcaaca tcgcccgagc atcggcttta ctttttatca aataatctgc accgtcacaa 120 attggacggg gctcggtgtt tccctccgga aagttcaagt cattgtcttt cctgcttttt 180 tggtgcacaa ctgacatcac acggaggttt aaatcccaac ggacgagctg ttaaaagtcg 240 cccgaagccc gcgtaatcgc ccaaacgccg gtcatactcc ggccgaaaat gcccatttgg 300 agctcgctga caagtcggcc gagctgaatt attgatttgt aaacggggct 350 // ID Hoyak2 repbase; DNA; INV; 2561 BP. XX AC . XX DT 21-SEP-2009 (Rel. 14.09, Created) DT 22-MAR-2010 (Rel. 15.04, Last updated, Version 2) XX DE Hoyak2 is a putative autonomous DNA transposon. XX KW hAT; DNA transposon; Transposable Element; transposase; HOBO; KW Hoyak2. XX NM Hoyak2. XX OS Drosophila yakuba OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; melanogaster subgroup. XX RN [1] RP 1-2561 RA Ortiz M.F. and Loreto E.L.S.; RT "Characterization of new hAT transposable elements in 12 RT Drosophila genomes."; RL Genetica 135(1), 67-75 (2009)DOI 10.1007/s10709-008-9259-5. XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 295..1797 FT /product="Hoyak2_1p" FT /translation="MCIKCGKTYQTSGNTTNLSCHIKRIHPNLTVSDLPQR FT QQLSSRSFIEKKYETSSNRKQTLDSALCSYITSDLRPFSVLENKGFRNFVS FT HLDPRYELPSRTTLRNVSMANAYAEKKAKLFELLKNVKHCAITSDCWTSRA FT NECYVTVTCHFVTADFELRNAVLVTEKLIDETTHSSENIANSIRAVLEEWS FT VLGKVTAMVTDNAKNMIKACEILQIRHVPCFAHSINLTVQDCLASENVKPI FT LEKCKRIVAFSKSSTISFVKFREAQDCVRPYSLKQECPTRWNSAFYMVERI FT FITKVAIASVLLNTSKGLMPLTSDEISILEDLKALLSPFEHATVHTSSSTS FT VTVSSVIHVPCGILHNLSALKSKLQTKVVCDDLLQGVNKRLVPYESRTVTR FT MTTILDPRFKKEGFWKTFNSSQGVKSLEEELSHIYAQTKSSTSLNPPTPEP FT TAESNLLFSFLQANKNQKIQSDCVDSILALRHYLNGTNLEPAQNPLDYWKV FT QKKII" XX SQ Sequence 2561 BP; 861 A; 488 C; 491 G; 721 T; 0 other; gtcggggaac tcgactatag cgttctctct tgttttttta tcaattcaat aaaattaact 60 taaaatatga tcagatttaa cagattgtgt ttaatattgg tgttggcggc atttgggatg 120 aagaagatat tgaaagagca gaacagcaat ggtaagtaga ctatttagtt acaattatat 180 gatttgtaat gtaaattttt gccaaagtag ttgccgggaa agacgaacaa agcgacgcgg 240 ccgagggtat gctcacaaat aagaggaaaa aatatgggac atcttctgtt ttaaatgtgt 300 attaaatgtg gcaaaacata tcaaacaagt gggaatacga ccaacttgtc ttgccacata 360 aagaggatac atcctaattt aacagtaagt gatttgccac aaaggcaaca gctgtcaagt 420 cggtcattca ttgagaaaaa atatgaaaca tcgtcgaacc gaaaacaaac actggacagt 480 gccttatgct cctacattac ctcagacttg cgcccttttt cagtgctgga aaataaaggc 540 tttagaaatt tcgtaagcca ccttgatcca aggtatgaat tgccttcacg aaccacatta 600 agaaacgttt cgatggcaaa tgcatacgca gaaaagaagg ccaagttgtt cgaactttta 660 aaaaacgtaa agcactgtgc aattacttcc gattgctgga cgtcaagagc aaacgagtgc 720 tatgttactg tgacgtgcca cttcgtgaca gcggacttcg aattgcgcaa cgctgttttg 780 gtcactgaga agctcatcga cgaaacgacc cactcttcgg aaaacatagc aaactccata 840 cgcgccgtgc tcgaggaatg gagtgtgtta ggaaaggtca ctgctatggt cacagacaac 900 gccaaaaata tgataaaggc atgtgagatc ctgcaaatta gacatgttcc atgctttgcg 960 cactcaataa atttaacagt gcaggactgt ttggcatctg aaaacgtcaa acccatttta 1020 gaaaagtgta agcgcattgt tgccttttct aaaagtagca caatttcctt tgtcaaattt 1080 agagaggcgc aggattgcgt acgcccatat agtttgaagc aagagtgtcc aaccaggtgg 1140 aacagcgcct tttacatggt ggaacgaatc ttcattacga aggttgctat cgcaagcgtt 1200 ttactaaaca cgtccaaggg gttaatgccg ctaacctcgg atgaaatatc cattctggag 1260 gacttgaagg cattactttc accgtttgag catgcaacgg tgcacacatc ctcaagtacg 1320 tctgtgacag tgtcatcagt gattcatgtt ccttgcggta tattacacaa cctgtcagcc 1380 ctgaaatcga aactccagac aaaggtggta tgcgatgact tgcttcaagg ggtcaataaa 1440 agattggtcc cctatgaatc tcgtactgtg acacgaatga ccacaatatt agaccctcgc 1500 tttaaaaagg aggggttttg gaaaacattt aattcctctc aaggagttaa atctttagag 1560 gaggagctgt cgcacatcta tgctcaaaca aaatcatcta catctttaaa cccgccaaca 1620 ccagaaccca cagcagagtc caacctgtta ttcagtttct tacaggccaa caaaaatcaa 1680 aaaattcaaa gcgattgcgt ggattctata ctagctctac gccattattt gaatggtact 1740 aatttggagc ctgcacaaaa cccattagat tattggaagg tacaaaaaaa aataatttag 1800 atggtaacca acttaatcca atttcttctt ttaacagata tcaaatgaca ccgcattcaa 1860 aacatgcgat tttaaagtac ttctgcgttc cggccacttc caccgaaagt gaacgaatgt 1920 ttagcaagga aggactggtt ttgagcgaaa agcggagctc gctaaaagcc aaaaatatta 1980 atgttatttt atttttaaac aaaaatgact ggataaatta atttaagtta tgtatctttt 2040 ttttttgagt agaactcaat gttatttttt attttatcaa ttgaattgtt attttatttt 2100 aatttttttt tttcttttta tgtttaagaa gtaaaaacta taaggaaact atggacctct 2160 aataaagctt tttttttata aacaacttgt atccatttga tttatttatt cggaataaaa 2220 tacactttac aaaaaaaaaa accctaaaaa gttcgatgta tcgatacatc gatgtttttt 2280 tctgaaaaac atcgaaacat cgatgtaggt aagcagaatt tctgccacaa gtgtcaaaaa 2340 aacgggaaat gaacacacga cccagacagc agccgtgtgt tccccgagaa aagccagcga 2400 accaagatgt caattaaaaa agcaatttga ggaattagac gtagttactg acagcaattc 2460 cccgatcttt tggcctcaga tttttaagaa taaaacaata ccagatataa cgattaattt 2520 aaataaaaac aagagagaac gctatagtcg agttccccga c 2561 // ID Gypsy-15_DPu-LTR repbase; DNA; INV; 169 BP. XX AC scaffold_27; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-15_DPu_; KW Gypsy-15_DPu-LTR; Gypsy-15_DPu-I. XX NM Gypsy-15_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-169 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 746-746 (2010). XX DR Genome; scaffold_27; Positions 603186 603018. XX CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX SQ Sequence 169 BP; 34 A; 58 C; 33 G; 44 T; 0 other; tgtgatgacg ccgtctggca acacagcccc catcctcctt gcagaatctt gttcggcccc 60 aaagctgtcc ttgccggcca gtctattgta gctctccttg gccgacggta acgtcaaccc 120 gaatacagat ttcgtaacct gactctccgc tcagtttcac ttcatcaca 169 // ID Mariner-1_TV repbase; DNA; INV; 2954 BP. XX AC . XX DT 07-MAY-2009 (Rel. 14.06, Created) DT 07-MAY-2009 (Rel. 14.06, Last updated, Version 1) XX DE Mariner transposons from Trichomonas vaginalis. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-1_TV. XX OS Trichomonas vaginalis OC Eukaryota; Parabasalia; Trichomonadida; Trichomonadidae; OC Trichomonas. XX RN [1] RP 1-2954 RA Bao W. and Jurka J.; RT "Mariner transposons from Trichomonas vaginalis."; RL Repbase Reports 9(6), 1142-1142 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 638..1753 FT /product="Mariner-1_TV_1p" FT /translation="MITAHRHPVAGLIISPQDCQTIDMLRRSTCDANHIRQ FT FFLSKHRYFARRKKFIANMDETMLYSKRRYKVLTAGRNRPVRAEKSQLPHL FT TGVCTIFADGTTMKPMVILPQKKTLDAELEDLDAFFVRSESGWMNKYLFMV FT YCIMFICKVQEKRSQMNVFDAQVPFLLIVDGHPSRLSFLAVRLLSAFNIEL FT LVLPGHTSHILQPLDVGIFSPLKAQFKKLFDMATIRYDQQGHMVINYVWNA FT RELRYIMVRSFLDACSVSCTATNIENAFKATGIYPLDMNKPLASRYIVANG FT VPPARPNFVNDKVLTDPNVMMQVFQMEYGRPMQQQDWYFNLFVAMQSVLAW FT NGAYGQVLSRELHLLYPTAGGFQKIQLK*" XX SQ Sequence 2954 BP; 970 A; 548 C; 595 G; 841 T; 0 other; ccgtaggttg aatagcccga gaccaatttc aaaggtcgtg taccaataaa aaattttgaa 60 tcaatttaca caacgtgccc attttactca ctcaatattt gagaaaataa ttcaattttg 120 agaagaatgg tcctctgcgt tttgagtgag tgatgaaaaa aaatctcaga aaaattgaat 180 gagtttttat ggccatgaac caaatttgtg aagttcctgt ttaaaaaaat gcaaccaaga 240 acacgaattc ccgagtttgc agaattagaa aactataaaa atcttgggct attgacccag 300 atgcaactcg atttactcta tcgaagagta aatggcgagt cttatcaaca aatacggaat 360 gtttactcaa tttcaaagac gacagtagca cgcgcaatca tgcgcactgc tacttgtcgt 420 tcatggacaa aaggtcagtc agggggtggt atgacccttt tgtcactacc agatgaaatg 480 cagttcaaaa aacttgttca agagatggca gacgacttga actgtattac aacatcagtg 540 gccattgctg tctgtacgga attacaaaat aggagattaa aatttgcggc gcgggtacta 600 atcgctgcaa gatgcccgca tcttctagcg aagctcgatg attactgccc atcgccatcc 660 cgtggctggc ttaatcatat cgccacaaga ttgtcaaact attgatatgc tccgacgctc 720 tacctgtgac gcaaaccata ttcgccaatt tttcctctcc aagcatagat actttgcccg 780 caggaagaag tttattgcga atatggacga gacaatgctc tattccaaac gtcgctataa 840 agttcttact gctggccgga atcgcccagt tcgcgctgaa aagtctcagc ttccacatct 900 tacaggggtc tgtaccatct ttgcagatgg tacaactatg aagccaatgg tgatacttcc 960 acaaaagaag acccttgatg cggaattgga agatctggac gccttttttg ttcgctccga 1020 gagtggatgg atgaacaaat acctttttat ggtatactgt atcatgttca tctgcaaagt 1080 acaggagaaa aggtctcaaa tgaatgtgtt tgatgcccag gtaccttttc tcctaattgt 1140 agatggtcat ccatccaggc tctctttctt ggccgttcgc ctcttgagcg ccttcaatat 1200 cgagcttctt gttctcccag gccatacatc tcatatctta caacctcttg atgttgggat 1260 tttttctccc ttgaaggccc aattcaagaa gttgtttgat atggcgacta tccggtacga 1320 tcagcaaggg catatggtga tcaattatgt atggaacgcc agagaactgc gctatatcat 1380 ggttcgcagc ttccttgatg catgtagtgt gtcttgcaca gctacaaaca tcgaaaatgc 1440 ttttaaagca acaggaatct atccattgga tatgaacaaa cctcttgctt ccagatacat 1500 tgtagcaaat ggagttcctc ctgcaaggcc caattttgtc aacgacaaag tccttacaga 1560 tccaaatgtg atgatgcaag tcttccaaat ggaatatgga cgccccatgc agcaacagga 1620 ttggtatttc aatctgtttg tggctatgca atctgttttg gcctggaatg gagcatatgg 1680 ccaggttctt agtcgtgagt tacatctcct ttatcctact gcaggtggat ttcaaaaaat 1740 ccaattaaaa tgatataaag ctgaggagct ccagggtgaa gaggaattga tgaagagatg 1800 aagcatcttg atgaagagag gaagcacctc gatcaagaga ggaagtactt tgaagaagag 1860 aggaagcatc ttgctgaaga gaggaagcat cttgatgaag agaggaggta ctttgaagaa 1920 gagaggaatc actttgttga agagaggaag cactttgatt tagagaggaa gctgcatgat 1980 gaagagagga agcactttct gcatgatgga gtgaggaagc atcttgatca agagaggaag 2040 tactttgaag aagagaggaa tcactttgtt gaagagagga agcatcttga tgaagagagg 2100 aagcatcttg atcaagagaa gaagctgcat gatgtagtga ggaagctgca tgatgaagag 2160 aggaagcact ttgatgaaga gaggaagcgt aagaaccgat aacatttaca aaatattaaa 2220 tttattttgt aaattttcca aaagccttag taccaatgtg atataaatgc tccaaaaata 2280 ccatccctcc ttccaaaagc gtaagaaccg ataacattta caaaatatta aatttatttt 2340 gtaaattttc caaaagcctt agtaccaatg tgatataaat gctccaaaaa taccatccct 2400 ccttccaaaa gcgtaagaac cgataacatt tacaaaatat taaatttatt ttgtaaattt 2460 tccaaaagcc ttagtaccaa tataatataa atgcactaaa aacccaactg ccaatagcat 2520 gtattcaaaa aatgcaattc aatacaaatt tgattgcatt ttttgagatg gaagtgtatc 2580 tgaattcaca attgtaatcc acgaatacac ttccatctca ttgataagag gctaatgttt 2640 ttctcattaa taaggcgctg ttgacctcta tccatagccg aagtgatatt aacgtatgtt 2700 aacgtacaaa gtaaaaacat gttttttctt tttaaaatca tttagtaatt aataatattt 2760 agcactcata gtaaaaacat taaaatttat aagaatcgtt atgaagaaaa attgagaact 2820 tatgattttt atttaagaaa atgattaaat aatatataaa taaccatata aacaacattt 2880 tagattcctc tgaaaagggt atctcaaatt ggtctatgac caattgaatg gttcgccggt 2940 tattcaaggt acgg 2954 // ID Proto2-6_CS1 repbase; DNA; INV; 4464 BP. XX AC . XX DT 15-JUL-2009 (Rel. 14.07, Created) DT 15-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE Annelida Proto2-6_CS1 autonomous Non-LTR Retrotransposon - DE consensus. XX KW Proto2; Non-LTR Retrotransposon; Transposable Element; KW Proto2-6_CS1. XX OS Capitella sp. 1 OC Eukaryota; Metazoa; Annelida; Polychaeta; Scolecida; Capitellida; OC Capitellidae; Capitella. XX RN [1] RP 1-4464 RA Kapitonov V. and Jurka J.; RT "Proto2, a novel clade of metazoan non-LTR retrotransposons."; RL Repbase Reports 9(7), 1561-1561 (2009). XX DR [1] (Consensus) XX CC Proto2-6_CS1 is a very young family of non-LTR retrotransposons CC present in the annelid genome. It belongs to a novel clade of CC metazoan non-LTR retrotransposons called Proto2. This clade CC includes families of non-LTR retrotransposons present in the CC hydra (from Proto2-1_HM to Proto2-5_HM), annelid (from CC Proto2-1_CS1 to Proto2-8_CS1), hemichordate (Proto2-1_SK) and CC amphioxus (Proto2-1_BF) genomes. A model Proto2 non-LTR CC retrotransposon is ~4.3 kb long, contains two ORFs; its 3' CC terminus is composed of microsatellites; usually there is no CC target site duplications generated upon retrotransposition of CC Proto2 elements. ORF1 codes for a protein conserved in Proto2 CC elements from all species mentioned above. ORF2 codes for a CC protein composed from the AP endonuclease and reverse CC transcriptase domains. It appears that the Proto2 clade is a CC clade ancestral to the RTE and RTEX clades. XX FH Key Location/Qualifiers FT CDS 84..1322 FT /product="Proto2-6_CS1_1p" FT /note="ORF1." FT /translation="MSGDNTPADLRSRSGSIVDGSEEKDRSSDPCPCNVNA FT SSSVCNSVLGYLVGSLKKSPRDVVKKIALSHFCPDEIKTAKQALWSSSACV FT PAIGPWKPRKGSSNKPRQEFEIDDIFDALSKLDEASVKPVIHVPALDLDLL FT PPMNPAQLLPVIFLDKMNEMEAKMAAMQAQIDNLGKREQVSGHGRNEMAHR FT STSRSDSRRSTGQQTKTRPAVPNPAGPAPPSMSAGSHAKSPPQDPERWPDR FT QDTASTSVKRVDTGEEETKQEHSNEWTEVVKRKSKKNPLKERVRKAAKEAE FT VIVGTKVDEGLSASNPIKHLFVNRLNNYHGVDDMKSFLLKNQVQARGIKKT FT SKDEWMKASFKISLNEGDFEKVFTPGFWPEGVQCREWIGYVPPVSKSDKKA FT ENSQPVEEVNLTTDDGPTE" FT CDS 1309..4371 FT /product="Proto2-6_CS1_2p" FT /note="ORF2, AP endonuclease, RT." FT /translation="MDPQSDMNKLKLCTFNVHGVHNKWTYLQELLVEHDIL FT FIQEHWLRECQFHLFNDNLVNVNFASVTPMDDGLLLSGRPHGGCAVMWRTT FT IAHRVSPIIHSNTRISAAMIEINNVSYLCCSIYMPCDDNTNIEVYESVLHD FT VMTIALEHSADHLIIGGDLNTDLSRSHSLHTMSLKLFVSNEDLVIIDDSLG FT IPYNVDFTFESTTNASRSILDHFIVSHNIVPHVMSIECNHDVNNSSDHVPL FT TIELNVNTKYVKAESTGTRPKPQWHKATNEDLDAYRSTLDVLLQNIVIPEA FT ALECSSSQCTNEMHREELLKLHDSLIQASLRASEHAIPSSSTDLAKGRPTV FT IPGWNEYVAPLKSTASLWHFIWKECGRPQNGIVADIRRRTRAQYHRMIRQV FT KRNEATIRSAKMAEAFHDTRNHRRDFWSEVKKVKGGSGATPATVDGVHGAD FT SIANTFCEKYADLFNSIQTTDEEMSCISDVLSSKVHNHEQADECSGHFISF FT EDVRTKCAELKSNKHDGYLGQYTDHIKNASHRYLCILSLLLNSILVHGVIP FT DAFKVSTVIPIPKNKRKSLNSSDNYRAVALSSVVCKLLEKIIMSKFNDVFY FT SSKYQYGYKEKHSTTHCSFVVNEIINYYAQNNSCTYVALLDASKAFDRLHY FT GTLFRKLIDKKLCPIVTRFLLEMYKSQLMRVKWCNSLSRTCEVSNGVKQGG FT VLSPLLFTVYLDELLERLKSTGFGCHIGTTFAGCFAYADDIVLLSPTLLGL FT KKMLHTCAAFSADYYVKFNPNKTKLVVCNSHSNVSPVVNFLGSNIEVVNDE FT KHLGTLLGNVSAQTRINEGVHNLLMKVNHFKCHFKTLPSDSAYSLFKTHCM FT AMYSSPLWDLTDRSIECFYVAWRKSIRRLFDLPYRTHSRLLHLICNDLPIA FT SQLLVRFHKFILSLQNTSNSLTRLCLQLALSGSRSAVSNSMSALSSRLHIS FT RLDMLDLDSSTVYKALFSMHLDLLDEDTAVHALLLKDLINMKFDAITSNEY FT NLSEICHSINYYCVL" XX SQ Sequence 4464 BP; 1325 A; 936 C; 944 G; 1259 T; 0 other; gccacgcatg agcgcgcgtc cctgaactct gctctgtggc ctgcactctt tgactgcggt 60 ttgtttataa acacagtcag ctgatgtcgg gcgacaatac gcctgcggac ctccgaagca 120 ggtcggggtc cattgtggat ggctctgagg agaaggaccg atcgtccgat ccatgcccat 180 gcaatgtaaa tgcctcttct agtgtctgca attctgtact cggctattta gtcggatctt 240 tgaagaagtc gccgagagac gttgtcaaga agatagcgct ctctcacttc tgccctgatg 300 aaattaagac tgcaaagcaa gccttatggt catcatctgc ctgtgtacca gcaattggtc 360 catggaaacc aaggaaaggg tcttcgaata agcccagaca agagttcgaa attgacgata 420 ttttcgacgc cctcagcaag ttagatgaag caagtgtcaa gccagtcatt catgtgcctg 480 cccttgatct tgatctgcta cccccgatga atcctgccca gttgctgcct gtcatcttcc 540 tagacaaaat gaatgaaatg gaggccaaaa tggccgctat gcaagctcaa attgataacc 600 tgggaaaaag agaacaagtg agtggccacg gaagaaatga gatggcccat cgatctacat 660 ctaggtcaga ttctcgaaga tcaactggac agcagaccaa aacaaggcct gcggttccaa 720 atcctgctgg acccgccccg ccaagtatga gtgcaggtag tcatgctaag tcaccccctc 780 aggaccctga aaggtggcca gaccggcaag acacagcctc tacaagtgtt aaaagagtcg 840 atactggcga agaggagaca aaacaagagc attccaacga gtggaccgaa gttgtcaaac 900 gaaagtctaa gaagaaccct cttaaagaga gagtacgcaa agccgcaaaa gaggcagagg 960 tcattgtcgg gaccaaagtt gatgagggtt tgtcagcaag caacccaata aagcatctct 1020 ttgtaaatcg cttaaataat taccatggtg ttgatgacat gaagagtttc ctgctgaaga 1080 atcaagtcca agccaggggg atcaaaaaga catcgaaaga tgaatggatg aaagcatcct 1140 tcaagatatc actcaatgaa ggtgattttg agaaggtctt cactccaggt ttctggccag 1200 aaggcgtaca atgcagagag tggatcggat atgtaccgcc agtatctaaa tccgacaaga 1260 aggctgagaa ctcacagccc gttgaagaag taaacctaac aactgacgat ggacccacag 1320 agtgatatga acaaactaaa actttgcact ttcaacgtgc acggtgtaca caataagtgg 1380 acatatctgc aggagctact tgtggaacat gatatactct tcattcagga gcattggctc 1440 cgggaatgtc aattccatct cttcaatgac aatctggtta acgtgaactt cgccagcgtt 1500 acaccaatgg acgatggact tcttctttct ggacggccgc acggcggatg tgcagtgatg 1560 tggaggacta caatagccca cagggtgtcg cccatcatac attcgaacac tagaatcagt 1620 gctgcaatga ttgaaatcaa caatgtttca tatttatgtt gctctatcta catgccttgt 1680 gacgataaca ctaatattga agtatatgaa tctgtattac atgatgtcat gactatagct 1740 ttagaacata gcgccgatca cttaataatt ggtggtgatc tcaacacaga tctttctcga 1800 tctcattcct tgcatactat gtctctcaag ttgtttgtct cgaatgaaga cctggtcatt 1860 atcgatgatt ccttaggtat tccatacaat gttgatttca cttttgaaag tacgactaat 1920 gcaagcaggt caatcctcga tcattttatt gtatctcata atattgtgcc ccatgttatg 1980 tctattgaat gtaatcatga tgtcaacaat tcatctgatc atgtcccctt gacaattgaa 2040 ctaaatgtga atacaaagta tgtaaaggca gagtcgactg gaacccgccc caagcctcag 2100 tggcataaag ctacaaacga agaccttgat gcatatagat ctactcttga tgtcctgctt 2160 caaaatattg tcattcctga agcagcgttg gaatgctcat cttcacaatg cacaaatgaa 2220 atgcatcgag aggaattgct aaaattacac gacagcttaa ttcaggcatc tctaagggcc 2280 agtgaacatg caatccccag ttcttctact gatctggcca agggaaggcc gactgtcatt 2340 ccaggctgga atgagtacgt ggcccctctt aagagcacgg catctctttg gcattttatt 2400 tggaaagagt gtggacggcc acaaaacggg attgtggcag acataaggcg acgcactaga 2460 gcccagtatc atagaatgat acgccaagtt aaaagaaacg aggcaactat acgcagtgca 2520 aaaatggcag aagcatttca tgataccaga aaccaccgac gggatttctg gtccgaagtt 2580 aaaaaggtca aaggcggctc aggagccact ccagccacag tggacggggt acatggtgca 2640 gactctattg ccaatacttt ttgcgaaaag tatgcagatt tattcaatag catacagaca 2700 actgatgaag agatgtcgtg tatatctgat gtactcagta gtaaggttca taatcatgag 2760 caagctgatg agtgttctgg tcacttcata agttttgaag atgtaagaac caagtgtgct 2820 gaattgaaaa gcaataagca tgatggttat ttgggtcagt atacagacca catcaaaaat 2880 gcaagtcata ggtatttgtg tatactctct cttctgttga atagcatact tgttcacggt 2940 gtaatcccag atgctttcaa agtaagcact gtcatcccaa ttcctaagaa caaacgtaaa 3000 tcattgaatt catctgataa ttatagggcc gttgctctga gcagtgtcgt atgcaaacta 3060 cttgagaaaa tcatcatgtc aaaattcaat gacgtttttt atagctccaa gtatcaatat 3120 ggatataagg agaaacattc aaccacacat tgttcatttg ttgtaaacga aattatcaat 3180 tattatgctc aaaataattc atgtacatat gttgcattgt tggatgccag caaggcattt 3240 gatcgcttac attatggaac cctttttcgt aagctcatag ataagaagct atgccctatt 3300 gtaacaagat ttttgttgga aatgtataaa tcacaattga tgcgtgtcaa atggtgcaac 3360 agtttaagtc gcacgtgtga agtgtccaat ggagtcaagc agggtggggt tctctcaccc 3420 cttctattca ctgtctatct ggacgagtta ctcgaacgtc tcaaaagcac tgggtttggg 3480 tgtcacatag gcactacatt tgcaggctgc tttgcatacg cagatgacat tgtcttgctt 3540 tcccctactc tgcttggcct aaaaaagatg cttcatacgt gcgcagcgtt ctctgcagat 3600 tattatgtta aatttaatcc taacaaaacc aaacttgttg tatgtaatag tcatagtaat 3660 gtcagtcctg ttgttaattt tcttggttct aatattgaag tggtgaacga cgagaagcac 3720 cttggtacct tactagggaa tgtttctgca cagaccagaa ttaatgaagg tgtacataat 3780 ttattgatga aagttaatca ctttaaatgt catttcaaaa ctctgccaag tgattctgca 3840 tattctcttt ttaagactca ttgtatggca atgtacagca gtccgctgtg ggacttgact 3900 gatcgctcta ttgaatgttt ttatgtcgca tggaggaaat caataagaag gttgttcgat 3960 cttccttatc gaacacacag tcgcttgctg catttgattt gtaatgatct acccattgct 4020 tcgcagctcc ttgtaagatt tcacaagttc attctttcgc tgcagaatac ctctaattca 4080 ttgactagac tctgcctaca gttagcttta agtggaagtc gctcagctgt aagtaacagc 4140 atgtctgctc tgagctcacg tcttcatatt tctcgtttag atatgcttga tttagattct 4200 tcaactgttt acaaagctct gttctctatg caccttgatc tgttagatga agatactgct 4260 gtccatgcct tattattaaa agacttaatt aatatgaaat ttgatgccat cacttcaaat 4320 gaatacaatc tgtcagaaat ttgtcactct atcaattatt actgtgtcct ctaaattgta 4380 acaatgacgt aatgacgctt tttttcacca catttttttt tactgttgac atgtttatgt 4440 gaataaactg tataataata ataa 4464 // ID Kolobok-1_TV repbase; DNA; INV; 1929 BP. XX AC . XX DT 27-FEB-2007 (Rel. 12.02, Created) DT 27-FEB-2007 (Rel. 12.02, Last updated, Version 1) XX DE A family of autonomous Kolobok transposons - a consensus. XX KW Kolobok; DNA transposon; Transposable Element; KW Interspersed repeat; Kolobok-1_TV. XX OS Trichomonas vaginalis OC Eukaryota; Parabasalia; Trichomonadida; Trichomonadidae; OC Trichomonas. XX RN [1] RP 1-1929 RA Kapitonov V.V. and Jurka J.; RT "Kolobok, a novel superfamily of eukaryotic DNA transposons."; RL Repbase Reports 7(2), 115-115 (2007). XX DR [1] (Consensus) XX CC Kolobok-1_TV is a consensus sequence of a family of autonomous CC Kolobok transposons that were active in the T. vaginalis genome CC in a last few million years. The Kolobok-1_TV transposon is CC characterized by 15-bp imperfect terminal inverted repeats, TTAA CC target site duplications, and it encodes the 401-aa CC Kolobok-1_TV1p transposase. Kolobok transposons, including CC numerous families of non-autonomous elements, constitute >2% of CC the T. vaginalis genome. See also comments in Kolobok-1_XT. XX FH Key Location/Qualifiers FT CDS 338..1540 FT /product="Kolobok-1_TV1p" FT /translation="MLGACIMTGMPYTKGARFLSLCGTKPPVKSGVMRQQR FT FCDDKIRRLKSISLMLSRKSFSGYLSIDARWTHRRNSPSCTVTALDAVTKR FT VLACVNINHIGGNRQHAQYSGASNNMESAGTRIILKQLKKYNILKDVKEII FT KDRDNKSVSVFKEFGVSHLERFDPGHVSKNISKDFTKFSASHKTVEVFNDK FT TQKIEKIERPFWGLNASLSMWLWSCFAEENIDKRTKMWENCVYHYVGNHSF FT CEPHNYKCFEWQKGVNSIQLQFMLYDWVHKWTPIVSKVSSIGSTCMNEAFN FT SKIACYLDKSRAWKNINIRVNVAILEWNDPEHFLGDIQKALHLPPLPEDCV FT KSFEENCYSKMILKQKTVTKGSKKQNKYKKSIQSVPGGDYQVFKSEWESED FT EDIDDFD" XX SQ Sequence 1929 BP; 648 A; 315 C; 328 G; 638 T; 0 other; gggtatagtc cactaaaaat ttttaagttt tgacaatgaa aataagctct gtgaccatgt 60 aaatatatat ctatgcaaaa cgagaggtta tgccataatt ttttttagtt atcaaatttt 120 cctttttttg agagaagtct ctcaataatt tgagaagaat ctttgatata aagagataca 180 catttgtaga acgaagtaag tgttttccaa acattttata tgttcatcct gtataacaat 240 ataaagctgc atagtgccat cttgtgcgtt tgctatttta tgtacttttc attgaaacat 300 tttactccac gaacgtcgtc gactctcagc gcgacagatg cttggtgctt gcataatgac 360 cggaatgcca tatacaaaag gtgctcgctt tctttctctt tgtggaacta agccccctgt 420 gaagagtggt gtcatgagac aacagagatt ttgtgacgat aagattagaa gacttaaaag 480 tatttcccta atgctgtcga gaaagtcctt cagtggttac ttatccattg atgcaagatg 540 gacgcacagg cgaaactcgc caagttgtac agtaactgcg ctcgatgctg tcacaaaaag 600 agttcttgca tgcgtgaaca tcaaccatat tggtggaaac agacagcatg ctcagtattc 660 cggagcttcc aacaatatgg aaagtgctgg aactcgaatc atcttgaagc aattaaagaa 720 atacaatatc ttgaaagatg ttaaagagat tattaaagac cgagataaca aaagtgtctc 780 tgtctttaaa gaatttggcg tctctcatct tgaaagattc gatccaggtc atgtatcaaa 840 aaatatctca aaagatttta caaagttctc agcttctcat aagacagttg aagttttcaa 900 cgacaagact cagaagattg aaaaaataga gagacctttc tggggtctga atgcaagcct 960 aagtatgtgg ctttggtcat gttttgccga agaaaatatt gacaaacgta cgaaaatgtg 1020 ggagaattgt gtctaccatt acgtaggcaa tcattctttt tgcgaaccgc acaattacaa 1080 atgctttgaa tggcaaaaag gtgtcaattc gatccaactt caattcatgc tctatgactg 1140 ggttcacaaa tggacaccca tagtttccaa agtttcttcc attggatcta cctgcatgaa 1200 tgaagcgttt aactccaaaa ttgcatgcta tttggacaaa tcaagagcct ggaagaacat 1260 taacattaga gttaacgttg caatcttaga atggaatgat ccagagcatt ttctcggaga 1320 tatacaaaaa gcattacatc ttcccccact tccagaagat tgtgtcaaat catttgaaga 1380 gaattgttac tccaaaatga tattgaaaca aaaaactgtt acaaaaggtt ctaaaaaaca 1440 gaacaaatac aaaaaatcca ttcagagtgt accaggagga gattatcaag ttttcaagtc 1500 cgaatgggaa tcagaagacg aagacattga tgactttgat taacgaaaat tacattttat 1560 ttattgtttt tgtttttaat taaattatta ttatgctcat tttcttattt tgattttttc 1620 tcattatgat ataaataatt tattaatgca tttatccaat aattatgcta tgatattttt 1680 atgccttatt tcatttctct tatatttgat aatctggtaa aatatagttc agactataat 1740 cattttagat tattgatata tttatatagg ttagaataaa attagtttca ttttctactt 1800 ttattttgaa ttaataattt ataatatcaa atattatggc gtaacctctc ctttaaaata 1860 tagatatata taggcctata gagtttattg gcactctcaa aactctagaa ttttgagttt 1920 actataccc 1929 // ID Gypsy6-SM_LTR repbase; DNA; INV; 318 BP. XX AC Contig1272; XX DT 15-AUG-2007 (Rel. 12.08, Created) DT 15-AUG-2007 (Rel. 12.08, Last updated, Version 1) XX DE LTR retrotransposon from Schmidtea mediterranea: long terminal DE repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy6-SM_LTR; KW Interspersed repeat; LG_I. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-318 RA Xu Z. and Wang H.; RT "LTR_FINDER: an efficient tool for the prediction of full-length RT LTR retrotransposons."; RL Nucleic Acids Res 35(Web Server issue), W265-W268 (2007). XX RN [2] RP 1-318 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from Schmidtea mediterranea."; RL Repbase Reports 7(8), 754-754 (2007). XX DR Genome; Contig1272; Positions 32712 33029. XX SQ Sequence 318 BP; 134 A; 15 C; 41 G; 128 T; 0 other; tgtgaggagt taaaattaaa atactaaatt aaattagttc acgattaaaa tttgtaaagt 60 tttaattgaa acttaaatta ttttaagata ttaaaattga aatgataatt gaaattttaa 120 aataaaataa tttttaataa aattgaaatt caagtatgaa tgcaatggta tgttgatatt 180 tggttgaaat taaaatttag aaaatgtccc aagaatattt gtttatataa tcagattttt 240 atgaaatata attttggatt taatagtctt agtaatttat tttgatgact gaagaattaa 300 atgttatagc ttctaaca 318 // ID CR1-66_HM repbase; DNA; INV; 4339 BP. XX AC . XX DT 24-DEC-2008 (Rel. 13.12, Created) DT 14-SEP-2009 (Rel. 14.1, Last updated, Version 2) XX DE CR1-type family - consensus. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; CR1-66_HM. XX NM CR1-66_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4339 RA Bao W. and Jurka J.; RT "CR1 families from Hydra magnipapillata."; RL Repbase Reports 8(12), 1893-1893 (2008). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 85..861 FT /product="CR1-66_HM_1p" FT /translation="MALTLQEIKKMFKEMFCEFKKEMFNEFKKETEAMFKK FT HEQTVLDIISGNLKILNERLDKLETNANENVIKIKKIEKEIEEISDSINFN FT ESIIDKKIEYNSKQLEKELIENEILKEKHRKLEDRSRRNNLRIDGLYEDEK FT ETWGQTEEKVHLFFKQKLGVEDIEIERAHRTGQKKNGKPRTIVLNLQRYKD FT KVKILKELHRIKGTNIFVNEDFSLETVAIRKKLFTDVKQRRLNGENVAVRY FT DKIICFKNSFYDNNRKK*" FT CDS join(912..2168,2143..4008) FT /product="CR1-66_HM_2p" FT /translation="MAALNNFESLAFNFFEKEFFSQDENSDPDNNFYCETF FT SDCSYIFPSELEEYLFKNVIDKNSNQIRILHLNIRSLNSNFEKLLNLIEET FT KNLFNIICLTETWITSKDLNSHFQIPHFNIISLERQINKRGGGVLIYAHET FT LRLVFRGDLSISDSDKEVLTIEIESNQTKNLLISCCYRPPAGMSENLSMFL FT HNVIKKGDTEKKKNILLGDFNMNCFLYNNDYKVKSFYDTFFETGSIPLINR FT PTRVTINSATLIDNIISTDIFNKGIKKGILKTDITDHFPIFVTIDTYTAKN FT IDNKKVLKKRIINQNNLNLFKDQLSLLHWNNININNNANDIYETFFKTFYS FT VYDANFPVVEIIAKPKSLNSPWITKGFKKSSKIKQKLYINFLKTKTTANEK FT IYKNYKYLFEKIRKNLKKKILLGITKKKYYSELLNKFKYNTKRIWEIEMKE FT IIGKQKSCCFGFLPQMIKIENTSIFEPNAISHEFNKFFTEIGPKLSNKIPI FT TRTLFSDFLLSLEKCICSDELSSDLSIEELEKALKSIKKNKSCGPDEIDGN FT VIIDCFKQLKDVLFKVFNASIKQGIFPEQLKIAKVTPIYKDGDQSQITNYR FT PISVLSVFSKILERIMYNRIYKHFDKNNLLYVNQFGFKKDSSTEHAIIQLV FT NEISKSFEQSKYTLGIFIDLSKAFDTVDHHILLKKLKYYGVNNQALRWFRS FT YLTNRKQFVVGNDNYHNNCLSITCGVPQGSILGPLLFLIYINDLNKATNLM FT SIMFADDTNLFISKSDISELFATVNKELKLLSHWFKSNKLTLNTDKTKWTL FT FHSNRKKKFLPLNLPPIFIDEIEIKRDSVIKFLGVYLDENINWNQHINYIC FT TKVSKNIGVLYKARTYLNKKHLRQLYFSFIHCYINYANIAWGSTEKSKLLR FT LYRCQKRAIRIINFADRFSHCKPYFIEMKILNVFELNVLKILCFVYMWKNN FT LSPPVFKDFFSLKPSNKYTLKRKNFFNEPFCQTNFNQFCLAYRAPHLWNKL FT VLPNVDFDRPNSFCLFKNKIKNLIFSLDNILEYY*" XX SQ Sequence 4339 BP; 1731 A; 652 C; 575 G; 1381 T; 0 other; aaagaaagaa aaaagtttta aaaacatata aacatttaaa tacatatctt cttataaatc 60 aattcagcat ccttgtaaat cattatggca ttgactcttc aagaaataaa gaaaatgttt 120 aaagaaatgt tttgtgagtt taaaaaagaa atgtttaatg agtttaaaaa agaaaccgaa 180 gctatgttta aaaaacacga acaaacagtt ctagacataa tcagtggaaa tctgaaaatt 240 ttaaacgaaa gattggacaa gcttgagacc aatgctaatg aaaatgtaat aaaaatcaaa 300 aaaattgaaa aagaaattga agaaataagc gatagcataa actttaacga aagcattatt 360 gataaaaaaa tcgaatacaa tagcaaacaa ctagaaaagg aattaatcga aaacgaaatt 420 ttaaaagaaa aacatcgaaa acttgaagat cgttcacgta gaaataatct acgaatagat 480 ggattatacg aagacgaaaa agaaacctgg ggacaaacag aagaaaaagt ccacttattt 540 ttcaaacaaa aacttggtgt tgaagacatt gaaattgagc gtgcccatcg cactggacaa 600 aagaaaaatg gaaaacctcg gacaatagtc ctcaaccttc aacgatataa agataaagta 660 aaaatactga aagaactaca tcgaataaaa ggaacaaaca ttttcgttaa cgaagacttt 720 tcactcgaaa ccgttgccat caggaaaaaa ttgtttactg acgtaaaaca aagacgttta 780 aacggagaga atgttgcggt tcggtatgat aaaattattt gttttaaaaa ttctttttat 840 gataacaata gaaaaaagta aactaattca cactcttaaa aagaactctt ttaaacatat 900 ataaaacaat catggctgca ttaaataatt ttgaatctct cgcgtttaat tttttcgaaa 960 aagaattttt ttcccaggac gaaaactcag atcccgacaa taatttttat tgtgagactt 1020 tttcagattg ttcgtatatt tttccaagcg aactagaaga gtatcttttt aagaatgtta 1080 ttgacaaaaa ttcgaatcaa attagaattc tccacctcaa tataagaagt ctaaatagta 1140 actttgaaaa acttttaaat ctaatagaag aaacgaaaaa cctttttaat ataatatgct 1200 taacagaaac gtggattaca tcaaaagatc taaatagtca ttttcaaatt ccgcatttta 1260 atattatttc tttggagaga caaataaata aacgaggtgg tggagtacta atttatgcgc 1320 acgaaacact taggcttgtt tttagaggcg atttgagcat ttctgatagc gacaaagaag 1380 tcttaacaat tgaaattgaa agcaaccaaa caaaaaactt attaattagc tgttgttatc 1440 ggccacctgc tggcatgagt gaaaacctta gcatgttttt gcataacgtc attaaaaaag 1500 gcgataccga aaagaaaaaa aatattttgc ttggggattt taatatgaat tgctttcttt 1560 ataacaatga ctataaagta aaaagttttt atgacacatt ttttgaaaca ggatcaatcc 1620 ccctcattaa tcgccctacc agagtaacta taaactcagc gactttaatc gataatatta 1680 tttcaactga tatttttaat aagggtataa aaaaaggtat attaaaaact gatataacgg 1740 accattttcc catttttgta acaattgata catatacagc aaaaaacatt gacaacaaaa 1800 aagtattgaa gaaacgtatc attaatcaaa ataaccttaa tttattcaaa gaccaactat 1860 ccttactcca ttggaataat attaacataa acaataatgc aaacgacata tatgaaactt 1920 tcttcaaaac tttctattct gtatacgacg ctaactttcc cgtagttgaa ataatagcaa 1980 aacctaagag tttgaattct ccttggatta ccaaggggtt taaaaaatct tccaaaatca 2040 aacaaaagtt atatataaat tttctaaaaa caaaaacaac tgcaaacgaa aaaatttaca 2100 aaaactataa atatttattt gagaaaattc gtaaaaactt aaaaaaaaaa atattactcg 2160 gaattactta acaaattcaa atataacacc aagcgcattt gggaaataga aatgaaggaa 2220 attataggaa aacaaaaatc atgctgcttt ggttttctgc cacaaatgat taaaatagaa 2280 aacactagca tatttgaacc aaacgctata tcacatgaat ttaataaatt ttttactgaa 2340 ataggtccta aactatcaaa taagattcct atcacaagaa ctttgtttag tgacttttta 2400 ttatccttgg agaagtgcat atgctctgat gagctatcct ctgatttatc aattgaagaa 2460 cttgaaaagg cgctcaaatc tattaaaaag aataaatcat gtggaccaga cgaaatagat 2520 ggaaatgtga taatagattg ctttaagcaa ctaaaagatg ttcttttcaa agtttttaat 2580 gcctcaatta aacaaggaat ctttccagaa caattaaaaa ttgccaaagt tacacctatt 2640 tataaagatg gtgaccaatc ccaaataact aattatcgcc ccatctctgt gctctctgta 2700 ttttcaaaaa tcctagaaag aattatgtac aacagaatat ataaacactt tgacaaaaat 2760 aacctactct atgttaatca atttggtttt aaaaaggata gctcaacaga gcatgcaatt 2820 atccaacttg taaatgaaat ttcaaaatcg tttgaacaat caaaatatac attaggtatt 2880 tttatcgacc tatcgaaggc cttcgatacg gttgaccatc acatcctact caaaaaactg 2940 aaatactatg gagtaaataa ccaagcgtta aggtggttca gaagttactt aacaaataga 3000 aaacaatttg ttgttggtaa tgacaactat cacaacaatt gtttaagtat aacctgtgga 3060 gttccacaag gttcaatcct cggacctctc ctttttttaa tttatataaa cgatttaaat 3120 aaagcaacta atctgatgag catcatgttt gctgatgata ctaatttatt catctctaag 3180 agcgatatta gcgaactttt tgccacagtg aataaagaac ttaaactttt atcccactgg 3240 ttcaaatcca acaaattaac tttaaatact gacaaaacta aatggactct ttttcattcg 3300 aataggaaaa aaaaattttt acccttaaac ttgcctccaa tctttattga tgaaattgaa 3360 ataaaaagag actccgttat taaattttta ggtgtttatc tcgatgagaa cattaattgg 3420 aatcaacata ttaactatat atgcactaaa gtctctaaaa acattggagt tttgtacaaa 3480 gctcgaactt atctcaataa gaaacactta aggcaacttt atttctcatt tattcactgc 3540 tacataaact acgctaatat tgcctgggga agtactgaaa aaagtaaatt acttcgtctt 3600 tatcgctgtc agaaacgagc aattcgcata attaactttg ctgatcgttt ttctcattgt 3660 aaaccttatt ttattgagat gaaaatttta aatgtatttg aactcaacgt tttaaaaatt 3720 ttatgctttg tttatatgtg gaaaaacaat ctatcccccc cagtctttaa agattttttt 3780 agtttaaaac ccagcaacaa atacactctt aaaagaaaaa atttttttaa cgaacctttt 3840 tgtcaaacaa attttaatca attttgtctt gcctatcgtg caccgcacct atggaataaa 3900 cttgttttac ctaatgttga ttttgatcga ccaaattctt tttgtctttt taaaaacaaa 3960 attaaaaatc taattttttc cctcgataac attttagagt actattaatt aatttatcta 4020 ttatttaaat ttttattatt ttttccctaa agttttgttg ttaacaacta tttatattta 4080 aagatttgct aatgtaaaag ttctatatat atattttact ttactgtaac gtatacttta 4140 caaggtattc gtaaattata ttgtttataa atgtttttta aggttccgat gataagatct 4200 tagtgatctt ctttcggaaa cctagtttac actgttatca cgatttgtat atatatatat 4260 ttgtaaagta attttaatag tttattgtat tgtaaaacga ctaactttgt aagctgaaaa 4320 aaaaaaaaaa aaaaaaaaa 4339 // ID Gypsy-11_CQ-LTR repbase; DNA; INV; 169 BP. XX AC AAWU01009286; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-11_CQ_; KW Gypsy-11_CQ-I; Gypsy-11_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-169 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 402-402 (2011). XX DR GenBank; AAWU01009286; Positions 2181 2013. XX SQ Sequence 169 BP; 40 A; 38 C; 45 G; 46 T; 0 other; tgtagggtcc tgagttatgt tttaaccttt tgtatacatt cttgaaacca cacctgcgat 60 gataagtctt gcgcgttgtg acgtcatgat gatttgacag ctcgcttggg ctgctcagtc 120 attacggagc tgacgacgcg aacagtggac gccgagcaag gacgcaaca 169 // ID Gypsy-8-LTR_HM repbase; DNA; INV; 117 BP. XX AC . XX DT 30-DEC-2008 (Rel. 13.12, Created) DT 30-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE Long terminal repeat of the LTR retrotransposon - a consensus DE sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-8-LTR_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-117 RA Bao W. and Jurka J.; RT "Gypsy LTR retrotransposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 1983-1983 (2008). XX DR [1] (Consensus) XX CC The LTR of this family is characterized by unusual 5'-TG and CC AA-3' termini. CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX SQ Sequence 117 BP; 45 A; 10 C; 21 G; 41 T; 0 other; tgtaagaatg tgctccgaga tggagatctt aattaaaact atatttacgt tattgtaact 60 ttatgaaggc attatataga aaataagcga taagaaatgt tgtagtttat cattaaa 117 // ID Nobel_I repbase; DNA; INV; 6294 BP. XX AC . XX DT 09-MAY-2009 (Rel. 14.06, Created) DT 09-MAY-2009 (Rel. 14.06, Last updated, Version 1) XX DE Nobel_I: A Bel-like LTR retrotransposon in Drosophila persimilis. XX KW BEL; LTR Retrotransposon; Transposable Element; gag-pol; Nobel_I. XX OS Drosophila persimilis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; obscura group; OC pseudoobscura subgroup. XX RN [1] RP 1-6294 RA Austin S. and Styles P.; RT "Nobel_I: A Bel-like LTR retrotransposon in Drosophila RT persimilis."; RL Repbase Reports 9(6), 1151-1151 (2009). XX DR [1] (Consensus) XX CC Nobel_I is an element belonging to the Bel superfamily, found in CC Drosophila yakuba, D. ananassae, D. pseudoobscura and D. CC persimilis, with a copy number of 11, 54, 2 and 53 elements, CC respectively. The consensus sequence is 6294bp in length. Nobel_I CC encodes a single gag-pol polyprotein which starts at position CC 1107 and ends within the LTR, which is interrupted by a premature CC stop codon between positions 3534 and 3536 in the consensus CC sequence. The open reading frame is only intact in one copy of CC Nobel in D. persimilis. Flanking LTRs can be up to 100% identical CC suggesting that Nobel is still actively retrotransposing in D. CC persimilis. There is some substructure to this family, with a CC group of 5 elements in D. persimilis sharing 37 mutations from CC the consensus, and a group of 5 elements in D. ananassae sharing CC 5 mutations from the consensus. Despite a patchy distribution CC across the Drosophila phylogeny, there is no evidence for CC horizontal transfer of Nobel between the four species in which it CC is found. XX SQ Sequence 6294 BP; 1721 A; 1466 C; 1491 G; 1616 T; 0 other; tggcgccccc gggtggactc gcaatttaaa cgtatatatt aattcaagtg gaaaggcaaa 60 aagtgacagt gacacagtga agatacatac ataaatagcc aaaatctatt gtacatacat 120 acatacatgt acatgcatat gtgctgcagc taaattgcca aagtgaccgc gacgtgtgtg 180 caaataatta aacaaaccaa actaaaaaaa atcaccgcgt cggtcgtgca ataccctgtc 240 tgcagctaca tatacaagag gagagtacag cacctggaat ttcctatctt cccaagaggt 300 atttgtgcaa aaaattgaaa gctcgagtga agacgaaaga ttaaaaaact cgcaattatt 360 gtcttgtgct gaaacaattg cacctcgctt tcaattcttg gcacatacgt atgacatagt 420 agagctagat tgcatgctgt agttagagtt gagcatatat attaatacat atatacatac 480 atatgtacat ctaaccctct tggacgcaat aactacacat atacataact catttttact 540 gcaatcgctg tcgccccttt cccctttctc gttttcgttg tgtcaaatat atgtatagtt 600 gggtcgttgg ccaagcacgc ataccaatac gcagctgcag cggtgaattt tttcgtctcg 660 ctttcgctct tcaccgttgc ttatcgttgt tcttttgttt ggcgcgcttg atccaaacca 720 aacaccaagt gacgcgctac cctgttttcg tcgggtatat tcgtttggtg tcagtaggat 780 ccaactatca attaagacaa agtcgtgcct cgcaaatcaa tttagcgtcg atcgttgtcg 840 gtgcatacgc atacacaaca ttaattcttc agaactgcag cggtgaaatt tttcgttttc 900 gcttgtcacc agtacagttc gttggtattc ttgtgtggtt ttcgtgtacg cgccagccaa 960 gcgacgctct accctaaatt tttcgggtat attggttttt cacctcgttt gggtacaatt 1020 acccattatt tcattgagtg cctcggcatt aaatacgatt attagaaacg atctttcgcc 1080 ttcagttgct tgtattgaga atcaatatga cctcaaaagc caacaatatc gtccttgaag 1140 agctgaagaa aaagcgcagt tgccgtgtca aaggcattcg tttattagcg gaccgtttag 1200 aagcggggga tattccgctg gtattgacgg agctcgagtg cagactcgaa acgctgatgc 1260 gtagcattga cacggcacgc gaacttcaag agaagattga agatatagac attgacgatg 1320 aatttggagt cgagttggaa gaactctcaa ttgtcacaaa ggcaaaatta atgtctctca 1380 tttgcgcatt aaagacagct gatgagacaa tacaggcagt accagcttat ggtggcccaa 1440 ctcgtgctcg ccttccgaag ttagctctgc cgcagttcag cggcaactat tcggacttta 1500 aaaacttcat aggtttattt gagactctag ttcataatga ccaaagcatt ccaattatcg 1560 aaaagttcaa ccatttgctt tcgtgtctct ccggtgaagc gttgggaaca gttagggctt 1620 tccaagtaac cgagaccaac tacgagaaag caatggctag cttaaaacgg gtctacgata 1680 atgattgtct gatctttaaa aatcacattg atgccctatt taatcttccc aaaatgaccc 1740 agcaatcagc atcgtcctta aggaacctta tcgataccgc ctcatccata tatggttcgt 1800 tgttgtcgat aggcgatgat aaaaagatct cgaatgcgat gattattcat ctcgtcttga 1860 gcagagtcga tccggtgacc aaagagaaat gggaagaaca gctcgattat gacaaactgc 1920 cactttggac ggactgtgaa aggttgctta atcgcagaca tcagcatctt tcggccgaaa 1980 attcggacaa accaaagcag gagcagaaag ctgtgccaag caagccgcat aatcggagct 2040 cattcgcgtg ttccaccgcg aatactaaaa atcaacaatg cagctactgc aatgcgaaag 2100 gccatttgtt gacagagtgc agtccatttg gtcggctagc agtaatgcaa aggttcgaat 2160 tcgcaaaaac tgcatccttg tgcatcaatt gtctgcagcc aggccactct gtagtacgat 2220 gcaaagctaa gaagtgtcga gtgtgtggct gctcacacca taccttattg cataggtaca 2280 cagtggcgaa taaaagtcta gcgttaccat cacccccaga gaattcttct cctcctccca 2340 atcagaatca accttccact tcgcatgctc tccatgcgac agctatggat agagttattc 2400 tggctacgtc gatcgtaagc gtccgaacaa ataatggtga acatgttttg gcccgagctc 2460 tgcttgactc ggggtctcaa accaatttta taactgagga cttagcacag cgcttacaga 2520 tccgtaggga ggagtcgtgt atcaacttgc ttggtattgg tgaatccaat tcacaagtca 2580 agaaaaagat acacacggtg gtgaagtcgc gaattactgg tagtgagttt tcggttgatt 2640 tctggattct aaggtccatt tcggggtatc accctgatca gacacttaac gtgagtgact 2700 ggaaaatccc gaagaactta ccacttgcag acccatattt cttcaagcca cagaagatag 2760 atatgttgat aggtgcagaa acatttttcg aacttctatc tgttggtcag attcgacaag 2820 gtcctgacta tccaactctg cagaaaacgc ttctcggctg gattgtgtcg ggcagataca 2880 ctccaaaggt gactgccacc caagaagcta ggaattgttt gagttgtcga gaagagtcag 2940 tgatccacat cgacaataca ttacaaaagt tttggtcact ggaagaattg ccgtcttcta 3000 aaaagtcttt atctcctgaa cacaagttat gtgaaaagca ttatagaagg actactcagg 3060 tcgtgccttc aggtcggttt gaagtaagac tgccgttcaa atcagatccc agtatcttag 3120 gcaactcctt tgaagtggct aaacggcggt ttttatcgct tgaaaaaaga ttgtctcgtg 3180 accccgagtt gcagaaaatg tacttggagt tcatggaaga gtacctctcc ttgggtcaca 3240 tgtctaccac agacaacaca atccccaaaa ctccacatta tttcattcca catcagtgtg 3300 tgttgaggcc tcaaagcacg tcgaccaagc tccgcgtcgt tttcgatgcg tcttgtaaaa 3360 cgtcctctca ggtggccttg aacgacatat tgatggtcgg tcctacgata caggatgagc 3420 tgtattcgac attgctccga ttcagactac acaggtacgc cttgacagcc gatgttaaga 3480 agatgtatcg ccaagtctgg gtcgctgatg cggatagaca attccagctc atatagtgtg 3540 gagaagagac ccgtctgagt ccttgagaat ataccagctc aacaccgtaa catatgggac 3600 tggaccagcc ccatttttgg caattcggtg tttgaagagg ttgagtgagt ctgcaaaact 3660 ctcattccct aaagctgctg aagttatcga ctccgacttc tatgtcgatg acatgttgac 3720 tggtgctggt tgcgtagagg agttaaagac aattaagtct gatgtggctc aggttcttca 3780 aaccgctggg tttgaattga ctaaatggtt ttcgaactca cctgaggtca ccgcatccga 3840 gagcacagtt aaaccgataa cgatctcgga ctcagagtca actaaggcgt taggaatatc 3900 gtggctgcca cacgaagacg tctttaagtt ccaaattgac acgtcagtta tgggtctccg 3960 agcgacaaaa cggaacatct tatcggtgac atcaaagctg tttgaccccc ttggcttgtt 4020 aagtccttta gtaattaagg gaaagatcct actgcaggag ctttggctca acaagctaga 4080 ttgggatgag tcaattccaa tgaacttgga aacagcatgg aacttattaa aagaatcctt 4140 gggtcagttg gagaagatta ccatacctcg atttgtgcat actgatccca tgtcaccgat 4200 tcaagtacat gcctttgccg acgcatccat gagagcttat ggagcctgcg tctatatccg 4260 tagaaagact gcagaaggtt tcaaggtctc cttgttgact gcaaagtcga aagtagcgcc 4320 actcaagacg aaaacgctcc caaggcttga gttatgtgcc gctcaccttc tagcagacct 4380 ttgtcaccga atcaaatccc tgctcaaggt tccaattgat aaaatgatat tctggtcaga 4440 ttcagaagtt actcttcatt ggatcagatc ccacccttcg tcgttgtcaa cattcgttgc 4500 gaaccgagta gcggagattc aagagtggtc aagtgaagct acgtggcggc atgtgccaac 4560 aaaacagaac ccagcagaca tagtttcgag aggttgtgac gtagaggaaa tcgtgcagtc 4620 gatctggttt ggaggaccag agttcttaaa gtttgaagaa gagagctggc ccagaaatcc 4680 acacttcgag ctttccgaag aagagataca gatggaaagt cggaagaaat cggtcggact 4740 gaccgtcgcg gccaagccaa attacttagt ggatgtaatc gagggatact cgtcacacct 4800 caaactgcta agagtgttcg ttttcgtgtt ccgcttcatc cgaaaatgca aagacaaaag 4860 tctcaacttt ggaaaaattc cgtcatccgt agaatacgac gaggcgtttc taaaaatagt 4920 cgaaataaca cagaaaaatg agtctcaaga agatattgaa agggttcgta aaggcaccaa 4980 gttaggcccc agtcttcagc gcttgaatcc cttcatccat gaagaggctg ggacatggtg 5040 ctctttttcg ttgttgcgag ttggggggcg actagttaac gctcctatgt catataatgc 5100 caagtttcct ttgttattga caaaacgctc gcagtttgtg cagacatacg ttcgctatct 5160 gcatcaaacg aattttcatg ccggcccacg agcactcgtg agtatcctta gacagcgcat 5220 ttggatagtg aatgctcagg cggtctgcag ggcgacagtt aggtcgtgta ttcgttgctt 5280 taaatgcaag ccgctactcc agacccaaat gatgggcaac ttacccgcag accggctccg 5340 tgctctccgc ccattttcag tatgtggcgt tgatttttgt ggcccagtct atacgactct 5400 gaaaattcgt ggaaggcccc cttataagtc ctacatagcg ttatttgtct gttttgcgtc 5460 caaggcggtt catttagaaa ttgtctccga tttgacgact aattccttct tgttggcatt 5520 ccaaaggttc gtcggtcgtc gaggatgtcc gcaacgcgtg aactgcgaca acgcgacaaa 5580 tttcgtcgga gcaagtcgcc acttcagcga actgcggagg aagatggagg cggaagcgga 5640 cgcgatacgc gaatttgcgt caagaagcgg gtgcgagttt gccttcatac cgcctcgagc 5700 accgcacatg ggcggacttt gggaggccgg tgtgaagtct gccaagggcc tactcctacg 5760 cgccatcgga agcgctctcc tcaccgcaga ggagctggag accgtgctgg tcgggatcga 5820 ggcagtgctc aactcgcgcc cgctaggacc cctaagccca gaccccaagc gacggagacg 5880 cgctgactcc cgggcacctg ctgacaggcg ggccgctcat cgcaccccca gcacccagga 5940 ccccggacca ggagggtctg agctgcttaa agcgatggcg gcttgtctcg tcagccaggc 6000 aaatgttctg gcagcgatgg tcccgggagt atgtgctggg attacaaatc agatgcaagt 6060 ggcaccagga ggagccaaac ataaaggaag gcgaccctag taatcgtcgc cgaggacaac 6120 ctgccccctc aacagtggct cctaggaagg gtggtcggca caaccgccgg gcaggacgga 6180 agggtcagag tggtcgagct aaggacgagc agcggagcca cgttcaggag gccgatacac 6240 aaattggcgc ttctgccaat ggtttgaagc cttccaggcc ttcaacgggg ccgg 6294 // ID Gypsy-34_DPu-LTR repbase; DNA; INV; 292 BP. XX AC . XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from Daphnia: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-34_DP_; KW Gypsy-34_DPu-I; Gypsy-34_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-292 RA Jurka J.; RT "LTR retrotransposons from Daphnia."; RL Direct Submission to RU (16-DEC-2010). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR [1] (Consensus) XX SQ Sequence 292 BP; 67 A; 76 C; 70 G; 77 T; 2 other; tgttgcggat tggccgcaag gagcaggcag atcgaccgtc agagggacac ckgcgagttc 60 gtctgtctcc caagccaggg aggacagtcg gtttgagtcg agcgtgtgat cctggtwaca 120 gcaattaaga gagagccctg ttcacccact ggctacctct cgcgaagaaa ccccctttca 180 tctgagtgta atttccttgg tttcccaata caaactgtaa gtcactgaat tcccatttat 240 ttaagtttat ttattcgttg tggttgcatt caaccgctcg gcctccacaa ca 292 // ID BEL-68_CQ-I repbase; DNA; INV; 6044 BP. XX AC AAWU01019429; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-68_CQ_; KW BEL-68_CQ-LTR; BEL-68_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-6044 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 289-289 (2011). XX DR GenBank; AAWU01019429; Positions 52731 46688. XX CC Positions [5078-5467] - Integrase core CC 'CGTC' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 32..5467 FT /product="BEL-68_CQ-I_1p" FT /translation="MSNKSDSGVNAGAGSTAKKWKTCGKCGKPGQSRMVTC FT SECQSSFHYECVRITSEVLDGPWVCNGCLRRTIEANNNVLQIKLQEEQQKR FT MQHDQQQRQLERQWRELNPDLNPQQSRNVEFLRLGDQIVKIVRQGDSPLAE FT STPVNLDSELAPRQDWMEARLRQLDNYNQDLLNQIDMLQRLNPQGTESGAR FT NPLYEFPPPNNGSGGKTDAVRGRPSGLLDPIPERKEPAPSTSESVGSIATK FT VSRGSVALRAQRLKALEERQALERKQLEEKLALEQELLEQETVAGSDVEQD FT GSASKSDRNSSTDSNFLNITNWLEDLERSGEGAVSTPRGNRTKEEGRSRPD FT SSEQRDSVSLPTDQHHPPPKNPARLNSNQLAARHTVKDLPRFNGDPEDWPR FT FIAAYERTTRMCGFGNDENLDRLERSLYDRALFYVKGLLLYPENVPLIIKT FT LEIHCGNPELIVETMVQRVRAMSPPKADQMETIIDFGIAVQNLCATMTACR FT MNECLYNVALLQELVERLPPTIKINWALHRKSKKTVSLADFGEWLGTLVEA FT LSKVTRPPAGPKHQIGQRDRRNEQIHVHSDDVKERNASKGCHVCKEDCQSL FT EKCGEFGKMSPLSRWKLIKEMDICRKCLKKHIRACDSKVPCGTNGCPFLHH FT KLLHDDAKHKLPEQVTQDKPKEDCNAHQCCLGSVLFKYVKITLHGNGKSVP FT TYAFLDSGSTCSLMEHSLWEELGLDGEKQPLCIGWTAGQGRYEANSVRCTV FT DVSGTTGKRRNRLKKVHTVESLQLPEQTMNIDDLSEHYGHLANLPVESYAN FT VRPRILLGMDNAYLDQPLEAREGGENQPVAALTRLGWTVYGPCSVAEETQA FT ATEFNYHICQCETLQAEMKQYFMLDSLGTQIPSKPLLSKDDERAVDMLKKR FT TILDGDRYRTGLLWKFDDVKLPDSRPMALNRLRCLEKRMIREPELATALQA FT KIEDYEQKGYIRKLTDAEERADRDRVWYLPIFVVTNPNKPGKLRIVWDAAA FT EVRGISLNSVLLKGPDQVTSLVDVLLRFREYRTAVTGDIKEMFHQVRVHPD FT DQHCQRFLWNNGAPGSTPSVYVMQVMTFGASCSPSCAQFVKNINAERFAED FT SPAAAKAIINDHYVDDMLSSVETEQEAIKLASSVRDIHGQGGFEIRGWRSN FT SKAVLAALNSQECEDKNLSEASQFSTEKVLGMWWDSSTDTFTFRLPTKPDK FT DLLNGLRVPTKKEVVRVLMSIFDPLGLLANVLMFLKVLIQEIWRSGIGWNA FT TITPEQYEKWLVWLKVIEKVEAVSIPRCYRQITSSSSKTNVQLHTFVDAST FT NGCATVAFLRFEEEDRIECAFVAGKTRVAPNKLTSIPRLEIDAGTMGVRLA FT QKIMEGLRIVIHQRFFWTDARDVLCWLHADHRNYSQFVGFRVAEILEKSNL FT AEWNYCPSKLNPADDGTKWTKIPDLSEKSRWLHVICFIQQKMSEWPNKLIH FT IGTTTTELKHSVGVHCVTAQVINPEDFSTWKRLLRYTVHCKRYVRNLKETT FT SNRPRTTGEITQEELLAAEAYLHRLAQEAEYIEEIAILIKAGKGNPNGLQI FT PKGSPIFKLSPYLDQFGVLRMQGRTVACEFVDHNAACPVILPRNHHITQLI FT IRSMHFRYHHLNHETVVNELRQKYRIPQLRRACFAVRASCQDCKNRYARPS FT PPLMAELPPARLAAFTRPFSYTGVDYFGPISVAVGRRVEKRWGVLLTCLVT FT RAVHIEIAHTLTTDSCILALRNFMAIRGTPLELISDQGTNFIGADRELKKA FT YQKVDQNQLIREFTTTNTKWTFNPPSSPHMGGSWERLVQSR" XX SQ Sequence 6044 BP; 1706 A; 1532 C; 1573 G; 1233 T; 0 other; tcttaatttt tccgtttaat caactgccgg aatgtcaaac aaaagcgatt caggtgtcaa 60 tgcgggagct ggatccactg ccaagaagtg gaaaacgtgt ggaaaatgtg gcaaacccgg 120 tcaatcgcgg atggttacct gtagcgaatg ccagtcttcg ttccactacg agtgtgttcg 180 gattacgtcc gaggtactgg atgggccctg ggtctgtaac ggctgcttga gaagaaccat 240 cgaagcgaac aacaatgttc tccagatcaa gctgcaagag gaacagcaga agaggatgca 300 acacgaccaa cagcagcgcc agttggagcg gcagtggcgt gaactaaatc cggatctgaa 360 ccctcagcag tcaagaaacg ttgagttcct gagacttgga gatcaaatag ttaaaatcgt 420 gcgacaaggt gattctccgt tggcagagtc tactccagtt aacttggatt ccgaactagc 480 gcctaggcaa gactggatgg aagcacgttt gcgccaactg gataattaca accaagattt 540 gctcaaccaa attgatatgc tgcagcgatt gaacccacaa ggaactgaat cgggcgctag 600 aaacccattg tacgagtttc caccacccaa caatggatca ggtggaaaaa cggacgcggt 660 tagaggcaga ccgagcggtt tgttggatcc gatcccggag cgaaaggaac cagcaccatc 720 tacatcagaa tcggttggat cgatcgcaac gaaagtgtct agaggatctg tggcactgcg 780 agcacaaaga ttaaaagcgc tggaagaaag acaggcactg gagagaaaac agctcgaaga 840 aaagctagcg ctcgaacagg aacttcttga gcaagaaacg gtcgctgggt ccgatgtaga 900 acaggacggc tcggcgagta aaagtgaccg aaacagctca actgattcca actttctgaa 960 catcaccaat tggctcgaag atctggaaag atccggcgaa ggggccgtat cgacgccacg 1020 aggaaaccgg accaaggaag agggccggtc aaggccagat agctctgagc aacgtgactc 1080 ggtgtcactt cctactgatc aacaccatcc tccccccaag aatcctgcac ggctgaacag 1140 caaccaactg gcggcacgcc ataccgtgaa ggatcttccc agatttaatg gtgatcccga 1200 ggattggccg cggttcatcg cagcctacga gcggacgaca aggatgtgcg gattcggcaa 1260 cgacgagaac ctggaccgat tggaacgcag cctgtacgac cgcgctctat tctatgtcaa 1320 agggctgcta ctctatcccg aaaatgtacc actaataatc aagacactcg aaatccactg 1380 cgggaaccct gaacttattg tggagaccat ggtgcagcga gttcgagcga tgtcaccacc 1440 gaaagctgat cagatggaga caatcatcga tttcgggata gcggttcaaa acctatgcgc 1500 tacgatgacg gcttgccgaa tgaacgagtg cttgtataac gtggccttgc tgcaggagct 1560 cgtcgagcgt ctaccgccaa cgataaagat caactgggct ctacatcgta aaagcaaaaa 1620 gaccgtgtca ctggcagact ttggtgaatg gctcggtaca ctggtggagg cgctaagtaa 1680 ggtcacaagg ccgccagctg ggccgaaaca ccagatcggc caacgggacc gtaggaatga 1740 acaaatccat gtccattcag acgatgtcaa ggaaagaaac gcttcaaagg gctgccacgt 1800 gtgtaaggaa gactgtcagt cgctggaaaa gtgtggcgaa ttcggcaaga tgtcgccact 1860 atcacgctgg aaactgatca aggaaatgga tatctgccgc aagtgcctga aaaaacatat 1920 cagagcgtgc gactctaaag tcccttgcgg tacaaacggt tgcccattcc tgcaccacaa 1980 acttctccac gacgacgcca agcacaagct tccggaacaa gtaacccagg acaaaccgaa 2040 agaggattgt aacgcacacc aatgctgcct aggatcagtc cttttcaaat atgttaagat 2100 aacacttcac ggaaacggaa agtccgtccc tacatatgct ttcctcgaca gcggatcgac 2160 atgctcattg atggaacaca gtctctggga ggagctcgga ttggacggcg aaaagcaacc 2220 gctgtgcatt ggttggaccg ctgggcaagg tcgatatgaa gctaattcgg taaggtgcac 2280 ggtcgacgtg tccgggacca ctgggaaacg acgcaatcgg ctaaagaaag tacacacggt 2340 cgagagtcta caactacccg agcagacgat gaacatcgac gacctctcgg aacactatgg 2400 acacttggcc aacctaccag tagaatcgta cgcaaacgta cgcccgagga ttctcctggg 2460 gatggacaac gcctatctcg accaaccgct cgaggcccgc gaaggtgggg agaatcaacc 2520 tgttgctgct ctcacaagac ttggatggac tgtatacggc ccttgctcgg tggcagagga 2580 gacacaggca gctacagagt tcaactacca tatttgtcag tgtgagactt tgcaagctga 2640 gatgaagcag tatttcatgc tcgacagcct cggaacacag attcccagca agccgctgtt 2700 atcgaaggac gacgaacgag cagtggacat gttgaaaaaa aggacgattc ttgatgggga 2760 tcgatacaga actggactac tctggaagtt cgatgatgtt aaattgcccg attcgagacc 2820 catggctctg aaccgtctgc ggtgtttgga gaagcggatg attcgggaac cggaactcgc 2880 cacagctctg caagctaaaa tcgaggatta cgaacagaag ggatacatta gaaagctgac 2940 ggatgcagaa gaacgggcgg atcgagatcg tgtgtggtat ctccctatct ttgttgtaac 3000 caacccaaac aaaccgggaa aactgcgcat tgtgtgggac gctgcggcag aagtaagagg 3060 catatcttta aactcagttc tcctgaaagg cccggatcaa gtcacgtcac tggtagacgt 3120 tctgctgcgt ttccgagagt accgtacggc tgttaccggt gacataaaag agatgtttca 3180 tcaagtgagg gttcacccgg acgatcaaca ctgtcagcga ttcctgtgga acaacggcgc 3240 gcccggctca actccgtccg tctacgtcat gcaggtgatg acattcggtg ccagctgctc 3300 tccaagttgc gcgcagttcg tcaagaacat aaacgcggag cgctttgcag aggattcacc 3360 agcagctgcc aaggccataa taaacgacca ctacgtagac gacatgctga gcagcgttga 3420 aacagaacaa gaggcgatca agctagctag cagcgttcga gacattcacg gacagggtgg 3480 gttcgaaata cgcggatggc gatcaaactc aaaagcggtt ctcgcggcgc taaactcaca 3540 ggaatgcgaa gataaaaatc tcagcgaagc ctcgcaattc tcaaccgaga aagtactggg 3600 catgtggtgg gattcgagca cggacacgtt cactttcagg ctgccaacaa aacctgacaa 3660 ggacctgctc aatggcctcc gcgttccaac gaaaaaagaa gttgttcgtg tgctgatgag 3720 tatcttcgac cctcttggcc tgctggccaa cgtactgatg ttcctgaagg ttctaatcca 3780 agaaatttgg cgctctggaa taggatggaa tgctaccata actccggagc agtacgagaa 3840 gtggctcgtt tggctgaagg taatcgaaaa ggtggaggcc gtgtcaattc cgagatgcta 3900 tcgccagata acgtcgagct cgtcgaaaac aaacgttcaa ctgcacacct tcgttgatgc 3960 cagcaccaac ggatgtgcca ctgtcgcttt cctgcggttc gaggaagaag accgtattga 4020 gtgcgcattt gttgccggca aaactcgagt tgctccgaac aaactgactt caatacctcg 4080 gctggagatt gatgcaggaa ccatgggagt cagactagcg cagaaaatta tggaaggact 4140 tcgaatcgta atccatcaac gattcttttg gaccgatgca cgagacgtcc tgtgctggct 4200 gcatgcggac cacagaaact acagtcagtt tgtcggtttc cgtgtcgctg aaatactgga 4260 gaagtccaat ctggccgaat ggaactactg cccaagtaag ctgaacccag ctgatgatgg 4320 gacaaagtgg acgaaaattc ccgatctttc ggagaagagt cggtggctcc atgtaatctg 4380 cttcatccag caaaaaatgt cggaatggcc caacaaactg attcacatcg gtacaacaac 4440 aacggagcta aagcactcag tgggagtaca ctgcgtgacc gctcaagtaa tcaaccctga 4500 agatttttct acctggaagc gactcttgcg atatacggtt cattgcaagc ggtacgttcg 4560 caatctgaag gaaactacga gcaatagacc acggacaaca ggtgaaatta ctcaagaaga 4620 actcctcgcg gcggaggcgt acctacaccg gctggcacag gaagctgaat acatcgagga 4680 aatagcaata ctcatcaagg cgggaaaagg taacccgaac ggtctccaga tcccaaaagg 4740 cagcccaatc ttcaaactct cgccatactt ggaccagttt ggtgtactgc ggatgcaagg 4800 gagaacggta gcatgtgaat tcgtggatca taacgctgct tgtccggtta ttcttccaag 4860 gaaccaccat atcacacagc taattatcag atcaatgcac ttccgatacc atcatctaaa 4920 tcatgaaacg gtcgtcaacg agctgaggca gaagtacagg attcctcaac tacgccgtgc 4980 gtgctttgcg gttcgagctt cgtgccaaga ttgtaaaaac cgttatgctc gcccgtcgcc 5040 tccgctcatg gccgaactcc ccccagctag actagccgca ttcacgaggc cgttttctta 5100 tactggagtg gactattttg gtccgatttc cgttgcagtg gggcgacgag ttgagaagcg 5160 ttggggtgtg ttgctcacct gtctcgtgac aagagccgtc cacatcgaga tcgcacacac 5220 acttactacc gactcatgta tacttgcttt gcgcaacttc atggccattc gtggaacacc 5280 actggagctc atcagcgatc aagggacaaa tttcatcggt gcagatcgcg aactcaagaa 5340 ggcctaccaa aaggttgacc aaaatcagct catcagggaa ttcacaacta ccaacaccaa 5400 atggaccttc aacccaccaa gttcgccaca catgggagga agctgggaga ggctagtcca 5460 atcgcgttaa aagtcctgaa ccaaatgaaa ctcccgcgaa acccctcgga tgaagtgcta 5520 agaaacaccc tgctggaaat ctcgaacatc gtcaactcac gccctctaac ttacatccca 5580 gtcgaggacg ataacacccc cgcattgaca cctaaccact tcttgcttgg ttcatccagt 5640 ggttccaaac cacttgtcgc attcgacaat gggcacaccg cgctacgtaa caactggaag 5700 gcgtcacaga tctacgccaa cttgttctgg aagaaatggg taaaggagta tttgcccaca 5760 atctgccgac gcacgaagtg gcatcaacca gtgaaaccta tccaggtggg cgacgttgtt 5820 gtcatcgcag atcccgatca ccctcgcaac agctggccca tgggacgtgt cgtcagcact 5880 aacaccagca aggacggcca ggtacgtagt gcagtcgtgc gtacgagcga gcggttctac 5940 gaacggccgg cggtaaagct tgctgttctt gacgtcggtg ttaatgaagg taagctgcca 6000 caagttccca gcgtaccggg ggggaatgtt acgtcacttg gacg 6044 // ID RTE-2_BM repbase; DNA; INV; 3238 BP. XX AC . XX DT 29-APR-2010 (Rel. 15.07, Created) DT 29-APR-2010 (Rel. 15.07, Last updated, Version 2) XX DE Non-LTR retrotransposon - a consensus. XX KW RTE; Non-LTR Retrotransposon; Transposable Element; RTE-2_BM. XX OS Bombyx mori OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Bombycidae; Bombycinae; Bombyx. XX RN [1] RP 1-3238 RA Jurka J.; RT "Non-LTR retrotransposons from Bombyx mori."; RL Repbase Reports 10(7), 1054-1054 (2010). XX DR [1] (Consensus) XX CC >97% identical to consensus. The ORF is interrupted by mutations. XX SQ Sequence 3238 BP; 1304 A; 628 C; 574 G; 730 T; 2 other; ggtaacgaga cctcacaaac aaacaagaag aacaagactc aaaagcccgg tacccccgcg 60 ataagatcca acagcatctc cgagggagtt gttaaaccgg gcatgctaaa cttcttggtg 120 aataaaaaca ccataacggc tagtaatcat aatccaggaa aatcaacata gcaattgcca 180 ccctcctcca attcaaattc aaatttagta cttaagaaca atcaaactaa acatcttcca 240 aaacggctgg tcaccgagga agattacgac tacaaccctc caagtgtaaa cattaacaga 300 cagcttataa gtaaaacaaa taaagataat gaacaaaaac tttatataat aactttcaac 360 gtgaaaacac tatcttctta tgaaaaacta atagaattaa cagaatcact aaaatcaata 420 aaatcagaca ttattggaat ggcagaagtg agacgtcttg gggaaaaaat agaggaatat 480 gaggatttta ttctttacta cacgggccag acacctggac agttcggcgt gggctttttg 540 gtgagaaaaa atcttaaagt aaacatagag aacttcacgg gcttatcaga acgagtcgct 600 atattaaact tacaattcca tgggcaaaaa ctatcaatca ttcaagtcta cgcacccaca 660 gaagcagcaa atgaagaaga tatttcttat ttttatgaga cagtcagaag ggctagggag 720 ttagcgcata aagactatat cataatgggc gacttcaacg ctaagatagg taaaccaaga 780 cacgaagaat tcttgattac aaaaccattt ggtcttggta taagaaatga cagaggccaa 840 aaattgatag atttcgctct agaaactaag acggcaataa tgaacacata ctatcataag 900 aaaccaaacc gcagatggac atggcgctca ccaggcggcg catataaaaa tgagatcgat 960 tacattctat ctaacatacc gcagaagttt cataatgtgg aagttctaaa tatcaactac 1020 ccatcagacc acaggccagt gagagcaacc atatctcttg aaaaaataaa aatatccagg 1080 actagataca agaccaacca gtgtgcgagt ctaaaatacg aaagcgaaat actcagattc 1140 agattaaatc tccaactcca cctgctacgt gccgatgacc agaacttgaa agatactgat 1200 aatatacaga ctcaatacaa caaaataacc aatgccatct cgagaagcct agaagaaacg 1260 cacaaggaac acaattcgta cagacacaaa aaaatatcgc tacgcacaga aaaactcatg 1320 actagaagga aagaattaca gaagactaaa aacaaaacca gaagtataag aaacgaaata 1380 tcagctttat acaaactcat tagcaagtat atcaggcaag attatgcaaa atacagacag 1440 gataccataa ctaagcactt acaagtttca ggcagcacga aaaaggctta taaagagcta 1500 gcacaaaata aaacttggat cgaaggactt aaatcatcag gaaaacaatc tataataaca 1560 aaccgcaatg atatcattaa tatggccact gaattttata aatcattata cagtaaaaaa 1620 acatccacat atgtagacat aacaccaagc agtaatatat caaacaacga aaacgtgcaa 1680 ccaatcacgg aatctgaact tattacagca ataaaaaacc tgaaaaccga caaaagccca 1740 ggcccagatg gcattncaaa cgaagctcta aaggcagctg acactctatt caactctcat 1800 tcttgaacct ggtgacaccc ccccaatggt cagagtcaaa tataatctta cttataaaaa 1860 aggagaccca atgacatggg aactacagac cgataagcct accctcgact gtacaaactt 1920 ttttctccat aatataacga agaataagta gtgcactaga gaaaggcaac ccgtgaacag 1980 gcgggcttca gaaaaggatt ttcaacacgt tgaccacatc acacgctaga gctcctgata 2040 gaaataccaa gaaaaacaga gaacactata cattgcatac atagactacc aaaaggcntt 2100 cgacaccgtg tcacacgata gcgtatggaa gtctttaaaa aaccaagggg tgaatgataa 2160 atatatacag ataattaaaa acatttataa aactaataca agtagaataa aactagaatc 2220 tataggccca agattttcta tcaaaagagg tgtaagacag ggagacccca tgtctccaaa 2280 aattttcatt gccatacttg aatcaatatt tagacagcta gattggaaaa acctcggctt 2340 aaatattgaa ggtaaatatt tgactcatct ccgctttgca gacgacttag tagtgctggc 2400 agaatcaagt tcagaacttc aatacatggt tgagtctctc cacagagcca gcattaaagt 2460 aggcttagaa atgaatacaa caaaaactat ggtaatgaca aatagtgcta gaaagaaaat 2520 aacagtcgga aacgaactat tgaaatacac agaccactat atttacttag gaaaacagat 2580 cagctttgaa agaaaaagca atgagatgga gattggaaga agaatccaac acacctggaa 2640 caagtactgg aatttacgtg aagtatttaa aagtacgttg ccaatatact taaaaaccaa 2700 ggttctaaat tcctgcattc taccatgctt gacttatggc tgccaaactt ggaagtttac 2760 atcaaaggcc aataatctaa tagtaagctg ccaacgagga atggagcgaa gtctgctgaa 2820 tataagaaag attcaaaaaa tcagacacac aaaaatcaga gaaaaaacaa aatctataga 2880 tgctcttgaa tatgccagaa acttgaagtg gaagtgggct ggtcatgtag ccagacttaa 2940 ggaccagaga tggactgcaa gaataactgc atgggaaggg ccacaaggga aaagaaaaaa 3000 gggtcgcccg atatcaaaat gggaggatga tataaaagct atcgctggcc ctcgatggat 3060 acaggtagcc caaaatagaa ccaaatggaa ggacttggag gaggccttca ccatgtttaa 3120 tgggttcctg ccaaacataa gtgaagaaaa ttaaaattat tatatgtata cctatttact 3180 ttgtatttta gcaggaaata aaaggctttt tattttattt tattttattt atttatta 3238 // ID Gypsy-39_DWil-LTR repbase; DNA; INV; 909 BP. XX AC scaffold_181154; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-39_DWil_; KW Gypsy-39_DWil-I; Gypsy-39_DWil-LTR. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-909 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_181154; Positions 2534463 2533555. XX SQ Sequence 909 BP; 286 A; 146 C; 228 G; 249 T; 0 other; tgttatgggc caaataattg ataaacaaca atacagctgt tatcgatggt gctggctcgt 60 tagctagaaa attagctggg aatttggcgg gaaaaaacca agggaaccga gctagttctg 120 ggagcggttc cggttccggg aacaaaattt gaatttttag gaatctaaga gagtggagga 180 gttgagtttt ttttggaaaa taagaggaag aggacgagaa acagaaaaaa ggagaaagaa 240 gttctagaag gcacgtggag ttgttaagtt ggagaagaaa aaaaactaaa aagtgcgtgt 300 gccgagaagg agcatctcac ccatccttga agaccaagaa ttgcatgcga gacgcaggac 360 ggacacacac aggagccttc ggacagccaa ttcgacggga tccaatcggc cacccagatc 420 cgggtgctcc aggttttttt tccatttaac ctcaacatta ggctgttaga ataagtaatt 480 gtattttgtt tatttatatt atttatttaa tttatatttg tattcgtccc agtggtaccc 540 tggttaggct cagttattat tgtaaaaaaa aaagtgaaat tgtttaatat aatatatgtg 600 tgataaaaaa aggagaaatg tctgagtaga gtcttgtcta tgtgggggtg agatgacgac 660 gggttagtcc acagaacatg ggggctgcca gggagtacgg ggcccaaggt agggtgggta 720 gatcctaatt aatataccag ctaattgata attccataat ccatcttccc gtccgagtgt 780 aacgtttttt tgtataattt ttttgttacc cgattaatcc atccatcgag gtcgaatcca 840 ttcctgagcc atgcgagtta gccggtgatc gatgatggat tgcagaagaa aaataatcgg 900 atcataaca 909 // ID Sola1-2_AC repbase; DNA; INV; 3386 BP. XX AC . XX DT 28-JAN-2009 (Rel. 14.02, Created) DT 28-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE Sola1 DNA transposons from Aplysia californica. XX KW Sola; DNA transposon; Transposable Element; Sola1; Sola1-2_AC. XX OS Aplysia californica OC Eukaryota; Metazoa; Mollusca; Gastropoda; Heterobranchia; OC Euthyneura; Euopisthobranchia; Aplysiomorpha; Aplysioidea; OC Aplysiidae; Aplysia. XX RN [1] RP 1-3386 RA Bao W., Jurka M.G., Kapitonov V.V. and Jurka J.; RT "New superfamilies of eukaryotic DNA transposons and their RT internal divisions."; RL Molecular Biology and Evolution 26(5), 983-993 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS join(807..1193,1054..1329,1333..2460) FT /product="Sola1-2_AC_1p" FT /translation="ALSVAPESTNTESIRHNDFETDETQSSNERLLESFEV FT EGSARQASPKKRKRINKKNDAKRKRQSDEEYVSAGTKKVVPARIMRKGCVG FT MECTRAGRQCSGITETDRREIFQDFYKLADLQKERVHCEAYERAALEWNVR FT ELAVSVLESQKRIGGRYFRTFINWLTCRRREFIARHIETTEPKVGKPNSRR FT SKTHAYYLTVNGKRLKVCRNTFLATLSISEKFVRVVLSKITETGVVEEDKR FT GGRQSELMKTREEQNREKISRHIDRFLRVESHYCRASSSKEYLSPDLTLSK FT MYDMFVKENPCNDIPSFHTYRRVFKTKSLAFHSPKNDQCSLCNSFRKGDEA FT QKNELREIYNRHEAEKQKVRQLKEDCKKMAQADHSILAGNFDFQQVIYLPI FT SHEDALFYKRRLSNFNLTFYNLGDHTCDCFIWHEGQSRRGSSEISTAVCRA FT LREYDQRGIKTAYLFADGCPGQNKNTIMPTMMLHMINTSAHMEELSLRYFE FT SFHGQCEGDSVHSAISTALGNAGDVFLPSQLFPIIALARRKQPYKVHPLEH FT GDFLNYKKLSEDLKVLNVRKDNQDSGVPMNWNKVMEVRVNKAHPTTIFF*" XX SQ Sequence 3386 BP; 1093 A; 720 C; 782 G; 791 T; 0 other; cgtcgcctgt cacacgaaag ggcgtgttcc aggattttga cagtttttga gttgttactt 60 ctagtaggat agcgctgcct cagatctatg ttcacacgaa acggcgtata cctagcttta 120 aaataacagt actagcagta aaaaaacgaa aagggcgtac gctaacactc aacgcgaccg 180 aatcagacga aacggcgtac gcaggcttgc gccctttcgt gtgcaaaaaa ttgatacaaa 240 gttgtatcaa atctattcgc acgaaaggac gtataccttg gttacgcccc ttcgtgtgaa 300 cgaccgttac acttttgtct ttttttttct ggcataggcc aaggccgtat gacgtaatat 360 agtaatactg gccctcctct aatgatacaa taggcctgta cacagcgcat gcactgtgtt 420 gcatcgaatc ctgcttgtgc aagagagaaa acaagaccgg ccttattgtg ttgttccagt 480 tgcttactgg tgcttgcggc ttgctgtcac atacaacctg tctttcaaac tgacctgctt 540 accttctgac gattggaaac cattcgaaaa agcaaacttg ggtttgatca aatttaaata 600 agatctactt tttgtccgac gattcacatg aatattgcta agctatatgt tcttccttta 660 gcgtattatg ctacactcat taactagaag gctacgctat ctagacaccc ccctccccca 720 cccccggcag cgctactgac tgaagtgata aaagcatcag gactttgcct ttgttatatt 780 gatctgaaga catgtctaca aagtgagcct tgtcagtagc gccagaatca acaaacactg 840 aatccataag gcacaatgat tttgagaccg atgagaccca gagtagcaac gaacggctac 900 tagaatcatt tgaagttgaa ggatcagccc gccaagcgtc accaaagaaa cggaagagaa 960 tcaacaagaa gaacgatgca aaaagaaagc gacagtctga cgaagagtac gtgagcgccg 1020 gcacgaaaaa ggttgttcca gcacgaataa tgagaaaggg ctgcgttgga atggaatgta 1080 cgagagctgg ccgtcagtgt tctggaatca cagaaacgga taggagggag atatttcagg 1140 acttttataa actggctgac ctgcagaagg agagagttca ttgcgaggca tattgagacg 1200 actgagccca aggtcgggaa accaaactct cgacgcagca aaacgcatgc ttattacctc 1260 acagtcaatg gaaagcgctt gaaagtgtgt agaaacacat tccttgcaac cttgagtatt 1320 tcagagaaat aatttgtccg agttgtttta tcaaagatca cggagaccgg cgttgtggaa 1380 gaagacaagc gagggggcag acaatcggag ctgatgaaga caagagaaga acaaaacagg 1440 gagaaaatca gtcgacacat tgacagattt ctacgcgtgg aatcgcacta ctgtagggcc 1500 tcttccagca aagagtatct cagcccagac ctcacattgt ccaaaatgta cgacatgttc 1560 gtgaaagaaa acccatgcaa tgacatccca agttttcata catacagacg agtgttcaag 1620 acgaaaagcc tggcctttca tagtcctaaa aatgaccaat gctcactctg caactctttc 1680 cgaaaagggg acgaagctca gaaaaatgaa ctcagggaaa tttacaacag gcacgaagca 1740 gaaaaacaaa aggtgcgaca acttaaggag gattgcaaga agatggctca agctgaccac 1800 agcattctgg cagggaactt tgatttccaa caggtcatat acttgccaat ttcacatgaa 1860 gatgctttgt tctacaagag gagactttcc aatttcaatt tgacatttta taatttgggt 1920 gaccacacat gtgattgctt catttggcac gaaggccaaa gcagaagggg cagctcagag 1980 atatcaactg cggtgtgcag ggcactgagg gaatatgacc aaagaggcat caagactgcc 2040 tacttgttcg ccgatggctg cccgggacaa aataaaaaca caatcatgcc caccatgatg 2100 ctccacatga tcaacacttc agcacacatg gaggaactgt cactcaggta ctttgaatcg 2160 ttccatgggc agtgcgaagg tgactcagta cacagtgcta taagcacggc cttgggaaat 2220 gctggggatg tgtttctccc atcacagctg tttcccatca ttgcattggc ccgacgcaaa 2280 cagccataca aggtacaccc gcttgagcac ggtgactttc taaactacaa aaaactgtca 2340 gaagatctga aagttctcaa cgtccgcaag gacaatcagg attcaggtgt tcccatgaac 2400 tggaacaagg tcatggaagt gagggtgaac aaagcccacc caacaacaat atttttttaa 2460 aaacagtcac ctggaagagc agtacaggtc catttcgctg aagcgccaac tttcacacct 2520 aattcattgt gaagttaaac aactaaatga agaaccaaac caaattagca aggaaaaata 2580 cggcgatttg atgtctctgt gctctgggaa cacacccgtc atccggaatg ccgagcacaa 2640 agctttcttc tgtcaactac cacacagcga atgagatgtc tcacaaaatt caagctgaga 2700 aatacaaaca tgtttgatca aagagagcct accaaagcat ggtggcatcc atcactaatt 2760 tgtttgtttt tttaaataat aaatttggaa ttggaacgaa aggaaactgc ttgtctttct 2820 aaaacatgat gcctgtacct gagagtatgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtga 2880 gagagagaga gagaaagata cagacggact gacagacaga cagactgaca gacagacaga 2940 cagacagacg gacgaacaga gaggtattgg aaggaaaaga gaaagggaaa gatttggcgg 3000 gtggatgggg gagtgaggag gaattttacc ctctactggt cagtctgaag actgaaatta 3060 tttgtgtgaa cactcaaaca cagggcgtat acatgggcgt atacactgat ttaggacagg 3120 gggaaacgtc gaacaatcac acgaaagggc gtatgctcac agatggcaca ttttcgcagt 3180 gaaattataa aaaaaagtgc aaaactaatc ttgctgattg tttgatatct tatcatgcac 3240 acaatatcca aatatcgcga tggtacgatg caaattccta taatgacaag ggaaaaggat 3300 aaagcaacct aagtctgaaa atctttaaat cgttctcctg aaaaagtaag aaaacaggga 3360 tacgcccttt ggtgtgatag acgacg 3386 // ID L1_Ele17 repbase; DNA; INV; 4594 BP. XX AC . XX DT 15-OCT-2010 (Rel. 15.1, Created) DT 15-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE An L1 clade non-LTR retrotransposon family from Aedes aegypti. XX KW L1; Non-LTR Retrotransposon; Transposable Element; L1_Ele17. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4594 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4594 RA Kojima K.K. and Jurka J.; RT "L1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (15-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. This consensus is generated from 6 CC sequences with >99% identity, and ~100% identical to the original CC sequence in [1]. XX FH Key Location/Qualifiers FT CDS 179..1306 FT /product="L1_Ele17_1p" FT /translation="MNTRRIRENTFRVEFSNLPKKPTSEEVHAFVAQQLGV FT SRTQLVRLQLSHTDECAFVKVIDLNTAQRIVQTHDNQHDYEMKGVKYKIRI FT RMADGGVDVRLYDLSEHVSDEQVSRHMSAYGEVLSVSELLWSDKHHYAGMA FT TGIRQVKMVLREPIKSYITIDGETTYVQYAKQRPTCRHCGEFMHTGISCVQ FT NKKLLAQKVSVNDRLKRSSYAGAVRGASEGFSANETEEQRSVPLDNNAPFP FT QVQAQVEVPSTSNAAVFPPLPNKESPLAVPQMDVASGTDLVPTQQALEDEP FT STEMNQETPQRTVSIGDISSMSTDESCNGNTLESSIVCNPNGELVVVTTEK FT PLTEGTGKHKNKNSNDTNPFILAKRGRGRSKKH" FT CDS 1275..4502 FT /product="L1_Ele17_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="PNAAEGGPKSIKVDNHSYKIGTVNINNITNQTKTSAL FT HNFIKTQDLDIIFIQEVENDKLDLPGYNIVCNVDNARRGTAIALKHQIKFS FT NVERSLNSRIIALRVHDTITLVNIYAHSGSQYRREREELFNFTLSYYLRHH FT TQHIILGGDFNSVIHSSDATGVSNFSLALKNMVQQLQLHDVVNVLNRENKQ FT FTYITHNSASRIDRIYVSSGLLNNLRRADTHVCSFTNHKALTVRLALPNLG FT REKGRGYWSLRPCVLTTENIEELQLKWDYWTRQRRLYNSWIEWWVHFAKPK FT LKTFFRWKTNQMYAEYNRKYQILYTELREAYDRYLGDAGVLVTINRVKARM FT LKLQREFDTLFIRVNETRVSGEKLSTFQMGECRRKRTTIDSLDIDDNRTIE FT RSEEIEEFIFDYFKNMYAGEEHEVNDEWMCDQTVSPNNEANASCMNDITTG FT EIYSAIKTSASRKAAGPDGITKDFFLKAFDVIHRHLNLALNEVLQGNIPEK FT FLDGIIVLIRKKNCGNTVKAFRPISLLNFDYKVLSRILKARLELILKESEV FT LSPSQKCSNFGKNIHQATLAIKDRIVQLRETKRKGKLISFDLDHAFDRVDH FT HFLFQSMRATGLNPALIELLSNVYARSRSKILVNGHLSPEIQIKCSVRQGD FT PISMHLFVLYLQPLVHKLDQVCDDRDDLVVVYADDITVVTTCIDKIERIKQ FT LFISFGRCSGALLNLNKTYCIDIGHFRTSEPVNIDWLQTSEKMKILGITFC FT NSIRLMQKLNWDALVSKCAKNIWLHKMRDLNLQQKVVLLNTYITSKIWYVS FT SVLAMNKFHAAKLTSLMGSFLWAGIPIRVSMNQLALPYDRGGLKLQLPSIK FT SKALLINRHLKEATALPYYTSILMNAQNPPYLNSLPSNSPCLKLVYQEISY FT LPIQMRDNPTSNALHTYFLNQMDCPKVVSKYQNCNWVAVWRNIKSSQLSSL FT ERSFYYILVNEKIAHAELLFRMQRVDSAICPNCLSGAESLAHKISTCPRIA FT ESWRILQQELLKVYPDKTFNVEELIRPSMKINRVKKIKILKLFVNYVIYIN FT EFNSNIDNTALRSLLGNII" XX SQ Sequence 4594 BP; 1460 A; 939 C; 995 G; 1200 T; 0 other; cagttggcct tcgacagccg aagtgttaac acgttgtcga ttcggagcgc aagcatacgg 60 tcgagttttt tttttctgtg aaacgtttcg tcgcggtaaa tccgtgttcg agtgtagtgc 120 cggtctcagc accggatgta gtgtttgtag gagtcatctc agaacgttga gtctcgagat 180 gaatactcgc aggatccgcg agaacacatt ccgtgtggaa ttctcgaatc ttccgaagaa 240 accgacctct gaagaggtac acgcgttcgt cgctcaacaa ctcggtgtaa gccgaacaca 300 actagtgaga ctacaactta gccacacaga cgagtgtgcg tttgtcaagg ttattgacct 360 caacacagca caacgaatag tgcagaccca cgacaaccaa cacgactacg agatgaaagg 420 ggtaaagtac aagattcgga tccgtatggc ggatggcggt gtggacgtta ggttgtatga 480 cctttctgag cacgtttcag acgagcaggt cagtcgccac atgtcggcgt atggcgaagt 540 tctctcagtt agtgagcttc tctggagcga caaacaccac tatgctggta tggctactgg 600 catacgtcag gtcaagatgg ttctacgtga acccatcaag tcctacatca caatagatgg 660 agagaccacg tatgtacagt acgccaaaca aagacccaca tgtagacatt gtggcgagtt 720 catgcacaca ggcatatcct gcgtgcaaaa taaaaagttg ttggcgcaaa aggtgagtgt 780 caacgatcga ctgaagagaa gttcatacgc tggagcggtg agaggagcgt ctgaagggtt 840 cagcgcaaac gagacggagg agcaacgaag tgtgccactg gataacaacg cacctttccc 900 acaagtgcag gcacaggtgg aagtgccttc gacttcaaac gcagcggttt tcccgccact 960 tcccaataag gaatctccct tggctgtccc acagatggat gtggcatctg gtaccgatct 1020 ggtgcccact cagcaggcgt tggaggatga gccttcgact gagatgaatc aggaaactcc 1080 gcaacgaacg gtctcaatcg gtgatatctc atcgatgtcc accgatgaaa gttgcaacgg 1140 aaacacgctg gaaagctcaa tcgtatgtaa cccgaacggt gagttggtag tggttactac 1200 tgagaaaccg cttacagagg gtacaggtaa acacaaaaat aagaactcaa acgacactaa 1260 cccatttatc ttagccaaac gcggccgagg gcggtccaaa aagcattaag gtggataatc 1320 acagctacaa aattggaacc gtaaacataa ataacataac taatcagact aaaacttctg 1380 cactacacaa cttcataaaa acacaagatt tggacatcat cttcattcag gaagttgaga 1440 atgataaatt agatttacca gggtacaaca ttgtttgtaa tgttgacaat gctcggaggg 1500 gtactgccat agcgcttaaa catcagatca aattttcgaa tgtggaacga agtttgaact 1560 cgcgtataat agctctaaga gtacatgata cgattacgtt ggtcaatatc tatgctcatt 1620 ctggctctca ataccgacgt gaaagagaag agctttttaa cttcactctc tcgtattatc 1680 tcagacacca cacacaacac atcatcttag gtggagactt taattcagtg atccattcta 1740 gcgacgccac aggtgtcagc aatttcagtt tggcattgaa aaatatggtt caacaattac 1800 aactacatga tgttgtaaat gttctcaatc gggaaaacaa gcaatttacc tacataacac 1860 ataactctgc gtcgcgaatt gatagaatct atgttagttc agggctactc aataatctta 1920 gacgtgcaga tacgcatgtt tgctcattta ctaatcacaa agcgctaaca gtgagactcg 1980 ctttaccaaa tctagggaga gaaaagggaa gagggtattg gtctcttcgg ccttgcgtgc 2040 ttactacaga gaatatcgag gaactgcaac tcaaatggga ttattggacc agacagagac 2100 gactatacaa ttcatggata gaatggtggg tacactttgc aaaaccaaag cttaaaactt 2160 ttttcaggtg gaagacgaac cagatgtatg cagaatacaa tcgtaagtat caaatactct 2220 ataccgagct ccgagaagct tatgaccgat atttgggaga cgcgggagtg ttggtgacta 2280 ttaatcgggt taaggcacgc atgttaaagc tgcaacgcga gttcgacaca cttttcattc 2340 gggtcaatga aacacgagtg tcgggggaaa agctttcgac atttcaaatg ggtgaatgta 2400 gacggaagag aacgactata gatagtttag acattgatga caatcgaacc attgagcgtt 2460 ctgaggaaat cgaagaattc atttttgact attttaaaaa tatgtatgcg ggggaagaac 2520 atgaagtgaa cgacgagtgg atgtgtgacc aaacagtgtc accaaacaat gaagccaatg 2580 ctagttgcat gaacgacata actactggag agatttacag tgctatcaag acaagtgcat 2640 ctcgaaaggc tgcgggtcct gatggcatca cgaaagattt tttcctcaaa gctttcgatg 2700 ttattcacag acatttgaat cttgcattga atgaagttct gcaaggaaac attccagaaa 2760 aattcttaga tggaatcata gttctcatta ggaaaaaaaa ttgcggcaac acggtgaaag 2820 ctttcaggcc aatatctttg ttgaattttg actataaagt cctctcgagg attctcaaag 2880 ctcgtctgga actcattctc aaagagagtg aagtgcttag cccctctcaa aagtgctcaa 2940 attttggaaa aaatattcat caagcaacgc tcgctatcaa ggatcgaata gtgcaattgc 3000 gtgaaaccaa gagaaaagga aaattgattt ccttcgatct ggaccacgcc tttgataggg 3060 tcgaccatca ttttctgttt cagtcaatga gggcgacagg tttaaatcca gcacttattg 3120 aacttttgtc aaatgtctat gctcgatctc gatcaaaaat ccttgtgaac ggtcatctgt 3180 caccagagat tcagataaaa tgctcggtcc gccaaggtga tcctatatcg atgcaccttt 3240 tcgtgctata tttacaaccg ctagtacata agcttgacca agtttgcgat gatcgcgatg 3300 atttggtagt agtctacgca gacgacatta ccgttgttac aacgtgcatt gacaaaattg 3360 agcgaatcaa acagcttttc atttctttcg gcagatgttc cggagctctt ctcaatttga 3420 acaaaacata ttgcattgac attggtcatt ttagaactag tgagcctgtt aacatagatt 3480 ggttgcagac gtccgaaaaa atgaaaattc taggtatcac gttttgtaac tcaattagat 3540 tgatgcaaaa attgaactgg gatgccctag tttctaaatg tgccaaaaac atatggcttc 3600 acaagatgag ggacttgaat ctgcaacaga aagtcgtgct gttgaacacc tacatcactt 3660 ctaaaatctg gtatgtttcc tcggtactag caatgaacaa attccatgca gcaaaactaa 3720 catcacttat gggatccttt ttgtgggctg ggataccaat ccgtgtatca atgaatcaat 3780 tagcactccc atatgataga gggggactta aattacaact tccgtctata aaatctaaag 3840 cacttctgat aaatcgccat ttaaaggaag ccacagctct tccatactat acctctattt 3900 taatgaatgc tcagaatcca ccgtatttga actcacttcc aagtaactct ccgtgtttga 3960 agcttgtgta ccaagaaatc tcttatcttc cgattcagat gagagataat ccgacatcga 4020 atgcattgca cacctacttc ttaaaccaga tggactgccc taaggtggta agtaagtatc 4080 aaaactgcaa ctgggtagcg gtttggcgga acatcaagtc aagtcagcta tcgtcactcg 4140 aaagaagctt ctattacatc ctcgtgaatg aaaaaattgc tcatgcagaa ctattgttca 4200 gaatgcaaag ggtagactct gcaatatgtc caaattgttt gtctggagca gaaagcttag 4260 ctcataaaat atcaacttgt cctagaatag ccgaatcgtg gaggattctc cagcaagagc 4320 ttttaaaagt ctaccctgat aagacattca acgtcgagga gctgatccga ccatcaatga 4380 aaattaatag agttaaaaag atcaagatac tgaagctttt tgtaaactat gtcatatata 4440 ttaacgagtt taatagcaat attgacaaca ctgcactgcg gtccttgtta ggaaatataa 4500 tatagaagtt gtagtatata attagatata attacttaaa tgtataaatg tattaactga 4560 accaaataaa gattttataa aaaaaaaaaa aaaa 4594 // ID DNA8-6_CQ repbase; DNA; INV; 1107 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A DNA transposon family from Culex quinquefasciatus - consensus. XX KW DNA transposon; Transposable Element; nonautonomous; DNA8-6_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-1107 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the southern house mosquito."; RL Repbase Reports 11(1), 83-83 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >88% CC identity. 8bp TSDs. XX SQ Sequence 1107 BP; 397 A; 158 C; 159 G; 393 T; 0 other; tagggcgtcc aattttcccg ggttttgaat ttcccgggaa acgggaaaaa tatttttcaa 60 atcccgggaa ttcccgggat cccgggaaat tttttaaaca gtcataaaat ctatgttttc 120 attaaatttg ttatgttttt caagcttaca atcatagaat tagctcaata ctgttaattg 180 gtgataatct ttacttcaat ctgaacaaga acagcagttt ttagtaagtt taaaaatatt 240 aaaaattgtt gttttttgtt ctttgttatg ctttggattt ttaatgaatc ataatttcta 300 atccaatgtg attcaaatta tttcatatta ataatcataa aattcttatt aatatatttc 360 ttttatattt ttatatgaaa tgactacaaa agctaaaaaa tttcattgaa actgtaaaaa 420 aaacaagaaa cgaaaacaaa tagaatgaaa ccatatgacc atccaatgta aaattttaga 480 aaagtttaga atttcatgtt ttatataaaa catattttac aaactatctt taagtcacta 540 ccgccaactg ggggaaatcg ggactgcagt ctgaataggt acaggatatt ttagagcaat 600 aaatttgaaa attggtgtac taattgtttt gttaattaaa aaaatatcag aacattatgc 660 tgtcttaatt cgttactatt gtcctaattt gctccagatg ccgatgtctg atgaaaaata 720 attaaatatg ttttttttca agtttaaaat catcattaaa cgtaaatatt taaaaaataa 780 ataaaattta atgaaaaata tttaaagcgc taggcatttt gcggatccgt aaactaaaat 840 tttaataatt ttcccttata agccaaataa atgttgatat atgccgaata caaatttgct 900 aatgctttaa aattgttatc gattactaag tattcaactg ttcatctatt tccacaacca 960 aatctgagtt ttcagaccaa aaaatgattt tttttcaatt tcgggaattc ccgggacaaa 1020 atatcaaaaa tcccgggatt cgggaattcc cggtttaggg aaaatcccgg gatttttgtc 1080 ccgggaattc ccgggatgga cgcacta 1107 // ID hATx-4_SM repbase; DNA; INV; 2857 BP. XX AC . XX DT 22-OCT-2007 (Rel. 12.1, Created) DT 22-OCT-2007 (Rel. 12.1, Last updated, Version 1) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hATx-4_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-2857 RA Jurka J.; RT "haT-type repetitive family from freshwater planarian Schmidtea RT mediterranea."; RL Repbase Reports 7(10), 1041-1041 (2007). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 995..2344 FT /product="hATx-4_SM_1p" FT /translation="MTGYCNAERILDMINLKLKTFGISFTNHIVAVTTDGA FT SVMXKLGSTCFAENMLCLNHAIHLAVVDVFYKVESNSKCNKGKTVSSDSDD FT FEDDDDGYIAEDDDIDEDDNDDRVSIEDDTDVNDDDVNSIKIVLSRKDVGD FT AIKHTRNIIKYFRKSPTKNEILQKYVKEEFSREYCLLLDVKTRWNSLVLMI FT ERFLKLKNCVKKSLIDLNLTNIWKEEHISILETTFNILKPIKVAVEALSRK FT DTNILTADAILNFLLKKLKNMDTEIGSLMFNTMLTRVEERRNMLSFSLAAY FT LHTSEIPKNNDFLSYSSKQAMIAYAGKLMDRLYPSENCIENVDDVAIVTDI FT KKIETIEDELQIAIKDATTKATNDKKNHDKTKKLKTEISVLESTGQRTENL FT NLLYNSLLTIKPTSTDCERVFSVAGNFVSKTRSRLSDNSIDXITFLKSHFL FT HEKEK" XX SQ Sequence 2857 BP; 1039 A; 375 C; 478 G; 959 T; 6 other; tagggattct atcccgggac gacccgagaa ataattcccg ggaatttgat aaatttctca 60 agtctagtgt cccgggaaaa ttttctcgag acccgggaat ttatttttca agatttatcc 120 atttcagcta tgtaaagtct caccgcaccg tcttttgatg cacgcagcac gcacgtgttc 180 attcttctag ttgcaacctt gcaacttaca atgccaaccc ggctcaggcg tatttcatgc 240 ttataaacat tttcaagtta taacttgtta gacgttatgc tctagatcaa aataagtgtt 300 attcagtttt aattgctttt ctttattaat ttttaacagt taaagataag agtatgattc 360 ctagttaatt ttattattat aaatttactt ttttatttag taatccttat gtcgaaaagt 420 ggagtgtgga aacattttat gaaatcaaaa aatgatgaag aagcacagtg tatgcattgt 480 ccaaaacaat tggcatgcaa aggctctaac acttcaggat tgttgcgaca tttgaaaaga 540 attcattcca ttaacgaaga atgtgagaac acttccacta atataaaaag aagtaaactg 600 tcagatccaa gcactagcaa tacttcatca ataataaaat ttttaaaaaa ccaaatttaa 660 atgaaacggt tgtgaaattg gcatgtgaag acggcattcc attcaaaaca ataactaaaa 720 gttcatttat tcgcaattcc atgaaaaatt tgaattttaa tttgccaaaa aatcaaacaa 780 gtgttwtgaa tttggtgtat aagtattagt atattatgaa gtgaaaaaat ggtttttaaa 840 ttaaaagcgc aaataacaca agcgattacc gcaggcgaaa gatttagctt gtctctggat 900 gaatggtcca gtgtcggcaa taaacgttat ttaggaattg tgttacatag taaacaggaa 960 caaaatagat tcagttagtt taggattagt tcgtatgact ggttattgca atgcagagag 1020 aatattagat atgataaatt tgaaacttaa aacttttggt atttcattta caaatcatat 1080 agtagcagtt actaccgatg gcgcaagcgt gatgraaaaa ttgggaagca cttgctttgc 1140 tgaaaatatg ttgtgtctaa atcacgccat tcatctcgcc gtcgtagatg tcttttataa 1200 agttgagagc aattcaaagt gcaataaagg taaaacagta tctagcgata gtgatgattt 1260 tgaagatgat gacgatggct acattgcgga agatgatgat attgatgaag atgataatga 1320 cgatagggta tctattgaag atgacactga tgtcaacgac gatgacgtaa attcgattaa 1380 aatcgtatta tctaggaaag atgttggtga cgcaataaaa cacactcgga atataatcaa 1440 atattttcga aaaagcccaa ctaaaaatga aatcttgcag aagtatgtga aggaagaatt 1500 ttcaagagag tattgtttgt tattagatgt caaaactcgg tggaattcac tagtattgat 1560 gattgaaagg tttttgaaat taaaaaattg tgttaaaaag tcattgattg acttaaattt 1620 aacaaatatt tggaaagagg aacatatttc catattggag actacattta acattttaaa 1680 accaattaaa gttgcagttg aggcactaag tcgcaaggat acaaatattt taacagcaga 1740 tgctatttta aactttttgt taaaaaagtt aaaaaacatg gacacagaaa ttggtagttt 1800 aatgtttaat acaatgctta caagagttga agaaagaaga aatatgttaa gcttttcttt 1860 agctgcatat ttacacactt ctgaaatccc caaaaataat gattttctgt cctattcatc 1920 aaaacaagct atgatagctt atgcgggaaa attgatggat agattatatc catctgaaaa 1980 ctgtatagaa aatgttgacg atgttgcaat tgtaacagac attaaaaaaa ttgaaacaat 2040 tgaagatgag ctacaaattg ccataaaaga tgctactact aaagcaacaa atgataagaa 2100 aaatcatgat aaaacaaaaa aattgaaaac cgaaattagt gttttggagt caactgggca 2160 aaggacagaa aatttaaatt tattgtataa ctcgttatta acaattaagc cgacaagtac 2220 cgattgcgaa agagttttct cagtggcggg aaattttgtt agcaaaacca gaagtcgctt 2280 atccgataat tcgatagack cgataacgtt cttgaagagc cactttcttc atgagaagga 2340 gaaataaatt tttgggttaa cttgcattac gcccccagca tttaggaata tttttcataa 2400 aaaacatact aaataaggaa ttatttttaa taaaaagatt tttttaaaaa aatttctaat 2460 atagtaaata aaatacttan tataaaattw wtttattttt gataattaaa attttatact 2520 attaatagat ttattttttt taaaaaatct tttattaaaa ttaattcgtt atttagtatt 2580 ttttttatga actatattcc taaaatgctg ggggtggtaa tgtaagttaa ccaatttttg 2640 tttaaattta atagttaatt tgttttattt tcattttttc gttcgatgta tttaatattt 2700 atttaacaaa attttatgtt ttaaatattt tttgtgtttt aaatatatgt atttatatta 2760 atgttttact attttttttt gtcgagaaat ccctggattt ttaccattat agtctcgaat 2820 cccgggattg aaaaacatcg agaaattgga atcccta 2857 // ID Copia-1_DYa-I repbase; DNA; INV; 4336 BP. XX AC chr2L_random; XX DT 06-APR-2011 (Rel. 16.04, Created) DT 06-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-1_DYa_; KW Copia-1_DYa-LTR; Copia-1_DYa-I. XX OS Drosophila yakuba OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; melanogaster subgroup. XX RN [1] RP 1-4336 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; chr2L_random; Positions 929389 925054. XX CC Positions [1933-2274] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 101..1669 FT /product="Copia-1_DYa-I_2p" FT /translation="MSGQGLCAIDKLDGENYAVWSVQMKSVLVHSGLWSLV FT CGRRVKQESDTAEQGAQFDELDEKALATILLCVKASEINLIKNCASSKEAW FT DKLAAVYKPRGPARKITLFRKLLRLSLSESACVQNYINEFVDIVEKLKEID FT MDISEDILTILMLSGLGKKFENFVVAIETREQLPTLSNLKVKIIEGGERQR FT VSSDETELCVQQAFVSYPSGKGDKKNRSAQKNSDAKKNKKCFRCGREGHFI FT AQCKVKDKNDEIETSAKEKSFTMLASVNKNDVLTKTMWCLDSGATAHMCCE FT RSWFADIKEHKEKILLAGENYIFSDGVGTVAVKTKNGSVNLKNVLYVKNLQ FT SNFISVAKAVDKGYKVIFHKHSAYIKNGQNEEVLQARKAGNLFVVDTPNNR FT CMYLCANDELWKWHYRYGHLNMASLKRMASNNMVNGLKSIKNVVDIDRITC FT ETCAKGKICVKPFPKMAENRADKVLGLVHSDICGPMNKVSAGGARYFVTFI FT DDYSRYMFVYFLKTRDELLPTLKDFVA" FT CDS 2131..4311 FT /product="Copia-1_DYa-I_1p" FT /translation="MARSMLIHSGVNEFLWGEAVRTAAYLRNRSETRSLKS FT QTPFELWSGKKPSISHLKIFGCKAIALNKKHAGKFKPKGIECMMIGYSTTA FT KAYRLMRIGSNQVIESRDVVFLEETIGYPKNGNSENETALIDEIGVDCSQI FT LAVDNNKEGAEVLEEEQIDSEVSEADSSDVYASAKSDDSEEREIRGPGRPK FT KIKTGKPGRPRKQYNLLNMMKVQDVKVPITFAEATQSEISQEWVASIQKEI FT GALEANQTWELQDLPEGKKAIGSKWVFGIKGDKDGNVMRFKSRLVAKGCAQ FT QYGIDFFETFSPVVRYSSVRLIISLAVEHNLFLHQMDVSTAYLNSELSEEV FT YMKQPDGFDDKHHGKVLRLKKSLYGLKQSGREWNAKLNKVLQKLHFVPCAS FT EPCVYTRNEKDNINIIAVYVDDLLIASSCKNDLIGIKESIGNEFKVVDSGQ FT LNHFLGIEVEREGETGSISIGHKTYIESMLHTWGMLDCKSAATPLEANFQV FT KCTDPNCKPVNEKDYQSLIGSLMYLAITTRPDIMHSVAKLAQRNVNPHKEH FT EVAAKRILRYLKGTSNLKLHYQSTGKPIHGYVDADWANDTSDRKSYSGYVF FT INAAGPISWESKKQSLVALSSTEAEYVALSIAAREAVFIKKFLKEMGFLID FT HSILIFCDNQSALCLAKNPVLHNRSKHIDIKYHYVRELYSKKEIDVKYICT FT NDMLSDILTKNLQKNKHLKCITEMNCF" XX SQ Sequence 4336 BP; 1455 A; 722 C; 1050 G; 1109 T; 0 other; ataggttatg ggcccaggag tagtaaagac tttaataatt gtgtgtgatc aagtggaaaa 60 taattatttg ctgtaattag cttgtacatt ttttataaac atgagcggac aagggctctg 120 tgcgatcgac aaattggacg gcgaaaatta cgcagtatgg tcggtgcaaa tgaagagtgt 180 attggtgcac tcaggtttgt ggtcgttagt gtgcggaaga cgcgtgaagc aagaaagtga 240 tacagctgag caaggagccc aattcgacga actagacgaa aaagcattag caacaatatt 300 gctgtgcgtg aaggcttcag aaattaattt gattaaaaac tgtgctagtt ctaaagaagc 360 ttgggacaag cttgcagctg tgtacaagcc tcgtggaccc gccagaaaaa ttaccttgtt 420 caggaaatta ctccggttga gtttgtcgga aagtgcgtgc gtgcaaaatt acataaacga 480 atttgtggat atcgtcgaaa aactgaaaga aattgatatg gacatttctg aagatattct 540 tactattttg atgctgtcgg gactcggtaa gaaattcgaa aatttcgtgg tagcgatcga 600 aacgcgtgaa cagttaccga ctctctctaa tttaaaagtg aaaattatag aagggggaga 660 gagacaaaga gtgagttcgg acgaaacgga actttgtgtg cagcaagcat ttgtgtcgta 720 tcccagtgga aagggagaca agaaaaatag atcggcgcaa aagaacagtg atgcaaaaaa 780 gaataaaaaa tgctttcgat gtggtcgcga gggtcacttt attgcgcagt gcaaagtgaa 840 agacaaaaat gacgaaatag aaacatctgc aaaagagaag tcgtttacta tgcttgcaag 900 tgtaaataaa aatgacgttc taacgaaaac gatgtggtgc ttggacagtg gagcaactgc 960 tcacatgtgt tgtgaaaggt cgtggttcgc ggacattaaa gagcataaag agaaaatatt 1020 acttgcagga gaaaactata tattttcgga tggcgtcggt acggtggccg ttaaaacaaa 1080 gaatggttcc gttaatttga aaaatgtgct ttacgtaaaa aacttgcaaa gtaatttcat 1140 atcggtagcg aaagccgttg acaaagggta taaagtgatt tttcacaaac atagtgcgta 1200 tataaagaac ggtcaaaatg aagaagtgtt acaggcaaga aaagctggaa atctttttgt 1260 tgttgatact ccgaacaatc ggtgcatgta cttgtgtgca aatgatgagt tgtggaagtg 1320 gcattatcgt tatggtcatt tgaacatggc gagcctaaaa aggatggcaa gtaacaatat 1380 ggtcaatggt ttaaagtcga tcaaaaatgt tgtggacata gaccgaatta catgtgaaac 1440 ctgtgctaaa ggtaaaatct gtgtgaaacc attcccaaaa atggcagaaa atcgtgcgga 1500 caaagtgcta gggctagtgc attcggacat ttgcgggcca atgaacaaag tatctgccgg 1560 tggagcaaga tatttcgtaa catttattga tgactactcg cgctatatgt tcgtatattt 1620 tttgaaaacc cgtgatgaat tgcttcccac gttaaaagac tttgtcgcgt gaccggtgaa 1680 aaacttaaag caattcgcag cgataacggt cgcgaatata taagccacga gtttcagaag 1740 ttcctatctg aaaacggtga aaccattccc aaaaatggca gaaaatcgtg cggacaaagt 1800 gctagagcta gtgcattcgg acatttgcgg gccaatgaac aaagtatctg ccggtggagc 1860 aagatatttc gtaacattta ttgatgacta ctcgcgctat atgttcgtat attttttgaa 1920 aacccgtgat gaattgcttc ccacgttaaa agactttgtc gcgtgaccgg tgaaaaactt 1980 aaagcaattc gcagcgataa cggtcgcgaa tatataagcc acgagtttca gaagttccta 2040 tctgaaaacg gtattaagcg gcaattgaca gtaccgtaca ctccgcagca aaacggggtt 2100 gctgagcgcg ctaaccgaac acttgtggaa atggcaagga gcatgcttat acattcagga 2160 gtgaatgagt tcctgtgggg agaagcagtt aggactgcag catatctacg caacagatcg 2220 gagacaaggt cgttaaagag tcaaacacca ttcgaactgt ggtcaggtaa aaagccttcg 2280 atatcgcatt taaaaatctt tggatgcaaa gcgattgcat taaataagaa acatgcaggt 2340 aagttcaagc caaaaggcat tgaatgcatg atgatagggt attcaacaac agcgaaggcc 2400 tacagattaa tgcgcattgg atcgaaccag gtaatcgaaa gtagagacgt cgtttttcta 2460 gaggaaacta tagggtatcc gaagaatgga aacagcgaga acgagacggc acttattgac 2520 gaaattggag ttgattgttc gcagattttg gcagtggaca acaacaaaga aggtgcagag 2580 gtgctcgagg aagagcaaat tgacagcgag gttagtgaag ctgatagttc tgatgtttac 2640 gcgagtgcga agagcgatga tagcgaagaa cgggaaattc gtggtcctgg tcgtcctaag 2700 aagataaaaa ctggaaaacc cgggcgccca cgaaaacaat acaacttgtt gaacatgatg 2760 aaggttcaag atgtcaaggt acctattacg tttgctgaag cgacacagtc tgaaataagt 2820 caagagtggg tagcatcaat tcagaaggag attggcgccc tcgaagctaa tcagacatgg 2880 gagctacagg acttaccaga aggtaaaaaa gccatcggta gtaagtgggt ttttggcatc 2940 aaaggagata aggatggcaa tgtgatgcga tttaaatcga ggttggtggc taaaggatgt 3000 gcacaacaat atggtatcga cttttttgaa acgttttcac cagttgtcag atattcgtcg 3060 gtaagactta ttatatctct ggctgttgag cataacttat ttctacatca gatggatgta 3120 tcgacagctt acttaaatag cgaattgagc gaagaagtct atatgaagca acccgatgga 3180 tttgatgata agcatcatgg aaaagttttg agactaaaga agtcactgta cgggctaaaa 3240 cagagcggta gagagtggaa cgccaagctc aataaagtct tgcaaaaact gcactttgtt 3300 ccctgcgcca gcgagccgtg tgtgtacaca cgcaatgaga aggacaatat taatataatt 3360 gcagtgtatg tcgacgatct tcttattgca agctcatgta aaaatgattt gattggtata 3420 aaagagtcaa ttggcaatga gtttaaagta gtagatagcg gtcagcttaa tcatttcttg 3480 ggcatcgaag ttgagcgaga aggtgaaact ggatccatat ccataggtca caagacatat 3540 atcgagagta tgctacacac ctggggtatg ttagactgca aatcggcagc aacgcctttg 3600 gaggcaaatt tccaagttaa gtgtactgat cctaactgca agcctgttaa cgaaaaggac 3660 taccaatcac ttataggatc actaatgtat ttggcaatta caaccaggcc agatataatg 3720 cattcagttg caaaattggc tcaaagaaat gtgaacccac ataaggagca tgaagtcgct 3780 gccaagcgta tactaaggta tttgaagggc acctcaaact taaagctaca ttatcaatca 3840 acaggtaaac cgattcacgg ctacgtcgac gcagattggg caaatgacac atcggatcgt 3900 aaatcctaca gtggatatgt cttcataaac gcagctggtc caatatcctg ggagtccaag 3960 aagcaaagtc tggtggcatt atccagcacg gaggccgaat atgttgcatt gtccattgca 4020 gcaagagaag ccgtatttat aaaaaagttt ctgaaggaaa tgggtttttt gatagatcac 4080 tcaattttaa ttttttgcga taatcagagt gctttatgtt tagctaagaa tccagtatta 4140 cataatagaa gtaagcatat cgacataaag taccactatg ttagggaact ttattccaaa 4200 aaggaaattg atgtaaaata tatatgtaca aatgacatgt tatcagatat attaacaaaa 4260 aacctacaga aaaacaagca tttaaaatgt ataactgaaa tgaattgttt ctgaatatgt 4320 tgttgagaag gagtat 4336 // ID piggyBac-1_SM repbase; DNA; INV; 2434 BP. XX AC . XX DT 29-MAY-2008 (Rel. 13.05, Created) DT 29-MAY-2008 (Rel. 13.05, Last updated, Version 1) XX DE DNA transposon from Schmidtea mediterranea. XX KW piggyBac; DNA transposon; Transposable Element; piggyBac-1_SM. XX OS Schmidtea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae. XX RN [1] RP 1-2434 RA Kapitonov V.V. and Jurka J.; RT "Families of autonomous piggyBac element from the freshwater RT planarian genome and two clear examples of horizontal transfer of RT piggyBac transposons between a flatworm and insect."; RL Repbase Reports 8(5), 520-520 (2008)Repbase Reports. XX DR [1] (Consensus) XX CC piggyBac-1_SM is a young family of piggyBac transposons, CC characterized by 12-bp TIRs and TTAA target-site duplications. CC The consensus sequence was reconstructed based on multiple CC alignment of four copies (they are ~97% identical to the CC consensus). The genome contains several hundred copies of CC nonautonomous deletion derivates of piggyBac-1_SM. CC In addition, we report here other 14 families of the freshwater CC planarian (flatworm) piggyBac. Evolution of two of them, CC piggyBac-8_SM and piggyBac-15_SM, involved relatively recent CC events of horizontal transfer between a flatworm and insect (see CC comments on these transposons). XX FH Key Location/Qualifiers FT CDS 476..2245 FT /product="piggyBac-1_SMp" FT /note="piggyBac transposase." FT /translation="MTSKKFLNIEDIETELENFDDNNSFDGFEDDEILEAT FT VSVPDQSNLSCPQSTAIDNLMEVCDLSDNESLSSSLPEENMEVDSDSEESV FT WNKGELQHERDLPIDFDTVNVTPTYNFSVTDGPVVYFQRFFDAQVITDLVN FT QTDLYAQQVKIKNWVPVTLEELNAFIGILIGMALHDLPSIDSFWSSDPLFL FT VKPIADCMPIKRFKKILEALHCNDNSQAPAVSCANRDKLYKVRPLIDILNT FT KFEIECHSSSSQSIDEAMIKFKGRSGIKQYMPMKPVKRGYKVWARADSNTG FT YLYQFDVYTGKLEDGVEEGLGSKVIKKLSTSLKNKNCHIVFDNFFTSYTLM FT EELYEHKIYATGTVRSSRKELPALAKKTQKMNKHESKYRIRNHTGYVKWKD FT TKIVSVLSTAFRPNIIGEAKRTQKDGSTKIVPCPQSIIEYTKRMGGVDRFD FT QKRTYYGIGRRSKRWWLRIFYFMIDAAIANANILYNVNHHNATLSGLDFRI FT RLHRSLISNYTSRKRPSLSNTPFIKKQKHNSTHKDLGVPDEIRLSKVGVHM FT PEKLTTFRRCRFCSSKTNNKRSNIGCTTCNVALCINCFKKFHK" XX SQ Sequence 2434 BP; 818 A; 418 C; 441 G; 757 T; 0 other; ccctttctaa ggtggtggca atatactgta caccatttta aactaataaa aatatctaag 60 tgtaacgctg tcggtaatca tttttacttt tggtacaact actgacatat attgagccac 120 aagagaacaa atttatagta tttgaaacca tagatgtgtt ttcccgctat atttattgac 180 agctgacaga caagtgtgtg tgacctacta cataaatctt catttatata attgatcttc 240 gattcaaaag gtatattatt tattattttg atgcgataca taatgcagta ttcctatgaa 300 attattattt tccttctact atatctatat tttgaaaatt ttgtagcaaa actttgggtg 360 gcaagatagt gtacattgcc cgtttgatgg tagcattggt ggcaacatac ttgccactgc 420 ctttgaatgc cactatatat tcaacctttt atttcagcta ttcaattttt tagaaatgac 480 ttctaagaaa tttttgaaca ttgaagatat cgagacggag ttggaaaatt ttgatgacaa 540 taattcattt gatggctttg aagatgatga aatattggaa gctactgtca gtgttcctga 600 tcaatccaac ttatcctgcc ctcaatctac cgcaattgat aatttgatgg aagtatgtga 660 tttgagcgac aatgagtctc tttctagttc tttacccgaa gaaaatatgg aagtagactc 720 tgacagtgaa gaatccgtat ggaataaagg tgaactccaa catgaaagag atttaccaat 780 cgattttgat actgtaaatg taactccaac atacaatttt tctgtaacag atggaccagt 840 tgtatatttt caaagatttt ttgatgccca agtaattaca gacttagtga accaaactga 900 cctttatgca caacaagtaa aaattaagaa ctgggttcct gtcactttag aagaactaaa 960 cgcatttatt ggcatcttga ttggtatggc gcttcatgat ttaccatcca ttgattcgtt 1020 ttggtcatca gacccattgt ttttagtaaa gcctatagct gactgcatgc ccataaaacg 1080 attcaaaaaa atattggaag cattgcattg caatgataac agtcaagctc ctgcagtgag 1140 ctgtgctaat cgcgacaagc tatataaagt acggcctctt attgacatat tgaataccaa 1200 atttgagata gagtgccact catctagttc gcagtcaatt gatgaggcta tgattaaatt 1260 caaaggccgt tcaggcatca aacaatatat gcctatgaaa cctgtaaagc gtggatataa 1320 agtttgggct agggcggact cgaacacagg ctatttatac caatttgatg tttataccgg 1380 aaagcttgaa gatggggtcg aagagggcct aggaagtaag gtgataaaga aattatcaac 1440 ttcactaaag aataaaaact gtcatattgt gtttgataat ttttttacgt cttacacact 1500 catggaggaa ctatatgaac ataaaattta tgctactggt acagtgaggt catcaagaaa 1560 agaacttcca gctttggcaa aaaaaacaca gaaaatgaac aagcacgaat caaaatacag 1620 gataagaaat cacactggct atgtaaaatg gaaggataca aaaatagtgt cggtgttatc 1680 gactgccttt agaccgaaca tcattggaga agcgaaaagg actcaaaaag atggctccac 1740 taaaattgtt ccatgtcctc aatccataat tgagtataca aagagaatgg ggggtgtaga 1800 tagatttgat caaaagagaa cgtactatgg aattggtagg cgatctaagc gatggtggct 1860 tcgaattttt tactttatga tagatgcagc tattgcaaac gcaaatattt tatataacgt 1920 aaaccatcac aatgcaacat tatctggcct tgattttcgt ataagacttc atagatctct 1980 tatatccaac tacacatcga ggaaacgacc gtcactatca aacactccgt ttatcaaaaa 2040 acaaaaacat aattctacgc acaaagatct tggagttccc gatgagataa ggttgtcaaa 2100 agtcggagta cacatgccag aaaaactaac aacattcaga agatgcagat tttgtagctc 2160 gaagacaaac aacaaaaggt cgaatattgg atgcacaaca tgtaatgtag ctctgtgcat 2220 aaactgcttc aagaaatttc acaaatagct tatcctggaa tttcttgatg caatgtcact 2280 gaaaggtagt ggcaatcata ttgccacccc ttctccgtaa aagcctcttg tttttttctg 2340 tttttatttt tttatgatca tttgtaacta tataattgtt atcaataaac tttgagtttt 2400 acatcacttt tagtttcctt tgccttagaa aggg 2434 // ID Gypsy-25_DPu-LTR repbase; DNA; INV; 147 BP. XX AC . XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from Daphnia: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-25_DP_; KW Gypsy-25_DPu-I; Gypsy-25_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-147 RA Jurka J.; RT "LTR retrotransposons from Daphnia."; RL Direct Submission to RU (16-DEC-2010). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR [1] (Consensus) XX SQ Sequence 147 BP; 37 A; 46 C; 26 G; 36 T; 2 other; tgacagatcc agcagaggct tccctacacc ccggctttgc ccttactccc aagtcgactt 60 tgtccctgat gtcacgwcca gcccactctc ttgtaamtgt acgggtactg acatagtcgt 120 taatacatcc aagtaaagaa cctaaca 147 // ID R2_DYa repbase; DNA; INV; 3548 BP. XX AC . XX DT 08-NOV-2010 (Rel. 15.11, Created) DT 08-NOV-2010 (Rel. 15.11, Last updated, Version 2) XX DE 28S rDNA-specific non-LTR retrotransposon R2 in Drosophila DE yakuba. XX KW R2; Non-LTR Retrotransposon; Transposable Element; R2_DYa. XX OS Drosophila yakuba OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; melanogaster subgroup. XX RN [1] RP 1-3548 RA Stage D.E. and Eickbush T.H.; RT "Origin of nascent lineages and the mechanisms used to prime RT second-strand DNA synthesis in the R1 and R2 retrotransposons of RT Drosophila."; RL Genome Biology 10(5), R49-R49 (2009). XX DR [1] (Consensus) XX CC This family is specifically inserted into 28S ribosomal RNA CC genes. XX FH Key Location/Qualifiers FT CDS 136..3306 FT /product="R2_DYa_1p" FT /note="reverse transcriptase and restriction-like FT endonuclease." FT /translation="FERRIFPKGLVPLTKDNHIGTTNLQNEPRIFTNDLLT FT TRPSVDHVPEDQYEPNAAATLSRVPCTVCDRSFNSKRGLGVHMRSRHPDEL FT DEERRRVDIKARWSEEEKWMMARKEVELMANGFKHINKQLAVYFANRSVEA FT IKKLRQRGDYKEKIEQIRGQSALAPEVANLTIRRRPSRSEQDHQVPTSEAS FT PITPLEQSNREILRTLRGYSPVVCPSKWRAQELQTIIDRAEFEGKETTLQC FT LSLYLQGIFPVQGVRHTLTRPPRRPRNRRESRRQQYAVIQRNWDKHKGRCI FT KSLLNGTDESVMPSREFMEPYWREVMTQPSPSSCNGEVIRTDHSLETVWSA FT ITEQDLRASRVSLSSSPGPDGVTPKTAREVPSGIMLRIMNLILWCGNLPHS FT IRLARTIFIPKTVTAKRPQDFRPISVPSVLVRQLNAILATRLTSSIDWDPR FT QRGFSPTDGCADNATIVDLVLRHSHKYFKSCYIANLDVSKAFDSLSHAAIY FT GTLRAYGAPKGFVDYVQKTYEGGGISLNGEGWCSEEFVPARGVKQGDPLSP FT ILFNLVIDRLLRALPSEIGTKVGNAMINAAAFADDLVLFAETRMGLQTLLD FT KTVDFLSTVGLKLNADKCFTVGIKGQPKQKCTVLEAQSFCVGSREIPTLKR FT TDEWKYLGIHFTASGRVRCNPAEDIGPKLQRLSEAPLKPQQRLFALRTVLI FT PQLYHKLSLGSVTIGVLRKTDKLIRFYVRRWLNLPSDVPIAFVHAPPKCGG FT LGIPSLRWVAPMLRLRRLSNIKWPHLVQSEEASSFIEAEKQRARGRLIAEQ FT NELLSRPAIEKYWANRLYLSVDGGGLREAGHYGPQHGWVSQPTRLLTGKEY FT LDGIRLRINALPTKSRTTRGRHELERQCRAGCDAPETTNHIMQKCYRSHGR FT RVARHNCVVNRIKRGLEERGCVVIAEPSLQCESGLNKPDLVVLRQNHIDVI FT DVQVVTDGHSMDEAHQRKINRYDRPDIRTELRRRFEAAGDIEFHSATLNWR FT GIWSGQSVKRLIAKGLLSKYDSHIISVQVMRGSLGCFRQFMYLSGFSRDWT FT " XX SQ Sequence 3548 BP; 1005 A; 801 C; 916 G; 826 T; 0 other; gggaacatgg ggtaaaggtg agtagagggg gagtattttt tatactctgc aactcataag 60 tcttgccttt actcaagtcg actcaaaacc tcctcgtggt gtttcccggt aatgttaaac 120 ttgtttagca gctaatttga gcggcgaatc tttccgaaag ggttggttcc cctgacgaag 180 gataatcata ttggtaccac aaatttacaa aacgagcctc ggatatttac taatgatctg 240 ttgacgaccc gaccctccgt ggatcacgtc ccggaggacc aatatgaacc aaacgcagcg 300 gctactctat caagggttcc ctgcacagta tgtgaccggt cctttaacag taagagagga 360 ctcggtgttc acatgcgatc tcggcaccca gacgaacttg atgaagaacg tcgacgtgtc 420 gatataaaag caaggtggag tgaggaagag aagtggatga tggcgagaaa ggaggtcgag 480 ctcatggcaa atggttttaa acacataaac aagcaactag cggtgtattt tgcaaaccgt 540 agcgtcgaag ccattaaaaa gctgagacag aggggcgatt ataaggagaa aatagagcag 600 ataagagggc aatccgctct cgccccagaa gttgctaatc taaccataag gcgccgccct 660 agtagaagtg agcaagacca ccaagtacca acgtcagaag catctccaat cactccgctc 720 gaacagtcga acagggaaat tttgcggacg ctgcgtgggt atagccccgt agtatgccct 780 tccaaatgga gagcccaaga actacaaact atcattgata gggcggaatt tgagggaaag 840 gaaaccactc tccaatgctt atcgctctac ctccagggaa tttttccggt acagggtgta 900 cgacacacgc tgacgaggcc tcctcggaga cctcggaata ggagggaaag cagaaggcag 960 cagtacgctg tcatccagcg aaactgggat aagcataaag gaaggtgcat taagtccctg 1020 cttaatggaa ctgatgaatc ggtaatgcca agccgagaat ttatggagcc ctactggaga 1080 gaagtaatga ctcagcctag cccaagctct tgcaatggag aagtgattcg tacggatcac 1140 tcgcttgaga cggtatggtc tgcaataacg gaacaagacc ttagggcatc aagagtttca 1200 ttatcttcat ctccggggcc tgacggggta actccaaaaa ctgccaggga ggtgccgtca 1260 ggtattatgc tacgcataat gaacctaatt ctatggtgcg gtaacttacc tcattctatc 1320 cgactggcca gaaccatctt catcccgaag acggtgacgg caaagcgacc gcaagacttt 1380 cgtccaatat cggtgccttc cgtcctggta agacagctaa atgccatctt ggcaacccga 1440 ttgacctcat caatcgattg ggacccgcgc cagcggggct tctcaccaac cgacggttgc 1500 gccgataatg cgacgatagt tgacttagtc ctgaggcata gccataagta ctttaaatct 1560 tgctacatcg ccaacttaga tgttagcaag gcatttgact cattgtcaca tgcagcaata 1620 tatgggacat tacgagctta tggtgcgccg aagggttttg ttgactatgt acagaagacg 1680 tacgagggag gtggtatcag tctcaacggg gaaggttggt gttcagagga attcgtgcct 1740 gctagaggag tgaagcaggg cgaccctttg tcccccattc tatttaactt ggtcatcgac 1800 cggttactta gagccctacc tagcgagatt ggtaccaagg tcggaaatgc catgataaac 1860 gctgctgcat ttgcagatga tttggtacta tttgcggaaa ctcggatggg acttcaaact 1920 ttgttggaca agactgtgga ctttttatcc accgtcggcc ttaaacttaa tgctgataag 1980 tgctttactg tcggtattaa gggacagccg aaacagaagt gtactgtgct agaggcacag 2040 agcttctgcg taggctcgag agagattcca acactgaagc gtactgacga gtggaagtat 2100 ctcggtatac atttcactgc aagtgggagg gttcgatgca atccggcaga ggacattggt 2160 ccaaagctac aaagattgtc agaggccccc cttaagccac aacagaggtt gttcgccctt 2220 cggactgtcc tgatcccaca actctatcac aagttatccc ttgggagtgt gacgataggc 2280 gtcttacgaa agactgacaa gctaatacgt ttctatgtgc gaagatggct aaatcttccg 2340 tcggatgtgc cgatagcatt cgttcatgcc cccccaaaat gtgggggtct cggaattcca 2400 tcactaagat gggtagcacc aatgttacga cttagacgat tgagcaacat aaaatggccc 2460 cacctcgtac aatccgagga agccagctcc ttcatcgaag cggaaaaaca aagggcccga 2520 ggtagattga tagctgaaca aaatgaattg ttatcgcgtc cggcaataga aaagtattgg 2580 gcgaacaggt tgtacctctc cgttgatggt ggcggactcc gtgaagcggg ccactatggt 2640 ccccaacacg ggtgggttag tcagcccacg cgtttactaa caggaaagga atatttagac 2700 ggtattcggc tgcggataaa tgccctaccc acaaagtctc gcactacgag gggaaggcac 2760 gaattggaga ggcagtgtcg tgcaggatgt gatgctcccg aaacaacaaa ccacattatg 2820 caaaaatgct atcgttcgca tgggaggcga gtagctagac acaactgcgt agtaaatcga 2880 atcaagcggg gacttgagga gagaggctgc gtggtcattg ccgaaccaag tctgcagtgc 2940 gaatccggcc ttaataaacc ggacctggtg gtactccgac aaaatcacat tgatgtgatt 3000 gacgtgcagg ttgtgacaga cggacattct atggacgaag cgcatcagcg caaaatcaat 3060 agatacgaca gaccggacat acgaactgaa ttgcgtcgcc gattcgaagc cgcaggtgac 3120 atcgagtttc attctgccac cctaaactgg agggggattt ggagtggtca atccgtaaaa 3180 cgattgattg cgaagggtct cctcagcaaa tatgatagtc atatcattag cgtccaggtt 3240 atgagaggca gtcttggctg ttttagacag ttcatgtacc ttagcgggtt ttctcgtgat 3300 tggacgtagc ttaaaacgtt tggttcacat acatctgcct gctgccttgg cacaatatca 3360 aaaaggcata aacatcgcac atattggtta tttacggcta tgaggatggt tttagtacgt 3420 aggcgttgcg gaacttcggt tcggatagag caatgaatcg tgcatgctag gaactgacca 3480 aataacagca gccctagtat ctttcgaaga tttccatacc tttgcgatca aaaaaaaaaa 3540 aaaaaaaa 3548 // ID Gypsy-130_AA-I repbase; DNA; INV; 4230 BP. XX AC supercont1.8; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-130_AA_; KW Gypsy-130_AA-LTR; Gypsy-130_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4230 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.8; Positions 429315 433544. XX CC Positions [3384-3722] - Integrase core CC 'CATAG' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 91..3081 FT /product="Gypsy-130_AA-I_1p" FT /translation="MTSVEGSSKGENSKSAIPVIVNRGGMIGSIEPYVPGE FT CFGEYKERLELFFELNDVIESKRVAMLITLIGPETYKILKSLVIPAEPKTK FT SFEELVTALTGHFAPTVNVIAERYKFHQCVQASSESIAEYIVAIKARAQSC FT KFDNFLDDALRDRFVAGIQSSGLRAVLLKEKNLTFQSACDLALNYEMAESG FT NTMLQHSSQQYVKRLGFKQSRVKEEKTVPQVSKSDIRKTCHLCGKSHEPKD FT CPARNWECFSCGKKGHTSLVCRSKPKKNVKSVLKKSSRIHEVKQKVADSEE FT EGEFFAMKLIEEEECPDPASVNVVSEVCKPSSSASPVAKMEVLIEEKSVVM FT EVDTGASASVIPLERYLSLFSEIPVQKCSKTFVTLTGDKIQISGQIKVRVR FT LPGSSRAIELELVICECKKPSVSKMPLIGRPWLDSLVPEWRQLFDSGVSRI FT QLLSSKSVDLVSIQKRFPNVLSRSLSDSIVGYEAEIVLKPDAVPIFHKAYT FT VPFGIRERVEHELDRMCREGVIVPVKSSPWASPIVVVPKSDSVRICIDCKV FT TINKFIETEHYPLPRSDDLFASLANFNCFCVIDLSRAYLQLKVSEKSRQYL FT TINTHKGLYQYTRLPFGVSSAPSIFQSVIDRVLQGVPGTKVYLDDIIIGGS FT TLEECKARLSEVLSRLNVHHVKINLEKCKYFEESVDFLGHTISFNTIRPNS FT EKVVAIVNAPVPKSVVELQAYLGLLNYYGRFIPRLSEELRVLYNLLKKGTK FT FNWSPLCQSTFVRSKQLILDHDVLTLYDPDKEIVVCADGSPLGVGAILSHV FT VDGVERPVLFASSTLSPAERNYSQLHREALAIIFAVTRFHKYLYGKSFTLC FT SDSEALKEIFNPTKGTSIVSASRLQRWSVTLSMYKYKFIHRPAREMRHVDA FT LSRLPLPDPTGIDDESIHRLSENCPLDFEKIRDAQKRDQLVQKLVACVHDD FT WPKKSLIVCKFITKSVTTLALKMMLCSTLIGLLFQKASKEVF" XX SQ Sequence 4230 BP; 1289 A; 763 C; 991 G; 1187 T; 0 other; gttggcgacg aggtaatcgg tagcgcgtgt tcgtagtttg tgtagtgagt cggtcagtgc 60 gtctaaagtc aatcacgtgg tcgtgaagtc atgacaagtg tcgaaggaag cagtaaaggt 120 gaaaattcga aatctgcaat accggtgatc gtaaatcgtg gaggaatgat cggaagcatt 180 gagccttacg tccctggtga gtgcttcgga gagtacaaag agcgtctcga actatttttc 240 gagttaaacg atgtgatcga aagcaagcgc gtggctatgc taatcacatt gatcggtccg 300 gaaacgtaca agattttgaa gtcgttagtg atccccgcgg aacctaaaac gaaaagtttt 360 gaagaactgg tgaccgcgtt aacaggccat tttgcgccaa cagtgaatgt gattgcagaa 420 aggtacaagt tccaccagtg tgtgcaagca tcgtcagaat ctatcgcgga gtacattgtc 480 gcgataaaag cacgtgctca gtcgtgtaag tttgataatt tcctggatga cgctttgcgg 540 gataggtttg tggctggaat ccaaagttcg ggtttacgag cagttcttct gaaagagaaa 600 aacctaacgt ttcagtccgc gtgtgattta gccttgaact acgaaatggc tgagtccggc 660 aacaccatgt tgcagcattc gtcgcagcag tatgtgaaaa ggttaggttt caagcagtca 720 cgtgttaaag aagaaaaaac ggtaccgcaa gtgagcaaga gtgatatccg gaaaacgtgc 780 catttgtgtg gaaaatcgca cgaaccaaag gactgtccag cacgtaactg ggagtgtttt 840 tcgtgcggga agaaaggaca tacgtcgttg gtatgtagat ccaagccgaa gaaaaacgtg 900 aaaagtgtgc tgaaaaaatc atcgagaatc cacgaggtca agcaaaaagt ggccgatagt 960 gaagaagaag gagaattttt cgctatgaag ttgattgaag aagaagaatg tccagatcca 1020 gcgtccgtca acgtagtcag cgaggtctgc aagccgtcaa gcagtgcgag tcctgtagcc 1080 aagatggagg tgctcataga ggaaaagtcg gtcgtcatgg aggtcgacac tggagctagt 1140 gcgtcagtca taccactaga aaggtattta tcgctattca gtgaaatccc agtccaaaaa 1200 tgttcaaaaa cgtttgtaac gctaacagga gataagattc agataagtgg tcagattaag 1260 gttagggtta ggctaccagg aagttcacgt gctattgagt tagaacttgt catttgcgag 1320 tgtaaaaagc cgtctgtatc gaaaatgcca ttgattggtc gtccttggct cgatagtttg 1380 gtccccgagt ggaggcaatt gttcgattct ggtgtgtcac gaatccagtt gttgagcagt 1440 aagtctgtcg atttggttag catccaaaaa cgattcccaa atgtgctttc tcgtagtcta 1500 tctgatagta tcgttggtta tgaggcggaa atagtgctaa aaccagatgc cgtcccaata 1560 ttccacaaag cgtatacggt cccttttgga attagagaaa gagtggaaca tgagctagat 1620 cgaatgtgtc gtgaaggtgt gatagttccg gtaaagtcta gtccgtgggc ctctccaatt 1680 gtagtagtgc caaagtccga ttctgttcgt atttgtatcg attgtaaagt cacgatcaat 1740 aagtttatcg aaaccgaaca ttaccccctt ccgcgcagcg atgatctgtt cgctagtttg 1800 gcaaatttca attgtttctg tgtgattgat ctctcaagag cctatctaca gctcaaagtg 1860 tctgagaagt cccgtcagta tttaacgatc aacacacata agggtttgta tcagtatacg 1920 cgattacctt ttggtgtgtc tagtgcccct tcgatttttc agtccgttat tgatagggtg 1980 ttgcaaggtg tccctggtac aaaagtttac ctcgacgaca taataattgg tggtagcacc 2040 ctagaggagt gcaaagctcg actctcggaa gttctaagtc gtttgaatgt ccatcatgtg 2100 aaaattaatc tcgagaagtg caaatacttc gaggagtctg ttgatttctt aggtcacaca 2160 atcagcttta acaccattcg tccaaattca gaaaaggtcg tggctattgt gaacgcccca 2220 gttcctaagt ccgttgtaga gcttcaagca tatttaggat tgttgaatta ctatggcaga 2280 ttcatcccta gattgtcaga ggaattgcga gtgctgtaca atttgctcaa aaaaggcacc 2340 aagtttaact ggagtcctct ttgtcagtcc acttttgtga gaagtaaaca gttgatactg 2400 gatcatgatg tcctaactct gtatgatcca gataaggaaa tagtagtctg tgccgatgga 2460 agtccgcttg gtgttggtgc aatactatca cacgtcgtag atggtgttga acgtccagtc 2520 ctatttgcat ctagtaccct ttctccagct gagcgtaatt attctcaatt acatcgagaa 2580 gcacttgcga taatatttgc agtgactcgt tttcacaaat acttgtacgg aaaaagcttt 2640 actctttgct cagattctga agccctgaaa gagatattca acccaacaaa aggtacatct 2700 attgtttctg cgtcaaggtt acagcgttgg tcagtcacat tgtctatgta caagtacaag 2760 ttcatccatc gtccggcacg agagatgcgc catgttgatg cattatctag attgccgtta 2820 ccagatccta ctggtattga tgatgagtct atacatcgtc tgtctgaaaa ctgtccactt 2880 gattttgaaa agattagaga tgcgcaaaag cgtgatcagt tggtccagaa gttagttgcc 2940 tgcgtacatg atgattggcc caaaaagtcc ctgattgttt gcaagtttat tacaaaatcc 3000 gtaacaactt tagcattgaa gatgatgctt tgttctacat tgatcggatt gttgttccag 3060 aaagcctcaa aagaagtgtt ttaactgttg tacatgagcc tcattgcggt atagtgcgta 3120 tgaaaatgaa tgctaggtcg tatgtgtggt ggaaaaacat caatgatgac attgacaatt 3180 ttgttgcttc atgtcttact tgtcagcaaa cacaaaactc aaaatgcgcg agagataccg 3240 ttgagtggcc agagtcggtt agaccgtttc aaagaattca tgtcgatttc tttcattttg 3300 agaagtttac ttgtttggtc atagtcgatt catattcaaa gttcattgac gtgaaagaaa 3360 tgaacaaaac taatgctgca taaacaatat gcaagatgag ggaattcttc gcgtattttg 3420 gtttgcctga tcagattgtg tctgacaacg gtcccccttt tggttccaag gaattcgttg 3480 agtttggaga gcaaaatggt atcaaaatga ctcgaatacc cccgtaccat ccagagagta 3540 aaggattggc tgaacgggct gtacagactg tcaaaggagt tttgaaaaag gtatatctgg 3600 attcatcgaa ctgtcgaaag cttcccctgt ccgaaataat ttgcaagttc ttgatcactt 3660 acaataatac cccctcaaca gtgacaggga aagctccaaa tgacatgatc tactcatata 3720 agccaagaac attactaact aaaattaatc caaagcataa tgagaaaata attaatgaaa 3780 caggttgtaa atcaaaagta atttatgtac aaaaacaaga aaagaagatt aatgacgaaa 3840 cattcaaaat taatgaaaaa gtaatgtaca gaaatgtgtt aaaaaattat gttaaatgga 3900 ttcctgcaag agttgtgaag agagtcagtc tttgtacata tttgattaat gtaaatgaca 3960 atgttaaatt tgtgcacaga aatcagatta ggaagtctag tttagatgac aagttccatc 4020 ctgaatattt gattcataat aatgatttta ttagaaaaga aacgacaaat gaagatactg 4080 agctcaagca taaaagaaaa gaagcgaaat tgaattcgcc taaaaaggtc agaagatcaa 4140 gtcgaattag gaaacggcct ttgagatttg gtttcaaaag ttatatgaaa tattgatagt 4200 cgaatataag aatcttcaaa agtgggagga 4230 // ID Gypsy4-I_Dya repbase; DNA; INV; 7680 BP. XX AC chrU; XX DT 18-MAY-2009 (Rel. 14.05, Created) DT 18-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy4_Dya; KW Gypsy4-LTR_Dya; Gypsy4-I_Dya. XX OS Drosophila yakuba OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; melanogaster subgroup. XX RN [1] RP 1-7680 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1046-1046 (2009). XX DR Genome; chrU; Positions 13099601 13091922. XX CC Positions [2021-2443] - Reverse transcriptase CC Positions [4455-4931] - Integrase core CC LTRs are 96% similar to each other. XX FH Key Location/Qualifiers FT CDS 77..1351 FT /product="Gypsy4-I_Dya_1p" FT /translation="MGRSWIYRLKKEDFAYVAQRLRISLSGRTEDMRKALA FT EYYAETENDPKLAAIWTELEATYPDRPGPSIKLTNPEGEDLIASLNMEEMF FT REGQRRETSRERRPSPERPRPSPPDYAKVAKQVREWSFRFDGEEKPLEFLE FT QVEWSANTYGLDLNMIPRAMPELMQGKALKWYIANNKHWRTWAEIIDSFQT FT YFLPRGYFTKLADKARQRKQGFREAFKDYMVEMQTLIRPLGYGKKETLELI FT KENCTPDLRIALRSYHVDDLEALMILADEYEELHREREAFAEEHKYSRNKA FT PAATHVTCRRCEDSGDPGTHTREQWAQYTPVQARRQNTTLWRPPVTTQRHT FT GTTSGGNPGQNTTHIANPQEACRRCGGHGHWARGCREQRLLFCWMCGKVGV FT RSTECCQRAGNDQRSQPQRDGPGSQDAASPN" FT CDS 3057..5375 FT /product="Gypsy4-I_Dya_4p" FT /translation="MGIGKYHWEESSRGFTAFTVPGKGLFQWKVMPFGLHS FT ASATFQRALDQVIGPEMSPHAFAYQDDIIVIGRTLEEHKRNIREVFIRLKK FT ANLKINPEKCQFFKQELLYLGHRVTSQGIGTDPEKVAAIAQLEPPSSVREL FT RQYLGVASWYRRFVPDFARIVRPLNDLLRKGTKWTWSQDHQQAFEEVKARL FT VTDPVSACPDFGRTFILQTDASDYGIGAILTQDTEDGERVISYSSRTLNGA FT EKNYSTTEKECLAIVWAIRKLRPYLEGYHFKVVTDHMALKWLNSIESPSGR FT IARWALELQQYDFEVAYRKGRLNVVADALSRQPLPESVSAEPDMLRRTQDK FT EAESECSWIKEMREKLKAQPEKFSDYVWEGSTLYRQIPHRAGNEDVVSWKL FT CVPRQLRETVLRENHDAPAAGHVGSRKTIARLAARYYWPGMQRDARAYVRK FT CEDCARFKPNQRQAAGKMLTQIPEEPWATVCADFVGPLPRSKHGNSMLLVL FT IDRFSKWTELVPLRTATAETLQKAFRERIVSRFGVPKVVITDNGVQFTSRA FT FKKFLNDMGIKQQFTAPYTPQENPTERVNRTVKTMIAQFAGQNQRNWDERW FT PEIMLAVNSSVSDSTGYSPAFLTQGREPRLPNALYDRETLGTGRQPESPEG FT NAEKMREVFEIVRRNLERVSQDQARHYNLRRRQCKPSVGEVVWAKEHHLSK FT AAEGFAAKLAPRFDGPYQIVDFISPVICKIRHRMNKKERTVHVSDLKQQDQ FT KEKGGKNVESDPSEGMQENDNT" FT CDS 1580..3046 FT /product="Gypsy4-I_Dya_2p" FT /translation="MSLLILSGIVDALVLGWNFLTQVGAEIKCAGHEIRIP FT ARGRHNGWLEEKLSVALVQKDGEEDHVNKFLETELAEFQAMTGTSSIAEHM FT ITMKDDKPIKQRYYPKNPKIQGEINAKVDELLKMGMIEHSRSPYSSPIVTV FT KMKTGKWRLCVDFRQINANSIRDAYPIPRINYILDQIREARFISSLDLKDG FT YWQIPLEESSRGFTAFTVPGKGLFQWKVMPFGLHSASATFQRVLDQVIGPE FT MSPHAFAYQDDIIVIGRTLEEHKRNLREVFSRLKKANLKINPEKCQFFKQE FT LLYLGHRVTSQGIGTDPEKVAAIAQLEPPSSVRELRQYLGVASWYRRFVPD FT FARIVRPLNDLLRKDTKWTWSQDHQQAFEEVKARLVTDPVLACPDFGRTFI FT LQTDASDYGIGAILTQDTEDGERVISYSSRTLNGAEKNYSTTEKECLAIVW FT AIRKLRPYLEGYHFKVVTDHMALKWLNSIESPSGRIARWALELQ" XX SQ Sequence 7680 BP; 2366 A; 1692 C; 2123 G; 1479 T; 20 other; gacaaataaa atcagttcgt tacatctggc gcccagataa cgtgattatc ccggcagtaa 60 acacaaataa agcagtatgg gtagaagttg gatctaccgt ctcaaaaagg aagatttcgc 120 gtatgtcgca cagaggttgc gaatatcgct gtccggaagg acagaggaca tgcggaaagc 180 cctagcggag tactacgcag agacagagaa cgacccaaag ctcgcagcaa tctggacaga 240 attggaggca acatatccgg acagaccagg tccaagcata aagcttacga accccgaggg 300 tgaagactta atcgcaagtc tgaacatgga ggagatgttt cgagaagggc aacggagaga 360 gacaagcagg gaaaggaggc caagtccaga aagacccagg cccagcccgc cagactacgc 420 gaaagtagcg aagcaagtaa gagagtggtc ttttcgcttc gacggagaag aaaaacccct 480 cgaattcctg gagcaggtgg aatggtccgc aaacacatac ggtttggacc tgaacatgat 540 tccccgagcc atgccggaat taatgcaggg aaaggctctg aagtggtaca tagccaacaa 600 caagcactgg aggacttggg ccgagatcat agacagtttc cagacgtatt tcctgcccag 660 gggatatttc acgaagcttg cggacaaggc aagacaaagg aagcaagggt tcagggaggc 720 attcaaggac tacatggtcg agatgcagac cctgataaga ccccttgggt atgggaagaa 780 agaaacgctg gagctcatca aggagaactg cacgccagac ctccgaatag cgttgagatc 840 ctaccacgtg gacgacctag aggctctcat gatcctggca gatgaatacg aggaactgca 900 cagggaacgg gaggcatttg cggaagagca caaatacagc cgcaacaaag ccccggccgc 960 aacacacgtc acgtgcagaa ggtgtgagga ctcaggagat ccaggcacgc atacccggga 1020 acagtgggca caatataccc cggtccaggc aaggcgacaa aacacgaccc tatggaggcc 1080 gccagtcacc acacagagac acactggaac aacgtcaggg ggcaacccgg gtcagaacac 1140 aacccatatt gccaatccac aggaagcgtg tcggaggtgc ggcggacacg gccattgggc 1200 aagaggctgc cgagaacaga ggctgctgtt ctgctggatg tgcgggaaag taggagtccg 1260 aagcacagaa tgctgccaga gagcgggaaa tgaccagcga tcccagccgc agagagacgg 1320 gccggggtcg caagatgccg cctctccaaa ctaaccggaa aactaatcga ggaggagcag 1380 cagttgtccg cagcggtaac aatcgggaca cgtacaaggc cacgatcgac accggggcaa 1440 cagcaagctt cataagtgaa gaggtgccca aggagcgatt acaaggatac gacggcaggt 1500 taggttggca gatggaaggt gtggcgaaat caacgcaaag ctcgaaatag gagtggagtt 1560 cggcaacagg cgactcgtca tgagcctgct aatattatca gggatagtgg atgcgttggt 1620 gctggggtgg aatttcctca cgcaagtcgg ggccgagatc aagtgcgccg gtcatgaaat 1680 ccgaataccg gccagaggca gacacaacgg gtggctcgag gagaagttgt cggtcgcatt 1740 agtccaaaag gacggcgaag aggaccacgt gaacaaattc ctggaaacag agctcgcgga 1800 gtttcaagcc atgaccggaa cgtcgagtat agccgagcac atgataacca tgaaggacga 1860 taagccaatc aagcagagat actaccccaa gaatccaaag attcaagggg agatcaacgc 1920 gaaggtggac gaactgctga agatggggat gatagagcat tcaaggagcc cgtacagctc 1980 acccatagtc acggtgaaga tgaaaacggg aaaatggaga ttatgcgtgg acttcaggca 2040 gataaacgcg aactccataa gggatgccta cccgatacca cggataaact acatcctcga 2100 tcaaataagg gaggcgcgat ttataagcag cctggaccta aaggatgggt attggcaaat 2160 accactggaa gagtcaagcc ggggcttcac ggccttcacg gtcccaggaa aagggctgtt 2220 ccaatggaag gtgatgccat ttgggttaca ctccgcgtcg gccacatttc aacgggtcct 2280 ggatcaggtg attggaccgg aaatgtcgcc gcatgcattt gcgtatcagg acgacatcat 2340 cgtgattggc cgcaccctag aggaacacaa gagaaacctc agggaggtat ttagtcgcct 2400 gaagaaagcg aacctgaaaa taaacccgga aaaatgccag ttcttcaaac aagaacttct 2460 gtacctagga caccgcgtca ccagccaagg gataggcacg gacccagaga aggtagcagc 2520 gatagcccag ctagaaccac cgtcatcggt ccgagaactg aggcaatacc ttggagtcgc 2580 atcgtggtac cgacgtttcg tgcctgactt cgcgcgcatc gtaaggccac tcaacgacct 2640 cctccgcaag gataccaagt ggacatggtc acaggaccac cagcaagcct tcgaggaggt 2700 aaaagccaga ttggttaccg acccagtctt ggcgtgcccc gacttcggga gaacgttcat 2760 cctgcaaacg gacgccagtg actacgggat tggggcaatt ctcacgcagg atacagaaga 2820 tggcgaaagg gtcatttcat actctagccg gaccctaaac ggagccgaga aaaactactc 2880 gacaacggag aaggaatgct tggcaattgt gtgggctatt cggaagctaa ggccgtacct 2940 tgagggatac cactttaagg tggtgacgga tcatatggcg ctgaagtggc tgaacagcat 3000 agaaagccct tcgggaagaa ttgccagatg ggcgttggag ctgcagnnnn nnnnnnatgg 3060 gtattggcaa ataccactgg gaagagtcaa gccggggctt cacggccttc acggtcccag 3120 ggaaagggct gttccaatgg aaggtgatgc catttgggtt acactccgcg tcggccacat 3180 ttcaacgggc cctggatcag gtgattggac cggaaatgtc gccgcatgca tttgcgtatc 3240 aggacgacat catcgtgatt ggccgcaccc tagaggaaca caagagaaac atcagggagg 3300 tattcattcg cctgaagaaa gcgaacctga aaataaaccc ggaaaaatgc caattcttca 3360 aacaagaact tctgtaccta ggacaccgcg ttacaagcca agggataggc acggacccag 3420 agaaggtagc agcgatagcc cagctagaac caccgtcatc ggtccgagaa ctgaggcaat 3480 accttggagt cgcatcgtgg taccgacgtt tcgtgcctga cttcgcgcgc attgtaaggc 3540 cactcaacga cctcctccgc aagggtacca agtggacatg gtcacaggac caccagcaag 3600 ccttcgagga ggtaaaagcc agattggtta ccgacccagt ctcggcgtgc cccgacttcg 3660 ggagaacgtt catcctgcaa acggacgcca gtgactacgg gattggggca atcctcacgc 3720 aggacacaga agatggcgaa agggtcattt catactccag ccggacccta aacggagccg 3780 agaagaacta ctcgacaacg gaaaaggagt gcttggcaat tgtgtgggct attcggaagc 3840 tgaggccgta cctggaggga taccacttta aggtggtgac agatcatatg gcgctgaagt 3900 ggctgaacag catagaaagc ccttcgggaa gaattgccag atgggcgttg gagctgcagc 3960 agtatgactt cgaggtagcg tacaggaaag gccggttgaa cgtggtagca gacgcattgt 4020 cgaggcagcc actgccagag tcggtctcag cagaacccga tatgctgaga agaacgcagg 4080 acaaggaggc cgaatcggag tgcagctgga tcaaggaaat gcgagaaaaa ctaaaagcac 4140 agcctgaaaa gttctcagac tacgtgtggg agggtagcac cctttacagg cagattccgc 4200 atagagcagg gaacgaagat gtagtgagct ggaagctatg tgttccgcgg cagctgagag 4260 aaacggtctt gcgtgaaaac cacgacgccc cagcagccgg tcatgtaggc agtcggaaga 4320 cgattgcacg gcttgcagcc cggtactact ggccaggaat gcagagagac gcaagggcct 4380 acgttcggaa gtgcgaggac tgcgccaggt tcaaaccgaa tcagcgacag gcggcgggaa 4440 agatgctgac acagattccg gaggagccct gggcgacagt gtgcgccgat ttcgtcggac 4500 cactaccgag gtcgaaacac ggaaacagta tgttgttggt actgatcgac aggttttcga 4560 agtggacgga gttggtgcct ttgcgaacag ccacggcgga aacgttgcag aaagcgttca 4620 gggaacgcat agtctcacga tttggagtac cgaaggtggt gatcacggac aacggcgtgc 4680 agttcacgag ccgggctttt aagaagttcc tcaacgacat ggggataaag cagcagttta 4740 cggcaccgta taccccacag gagaatccga cagagagagt aaaccgcacg gtgaagacaa 4800 tgatagcgca gtttgccgga cagaaccaga ggaattggga tgaaaggtgg cccgaaatca 4860 tgctggccgt gaattccagt gtttcggatt ccacaggata ctcaccggca tttttgacac 4920 agggccgcga gccaagacta ccgaacgcac tctatgacag agaaacgcta ggtacgggaa 4980 ggcaaccaga gtcaccggaa ggaaacgctg agaagatgag agaggtcttc gaaattgtgc 5040 gcaggaacct agagagggtg tctcaggacc aagcaaggca ttacaatctg cggagaaggc 5100 agtgtaagcc aagcgttggt gaagtggtgt gggccaagga gcaccatttg tctaaagcgg 5160 ctgaagggtt tgcggccaaa ctagctccga ggttcgatgg accgtaccaa attgtggatt 5220 tcatatcgcc ggtgatatgt aagattcgcc accggatgaa caaaaaggaa cgcacggtac 5280 atgtcagcga cctaaaacag caggatcaga aggagaaggg tggcaagaac gtggagtcgg 5340 atccatcgga aggcatgcag gagaacgaca acacctgaga acggactggg ttcggcagcc 5400 gttctagcgt caggccgagg cgagagtcaa cggatgcaga ctgcgagtct gactgaatct 5460 aaaatcacga tctgactaag tcagaggcag tttgccagta ataacgatat tgcatttaat 5520 ttcaggtaca ccgataacaa taggtgaaaa tgtaatcaag acatttctct gcgagactat 5580 tgacaagtct cttttacgta tcgagtggga gaagtggcta agagccttcg aaatttatct 5640 catagctgaa gaaattgaca cacctattcg gaaaagaagt aaactacttc atgtaggaga 5700 accacaactt cagacggtag cgttcacatt acgcgatgcg gtgttagaaa atgccacaga 5760 taataatata gatatataca aggtccttgt tgaaaaattg aatgcataca tctcgccaaa 5820 gcagaactcc agttttgaaa ggcatctgtt taggaatctc tgtccatcag aaggagagac 5880 ttttgctaaa tttgtattaa ggctacgtca ccaaattcag aagtgtaatt tcgggctact 5940 aaagcagaag tcgaggagat ttgtctaaag gacaaaataa ttgacagctg ggccggagta 6000 gatcttaaaa agaaattgtt gggtaaaaga atacaatttg tctcaaatgt tagatgcctg 6060 tcaagtcgat gaggaaatta ataaacaatc ggaaatgatg cgatcaaacc cagaattggg 6120 gacaagtaaa aacctagaag ccgagagtat aagaaaaatc tcaacaggga tcatgaacat 6180 taataaatcc atcgaatgca gtagttacag ttctttgagc ccagccaaaa atcaaaagtg 6240 caacaagtgc acacgaattg gtcatttcgc cattaaatgc acaacaaact taaaacgcga 6300 gagaccccaa gggcataatg aaatctttaa acgccaacgc accaaagtac gtcgggtaga 6360 gggagaacgc gatgtgaaat ctgaagaaag ttgttttaaa attcagagtg atgaggagga 6420 tgaatttatt aaatgtcgaa ttggtgggcg cgaagttttc ctagtaattg actcgggctc 6480 caaattcaat ctgctaagcc cgaaagactg gtcatatctc cagacaggaa agacagtttt 6540 attcaacgtt cgaccaaatt caaataataa atttagagga tatgcttcac acgaattgct 6600 tcatgtaatt tgtgtgtttg aagcccctac atccattggc ttgaaccctg aggtgatggc 6660 ttcgttttta aacggaacac aatcactgct ggggaaagac acggcactgg aacttaatgt 6720 tttacgcctc ggactggaag ttaagaaaat cgattcagta actccgtttc caaaatggat 6780 ggggccgccg gtaaaattat caattgatac aaatatcgat tttcagcaac caattagaag 6840 gataccaatt gcacttgagg ataaagtaat tgctaaactg gaagaggatg ttgcccttga 6900 tataatcgaa ccagtaatag ggcattctcc ctggatataa cccatggtca tagcgtttaa 6960 agaaaatggc gacttaagga tatgcataga cttgaagctt gccaataagg caatcttaag 7020 gaaaaattat ccgctgcctg tatttgaatc gtttatgacg aaactaaggg gcgcacaata 7080 cttcacccgc ttagacctaa aaagcgcata tcatcaattg gcgctagacg aatccagccg 7140 atacataaca actttcatta cacccctggg tttgttcagg tacaaaagat taatgtttgg 7200 agttaattct ggcgctgaaa tttttcaaag aagactggag gagctgttag cacaatgcaa 7260 aaacgttatg aattatattg acgacatcat tatttttgaa acaaattaaa aagatcacga 7320 ttacgccgtt cagaagatca gagatattct aaaagtaaat aacgtatttt tgaatgagga 7380 aaaatgtatt tggaaaacac aacaagttaa atttctgaga cacatactat cgaaaaaagg 7440 catcgaagta gactctgggc tggagaaaga gagttgggag actctagggg aagagagttt 7500 ggagactcgg gaaatggaga gagagaggat ggagactccg gacttgctga aaagagagtt 7560 tggagagtca gcaggcttaa aggcaattaa cgggacaaat cgaactttgg accggccaac 7620 ggtaaaatct tggactttgg atccggagan nnnnnnnnng cgcggtgcgt cggaaggaac 7680 // ID Sola1-1_AP repbase; DNA; INV; 4723 BP. XX AC AC202225.3; XX DT 28-JAN-2009 (Rel. 14.02, Created) DT 28-JAN-2009 (Rel. 15.12, Last updated, Version 2) XX DE Sola1 DNA transposons from Acyrthosiphon pisum. XX KW Sola; DNA transposon; Transposable Element; Sola1; Sola1-1_AP. XX NM Sola1-1_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-4723 RA Bao W., Jurka M.G., Kapitonov V.V. and Jurka J.; RT "New superfamilies of eukaryotic DNA transposons and their RT internal divisions."; RL Molecular Biology and Evolution 26(5), 983-993 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX FH Key Location/Qualifiers FT CDS 1537..3282 FT /product="Sola1-1_AP_1p" FT /translation="MTPKEKVQRKRQRKVSDWKDVRAKAELNKGIEHINRS FT GKKKNAVEMGPPCKCKLRCFDKVSEDLRKKIFIEYWSLGDHSRQWDFIARY FT VTTVEKKVTTLSTESRRSYSRKYSLPINNDKIKVCKVMFLKTLSISERVVS FT TVSQKLMISPVIPLDQRGTHHTRPNRIQREVLDTIREHISMFPSVDSHYRR FT ADSQKQYLESDLSISKMHRLYLEWVKDKTVCVKAKSATLRQYTDVFNTFNL FT SFYKPKKDLCDKCEQFKLANEEEKLILQSEHENHLKNKDIARDKKNYDKLR FT AMNNHELCVAVFDLEKVLTTPQGEAGSFYYKRKFAVYNFTVYDIGNKQGYC FT YMWDESEGKRGSNEIGTCLLKFIESMKEKGYKEFSFYSDNCGGQNRNRFIY FT SMWEYASFTFKVKITHTFLERGHTQSEGDSMHSCVEHAKKGKSIYVPAQWV FT TLVRCSKVKGNPFRVIEVSYDEFLDFKPLVEDKQFNFKTSTNGDIIKWNSV FT KEVFVSFESPFDLCLKYDLNSTDFIHVNILKSKQRGRYQPHTKPSKAYKDK FT LPIEKSKHNDLLSLCSMGLIPSTYHSFYKDLLSK*" XX SQ Sequence 4723 BP; 1713 A; 611 C; 682 G; 1717 T; 0 other; cgcctaccgc aacgaaaaag gtcgtatctc gatttttgtc atttttgagt tacgaccata 60 ttcgtatagg aaataatttt ctctatcgcc cacgaaaacg gtcgtatctg tagctatcca 120 aggaggttag atattactta aattaaattt aattgatttt tgtatttccc gcgaaaacaa 180 acgtatgttt atatacgacg ctttgattac aaatatttcc cgcgaaaaca gacgtatgtt 240 cagataagac gctttgataa caagacgtat agtgttagta atcaaaatat aattttattc 300 aattatcaat ttaatatttt aataatatta taatgtaaaa taatatagtt aattaattgt 360 ttaactcctt tcggctttct attgtacaat attatattat tataattgaa gtagttatcg 420 caatattatc gtcattataa ttattctatt ttctatttcg aaagatgaca cacgaataca 480 aattgtctac agaaatgaaa ggtgtaagta caaattttaa ttctataata tcagtgtaaa 540 ctgtataatt taacaaaatt aaaacaatgc aatttgtgat tcgaaaatag actacaatat 600 tataatatta ttattattat tattgtatta tttactatgt atttttttga acaggaaaat 660 aataataata ttggtttaat tgattccgac tgcgacagtg tattagatac atattattat 720 ggatgtctaa gtagtgatga tgatgttatt caaccaattc ctgttaataa tgatgtatca 780 gcagcaatta tcaatcatca ggcaatatat tttaatcact taatgctaaa ttaatatttt 840 attaaaataa agtataggta ataggagtac ctacttttgt ttaaaattta atatacattt 900 tttaattaca ggatttgtgg tgtattgatg atagtaatag tgaacactat gatagtgata 960 atgacccaga gtacacacca cagttggaca atttcaattt agttgatgaa gctcttgtta 1020 atgatgacgt gcctgtcgct gataataatt atgatgctct aaatgatgta aatagtctta 1080 ttagtattta tggttgctat ctagggttgt atctagggtt caatttttat agctaaggat 1140 tgaaaattta aaacaaggtt ccataaatac taaataggtt ataataatat ataaattact 1200 ttaggtattc acaataatat caaatatact tattaatttt atttataact tgtaactagt 1260 aaataataaa taaaatatta aaaacttgta ttttttttaa atttttggtt aaaaaaaatc 1320 ccctaaatga aaattatact caatttttat tagaatatta ctcacttcat tgcaaaatat 1380 aatactcact attctcactt tttttaagga aaaaatgagt agcaaccaat aattataata 1440 taatattatg aaattgtaca cacacatttt attttatgct aggaccttgc aaatgttgga 1500 aataataatg atattacaac aaaaaaaaag cgtgttatga cacctaaaga aaaagttcag 1560 agaaaacgtc aaagaaaggt atcagattgg aaggatgtta gagcaaaggc tgaattaaat 1620 aaaggtattg aacatatcaa tcgatctggt aaaaaaaaga acgcagtaga aatgggtcca 1680 ccatgtaaat gtaaattaag atgttttgat aaagttagtg aagatttacg taagaaaata 1740 ttcattgaat actggagcct aggggatcat agtcgtcagt gggactttat agccaggtat 1800 gtaacaacag ttgaaaaaaa agtaacaaca ttatcaactg agtcacgcag atcatattct 1860 agaaagtatt ctttacctat taataatgac aagattaaag tttgcaaggt aatgttttta 1920 aaaacacttt ctatatctga aagagtagta agtacagttt ctcaaaaact aatgatttct 1980 cctgtaatac cacttgacca gagaggtaca catcatactc gacctaatcg aattcagaga 2040 gaagtacttg atacaatcag agaacatatt tctatgtttc cttcagttga ctctcattat 2100 aggagagcag attcccaaaa acaatatttg gaatcagatt tatctatttc aaaaatgcat 2160 aggctttatc ttgaatgggt gaaagataaa acggtttgtg ttaaagctaa aagtgctaca 2220 ctaagacaat acacagatgt tttcaatact tttaacctca gtttttacaa acctaaaaaa 2280 gatctgtgtg acaagtgtga gcagtttaaa ctagcgaatg aagaagaaaa gttaatacta 2340 caatctgagc atgaaaatca tttaaaaaac aaagacatag ccagagacaa aaaaaattat 2400 gataaactac gagcgatgaa taatcatgaa ttgtgtgtgg ctgtttttga cctagaaaaa 2460 gtgttaacaa ctccacaagg tgaggctggt agtttttact ataaacgaaa atttgctgtt 2520 tacaatttca ccgtttatga cattggcaat aaacaaggat attgttacat gtgggacgag 2580 tccgaaggca aaagaggttc caatgagatt ggaacttgtt tgttaaaatt tattgaatca 2640 atgaaggaga aggggtataa agaattttct ttttattcgg acaactgcgg tggtcaaaat 2700 cgcaaccgat tcatttactc aatgtgggaa tacgcttcct tcacttttaa agttaaaata 2760 acgcatacat ttctagaacg aggccacacc caaagcgagg gtgatagtat gcactcttgt 2820 gtagaacatg caaaaaaagg aaaatctatt tatgtgcctg cacaatgggt aaccctagtg 2880 agatgctcaa aagttaaggg taatccattc agagtaatag aagtttctta tgatgaattt 2940 ttggatttta agcctctagt agaagataaa caatttaact tcaaaacatc taccaatggt 3000 gatataatta agtggaatag tgtcaaagaa gtttttgttt catttgaaag tccatttgat 3060 ttatgtttaa aatatgattt gaattctaca gattttatac atgttaatat attaaaaagt 3120 aagcaaagag gtaggtatca gcctcatact aagccttcaa aagcatataa ggacaaatta 3180 ccaattgaaa agtcaaaaca taatgattta ttatctctat gttccatggg ccttatccca 3240 tctacttatc attcctttta taaagatctt ctatcaaaat aataattata ttttgatatt 3300 ttttttttta ttatttttaa taataatgca taatttaagc aatatgcata caagttggtt 3360 tattttattt taacatttta acattttgtt tattttggtg ctaatccaat tatcacgatt 3420 aataatcttt taccaaatgt cttatgtttt tactttatta ttttgcaaaa tgcataaaag 3480 ttgttttatt ttatttataa taatatacat acaagttggt ttatattttt tattttaaca 3540 tagaattttt atttcaattt agtaattttg tactgttatt aattttgtaa tgaatataat 3600 attttggact taatccaatt gtcactatta ctagtctttt accaagtttc ttaatatgtt 3660 tttactttat tattttgcaa attgcataaa agttgtttta ttttatttat aataatatac 3720 atacaagttg gtttatattt tttattttaa catagaattt ttatttcaat ttagtaattt 3780 tgtactgtta ttaattttat aatgaatata atattttgga cctaatccaa ttgtcactat 3840 tactagtctt ttaccaagtt tcttaatatt atgtttttac tttattattt tgcaaattgc 3900 ataaaagttg ttttatttta tttaatattg taattttctt tattataata ttttaacata 3960 agactgttat tagattcata attggtacta ataactacta ctagtctttt aacaagtctg 4020 aatgtttttt tacttaaata tttctataat tattacacta attattataa tgtaaaaagt 4080 ttaataaaaa attttggttt aattctataa cttttgattg ttaaatttat ttatcttcta 4140 ttttaaatgg tatattatga ctaatcatta tgcatttcca caatatgagg gtcacttagg 4200 tatttctctt aagggggggg gggggggctg taaaataagt gtttttttaa cacaaaaaaa 4260 aaattggttg gcattgacgt ttcagatttt tttttggggg ggagtgctcc cccacccacc 4320 atactaccac accaaccaca atcttcgcac tacccccccc ccccctaatt gacgcctttg 4380 cattttcaca caaaaaatgt tgtaggtatg tagtagcggt ataataataa taatattatt 4440 attcctataa atggtataca atttcccacg aaaaaagtcg taagtataga tacgacctaa 4500 agaaatattt ttttaaggtc gtatctaaaa aaaaaaaatg aaggtttaat tttttttcaa 4560 aatcgagaac ataataattt agagtatcct aaatgatgaa aatagtgatc ttcagatcta 4620 aataaaaata aaaagcataa taacaaatta aaatcccaat tcattttgct cctcctgaaa 4680 agttgataaa atcgagatac gacctttttc gttgcggtag gcg 4723 // ID Copia-22_CQ-I repbase; DNA; INV; 4791 BP. XX AC AAWU01004224; XX DT 26-DEC-2010 (Rel. 16.01, Created) DT 26-DEC-2010 (Rel. 16.01, Last updated, Version 1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-22_CQ_; KW Copia-22_CQ-LTR; Copia-22_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4791 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 359-359 (2011). XX DR GenBank; AAWU01004224; Positions 5110 9900. XX CC Positions [2132-2659] - Integrase core CC 'CCGAG' target site duplication CC LTRs are 100% similar to each other. CC Contains a SINE insertion (masked). XX FH Key Location/Qualifiers FT CDS 1532..4774 FT /product="Copia-22_CQ-I_1p" FT /translation="MVDAEQAVRCSVSGASSLWYVDSGASQHMTCNKNFFT FT ELTEDVQIGITLANGNTTHSQGMGRGLLRCVDDEGRERQIQVQNVLYVPEL FT DGSLLSVSTIVNNHFVVSFTARGCKVLDESGAVAAVADLVGGMYVLKTNEK FT SMVSATARHTEFCPHTWHRRFGHRDIDVIDQIKRNDLATHMKVIDCGLQSV FT CECCLECKLSRKSFPKVAENRAKSVLDLVHTDVCGPFQNVTPGGKRYFMTI FT IDDFTRYTIVYLLANKSEVVNKIIEFVRLVENQMGEKPRVIRSDNGGEYAT FT KALEEFYAAEGIKAEYTCPYSPQQNGVAERKNRYLQEMALCMLHDAGLPKL FT YWGEAVMTATFLQNRLPSRSIGCSPYEKLFNKKPAVKDLKVFGSEAFVYIP FT AERRRKFENRARKLIFIGYCTQNKAYRFLDRKTGRIIVSRDAQFLELGAEI FT GEESASDKTSTNDVEVVIGEKVEKVREELQAPEDLDVSDDSFYDPEEDASE FT EAVEESAASGGSAEPAGPSAAGEPAASAQNGADGNRHSQRSNFGVPPQHLN FT DFVVGLAAQCSDEPGTYREAIAGPEKKQWLAAMKEELRSLEENNTWSLVTP FT PAGCEPIGSRWVFKKKEDAEGKVLRFKARLVAQGFSQKFGRDFDQVFAPVA FT MQSTFRVLLSIASQKKMDVYHVDVKTAYLYGTLEEEIFMRQPPGFVVPGQE FT KCVCKLNRSIYGLKQAARVWNSTIKDFLIAIDFVQSSSDACLYMKKLSDRE FT WMYLLIYVDDMVIVCREKNQMLRVEEELKKRFRITSLGPVHLFLGIAVNQD FT ADGVYSINQSSFIRKIAKQYGLDESKKSKIPMDTGYYKFAEKSEVLPNNTE FT YHSLVRSLLYVSTNTRPDIAAGVSILSRKIQSPSQADWTELKRVVRYLVGT FT ADHRLQLGASEADRMSALEGFCDADWASDTSDRKSNSGFVFRFGEATISWA FT SRKQTCVSTSTMEAEYVALAEAVQEVVWLRRLLAELGEKQQTPTTIFEDNK FT SCLDFVALERQNRRSKHIDTKFHYIKDAATSGQIVLRYCATTDMLADIMTK FT PLGAVQVAKFVGMLGLKAGGSGHR" XX SQ Sequence 4791 BP; 1105 A; 1138 C; 1314 G; 934 T; 300 other; ggttattggc ccggaaagtt ttgagtatcg tcgggaagtt ttgtgcgctg cggagttcgc 60 tcccgaggaa tcttcgtctc ccgtgaccgc gtggtcgctg cctgttcctg cgatggcgga 120 ggctgtgaag ttcaacgttg ccaagctcgg caacggcaat tacccgtcct ggaagttccg 180 gatggagatg ctgctcgtcc gcgaggagct gttttacgtc atctgcgatc cgcgggacga 240 tcccgtcacg gaagcgtgga caaaggacga ccggaaggcc cgcgcaaccc taggcctgtg 300 cgtggaggag agtcaatttg ggctcatcaa gggcaaggcc accgcgaagg agatgtggga 360 aagtcttcaa gagtaccacg agaagcactc cgtttcgtcg cgcgtgttgc tgctgaagaa 420 gctgtgcagc aagaatttgc aagaaaatgg cgacatggaa gcacacctgc aggacattga 480 ggttatgttc gaccgcctgc aaggtgccaa tctcaagctg gacaaggagt tgaaggtcgc 540 gctgctgctg cgaacactgc cggactccta caacggtctt gtcgtggccc tggagagccg 600 cccagatgcc gatctgaccc tggactttgt caaatcccgg ctgctcgatg agcaccagaa 660 gcggctggag cgggggaaca ttgccggcgg cagtaagctg ctgaagacgg ttgcatccaa 720 gcccggatag agcctcaatc gcgcagatga cgaggaaatc ttggctgatt tgatgcggcc 780 ggattggcgg ctcctaagaa cttaggagct gcaggttggg acacaaactg gctgtgtcct 840 tcgaagattt taagatttgc tgattcttgg tgagagcttt ccatcattat ttgctaaatg 900 attatattcc cttatattat tatactataa attcaatagc cctcctaccc cttcctcact 960 ttcccattcc ttaatcccat tttgcccaat accacttcct ccctaaaaat ataattatct 1020 gccaaatttx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 1080 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 1140 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 1200 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 1260 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 1320 xxxxxxxxxg acggttgcat ccaagcccgg agagaagaca tgtttctttt gtaagaaacc 1380 tggccacttc cgtaaaaact gccggaagta ccaggcgagc ctgaagggcg actccggcga 1440 gaccagacac cagtacgcac caagggcgaa gaaggcggcg gaagcatctc cgttgctgtt 1500 ttggacgaag gtgggaaccg catcggcgag gatggtggat gctgagcagg cggtacgctg 1560 ctcggtgagc ggcgcttctt cgctgtggta cgtggacagt ggtgcatccc agcacatgac 1620 ctgcaacaaa aacttcttca ctgagctgac ggaggacgtg cagatcggaa tcaccttggc 1680 gaacggcaac acaactcatt cgcaaggtat gggacgtggc ctgctacgct gcgttgacga 1740 cgaaggacga gaaaggcaga tccaagtaca aaacgttttg tatgtacccg aactcgatgg 1800 tagtctgctc tctgtaagca cgatcgtaaa caatcatttt gttgtgtcct tcaccgcgcg 1860 cggctgcaaa gtgctcgacg aaagtggcgc tgtggctgct gtcgctgacc tggtcggcgg 1920 aatgtacgtg ctcaagacga acgagaaatc gatggtgtcg gccaccgccc ggcacacgga 1980 gttctgtccc cacacctggc atcgtcggtt tggtcaccgc gacatcgacg tcatcgacca 2040 gatcaagagg aacgacctgg caacacacat gaaagtgatc gactgtgggc tgcaaagtgt 2100 gtgtgagtgc tgcctcgagt gcaagttgtc ccgcaaatcg ttccccaaag tggctgaaaa 2160 ccgtgcaaaa agtgtgctgg acctggtgca cacggacgtt tgtggaccgt tccaaaacgt 2220 cacgccggga ggtaaaagat attttatgac gataattgac gactttaccc gctacaccat 2280 tgtttacctc ctagcgaaca aatcggaagt tgtcaacaaa atcatcgagt tcgtccgcct 2340 cgtggagaac cagatggggg aaaagccacg ggtgatacgg tcggacaacg gcggcgaata 2400 cgcgaccaag gcgctggagg agttctacgc ggcagagggg atcaaagccg agtacacctg 2460 tccctactcc ccgcagcaga acggtgtcgc tgagcgcaag aaccgttacc tccaggagat 2520 ggcgctttgt atgctgcacg acgcaggctt accgaagctg tactggggcg aggccgtgat 2580 gaccgcaacg ttcctgcaga accgactccc ttctcgatcg attggctgct caccgtacga 2640 aaagctcttc aacaagaagc ccgcggtgaa ggacctgaaa gtgttcggct cggaagcgtt 2700 tgtttacatt ccggccgaaa ggaggagaaa gttcgagaac agggcgcgga agctgatctt 2760 catcggttac tgcacgcaga acaaggccta ccgtttcctc gaccgtaaga ccggtcggat 2820 catcgtcagc cgggacgcgc agttcctgga gttgggcgct gaaatcggag aagaaagtgc 2880 aagtgacaaa acgtcaacaa atgacgtgga agttgtgatc ggagaaaaag ttgaaaaagt 2940 gcgcgaagag cttcaagctc cggaagacct agacgtctcg gatgatagtt tctacgatcc 3000 ggaggaagac gcaagtgaag aagctgttga ggaatcggcg gcatctggcg gttctgcaga 3060 acccgctgga ccaagtgctg ctggagagcc ggcggcgtcg gcgcagaacg gtgctgatgg 3120 taatcggcat tcgcaacgaa gcaattttgg tgtcccgccg cagcatctca acgatttcgt 3180 ggtcggactg gccgcgcagt gcagcgatga acccggaacc taccgtgaag ccatcgctgg 3240 ccccgagaag aaacagtggt tggccgccat gaaggaagaa ctccgatcgc tcgaagagaa 3300 caacacctgg tcgctcgtga cgcccccagc cggttgcgaa ccgatcggct ctcgatgggt 3360 gttcaagaag aaggaagacg cggaagggaa agtgttgcgc ttcaaagcaa gattagtggc 3420 acaagggttt tcccaaaagt tcggtcgcga cttcgatcaa gtgtttgcgc cagtggcgat 3480 gcaatccaca tttcgtgttt tgctctcgat cgccagccag aagaagatgg acgtgtacca 3540 tgttgatgtg aaaacggcgt atttgtatgg taccctcgaa gaagaaatct ttatgcgaca 3600 accccctggt tttgttgtcc ctggccaaga aaagtgtgtg tgcaagctca atcgaagtat 3660 ttacggcttg aaacaggccg cgcgagtgtg gaacagcacc atcaaggact ttctgattgc 3720 tatcgatttt gtgcaatcgt cgtccgatgc ttgcttgtac atgaagaagt tgagcgatcg 3780 tgagtggatg taccttttga tctatgtcga tgacatggtg attgtgtgcc gcgagaagaa 3840 ccagatgcta cgcgtcgagg aggagctcaa gaaacggttc cggatcacat ccctggggcc 3900 ggtgcatctc tttctcggaa tcgctgtgaa ccaggacgcc gacggcgtgt actcgatcaa 3960 ccaatcgtct ttcatccgta agattgcaaa acagtacggg cttgacgagt cgaagaagtc 4020 gaaaattccg atggacaccg gctactacaa gtttgccgag aagtcggaag tgctgccgaa 4080 caacaccgag taccacagcc tggtcagatc gctgctgtac gtgtccacaa acaccagacc 4140 ggacatcgca gcgggcgtga gcatcctgag ccggaagatc cagagtccgt cacaggcgga 4200 ttggactgag ctgaaaaggg tagtgcgtta cctcgtcgga acagcagacc atcgtctgca 4260 actcggagca tcagaagcag accggatgtc ggcgctggaa ggtttttgcg acgcagattg 4320 ggccagtgac acgtccgatc gaaagtcgaa cagcggattt gtgtttcgat tcggagaagc 4380 gacgatcagc tgggctagcc gaaagcaaac ctgcgtatca acgtcaacga tggaggcgga 4440 atacgtggcc ctggcggaag cagtccaaga ggtcgtgtgg ctacggcgtc tgctggcaga 4500 gctcggcgag aagcagcaaa caccaacaac aatcttcgaa gacaacaaga gctgcctgga 4560 ctttgtcgcg ctggagcggc agaatcgcag atcgaagcat atcgacacaa agttccacta 4620 catcaaggac gctgctactt ctggtcaaat tgtcctgcgc tactgcgcca ccaccgacat 4680 gctggcggac atcatgacca aaccgcttgg agctgttcaa gttgcgaagt tcgtcgggat 4740 gctgggactc aaggctggag gatctggcca ccgctagaca tcgaggagga g 4791 // ID Polinton3_SM repbase; DNA; INV; 2836 BP. XX AC . XX DT 02-MAR-2008 (Rel. 13.03, Created) DT 31-MAR-2008 (Rel. 13.03, Last updated, Version 1) XX DE Polinton-type family of transposable elements. XX KW Polinton; DNA transposon; Transposable Element; Polinton3_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-2836 RA Jurka J.; RT "Polinton2_SM: Polinton-type element from the planarian Schmidtea RT mediterranea."; RL Repbase Reports 8(3), 374-374 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 496..2301 FT /product="Polinton3_SM_1p" FT /note="DNA_pol_B_2." FT /translation="MVYVGGLRIRFIDSYQFVNDSLAKAVKTLNNLPLTKS FT VFNGTIVDSKGIFPYNFATSLEVLETTYELPPIWEEVTEDEYQKAKKIWKE FT TNCNTLLDYMMVYLKLDVFLLADFFQQFRAKSIAHNRLEPLNFFGIPGMSW FT NSALMTLDEPIELLQDMEIYNFYEGGIRGGLTFVNKHHVKTTEDTELLYID FT INNLYGWALSQSLPCGKFSWIHDNLDSILEDCKNLDLEKXSYGYTVEVDLE FT IPNEIHDLLDDFPIGAEKKSAEKKCVSKVEKLLLTHNPKQNYVVHWRLLKM FT FLLLGVQVVNIHRAIRFKQEPIFKNYVDTNTKLRAESTNTLDKNYYKLLNN FT SLYGKTVENLKKRMNLRLCNNQQKMITYASKPTFRKSIKIADDLIAAILDK FT DSICLDRPSYIGQTVLDLSKLRMYELQYVELAKYRNQFNCSINIVAGDTDS FT FFLEIKNCKLDKLLPAMISDNLLDTSNYDINHPLYSKKLDSVIGKFKDESK FT GIGYKEWIFLRPKCYSLLGNTCTTKAKGVTLKDTDIKHQSYLDCYNGIEIP FT RVSQFRIGTSNHQLYSFKTTKVALTNNDDKRVWVGKNESLAYGHYMIENIN FT QFTE" XX SQ Sequence 2836 BP; 1112 A; 336 C; 432 G; 954 T; 2 other; tttaatttat tgacttattt aattttaatt tttattaatt tatatatttt aataatttat 60 ttatttttat ttaattattt attaattatt tattactatc aatttgttgg tcaaaactgt 120 gtttatgatt ttttgaagaa ggtttcagat ataggaacaa acattgtgct accatattac 180 aaagaagagg gtcacaaact gatttcacta cttactttgg atgaagaaaa ttcttttaat 240 atcagtgaat attgttattt gtgtgagaaa aagatagtag ttaaggttaa agatcatgat 300 cattttacag gaaaatatat tggtcctgca tgtagcaagt gtaatttatc taggaaaatt 360 atagaattat tacctgtagt ttttcataat ttacgggggt atgacttaca tcatatcctt 420 aagtatggat taaatcaatt tcccaattgg aatttatcat gtattccaat tagtagtgaa 480 aagttcttat ctcttatggt atatgttggt ggacttagaa tacggtttat tgattcatac 540 cagtttgtaa atgattcatt ggctaaagct gttaagacat tgaacaatct acccttaaca 600 aaatcggtat tcaatggaac tatagttgac tcaaaaggaa tattcccata taattttgca 660 acttctttag aagttttgga aaccacatat gaattaccac ctatctggga agaagttaca 720 gaagatgaat atcaaaaggc aaagaaaata tggaaagaaa caaactgtaa cactttattg 780 gattacatga tggtttattt gaaattggat gttttcttgc ttgccgactt ttttcaacaa 840 tttcgggcga aatctatagc tcataataga ttagaacctt tgaatttctt tggtattcca 900 ggtatgtcat ggaattcggc attaatgaca ttggatgaac ccattgaatt attacaagac 960 atggaaattt ataatttcta tgaaggggga attcgcggtg gtttgacatt tgttaataag 1020 catcacgtga aaactactga agatacagag ttattrtata tcgacatcaa taacctatat 1080 ggatgggctt taagtcaaag tttaccctgt ggaaaattta gttggattca tgataacctt 1140 gatagtattt tagaagattg caagaatttg gacttagaaa agwtgtcgta tggatatact 1200 gtggaggttg atttggaaat tccgaatgaa attcatgatt tattggatga tttcccaatc 1260 ggtgcagaaa agaaatcagc agaaaagaaa tgtgtttcaa aagtagagaa attactctta 1320 acgcataatc caaagcaaaa ttatgtagtg cattggcgat tgctcaaaat gtttttatta 1380 ttaggtgtgc aagttgtcaa tattcatcgg gctatcagat ttaaacaaga acctattttc 1440 aaaaactatg tcgatactaa caccaaactc agagctgaat caacaaacac acttgataaa 1500 aattattata agttgctcaa caatagtcta tatggtaaga ctgtagaaaa cttaaagaaa 1560 agaatgaatt taagattgtg taataatcag caaaaaatga ttacatatgc atcgaagcca 1620 actttccgaa aaagcatcaa gattgcggat gatttaattg cagccatact ggataaagat 1680 tcaatatgtt tagatagacc aagttacatt ggtcaaacag ttttggattt atcaaaactt 1740 cgaatgtatg agcttcaata tgtagaatta gcaaaatatc gaaatcagtt caattgcagt 1800 atcaacattg ttgctggaga tacagattcc ttcttcttag aaattaaaaa ttgcaaatta 1860 gataagcttt tacctgcaat gatttcagat aatctactag atacgtctaa ttatgatata 1920 aatcacccat tgtatagtaa aaagttagac tctgtcattg gaaaatttaa agacgagagt 1980 aagggtatag gatacaaaga gtggatattt ctcagaccaa aatgttacag tttattaggt 2040 aatacttgta caacaaaagc aaaaggggta acattgaaag ataccgacat aaaacaccag 2100 tcttatctcg attgttacaa tggaatagaa atacctaggg tgtctcaatt tcgaataggt 2160 accagcaacc atcagttgta tagtttcaaa acaactaaag tcgcacttac aaataatgac 2220 gataaaaggg tttgggttgg aaagaatgaa agtctggcat atggtcatta tatgatagag 2280 aatatcaatc agtttacaga ataatattca aactgtgcat ggtctgctct tcaaatcgat 2340 catcaagcta ccaatcaaac ggatcatcaa actgttcatc aaattgctca tatttatcaa 2400 acttttcatt gacatatttg tgattagcat tgttgtatct atttaataaa tgtatattgg 2460 tttctgatgt taattactta ttgttatatt cgtaataaaa attttgtata taaactgtat 2520 gtgtatgtta aaaataaata aataaaaata ataataataa taataataat aataataata 2580 ataataataa taataataat aataataata ataataataa taataataat aataattaca 2640 ataaaaatat taataataat aaatacaaat aaataaaatt aaataaaaat aaaaaaataa 2700 aaaaataatt aaattaaaaa taagttaatt aaataaataa ttatataaaa ttaataaata 2760 atcaaataaa ataaataaat tattaaaata tatataaatt aataaaaata aaaattaatt 2820 aagtcaataa attaaa 2836 // ID Copia-29_CQ-I repbase; DNA; INV; 4421 BP. XX AC AAWU01004852; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-29_CQ_; KW Copia-29_CQ-LTR; Copia-29_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4421 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 367-367 (2011). XX DR GenBank; AAWU01004852; Positions 83162 87582. XX CC Positions [1640-2143] - Integrase core CC 'TTTTG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 170..3253 FT /product="Copia-29_CQ-I_2p" FT /translation="MEYEQNPRVYLYNGKNWPNWSLRMEAYLRELKIFHCV FT QRTLEQEPFFPAVEADDAAEQARKEALRNVRREEDAKCVSVLLHKIADSQL FT ERIRGKRSPKAIWDTLQKAFDKKGVSGVFLLLNQFTTLRYDEEQDMEEHIL FT EFERVVRELEAANIKFVEPVQVFFLLQSMPKSYAQLITVLKTLPVEQCSME FT FIKSRLLNEDVERHHLAVKVEPKAEPSSAFAGKGGKFTFKCHACGKPGHKK FT ADCPEKEKAEQAAGSSHRRKKGKAHAAETKDGANDDVSFVSVAQEANSVET FT HEKFRWVLDSGASEHMVGEKGLLVNVRRMESPTVINVAKAGVTLEGRYVGD FT VKMKADIGTRTLNCTVRDVLYVPGLLTNLFSVKKVAERGMEVIFGQKGARI FT TKDGEVKCTAKQNGRLYELDVKVVRGSAMVGESESKLSLWHRRYGHIGNTG FT LMKLIRNEMVEGLDVGKNAKPEKEICECCMKGKQARLPFEEATKPRSSRPL FT ELIHSDVCGPFNPASWNGKKMFVSFIDDYTHFAAVYILEAKSEVLDAFRKY FT AAMATAHFGTKIARLRSDNGREYINTEFKKYCNDAGIVMEPTVPYTPEQNG FT KSERFNRTVMERARAMMEDAEIDWNMWSEAVLAAVHLINRSPTAALRDAKT FT PYEMWFGRKPNVSKLRVFGSKAYTHIPKEKRSKLSAKSHVCVLVGYGMNGY FT RLWDPVKKQVIVARDVKVEEKVMHKQVYTDDHMVSDSIVRQQEHEELDSDD FT DPDPEQEEEEVVEVLVDSEAESESDDEVFDQEVPEEEDQVPEENAVPEQEE FT AAAERRSGRARKRPARFSDYELEQKRPARHDHDVCVAFALNAEAYVDEIPD FT TIEALQKRDDWPLWKRAIDDELRSLEKNRTWDLVEAPAGRRVVSCKWVFRI FT KKKEDGTAARYKARLVAKGCSQRAGYDYQETYAPVVRMSTVRTLLAVAVQQ FT NLHLHQMDVRTAFLNGHLNETVYMRQPPGFERGSKVCKLNKSLYGLKQAPR FT SWNERFNQFILKLGFKRSPYDC" FT CDS 3142..4422 FT /product="Copia-29_CQ-I_1p" FT /translation="MQAKQIPVRLEAGAKELEREVQPVHPQAGVQAVAVRL FT LNETVYMRQPPGFERGSKVCKLNKSLYGLKQAPRSWNERFNQFILKLGFKR FT SPYDCCLYTKVTKDVKLYLVLYVDDIVLASNSLEELKNVKVQLAKEFEMED FT LEELRNFLGLKIERDVEKGTLTINQSQYVSGLLKRFGMEQCKPLGTPLEAN FT VKLARKKDEEAVTEHPYRELIGCLTYLMLSSRPDISIAVNFLSRFQSAATD FT THWIHLKRVLRYLQGTKEFSLEYKRSETADPLVGFADADWGSDIDDRRSTS FT GFVFKVFGNTVSWTSRKQATVSLSSTEAEYVSLSQAACESIWLMNLLSDFG FT IVLDAPAVIHEDNQSCIHIAEEPRDQKRMKHLDIRYNYIRECIQDELIKVQ FT YIPTKDQLADVFTKGLPAVSYKKHRSTLGLRGG" XX SQ Sequence 4421 BP; 1109 A; 1075 C; 1405 G; 832 T; 0 other; ggttgtgtgg cccagcacat cgcgtggcgc gttgccatag taaccgcgac cggaaggaat 60 tttttttttt tccacgtgcg cagagttcgg gaaggtgacg ccattttttg cgattcgttt 120 tgagttaagt tttttcggga agtaagacgc ggaaagaaac gtgctcgcga tggaatacga 180 acaaaatccg cgagtgtacc tgtacaacgg gaagaattgg ccgaattgga gtcttcggat 240 ggaggcctac ctccgggaac tgaagatttt tcactgtgtt cagcggacgc tggaacagga 300 accgtttttc ccggcggtgg aggcggatga tgctgcggaa caggcgcgga aggaagcgct 360 tcggaacgtc cgaagagaag aggacgcgaa gtgtgtgtcg gtcctgctcc acaagattgc 420 cgactcgcag ctggaaagaa tccgtggaaa gcggtctccg aaggccattt gggacaccct 480 ccagaaggcg ttcgacaaga agggagtttc cggcgtgttt ctgctgttga accagtttac 540 caccttgcgg tacgacgaag agcaggacat ggaggagcac attctggagt tcgagagggt 600 ggtccgggag ctggaagcgg ccaacatcaa gttcgtcgag ccggtgcaag tgtttttcct 660 gctgcaatcg atgccgaagt cctacgccca gctgattacc gtgctgaaga cgttaccggt 720 cgagcaatgc tcgatggaat tcatcaagtc acggcttttg aacgaggacg tggaacggca 780 ccatcttgca gtgaaagtcg agcccaaggc ggagccgagt tcagcgtttg cgggaaaagg 840 cggcaaattt acgttcaagt gccacgcctg cggcaaacct ggacacaaga aggcggactg 900 cccggagaag gagaaagcgg aacaggccgc cggaagctcg catcgccgga agaaggggaa 960 ggcacacgcg gcggaaacga aggacggtgc caacgatgac gtgtcgtttg tgtcagtagc 1020 gcaagaagcc aacagcgtgg agacacacga aaagttccgg tgggtgctgg acagcggtgc 1080 gtccgagcac atggttggcg agaagggtct actggtcaac gtacgacgga tggaatcccc 1140 cacggtgatc aacgtggcca aggctggagt gacgctcgag ggtcgctacg ttggtgacgt 1200 caaaatgaag gccgacatcg gaacacggac gttgaactgc acggttcgcg acgtcctgta 1260 cgtgcctgga ttactgacga acctgttctc cgtgaagaag gttgccgagc gcggcatgga 1320 ggtgattttt ggccagaaag gggcgcgaat cacgaaggac ggagaggtca agtgtacggc 1380 gaaacagaac gggcgtttgt acgagctgga tgtgaaagtc gtgcgtggat cggccatggt 1440 tggtgagtcg gagagcaagt tgtcgctgtg gcatcgccgg tatggacaca tcgggaatac 1500 tggtctgatg aagctgatcc ggaacgagat ggtcgaagga ctggatgttg gaaagaacgc 1560 gaagccagag aaggagattt gcgagtgctg catgaagggg aagcaagcga ggttgccgtt 1620 cgaagaagcg accaagcctc gttcctcgcg tcccttggag ctcatccact ccgacgtgtg 1680 tggcccgttc aacccggcat cctggaacgg caagaagatg ttcgtgagct tcatcgacga 1740 ctacactcac ttcgctgctg tttacatcct tgaggcgaag agtgaggtgt tggatgcgtt 1800 ccggaagtac gcggcgatgg ccactgctca cttcggaacc aaaattgccc gactgagaag 1860 tgacaacggc cgtgagtaca tcaacaccga gttcaagaag tattgcaacg acgctggtat 1920 cgtcatggaa ccgaccgtac cgtacactcc ggaacagaac gggaaatcgg agcggttcaa 1980 ccggacggtg atggagcgtg cacgtgctat gatggaggac gcggaaatcg actggaacat 2040 gtggagtgaa gcggtgctgg ctgcggtgca cttgatcaac aggagtccca cggctgcgct 2100 tcgggacgcg aaaacgccgt acgagatgtg gttcggacgc aagccaaacg tgtcgaaact 2160 gcgtgtcttc ggcagcaagg cgtacaccca cattccgaag gagaagcggt cgaagctgag 2220 cgcgaagagc catgtgtgtg tcctggtcgg ctacgggatg aatggctatc gtctgtggga 2280 tcctgtgaag aagcaagtga tcgtcgccag agacgtaaaa gtggaagaaa aagtgatgca 2340 caagcaagtt tacacggacg atcatatggt ctcagattct atcgtgcgcc aacaagaaca 2400 cgaagaactg gacagtgacg atgatcctga tccggagcaa gaagaagaag aagttgtgga 2460 agtgctagtg gactcggagg ccgaatccga gtcagatgac gaagtctttg accaagaagt 2520 gcccgaagaa gaagaccaag tgccagaaga gaacgcagta ccggagcagg aggaagctgc 2580 ggctgaaaga agaagcggac gtgccaggaa gcgtccggcc cgattctccg actacgagtt 2640 ggagcaaaag cggcctgcga gacacgacca cgacgtctgc gtcgcctttg cgttgaacgc 2700 cgaggcctac gtggacgaga ttcccgacac cattgaagcg ctgcagaaac gcgacgactg 2760 gccgctgtgg aagcgtgcta tcgacgacga gttgcggtcg ctagagaaga accgcacgtg 2820 ggacctcgtc gaggctcctg ctggtcgccg agttgtgtcc tgtaagtggg tgttcagaat 2880 caaaaagaag gaagacggta ccgctgccag gtacaaagca cggctcgtcg cgaaggggtg 2940 ctcgcaacgt gcggggtacg actaccagga aacgtacgcc cctgtcgtcc ggatgagcac 3000 cgttcgcacg ctgctggctg tagccgttca gcaaaatctt cacctgcacc agatggatgt 3060 ccgtacagca tttctgaacg ggcacctgaa cgaaaccgtg tacatgcgac agcctccagg 3120 atttgagagg gggagtaagg tatgcaagct aaacaaatcc ctgtacggct tgaagcaggc 3180 gccaaggagc tggaacgaga ggttcaacca gttcatcctc aagctggggt tcaagcggtc 3240 gccgtacgat tgctgaacga aaccgtgtac atgcgacagc ctccaggatt tgagaggggg 3300 agtaaggtat gcaagctaaa caaatccctg tacggcttga agcaggcgcc aaggagctgg 3360 aacgagaggt tcaaccagtt catcctcaag ctggggttca agcggtcgcc gtacgattgc 3420 tgtctgtaca cgaaggtcac gaaggacgtt aaactgtacc tggtgctcta cgtggacgac 3480 attgtgctgg cgtccaactc tctggaggag ctgaagaacg tgaaggtgca gctggccaaa 3540 gagttcgaga tggaagactt ggaggagctt cggaacttcc tagggctgaa gatcgagcgc 3600 gacgtcgaga aaggtacgtt gaccatcaac cagtctcaat acgtttccgg gttgctaaag 3660 cgatttggga tggagcaatg caagcccctc ggaacgccgc tggaagcgaa cgtgaagctg 3720 gctcgcaaga aggacgagga agcagtcacg gaacatcctt accgtgagct aatcggatgc 3780 ctcacgtatc tgatgctgtc gtcgcggccc gacatcagca tcgcggtgaa cttcctgagc 3840 cgtttccaga gcgccgccac tgacacgcat tggatccact tgaagcgtgt cctgcggtac 3900 ctgcaaggaa cgaaggagtt cagcctggag tacaaacggt ccgagaccgc cgacccgctg 3960 gtgggatttg cggacgcgga ctggggaagc gacatcgacg atcgccgctc aacatctggg 4020 ttcgtgttca aggtgttcgg caacacagtc tcgtggacaa gcaggaagca agccactgtg 4080 tcactctcgt cgaccgaagc cgaatacgtg tccctcagcc aagctgcctg cgaatcgatt 4140 tggctgatga acctgctgag cgatttcggg atcgtgctgg acgcgccggc tgtaattcac 4200 gaggacaacc agtcgtgcat tcacatcgcg gaggagccac gagaccagaa gcggatgaaa 4260 cacctggata tacgctacaa ctacatccgt gagtgtatcc aggacgagct catcaaggtg 4320 cagtacatcc cgacgaagga ccaactcgca gatgtcttca ccaagggcct cccagctgtt 4380 tcttacaaga agcatcgaag cacgttagga ttgagagggg g 4421 // ID Copia-47_AA-LTR repbase; DNA; INV; 268 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-47_AA_; KW Copia-47_AA-I; Copia-47_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-268 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 968-968 (2011). XX DR [2] (Consensus) XX SQ Sequence 268 BP; 55 A; 81 C; 49 G; 83 T; 0 other; tgctacacat ccgcacgccc ctcggtgtgc ccaactacag agtgcaactc catgcgactg 60 cagaatccat ctctgctaca ccggcaacag cagcaaatgt tggcggttat tttccactac 120 tgtttttctc atcgacgcgt ccaaacgtgt tttcatttcg ctattaaata aaagttgttc 180 aaaagtattt ctcgcgagtt tatcttttcg tgcttcctat cccgattccc gtacgctgtt 240 ctctgctgcc gtttgtccac ctgcgcca 268 // ID BEL-192_AA-LTR repbase; DNA; INV; 183 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-192_AA_; KW BEL-192_AA-I; BEL-192_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-183 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 876-876 (2011). XX DR [2] (Consensus) XX SQ Sequence 183 BP; 60 A; 26 C; 39 G; 58 T; 0 other; tgtagggaga ttgaaacgtt cgaaggctga atttgtaatt cgaagactaa attcgtaagt 60 agtaaaacaa acttgtaacg ttgtgaatgc gaaatttaat gaatatattt ttagttttga 120 gtgtgtcgaa aagcagtact tcaaaaggta gttttatcga tatgctgccc cgaactcctc 180 cca 183 // ID Gypsy-32_AA-LTR repbase; DNA; INV; 781 BP. XX AC supercont1.22; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-32_AA_; KW Gypsy-32_AA-I; Gypsy-32_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-781 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.22; Positions 632456 633236. XX SQ Sequence 781 BP; 226 A; 177 C; 210 G; 168 T; 0 other; tgtaacatgt gcatgtccca tagagtaaat taaaaaaata aaaattatgt gcgttgcgtt 60 tgattaggag ttagaattag aattgttaaa attttccctt aaaaatgccc aatcgacaca 120 ccgtcactgc actaatcccc ataaaatgcg cgctccatcg gcaataatgc ggataatact 180 ggttgcgaat gggacgagag agaggagtcc cgaaaattat accagtggac gaggctcctt 240 ccggagtcaa acagaaaggc acgggctctt ttgccttgag ctgtcgtgga agaaacaagt 300 gctccagtga agatctccga tccagtgaga aattagtgga cgaaaattaa agtgcgggtg 360 ttagaccctc aacgcggtac cctattaagg gagtgaatag cccgtgtccg tcagtgctga 420 ggaagttaga acctcgaagg ggcgtaggtg gtgattagtg ccggtaaaag atattcgatt 480 ccacgcggga ggctcaacct agagggtcga acgtggagac ccaagccgcg tgtgataaaa 540 accgcgttcc gcggaagtga agggcgaaaa acctgtccga aactgcaggt gccccggata 600 gttgtgcgta gcgtgtgaac cgagtgcgaa gtgcctagaa gatcgaccca gcggccgacg 660 gtggtgtggg ccatcctaaa tcccccccag caaatactct catcaggcta ggcgcccctg 720 ggatgagacc aggtaagtcc atgctccctt cttcctatta gatgcaaaaa gcaaacttac 780 a 781 // ID Jockey-N7_CQ repbase; DNA; INV; 1814 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A non-autonomous Jockey-like non-LTR retrotransposon family from DE Culex quinquefasciatus - consensus. XX KW Jockey; Non-LTR Retrotransposon; Transposable Element; KW nonautonomous; Jockey-N7_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-1814 RA Kojima K.K. and Jurka J.; RT "Non-autonomous Jockey non-LTR retrotransposons from the southern RT house mosquito."; RL Repbase Reports 11(1), 592-592 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 10 sequences with >92% CC identity. This family encodes a protein similar to Jockey ORF1p CC but does not encode ORF2p. Thus it is a non-autonomous non-LTR CC retrotransposon derived from Jockey, like HeT-A. XX FH Key Location/Qualifiers FT CDS 113..1507 FT /product="Jockey-N7_CQ_1p" FT /translation="MGRNTRGRGSSSAPRGRGRASSCSSIGKTSSSGSKRS FT TNLLKLPSNYVPARSPIRTRGLAALANPDQPGTSGLSTNKPGTSSTAVPTS FT NGFAALSDDNKEDDDDDVSGGDGVTTTPSQAKGTSTAGEQEXKKSNKKMPP FT ITVPGRPAVEIEGALADADCQYLMRINKSSVNIITRDRPRFEKVLKTLKDA FT NIQYYTHDSPENLPVKVVVTGFSILVPPKEFVDVILAKKNIYPREAKVLSH FT KVTEVGDQILWLLYFERGSVKIQDLRKVKSLEGFMVSWRYFSKRPSDAAQC FT HRCQRFGHGSRNCTLAPKCVKCSAAHLTAACTLPKKASLGKDNQAEQNKRN FT VKCANCDGNHTANYRGCTARKTYLEALEKRKKPAPHSSRTSTGQPSTEPRQ FT TNPPGFGRTYASVAATANGPTPDSNSDGDLFTLTEFLSLARDMFTRLNTCR FT NKLEQFFALQELMGKYLCPA" XX SQ Sequence 1814 BP; 477 A; 478 C; 440 G; 418 T; 1 other; cattggggtc gttaccctgt tcgcagcaac atcgcctgtg ttgatgctcc gcgaattttt 60 ttcgatttac ggtgaaagtg cgttgtatta atcgcgagtt cttccccgca cgatgggcag 120 gaacacccgc ggccgtggca gctcgagtgc acctcgagga cgtggccggg ccagttcctg 180 cagcagcatc ggaaagacct cctcgtctgg ctccaaacgg tcaacaaacc tgctgaagct 240 gccatccaac tacgtacctg cgcgtagccc catacgtacg cgtggactag cggcacttgc 300 caacccggat caacccggaa ccagtggatt gtccaccaac aagccgggaa cttcctcgac 360 ggcggttcca acctcaaacg gtttcgctgc gttgtctgat gacaacaaag aagacgacga 420 cgacgacgtc agtggtggcg atggtgttac tactacaccg tcgcaggcta agggcacctc 480 caccgcgggg gagcaagagt saaaaaaatc gaacaaaaaa atgccaccga ttacagtgcc 540 tggtcgccca gctgttgaaa tcgagggagc attggcggat gccgactgtc aatacctgat 600 gcgtattaac aagtcgtccg taaacatcat cacacgagat cgcccgcgtt tcgagaaagt 660 gctgaaaacg ctgaaagatg caaacattca gtactacacg catgattcgc cggaaaacct 720 tccggtaaag gtagtggtga ctgggttctc catcctagtg ccacccaagg aattcgtgga 780 tgtaattctc gcgaaaaaaa atatttatcc gcgagaagct aaagtgctct cgcataaggt 840 aaccgaggtc ggagaccaga ttctctggct gctgtacttc gaacgcggtt ccgttaagat 900 ccaggacctg cgcaaagtta aatcgctgga aggattcatg gtgagctgga gatacttcag 960 taagcggcct tccgatgctg cccaatgtca ccgatgccaa cgttttggac acggttcccg 1020 gaactgcact ttggcgccga agtgtgtcaa gtgcagcgca gcccacctca cggctgcatg 1080 cacgctgccc aagaaagcat ccctgggaaa ggacaaccaa gccgagcaga acaagcggaa 1140 cgtgaagtgt gctaactgtg acggtaacca cactgccaat taccgcgggt gcaccgctag 1200 gaagacctac ctcgaggcgc tggagaagag gaaaaagcca gcaccacatt cctcaaggac 1260 ttcgacaggt caaccctcga ccgaaccccg tcaaacgaat cctcctggat tcgggcgtac 1320 ttacgccagc gtggcggcta ccgcaaacgg tccaaccccg gacagcaaca gtgatggtga 1380 tttattcacc ctcaccgaat tcctctccct tgcgagggac atgtttacgc gactcaacac 1440 ttgccgaaac aagctggagc agttcttcgc gctgcaagag ctgatgggga agtacctatg 1500 ccctgcatag gtgcaagctc acacactgtg gaacagtctt cgccttcagt aacgaactga 1560 ttttttttct gtgatatata tttaagcttt cctatcctcc ttctctttcc atagtaatag 1620 tagcagttat tttagaaagt tttttctgct ctcgcttaac cgactgaaac tgaatttatt 1680 tcatccaatt gattatattt gaatttattt gtagttgtaa ggatctccaa aactctgcat 1740 tctgttatta gttaagcaaa ttgtattctg attagtacaa caaataaaca tgaattgaat 1800 tgaattgaat tgaa 1814 // ID BEL-606_AA-I repbase; DNA; INV; 6154 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-606_AA_; KW BEL-606_AA-LTR; Pao_Bel_Ele199; BEL-606_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-6154 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [5181-5738] - Integrase core CC LTRs are 96% similar to each other. XX FH Key Location/Qualifiers FT CDS 48..6152 FT /product="BEL-606_AA-I_1p" FT /translation="MPETDLFDCQMCELANNIDDMVQCEGCTKWSHYGCVG FT FDDGKKEENWRCAGCIAKSSSNSTGGDSNVQATDGQQKTRGSTGGAESISD FT LAQLNLKLLEERKAVLLREIELQHSTQLEQRKLQLEKEAWQAKYDILNAKC FT ESTSSTVGSGGLGNWISRMNQVAVSQYQQTSVSASTVTTSLNPRTRQQHTA FT AGGGNATSVVTTSSRPGCSLYGSESTSFLPQITSTIALSANQPSSTYAVGQ FT ATSYMGDFQPGVGANRPGSSTPVTSVNWVNPGANPTSSAQCLPPYVSSMEQ FT LGGHPMPSGYVSHPYQVTSSLNNFAPVGQVAFSQYAHSSVGSGNPYLDSIN FT TRYSYAPPPLPTSLPQTNPHVSQAPFSAQTIPDAQIDNSSPTPRQLAARHV FT MPKELPSFSGNSEEWPMFFSAFNTSTEACGYSNVENLGRLQRCLKGGALEA FT VRSRLLLPSSVPQVMQTLQLLYGRPEQIIYALLQKVRDVPAPKADNLCTVV FT TFGMAVQNFCEHLEAAGQVLHLSNPVLLQELVDKLPANLKLDWVTFKRQFL FT VTDLRVFSRFMTNLVSAAAEVTLTLDQKGPKPKKEEKQKGFVNAHATPSSI FT VVTEKPKTEFVKAASPITCLVCGNPDHRVKECPIFKKMDSDDRWKVVQTHY FT LCRVCLGKHGRKPCRSSTRCDVQGCQSRHHRLLHNETASSSPATKATTQGG FT EMTPCKNPTEGVNAHHTQSSTLFRMLPVRLFSKGRSVETLAFIDEGSSVTL FT LERSIADALQAKQTEMRLCLTWTSNISREEEGSCQVEVEISSVGGGKRYPL FT KDVRTVESLALPAQTLHYKELADRFEHLRKLPVSDFESKAPGILIGAKNTH FT LTATQQLREGRVGEPIAAKTRLGWAIYGSMPNGSNIASCNLHICGCDSDNT FT LHELVKQYFTVENVGVSVDRSPDSDEDKRARALLEKTTRRIEGGFETGLLW FT RHDIVEFPDNYAMAVRRLRCLERRFNTDPTLFESVQRQIMQYQQKGYIHEA FT TNEELDAVDPRRLWFLPVGIVRNPKKPNKLRIVWDAAATVNGVSLNSMLLK FT GPDLVQPLPNVLCGFRERKIAVVGDIMEMFHQLKIRKEDRYSQLFLWPNET FT GPDPKIFVIDVATFGSTSSPCSAQFVKNLNATEHAEAYPRAAEAIVRRHYV FT DDYLESFDSEEEACQVIEEVKLVHRLGGFTIRNWLSNSTAVLRRVGEAETK FT TIKIVDDSGSQVERVLGMQWRSTEDVFVFSSEVDVESVIPTKRGILRCVMS FT QFDPLGLLSHFLIHGRVIIQDIWRSKAGWDDTINEEILERWRVWTTKFKEL FT EQIQISRAYFPGVTSSDIEDLQLHVFVDGSETAYACVAYFRATVNGKFECA FT LVGGKAKVAPLKALSIPRLELQAAVIGCRLMKTLCSSHSLPISKRMLWTDS FT KTVLSWIHSDHRRYRQFVACRIGEILSKSNPEEWRYVPSKANVADDGTKWA FT RGPNLQADSRWFKGPEFLWKPEESWPRQTEPEGTNVEIRSCNVHQIMESSE FT IVNWDRFSKFERLRRTVAYLYRFANNCRRKAKQQPLQYSHITREEVQEAEN FT AIWRFVQQAEYQEEIIQLQKMKEGHKCRLKRTSPIYKLSPFIDEYGVIRMD FT SRISGAIFIPYDAKFPIILPKGHRVTKLVVDWYHRSYLHANNETVVNEIRQ FT RFHVSCLRVLVRNLSRNCMQCKVLKAKPLSPREAPLPEARLSPYTRPFSFV FT GLDYFGPIQVKVGRSLVKRWVALFTCLTIRAVHLEVVHSLSTEACKMAIRR FT FVVRRGAPLEIRSDNGTNFLGASHELIQQIQEINQHLSAVFTNATTKWVFN FT PPSAPHMGGSWERLVRSIKIAFSALTTTKNPDDETLLTLMIEAEGIVNSRP FT LTFVPLETESQAALTPNHFLLLSSRGVAQPPMTIPERPESLRTNWRMTTNL FT VNQFWHRWVREYLPTIAVRTKWYEDTKDPKVGDLAIIVDPSVRNGWLRGRI FT LSVIIGRDGRCRQVLVKTSSGVLRRPVTKVAILDIKPAERGDVEGNAVPPE FT VLDLHYGSG" XX SQ Sequence 6154 BP; 1694 A; 1507 C; 1604 G; 1346 T; 3 other; caaatctcca aaattgtggt gaamcacgca ggaacgcttg caccaggatg ccggagaccg 60 atctatttga ttgccagatg tgcgagctcg caaataacat cgatgatatg gtgcaatgtg 120 agggatgtac taagtggtcg cattatggat gcgtcggctt tgacgatggg aaaaaggaag 180 aaaactggag gtgtgccggw tgtattgcga aatcatcttc caatagcacc ggcggagatt 240 csaatgttca agctaccgac ggacagcaga agacaagagg ctctacagga ggggcagaat 300 ccattagtga tctcgcacag cttaacctca aactgttaga ggagcgcaag gccgtcctgt 360 tgagggagat cgaactgcag cactcaactc agttagaaca acgtaaactt cagctggaga 420 aggaggcatg gcaagcaaag tacgatattc tcaacgcgaa atgtgaatca accagtagca 480 cggtaggaag cggcggtctc ggaaactgga taagccgtat gaatcaggtc gctgtcagcc 540 agtaccaaca aacatccgtg tcggcgagta cggtgacgac ttcattgaat cccaggacga 600 gacagcagca caccgctgct ggaggaggaa atgcgacatc agtagtaacg acgtcatcgc 660 gaccaggctg ttcgctgtat ggaagcgaat cgacatcctt tcttccgcaa attacttcta 720 cgatcgcctt gtctgcaaat caaccatcaa gtacgtatgc ggtgggacaa gcaaccagtt 780 atatgggaga cttccaacct ggagtaggag cgaatcgtcc gggatcatcg acgccagtga 840 ccagtgtaaa ttgggtcaac ccaggggcga atccaaccag ctccgcccag tgtttgccac 900 catacgtcag ctcgatggaa cagctaggag gtcatccgat gccatcggga tacgtaagcc 960 acccgtacca ggtaaccagt agcttaaata atttcgcacc agtaggccag gtagcttttt 1020 cacaatatgc tcacagtagc gtagggtcag gtaatcctta tttggattca ataaatacac 1080 gctattccta cgctcctcct ccccttccta caagccttcc tcaaaccaac ccacacgtct 1140 cacaagcgcc attcagcgcg caaacgattc ccgatgcaca aatcgataat tcgtcaccaa 1200 cgcctcgcca actcgcggcg cgacacgtta tgccgaagga actgccatcg ttcagtggta 1260 attcagaaga gtggcccatg ttctttagtg ctttcaacac ctccacagaa gcatgcggct 1320 acagcaacgt ggagaacctt gggaggcttc aacgttgttt gaaaggtgga gccttggaag 1380 cagtccgtag ccgcttactg ctaccctcat ctgtgccaca agtgatgcaa acgctgcagt 1440 tattgtacgg tcgcccggag caaataatat atgctctact acaaaaggta cgagatgttc 1500 cggcaccgaa agcggataac ctgtgtacag tggtgacctt cggaatggca gtgcaaaact 1560 tctgcgagca tctggaggcg gctggccaag tgcttcatct atccaaccct gtcttgctac 1620 aggagcttgt ggataagctt ccagcgaacc tgaaactcga ttgggtaaca ttcaagcgac 1680 agtttctggt gacggacctc cgcgtattca gtcgtttcat gacgaacctg gtatctgcag 1740 cggctgaagt cactctcacg ttggatcaga aagggccaaa accgaagaag gaggagaagc 1800 agaaaggatt cgtgaatgct cacgctacgc cgtccagtat agtggtgacg gaaaagccga 1860 aaacggagtt cgtcaaggca gcatcgccaa ttacctgttt ggtctgtgga aatccggatc 1920 atcgggtgaa ggaatgtcca atcttcaaaa agatggattc tgacgatcgg tggaaggttg 1980 ttcagactca ttacctttgt cgggtatgtc tcgggaaaca cggaagaaaa ccatgtcggt 2040 cctccacgcg ttgtgacgta caagggtgcc aatcacgaca ccatcgactt ctgcataacg 2100 aaacagcaag ttcatcgcct gcaaccaagg cgacaacaca aggaggtgaa atgacgccat 2160 gcaaaaatcc taccgaaggt gtgaatgctc atcacactca gagctccacg ctgttccgta 2220 tgttgccggt gcgcctgttt agtaaaggaa ggagcgtcga aacactggcc tttatcgatg 2280 aaggttcttc ggtcacgctg ttggaaagga gcatagcaga tgctctgcaa gccaaacaaa 2340 ccgaaatgcg tctctgccta acctggacca gcaacatcag ccgtgaagag gaaggttcat 2400 gccaggtgga ggtcgagatt tcgagcgtcg gaggaggtaa acgatatcct ttgaaagacg 2460 ttagaaccgt ggagtcctta gcgctaccgg ctcaaacgct gcactacaaa gagctcgctg 2520 atcgtttcga gcatctccga aagctgccag tctcggattt tgagtccaaa gctcctggaa 2580 tactcattgg agccaagaat acgcatctca ccgccactca acaattacgt gaaggtcgag 2640 taggagaacc gatagcggcg aaaactcgtc tcggatgggc aatctatggg tcaatgccga 2700 acgggtcgaa catcgcgagt tgtaatttgc acatctgtgg ttgcgattcc gacaatactc 2760 tgcacgagct agtgaagcag tacttcacgg tggagaatgt aggagtttcg gtggatcgga 2820 gccctgattc ggacgaagac aagcgggcca gagcattact cgagaaaacc actagacgga 2880 tcgaaggcgg ctttgagacc gggttgctgt ggcgccacga catcgtggag tttcccgata 2940 actacgccat ggcagtgaga cggctacggt gtctggaaag gcgctttaac acggatccaa 3000 ctttgttcga aagcgttcag cgtcagataa tgcagtatca acagaagggg tacatccatg 3060 aggcaacaaa cgaggagctg gatgctgtcg atccgagaag gttgtggttt cttccggtcg 3120 gaatagtgcg aaatccgaaa aagccgaaca agctacgtat tgtgtgggat gcggcagcaa 3180 cagtgaatgg agtatccttg aacagcatgc tattgaaggg cccggatcta gtgcagcctc 3240 ttccaaatgt tctctgtgga ttccgagaac gcaagatcgc ggtggtggga gacattatgg 3300 aaatgttcca tcaactgaaa atccgaaaag aagaccgata cagccaactg tttctttggc 3360 ccaatgaaac cggtccagat ccgaagatct tcgtaatcga cgttgcgaca ttcggctcca 3420 ccagctcccc atgctcggcg caattcgtta aaaacttaaa cgccactgag catgcggaag 3480 cttatccaag agcagcggag gcgattgtac gtcgacatta tgtcgacgat tacctggaga 3540 gcttcgacag cgaggaagaa gcgtgccaag tcatagaaga agttaagtta gtgcatcggc 3600 tgggcggatt cactatccgc aactggctct caaactcgac agcggttctc cgacgggtcg 3660 gggaagccga gacgaaaaca ataaagatag ttgatgatag cggatcccaa gttgagcgag 3720 tactgggtat gcagtggagg tcgaccgaag acgtgttcgt attctccagc gaagttgacg 3780 tagaatcagt tatccctacc aagcgaggca tcttacgatg cgtaatgagc cagtttgatc 3840 cgctggggct cctgtcacat ttcctcatcc acggacgtgt gataatccag gatatctggc 3900 gaagtaaagc tggttgggac gacacgataa acgaagaaat cctagagaga tggcgtgtat 3960 ggacgaccaa attcaaggaa ctcgaacaga tccaaatatc acgtgcctat tttccgggcg 4020 ttacgtcatc agacatcgaa gatttacagt tgcatgtctt cgttgacggc agtgagactg 4080 cttatgcctg cgtcgcctat ttcagagcca ccgtcaacgg aaagttcgaa tgcgcacttg 4140 tgggaggaaa agcgaaagta gccccgttaa aagcgctctc aataccaagg ctagagctcc 4200 aggcagcggt cataggctgt aggctaatga aaacgctctg tagcagtcac agtctaccaa 4260 tatcgaagcg aatgctttgg accgactcga aaaccgtgct ctcctggatt cattccgatc 4320 accggcgcta ccgacaattc gtcgcttgca ggataggaga aattctttcc aaatccaatc 4380 ctgaagagtg gagatacgtt ccatcaaaag caaatgtcgc agacgatggc acaaagtggg 4440 ccagagggcc gaatctgcaa gccgatagcc gttggtttaa gggtccagag ttcttatgga 4500 agcctgagga gtcttggcca cgtcaaactg aacctgaggg cacaaatgtc gaaattagat 4560 catgcaacgt ccaccagata atggagagca gcgaaatcgt caattgggac cgcttttcca 4620 aattcgaaag gctcaggcga acagttgcct atctttatcg ttttgccaac aactgcagac 4680 gaaaagcgaa gcagcagccg ctacaatatt cgcacattac ccgtgaagaa gtacaagaag 4740 ccgagaacgc catatggcga ttcgtgcaac aagccgagta tcaagaagag attatccaac 4800 tccagaaaat gaaagaaggc cataagtgtc ggttgaaacg aaccagccct atctacaagc 4860 tgtcgccctt catagacgaa tatggagtga tccgaatgga ctcgcgaatt tctggagcaa 4920 tcttcatacc ttacgatgct aagttcccca taattctccc aaaaggacat cgcgtgacaa 4980 aattggtcgt tgactggtac catcgctcct acctgcacgc aaataatgag acagtcgtca 5040 atgaaattcg acaacgcttc cacgtttcat gtcttcgggt gctggtgcgg aatttatcaa 5100 gaaattgtat gcagtgtaag gttctcaagg ccaaaccatt gagcccacgg gaggcacccc 5160 ttcctgaagc gcgcctcagt ccatacactc gacctttcag ctttgttgga ctggactact 5220 tcggtccaat ccaggtcaag gtcgggcgct cattggtcaa acgttgggtg gcactgttta 5280 catgccttac catccgtgct gtccatctag aggtggtaca ttcgctatcc actgaagcgt 5340 gcaagatggc aatacgacga ttcgtcgtgc gacgaggagc tccattggag atcaggagtg 5400 acaacggtac caacttctta ggcgccagtc acgaattgat tcagcagatc caagagatta 5460 atcagcatct gtccgccgta ttcaccaacg ctacaaccaa gtgggtattt aacccaccgt 5520 cagcccctca catgggtgga tcgtgggaga gactggtcag atctatcaag atcgcttttt 5580 cagccctaac gaccaccaaa aatccagatg atgagacact tctgacgttg atgatagagg 5640 cagaaggcat agttaactct aggccactaa cgtttgtgcc tctagaaaca gaatcccaag 5700 cggctctcac accgaatcat tttttgttgt tgagctcgcg tggagtcgct caaccaccaa 5760 tgaccatccc agaacgtcca gagtccctta ggaccaactg gcgaatgacg acgaacctgg 5820 tgaatcaatt ttggcaccga tgggtgcggg aatacctccc aactatcgct gtccgtacca 5880 agtggtacga agatactaag gacccgaagg ttggagacct agctatcatc gtagatccat 5940 cggttcgaaa cggctggctg cgtgggcgaa ttctttccgt tatcattggt cgggacggaa 6000 gatgcagaca ggttcttgtg aagacatcta gcggtgtact acggcgtcct gtgacaaagg 6060 ttgccatcct ggatattaag ccagcagagc gaggtgatgt tgaaggtaac gcagtaccgc 6120 cggaagtgct ggacctgcat tacgggtcgg ggga 6154 // ID hAT-74_HM repbase; DNA; INV; 5114 BP. XX AC . XX DT 14-JAN-2009 (Rel. 14.02, Created) DT 14-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE hAT-type DNA transposon family - consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-74_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-5114 RA Bao W. and Jurka J.; RT "hAT-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 9(2), 414-414 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 1079..3751 FT /product="hAT-74_HM_1p" FT /translation="MPSNNGKYYFKKKYKKIQSKTIVKLFEKHKETSVQNY FT SQQCTSIVSESPHQSTSIVSESPHPSISSKQSYHQSTSVVSESPHRSISAV FT NECTYQSTSVLRQKPPQSALAVSESAHHSTSVVYQSPHQPTSTVSENNENT FT ILLTSVLGKRKNKKVKPAQIPIKNVEDLRLKTTQKYEVLFPDFFYSSSKQG FT WFCKVCISFAPGKKSSRAFIENPGKFADHPSERANDHLNTHRHTIALQNKQ FT AFDEMSKQDSNIWKMLRENTLAKETLKTSTSRFVIKTFFRVTHLLIIKNWA FT HTHNFSDIIELIAACGCKEVKAHVLSSPANATYMSPEYIAKYINIIFDFIK FT NPLLQSLKNGGSFSFFSDETADITSIEQLTIYATFEHNSIVKEHFIGLVPL FT SKLTGSSLSAPNVFDVIQKFFRANNISFKNARFACMDTTSVNSGIKGGLKA FT YIEDAVPLCVWVGCTSHKLALCFKHILKDFPTIIEVDIFLLNLWKFFKYRT FT LAMHLLQECSEIYGELETIVPVCPSVTRWTAHDRACLVVFKGYKPILQALS FT TCYTERKEAEALGLFIQASSQSNIATILMLLDVFEAIRPLVLALQSSQVGI FT SLSDIPTYVNQANHSLVDLISGTRRKYFTQKNFIEMKSVTDELAFSLPATS FT RLKKNNNFDLSEFELNVFISFVNSFKSELKNTFSQMDFWQSLSIFDPRQLP FT KTLNQLSSYGKESLEQLITHYGRPKSSNKFGALHIQVPDISETETKNEFES FT FKIILFTKRREWENSFDIRISKEKDQEVVNNLIKSRNNFTPQELWKVINND FT PVLISVYANCMFLTKLLIIFPLSVACVERFFSKLKIVKNRLRNQLSHSTLE FT RLMMIATEGPKECFDDDLLEYFVDELKGKNQNMRIKL*" XX SQ Sequence 5114 BP; 1817 A; 825 C; 756 G; 1716 T; 0 other; cagggtgtta gccagcctga tggcaccgcg tataacgcgg aattcccaat ttttttaaat 60 ttcccaaatt gttattaata agcttagatt tagagagtta cgcaagactt gtagaaatac 120 tacgaaacga agaaaataag aaagtacacg aaacgaagta aatgattcag tccattgaga 180 acaattactt tctttataaa ataaaaaata gcgttgcaag tcattcattc aaaaataaag 240 ttaaaacttt attttttaac tgacgtcttg ttttcttttt gcttaaaacg tagctttttt 300 attcgctttt tttattcaaa ctttttgttt acaataaata attttaaaat aacacatgct 360 gtttatcttt agttggaaaa gtaacaaaga aattagcata gtgtcatttc gaaatttctt 420 ttaaaatttt acatttttaa aatttacatt tcgaaattac ttttaaaaaa agataaaaaa 480 agttatttat taagtaagct atatttgctt gtgacgcatt ttagtgtaag cattttaata 540 ttaagttaac aaatcatttg tgcttataaa tgttttcaag atttgcaatt tttaccaatt 600 tttttttaaa tttacaaatc aaaattttta ataaacaaat aaaagaagtt tagtttagta 660 aaacaaaatt tacaaataca aaaatcaatt tttattaatt tgcacatact aaagttacca 720 ctttagctaa atttatcgct aaatatttac ttgcgcgatc aacaaatctt ccccggcgtt 780 agttgtttat tgaatttaat tctaaaatac catgaagcag tgaaatttct tatcaataac 840 tttttattcc atagaaattc tattcgatta caaaaaattc actcaaaaga cattgacgtt 900 ctatttgtct aaagcaacaa aaactttttt aacaacgaaa actttttaat ttttaacatt 960 ctatttgaaa aaattctatt gaaaaacgtt tcgcttcaaa gaccttaata ttcttgaatg 1020 agtttgacgt tctattagta atttagattc aagtcaataa aataaaagcg gcaacaaaat 1080 gccttcaaat aatggaaaat actattttaa aaagaaatat aaaaaaattc aatcaaaaac 1140 tattgtaaag ttatttgaga aacataaaga aacaagtgtg caaaactata gtcaacaatg 1200 tacatcaata gtaagtgaaa gtcctcatca gtctacatca atagtaagtg aaagtcctca 1260 tccgtctatc agtagtaagc aaagttatca tcagtctaca tcagtagtaa gcgaaagtcc 1320 tcatcggtct atatcagcag taaatgaatg tacttatcag tctacatcag tattgaggca 1380 aaaacctcct cagtctgcat tagcagtaag tgaaagtgct catcattcta catcagtagt 1440 ataccaaagt cctcatcagc ctacatcaac agtaagcgaa aacaatgaaa acacaattct 1500 tttaacttca gttcttggaa aaagaaaaaa taagaaagtc aaaccagcac aaataccaat 1560 caaaaatgta gaagacctac gactcaaaac aacacaaaag tatgaggtgt tatttcccga 1620 ttttttttat tcgtcatcaa aacaaggttg gttttgtaaa gtttgcatat cgtttgcccc 1680 tggaaaaaaa agtagtcgtg catttattga aaaccctggc aaatttgcag atcacccctc 1740 ggagcgtgct aacgatcact taaatacgca taggcatact atagctttac aaaataaaca 1800 agcttttgat gaaatgtcta agcaagactc gaatatctgg aaaatgctgc gtgaaaatac 1860 ccttgcaaaa gaaacattaa aaacaagcac ttctcgcttc gttattaaaa ccttctttcg 1920 agtcacgcat ttgttaatca ttaaaaactg ggcacatact cataattttt ccgacattat 1980 tgaacttatt gctgcgtgtg gttgtaagga agttaaggcg cacgttttgt ctagcccagc 2040 aaatgcaaca tacatgtctc cggaatatat tgctaagtat ataaatatta tctttgattt 2100 catcaaaaat cctctactcc agtcacttaa aaatggtggt agtttttctt ttttttcgga 2160 tgaaactgct gatattacgt cgattgaaca gttaactata tatgccactt ttgaacacaa 2220 cagcatcgta aaagaacatt ttataggatt agttccttta agtaagctca ccggctctag 2280 tctatcagct ccaaatgttt ttgatgttat tcaaaaattt tttagagcta ataatatctc 2340 atttaaaaat gctcgctttg cttgtatgga cacaacaagt gttaattctg gtataaaagg 2400 aggtttaaaa gcctacatcg aggatgctgt gccactgtgt gtttgggttg ggtgtacaag 2460 tcataaactt gcgttatgtt ttaaacatat tttaaaggac tttcctacta ttatagaggt 2520 agatattttt ttacttaacc tatggaagtt ttttaagtac cgcacactag ccatgcattt 2580 gctgcaggaa tgctccgaaa tctacggaga actagaaacc atagttccag tgtgtccaag 2640 tgtaacacgt tggactgcac acgatcgagc ttgtttagtc gtgtttaaag gttataaacc 2700 gattttacaa gcattgtcaa catgctatac cgaacgaaaa gaagcagaag cgctcgggct 2760 atttatacaa gcgagttcac aatctaacat cgcaacaata ctaatgctct tagatgtctt 2820 tgaagcaatt cgcccgcttg tcctagcgtt acaaagttcg caagtgggaa ttagtctcag 2880 cgacattcca acttatgtga atcaagctaa tcatagttta gttgatttaa tctcgggtac 2940 tcgacgaaag tatttcacgc aaaaaaattt tatcgaaatg aaatcagtga cagatgaatt 3000 agctttttcg ttaccagcga cgagccgatt aaaaaaaaat aataactttg acttaagtga 3060 gtttgagtta aatgttttta tatcttttgt taacagtttt aaaagtgagc taaaaaacac 3120 tttctcgcaa atggattttt ggcaatcgct ttcgattttt gacccaagac aattgcctaa 3180 aacactcaac caactgtcta gttatggtaa agaaagctta gagcaactta taactcacta 3240 tggcagacca aaaagcagca acaaatttgg agcattacac attcaagttc cagatatctc 3300 agaaaccgag acaaaaaacg aatttgaaag tttcaaaatt attttattta ctaaacgccg 3360 agaatgggaa aatagttttg atatacgtat atcgaaagaa aaagatcaag aagtagtcaa 3420 taatttaatt aagtcaagaa ataattttac gcctcaagag ttatggaaag tgattaataa 3480 tgaccctgtt ttgataagtg tttatgctaa ttgtatgttt ttaactaagt tactcattat 3540 atttcctttg agtgtagctt gcgttgaaag atttttttca aaattaaaaa tagttaaaaa 3600 ccgtcttaga aatcagctaa gccatagcac attagaaagg ttgatgatga ttgcgacaga 3660 aggtccaaag gagtgttttg acgacgattt gttagagtac tttgtcgacg agttgaaagg 3720 caaaaatcaa aatatgagaa ttaagttgta gatttaaaat ttacaacgta aatctcattt 3780 ttattttttt atttgagctt acagacaatt agtgaatatt taatttttgt agttttttca 3840 cggaatcttc tcgttgattt cttgtattcc agccgtgttt ttacaatttt tgtttgtcgt 3900 agttaactga acttttattt gtatgtaaaa taaaagcttt ttgttcccac aggaagtcct 3960 aaactttaga gctaaatggt tttaataatc gttatcttgt gcattagata ttattagacg 4020 acttcttgca acatcagcta gatttcttaa acccaatcat tacttaattt tcttaaaccc 4080 aaccgttacc taaataaaca atcgtaaata aataaagatc gtacgtcaag aaacgctcgt 4140 ttaaaacgcg catagcggtc caagtttatc tgaaagctaa ctagtttttc ttatcgccga 4200 tgacgtttaa ggcaaagttt agaacaatga agacttaaaa tttacactag caaattattc 4260 atcgataaaa gttagattat aaaattagct tgcaaaagta tgaatatctg tttaaatata 4320 aaaatggtta aaaaaaaggt tttaagtatt ttaaacaaat taataaaact ttttttgaaa 4380 gaagataaac gttaaaacta gataaacatt aaactgttaa aaagttcagt ggaaaaattt 4440 aaattttaca tttactataa aaaaacgcta ccgtcgcttt aatttttttc ttaaagtcac 4500 tgtagtgatt tctaattggt tgattttttt ttttaattta agaaatgcat gcccaaacca 4560 aaacccttaa ttgatgggta ggctctcttg tcaaaccgtt attcaaaaac aatataatat 4620 atattgtttt tttgcatttt cggtccttaa ggtccgaaaa tgcaaaaaaa cgacttcatt 4680 taaaagagaa cccgcctacc ccaaaccctc agtcgttgtt tgcacattcc gctgcatgtt 4740 ctctctaaaa atcagtcggt gtatgtgcac accgcagcgc aaatccctat ataccctatt 4800 ttgcctattt tctagaaata caagctcgcg gacgcaaagg attaataatt acagatatac 4860 acgctctaaa tcatcgttta aacttttttt ataaaatata atatattttt ttatgatatt 4920 aaatattact ttaaataata ttccttttaa tattatttaa aagcgatatc aaagtatcac 4980 accgttaatt ttttatttga aaataaaaga tgcgagcttt tttttttttt ttttttcact 5040 caacagttga cggcaaccta aattcccaat actatgaaaa cgctgtatat attttattcc 5100 tggctaacac cctg 5114 // ID Gypsy-102_AA-LTR repbase; DNA; INV; 248 BP. XX AC AAGE02017444; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-102_AA_; KW Gypsy-102_AA-I; Gypsy-102_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-248 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02017444; Positions 51367 51614. XX SQ Sequence 248 BP; 70 A; 37 C; 54 G; 87 T; 0 other; tgtagtgtct tagttggaca cgctctgaat tatataaata tgattttgta ctgaattcga 60 tttagagtgt aactgatgtc attagatgtg ctatgaatgt agcgcgagtt ttgtatgtgt 120 gctggaaaga taaaatacag accagctcga agagcagggc tcattcgtgt tctggatagc 180 aaacttgaaa gttgtgtttt ttataatttc ctctaaagaa gttttcccca cagttcccag 240 atatttca 248 // ID REP6_TT repbase; DNA; INV; 6107 BP. XX AC AY371731; XX DT 17-NOV-2004 (Rel. 9.1, Created) DT 17-NOV-2004 (Rel. 9.1, Last updated, Version 1) XX DE Tetrahymena thermophila micronuclear non-LTR retrotransposon DE REP6, complete sequence. XX KW Non-LTR Retrotransposon; Transposable Element; REP6_TT; KW endonuclease domain; reverse transcriptase domain. XX OS Tetrahymena thermophila OC Eukaryota; Alveolata; Ciliophora; Intramacronucleata; OC Oligohymenophorea; Hymenostomatida; Tetrahymenina; Tetrahymenidae; OC Tetrahymena. XX RN [1] RA Fillingham S.J., Thing A.T., Vythilingum N., Keuroghlian A., RA Bruno D., Golding B.G. and Pearlman E.R.; RT "A non-long terminal repeat retrotransposon family is restricted RT to the germ line micronucleus of the ciliated protozoan RT Tetrahymena thermophila."; RL Eukaryotic Cell 3(1), 157-169 (2004). XX DR Genbank; AY371731; Positions 1 6107. XX SQ Sequence 6107 BP; 2393 A; 1306 C; 968 G; 1440 T; 0 other; aaaaactttc caaaaacttt aattaagatt cgaaaaacga agctaaaaag taaaacagag 60 aagtacaaat taattaaaaa tcagaggaaa aacaagaaaa agtcaaacaa actattaaaa 120 gatatccaaa taacgagcaa aacctactga aaatgtctca aagaagatca agcaataagg 180 aagcaagaga gcttagcaaa caagaaagaa caggttcaaa caaccaccgt acctcctagg 240 aacgcatcag cggacactta aaacatagaa ctcaacgaaa catcttagaa aaccactcaa 300 ccaagcacca atcaaaaacc taatccagaa atttctgaaa agaaggtaga tggcaggaag 360 agaggtttca gatcggccta caagaataag gtagaaaaac cataataaga agagcagcaa 420 ccggtgtcaa ctccactccc atctgaagag aattgtttaa aactgctgta agaaccccaa 480 atggtcataa ataagtagcc aagacccgaa gccaaggagg ttccaatcag cagcaataca 540 tcaatctcct aaagatcaaa tagcaatgat agtagcagtc taaaacagaa ccacaaccat 600 tagcatcgac actcaagcaa ctctagacaa gaaagtaaac cagccttgca agaaccgaaa 660 ccggaacttc caacctccat cgatatcggc aataagatct ggaaattgtg tagatttgac 720 gcaggtatga atggatgcag aaatagttcc taggattgca agtttatcca tttggacttg 780 atagaacagg tagctctaag cgacaaaaag aagctagcca atgtagcaat ctggaactgg 840 gactaagcta agctcgacag caagtatcaa ccttcggaca aggaccttct caactcctgg 900 gtaaactcaa attcaaacga gggaaatccc ctcttcacca aatacaaaac aatgattgag 960 gagaaggatc ctgaattcgt ctaactcact aaagatctca cagctgagta aatggaactt 1020 gaagaactct aaaacgcatt ccaagtcaag aagcagcgct acatgtcaga gctttcttca 1080 agactaagac agaaaattct atccaaaagc aggcaaaagg cctttgataa actgctttca 1140 gagtcagccc agaagagagc agcagcatct gaagcagcac tatctcaaga aagtagtagc 1200 atagaatatc tgccaccatc ccagtagaag tactcaaatc gatatccgat gataccacat 1260 cacaaggacc aatatctacc ctttgaaaat tcacagtatt cctcactcca gtagtagcca 1320 aaatctccaa tacgaacttc tcagtagcaa cattattact agcaaaacaa tactcaaagc 1380 taccagccac agcactacgc ccataactgg aatccaacct attagcacta gcaatctaac 1440 aattagagat ttgaacccag ctacaagccg tcctactact gaagaaatac ataaaaaact 1500 ccttgaaatg aaatattaac ttctcatgca tcaataagga agtgttttac ctctattaga 1560 gatgagtttt gagtaaactc ttattcaaag aatttggaaa gttttaaaaa gaacgtatca 1620 aacaaaaacc ctgtgtctcc ttcatattta catattgtag tacctcaaca acgcttaaat 1680 acagctcatt ctctatcccc aataattagc aacagcaaat tacaataggc tttgctctta 1740 ttataaaaat aatatataaa cagatataaa atttactatt ttttaggtat tattctaaaa 1800 ggaaattagg aaggccatca aataaagcaa tcaacaaaag aacatctata aaatgattag 1860 atcaaatcca atttatagat caagcaagaa gtcacaaaca caaacctcac agacaattaa 1920 gctagattca agactcatct cgtaaataaa ctcacacctc aacaaacatg attctaacct 1980 aaaaagtaac tcctgctcaa accaatcaaa atcttccaaa cgatgtatta acaaagactc 2040 gataaagaaa gtaaatcaat acgttggctc gaacgaaatt tctccctctt aaaaatcacc 2100 acaacaacat ttaaaaagca aaattattgg ctctcagctg acaattacag gcttcaatac 2160 tcaaggagta gcaaaagcta acaaaagaaa taagcaatac tctacacttt acctccaaaa 2220 gcttttcgct gaaagcgatg ctactatact tctggagaca aattgcaaag aaaatatgcg 2280 agtggatatc cataatccag attacattca aatgaacaat cactgcaaag tacgctaaat 2340 gggtgcagga acagcctcta ttcaccacaa ggaaatcaaa ctagcaccat acgtcgcaag 2400 cctcaataat gaaaaagccc ttgcccaagt attaattctc gagaatcaat agagagtttt 2460 aatcatgggg ctgcatctcc aactctaggc gaaccctatt gaagaatagg ctctactcga 2520 agaaataatc tcacaagtgc tgcaagacaa caggctcaac catatactaa tctatgggga 2580 tttcaacctc gatgtttcaa cgtccaactc ctctcactca gatgcaagga aagttggaca 2640 aatcagaagg ctcaaggact tcttgaaatc aaaaaactta ttcatacaca gcacaaatag 2700 gcacaccaga gaagcattta agcagaacaa aatcataaaa actcaagtgg acttcttcat 2760 aagctcgttt gctcctaata gcattctcaa aatcgaaaca attacacacg acgaaagcaa 2820 cagaggcagt gatcactatc ccatccgaat tcaaattgac ttgcaagttg caaaggcgag 2880 agaaatcaaa aaagtcatca acttaaagca gctagatctc ctatcctaga agcttcaaaa 2940 attcattcgc agcagtccaa agaatgatac agaaatcaga aataaaatta acagcattaa 3000 aggtaaactt gatctcaaat agcaaaactc ttaacaacta aggcggaaca taaacgaaag 3060 aatcaacgca gttatcaggg cgatccataa gcatgaaaag gtgctgatga atgattaaaa 3120 ctcagcaaaa actagagcag aacacatgta aacacgtaga aaaatcatac agagcaagga 3180 caccataatc cagcataagg tcggtgagat actcaaagaa tactatgcgg aatcaacgaa 3240 aaaagtccaa tcactcttta aaacaaacat taagcgatat tattagaaca tcaagagagt 3300 atgcaactta ggcagcaagt ccaatgttct aaacacttca agcggaccag ttgaaaactc 3360 gaacaaagaa ctcttatttg atagcagaga catctaaaac gaaataagca aatacttcaa 3420 agaacactac aaatgtgaag cacacgtcaa ttactatcac accattgggc aacttagcag 3480 cagcgaaata gacactctca ttcagtcttc agaaactctc tacagcagca ggaacaaggc 3540 cttttccacc gattgtatca aagatcagtg ctttttccca ccaaattata gcgaaatctc 3600 taacactttg caaagcattc catgctccaa aactaccaag aagcaaatga tttcggagaa 3660 aatgctctct caaaagtagc atagagacgc aaacctcaga accctcctca gagacatcct 3720 cactaaccca gactccttca ggcagacctt caatgcgagg taaatctacc tcctaaagag 3780 cgaaagtagg taaatcatca actgtagacc catcacgatt caatcctcag ctattaaaat 3840 cctggaaaat gcaatgctcc agcatgtcca aaagctcaaa gacgaaggga aggttaaaga 3900 tttccacatc agctagtgtg gcttccaaaa gaatcggagc acaatcatca acatttgcag 3960 aactattgct cttatccaaa gcacagtagc aaagaaagaa aacagagttg ccatcttcgt 4020 agatgtcaag agtgccttcg atagcgtcaa ccacgaacag ctgttctaag cattacgaaa 4080 tcaaggcttc gacgacattt tcatcaaatc tgtcgccttt ctctactagc actgccgcat 4140 caatgggtac caaatcggga gaggagtcat ccaaggaggg aaactatcgc ctatactgtt 4200 caattacctc tatgaagaag tcagagtgaa aatcctggaa atatggaaga gtaagaagct 4260 ggattgccaa gacctccact ttgagctctt tgccgatgac atgctaatca ttctcaaaaa 4320 gtacaagctc acagcaaccc tcctcgaagt tctcaagcaa gcatatcagg aaatcaatct 4380 ccaaatcaat gaatctaaga cgaaaatcat gctaatcggt aagcaggaaa catacatcaa 4440 ggactctatc agactaaggc aggcgaaggg catagatctc gttatggagt tcaaatatct 4500 tggcttagtc atcaacaacg tgggcaaaat tcacaaggat gtagctaaga aagtagataa 4560 ggcaaagaat ctcacgtaaa tgctcaagag gtggcgctac aacaagctag gcattattcc 4620 ttgtcttctt ctctggcagc ttttcattaa atcgcagcta taatatgcct ctgcaatcat 4680 gatcacatca gcctaaagcg aaaaatgtat gagaagcctt cgtaacatgt acaattcttc 4740 ccttagggac actctcggac taaacaggaa cacttcaatc tcaactatct gttacgttct 4800 tggtatagaa aatattgaat ctatgatact caacagctac gacaatctca tgtgccaaat 4860 taacgctgat aaacgaatcc aaatgcaatt cagagtcttc actaaaagca tcaaataaga 4920 cattgcaagt aggcatcagc agtttctaca agcatctact catagggaca acaaagttaa 4980 acaaagattc tggtggcagc tcaacaacaa gcatgatgaa ctcattaaac tcaatatata 5040 tctctagaac ccagctgggc aattctgctg gaaatgtcaa aggttctaca agggtgacag 5100 ctggctctac tgtcaacata aaaaagaagc ttggattgaa ctctaaaaga tactaaatac 5160 cacctccaag gcagaaatag tcagtgtgct caatcttagg aacatgagtt ggtaacagtt 5220 tccaaatgat gtacgaacaa gaatttaaga gtaggcaaga ctgctggtct attaagacat 5280 ttctaaggtt agatgaaata aaataagcct cgaaaattta attattctca ctaaaatatc 5340 cttttttgta cccccttaag tttctctctt tcgatgataa aaataaaaaa ataaaattaa 5400 aataaatatt aataataaaa ataataatca aataataatc aattattgaa gatattttca 5460 ttgaaatgct agatttacta ataaactaaa aaattagctt taataagcaa agctaaatta 5520 ataatattta atgcttttta ttcaaaagat ataatatgct caaattgtcc aaatcaacaa 5580 aaataaaaat aaataaaaaa gggggtgctc tcatgatctt tcgttaatta ataatttaaa 5640 aaataatgat cgaatctacc aagatatttc taaagttaga tgaacaaata agaaaactac 5700 accccccccc ccctcccccc ctcccattaa actcatccaa cttatgaaac tcatatggaa 5760 ctcatgtagc tcaagtaact catgtaattc atggaactca tgaagctcct gaagaaactc 5820 ctgcaactcc agctcctaca actcctgaaa ctcctgaagc tccggaagct acagaagcta 5880 cagaagctac ggaagctccg gcaactaaat ggagtgacgt tgatactata tataatattt 5940 gaattaaata aataaacaaa ttgctttgaa aacccttatc tagacctaca gattaacaca 6000 aaaaaataaa taaaataaaa tatataaaat tgtttttcta taatataatt taattaagat 6060 gtattctgag tgcatattta tgtgcgagta ttccattcca ttccacc 6107 // ID LOA_Ele4 repbase; DNA; INV; 6400 BP. XX AC . XX DT 15-OCT-2010 (Rel. 15.1, Created) DT 15-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE A LOA clade non-LTR retrotransposon family from Aedes aegypti. XX KW Loa; Non-LTR Retrotransposon; Transposable Element; Lian; KW LOA_Ele4. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-6400 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6400 RA Kojima K.K. and Jurka J.; RT "LOA clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (15-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update and extension. This consensus is generated CC from 20 sequences with >94% identity, and ~100% identical to the CC original sequence in [1]. XX FH Key Location/Qualifiers FT CDS 1080..2621 FT /product="LOA_Ele4_1p" FT /translation="MEFTRENDLIEKESLEGVSSCSSSILNTPEREDMYVD FT DDDYDDGINVTLLSSSVIMESNKDSLQNTVEEKQMEASIATLNSTNNGDEK FT GCSKLPSTAVATSDKKQKRLNGAGKKRFKYLVAHGHSPNDARILAEXPFRV FT VQPDPSKRRRNADLSESTSSDTNPPKKLARQVVRGQSSKAKSSVQQRLEQA FT SKGGVSLDPKEAEGNNRLNKPLFSEVVNCTRIGILPEGFPNTEFTTQQLVT FT IQNAILKKVAEQRKEPIKPKFGNCLFRPGHMIIICKNQDTVNWLKAKISNI FT KPWENASLIAVEEKDIPRPEILVGFFPRSEQDSNEEILAFIESQNEDLIVD FT AWRILKRYTVKQHHVELIFTVDAVSMKSLENCKFIIDYKFGVAYIRKRNSK FT AEVIEDNEEKASKDETGKSRQHFEGARDVEMSEPGPIGVDNTLKSAENNAH FT CTLTTSTTIVNQCSPHSKEKKNDKSTVVHSISQPDHTKKRNNNLEYRQIES FT HKQTLKKEADKLPANQNK" FT CDS 2635..6267 FT /product="LOA_Ele4_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="MTKRSIKFIQENLHHAKAASAILSKTFIKGKLDVALL FT QEPWTKNHTILGLQTTGCKLIYDNTQLAPRAAILLNCNVKFTPISEFIGRD FT IAAISLEVPTAKGKTIVYVASAYFPGDVDEVPPPEVAAFVSHCKKQNKAFI FT IGCDANAHHTIWSSTGINSRGELLFDYVSQNDIDICNKGNSPTFSNAIREE FT VLDLTLCSSKLSDNINNWHVSDEISLSDHKQILFEFCAKDILRETFRDPRK FT TNWELFCSRLKYQKESISDTINSSLELEKAADCLTSSIINAYNESCPAKER FT STCKDVPWWNYRLEKLRKNARKQFNRAKLTRNWSLYKEALTNYNKEIRRSK FT RKNWRYMCEGIESSHEAARLRKVLSKDHNNGLGTLKKDDGSYTADTQESLE FT LMVKTHFPGSIPITNRSRNQDISTSLNGSDRSNIEVADIFTYSKIEWALDS FT FESFKTPGLDGIFPAHLQKCKETIIPILMTLFKSSFLLGYLPTKWRQVRVV FT FIPKSNKKDKTVPKSFRPISLTSIFLKIMEKLIDEYIKSNILKNKPLSRAQ FT FAYQTGKSTLSALHSLVSKIEKTFDTKEILLAAFLDVEGAFDNASFTSMKN FT AMRNRDFPEKMIEWIEEMLSRREISAYLGDSVVKVTAVQGCPQGGVLSPLL FT WSLIVDDLLVKLKLQGFEIIGFADDIIIIVRGKFDKTVSERMQTALNLTLA FT WCNNEGLSINPDKTTIVPFTRRRKVCISALKIGDTKLSLSLSAKFVGVILD FT CKLVWNKHIEQQIEKARNAFWGCKRTFGKKWGLKPKMICWIYSAIVKPIIS FT YASIFWWEKTKQVATQAKLNKLQRLATISITGAIRNTPSNALNAMLNLLPL FT HDYIQMDAVKSAIRVRRSLDSADSELKGHMTILKHFKISPVYLVNEDYMLK FT RNFFDHRFWVADVTRQDWDTGGPTFRSGSLIFFTDGSKQDNMVGSGVFGPG FT VNVSYALGSWPTVFQAEIYAILECADICLKRRYRNANICICSDSKAALNAL FT KSRTYTSRLVWECSKLLQQLSYRNKVNLYWVPGHCGIEGNEKADQLAKQGS FT SMQFIGPQPFFGMASCALKMQLKSWMKAKIKLNWKNAMDSRQSKRFIEPDA FT YQTLRLLNLNKHDLSTYTGLLTGHCPSKRYLKIIGKLPEDMCRFCKLESES FT TEHLLCKCVALYHKRCRYLDKGLLEPNEIQYYHPKQVLHFIRNVVPDWDAR FT Q" XX SQ Sequence 6400 BP; 2187 A; 1127 C; 1202 G; 1881 T; 3 other; tttttttttt ttwtctctgg caactctgct tattactgtg ttcgactact atcgcccacg 60 ttatactttc gctgattttc atagtgtaaa ccgccggtaa attcgcgcgt aactgtttaa 120 cagtgagatt tgttggtgtt ttgagtggtt ctagtgcagt aaggtaggag catgcatttt 180 taatgataat aattaattaa ttattagtgc caaataaagt gctcggtgaa agtactatgg 240 cggtacatac aaaggtataa aagcaatcca atagaggctg cgattgagag aggctgtctt 300 agtattgagc cttgcatgca agtttcgttg taagcatagt gattgcgcat gatatttata 360 cggtgcatac tatgatgtat ctgctgcaaa tataaatgca gttgaaagaa tttattgctt 420 aataaggata tagtaatcgg tcctaatagt gctaatattc cgaaaagaag tactctgtga 480 tagctacaaa taaatcatta aaatccgtac aataaagtgc tactaaatta tatgtgtcag 540 tgtttcccaa tttttgattt gtttgacatg aaaaaaagac ggtgtgcagt gagaaagaaa 600 ttaatattaa aataatttgt tttctccaat agttttttat ttatttattt aattttgtct 660 ttattttatt ttatcatcat ttttctctaa attttttttt tctttatttg tgtacttatt 720 aaattattaa ctttttttgt tatwatattt attttttttt tattaatatt attatttttt 780 ttattaaatt ttaattttta ttttattttt gttttttttt ttcaatatat ttatttattt 840 tatttatatt atttattttt tgttttcttt tttgtttatt gctactgtac atattgcttt 900 tcttatgtca gcgtttttaa cttgttcttt ttccctaaat aaacgttctt cataataata 960 acaataaaaa taagaaaatt aatgcacata tacgtacgac aaacaattaa acataaattt 1020 tgcaacatta aagtacatat agtattcctc actgttacag agctcaagta cggcgtacaa 1080 tggaatttac tagagaaaat gacttaatcg agaaagaatc gttggaggga gtaagctcct 1140 gttcctcatc aattcttaac actcctgagc gggaagacat gtatgtagat gatgacgact 1200 atgatgatgg gattaatgtc actctactat cgtcatctgt tattatggaa tcgaacaagg 1260 attccttaca aaatactgta gaggaaaagc agatggaagc ttctatcgct acacttaact 1320 ctactaacaa tggagatgaa aaaggttgca gtaagctacc atcaaccgct gtagctacaa 1380 gtgataaaaa gcagaaacgt ctcaatggag ctggcaagaa aaggtttaaa tacctcgtag 1440 cacatggaca tagtccaaat gatgctcgta tattagcgga aaamccattc cgcgtggtgc 1500 aacccgatcc atctaagcgt cgcagaaatg cagacctaag tgaatctaca tctagcgaca 1560 caaatccgcc caaaaagctt gctcgtcaag ttgtaagagg acagtcttct aaagcaaagt 1620 cctcagttca acagagactt gaacaagctt caaagggcgg cgtctcgtta gatcctaaag 1680 aggctgaggg aaacaacaga ctcaataaac cattgttctc agaagtagtg aactgtacta 1740 gaattggcat ccttcccgaa ggctttccta acacagaatt tactacccaa caattggtca 1800 ctattcaaaa tgctattctt aaaaaagttg ctgaacaaag gaaggaacca attaaaccta 1860 aattcggcaa ttgtttgttc agaccagggc atatgatcat tatctgcaaa aaccaggaca 1920 cagtcaactg gctaaaggct aaaatctcca atatcaaacc atgggagaat gcaagcctta 1980 ttgcagttga agaaaaagac ataccccggc ctgaaatatt agtaggattc tttccacgca 2040 gtgaacagga ttctaacgaa gaaatcttag cttttataga aagtcagaat gaagacttaa 2100 tagtcgatgc ttggaggatt cttaagagat atacggtcaa acagcatcac gtggaactca 2160 ttttcactgt tgatgccgtt tccatgaagt ctctggaaaa ctgcaagttt ataatagact 2220 acaaattcgg agtggcctat ataaggaaaa gaaattcgaa ggctgaagtt attgaagata 2280 acgaagaaaa agcaagtaag gatgaaacag gtaaatccag acaacatttc gaaggtgcta 2340 gagacgttga gatgagcgag cctggcccaa taggggtgga caacacatta aaaagtgctg 2400 aaaacaatgc acactgcact ctaactacga gtaccactat cgtcaatcaa tgctctccac 2460 attccaagga gaaaaagaat gataaaagca ctgtggtaca ctctatttct caaccggatc 2520 atactaaaaa aaggaacaac aatttggaat accgccagat agaatcccac aaacaaacac 2580 ttaaaaaaga ggcggataaa ctgcctgcaa atcaaaacaa ataatttaag agaaatgacc 2640 aaaagaagca taaaatttat acaagaaaac cttcatcatg caaaggctgc ttctgcaatt 2700 ttgagcaaaa cgtttatcaa aggcaaactg gacgtggctc ttctacaaga gccatggaca 2760 aaaaaccata caattttagg attacaaact actggatgta agttgattta tgataatacg 2820 cagctcgctc ctagagctgc aattctactc aactgtaacg ttaaattcac accaatttca 2880 gaattcatcg gaagagatat tgctgctatt tccctagagg tgccaaccgc aaaaggaaaa 2940 acgattgtgt atgtcgcatc cgcttatttt cctggcgatg ttgatgaagt acctcctccg 3000 gaagttgcag cttttgtgtc tcactgcaaa aaacaaaaca aagcatttat catcggatgt 3060 gacgcaaatg cgcatcacac tatatggagc agcacaggaa tcaacagtag aggtgagtta 3120 ctctttgact acgtatctca gaatgacata gatatatgca ataagggtaa ttctccaaca 3180 ttttccaatg ctatacgaga ggaagtattg gacttaactc tttgtagttc aaaactctca 3240 gataatataa acaattggca tgtatctgac gaaatatctt tatcagacca taagcaaatt 3300 ttatttgaat tctgcgcaaa agatatatta agagaaactt tcagagaccc acggaaaaca 3360 aactgggaat tattttgttc tcgacttaaa taccaaaagg aatccatttc tgacacgata 3420 aattcatctt tggaactaga aaaggcagcc gactgcttaa cttcttccat cattaatgct 3480 tacaatgaaa gctgtccagc aaaagaacga tctacttgta aagatgttcc ttggtggaac 3540 taccggttag agaagcttcg aaaaaatgct cgaaaacaat tcaatcgggc aaaattgaca 3600 agaaattggt ctctgtataa agaggcctta accaattata ataaagaaat aagaagatca 3660 aagcgtaaaa attggagata catgtgcgaa ggcatagaaa gttctcatga agctgcaaga 3720 cttcggaaag ttctttctaa ggatcataat aatggtctag gtacgttaaa aaaggacgat 3780 gggtcctaca ctgcagatac acaagaaagt ttggaattaa tggtgaaaac tcattttcct 3840 ggatccatcc ccataacaaa tcgatcaaga aatcaagata tcagtacctc tttaaatgga 3900 tctgatagat ccaatataga ggttgcagat atttttacat attctaagat tgaatgggca 3960 ttggattctt tcgaatcttt caaaacacct ggtttagatg ggatttttcc ggcgcatctt 4020 caaaagtgca aggaaactat tatccctatt ctaatgactc tattcaaatc tagctttctt 4080 ctgggatatc ttccaactaa atggaggcaa gtccgggtag tgtttatacc gaaatctaat 4140 aaaaaagata aaacagtgcc aaaatcattt aggccaataa gtcttacatc aatattttta 4200 aaaataatgg aaaagcttat agatgagtac attaaatcaa atatacttaa aaataagcca 4260 ttaagtagag ctcaattcgc ttatcagact ggtaagtcga cactttcagc tctccattcg 4320 ctggtttcaa aaattgaaaa aacttttgac actaaggaga tactattggc tgcttttctc 4380 gatgtagaag gcgcgtttga caacgctagt tttacatcga tgaaaaacgc catgagaaat 4440 cgagatttcc ctgaaaaaat gattgagtgg atagaagaga tgctatctag aagggaaata 4500 tcagcatatc taggagattc agttgtaaag gtaacagctg ttcaaggctg tccgcaaggg 4560 ggagttctct ctcctcttct ctggtctctg attgttgatg acctacttgt aaagctaaag 4620 cttcaaggat tcgaaataat tggatttgca gatgatatta ttattatagt ccgcggaaaa 4680 ttcgataaaa ctgtttccga gcgaatgcaa acagcactca acttaacctt agcatggtgc 4740 aataatgaag gattaagtat aaatcctgat aaaactacaa tagtaccgtt tacaagaaga 4800 cgtaaagtct gcatttcagc tctcaaaata ggagatacta aactatcctt atctcttagt 4860 gccaaatttg taggtgttat attagattgt aagcttgttt ggaataagca tatagaacaa 4920 cagatagaaa aagcaagaaa tgctttctgg ggctgcaaaa gaacctttgg taaaaaatgg 4980 ggtctgaaac cgaagatgat atgctggatc tattcggcca ttgttaaacc cattatttca 5040 tacgcatcaa ttttttggtg ggaaaaaacc aaacaagtcg ctactcaagc taagttaaat 5100 aaactacaaa ggcttgctac tatttcaatt actggggcaa tacgaaacac cccttcaaat 5160 gcgttgaacg caatgttaaa tcttcttcca cttcatgact atatacaaat ggatgcagtt 5220 aagagtgcta tcagagtcag aaggtcctta gatagcgctg atagtgaact aaaaggacat 5280 atgactatat tgaagcactt caaaataagt ccagtgtact tagtgaacga agattatatg 5340 ttgaagcgta acttttttga tcatcgtttt tgggttgctg atgtaactcg ccaggattgg 5400 gacacaggag gaccaacttt ccgatctgga tcattaattt ttttcactga tggttcaaaa 5460 caagataaca tggtagggtc aggagtgttt ggcccaggag taaatgtatc ttatgcactg 5520 ggaagttggc caacagtttt tcaggccgaa atttatgcca ttcttgaatg cgctgacatt 5580 tgtcttaaaa ggcgctaccg gaatgctaat atatgtattt gttcggatag caaagcagcc 5640 ttgaacgctt tgaagtctag aacttacaca tcaagacttg tttgggagtg ttccaagcta 5700 ctgcagcagt tgtcctatcg taataaggtt aatctatatt gggttcctgg ccattgcgga 5760 atagaaggaa atgaaaaagc tgatcagtta gcaaaacaag gttcttccat gcaatttata 5820 gggcctcaac cgttcttcgg aatggcctct tgtgctctga aaatgcaatt aaagagctgg 5880 atgaaggcaa aaattaaact aaactggaag aacgcaatgg actctcgtca gtccaaacga 5940 ttcatcgaac cagacgccta tcaaacgctg aggttgttaa acctcaataa acacgatcta 6000 agtacttata ctggcttatt aacaggccac tgccccagca aacgttacct taaaattata 6060 ggtaaattgc cagaagatat gtgtcgtttc tgtaaattag aaagcgaaag cactgaacat 6120 ttgttgtgca agtgtgttgc actataccac aagcgttgta gatatttaga taaaggcttg 6180 ctagagccta atgaaattca gtactatcat ccaaaacagg tactacattt tatccgaaac 6240 gtagtacctg attgggatgc acgccaatag ctgtggagat agttacttca tatagaaaca 6300 tatttctaca gtacggcaaa aacaaagagg ataacaccac aatagatcaa cttaatggtc 6360 gcagtggtat catgtcccca cagaggaaaa aaaaaaaaaa 6400 // ID Kiri-6_CQ repbase; DNA; INV; 2457 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 16-FEB-2011 (Rel. 16.01, Last updated, Version 2) XX DE A Kiri non-LTR retrotransposon family from Culex quinquefasciatus DE - consensus. XX KW Kiri; Non-LTR Retrotransposon; Transposable Element; Kiri-6_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-2457 RA Kojima K.K. and Jurka J.; RT "Kiri non-LTR retrotransposons from the southern house RT mosquito."; RL Repbase Reports 11(1), 125-125 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 4 sequences with >94% CC identity. CC This family and related Kiri elements found in two mosquito CC species, Culex quinquefasciatus and Aedes aegypti, constitute a CC distinct lineage belonging to the CR1 group. The phylogenetic CC analysis with PhyML indicates that Kiri is a sister lineage of CC the L2 clade. Although several Kiri elements do not encode two CC proteins, probably due to 5' truncation, Kiri elements generally CC encode two proteins. Their ORF1 proteins commonly contain an CC L1-like RNA-binding domain. XX FH Key Location/Qualifiers FT CDS 2..2332 FT /product="Kiri-6_CQ_1p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="ALHRIHAAFNLTVLHTGPTRITDTSSTTIDLLITDGP FT QHIKKAKAASVSSVSDHEVVYLVADIRVPRTCQRTIRARNFRNIDQLQLQA FT DFQAKDLRRLFESVDINEKATVLNLELNTLLEVHAPERTITIRDERTPWIT FT QQIKAAVGLRNLALNLYARNPNRRRGDNQWLDYERKRDRASSLIFAAKKRY FT AELHFAHDLPAKKLWSNLKREGIHNTTKQSSSSADINADELNTFFSDGHRR FT LGNAATPSHRNAPAEPSHRTAVDHGANGFSFRHTSVEEICRKIFEIDTNAT FT GTDGIPISFLKMLCPFILPVLCHLFDAIIETRTFPALWKTAIVTPIPKTSN FT PVLPKDYRPISVLPAVSKVLEKILLNQITEHLDNPNNPLLARNQSGYRKGY FT GTTTALAKVTHDIYGNLDAGRCTVMVLVDFSLAFNCVDHQLLAAKLNREFR FT FSRPACELVSSFLGERKQSVRHGNVVSAVREVTDGTPQGSCLSALLFSLYI FT NSLPASLKCNYQLYADDLQIYVSGEIDEVDRLIGTINMDLEAITHWATENR FT LHPNPKKTQAIVFSRSGRVVPQSDIVFKGEVVPVAEKVTNLGLEMDSAMSW FT THQVNSVVQKAYNTLRTFRRFSGVLTGATKRKLVQAVIVPIFTYGDIVYYS FT GLTAALKQQLHRCFKSAVRFVYNLRRRDSTAAVRHSILGHDLPDNYNLRTC FT CFIKRGYDGNLPEYLQEHLVHGQHQRTRSLIVPRHTASSGKSIMIAGVTLW FT NNLPIEAKMIPTLSSFKSALKRSFQN" XX SQ Sequence 2457 BP; 639 A; 702 C; 579 G; 534 T; 3 other; cgcmctccac cgtattcacg ctgccttcaa tctcacggtt ctccacaccg gccctactcg 60 watcaccgat acatcgtcwa ccacgattga cctcctgatt acggatggcc cccagcacat 120 caagaaggcg aaagcggcat cggttagcag cgtctccgat cacgaagtcg tttacctggt 180 tgccgacatt cgggtacctc gcacgtgtca gcggacgatt cgtgcacgaa attttcgaaa 240 catcgaccag ttgcagctgc aagcggactt ccaggcaaag gatctccgtc gactttttga 300 gtcggtggac ataaacgaga aggctacggt gttgaatttg gagctgaaca ccctgctcga 360 agtgcacgcc cctgaacgaa ccatcacaat ccgagatgag cgaactccgt ggattaccca 420 gcagatcaag gcagccgtcg ggttgagaaa cctggccctc aatctctacg cacgaaaccc 480 gaacagaaga agaggcgaca accagtggct cgactacgaa cgcaaacgcg accgtgcgag 540 ctcccttatc ttcgcagcaa agaagcgtta tgcagaactc cacttcgctc acgatcttcc 600 ggcaaagaaa ctttggtcca atctgaagag agagggtatc cacaacacga cgaaacagag 660 ctcatcgtca gcagatataa atgcagacga gttgaacacc ttcttttccg acggacaccg 720 ccgtctggga aacgccgcaa cccccagcca ccgcaacgcg ccagcagaac catctcacag 780 aacagcggtt gatcacggtg cgaacgggtt cagttttcgc cataccagcg tggaggaaat 840 ctgcaggaag atcttcgaga tcgacacgaa cgcaaccgga acagatggta tcccaatttc 900 gttcctgaag atgctgtgcc cgttcattct gccagtactc tgccatctgt tcgacgccat 960 catcgaaact cggactttcc ctgcgctgtg gaaaacggcg attgtcaccc ccatcccgaa 1020 aacgtccaac ccagttctgc caaaagatta ccgccctatc agtgttctgc ccgctgtttc 1080 caaagtgctg gaaaagatcc tgctcaacca gatcaccgaa cacctggata accccaacaa 1140 ccccctacta gcgcgcaacc aatccggcta cagaaagggc tacggaacaa caactgctct 1200 ggcaaaggtc acccacgaca tctacggcaa cctggatgcc ggacgctgta ccgtcatggt 1260 gctggtggat ttttcgctgg cctttaactg cgtggatcat cagctgttag ccgcgaagct 1320 caaccgagag ttccggttct ccagacctgc ttgtgagctg gtttcgtcgt tcctgggaga 1380 gagaaagcaa tcggtccggc atggaaacgt ggtgtctgcg gtgcgggagg ttacagatgg 1440 tactccacag gggtcctgtc tgagtgctct tctcttcagt ctgtacatca acagcctacc 1500 tgcctctctg aagtgcaact accagctcta cgcagacgac cttcaaatct atgtttctgg 1560 agaaatcgac gaagtagaca gactgatcgg aaccatcaac atggacctcg aggcaatcac 1620 gcactgggct actgagaatc gactgcaccc aaacccgaaa aaaacccagg ccatcgtctt 1680 cagtagatct ggtagagttg tgccgcaatc ggatatcgtc ttcaagggtg aagttgtgcc 1740 tgtcgcagag aaggttacga acctgggact ggagatggac agcgccatgt cgtggacgca 1800 ccaagtgaac agtgtcgtcc aaaaggccta caacactctg cggaccttcc gacgattctc 1860 tggagtcctc actggtgcca caaagcggaa gctggttcag gcagtaatag tacctatttt 1920 cacctatggt gatatagtgt actacagtgg gctcactgcc gcacttaaac aacaactaca 1980 ccggtgcttc aagtctgcgg tacggttcgt gtacaatctt cgtcgccgag actcaacagc 2040 agctgtaagg cattcaattc ttggacacga ccttcccgac aactacaacc tgcggacgtg 2100 ctgtttcatc aaacgcggat acgacggcaa cctgcccgaa taccttcagg aacacctggt 2160 tcacggacaa caccagcgca cgcggtcgct aattgtccca aggcacacag catcaagcgg 2220 aaagagcatc atgattgccg gcgttaccct atggaacaac ctgccgatcg aggcgaagat 2280 gataccaaca ctatcgtctt tcaaaagtgc tctgaaacga tcatttcaaa attagttatt 2340 aggccttctg caatttgttt gattttatct ttggatcttt tataacctcc tgttgtttct 2400 tccttgactt gatttaaagt gttatgctaa caagttgacg tgcccaataa aacaaca 2457 // ID BEL-622_AA-LTR repbase; DNA; INV; 296 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-622_AA_; KW Pao_Bel_Ele104; BEL-622_AA-I; BEL-622_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-296 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 296 BP; 78 A; 68 C; 65 G; 85 T; 0 other; tggtcaagca ggacagaaag ggtcaaaagt gagtgaccaa ataagttcca cactacggta 60 tctgttgtta tctcattttt tcgcatgcat tctcttggtc cgtttcattc tgcggagcca 120 taccatcgta ggtaccgtag tgtgtgaaag tgtctctacc cgatcgcttc tcgatcgtgg 180 atcagctggt ccgcacatag caatagatga aatagataac cagaagtaaa tataccgccg 240 cgagtactac acctcgtccg ctttaagttt caataaaagt tgtttctgtt cgcgca 296 // ID Gypsy19-LTR_Dpse repbase; DNA; INV; 325 BP. XX AC Unknown_singleton_87; XX DT 15-MAY-2009 (Rel. 14.05, Created) DT 15-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy19_Dpse; KW Gypsy19-I_Dpse; Gypsy19-LTR_Dpse. XX OS Drosophila pseudoobscura OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; obscura group; OC pseudoobscura subgroup. XX RN [1] RP 1-325 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1114-1114 (2009). XX DR Genome; Unknown_singleton_87; Positions 2809 2485. XX SQ Sequence 325 BP; 108 A; 48 C; 63 G; 106 T; 0 other; tgttaggttg ttttagtatt ataactctac atttgtattt accttgctgc tagcttataa 60 ttattagtgt taatctacaa ttgtatttac tattaacgtt gtacgaaata tcgtaagtat 120 ctgaataagt atcgatatga ataatcatcc gatagttcta gttctgcgaa ctagaactat 180 gcgagaacta gatcggaata gaaccagaga tcggagctgg gttcaatgag cacttttgcg 240 gtatcaacag gacagagtgg ttggtaaaga attaagtaaa ataaaatcat aaaaagtacg 300 aaacatccgt gtactttctg cgcca 325 // ID I-6_AAe repbase; DNA; INV; 5822 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A I non-LTR retrotransposon family from Aedes aegypti. XX KW I; Non-LTR Retrotransposon; Transposable Element; I-6_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5822 RA Kojima K.K. and Jurka J.; RT "I non-LTR retrotransposons from the yellow fever mosquito."; RL Repbase Reports 11(4), 1357-1357 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >94% CC identity. XX FH Key Location/Qualifiers FT CDS 544..1839 FT /product="I-6_AAe_1p" FT /translation="MDVDVHNSVLKDGDPPPGPSSSTAKTFRIKIYPISFA FT GPFVVFFRKKQIPINVLLISSKIYEKYSSVKEIKKVSLDKLRVVFGSRDDA FT NSLLESHLFAESYRVYAPSNFCEINGVIYDESLECDEVVSHGTGIFKNQAI FT SPVHVLDCHRLSKLFLNDKGSEYVHSNCLKITFAGSVLPDFVKVNNVIFRV FT RLFFPKIMHCERCLLFGHTNQFCSNKPKCAKCGQSHLSSECNKKSDLCIYC FT NQKHNILKDCPVYKNNESKFHQKLKNRNQFSYSEILKNSTDYTAPNIYESL FT TDDEFEDTLNLDNYIYKPPSKRKRIQKNTSKAQVKIDSNAKSFNENFPRLS FT SSSSRIIPGFQRVDSDPSNSNVNENVNTSNDNNNVGENSSILGILEELVEL FT LGFSDFWKKIIKIILPFVASLLDKLNAFGPIFASLFSS" FT CDS 1842..5534 FT /product="I-6_AAe_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="MASNSNNYLNILQWNCRSIIPKIDRLRVMLEHNKIDI FT FCLNETWLSESKIVRIPSFNIIRKDRDIPFGGVLIGIRNGIEFKYINLSLN FT VHFEYIIISVKHTDYHFHIICLYIPPNVNFSLSELKMVLDSVPHPFYILGD FT LNAHNFAWGSDKIDGRGSLILDLIDEYNLNILNDGSFTRIAVPPLQNSCID FT LSLCSNNLSFRSTWSIINDPNNSDHLPILIRMSHCMQNNHQVNSSIPDYSK FT NVDWVKFSDLISFSLINLKDSNTPIENYTKFIKILFNCLLKSQKNKIVCNN FT KKNKRPTFWWDNDCSIALKNKSEAFKKFRRLGSRENYFLYCKAEAWFTRTT FT KFKKRNYWKXFIENLDRDTSLSTLWSVARNLRNYDNSSPVLMEYSDNWIDQ FT FASKICPDFVPQSINFRTKQSINYFPELCSPFSLEELDLALSITNNTAPGI FT DNIKFIVLKNLPYEGKVHLLAIYNSFLLQNVIPLDWRSVKVVSILKPGKDA FT SLADSRRPISLLSCLRKLMERMVLNRLEIWAEHNNILSPCQFGFRKGRGTR FT DCTALLASQIELAFNKKQDTISTFLDVSGAYDSVLIDLLFNKMNSYKIPLI FT LSNFIYNLFSFKIMHFYHNGSSKFTRNSYFGLPQGSCLSPFLYNLFTCDFS FT SVIPNGCFLVQFADDNVISVSGYNREIIRHYMQICLDNIDSWAYDNGFTFS FT VQKTKYIIFSRKHSSVSVDLFLNGYEIEQVFEYKYLGIWYDSKLNWNFQIK FT YIQKICSKRINFLRTITGTWWGAHPSDMIILYKTTIRSVLEYGCFTFGCAT FT QTHFSKLEKIQFRCLRICLKLMNSTHTKSVEILAGVVPLRIRFHELNCKFL FT SQCFTNQHPLIQILSDLHGINPSSKMLNSFNHCSTENILPGFPNTFYNCDI FT NVHSFQTFVDKSLHWELNQIPRHMCVRYAKLCFERKFCGFDQKFIYYTDGS FT LIQNSAGFGVYNIDVVHFHKLKSPCSIFVAELIALYFTCNIIKDHPPNMFV FT ICTDSLSCINSIESSKFNFKTHYLILLIRKLLYDLHSQGFIIKFVWVPAHS FT NIFGNEQADYLAKLGATRGIIYERNVFHSEYYAKIKRNALDSWQIDWDSSD FT KGRWCHSIFPIVDESTWFKKLSVGRNFICMFSRLISNHYICNSHLYRINII FT DSNLCDCGESYQDIDHIVFYCKKYILPRKKIIEKYKSLEITVPISVRDILG FT SKSLPMLKILYNFFRDISYIV" XX SQ Sequence 5822 BP; 1867 A; 810 C; 917 G; 2227 T; 1 other; cattgctgca gtaggcgttg atctggcagg acgttctttt ttcaaccaac tgacattttt 60 ctctgaattt caggagagaa aattttcaag ttgagaggat ttggaccgtt tgtgatacga 120 tttggagaga ggctgattat tgctattcct gtgtggtgcg gtaggagaat atatcctgtg 180 tggtgcggta ggagaatata acctgtgtgg tgcggtgttt gaagatttgg aagcggacgt 240 tattcgtcgg tatttcaaga tctcctgaac aagattcaaa ggagtgttcg actttgggtc 300 atctttcctg tgtggtgcga aagaagtaat tgcctgtgtg gtgcggagga agttattgct 360 gtgtggtgcg cgaggagaaa gttatcaaga ggacgttaca tcaagtttga agcttgaaat 420 ctcatcttcg cgtggaaatt taagtttgga gttcctaagt atctttttat tattattttt 480 atttactgtt ttcattaaaa gttttgattt tctttcaaat tttgatattt ttgcacccca 540 attatggatg ttgatgtgca taattctgtt ttaaaagatg gtgatcctcc accgggacca 600 tcatcttcta ctgctaaaac atttcgtata aaaatatatc ccatttcttt tgctggtcca 660 tttgtggtat tttttcggaa aaaacaaatt ccaattaatg ttttactaat ttcatccaaa 720 atttatgaaa agtactcttc tgtcaaggaa atcaaaaagg tttcacttga caaattacga 780 gtcgtgtttg gttcgcgtga tgatgcaaat tcacttttag aatctcatct tttcgcagaa 840 tcttatcgtg tatatgctcc aagtaatttt tgcgaaatta atggagttat atatgatgag 900 tctttggaat gtgatgaagt tgtaagtcat ggtaccggta ttttcaaaaa tcaagctatt 960 tcacctgttc atgttttaga ttgtcatcgt ttgtctaaac tttttttgaa tgacaaaggc 1020 tctgaatatg ttcattctaa ttgtttgaaa attacatttg ctggatcagt acttcctgat 1080 tttgtaaaag taaataatgt catttttcgc gttagacttt ttttcccaaa aataatgcat 1140 tgtgaacgat gccttctttt tggacatacg aatcaatttt gctccaataa acctaaatgt 1200 gcaaaatgtg gtcaatcaca cttatcatcg gaatgtaata aaaaatctga tttgtgtatt 1260 tattgtaatc aaaaacataa cattctgaaa gattgtcctg tgtataaaaa taatgaatcc 1320 aaatttcatc aaaaattgaa aaacagaaat cagttttctt attccgaaat tttgaaaaat 1380 tcgacggatt atactgctcc gaatatttat gaatctttaa ctgatgatga atttgaagac 1440 acattgaatt tagataatta catttataaa ccaccatcaa aaagaaaaag aattcaaaaa 1500 aatacatcta aagctcaggt taaaattgat tctaatgcaa aatcattcaa tgaaaatttt 1560 ccgcgattga gttcttcatc ttctcgtatt attccaggtt ttcaacgagt tgattctgat 1620 ccatcaaata gtaatgtgaa tgaaaatgtc aatacttcaa acgataacaa taatgttggt 1680 gaaaattctt caattttagg aattttggag gaattagtag aattgttggg atttagtgat 1740 ttttggaaaa aaattattaa aattatttta ccatttgttg cctccctact tgataaattg 1800 aatgcttttg gacccatctt tgcttcatta ttttcatcgt aatggcttca aattccaata 1860 attatttgaa tattttacaa tggaactgta gaagcataat tccaaagata gatagattga 1920 gagttatgct tgaacataac aaaattgata tattttgttt aaatgaaact tggctatctg 1980 aatcaaaaat agtacgcatt ccatcattca atataattag aaaagataga gatattcctt 2040 ttgggggagt tttgattgga attcgcaatg gtattgaatt taaatatatc aatttatcat 2100 tgaatgtaca ttttgaatat attattattt ctgtaaaaca tacagattat cattttcata 2160 ttatttgttt gtatattcct cctaatgtaa atttttcatt atctgaattg aaaatggttt 2220 tagatagtgt tcctcatcct ttttatattt tgggtgattt aaatgcccat aattttgctt 2280 ggggtagtga taaaattgat ggtcgtggat cattaatatt ggatttgatt gatgaatata 2340 atttaaacat tttaaatgat ggatcgttca ccagaattgc tgttcctcca ttacaaaatt 2400 catgcattga tttatcttta tgttcaaata atttatcttt taggtcaact tggagtataa 2460 tcaatgatcc aaataatagt gatcatcttc caattttgat tcgaatgagc cattgtatgc 2520 aaaacaatca tcaagtaaat tcttctattc ctgattattc aaaaaatgtt gactgggtta 2580 aattttcgga tttaatttct ttttcactta ttaatcttaa agattcaaat actccaattg 2640 aaaattacac taaattcatc aaaattttgt tcaattgctt acttaaatct cagaaaaata 2700 aaattgtttg caataataaa aaaaataaac gtcctacatt ttggtgggat aatgattgtt 2760 caattgcttt aaaaaataaa tcggaagctt ttaaaaaatt tcgtcgttta ggatctagag 2820 aaaattattt cttatattgt aaagcagaag cttggtttac ccgtaccaca aaatttaaga 2880 aaagaaatta ttggaaamat tttattgaaa accttgacag agacacttca ttatccacat 2940 tgtggtctgt agcaagaaat ttgagaaatt atgataattc ttctcctgtt ttaatggagt 3000 attctgataa ttggattgat caatttgcat ctaaaatttg tccagatttt gttcctcaat 3060 ctattaattt tagaacaaag caatcgatta attattttcc agagctttgt tcaccatttt 3120 cattagaaga attggatttg gcattatcta ttacaaataa tactgcacca ggtattgaca 3180 atattaaatt tatcgtactt aaaaatttac cctatgaagg gaaagttcat ttacttgcaa 3240 tttacaattc atttctttta caaaatgtta ttccattaga ttggcgttca gttaaagtag 3300 taagcatact taaaccagga aaagatgctt cattggctga tagtcgtaga ccaataagtt 3360 tactatcatg tcttcgtaaa cttatggaaa gaatggtttt aaatcgtctt gagatatggg 3420 ctgaacataa taatatttta tctccatgtc aatttggttt cagaaaagga cgtggtactc 3480 gtgattgtac agcacttttg gcttctcaaa ttgaacttgc ttttaataaa aaacaggata 3540 ctatttctac ttttttagac gtgtctggag cttatgattc tgtacttatt gatttgcttt 3600 ttaataaaat gaattcttat aaaattcctt taattttatc caatttcata tacaacttat 3660 tttcattcaa aattatgcat ttctatcaca atgggtcgtc taaatttact cgtaatagtt 3720 attttggcct tcctcaagga tcttgtttga gtcctttttt gtacaattta ttcacatgtg 3780 atttttcatc tgttatccca aatggttgtt ttttagttca atttgctgat gataatgtta 3840 tttccgtaag tggatataat agagaaataa tacgtcatta tatgcaaatt tgtttagata 3900 atattgattc gtgggcttac gataatggtt ttacgttttc agtgcaaaaa actaaatata 3960 ttatattttc acgtaaacat tcctcagtaa gtgtagattt atttcttaat ggatatgaaa 4020 tcgaacaagt atttgaatac aagtatcttg gtatttggta tgattcaaaa ttaaattgga 4080 actttcaaat taaatacatc cagaaaattt gttctaaaag gattaatttt cttcgtacaa 4140 ttactggtac ttggtggggt gctcatcctt ctgatatgat tattctttat aaaactacaa 4200 tacgctcagt tttggaatat ggttgtttta cttttggttg tgccactcaa actcattttt 4260 ctaaacttga aaaaattcaa tttcgttgtt taagaatttg tttaaaatta atgaattcta 4320 ctcatactaa atctgttgaa attttggcag gagtggttcc actcagaatt cgttttcatg 4380 aattaaattg taaatttcta agtcaatgtt tcacaaatca gcatcccctt attcaaattt 4440 taagtgattt acacggaata aatccttcta gtaaaatgtt aaactcattt aatcattgtt 4500 ctaccgaaaa catattacct ggttttccaa atacatttta taattgtgat attaatgtac 4560 attcttttca gacctttgtt gataaatctt tacattggga attaaatcaa ataccaagac 4620 atatgtgtgt tcgatatgcc aaattatgtt ttgagcgaaa attttgtggc ttcgaccaaa 4680 aatttattta ttacacggat ggatcattaa ttcaaaatag tgcaggtttt ggagtttata 4740 atattgatgt agtccatttt cacaaattga aatctccttg ttctattttt gtagctgaat 4800 tgatagcatt gtacttcaca tgtaacataa taaaggatca tcctccaaat atgttcgtta 4860 tttgtactga tagtttgagt tgtatcaatt ccatcgaatc ttcaaaattt aattttaaaa 4920 cccattattt aattttattg atacgaaaat tgttatacga tttgcattct caggggttta 4980 taattaaatt tgtttgggtt ccagcccata gtaatatatt tggtaacgaa caagcggatt 5040 atttagctaa actaggagct actcgcggaa ttatttatga acgaaatgtt tttcattcgg 5100 aatattatgc taaaatcaaa agaaatgctt tagattcttg gcaaattgat tgggattcta 5160 gtgataaagg tcgatggtgt cattccattt ttccaattgt ggatgaaagt acctggttta 5220 aaaaactgtc tgtcggaaga aatttcattt gcatgttttc aagacttatt tctaatcatt 5280 atatttgtaa tagtcattta tatcgtatta acataattga ttcaaattta tgtgattgtg 5340 gagaatctta tcaagatatc gatcacatag ttttttattg caaaaaatat attttgccta 5400 ggaaaaaaat aatagaaaaa tacaaaagtt tagaaattac agtacctata tctgttcgtg 5460 atattcttgg tagcaaatcg cttcctatgt taaaaatttt atacaacttt ttcagagata 5520 ttagttatat tgtttgatac tgctttgctt tttatctctt tcgtttcaga aatcaagaaa 5580 ataaccttcc cttgtaccct gcaataatcc tccctttttg gttacttcct tttcaagaat 5640 tcggctctga tatggatcac ttccggttga gcctttagtt ttaagattta tttttgtaac 5700 gtaataagaa aagataaaga ggttttgtgc ccttttgaga acgatttcca attgatatca 5760 ctcaaagggg tttttccctc tttcaaaatt tttagttcaa taaataaata aataaataaa 5820 ta 5822 // ID Gypsy-55_CQ-LTR repbase; DNA; INV; 1301 BP. XX AC AAWU01036466; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-55_CQ_; KW Gypsy-55_CQ-I; Gypsy-55_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-1301 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 490-490 (2011). XX DR Genome; AAWU01036466; Positions 6931 8231. XX SQ Sequence 1301 BP; 362 A; 320 C; 290 G; 329 T; 0 other; tgtaaccggt tgttacagtc taacatcata aattcaatta actaaaatca ataaataaat 60 aaatatggac attcacagat cttatgctat gtttagtagg gaattaggga atattaaaat 120 aattgttaaa gagttagtag ggattgatta aaatggaaaa taaataatta aataaaaaca 180 cataatttgt caacacactc tacaacacac cgtctcaggg ggaacaaagt cacagacaat 240 gacggggttg tcggagtaca tcgtgttcgt cgatttgggg gtggaaagtg aaatgaggat 300 tgacatctat cctgtgtgac gtcacgccgt gtcacaggcc ccaaaggctt aaagggatct 360 gatttagact ggcaacggtc atttcaacgc aggctagtgc gcggtcctaa accgtttttg 420 tggagctata aaagtgaagt gcagtgagag aaagagtggt ggcagatttt gctggagtat 480 ctcaccccgt tcaggtacgc taaacacccc tatagtacgg ctctatagga cggctggtta 540 ttttaaacat actaattaag tatggcctca cacgcacttc tttgtccttt aggacctttt 600 gtgtgccgtt ttacggccat tttgcgatcc aagcacgccg cttggacgcc ccgatcgcgc 660 acttctcggc ctggttttgt aagtgcggtg gagccgtcta ccacgtctta ccgtcaaaga 720 gaaccacatt cgctcggacc agcgggtcct aatcccaaag gagcacgctc gggcctgatc 780 ggctctaaac cccaaggagc gttgataccc ctctcggtgc aggccaaagt gggcaggtcc 840 ccggccggcc gatcacgtca acgagggaag gcctgttgat ccctttagcc gagctccacc 900 acaagaccac ccccaccacc ccaagcaccc ggaagttccg ggattccagt agcatcgcgg 960 gcgcccgtcg atcagcagaa gctgatcaag cagtgcatcg acagcagtcc agcggtacgt 1020 acagcagcag aaacatcccc cccccccaca ctacacttta caataaacac acactttaaa 1080 tagattttcc ccgagttttc aataaatttt cctttgtcta cctgtaataa ctttgattat 1140 ttttccacca acccatagcc caaagaattg tgtttgtcct agttcatcct cttaacgccg 1200 ccgaccctgt gattaaatta agtccaggtt gtccgtaagc tagtgttggc aagatatctt 1260 tttaaatggg taagtcctaa aagggtatta attggcgccc a 1301 // ID Crack-14_AAe repbase; DNA; INV; 4676 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A Crack non-LTR retrotransposon family from Aedes aegypti. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; KW Crack-14_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4676 RA Kojima K.K. and Jurka J.; RT "Crack clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1230-1230 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 10 sequences with >95% CC identity. Closely related to Crack elements in Culex pipiens CC (Crack-1_CP to Crack-4_CP). XX FH Key Location/Qualifiers FT CDS 325..1359 FT /product="Crack-14_AAe_1p" FT /translation="MDNEPICCVVCKKQEPERNKLITCMYCFSSAHFKCRN FT IIGSAINRVRANMYFCTTSCSEIYKRIVDMQTNHSSMISSLTSELKATVSS FT VVADQMVNVKNEVRSVTTAIEKSQDFLSAKFDDIVSDFKDLKAENECLKQR FT INELTESHSKLTSFVHQLEANVDKSDRKAISKNAVLLGLPCIQNENVLATV FT HNTIAQVGAQIEHDSIVSATRLFVSNKSNVMIPIQIEFKDVSVKEMVLSKK FT REFGKIVSTNINENFLVNGKPTTVSIRDELTPLSLELLRKMRESQELLKIK FT FVWPGRGGGILVKKHEDSKPDVIKTRDDLNRIMNAYSVAMNQSPSPKRKKN FT VY" FT CDS 1437..4319 FT /product="Crack-14_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MDKSLNYYHNHIDEFNVSCGRNSSKYLRVLQWNVRGI FT NDMSKFDNILLSIDHFVVPIDVIILGETWIKADNTGLYKINGYNSIFSSRE FT NSSGGLAVYIRDTIKYNVLENLSSGGFHHISVELKMNGEKYEIHGVYRPPS FT FDFNVFYEFIECWLNKSNSHPCFIFGDVNVPVNLINNNVVLKYKYLLQSYN FT FLCSNSFVTRPASSNVLDHVICKMSDAHRLYNHTILNDVSDHSLILSEFGL FT RFRSDRVVLSKNIVNHDKLHRDFKSFIDTIGVIDNVDTCLSNIIAKYNTLL FT KNCTRTMSKTVNIKGTHCPWMNLELWSLIKLKNNYLKKCRKYPNDSHLAEL FT LKHISSKLDVAKKQTKKRYYENLLNNTTHSKLWKNINSILGKSKTNTELTL FT NVNGQYTSDAKMVCEAFNEHFSTVGCKLAQQIPSTLENPLQYINPISESCF FT LQPASINEVTLLINDLNSNKSSGPDCIPAKIIKNNVVAFSRILADSFNLII FT ETGLYPECLKVAKVIPIFKSGDRCLADNYRPISTLSVFNKIFEKLLVTRLM FT KFFDRNNVLYSMQYGFRSGCSSSIAITELVEKILEETDSKKFVGALFLDLK FT KAFDTLDHNILLSKLDLYGIRGVVNNIIRSYLNTRKQLVSVDGVNSSLKEI FT STGVPQGSNIGPLLFLIYINDICKINLKGTPRLFADDTALFYPNINPELIT FT TNMCEDLRLLQSYFACNLLSLNLKKTKYMIFRSPRKAQPNTPSIIIGTATI FT EKVDSFKYLGLTLDCTLSWDQHIKTVCNKISAMCGVLKRVVSYLPRKALLL FT FYYAHIHSHLNYLTLSWGRACKSKLKKLQTLQNRCLKIIFNLPILFSTVRL FT YTELPHRALPILGICEQQTLQMVHNVLHNPLLHHNITLNIASRIHNTRQFN FT TLSRSRAHSNFGQKRFSFLGPSKYNVLPRHLQQITNSLSFKTNLKEFLRRR FT IPHLLI" XX SQ Sequence 4676 BP; 1513 A; 829 C; 804 G; 1530 T; 0 other; cacacttgtt ggttcaccag tggttcctta cgcggttgtg ctctcaaagc ccatgcatat 60 attagtaatc tagtcataaa ctgaacgagg tatttgctag aataacatgc tcgcaagatc 120 cgtcaatgaa tgatcacctt atcagtacgc aattcatatt aaatatattt atttgaattt 180 tcttgaccgc ttctgtactt cgtaatattg atttaacgta atagtgaaac aattcgttga 240 gcttttaggt tgtccttgtt tcgttctgtt gaatatctgt tttttttttt atcttgcagt 300 cttgcctctg atacgtatat taacatggat aacgaaccga tctgctgtgt cgtatgcaaa 360 aaacaagaac cagagagaaa caaactcatt acatgcatgt actgtttctc ttctgctcat 420 ttcaaatgta gaaacattat tggaagcgcg ataaatcggg tgagagcaaa tatgtatttc 480 tgcaccacta gttgctcaga aatttacaaa cggattgtgg acatgcagac gaaccattcg 540 tcgatgatta gttctttaac atctgagctt aaagcaacgg tatcgagcgt cgtagctgat 600 caaatggtta acgttaaaaa tgaggttcgg tctgtaacta cagccattga aaaatcacaa 660 gattttttgt ctgcgaagtt tgatgacatt gtgtcagact tcaaagatct caaagcagaa 720 aatgaatgtt taaagcagcg tatcaatgaa ctaactgaat ctcattcaaa actaaccagt 780 tttgtccatc agttggaagc gaatgtagat aaatctgacc gcaaagctat ttcaaaaaat 840 gctgttctgc tagggctccc atgtatacaa aatgaaaacg ttttagctac tgtccataat 900 actatcgctc aagtcggagc gcaaatagaa catgactcga tagtatcagc cactaggttg 960 tttgtcagca ataaatctaa tgtgatgatt ccaatacaaa tcgaattcaa agatgtgagt 1020 gttaaagaaa tggttctgtc caagaaaaga gagtttggta aaattgtgtc aactaacatt 1080 aatgaaaact ttttagtaaa tggaaaacca actacagtta gtattaggga cgaattgact 1140 ccgctttcat tggagcttct tcgtaaaatg cgcgagtccc aggaactgct gaagattaag 1200 tttgtttggc ctggaagagg cggagggata cttgttaaaa agcatgaaga ttcgaaaccc 1260 gatgtaatca aaaccagaga tgatttgaac cgtattatga atgcttactc ggtcgctatg 1320 aaccaatcac catcaccaaa gagaaagaag aatgtttatt agtttgttcc atcatcaatt 1380 tcaatgtaac agtagtctta tgtgtgtatg tttatgtctc aatataaatt ttagtaatgg 1440 ataagtcttt aaattattat cataaccata ttgatgagtt caatgtgtct tgtggaagaa 1500 attcttcgaa atatttgcgt gttttacagt ggaatgtccg tggaataaat gatatgagta 1560 agtttgataa tattttgttg tctattgatc actttgtggt acctattgac gttataatac 1620 tgggagaaac gtggatcaag gccgataata ctggattata taaaataaat ggttacaata 1680 gcattttttc tagtcgggaa aattcgtcag gtggattggc tgtgtatatt agagacacca 1740 taaaatataa tgttcttgaa aatctatcta gtggaggatt ccatcatata tctgttgaac 1800 ttaaaatgaa tggggaaaag tatgaaattc atggtgttta tcgaccacca tcatttgact 1860 tcaatgtgtt ttatgaattc attgaatgct ggcttaataa gagtaatagt cacccctgtt 1920 tcatctttgg cgatgttaat gtacctgtta acctaataaa taacaatgtt gttctcaaat 1980 acaaatatct tttgcaatct tataactttt tatgttcaaa ttcatttgtt accagaccag 2040 ccagttccaa tgttcttgac catgttatat gtaaaatgtc tgatgcccat cgcttatata 2100 accatactat tttaaatgat gttagtgatc attcgctcat attgtcagaa tttggtttac 2160 gtttccgttc agatagagtt gtgctgtcga aaaacatagt taatcacgat aaactgcata 2220 gagatttcaa gtcttttatc gataccatcg gagttataga caatgttgac acttgtttga 2280 gtaatatcat tgctaaatat aatacgcttt tgaaaaactg tactagaaca atgtccaaaa 2340 cggttaatat taaaggtacg cattgtcctt ggatgaattt agaattgtgg tctttgatca 2400 aacttaaaaa caactatttg aaaaagtgtc gtaaatatcc aaatgattct cacctagccg 2460 aacttctaaa acatatatcg agcaaattag atgtcgctaa aaagcaaaca aaaaagaggt 2520 attatgagaa cttgttgaat aacaccaccc attcaaaatt atggaaaaat atcaactcta 2580 tcttaggaaa atcgaaaacc aataccgaat tgacactgaa tgtgaatggt caatacactt 2640 ctgatgcaaa aatggtatgc gaagccttta atgagcattt ttcaacagtt ggctgtaaat 2700 tggcccaaca aattccatct acacttgaaa acccattgca gtacataaat ccgatttcgg 2760 aatcatgttt tctacagcct gcttctataa atgaggttac acttcttatc aacgacttga 2820 attcgaacaa aagttccggc ccagactgca taccggcaaa aataatcaaa aacaatgttg 2880 tagccttttc ccgtatactt gcggattctt ttaacttaat aattgaaaca ggtttatatc 2940 ctgaatgcct caaagttgct aaagtcatac caatttttaa atcaggagat cgctgtttag 3000 cagataatta tcggcctata tccactttat ctgtatttaa caaaatattt gaaaagcttt 3060 tggtgacacg tttaatgaag tttttcgaca ggaataatgt attatacagc atgcaatacg 3120 gttttcgctc tgggtgcagc agttcaatag ccataacaga actagtcgaa aagattcttg 3180 aagaaactga ttctaaaaag tttgtcggtg ctctgtttct tgacttgaaa aaggcatttg 3240 atactttgga ccacaacata ttacttagca agctagatct gtatggaata agaggtgtcg 3300 ttaataatat tattcgtagc tacttgaaca ctagaaaaca gttggtatca gttgatgggg 3360 tgaacagttc tctaaaagaa atatctacag gagttccgca agggagcaac attgggcctc 3420 ttctttttct aatttacata aatgatatat gcaaaattaa tctgaaaggt actccgaggc 3480 tttttgctga cgatacggca ttgttttacc ctaatataaa tcccgaacta ataacaacaa 3540 atatgtgtga agatctgcgt ttacttcaaa gttattttgc ctgcaatctc ctttctttga 3600 acctgaaaaa aacaaagtat atgattttca gatcaccaag aaaagctcaa ccgaatacac 3660 ctagcatcat tattggtact gctactatcg aaaaagttga ttcctttaaa tatttgggtc 3720 taacccttga ttgcacatta tcttgggatc aacatattaa aacggtctgc aataaaattt 3780 cagcaatgtg tggagtcctt aaacgagtag tatcctacct tccacgcaaa gctttacttc 3840 tgttttatta tgcccatatt cattctcatc tgaattatct tactctttca tggggtagag 3900 catgcaaatc taagctcaaa aaacttcaaa ccctccaaaa tcgctgcctc aaaattatct 3960 tcaatttacc aatcctgttt tcaactgtcc gtttgtatac tgagctaccc catcgcgctt 4020 taccaatttt aggaatctgc gaacagcaaa cgctgcaaat ggttcataat gttctccaca 4080 atccgttatt acatcataac ataaccctca acattgcttc tcgtattcat aatactagac 4140 aattcaatac tctctctcga tctagagcgc attccaattt tggtcaaaaa cgtttttcct 4200 tccttggtcc ttcgaaatat aacgtcttac ctagacatct ccaacaaatt actaacagct 4260 tgtccttcaa aactaacctt aaagaattcc tgcggcgaag aattccacat ttacttattt 4320 gaattgaaac tatgtcgtaa aaacaaaaaa aagttaatcg tctccaaaca attagtttaa 4380 ctgtttagta ataataattc gtatttattt ataatttcgt atttgtagtg taaaattttc 4440 tttttcatat gtgcttcttt aaaaggattt acaatccact agaagcactt agcttagtta 4500 ttttgtttca aactccctgt tgttgtcatt tttgattatt ctcagcctct tataaatatt 4560 tgttttctta atttagtgtt gcctgtggct gagaattgag tgtccattac cagggggctc 4620 gtcatgagct ttttggtatg ggggagagcg gagggtcact caaaaaaaaa aaaaaa 4676 // ID Gypsy14-NVi_I repbase; DNA; INV; 10944 BP. XX AC NW_001820566; XX DT 20-DEC-2007 (Rel. 12.12, Created) DT 20-DEC-2007 (Rel. 12.12, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy14-NV; KW Gypsy14-NVi_LTR; internal portion; Gypsy14-NVi_I. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-10944 RA Kohany O. and Jurka J.; RT "LTR retrotransposon from Nasonia parasitic wasp."; RL Repbase Reports 7(12), 1208-1208 (2007). XX DR Genome; NW_001820566; Positions 131000 141943. XX CC Positions [6527-7096] - Reverse transcriptase CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS join(5453..6883,6887..9718) FT /product="Gypsy14-NVi_I_2p" FT /translation="MVVETQQNIKSKSEEKSEVALNERFETAKVRFLLAGR FT QPPIVKLNSPQLKEGQGTFYADSGADISVLKRGKLAACYPIDTSKIIKIQG FT VTPGASHTLGQAVIKLQGLACNVHIVPNDFPIENSGIIGWDIIDAHRGCVD FT AADQCLRLGKEQIPFESDERVTIPPRVKMIIGARVKNSDVKVGWVPITNIH FT PDILFGNFVAENRNGRVYAECINTSDEEISIANPVIELLECKTMEENPLYQ FT ADGDGSAESTAKFTANLRRMFDISKKVEKYKEVESLNRRLLADEQARIERV FT EKIRKLADLEGCNDEEIGYIRKIISDFPGVFGLDVEPLPATHLLKHKIVLK FT SDKPIKSNRFRFPPALKENMLRELEKLREQGIVVPSNSNYSWSLWIVPKKP FT DAQGNKRFRLVTDFRALNEETEGSCHPLPFTSDILEHLAAANYITVMDLKQ FT GYHQIEMDPESVHLTAFYAPDGRHGNQLLQFSMAMGLKEATITFTKAMSLA FT MKGLQGEEVEIYLDDLMVFSETLDEHKERLRRVLGRLLEANMTVEPKKCQF FT LKKEAHVLGHIVGGGRIRTDPEKTRVMAKYPVPTDAKKLKQAVGLFSYYRR FT FIKDFAKIARPLFLLLQKNAEFVWGEEQETAFNILRELMSKEPVLKAPDMS FT QPFIVTTDSSDWALGAILSQGKLGADQPCAYASRCLKGSELKYPIYDKELL FT AIVFAKEQFRYYLYGRKFTVVTDHESLKHFHNTKKPDLRFNRLKAALVGYD FT FDIVYRPGEKNANADALSRNPVITEGQINPDLPRAELYKLANKQILESPDE FT EAGAPPGRIFRTRAIKRQGIDKDRKIKLSTTSSASSNKSTRIRAKRKPKRV FT IYKAGEYLAIRNVENSFYIGRILLDVLDGDEVVNLRWLTEIDDCKGTYRRD FT YCGKINVECILACVVIKQVGKNMYRLGPNESKRIESILRESIEFCEAAKIG FT RREVNESLIDETSDSDISMQSICTSVAASSRGGVKLPPDLQKYLSYVEPAG FT PVTRSAKQSAKATELYEPSASSSSDTERAAPPSGKTHADLESLQSFKFSAS FT NSSQARSPVNKASSNVSTDRHISSDESARKSMSIGSPRLEPLLGTKKVSNM FT PASWPWGSDGTKRKLAITVEGQVVNGKNGVQILVHERQSTVAPSCSIPRTG FT GCSENLNNAKDKLNAGPAVIDKNKLGPRKEKNLIPTKQLEYQAIGTGNSST FT FADYEGVGPAESFEGISRNYERKPEPSDENSSASSGNVLIQTYIRKNRCAE FT PLVELVRSDDIPNSPAGNCLFFSLIKLAKLHLSATELRELLLESPMLHACG FT EPAETERILRSESEYGNIDCAFLFAHAFKMNVCIHYDVTQSKRIFCHIVVD FT GATDVAHLNLSGLHFTPYDRVKARQRTAKLPEIRRAPSPSGDSDEEMPRSP FT SARKKTK" XX SQ Sequence 10944 BP; 3427 A; 2637 C; 2683 G; 2197 T; 0 other; cacaagcatg cggtcctagg tgccaaaatt caagtctaat cgaggaagcg acatcagtca 60 cgcgtcaagc ggaacgtgtc ctggatctga tgctcgtcaa cgcggcaaac ggagagcagc 120 atcgcgctcg ccgatctcta ctccccttcg tcggttccat ccataaattt atacgggacc 180 ctagatgagg acgacaagag agaaattcag gcatcaatta aactcatagc cgacgatacg 240 cgccaaacag cggcgctatt agctgagcag acggaggtcg tcgaacgcga gctatctgac 300 ctaggcgata gagtaggcaa actagaggct atcgtagaag ttttagaaga caagaccgct 360 tctattttca gagaattggg ggctctaaac acaatagcag aaatgaggga tgcactcaga 420 cagtttagcc tagactcgga agtcctgacg gatgcaatct tattcgcagc gaaaggaatg 480 gtgcatccgc gcattgtacc cccagataca attcgagacg cagctagaac cgtcgcaaac 540 tccgtaacca atgccagatt tccgatgccg gaggaagggt ttgcaatcat tcctattatg 600 aagatttcca aattaacggt gttgttatca gacggatgtt taatctatca gatagcaata 660 ccgttactcg acattcagag atacaatttg ttcaaagcgt caccgctacc ggcaatccaa 720 aaagttttta atattcctta tctggcagcc tacatctggc cagagtacaa ctacttcgcg 780 gttagtgagt caaatcgcac ttacatgccc ttattaccgg aagaggtaac aagccttcga 840 aggctcgaca agctaatgat cgcggtaaac ccagagccag tgcgggaaat aagaagcaat 900 gccgcctgcg aggtaaagat cgcgtcgggt cgacaagtct ccaatcccga agtgtgcgac 960 atcagattca ggcagttgag agataccttc tggttgagac tgcataaagc gaactcgtgg 1020 atattttccg ccaaatcacc ggaagatata tttatccaat gcctgcgatc cgagcacgta 1080 gcagcgaaaa taagtggtgc aggcatactc gaactgcaac ccggatgcgc agcgcataca 1140 gcaaacgctc gcgtcccgcg ctttctcgac taactcaaat caatcaaaat tcgaaactat 1200 agagtttaac gtatccgaag taattaagct tctaaacgaa tcagaaattg ccgaagcgga 1260 attcagaaac gcaatcgcat ccgaagcagc gaatcgggta aaaagcttgc acggggtcaa 1320 gttagaaaac ctaaaagcag gggcgagatt acacgagatc gcaagtaaag cacgtgaaat 1380 cgcgcacagg aaggcgaccg ccctcgagct cgaaaatttg ggcagcacaa ctgcaatctt 1440 cgggtggtcc ttaacttcag ccgtaatatc tatagctatc ataatcacaa tcggcggtat 1500 aatctatcgg cggcgttcga acggagcaat aaatcgcttt gtcaaacagc aagagtctct 1560 ctttatgctg gagcagatga gcaggtattc acgagagaga aaacgcgccg agcggagtca 1620 agaataaaaa aaaaaaaaaa aaaaaaaaaa aaaaaattct tatccgctct atattttaac 1680 tataactatg attatcaatt acgaataagc caagctgcgt accatcatgg acaacgccct 1740 aatacataat gaaatacaat taagtccgtt tcaattttat acttttgaca atctaacttg 1800 tataatacat aaccatttac agttagattg atagtatata agaaaattga agcgcttatt 1860 aaaacagact gctcagcgat gagcaagcta aaaataaata ctgtatctag gttaattttc 1920 acatatatgc accagaatat gtcaaataat ttttgtgcgt cataaagaca ttctaataac 1980 catgtaagcc tatcgataac aaccaaatat tcataacaaa cattttataa ccttttgtaa 2040 gagctccgct cgaaagaaat gtaagacagc agcggctgcc agggtagaaa aacacccggg 2100 ctcacaacac agacatacat tcagatacta cacacacctt tggattagaa ccccaatcag 2160 cacctcgcga ggctcaagcc tctggcatcc cacgacacac ataaaataga aatttttttt 2220 tttaacactt agaacagata cacactcgag cgtacacgcg acctccgtgc gtcgttcctg 2280 gatagcctgt ttttttttca ggcgacgagg ggcgtcggcc agcgcggtcc gggcgtgtca 2340 cggacgagcc ccaggctctc cggtgacggc tcgggtcgcc gctgccgacg gcgcatgcaa 2400 tgctccgtca gcagagaacg gcggactcgg aaggttcgca cgggcaaagg tgcatcgggt 2460 tgcaccttcg cgcgtgcgac cgctaaccgt cgacgagccg cgtcgatggc aaagcgggca 2520 gttgatcggc ggcagagctc gcgagcggag ctcacaacct gcaataaagt ttactagagc 2580 aaggtcagtc gttgtctttg gggcaggacc ccgtcatcac ccgccacatc tggcatccct 2640 tagcggggcc ccgaaaaacg acacctgcaa tcgatttaaa ttaatcggtt accgagtcta 2700 aataaaagaa agaacgataa gcagtgcttg tgaatattcg agtgtaaatt cgcaagaatc 2760 ttaagaagct cgtaagcctc gtgccgccaa agctccaaaa acaaaactcg gggctccacg 2820 agaaggaagc tgaggtcaca aaaaaaaagt ggctccaact cgcccgtgtg actttgaaga 2880 aaatcagtca aacggcctca gcaccggaca cccgccggcg tgcgttacct gggcctcagc 2940 gaagccaact ggccaccatc ctatctgggc caccgaggac cagccgtcgg tgcgcgggtc 3000 aaccacggaa gcagccacca cgcgcgctca atagtaggcg cggactagcc gtcggcgagc 3060 gcgatccatc ggacaacatc tcgcagataa catccaccat caagacatca tgcgcccaac 3120 cttcgtgctt atcatcgcaa tgtaagttgt ccatgatatt tttacgtagc ttggcttccg 3180 ggtaggtcgc ggtacagccg cattttgttt actttgtttt atcatttccg ggtcttcgtg 3240 taagtttagg gagtctttag cataagctga gtgcgcgtaa cgcagcgaac atggccccgt 3300 ctggcctcat tcacgcggcg tggcaagcgg taaaaagctc taaaaccgaa gagagaatgc 3360 agccgaacac cggaccaaaa aagcagttta aacgggctag agcggttttt agaattacgc 3420 gaaaaatgtc gcccagatcg caaagagcca cagcgactgc gttaaggtcc ccggtcagcg 3480 actccagcag cgcgtgcaat gaccaccttt gttctggtcg atcgaaacgg ggttggacgg 3540 catctcaaag ggacgcaata gaaaagcgca tgaatgaggg ccggtacacc gataaaccgg 3600 aattatcgga cgaagagagt cccgaacaag gcctcagaga aaacagcgtt atgctacggt 3660 ctagcatact cccgaacgat gaaaataact gcgcgctgaa gctcggaagc gcggcacagc 3720 caaatgaatt aaagaacctg gaaaataggc taggcaatct atgcttaagc gagaacaccg 3780 ttcgcgaaaa tttttttacc ggagtagccg gagtgaagta cgcgccgaaa ctcaaatcac 3840 ggattagctt gcgcagtagt ccgacccgca gcaactgtcg cgaagtaaat ttttataccc 3900 tagctcccgc ggagggcaaa tgcgacgcgc agcgggcgat taactaccgg gagaagtaca 3960 gccgccggcc aggcaggaca agcgcgagaa acacgcgagt ccagcgcagt ctaagcgcga 4020 gtgcgtctag cgagtccgac gagtacagct gacatagcgg caacagcaac aacagtcgtc 4080 gtcggcgtag tgatagcgat agcagccgcg gtaacaaaag ggatggcgac gataaatatc 4140 gaaagcgcgg ggacaaaaat aacgagagac ggcgcaagca cggcgacgac gaagatagaa 4200 agcgcgatcg atcaaagcag gacaggagcg cgtccggtaa tcgcgatagg tacaggcgcg 4260 aagatagaaa attaaaaagc gataagcgta ggcgcggcag cgagagtgaa agcgatgcaa 4320 gtcgcaagca ccgagacaaa tccaggcata gccatccctc tcgtcaaaag aagtattcgg 4380 atgacgagga tagcgaagcc gaactagtac atccctcggt tgcagaggct gccgtaaggc 4440 gaatgacctt tcaccaagcg gctagctacg taccttactt tgacggagac cccgacaatc 4500 tatcaatttt ttgcaacgcg gtccgcaaag tgcaaaagaa gtttggaccc gaaaatgagg 4560 agttcctttt gatgcacgtc gctaataggc tgcaaggaaa ggccacggag ggatatcgct 4620 cacgaacagc ccgttataaa actatcgatg aatttttgcg agatctgacc ctacatttcg 4680 cgaacatcgg cgtcgcccac caaatacagg gtgaaattcg cacgcttcaa cagggcgtat 4740 tagaatcggc aaacgattac ggcatgcgtg ccgaaaaatt atataatagg ttgagaacaa 4800 ttataggatg tgccccggac atatccgaat ccgatagggc gtctaaatta cagcaagcag 4860 acatcgaagc cctcgagcaa tttttactcg gcctaaagtc tccgctcaat ttcctggtac 4920 agcttaagca gccaaaaacg ttaaactaag caataacatt cgccatcgaa tacgaaggta 4980 aatagggtac gagaaacgcg cctagtaaca tcgtgagtcc tgtagatccg gtggcacaac 5040 tgcggttatt attaagcgag ggaattcttc ctaaatcagc acagactgtc aacgaaccac 5100 cgccacaaac aagtaccacg caggagaatc agcaaggcgc agagaaaatg tgcggttact 5160 gccaaacaga cacgcacaat acagaagatt gtcgtacgct gatgtttcat gcaaaaaacg 5220 gaatgatctc gcggccccct cgcaaaccta aaaagaatgg caacagggat cgccgctaca 5280 ataataaaaa taacaacaag cagaaaaatg gtaataatag cccggacaac gcaacagaac 5340 acggcaaaaa tttaaactag atagacgttc gccacgttcc ccaggcgacg agtgtcaaaa 5400 agaaaaagct gtgaacgaag cggcaactaa taatcagccc acaaaaaatc caatggtagt 5460 ggaaacccaa cagaatataa aatcaaaatc tgaagaaaaa tccgaagtcg cgttaaacga 5520 gaggttcgag actgctaagg ttagattttt actagctgga agacaacctc caatcgttaa 5580 actaaatagc cctcagctaa aagagggaca aggtacattc tacgcggact caggggcgga 5640 catctccgtc ttaaaaagag gaaaattagc ggcatgctat cctatagaca cgagcaaaat 5700 tatcaagatc caaggcgtaa cgccaggagc atcgcacacg ctgggccaag cagtaataaa 5760 actgcaaggg ttggcttgca acgtccacat agtgcccaat gacttcccta tagaaaactc 5820 gggaatcatt gggtgggaca tcatagacgc gcacagaggt tgcgtagatg cagccgacca 5880 atgcctaaga ctcggcaaag agcaaattcc gttcgagtct gatgagcgtg tgacgatacc 5940 cccaagggtg aagatgataa tcggagctcg cgtcaaaaat agcgacgtga aagtcgggtg 6000 ggtgcccata acaaatatcc atcccgacat cctctttggt aatttcgtag ccgaaaacag 6060 aaacggcaga gtctatgcgg aatgtatcaa tactagcgat gaagagattt cgatagcaaa 6120 cccagtaatc gagcttctcg agtgcaagac aatggaagaa aatccgctgt atcaggccga 6180 tggggatggc tcagctgaaa gtaccgccaa gttcacggca aatctgcggc gaatgttcga 6240 cataagtaag aaagtcgaaa aatataagga ggtagagtcc ctaaatagaa gacttctcgc 6300 cgacgaacaa gctagaatag aacgagtcga aaaaatccga aaactggcag acttggaggg 6360 atgcaacgac gaggaaatcg gttatatacg gaaaatcatc agcgattttc caggagtgtt 6420 tgggctagac gtcgagccac tgccagctac gcacctgctt aaacacaaga ttgtacttaa 6480 gtcagacaaa ccgataaaaa gcaatcgatt tcgatttccg ccagccctaa aagaaaacat 6540 gctcagagag ctagaaaagc tccgcgagca agggatcgtt gtaccctcaa attctaacta 6600 ttcgtggtca ctgtggatcg tgccaaaaaa gcccgacgcg cagggaaaca agcggttccg 6660 cctagtgact gattttagag cgctcaatga agaaacagaa ggcagttgtc accctctacc 6720 tttcacgagc gacattttag aacatttggc cgcagctaac tacattaccg tgatggacct 6780 aaagcaaggg taccatcaga tagaaatgga cccagaatcg gtccatttga ccgctttcta 6840 cgctcctgac gggagacatg gcaatcagct gttacagttt agctgaatgg ctatgggcct 6900 gaaggaggcg actataacat ttacgaaagc aatgtcattg gccatgaagg gcttgcaagg 6960 cgaggaagtc gagatatatc tcgacgacct catggtcttt agcgagacac tggatgaaca 7020 caaagagagg ctgcgtcgag tactagggcg attgctcgag gctaatatga cagtcgagcc 7080 caaaaaatgc caatttttaa aaaaagaggc tcatgtactc gggcatatag tgggcggagg 7140 tcgcattcgc accgatcccg agaaaacacg ggtgatggct aaatatccag tgccaactga 7200 cgcgaagaaa ctgaagcagg cggttggcct gttcagttat tatcggcgat tcatcaagga 7260 cttcgccaaa atcgctaggc cactgttctt gctcctacaa aaaaacgcag aatttgtctg 7320 gggcgaggaa caagagaccg cgttcaacat attgcgagaa ctcatgtcga aagaaccagt 7380 gcttaaagct cccgatatgt cgcagccttt cattgtcacg acagacagca gtgattgggc 7440 cctcggcgca attctgagcc agggaaaact tggcgcagac caaccgtgtg cctacgcttc 7500 acgctgcttg aaaggcagcg aactaaaata tcccatatac gacaaggagc tgttggcaat 7560 agtgtttgca aaagagcaat tccgctatta tctgtacggt aggaagttta cagtcgtgac 7620 ggaccacgag agcttgaagc attttcataa cactaaaaag ccggacttga ggtttaatcg 7680 cctcaaagcc gccttggtcg gttatgattt tgacatagtg tatcgccccg gcgaaaagaa 7740 tgccaacgca gatgctcttt cgcgaaatcc ggtgatcacg gaggggcaaa ttaatccgga 7800 ccttcctcga gcggagttat acaagctggc caataagcaa atcttagaaa gccccgatga 7860 agaagcaggt gcccctccgg gaagaatatt tcggactcgc gccataaaaa ggcaaggtat 7920 cgataaagat cgtaaaataa aattgtcgac aaccagttca gcaagctcaa acaaaagcac 7980 gcgtataaga gcgaagcgca agccgaagcg cgtgatctac aaggcgggcg aatacttagc 8040 gataagaaac gtcgaaaata gcttttatat aggccgcata ctcttagacg tcctcgacgg 8100 cgacgaggtt gtgaatctgc gctggttaac tgagatcgat gactgtaaag gtacgtacag 8160 acgcgattat tgcggaaaaa taaacgtaga atgcattttg gcatgcgtgg tcatcaagca 8220 ggtgggaaaa aacatgtatc gcttaggacc aaacgagagc aagcgcatcg aaagtattct 8280 aagagagtcg atcgaattct gcgaggcagc gaaaattggc cggcgtgaag tcaacgaaag 8340 cttaattgac gaaacatctg actcagacat atcgatgcaa tcgatatgca cctctgtcgc 8400 tgcaagcagc aggggtggcg taaagttgcc tcccgacctt caaaaatatt taagctacgt 8460 cgagccagca ggccccgtaa ctcgcagtgc gaaacaaagc gcaaaagcaa cagagctata 8520 cgaacctagt gcatctagta gcagcgatac cgagagagca gcaccgccct caggaaaaac 8580 gcatgcagac ttagaatctc tgcagagctt taaattctcg gcaagcaatt cgagtcaagc 8640 tagatccccg gtaaacaaag cttcatcgaa cgtaagcacg gaccggcata tttcgtcgga 8700 cgaatccgca agaaagagca tgtcgatcgg gtcgccccgt ctagaaccgc tgttaggcac 8760 taagaaggta tctaatatgc cagcaagctg gccatgggga agcgacggta ccaagagaaa 8820 actggcgatc accgtagaag gacaggtggt taatggaaaa aacggtgtac agattttggt 8880 gcacgagcgg caatcaaccg tcgctccgag ttgcagtata ccaagaaccg gggggtgctc 8940 tgaaaatctt aataatgcta aagataagct gaatgctggt ccagcagtta tcgacaaaaa 9000 caaactcggc cccagaaaag agaagaatct gatccctaca aagcagctcg aataccaagc 9060 catcggtacc gggaattcca gcacattcgc agattacgag ggtgtcggcc cagctgaaag 9120 cttcgaaggg atatctagga attatgaaag gaagccggaa ccatcagacg agaactcctc 9180 cgccagctcg gggaacgttt taatacaaac atatatacgt aaaaaccgtt gtgcagaacc 9240 actggtcgag ttggtgcgct cggacgacat tccaaactct cccgccggta attgcctgtt 9300 cttctcctta ataaaattag caaaattgca tctatcagcg accgagttaa gagagctcct 9360 cctcgagtcg ccaatgcttc acgcttgtgg agaaccggcc gaaactgaaa gaattttgcg 9420 ctctgagtcc gaatacggaa acatcgactg cgcgttcctc ttcgcgcatg cctttaaaat 9480 gaacgtgtgc atacattacg atgtaacaca gagcaaacgc attttctgtc atatcgtagt 9540 ggacggcgct accgacgtgg cacatttaaa tctatcggga ctacacttta cgccatacga 9600 ccgcgtgaag gctcgacaac gcaccgccaa actacccgag atacgacgcg cgccttcgcc 9660 ctcgggcgac agcgatgagg agatgccacg ctcccccagt gctcggaaga aaacgaaata 9720 gaaaacctaa aggatgggtg atagacaatc aaagtaaaac tcgcgagccc gaagataacg 9780 aaaagggcga acgctgcgga agcaatcaag aatcgatcgc taatcaatcg cctgtcgaat 9840 catcaccagg agcaagtccg aacaatcacg gcgatcaacg aagtagcgcg tgacctcccg 9900 ctgacaatcc tcagcgaagc ccaaccgagg acgaggtgta tcctgcacaa gagcccaatg 9960 agtcgccgaa cgagcaaagc ccacacggtc cgctgcagac aaacgcagca gctgtctcgg 10020 catgcgacat cggcgctagt aacaggggcg atgggcgtac cgacgcgatc gcggctgctg 10080 tgatcaaaga cgcactcgac agtagcaaca aacttaagca gtgcgtcgtc gcaaaagcga 10140 gaccgccacg cgagaaacca ccatgggcga tagctaacgg gcaagtgata tcgccgtttg 10200 tacacttgaa atcctacaat gtacacccat atcggtatag agagaatatg gtgtatttgg 10260 tctcggcgga cgattattta ggaactgaag tccagagatc cctcatcgaa aggggttacc 10320 gaaagagcga tgaattgcaa aataaaggat ttaaagtcgg agaaataaac gtaacatcgt 10380 gtcaagggtt ggctctggta ggcgtataca tcaaagcgca catagacatc cgaccgttaa 10440 aagcggaaat ccgtaaatgc ttacgtacgc tcaaaaacgt gctgaggtca aaaaacatta 10500 acagcttcgc cataatccgc gatctcgaaa tactaactca agccgaatgg gacaaatgca 10560 tcgagttatt cgataactta ttcgcaaacg agaaaatagt tgccgtactg tataaagata 10620 acctgccagt accaccggtg aatttgcgct ataaagtaat caagcaatac cacgatgcag 10680 ctatgggcgg ccatcacggc gtgagcaaga cgtacgggaa gatcgccaac gacttctatt 10740 ggaaaaacat gcggcaagac ataaagaaat ttgtcgcccg ctgtccgact tgcatgacta 10800 acaagttagt gcgattaaaa accagattgc caatgctaat tagcaatact ccggccatgc 10860 ctttcgatca gattgccatg gacttttacg ggcccctgga ggcgtcggac aagggaaata 10920 aatatatttt atcaattcag gata 10944 // ID BEL-42_CQ-I repbase; DNA; INV; 5696 BP. XX AC AAWU01000931; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-42_CQ_; KW BEL-42_CQ-LTR; BEL-42_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-5696 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 237-237 (2011). XX DR GenBank; AAWU01000931; Positions 32094 26399. XX CC Positions [4708-5316] - Integrase core CC LTRs are 93% similar to each other. XX FH Key Location/Qualifiers FT CDS 346..5694 FT /product="BEL-42_CQ-I_1p" FT /translation="MPKSSAKAPATSPGSAEFGNSATSSTPVVKKEKKKRK FT LQDKIKMEIETLVHRRGLAKGKVTKILKAIRPTETAEVRQLSEPQVHVYMK FT KLETAQKEYMSNHESILAMVDMEGRKQADKHYEEFDDLHDKVALMLEEQLL FT GFNATRANGTLNATAPPQAPQPPVIVHQPLRMPIPSFDGRYESWPKFKVLF FT KEVVDKAPDPPAVKLYHLEHALVGGAVGSIDAKTISEGNYAHAWEILEEKY FT GNKRHAIDRHINGLLNLTKMNKENHVELRGLVDECTRHVESLKFLEQEFTG FT ISELIVVHLLGAALDKDTRRRWESEVKKGELPQYDQMLEFLKAQCFVMERC FT EDLKPKQPTVQPKPAVKSSQKSYAVSTSESETRSEDRCDICGKGHKNHACP FT DFKALPLQQRMVKVREHNLCFNCLKKGHSSKKCTSPGTCSKCKRRHHSLLH FT SEERPKSEQKVTQAPPEQAPEVKQQEAAQLTTASCHNETPTQEVLLLTAVV FT DIMDRNNKAHPCRVLLDSASQANLISRAMVEALGLKQFPSNVTVAGIDGTK FT THASSGSVVQLRSRYSSFSANIKCLVTERVTADLPTSAVNVRSWELPPGIQ FT LADPSFHQPGKVDLLLGNQLFLKLLMPGEVHLAEHLPMLRETQFGWVVGGV FT CDEEDEAAAVVHSHSATLEDLNRTVQRFWELEEYESDVKVSSEESECELHF FT TETHRRDATGRYVVELPLKESVAELGDSRTMALRRFYALERKLAQQPELKK FT QYVEFMEEYEKLGHCKEIQEGEDPVGLKKWYLPHHAVLRPSNTTTKCRVVF FT DASAKVNGLSLNDVMKVGDLNQSSLVSIVLRFRFPRYVLTTDVAKMYRQVL FT VDERHTPLQRVFWRTDPTKPLRVLELTTVTYGTASAPFLATRALQQLARDE FT KHRFPLAAKIIEEDFYVDNALFGFDDVSQASEAQSQLIQLCKAGGFELHKW FT SSNCPELLEAIPEADREELVSVGGTGTKEVIKALGLLWNPAADELQFAPLP FT SERDGRPTKSQVLSCVASIFDPAGIAAPVVLVGKLLMKGIWEKEIGWNQQI FT PEELVKEWEHFLDAVRDLDKIRVPRRVVATNAAANELHGYADASKHAYGAC FT VYVKSIIPGSPAELNLLCAKSKLVPKTLKTIPRSELLAARLLHRLVKNVLE FT AVDFVFRKIGLWSDSQVVLAWLKKKPDQLEVFVKNRVAEIVATGGIFEWKY FT VRTADNPADIVSRGMSATKLATNEKWKKCAEYQRSEICEMEEPEPLPDDAV FT PELRRAVVVSAAIQYETFQSTFERIESFRELQRIFAYVVRFCRNCKEKVKE FT RRIVSRYPTIPEMRESQKAILRAVQHLEFAAEIASLEKGESFKKLNCLNPF FT LDDGLVRVGGRLRHSRLPYATKHQWILPQHNKVVQNLIRCIHKENLHAGPL FT ALLAAVRRQFWIPNARSVVRKITRSCVQCFKVSPTTAKQFMGDLPTSRCDR FT ALAFQKVGLDFAGPFLIKQAGRKAAPVKGYVCVFVCMVTKGIHLEAVESLS FT TEAFIAALLRFVSRRGLPEELFSDNGTNFVGAKHELHDLYKLLQDQLTERK FT IFEFCQPREIQWSMIPPGAPHMGGIWEAGVKSTKTILKKVCNTALLTMTEF FT ATLLCQIEAQLNSRPLYAQSEDPSDPEPLTPGHFLIDRPLTAIPEPTYEEI FT PTNRLSRWQYVQVLRDNFWKRWSREYLVELQARGKWTRKMVNVRKGMVVMI FT KEDNLPPQSWRLGVVTDTYPGPDKMVRVVDLRTRSGILKRPIHKLAPLPIL FT DNAETTAASGGE" XX SQ Sequence 5696 BP; 1436 A; 1489 C; 1684 G; 1087 T; 0 other; tggtccaaac gagccggatc tggtgaaccg ggagtgcaca gtgaaggaca aaaggcggag 60 ggtcgtttgt atcgcggcaa gtggacaagt tcgcgaaaag tgcgaaaaaa gtgaacgccg 120 cgaccaagtg aaaaagccgc catcttggcg aaaaaagaag cgacgccatc ttgcgtccag 180 aacgttcttg agaggaaaaa agttccagaa gaagaacaag tggaaggaag atagtgaagt 240 gtcggccaag aggacgccat tgtttccccc tagcaccgtg atttccaact tggtggattt 300 ccgtgttagg ttcgggcaaa aggctcgatt ggtggtgtcc ctcgaatgcc gaaaagttcc 360 gcgaaagctc ctgcaacatc acctggaagc gctgagttcg gaaactcggc cacatcgtcg 420 actcctgtag tgaagaagga gaagaagaag cggaagttgc aggacaagat caagatggag 480 atcgagacgc tcgtccaccg ccgaggcctg gccaaaggta aggtgacgaa gatactcaaa 540 gctattcgac ccacggagac tgcggaagtt cgccagctaa gcgagccgca agttcatgtc 600 tacatgaaaa agttggaaac cgctcagaag gagtacatgt ccaaccacga atccatcctg 660 gccatggtcg acatggaggg tcgtaagcag gctgacaagc attacgaaga atttgacgac 720 ctccatgaca aagtggccct catgctcgaa gagcagttgc tcgggttcaa tgccacaagg 780 gcaaacggaa ccttgaacgc aactgcaccc ccacaagccc ctcaaccccc tgtcattgtt 840 caccagccct tgagaatgcc aatcccctca ttcgacggcc gctacgagag ttggccgaag 900 ttcaaggtcc tgttcaagga agtggtcgac aaggcgcccg atccgccagc agtgaagctc 960 taccatctgg aacacgcgct ggtggggggt gcagttgggt cgatcgacgc gaagaccatc 1020 agcgagggca actacgccca tgcctgggaa attctggagg aaaagtacgg caacaagcgt 1080 catgcgatcg acagacacat caacgggctg ctaaacctga cgaagatgaa caaggagaac 1140 cacgtcgagt tgagaggcct ggtggacgag tgcacaaggc acgtagagag tctgaagttc 1200 ctggaacagg agttcacggg aatatcagag ctgattgtgg tccatctgct cggcgcagcg 1260 ctggacaagg acactcgaag acgctgggag tcggaggtca agaaaggaga actaccccag 1320 tacgatcaga tgctggagtt cctgaaggcc cagtgcttcg tcatggagcg ttgcgaggac 1380 ctgaaaccaa agcagccaac ggtccagccc aagccggccg tcaagtccag ccagaagtcg 1440 tacgccgtgt ccacgtccga gtccgagacc aggtcggaag acaggtgcga catctgtggc 1500 aagggccaca agaaccacgc ctgcccggat ttcaaggcac ttccgttgca gcagagaatg 1560 gtcaaggttc gggagcacaa cctgtgcttc aactgtctaa agaagggaca ctccagcaag 1620 aagtgcactt caccgggaac ctgctcgaag tgcaaacgac gtcaccacag tctgctccac 1680 tccgaagaac gccccaagtc ggaacagaag gtgacacaag caccgccgga acaagccccg 1740 gaggtcaagc agcaagaagc agctcagctg acaaccgcga gttgccacaa cgagacgcca 1800 acccaagagg tgctgcttct cacggcagtg gtggacataa tggatcgaaa caacaaggcc 1860 cacccctgca gagttttatt ggacagtgcg tcacaagcta acctcatctc gagggccatg 1920 gtggaggcgc tgggcttgaa gcagtttccg tccaacgtca ccgtggcagg aatcgacggc 1980 acgaaaaccc acgcttcatc gggcagcgta gtccaacttc gttcgaggta ctccagcttc 2040 agcgcgaaca tcaagtgttt ggtcacggag agggtcacgg ccgaccttcc cacgtccgca 2100 gtgaacgtgc gcagctggga gctgccacct ggaatacaac ttgcggaccc gtctttccac 2160 cagcctggga aggtggatct gctactgggc aatcaattgt tcttgaagtt gctgatgccc 2220 ggagaagtgc atttggcgga acacctgcca atgctacgcg aaactcagtt cggctgggtc 2280 gttggtggcg tgtgcgatga agaagacgaa gctgcagcag tggttcactc gcactcagcc 2340 acactggaag acttgaaccg caccgttcaa cgtttttggg agctggaaga atacgaaagt 2400 gatgtgaaag tgtcaagtga agaaagtgaa tgtgagctgc acttcacgga aacacaccgt 2460 cgggatgcaa ctgggaggta cgtggtggag cttccgctca aagagtcggt cgccgagctc 2520 ggcgattcac gtacgatggc gctgcgaaga ttctacgctc tggagagaaa gcttgcacaa 2580 cagccggagt tgaagaagca gtacgtcgaa ttcatggaag agtacgagaa gcttggacac 2640 tgcaaggaga tacaagaagg tgaagaccca gttggactca agaagtggta cctgcctcac 2700 cacgccgttc ttcgcccgtc gaatactacc accaagtgcc gcgtggtgtt cgacgcatcc 2760 gccaaagtga acggcctctc tctcaacgat gtgatgaagg taggtgacct aaaccaaagc 2820 tcccttgtct cgatcgttct tcgctttcgc ttcccccgct acgtgctgac cacagacgtg 2880 gccaagatgt atcggcaagt ccttgtcgat gaaagacaca cgccgctgca gcgcgtgttt 2940 tggagaaccg atccaacgaa accgcttagg gtgctagagt tgacgacggt cacctacggc 3000 actgcatccg caccatttct ggccacacga gcgctccagc agctggccag agacgagaag 3060 catcgattcc cactagcagc caagatcatc gaggaagact tctacgtcga caacgcgctg 3120 ttcggtttcg acgacgtttc acaagcttcc gaagcgcaat cgcagctgat ccaactgtgc 3180 aaggcaggtg gatttgagtt gcataagtgg tcgtcaaact gtccggaact cttggaggcc 3240 attccggaag ccgatcggga ggagcttgtc agcgttggtg gaactggaac caaggaggtg 3300 atcaaggctc taggtctgct gtggaacccc gcggcggacg aactacaatt tgcgccgctg 3360 ccttccgagc gagacgggcg gccaacgaaa tcgcaagtgc tctcgtgtgt cgcgagcatc 3420 ttcgacccgg ccggaatcgc tgcgccagtg gttctcgtcg gcaagttatt gatgaagggt 3480 atctgggaaa aggagattgg atggaatcag cagatacctg aagagctagt gaaggagtgg 3540 gagcacttcc tggatgcggt ccgggacctc gacaagattc gagtgccgcg tagggtggta 3600 gctacgaacg ccgctgctaa cgagctgcac gggtacgccg acgcgtcgaa gcacgcgtac 3660 ggagcttgcg tgtacgtgaa atcgatcatc ccagggagcc ccgcggaact gaacctgctg 3720 tgtgccaagt cgaagttggt tccaaagacg ttgaaaacga tcccacgaag tgagctgctg 3780 gccgcacgct tgctgcaccg gttggtcaag aacgttctgg aggcggtgga cttcgtcttc 3840 aggaaaattg ggctgtggtc tgacagccag gtcgtgcttg cttggctaaa aaagaaacct 3900 gatcaactgg aggtgtttgt gaaaaatcgg gttgctgaga ttgttgctac cggcggtatt 3960 ttcgagtgga agtatgtgcg cactgcagac aaccctgcag atatcgtttc acgcggtatg 4020 tctgccacca agctagcaac aaacgagaag tggaagaagt gcgcagagta ccagcgaagt 4080 gagatctgcg aaatggagga accagagccg ctgccagatg acgccgttcc agagcttcgt 4140 cgtgctgtgg tcgtaagcgc ggccatccag tacgaaacat ttcaatcaac gttcgagagg 4200 atcgaatcgt ttcgagagct acagcgcatc ttcgcctacg tcgtgcggtt ctgccgcaac 4260 tgtaaggaga aggtcaagga aaggcggatc gtgtcaaggt accctacgat cccagagatg 4320 cgcgagtcgc agaaggcgat tttgcgagcg gtgcaacacc tggagttcgc cgccgaaata 4380 gccagcctcg agaaaggtga gtccttcaaa aaactaaatt gtttgaaccc atttctggac 4440 gatgggttgg tacgtgtggg cggtcgcctg cgccattcgc ggctgccgta cgcaaccaag 4500 caccagtgga tactgccgca gcacaacaag gtggtgcaga acctgatcag atgcattcac 4560 aaggagaact tgcatgctgg cccgctggca ttgctggcgg cagttcgccg gcaattctgg 4620 atacccaacg ctcgttcggt ggtacgcaag atcacacgga gctgcgtgca gtgtttcaag 4680 gtcagtccca cgacagccaa gcagtttatg ggagaccttc caacgagtcg ctgcgacaga 4740 gctctagcgt ttcagaaggt tggcctggat ttcgcgggcc cgttcctcat caagcaagct 4800 ggacggaagg cggccccggt caagggttac gtgtgcgtgt tcgtgtgcat ggtaacgaaa 4860 ggaatccacc tggaggcggt cgagagcctg tcgactgaag cattcatcgc tgcgctgcta 4920 cggttcgtgt cccgccgtgg tctacccgaa gagctgtttt ctgataacgg gacgaacttc 4980 gtcggcgcca agcacgagtt acacgacctg tacaagctgt tgcaggatca gctgacggag 5040 agaaagattt ttgagttttg ccagccgcgc gagattcaat ggagtatgat cccccctggc 5100 gcaccgcaca tgggtggaat ttgggaagca ggagtcaaga gcaccaagac gatcctgaag 5160 aaggtatgca acactgcgtt actcacgatg acggaatttg ctacgctgct ctgccagatt 5220 gaggcgcaac tcaattcgcg gcctctgtac gcgcagtccg aggacccatc ggaccccgaa 5280 ccgctaacgc cgggacattt tttgattgac cgtccgctga cggcaatacc agagccaacc 5340 tacgaggaga ttcctacgaa tcgcctgtcc cgctggcagt acgtccaggt gctccgtgac 5400 aacttttgga agcgctggtc acgggagtat ctggtggaac ttcaagctcg gggaaaatgg 5460 accaggaaga tggtcaatgt gcgcaagggt atggttgtca tgatcaagga ggacaacttg 5520 ccaccccaat cgtggagact cggagtggtt accgacactt acccgggtcc ggacaagatg 5580 gttcgagtcg tcgacttgcg aacacgttcc ggaatattga agcggccgat ccacaagctg 5640 gcaccgctgc cgattctgga caacgccgag actacagccg cttccggtgg ggagga 5696 // ID BEL-633_AA-LTR repbase; DNA; INV; 179 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-633_AA_; KW Pao_Bel_Ele82; BEL-633_AA-I; BEL-633_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-179 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 179 BP; 55 A; 35 C; 35 G; 54 T; 0 other; tgttccgtcg agctggcaac acgtcgcagc agactgtgac acacgagatg aaaaacgcta 60 tggttgaatt ttgttctttt atttttgttt tttgattcaa gttcctacaa atacacaagc 120 aaaagagaat aaagtgaaag tatgtacagt ccacacgcgt tttactatta gtcccgaca 179 // ID BEL-594_AA-I repbase; DNA; INV; 6427 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-594_AA_; KW BEL-594_AA-LTR; Pao_Bel_Ele204; BEL-594_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-6427 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [5466-6035] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 47..3718 FT /product="BEL-594_AA-I_1p" FT /translation="MPRSQHIPHEASADAQCIACRKPNSAEAKMVQCDGCH FT RWYHFSCAGVGDSIEAEDRSYQCALCRPRSSRAPLSEKSTASTSASAIEAR FT LQLEMQHLVEEKELQAKLMAEKAKQDREMQEKALQLETERRAKMMDDLMKL FT EKEFLDRKFQLLHAQLDEGAKTSSRASVHSGVSAAGSADKVRSWIDNHQVI FT LRANSGKDISTGLTSQAGGHVVATRSTLVDGAVAPVAAVSSHEIRSSQAVT FT QPGVFSNSFDDVISPIATVYQGGFTNTANPVPPVLQSTPRTESITLSSIPA FT VHPTVSWQTDIHPSSCAVSQPAPVIGSTVGGSSMLPRVPAPVSSFGRYDQS FT QYVRRTWMDPHVGQHNPYVLPSSTVVSTTSAEVIAGKIGEPITQNCVQPSH FT VFPSTLGQLGYSQISRNEANMPPDSGNYFLSRNQPVDLNPIVSQNAPFPVY FT HMQSEPMSSGGESRPPLRPPVVSVSSYQTPIPAVQNPQLQYGGFQPSYGPN FT AQQLAARHVVPKDLPCFSGDPTEWPMFWSSYQTSTQICGYNDSENLMRLQR FT CLKGDARKAVSSFLLHPANVPEIINTLRILYGRPEAVISSLLAEVRATPTP FT RPEKLGTIISFGLAVRNFCTHLVSTGQQVHLANPILIEELVEKLPANIKLD FT WAIHKQRVQNVDLRTFADYMNVIVTAASSVTPTIVKHEKPKGKAEIHTVNI FT QNEKPEPSSQTRTERPCMVCQKPGHKPKDCSSFKSKTLEDKWKVAQEVSLC FT RRCLYPHGKWPCKASNCGVNGCQQRHHRLLHPGDPRETRPEGSSQVSGVIS FT VHRQQQEILFRIIPVILHANGRSVQTFAFLDGGSDTTLLERSLAEQLGVQG FT PTSPLCMKWTNGVERVEEESQKIQLDISGLSTDRKFTIRRVQTIEGLELPR FT QSVNFEELRSKYPHLRGLPVQSYNDAVPGILIGLDNTRLKTTLKLREGREH FT EPVVAKTRLGWVLYGRSGNDVGSSSQRVLHICTKSRDDELHGLVKSFFSME FT SVGISVTQAKESADDKRAQEILQATTNRTDTGRFETGLLWRFDNVEFPNSR FT PMAERRLKCLERRLLKSPALYANVRQQIASYVSKGYAHKATPRELETMDAK FT RTWFLPLGVVVNPKKPEKVRVVRDAAASVEGVSLNSVLLKGPDLPTSLQAV FT LCRCRQKAIAISGDIMEMFHQVLIRVQDRQLSGSCGEKVHPILFKYTSWTL FT QHSGRRVRQVQPNM" FT CDS 4968..6395 FT /product="BEL-594_AA-I_2p" FT /translation="MRRNGSKDGKATLGSAEWKRAESAVFKLIQANEYPEE FT CSILHRNLTLQPDERERLQKSSRVFKLSPFLDLAGVMRMGSRIANAECVSD FT ESKFPVILPKDHYVTQLVIDWYHRKYRHANNETVVNEIRQKFHVSDLRTAV FT KKVANGCNWCRVYKCKPQVPPMAPLPSARLTPYVRAFTFTGLDYFGPINVR FT FGRGSVKRWVALFTCLGTRAVHLEIAYSLSSESCKMAIRRFIARRGAPQEI FT YSDQGTNFQGASAELRQQIEAVNRGLAETFTNAETQWKHDPPYAPHMGGVW FT ERLVRSVKSGLAYMELPRIPEEETFITALAEVESTVNSRPLTYLPLESEES FT TALTPNHFLLLSSSGVVQPSVRPTEEGAALRSNWKLAQVMLDRFWMRWIRE FT YLPVIARQPKWFGEVKAINVGDLVIVVNEGVRNSWIRGRIIRVYPGRDGRV FT RSADVQTSFGIMRRPATKLAVLDVAREAGTA" XX SQ Sequence 6427 BP; 1715 A; 1542 C; 1753 G; 1417 T; 0 other; atctttaaga ttgatttact gagatcgctt gcaatctcgt agaaacatgc cgaggtctca 60 acacatccct catgaagcat ctgctgatgc tcagtgcatt gcatgtagga aaccaaattc 120 ggccgaagcc aagatggtcc agtgtgatgg atgccaccga tggtaccact tttcgtgcgc 180 cggcgtcggc gatagcattg aagcagaaga tcgtagctat cagtgcgcgc tttgccgacc 240 gaggagttct cgagcgccgt tatcggaaaa atccacagca tcgaccagtg caagtgcaat 300 tgaggcaagg ttgcaactgg aaatgcagca tctggttgaa gaaaaggaac tacaagccaa 360 gttgatggct gaaaaggcta aacaggatcg ggaaatgcag gagaaggctc tgcagctgga 420 aaccgaaagg agggcaaaaa tgatggacga tctgatgaaa ctggaaaagg agtttttgga 480 ccgtaagttc cagttacttc acgcccagct ggatgagggc gcgaagacga gctcgagagc 540 gagtgtacac agcggggtca gcgcagccgg aagtgccgat aaagttcgga gctggatcga 600 caaccaccaa gtgatcctga gggccaattc tggaaaagat atttccactg gactcacatc 660 acaggccggt ggccacgtcg tggcaacgag atcgacactg gtagatggcg cagtagctcc 720 agtcgcagca gttagcagcc acgaaatcag atccagtcag gcagtgaccc aaccaggtgt 780 gttttcaaac tcgttcgacg acgtcatctc gcctatagcg accgtatacc aaggaggttt 840 caccaatacc gccaatccag ttccaccagt tctacaatca acgccgcgga cggaatccat 900 aacgctgtcg agtattccag ccgtgcaccc cactgtatcg tggcagaccg atattcatcc 960 gagttcgtgt gcagttagcc agccggctcc ggttattggt tcaactgtcg gcggcagttc 1020 gatgcttcct cgggtcccgg caccggtatc gtcatttggg agatacgatc aatcacaata 1080 cgtcaggcgc acgtggatgg atcctcatgt tgggcagcat aatccgtatg tactacctag 1140 ttcgactgtg gtgagtacta caagtgcaga agtcattgca ggaaaaatcg gcgaaccaat 1200 cacacaaaat tgtgtccaac catctcacgt tttcccctct acgctagggc agttagggta 1260 ctctcaaata tctaggaacg aagctaacat gcctcctgat tccggtaact atttcttaag 1320 taggaaccag cccgtagatt tgaatcccat agtgtcacag aatgctccct ttcccgttta 1380 tcatatgcaa tccgaaccaa tgtcttcagg gggggaatct cgaccgccac ttcgtcctcc 1440 tgtagtgagt gtgtcgagtt atcaaacacc cataccagct gttcaaaatc cgcagctgca 1500 gtatgggggc ttccaaccat cgtacggtcc gaatgcgcag cagctggcgg cccgccatgt 1560 ggttcccaaa gaccttccgt gcttttctgg ggacccaacg gagtggccga tgttctggag 1620 cagctaccag acgtccacgc agatttgtgg gtacaacgat tcggagaacc tgatgcgatt 1680 gcaacgatgc ttgaaagggg atgcgaggaa agcagtgagc agttttctcc ttcatccggc 1740 gaacgttcct gagataatca acacgttgcg gattctatat ggacgccctg aagcggtcat 1800 cagttctctg ctggctgaag tgcgcgcaac cccgacaccg cgaccagaaa aactgggaac 1860 gatcatcagt ttcggcttag ctgttcggaa tttctgtacg catctagttt cgactggaca 1920 acaggtgcat ttggctaacc cgatcttgat cgaggagcta gtagaaaaac tgccggccaa 1980 tatcaagcta gattgggcga tacataaaca gcgagtgcag aacgtagatt tgagaacctt 2040 tgcggactac atgaacgtga tcgtgacagc agctagtagt gtaacaccaa ctatcgtcaa 2100 acacgagaag ccgaaaggga aagccgaaat acacacggtg aatatccaga acgagaaacc 2160 agaaccatcc agccaaacga ggacagagcg tccttgtatg gtttgtcaga aacccggaca 2220 caaaccaaag gattgcagta gcttcaagtc gaagaccttg gaagacaagt ggaaggtggc 2280 ccaagaagtg agcctatgta gacgctgctt gtatccgcat ggtaaatggc cgtgcaaagc 2340 ctcgaactgc ggagtaaatg gctgccaaca acgacatcat cggttgcttc acccaggaga 2400 tccacgagaa acaagaccag aaggatcgtc tcaggtttcc ggcgtaatct ccgttcatcg 2460 gcagcaacag gaaatcctat ttcgtataat tccggttatc cttcacgcca acggtcgatc 2520 ggtacaaacc tttgccttct tggatggtgg ttccgatacc actttgttgg agcgatcttt 2580 ggcagaacag ttgggtgtgc aaggtccaac atctccgctt tgtatgaaat ggaccaacgg 2640 agttgagcga gtggaagagg aatcccagaa aattcagctg gatatttctg gattaagtac 2700 ggataggaag tttactatac ggagagtaca aaccatcgaa gggctagagc ttccgcgaca 2760 gtctgtgaat ttcgaggagt tgaggagcaa atatccgcat ctgaggggtc ttccagtgca 2820 aagctacaac gacgcagttc caggtatcct gataggcttg gacaacacta gattgaagac 2880 tacgttgaaa ttgcgggagg gccgagaaca cgagccggtt gtagccaaaa ctagactggg 2940 ctgggttctt tacggccgat ctggtaacga cgttggaagc tcgtcacagc gggttctgca 3000 tatttgcacc aaatcgcgcg acgacgaatt gcacggactg gtgaaaagtt ttttctccat 3060 ggaaagcgtc ggcatttcgg taactcaggc gaaggaatca gcagatgaca agcgggctca 3120 agagattctt caagcaacca cgaacaggac tgataccggc agattcgaaa ccggtctctt 3180 gtggagattc gacaacgttg agtttcccaa tagcaggcct atggctgagc ggagactaaa 3240 atgcttggaa cgtcgtcttc ttaaatcacc agcattatat gccaacgtta ggcagcaaat 3300 cgcaagctat gtatctaagg gatacgcaca caaagccact cctcgagaac tggagacgat 3360 ggatgcgaaa cgtacgtggt ttttaccctt gggtgtcgtg gtgaacccaa aaaagcccga 3420 gaaagtcagg gtcgtccggg atgccgccgc atcggtagaa ggcgtatcgt taaactccgt 3480 tctcctaaaa ggacctgatc tgccaacctc tctgcaagcc gtcttgtgcc gttgccgaca 3540 gaaagcaatc gccatcagtg gcgatataat ggaaatgttc catcaagtgc ttatcagagt 3600 gcaagatcga cagctcagtg gttcatgtgg cgagaaagtc catccgatcc tattcaagta 3660 tacgtcatgg acgttgcaac attcgggtcg acgtgttcgt caagttcaac ccaatatgtg 3720 aaaaacctga atgcggagga gtatgcggcg gagttcccga gggcgtccga ggcgataaag 3780 gaaaatcact atgtggacga ctatctggac agtgtcgata cggtcgagga agcggtcagt 3840 agcactggac gtcaaggaag tccacgagaa ggcaggattc cttattcggc actggatgtc 3900 caactcccca gaggtgctga cacgcattgg agaacaaaac acgaagtcag tgaagagttt 3960 tacaatggac aagggcagca acctggaacg agtcttaggg atggtgtggc gaccacaaga 4020 ggatgtcttc gtattttcga tggctttccg agaagatctg caacgtctac tggatgattc 4080 cgtcgtgcca accaagcgtc agctactgag cctcgtcatg agcgttttcg atccattagg 4140 tatggttgct cctttcgtgg tacatggcaa gacgatcgtt caggaagtgt ggagatccgg 4200 aattagctgg gacgagccac tacctgagga tatcgtacac cgatgggagc agtatgccaa 4260 attgctcgga acgatgcata cgtgaagata cctagatgct attttcccgg atacaatcct 4320 gaagcttacg attcgcttga actacacgtt ttcgtggatg ctagcagctc agcatatgcc 4380 gccgtagctt atttccggat tgtggactgt ggaaaagtac gctgtagcct ggtatctgcc 4440 agaaccaaag tagcgccgat caaattgctg tcagttccac ggttggagct gcaagccgcc 4500 gtgatcggag cacgcctcgc aaagtctgtc atttcaaagc actctttgaa gatcaagcgc 4560 aaaactttgt ggagtgattc ctcgacagtg ctgaactggc tacgatctga tccgcggaag 4620 ttcaagcagt ttgtagcgtt tcggataaca gagatcctgg aggaaacaga agtcaccgaa 4680 tggaggaaag tcccaacgaa gatgaacgta gctgatgaag ccaccaagtg gggacatggg 4740 ccatgcttcg atgaacagag caggtggtac aagggcccag cattcctata tgagccggag 4800 tttgattggc cagaggaagc gatcgaacca ttacagcgga agaattacga gtggttcatg 4860 tgcaccagca actaccgacg gaaccgatca tcgagtttga acgtttttcg aagtgggaga 4920 ggctaacccg ctcagtggca ttcgttctac gctttggttg caagaggatg cgacgaaacg 4980 gatcgaagga cggaaaagca accctaggca gcgcggaatg gaaacgagct gagtcggctg 5040 tgttcaagct gattcaagcg aacgagtatc cagaggaatg tagtattctt catagaaacc 5100 ttacgttgca gccagatgaa cgtgaaagac tccagaagag cagccgagtg ttcaaattat 5160 ctccgttttt ggacctcgcc ggagtgatga gaatgggatc ccgcatcgca aacgcggaat 5220 gtgtttcgga cgaaagcaaa tttcccgtca tcttgcctaa agatcactat gtcacgcaac 5280 tagtgatcga ctggtatcac aggaagtatc gacacgccaa caatgaaacg gtggtgaacg 5340 agattcgcca gaagttccac gtttctgact tgaggacagc cgtcaagaaa gtggcaaacg 5400 gctgtaactg gtgcagagtt tacaaatgca agccacaggt tccaccaatg gctcccttgc 5460 cctctgctcg tctcacgccg tatgtccggg ccttcacctt tacggggctg gactacttcg 5520 gcccgattaa tgttcgcttt ggacggggta gcgttaaacg atgggtagca ctctttacct 5580 gtctggggac ccgagccgtg catttagaaa ttgcttatag cctatcatct gaatcgtgca 5640 agatggcaat acgcaggttc atcgcccgcc gaggtgcgcc gcaggaaata tactcggatc 5700 aaggtaccaa cttccagggg gctagcgcag aattgagaca acagatcgaa gcggtaaacc 5760 gaggtctagc ggagacgttc acgaacgccg aaacccaatg gaaacacgac cctccgtacg 5820 ccccccatat gggaggtgtg tgggaacggc tagtacgttc agtcaagtca ggtttagcgt 5880 acatggagtt gccgaggatc ccggaagagg agacgttcat aacggccctt gcggaggttg 5940 aatctacggt gaactcgcga cctctaactt accttccgct ggaaagtgag gaatctaccg 6000 ccctaacgcc gaaccacttc ctattgctga gttccagcgg agtcgtccaa ccatctgtcc 6060 ggccaactga ggagggtgcg gcgttgcggt cgaactggaa acttgctcag gttatgttgg 6120 accgattctg gatgcgatgg atccgagaat atcttccggt tatcgctcga cagccgaagt 6180 ggtttggaga ggttaaggcg atcaacgtcg gtgatctggt catcgtcgta aacgaaggag 6240 tgagaaacag ctggatcaga ggaagaatca tccgggtgta tcctggacgt gatggacgag 6300 tccgcagcgc ggatgtccag acttcttttg gtatcatgcg acgaccggcg actaagctcg 6360 ctgtcctgga tgtagccagg gaagctggta cagcttaagg acacttgaag cagtacgggc 6420 gggggcg 6427 // ID Neptune1_Ren repbase; DNA; INV; 4060 BP. XX AC . XX DT 20-DEC-2006 (Rel. 11.12, Created) DT 09-JAN-2007 (Rel. 11.12, Last updated, Version 1) XX DE Neptune1_Ren is a Penelope-like retrotransposon. XX KW Penelope; Non-LTR Retrotransposon; Transposable Element; KW Penelope-like element; reverse transcriptase; KW GIY-YIG endonuclease; Neptune1_Ren. XX OS Reniera OC Eukaryota; Metazoa; Porifera; Demospongiae; Ceractinomorpha; OC Haplosclerida; Chalinidae. XX RN [1] RP 1-4060 RA Arkhipova I.R.; RT "Distribution and Phylogeny of Penelope-like elements in RT Eukaryotes."; RL Syst. Biol 55(6), 875-885 (2006). XX DR [1] (Consensus) XX CC Neptune1_Ren is a Penelope-like element (PLE) from the sequenced CC genome of the sponge Reniera sp. JGI-2005. It belongs to the CC Neptune group of PLEs. Its ORF1 has a region of homology to SGNH CC hydrolases, members of a diverse family of lipases and esterases, CC and ORF2 contains regions homologous to reverse transcriptases CC and to GIY-YIG endonucleases. The element appears to be low-copy CC and probably inactive, although the presence of intact copies CC cannot yet be ruled out. Consensus sequence was assembled from CC trace archives. XX FH Key Location/Qualifiers FT CDS 1..1632 FT /product="Neptune1_Ren_1p" FT /translation="MLPNYNNHWQEARNRNTNRQPNQYHVPYNGPVFHRPL FT PPTNRRPPTNPSSYIPPHRRPTYNTAYQSRGNHRPDQQQKRLPAHRRQPVP FT THKEEEPFVKVFFQVLQCLHHLAILTLQQQGQPAFTFKRKVSELDGFIRPA FT MKTSAVDANIRSLNRTWLCKVTQVLIDHYQTTLKEKLTTIRTKSLSSPAVD FT RIVSHALSWARRSYGKRLTQAVISKFYQTINETKTRVTISLPTEMDTTPAP FT SRATTLPNQTPKRARSSPTPSPADIGKRSRREGTPLLFSLSDSSSPSTPSE FT SSPRDPGSANLSSFCAFLDQQTTPRSPSKSRHVSRPSPPRTRSQSVPNATI FT LTTTKPVPITPSTTIISATKPYYHQTKDKSLWKLPALKKSTLIIGSSNLNR FT ITKTSSDTEIHSYPGAQIHHIQSILESYDHDPKPTTIILHVGLNNRDQIPK FT TSFNQAQKLIATANKTFPNSKILFTCIPISNKISKKAQQNLSTLNNMMMES FT SINSATILPPYEEEFETVDNLHWSPETANSIYLFWIHQLQASSKSLN*VFK FT LITGDKNTERPPLGQTVVNLSRKILTPLQHSILDRGLKFIPTPKIANLNYL FT RPSIERFQRNLSCTYFFRNHPPRGARTMFIPKSTWTPPQAAIHPEILKAFA FT SMNSQTVNLSITHEKPNLTRVEREALNLLKKDDSIVIKKSDKGSCCVILNR FT EDYVSEAENQLNNPKHYAILPKPIFKETANMINNILADLVKTSYLKQKQYD FT YLSAKPDCRERHLYTLPKIHKNPTKEWFISNKIPKGRPIISDCGSESYAVS FT EYIDHFLLPLACQHRAYLKDTTDFVNKIKNLRVPSNAILISIDVSSMYTNI FT DNTTGLTAVRKAFNRNPDPNRPDDAILQLLEICLSRNDFTFNGKTYLQTSG FT TAMGKRFAPSYANLFMAEWEREVLPLCPFDPLFYGRFLDDIFMVWTYSIEQ FT FWEFFEILNNHHPSIKLTANVSDKSIDFLDTTVFKGDSIAETNKLDIKVYF FT KETDTHQLLHKSSFHPKHTFSGIIKSQVLRFHRICTRSSDFNQACTIVFSK FT LKERGYSKRFLRNIKNSTLAQIAAKNRLYAEVPPGLRSAPCLSSRCKTCPF FT ISETHYFVSNSTNKRYPIQEHLTCEHSNVIYLISCKKCPKQYVGETKNNLR FT TRFNAHRSDIKLKKDKPVSNHFNLDDHSIDDLTLIPIERIEIDLPEDETTK FT FRQSREAHWIETLNTARPIGLNIHHNVGIAPFVVPFNTTSNRASKIVHAHY FT KSLQEKFPHVYKQKMIVAYSKNKNLNDNIVHSKLKTIST*" FT CDS 1620..3890 FT /product="Neptune1_Ren_2p" FT /translation="VFKLITGDKNTERPPLGQTVVNLSRKILTPLQHSILD FT RGLKFIPTPKIANLNYLRPSIERFQRNLSCTYFFRNHPPRGARTMFIPKST FT WTPPQAAIHPEILKAFASMNSQTVNLSITHEKPNLTRVEREALNLLKKDDS FT IVIKKSDKGSCCVILNREDYVSEAENQLNNPKHYAILPKPIFKETANMINN FT ILADLVKTSYLKQKQYDYLSAKPDCRERHLYTLPKIHKNPTKEWFISNKIP FT KGRPIISDCGSESYAVSEYIDHFLLPLACQHRAYLKDTTDFVNKIKNLRVP FT SNAILISIDVSSMYTNIDNTTGLTAVRKAFNRNPDPNRPDDAILQLLEICL FT SRNDFTFNGKTYLQTSGTAMGKRFAPSYANLFMAEWEREVLPLCPFDPLFY FT GRFLDDIFMVWTYSIEQFWEFFEILNNHHPSIKLTANVSDKSIDFLDTTVF FT KGDSIAETNKLDIKVYFKETDTHQLLHKSSFHPKHTFSGIIKSQVLRFHRI FT CTRSSDFNQACTIVFSKLKERGYSKRFLRNIKNSTLAQIAAKNRLYAEVPP FT GLRSAPCLSSRCKTCPFISETHYFVSNSTNKRYPIQEHLTCEHSNVIYLIS FT CKKCPKQYVGETKNNLRTRFNAHRSDIKLKKDKPVSNHFNLDDHSIDDLTL FT IPIERIEIDLPEDETTKFRQSREAHWIETLNTARPIGLNIHHNVGIAPFVV FT PFNTTSNRASKIVHAHYKSLQEKFPHVYKQKMIVAYSKNKNLNDNIVHSKL FT KTIST*" XX SQ Sequence 4060 BP; 1343 A; 1071 C; 616 G; 1030 T; 0 other; atgctcccca actacaacaa ccactggcaa gaagccagaa atagaaacac taatcgtcaa 60 ccaaaccaat atcacgtacc ctacaacggc cctgtcttcc atcgacctct accacctacc 120 aatcgccgac caccaaccaa cccatccagc tatatcccac cccatcgccg acctacctac 180 aacacagcct accaatcaag aggtaaccat cgacctgacc aacaacagaa acgactacca 240 gcccaccgac gccaaccagt ccccactcat aaggaggaag aaccctttgt gaaggtgttc 300 ttccaagttt tgcagtgcct ccatcatttg gcaatcctta cccttcaaca gcaaggccaa 360 ccagcattta ccttcaaacg caaagtctct gagttggatg gttttatccg tccagcaatg 420 aaaacatcag ctgtcgacgc gaatattaga tccctcaatc ggacatggct ctgtaaggtt 480 acgcaggtcc tgatcgacca ttaccaaacc acactcaaag agaagctgac caccattcgt 540 accaagtctt tatcatcacc tgccgtcgac cgcatcgttt ctcatgccct gtcctgggcc 600 cgtcgctcct atgggaaacg tctcactcaa gctgtaattt ccaaattcta ccaaacaatc 660 aacgaaacaa aaaccagagt aaccatatct ctaccaactg aaatggacac tacacctgca 720 ccatctagag ctaccacgct gccgaaccaa acccctaaac gagcccgaag ttcacctacc 780 cctagtccag cagacatagg taaacgctct cgtagagagg gcacacccct tctcttttca 840 ctttcagact cgtcttcccc ttccacacca tctgaatctt caccacgtga tccaggatcc 900 gccaaccttt cttccttctg tgcgtttttg gaccagcaaa ccacccctcg gtctccatcc 960 aagtcgagac atgtatccag accttcccct cctcgaacca gatcccagtc cgttcccaat 1020 gctacaattc taacaactac caaaccagta ccaatcacac cgtcgacgac cattatatcc 1080 gctaccaaac cgtattatca ccaaaccaag gacaaatcat tatggaaact acctgctcta 1140 aagaaatcaa ccttaataat tggatcatct aacctgaatc gaattaccaa aaccagttca 1200 gacaccgaaa tacattctta ccccggtgcc caaatccatc acatacagtc aatcctggaa 1260 tcatacgacc atgatcccaa acctactacc attattcttc atgtaggcct caacaaccgt 1320 gatcagatac caaagacctc ctttaaccaa gcccagaaat taatcgccac agccaacaaa 1380 accttcccaa actccaagat cctcttcaca tgtataccca tttcaaataa aatctcgaag 1440 aaagcacaac agaacctcag caccctcaac aacatgatga tggagtccag tatcaacagc 1500 gcaacaatct taccacccta cgaagaagaa ttcgaaacag ttgataatct tcattggtct 1560 ccagaaacag caaacagcat ttatttgttt tggatacacc agcttcaagc atcgagtaag 1620 tctttaaact aattactggg gataaaaata ctgagaggcc accgctaggc caaactgtag 1680 ttaacttgag ccgcaaaatt ctcacacccc ttcagcattc aatcttagac cgtggactta 1740 agtttattcc cacaccaaag atagctaact tgaattatct ccgaccatcc attgaacgat 1800 ttcaacgaaa tcttagctgc acatattttt ttcgaaacca ccccccacgt ggtgctagaa 1860 caatgtttat tcctaaatcc acgtggacac cccctcaggc cgccatccat cctgaaatat 1920 tgaaagcatt tgctagtatg aatagccaaa cagtgaacct ttccataaca cacgaaaagc 1980 ctaatctcac acgagtggaa cgtgaagcgt taaacttact taagaaagac gatagcattg 2040 ttattaaaaa gagtgacaaa ggttcatgtt gtgtcattct gaatagggag gattacgttt 2100 cagaagcaga gaatcaactg aataatccta aacattatgc cattttaccg aaaccaattt 2160 tcaaagaaac tgctaacatg attaataata ttcttgccga tttagtcaaa actagttatc 2220 tcaaacagaa acaatatgac tatctgtctg ccaaacctga ttgtcgcgag agacatttgt 2280 acactttgcc taaaatccat aagaatccaa ccaaggagtg gtttatctcg aataaaatcc 2340 ctaaaggccg cccgattata tctgactgtg gtagtgagtc atatgcagtg tcagaataca 2400 ttgatcattt cttactacct ctagcatgcc aacatcgcgc ctaccttaaa gacacaaccg 2460 attttgtcaa taaaataaaa aatcttcgag ttccctctaa tgctatttta atttctatcg 2520 acgtttcttc tatgtatacc aatatcgata acactactgg tttaacggca gttcgtaaag 2580 cttttaatag aaacccagat ccaaacagac ccgacgatgc aattctacag cttttagaga 2640 tttgtcttag tcgaaacgat ttcacattta atggtaaaac atatttacaa acttcaggga 2700 cagcaatggg caaacgattc gccccttcgt acgcaaacct tttcatggca gagtgggaac 2760 gtgaggtgtt gcccctctgt ccatttgacc cactttttta cggacgcttt ttggatgaca 2820 tttttatggt ttggacttac tctatcgaac agttttggga attctttgaa attctcaaca 2880 accaccatcc ttctatcaaa ctaaccgcaa acgtgagtga caaaagcata gactttctcg 2940 ataccacagt ctttaaggga gattccatag cagaaacaaa taaactggat attaaagtgt 3000 atttcaaaga aactgatact catcaactct tgcacaagag ctcctttcac ccaaaacata 3060 ccttttctgg tataattaag tcacaagttt tacgatttca tcgaatttgt acaagatctt 3120 ctgatttcaa ccaagcctgt acgattgttt tctcaaaact gaaagagcgt gggtactcta 3180 aacgattttt acgaaacata aaaaattcca ccttagcaca gatagcagct aaaaacagac 3240 tttatgccga agtaccccca ggtttaagaa gtgctccatg tttgtccagt agatgcaaaa 3300 catgtccctt tatatcagaa actcattatt ttgtaagcaa ctctacgaat aaaaggtacc 3360 cgatccagga acatctaaca tgtgaacatt cgaatgtaat atatttaatt tcatgcaaga 3420 aatgccccaa acaatatgta ggagaaacta agaataattt acgaactcgc tttaatgctc 3480 ataggtcaga catcaaatta aagaaagaca agcctgtttc taatcatttc aacctcgacg 3540 accattcaat agacgatctc accctaatcc caattgagcg tatagaaatt gacttgcccg 3600 aagatgaaac taccaaattc cgtcaatcta gggaggccca ttggattgag acattgaata 3660 cagcaagacc tataggacta aatattcatc ataatgtagg catagcaccc tttgttgtcc 3720 cgtttaatac aaccagcaat cgagcaagca agatagttca tgcccactat aagagccttc 3780 aagagaaatt cccccatgtc tataagcaga aaatgatagt tgcatattca aaaaataaga 3840 acttaaatga taatattgtt catagtaaac taaaaacaat cagtacataa cagatactat 3900 acttctcgcc catcgggcga gttatttttt ctaatttcag atcagaataa gaacagaaga 3960 aaacccaata tcaagaccct taatcctaat ttctaaataa agtgtctgct ccaccctagg 4020 cctgtacatt tgggagaatc aaaccctccc ttccccgaaa 4060 // ID Gypsy-602_AA-LTR repbase; DNA; INV; 131 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-602_AA_; KW Ty3_gypsy_Ele77; Gypsy-602_AA-I; Gypsy-602_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-131 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 131 BP; 26 A; 40 C; 37 G; 28 T; 0 other; tgttggggta ccaccctatg cgcctttgat cacggctgtc acccgacagc gagcccgaca 60 ggtgtcatcc aaccggatgc tgggtggtac ttgtcatcta ggcaatcgta gcagtacacg 120 ccggtcgcac a 131 // ID BEL-627_AA-LTR repbase; DNA; INV; 704 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-627_AA_; KW Pao_Bel_Ele182; BEL-627_AA-I; BEL-627_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-704 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 704 BP; 185 A; 162 C; 178 G; 179 T; 0 other; tgatcgcgag tgctctatcc ccgagctaca accctggtgg attatcaagg tagccgtgca 60 tgtaatgcaa gccaccgtct atctgctagg tatctacatc acctctatca ccggctcgtc 120 aatgcgagcg aggagagata ttctggtagt ctaccatcga gcttcgtccg ttatcagctt 180 gcgcgagaag gagagctgag cattatcagt gtcggagttc aaacgaagct acgaatctcc 240 ggagagctga cccgaacatc cagaagacgg cttgaggtgt tttctccaga agaacagaaa 300 ccagaaacga aaaggtcacg gtccggtccg gtacgagagc acctacgctc gtcggttttt 360 taatctgttg tagatataag tgtagtttat agttaggaat aaaagtcagt gtccagtttt 420 ttaataaatg tagttgtccg taaaaataaa tgtgtaatta aagtgttcat gtgcatttcg 480 tgtttggaaa tgctattggt ccttcgaaag gaattagccg tggttggaac gaagaaaacc 540 acggtcttca acccctggga atttcttgct gcatggtgaa tgcatgcccc cctgcgtttc 600 ccacctcatc atcagccacc ggtcgatgtg cggtaacctg cccggaagga gcgatcgaag 660 gtctggatac aactcggcac caggtaacag ggcagatccg ttca 704 // ID Copia-117_AA-LTR repbase; DNA; INV; 316 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-117_AA_; KW Ty1_copia_Ele78; Copia-117_AA-I; Copia-117_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-316 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 316 BP; 99 A; 50 C; 62 G; 105 T; 0 other; tgttgaaata tcaatttctg caccggtcta atctagatgc tgataatgga ccactatgga 60 caaaagattg tatgccaacc ggctggacaa tagtttcgct ttgccgttcg tctattgtac 120 acacatgaca tgcataaacg aaagagcgtg gataggtttg gagtttgtga taatttgtat 180 atatagtttt atctatcgct atcactccat gtattgtaaa tatgataccg accaaattga 240 attgaataaa ttgtcttttg gatatcgatc tgaagaatac gttaataagg tatattgaga 300 ttgatgtcaa acctca 316 // ID Sola1-N1B_CQ repbase; DNA; INV; 1019 BP. XX AC . XX DT 28-DEC-2010 (Rel. 16.01, Created) DT 28-DEC-2010 (Rel. 16.01, Last updated, Version 1) XX DE A DNA transposon family from Culex quinquefasciatus - consensus. XX KW Sola; DNA transposon; Transposable Element; Nonautonomous; KW Sola1-N1B_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-1019 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the southern house mosquito."; RL Repbase Reports 11(1), 95-95 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >83% CC identity. 32 bp TIRs. 4-bp TSDs. ~77% identical to Sola1-N1_CQ. XX SQ Sequence 1019 BP; 359 A; 166 C; 157 G; 337 T; 0 other; ctgccgctct aatctagtcc gtcccatatg taaaaagtga gtgctgagat aacgctgtga 60 gaagcccaaa aaaagtttca ccatttttct cattctaaac caacattttc tatataattt 120 tttatacagt tatttcgata tcagtctaaa aaacaccact atcataaatt tgatttaaaa 180 aactaacaat ttgaaggtaa atttgtgaaa ttttcaacaa gagaccgact tgactttatt 240 tgtccataaa taaaggcagg tcagtcatgg tatcaaggta ggcttgaaca tttacaccag 300 aggttcaacc tgattcaact gtcatttcca ggtttgtttt ggtagttttc caagtaatta 360 atggtcacca aatgatacaa atttgtttat ttcagatttt tatagaattc caagatacaa 420 aaatatgacc agcaacgaca ccagaaattt tttaaataat acagaggcaa atgacatatg 480 aatgcaaatc tcaactgctt aaccctgaaa ttaatttcaa aaaatccgct tttcggaggt 540 ggaatctttt ccagaaccag gatcaagcct attcattcaa gacaaaatag aactaattgt 600 atggtgtaga tattagccat aacggtatgc attttatttt ttctattcgt ggttttccct 660 tgaagaagaa aaaaagcaaa aaggtttgga ttatttcgtt tttaatattc aaatatttct 720 tcagaaatga acagataatg atattttcaa gctttacgta ttcataaaga catactctgt 780 aaaatttaaa caattctgca tccactcaaa cgatttgcga gacatttcta tatgaggaac 840 tgacttgcct ttattttgca tatgggacag cacgaaagtc atatttattt tggaattttt 900 agaacaaaca ttacttgttt attcaatagg ctaaaataac atgtaaaaac aagacttttg 960 tgaagtttat ctcatttttg acaaaaatac atatgggacg gactagattg gagcggcag 1019 // ID Gypsy-592_AA-I repbase; DNA; INV; 6314 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-592_AA_; KW Gypsy-592_AA-LTR; Ty3_gypsy_Ele160; Gypsy-592_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-6314 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [3261-3776] - Reverse transcriptase CC Positions [5187-5663] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS join(997..2232,2235..6008) FT /product="Gypsy-592_AA-I_1p" FT /translation="MDLQEVRSDLELRIAELNSILTNFSKAPRRNYSQKFL FT HAKENRAKIIYNDVNTILAYNEALFHETELTFYTRGVRQKYNEIILLVNSK FT LPFAKADAVDFKTAANVLLFLAKLRTKVNLRRVLPPIMAAPTVDIKLGTSL FT VPVYNGTPEHLESFVDAVNLFSDTVVNTFAGATADQQAAAQVTVTRFVKTR FT LTGVARQVIAEANDLQGILNAVKQHCEPKTTSDNIIAKLKAVKQKDSAESF FT CNEVEKLTAQLKATYVREQIPLDRASIMATKKGVDALIDGTKSSDTKIILK FT AGTFSKISDAVQKLMENDKPTPTTHNAQIFTLRGTNSQYRGRNINHGRGNF FT YNTRGNHNNSHRYRGNYHHQRGNFHNRGNFNRNDQQTRGSYLHRGSYPQRG FT HFNAHHGMFLAQHIPQHPATATKSPANTANFPGIPSNPPKQSKCTSKFFRC FT PPGAVYAINLAQTNFVLLNIEGSEQQCSFIVDCGSDISVLKISKIKSQQYY FT NPNDNCTISGIGNGTVTSYGSIITKLQINEESLQHKIHLVDNTFPIPTDGI FT LGRDFFTAYRCCIDYNTWMLTIKTHHNITSIPIFNEAKTKHIIPPRCEIIK FT KINLPDLSEDMVTISNEIAPGLVYANCIINNDNKFLKFINTTNDNLIIPEN FT FFPNMTPLRNYCYHNKISPHTTNNLRNEQLLRELQLKNIDPKIKPELERLC FT MEYNDIFTLQDDSLTTNNFYEQEILLNDNSPVYIKNYRTPEAQKTEIEVQV FT QNMLDQNIIQPSISSYNSPILLVPKKTASDDKKWRLVIDFRQLNKKIVPDK FT FPLPRIDEILDQLGRARYFSTLDLMSGFHQIPLHQHCRKFTAFTAGNGHFE FT FTRLPFGLNISPNSFQRMMTLALSGLPPECAFLYVDDIIVIGCSLSHHLNN FT LSNVFARMRSCNLKLNPKKCCFFRSEVTFLGHHVSQEGVQPDKTKYSIIKQ FT YPVPKNADETRRFVAFCNYYRRFIPNFAEISHPLNHLTKKDTKFEWNEQCQ FT KAFLTLKHQLLSPRILKFPNFKKDFIITTDASKVACGAVLAQEYDDIELPV FT AFASRTFTKGEANKSTIERELAAIHWAIKYFRPYVYGRKFVIKTDHRPLVY FT LFTMKDPSSKLTQMRLDLEEYDFVVQYIKGKSNVISDALSRVDLDVQELKS FT MYILTRSMTRNKPTPVIVQSASPEPDQFRVYDAITGRDTFDLPKLNFVVNQ FT NTNKVNLVLFNKNYKKSLALEQDLELLNLLPTLQKAFGKIDKGIPTINNTN FT INVRTNKKTKTRNAIKFSSKKPNHFALATNDQIFQYIDVNTFKELGNKILR FT NVTIMLYDRPKIISKSNDIENILHQYHNTPLGGHPGINRLYRKLRPIFYWS FT NMKQTITNYVKKCELCKINKHFTHVQTPAIITTTPIKPFEVVSVDTVGPFP FT LSINGNRYAVSMQCDFTKFVILVPVSDKTAATMAKAVVEHCILIYGPMTKI FT KTDQGTEFKAVFDEMCKTLQITHACSAAYHPQTIGALERNHRCLNEYLRNF FT CNDRSNDWDNWLAYYSFAYNTTPNLDHNYTPFELVFGKKLNSFEIVQNKIN FT PVYNYDSYEKEMKFRLQIAHTQVLDAIRKIKMKRTHKLNENVRNTDIKIGD FT IVYLEQENRTKLDQVSSGPYKVITTTVSNATITNDQNKCIEVHKSRLLKFK FT N" XX SQ Sequence 6314 BP; 2322 A; 1232 C; 1054 G; 1704 T; 2 other; tggcgaccgg actgtgaatc aaagtgaaaa ttgtactaaa atatttgaac acatcatgtt 60 agtgaaagtt ctgttcctga taacaacgat tattctttgg tgctttgctt atacgttgtg 120 gaaaatgcta acagtatatt accagtgaaa atcaaaaatg ggcaaaagca gttcaaaacc 180 aaaaataacc acgagtggaa ctaacgtgga aataaacact attctggaac aacaccaaga 240 actacacgag aaccatgaat tcaagctatg gctaatcctt gtcctagttg caatacaagt 300 gatagttcta tgttataaag aactcaagtt acaaagccgc agaaatgctt taaaggcagc 360 gaaaagtgtg gccaatatcg aaaacgtgta aagtgcaaga agaacataat caacgactta 420 tgcaatctgt aacgtccgta agcagcagga gtgcagtgac ttcaaaagtg tattatgcca 480 atcaatgacc atgtattaca tacatcaacg gctgactatg aagtaatgca atccgtaacg 540 tcagcagcag gagcgcagtg acttcaaaag tgtattaaac ccagggctaa acggcaactt 600 ggagtggccc ggtcgatggc taatcggcaa catggagcgg ccaagggagg atagcgacaa 660 cgggaggtat caacatttca acacccgaca cccstaaatg aatgacgtgt tcttcgatgt 720 taccgattaa caggtgagcc agtgatcgtc ctcaattccc gctatcgcta aacaattgaa 780 aaagtgaaac ttacaaatga cctgcaacat gctaatttaa gttcctttta atggtaatga 840 agaaaaaaaa ccccaaggtc gtgctttgtg ctagatgtga atcattaata ttgagttaat 900 tttaggtaac tttaaaaatt ttgagtaaat tcccaaacca atggatctag ttcaaacggt 960 gtaaatgtat attgattatc gaattcaatc ccatttatgg atttacaaga agtaaggagt 1020 gatttggaac taagaatagc agaattgaac tcaattttaa ccaatttttc aaaagcacct 1080 agaagaaatt actcacaaaa gtttttacat gcaaaagaga atagagccaa aattatatat 1140 aatgacgtga ataccatttt ggcttacaac gaagcattat ttcatgaaac cgaattgaca 1200 ttctacacta gaggggtgag gcaaaaatat aatgaaataa ttcttcttgt taatagcaaa 1260 ctaccttttg ctaaagccga tgcagtggat ttcaaaactg ctgcaaacgt gctattattt 1320 ttggctaaat tacgaacaaa agtgaatttg agaagagtct taccaccaat catggcagcc 1380 ccgactgttg atattaaact tggcactagc ttagtaccag tgtataacgg tacacctgaa 1440 catctggaat catttgtcga cgcagtgaac ctttttagtg atactgtcgt aaatacattc 1500 gccggggcta ctgccgatca gcaggcagca gctcaggtta ccgtaacaag attcgtcaag 1560 actcgactaa cgggcgttgc ccgtcaagtc attgctgaag ccaacgactt acaaggcatt 1620 ttaaacgcag taaaacaaca ctgtgaacca aaaacaactt ccgataatat cattgccaag 1680 ctcaaagccg tgaaacagaa agattctgcg gaatcgttct gcaatgaagt ggaaaaactg 1740 acggcacaac ttaaagctac ctacgtcagg gaacaaatcc ctttagatag agcaagtatt 1800 atggcaacaa aaaaaggcgt cgatgctcta attgatggaa ccaaaagcag cgacactaaa 1860 attattttga aagctggtac cttttcaaag atttctgatg cggtacaaaa gctaatggaa 1920 aatgacaaac caactcccac gacacacaac gctcaaatct ttactctaag aggtacaaat 1980 tcacaatatc gcggtagaaa tattaatcac ggtcgaggaa atttttacaa cactagagga 2040 aaccacaata attcgcatag atatcgtgga aattatcatc atcagcgagg aaattttcac 2100 aacagaggaa attttaatag aaatgaccaa caaacaagag ggtcgtacct ccatcgtgga 2160 tcatatccac agagaggaca ttttaatgcg catcacggaa tgtttttagc tcaacacatt 2220 ccacaacacc cawtgccaca gcaacaaaat ctcccgcaaa tacagcaaac ttcccaggta 2280 tccccagcaa tccaccaaag caatcaaaat gcacctcaaa attttttagg tgtcccccag 2340 gggcagttta tgcgataaac cttgcccaaa ctaattttgt acttttaaat attgaaggtt 2400 ccgagcaaca atgtagcttc atagttgatt gtggctcaga catttccgtt ctgaaaatca 2460 gtaaaattaa atcccaacaa tactataatc caaatgacaa ctgcacaatt tctggaatag 2520 gtaatggtac tgtaacctcc tatggaagta tcattactaa attacaaatt aacgaagagt 2580 ctttgcaaca taaaattcat ttagtggaca acactttccc aatccccacg gatggaattt 2640 tgggaagaga cttttttaca gcatacaggt gttgtataga ttacaacact tggatgctta 2700 caattaaaac tcaccacaat attacttcaa ttccaatttt taacgaagca aaaacaaaac 2760 atattattcc cccaagatgt gaaataatta aaaaaattaa tttgcctgat ctctcagaag 2820 atatggttac catatccaac gaaatagcac ccggattagt gtatgcaaat tgcataatta 2880 ataacgacaa caaattctta aaatttatca acactacgaa tgataatttg attattcccg 2940 aaaacttctt tccgaatatg acaccattgc gaaattattg ctaccacaat aagatttctc 3000 cacacacaac gaacaatcta agaaacgaac aattgctacg cgagttacaa ctaaaaaata 3060 ttgacccaaa aataaaacca gagcttgaac gtctatgtat ggaatacaat gacatattca 3120 cattacaaga cgatagcctc actacaaata atttttacga acaggaaatt ttgctgaatg 3180 acaattctcc tgtttatatt aaaaattaca gaaccccgga agcacaaaaa actgaaatcg 3240 aagtacaagt tcaaaacatg ttagatcaaa atattattca accttcaatt tcttcataca 3300 attccccaat attacttgtt ccaaagaaaa cggcttccga tgataaaaaa tggagattag 3360 ttattgattt tcgacaacta aataagaaaa tcgtaccaga taaatttccg ttacctagga 3420 ttgatgaaat attagaccaa ttaggcagag caagatactt ttctactctt gacctaatgt 3480 ctggatttca tcaaattcct ttacaccagc attgtagaaa attcacagca tttacagcag 3540 ggaatggtca tttcgagttt actaggcttc cgtttggact aaacattagt ccgaacagtt 3600 ttcagcgcat gatgacgctg gcgcttagtg gcctaccacc cgaatgcgca ttcctttatg 3660 tagatgacat aattgttatt ggatgttcgt tgagtcatca ccttaataat ctttccaacg 3720 tttttgctcg gatgagaagt tgcaatttaa aattaaatcc gaaaaaatgt tgttttttca 3780 gatcagaggt cacatttctt ggccatcacg tttcacagga aggagtacaa cctgataaaa 3840 ctaagtactc aattataaaa caatacccag taccgaaaaa tgctgacgaa acacgtagat 3900 ttgttgcatt ctgcaactac tacagacgtt tcatacctaa ttttgccgaa atttcacacc 3960 cactgaacca ccttaccaaa aaggatacaa aattcgagtg gaatgaacaa tgccagaaag 4020 catttcttac attaaaacac caattattat caccaagaat tttaaaattc ccaaatttta 4080 agaaagactt cattatcacg actgacgcct caaaagtggc ttgtggagca gttctagctc 4140 aagagtatga cgatatcgaa ttacctgtgg cattcgccag ccgaacgttc acgaaaggtg 4200 aagcgaataa gtcaactata gagcgtgaac tagccgctat ccactgggcg attaaatatt 4260 ttcgaccata tgtatatggc agaaaatttg taattaaaac cgaccatcga ccgcttgttt 4320 atttatttac catgaaagac ccttcatcca agctaacaca aatgcgattg gacttggaag 4380 aatatgactt cgtagtacaa tatataaaag gcaaaagcaa tgttatttca gatgcactct 4440 ccagagtaga tctagatgta caagaactta aaagtatgta tatattaacc aggtcaatga 4500 ctagaaataa accaacacct gtaatcgttc aatcggcttc accggaacct gatcaattcc 4560 gagtgtatga tgccattaca ggtcgtgaca cttttgactt acctaagctt aacttcgttg 4620 ttaaccagaa tacaaataag gttaatttag ttttattcaa taaaaactac aaaaagagtc 4680 ttgctttaga gcaagatctc gaattgttaa atttattacc cacgctacag aaagcattcg 4740 gaaaaattga taaagggatc cctacaatta ataatacaaa tataaatgtg cgcacaaata 4800 agaaaacaaa aacaagaaat gctattaaat ttagctcgaa aaagcctaat cattttgcat 4860 tggctacaaa tgaccaaatt ttccaatata ttgatgtcaa tacatttaag gaattaggta 4920 acaaaattct taggaatgtt accataatgt tgtacgatag accaaaaata atttcaaaat 4980 cgaatgatat tgaaaatatt cttcaccagt atcataacac tcctctcgga ggacatccag 5040 gaataaaccg tttgtacaga aaattacgac caattttcta ttggtcaaat atgaaacaaa 5100 ctattacaaa ttacgtaaaa aaatgcgaac tttgtaaaat taacaagcat tttactcacg 5160 tacaaacccc agccataatc acgacaacgc ccattaaacc attcgaagta gtatcagttg 5220 atactgtagg accgtttccg ctttcaatta atggaaatcg atatgcagta tcaatgcaat 5280 gcgacttcac aaaatttgtc atattagtac cggtctcgga caaaacagca gccactatgg 5340 caaaagccgt agttgaacat tgcatattaa tatatggacc aatgacaaaa attaaaactg 5400 atcaaggtac agaattcaaa gctgtttttg atgagatgtg caagactctg caaataactc 5460 acgcttgttc cgccgcctac catccacaaa cgattggtgc gttagagaga aatcataggt 5520 gtttgaatga atacctacga aatttctgta acgatcgatc taatgattgg gataattggc 5580 ttgcatacta ctcttttgcg tataacacaa ctcctaacct tgaccacaat tacacaccgt 5640 tcgaactagt atttggtaag aaactaaact cttttgaaat agtacaaaac aaaataaacc 5700 cagtatacaa ttatgattcc tacgaaaaag aaatgaaatt cagattacaa attgctcata 5760 cccaagttct agatgccata cgaaaaataa aaatgaagag aacccataaa ttaaatgaaa 5820 atgttagaaa cacggatatc aaaattggcg atattgttta tttggagcaa gaaaatagaa 5880 ctaaactcga tcaagtcagt tcaggacctt ataaagtcat tactacaaca gtatcgaatg 5940 ctacaattac aaatgaccaa aacaaatgta tagaagttca taaatcacga ctacttaaat 6000 ttaaaaacta aattttctaa aggggaaaga gtatctgatg atcttttaaa atgtagtatt 6060 tacattccaa tctcaagatt aacttttcca tagatcaata acacaacaaa ttatatcgca 6120 accgaaaaac agcatgtgat atacaaagaa aaattaacga aaaaacaact tattttgtga 6180 gaggtacttg gagtcatatg gcatgcttta tatgttctac gaacattttc ctttaggggg 6240 atggtgtggc atctgtattc cacatgtcat ccaatcccgc atagcaacca tagtttctag 6300 aaggaccaca ccag 6314 // ID BEL-191_AA-I repbase; DNA; INV; 7188 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-191_AA_; KW BEL-191_AA-LTR; BEL-191_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-7188 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 873-873 (2011). XX DR [2] (Consensus) XX CC Positions [5213-5791] - Integrase core CC LTRs are 96% similar to each other. XX FH Key Location/Qualifiers FT CDS 4700..6175 FT /product="BEL-191_AA-I_2p" FT /translation="MVALVRRFHYNSNPRNRNAKRTGFLSTVELDQSTEQL FT VKIAQEERFSQDIAEVIRDGQVKKNSRLKALLPRLVNGILRVGGRLRNAPV FT TYNQRHPMILPDRHPLTEMILRFYHLNNLHAGPQLITAAVRERFWPLRIRD FT QARQVVHSCLRCFRCKPEVMEQLMGELPSERVTPTLPFLRSGVDYCGPFFY FT RTSRKGVPIRCYVAIFICMVTKAVHVECVADLSTSAFIAALRRFVARRGKP FT QLIECDNALNFRGAKRELDELVRLFGSQQHQHLVTTSCAEDGITFKFIPPR FT SPNFGGLWEAAVKSLKKHLRCTLLNTILSLDEFQTLLTQIEACLNSRPLTQ FT LTADPNDLEVLTPGHFLVHRPLITMPEPSLADVPMNRLDRYQQTQEYVRRI FT WRQWQSDYLSGLQPRTRWTRQRDNVNVGTMVLLKDDNLPPLKWRLGRVTQV FT FRGDDNYVRVVTVRTQDGEFRRAITKICVLPIQQPGGDPIADEPEGN" FT CDS join(1479..3005,3008..4363) FT /product="BEL-191_AA-I_3p" FT /translation="MKKECPDSLNKLIDEYERNLRMLEKVGEHPENWSTLL FT VFMLSSRLDPATMRHWETHRRSTNVPTYKELIEFLRSHSLILQSVAAPKYQ FT PSDPPRTTPSARNAPIKLTSANSAISPPQKSCAFCKQSPHSPFHCEVFRKM FT TASERFDATKKNSLCINCLSPSHQMKNCSSGACRVCNQKHHTMLHQRPNPS FT NQTSPPTQTQPSKPSSSSSSQSSFSPSAPTNHSQSSHASKPSTSSNENQAP FT TISALSTHHVSSSPSKQHHRVQSTVLLSTALVKVFDPEGQSLWARALLDSG FT SQLNFVSEQLVQKLKLKRSKDFLPISGVGLSSTSSKYSVVARIQSHHADFE FT TSWRFHVLPKITMELPTQAVDVSSLQLPSDLVLADPSFGEPGPIDLIFGAE FT NFYDLLREGHFKAGPYQPNLQNTALGWVVSGKVESAILESSVVNVAYATPT FT IEEQLARFWEIESCQNSSTLSLEETACEQHFSKTTIRDQPDGLSLHCQKES FT QHSPNLVVQRSSPRRFLSLERRLNADPQLKQAYTAFVNEYAELGHMKLVDN FT PETHSPTYFLPHHCVVRPDSITTKLRVVFDASCATDTGVSLNDALMIGPQV FT QDDLVAIILRFRLPQFAIISDIEKMYRQIGMDPVDQRLQLILWRDSPSEPI FT RTYQLTTVTYGTSSAPYLATKCLQTLANIGLEGHPAAAAVVGKDFYMDDLL FT TGADSVEEGQELVLQLLQLMNSAGLQLRKWASNSPQILQVVPEHLRDERTL FT LDLEATSPVKTLGLQWNTRTDEFCFEVPKHSDVHPVTKRIVLSDIARLFDP FT LGLVGPVVVQAKLFMQELWREGKAWDDELSNSAQQRWVEFRENLRDIASLS FT VPRWVFATPSIAAVEIHGFCDASTLAYGACIYVRTVASNGDISANLLTAKS FT KVAPLGNSKRNPTISLPRLELSGAPKPPLRKGRIQYKYPSQEFLLDGLDYS FT LSLAIF" XX SQ Sequence 7188 BP; 1855 A; 1876 C; 1749 G; 1704 T; 4 other; taaaaatctg gtccttcgag ccggatcaag gagagtcggg aacagtgttt gcgattcgcg 60 ttcggttcgg agtgattttc gtgaaaaaag tgcaagtgaa cattagtgaa agtgactgga 120 accctttagt gtcacgaaag atagtgacga agaaagcgtg atcacgatca cgaataaaag 180 tggatccaga aaagaaacga caactgtgac tgcgagctgt gattcgttgc gaacgtgcaa 240 ttcagcgata attagcacag aaagtgatag tgaaaccacc tcggacccac gctggaccac 300 gttggattct ggcgtcatcg agaagcatcg gtctattgtg ctattcatcg ctctccccca 360 ttgtcgtcgg tggcgtggct tgggtgtcga aacgtggtgt ggcttcggtt ctttctctac 420 cccaacggag actgtaagta cacttgcatg cgcatatgac ccgagtggca agatatacca 480 gaagtaagac cagtgtgctg gcttactaaa tttttcgcaa ttttgcatga ctcgagtggt 540 ttcggatgtg agcaattttt gggattttgg gtagtgattg gcactgtgga aggtacgtgt 600 ggaaaacgtg gttcatcgaa tgttctagtg cagttgggcg ggaaagatgg ctactgaact 660 caaaaggctt atcaagcagg agcgattttg cgtagattct ctcatcaatc tagagaagaa 720 cgtacgaaaa tttgatgatt cagcgggcgg gggcccggcc ggcggcgcgc gccgcgccgg 780 gggcggggcg cccccggcgg gccgggggcc gccccgggcg gccgccgccc cgcgcgcggc 840 ggcggcccgc gggggccgcc gcgggcggcc gcccggggcg cgccgggggg gatcaggata 900 tgcttgaagg ttggcaagaa tatttggagg aaatttacgg cgaatttaaa cgggtcaggt 960 tagagttgga gttgtcggat gaatttcgca atgcactaga gcagtctatg aaggtggagg 1020 aaggttcgtc tgataatttg gacctgacaa ttaaggccac acgtgctgaa gtcgagctca 1080 cgtatttgaa gatcaagggg tttataaaat ctaaattaca aaaaccaagt acatcaaccg 1140 ttmwtgctgc ccaaccagtt ccattaccag tgaatactat tcagtctcga gtgaaacttc 1200 cggaaattaa acttccgcat ttcgatggat ccattcggga ctggccaacg ttccgagaca 1260 catttcgttc gcttattgat tccactccac agctcagcaa tgtggacaaa ttttcgtact 1320 tagtctcgtc cctttcgaaa gaagccaaac gtgtgatcga agttattgag gttacttcgg 1380 caaattattc ggtagcgtgg gaattacttg aaaagaggta cgagaacaaa tacctcattg 1440 ttaaagcgta tatcgaagcg ctttttaggg tcgagtcgat gaagaaggag tgtcctgact 1500 cactcaataa gctcatcgac gaatacgagc gtaaccttcg aatgctcgag aaggttggtg 1560 agcacccaga gaactggagc actcttcttg tattcatgct cagttctaga ctcgatccag 1620 ctaccatgcg ccattgggaa acccaccgta ggtcaaccaa tgttcctacg tacaaggagt 1680 tgatagaatt cttgcgaagt cacagcctca tcctgcagtc agtggcagcc cccaaatatc 1740 aaccatctga tcctccccgg acaactccct ctgctcgtaa cgcccccatc aaactaacct 1800 cggctaattc tgccatatcg ccaccgcaga aatcgtgcgc tttctgtaag caatcccctc 1860 attcaccctt ccattgcgaa gtctttcgga agatgaccgc tagcgagcgg ttcgacgcca 1920 caaagaagaa ttcgttatgc atcaactgcc tctcaccttc tcatcaaatg aagaactgtt 1980 cgagtggagc ctgtcgagtg tgcaaccaaa aacatcatac gatgctgcat caaagaccca 2040 acccgtctaa tcaaactagc cctcctaccc aaacccaacc atcgaaaccg tcgtcgtcgt 2100 ctagttcgca gtcatccttt tcgccgtccg ctccaacaaa ccactcgcag tcctctcatg 2160 cgagcaagcc ctcaacatcg tccaatgaga accaagcgcc cacaatatct gccctttcga 2220 ctcaccacgt ctcatcgtcg ccatcaaagc agcatcatcg agttcaatcg acagtgctgc 2280 tctctacagc cctggttaag gtattcgatc ctgagggaca gtctctgtgg gctagagctc 2340 tactggattc ggggtctcag ctgaacttcg tatcggagca acttgtccag aaactgaagc 2400 tgaaacgatc caaggatttc cttccgatca gcggtgtcgg actatcatca acctcttcca 2460 aatattccgt cgtcgctcgt atccagtctc accacgcaga tttcgaaacc agttggagat 2520 tccatgttct tcccaagatt actatggaac taccaacgca agcggtggac gtttccagcc 2580 ttcaattacc ctccgacctg gtcctagctg atccttcgtt tggcgaaccg ggtccaatcg 2640 acttaatatt tggcgctgag aacttctacg acttgttgcg cgaaggccat ttcaaggccg 2700 gtccctatca gcccaaccta caaaatacgg ctttaggatg ggtggtgtca gggaaggtcg 2760 agtcagcaat cctagagtcg tcagtcgtta acgtcgcata tgctactcca accatcgaag 2820 aacaactggc tcgattttgg gagattgagt cttgtcagaa tagcagcacg ttgtcactgg 2880 aagaaacagc ctgtgagcag catttttcaa aaacgaccat ccgagatcaa ccggacggtt 2940 tgtcgttaca ctgccaaaaa gagagtcagc attcgccaaa cttggtagtt caaaggtcgt 3000 cgccamtcgt cggttccttt ccctggaacg ccgcctcaac gccgatcctc agttgaagca 3060 ggcatacaca gcattcgtca acgagtatgc ggagctcggg cacatgaagc tagtggacaa 3120 tcctgaaaca cactcaccca cctacttcct tccacatcat tgtgtagttc gaccagacag 3180 catcaccact aaattgcgag tggtcttcga cgcatcttgc gcaaccgaca caggagtgtc 3240 attgaatgat gctctcatga tcggtccaca agtgcaggat gatcttgtcg caatcatttt 3300 gcgattccgt cttcctcagt ttgccatcat ctctgacatc gagaagatgt ataggcaaat 3360 cgggatggat ccagtcgatc aaaggcttca acttattctt tggagagatt cgccgtcaga 3420 accaatcaga acgtaccaac tcacaaccgt gacgtatggt acatcttctg ctccgtacct 3480 cgctacgaaa tgcctccaaa ctctcgcgaa cattggattg gaaggccatc ctgcagctgc 3540 cgctgttgtg ggcaaggatt tttatatgga cgacttgctc accggtgccg acagtgtcga 3600 agaaggtcag gaacttgttc tacagctact ccagctgatg aattctgctg gattacaact 3660 gcggaagtgg gcctccaata gcccacaaat ccttcaagtc gttcctgaac acttaaggga 3720 tgaacgtact cttctcgatc tcgaagctac ctctccggtc aaaactctcg gtctacaatg 3780 gaatactcgc actgacgaat tctgtttcga ggtaccgaaa cacagcgacg ttcatcctgt 3840 cacaaagcga attgtccttt cggatatcgc acggctgttc gaccccctgg gactagtcgg 3900 cccagtggtg gtacaggcta agctgttcat gcaagaattg tggagagaag ggaaagcgtg 3960 ggacgatgaa ctcagtaact ctgctcaaca acgctgggta gagttccgtg aaaatcttcg 4020 agacatcgca agcttaagcg taccgcgttg ggtcttcgct acaccaagta ttgcggcggt 4080 tgaaattcat ggattttgcg atgcgtcaac tcttgcctat ggtgcctgca tctacgtcag 4140 gacagttgcg tccaatggcg acatctctgc taatcttctc accgcaaaat ccaaggtggc 4200 tcctcttgga aattcgaaaa gaaacccaac gatcagtctg ccgcgcctag agctatccgg 4260 cgcgcctaag ccacctcttc gaaaaggtcg aatccagtac aaatatccaa gccaggagtt 4320 tcttttggac ggactcgact atagtctatc attggctatc ttctaatcct tctcgttgga 4380 aaacgtttgt ggccaaccgg gtctccgaga tccagcgaat cactgctaaa ggaatctggg 4440 cacatgtgcc agggctggaa aatcctgcgg acgtcatctc cagaggaatg cctccaagcg 4500 aactgaagga attcactcct tggtggaatg gtccgactgg cttcgacaac cttcccgctt 4560 ttggccgtcg ttgtctgctc ccgtccaaga cgagtatccg ccggatcagc ttgaagaacg 4620 catcgtagcg ctacctgttc aagtgtgccc gccaaacgag ctgttttcgt tattttcgtc 4680 gttcacgaaa ttagtacgaa tggtagctct agttcgaaga ttccactaca actccaaccc 4740 gcggaatcgg aatgcgaaac gcacgggttt cttgtcgact gtcgaactag atcaaagtac 4800 cgaacaattg gtgaagatcg cccaggagga acggttttca caggatatcg cggaagtgat 4860 tcgcgatggg caagtcaaaa agaactcccg actgaaggca cttcttcctc gattggtgaa 4920 cggcatattg cgtgtcggtg gccggctgag gaacgctcca gttacgtaca accaacgcca 4980 tccaatgata cttccggata gacatccgct gacagaaatg attctgaggt tctaccacct 5040 caacaacctc cacgctggac cacagttaat aacagccgca gttcgggagc gattctggcc 5100 acttcgtatc cgcgaccaag cacgacaagt ggtgcattcg tgcctaagat gtttccgctg 5160 caaacctgaa gttatggaac agctaatggg agaattgcca tcagaaaggg ttactcctac 5220 gctaccattt ctccgctccg gagttgacta ctgtgggccc tttttctacc gtaccagcag 5280 aaaaggagtt ccgataaggt gctacgtcgc catcttcata tgtatggtga cgaaggcagt 5340 ccacgtcgaa tgtgtagccg atttgtctac tagtgcgttt attgccgccc ttcggcggtt 5400 cgttgcccgg cggggtaagc cacaactaat tgagtgcgat aatgcactga acttcagagg 5460 agctaagcgc gagctggatg aactcgtccg tcttttcgga tctcaacaac atcaacatct 5520 agtaaccacc agctgcgctg aagacggtat cacgttcaag tttataccac cgagatctcc 5580 gaatttcggc ggcctctggg aggctgccgt caagtctctc aagaagcatc tccgctgcac 5640 tcttttgaac actatcctat ctttggatga attccaaacc ctcctcactc aaattgaagc 5700 ctgtctcaat tctcgtccgc tgacccagct aactgcagat cccaatgacc tggaggtgct 5760 cacaccaggg cactttttgg tccatcggcc actcatcacg atgcctgagc caagcctggc 5820 tgatgttccg atgaacaggc tcgacaggta tcagcagacc caagagtacg tccgtcggat 5880 ttggaggcag tggcagtccg attacctatc tggtttgcaa ccgcgaacac gttggactcg 5940 ccagcgcgac aacgtcaatg tgggtacgat ggtcttgctg aaggacgaca accttccacc 6000 cctcaagtgg agactaggac gagtgacgca ggtgttcaga ggagacgaca actacgtgcg 6060 agtcgtgacg gtccggacac aagacggcga attccggcgt gcgatcacga agatttgcgt 6120 cttacccatt cagcaacccg gcggtgaccc aatcgctgac gaacctgaag ggaattagta 6180 tacctccaac aatcgcagtg tttaccactg cgcaacgccg gggggtcgaa agacctccca 6240 tccatgtaag tctatgtttt atattgattt tcaaaaagct aatggaatgt ttcatccccc 6300 ataacttccc gacgacgacg tccgctgctg ctgccgcgag ttcttgatcc tacgctaacc 6360 aatgatctaa tgttgtaccc aaacgcatcc actatttgtc ggagagacta gtggttctgc 6420 aacaggggaa gctatgatca aaggaagcaa gatcaactcg kgcttggatt ccgctctgaa 6480 ctgaaggatg tcacgtcatc tacacccgca atcgtctcgc cctcggagga aagtcatcca 6540 acaacgcaca agtacgctca gtagcttgat gccaacgatc atcgtccgtt tgcggacccg 6600 cggaggccgg tagcctccaa cccagttaag tgttgttttt attctcaaat attgttggaa 6660 aagccgccta tgttttgttc cagagaatga ccagccatcg agggtgtcga cgaggaccac 6720 ccacccgacg aagagcacgt gcgacgaagt gatgactcct acaccgccat acatcgataa 6780 gccaagcacg gtcgaggtca ctaagtgttt ccgacaacgc acatggctat cgacgaagat 6840 ttcaggagta ttattattat tatagcttta ttaaggagat tttcagccct aggcatccca 6900 cgagtagaag taaacaaccc gatgcagatt cccgacagca gacgaagtgc ggcgcagcaa 6960 gaccactgga cactacaagc tacaagatca tcgaatgcag acagtgacga catgcgagca 7020 gcaacagaaa ccagactatc taaagttaat tccgaaccat gaattgaaaa atgaaattaa 7080 cgtagattaa gagtagaata ggtacagtta ggctaacgaa cattgaattc caattgataa 7140 gctaggggaa gagtagttaa aaacgcgcct ttttaacggt ggccggta 7188 // ID Gypsy-6_RP-LTR repbase; DNA; INV; 128 BP. XX AC ACPB02021416; XX DT 28-JAN-2011 (Rel. 16.02, Created) DT 28-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Rhodnius prolixus genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-6_RP_; KW Gypsy-6_RP-I; Gypsy-6_RP-LTR. XX OS Rhodnius prolixus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Euhemiptera; Heteroptera; OC Panheteroptera; Cimicomorpha; Reduviidae; Triatominae; Rhodnius. XX RN [1] RP 1-128 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Rhodnius prolixus genome."; RL Direct Submission to RU (28-JAN-2011). XX DR Genome; ACPB02021416; Positions 1718 1845. XX SQ Sequence 128 BP; 27 A; 25 C; 21 G; 55 T; 0 other; tgtatgtact ctttctttgc gagcgtagtt aagtacgcgc ccttacttat attctctgta 60 gatcagtgtt ttaattaaat cgtattgtgt ttctttttgt gttggtaaat cacctacatt 120 ctcccaca 128 // ID REP-1_CQ repbase; DNA; INV; 2099 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A repeat family from Culex quinquefasciatus - consensus. XX KW Repetitive element; nonautonomous; REP-1_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-2099 RA Kojima K.K. and Jurka J.; RT "Repeats from the southern house mosquito."; RL Repbase Reports 11(1), 604-604 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >92% CC identity. No TIRs. XX SQ Sequence 2099 BP; 598 A; 466 C; 387 G; 641 T; 7 other; ttttgccttc ctcactgagg taaggctata atcctgctck aaaaatgaac ttttgawaaa 60 magctcgtag acccacmttc atgtatacmt atcgactcag aatcgaaaac tgaacaaatg 120 tctgtgtgtg tgtgtgtatg tgtgtgcgta ttccccttgg tgcccaaaat tctcgccgag 180 ttttctcggc actggctgat ccgatttkgg tcaaaccagt tgcattcgac ccggtttggt 240 gccccatact gcgctattga attgtttgaa gatccgataa gtagttcaaa agttacgtat 300 aaaaaagtgt cagagtcgcg aatcggatct cacttaaatg catgtaaact atgtccggac 360 ccatcacccg acccatcgtt ggttaggtaa tcgaaagacc tttccaacga gtccaaaaca 420 ttgaagatct ggcaaccctg tctcgagtta tgaccactta agtgatattt atgtactttt 480 ttgaagccgg atctcactta aatgtatgta aactatgtcc ggatccatca tccaacccat 540 cgttggwtag gtaatcgaaa gacctttcca atgagtccaa aacattgaag atctggcaac 600 cctgtctcga gttatgacca cttaagtgat atttatgtac ttttttgaag ccggatctca 660 cttaaatgta tgtaaactat gtccggatcc atcatccaac ccatcgttgg ttaggtaatc 720 gaaagacctt tccaatgagt ccaaaacatt gaagatctgg caaccctgtc tcgagttatg 780 accacttaag tgatatttat gtactttttt gaagccggat ctcacttaaa tgtatgtaaa 840 ctatgtccgg atccatcatc caacccatcg ttggttaggt aatcgaaaga cctttccaat 900 gagtccaaaa cattgaagat ctggcaaccc tgtctcgagt tatgaccact taagtgatat 960 ttatgtactt ttttgaagcc ggatctcact taaatgtatg taaactatgt ccggatccat 1020 catccgaccc atcgttggtt aggtaatcga aagacctttc caatgagtcc aaaacattga 1080 agatctggca accctgtctc gagttatgac cacttaagtg atatttatgt acttttttga 1140 agccggatct cacttaaatg tatgtaaact atgtccggat ccatcatcca acccatcgtt 1200 ggttaggtaa tcgaaagacc tttccaatga gtccaaaaca ttgaagatct ggcaaccctg 1260 tctcgagtta tgaccactta agtgatattt atgtactttt ttgaagccgg atctcactta 1320 aatgtatgta aactatgtcc ggatccatca tccaacccat cgttggttag gtaatcgaaa 1380 gacctttcca atgagtccaa aacattgaag atctggcaac cctgtctcga gttatgacca 1440 cttaagtgat atttatgtac ttttttgaag ccggatctca cttaaatgta tgtaaactat 1500 gtccggatcc atcatccgac ccatcgttgg ttaggtaatc gaaagacctt tccaatgagt 1560 ccaaaacatt gaagatctgg caaccctgtc tcgagttatg accacttaag tgatatttat 1620 gtactttttt gaagccggat ctcacttaaa tgtatgtaaa ctatgtccgg atccatcatc 1680 caacccatcg ttggttaggt aatcgaaaga cctttccaat gagtccaaaa cattgaagat 1740 ctggcaaccc tgtctcgagt tatgaccact taagtgatat ttatgtactt ttttgatgcc 1800 ggatctcact taaatgtatg taaactatgt ccggatccat catccaaccc atcgttggtt 1860 aggtaatcga aagacctttc caatgagtcc aaaacattga agatctggca accctgtctc 1920 gagttatgac cacttaagtg atatttatgt acttttttga agccggatct cacttaaatg 1980 tatgtaaact atgtccggat ccatcatccg acccatcgtt ggttaggtaa tcgaaagacc 2040 tttccaatga gtccaaaaca ttgaagatct ggcaaccctg tctcgagtta tgaccactt 2099 // ID Gypsy12-LTR_Dya repbase; DNA; INV; 354 BP. XX AC chr3R; XX DT 18-MAY-2009 (Rel. 14.05, Created) DT 18-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy12_Dya; KW Gypsy12-I_Dya; Gypsy12-LTR_Dya. XX OS Drosophila yakuba OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; melanogaster subgroup. XX RN [1] RP 1-354 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1087-1087 (2009). XX DR Genome; chr3R; Positions 203620 203267. XX SQ Sequence 354 BP; 118 A; 67 C; 83 G; 86 T; 0 other; tgtgggattg gtttttggct gtttacccta atcccactag tatgaaacat ccctagacaa 60 cgacaacgac gataacaacg acgagatgaa gatgatggcg aattagcgag gtggcgaatt 120 cgcaaacacg acgatcaacg aggagcaaaa gcgcaacgct acgagggcga agtgctttac 180 gacagcggta gagagagagt tgttttttgc tcggagaaga aggaagttgt gcgttctgcc 240 tgtgtggcta cgagatcgac aacaatacaa cgacggtgtc gattaatttc aatatcctat 300 attatttgtt acaaaattca aataaaactt atattataat aataaaaccc caca 354 // ID I-62_AAe repbase; DNA; INV; 5304 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE An I non-LTR retrotransposon from Aedes aegypti. XX KW I; Non-LTR Retrotransposon; Transposable Element; I-62_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5304 RA Kojima K.K. and Jurka J.; RT "I clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1333-1333 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 8 sequences with >98% CC identity. XX FH Key Location/Qualifiers FT CDS 108..764 FT /product="I-62_AAe_1p" FT /translation="MGISYPAAKRQHDLRHNPQSMASVVAAGNDQRFAELS FT AKVDNLVQEVKRKDDRVETLLTEIRNKDAHIERLEAALKLTPQERLTMVKE FT HGTIKDMVQKIRSLESELARKEKEVAVMRDVYVPKKALNTTANNTAKTPAA FT VSEPCANKTNPTSKKTSTKKKVVKTDDYGLLNKRSKNCVSPIDTSEPMNTS FT INYSPESGDIEISSGDEPMLNASGYISDS" FT CDS 834..5072 FT /product="I-62_AAe_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="MSSHPTTIDPKHQPSAERTSKPASKLNNSANRKPQLF FT KNHQTSPVEPPSSPGPVSAVVYSTPVLADNPGHSLVTGRGTDKGETSYTPK FT DNAEIVAIATTEAHQETQERTVGAPKVHKPSSTHSITYDSLVAERTPRTTL FT PELARSNAAIYDLLQLNQQNIPSPTAGTSGLTASSKRNLRRKSPVKVTTQS FT TPVRQSQPTTVRHSQNASTKTVLAIQWNMNGYYNNLADLEILVRDQEPLVL FT AIQEPHKISINSLNHSLRRQYTWSLKCNQNFYHSTAIGVLVSVPHSPLQFN FT TDLPLVGVKLGYPWPLTVISGYLPNGNIPDLKQKLVDFFQALDTPILVLWD FT ANGHHPDWGSTTANARGSLILEIAQMFDLVVLNDGAPTFIRGERSSTVDIS FT LASSSIVDRLLWAPGEDPMGSDHYPISISINEQPQAISRRPRWKYNEADWS FT QFQESITPKLCDPVPDNIEEFLNIISDSASLSIPRTSTSPGRRSLPWWSPN FT IKKVIKERRKTLRAAKRLPNDHPAKHDAMETYRSKRNQCRQQIREAKKKSW FT EDFLEGINENQTSADLWNRVNALNGKRRSTGMTLQLPNGPTRDPLIIANAL FT ADYFASLSSLDLYDANFLRRNQVSADSIIKMAIPDDTAGLPINAPFRFDEL FT TFALRRCKGKSAGPDDLGYPMFRNLTPEAKAIFLELLNKIWIGNTLPKAWT FT HSLVVPIPKNGRAATSPSDFRPISLTCCASKILERMVNRRLNRFLEDNQLL FT DHRQHAFRAGHGTGTYFAGLGDVLQDAMNANLHADIASLDLAKAYNRAWTP FT RAIHQLAEWGLGGNILHFIKNFLSGRTFEVVIGNNHSTVRAEETGVPQGSV FT IAVTIFLVLMNSVFDTLPKEIYVFVYADDIVLIVIGRTLKFIRRKLQAAVS FT AVSRWATQAGFILSAEKSIVSHICRSHHRVLSTPVTANGCSIPHKTSTVVL FT GVRLDRKLQFGEHLNETKRNCQTRLNLLRTLSKPHRSCNRDTLLRIARAIV FT NSRIFYGIELFGLAGDALITCLAPVYNQAIRIIAGLLPSTPADAACIELGV FT LPFRYQVTQTLCCRTISYLENTTGDHEVFLLREGNRSLSSLVHQELPPVEQ FT VHWVGARRWDATDFRVDLSILKHFRAGDNSSAMQSHVVELLADKYRNYHLF FT YTDGSKTVNRTGFGVATNDTSYFYRLPDQCSIFSAEAAAILFAITRPTERP FT ICVISDSASVLTTINSPSTRHPWIQAIQMKSSPETVFLWVPGHCGIRGNVE FT ADRLAATGRLGRMLTNLIPGADLKTWCKSTIRLEWAREWIGLRHLFIRKIK FT GETTRWIDSAERHNQQILSRLRVGHTHLTHNMGNRGPFHKVCPACNTKMTV FT EHLLIDCPCYQAARLRHAIPGSIRDALANDPTNEAAILSFLKDAGLHRNI" XX SQ Sequence 5304 BP; 1614 A; 1411 C; 1086 G; 1190 T; 3 other; catccggtgc cttttgcaag agatgcaatt ctgcaaacca ctcgatagct agccgaaaat 60 gccccgccta catcaaggaa gaagaaattc aacacatacg tgtcgacatg ggaatttcat 120 acccagcagc aaaaagacaa cacgacttac ggcacaaccc acagtcaatg gcatccgtgg 180 tggcggctgg caatgatcag cgtttcgctg agctctccgc caaagtggat aacctcgtac 240 aggaagttaa gcgtaaggac gacagagtag aaactctact gaccgaaatc agaaacaaag 300 atgcccacat tgaaaggcta gaagcagcac tgaaactaac tcctcaagaa agactgacca 360 tggtaaagga acacgggaca ataaaggata tggtccaaaa gatacgctcc ctggaatctg 420 agctggccag aaaagagaaa gaagttgctg taatgagaga cgtctacgtt cccaagaaag 480 cccttaatac cacagcgaac aataccgcaa aaacccctgc ggcagtttct gaaccctgtg 540 ctaataaaac caacccaacg agcaaaaaaa ccagcactaa aaagaaagtc gttaaaactg 600 atgactatgg actactaaac aagcgctcga aaaactgtgt ctcgccaata gatacttcag 660 aacctatgaa tacgtcgatc aactattcgc cggaaagcgg tgatatcgaa atctcctctg 720 gagatgagcc gatgctgaat gcatccggtt acatatcaga ttcttaaaga aaaccacctt 780 tccagtcctc tcttgtttct ccgcccaaca aataacgaac attaaccagt ccaatgtctt 840 cgcatccaac cactatcgat cctaaacatc aacccagtgc agagaggact agcaaacccg 900 catccaagct aaataattca gcaaatcgca aaccacagct cttcaagaat catcaaacat 960 ccccggtgga accaccgagt agtccaggcc ccgtcagtgc ggtcgtctac tccaccccgg 1020 tactggcgga caaccctggg cactctttgg tgaccggaag agggacggac aagggagaaa 1080 cctcctatac ccctaaggac aacgcggaaa tcgtagcaat agcaacaaca gaagcccatc 1140 aagagactca agaacgtacc gttggtgcgc cgaaagtgca taaaccgtcg tcaacacaca 1200 gcataacata tgactctctg gtggcagaac ggactccaag gactaccctg ccagaactag 1260 caagatccaa cgcagcaata tacgatttac ttcagctcaa ccaacagaac atcccctctc 1320 ccacagcagg cacttccggt ctaacagctt cttcaaaaag aaatcttcga agaaagtctc 1380 cagtcaaagt tactacccag tcgacaccag tcaggcaatc acaacccact acggttcgtc 1440 acagtcaaaa cgcttccaca aaaaccgtac tggctatcca gtggaacatg aacggttact 1500 acaacaacct agcggaccta gaaattttag tcagagacca agagccgttg gtcctagcaa 1560 tccaagaacc gcataagatt agcatcaaca gtctaaacca ctccttaaga aggcaatata 1620 catggagtct taaatgcaac caaaatttct atcactcaac cgcgatagga gttctagtgt 1680 ctgtacctca ctcgccactt caattcaaca ccgacctccc tttagtggga gtaaaactag 1740 gctatccgtg gcctttaact gttatatcgg gatatctccc caacggtaat attccagatt 1800 taaaacaaaa actcgttgat tttttccagg cactagatac accgatactg gttctgtggg 1860 acgccaacgg gcaccacccc gactggggaa gtaccactgc taatgccaga ggttcattga 1920 tcttggagat tgctcaaatg tttgatttag tcgttctcaa cgacggagcc ccaaccttta 1980 ttaggggcga acgaagctct acagttgaca tcagcctagc tagttcaagt atagtcgatc 2040 gactcctttg ggcaccaggc gaagacccaa tgggtagtga ccactacccg atctcaatca 2100 gcatcaacga acaaccccaa gcaatatcac gtcgtccccg atggaagtac aatgaagcag 2160 actggtcaca atttcaggag tcaataactc cgaaactctg tgatcctgta ccggataata 2220 ttgaggaatt tctgaacatt atcagtgact cggcctcact ctcaattcct cgaacatcca 2280 caagtccagg tcggagatct ctcccctggt ggtccccaaa tataaaaaag gtcataaaag 2340 agaggagaaa aacactaaga gctgcgaaac gscttcctaa tgaccaccct gccaaacatg 2400 acgccatgga gacttatcgc tcaaaaagga atcaatgtcg tcaacaaatc cgagaagcaa 2460 aaaagaagag ctgggaagat tttctagaag gaataaacga aaaccaaacc tcagctgatc 2520 tgtggaatcg tgtaaatgct ctaaatggca aacggagatc cacgggaatg accctgcagt 2580 tgcccaatgg tcctactcga gatccactca ttatagctaa tgctctagca gattactttg 2640 cgtcactttc atcacttgat ctctacgacg ctaattttct tcgamggaac caagtatctg 2700 ccgacagcat aatcaagatg gcaatcccag atgataccgc cggtttgccg attaatgctc 2760 ccttccggtt cgatgaactg actttcgcgc tacgtcgttg taagggtaaa tcagcgggtc 2820 cagacgatct tggctacccg atgtttcgaa acctaacacc cgaagcaaaa gccatctttc 2880 ttgaactttt aaacaaaatc tggatcggaa acacgctacc caaagcatgg acccacagcc 2940 tagttgtacc cataccaaaa aatggacgtg ctgcaacatc acctagtgat ttcagaccta 3000 tatctctgac atgctgcgcc agcaaaatcc tagaacgtat ggtgaaccgg aggctgaatc 3060 gttttttgga agacaaccaa ctactcgacc atcggcaaca tgcctttcgt gccggacatg 3120 ggacmggcac ttattttgct ggtttaggag atgtgcttca ggatgcgatg aacgcaaacc 3180 ttcacgctga tatcgcctcc ttggatctgg ccaaggccta taatcgtgcc tggacaccca 3240 gagccattca tcagttggct gaatggggtt tggggggaaa tatcctccac tttataaaaa 3300 atttcttaag cggcagaaca tttgaagtag tcatcggtaa taaccactcc acagttcgag 3360 cagaagagac aggtgtcccc caaggctccg ttatagcggt caccattttc ttagttctca 3420 tgaatagtgt ctttgacacg ctacctaaag aaatctacgt atttgtctac gcagatgaca 3480 ttgtcttgat agtcatcggc cgcaccttga aattcataag gcgaaaactc caggcagccg 3540 tttctgcagt ttcaaggtgg gctactcaag ctggcttcat actgtcggca gaaaaaagca 3600 tcgtctctca tatctgccga tcgcatcacc gagtgttatc aactccagta acagcaaacg 3660 gttgttcaat accacataaa acatcaaccg tagtactggg tgtgcgactg gatcgaaaac 3720 tacaatttgg tgagcacctc aacgagacca aaagaaactg ccaaacacga ttaaacctcc 3780 ttcgaacgct atctaagcca caccgcagtt gtaacagaga tactcttctc agaattgcaa 3840 gggccattgt caatagccgc atcttctacg gaattgaact gtttggtcta gcaggggatg 3900 ctctaatcac atgccttgct ccggtataca accaagctat tcgcataatt gctggcctgc 3960 ttccgtcaac cccagctgat gcggcctgta ttgaacttgg tgttcttccc ttccgctatc 4020 aggttacaca aactctatgc tgcaggacta taagctacct ggaaaatacc actggagatc 4080 atgaggtctt tcttctcagg gaggggaata gatccctcag cagcctggtc catcaggaac 4140 tccccccggt tgagcaggtc cactgggttg gagccagaag atgggacgcc accgacttcc 4200 gagtagactt gtctatcttg aaacacttcc gagccggaga caactcttcc gctatgcaat 4260 cccatgtcgt cgaactactt gccgacaagt accgcaacta tcatctattc tacacagacg 4320 gatcaaaaac tgtaaataga actggtttcg gcgttgccac aaacgacaca agttacttct 4380 atcggctgcc cgaccaatgc tctatctttt cggctgaagc agctgcaata ctctttgcaa 4440 tcacaaggcc aacagaacgc cccatctgcg taatttctga ctctgcaagt gtgcttacca 4500 ccatcaactc tccatcaaca cgccatcctt ggattcaagc catccagatg aagtcctctc 4560 cggaaactgt cttcctatgg gtaccgggcc actgtggtat ccggggcaat gtggaagcag 4620 accgccttgc cgccacagga cgacttggcc gcatgctcac caatttgatt ccgggtgcag 4680 atttgaaaac ttggtgtaaa tccactattc gcttggagtg ggcccgagag tggataggcc 4740 tgagacatct atttataaga aaaattaaag gggaaactac aagatggatc gattctgccg 4800 aacgacacaa ccagcaaatt ctgtcccgac ttcgagttgg ccacacacat ctaacacaca 4860 acatgggaaa ccgaggacct ttccacaaag tatgcccggc atgcaacacg aaaatgacgg 4920 tagaacacct cttaatcgac tgtccctgct accaagctgc ccgacttcgt catgccatcc 4980 caggcagtat aagagacgct ttagctaatg atccgaccaa cgaagccgca atattatcgt 5040 tcctcaaaga tgccggacta caccgaaaca tttaacatcc tacatcaaac aacaaccacc 5100 gagatatgac gaccctgctt ttcaccctaa attgacttgg attacctatt cgttttttga 5160 atgaacattg atgtatttat gtgtatcgaa gttaattaaa gcaaaatgta caggggggcc 5220 tctcactacg aagccctctc catattctct accggagacg aaccagccaa cggctgaaag 5280 tctcgataat aaagataata ataa 5304 // ID Gypsy-160_AA-I repbase; DNA; INV; 6087 BP. XX AC AAGE02018302; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-160_AA_; KW Gypsy-160_AA-LTR; Gypsy-160_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6087 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02018302; Positions 20078 26164. XX CC Positions [2792-3295] - Reverse transcriptase CC Positions [4364-4840] - Integrase core CC 'CTGT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 126..2153 FT /product="Gypsy-160_AA-I_2p" FT /translation="MDFNHLTEEEISYELALRHIVNVASMTHRNKVLKLKA FT LTQEEILRDITHSSSDNPIDPTVNIENCSQGIQQLQSLVAHAQTSGNCQDL FT SILASRLTHYKNRLAIIEPPEHLIQALQSLRQIVDTLFVNVNSAMRKTVYQ FT PTERIHSGAIPKGQRQQTSVADQQSPMKPRDEREGDLSVHNSERMSEEDAN FT KFGLPPSSSPLNASRGRGRGTMFQPFRGTNQRNSVAANVAGVDTEPEQHYP FT RRDSEHQYNQLREDLLNRLIQMQLNQQYGNGRMEDRRMQKAIHNWPFKFRG FT ERDTKTLNSFLDRVESFARSEGVSDELLLTSIKHLLQEDALDWYARVEYQG FT LLDSWESFKEQIRKEYLPAQYGQLLRAEAFFRYQGQTESFAKYYRDITTLF FT RFAEPAMSEDEKFFIVKKNMHADYALIITAAKPENMAELADVCSSYDDTRM FT LLNRQQRKMSIPQELLVEPNLATPTAGHRPSQQITSQRFSRINAVELQEIE FT LPTSNNSRPAINQCPPGGNDEDYWTMKINQLMDQVNAIKMQVERRHLRTPE FT NSANGLEQRNHQRSSWRPMQQSPQQTGATFRSSQNHFAQQQRSVGSEHEQP FT DNDQQQVQHGAQHLQAFENNRPITICWNCEEEGHRFADCRKQQAFLFCHGC FT GKRGYTLRNCVTCRTEVGNYPAGTQQM" FT CDS 2198..5212 FT /product="Gypsy-160_AA-I_1p" FT /translation="MTPQINSIVNPGQDNRPHAVISILGKELTALLDSGAN FT CSLLGGKHVNIVEQLGLRKGVVVCGGIKTADGTKHKIDHFVRLPIAYNNRN FT ETIPLLLIPSIPDCIILGMDFWERFGVKPVCYAIEAIIQDDLQKDLSNQEK FT QQLERVIEKFPKSVEGKLGRTPMYEHRIDIGDAPPKKQRYYPMSPYVLEEV FT NKEIDRMLALDVIEEARFCPWNNPLVAVKKKTGKYRVCLDARHLNSIMVNE FT GYPIPQISSIMNNLSGCKYISTIDLNDAFWQLPLHKASRPLTAFTVPSRGH FT FQFKVVPFGLCTASQALARVMTHLFADMEPFVFHYLDDLVICSRTFQEHLN FT TLEEVARRLSSANLTISTEKSLFCRREIKYLGYILNEDGWKVDKDKIECIV FT QFPVPSCRKEVQRFLGICNWYRRFISEFARIATPLTELTKTKNKFRWTEQA FT EDAFLRLKSALVSAPVLAMPNYEKPFNIACDASDVAIGAVLTQDIEGEEHP FT ICYFSQKLNAAERKYSATERECLAVIRGIEKFRCYVEGVKFTVFTDHAALS FT YLRSMKNPTALMSRWILRLNAFNFDIKYRKGCINIVPDALSRIANLTFLGN FT KDTVDDWHTRMINSVESEPDKFPDFRIVNGELFKNCKCRDETGVTTHKWKK FT TVPFASRLDVIRRFHDAPSAAHLGFYKTWQKVQTHFYWPKMQSDISSYVRS FT CAICKASKSPTSIMMPQMGGMKPAKQPWELISIDFVGPFTRSKRGNTVMLV FT VVDWITKYVIVHPMRKADSSRLVEFLEQEVFLKFSRPRIILSDNGRQFESL FT VFKSLLSRQNIIHMKTGYYSPMVNNAERVNQVLITCIRALLDEDHRSWDEN FT LPTITAAINSSKHEATGVSPHFANFGRELLLHTDLYDQQDLNTPDDTQLVQ FT DKRLAEIRRIQEFVLQKIRNNHIKTKQRYNMRTRAITFKIGELVWRRVFQL FT SSKENHINQKLNPKFTPAIVKEVLGTNLYLLEDVSSGKRGRYHAKDIKAD" XX SQ Sequence 6087 BP; 1933 A; 1353 C; 1356 G; 1445 T; 0 other; ttgacgccca acaaaaaata cccacacact taagccacgt tgtttgtttg ctgagcttat 60 atttttttaa tttgatggtt tgagttcgcc tattttcaat ttagtttgtt gactctttat 120 ttatcatgga tttcaatcac ctcaccgaag aggaaattag ctacgaatta gctttacgac 180 acattgtcaa tgtggcatcc atgacgcatc gcaataaagt gcttaaatta aaagcactaa 240 cgcaagagga aatcttaagg gatatcacac acagcagttc ggataatccc atagatccaa 300 ctgttaatat tgaaaactgt tcacaaggca ttcagcaact acagtcactg gttgcgcatg 360 cacaaacatc aggaaactgt caagatctgt caatattagc atcgcgactt actcattaca 420 aaaatagatt ggcaataata gaaccccctg aacatctaat tcaggcacta caatcgctac 480 gtcagatagt agacacactt ttcgtgaatg tgaacagtgc aatgagaaag acagtgtatc 540 aaccaaccga acgaatacac agtggagcaa taccaaaagg tcaacgccaa caaacatcag 600 tcgctgatca acaatcacca atgaaaccaa gggacgaaag agagggggac ttgagtgtgc 660 acaactcgga aaggatgtca gaggaagatg ctaacaaatt tggattgcct ccttcgagtt 720 caccattgaa cgcaagccgc ggtagaggaa gaggaactat gtttcaacca ttcagaggca 780 ctaaccagcg taattcggta gccgcaaacg tcgccggagt tgacacggaa ccagagcaac 840 attatccacg aagggacagt gaacatcaat acaaccagct gcgggaagac ttgttgaata 900 ggctgataca gatgcagctg aatcaacagt acggcaacgg acgcatggaa gacaggagga 960 tgcagaaggc catccacaat tggccattta agtttcgagg agaacgtgat acgaagacac 1020 tcaactcgtt tctggaccgc gttgaatcgt tcgcacgttc agaaggggtt agtgatgagc 1080 tgctgctgac atctattaag catttgttac aagaggatgc tttggactgg tatgcgagag 1140 tggaatatca aggattactt gattcctggg aaagttttaa agaacaaatc agaaaagaat 1200 atctaccagc acagtatggt caactgctca gggcagaagc attcttccgt tatcaagggc 1260 aaacggaatc gttcgccaag tactatcgag atatcaccac tctatttagg tttgccgaac 1320 cagccatgtc ggaggacgaa aagttcttca tcgtaaagaa gaacatgcat gctgactatg 1380 cgctcatcat cacagcggcc aaaccggaga atatggcaga gctagcagat gtttgttcca 1440 gttacgacga tacaaggatg ttactgaacc gccaacaacg caaaatgtcg attccacagg 1500 agttattggt agaacccaat ttagctactc caactgctgg acatagaccg tcgcagcaga 1560 tcacgtccca gcgtttcagc agaattaatg cagtcgaact tcaagagata gaacttccaa 1620 cgtcaaacaa ctcacgtcca gcgatcaatc aatgcccgcc aggaggaaac gacgaggatt 1680 actggacaat gaaaatcaat caattgatgg atcaagtcaa cgcaattaag atgcaagtag 1740 aacggaggca tttgaggaca cctgaaaatt ctgcgaatgg tttggaacaa aggaatcacc 1800 aacgaagttc ttggagaccc atgcagcaaa gccctcagca aacaggagcc accttccgtt 1860 cctctcaaaa ccatttcgca cagcaacaaa gatcagtagg ttcagaacat gaacaaccag 1920 ataacgacca gcagcaggtt cagcacgggg cacagcatct gcaagcattt gagaacaaca 1980 gacccatcac aatttgttgg aattgtgaag aggaaggaca tcgattcgcg gactgcagga 2040 aacaacaagc attcttgttt tgccatggat gtgggaaaag aggatacacg ttgcgtaact 2100 gcgtcacgtg ccgcacagag gtgggaaact atccagcggg gactcagcag atgtagaggt 2160 tggctctccg ctcaacgtaa atcctctaat cgtccacatg actccacaaa tcaattccat 2220 agtcaatccc ggacaggaca acagaccgca tgcggtgatc agcatactgg gaaaagaact 2280 caccgcgctt ttagacagtg gagccaactg ctcattactt ggcggaaaac acgtgaatat 2340 agtggagcag ctaggtctgc ggaaaggagt ggtagtatgc ggtggcatca aaacagcaga 2400 cggtaccaaa cacaagatag accacttcgt tcgtctccca attgcctaca acaatcggaa 2460 tgagacaatt ccacttttgc tcatcccatc tatacctgat tgtatcattc tagggatgga 2520 cttttgggag agatttggcg taaaacctgt gtgctatgct attgaagcca taattcaaga 2580 tgaccttcag aaagacctat ccaatcaaga gaagcagcag ttagaaaggg ttatagagaa 2640 attcccaaaa tcagtcgagg gaaagcttgg aagaacgccg atgtacgaac accgcatcga 2700 tataggagac gcaccgccaa aaaagcaaag gtattatccg atgtctccct atgttttaga 2760 ggaagtgaat aaggaaattg atcgtatgtt agcactagac gtcattgaag aagcacgctt 2820 ttgtccatgg aacaatccac ttgtggctgt caaaaagaaa acggggaagt accgtgtttg 2880 tttagatgca cgtcatctga actcgataat ggtcaatgaa ggttacccga ttccacaaat 2940 ttcctcaata atgaacaacc taagtggttg taaatacata tcaacaattg accttaatga 3000 cgctttttgg cagctccctc tacataaagc atcacgacct ctcacagcat tcacagtacc 3060 gtctagagga cattttcaat ttaaagtcgt tccgtttggc ttatgcacgg caagtcaagc 3120 tctggcacga gtaatgacac atcttttcgc agacatggag cccttcgtct ttcattactt 3180 ggatgatctg gtaatctgtt ctcggacgtt tcaagaacat ctgaacacac ttgaggaagt 3240 ggcgcggagg ttgtcgtccg ccaacctcac tatatctaca gaaaagtccc tcttctgtcg 3300 cagagagata aaataccttg ggtatatact caatgaagac ggatggaaag tagacaagga 3360 caaaatcgaa tgtattgttc agtttcccgt gccttcatgt cggaaggagg ttcaaaggtt 3420 tttagggatc tgtaattggt accgcagatt catctccgag ttcgcacgga tcgcaacacc 3480 tctcactgag ctaactaaaa ccaaaaataa gttcaggtgg acagagcagg cggaagacgc 3540 cttcctgcgg ttaaaatcag cactcgtttc ggcgccggtt ctggccatgc caaactatga 3600 aaagcctttt aatatagcat gtgatgcgag tgacgtcgcc attggggctg tgctaacaca 3660 ggacatcgaa ggggaggagc atccaatatg ttacttttcc cagaaactaa atgctgctga 3720 acggaaatat tcggctaccg aaagagaatg tttggctgtt atccgaggaa ttgaaaaatt 3780 tcgttgctac gtagaaggag taaagttcac cgtgttcaca gatcatgctg ccctcagcta 3840 tcttcgctcg atgaagaacc ctacagcact catgagtagg tggatattac ggctcaatgc 3900 gttcaacttc gacattaagt acaggaaggg ctgtataaat atcgtgccag atgcattatc 3960 acggatagca aaccttacat tcctaggcaa caaagatacg gtggatgact ggcacacccg 4020 catgattaac agcgtagaaa gtgaaccaga caaatttcca gattttcgaa tagttaatgg 4080 tgaattattc aagaactgta aatgtcgaga tgaaacaggc gtaacaactc acaaatggaa 4140 gaaaaccgta ccattcgcaa gcagattgga cgttatccgg cgatttcatg acgctccatc 4200 cgcagctcac ctcggtttct acaaaacctg gcagaaggtt caaacacact tctattggcc 4260 taaaatgcag agtgatatta gcagttacgt ccgcagttgc gccatatgta aagctagtaa 4320 atcacccaca tccattatga tgccacagat gggaggtatg aagccggcta aacaaccgtg 4380 ggaactaatc tcaatcgatt tcgttggccc ttttactcgc tccaaacggg gaaatacggt 4440 gatgttggta gttgtggatt ggataaccaa atacgtgata gtccatccaa tgcgcaaggc 4500 tgattctagc cgattggttg agtttttaga acaagaagtc ttcctcaagt tttcacgtcc 4560 tcgcatcatc ttatcggaca acggtaggca atttgagtct ttagtcttca aatcgctatt 4620 atcgagacag aatatcatcc atatgaagac gggatattac agccctatgg tcaacaatgc 4680 agaacgcgta aaccaggtac tcataacatg tataagagct ctcttggacg aggaccaccg 4740 ctcatgggac gaaaatctac cgacgataac agcggcgatc aacagctcaa aacatgaagc 4800 aacgggcgta agtccacatt ttgcaaactt tggcagagag cttcttttgc acacagactt 4860 gtacgatcaa caagatctca atactccgga tgatacccag ctagtacagg acaagcgttt 4920 agcagaaatc cgccggattc aggaatttgt gttgcaaaag ataagaaaca accacattaa 4980 gacaaagcaa cgctacaata tgcgcacaag ggcaataaca ttcaagatcg gtgaactggt 5040 gtggcgcaga gtatttcagc tgtcatcaaa agagaaccat atcaaccaga agctcaatcc 5100 aaagttcacg ccagcaatcg taaaagaggt gttgggaacg aatctctact tgctcgaaga 5160 cgtctcgagt ggaaaacggg gtcgatatca cgccaaggac ataaaagcag attaggacag 5220 agacacatca atatgctgac tacaagcaac cacatctgca tgcgtcagag atgttggaga 5280 caagctatgt acacagtatg caaggaactg tcaactaaga gcgatttcgc caggaaaata 5340 cgattcacgg atcaggctga aggaccaaca acaacattcg aaacctcgcc gtcccttgcg 5400 caccgatatc ctgccaaagc cagctgcctt tgccatggat tgccaaacta caaactttag 5460 aaagcagtga ggaagcacta tctcatccaa attgcaaaat cgctgctagc caaatctaaa 5520 gtagtcgtaa acaatgagtc agaagcaagt aaatcgggga tcggcaagaa atttcctttc 5580 taaagtaacg cgacccgggg cgcggtaaaa ccccttataa atgggctctt gtcgattctg 5640 gagattaaat cctcctttcc tgaaatacgg cttccatcaa actaattcaa accatcatta 5700 tctcagcact taatttgtaa gttttaatgt tctttgttcg atcagacccc ttcattgcat 5760 aaagggttta agatcgtcct gcgtgttcac catgaaggga gatatgacac aagaagcagg 5820 aacaagaact ttgtcggaga cttctggcca gaaatggcgt tagatcgaaa attttgccag 5880 aattcgctgg tacgattatg aatatgttag cctgttttgc tgagtagtca gttgtattct 5940 tctagggaag ataacactcg aaatctgttc gagcgtccga taatgatcat ttgcgtctcc 6000 gactaacttc ttaaaagaaa gacgaaaaat tttttcatga gccaactgtc gttgcctcat 6060 gaaaaatttt cccttgtggt ggtgtgg 6087 // ID R2_DPs repbase; DNA; INV; 3557 BP. XX AC . XX DT 08-NOV-2010 (Rel. 15.11, Created) DT 08-NOV-2010 (Rel. 15.11, Last updated, Version 2) XX DE 28S rDNA-specific non-LTR retrotransposon R2 in Drosophila DE pseudoobscura. XX KW R2; Non-LTR Retrotransposon; Transposable Element; R2_DPs. XX OS Drosophila pseudoobscura OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; obscura group; OC pseudoobscura subgroup. XX RN [1] RP 1-3557 RA Stage D.E. and Eickbush T.H.; RT "Origin of nascent lineages and the mechanisms used to prime RT second-strand DNA synthesis in the R1 and R2 retrotransposons of RT Drosophila."; RL Genome Biology 10(5), R49-R49 (2009). XX DR [1] (Consensus) XX CC This family is specifically inserted into 28S ribosomal RNA CC genes. XX FH Key Location/Qualifiers FT CDS 129..3338 FT /product="R2_DPs_1p" FT /note="reverse transcriptase and restriction-like FT endonuclease." FT /translation="SSFGLIVTNLNSETVLWGCQPLGQFSLIGTNMQNTTP FT RIINTNSLTNQIPTVSSLGAQSEHSAQVNPNSGYQCTICESSFRSKSGLGV FT HMSRRHKDEFDQLRLRTDRKAQWSEEELSMMARKEIELAANGERYLNKKLA FT EVFTNRSVDAIKKCRQRERYKTKIEQLKGQAVPLPEALESETIQRRPSIRE FT RDLLVTPPNTLGTTPTELSNREILAVLQGYPPVVCNDQWRVEVLQSIVDGA FT QASGKEITLQRLSTYLMEVFPSQNDRPIQTRPPRRPRNRRQGRRQQYALTQ FT RNWDKHKGRCIKAILDGTEGTATMPSQGIMGSYWRQVMTQTSPTYSGTNTT FT FRTEHPLEGVWSPITLGDLRVHRVSLTKSPGPDGITPRTVRSIPSGVMLRI FT MNLILWCGKLPVSIRQARTIFIPKVGNASRPQDFRPITVQSVMVRILNAIL FT ASRLTSSVDWDPRQRGFLPTDGCADNTTIVDLILRDHHKRCKSLYIATLDI FT SKAFDSVSHAAVSATLTAYGAPKEFVDYVQNSYEVCGTTLNGDGWRSEEFI FT PARGVRQGDPLSPIIFNLIIDQLLRSYPNEIGATIGDHTTNAAAFADDIVL FT FAETRLGLQTMLDTTVDFLSSVGLTLNSDKCFTVGIKGQPKQKCTVVIPET FT FRIGSRSCPALKRTDEWKYLGITFTAQGRTRYSPADDLGPKLLRLTRSPLK FT PQQKLFALRTVLIPQLYHKLTLGSVMIGVLRKCDILVRSTVRKWLGLPLDV FT STAFFHAPHIYGGLGIPSVRWVAPMLRMKRLSNIKWAHLAQSEAASSFLTD FT ELNKARGRTLAGLNELTSRSEIETYWANRLYMSVDGRGLREAGLFRPQHGW FT VCQPTRLLTGQDYRNGIKLRINALPSRSRTTRGRNELERQCRAGCDAPETT FT NHILQNCYRTHGRRVARHNCVVNNLKRILEEKGHTVHVEPSLQLETSVSKP FT DLVCIRDNHACVIDAQIITDGLFLDDVHHRKVEKYKRPEVISALRREFGVS FT GNVEVLSATLNWRGIWSNQSVRRLIAKGLISSGDSNVISARVVTGGLYCFR FT QFMYLAGYTRDWT" XX SQ Sequence 3557 BP; 1019 A; 806 C; 878 G; 854 T; 0 other; caattggaaa gatatgggtc tgaataatag cgtagaaggg gagtcattcc gtaattcgta 60 aatcgtaaaa atcagatcaa gttgattcaa gacctcctcg tggtatcttc tggatgctat 120 tagactgaag ttcttttggt ctaatagtaa ctaacttgaa cagcgaaaca gtcctatggg 180 gctgccagcc ccttggacag ttcagtttga ttggcactaa tatgcaaaat acaacgcctc 240 ggataataaa cactaattcg ttgacgaacc aaatccctac ggtctctagc ctaggggccc 300 aatctgaaca tagtgcacag gttaacccaa acagtggtta ccaatgcacg atatgtgaat 360 cgtctttccg tagcaaaagc ggactaggcg ttcacatgtc acgtcggcac aaggacgagt 420 ttgatcaact tcgtctgcgt accgaccgta aggcacaatg gagtgaggaa gagttgagta 480 tgatggcaag aaaagagatc gagctcgcag caaatggaga aagatatcta aataagaagc 540 tagcggaagt atttacgaac cgtagcgtcg acgctatcaa gaaatgtcga cagagggaga 600 gatataagac caaaatcgaa cagctaaagg gtcaagctgt tcctctccca gaagcattag 660 aatctgaaac catacagcgc cgccctagta tacgcgagcg agatctccta gtaacgccac 720 ctaacactct cggaaccact ccaaccgaac tgtcgaacag ggaaatcctg gcagtactac 780 aggggtaccc acctgtagta tgcaatgacc aatggagagt tgaggttttg caatccatcg 840 tagatggtgc gcaggcctcg ggtaaggaaa ttactcttca gcgcttgtct acttacctta 900 tggaagtatt tccctcacag aatgaccgcc ccattcaaac gagacctcca cggagacctc 960 gtaataggag acaaggtagg agacagcagt acgccttaac acagcgtaac tgggacaagc 1020 acaaaggtcg ttgtataaaa gccattttgg atggaactga ggggacagca actatgccaa 1080 gtcaaggtat catggggtcc tattggagac aagtcatgac acaaacaagc ccaacatata 1140 gtggtacgaa caccacgttc cggacggaac acccacttga aggggtttgg tccccgataa 1200 cactagggga cctaagggta cacagagtgt cattgacgaa atctccagga cctgatggga 1260 ttactccaag aactgtcagg agtattccgt caggagttat gcttcgcata atgaacctga 1320 tactttggtg cggaaagttg cctgtctcca tccgacaggc acgaaccatc ttcattccga 1380 aggtggggaa tgcttctcga ccgcaagact ttcgtccaat tacggtacaa tctgttatgg 1440 taaggatttt aaatgccatt ttggcttccc ggttgacctc atcagtcgac tgggatccgc 1500 gtcagcgagg tttccttcca accgacggat gtgccgataa tacgacgata gtcgacttaa 1560 tcttaaggga tcaccataaa cgttgtaaat cactttatat cgcaacttta gatataagca 1620 aagcatttga ctcggtgtct catgcagcag ttagcgccac tctaactgca tatggtgccc 1680 ctaaagaatt cgttgactac gtacaaaatt cgtacgaggt ctgtggcaca acgctcaatg 1740 gggacggatg gagatcagag gaattcatac ctgctcgagg tgtcagacag ggtgacccgc 1800 tatctcccat aatattcaac ttgatcatcg atcagttgct taggtcctac cccaatgaga 1860 ttggtgccac aatcggtgat cacacaacaa acgcggccgc gttcgcagat gatattgtct 1920 tatttgcgga aactcgttta ggccttcaaa caatgctaga cacgactgtc gattttctat 1980 cttcagtcgg gcttaccctt aactcggata aatgttttac agttggaata aaggggcaac 2040 cgaaacagaa gtgtactgtg gtcatcccag agaccttccg tatcggttcg cgctcgtgtc 2100 ctgcattgaa gcgcacagac gagtggaagt atttaggcat aacattcact gcacaaggga 2160 ggaccaggta cagtccagcc gacgacctag gtccgaagct gttgaggctg acaaggtccc 2220 ccctaaaacc acaacagaaa ttgttcgcac tcagaacagt tcttatccca caactttacc 2280 ataagctgac cctaggtagt gtgatgatag gtgttctgag gaagtgtgac atactggtac 2340 gttcgaccgt aaggaagtgg ttagggcttc ctctggacgt gtcaactgca ttcttccatg 2400 ctcctcatat ttatgggggc ctcggaatcc cttcagttcg ttgggtagcg ccaatgctac 2460 gtatgaaaag attgagcaat attaagtggg cccacctcgc gcaatccgag gcggccagct 2520 catttcttac cgacgaattg aataaggccc ggggtagaac tctggctgga ctgaatgagt 2580 tgacatcgcg ttcggagatc gaaacgtact gggcgaacag gttgtatatg tctgttgatg 2640 gtcgcggctt acgtgaagcg ggactttttc gtccccaaca cggctgggtg tgtcagccca 2700 cgcgtttgct aacaggtcaa gattaccgaa acggtatcaa gctgcgaata aatgccctac 2760 catcgaggtc tcgtaccacg aggggcagaa atgaattgga acggcaatgt cgtgcaggtt 2820 gtgatgctcc cgaaacaaca aaccacatcc tgcagaattg ttaccgtacg catgggaggc 2880 gggtagcaag acataactgt gtagtcaata accttaagag gattcttgag gagaagggcc 2940 acacagtaca cgtcgaacca agtttgcagc tggaaacctc ggtaagtaaa ccagacctgg 3000 tgtgtatccg tgacaatcac gcttgcgtga ttgatgcgca gattataacg gatggactgt 3060 ttctcgacga tgtgcaccat cgcaaagttg agaaatataa aagaccggaa gttatatctg 3120 cactgcggag agaattcgga gtgtcgggca acgtcgaagt cctatccgcg acgttaaact 3180 ggcgtgggat ctggagcaat caatccgtta gaagattgat agcaaagggt ctcatctcat 3240 ccggtgacag caatgtcatt agcgccagag tggtaacagg cggactatat tgcttcagac 3300 agttcatgta tctcgcaggt tacactcgag attggactta gcctatacac tatgttggag 3360 agaagacgct tgctacctag gcataatgtg aaattaggta taaacatcgt ggttgtaaaa 3420 cttgaggtgg gtttttagta cgtatgcgtg attacttcgt aatcatgaat cgtgcatgct 3480 agtggggttt ggcctccact agtatctttg aagattttcc ttcctcagcg atcaaaaaaa 3540 aaaaaaaaaa aaaaaaa 3557 // ID Zator-N1C_CQ repbase; DNA; INV; 471 BP. XX AC . XX DT 29-DEC-2010 (Rel. 16.01, Created) DT 29-DEC-2010 (Rel. 16.01, Last updated, Version 1) XX DE A Zator DNA transposon family from Culex quinquefasciatus - DE consensus. XX KW Zator; DNA transposon; Transposable Element; Nonautonomous; KW Zator-N1_CQ; Zator-N1B_CQ; Zator-N1C_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-471 RA Kojima K.K. and Jurka J.; RT "Zator DNA transposons from the southern house mosquito."; RL Repbase Reports 11(1), 650-650 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >84% CC identity. 3-bp TSDs are usually TWA. ~150-bp TIRs. ~86% identical CC to Zator-N1B_CQ. XX SQ Sequence 471 BP; 155 A; 95 C; 64 G; 157 T; 0 other; ggccgttgca aatatttttc aaagtttatg tcgccccccc cccccttcaa agtcggcctg 60 aataatcagg gggcaaaaaa atattttttc gaaaaacttc aaaattttaa tgaaaattaa 120 agtttaatca actgaaaacc agttaaaatg cattttcccg cgtttataat catatttagc 180 atgtttgaac tcctttgaaa acattttaaa ttttcatgaa ataccaatgt acagtcccac 240 gaaaagtttt ttttttgcga aaaaaaaatc cgtcaatacc tcgatatttt caaaactaat 300 gattggaaag caactgtacg cctgtaaaat gcattttaaa acactttttt catccaaatg 360 ttgaaaccta ggcttgtaat ttcaattttt atattttttt tttgtttccc ccccccctcg 420 actttggtca gagccgaggg acataaactt cgaaaaatat ttgcaacggc c 471 // ID Copia-10_SI-LTR repbase; DNA; INV; 238 BP. XX AC AEAQ01016285; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: long terminal DE repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-10_SI_; KW Copia-10_SI-I; Copia-10_SI-LTR. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-238 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01016285; Positions 791 554. XX SQ Sequence 238 BP; 52 A; 56 C; 40 G; 90 T; 0 other; tgttgagatg aacaatgtat aatcgccatg tctgaatgtc ggtaaagtga ctagggtctt 60 ataagtttcg tttctatatt tcttttagtt tatgttttcc cgctacgcgg cggtacccgc 120 tctctctcta ctctctgctc ctgcatccgc actgtcgttg tgtgtatctc tcaaaataaa 180 tcttttgtag tattaattac ccgcgtatac ttttatctca aacccattga ctccatca 238 // ID BEL-601_AA-LTR repbase; DNA; INV; 222 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-601_AA_; KW Pao_Bel_Ele80; BEL-601_AA-I; BEL-601_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-222 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 222 BP; 66 A; 51 C; 38 G; 67 T; 0 other; tgtttattgt agcgacacac cttgccttgc tagcaattga aattgaaact ttgctggcaa 60 ccctggagaa aaatctagta ccacactaac acaccctttg taatacaaca ggaaataaag 120 ccagaatttt ttgtatcgtt cacaagacca gacgcgtttt cttcacggtc gtttggaaga 180 aaattttctc gtttacagtc cacttattcc cgagcttgta ca 222 // ID Crack-11_BF repbase; DNA; INV; 4357 BP. XX AC . XX DT 30-APR-2009 (Rel. 14.04, Created) DT 30-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE Amphioxus Crack-11_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; KW Crack-11_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-4357 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-4357 RA Kapitonov V. and Jurka J.; RT "Young families of Crack non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(4), 816-816 (2009). XX DR [2] (Consensus) XX CC Its 5'-terminal portion is incomplete. XX FH Key Location/Qualifiers FT CDS 498..3899 FT /product="Crack-11_BF_2p" FT /translation="MNMAGCLLVQFVAFTAIILQLSNPVISLKRVSTQADS FT ITTFNQVSCFKSHHTAVTPEEYVLSLLYPTANTGSRKNVCHVETRVNVVHL FT CLHTLLLLSGDVASNPGPKDPCGVCTKSVRNNQKGICCDLCDKWFHIKCTD FT LDHKTYCNLANSHEYWYCNQCLLPCFSDSFFNSPCNSTMGSSEEENVDLED FT CPSIIPLKGLSIVHLNICSLYSKLDQLSVFMTSNHIDVLTISETHLDGSII FT DNELHIAGYHLYRLDRNRKGGGVAIYVLDSYTHSERKDLQQPGLEALFCQI FT DLPCTKPIIVGSVYRPPSAPVEFYSLLKNSLETWTLSSPNSELYLLGDFNV FT DLGQHKNPGARNISSLSNEYQLKQLIKEPTRVHQHSASIIDHIYCSDLHLV FT TTSGVLQCTLSDHYAVYCVRKATRQRANTKYVTSRKFTKFNEQSYMDDLSK FT VDWNAILHIDKVEEAWTLFKSVLTTVSDIHAPYITKRTKDNQPKWLTSDIR FT KLMLTRDDLKAKARKSGATADWEAYRAKRNVVNKMVQRAKAKYCQEKLEEN FT VSDSKKLWGTIGEILPNKKSIVSKTLTFAGEHLTGLTDIVKGFNKFFGMAG FT RKLAECFGSHNPTPLCPAQYKNLDCTFTFTPLTDQAVHTQLKNLPIRKAAG FT LDNIHPRLLKAAAPIVAQPLTHILNLSLSTGIIPSEWKLARVTPIHKGGDQ FT SDPNNFRPISVLPIVMKIFEKEVHKQLLAYLHSHDILCTNQSGFRPSHSTT FT TTLLDVTDYILHNINSGKLVGTVFIDLKKAFDTVDHSLLLTKLNWIGLRST FT ELDWFTNYLADRQQRVSLNGCTSDSIPITIGVPQGSILGPLLFNLFINDLP FT KAITRSKVTLYADDTAIFFSSNDPRCIESTLNMELKQITSWFKSNRLTLNI FT KKTKWMLMGSTRKTNSTPDLEIKINGDTLERVSSYKYLGVILDSSLSFTDH FT LDEVCSKMSQRLGLLRRLRPYLCTGVANLLYKTLALPLLEYCNVVWDNCSA FT ANKQRLQILQNRGARIILRREPRSNVGELHQALGWSLLQERRNRQICIMVY FT KCLHGLAPNYLLNTFRFNNQIHSHNTRQAQLLHKPNYTSRAGQRTFAYRAA FT EIYNTLKPETKQATTLKAFKQSLIS*" XX SQ Sequence 4357 BP; 1291 A; 935 C; 877 G; 1254 T; 0 other; cgttgttcag aggtcagcca agggaacccc accttccgtg gtctcagcta taacagactc 60 tattttaccc cttttttact ctttgtaaca acatttactc ctaacataat ccaccataac 120 gtcgtcaagg tggtagtgag ctgctggatg ttagtaaaaa gcaaatttct gccaaattaa 180 agcagaactt tgagtccttg ttgtagggag aagcgcggat ggggagggta ccccggtaca 240 tggctccagc acgcagtgag tagaacgctg cccccacctt ttgtagggga acagcttagc 300 agtgtttggt gtacttgtac ctataactgt aattgcttaa tctttattgc ttttttagtc 360 ctatatcgtt ttctcaaatt gcttgacggt ttgcaacctg tgattgtgca ataattgggt 420 tagtgaacta attgggttgg aaagcgccag gtgtgctgta tgccgagtta atctaacatc 480 ataatttagc cttgtgtatg aatatggctg gttgtttact cgttcaattt gtagcattta 540 cggctatcat actacagctt tctaacccgg ttataagctt gaaacgtgtc agcacgcagg 600 ctgactcaat aacaacattt aaccaggtgt cctgttttaa gtctcaccac actgctgtta 660 ccccagagga atatgtacta tctctgttgt atccaactgc aaatactggc agtagaaaga 720 atgtttgtca tgttgaaacc agggtcaatg ttgttcatct ttgcctacat accctgcttc 780 ttctgtcagg cgatgtagct agtaatccag gtccgaagga cccatgtggt gtatgcacta 840 aatcggtaag aaacaaccaa aagggtatct gttgcgatct ttgtgacaag tggttccaca 900 tcaaatgcac agacttggac cataagacat actgtaattt ggctaacagt catgagtatt 960 ggtactgtaa tcagtgttta ttgccatgtt ttagtgactc attcttcaat tcaccatgta 1020 atagcacaat gggcagctcg gaggaggaaa atgtggatct ggaggactgt cccagcataa 1080 ttcctttgaa gggactgagc attgtgcact tgaacatatg tagtctctac agtaaattgg 1140 atcagctatc agttttcatg acctctaatc atatagatgt gttgaccata agtgagactc 1200 acctggatgg ttctattatt gacaatgaat tgcatatagc aggatatcat ctgtatagac 1260 ttgaccgtaa taggaaaggg ggcggggtag ctatttatgt attggattct tacacacatt 1320 cagaaagaaa ggatctacaa cagccaggac tggaggccct gttttgccag attgacctac 1380 cttgtaccaa gcccatcata gttggctctg tatacagacc accctctgct ccagtagaat 1440 tctactcctt actcaagaat tcccttgaga catggactct gtcttcccca aactcagaac 1500 tgtatttact cggagacttt aatgtagacc taggtcagca taagaaccct ggggccagaa 1560 acatttcaag tcttagtaat gagtaccagt taaagcaatt aattaaagaa ccaactcgag 1620 tgcaccaaca ctcggccagc atcattgacc atatatactg cagtgatcta caccttgtta 1680 ccacttctgg tgtactgcaa tgtacactat cagatcacta tgcagtttat tgtgtaagaa 1740 aggccacccg gcaacgtgca aacacaaagt atgtaacatc ccgtaagttc accaagttca 1800 acgaacaatc ctacatggat gaccttagta aggttgactg gaatgcgata ctacatattg 1860 acaaggtgga agaggcatgg acattgttca agtcagtact cacaacagta agtgacatac 1920 atgcccctta cattactaaa cgcaccaagg acaaccagcc taaatggctg acatcggaca 1980 tcaggaaact gatgcttact agggatgact taaaggccaa agcccgtaaa tctggtgcaa 2040 ctgcagattg ggaggcatac agggcaaaaa ggaatgtggt aaacaaaatg gttcaaagag 2100 caaaggctaa atactgccaa gaaaaactag aagaaaacgt ttcagactcg aagaaactat 2160 ggggaacaat tggggaaatt ttacccaata agaaaagtat cgtgtcaaag acactgacct 2220 ttgcaggcga acatcttact ggattgactg atattgtaaa gggctttaat aagttcttcg 2280 gtatggcagg gcgtaagcta gctgaatgct ttgggtcgca taaccctacc ccattatgcc 2340 cagcacagta taagaacttg gattgcacct ttacttttac tccacttact gaccaagctg 2400 tgcataccca gctaaagaac ctacctataa gaaaagctgc cgggttggat aacattcacc 2460 ccagacttct taaagccgca gccccgattg tagcacagcc acttactcac atattgaacc 2520 tgtccctcag cactggtatt ataccatctg agtggaaatt agcaagagtg acgcccatac 2580 ataaaggtgg agatcagtcc gatccaaata actttagacc catttctgtc ctgcctattg 2640 tgatgaaaat atttgagaag gaggtccaca aacaattact ggcttactta cattctcatg 2700 acatactttg taccaaccag tcaggcttca gaccaagcca ttcaacaacc accaccttac 2760 tcgatgttac tgactacata ttgcataata tcaacagtgg taaactagtc ggtactgtat 2820 ttattgattt gaaaaaggcg tttgacactg ttgatcacag cctacttttg actaaactga 2880 attggattgg ccttcggagt actgagctgg attggttcac aaactatctt gcagatcgcc 2940 aacagcgagt ctcactgaat ggttgcacat ctgattccat tcctattact attggggtac 3000 ctcaagggtc aattttaggt ccccttcttt tcaacctgtt tattaatgac ctacccaaag 3060 ctataacccg cagtaaagta accttatatg cggatgatac agctatattc ttttcaagta 3120 atgaccccag atgtatagaa agcacattaa acatggaact taaacaaatt acttcctggt 3180 ttaagtcgaa tcgacttact cttaacatca agaagacaaa gtggatgctt atgggctcca 3240 ctaggaaaac aaattctact cctgatttgg agatcaagat taatggtgat accttggaaa 3300 gggttagctc ttataaatat ctgggtgtca tacttgattc tagcttaagt ttcactgatc 3360 acttggatga agtgtgcagt aaaatgtcac agcgcctagg actattaaga cgcctacggc 3420 cttatctctg cactggtgta gcaaatttgt tgtataaaac ccttgctcta ccactccttg 3480 aatattgtaa cgtggtatgg gacaactgca gtgcagccaa caagcagcgt ctccaaatcc 3540 tccagaaccg gggcgcacga atcatcctcc gacgtgagcc aaggtcgaat gttggggagc 3600 tgcatcaggc actgggttgg agtctgctgc aggagcgaag aaatcggcag atctgcatta 3660 tggtgtacaa atgccttcat ggcctagcac ccaactacct actcaacaca tttcgtttca 3720 ataaccaaat ccacagtcat aacaccagac aagcacaact cctacataaa cctaactaca 3780 cttcaagagc aggtcaacgc acatttgcat acagggcagc tgaaatatat aatacactta 3840 aacctgagac aaaacaagca actactctta aagcctttaa acagtctctg atatcataac 3900 actgacatct gacctctgat cccaatgact gttcaattac cttatgattt ttgtgatgtt 3960 gagttgttat tgtgtatgag ttgttattag taagttgcgt atatgattta ctgatccgaa 4020 ttggattatt ctttcttatg ttcagtccga ttgatcagga caaatggcac tgacatctga 4080 cctctgaccg caatgaccgt ccaatgaccc tatgagtttg tgatcttgag ttgtttttgt 4140 tattaagtat gagttgttat tcttttgagt tttcctaagt tgcttgtatg atttacaaac 4200 ccgaattgga ttattgcttt gttatgttca atcctatgtt tgtatttatc ttttgtactt 4260 tgagttatta ttattgttat agggctccct tggaaatcag ttaccacgtt aactgaaggg 4320 actaccctaa agatcaataa ataaataaat aaataaa 4357 // ID Gypsy-177_AA-I repbase; DNA; INV; 6999 BP. XX AC supercont1.143; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-177_AA_; KW Gypsy-177_AA-LTR; Gypsy-177_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6999 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.143; Positions 550904 557902. XX CC Positions [4577-5056] - Integrase core CC 'ACAA' target site duplication CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 354..2156 FT /product="Gypsy-177_AA-I_2p" FT /translation="MATSEGSALLINSHDLSEDEVDYELLIRNQLYEETEK FT QKRDRLRRIFLEEGNDTIVLTEDIFFVNELPLVTKKLEEIEYGLARTIQVK FT WISQLRHWKERVNRSKVVDLAQGEQKVEILFRIKNLIKIYKTTAVRLMNLS FT ESEEELGDSKGSDRTQFKQENKLTKARAPRFAETSVRKENQSFDHDTLSRA FT VNSTIGDYFKQFPNIAGMKISIDRDDKTKKKVIQERESMRNLGAVPKRFER FT LGSIEGQDIEVHEINEKQNRKKSRKTKLPKKPMSDDSSSLDSLSSLLSSES FT SSDSGSIYPSERSDDIGRRDNRVQNHRLDKWGIQFSGDIHGMDVSDFVFQV FT NELMDAERIPNDRFLDHAYILFAGEARRWYFTYKKKYKTWDKFSKQLKIRF FT GDPNKDRKILQDIKDRKQKKGESFVAFCSEIEGMFERMTKQYSERKRLKVL FT RNNMRRWYKTKLSFFKIKNIAHLSMLCQQLDKDSGRIYSKTSLPTRKHIRN FT VDASSDSSSSSSEQEVCAFDKKENRQRRYREPRPDITEPQGRENLLQLNSL FT CWNCRKYGHRWRDCKQPKVIFCHACGMPGVTFMTCPKSHTLPQQAPKNESS FT EEK" FT CDS 2066..5428 FT /product="Gypsy-177_AA-I_1p" FT /translation="MRHAWSYFHDLSKESYASSTGSKKRELGGEVRGVSFP FT ENCLDPKKENLLGDHHSNWKIYELNIKPNKCPHVLVKIVDSNIEALLDSGA FT EISVMNSIDLVRRYNFKIHKTNIRVDTAGTAEHHCLGFVNVPLTFNDLTRV FT VRIYIVLEFSKPLILGVNFWRAFNIVPAIEMTNKEIISIDRLNSQSQIQVC FT PIRLIENYFSADESELALTFSEFVGAVHEPDSHSDDEDLSLEVPTLDFPSM FT SIEEIETEHVLNREERFQLLEAISCFEFSEDDKIGRTAILEHKIQIIPGAE FT IKASPMYRCSPYIQKYVDEELERMKKMDVIEPCDSDYASPLLPIKKANGKF FT RVCLDSRRVNDVTENNAYPIPNLHEVLHRIKKAKYFSVIDLKEAYWQIPLA FT KESRKYTAFRTKSGLYRFKVMPFGLKGAPFTMSKLMDLVLGNDLQPYVWVY FT LDDIILATETLSQHVALIKEVASRLKRANLTISLNKSKFCRKSVRYLGYVI FT SEQGISIDMEKIKPILDYEAPKTVKDIRRLLGLANFYQKFIKHYSDITSPI FT SDLLKKDRKKFSWSSEANEALNKLKQALISPPILANPNFDLEFTIESDASD FT LAVGAVLTQKQHGEKRIIAYFSKKLNSTRRKYAAVEKECLAVLWAIEAFRN FT YIEGTHFRVITDARSLVWLSKVSAEKGSAKLCRWALKLQQFDFSIEYRKGR FT DNITADCLSRSLSTIKVKWEDSDYSGMLNKIKVDPLKYKDYKIVDNVIYKF FT VNDESLITDRRFTWKRIPTIAERVELIREIHEEAHLGTEKTLGKIRDKYYW FT PRMYSDVKRFCEGCIACRKSKSDNVNHVPPMGKQKIANQPWQIIAIDYVGP FT FPRAKRSGNTCLLVITDIFSKFVIIQPLKEAKAKQLSFFMENMIFLVFGVP FT EIVLSDNGAQFTSKEFKNLLQKYSVSHWLTPVYFPQVNNAERTNRVITSAI FT RALIKNEQNEWDANIYKIAHAINNATHSSSKFSPYFVNFGRNQISSGEEYQ FT ILRDLERDLSPEEQEHSSKMKNIFEEVRKKLKIAYDKYSKYYNLRAGKMVQ FT FAIGQKVLKKNYFLSDKGKGFNAKLAPKYSEAVVKKKVGDYCYILEDLNGK FT SLGMYHVSQLKKV" XX SQ Sequence 6999 BP; 2567 A; 1134 C; 1381 G; 1917 T; 0 other; gtggcgccca acttataaac agcacgtccc acttgaatta gaaattttat tgatcaatat 60 ttttcacccc tctttatttt gaattgaact atacattatt aggttaagat agatttagtg 120 gattcgagca ctttaatact attttggtta aattagttta ggataagatt atttgaattg 180 gatatacatt ttcgttcaag gaatacatta ggttaaacaa gattgaatat tcattttttc 240 tttttattga tattattact ctattttaca atattaatca tatttatcct ttcttcatac 300 tgcggttgag taccaatttc ctgaaaaaaa accagaaaag tataaaattt aaaatggcta 360 catcagaagg gtcagctttg ctaataaact cgcatgattt aagtgaagat gaagtagatt 420 atgagttgct gataagaaac cagttgtacg aagaaacaga aaaacagaaa cgggatcgtt 480 tgagaagaat attcctagaa gaaggtaacg atactatcgt tttaacggag gacatttttt 540 tcgttaatga attaccgtta gttacaaaaa aacttgaaga aattgaatat ggtttagcca 600 gaactattca agttaaatgg atttcacaac ttcgtcactg gaaggaaaga gttaatagaa 660 gcaaagttgt tgatctggct caaggagaac aaaaagtcga gattttattt agaatcaaaa 720 atttgattaa aatttacaaa acaactgcag ttagattgat gaatctaagc gagtctgaag 780 aggaacttgg tgattctaaa ggctcagatc gaacgcaatt taaacaggag aataaattaa 840 caaaagctag ggcaccacgt tttgcagaaa cttcggttcg gaaagaaaat caatcatttg 900 accacgatac cttatcaagg gcagtgaatt ctacaattgg agactatttc aaacaattcc 960 caaatattgc aggaatgaaa attagtattg accgcgacga taaaacaaag aaaaaagtta 1020 ttcaggaaag ggaatcgatg aggaatctag gagcggttcc caagcggttt gaaaggcttg 1080 gctctataga aggacaggat attgaagttc acgaaattaa tgagaaacag aataggaaaa 1140 agagcaggaa aacaaaatta cctaagaaac ctatgtcgga tgacagctcg tctttggatt 1200 cgttgagcag tctgctatct tcagaaagta gctcggactc cggatctatt tacccgtcag 1260 aaaggtctga tgatattgga cgtcgtgata atcgggtcca gaatcaccga cttgacaaat 1320 ggggtatcca gttcagtgga gatatccacg gtatggatgt gtctgatttc gttttccaag 1380 tcaacgaatt aatggatgct gaacgcattc caaacgatag atttttagat cacgcctaca 1440 ttctttttgc tggagaagct cgtagatggt attttaccta taaaaagaag tataaaacgt 1500 gggataaatt ttcgaaacaa ttaaaaatcc gtttcggtga cccaaataaa gacaggaaaa 1560 ttcttcaaga catcaaggat agaaaacaaa aaaagggaga gtcgtttgtg gctttttgct 1620 cggaaataga gggaatgttt gaaagaatga ctaaacagta ttcggagcgg aaaaggttga 1680 aggtccttag aaacaacatg agacgctggt acaagaccaa actttcattt ttcaaaatta 1740 aaaacattgc gcatttaagc atgctttgtc agcaattgga taaagacagt ggaagaattt 1800 attccaaaac ctcccttccc acaagaaaac atatcagaaa tgtggatgcg tcgtctgatt 1860 cttcttcctc gtcttctgaa caggaagtgt gcgctttcga caaaaaagaa aaccgacaac 1920 gcagataccg tgaaccccga ccagacataa ctgaacctca aggacgagaa aatttattgc 1980 aattgaattc tctctgctgg aattgccgaa aatatggtca tcggtggagg gattgcaaac 2040 aacccaaggt aattttttgc cacgcatgcg gcatgcctgg agttactttc atgacctgtc 2100 caaagagtca tacgcttcct caacaggctc caaaaaacga gagctcggag gagaagtaag 2160 gggtgtttct tttccggaga attgtttaga ccccaagaaa gaaaacttgt taggggatca 2220 ccattcaaac tggaaaattt atgagttgaa cattaagcca aacaaatgtc cacatgtttt 2280 agtcaaaatt gtcgattcta atattgaagc tctgttagac tcgggagctg aaatctctgt 2340 catgaattca atagacttag tccgtagata caattttaaa attcacaaaa caaatatcag 2400 agtagataca gcaggaacag ctgagcatca ttgcttaggt tttgtaaatg taccattaac 2460 gtttaatgat ttaacgagag tggttcgaat ctatattgtt ttagaatttt ccaaaccctt 2520 aatattaggc gttaattttt ggagagcttt caatatagtt ccagcaatcg aaatgaccaa 2580 taaagaaatc atatccatag atcgattgaa ttctcaaagt cagatacaag tttgtccaat 2640 tcgtctgatt gaaaactatt tttcggcaga tgaatcagaa ctagcactca cattttctga 2700 atttgtcgga gcagtacatg aacctgacag tcatagtgac gatgaagact tgtccttaga 2760 agtacccacg ctagactttc cctccatgtc gatcgaagaa atagaaacag aacacgtact 2820 gaatagggaa gaaagatttc aactactaga ggcaattagt tgtttcgagt tttccgaaga 2880 tgacaaaatc ggtaggaccg cgatcttaga gcacaaaatc caaattattc caggcgccga 2940 gataaaagct tctccgatgt accgctgctc gccatatatc cagaaatatg tggacgaaga 3000 gttagaacgg atgaaaaaga tggatgttat tgaaccatgc gactcggatt atgcaagccc 3060 tttgctaccg attaaaaaag ctaacggtaa attccgcgtt tgcttggatt cgcggcgggt 3120 taatgatgtg accgaaaata acgcgtatcc aattcctaac ttacacgaag tcttacacag 3180 aattaaaaag gccaaatact ttagtgtaat tgacctgaaa gaggcgtatt ggcaaattcc 3240 gctggctaaa gaatctagaa aatatactgc gtttaggacg aagtcagggt tatatagatt 3300 caaagtgatg cctttcggcc tgaaaggagc cccttttact atgtccaagc taatggattt 3360 agtcctcgga aatgacctac aaccctacgt atgggtttat ttagatgaca tcatcctcgc 3420 gaccgaaaca ttatcccaac atgttgcact aataaaagaa gttgctagtc gtttgaaacg 3480 ggcaaactta acgatcagcc taaataagtc taaattttgt aggaaaagtg tcaggtatct 3540 agggtacgtg atttctgaac aaggtatttc gattgacatg gaaaagatta aacctattct 3600 ggactatgaa gcaccgaaaa cagtgaaaga cattagacgt ctactgggac tagctaactt 3660 ctaccaaaag tttatcaaac attacagcga tataacctct cctatttcag atcttttaaa 3720 gaaagacagg aaaaaatttt cttggagctc tgaagcaaac gaggcgctta acaaactgaa 3780 acaggcactg attagtccgc caattctggc caatcctaat tttgacctag aatttaccat 3840 agagtctgat gcgtccgatt tagcagtagg agccgtcctt actcaaaaac agcacggcga 3900 gaagcgaatt atcgcctatt ttagtaagaa attgaattct acacggcgaa aatatgccgc 3960 agtagaaaag gaatgtttag cagtgttgtg ggcaattgaa gcattccgca attatattga 4020 aggaacgcat ttccgagtga ttacagatgc caggagtttg gtctggcttt caaaagtgag 4080 cgctgagaaa ggctcggcta agttatgcag atgggcgctg aaactacagc aatttgactt 4140 cagcattgaa tacaggaaag ggcgtgacaa catcactgca gattgtcttt cgagatcact 4200 cagcaccatt aaagtaaaat gggaggactc tgactatagt ggaatgctta ataaaattaa 4260 agttgatcca ttaaaataca aagattacaa aatagtggac aacgtgattt acaaattcgt 4320 taatgatgag tcattgatta cagataggag attcacatgg aaacgaattc caacaatagc 4380 agaaagggtt gaattaatta gggaaatcca tgaagaggct cacttaggca cagaaaaaac 4440 actaggcaaa ataagagaca aatattattg gccgaggatg tactctgacg taaaaagatt 4500 ttgtgagggc tgcatagcgt gtaggaaatc aaaatccgat aatgttaatc atgtgccacc 4560 tatgggcaaa caaaagatag ctaaccagcc ctggcaaatt attgctatag attatgtggg 4620 accttttcct cgggcaaaac ggtcgggaaa tacgtgctta ttagttataa cggacatatt 4680 ttcaaaattc gtaattatac agcctctcaa ggaagcaaaa gcgaaacaac tttcattttt 4740 catggaaaac atgatatttt tggtgtttgg cgttcccgaa atcgttctat cggacaatgg 4800 ggcgcaattt acatcaaaag aatttaaaaa tttgttgcaa aaatacagtg taagtcactg 4860 gcttacccca gtttattttc cgcaagtcaa taacgcagaa aggacaaatc gcgtaatcac 4920 ttccgctatt agggctctca ttaaaaacga gcaaaatgaa tgggacgcga atatatataa 4980 aattgctcat gcaatcaaca acgcaacgca ctcatcgagt aagttttctc catattttgt 5040 gaacttcgga aggaaccaga ttagctcagg cgaggaatat cagatactta gagatctaga 5100 acgagattta tctccggaag aacaggaaca tagcagtaag atgaaaaata ttttcgaaga 5160 agttcgaaaa aaattaaaaa ttgcctacga taaatattct aaatactaca atctacgagc 5220 aggtaaaatg gtgcaatttg caatagggca aaaagttcta aaaaagaatt attttctatc 5280 agataagggc aaaggattta acgcaaaact ggcaccaaaa tattcagagg ccgtggttaa 5340 aaaaaaagta ggagattatt gttatatttt agaagatttg aatggtaaaa gcctaggaat 5400 gtatcatgta tctcaactga agaaagttta aaaaaataaa ataaataaac ctgtagctat 5460 gtataaaaac taagaaatcc aaatttagaa ccatcaaaac acgcaaagtc ggtaaacaaa 5520 taatgaaaac gttaatatag atataaaatc tagtaaatgt ttattaaaat attgtaaaaa 5580 caatcgagta acaaaataca aacaaaaata acaaatacga aaaaacaaat aaaattaaca 5640 gcataagata tgaatgaaat gagatcatag aaaaaaaaac aaatcctttg catttagata 5700 attaattgaa tatgcattag aaaatttagt tataaaatta gagcgtatta ttgcaaatta 5760 tgtttttttt tataaaagat cgagctatgt acagaaacta tgggtgaagt caaaaaaaaa 5820 aaaaaataat aaaataaaaa aataactgca ccagaaaaat acgcaaagtc ggcataatta 5880 acacaaaaaa aaaacaaaaa aataaaatag cacaggaaga attggaaacg aaaacaaaaa 5940 tcaacttacc aaaaccacga aaggagacta catctcaaga aacaagtaca taattctatt 6000 cgtcttccag tttcgtgcta aaataaaata aacacacatt agcaatagtt taaaatgtaa 6060 ataatgtaaa tagtagtact aattaaataa gaaagctttt catctttcac gaaagagacc 6120 aaaagattca catagaataa gaaagttggg aacactgttc catcataacg tgatagagtc 6180 agtagtttca agaagaattt caaggtcaag gggtcatagt ggtgacttat tggactgaga 6240 agtttataaa ttagaaaaat agaaatataa gaaatgatag aggtcttaat aaggtggatt 6300 aattggtact tacgttttgg aaaggtagac accggtgatc ctgcaagcat gccatatata 6360 gtttctcagc aatagcctca ataccatttc acagtttgac agaatacgca ctatgcgctc 6420 caatttttga caatagacta tgccacagca aagttcaaca aagttctaat agcccggtat 6480 ttcaacatta actgacgaac cgagagagat gagcgagagt taattgtgga tatttgtaga 6540 gttttgatcg tgtccgtata ctgatacgga gtgataagta tgagttagct acattttgtt 6600 aagacgaata tgaccgacga atagttcagg tacatttttc gataattgac atatgtggtg 6660 gttatgagca tgagcctcgg atttcttcca ataaatgtat tggagtaagg tatcgttcga 6720 tatcatttga aaagaattgt ttaccagtta gttaactggt gcggtttcga attaacagtc 6780 gcgtcgtatc gctctcgggc atgttttgat ttcaagtgaa tgataaaaga aaatcgattc 6840 ccgaaaattt gcatgaaagt agtgataagt gtaaaaagta gaatataagt ctaaaatacc 6900 gtagaagtga tgtttttttt cggaaattgg atatatatga agaagaataa taaggtaaaa 6960 aaaaaatatt ccatattttt tttgggtagt tgggagtga 6999 // ID Gypsy-4-I_HM repbase; DNA; INV; 3670 BP. XX AC . XX DT 22-DEC-2008 (Rel. 13.12, Created) DT 22-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE An internal portion of the LTR retrotransposon - a consensus DE sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW reverse transcriptase; Gypsy superfamily; Gypsy-4-I_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3670 RA Bao W. and Jurka J.; RT "Gypsy LTR retrotransposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 1974-1974 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 17..754 FT /product="Gypsy-4-I_HM_1p" FT /translation="MSSMAKMIGKFDGSGDVVVWLKKVKLVGELQKMGDLS FT LIIPLFLEGDAFALYEQLSENEKASAVDIEKALLDAFAVDLFEAYELFRDR FT RWNEGESVDIYMASLRQLMQLAKVENEELLQRAFITGLPGEISKQLQSQVK FT IFGINLNDVLNHARVLLSKVTKEECLSTTISGSHKNNNRKSASRFNCYNCG FT KPGHVARSCFAKKETVTEGIQCFRCGKTGHIARFCSLKQGNASGKPPAPAV FT SLDN*" FT CDS 649..3651 FT /product="Gypsy-4-I_HM_2p" FT /translation="MFSMWKNWTHCKILLLETGKRKWETSSASCFPRQLTA FT LPICSVHVNTYAGKALVDSGCTRTIISNKMAQKAGVCVKRNEFMVLAVNGE FT LVKTVGEAIVTLMVAGKQLQASVMVLKNVVNGIDAVIGMDIINRCGGVHLY FT DGKIEFGFVSIHTEKQFVDIKDKDFDANFDGMKWTAKYKWKQDEPQLKNFV FT TQYKVKPKLIHQFNEEMDLWVENGWLKPCKIDENNKKGIVPLMAVEQENKG FT KIRPVLDFRELNQYVNSHNASSDACDETLRRWRRVGDKFALLDLRCAYMQI FT HIDPVLWKYQRVVYKEMQYHLTRLGFGLNSAPKIMSSILGKVLSLDAGIGE FT ATDHYIDDIIVDLKKTSVENVVNHLERYGLLTKEPEECDGARVLGLQIFRN FT DKKVLCWKRGNIIPKSEDIENEKITRRRLFSICGHLIGHYPIGNWLRIACS FT FTKRYSQGDAWDDLIGDQAQSWLVEILRRLEASDPVKGTWNADGIQNGAIL FT WCDASSIAMGVAIEYNGVIVEDAAWLRSTNDVMHINMAELDAVVRGINLAV FT KWDIKNLSIFIDSVTVHGWLKSVLTESHRVRSNGLSEMLIKRRLSLLRDLI FT KEYSMNIIVKWVPSSKNKADVLTRVSKKWLNEISRKNVKELCGVNIADEIR FT KCHSKHHFGVNRTMFTVQQRIPQVSRKDVKDCVRKCRQCSSIDPAPIKWEH FT GVLDVPDTWYRIACDVTHYKGSPYLTMIDCGPSRFAVWRKMLHEDTKSILI FT QLDSVFREHGPPHELLMDNGAVFRSNELSNMCKKWCVQKMYRCAYRPSGNG FT IVERNHRTVKRXSARAQICPLQAVFWYNATPMIGTQEDSTPASLKNKYVWR FT LPIHNTESHERPQKDSATYRIGEQVYVRPVGAKCTTIWKIGNVTNILSKTN FT IEIDGVPRHVRDVRIVPESYGEKVSRMNSVEENESDESEFDDLHIARDEEV FT EESVDEESVADEENDGIEVMDNLLPLLRRSTRERRRPTYLNDYLP*" XX SQ Sequence 3670 BP; 1242 A; 582 C; 870 G; 975 T; 1 other; aagctatttc ggcaatatgt catccatggc aaaaatgatc ggaaaatttg atgggtctgg 60 cgatgttgtt gtgtggctga aaaaagtcaa gctggtcgga gagcttcaaa agatgggcga 120 tctgtcgtta ataattccgt tgtttctgga aggagatgca tttgctttgt atgaacaact 180 tagcgaaaac gagaaagcct ccgccgtgga tatcgaaaaa gctttgttgg atgcatttgc 240 ggttgatctt ttcgaagcct acgagctgtt cagggatcgc agatggaacg aaggcgaatc 300 cgttgatata tatatggcca gtttgcgtca acttatgcag ttggccaaag ttgaaaacga 360 agaattgttg caacgagctt ttataactgg attgcctggt gaaatttcaa agcagctaca 420 atcgcaagtc aaaatattcg gaatcaactt gaatgatgta ttgaaccacg caagagtact 480 tttgtcaaaa gtaacgaaag aagaatgtct ttcaacaaca atctctggaa gtcataaaaa 540 caataataga aaatctgctt cacgattcaa ctgttacaat tgcggcaagc ctgggcatgt 600 tgctcgaagc tgttttgcaa agaaagaaac tgttactgag ggaatacaat gttttcgatg 660 tggaaaaact ggacacattg caagattttg ctccttgaaa cagggaaacg caagtgggaa 720 acctccagcg ccagctgttt ccctcgacaa ctaacggcgc ttccgatatg cagtgtgcat 780 gtgaacacat atgcaggaaa agcactggtt gatagtggat gtactcgaac catcatttca 840 aacaaaatgg ctcaaaaagc aggtgtttgt gtcaaaagaa acgaattcat ggtactggct 900 gtaaatggag aacttgtgaa aacagtggga gaagctatag tgacattgat ggttgcagga 960 aaacaactac aagcatccgt aatggtttta aaaaatgttg ttaacggtat tgatgctgtt 1020 attggtatgg atattattaa tcgttgtgga ggagtgcacc tatacgatgg aaaaattgaa 1080 tttggttttg tttcaattca cacagaaaaa caatttgttg atataaagga caaggacttt 1140 gacgctaatt ttgatggcat gaaatggact gctaaatata aatggaaaca agatgaacca 1200 caactgaaga actttgtcac gcagtacaaa gtaaaaccca agttgataca tcaatttaac 1260 gaagagatgg acttatgggt ggaaaatggt tggctgaaac catgtaaaat agatgaaaat 1320 aataaaaaag ggattgtccc tttaatggca gttgaacaag aaaataaagg gaagatacga 1380 cctgtcttgg attttcgcga gttgaaccaa tacgttaatt cacataacgc ttcaagtgat 1440 gcctgcgatg aaacattgag gagatggaga agagttggtg acaaatttgc actgcttgat 1500 ctcagatgcg cctacatgca aatacacatt gatccggttt tgtggaagta tcagagggta 1560 gtctacaaag agatgcaata ccatcttaca cgactcggat ttgggttaaa ttctgcacca 1620 aagattatgt cgtcaatctt aggcaaagtg ctaagtcttg atgcaggtat tggtgaagca 1680 actgaccatt atatcgacga cataatcgtt gatttgaaaa agacatcggt tgaaaatgtc 1740 gtaaaccact tggaacggta tggtttgtta acaaaagagc cggaagaatg tgatggtgca 1800 cgagttttgg gtttgcaaat attcagaaac gataaaaaag tgctttgttg gaaacgcggc 1860 aatatcattc caaagtcaga agacatcgaa aatgaaaaaa ttacacgtcg gagattgttt 1920 tcaatatgtg gtcacttgat tggacactat ccaattggaa actggctcag aattgcctgc 1980 agctttacaa aacgttatag tcaaggagat gcttgggatg acctaatagg agatcaagca 2040 cagtcatggt tggtcgaaat attgcgccgt ttggaagcta gtgatcctgt aaaagggaca 2100 tggaatgctg atggtattca aaatggagca atattgtggt gtgatgctag tagtattgca 2160 atgggtgttg caatcgaata taacggcgta atagttgagg atgcagcatg gctacgaagt 2220 acaaatgatg tcatgcatat taatatggct gaattagatg ctgtggtgcg tggaataaac 2280 ctagctgtta agtgggacat caagaaccta tcgatattta ttgactcggt aactgttcat 2340 ggatggttaa aaagtgtgct gactgaaagt catagggttc ggtcaaatgg actatcagaa 2400 atgctaatca aacgaagact ttcattattg cgtgatttaa taaaggaata cagtatgaac 2460 atcatagtga agtgggttcc gtcttctaaa aacaaagcag atgtattaac aagagtttca 2520 aaaaagtggt taaatgaaat ttcacgaaaa aatgttaaag aattatgtgg ggtaaatatt 2580 gccgatgaaa ttcgtaaatg tcactcgaaa catcattttg gcgttaatcg aacaatgttt 2640 acagtacaac aaagaatccc gcaagtgtcg agaaaagacg ttaaagactg tgtgcgcaaa 2700 tgtcgtcaat gtagttcaat tgatccagca ccaataaaat gggaacatgg agtgctagat 2760 gtaccagata catggtatcg tattgcatgt gacgtaactc actataaagg gtcaccgtac 2820 ctaacaatga ttgactgtgg accaagtcga ttcgcagtat ggagaaagat gctacacgaa 2880 gatacaaaga gtatattaat tcaattggat tccgtattta gggagcatgg cccgccacac 2940 gaactattga tggacaatgg agctgtgttt agatcaaatg agttgtcaaa catgtgtaaa 3000 aaatggtgtg tgcaaaagat gtatagatgt gcatatagac catcaggaaa tggtatcgtg 3060 gagcgtaacc accgcacagt aaaacgcrca tctgcgagag cacagatatg tccgttacaa 3120 gcggtatttt ggtataatgc aacaccaatg attggaacgc aagaagactc tacacctgct 3180 tctctaaaaa ataagtacgt ctggcgtcta cctattcaca acacagaaag tcacgaaaga 3240 ccacaaaagg actcggcaac ctatagaatt ggagaacaag tttatgttcg tcctgttggt 3300 gcaaagtgta cgacgatttg gaaaattggt aacgtgacca atatactctc aaaaacaaat 3360 attgaaattg atggtgtacc acgacatgtg agagacgtga gaatagttcc tgaatcatat 3420 ggtgagaaag tttcaagaat gaatagtgtt gaagaaaatg aaagcgacga aagtgagttc 3480 gacgatttac atattgcaag agacgaggaa gttgaagagt cagttgatga agagtcagtt 3540 gctgacgaag aaaatgacgg tattgaagtc atggataatt tattaccgtt gctcagacgt 3600 tcgactagag aacggagaag gccgacatat ttaaatgact atttgcctta agaacttagt 3660 tcttggggag 3670 // ID Vingi-3_HR repbase; DNA; INV; 3191 BP. XX AC . XX DT 17-AUG-2010 (Rel. 15.08, Created) DT 17-AUG-2010 (Rel. 15.08, Last updated, Version 3) XX DE A family of Vingi non-LTR retrotransposons - consensus sequence. XX KW Vingi; Non-LTR Retrotransposon; Transposable Element; Vingi-3_HR. XX OS Helobdella robusta OC Eukaryota; Metazoa; Annelida; Clitellata; Hirudinida; Hirudinea; OC Rhynchobdellida; Glossiphoniidae; Helobdella. XX RN [1] RP 1-3191 RA Kojima K.K., Kapitonov V.V. and Jurka J.; RT "Recent Expansion of a New Ingi-Related Clade of Vingi non-LTR RT Retrotransposons in Hedgehogs."; RL Molecular Biology and Evolution 28(1), 17-20 (2011). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 2..3016 FT /product="Vingi-3_HR_1p" FT /note="includes endonuclease and reverse FT transcriptase domains and a CCHC zinc-finger FT motif." FT /translation="TNSSGQSVSGGTLRWQTTSTSTSISSCSNTSHPQTQS FT AGQLNLLQLNCNGIKNKQDEISQHLIKHNIHIAAIQESKLTADSKEPVFKD FT YALYRKDRGTGRGGGLIMLIHHSIPYTIKTLPPTSTTESQAITIGANGTDI FT NIINIYIPPQSVCPSGFSASISQYLAIPNVVLLGDVNAHDALWYSSLEDSR FT GESLASEIENSGCGTLNLDTPTRLPSTGKPSSPDISVVSDSLIVNMEWSTM FT TALSSDHLPIHITLLSTFQQTASPKIKFINFAKANWEGFTSKTEAAFTKTP FT FPPNPSVAERSFCHILNVASDHNIPAGRLITHTPGLSKEITALIKKRDDLR FT ANTPTSKDIAILTSEIQTKTDNLKRTRWKDFLGTFNKKTNTKRLWSTIKSL FT SGKPIQYPNNAVSFNNRSTTNIWQMANSFNKQFTAAKRHISNKSNRCTTKA FT IKKLSLDDHPTFTDFQVATAIKSSKPSKSIGPDNLTIFHLKHLGPSGISFL FT TNVFNESIRSCRIPEIYKKAKIIPLLKPGKPVGESKSYRPISLLCPAAKVL FT EKCLLPIITEHLTLANHQHGFRPDHSTTTALALMVTDVANGFNQKRPAHRT FT VLTALDLTKAFDTVNHQILLSDLLSSSLSRSIVRWLSNYLHGRSASTEFRQ FT HLSKQRIIRSGVPQGSVLSPTLFNYYVAKAPAPPQDIKIISYADDFSIYTT FT GPDADILTRRLNCYLPKLYDFFNSRNLEISTSKSTVSLFTSHTSQYNYHPQ FT IKINNQLLPLEHNPKILGLTLDTMLSFSQHHKMAAAKATKRNNILKALSGT FT TYGQDKETLLQTYKAIGRSTIEYACPAWSPLTSNSSLNRLQRSQNAALRLV FT TGCHGITSEQHLHSECQMLTVKQHSELLSSQFLLQCYKPSHPCNIITTQPR FT PHRDMKDTLISRWSSTVVAVLPANINNKSLKAASNTLHSNAAKAAAESYQA FT PILLNGWPPPTPKIDKSEKTLSRATRWTLAQLRSGHSILLNSYKNRQWTH" XX SQ Sequence 3191 BP; 971 A; 1001 C; 539 G; 680 T; 0 other; caccaattcg agtggtcagt cggtgtcggg tgggactctc cgatggcaga caacatcaac 60 aagcacttcc atctcttcct gctccaacac atcacatccc caaacacaat ccgccggaca 120 gcttaacctc cttcaactta actgtaacgg cataaaaaac aaacaagacg aaatcagcca 180 acatctaatc aaacacaata ttcacatagc agctatccaa gagtctaaac taacagcaga 240 ctcaaaagaa ccagtcttta aggactacgc gctgtaccgc aaagaccgtg gtaccggacg 300 aggtggaggt ctcatcatgc tcatccatca cagtattcct tacacaatca agaccctacc 360 tcctacgtca acaacagaga gtcaggccat caccatcggg gccaacggca cagatatcaa 420 catcatcaac atctacatcc ctccacaatc agtctgcccc tccggcttct cagcttcaat 480 atcacaatat ttggccatcc caaacgtggt tctgttggga gacgtaaacg cgcatgatgc 540 gctttggtac tccagtttgg aggattctcg tggagaatct ttagccagcg aaattgaaaa 600 ctcaggctgc ggcacactca atctggatac cccaacccgt ctcccctcaa ccggaaaacc 660 gtcatccccc gatatatctg tcgtttcgga ttccctgatt gtcaacatgg aatggtcgac 720 aatgaccgca ctctcttctg accatctccc aattcatatc accctcctat ccacatttca 780 acagacagca tctcctaaaa tcaaatttat aaactttgcc aaagccaact gggaaggctt 840 cacttccaaa accgaggccg cgttcaccaa aacccccttc cctcccaacc catcggtagc 900 cgagagatcg ttttgtcata ttttgaatgt cgcaagtgac cacaacatcc ctgcaggaag 960 actcatcacc cacaccccag gtctctcgaa ggaaatcaca gccctgataa agaaacgtga 1020 tgatctcaga gctaacactc ccacatccaa ggacattgcc atcctcacct cagaaatcca 1080 aactaaaacc gacaacctca aacgcacgcg ctggaaggat ttcctgggca ctttcaacaa 1140 gaaaaccaac acaaaacgcc tctggagcac cattaagtct ctaagcggca aaccaataca 1200 ataccccaac aatgctgtca gtttcaacaa tagatcaaca accaacatct ggcagatggc 1260 caactcattc aacaaacaat ttaccgcagc aaaacgccat atctccaaca aatctaatag 1320 atgcacaacc aaagccatca agaaactatc tctcgatgac caccccacct tcaccgattt 1380 tcaggttgcc acagctataa aatcatcaaa accatctaaa tcaattggac cggacaacct 1440 gacaatcttc cacctaaagc accttggccc ttcaggaata tccttcctga ccaacgtatt 1500 caacgaatcc atacgttcgt gcagaatacc cgaaatttat aaaaaggcaa aaatcatccc 1560 ccttctcaaa cccggcaaac cagtgggcga aagcaaatcc taccgaccaa tatccttact 1620 atgcccggcc gccaaggtcc tagagaagtg cctgctgccg atcatcactg agcatctcac 1680 tctggcaaac catcaacatg gttttcgtcc agaccactct accaccacag cattagccct 1740 gatggttacc gatgtcgcca atggcttcaa tcagaaaagg ccagctcatc gcacagtcct 1800 cactgctctg gacctcacga aagcgttcga cacggtcaac caccagatcc tcctttctga 1860 cctcctatca tcatcacttt caaggtcgat tgtccggtgg ctctcaaact atttgcatgg 1920 tcgttccgca tcaacagagt tccggcaaca tctttccaaa caacgcatca ttcgctcggg 1980 agtaccccag ggctctgttc tttcacctac tcttttcaac tactatgtag ccaaagcacc 2040 cgctccgcct caagacatca aaatcatctc ctatgccgac gacttctcaa tctacacaac 2100 cggtccagat gcagatattt tgacccgccg gctcaattgc taccttccga agttatatga 2160 tttcttcaat agcagaaatt tagaaatctc cacttctaaa tcaacagttt cactcttcac 2220 atctcataca tcgcaataca actaccatcc ccaaattaag ataaacaacc agcttttacc 2280 gctggaacac aacccgaaaa tattaggact aaccttggac acaatgctgt cgttctcaca 2340 acaccacaaa atggcagcgg ccaaggccac caaaagaaac aatattctca aggctctcag 2400 cggcaccaca tatggccagg ataaagaaac actcctacag acatacaaag ccattgggcg 2460 ttccacaatt gagtatgcct gcccggcctg gtcccctctg acgtccaact caagcctcaa 2520 tagactacaa cgctctcaga atgctgcttt acgactggtt acaggctgcc acggaatcac 2580 gtccgagcag catttacatt cagaatgcca gatgttaaca gtgaagcagc actccgagtt 2640 gctatcctca caattccttc tacaatgtta caaaccaagc cacccgtgca acatcattac 2700 cactcaacca cgacctcacc gagatatgaa ggacactctc atctcaagat ggtcatcaac 2760 ggtcgtagcg gtgcttccag cgaacatcaa caacaagtct ctaaaggccg catccaacac 2820 tcttcatagt aacgccgcca aagccgccgc cgagtcgtac caggcaccga tactgctcaa 2880 tgggtggcca ccaccaaccc ctaaaataga caaatcggag aaaaccctat caagggcaac 2940 caggtggacc ctcgctcaac tccgatccgg ccacagcatc cttttgaaca gctacaagaa 3000 tagacagtgg acacattaac atatgtcctt tctgccaaac acatccggac gatgtgccac 3060 atctgttcag ctgtccacag aatcccactg tcctcaatcc aatagatctg tggagaaacc 3120 cggtcgcggt ggcgaactgg ctcaggcctc gcctggaacc caagtccgac tcctaggaag 3180 ggcaacaaca a 3191 // ID Copia-11_CQ-I repbase; DNA; INV; 4695 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-11_CQ_; KW Copia-11_CQ-LTR; Copia-11_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4695 RA Jurka J.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 337-337 (2011). XX DR [2] (Consensus) XX CC Positions [2074-2577] - Integrase core CC 'AGTTG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 70..4695 FT /product="Copia-11_CQ-I_1p" FT /translation="MSTERGTLGQLPYTALQVRETIFDGREGTFGTWKHRL FT LRRLAKLRLDHCLIRTPLEEACAVDLLDTKVDEAKKEVYRKRLNDDTDALD FT EIVMAVNNDVLNYIIGATCAKEAMGILIHEFQKEGTGCLISMRGRLQTLRE FT RKFDSLSELFYHYDVIVRELDRLGAKMSNAEKIHSLLLAIPGRFNHVRGAL FT TVLPNEELCKKPIAEIKRMFLDAEMAMGEERKNGREERSGPPNVALKTSSK FT KAEPKCFGCGEIGHYKNRCPRRPRKQRNGHGGGRAAGGHAMIARAGRTIRQ FT AKLVKLARAVITRHRENAMPIRGREREQPRENAEPARGREREQPSDRAVPT FT RDREREQPCDHAVPTRGREREQPCNHAVPIHRCAAVKIPPTTPQKDLRERV FT RVLTAKKLPRERELPHKAASPRERHRERTTTHGSSVTTVKSREQALAMGEV FT AREQPRENRRAALKRAIWMGERPVQRKVRFVIDSGATNHMVREENLLEKVE FT EMKQPLIISTAKSGESLRAIKKGKVTLRSVVGNVIKMVNLYNVLFIPGLET FT NLLSVRKATTAGKRVTFQGGEVIFDDKGEVIAKGKLVDGLYSVEFLREFSD FT SALLGKKELDIHRWHKRLGHLSYGAIWTLLNKKMADGFNCAASSSDRDAIC FT EGCLAGKQTSRKFRKMELPRSSRPLELVHSDVCGYMEKSTQEGFRYFVTFI FT DDYSHYTVIYLLKNKSEVFQKFKEYEALATANFGQKLSKLRCDNGREYVGK FT EFQEFCKEKGIQMVLTVPYTPQQNGVSERINRTLMEKVRAMLFDSKLPKSL FT WGEALYCATYLTNRSPTRALLDNKTPFEIWNRKKPNLSNLRIFGCPAYAQI FT PKEKRRKLDPKTKRLIFVGYANNGYRLWNNFTRCVEIHRNVVFDEESMLDV FT EELIYTNFKQSKPEHNSQDRSDYIKGEIPQSSEESADETTPKADPSLTDDD FT DDDEDGNDVSEYDDADDNLDDIETAPEEHFNEEAQVESLESMDDKITKEGE FT VGVQPTLRRSKRTKKTTRNLTVLATSNFEEIPQTISELKLRDDWDSWKEAV FT NDELGSMKANNTWVAVNSVPEGRKSINSKWVFTVKDDGRYKARLVAKGCSQ FT RPGFDYIETFAPVVRMESVRTILALANEFGWLIHQMDVKTAFLNGDLNETI FT FMKLPNDELCRSQVVRLQKSLYGLKQAGRAWNKKFDDEVKKLGFVQLKSDC FT CVYRSSTKQLGMVLYVDDILIIGEKESNINWIKNQLGRLFQMKDMKEIKHF FT LGMDISRNFQKQTLEISQVGYTEKILKRFGMSECNPVGTPLDPNVQWRETA FT DDELTTHPFKELLGCLQYLAITSRPDICAAVSALSKYQAAPSDLHWAGLKR FT ILRYLQGTIETKIVYEKKRDHHIMLGYADADFANDKDNRKSVSGYALVVFG FT NLVAWSTKRQPTVSLSSTEAELISLCTAAKEGLWLTNLLNELSVANTSFTI FT MEDNIPCIRIAEEPRSHQRTKHLDLKYLFIRELIAEGKLQLRHISTNDQPA FT DAFTKGLPKLQHRKLTNLLNVRIEGR" XX SQ Sequence 4695 BP; 1411 A; 1041 C; 1241 G; 1002 T; 0 other; ggttatgggc ccagccttaa ccgtttagag atctatctgc ctttgtttga agaaagttaa 60 gactgagaaa tgtcgactga gaggggtact cttgggcaac tgccgtacac ggcgttgcaa 120 gtccgcgaga ccatctttga tggccgcgag ggaactttcg gcacgtggaa gcaccggctt 180 ctcaggcggc tagccaaact ccgattggac cactgcctca tccgcacccc attggaggaa 240 gcatgtgccg tcgacctgct ggacaccaag gttgacgagg ccaagaagga ggtttaccgg 300 aagcggctga atgacgacac cgatgccctg gatgaaatcg tgatggccgt caacaacgat 360 gtactcaact acatcatcgg agcgacgtgt gcgaaggagg ccatgggtat cctgatccac 420 gagttccaaa aggaaggcac cggttgcctg atctcgatgc gcgggcgtct tcagacactc 480 cgagagagaa agttcgactc tctctcggag ctgttctatc attacgacgt catcgtccgc 540 gagctggacc ggctcggcgc aaaaatgagt aatgcagaga agattcactc gttgctgctg 600 gccattccgg gacgcttcaa ccacgtgcgc ggggcgctca cggttctccc gaatgaggaa 660 ctctgcaaga agccgatcgc agagataaaa cgtatgtttt tagatgcgga gatggccatg 720 ggagaggagc ggaagaacgg gagagaggaa cgctcggggc caccgaacgt ggcactaaaa 780 acttcatcga agaaagccga accaaaatgc tttggctgtg gagagatagg gcattacaaa 840 aatcgttgcc ctagacgccc acggaagcaa cgaaacggac acggcggcgg acgagctgct 900 ggaggacatg cgatgatcgc acgcgctgga agaacgattc gtcaagccaa gctggttaag 960 cttgcacgtg cggtgatcac cagacatcgg gagaacgcta tgccgatccg cggcagagag 1020 cgtgagcagc ctcgggagaa cgctgagccg gcccgcggca gagagcgtga gcagcctagc 1080 gaccgcgccg tgccgacccg cgacagagag cgtgagcagc cttgcgacca cgccgtaccg 1140 acccgcggca gagagcgtga gcagccttgc aaccacgctg tgccgatcca ccgttgtgct 1200 gcagtgaaga taccaccgac gacgccgcag aaggaccttc gtgagagagt tcgggttctg 1260 acggcgaaga agttgccacg agagcgtgag ctaccacaca aggcagcctc accacgagag 1320 agacatcgcg agaggacgac gacgcatggg tcatcggtga cgacggtgaa gtcacgtgag 1380 caggcgctgg ctatgggaga agtagctcgc gaacaaccac gtgaaaatcg tcgagcggcc 1440 ttaaagagag cgatctggat gggagagcgg ccggtgcaaa gaaaggtgcg ttttgtcatc 1500 gactcgggcg ctacaaacca tatggttcgc gaggagaacc tattggaaaa ggtggaggaa 1560 atgaagcaac ctttaattat ttccaccgcc aaatcaggtg agtccctgag ggcgattaag 1620 aagggtaaag ttactttgag aagtgtcgtt ggaaacgtaa taaagatggt taatctttac 1680 aatgttcttt ttattccggg tttggaaaca aacctacttt cggttagaaa ggccaccact 1740 gctgggaaaa gggtcacttt tcagggtggg gaagtcatct tcgatgataa aggcgaggtt 1800 attgctaaag ggaaattggt cgacggtctg tacagcgttg aatttctgcg ggagttttcc 1860 gattctgctt tgcttggtaa aaaggaactg gatattcatc gatggcataa aaggttgggg 1920 catctcagct acggggccat ttggacgtta cttaacaaaa agatggcaga cggattcaac 1980 tgtgctgcca gctcttccga cagggatgcc atctgcgagg gttgtttggc tggaaagcaa 2040 accagtcgaa agttcagaaa gatggaattg ccgagatcat ctaggccgtt ggaattggtc 2100 cactcagacg tttgtggtta catggaaaaa tcaacccaag aaggttttcg ctatttcgtc 2160 acctttattg atgattactc ccattatacc gtcatttacc tgttgaaaaa caagagtgaa 2220 gtttttcaaa aattcaaaga atacgaagct ttggcaactg ctaattttgg tcaaaaattg 2280 agcaaactaa gatgcgacaa cggtcgagaa tacgttggga aggaattcca agaattttgc 2340 aaagaaaagg gaattcaaat ggtcctcaca gttccctaca cgccccagca aaatggggta 2400 agcgaaagaa ttaacagaac gttaatggaa aaggtacgag ccatgctgtt tgatagcaaa 2460 ttacccaaat cgctgtgggg ggaggcctta tattgtgcta cttatttaac gaatcgatct 2520 ccaacgaggg ctctcctgga caacaagaca ccctttgaga tttggaatag gaagaaacca 2580 aatctgagca acttaaggat cttcggctgt cctgcttatg cgcaaattcc gaaagaaaag 2640 cggcgcaagc ttgacccaaa aaccaaacgc ttaattttcg tgggttacgc gaataatggg 2700 tatagactct ggaataactt cacacgttgc gtagaaatcc atcgaaatgt ggtttttgat 2760 gaggaaagca tgcttgacgt ggaagaacta atttacacca acttcaagca gtcaaaacct 2820 gagcacaatt cccaggaccg ttcagattac atcaaagggg aaataccaca aagcagcgaa 2880 gaatcagctg atgaaacaac gccaaaagcc gacccgagcc tcaccgacga cgacgacgac 2940 gacgaagacg gtaatgacgt cagcgaatac gacgacgctg atgataatct tgatgacatc 3000 gagacagcac cagaggagca ttttaatgaa gaagcgcaag tagaatcttt ggaatcaatg 3060 gatgataaaa tcacaaaaga aggggaggtt ggggtacaac cgactttacg tcgatcaaag 3120 agaactaaaa agacgacgag aaatttaaca gtacttgcta catcaaattt tgaagaaatt 3180 ccccaaacaa tttcagaact gaagttgcga gatgattggg atagttggaa agaagccgtc 3240 aatgatgagc tcggatcgat gaaagcaaac aatacctggg tagcggttaa ctcggttcca 3300 gagggaagaa aatcgatcaa ctctaaatgg gttttcaccg tgaaggacga cggaaggtac 3360 aaagccaggc tggtagccaa gggatgctca caacgccctg gattcgacta catagaaacg 3420 tttgcaccag ttgtaaggat ggaaagtgtt cgcacaattc ttgccctagc gaacgaattt 3480 ggctggctca ttcatcaaat ggatgtgaaa acagcttttc taaatggtga cctcaatgaa 3540 accattttta tgaagcttcc aaatgatgaa ttatgcagat cgcaggttgt gcgcctgcag 3600 aagagtttgt acggattaaa acaagccggc agagcatgga acaagaaatt cgacgacgaa 3660 gtgaagaaac ttgggttcgt ccaactgaaa agtgattgct gtgtgtaccg atcaagcaca 3720 aaacaacttg gaatggtact gtatgtggat gacatactca tcatcggaga gaaggagtcc 3780 aacatcaatt ggatcaaaaa tcaacttgga aggctattcc aaatgaagga catgaaagaa 3840 atcaaacatt tcttgggcat ggacatctcg agaaattttc aaaaacaaac acttgaaatt 3900 tctcaggtag gttataccga aaagatatta aaaaggtttg gaatgtccga atgcaacccc 3960 gtcggtacac ccctggatcc gaacgttcaa tggcgagaga cagcggatga tgaattaacc 4020 actcatcctt tcaaagagct tctaggatgc ctgcagtacc ttgcaattac ctctcggcct 4080 gacatctgtg ctgcggttag cgcactaagc aagtaccaag cggctccatc agacttgcac 4140 tgggctggtc tgaaacgaat actacgctat ctacaaggca cgatcgagac caagatcgtc 4200 tacgagaaga agagggatca tcatattatg ctgggttatg ctgatgcaga tttcgccaac 4260 gacaaagaca atagaaaatc ggtgtcgggt tacgcacttg tagtctttgg caacctagtt 4320 gcatggtcta ccaaacgtca gccaactgtc agcctatcgt caacagaagc agaattgatc 4380 tctctatgca cagcagctaa ggaaggtttg tggttaacaa acctcctaaa tgaattgagt 4440 gtggctaaca cttcattcac aattatggaa gacaacatcc cgtgtataag aattgcagag 4500 gaaccccgga gccatcagcg aacgaaacat ttagatctga agtatttgtt catcagggag 4560 ctgattgccg aaggaaagct gcaactgaga cacatcagca ccaacgatca accagctgac 4620 gcgtttacca agggacttcc gaagctgcaa caccgcaaat tgacgaactt actcaacgtt 4680 cgaattgagg ggagg 4695 // ID DIRS_Nvi repbase; DNA; INV; 5921 BP. XX AC NW_001820493; XX DT 16-MAR-2009 (Rel. 14.03, Created) DT 16-MAR-2009 (Rel. 14.03, Last updated, Version 1) XX DE DIRS-type family. XX KW DIRS; LTR Retrotransposon; Transposable Element; DIRS_Nvi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-5921 RA Jurka J.; RT "DIRS families from Nasonia wasp."; RL Repbase Reports 9(3), 679-679 (2009). XX DR EMBL/GenBank/DDBJ; NW_001820493; Positions 3325 9245. XX FH Key Location/Qualifiers FT CDS join(1798..1917,2108..2770,3222..4451,4474..4929) FT /product="DIRS_Nvi_1p" FT /translation="MSDEKMTPTKRKNSFSSLQVMPENCCLIYIFKFQCLE FT GPLKPDKFERRFKLQRPPPSTKHDVHTPSEGAEMTIQQSRAEQELQVEVDA FT PVSKIGGRIRFFEQSWQKITNDPFIISCVRGYTIKFDYNVIQPVVPPEPHF FT SKEENLACESSINILLEKGAVSKCKPCKKQFISTYFLRQKPNGDYRFILNL FT KQLNKFIVASHFKMENVKTALKLLSKDSYMAAMYLQDGYLLVPINKKSRKF FT LRFSYKNTCTNLTLCPSDFQQWWLNNIFLVKNYIRDNKYHLEIFTDASLTG FT WGVFCAGEKVHGWWENGESDHINFLELQAAFNGLKCFAKNEGNCEILLIID FT NTNAISYVNRMGGIKYPKLNNIVRDIWQWRESRNIFLYASYIKSKDNFEAD FT TDSRRLPPETVWEIASWAFNMIIKSFGVPEIDLFASCLNAKCKKYITWKKD FT PDSIEIDAFTIPWTNLKFSAFTLFSVILKVLRNIIIDNAEGIVVVPYWPTQ FT SWFSLYKSLATTPFLYLGPNPKLLSSPYSKSHQLSQKLILVAAKLSCNHTR FT EKRSRAETTLARYGNPVNKWWNYCREKNISLYEADTKDIIEFFTRQSEVVK FT SYSKLNIYRAALSLIMNNRLGEMPEISRFFKGVANQKPKNAKYNEIWDPDR FT VLTFLSILFSNDALSLENLTKKSRENAFIQTLSKIKLDNIKEMENEIKIRI FT TDRIKTSNRVKEQPLLKIPFFTRQENICPAKTLKVYVEKTASLRTEKDKNL FT ILTFKKPHHTATTQTISRWIKDTLKESGIDTSIYSAHSTRHASTSAAARAV FT INIETIRKAAGCSQKSKVFA*" XX SQ Sequence 5921 BP; 1960 A; 1167 C; 1156 G; 1638 T; 0 other; gatagaaacg agtcgtcatc acgatcatct ccctcatatg cggaaaaata taaatgaaaa 60 atttcatttc gatattattc gatcgagata cgacgtgctt cacgcggccc atttccaata 120 catatggacg tacgcgtgac gagtaacaac tctgcccaca gctgcacacg acggattgaa 180 atactcagtg atataaaaaa aggaatatgc aatattgcga gcctcttcaa tttcgctatg 240 tcctcgaccc taattacagt ctctgtatac cgccagtttc gctcaacggg ctaacgcgta 300 agccttcccc gagttatacc agacgcgcaa attgtgtgta tgcaaccgcc gtctatggct 360 atctgcatcc ggacgatcct cgatgtcgac cacgatgtag ccgcttctgt gtcgttcgcc 420 aattcagagc gcgctgaatg aaggttttcg gttttcggtt gaagcttgtg cacgagatcg 480 agtttggatt ttccctgcag gcggatgcgg ttgaaagctg tatacctcta tagtaggatg 540 ttcgttaggg atcaaaggaa gagtcgacgg tttgcttttg aatttcgtgc cactggttgc 600 tcgaattttt cacaccgtca gcactcgacg agcaggtgac ctgcttagat attttaattt 660 tgccttttca cgtctgttta gaagggaacg aaaattgtac ttctctgcct cgtcctctag 720 atcaaattgt agtagcgctc cactgagcgt tctaccacag gatttttaaa gcatgattta 780 agaattcact tgtctcgctt gttgccatgg tattatatct ctgaatgtgc cattacgctt 840 tcctgcgcgt tccaagctac aggcataacc ttatactcgc tcattgaatt tctttttcat 900 caatatgagt tctcatcatc gaaaaaggaa tcggagtata gatcgaagag ataacgaaaa 960 gtcgagctca ttccatcgtt cacggcgcca tacggtgcac tcacgcagca aaagtccgcc 1020 gagaccaaca gcatcgtcag catatcgaca tcaacacgag cggagaaact gtaaacgtgc 1080 acattcgggg caaagacgtg acagccaaca aggaggtaat gcatcagtgc atcgatacaa 1140 aaagcctcga aggagcagtg cgcacaaaaa ttatccgtcc gaaaacgaca gtgagcgtcg 1200 cccgagaggc tcgccgtatc gtagacgtat agtacctcca tcagcagctc gctttccagt 1260 gaatattcat accggcgcag cagctcatat gaaagtcgca cctcgctaag ttactcacgc 1320 tcccctaatc gaagtgtttc gagctgatag acatgctcag ccaccactag gaacgatagt 1380 gagaggaaaa gagccgagag tcaaaaccac cacgacaagg ttaaggtaac cgatactgcg 1440 agaacatcag gtgagtcttc tgctcctaat aaaagtcagc cagaagtaaa ctcggagttg 1500 cttaatctct taggagagtc aatccaagaa ttaaacactg aacctcctat tcatgccgaa 1560 gttgcaaaag tttggcagca aatttgccat aaaggttttg ctaagaggtc aaaagaatct 1620 ttgtttcaaa aatatttaat cccggaaaat tgtgtatttc ttaaatcgcc aaagctcaat 1680 tcggagatta aatcgggagt tggaaaatct atccaatgca gagaccaatt tcaagtagtt 1740 acgcaaaatc agcttggctc agctctttca gcaatagcga aagttttaac aaagctgatg 1800 tcggatgaaa aaatgacacc gacgaagcga aaaaattcat tttcgagtct acaagtgatg 1860 ccggaaaatt gttgcctgat ttacattttc aaatttcaat gtctagaagg gcccttataa 1920 atccagcttt gaagcaagta gcggtcaacg tagcagaaaa atccgaaatc aacgagtatc 1980 tttacggcca gaatttctcg gagaaactta aaattgccaa ggaagtggag aaagtcggaa 2040 gtggcctttc gaaacaagat cggtttgcga gaaaacctga tttttttggc tcaagaagct 2100 cctatagaaa ccagacaaat tcgagaggcg atttaaacta cagaggcccc ctccatcgac 2160 gaaacacgac gtacacacgc ccagcgaggg ggcagaaatg acaatccagc agtcgagggc 2220 agaacaggaa cttcaggtgg aagtagatgc acctgtaagt aaaattggcg gtagaataag 2280 attctttgaa caatcttggc agaaaattac gaatgatcct ttcattatct cttgcgtaag 2340 aggttataca atcaaatttg attataacgt aattcagcct gtagtaccac cagaaccgca 2400 tttctctaaa gaagaaaatt tagcgtgtga atcttcgata aatattttgt tagagaaagg 2460 tgctgtatca aaatgcaaac cttgcaaaaa gcagtttatt tcgacttatt ttttgcgaca 2520 aaaaccgaat ggagattata gatttatttt aaatctcaaa cagcttaata aatttatagt 2580 cgcttcacat tttaaaatgg aaaatgttaa aactgcgcta aaacttttat ctaaagactc 2640 gtatatggca gctatgtatt tgcaagacgg ttatttactc gtaccaatta ataaaaaatc 2700 aaggaagttt ttaagatttt cgtacaaaaa tacttgtacg aatttaacgc tttgcccttc 2760 ggactttcaa tagctcctta tattttcaca aaattgttaa aaccagttgt agagaaatta 2820 cgattagaaa ataccatatt agtgatatat ctggatgaca ttttactgat agctagaaac 2880 cgagaagatt gcttgaggaa tgtttgcatt actcgaaatt tattagaatc attaggtctc 2940 ataattaata caaagaaaag acaattaata ccgagtaaac agtgtaaata tttgggtttt 3000 attctggata gcaaacattt cagtatctat ttgactaatg aaaaacgcga aaaaattaaa 3060 aatgcagttg acaaattata caatgggaaa tcatttaaaa tcagagaagt cgcaaaagtc 3120 ataggtacgc ttgtatgagc gtgtcctgcc gttacatacg gttggctgta cactaagcga 3180 ctagaacgag atacattttt agccctcaat aaaaataata acaatggtgg ctaaacaaca 3240 tatttttagt taaaaattac attcgagaca acaagtatca tctagaaatc ttcactgacg 3300 catctctcac aggatggggc gttttctgtg caggtgaaaa ggttcatggt tggtgggaga 3360 atggtgaatc agaccatatc aattttttag agttacaagc tgcttttaat ggcctcaaat 3420 gtttcgcgaa aaatgaagga aattgcgaaa ttctactcat aattgacaat actaatgcca 3480 tttcttacgt gaatagaatg ggcggaataa aatatccaaa acttaacaat attgtacgag 3540 atatttggca atggcgtgaa agtaggaata tttttttata cgcatcttat ataaaatcaa 3600 aggataattt tgaagcagac acagactctc gtagattacc gccagaaact gtatgggaaa 3660 tagcaagttg ggcatttaac atgatcataa aaagcttcgg tgttccagaa attgatctct 3720 ttgcttcctg tttaaatgca aagtgtaaaa aatatattac ttggaaaaaa gatccagatt 3780 ctatcgaaat tgacgcattc acaataccgt ggacaaattt gaaattttcc gcttttacgc 3840 tattttctgt tattctcaaa gttctacgta acataataat agataacgca gaagggattg 3900 tagttgtgcc atattggcca actcaatctt ggttttcgtt gtataaatcg ctagcaacga 3960 caccttttct ctatctcggg cctaatccta aattactttc gtctccttac agcaaatcgc 4020 atcaattgtc tcagaaactt atcctggttg ccgcgaaatt atcctgcaat catacacgcg 4080 aaaagagatc tagagcagaa acaacattgg caagatatgg aaacccagtg aataaatggt 4140 ggaattattg cagagagaaa aacatctcgt tatacgaagc cgatacaaag gatattatag 4200 aattcttcac gaggcaatca gaagtcgtaa aatcgtacag taagcttaat atttacaggg 4260 ctgcactttc gttgattatg aacaaccgtt tgggagaaat gccggaaata agtaggtttt 4320 ttaagggcgt tgctaatcaa aaaccgaaaa atgctaaata taatgaaata tgggatccag 4380 atagagtact gacatttctc tcaattttat tttcaaatga tgctttgtct cttgagaatt 4440 taacgaaaaa atgagttaca cttttagctt tagtcacggg aaaacgcatt cattcagaca 4500 ttgtcaaaaa taaaattaga taacataaaa gaaatggaaa acgagatcaa aattcgaatc 4560 acggatagaa taaaaactag caaccgagta aaagaacaac ctttgttaaa aattccattt 4620 tttactagac aagaaaatat ttgccctgca aaaactttaa aagtttacgt agaaaaaact 4680 gcaagcctta ggacagaaaa ggataaaaat ttaatcctaa catttaaaaa gccacaccat 4740 acagcgacaa cacaaacaat tagccgttgg attaaagata cgttaaaaga aagcggtata 4800 gatacgagca tttattcggc acatagcacg cgtcatgcgt ctacctctgc agcagctcga 4860 gccgttataa acatcgaaac tattcgcaaa gcagccggct gctcacaaaa atctaaagta 4920 tttgcttaaa tcatgcttta aaaatcctgt ggtagaacgt tcagtggaac gctactacaa 4980 tttgatctat tcggacgagg cagattgtac aattatgatc gagttttcac ttacaggtaa 5040 gttcactcga tcgttcaatt ctcgacaccg gactgcgaag gattttaatt tgcaaagaag 5100 cgaccctgtt gaaaatatat acctggatat ccaggtggat tttccatatg gataatctat 5160 atcgattcat gtggattcca tatggattcc atgtggatat tccatatgga tatccacatg 5220 gatatacgtc tagtttttga aaaagtcaaa acagttaaaa attttgtatg cgcctgttat 5280 gtataggtta acaaccatta atggaagtac cattcggata ataacacctt aactagggtt 5340 gtgagctcca aaatcttaat ttgtttagca ttacaaaaaa ttttaatttc tataaaataa 5400 ttaaaaagtg aatattttgg agctcacaac cccagtttag gtgttatcat ccggatggta 5460 cttccattaa tggatagatg aactatacat aactggcgca taaaaaattt ttaactgtct 5520 tgacttttta ataaactaga cggatattca tgtggatatc catatggaat atccacatgg 5580 aatcccacat aaatccatat ggaatccaca tgaatccata aagattatcc atatggaaaa 5640 tccacctgga tatccatcag gtatatattt tcaacagggg atgaattccg cggattctcc 5700 ctccgcgttg atagcttaat ggccgcagtg tcaagtccga agtatgcgcg ccgttccgaa 5760 gggcctatca agggcctgca tcgagaaaaa tcggcccgca cctgcatgca catacatcag 5820 aattcgaaaa atcgcaacgt cctcgtacgg ccgacccact tcctgcgctc gcgccgccta 5880 taaaacgcgg cctcgtttgg aattcctcgc ctgcagccgc g 5921 // ID Chapaev-3B_HM repbase; DNA; INV; 5001 BP. XX AC . XX DT 27-FEB-2008 (Rel. 13.02, Created) DT 27-FEB-2008 (Rel. 13.02, Last updated, Version 1) XX DE Autonomous Chapaev DNA transposon -a consensus sequence. XX KW Chapaev; DNA transposon; Transposable Element; Chapaev-3_HM; KW Chapaev-3B_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-5001 RA Kapitonov V.V. and Jurka J.; RT "Autonomous Chapaev transposons from the hydra genome."; RL Repbase Reports 8(2), 36-36 (2008). XX DR [1] (Consensus) XX CC Chapaev-3B_HM is a very young subfamily of Chapaev-3_HM DNA CC transposons that was active in the hydra genome less than a few CC million years ago (copies are ~1% divergent from their consensus CC sequence). The consensus sequence of the TPase-encoding region CC was obtained based on multiple alignment of 4 copies; it codes CC for a 954-aa Chapaev transposase (ten exons). CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(456..1382,1482..1774,1859..2071,2165..2404, FT 2491..2567,2687..2892,3063..3392,3497..3652, FT 3873..4065,4196..4422) FT /product="Chapaev-3B_HMp" FT /note="Transposase." FT /translation="MATVVATNHILCLMKLCRLCGNYIGHDPVNTENIKIR FT IDQAFFTELEEDRLDTHPPKICMKCYTNLRNIEKRGTTSNFSLSQWPLTCF FT IERCICFWKNPGRKKKKQCGRPSINDKVIWTRESICKIMESFSGLSRLCHD FT SIDVADNPHKDLCVCKICMRLLHKPVLLNNCQHLFCATCIFPYIIGKLETE FT TKCTICFSNITLGSILKATSMQNILDNLFIKCCNNTCKEKFTVKNISFKEE FT HEKTCKVRNKSQYSSPSSSSSLTVTEIYKINEHCEIPKELDYAFAHFAKLK FT MAKNKSHSFELPSGGPRAIQFKVTPKIYKTSNSISQKTIKRRQNQIHQSLV FT ENAGLAKKAKINQAALMLNSFNDNDKINILKKANIPQVQIESEDIVSLKAD FT CGLPWEKMKKMIRFFHTKNVKLPSLSNQRKVSKYWSGDDLIVENKKFLFEL FT KNKKETFEIKDTPTAYINDLPSHLIKLLDKLERYNQLTYEKINQSEIHVKV FT GGDHGGGSFKMSYQIANVSKPNSKNNTTVFNIFEAKDYRANLKIGLSRHID FT EIRQLQTMKWREKNLRVFIFGDYVFLCAIYGITGATGRHCCLFCEITSSEM FT QLNTENRNRQINLRTLQTLKSDYERFKNNGGNIKNAKKFNNVIDEPLFDIP FT IDQIAIPALHISLGIFLKFFDMLEAECHLLDVKLAAFLAMNDKRLDSIEFD FT KYVKKQVEISQLKYVIDDINNKIILIQDTCVVEVLRNPENSDYLQLIYTER FT IALLKSKKIEKEAQIAECCKVKLLEGLGPIIKGIESVLQSCGVQRQAYHSC FT SFIGNHVHKMLKVDNIKKLCDAIPLVLFINIKHQEHPIHVEAINIQQFNIL FT FNNYSKCHNKMNSCYLFSEENIKSFENHMGVILSSWGIGLGLYEEQGGESI FT HAEFNNIGRIYSSMSGTRKLESIIRGHIIRNSIMAKSLKPPIVPRRNKTV" XX SQ Sequence 5001 BP; 1873 A; 651 C; 752 G; 1725 T; 0 other; cacggttgtt ttatcttgat acctatcaga catttatagt acgcatgcgt ctccgtgaat 60 tttctgtttt cctctcgata tttgctataa ttttaatttg tacattgaga cactgaaatt 120 aggaacacat ttttaatcta ttatattttg tttttcttag tgttggaacg ttttgttgga 180 tcgtcaaata ataaaatatt atagttaatt aaaaattttt ttgattcgat tgctttttaa 240 caaactttca agagctgtaa taaaaagtat tatttttaaa attttaagtt ttttctaaac 300 tcttcaactt ttttaattca atttttggaa gcaagaacta aacggtttct taaaatatta 360 tttaaattct tttagaaaac accaatttta tcacccagtc atttttttaa ataatcaaat 420 aattggatta cataagtacg atatcccatc gtaaaatggc tactgttgtt gctactaacc 480 acatattatg tcttatgaaa ctctgcagac tatgtggaaa ctatatcggc catgatccag 540 ttaacacgga aaatataaag attcgtattg atcaggcatt ttttactgaa cttgaagaag 600 acaggctgga tacacaccca cctaaaattt gcatgaaatg ctacacaaac ttgagaaata 660 tcgaaaagcg aggaacaacc tctaatttca gtttaagtca atggccactt acttgtttca 720 ttgaaagatg catatgtttt tggaaaaatc ctggtaggaa gaaaaaaaag cagtgtggaa 780 gaccatcaat aaatgacaaa gttatttgga caagggagtc aatttgtaaa ataatggaaa 840 gtttttctgg actgtctcgt ttatgccatg attctattga tgtagcagat aatccacaca 900 aagatttatg tgtatgcaag atttgcatga ggttgttgca taagcctgta ttacttaata 960 actgtcaaca tttgttttgt gctacatgca tttttccata cataatcgga aaattagaga 1020 cagaaacaaa atgcacaata tgtttctcta atataactct tggtagtatc ttaaaagcga 1080 cgtcaatgca gaatatactt gataatttat ttataaagtg ttgtaataat acatgcaaag 1140 aaaaatttac tgtaaaaaat ataagcttta aagaagaaca tgaaaaaacc tgcaaagtta 1200 gaaacaaatc acaatacagt tccccatcat ctagctcttc attgacagtt actgagatat 1260 ataaaataaa tgaacattgt gaaattccaa aagagttaga ctatgcattt gctcactttg 1320 caaaactgaa aatggcaaaa aacaaatcac attcctttga actcccttca ggtggtccac 1380 gggtaagcta tttttatttg tttaaattaa aatataaata ttaatgtaat aaaataatag 1440 tttgaagaat agtttatagt taaataattg gtcttcaaca ggctatacag tttaaagtga 1500 ctccaaagat ctacaaaact tcaaatagta ttagccagaa aactattaaa aggcgtcaaa 1560 atcaaataca ccaatcactc gttgaaaatg caggactagc aaaaaaagca aaaataaatc 1620 aagcagcact tatgcttaat tcttttaatg acaatgataa aattaatata cttaaaaaag 1680 caaacatccc ccaagttcag atagagtcag aagacattgt atcactaaag gcagattgtg 1740 gtttaccttg ggaaaaaatg aaaaaaatga ttaggtaagt gctttggaac ctgaggatgt 1800 taatttggct cagcgcacac acttatatat ctaattattt acagttactt tatttcagat 1860 ttttccacac aaaaaatgtc aaacttccgt cactttccaa tcagcgaaag gtttcaaaat 1920 attggtcagg tgatgatttg attgttgaaa ataaaaaatt tctgtttgaa ttgaaaaata 1980 aaaaagaaac ttttgaaata aaagataccc ctacagcata tattaatgat ttgccatcac 2040 atttaatcaa gttacttgat aaattagaaa ggtaagttca aaatatattt agatgtaaat 2100 ataaatattt tgtaagtttt tacctttgtt ttataaataa aaaatttctt ctttattatc 2160 gtagatataa ccaactaaca tatgaaaaga taaatcaaag tgaaatacat gtaaaggttg 2220 gtggcgacca tggaggtggt tcatttaaaa tgagctacca gattgctaat gtttctaaac 2280 ctaattcaaa gaataatacg acagttttca atatatttga ggccaaagat tatagagcaa 2340 atcttaaaat tgggctctca agacacatag atgaaattag acagcttcag acaatgaaat 2400 ggaggttatt aatttaatat tttaaaaatt tatttgcttt tcgaattata aaaaatgttt 2460 acaaaatttt ttaatcattt ttaattttag ggaaaagaat ttacgagtat ttatatttgg 2520 tgattatgta tttctgtgtg ctatctatgg tatcacagga gcaactggtc agtctatagt 2580 tttctattca ctaagcttgc attctaaatt aaattactaa aaatgaaatt aaaaaaatta 2640 taaaagtgaa ataaaaacat taaaaattat gtaatttatt taacaggtcg tcattgctgc 2700 ttattctgtg agatcacatc aagtgaaatg caattgaata ctgaaaatcg gaatcgacaa 2760 atcaatttaa gaacgcttca aactttgaaa tcggattatg aacgattcaa aaataatgga 2820 ggcaacatta aaaatgcaaa gaaattcaat aatgtaatcg atgagccatt gtttgatata 2880 ccaattgatc aggtgtgata aataatttgt tgccaaacgt ttaatgcgta taatatatat 2940 atatatatat atatatatat atatatatat atatatatat atatatatat atatatatat 3000 atatatatat attatataac acacctctaa ctattatgtg attgttatta atcatttaac 3060 agatagcaat accagctctc cacatctcat tgggaatatt tctaaaattc tttgacatgt 3120 tagaagctga atgtcatctc cttgatgtaa aactagctgc atttcttgca atgaatgata 3180 aacgccttga ctcaattgag tttgacaaat atgttaaaaa gcaggttgag atatctcaat 3240 taaaatatgt cattgatgat ataaataata aaattattct tattcaagat acatgtgtag 3300 ttgaagtact tcgtaaccct gaaaattcag attacttgca gttaatttac acagagagaa 3360 ttgcactatt aaaatcaaag aaaatagaaa aggtattaaa tagtactaga aataatgatt 3420 tgtttttaga gcattgtata ttttatgggg tgtattatct agattttttc tacaataatt 3480 atttttaatt tttaaggagg cacaaatagc agaatgctgt aaagttaaac tattagaagg 3540 acttgggcca ataataaaag gaattgaatc tgttttacag tcgtgtggag tgcaacgtca 3600 agcatatcat agttgctcat tcattggaaa tcatgttcat aaaatgttga aggtatgaaa 3660 atataaatct gtattttcat gaatttattt catataaact attgtgtgtg ttgtgcaagt 3720 tttcataaat tctgcattaa attttgaatt ataaccaacg ttcttgatct aatatattgc 3780 aatgcatata catatatatg catataaagc atatgcatat aataataact tgttgtacaa 3840 atgtggttta tgatttttgt ttcattgttt aggttgataa cataaaaaag ttatgtgatg 3900 caattcccct agtacttttt attaatataa aacatcaaga acatccaatt cacgtagagg 3960 cgattaatat acaacaattc aatattttgt ttaacaacta ttctaagtgt cacaataaaa 4020 tgaactcatg ttatttattc agcgaggaaa atatcaagag ttttggtaag tttattaata 4080 aatcttgcca aagacaaata tttccaccat ttatttacaa ttactagttt tttttgtaga 4140 aataacagta tgtgaattaa tgaaacattt tcgaagcaca aaatgcatat gttagagaac 4200 cacatgggag taattttgag ctcatgggga ataggtcttg ggttatacga ggagcaaggt 4260 ggtgaaagta ttcatgcaga atttaataat attggaagaa tatactcaag catgtctggc 4320 acgagaaaat tagaaagtat aataaggggc catatcataa gaaatagcat aatggcaaag 4380 tcattaaaac caccaatcgt ccctcgaaga aacaagactg tataatatgt tatatttttc 4440 gtacaataaa atgattttac gcaataaaat tatatagttt atgtcaaata tttgtattta 4500 atatgtttgt ataaagttat tttgagatgt tgtagacata agtcagccat gtatcgaaat 4560 attatcgaac tttgtagcta cgtatcttca tttaaaaaaa tgactggtaa aaatactgag 4620 ttaaatgttt tagctcaact tttatttttt tctatgatat ataatactat atttcaaaag 4680 ttgatattcg taggttttat tgagaaataa aaatggtagt aaaatccgca atgaaaatct 4740 tggtcaactt taccggcgag ttaaggctct aggcgtgaca tatttttttt tcttttaatt 4800 ttttctaata attaaaatgt aagtaaatac aaaaaaaagt ttattagtga gatttaatat 4860 ttcagtatcg tatatatgaa atattgaaaa tacctctaaa ttcgccgttt tcgtagctag 4920 tggaaaaata gttttaaagc gcaatttatt ttatacgtaa tgcgcttgcg cataataaat 4980 gtctattgcg aaacggccgt g 5001 // ID BEL-596_AA-LTR repbase; DNA; INV; 471 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-596_AA_; KW Pao_Bel_Ele208; BEL-596_AA-I; BEL-596_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-471 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 471 BP; 159 A; 93 C; 74 G; 145 T; 0 other; tgttaggatt aatcatcaga atagacccct cgatcgttcg tctaacggat ggatcgcagg 60 cagcttcaca ctaggctaag caaaacccca atacacatac aacatgcaat agagctcctc 120 gtcattgttt gccatcatgt ccgatacaat taattatcag ctaaagtagt aaaagacaga 180 atctgtgtcc aattgttaaa atctattttc tacaccatta gtaagaattt attgctgttt 240 atacctagaa attgttcaac ttatactaaa tttcttctta aactaggttt ccttaagcta 300 acttacctag aagcatattt gttcgctgaa aaatacttaa tgaggagatt aaatgtaagt 360 acaaacgcta ttgttcggca tgcaaatgtt taataaaact atttacagct tcgagctaac 420 tttcgagcaa aaatcgagtt tgcttcggag aatccgaata cttcagcaac a 471 // ID BEL-182_AA-LTR repbase; DNA; INV; 654 BP. XX AC AAGE02029203; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-182_AA_; KW BEL-182_AA-I; BEL-182_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-654 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02029203; Positions 16122 16775. XX SQ Sequence 654 BP; 219 A; 124 C; 147 G; 164 T; 0 other; tgtcggcgca ggagtcggca agctggcaac actgccaatc aacaatgctg aagggcaggc 60 gacagctgtc aaagcgccgc ttcatcacga gtgacagatg tcactacgga ggggacgaac 120 cgtcagatgc tgcttccggt cgtccatttc gaagccagtg ctaaacgtgc gtagttgcaa 180 attagatagc tgtagtaaaa gtgaattatt taggcagtga aaattaaatt gagtgaattt 240 attaaactag ttaaagtgaa tttattaaaa ggggactgct aattggtcct gagacaaatt 300 cgcccggaat catcggtccg agaatccaac tctgtgaagt gagtctatag gtccagatcc 360 ggtgatgttc ggtaatccct aggtccgtcc gcaaattccg ctaatcctga aaagagaaaa 420 aaagtaagaa aataatagaa atagaattaa aagaaataac taaaattaat aatttaggtc 480 gtagaccatc gaaacaactg agattctgcc aaagggcgtg aagagagaaa aaggtcaccg 540 aattgtaagt gaatttaacc ttacctgcaa catgtgtact taccctaatt tgttttatag 600 tttgaagcgc tgcaatacat acttgcttac aaaaccggtc atacttttaa tcca 654 // ID SMARN25 repbase; DNA; INV; 782 BP. XX AC . XX DT 16-AUG-2009 (Rel. 14.08, Created) DT 16-AUG-2009 (Rel. 14.08, Last updated, Version 2) XX DE Consensus sequence of non-autonomous Mariner-type family of DE repeats. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Nonautonomous; KW SMARN25. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-782 RA Jurka J.; RT "Mariner-type element from freshwater planarian (Schmidtea RT mediterranea)."; RL Repbase Reports 9(8), 1893-1893 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX SQ Sequence 782 BP; 253 A; 112 C; 105 G; 294 T; 18 other; cacctaattc tgtttttaca cgaattattt ttacacgaat ttgaaataac acgagtaaat 60 acgaaaaaat atttctnttt ttacacgaca taatttgaaa taacacgaat acaaaaaaaa 120 caaaattact ttttttacat gaattattgg atttaacacg ttttttttca atgaattttt 180 tacaaattat gtatacatac attttaactt cgcattatat tcttcagaat tcaccacgct 240 anantatgtc ganctntacg cgnacgnttt ctntncgagt gggnttcccc gaagtanacg 300 taaagcanca gccgtnttgt actttgtttg tttacacgca atttagttcg gttttgagaa 360 ttgaaangaa aagatcgttt agttgttgct aatgtttaac aagttataag taaaataagg 420 tagaagctcg tataaattta ttaattctgc tcgtatagnt ttgttaaccc aataacaggt 480 tagtttcaaa tatttanttt caaatctttg tattattgat tataattata gattaataaa 540 atatattttg gtagccttaa cacccatgtc acataattcn cctgcantaa aaacgaagag 600 gaaagcgatc agcttagata ccaaaattat tatacttttg tttgtttatt gtacttttgt 660 tgctgtttta ggtatggaac caatctacta tttttccatt cggccaatat ctttttttac 720 acgatttttt tttacacgaa ttttttagga acgtatctat cgtgtaaaaa cagaattagg 780 tg 782 // ID DNA8-86_AP repbase; DNA; INV; 466 BP. XX AC . XX DT 26-AUG-2009 (Rel. 14.09, Created) DT 26-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-86_AP. XX NM DNA8-86_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-466 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2022-2022 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 466 BP; 154 A; 70 C; 72 G; 170 T; 0 other; caggggtggc caaacggtcg atcgcgatcg actggtcgat cttggacgat ttctaagtcg 60 atccccttgt tagaatttat taattattct tatgcatttg tcatttagta gattcagtta 120 ttaaatacat atatttcgga ttaacctgtt agtgtgaaat ggcattttct cacatgaaat 180 ttataaaagc tgagtaggtt aggtttagac ttaagcacgg agtcttcgtt tggccatgat 240 atatgcaacg acacgcttct acatcaacag taaaatgtta atacatttta aaattattta 300 gaaatttaat gtgaaacact cattttagat aacttaacac tttaatttta ttatttatat 360 aagtatgttt attattaata ttaaataatt tttttacatt agtcgatcct aaaagaaaaa 420 atataccctc agtagatctt aatcaaaaaa agtttggcca cccctg 466 // ID GQRP1 repbase; DNA; INV; 85 BP. XX AC J01056; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 28-SEP-1995 (Rel. 1.08, Last updated, Version 1) XX DE G.quinquedens highly repetitive DNA fragment. XX KW GQRP1; Repetitive sequence. XX OS Chaceon quinquedens OC Eukaryota; Metazoa; Arthropoda; Crustacea; Malacostraca; OC Eumalacostraca; Eucarida; Decapoda; Pleocyemata; Brachyura; OC Eubrachyura; Portunoidea; Geryonidae; Chaceon. XX RN [1] RP 1-85 RA Christie T.N. and Skinner M.D.; RT "Selective amplification of variants of a complex repeating unit RT in DNA of a crustacean."; RL Proc. Natl. Acad. Sci. U.S.A 77(5), 2786-2790 (1980). XX DR GenBank; J01056; Positions 1 85. XX SQ Sequence 85 BP; 18 A; 28 C; 14 G; 25 T; 0 other; agcttatcac cacctgtaac aacttttttt gtataagtcc ccaagcgctt caccgtgcca 60 cagccctgct tttggccgtc aagct 85 // ID DNAX-6_Tad repbase; DNA; INV; 435 BP. XX AC . XX DT 08-OCT-2009 (Rel. 14.1, Created) DT 08-OCT-2009 (Rel. 14.1, Last updated, Version 3) XX DE Putative non-autonomous DNA transposon: consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; DNAX-6_Tad. XX OS Trichoplax adhaerens OC Eukaryota; Metazoa; Placozoa; Trichoplax. XX RN [1] RP 1-435 RA Jurka J.; RT "DNA transposons from Trichoplax adhaerens."; RL Repbase Reports 9(10), 2148-2148 (2009). XX DR [1] (Consensus) XX SQ Sequence 435 BP; 144 A; 67 C; 66 G; 158 T; 0 other; agtttctaac cttaaaaaca gctgaaaagc tgacatttac ataaattaga ataatttaat 60 aacgtaatgt ctttgattga aatctaagtt gtgtactttt aaggtaatca ttgactgatc 120 gctaagtgta aaaaatccat atttgatacc gacggtatag tttcaatcta tctcgaagga 180 gtgcgattgg cacgatttag catcgttttg tgccattttg caaggaaatt ttgccaatcg 240 caccccttcg atgtcgattt aaactacatt gtcgatatca aatctggatt attttttcgc 300 ttaacgatca gtcaatgatg ccttaaaaag tacacaactt ggatttcaat caaaaatatt 360 acattattaa attaaatttt aatttttcta atttatgtaa atgtcagctt ttcagctgtt 420 ttaaggttag aaact 435 // ID Gypsy-221_AA-I repbase; DNA; INV; 6948 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-221_AA_; KW Gypsy-221_AA-LTR; Gypsy-221_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6948 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1045-1045 (2011). XX DR [2] (Consensus) XX CC Positions [5179-5655] - Integrase core CC LTRs are 92% similar to each other. XX FH Key Location/Qualifiers FT CDS 427..2709 FT /product="Gypsy-221_AA-I_1p" FT /translation="MDLQSAYRHMDVSHLAIDEVEHELLIRNMLFHFDDHD FT SVKRRKLKDRMKEERNMGINSTAFARTWRTAQEEIEIVRSRIQIIRGIFEN FT PKADARQKQKLRTRIIHYRVRISQLARATDARKYLQQISDIEDQIDEIMAT FT HFDAMNSESDKTKAVPLDEKITEVLGEVRTEIATLNDTVATLEGGDEGGEL FT EQASGGAMSVIKQKKHEIELSKKRADEILQKLNEYEPGKEKEFEHLISAFK FT DFVVHTSEQQRLKREQEIVTERQRLEEMASRIKRKQNLEKLLTELNENLSQ FT SKNGSTIMLNTSNASPSDRSSTSENENGMEQVELKASASSSKSGKADSGKG FT NRRSVEFQKNVESSTESSGENYECARTSEKKGKMNKLRKRDSIRSIMKKSR FT KKTVRSSTSSEQTFSTDSSESSESLSSSSSCSMSSTEEERKKKKRAKKHKR FT ARKSIKRIPVAEWRLKYDGKDDGRKLAEFLKEVKMRSRSEDISDRELFRSA FT LHLFTGRAKDWYIDGVDNGDFRNWSELKRELKREFLPPDLDFQLEIQATNR FT RQSRGERFCDYSHDMQKIFQCMTKQLSERRKFDLIWRNMRHDYKNALTGAG FT IRSLSKLKKYGRKIDENFSFLQKQNDTRARPNQVSEITSNREKGNSGNSGN FT NTRVFTNSKNQSKPKLSGEGKEEQPPKVGKESGKGRVEEKTQPMEGSSTGT FT MLAIAEQYRRPPIGTCYNCRTHGHHYVECPEPRYKFCRICGFQDVFTSACP FT VCQKNIKGSA" FT CDS 4783..6054 FT /product="Gypsy-221_AA-I_2p" FT /translation="MFSIIGMSGNSPAGRRRRAARARAAARGRAPPPPGAP FT AGRRARPPAGRPGPPPRRRGRRPACIPEALRKEILEKEHVSAFHIGYEKLL FT DKLRQRYFWPNMAQSAKKYVQSCQQCKEFKPANVSQHPEMGKQRLTTKPFQ FT ILSMDFIQSLPRSKTGNTHLLVLMDMFSKWTMLFPVKKIASEIVIRLVEQH FT WFRRYSVPEILITDNASCFLSKEFKTFLDRYHVQHWANARHHSQANPVERL FT NRNINACIRTYVKSNQRLWDTRISEIEYTINNTRHSSTGFTPYRILFGHEI FT VADGVEHRLDADTEHLSEKERMEQKLKIDHSIFDTVYKNLERAHEKSTRSY FT NLRFRKAAPVYEVGQKVFRRNFAQSSAGEAYNAKLGPMYIPCTVVARRGTS FT SYELMDSTGKNVGIFSAADLRPGNAEERDS" FT CDS 2673..5234 FT /product="Gypsy-221_AA-I_3p" FT /translation="MSSLSKKHQGVSLRRQVDLDLENPPTLKQMAQTLTSY FT GLEPISATDYDIAHDLNEIFVHIQGDGRPFAKVNVMGVEILGLLDSGAQRS FT VLGTGAEKLIRTLKLKIYPTPASVRTAEGKDVPVKGLVHLPITFHNQTRII FT PTLVAPELRRRLILGFEDFWRAFHIRPTLLGEEGLVRVDEVESTPGNSIEE FT STHDLTDEQRAQLETVKTLFKVAIDGETLDVTPLITHKIELREEYRSSPPV FT RINPYPTSPEMQRKINQEIDKLLSQKVIEKSNSDWSLSTVPVVKPSGEVRL FT CLDARRLNERTQRDAYPLPHQDRILSRLGASKFMTTIDLTKAFLQIPLDCH FT SRKYTAFSVVGRGLFQFTRLPFGLVNSPASLSRLMDEVLGYGELEPNVFVY FT LDDIVVVSDTFEAHLCTLREVAKRLQAANLSINLQKSKFCVTELPYLGYIL FT TPEGIRPNPDRIDAILNYERPSSIRALRRFLGMANYYRRFIASFSEMSAPL FT TNLLKKKPKCLIWNNEAEQAFIRLKESLISAPILSNPNFSLPFQIQTDASD FT NAIAAVLSQEHDDGEKVIAYFSQKLTPAQQSYAATEKEGLAVLSAIEKFRP FT YIEGMHFTVLTDASALTYIMNGRWKTSSRLSRWSIDLQGYDFDIKHRRGRD FT NIIPDALSRAVEVLEITVTDWYQDLLNKVKANPEENLDYRIESDKLYKFVP FT TKTDVFDYRYEWKLARGAPPARRARPRRGARPRPAAAGRPGGPARPPPGRA FT PGAPAAAARPPPRLHTRGASKGNSRKRARIRISHRVREVVGQAETKIFLAQ FT HGSVGQEICSKLSTVQRIQTGERFTAPRNGKTAPHHKTFSDPLNGLYPVLA FT A" XX SQ Sequence 6948 BP; 2147 A; 1455 C; 1650 G; 1687 T; 9 other; attggcgccc agcttaaaga aaagcgatac tgatcagaag ttcaamggga gtgagattgc 60 ttagggagca gatattttgc gatagctagc gaaaaataac cttcatttgc tcggattggt 120 gagtaggtag ggtacttgat ctgtgtagtg cgagccacac tttccgcaga gatcactgag 180 agagctgtgc ttctactagt gcgcgaacgt cgtttagagt aattaatttg ataaaaagcg 240 tactaaatag ttgcgcgcga atcaatttta ggtatttttt ctaatttttt tttcattaat 300 gggtcttgac tcaaccaatc acaactaggt aattaacaaa ttttgaccaa cgttatctct 360 cgattctgtg agaatacaag aacatttttc gattggcaag tgtgactgta gtggaaaagc 420 cttgcgatgg atttacagtc ggcttaccga cacatggatg tgtcccatct cgctatagac 480 gaggtggaac acgaactctt gatcagaaat atgttatttc attttgacga tcacgatagc 540 gtaaaacgac gaaaattaaa ggataggatg aaagaggaga gaaatatggg cataaattct 600 acggccttcg ctaggacgtg gagaacagcc caggaagaga ttgagattgt gagatcccga 660 attcagatta ttcgtggaat ttttgagaat cccaaagccg atgccaggca aaaacaaaaa 720 ttgagaaccc gtatcatcca ttaccgcgtg agaattagtc agcttgcgag agctactgac 780 gctcgcaagt acctacaaca gattagtgac atcgaagacc aaatagacga gatcatggca 840 actcattttg atgcaatgaa ctcggagagc gataaaacca aagcagttcc attggacgag 900 aagataacag aggttttagg agaagtacgc accgagatcg caacgttgaa cgatacggtt 960 gctactctcg aaggtgggga tgaaggggga gagttagaac aggcaagtgg aggagcaatg 1020 agcgtcataa aacagaagaa gcatgaaatt gaactttcga agaaaagggc agatgaaatt 1080 ctacaaaaac tgaatgaata cgaaccagga aaagaaaagg aatttgaaca tttgatctct 1140 gcatttaagg actttgtcgt tcatacgtct gagcagcaga ggctcaagcg agagcaagaa 1200 attgtaacag aacggcaaag attggaggaa atggcctctc gtattaaaag aaagcagaat 1260 ttagaaaaac tcttgactga actgaatgag aacctgagcc aatctaagaa tggttccact 1320 attatgttga acaccagtaa cgcgtctcca tcagaccgtt catctacatc agaaaatgag 1380 aatggaatgg aacaggtcga attaaaagcc tcagcktctt ctagtaaatc tggaaaagct 1440 gattcgggaa aaggaaatag aagaagtgta gagtttcaga aaaacgtaga aagttcaacg 1500 gaaagctctg gtgaaaatta tgaatgtgct cgaacttctg aaaagaaagg aaaaatgaat 1560 aagctgagaa aaagagatag catcaggtcg ataatgaaga agagtaggaa gaagacggta 1620 aggagctcca cttcgtctga gcaaactttc tctactgatt cctcggagag ctcagaatct 1680 ttgagttcga gttctagttg ctctatgagc tcaaccgagg aagaaaggaa gaaaaagaag 1740 agggcgaaaa agcataagag ggctagaaaa tcaataaaaa ggatcccagt agccgaatgg 1800 cgacttaaat atgatggtaa ggatgacggt cggaaattgg ctgaattttt gaaggaagtc 1860 aagatgcgat ctagatctga ggatatttca gatcgagagc tgtttaggtc cgcgcttcat 1920 ttattcaccg gtcgtgccaa agattggtat atcgatggtg ttgacaatgg ggatttcaga 1980 aattggtcag agttgaagag ggagttgaaa agggaatttt taccccccga tcttgacttc 2040 caactagaaa ttcaggctac caatcgccgc caatcccgag gggaaagatt ctgtgattat 2100 tcccacgaca tgcagaaaat tttccagtgc atgaccaaac agttatcgga gagacgaaaa 2160 tttgatttaa tctggcgcaa tatgcgccac gattacaaaa atgctttaac gggagcaggg 2220 atcagatccc taagcaaact gaaaaagtat ggcaggaaaa ttgatgagaa tttcagcttt 2280 ctccaaaagc aaaacgacac tcgggctagg ccaaaccagg ttagcgagat cacgtccaac 2340 agagagaaag gcaatagcgg taactcaggg aataataccc gagtttttac gaatagtaaa 2400 aaccaatcta aaccgaaact atctggagag gggaaggagg aacaaccacc caaggtagga 2460 aaagagagtg gtaaggggag agtagaggag aaaacccaac cgatggaagg gtcttctacg 2520 ggtacgatgc tggctattgc agagcaatac cgtagaccac ccattggcac gtgctacaat 2580 tgccgaacac acggacacca ctatgtcgaa tgtcccgagc caaggtacaa attttgtcgt 2640 atttgtggtt ttcaggacgt ctttactagt gcatgtccag tttgtcaaaa aaacatcaag 2700 gggtcagctt gaggaggcaa gttgatttag acctcgaaaa ccctcccact ctaaagcaaa 2760 tggcacaaac cttgacctca tacggcttgg agcccatttc agctaccgat tacgacattg 2820 cacacgattt aaacgaaatt ttcgtccaca tacagggaga tggtagacct tttgccaagg 2880 tgaatgttat gggagtagaa atactcggcc ttttggacag cggagcccag cgatcagtat 2940 tggggaccgg tgccgaaaag ctcattagaa cattaaaatt gaaaatttat cctactccgg 3000 cttcggttag aacggcggag gggaaggacg tccctgtaaa aggtctcgtt catctgccta 3060 tcacttttca caaccagact cgtatcattc ccactctggt ggcgcccgaa ctccgaagga 3120 gactcatttt gggtttcgaa gacttctgga gagcatttca tatacgacca actttgctag 3180 gagaggaagg actggttaga gttgacgaag tagagagtac cccgggaaac agtattgaag 3240 agtccactca cgatctcaca gatgagcaaa gagctcagct cgagacggtg aaaacactct 3300 ttaaggtagc aattgacggt gagaccctcg acgtcacacc cttgattacc cataaaattg 3360 aactgaggga agaatatcgg agctctccac cagttcgaat caatccttac cctacctctc 3420 cggagatgca gmggaaaata aaccaagaaa ttgataagct cttatctcag aaggtgatcg 3480 agaagagcaa cagtgactgg tctctcagta ctgtgccagt ggtcaaacct tctggcgagg 3540 taagattatg cctagatgcc cgtcgtctga atgagcgaac acaaagggat gcctatcctc 3600 ttccccacca agaccgtata ctgagtcggc taggagcaag taaatttatg accacgatcg 3660 atttaacgaa agcctttctc cagatcccgc tcgattgcca ctcacgcaag tatacggcct 3720 tctcagtggt ggggagagga ctgttccagt ttaccagact gcctttcggt ctagtaaata 3780 gtccagcaag tctgtcacga ctcatggacg aggtattggg ctatggtgaa ctggaaccga 3840 atgtgttcgt ttacctcgac gatatcgtcg tggtaagcga cacattcgag gcccaccttt 3900 gcactcttcg cgaagtagcc aagaggctac aggctgcgaa cttatccatc aacctccaaa 3960 aatcaaaatt ttgcgtgact gagttaccct atttgggtta catactaaca cctgagggaa 4020 ttcgcccgaa cccagacagg atcgacgcta tactaaatta tgaacgtcca agctcgattc 4080 gggccttgcg ccgatttttg ggcatggcaa actattacag gcgatttatt gcctctttta 4140 gtgagatgag cgcccctctc accaatctgc tcaaaaagaa acccaagtgt ctgatatgga 4200 ataatgaagc cgaacaggcg tttattcgtt tgaaggagag tttgatctca gctccaattt 4260 tgagcaatcc gaatttcagc ttacctttcc aaatacagac cgatgctagc gataatgcga 4320 tagctgccgt gctgtcccaa gagcacgatg atggtgagaa agtgattgcg tatttttcgc 4380 aaaaactgac tcccgctcag caatcatacg cagcgaccga gaaagagggc ctggcggtac 4440 tctcagccat cgaaaaattt cgaccatata tcgaggggat gcatttcacg gtgttgaccg 4500 atgcgtcsgc actaacttat attatgaacg ggaggtggaa gacgtcatcc cgtctgagta 4560 ggtggagcat tgatttgcaa gggtatgact tcgatatcaa acacaggaga ggcmgggaca 4620 atatcatccc tgatgccctc tctcgagcag tcgaggtgtt agaaatcacg gtaaccgact 4680 ggtatcaaga tttactcaat aaagttaagg caaatcccga agaaaaccta gattaccgga 4740 tcgaatctga caaattatac aaatttgtac cgacgaaaac cgatgttttc gattatcggt 4800 atgagtggaa actcgcccgc ggggcgccgc cggcgcgccg cgcgcgcccg cgccgcggcg 4860 cgcggccgcg ccccgccgcc gccgggcgcc ccggcgggcc ggcgcgcccg ccccccggcc 4920 gggcgcccgg ggcccccgcc gcggcggcgc ggccgccgcc ccgcctgcat acccgaggcg 4980 cttcgaaagg aaattctcga aaaagagcac gtatccgcat ttcacatagg gtacgagaag 5040 ttgttggaca agctgagaca aagatatttc tggcccaaca tggctcagtc ggccaagaaa 5100 tatgttcaaa gctgtcaaca gtgcaaagaa ttcaaaccgg cgaacgtttc acagcacccc 5160 gaaatgggaa aacagcgcct caccacaaaa ccttttcaga tcctctcaat ggactttatc 5220 cagtccttgc cgcgtagcaa aactggaaac acacatttgc ttgttctgat ggacatgttc 5280 tcaaaatgga cgatgttgtt cccagtcaaa aagatcgctt cggagatcgt tatccggctg 5340 gtggaacaac attggttccg tagatattcc gtaccagaga tccttattac ggataatgcc 5400 agttgcttct taagcaaaga atttaagacc ttcctcgacc ggtatcacgt tcaacactgg 5460 gctaatgcac gccaccacag ccaggcgaac ccagttgaga gactgaaccg aaatattaat 5520 gcatgtattc ggacatatgt caagtccaac caacggctct gggatactcg tatctctgag 5580 atcgagtaca cgatcaacaa tacgaggcac tcttctacag gctttacgcc atatagaata 5640 ctatttgggc atgagatcgt tgctgatggt gttgagcatc gtctcgatgc agatactgag 5700 cacctatctg aaaaagaaag aatggagcag aagttgaaga tagatcactc aatttttgac 5760 acggtttaca aaaacctaga aagagcccac gagaaaagca ctaggagcta caatttgcga 5820 tttcggaaag cagctcctgt ctacgaagta ggtcagaagg ttttccgaag aaacttcgca 5880 caatcctccg caggggaggc ttataatgcc aaattaggac cgatgtacat tccgtgcaca 5940 gttgtggccc gtcgtggaac tagttcctac gagctaatgg acagtaccgg gaaaaatgta 6000 ggaattttct cagcggctga tctcagacca gggaatgctg aggaaagaga ttcctaaatg 6060 tagatactag atcataataa attggtaggt agtttgtagg aataatttct ctaatttgtt 6120 atccctaatt gttcttcaag ttacatgtcg ttctacaatg tttcagagtt ccgtagttac 6180 gttatttagt twggttttgt tamatttttc gttttkaaaa tgagataagt cctagagtat 6240 gatgaagttc aaggtctgtc taattggtgt tcgattgtca tgtccagtgt agtagttatg 6300 ttcacaawta tttgtatcac tgattctcgg atcagtttcg tctgaatttt aataggtttt 6360 gctatccgac atcatagtac gcttaatcgg aggaaaactg atcaaggaaa attaccactg 6420 gaaaatggaa aaattcgata agcaccgaaa agttttgaac gttcattttt cctatgatat 6480 cagattgcga cacttcaacc attttccata caacctcact agatcaacac taataacatc 6540 atgaaacccg ccatctactc gggttgcttc agtaacagtc acaatgccaa tggagttctt 6600 gagttccact ttatcatgta cagctctagc caaatatggt aacattcggt cagaattctc 6660 cctctcaatt gagttgagat taaggaagag gaagaccgat taatgagaga taagggtaga 6720 gacgctaata aatggtatct tgtccaaaat ttgacaagat gattctattt agttaactta 6780 gcttcgaatt atttactctt acggattacg aagattgcaa aaggcaagcc caaccctggg 6840 agtacaactg tacagtgatt tactgaaata aattgaggaa aataagaata ttgcacatga 6900 taaaataaat tgataaatca atttattttt acggggagtg tgaggtag 6948 // ID Gypsy-84_AA-LTR repbase; DNA; INV; 258 BP. XX AC supercont1.15; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-84_AA_; KW Gypsy-84_AA-I; Gypsy-84_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-258 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.15; Positions 1929310 1929053. XX SQ Sequence 258 BP; 80 A; 44 C; 72 G; 62 T; 0 other; tgtagggtcc tatcctctgg attacatcga gctgccaatc tataacaggg aatgatgtct 60 gttagcgctg gcaacacagt aacgaggaga gagcttgagc cggggaggga cagggagtag 120 agtgtgtgac gatgaacggt gcgaacgtga ctaccgccgc agtgttgctt taatgcaagt 180 gaaataaacg attaagtcaa acggaaatta acagtgttta atttaatatt gtgaaaggct 240 taaactgcga attcaaca 258 // ID CR1-22_CQ repbase; DNA; INV; 4654 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-22_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4654 RA Kojima K.K. and Jurka J.; RT "CR1 non-LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 26-26 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 6 sequences with >97% CC identity. XX FH Key Location/Qualifiers FT CDS 25..717 FT /product="CR1-22_CQ_1p" FT /translation="MICAKVDVPLREQLDLNQTQVFWXCGECAKLFSNSHF FT RRVVKDHDGGNSEIAESMKTMQNDIANLTSTITDLKEXVQTQSTSLSTPXW FT PNKRQRGSSTDTPVKVAIPTASRGTKAMNSVPVIAAEPNDLWYLWLSSFPP FT SVTDEDIHSMVKECLSVEDDDPIVVKMLVKKGVDVSTLSSVTFKVGIGRDY FT RESSMDAANWPEGLSFREFVDIDRRPAPTVLPSGFSRRRLE" FT CDS 789..4508 FT /product="CR1-22_CQ_2p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="MEAPNHPVTVEFPASDQHSRPGPVFGCREGVFQATFT FT GEYHPLSDLSCPYSPSTSSVQPCHWFALHGTTSHIAALSSPGRMLYGRKEA FT PKRSDAVAFPASDQHSRPGPAFRCREGVFQPTFRGQYPITSSNDCFDLDIF FT SDSSQSPEPGRDPNQDSHLTIRPSPGRTVYGTMEAPDHSVTVEFPASGQHS FT RPGPVFGDREGVFQAAPTGEYRSNQVTSCSENSSAFSSRPSTSHQHDRAAR FT EPFTESQAPTPPSSTDTNASGTCQQIRFYYQNVRGLKSKVDEFFLNCLDCD FT YDVIMLTETWLDDSINAVQLFGNAFEVFSTHRNAGNSRRTVGGGVLIAVRK FT SLDSTVCAEAAAVNLEQIFVKIQLKTQKVFVGCFYLPSEKRYEHQLVEDHI FT TCIESVAGLADPSDYIVVAGDYNQSNLVWKALSGNRSFADPLHTHTRCRME FT KTTHEVLIDGVARNSLHQCNLVPNHQNRILDLIFISDGAERTAVSEANDSL FT VPLDRYHPALVFDLSAVLDAEFEDVHDPLDLNFRKTDFSELIRSLENTDWT FT SVLGSSDVDAAVDIFGSVVRELLARHTPCRRPPRKPAWSNASLRELKRQRA FT AALERYTQHRGPITQAVFKAASSRYKSYNKRCYKSYIRKTQENLRRNPSQF FT WSFAKSKYQENGLPSTMVLGETVAKTNAEKCTLFAQHFASVFQQPGPAPAS FT EQHLELVPRDVVDIDVFVASKDMLKRASLKLKSSNAPGPDGIPSVVLKRCF FT EQLSEPLLKIVNLSLQQSKFPTAWKKSVMFPVFKKGAKRLIGNYRGITSLC FT AGSKLLEIVVGEVISFNCRSFICPEQHGFMPGRSVTSNLMEFTNFCIENIA FT AGRQVDAVYTDLKAAFDRLDHTIVLQKLAKLGFSSRMCRWLESFLKERIIQ FT VKIGTSLSAEFSNGSGVAQGSNLGPLVFTVFFNDSNALFVDKCKLLYADDF FT KIFLAIDNLADCYGLQSRLDKFSNWCDVNRLELSVQKCTTISFHRKTRREV FT FSFDYSLGGHVLERLEVVKDLGVLLDEKLTFRQHLSAIVDKAYRQLGFLFR FT MAREFDDPLCLRSLYFALVRSHLEYSSVIWSPYHQNWIDRIERIQKKFVWY FT ACRKMPWNDPTRLPSYEARCNLLGIETLETRRTIAKAVFAAKILTAEIDCP FT HLLSLLNTNISSRNLRPRADQGLLARSFARTDYQRNSPVRSISDAFNDAYR FT FFEFHEPVSKFREKLRAEFKERTRIRLSRN" XX SQ Sequence 4654 BP; 1159 A; 1260 C; 1107 G; 1125 T; 3 other; gggtattgcg gagcaatttt ccacatgatt tgcgccaagg tggacgtccc gctgcgggaa 60 cagctggatt tgaaccagac gcaagtattc tggatktgtg gggaatgtgc caaattgttc 120 tccaacagtc acttccgccg cgtcgttaag gaccacgatg ggggcaactc ggagattgcc 180 gagtcgatga aaacaatgca gaacgacatc gcgaatctta cgtcgacgat taccgacctg 240 aaggagawag tccagaccca atcgacatca ttgtccacac ccwcgtggcc aaacaaacga 300 cagcgcggct ccagcacaga tactcccgtg aaggtcgcta ttccaaccgc ctctcgcgga 360 acgaaggcta tgaattcagt tccagtaatc gcagctgagc cgaacgacct gtggtacctg 420 tggctctcca gtttcccccc gagtgtcacg gatgaggaca tacactcgat ggtgaaagag 480 tgtctctccg tcgaagacga cgatcccatc gtcgtaaaga tgctcgtgaa gaagggtgtc 540 gacgtttcca cactctcgtc ggtgacgttc aaagtcggaa tcggccggga ctaccgtgaa 600 tcgtccatgg atgcagccaa ctggccggaa ggactgtctt tccgcgagtt cgtggacatc 660 gaccgtagac cagcgcccac cgtattgcca tcggggtttt cgaggcgccg cctggaataa 720 tctcgcaacg tcgtagtcag ccatcatttg ccgctctgcc atcaccggga cgcctgttag 780 aaagacctat ggaagccccc aatcaccccg tcacagtcga gttcccagcc agcgaccagc 840 acagccgtcc cgggcctgtg ttcgggtgtc gggagggggt cttccaggcc acattcacag 900 gcgagtacca tccgttatca gatttatcgt gcccttacag tccttcgact tccagcgtgc 960 agccatgcca ctggtttgcc ttacacggaa cgacttctca tattgctgcc ttgtcatcac 1020 cgggacgcat gttatatggt cgtaaggaag cccccaaacg ctctgacgca gtcgcgttcc 1080 cagccagcga ccagcacagc cgtcccggac ctgcgttcag atgtcgtgag ggggtcttcc 1140 aacctacttt cagaggccag tatccaatca ccagctcgaa tgactgtttc gaccttgaca 1200 ttttctcgga ttccagccaa agtcctgaac caggtcgtga cccaaaccaa gacagccacc 1260 tgacgatacg accatcaccg ggacgcacgg tatacggcac tatggaagcc cccgatcact 1320 ccgtcacagt cgagttccca gccagcggcc agcacagccg tcccggtcct gtgttcggag 1380 atcgagaggg ggtcttccaa gccgccccca caggcgagta ccgttcaaat caagtgactt 1440 cttgctctga aaattcttca gctttcagtt ctcggccgag tacaagtcac cagcatgaca 1500 gagctgcccg tgagcctttc acggaatcgc aggctcccac cccgccttcg tcaacagaca 1560 ccaatgctag tggaacgtgt cagcagatcc ggttctacta ccagaacgta cgaggattaa 1620 aaagtaaagt ggacgagttt ttcctgaatt gccttgattg cgactacgac gtcattatgc 1680 tgaccgaaac ctggctggac gactcaataa atgctgtgca gctgtttgga aatgcttttg 1740 aagtgttttc tactcatcgc aacgccggaa acagccgtcg taccgtgggt ggcggtgtac 1800 tcatagctgt ccggaagtcc ttggattcga ctgtatgcgc tgaagctgct gctgttaatc 1860 tagaacaaat cttcgtcaaa attcagctga agacgcagaa ggtttttgtc ggctgtttct 1920 atctaccttc tgaaaaacgc tacgaacatc aattggtaga ggatcacata acctgcattg 1980 aatccgttgc tggactcgct gatccaagcg attacattgt ggtcgccgga gattacaatc 2040 agtccaatct ggtttggaag gcgctatccg gcaatcgctc gttcgccgac cccctgcaca 2100 cgcacacgcg ctgtcggatg gaaaaaacaa cacatgaagt gctaatcgat ggtgtcgcta 2160 gaaacagctt gcatcaatgc aatctcgtac ccaatcatca gaacagaatc cttgacctta 2220 tcttcatcag tgatggcgct gaacgcactg cagtgtctga agcgaacgac tctctagttc 2280 ctctagatag gtatcaccct gcactggttt tcgatctctc cgctgttttg gacgcagaat 2340 tcgaagacgt tcacgacccc ctcgatctga acttccggaa gacagacttc agcgagctga 2400 ttcggagctt ggaaaacact gactggacaa gtgttttagg atcttcagat gttgatgcag 2460 cagttgatat cttcggatct gtcgtacgag aacttctcgc gaggcacacc ccctgccggc 2520 gacctcctcg aaaaccggct tggagcaacg cgagtttgcg cgaactcaaa aggcaacgag 2580 ctgctgccct cgagcgatac acccagcacc gcggaccgat tacacaagct gtgtttaaag 2640 ctgccagctc ccgatacaaa tcttacaaca agcgctgcta caaaagctac atccgaaaaa 2700 cacaagaaaa tcttcgccgc aacccgtcgc agttctggtc ctttgcaaaa tctaaatatc 2760 aagaaaatgg actgccctct actatggttt taggagagac tgtagccaaa acgaacgccg 2820 aaaagtgcac ccttttcgcc caacactttg catctgtttt tcaacaacct ggtccagcac 2880 ctgcctctga acagcatctg gaactggtac cgcgcgacgt agtggacatc gatgttttcg 2940 tcgcttccaa ggacatgtta aaacgtgcat cgctgaaact aaagtcgtcg aacgcgccag 3000 gacctgacgg catcccatcg gtagtgctca agcgctgttt cgaacaactg tcggagccgc 3060 ttttgaaaat tgtgaacctg tccttgcaac aatccaagtt tccaactgca tggaagaaat 3120 cagtgatgtt tccggtgttt aaaaagggtg caaaacgatt gattgggaat tacagaggaa 3180 tcacttctct ctgcgctgga tcgaagcttt tagaaattgt tgttggtgaa gtcatctctt 3240 tcaattgccg cagcttcatc tgtcccgagc aacacggatt tatgccagga agatctgtca 3300 cttcaaacct gatggaattt acaaacttct gcatcgagaa tattgcggcg ggtcgacaag 3360 tagatgcagt ttacactgat ttgaaggcag cttttgaccg cttggaccat accatcgtgc 3420 tccagaaact tgctaagctc ggattctcat ctcgcatgtg tcgctggctg gaatcattcc 3480 tcaaagaacg gataatccaa gtaaagattg gtacatcgct gtcagccgag ttttctaacg 3540 gatcgggtgt cgcacaagga agcaaccttg gcccacttgt gttcactgtg ttcttcaacg 3600 actccaacgc cctttttgtc gacaaatgca agctcctgta tgctgacgac ttcaaaattt 3660 tcttggccat cgacaacctc gctgattgct acggtcttca gtctcgcctc gacaagttct 3720 ctaactggtg tgacgtgaac cgcttggaac tcagcgtgca gaagtgcaca actatttcct 3780 ttcaccggaa aacaaggcgc gaggtgtttt cgtttgatta ctcgctaggt gggcacgtgc 3840 tcgagcggct cgaagtcgtg aaagaccttg gggtcctact tgacgaaaag cttaccttcc 3900 ggcagcatct atcagcaata gtagacaagg cttatagaca gctcgggttt ctgttcagaa 3960 tggcgcgtga attcgatgac ccgctctgcc ttcggtcgtt gtactttgcc ttggtcaggt 4020 cccatttgga atattcgtcc gtaatctggt ctccttatca tcaaaactgg attgaccgta 4080 tcgagagaat ccagaagaag ttcgtgtggt acgcctgcag aaaaatgccc tggaacgacc 4140 ctacacgact gccgagctac gaagcacgct gcaacctgct tggcatcgaa acactggaga 4200 cgcgacgaac cattgcgaaa gctgtcttcg ctgctaagat cctgaccgct gaaattgatt 4260 gcccacatct tttatcccta ctaaatacaa atatttcttc cagaaatctg cgtcctcgag 4320 ctgaccaggg gcttctcgca cgctcgttcg caaggacaga ctatcagcgc aactctcccg 4380 tgaggtccat ttctgacgct tttaacgacg cctacaggtt tttcgaattt cacgaacccg 4440 tgagcaagtt ccgcgaaaag ctgcgcgccg agttcaagga gaggacacga attcgtttaa 4500 gtagaaacta aaatgacaat tttttaacgc gttgtaaatt atgtttgtat tgtgctacct 4560 gtaattgtaa tttattttat ttgtaagagc atcgagcaat acccaaccat ttagaccatt 4620 taggtccgat gggttaataa acaattcaat tcaa 4654 // ID Gypsy-89_AA-LTR repbase; DNA; INV; 1309 BP. XX AC supercont1.279; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-89_AA_; KW Gypsy-89_AA-I; Gypsy-89_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-1309 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.279; Positions 1501098 1499790. XX SQ Sequence 1309 BP; 347 A; 317 C; 354 G; 291 T; 0 other; tgttaccaac ttagtggaaa caatagtagt aaccatccct gggaagtaga ataatctgcc 60 agggctatga gttgtttttt tattgatagg agtttgagtt aattagacgc aatggaaaat 120 aactttaaat ttgttctaaa aactataccc ttagtcttca atacccctcg tctagaggat 180 tgtcggagta gcgcatctgc tattctaagc aggtggtatg actgattccg gatgaggttg 240 aggagagaag agagagagag agaggaaact gtagcgaaga gagtagagaa gggagtggaa 300 gaaaggcgat acccggtgaa ggattaattt tggctagggc ggcagaaagt accgaactac 360 cacgaaggtc ggatgggaaa agttaattgg tgcgtgtcga agtgagtccg atagaagaag 420 tgaccgactg gtgaagaaat agaccatccg agtgtagcca gctggtcctc cgaaagcgtt 480 gacgggaaga caatccacgt agtgaggccg gcgacgtggt caagtacgct gagacgacct 540 agggtgatat cccatcccag gaggctgttg gagccaacct tatccaatcc ccgaaggaag 600 ataaccccaa taggcagcgc aacccgcgct tcggtgtagg aaccacctga ttcgacccgt 660 ggaattcgtc cagtggagct actaacttcg gagaggctgg cgagccccgt gctggtgtcc 720 gacgagtccc atcagctatc gtcgaggcac acgccacgtg cgagtagtac cgaccgctat 780 ggaggtgaag gccgagggag gactgatggg ttagtccccg ctctcttgtg cccaacatcc 840 gatgtggtac taccccgtcc gcgagcacga gggacgtcat cccactacca cgtggtctac 900 caaccttacc cccacaagga tccacagaag ctagatcaga acccgaagac ccagaggagc 960 agcgtggtaa caagctacag atcctgcatt cgacgaggcc gcatatctgc cagccggtgt 1020 gtagcagtcg gcaagtaaca accataaatg ttccctcacc tctcgctctc tctcgctaag 1080 catgcacatc tatgggcccc aacgaacccc ccccccaaac aatgtaattc cttcttaata 1140 aataatgttt aagcttactt cttgtgtttg ttgccacttt acgaaagatt gcctactttc 1200 cctagtgctg ggctactctt ggttcacggg tgtggttgtg tgagttctcc ctgagtaagc 1260 tgacgccgac cctgagaggg tatagccgtg gggctagccg taattaaca 1309 // ID Helitron-1_DAna repbase; DNA; INV; 1839 BP. XX AC . XX DT 27-APR-2008 (Rel. 13.05, Created) DT 27-APR-2008 (Rel. 13.05, Last updated, Version 1) XX DE Non-autonomous family of Helitrons - a consensus sequence. XX KW Helitron; DNA transposon; Transposable Element; KW Interspersed repeat; Helitron-1_DAna. XX OS Drosophila ananassae OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; ananassae subgroup. XX RN [1] RP 1-1839 RA Takahara S. and Sato H.; RT "Helitrons in fruit flies."; RL Repbase Reports 8(5), 537-537 (2008). XX DR [1] (Consensus) XX CC This is a consensus sequence of a family of non-autonomous CC Helitron CC transposons in the Drosophila ananassae genome. XX SQ Sequence 1839 BP; 617 A; 320 C; 357 G; 545 T; 0 other; tctatatata taaaagaaca acgtgtttta tgttacacat cttataaatc aagaacggaa 60 gaacagattt gagtgaaaat tggtagggag gtagcttaga gccaggagaa ggacatagga 120 tactttttgt cccgttcgac agcgttcctc cctttcctcc ctggcccata cttttttatt 180 taggcagaca gctaagaaga aaatgtcaaa tcagttacaa acgaagtcaa attcatagtc 240 aggctctcaa atttaacaac gaaccaaaaa gattgtgttg cgctgaaatc aatattaaaa 300 ctatttctat taaataaata attaacaagt ggattgaagt gaagtgaagt ttaattaata 360 atgccgcgac caagacggtc gaatctttcc cgacgaagcc gtaatgcaag aacaatacaa 420 aatattgcat atgaaaggac tgaagaagaa cgagaaattg cacgtgaaca gcgccgcaat 480 agtatggctc gacttcgtgc ttctcaaata gatttcaatt tgcgacgcag atatttagct 540 gatttgaatc gagctgcgtt tcgatacgat tgcagcaatg attacagctt gcatcctagc 600 gtttgcattg ggcaaatgga cgttgtttgc gagtattgtg gtgcattaaa gttttccgga 660 gaaacgcctg gattatgctg cgttaatggt aaagtgaaat tgccagtgtt gactccgcca 720 caagagccat tgtattcatt gctttgcggc gaaacacaag aatcacgcca ctttcttgca 780 aatactcgaa aatacaatag ttgtttccaa atgacgtcat ttggggcaga cattatcgaa 840 gaaggaggtt ttaatccgac atttaaggta tttatttttg tacttacata gaatgtagtt 900 aaagaatctg gtttttattt tgtagataga atcgagcggt ttaatatcat ttccaaataa 960 tttctgtaat tttgtctcat caaaagacga acttatcaac aatgtatttc caaatattat 1020 ttctaactac aaaaataatg aatggttgag tgagcgagca attttagcgg ctaagaataa 1080 agatgtagat gacctgaact acataattca aaataagatc attggaacaa tgcattcatt 1140 caaatctatt gactgcgtca caaatgaaga tgaagccacc aactatccaa ttgaattttt 1200 aaactctttg gacgtgcctg gcttaccacc gcacaattta cgcctaaagg ttggctccgt 1260 agtaatcatg cttcgaaaca taaaccaacg aaaactgtgc aacggtacgc gtttggtggt 1320 tagtaaattg atgaacaatg taatttacgc tacgataatg ataggaaaat tcaaaggtga 1380 ggaagttctc attccgagga tcccgatgat cccaaccgat atgccgtttg aatttaaaag 1440 acttcaattt ccgatacgtc ttgcatttgc catgacaatc aacaaatcac aaggccaatc 1500 cttaaaagtt tgtggtttaa atctagaaca ttcatgtttt tcccatggtc aattatacgt 1560 ggcatgttct cgggtcggaa gaccatctgc gttgtttgtt tttgcgcctg ataataaaac 1620 aaaaaatgtc gtgtatcaca aggtgcttaa gtgaagagca acctatgtac taaaatacag 1680 aaataataat gaatgattaa ggaggagaaa caaagaatat tgtataccca caagtattac 1740 agaatttaat taatttgaaa attattcttt tttgtttgac ttatttttta tgcgtgcaac 1800 aacaatatgc cacagcgaaa cgtggcaggg tactgctag 1839 // ID CR1-57_HM repbase; DNA; INV; 4480 BP. XX AC . XX DT 22-DEC-2008 (Rel. 13.12, Created) DT 22-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE CR1-type family - consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-57_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4480 RA Bao W. and Jurka J.; RT "CR1 families from Hydra magnipapillata."; RL Repbase Reports 8(12), 1885-1885 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 60..797 FT /product="CR1-57_HM_1p" FT /translation="MEITMKNIEILISNKLDEHRKSILKETEKLLKEQEKT FT FTLIVSGNLKIISDKLNVMEKEILENKSKIKNIEKDLNDVKDSLNFQEEKT FT LEKLKKIKKYFDNEINVLYTRTTDLENRSRRNNLRIDGLVEKPGESWSECE FT IAVKEIFNKNLKISSEVVIERAHRIGPINNKKPRTIVLKLLNFQDKNKILN FT AVKNLKGTGVYINEDFSQETIEQRRKLWEEVKRLRSEGKYAILKYNKIFSR FT DFKK*" FT CDS join(868..2037,1961..3958) FT /product="CR1-57_HM_2p" FT /translation="MNNETIDFETLHFNVFETANNVILNDLNDADAQIFRG FT INFGSQYFETETFKTELTTLRNKFTALHINIRSINNNIDKLKQFLTECNYL FT FSMICLTETWCSDESSRINSNLEIPNYKLISSERKTNKRGGGIINYIRTDQ FT ATKYRDDLSTSDADSEVFTIEITTNNKSKNILVSTCYRPPDGDITKFSNHL FT KQIFYKNNNEQKKLFCIGDINIDCLQYDKHAKTKFFFDEMLQHYIFPIINK FT PTRVTQSSVTSIDNILTNTVLDVTLKAGIIKTDISDHFPIFFSLSHDTKTT FT NDCKIEIHKRKINKYAIQQFKESLSAINWDKVYQECNLGRTNSAYTNFENI FT FLKNYNKYFPIKVMLVKEKHLKCPWITKGIKKSSKKKTKTLYQILKKNVRG FT SPKELKNLQKKKQKLYIKYLKNRNEANLNAYKQYKNLFEKIKKISKKNYYS FT NKIKNSKGDIKKTWDVIKEIIGNKNCKPNSLPTKVVINNEEYVSSDVISEK FT FNNFFVNIGPNLATKINCPNNSFETYLTNNNNELAFSELKIEELEVAIKSL FT KINKSPGIDDICSNIVIDVFLEIRKPIFEIFKSSIKTGTVPDKLKTAKIIP FT IFKTGETSLLNNYRPISILPTFSKILERIVYNRLYEYLLQNNLLNKKQFGF FT QTKHSTEHAILDLVNSMSNSFDKKQFVLGTFIDLSKAFDTVNHNILLKKME FT RYGVKNISLDWFKSYLSSRKQCVVTDHNKYSDLLEIKCGVPQGSILGPLLF FT LIYINDLPKALKKLDVIMFADDTNLFYSSPSISDLYESMNTDLENLNIWFK FT VNKLSLNTEKTKYILFHSNRLKNFIPDTLPSLKIDNINIKRTEITKFLGII FT IDQNINWKAHINTINTKISKSIGVLYKAKPMLSQDNLKILFFSYIQSYLTY FT ANIAWGSTHKTKLNSLYIHQKHASRLIYNKDKFTHADPLLKNLNALNIYQI FT NIYQNILFMLKYKLGLVPKYFTDSFFHINANKYITRGTGNFTLPLKKTKFS FT RFSLIYRGPYLHNKIILQNMELNKMDNLIALKKKLKHIIINLTNFIEMY*" XX SQ Sequence 4480 BP; 1863 A; 700 C; 573 G; 1343 T; 1 other; tttttttttt tcggcataat ctacgggagt aagacgtgtt ttaagtaaat aaaaaagcaa 60 tggaaatcac aatgaaaaac attgaaatac taatatcaaa taaattagat gaacatagga 120 aaagtatttt aaaggaaaca gaaaagctac tcaaggaaca agagaaaact tttactttaa 180 ttgtaagtgg caaccttaaa atcatatcag ataaacttaa cgtgatggag aaagaaattt 240 tagaaaacaa atcaaaaatt aaaaatatag aaaaggatct taacgacgtt aaagatagtc 300 taaatttcca agaagaaaaa actttagaaa aacttaaaaa aattaaaaaa tacttcgaca 360 atgaaatcaa cgtattgtac acaagaacaa cagatttgga aaatcgatca cgtcgaaata 420 atctcagaat agatggacta gtagaaaaac caggagaaag ttggagcgag tgtgaaattg 480 cagtgaaaga aatctttaat aaaaatctta aaatatcaag cgaagtggtt attgaaagag 540 cccatcgaat tggcccaatt aataataaaa aacctagaac aatagtatta aaactcttaa 600 atttccagga taaaaacaaa atccttaatg ctgtaaaaaa tcttaaagga actggtgtgt 660 atattaacga ggacttttcg caagaaacga ttgaacaacg aaggaaactt tgggaagagg 720 tgaaacgact tcgaagtgaa ggtaaatacg ccattttaaa atataataaa atatttagtc 780 gagattttaa aaagtagcga tgcgtatttt tcttaatttt tctttttttt tctttcaaag 840 cgcttttaat ttctcatttt aagcataatg aataacgaaa caatagattt tgaaacgctg 900 cactttaatg tcttcgaaac ggcaaataat gttatactca acgatttgaa tgacgcagat 960 gcgcagattt ttcgcggaat aaattttggt tctcaatatt ttgaaacaga aacttttaaa 1020 actgaactta caactttaag gaataaattc actgcattac acataaatat aagaagcata 1080 aataacaaca ttgataagct taagcaattt ctaaccgaat gcaattattt attcagtatg 1140 atttgtttaa ccgagacatg gtgttctgac gaatcgtcca gaataaattc aaacttagaa 1200 attcctaatt acaaattaat atcttctgaa agaaaaacta acaaaagggg agggggaatt 1260 attaactata ttcgaactga tcaagccacc aaatatagag atgacctttc tacctcagat 1320 gccgatagtg aggtctttac aattgagata acaacaaata acaaatcaaa aaacatatta 1380 gtctccacat gttatagacc acctgatggt gatataacaa aattttctaa tcatttaaaa 1440 caaatttttt ataaaaataa caatgaacaa aaaaaactat tttgtattgg agacattaac 1500 attgattgct tacaatacga taaacatgcc aaaactaaat ttttttttga cgaaatgctc 1560 caacattaca tcttcccaat tattaataag ccgactcggg taactcaatc atcagtaacg 1620 tcaatagaca atatattaac aaatacagtt cttgatgtta ctttaaaagc aggtattata 1680 aaaacagata tatcggatca ttttccgatt ttcttttctc tgtcacacga tacaaaaact 1740 actaacgact gtaaaattga aattcataaa aggaaaatta acaaatatgc tattcaacaa 1800 tttaaagaat cactatcggc aataaactgg gataaagtat accaagaatg taaccttgga 1860 cgcaccaact ctgcttacac taattttgaa aacattttct taaaaaacta taataaatat 1920 ttcccaataa aagtaatgct agtaaaagaa aaacatctaa aatgtccgtg gatcaccaaa 1980 ggaattaaaa aatcttcaaa aaaaaaaaca aaaactttat atcaaatact taaaaaatag 2040 aaatgaagca aacctaaatg cttacaaaca atataaaaat ttgtttgaaa aaatcaaaaa 2100 gatttcaaaa aagaattatt actcaaataa aataaaaaac tccaaaggtg atattaaaaa 2160 aacgtgggac gtaattaaag aaataattgg gaacaaaaat tgcaaaccaa atagtttacc 2220 tactaaagtt gttataaata atgaagagta tgtcagtagt gacgtaatct cagaaaagtt 2280 taacaatttt ttcgttaaca taggccctaa cctggctaca aaaattaatt gtccaaacaa 2340 ctcatttgag acatacttaa ctaataacaa caacgaatta gcatttagcg aactaaaaat 2400 tgaagaactt gaagttgcga taaaatctct taaaataaat aaatctccag gtatagatga 2460 tatttgtagc aatatcgtta tcgatgtctt tttggagata cgcaaaccga tcttcgaaat 2520 atttaaatca tcaattaaaa caggtactgt accagataaa ttgaaaacag ctaaaattat 2580 accaattttt aaaacaggcg aaacatcttt attaaataat tacagaccga tctctatact 2640 gcctactttc tctaaaatac ttgaaagaat agtttacaat agattatacg aatatctcct 2700 tcaaaataac ctcttaaaca aaaaacaatt cgggtttcaa acaaaacact caaccgaaca 2760 tgccatttta gatcttgtaa atagcatgag taattctttc gataaaaagc aatttgtatt 2820 agggactttt atagatttat ctaaagcgtt cgacacagtt aatcacaata tcttacttaa 2880 aaaaatggaa agatatggtg ttaaaaatat atcccttgac tggttcaaaa gttacctgag 2940 tagcagaaaa caatgcgttg ttacagatca taataaatat tcagacctac tagaaataaa 3000 atgcggagtt ccccaaggtt ccattctggg tcccctttta tttctaatat atatcaacga 3060 tcttccaaaa gccctaaaaa aacttgatgt aataatgttc gctgatgata caaatttatt 3120 ttactcatcg ccgtctataa gcgaccttta cgaatctatg aacactgacc tcgaaaacct 3180 caacatctgg tttaaagtta ataaattatc tttgaataca gaaaaaacaa aatatatctt 3240 gtttcactcc aaccgtctta aaaactttat accagacact ctaccttcac taaaaattga 3300 taacataaac atcaaaagaa ctgaaataac taagtttctt ggaataatta ttgaccagaa 3360 tattaattgg aaagcccata ttaatacaat aaacacaaaa atatcaaaaa gtatcggcgt 3420 actctacaaa gccaaaccta tgctttccca agacaactta aaaattctct tcttttctta 3480 tattcaaagt tacctaacat acgctaatat tgcatgggga agtacccata aaactaaatt 3540 aaattcactc tacatacatc agaaacatgc atcaagatta atttataata aagataaatt 3600 cactcatgcc gaccctttac ttaaaaattt gaatgcatta aatatctatc aaataaatat 3660 ttaccaaaat atcttattca tgctaaaata caaactcgga cttgttccga aatactttac 3720 tgatagcttt tttcatatca acgcaaacaa atatatcaca cgaggaacag gcaacttcac 3780 cytgccttta aaaaaaacaa aattctcgcg attttcttta atataccgtg gcccatactt 3840 acataacaaa ataatacttc aaaatatgga acttaataaa atggataacc ttattgctct 3900 gaaaaaaaaa ctaaaacata tcataattaa cttaaccaac ttcatcgaaa tgtattaaac 3960 aatttgtgaa aaatagtaat aataaaaagc cctactactt ctactatata gtagatacca 4020 aaatacttat gtgtatttgt tgaactagtt cattaaagtt tattttatgt tttaaaggtt 4080 ctcgatgaaa agactatttt agtcttctgc gagtttcctt acaacaaaga aaaatatttt 4140 ttaagaataa ttttatatat atatatatat tcaacaacgt ccacaagcaa gtccctagta 4200 gccctgtggt acgacggact tgcatatatt ttttaaatgc atgggttaat agccagcact 4260 ttatattata aacaacgaat agtttacctt attttgtaca ctgtacaacg cgcatgcgct 4320 aaacagaatt ttgaatttct tttgtcttgt aaaaatttta atggtacaaa aattgtttaa 4380 ttgtacgttt tgtacttata tgtaaaaata tgtaattttg ttttattgaa atatatacgg 4440 atacttgtgt tgtaaaaaaa aaaaaaaaaa aaaaaaaaaa 4480 // ID PrD37E repbase; DNA; INV; 1329 BP. XX AC DQ138288; XX DT 18-AUG-2005 (Rel. 10.08, Created) DT 26-APR-2010 (Rel. 15.05, Last updated, Version 2) XX DE DNA transposon from Philodina roseola. XX KW Mariner/Tc1; DNA transposon; Transposable Element; KW Interspersed repeat; PrD37E. XX NM PrD37E. XX OS Philodina roseola OC Eukaryota; Metazoa; Rotifera; Bdelloidea; Philodinida; OC Philodinidae; Philodina. XX RN [1] RP 1-1329 RA Arkhipova I.R. and Meselson M.; RT "Diverse DNA transposons in rotifers of the class Bdelloidea."; RL Proc Natl Acad Sci U S A 102(33), 11781-11786 (2005). XX DR Genbank; DQ138288; Positions 3869 5197. XX CC This element belongs to the ITM D,D(37)D family, which occupies CC an intermediate position between the Tc and mariner families. See CC also PrD37D. XX FH Key Location/Qualifiers FT CDS 326..1207 FT /product="PrD37E_1p" FT /translation="MKDLPRSGRPFKLSKKKIDCLVESVNNRCGVSQRKLA FT RRFGVHQSTICRNLRRRTSVVIRKRKKAPKMNSTEQETRAQENCGKLCRML FT VNGCDIIMDDEKYFKLSGDNVLGNRYFYSNDPSTAPPDVKFQKKAKFERKV FT MVWIAISSRGISSVYVHKSKQAVRQETYLAECIDKRLLPFIEKHHSDGKYL FT FWPDLATSHYSNIVQQRLRDNHIPYVLRIDNPPNVPQARPIEQVWSLLEQK FT IYENNWEAKDIDCLARRIKQKVKEFDQDMLRRMICSVRKKLLSMWKKGLYS FT VI" XX SQ Sequence 1329 BP; 434 A; 259 C; 264 G; 372 T; 0 other; actatcgatg caaaagtctt tggaaaaatg tctaggtcga tttctcgctg acggacaccg 60 tgggaatgca gtcttttttt tcacaaaaag atagctcgtc tggttctctt tcccacgggc 120 tggcattcgt tttttcctcc tctttcaaaa ttcacatgga caccccgaaa aaaaaagaaa 180 aaacttgccc caaggtaatt tgcaacttcc tttccggaag gtggttaaca aatcagtgaa 240 aacgacagtc aattttttta aaaaacaaaa agtacctcaa agtaccattt attttacatt 300 gaaaaagtat ttacaacacg gaacaatgaa agatttgcct cgaagtggcc ggccattcaa 360 attatccaag aagaaaatcg attgtcttgt tgaatccgtg aacaatcgat gtggcgtaag 420 tcaacgaaaa ttagcacgac gattcggagt ccatcaatcg acaatctgtc gaaatctgcg 480 acgacgaacg tcggtcgtca tccgtaaacg caaaaaggct ccgaaaatga atagtactga 540 acaagaaact cgagcacagg aaaactgtgg caaactttgt cgaatgttgg ttaatggttg 600 cgatatcatc atggatgatg aaaagtattt caaattatcg ggagataatg ttttaggaaa 660 tcgatacttc tactcaaatg atccatctac agctcctcca gacgttaaat tccaaaagaa 720 agcgaaattc gaaagaaagg tcatggtctg gattgccata tcttccagag gcatttccag 780 cgtttacgtt cacaagagta aacaagccgt tcgtcaagaa acatacctgg cagagtgtat 840 cgacaaacga ttgttgccat tcatcgagaa gcaccattct gatggaaaat atttgttttg 900 gcccgattta gcgacgtcgc attactcgaa tattgttcaa caacgtttac gagacaatca 960 tataccctat gtattacgta tcgacaatcc accaaatgtc cctcaagctc gtcctattga 1020 gcaagtctgg tctttactcg aacagaaaat ctacgagaat aattgggaag cgaaagacat 1080 tgattgtttg gctagacgaa tcaagcaaaa ggtgaaagaa tttgatcaag acatgttgcg 1140 gcgtatgatt tgtagtgtac gaaagaaact tttatctatg tggaaaaaag gactatattc 1200 ggttatctga ggatagtttt ttgtagcatt ttcgatgtcc tttcttatga aaaaaattca 1260 actttttacg tcgttgaatg aacgagcagc gacgttcgac atttttccaa agacttttgc 1320 atcgatagt 1329 // ID BEL-48_AA-LTR repbase; DNA; INV; 288 BP. XX AC supercont1.380; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-48_AA_; KW BEL-48_AA-I; BEL-48_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-288 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.380; Positions 372219 372506. XX SQ Sequence 288 BP; 69 A; 70 C; 65 G; 84 T; 0 other; tgtaaagtta cttgctaata aagttgcatg ccaaaagact caagtgcagt gtctaaatct 60 aagagaagaa acttacctgt gttcgctaga ttgtgatgtg tgatagctgt tttttctgct 120 gctgctgttg agtttgtttt gcctcgccgt ctgttaactg tgccgctgtt ggattgccac 180 ccaagctacc acccactcga ttgctgccgg agaacagttt gcgtcgccag tggttcctca 240 tcgccctcaa tttaccacca gaagacacaa cgtaggtaag cgccaaca 288 // ID LmeSINE1b repbase; DNA; INV; 415 BP. XX AC . XX DT 07-JUL-2006 (Rel. 11.06, Created) DT 02-AUG-2006 (Rel. 11.06, Last updated, Version 2) XX DE Coelacanth DeuSINE element - consensus. XX KW SINE3/5S; SINE; Non-LTR Retrotransposon; Transposable Element; KW Interspersed repeat; SINE3; DeuSINE; conserved; LmeSINE1b; CNE. XX NM LmeSINE1b. XX OS Metazoa OC Eukaryota. XX RN [1] RP 1-415 RA Nishihara H., Smit A.F. and Okada N.; RT "Functional noncoding sequences derived from SINEs in the RT mammalian genome."; RL Genome Res 16(7), 864-874 (2006). XX DR [1] (Consensus) XX SQ Sequence 415 BP; 96 A; 98 C; 106 G; 112 T; 3 other; ggtggctcag taggtakcac tgttgcctct gagtcagaaa gtcgaaggtt cgaatcccac 60 ctgaggaacc tggacatatt ccatctrcca acgccccagc atggtgtttg ggggcgctgt 120 gctgctgaag gtgccttctt ttggatgaga cgttaaaccg aggtcctgtc tactttgtgg 180 acattaaaga tcccatggca cttttcgaaa gagtaggggt gttaacccca gtgtcctggc 240 cacaattccc ccadtaatga ctggaagctg ttgtgtggtg tgctggcact aaaatggttg 300 cctcgttcca ccccagaggt ggctgcactt cagtggattg cacaatatct ctgtaaagcg 360 ctttgagatc cttcgggatg aaaggcgcta tataaatcca aaattcattc attca 415 // ID BEL-603_AA-I repbase; DNA; INV; 6978 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-603_AA_; KW BEL-603_AA-LTR; Pao_Bel_Ele73; BEL-603_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-6978 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [6018-6587] - Integrase core CC LTRs are 96% similar to each other. XX FH Key Location/Qualifiers FT CDS 5538..6977 FT /product="BEL-603_AA-I_3p" FT /translation="MPRTGILNSEELLEAERTIICLIQRAEFPEEHAVLNY FT NRQLKNGDLKAIKKISKIYKLCPFIGADGVIRKDGRIGAAPWLTLETKFPA FT ILPKNHYVTALIIDEIHRKFGHSNNETVVNEVRQRYHVSELRVVVRRVAQG FT CQRCKVRKTVPRPPRMSPLPEVRLTPYIKPFTYTGLDYFGPLTVRIGRSQT FT KRWVALFTCLTTRAVHLELAFTLSTESCKLAIRRFVARRGPPAEIFSDQGT FT NFQGARKELKEQIQRMNSELSTTFTNTTTKWKLNPPYAPHMGGIWERLVRS FT VKSGLAAMEISRSADEETLLTALAEVESMVNTRPLTYLPLDAAEDHALTPN FT HFLLLSSNGVCQPAVIPQDEKSALRSNWNHVRVMLDRFWKRWIKEYLPAIT FT RQSKWFGEQKPLKVGDLVISVNENVRNSWDRGVVVNVYPGKDGRIRRADVK FT TTAGIFLRPVTQLAVLDVQDEGGIAEGTTSNTGRS" FT CDS join(1003..2151,2155..5592) FT /product="BEL-603_AA-I_1p" FT /translation="MAINSSVAMASGNTATPSHNQQVFQSIQPVESQQVTV FT ISQSDCVSSLAPTREYDLLFSVIAEFFLSSPLSVPPQIKQPVSSLSMLWCY FT RNPIPTPSASVQEASTIPSTTCDDNNVFVISPPTCTSASYGQLPPVPCSSS FT SQQQSVPQTTSVSQHYPQPSTARHGSAYPAVMSSRSVQHVQHGSLPWNASL FT PPPVHQSTPTSMQSGRHPLIGTVGNQSNPYTSQHQAAYGCHRQPLGANDAA FT YPVVSQYIPTATAVSSQPWFNDMRNMYQNQWVSAPNQNLLTAGSQMFPNIM FT GNISSQGLQGHFPNQINQFTNAMPSNIMPASSVGLNISNFPTQSGSAPCLS FT TGAELGPNAQQLAARHVVPKDLPPFGGNPAEWPLFWSSYESTQMCGYSSSE FT NLLRLQRCLKGDARKAVNSFLLHPSNVGEIMSTLRMLYGRPDAIIGSLLND FT VRNTPAPKPEKLETLVNFGLVVRNLCAHLVSAGQQMHLTNPILLQELVDKL FT PANIKLGWALHKQTVVEADLRSFSDYMSLMINAASSVTSGIGECSKSERQK FT GKAFVNSVRAEANAESSRENNNKKSSGGRNTNNSQALTSPRIRPCAVCQTD FT GHKPKDCPSFKAKNLADRWKTVQEAHLCKRCLYPHGKWPCKAPPCGAEGCQ FT DNHHKFLHPGNPQNSSGFGNVASSPSNTGVVTVHQHSHQNILFRIIPVSLH FT ANGKTAKTFAFLDGGSDSTLLENSIAKQLGLSGPTVPLCMQWTNGVQRTEE FT ESERVELEISGNSSKRYKLSGVHTVSSLDLPRQTVDGNMLQAKFPHLKGLP FT ITSYDAAVPGILIGLDNTKLKTTLVIREGNGDEPVAAKTRLGWTVFGKAGT FT SNVATSNRVLHVCHQSKDTDLHELVKSFFTIEGAGTDSNTLVESKLEKRAR FT EIMEATTVRLDSGKFQTGLLWRYDKIEFPDSRPMAERRLQCLETRLSKNPT FT LYEKVRQQMVEYVTKGYAHKATKQELEQASPGQTWYLPLGVVSNPKKPEKV FT RIVWDAAASVRGVSLNSVLLKGPDLLQSLPTVLCRFRQREVAINADIREMF FT HQVYIQPEDRQAQRFLWRNNPTERMEVFVMDVAIFGATCSPCSAQFAKNLN FT AKEHAAEYPSAATAIVENHYVDDYLDSVDTIEQAIQLATEVKLVHQRGGFD FT LRHWLSNRKEVLEKLGEGYSDESKNFVMGKGCSSERILGMIWLPQEDVFSF FT AIEYRKDLGRLLNGDVYPTKRELLSLVMSIFDPLGLVANYVIHGKVLIQDV FT WRTNTGWDEKIPDEIFPSWQRWVVSLRELAQLRIDRCYFPGYTPAALETLE FT LHVFVDASLAAYAAVAYFRIDDRGTIRCSLVSSKTKVAPLKQLSVPRLELQ FT AAVLGTRLMKSIIGGHTLPIRRKVFWSDSNTVLAWLRADQRRFHQFVAFRI FT GEIQEQTDISEWKKVPSRWNVADDATKWGFDSKALNSSRWVQGPEFLHKDE FT SEWPEQSNDLITEEELRATVATHRRAEYDEIIQFGRFLKGDLASIKCPEPE FT FSTAKSYWRQKGR" XX SQ Sequence 6978 BP; 1985 A; 1636 C; 1761 G; 1593 T; 3 other; agaactttaa gattaactat cagttaatca tcaaatccag tcatgtcgaa ggctggttca 60 catggaaaag ctaaccctag atctaatgca ggtgccgaca caacactgca ccaatgcatg 120 atctgcagcc tcccgaacga agcagatcgg aagatggtcc agtgtgacgt atgtgaccac 180 tggttccatt tcatgtgtgc gggtgtcaat gatagcatcg aagatgagaa ccgaagttat 240 gtatgcgtgg cttgtgctct acccgctccg tctatatcgt cgacgaccag cagtgttcga 300 gaagcgagga tgcgactcga gatgcaacgg ttggaagagg agaagcgact tcaggagcgg 360 ttacgagagg aacgagagaa gactcttcag gaaattgtag gtatcgggta cccgatactt 420 caagaaatga cgatacgttt ggagagagaa cgcggagaga aagcaattgc ggcgatgctg 480 actttggaga aggaacacat tcagcagaag tatagtttac ttcacacgca gctgaacgat 540 gacggagatg taggtagcgt gcgtagccga gcgaaccgcg gcacaacgag taaagttcaa 600 gattggatga accagaatac agccgtgacc ctaagcacca ggcgccgagt tcacaaattt 660 gacgtttgag cggtgtcaaa ttcactggtt tccatggtga cataaataac acggcaccgc 720 taaaatgtca aatagtgaac ccgtgcaaag gtccatactg ggagcaaccc cactgggctt 780 acgttaaacg ccagatagtg aactcgtgca tcggtccata agtccatgca cgtgttcact 840 aatttgacgt ttgagcggtg ccgaattcac tggttgccat tatcacataa ataacacggc 900 accgctaaaa tgtcaaatag tgaacccgtg caaaggtcca tactgggagc aaccccactg 960 ggcttacgtt acaaccagct gtttcagtta attcgtccct aaatggcaat caactcgtct 1020 gttgccatgg catctggcaa cacagcaaca ccatcccata atcagcaagt tttccagtct 1080 atccagccag tagagtcaca acaagttaca gtgatttctc aatcagactg tgtatcgtcg 1140 ttagcaccca cccgcgaata cgatcttctg ttttctgtga ttgctgagtt ttttcttagt 1200 tcgcctctat cggtaccacc acagataaag caaccagtga gttccctgtc gatgctttgg 1260 tgctatagga acccaatacc tacaccgtca gcgagtgtac aggaggcctc aactatacca 1320 tccacgacat gtgatgacaa caatgttttt gttatctcgc ctccaacatg tacatcagca 1380 tcgtacggac aactgccacc ggttccatgt tcatcgagct cgcaacaaca atctgtccct 1440 cagactacat ccgtttcaca acactatccg cagccatcaa cagcgagaca tggatcagct 1500 tatccagcag tcatgtcgtc gagatccgta cagcatgttc agcatggttc attaccgtgg 1560 aatgcgtcgt taccaccacc ggtacatcag tcaacaccaa caagcatgca gtctgggagg 1620 catccgctga tcgggactgt cggaaatcaa tccaacccgt atacatctca acaccaggca 1680 gcatatgggt gccatagaca gcccctaggg gctaatgatg cagcctaccc agtagtaagt 1740 cagtatatcc ctacggcgac ggcggtaagt agccaaccat ggttcaatga catgcgaaat 1800 atgtaccaaa atcaatgggt ttccgctccg aatcaaaatt tgttaacggc tgggtcgcaa 1860 atgtttccaa atataatggg taatattagt agtcagggac ttcaagggca ctttccaaac 1920 caaatcaatc aattcactaa cgcaatgcca agtaatataa tgccagctag tagtgtgggt 1980 ttgaatatct ccaatttccc aacacaaagt gggtcagctc cttgtctgtc gactggagca 2040 gagcttgggc cgaatgccca acaacttgcg gcgcgtcatg ttgtgccgaa agatcttcca 2100 ccattcggcg gaaatcctgc tgaatggcca ttgttttgga gcagctatga aamttcaacg 2160 caaatgtgtg gatattcttc gtcagaaaat cttctgagac tgcagcgatg tctaaaggga 2220 gatgctagga aggcggtcaa cagtttcctg ctgcatccgt ccaatgttgg agagatcatg 2280 tcaacccttc gtatgctcta tggccgtcca gacgccatca tcggctctct gttgaatgac 2340 gtcagaaaca ccccagcacc gaagccggag aaactcgaaa ctctggtgaa ttttggactt 2400 gtggtgcgca acctctgtgc gcatctggta tccgctggac agcaaatgca cttgaccaat 2460 cccattctcc tccaagaact ggtagacaaa ctaccagcca atatcaagct tgggtgggca 2520 ttgcataaac aaacagtagt ggaagcagat cttcggagtt tttcggacta catgagtctc 2580 atgatcaacg ctgcaagtag tgttactagc ggaatcggag agtgctccaa atcggagcgt 2640 cagaaaggaa aggcgttcgt caactctgtt agagccgagg ccaatgcgga aagcagtcgc 2700 gaaaataaca ataagaagag cagcggtggt aggaatacga ataattcgca agcattaaca 2760 agtccaagga ttcgtccgtg cgcggtgtgt caaacggacg gacataaacc aaaagattgt 2820 ccttcattca aagcgaaaaa tctcgccgac cgttggaaga cagtccaaga agcacatttg 2880 tgcaagcgat gtttgtaccc tcatggaaag tggccatgca aggctcctcc atgcggagct 2940 gagggttgtc aagacaatca ccataaattc cttcacccag ggaatcccca aaactcatct 3000 ggcttcggaa acgtcgcttc gtcaccgtcc aatactggag ttgtgacggt tcatcagcac 3060 tcacaccaaa acatattgtt ccggatcatt cccgtttcac ttcatgcgaa cggcaaaacg 3120 gcgaaaactt tcgccttctt ggacggagga tcggattcga ctctgctaga gaattcaatt 3180 gctaaacagt tgggtctgtc tgggccaacg gttccactct gtatgcagtg gacaaatgga 3240 gtacagcgaa ctgaagaaga atccgagcgt gttgagttgg aaatctccgg aaacagttca 3300 aaacgctaca agttgagtgg cgtccatacg gtgtcgagct tagatctgcc acggcaaacg 3360 gttgatggca acatgcttca agccaagttc ccgcatctca aaggattgcc catcacaagc 3420 tacgatgcag cagtgccggg aatactcatc ggactcgaca acactaagct gaagacgacg 3480 ttggtaatcc gggaaggaaa cggcgacgaa ccagtcgcag caaaaacccg actaggctgg 3540 acagtattcg gaaaagctgg tacgtctaat gtagcaacgt ccaaccgagt tctgcacgtg 3600 tgccaccaat caaaagatac ggatcttcac gaattggtga aaagcttctt cacgatagaa 3660 ggagcgggaa ccgattcgaa cacattagtt gaatctaaac tagaaaagcg agctcgagag 3720 attatggagg ctactacggt acgtttggac tcaggaaagt tccagacagg tcttctttgg 3780 agatatgaca aaatagagtt tcctgacagc aggcctatgg cagagagaag gctgcagtgt 3840 ctcgagactc gactgtctaa gaatccaacg ctctatgaaa aagtgcgcca acaaatggtc 3900 gagtatgtca ccaaagggta tgcccataaa gccacaaagc aggagctgga gcaagcaagt 3960 cctggtcaaa cctggtattt accactgggc gtggtgagca acccaaaaaa gccggaaaaa 4020 gtgcgtattg tgtgggacgc cgcagcatcg gttagaggag tctctttgaa ttcggttcta 4080 ctcaaaggac cggatcttct tcaatctctt ccaacagtat tatgccggtt tagacaaagg 4140 gaggtggcca tcaacgctga tattagggag atgtttcacc aagtgtacat acagcccgaa 4200 gaccgccaag cccaaagatt tctttggcga aacaacccga cggaacgaat ggaggtgttc 4260 gtaatggatg tcgccatatt tggcgcaaca tgttccccgt gttctgcgca gtttgcaaaa 4320 aaccttaatg ctaaggaaca cgcggcagaa tatccaagtg ccgcaactgc aatagtcgaa 4380 aatcactatg ttgacgacta cttggatagt gtggacacaa ttgagcaggc aatccagttg 4440 gcaacggaag tgaaattggt ccatcaacga ggaggatttg atcttcgaca ctggctgtcc 4500 aaccgaaagg aagtattgga gaagcttggc gaaggatatt ccgatgagtc taaaaacttc 4560 gttatgggaa aaggctgcag ttcagagcgg attttgggca tgatttggct accgcaggag 4620 gacgtatttt cgttcgccat agaatatcga aaagatctgg gccggctact gaacggagac 4680 gtctatccca ccaaaagaga actcctaagc ctcgtaatga gtatatttga tcccctgggg 4740 ctggtggcta attacgtcat tcacgggaag gtattaatac aagacgtatg gcgaacgaat 4800 accggctggg atgaaaaaat accggatgaa atctttccgt cgtggcaacg atgggtagtg 4860 tcgcttcgtg aactggcaca gttgcgtata gatcgatgtt attttccggg ttatactcct 4920 gcggcattag aaacgctgga attgcacgtg ttcgtggatg cgagtttggc agcttacgcg 4980 gccgtagcct acttccgcat agacgatcgt gggacgatcc gctgtagcct cgtgtcatct 5040 aagactaagg tggcaccact caagcaactc tccgtaccta gactggagct acaagcagct 5100 gttctgggta cgaggttaat gaagtctatt attggtggcc atacccttcc catccgtaga 5160 aaggtcttct ggagtgattc caataccgtc ctggcttggc ttcgtgcaga tcagcgtaga 5220 ttccaccagt ttgtggcgtt cagaattgga gagatccagg aacaaaccga catttcagaa 5280 tggaaaaagg tgccttcaag gtggaacgtt gcagatgatg ccacaaagtg gggtttcgac 5340 tcgaaagctc tcaacagcag tagatgggtg caaggtcccg agtttcttca caaggatgaa 5400 tcggaatggc ctgaacaaag caacgacttg attactgaag aagaattgcg tgcaaccgtt 5460 gccacccacc gtcgagcaga atacgatgaa atcatacagt tcggacgttt cttaaaagga 5520 gacttggcaa gcatcaaatg cccagaaccg gaattctcaa cagcgaagag ttactggagg 5580 cagaaaggac gataatctgt ttgatccagc gtgcagaatt tccmgaggaa catgccgttc 5640 tgaattacaa tcgacaattg aagaatggcg atctcaaagc gatcaaaaag atcagtaaga 5700 tctataaatt gtgccctttc atcggagctg atggtgttat acgcaaagac ggacgcatcg 5760 gtgcggcgcc atggctaacc cttgaaacaa aatttccagc tattctcccw aaaaaccatt 5820 atgtaacggc gttgatcatt gacgagattc atcgcaaatt cggccattcg aacaacgaaa 5880 cagttgtcaa tgaagtcagg cagcgatacc atgtgtccga gctccgagtc gtcgtgagaa 5940 gggtagccca aggctgtcaa cgatgcaagg tgcgcaaaac agtcccccga ccaccgcgaa 6000 tgagtcctct tcccgaagtg cgtctaactc cgtatatcaa gccattcacg tacaccggac 6060 ttgattattt tggtccactt actgtgagaa tagggcggag tcaaaccaag cggtgggtgg 6120 ctttgttcac ctgccttact acaagggccg tgcatctgga actggctttc actttatcca 6180 cggaatcttg caagctggct atccgaaggt tcgttgctcg ccgtgggccc ccagcggaga 6240 tattttcgga tcaaggaacg aacttccagg gagcccggaa agagctaaag gagcaaattc 6300 aacggatgaa tagcgaattg tctacaactt tcaccaatac cacaacaaag tggaagttga 6360 accccccgta cgctccacac atgggtggca tatgggagag gttggtcaga tcggtgaaga 6420 gcggattggc agctatggag atttcaagga gtgctgatga agaaacactg ctcacagcgt 6480 tggcggaagt agagtctatg gtcaacacac gtccgctcac atatttgcca ttggatgcag 6540 cagaagatca tgcattgacc ccgaaccact tcctacttct gagttccaac ggcgtatgtc 6600 agcctgccgt tattccgcag gatgaaaaat cagcgctacg gtcgaactgg aaccatgtga 6660 gggtcatgtt ggacagattc tggaagaggt ggattaaaga gtatctgccg gcgattacga 6720 ggcagtccaa gtggttcggg gaacaaaaac ccctgaaggt cggtgacctg gtaatttccg 6780 tcaatgagaa tgtgaggaat agttgggatc gtggagtcgt tgtcaacgta tatccaggaa 6840 aggatgggag aatcagacgt gcagatgtga aaacgacagc cggtatattt ctgcggccag 6900 tgacccagtt ggcggtctta gacgtgcagg atgaaggtgg tatagctgaa gggactacca 6960 gcaatacggg gcggagta 6978 // ID Lep1 repbase; DNA; INV; 134 BP. XX AC AF009827; D66906; XX DT 06-NOV-2008 (Rel. 13.12, Created) DT 06-NOV-2008 (Rel. 13.12, Last updated, Version 1) XX DE A tRNA-derived short interspersed nuclear element (SINE) that is DE highly prevalent within the genomes of Lepidoptera. XX KW SINE2/tRNA; SINE; Non-LTR Retrotransposon; Transposable Element; KW Nonautonomous; short interspersed nuclear element; tRNA-derived; KW Lep1. XX OS Bombyx mori OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Bombycidae; Bombycinae; Bombyx. XX RN [1] RP 1-134 RA Yang C., Teng X., Zurovec M., Scheller K. and Sehnal F.; RT "Characterization of the P25 silk gene and associated insertion RT elements in Galleria mellonella."; RL Gene 209(1-2), 157-165 (1998). XX RN [2] RP 1-134 RA Van't Hof A.E., Brakefield P.M., Saccheri I.J. and Zwaan B.J.; RT "Evolutionary dynamics of multilocus microsatellite arrangements RT in the genome of the butterfly Bicyclus anynana, with RT implications for other Lepidoptera."; RL Heredity 98(5), 320-328 (2007). XX RN [3] RP 1-134 RA Coates B.S. and Sumerford D.V.; RT "Genome impact of the Lep1 tRNA-derived short interspersed RT nuclear element (SINE)."; RL Molecular Biology and Evolution0-0 (2009)To be submitted. XX DR [1] (Consensus) XX CC This is a consensus sequence from multiple lepidopteran species, CC but first described by Yang et al. (1998). The second mention of CC the Lep1 sequence in the literature was made by Van't Hof et al. CC (2007), wherein it was named the "lepidopteran specific common CC sequence" (LSCS3). XX SQ Sequence 134 BP; 48 A; 19 C; 22 G; 45 T; 0 other; atctatacta atattataaa gaggaaagat ttgtttgttt gtaatgaata ggctcagaaa 60 ctactgaacc gatttgaaaa attctttcac tgttggaaag ctacactatt cccgagtaac 120 ataggctaaa tttt 134 // ID Gypsy-93_AA-I repbase; DNA; INV; 6103 BP. XX AC supercont1.368; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-93_AA_; KW Gypsy-93_AA-LTR; Gypsy-93_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6103 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.368; Positions 476350 470248. XX CC Positions [3267-3764] - Reverse transcriptase CC Positions [5035-5478] - Integrase core CC 'GGCGC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 1688..2893 FT /product="Gypsy-93_AA-I_2p" FT /translation="MANDQNRIDEEIVQVVQQNVSLFSPSSYNLPHFKYKH FT LPPSEIRNSWVGWIRWFESIMAAWVGGLELQSAFYGIPGFDDIVPNHDPYQ FT SAKEKLDQLFSPKHHDSFERFMFWTMTPEEDEAIDKFALKVQHKAAKCSFG FT KSESESRNIAVVDKIIQFTPNDLRQKLLEKDILTLDDTLKVVNAYQSVRYQ FT ASKMNPKNAISNSTSVNRLYDKPKPNNTVITNLRCIRCGYKRHYDKDRCPA FT LNKTCLKCKKVGHFQSASRSKPPMNNASQDRKRKSAYPNNRDFGNSIKRPR FT NIFNIDEANQRDPIFEDLPVYNVGDNDEELITCRVGGVDIVMLIDSGSKHN FT LIDDTTWELMKLKDVQITNQRVDQEKRFLAYGRIPLKLITTFDATLEIDDG FT DRYLFLIF" FT CDS 2931..4517 FT /product="Gypsy-93_AA-I_3p" FT /translation="MMRSIIQVVKLYKISHLYRVLSTTTSFYVIEKGQQPL FT MGKITAQSLGVLKIGLPSQNNVVDRVETKKEFPKIKGVSLSIPIDRSIPPV FT IQPLRRCPIPLLQKVKSKLDELLEMGIIEKVTQPTSWVSPLVPIIKDNGEL FT RLCIDMRRANQAVQRLNHPLPIFDDLSPKFRNARYFTTLDIKQAFHQVELN FT ENCRDITTFVTNWGLYRYKRLLFGVNCAPELFQNLMESILASCRNTVVFID FT DIMIFGSSEEEHDSCVKEVLKVLNQQGILLNDHKCNFKQRETVFLGHKLSA FT DGIAPADEKVKSILQFRAPQTKEELRSFLGLVTYVAKFIPDLATVNSPLRN FT LLRQETPFDWKAEHQQSFDKLKNQIGSVMNLGYFDPQDRTLVVTDASGVGL FT GAVLIQFKGNQPRIICYASKSLSDVEKKYPIIEKEALGIVWSIERFKMYLM FT GITFELETDHRPLETLFAITSKPTARIKRWILRVQAFKFKVVIVLIRSVSI FT FFSLLTKKSVIVGCVQKRFSKPSRRTVTIKFSC" FT CDS 4549..5478 FT /product="Gypsy-93_AA-I_1p" FT /translation="MYIRHIMVKAISTLSESENREIFDSNTECMIRTVQES FT AALDIVEVITAAENDSEMQALKDCILSGKWSNEALKHYSAFQLEYSFVNGL FT IMRGTKLVTPMTLRQRMCQLAHEGHPGQSMMKRRLRERCWWPGIDRDAVKF FT CETCEGCRLVQIPDPPEPMARRQLPNKPWIDIAIDFLGPLPTGEYVLVVID FT YYSRYMELEIMRKITAQETIKRLKRIFRTWGLPRTITLDNAKQFVSAEFED FT FCKFSRIHLNHTSPYWPQANGEVERQNRSLLKRMRFLMRFTMTGRRNSISI FT CSFIIIHRIRLQELHPVN" XX SQ Sequence 6103 BP; 1976 A; 1086 C; 1364 G; 1677 T; 0 other; aattggcgac gaggtttcag aacccgaatt gaaaattgtg aatttattgt gagttggtga 60 gaatcaaggt aataaaaatt acgacaaaaa gaaaaaatta aattttcaac taaccggttg 120 tctaattcgt gcgtggcaaa gaagagcagc agaatggacg ggattgtggt tgaatggtgt 180 gatcgattaa atacattaaa aaaaagaaaa aacaagttta tacctaatgt gaagaaaaat 240 tgctgaagag tgattgggaa aaataaaaca aacggaaaaa gggaatgtgc gcagaaaaaa 300 aaatacccaa gaaagcagcg tggcgtgatt ttttttttca tgctgctgaa aaaaccaagg 360 tcataagtga cgtgaatgtg tgtgaaaggg tatccgcttg gcctcatagc ctccctgcca 420 gtgttgttgc tctaagcatg tgtgaaagcg gtgggagtgt gaaagctgat gtgatgcgtt 480 ggtttggtgt tggtggtttt cattcggttg cttgctgagc atccacagcc aaatcagaat 540 aagtgagcgt tgtatttctt atccaattga acggtaatta tattttacac aaaggtgatt 600 actgtcatga gtatgctgaa attaaaggta agtgatttta atgcaatttt aacaatgatg 660 gccagctcca tacgacaatg gtttttaatt gttattttaa atcctgctat agctatttta 720 aaaaggtttt caaatgccgt attgtattga aaagattatc tttgagttaa aaataataac 780 tgtgtttcgc tcaaaagaga aagagattag ccggagagtt ttcgtcaata taatacgtat 840 tcatttgaag aagtcaatga tgatggaatg aatcattgag aattggctgc gcatgcactc 900 ttcgatgatg tgcctgtgca tgtgtaagta tagtgaaagc aagaaagtgc gctcttgaat 960 ggtgcgcttg ttcagtgaaa acaaataaaa aaaaaacttg ctcttgaatg ttgcgagata 1020 gagacccata cctatttgtt cgctcttcga tgatgcgtgt gactgtatga tagctggaaa 1080 gctcttcaat gtgctaggaa aaagcttgcc tttggataat gtgtaggtgc gctcttggat 1140 gatgcgtgtg tgttgattga aatgtagttc gacatgcttg aaagctcgct cttaagcgtt 1200 gcgtaggtgc gctcttggat gatgcgtgtg tttcgattga aatgttgttt gacatgggta 1260 cgaactgtat aaatagctct taaatgacgc ttgaaggtcg ctcttaaatg atgtgtaggt 1320 acgctcttgg atgatgcgta tgtattgatt aaaatgttgt tcgtcatggt tacggactgt 1380 atgaattgtt cttaaatgat gcttgaaagc tcgctattaa ataatgcgta ggtacgctct 1440 tggatgatgc gtgtgtgttc gttggaaagt agggctgaat ggtttagaat gattatcact 1500 tttgaatgat gtaaaataat accacaaatg catataatga aatgcaatgt tatgaaaact 1560 aaattgatac aaacagagca aagctgcaaa gaatgtaatg tatgtgtgtg taacatgccc 1620 ttataataaa ctgtgacgta caaggttgaa agttgatttt tttattaaca gcggcagtgt 1680 tgtcagaatg gcgaatgacc agaatcgaat cgatgaggaa atcgttcaag tggttcaaca 1740 aaatgtatcg ttgttcagtc cgtcatccta taatctacca cattttaaat acaaacattt 1800 gccaccctct gagattcgta attcatgggt tggatggata cggtggttcg agagcattat 1860 ggcagcatgg gtgggtggtc ttgaactaca gagcgcattc tacggcatac ctggattcga 1920 tgatatagtc ccaaaccatg atccgtatca atccgcaaaa gagaaactag atcaactctt 1980 ctctccaaaa caccatgata gttttgagag gttcatgttc tggactatga cccctgagga 2040 agatgaggcg atcgataaat ttgctttaaa agttcagcac aaggctgcga aatgttcatt 2100 cgggaagtcc gagagtgaga gtagaaacat tgccgttgtt gacaaaatta ttcagtttac 2160 gccaaacgat ttaaggcaaa aattgctaga aaaggacatt ttgactcttg acgatacctt 2220 gaaggtagta aatgcatacc aatcagtgcg ttaccaagca tcaaagatga accccaagaa 2280 tgccatatcg aattccacaa gtgtaaatcg attgtacgat aaaccaaaac cgaacaatac 2340 agtaatcacg aatctacgtt gcatacgatg cggctataaa cgccattatg acaaagatcg 2400 ttgtcctgct ctcaacaaga cttgcttgaa atgcaaaaag gttggccatt tccagtctgc 2460 cagtagatcg aagccaccaa tgaataatgc ttctcaagat cgtaaacgaa aatcagccta 2520 tcccaacaac cgcgatttcg gcaactcaat caaacgtcct cgaaatatct tcaatattga 2580 cgaggctaat caacgtgatc cgatttttga agatttacct gtctacaatg taggagacaa 2640 cgacgaagag ttaataacat gccgtgttgg aggtgtagac attgtcatgc taatcgattc 2700 tgggtcaaaa cacaatttga ttgatgacac tacttgggaa ctgatgaaat tgaaagatgt 2760 ccaaattacc aaccaaagag tcgaccaaga aaagaggttt ctggcgtatg gaagaatccc 2820 cttgaagctc atcacgacct tcgacgcgac attggaaatt gacgatggtg acaggtactt 2880 atttttgata ttttagctca attaattggc tttgttgccc aacatcagac atgatgagaa 2940 gtattattca ggtagttaaa ttgtacaaaa tttcacattt atacagggtg ttaagcacta 3000 caacatcgtt ctatgtaata gaaaaaggac aacagcctct aatgggaaaa ataacagcac 3060 aaagtctagg agtactcaaa attggcctgc cgagccaaaa caatgttgtg gacagagttg 3120 aaactaagaa agagtttcct aaaattaagg gtgtcagtct aagcattcct attgatcgga 3180 gtattcctcc ggtcattcag ccactgcgtc gttgtccaat accgttactt caaaaagtca 3240 agtccaaact tgatgagctg ttggagatgg ggataataga gaaggtcaca caacccacat 3300 cttgggtttc gcctctagtt ccaatcatta aagataatgg ggaactcaga ctgtgcattg 3360 atatgcgaag agctaaccaa gcagttcaaa ggctcaatca tccactgccg atttttgacg 3420 atctgagtcc gaaattccga aatgctagat acttcacaac gctcgacatt aaacaagcgt 3480 ttcatcaggt ggagttgaat gaaaactgtc gagatataac cacatttgtt acaaactggg 3540 gcttgtatcg ctacaaaagg cttctttttg gagtaaattg tgcgcctgaa ctgtttcaga 3600 atctcatgga gagcatactg gcaagttgta ggaataccgt ggtgtttatt gacgatatca 3660 tgatattcgg atcgtctgaa gaagaacacg acagctgtgt aaaggaagta ctaaaggttc 3720 ttaatcaaca aggaatattg ctcaatgatc ataaatgcaa ctttaaacag cgagaaacag 3780 tgtttctagg ccacaagctg tctgctgacg gaatcgctcc cgcagatgaa aaggtaaaat 3840 caatccttca gttccgggca ccgcaaacaa aagaagaatt gagaagcttc ctgggtttgg 3900 tgacatatgt agcaaaattc atacctgatc ttgcaacagt taactctcca ttgcgaaacc 3960 tactgaggca agaaacaccc tttgattgga aagcagaaca ccagcaatcg tttgataagc 4020 tcaagaacca gataggatca gtcatgaacc taggatattt cgaccctcaa gatcgcactc 4080 ttgtagtcac agacgcttcc ggtgtagggt taggagcggt gcttattcag tttaaaggga 4140 atcagcctcg tatcatttgc tacgcgtcaa aaagtctctc agatgtcgaa aagaaatatc 4200 ctattattga aaaggaagcg ctaggaatag tttggtccat cgagcgattt aagatgtatc 4260 tgatgggaat aacgtttgaa ttggaaaccg accaccgccc attggaaact ttgtttgcga 4320 taacttccaa gccaacagca aggattaaac gttggatatt gcgcgtccaa gctttcaagt 4380 ttaaggtagt tattgttctg ataaggtctg tttctatttt tttttctctt ctaactaaaa 4440 aaagtgtaat tgtaggttgt gtacagaaaa ggttcagcaa acctagccga cgtactgtca 4500 cgattaagtt ctcatgttga agatagtcat tgggttgatg attcagaaat gtacatacgg 4560 catattatgg tcaaggcaat ctctacttta agtgagagtg aaaaccgcga aatcttcgat 4620 tccaacactg aatgtatgat aagaacagtc caagagagtg cagctttaga tatcgtagag 4680 gtcattactg cagcagaaaa tgattctgaa atgcaagccc tgaaagactg tattttgagt 4740 ggcaaatgga gcaacgaagc tttaaagcac tactcggcat ttcagctgga gtattcattt 4800 gtaaatggac ttattatgcg aggaacaaaa cttgtcactc ccatgacatt aaggcagcgt 4860 atgtgtcagt tggcacatga gggacatcct ggtcaatcca tgatgaagcg taggttacgc 4920 gaacgatgct ggtggcctgg catagaccga gatgctgtaa aattctgtga aacctgcgaa 4980 ggttgtagat tggttcaaat tccagatcct cctgaaccaa tggcacggcg gcagttacca 5040 aataagccgt ggatagatat cgcgatcgat tttctgggac cattaccaac cggagaatac 5100 gtcttagtgg taatagacta ctatagccga tacatggagt tggagatcat gagaaaaata 5160 actgcacaag aaaccatcaa aaggctcaaa agaatatttc gaacatgggg tttaccaaga 5220 actattacct tggacaacgc aaagcagttc gtttcggcgg aattcgagga tttttgcaag 5280 ttcagtagaa ttcatttgaa tcatacttcg ccgtactggc cgcaagccaa cggcgaagta 5340 gaacgtcaaa ataggtcttt gttgaaacgg atgagatttc tcatgcgctt cacgatgact 5400 ggaaggagga actcgatcag tatctgcagc tttataataa tacaccgcat acggttacag 5460 gagttgcacc cagtgaattg attcagaacc ggaaagttcg aactaagttg ccacacatcg 5520 atgatctaga aacgacacca tccagcactg agttcagaga tagagatatg gaacacaaga 5580 tgcttggaaa agaacgggaa gatgctgtac gtaaagcaag gaccagtgaa ataaaagttg 5640 gagatgtagt tctcatgaga aatcttctac caaccaacaa gctttcaacg aatttcatga 5700 aagagaaatt cactgttgtt gacaagcaag gctctaacgt gacggttcag tctgatgata 5760 ctggaaagac atatgatcgt aacatctcgc atctgaagct gattccttct tcatcgccgg 5820 aagaagcaga aagtaaagca gaatccaaaa gaacagagtt gcaagatacg cgtcacacga 5880 gaacttcaaa cttcgaacca ttgttaacat cgccaagctc gaaggttcgg cgttcgacaa 5940 gagtagcaaa acctgtgcag aagttcaaca tatcaagata agtataagtt tcagaattga 6000 tttgttttca tatactttga atcattttaa actataagta tagttaactt ttgttgaaac 6060 aatataaatt tacaatacat aactttctac agggaaaagg gga 6103 // ID Gypsy19-LTR_Dya repbase; DNA; INV; 533 BP. XX AC chrU; XX DT 19-MAY-2009 (Rel. 14.05, Created) DT 19-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy19_Dya; KW Gypsy19-I_Dya; Gypsy19-LTR_Dya. XX OS Drosophila yakuba OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; melanogaster subgroup. XX RN [1] RP 1-533 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1115-1115 (2009). XX DR Genome; chrU; Positions 1327489 1326957. XX SQ Sequence 533 BP; 179 A; 104 C; 109 G; 141 T; 0 other; tgcttgattt tatttttggg cctaagtaca tttgtgtcat aagggagact ttttgattgc 60 actttcaacc tttttatctt attattacca gtagttctca cctacattcc ttgttgaaaa 120 aattctaagg aaaacatacc ccaattcctt aatacttaaa ggtggcagcc gatcacggac 180 acagagagcg gtgcatcaca aagtgcgacc gcgggagaaa cacagactgt atgatactca 240 tgatagggag agtttggaaa acttaaggac agattatgcc cgaaaactct gccgatagct 300 ctttgagaca tcaagacacc agatgcctta attaagtgag cacagaggaa cagaaaccat 360 gcatcatatg cctttgaagc tgtgaccagt tcccaatttc actccgaact atattggaat 420 caaacatttc gaattctagt tgcaaaggta atttaccagg aattcaaaat ctatggaacc 480 ggcagagggg aaaaagggac tgataaaggt caagaattaa tggaacaaaa gca 533 // ID Gypsy-11_IS-LTR repbase; DNA; INV; 217 BP. XX AC ABJB010103780; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the black-legged tick genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-11_IS_; KW Gypsy-11_IS-I; Gypsy-11_IS-LTR. XX OS Ixodes scapularis OC Eukaryota; Metazoa; Arthropoda; Chelicerata; Arachnida; Acari; OC Parasitiformes; Ixodida; Ixodoidea; Ixodidae; Ixodinae; Ixodes. XX RN [1] RP 1-217 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the black-legged tick genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; ABJB010103780; Positions 20426 20642. XX SQ Sequence 217 BP; 40 A; 60 C; 55 G; 62 T; 0 other; tgtggtgtca tcctgaccgc atcatcatca tattgctgta ccatcactga ctgtcattca 60 ctatctgacc tgcgtacgtg actcgcctcg ctctccgctt atcagtggta cggcggttgg 120 cgatatatac gcgcgcggaa taaaggctgg gggttgttct gctgggaact acttggttgg 180 ctgtcacttc cttcatcgtt ccgcgagaca cacgaca 217 // ID Copia-55_AA-LTR repbase; DNA; INV; 212 BP. XX AC supercont1.168; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-55_AA_; KW Copia-55_AA-I; Copia-55_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-212 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.168; Positions 1202299 1202088. XX SQ Sequence 212 BP; 56 A; 55 C; 33 G; 68 T; 0 other; tgttgggaat gtcaatttcc tatcccttat agtcaaaccc cattgaccga cccctcgaca 60 gtgagctgtc attccatacc ggaagctgac aatatttttc attctaaaac tgtacctcaa 120 agagaaaaca cgttattcat taaaactcgt tcgttcaact aagttgttgc gtttctcttt 180 gtgtgattcc gttaccactg cccgaattcc ca 212 // ID Gypsy1-LTR_Dmoj repbase; DNA; INV; 245 BP. XX AC scaffold_6123; XX DT 13-MAY-2009 (Rel. 14.05, Created) DT 13-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy1_Dmoj; KW Gypsy1-I_Dmoj; Gypsy1-LTR_Dmoj. XX OS Drosophila mojavensis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; repleta group; OC mulleri subgroup. XX RN [1] RP 1-245 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1029-1029 (2009). XX DR Genome; scaffold_6123; Positions 482 238. XX SQ Sequence 245 BP; 70 A; 20 C; 21 G; 134 T; 0 other; tgacaaaggc aagagattaa cttaagagaa gcttatttaa ctattaagag caattattag 60 ctgctgcgcc gaataattta tttatttatt tatttattta tttatttatt tatttattta 120 tttatttatt tatttattta tttatttatt tatttattta tttatttatt tatttattta 180 tttatttatt tatttattaa tttatttatt tattttgtaa ggcggccttt tttgcccacc 240 ttaca 245 // ID Copia-25_DPu-LTR repbase; DNA; INV; 247 BP. XX AC scaffold_34; XX DT 11-MAY-2010 (Rel. 15.05, Created) DT 11-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE Copia-like LTR retrotransposon from Daphnia: long terminal DE repeat. XX KW LTR Retrotransposon; Transposable Element; Copia-25_DPu-LTR. XX NM Copia-25_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-247 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 714-714 (2010). XX DR Genome; scaffold_34; Positions 253097 252851. XX CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX SQ Sequence 247 BP; 85 A; 47 C; 40 G; 75 T; 0 other; tgttggtaca aaaaattgcg cgtcagcgtt gctgaaccaa agttggttgc ataaaccgat 60 taaagaaaaa ctgtctgctg tcaccaccag ttgtcaactc tcttctcttc ttgaaaagtg 120 tttccctaaa atatcgcttt cttgtgtttc acaatttcaa ctgtatagaa ggtaaagaaa 180 gacattcaaa cagaataaaa ggaaatcttg tgtgtaaaat caattctaat cagtataact 240 tccaaca 247 // ID Sola1-8_AP repbase; DNA; INV; 3562 BP. XX AC ABLF01000511.1; XX DT 28-JAN-2009 (Rel. 14.02, Created) DT 28-JAN-2009 (Rel. 15.12, Last updated, Version 2) XX DE Sola1 DNA transposons from Acyrthosiphon pisum. XX KW Sola; DNA transposon; Transposable Element; Sola1; Sola1-8_AP. XX NM Sola1-8_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-3562 RA Bao W., Jurka M.G., Kapitonov V.V. and Jurka J.; RT "New superfamilies of eukaryotic DNA transposons and their RT internal divisions."; RL Molecular Biology and Evolution 26(5), 983-993 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX FH Key Location/Qualifiers FT CDS join(1451..2461,2465..2854,2797..3159) FT /product="Sola1-8_AP_1p" FT /translation="MARKIKQLGSCRRKCNELFKDNDRKIIFEEYWSMGDY FT AKRVSYVMALMNLKQKMTERKKVDISEKQRNKTFSREYFLRINGEKKKICC FT SCFLSTLGETKGFLNTVVAKCISSKSGIPQNDLRGKREPHNKLSEQDLKNA FT KDHILSIPSYESHYSRRDTSKRYLPSHFTLSSLYEQYKQTTDTPIGRPKYE FT AIFHDLNLSIKKPKKDTCNKCDTFHMRISLADGDEKVNLQIAFRDHLEQAD FT IGYTSKAEDKEEAKTDDKKEKKVLTFDLQQCLPTPHLETNVVFYKRQLWTY FT NLTVHDCSDNPAYCFMWSEVDGGRGANQIASCVAKHLYTWICVDLLHGLPT FT KHVTLYSDTCGGQNKNSHMVAMFMSVIRYHNILEVIDHKFLLAGHTHMECD FT SDHALIEKKKKTSKTIIYHPHDWMQLVRQTGKKNPFIVTSMMHSDFLNYSE FT LLKNQLQMKKKIVMVKNLYGKIKSTTDEKKNSNGEKFVWKNIKWLRFKKTE FT LGILYYKETLDIDAPFKSLSLCSKGRSHYQLTPPLCYNRPVPIAEKKKKDL FT LDLLPMIPNVFHDFCKNLPTDATLKETYPNNETDTEEENS*" XX SQ Sequence 3562 BP; 1365 A; 482 C; 559 G; 1156 T; 0 other; cgatggtgct cgggaaaaac tggctaagta taaaatttgc gaaaattagg ttatgtcata 60 tctagataca gaagaatcag acggtgtttt gggtaaaatg gtctaagtca tacaccgatt 120 ttaccagttg ttatacactt atgaaaaata gaacatttca caaaattaat tttgtgtttg 180 agaaaaacca gttatatgac ttagacgatt ttgtacatac tattatcatt gcgaaatttg 240 gcctatgtca acttagacaa ctttacacca aaataatgat atgtgataaa gtggctaagt 300 gacttggcgg tgtgtagaca taattccttg tagacatagt gaatattacc cataatttaa 360 taaaacaact attccccatt tttgtaacat ctctgactag agagccttta ttatctagga 420 tctctatctg tgactatcaa taatcaataa tatgtattaa aatcatagac cttatactaa 480 tataaggtat atgattaaaa tatattaata atcttatcat agctatattt tatcaatcat 540 aaaaatatta tgattgtaca taatatttta attattttat tagtgttatt ggtaggcagt 600 agctattata caaacgttat actaatttat tacgttataa caataaaact attattactg 660 atctcgtaca tgattgtatt tattattttg gtaaaatatc gtatacaata tagttataat 720 gtttaaatcc agaacagcca aacttattgc aatgtgttat tcagataaca ataaaaacaa 780 ctcaggtaag caaatgtgtt aatgcaagat cactgttttt ttcctaaatg ttgtagttta 840 tttatgtatt aaaataaatg tttagatgaa gaaaatacat cagttgagag caataacgac 900 tctggtgact catatagacc accacttgtt cgatcaaaga gttctaccag tgttgcatca 960 aacattgtaa gtacacttag gacaatgcaa tataatagat atatttaatt ttatttaata 1020 actttttggg tttgtgtaaa ttatcaattt aaactaagaa tttaattttt aatatatgta 1080 actattgtac attgcagaga taaactaact tgtatagtgt aggtacatac atgtcagtta 1140 ccattgtaac aatacattac ccacaaactc ataaactatt aataatttta tgttgtatat 1200 tatcataaat attattatta agtaaaacaa taattttttt tacaggaaga caatagtaat 1260 tatttagcca atatggatgg aaataataat tctcctttta caactgattc atcaactgtt 1320 gatggtaatg ctggtaaaaa aaaaaaagga aaattagagt gtaaaaggac aatccagaca 1380 agtaaggact gatagatgta ataaacgtaa taaaggagaa gagtacaaaa cgggaaaagg 1440 aaaggtgata atggcaagaa aaataaaaca gttgggttca tgtagaagaa agtgtaatga 1500 actgtttaag gataatgata gaaaaattat ttttgaagaa tattggagta tgggagatta 1560 tgcaaagaga gtttcttatg taatggccct tatgaatcta aaacaaaaaa tgactgaaag 1620 aaaaaaagtg gatatttctg aaaagcaaag aaataagaca ttttctagag agtacttctt 1680 gcgtattaat ggagaaaaga aaaaaatttg ttgttcttgt ttcttaagta ctttaggtga 1740 aacaaaaggg tttctcaata cagtagtggc aaagtgcatt tcatcaaaat caggtatacc 1800 tcaaaatgat ttgagaggta aaagagaacc acacaacaag ctgtctgagc aagatttaaa 1860 aaatgctaag gatcatatac taagtattcc ttcatatgaa agtcactatt ctaggcgtga 1920 tactagcaaa agataccttc catcacattt tacactgtca agtttatatg aacagtataa 1980 acaaacaaca gatacaccaa tcggccgtcc aaaatatgaa gccattttcc atgatttaaa 2040 cctgtctata aaaaaaccaa aaaaggacac ttgtaataaa tgtgatactt tccatatgcg 2100 cataagtttg gcagatggag atgaaaaagt aaatttgcag atagcgttta gggatcattt 2160 agaacaagct gacatcggat atacttcaaa agcagaagat aaagaagaag ctaaaactga 2220 tgacaaaaaa gaaaaaaaag tactgacatt tgatttacaa caatgccttc ctacgccaca 2280 tcttgaaaca aatgttgttt tctataaacg tcagttgtgg acgtacaacc tgaccgttca 2340 tgattgttca gacaatccag cttattgttt tatgtggtct gaggtcgatg gaggaagggg 2400 tgctaaccag atagcttctt gtgttgctaa acatctgtac acatggattt gtgtggattt 2460 gtgattacat ggattgccta ccaaacatgt aacactttac tcagatacct gtggggggca 2520 aaataaaaat tcccacatgg ttgctatgtt tatgtcagtt ataagatatc ataatatact 2580 agaagttata gatcataagt ttctattagc aggtcacact catatggaat gtgattctga 2640 ccacgcacta attgaaaaaa aaaaaaaaac aagtaagacc ataatatacc atccacacga 2700 ttggatgcaa ctagtcagac agacaggaaa aaaaaatcca ttcattgtaa cctccatgat 2760 gcatagtgat tttcttaact actcagaatt gcttaaaaat caactacaga tgaaaaaaaa 2820 aatagtaatg gtgaaaaatt tgtatggaaa aatataaaat ggttgagatt taaaaaaact 2880 gaactaggaa ttttatacta taaggaaaca ttagatatag atgcaccatt caaatcacta 2940 tcactctgct caaaaggaag aagccattat caacttacac caccattgtg ttataataga 3000 cctgttccga ttgcagagaa gaagaaaaaa gatcttttgg acttacttcc aatgatacca 3060 aatgtttttc atgacttttg caaaaattta ccaactgatg ctacactaaa agaaacatac 3120 ccaaacaatg agactgatac agaggaagaa aactcttaaa gcaatggact gaaataattt 3180 taaccattta tgtataagaa ttaaaagttt tattattaag tttattttgt ttttgtttat 3240 aatttactaa aaaactatgt actgtatatt tatacatttt tttctataat aataggtaat 3300 aactatctaa gtcaaaaaat attttgtgca atataggcta agtctgccta attaagtcaa 3360 ataaattatt ttgttgtatt aattataatt ttttattctg ttttaatata atttgttgtt 3420 ttttaaaggt tatagaaaac attagtcaat gttatgaatg aaaataaagg ttatcttatc 3480 tcttttttgt aattgatttt ttatttttga atattccaaa ttccaaattt tagacttagc 3540 aagtttttcc cgagcaccat cg 3562 // ID L1-6_Cis repbase; DNA; INV; 6160 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 06-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE L1 Non-LTR Retrotransposon from Ciona savignyi. XX KW L1; Non-LTR Retrotransposon; Transposable Element; L1-6_Cis. XX OS Ciona savignyi OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. XX RN [1] RP 1-6160 RA Smit A.F.; RT "L1-6_Cis - L1 Non-LTR Retrotransposon from Ciona savignyi."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX CC Ci000747 4%. XX SQ Sequence 6160 BP; 2299 A; 1298 C; 838 G; 1703 T; 22 other; tctatttgaa taaggtacga attttgctcg gcagaatatt cttcggacca tattcaaata 60 gaacccttga gccttaataa gcgttggatt tatttcatcg taatattcat tggatttatc 120 acattggatt taccagctcc cacaaataga agatcttggt gagtctctgt ttttgcacct 180 tgtagcacat aggccttttt tcaggcgatc gctcgtactc gtttttttct ttttttcaca 240 ttattagtgt tgccctctct cactttagtt taaaattttt tctggcaaca ctaattatta 300 aaaaagcgcc tgtaattatt tttttgcact tcctaggccc taagtgtgca atttcataca 360 gcgaagccct agcaaatttt ttttcaaaca gcgacatcta gcgganatat attttccaga 420 cttagcaatt tttccantaa ttctccctta cgccgtctaa cggcctaatt cacacacgta 480 tttatccaca cggtggtctc tcttatatcg gtacaatatc attccacata tactcaatat 540 atccatacct attatatata acaaaacaca tatcccacaa tatacaacac accacaaaat 600 atacaacaca accacacaat atacatcaaa tatataacat atcgatcaag aagaagaaga 660 agtcatgcat ggacaagacg gacaagtacc tgaactggaa ataactttga cgcgttccat 720 tctaattaag ctttccggaa atgttgagct gatcagcgta tcctccttca tggagatatt 780 tgctgaagac gctgcactcc attatcttgg tccccaggtt gaaggtgtca taaccgaaaa 840 cttccaacaa tcggaattcc tcctcacctt gaaatcaatt gaaaatttag aaccacccaa 900 tatagaagaa gaaatcaaaa taataaatga caaaaacctt acctttaccg gaccaaacgg 960 ctccatcatg gatgtgtcgg ctaaagttcc ctcgcctccg atggagatag taactctcta 1020 tcccgtccca tgcctggttg acgagacaag attgcgtcaa ttaattaaaa aatataaatg 1080 gggcaaagtg gatagcttct attttggaac gcatcgacac tttccaaaca taaagaatgg 1140 atggctcacc atcaagatgt ttgatgtgaa cctcaataat attccaaaag taatgaaagt 1200 cggcggcaga tgggtcacag ttacccgtcc gggtgaatca catctccccc tttgtcgcta 1260 ctgtaaagag cgtgggcacc cccaacaaaa gtgcccaaag aaaggatggt gcaaccattg 1320 taacacccat ggccacctga cccgacgatg tcgatccttt ccgcccacag caggaaatta 1380 taacggatat ccatcccaaa cccgtcaaat aaatcaaaat cagacccggt acaacacccc 1440 tccaaccacc accgaaacat ggacaaccgt aatctcaaga tccaaagcaa aacagcaacc 1500 caaacaaaca ccaattgaaa cttccaaccc attctccagc atcgcaccac aagaaaccat 1560 cgatacattt gaggaaaaca actcaacaat cattcaatcc ggacctgatt tgtcactacc 1620 ccaaaggcca caaaaaacac ccaaacgaaa caaacagaaa ccacaatcgg caaactccac 1680 cccaattcaa ttcgaagata ttatgttgcc aacaataaaa cccagaaccc ctacttcttc 1740 gacaactgga aacattccga caatctatac aaccccacca acaccagcac ctcatcactc 1800 aaacaatttg acagagccaa tngaaaattt aacctccgac ccaacaaaca ccaacgataa 1860 ccctggaccc tccaaacttc aaccaaaatc aacaaccccc accacatcat ctgcagcgtc 1920 aacagacaac tccccggaat cggcgtgcta tattatcaat aaagcacaca aagaatacgt 1980 tagaaaccca aacaccaacc caccaccata catacaaaga accgtcaaag aactaagaac 2040 gcaactnttt gaatttaaca aggataactc acccaaaaaa agaagactcg aagatactca 2100 acaatagtgt tcctaaacac caacaagaac tacaaacaat aaatacatac tttttcaaca 2160 tcggtccata ctattaattt gcaatggatg gacatgaaat agataatcaa ataattaatt 2220 cttatccatt aaatatagca accattaatg tnaatggttt aaaatgtaaa ataaataatg 2280 taaacaacta tatagaaann aataatatag atatcttatg catncaagag actcaccgtg 2340 tggacaatac aactatttca cgactgcgat ctctatctcc atattgtata ttcttcaaca 2400 ccatcatatc gaattccaca caacatcgat gcaatcatgg tactgcgata tatgtacgaa 2460 atcatatatc aaattcgttt actattaatc atactatact ggtggaaaat atgatccata 2520 aaataaccct taccaatcgc aatctccaac ttaatattat taacacgtac ttaccatgtg 2580 ggaatttaaa tagaaataaa cgcagcgaat taatcaactt aattaacgaa aatataatta 2640 cgtcacaata tactcactcg ataatcctgg gtgactttaa tatgcttcta aatccccttg 2700 acatcaaagg aaactttgat catcgctcca aacctgatcg tcgagcctgg cgtaccctgg 2760 aaaaagataa atccattaat gatgcgtttc gtaaattaaa taaactcgaa attaattatt 2820 cacgcataac taacaccacg gcgacacgaa tcgatcgcat atatataagt aaaaatttaa 2880 atagcttcgc aactcgatat attcatgtcc gaaacttctt ctccgaccat aaccactgtc 2940 ccatactctc gattaatata gaaaattcta aaaaatgggg accctctttt tataaactaa 3000 ataactcctt attaactaat aatgatctaa ttgataaaat ggcccttttc tggaaaaact 3060 ggacatatga aaaacgtaaa tatagtaaca tcctaaattg gtgggatgac ggcaaacaaa 3120 tgcttaaaca catcactata gattactcat acggaaaatc gaaacgggaa aaacaaaatt 3180 acgnaaatgc tgtttcaaga ctatctgntc tccagcgcca actggagacg gtatctagaa 3240 acaaanaaat atctatcctt aaacataata taaacatnta cgaatcttct ataaatacgg 3300 gtgccatcat tcgctctaag gtcaaattta ttncaaacga agaatctcca aataaaaatt 3360 tttttgattt tgaaaaatcg caccaaaaaa aagataccat ttttcgagtt aaagataacg 3420 agggaaattt aacatctgat cctaaaacca ctctaaaagc tattgtnaat ttctataccg 3480 acctgtgggg aaattcacca tctaccatca atcccgataa ttatctacgt tgcatcgatc 3540 ctataaattt cgaagatgaa ctctccatat taacgcaacc aataaatata ttagaaatcg 3600 aaactgcnat taatgatatg cgtgataaca gcacccccgg aagtgacgga ctcacatacc 3660 ttttatataa aaaactcttt cctattataa aatacgattt agaagaagta tataataata 3720 tctatttaaa acaaacntta acaaatagca tgaaaacagc gatcgttaaa cttatntata 3780 aaaagggtga caaaacggac ttaaaaaatt ggcgccccat ctctttatta aactgcgact 3840 ataaaatcct tagcaaaata ataagtaatc gcttcgactt aatcattaat aaaataatta 3900 gtaaaaacca aaaatgcggn atcaaaggac gaaaaattaa tgatgccatg tacaatattc 3960 aagcagcaat taactccgca aaacatttta atcatccact cacggcaatt gcgatcgatt 4020 ttgaaaaggc tttcgatcgt actgaccccg aatttattat acaaatatgt ctaaaattaa 4080 atttaccccg aacccttatt aactggatac gtattatata taacgacgta aacagtcgaa 4140 tagagattaa cggcgcattc acgcctaaaa ttcatattaa acggggaata agacagggtt 4200 gtcccctaag tatggtacta tttttaattt ccatggaaat aataacacgt aaacttcact 4260 ataataataa aataatcggg tatcgactaa ataatatcga acttaaatgc gagcaatatg 4320 ccgatgatct aactatacta tctgggcaca attcgtcaat tcccgaaatn tttaaagaat 4380 tggaagaata tgctcaagta tcggatcaaa aaataaattt agtaaaaacc aaagtaataa 4440 gtaatgataa tttatctatt tatactctgc gcagatccta tccatctatc gaaattgccg 4500 aaactattaa aatcctaggt atattcttcg ccttttcacg tgaatgtata gatcaaaatt 4560 gggataaata taaaaatcat ataattaata ctttacgaat aaacaaacgt cgtcatttaa 4620 ctttaaatgg taaaaaaaca ataattaata ccttaatctt accgcaantg aacatggtag 4680 gagatattct tcccatgaat aaggcgaccc ttcagcgaat taattcggaa atttatcaat 4740 ttgtatggca tccctataaa atggaaatga ttcaaagaat aaaacttagc tcttcgtata 4800 aacatggtgg acttcaattc ccggacatta aaacgaaatt agacgctctc aaagccgctc 4860 gcttatatcg cctaaaacat ataactaaaa taacggaact atggcatgaa tgggcacgat 4920 ttaatctagg ctcaactatg tgctccctaa ataaactatt atatagtaac agcctaccaa 4980 acgcggcttt ccccgatcca ttctatcgtg acgttcgaca cgcatattat aaactgaaac 5040 gttgtgatta taattgggaa aatgggaaaa taaaagattt ttataacgca ttaatcgctc 5100 aacttacacc gattcataat atcgaattaa ataataaaat tattccatgg aaatatataa 5160 atctacaatc aaaagcgatt aaaccctttt tctccaacgt cgatcgcgat atcgcatata 5220 aaattgcacg gaacctccta cctcatggtg attggtttcg aacaagaaat atccgccctc 5280 aacacggtga tattatatac atccatgatt gcaaattctg tagaaataaa gacgataatt 5340 attctcatat tttaaccact tgtaatgtaa taagcacggt gatagacgcg atacataaat 5400 acgcgaacaa aatatcgttt aaaacttata atcgaaacga ttccctaatt ttatacaata 5460 aaacggatct cgcccaaaaa gacgagatac ttctaataaa gttattaaca ataactaaat 5520 gcgaaatata ttatgcaaaa cgcaaactgg atattcaaaa taaatacatg tggaataacg 5580 acgaattcgt caggaaaatt ttatggatcg tcaaaacaaa attcaaatta tatatcagaa 5640 aaatgattgt caaatatgat ttcgaataca tatacgataa atttaaatta cttcccgatt 5700 tcaattcata tgtactgtaa ttatttcttc aattataatt acctcctcaa ttatacgcta 5760 caaggtgcat tacagcgact caataattta attatccata aagtagttat ataaaaacct 5820 gaattctata aaagcaggtc ttctatttgt gggagacatt tttccatgca gattttgacc 5880 ttcaccttcg aaactttacc gcccttcaac ttcaccaaat nttcttcagt catcatctgt 5940 taaagattaa atgaattaaa ttataatcgt catgwatata aaattgtaaa tatgtaaata 6000 tgtaaattgt caatatgtaa atatataaat atgtaatatg tatgtaattt gataagtgtg 6060 ctacgtttta ttgattcgcg tttattattt gttgccatta ttatgtacgg ccgggtatcc 6120 ggtatacgtg tatcaggcgt ccaataaaaa aaaaaaaaaa 6160 // ID BEL-61_AA-I repbase; DNA; INV; 3352 BP. XX AC supercont1.17; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-61_AA_; KW BEL-61_AA-LTR; BEL-61_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-3352 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.17; Positions 735018 731667. XX CC Positions [2401-2964] - Integrase core CC 'GAAAC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 868..3327 FT /product="BEL-61_AA-I_1p" FT /translation="MDDKDLDLTPEVSTEKVLGMWWCTASDTFMYKIGWNR FT YDRALMEGCRRPTKREVLRVLMSIFDPLGLIGHFLMFLKVLLQEVWRSGVQ FT WDEGITDDAFTKWKQWLKLLPQVEQVHVPRCYRSPSHDIIDRAELHTFADA FT SECGMSAVVFLRIIRNGTIECRIVAAKTRVAPLKFVSIPRMELQAALVGTR FT LAHTITDSLSIEISKRYYWTDSRDVLCWLNSDHRRYTQYVAFRVSEILDST FT EANEWCWVPSKMNVADDGTKWNSWPDLSPNSRWFAGPDFLRRHSKDWPRQP FT SKNNSTDKELRPNLLANHTPCEASIDAYKFSSWKRMLNVTAFVLRFLANCR FT QKLDGAPTTTGPLTAVELSSAEAYLLRSAQRDGYPEEMSHLEKLGQKHKYS FT ETLSKQSSLYKLTPWLDRQGLMRMRTRIAACHYATEDAKKPIILPRDHPTT FT NLIIAHYHRKFHHQNHEAVINELRQRFNIPRIRSAYAKVRKNCQRCKNDHA FT NPRPPIMADLPSARLEAYSPPFTHTGIDFFGPYEVAVGRRTEKRWGMLATC FT LTIRAVHIEVVASLSTDSCIMAIRNFMSRRGTPRVIYSDRGTNFIGASRQM FT KEAAAAVNVDQVMKEFVNVDTAWSFNPPLAPHMGGSWERLIGSVKRNLQTI FT NPPRNPTDEVLRNLLIEVENIINSRPLTHVPVDDDCAPALTPNHFLLGSSN FT GIKPFSTIDESSAALRQNILASQVLANRFWKRWLSDYLPEITKRSKWFNRT FT DPVTIGDLVVIADPKLPRNCWPKGKVIEVHPGKDGEVRSATVRTSTGVYVR FT AVTKLAILDVRRDESKPV" XX SQ Sequence 3352 BP; 971 A; 807 C; 823 G; 751 T; 0 other; aattaataaa ttcgtttacg cgaacaatgt cgggtcggaa aactgggaag gataaaaatg 60 gttcgaaagg tacgcggtcg aaggaaaagc aattagaagt gttcagtact agtggtgatg 120 caaatgtagt ggtggtggtc actgaagaag gtaaggagtc gtctggagat aagaaagatc 180 ctgagcaagc tggagaacag agctgtacgt tctgccagga ggcggacaat gacgaaatgg 240 tccaatgcga caagtgtgac aggtggatcc actttgcgtg tgttggcgtc acagaaggaa 300 tagcggatga gagctggagt tgtcccaaat gtgttactac aaccgggatc cagcagccgt 360 cttcttctgc cataaatcgc cttccgatca tatattcaag ccgcgcagga caacagaagc 420 cagctgccac aaaagatcag ttcacctcga aagcttctct tgaagtcggt aagaatatac 480 actgccacga taagacggcc cattcgttga agtcatcatc atcgcggaga tcccttttgc 540 aactacagtt acaacggctc gaggaagaac gagaacacga gaagcaacaa gcggaaaaac 600 atcgggctta tctggacaga aaatacgaac tactagaaca gatgcacagt cggaccggat 660 ccgaatatag tggatcacag gaacggatta gacagtgggt tcaggataca aacaatatcc 720 gtacggaaga cgccttagat ccccggtttg ctgaagtttt cgagccacag aggcactcta 780 cgcagaacta ttccggagaa gcacaggtcg gttccactcc accagcgact cggtttcgtc 840 agagcgattg cagaattgaa gaagttgatg gacgataaag atctggactt gacaccagaa 900 gtgagcaccg aaaaagtctt gggaatgtgg tggtgtactg cctcagacac atttatgtac 960 aaaatcggat ggaatcgcta cgaccgtgcc ctcatggaag ggtgcagacg cccaacaaaa 1020 cgagaggtgc tacgtgtgct gatgtctatt tttgatcctc ttgggctaat cggtcacttc 1080 ctgatgtttc tgaaagtact gttacaagaa gtgtggcggt ctggtgtgca gtgggacgag 1140 ggaataaccg acgatgcgtt cacgaagtgg aaacagtggc tgaagttatt acctcaagta 1200 gagcaggttc atgtcccaag atgttatcga tcgccatccc atgacatcat agacagagca 1260 gagcttcata catttgcgga tgccagtgaa tgtggcatgt ccgctgtcgt ctttttgcgt 1320 attatcagga acggaaccat cgaatgcaga atcgtggctg caaaaacgag agttgcgcca 1380 ttgaaatttg tatccatccc tcgtatggaa ttacaagccg ctttggtagg cactagactg 1440 gcacatacca ttacagattc cctgtccata gaaatatcca agcggtacta ctggaccgac 1500 tcccgcgatg tgctttgttg gttgaattct gaccatcggc gatacacaca gtacgttgcg 1560 ttccgtgtca gcgaaattct ggatagtacc gaagctaacg agtggtgctg ggtaccgtct 1620 aagatgaacg ttgcggatga cggtactaaa tggaactctt ggccagacct ctcacctaat 1680 agtagatggt ttgcaggtcc ggatttctta cggcgacact cgaaggactg gccacgccaa 1740 ccatcaaaga acaattctac ggacaaagaa ctccgtccta atttgctcgc gaatcacact 1800 ccatgtgaag cttccatcga tgcttacaaa ttctccagct ggaagcgcat gcttaacgtg 1860 acagccttcg ttctcaggtt tctggccaat tgtcgtcaaa aattggacgg agctccgact 1920 accaccggac cactaacagc ggtcgaatta agttccgctg aagcatatct tctacgatca 1980 gcgcagcgtg atggatatcc tgaagagatg tcccacttgg aaaagttagg acagaaacac 2040 aaatactcag aaacgttatc caagcaaagc agtctgtaca aattaactcc gtggctagat 2100 cgccaaggac tcatgcgaat gcgaacgcga attgcagcct gtcattacgc tacagaagac 2160 gctaaaaagc caataatctt accgcgtgac caccccacta ccaacttaat aattgcgcac 2220 tatcatcgaa aatttcatca ccaaaatcat gaagcggtga tcaacgaact tcgtcagaga 2280 ttcaacatcc cgcgtatccg ctcagcctac gccaaagtga gaaaaaactg ccaacgatgc 2340 aagaacgatc atgccaatcc acgtccacca atcatggcag accttccatc ggcgcgactt 2400 gaagcatatt cgccaccctt cacacacacg ggtatagact tttttgggcc atacgaagta 2460 gcagttggaa gacgcactga aaaacgatgg ggaatgctcg caacttgtct aaccattcgt 2520 gccgttcata tcgaagtcgt ggcctcttta agcactgatt cttgcataat ggcaattcgc 2580 aacttcatgt cgcgtcgagg tacacccagg gtgatttaca gcgatagggg aaccaacttt 2640 atcggagcgt ctcgtcaaat gaaagaggca gctgcagcag tgaatgtgga ccaagttatg 2700 aaggaatttg taaacgtgga tacagcttgg tccttcaatc cacccctcgc accacacatg 2760 ggtgggagct gggagagact gataggaagc gtcaagagaa atcttcaaac gataaaccca 2820 cctcgaaacc caactgatga agtacttcgc aacctactga tcgaagttga aaacatcatt 2880 aactccaggc cgctcaccca cgtcccggtt gatgatgact gtgctcccgc gctgacacca 2940 aaccattttc tccttggttc ctcaaatggt atcaagccgt tctctactat cgacgaaagc 3000 agtgctgctc ttcgtcaaaa tatactagca tcgcaggtgc tagccaacag gttctggaag 3060 cgttggctaa gcgactacct tcccgaaatc accaagcggt caaagtggtt caaccgcacc 3120 gatccagtaa ccataggtga tctggttgtg atagctgacc ccaaactacc ccgtaattgc 3180 tggcccaaag gtaaggtaat cgaagtacac ccaggcaagg acggagaagt tcgctcggcc 3240 actgtacgaa catctactgg agtatacgtt cgagcggtca ccaaattagc aattctggac 3300 gtgaggcgcg atgaaagtaa gccagtctga acgctggcgt acctgggggg ag 3352 // ID Copia-123_AA-I repbase; DNA; INV; 4056 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-123_AA_; KW Copia-123_AA-LTR; Ty1_copia_Ele112; Copia-123_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4056 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [1470-1997] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 120..4043 FT /product="Copia-123_AA-I_1p" FT /translation="MADVKVSLEKLNDQNYSIWKFKMELLLVKEELLTFVT FT DPKPDVPTADWSLKDGKTRAMIGLAVEDSQLVHIIRKKTAKEMWDALKQIH FT EASSLSSTLHVLRKLCSLRLSEEGDLPGHLNEMTDLQNRLEVIGEGLKDRV FT FMALILSSLPPSFGSLINVIENKPEQELTVDFVKSKLREEWRRRLECNGGS FT VRGDEKVMQSVKSNKTNNKKKFVCHYCKEEGHFRNDCPKLAKTRQEKKKSA FT DSKANLAAEVPDGMEMCLMAVTTSGANRWYLDSGATSHMTSDKNLLKDVNS FT AKQPEICLADGKRIKSSGAGSGKLVSVTGDGVRMDVTLKDVYHVPSLSGNL FT LSVSKICDLGYKVLFDRTGCEVIKDQKAVLVGERSGGLYRLKNYSQQAFVT FT KAEHPDLCEHLWHRRFGHRNSEAISRIVREELGFGLKMKKCDIQCVCGICC FT EGKMSRIPFPKESGSKSKAVGDLVHSDLGGPMEKATPSGFRYYMTMVDDYS FT GYTVIYLLKAKSDTELKIREYCAMVKNQFGRAPRVIRTDGGGEYSSASLKA FT YLAENGIIHQQTAPYSPQQNGKAERKNRYEVEMVRCLLAESGLDKKYWGEA FT ICTANYLQNRLPTSAVEKTPYELWHGKKPSYSHLRTFGSEAYVHVPKEKRL FT KLDKKAKKMIFVGYAEGRKAYRFLDPEKDWIAISRDAKFLETGVLKSERAA FT ESEQSSPVQQSDSLEIKASAEEAEDKVVLSLGSETPNPGSLPEAVENPESE FT EELEESDPSPEIARRSSRPNKGIAPRRLIEEIFAVGDVSFQDPTEPSGFKD FT AMTGEKCAEWRRAMEAEMESHQVNGTWDLVKLPDGRKPVGCRWVYKLKRNA FT AGDVVKYKARLVAQGFSQKFGQDYDEVFAPVTKQTTLRTLLAVASKKKLIL FT KHFDVKTAYLYGELEEELYMKQPPGFEIKGKEALVCRLRRSIYGLKQSARC FT WNHRLHSVLLELGFEQSTADPCFYTKVVNGKRVYLLIYVDDILVGSGSEAE FT IKKIYEALKKEFEMTDLGDLNFFLGLEITRTEGNYGVSLEGYIDRVAERFG FT LRDAKEAKTPMEEGFTKGKSEGPLLEDATKYRSLVGALLYIAVCARPDIAV FT SASILGRSVSAPREEDWVAAKRVVRYLKATKDWRLQYGEPAGKLIGYSDAD FT WAGDAKTRKSTTGNIFLFAGAAISWASRLQSCVTLSSMESEYVALSEASQE FT AVWLRKLLEDFGEPQLEPVEISEDNQSCIKFVESERVSRRSKHIETREAYV FT KELCDQRVLKLVYRPTEEMVADALTKPLGRIKLQKFSSMLGLVAGNR" XX SQ Sequence 4056 BP; 1074 A; 806 C; 1256 G; 920 T; 0 other; ataggttatg agcccggtgt aacgcggaat ttgtgcggat ttgtggacat ttagtgaaga 60 atcgatcggt gaagtgtttt ttttcgtggt acgaacattt tgcgtcgcga gtgaacaaaa 120 tggcggatgt aaaagtgtcc ttggagaagc tgaacgacca gaactattcg atctggaagt 180 tcaagatgga gcttcttctg gtgaaggaag agctgctgac ctttgtcacg gatcctaagc 240 ctgatgtgcc gacggcggat tggtctttga aagacggaaa aactcgcgcg atgatcggct 300 tggcagtgga agacagtcag ctagtgcata tcatccggaa gaagacagcc aaggaaatgt 360 gggacgcgtt gaagcagatt cacgaggcat cgtctctgtc gtcaacactt cacgtcttgc 420 gcaagctgtg ttccctgcgg ctttcggaag aaggcgatct ccccggacac ctgaacgaga 480 tgacggattt gcagaaccgt ctggaagtga ttggggaagg tctgaaggat cgagttttca 540 tggcgctgat tttgtcgagc ctcccgccat cgtttggcag cttgatcaat gtgatcgaga 600 acaaaccgga acaagaacta acggtggatt ttgtgaagag caagctgcgg gaagagtggc 660 gacgtcgtct cgagtgcaac ggtggtagtg tgcgcggtga cgagaaagtg atgcaaagtg 720 ttaagtctaa caagacgaat aataagaaga aatttgtgtg tcactactgc aaggaagagg 780 gacatttccg aaacgattgt ccgaagctgg ctaaaacgcg acaggagaag aagaaatcgg 840 cagatagtaa ggcgaatctg gccgctgagg taccggatgg aatggagatg tgtctaatgg 900 ctgtgacgac ctctggcgca aaccggtggt acttggactc cggtgctacg tctcacatga 960 ccagcgacaa gaatctcctg aaggacgtga actcagcaaa gcagccggaa atctgcctcg 1020 ccgatgggaa acgaatcaag tcgagcggcg ccggaagcgg aaagctggtg tcagtcaccg 1080 gcgatggtgt gcggatggat gtcaccttga aggatgtgta tcatgttcct tcgttatcag 1140 gaaatttgct ctcagtaagt aaaatttgtg acctggggta caaagtttta tttgatcgta 1200 ctggatgtga agtaattaaa gatcaaaaag cagtgctggt tggtgaacgg agcggtggct 1260 tgtaccgttt gaagaactac tctcagcaag cttttgtgac caaggctgag catccggatt 1320 tgtgtgagca cttgtggcac cgaaggttcg gacatcggaa cagtgaagca atctcgagaa 1380 tcgtacgtga agaacttggt ttcggactga aaatgaagaa atgtgacatt cagtgcgttt 1440 gtggtatttg ctgtgaaggg aagatgagtc gcataccgtt tccgaaggaa tccggtagta 1500 aatccaaagc agtgggggat ttagtgcatt ccgatctggg gggtccaatg gagaaagcga 1560 ctcccagtgg tttcaggtac tacatgacga tggtggacga ttacagtggc tatacagtga 1620 tttacttgct gaaagcaaag tcggacactg aactgaagat tcgggaatat tgtgccatgg 1680 tgaagaacca gtttgggcgt gctccgaggg tgattcgtac cgacgggggc ggtgaatatt 1740 ccagtgcatc gttgaaggcg tacctggcgg agaatggtat cattcatcag caaactgctc 1800 cgtattcgcc acagcagaac gggaaggcgg aacgtaagaa tcggtatgaa gtggagatgg 1860 tgaggtgctt gctggcggag tctggtcttg acaagaagta ctggggagaa gcgatttgca 1920 ctgcaaacta cttgcagaat cggctaccaa cttctgctgt cgagaagacg ccgtacgagc 1980 tgtggcatgg gaagaagccg tcgtactctc atttgaggac ctttggatcc gaggcatacg 2040 ttcatgtgcc gaaagagaaa cgtttgaagc tggacaagaa ggcgaagaag atgatttttg 2100 tcggatacgc agaaggaagg aaagcttacc gttttctgga tccggaaaag gattggattg 2160 cgatcagccg agatgccaaa tttttggaga ctggtgtctt gaagtccgaa cgtgcggctg 2220 aatcggaaca gtcctcgccg gttcaacaat cagactcgtt ggagataaaa gcatccgctg 2280 aagaggcgga agataaggtg gtgttgtcac tcgggtcgga aactcccaat ccgggttcat 2340 tgcctgaagc tgtggaaaac ccggagtctg aggaagaact ggaggaatcg gatcccagtc 2400 cagaaattgc acgacgttca tcgcggccta acaaaggcat tgcaccgaga agattaatcg 2460 aggagatttt tgctgtcggt gatgtgtcat tccaggatcc gacggagcca tccggcttca 2520 aggatgcgat gaccggtgaa aagtgcgcag aatggaggcg tgccatggaa gctgaaatgg 2580 aatctcatca ggtgaatggt acttgggatc tggtgaagct acctgatggc cggaaacccg 2640 ttggatgtcg ttgggtgtat aagttgaaac ggaatgctgc tggtgatgtg gtgaaatata 2700 aggcgcgact agtggctcaa ggtttcagcc agaaatttgg acaggactat gacgaggtct 2760 ttgctcctgt aaccaagcaa acaacgctga ggaccctgct ggctgtcgca agcaagaaga 2820 agctgatcct gaagcatttc gacgtgaaga cggcgtacct gtatggtgag cttgaagaag 2880 agctgtacat gaagcaaccg ccggggttcg aaatcaaagg taaggaagca ttggtctgtc 2940 gcttgaggag gagcatatac gggctgaagc agtcggcccg atgctggaac caccggctac 3000 attccgtcct cctggagcta ggtttcgagc agagcacagc ggatccgtgc ttctatacga 3060 aggtcgtcaa tggaaagcgt gtctacctgc taatttacgt ggacgacatt ctggttggca 3120 gtggaagcga ggctgaaata aagaaaattt atgaagcgct gaagaaggag tttgagatga 3180 cggatcttgg cgatctgaac ttcttcttgg gactggagat caccaggacg gaagggaact 3240 acggcgtctc ccttgaaggc tacatcgatc gtgtggctga gcgatttgga cttcgtgacg 3300 ctaaggaggc gaaaacgcca atggaggaag gattcacgaa gggcaagtcg gaaggtccac 3360 ttctggagga tgcaacgaag tataggagct tagtcggagc cctactctac attgcggttt 3420 gtgccaggcc ggatatagca gtgagcgctt cgattctggg caggagtgtc agtgcaccac 3480 gggaagagga ctgggtggcg gccaagcgag tggtccgcta cttgaaggcg acgaaagact 3540 ggcgtctgca gtacggagaa ccggctggaa aactaattgg ttattccgac gcagattggg 3600 ctggtgatgc caaaacgagg aaatcgacaa ccggcaacat cttcttgttc gctggtgcgg 3660 caatatcttg ggcgagtcgt ttgcagagct gcgtaacttt gtcgtcaatg gagtcggagt 3720 acgttgctct aagtgaggca agtcaggagg cagtgtggct tcggaagctg ctcgaagatt 3780 ttggcgaacc acaattggaa cctgtggaga tatcggaaga caaccaaagt tgtattaagt 3840 tcgtcgaatc ggagagagtc agtcggcgat cgaagcacat agaaacccga gaggcctacg 3900 tgaaggaact gtgcgaccaa agggtgctga agctggtgta ccgtcccacc gaagagatgg 3960 tagctgacgc acttaccaaa ccgctgggca ggatcaagtt gcagaagttt tcttcaatgc 4020 ttggtcttgt agctggcaat cgttgaggag gagtat 4056 // ID Gypsy-18-I_NVi repbase; DNA; INV; 16651 BP. XX AC . XX DT 15-APR-2009 (Rel. 14.04, Created) DT 15-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW internal portion; Gypsy-18-I_NVi; RRM motif. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-16651 RA Bao W. and Jurka J.; RT "LTR retrotransposon from Nasonia parasitic wasp."; RL Repbase Reports 9(4), 773-773 (2009). XX DR [1] (Consensus) XX CC The 3'-end portion encode a RRM (RNA recognition motif) domain. XX FH Key Location/Qualifiers FT CDS join(3621..5171,4829..9100,8982..10316,10327..12345, FT 12363..13424,13544..15277) FT /product="Gypsy-18-I_NVi_1p" FT /translation="MTDETGNRTGENTGADLATTMANSFMLMSCVSGIPSF FT DGKHTKLRDYIQDLKNANELVTEAIRPQFMRNVLQRITGAAKKSLNNKTIT FT TVEDLIKHLKQRFGPGRDFSYYNTRIQTVKMRQGDTVGDFYDELNVLLSSA FT RNALKEEKGSDNLDAMMEPLEMLAVDIYIRGLPANLSERVDLFKPKNLREA FT YEEAVRLETRMEARIIPDSRPRVNRSYYNTERRDSYNNGYRNEGGGYNNRG FT GYNNEEYRNQDRYDNRNRRHDDFIGYMPEEEECAGYVEEEEDYARYDNEDE FT DYVGYVNEPEQRQREFQDNHRRNNYNRNNYNQNGYQGYQRGNGSGNNYNRG FT GYQRGYNNSNRPGGSNWPSWNRNYGQRSQRQIFNHENGRRENYQPSGPDRR FT ENYNNYPNNWRNGNARNYQQNENERENLNSQRARHGERMTSQEQNQQPIQK FT PQQNVDYQKYQQQTQGYNQGRQNPNQSSQNNQVQQKAPVMAIMTREQELPI FT YGQPTARDLKSLIGKMMNQQQLEKRQCKKLSTKRKRTREFKLSKGSSWRKN FT DEPGTEPTTYPETTAECGLPEISTTNTGLQSRQAEPKSEQPEQPGAAKGTS FT DGDNDKGTGTSNIWTADCKGPEEFNWENDEPTAIKVHIPQGTTKPWLTFMI FT DNGASVNLIKLSFIHDDMPINMKDVRNLGGITTGTVPTLGAVYLLIRNTPV FT KFHIVTDDFPTPHDGLLGRNYLKKEEAVISYFNNALMVGGDVMHPMPFLGQ FT EKEYREKRKRAHNSYKTSNVLQVEDKKDDKEAPVTDVSKNEKTPAKTKHVL FT KARTRQVVRINLIRSELKEGYIPRIDVGNENVFLGEGVVINDNNTCKMMAI FT NTSEEDVMIEVDAKELIPFDTEPNFLEETDSEFNGEVIVDRIKRLEKVKES FT IRRSHLNQEELKIVDRIIEDYLDRFLLPGDKLPCTDMIQHHIHLEDDIPIN FT TKQYRHPPKHKQVVRESVEKKLRDKIIRESNSPCNSPIWIVPKKPDSHGNP FT RWRMVIDFREINKKTIRDAYPLPNIADIMDQLGGATYFSIFDLASGFQQIP FT MAPEDCYKTAFTTLNGHYEYTRLPEGLKNATATFQRLMEKALRGLQNIEML FT VYLDDIIVYSKDLQEHEQRIRHMMDRLRLAKLVLQPDKIEFFRREVGFLGH FT IISSRGIEPNPEKVEAITKLPTPKTAKNVRTLLGMFGYYRKYIKDFAKIAK FT PLNDLLKKNVKFEWTEDCEKSYQILKDCLVREPILQFPDFNKEFTLTTDAS FT DYAIGAVLSQEKDGFDHPVQYLSRALNKAERNYSTTEKECLAVLYALHQFR FT PYLLCRKFTLVSDHEPLNWMHSRKDPGQRLMRWMFRFTGYEYTFKYKPGKL FT NKNADALSRNPPEMTEEEINKNLPSIKIMVIEEKQSKQEKAAASVAKPAKG FT IVQPRTRIQSTGDIANKPRGRGRPIGAKTNKEAPKLEHSVIAQRTRARRAQ FT VQSPVGQYIKQGAIPRVPKPVASRKATKPGKPVTAKPTAETSTEASTAETS FT SIQLKEIPESSTDADSDTSMTVNPSRRSWLSSTTNEDTDSEKPPIPDPRYS FT GLRQEDTETTEEETEDDEVFSLEGNQDKTISNIAIEVSAIEGSNTEESSED FT ESIKNLSVKTTLTREEVEEASRKFEESMKRYEMDKDTNTTESGTEIAIQLQ FT IPSHFSDDDEVADEVREAIYTSDPRFQTEEEIDIDNVWRKSVQKLIKRARQ FT KLQETTREENSGSDSDVESLADIIMPPPRKTLSVTPTITSGSPTAKSTPQN FT KTKARRKLGVLSEIEAEDGSIIDIRFSKPPEITMPFLLPTPPDLEPLNKDY FT TPTENDHESEDAPTHITFKKIPQNIITVRECITHKRDNIVHFLSADCDNTW FT PVTRLLVEIGAIDLTKIKSKKPKIGQILVTPLQKTSHIHSNNXRQMLQRNK FT NRKFKQGTSESKRNTHQSRHYKKHHIFTVIXKDKCFNVIKIENLNKGLRNL FT KETLINKKITSFRIARKGDILDQLESPIILDVLYKHFHGSRIRVTMCYGKA FT QVPAEKHRKEIISHLHDSLTGGHKGINQTYQKIRERYYWPGMRNDVQDYIR FT RCAECQEQKIERFKTREPMIITDTPIEAFDKVSIDTVGKLKMTPRGNCHLL FT TMQCNLTKYLIAIPIKNLNATTIADALAKYLICQFGAPRAILSDRGTSFLS FT KIVESLLKLFKINHLTTSGYRPQTNGSLERSHAPLIEFIRIYSERYDDWDH FT LTPFATFTYNTSVHAATNFTPFELVYGRIARFPLRIPSDEKLKTYNVYMRD FT LVLRLEEMKILAGETQIANKIKTKDRYDEKVRAFKGRVGGYARLINEPRVS FT KFDAYRNKPLRIIEFLGRKNVLLEYPNGKRIRKHIDKLKAVEDKSDNEDSD FT SFNPCYRMKILEIIIIIFNLLNVRRCEPMYQIEPLNPNPGIYFEKLDVIRI FT KKATWKLNIYIDVEDFMQTHNATDSYKEIFEICKKVMEERKCRHALAIDLL FT QIKEQELAKTQEKIKETIASLGHTQYPTARIPRSIRTKRLVPLGIIGSISS FT SLFGLVTNDEVDNINKNIDQLFQDQSKMVHLLDENSHIISAKFEELYNITS FT NHQKVLQGFEKELTKTIKTILREKDQMKYQVEVVIYVKRLESTLDHVIKSN FT EKLLEILRKLKEGKVHPDLMKQDMVQQMNMDVKRVSQDLEFPPPPEHMRAE FT ELARISEIDAIHQNGRTLAVLHLPLVDRMPYQLYKMHPINIPQNMKNETMG FT QAFIRPSHEYIAISYDHTRYIKFNEDQRQMCIKTHYADICPILGALRNVPE FT SRDCEITLLLNPSQKAITQCDIRYRLSEQTQWTYLNYDKSWLYSTIRPEIL FT NIICQDKHENKVTIKDAGIVHIAPICIGTTAEATITGEVTRNTEFTYVYKP FT EINLKITDIYPLLNQEDTSLEVHSSENIDVLGPNMANNDGRPLHEIVSKLR FT EIGEHKRQNYSTNTVLYGSMTVQLIIVTIIIIIIIKMKCLRKWRCPKNKKR FT RNPSPRINTKRRKIETKDEEIELEDILGSHKNHAILRLDRENPKTNAQSTT FT KKFALPKQEAECSIFEFPGRLYIAKQCLTNQKFFFSNMIDVIYLQTKEYTM FT DFRPRTNISMFCDVDIPLSNIIREFERVGEPQNARVEKHRYERETKKITLS FT FNSEREIAKAYELRSLLAKQNKLKRNSKRAAPYQTPMITTGPKTAIGDLID FT LTPKEEPTVRSEVHVVNQPSTPQPIRNRLFCLLPWDGNDREKDEMFFRHFA FT QFGTLAYYETVRDKKGRLSYGYVQYFTQEDTKAAKEESDPMYKATFAEPRR FT LPRAAQGTANTKFHERCKQMIEVEIYNLHEITCKKQIAWTQDNVSEDPLSK FT LQPLQETIMELKEKIAKLENQKENNRHPPTPRTNRVSHERPKTPEATLPDL FT DENDSSMSIENNQIHITYNYHTTLFELFSDMDQVLKNIVTSIVFDEVLDHV FT TICFDPAGKLPMVMKLLNSYGPNNTVTFEIIPRGPHHDSARLTIKYRTRTS FT ATRLRSDYKTSRTEDRMTVEQRVEAPTDSDSSDTDDEHSSSDSSTTDGTDG FT TDNTSDTPESDANDENDEVILNSDEDVPQQDNDNNEDNMNSDSDNATGNDN FT DHIPDNNDDNYGSNDIDNADNNDINSDDNVNDNGYPLQYNPQNNVELLKMN FT LTDAYTITAIYRGDTIDAEALFRQYGPCHTEHYMSTTDDTKAISTRYQLDR FT HARKIINLYRQYIQRTDEDVQGLRRQEREQGPPTTEYVRSDDTICVRVPRT FT VSKQELTEIFATFGEIADIRRIKPNDEDERYTAFIQYRHKDSAYKAVTETD FT NTYLPKWAYNRRQRTTKPATTQENAQERQRTEGPQEPAQRQRRWTLDPANN FT TKLYVLAPPDTTQQEFRADFIKYGEAIQIELTNGKKENNEKVGYVIYKDKI FT DAIDAKLNGPRKYKIDWKLTPQTTETLLHFTCGDQIPIQSFTDHTRDCTTT FT TLRRRLQDKKKRTTEYQQTSRRETEC*" XX SQ Sequence 16651 BP; 6590 A; 3298 C; 3215 G; 3547 T; 1 other; aaactggggg ttcgtccggg atctgcttag cttttgttgc ttgcaaaact gtttggtttt 60 ctgtgtttaa aataattttt tttggtgagt gagtgaacat ataaaaaaaa agatcccaat 120 taagtgcgcg taaatacgaa aaatattaaa aagtcataag tgcgacaaag agagaaacga 180 gagaatataa aaagcaagag cattgatagg agtttgaaat tcgagacgcg cgcgcaagca 240 cacaaagacg ccgtgacgtc actgcagcca catatcgggt gggacgcgca gtaacgtcag 300 tgattttaaa ataatttttt ttttattatt tcttgtgact cttacaatta ataagttcgg 360 acgttttact agttaaaaat cattgagacc cagtggatat taggagaggt acccaaataa 420 gacaaaacgc aataatatta cacaatatct tttcttacac cagcaatttt ctttacacac 480 tcaatgcaca agcagtttga tgagtcctgc cacacaacat atacacataa aagttcatta 540 caagtacttt gatattacac acaaagagta gaatcaagat tttgaaaatt ggcatcgcat 600 tcgagtcaca caccaacaat ctacaatata cctccgtagt ttgatatcac aaattattgt 660 tcactaacca gtacacagag agacgcataa aacagtaact tccaaaacat ttacacatag 720 aatcacacca attttcattc aacaattttt ttttttgcat tacattttga ttgccgacac 780 ttgactccaa aaacaggcac agtcctacat tttacagtct aattcagaac taacaaaaaa 840 aaaaaacaaa acacaaattc taattgacac cattcgcctt gacactctgt atttaactaa 900 aattacactt gtaacactgc actatttctc ttgaatacag aacacacgta ggggtgggcc 960 cgttaggaca ctgctcgtgc tccaaaataa ccgaaacacc cagaatctgg tcgacgagta 1020 gcactagagt gcactggcaa gtgatcgcca gaagtcgagc cagggccttg acgaggctgc 1080 tgcgtagagt cttggactcc ggctttcgca ggagaaagtc tgggctcttg cactcgaacc 1140 agctaacttg ggtgttgaca cgcctattga gaacctgtaa ataccaaaaa taattaattt 1200 catgcgtatt tgtaccccag ataataatta ataccatatt ttgaccactg agcgaggaaa 1260 cgctaaaatt tttgtatatg aaacccattc ttacacacat ttatttgcac aactttgcgt 1320 aaggcatatg gattaacaag gacccattgc tagacatagt ttaggggaaa ggcaagaaag 1380 tctaatcaaa aatatatcga gatagcttga tcagaaccag gcgaggacac atatatctaa 1440 cctagtttgg cgtcaaagta tttctgacca aaaccataaa tattaatcat tttgaacaaa 1500 acaaaataaa atcccagggt ttaggttaaa acacatctat tttgcacaaa aagacccaat 1560 aaatattaat aataattgtc aaaaagcgac aattgaccct cagacgcgac aaatttataa 1620 caaagtcagc atatagcaag ctgattatca cctataattt acaggaaaag tattaacaag 1680 caacagaggg agtgtgtgat taataatatt tgcaaaaatt acataattat tgttaatggt 1740 accaaaaact ttaagtatag aagcacggct ctaaataaaa tacggtttta gaacaaggtt 1800 tggactatta aaaataaatt taacattcga ttattcaaca caagcaacat tagaaacacg 1860 aacaataatt tattaatagc taagtgatag aagcacggtt cagaatacat gaaagacgct 1920 tgtcgcacaa atttctactt ttaaattctc caagtctatt gaataaaaac cccaagtaca 1980 cgactgagtt attaatcaca cacgccacaa ttattgttaa ttaaaaaaaa aaaatagaga 2040 gagagcaagt ggtaaaaata tctgcttaca caaaacacac ccatgacgtc atgcagactc 2100 agcactgtcc tccttgatct tgaggatgtt tgtaaaaaaa ttcaagctgc tgctgactgc 2160 cacgaaagct gaaaagaaag acaaaacaag tttacacaaa gcaacagttc tcgggttgtt 2220 tgcaagaaat ttaatattta tttaataaaa aaaaaaaact tatctgagtc agctgataga 2280 gttgcagctg cgacattggg aacaatcctg ggcccgaact ttgacctttg caggaacacc 2340 gtgatagcac gcacacgcac ttttgattga acgatacgag acgacacgca cggttcacct 2400 tttgcaaaac aattagcaat aatttgaagg cgaaaagtac acgcgacgca acacgaggca 2460 acagctgacg caaacggact cgactaggga gtttgactgt agcgagaagt taggcgcgga 2520 gcggtgccac acccaagcta gtgactatag actcgagagc gggcggacgc cattttgaga 2580 gcggcggagc gcgcgcgaat atttgaattc gaatattgga cttcgagact tttcaagaga 2640 attattgtaa agtaaattca gtaattaaac aattaaacta ttgctactgt atttagaccg 2700 gattaacgaa aacaagggcg tgtccattgg aattaaatta tttccttacg actttgaaat 2760 attacgtgac agtttgagac cattcagaaa caatagtttt tttttgcata atagggcttt 2820 tacataaata taaataatct ttacggttaa gactattatt caaacattat atgttaataa 2880 ttataggcga attaatttcg aaaacactat ttagaggaaa ttcatttaca taatcgataa 2940 caacaaatag tcatctatta tttttaaata aataaattaa ttttgaatta tacaaaataa 3000 ttagcataat taaattttct tttcgggaat ttttgcgaag aaaataaatc acgtaaatat 3060 tttaattgaa tttacaaaac cggaccatca ctcttaattt aaatttgact taaattttga 3120 cagaggacac gacacaaatt ttctttttcc cattaaaaaa aatgcacgat ttgacgactc 3180 tcgttggata agcctttagt ttagacatat acccgtgaac acttaatatt aatttagtat 3240 taatatttta ttacaatatt ttattgacta ctggttttat tattaatcat tatcaccatt 3300 tattattatt tatcgaaccc cacaataatt aaacgaaaat taactatcaa aaatcccaca 3360 atttcgttta cgttacccct taatatactt caatatatta tctataccca catcataaaa 3420 ttttaattac aataataata ataacatatg ataaatataa tattgggatt gtgaggagag 3480 gaaataaggc tttaataaaa tagtggcacg gaattacgga ctctctataa agcaagaacg 3540 ttttagaagc atttagagaa ctatctagag gattatttag aattttctac gacaattaag 3600 aggcagtaaa tctattcaaa atgacagacg agaccggcaa taggacgggt gaaaacacag 3660 gagcagactt ggctaccacc atggccaata gcttcatgct tatgagttgc gtaagcggca 3720 taccaagctt tgacgggaaa cataccaagt taagagacta catacaagat ttgaagaatg 3780 caaatgaact tgtaactgag gcaatacgac cacaatttat gcgaaatgtg ctgcagagaa 3840 taaccggagc agcgaagaag agtctcaaca acaaaaccat aacaacagta gaagacttaa 3900 ttaaacattt aaagcagaga tttggtccag gcagggactt tagttattat aacacacgta 3960 tccagacggt caagatgagg caaggagaca ctgtaggaga tttttatgac gaattgaacg 4020 ttttactgag cagtgccagg aatgcattga aagaagaaaa gggaagtgac aaccttgacg 4080 cgatgatgga accattggag atgctggcag tcgatatata cataagggga ttaccagcta 4140 atttatcgga aagagtggac ttattcaaac caaagaatct acgagaagca tacgaagagg 4200 ctgtgcgact agaaacacgc atggaagcca ggatcatacc ggactccagg ccacgagtga 4260 atcgaagtta ttataataca gaaagaaggg atagttataa taatggatac cgcaatgaag 4320 gtggcggata taataataga ggcggatata acaatgaaga atacagaaat caagatagat 4380 atgacaacag aaatcgcaga catgacgatt ttataggata tatgcctgaa gaagaagaat 4440 gtgcaggata tgtcgaggag gaagaagatt atgcaagata cgacaacgaa gatgaagact 4500 acgtaggcta cgtcaatgag cctgaacaac gccaacgaga attccaagat aaccatagga 4560 ggaacaatta taaccggaat aactacaacc aaaatggtta tcaaggttat cagagaggaa 4620 atggtagtgg caacaattac aatagaggag gataccagcg ggggtataac aacagcaaca 4680 gaccaggagg gagcaattgg ccgtcgtgga atagaaacta tggtcaaaga agccaaaggc 4740 aaatttttaa tcacgaaaat ggacgtcgtg aaaattatca acctagtggt ccggatagac 4800 gagagaatta caataattat cccaataact ggagaaacgg caatgcaaga aactatcaac 4860 aaaacgaaaa cgaacgagag aatttaaact ctcaaagggc tcgtcatgga gaaagaatga 4920 cgagccagga acagaaccaa caacctatcc agaaaccaca gcagaatgtg gactaccaga 4980 aatatcaaca acaaacacag ggttacaatc aaggcaggca gaacccaaat cagagcagcc 5040 agaacaacca ggtgcagcaa aaggcaccag tgatggcgat aatgacaagg gaacaggaac 5100 ttccaatata tggacagccg actgcaaggg acctgaagag tttaattggg aaaatgatga 5160 accaacagca ataaaagtac acataccaca aggaaccacg aaaccttggt taacattcat 5220 gatagacaat ggagcaagtg taaatctaat caaattaagt tttatacacg atgacatgcc 5280 aataaatatg aaagatgtga ggaatcttgg aggaataacc acgggaacag taccgactct 5340 aggagcagtt tatttgctta tcagaaatac accagtgaaa ttccacatag tgacagacga 5400 ttttccgaca ccacacgatg gactattagg aagaaattat ttaaagaagg aagaggctgt 5460 catttcttat tttaacaacg ctctaatggt aggcggagac gtgatgcacc caatgccatt 5520 tctgggacaa gaaaaagaat acagagagaa gagaaaaagg gcacataatt cttacaaaac 5580 cagcaacgtg ctacaagtcg aagacaagaa agacgataaa gaagcacctg taacagatgt 5640 atccaaaaat gagaagacac cagcaaaaac gaaacatgtt ttaaaggcca gaactcggca 5700 agttgtaaga atcaatctaa tcagaagtga attaaaagaa ggatacatac ccagaatcga 5760 tgtagggaat gaaaatgtat ttttgggaga aggagtagta attaacgaca acaacacctg 5820 caaaatgatg gcgattaata cgagtgaaga agacgtcatg atagaggtcg atgccaaaga 5880 attaatacca tttgacacgg agccaaattt tctagaagaa acagacagcg aatttaatgg 5940 tgaagttata gtggacagaa ttaagaggct agagaaggtg aaagaatcaa ttagaagaag 6000 tcacttgaat caagaagaac taaaaatagt ggaccgtata atagaagatt atttagatag 6060 atttttatta ccaggagaca agttaccgtg tacagatatg atccagcacc acatacattt 6120 agaggacgac atacctataa acacgaaaca atataggcat cctccgaaac ataagcaagt 6180 cgtcagagag agtgtggaga agaaattgcg agataaaatt attagagaat cgaattcacc 6240 atgcaactcg ccaatatgga tagtaccgaa gaagcctgac agccatggaa acccaagatg 6300 gaggatggta atcgactttc gcgaaatcaa taaaaagaca atcagagacg cctaccctct 6360 gcctaacatt gccgacatca tggaccaatt aggtggagca acatactttt cgattttcga 6420 ccttgcaagt ggattccagc agataccaat ggcaccagaa gactgttaca agacagcatt 6480 cacaaccctc aacggacatt atgagtatac acgtttacca gaaggtctga agaacgcaac 6540 agcgacattt caacgcttaa tggaaaaggc acttagaggt ttgcaaaaca tagagatgtt 6600 agtgtattta gatgatatta tagtctacag caaggacttg caagaacacg aacagcgaat 6660 taggcatatg atggatagac taaggctcgc caaactggta ctgcagccag ataaaattga 6720 attttttaga cgagaagtag gattcctagg acacataata agttccagag gaattgaacc 6780 aaatccggaa aaggtggaag caataacaaa gttacccaca ccaaagacag caaagaatgt 6840 aagaacacta ctaggaatgt ttggatacta caggaagtac attaaggact ttgcgaagat 6900 agctaagcca ctaaatgacc tgctcaagaa aaacgtaaaa tttgaatgga cagaagattg 6960 cgagaaaagc taccagatac tgaaagattg ccttgtaaga gaacccatat tacaattccc 7020 agacttcaac aaggagttca cactgaccac tgatgcgtca gattacgcaa taggagcagt 7080 attaagccaa gaaaaggacg gcttcgatca cccagtacaa tatttgtcta gagcgctcaa 7140 caaagccgag agaaattatt caaccacgga gaaagaatgc ctcgcagtac tttacgcctt 7200 gcaccagttt agaccttatt tattgtgcag aaaatttaca ctagttagtg accacgaacc 7260 actaaactgg atgcacagca gaaaagaccc aggccaaaga ctcatgaggt ggatgtttag 7320 attcaccggg tacgagtaca cttttaagta taaaccagga aagctgaata aaaacgcaga 7380 tgcactatca cggaatccac cagaaatgac tgaagaagag atcaacaaaa atctacctag 7440 tatcaaaata atggtaatag aagaaaaaca aagtaaacaa gaaaaggcgg cagcaagtgt 7500 agcaaaacca gccaagggaa ttgtgcaacc acgcaccagg atacaatcaa caggcgacat 7560 agcaaataaa cctagaggaa gaggaagacc cataggagcc aaaaccaata aagaggcacc 7620 aaagttggag catagcgtga tagctcaaag aactagagca agacgcgcac aagtacagag 7680 tccagtggga cagtacatca aacaaggagc aatacccaga gtacccaaac ccgttgcatc 7740 aagaaaagcc acaaaaccag gaaaaccagt gacagctaaa cccacagctg aaacatcgac 7800 tgaagcgtca acagcagaaa ctagctccat tcagctgaag gaaatcccag aaagttcaac 7860 cgatgcagac tccgatacat cgatgacagt caacccatct agacgatcgt ggttatcttc 7920 gacaaccaac gaagacaccg actcagagaa gccaccgata ccagacccaa ggtatagtgg 7980 actacgacaa gaagacacag agacgacgga agaagaaaca gaagatgacg aagtattttc 8040 tttagaagga aaccaagata aaactataag caacattgca atagaggtca gcgcaataga 8100 aggatccaac acagaagaaa gttccgaaga cgaatcaatt aagaacctat cagtaaagac 8160 gactctaact agggaagagg tagaagaagc aagcagaaaa tttgaagaga gtatgaaaag 8220 atatgaaatg gataaagaca ccaataccac agaatcaggg acagaaattg cgatacagct 8280 acagataccc tcacactttt cagatgatga tgaagtagca gatgaagtac gtgaagcaat 8340 ctacacatcc gacccacggt ttcaaacaga agaagaaata gatatagata atgtatggag 8400 aaaaagcgtg cagaaattaa taaaaagagc acgacaaaag ctacaggaga ccacaagaga 8460 agaaaactca ggaagcgact ccgacgtcga aagcctggct gacataatta tgccgccgcc 8520 aaggaaaaca ttatctgtga ccccgacgat aacaagtggt agccctacag ccaagagcac 8580 accacaaaat aagacgaaag cacgcaggaa attaggagta ttaagtgaaa tagaagcaga 8640 agatggatca ataatagaca taagattttc aaaaccgcct gaaataacaa tgccttttct 8700 cctgcctaca ccaccagacc tagagccact aaacaaagac tacactccaa cagaaaatga 8760 tcatgaatcg gaagatgccc caacacacat aacattcaag aagattcccc agaacataat 8820 caccgtaaga gaatgcatca cgcacaaaag agataacatt gttcactttc tatcggcaga 8880 ctgcgacaac acttggccag taaccagatt attagtagaa atcggagcca tagacttaac 8940 gaaaatcaag agtaaaaagc caaagatagg ccaaatccta gtcacgccat tacaaaaaac 9000 atcacatatt cacagtaata atraaagaca aatgcttcaa cgtaataaaa atagaaaatt 9060 taaacaaggg acttcggaat ctaaaagaaa cactcatcaa taaaaaaatt acaagtttcc 9120 gaatagcgag aaaaggagac atattagacc aattagaatc accaataatt ttagacgtac 9180 tctacaaaca ttttcacgga agcagaataa gagtgacaat gtgttatgga aaagcacaag 9240 taccagctga aaaacacaga aaggaaataa ttagccatct tcacgacagt ctaacaggag 9300 gacacaaggg aatcaatcaa acttaccaga aaataagaga acgatattat tggccaggaa 9360 tgaggaacga cgttcaagac tacattcgaa gatgtgcaga atgtcaagag cagaaaatag 9420 agagattcaa aactagagag ccgatgataa taaccgatac accgatagag gcttttgaca 9480 aagtatcgat cgacacagta ggaaaactta agatgacccc gagaggaaat tgccatctat 9540 tgacaatgca atgcaaccta accaaatatt taatagccat acccatcaaa aatctcaacg 9600 ccacaacaat agcagatgca ttggcaaagt atctcatttg tcaatttggg gctccgagag 9660 ccatactctc cgacagagga accagttttc tgtcaaaaat agtggaatca ttattgaaat 9720 tgttcaaaat aaaccattta acgacgtctg gatatagacc tcagacgaat gggtcactgg 9780 aaagaagcca cgccccactt atagagttca ttagaatata ctccgaaagg tatgacgatt 9840 gggaccattt aacaccattt gcaacattca cttacaacac tagtgtacac gcagcaacta 9900 atttcacccc ttttgaactc gtttacgggc gaatagcgcg tttccctttg agaataccat 9960 ccgacgaaaa actaaaaacc tacaatgttt acatgcgcga cctagtatta aggttagaag 10020 aaatgaaaat tttagcaggc gaaacccaaa tagcgaataa aattaaaaca aaggacaggt 10080 atgacgagaa agtcagagcc tttaagggta gagtaggcgg atacgcgaga ctaataaatg 10140 aaccccgtgt aagcaaattt gatgcgtata ggaataaacc attgagaata atcgaattct 10200 taggtaggaa aaatgtcctt ctagagtacc cgaatggaaa gcgcatacga aaacatatag 10260 ataaattgaa agccgtagag gataagtccg ataacgagga ttcagactct ttcaactaat 10320 cgataaccat gttacaggat gaaaatttta gaaataataa taataatatt taatctactc 10380 aacgtgagga gatgcgaacc catgtaccag atagaacccc taaaccccaa tcccggaata 10440 tacttcgaga agctggacgt gatcaggatc aaaaaggcga cgtggaagct caacatctac 10500 attgacgtcg aggacttcat gcagacgcac aacgcaacgg actcgtacaa agaaatattc 10560 gagatctgca agaaagtaat ggaagaaaga aaatgcagac acgccctagc cattgaccta 10620 ctgcaaataa aagaacaaga attagcaaag acccaagaaa agataaaaga gacaatagca 10680 agtctaggac acacacagta ccccacagca cgaattccaa gaagcatcag gacaaagaga 10740 ctagtaccac taggcataat tggaagtata agtagcagtt tgttcggact cgtgactaac 10800 gacgaagtgg acaacataaa taaaaatata gaccagctct tccaggacca aagcaaaatg 10860 gttcatttac tagacgagaa ttcccacata atctcagcaa aatttgagga attatacaac 10920 ataaccagca accaccagaa agtgttgcaa ggattcgaaa aagaattaac aaagacgatc 10980 aaaacaatac tacgagaaaa agatcaaatg aagtaccaag ttgaagtagt aatatacgtt 11040 aaacgactag aatcgacatt ggaccatgtg attaaaagta atgagaaatt actagaaatc 11100 ctacgaaaat taaaagaagg aaaagtacac ccagacctta tgaaacagga catggttcaa 11160 caaatgaaca tggatgtaaa aagagtgagt caagatttag aatttccacc accaccagag 11220 cacatgagag ccgaagaatt agccagaata tcggaaattg acgccattca ccagaatggg 11280 cgaacattag cagtgctaca tctacctcta gtagatcgca tgccttatca attgtacaag 11340 atgcatccta ttaacatacc acaaaacatg aaaaatgaaa cgatgggaca agccttcata 11400 agaccatcac atgaatacat cgccattagc tacgatcata ccagatacat aaaattcaat 11460 gaggaccaaa gacaaatgtg cattaaaacg cactatgcag acatatgccc aatactagga 11520 gcactcagga atgtcccaga atcaagagac tgtgaaataa cattgctctt gaaccccagt 11580 caaaaggcca ttacacaatg tgatatacga tacagactga gcgaacaaac tcagtggacg 11640 tacctaaatt atgacaaatc atggctctat tcaacaattc gaccagaaat actaaacatc 11700 atttgccaag acaaacatga aaacaaggtc acgataaaag acgcaggaat cgttcacata 11760 gcaccaatat gcataggaac aacggctgag gcgaccatca ccggagaagt taccaggaac 11820 accgagttta cctacgtgta caaaccagaa ataaacttaa aaataacaga tatttacccc 11880 ctactgaatc aagaagacac cagtttagaa gtgcacagct cagaaaacat agatgtgcta 11940 ggaccaaata tggccaacaa tgacggaagg ccattgcatg aaatagtcag caaactaaga 12000 gaaataggcg aacataaacg acaaaactac agtacaaaca cagtactata tggaagcatg 12060 accgtccaat taataatagt aacgataata ataattatca tcataaagat gaagtgtcta 12120 agaaaatggc gctgcccaaa gaataagaag cgcagaaacc catcaccgag aataaacacc 12180 aaaagacgaa agatcgaaac aaaagatgag gaaatcgaac tcgaagacat tctaggcagc 12240 cataaaaacc acgcaatcct gcgactagac cgcgaaaacc ctaaaacgaa cgcgcaatcg 12300 acaacaaaga aatttgcact acccaaacaa gaagcagaat gctcatgagt gaatgaaatt 12360 aaatttttga atttccagga cgcttgtaca tagcaaaaca atgcttaacc aatcaaaaat 12420 tttttttctc taacatgatt gatgtaatct atttgcagac aaaggaatac acaatggact 12480 ttagaccaag aacaaacatt tcaatgttct gcgatgtaga cataccacta tcaaacataa 12540 taagagaatt tgaaagagta ggagaacccc agaacgcccg tgtagaaaaa cacagatatg 12600 aacgagagac caaaaagata acgctatctt tcaactcaga aagagagata gcaaaagcat 12660 atgaactcag atccctatta gccaagcaaa ataaattaaa gaggaacagc aagagagcag 12720 caccatacca aactcccatg ataaccacag gaccaaaaac agcaatagga gatttgatag 12780 acctcactcc aaaagaagaa ccaacagtta gatctgaagt acacgtggtt aatcaaccat 12840 caacacctca accaatcagg aacagactgt tctgcctact gccttgggac ggcaatgata 12900 gagagaaaga cgaaatgttc ttcagacatt tcgcccagtt cgggacacta gcatactacg 12960 aaacagtcag agacaaaaag ggacgtttgt catacggata cgtgcaatat ttcacacaag 13020 aagacaccaa agccgccaag gaagagagtg acccaatgta caaagcaaca tttgccgaac 13080 cgcgaagact accacgtgca gcgcaaggaa ccgcaaatac taaattccat gaaagatgca 13140 aacagatgat agaagtagaa atatacaact tacatgaaat aacatgcaaa aagcaaatag 13200 cttggaccca agacaatgtc agtgaagacc cactgagcaa gttacagcca ctacaggaga 13260 caattatgga attaaaagaa aaaatagcaa aactggaaaa tcaaaaggaa aacaaccgac 13320 atccaccaac acccaggaca aatagagtga gccatgaaag acccaagacc ccagaagcca 13380 ccctccctga cctagatgaa aatgatagca gtatgagcat agaataagac agataaattt 13440 acattttgcg accgtgttat ataaccataa taaacaatag ttaggaccaa tgaccctgta 13500 tgaatgaatg aatgcatgaa cgaaaataca tcgcatgaat taaaataacc aaatacacat 13560 aacatataat tatcatacca ctctctttga attattttca gatatggacc aagttttgaa 13620 aaatatcgtc acctcaatcg tcttcgacga agttttggac catgtcacaa tctgttttga 13680 ccccgcggga aaactcccta tggtaatgaa gctattgaac agttatggac caaacaacac 13740 cgtcacgttt gaaattattc caagaggacc acaccatgac tcagcgagat taactattaa 13800 gtacagaaca cggacaagcg cgacgagact cagaagcgac tataagacat ccagaacaga 13860 ggacaggatg acagtagaac agagagtaga agcaccgact gattcggaca gctcagacac 13920 agacgacgaa catagcagca gcgacagcag taccactgac ggaaccgacg gaacggacaa 13980 cactagcgac actcctgaaa gcgacgctaa cgacgaaaat gacgaagtta ttttaaattc 14040 agacgaagat gtcccacagc aagacaatga caataacgaa gacaacatga acagcgacag 14100 tgacaacgcc acgggcaacg acaatgacca catcccagac aacaatgacg acaattacgg 14160 cagtaatgac atcgacaatg cagacaacaa cgatatcaac agcgacgaca atgttaatga 14220 taacggctac cccttgcagt acaatcccca aaacaacgtg gaattactca agatgaacct 14280 gacggacgct tatacgataa cagctatata cagaggagac acaattgacg cagaggcatt 14340 gttcagacag tacggaccat gccacacaga acactacatg tccacgacag acgacactaa 14400 agccatatcg acacggtacc agttagacag acatgcacgg aaaataataa atctgtacag 14460 acaatatata caacggacag acgaggacgt gcaaggactt cgacggcaag aacgagaaca 14520 gggaccaccg acaacagaat acgttagatc ggacgatact atttgtgtac gagtaccgcg 14580 gacagtaagc aaacaagaac tgacggaaat atttgcaaca tttggagaaa tagctgacat 14640 acgacgcatc aaaccgaacg acgaagacga acgttataca gcatttatac aatacagaca 14700 caaggacagc gcatacaaag cagtaacaga aactgacaat acatacttac cgaaatgggc 14760 ttataatcga agacaacgga cgacaaaacc agcgacgaca caggaaaatg cacaggagcg 14820 acagagaacc gaaggaccac aggaacccgc acagagacaa agacgatgga cgttggaccc 14880 cgccaacaac acaaaactgt atgttctcgc accacctgac acgacacaac aggaattcag 14940 agcggacttc ataaaatacg gagaagcaat acagatagag ttaacgaacg gaaaaaagga 15000 gaataatgaa aaagtaggat acgttattta caaggacaaa atagacgcca tcgacgccaa 15060 actaaacgga ccacgaaaat acaaaattga ctggaagtta acgccacaga ctactgagac 15120 actattacat tttacatgcg gagaccaaat accgatacag agttttacag accacacacg 15180 cgactgcacc acaactacac tacgacgacg gctacaagac aaaaaaaaaa gaacaaccga 15240 gtaccagcag acaagccgac gagagacaga atgctgacac agcagggaca gcaacgcagt 15300 caacaagagc accagtctca gtacacagag ccttatacat cgactacagc tcatcaagtg 15360 atgaagagtt aaatgaagta gaaatagcca gatcaatacg acagcgacaa ctgaccatag 15420 acaatagaca acatcaaaat gagagagcag atgaggcaga agcaccagcg aggcacgagg 15480 acgaagcagc taatgagtta cgacaagaac tggacaaaat tgacttcaag aagatattag 15540 acgttagttt ttaagtgacg acaccgtgac gctagtttaa gttagattta agaaaccaag 15600 agtgccataa aagttttgta gtcaataaaa gatgcagaac ataagtatta ttgaaccaag 15660 ttaccactat tgtcatctaa taaataaaat actgttttgt actacctttc ataaaaaaaa 15720 gagagaacat tttctttcac tgaccaattg acctacatcc aaacaataca ccttaatata 15780 atataaatat taaactatag gaactaatat catatataat ataaacgaaa gcgaatctac 15840 ttactaacac aaatgaaata tagtattagc ataatataca gtagcaggtt aggacatata 15900 tttagattaa tattaggcaa cattaacagt aatagttaaa tttatacagt aaggttaaag 15960 tgtaccagaa catcatcaac gacatgtgtg acagcaaggt gaaaccaaaa caaattacaa 16020 ggtaggacaa aactaaagag aatagattac ccaagataga acgatccatc caaaccaatc 16080 atcccacgca agcaaaaggt gatgcatttg aagaaattcc atgtatcaat gagatacaaa 16140 gaaagcacaa ttgccggttg cgtaccagta gcttccagaa ttcttttctc aataaaagca 16200 gacctgaccg tgagacctct atatgaatgc tctcgacatt caaagagaaa ggatatgttt 16260 tctcttctta atcaagcgcc gcaagggctg attaatatga tcatgtgatc atagaggcaa 16320 gggaccaagg accgcaccaa ttatttaaaa gcataggcac acatcaacag ctacaactta 16380 gaacagctga tagacacctg aaaaagcttg atcctgccca aattccaaaa aaaaaaaaaa 16440 aaaaaccttc caagatacct catacaactc attgcccaaa atgcatctca caacaccgat 16500 attaaatcag taaatagcag gttagaactt taacctcata ggacagttcg ttactcagtt 16560 ttggtttctc attgtgtcac cggtgtctga ccatttaccc ttatatgtgt gtgtttgtcg 16620 aggatgacaa acgtctgcaa gggggggagg a 16651 // ID Kiri-19_AAe repbase; DNA; INV; 2963 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 16-FEB-2011 (Rel. 16.02, Last updated, Version 1) XX DE A Kiri non-LTR retrotransposon family from Aedes aegypti. XX KW Kiri; Non-LTR Retrotransposon; Transposable Element; Kiri-19_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-2963 RA Kojima K.K. and Jurka J.; RT "Kiri non-LTR retrotransposons from the yellow fever mosquito."; RL Repbase Reports 11(2), 714-714 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 12 sequences with >97% CC identity. CC This family and related Kiri elements found in two mosquito CC species, Culex quinquefasciatus and Aedes aegypti, constitute a CC distinct lineage belonging to the CR1 group. The phylogenetic CC analysis with PhyML indicates that Kiri is a sister lineage of CC the L2 clade. Although several Kiri elements do not encode two CC proteins, probably due to 5' truncation, Kiri elements generally CC encode two proteins. Their ORF1 proteins commonly contain an CC L1-like RNA-binding domain. XX FH Key Location/Qualifiers FT CDS 3..239 FT /product="Kiri-19_AAe_1p" FT /translation="INLCHLNAQSLCARQLGKLDEFKSCFVNSKLDIICVT FT ETWLQDSISDSTVGVEGYTILRNDRNYSRGGGVCIYYKNCIN" FT CDS 243..2750 FT /product="Kiri-19_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="KQVAASEIVSDSRDFNRTEYLFVEFFVVQIKFLLGVV FT YCPPGVDCSWVLEQKLSELSLNYENIILLGDFNTNLMKNCQRTNRFNDVLT FT NFGLFCVNNEPTHFYNGGSSLIDLLLTNDSNFVLNFNQVAAPGFSQHDIIF FT SSLNITRTPSDRTRMFRDYNRIDYTRLRQSLECIDWSLLYSITEADTALDF FT FNSIIIELYENFVPLRISRPKSNAPWFTNDILNAMVVRDVAYRQWVCSKSD FT ADRNQYKRLRNRVTCLINKAKSDYLSINIESSSSNKETWRRLKKLNVTASN FT NEVEFCNTSDEINSYFSGNFTRESDYHSVSPINVDGFKFTSCSELEVTSAI FT FSISSNAVGLDGVSLRFIKMVLPYVITPITYLFNLFISTSKFPRAWKAAKV FT LPIRKKPKSNRLDNLRPISILSSLSKAFEKILKIQIQGFVQRFELLSPYQS FT GFRAGRSTTTALLKVHDDIHQTIDKKGVAFLLLIDFSKAFDRVPHVKLLNK FT LSTQFYFHRNAVTLIKSYLNLRTQVVDANGELSNPITILSGVPQGSVLGPL FT LFSLFINDLPSVLKFCLIHMFADDVQLYFCTANTNLESVAVLINSDLKRVQ FT EWSEKNLLPINPSKTKVMFISRQQARSPLPNLLINDEIIQYAEKASNLGVI FT FQNNLEWNVQVNSQCGKIYAGLRHLKLTASMVPVSTKLRLFKSLLLPHFSY FT GLELILNSSASDYDRLRVALNHCVRWIFNLNRYSRVTHLQRQLLGCSFYNF FT FKLRCYTTLFKIINHGPKYLADKLQPFRSIRVRHFVLLQHNSSHYGNTFFV FT RAIILWNQLPNEIKVTRSATSFRRECLSWLNGGN" XX SQ Sequence 2963 BP; 923 A; 521 C; 538 G; 981 T; 0 other; ttattaattt gtgtcacttg aatgctcaaa gtttgtgcgc ccgtcagcta ggtaagcttg 60 atgaattcaa atcctgtttt gttaatagta aactggatat catttgcgta actgaaacat 120 ggctgcaaga ttctatttcc gatagtaccg ttggcgtcga gggttacaca attctaagga 180 acgatcgaaa ttatagtcga ggtggaggag tttgtattta ttataaaaac tgtattaatt 240 gaaaacaagt tgctgcttcc gagatagtaa gtgattcaag agattttaat cgtactgaat 300 accttttcgt tgaatttttc gttgttcaaa ttaaatttct tttgggggtt gtttactgtc 360 ctccaggagt tgattgttca tgggtgctgg agcaaaagct ttcagaatta tctttaaatt 420 atgaaaacat aatcttatta ggtgatttca acacgaactt aatgaagaac tgtcagagaa 480 caaatcgttt taatgatgtg ttaactaact tcggactttt ttgcgttaac aacgaaccca 540 ctcactttta taatggtggg agctccttga ttgatttgct gttaacgaac gattccaatt 600 ttgtactaaa cttcaatcag gtagcagctc ctgggttttc ccaacatgac ataattttct 660 catcgcttaa cattactcgt acgccatccg atcgaacaag gatgtttaga gattacaacc 720 gaattgatta tactcgtcta agacaaagct tggaatgtat cgattggtcg ttactctata 780 gtattactga ggccgataca gctctcgatt tcttcaatag tataataatt gaactttacg 840 aaaattttgt tccactgcgc atatctagac caaaatcaaa tgcaccttgg ttcaccaacg 900 atattctaaa tgctatggta gtaagggatg tagcatatcg tcaatgggtc tgcagtaaga 960 gtgatgctga tcgcaatcag tacaaaaggc ttcgaaatag ggtcacttgc ttgattaata 1020 aggctaaatc tgattatctt tcaataaaca tagaatcatc atcttctaac aaggaaacat 1080 ggcgaagatt gaagaagcta aatgttactg cttcaaataa cgaggtggaa ttttgtaata 1140 ctagtgatga aatcaattct tactttagtg gtaattttac acgtgaatca gattaccatt 1200 ctgtatcccc aataaacgtc gatggattca agttcacctc atgtagtgag ttagaagtta 1260 cgtcggcaat cttttctatt tcatcgaatg cagttggatt agatggtgtg tctttgagat 1320 tcataaaaat ggttttacct tatgtcatta cccctatcac gtatcttttc aacttgttta 1380 tttctacatc aaagtttcct cgagcatgga aagctgcaaa ggtcttgcca attcgtaaaa 1440 aaccaaagag taacaggttg gacaatcttc gtccaataag cattctgagc tcgctttcta 1500 aggcttttga aaagatttta aaaattcaaa tacaaggctt tgttcaacga tttgaacttc 1560 ttagtccgta ccaatccgga tttagagcag ggcgaagcac aactaccgca ctactcaagg 1620 ttcatgatga tattcatcaa accattgata agaaaggtgt agctttttta cttcttattg 1680 atttttcaaa agcttttgat cgagtaccac atgttaagct tttgaataag ctgtcgactc 1740 aattttactt tcatcgtaat gctgtaacat taatcaaatc atatttgaat ctacgtacgc 1800 aagtggtaga tgctaacggt gaactctcta atccaatcac tattctttca ggtgtacctc 1860 aggggtccgt cttgggtccc ttgcttttct ctttatttat taacgatctg ccatccgttc 1920 tgaagttctg tctaatacac atgtttgcag atgatgtgca actttatttc tgcacggcta 1980 ataccaattt agaaagcgta gcagtactga taaattctga tcttaagcgg gttcaagaat 2040 ggtctgagaa aaatcttctt ccgataaatc catctaagac gaaagtcatg tttatttcta 2100 ggcaacaggc tcgttcacct ttacctaatt tgttgatcaa cgatgaaatc atacaatatg 2160 ccgaaaaagc ttctaaccta ggagtgatat tccaaaataa tcttgaatgg aatgtccaag 2220 tgaactcaca atgcgggaaa atttacgcag gtcttcgtca cttaaaactt acagcttcta 2280 tggttcccgt ttctactaaa ctacgattgt tcaaatcgct tttgttgcct catttttcgt 2340 atggtttaga attgatttta aattcatcgg cttcagatta tgaccgttta agagttgctt 2400 tgaaccactg tgtacgctgg atatttaatc ttaacagata ttctagagta acgcacttac 2460 aacgacaact tctggggtgt tccttttaca atttcttcaa attacgttgc tacacaacat 2520 tgtttaagat aataaatcat gggccaaaat atcttgctga taagctgcaa cctttcagaa 2580 gcatccgagt acgacacttt gtgttattgc agcataactc gtcgcactac ggtaatactt 2640 tctttgtacg tgcaattata ctatggaatc aacttcctaa tgaaattaaa gttacgcgtt 2700 cagcaacaag tttccgtcgt gaatgtttaa gctggcttaa tgggggaaat tagagtgaac 2760 gatgcagcaa gtcaacgaaa aagagaaaag ttaatgtttg tattgtttga atttattgaa 2820 tttgtaaaga atggaagtgg taaatttgga gaacaaaaat aataccgatt gtagtaaatt 2880 tataagggaa atacccttac gctgcaagta ttttcattaa taaataaaat aaaataaata 2940 aataaataaa atcaaaaata aaa 2963 // ID DNA-2-3_HM repbase; DNA; INV; 3843 BP. XX AC . XX DT 15-JAN-2009 (Rel. 14.02, Created) DT 15-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE DNA transposon from Hydra magnipapillata - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; 2-bp TSD; KW DNA-2-3_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3843 RA Bao W. and Jurka J.; RT "DNA transposons from Hydra magnipapillata."; RL Repbase Reports 9(2), 375-375 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX SQ Sequence 3843 BP; 1267 A; 449 C; 544 G; 1578 T; 5 other; cccttacaaa aaaacccagc gaacttccag cgaaagttcg ctggaaatta aaagatcact 60 ggaaatttac taagatcact gggttttcgc tggatttcca tggaaacaaa agacttcgct 120 ggaagttcgc tggaaatcca acgaaaaact tccagtgatc ttaaagttcg ctggaagttt 180 actgggtttc ttgcgaaata tttcgcggtg atgtaaaaag cgttgctgtt acaacttctg 240 caagcagtta aaacttggaa aattgataat atagcttctt aaaataatat ttcaaatttt 300 tgtataaaag taatcctttt tattttattt tttgcatcta atagatgatc tatagtgtga 360 gtttcaatta ttattagttc tactcaagtt acaaaacaaa atcatctggt atgttgctca 420 aaatgtttat ttctcttaag tattgtttat tatgcattcg ctttatgcga atgcatttta 480 tttatyaaat tcaactttat tattttataa aaataatttt tataaatgtc tttatgatta 540 agtattttat attaaaaata ataaataaca atctgtttct ctctctcgct gaaagagagg 600 gatatataat atatatccct atatttgtga agttgcttta agtactaata agcaatgtac 660 aaaatgtttg aaatatttta cattatccta tctttataag aaatgcagtt attttcatat 720 tttaaacata aacaaacagc ataaaaacat gtgtgataac aattggatac atttagcaag 780 tcttgaagaa aatgtgcaag caacaattag acttgttgtt aatactgatg gaattcctgt 840 ttttaaatta tcaaagtgga ccttgtggcc tttgctagca tccattcata attttccata 900 ttcattgcaa attactaata ttttgttttt tggtttctgg tccagtttta aaaaaccaaa 960 cgtgcaatta tttttatctt gttttgttga tttttttact aacctatcca ctactggtat 1020 actcttatct aataatgttc gtgttttttt ttcatgtatc tatttttttg tgtgatacaa 1080 ttgcaggagc tatggttttt tgttgtaggt agtttaatga ttataatggt tgccattggt 1140 gttcttctca aggggtaaga gttaaaaaaa ggtaggaggt cttgtatgac ttatccattt 1200 tctgatgtta atgggagttg tagaaaccat acagattttt tttggattaa tgtgattaaa 1260 gggtttttag atttttctac atttgtaaga gtacctaatt ttaattgtga aaatggaaca 1320 gttgttaata atctgcactg tattgagctt ggtgttatga aagctatgtt ttctttctga 1380 tttgatagtt attattctgg ttatattttt tctattagaa gacactgtac tgttgtaaat 1440 aatcttttat ctgcaatcag tgtgccaagc aatataagac agactcgatg atcattaagt 1500 tgtataaagc attggaaagg tattgagttt tttaattttt tagtttttta tggtcctgtt 1560 attttatgtg atgttttgcc tcctcagttt tttgaccatt ttactctttt atctaatatt 1620 gtttttacgt tgaacaataa ggtattatct gaaagtgtaa ttaatgaagc cacattatta 1680 gttaatagat ttttttatga gttttctgat ttgtatggta aagaaaatat gttttatatt 1740 aaagagttat tttaaagcag ttattatata tatacagagc tttttttaaa gcaggtatta 1800 tattttaatg cagtttttat ttaagtataa ctaaaatctt tctctaaatt ttatacagta 1860 tttattatat gtttttataa aattgttatt gatgtccttt aaatattgta taaaccatct 1920 tttttttttt tagctttaaa gacccaaacg aaaatacttt taaagtcaat tttcgcaatt 1980 ctgtttgttt tgttatgcta atttgtcagt agagaaccaa cagaccactg acttcataat 2040 ttttccrtct aaaattatac taaaatatcc taataaatac aatttgaatt agatgatata 2100 agttatcaga gaaagtaaac aaagtacact tattcggaat aaaaaatgta actactgcca 2160 tttggttctc gacttttttg aaatgggcta aaaaacatgc ttaagcatca tattgcctat 2220 aagatatact tactgcaata tttttctttt tctctaatat acaggtttaa gttttgcttt 2280 ataatgacta ttatgttttt tacagattta aatatctata tcagtacaga aaagtagagc 2340 tgtatgaatt tttgttttgt tgatttaaac agttgaaaat ttaaaaaaaa attagtttaa 2400 aaattgtaaa gttccttaca aaaaaaaccc agaaaacttc cagtgaaagt ttgctgggtt 2460 tttaaattaa aattatttag ttattatata gtttttatta ttttatctat aattattatt 2520 tttatgcagg ataaaagtga taaaatgcta ttggttggaa attgataaat taataaaaaa 2580 tgattgattc taattgcgcg aaaatatgtt ggtatgtttt tatctatctt atactgtttt 2640 atttttgctt atcatttcac attttgtctt attttgaata ttatgtttta tataawtata 2700 agttagtgtt tgtaaaacac taactaattt tgattgcttt tagattttgg atactgcctc 2760 taccattaac aaattaaaaa attatattaa taaataatat aaattatcta actaactaat 2820 tattttaata tttttattta aaataatatt aattaatata attatctact attatatatt 2880 agtagaaagt cacttaatag aaaatttttt ataagttttc gatgtaacag tgtaaaatga 2940 aaaaaaattt ttgttaagta attttctact tatttataat tgctctgttc ttttaagaac 3000 attgagcact ctattttgta gaatacattt tgaaattgtt taatattctt tatttattgt 3060 atgcattctt tttttttttt tctttttttt tttagctcta aagacccaaa ccgaaaagaa 3120 tgcctaaagt ggttttcaaa agatgaattg acgaatttct taattctttt tgtttcgttg 3180 tgataatttg tcgtatgaga acgctaacga gaaaaaactt tttaaatata gaaacatgcg 3240 taatataaag taaagatcaa caatacgttt acttatgcct ataaaaaaat atgttttaca 3300 ttaattaata aattaacttt ttagtttctt gttaaaaatt gtattctgta ttraagttta 3360 tttctgagaa tttgtaatta tacctaagat aatctaatcc tacattgtat tttaatttat 3420 aaaactcgat attttaaaat aaatatatta cttttttatc gcgcttaact tctcttattt 3480 cgcgtttatt ggttaaaaaa ttttaaaatg tttataaact tttgaaactt ttatagcaaa 3540 aataaaaaat cytttttaaa aaaacttcca gcgaatctat ttttatacat aaacatgaaa 3600 attcgctaga aaagttcact ggaagttcgc tgggttttgg aacaagttcg ctggaaattt 3660 ccagcgaact tccagtgatc tttaaagatc actggaaaag ttcgctggaa gttcgctggg 3720 ttttggaaca agttcgctgg aaatttccag tgaactttag agatcactgg aaaagatcac 3780 tggaagttcg ctgggttttg gaacattttt tcactggaag ttcgctgggt ttttttgtaa 3840 ggg 3843 // ID Gypsy-108_AA-I repbase; DNA; INV; 4619 BP. XX AC AAGE02028575; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-108_AA_; KW Gypsy-108_AA-LTR; Gypsy-108_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4619 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02028575; Positions 24987 20369. XX CC Positions [3452-3946] - Integrase core CC 'TATGG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 194..4591 FT /product="Gypsy-108_AA-I_1p" FT /translation="MDPEQFRLFMEQQSNMITQLITTVTQQRPQQLPQVQQ FT AQPPVSSVPVPPPSSLSVEGDMAENFDFFERSWKNYESASGMDKWPVDNNE FT RKVSVLLSVIGETGRKKYFNFELTQQQLADPDQALAAIKDKVVVKRNIIID FT RLDFFTATQFSNELVDDFKTRLRIMAKMAKLDTLENELITYKLVTANKWPH FT LRTKMLAITDITLDKAVDLCRAEEIAARRSQELGVVVSSSEVNKISKGQPK FT QKSPRCKFCGDYHEFAKGVCPALGKRCNRCRKKNHFEKVCKLKSRKSRRVK FT QVKEPDDTSDTSSRESESESEESDEEFEIGKIFDNSNTGGSVLAELNLKFG FT GTWKSVLCELDTGANTSLIGHECLMKLSGLPDFPLKPSKIRLQSFGGNPIK FT VLGQVKVPCRRLDRKYGLVLQVVDVDHRPILSANASKALGFVKFCNTVTFE FT GPNFPSASSKLPQEVLLNVYRVKAQQMVDRHHGLFEGYGRLPGEVTLEIDQ FT SITPSIQPPRRVPIAKRSQLKTELENLEKDGIIVKETSHTEWVSNIVIVQR FT GGPGSGVRICLDPIFLNKALKRPNLQFVTLDEILPELGKAKVFTTVDAKKG FT FWHVVLDEPSSKLTTFWTPFGRYRWIRLPFGVAPAPEIFQLKLQEVIQDLE FT NVECIADDILVYGVGNTMEEALQNHNECLEKLLSRLDQHNVKLNRSKLKLC FT EKSVKFYGHILTDQGLKADESKIAAIREFPQPINRKEVHRFVGMVTYLSRF FT IRNLTSSLTNLRNLISESVPWRWTAIEEKEFNQVKKMVADVKSLRYYDMNK FT PLIIECDASSIGLGAAVFQNDGVIGYASRTLTAAEKNYAQIEKELLAILFA FT CVRFDQLVVGNPKTTIRTDHKPLLNVFQKPLLSAPRRLQHMLLNLQRYNLS FT LEYVTGKENVIADALSRAPLQDVQPNDEYRKRNIFKVFDEVQGVKLSNLIS FT VSSSRLDEIMRETEKDKSIQQLINYIQQGWPETIDRVPDNVKIYFCYRNEL FT STQDGLVFRNDRILIPYALRRKLVDCCHASHNGISATLRLARANIFWPGMS FT SQIMEIVKNCPVCAKFAPSQQNPPMQSHPIPIHPFQIVSMDVFFCEMQGAK FT RKFLITVDHYSDFFEVSLLKDLSPESVVAVCKQNFARFGVPQRVITDNGTN FT FANQKMASFASEWDFELATSAPHHQQANGKAESAVKIAKHLMKKADESEAD FT FWYMLLHWRNIPNNIGSSPAARLLSRSTRCGVPTAATNLLPKVEEDVPAKI FT EENRKRAKLHYDKKSRNLPELHVGSPVYVQLNPESNKHWTQGTVSNKFNDR FT SYLVSANGTEYRRSLVHLKPRKESATQPDREVPSHPDRPVMCEVIPVSSLF FT PGSNEKSATTVLDQISNDNTIGQPPTSTEIATTASAFACASPKQSRTSGER FT TATTPMTTTTAASPVPTGSKSRGVQPITPSKNNRPKREIRIPEKLKDYQIK FT F" XX SQ Sequence 4619 BP; 1377 A; 1039 C; 1148 G; 1055 T; 0 other; tggtgtcaga agcaaacgtg agcttctagt gaattccggc gtcattaagt gaaatcgcgg 60 aaaactacga actttttcga aagtagtgtt tgaaaggcgc gcgtgaacat ttcgagtcgc 120 gttggtggcc attttgtatc gaaagatttt gtttgtttac aatcgagtgt gttgtgcggg 180 attcctcatc gatatggatc cggaacaatt tcggctgttc atggaacagc aatcaaatat 240 gatcacccag cttatcacca ctgtgacgca gcagcgtcca cagcaactgc cgcaagtcca 300 gcaggcacag ccaccggttt ccagcgtacc ggtgcctccg ccgtcgtcgt tatcggtaga 360 aggagatatg gccgaaaatt ttgatttttt tgagcgaagt tggaagaact atgaaagtgc 420 tagtggaatg gacaagtggc cagtggacaa taatgaaagg aaagtgagtg tactgctatc 480 ggttatcgga gagacgggaa gaaaaaagta tttcaacttc gagttgacgc agcagcagtt 540 agccgatccg gaccaagcgc tagcagctat caaggacaaa gtggtagtaa agaggaacat 600 catcatcgat cgtctcgatt ttttcacggc cacgcaattt tcgaacgagt tggttgatga 660 ctttaaaact cgactacgaa taatggccaa gatggcgaaa ctcgatactc tggagaacga 720 gctgatcacg tacaagcttg tgacagctaa caagtggcct cacctaagaa cgaagatgct 780 cgctatcacg gatatcacac tggataaggc tgttgacctt tgtcgcgctg aggaaatagc 840 tgccagaagg tcgcaggagt tgggagttgt ggtttcgagc tcggaagtga acaaaatttc 900 aaaaggccaa ccaaagcaga aatcgccgcg ttgcaagttt tgcggtgatt accacgaatt 960 cgccaaagga gtgtgcccag cattggggaa acgctgtaac cgctgcagga agaagaatca 1020 tttcgagaaa gtgtgcaaac taaaaagccg aaaatccagg agggtcaagc aggtgaagga 1080 accggatgat acatcggata cgtcgtcacg ggaaagtgaa tcggaatcag aagagtcgga 1140 cgaagaattc gagattggga agatattcga taactctaat actggaggaa gcgtcctggc 1200 ggagctgaac ctgaaattcg gtggcacatg gaaatcagta ctatgcgaac tggatacagg 1260 cgcaaacacc agcctgattg gtcacgagtg tctgatgaag ctatctggat taccagattt 1320 tccactgaaa ccgtccaaaa tccgattgca aagttttggt ggcaacccga taaaagtcct 1380 tggtcaagtt aaggtaccat gtcgccgatt ggacagaaag tacggtctcg ttttgcaagt 1440 tgtggatgtc gatcaccgac cgatactatc cgcaaatgcc tctaaggcac ttggattcgt 1500 gaaattttgc aacaccgtta cctttgaagg accgaatttt cctagcgcct catctaaact 1560 accacaggaa gttcttctca acgtgtatcg agtgaaggct cagcagatgg tggacagaca 1620 ccacggtttg ttcgaaggct acggtaggct tcctggagaa gtaacgttgg agatagacca 1680 aagcatcact ccatcgattc aacccccgcg gcgtgtaccc attgcgaaga gaagtcagct 1740 gaaaactgag ttggagaacc tcgagaaaga cggcatcatc gtaaaagaaa cgtcgcacac 1800 tgaatgggtg agtaacatcg tgattgtaca acgaggagga ccgggttcag gcgttcgtat 1860 atgcttagat ccaattttcc tgaacaaggc cttgaagcga ccaaatttac agtttgtgac 1920 tttggacgaa attttgccgg aactcgggaa ggcaaaagtt ttcaccacgg ttgatgcaaa 1980 gaagggtttc tggcacgtgg ttctggacga gcctagtagc aagttaacca ctttctggac 2040 gcctttcggt cgttatcggt ggatacgtct accattcgga gtggcaccag caccagaaat 2100 attccagctt aagttgcagg aggtgatcca ggatctcgag aacgtcgaat gtattgctga 2160 tgacattctt gtgtacggag ttggtaacac tatggaggaa gcgctacaga atcacaacga 2220 gtgtttggag aaacttctgt cccgactcga tcaacacaac gtgaaactca accgatcgaa 2280 gctgaaactt tgcgaaaagt cagtgaaatt ctatggccac atactcaccg atcaaggatt 2340 gaaagctgat gaatccaaaa tagcagcaat ccgagaattc ccacagccaa tcaaccgaaa 2400 ggaagtacat aggtttgtag gaatggttac atacctaagt cgcttcatac gaaacctgac 2460 tagcagctta acaaacctac ggaacctgat ttccgagtcg gtaccatgga gatggacagc 2520 gatcgaagag aaggagttca accaggtgaa gaaaatggta gctgatgtga agtccctgcg 2580 ttattacgac atgaataagc cgctgattat cgaatgtgac gccagtagca tcgggcttgg 2640 agcagcagtg ttccaaaacg acggagtcat agggtacgca tcgaggacct tgacagctgc 2700 ggagaagaat tacgcgcaga tagagaaaga gcttctcgca attctcttcg cctgcgtaag 2760 attcgatcaa ctagtggtcg gcaatccgaa aactacaatc cgtactgatc acaagcctct 2820 cctaaacgtg ttccaaaagc cgctcttgtc ggcacctcgt aggttgcaac acatgctgct 2880 caatctacaa cggtacaact tatctcttga gtatgttacc ggaaaagaaa atgtcatcgc 2940 cgacgcgtta tcgcgtgcac cgttgcagga tgttcagccc aacgacgagt acagaaaacg 3000 aaatatcttc aaggtgtttg acgaagtcca aggagtaaag ctgagcaacc tcatcagtgt 3060 gtcaagttca agactggatg aaatcatgcg ggaaacggaa aaggacaaat cgattcaaca 3120 gctcatcaac tacatacagc aaggatggcc tgaaacaatc gaccgagtac cagacaacgt 3180 gaagatatat ttctgctatc gaaacgaatt gtctacccag gatggattgg tgtttcgcaa 3240 cgaccgcatc ttgattccgt atgctctgcg aaggaaacta gttgattgct gtcacgctag 3300 tcacaacggg atttccgcta cccttaggct agctcgggca aacatcttct ggccggggat 3360 gtccagtcaa ataatggaga tcgtaaagaa ctgccccgtt tgtgcaaagt ttgctccatc 3420 ccaacagaat cctccgatgc aaagccaccc gataccgatt catccttttc agatagtatc 3480 catggatgtt ttcttctgtg agatgcaggg cgccaagaga aagtttctga taacggtgga 3540 tcactattcg gacttctttg aggtcagtct gctcaaggat ctaagccccg aatctgttgt 3600 cgcggtgtgc aaacagaact ttgcaagatt tggagttcct caacgagtta tcaccgataa 3660 tggaaccaat tttgcaaatc agaaaatggc tagttttgca tccgaatggg atttcgaatt 3720 agctacgtcg gcaccacatc accaacaagc caacggcaag gccgaatcag cagtgaaaat 3780 tgccaagcac ctgatgaaga aggcggatga gagcgaagct gatttctggt acatgctgtt 3840 gcattggcgc aacattccaa ataacatcgg ttccagcccg gcagctcgac ttttgtcccg 3900 ttcgacacgc tgtggtgtgc caactgcggc tacgaatcta ctcccgaagg tagaggaaga 3960 tgttccagct aaaattgaag agaatcgaaa gcgagcgaaa ttgcactacg acaagaaatc 4020 acgcaactta ccagaactgc acgttggttc acccgtatac gtgcaactga atccggaatc 4080 aaataaacat tggacccaag gaacagtgtc aaacaagttc aacgatcgat cgtacttggt 4140 gagcgccaac ggaacggaat atcgtcgttc actggtgcac ctgaagccac gcaaggaatc 4200 cgctacgcag ccagatcgtg aagtgccatc tcatccagat cgtcctgtaa tgtgtgaagt 4260 aattcctgtt tcatcgttgt tcccgggctc aaacgaaaaa tctgctacta ctgttttgga 4320 tcaaattagt aacgataaca caatcggaca accgcctact tcgaccgaga tcgcgacaac 4380 tgcatcagca tttgcatgtg catcaccaaa acaatcgcgt acgtctgggg aaagaacagc 4440 aacgacgccg atgacaacaa caacagcagc atcaccggtg ccgacaggat cgaagtcaag 4500 gggggttcag cccattacac cgtcgaagaa taaccgtcca aagagagaaa ttcgaattcc 4560 ggaaaagctc aaagattatc agataaagtt ctaattagtc ttttaaggaa aagggaaga 4619 // ID Kiri-12_AAe repbase; DNA; INV; 4284 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 16-FEB-2011 (Rel. 16.02, Last updated, Version 1) XX DE A Kiri non-LTR retrotransposon family from Aedes aegypti. XX KW Kiri; Non-LTR Retrotransposon; Transposable Element; Kiri-12_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4284 RA Kojima K.K. and Jurka J.; RT "Kiri non-LTR retrotransposons from the yellow fever mosquito."; RL Repbase Reports 11(2), 707-707 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 17 sequences with >97% CC identity. CC This family and related Kiri elements found in two mosquito CC species, Culex quinquefasciatus and Aedes aegypti, constitute a CC distinct lineage belonging to the CR1 group. The phylogenetic CC analysis with PhyML indicates that Kiri is a sister lineage of CC the L2 clade. Although several Kiri elements do not encode two CC proteins, probably due to 5' truncation, Kiri elements generally CC encode two proteins. Their ORF1 proteins commonly contain an CC L1-like RNA-binding domain. XX FH Key Location/Qualifiers FT CDS 315..1118 FT /product="Kiri-12_AAe_1p" FT /translation="MSLHRTPPSDKSNENIAKRIRSDGDVHEVPQPLTLEA FT VMQAMNHQFAQTLARIEEISVNISGKIDTVKADLDGKLEAVLHDINSFKAD FT CNAKLKSNEDALCALDERVGAISQDIDNLQNRNELIVSGIPYLKDENLISY FT FNAMWKYLDLQKGSIPSVDVRRLRPSTQNDGLIVLQFALRNDRDDFYSGYL FT HKRDLKLCHLGIDSSRRVYVNENLTVAARKIKATALRLKKAGKLSTVYTKL FT GTVMVRRSVDQPPVAVHSDVLLDQFLS" FT CDS 1291..4143 FT /product="Kiri-12_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MMPSLPNHTSMNIPGAVMNAVLPPGKLNVCHGNAQSL FT CARNSSKLEEVKDLLRNSRVSVARFTESWLSARNSNRSISISGFSAIRNDR FT IYRRGGGIVVYYKNHLCCSPIFSTKLSSESEDKTECLAVELRLGSDKILIV FT VVYNPPDNNCTKFLADKLTDFATRYDNILLIGDLNIDLSKPSSKRAQFEYM FT LENFALFSVGEEPTYFHKDGCSHLDLFITSCYDKVLRFNQVGFPGLSQHDL FT IFGSLDFDANPVPSMNTYRDYVNFEARTLQNAILSVPWDTFYEMHDPNELV FT DFFNSNVKRIHDDCIPLRTSKKHKSSNLWFNNEIRKAMLERDLAYTDWIRA FT PTTTKDQARRRYNALRNKTNSMVVAAKNQYLNRFLDSRVPPKELWKRLKGI FT GVGKDKSSTVCEFHPDDVNRIFLSSCVQCDHNSNPRNSSSNSPYSFSFRQV FT EYWEVVNAVCSIKSNATGMDGLPIKFIKIILPLIVQQVTHMFNCIIETSTF FT PSCWKHAKVLPLRKKPHVNALTNLRPISILCALSKAFEKLLKQQMSSYITV FT NNLLTEHQAGFRKGQSIQTAVVHVYDELAKAVDKRGTGVLLLLDFSKAFDT FT IPHRKLCSKLETQFNFAGTAVELISSYLKERMQTVYCGNQQSERGFLTSGV FT PQGSVLGPLLFCCHINDLPTVLKYCSIQLYADDVQLYICRLGPCTRDLISM FT VNMDLERITEWSQRNQLFVNQSKSKAMLVKNRRRNTVQTELLPLISMAGQT FT IEWTESAANLGFTFQSDLQWEGLVNQQCGKIYACLRSLYSCSSGAPISTRL FT KLFKALILPHFMFGELLHGRSCAYSMDRLRVALNSCVRFVYGLNRYAHVSH FT LQRNLIGCPFENFYAHRSCVFLYKLVKTNTPPILHQKLLPFRGRRLQNLMI FT PPNNSMCYSNSLFVRGVVYYNMLPPAVKCSTSEAVFKRGCLEFWNRM" XX SQ Sequence 4284 BP; 1213 A; 944 C; 933 G; 1194 T; 0 other; gaaaattctg gcaacactgg cgctactatg tgttgtgaat agtggctcag ctaagcaaat 60 tcgcgttagt ggaatgtaat cgtttttttt aatatctcga aggaatccca gtaaagtcct 120 gagtgtcttc cgcgatcgtg tacgtgttat ccagttccgc cgtattgtat ctggaccaat 180 aaaacagttg caaaatcctg agccatatgc cgacattgtt ctgtgcatct acaagcttgc 240 cataaaacga tcagtgtgag gtctctttca tacctacaca ttgcgctcct cgataaacac 300 agtacgaaac aaaaatgtcg ctacatcgta ctccaccaag tgacaagagc aacgaaaaca 360 ttgctaagcg catacgatcg gatggcgatg ttcatgaagt cccccagcct ttgacgttgg 420 aggccgtaat gcaagctatg aaccatcagt ttgcgcagac actagcacgg atcgaggaaa 480 ttagtgtcaa catcagcgga aaaatagaca cagtaaaagc tgatctagat gggaaacttg 540 aagctgttct ccacgacatc aactccttca aggcggactg caatgctaag ctcaaaagca 600 acgaagatgc actgtgtgct ctcgacgaaa gagtgggcgc tatatcgcag gatatcgaca 660 atctccaaaa caggaacgag ctaattgtga gtggtatccc gtatctcaag gacgaaaatc 720 tgatatctta tttcaacgca atgtggaagt acttagatct gcagaaggga tccattccat 780 ctgtcgacgt acggagattg agaccgagca ctcaaaacga tggtctaatt gtattacaat 840 ttgcgctacg aaatgaccgt gacgacttct acagcggcta tttacataag cgagatctaa 900 agctgtgcca tctgggaatc gattcttcaa ggcgagttta tgtgaacgag aacctgactg 960 tcgctgcacg caaaataaag gcaacagcat tgcggctgaa gaaagctggg aaactatcaa 1020 cggtgtacac gaagctgggg acagtcatgg tgaggcgctc cgtcgatcag ccgcctgtgg 1080 ccgtgcactc ggatgtactg ctggatcagt ttcttagcta agtaatcgag tttatgtcat 1140 cgtccacttt cgttatttgt taatgtgtat tgaatattgt atttggtttt gtaaattgtg 1200 attgtacgaa gactgttata gtagcaaaat accaatgtcg tctattgtag tttattcaca 1260 gccccctaaa aactgcttcc cttttttcac atgatgccat cgttaccaaa ccacacctca 1320 atgaacattc ctggtgcagt aatgaatgcc gttctccctc ctggaaagct gaacgtttgc 1380 catggaaatg ctcagagtct atgtgcacga aactcatcca agttggaaga agtgaaggat 1440 ctcctgcgta actctagagt atcggttgca cgtttcactg aatcctggct ttcagcaagg 1500 aacagcaatc gcagtatctc tatctctggt ttttctgcaa ttcgcaatga tcgtatatat 1560 cgacgtggag gtggaattgt agtttactac aagaatcatt tgtgctgttc tccgatcttc 1620 agtacgaaat tgtcatccga atcagaagac aaaactgagt gtctggccgt ggagttgcga 1680 ttaggcagcg ataaaatcct gattgtggta gtctacaatc ctccagataa caactgcaca 1740 aaattcctcg cagacaaact gacggatttc gccacgcgtt acgacaatat tctgctaatt 1800 ggagacttga acatcgattt gtctaagccg agcagcaaac gtgcacagtt tgagtatatg 1860 ctagaaaatt tcgctttgtt ctcggtgggt gaggaaccta cgtatttcca taaagatggt 1920 tgctcgcatt tagacctgtt tattacaagc tgctacgata aagtgctgcg ctttaatcaa 1980 gttggcttcc ctgggctatc ccagcacgac ttgatcttcg gctccctaga cttcgacgca 2040 aaccctgttc caagtatgaa tacgtatcgt gattacgtca actttgaagc cagaacactg 2100 caaaacgcta ttctttccgt tccttgggat actttctacg aaatgcacga tccaaatgag 2160 ctggtcgact ttttcaactc caatgtgaaa cgcattcatg atgactgcat cccacttcgt 2220 accagtaaga aacacaagtc atcaaactta tggttcaaca acgaaatccg taaggctatg 2280 ttggaacgtg atttggccta taccgactgg attagagcac cgactacaac caaggatcaa 2340 gcgcgacgcc gttataacgc tttgaggaat aaaacaaact cgatggttgt ggctgcaaaa 2400 aaccaatacc tcaatcgatt cttggacagc cgagtcccgc cgaaagagct gtggaagcga 2460 ctcaaaggaa ttggcgtagg aaaagataaa tcatcaaccg tttgtgagtt tcacccagac 2520 gatgtgaatc gcatatttct ttcaagctgt gttcaatgcg atcataatag caatccgaga 2580 aattcatcgt ccaactcgcc ctacagtttc tcgttccggc aagtggagta ctgggaagta 2640 gttaatgccg tgtgtagcat aaaatccaat gcaacgggca tggatggctt gccgatcaaa 2700 ttcataaaaa tcatccttcc gttgatcgtg caacaagtta cccatatgtt caactgtatt 2760 attgaaacgt ctacttttcc gtcgtgctgg aaacacgcta aagttttgcc gctgcgaaaa 2820 aagcctcatg ttaatgcgtt gacgaacctc agaccaatta gtattttgtg tgcactatcg 2880 aaggcatttg agaaactcct gaagcaacag atgtcatcct acattactgt aaacaatttg 2940 ttaacagaac atcaagccgg ttttcgtaaa gggcaaagca tccaaacagc cgttgtccac 3000 gtttatgacg aattggcaaa agctgttgat aagagaggaa ctggcgtatt actgctgctt 3060 gatttttcca aggctttcga tactattcct caccgaaaac tctgctcgaa gttggaaact 3120 cagtttaact ttgctggtac agctgttgag cttataagtt cctatttgaa agagcgaatg 3180 cagactgtgt actgtgggaa ccaacaatcc gaacgtggct ttttaacgtc tggtgttcct 3240 caagggtcgg tgcttggacc tttattattc tgctgtcata tcaacgattt gcctaccgtc 3300 ctaaaatact gttcgataca actgtatgca gacgatgtac agttgtatat ctgtcgcctt 3360 ggtccctgta ctcgcgatct catcagtatg gtgaatatgg acctcgagag aatcacggaa 3420 tggtctcaac gaaatcagct ttttgttaac caatcaaaaa gcaaagcaat gctggtaaag 3480 aatcgacgga gaaacaccgt tcaaacggag ttgctgcctc ttatcagtat ggctgggcag 3540 actatagagt ggacggaaag tgctgccaat cttggattca cctttcaatc agacctgcag 3600 tgggaaggcc ttgtgaatca gcaatgtgga aaaatttacg catgcctcag atcgctctat 3660 tcttgttctt ctggagctcc tattagtaca cgtctgaaac tgttcaaagc actcattctt 3720 ccgcacttca tgttcggcga actacttcat ggcagatcct gcgcctattc tatggacagg 3780 ctacgtgtcg cattgaattc ctgcgtacgt ttcgtttatg ggctgaaccg ctacgcgcac 3840 gttagtcatt tgcaaaggaa cttgatcggc tgcccatttg aaaacttcta tgctcaccgt 3900 tcttgtgtgt tcctgtataa gttggtgaaa acgaacactc cgccaatact ccatcaaaag 3960 ttgttgccgt tccgaggacg tcgcttgcag aatttgatga tccctcccaa taactcaatg 4020 tgctattcca actcgttatt tgttcgaggt gtggtttatt ataatatgct tcctcctgcg 4080 gtcaaatgct cgacatcaga agcagttttt aaaaggggct gtctagagtt ttggaaccgt 4140 atgtaatgtg aatattgtag aatgtaagag tagtttatat gttagttttt aagtttttat 4200 tgtatttgac tcgttaaaaa cgcaggtggt aaacaaaaaa aggtttcctt tacgccactg 4260 gaaataaaca aacaaacaaa caaa 4284 // ID BEL-63_AA-I repbase; DNA; INV; 5997 BP. XX AC AAGE02019161; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-63_AA_; KW BEL-63_AA-LTR; BEL-63_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5997 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02019161; Positions 54637 60633. XX CC Positions [5032-5604] - Integrase core CC 'ATAAG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 592..5997 FT /product="BEL-63_AA-I_1p" FT /translation="MAIKKKSLEEKVKLIREQSFRGSSRSSNISQSASEVS FT EKVTNWLEKTDTHADQNQDNHLGLSVLSNPALVNSFEKLGLSNQEGEGRTA FT FFGHPSGFHQNLNDFVNEVRSSQGLNAETSRQNVVSGNTRREQERDCSGIN FT VPEQTCLSLGRQNAENEQLYAYDHQAGPTNRQLGARQVMGKDLPVFSGNPE FT EWPIWVSNFERSTMTCGFSQDENLIRLQRCLKGPALEMVRGRLLTPACVPH FT VIKTLQLRYGRPETLIRALTEKIRNLPSPKMDNLESIIDFGMAVDNLVEHL FT KTAKQFAHMTNPSLLHDLVGKLPVEYRMKWAAFKGARADADLRIFGTFMNS FT IVELAFDVADDHPMSSYKSQQKPKERAYIQTHSEPTSAIEIGVDGRQEKMR FT KRMCVTCGKEDHRIHECEKFRLLSVDERLKIVNQNSLCRTCLNQHGRWPCR FT TWQGCGISGCRLRHHTLLHPSAQSISVAVSTSHLDQKQSSNGPLFRILPVT FT LYGPSGKIDIFAFVDEGSQITLLEEEIASQLGLYGPCEPLQLLWTGNITRN FT ESKSRRLLMDITGAQLDQKFKLTDARTVDKLMLPTQTLCYSSLETRYPHLR FT GLPIKDYEKVAPKLLIGLDNLKLTIPLKIREGDWGHPMAAKCRLGWSIYGC FT SPNGEGPITCGFHVGGWTNQDQELNQLVRNYVALDNAGIISPISQLESDED FT RRAREILEATTRRISTGFETGLLWKTDYVQLPNSYGMAYRRLCGLERKLSM FT DEPLYECVRTTIRDYVEKQYAHEATEDELATTSPEKCWYLPLGVVVNPRKN FT KVRLIMDARATVNGVSFNSSLLKGPDMLVSLPSVLSHFRLFQYALAADIKE FT MFHRIKIREEDRQFQRFLWRDRPHLEPSTFVMDVAIFGSTCSPSSAQYVKN FT QNAREFASEFPRAAEAITRHHYVDDYLDSFGTIEEAVTVGREVKEIHARGG FT FEIRNFLSNKAELAESVGSQSTAMEKLFPAGNEEYAESVLGMSWIPSSDNF FT TYSMQFRESLQNIVAEGHVPTKREVLRVAMSLFDPLGLITFYLIHGRILKQ FT NIWASKINWDDVIDNELCERWRLWIGYLPQLDLLRIPRCYFAGGNQQTYST FT LQIHVFVDASEAAFSSVIYFRVETVSGAKVALVSAKSKVAPLKMLSIPRLE FT LQAAVLGTRLLNSVISMHNLQVTRRVLWTDSRTVLSWINSDHRKYHQFVGF FT RVAEILSTTELSEWRWIPTRINVADLATKWGAGPDFSGNSVWFRGPEFLRQ FT PEDMWPQQHREIDSTVEEIRPCNIHVSKPELPIDLSRFSRWERLHRTMGYV FT HRFTDNIQRKRQGAPLEVGNLTQFELVKAERSLWKFAQQEYYSVEVEQLGE FT NGDTVVSKSSKIFKLCPFMDKFNVLRMRGRLELSTYVPYEAKYPVILPPES FT VITNLLADWFHRRYQHANRETVVNEMRQYYQIPKLRALIAKTAKNCMQCRV FT CKALPRTPVMAPLPKVRLTPFVRPFTFVGLDYFGPILAKQGRSNVKRWIAL FT FTCLSIRAVHMEVVHALSTESCVLAIRRFVTRRGSPAEIFSDNGTNFHGAN FT NKLKKQIQERNRTLAAIFTNANTKWSFNPPGAPHMGGVWERMVRSVKVAIG FT TFTEATRKPDDETLETVIIEAEGIINSRPLTYIPVESADQESLTPNHFLLG FT SSNGVKQLPVLPTEYRTTLRSSWKLAQHLADGFWRRWLKEYLPVISRRSKW FT FDNVRDISVGDLVLVINGEVRNQWTRGRVEKVIMGADGKVRQAWVRTAGGL FT HRRPSVKLALLDVAQSCEQDLGHRSRSQEGE" XX SQ Sequence 5997 BP; 1757 A; 1277 C; 1508 G; 1455 T; 0 other; acctttaaaa attttcaccg attcctccag catgtcggtt tccggagcga atgatccaca 60 ggtgcagaag agctgcccgg cctgcaaccg ccccgaccac atcgaagata tggtcgcatg 120 cgacgattgt ggcaagtggc accattatgg ttgtgcgaag gttgacgaag ccatcaaaga 180 ccaatcatgg aagtgcgaac cttgtggact cctcccatta aacactgcat cgattacagg 240 agccggtgca agtctggccg ttcccactgg agccatgaag aagaatgcaa agccggccgg 300 taacaaagca ggatcccgta agtcgaaaaa gaccaacctt agtagaactg taagtgttac 360 atcaagtgca agagcacgcc tagctttaga gttagaagtt cttaacgaac agcaacagtt 420 agaagaatta gaactggaag aggagaagca gattagagcc aggcaaattg cacaagaaaa 480 gatgattaaa gatcgtgagt tagagataga ggctaagaag ctagccgaag agaaggcatt 540 tttggaaaag aagacggccg ggccgaagag ttaaaatttc gtagggatca aatggccatc 600 aagaaaaagt ctttagagga aaaagttaag cttattcgcg agcagtcatt tcgtgggagt 660 agtcgttcgt cgaatatcag tcaaagtgca tcggaagtga gtgaaaaagt gactaattgg 720 ctagaaaaga cagatacaca tgccgatcaa aaccaagaca atcatttggg actgagcgtt 780 ctgagtaatc ctgcgctggt gaattctttc gaaaagctcg gtttatcgaa tcaggaaggc 840 gagggacgaa cggcgttttt cggccacccg tcgggattcc atcaaaatct aaacgatttc 900 gttaatgaag tacggtcaag tcaaggattg aatgctgaaa cttcccggca gaatgttgtt 960 tccggcaaca caagacgcga acaagaacgg gactgtagtg gtataaacgt tccggaacaa 1020 acgtgtctgt cacttggcag acaaaatgct gaaaacgaac aattgtacgc gtatgatcat 1080 caggcgggtc ccacaaatcg tcaacttggt gcgcgtcaag taatgggtaa agatcttccc 1140 gtgttttctg gaaatccaga agaatggcct atttgggtga gcaacttcga aaggtctaca 1200 atgacatgcg ggttttcaca agatgaaaat ctaattcgac tacaacgttg ccttaaggga 1260 ccagcccttg aaatggtccg cggaaggctt ctaacaccag cttgtgtgcc acatgtaatc 1320 aagacgttac aactgcgtta cggtcgccca gaaactttga ttcgagctct caccgagaaa 1380 attcgaaacc tcccttcgcc gaagatggat aacttggaaa gtatcatcga cttcggtatg 1440 gccgtggaca acctagttga gcatctgaag acagcaaaac agtttgctca tatgacaaac 1500 ccgtcacttt tacacgactt ggtcggtaag ctacctgtcg aatatagaat gaaatgggca 1560 gcgttcaaag gtgctcgtgc tgacgccgat ctacgaattt ttggaacgtt tatgaactcc 1620 atcgtagagc ttgcctttga tgtagcggat gatcatccga tgagctcata caaatcgcaa 1680 caaaaaccga aggaacgtgc ttacattcaa actcattccg aaccgactag tgcgattgaa 1740 atcggcgtcg atggtcggca ggagaaaatg cggaagagga tgtgcgtaac atgcggaaag 1800 gaagatcatc gaatccacga gtgcgagaaa ttcagactgt taagcgttga tgaaaggttg 1860 aagatcgtga atcagaattc actgtgcaga acctgcttaa atcaacatgg aagatggcct 1920 tgtcgtacat ggcaagggtg tggaatctca ggctgtagac tgcggcatca cacgctgcta 1980 catccatcag cgcagagcat ttcagttgct gtatctacaa gtcatctaga tcaaaaacag 2040 tcatctaacg gtcctctgtt tagaattctg ccagttacgt tatatggacc gtctggaaag 2100 attgacattt tcgcctttgt tgatgaggga tcccaaataa cgcttctaga ggaagaaatt 2160 gcatctcaac ttggtcttta tggaccctgt gaaccactgc agttattatg gactggaaat 2220 attacgcgta acgaatccaa atcacgacgg cttcttatgg atattactgg agcccagttg 2280 gatcagaaat tcaagctaac cgatgctcga acggtcgata aattgatgct cccaacacaa 2340 acactgtgtt acagcagctt ggaaacacgt tacccccacc ttcgagggct tccgattaag 2400 gattacgaga aggttgcgcc aaagctactc attggtcttg acaacttgaa gctcacaata 2460 ccgttgaaga ttcgtgaagg cgattggggg catcccatgg cagccaaatg tcgtcttggt 2520 tggagtatct atgggtgctc accgaacggt gagggaccga ttacttgtgg gtttcatgtt 2580 ggaggatgga ccaatcaaga tcaagagtta aaccaattgg tcagaaatta tgtagctttg 2640 gataatgctg gcataatatc gccaatatct cagttagagt cagatgaaga tcgtcgtgca 2700 cgtgaaatac tagaagcgac cacacggaga atttctaccg gatttgaaac cggtcttctt 2760 tggaagacgg actacgtaca gctgcccaac agttacggta tggcctatag gcggctttgt 2820 ggcttggagc gaaagctctc gatggatgaa ccgctttacg agtgtgtgcg aacaacgata 2880 cgagattacg tggagaaaca gtatgcacat gaagctacgg aggatgagct tgccacaaca 2940 agcccggaga aatgttggta tctgccactt ggagtagtgg tcaatccaag gaagaataag 3000 gttcggctga ttatggacgc gagagccaca gtaaatggtg tttcattcaa ttcttctttg 3060 ctgaaaggac cggatatgct ggtgtcgcta ccgtcagtac taagccattt ccgtctgttt 3120 cagtacgcct tggctgccga tatcaaggaa atgtttcatc gcatcaagat acgggaagag 3180 gatcgacaat ttcaacgatt cctctggcga gatcgaccac atttggaacc gtcaacattt 3240 gtaatggacg tggcaatatt tgggtccacc tgttctccaa gttcggccca atatgtaaaa 3300 aaccagaacg cgcgggaatt tgcaagtgag tttcctagag cagccgaagc aattactcgc 3360 catcattatg ttgacgatta tctagacagt ttcgggacga tagaagaagc ggttacagtc 3420 ggccgagagg tgaaagaaat tcacgcaagg ggtggatttg aaattcggaa tttcctgtcc 3480 aacaaagcag aactagctga atcggtagga tcccagtcaa cggcgatgga gaaactgttt 3540 ccagcaggaa acgaagaata tgcggaatcc gtcctaggaa tgagttggat cccttcaagt 3600 gacaacttta cgtattccat gcagtttcga gaaagtctgc aaaacattgt ggccgaagga 3660 catgttccca caaaacgaga agtattgcgg gtcgctatga gtctcttcga cccattagga 3720 ttgataacgt tctatttaat tcacggtaga atcctaaagc aaaacatatg ggcgtctaag 3780 atcaactggg acgacgtcat tgataatgag ctgtgcgaac gatggcgttt gtggattgga 3840 tacctaccac aattagattt gctccgtatt ccacgatgtt actttgcagg agggaatcag 3900 cagacgtact caacgttgca aatacatgtt tttgtggatg ctagtgaagc agcattttcc 3960 agtgtaatct atttccgcgt ggaaacagtc tcgggagcga aagtagcgct agtgtcagcc 4020 aaatcaaaag tagcgccgct gaaaatgttg tctattcccc gcttggagct acaggcagct 4080 gtgttgggaa cccggctgtt gaacagtgtc atctctatgc acaatctcca agtaacccga 4140 cgagtactct ggactgattc tcgaacggta ctttcgtgga tcaactccga tcatcgcaaa 4200 taccatcagt tcgtaggttt ccgtgtagca gaaattctgt ctacaacgga attatcggaa 4260 tggcggtgga ttcccactag aattaacgta gctgaccttg ctactaagtg gggtgctgga 4320 cccgatttta gcggaaatag tgtgtggttt cgtggaccgg aattcctacg ccaaccagag 4380 gatatgtggc cacagcaaca ccgagaaata gactcaaccg tagaagaaat tagaccttgc 4440 aacattcacg tatccaaacc agagctaccc atagatcttt cacgttttag tcgttgggaa 4500 aggcttcatc gaacgatggg atacgttcat cgtttcacgg acaacattca gcggaagcga 4560 caaggagcac cgttggaggt tggaaatctc acccagtttg agttggtaaa ggcagaaaga 4620 agtctgtgga agtttgctca gcaagaatat tactctgtgg aagtagagca gcttggcgag 4680 aatggcgata cggtagtctc gaaatcaagt aaaattttta agctgtgtcc gttcatggat 4740 aaattcaatg tacttcgcat gcgaggtcga cttgagttat ctacatacgt accatacgaa 4800 gcaaagtatc cagtaatact tccaccagaa tctgttatta ccaacctgtt ggcagattgg 4860 tttcatcggc gatatcaaca cgcaaaccgg gaaacggtgg ttaatgaaat gaggcagtat 4920 tatcaaatcc cgaaactgcg agcattgatc gctaaaacag ccaagaattg tatgcaatgc 4980 cgagtatgca aggcacttcc taggacacct gttatggctc ccttgccgaa agtgcgcctc 5040 accccatttg ttcgaccatt tactttcgta ggcctcgatt acttcggccc catattagca 5100 aaacagggtc gaagtaacgt taaaaggtgg attgcgctat tcacgtgcct gagcattcgt 5160 gctgtacaca tggaggtagt tcatgccttg tcgacggagt catgcgttct ggccatccgc 5220 agatttgtta cgcgtcgtgg ctcaccagct gaaatattta gcgataatgg aacaaacttt 5280 catggagcta ataataaatt gaagaagcaa atccaagagc gcaaccgtac gcttgctgca 5340 atcttcacta acgccaatac gaagtggtcc ttcaacccgc ctggagctcc tcacatgggg 5400 ggagtgtggg aacgaatggt acgctcggta aaagttgcca taggtacctt taccgaagct 5460 actcgaaaac ccgacgatga aacgttagag acggtaatta ttgaagcaga gggtataata 5520 aattctcgac cgttgacgta cattcctgtg gagtctgctg atcaggaatc cttaacacct 5580 aaccatttcc tactgggtag ctccaatggt gtcaagcaac taccggtgtt accaacagag 5640 taccgaacaa ccctaagaag tagctggaaa ttagcacagc atttggcaga cgggttttgg 5700 agaagatggt tgaaagaata cttgccagta atatcacgcc gatctaaatg gttcgacaac 5760 gttagggaca tttcagtagg cgacctggta ctcgttataa acggagaagt aaggaatcaa 5820 tggacaagag gacgtgtgga gaaggtgatt atgggtgcgg acggtaaagt tcgacaagct 5880 tgggtacgaa cagctggagg attgcatcgt cgaccgtcag ttaagttggc gttgctcgac 5940 gtcgctcaga gttgtgaaca agatctagga catcgtagtc gttcacagga gggggaa 5997 // ID DNA2-1_Tad repbase; DNA; INV; 153 BP. XX AC . XX DT 01-AUG-2009 (Rel. 14.08, Created) DT 01-AUG-2009 (Rel. 14.08, Last updated, Version 2) XX DE Putative non-autonomous DNA transposon: consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA2-1_Tad. XX OS Trichoplax adhaerens OC Eukaryota; Metazoa; Placozoa; Trichoplax. XX RN [1] RP 1-153 RA Jurka J.; RT "DNA transposons from Trichoplax adhaerens."; RL Repbase Reports 9(8), 1824-1824 (2009). XX DR [1] (Consensus) XX SQ Sequence 153 BP; 45 A; 31 C; 33 G; 44 T; 0 other; ctgtacagta ggcccagcca taataacccc gattactcag atactcagcg taaatagctt 60 aaacgatgat acattacatt attctaggag tctaaggtat ttacgctgag tatctgagta 120 atcggggtta ttatggctgg gcctactgta cag 153 // ID LOA_Ele6C_AAe repbase; DNA; INV; 5744 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A LOA non-LTR retrotransposon from Aedes aegypti. XX KW Loa; Non-LTR Retrotransposon; Transposable Element; KW LOA_Ele6C_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5744 RA Kojima K.K. and Jurka J.; RT "LOA clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1425-1425 (2011). XX DR [2] (Consensus) XX CC >98% identical to consensus. The consensus is ~77% identical to CC LOA_Ele6 and LOA_Ele6B_AAe. XX FH Key Location/Qualifiers FT CDS 240..1916 FT /product="LOA_Ele6C_AAe_1p" FT /translation="MEPNTESMEIASDMETEEMPKEEDLLQSSSMDDIGGG FT GSISSSILNSEDEEDGVNITIKEIRNDLQSNLQNPEINVRPKLSGAKKKRF FT RKLLLGGHAREEALSMVLTPSNVPTPKRSRNMNNSSNSDGKPNPKKQKGLQ FT VHHSVNNRMENIRQGATAIVPENQQSTYGEVAKWVRVGILPNTYPQIQLTT FT EQMNLAQEAILEKVTVQRRESFKPKFTNCWQRTGHLVINCQDTETADWLDS FT VVPTLCPWEGAELTVVDADDIPRLDVMIGFFPQSVNNDNDTIRIFIESQND FT GLSTDKWRIIQRNTLYEKHVEWYFTVDEASMQHFRASNFLLNYKFGQTTLR FT KKGMFKPDSNGTVTLRDDTKDEPESSVSTGNQQDHSVPGPSGIHGNPSKVE FT SSKNHNQTQPDVNNHQGSKNNVFGHQTGLAEGQFLSEPPKDRNSSGPTKDH FT SRSGSSKDQTISGHLKGQRSSELSKARSSVGPSKEQRSFGPSKEQSSFGPL FT KDQSSFGTLKDQNHSGPHMNQNAYGLVRDQNTPGLRKNHNETEQKHKQKST FT GPKKDQMNSKNV" FT CDS 1935..5648 FT /product="LOA_Ele6C_AAe_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="MSSNRNHMQIDTNIKCIQINLHHAKGATGILCQRFMK FT EKLSVAFIQEPWAHKNQVLGIPMQTGKIIFDEHSSRPRTALLLKGGLKYVP FT LTQFITKDLVAVQMEIPTTRGKTEACIASAYFPGDVDTVPPPELAAFVRFC FT KSNNKHFIVGCDANSHHTVWGSTNINKRGEDLFEYISSNNIDLCNEGNQPT FT FTTKNRQEVLDITMCSSMISTNIKNWRVSDEESLSDHKHIVFEYRASKTIK FT VTYRNPRNTNWEKYESMLNLSNNISLNDNIETTEQLDKIAENITGKIIRAY FT DTSCPIKEEFSNRDVPWWNKSLENLKKKSRQLFNRAKVSGQWDQYKKALTT FT YNKEIRKSRRKTWKFYCENIEKTPDVAKLQKVLSKDHSNGLGNLKKINGSF FT TVDTKETLEIMMKSHFPGSSLVMSDDTECDKVGETSRPNRTTGNNLDLPDT FT IFTRSKVEWAVNSFEPFKSPGRDGVFPALLQKGGQLLLDSLTNIFKTSLCL FT GHIPKIWTQVRVVFIPKAGKKDKTNPKSFRPISLTSTILKIMEKIIDEYIK FT SKYLQTNPISKYQFAYQPNKSTVTALNELVTKIESTLHRKEIALMAFLDVE FT GAFDNASYKSIENCMVKRQFDRCIKIWIMKMLNDREVSAELGESSITIKTT FT KGCPQGGVLSPLLWSLVVDDLLIKLVEKGFEVIGFADDVVIIVRGKFDNII FT TDRMQSALNCTWQWCQHEGLNINPSKTVLVPFTRRRKITLKNLTINGCILQ FT YSSNVKYLGVILDSTLNWNLHLEQVILKATNALWVSKKTFGKKWGLKPKMI FT NWIYTVIVRPRIVYASLVWWTKTKQHMAQRKLGKLQRLACMAITGAMHSTP FT SKALEAALYKLPLHQYIQKEAEKSALRLKRTHTFLEGNLFGHLSILKDFPI FT NPLVVMNEDWMENRMNLDIPYNVVETNRQVWESGGPNISPGSIIFYTDGSK FT MNDKTGAGITGPGIDISIPMGRWTTVFLAEIYAILECASLCLRRKYRYARI FT CIFSDSQAALKALKSFTSQSKLVWECITLLKQLSVKNQVNLYWVPGHCGIE FT GNEKADLLARRGSSVQFIGPEPFCGVSKCLIQMELKQWEDQMIQSNWIATE FT TLRQSKLFITPNKNVTEKLLSLNKKSLSILIGLLTGHCPTRYHLNKINRCQ FT TGVCRFCDCEIETSQHLLCDCPALYVRRRQYFNKGILSPFEIWLENPNLVI FT GFILRIIPDWSTSHHQSTAFTPNGNSSS" XX SQ Sequence 5744 BP; 2079 A; 1050 C; 1165 G; 1450 T; 0 other; tctatcgcat aacgagagaa agaaagttgc ctgagattgg ttggtgctac cgggccacta 60 ccactcttgt gtgtgaaaga agatcccgaa gatcactcaa tacttgtgcg cggtgtgcta 120 gaagagaaat attcaaaaaa aaaaacggtt gagttcaaaa caaaatcaaa aaataatttc 180 tttttttttt tttttttaaa cataatccaa aaattaaatt aaaggtattg ttcagcgtga 240 tggagccaaa tactgaatca atggaaattg catcagatat ggaaacggaa gaaatgccta 300 aagaagaaga tctactacaa agttcctcaa tggatgatat tggcggtgga ggatctatct 360 cgtcctctat ccttaactct gaggatgagg aggatggagt aaatatcacc attaaggaga 420 tacggaatga tctacagagt aatttacaaa atccagaaat aaacgtaaga ccaaagttga 480 gcggagcaaa gaagaagcgt ttcagaaaac tactgttggg tggacatgca cgggaggaag 540 cactttcgat ggttttaaca ccatctaacg tgcccacacc gaagaggtct cgaaacatga 600 acaacagtag caacagtgac gggaaaccca atcctaaaaa acaaaaagga ttgcaggttc 660 atcactccgt caataatcga atggaaaata tcaggcaagg agcaacagcg attgtacctg 720 agaatcaaca gtcgacatat ggagaagtgg ccaaatgggt tagagtagga atactaccaa 780 acacctatcc acaaatacag ctcacaacgg aacagatgaa tttggcacag gaggccattc 840 tcgaaaaagt aacggttcaa agaagagagt ccttcaagcc gaagttcact aactgctggc 900 agcgaacagg acacttagtg ataaactgtc aagacacgga aacagcggat tggctcgact 960 ctgtggttcc tacgctctgc ccatgggaag gcgctgaatt aacagtagtc gatgcagatg 1020 atatcccaag gttagacgtg atgataggct tttttcctca aagtgtgaac aacgacaacg 1080 acaccatccg gatcttcata gagagtcaaa acgacggcct gagtaccgat aaatggagga 1140 tcattcaacg aaacactctt tatgagaaac acgttgaatg gtacttcact gttgacgagg 1200 cgtccatgca acatttcaga gccagtaatt ttcttcttaa ctacaagttt gggcaaacca 1260 ctcttcggaa aaaaggtatg ttcaaacctg attcaaatgg gactgttact ttgagagatg 1320 ataccaaaga cgaaccggaa agctccgtaa gtactgggaa ccaacaagac catagtgtcc 1380 cgggaccaag cggaatacac gggaatccgt caaaggtaga atcctcaaag aatcataacc 1440 agacacaacc agatgtaaac aatcaccaag ggtcaaaaaa caatgttttt gggcatcaaa 1500 ctgggcttgc agagggccag tttctctctg agcctccaaa ggaccgaaac tcttctgggc 1560 cgacaaagga ccatagcagg tctggatcgt caaaagacca gactatctct ggccatctaa 1620 agggacagag aagctctgag ctatcaaagg ctcggagcag cgttgggccg tcgaaggaac 1680 agcgcagctt tgggccgtca aaggaacaga gtagctttgg gccgttgaag gaccagagca 1740 gttttgggac tctaaaagac caaaatcatt ctgggcctca tatgaaccag aatgcttatg 1800 ggctggtaag ggatcaaaat acacctgggc ttcgtaagaa ccataatgaa actgaacaaa 1860 aacacaaaca aaaatccact gggcccaaaa aggaccagat gaattcaaaa aacgtctaac 1920 tatgagagtt taaaatgtct tctaatagaa atcatatgca aatagataca aacataaaat 1980 gtatacaaat aaaccttcat cacgctaagg gtgcaactgg aatcctctgt cagaggttta 2040 tgaaagaaaa attaagcgta gcatttatac aagaaccttg ggctcataaa aaccaagttc 2100 ttggaattcc tatgcaaaca ggtaaaatta tttttgatga acacagttca agaccgcgca 2160 cagcactatt acttaaaggg ggacttaaat acgtaccatt gacacaattt atcaccaaag 2220 atttagttgc agttcaaatg gagattccca caacccgggg aaaaactgaa gcttgtatag 2280 cttctgcata ttttcctggg gatgtggaca cagtacctcc tcctgaattg gcagctttcg 2340 taagattttg taagagtaat aataaacatt ttattgtcgg ttgcgatgct aattcccatc 2400 acacagtttg gggaagcacc aacataaata aacgaggcga agaccttttt gaatacatat 2460 cttcgaacaa catcgattta tgtaacgaag gaaatcagcc tacatttaca acaaaaaaca 2520 gacaagaagt attggatata actatgtgta gttcaatgat ttctacaaat ataaagaatt 2580 ggcgagtttc agatgaagaa tctttatctg accacaaaca catagtgttt gagtacagag 2640 ccagcaaaac aatcaaagtg acttacagaa atcctaggaa taccaactgg gaaaaatatg 2700 aatcaatgtt aaatttaagt aataatatct cattgaatga caatatagag acaacagaac 2760 aattagacaa aattgcagaa aatataactg gaaaaattat tcgggcgtat gatactagtt 2820 gtccaataaa agaagaattc tctaatagag atgttccttg gtggaataaa tctttagaaa 2880 acctaaagaa aaaatctcgg cagcttttca accgcgcaaa agtttctggg caatgggacc 2940 aatacaaaaa ggcccttaca acttacaaca aagaaataag aaaatcccgt agaaaaacct 3000 ggaagtttta ttgtgaaaac atagaaaaaa ctccggatgt agcaaaatta cagaaggttc 3060 tttccaaaga ccattcaaat gggctgggaa atttgaaaaa aataaacggg agttttaccg 3120 ttgatacaaa agaaacactt gaaataatga tgaaaagtca ttttccagga tcaagtttag 3180 taatgagtga tgatacagaa tgtgataaag ttggagaaac ctctaggcct aataggacta 3240 ccggcaacaa cttagatcta ccggatacta tctttactcg atctaaagtt gagtgggcag 3300 tgaattcttt cgaacctttc aagtctccag gaagagatgg tgttttccct gcacttctac 3360 agaagggtgg acaattactg cttgactctc tcacaaatat ctttaaaact agcctatgtc 3420 ttggccatat accaaaaata tggactcaag ttcgggtcgt ctttattcct aaagcaggga 3480 aaaaagacaa aacaaatcca aaatcgttta gaccaattag tctaacatct acaattctta 3540 aaatcatgga aaagataata gacgaataca taaagtcaaa atacttacaa acaaatccta 3600 tcagtaaata tcaatttgct tatcaaccca ataagtctac agtcacagct cttaatgaac 3660 tggttacaaa aatagaatcc acgcttcaca ggaaagaaat agctcttatg gcatttctgg 3720 atgttgaagg tgcgtttgat aatgcgtctt ataaatctat tgaaaattgt atggtgaaaa 3780 ggcaattcga tcgttgtatc aaaatatgga ttatgaaaat gttgaatgat agagaagtat 3840 ctgctgaatt gggagaatct tcaataacaa taaaaaccac taagggatgt cctcaaggag 3900 gtgtattatc ccctttgttg tggtcgcttg ttgttgatga tctcctcatt aaactagtag 3960 aaaaaggctt cgaagttatt gggtttgcag atgatgtggt aataatagtt cgtggaaagt 4020 ttgataatat aatcactgac agaatgcaat cagcattgaa ctgtacttgg cagtggtgtc 4080 aacatgaagg gttgaacatt aacccatcaa aaacagtttt ggtgccgttt actcgcagaa 4140 gaaagattac tttaaaaaat ttgactataa acggatgtat tttacaatat tcttccaatg 4200 ttaaatatct aggagtaata ctagatagta ctctcaattg gaacttacac ttagagcagg 4260 taatactcaa agccacaaat gctctatggg tgagtaaaaa aacttttggt aaaaaatggg 4320 gactgaaacc aaaaatgatt aattggatct atacggttat agtacgaccc agaatagttt 4380 atgcttcact agtttggtgg acaaaaacaa aacaacatat ggcccagaga aagcttggaa 4440 agttgcaacg cttagcctgc atggccataa caggagcaat gcacagtaca ccatcaaaag 4500 ccttggaagc agctctttat aaacttccgc tacatcaata tatacagaag gaagctgaaa 4560 agagtgctct aaggctgaaa agaactcata catttttaga aggcaatctt ttcgggcatc 4620 tcagtatact gaaagatttt ccaataaatc cattggtagt aatgaacgaa gactggatgg 4680 aaaataggat gaacttagat ataccatata acgtggttga aacaaaccgc caagtatggg 4740 aatcaggggg tccgaatatc tcaccaggat cgataatttt ctacactgat ggctcaaaaa 4800 tgaatgataa aaccggggct ggaattacag gaccaggtat tgatatttca ataccaatgg 4860 gaagatggac cactgttttc cttgcagaaa tttatgccat tctagaatgt gcatctttgt 4920 gtctgagaag gaaatacagg tatgcacgga tttgcatatt ttcagatagc caagcggcgc 4980 tgaaagcttt aaaatcattc acaagccaat caaaactggt atgggaatgc ataacactgc 5040 taaagcaatt gtccgttaag aaccaggtta atttatactg ggtaccaggc cattgtggaa 5100 ttgaaggtaa tgaaaaagcg gatttacttg caaggagagg gtcaagtgtc cagttcatag 5160 gaccagaacc cttctgcggt gtatccaaat gtttaataca aatggaattg aaacaatggg 5220 aggatcaaat gatacagtca aactggattg caacagaaac cttaaggcaa tcaaaactgt 5280 tcataactcc aaataaaaac gttacggaaa aacttttaag tttaaacaaa aaatctttaa 5340 gcatacttat tggacttctc acaggacatt gtccgacaag atatcatttg aataaaatca 5400 atcgatgtca aactggtgtc tgtcggtttt gtgactgtga aatagagacc tcacaacatt 5460 tgctctgtga ctgtccagcg ttatatgtac gtagaagaca atatttcaat aagggcattt 5520 taagcccttt tgaaatttgg ttagaaaatc ccaatctggt tattggtttc attctaagaa 5580 tcataccgga ttggagtact tcgcatcatc agtcaacggc atttacccca aatggtaatt 5640 cgtcgtcctg aaagaatgca aattaaatag gggtatacca caatagatca aactcatggt 5700 cgcagtggtt taaatcccaa caccaaaaaa aaaaaaaaaa aaaa 5744 // ID Jockey-14_AAe repbase; DNA; INV; 4378 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A Jockey non-LTR retrotransposon from Aedes aegypti. XX KW Jockey; Non-LTR Retrotransposon; Transposable Element; KW Jockey-14_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4378 RA Kojima K.K. and Jurka J.; RT "Jockey clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1380-1380 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 14 sequences with >95% CC identity. XX FH Key Location/Qualifiers FT CDS 256..1464 FT /product="Jockey-14_AAe_1p" FT /translation="MDTTDQQNATCSSISYGYNIATSNQFGTLAGSDTNVL FT PITQNKNAKKGKEKIPLVTITETNIPAVQGMALNAGVSMFNVKRTSTGVNV FT FMYTTQDHSKLVSYLKQQKANFFTYVRDEDRMTKIVLTGLPDMNVEEVSEA FT LKEVDVSPKEIRKMTLKRKRNEDHALYLLYFNKGSVQLNAIRKIKTVGHCI FT VRWDFYSTKTKGPVQCRNCQMYGHGSAQCFRRPRCVKCGNDHKTIDCPLTA FT NREDKSIKLPKEQLACANCGQQHTANFHLCEKRQEFINARNHQLLKSKTQR FT SSREGFQFRPEEFPPSGPLGGSFQKPFRQHAQLYSNVVSHREDRTVPPQFR FT QHHNPRYSSNQSTKNDLFSPEEIVDIVSDVFSRLSTCTTKQQQIKVIFEIT FT AQYCFPCNG" FT CDS 1460..4135 FT /product="Jockey-14_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MDNNTRNITVSYWNANGVQSKSHDLFRFLINNSIDVV FT LICETFLKPAIKLVHSDYKIYRLDRSGRRKGGVAIIVRSNVAHTLLPSFDL FT EIVEAIGIEVVTSTGPISLVSAYNPGANRDNELFLQDIQKLTKIRNSFFIC FT GDLNARHRLWNCIRANTAGNLLFNELQSGTFIVQHPPLSTYIPTDPNRRCS FT TLDLVLTNGIHSILNIRTVQDLTSDHLPVLFEIDSATANLAHPRSIPDYSR FT ANWRGFKSFLNDNIDLQQLNLSTIHETYQIDNHIEYFTTIMHQAHERFIPL FT IVPDRYKLILPDDILRLIRLRNSRRRQWQRNRRDPYLKSVYDYVCTLVKKR FT IDEFRNKSWSDNLLSMNYDRNNHNKMWKFGKLFKNKHNFIPALXKDGKLLI FT TDKEKCEEIGNNFARAHLTTFHDQSPIEFEVNSSVDNFLANHEQVDLDTRR FT LVKPKEILHFLKKLKNNKSPGFDQINNRCLKRLPRKALVLLTFIFNSCIKL FT NYFPLAWKHAKVIAIPKPNKDHSDPRNYRPISLLSSLSKIFEKVLLNRLNQ FT HISENNIISETQFGFRTEHSTVHQLHRLTRNIKNNREIKKSTGLVLLDNEK FT AFDTVWHKGLIFKMINLNFPTYLVKLIHSYISDRDFVVTIGSETSQLQTVP FT AGVPQGSVLSPVLYNVYAHDIPDFGGCVRYQFADDVAITSSSSDPVEVIIN FT LNASLVQYSNYCKKWKIKVNENKTEAVFFTRFRSPRKLPNRCLNLNGFDIP FT WKDEAKYLGLILDKKLTFQKHSQYILERCEKLIRIMYPFINRKSKLNTRNK FT ILLYKTVFRSTLAYASPVWSECAMSHRKKLQVFQNKCLKMVHNLPPWFSTS FT ELHDISHIETLDQYCNRIRSNYYQKCESSSFDMLRALCN" XX SQ Sequence 4378 BP; 1409 A; 914 C; 786 G; 1268 T; 1 other; tttgaaaccc gctctcgttg tgatcggacg tgttattcat cgcgctttat tctcgatcta 60 ttattcaaat agtcttttgt tttcgtgatc tcctatcgtt gttgcagcat cgatctagca 120 ccggcggccg ctggtcaaaa acggaatagt tccacagtaa tatcaaacgg cgtggagcca 180 aatggcccga cccagaaact tcgtcgcccc ttaacggatt tcccgagcac atctgccctg 240 attgagcttc cgatcatgga tacaacagat caacaaaacg ccacgtgctc ttctatcagc 300 tacggttaca acatcgccac atcaaatcag tttggcactc tagctggcag tgataccaac 360 gtgctgccta tcacgcaaaa taaaaatgcc aaaaaaggaa aagaaaagat tccgctggtc 420 acgattactg agaccaacat ccctgctgta caaggaatgg ccttgaacgc aggcgtctcg 480 atgttcaacg tcaagcgaac atcaactggg gtcaatgtgt tcatgtacac aacccaagac 540 cactccaagc tcgtttccta cctgaaacag caaaaagcaa acttcttcac ttacgtgcga 600 gatgaagaca gaatgactaa aattgtcttg actgggcttc cagacatgaa tgtcgaagag 660 gtatccgaag cactcaagga agtggatgtg tctccaaaag aaatccgaaa aatgacgctc 720 aagagaaaac ggaacgaaga tcatgctttg tacctcctgt atttcaacaa aggaagcgtg 780 cagctcaacg ccattaggaa aatcaagacg gttggccatt gcatcgttag gtgggacttt 840 tattccacaa agacaaaagg acccgttcag tgccgcaatt gtcaaatgta cggacatgga 900 tctgcacaat gtttccgccg gccacgttgt gtcaaatgtg gaaatgatca caagactatt 960 gattgcccgc taacggcaaa ccgtgaggat aagtccatta agcttccaaa agaacagctt 1020 gcatgtgcga attgtggtca gcaacatacc gctaactttc acctttgtga gaaacgtcaa 1080 gagttcatca acgccaggaa tcatcaactt ctcaagtcca aaacccagag atctagccgt 1140 gaagggttcc agtttcgtcc agaagaattc cccccatccg ggccactggg agggtcgttc 1200 cagaaaccgt ttcgtcaaca tgctcagctg tattccaacg ttgtgtccca tcgtgaagat 1260 cgaaccgttc caccccagtt ccgccaacat cataatcccc ggtatagcag taaccagtcc 1320 acaaaaaacg atcttttctc cccagaagaa atcgtcgaca ttgtgtctga tgttttctca 1380 aggttaagta cttgtacgac caagcaacaa cagatcaaag tgatttttga aattacagct 1440 cagtactgtt tcccgtgtaa tggataataa taccagaaac atcacagttt catactggaa 1500 tgctaatggt gttcagagca aatcgcatga ccttttccga ttcctcatta ataattcgat 1560 tgatgttgtt ctcatctgcg aaacttttct caaaccagct atcaaacttg ttcattctga 1620 ttacaaaatt taccgcttag acagatctgg ccgtcgtaaa ggcggagttg caataatcgt 1680 tcgttctaat gttgcgcata cattgctgcc ttctttcgac cttgaaattg tcgaggcaat 1740 aggtattgag gttgtcacat ctacgggtcc aatttcgtta gtttctgctt acaatcctgg 1800 cgctaacaga gataatgagc tttttctgca agatattcaa aagctcacta aaattcgaaa 1860 cagcttcttc atttgtggag atctaaacgc ccgccaccgt ctgtggaact gcattcgtgc 1920 taacactgca ggcaatttgc ttttcaatga actgcaatct ggtactttta ttgtacaaca 1980 tcctccatta tctacgtata ttcccactga tccaaaccgt cgatgctcta ctcttgattt 2040 agtgcttaca aatggtatac attcaatatt aaacatacga actgttcaag atctaacatc 2100 tgaccattta ccagtgctat ttgaaattga ttccgctacg gcaaacttag cccatcctcg 2160 atccattcca gactattccc gtgccaactg gagaggattt aaatctttcc taaatgataa 2220 cattgatcta caacaactta atttgagtac aattcacgaa acctatcaaa ttgacaacca 2280 tattgaatat tttaccacta ttatgcacca agctcatgaa agatttattc cgctaattgt 2340 tccagacagg tacaagctta ttttacctga tgatattctt cgtttgataa gattaagaaa 2400 tagtcgccgg agacaatggc aacgaaatcg aagagatcca taccttaaat ccgtctatga 2460 ctatgtatgt actttagtca aaaaacgtat cgacgagttc aggaacaaat catggtcaga 2520 taatttgctg agcatgaatt acgatcgaaa taatcacaac aaaatgtgga aatttggtaa 2580 acttttcaaa aacaaacata atttcatacc agcacttama aaagacggta aattattgat 2640 taccgataaa gaaaaatgtg aagaaattgg aaataacttt gccagagcac atctcactac 2700 ctttcatgat caaagtccca tcgaattcga agttaattca tcagttgata attttcttgc 2760 aaaccacgaa caagttgatc ttgatactcg gaggctagtc aaaccgaagg aaatattaca 2820 ctttctaaaa aagttgaaaa acaacaaatc tccaggcttt gaccaaataa ataatcggtg 2880 tttgaaaaga ttaccgagaa aagctctcgt gcttttgaca tttattttca attcgtgcat 2940 aaaactgaat tacttcccct tggcatggaa acatgctaag gttatagcaa ttccgaagcc 3000 aaataaagac cattctgacc cgcgaaacta ccgtcctatt agtctattga gcagtttaag 3060 taaaatcttc gagaaagttt tactaaaccg actcaatcaa catatctcag aaaacaatat 3120 aatttcagaa actcagttcg gattcaggac tgagcattct acagttcacc aattgcatcg 3180 actaactagg aatataaaaa acaaccgcga aatcaagaag tctactggac tagttctgtt 3240 agacaatgaa aaagcattcg atacggtttg gcacaaaggt ttgattttca aaatgatcaa 3300 tttgaatttc cctacttatc tggtaaaatt aattcattcc tacatatcag atcgtgactt 3360 tgtagtaacg ataggttccg aaacctctca gttacaaacc gttcctgctg gtgtccctca 3420 agggagcgtt ttatctcccg tgttgtataa cgtctacgca catgatattc cagattttgg 3480 aggatgtgta cgatatcaat ttgctgatga tgtagccata acatcgtcat ctagcgatcc 3540 agttgaggtg attatcaatc tcaatgcgag tctagtccag tattccaatt attgcaaaaa 3600 atggaaaatt aaggtcaatg aaaataaaac cgaagctgtt ttctttacaa ggttcagaag 3660 tccaagaaaa cttcctaata gatgtctgaa tttgaatgga tttgatatac cgtggaaaga 3720 tgaagcgaag tatctaggtt taattttaga taaaaaactt acgtttcaaa aacattctca 3780 gtatatactt gagagatgtg agaaattgat tagaattatg tatcctttta ttaatcgaaa 3840 atcaaaacta aatactcgaa ataagatttt gctgtacaaa actgtattta gatcaactct 3900 agcctacgct tcgccagtat ggtccgagtg tgccatgtct cataggaaaa agctacaagt 3960 ttttcaaaac aaatgtttaa aaatggttca taatttacct ccttggttta gcaccagcga 4020 attacatgac attagtcata tagaaaccct agatcagtac tgcaatagaa ttagatcaaa 4080 ctattatcag aaatgtgaaa gttcttcttt tgatatgttg cgtgctttat gtaattagtg 4140 ttagggtcct agcttttaag ttttttaatt gtttcaagta agaggttttt tatctgaaaa 4200 gattgttttc ctctaaatga aaaatcaata tgatataaca tcatcactat attgtaaaga 4260 tacttaaaaa atcgatctaa tgttgaaagt tgctaacgaa ctcattattt gtagaataag 4320 gaatttattg tataatactc ttttatgatg aataaactgt tgattgattg attgattg 4378 // ID LSU-rRNA_Hsa repbase; DNA; INV; 5035 BP. XX AC HSU13369; XX DT 29-AUG-2008 (Rel. 13.08, Created) DT 29-AUG-2008 (Rel. 13.08, Last updated, Version 1) XX DE rRNA from Metazoa. XX KW rRNA; Pseudogene; LSU-rRNA_Hsa. XX OS Metazoa OC Eukaryota. XX RN [1] RP 1-5035 RA Smit A.F.; RT "LSU-rRNA_Hsa - rRNA from Metazoa (extracted from Genbank locus RT HSU13369)."; RL Direct Submission to Repbase Update (05-AUG-2008). XX DR EMBL/GenBank/DDBJ; HSU13369; Positions 7935 12969. XX SQ Sequence 5035 BP; 800 A; 1672 C; 1811 G; 752 T; 0 other; cgcgacctca gatcagacgt ggcgacccgc tgaatttaag catattagtc agcggaggaa 60 aagaaactaa ccaggattcc ctcagtaacg gcgagtgaac agggaagagc ccagcgccga 120 atccccgccc cgcggggcgc gggacatgtg gcgtacggaa gacccgctcc ccggcgccgc 180 tcgtgggggg cccaagtcct tctgatcgag gcccagcccg tggacggtgt gaggccggta 240 gcggccggcg cgcgcccggg tcttcccgga gtcgggttgc ttgggaatgc agcccaaagc 300 gggtggtaaa ctccatctaa ggctaaatac cggcacgaga ccgatagtca acaagtaccg 360 taagggaaag ttgaaaagaa ctttgaagag agagttcaag agggcgtgaa accgttaaga 420 ggtaaacggg tggggtccgc gcagtccgcc cggaggattc aacccggcgg cgggtccggc 480 cgtgtcggcg gcccggcgga tctttcccgc cccccgttcc tcccgacccc tccacccgcc 540 ctcccttccc ccgccgcccc tcctcctcct ccccggaggg ggcgggctcc ggcgggtgcg 600 ggggtgggcg ggcggggccg ggggtggggt cggcggggga ccgtcccccg accggcgacc 660 ggccgccgcc gggcgcattt ccaccgcggc ggtgcgccgc gaccggctcc gggacggctg 720 ggaaggcccg gcggggaagg tggctcgggg ggccccgtcc gtccgtccgt cctcctcctc 780 ccccgtctcc gccccccggc cccgcgtcct ccctcgggag ggcgcgcggg tcggggcggc 840 ggcggcggcg gcggtggcgg cggcggcggg ggcggcggga ccgaaacccc ccccgagtgt 900 tacagccccc ccggcagcag cactcgccga atcccggggc cgagggagcg agacccgtcg 960 ccgcgctctc ccccctcccg gcgcccaccc ccgcggggaa tcccccgcga ggggggtctc 1020 ccccgcgggg gcgcgccggc gtctcctcgt gggggggccg ggccacccct cccacggcgc 1080 gaccgctctc ccacccctcc tccccgcgcc cccgccccgg cgacgggggg ggtgccgcgc 1140 gcgggtcggg gggcggggcg gactgtcccc agtgcgcccc gggcgggtcg cgccgtcggg 1200 cccgggggag gttctctcgg ggccacgcgc gcgtcccccg aagaggggga cggcggagcg 1260 agcgcacggg gtcggcggcg acgtcggcta cccacccgac ccgtcttgaa acacggacca 1320 aggagtctaa cacgtgcgcg agtcgggggc tcgcacgaaa gccgccgtgg cgcaatgaag 1380 gtgaaggccg gcgcgctcgc cggccgaggt gggatcccga ggcctctcca gtccgccgag 1440 ggcgcaccac cggcccgtct cgcccgccgc gccggggagg tggagcacga gcgcacgtgt 1500 taggacccga aagatggtga actatgcctg ggcagggcga agccagagga aactctggtg 1560 gaggtccgta gcggtcctga cgtgcaaatc ggtcgtccga cctgggtata ggggcgaaag 1620 actaatcgaa ccatctagta gctggttccc tccgaagttt ccctcaggat agctggcgct 1680 ctcgcagacc cgacgcaccc ccgccacgca gttttatccg gtaaagcgaa tgattagagg 1740 tcttggggcc gaaacgatct caacctattc tcaaacttta aatgggtaag aagcccggct 1800 cgctggcgtg gagccgggcg tggaatgcga gtgcctagtg ggccactttt ggtaagcaga 1860 actggcgctg cgggatgaac cgaacgccgg gttaaggcgc ccgatgccga cgctcatcag 1920 accccagaaa aggtgttggt tgatatagac agcaggacgg tggccatgga agtcggaatc 1980 cgctaaggag tgtgtaacaa ctcacctgcc gaatcaacta gccctgaaaa tggatggcgc 2040 tggagcgtcg ggcccatacc cggccgtcgc cggcagtcga gagtggacgg gagcggcggg 2100 ggcggcgcgc gcgcgcgcgc gtgtggtgtg cgtcggaggg cggcggcggc ggcggcggcg 2160 ggggtgtggg gtccttcccc cgcccccccc cccacgcctc ctcccctcct cccgcccacg 2220 ccccgctccc cgcccccgga gccccgcgga cgctacgccg cgacgagtag gagggccgct 2280 gcggtgagcc ttgaagccta gggcgcgggc ccgggtggag ccgccgcagg tgcagatctt 2340 ggtggtagta gcaaatattc aaacgagaac tttgaaggcc gaagtggaga agggttccat 2400 gtgaacagca gttgaacatg ggtcagtcgg tcctgagaga tgggcgagcg ccgttccgaa 2460 gggacgggcg atggcctccg ttgccctcgg ccgatcgaaa gggagtcggg ttcagatccc 2520 cgaatccgga gtggcggaga tgggcgccgc gaggcgtcca gtgcggtaac gcgaccgatc 2580 ccggagaagc cggcgggagc cccggggaga gttctctttt ctttgtgaag ggcagggcgc 2640 cctggaatgg gttcgccccg agagaggggc ccgtgccttg gaaagcgtcg cggttccggc 2700 ggcgtccggt gagctctcgc tggcccttga aaatccgggg gagagggtgt aaatctcgcg 2760 ccgggccgta cccatatccg cagcaggtct ccaaggtgaa cagcctctgg catgttggaa 2820 caatgtaggt aagggaagtc ggcaagccgg atccgtaact tcgggataag gattggctct 2880 aagggctggg tcggtcgggc tggggcgcga agcggggctg ggcgcgcgcc gcggctggac 2940 gaggcgcgcg ccccccccac gcccggggca cccccctcgc ggccctcccc cgccccaccc 3000 gcgcgcgccg ctcgctccct ccccaccccg cgccctctct ctctctctct cccccgctcc 3060 ccgtcctccc ccctccccgg gggagcgccg cgtgggggcg cggcgggggg agaagggtcg 3120 gggcggcagg ggccgcgcgg cggccgccgg ggcggccggc gggggcaggt ccccgcgagg 3180 ggggccccgg ggacccgggg ggccggcggc ggcgcggact ctggacgcga gccgggccct 3240 tcccgtggat cgccccagct gcggcgggcg tcgcggccgc ccccggggag cccggcggcg 3300 gcgcggcgcg ccccccaccc ccaccccacg tctcggtcgc gcgcgcgtcc gctgggggcg 3360 ggagcggtcg ggcggcggcg gtcggcgggc ggcggggcgg ggcggttcgt ccccccgccc 3420 tacccccccg gccccgtccg ccccccgttc ccccctcctc ctcggcgcgc ggcggcggcg 3480 gcggcaggcg gcggaggggc cgcgggccgg tcccccccgc cgggtccgcc cccggggccg 3540 cggttccgcg cgcgcctcgc ctcggccggc gcctagcagc cgacttagaa ctggtgcgga 3600 ccaggggaat ccgactgttt aattaaaaca aagcatcgcg aaggcccgcg gcgggtgttg 3660 acgcgatgtg atttctgccc agtgctctga atgtcaaagt gaagaaattc aatgaagcgc 3720 gggtaaacgg cgggagtaac tatgactctc ttaaggtagc caaatgcctc gtcatctaat 3780 tagtgacgcg catgaatgga tgaacgagat tcccactgtc cctacctact atccagcgaa 3840 accacagcca agggaacggg cttggcggaa tcagcgggga aagaagaccc tgttgagctt 3900 gactctagtc tggcacggtg aagagacatg agaggtgtag aataagtggg aggcccccgg 3960 cgcccccccg gtgtccccgc gaggggcccg gggcggggtc cgcggccctg cgggccgccg 4020 gtgaaatacc actactctga tcgttttttc actgacccgg tgaggcgggg gggcgagccc 4080 gaggggctct cgcttctggc gccaagcgcc cgcccggccg ggcgcgaccc gctccgggga 4140 cagtgccagg tggggagttt gactggggcg gtacacctgt caaacggtaa cgcaggtgtc 4200 ctaaggcgag ctcagggagg acagaaacct cccgtggagc agaagggcaa aagctcgctt 4260 gatcttgatt ttcagtacga atacagaccg tgaaagcggg gcctcacgat ccttctgacc 4320 ttttgggttt taagcaggag gtgtcagaaa agttaccaca gggataactg gcttgtggcg 4380 gccaagcgtt catagcgacg tcgctttttg atccttcgat gtcggctctt cctatcattg 4440 tgaagcagaa ttcgccaagc gttggattgt tcacccacta atagggaacg tgagctgggt 4500 ttagaccgtc gtgagacagg ttagttttac cctactgatg atgtgttgtt gccatggtaa 4560 tcctgctcag tacgagagga accgcaggtt cagacatttg gtgtatgtgc ttggctgagg 4620 agccaatggg gcgaagctac catctgtggg attatgactg aacgcctcta agtcagaatc 4680 ccgcccaggc gaacgatacg gcagcgccgc ggagcctcgg ttggcctcgg atagccggtc 4740 ccccgcctgt ccccgccggc gggccgcccc cccctccacg cgccccgccg cgggagggcg 4800 cgtgccccgc cgcgcgccgg gaccggggtc cggtgcggag tgcccttcgt cctgggaaac 4860 ggggcgcggc cggaaaggcg gccgccccct cgcccgtcac gcaccgcacg ttcgtgggga 4920 acctggcgct aaaccattcg tagacgacct gcttctgggt cggggtttcg tacgtagcag 4980 agcagctccc tcgctgcgat ctattgaaag tcagccctcg acacaagggt ttgtc 5035 // ID Gypsy-34_CQ-I repbase; DNA; INV; 4077 BP. XX AC AAWU01012010; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-34_CQ_; KW Gypsy-34_CQ-LTR; Gypsy-34_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4077 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 447-447 (2011). XX DR GenBank; AAWU01012010; Positions 12702 8626. XX CC Positions [3121-3636] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS join(73..930,934..4044) FT /product="Gypsy-34_CQ-I_1p" FT /translation="MSDDEKKKTSAVTDAAQVETVTAPRLNPPSMTDSNIE FT SYFMSLEFWFAASGIATQHDGRRYNIVMAQVPVSKLTELKAIIDAVPEVGK FT YTYIKKALIDYFADSQQRRLQRVLSDMPLGDMKPSRLFNEMKRVAGTSLGD FT GVLLDLWSTRLPPHAQAAVIASKGDAADKTAIADAIVDSMALRNINAIAFE FT APRVPSDTTTTLPSSVDGMAAMQREIAELSRKLDQVWNFRDSRSGSRARSR FT TRQTYRSRDREPSNGPCWYHRVYGNDARRCRQPCNFDRPSTTSRPLDLVEP FT VAEIGELSDTATIFRLKISDTSTNMQFLIDTGADVSVIPRGVRSAEVKPSS FT MQLFAANGTPIKVYGEVLLKVNLGLRREFSWTFLIAEVTSGIIGADFICHH FT DLLIDLKRKRLIDNTTRLEAAGILARSSEYSIKTFSTDSPYAELLAKFPLI FT TRLAPPGTVTQSTVYHRIETTGQPTFARPRRLPPDKLQAARAEFEHLMQLG FT ICRPSSSNWASPLHMVKKADGTWRPCGDYRALNAITVPDRYPLPYLQDFTC FT NLHGKTIFSKVDLQKAFHQVPIHPEDIPKMAITTPFGLFEFAFMTFGLRNA FT AQTFQRIIHQVVRGLDFVFPYIDDIFIASTSPEEHLDHLRTLFERLQQHHL FT AVNAAKCEFGRSQITFLGHLVTAQGISPLPERVEAISSFPKPTSVKDLKSF FT LAMINFYRRFIPNAIVDQVPLLTLTPGNKRNDRTVLVWTDETTAAFERCKQ FT QLAQAALLAHPAKSAELSLWVDASNIAAGAVLHQLVAGKAQPLGFFSKKFD FT KAQLRYSTYDRELTAIFLAVRHFKYMLQGRKFHIYTDHKPITFAFRQNLDK FT ASDRQARQLDYIGQMTTDIRHVVGQENVTADLLSRIEAVQAGQPIDFAALA FT DAQKTDSEILDILRGNSSSSLQLKPFTIPGSSKQLYCDCSQNQLRPFVTRG FT FRDQILAATHNLAHPGTRATARLMAERFVWPNIRKDSIAYAKSCLQCQRSK FT VTRHTQSPTVRFAPPDSRFAHINIDIVGPFPPSNGQRYCLTIIDRFSRWPE FT AIPIPDMTATTIAQALVDGWIASFGVPAIITTDQGRQFESTLFAELVRILG FT ITHLRTTPYHPQSNGMIERWHRTLKAAILCHNTERWTEHLPLILLGLRTTY FT KEDIRASPAEMVFGTTLKIPSEFFAANNNIHTDSEFATNLREAMRCLRPTE FT TAWHGQRGVFVHPDLRSCQQVFVRNDSIRPSLSAPYAGPYEDVSRNDNHFK FT VSVNGRMVNISIDRLKPAYTAEDQQQQHQPELGAGPASTESSPLSTRTTRS FT GRRVTIPLRYR" XX SQ Sequence 4077 BP; 989 A; 1176 C; 1047 G; 865 T; 0 other; tattggtgac cccgacgtga tctagttcaa agtttccttc acgtatttcg cgtaattcga 60 acgtttcgcg agatgtcgga cgacgaaaaa aagaaaacct cggccgtcac ggacgccgcg 120 caggtggaaa cggttaccgc gccgcgactg aatccgccga gtatgaccga ttcgaacatc 180 gagtcatatt ttatgtcgct cgagttctgg tttgcggctt ctgggatagc cacccaacac 240 gacggccgtc gttataacat cgtgatggcg caggttcccg tgagcaaact aacggaattg 300 aaggccatca tcgacgcggt tccagaagtc gggaagtaca cgtacataaa aaaggcgctg 360 attgattact ttgctgacag ccagcagcgg cgtctgcaac gtgtcctttc cgacatgccg 420 ttgggggaca tgaaaccgag ccggctgttc aacgagatga aaagggtggc cggaacatct 480 ttgggcgatg gtgtgctgct cgatctgtgg tcgacaaggc ttcctccaca cgcacaggcc 540 gcagtgatcg cttcgaaagg agacgcggca gacaagactg cgatcgcgga cgccattgtt 600 gattcgatgg ctttgcggaa cataaacgcc atcgcgttcg aagcaccacg tgttccctcg 660 gatacaacaa ccactctccc gtcgagcgtc gacggcatgg ccgccatgca acgcgagata 720 gcggagcttt ctcgaaaact tgaccaggtt tggaacttcc gcgactctcg gagtggttct 780 cgtgcccgca gtcgcacccg gcagacctac cgtagtcgtg atcgggagcc gtcgaacgga 840 ccatgctggt atcaccgcgt ttacggcaac gacgcccgta ggtgccgaca accatgcaat 900 ttcgaccgcc cgtcgacaac aagccgccca tgactagacc tcgttgaacc tgtggcagag 960 atcggcgaac tctctgacac agcaaccatc tttcgcctca agatctcgga cacgtcaaca 1020 aacatgcaat ttttaatcga tacgggagcc gacgtgtccg taataccaag aggtgttcgt 1080 tcagccgagg tcaaaccatc atccatgcaa ctgttcgccg ccaacggaac gccgatcaag 1140 gtatacggcg aggtgctgct aaaagtcaat ctcggccttc gtcgtgaatt ctcgtggacc 1200 ttccttatcg ccgaggtgac gtcaggaatc atcggagctg attttatctg ccatcacgac 1260 ctcctgattg acctgaaacg gaagcggctt atcgacaaca ccacgcggct ggaggcagct 1320 gggatcttgg cacgatcaag tgagtactca atcaaaacgt ttagtaccga ctcaccgtac 1380 gctgaactgc tcgccaaatt cccattgata acccgcttgg cacctcctgg tacagttacc 1440 cagtctacgg tgtaccaccg catcgaaacg acgggtcagc caacgtttgc ccgtccaaga 1500 cgtctacccc cagacaagct gcaagctgcc cgagctgaat tcgagcacct gatgcagcta 1560 ggaatctgtc gcccttccag cagcaactgg gcgagtcctc tgcacatggt gaagaaggct 1620 gatggaacct ggcgaccttg tggggattac cgagcattga atgctatcac ggtgccggac 1680 cggtacccgt tgccgtatct gcaagatttc acctgcaacc tccacggaaa aacgatcttc 1740 tcgaaggtcg atttgcagaa agcctttcat caggtcccaa ttcatcctga agacatccct 1800 aagatggcca tcacgacgcc gtttggtctg ttcgaattcg ccttcatgac gttcgggctg 1860 cggaacgcag cacagacctt ccaacgaatc attcaccaag tcgtccgcgg actagatttc 1920 gtgttccctt acattgacga cattttcatc gcgtcgacgt cgcctgagga acatctcgac 1980 caccttcgta cacttttcga gaggctgcag cagcatcacc tggcggtcaa cgcagcaaag 2040 tgcgagtttg gtcgttcgca gattacgttt cttggtcatc tggtcacagc gcaggggata 2100 agtccattac cggagcgagt cgaagcaatc agcagctttc ctaaaccgac gtcggtgaag 2160 gacctcaaaa gttttctcgc aatgatcaat ttctaccggc gatttatacc gaacgctatc 2220 gtggaccagg tgccgctgct gaccttaaca cccgggaaca agcggaacga tcgtacggtg 2280 ctggtgtgga cggatgaaac aacagctgcg ttcgaacggt gtaagcaaca actcgcgcag 2340 gccgcactgc tcgcccatcc ggccaaaagt gcggaactgt cactctgggt ggacgcgtct 2400 aacatcgccg caggagctgt tctccaccaa ctcgtcgcag ggaaagcaca gcctttgggg 2460 tttttctcca aaaagttcga caaggctcag ctgcggtaca gtacctacga cagagaactg 2520 acagcaatct ttttggcagt ccgccatttc aagtacatgc tgcaaggtcg gaaattccac 2580 atctacactg accacaagcc gattacgttt gcctttcgtc aaaatcttga caaagccagt 2640 gatcgtcaag cccgtcaact agactacatc gggcaaatga cgaccgacat ccggcacgtg 2700 gtgggccagg agaacgtgac agctgacctg ctttcgcgca ttgaggcagt gcaagctggg 2760 caaccgatcg acttcgcagc gcttgccgat gctcaaaaaa ccgacagcga aatcctggac 2820 attttgcgag gtaattcgag ttcaagtttg cagcttaaac catttacgat tccaggcagc 2880 tcgaaacaac tttactgtga ctgctcgcaa aaccaactgc ggccatttgt cacacgtgga 2940 ttccgggacc aaatcttggc cgcaactcac aaccttgcac atcctgggac tagggcgacg 3000 gcgaggctga tggcggagcg gtttgtgtgg ccgaacattc gaaaggacag catcgcctac 3060 gcgaagagct gtttgcagtg ccagcgatca aaggtgactc ggcacacgca atcgccaacg 3120 gtccgatttg cacctcccga cagtcggttt gcgcacatca acattgacat tgtgggcccc 3180 tttccgccaa gcaacgggca gcgatattgc ctcaccatta tcgatcgttt ctctcgatgg 3240 cctgaagcga tcccgatccc cgatatgacg gcaactacaa tcgctcaagc acttgtcgat 3300 ggttggatcg catcgtttgg cgtgccagcg attatcacca ccgaccaagg acggcagttt 3360 gagtcaacgc tatttgccga gcttgttcgc atccttggga tcacacacct gcgaacaaca 3420 ccgtaccacc cacaatccaa cgggatgatc gagcggtggc acaggacact gaaagcggca 3480 atcctctgcc acaacactga acgctggact gaacatctgc cgctgatcct gcttggccta 3540 cgcacaacct acaaggagga catcagggct tccccggcag aaatggtgtt tggcacgacg 3600 ctgaaaatcc cgtccgagtt ttttgctgca aataacaaca tacacactga ctcggaattc 3660 gccaccaacc tgcgagaggc aatgcgatgt ctccgtccaa cggaaactgc gtggcatgga 3720 caacgtggtg tattcgtcca tccggacctc agaagctgcc agcaggtgtt cgtccgcaac 3780 gactctattc gaccatccct ttcggcgcct tatgctggtc cgtacgaaga tgtcagcagg 3840 aacgataacc acttcaaagt gtcggtaaac gggagaatgg tcaacatttc aattgaccga 3900 ctgaaacctg cctacactgc cgaggatcag caacagcagc accaaccaga gctcggcgcc 3960 ggcccggcct cgacggagtc ttcgccgtta tcgactagga cgacgcgttc tggacgacga 4020 gtgacaatcc ctcttcgtta tcgatgagat cgtcgcctag tctgcggggg agtactg 4077 // ID hAT-26_HM repbase; DNA; INV; 3138 BP. XX AC . XX DT 07-DEC-2008 (Rel. 13.12, Created) DT 12-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-26_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3138 RA Jurka J.; RT "hAT-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 2015-2015 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 970..2601 FT /product="hAT-26_HM_1p" FT /translation="MIIATCNPVTLVDNASFRKLVNTLDYKFSLPASAKIT FT SLLNEEFTVLTRKLRDFISEGRRFTICLDGWTKKGLTASFLGISVCFFHFK FT SEKPIHALLNLYLVKHPHTGEQIANCFEKCLKYWNISREKILLVISDNGAN FT MVKGIKLSRMKAQAERDILIELESEVNEEQITDLEDDENMTNSEEWEHVDL FT IVDVIENVAFRRLGCIAHVIQLVVKLAYDGKYHGLLLKVRGLVGKVRKSSV FT ALEKIINKCGKTVISDCSTRWNSTYFMVRRFLEIKTSINEVLGDLNIDSLA FT NSEWIMLQDFVNLLEPFANETDILQTDALSLSSVIPSILNLECHLEQFGDA FT KDVAVKMLEDLRRRFAVLLQPNDPNFNPLPCAACLLDPTCAIALLGNEHTQ FT IRECAKKYILSEAKKSVQHDLPLASASTLSVEIAPEINGLTKWKFLAAKQR FT VEKIKLIPGNEAQNELNKYFIEIRNSSMCQNALNFWKMRRAVYPHIAPLAE FT DLIAAPATQAYVERLFSICGLLTNGRRNRMSASLEMRVFLKLNSHLV*" XX SQ Sequence 3138 BP; 1140 A; 458 C; 539 G; 1001 T; 0 other; gcagtgatat tttcgtacac gagaacgaaa acgagaacga aaaaaaaagc ctttcgttta 60 cgaaaacgaa aactaaaacg aaatttattt taagaacgag aacagaataa aaacgataat 120 tttaaatttt cgaacggaat aaaaactgaa tcgaaaatta ttttaaaacg aaatattttc 180 gttctagatt tgaaacgagt tgaattattt gatatatata attattatat tacaatacat 240 tataatgata tattattcga acttaattga attttattca attaagttcg aataagttta 300 gttatgatat agttcgaata tatttattcg aatatatttt ttcgaactaa aaaaaatcaa 360 actgtaagac caacacaact acgcgatgag tttgagtaaa ctttttgctg gtaggaagag 420 gaatagttca gtatggaaat ggtttaaata cgatgcgtca tccgacaaat cagtatgttt 480 agttttactt gatggtgaaa aagtatgcaa tacaaagttg tgtggaaaga acccaacaaa 540 tcttaaagtt agtttatctg ttagaggata tgttttcatt ttgaaacaaa cgactcattt 600 ttatcatttt agagtaatta ttcaataacg tatacgaact tatttataca ttaattgata 660 aacagataga taaataaaca tataaatata caaacaaaaa attatataaa tatgtttatt 720 tttttctagc tacatttagc aagaaaccat aagtctgtac acgttgagct ggaagagtcg 780 gaattaaaga aattaagtga aaaaaagcaa acagggataa agagaaaagc agtggatgaa 840 aatggaccca ctcttttaga atcgtcctca aatcagaacc agaccttgat tcaatgcatt 900 aatcgtagga ataccgcatg gcctattaat tcacacgagc ataaaattcg actaaactct 960 cttattcaaa tgataattgc aacatgtaat ccagtgacat tggtggataa tgcgagtttt 1020 agaaagttag ttaatacctt ggattataaa ttttcattgc ctgcttctgc caaaattact 1080 agcctactta atgaagaatt tactgtttta acgagaaaac ttcgtgactt catatctgaa 1140 ggcaggcgct tcacaatatg ccttgatggc tggactaaaa aaggtcttac tgcgtccttt 1200 ttaggaattt cggtttgttt ttttcacttc aaatccgaaa aaccaataca tgctctgctt 1260 aatttgtatc ttgtgaaaca tccacataca ggtgaacaaa ttgctaattg ttttgaaaaa 1320 tgcctaaaat attggaatat ttccagagaa aaaatattgc ttgttatttc ggacaatgga 1380 gcaaacatgg tcaaaggcat aaagttgtcg agaatgaaag ctcaagctga acgtgatata 1440 cttattgaat tagaaagtga ggttaatgaa gagcagataa cagatctgga agacgatgag 1500 aatatgacaa atagtgaaga gtgggaacat gttgacttga tagttgatgt aatagaaaac 1560 gtagcattta gaagattggg atgcattgca catgttatac agctagttgt aaaacttgca 1620 tatgatggta aatatcatgg cttacttttg aaggtacgtg gattagtggg aaaagttcgc 1680 aaatcttctg ttgcattgga aaaaattatc aacaaatgtg gtaagactgt gatcagcgac 1740 tgttctacaa gatggaacag cacttacttt atggttcgga gatttctgga aataaagaca 1800 tctatcaacg aagtgcttgg agaccttaac attgattcac ttgccaatag cgaatggata 1860 atgcttcagg attttgtcaa tcttttagaa ccttttgcca atgaaactga tattcttcag 1920 acggatgcac tttcactatc cagtgttata ccttcaatac ttaatttaga atgtcacctt 1980 gagcaatttg gagatgccaa agatgtagca gttaaaatgc ttgaagatct acgccgtcgt 2040 tttgccgttc ttttgcagcc aaacgatcct aattttaatc cactgccatg tgccgcatgt 2100 ctacttgacc ccacatgtgc aatagctctt ttaggtaatg agcatacaca aatcagagag 2160 tgtgcaaaga aatacatttt gtcagaggct aaaaaatcgg tgcaacatga tttaccttta 2220 gcaagtgcat ctactctttc tgttgagatt gctccagaaa taaatggatt aacaaaatgg 2280 aaatttttag cggcaaaaca gcgagtagaa aaaataaagc ttattccagg taatgaagca 2340 caaaatgaac tcaacaaata ttttattgaa ataaggaaca gctctatgtg ccaaaatgct 2400 ttaaatttct ggaagatgag acgtgctgtt tacccacata ttgcaccatt agctgaagac 2460 ctcattgcag ctcctgcaac tcaagcatat gttgaaagac ttttttcaat ctgtggcttg 2520 ctcacaaatg gacggagaaa tcgaatgtca gcgtcattgg aaatgcgagt tttcttaaaa 2580 ctaaattctc atctagtcta gatgataatc atataatcac taggcccttt atttgtaaaa 2640 tatatgagta attattaaag tctatacaat aactattcat tagtttttat tgttaccaaa 2700 tttgaggagt tttttttata cctaataatt gaaatttttt taagttttag agcaataagg 2760 acaggttgtt gttgtgtttt ttattaaggg cggtgggcgg ggctctaaac ttctagccat 2820 gggactgggt gtattacatt taattaagtt ctcatttttt tgtttgttta tagaactaag 2880 acttacttct tttaattata tatgcctata ttatttataa attatactaa attataatta 2940 gatgagataa aaaatttata cgacaaacga aaattgaact aaataaaaac taaaactaaa 3000 attattttaa gatgaactga ataaaaacta gaactaaatt tatttactga actgaataag 3060 aactagaact aaaaactttg cctctaaact gaataaaaac taaaactaaa attattttcc 3120 gaacaaaaat atcactgc 3138 // ID P-24_HM repbase; DNA; INV; 3038 BP. XX AC . XX DT 21-MAR-2008 (Rel. 13.03, Created) DT 21-MAR-2008 (Rel. 13.03, Last updated, Version 1) XX DE P-type DNA transposon family: consensus. XX KW P; DNA transposon; Transposable Element; P-24_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3038 RA Jurka J.; RT "P-type DNA transposon families from Hydra magnipapillata."; RL Repbase Reports 8(3), 370-370 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(201..992,934..1314,1318..2610) FT /product="P-24_HM_1p" FT /translation="MSVYRFPRAEEEKKCWIKCIPNANLKVTSNTVICELH FT WPKNFEKVKVKGRERPKNPPSIWPNVPLSQIPTSIPSLRSTKRALSSNRTS FT FDEIDLFLAQDNFNYDFFCDTLLNKKRNFSVDIVAFISDSKLCVQSTDFIN FT YATPKFIIKFDSCLKFESYFNGVKRKVHTVSNVVNTWSKFEEILRYLNLLT FT PNSKEIVMNEQIDSMNTNKYSLSVIIRAFEYFATSRTLYKKLRDDFKLPSI FT RTLTRITSKVSKSDDNFFFKRSFHALLLKYRNLMIIFFLKEVFDNLSDDQK FT TCIILHDEVYVKKMLLYHGGALFGKALDNPDSLAKTVLGVMITCMFGGPKF FT LVKMLPVNKLNSQFLYDQIHLTVEAVKESNGNAKVIICDGNRTNQAFFKFD FT TFPEKPWLTTNGTYLLFDFVHLIKNIRNNWLTEKKGELIFYENGVEKTACW FT SHLIKLFEADSNCLMKLSDLNEVSVRPSHIERQSVSTCLKVFSEKTYSALL FT NHPETSNILNINETADFIKMVVTWWKILNVKSVGVDCRFNDDRQAVIRDPN FT DDQLKFIWDFGEMALRMTSNQGKRIKQLTHDTGHSINHTCKGIVELTKTLL FT KETHKYVMLGKFTTDPLEKEFSKLRQGSGGTYFLSVQQIIEKFNISKTSLL FT LSYNINESTPDAGHSCQFCGFLLDESSAEVFDNLPTLEKSITVQSKMSLVY FT IAXYVSRNDDLLNENELLTRTMFYFEKFGQYLKSVDRGGLKVPFDNTVQWV FT FFSFVLFNSIKDKVCRTSLCNTFMLISDHYSFDMEQHHGNILSNIFIKNHC FT KLSTPRSNKEPSLKILKLSC" XX SQ Sequence 3038 BP; 1078 A; 437 C; 458 G; 1061 T; 4 other; catggcgctc tttaagttac aggccgtgac acgcaaaatt tagaaaaaaa tgtgggggcc 60 gatattaact taataatatc gacataactt aagttattat cggcgtwgaa gtacttttat 120 aaaaaaatgg taaaaaatgc tgtgtttatg gctgttctac gaattactta tcacagaaaa 180 aaaaatcaaa cgaatctaaa atgtctgttt atagatttcc aagagctgaa gaagagaaaa 240 aatgttggat aaagtgcatt cctaatgcaa atttaaaagt gacaagcaac actgtaatct 300 gtgaactaca ctggcccaaa aattttgaaa aagtaaaagt aaaaggaaga gaaagaccaa 360 aaaatccacc atcaatttgg cccaatgttc ctttaagtca aataccaaca tctataccct 420 ctttaagatc aactaaacgt gcattatcct ctaatcgaac ttcttttgac gaaatcgatt 480 tatttttagc acaagataat tttaactatg actttttttg tgacacattg ttaaataaaa 540 aacgcaattt ttctgttgat attgtagctt tcatttcaga tagtaaactt tgtgtacaat 600 ccacagattt tattaattat gcaacaccaa aatttatcat taaatttgat agctgtttaa 660 agtttgaatc atatttcaat ggtgtaaaaa gaaaagtaca tactgtgtct aatgttgtta 720 acacatggtc taaatttgaa gaaatcttgc gatacttgaa ccttttgaca ccaaacagta 780 aggaaatagt aatgaatgag caaattgatt ccatgaacac aaataagtat agtcttagtg 840 taatcatcag agcatttgag tattttgcta cctctcgtac actttataaa aaattgcgtg 900 atgattttaa actaccttct attcgaacat tgacacgcat tacttctaaa gtatcgaaat 960 ctgatgataa tttttttttt aaaagaagtt tttgataatt taagtgatga tcaaaaaact 1020 tgtattattc tccacgatga ggtttatgtt aaaaaaatgt tattatatca tggaggtgct 1080 ctttttggaa aagcattaga taacccagat tctcttgcaa aaactgtttt aggtgtcatg 1140 ataacttgta tgtttggtgg acctaaattt cttgtgaaaa tgttacctgt aaacaaatta 1200 aattctcaat ttttatatga tcaaattcat ttgactgttg aagcagttaa agagtctaat 1260 ggcaacgcaa aggtaattat atgtgatgga aatcgaacta atcaagcatt tttttaaaaa 1320 tttgacacat ttcctgaaaa accttggcta accaccaatg gaacctattt gttgtttgac 1380 tttgtccact taatcaaaaa catccgaaac aattggctca cagaaaaaaa aggagaatta 1440 attttttatg aaaatggtgt tgagaaaaca gcttgttgga gtcatttaat aaaacttttt 1500 gaagctgatt ctaattgtct aatgaaactt tctgatttaa atgaagtttc tgttcgacca 1560 agtcatattg aaagacaatc tgtttctacc tgtctaaaag tgttttctga aaaaacatac 1620 agcgctttac ttaatcatcc tgaaacctcc aacatcttaa acataaatga gactgctgat 1680 tttattaaaa tggttgtcac atggtggaaa attttaaatg taaaaagtgt gggtgtagat 1740 tgtaggttta atgatgatcg acaagctgtc atcagagatc caaatgatga tcaattaaaa 1800 tttatttggg attttggaga aatggcatta cgtatgacaa gtaatcaagg raarcgtata 1860 aaacaattaa cacatgatac aggacattcc ataaatcata catgtaaagg aattgttgag 1920 ctgacaaaaa cactattaaa agaaacacac aaatatgtta tgttgggaaa atttacaaca 1980 gacccacttg aaaaagaatt cagcaagtta agacaaggat ctggtggcac atatttttta 2040 agtgttcaac aaataattga aaagttcaat atttctaaaa cttcactttt gttgtcatat 2100 aacattaatg agtcaactcc tgatgcagga cattcttgcc aattttgtgg atttttactt 2160 gatgaatctt cagctgaggt ctttgataat ttaccaacac tagaaaaaag tataactgtt 2220 caatcaaaga tgtctcttgt ttatattgcc rgatatgtgt ctcgtaatga tgatttgctt 2280 aacgaaaatg aacttttaac cagaacaatg ttttattttg aaaagtttgg tcaatatcta 2340 aaatcagtcg atagaggtgg gttaaaagtt ccttttgata atactgttca atgggttttc 2400 ttctcatttg tactatttaa ctctattaaa gataaagttt gtcggacatc cctgtgcaac 2460 acttttatgc ttatatcgga tcattactct tttgacatgg aacaacatca tggtaatata 2520 ctgtcaaata tttttatcaa aaaccactgc aaactttcta caccaagatc aaacaaagaa 2580 ccgtctctga aaattttaaa gttgtcttgt taagttgaca gttatttatt gacacttatg 2640 ctgttatttc tttattgaaa tttattaatt gatttaaagt ttctgtttta tgattttaaa 2700 agtatttctt atttatttaa atcactaaaa tatattccta tttaaaggtg cagttaccct 2760 aacagtaatt tatttaactt ttgtttttta cctgatataa aactttttta tttcattatt 2820 taagcaattt tatgaattga aacaattaaa taaagtttat tgttaaaaac ttaaaaagat 2880 ttataaattg tacaaatttt tacatagttt aatgctttat ataacgggag caacatattt 2940 tgttttaaaa caattatcat attgattttt gaaatatcgg cccccacatt tttttctaag 3000 ttttgcgtgt cacggcctgt tacttataga tcgccgtg 3038 // ID DNA8-62_AP repbase; DNA; INV; 223 BP. XX AC . XX DT 25-AUG-2009 (Rel. 14.09, Created) DT 25-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-62_AP. XX NM DNA8-62_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-223 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 1996-1996 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 223 BP; 64 A; 42 C; 59 G; 58 T; 0 other; agggccggat tggttaggcg agctaccgag aaaatctcgg tgggccgctt aaagtttggg 60 ccggtcctgt gagtgtgagt gaaaataaat tcatgtttaa aatacgattg gccacctata 120 agctcgtaat aattatagac gtaattttgg gccactgaat atgtaaatgg gccgcttata 180 gaagtgaaaa tctcgatggg ccgatacata ccaatccggc cct 223 // ID Gypsy3-I_AP repbase; DNA; INV; 4801 BP. XX AC Contig4708; XX DT 21-APR-2008 (Rel. 13.04, Created) DT 21-APR-2008 (Rel. 15.12, Last updated, Version 0) XX DE LTR retrotransposon from pea aphid: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy3AP; KW Gypsy3-I_AP; Gypsy3-LTR_AP. XX NM Gypsy3-I_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-4801 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from pea aphid."; RL Repbase Reports 8(4), 441-441 (2008). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC Positions [3749-4225] - Integrase core CC LTRs are 94% similar to each other. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX FH Key Location/Qualifiers FT CDS 362..4618 FT /product="Gypsy3-I_AP_1p" FT /translation="MGENSASTSKAGHEVLSPRRLRNGSVLPSSSDVITSV FT NNRANEDTDSQRPSSVLPQSHPVPVDNQLLTNALLSLMEQNKVLLSRLVND FT STPPLVQSCVPNNNGYYVMPDFNHSLTDFSGRESNAEARAWLQSIESVAKL FT HNWPDNFKLETVRSKLIGPSHNWYVGRTFSNWTSFVDQFTTTFVGHEQCTV FT DRVKIMSSRVQLKGECVIEYFHHKARLCREILLPFQEVKMQIIEGLYYHDM FT CNYLMARTHMSEDDLLTDIKSYNNLRNARSSRFRSAETTESFNKNNGFSFN FT RRTDVKNHDTVQRLPPEMKSSVPKDMSNVTCHRCGIKGHIAPACTAKVQRS FT EKRSCFRCNGHGHLARDCPQNRSNQHPASQTSDHGENVSILERPPQDIVPP FT YTLSAKIMYPNGNCSSITALIDTGSPVSLLKLMVVSYTSGIKPPPSGLIGI FT NGSMLNIIDEFFADLHHVDLDTPINLNFKVVPDSTIQTDCLLGRNFLAHPR FT VNLSVIDGQFIITFKQTDNVPFNEILSLDFDLNNYRELDIDLNINDKLSND FT VITSVKRIYINNYVKHCAQDIDDDLPEMHIQLKAHNSFYFRPRRLSFYEKE FT KLQIILDDMLKKHIIRPSTSEYSSPIVLVKKKNGDLRLCVDYRELNKHIVK FT DRYPLPLIDDNLDLLRGKKYFTCFDLKDGFHHIYVARDSIKYTSFTTPLGQ FT FEYLKLPFGLATGPACFSRFIKNIFDEFIRKKEVIVYFDDIMVATETIDEH FT LDILARILTTMKNKKLEIRLDKTQFLKSEVIYLGYRVNADGIQPNPKNVAI FT IDNYPIPSNQKELHSFIGLASYFRRFIPNFALLTQPLYKLLKKNVIYEFGE FT EQLKSFDIIKSKLNAQPLLSIYDPNAQTELHCDASSQGYGAILLQQQNDNN FT FHPVFYYSHRTTDVESRYHSYELEMLAIVNAVKRFHIYLQGVHFKIVTDCN FT SVTLTLRKKDINPRIARWALFLQNYSYEIEHRPGSKMQHVDALSRCRHILV FT LEGCTFNQTLAIRQSTDPEIKNLIKILEKSEHPLFEMRNGLVYRKHGNRLL FT FYVPAGMYEQIIRTCHDDMCHIGVNRTIEFIKQVYWFPKLAEHVKKYINNC FT LKCIIFSPKEGKVEGLLNIIDKDNKPFQTIHIDHYGPLNKTKNRFRYILVV FT VDAFSKFLILYPVRSVTTKETCLKLVQYFSYYSKPVKIISDRGSCFRSDVF FT REFCEQHEIKHIMTAVGSPQSNGQVERYNRSLTVMLSKLMHEHSQNWNEHL FT NKIQFAINNTHNRSINNTPSKLLFGMNQVGNTNDYIKIYLETLNENDNDRE FT FETTRQNAFDITRQNQLKNKAYYDAKHKQANTYSIGDYVMVKNVDTTPNTC FT KKLIPKFKGPYKILKILPNDRYVVADIDGYQVTQIPLNTVIAARDIKPWVT FT FKKND" XX SQ Sequence 4801 BP; 1627 A; 817 C; 846 G; 1511 T; 0 other; gccgttagta gtagtagtag tccagcgaca ttcggcgata cagtaggccg tggtgctcgt 60 cgtttttttt taattgtatt tgagtattaa ttattagtat tttatgccag gtggtctcga 120 tagtggcggg atactggcga gaagtgttgt gtgttaatta tttaatatat atataagtgt 180 aataactttt cataataaga tttaataaat gtcagcgcgc ataactatat aattttgttg 240 ttcctgactt aaatgccctc tgacctacaa attcagaagt ggaattcgtt tggggagtat 300 ttgaattcaa ggtgtttcga aattaaaacg taagtgtgta ttatttcgtt ataagtcgaa 360 aatgggcgaa aattcggctt caacatctaa agctggtcac gaggttttgt cgccgagacg 420 attgcgtaat ggttcggttt tacccagttc tagtgacgtc ataacgtcag tgaataatcg 480 agcgaatgaa gataccgatt ctcaacgacc atcgtcagtg ttaccacagt cgcacccagt 540 accagttgac aatcagttat taaccaatgc actgttatcg ctaatggaac aaaacaaagt 600 gttgttgtca agattggtaa atgatagcac accaccgctt gtgcagtcat gtgtacccaa 660 taataatggt tactatgtta tgccagattt taatcattca cttactgatt tttccggacg 720 tgagtctaac gcggaggctc gagcatggtt gcagtcgatt gagagtgtag caaaattaca 780 caattggcct gataatttta aattagaaac agttcggtcg aagctcatag gcccatccca 840 taattggtac gtcggccgta ctttttcgaa ctggactagt tttgttgatc aatttactac 900 cacttttgtg ggacatgagc aatgtactgt tgatcgtgta aaaataatgt ctagtcgtgt 960 tcagttgaaa ggtgaatgtg ttattgagta cttccaccac aaagctcggt tatgtcgcga 1020 gattttattg ccgtttcaag aagttaaaat gcaaataatt gagggattat attatcacga 1080 tatgtgtaat tacctcatgg ctcgaactca tatgagtgaa gatgatttat tgacagatat 1140 aaaatcgtat aataatttgc gaaatgcaag gtctagtcga tttcggtcgg cagaaactac 1200 agaatcgttc aataagaaca atggtttttc attcaaccgt cgcacagatg ttaagaacca 1260 tgataccgtt caacgtttac cacctgagat gaagtccagt gtaccgaagg atatgtccaa 1320 cgtcacatgc catcggtgtg gtataaaagg tcatattgct ccagcgtgca cagccaaggt 1380 acaacgaagt gaaaaacgat cctgtttccg gtgcaatggt catggacatc tagcacgaga 1440 ttgtccacag aatagaagta accaacatcc agccagtcag acgagtgatc atggtgagaa 1500 tgtgtcaata ttggaaagac ccccccaaga tattgtaccc ccatatacat taagtgccaa 1560 aattatgtat cctaatggta attgttctag cataactgct cttatagaca caggtagccc 1620 tgttagcctc cttaaattaa tggtagtctc gtatacatct ggtattaaac cacctccctc 1680 aggtttaatt ggtataaatg gttctatgct taacattatt gatgaatttt ttgctgattt 1740 acaccatgtc gatcttgata caccaataaa tttgaacttt aaagtagtac ctgatagcac 1800 aattcaaact gattgtctgt taggacggaa ttttcttgca caccctcgag ttaacttatc 1860 agtaatagat ggacaattta ttatcacttt taaacaaact gacaacgttc catttaacga 1920 aatattatct ttagattttg acttgaataa ttatagagaa cttgacattg acttgaatat 1980 aaatgataaa ttaagtaatg acgttattac cagtgtaaaa cgaatttata ttaataatta 2040 tgttaaacat tgtgcacagg atattgatga tgacctccct gaaatgcata tacagctaaa 2100 agcacataat tctttttatt tccgtccacg acgattatca ttttatgaaa aagaaaaatt 2160 acaaataata cttgatgaca tgctaaaaaa acacattata cgtccgagta cgtccgaata 2220 tagtagcccg attgttcttg taaagaaaaa aaatggagat ctgcgacttt gtgtcgacta 2280 tcgcgagctc aataaacata ttgtaaagga taggtatcca ttgccgttaa ttgacgataa 2340 tctggattta ctccgtggga aaaaatattt tacttgtttt gatttaaaag acggatttca 2400 tcatatttat gtagctagag attcaataaa atatacttcc tttacaactc cactaggaca 2460 atttgaatac ttaaaattgc catttggttt agcgacgggt cctgcttgtt ttagtagatt 2520 tattaaaaat atttttgacg aatttatcag aaaaaaagaa gtaattgttt attttgacga 2580 cattatggtg gccactgaaa caatagacga gcatctcgat atattagcta gaatattaac 2640 caccatgaaa aacaaaaaat tggaaattcg cttagataaa actcaatttt taaaatccga 2700 agttatttat ttaggttacc gtgttaatgc tgacggcatt cagccaaacc caaaaaacgt 2760 tgctattatt gataattacc ctataccgtc gaatcaaaaa gaactccata gttttatcgg 2820 acttgcatct tacttccggc gattcatccc aaactttgcg ttattaaccc aaccattata 2880 taaattatta aaaaaaaatg ttatttatga atttggggag gaacaactaa aatcattcga 2940 tattattaaa tccaaactta atgctcaacc actcttatca atttatgacc ctaacgctca 3000 aaccgaatta cattgtgacg caagtagtca agggtatggt gcaatattgc tgcaacaaca 3060 aaatgataat aatttccacc ctgttttcta ttatagccat cgaaccactg acgtagaatc 3120 tagataccac agttacgaac tcgaaatgtt agcgatagtt aatgctgtta aacgatttca 3180 tatttattta caaggtgtcc actttaagat tgttactgat tgcaacagcg ttacattaac 3240 tttacgtaaa aaagatatca acccgcgtat tgcaaggtgg gcgttatttc tacaaaatta 3300 ttcgtatgag atagaacacc gtcctgggtc aaaaatgcaa cacgtggatg cattaagtcg 3360 ttgtcgacac atattggtat tggaaggatg tacttttaac cagacactcg ctattaggca 3420 aagtactgac cccgaaatta aaaatttaat taaaatactc gaaaaatcgg aacacccctt 3480 atttgagatg cgtaatgggt tagtgtatag gaaacacggc aacagacttt tgttctatgt 3540 tccagctggt atgtatgaac aaataatccg aacatgtcac gatgatatgt gtcacattgg 3600 tgttaatcgt acaatagaat ttataaaaca agtttattgg ttcccgaagt tggctgaaca 3660 tgtaaaaaaa tacataaata attgcttgaa atgtattata ttttctccca aggaagggaa 3720 agtagaagga ttgttaaaca taatcgacaa ggacaataag ccattccaaa ctattcacat 3780 cgatcattac ggcccgctca ataaaacaaa aaatcgtttt aggtatattc tagtcgttgt 3840 agacgctttt agtaaatttc tgattttata tcccgttcgt tctgtcacca ctaaagagac 3900 ctgtttaaaa ctagtacaat atttttcata ctacagtaag ccagttaaaa ttatatccga 3960 tcgcggatcg tgctttcgat ctgacgtttt ccgagaattt tgtgaacaac atgaaattaa 4020 acatattatg actgcagtag gatcgccgca gtcaaatggc caagtagagc gatataatcg 4080 tagtctcacg gtaatgctat ctaagttgat gcatgaacat agccagaatt ggaacgaaca 4140 tctaaataag atacaatttg caataaataa tactcacaat agatctatca acaatacacc 4200 aagtaaactt ctttttggta tgaatcaagt aggaaacact aatgactata taaaaattta 4260 tttagaaaca ttaaacgaaa atgacaacga tagagaattt gaaacaacta gacaaaacgc 4320 atttgatatt acacgacaaa accaattaaa aaataaagcc tactatgatg ctaaacataa 4380 acaagccaat acttattcta taggtgacta tgttatggta aaaaacgttg atacaactcc 4440 aaatacatgt aaaaaactta taccaaaatt taagggacca tataagatac ttaaaatatt 4500 acccaacgat agatatgttg tggctgatat tgacggatat caagtaacac aaataccctt 4560 aaataccgtt attgctgccc gagacataaa accctgggtt acatttaaga aaaatgacta 4620 ataattattt taatttgtaa ttgtatactt tatttattat atatataggt attgtttatt 4680 ttattttatt tttttttttt tagtgttatg taaacatttt aattattatt ttgttattat 4740 tataaattgt atagaataag accatgatcg aggtcgatca atattgtcag gaaggccgag 4800 t 4801 // ID Gypsy-28_OD-I repbase; DNA; INV; 9962 BP. XX AC CABV01003315; XX DT 24-FEB-2011 (Rel. 16.02, Created) DT 24-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Oikopleura dioica genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-28_OD_; KW Gypsy-28_OD-LTR; Gypsy-28_OD-I. XX OS Oikopleura dioica OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; OC Oikopleuridae; Oikopleura. XX RN [1] RP 1-9962 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Oikopleura dioica genome."; RL Direct Submission to RU (24-FEB-2011). XX DR Genome; CABV01003315; Positions 9802 19763. XX CC Positions [5087-5596] - Integrase core CC 'CTAT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 399..2390 FT /product="Gypsy-28_OD-I_3p" FT /translation="MRLNYTILPTEENLIENIFAENKNLISKNKEGKPINM FT TKDSQFAKLKEIFEKKGIVKIPFRFPTASIPDIIEFYKLDVERLSLSDKIV FT RRDELTLPAPNELRDTTQLIAALEKLDPKFMIDQLAIKMNNKIVVKEENLT FT DLIEKAKDTIESNLQTAHDNMKLKEQIEHLTSACDHNDRTTSECMVELSNL FT EEEHQKVKFENAYLRQENKRAKDELNDASLELAKALAEATTWKAKITKSNL FT QTFEASSRHSDYQENKDSNEKDKQSLPENKRTDNFDAYIGRDETEWTNTFL FT EEKTKPLTNRVNVNNLKVSLKIPIWENSPSIDKTSSLNDFIDKLRYFRSMN FT ILSDAQTIYSSLEASSRIDILRELDRETLKDIDKFVKYLRMAHGGSSLKQR FT SNLESLIQSPIESALSFFKRVIREYYLSKGLEPRDPKDITEKDRQEDIVYH FT FSRGLRNNTTGTQIRMNRLTTNFEKLGEIANHIDQALEPINSINNVQLIEQ FT INKLSLNEPTNHVNAVDRNYNPDYNKSRQCLTCGFYGHQSRDCYANQRTQG FT RYSRQNRSRSQGRNQRNDRSKSPYRRSYDNRGRSPYRGSRDNRGRQNTPYN FT RDRRGRSDSHGSSRGRSASYDKNRKYSRDYKQTRTSGSRERRDSGNRDRQN FT SRDRRSNSRESRAKW" FT CDS 2387..6340 FT /product="Gypsy-28_OD-I_2p" FT /translation="MVTCLPIVHLKILYDNVQYRVNFLLDTGASANILSSH FT MRIKPETESKSEIKSFDGTIKKLNVGSIHADVITNRKDEILDKNIKFIITE FT LEYDGILGMPTLNNFDIKFYDQSIRLINNKSGKEFNLERINLSVCNIESLS FT LKPMESKIILLKENIKINNAYALVGSSAMNQSIPAKIVNNSIEIFNDSSKF FT LISDDHKLYICQFGVTKQKATDIEHDRMPHYRRSLNMPKLKPETFKINKNL FT TAEQKQEILDLLSEFDSIFARSEIDINPGFREPYMYKINTSTPAQQPAKFV FT QTASKYNEELQKHIDMLEQQNVIEQIEFADVINAGFVVVSKKCGRLRFCLD FT LRSLNNVTIPNKNYPIPHMDTILGNLSGNRYFSSLDLTSAYHQFLIDPRDR FT ERYTFIGPNKKIYRYRRVPFGGRLITAWLQALMQQVILKDLENTNAYIDDI FT NIGSKTFEDHKNNLRNTLNRISFHNLELSARKCSFAYEETKVFGYIVNFDG FT YTPDPARIETLKIPMPKNKKELLSTLAGINYYRSTIPKFAELASNLFKFTR FT TTSDFDPEDIETKNEWKKLIDALGKAIMIQKPDLNRQFILRTDASKKAFGN FT MLSQKNDKGKEIILAVESKQFTDQQIVWSIGMKELLSAAHAVKRYTNLIEG FT RKFLLITDCKSVYYLLKNRKEVNLTASNPLTRNFLFLMIYDFEIKWSKGTK FT KDFLLTDLLSRKKIDGSTEIVIGKKSKDPLLSIKLLDGKLFGIDERTEYTE FT KGTKSEIQSVNAINLEPVKYFEIDEILKEIKRSQMDDENLRKKIKRIANGK FT KLETYKTIVEAPYEHPILYHSVNEKWSLMVPKYLIPKVLSKIHRHTSKSFD FT LSKIEKLGLFWNNMAESITSWHQSCTECTISKPNTAPQKHTNGQIGDFPRG FT PFESICIDAIHIRPYLALVSVDHFSGFTSAYAIDNERIETLINATLSLCLR FT FIVPKMIRLDNHRSFKSTKFIDAMNKMDITLSFTTPANSQANGACERRIRQ FT VQERLKTLTMNNVVNGSESMTIMDLQTCIDLIVFEINMSPRNNKICPLMIM FT TGIEPSFQIHTPHILKELEEPHGQRLTDLRNEIQARIAKDYENEASAEITP FT SQEMKIKVGDFVRIQKASVATKTKREQTRYSKEIYQIKEILNRYGTVKIME FT VKETDKIDRRRPEIRIISVKKLKRIIDRSKLIETYEKKTQKEWAKDCEVPK FT EVGHKSTSPKEWAKKSEEVGRRSKSPKKWAETSQEVGRGSLSPKEVGQLDK FT AKTRLKGPEKHVKKVRFDIIENSKNDKKDKSASSQKSSDNPQSRYSLRRRA FT NVDYKD" FT CDS 6379..8715 FT /product="Gypsy-28_OD-I_1p" FT /translation="MREKLKIYILFLFSTIALGYRGHEENALYVRKIDPAE FT SKIAYNAQILASVGIALIRNSNENSVNIRRESWHPIKLVYRLPTLKDFIVQ FT SCPKSLHATLQSRMDNNAEMQLLTKEAESLKNQFLHELTNATPYIAQRKRR FT NTTSANSYNNITSSGKPRVHRVAPNANKSENSANDTINMSHTQHMDWSESS FT LSEIYASSLSSDNSVTNNVESYSSESYSNENEMIKSEPIANETEYEPSGFC FT NYNTNEAPIEALKFFKMECENYICCNQITCKTVNQRTVAANLIEELIDARQ FT YMNQTERDLLKSKLSEISVCFKPQPQRSRKSLGFWSYWSKGGPLTPNSINA FT EIEEVEKLENNQIHALSEKMLSRETFNAAIESEESKLKLLSSTLCTISTEL FT LGSSKITEMRNHYIQLEDKIERAVATCEKGNRPVGLSAKLLISLCMRINPN FT DNRFCRDPTVLSKINCKAKKLEINESEIVFHMKIGMYEFLEDSVTTQIVAT FT PFIDQSGRPFILSDVPKTIVETHRHIISTTCEKTLGETINICSLRNAVDVN FT ANLACVNALTSQSKNLIKISCKKSSYRGDKCITYGLPNGGYAVWSSKGIEI FT KALARDNAFSKTIRMIKPNNLTIISEKDVNFECNAVQYSTNSDTKEISYEA FT NSEINIDLEAKNIINDEFGLFNNRLSEIEKTQIANHKNATSLATSQLSKIL FT NINPKHEPKIIVFFRYLIYGLAAMSTFWIMKKLITFAINKYKKYKRKKRRS FT FTDLINNQVRPDNSRTEITTEQQTTSNL" XX SQ Sequence 9962 BP; 3939 A; 1834 C; 1833 G; 2356 T; 0 other; ttggtgtcag aaaacgaacc aaaaacgcct gacaatcagt gtcaaatcgg aaaatcggaa 60 aatcggaaaa tcgaaaaatc cggaggggag gcgaaccagt cggacttccg actggttcaa 120 ctctttttgt ttcatgaaac gaaatataag tgttgttttc agcaacatct ttcttaattt 180 gagtacattt tcaagctgga atgtaaaatt caagctttga agaataagaa aatcaaagtc 240 ggggcgaacc agtcggaaat ttgtttgttt ttcaaaatca aggcgaccaa atgtgtttct 300 agaactgtta ttcaaagtcg gggtgaacca gtcggaattc cgactggttc gcctccgctc 360 ccgaaaaatc ggaaacgtaa atctgtcaaa cagcaaaaat gcgattaaat tataccattc 420 taccaactga agaaaatctt atcgaaaaca tattcgccga aaacaaaaat cttatttcaa 480 aaaataaaga aggaaaacca attaacatga cgaaagactc acagttcgca aaattgaaag 540 aaatttttga aaaaaaggga atagtgaaga ttccatttcg atttccaaca gcttcaatcc 600 cagatataat agagttttat aaactcgatg ttgaacgttt atcgctatct gacaaaattg 660 tgaggcggga tgagctcact ctcccagcgc ccaatgagct gcgtgacact acacaactga 720 tagcggcact agaaaaactg gaccccaagt tcatgatcga ccaattagct ataaaaatga 780 acaacaaaat agtcgtaaaa gaagaaaact tgactgatct aatcgaaaaa gcaaaagata 840 caatcgaaag taacttacaa acagctcatg ataatatgaa gttgaaagag caaatcgagc 900 acttaacatc cgcatgtgat cataatgaca gaacaacaag tgaatgtatg gtcgaattat 960 cgaatctaga agaggaacac caaaaggtga agttcgaaaa tgcatatctg agacaggaaa 1020 ataaacgcgc taaagatgaa ctaaacgatg cctcgttaga gctagcaaaa gctttagccg 1080 aagctacaac gtggaaagca aaaataacaa aatcaaattt gcaaacgttt gaagcaagtt 1140 ccagacattc tgattatcaa gaaaataaag attcgaatga aaaagacaag cagtcactac 1200 cagaaaataa aagaacggat aattttgacg catatatcgg acgtgatgaa accgaatgga 1260 ctaatacatt tttagaagaa aagacaaaac cattaactaa tcgtgtaaac gtaaataatc 1320 ttaaagtttc actgaaaatc cctatctggg aaaacagtcc gtcaattgat aaaacatcgt 1380 cactaaacga ttttatcgac aaacttcgct atttccgttc aatgaatata ctgtcggacg 1440 ctcaaacgat ctactctagt ttagaagcat caagccgaat tgacattctt cgtgaacttg 1500 acagagaaac gctcaaagac atagacaaat tcgttaagta cttacgaatg gcacatggag 1560 gcagttcgct taaacaaaga agcaatttag aatctttaat ccagtcacct atcgaatccg 1620 cactttcatt ctttaaacga gtaatccgag aatattatct atcgaaagga ttagaaccaa 1680 gagacccaaa agatataacc gaaaaagaca gacaggaaga tatcgtttat cacttttcaa 1740 gaggccttag gaacaatact acaggaactc agataagaat gaatcgactc accacaaact 1800 ttgaaaaact cggagagatc gcaaatcaca tcgaccaagc acttgaacca ataaattcaa 1860 ttaataatgt tcaattaatc gagcagataa ataaactgtc gctcaacgaa ccgacaaatc 1920 atgtcaacgc agtagatcgt aattacaatc cagactataa taaaagtcgc cagtgtttga 1980 cttgtggatt ttatggccac cagagcagag attgctacgc aaatcaaaga acgcaaggca 2040 gatattcacg tcaaaatcga agcagaagcc agggcagaaa tcaaaggaat gacagaagca 2100 agtcaccata tcgtagatcg tacgataaca ggggacgatc tccatatcga ggatccagag 2160 ataatagagg aagacagaac acaccttaca atcgcgatcg gcgagggaga tctgactcac 2220 acggctcatc gagaggtcga agtgcgagtt atgacaaaaa tcgaaagtac tctcgggact 2280 acaaacagac acgaacaagt ggaagtagag aaagaagaga ctcgggaaac agagatagac 2340 aaaactcgag agatcgcaga tcgaactcaa gagagtccag agcaaaatgg taacatgctt 2400 accaattgtt catttgaaaa ttttatatga taatgtacag tatcgagtta attttttact 2460 ggatactgga gcgtcggcga atatattatc atcgcatatg cgtatcaagc cagaaacgga 2520 atcgaaatcg gaaataaaat cgtttgatgg aactataaaa aagttaaatg ttggatcaat 2580 tcatgcagac gtaataacaa atcgtaaaga cgaaatattg gacaaaaaca taaaattcat 2640 aataacggaa ctagaatatg atggaattct tggaatgcca acattgaaca attttgacat 2700 aaaattctat gatcaatcaa ttcgtttgat aaataataaa tcaggaaaag aatttaatct 2760 tgaaagaatc aatttatccg tttgcaacat cgaatcattg tcattgaaac caatggaatc 2820 aaaaataata ttattaaaag agaatataaa aataaataat gcctacgcac tagtgggaag 2880 ttcagcaatg aatcagtcaa taccggcaaa aatcgtgaat aactcgattg agattttcaa 2940 tgattcgagt aaattcttaa tctcagatga tcataaatta tacatctgtc agttcggagt 3000 aaccaagcaa aaagctactg acatagaaca tgatcggatg ccgcattatc ggagaagctt 3060 aaatatgccg aaattaaagc cagaaacttt taaaataaac aaaaatctga cggcagagca 3120 aaagcaagag attcttgatc tactatcaga atttgactct atatttgctc gatcggagat 3180 tgatataaat ccaggattcc gagagccgta catgtacaaa ataaacacct cgacaccggc 3240 acagcaacca gctaaattcg tgcaaacagc aagcaaatat aatgaagagt tacaaaaaca 3300 tatcgacatg cttgaacagc aaaatgttat tgaacaaatc gagttcgcag acgtcattaa 3360 cgctggattc gtcgtagtaa gcaaaaaatg tggaagacta aggttttgct tagacctgag 3420 atcattaaac aatgttacaa ttccaaacaa aaattatccg attccacata tggacacaat 3480 tctgggaaac cttagtggaa atcgatactt ttcatcgtta gatctgacat cagcctatca 3540 tcaatttctt attgacccta gggatagaga aagatataca tttatcggac caaataaaaa 3600 aatatacagg taccgacgtg ttccatttgg aggtcggtta atcacagcat ggctacaagc 3660 acttatgcaa caagtcattc taaaagattt agaaaacact aatgcctata tcgacgatat 3720 aaacatcggt agcaagactt ttgaagatca taaaaataat ctaagaaata cgcttaatag 3780 aatatcattc cataacctag aattatctgc aagaaaatgc tcatttgcat atgaagaaac 3840 caaagttttc ggatacatcg taaactttga tggttatacg ccagacccag caagaattga 3900 aacactaaaa attccaatgc caaagaataa aaaagagctg ctgagcacac ttgcaggaat 3960 aaattattat agatctacta ttccgaaatt cgcagagtta gcatcaaatc ttttcaaatt 4020 tacacgaaca acatcagatt ttgatccaga agatattgaa acaaagaatg aatggaaaaa 4080 actgatagat gcactcggaa aagcaatcat gatccagaag cctgacctaa atcgacaatt 4140 tatcttgaga acagatgcaa gtaaaaaagc ttttgggaac atgctgagtc aaaagaatga 4200 taaaggaaag gaaattattc tagctgtaga atcgaaacaa tttacggatc aacaaattgt 4260 gtggtcgatc ggaatgaaag aattattatc cgccgctcat gcagtaaaaa gatacacgaa 4320 tcttatcgaa ggaagaaaat ttctattaat cacagactgc aaatcagttt actatctgtt 4380 aaaaaaccga aaagaagtaa atctaacggc atcaaatcca ttaaccagaa actttctttt 4440 tctcatgatt tacgatttcg aaataaaatg gtcaaaaggt accaagaaag attttctttt 4500 gacagatcta ctgtcaagga agaaaattga tggtagtaca gaaattgtaa tcggcaaaaa 4560 atcgaaagac cctctattaa gtataaaatt acttgatggt aaacttttcg gaatagatga 4620 aagaactgaa tatactgaaa aagggacaaa atcggaaatc caatcagtga acgcaataaa 4680 tctcgaacca gtaaaatatt ttgaaatcga tgaaatccta aaagaaataa aacgatcaca 4740 aatggatgac gaaaatttaa gaaaaaagat caaaagaata gcaaatggta aaaagctaga 4800 aacctataaa acaattgtag aagctccata tgaacaccca atactttatc actctgtgaa 4860 cgaaaaatgg agtttaatgg ttccaaaata tctcattcca aaagttttgt cgaaaattca 4920 ccgccacaca tcaaaatcgt ttgacctaag caaaattgaa aaactaggac tattttggaa 4980 caacatggcg gaatcaataa catcgtggca ccagtcatgc acagaatgca ccatttcgaa 5040 gccaaatacg gcaccacaaa agcacactaa tggacaaatc ggagattttc cgcgtggacc 5100 attcgaatcg atatgcatcg acgcaattca cataagacct tatctagcac ttgtcagcgt 5160 tgatcatttc tcaggattca caagcgccta tgctatagac aatgaaagaa tcgaaacact 5220 cataaatgcg acgctgagcc tgtgcttacg atttatagta ccaaaaatga taagattgga 5280 caaccatcgt agtttcaaaa gtacaaaatt tattgacgct atgaataaaa tggacataac 5340 tctatcgttt accactcctg caaactcaca agccaatgga gcttgcgaaa gaaggatacg 5400 ccaagtccag gaacgcttga aaacgctaac gatgaataac gtcgtaaatg gaagtgaatc 5460 aatgacaata atggatcttc agacttgcat cgacttaatc gtatttgaaa taaacatgtc 5520 acctcgtaat aataaaatct gtccactaat gataatgaca gggatcgagc cgagttttca 5580 aatacacaca ccacacattc taaaagaatt ggaagaaccg catggacaga gactgactga 5640 tctaagaaat gaaattcaag caagaattgc gaaagattat gaaaatgaag cttcagcaga 5700 aataacacca tcacaagaaa tgaaaataaa agttggtgat tttgtccgaa ttcaaaaagc 5760 atcggtagca acgaaaacta aacgcgagca aacaagatac tcaaaagaaa tataccaaat 5820 aaaggaaata ctcaatcgtt acgggacagt caaaattatg gaagtaaaag agacggataa 5880 aattgatcga cgaagacccg aaattagaat tatatctgtt aaaaaattaa agagaataat 5940 cgatcgatca aaacttatcg aaacgtatga gaaaaagacc caaaaagagt gggccaaaga 6000 ttgcgaagtc ccaaaagaag tgggccataa aagtacaagt cccaaggagt gggcaaagaa 6060 gtccgaagaa gtgggccgaa gaagtaaaag tcccaagaag tgggcagaaa cgtcccaaga 6120 agtgggccgt ggaagtttaa gtcccaagga ggtgggccag ttagataaag caaaaacccg 6180 gttaaagggt cctgaaaaac acgtgaaaaa agtaagattc gatattatcg aaaatagcaa 6240 aaatgacaaa aaggacaaaa gtgcaagttc gcaaaaatcg agtgataatc ctcagagccg 6300 ctactcactg cgcaggagag caaatgtaga ctataaagat tgactgatca acaatctagt 6360 ttagtttcaa atagtgaaat gagagagaaa ctcaaaattt acattttatt tttattttcc 6420 accatagcct taggctatcg tggacatgaa gagaacgctc tttatgtccg caaaatcgat 6480 ccggctgaat ctaaaatcgc gtacaacgca caaattttag cttcagttgg aatcgctctt 6540 atccgtaact caaacgagaa ctcggtcaat attcgcaggg aatcgtggca tccaatcaag 6600 cttgtatatc gtctaccgac gcttaaggac tttatcgttc aatcgtgtcc aaaatcactt 6660 catgcaaccc tacaatcgag aatggataac aacgcggaaa tgcaattact cacaaaagaa 6720 gcagaaagtc ttaaaaacca atttttacat gaactgacaa atgccactcc gtatatcgct 6780 cagcgaaaac gtagaaatac gacctcagca aattcctata ataatatcac atcaagtgga 6840 aaaccgaggg ttcatagagt tgcgccaaat gcaaataaat cggaaaattc cgctaatgac 6900 actataaata tgagtcacac acagcacatg gactggtcag aatcaagttt atcggagatt 6960 tacgcatcat ctctttcatc tgataatagc gttacgaata atgtggaaag ttactcctcc 7020 gaatcttatt caaatgaaaa tgaaatgatt aaatctgaac caatcgctaa tgaaacagaa 7080 tatgaaccat caggattttg taattataac acaaatgaag caccaattga ggctttgaaa 7140 tttttcaaaa tggaatgcga aaattatatt tgctgtaatc agataacatg taaaacagta 7200 aatcaaagaa cagtggcagc aaatctgatc gaagagttaa tcgatgcccg gcaatacatg 7260 aatcaaacag aaagagattt actcaaatca aaattatctg aaataagtgt ttgttttaaa 7320 ccacaaccac aaagatcaag aaaaagcctc ggcttctggt cttattggtc aaaaggagga 7380 ccactgactc caaattccat taacgcggaa atcgaggaag ttgaaaaatt ggaaaataat 7440 caaatccacg cattatcaga aaaaatgtta tcacgcgaaa catttaacgc tgcaatcgaa 7500 agcgaagaat caaaattaaa actattaagt tcaactcttt gcacaataag cacagaattg 7560 ttgggatcga gtaaaataac tgaaatgaga aatcattata ttcaacttga agataaaatc 7620 gaaagagcag tcgcaacatg cgaaaaagga aaccgccctg taggattatc ggcaaaatta 7680 ttaataagtc tttgcatgcg aataaatcct aatgacaaca gattttgccg ggacccaact 7740 gtattatcaa aaataaattg taaagctaaa aaactcgaga taaacgaaag tgaaatcgtg 7800 tttcatatga aaattggaat gtatgagttt ttggaagatt ctgtaacaac acaaatagtt 7860 gcaacaccat ttatcgatca atcgggacga ccgtttattc tttccgacgt gcctaaaaca 7920 attgttgaaa cacatagaca tataatatct acaacttgcg aaaagacgct cggcgaaaca 7980 atcaacatat gcagtttgcg aaacgcagtt gacgtaaacg caaatttagc atgtgtaaac 8040 gcgcttacta gtcagtcaaa gaacttgata aaaataagct gtaaaaaatc gagttatcga 8100 ggtgataaat gcataactta cggtctccca aatgggggat acgctgtttg gagttcaaaa 8160 ggaatcgaaa taaaagctct tgctcgagac aatgcttttt caaaaacaat aagaatgata 8220 aaaccgaata acctgactat aatctctgaa aaagatgtta atttcgaatg taatgcagtc 8280 caatattcaa caaatagtga tacaaaagaa atatcttacg aggctaactc ggaaatcaat 8340 attgacttag aagccaaaaa tatcataaat gatgaatttg gtttatttaa taatagatta 8400 tcagaaattg aaaagactca aatcgcaaat cataaaaacg cgacgtcact agccacaagt 8460 cagctaagta aaattttaaa cattaatccg aaacacgaac cgaaaataat agtatttttt 8520 cgatatctga tatacggatt agcggcaatg agcacatttt ggataatgaa aaagttaatc 8580 acgttcgcga taaataaata taaaaaatat aaacgaaaga aaagaagatc atttacagac 8640 ttgatcaata atcaagtaag gcctgataat tcaagaacag aaattacaac agaacaacaa 8700 acaacttcaa atctttaaat taagaaaaac aattctaata aatataaaaa tctattgctc 8760 ataatcgtca tcgaaaagca aacagtaaaa tcgccataag aggcgttaac accctaaaac 8820 gcgaaaatgg gggcctaaaa agcgcgcacc ctaaatcgtc aaaaagcttt cacaccctaa 8880 aacgcaaaaa tggggggcta aaaagcgcgc accctaaatc gtcataaagc tttcacaccc 8940 taaaagcaca atgcggcgtc aaagcgaggc gatagaatct cctcaacagt aatgctagga 9000 aaataatcgc taaagtcaat cgtcctcact tttataaaaa tcgaaaaagc gaaaagtcaa 9060 aatgtcgaga aatcgagaga atcccccggc tctagatgat cttgaggagc cgatggaaat 9120 cgtcaccgga gtaagagcaa gtaatgcaga tgaactggag agagaactag aagctcctga 9180 agaaattcct ccagtagtgc gcgccgagca ggaaagactc agaagcgtgg aaagatctgt 9240 tcatcagatg cagcacgcac tagagcttct ggtatcgcga actgaaaatg aagagcgtca 9300 taatgaaccg ggaccttcga cccgaagttt ttacaaccct ggaagtcgtg tttaccgaaa 9360 tcgtaacagt tcacgaacca tcttcgctgg tatcatgaga accatacagg aaaagaaaaa 9420 cctgggaatc atcaatggac aggtatacac tgtcagaagc gaaccaatcg caatttatga 9480 cgaaaacgaa agacgagggc acaaaataca ccaggttgca ctgtttattg atggaaaaac 9540 actgctaaca atcgcaagca ccgagcgtct tttaaaaata aagctcttca taatcaaacg 9600 aatcgagtcc gagatcgaaa acgtcatgct ttgtcgctgg acagcggaat tggcggaaca 9660 acgcaccgtc gaattcgtgt ggtatacaag cctcaatttc acggacgcct cagaaagagg 9720 cccacaacaa cagttcaggt caatctcaga gtacaggcag attaatggcg acgagcaaga 9780 agtacatgtc tttcgatata tggcgccaca atggtcattt ggacaaagga ctgagaacag 9840 aaaaaggaca cgattttgaa agacgacaaa tcagaattcc aagctattta tttattaatt 9900 tacttaactc tctgtgctta tctctcttta cttatattac tgtgcaagaa gggggaaata 9960 ta 9962 // ID Gypsy6-SM_I repbase; DNA; INV; 4427 BP. XX AC Contig1272; XX DT 15-AUG-2007 (Rel. 12.08, Created) DT 15-AUG-2007 (Rel. 12.08, Last updated, Version 1) XX DE LTR retrotransposon from Schmidtea mediterranea: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy6-SM_I; KW Interspersed repeat; LG_I; internal portion. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-4427 RA Xu Z. and Wang H.; RT "LTR_FINDER: an efficient tool for the prediction of full-length RT LTR retrotransposons."; RL Nucleic Acids Res 35(Web Server issue), W265-W268 (2007). XX RN [2] RP 1-4427 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from Schmidtea mediterranea."; RL Repbase Reports 7(8), 753-753 (2007). XX DR Genome; Contig1272; Positions 33030 37456. XX CC Positions [2045-2587] - Reverse transcriptase CC Positions [3657-4139] - Integrase core CC 'CTAAT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 1..1083 FT /product="Gypsy6-SM_I_1p" FT /translation="MATIPTDLLIGRLLPEKFHRGDDLELFIKECQRFFEI FT TKTPVKTQMVLVITLLDRTLIEEYEAAEGKTVEQKLRAAFHRPTSLIDDLR FT EALNYEQGNDSAEIFIEKISKMTKKLASHTWNEEEIQKCLLTHCVRDKEVR FT KEIEMKDLKTAEQIKETIKKIEKVNKVIEQVNTVRSIRPTTGGRTYRDVVQ FT VGATKREINEWKPETRVKVIECWTCQKPGHSSRECNIKRRFQCYACGVEGH FT IRRECPTIKCHRCNARGHKERECYTNMERRNQGRDRDQRKMSGGRIQRNTY FT QQREDMYQPRNVHQRQWGNQQYNRKDIAAIESDDEMMNTKQSSQPDEYNRE FT NDPNEHAPSNGTLIGAIY" FT CDS 1110..3749 FT /product="Gypsy6-SM_I_2p" FT /translation="MNANGMCNLSMDDDCMIDKDCSVFERDINNNSSVEIK FT KSNDKESQKKFKEECKVNEVVKHEERTIEKIYKFLKKNGDVCINEIVEDTR FT EQNILSVEEQKSPVITYNIMQGRPSTTLDIQGRKVDCLLDTGARINVMAKS FT VIDRLENIEILETRESLRCANNSRLETMGKLNINVKMGSMERNVTFIIVKN FT LIPEIIGGVELQRLFGIELKCILEEHEKRSDFICEIEARFGRIITDEERLR FT HAIDVLKVTGNKRLLEIFQANKNVFMADKWDIGCTNLIKHKIITKGEPIMI FT KPRRQPINLEDKIEEAIKNLENNGIIRKCNSPWNTPLVCVWKKEKKDIRLC FT LDFRQLNKITERQAFPMPNVDEILDILHGSRYFSSIDLGNAYYQVKLDKES FT QEKTAFSTKEGQFCFNRMPFGIAAAPGTFQELMTKVLKDLWKDGVMVYLDD FT ILIFTKTEEDHYNIFGKVLGKIATAGLRLNPEKCQIFRKEVKFLGHIINKD FT GIQTDNTKIEAIQSFQKPKCVKNLRSFLGICNYYRRFIKDYAKKARALESI FT CGKNNEKIRWTEMCEKAFGEMKEALITAPVLVFPDFRKEFILDTDASFDTI FT GAVLSQKDEKGHEHVIAYGSHAMSSHEKGYCITRKELLAIYYFCKHFNHYL FT YGKRFVLRTDHKAITFMVTTKKPITAQFQTWINYLSSLDIKMEYRKGTSHT FT NADMLSRKTCGTCVQCMMEHEDAKTGKIKTRILTVTAEGGYNKWQNDNMEV FT QEIKNKLENKDCKFIMENNTVLTKQGKIWIPSDNRQRMIKEVHVLLCHAGA FT QKVTKYIQNNCDMENLATEVKKVIENCERCQKMKTITTKTKGGNSNNKEYR FT TIREDIYGYMWAIERNVQQKEIYLRNH" XX SQ Sequence 4427 BP; 1879 A; 617 C; 959 G; 972 T; 0 other; atggcgacca ttccaacgga tctgcttatt ggaagattac ttccagaaaa attccacaga 60 ggagatgacc tcgaattatt tatcaaagaa tgtcagaggt tctttgaaat aacgaagaca 120 ccagtgaaga cacaaatggt actagttatt acacttctgg acaggacttt gattgaagaa 180 tatgaagccg cagaaggtaa aaccgttgaa cagaaattaa gagcagcttt tcatcgacca 240 acgtcattga ttgatgactt aagagaagcc ttgaactacg agcaaggtaa cgattcggcg 300 gaaatattta tagagaaaat atcaaaaatg actaaaaaat tggcctcaca cacttggaat 360 gaggaagaaa ttcagaaatg tttattgaca cattgtgtaa gagataaaga agtcaggaaa 420 gaaatcgaaa tgaaagatct caaaacggcg gaacaaatta aagaaactat aaagaagata 480 gaaaaagtta acaaggttat tgaacaagta aacacggtga gatcaatacg acctacaact 540 gggggaagaa cttatagaga tgtagtacaa gttggagcca cgaagagaga aatcaacgaa 600 tggaaaccag agacaagagt taaagtgata gaatgctgga cgtgtcaaaa accgggacac 660 agtagcagag aatgtaatat aaaaagaaga tttcaatgtt atgcatgtgg tgttgaaggt 720 cacatacgaa gagaatgccc aacaatcaaa tgtcatagat gcaatgcacg aggacacaaa 780 gaaagagagt gctacacaaa tatggaaaga cgaaatcaag gaagagatcg agaccaaagg 840 aagatgtcag gaggaagaat tcaaagaaat acctatcaac aaagggaaga tatgtatcaa 900 ccaaggaacg tacatcaaag acaatgggga aaccaacaat ataacagaaa agatattgca 960 gctatcgagt ctgatgatga gatgatgaat acaaaacagt caagccagcc agacgagtat 1020 aacagggaga atgacccaaa cgagcacgct ccgtcaaacg ggacattgat cggagcaatt 1080 tactaagtaa gattgatttc aaccaaagaa tgaatgctaa cggtatgtgt aatttaagta 1140 tggatgatga ttgtatgatt gataaagatt gtagtgtttt tgaacgtgat attaataata 1200 atagtagtgt agaaattaaa aagtcgaatg ataaggaaag tcagaaaaaa tttaaagagg 1260 agtgtaaagt aaatgaagta gtaaaacacg aggaaagaac gatagagaaa atttacaaat 1320 tcctaaagaa aaatggagac gtgtgtatta atgaaattgt agaagatact agagagcaga 1380 atattttgtc agtagaagaa caaaaaagtc cagtaataac atataatata atgcaaggta 1440 gaccgagtac aacactggat atccaagggc ggaaagtaga ttgtttattg gatacaggag 1500 cgcgaattaa tgtgatggct aaatctgtaa ttgatcgatt agaaaatatc gaaatattag 1560 aaacaaggga atcgctaaga tgtgcaaaca acagtagatt agaaactatg ggtaaactaa 1620 acatcaatgt taaaatgggc agtatggaaa gaaatgtaac attcattata gtaaagaatt 1680 taataccaga aattattgga ggagtagaac tacaaaggtt atttggtata gaattgaaat 1740 gtatactaga agaacatgaa aaacgtagtg atttcatttg cgaaatagaa gctagatttg 1800 ggcggattat aacagatgaa gaaagattac gtcatgccat cgacgtttta aaagttaccg 1860 gaaataagag actgctagaa atatttcagg caaacaagaa tgtttttatg gcagataaat 1920 gggacattgg gtgtaccaat ctgataaaac ataagatcat cacgaaagga gagccaataa 1980 tgattaaacc gagacgtcag ccaataaatt tggaagacaa gattgaggaa gcaataaaaa 2040 atctagaaaa caacggaata attaggaagt gcaattcacc gtggaacaca cctttagttt 2100 gtgtatggaa gaaagagaaa aaagacatca ggctttgtct agacttcaga caattaaaca 2160 agataacaga aagacaagca tttccaatgc caaatgtaga tgaaatttta gacatcctac 2220 acggatccag atattttagc tcaatcgact tgggaaatgc ttattaccaa gtgaagttag 2280 ataaagaatc tcaagagaaa acagcattct caacaaaaga aggacagttc tgttttaaca 2340 ggatgccgtt cggtattgca gcggcaccag gaacatttca agaattaatg acgaaagtat 2400 tgaaagactt gtggaaagat ggagtgatgg tatatttaga cgacatccta atattcacaa 2460 agacagaaga agaccattat aacatatttg gaaaagtcct agggaagatc gcaacagcag 2520 gactaagatt gaaccccgaa aaatgtcaaa tatttagaaa agaagtgaag tttctgggac 2580 acataataaa taaagacggc atacaaacag ataatactaa aatagaagca atacaatcat 2640 ttcaaaaacc aaaatgtgtg aagaatctga ggagctttct gggtatctgt aactattatc 2700 gacggttcat aaaagactat gcaaagaagg caagagcact agaaagtata tgcggaaaaa 2760 acaatgagaa aataagatgg acagagatgt gtgaaaaggc tttcggggaa atgaaagaag 2820 cattgataac cgccccagta ttggtatttc cagatttcag aaaagaattt atattagaca 2880 cagacgcgag cttcgatact attggagcag ttctttcgca aaaggatgaa aaaggacatg 2940 aacatgtcat cgcatatggt tcacatgcga tgagcagcca cgaaaaagga tactgcatta 3000 ccagaaaaga attattggca atatactatt tttgtaaaca tttcaaccac tacttatatg 3060 gtaagagatt cgtactgaga acggaccata aagctattac gtttatggta acaacgaaga 3120 aaccaataac ggctcaattc cagacatgga tcaactattt aagcagtctg gatattaaaa 3180 tggaatacag gaaaggaaca agccatacaa acgcggatat gctatccagg aaaacatgcg 3240 gaacatgtgt acagtgtatg atggaacacg aagacgcaaa aaccggcaaa attaaaacca 3300 gaatattaac agtaacagca gaaggcggat ataacaagtg gcaaaatgac aatatggagg 3360 tccaagaaat aaagaataag ttagaaaata aagattgtaa gttcataatg gaaaacaaca 3420 cggtactaac taaacaaggt aaaatatgga taccgtcaga taatagacag aggatgataa 3480 aagaagtaca cgtattgctg tgccatgcag gtgcacagaa agtaacaaaa tatatccaga 3540 acaactgcga catggaaaat ctagcaacgg aagtaaaaaa ggtaattgag aactgcgaaa 3600 gatgtcagaa aatgaaaacg ataacaacca agacaaaagg aggaaactca aacaataaag 3660 agtacagaac cattcgagaa gatatatatg gatatatgtg ggccattgaa agaaacgttc 3720 aacaaaaaga aatatatttg cggaatcatt gaccactata gtaaatacat ctcattaacg 3780 gccataaaca agcaagacga aagaacaatt agtgagacgt tattaaataa atggatatta 3840 aagtttggag cgccaaaaga acttcatgta gattgtggga agaactttga agcaagaagc 3900 ataaaagaac tagcaaagac agctggcatt gaattaattt tctcaagccc atatcaccat 3960 aacacgaatg gtattattga aagacaattc agaacaatca gagagtatat taacgcgtca 4020 ttgaatgaag gaggaaggaa aaactgggct gatatagtgc cagaaataga atatacgtta 4080 aatgcaacag ttcaaaaaac aacgggagta agcccagcag agattatatt taggaggaag 4140 atcgacagaa tgaaatggta ctcgaataaa gaaataaata gagaagatat ggaaaagaga 4200 atagaggata aaacactaaa accaaaaata agcaaaacag tgagaaattt tgaaatggaa 4260 gatgtggtct tgatcaaaca agaaattcga aataaagacg atgcaaggtg ggaagggccg 4320 tacaaagtga taaagaaaat acatgaacgg agctacttac ttaaggatca aaatgggaag 4380 atggtagtca gaaatgttga gaagatcaaa cattttaaaa aaggggg 4427 // ID BEL-200_AA-LTR repbase; DNA; INV; 691 BP. XX AC AAGE02032565; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-200_AA_; KW BEL-200_AA-I; BEL-200_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-691 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02032565; Positions 981 1671. XX SQ Sequence 691 BP; 197 A; 160 C; 154 G; 180 T; 0 other; tgtacagttc tgcactgctg tcccctacga tcctggttga ccgtggcgag cacacatgca 60 ctgagcaaac agcacctagg aaatactgac caaaacaaaa cgcaactatt gttgaccatc 120 gccttgatca tcgtctggat tgctacagta cttatcacga tcaaccgaga gatgagagaa 180 cgacggagaa tcaaactgtt cgaatcccga ttgaccaccg attgttgaac tccaggtagc 240 agtttgtggt tatcgagagt ctctacacga cttctctgga tgaccgcagg ttaaccgatg 300 accgatgata gtttttttta gttcaattcg aatagccggc gcgaaaggtc atcattggcc 360 cagttaccca ttttaatctg tgtagatgta gttgttaacg ttaaataaat gtagttagag 420 tagattgtgt ttaactagaa ataaatgtag tttttgtgaa ttccatccgc gtgttgtgtc 480 cgattctgtg tggaaagcaa ccccaaggcc tattacggtc ccatccaaac agcaatcatc 540 attcgtgcga agaaggaggc tttagacgaa gtgaattgga tcccaagctt cactccatcc 600 ccaattgctg tacaaaccaa ccccccccgt gttggagtcg gaaggcaaac tccgctgtaa 660 tagaaggaag gtaggattcg actaaccaac a 691 // ID Gypsy-50_AA-I repbase; DNA; INV; 4692 BP. XX AC AAGE02019559; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-50_AA_; KW Gypsy-50_AA-LTR; Gypsy-50_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4692 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02019559; Positions 5455 764. XX CC Positions [2087-2620] - Reverse transcriptase CC Positions [3712-4173] - Integrase core CC 'CCCAC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 674..2479 FT /product="Gypsy-50_AA-I_2p" FT /translation="MDKWDIPAFKFKLMPPNEVRDEWIRYKRQFEYLSLAN FT NVANRTRLKNIFLAKAGPDVQDVFSSIPGADVEERAGVDPFKMAIEKLDEY FT FAPKQHEAYQRFLFWSLTLKEEEPLDKFLLRAMDSAGKCNFGTSKQTAYEI FT CVIDKLIQLAPPDLREKLLQKEKLVLDEVIKIVNSHGTIKHQASQMTTGNV FT GPSSSGFVNRIASLPKDNRFCCSRCGRRGHSGKDSICPARTRNCRICRKPG FT HFAECCRTSRGPAKRETLEEPSNYHKRIKYQHVRTIQDAEDEGKPHSFIFT FT VGDGDEFLWVKIGGVLIQTLIDSGSQKNILDDVTWENMKVQGAKVQNLRSA FT SDQTFRAYGKSSDPLVVKQVFESIIEVEEGQNTISTTATFYVIEGGSQALL FT GRATAKEMGVLALGLPSTEPPNVHRVRSERKRPFPKVKGIKLHIPINTSVT FT PVTQHARRPPLALMDKIEEKLDSLLASDIIERVNEYSPWVSPLVTIVKDNG FT DLRLCVDMRRANLAINRETHFMPTFEDFLPQLRKAKYFSRLDIKDAFHQVE FT LDESSRHITTFITHKGLYRYKRLMFGVSCAPEMFQKVLEQILSGCKNVVRS FT SMTV" FT CDS 2530..4659 FT /product="Gypsy-50_AA-I_1p" FT /translation="MHTLQDKNVLLNHDKCILRVQEVDFLGHHLSEKGIRP FT TDDKLAALRSFRAPQTSEELRSFLGLVTYVGRFLPDLATVCAPLRLLTHIG FT VPFTWKNEHEIAFRRLQTMISDIATLHYFDNSLRTRVVADASPVGLGAVLV FT QFANDRDDSCPRVISYASKSLSPTERRYCQTEKEALALVLAVERFSTYLLG FT RKFELETDHKPLEMIFSPSSRPCARIERWVLRLQSFTFVVRYRKGTSNVAD FT PFSRLATDCTTQDFDSDSPFLILDIMESAAIDTCELELASENDFELSVVKE FT CVRTGKWDRAEAKPFEVFRNELGFVGDLLVRGSKLVVPKLLRSRMLVLAHE FT GHPGETVMKRRLRDRVWWPSMDRDTSAYVTACEGCRLVGLPCKPEPMQRRE FT LPVKPWVDIALDFLGPLPTGEYLLVVIDYFSRYKEIEIMKHITAEETILRL FT QKIFTRLGFPITLTLDNARQFISAAFENYCTHNGIHLNYSTPYWPQENGLV FT ERQNRSLLKRLQISHALGRDWKSDLQEYLLMYYTSPHSVTGKTPTELCYGR FT TIRSKIPSLSDVEVAPSRDEVADRDRLLKKKGKDAEDGKRHAEHSDLKVGN FT TVLMKNLLPGNKLTTTFGRNEYEVMSKDGSRVTVKDKTSGKIYRRNVAHLK FT RIDSSHEGTNDSLASVAPCEPANPELSPSNSQADGSRLNLSEHSRTRRTLK FT LPSRFKDYEC" XX SQ Sequence 4692 BP; 1423 A; 939 C; 1158 G; 1172 T; 0 other; attggcgact gtggatgtgg aattattagg taagaatcgt attgcatatt tattcactcg 60 aaaaaaacga aaaaaaacaa cgttcataga tcggctgcgc tgtgttgata ttgattggag 120 attgaggtta tgtgttgtgc tatgaagggg ataaggcaat ttatctaaaa atcgcaggaa 180 agctgagaat aggccttcga ggctggtggt ccaaacgatc gaaatgattg ataaatgaaa 240 tgtatattga aggtcctgga gactggtaat gtacaggtcc aagagactgg agtcagagca 300 gtgcgttaca gaaagattat ataggtcctg gagactgaaa aaacaggtcc gaaagactga 360 aaaaaaggtc ccggagactg atactaataa agaaaggtct aagagactgc acatttgagg 420 aaatgatatt cgatgtggga ggtccttgag actgaagaaa taaaaatatg agaagtaaga 480 atattacgac gaagaagtat aatacatttt tgctttcagg tgaggatcaa atcgagccaa 540 cgacgtgtta tctgatacaa aaaaacgggc acctatttgg tgaagttaag gtaacactat 600 ttacgaatat ggactaagat tatgagaata aattttgaat attccagtaa cctgtgcaac 660 tgagtcagcg aacatggata aatgggatat tccggcattc aaatttaagc tgatgccacc 720 taatgaagtg agggacgaat ggatccgcta caaaaggcaa tttgaatacc tgagcctggc 780 aaacaatgtg gcaaatagaa cgcgactgaa aaacatcttc ttggcaaaag ctgggccgga 840 tgtccaagac gttttcagca gcattccagg agcagatgtt gaggaaagag cgggagtcga 900 tccattcaag atggcgatcg aaaaactcga cgaatacttc gcgccgaaac agcatgaagc 960 ttatcaaagg tttctatttt ggtcgctgac actgaaggaa gaagagccgt tggacaagtt 1020 cctgttgcgt gcgatggata gtgccggtaa atgcaatttt ggaacctcta aacaaacagc 1080 ctacgaaatc tgcgtaattg ataaactcat tcaacttgct ccacctgact tgcgagagaa 1140 attactgcag aaggagaagc ttgtgctcga tgaggttatc aaaatcgtga attcgcatgg 1200 cactatcaag catcaagcaa gtcaaatgac gaccggcaac gttggaccgt cttcatccgg 1260 gttcgttaac cgaattgcca gtctaccgaa ggataatcga ttctgttgct ctcgatgcgg 1320 tcgtagaggt catagcggga aagattccat ttgccctgcc aggacgagaa attgtcgcat 1380 ttgtcgtaag cctggacatt tcgcagagtg ctgccgaaca tctcgaggcc ctgccaagcg 1440 tgaaactctg gaagaaccaa gcaactatca caagcgtatc aagtatcaac atgtgcgaac 1500 gattcaggat gctgaggatg aaggtaaacc acacagtttc attttcactg tcggcgacgg 1560 ggacgagttc ttatgggtca aaattggagg cgtactgatt caaacactaa ttgattctgg 1620 tagtcagaaa aacatcctgg atgacgtaac ttgggagaat atgaaagttc aaggcgctaa 1680 agtacaaaat ttgcgatcag cttccgatca aacgtttaga gcatacggaa agagctcaga 1740 cccattagtc gtaaaacagg tgtttgaaag cattatcgaa gtggaggaag gacaaaacac 1800 tatcagcacc acagcaacat tttacgtcat tgaaggtggt tcacaagcac ttttgggcag 1860 agcgaccgcc aaagaaatgg gtgtgctagc actaggactt cctagtacgg agcctcctaa 1920 tgttcaccga gttcgttcag aacgaaaacg tccttttcca aaagtaaagg gtatcaagct 1980 tcatatacct attaacacca gtgttacacc agtcactcag catgcccgtc gaccaccctt 2040 ggcattgatg gataaaatag aagaaaagct agattcccta ttggcatcgg atataattga 2100 acgcgtaaac gaatatagtc catgggtttc tccattggtc acgattgtta aggacaatgg 2160 agacctacgt ttgtgtgtgg acatgaggag agcgaattta gcgattaatc gagaaacgca 2220 ctttatgccg acgtttgagg atttcctccc acaactgagg aaagccaaat attttagtcg 2280 gctggacata aaggacgctt tccaccaagt cgagctggat gaatccagca gacatattac 2340 aacgtttatt acacacaaag ggctttaccg atataagcga ctcatgtttg gggtttcgtg 2400 cgcaccggaa atgtttcaga aggttctgga acaaattcta tctggttgca aaaacgtggt 2460 cagatcatcg atgactgtct aatattcggg gaaactttag aagagcacga tgaagctctg 2520 gcgaaggtta tgcatacgtt gcaggataaa aacgtccttt taaatcacga caaatgcatt 2580 ttgcgtgttc aggaggtcga ttttttgggc catcatctgt cggaaaaagg tattcgccca 2640 accgatgata agctagctgc gttgagatcg ttccgtgccc cacagacttc ggaagaatta 2700 agaagctttt tgggactggt cacctacgtt ggcagatttc tcccggacct ggcaacagtt 2760 tgtgcaccac tgcggctact aacccacata ggggttccat tcacttggaa aaatgaacac 2820 gaaatcgcat ttcgtcgatt gcaaactatg atctcggata tagcaacatt acactatttt 2880 gacaattctc tgcgcacgcg agtagttgct gatgcatcgc cggtgggcct tggtgccgta 2940 ctggtacagt tcgcgaatga tcgggatgac tcgtgccctc gtgtgataag ctacgccagc 3000 aagagtctga gtccaactga gcggagatac tgccaaaccg aaaaagaagc gcttgctttg 3060 gttttggcag tagagcggtt ctcaacgtat ttgctgggaa ggaaattcga gctcgagacg 3120 gaccacaaac cgctcgaaat gatattttcg ccatcctctc gaccttgtgc gcgcatagaa 3180 aggtgggtgt tgcggttaca atcattcacg tttgttgtgc gatatcggaa agggactagc 3240 aatgttgctg acccgttttc tcgacttgcg actgattgta caacccaaga ttttgatagt 3300 gatagcccat tcttgatttt ggatataatg gagtcagccg cgatcgacac gtgtgaactt 3360 gaactcgctt cagaaaatga ttttgagtta tccgtcgtta aagaatgcgt cagaactgga 3420 aaatgggata gagcagaggc taaacctttt gaagtctttc gtaatgaact gggatttgtt 3480 ggagatctgc tagtgcgtgg ttcgaaactg gtggtgccga aattgttacg gtcaagaatg 3540 ctagtgttgg cacatgaagg ccatccagga gagactgtga tgaaacgtcg attgagggat 3600 cgcgtctggt ggcccagtat ggatcgtgat acgagtgctt acgtcactgc ttgcgaaggt 3660 tgtcgcttgg ttggattgcc atgtaaacca gaacccatgc aacgcagaga attacctgtg 3720 aaaccttggg tcgacattgc tttggatttc ctagggcctc tcccaactgg agaatacctg 3780 ctagtagtga tcgactattt cagtcgatat aaagagattg aaataatgaa gcacattaca 3840 gcggaagaaa ctattctccg attacaaaaa atatttacac ggcttggatt tccaattact 3900 ctcacactgg ataatgctag acaattcatc agtgcggctt ttgagaacta ttgtacgcac 3960 aatggaatac acttgaatta ctctacccca tactggccgc aagaaaacgg cctggtggag 4020 cgccagaatc gctccttatt gaaacgactc cagatcagtc acgcgcttgg gcgtgattgg 4080 aaatccgatt tacaggagta tttactaatg tattatactt caccacattc cgtcactgga 4140 aagaccccta cggagctctg ttatggacga accataaggt cgaagatacc gtctctgagc 4200 gacgtggaag ttgctccatc gagagatgaa gtagcggata gagatcgact tctgaaaaag 4260 aaaggaaagg atgctgaaga tgggaaacgc catgcggagc actcagactt gaaagttgga 4320 aataccgtgc ttatgaagaa tttgttacct ggaaataagt tgactactac ctttggtcga 4380 aacgagtatg aggtgatgag taaggacgga tcccgtgtta ctgtcaagga taaaacatct 4440 ggaaagatat accgaagaaa cgttgcacat ctcaaacgaa ttgatagtag ccatgaggga 4500 accaatgatt ccctagcttc agtggcccca tgtgaaccag caaaccctga actgtcccca 4560 tcgaatagcc aagcggatgg ttcaaggtta aacctctctg aacattctag gacacgacgc 4620 acactaaaat tgccatcaag gttcaaggac tacgagtgct aagggatttt acgttcacga 4680 gaaaaaggga ga 4692 // ID Copia-27_DPu-LTR repbase; DNA; INV; 180 BP. XX AC . XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from Daphnia: long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-27_DP_; KW Copia-27_DPu-I; Copia-27_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-177 RA Jurka J.; RT "LTR retrotransposons from Daphnia."; RL Direct Submission to RU (16-DEC-2010). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR [1] (Consensus) XX SQ Sequence 180 BP; 40 A; 35 C; 40 G; 65 T; 0 other; tgttgaaaat aggaggcaag cagaggcttc tttcctttga tgctcgtatt cttctttgtc 60 gtctgaaggg cttcatgtca agtgtgttgc cgtctgtcac ctatcggtca gttgttgcaa 120 taaaggtaat cacgctgcac taaattgtcc tgtcatatat tttattgggc aatttctaca 180 // ID Copia-23_SI-LTR repbase; DNA; INV; 293 BP. XX AC AEAQ01023869; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: long terminal DE repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-23_SI_; KW Copia-23_SI-I; Copia-23_SI-LTR. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-293 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01023869; Positions 482 190. XX SQ Sequence 293 BP; 71 A; 50 C; 64 G; 108 T; 0 other; tggagagtaa caattaaaga tgtcaagtcc gttcaatttc actcattgta tatgcttgtg 60 tctgattggt atgagagatg aacaaggagc gcactctgtc gtgagatgat gcaatgatgt 120 gtgactgtaa aaggagttag gtaacgtcgc gtctccttct tcgtcgtgtc gtgtcgagtc 180 gagacgattc tttgtattaa ctttttgttt agttttccag ttgtcttaat taaatctttt 240 cacttgtagt cattgttcag ttttcctctc tatgcagatc agataacctt aca 293 // ID hAT-35_HM repbase; DNA; INV; 3977 BP. XX AC . XX DT 07-DEC-2008 (Rel. 13.12, Created) DT 13-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-35_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3977 RA Jurka J.; RT "hAT-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 2024-2024 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 717..3248 FT /product="hAT-35_HM_1p" FT /translation="MEAVCSVCRERNTKYTCMKCAVFVCNICATPGKTGDL FT SYSEEIKRIALCSKCSTVEPPKNKIQKTIFSLFKPLLEQSPAAKSINEKNA FT KEETSKITANPRKVSVSTVETWKSELAEYSLSEWLFYDIDSNERVKNLKCK FT FCTEFNDTIKTMPYYTNTFIVGSANYRKSNILDHCKSQPHLKAYTLFLKKS FT NNFQNASKPLLPCKNNATITDGLKKMEKNDLEKTLKKFQLSYFIAKQQLPL FT SKFVPIVKLVELFGVNMGESYISDRQCVVFIDYIARSLQNSLHDSLRNAKF FT YSVLCDGSTDSSVIENEVVYALYFDPQPVGSDAVEIRTSFLDIKYLESGKA FT QGVVKVLEECFVNLLTPIKCIGFTSDGAALNRGDQSSVKTILRERSPWLVF FT VWCVAHRLELAIADALKDTEFDLVDEIILRSYCLYQKAPKKLRQLRELYTV FT LKDGMDTLDESCKPKKASGTRWISYKVNAMKSFLDKWGLYITHLEQLAEDP FT SILAPDKAKMKGYLLKWKKGKIPLLFAFFIDLLEIPSILSMSFQDDQVDHV FT QVMNRLTKAKERWVLFEKKSLENLPRVKDFLSQIKKKNGKHYYHDIELKDF FT EREFKIFKTKTVEYLTVVRDCIIQRVENEKESTLILKYASQVLNCNGWLRE FT VIAESGIITRNLDFADEALNYICKHFQEPLRSAAVVFPDILTQWKEMLSYA FT QEFLSPVETQYRITWRRIFVSPRSKSWTSILTVAELLFTLPISNAKLERMF FT SKMKSVKSNYRTSLTESCVSNILRILEDGPLLKEFNFLPAIELWWSDKGRR FT PNQKPTGIYKDRVNKKRTITSLSDSENDSGEDLVKSLFDDN*" XX SQ Sequence 3977 BP; 1413 A; 593 C; 650 G; 1321 T; 0 other; cagggctcga aatagccgcc ggacatcgga cattgtccga cattggtccg gtaaaaatca 60 tcataaatcg gtcattttgt ccgactaata ttttttgtga aattttctta aaattatttt 120 agttatttaa aagttattga aatgttcagt atattatata taaactttct tttttatttt 180 attttttaac caaaaatgcc caagaaaacg aaagtgtgca caatgtgatg aagttcgtca 240 ttctttgaga taattcagac aaaatggaaa cagtattatg ccgcaagaat tttatttaat 300 gagcaacgct ttttttaacg ggggttaaca tcaacacttt ttttacattt ttcttttaca 360 tttttaaaat atcttccccg taaatattaa tattttgtaa aaaataaaat ttaatacaac 420 atgttacgtt tttaatagtt ttctgtttac tgtgatttaa ttttgaattt gaaattttcg 480 taaggtaaag ctatttgtta cttttaatta aacttatttc ttatcttatt caattttgtg 540 aacagaagtt tttattttat aaatttaaca ataaatatat taataaaatg tataattata 600 tgagtataat tatgtatttt attaatataa ttattcacat ataatatata taagtttaaa 660 atatttatga taatgtgttg tcattattcc aacttagggc gtaatattaa aaaaaaatgg 720 aagcagtttg ctcagtttgt cgtgaaagaa atacaaagta tacatgcatg aaatgcgctg 780 tttttgtttg caatatttgt gcaactccgg gaaaaactgg cgacttatct tatagtgagg 840 aaattaagag aattgcctta tgttcaaagt gttcaacagt tgagccaccg aaaaataaaa 900 ttcaaaaaac tatattttct ttgtttaaac cacttttaga acagtcgcct gcagctaagt 960 ctataaatga aaaaaatgcc aaagaagaaa caagtaagat cacggcaaat cctcgtaaag 1020 tatcagtatc gacagtagaa acttggaagt cggagctcgc tgagtattca cttagcgaat 1080 ggctatttta tgacattgat tcaaatgaaa gagtaaaaaa cttaaaatgc aagttttgta 1140 cagagtttaa tgatacaata aaaactatgc catactatac gaatacattc atagttggat 1200 cagcaaacta tagaaaatcg aatattcttg atcattgtaa gtctcaaccg cacttaaaag 1260 catacactct ttttcttaaa aaatctaata attttcaaaa tgcatctaaa cctttattgc 1320 cctgtaaaaa taacgcaaca atcactgatg gtttgaaaaa aatggaaaaa aacgaccttg 1380 aaaaaaccct taaaaaattt caactttctt actttatagc gaaacaacag ttacctcttt 1440 caaagtttgt tccaattgtt aaattagttg agcttttcgg tgtgaatatg ggcgaatcct 1500 atatttcaga tcgccaatgc gttgtattta ttgactatat tgcaagaagt ctgcaaaatt 1560 cgttacacga tagccttaga aatgctaaat tttatagtgt attgtgtgat ggttcaacag 1620 acagttcggt aattgaaaat gaagtggtgt atgcattata ttttgatcca cagccagttg 1680 ggtcggatgc ggttgaaatc aggacttcgt ttttagatat aaaatattta gaatctggaa 1740 aagcccaagg tgttgtaaaa gttttagaag aatgctttgt aaatttgcta acacctatta 1800 aatgcattgg ttttacttca gatggggctg cactaaatag aggcgatcaa agtagtgtga 1860 aaaccattct acgcgaaaga agtccgtggc ttgtgtttgt ttggtgtgtt gcacaccgct 1920 tagaattagc aattgcagat gctttaaaag atacagaatt cgatcttgta gatgaaatta 1980 tattacgaag ttattgcctg taccaaaaag caccaaaaaa gctccgacag cttcgcgagt 2040 tatatactgt tttaaaagac gggatggata cgcttgatga aagttgcaag cctaaaaaag 2100 catctggaac acgatggatt tcttataaag ttaatgccat gaagtctttc cttgacaaat 2160 ggggtttata cattacccat cttgaacaat tagctgaaga cccatcgatt ttagctcctg 2220 acaaagctaa aatgaaaggg tatctcctaa aatggaaaaa aggaaaaata ccacttctat 2280 ttgcattttt tatcgattta ctcgaaattc cctcaatact ttcaatgtct tttcaggatg 2340 atcaagttga tcacgttcaa gtaatgaacc gtcttacaaa agcaaaggaa agatgggtat 2400 tatttgagaa aaaaagtctc gaaaatttgc ctagagttaa agattttctt tcacaaatca 2460 aaaaaaaaaa cggcaaacat tattatcatg acattgaact taaagacttt gagcgtgagt 2520 ttaaaatttt taaaacaaaa actgtggagt atttaacagt agtcagggac tgcattatac 2580 aacgtgtaga aaacgaaaaa gaaagtactc ttattttaaa gtatgcatcc caagtattaa 2640 attgcaacgg ctggcttcgg gaggttatag ctgaaagcgg aatcataacg cgaaatttag 2700 attttgccga tgaagcttta aattatatat gcaaacactt tcaagagcct ttacgctctg 2760 ctgctgttgt ttttccagac atcttaactc aatggaaaga aatgctaagt tatgctcaag 2820 agtttttatc cccagttgaa acgcaatacc gaattacctg gcgacgcata tttgtatcac 2880 ctagaagtaa atcatggacc tctattttaa cagttgctga gctattgttt actttgccaa 2940 tttctaacgc taagctggaa agaatgtttt caaagatgaa aagtgtcaaa tcaaattatc 3000 gaacctcatt gacagaatct tgtgtttcaa atattttgcg cattttggaa gatggtccat 3060 tgctgaaaga attcaatttt cttcctgcaa ttgaattgtg gtggagtgat aaaggaagac 3120 gcccaaacca aaaaccaacg ggaatttata aagatcgggt aaacaaaaaa agaacaatta 3180 cttcccttag cgacagtgaa aatgatagtg gagaggatct tgtgaaatct ctttttgatg 3240 acaattaaaa gaaaaacata ttttaattgt gtaaaaatct ttaagcttta aaataaatac 3300 aaaatactgg tcttgctttt ataatcaaac atattcctga cgagaaaagt tcgtttatgt 3360 atagacatta cgtttgcaca ataaatagcg ttaaatagag atagttaata ccaatactct 3420 aaagtaaaaa tggttagcgc gatagcaaac acaactattt attattatta ttttattaaa 3480 aactgttttt attcttttag aaatgttgtc gtttttttcc aaaaaagtca caagttttta 3540 acttgtaagt ctaaacatga aagtataaca cgtttttttt ttatactttc ataatctaat 3600 taacaccgca atactaatga ctaagggtgc actgttgcat cggtcttggt aaccatcggc 3660 cgatgccgat tgttttgaaa aatgagtctt tttaattttg tgtttctata ttttataaaa 3720 aaatataatt attatatttt tttatacaat atagaaacaa aaaaataaaa aaaattgact 3780 tttttaaaca attggcatcg gccgatgcat cggtgcaccc ctactaatga ccgcacatac 3840 cagtagtaaa taataaatcg gatattagtt ttatagtaaa aatatttttt gttgaaattt 3900 atcggtcatg tccgacaaac caaacaattt atcggacaaa ctgtccggaa caaaaaagta 3960 cgttatttcg agccctg 3977 // ID Copia-37_DPu-LTR repbase; DNA; INV; 263 BP. XX AC ACJG01004148; XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the common water flea: long terminal DE repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-37_DPu_; KW Copia-37_DPu-I; Copia-37_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-263 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the common water flea."; RL Direct Submission to RU (08-FEB-2011). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR Genome; ACJG01004148; Positions 2539 2277. XX SQ Sequence 263 BP; 70 A; 62 C; 43 G; 88 T; 0 other; tgttgtaatt atcttccaac tcaagcgcac ccaagtggtg ttgcttaacc ttagtgttgc 60 ttaaccccac tcccgttaga tgtctcatgg aagatatagt tgacgtctgc tacctgtcca 120 tgtgacagtc tcttcactct cttaagaacc ggtcataggg ttagacttat ctaaaactat 180 attctgtcat ccatggtatc tgtaatacaa tatcttgttt aaaatcacgc tcatggtcta 240 atgttcagtc actaaactca aca 263 // ID Gypsy-15_IS-I repbase; DNA; INV; 3975 BP. XX AC ABJB010373042; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the black-legged tick genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-15_IS_; KW Gypsy-15_IS-LTR; Gypsy-15_IS-I. XX OS Ixodes scapularis OC Eukaryota; Metazoa; Arthropoda; Chelicerata; Arachnida; Acari; OC Parasitiformes; Ixodida; Ixodoidea; Ixodidae; Ixodinae; Ixodes. XX RN [1] RP 1-3975 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the black-legged tick genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; ABJB010373042; Positions 13267 9293. XX CC Positions [1579-2028] - Reverse transcriptase CC Positions [3258-3641] - Integrase core CC 'ACGAC' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 1486..3039 FT /product="Gypsy-15_IS-I_1p" FT /translation="MERMGVISPVEEPTPWCAPMVVVPKPSGAVRICVDFT FT ELNRYVLREWHPIPSVEHTLGLLQNAKYFSKFDANSGFWQIPLSGDSKQLT FT TFISLFGRFCFNRLPYGISLAPEHFQKQPSSILQGASGVVCHMDDILIWGS FT TPAEHNTRLRIVLQRLCNAGLTLNEEKCSFNVQELTFLGHKINEHGIRPDD FT TKVEAVKAMDPPTSKPELQRVLGMATYLARFVPNLATVLQPLSSMLSTKQE FT FVWGSAQQDAFDKWKAILSSHPVLGVYDPRKETVVTADASSYGVGAVLRQK FT QEGNKYKVIAYASRLLSDTEKRYAQIEKDGLALVWACEKFSDYLVGKFFNL FT ETDHKPLVPLFSTKALDDLTPRLQRLRMRMMRYNYNITYVPGKELLAADTL FT SRAPLKNTFPLDLERDVGAFVQHIRSTLSFKDTCLQWIIDHQKTDLVCQAL FT RKYSQRTWPSRRDLDNDVKPYYQYRHQFTVEEDLLLYASRVVIPVKCRPEL FT LRLLHEGRFGIVKTQARAAGSV" XX SQ Sequence 3975 BP; 1130 A; 1078 C; 944 G; 823 T; 0 other; tggtgtcaga agtgggatca cggccggaaa tcgagaaccg ctaccagcac tgacttccga 60 ggcacccaag gaagcgaccc tgcgctggaa gaaatggcaa ccaccgcagc gggcacatcg 120 ttcgtaatac agccaccgga gccattttct ttctcgtctc cgaacgaatg gccgaaatgg 180 aagaagcgct tccaacgctt tcatagtgct tcgggactca gcacaaagcc tgaagaacac 240 caaatcgacg ctttccttta cgttatgggg gaccaagccg aggacatctt cagcacgttc 300 cagctaaccg aagcagaagc caagaagttc gataccatca tgggccactt cgacgcgtat 360 ctcataccgt gaagaaacgt catctttgag cgatctcgat tcaactcccg ggtccaacaa 420 gaaggcgagt ccatcgaaga gtttgccacg tcactgcacg ccttgtcaaa gtactgcgac 480 tacggcccaa tgcaagacca gttcgtacga gatcggctag tcgttggcat ccgtgacaaa 540 aagctctcca caaaactcca gcttgacgcc gacttaaaac tttaaaaggc cctagagtca 600 gcgaggcaag ttgaaaacgt ccgtaagcag caagtagagc tgaatcaaga catcaaagcc 660 ggcagcagcg tcaacaagct tgacacaagg acaacttcaa ctaaccgctt ctctgggtct 720 agctccaagt caccaggcag actagaacac aaagagtctg gttcgtccaa gacgaacagc 780 caagaatgtc attggtgtgg acgagcacgc catcccagaa agtcctgttc agctcgacac 840 gagacttgta agagttgcca taagaaaggt catttctccg gagtctgcct ttccaagaaa 900 gtccgcttcg ttggacaatc tcaagacgag gatgacgagg ttggctttat tggaggagtg 960 gcacatcctc aagctgacaa gtgggaagtg acggtgacgc ttcaagggga aaccgtaaag 1020 tttaaattgg acacgggcgc cgacgaaaca gtcatctcgc cctccatctt ccggaagctt 1080 tccccacgac cgcgcctggt agcaccacct cgtcagcttc acggaccgga cggaaccctt 1140 ttggcaacac agggggcagc tcgcctcaac atcacgtaca gagaccgcac ttcggtgcaa 1200 gacgtttaca tcttggaaga cctccgtacg ccattactag gaaagcctgc tgtacaaagc 1260 ttgcaattgg gatcggtggt caacgagatc agcccaagct ccaatattct tgacggatct 1320 aaagtcgacg caagaacaga gttccctgcg ctcttcaaag ggctcggcaa gctccctgaa 1380 gcaaaccacg tgcaactcca agagggcgcc aaaccgttcg cactaagttc tacacggcga 1440 gtgccgatcc cgctctacga aaagacgaaa gcagagctct aaagaatgga gagaatggga 1500 gtaatatcac ccgtcgaaga gccaactcct tggtgtgcgc cgatggtagt tgttccaaag 1560 ccaagcggtg cagtgagaat atgcgttgat ttcacagaac tcaatcgcta cgtactcaga 1620 gaatggcacc ctataccctc cgtcgaacac accttaggac tccttcagaa cgcaaaatac 1680 ttctcaaaat tcgatgccaa ctccggtttt tggcaaattc cactaagcgg agacagcaag 1740 cagctcacga ctttcatttc gctgtttgga cggttttgct tcaaccggct cccttacgga 1800 atttctttag ctccggagca cttccagaaa cagccctcga gcatcctgca gggagcaagt 1860 ggtgtcgtat gccatatgga tgacattctc atctggggct ccactcccgc agagcataac 1920 accagactcc gcatcgtttt gcagcgactc tgcaacgcag gtctgacgct caacgaggaa 1980 aaatgctcct tcaacgtcca agaattaaca ttcctgggcc acaagatcaa cgagcatggc 2040 atccgtccag acgataccaa ggttgaagcc gtcaaggcca tggacccgcc tacctctaag 2100 ccagagctgc agagggtgct gggaatggca acgtacctgg ctagatttgt tccgaaccta 2160 gcaaccgtcc ttcaaccact cagctctatg cttagcacta agcaagagtt tgtttgggga 2220 tccgctcaac aagacgcgtt tgacaagtgg aaggcgattt tatcgtcaca tccagtcctt 2280 ggagtctacg accctaggaa agagactgta gtgacggccg atgcgtcttc ttacggagtc 2340 ggcgccgtat tgagacaaaa gcaagaggga aacaaatata aagtcatcgc gtacgcttcc 2400 agactgctga gcgacaccga aaaaaggtac gctcaaattg aaaaagacgg tctagcactg 2460 gtctgggctt gtgaaaagtt cagcgattac ctagtcggca agttcttcaa cctcgagacc 2520 gaccacaagc ctctggtgcc gcttttctca accaaggctc tagacgacct cactccgaga 2580 ctgcagcgtt tgcgcatgcg tatgatgagg tacaactaca acatcacata cgtaccagga 2640 aaagaacttc ttgccgcaga cacgctatca agggcacctc tcaaaaacac attccctctg 2700 gacctagaga gagacgttgg cgcttttgtc caacacatca ggtccacatt gtcattcaaa 2760 gacacatgtc ttcagtggat tattgaccac caaaaaacag acctagtctg ccaagccttg 2820 aggaaatact ctcaacgaac ttggcccagc cggcgagacc ttgacaacga cgtgaagccc 2880 tactatcaat atcggcatca attcactgtt gaggaagatc tacttctcta cgcctcaagg 2940 gtagtgatcc cagtcaagtg ccgacccgaa ctcctgagac ttctgcatga aggtcgcttc 3000 ggcattgtaa agactcaagc cagagcggca ggctctgtat gatggccagg tctacaaaaa 3060 gatgtggaaa gacagtaaaa acatgccatc cctgcattga aaattcaaag aacaggagaa 3120 tgcctttccg actgaatttc ccgacagacc atggcagagg gtggcaatgg acctctttta 3180 ccatgaaggt cactggtggt taattgtgac cgattatttt tctcggtacc cagaaatcgc 3240 acgactacaa agtctgacta cacaggcagt tgtcacacac tgcaagtcaa tattttcccg 3300 acacggtata ccagaggtcg tacgtttccg acaacagacc ccagttttct cgtgcagcga 3360 cgtctgcgtt ttctacattt gcaagagaaa atggatttca acatgtgaca tccagccctc 3420 gttatccaca aagcaatggg ttagcggagg ctgcagttaa aataatcaaa acttcgatgt 3480 caaaaacggg agacctttac aagacagttc tctcgtacca aacgacacct ctaaggaacg 3540 gatatagccc tgcggaactc ctcatgggaa ggcggcttcg aacaaacgtg cctacagcta 3600 gcaagatgtt acttcccgtc agccccgaca ttaatcgact aaagtttgag aaagaaagtc 3660 gggaacaaca agaaaaggca tacaatcgcc gctacggagt tcgcgacctg gcggacatcg 3720 tggatggtac cgatgtctgg gttgtagacc tcaagagaag aggtactgta caacagccag 3780 cccaggagcc aaggtccttc atcgtagaaa cggatgaagg caccgttcgg aggaacaaga 3840 cccatcttgt tccctacacc gatcaaaacc cggcgcctga ctcctcacac gaatgcgaag 3900 ataccacgca tgcccacagc aagagtggcc gctgtctaaa acaacctagg cggtactctt 3960 cttaagaaag ggaga 3975 // ID BEL-61_CQ-I repbase; DNA; INV; 3382 BP. XX AC AAWU01017211; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-61_CQ_; KW BEL-61_CQ-LTR; BEL-61_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-3382 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 275-275 (2011). XX DR GenBank; AAWU01017211; Positions 9663 13044. XX CC 'GCGCC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 333..2990 FT /product="BEL-61_CQ-I_1p" FT /translation="MWRSITSMKLEDAFPQRAHAKQKIKKLRDIILGLKNP FT SLRQLTRLRQHLWELYDEYGNAHSCIVGNVPDNDLAQQEEEYNDFDHLFNE FT AGVPLEEKLIALRSAAPKTSATYRVEHQQHPWGRSVSNFYGCSSWSTSMMS FT CESGQPELSGQATNLNQRSRLRNLEADVVLRNHHPPTSSPSSPPEPRSRSQ FT NLEPDTALRSCMPPTNGPSCRSKPTSASDVLDTEVVMRDHLQPQTEANPTD FT LPESTTSSEVTKTNAKCVPLQPVQRNPNESRNGRVLPDDGIYTTSAQTPNS FT VLNHEANPIATDNGLESQVETDERRKLAIPTVTTLVNTDRCSEDSGSRPEA FT EKIPTVLPSFPKPDLFDSQRSLMDGDTRTKEREPLSAVNLRNEKTNSLSVE FT HGSSLMIIQWDVVTLWTNAELNPPLPTAANAPDKRRPTIACPNGPNPPPHG FT NRKKHAVNKTAAPNPRKTDGLSPKFPDQLHAVLCRGAPHEQRPTSSTSGDL FT RDTGAPEPNRVNATHQHHGEKPLQKRKSETEQLTKKCARYERKGDSKHALK FT RLQHRAAPKKSIPTSEALNRRPHLDEPPDLPPKRSTIAAVRWTCLKQNRTN FT GSLCCSELKATTVKSKTKPKSPRSRSLWTISSTAAAVLVSMLPFLLAFRHN FT IHAKYCENDFVGRETSFPEIGRGPCEQTVAPKSIEVNRLGVSSHTVRQLRL FT LDGRQRRTIPSQPTESVPLENASRGKKSVQPLHDHTKLFQRNQPNWNEQAC FT MVCSESPQCDLPDPGSFKSSSTKHPNLPQRLQQIASGREKKPPQKQLHARS FT PKTVQPTGKSNQADQRGSKSKCIHTGSQHLKAPQKVFQTSEALDFLLHLCE FT APASPRGRRHIATLCRIRQKGIPTKTRRCWSNL" XX SQ Sequence 3382 BP; 980 A; 1061 C; 813 G; 528 T; 0 other; tatggtccat ccgaaccgga ttggactttc gagtgtgtgc cgccactaga gcaatcggac 60 agaaagtgta gtgtgccatc aagaacgatt ccggaagtga gcttccgggg aagtaaagtg 120 tcaagtgatt cggaacatcc ggctggaagc tcctgtgcca cgaaagcgaa attgaaccct 180 ccctgctgcg gcaggacagt aagaaagaag acatcccggc tgtgaccgaa cagtgcaaaa 240 aacaaacatc cacccgcgga aaccagattc cgccgtcgac gcaacagcag cgatccaccc 300 cgaacaacct gccaggcgat tctccaaaca ccatgtggag atcgatcacc tcgatgaagc 360 tggaggatgc cttcccacag cgagcgcacg ccaagcagaa gatcaaaaag ctccgtgata 420 tcatactcgg cttaaagaac ccgtcgttaa ggcagctaac gaggctgcgt cagcaccttt 480 gggagttgta cgacgaatac ggcaacgctc acagctgcat cgtcggaaac gttccggata 540 acgatctcgc ccagcaagaa gaagaataca acgactttga tcatctcttc aacgaagccg 600 gagtgccgct ggaggaaaaa ctcattgctc tgagaagcgc agctccgaaa acaagtgcaa 660 cttaccgcgt tgaacaccag cagcacccct gggggaggtc agtgtcgaat ttctacgggt 720 gcagcagctg gtcaacctcg atgatgtcct gtgagtccgg ccaaccggag ctctctggac 780 aagccacaaa cttgaaccaa cgatccagac tcagaaatct ggaagcggac gtggttttgc 840 gaaaccacca cccgccgacg agcagcccaa gctcacctcc cgaaccccgg tccaggtccc 900 aaaacctgga accggataca gctttgcgaa gctgtatgcc gccaacaaac ggcccaagct 960 gccgatccaa gccaacatcc gcgagcgatg tcttggacac cgaagtggtc atgcgtgacc 1020 accttcaacc tcaaaccgaa gccaatccaa cggacctccc agagtccaca acctccagcg 1080 aagttaccaa aaccaacgcc aaatgcgtgc ccctccaacc ggtacaaaga aaccccaacg 1140 aatcccgaaa cggccgtgtc ctcccggacg acggcatcta cacgactagt gcgcaaactc 1200 ccaacagtgt cctgaaccac gaagccaacc cgatcgcaac cgacaacggc ctcgaatcac 1260 aagtcgaaac cgacgagaga aggaaactcg caattccgac ggtgacaacg ctcgtcaaca 1320 ctgaccgctg ctcagaggac agcggatctc gacccgaagc tgagaagatt ccaacagtgc 1380 tgccaagctt tcccaaaccc gacctgttcg actcccaacg ttcgctcatg gacggtgaca 1440 cacgaacgaa agaacgcgag ccgctgtcag cggttaatct ccggaatgaa aagacaaatt 1500 ccctcagtgt cgagcacggc agctcactga tgatcatcca gtgggacgtt gtaaccctct 1560 ggaccaacgc ggagctcaat ccgccgctgc caaccgctgc gaatgcgccc gataaacgac 1620 gcccaacgat agcctgtccc aacggtccca acccgccgcc ccacggaaac cggaagaagc 1680 acgcggtcaa caaaaccgca gcacccaatc cacgaaagac tgacggcctt tctccaaagt 1740 tcccggacca gttacacgca gtgctttgcc gaggcgcacc acacgaacaa cgacccacca 1800 gctcgacttc cggcgatctg cgagacacag gagctccgga acccaatcgc gtgaacgcga 1860 cacaccaaca ccacggagag aaacctctac agaaacgaaa gtccgagacc gagcaattga 1920 caaagaagtg cgcccggtat gaacgcaaag gtgactccaa gcacgccctc aaacggcttc 1980 agcatcgagc agcaccgaag aagagcatcc caacgagcga ggcgctcaat cgtcgaccgc 2040 acctggatga accacccgac ctgccgccga aacgaagcac tattgccgca gttcgctgga 2100 cctgtctgaa acagaaccga acaaacggca gtctctgctg ctctgagttg aaagccacca 2160 ctgtcaagtc taaaacgaag cccaaatccc cacgaagcag atcactatgg acaatttcaa 2220 gcaccgccgc tgccgtactt gtgagtatgc tcccattttt gcttgctttt cgccacaaca 2280 tacacgccaa gtactgcgag aacgacttcg tcggtcgtga aaccagtttc ccggagatcg 2340 gccgtgggcc gtgcgaacag accgtcgctc cgaagtccat cgaggtcaac cgactcggtg 2400 tgagcagcca cacagttcgc caactacgat tgcttgatgg ccggcagcgg agaacgattc 2460 cgagccagcc aaccgaaagt gttccgttgg aaaacgcaag taggggaaag aaaagcgtcc 2520 aaccgctaca cgatcacacc aagctattcc aacgaaacca gcctaactgg aacgaacaag 2580 cctgcatggt ttgtagcgag tcaccgcagt gcgacctgcc cgatcccggt tccttcaagt 2640 catcttccac aaaacacccc aacctgccac agcgcttaca gcagattgcc tctggccgcg 2700 aaaagaagcc ccctcagaag caactacatg cacgaagtcc caaaaccgtg caaccgacgg 2760 gaaaaagcaa ccaggctgat caacgaggtt ccaagtccaa gtgtatccac accggatcgc 2820 aacatctaaa agcaccgcag aaagtttttc aaacgagcga agcgctcgat tttctgctgc 2880 acctgtgcga agcgcccgct tcaccgcggg gccgacggca cattgccaca ctttgccgaa 2940 tccgccaaaa gggaattcca acgaaaaccc gtcgctgctg gtccaatctc taagacaacg 3000 tcaaaaacag aacgaagtcc tttttactac gacgcagaat ccctggaacg actgcaactg 3060 caactcaaga gaataggatc cagaccaacc agcggaacac gggaacccca gttaggatgc 3120 tgcgaagcgg caaccggcgc aacaagctca attccaacca cgcacgagaa ggactcgctc 3180 gcaaaccgac cgctcaagac atcggtgagc gaaagttgac gagacattct gagccagagc 3240 accggaagtg ctccgttggt aaacgcaagc tgatattcga ggtgtgtcca ccagtccgac 3300 aagagagaaa ccaagctgca ccgagtcaac gccaacctga tcaccaagat ttgttgaacc 3360 agaggttcaa cggaggggag ta 3382 // ID Kiri-31_AAe repbase; DNA; INV; 4508 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 16-FEB-2011 (Rel. 16.02, Last updated, Version 1) XX DE A Kiri non-LTR retrotransposon family from Aedes aegypti. XX KW Kiri; Non-LTR Retrotransposon; Transposable Element; Kiri-31_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4508 RA Kojima K.K. and Jurka J.; RT "Kiri non-LTR retrotransposons from the yellow fever mosquito."; RL Repbase Reports 11(2), 726-726 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 11 sequences with >97% CC identity. CC This family and related Kiri elements found in two mosquito CC species, Culex quinquefasciatus and Aedes aegypti, constitute a CC distinct lineage belonging to the CR1 group. The phylogenetic CC analysis with PhyML indicates that Kiri is a sister lineage of CC the L2 clade. Although several Kiri elements do not encode two CC proteins, probably due to 5' truncation, Kiri elements generally CC encode two proteins. Their ORF1 proteins commonly contain an CC L1-like RNA-binding domain. XX FH Key Location/Qualifiers FT CDS 292..1062 FT /product="Kiri-31_AAe_1p" FT /translation="MSDNERTLRSNNTTKGSSLSNDMSVGDLANLMKSQFI FT SYQRSTKEEIKIMGEKLSSEIEGLKREITTDMDKLREENNKIYAELTSTIS FT TLEMDTTHAMETSMRANDLIASGIPFVQGEDLMSYFKVWCCSLGYSEGHIP FT MVDVRRLIGRSGSSGNAPNILIQFAITVQRNDFYSRYLRSRRLVLSDIGFS FT VNKRVYVNENLGPAARNLRSKAIRLKKEGRLHAVFTRNGAVYIKRNEEDRG FT VAVKSSDELDHLLQSS" FT CDS 1528..4347 FT /product="Kiri-31_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MDNDTSVNTSSNWSIPGVVMKSALLPGKLSICCLNSQ FT SICARRMTKFIELKQMIIMTAVDIVCVCETWLNSRTDNNVLTIDGYDLIRN FT DRIGRLGGGVLMYVKHGLNHKILDLSENISSTEYILCEMKLHNKKMLFGVF FT YNPPNVECSETLNDLLCRYGTEYSGVYFLGDFNTNLLDVHSSRTARLQEVL FT IGHSTECLGVEPTFFHINGCSQLDLFLTNVPDNVLRFNQIDFPVLSNHDMI FT FASLDIEVYTGVPSSFQYRDYNRVDIGALRESLQLVDWDTFYSIDDSDILT FT EYFNAVICEMHEIFVPVRTVLRNLKSNPWFNNSIAKAIVDRDLSFKQWKVS FT SNAIDHALFRRLRNKVNYLIRKAKTEYYSSQINTNLPPKLLWKRLKNSGIN FT TTTKSCNNFDANDINNFFCETFGDNDRSFRPIHRNSELPAFNFRNIDEVQV FT INALFDIKSEAVGLDNLSLKFLKMCLPSILPQIVYMFNTIICSSRFPKCWK FT QAKVIPIKKKPLLNSIQNLRPISILCVLSKAFEALLKKQVCEHVEEYNLLT FT RFQSGYRSKHSTKTAMMKVLDDIGIVLDNGKPVVLVLLDFSKAFDTICHTK FT LCTKLQTNFGFSFEATQLVYSYLTERFQCVYNNGFFSTFIPTTSGVPQGSI FT LGPIFFALYINDLPNVLKFCKIHIYADDVQLYFDCSEFDANTTSRYINEDL FT ARILDWSVRNKLSINVEKTHAMFLTTRANSENAPMLCLNGSSLRYVEKALS FT LGFTIKNDFSWDDFILQQCGKIYGKLRTLQLTGYFFSAEIKLKLFKTFIFP FT FFVACDFLLPQASVNATNKLRVALNACVRYVYNLSRFDHVSHLQCKLIGCS FT FENFVKLRSCLIIHDIFTQKEPDYLHSKLVLNRSGRSNKFILPRHRTSKYG FT NSFFVRGVAYWNLLPNELTMQLSGMGFRRKCFEHFR" XX SQ Sequence 4508 BP; 1370 A; 780 C; 855 G; 1503 T; 0 other; gttcatacgg tagaatgaag tgcacgtaaa agtggttggt gctcacaatt tgcgaatatc 60 catggtaaaa caccagttaa aaaccgaaaa gctactgttg ccaaacaagt gatcatctat 120 caaacgattt ctatccgcca attaagtgat aagcttcaga ttagagttgt tttttgatac 180 tcttttctag aacaagttct ataatatatc tgcagtagcc aaagccataa ttttcgagca 240 gttattgttc tattccatcc cgtttgatcc attcggattg tactcgtcaa tatgagtgat 300 aacgagagaa cgctgcgtag taacaatacc accaaaggat ccagtctcag taacgatatg 360 agcgtcggcg atctggctaa tctgatgaaa tcgcagttca tctcgtatca acgttcaacg 420 aaagaggaaa tcaagataat gggagagaaa ttgtcctcgg aaattgaagg cctcaaacga 480 gagatcacta cggacatgga caaattgcga gaggagaaca ataagatcta cgctgagttg 540 acttcgacca tcagtacgct agaaatggac accactcatg ccatggaaac atcgatgaga 600 gcaaacgatt tgattgctag tggaataccg tttgtgcaag gagaggatct gatgtcttat 660 ttcaaggttt ggtgttgctc tcttggatac agcgaaggtc atataccaat ggtcgatgtt 720 cgtcgtttaa ttggaagatc tggtagtagt ggtaacgcac caaatattct gatccagttc 780 gctattacgg ttcagagaaa cgatttttac tcacgttatt tacgttcccg acgtctagtt 840 ctctctgata tcgggttctc cgtaaacaaa cgagtttatg tgaatgaaaa tctcggacct 900 gctgctagaa acttacgatc gaaagctata cgactaaaaa aggaaggccg actacatgct 960 gtattcacca gaaatggagc tgtgtacatc aaacggaatg aggaggatcg tggagttgct 1020 gttaagtcct ctgatgaatt ggaccatctc ttacaatctt cgtaacccta ttcctttgac 1080 aatctcatga atacttccca tgtttccaat ccaaaaggtt ttcctttgaa tcctacctga 1140 aagtctctta catattacct tctcctttga agtttttgtt tacttccttc cgatccgttc 1200 ctttaaaatt atcctttggc tcccctccta aaagctaaaa tcttctgttg ctgtatattc 1260 atcactttgc tgctgctgtt gttgctggga tgtcgcctat cggatgctgt tgatgctggg 1320 atgtggaatg ctgttgtttt aatgtccgcg ctgctttatc attcaaataa actctaataa 1380 aattttacgc atcttgtgac acttttgata ataatgaatt gtagtaaatg atttgtatta 1440 atgacttgaa cattttttgt attgtaaatg tagtatgggt gctcaaacta atttccgttt 1500 caatctcgtc tgccttattt attgataatg gataacgata cctcggtcaa cacctcttcg 1560 aattggagta ttccgggtgt ggttatgaaa tctgcgttgt taccaggaaa attatcaata 1620 tgttgtttaa acagccaaag tatatgtgct cgtcgtatga ctaaatttat tgagctgaaa 1680 caaatgatca ttatgactgc tgtagacatt gtttgcgtgt gtgaaacatg gttgaatagc 1740 cggactgata ataacgttct tacgattgat ggctatgatt tgataagaaa cgacagaata 1800 ggtagacttg gcggaggtgt tctaatgtac gttaagcatg gtcttaatca caaaattctt 1860 gatttgtcag aaaatatttc ttccacagaa tacattcttt gtgagatgaa attacataac 1920 aaaaaaatgc ttttcggagt cttctataac cccccaaatg tcgaatgctc tgaaacgttg 1980 aacgatttgc tttgccggta tggaactgaa tacagtggag tgtattttct aggagatttc 2040 aatactaatc tccttgacgt acactctagt aggactgcta gattacaaga agtgttgatc 2100 ggccatagta ctgaatgttt gggagtcgaa ccgacttttt tccatatcaa cggctgttcg 2160 cagctagatt tattcctaac aaatgttcct gataatgttt tacgattcaa ccaaatagac 2220 tttcctgtgc tctctaatca tgacatgata tttgcttctt tagacataga ggtatatacc 2280 ggggttccga gttctttcca gtatcgtgac tataaccgtg tcgatatagg tgctttgcga 2340 gagtcactgc agttggttga ttgggacact ttttacagta tcgacgattc tgatatactt 2400 actgaatatt tcaatgctgt gatctgcgag atgcatgaaa tttttgtgcc tgtacgtact 2460 gttttaagaa atctaaaatc gaatccatgg tttaataata gcatagcaaa agcgatagtt 2520 gatcgcgatt tatcttttaa acaatggaag gtctcatcaa acgcaattga tcatgctctg 2580 tttagaagat tacgaaacaa agttaattat ctaattagaa aagcaaaaac tgaatattat 2640 agttctcaaa ttaatacgaa cttaccccca aaacttctct ggaagagatt gaaaaattct 2700 ggaatcaata caactactaa atcctgtaac aattttgatg caaatgatat aaataacttt 2760 ttttgtgaaa ccttcggcga taatgatcgt tctttcaggc ctatccatag aaactcagaa 2820 ctacctgctt ttaatttccg taatatcgat gaagtccaag ttatcaatgc tctatttgat 2880 atcaaatctg aagctgtggg tcttgataat ttatcgctaa aatttttaaa aatgtgctta 2940 ccaagtattt taccacagat agtttatatg ttcaatacaa tcatatgtag cagccgtttt 3000 cccaagtgct ggaagcaagc aaaggttatt cctatcaaaa agaaaccctt gttaaattca 3060 attcaaaacc tgcgacccat aagtattttg tgcgtcttgt ctaaagcttt cgaagctttg 3120 cttaaaaagc aagtttgtga gcatgttgaa gaatataatc ttttgactcg ttttcagtca 3180 ggttatcggt ctaaacacag tacgaaaact gccatgatga aggttttaga tgacattgga 3240 atagtgttgg ataatggaaa accagtagtt ttagttttat tagatttctc gaaagcgttt 3300 gatacgatct gccatactaa actgtgtact aagctgcaaa ctaattttgg tttttctttt 3360 gaagctacgc aattggttta ttcctatctt acagaacgct ttcagtgtgt atacaataat 3420 ggattttttt ctacgtttat cccaactaca tctggagttc ctcagggatc tatacttggg 3480 cctattttct ttgcattata cattaatgat ttaccaaacg tgttgaaatt ttgtaaaatt 3540 catatatacg ctgatgatgt ccaactctat tttgattgtt cggaatttga cgctaacacc 3600 acttcgaggt acattaatga agatttggca aggatattgg attggtctgt tagaaacaaa 3660 ctttcaataa acgttgaaaa gacgcatgca atgtttttaa ctactcgtgc caacagtgaa 3720 aacgctccta tgttatgttt aaatggaagt tctttaaggt acgttgagaa agctttgagc 3780 cttggtttta cgattaagaa tgattttagt tgggatgact tcattcttca acaatgtgga 3840 aaaatttatg gtaaattgag gacactacag cttaccggat attttttcag tgctgaaata 3900 aagttaaaat tattcaaaac ctttattttt cccttttttg tcgcctgtga ttttcttttg 3960 ccgcaagcat cagtgaacgc tacaaataaa ctgcgtgtag ctcttaatgc ctgcgtgaga 4020 tatgtttata atcttagccg gtttgatcat gtttcacatt tgcagtgtaa attaataggt 4080 tgttcttttg aaaattttgt gaaacttaga tcttgtctaa ttattcatga tatttttaca 4140 caaaaagaac ctgattactt acatagtaaa ttagttttga atagaagtgg aagaagtaat 4200 aaattcatac taccacggca ccgtacgtca aagtatggta attccttttt cgttaggggt 4260 gttgcatatt ggaacttgtt gccaaacgaa ttaactatgc aattatcagg gatgggattt 4320 agacggaaat gttttgagca ctttagataa atagtagttt atttaaacac gtatattaga 4380 atttttcaga aatgcgtgct ttaccgaatg tgtcactttt gttctacaat gcatcaatgt 4440 aacatagaaa agacaaatcg tcttaagtta cttattaaaa taaataaatg aaatgaaatg 4500 aaatgaaa 4508 // ID Gypsy-2_TCa-LTR repbase; DNA; INV; 183 BP. XX AC ChLG3; XX DT 21-FEB-2011 (Rel. 16.02, Created) DT 21-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the red flour beetle genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-2_TCa_; KW Gypsy-2_TCa-I; Gypsy-2_TCa-LTR. XX OS Tribolium castaneum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Coleoptera; Polyphaga; Cucujiformia; OC Tenebrionidae; Tribolium. XX RN [1] RP 1-183 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the red flour beetle genome."; RL Direct Submission to RU (21-FEB-2011). XX DR Genome; ChLG3; Positions 29077315 29077133. XX SQ Sequence 183 BP; 49 A; 25 C; 41 G; 68 T; 0 other; tgttagaata tttagttagt ttaggttatg aggttggtag tcagaaagta actaggggag 60 aagctactgt gatggcgtcg gcctgaaatg tgtttttgtt gttaatcttc cgaaatcaag 120 ttcatcatta aacttttgtt ttaacagact gtgttttgtg ttccctgaca ctcaaatcta 180 aca 183 // ID Gypsy-187_AA-I repbase; DNA; INV; 5385 BP. XX AC supercont1.90; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-187_AA_; KW Gypsy-187_AA-LTR; Gypsy-187_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5385 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.90; Positions 1952981 1958365. XX CC Positions [2297-2797] - Reverse transcriptase CC Positions [3932-4399] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 1718..5275 FT /product="Gypsy-187_AA-I_2p" FT /translation="MDVLMVSASRRSSTPYLIDAIVGGKNLQMEIDSGSAV FT SVLCRQTYEKKFKNYPISTCRMKLAVVDGAHLMVVGQFDVSVKLNGQRRTV FT QVVVLESKKIFTPLFGRDWLDVFFSAWRDAFQPVSVNYTTVTREVTVEEDI FT KSKFPNVFDNDFSTPIKDFEADLVLKEDKPVFRRSYEVPYRVRDKVIEHID FT SLERDSVITPVDVSEWASPVIIVAKKDNGIRMAIDCKVAINNVILPNTYPL FT PLVRDLFASFSGSKVFCSLDLAGAYTQLRLSENSRKIMVINTIKGLYSYNR FT LPQGASSSAAIFQKVMDQVLYDSEGVSCYLDDVLIAGKDYEECKKRLYLVL FT ERLSKFNIKVHSKKCKFFVSSLPYLGHVLTDQGLLPCPEKIQTIRRAIAPK FT NVTELKAVLGMVNYYGRFIPNLSSHLSTLYNLLKKDSKFVWSKNCDDTFEL FT CKHLLLNSKLLEYYDPEKPIVVVSDACGYGIGGVIAHVVGKEERPISFTSF FT SLNDAQKSYPILHLEALAIVSTIKKFHKYLYGKKFTVYTDHKPLIGIFGKE FT GKNSIFVTRLQRHIMELSIYDFDIMYRPSAQMGNADFCSRFPLPQEIPKDL FT LRDHVKNINITTELPLDWMTIAKETKNDEFLQKGLFFLQYGWPKKLEKRFR FT DVFSQHQDMELLDGCLLFQDRVVIQVSLHAAVLKLLHRNHAGIHKIKQLSR FT RTVYWYGINSDIESFVKNCHTCNQMAVVPKKPSQITWIPTSRPFSRIHADF FT FYFDHKVFLVIVDSYSRWLEVEFMRSGTDAKRVKSKFMSLFARFGLPDVIV FT TDGGPPFNSRYLIDFWEKQGIKVMKSPPYHPSSNGQAERMVRVVKEVLKKF FT LLDPQMRELDIEDQISCFLFNYRNTCTEDSSFPSEKMFKFKPKTILDLVNP FT KSSYKKHLTTPKEEQSIGFNADECKKDKLAKLRTGDPIYFKNFKPTDVRRW FT LEAIFLKQISPNVFQISLGGRTYTAHREQLKLRRKSPNRIVCGTSEMDKER FT SREKMNGQDRNQDQDLDNDFYGYVFDSCLFNIPSGNSLPYSTISGSSLPSS FT SVSGNSLPSSSASDSGLSRSSAHGNSLPVTEHSGNCVPIRTPLTHTDGSTR FT QLNEDSVGRTECPNGPSVTTAVDLLDPINKPLQLPGSRQRKTRRNRNKDEA FT FGNRSDLNLRRSLRSKRKTNSDEYEYY" XX SQ Sequence 5385 BP; 1620 A; 927 C; 1183 G; 1655 T; 0 other; ttataactgg cgtacgaggt gaaatttttt ttttcaattc agttgaagtt taagctgtga 60 ttaagaacaa ttatagtttt gtgaggtgat actagttatt ggtagaattt aattctactg 120 atacagtttc taaagccgtt tcgctcatct caagtgtggt gatgccaagg cggcaaggag 180 tttatgttga aatctcatca cagcagtgtt taattcgcac gttttgcgaa aaagttttcc 240 aatagtttac tacagagttt cttcataaaa aaaagtaaaa cctgtgaaaa gtgtttaaat 300 cgcaaaggac gcgtgtgttt aagatcggct tctactgcat cgcccattgt tcagtttggc 360 gtaatataat agcgttgtgt gtttgaccgg agacaaagcc tggcatcgca gatcagtttt 420 ttgtagctga ccacaatagc tttggaaatt ttcatcgcgt caaccagcat ccattcccag 480 tataagcttg tagtgtgtgt tgaagttttt ggtaccgctt agactggtcc acatcatcgt 540 ttgaagacaa cacgttgact aaccaggtat gttttaaatt gttttatcgt ttttggttta 600 atttgttttg ttgtgattgc gattgtgcgt gtttgttttt ttttccgaag ttatttaaaa 660 acttgagcaa tttaagtttg ttttcttcat catttgtgtc tctctgggag tagtttaaaa 720 tttatgttag tacaacatat tatgtgtttg tagttgatat ttttttcata aaatatcaaa 780 aattatacaa atttagaagt tgattttttt tcgtttctct gtttgtttct aatttttatt 840 aaagtggcaa tgccaattat gagtacaatt gaaccgttcc atccagatgg aagcataact 900 ttctcgcaat atctggatca actggagtgg atttacttgc atcaaaaagt agcagatggg 960 gataagaaga ctacattcct tgcatcgtgt ggtaccgctg tctacagtga gctcaggttg 1020 ctgtaccctg gccgaaactt gaaagatatt gaatacaaag atttgacaga ttcgctacgt 1080 aaacgtttcg acaaaagtga aagtgattta gtccaacgta tcaaatttta tgcgagggtg 1140 cagaaacctg gcgaaaaagc cgtagatttt attctggcgg taaagcagct ggcagagtac 1200 tgtaagttcg atacgtttaa agagacggca attcgtgaca agttgctgtg cggtatttca 1260 agtaaacagc tacaggagcg gctcttagat gaagaagatt taactcttgc tagggccgag 1320 cggatcataa ctaaccgtga agaagcgatg gaaagggcta gtctgatgaa tgaaccagaa 1380 acaggtagag ttagtgccat tgaacgtttt ggaggtagta agagggttga ttttcgtcga 1440 gatggttcac cgccaccgtc taggagtagg gttcggaatc acgatcttcg taaccgtttg 1500 aacgacagcc ggagtagtag caggagccgt tctgaatcca gaaagcgggg ttaccagcca 1560 tactactgta cgtactgtcg cagaaagggt catactcgca agtattgtta cgatttgaag 1620 catcataagc catcggtaaa atctgtagtt gtcgagacga agaaaccaga cctctctgac 1680 cgtttcaaga gagcaactat cggattctga agaggaaatg gacgttctca tggtgtctgc 1740 tagtcggcga tcaagtacac cttacttgat tgatgctatc gtcggcggta aaaacttaca 1800 gatggaaatc gacagcggtt cagcagtgtc cgttttgtgc agacaaacgt atgagaagaa 1860 gttcaaaaac tacccaataa gtacctgtcg tatgaagctg gcagtcgtcg atggagctca 1920 tttgatggtc gttggtcagt tcgacgtgtc ggtgaagctt aatggtcagc gtagaacagt 1980 tcaagtagtg gtcctggaaa gcaaaaaaat cttcacacca cttttcgggc gtgactggct 2040 agacgtattc ttttcagcat ggagagacgc cttccaacca gtttctgtaa actacacaac 2100 cgtgacgagg gaggttaccg ttgaagagga tataaaaagt aagtttccaa atgtttttga 2160 taatgatttc tccactccaa taaaagattt tgaagcagat ttggttttga aggaagataa 2220 accagttttt aggcgatcat atgaagtccc atatcgtgta agagataagg ttattgaaca 2280 tattgattct ttggagaggg atagtgtaat tactcctgtt gacgtcagcg aatgggcttc 2340 tcctgtaatt atagtagcga agaaggataa tggtattagg atggcgattg attgcaaagt 2400 cgccataaat aatgttattt taccgaatac ctatccgctt ccattggttc gagatttgtt 2460 tgcatctttc tcgggttcta aggttttttg ttcactcgat cttgctggag cttacaccca 2520 gttacgtctt tccgaaaatt caagaaaaat tatggtaatt aataccataa aaggtttgta 2580 ctcatacaat cgcctgccgc agggagcatc tagcagtgca gcaatttttc aaaaagtaat 2640 ggatcaagtt ttgtatgatt cagagggggt ttcttgttac cttgatgacg tgttgattgc 2700 tggtaaggat tacgaagaat gtaagaagag gctttatttg gtcttagaac gtttgtctaa 2760 gttcaacatc aaagtacatt caaagaaatg caagtttttt gtttcaagtt tgccttattt 2820 aggacatgtg ctaacggacc agggtttgtt gccatgtcct gagaaaattc agacgattcg 2880 tagagctata gctccgaaaa acgtcacgga attgaaggca gttttgggta tggtgaacta 2940 ttatggtcgt ttcattccca atttatcctc acatctcagc acgttgtata atttgttgaa 3000 gaaggactct aaatttgttt ggagcaaaaa ttgtgatgat acctttgaat tatgcaaaca 3060 cttactatta aactcaaaac ttttggagta ctatgatcct gaaaaaccaa tagtagttgt 3120 ttctgatgcc tgtggatacg gcataggagg agtaattgct catgttgttg gtaaagagga 3180 gcgacccata agttttacat ccttttcgtt aaatgatgcg cagaagtcgt acccgattct 3240 gcatctggaa gcattggcga ttgtcagtac tataaaaaag tttcacaaat atctgtatgg 3300 taaaaagttt actgtttaca cagaccataa gcctttgatt ggcattttcg gaaaggaagg 3360 taaaaattca atttttgtaa cacggctaca acgacacata atggaattat caatttacga 3420 ttttgatatt atgtatcgtc cgtctgctca aatgggtaac gcagattttt gttctcgttt 3480 ccctttacct caagaaattc ctaaggattt gctgcgtgat cacgttaaga acatcaatat 3540 caccactgaa ttaccgttag actggatgac gattgcgaag gagacaaaaa atgatgaatt 3600 tctacaaaaa ggtttgttct ttttacaata tggttggcct aaaaagttgg aaaaacgttt 3660 cagagatgtc ttttcacaac atcaagatat ggaactgttg gatggttgtt tactttttca 3720 agaccgtgta gtaatacaag taagtttgca tgctgcagtg ttgaaactac ttcatagaaa 3780 tcatgcaggc attcacaaaa ttaaacaact atccagaaga acagtttatt ggtacggaat 3840 caacagtgat atagaaagtt ttgttaaaaa ttgtcatact tgtaatcaga tggctgttgt 3900 tcctaagaaa ccatctcaaa taacctggat tcctactagt cgtccattta gtaggataca 3960 tgctgatttc ttctactttg accacaaagt tttccttgtc atagttgata gttactcgag 4020 gtggcttgaa gttgagttta tgcgaagcgg aactgatgct aaaagagtta aatctaaatt 4080 tatgtctttg tttgctagat ttgggttgcc tgatgtcatt gtgactgatg gtggtcctcc 4140 gtttaattcc agatatctca tagacttctg ggaaaaacag ggcatcaaag taatgaaaag 4200 tccaccatat cacccctcta gtaacggcca agcagaaaga atggtacgtg tagtaaaaga 4260 ggtgttgaag aagtttttgc tcgatccaca aatgagggaa cttgacattg aagatcaaat 4320 atcttgtttc ttgtttaact atcgcaacac atgtacagag gactcaagtt ttccctctga 4380 aaaaatgttt aaatttaaac caaaaacgat tctcgatcta gtcaatccaa aaagtagtta 4440 taaaaagcat ctgacgactc caaaagagga gcaaagtatt ggttttaatg cggacgagtg 4500 taaaaaggat aagttggcca agttacgaac aggggatcca atttacttca aaaacttcaa 4560 gccgacagat gtgagaaggt ggctggaagc tattttcctt aaacaaatat ctcccaatgt 4620 tttccagatc tctctaggag gacgcacata cacggctcat cgcgaacagt tgaagttgag 4680 acgcaagtcg cctaaccgca tcgtttgtgg aactagcgaa atggataagg aacgttcaag 4740 ggagaagatg aatggccagg accggaatca ggaccaggac ttggataacg atttttatgg 4800 atatgttttc gattcatgtc tgttcaacat accatctggc aatagtttgc catactcgac 4860 aatttctggc agtagtttgc caagttcatc agtttctggc aatagtttgc caagttcatc 4920 agcctctgac agtggtttgt cacgatcatc agctcatggc aatagtttgc cagtaaccga 4980 gcattcaggc aattgtgtgc ctattcgaac tcctttaaca catactgacg gatcaacgcg 5040 gcagctcaac gaagattcgg tcggaagaac ggagtgccca aatggcccat cggttacaac 5100 tgcagttgat ctactcgatc cgataaataa gcctctacag ttaccagggt ctaggcaaag 5160 gaaaacaaga agaaaccgta acaaagatga agcatttggt aatcgcagtg atttaaattt 5220 aagaagatct cttcgttcaa agcgtaagac gaacagtgat gaatatgaat actattaatt 5280 tgaattttat tcggtctata ttattgtaaa tagtaattcg ttcagttaga tgaaatatga 5340 acatgtttta aatgaattgt aataacaatc tcgagaaggg agaac 5385 // ID Gypsy-20_AA-LTR repbase; DNA; INV; 174 BP. XX AC supercont1.196; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-20_AA_; KW Gypsy-20_AA-I; Gypsy-20_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-174 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.196; Positions 271781 271608. XX SQ Sequence 174 BP; 60 A; 27 C; 38 G; 49 T; 0 other; tgtggtgccg agtacggccc tggtgaataa taaatacagt atactgttgt tgacaaagga 60 gcgaagaacg gcaggtgccg tgaaagcaaa gtgaagaaat aataaacgtt ttctaaatca 120 ttaaataagg ttttacgtat ttatattgtc caacacaaat cctcggtttt taca 174 // ID BEL-56_AA-LTR repbase; DNA; INV; 539 BP. XX AC supercont1.150; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-56_AA_; KW BEL-56_AA-I; BEL-56_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-539 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.150; Positions 1233454 1233992. XX SQ Sequence 539 BP; 170 A; 86 C; 123 G; 160 T; 0 other; tgttggcaga agcgtagcgc acacagctga acgtgctgaa agttgccata tgcaggaacg 60 gtcacccggt agcgaggagg ctgccgaatg aaacgagccg tcgtgagaga atgtgagaac 120 tagatcgata tgagtcagca gtatagggag aagcagaagg ctacgtatgc tatagttctc 180 gcactgtttg ttggcatagt tgcaattaat atttagaaat aattattgag tatagaaata 240 cggaagttct acaacatatc cattgtgagt atataaaatt taatgaattt attgcgctga 300 attgtaaccg tgaacatcaa attataggtt gaactacaaa acgatatttg ctattgttct 360 tattgtgtcg actagaagta gggtgaattc gacaggtcac cggaattgta agatttttct 420 tatactcgct aggttgataa ctaattgttt gaatatttta gctttaaagc gccgtttaat 480 aaaacctttg aacagctaaa agttccgtga atctgtattc ctacttcgac gtgacaaca 539 // ID DNA3-5_AP repbase; DNA; INV; 134 BP. XX AC . XX DT 30-AUG-2009 (Rel. 14.09, Created) DT 30-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA3-5_AP. XX NM DNA3-5_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-134 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 1946-1946 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 3 bp TSD CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 134 BP; 36 A; 29 C; 32 G; 37 T; 0 other; gggccgattt caccaacaca gttaatcacg gttaaagtta accggagttt aatcggccga 60 atttggccgg ttaaatatct gttatcagcc gattaagcgt taaccgtgat taactgtgtc 120 ggtgaaatcg gccc 134 // ID BEL-46_AA-LTR repbase; DNA; INV; 485 BP. XX AC supercont1.246; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-46_AA_; KW BEL-46_AA-I; BEL-46_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-485 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.246; Positions 157677 157193. XX SQ Sequence 485 BP; 190 A; 98 C; 71 G; 126 T; 0 other; tgtgcaggat tgagcccact gtacgcacta ccactgctcc acgggataaa atctcaacct 60 tggataccag caccatgctc agcatagctg tcagtggtat tatacaaaac aaaaacagaa 120 atactaaacg tttgaacgtg aatttcatct ctattagcta ccttacttat taatctaaaa 180 tactttaaaa ccgtgttaat ttagcacatt tgaattgtaa gtagacaaag attaaaaact 240 attgaactaa aattaactaa actaattgta caacacagaa aattgaaatt gattgcggaa 300 aacaggaaga ggacaaacaa accaaattag ggactataaa cgtaagtgac tataaacaga 360 ctatgatata tgcaattaac taattctaat aaacataccc gtagcttaaa gcattcactc 420 aacaaacaca cgagttttgc taaaaaggcg tccttaattg tcttgctgtc ccaaccaccg 480 caaca 485 // ID Gypsy-151_AA-I repbase; DNA; INV; 4884 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-151_AA_; KW Gypsy-151_AA-LTR; Gypsy-151_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4884 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1027-1027 (2011). XX DR [2] (Consensus) XX CC 'KGCTC' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 1958..3730 FT /product="Gypsy-151_AA-I_1p" FT /translation="MVEQGVICPVDYPTDWVNNMQIVEKPNGSLRICLDPK FT PLNACIKREHFLIPKTEDILCRLNGKQVFTVLDLRNGFWQMELDRQSSDLT FT TFMTPFGRFRWKRVPFGISSAPEMFQKRMVQLFGDIPGVEVYFDDMSIAGE FT DYEEHDEILAKVLQRARDNNVRFNPAKIQYRTDKVKFMGHIASEDAIAPDS FT SYVQAIIDMPKPSNKSDVLRVLGLLKYLGKFIPNFSQRTAHLRNLTKNDVE FT WNWTDDHDRELKDVLASITEAPVLAIFDGDKQVVLQTDSSKDGLGSVIMQD FT GKPVAYASRTLSKSEQKWAQIEKELLAIVFACERFHHYLYGRRFVVQSDHK FT PLETLVKRDIDDVTPRLQRMFLHLLKYPGMDIVYTPGKNLLVADCLSRASL FT PDDVDFSEELSGMIHAVSQQACMSTDNYNYYVEKLGNDDRYRRIAYYVENG FT WPPFHQLDDLGQPFYKYRDLLHYENGLLFKEHRLVIPTAMQRCISNWLHAP FT HLGIEKTLARARSQFFWPGMTNDITELVKNCTVCEKFTRNTQKEPLLQDSP FT PEYPFQRVGIDLYEYGGRDYVVVIDAYSGYVFSERLDEKTMLHM" XX SQ Sequence 4884 BP; 1396 A; 953 C; 1171 G; 1354 T; 10 other; tggtgtcaga agtggtgagc tcggaatcgt wwataaatcg tcgcgaaaat cgtcgcgtgt 60 ttttktttaa ataagtgcaa ttttgtgctg ttcgcaagct ttgcgaaata aaaaaaaaaa 120 gtttcaagct cacaagaaca cgttcgagaa acttcgagtg aacgtttgtg tattggtgaa 180 gcaagattga agtgcacgaa aaagctccaa acgtgagaga tcgaatgttg actgcacgaa 240 agcttttgat ctttacgtca atcgaccaac tgaagtaaac aaacggtgaa gcgaaagagg 300 aagagtggaa cgagcttttt gttgctgttt tattgatacg gaagcgaaaa tggatcaacg 360 tttaccgcaa ccattggatt gtacgaactt ggccagagag tggcctaaat ggaagcaaca 420 gttctatata tatatgatcg caagtaacaa gaatgctgat gtggagcgta ataaaatagc 480 aacattcttg tggctcattg gtgagcgtgg ggtggaaata tacaacacac tctttccgaa 540 cgccggtgat gttgacagta tgtttggtgg ggcgaccgtt gctgctggag ctgttgctgc 600 tgggctgttg ctgctgggga tgctgctgct gctgttggtg ctgctggggt ttcggctggt 660 aacgtgaacg caactggaac cggaggacaa aatgtacagc gtacattagc agatgttatc 720 acggcatttg atggctattg tctaccgcgg aagaatgttg ccatggaggc atttaagttt 780 aatatgattg ttcaaaagga gaagcaatca tttgccgatt ttgaaacggc tctccgaact 840 caactggcgt actgcgaata tgaatgtgct acgtgtcatt cttcatatgc cgatcgaatg 900 cttcgtgatc gtattataat tggcgttcag gcgataagaa gcttcagctg aagctactgg 960 atggaaaaga tgaaccgttg cgaagtgtgg tggaaacatg caagatattt gaagcggcaa 1020 ctgagaataa acaacttctc gataacaagg cgcaccatga aatgcacgtc gtcgtagaag 1080 aaccagtcaa ggaagatggg tggcagtcgt caagaaagca tcgtgctaca actgtgggca 1140 gaccttcaac gagagacatc gcctcttttg tccagcgaga aaggtgaact gcgacgcttg 1200 tggccgacgt ggccatttca aacggttctg caaggcaaca agatcgggag acaaagggaa 1260 tcggaacgtt catcagctag aggaaaagca taagcccaac agggaggctg tgcacaatct 1320 caactggcgt gaatcaggta attgggaaac agataatgtt aatgcaagtc aaccaaaggt 1380 aacgtcactt ggtacttctg atttacgtta tagaatgaat tcgaacagtg gcaatgatag 1440 ggtatctcgc gtcaggtgga caaagcgata tctgattgaa gacaatccgg tggatttcaa 1500 gctggacaca ggagctgatg tgaattgtat tccaatgaaa tgcgtaaaac ggttgaaggt 1560 cccctacatt cgcaatgcct ccgtgtctaa cgttgtggat tacagttcga ataaattaaa 1620 attcatggta tatttctctt ccctgtttcg atcctgataa aggggtaccc ataatgcaga 1680 attcctggtt gttgatgact catatgagcc tatactaggt cttgagtcat gcgtagcttt 1740 cggtttggtc gaacgtctga atgccgtacg gcctgatcca gaattccctg aaagtaaaac 1800 tgatttcgtc cagcaaaatc gtgctgtttt tgaaggtcta ggtaagttcc caggcacttg 1860 ttctattgtt ctgaaagaaa attccgttcc tacgcttcac tacaagaaga gaattccttt 1920 gagtcttcat gatcgtttga aggctgagct aaagtctatg gttgaacagg gagttatttg 1980 tccagtagat tatccgactg actgggtaaa taatatgcag attgttgaaa agccaaatgg 2040 ttctctcagg atctgtcttg accctaagcc tttgaatgcc tgcataaagc gagagcactt 2100 cctcattccc aaaactgaag atatactatg tcgtttgaat ggtaagcagg tattcacagt 2160 gctggatcta cgtaatggat tttggcagat ggagctcgat cgtcaaagct cagacttgac 2220 cactttcatg accccattcg gtcgttttcg ttggaagcgg gttccttttg gtatcagcag 2280 tgcgcctgag atgtttcaaa agcggatggt tcagttgttt ggtgatattc ccggcgttga 2340 agtatacttt gatgacatgt caatagccgg agaggattac gaagaacatg acgagattct 2400 agcaaaagtc ctacagcgag ctcgggataa taatgtcagg ttcaatccgg cgaaaattca 2460 gtaccgaacc gataaggtta aattcatggg ccacattgca tccgaggatg cgattgcacc 2520 agattcatcg tacgttcagg caatcatcga tatgcctaag cccagcaata aatcagacgt 2580 tcttcgtgtt cttgggcttc tgaaatatct cggcaaattt attcctaact tctcccagcg 2640 gacggcgcat ctccggaatt tgacaaaaaa tgacgttgaa tggaactgga ccgatgatca 2700 tgatcgagag ctgaaagatg ttctagcatc tatcactgaa gcgcctgttt tggctatatt 2760 tgatggtgat aaacaagtcg tgttacaaac agatagctca aaagatggtt taggcagcgt 2820 gattatgcaa gacggaaaac ctgttgcgta tgcctctcga acactttcga aaagcgagca 2880 gaagtgggct caaatcgaga aagaattgtt agccatcgtc tttgcgtgtg agaggttcca 2940 ccattacttg tatggtcgtc ggtttgttgt acaatcagac cataaaccat tagaaacctt 3000 ggttaagagg gatatcgatg atgttactcc tcgtcttcag cgtatgtttt tacatctttt 3060 gaaatatcca ggcatggaca ttgtgtacac cccgggtaag aacttgcttg ttgctgattg 3120 tctttctagg gcatcccttc cagatgatgt tgattttagt gaggagttga gtggcatgat 3180 tcacgcagtc tctcagcaag catgtatgtc cactgataat tataattact acgtcgaaaa 3240 gcttggaaat gatgatcggt atcgtcgcat tgcttattac gtcgaaaacg ggtggccacc 3300 ttttcatcag ctagatgact tgggccagcc cttctacaaa tatcgagact tgctacacta 3360 tgaaaacgga ttactgttta aagaacatcg gttggtcatt ccgactgcaa tgcagaggtg 3420 tatttctaat tggttacatg caccgcattt gggtattgaa aaaacgctgg ctcgtgcwmg 3480 atcgcagttt ttctggccag gtatgacaaa cgacatmacc gaactwgtca aaaactgtac 3540 ggtttgtgaa aagttcacgc gtaatactca aaaggagcca ctacttcagg attctccgcc 3600 ggagtatcct ttccaacgtg ttggaattga tttgtatgaa tatgggggac gtgattatgt 3660 agttgtaatc gatgcgtatt ccggatacgt cttctctgaa cggttggatg aaaaaacaat 3720 gcttcacatg tgattgtaga ctactcgatc gtatgtttcg mttgttatgg ctatccaaca 3780 caagttcgat gtgacaatgt accttttaac tcctcggcat ttgacgcata tgccaataag 3840 tgcaacattg agttcgagtt ctcaagtccc cgatatccgc agagcaatgg tttggcagaa 3900 aaaggtgtgg cgattgctaa aaacattttg aaacgatgtt atgaaacggg tgaagtcgaa 3960 cagtttcggt atcgattgct cgaatacaat actactcctg tagcgagtat gggtctttct 4020 cccgcccaac tgttctttgg acgtcaactg aagacccgcc taccgatatc gagcgaatcg 4080 cttgttcgaa acaaatgttc ttgaagaatg acgatgttaa aagaaagcta attcaaaaga 4140 agcgcgctga tcagaagtac tactatgatc gatctgcgaa acaattaccg tccttgaatc 4200 gaggtgataa agtgatcttt aagaagaatg gtaaagagtg gcattacgga gaaatagtac 4260 ggagtgtgaa cggccggtcg tatgttatta gagataattt tgggaatcat ttccgtcgga 4320 accgacggtt tatcacacga acgataaacg agaattctaa tccaagtgaa ctgttgtcga 4380 agaagatcaa caaagtctga tgaaccgttt tgaaaaccac cacaaaccta atttgccatt 4440 gcaagtaatt cctgattctc ctgtgtccca gccaaatgtt acgagctttc ctccccatgc 4500 tgtagaacaa gaaccagttt tgtcagatta tgagtcttct ggttcatcgt cctattttga 4560 tgctgaacaa ggaagttttk aaagtgattt tgaatcggat gtgtacaaca cctcgggcca 4620 atcaaatcaa tattcaaata ttttcgcctg tccaccgacg caagaaaatc ctcagaatct 4680 atataaaaca cgaagtggtc gaattgttcg gcctccacga agatatgatg agtagstagc 4740 aacccgatcc taattatcct ttcctattat tgttgtatcc tactaacccc gagtcgccgc 4800 gggtgtacgg ctgccctgta acattaggtt taacgaaagg attttttttt gtaacagatg 4860 tagcttaaaa agaagaagaa agcg 4884 // ID Gypsy-7_SI-LTR repbase; DNA; INV; 202 BP. XX AC AEAQ01015416; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: long terminal DE repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-7_SI_; KW Gypsy-7_SI-I; Gypsy-7_SI-LTR. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-202 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01015416; Positions 4342 4141. XX SQ Sequence 202 BP; 62 A; 45 C; 38 G; 57 T; 0 other; tgtggtatat cgagtcacgc ggtatgaact tacttgcata gacgttcacg cacgcgcagc 60 agaactagcg aagccgcata cgacgcttag agtcacgtat attcataaag attcttctta 120 tatattatca gcattaataa agactaatat gtgcttgtaa gactgctagt gtatatgcct 180 taccttccac tcgaacacaa ca 202 // ID CR1-17_CQ repbase; DNA; INV; 1715 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-17_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-1715 RA Kojima K.K. and Jurka J.; RT "CR1 non-LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 21-21 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 5 sequences with >99% CC identity. XX FH Key Location/Qualifiers FT CDS 2..1648 FT /product="CR1-17_CQ_1p" FT /note="reverse transcriptase." FT /translation="SSSAKEAAQLCADFFKTTFVQHEDPVSDHHLRFVRPQ FT DLTILQPRLTRQDVLRALSQVNTKKGPGPDRLSPLFLKSCAGSLAHPVSIL FT FNLSLQTGIFPSVWKTAAMVPIHKSGNLHNVENYRGVSILSSLAKLLELLV FT LNAMKEATNPILSDYQHGFRKARSTLSNLMVYVPYVSKALGNRTQVDSIYI FT DFAKAFDRLPHKIIISKLAKLGLPDWLLTWLSSYLENRKAYVKHGRSCSKL FT FSVTSGVPQGSHLGPILFNLFINDVCEVIKSEKVMFADDLKFYREVYNTKD FT CAELQDDLSAMIEWCSANGMQVNASKCKVLSFYRGNSPILYRYLSDGKELE FT RVESIKDLGIVVDKKLTFKEHLSATVAKARTTLGFISRNTQEFKDFQAIKC FT LYLTLVRSILEYGVQIWAPRLIGDANLYERVQRTATRMIIRKLPSNHPVRQ FT AHYEERLQFLRMEKLSLRVSFLRRLFLFDVVNGNSDCQALIALLPPQAPER FT NRRGDVLFRIPNDGWRSYFNPFYECCKCFNSVCVKFTNGMSKLSFKSRIKG FT ID" XX SQ Sequence 1715 BP; 468 A; 396 C; 392 G; 459 T; 0 other; atcttcgtct gctaaggaag ccgctcagct gtgtgcagac ttcttcaaaa ccacctttgt 60 tcagcacgag gatcctgtct ctgatcatca tctcagattc gtcagaccac aggacctgac 120 tatactgcag cctcgtctta cacgacagga cgtgttgaga gcactttcgc aggtcaacac 180 caagaaagga cccggtccag acaggttgtc gcccttgttt ctaaagtctt gtgccggatc 240 gcttgcccat cctgtaagca tcttgttcaa cctttcgctt caaaccggca ttttcccttc 300 tgtgtggaaa actgcagcca tggtccccat tcataaaagc ggaaatttac ataacgtgga 360 aaactatcgt ggtgtttcta tactgagtag cctggcgaaa ctgttagaac tgttggtgct 420 gaatgcgatg aaggaggcaa caaatcccat cttgtctgac taccagcacg gctttaggaa 480 ggccagatct actctctcca acctgatggt ctacgttcca tatgttagca aagccctagg 540 aaaccggacc caagttgatt caatctatat cgactttgca aaagcgtttg atcggctgcc 600 gcataagatc atcatcagta aactggccaa gcttggtctg cctgactggt tattgacgtg 660 gttgtcttct tacctggaga accggaaggc atacgtcaaa catggtcgct cctgctccaa 720 gcttttctcg gttacatcag gagtccccca gggtagccat ctaggtccga ttctatttaa 780 cctgtttata aacgacgtgt gcgaagtcat caagtcggag aaagttatgt ttgccgatga 840 tttgaagttt tatcgagaag tctacaacac caaagactgc gcagaactcc aagatgacct 900 aagtgcaatg atcgagtggt gcagcgcaaa cggaatgcag gtcaacgcta gcaaatgcaa 960 ggtcctgtca ttttatcgtg gaaactctcc catcctctat cgttaccttt ccgacgggaa 1020 ggagctggaa cgtgtcgagt cgattaagga tctcggcatt gtagtcgaca agaagctgac 1080 tttcaaggaa catctctcgg cgacagtagc gaaggcaagg actacattgg gatttatttc 1140 tcggaacacc caggagttca aggattttca agcgatcaag tgtttgtact tgaccctagt 1200 ccgcagcata ctagagtatg gcgtacaaat ttgggccccc aggctgattg gagatgcaaa 1260 cttgtacgaa agagtgcaaa gaaccgcaac gaggatgatt atccgcaaac ttccctccaa 1320 ccacccagtg agacaggctc actacgagga gagactccaa tttctgagga tggaaaagct 1380 ctctttgcgg gtttcattcc tcaggcgtct tttcttattc gacgtagtga acggaaactc 1440 ggactgccag gcactaatag ctttgcttcc gccgcaagct ccagagagaa atcggagagg 1500 tgatgttctc tttagaatcc cgaacgatgg ctggagaagt tattttaacc ctttttatga 1560 atgttgtaaa tgttttaatt ctgtctgtgt caaatttacc aacggaatgt ccaaacttag 1620 ctttaaaagt agaattaagg gtatagatta aattcagtct gtggatattt tttgtatcaa 1680 agacggtgtg actaaaataa ataaaataaa ataaa 1715 // ID BEL-131_AA-LTR repbase; DNA; INV; 626 BP. XX AC supercont1.255; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-131_AA_; KW BEL-131_AA-I; BEL-131_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-626 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.255; Positions 801625 802250. XX SQ Sequence 626 BP; 237 A; 87 C; 115 G; 187 T; 0 other; tgtcaacacc actgggatga cgagaatgcg ccagtctaca aatcggatta gagataagat 60 aatgggaaaa caaaaacaaa acctagattg aatgtgaatt gtgctatatg aaactattgg 120 aaattgttga atttgagacg aaaattattg aattaaaaac taaaatttgg tggtaatcta 180 cgggcattta ttaagtaagt tgttgttcat gaatttgaag gctaaagtta taaaatacat 240 actcaaatga tggtttatgc ttaggtgaac tacgacgtgg ttacgactac tatttttgat 300 aaaaactatc tactaaagtt tggcaactgc aagataaaac cgtttgtaag ttagcaaaaa 360 attcaaagtg agtactctag aactaaaatt ataaatgaat atactaaaac atcaatttac 420 taggctaatg ttaaaccgtt ttctgcgcaa aattggatcc gttcagcgaa attgttccac 480 gaggctagca gtgaaactaa atttgtaagt cacagataaa aagttatgta aattcgtatg 540 acaaatgtaa ttattgattt cagtttgagc tgctctaaat agactgctac agaaaataag 600 tgccgtttct gcatacaatc cgaaca 626 // ID Polinton-1_TV repbase; DNA; INV; 20724 BP. XX AC . XX DT 14-MAR-2006 (Rel. 11.02, Created) DT 01-APR-2011 (Rel. 16.03, Last updated, Version 2) XX DE A family of autonomous Polinton DNA transposons - a consensus DE sequence. XX KW Polinton; DNA transposon; Transposable Element; KW Interspersed repeat; Polinton-1_TV; Maverick; Tlr. XX OS Trichomonas vaginalis OC Eukaryota; Parabasalia; Trichomonadida; Trichomonadidae; OC Trichomonas. XX RN [1] RP 1-20724 RA Kapitonov V.V. and Jurka J.; RT "Self-synthesizing DNA transposons in eukaryotes."; RL Proc. Natl. Acad. Sci. USA 103(12), 4540-4545 (2006). XX DR [1] (Consensus) XX CC This transposon is characterized by 135-bp terminal inverted CC repeats and 6-bp target site duplications. The consensus sequence CC was built based on multiple alignment of several copies that are CC >90% identical to each other. It encodes a family B DNA CC polymerase (POLB-1_TV), retroviral integrase (INT-1_TV), ATPases CC (ATP-1_TV and ATP1-1_TV), and 6 unclassified proteins (PTV1-1_TV CC - PTV6-1_TV). PTV2-1_TV matches glycoproteins/structural proteins CC from phages, including short tail fibers, responsible for CC adsorption to host cells and interaction with receptors. A CC protein similar to PTV6-1_TV is also encoded by Polintons from CC Entamoeba. XX FH Key Location/Qualifiers FT CDS 765..1712 FT /product="PTV1-1_TVp" FT /translation="MNKDDVINFLYPTSKKEKPKEPDVFPEYYEERIKDDE FT YDEIERLEKIRDEIDEKIKELREVEKYNENEVEGLMNIPASESEVQKSHEE FT RYKNYVLESTQNLLDGDIDETLSEEGTKFIHKHKLQFIDDGDYPCILLSPP FT LDSEKFKKALKNVTYATNTIKKEDHDIYTNYYLNKCTGVDKVFDLLDKISK FT ETIGTYKILCDCGFIIEDTNNVSYERTAPKEDEVERSIPFVIKNIKDMKTY FT KHYIYSFISEKMEATHKTSSHHFIAIHTFLFKVVKLNKTGIRFNVSGYAFL FT SKLKCIRHVADDKNLCVLNSFYGL" FT CDS 17735..16911 FT /product="ATP-1_TVp" FT /translation="MDLNSSTPYEELAGVYNTYVDDMINEALSDNRNALSK FT EKLMKIPYSECYLNSITAAIGKQNKGKTLTILKQIIIIANTSPHSHVLIYI FT NRSGSPSDDTFESLKHLIKIPIIYLSQEEAEDYLKNFLMYKELYNTIKKQG FT LEDKIIDKQRDELFETLSISNFDREFLHSLIFLDDAVKSKLITNEKSYFNQ FT LLTQCRHIQCSFFMAVQYFKALSTNIKSNLSTLFIFSGFSRQQLNVMLYQV FT NLPMSINELYTQYQQLGEHGKIIVDLNKGSVKFD" FT CDS 16867..16127 FT /product="PTV5-1_TVp" FT /translation="MTEIANLLQVPLKQGTNKKGAYNRTMKVPDGMKELVH FT IVAGLQKDVKQMRNVLTLPEAQAYAKRHGPNWEAHEADITGPNGKPDGINE FT VFVTDGQGNIKVINGYKLTISDYPKRKAYNERYPMQFDENGKAVKQYIGEG FT ENGPINSYRHNPYSYFNEELNAIEEGPDGLPRYSNQFDGQYANIRPEISAK FT KFFKVALFDPVFANELKDELKQQFQPIIVARIASGALSQAFKDLIKKPLYQ FT QRGYNL" FT CDS 14713..14006 FT /product="PTV3-1_TVp" FT /translation="MSIINIISNNNCELNLTVSGIYNVTLVSVSGKFNPIL FT LDPSDYISFIKDDHEKTYDMYINNVIAKWDINMILSYLNESFIVWYEENFK FT SKPDKLFELDEINNLIKFPLKIKFCTPHLRSLLGILNFEKDVGDVLYSDTG FT SNFILIKCDKVNTPMRFSHQTLDPNSYPSIQNILSINLNTFSLSYPFQLSG FT NSFILHSKDLSNIKFRITDLFDCDLEFMNEILWQFQIEKIADEITD" FT CDS 13978..12350 FT /product="PTV2-1_TVp" FT /translation="MIKQQQPQQLKAQSTQVLQTITYAIYAYNSKTAEQTQ FT RLKYGDLEIVLKNDHFEFVCKGGAVISNIKEILFMNSKIKGITDDITSIPP FT EEQRYYALNAFPKVLKIGRFNLNEEFIKGFLNDIMKKADKEYVASELMKKA FT NLVYVDEKDNEIKERVEWYHEWTSKIFKTKADTGWVSGCLDLKADKTYVND FT ELEKKADKEYVNEIYQSLIGTTKNIETIVGDFSVNEENKYFRWQFVQSLKN FT GTRCKLIPKNNNEMYVSYKKNDGTQVKVPLDNNCYLNLYGDEQKKYLRVYE FT MADMTLPDPAPAPYYYDYGDDDRTPHVDEQKLTLYLDYYRYKRYNAIEKTR FT IVYYYTDDRNKYYSQDFTYPENIRLYYKKYSDTNQKKPEIVALYFKFKRDN FT PIISHMFDLMNPVGSIYTSMDARGPQVMFGVGAWEQIVDRFLYCANSSKET FT GGSKTISGENLPAHSHYIDLSTSQAGWHKHRYWDWSGMTKGKGYDVKDDVK FT FAINCYWSDTQGEGNHTHFVSGYTQTTGQSKEYMPPYMTVYAWYRVQ" FT CDS 15908..15213 FT /product="PTV4-1_TVp" FT /translation="MRRRFTSQKNTNAKAEITVGYNVNDDKSRIIQNVKVN FT VSPELTRSATQPNLLTRDLPKNEEEQNKEDGDPIILSEPGKEPVEIPTYPT FT YPPPLSSNIENPGIYDSSRNAPYKIDINSELMKIYRDIIINNYDSIINLID FT QSGLIILPYESLIHIIAILCDVQDSDVKIQFFIDDEVSCCGTVAKINPIKD FT ISTIKISKNGTTTELHLTYNDIYNKIVDEYKISLEKFFVKAI" FT CDS 5677..6900 FT /product="INT-1_TVp" FT /translation="MNYINYIILSRKNQRNKKKVQNAQVLQTINEAQALIY FT DSLVKPYSARLLNEDQLLEFILQHKLEAGKYIKPLYTAYLKQMNRIHPELM FT KGGMKSKIKADKIKPLATPKPPTTVPPPPSVNPSSFTLLYPFQTLKKQKDF FT KKDFKILNKPIDPKIQPLKEKFSRPSFAPYPYSYEIDHLEYSKGNVTYLFA FT ININTRYLYCIPVKGKTEQETRRAIQYLLDHERVDNIRGDGDKGFQAAMTH FT YFPQINFYFSSSPYTFHNKIVDAVMRTLRDALGVNGQIYWDGNHDSIIQQL FT VYYYNNTWHRTINMKPVEMKNDISKEWEYIRKQMERLNDVKREQINSGLMN FT FKQGDKVLIHLDYGKTDKSMTKRRRRFDTIAEFVRYINGNALVEVDNKLIE FT IPIYFITPLYLNNI" FT CDS 1791..5255 FT /product="POLB-1_TVp" FT /translation="MNKNGTAKMNDNQIRAEGRRLFKRFYNIPDGVNPKTR FT RSYLNEAIDAYEGFDISSESERLARMLNVNIDFYYCDPQPEDVDINKVDFP FT LVESIMIDPEFETVNILLTQSPCGKLHADRITDVEALTGYKVCPFCKEEVY FT SIRDDPERKNQRRFLKHCEKCKENNGRLIQDVQLQNTQQPYAPHITKQKIY FT QWLLAHNLQEYYKPTRYYITFDFETLETKEELQLSECATLNAYLKPFMVSS FT TVKIKPQSGEVVDRDKTFTRNFCLTSSDSFISDWIEFLFSTCSLVAKSNIE FT QYEELMNTLNEKQKEAFKELLDEEFNKVNIIGFNSGKFDLNLILRDLNTAK FT WKIKSMLGSSSQFKQIIVKKTNVPYELRFIDIRNYIAGGTLDQFTQDFGNN FT KSRVKSFFPYQFITWENYEEELYKEEPFTQDSFYSALTQETISDSDYQIYL FT NDAKNFKNRLEYFIHYCNKDVEIMIDPIDKIIEETFVYKIDMLHNLSLSSN FT ASMIRYALAYKDFNPNEKYPEEEIESSFELTKKYWKYKIESYKKQDEKAER FT DTKNNVSMDDFQYYHDLILSSKCKMCGKAFTYANKPTLDRIDNEKPHTKEN FT CQLMCCYCNVVKSNKDEDVQRLRINLRNYALKNNLPMTLDSVEAYHILRNG FT ITGGLSNVQHRVNLKGITHINKLSYNPETKTVSNEDTTNIMTHFVGVDFNS FT LYPSSFSSNPHDFIQYTDHKMYMPGRLNGVIICDTATKKAKALGIIKKKNT FT LFVAEVKGHIDEKYINDYINFLPIIRNLDIITDEKTIGSFMYNYMKSNNLP FT VDQKQRKLTQLASTHNQYQPFSSYYLWYLIDRFHFIIDDIQSILTFTKNTC FT FNEFANEFMNERQKAELEGNKGKSLFCKISLNGSYGYDAMNTQNYAKTKIM FT NAQKARVACMSNKFKNIREIGEDTYQVMLKDRFYRCDTCLQEAFFTLDNAK FT YWYLVFIYDFMYKCMNVNRFHFIEGDTDSSYWAIAGDPNLPNTQAFQAIVT FT DKQFYDKNIFKFAPFDFFCFDEKFKPKLKNKAEEKAHEKKLLGLAIEKQGD FT NMVALCPKCYTSFNGSIDGSDFKKIAQKMKGVSLRQNKQLTPKNYLDIIND FT KVIFDGQNINLQLKNGSMTRLTIGKTALTGAHTKAVCCENGCCMPFI" FT CDS join(11779..11411,11495..11046) FT /product="Polinton-1_TV_11p" FT /note="KilA-N domain." FT /translation="MMQSKNKVSFKIIFIYFFILYVKXVMSVEAKTFTNKY FT NGETFTKGTYNDIEVLRRDKDGYINATNMCQRFRKDFRKLLENKSWEEYYK FT AFCEEYSNAGIQTIAFYTKFMQEFLMKSNRLEERIQRRNSDNCFLYKIYAG FT IPDEIKQVRGTYIDPRLINYIAMWASPKYCIAVGKIMDSIDKKVHEKLDEE FT ELEDTVENAKHLFEEEVGKMCGKQLEHEREKCYGYRDSPYELDQWEQEDLK FT REFREYELAKIALKTAEKKLKVWGRFVQKYCE" FT CDS 10465..8156 FT /product="ATP1-1_TVp" FT /translation="MTATNNNSNYIVSDEKVIDSLAEELVVAETKTLNERI FT GENMIDLHNYLKHDGGRGRKTGMALLVNKISNLTIIDVDINKSYNDELKET FT VRKDILSKLSDKDVIVKTASGGLHIYCNTNFFYAVSNRMIKCYSCNDYDID FT IMTSMDDSKRSLVVMADSRVRKNATEPINTYSFIRGSYDSTLTRTVNDILN FT DLNIKVRVEQKNEEIKTIMNENKDVNIDDKLAQSIVDGICDFEVHNDGGNM FT NLEKEVTLFTLFQAINSLPNHLINEAYDNVYNFCNLTENAKSNFEKARSRY FT AHLMTSPFVLVKILKLYQKEYYDEYVLPLLRKPKITFDIKLDDSFLITDIR FT RKAENHQYKNSSEVIEDLSRVIRFVDCGNKYFIQKDYNIHDKMNQISFVLQ FT SNMKESLKMIKLFKEEGKTITAWDVLLNQLSSLTVKGVCFKSDSPDVFSTF FT QGYKYNILEKPDYSKIEMFMNFIKEAICDNNDEVYKYLLGWIASMIQHPGI FT KNETAIILKGLQGTGKNRFTDIISELLAGYSCKNITEISELTGNFNSVVEN FT KMFLVLNELKNVGEDRLANFNALKSIITDNEIRINEKNQPRRTAQNVANFI FT FVTNNVYPVKIEVGDRRYVVLRVNGKFKGQFDYFKNLMDSCTKEFYDNLLT FT YFINYDLSSFNVRIIPMTEAKQDLIELSTNPLDVWINTHYDELCAGMTCKN FT ALFCKPSDMKDKNFQMSIKDKCDRKQRRIDGKQTWCYVLKEEMKGLYKQVE FT PNDNEDSFTDDEIPVNDAL" FT CDS 19400..17763 FT /product="PTV6-1_TVp" FT /translation="MTTNYLVPNTKPINGNVFHSFEKINEIFAKNDGRRYY FT RELESSCYDSDTPYFDDKQTHLRITNPAHDVNQLGDTYIKRKVKATLVSNK FT AFTCDAAFKCMRLFVGYKSSNQSFKQLEVETERGDAGYLQMDCARESFAFA FT TYKPKSERKVKKFVHSLYNNVQKYDNSVCGKYIDPSEVFKKANEEVDVTFD FT MIIPVNDLLAFQAFDDYPKSFGDIILKYYVFRDTLVFCQVDPLRVADLQNY FT VYEKITDENYEKIMNHVYKYDRKFQQIGLPAKLITSLAATSADATVDDVII FT SCTKMEVLEDKCITPGYGLIEESVKAIPRLFSKEDPFMIPAQHIDVRSLNG FT GCITPEGISSDFSYALHNVTDVVLTFPKNAMQRTVLENPMLRGLQVTIDSR FT NFPDQAMTTVGDIFFTRMLQASDLDGVNECTEEYEDSLTRPLNADDGTPLA FT RTWKDQTSFLCTIPVQRDGAGIFFDGLETFNSQVTVQVKAYPIIQGENDTY FT CSKVTNPPQVWFTRTTYWTWTPDEGLKYHPTGTPPQYRSSYDEILMNRNV" XX SQ Sequence 20724 BP; 7294 A; 3115 C; 3048 G; 7254 T; 13 other; agtagtagga taggatgggt tgtgtgactt tttttgtaaa cgtaagaatt catacaatat 60 cgagcgattt tgtatcattt ttttaacata tgaagtaaaa aatgtatgat tcttacggtt 120 tttattctta cgtttttgaa acaaataaaa caagattaaa tagattatcg aattttagag 180 gatggtgttc ctatttatat tcaaagattt aaagggaagc aatattatag aattgttaat 240 cctgaatata taaaacattt atttaaaaag gaagactgtg aattattaaa cgttaatgaa 300 tatgaaaata mtaaatcaaa actgagatat aaatatattg gaagtgatga aagatgcaaa 360 gataaagata aagtttatac agtttcatat aatgaatgga aaaccaataa gataagaaaa 420 catttaggag attatggtaa aagttctgaa ttctttaaat ctagtatgta cgagagagta 480 aaagaaatta gaaataatta ttttcatgaa taaaagattc actaccataa tgaaataaaa 540 aataaaaaaa ataaacttta aaaattcatg tcttacattt ttgaataatt tatttttatt 600 ttgtatgaaa tgaggtcaag actaacgacg ttaaaaaatt aggattcaca aacggggaaa 660 taaaggatat attagaggag gcaggttatg aggttgataa atattattta agaaattgtc 720 aaaaattcgc ggagttttta aatgagtttg attatgatgg aaaaatgaat aaagatgatg 780 ttataaattt cttatatccc acatctaaaa aagaaaaacc taaagaacct gatgttttcc 840 ctgaatatta tgaagaacga attaaagatg atgaatatga tgaaattgaa agactggaaa 900 aaataagaga tgaaatagat gaaaaaatta aggaattaag agaagttgaa aaatataatg 960 aaaatgaagt agagggttta atgaatatcc cagcttcaga atcagaagtt cagaaatctc 1020 atgaagaaag atataaaaat tatgtcttag aatcaacaca aaatttatta gatggtgata 1080 ttgacgaaac actttcagaa gaaggcacaa aatttattca taaacataag ttacaattta 1140 ttgacgatgg agattatcca tgtatattac tttcacctcc tcttgattct gaaaaattca 1200 aaaaagcact gaaaaatgta acatatgcaa caaatacgat taagaaagaa gaccatgata 1260 tttacacaaa ttattattta aataaatgta caggtgttga taaagttttt gatttgttag 1320 ataaaatatc aaaagaaaca attggtacat ataaaatttt atgtgattgt ggtttcatta 1380 ttgaagatac caataatgtt tcatatgaaa gaacagcacc gaaggaagac gaagttgaaa 1440 ggtctatacc attcgttatt aagaacataa aagatatgaa aacttataaa cattatattt 1500 attcatttat ttcagaaaaa atggaagcaa cccataaaac atcatctcat cattttatag 1560 ctattcatac attccttttc aaagttgtga aattaaataa gacaggaatt agatttaatg 1620 tatcagggta cgcattcttg agtaaattaa aatgtattcg tcacgtcgca gatgataaaa 1680 atttatgtgt tcttaattca ttttatggac tttagccctg gactctctga gtctcggact 1740 ttaaaaaaaa taaactttat ttttattttt taatttttat tcttacataa atgaacaaaa 1800 atggaacagc taaaatgaat gataatcaaa taagagccga aggtagaagg ttatttaaac 1860 gcttctataa cattcctgat ggtgttaatc ctaaaactcg tagaagttat ttaaatgaag 1920 ccatagatgc atatgaagga tttgacattt catcagaaag tgagaggtta gcaagaatgt 1980 taaacgttaa tattgatttt tattattgtg accctcaacc agaagacgta gacattaaca 2040 aggtagactt tccattagtg gaatcaataa tgatagatcc agaatttgaa acagtaaata 2100 ttttattaac acagagtcca tgtggaaaac ttcacgctga cagaataaca gacgttgaag 2160 cactcacagg atataaagtt tgcccctttt gcaaagaaga agtatattct atccgtgatg 2220 atccagaaag gaaaaaccaa agaagatttt taaaacattg tgagaaatgt aaagaaaata 2280 atggaagatt aattcaagac gttcaattgc aaaacacaca acaaccatat gcaccacata 2340 tcacgaaaca gaagatatat cagtggctat tagcacataa tttgcaggaa tattataaac 2400 caacaagata ttatattact tttgacttcg aaacattaga gactaaagaa gaacttcaac 2460 tttctgaatg tgcaactttg aacgcttact taaaaccatt tatggtatca tcaactgtca 2520 aaataaagcc gcaaagcggc gaggtcgtag accgtgataa aacatttacg aggaattttt 2580 gcttaacatc aagtgattcg ttcatctctg attggattga atttttattt tcaacctgtt 2640 cattagttgc taaatcaaat attgaacaat atgaagaatt aatgaataca ttaaatgaaa 2700 aacagaagga agcatttaaa gaacttctag atgaggaatt caataaagtg aatatcatag 2760 gttttaattc gggaaagttt gatttaaatt taattcttcg tgatttaaac actgcaaaat 2820 ggaaaataaa atcaatgctg ggttcttctt ctcaatttaa gcaaatcatt gtaaagaaaa 2880 caaacgttcc atatgaattg agatttattg acatcagaaa ttatatagcg ggtgggacat 2940 tagaccaatt cactcaagat ttcggaaaca ataaatcacg tgttaaatcc tttttccctt 3000 atcaattcat cacatgggaa aattacgaag aagaattata taaagaggaa ccattcactc 3060 aagattcatt ttattcagca ttaactcaag aaactatatc agatagtgat tatcaaatat 3120 atctcaatga tgctaaaaat tttaagaata gattagaata ttttattcat tattgcaata 3180 aagatgttga aattatgatt gacccaattg ataaaattat tgaagaaaca tttgtttata 3240 aaattgacat gcttcataat ctttctttat catcgaatgc ttcaatgatt cgttacgctt 3300 tagcatataa agactttaat cctaatgaaa aatatcctga ggaagagata gaatcaagtt 3360 ttgaattaac gaaaaagtat tggaaatata aaattgaatc atataagaaa caagatgaaa 3420 aagcagaaag ggatactaaa aataatgttt caatggatga ttttcaatac tatcacgatt 3480 taatcctttc atctaaatgc aaaatgtgtg gtaaagcgtt tacatatgca aataaaccaa 3540 cattggatag aatcgataat gaaaaaccac ataccaaaga aaattgtcag ttaatgtgtt 3600 gttattgcaa tgtcgtaaaa tcaaataaag atgaagatgt tcaacgttta agaatcaatt 3660 taagaaatta tgcattaaaa aataatttac caatgacttt agattcagtt gaagcttatc 3720 acattctccg taatggaatt actggtggtt tatcaaatgt tcaacataga gtaaatctta 3780 aaggaatcac gcacattaat aaactttcat ataatccaga aactaaaact gtatcaaatg 3840 aggacaccac caacattatg actcatttcg ttggtgttga ttttaattca ttatatcctt 3900 catcattttc atctaatcct catgatttca ttcaatatac tgaccataaa atgtatatgc 3960 cgggaagatt gaatggtgtt atcatttgtg atacagcaac aaagaaagca aaagcactgg 4020 gaataattaa gaagaaaaat actttattcg tggctgaagt aaagggtcac attgatgaaa 4080 aatacattaa tgattacata aactttttac cgataataag aaacttagat attatcacag 4140 atgaaaaaac aattggcagt tttatgtata attacatgaa atcaaacaat ctaccagttg 4200 accagaaaca gaggaaatta actcagttag catcaacaca caaccaatat caaccatttt 4260 cttcttatta tctttggtat ctaatagaca gatttcattt tatcattgat gatattcaat 4320 caattttaac atttacgaaa aacacttgtt ttaatgaatt tgcgaacgaa tttatgaacg 4380 aaagacaaaa ggctgaatta gaaggaaata aaggaaaaag tttattttgc aagatttcac 4440 ttaatggatc atatggttac gatgcaatga atacacaaaa ttacgcaaag actaaaataa 4500 tgaatgcaca gaaggcacgc gttgcatgta tgtcaaataa gtttaaaaac ataagagaaa 4560 taggtgagga tacatatcaa gtgatgctta aagatagatt ttatagatgt gatacatgtt 4620 tacaagaggc attcttcact ttagataacg caaaatactg gtatttggtt ttcatttatg 4680 attttatgta taaatgcatg aatgttaatc gttttcattt cattgaaggt gatactgatt 4740 catcttattg ggcaatcgct ggcgacccaa atcttcctaa cactcaagca tttcaagcaa 4800 ttgttaccga taaacaattt tatgataaaa acattttcaa attcgcacca tttgatttct 4860 tctgttttga tgagaaattt aaacctaaat taaagaataa agctgaagaa aaagcacatg 4920 agaagaagtt attgggttta gcaattgaga aacaaggtga taacatggtt gctttatgtc 4980 ctaagtgtta tacatcattc aacggttcaa ttgatggaag tgattttaaa aagattgcac 5040 aaaaaatgaa aggtgttagt ttacgacaaa ataaacaatt aacacctaaa aattatttgg 5100 atataataaa tgataaagtg atatttgatg gtcagaatat taatttacaa cttaaaaacg 5160 ggtcaatgac tcgcttaaca atcggtaaga ctgcattaac aggagcacac actaaggctg 5220 tatgttgtga aaatggatgt tgtatgccat ttatttaata caatttaaaa ttttattttt 5280 aataaaatgc cacagaagag acaaaaggaa ccgttaataa tggttttgat actatgattc 5340 aaagaggtgg gcacccttaa aacatttatt ttatttttat ataaatggat aacgcagatt 5400 taaaaatttt aattgatgat ttagtattta atcaaaattt tcttaattat tatgaattaa 5460 aatcatatct ccaaaaatat aataacaata taacgggtga tgatatggaa tattattatc 5520 catattataa aaatgaaatt ttgaaatatc atgataaaat aatttctaaa attaaagaca 5580 aaattaaaaa taatgaatat acaaagggaa aaactaaatc acaacttaaa cagttgaaag 5640 atttcaaaaa taaagtatta acagaagaag atattgatga attatataaa ttatataatc 5700 ctgagccgca aaaaccaaag gaacaaaaaa aaggtgcaaa acgcccaggt ccttcagacc 5760 ataaatgaag cacaagcttt aatttatgat tcattagtta aaccatattc agcccgactt 5820 ttaaatgaag atcaactttt ggaattcatc cttcaacata aactcgaagc gggaaagtat 5880 atcaaacctt tatatacagc atatttaaaa caaatgaata gaatacatcc agaattaatg 5940 aagggaggaa tgaaatccaa aatcaaagca gataaaatta aaccacttgc aacaccaaaa 6000 cctccaacga cagtaccgcc tcctccttca gttaatcctt catcattcac tttattatat 6060 ccatttcaaa cattaaagaa gcaaaaagat tttaaaaagg atttcaagat attaaataaa 6120 ccaattgacc cgaaaattca accgttaaag gaaaaattta gtcgcccttc atttgcacca 6180 tatccatatt catacgaaat cgatcatctg gaatattcaa aaggaaacgt tacttattta 6240 tttgccatta acattaacac tagatattta tattgcattc cagtgaaagg gaaaacagaa 6300 caggaaacaa gaagagctat tcaatactta ctagatcatg aaagagtaga taatatcagg 6360 ggtgatggtg ataaaggatt tcaggcagct atgactcatt atttcccaca aattaatttt 6420 tacttttcat cctctccata tacgtttcat aataaaatag ttgatgcggt aatgagaact 6480 cttagagatg cgttaggggt taacggtcag atatactggg atggaaatca tgactccatc 6540 atacaacagt tagtatatta ttacaataat acatggcata gaactataaa catgaaacca 6600 gtggaaatga aaaatgatat ttcaaaagaa tgggaatata ttagaaagca aatggaaagg 6660 ttgaatgatg ttaaacgtga acaaattaat tcggggctca tgaattttaa acaaggggat 6720 aaagttttga ttcatctcga ttatggaaaa actgataaat caatgacaaa aagaagacga 6780 agatttgaca caatagctga atttgttaga tatattaacg gcaatgcatt agttgaagtt 6840 gacaataaat taatagaaat tccgatttat tttattactc cattatattt aaataatatt 6900 taattttttt atttcgtaag aaatgagtga caacaaagca gccaagagtc cagacgatga 6960 atataaaaag aaactaaaca gaattaaatc aaaaatatgc tattacaaaa agaagccgca 7020 atgtggtggg gttgaaaacg ataaagagag gaaagaaata atcgaaaaat tagaaactta 7080 ccgttcaatc cttaaactga gtgaagcaaa aataaaagaa tttaatagaa taaataaatt 7140 gattggaaga gatgaattta ataaagatga atttttaaac tcaatccaaa tttaattttt 7200 attcctcttc gacaacatct ctctttccat cattactgca taaccacatt ttctcagttt 7260 tagtggtttg ttatttttaa catgttcttg catcattttt atcatttcct cccttgttat 7320 gattcgtggt aaattttgag cttgtggggt atccatttat gataaaaata aaaatttgtt 7380 atcagttatt ttatcgttaa tgatttcagg ttattcaata attaattatt tttgaagaac 7440 tgaaggtgtc actatgtcac catgtcacca ggttawttaa cttctctgaa aaaatgagac 7500 aacatgacat tttgacaaca caacaggagg aggaaggcag ttatgaatta tttttgaaga 7560 actgaaggtg tcactatgtc accatgtcac caggttaatt aacttctctg aaaaaatgag 7620 acaacatgac attttgacaa cacaggagga tggcagttat gaattatttt tgaagaactg 7680 aaggtgtcac catgtcacca tgtcaccagg ttawttaact tctctgaaaa aatgagacaa 7740 catgacattt tgacaacaac acaggaggaw ggcagttatg aattattttt gaagaactga 7800 aggtgtcacc atgtcaccat gtcaccaggt yatttaactt ctctgaaaaa atgagacaac 7860 atgacatttt gacaacacag gaggaaggca gttatgaatt atttttgaag aactgaaggt 7920 gtcaccatgt caccatgtca ccaggttatt taacttctct gaaaaaatga gacaacatga 7980 cattttgaca acaacacagg aggaaggcag ttatgaatta tttttgaaga actgaaggtg 8040 tcaccatgtc accatgtcac caggttattt aacttctctg aaaaaatgag acaacatgac 8100 attttgacaa caacacagga ggattaatgc aaaaataaat tccaatcata ctttataaag 8160 catcatttac aggtatttca tcgtcagtaa atgaatcctc attatcgttt ggttcaacct 8220 gtttatataa tcctttcatt tcttctttca aaacataaca ccatgtttgt ttaccatcta 8280 ttcttctctg tttcctatca catttatcct taatgctcat ttggaaattc ttgtctttca 8340 tgtctgatgg tttgcagaat agagcatttt tacatgtcat acctgcacac aattcatcat 8400 aatgtgtatt tatccataca tctaaaggat ttgttgataa ttcaattaaa tcttgtttgg 8460 cttcagtcat cggtatgatt cgtacattaa atgatgaaag gtcgtaattg ataaaatatg 8520 ttaaaaggtt atcataaaat tcctttgtgc atgaatccat caaattcttg aagtaatcaa 8580 attgaccctt gaatttacca ttaactctta aaactacgta tcttctgtct ccaacttcaa 8640 ttttgacagg gtagacgtta ttagttacga aaatgaagtt tgcaacattc tgagcagttc 8700 ttctgggttg attcttttca ttaattctaa tctcattatc cgtaataatt gatttcaaag 8760 cgttgaagtt tgcgagtctg tcttcaccta cattcttgag ttcattaagt acaagaaaca 8820 ttttattctc aactactgaa ttgaaattac ccgttagctc acttatttca gtaatgtttt 8880 tacatgaata accagcgaga agttcagaaa taatgtctgt aaatctgttt ttacctgttc 8940 cctgaagtcc tttcaaaata attgctgttt cattcttgat tccaggatgt tgaatcattg 9000 atgcaatcca gccgagaagg tatttataca cttcatcgtt attatcacaa atggcttctt 9060 taatgaaatt cataaacatc tcaatttttg agtaatcagg cttttcgaga atgttgtatt 9120 tataaccctg aaacgttgag aaaacatctg gagaatcaga tttaaagcaa acacccttaa 9180 cggttaatga ggataattga tttagtagta catcccatgc tgttattgtt ttaccttctt 9240 ctttaaataa tttaatcatt ttgagtgact ctttcatgtt tgattgaaga acgaatgata 9300 tttgattcat tttgtcgtgg atattatagt ccttttgaat gaagtatttg tttccacaat 9360 ctacaaaacg gattacacgt gataaatcct caattacttc agaactgttt ttatactgat 9420 gattttcagc cttgcgtcta atatctgtta tcaaaaacga gtcatcaagt ttgatatcaa 9480 atgttatttt tggcttacgt aataaaggta aaacatattc atcataatat tccttttgat 9540 atagttttag aatcttcacc aaaacaaatg gagaagtcat taaatgtgca tatctactac 9600 gtgctttttc aaaattactt tttgcatttt ctgttaaatt gcagaagtta taaacgttat 9660 cgtatgcttc attaattaaa tgattaggta acgaattaat agcttgaaac agtgtgaata 9720 atgtaacttc tttttctaaa ttcatgtttc caccgtcatt atgtacttca aaatcacaga 9780 tgccatcaac tatgctctgg gcaagtttat cgtcaatgtt tacatctttg ttttcattca 9840 tgatagtctt tatttcttcg ttcttctgtt caactcttac cttaatattc aaatcattta 9900 atatatcatt cactgttctg gttaatgttg aatcataact tccacgaatg aatgagtatg 9960 tattgattgg ttctgttgca ttcttgcgaa ctcttgaatc agccataact actaaagaac 10020 gctttgaatc atccatagaa gtcattatat caatatcata atcattgcag gaataacatt 10080 taatcattct gttcgaaact gcatagaaga aattagtgtt gcaataaatg tgaagacctc 10140 ctgaagcggt tttaacgata acatctttat ccgaaagttt tgataaaatg tcttttctaa 10200 ctgtttcttt aagttcatca ttgtatgatt tattgatatc aacatctata atcgttaaat 10260 ttgaaatttt atttactaat aaagccatac cagttttcct tcctcttcct ccatcatgtt 10320 tgaggtagtt atgtaaatca atcatgtttt caccaattct ttcgttgagg gttttagttt 10380 ctgccacaac taattcctca gctaatgaat caataacttt ttcatcagaa acgatatagt 10440 tgctgttgtt gtttgttgcc gtcatttata taggaataaa aaaataaaaa taattttaaa 10500 actaacttta tttttggact gcatcatatt tccagtgatg ttgtcaaaat gtcatgttgt 10560 ctcatttttt cagagaagtt aaattacctg gtgacctggt gacctggtga ccccttcagt 10620 tcttcaaaaa taattaatta ttgaataacc taaaaccatt aacgaaacca ttaacgataa 10680 aaatatattt ccagtgatgt tgtcaaaatg tcatgttgtc tcattttttc agagaagtta 10740 aattacctgg tgacctggtg acctggtgac ctcttcagtt cttcaaaaat aattaattat 10800 tgaataacct taaaccatta acgaaaccat taacgataaa aatatatttc cagtgatgtt 10860 gtcaaaatgt catgttgtct cattttttca gagaagttaa attacctggt gacctggtga 10920 catggtgaca ccttcagttc ttcaaaaata attaattatt gaataacctt aaaccattaa 10980 cgataccatt aacgataaaa atatatttcc agtgttgttg tcaaaatgtc atgttgtctc 11040 atttactcac aatatttttg tacaaaacga ccccaaactt ttaacttctt ttctgcagtt 11100 tttaatgcta tcttagcgag ttcatattct ctaaattctc tctttaaatc ttcttgttcc 11160 cattggtcta gttcgtatgg agaatctcta taaccataac atttctcacg ttcatgttct 11220 aattgttttc cgcacatttt tccaacttct tcttcaaaca agtgtttagc attctctact 11280 gtatcttcga gttcttcttc atctaatttc tcatgaactt tcttgtctat ggagtccata 11340 atttttccaa ctgctatgca atattttgga gaagcccaca ttgctatata gtttataagt 11400 cttgggtcta tatacgttcc tctaacctgt ttgatttcat caggaattcc tgcataaatt 11460 ttgtataaaa agcaattgtc tgaattccgg cgttgctata ttcttcacaa aatgccttat 11520 aatactcttc ccaggactta ttttccaaca actttctaaa gtctttcctg aaccgctgac 11580 acatatttgt tgcattaatg taaccatcct tgtctctacg taagacttca atatcgttat 11640 aagtgccttt tgtgaatgtt tcgccattat atttgttagt gaatgttttt gcttcgactg 11700 acattactrt ctttacatac aaaataaaaa aataaataaa aataatttta aaactaactt 11760 tatttttgga ctgcatcata tttacagtga tgttgtcaaa atgtcatgtt gtcaywattt 11820 ttmaragaag ttaaataacc tggtagcaca tggtaacatg gtaacagtga aatcaaattt 11880 tgttatcagt tattttatca tcattttatt aatttcatct ttagttaata atttgaatgg 11940 attcgctaat gttccacctt gagtcacaac acttgccctg gctaattgtg aagaacctgg 12000 gataagtact ccatgaaatg atttcattgg tatcattaat ttagaaatag ttggcttaat 12060 taattttgtc atttatgata aaaataaaat tacataattt aaattttttc tttttttatt 12120 ttctcatcct gttttggttt cagcaacttt tcaagttgtt gtcttgtcag taactttaat 12180 tttaactttg gatcaaatat tgggtttcca tctttatctc ttccaataac accatatggt 12240 agaatcttca tttatactaa aataaaaatt taaattatgc cgttcgctct tttggagctc 12300 aggactttca gtccgttcta aagtttgttt cacaaacatt gagcgttcat tgaactctat 12360 accatgcgta aactgtcatg taaggtggca tgtattcttt actttgtccc gttgtttgtg 12420 tatatcctga aacgaaatgt gtatggtttc cttctccctg agtgtcactc cagtaacaat 12480 ttatagcaaa cttaacgtcg tctttaacat catatccttt accttttgtc attcctgacc 12540 aatcccaata tctatgctta tgccaacctg cttgagatgt acttaaatct atgtagtgac 12600 tatgtgcagg taaattttca cctgaaatcg ttttacttcc acctgtttcc tttgaagagt 12660 ttgcacaata caaaaacctg tcgactattt gttcccatgc accaacacca aacattactt 12720 gaggacctct tgcatccata gatgtataaa tagaacctac tgggttcatt aaatcaaaca 12780 tatgagatat tataggattg tctcttttga acttaaaata tagtgcaact atttcaggtt 12840 tcttctggtt tgtgtcagaa tattttttat aatataatct tatgttttca gggtaggtaa 12900 aatcttgact ataatattta tttctgtcat ctgtataata atatactatt ctcgtttttt 12960 ctattgcatt atatctttta tatctataat aatctaaata taatgttagc ttttgttcat 13020 ctacatgtgg tgttctgtcg tcatctccat agtcataata atatggtgct ggagctgggt 13080 caggcaatgt catatctgcc atttcataaa ctcttaaata tttcttttgt tcgtctccat 13140 ataagtttaa gtaacagttg ttgtctaatg gaactttaac ttgtgtgcca tcattctttt 13200 tatagcttac atacatttca ttgttatttt tcggaatgag tttacaccgt gtaccatttt 13260 ttaaggactg tacaaactgc catctaaaat atttattttc ttcattgaca gaaaagtcac 13320 caacaatagt ttcaatattt tttgttgttc ctatcaaact ttggtatatt tcgttaacat 13380 attccttatc tgctttcttc tctaactcat crttaacata agttttatct gctttaaggt 13440 ctaaacatcc tgaaacccat cctgtatcag ctttagtttt aaaaatcttc gatgtccatt 13500 catgatacca ttcaaccctt tctttaattt cattatcttt ttcatcaaca taaaccaaat 13560 ttgccttctt cattaactca ctagccacat attccttatc tgctttcttc attatgtcat 13620 ttaggaaacc ttttatgaat tcctcattta aattaaatct tccaatcttc aaaactttag 13680 gaaatgcatt taatgcgtaa tatctttgtt cttctggtgg aattgatgtt atatcatccg 13740 ttattccttt aatttttgaa ttcataaata aaatttcttt tatatttgat atcactgcac 13800 cacctttgca aacgaattcg aaatggtcat ttttcaatac tatctccaaa tctccatatt 13860 tcaacctttg cgtttgttct gccgtctttg aattatatgc atatatggcg tatgtaatgg 13920 tctgaaggac ctgggtgctt tgcgccttaa gttgttgtgg ttgctgttgt ttaatcattt 13980 attatgattt aaaaataaat atttaatccg ttatctcatc agctattttc tctatttgaa 14040 attgccataa tatttcattc ataaattcta aatcacaatc aaataggtca gttattctaa 14100 atttaatatt tgataaatct tttgaatgaa gaataaatga attcccggat aattgaaacg 14160 gatatgataa agaaaatgtg tttaagttga tggataaaat gttttgtatt gatgggtaag 14220 agttagggtc taatgtttga tgtgaaaatc tcattggggt attaacttta tcacatttga 14280 ttaaaatgaa attggaacca gtgtcagaat acaaaacatc cccaacatct ttttcaaaat 14340 ttaatatccc caacaatgaa cgaagatgag gagtacaaaa tttaattttc agcggaaatt 14400 taataaggtt atttatttca tcaagttcaa aaagtttatc aggttttgat ttaaaatttt 14460 cttcatacca tacgataaat gattcattta aataacttaa aatcatgtta atatcccatt 14520 tcgctatcac attatttata tacatatcat acgttttttc atggtcatct ttaataaatg 14580 atatataatc tgaagggtct aaaagaatgg gattaaattt accactcaca gaaacgagag 14640 taacattata aattccgcta actgttaaat ttaattcaca attattattc gaaataatat 14700 tgataattga cattttttat tacgattaaa aatatggtaa agcgttattt tccagtgata 14760 aataaaacta gaataagatt acctcaagaa ttcataaatt caacgaataa gaaattcatt 14820 catgttataa attttagtgt tttcagaaaa tatgagtcga tgtcatattc atcaacaggt 14880 tatatttcac ttcattcaaa tttagttcaa tcaaacccat ataatgacca tttcataaga 14940 ttttcagatg gtattaaaaa tgtaattgat agattaaaat atgaacagat tgacaatatc 15000 gaatttttgg aattttggtt taaatatcct gatggaaaaa agattgaagt taaagataaa 15060 gctgatacta taacaaaaac aattacaatg accgatccag atagtgggga acaaaaagaa 15120 ataaaacaaa cgatagaaac tggtgaaagg tcaattatta ttgattatga ttcaaaggat 15180 tattattcga taatgttaat gcttgaatat taaatagcct taacaaagaa tttttcaagt 15240 gatattttat attcatcaac tattttatta taaatatcat tatatgttaa atgaagttca 15300 gttgttgttc cgttctttga tatcttaatt gtgctaatat cctttatagg atttattttt 15360 gcaactgttc cgcagcatga aacctcatca tctatgaaaa attgaatctt aacgtctgaa 15420 tcttgaacat cacaaagaat ggcgataatg tgaattaatg attcataagg taaaataatt 15480 aaaccggatt ggtcaattaa atttattatt gaatcataat tattaattat tatatcacgg 15540 taaattttca ttaattcaga attaatgtct attttatatg gcgcattgcg actggaatcg 15600 tagattcctg ggttctcgat gtttgatgaa agaggtgggg ggtatgttgg ataagtggga 15660 atttcaacag gttctttacc aggttcagat aaaatgattg ggtctccatc ttctttattt 15720 tgctcctcct cattcttcgg taagtcacga gttaataaat ttggttgagt tgctgacctt 15780 gttaattcag ggctaacatt aactttaaca ttttgaataa ttcgtgattt atcatcattt 15840 acgttataac caaccgtaat ctctgctttt gcgttagtgt tcttttgact agtgaaacgt 15900 cttctcattt atattcgaat aaaaaaaaaa aaatatattt aaacttttta ttttgctgga 15960 acttttgaat ctattatccc ctgtatttgt aactgcacca atcatgaagt tgatttttag 16020 ctgtttcatc gaatcattta caatagaccg aatatattca ttaattgcat tcttataatc 16080 atctgatttc tcaaattttc ttttatctga ttccttaact aaatcataag ttataacctc 16140 gttgctgata taatggtttc ttaattaaat ctttgaatgc ttgagataaa gcacctgatg 16200 caattcttgc tactataatt ggttggaatt gytgtttaag ttcatcttta agttcatttg 16260 cgaaaactgg atcgaataaa gcaactttaa agaatttctt agctgaaatt tcaggtctaa 16320 tatttgcata ttgaccatca aactggttag aatatcgtgg taatccatcc ggtccctctt 16380 caatagcatt taattcttca ttgaaataag aatatggatt atgacggtat gaattaattg 16440 gaccattttc accttctcca atgtattgtt taacagcttt tccattttca tcaaattgca 16500 taggataacg ttcattatat gctttacgtt taggataatc agatatagtt aatttataac 16560 catttatgac tttaatatta ccttgaccat cagtaacgaa aacttcattt ataccatcag 16620 gttttccatt tggacccgta atgtctgctt cgtgtgcttc ccaatttgga ccatgacgtt 16680 tggcatatgc ttgagcttct ggtaatgtta aaacgtttct catttgttta acatcttttt 16740 gaagaccagc aacaatatga actaattctt tcattccatc aggaactttc atagttctat 16800 tatatgcacc ctttttgttt gtcccttgtt ttaatggaac ctgtagaaga tttgctattt 16860 ctgtcatgtt taattataaa taaaaaataa aaattaatta taaaaagtta atcaaattta 16920 acacttcctt tatttaaatc aactattatt tttccatgtt cacctaattg ctgatattgt 16980 gtgtaaagtt cattaatgct cattggtaaa ttcacttgat aaagcataac gttcaactgt 17040 tgacgactaa acccactgaa aatgaaaagt gttgatagat ttgatttaat gtttgtgctt 17100 aatgctttga aatattgaac agccataaag aatgaacact gaatgtgtcg acattgagtt 17160 aataactgat taaaatatga tttctcattt gttattagtt ttgatttcac tgcatcatct 17220 agaaaaatga gtgaatgcaa aaattctcta tcaaaatttg aaatagaaag tgtttcaaat 17280 aattcatccc gttgtttatc aataatttta tcctctaatc cttgcttctt gattgtatta 17340 tataattctt tatacattaa aaagtttttt aaataatctt ctgcttcttc ttgtgataaa 17400 taaataattg gaattttgat taaatgcttt aaagattcaa atgtatcatc tgaaggtgaa 17460 cctgaacggt tgatataaat taacacatgt gaatgggggg aagtgttagc aataatgata 17520 atttgtttta atatcgttaa tgttttacct ttattttgtt taccaattgc tgcagtgatt 17580 gaatttaagt aacattcaga atatggaatt ttcattaatt tttccttaga tagagcattt 17640 ctattatcac tcaatgcttc attaatcata tcatctacgt atgtattata tactcctgct 17700 aattcttcat aaggggtgga tgaatttaaa tccatttaat caaaaataaa tttataattt 17760 taaacatttc tattcatcaa tatttcatca tatgaacttc tatactgtgg tggtgtacct 17820 gttggatgat atttcaaacc ttcatccggt gtccatgtcc aatatgtagt cctagtaaac 17880 caaacctgag gaggatttgt cacctttgaa cagtaagtgt cattttcacc ctgaatgatt 17940 ggataagctt tcacttgtac tgtaacttgt gagttgaaag tttctaatcc atcaaagaat 18000 atacccgctc catcacgttg aactggaata gtgcataaga atgaagtttg gtctttccat 18060 gttctagcta atggcgtgcc atcatctgca ttcagtggtc tagttaatga gtcctcgtat 18120 tcttccgtgc attcattcac tccatctaaa tcagaagctt gaagcatacg tgtaaagaat 18180 atgtcgccca ctgtggtcat cgcttgatcg gggaagtttc tggaatctat agtgacctga 18240 agacctcgaa gcattggatt ttctaaaacc gttctttgca ttgcattctt cggaaatgtt 18300 aacacaacat ctgtaacatt atgaagagcg tatgagaaat ctgagctaat accttcagga 18360 gtgatgcaac ctccatttag tgaacgaacg tcaatatgtt gagctggaat catgaatgga 18420 tcttccttac tgaataaacg tggtattgct ttaactgatt cttcaattaa accgtatcct 18480 ggtgtaatgc atttatcctc taaaacttcc atctttgtgc atgaaataat aacgtcatca 18540 actgttgcgt cagcagatgt agcggctaag cttgtaatta atttggcagg aagtccgatt 18600 tgttggaatt tgcgatcata tttataaaca tgattcataa tcttttcata attttcatca 18660 gttatctttt cataaacgta attttgtaaa tcagcaactc ttaacgggtc aacctgacaa 18720 aatactaaag tatccctgaa aacataatat ttaagaatga tatcaccaaa tgatttagga 18780 taatcatcga acgcctgaaa tgctaacaaa tcatttaccg gaataatcat atcaaaagtg 18840 acatcaactt cttcatttgc tttcttgaaa acttctgaag ggtcaatata tttaccacat 18900 actgagttat catatttttg aacattatta tataacgaat gaacaaactt tttaactttc 18960 ctttctgatt ttggtttata tgttgcaaat gcgaaagatt cacgggcgca atccatctga 19020 agataaccgg catcacctct ttcggtttct acttctaatt gtttaaagct ttgatttgat 19080 gatttgtaac caacgaataa acgcatgcat ttaaaagcgg catcacatgt gaaagcttta 19140 tttgaaacta gagttgcttt aactttgcgt ttaatatatg tgtcacctaa ttggttcaca 19200 tcatgagccg gatttgtaat tcttaaatga gtttgtttat catcaaaata tggagtatca 19260 gagtcatagc atgaactttc aagttctcta taatatcttc taccgtcatt ctttgcgaat 19320 atttcattta tcttttcgaa tgaatgaaac acatttccat taatcggttt agtgtttggc 19380 accaaatagt tagtcgtcat ttattaataa ataaaaaatc aaaattaaaa gatgaaaatg 19440 aatttttaat tttaatgaaa taatcaacta tggctgcagg tatatttaag aaactgtggg 19500 ataaagtaaa aagaggagcg agtaaagttt gggaaacatt gaagaaggga gcaaattgga 19560 tatttaagaa taaagaagca attagtgacg ttgctgaatc aataacacca aaagctgccc 19620 caataagaga agttttaaat aaaggagaaa aagtttatca aagactacaa ccgatactaa 19680 gataaaaata atatttaaat aatttatttt tccactatct acctcctatt ttattaagta 19740 ttccttcagc aattccacca attgcgttac ctgcttgtgc tcccatcgcc ggattacctg 19800 ctttagtacc taatactcca cctatgatag ttcctgccgt tttcgctact ggtgccaaaa 19860 agtttgcaac gggtttaata acctttttat aaacccaacc cgcagctttc ttaattccat 19920 cccataattt tttaaagaat cctgccgcca tttctgattt aatattaatt aaaaaatata 19980 tttcatcact tatattctaa tttttattta atgataaatg aaaccgattt taaaaataaa 20040 tgaaacagag gaggaagaaa taaatgaaag tgaaaatgtt caacaaaaaa taaacgttgt 20100 taaaccaatt actgaaataa aaattttttc atctgttgaa gaatttaatg actattatgc 20160 taataataaa aaattatttg aagaattgac aacatgtaaa ttgaataaaa tgtttaaagt 20220 tgatggattc aaaatttcaa aaataaaagg ggtggttagt ttgaaatcta ttcctcaatc 20280 aagaatcacc tcattaatga aaattgatga tttaacaaaa agaatagaag agattgaaga 20340 aagaatgaat caaataattg atttcataaa ttcaaataat cattagtttc taatttcttt 20400 tactcgttta tgaggtcttt gattatgact tcattgattc cattcaggat ttaaacaatg 20460 tttaattaat cgcggaatat ctgacatcat ctatatatat atagcttctt ttatttttka 20520 ttacattatg tatgggctga gtgtaaatgt cttacaattt tgaatatttt taaatcttga 20580 tttatttata aacgtaagaa taaaaaccgt aagaatcata cattttttac ttcatatgtt 20640 aaaaaaaatg atacaaaatc gttcaatatt gtatgaattc ttacgtttac aaaaaaagtc 20700 acacaacccg ttatatatac tact 20724 // ID Gypsy-45_CQ-LTR repbase; DNA; INV; 158 BP. XX AC AAWU01015815; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-45_CQ_; KW Gypsy-45_CQ-I; Gypsy-45_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-158 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 470-470 (2011). XX DR GenBank; AAWU01015815; Positions 31064 31221. XX SQ Sequence 158 BP; 47 A; 32 C; 42 G; 37 T; 0 other; tgtggtaccc tgtgcgggtt caggagtgcg ctacgtgcag gtgcaccctg tggacgagct 60 gtcagtcgag gagttaagga aaatcaaata acacgtgtgc tactaactac tgtatactaa 120 agttcgaata ataaacgatt gcaaccgaga ttgcaaca 158 // ID Gypsy-596_AA-I repbase; DNA; INV; 4594 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-596_AA_; KW Gypsy-596_AA-LTR; Ty3_gypsy_Ele69; Gypsy-596_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4594 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [3509-3994] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 1466..4582 FT /product="Gypsy-596_AA-I_1p" FT /translation="MGLILELTCSITLNGMSKQGRCFVSKADDLNLFGAEW FT IELFGLWDVPFNAVCNQVSMKLHPNAVGLVDSLKAKYSNVFNESLGLCTKK FT QVSLTVKPGNRPVYRQKRPVPYASTEKIEDELNRLQALGIISPVTYSEWAA FT PIVAVRKPNGKVRICADYSTGLNERLESNHHPLPLPQDIFAKLAGKKIFSQ FT IDLSDAYLQVEVSEESRKLLTINTHKGLYQFNRLSPGVKTAPGEFQQIVDS FT MIADLEDTSGYLDDIIVASDSLEEHLEQLDRLLARIEEYGFHLKIEKSNFF FT MQQIKYLGLIVDEEGIRPDPEKIKAIVNMPAPHDVQTLRSFLGAVNYYGKF FT VRTIHDLRQPLDALLKKDVKWNWSPACQKSFEQFKNVLQSDLLLTHYNPNL FT EMIVAADASQHGIGAVLLHRFPDGSVKAVCHASRTLTDAERNYAQGEKEGL FT ALIFACTKFHRMIFGRRFTLQTDHKPLLGIFGSKKGIPVHTANRLQRWALT FT LLQYDFKIEFKKTESFGYADVLSRLIGEHSKPDEDYVIASLQIEEDIRNIQ FT SEVLSALPVTNQMIVQETRKDKTLQTILKQVRENWPSVKLPDAVSSFYKRR FT ESFCEADGCLMFMDRIVVPQSLQSPVLKQLHVGHPGMQRMKSLARSYVYWP FT NIDADIEDFVRKCGSCASASKAPVKTTLSSWPIPSTPWSRLHVDYAGPVRG FT KFFLVIVDAHSKWPEMFATNNSTATTTIKKLKECVSRFGCPLTIVTDNGTQ FT FSSEAFAKFCQDFGIDHVKTPPFHPQSNGQAERFVDTLKRALLKIGGEDVE FT DTLQIFLQTYRCTPNPSLPDNKSPAEVLLGRKPRSVLDLMKPSVPEPSHIN FT EVQNMKFNKKHGAKHRSFEAGDAVFAEIHIRNERYWAQGTVIEKKGSVVYN FT ILLEDKRRRGLIRSHVNQLRRRTPDGEHQPTAEQPLPLEILLEEFNCPGQL FT NQVENIENLPILVDNPVPDPGSPRAGTFPEDVMQNPMEDAVPGPSQINQTT FT TDPVPSRKRSLPFRGPSGRKRWLPSHFEYYELF" XX SQ Sequence 4594 BP; 1339 A; 1088 C; 1082 G; 1075 T; 10 other; tattttggcg acgagaattt ttacgaagtc acgcacagtt acggcagcga aatgagtccc 60 gagttcgagg caaatctctt gcggattctg gagaaccaag gacgaatcct tcaagaactt 120 tccgcttcaa gagccgcagc aagtcaacaa gttggaagtg gtgacggtaa tctgcagccc 180 aaccagcagc atccggagga tccacgccga caaaaccaga ctgagttcct gatcgagtct 240 ttgtcaagca gcatcagtga attccattac gacgctgaag cgggagttac gtttgaagcc 300 tggttcgcaa aatacgagga tctgtttgaa gaagacgccc ggaaattgga tggaccggcc 360 aaggtcagac tcttgctccg gagcatcaac gctgcagcgc acaagaagta tgtgaactac 420 atcctaccga agaaaccgaa ggacgtctcg ttcgaagaca ccatcaacac ttgaagtcaa 480 ttttcggtcg gcgcacatcg ctgttcaacc tgcggtacca gtgtctacag ctcaagaagg 540 gtcaggccga cgattncttc acttacgctg gaatcgtcaa cgagaagtgc gaagaattca 600 agctgccsga gatcamkgcc gaccagttca agtgtctgmt sttcgtttsc ggcctsaatt 660 cawgcaagga ckcggatgtc cgaacttccc tgctgtccaa gatcgagaac gctaatccag 720 cgacgccgat gacgctacat tcgttggccg aggattgtca gcggctcctg aatctcaaaa 780 aggatacggc aatgatcgag aagtcaggta acaaatccac agtttgcgcc atcaagcaac 840 caacgaaagt caagacaact ggtcaatatc agcagcagca gcagttctcc caaagacaaa 900 ttaaagacct ccatgtccct tgtagttgcc caaaatttta tggctcatag ggtaatctag 960 aatgccgtga aaagtgtact gcgttctgtc aaacttgggt accttttgca ctaccgaggg 1020 accaaatgca gtattagaag tggtacccaa tctaagcccg agtgcgacgg acccggttct 1080 ccgatgtccc gaacacccct tgctggcgtt gcggagaaat gcactactcc aagaactgca 1140 acttcattca acatcagtgc aagtcatgca agaaagtcgg acataaagag ggatactgtg 1200 cctgcttcca ggcgaaaccg aagaagaaga atcgaggttc gaagaatcaa tcagcaaaag 1260 cccacggaat ttattccata aatcaggtca gtattgcagc taatcggaaa ttcactacgg 1320 tcgaactcaa cggaagccca gtcagaatgc agctaaggat agtgcagcgg acatcagcgt 1380 aatttcccat gaagtctacc agcaattggg ctgcccgacc ggaaaacaac cgtctatcaa 1440 cgtggtgaat gcttccggtg acgatatggg tctcattctg gagttaacgt gctcaatcac 1500 attgaatggc atgtccaagc agggaagatg cttcgtatcg aaggcagacg atctcaatct 1560 atttggagct gagtggatcg aactttttgg actatgggat gtaccgttca acgcagtgtg 1620 caaccaagta tccatgaagc ttcatccaaa cgctgttggt ctggtcgata gtttgaaagc 1680 gaagtacagc aacgtgttca acgaaagtct cggactgtgc acgaagaaac aagtatcgct 1740 gacggtcaag ccaggaaaca gaccagtcta tcgtcaaaaa cgcccagttc catatgcatc 1800 cactgagaag attgaagatg aactgaatcg acttcaagca ctgggaatca tatctcctgt 1860 gacatactcc gagtgggccg cccctattgt agcagttcgg aaaccaaacg gcaaagtgag 1920 gatctgtgca gattattcga ccggattgaa cgagagattg gagtcgaatc accatccgtt 1980 gccgttgcct caagacatat tcgcaaaatt ggctggaaag aagatatttt cgcaaatcga 2040 tctttcggat gcttatttgc aagtcgaagt ctccgaagaa agcagaaaat tactgaccat 2100 caacacgcac aaagggctat atcaattcaa tcgactttct ccaggagtga aaacagcacc 2160 aggcgaattt caacagatcg tggacagcat gatcgccgac ctggaagaca ccagcggtta 2220 tttggacgac atcattgtag ccagtgattc cttagaagaa catcttgagc aactcgaccg 2280 gctgttggcg cgcattgaag aatatgggtt ccacctgaaa atcgaaaaaa gcaatttctt 2340 catgcagcag atcaagtacc tcgggttgat agtagatgaa gaagggattc gtccggatcc 2400 agagaaaatc aaggccatcg tcaacatgcc agccccacac gatgtacaga cgctacgttc 2460 gttccttgga gcagttaact attatggtaa gtttgtacga accatacacg atctccgtca 2520 accgttggat gccttgctga aaaaggacgt gaagtggaac tggagtccag cctgccagaa 2580 gtcttttgag caattcaaga acgttttgca atcggacctg ctgttgaccc attacaatcc 2640 gaatttggag atgatagtag cagcggatgc atcacagcat ggcataggag cagtcctact 2700 tcaccgtttc ccggatggtt ctgtgaaagc tgtttgtcac gcttctcgca cgttgacgga 2760 tgctgaacgg aattacgctc aaggagaaaa agaaggacta gctctgattt ttgcctgtac 2820 caagttccac cgtatgatat ttggaagacg tttcaccctg cagaccgatc acaaaccact 2880 ccttggtata tttggttcga agaaagggat tcctgttcat acagcaaaca gattgcagcg 2940 gtgggcgctc acactgctac aatacgattt caagatcgaa ttcaagaaga ctgaaagttt 3000 tggatatgcc gatgtccttt ctcgtttgat tggagaacat tccaagcccg atgaagatta 3060 cgtgatcgca agtctacaaa tcgaggaaga tatccgtaac attcaatccg aagttttatc 3120 agctcttccg gtgacaaatc aaatgatcgt gcaagaaaca agaaaagaca aaactctcca 3180 gacgattctc aaacaagtac gtgaaaactg gccttctgtg aaacttccgg atgcagtatc 3240 atcgttctac aaaagacgtg aatcattctg tgaagcagac ggatgcctga tgttcatgga 3300 ccgaattgtc gttccgcagt cattgcagtc acctgtcctc aagcagttac acgttggtca 3360 tcctggtatg cagagaatga aatcccttgc taggagttac gtgtactggc cgaacattga 3420 tgcagatatt gaagactttg ttcgaaaatg cggaagttgc gcttcggcat caaaagctcc 3480 tgtgaaaaca acgctttcgt cgtggccgat tccatccacg ccatggtctc gtcttcacgt 3540 ggattatgct ggaccagtac ggggaaaatt tttccttgtc atcgtggacg cgcattcaaa 3600 gtggcctgag atgtttgcga ccaacaattc caccgcaaca acgacgatta agaagcttaa 3660 ggagtgtgtg tctcgattcg gatgtccact aacaattgtg acggacaatg gcacacaatt 3720 cagttcggaa gcctttgcaa agttttgtca agatttcgga atcgatcatg tgaagactcc 3780 tccctttcat ccacagtcta atggtcaagc cgaaaggttt gtggatacgt tgaagagagc 3840 tcttttgaag ataggcggag aagatgttga agacactttg cagattttcc ttcaaaccta 3900 tcgttgcacc ccgaatccgt ctttgccgga caacaaatcg ccagctgaag ttttactggg 3960 aaggaaaccc cgctccgttc tagatttgat gaagccatcg gttccagagc catctcacat 4020 caatgaagta cagaacatga aattcaataa gaagcacggc gctaaacacc ggtctttcga 4080 agctggagac gcagtattcg ctgaaattca tattcgaaac gaacgatatt gggcgcaagg 4140 aaccgttatc gaaaagaagg gtagcgtggt gtacaacatt ctcctcgaag acaaaagacg 4200 tcgaggttta atcagatccc atgtgaacca gttgcgaaga cgcacaccgg acggtgaaca 4260 tcaacctaca gcagaacagc ctttgccttt ggaaatcttg ctagaagagt tcaactgtcc 4320 aggtcaactt aatcaagtcg aaaacatcga aaatctccct attcttgtgg acaaccccgt 4380 tcctgatccc ggatccccaa gagctggtac gtttcctgaa gacgtgatgc agaatccaat 4440 ggaagatgct gtacctggac catcgcagat aaatcaaacc accacagatc cagttccgtc 4500 cagaaaacgt tcactaccct tccgaggacc ttcaggcagg aaacgttggc tcccttctca 4560 ttttgaatat tatgagcttt tctgaggagg gaga 4594 // ID Copia-6_SI-I repbase; DNA; INV; 4105 BP. XX AC AEAQ01011805; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-6_SI_; KW Copia-6_SI-LTR; Copia-6_SI-I. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-4105 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01011805; Positions 4632 528. XX CC Positions [1574-2074] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 125..2434 FT /product="Copia-6_SI-I_2p" FT /translation="MEFHGIEKLRGAENWNTWKFTVRNLLRGTEDAYEVCT FT GEIEKPKSLEATASAEQRVMYQTNLKIWDKADRAASQIIVKTLETKVTTLL FT VMCECARDMWLKLHAVFEQQTKQAAHTVQSEFFGFNKNQSDDMVSHIAKFE FT NLVLRMQQLNVRPDESSVMVKLLDSLPDDYESLRQAWWARPEEQQTLTNLI FT EILTSDDKRRQQRMDKQEEMVALVAKVQGKNECDKMQRDGKKRQSKKPVES FT TVVKGKNKFTQYKCHNCGGIGHFRKNCPSKKQKQKDDDEAFVCEILNTELD FT DSWIVDSGATDHITHRGEWFSTFEYFKTAEIIHIGNKSTMDALGKGTIKFE FT ALVSGKWLSCRMENVLYVPDARRNLFSVTSALDRGMAFKSSKNGCEFTKNG FT IVKAQGVRVGQLFKMEIRIKPPKTSCVKEVNLSSRDSLRIWHERLGHQNMY FT HVKKILRKHDIEFKDDGQFCGACVEGKQHRNTFKERQQRANEPGEIIHADL FT CGPMECTSLGGAKYFVCFTCDYSRLRIVYFLKEKSETAGKIAEVLQIIKTN FT FGRPMKTLLCDGGTEFKNSKVQELLSSNGVTLVVSNPHTPQQNGCAERTNR FT TVVDIARTMLLAKNLPKHLWAEAVNSAVYILNRTGPSSLDGKTPYELFTGK FT SVHLNKFHTFGTGCFVHVPKVQRKKWDAKGQHGILVGYSDNIDGFRIWLKS FT RSKIIRSRDVVFEPETTEDLLALFLLIQTLSRRLTIRRNVLKTLTLTHPTH FT RVIHQMIQRRIQRWIRRKI" FT CDS 2509..4059 FT /product="Copia-6_SI-I_1p" FT /translation="MLAEMDEPRNYTEATKSEECHHWKAAMEDELASLEEN FT SAWSLVELPTGCRSISNRWVYRIKRDAEGKVSRYKARLVARGFSQREGIDY FT NDTFSPVARFDTIRAILSIAANEQLEIAQFDVKTAFLNGIVKEDIFMDQPQ FT GYEDGTNRVCKLHKSLYGLKQSPRCWNKRFKNVSTNFGLQESSADPCLFYR FT IIGNEKLIVVLYVDDGLVVATKQTDIQEFLIRLKSEFKITVETVGCFLNVL FT INRFEDGSISISQKTYAESILRRFNMEEARFVSTPMEKCHQAEEVIDTKIT FT SAPYREAVGCLMYLAICTRPDITYAVNYVSQFLEKPEERHWTMVKRIMRYI FT KGSLTLGVYYDAHAKGGKLKIYSDADYASDSNTRRSVSGIACNYSGGVILW FT ASRRQQSVSLSTTEAEYVAASEAAKDAVWLNQLYKEISPLETAPVLYVDNV FT SAIKLTKNPSFHKRSKHIDVRFHYVRERVQEGQLKIEYVPSKEQAADILTK FT VIPRIQFERLRKMLGMTNIDR" XX SQ Sequence 4105 BP; 1335 A; 702 C; 964 G; 1104 T; 0 other; atatggtagc agagcgtggt tcgtgaatgg attgaagact tcgcgtgtaa taagtaagga 60 aagaagttaa acgtgaacgt actctctcga atttgatact tgctggaaat tttacacgag 120 caacatggaa ttccatggaa ttgagaagct gcgtggtgcc gaaaattgga acacttggaa 180 attcacagtg agaaatctgc tgcgaggaac tgaagatgcg tatgaggtat gcaccggtga 240 gattgaaaaa ccgaaatcgc tggaagctac cgctagtgca gaacagcgag tgatgtatca 300 aactaaccta aaaatctggg ataaggcgga tcgtgcagcg agccagatta ttgtcaagac 360 tctggaaacc aaggtaacga cattgttggt tatgtgtgag tgtgcaagag atatgtggct 420 caagctacat gcagttttcg aacagcagac gaagcaagct gctcacacgg ttcagtcaga 480 atttttcggt tttaataaga atcaatccga tgatatggta tcgcatattg caaaatttga 540 aaatttggtt ttacgtatgc agcagttgaa tgtgaggcca gatgaatcat ctgtgatggt 600 taaactatta gattctttgc cagatgatta tgagagtctc cgacaggcct ggtgggcaag 660 accagaggaa caacagacgc ttacaaattt gatagaaata ctgacatccg atgacaaacg 720 acgacaacaa cgaatggata agcaggaaga gatggtcgct ctggtagcca aggttcaagg 780 gaaaaatgaa tgtgacaaga tgcagcgcga cggaaagaaa cgtcagtcga agaaacctgt 840 tgaatctaca gttgtaaaag gtaaaaataa atttacacag tataaatgcc ataactgtgg 900 tggaataggc cactttcgta agaactgtcc aagcaagaag cagaaacaga aggatgacga 960 cgaggcattt gtatgtgaaa tattgaatac tgaacttgac gatagctgga ttgttgattc 1020 tggtgcgacg gatcatataa cacaccgtgg tgagtggttt tcgacttttg aatattttaa 1080 aactgctgaa attattcaca ttggtaacaa gtcaacaatg gatgcacttg gcaaaggaac 1140 tatcaagttt gaagctctgg ttagtggaaa atggttatct tgtcgtatgg aaaatgtttt 1200 gtatgtacca gatgctcgta gaaatctgtt ttcagtgacg tcagcacttg atagaggtat 1260 ggcattcaag tcatctaaga atggatgtga atttacaaaa aatggcattg ttaaagcgca 1320 aggcgttcga gttggtcagc tgtttaaaat ggaaattcgt atcaagccac caaaaacatc 1380 atgtgtcaag gaagtaaatt tatcgtcgag ggattcattg cgcatttggc atgaacgatt 1440 aggccatcag aacatgtatc atgtgaagaa gattttaagg aagcatgaca ttgaatttaa 1500 ggatgatggt cagttttgtg gagcctgcgt cgaggggaag cagcatcgta acacttttaa 1560 ggaacgtcaa caacgagcaa acgaaccagg agagatcatt catgcagatc tctgtggtcc 1620 gatggaatgt acatctctgg gaggagcaaa atattttgta tgctttacat gcgactattc 1680 tagattacgc atagtttatt tccttaaaga gaaatcagaa actgctggta aaattgcaga 1740 agtattacaa attatcaaaa ctaattttgg acgtcccatg aaaactctgc tatgtgatgg 1800 cggtacagaa tttaaaaaca gtaaagtaca agaacttctg tcatcaaatg gtgtaactct 1860 tgttgtttct aatccgcata cacctcagca aaatggctgt gctgagcgta cgaatagaac 1920 agtagtggat attgctcgaa caatgctcct ggcgaagaac ttacctaaac atttatgggc 1980 ggaagcagta aattcagccg tttatattct taatcgaact ggacctagta gcctcgatgg 2040 aaaaacgcca tatgaattgt ttaccggaaa aagcgtgcat ctcaacaaat ttcacacttt 2100 tggtacagga tgttttgtgc atgtgccaaa ggttcaacgt aagaaatggg atgcaaaagg 2160 acaacacggt atacttgttg gttactcaga taacattgat ggtttccgca tatggctaaa 2220 gtctaggagc aagattattc ggagcagaga tgtagtgttt gagccagaaa caacggagga 2280 cttattagct ttatttctgt tgatacaaac attgagccgg aggctaacga tcagaagaaa 2340 tgtattgaag acgttaacat tgactcatcc gactcatcga gtaattcatc aaatgattca 2400 gcggaggatt cagcgttgga tccgaaggaa aatttaatac cggaaactcc aaaacgagat 2460 ctgcgagata ggaataagtt gaatccacca aaacgattga ttgaaatcat gctggctgaa 2520 atggacgaac ctcggaatta cactgaagcg actaaatcag aagaatgtca tcactggaaa 2580 gcagcgatgg aagatgaatt agcatcgttg gaggaaaatt cggcttggtc tttggtcgag 2640 ttaccaactg gttgcagatc tatttcaaat cgctgggttt atcgcattaa acgtgacgct 2700 gaaggaaaag tcagccgtta taaagctcga ttagttgctc gaggttttag tcagcgcgaa 2760 ggaattgact ataatgatac attcagccct gttgctagat ttgatacaat tcgagcaata 2820 ttaagtattg ctgcgaatga gcaacttgaa attgcacagt ttgatgttaa aactgccttt 2880 ctaaatggca tagttaaaga agatatcttt atggatcaac cacaaggcta cgaggatggc 2940 accaatcgtg tctgtaaact ccataaaagt ttgtatggtt tgaaacagtc tcctagatgc 3000 tggaacaaac gcttcaaaaa tgtgtcaact aactttggat tgcaggaaag ctcagcagat 3060 ccatgtctat tttatcggat aatcggtaat gagaagttaa tagtcgtatt atatgttgac 3120 gacggtcttg ttgttgctac aaaacaaaca gatattcaag agtttttaat cagacttaaa 3180 agtgaattta aaataaccgt tgaaacagtc ggttgttttt tgaatgtgct catcaatcgt 3240 tttgaagatg gttcgatttc tatttcccaa aagacctatg cagaaagtat tctgcgaaga 3300 tttaatatgg aagaagctcg ttttgtatca actccaatgg aaaaatgtca tcaagcagag 3360 gaagttattg acacgaagat cacgagtgct ccctatcgtg aagcagtggg gtgcttgatg 3420 tatttagcca tctgtacacg accggatatc acttacgcag tgaattacgt ctcacagttt 3480 ttggaaaaac cagaagaaag acattggaca atggtcaagc ggatcatgag gtatatcaaa 3540 ggttcgctaa cgttaggcgt ctactatgat gcacatgcca agggtggcaa attaaaaatc 3600 tacagcgatg cagactatgc gagtgattca aatactcgac gttcagtcag tggaatagct 3660 tgtaactaca gtggtggagt cattctctgg gcaagcagac gtcaacagag cgtttcctta 3720 tcaacgactg aagccgagta tgttgccgct tcagaagcgg ccaaggatgc tgtatggtta 3780 aatcaactat acaaggaaat atctccttta gagactgctc cagtactcta cgtggataat 3840 gttagcgcca tcaagctaac aaagaatcct agtttccaca agcgcagcaa gcacatagat 3900 gtgcgctttc actatgtacg cgaacgagta caagaaggac aactgaagat tgaatatgtt 3960 ccaagcaaag agcaagcagc ggacatcctg acgaaggtca ttccgcgtat acagtttgaa 4020 agactacgta aaatgttggg aatgactaat attgatcgtt aaattgtttt ctttagttct 4080 tttgattttg aacatttagg ggaag 4105 // ID LOA-2_CQ repbase; DNA; INV; 4909 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A LOA non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW Loa; Non-LTR Retrotransposon; Transposable Element; LOA-2_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4909 RA Kojima K.K. and Jurka J.; RT "LOA non-LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 149-149 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 10 sequences with >98% CC identity. XX FH Key Location/Qualifiers FT CDS 488..1144 FT /product="LOA-2_CQ_1p" FT /translation="METDQNQNPSASCFPQLEPTRKEGRRSMKSAGREDTT FT RREESAENCRVQFAVSERMRDMHRKVHMWKEVGELSSTKREPSVVLRSNSA FT RDGKVTQQSLTKTRQPHAYFRFFANIAPVQLNILKIHATTSPLEHPRYDEH FT GILPLVNTASQKTLRYRYTIQNRPHVAFQYFMYYTRGTRILNTHQRPSCRT FT LPSCSRIIKTSWRKRQVQLNNEQYPLHPG" FT CDS 1119..4805 FT /product="LOA-2_CQ_2p" FT /note="apurinic-like endonuclease, reverse FT transcriptase and ribonuclease H." FT /translation="MNNIRCIQVNLHHAKGASSILSRRFTKEQLGIALVQE FT PWVNHNKILGLSCQSSRLIYSNTQATPRTAILLSGNIKCTPITEFVQRDIV FT AAMVTVPTTKGKQEMVVASAYFPGDQDDVPPPEVAALVRYCRAVNKPFLIG FT CDANAHHTIWGSTDVNDRGERLLEYLTSNNVNVCNKGNEPTFVTVARQEVL FT DLTLCSAAFADKIKNWHVSEEASLSDHRQIVFDIEASQLKRETFRDPNDTD FT WNAFRGHLCRSKQDAPSRIRTPEELDTAANTLQRRITAAYQASCPKKVREI FT CRDVPWWSESLSSLRKEARRLFNRAKLTSEWDAYKAALTKYNAELRRAKKT FT SLANFCEDISSMSEATRLQKALSKDHSNGLGQVRDEQGTLTVTNKETLQVL FT MSTHFPESTEREEDAASTRTADGVWCRPSRESVHLARRMFNQSSIRWALGT FT FEPLKAAGPDGILPIFLQKAADTIMAELINLLRASFTLGYIPRSWRKVKVI FT FIPKAGRSDPTMPKAFRPISLTVTLLKLMEKITDNHIRAEFLKDFPLHKHQ FT YAYQAGKSTETALHALVSRIENALKYKESALCAFLDIQGAFDNTSYTAINE FT ALRSRNVDGTTASWIHAMLTSREISASLGDTSITITAAKGCPQGGVLSPLL FT WLLVVDSLLRKLTLLGYEVIGYADDVVLIIRGKYDGTLSDRLQSALNCTMS FT WCEQEGLTINPNKTVIIPFTNRRKHDLRPPTLKGTQLTFSSEVKYLGVILD FT KKLNWNAHLDYAVKKATTAIWACNKMLGKTWGLKPKLAFWSYTTIARPRVT FT YASTVWWPKTEQKTCQSKLTKLQRLACLSVTSAMKTTPTAAMEAMLCILPL FT HLHVKQEAALGALRLQRCNNWVEGDGTGHSRIVKAFDISPLATSVSDCMEV FT RPNMDIPYEVIETNRQMWSNGGPTLPEGTIRFFTDGSKMGTSTGAGVFGPR FT TRETISMGKWPTVFQAEVYAIHICARTCIMRNYRHAKIGIFSDSQAALLAL FT KSSKCESKLVWECVASLRELASRHNRVMLFWVPGHCGIEGNEMADELARQG FT SSNIFVGPEPFLGISKSAVKQEITNWGQLQIASIWGEMRGLRQAKTFITPS FT PSIARKLLGLNRRELRILAGLLTGHCPARYHLHKIGRWPNNLCRFCLTELE FT SSAHLLCFCGALVSRRLRFFGSHLLTPYDVWHHTHPKKVIQFIDCIAPNWD FT KPCLQNNPSSSDPMDTSNL" XX SQ Sequence 4909 BP; 1423 A; 1222 C; 1166 G; 1096 T; 2 other; ccgcgtggga gtcgctcaat ctgcattcct gtcacacatg aggtctgcac agctgattcc 60 acgtagattt ttcccgccag taatgacagg aaacgctgac aaacggaata tgtgccaaaa 120 tcataagtgg tttttttaac atataaaggg aaatcatcgt gaactggaac tgccggtcgt 180 tagcccgggt aaagtggaaa cgttgagatg attgttgtcc tctgctctca tctcgcaaaa 240 agccatacag agaaccacgt cccattcccc atccccaatc cctttcccaa atcccaatcc 300 ccaatctaca gtgtcaagcg acccgtgcca agggatgtat gtctggaggg gagcaagaag 360 twtccaggtc tkaacggagc ctgcgtggta ccaagacgcc ccacgcagta tgtagtctcc 420 tctgtgtctt tgcaggcccg agtcacagtt gaagtttgta ggagtcctga aaacaaggtt 480 tatcaacatg gaaactgatc aaaaccaaaa cccaagtgcc agctgcttcc ctcagctgga 540 accaacgcgg aaggaaggaa gaagaagcat gaaatccgcc ggaagagagg acaccactag 600 aagagaagag tcggcggaaa actgtagagt gcagtttgcg gtgtcggaac gcatgagaga 660 catgcacagg aaggttcaca tgtggaagga agtaggagag ttatccagta ccaaacgaga 720 accttctgtt gttcttcgga gcaattctgc gagagatggt aaggttacgc aacagtcttt 780 gacgaaaact aggcaaccac atgcatattt tcgctttttt gcgaatatag caccggtaca 840 gctgaacatt ctgaaaattc acgcaacaac ctcaccgctt gagcaccccc gctacgatga 900 gcacggtatt ttaccactcg tcaacacagc atctcagaaa actctcaggt acagatatac 960 aatccaaaac cgacctcatg tggccttcca gtacttcatg tactatacaa gaggaacccg 1020 catcctcaac acacatcagc gaccaagctg caggacactg ccatcctgtt cccggatcat 1080 caaaacatct tggagaaaac ggcaggttca actgaacaat gaacaatatc cgttgcatcc 1140 aggttaacct ccatcatgct aagggtgctt ccagtatcct tagtcggagg ttcactaaag 1200 agcaattggg gattgctctc gttcaagaac cgtgggtgaa ccacaacaaa atccttggac 1260 tgtcctgtca aagcagtaga ttgatctaca gcaatacgca ggcaacacca agaacggcga 1320 ttctgctgag cggaaacatc aagtgtactc caatcacgga attcgtccaa cgagacatcg 1380 ttgcagcaat ggtgaccgtc ccgaccacca agggaaagca ggagatggtt gttgcatctg 1440 cttactttcc tggtgaccag gacgatgttc caccgccaga agtggccgca cttgtgcgat 1500 actgtagggc tgtcaataaa ccgttcctca tcggttgtga cgcgaacgcc caccatacga 1560 tatggggaag cacggatgtt aacgatagag gtgagcgcct tcttgaatac ttgacgtcta 1620 acaacgtcaa tgtatgcaat aagggcaatg aacctacttt tgttactgtc gcaagacagg 1680 aggtcttgga cctcactctg tgtagtgctg cttttgcaga caaaataaaa aactggcatg 1740 tctctgaaga agctagtcta tctgatcaca gacagattgt ctttgacatt gaagccagtc 1800 aactgaaaag ggaaacgttt cgtgacccaa atgacacgga ctggaacgcc tttcgaggtc 1860 acctttgccg gtccaaacaa gacgctcctt cacgaatacg aacccccgaa gaattggata 1920 ctgcagcgaa tactcttcaa cgcaggatca cggcagccta ccaggcaagc tgccctaaga 1980 aggtgaggga aatctgtcgg gacgtaccgt ggtggagcga aagcctgagc agtctccgca 2040 aggaggctag aaggcttttc aacagagcta agctcacttc cgaatgggat gcctacaaag 2100 cagccctaac gaagtataac gcggagttga ggagagccaa aaaaactagc ctggctaatt 2160 tctgtgagga cataagctcg atgtctgaag caacccgact ccaaaaggcg ttgtcaaaag 2220 accattccaa cggtctagga caggtgagag acgagcaagg cactctcact gtcacaaaca 2280 aggaaacact gcaagtactc atgagtacgc actttccaga atcaactgaa agagaggagg 2340 atgcagctag tacccggact gcagatggcg tatggtgtcg accatcaaga gagtcagtcc 2400 acctagcccg tcgaatgttc aaccagtctt caatcagatg ggccctcggc acatttgaac 2460 cactgaaggc tgcaggaccc gacggcatcc ttccaatctt tctccaaaaa gcagccgaca 2520 ccataatggc tgagttgatc aacttacttc gagccagctt caccctgggc tacatccctc 2580 gaagctggcg caaagttaag gtgatcttca tacctaaggc cggcagaagt gaccccacga 2640 tgccaaaagc cttcagaccc ataagtctga cagtgacgct gctcaagctt atggagaaaa 2700 tcacggataa ccacatccgg gcggagttct tgaaggactt ccctttgcat aaacaccaat 2760 atgcatacca agcgggcaaa tcgaccgaaa ctgccctaca cgcgttagtg tcacgtatcg 2820 agaacgcact gaaatataaa gaatctgcac tttgtgcttt tcttgatatt cagggtgcct 2880 tcgataacac ttcgtacaca gccatcaatg aagcgctccg gtcgaggaac gtcgatggaa 2940 caactgcgag ctggattcat gctatgctta cgagcagaga gatttcagca tcgctaggag 3000 acacatctat cacaataaca gcagccaaag gctgtccgca agggggagta ctctctcctt 3060 tgctttggct gctggtggta gatagtcttc taagaaaact tacactactc ggctacgaag 3120 tcattggata tgctgacgac gtagtcttaa tcatccgggg gaagtacgac ggaacgttat 3180 cggaccgtct acagtcagct ctaaactgca ccatgtcgtg gtgtgagcag gaagggctga 3240 caataaaccc caacaaaacc gtgataatcc cgtttactaa caggaggaaa cacgacctta 3300 ggccgcctac cttgaaagga acccaactga ccttcagttc tgaggtcaaa tatttaggag 3360 taatccttga caaaaagctg aactggaatg cacatctgga ctatgcggta aagaaggcaa 3420 ctactgctat ctgggcgtgc aacaaaatgc tcggcaaaac ctggggcctt aaaccgaaac 3480 ttgcattctg gtcttacacc accatcgctc gacctagggt aacctatgcg tcgaccgttt 3540 ggtggccgaa gactgaacag aagacatgtc agtcgaagct gaccaagctc caacgactgg 3600 catgtctctc tgtaacgagt gcaatgaaaa caacccccac cgcggccatg gaagccatgc 3660 tatgcattct gcctttacat ttacatgtga agcaggaggc agcgcttggc gctctacgac 3720 tacaacggtg taacaactgg gtggaaggag atggaacggg tcactcgcgg atcgtgaagg 3780 cttttgacat ttcccccctt gcaacttcag tctcggactg catggaggtg aggcccaaca 3840 tggacattcc atatgaggtg attgaaacaa atcgccaaat gtggagtaat ggaggaccta 3900 cacttccaga aggaactatt cgtttcttta cggatggttc aaaaatgggc acttctactg 3960 gagctggagt cttcggtcct cgaaccaggg aaaccatatc tatgggaaag tggcctaccg 4020 tttttcaagc agaagtgtat gccatacaca tctgtgcgag gacgtgtatt atgagaaatt 4080 atagacacgc aaaaatcggt attttctcgg atagccaagc tgcactattg gcattaaaat 4140 cctctaaatg cgaatccaaa cttgtttggg agtgcgtcgc ttccctcagg gaacttgctt 4200 ctcggcataa cagagtcatg ttgttttggg tcccagggca ctgcggcatt gaaggcaacg 4260 aaatggctga tgaacttgct agacaaggct catcaaacat cttcgtgggt cccgagccgt 4320 tccttggaat ctcaaaaagt gccgtgaagc aagaaataac taattgggga caattgcaaa 4380 tcgcttcaat ctggggcgag atgcgaggat tgagacaagc taaaactttt atcactccaa 4440 gtccttcaat tgccaggaaa cttcttggct taaaccgtag ggaactacga atactcgcag 4500 ggcttctaac aggacattgt cctgccagat atcacctgca caaaatcggc aggtggccta 4560 acaacctctg tcgtttttgt ttaactgagc tggaaagctc ggcacattta ctctgcttct 4620 gcggagcact tgtgtctcga aggctgcggt tctttggatc gcaccttctt acgccgtacg 4680 atgtatggca ccacacccat cccaagaagg tcatacagtt catcgactgt attgcgccga 4740 attgggataa accatgtctg caaaataacc cctcgtcttc cgatccaatg gatacgagta 4800 atttgtagta tggacagtaa ccagggtaca tcacaaaaga tagccttctc aggtggtcgc 4860 agtgaactta acccaatgcc cattgggcaa caaaaaaaaa aaaaaaaaa 4909 // ID EnSpm-7_HM repbase; DNA; INV; 7439 BP. XX AC . XX DT 05-JAN-2009 (Rel. 14.02, Created) DT 05-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE EnSpm-type family from Hydra magnipapillata - consensus. XX KW EnSpm; DNA transposon; Transposable Element; EnSpm-7_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-7439 RA Bao W. and Jurka J.; RT "EnSpm-type families from Hydra magnipapillata."; RL Repbase Reports 9(2), 378-378 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 800..2833 FT /product="EnSpm-7_HM_1p" FT /translation="MLKINKRKNCGYMKRYMKNYRARSNLLAEMLNTDTLV FT ANKKLLNIKDHLDSDNTSENEEFNSPDTYKDVDSNSMQNDTASLHLYDEEC FT ILEPDDVSPVSSSEDEIKITFKETTFEFLTKLAQWAIESRLPREKINELLK FT ILDNCGRFNKGDIPKDARALLHTPRTVNTVQTSGGHFAYFGLEKSLIQKIL FT SLPFIFKDRTVLKLNINIDGVPIHRSSNLQFWPILCAFHGINTTPFVIAIF FT SGYRKPLNINEYLEQFIDEINFLQKNSMVVDGKIYEIKLNAFICDAPARAF FT VKCVSGHTSKQGCERCTCEGQSVERRIIFSNAGESRTDSLFRLGQYPKHYL FT SKSPALNIDHFDVIKGFPLESMHLLFLGVCRRYLMFLKTGPRNVRLSHAQL FT NTISLRLTELSQYTPSDFMRRPRSLFEVDRWKATEFRQLVLYTGILIFKGV FT LNDQHYDLFKSFFIAVRILHIDNDEYRNKLLGFARCLFQSFVYNAKILCGE FT TFLTYNVHNLLHIVDDVEYFRCSLSYLSSFPFENFLQCLKRTVRDAKTNPL FT TSXTKRFFEASQIQLSTYETMLNMKVSTIRRDQFFISKNEKYCEVIGISGS FT SDNQQLLCLVINPSQLKSFFDVPLDSRQWGILCCDSIRKLKRKTVTLNKAD FT VYKKLYAMKYGDDGLLFVPVLHKNECKQ*" XX SQ Sequence 7439 BP; 2768 A; 858 C; 1007 G; 2797 T; 9 other; cccagccggc acaatgcgtg tcaaataaaa ccattagttc cttacacgtg ttaaaaactt 60 tataaataac gcatgcttaa ctttataaat gaacaggttc taaactcgtg ttaataattc 120 cgtcatttag agtgatcaac aggaattttt ataaataata taaattgcta tagtaattta 180 agaaagaaga cagtaaactt aattaataat aaatatgata gatatctcat agtagcgcct 240 actaaatggc aaattttatg tgtccatagg atgagttcat ttcttataaa cttagaaact 300 tacgaaagtt cttaagttta agttcaaaca ttattaaata ttgttcaaat ataaattatt 360 taacttagct gaatattata tatttattta atattattta ataaatttat aattcaaatt 420 gattatattt tgtatttgta agctgttata aaaatataat aatcctattc tcacaagaag 480 ttgagttata tatttagtta tgtatatata tatatatata tatatatata tatatatata 540 tatatatata tatatatata tatatatata tatatatata tatatatata taaaaagagt 600 tattgtttat gtttatatat atgaacatat acataactaa atatataact taagttattg 660 tcagaatagg ataaatatgt ttttataaca gcttacaaat gcaaaacaat aaattagaat 720 tataaattta taatgtaaat tataatataa atttatactg taattgatat taatcacaga 780 tatcaattac aatttaagaa tgctcaaaat caacaagcgt aaaaattgtg gatatatgaa 840 acgttacatg aaaaattatc gggctaggtc aaatttatta gcagaaatgt taaacactga 900 tacattggtt gctaataaaa agcttttaaa cattaaagat catttggata gtgacaatac 960 cagtgagaat gaagaattca atagtcctga tacatataaa gatgtagatt caaatagtat 1020 gcaaaatgat actgcttctc tgcatctgta tgatgaagaa tgtattttag aaccagatga 1080 tgttagccct gtatcttcaa gtgaagatga aattaaaatt acttttaaag aaacaacatt 1140 tgaattttta acaaagcttg cacaatgggc tattgaatct agattgccac gcgaaaaaat 1200 taatgaactt ctgaagatct tggataattg tgggaggttc aacaaaggtg acatcccaaa 1260 agatgctcgt gccttattac acactcctcg aactgttaac accgtgcaaa cgtctggtgg 1320 tcattttgct tattttggtt tagaaaaaag tttaattcaa aagattcttt ctctaccttt 1380 tattttcaaa gatcgaacag ttttaaaatt aaatattaat attgatgggg ttcctattca 1440 tcgttcaagt aatctccaat tttggcctat tttatgtgct ttccatggta ttaatacaac 1500 tccatttgta atagctatat ttagtggtta tagaaaacca ttaaacatta atgaatattt 1560 agagcagttt attgatgaaa tcaacttttt gcagaaaaat tctatggtgg ttgatggcaa 1620 gatatatgaa ataaagctta atgcatttat ttgtgatgct cctgccagag cttttgttaa 1680 atgtgtatca ggccatacct caaagcaagg atgtgaaaga tgcacttgtg aaggccaatc 1740 tgtagaaaga agaataattt tttcgaatgc aggcgaaagt agaacagatt ctttatttcg 1800 gttaggacaa tatcctaagc attacttaag taaaagtcct gctcttaata ttgatcattt 1860 tgatgttatt aaaggttttc cattagaatc aatgcattta ttgtttcttg gggtttgcag 1920 gagatattta atgtttctta aaacagggcc acgaaatgtt agactcagcc atgctcagct 1980 aaatactata tcattacgct taacagaatt atcacaatat acaccatcag attttatgag 2040 acgaccacgc agtttatttg aagtggatcg ttggaaagct acagagtttc ggcagttggt 2100 tttatacact ggtattttga tatttaaagg tgtacttaac gatcagcatt acgatttatt 2160 taaatcattt tttatagctg tacggatttt acacattgat aatgatgaat atagaaacaa 2220 gttgttgggg tttgcacgtt gcctatttca aagttttgtc tacaatgcta aaatactttg 2280 tggtgaaacc ttccttacat acaatgttca caatttactt cacattgtcg atgatgtaga 2340 atattttagg tgttcactta gttatttatc ttcttttcct tttgaaaact ttttacaatg 2400 tttgaaacga actgttcgag atgcgaaaac taatccctta acatcatrca ctaaaagatt 2460 ttttgaagct tcacaaattc agctatctac ttatgaaaca atgttaaata tgaaagtgtc 2520 aacaattaga agagaccaat tctttattag taagaatgaa aaatattgtg aagttattgg 2580 aatttcagga tcaagtgata atcaacaatt actttgttta gttatcaatc caagccagtt 2640 aaaaagtttt tttgatgtac ctttagatag tagacagtgg ggtatacttt gttgtgacag 2700 tattagaaaa ttgaagcgta aaactgtaac tttaaataaa gctgacgttt ataaaaaact 2760 gtatgcaatg aagtatggcg atgacggttt gctctttgtt ccagttcttc acaaaaatga 2820 atgcaaacaa tagttcaatt gtatttagtt tttagttaat tcaaaaaaaa aatttcattg 2880 ttacaagtat aatgctattt ttaaaacaat taacagttgt ttgcacatat aattaagaaa 2940 tgattttttt gttgttgttg atattatttt catttatttt aactattgac tataatttac 3000 aaaatgttct gtattattta tttttgattg acaaaaaaat taagatttta gaaacttatt 3060 acttgcgctg ttggtaataa tcattgcttc tgaagtcatt gcttcacaga caatttatgg 3120 tttgattawt ttgctttttg tgtgttttac atttgtattt gatgtgtttg ttatatatat 3180 atatatatat atatatatat atatatatat atatatatat atatatatat atayatatat 3240 atatatatat atataatata aatatatgta taatgtataa taycatatta tatataataa 3300 tatatatata tatatatata ttatataata atatatatat atatatatta tatatatata 3360 tatattatat atatatatat acatatatat atatatatta tgtataaaag tatattatat 3420 atatatatat atatatatat atatatatat atatatatat atgtatataa tatataatat 3480 ggatatttat tattaataat aatggaactt atatattaat aaataatatt attaaatata 3540 tatttattat tatatatctc tctctctata tatatatata tatatatata tatatatata 3600 tatatatata tatatatata tatatatata tatatatata tatatatata tatatatata 3660 tatatatata tatatatata tatatatata tatatatata tatatatata tatatatata 3720 tatatatata tatatatatg tatgtatata cagtcttgga aaaaagttta agaccacttg 3780 cgaaaaactg aaaatttaat gaaaaaacta ttacagtaca aaaaataatt gggaaattat 3840 tttttgatta tataataata attgggaaac tgtgttatag ttgccaaayw arctatggtt 3900 gccaaataaa aacttataca cgctcaattt tattttaggc accaaagtat agcattttta 3960 tgtagtggtt ttagactata tatatatata tatatatata tatatatata tatatatata 4020 tatatatata tatatatata tatatatata tatatatatg ttgcacttat tctcacttga 4080 tttctaggat aaatattgta ctgcggtatg gaaagaaggg ttaaaaacca tacaagaaaa 4140 cattcctgtt tgttggatag atatgataga gaagtgtgta cgttatcctt ctgagccgga 4200 tagacctgtc attcaaatga ttacagattg cacagagcct gacatacaat ggttttcatt 4260 tccacttcaa aaagtgaaac gttttagtgg tactaatgtt tttgatttta ttgaaaagta 4320 tatattttct attaaaatgt ttaagttatt tttcataaaa attttatttt atacttctaa 4380 agtcatgtac aaatacatta tacataacaa agagcaacac aaataattta tttaattatt 4440 ttgcttacag attctttagt ggaacttgaa aatttatata atgtaactac tgaggaagac 4500 actgggtcag caaatgtaga gctaataaat acaaataaaa acaatgatct tcaatgtttt 4560 tgtaagtttg catataatat tatttctaaa aaaatttatt tttttaatta tttatatatt 4620 ttactatttc aggaaaatgt acgttaaaat ttttatttag ttaaaaagtt tttttttaaa 4680 aaaaagataa ggaaaatatt atttgtggtg gtgattagtg gcattaaagc tataggtttt 4740 gtaaacaatt tctaatatct agacaaagca agtgtagcta acgaaaagtt gacatcaatg 4800 cagcattctt tatctactaa gaaaggtaaa agaaaatttt gtatttattt atttattttt 4860 tttttgcttt cttgtttgtt tgtttgtttt ttttttcctg gtattttatt atgaattttc 4920 attaggtata aaacataaat aacaccatcg tgctaaataa ttttaagtat aagttagtag 4980 gaatgctcct gttatttatg ttttatacct tattataatt tctaattttt ttatgtttta 5040 tagctgatta taatttctaa atacaacaaa tgatgttgca gtacactatt atatctattt 5100 cattgcttat acttattttt tataaattgc ttgtaggaaa agttcttaaa gcatgttatg 5160 atacaataac aggtaaattg atccattcct tgttgtggtt aatagtagtt gttataatgt 5220 aacggttaaa ttcttgattc cattatgact tttaaaagtc ttaaaattca tcaaagtggt 5280 tgttgcaaga gttttacgta atttatgggt tgttaagttt tagtgtttcc atagttaaaa 5340 gtaaaagttc attatcatta tttcaagtta tcaaacttat tatacctttc agaaacactg 5400 gtggattatc ctgtgattca aaattttttg gaaaaaccaa ataatggtta gtattcactt 5460 tcacattaga aacaaatcat gactatgatt tgtttgtaaa taattctgag aatatagttt 5520 acaatgattt aaatacttat aacagtattt acttataaya aaggtgtatt tttaattaaa 5580 gagcattttc ataatccttt aacagacatc tccggcctta tattatttta gctttaagag 5640 cagatattct cttatataag ttttctaagg ttttttaagt ttatcttccc aaagctacca 5700 gaacagggtg taaaacagga ataaaatttt atgtttactt tgattgggaa taaaaattcc 5760 tccacatcaa aaaaaaatcc ataaaaaacc tcacattaat agtttttttc aactattttg 5820 tatttattta ttgagcactt atttgaaaat atttttttaa ggtaaaaaac gagaaggatg 5880 gttggtgaaa tcaaattcac ctaaaaaaaa aaaaaagcgg aacataaatg aagaagctgt 5940 tgacgattca tctggtgagt tttcacacaa aggggatata cattttattt taaataatga 6000 tacatcactt aaaaataaag aagacaaaat aaataattat ctccctgtca agagttttaa 6060 aggtggacga ccaaaaaaaa ttcaagttga ccaatttcct atagacatgg gaagctttca 6120 gcaaaaagtt ttgttttgtt taagcgacct taaagaaaaa atgcaactaa tcgaaaaaca 6180 acaatcaaag attctttcat ttctatcaaa gagtgacaat gaagatgtag atgatttcga 6240 acaaacgact accaccgaag aacttttggc actcgaaaar aaaatagttg acaataaaaa 6300 aattaaacta atgcttcaga ggaaggtgga agttataggt ttaacatgta agtccatgag 6360 cgagcatgtt cgttgttctc taaatgcaat tatgtcatgc aagttgcaaa taaatataaa 6420 tgttatgtat ggtaataatg taccgttaaa gtcatatacg gaccatttgt ctttggtcca 6480 tcatcttcca actttgtatt cattattact cagatcaata tctaatgttt ggaattcctc 6540 agaaaaagaa accaaacgag ttatatcaga gtggcttaaa caagcaggaa aacgttttaa 6600 tagactacca ccatgttagg ctgtcaaact ttttatcaaa tgatgatttg agttatttta 6660 ataataaaag tacttattat ttagttttta ttgtatttac tttattttaa tgagcaaact 6720 tattaaaaat aacaattatt aacatacgtt gatatataaa aatgcatata aaaatttata 6780 tctgacacgt tattaaacgt gtcagatata aattttaatt atttaagata taaaattaat 6840 ttaagtttaa tatgagcata attaaaactt aagttttaat tatgctcata ttatgcgttt 6900 aagattatac ttaattttat atcttaaata attaaaacta tagttttact tataagttat 6960 aattacaact tataagtaaa actaaagttt tacatgtaaa aaaaacttta gggtgagttt 7020 ccactcatcc gttttttcat aggaacggga aaaggaagga aaaaataatc acgtgggcat 7080 gcgtattata agttttaact tgtccgaagt aaaactatgc atgctcacgt gattattttt 7140 cccgttccta tctcgttccc gtagaaaaac gaatgagtgg gaaacccacc tttaaaatca 7200 cgttccaaac actaaataga attaggtttc taactcgtga gaaaagttta caaaatcact 7260 gtgtccgtca cgtaattaaa acttggtttt acacgtgaaa aaaacttttt aaaacacgtt 7320 ccaaacacca attataaaaa ggtttctaac tcgtgagaaa agtttacaaa ttcactgagt 7380 caaccacgtg ttgttcactg gtttttaaca cgtaaatgaa acttaatgtg ccggctggg 7439 // ID Gypsy-255_AA-LTR repbase; DNA; INV; 226 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-255_AA_; KW Gypsy-255_AA-I; Gypsy-255_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-226 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1112-1112 (2011). XX DR [1] (Consensus) XX SQ Sequence 226 BP; 76 A; 40 C; 41 G; 69 T; 0 other; tgtggagtcc atgaaatgac cagatagaag tataaccctt ttatctattt atgatacaaa 60 cacatgagaa tgaagtggtg tgctatgaac acatatttgt tatatacaaa ctctgaataa 120 ataccgctct cgacgttcag tcgagttagt tgattacttg catccaaagt gaaaggctgt 180 tttactgtta aaccgtaaga aaactctaat aaggtctctg acctca 226 // ID Chapaev-3A_HM repbase; DNA; INV; 6151 BP. XX AC . XX DT 27-FEB-2008 (Rel. 13.02, Created) DT 27-FEB-2008 (Rel. 13.02, Last updated, Version 1) XX DE Autonomous Chapaev DNA transposon -a consensus sequence. XX KW Chapaev; DNA transposon; Transposable Element; Chapaev-3_HM; KW Chapaev-3A_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-6151 RA Kapitonov V.V. and Jurka J.; RT "Autonomous Chapaev transposons from the hydra genome."; RL Repbase Reports 8(2), 35-35 (2008). XX DR [1] (Consensus) XX CC Chapaev-3A_HM is a very young subfamily of Chapaev-3_HM DNA CC transposons that was active in the hydra genome less than a few CC million years ago (copies are ~1% divergent from their consensus CC sequence). The consensus sequence was obtained based on a CC multiple alignment of 10 copies; it codes for a 938-aa Chapaev CC transposase (11 exons). Chapaev-3A_HM is characterized by 4-bp CC target site duplications, 10-bp terminal inverted repeats, and CC 22-bp subterminal inverted repeats (separated by 17- and 7-bp CC regions from the 5' and 3' TIRs, respectively). CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(451..1362,1513..1748,1902..2114,2304..2400, FT 2461..2543,2654..2730,2837..3042,4075..4404, FT 4537..4692,4848..5043,5219..5526) FT /product="Chapaev-3A_HMp" FT /note="Transposase." FT /translation="MATATSANHILCLIKLCRLCGNYIGTDSFNVINIIAR FT VDQAFFTEVGEDRTEIHPPKICMKCYTLMRHIEQRGTTSFNFILTSWPHTC FT SLESCICFLKKSGRKKKKPFGRPPSVNKDVWTRKSINEILDSFSEMPRLCY FT ESISIVNNPHKDLCICKICKRIFHRPVLLNNCQHLFCASCIFPNLVGKLET FT EAKCKLCGSNISLGSISKATTMQNILENIVLKCSENSCKSISNKEEHEQVW FT KSGNILPQCSASSSSSLTVTDIYKINTNSDIPKELEYAFAHFAKLKMAKNN FT SHSFELPSGGPRKTIYRRQKQLHQSLIDNAGPTKKAKIQQVALMLNSFNKN FT DKIDILKKSNLSQVEIGPEDIVSLKADCGLPWEKMKKMIRFFQTKNVKLPS FT LSNQRKVAKYWSGNDLIVEDKELLFELKNKKGTFELKDTPTAYINDLPSHL FT IKVLDQLERYNQLTFEKITPSEIHIKIGGDHGGGSFKMSYQAKDYRANLKI FT SLSRHIDEIRQLQAMKWREKCLRVFIFGDYVFLCAIYGITGATGRHCCLFC FT EITSSEMQLNIEARTRPILLRTLATLKSDYERFKNDGGNIKRAKNFNNVID FT EPLFDIPIEQIAIPALHISLGIFLKFFKMLEVECHLLDIKLAGFLAINDKH FT LESSEFDKYVEKQAEIHQLEYDVDDINTKILLIQDTIVVEVFRSPKNTEYL FT QLMYSERIALLKSKRIEKEGKIAECSKFKLIEGMGPILKEIESVLQSCGVQ FT RQAYHSCSFVGNHVHKMLKVDNIKKLCDSISQTVFNNIADREHPIYMEAVD FT IQQKFKALFHKYSKCDNEMNSCHLFNEENIKSFETAVCELMKFFRSTWPNE FT SITPKMHLLENHIGVFLSTWGVGLGLYGEQGGESIHAEFNNIGRIYSTMSG FT TRKLECIMRDHFIRNNLIASSLKPPIVPRRKKII" XX SQ Sequence 6151 BP; 2282 A; 777 C; 891 G; 2173 T; 28 other; cacgatcgtt caatattgat ccccggcgga catttatagt gcgcatgcgt ttacatggac 60 tttagtaaaa tctagaattt ccatattttt tgcaggcgac gagaatagga tcacatttta 120 aaactattat attttatact tattaatgtt gttttatatc attcaaagat tttataaaca 180 tttaaaggac caaagattat atttaaataa aagaaatcct gcttcaatcg gtttttggca 240 atctttgact ggctttaaaa atctatatta aaaaaaaaaa aaatgttttt caaacttgtg 300 taactttttt aaatcaattt tttgtttcaa gaactaaatt gtttattaaa ctaaaattta 360 aatattttca gaaataacta aatttattta ccagccattt ttttaagttt tagaaatttt 420 aaaataaaaa aattcaccga actacctagt atggcgactg ctacttctgc taaccacatt 480 ttgtgtctta tcaaactatg cagactatgt ggaaattata ttggaactga ctcatttaat 540 gtaataaata taatagcaag agttgaccaa gcatttttta ccgaggtagg agaagacaga 600 acggagatcc acccaccaaa aatttgcatg aaatgttata cattaatgag acatattgaa 660 cagcgaggaa cgacctcttt taatttcatt ttaactagct ggccgcatac gtgctcactt 720 gaaagctgta tatgtttttt aaagaaatct ggcagaaaga aaaagaaacc atttggaaga 780 ccaccatcag ttaataaaga tgtttggaca aggaaatcaa ttaatgaaat attagacagt 840 ttttctgaaa tgcctcgctt atgctatgag tctatcagta tagttaataa cccccacaaa 900 gatttatgca tctgcaagat ttgtaaaaga atatttcatc gacctgtttt acttaataac 960 tgccaacatt tattttgtgc ttcatgtatt tttccaaatt tagttggaaa attagagaca 1020 gaagcaaaat gcaagttatg tggctctaat atatctttag gtagcatttc aaaagcaaca 1080 actatgcaaa atatccttga aaatatagtt ctaaagtgtt ccgaaaattc atgcaaaagt 1140 ataagcaata aggaagaaca tgaacaagta tggaaaagtg gaaacatatt accccaatgt 1200 tcagcatcgt cttcctcttc attgacagta acagatattt acaaaattaa tacaaatagt 1260 gatattccta aagaactaga atatgcattt gctcactttg caaagctaaa aatggcaaaa 1320 aacaattcac attcatttga acttccttct ggcggtccaa gggtaagaaa ttaaattttt 1380 ttaacaaaaa tattattagt taaatttatt ataaataatg tatttgcata aagagtaatt 1440 tttttcacta attaggctac acagttttca gtgactacaa agagctttaa aacatcaact 1500 gagattagtc agaaaaccat ttatagacgt caaaagcaac tccaccaatc gctaatagat 1560 aatgcaggac caacaaaaaa ggcaaaaata caacaagtag cacttatgct caattctttt 1620 aataaaaacg ataaaattga tattctaaaa aaatcaaatc tctcccaagt tgagatagga 1680 ccagaagata ttgtatcact taaagcagac tgtggtttac cttgggaaaa gatgaaaaaa 1740 atgataaggt atataacatt attttaataa aaataaatta tgcataaaac aataatcaag 1800 ttatttcaaa ttacaaattg catttatatt tatcagcatt ttgttgtctt tatttttttt 1860 ttcatttaat ttttcaaatt aaaaaaactt ttatatttca gatttttcca gacaaaaaat 1920 gtcaaattac cctcactttc taatcaacga aaagttgcaa aatattggtc aggtaacgat 1980 ttgattgttg aagataaaga actgctgttt gaattaaaaa acaaaaaagg aacttttgaa 2040 ttgaaagata cacctacagc atatattaat gatttgccat cacatttaat caaggtactg 2100 gatcagttag aaaggtaagt tcaaaatata agagactaat gtaatgttta ttaattcttg 2160 ttaaatatgc aagaaaattg aaaattgaat gttttaaaca aaacttaaga agtgtaaatc 2220 ataaattgtg caaatattat atctctgcat aaattgtata tatttttttg ttataaatat 2280 attttctttc tttattatca tagatataac caactaacat ttgaaaagat aactcctagt 2340 gaaatacata taaagattgg tggtgaccac ggaggtggtt catttaaaat gagctatcag 2400 gtagctaatg ttgctacacc caattcaaaa aataatacga ctgttttcaa tatatttgag 2460 gccaaagatt atagagcaaa cctcaaaatt tcactctcaa gacacataga tgaaattaga 2520 cagcttcaag caatgaaatg gaggtgaata aaaaaatatt tcaaaagttt atttggataa 2580 gaaatgtttg taaaataaaa aacatagata ttttaataaa cttttatttg attttaatac 2640 attctcattt tagggaaaaa tgtttgcgtg tatttatatt tggtgattat gtatttttgt 2700 gtgctattta tggcatcaca ggagcaactg gttagcttat agttttgtat ttcttttagc 2760 ttatattctt attttaatta ctaaaaatgt acacaaagag cattaaaagt aatattatga 2820 atatttaatt taacaggtcg tcattgctgc ttattctgtg aaatcacatc aagtgaaatg 2880 caattgaaca ttgaagctag aacgcgaccg attcttttga gaacgcttgc aactttaaaa 2940 tcagattatg aacgattcaa aaatgatgga ggcaatatca aacgtgcaaa gaatttcaat 3000 aatgtaattg atgagccgtt atttgatata ccaattgaac aggtgtgtgt taattgttgc 3060 caaagcatta atgccagtaa tataaatata aatgcaggtc tgtggttagg ctaccccaaa 3120 ccaggggcag caaattgtag aggccccttt ccaaaattta agtagcttaa tttaagtagt 3180 attaaatttg atttattatt tttttagaaa tacattaaat aattgaatgt tttgaagatt 3240 ttattgttat taagattaag attgaggctg tgttttttta gggggggggg gggtcctcta 3300 tataatagta tatacttata tataggcact atatataagt atatatatat atatatatat 3360 atatatatat atatatatat atatatatat atatatatat atatatatat atatatatat 3420 atatatatat atatatatac ccatatagta gacgttttaa tgtaaaaaaa gtctcttacg 3480 actacaattt ttttcktctc ttaagctgtt caaagcaatt aatttacatt ttgtttttta 3540 ttttttaaaa raycgcaatt ataacgactg aaagagaggt gtttttgtta ttgatttaat 3600 aaaaaacatt ttaccgtawa tgttctacat cgactatctt atcctgcgca acagagctgt 3660 acatcytgaa aatagcctac tygcaagtgc tctacataat agtayatact gacaaatagg 3720 cactayatcg actgayawat akcctkacty gcaaagraat gcwgctacat cgactgacaa 3780 atagagakat acatcgactg agrktttggt ttrgccagry atmtatcaat taaaaaaaaa 3840 atttaatttt ctaaaacttt ttctrtwtat taatctttaa aactacatat ataaactttt 3900 ttwgatatgc aagagacatr yaaragtata tatatatata tatatatata tatatatata 3960 tatatatata tatatatata tatatatata tatatatata tatatatata tatatatgta 4020 tatatatata cttatatata tagctattat gcaattgtta aaatcatttt atagatagca 4080 ataccagctc tacacatctc attgggaata tttctaaaat tcttcaaaat gttagaggta 4140 gaatgtcatc tccttgatat caaactagct ggatttctag caataaatga taaacacctt 4200 gaatcaagtg aatttgacaa atatgttgag aagcaggcag agatacatca attagaatat 4260 gacgttgatg atataaatac aaaaattctt cttattcaag atacaatagt cgttgaagta 4320 tttcgcagcc ctaaaaatac agagtacttg cagttaatgt actctgagag aattgcattg 4380 ttaaaatcga agagaataga aaaggtatca acattaaata gtactggaaa ttattttgag 4440 aggatagttt tttttaaggg catttttttt tttaatattt caatgactgt attatctgga 4500 ttttgtatac atctatattt atatttaatt ttgaaggaag gaaaaattgc ggaatgctca 4560 aaatttaagt taatagaagg aatggggcca atattaaagg aaattgaatc tgttttgcag 4620 tcatgtggag tacaacgtca agcttatcac agttgctcgt tcgttggaaa tcatgttcac 4680 aaaatgttaa aggtacatgt ttttgtatac aaatctatgt ttttaggcat taaaattttg 4740 ccatctctgc attaaataat ttagttatca ttccaaataa tagttctact ataataccta 4800 tatataatag tatatatata tgcgtatata tgtattttat ttttcaggtt gataacataa 4860 aaaagttatg tgactcaatt tctcaaacag tttttaataa tatagcagat agagaacatc 4920 caatttatat ggaggcggtt gacatacagc aaaaatttaa agcattattt cacaaatatt 4980 caaagtgcga caatgaaatg aattcatgtc acttattcaa cgaagaaaat ataaagagtt 5040 tcggtaattt ttcatgaaaa tttttattat agaagtagtt ttttatttgc agaagtagtc 5100 tttttttaag gagtattata atctttaata aatagtgatg aagatatttt gccatatata 5160 taaaatatat agtattatta ttattatttt ttttaatact tgttaatttg ttttgtagaa 5220 acagctgtat gcgaactaat gaagtttttt agaagcactt ggcctaatga gtcaataact 5280 ccaaaaatgc acttgttgga aaatcatatt ggagtttttt tgagcacatg gggagttggc 5340 cttgggctat atggggagca aggtggcgag agtattcatg ccgagtttaa taatattgga 5400 agaatatact caaccatgtc tggtacgaga aaactagaat gtatcatgag agatcatttt 5460 ataagaaata atttaattgc gagttcgtta aaaccgccaa tcgtccctcg aagaaaaaaa 5520 attatataat aaaatttata gtttatatat aataaagtat ttatgcaaat tattattatg 5580 caataattat gtaattatca aatttattgt ttaatttatt tatatattta ttatgtttat 5640 ataaggttct taaagatttc gaaataggtc ggccctatta ttttacgatt agtagaaaat 5700 tagcgaaata ttagccaact ttcaagcttc aaatcttttt gtaaaaaaaa tggctggtaa 5760 aaataatatg ctaaataata tagctcaacc tttattcttt caaatgatat ataacacttt 5820 tttataaaaa ttgatcttcg taagttttat taaagaataa gacagatagt ataatctgcg 5880 ataaaatctc ggttaacttg acaagtgaat ttaaggttca tgtgtggttt tttttgttgt 5940 gatttttttt ttttctactt actaaacttt tagttgataa aaataatttt tattaatgag 6000 ttttatttta ttcgtatcgt gtatacctta gatttaaaat acctccaaat gctccgtttt 6060 cgaggcaagt gaaataacag ctttagctgc gcatattaat ttataagtaa tgcgcctgcg 6120 cactataaat gtccattgcg aaacgaccgt g 6151 // ID Copia-9_DPu-LTR repbase; DNA; INV; 254 BP. XX AC scaffold_317; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-9_DPu_; KW Copia-9_DPu-LTR; Copia-9_DPu-I. XX NM Copia-9_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-254 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 682-682 (2010). XX DR Genome; scaffold_317; Positions 80871 81124. XX CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX SQ Sequence 254 BP; 67 A; 53 C; 52 G; 82 T; 0 other; tgttggatat tgttagatgc ggagcaaact gaagacattc tcagactgat ctggcaactc 60 tgcctcagtc caacagatgt ctactttgtc cctcccctct ctaaaatggc gactctctct 120 cttagagact gcccgactac gtgtgtgtgg ttttgaataa ggactcaatt ggtggtatga 180 aatacagttt ctttacaagc agaatatata tattgtgtgc actcatttgc ttatctagag 240 ttaggtaatc aaca 254 // ID Pokey_Cis repbase; DNA; INV; 2528 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 06-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE piggyBac DNA transposon from Ciona savignyi. XX KW DNA transposon; Transposable Element; piggyBac; Pokey_Cis. XX OS Ciona savignyi OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. XX RN [1] RP 1-2528 RA Smit A.F.; RT "Pokey_Cis - piggyBac DNA transposon from Ciona savignyi."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX CC Ci000585 TTAA duplication. ORF from bp 371-2203 encodes a CC transposase 36% identical (52% similar) to the Daphnia pulicaria CC Pokey transposase. Copies <0.5% diverged from consensus. XX SQ Sequence 2528 BP; 793 A; 470 C; 517 G; 748 T; 0 other; ccctttcaag actaacggta catatatgta ccaccgacat gatgctccgg tctgagaggc 60 cattattgta ccataaatgc ctttattttg acgttttccg acttttcagg aactacgcat 120 gcttttttcg ctttgtgtca tgggatttag ctatcacgtg acctattttc gcgcctttag 180 cttcatttac catgtggtgt tgagacagat cacagtcttg caggtgtttt acaatgctgt 240 attgcagtta cattggctag cattagtttg gaaagacttg taagtgtttt agaagttaga 300 ttaccgttat attggctagc attagatcga attggaaagg ctgcatttac tatcagtata 360 actggtacac atgtgtacca acagacctaa catagttagc atagtggtaa ctatttgtac 420 caacaggcct ctacacagaa ttagttttgc agctcagtta tatatttttg aggttatctg 480 tagaattacc aatatgacaa gacgttatac ggcacaagaa attattgacg caataccata 540 tctatctgat gcagaagaat cagatttttc agattcttgt tcagagtacc tgccagaaca 600 accacgacgg gaagattctg attctgatga tagtgaattg tcttcagata gtgaacaaga 660 agattcagta agcgaagaca atgatgatgc agagagcaat agcaatagtg gtgattcact 720 ttgtaccggt actacgaaca cggtactatg gaacaatgtg ccacctgggt ttcagccaaa 780 attgaatatc agtcaagaca gaccttgcac aattcatcca gatatcaccg tcacaacaag 840 cattttagac atttttttga agctttttcc ttatagcttg ctgctgcaga tagcttacta 900 cactaacaag agattgagta tctttcaaaa tgctaaaacc aaaagaagaa aattagtccc 960 aactgatgca catgagatta tgaaactttt tggatgtttt ttcgttatga gttataatcg 1020 cttgccgtct ataaataact attggtctaa gcatcagtcg atgggaaacg ctcttatgaa 1080 atctgcattt tcgagggata gattcaaatt gattatgtca aaactgtatg ctgctgaacc 1140 ggataaacct tcacaagcaa gcaagttgta ttacgttgaa gatttgctct tatgcctgaa 1200 atcaacattt atgaagtaca ggcagaacag tccatttcaa agcatagacg aatctatgac 1260 gaaattcaag ggaaggtgct ctttcaagca gtaccttccc atgaagcctg tgaaacgtgg 1320 gataaagctt tggatgagat gtgacccaca cagtggatat acatatgatg tgaatattta 1380 tgctggtaaa gatgatcaag gcctaagcat tccattagga gagagggttg taaagacttt 1440 gctctctact ataccagacg gtgatgacgt taccgtggct tttgatagat ttttcacttc 1500 ggttcatctt atggacagtc tacgctatcc cgcagtggga acagtcatca aaagccgaaa 1560 aaatctacca gttttccaag acaaactcag ccgtggtgat tcagaattta gatgtaacat 1620 gaacggaact ttagcagtta gatggttgga tacgaaagaa gtgctattgc tgagcaactt 1680 ccacactaat acagtgggtg aagttaggaa gaagcaacgt gatggaacaa tcataaacgt 1740 ctcctgccct gatgctatcc ggtgttatag gcaaataatg ggtggtgtcg atagagcgga 1800 ccagatggct ggactttatg atcttgatag aaaatccaca aagtggtgga aaaaagttat 1860 ttatcgtatg ctggcatttt cagctgtcaa tgcatgggtg atttacaagg aattgcacag 1920 gcattataaa aagccatacc ttgatttcct tgtggaacta gcagaagagc taattcaacg 1980 aggagagagt gggtcaactg tgaagcgtcg cagtgctgca ggaagaccat ctaaacgtgc 2040 ttccatgatg caaaatcttg gaacacacct tccgtatgaa ggtaatacca gaagaagatg 2100 caccaattgc ggaaaagata aaaaacaaaa aagaacaaag ctaatgtgca gtgcttgcaa 2160 tttgccatat tgtattgatt gtttcaagcc atgccactcc taaatgagtg cacattcaag 2220 ccattccatt cctgaatgaa ggacaattca atgtaaatat tgtgaacttc aattttcatg 2280 ttgtgaactt cgctttacat tctgtaaata aatttctttg ctggcattat gtctattggt 2340 acatatatgt accaacattt ttagtttaaa aagccttatg gtacatatat gtaccaggcc 2400 gatttacccc cccaaacccc attttttcag acttcacctt acacagttat aaaaatgatc 2460 cccttggaga aaaaaatcat ctgttgtttg gttgttttag caacaaaaaa aaattcgact 2520 tgaaaggg 2528 // ID BEL-128_AA-LTR repbase; DNA; INV; 399 BP. XX AC supercont1.1; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-128_AA_; KW BEL-128_AA-I; BEL-128_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-399 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.1; Positions 3321678 3322076. XX SQ Sequence 399 BP; 129 A; 79 C; 87 G; 104 T; 0 other; tgtttaaaac cctcatgaaa agggaatgaa atttccaaca tattatttgt aatatcatac 60 accgcaccca gtgtgcaacc ccactgtgca aagtcaaaca cgcagctgtc aaacagcgca 120 gcatagaaac ttgtaaatag tagcataaga gaaataaatc agttgaaatc gtcacttcgc 180 cgcgtttgca cacacaattt gttgctttcg gtttttgtgg tgtccgaatg gtaaccgcgg 240 gttgataaat cgattcttat cgtcccgttt gtgctgtgta gtgtggtggg aaaacatttc 300 cccaagaaat tgagccaaga agtgcaaaga aaggccaatt ccagtgagag gaacaaaatc 360 ctcatacgtt agtaagaaga gtggttaagt gcttgaaca 399 // ID DNAX-8C_AP repbase; DNA; INV; 151 BP. XX AC . XX DT 28-AUG-2009 (Rel. 14.09, Created) DT 28-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNAX-8C_AP. XX NM DNAX-8C_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-151 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2063-2063 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC tsd TA or TATA. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 151 BP; 36 A; 46 C; 41 G; 28 T; 0 other; ctccgcgcca cgagagaagg aacccgtgtc ccgaccgaaa cacgttcgat tccgcgcatc 60 gccacgtatt tgacatacac taaatggcac aaagtgggga atgcctacag tctgcatgcg 120 cgaaacgggt tccttctctc gtggcgcgga g 151 // ID Mariner-12_SM repbase; DNA; INV; 1442 BP. XX AC . XX DT 11-MAR-2008 (Rel. 13.03, Created) DT 11-AUG-2009 (Rel. 14.09, Last updated, Version 2) XX DE Mariner DNA transposon from Schmidtea mediterranea: consensus. XX KW Mariner/Tc1; DNA transposon; Transposable Element; MARINER-12_SM. XX NM MARINER-12_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-1442 RA Tempel S., Bao W. and Jurka J.; RT "Mariner DNA transposon from Schmidtea mediterranea."; RL Repbase Reports 8(3), 339-339 (2008). XX DR [1] (Consensus) XX CC TSD : TA. CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 160..1272 FT /product="Mariner-12_SM_1p" FT /translation="MARQINQFPASIANITEIVVESDNTAEPAIRNTFRVQ FT RRNSTTSNADRDRVITAYDDGTSIVDISRILSIKRTTIYGILKKYHNTGVV FT EADKRGGDHPQKLTEDQKNQVKEWVDDNCQLSLKVLVEKVFTEFQIRISTT FT YVYNLLSRFHYSMKRLHLIPERRNSTEVIETRKEYALNFSSILSIYSEQEL FT IFIDEVGFNVSMRTTRGRSQIGTPATLNVPTIRSRNISIACAMNKNGLLFY FT KSFNRALNKEKFIEYITELKEVLIDKEIHRCVFIMDNVKFHKSPEVLSAIN FT NENTTVKFLPPYSPFLNPIENMFSKWKEITKRANPRNEDDLMRAIEIGSTL FT ISSQDCSNYVRNMWSYIPRCIEGEIILN" XX SQ Sequence 1442 BP; 551 A; 199 C; 226 G; 466 T; 0 other; cagggtttta agttgtattt tgacaaacat aatatatttt ttgacaatat taattagata 60 ataaacaata ttaaatctat tattgacaat atatactaaa ttaatgacaa taaaaacttt 120 attaatgaca ttataaatat atttttagac attttatgta tggctcgaca aataaatcaa 180 tttcctgctt ccatagcaaa tattacagaa atagtagtcg aaagtgataa caccgccgaa 240 ccagctatcc gtaacacttt tcgagtacaa cgaagaaata gtacaacttc aaatgctgat 300 agagatcgtg ttattacagc atatgatgac gggacttcta tagttgatat ttctcgaatt 360 ttatcaatta aaagaactac aatttatggt attttgaaga agtaccataa cacgggagta 420 gtggaagctg ataagagagg cggcgatcat ccacagaaat tgacagagga tcaaaaaaat 480 caagtaaaag agtgggttga tgataattgc caattatcgt tgaaagtttt ggtggaaaag 540 gtatttacag agtttcaaat tagaatatcc actacttatg tatataattt attatctcgt 600 tttcattatt caatgaaaag acttcattta attccagaga gaagaaattc aacagaagta 660 attgaaacaa gaaaagaata tgcacttaac ttctcttcta tactaagtat atattcagaa 720 caagaattaa tatttattga tgaggtggga ttcaacgttt ctatgagaac aacaagagga 780 agatcacaaa taggcactcc tgcaacatta aatgtaccga ccatacgatc aagaaatata 840 tctatagcat gtgcaatgaa caaaaatggg ctattgtttt ataaatcttt taatcgggcc 900 ctcaataaag agaagtttat cgaatatata acagagttaa aggaagtttt aattgataaa 960 gagattcaca gatgcgtatt cattatggat aatgttaagt tccataaaag tcctgaagtg 1020 cttagcgcta taaataatga aaatacaact gtaaaatttc ttccgccgta ctcccctttt 1080 ctcaatccta ttgagaacat gttcagcaag tggaaggaaa tcaccaaaag agccaacccc 1140 agaaatgaag atgatcttat gcgagcaatt gaaattggaa gtaccctaat atcttcacaa 1200 gactgtagta attatgtgcg aaatatgtgg tcatatatac caagatgtat tgagggagaa 1260 attattctta attaaatgtc acttaatttt tttgtcatta atatattatt attgtcatta 1320 atattattta aattggcatt tattttatta atatcgtcat taatgtattt aacattgtca 1380 aagattatat taaaatattc aaaaaatatg ttatgtttgt cataatacga cttaaaaccc 1440 tg 1442 // ID CR1-53_AAe repbase; DNA; INV; 4596 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-53_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4596 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1140-1140 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 13 sequences with >98% CC identity. XX FH Key Location/Qualifiers FT CDS 203..1030 FT /product="CR1-53_AAe_1p" FT /translation="MSACDRCAKSIKRADDVITCMGFCEHIVHLRCTNFEK FT SIAKAISDSQNLFWMCDECTKLMKIVCFQNAISSVGNSINELTKNQEAANA FT ELKSVLAKHSEQIAQLSSRIQSTTPTIADSVSRRAMKRRRTEELAPIAKPL FT LGGTKSTDDVSIATVPPPTPLFWIYLSRLHTSVKPDIVEKLTKDSLHCESA FT KAVPLVKQGTDINSLNFISFKVGVDPKYRLSALDPSSWPKGILFREFEDNR FT AKNLWMPDPCSPSIIVTDSLETPQHAPSSMITADC" FT CDS 1034..4537 FT /product="CR1-53_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="HQGARCTASVGASGSSNTVVPYTHCLAPPASISCRPS FT RPGPVFGSKSEVSQLNTPGKYDFIRTCCLPDSFTTSSCYRVHDAMFSSPGN FT AAFNTPCHISVGPSASTTSSTCIRKTGRLQPSLMEVPRPSTAVELVPGFPD FT NPACVSCHQSRPVPACRSGQGVFQTPSSGEFLSCNASTAPDGLPTPRSHSG FT SPPSSTRNTSGFFHARHQECSSPPLQHVTVYYQNVRGMRTKTTDFFLITAT FT CDYDIIILTETWLHPDIQNSELTSNYTIFRCDRNKQTSSLQRGGGVLIAVK FT SALRCSSITLTDCNSLEQVAVSVILPETTIHLCGIYIRPTSPPEVYTTHAS FT AVKKICDLSCVSDTIVVVGDYNLPGLSWFLDDDLNSYLPSNASSEAETTFA FT EMMISTGLHQVNSLLNTNDRLLDLAFVSDAGEVEVLDPPTNLLRLDAHHKP FT FVLRIEVTNNRHLTRPESNSELSFDFNRCDFVTLNQILSSINWYDLLSNES FT SDEAVALFYTKLFDVFRDAVPYKRKRLNLAPKLPWWSPELRHLRNVLRKAR FT KRFFRSRSNEDRENLKSLEARYNECQNLYFQNYINRVESNLKHNPSSFWSY FT VKTRKSRNHLPEQMCYCGSSAANGNESADLFANFFSSVYSTNCPTLSAATR FT QCIRTHDVNLPLHSVTEQDVRAELIKLDADKGAGTDRLPPIVLKECAESLK FT TPISIIFNRSLNEKKFPTVWKTSSITPIHKSGNYTSVENYRGISIICCVAK FT VFEKIIHSVLYNATRHLISDRQHGFMKKRSTVSNLMCYTNFLSREIENRKQ FT VDTVYFDFRKAFDKVPHSLAIEKLEHMGLPVWITEWLRSYLSDRKAFVKFG FT NSQSRVFDTTSGVPQGSVLGPLIFVLFINDLSVRLKSTMLLYADDLKMYRI FT ISSTLDCCELQADIDELHLWCLENGMQLNISKCKTITFTRRLSCFPFEYKI FT NGICLDRVNTINDLGVIIDSKLKFNEHINIITAKALSVLGFVRRNSQLFHD FT VYTLKTLYCSLVRSILEHAACVWSPYHTTLEIRIEKVQRSFIRYALRSLPW FT NDPQNLPHYESRCKLIDLETLSRRRTKLKQMFVFDLITGNIDCSALLNDVM FT FFAPSRNLRGRYLLTTGAHRTSYGQNSPFTSCIQCFNDVNDQFDFNVSKTI FT FKRRIKNIR" XX SQ Sequence 4596 BP; 1273 A; 1148 C; 906 G; 1268 T; 1 other; gctttgtacg cgttttatgc tttaattgcg tgtgttttta tgcctatgtg tccacctgta 60 gtgatataat atcacttttc gtcagtgcat cttacaatat ccagctcatt taacaacaat 120 agacatctgc ctaccgctaa ctgacatcaa cccaacctgt gatatatatt gtggtgccgt 180 aatcataaac aaaatcatcg atatgtcagc ttgtgatcgt tgtgcaaaat ccatcaaacg 240 agccgatgac gtgattacat gcatgggatt ctgcgagcat atcgtccatc tcagatgcac 300 caacttcgaa aaaagtatcg ccaaagccat atccgactca caaaatttgt tctggatgtg 360 tgatgagtgc ackaaattaa tgaagattgt ttgcttccaa aacgctatct catctgtcgg 420 aaattcgatc aacgaactga ccaagaatca agaagcggct aatgctgaat taaaaagcgt 480 gttggcaaaa cacagtgaac aaattgccca actttctagc cgcattcaat cgaccactcc 540 aacaattgcg gactctgtca gtcgtcgtgc aatgaaacgc cgacgcactg aggaattggc 600 cccaatcgct aaacctttac tcggaggaac aaaatctacc gatgatgtta gtattgctac 660 agtgcctcca ccaacccctc tgttctggat ctatctgtct cgtttacata ccagtgtgaa 720 gcccgacatt gtggaaaaac taacgaaaga cagtttgcat tgtgagtcag ctaaagcagt 780 tccattggtt aaacaaggca cagatatcaa ctccttgaac ttcatatcgt tcaaagttgg 840 tgtcgatccc aagtaccgcc tttcagctct tgatccatca tcttggccga aaggaatcct 900 gtttagagag ttcgaggaca atcgtgctaa gaacttatgg atgccggacc cttgttctcc 960 gtcgataatt gtgactgact cattggagac cccgcagcat gctccctctt ctatgatcac 1020 tgccgactgc tgacaccagg gcgcacgatg tacagcttca gtgggagcct ctggttcatc 1080 caacacagtc gtgccataca cccattgtct cgctccgcca gcttcgattt cctgccgtcc 1140 aagtcgccct ggccctgtgt ttggatccaa atcagaggtc tcccaactca ataccccagg 1200 caagtatgat ttcatcagaa cgtgttgcct tcctgacagt tttaccactt ctagttgcta 1260 ccgtgttcac gatgcaatgt tctcatcacc tgggaatgca gctttcaata caccctgcca 1320 cattagcgtc gggccttcag catcaacaac gtcatcaaca tgcattcgaa aaacggggcg 1380 cctgcagcca agtcttatgg aagtccctag accctctacc gcagtcgagc tagttcctgg 1440 ctttccagac aatccagcgt gtgtttcctg ccatcaaagt cgccccgttc ctgcgtgtag 1500 aagtggtcaa ggggtcttcc aaactccttc ctctggtgag tttttaagct gcaatgcatc 1560 taccgcccct gatggtcttc ccactcctag gtcgcatagt ggatccccac catccagtac 1620 cagaaacaca tcgggctttt tccacgcacg acatcaagaa tgtagcagtc cgcctctaca 1680 gcacgtgacc gtttattatc aaaatgttcg aggcatgcgc acaaaaacca ctgacttttt 1740 cctgatcacg gcaacttgcg actacgacat tattatactt acggaaacct ggctgcaccc 1800 cgacatacaa aactccgagc tgacttcaaa ttatacaatt ttccggtgcg atcgcaacaa 1860 gcaaaccagc tctcttcaac gtggcggcgg cgtgctaatt gcagtgaaat ctgctcttcg 1920 ttgttcatct attactttaa ctgactgtaa ttccttggaa caagtcgctg tctctgtaat 1980 actaccggag acaacgattc atctctgtgg tatttacata cgacccactt ctcctcctga 2040 agtttacaca acgcacgcat cagctgttaa aaaaatttgc gatctttctt gtgtatctga 2100 cacgattgtg gttgttggag actataatct gccagggcta tcctggtttc tcgacgacga 2160 tctgaatagc tatctacctt ccaacgcttc atccgaagca gagacaacat ttgccgaaat 2220 gatgatttct acgggactcc atcaagtcaa ttccttgctc aatacaaacg atcgcctcct 2280 cgatttggct tttgtaagtg atgctggcga agtagaagtg cttgatcctc caacgaatct 2340 gttacgtctg gatgcacatc acaaaccctt cgtacttcga attgaagtta caaacaaccg 2400 tcacctgacg cgtccagaga gcaacagcga actcagtttt gattttaacc gctgtgattt 2460 cgtaacattg aatcaaattt tatcatcgat caattggtat gatttgctta gtaatgaatc 2520 atctgacgaa gcagtggcac ttttctacac caaactgttt gacgtttttc gcgatgctgt 2580 tccatacaaa cgcaagcgtc ttaatctagc acccaaactg ccgtggtggt cacctgagct 2640 ccggcatctc cgaaacgttt tgcgtaaggc ccgaaagcga tttttccgtt caaggtccaa 2700 cgaagatcgt gaaaatctca agtctcttga agcgcgctat aacgagtgtc aaaacttgta 2760 ttttcaaaac tacataaatc gagtggaatc aaatttgaaa cacaacccgt ccagtttttg 2820 gtcttatgta aaaactcgta aatcgcgcaa tcatctaccc gaacaaatgt gttactgtgg 2880 atcgtccgct gcaaatggca atgaatcggc tgaccttttc gcgaactttt tcagcagcgt 2940 atacagtaca aattgcccaa cactctctgc agcgactcga caatgtattc gtacacatga 3000 tgtgaactta cctctccata gtgtaacgga acaagatgtc agggcggagc tcataaagct 3060 tgatgctgac aaaggcgctg ggacggaccg tctacctccg atagtcttga aagaatgtgc 3120 agagtctctg aaaacaccca tcagcataat cttcaatcga tctctcaatg agaagaaatt 3180 ccctactgtt tggaaaactt catcgataac tcccatccac aaatctggaa actacacctc 3240 cgtggaaaac tatcgcggta tctccataat atgctgtgtc gccaaggtat ttgagaagat 3300 aatacattcc gttttgtaca acgctacccg gcacttgatt tcggaccgac aacacgggtt 3360 catgaagaag cgttcaacag tgtccaactt gatgtgttac accaacttcc tttcaagaga 3420 aatcgaaaac cgcaaacaag tagacacagt gtatttcgac tttagaaagg cttttgacaa 3480 agtcccgcat agtcttgcaa tcgagaaact tgagcacatg gggcttcctg tctggatcac 3540 cgagtggctt cgctcgtatt tatctgatcg caaagctttc gtcaaatttg gcaactcaca 3600 atcacgcgtt ttcgacacaa cctctggagt cccgcagggc agtgttctag gaccgttgat 3660 tttcgtgctt ttcatcaacg acctgtccgt tcgtctaaaa tcaacgatgt tattgtacgc 3720 tgacgatctt aagatgtata ggataatttc ctcgaccttg gactgctgcg agctccaagc 3780 tgacatcgac gaacttcatc tctggtgttt ggaaaatggt atgcagctta acattagcaa 3840 atgtaaaacc atcactttca cacgtcgcct atcctgcttt ccatttgaat acaaaatcaa 3900 tggtatttgc ttagaccgtg tcaatacgat taacgacttg ggtgtcatca ttgatagtaa 3960 gctgaaattc aacgaacata tcaacatcat aacagccaaa gcgttgtcag ttcttggttt 4020 tgtacggcgt aattcacaat tatttcacga cgtgtacacg ttgaaaactt tgtattgctc 4080 tttggttcgg agcattctgg aacatgctgc atgtgtttgg tcgccttacc atacaacact 4140 ggaaattcgc attgagaaag tacagcgaag ctttattcgt tacgccctca gatcgcttcc 4200 atggaatgat ccgcaaaatc ttcctcatta tgagagccgc tgcaaactga tcgatctcga 4260 aacactatca cgaagacgaa ctaagttgaa acaaatgttc gtatttgacc tcataaccgg 4320 aaacattgac tgctctgctc ttctcaacga tgtaatgttc tttgcaccga gccgaaattt 4380 acgtggaaga tatctcttaa cgacgggtgc acaccgcacc agttacggtc aaaacagtcc 4440 cttcactagt tgcatccagt gcttcaatga tgtaaacgat caatttgatt tcaacgtgtc 4500 caagacgatt tttaaacgta gaattaagaa tattagataa taagaacagt ctgtgggaca 4560 ttaaaggaat ccaagacggt gtcaaataaa taaata 4596 // ID BEL-8_DWil-LTR repbase; DNA; INV; 415 BP. XX AC scaffold_181123; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-8_DWil_; KW BEL-8_DWil-I; BEL-8_DWil-LTR. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-415 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_181123; Positions 84084 83670. XX SQ Sequence 415 BP; 132 A; 72 C; 87 G; 124 T; 0 other; tgttgtggag gcggtacgtt actgccaatg ccctgaattt aggtagccct gatcgttttc 60 cctttagact tagtatgtat cgacctgaat gcaaccctga cttatcgagg cagaaaccag 120 ctctaagaag tgagacgagg cgtcagtata gaagaaaatt gcacaggaac gatacgttaa 180 aataaacatc cgataattgg cacgaaacga agctcgaggc taggagcctt agggcttaat 240 atatatggat aaactttata atttgttgta tctaagaaac accatcataa ttattctata 300 ctcagtttgt aataaataag tttaaaatca attcgtgcta aatgttctat ttttgtatcg 360 gtaaaatttt cgggcgttaa agtaacgcgt ggtcgagaca ttcttgacgt taaca 415 // ID BEL-32_AA-I repbase; DNA; INV; 5529 BP. XX AC supercont1.240; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-32_AA_; KW BEL-32_AA-LTR; BEL-32_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5529 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.240; Positions 243582 238054. XX CC Positions [4442-5044] - Integrase core CC 'GCAAC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 642..1958 FT /product="BEL-32_AA-I_2p" FT /translation="MSHLEGEAKSLVSSYAITDVNYKEVWDTLVEQYDKPK FT FAVSALVQEFCDQPVIKTANLVNLRKLVSTSDEVIRQLKAMGAAYETRDPW FT LIHIVVKKLDDNVRSQWAAHIVDIDNPTFEELLKFLKRKCDTFETCAAFSG FT KQSDYVKKDIFKEERKNPAVKREVSNFGVTQQSCPICSSGDHVIYQCTTFK FT EATVKERRDMALQMRLCYNCLRSNHCAKSCPSKSVCRTSDCQQRHHTLLCQ FT YDKAEVNSNVSTIEDKVVPATTTSETEDNTELVSCSANVKATTSVFIATLP FT TAVVRVRGKDKQFHEVRAMIDSGSQSSLISEHCVTVLGLEKENAKLIVSGV FT GSGATETTRGVVNLEISSRFDDNLICRTKAFVLSKLATNLPSRQIDTSRLK FT CLESLRLADPHFEKPRKIYVILGVDVFLSVVQDGKVKDDMGLQWH" FT CDS 2264..5053 FT /product="BEL-32_AA-I_1p" FT /translation="MHRKFASDPVFKKLYCDFMAEYLQLGHMERIPDAEIE FT MPSDQRYYFPHHAVLRSESLTTKLRVVFDGSCETATGVSLNDRLLVGPKVT FT EDVPVVFTRFRTYVVAFTADVEKMYRQVKVRKEDTNYQRIIWSPEPGKPFE FT HYRLLTVTYGSSCAPYLAIESLRQAARDCQSQYPEAAEHVSKNFYVDDLLS FT GANTLEEAIKLRNDIIQVTSTAGFTLRKWSSNDPRLFDESTETDAAVPLHL FT APDADSVKALGIQWYPATDTFGYQLKMDFNKRNTKRQMLSDSARLFDPLGW FT VAPIIVRIKILFQSLWLQDLLWDDPLPAAIDDEWNKIKNDLQAIEKIRIPR FT WIPHHGGKVQLFGFADASEAAYAAVVYARSTDQNGKIFVSLVASKTKVAPI FT QQVSLPRLELNAAVLLTDLMKQLMQSLSHLDVTCYALTDSEIVLGWLSSHP FT RKWKTYIANRTSMILEFLPRSSWNHVASADNPADCASRGLYPADLVNFSLW FT WKGPSWLYQSDETWNRHPAVMVPDPLLTDQECSSQCLMVASSGSFSSYDVE FT NYLLARFSSLNRAQRVLAYINRFKSNLLASRRGGQHVGGELDPVELHEANV FT QLARCAQHAVFQRDIQCLQKGDQLALKSQIRSLFPFLDANGTLRVGGRLQN FT SDQSFEMKHPIILPKFHRYTELLIINLHLENLHAGPTLVVATLSQKYWILG FT CQTIVRNIINNCVRCLRWKAKTAKQLMGSLPTVRIVGRRAFENVGVDYAGP FT IVLKASVLRTTKTIKGYIAVFVCLGTKAIHLEAVTALSSGAFISALKRFSS FT RRGSPSQIWSDNGTNFVGADHQLKDLLQSVNFNSEINRYLSGLGVKWTFIT FT PSAPHMGGIWESAVKSMKKHLRIVLGKQMITYEDMTTILTQIEACLNSRPL FT CALSSSTDSCEALTPGHFVIGQPLPR" XX SQ Sequence 5529 BP; 1550 A; 1242 C; 1366 G; 1371 T; 0 other; ttttggtcct tctgctccgg atagcagaga atccgcgacc ggtccgggac aatggaagcg 60 cttttgaaga agcgaaatgt tgcgttcgaa cggctgaaac gggaacacgc aggtgcgaga 120 caagtgacac aggaaacacc aaccttcgag ataaatgatc ggctgcaaaa gttggtggaa 180 atgcaggaga atttcgacaa aatccagtgc gaaatcgaag acgcagcaac agaggaggaa 240 ctgccgtcgg tgctgaatgt gcgagaggat tacgagaaac tgttctacat taccaaagga 300 atgtttactc gtatgctgga ggtagagcgg ccgagaggat cctactgcgg aagtagcgac 360 ggtacgtgca ttgaaccgga tagcggactg aaggaagctg ttcgtgtgct attggaaacg 420 caaagggcgt tgttaacaag acaagctgca gcttcaacta ccgtggagga gttagctggc 480 cagtttcgta atatgcgcga aaattcaatc gacacacaac taccatcatt caatcttccg 540 acgttcaagg cgacaggaaa cagtgggcat cgttcaagga catctttgtc agcagtgtgg 600 ataagaaaaa ccttacgaat gcgctcaaac tgcagcttct tatgtctcac ttggagggag 660 aagcaaaaag cctcgttagt agttatgcaa ttaccgatgt gaactataaa gaggtatggg 720 atactctggt cgaacagtat gacaagccga aatttgccgt ttcggcgctt gtccaggagt 780 tttgtgacca gccagtaatc aagacagcga atctcgtgaa tctacggaag ctcgtatcga 840 cctccgacga ggtgattcgt cagctgaaag ccatgggagc tgcgtacgag accagagacc 900 cttggctcat ccatatcgtt gtgaaaaaac tggatgacaa tgtaagatcg caatgggcag 960 cgcacatcgt ggatatcgac aaccctactt tcgaagaact gctcaaattc ctcaagcgaa 1020 aatgtgacac cttcgaaacc tgtgcagctt ttagcggcaa gcaatcggat tacgtcaaga 1080 aagacatctt caaagaagaa cggaaaaatc cagcagtaaa gagagaagtc agcaattttg 1140 gcgtcactca gcagtcgtgt ccaatatgtt catcaggaga tcatgtaatt taccagtgca 1200 ccactttcaa ggaggctaca gtgaaggaac gacgagacat ggcactgcaa atgcgacttt 1260 gctacaactg tttgcgttca aaccactgcg caaaatcgtg tccgtcaaaa tctgtatgcc 1320 gcacatctga ttgccagcaa cgacaccata ctctgctgtg tcagtatgac aaagcggaag 1380 tcaacagcaa cgtctcaacg attgaagata aggtggttcc cgcgactaca accagcgaaa 1440 ctgaagacaa cactgaactc gtttcttgtt cggcaaatgt gaaagcaact acttcggtct 1500 tcatagctac gttgccaact gcagttgttc gtgtgagagg gaaagacaag cagtttcatg 1560 aagtgcgagc catgattgac agcggttctc aatcgtccct aatctcagag cattgtgtca 1620 ctgtccttgg tttggaaaaa gaaaatgcga agctgatagt gtcaggtgtt ggcagcggtg 1680 caacggagac caccagaggg gtggtgaatt tggaaatctc ttctcgcttc gacgacaatc 1740 tgatatgccg aacgaaggca ttcgtgctga gcaagcttgc aacgaatcta ccaagccgac 1800 aaatcgatac tagccgtttg aaatgccttg agtcgttgcg actagccgat ccccatttcg 1860 agaaaccaag aaagatttat gtgattctag gagtggatgt ttttttgtcc gtagttcaag 1920 atggaaaagt gaaagatgac atggggctac agtggcattg aattccgcgt tcggttggat 1980 tgttgctggt caagtcgggg ctacaactga tctaacatgt aacactgcta ttgtcactat 2040 gtgcaccgaa ttcaacgttg ataaaacgct acgacagttt tgggaagtcg aggaggtgaa 2100 caagccgaag ccgataacac cgcagcaaca aaagctgtcg aagtattcca agctacctat 2160 caacgtgacg aatctggccg attcatcgta agtctcccat tcgatgaaac tactgcaccg 2220 ctaggaaaag tcacttccag cagccattca tcgactcaag tcaatgcacc gaaaatttgc 2280 aagcgatcct gttttcaaga agctgtactg tgattttatg gcagagtatt tgcagctggg 2340 ccacatggaa cgaattcctg atgcagagat cgagatgcca tcggatcaac gatattattt 2400 tccgcatcac gcagtccttc gaagcgaaag tttgacaacc aaactgcgag ttgttttcga 2460 cggctcctgt gaaacagcaa ccggcgtgtc gttgaatgat cgtttgttag ttggaccaaa 2520 agtcaccgaa gatgtcccgg tggttttcac ccgtttcagg acctacgtcg tcgcttttac 2580 agctgatgtc gagaaaatgt atcgacaggt caaggtgcgc aaggaagata ccaattacca 2640 acgcattatt tggtctcctg agcccggaaa acctttcgag cattataggc tactaaccgt 2700 cacgtatggc agctcatgtg ctccgtactt ggcaattgaa tcactccgac aagcagcacg 2760 agattgccaa tcgcagtacc cggaagcagc agagcatgtt tcgaagaact tttatgtgga 2820 tgacctactc tccggtgcca acacgctcga agaagcaatc aagctaagaa atgacatcat 2880 ccaggtaaca tcgacggccg ggttcaccct acggaagtgg tcatcaaacg accctcgctt 2940 attcgatgaa agtactgaga ccgacgctgc agtcccactt catttggcgc cagatgctga 3000 ctcggtgaag gcattgggaa tacagtggta tcctgccact gacactttcg gataccagtt 3060 aaagatggat ttcaacaagc gaaatacgaa acgtcaaatg ttatcagact cagctcgact 3120 atttgatcct ttgggctggg tcgctccgat catcgttcgt attaagatcc tttttcaatc 3180 gctgtggttg caggatttgc tgtgggatga tccattaccg gcggcaatag atgacgagtg 3240 gaacaaaatc aagaacgact tgcaagcaat cgagaagatt cgtattccaa gatggattcc 3300 ccatcatgga ggcaaagtgc agctgtttgg atttgctgat gcttcagagg ctgcatacgc 3360 ggcagtggta tacgctcggt caaccgacca aaacggtaag attttcgtct ctctggtagc 3420 aagtaaaacg aaagtggcac caatccagca agtatctctt ccccgcttgg agctcaacgc 3480 tgcggtactg cttactgacc tcatgaagca acttatgcag tctctatcgc atttggacgt 3540 tacgtgctac gcgttgacgg attcggagat cgttctgggg tggctatcgt cgcatccaag 3600 aaagtggaaa acgtacatcg ccaatcgcac atcaatgatt ttggaatttc tcccacgtag 3660 ctcctggaac cacgtagcat cggctgataa cccggctgat tgcgcgtcaa gaggtttgta 3720 tccagcagat ctggtgaact ttagcttgtg gtggaaaggt ccatcgtggt tgtatcagtc 3780 cgacgaaaca tggaatcgtc atcccgcagt aatggttcca gatcctttgc tcacggatca 3840 agagtgctca tcgcaatgtt tgatggttgc aagttctgga tcgttttctt cctacgacgt 3900 ggagaactac ctgttagcac gtttttcgtc actcaatcgt gctcaaagag tgctggcgta 3960 catcaacaga tttaaatcca atttgttagc gagccgacgt ggtggtcaac atgttggtgg 4020 cgagttggat ccggtggaac tacatgaagc aaacgttcaa cttgctcgat gtgctcaaca 4080 tgctgtgttt caacgagata tacagtgcct gcaaaaggga gatcaattag cgctgaaaag 4140 tcagatcaga tctttgtttc cgtttctgga tgcgaatgga acgcttcgtg taggaggacg 4200 actacaaaac tcagaccaat catttgagat gaaacatccc ataatcttgc ctaagtttca 4260 tcgttacacg gagctcctaa tcatcaatct gcaccttgaa aacctacatg ccggaccgac 4320 actggtagtg gctacactta gtcaaaaata ttggatcctt gggtgtcaaa caatcgtccg 4380 caatatcatc aataactgcg tccgatgtct tcgatggaaa gcaaaaaccg caaagcagct 4440 aatgggaagt ttgccgactg tgaggatcgt aggaaggcgt gccttcgaaa acgtaggtgt 4500 cgactatgcc ggtcccattg ttttgaaggc tagtgttctg agaactacca agactatcaa 4560 agggtatatt gcagtatttg tttgtcttgg cacaaaggct attcacttgg aagcagttac 4620 ggccttgtcc tcaggtgctt tcatttctgc attgaagcga tttagcagtc gtcgtggatc 4680 gcccagccaa atatggtcag acaatgggac gaatttcgtt ggtgctgacc atcaactgaa 4740 agatcttctg cagtctgtca actttaattc cgaaattaat cgttatttga gtggtctcgg 4800 agtaaagtgg accttcataa caccgtctgc tccccatatg ggcggaatat gggagagtgc 4860 tgtgaagagt atgaaaaagc atcttcgaat cgtccttggc aagcaaatga taacctacga 4920 agatatgacc actatattaa cacaaataga agcgtgcttg aactcacgcc cactctgtgc 4980 tttgtcctca tctaccgatt catgcgaagc gctgactcct ggtcatttcg tcattggaca 5040 accactgccc aggtaacatt ggtagctgaa taaaagttta ttcagctaag aaaaggctga 5100 ataaacaaca tttgagatga ataagctgtt attcagctac caatgttact tgggtgaacc 5160 ttattcctga acctggcgtg gcattgctcc cggagaaccg cctcgataaa tatcaacgta 5220 taagaaaagt agttgaggat ctgtgggacc gtttcaagac tgagtacgtg tcgtcactgc 5280 aatccaggaa taagtggcaa gcatctgtag aaaatgtgaa ggtaactgac ttggtgctgg 5340 taaaaaacga ttgtactccg ccagcatatt gggaacttgc tcgcgttact gctgttcatc 5400 cagatcgaag cggagtggtg cgtaacgtga ctctacagag aggccaaact acatatcaac 5460 gacctattca caagttggtg gtgttgccga gtaattgagg catccgcctc aaggcggggt 5520 gtttgttgc 5529 // ID Gypsy-21_OD-LTR repbase; DNA; INV; 350 BP. XX AC CABV01001265; XX DT 24-FEB-2011 (Rel. 16.02, Created) DT 24-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Oikopleura dioica genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-21_OD_; KW Gypsy-21_OD-I; Gypsy-21_OD-LTR. XX OS Oikopleura dioica OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; OC Oikopleuridae; Oikopleura. XX RN [1] RP 1-350 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Oikopleura dioica genome."; RL Direct Submission to RU (24-FEB-2011). XX DR Genome; CABV01001265; Positions 2029 1680. XX SQ Sequence 350 BP; 113 A; 69 C; 90 G; 78 T; 0 other; tgggggacgt aagattatta gcgaaagaga agccctaaca ggcctcaagg ctaaaagggg 60 aaagtttgtc tgaaggacag cgctcgagtt acgcggccca aaagccggat caagactgct 120 cagtccgcga actcgaggta aaggaaacgc acgtttaaat agagaattaa tttgacgaaa 180 ccaccgatca gtcgagggtt gaagacaaca aagagaatta acgtacgttt aggtttgttc 240 tacaatcaga cgaagcgcag cgaccgactt ctgcaagaat cgatccctgc tcatttctgt 300 gttttgaata aagagttgca gattgaacac gagcgtttat ttatatggca 350 // ID Mariner-2_AP repbase; DNA; INV; 2267 BP. XX AC Contig30729; XX DT 07-MAR-2008 (Rel. 13.03, Created) DT 17-MAR-2008 (Rel. 15.12, Last updated, Version 2) XX DE Mariner-type transposable element. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-2_AP. XX NM Mariner-2_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-2267 RA Jurka J.; RT "Mariner families from Acyrthosiphon pisum."; RL Repbase Reports 8(3), 341-341 (2008). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX FH Key Location/Qualifiers FT CDS join(469..1026,1049..1747) FT /product="Mariner-2_AP_1p" FT /translation="MNSDPTKSLRIIRQELSKELGIGATTISTTITEYNNT FT KKVISPCKKRVKTSLLETFEEFERNVVRRHVHSFWFKREIPTVDKIFQVVS FT DDDSLPIISRTNLFRLLKEMDFKYSKRGRNSALTEKPKFCVGVDGFLNNCV FT NTEMRVDIYIIWTRPGLMLVNVPAKHGSILQLNHPETHSYKVCQLVPTPYC FT PEYRFGRWVVPDALLCFESKKNTRDYHDEMNGETFYEWMEGVLPRLKENCV FT IIMDNASYHSVKLDKAPTSQTRKGDIIKWLEDKGEVIDRPMCIPQLLQIVK FT RIKPQHQKYVIDELAKKHNRNILRLPPYHCELNPIELAWSSVKNYVRMNNK FT TYKLHDVKNVLIEGVKRVDADMWKNFISHTKKEEDKFWEIDFVVDEVLSAE FT LESVTLTIGDTSSDDSLGTESDYFF" XX SQ Sequence 2267 BP; 728 A; 381 C; 437 G; 721 T; 0 other; ctatactcta ttccacctaa gagtgaacca ctcagccgac caaaactcgt tttgttcgcg 60 catccccacc acaagacggg cgccgccgcg aacgttggtg tgggtcgtca cgaaactgca 120 gctgcctacg tagtgagctg tattaacaca ctatcacttg tttgattcca ctcgcgcgca 180 catgttcact tttgtattta ccgccgtcat ttattagttt agctatcatt ttccgtagtg 240 aattattttt tttgttcata tattttttat taaataatta ccatggacca agaacaagct 300 gtaggcaagt ctccagtaaa aaaaaatcca tcaggaaagg taagaacatt ttttcaatga 360 atgatttttt aatttaatat gctggtttta tcataataat ttttgttaca atttcagaga 420 gtagattctg gcaagagaca aattataatc aatgcgtaca agtcagaaat gaattccgat 480 ccaacaaagt ctttaagaat tatccgacaa gaactttcta aagaattggg cattggagca 540 acgacaattt cgaccacaat tacagaatac aacaacacta aaaaagtaat atctccttgc 600 aaaaaacgtg ttaaaacatc cttgctcgaa acatttgaag agtttgagag gaatgtggtt 660 cgtaggcatg tacatagttt ttggttcaaa cgtgaaattc ctacggtcga caaaatattt 720 caagtagttt ccgatgacga ctctttacct attatttcga ggacaaatct ttttaggctt 780 ttgaaagaaa tggattttaa gtacagcaaa cgtggtcgaa atagtgcact gactgaaaaa 840 ccgaaatttt gtgttggcgt agacggtttc ttgaacaatt gcgtgaatac cgaaatgagg 900 gtcgacattt atattatctg gacgagacct gggttaatgc tggtgaatgt accagcaaaa 960 catggatcga tactacaatt aaatcacccc gagacgcatt cctacaaggt ctgtcaactg 1020 gtgccgtaaa tccttcagga aaaggtaaac gccttattgt cctgaatata ggttcggaag 1080 atgggttgtc cccgacgcat tgttgtgctt cgaatcgaaa aaaaacacta gagattacca 1140 tgacgaaatg aacggggaga cattttacga atggatggag ggtgtactgc ctcgtttaaa 1200 agagaactgt gtcatcatca tggacaatgc gtcatatcat tctgtgaagc ttgataaagc 1260 acctacgtcg caaaccagga aaggtgatat aataaaatgg ctggaggata aaggcgaggt 1320 tatagacaga cctatgtgta taccacaact tcttcaaatt gtgaaaagga tcaagccaca 1380 acaccagaaa tacgtaattg acgaattggc aaaaaaacat aaccgtaata tactaagact 1440 acctccatat cactgtgagc tcaatccaat agagttggct tggtcatccg ttaaaaatta 1500 cgtacggatg aacaataaga cctataaatt acacgatgtc aaaaatgtat taattgaggg 1560 cgtcaaacgc gtggatgcag acatgtggaa gaattttatt agtcacacga agaaagaaga 1620 ggataaattt tgggaaatcg acttcgtcgt agatgaagta ttgtctgcag agttggagtc 1680 agtaactttg acaatcggag atacatcatc agacgactcc ctcggcactg aatctgacta 1740 ttttttttaa atttttattc tattatattg acgtgtgtat ttttttattt tttttatttg 1800 tattttatta tattgactta tgtatttttt aaatattgta tttgtattat attatattga 1860 cgtgtgtatt tttaaaatat tatattttta ttatattata ttgacgtgta cttttttttt 1920 atttgtattt ttaaaaaaat tgtttttgtt ttatacctat tatattaaat ttattattta 1980 agcaataaaa caatttaatt tataatattt tatatttatg tgtttgtata taatatagtt 2040 tgatttatga tgaaaattca aaataagtgt tgtatagtaa ttaatcgcga aaacagctat 2100 gtttttatga ttctctgtat gcacggcggt cgacaactta cgacgaccgg cgatgactgt 2160 cgaccaatga cagaacgcct agccgtcacc aggggggtca gacaatgatc ggcggctcag 2220 tctgcgtgcg cgaaagcggt tcactcttag gtggaataga gtatagt 2267 // ID LOA_Ele7 repbase; DNA; INV; 5878 BP. XX AC . XX DT 15-OCT-2010 (Rel. 15.1, Created) DT 15-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE A LOA clade non-LTR retrotransposon family from Aedes aegypti. XX KW Loa; Non-LTR Retrotransposon; Transposable Element; Lian; KW LOA_Ele7. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-5878 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5878 RA Kojima K.K. and Jurka J.; RT "LOA clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (15-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update and extension. This consensus is generated CC from 12 sequences with >92% identity, and ~99% identical to the CC original sequence in [1]. XX FH Key Location/Qualifiers FT CDS 318..1970 FT /product="LOA_Ele7_1p" FT /translation="MSDIGEKQLMEIDVEDVLXSSSVDDVPSCSSSILDNP FT VGDDGYEDDGDFEDGINVTILSSTANIVHDTSAVDKTVLGSSAGKQPVLPG FT DQQRTKLNGAARKRYKRLVAEGIDPEEAYTLSRVPLQTPNSEKRSRNADLS FT GSTSSGENPRKKQNVRPLLEPRTGSNHXSLGKFSVQGRLQINSNVCSGRNE FT TIQSQNKASYSEVVNYVRVGIVPKDYPNVELTSAQLLATRKAILSKVAQQR FT KEKIXPKFGQCLLRTGHLILVCKNQETAGWLKSIASTISVVEDVELTALDE FT KKIPRPEIIIGFFPVSAEDSTDDILELLESQNEGLNTDEWRIKERNIINQL FT HVELIFTVDGASLDAIKKCEFTLDYKFGTAPLRRKIPPKIKMPNQGVNNDL FT VSPSDNLNNRRVLEEQQTTLIGSGVIRQTATPIENLKTTGDSSSASNREIQ FT NMPGPSGIRDASFKGARPKQILEGTSSGLVRKNDTAPQSPGSNGTMESKRN FT VKKTKILPVKDKNPLHKNKYIAKYMKNLGDRVTNTEGGKPPRESDTTMKHE FT NGQH" FT CDS 2099..5749 FT /product="LOA_Ele7_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="MRREQQLYFKKNFVHNKMDIALIQEPWAYDRKVLGIP FT TSAGQLIYDENQIVPRAAILISNRMKFLPITQFISKDIAVVLMEVPTTRGI FT AEMYVASAYFPGDLEAIPPPDVVNFVSYCKKNNKPFILGCDANAHHIVWSS FT SDINYRGEYLLEYIITNNIDICNQGCEPTFITSTRKEVLDLTLCSTNISEN FT VKNWHVSNETSLSDHRHILFQYQTNEIFTNEFRNPRKTNWDLYYQNILNES FT HISDEKFNSITELEKASVQVSNLMQHAFQASCPITVRSTSRDVTWWNDNLE FT ALRKKTRKLFNRAKRTMKWEXYKIALTDYNKEIRRSKRMDWRRTCESIENT FT PVVARLQKVLSKDHVNGLGTVRKSNDCLTSNGLETLEVMMQTHFPCSTVIN FT YDENATEDNTSNELSRMTSLGEIHDSKDNLDLLPILNMAAQKADEIFTKNK FT VEWAINSFDPFKSPGKDGIYPAFLQKSNGVLTSALSNMFKASLVFGYIPKS FT WRQVRVVFIPKANKKDKTSPKSFRPISLSSIMLKIMEKLIDHYIKTAYLKQ FT NPISIHQFAYQSGKSTTTALHALVSKIEKTFEAKEVLLATFLDIEGAFDNA FT SHKSMARAMLKHGFDICIVKWISEMLSKREITADLGSVSISVKAVKGCPQG FT GVLSPLLWSLVVDELLTNLESQGFEIIGFADDIVIIVRGKYDSVITNRMQR FT ALDYTIQWCITEGLDINPQKTHIIPFTKRRKLIISGLHMKGTLMTLSTEVK FT YLGIILDCKLNWNSHLKQIIDKATNALWLSKRTFGNKWGLRPIMIHWIYTT FT IIRPRITYASLVWWTKTKEKCAIQKLAKLQRLATMCTTGAMRSAPTKALDA FT ILNLLPLHQFIQLEAEKSALRFQQWKPLLSGDIKGHLSILNDIKINPLVTV FT NTDWMEIQFNFNRMFNVVESDRTVWTMGGPQIRSGSIIFYTDGSKQNGQVG FT AGVTGPGVNLSVSMGRWPTVFQAEIQAILECASICLKRKYKHSNICIFSDS FT QAALSALKSYTCTSKLVWECFLLLQQLCINNTVNLYWVPGHCGIDGNEKAD FT ELARIGSTEKFLGPEPFCGVSTCSIKMELRNWKQSMVITNWNKTAVARQSK FT QFITPHASNTQKLLFLSKKDLCTYTGLITGHCSAKYHLRLMGKVEDDICRF FT CQESCETSEHLLCSCDALFQRRVKFFDKGDIQPFEIWSLPPGKVVRFIRHI FT IPNWDNT" XX SQ Sequence 5878 BP; 2038 A; 1078 C; 1142 G; 1611 T; 9 other; actsawcaat aggttcataa gtgttatacg tttactacat atacgtccat atctttctct 60 gttaacttgc atgcaagatt tmtatatgtt gtggtagaaa taccatcaac sgagtatcaa 120 ctgaagtcgt tgtgaaccat ttgcaatgac tcaaccgttc atactcttcc tcagaacaca 180 atttgttctg cggatatgaa tgtttttctt gtctcgtaca agtggtagta cagtggcaga 240 cccgactgca ctataatata cgattgataa taaatcccac cgttcgtact ttaatcttca 300 cgatttcagc cacaaatatg agcgacattg gggagaaaca gcttatggag attgatgtag 360 aagatgtttt ggwcagcagt tcggtggatg atgttccatc ctgctcttcg tctattttgg 420 ataatccggt tggcgatgat ggttacgaag acgacggcga cttcgaagat ggcattaatg 480 tcaccatact gtcatcaaca gccaacatag tgcatgatac ttctgcagtc gacaagactg 540 tgctgggtag ctctgcagga aaacaaccag tgttaccagg tgaccagcag agaactaagt 600 taaacggcgc tgctaggaaa cgttataagc gattggttgc cgagggaatc gatccagaag 660 aagcttacac cctttctcgt gttccactac agacgcctaa ttccgagaaa agatctcgaa 720 atgctgatct tagtggttca actagcagtg gtgaaaatcc tcgcaaaaaa caaaatgtac 780 gtccgcttct agagccaaga acaggatcaa accaccmgtc attggggaag ttctcggttc 840 aagggcgtct tcaaattaat tcaaacgtat gctctggacg aaacgaaacg atacagagtc 900 aaaataaagc ttcttatagc gaggttgtga actacgttag agtaggaatt gttccgaaag 960 actaccctaa cgtggagctt acatctgcac aattacttgc aacacgcaaa gctattcttt 1020 caaaagttgc acaacaacgg aaagaaaaaa ttwaaccaaa attcggtcaa tgtttgctta 1080 ggacggggca tctgattctt gtctgtaaaa accaggagac agctggctgg ctaaaaagca 1140 tagcctctac aatwtccgtc gttgaagatg tagagcttac tgctttagac gagaaaaaga 1200 ttccacgtcc tgagattata attggatttt tcccagtgag tgcagaagac agtacagatg 1260 atattctgga actcctggag agtcaaaatg aaggtctgaa tacggatgaa tggcgaatca 1320 aagagcgcaa catcattaat cagttgcatg ttgagttgat ttttaccgtg gatggagcct 1380 cgctggacgc aatcaaaaag tgtgaattca ctcttgatta caagttcgga acagctccac 1440 tccgtcgtaa aatccctcca aaaataaaaa tgccaaacca aggagttaat aacgacttgg 1500 tttcaccatc cgataatttg aataatcgca gagtactaga ggagcagcaa actactctga 1560 ttggttctgg agttattaga cagacggcga ctccaattga aaacctgaag actactggtg 1620 attcaagcag tgcttcaaac cgcgaaatcc aaaatatgcc aggacctagt gggattaggg 1680 atgcaagctt taaaggcgca cgtcccaaac aaatccttga aggaactagc agcggactag 1740 tgcgaaaaaa tgatactgca ccacaatcac caggatctaa tggtacaatg gagtcaaaac 1800 gaaatgtcaa aaaaacgaaa atacttcccg ttaaggataa aaatccttta cacaaaaaca 1860 agtacattgc taaatacatg aaaaatctgg gtgatcgcgt aaccaacaca gagggaggaa 1920 aaccacctcg ggaaagcgat actacaatga aacatgaaaa tggccaacat taacgtaaat 1980 aaatcatgta atatccaaag cagcaaccaa aacaatcata gttctatcac aaataccaac 2040 acaaataata ttaatagaca gaataggttc ataaaattca ttcagataaa tttacatcat 2100 gcgaagggag caacagctgt acttcaaaaa aaactttgtg cataataaaa tggatatcgc 2160 gctcatacag gaaccgtggg cttatgacag aaaagtgtta ggaataccga cctctgctgg 2220 tcaacttatt tatgatgaaa atcaaatagt tccaagagca gctattttaa taagcaaccg 2280 aatgaagttt ttaccaatta cgcagttcat tagtaaagat attgctgttg ttttgatgga 2340 ggttccaacc acgcgaggaa tagcagaaat gtatgttgca tctgcgtatt ttcctggaga 2400 tttggaagcg attcctcctc ctgatgttgt caatttcgtt tcatattgta agaaaaacaa 2460 taaaccattt attttaggat gtgacgcaaa cgctcaccac atagtatgga gcagctctga 2520 catcaactat agaggtgagt atctcttaga atatataata actaataata ttgatatttg 2580 taatcaagga tgtgaaccga ccttcattac ttcaaccaga aaggaagtgt tggatctgac 2640 tctatgtagt actaacatat cggaaaatgt taagaattgg catgtatcta atgaaacatc 2700 tctgtctgat cacagacata ttttatttca gtatcaaaca aatgaaatat tcacaaatga 2760 atttcgaaat ccccggaaga caaactggga tttatattac caaaatattc taaatgaatc 2820 tcacatttct gatgaaaaat ttaactccat aacagagcta gaaaaagctt ctgtacaagt 2880 atcaaattta atgcagcatg cttttcaagc aagctgtcca attactgtac gttctactag 2940 tagagatgtg acgtggtgga atgacaactt agaagcatta aggaaaaaaa ctagaaagtt 3000 attcaataga gcaaaaagaa ccatgaagtg ggaaamttat aaaattgctc taacagatta 3060 taataaagag ataagaagat caaaacgaat ggactggagg cgtacatgtg aaagcattga 3120 gaacactcct gttgtcgcaa gactacaaaa agtcctttca aaagatcatg ttaatggctt 3180 aggaacggtt agaaaaagta acgactgcct tacttctaat ggacttgaaa ctttagaagt 3240 aatgatgcag acgcactttc cttgttctac agttataaat tacgatgaaa atgctacaga 3300 agacaatact agtaacgaat tatctaggat gacttctctg ggcgaaattc atgatagtaa 3360 ggataactta gacttactac caatcttaaa tatggctgct caaaaagctg atgaaatttt 3420 tacaaaaaac aaagtagaat gggccattaa ctcattcgat ccattcaaat ctcctggcaa 3480 ggatggaata tatcctgcat ttttacaaaa aagcaatgga gtactgactt cagctctttc 3540 gaatatgttt aaagcaagtc tagttttcgg ttatattcca aagtcatggc gtcaagtgcg 3600 tgttgtgttt attccgaagg ctaacaaaaa agataaaact tctccaaaat cgttcagacc 3660 cataagtcta tcatcaataa tgctgaaaat tatggaaaag cttattgatc actacattaa 3720 aacagcttac ttaaaacaaa atcctataag cattcatcag tttgcttatc aatctggaaa 3780 atctactaca actgcgttac acgcacttgt ttcaaaaata gaaaaaacat ttgaagcaaa 3840 ggaagttttg cttgcaactt ttcttgatat agaaggagct tttgataatg catcacataa 3900 atccatggcg agggcaatgt taaaacatgg ttttgacata tgcatcgtaa aatggattag 3960 tgaaatgtta tcaaaacgcg aaataacagc cgacctcggt agcgtatcta tatccgtaaa 4020 agctgtaaag gggtgtccac aaggaggcgt attatcgcct cttctgtggt cgttagtggt 4080 agatgaacta ctgacaaacc tagaatcaca aggattcgaa ataataggat ttgctgatga 4140 tattgttata atagtccgag gtaaatacga ttctgtcatc accaacagaa tgcagcgagc 4200 cttagactac acgattcaat ggtgtattac agaaggatta gacataaatc ctcaaaaaac 4260 gcatattatt ccatttacca aaagaagaaa attaataatt tctggacttc atatgaaagg 4320 aactttaatg acactttcta cagaagttaa gtacctagga attattttag attgtaaatt 4380 aaactggaat tcacacctga aacaaatcat agataaagca acaaatgcgc tatggttaag 4440 taaacgtact tttggtaata aatggggtct gaggccgatc atgattcact ggatatatac 4500 gaccatcatt agacccagaa taacatatgc ttctttagtg tggtggacca aaactaagga 4560 aaaatgcgcc atacagaagt tagcaaaact tcaaagacta gcaacaatgt gtactacagg 4620 tgccatgcgc agcgcaccca caaaagcttt agatgcgatt ttaaatcttc ttccactaca 4680 tcagttcatt caattagaag ccgaaaaaag tgctcttcga tttcaacaat ggaaacctct 4740 tcttagtggt gacatcaaag gacatttaag catcttaaat gatatcaaaa ttaatccatt 4800 agttacagtc aacacagact ggatggaaat acagttcaat ttcaaccgta tgttcaatgt 4860 ggtagaatca gatcgtactg tttggaccat gggaggacca caaattcgat caggatctat 4920 cattttttat accgatggtt caaaacaaaa tggtcaagta ggagctggtg taacaggccc 4980 tggtgttaac ctttcagttt caatgggtag atggccaacc gtcttccagg cagaaataca 5040 agcgattttg gaatgtgctt ctatttgctt aaaaaggaaa tataaacatt ctaacatatg 5100 tatattctct gatagtcagg cagctttgtc tgctttgaaa tcgtatacgt gcacgtcaaa 5160 acttgtatgg gaatgcttcc tcttgctcca acaactttgt attaataata cggtaaatct 5220 ttactgggtt cctggacatt gtggaatcga tggcaatgaa aaggctgatg aattagccag 5280 gattggatca accgaaaaat ttctaggtcc agagccattt tgtggagttt caacatgctc 5340 cataaaaatg gaactaagga actggaaaca gtcgatggtg attacaaact ggaataaaac 5400 tgctgtggct cgacagtcaa aacagttcat aacgccgcat gcttcgaata cacaaaagtt 5460 attatttttg tctaaaaaag acctgtgcac atacacagga ctgattacag ggcattgttc 5520 agccaaatat cacctaagac tgatgggaaa ggttgaagat gatatttgtc gtttttgtca 5580 agagagttgt gaaacctctg aacatttatt atgtagttgt gatgctcttt ttcaaagaag 5640 agtaaagttt tttgacaagg gagatataca gccctttgaa atatggtctc tgcctcccgg 5700 taaggttgtg cgtttcatac gtcacattat acctaattgg gataatacct aagtcatgcg 5760 ctgataatta cttttatagt aatttatcac tttgacaagg tacattaaaa agggttgtgt 5820 atcacaaaag atcaatcagt ggtcgcagtg ataatatacc caacaaaaaa aaaaaaaa 5878 // ID Gypsy-25_OD-I repbase; DNA; INV; 10004 BP. XX AC CABV01002755; XX DT 24-FEB-2011 (Rel. 16.02, Created) DT 24-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Oikopleura dioica genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-25_OD_; KW Gypsy-25_OD-LTR; Gypsy-25_OD-I. XX OS Oikopleura dioica OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; OC Oikopleuridae; Oikopleura. XX RN [1] RP 1-10004 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Oikopleura dioica genome."; RL Direct Submission to RU (24-FEB-2011). XX DR Genome; CABV01002755; Positions 15431 5428. XX CC Positions [2901-3497] - Reverse transcriptase CC Positions [4696-5184] - Integrase core CC 'ATGCA' target site duplication CC LTRs are 95% similar to each other. XX FH Key Location/Qualifiers FT CDS 71..1822 FT /product="Gypsy-25_OD-I_2p" FT /translation="MSDAFRVISRALHYTSKVSASIRQHLLFYSLVHLDEE FT TQEKYKQTNTIPSNSELVNRLKELNLNYIVYIFAVYKGRYFVIPIRVSESP FT EYRNQIAPFTTLLYDFKHDESLRRRTSILNIEEILQLSDEGLEGHINYSIR FT IIIHIVDSDKSIANSGLEVSVMEEENQPMRRARSEETIFEDAMEDEQEMIN FT ESIRQNVADNNRTFETPIIEDQPFARPQANQQQRPRGYQVNTGEVKQNDQD FT NGDASENWKNHRIPKYVDSGLGVRDWANKIVFMVDFSRKRPMTSKEKCQLL FT LENIPAKSFGSIMQAFSETANQSYDELVDIIEDELQIDEAEASLNLANLKF FT NEERDKDLKRFFEKIKKLVQIKYPNLEKEGVLTTSMELFEKQVPNYVKNSE FT NWGLDVYDDSDPAARVTLANRIFNLNKNRRSVNAISDARKPGNQKKNGQSE FT TRSCSNCGRKGHIRPECRLLPECYSCHKRGHLANECRSRPKPADQNPRRGG FT SNNSRGSGRGYTNNFQYSQQQNQRGQQSNRGNYRPSNNSSQQNRNCFLCGL FT PDHWAKNCPKNKRVNNFNMQGPEQNHEMTNTPYININ" FT CDS 1845..4256 FT /product="Gypsy-25_OD-I_3p" FT /translation="MLPLIKCVLVQYVDNEELTYEISNVLCDTGAETSIIS FT SDDIPSKFIKEESPSITLSNAIERSFASCEEQLVCNIRLPSDNLEITNCKM FT FILKNSAQLAFSAILGMNVLKKVQLRVGHQDNILQINRMNFTMNVLEIKVT FT PSGNIPGLTLENDIYLWPGEAHRIAKVKVYNQTDDTPALNFIRTVPRLEDN FT QIQVENRRYQQNWTHVKLSNHSDKDVFITKGSLIATEKAPQSYEESLLNTL FT VAVKDISNAEKVVHQKELKNWFDRRNKLTKEIPIAKEIDDIVLQAPQKHRE FT KLRQTLEKYQWNFARNTSDNGLSQKFMGELKLTGDHASFSPPYPIKTELVE FT KIENKLCEMEKNGIVEETCSAFNSPVLFIMKPNGSLRTVNNYSSGAESVNS FT RIVMPRFPTINVRVLLQLIGNHISTLRKRRPEQPIVFGSIDLANAFYAVGL FT RESSRPYTAFLYSRRQLQYSRLAQGLSSSPSIFAAFANKVFSGASSEEKGF FT FTMNYQDDLVLLCSEDAMIDAIDTVLKRCRDNNLVVRLSKCEFFRKNLQFL FT GYQLSENGITVPEKRIKILLDFPYPKTVTEAMRYQGAHMYYLRQTPELSAL FT MSPLSREISKGKNYTLTDEIKNGIDKLRENIKNGLRTCHLEYNSNKRNRHI FT FICADTSLYQTGGVIGNCTLENGQIQDVKIAGYSSKRLNEQESMLSSRARE FT LIGIQAALRSFKDMIPTTEDLLIFTDHRSLDGIVHSSGLRGSGSTRTRSAF FT ADLVEYPRSRVFYVPNTSDIIKCVDSISRININDSEIKVDVFNPKGFSYSK FT ER" FT CDS join(5718..7652,7656..8702) FT /product="Gypsy-25_OD-I_4p" FT /translation="MKLRGYRAASSSVLRVIFFITLTSDPSYATSTLSNVT FT TLNGVVTYQLPNDNVYLVNDEQPQNISFAFPSPSIPEKTWTSCSETNSNIT FT SMSIKYVNALLSEWLNINIEQPNREIDYSVTSDGKLIIHDAAEKINEFSTK FT IKFPNGKTKTFDKFTQFHFQQCFVDQNTHEFEFSAYEHFNAVLIQTNSFEN FT IIVDCDWNSRTGNVETQVIANTGITKIMLNSPTKTTKIQIKFVGSNLVNVD FT NQEEVKACLSLIIENIIVVFDNNLHSKVDENIIGTTSVSIPKTTTTKTTTT FT STTTNLSDQDQIIGVITQFGSTLFKDLGETRSIGIANKLWLNTFHDKEYSY FT LPFTKNMTLHNLFENMYRNGQKRHHITLPTNSCSDDFQFQSIIKLRTPINT FT IGVKLQLSEKTQEWLNDHKRTYLDILEIYKLKIRQIRLSSSLMEIYLKDRL FT ELPRRAPFIVSCQTHLYDYLDISCEPPNSFYKGTSNNYFLLFSIRLSKANC FT VTFEKFLVEHGSTTFNVTYQATPRNRKRRQALAAIGVPVILAGGTSYITNK FT FMEKKEEAEIERVRQTIKLSNLQQEDLINLVKSDFDKVDLEFDHVYEKFEE FT ETTKQCQELGMLSDRMFLEVVNDLFIYYRLTVMSFLTDINTQNRFSTQSTI FT MSLCKTYNSNDLSTACSDYFQSNRAKIHGVNPVLNSNGIISVKVIVEIMMP FT RFDFLGKTTRTLTIGLPISSENDQYYFEFPIVPKRISMKDGKVTVIDECKT FT VGDSTFCDYSEILRSSIYERCGRQVITNSDLSSCEKKIYKTSTPCIYQASS FT SKILISSFTDVYLLNDNHDKIADEEPKITLIEKSNAKMMMCGTNMLTLPDI FT TESSEMVLEKEYGSIQSSAESWISQGEYVRIGKFAERSTYFNDEEYDTLAR FT KTDQLVHEKLKPVYENLRPLSFYITSIFGIITILTCIYLAIKIFKCARARF FT LSRTSRPEEEPIPMKAASTSQQRSLQNRRRARSFRNYNL" XX SQ Sequence 10004 BP; 3562 A; 2096 C; 1850 G; 2496 T; 0 other; tctggtgact gagaaaaaag cgaactaacg cccgaagaca ttcaggaaaa gggttttttt 60 atggaaattc atgtcagatg catttcgggt aatttccagg gctttacatt atacgagtaa 120 agtaagcgct tcaatacgtc aacacttgct gttctactcg cttgttcacc ttgacgaaga 180 aactcaagaa aaatacaagc aaacgaatac gattccttca aacagtgaat tggttaatcg 240 acttaaagaa cttaacctca actatatcgt ttacattttt gcggtataca aaggacgata 300 ctttgtcatt cccatcagag tatctgaatc accagagtac agaaatcaaa ttgcaccatt 360 taccacactt ttgtacgact tcaaacacga cgaaagttta agaagacgaa cttctatctt 420 aaacatagaa gaaattcttc aactctcaga cgaaggtcta gaaggtcaca ttaattattc 480 aattcggata atcattcata tcgttgactc cgacaaaagt attgcaaaca gtggcttaga 540 agtctcagtt atggaagaag aaaatcaacc gatgagaaga gcaaggtcag aagaaactat 600 atttgaagat gcaatggaag atgaacaaga aatgattaac gaaagtatcc gacaaaatgt 660 agcagacaac aacagaactt ttgagacccc aatcattgaa gaccaaccat ttgcgagacc 720 acaagccaat cagcaacaaa gaccaagagg ttatcaagtg aatactggtg aggtcaaaca 780 aaatgaccaa gacaatggag atgcatcaga aaactggaaa aaccatcgta ttccgaaata 840 cgtagactca ggcctaggcg ttcgggactg ggctaacaaa attgtcttta tggttgactt 900 ttccagaaag cgtccaatga catcaaaaga aaaatgccag cttctccttg aaaatattcc 960 agcaaagagc tttggcagta tcatgcaagc cttttccgaa accgcgaacc aaagctatga 1020 tgagcttgtt gatattattg aagacgaact tcaaatcgac gaagctgaag caagtttgaa 1080 cttggctaac ttgaaattca atgaggagcg agacaaagat ctaaagcgat tcttcgaaaa 1140 aatcaaaaaa ttggttcaaa tcaaataccc aaatctggaa aaggaaggag tccttaccac 1200 gagtatggag ctttttgaaa aacaagttcc taactatgtc aaaaactccg aaaactgggg 1260 ccttgacgtt tacgatgaca gtgatcctgc tgcaagggtt acacttgcga atcgtatctt 1320 caatctgaac aaaaatcgac gatcagtaaa tgctattagc gatgctagga agccgggaaa 1380 ccagaaaaaa aatggtcagt ctgaaacaag aagttgctca aactgtggtc gcaaagggca 1440 cataagaccc gagtgtcgac tgctacccga atgctactca tgtcataaac gcggacacct 1500 agcaaacgaa tgcagaagtc gtccaaaacc agcagatcag aacccaagaa gaggaggatc 1560 aaacaattct cggggctctg gccgaggata tacaaacaac ttccagtatt ctcagcagca 1620 gaatcaaaga ggtcaacaaa gcaaccgtgg aaactatagg ccaagcaaca actcgagcca 1680 acaaaatcgt aattgctttt tgtgcggatt acccgatcat tgggccaaaa attgccctaa 1740 aaacaaaagg gttaacaact tcaatatgca gggtccggag caaaaccacg aaatgacaaa 1800 cactccatac atcaatatta attgaaaccc gcaaaagact gtaaatgctc ccgctcataa 1860 agtgtgtact tgtacaatat gttgacaatg aagaactaac ctacgaaatt tcaaacgttt 1920 tatgcgacac aggtgcggaa acaagcatca tttcatccga tgacattccc tccaaattta 1980 taaaagaaga gtcaccatct ataacattaa gcaatgcgat cgaacgaagt tttgcaagtt 2040 gcgaagaaca actcgtctgc aacatacgat tgcctagcga taatctcgaa ataacgaact 2100 gcaaaatgtt cattctgaaa aattctgctc agctagcctt ttcggcgatt cttggaatga 2160 acgttctgaa gaaagtgcag ttgagagttg gtcatcaaga caacattctt cagataaatc 2220 gcatgaactt taccatgaac gtcctcgaga tcaaagtgac tccaagtgga aacataccag 2280 gcctgaccct tgaaaatgat atttatcttt ggcctggcga agcacatcga attgcgaaag 2340 tcaaggtcta caatcaaaca gatgacactc cagcactcaa ctttatcaga actgttcctc 2400 gattagaaga caaccaaatc caagtcgaaa acagacgtta tcaacagaac tggactcatg 2460 tgaaactcag caatcattct gacaaggacg tttttatcac caaaggaagc ttgattgcga 2520 cggaaaaggc accacaatca tatgaagaaa gtctactcaa caccctggtt gcggtcaagg 2580 atatttccaa cgcggaaaaa gtggtccacc agaaggaact caaaaactgg ttcgacagac 2640 gtaacaagct tacaaaagaa attccaattg ctaaagaaat cgacgatatc gttctacaag 2700 cgcctcagaa acatcgagaa aagcttagac agactcttga aaaatatcag tggaattttg 2760 caagaaacac ttcggataac ggattatctc aaaaattcat gggtgaattg aagctcacgg 2820 gagatcatgc ttctttctct ccaccgtacc ccatcaaaac cgaacttgtt gaaaaaatcg 2880 aaaacaagct ttgcgaaatg gagaaaaatg gcatcgtgga ggagacttgt tcagccttca 2940 acagccccgt tctttttata atgaaaccaa atggaagcct aagaactgta aataattaca 3000 gtagcggcgc ggaatcggta aactctcgaa tcgtcatgcc acgatttcct acgataaatg 3060 tgagagtcct tttgcaacta atcggcaatc acatctctac actcaggaaa agacgaccag 3120 aacagccaat tgtattcggg tctatagacc tggcgaatgc gttctacgcg gttggtctta 3180 gagaatcatc tcgaccatac acggcatttc tatattcacg acgacagctc caatattcac 3240 gactcgcaca aggcttgtcc tcatcgccgt caatttttgc agcctttgca aacaaagtct 3300 tctcaggtgc atcatcagag gagaaagggt tcttcacaat gaactaccaa gacgatctgg 3360 tactcttatg ttccgaagac gctatgatcg atgcgattga tacagtgctg aaacgttgcc 3420 gagataataa tttggttgtg cgactttcga aatgtgagtt ttttcgaaaa aatctgcaat 3480 ttcttggata ccagctctca gaaaatggaa taaccgtacc agagaagcgt atcaaaattt 3540 tattggactt cccttatcca aagacggtca cagaagctat gcgatatcaa ggcgcgcata 3600 tgtactacct aaggcagacg cctgaactaa gcgcgctgat gtcaccactt tctcgtgaaa 3660 tttccaaagg aaaaaactat actctcacag acgagataaa aaatggaatc gacaaactcc 3720 gggaaaatat caaaaacggt ctacgaacat gtcatctgga gtacaattct aacaaacgga 3780 atagacatat ctttatctgt gcagatacaa gtctttatca aacaggagga gtgattggaa 3840 attgcacatt ggaaaatgga caaattcagg acgtcaagat cgctggatac tcatccaaac 3900 gacttaatga acaagaatct atgctgagca gccgtgcacg agaattaatc ggaatacagg 3960 cagcactcag gtccttcaaa gatatgattc caacaacgga agatctactc atcttcacgg 4020 atcaccgctc tctcgatgga attgttcaca gctcaggtct cagaggctca gggagcacgc 4080 gcacaaggtc tgcattcgcg gatttggtcg aatatcctcg cagtcgtgtg ttctacgtac 4140 ccaatacaag tgatattatc aaatgtgtgg attcgatatc aagaataaat atcaatgaca 4200 gtgaaatcaa agtcgatgtt ttcaacccca aaggtttttc gtacagcaag gaaagatgaa 4260 gaaatcagtg gaaaagataa atcagaagaa gttcaagtaa acgctgtcaa attgcgaaaa 4320 aaagtccaac aagtcgacta tttacaaatc atccgagaac agctggccag caaacgtttc 4380 tcaccaattc agaagaagtt agaaaacgat gaactgatca ccatcgaaga caaaacttac 4440 atgaagaaga ataacgggtt atacttgaaa acaaacagtg gaaagttcct gcttgttatt 4500 ccaaaaacgc tcgcgtacaa tattttggaa aacatacatg tgcattcggc acacgcaggc 4560 gtacaaagta ttttgcgtca aattgaacaa gaagacgtat gggttgaagc gaagaataag 4620 attgctgcgg aagtttgccg agcatgcatt ctttgcaagt tgatttacag aaaacacgaa 4680 aagaagtcca aagaccaaaa aatccgacca agctttagtc cttattccat ggcatatacg 4740 gacattgttg agattcgctc agaaaacaat gcagtattca acgtgctttc atttcaagat 4800 catttcagtc gcaaagtgac atatcgtgtg gtaagaaata aagaatcagt tacggtggca 4860 gaagctctga cagaacttat cacggaagtc ggtgggcaag gtaaacttac actagtcagt 4920 gacaacggcc cagaattcaa gggaaaaggt acagaagaag tgctgttatc actcaatgtg 4980 cggcattgtt atataagtcc tacaaattct cgcagtaaca ttgttgagag gtttcattcg 5040 gaattgcgaa gaattctgaa aacaacaaac atcaacgcac gaaacgccaa acataaaatc 5100 aacatcgcag taagcatata caacaacaaa ccggcggaag cgttggaatt cagaagtccg 5160 aatcaagtcc tttcgaatat tgatccgcct aagtattttt gcttatcaag cccaagtcaa 5220 ccggaggtgg aaccgaaata cgaagatcat gtgacagatt taaaggaaat gcaaggaatt 5280 gttgcagcga ctcatctacg aaattttcta gcggttacga catctgaaca agatattttt 5340 gaagtaaacg acatatgcgt tctcctggaa tcgacaatta ttggccacaa taaaatcaac 5400 gaaggcccat ttattgtcaa acgactaagg caaaacaaca acgttgatgt taaaagccta 5460 gtcaccggaa gaatttatca tcgtaacgta agatatctct ctaaaataca cctcagtgac 5520 gaagataaaa agaagctagt gatagacgat tcaattgtct ttgacccaaa aacttttgaa 5580 attggtccga aagaactcga atctgttaag tcagttcttg attttacgtt taaaaaccca 5640 aataagaaag acaacgaaac acccgaagcg tcgacgaata cggaacaaca acaccgctat 5700 caactacgac cacgaaaatg aaactccgcg gctatcgagc tgcctcaagt tccgttttac 5760 gagtcatttt tttcatcaca ctaacttccg atccatccta tgctacgtct actttatcaa 5820 acgttactac tttaaatgga gtcgttactt atcaattgcc aaatgataac gtttatctgg 5880 taaacgatga acaaccccaa aacatttcgt ttgctttccc ctcgccttct attcctgaaa 5940 aaacatggac atcatgctca gaaacaaact ccaacataac atcaatgtcg ataaaatacg 6000 tgaacgcact tttatcagaa tggctgaata tcaatattga acaaccaaat cgtgaaattg 6060 attattccgt tacaagcgat ggaaaactaa tcatacacga cgctgcggaa aaaatcaatg 6120 aattttcaac aaaaatcaag tttcccaacg gcaagacaaa aacttttgat aaatttacgc 6180 aatttcattt ccagcaatgc ttcgtcgatc agaacaccca tgagttcgag ttttcagctt 6240 atgaacactt caatgcagta cttatccaaa caaactcttt cgagaacatc attgtggact 6300 gtgattggaa ctcaagaacc ggaaatgtgg aaacgcaagt cattgcaaat acgggaatta 6360 cgaaaatcat gcttaacagc ccgacgaaaa ctacgaaaat tcaaatcaaa tttgttggtt 6420 caaacttagt caatgttgat aatcaagaag aagtaaaggc gtgtctatct ttgataattg 6480 aaaacatcat tgtcgtcttc gacaacaacc tccactcaaa agtggacgag aatatcattg 6540 gaacaacttc agtttcaatt ccgaaaacaa caacaacgaa aactacgaca acgagtacga 6600 caaccaattt gagtgatcaa gatcaaatca ttggagttat cacacaattc ggttcaactt 6660 tgttcaaaga tttaggagag acgcgaagca ttggcattgc gaacaagctg tggctaaata 6720 cttttcatga taaagagtat tcatatcttc cgtttacaaa aaacatgaca ctacacaatc 6780 ttttcgagaa tatgtatcga aacggacaaa aacgacatca tatcactcta ccgacaaaca 6840 gttgctctga tgatttccaa ttccaatcga ttatcaaact acggacgcca ataaacacta 6900 tcggtgtaaa gctgcagctt tcagaaaaga cacaagaatg gcttaacgac cacaagcgaa 6960 catatttgga tattttagaa atctataagt tgaagatacg tcagattcgc ttgtcttctt 7020 cgcttatgga aatctactta aaagacaggc tcgaattacc acgaagagcg ccttttatcg 7080 ttagctgtca aacgcatttg tacgactatc ttgatatttc ttgcgaacca ccgaacagct 7140 tctacaaagg aacttccaac aactactttc ttctattttc tatacgactc agtaaagcaa 7200 actgtgtaac tttcgaaaaa tttttggtgg aacatggaag cacgacattc aacgtcacat 7260 accaagctac accaagaaat cggaaacgtc gtcaagctct agcagctatc ggggtcccag 7320 taatcctagc tggtggaaca tcgtacatca ccaacaagtt catggagaaa aaagaggagg 7380 cagaaattga aagagttcgc caaaccatca agttatcgaa ccttcaacaa gaagatctta 7440 taaatctcgt caaatcagat ttcgataaag tagatctcga attcgaccat gtttacgaaa 7500 aatttgaaga agaaacaaca aagcaatgtc aagaacttgg aatgctttca gatcgaatgt 7560 ttctagaagt tgtaaacgat ctattcatct attaccggct tacagttatg agtttcctaa 7620 ctgatataaa cacgcaaaac agattctcga cataacaatc aaccatcatg agcctgtgca 7680 aaacatacaa cagcaatgat ttgtcaacag cttgcagtga ttattttcaa tcaaaccgcg 7740 ccaaaatcca tggagtgaat ccagttttga acagcaatgg aatcatctcg gtgaaagtta 7800 ttgtagagat catgatgccc agatttgatt ttcttggaaa aacaacgcga acgcttacaa 7860 tcggcttacc aattagctct gaaaacgatc aatattattt tgagttccca attgtaccta 7920 aaagaatatc aatgaaagat ggtaaagtca cagttatcga tgaatgcaaa acagtcggcg 7980 attcaacatt ctgcgactat tctgaaattc tccgaagttc aatttatgaa cgttgtggac 8040 gccaggtaat cacaaactca gatttgagta gctgtgagaa gaaaatatat aaaacatcta 8100 ctccttgtat ctaccaagca tcaagttcta aaatacttat atcctcattc acggatgttt 8160 atcttttgaa cgacaatcac gataaaattg ccgacgaaga accgaaaatc acacttattg 8220 aaaagtcaaa tgcaaagatg atgatgtgtg gaaccaatat gctaacgttg ccagacatca 8280 cagaaagttc ggaaatggta cttgaaaaag aatatggtag cattcaatca tcagctgaaa 8340 gttggatcag ccaaggagag tatgtaagaa tcggtaagtt tgcagagcgc tccacatatt 8400 tcaacgacga agaatacgat acgcttgcac gaaaaaccga tcaacttgtt cacgagaagc 8460 taaagccagt ctatgaaaat ttgcgtccat tatcttttta tatcacgtca atctttggta 8520 taataacaat cctgacctgc atctatctag cgataaaaat attcaaatgc gcgagagccc 8580 gctttttaag cagaacaagc cgaccagaag aagaacccat cccaatgaaa gctgcgtcaa 8640 caagtcaaca aagaagtctc caaaatcgac gtcgcgcgcg ctcatttcga aattacaatc 8700 tttgagcaaa atgttaaaaa ctcttcatca aaattaaaac gaacaattat gaaaaaattt 8760 tattagaaaa catcaaaaca aaatggatta gtaaaacttt gttgcagtct tcacaattcg 8820 acccttgaac aatcaatcat ttcagaaaat ttttacacct accgtcattc ttcgttacaa 8880 atgctggcgc tgaactcgct ggaaaaagcg tcatcgattt taaacaactg aagtcgagca 8940 agagaattcg acttgatctc tcttcgaaac cagttatttc caaatccggt gctgttccat 9000 tttgaccaat caaatggtga attcggcaaa tcgacaagac gaatagcgtt tgaattgaac 9060 atgttccagt ccaacagcgg atactctcga taaacgtgtc gaagaaactg ctttatctcg 9120 ctgtatcttg ggaaatcgcc aagcttattc gacgcgatta tcttgttctt caattcgtca 9180 gacgattcca gagacgcctt catcacttca cctttaatgg acttcacgaa aacttcttta 9240 ccaaagaaat cgtcatcaaa ccaattcgaa ttgatttcag aaaacacctt tagaaaattc 9300 aaattacaaa ttataacgct gaaagcaacc tctagggtgt ccaaaaggat ggacgtcgta 9360 tggtaaaatt gtccgtctac acatgcttct tcgacaccca aacaggcaat ataacgacgc 9420 tccgagaaac ttccatctaa atgaccaacc ggagcatcga agcacaaaaa cgaatggatg 9480 taatagtcgg attcaggagg cccattttcg tcgaaatccg cgcgcaccct atcaacctga 9540 gaagcacggc gcgtggacgg ccttatcgct acacagacct cttcttgcta gaagaaaaaa 9600 gtgaaaattt catcaaaaag taaattcgag aatgtaaaaa gaagatcaaa tttataaatc 9660 tgcaacaatt tttctgacaa tttaatttcg ttcacactta ctttttgtta atattttaag 9720 aaaaaactaa cacttaacag cactagaaac agagcgtgaa ttacttttat tcgcgtgttt 9780 cttttggaat cacatcctca atttgcaagg gggaggaagg actgatttaa tcaagtcttt 9840 gtattaataa aacataaaat acgcgagtaa ccaaaaaaat attaaaaatg aaaaaatgcc 9900 aaaaaatgaa cttcatcaca atttatcacc tttacaatat tcataacaga aacaattctg 9960 acttcagatc cgaacgtcaa ctcctcttta atataggggt tgga 10004 // ID L2B-4_AAe repbase; DNA; INV; 5213 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE An L2B non-LTR retrotransposon from Aedes aegypti. XX KW L2B; Non-LTR Retrotransposon; Transposable Element; L2B-4_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5213 RA Kojima K.K. and Jurka J.; RT "L2B clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1409-1409 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 24 sequences with >98% CC identity. Closely related to CR1-1_AG and L2B-1_CP. XX FH Key Location/Qualifiers FT CDS 330..1736 FT /product="L2B-4_AAe_1p" FT /translation="MECGACRGGIAINGLPLFCFRCEKYFHAQCSSVENKA FT AARLITDNANVFFKCDDCLSRPSCGEQNSSVFNIGSMEEDMKKISSLSDSI FT DDIRNQLAAQIGNALKSGMEELVQNVNGTLNEAVNNIEKLINGKMENMTSS FT FFQFNADKMKLATMKSTRDASRSVHLESTSSARKKRKISNTDEIDDFDNDD FT VFEDANPFVTVVNKKARKSKSKGPKPNSPKKSETRPVIVIKPTSNQSSEDT FT RKFLYEKLDPKIHKISSFRNGKDGSVIVKCATGYNVDLVKNGMQNDLGENY FT SAVVPTSVPRLKVLGMSYKYSSDVIIDYLKGHNEDIAINDVKVLNVYENPR FT FRYKQYNAVIEVDVDTYSCLLTAKKVNVKFDRCLVVPAVSVLRCFECGEFG FT HMSTKCNNSTACSKCSGSHKTSECTSTVLKCVNCIKMNNERNMNLNINHPA FT FSKNECMAYKKLFDQKKSSLHFNK" FT CDS 1740..4547 FT /product="L2B-4_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="QTSSSSNVNQLDILYFNIAGLSTNYVALRQIVEEKRP FT FLVLLAETHIVDIDAFNQYNIPGYNIAACLSHSRHTGGVAIYVKESVQFKL FT QSNEAHDGNWFLGITVVRGMKMGNYGILYHSPSSSDQRFIEILENWFETFI FT DPSKLNVLAGDFNINWSDANSNHLRRLAEYFNLKQKVTDYTRISRHNRTII FT DHVYSNFDTVNSVTCSDLKITDHETLVINIEDNSSIRDNLVRKKCWRKYSK FT QAVSNLVERSMGLHNSAGSLDHKAAVLTDILKTSTNKLVEQKFVSLENSNS FT WYSLELLRLKRKRDKLYKKFLRRNNDNNWRRYKLARNIYSQTLKKTRCEYI FT QRKIDQHQNNSKELWKILKSLLKPNNCNPRSITFDGTLEQSEQEIACNFNK FT YFVDSVSSINQSIELVAEPDEIKQPVNSRCKLECFHPITFDELRSICFSLD FT KTAGVDNVNARVVQDCFHVIGHNLLDLINESLRTGHVPQVWKESLVIPIQK FT VAGTIKAEEFRPINMLHTLEKILELVVKGQLIKYLNNNNLLIPEQSGYREG FT HSCETALNLVLAKWKEKMECKETIIAVFLDLKRAFETISRPLLLKKIERFG FT ISGSAYKWFESYLCGRTQRTVFNNSVSSPVESNLGVPQGSVLGPILFIMYI FT NDMRRVLRFCDINLFADDTVLFIAAKNLEQAVSNLNEDLRYLSRWLKFKQL FT KLNISKTKFMIISRNRNNEQVSVEIDNEAIDRVREIKYLGVIIDDKLKFAT FT HINNVIKKIAKKYGILCRLKNELTIASKILLYKSIISPHLDFCPSILFLAN FT ETQISRLQRLQNKIMRLILRCDRLTSSLFLLDALQWLSVKQRIVYLTMVFV FT FKVVNGLLPRYLCDRIERGSDIHRYNTRNADEIRTPNFLHGASQNSLYYKG FT INVFNSMPIQIKRATTLSQFKRLCISHVKAVF" XX SQ Sequence 5213 BP; 1688 A; 805 C; 1083 G; 1637 T; 0 other; tgtgatggtg aacattgtgg tttgtcttgc cgtgcggtgt taattaaatt ttaatttacg 60 tgtgaaatca gcagagtgcc gcgaagaaat tacaatctgc gtgtgcctaa aagtgtttgt 120 gatcaggata agtagaagac agtagtctct ccgttgcctg ttgtgtttaa attgaattta 180 atttccctcg ccacggctgc aagtgacggt gccataaggc aatttcgtca gcaagctgtg 240 gttgagtttg tttgtgcggt attgtttcac tgtttgtctc tctgtgtatc gttgtgcgct 300 ttaactcctc ggtagtcatt gcggggggga tggagtgcgg ggcatgcagg ggggggattg 360 cgatcaacgg gctgccattg ttttgtttcc gatgcgagaa atacttccat gcgcagtgtt 420 cctccgttga aaacaaggca gcagcaaggt tgatcacaga taatgcaaat gtttttttca 480 agtgtgatga ttgcctatct cgtccgagtt gtggtgagca gaacagtagc gtgttcaata 540 tcggatcaat ggaagaagat atgaagaaaa tttcttcact ttcggattct atcgatgata 600 ttcgaaacca actggctgct cagataggca acgcgttaaa gtccggtatg gaagaattgg 660 tacagaatgt taacggcact cttaatgagg ctgtaaataa tattgagaaa ttaattaatg 720 gcaaaatgga aaatatgaca agttcatttt ttcaatttaa tgcggataaa atgaaattag 780 ctacaatgaa aagtacgcgt gatgcatcta ggtcggttca tttggaatcg acctctagtg 840 cacggaaaaa acggaaaata agtaatacag atgaaattga tgattttgat aatgacgatg 900 tttttgaaga cgcgaatcct tttgttacag tagttaataa aaaagctcgg aaaagtaaaa 960 gtaaaggtcc taaaccaaat tcaccaaaaa agagtgagac tcgtccggtt attgtgatta 1020 aaccaacgtc caatcagagc agcgaggaca ctcgcaaatt tttgtatgaa aaacttgatc 1080 caaaaataca caagataagt agctttagaa acgggaaaga tggctccgta attgtgaagt 1140 gtgcaacagg atataatgtt gatcttgtga aaaatggcat gcaaaacgat ctgggtgaaa 1200 actacagcgc tgttgtccca acatcggtgc caagattgaa agtgttgggt atgagctata 1260 aatattcctc cgatgtcatc attgactatt taaaaggtca taatgaagat atcgcgatta 1320 atgatgtgaa agtattaaat gtttatgaaa atccacgctt ccgatacaag caatacaatg 1380 ccgtaatcga agtagatgtt gatacgtata gctgtctctt aactgccaaa aaagttaatg 1440 taaaatttga tcggtgcctt gtagttcccg cggttagtgt attaagatgc ttcgaatgtg 1500 gagaattcgg gcacatgagt acgaagtgca acaacagcac tgcgtgctct aagtgcagcg 1560 gaagtcacaa aacatcggag tgtacgtcaa ccgtattgaa atgtgtaaat tgtataaaaa 1620 tgaataatga acgtaacatg aacttgaata ttaatcatcc agctttcagt aagaatgaat 1680 gtatggcata taagaagcta tttgatcaga aaaaaagcag cttgcatttc aataaatagc 1740 aaacaagttc cagtagtaat gtgaaccaat tagacatttt gtattttaat attgctggat 1800 tatctacaaa ctacgttgca ttacgtcaga ttgtagagga aaaacgtcca tttttggtac 1860 ttcttgcgga aacacatatt gtagacattg acgcatttaa tcagtataat attccgggtt 1920 acaatattgc tgcttgtttg tcacattcaa gacacactgg cggagttgct atttatgtca 1980 aagaatcagt tcaattcaag cttcaatcga acgaggcgca tgatggtaat tggttcttag 2040 gcattacggt tgtacgtggc atgaagatgg gtaattatgg tattttgtat cattctccca 2100 gttcaagcga ccagcgtttt attgaaattt tagaaaactg gtttgagaca tttatagatc 2160 ctagtaaact taacgtactt gctggcgatt ttaatatcaa ttggagtgat gcgaattcga 2220 atcatttgag gcgtttggct gagtatttca atttgaaaca aaaagttacc gattacactc 2280 gaatttcccg gcacaataga acaattattg atcatgtcta ttctaatttc gatacagtaa 2340 actcagtcac atgttcagat ttgaaaataa ccgatcatga aactttagtt ataaatattg 2400 aagataacag tagtattcga gataatcttg taagaaaaaa gtgctggagg aaatattcaa 2460 aacaggcggt atcaaatctt gttgaaagaa gtatgggttt gcataatagt gcgggcagtt 2520 tggatcataa agcagctgtt ttaacggata tattaaaaac cagtaccaat aagttagttg 2580 aacaaaagtt tgtatctcta gaaaactcaa acagctggta cagtttagag ctacttcgtc 2640 ttaaacgcaa gagggataaa ctgtacaaaa agtttttaag gagaaataat gataataact 2700 ggagaaggta caagttagcg cgaaatatat attcacaaac gttgaaaaag acccgctgtg 2760 aatatattca gaggaaaatc gatcagcatc agaacaacag caaggagtta tggaaaattt 2820 taaagtcttt attgaaacct aataattgta atccgcgatc cataactttc gatggcacat 2880 tagaacagtc agaacaagaa attgcatgta attttaacaa atattttgtc gatagcgttt 2940 catcgatcaa ccaatctatt gagttggttg ctgaacctga cgaaatcaaa cagccggtta 3000 atagtagatg taaacttgaa tgttttcacc cgattacatt tgatgaatta agaagtatat 3060 gtttttcttt agataaaacg gctggagtag ataatgttaa tgctagagtt gtgcaagatt 3120 gctttcatgt cattggacac aatctgttgg accttataaa tgaatctttg agaaccgggc 3180 acgtgccaca ggtttggaag gaatctttag tgattcctat acaaaaggtt gctggaacga 3240 ttaaagccga agagtttcgt cccatcaaca tgttgcacac attagagaaa attttagaac 3300 ttgttgttaa aggccaactg ataaaatatt tgaataataa taacttgctg ataccagagc 3360 aatcgggata tcgagaagga cattcttgtg aaaccgcatt gaacttggtt ttagccaaat 3420 ggaaagaaaa aatggagtgt aaagaaacta tcattgctgt atttctggat ctgaaacgcg 3480 ccttcgagac gatttctagg cccttattgt tgaaaaaaat cgagcgcttt ggaatttcgg 3540 gttctgcata caaatggttc gaaagctatt tgtgtggaag aactcaacgg actgttttta 3600 ataattcggt ttccagtccc gtggaaagta atcttggtgt tccacaggga agtgtattag 3660 ggcccatttt atttattatg tacattaatg acatgagacg agttttacga ttttgtgata 3720 ttaacttgtt tgcggatgac accgtgttat tcattgcagc taaaaattta gaacaagccg 3780 tttcaaactt gaatgaagat ttgcgttatt taagtagatg gttgaagttt aaacaactga 3840 aattgaacat aagtaaaaca aaattcatga tcatttcgcg gaatcgtaat aatgaacaag 3900 tctctgttga aattgacaat gaggcaattg atcgcgttcg agaaattaaa tatcttggcg 3960 tgattattga tgataagctt aaatttgcca ctcatattaa caatgtcatc aagaaaatag 4020 ccaagaagta cggtattctt tgtcgattaa aaaacgaatt aaccattgct agtaagatat 4080 tgttgtacaa atcaatcatc tctcctcact tggatttttg cccttccatt ttgtttttgg 4140 ctaatgaaac acaaatatcg agattgcagc gcttgcaaaa taaaattatg cgcttgattt 4200 taagatgtga tagactcact tcctcacttt ttttgttaga cgctctacaa tggctgtcag 4260 tgaagcaaag aattgtttat ttgacaatgg tgttcgtttt taaagtagtc aacggtttgc 4320 tgcctcgata tttgtgtgac agaattgaaa gaggaagtga cattcataga tataacacaa 4380 gaaacgcgga cgaaatcaga acacccaatt ttctgcatgg tgcttcacaa aactcattat 4440 actacaaagg aataaatgtt ttcaattcta tgccaataca aataaagcgt gcaacgactc 4500 tgtcacagtt caagagacta tgcatttcac acgtaaaagc tgtattttga gcagccaaat 4560 ggtaattttt taataatgac taattttgta tcttgacgat gtttttacaa acagattatt 4620 tgattgcata atttatttgt tttatttcat tagggtatag acaaactatt atgttcaccc 4680 tgatgatgat gatggatttt tatattgttg acgtagcttt agcttaaaac ctattttttt 4740 gtgaagtcta caaaagtttg agtctcgcgc gcggtacaca ggtataaatg actattttat 4800 gatttgtaaa caaaattgga caattgcact gttgaaggat tctttggatt attagtaggc 4860 tcgctggttg aatgtcggaa acttcttgaa aatgcgaatg ttgtcgatag ttttgaaatc 4920 gtcttgcgga gttttcaatc ctgacaaggt tttggatttt gttggaattt catgatcact 4980 ggttatgctt atgaacatct gtggttaaga tcaatggtgt ttacaaaatt tctttgttgc 5040 tgtatagtat gaaaatttct atgttattta tatctgtaaa aaataatcag accgtttcgt 5100 tttgcaatat ctgattaaat tgatcataat aattatctta aagatatatc gtctagctca 5160 aacctttgta ggggtatgtg gcgggaccat catcatcatc atcatcatca tca 5213 // ID Gypsy-66_AA-LTR repbase; DNA; INV; 178 BP. XX AC supercont1.238; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-66_AA_; KW Gypsy-66_AA-I; Gypsy-66_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-178 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.238; Positions 1608062 1607885. XX SQ Sequence 178 BP; 55 A; 36 C; 42 G; 45 T; 0 other; tgtagtaatg acatacattg agccacccct cgacagagcg agtgaagggt gttggtaaca 60 cggtgagtta gacgacaagt gagtacatta gagtgcacat gtgtgaccgc attcggtaat 120 aaactttgat acgatcttac gccgataatc gatatctaat gatctccgaa ttaccaca 178 // ID otherMITEs_Ele11 repbase; DNA; INV; 692 BP. XX AC . XX DT 12-OCT-2010 (Rel. 15.1, Created) DT 12-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE A non-autonomous DNA transposon family from Aedes aegypti. XX KW DNA transposon; Transposable Element; Nonautonomous; KW otherMITEs_Ele11. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-692 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-692 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the yellow fever mosquito."; RL Direct Submission to Repbase Update (12-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. >97% identical to consensus. This consensus CC is ~98% identical to the original sequence in [1]. 8-bp TSDs. CC TIRs are ~300 bp long. XX SQ Sequence 692 BP; 219 A; 134 C; 136 G; 203 T; 0 other; tagtgtggga caaaatttag attctcgctc cagtccactt tttggattgc atttaggtcc 60 cataacaact gtgcaaaatt tcagctcgat cggagaaact atattttagc gccagccgtt 120 caaagtttgt atgggattta ctatgggaaa acttactttt gcaaagaaaa atcgccagag 180 gtcgcccatt gacctctata aaaattctga acacagacct cgataggtat ttttacgatg 240 aagaatattg ccgaagaccg cgaaacaatc cgacacttgt gaaaaaagtt attcaatgaa 300 aacctattgg caacgcgacg ttgattaaca cgtaaaggaa taacaataat aacaaaatcg 360 ggcaaaattt ccgaatagtc tatgcttaat aacttttttc acaagcatcg gattgctttg 420 cggtcttcgg caatgttgtt catcgtagaa atacctatcg agatctgtgt tcagaatttt 480 tatagaggtc aatgggcggc ctctggcgat ttttctttgc aaaagtaagt tttcccatag 540 taaatcccat acaaactttg aacggctggc gctaaaatat agtttctccg atcgagctga 600 aattttgcac agttgttatg ggacctaaat gcaatccaaa aagtggactg gagcgagaat 660 ctaaattttt catataaccg tgtcccaggc ta 692 // ID DNA-8_AAe repbase; DNA; INV; 238 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A non-autonomous DNA transposon family from Aedes aegypti. XX KW DNA transposon; Transposable Element; nonautonomous; DNA-8_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-238 RA Jurka J.; RT "DNA transposons from the yellow fever mosquito."; RL Repbase Reports 11(4), 1262-1262 (2011). XX DR [2] (Consensus) XX CC ~92% identical to consensus. Present in ~3000 copies in the CC genome. CC TTAA TSD. XX SQ Sequence 238 BP; 64 A; 42 C; 53 G; 76 T; 3 other; ggtgacacgg gagaccgtgt tattttccct atctttcgtc tcactctaac aattatcatc 60 aaaactttgc ggaagcaaat ctcgagtttt agtgaaccga tgaagctgaa aattnantgg 120 gttgtgcact acatatatag aatcatagtg ataaattttt cgcatcgata tatggagtgg 180 tncttgagat ttgcttcttc aaatggatag ggtgattatg atgacgtccc gtgcggcc 238 // ID Transib-7_HM repbase; DNA; INV; 3036 BP. XX AC . XX DT 30-JAN-2008 (Rel. 13.01, Created) DT 30-JAN-2008 (Rel. 13.01, Last updated, Version 1) XX DE Autonomous Transib DNA transposon -a consensus sequence. XX KW Transib; DNA transposon; Transposable Element; Transib-7_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3036 RA Kapitonov V.V. and Jurka J.; RT "Autonomous Transib transposons from the hydra genome."; RL Repbase Reports 8(1), 7-7 (2008). XX DR [1] (Consensus) XX CC Transib-7_HM is a young family of autonomous Transib DNA CC transposons that were active in the hydra genome just a few CC million years ago (they are ~2% divergent from their consensus CC sequence). The consensus sequence was obtained based on a CC multiple alignment of ~30 copies; it codes for a 663-aa Transib CC transposase. Like other Transib transposons, Transib-7_HM is CC characterized by 5-bp target site duplications and short terminal CC inverted repeats. CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 757..2697 FT /product="Transib-7_HMp" FT /note="Transib transposase." FT /translation="MKYIDIYTYILENKLQISDHGVLKDVINGMNLVNPNP FT KLETLLIHMRKRWQDLKGNRICFEKKHKEWLQSEIDVFTHVPVKQDTRGRP FT SIENFDDCSLRTKYRRTNSLQQGHSSNELLFAAKVTLKKENKLDELHAVKN FT IFDSSKKVSQQLSGNEALALLLDTNMTKDGYQLHRNVAMRKNCELYPSYHI FT IKAAKLSCYPNEIEIADYFAKISMQSLVDHTAQRLCQVQNEVIETVASDTT FT IFATLRHKAGFDGSTGQSVYKQTVSESGIRNIKIEESLFLTCIVPLDLSTF FT AGGKKTTLWRNPKPSSTLYCRPLRFQFENESKQVLISEEAQLKEEIRNLNC FT TKIILNNGKELQVKHVIEITMIDGKVHTALSEATNSSQCCSICGCSPKEMN FT NIDASIVRNYQAKGLQYGLSTLHCWIRCFECFLHIGYKLPIKKWQARTAED FT KAKVADTKSHIQKAFLHQMGLVVDQPKSGGSGTSNDGNTSRRAFTDAISFA FT NITGVNKELILKLHIILSTISSGYEINTEKFTVICTKTAKLFVKLYPWYYM FT PQSLHKLLIHGPAIINRMSLPIGMMSEEAQESTNKYFKRFRERHARKASRY FT MANEDILKRFLCSSDPVIASLRQTCTSKKKDFPVEVLELLLAGDNIVNL" XX SQ Sequence 3036 BP; 1100 A; 450 C; 481 G; 1005 T; 0 other; cacagtgggt caaggtgtgc caaaattaaa aaaaacaaat tatgttcaac gtaggtttat 60 tggtaatagt agttctccat acttatcaag cttttaaaaa atgatatttt ttagccacta 120 gcttttatat taaatttttt atggcttgtt aaagttaaat tttttattta gatttgtaag 180 tttttaaatg aaaataaata aaataaatgt gttgctacac agaaactgta aactactttt 240 taatttaaat tatttgcgct ataatttttt tatacattga tagattgatc ttttttaaaa 300 ccaaataact tcgctgcaca tttttttctt gtaagttaga aattttattt tatttttaag 360 atgatatgtt tttacaggtt attttagtgt aaatttttac ttactttatt taattaaaaa 420 atcctccatt atcctccagt ctgaaaatgc attaaaatcc ttacttacta tacatattat 480 aatttaaagt tagtattagc ttcaacaaag taccaccttg tctacaaatt aacttataga 540 tagcaaggta taacaaaaat tttgtccgcg acaaataaat cacaggattt ttaataaatt 600 aatttttaat taatcacttg gtttttttaa ttaaaataaa acacttcatc tttaaaccta 660 atatactttc atttaaatga tttctaatga gtatattttt gaatttatag gataaaatga 720 tttctaatga gtatattttt gaatttatag gataaaatga aatacattga tatatatacc 780 tacatactag aaaataaatt acaaatctct gaccatggtg ttttaaagga tgttatcaat 840 ggcatgaatt tggtaaatcc taaccctaag ctggaaacat tgctaattca catgagaaaa 900 agatggcagg acttaaaagg taaccgtatt tgttttgaaa aaaaacacaa agagtggcta 960 cagtcagaga ttgatgtttt tactcatgtt cctgtgaaac aagatacacg tggcaggcct 1020 tcaattgaga attttgatga ctgttcactt cgaacaaagt acaggcgtac caactctcta 1080 caacaagggc attcatccaa tgaactactt tttgcggcaa aagttactct taaaaaagaa 1140 aataaacttg atgagttaca tgcagtaaag aatatttttg acagtagtaa aaaagtatct 1200 caacaactat ctggcaatga ggctctagct ttacttttag atacaaatat gaccaaagat 1260 ggctatcaac tacatcgaaa tgttgctatg agaaaaaact gtgaacttta tccttcatat 1320 cacataataa aggcagcaaa attatcttgc tatccaaatg aaatagagat agctgattat 1380 tttgctaaaa tctcaatgca gagcttggta gaccacactg cacaaagact atgtcaagta 1440 cagaatgaag ttattgaaac agttgctagt gatacaacca tatttgctac attaagacac 1500 aaagcaggat ttgatggctc aacaggacaa agtgtctata agcaaacagt gtctgaaagt 1560 ggtatccgta atataaaaat agaagagtct ttgtttttaa catgcattgt accacttgac 1620 ttgagcacat ttgctggtgg aaaaaagaca actttgtggc gaaacccaaa accatcatca 1680 actctgtatt gtcgcccatt gcgttttcag tttgaaaatg agtctaaaca agtattaatc 1740 tctgaagaag ctcaactaaa agaagaaatt aggaatttaa attgcacaaa aattatatta 1800 aacaatggca aagaacttca agtaaagcat gtgattgaaa taacaatgat tgatggaaaa 1860 gtacacacag ctttatctga ggccactaac tcaagtcagt gttgttctat ctgtggatgc 1920 tcaccaaaag agatgaacaa cattgatgcc agcatagtaa gaaattacca ggcaaaaggc 1980 ttgcagtatg gtttatctac actgcattgc tggattagat gttttgaatg ttttctgcat 2040 attggttaca aattgcccat caaaaagtgg caagcaagaa cagcagaaga caaagcaaaa 2100 gttgctgaca caaagtctca cattcaaaaa gcttttcttc atcagatggg ccttgttgtt 2160 gatcaaccaa agtctggagg atctggtaca tctaatgatg gcaacacttc tcgaagagca 2220 tttactgatg ccatttcttt tgcaaatata acaggtgtca ataaggaact aattttaaaa 2280 ttgcacatta ttttgtcaac catatcaagt ggctatgaaa taaatactga aaagtttacg 2340 gtgatatgta caaaaactgc aaagttgttt gtgaagctgt acccatggta ctacatgcca 2400 caaagcctcc acaaactgct gatccacggc ccagcgataa ttaatagaat gtcacttcca 2460 attggtatga tgagtgaaga agcacaggag tcaaccaaca aatattttaa acggtttcgt 2520 gaacgtcatg ctcgaaaagc ttcaagatat atggcaaatg aggacatttt gaaaagattt 2580 ctatgctcat ctgatcctgt catagcatca ttgaggcaaa cctgcacttc aaaaaagaaa 2640 gactttcctg ttgaggtttt ggagttgtta ttagcaggag ataacattgt aaatttgtaa 2700 atacttcatt taaatatatg cagacattga tggctgcctt aaagttgtat gagttaaatg 2760 agttgataag acaagaaagt attttttcat aaaattgtat gcctatcaaa aaaattatac 2820 tgtaattttt ttatgtttat tctgtggatt ttgtttgaat tgaaaaactg catatttttt 2880 ttattttttt gctgtatttt taattattaa aaaaaaacaa taaatttttt ttttttattt 2940 agtcacaaat atattaagaa gctcaaagta tgttatttaa atcaatttca taaaaaaaaa 3000 attttggcac aaaaatgggc attttgaccc actgtg 3036 // ID Crack-4_CP repbase; DNA; INV; 4307 BP. XX AC . XX DT 14-JUL-2009 (Rel. 14.07, Created) DT 14-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE Culex pipiens Crack non-LTR retrotransposon. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; Crack-4_CP. XX OS Culex pipiens OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RP 1-4307 RA Kapitonov V.V. and Jurka J.; RT "A family of Crack retrotransposons from Culex pipiens."; RL Repbase Reports 9(7), 1335-1335 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 223..975 FT /product="Crack-4_CP_1p" FT /note="ORF1." FT /translation="MSNRRSNTAKAGKESKSNGDVAQQLEEISAKLSKLDE FT LEKLNRDLLGQLLKLSHRVTEVTVENRLLRQEIEDLKQERLQEEVLIKGFK FT LQGPEHHQEVFHKVCEVIGADVAENVKEVRPLIFKQNRKTDAVLVKFWRVG FT DKHQFIQRVRALKRPITPGDLNIKSINKFVLIQDHLTPEKQRVHRLTWELC FT KLGLQKPWFYKGSTWVTHPETQKRLRVADSKDVDELQFLLVTQGKVNTNSN FT LDNQVTVSDD" FT CDS 979..3960 FT /product="Crack-4_CP_2p" FT /note="ORF2." FT /translation="MEDIEQYLDFQHNNKYFDKIDDYDREFQNFNGISILQ FT VNTRSINKIDRFDNLKTQISKLIFKPQIIVVSETWIPINLIQLYNIDGYES FT FFSCRRDGRGGGLAVFVSKKLNFSVTVNQEVFSNSNSFHFLQLIVTGSSNK FT SLIISAYYRPPDDTILTDFLQHLDNTLDSGDSAHVVVGDINIDSNSSSRKL FT DKYLNIISSFGFAITNTNLTRPSSESIIDHVLFNYHDNFSIINDTIYNPDS FT DHNFIISSIDFTLNNNPPDIRYKLHTNYTKLQQLLEMRFSPAYMGNYEDPN FT AFYDYFINTLTKSISESTTKYLIKNKTSACPWVNDNLQSLYNSSRNLKKKK FT NKLLSQGKNVDHIEIKIEALSDKIKNCSSILYQNYYTDKFSGSKSTKDIWN FT NINDVLGRKHKDKIPKTMTKLTDSGVKVTIDSQQTIANEFNSFFTSIGKSL FT ANNIPKTLNDDINKFRTLKNCQRSFNLEPTNEYEIINLIDSLSNNTSCGHD FT KISAFILKKVKLIIAPILTNILNLSMVNGFYPDQLKIARVTPIFKSGSNSL FT FNNYRPISVLSSINKIFEKILFQRLNHFLISTKFFCTQQYGFRQKSSTTNA FT VIDLVNKIQRHLNDKEDVLGLFLDLSKAFDTVDHNILLAKLERAGVRGIPL FT DLFKSYLSNRSQFVSFDGIKSNIKFIDVGVPQGSVLGPLFFIIYLNDLAFL FT SLKGKLKLFADDSSFLYNNKSASVNDKNLHDDLKTLVEYFRINKLTLNINK FT SNIINIKNSNRSIPNRLTLTKNSFPDVKVIDECKYLGIILDNRLNWSPYIN FT SLLLKLNKITGIIYKIKHKLPLSVLLLIYHSLFSSHLSYITAVWGNACNIL FT INKVQIAQNKFLRIIFNLPIRSHSVDLYTKFKNIYPVRGIYIIQSCCFIYS FT CLANQTHSNTKFEPSTHNHFTRNHDSLQRPNVSTLAGERSITFKGAQLYNF FT FSNKFGDCLSLSIFKKQLTNFLSEPAILEKLLKTYDFHI" XX SQ Sequence 4307 BP; 1376 A; 808 C; 720 G; 1403 T; 0 other; atcttctggc aacactgcta cggaaaggtc ggctttgttt acgctcccgc gatttcggtt 60 aatttattaa tcaatttcaa tttttcatta ctttttccgg taaatacgac cttggggccg 120 ttggatcgtc gctgggattg tggagtctgg tcggagctgc ttaatttgca gaggaaattg 180 actcttcccc ctctccggga ggcgagctga gttgaggtta ggatgtcgaa ccggcgctcg 240 aacactgcca aggcagggaa ggagtcgaaa tcgaacggtg acgttgccca gcaacttgaa 300 gagatctcag ctaagctatc caagctcgac gaactggaga agctgaaccg ggatctgctg 360 gggcagttgc tcaaactgtc gcacagagtg accgaggtca ccgtggaaaa tcgactcctg 420 aggcaagaga tcgaggactt gaagcaagag cggttgcagg aggaggtgct gatcaagggc 480 ttcaagctgc aaggtccgga acaccaccag gaagtctttc acaaggtctg cgaggtgatc 540 ggggctgacg tggcggagaa cgtcaaagag gttcgtccgc tgatcttcaa gcagaaccgg 600 aagacagacg cagtgctggt gaagttctgg agggttggtg acaagcacca gttcatccag 660 cgagtgcggg ctctcaagag gcccatcaca cccggcgact tgaacatcaa atcgataaac 720 aagttcgtct tgattcagga ccatctgacg ccggagaagc agcgtgtgca tcggctgact 780 tgggagctct gcaagcttgg actccagaag ccttggttct acaagggttc tacctgggtc 840 acccaccccg aaacccagaa acgactccgc gttgctgatt ccaaggacgt cgacgagctc 900 cagtttcttc tggtcacgca agggaaagtc aacacaaact ccaacctcga caatcaggta 960 acggtatcag acgactaaat ggaagatatt gagcaatatt tagactttca acataataac 1020 aagtattttg acaaaattga cgattacgac agggaatttc agaactttaa cggtatctca 1080 attttacaag tcaacacgag aagcattaac aagattgatc gatttgacaa tcttaaaact 1140 caaatctcaa aattaatttt taagcctcaa attattgttg tcagtgaaac gtggattcct 1200 attaatctaa ttcaacttta caatattgat gggtatgagt cgtttttctc gtgtagacgc 1260 gatggacggg gagggggatt agctgttttt gtctcaaaaa aacttaattt ttctgttact 1320 gtaaatcaag aggtcttttc taattctaac tcttttcatt ttttgcaatt aatagttact 1380 ggttcttcta acaaatcact cattatttct gcgtactata gacccccaga tgatacaatt 1440 ttaactgatt tcttgcaaca tttagacaat actttagatt ccggtgattc tgcacatgta 1500 gttgtgggag atataaacat tgactctaat tcctcttcaa ggaaactcga taaatatttg 1560 aatataattt catcctttgg ttttgcaata acaaatacta acttaacgcg tccatctagc 1620 gaatcaatca ttgatcacgt tttgtttaac tatcatgata atttttcaat aatcaacgac 1680 accatctata atccagacag cgatcataat tttataattt catctatcga tttcactctt 1740 aataacaatc cacctgatat tagatacaaa cttcatacta actatactaa attgcaacaa 1800 cttcttgaaa tgagattcag cccagcttac atgggtaact acgaagatcc taacgctttt 1860 tatgactatt tcattaatac cttgaccaaa tctatatcgg agagtacaac taaatactta 1920 attaaaaata aaacttctgc ttgtccatgg gtcaatgata acttacagtc tttatacaac 1980 tcatctagga atctaaagaa aaagaaaaat aagcttttat ctcaaggtaa aaatgtagat 2040 cacatcgaaa tcaaaattga agctctttct gataaaatta aaaattgttc aagcatatta 2100 tatcaaaatt actatacaga taaattttct ggttcaaaat ccactaagga tatttggaat 2160 aacattaacg atgttcttgg tcgaaaacat aaagataaaa tacctaaaac tatgactaaa 2220 ttaactgatt caggtgtgaa agtcacaatt gatagccaac aaactattgc aaacgaattt 2280 aactcttttt ttacttcaat tggcaaatca ttagcaaata acattcctaa aactttgaac 2340 gatgatatta ataaatttcg tactttgaaa aactgccaac gatctttcaa cttagagcct 2400 actaatgaat atgaaataat taatcttatt gattctctta gtaataacac tagttgtgga 2460 catgataaaa tttctgcatt tattttaaaa aaagtaaagt taattattgc cccaattctc 2520 actaacattt taaatttatc tatggttaat ggtttctatc ctgatcagtt aaaaatagct 2580 agagttacac ctatatttaa atccggatcc aattcacttt ttaataatta tcgtccaata 2640 tctgttcttt cctcaataaa taaaattttc gaaaagatcc tttttcaaag attaaatcac 2700 ttcttaatta gcacaaaatt cttttgcact caacagtatg ggtttagaca gaaatcttct 2760 acaactaacg cagtaattga cttagttaat aaaattcaac gccacttaaa tgataaggag 2820 gatgtgcttg gactgttttt ggatttatct aaagcttttg acaccgttga tcataatatt 2880 ttgttagcta aacttgaacg ggctggtgtt cgtggtattc cgctagattt gtttaaaagt 2940 tatctatcta atagatccca gtttgtaagt tttgatggta ttaaaagtaa tattaaattt 3000 attgatgtag gtgttccaca aggctcggtt cttggaccat tatttttcat aatttactta 3060 aatgatttag cattcctatc tcttaaaggt aaattgaaac tttttgctga cgattcttct 3120 tttctgtata ataacaagtc tgcttccgta aatgataaaa acttgcacga tgatctcaaa 3180 actcttgttg aatatttccg aattaacaaa ttaactttga atattaataa atcaaatata 3240 ataaatatta aaaactctaa tcgttccatt ccaaacagat tgacactaac aaaaaacagt 3300 tttcctgatg ttaaagtcat agatgaatgc aaatatcttg gaattatctt agataacaga 3360 cttaattggt caccttatat aaattcatta ttactaaagc ttaacaaaat aactggaatc 3420 atttataaaa tcaaacataa attaccttta agcgtccttt tgttaatata tcattcattg 3480 tttagctctc atctttcgta tataactgct gtttggggta acgcatgtaa cattcttatt 3540 aacaaagttc aaattgctca gaacaaattc ttaagaataa tctttaactt gccaatccgt 3600 agtcattccg ttgatttgta cactaaattc aaaaatattt atcctgtaag aggcatttac 3660 atcattcaat catgctgttt tatttattca tgtttagcaa atcaaactca tagtaataca 3720 aaatttgaac cctccactca taatcatttt acaagaaacc acgattcatt acagcgtccc 3780 aatgtttcaa ctttagctgg cgaacgaagt attacgttta aaggtgctca attgtataac 3840 tttttttcaa acaagtttgg tgattgctta agcctatcta ttttcaaaaa acagctaacc 3900 aatttcctgt ctgaaccggc tatcctagag aaacttctca aaacctacga tttccatata 3960 taataatttt tttttttctt ttttcttgct tccatttcgt aactgaatag aactgatact 4020 gttgctctgc ttgttctttt ttatccttga attatttgat cctgctttgc cttttctttc 4080 ttttattcct tttcagatga ttcagccagg agtgtaggag tccagctcaa ccgagctcct 4140 cgcgaagatg gagctctgga tccagccgtg ttatttgtat tttgtcactc atgtatttat 4200 gttttgtatt atgtgtttac aaaaagaagt aggtttttgg tgccactgct ttggtggctt 4260 ttcctgcctt aaaaaaagta aaaataaaat tcaaaaataa aattcaa 4307 // ID hAT-53_HM repbase; DNA; INV; 3752 BP. XX AC . XX DT 18-DEC-2008 (Rel. 13.12, Created) DT 18-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type DNA transposon family - consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-53_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3752 RA Bao W. and Jurka J.; RT "hAT-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 2041-2041 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(1526..2902,2823..3329) FT /product="hAT-53_HM_1p" FT /translation="MAAFEKAIKEGKKNALVKKEKWGSLENVKKDAVQSTL FT EKYGTDKGYESFKINQKSLDQLVVNFVVSETQPLSIVDKPSFVNLVKLGLP FT KNLKVICSKTLKLRINSLYSSMVENITSTLGDIMYVATTADCWSKGKRSYI FT GVTCHWIDQKTLQRNSVSLACSRIKGRHTYIVLASALYKIHTTYKIQNKIV FT ATTTDSGSNFVKAFKSYTVSENSENEEDSNEDDDDDDNDDLYGENMSLYDI FT LDKNIHEQEESDLLVQLPQHFRCASHTLDLIAKNDVEKMVQNSPSNFKKLY FT RKAMGKCSSLWSKQNMSNRVAEKIHNTLDVYLKTPNKTRWNCTYDSLLQIK FT NIFSDSNGHNKMSNIMDFCEIQRFTNQEIQIIQEYCEVMTPLAQSLDFLQG FT EESMFMGYLLPTLYALDKKLTTLQQKNMNFCSPLVNAVKSSVQTRFSAIWE FT KKRANISIVFVTSLNLXFKLDLAQYGKKKELILASCLLPRFKLLWLDGVKR FT FSAEASLKSLFESVDHRSSSGPCEMESSSRTTTGEFKEDFFCLPVESNRVT FT SSGIEELELYLKSQSRELSLLLLYPSVLKAFLELNTPLPSSAPVERLFSTG FT SNVITEKRYKLGDLLFEKLVLLKQNKVEV*" XX SQ Sequence 3752 BP; 1368 A; 482 C; 593 G; 1291 T; 18 other; cagtgtttgg aagaacgcgt tccaaaagga acggattcct ggaacaagtt cctttattga 60 ggaacggagc taggaactgg ttccttttaa aaaaacggaa cgaggaaaga gaacgaattc 120 ttttttttaa ggaacgtgaa cgaaaattta ttattttttg ttcacgttcc tcaaaaaaaa 180 tttttttaat tgttcctcaa aatttaaacc aatagccttt cgttatagtt gattttacag 240 ttcaaaaaac ataacgcttt gtatttggct atagttttag aaatagtacg tttggccaac 300 ttatggcgaa ctttgccgcg gagttacttt aaaaagtaat tttattttac gcggctgtag 360 taagcaatat gaatccagtt attgaaattg ataatgttag tggtggcgca tgtaataatg 420 acactgaagt tagtggtaag ttttaaaaaa aatataccgt tacttttatc tattttatat 480 ttcattgact tgcaaattgt atttctaaag taagtttatt acaataaaac atgtcttgac 540 ttgcagactt aactaaagcc ataatgcgta agtctaagtc aaaatactag actttaacta 600 aaaaattaat tatacttaac actatataat gtttaccttt tcaggtgaat gtttgccctg 660 gaatggttta gagtcatttt ttatattcaa tgaaataaat ggcaacaata taaatgcaaa 720 atgtgttttg tgtggccctt caggaaaata tataagtaca gctaaaaata catcatctaa 780 tctaaaacgg catgttaagg taaataaata attataartt attcttcmaa ytattttaaa 840 attttatcaa mttatatttc aaattgaatt tatttwcaaa ttataatatt ttcattttag 900 tttatacata attattatgt actaataaat tattattatg taataataaa ttattattat 960 gtattaataw attattatta tgtaactaaa atgcatactg ttgtgcctta aaagcatatg 1020 macaattatt gcacattcaa taatttgcta cagtttccat tttttgaatt tgccgaaatt 1080 tctgatgaac tttatattta attattaaac gcaaaaggta tttgctggca gtgccagcaa 1140 ataattctga ggttagttgc cggctggata atttaatata taattttaaa ttttcttarc 1200 tctaatatgc ttgcgtagca cagtggtttt caawaaagmt gagaagtmgg aggcccgcgt 1260 tcgattcctt atagaagact tttttttttt tactgcctga agatggttta taaaaaaaaa 1320 ttaaataaag aaaatgatta tyattatata taatatattt taactgyttt tccacattag 1380 taatacactg ccggcatata cttattacat taaattgtat attatattgt acataaatac 1440 attttttttg tttaaatatt tgtcaaattt ttttattaaa aaaaataaat atttgtttca 1500 tttttagaga gttcatccat ttaaaatggc tgcttttgaa aaagctataa aagaaggtaa 1560 aaaaaatgct ttggttaaaa aagaaaaatg gggttcattg gagaatgtga agaaggatgc 1620 ggtgcaatca actcttgaga aatatggaac tgataaagga tatgagagtt ttaaaataaa 1680 tcaaaaaagt ttagatcaac ttgttgtaaa ttttgttgtt agtgaaacac aacctttatc 1740 tatagttgat aaaccatcgt ttgtaaattt ggtaaagtta ggccttccta aaaacctgaa 1800 agttatttgc agtaaaactt taaaactacg tattaatagt ctttattcat caatggttga 1860 aaatattact tcaacacttg gagatattat gtatgttgca accacagctg actgttggag 1920 caaaggtaaa agaagttata ttggggttac ttgccattgg atagatcaaa agactcttca 1980 gcgtaattca gtatctttag catgtagccg aataaaagga agacatacat atattgtttt 2040 ggcttctgct ttgtataaga ttcatacaac ctataaaata caaaacaaga tagttgctac 2100 aacaacagac agtggctcaa attttgtaaa agcttttaaa tcatatactg tttccgaaaa 2160 ttcagagaat gaggaggata gtaatgaaga tgatgatgat gatgataatg atgatttata 2220 cggtgaaaat atgagtttat atgacatttt agacaaaaay attcatgaac aagaagagag 2280 tgacttactt gttcagttac cacaacattt tagatgtgct agtcatacac tagacttaat 2340 agcaaaaaat gatgttgaga aaatggtaca gaattcacct agtaatttta agaaacttta 2400 taggaaagca atgggcaagt gttcatcgtt atggagtaaa caaaatatgt caaatagagt 2460 tgcagaaaaa attcacaata cactagatgt ttatttaaaa acwcctaata aaacaagatg 2520 gaactgtacc tatgatagct trttacaaat taaaaatatt ttttcagact ctaatggaca 2580 taacaaaatg agcaatataa tggatttctg tgaaatacaa agattcacaa atcaagaaat 2640 tcaaataatt caagaatact gtgaggtaat gactcctctt gcccaatctt tagatttttt 2700 acaaggagaa gaatcaatgt ttatggggta tttattacct acgctttacg ctttagataa 2760 aaaactaact actttgcaac aaaaaaatat gaacttttgt agtccgcttg tcaacgcagt 2820 aaaatcttcw gttcaaactc gatttagcgc aatatgggaa aaaaaaagag ctaatattag 2880 catcgtgttt gttacctcgc tttaaattat tatggcttga tggtgtcaaa cgtttctcag 2940 cggaagcctc gttaaaatct ttatttgaaa gtgtggatca taggtctagt tcaggccctt 3000 gtgaaatgga atcttcttca cgtacaacta caggtgaatt taaagaagac tttttctgtt 3060 tacctgttga aagcaacaga gttacaagtt ctggtattga agagttagaa ttatatttaa 3120 aatcacagtc aagagagtta tctcttttac tattataccc atcagtatta aaagcctttt 3180 tagaacttaa cacacctctt ccatcaagtg cacctgttga gcgactattt tcaacagggt 3240 ctaatgtaat tactgaaaaa agatataaat tgggkgattt attatttgaa aaattagttt 3300 tgttaaaaca aaataaagtt gaagtttaaa atattttttg ttttcttata ttttactgtt 3360 ttctaattac atttttctta tgcttttgaa ctgaaattaa catatatgct tatatatgcg 3420 taactatacg tgccgagccg ttacaaataa aagtaaaatt tcatgcaagc cggcaaaaaa 3480 acaagatact tataaaaaga ttttaaaata ttaataaaaa taaaccaatt ataataagtt 3540 tgcttccatt ttgttgttag tgtaattcgt aatgagtaat tcaaaaataa ttaaaaaaaa 3600 aagaacgaac ttggaacgaa ttcctttttt tgaggaattt ggaaaaagtt cctttttgag 3660 gtaaggaacg aggatctgga acgagttcat attataccaa atgaacgggg aacggaacga 3720 gatcctttta aaaaggaact ttccaaacac tg 3752 // ID BEL-612_AA-LTR repbase; DNA; INV; 549 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-612_AA_; KW Pao_Bel_Ele159; BEL-612_AA-I; BEL-612_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-549 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 549 BP; 196 A; 104 C; 98 G; 151 T; 0 other; tgttgctgcc agcactgccc cgcagtgttg atgcggctga acgatgatga tgagcagcaa 60 tgtcatccat gcaaagatat gcggtatagc aagtcgtgac aaagagaaag aaagttgaga 120 ctgttctttg ttactcttcg aggcagcacc ttgatcaacg taaattagtt aaaattgaac 180 tatcctagtt tctaaaattt gtccctatat ttacagtgaa ttcaaaaatt aaatctaact 240 taaaactagc gctatcacag ttagtttatc acagttcatt atcacagaaa gtaagtggtt 300 gaattacata aaattactaa aactaaccta aatctaaatc acaggttggt agccaaacag 360 acaataggcg aacacagaac aatttgtaac aagctcaaac cgaacaattg taagttaccc 420 tcgtatgtat tttcacaact aaactaatta aaataaacat tgcagcttaa agctgatttc 480 gcaccagaat ctagagtttg cgaactgctt aaaagagtcg gaaaagagct cccgtttgtt 540 ctaggaaca 549 // ID hAT-5B_SM repbase; DNA; INV; 2685 BP. XX AC . XX DT 14-AUG-2009 (Rel. 14.08, Created) DT 14-AUG-2009 (Rel. 14.08, Last updated, Version 2) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-5B_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-2685 RA Jurka J.; RT "haT-type repetitive family from freshwater planarian Schmidtea RT mediterranea."; RL Repbase Reports 9(8), 1836-1836 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX SQ Sequence 2685 BP; 979 A; 349 C; 438 G; 856 T; 63 other; cagggatggc gaacctatgg cacgcgtgcc caaagtggca cgcagctaaa ttttcgctgg 60 cacgtataat atttttcatt ttcatacata tgtaatgtac tgactaatcc aaatgcattt 120 ttgatattgg aaaaattgtg aactcaattc aaatttaatg cataaatttt gatatttaaa 180 aacaaacttc cagatttctg gcgcctttat tcataacgtt ataatataat tatattattt 240 gcgtatgtag gtataggtat gtacaagtat gaacatgtaa ttttaaatat aaaaaaaaat 300 gttaaaatga acatagtatg ctttggattt tcaaatatat acatacatat atatatgaac 360 atctgcattc tcgccaaaat tgaaattcac taattctaat tcagttcagt taaaaacttt 420 gtaagtgcga ttgaaaattg tgttttctta ttttagtatt attttttgct aataaaaaaa 480 aatctattta tatttttaga agacaattat aattgatttc aaaaggtaat tcattaacaa 540 ttttaatgat aaataaataa tatgtttcaa ctttagaatg tcagaaaata agaaaaggaa 600 aacagaggcc agctctaaca gaccatttca agattggtgg actgataaat acggaatgat 660 gaaaagcaat aacaangcnc tgtgtgcgtt gtgtngnaat acagtagtgt gcagaacatc 720 ctcagtaaaa cgncattatg aaaccaacca taaatttctt ctcgatctaa aagtgattca 780 gaacaaaaag aatatattgc acgtgacgtt aaaaataaaa atgtacaatc aaactcattc 840 actacattnn tgaaacatat cggaantcat tcaaacaccg ctgctgctag ttttgaggtt 900 tcaaaganta ttgctaaaca tgggaaaccn tttagcgatg gagaatatnt taaagaagct 960 tngctgaaat gtgcaccaca tttatttgaa gattttgaaa acaaagaaaa aattattcaa 1020 agaattaaag aattaagtat agctagaaat acggtgaaag anagaatntt gganatggcn 1080 aaaaatgtaa atnttcaaaa aataaatgat ataagtcctg cgattttatt tccatttgcc 1140 ttgacgagng taccgacgta acaggctcng ctcgtttagc aatttttgcn cgttattttg 1200 ctaaaggaga aanaatagtt gaagaattga tttcacnagt ttcattanca acaacaacaa 1260 aaggantaga catttgtaat gcagttatag angcgtttca aaaaggngaa atagatcctt 1320 cnaaaattgt gtcagttaca actgacgggg ctcccgcaat gactggacgg gaaaatggtt 1380 ttataaattt atttacagaa cgcgttggac atccaattct tnattttcat tgcattattc 1440 accaanaagt tttatgtgcg aagncnggtt tgaaagcgtt tgatgatatt ttgagtnttg 1500 taacaaaatt agtcaatttt atatcagcac gagcattaaa taaacgaaaa tttcagaagt 1560 tnttaaacga ggtaaactcg anttataatg gnttacttat gtacaataac gtacgctggt 1620 taagtcgngg aaatgttttg caaagntttg tagantgttt ggatgaaatt atnntgtttc 1680 taaataacga aaacattatt ggacaatata atgaattatt ngatgtagaa tgactggacc 1740 acaaaattaa tgtttttcac tgatttntgc taccatttaa atgaattaaa tattaaactg 1800 caaggtgtcg anaaaacaat tattgtaatg ttcgatttaa tcgaagcgtt cgaggccaaa 1860 ttacaaatct ttaancgcga tattgacanc aataatttta agtattttcc tttgtcgaaa 1920 gatcacctcc aaaaattttg ctattcgtgg agtctatgaa attaatataa atttttanag 1980 ccagagttnt aaatgacatn caatccctga ttcaggaatt ttgctgcgag atttaaacag 2040 ttcaaagaat ttgaagaaac attaaaattt attttatatc cagacacagt atcatttgga 2100 aactgctaaa atggatcaat tngagtggtt aaatttggag gatttagaaa tgcaattggt 2160 ngaatttcaa tcgagttcta tatggaagca aaaatttgtt gatttaagaa anganttaga 2220 aaataatgaa agnaatngat taaatgaana agaatacgaa aacatgaata atatgttgtt 2280 gaagacatgg cgttcgctcc cngaaacgtt taatgcgntg aagaatattg tcnnnggcta 2340 ttctancaat attttcatcg acgtatagct gtgaangtct attttcgatt atgaattnta 2400 taaaatctga tgtacgaaac agacttaccg atgaatgtgg ggaggcttgt atttcattaa 2460 aattgtcaaa ctatgatccg gatatcgatg cattatcgag aaatattcag caacaaaaat 2520 cccattaaaa aatatgtcag catattatag aaatgaatct atatataatt tttaaaagca 2580 tgtattttca tttaataaaa ataatttgta ttggcacgca tgtattttca ttttataaaa 2640 ataatattta ttggcacgca gtcaaaaaaa ggttcgccat ccctg 2685 // ID Academ-2_CS repbase; DNA; INV; 6189 BP. XX AC . XX DT 16-MAY-2011 (Rel. 16.05, Created) DT 16-MAY-2011 (Rel. 16.05, Last updated, Version 1) XX DE DNA transposon: consensus. XX KW Academ; DNA transposon; Transposable Element; Academ-2_CS. XX OS Ciona savignyi OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. XX RN [1] RP 1-3053 RA Yuan Y.W. and Wessler S.R.; RT "The catalytic domain of all eukaryotic cut-and-paste transposase RT superfamilies."; RL Proc Natl Acad Sci U S A 108(19), 7884-7889 (2011). XX DR [1] (Consensus) XX SQ Sequence 6189 BP; 1999 A; 1207 C; 1235 G; 1748 T; 0 other; taggccataa gccaaaagct gtaacgtgac acttcgttcc caaagttgtg gtactacttg 60 atttccggtt aaatttgcgt aaaattatta ttataatgac atgccaaaag ttctccattt 120 aaattcctgc agtattttat tacttttact tctaaaatta ctattaattg tggtccccat 180 tgaaatatta aaatgtagtc aacactgcgc cagcccaatc acaatcgttt attctgacgg 240 tcgtcaactg gttgttacag aagctaaatt atctacgaaa agttttcttg ttttaattta 300 tttcattttt cttttatttt caattacaaa actaaaggaa cacggcaaat tgttgctacc 360 ttaaagtctg aaccgaaaga accgatccac ttttatataa atagggcaat aaaaaatatg 420 acaaatcacg taggcttagg ctacttggtt ccgatatcag ataataataa cctaatttag 480 gattataatt ggtttaagca aatgcctttt tatatttagt caaataccta atattttaat 540 atgacaggtt tctatacatc agttatggaa acattaatat gtattttgca tactgaaagc 600 gcgggacaat caattgatga ggccagaggc gagtttctct catgttagat gggaaacatt 660 acaaaactcg tgtgaaattt ggaaacaatt ggaatgcgaa gaaagcgaaa tagcccgacg 720 gattgccccc cttttcgcta tggagtttac aaatatcccg tccaaccttg ggtatcaccc 780 aaagtgttat cgtcggttca ccgacaaaaa acgacaagaa agcagcaaca aaagaaagaa 840 gccagattcc atctttgggc attctcaaac agttgtaagg aagctccgag gtcagatacg 900 tcatttaact ctctgaacag tcgtagcaga gcagttcttc ctcctgtatg cctaatatgt 960 gaaaaatcag acaaatacat cacagtgaaa tctaaacgag ttaaagatag tctgacgcaa 1020 gcacagacaa tagacggagg tttgtagttt tatgtagtgc aagtaaatac aaatattaaa 1080 ctatctgtac atgtttatat aaaaaaggac cggtagcaat tatgtgatta acaggattat 1140 tgcggcagtc agcagaatta aaacaagacg aaaatattct tcggcatata cgtgataaag 1200 attgcgttgc tctagaagcc atgtaccacc gaacttgtta ccgagcgtac acacgattcc 1260 tttccagatc agaaaccagg taggtctgtt agaattaaga gtcttatatg actaatgggt 1320 ttaaactgct gactgttaag ctcgttttaa ccttaatttt cgacttcatt ttcagtgtta 1380 aatctgaggc acggtacgtt aaaacctatg acgtgtttaa gaaaaacgta gtgagtagaa 1440 aaaaattatt caagatcaat ccgtgtacag aatgtccaaa ctattatcga tgtttaagga 1500 cttagcgaag gagcaggagg atcttaccga cactgactat aggtagattg tatgatattt 1560 gttgcacagt ttgatttatt ttcgttataa ccatggaaac catggtagat atttcaatcg 1620 cgccatgaaa ctataactgt tctgtcgttc agggccgacc acctgaagcg tagattgagg 1680 gcagactacc cacagttgca attccatcta cctgggacac gacgtgaaag tgaaatagta 1740 ttctctgaaa gtatagcagt tgcggatttt attccctgta catcggaaac cgattctgaa 1800 gattcagaca cagaagaaat tgcttcggct acaagtgcac acagtttaaa tcaaacgcag 1860 tgcttgtatc atgcaggtaa actgagttgc atactgaata atgtttaacc ttagaacaac 1920 tttgctctaa tttatatttt tccagcaatg atgatacgaa gatgcataac agaacagccc 1980 ggaatgaatg tagtatggcc tcctatgtca acagacatca ctgtggaaga agtaaaacgc 2040 gttgttccat taaaattgtt tagctttctt acgtggtgca ttggattatc agatgatcct 2100 tcattaacat ccgaggttga atacgagaaa gaaagtaacc gtaaggtact atctatatgt 2160 caggatattg tctatttagc atcgaaaggt agaaaacaaa cgccaaaaag cttaagcctt 2220 ggtcttgccg ttcgtcatct aactggctcc tcacagttgc taaatctgct taacaagttt 2280 ggccattctg cctcgtatga taatacacta cgttgggaaa ctagtcttgc acagctccaa 2340 tcgcttaaca tggacgaact tccaaagggg ttcgacaaag aaacgcccac gatattagtt 2400 tgggacaata ttgattttag tgaagaaact atcactggag ctggtagtag agtagtacac 2460 actttacaaa tggtataatg attcaaagca aaattaccaa tacacaacca aaccacaaaa 2520 gcgttcgaat aactttacca aaatcgcgta gatcattacc acctacgcat tcgacaatag 2580 aacaatacca ttcatctaaa agacagggtc catgtgtcaa ctttaatttc gacgcaattt 2640 caacttctag caatattctt aagctatcgc aactcctgga ttttgcatat gttgctctga 2700 agcacagaga gtgccaagtg ccaagttggc gtggctacaa catgacatta accaatcaat 2760 ctattttaag taaatctgca attcactatt tgccggttat cgaggcttcg cccacggaaa 2820 tggctaccgt atttacaata ctgaaacgaa gtttagagct tgcagataaa ctagcagtgg 2880 agacaattac gttagttttt gaccaagcaa tatattcaaa agcgcaagag attcgctgga 2940 aaaacgattg catacaacgg cgcaccattg taagactggg ggaatttcac acagcaatgt 3000 cgtttttggc agttattggg aaaagattta aatctgctgg tcttcaagac atcatgattg 3060 aatcacaaat agttgctcct gggtctatca atggggtcat ttcaggccac cactacaata 3120 gaagcatacg ttcacatctt accatatacg aggctttgga acgtttgcga ttccaatctt 3180 tcttggacac attatccgat aagaaagtgg atgcatacaa taaactgtgc gactttatgg 3240 gtaggacgta cttatcccct aattttattt gtgacgtatt acagacaagc gaggttatgg 3300 ccatgcaggc tgattacaac gaattcgtga gtaccaactg ttgcgataat ccaacattca 3360 acttttggag ttcgtacatc aatattgtac aacttcttct tacctttgta cgagcaacta 3420 gagaatcgaa ttggaattta cacctagcta gtgtacgaca aatgttgaaa tggttctttt 3480 cgtatgatat gacaaactat tccaggtatc tgcctgttta ttgggtggaa atggaaaatt 3540 tgtcaacaac ccatcctgat gcctatacaa aaataactca acgaggggaa tggactgtac 3600 agcggcagca aaataccgga ttttcttcta ttgcatgtga tcaagcaata gaacaaaccg 3660 taaaccgtga ctcgaaaaca agaggaggca tcaagggcat aacgttacgc ccaggtgcgg 3720 ttcataggtg gatattggca cagccagaaa gagcagcgat attgcgtcaa tgtgaaagct 3780 tagctggtat gacacctgaa tcgcgaatat ctaaacaatt agataaatct caatctgaca 3840 ggcgggagat ttcagttttg tcagtgatgt ctacaatatc tgctatgata aatccgttta 3900 atacacaaaa tgaccacctt gtatgtctga gttccggggc tgtcgctacc gctagcatac 3960 agcatgattt gctgttagcc gaggaaaagg gtgaatctgc tgctatttca tatatcagta 4020 atagaatagc agagggaaaa gaagacatgt tttcaccaat aaaacgtctt aaattgaaaa 4080 ccttcgatga tctgcacaaa cccaaagcgt cacgacaaag ggcaaaagaa gtcattttaa 4140 agcacgacca ggctttattt tctagacttt tggttattgg ccaaacacgc aacattaaat 4200 taaaggaaat tttgacctat tcccttggca ttgtgtcata tccgttagca agcatggatg 4260 ggtcaatggc aaaaaccaac aaatctgatt tgctgcactt actcgaaagt aaatccgtcg 4320 attgccaagt tgacacaaca ttatcggaat ctgcattaat acttgacgcg atggcattca 4380 ttcacgcaat aactacagtc ccagcgacat tcggaggact ggcagagacc atattggaag 4440 caattcttaa attggccacc acgtataaat gtagacgagt cgattttgtg gttgatgtat 4500 atccccaagt aagtatcaaa aatgcagaaa gaggtaggcg agctaactcg tcagggagcc 4560 aattaataaa gatctatgga aaagagcaga aactaccaag gcagtggaag aaatttctgg 4620 caagtggaca aaacaaagag gagttgattg catttttatt cgccacatgg agcaatagtc 4680 gttccaataa gagcctatct gtcgactcgt tacaactatt cgtaacacat ggggtggaat 4740 gtcacctaat gcatatcaaa agcggatgct tagctgtatc tgatgttaag gagttgtttt 4800 gcgaccacga ggaagctgac acgcggatgt tactccatgc tgcccatgcc tcccgcacgt 4860 ataattcggt tataattaaa tcaccagaca ccgatgtggc cattatctgc gtgtctcttg 4920 ctgcaaaaat tgacgccagg atctatttgc tcaccggggc aaaaaattcg ctgcgaatgt 4980 tagacattac aaaaattacc gtggctttat catctcccgt gtccgaggca tgtattggcc 5040 tccacgcatt tacgggatgc gactccacta gcgcctttta tggaaaaggc aagaagaaag 5100 cctttgttat agctattggc cgagatgaat acatcaaggt atttgtacag ttaggtctaa 5160 atttcaaatt agacgaaagc ttgaataccg tgttggaaaa atttgtttgt gaactgtatg 5220 gatacacgct agccactggt gtaaatgacg ctcgatataa gtcattctgc tctgctaaat 5280 ccagtggtca acagatgcca ccaaccagta acgcattggg tttacactct atgcgggcaa 5340 attatcaggc tgctatttac agaagagctc tggagccatt tattaatgca ccccaaccaa 5400 atgggcatgg atgggaaatt caaaatggac aacttggtat tgtttggatg acacaacacc 5460 ctgccccatc cgacatacta cttactgtaa aatgtaagtg taaggtgggt gaatgctcga 5520 cgcagagatg ctcttgcaaa tcctcaacac tccaatgcac cgacttttgt gaatgctcga 5580 actgcgttaa tactagcaag ccagactttc actacgaaac agacaacgat tctgaggacg 5640 agtagacaac actgcattct ttatgttaca tgcgtttaaa ccatacgata taaatacaaa 5700 gcgttggggt atttcacgtg tgattttata gtcataaaaa cactatattt gaggcatatg 5760 caagtaaggg tccatttgta cgataactgc agaattcatt aaaaacatgg tatgtttatg 5820 catgtatact acgtcaccta gcggacaata taggttgtat accctgtaac tacccttaaa 5880 agcaactgtt gcatgaataa tataaaatta gtaagtatat aacgtgtagt gtagcaaaaa 5940 tgacgcttgt gaactgtggg tcacaaacaa aataaacgtt tttttacttc cacttggcct 6000 aaatatccgt tgtactaccg tgcagaaaac cccacctagc ggaaaaactg caggacattt 6060 tgtggagaac ttttagtata ttaataagta agtcaatacg aattcaaaaa tgtatacaac 6120 atgtagtgcc agagatgtgg gaaccaatct attttttagc gcttttttat aggtttggct 6180 tatggccta 6189 // ID CR1-5B_CQ repbase; DNA; INV; 2935 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-5_CQ; KW CR1-5B_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-2935 RA Kojima K.K. and Jurka J.; RT "CR1 non-LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 5-5 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 7 sequences with >98% CC identity. CC The consensus of this family is ~81% identical to that of CC CR1-5_CQ. XX FH Key Location/Qualifiers FT CDS 23..2863 FT /product="CR1-5B_CQ_1p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="MNTRALEYRLACSAISYDVIAVTETWLKNNTLSSQVF FT GNGYEVFRCDREQFINSQKKDGGGVAIAVRQGLNARIVSSDEWKCVEQVWV FT AIDLADHTVYICVVYIPPDQTNNASVFEINSSSVSTVSSMARPYDELVILG FT DFNLPRLIWRPSRDGFLYADPELSTQTLHVTEFLDNYSSALLQQINQFPNE FT NNVLLDLCFVSCPDIAPPIAIAPAPLVKDVRHHPPLHLSLPVHLASEFTPI FT VCEVRYDYRKSDVPSMLELLRSIDWENTLDKDNLDDAVQTFTNIIGYVIDR FT HVPKKPVTNARVPWQTSELRRVKAAKRSALKKYSKHRTLPLHNYYVRLNSC FT YKRMSKQCHADHEVRTQKKLKSNPKGFWKYINEQRKEIGLPSSMFLGDKTA FT SNTEDICQLFAAKFSSMFTQEDLSQQQITAAANNVLPNGLALNRITIDRSQ FT ILEAATKLKSSNTAGPDGVPSIVLKKCIDGLLDPLAHLFSLSLSTGSFPKL FT WKHAYMFPVHKKGDKRNADNYRGISALCATSKLFELAVMDPVFFFCKNLIS FT DDQHGFMPSRSTTTNLLTFTTFVTDSFTAKSQTDAVYTDLSAAFDKLNHSI FT AIAKLERMGIGGVLLRWFQSYLTGRKISVKIGDWLSAVFEAFSGIAQGSHL FT GPLVFLLYYNDCIHSISVPRLSYADDMKIYCRIDNASDAQVLQQQLINFAE FT WCKINRMVVNPTKCSVISFSRKRNPIKFEYQICGTSIPRENCVKDLGVLLD FT TELTYKQHISYIVSKASRQLGFVFRTAKNFTDIYCLKALYCALVRSTLEYC FT CSVWSPYYENSVARIESVQRRFIRYALRSLPWRDPFRLPSYESRCELINLD FT TLAVRRNVCRSLLVADVLTSRVQCPAILRGLNIFAPIRTFRNSPFLRVPIR FT QTNYAMYSALTGLHRAFNFVASLFDFNLSRDVIKRKFLLFFRR" XX SQ Sequence 2935 BP; 753 A; 797 C; 666 G; 719 T; 0 other; atactaccag aacgtcggag gcatgaacac acgggctctc gagtaccgtc tcgcgtgctc 60 cgctatctct tacgacgtaa tcgcggtgac ggaaacctgg ttaaagaaca acacgctatc 120 gagtcaagtc ttcggcaacg ggtacgaggt cttcagatgt gatcgcgagc agtttatcaa 180 cagccagaag aaagacggtg ggggagtagc tatcgcagtc cggcagggac tcaacgcgcg 240 aatcgtttca tcggacgagt ggaagtgcgt tgagcaagtt tgggtcgcca tcgatcttgc 300 tgaccatacc gtctacatct gcgttgttta catccctccc gaccaaacca acaacgcaag 360 cgtatttgaa atcaactcgt cttcggtttc aacagtctct tcaatggcac gtccctacga 420 tgagctagta attctcggcg acttcaactt gcccagactg atatggcggc caagccggga 480 tggtttcctg tatgctgatc ccgaactctc aacccagacc ttgcacgtaa cagagttcct 540 agacaactat agcagcgcac ttctgcagca gatcaaccag ttcccgaacg aaaacaatgt 600 gctgttagac ctttgtttcg tcagctgccc agatattgca cccccgatag ccatcgcccc 660 ggcaccgcta gtcaaggatg tccggcacca tccaccgctg cacctatctt tgcctgttca 720 tttagccagc gaatttacgc ccattgtttg tgaagtccgc tatgactatc gcaaatccga 780 cgttcccagc atgctagaac tacttcggag cattgactgg gagaacacac ttgacaagga 840 caatcttgac gatgctgtgc agaccttcac caatattatc ggctatgtca tcgacagaca 900 cgtcccgaag aaaccagtta cgaacgcccg tgttccatgg caaaccagcg aacttaggag 960 agtcaaggct gctaagaggt cggcactgaa aaagtactca aaacatcgaa cgttaccatt 1020 gcacaactac tacgtgcgac tcaacagctg ctacaaaagg atgagcaaac agtgtcatgc 1080 cgatcacgaa gtacgtaccc aaaagaagct taagtccaat ccgaaaggat tctggaaata 1140 catcaatgag cagcggaagg aaatcgggct gccgtcgtcg atgttccttg gcgacaagac 1200 agcttcgaat acggaggaca tctgccagct gtttgccgca aaattttcga gcatgttcac 1260 acaagaagat ctgtcacaac aacaaatcac cgctgctgcc aacaatgtcc ttccaaacgg 1320 gctggccctg aaccgcatca caatcgatcg ctcgcagatt ctggaagccg caaccaaact 1380 gaagtcatcc aacactgcag gtccggacgg agttccatcg atcgtcctga aaaaatgtat 1440 cgacgggctt ctagatcctc tcgcacatct cttcagtctg tccctctcca ccggctcctt 1500 cccaaaactc tggaagcacg catatatgtt tccggtgcac aagaaagggg acaaacggaa 1560 cgcagacaac tatcgcggga tttcagcact gtgcgccact tcgaaactct tcgagctggc 1620 ggtcatggat ccggtgtttt tcttctgcaa gaaccttatc agcgacgacc agcacggttt 1680 tatgccttcg cggtcaacca cgacgaatct gctcactttt acgacattcg tgaccgatag 1740 tttcaccgcc aaatcgcaga ccgacgcggt gtataccgac ctgtcggccg catttgacaa 1800 gctgaaccat tctattgcaa tcgcaaagct ggaaagaatg ggaatcggtg gggttctcct 1860 gcgttggttt caatcctatc taaccggccg gaaaattagc gtcaagatcg gtgattggct 1920 gtccgctgtc tttgaagcct tttcgggaat tgcacaaggc agtcacttgg gaccacttgt 1980 attcctgctg tactacaacg actgtatcca ttctatctcg gttccacgcc tgtcgtacgc 2040 ggacgacatg aaaatctact gccggattga caatgcatct gacgctcaag ttctccaaca 2100 acagctgatc aacttcgcgg aatggtgcaa aattaaccga atggtcgtta atccaaccaa 2160 atgctccgtt atctcttttt ccagaaaacg caacccgatt aagttcgagt accagatttg 2220 cggcacttca attcctcgtg agaactgcgt taaggatctg ggtgtgctcc tcgacacgga 2280 gctgacatac aagcagcaca tctcctacat cgtctccaaa gcatcgcggc aacttggttt 2340 tgtcttccgt acagccaaaa actttactga catctactgt ttgaaggcac tttattgtgc 2400 gttggtccgt tccacactag aatactgctg ttctgtgtgg agtccgtact acgagaacag 2460 cgtagcgcga atcgagagtg tgcagcgaag attcatccga tatgcccttc gatctctacc 2520 ctggcgggat cccttccgac tgcccagcta cgaaagtcgc tgtgagctga tcaacctgga 2580 cacgctagca gtacgacgaa atgtgtgccg atcactccta gttgccgacg ttttgacctc 2640 gagagtgcag tgtccggcca ttctccgtgg attgaacatt ttcgccccca ttcggacttt 2700 tcggaactcc ccattcctac gtgtccctat ccgtcaaaca aattatgcaa tgtacagtgc 2760 actgacaggg ctacaccggg cgtttaattt tgtcgcttct ttgttcgatt ttaacttgtc 2820 tcgtgatgtt atcaaaagga aattcttgtt gttctttaga cgttaattta tgtattagtt 2880 tgtaaggtaa aacaccattg gggcccgagc ctgttggtgt ccaaataaat aaata 2935 // ID BEL-90_AA-LTR repbase; DNA; INV; 338 BP. XX AC supercont1.287; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-90_AA_; KW BEL-90_AA-I; BEL-90_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-338 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.287; Positions 39824 40161. XX SQ Sequence 338 BP; 124 A; 53 C; 70 G; 91 T; 0 other; tgttagcacc ccccactggg cacaacccta gtgggcttgt cgaacagtga caggtctgag 60 aaatgtcaaa agcaattgaa gtttagggat caaaaataaa agaagaagaa ctggattgat 120 gctgagagtt gctaaaattt ggaatagttg taaaatctat tggaaattgg aatttaaagt 180 gaactgaaaa taaaattaag cctaaagttt tcggatttgc tagaattcac aacgccacaa 240 taaaatattt gtcaggtcac agaaaccaca tggagaacat caataatttt tgagctgtat 300 gctacaaaaa ggtgtatttc ttctacgtgt cctgaaca 338 // ID CVE repbase; DNA; INV; 520 BP. XX AC . XX DT 04-OCT-2002 (Rel. 7.09, Created) DT 01-JUL-2005 (Rel. 10.08, Last updated, Version 2) XX DE CvE: putative non-autonomous DNA transposon element from oysters DE (Crassostrea virginica) - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; CvE; KW nonautonomous DNA transposon. XX NM CVE. XX OS Crassostrea virginica OC Eukaryota; Metazoa; Mollusca; Bivalvia; Pteriomorphia; Ostreoida; OC Ostreoidea; Ostreidae; Crassostrea. XX RN [1] RP 1-520 RA Gaffney P.M., Pierce J.C., Mackinley A.G., Titchen D.A. RA and Glenn W.K.; RT "Pearl, a novel family of putative transposable elements in RT bivalve mollusks."; RL J Mol Evol 56(3), 308-316 (2003). XX DR [1] (Consensus) XX CC Uncommon, structurally similar to CvA. Modular organization CC includes CC subterminal inverted repeats (nt 23-33, 505-515), perfect CC inverted CC repeats (nt 409-416/494-501, 483-491/510-518), an imperfect CC inverted CC repeat (nt 89-97/147-155), self-complementary regions (nt 10-21, CC 494-507), CC and an (ACTG)n microsatellite region (nt 434-473). Putative CC target site CC duplication AA. Individual CvE elements contain several copies of CC a 180 nt CC core repeat unit, the first copy being truncated at the 5m end CC (nt 147-237 CC and 238-417). XX SQ Sequence 520 BP; 172 A; 108 C; 104 G; 136 T; 0 other; aagaaagagt gagcaagctc acatacccca cgctccaaca ttgcttgaga atgctgtata 60 caatgaatgg caattccaag cattagagac aggcaatcgg acaaaaaagg gggatataaa 120 ttgctcctca aaaggatcaa atcagcattt cctgtaaaaa tgcatatcta cacattatgt 180 ccttcataac tacaaagttt cacgcaattc tgttgagcgg tttcagtgga gttgcgctga 240 ccaactgttt cagtagtatc tttcattttc gtcaaatttc taagttcaaa aaggggcatt 300 gctcccagaa aaaaaatgga atcaaaatgt cctgtggata tgcacatcta cacattatgt 360 ccttcattac tacaaagttt caggaacttc tgttgagcgg tttcagagga gttgcgctaa 420 caagaaaaac aggactgact gactgactga ctgactgact gactgacgga caggtcaaaa 480 acattatacc ctccgcaact cgttgcgtgg ggtataataa 520 // ID DNA8-73_AP repbase; DNA; INV; 784 BP. XX AC . XX DT 25-AUG-2009 (Rel. 14.09, Created) DT 25-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-73_AP. XX NM DNA8-73_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-784 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2009-2009 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 784 BP; 314 A; 102 C; 94 G; 274 T; 0 other; tagggctggg attttaatgc cctaaaaaat ttcaaaaaat gcccttaaaa aaatgcaaaa 60 atgacttaaa aaactcaaaa aatgcattta aattatattt attatagctt aaaaattaga 120 aaaaaaattt taaattatac tacagtaatg tatttttttt tgtttattgt acgttgttac 180 ttgtttagtt tttttatgct tctgttttct taatcattca ttcaaaaata ataattttaa 240 aaaaaatttt aatattttaa cttgtttttg ctagttgtta ttaatactaa cctgaacaaa 300 atcgctattg cactgtataa tgatatgcct agctatgata aaattgttat cgacgataac 360 ttgtacctat gtatgtatgt atggtgtaat aaaaataata attcataata ggtataacat 420 aacggtgata attttcgtgg aaaatcctag ctatgatgaa atatatgaat atatgatact 480 atatataatt tattattcgt accgtcagca gtttgccgaa agtttcttgt gaatcgcaac 540 aattttcgtt cattttatga agaaattgtg aaaaatgctc taaaaaccaa gaaaatgcac 600 taaaaacgtc gaaaaatgtc ctaaaaaatg catttaaggt taattttgtc aaaaaatgca 660 aaaaaatgca aaataaaaaa tactttaaaa taatcagtat cacccaaaac acattttata 720 cttacggatc aacgtttcaa aacatcagaa aaaaagatgc ctttgcatca aaatcccagc 780 ccta 784 // ID BEL-39_AA-I repbase; DNA; INV; 6721 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-39_AA_; KW BEL-39_AA-LTR; BEL-39_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-6721 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 869-869 (2011). XX DR [1] (Consensus) XX CC Positions [5677-6258] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 1312..6654 FT /product="BEL-39_AA-I_1p" FT /translation="MTDKKIKEKVKKRERIVDTVKRHASFLGIYDPSIHRG FT EVQSRLDKIEVKFDEFEEIQEEIAELDAEGKFEEDCSKAADEFEKLYYSLR FT AALLAKIPSEDPPADLNNSIGRAGLALGAHTGVRLPQISLPEFNGDIRVWL FT SFKSTYMSLIHESGELSDVQKFHYLKSALKGEAAKLIESLAITSDNYNIAW FT DTVTKRYSNEYLLKKRHLQALMEYPKVEKESSSAIHALVDEFEQRLKILKQ FT LGEKTEHWGAMIVHWMCSKLDMKTLQLWEDHAASTKDPTFAILVNFLEKRT FT RVLEAVSSNVELQCSSQKSEVKRQKVVVHAATDSEQRSGLVCSCCGESHFL FT GRCGKFNQMTLKEKLQFVNSKRLCSNCLKSGHWVRDCNSKFSCRDCGKKHN FT SLIHPGFSPSSSGAGNSDHLAGKAEKKRNGTVATNVATNEEEAEQEIDDQE FT AVGTYNVGTKVGKISNVFLSTVVLVVRDLHGGKQLARALLDNGSQANIMSE FT RLCQMLNLKRRTANVPISGIGQSETRARFIVNTTVSSRIQDFSVGMEFLVL FT QKVTSELPSAHIPVAHWNIPKDIQLADPNFNTSNRIDLLIGGEHFYRFLYE FT REMKRIVLGPGLPMLINSVFGYIVTGKVSETHDYSVSCCLAATPGQLESDN FT LEAQLRRFWEIESSEDRPAWSKEEQDSEDHFVKTFSRTEEGRYVVRLPKHI FT NYGQMLGNSHAMALTRFRRLEKQLERNPDMRLQYNAFMQEYLNLGHMREVN FT EEELSKEMMSDNSKKVCYLPHHAVLKETSTTTKVRVVFDGSASTDSGYSLN FT DALLKGPVIQDELLSLLIRFRKHEVALVGDVEKMYRQVKVHPEDAGLQRIL FT FRFSPEEPIRVFELSTVTYGLTPSSFLAIRALHQLAADEGVAYPEAAKAIV FT HDFYVDDYIGGASSVDEAIRLQGDLDTLMKKGGFALRKWCSNRPEVLTGIP FT ADQLGTNLSISFDISPEEKVKTLGITWEPGTDQLRFIYENVESEQTWTRRT FT ILSSIAKLFDPLGLISPVVIVAKMLMQELASRKIAWDTPVPDDIRKKWTTF FT YSELDKIAELRISRFAFASNWVDIQFHCFADASTVAYGACLYVRTTDQAGN FT VRIELLASKSRVAPLQRLPIPRLELCAAKEAALLHSKVTKALSLGEVRSTF FT WSDSTIVLHWLRAAPNTWQTFVANRVSAIQTSTHPHSWRHIAGKENPADLV FT SRGMTINDFLHSQLWNEGPPWIRDDEDTWPYQVNGFNPPDDQLEIRKVVHQ FT ATVAEPSTNDLFSLRSSLPPLLRIVAYCHRFAHNCRFPNDRRRGIVLTAEE FT IQAAKMALTRLAQAERFSEEFKALQHQQPIHRKSKLNRLFPFLDSDGVLRV FT GGRLRFSTIGYAAKHPAVLPSNHPFTDLVIRYFHYQNFHSGPQLTLSEIRQ FT EFWPIHGRRAVNNVLRKCIRCFRSNPAPIQQPMGQLPAGRVRPGRPFLITG FT VDYCGPFYLKPPRRNAAPPKVYIAVFVCFSTKSMHLEMASDLSTASFLSVL FT RRFIGYRGVPAEIHSDNATNFSGARNELKALYDLLNNSISRTTISNELSQQ FT GIQWKFIPPRAPNFGGLWEAAVRSVKTALKKEIGLQQLTYDNFLTLIVQIT FT ATLNSRPLSPLSDDPTEFEALTPAHFLIGSSMKALPEPILISIPTNRLNHY FT QQTQQMFQRYWQRWSKQYLTQLQVTTRNLPTNPMQVGSIVVLREDNLPPLC FT WPLARIISLHPGSDGIVRVVTVKTATGVYKRAVNRICPLPTEDVNRRTTRA FT SGLPAISDRN" XX SQ Sequence 6721 BP; 1804 A; 1554 C; 1716 G; 1647 T; 0 other; tttttttggt gccgtgacca ggatcagcag ttggataagg ccattatact ccgtagtgta 60 ttgttctggg gaagaatagg ccggccggaa gacatctatc gggcaacgga ttcactgttg 120 tgaggttcct caacgagtgg agaactcact ctccccaacg atagcgtgga ttacttcttc 180 cttcacacgt tgtacggccc atcggccact acaaagtcac gcgacgagtt gtaattgttg 240 cagtcagaag cagtaataaa ttaaatacaa ggcctatatt gcttaggcca cgggtgagtg 300 gcattccaaa tatctctaac aatcacacag gtgcttttga gatttggttt tgtttcatgt 360 tcgctggctt ctgtgactgc atgtgacgtc gcgcgacacg gagttgcatc gtcgatttat 420 tactattgct gggaggattt ctgctacgcc cgtgctgcat acgacttcaa cagtatatct 480 ggacgtgatt cgagctattt tttatctggc atcgttcttc ggttggagat ttagatattt 540 gaaattcaag gcctattgtc gacaggcctc aggtgagtgc ctgtccaatt aaattatctc 600 ttttgggtgg tgctatatgg tacgattatt gatccattct cggtacacgt gatcgttggc 660 tacggttgga gaatcggtac gtgaacgtac ggctacacta ggcaacaaca ttttttcgca 720 tcgccacgaa accgattgga ttggactcga cggacggatt gatacctctt ctgagatcat 780 cgtagatttc atcgagctca actggtggaa gacgttacgc aacactttgc gtgcagaaag 840 gcggaggaag tgtttacgtc gggacaatta gtacaagcga aggttggcat ctacgatcga 900 tgaaggttag tggtatcgaa ccattttcta ttgctgggga gctggttata cttgaggcga 960 tttattaaaa aaaatacaag gcctgtggta gacaggtacc agtggagttt cttgcaccgt 1020 gaggaaatta gttgcaatac agtctacatt gctggatgca catttcggca ttttttggac 1080 gagcctcaat tcgaagtacg aagacgatca tctcactact tgatattgta cgggggctgg 1140 cggaagctgc atttaattaa atacaaggcc ttgttgagac aggcctcagg tgagtacctg 1200 tccaacaatc tcacttggag aattgcagag cttcttggtt ggttttcttt ctcattcttg 1260 atcgtcatcg ttcgtacttc gatttgctga gctggtagtg gttgtgggaa aatgactgat 1320 aagaaaataa aggaaaaggt caaaaagcgg gagcgcattg tcgatactgt gaaacgacat 1380 gcgtcatttc tggggattta tgatccaagt attcataggg gggaagttca gtcaaggctg 1440 gataaaatag aggtaaagtt tgacgaattt gaggaaattc aggaagaaat cgctgagctg 1500 gatgcggaag ggaagttcga ggaggattgc agcaaggccg ctgatgagtt cgagaaactg 1560 tattacagtc tgcgagcagc cttgttggcc aaaataccat ccgaggaccc accggctgat 1620 ttgaacaact cgattggaag agctggcctt gcattagggg cccacactgg agtacgtctg 1680 ccgcagattt cactgccgga gttcaacggc gatatcaggg tatggttgtc gttcaaatca 1740 acctacatgt cgctcattca cgagtccggg gagctaagcg atgttcagaa atttcactac 1800 ctcaaatctg cactcaaggg agaagcagcc aagctcatcg aatcacttgc tatcaccagc 1860 gacaactaca acatcgcttg ggatacggtt acgaagcggt actctaacga gtatcttttg 1920 aagaagaggc atctgcaggc gctcatggag tatccgaagg tggagaagga gtcatcatct 1980 gctattcatg ctctagtgga cgagtttgaa cagcgtttga agatcttgaa gcaactagga 2040 gagaagaccg agcattgggg ggcaatgata gtgcactgga tgtgttccaa gctggatatg 2100 aagactcttc agctgtggga agaccatgca gcttcaacaa aggatccgac gttcgcaatt 2160 ctagtaaact ttttagagaa gcgtacgagg gtattggaag cggtttcatc gaacgttgag 2220 ttgcaatgca gttcacaaaa gtcggaagtt aagcgtcaaa aggtggtcgt acatgctgca 2280 actgatagtg agcagcggag cggtctggta tgcagctgtt gtggagaatc gcatttcttg 2340 ggccggtgcg gaaagttcaa ccagatgacc ctgaaggaga agctacagtt cgtgaacagc 2400 aaacggctgt gcagtaattg cctgaagtct ggccattggg tacgcgattg caattcgaag 2460 ttcagctgtc gcgactgcgg aaagaagcac aattcgctga ttcatccagg attctcacca 2520 agcagcagcg gtgctggaaa tagtgatcat cttgcgggta aagcggagaa gaaacggaac 2580 ggcacggtgg caacaaacgt ggcgactaac gaggaggaag ccgaacaaga aatcgacgat 2640 caagaagcgg ttggaacata caacgtaggg accaaggttg gcaaaatctc aaacgttttc 2700 ctatctacgg tggtattagt tgttcgggac ctgcacggag ggaagcagct agcccgagct 2760 ttgcttgata atgggtctca agcaaacatc atgagtgagc ggttgtgtca aatgctgaac 2820 ttgaagcggc gcacagcgaa cgtgccgatt agtgggatcg gccaatcgga aactcgtgct 2880 agatttatag ttaacactac agtcagctct aggatccagg acttctccgt aggaatggag 2940 ttcctggtgc ttcagaaggt tacgtcggag ttgccatcag cacatatacc ggtagcgcac 3000 tggaacattc cgaaggatat tcagttggcg gatccgaact tcaacaccag caatcggatt 3060 gaccttttga ttggaggaga acacttctat cgtttcttgt acgaaaggga gatgaagaga 3120 atcgtacttg gaccaggact accgatgttg atcaactcgg tattcggcta tatagttacg 3180 gggaaagttt cggaaacaca tgactactcc gttagctgtt gcttggcagc cactccgggg 3240 caactggaat cggacaattt ggaggctcaa ttacggaggt tttgggaaat cgagagtagc 3300 gaggatcgac cagcttggtc gaaggaagaa caggacagtg aagaccattt cgtgaaaacg 3360 ttcagccgaa cagaggaagg tcggtacgtc gtacgtttac cgaaacacat taactacggt 3420 caaatgttgg gtaattctca tgcgatggcg ctaacaaggt tcaggaggct agaaaagcag 3480 ctggagagga atccggatat gcgtctacaa tacaatgcat ttatgcaaga gtatctgaat 3540 ctcggacata tgcgggaggt taacgaggag gagctctcga aggaaatgat gtcggataac 3600 tcgaagaaag tctgctacct accgcatcac gcggtgctga aggaaaccag caccacaacg 3660 aaggttcgtg ttgttttcga tggctcggcc agtacggaca gcgggtattc cctgaacgac 3720 gctctactga aaggtccggt aattcaagat gagctcctca gtcttctaat acggtttcgg 3780 aagcatgaag tggcgctagt tggcgatgtc gagaagatgt atcgacaggt aaaagtacac 3840 ccggaagacg ccggattgca gcgcatcctt ttccgatttt ctcctgagga gcccataagg 3900 gttttcgagt tatcaacggt aacttacggg ttgacacctt catcattctt ggcgatccgc 3960 gccctacacc aacttgcagc agatgaagga gttgcatacc ccgaagcggc gaaagcgata 4020 gttcacgatt tttacgtgga tgattatatc ggtggagcat ccagcgtaga cgaagctatc 4080 cggcttcaag gagaccttga caccctaatg aagaagggtg gatttgcgtt acgcaaatgg 4140 tgttccaatc ggccagaagt tctgactggc atcccagccg atcaacttgg aacaaatcta 4200 tcgatttcat tcgatataag cccggaagaa aaggtgaaaa cactgggaat cacctgggaa 4260 cccgggacag atcaactgcg attcatttac gaaaacgtag aaagcgagca aacttggaca 4320 cgaaggacaa ttctgtcttc tattgctaaa ctgttcgatc ccctgggatt aatatctcca 4380 gtggttattg tagcaaaaat gcttatgcaa gagctcgcgt cacgaaaaat agcatgggat 4440 acacctgttc ccgatgacat aaggaaaaaa tggacgacgt tctactcaga actggacaaa 4500 attgcggaac tacgaattag ccgatttgct ttcgcgtcca actgggtaga tatccagttc 4560 cattgttttg ctgatgcttc cacagtagcg tacggagctt gcctatacgt acgcacaaca 4620 gaccaagctg gaaacgtgcg aattgagcta ctcgcttcaa agtctcgtgt cgcaccgctc 4680 cagagactac caattccacg acttgaactt tgtgcagcca aagaagctgc actgctacac 4740 tcgaaggtga cgaaagcact atcgttaggc gaagttcgtt ctacgttctg gtccgatagt 4800 acgattgtgc tccactggct gcgagctgct ccaaatacct ggcagacatt cgtagccaac 4860 agagtatcag cgattcaaac ctccacgcac ccacattctt ggcgacacat agcaggaaag 4920 gagaatccag cggacctagt atcgcgtgga atgaccatca acgactttct gcatagtcag 4980 ctgtggaatg aaggacctcc atggattcgc gatgatgagg atacgtggcc ctatcaagtc 5040 aatggattca atcctcctga tgatcagttg gagattcgta aggttgttca ccaggctaca 5100 gtcgctgaac catctactaa cgatttattc agcttacgtt cttcgctacc ccctctactt 5160 cggatcgtcg cttactgcca tcggtttgcc cataattgcc ggtttccgaa cgacagaagg 5220 cgcggcatcg tattgactgc tgaagaaata caagctgcaa aaatggcact gactcgatta 5280 gcacaagctg agcggttttc cgaagaattc aaagctctcc agcatcaaca gccgattcac 5340 cgcaaatcta aactgaatag actgttcccg tttttggaca gcgatggagt cctcagagtt 5400 ggcggacgtc tacgcttctc aaccatagga tacgcagcca agcatcctgc ggtattgccg 5460 agtaatcacc cattcacgga tttagttatc cggtacttcc actatcagaa cttccacagt 5520 ggtcctcaac tcacgttgtc cgagatacgg caggaatttt ggcccataca tggccgacgt 5580 gccgtcaaca acgttctccg caaatgcatc agatgcttcc gcagcaatcc agcgcctatt 5640 cagcagccca tgggacagct acctgccggt cgcgtgcgcc caggacgccc attcctgata 5700 accggtgttg attattgcgg gccgttttac ctaaagccac cacgccgaaa tgctgcgccg 5760 ccgaaggtgt atattgcggt gtttgtctgc ttctctacaa agtcgatgca cctcgagatg 5820 gccagcgact tatccactgc ctcattcctt tctgttttgc gacggttcat aggctaccgt 5880 ggtgttccgg ctgaaataca ttcagacaat gccacaaatt tctctggagc acgcaatgag 5940 ttgaaggcgt tgtacgacct actgaacaat tcgattagcc gcaccaccat cagcaacgag 6000 ttgtctcagc aaggcatcca atggaagttt attcctccgc gcgctcccaa ctttggtgga 6060 ttgtgggagg ccgccgtacg ctccgtaaag accgccttga agaaagagat cggccttcaa 6120 caacttacct acgacaactt cctcacatta atagttcaga tcaccgctac tctgaactca 6180 agacctctat cccccctatc tgatgacccc actgaatttg aagcgcttac accagcgcat 6240 tttttgattg ggtcatcaat gaaagctctc cctgagccca tcctaatctc cattcccaca 6300 aaccgcctta accactacca gcaaacccag caaatgttcc agcgctattg gcaacggtgg 6360 agcaagcagt atctaacaca gctgcaagtg acaacgagga acctacccac aaaccctatg 6420 caagttggca gcatcgtcgt gctacgggag gacaaccttc cgcccctttg ttggccgctg 6480 gcacgaatca tcagcctgca cccgggctct gacggaatcg tgcgagtagt gacggtgaag 6540 acagcgactg gagtctacaa gcgggcagtt aatcgcatat gtccactacc taccgaagat 6600 gtgaaccgac ggaccacgag agcttcagga ttgccagcga tttcggatcg taactgaata 6660 tgaaccaaat tgttgaattc atacaattag cttattgaag ccagttcaag agggccggta 6720 a 6721 // ID Gypsy12-I_Dpse repbase; DNA; INV; 10352 BP. XX AC Unknown_group_699; XX DT 13-MAY-2009 (Rel. 14.05, Created) DT 13-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy12_Dpse; KW Gypsy12-LTR_Dpse; Gypsy12-I_Dpse. XX OS Drosophila pseudoobscura OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; obscura group; OC pseudoobscura subgroup. XX RN [1] RP 1-10352 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1084-1084 (2009). XX DR Genome; Unknown_group_699; Positions 3871 14222. XX CC Positions [7959-8501] - Reverse transcriptase CC Positions [3937-4416] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 2314..4653 FT /product="Gypsy12-I_Dpse_1p" FT /translation="MPSISMILTNLGTARYFTTLDLKSGYHQITLAERDRE FT KTSFSVNGGKYEFCRLPFGLRNAGSIFQRAIDDVLREQIGKTCYVYVDDVI FT IFSKSQDDHVKHVDWVLKSLYDANMRVAREKSNFFKTSVEYLGFIVTNEGT FT RTDPEKVKAIQEYVEPTTLFGLRSFLGLASYYRCFVKDFASIARPLSDILK FT GENGSISKYKSKKTTINLSEVQRHAFQKLRNILASENVILMYPDYKKPFDL FT TTDASAYGIGAVLSQGGRPITMISRTLKKGERHYATNERELLAIVWALGKL FT RHYIYGVRDINVFTDHQPLTFAVSEGNPNSKIKRWKARIDELGAKLFYKPG FT KENHVADALSRQNINALEDSAPPSDAATIHSEASLTYTIETTDKPVNCFRN FT QIVLEAARFPLKQMFILFGEKTRHIINFTDKNALTGVVKTSVNPTVVNAIH FT CDFPTLACIQHELVRAFPSTKFWHCKNLVTDIFNENERKEIVTIEHNRAHR FT AAQENVKQVLADYYFPKMTKLASEVVMNCRICSRAKYDRHPKKQELGPTPV FT PSFTGEMLHVDIFSTDRKYFLTCVDKFSKFAIVQPIASRTIADIKAPLLQL FT MNIFPNTRQIYCDNEPSLNSETIRSLLRDNFGAHIVNAPPLHSTSNGQVER FT FHSTLLEIARCLKIAKGIVDTVDLFLQATVEYNRSVHSVTNKKPIDILNAA FT AADFHSEVQNRVRKAQIVQNNRVNRSRQNRVFRVGDKVLLKPNRRLGNKLT FT PLYVEETVEADLGTTVLIRGKVVHKDNLR" FT CDS 7641..10103 FT /product="Gypsy12-I_Dpse_4p" FT /translation="MISLKSFVADMQGYTFLKHDADPVPTDEHALIYNVTE FT ESSFVAPPQYRKDVEQMIRNSYQMPTEVTKQSPIQLKIVPDGVIKPFHHSP FT SRLSTDEASAVKKQVEEWNEQGIVRKSSLNVASRVVVVKKKDGTLRVCVDY FT RKLNSMVLLDCFPVPIMEEVLEKLQSAKWFTIMDLENGFFHVPMEEQSKSY FT TAFVTKEGLFEFNKAPFGFKNSPAAFIRFVSYIFQELINSDIMQLYMDDII FT VYAASPEVCMRKTKLVLETAAQFGLKIKWKKCSFMQPRISFLGHIIEDGRI FT WPGKEKTAAVSRFDTPKDIKAVQAFLGLTGFFRKFIPGYAQVARPLTNLLR FT KEAVFNIGEAEQQSLQTLKNLLVIAPVLHLYSREAPTELHTDASKEGFGAV FT LLQQFDGNFHPIYYCSKKTTQAESKRHSYYLEVKAAYLALKKFRHYLLGIK FT FDLATDCAAFKQTTTKADIPREVSQWILYMEDFTFKVVHRPGDRMRHVDCL FT SRFPQTCMFVTTELTARIKKAQQDDDFIKATVEILKQHPYQDYKLKGGLLF FT KSVNGNDVLVVPKLMEREIIQGSHEVGHFSTAKTMHSVQQQYWIPHLERKV FT NKLITNCISCIIFSKKLGKQDGYLHCIDKGDTPLYTLHVDHLGPMDATAKQ FT YKYIFAVVDAFSKFVWLFPTKSTGHEEVVKRLRDWSFVFGFPKRIVSDRGA FT AFTSNAFSEFLNENKVEHVCTTTGVARGNGQIERVNRSILGIIAKLSAQES FT TSRHSRSCLGQRCTDKLKVDYWRFSTRSWLRSLTTSAKNCVTKRNKTLRRR FT KRCTSAIMIKNDDLSTATD" FT CDS 6560..7666 FT /product="Gypsy12-I_Dpse_2p" FT /translation="MEDLENAEKNTEDKIDQMIKLLSLKVLCDEQREMKIA FT PDNFTKVVSDCDGKSVPIEKWFEIFEKNADAYELTVKQKYVQARGKITGAA FT KVFLEAQCVCTYEELKNVLLDEFTCSYNSADIHKLLQERKRKSSESMHEYL FT LQMRKIAAVGHVEDAAVISHIMNGLDIRKEYKYAMYRCKTFRAMKEEFDIY FT DRLNILEKKGNNEQHKTSFKQQSGQSGKKEYCFNCGSGEHKRKDCKAETKC FT FNCNQNGHISRNCPYEADKVDKVRVILDRSRMKEIQINGIVVDCLVDTGSD FT VTIIKESMLKTMKNVKLLKCTTMLRGLGQISTKPVGYFNAEVTVDNLQTTQ FT KFLVVPSSQIDFDALLGHDFIKKFRR" XX SQ Sequence 10352 BP; 3269 A; 2331 C; 2391 G; 2361 T; 0 other; gtaccactaa ccatacagta tactgtatat gtaagttatc cgatccagcc ggatacccct 60 ccctctgcat ttggttctcg accctacagc gggtcaatga gcttaaacac cttcatatga 120 accaaataca gcacatgcct cgtcgcgtaa tagaacacga gagttgcagc gttgacggaa 180 acacttgccc agacggccca acaacaacca tggggccgtg gaacgtgata aactgccgct 240 gctcaacata acaccggcgc gtgtgccatg tcagcagatt aatgcaccgg cgactgcgca 300 gccagcgttc ggggacttaa tattaagact ttcattttta tgtaaattta gattctctta 360 gacctccaac gaggaacaat aaatctcgct tcgacgagtt aataataaag aataaatact 420 acaataatcg tcgattctac ttaacttaaa tggcgcccaa cggaatttta tccaacgcga 480 gtaattattt agtaaaaatt cgtaagctaa cgcttcgggc ttaacgaaaa taaaagaaac 540 ataaaaagtg cttacccaac tcgaggttct actactaaat tagcgagcta acgccgcaag 600 tgtaataaac aacaagtgag gtgaaataaa taaaaaaaac cacacccgga agaagctgaa 660 gggcaatccc tcgctcccgg catcaacgtc tacgaaagaa aaagcaggct gaagggcaat 720 ccctcgcctc aaaaattaaa accaacaagc tgaagggtga tccctcgctt ccctcatcgc 780 cacccacaaa ggaattaata attaatcaaa ggaaactaat aattaagtca aataaaacac 840 ggtataaacc gcagaaaacc atgtaagtga gctcttttta taaaatttta ttcaagcatt 900 ttggtaatcg cgacccaata aaaataaaat aatactgagt ggtgcagctg agatatttaa 960 aatcaaaatc aaattcatta aaaaaaaaag aaaatacaat aaaataagaa gaacattgaa 1020 aacagcagta tttccggcaa gaccgaaaga tctgcagacc gctctggccc ttgccgatcg 1080 cagcgttttg gctgcaaatt acgctagggc catggaggaa aaggcccaaa aaacgggcca 1140 gcgccaaact caagcgcaac tccagggcaa ttttaggcaa actagagctg tggactccta 1200 taaaagtcct cattacgaca aaaggcaaga cgcgtcgatc gcccagcaga tccaaccaac 1260 gatccaatag cggcaggact cgggcccagg gcccggagcc catggatgta gatccttctt 1320 cttccaaact cagactgccc actgcttacc aaaaaccttt cgggtcgggt ggacgcaatc 1380 aaaattttac gaggaaaaca aacgcgtcag agcgcacctc gagaaattgg cagcagagga 1440 tagagcacac tacaaccggg gacaaccaga ccgacacggc ctacaatggg gctgcggccg 1500 aagctgtcgc tgagatagaa gcggagggcg aattagagta cgagagcgac tcagtgaatt 1560 ttttagggga agatccctgc tgccattcgt gtggcggaaa atagcaggga ttaatatgaa 1620 gcttctcata gacacgggtg cgtcaaagaa ttacatcaag ccgtgtaaat ggcttcgcgg 1680 cgtcatccct gtcgaggcac cctttcccgt ctactcaatc aacgggacaa atgaaatttc 1740 gaagaaatgc cttctgtcaa tattcggcaa gacctcccca ttttttttgt tagaatcatt 1800 atcatcattt gacggcatca ttggccttga cctgctatcg caggtagggg cgacattgtg 1860 cttaaagagc ggtacactca gattcggtaa cacgactgag gtaatcaaat tccacaagtg 1920 catagacgtc aaccacacaa acgttattga cgtcgatgct cccccagcca taagggcgga 1980 ctttataaaa atgttaaata agaggagtaa ggcatttgcc gaccctaacg gggctctgcc 2040 ttataatact tcggtcgtgg ccactatccg cactgtggac gacgaaccaa tttactccaa 2100 gctgtacccg taccctatgg gtgtagctga ttttgtcgaa tcagaggtca agagcctttt 2160 gaaagacaaa atcattcgca aatccagatc gccatataat aatcctatat gggtggtcga 2220 caaaaaaggc actgatgaag cgggcaataa gaagaaacgt ctcgtaatgg actttcggaa 2280 gctctttcac aaacagtggc cgacaaatac ccgatgccaa gcatctcaat gatcttaaca 2340 aacttgggta ctgccagata cttcacaaca ctggacttga agtcaggata ccaccaaata 2400 accttggcgg agcgcgaccg agaaaagact tccttttccg taaatggagg gaagtacgag 2460 ttttgccgac tgccctttgg tctacgaaat gcaggtagca ttttccagag agcaatcgat 2520 gatgttttgc gcgaacaaat cggcaaaaca tgctacgtct atgtcgatga cgtaattatt 2580 ttctccaagt ctcaagacga ccatgttaaa cacgtcgact gggtgctgaa aagcctctat 2640 gacgccaaca tgagagtcgc tcgagagaag tccaattttt ttaagacgag cgtggaatac 2700 ctcggcttta ttgtgactaa cgaaggcacg aggacggacc ctgagaaggt gaaggccatc 2760 caggagtacg tcgagccaac aacgttgttc ggcctaaggt cgttcttggg cctggcaagt 2820 tactacagat gcttcgtgaa agatttcgcg tcaattgccc gaccgctttc tgatatttta 2880 aaaggggaaa atgggtccat cagcaaatac aaatcaaaaa aaaccactat aaatttgagc 2940 gaggtccaac gacacgcctt tcagaaatta aggaacattc tggcatcaga aaacgtaata 3000 ttgatgtacc cagattacaa aaaaccgttc gacctaacga ccgacgcctc tgcctatggc 3060 atcggggcgg tcctttcaca aggcggccgc ccaataacta tgatttcaag gacactaaag 3120 aaaggtgagc gtcactacgc taccaatgaa cgcgagcttt tagcaattgt gtgggccttg 3180 gggaagcttc gtcactacat ttacggagtc agggacatta atgtgttcac tgaccaccag 3240 ccactcacgt ttgcggtttc cgaaggaaac cctaactcta aaataaaacg ctggaaggca 3300 cgcattgacg agctcggtgc caagcttttc tataagccag gcaaagaaaa ccatgtcgca 3360 gacgcattgt ctcgacaaaa catcaatgca ttggaagaca gcgcgccacc atctgacgca 3420 gcgacaatac acagcgaagc gtctttgact tacacgatag aaacgacaga caaacctgtc 3480 aattgtttta ggaatcaaat cgtcctagaa gcggcgcgtt ttcctctaaa gcagatgttc 3540 attcttttcg gggaaaaaac gcgacacata ataaatttta cggacaaaaa tgcgttgact 3600 ggggtcgtga agacctcagt aaaccctacg gttgtaaacg caatccactg tgattttccg 3660 actctagcct gcattcagca tgagctagta cgcgcctttc cgtcgaccaa gttctggcat 3720 tgcaagaatc tggtcacaga cattttcaat gaaaatgaaa gaaaggagat tgtcacaatc 3780 gaacacaatc gagcacatag agctgctcag gaaaacgtaa aacaggtctt ggccgattac 3840 tatttcccaa aaatgactaa actcgcaagc gaggtagtca tgaattgcag gatttgttct 3900 agagcaaagt acgataggca tccaaaaaaa caggagctgg gaccaacgcc agtcccttcg 3960 ttcactggag aaatgcttca cgtcgacatt ttttcaacag ataggaagta cttcctaaca 4020 tgcgtggaca aattctcgaa atttgccatc gtacaaccga tagcttcccg cacgattgcg 4080 gatataaagg ctccactctt gcaacttatg aacattttcc ccaacactag acaaatatat 4140 tgcgacaatg aaccatcgct gaattcagaa accatcaggt cacttctgcg agacaatttc 4200 ggagcccaca ttgtgaatgc tcctcccctc cacagcactt caaacggcca agttgaaaga 4260 ttccatagta ctctgttgga aatcgcgaga tgcctcaaga tagcgaaggg aatcgtcgac 4320 acagtcgacc ttttccttca ggcaacagtg gagtacaaca ggtcggtaca ctcggtaacc 4380 aacaagaagc cgatcgatat tttaaacgcc gctgcagcag actttcacag cgaggtacag 4440 aacagggtac ggaaagctca gattgtgcag aacaatcgag tcaaccgatc gcgtcagaac 4500 agagttttta gagtaggtga taaggtatta ctgaaaccca acagacgcct cggaaataaa 4560 ctaaccccgt tgtacgtgga agaaacggta gaagcggact tggggacaac ggtcctaatt 4620 agggggaagg ttgtgcacaa agacaatctt aggtaatgcc cctatgccca tacgtgagcg 4680 ttccactcag ccgactattt tttacgctcc atttacattc atacttagag actcgatcac 4740 agattaccaa gcaacctttt ctttcattcc ttaggttgtt tcacatgata atccttctgg 4800 tggccttggt aaacgctcgc ataacagact actcacattc tgactacgta ccgatattgg 4860 acggcgacat actcgtatgg gacgaaataa attaccttag gcattcgaca aacttaacgg 4920 attacgaacg aatggcggac gagaccgcaa atttgaccga gatgtttcca cagtcgcaca 4980 tgcgaaaact actagtggtg gataccgacc atatccgaaa tatgttagcc acgattagcg 5040 tgcaccacag agttgctagg agcctaaaca tcctgggctc ggtcttaaag gtggtagccg 5100 gaactccgga cgcggacgac ttagaaaaga tacgaataaa cgaggcccaa catattgaat 5160 cgaataacag gcagataagc ataaattcta agtctcagga gcaaatcaac cgcttaacag 5220 actcagtcaa taaattgcta gaggcagcaa agggaaagca aattgactcg gctcacctgt 5280 acgagacctt gctggctagg aacagaatgt tggcatccga ccacgacaac attctgcgaa 5340 acgtcgacaa aggactcctg tgcccaagga ttatattctg gtggcgtggc ccactgccag 5400 tcgcaaccca gccacctcag cgccatcaca ctggtcgacg acggcatcat tatcatcaac 5460 gatcacccag cagcagtaag tttcgacggc agcgcggcgc tcaacatttc agggacgcat 5520 ctaatcacat tcaacgatta tgccgtcata aatggttcac ggtatcagaa tcgcaaaaac 5580 gttcagagcc gatacccagg agttgcatct tcacccttgc taaatgttac cgagcataaa 5640 agggtgctaa gcctgccatt cctccaccag ttgagcgagg agaacctcaa cttcatcaag 5700 gagataaagg aagaggtgtc ctcacgttcc agaccaatct ttgcattctg cctaggacta 5760 ggtatttgtg gcctagtatg tggcatggcc atgctcagac tatatttgac gaagaagcgc 5820 gacgcccggc aaatcaatgg attaatggca agactaagtg cgccggggac ggcgactgct 5880 caaggggggg agtagttatc cgatccagcc ggatacccct ccctctgcat ttggttctcg 5940 accctacagc gggtcaatga gcttaaacac cttcatatga accaaataca gcacatgcct 6000 cgtcgcataa tagaacacga gagttgcagc gttgacggag acacttgccc agacagccca 6060 acaacaacca tggggccgtg gaacgtggga aactgccgct gctcaacata acaccggcgc 6120 gtgtgccatg tcagcagatt aatgcaccgg cgactgcgca gccggcgttc ggggacttaa 6180 tattaagact ttcattttta tgtaaattta gattctctta gacctccaac gaggaacaat 6240 aaatctcgct tcgacgagta aataataaag aataaatact acaataaacg tcgattctac 6300 ttaacttaac ttgtatgtag tatacttact gtataccttg tatatatgta gtatactgta 6360 tggttagtgg cagtacatac ataagcagaa aaaacacgca cagacacgtt gtgtgtgata 6420 cacaaaaaga caagcactcg tcgggtacac agacacaagt cgaaaagagc cgcaaaatga 6480 tgcaaacacc gccgccgttg cattggaaag agagagagaa ggtgctgtac caggaacgtc 6540 gcgcccgagt gcaaacgaga tggaagattt agaaaatgcg gaaaagaata ctgaggacaa 6600 gattgatcaa atgataaagt tgctaagttt gaaagtgctt tgcgatgagc aacgagaaat 6660 gaagatcgct ccagataatt ttacaaaagt tgtgagcgat tgtgatggaa aatcggttcc 6720 catagaaaaa tggtttgaga ttttcgaaaa aaatgccgac gcatatgagc ttactgtaaa 6780 gcaaaagtat gtgcaggcca gaggaaaaat taccggtgca gctaaggttt tccttgaagc 6840 acaatgcgtg tgcacctatg aggaactcaa gaacgtatta ttggatgaat tcacatgtag 6900 ctataacagt gcagacattc ataagctgct gcaagaaaga aagaggaaga gtagtgagtc 6960 gatgcacgag tacttgctgc agatgagaaa aatagcagca gtcggacatg ttgaagacgc 7020 ggctgttata agccatatta tgaacggtct ggacataaga aaagagtata agtatgccat 7080 gtatcgctgt aagaccttca gggccatgaa agaagagttc gatatctatg atcgtttgaa 7140 catattggag aagaagggca ataacgaaca gcataagacg agtttcaagc agcaaagtgg 7200 gcaaagcggt aaaaaagaat attgctttaa ctgtggatca ggagaacaca aacgcaaaga 7260 ctgcaaagcc gaaacgaagt gttttaactg caatcagaat ggtcacattt cgcgtaactg 7320 tccttatgaa gctgataaag ttgataaagt tcgcgtcatt ttggatcgca gtcgcatgaa 7380 agaaattcaa ataaacggca ttgtagttga ctgtcttgtg gacacagggt cagatgtaac 7440 cataatcaaa gagagtatgt tgaagaccat gaaaaacgtt aaacttttga agtgcacaac 7500 tatgttgcgc ggtctgggtc agatatcaac aaagcctgtt ggatacttta atgcagaagt 7560 taccgtggat aacttgcaga ccacacagaa gtttttagta gtccccagta gccagattga 7620 tttcgacgca cttttggggc atgatttcat taaaaagttt cgtcgctgat atgcaaggat 7680 acacgttctt gaagcatgac gcagatcctg tgccaacaga tgagcatgct cttatatata 7740 atgtaactga agagtcatcc tttgtagctc caccgcaata tcgaaaagac gttgaacaga 7800 tgataagaaa cagttaccaa atgcccacag aagttacaaa gcagtctcca attcaactga 7860 agatagttcc cgacggagta atcaagccgt ttcatcactc accgagtcgt ttatcgacag 7920 acgaagccag tgccgtaaag aagcaagtcg aagaatggaa cgagcaagga atcgtacgca 7980 agtcgtcttt aaatgtagcg agcagagttg tcgtcgtgaa gaaaaaagat ggtacactta 8040 gagtttgcgt agactacagg aagttaaaca gcatggtatt gctggactgc ttcccagtac 8100 cgatcatgga ggaagtcttg gaaaagctgc agtcggctaa gtggtttacc ataatggacc 8160 ttgaaaacgg atttttccat gttcctatgg aagagcaaag caagtcgtat acggccttcg 8220 tcacaaagga gggattattc gagtttaata aagcgccttt cggattcaag aactcgccag 8280 ctgcgttcat tcggttcgta agctatattt ttcaagaatt aatcaactct gacattatgc 8340 agctttatat ggacgacata attgtttacg ccgcgtcgcc cgaggtatgc atgaggaaga 8400 cgaagttggt tctggagaca gctgcgcagt ttggcctaaa gatcaagtgg aagaaatgta 8460 gcttcatgca gccacgcatt agctttctgg gccatatcat tgaggacggg agaatctggc 8520 ccggcaaaga gaagacagca gctgtcagtc ggtttgatac gcccaaagac attaaagcag 8580 ttcaagcatt tctgggactc acaggctttt tcagaaagtt catccctggc tatgcacaag 8640 ttgcccggcc gctcacgaat ctactcagga aggaagcagt tttcaacatt ggcgaagcag 8700 agcaacagtc gctacaaacc ttaaagaatc tgcttgtaat cgcacctgta ttacatttat 8760 actcacgaga agcgccaacg gaactccaca cggatgcatc caaagaaggt ttcggagcag 8820 ttttgttgca gcagtttgat ggcaatttcc acccgattta ctattgtagt aaaaagacta 8880 cacaagctga gtccaaacgt cacagttact atttggaggt caaagctgca tacctcgcac 8940 tcaagaagtt tcgtcactac ctgttgggga tcaaatttga tcttgccact gactgtgcgg 9000 cgtttaagca gacaacaacg aaagcagata tacccagaga agtttcccag tggattctgt 9060 atatggagga ctttacattc aaagtagtac accgcccagg ggacagaatg agacacgtgg 9120 actgtctcag tcgttttccg cagacatgca tgttcgtgac gacggagcta acagctcgga 9180 taaagaaagc acagcaggat gacgatttca tcaaagccac agttgagatc ctgaagcagc 9240 acccatatca agactacaag ctaaaaggag gtcttctctt caaatctgta aatggcaatg 9300 acgtattggt ggttccaaag ctgatggaaa gggaaatcat tcaaggatcg catgaagttg 9360 gtcatttctc tacagcgaag acaatgcact cagttcagca gcaatactgg attccgcacc 9420 ttgaacggaa agtaaataag ttgattacta attgtatcag ttgtataatt ttcagcaaaa 9480 agttgggaaa acaagacggc tatcttcatt gcattgacaa gggtgacaca ccactataca 9540 cgctgcacgt tgatcatttg ggaccaatgg atgcgacagc caagcaatac aagtacattt 9600 ttgctgtggt agatgcattc tccaagtttg tgtggctgtt cccaactaag tcaacaggtc 9660 acgaagaagt tgtgaagagg ctgagagatt ggtcatttgt gtttggtttt cccaagcgca 9720 tagtcagcga cagaggagca gcgtttacat ccaacgcatt cagcgagttt ctcaacgaga 9780 acaaagtcga acatgtgtgt acaacgacag gtgtggcgag aggcaacggt cagattgagc 9840 gggtgaaccg ttcaattttg ggtatcatcg caaagctctc agctcaggag tcaacgagtc 9900 gccattcgag gtcatgtttg ggacaaagat gcacagacaa gctgaaagtc gactattgga 9960 ggttctcaac gaggagttgg ttacgcagtt taacaacgag cgccaagaac tgcgtgacca 10020 agcgaaacaa aacattgcga aggcgcaaga ggtgtacaag cgcaattatg ataaaaaacg 10080 acgacctgag cacggctaca gactaggaga catggtcgcg atcaagagga cgcagttcgt 10140 cgcaggacgc aagttggcca gcgagtatct aggcccatat gaagtcacca aggtaaagcg 10200 gaacggtcgc tacgacgtca gaaaggtttc tcaagtcgag gggccaaaca ccaccgcaac 10260 gagcagtgac aatatgaagc tatggcgttt tttctcagag aacgaggata tattatcatc 10320 tgggacagat gaagatgagc aggagggccg aa 10352 // ID Gypsy-119_AA-LTR repbase; DNA; INV; 1107 BP. XX AC supercont1.245; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-119_AA_; KW Gypsy-119_AA-I; Gypsy-119_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-1107 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.245; Positions 1156443 1157549. XX SQ Sequence 1107 BP; 319 A; 259 C; 198 G; 331 T; 0 other; tgtaacctct caaaatggtt acatttgtct ttttgttaat attccctgta aatagttttt 60 ctttgtacat tattattatt attattatta ttattattct tattattact cttattatta 120 tccctatcat cattattatt tttcttatta ctacccttat tactattatc atttgtatta 180 aaattatggc tgctaaagat ataatgaatt taaaattgaa atatttaaaa tagaatataa 240 actgaaattg aatagaaaga aatatcgaat ttatccccat acctgaacat ccccatcctt 300 attcccattc tactaccatc cctattccca ttctactacg acacaccacc gctaccagcg 360 acaactgtca cgacgttggt agcaaatcga aagcagcgag gaaataaaac cgtttgtcaa 420 cagacgaacg gtctctaata gtgaactgaa aagccccgat gtgccccatt cattgttctt 480 ggtccggtcg gctagtgtcc agtagtggaa tccagaaagc ttgtgagaat ccgtgcgccg 540 ctagttcgga acaagttcat ccggcacgaa ggaccaccca gtgtgagcgg ccgcgagaat 600 agtttggtcc ggccagcgaa gtggttaacg gcctaccagc tgaagaatcc gtcacgtccc 660 ctgagtgcga agaactccac tgccactgaa gacaccattt tcatccgtcc aggacaaagc 720 taacaccgcg gtggatggtg ccgggaccag cagcgccttg gtagacggtt caggccccat 780 aagaagctgt gtggtaatgt aaatcctatt ttccttcctt ctttgtaaga taaaatgccc 840 ctaattaaaa cgaatcttat aatcaaattg agaacttttg aatattgcta gcaaagccct 900 gtatttcctc catctcctcc agatttcggt tctgtccacc tcgacaacgt aaattttgac 960 gtaggactac gtttgagttt ttttaaaggg aagtttttcc gcccatctcc ccaatttcct 1020 cttggagcaa tttgaaacga ccctgaggcc caggacaaaa gcccggttgg ggacaataga 1080 ggttcccaac ctaaagaatc tctaaca 1107 // ID BEL-94_AA-I repbase; DNA; INV; 6339 BP. XX AC supercont1.289; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-94_AA_; KW BEL-94_AA-LTR; BEL-94_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6339 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.289; Positions 918997 925335. XX CC Positions [5345-5920] - Integrase core CC 'AATAT' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 1349..6339 FT /product="BEL-94_AA-I_1p" FT /translation="MSESYQQNQAIPHQDPVRPTQQQLAARQSLAKELPRF FT SGDPAEWPIFISNYRYTTEACGFTDGENMLRLQRCLSGPALETVRSRLVLP FT AAVPQVIETLRLRFGRPELLINALLRKVREIPAPKSDKLEGIIDFGMAVQA FT LCDHIEAANELAHLSNPSLLQELVAKLPADQRMMWAGYKRGFQHVDLKTFG FT DYMASVVRDATSVVTFEPEKRNCGRERPKNRGFVNSHATEGSSRSVDSSSE FT PEQPKQFVCAHCHKGGHRVRECNAFKQLHVDDRWRRVRTLGLCQNCLYNHG FT RRGCRGRKICDIDGCQFRHHTLLHSPRGPPKSSEHTMQVAENHTHRRPKSS FT MLFRIIPVTLHGTNGSINTFAFLDEGSDLTMVENELVASLGVKGTPLPLCL FT RWTGNTSREEKDSQQVTIEINGIGQDKRHKLMNARTVSSLGLPRQSFDADE FT AVKAHKHLKGIPLTSYRDAMPQILIGVDNLRLALPLKVREGDGTAPVAVKT FT RLGWCVYGPRGSSNRESYSFLICECSCDKTLHESVKDFFAIEGAGAGPAEI FT PLKKEDERALTIMETTTRRVGDRFETGLIWKEDEVEFPNSYPMAVRRLECL FT ERRMDRDPALKENLHRQIREYDAKGYAHKATREELETSDARRVWYLPVGAV FT MNPKKPGKVRVIWDAAAKVEGVSLNTTLLKGPDQLSSLPAILFRFRLYRVA FT VSSDIQEMFHQIRIREEDKNSQRFLWREKPSEKPSVYLMDVATFGSTCSPA FT SAQFVKNLNAEQHRELYPEAAKAIVDDHYVDDYLTSFSTEDEAARIASDVR FT TIHGNGGFKLHNWRSNSSAVLERLGEVQAEAEKQLALVDGAKTERVLGMLW FT TPRTDELSFSTQMSEEVQDLMRTATRPTKRQVLRCVMTLFDPLGLLSPFLI FT HGKVLIQDLWREGTEWDEQVNTEVFAKWQRWTQMIEYIAEIRIPRCYFRHA FT EQTTYRNSELHVFVDASEVAYSCAVYLRTLNHGADPQCCLIAAKSKVAPLK FT PWSIPRLELKGCVLGVRWSKFVSENHGVHVSKMVFWTDSRTALAWIKADPR FT NYRQFVSFRVGEILESTTASDWRWVPSKSNPADEATKWGSGPYFDHDSKWF FT QGPDFLRRPEAEWPRPKEVLTATTEEIRATILFHYSFDPVVDFDRFSSWER FT LLRTIAYALRFLNNSAKAKPKRTGQLQQDELRAAEVTIFRLVQRESYSDEI FT AELSNKTANKPSQDVIGKHSSIYRLIPFLDDNGLLRERGRISAVEGVCYSV FT RHPIILPRKHRVTELLVRRYHQQYRHGNSETVVNEIRQLYNIPRLRLVVRK FT VSRDCNVCKIRRAKPTIPLMAPLPPARLAHHERAFTYTGVDYFGPLLVKLG FT RSNVKRWIALFTCLTVRAVHLEIAYTLSTESCISCVRRFVGRRGPPAEFFS FT DNGTNFQGADRVLRHQIGQGLSETFTGANTKWNFIPPGAPHMGGAWERLVQ FT SVKAAMMEAYSEGKLDDEGLQTLVVETESIVNSRPLTYLPLDSEEAEALTP FT NHFLLLSSNGAKRRSSAAVDIVRCKTNELVRREILGKSWELIERQLDAFWK FT RWLTEYLPVIRRQPKWFEENRTLKAGDLVMVAESTGRNKWERGRIIRTIPG FT PDGRHRQAVIQMGNKALLRPVSRLALLDLKGFREVPENSGLHPGET" XX SQ Sequence 6339 BP; 1675 A; 1676 C; 1721 G; 1267 T; 0 other; aatcttcaaa attcgagcca acatgcatgg aagccaggca ccgggaggat cgaaaggaca 60 cgtgtgcaaa tccagctgcg cggcctgcca gcgcccggac agtgcggagg acatggtcca 120 gtgtgaccat tgcgacatct ggaaacatta ttcgtgcgcc ggggtgaacg aaagtgtcgc 180 tgggagatcg tttgtatgtg gcgattgttc tgtccagcaa actgatgacg tcatctcagt 240 gcgaacaact tcgagtcgat ccagcgcaac atcaacaatc tctgtctgcc taagccagct 300 ggtagaacga caacgattag agcgcgctcg cctggaacta gagctccaac gtcgtcatct 360 ggacgagcag cagcagttaa tcgataaggc catcggcgga gaggaaatgg acgatcgtcg 420 cggtgaaggt tcatctgaag ttgtaaacaa aaccacccca gcggcaccca gcaaacctat 480 cggaacaacg tttcttcccg aaacaccgct gggtccgtcg gatcgtcgat cgaaggcaca 540 gttggacgct cccaagtctg cagcgtcagt aatggttttg attccagtca cagcactgga 600 accaaagact gtgaaatatc ttcaaggaaa gaagaatccc caagccgcct tccaagatct 660 caggagccgg gatgagcagt gcgagaagcg cgccaatcca acagctgacc agctggctga 720 cctacattct cagctggaac tgtgtcgcaa actcctggag gggatccaga ccggagcagc 780 agttaaggaa tcgattcaac agccaaattt ccaaaaacag cagcgagtat gtcacgcgta 840 gtgagcaata ccggtgcaat cccgaaggca aaccgatcgt tgcttacggt ccccgaggaa 900 gatcggaagc gttccgatta tcaatccgac gacggagcag taggaggaac cacatcgccg 960 gaagaaatgt ggccgttgaa gaatccggta agccaaacgg gtcaaccgcc ggtaagtcga 1020 gcagtcgaat cagctgcaat gtgtccgaac gacttggaac ccccaggtaa gcgtcaagcc 1080 ttgagtcact ccattgaaaa tacgatcata tgtgcccctg atcattgccc cacgaccaat 1140 aatccacttg caagagggca gctgcgattt gtcgcgcgcc cagatgaaat ccgatcgcct 1200 ccaaccaacc ttcgagagca gctgcgacag ccactcgcgc actcgaacga tgccattgta 1260 tctgacaacc agcctaagtc tctctcaacc gatttgctaa cgcgtaatcc tccctcgcga 1320 gcggttgaaa cgtctcagcc cctctcgtat gtcagaaagc tatcagcaaa atcaagcgat 1380 tccccatcaa gacccagttc ggcccacgca gcagcagctg gcagcaaggc agtccctggc 1440 caaggaactt cctcgattca gcggtgaccc agcggaatgg ccgatcttca tttcgaatta 1500 tcgctacacg acagaagctt gcggcttcac tgacggtgaa aacatgctcc gcctccagcg 1560 gtgcttatca ggaccagcgc tcgaaacggt gcgcagtcgg ttggtgcttc cagcagcggt 1620 gccgcaggta attgaaacac ttcgtttgcg gtttggacgt ccggagttgc tgatcaacgc 1680 tctgctgcgc aaggtacgtg aaattccagc ccccaagtca gacaagctgg aaggaatcat 1740 cgatttcggg atggcggtgc aggcattgtg tgaccacatt gaagctgcga acgaactcgc 1800 tcacctttcg aacccgtcac tgcttcaaga gcttgtagca aaactccctg ctgatcaacg 1860 gatgatgtgg gccggctaca agcgaggatt tcaacacgtc gatctgaaaa ccttcggcga 1920 ctatatggcg tcggtagttc gcgatgcgac cagtgtagtg accttcgagc cggagaagcg 1980 gaactgtgga cgtgaacgtc cgaagaacag aggattcgtc aactctcatg ccacggaagg 2040 atccagtaga tcagtggact cgtcgagtga accggagcaa ccgaaacaat tcgtttgcgc 2100 tcattgccac aaaggtggac accgagtacg cgaatgcaac gcgttcaaac aactgcacgt 2160 cgacgatcga tggcgccgag ttcgaacttt ggggctatgc cagaactgtc tatacaacca 2220 cggacgacga ggatgccgag gccgtaaaat ctgcgacatc gatggctgcc agtttcgcca 2280 ccatacgctt ctccactccc ccagaggtcc acctaagtca tccgaacaca ccatgcaggt 2340 ggcagaaaac catacccatc gccgtccaaa gtcttcgatg ctctttcgaa tcattccagt 2400 gaccctgcac ggaactaacg gttcaatcaa tactttcgcc tttctcgacg aaggttccga 2460 cctgacaatg gtggaaaacg agttggttgc atcgctcggg gtgaagggta ctccactccc 2520 actatgcctt cggtggactg gcaatacatc gcgcgaggag aaagattccc agcaggttac 2580 cattgaaatc aatgggattg gtcaagacaa gcgacataag ctcatgaatg ctagaaccgt 2640 cagcagcctt ggacttccac gacaaagttt cgatgcggat gaagcagtaa aggcccacaa 2700 acatctgaag ggcattcccc tcacaagcta ccgcgatgcg atgccacaga tcctcatcgg 2760 cgtagacaat ttacgacttg cacttccgct gaaagtgcga gagggtgatg gtacagctcc 2820 ggtggcagta aagaccagac ttggttggtg cgtctacggc cctcgaggta gcagtaaccg 2880 cgaatcctac agcttcctta tttgcgaatg ctcgtgtgac aaaacgctcc atgagtcggt 2940 gaaagatttc ttcgccattg aaggagcagg agcaggaccg gcggagatcc cgcttaagaa 3000 ggaagatgag cgagcattaa ccatcatgga gactactact cggcgagttg gggaccgctt 3060 cgaaacagga ctgatttgga aagaagacga ggtggagttc ccgaacagct atcccatggc 3120 agtacgtcga ctagaatgct tggaacgcag aatggatcgt gatcctgcac ttaaggagaa 3180 tctgcatcgg caaatacggg aatacgacgc gaagggatat gcgcacaaag ctacgagaga 3240 agaattggaa acatcggatg caaggcgcgt gtggtacctg ccggtcggtg cggtgatgaa 3300 tcccaagaag ccgggcaagg tccgagtcat ctgggacgca gcggccaaag tcgagggtgt 3360 ttcgctgaac accacactac tcaaagggcc agaccaacta tcttctcttc cggcgattct 3420 gtttcgattc cgcctgtata gggtggcagt tagctcggat atccaggaga tgttccacca 3480 aatccgaatc cgagaggaag acaagaactc ccagcgtttc ctatggcgcg aaaagccgtc 3540 cgagaaaccg tcagtttact tgatggacgt tgccactttc ggcagcacat gctcaccggc 3600 ctcagcacag tttgtcaaga atctcaatgc tgaacaacac cgtgagctgt acccagaagc 3660 tgccaaggca attgtcgatg accactacgt tgacgattat ctgacgagtt tcagcacaga 3720 agacgaggct gcaaggatag ctagcgatgt acgaaccata cacggcaacg gaggattcaa 3780 gcttcacaac tggcggtcaa acagcagcgc ggtgttggaa cgcttaggtg aagtgcaggc 3840 tgaggcagag aagcagctgg cactcgtgga tggtgcaaag acagaaaggg ttcttgggat 3900 gctgtggaca cctcgaaccg acgagctgag cttctctacc cagatgagcg aggaggtgca 3960 agatttgatg cggacagcaa cgcggccgac gaagagacaa gtactgcggt gcgtgatgac 4020 gttattcgat ccgctggggt tgctctcacc gttcctcatc catggcaaag tcctaatcca 4080 agacctttgg agagaaggta ccgagtggga cgagcaagtc aacaccgaag tcttcgcgaa 4140 gtggcagcgg tggacacaga tgatcgaata catcgccgaa atacgaattc ccagatgtta 4200 ctttcgacac gcagagcaaa cgacgtaccg caattcagaa ctccatgtat tcgtcgatgc 4260 gagcgaggta gcatactcct gcgcggtcta tcttcggacg ctgaaccacg gtgccgatcc 4320 acaatgctgc ttgatcgctg ccaaatcaaa ggtcgctccg ctgaagccgt ggtcgattcc 4380 tagactagaa ttaaaagggt gcgttctagg cgtacgctgg tccaagttcg tcagtgaaaa 4440 ccacggtgtt cacgtatcca aaatggtgtt ctggacggat tccagaacgg cactagcctg 4500 gatcaaggcc gatcctcgga actatcggca gtttgtatcg ttcagggttg gcgaaatcct 4560 agagagcacg accgcaagcg attggagatg ggtgccatca aaatcaaacc cagccgatga 4620 agcgactaaa tggggaagcg gtccgtattt cgaccatgac agcaagtggt tccagggacc 4680 cgactttctg cgccggccgg aagctgaatg gcctcgtccc aaagaagtat tgactgcaac 4740 taccgaagag atacgcgcaa cgattctgtt ccattattca ttcgatccgg tagtggattt 4800 tgaccggttc tcctcttggg agcggcttct gcgaacaata gcatatgctc tccgattttt 4860 gaacaactcg gccaaggcga agccgaaaag aaccgggcag cttcaacagg atgaactccg 4920 agcagcagag gtgactatct tcaggctagt gcagcgtgaa tcctattcgg atgagatcgc 4980 agagctctcc aataaaaccg cgaacaaacc gtcgcaagat gttatcggga aacacagctc 5040 catctaccga ttaataccgt tcctggacga caatggcctg ctgcgtgagc gcggtcggat 5100 cagtgcagtt gagggcgttt gttacagcgt gcgccatccc attatacttc cgagaaaaca 5160 tcgagtcacg gagctactcg tccgcagata ccatcagcag taccgccacg gcaattcgga 5220 aaccgtggtc aatgagatcc gccagctgta taacattcca agactacgcc tggtcgtcag 5280 aaaagtcagc cgcgactgca acgtgtgcaa gatccgtcga gcaaaaccaa caattccact 5340 gatggcacct cttccaccgg cgcgcctggc acaccatgaa cgtgcattca cctacactgg 5400 ggtggactac ttcgggcctc tactggtgaa gctaggacga tccaacgtta aaagatggat 5460 cgcactgttc acgtgcttga cagtgcgagc cgtccaccta gaaatcgcct acacattgtc 5520 tacagagtcg tgcatctcct gcgtccggcg atttgtaggt cgtcgaggac cacccgccga 5580 atttttcagc gataatggga cgaattttca gggagctgat cgagttctgc gacaccaaat 5640 cggccagggg ctgtcggaaa ccttcaccgg cgccaacacg aagtggaact tcataccgcc 5700 cggagcacca catatgggcg gagcgtggga acgcttagta cagtctgtga aggccgcgat 5760 gatggaagcg tactccgagg ggaagcttga tgacgagggg ctgcagacac tggtcgtgga 5820 gacggaaagc atagtgaact caaggcctct gacatatttg ccactcgact ccgaggaagc 5880 agaggcgctt acgccaaacc atttcctact attgagttct aacggggcga aacgacgcag 5940 ttcagcagcc gtagacatag ttcgctgtaa gacaaacgaa cttgttcgcc gagaaatcct 6000 gggaaaatcc tgggaactaa tagagcgcca gctagacgcc ttctggaagc gctggttgac 6060 agaatatctg ccggtgatcc gtcgacaacc aaaatggttt gaggagaatc gaacgctgaa 6120 agcaggagat ctggtaatgg tggcagaatc aacaggacgc aataagtggg agcgcgggcg 6180 aatcatccgt acgattcctg gcccggatgg ccggcatcga caggccgtca tacagatggg 6240 aaataaggct ctgctgagac cggtgtcacg gttggcattg ctggatctaa aaggctttcg 6300 tgaagttccg gagaactccg gactacaccc gggggagac 6339 // ID BEL-208_AA-LTR repbase; DNA; INV; 467 BP. XX AC AAGE02026623; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-208_AA_; KW BEL-208_AA-I; BEL-208_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-467 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02026623; Positions 33081 33547. XX SQ Sequence 467 BP; 138 A; 103 C; 129 G; 97 T; 0 other; tgttcagtcc gcagcgcagc gaactgaaga agaagcgcat gcaacggaag ccccgcccaa 60 caggaaaacg atcgtcatcg tacgcattcg ttgcgtaaca aacaaatata aatatgctct 120 acctgctagg gcgagcatca gttttgtgac gcaactcgga gagagcagat agcagcagcg 180 agaagaaagg caaacgtgaa aaggggagaa gtcgagccgg agagccgaga ccggccgaag 240 aaagtgtgtg gaagaatgta gtaaagtgaa taagtaaaat gaattagaag aagaaagctt 300 gtgtgtgtcg ttttgttttc gagccgaaat ttccgtggtc cgccagtcgc ctgcgtttcc 360 accacagtcc ttgtcaggac tagttgctag ggaaaatatt cacttcgctt ggtcatcgct 420 ggtccgttgg tgcagcagtt tacagtccac tgcaacacgt cgcaaca 467 // ID Gypsy-19_OD-I repbase; DNA; INV; 6882 BP. XX AC CABV01002282; XX DT 24-FEB-2011 (Rel. 16.02, Created) DT 24-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Oikopleura dioica genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-19_OD_; KW Gypsy-19_OD-LTR; Gypsy-19_OD-I. XX OS Oikopleura dioica OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; OC Oikopleuridae; Oikopleura. XX RN [1] RP 1-6882 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Oikopleura dioica genome."; RL Direct Submission to RU (24-FEB-2011). XX DR Genome; CABV01002282; Positions 335 7216. XX CC Positions [4421-4897] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 563..1771 FT /product="Gypsy-19_OD-I_2p" FT /translation="MTLATVKATLKDKKLVLSTYESRPPTTDEQRAGMVQL FT ESEIKDLEAQLCSIVLSEESGETAPQAAQGDGSWLESQILVTHLRTMSKFT FT GASAEATTQFIQQVKALTESCPGMKFQTIYGAIRPCLSTSVLKTLNNSEVK FT TFDDFKKVLNTNYGHSLNIFQRWENWNTATKRFGKSYTQWFCEISNSLEPI FT ITSYQQEVIKNKEAMGVKNYTPTFEDGFQLSLVYKMLNMIRSDSNELYSAI FT IIELSSLKTCEALASRAEALNCQAGMLSGAFAGSRSNAKSSNKGNTSSTNQ FT DQPEDTTSNQQRGSSRGKWRGRGRGRGRGRGRGRGSKGNNGRSNDSIYKQD FT ERQAQTSRSQSGNRAQSVGNDHNQAEQPGAFASFHPEDYLPENYDENAEWQ FT DDGEYAPKN" FT CDS 1738..3279 FT /product="Gypsy-19_OD-I_1p" FT /translation="MAGRRRIRSKKLDQSGDDDFVGHSSRYQPALFKSYPL FT VNDNKSAYIPVQFPNSKEWVPALYDPGSFASLLCENVAKDLNLQLKPTAQR FT IRGIGTDANDRCVGKVVVDIKVGWQSNWPKVTWYVLPAGSMIIPAIIGRSP FT LFERCTKIVQNLHSRKFYLGQPGEMPAKVPYVSDPRHEKFYSNFDDGNCNA FT AVDAVATSDLVELVQREIGATINIGDSPDHAREVAQIILTNRDAFTTPTRL FT IGRFKPYEAGIPTQPGKARCIHQWRVPEKHKAPLTDVVNTLKEQGILIPCP FT DSKGWNTPVSAVGKRDGGVRLVMNLNLTINKLLTETDTYSLPYLDESTEIP FT VGMKFFGSLDLASGYYNIAIKQEDQVKTSIHWNGEQLMFTRCPFGMRHSGN FT LFCRALHHALHKMKNRQHVTVFVDDLCIHTPDFQSFCSTLRELLQLIHEYG FT FVVKRAQSLSALSRDPLARSTNIRRGPAPRSRKCRNDSQDEPSDQLQRIAV FT PARNAELGQTVLFHQVRR" FT CDS 3191..5224 FT /product="Gypsy-19_OD-I_3p" FT /translation="MNPPTNFKGLQSLLGMLNWVRQFCSIKSGDDISTQNF FT STLIRPITALVKINRPRGPFTWNREHTAAFNLIKQKLSSPEMIYFPDFSLP FT FVLCTDASSVASGWCLLQIHEGKSRIVRVGSKTFTPAQTRYSATEREALAI FT CTAVGDCRTYIFGTPFTIRTDHQALIYIDAKISKNDKLARWASYLSQYDFV FT ITYLPGEENIVADYLSRPADYDYKRPKNSDAPEPAGVFRDFHGFRIYVPSW FT VRLDRRPKIEFKPSDLCSGDDLVSHVASKPPTDVGINQLNSITAEQYADPA FT CKTALEAITHDRLPRFDQSCEESSFLNRNFSHLSLCPESGSLKFKDRSFVP FT VGIRPSVLIAFHDRRNHAGQARMRETMNHLTWPSVFADIDNYVRSCNCNHA FT KGGRGRCHTPGLVPTPKGSKPFERLLIDFVEFPPARSGHRYAVTILCTFTK FT FLIAVATRRCRAIDAVEALQKHVIEIYPHPKVIASDRGQHFCSSLNALFAK FT MNNIHWHHHISFRPQSCGLLESRHRELKSAIYIATHTLSADWPTVLKRVVF FT VMNSATNKSTRQSPYKAVWGVDPVISKFDKIDPPPSGADIPTFLKNRKQVT FT DLLHDKIRICQAEADEKVRNSRPYIEPEELHPGDEVLLKKERSALAKSTRL FT NWVGPFLVCRSKWSRCFSFRQRRPTRLGP" XX SQ Sequence 6882 BP; 1956 A; 1853 C; 1503 G; 1570 T; 0 other; gtggtgttca gaacgttaaa gctgcgaaat gcacctaccc caaaataatt ttctcaagcc 60 gggggactca aaagtcacga aataagcaaa tttcgtgccg aaaaaatgcc aaccatattt 120 tcgcaagctc actgaccgat tcaaaaatta aaattaagcc attttggcga tggtaaaacg 180 cgcttcttag atcagcaaca gaattataat tctgtgacgg ctctaggggg tgatatttcg 240 cgcgaagttt ttccctcacc caaattttga gtcaaatttc agaaggcgag tcccaagctc 300 agccagtcat gtgaccgcgc atatatattt cgcgaatcat caaaacggat catgagcaag 360 tcaaacaatg attcttagcg acgttagatt ttcgacactc gagaattcca tcgaaaaatc 420 taagcaccaa attctagaaa aaaggtgaat tttagttttg cgcaagtttt ggcccttcca 480 aaaccagagt cccccaggca tagcggcata aaatacaatc cggcgaagag tcaaaccagt 540 tcctacgatt atcatcatca ccatgaccct cgcgacggtc aaagcgactc tcaaagacaa 600 gaagctagtg cttagcactt acgaatctcg tccaccgacc accgatgagc agagagcggg 660 gatggttcaa ctcgagtccg agataaagga cttggaagca caactgtgct ctattgttct 720 ttccgaggaa tcgggtgaaa ccgccccgca agctgctcaa ggcgatggta gctggctcga 780 gagccagatt ctcgtcactc atctacgaac gatgagtaaa ttcaccggcg cgtcggcgga 840 ggcgacaact cagtttatcc agcaagtgaa agcgctgacc gagagctgcc cgggaatgaa 900 atttcagacc atttacgggg caattcgacc ttgcctctcg acatcggtct taaaaacctt 960 gaataattcc gaggtgaaaa ctttcgatga tttcaaaaag gtgttgaaca ccaactacgg 1020 ccattcgctg aatatctttc agcgatggga aaactggaac acagcaacga agcgatttgg 1080 taaaagctac acccagtggt tttgtgaaat cagcaactcg ctcgaaccaa tcattactag 1140 ctaccagcag gaagttataa aaaataagga agccatggga gtgaagaact acactccaac 1200 cttcgaagat ggattccaac tgtcacttgt ttacaagatg ctcaatatga tccgtagtga 1260 cagcaacgag ctatacagcg caataatcat tgagctaagc tcactcaaaa cctgcgaagc 1320 actcgctagc cgagccgaag ccctcaattg ccaggctgga atgctaagtg gcgcattcgc 1380 tggcagccgc tccaacgcca aaagttccaa caaagggaac acgagctcaa caaatcaaga 1440 ccagcctgaa gacaccacct caaaccagca gagaggttca tcgcgtggta aatggcgagg 1500 tcgcggccgc ggacgaggac gcggtcgcgg gcgaggacgc ggcagtaaag gcaacaatgg 1560 gcgcagtaat gattccatct acaagcaaga tgagcgccaa gcccagacca gtcgctccca 1620 aagcggcaac agagcccaga gcgtgggcaa cgatcataat caagctgagc aacccggcgc 1680 atttgccagc ttccacccag aggactactt gcccgagaat tatgacgaaa acgccgaatg 1740 gcaggacgac ggcgaatacg ctccaaaaaa ctagatcagt ccggcgatga cgatttcgtc 1800 ggacattcct ctcgatatca acctgcttta ttcaagtcgt atccactcgt caacgataac 1860 aaatcagcat atattcctgt acaatttcca aattccaaag aatgggtccc ggctctttac 1920 gatcctggct cattcgcaag tctgctatgt gaaaatgtgg caaaagatct gaatcttcaa 1980 ttgaaaccga cagctcaacg gatccgtggt attggcactg acgcgaacga ccgctgcgtc 2040 ggcaaggtgg ttgtggacat aaaggttgga tggcaatcga actggccgaa agtgacatgg 2100 tatgtccttc cggctgggtc gatgatcata ccagcaatta ttggccgctc cccattgttc 2160 gaacgctgca ctaaaattgt acaaaatctt catagtcgta aattttacct ggggcagccc 2220 ggcgaaatgc ctgccaaggt gccatacgtc agcgatccaa gacacgaaaa gttctactcg 2280 aacttcgatg acggcaactg taacgctgcc gtcgacgccg ttgctaccag cgatcttgtc 2340 gaacttgtcc agcgcgagat cggcgcaaca atcaacattg gcgattctcc cgaccatgca 2400 agagaagtcg ctcagattat cctaacaaac agggatgctt tcacaacacc gactcgcctg 2460 attggaaggt tcaagccata cgaagcaggc atacctaccc agcctggaaa ggctagatgc 2520 atccatcaat ggcgtgtccc agagaagcac aaggctcctc ttactgacgt cgtcaacact 2580 ctcaaagaac aaggcatatt gattccatgc ccagattcca agggatggaa cactcccgtg 2640 tccgccgttg gtaagcgaga cggaggcgtt cgattggtga tgaatttgaa tttaacgatc 2700 aacaagcttc taaccgagac cgacacgtac tcgcttccct accttgacga atctacagaa 2760 ataccggtcg gtatgaaatt ttttgggtca ctcgatcttg catcgggtta ctacaacatc 2820 gcaattaaac aagaagatca agtaaaaacg agtattcatt ggaacggcga gcagctgatg 2880 tttacacgct gccctttcgg gatgcgccat tcggggaact tattttgccg agcactgcat 2940 cacgccctcc acaaaatgaa gaatcgtcag cacgtgacag tatttgtcga cgatctgtgc 3000 atacacactc ctgatttcca gtcattctgc agtactctac gcgaacttct acagcttatt 3060 cacgaatatg gattcgttgt aaaaagggcg caaagtttgt ctgctctttc ccgagatccg 3120 ctggctcggt cgactaatat ccgccgaggg ccagcgccca gatccagaaa atgtcgaaac 3180 gattctcaag atgaaccctc cgaccaactt caaaggattg cagtccctgc taggaatgct 3240 gaattgggtc agacagttct gttccatcaa gtccggcgat gacatttcaa ctcaaaattt 3300 ttcaaccctt atccgaccaa tcaccgccct cgtcaaaatc aatcgcccta ggggcccctt 3360 cacctggaac cgcgaacaca ccgccgcttt caacctcatc aagcagaagc tgagcagccc 3420 ggaaatgatt tattttcccg atttcagtct accattcgtc ctctgcaccg acgcgagcag 3480 cgtcgcttcc ggttggtgtc tcctccagat tcacgaagga aaaagtagaa ttgtcagggt 3540 ggggagtaaa acctttactc ctgctcaaac aagatacagc gcgacggagc gagaagctct 3600 cgcgatttgc accgctgttg gcgactgtcg cacatatatt tttggtactc cttttacgat 3660 ccggaccgac caccaagcac tcatctacat cgatgctaaa atcagcaaga atgataaact 3720 tgcccgctgg gcgtcttatc tcagccaata cgatttcgtt atcacttatt tgcctggcga 3780 agaaaacatt gtggcggatt atctcagccg tcccgccgac tacgactata aacgaccgaa 3840 aaattctgat gctccagaac cagctggcgt ttttcgtgat tttcacggat tccgtattta 3900 cgtgccaagc tgggtccgac tcgaccgccg ccccaaaatc gaatttaaac cctcagatct 3960 ctgcagcggc gacgacctcg tctctcatgt cgcctcaaag ccaccgaccg atgtcggtat 4020 aaatcagctt aattcgataa cggccgagca atacgctgac cctgcctgta agacggcact 4080 ggaagcaatc actcacgatc gactaccccg tttcgatcaa agttgcgagg aatcatcatt 4140 cctgaaccgc aatttttctc atttgtcact ttgtccggaa agtggctctc taaaattcaa 4200 ggatcgaagt ttcgttcctg taggaattcg accatctgta ctcattgcat ttcacgacag 4260 aagaaatcac gctggtcagg ccagaatgcg cgaaacgatg aatcatttaa catggccgtc 4320 cgttttcgcc gacatcgaca attacgtcag atcctgtaat tgcaaccacg ctaagggtgg 4380 tcgtggacga tgccacacgc ccgggcttgt tccgactccc aagggctcaa agccattcga 4440 gcgactgctt atcgacttcg tggaatttcc gcctgctcgt tcaggacatc gctacgccgt 4500 caccatctta tgcacattta caaaattctt gattgcagtt gctaccagga gatgcagagc 4560 aattgacgcc gtcgaggctc tccagaagca cgtcatcgaa atttatcctc atcccaaagt 4620 aatagcgtca gaccgtggcc agcatttctg ctcctcccta aatgcattat tcgccaaaat 4680 gaataatatt cattggcatc atcacatcag tttccgcccg cagtcatgtg gattacttga 4740 gtcccgtcat agggaactta aatccgcaat ttacatcgcc actcatactc tatcggccga 4800 ctggccgact gtactaaagc gcgttgtttt cgtcatgaac agtgctacca acaaatctac 4860 tcgtcaaagt ccgtataaag ctgtctgggg agtcgatccc gtaatctcca aattcgacaa 4920 aatcgaccct cctccctccg gagcagatat accgacattt ttgaagaatc gaaagcaagt 4980 tactgacctt ctccatgaca aaatccgaat ctgccaggct gaagcagatg aaaaagttag 5040 aaattcccgc ccttacatcg agcccgagga acttcatccg ggcgacgaag tcctcctcaa 5100 aaaagagcgc agcgcgctcg caaaatcaac ccgacttaat tgggtcggcc ctttcttggt 5160 ttgccgatca aaatggtcac gttgttttag tttcagacag cgaaggccga caagactggg 5220 tccataggtc tacctgcctg aaaaaggttg accgatttcc tcacttgggc gaaattccgc 5280 ccttcccaaa tatgcaaatt cctctgcgta aaacaacccc atctccaatc gtaaattcta 5340 gaccgctcaa ccctcccgtt ccagatcata ttcaagcacc cgaagcagcg ccggaggctg 5400 taccggaaaa agaagttcaa aacactcccg tcggtcacga cgacacagta ttcttcgatt 5460 gcgaaacaga tctcaattca ccccgaaact ctacgaacac gcccggagcg ccgatccccg 5520 aggaaccgca agttccctta aatccgccag aaaacgcccc ttggggcgaa atcgcgccta 5580 tcaatcatat tccacttccc ccagcacgcg cgaatacaac gtctacgaca agacgtaaaa 5640 tcgctccttc gcgccagccc tcgagcgcat ctccagctaa aacgcgatca caagctcgag 5700 taaattctga gcgcgaaatg acagacagag cgctcgcaaa aagattgcaa gcggaaaacc 5760 ggcggcgttc gacacgcgtc gataagagtt attttacata aaattacgag tcagacccgt 5820 ttggggagta tagacacttg taaaacttag tatgagtcgt aatgttatgt aatttgtaat 5880 ttatcaataa aaaaaatcgt ggttgcgtaa cattaaattt aactgtaaaa ttctaagtta 5940 cgtaatcaca aaagtgatca atggcatttt tcagacgatg agcttcccct cactggaaga 6000 tttgatcggt ggcacgagcc gaagtttcga cccgcagacc acacttgtga cctttactcc 6060 cggtcaatta acgccagtgg aaaacgcggt aaattttggg ccccgagtct ttacatctgt 6120 atatgactcg tggaccgaaa attactgggt tttctaactt ttttcagctc ccgccaagcg 6180 actacgatcc agacaaggat tcgctccagc tcacatccga tgagtcactt cctgcgtcac 6240 caaagactgt gcctccacct ccgccgaaaa agcgcaaagc cgatccggct ccagagccgc 6300 ttaccattac acgctaattc tccgacctgg aaacacctcc aggctttagt tcttcggatg 6360 gcgaagcgaa aaaaccgctt tccaaatctc agaagcaaag gagaagaaag aaggctaaaa 6420 agcgacaaaa agcgctggaa aaccagtccg cgcctcagct cgagccgacc ctcgaagtca 6480 atcacgaaat tcgtcgacta agccacgaga tcatggttgt tctccaacgt tcagctcgag 6540 tcatcagtca gagcagcttt ccaaagagca aggatggcaa aaagctcacc aagctgaccc 6600 agcaactctt ctgtggcgat ttggcatttt cggcaaatca gattcttatg ggatccaaga 6660 tgtccttgaa agcattcttg gcagccaaac aaaccctgga cgaataccag ccgatgcggc 6720 cagatgctgt caaagaaaaa gctacctcgt cgaaaaaatg agtctaagga cttgccatcc 6780 tatccattgt acctcaccta gtccaaattg atcccttcac actgccgttt gtcatgagtg 6840 gggggctgtt gggttactcg agttcatttc ataactcgta ag 6882 // ID BEL-29_AA-I repbase; DNA; INV; 6352 BP. XX AC supercont1.240; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-29_AA_; KW BEL-29_AA-LTR; BEL-29_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6352 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.240; Positions 1598185 1591834. XX CC Positions [5409-5693] - Integrase core CC 'CTAAC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 21..5693 FT /product="BEL-29_AA-I_1p" FT /translation="MTTERPNGSSTDLEQSEIFTCAACANPEVADDLVACD FT KCDKWWHFTCAGVSDSISSRDWICPRCRPPSIPASLRSTTSSKRADLQRKR FT LAEQQELEKKELELERKQLELLKRHSQEQYELEESIVGAEGDNRSVLSRVN FT EIEARERQVEAWVDQHSSTGLTSAVPPAGSLNLYRQLAQDAGIDISTNKVG FT PVPDPPLQLPIITEPANEVGKSFVDNPVLRTLQQQLAQGGQQPFSAEQLLS FT LEAQLRKCRLEMMQQEEEASGGDALTNRHQDTGANRGARPKNPIARNALEP FT PKFAIESSNQTRLTESGIPDQRHRSSTLLAQEDESKGTSRTGPRIRFAPKQ FT DAVKPPLIESTLIEQTRLRHAHSIPPPLNSTLQPTGRYFQLDAPATRFNTP FT PREDPRHSSPENGAFRRATGNVRGSMRQGDSSFAIPNPGATSFVQPPNPAV FT PPVNFNLGTGFLNRQPSALNPDQINLRRPSPEQLTARQVMPRDLPDFSGDP FT EEWSIFYSSFQNSSVACGFNDVENLSRLQRCLKGNALKSVRYYLLSPESVP FT NVIDTLRTLYGRPEIIINKLIQSVRETHAPKPEKLESLIDFGISIRNLTQH FT LIAAGQQAHLSNPVLLQELVEKLPANVKLQWAQHLMYEPEASLRTFSDFMG FT TVVESVSKVVIYTGAQQARSEKPRNKDKSFVNAHSEDSSSSAPAGENYEVE FT KPCLFCEKVGHRLKSCTKFQQLSIDDRWKNVQRLKLCRSCLNPHGRRPCRM FT SSRCGINGCEFRHHQLLHGKSDAPIQPAGTSAENHVHHHCGQSVLFRIVPI FT TLYGKSKSVNAFAFLDEGSSATLIEQSLVQELGVEGPTVPLCLRWTANMTR FT SEDESQTVSLEVSEVGQKKKFQLMHVRTVENLNLPSQTLRFAELQVGHPHL FT KGLPIRSYEAVTPKILVGLRNLQLAVPQKIKEGNSGIIATKTRLGWCVYGS FT LNRVEKTDEYNYHVCECESEEKLDRLIKNYFNAEDCPTNYPEKLESNEELK FT ARRIMEKTTIRVGDRFATGLLWKSDFVELPDRYSMALRRLECLERRMSRDP FT FLKENIRRQISEYQEKRYAHRATTDELANADPRRTWYLPLGVVYNPKKPGK FT VRLIWDAAAKVDGISLNSVLLPGPDLLTSLPSVQFRFRQYPVAVSGDIKEM FT FHQVQVIERDRPAQSFLFRENPDDKPSVFMMDVLTFGATSSPSSAQFIKNR FT NAQEFSERYPRATEAIRKCHYVDDYLDSFVSTDEAKSVASEVRWIHSKGGF FT ELRNWASNRKAVLEYLGELPKLYTKDLAMKADSERVLGMLWHTEDDVLRFS FT TSFREEIVTLIEAGSKPTKRQVLKCVMSLFDPLGLLACVLVHGKIMMQDIW FT RTAIKWDECIDDQINERWVKWIELLQDVNEVRVPRCYFNGASATLYDSLEA FT HIFVDASEVAYAALVYFRVIRSDETASCALVSAKTKVAPLKYVSIPRLELM FT AAVLGVRLLSFVKESHTFQIKRCVYWSDSQVTLAWIRSEHRRYRPFVACRI FT GEILSTTSDNQWRYIPSKLNVADDATKWGEKLCLDASSRWFSGPPFLQQPE FT NMWPAQKKSIATTDEELRASYLHQDVPVLHNPFTFERFSKWNRLLRTTAYV FT IRFIGILKRTGTGEKRRTSALRQDELHAAEVMIWKLVQSEAYRDEVIVLSK FT NKELSSGKRIELEKSSVLWNLAPKMDDDEILRVDGRISAAKEMSNDVKFPV FT ILPRKHYATHLILEDYHRRLLHANFETVVNEVRQRYHIPRLRTTARSVISR FT CQWCKVHKARPAMPMMGPLPTARISPGTRPFSYVGIDYFGPILVKIGRATA FT KRWVCLITCLTIRAVHVEVAYDLSTKSCIACIRRFVCRRGAPIEIYSDNGR FT NFTGADRVLQEQIRRLEE" XX SQ Sequence 6352 BP; 1749 A; 1566 C; 1646 G; 1391 T; 0 other; aaacttcaag ataatatatc atgaccactg aaaggcctaa tgggtcgtcg acggatttgg 60 agcaatccga aatcttcact tgtgcggcgt gtgccaaccc agaagtagcg gacgatctgg 120 ttgcatgcga caaatgcgac aaatggtggc attttacgtg tgctggggtc agcgattcaa 180 tttcaagtcg tgactggatc tgcccgaggt gccgcccacc ctccattccg gcttcattaa 240 gatcgacaac ttctagcaaa agagcagatt tgcaacgcaa acgcttggca gaacaacagg 300 agctggagaa aaaagaactg gagttggaaa gaaagcagct agaactattg aaaaggcatt 360 cgcaggagca gtacgaattg gaagagtcaa ttgtgggagc agaaggtgat aaccgtagcg 420 ttctgagccg cgtgaacgag attgaagctc gtgaaaggca ggttgaagca tgggtggacc 480 aacactcctc caccggacta acatccgctg taccacccgc cggatcttta aatctgtacc 540 gtcagttagc ccaggatgct ggcatcgata tttcaaccaa taaagtcgga ccagttcctg 600 atccaccact tcagcttccc atcattactg agcctgcaaa cgaggtcggt aaatcgtttg 660 ttgacaatcc agtactacgt actctgcagc agcagctggc tcagggcgga caacagcctt 720 tttcagcaga acaattgcta agcttggaag cccagttgag aaaatgccga ttggaaatga 780 tgcagcagga agaagaagca tcaggtggtg acgccctcac caatcgtcac caagataccg 840 gtgccaatag aggagcaagg ccgaagaatc cgattgcgcg caatgctctc gaaccgccaa 900 aatttgcaat agagagttcg aaccagacga gattgaccga gagtggaata ccggatcaaa 960 gacatcgcag ttcaactcta ctcgcccagg aagacgaatc taaaggaacc agtagaacag 1020 gcccacgaat tcgcttcgcc cctaagcaag atgctgtgaa gccaccgttg atcgaatcaa 1080 cactgattga gcagacgcga ttgagacacg cgcactcaat tcctccacca ctcaacagca 1140 cgctgcagcc gacgggtcga tacttccaat tggatgcccc tgctactcgt ttcaacactc 1200 ctcctcgcga agatccccgc cacagctctc ccgaaaacgg tgcttttcga agagccactg 1260 gaaacgtcag gggaagcatg agacaagggg acagttcctt tgcaattccg aaccccggtg 1320 cgacaagttt tgtacagcca ccaaatcccg ctgttccccc agtgaacttc aacctcggca 1380 cgggctttct caacagacaa ccgtcagcgt tgaatccaga tcaaatcaac ttacgtcgtc 1440 cttcccccga acaactaaca gccaggcaag tcatgccacg agatctcccg gacttctctg 1500 gagatccaga ggaatggtcg attttctaca gcagtttcca aaactcctca gtggcgtgtg 1560 gattcaacga cgtcgaaaac ctttcccgtt tacaacgttg cttgaaaggg aacgcgctaa 1620 agtcggtaag atattatcta ctatcgcccg agtcagtgcc taacgtgata gacactttgc 1680 gaaccttgta cggtcgtccg gagatcatca tcaacaagtt gatccagtcc gtacgtgaga 1740 cacacgcgcc aaagcctgag aagttggaat cccttatcga ttttgggatt tcgattagaa 1800 atctgacgca acatctcatt gcagcgggac aacaagcaca cctgtcgaac ccggtgcttc 1860 ttcaagagtt ggtagaaaaa ttgccggcca acgtgaagct acaatgggcc cagcatctga 1920 tgtacgagcc agaagcgagt ttgagaacct tcagtgactt catggggaca gtcgttgagt 1980 cggtcagcaa ggtggtgatc tacacggggg ctcaacaagc caggagcgaa aagccccgaa 2040 ataaagataa aagcttcgtc aatgctcact ccgaggatag cagtagcagc gctccagcgg 2100 gagaaaatta tgaagtcgaa aagccgtgcc tattttgcga aaaagtcggc caccgtctga 2160 aaagctgtac aaaattccag caactttcga ttgacgatcg gtggaaaaac gttcaacgat 2220 tgaagttgtg tcgcagttgc ctcaatcctc acggtcggcg cccttgtaga atgtccagcc 2280 gatgtggaat taacggatgt gagttccgtc accatcaact gctgcatgga aaatctgacg 2340 caccgatcca accagctggg acctcagcgg agaaccatgt tcaccaccat tgtggtcaat 2400 ccgtcctctt ccgaatcgtt ccgataacat tgtacggaaa atcaaagtca gtaaacgcgt 2460 tcgccttctt agacgaggga tcttctgcga cattgatcga gcaaagtctg gttcaagagc 2520 tgggagtaga agggccaaca gtcccactgt gtcttaggtg gacagctaac atgacgagaa 2580 gcgaggacga gtcacagact gtgtcgctgg aagtgtcgga agtgggacaa aagaagaaat 2640 tccagctgat gcatgtacgc accgtcgaga atctcaactt accatcgcaa acactacgat 2700 tcgctgagct gcaagtgggc catccccacc tgaaaggatt gcctattcgg agctacgagg 2760 ccgtaacacc gaagatattg gtcggcctac gaaacctgca gctggccgtc ccgcagaaga 2820 tcaaagaggg gaacagtggc ataattgcga caaagacccg tttaggatgg tgcgtctacg 2880 gaagtttgaa tagggtagaa aaaacggatg aatacaatta ccatgtgtgc gagtgtgaat 2940 ctgaggagaa actggatcgc cttattaaaa attacttcaa cgccgaagat tgtccgacaa 3000 actacccgga gaagctagag tccaatgaag aattgaaagc acggcgaatc atggagaaaa 3060 cgacgattag ggtgggtgat cgatttgcga cgggactgct ctggaagtcc gattttgtag 3120 agttgccgga ccgttactca atggcccttc gacgcttgga atgccttgag agacgaatgt 3180 ctcgcgatcc ctttttaaag gaaaatatcc gccggcaaat tagcgagtac caggagaaga 3240 ggtatgctca ccgagcgact acagacgagc ttgcgaatgc tgatcccaga cggacctggt 3300 atcttccact aggcgttgtg tacaacccca agaaaccagg aaaagtgcgc ttgatttggg 3360 atgcagcagc gaaagtcgac gggatatctc taaattccgt gctactccca ggccctgacc 3420 ttctgacatc actcccttcc gtacagttcc gctttcgaca gtacccagtg gcagtaagcg 3480 gtgacatcaa ggagatgttt caccaggtac aggtcatcga gcgtgaccga ccagcacaaa 3540 gcttcttgtt ccgtgaaaat cctgacgaca aacccagcgt attcatgatg gacgttctca 3600 ccttcggggc cactagttct ccctcatccg cccagttcat caaaaaccgc aacgcgcagg 3660 agttctccga gcgataccca cgggccacgg aggcgattcg taagtgccac tatgttgacg 3720 actacttaga tagttttgtg tcaaccgacg aagcaaaatc agtggcgagc gaagtaaggt 3780 ggattcactc gaaaggaggc tttgaactgc ggaactgggc atctaatcga aaggctgtcc 3840 tagaatatct aggagagctt ccgaagctat acaccaaaga tctcgctatg aaggctgatt 3900 cggaacgcgt gctgggaatg ctgtggcaca ctgaggatga tgttctacgg ttttcgacat 3960 catttcgtga ggaaattgtc acactaattg aagcgggttc aaagccaacg aagcggcaag 4020 tcttaaagtg tgtgatgagt ctgttcgacc ccttagggct gctggcttgc gttctcgtgc 4080 acgggaaaat aatgatgcaa gatatctgga ggactgcgat caagtgggat gaatgcatag 4140 acgaccagat caatgagcgg tgggtgaagt ggattgagtt gctgcaagac gtcaacgaag 4200 ttcgcgtacc gcggtgctac ttcaacggag caagtgctac tttgtacgat tcgttggagg 4260 cccacatttt cgtcgatgca agtgaggtgg cgtacgctgc attggtgtat ttccgtgtca 4320 tcagatcgga cgaaacagca tcgtgcgctt tagtttcggc caaaacaaag gttgctcctc 4380 tcaaatacgt atcaatcccc cgtttggagt tgatggcggc ggttctggga gtacgtttac 4440 tctcattcgt gaaggagagc cacacctttc agatcaaacg ctgcgtctac tggtcggact 4500 cgcaagtaac gttggcttgg atacggtcag agcatcgacg ttaccgacca ttcgtagcct 4560 gccgcatcgg agaaattcta tcgacaacca gtgataatca gtggagatac atcccgagca 4620 aactcaacgt tgccgatgat gccactaagt ggggtgagaa actgtgtctt gatgcctcaa 4680 gtcggtggtt tagcggtcca ccattccttc aacaaccaga aaatatgtgg cccgcgcaga 4740 aaaagagcat tgcaacgact gatgaggagc tccgagcgag ttacctgcat caggatgtac 4800 cagttctgca caatccgttc acttttgaac ggttctccaa atggaatcgg cttttgagga 4860 caacagccta cgtaattcgg ttcatcggca tactgaaacg cactggaacc ggtgaaaaac 4920 gccgtacctc agctttacgg caggatgaac tacacgcagc agaagttatg atctggaaac 4980 tggtccagtc cgaagcttat cgcgacgagg taatcgtcct ttcgaaaaat aaggagctgt 5040 cgagcgggaa acgaattgag ctggaaaagt caagtgtgct gtggaattta gctcctaaga 5100 tggatgacga tgaaattctt cgagttgacg gtcggatttc cgcagcgaaa gagatgtcta 5160 atgacgtcaa gtttcctgta atcttgccaa ggaaacacta cgctactcat cttatccttg 5220 aagactacca caggaggctg ctacatgcca attttgaaac ggtagtcaat gaagtgcgtc 5280 aacgttacca cattcctcga ctacgtacca cagctcggag tgtaatcagc cgctgccaat 5340 ggtgcaaggt gcataaggcc cgtccagcga tgccaatgat gggaccactt ccgacagccc 5400 gcatatcacc cggaacacgc ccgtttagct acgtcggcat cgactatttc gggcctatcc 5460 tggtgaagat aggtcgagca acagcgaagc gatgggtatg tctgataaca tgcttgacca 5520 tacgtgccgt acacgtcgag gtagcgtacg acctgtctac gaagtcatgc atcgcctgca 5580 ttcgccgttt cgtttgccga cgcggcgcac ctatagaaat ctactccgat aacggccgta 5640 acttcactgg agcagatcgt gttttgcaag aacaaataag gcgcctcgaa gagtaagcgt 5700 ctacgacttt cacgaatact acgacgaaat ggttgtttat accgcccttt gctccccaca 5760 tgggtggagc atgggagcgg atggtacgtt ctgtcaagaa cgcgttaacc agtatgcctc 5820 aggacgacaa actggatgat gaaggactcc aaaccattat agtagaagcg gaagcgatag 5880 tgaactcgcg cccgttgacg tatctaccgc tggattcagc ggagcaagag gcgctcacgc 5940 ctaatcactt catccttgga aattcaacag gagtaaagca acctgcggtg aagctcgaag 6000 attcagaggc agcagttcaa ccttctctga acctgattcg acgaaggatt gatcacttct 6060 ggaagcgttg gatcctagag tacctcccaa ccctgacgag acgagtaaag tggctgcagg 6120 atactaagcc gatgcggatc ggagatttgg tcatcatgat cgacgagact agaagaaacg 6180 gatggatacg agggcgtgtc ttagacgtca ctgcaggtag agacggaaga gttcgccaag 6240 cgttagtgca aacttcaggt ggactttttc gacggccggt atccaaactg gcggtgctgg 6300 atgtagctga acgtgaagtg gaggacccta ccgctcctca cggggagggg ga 6352 // ID BEL-7_DPu-LTR repbase; DNA; INV; 342 BP. XX AC scaffold_26; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: long terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-7_DPu_; KW BEL-7_DPu-LTR; BEL-7_DPu-I. XX NM BEL-7_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-342 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 662-662 (2010). XX DR Genome; scaffold_26; Positions 1027695 1028036. XX CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX SQ Sequence 342 BP; 71 A; 95 C; 52 G; 124 T; 0 other; tgtcgaaaat tctccaactt taacttcatt tctcttaaaa ctatgttccc ccttttcgtt 60 ttggctttct cacccccctt tgctaatcac ccccgagcta tcgcgattca cccttctttt 120 gttttgtgtt ctgagactga cgagtctccc tgtctagact cgtcagtctg aatcttgatg 180 gctatcgaga atgttccact aagtgtccgc tcaggctcaa tcttaataaa ataacacaac 240 gtaggactca atcacggtgt acgtatttct ttatttacgt gtgcctctcg gtttctcccc 300 ctcaaagtta gtcgtgcaat tctacctcct ttttttctga ca 342 // ID Kolobok-1_Ppac repbase; DNA; INV; 5668 BP. XX AC . XX DT 16-MAY-2011 (Rel. 16.05, Created) DT 16-MAY-2011 (Rel. 16.05, Last updated, Version 1) XX DE DNA transposon: consensus. XX KW Kolobok; DNA transposon; Transposable Element; Kolobok-1_Ppac. XX OS Pristionchus pacificus OC Eukaryota; Metazoa; Nematoda; Chromadorea; Diplogasterida; OC Neodiplogasteridae; Pristionchus. XX RN [1] RP 1-3053 RA Yuan Y.W. and Wessler S.R.; RT "The catalytic domain of all eukaryotic cut-and-paste transposase RT superfamilies."; RL Proc Natl Acad Sci U S A 108(19), 7884-7889 (2011). XX DR [1] (Consensus) XX SQ Sequence 5668 BP; 1684 A; 1248 C; 1152 G; 1584 T; 0 other; gggggagggg cacgaatatt ttttcacata agctgagcct gtggggtaat agggatgcta 60 tacccgatca cagatcactt tatccgaagt tctagaaatg cctctgaacc tcgctatgga 120 tttttagaaa tttcagcccc gtgcaggctc gatcagggta tactgtattc gaagagaaat 180 atcgattgat tgtgccgata tttgtaaaaa tgagtcaatc cgggtagcaa taaatatctg 240 caactgaagg gagtcgaacc aacgggggga tgagcgccag ccaatccctt tagccactag 300 gctatgaagg cagttgagag cctccccttg gaggagatat gtgcacttga ggtgctcaat 360 gatagtcacg ggaatggttt atcggcaaat aaaacaactt gcaatggtga tttcgaacag 420 aataacaagt caagagacat aagggagtga tagacattat gtaaatgtac atgtcattta 480 aaaaattaat catcgaataa agttggggac cgcttccggc ttatcggtat gtcctttttg 540 tgttgcactc gctggcaact ttgtcatcac ttcaccaata atagtgtttt cgatcaattt 600 gcattgtttc ttgtgtaatt tttcatttat tccgtgttgt tctgtcgctt ttgcagatgg 660 acactttccc gtcactcgtt gaagcccaaa ctacagtgaa caagatgaga ttgagggagg 720 aaaaggagca aagagtcttc ccccctagtg tcgattcgga ggaggcagca aactgtcttt 780 gccaatgctg tcctcctaga tcttcacagg aacaagggga ttactgctgt tcatccctct 840 tctcattccc tcttctccgg aatggcgctc tcctcaagga tggattactc tctaaaatga 900 aagaagtcgg ctctcactgc tgcatcacga aggatccgct cttcactgac cacgtactca 960 cggaagctgt gagtgtttaa cttattattt tcctattgta gaatagaata cctttccaat 1020 acttctcaac tgtgatatat catggctacc gtaatccagt atttatattt gaatcaatac 1080 aagaaaatat cgaatgatgt tgctaccgta atcgagcatc gatttttgaa tgtccctgat 1140 cggcctagtt atctaccgta tgagagtagc tatcaactac cgtatctaat ctcataaagc 1200 cctatctcat gaaaaactac cgtattatgc ccgtatcgat tgcaggcagc acgtgcctct 1260 gcagagatgt tctcgatgtt ttctggagag ccgatcacgg atttcaacaa gtgagattct 1320 tttgtcagca gtatttgtac atataattca gagcctatcg atacggttcg tatcgcctct 1380 ttgtcgcatc gaccatcggt catctgggaa aaggagtcag aatcagactt cccgcttgct 1440 ttgtacacgc agtgagacag ctatggccga gcacaaacta tagaggattt tcatcgtcgg 1500 aagcctatga tttataaatg aataaagttg ttacaaataa actgatggtt tttttcggat 1560 aataaaacgt tgatcacgaa catcctctta tcttggcgta tgcttcaatg ctattgatct 1620 attggcattc tcagtgagag atggtcaagt gcgttgcttg tcctacggac tgtgatcgaa 1680 gtctccttca ccagttcact tctaagcctc atcttcgaga caaatggctc actgcactca 1740 caaaagacga ttcggaaaag gcggcattgg atgtcaaact caggactgct cccggtagac 1800 actttgtctg tattgatcac ttcgatgacg atgcattcga ggaaacggaa ttttctcgtt 1860 ttctcaaggc ggatgctatt ccattgtcca aggtcagtac aacaatacat taagaagaaa 1920 atttttatca tactataata atcacaagac attcgacgat taggacctct tttcagagat 1980 ccacgccgag ctacagtaat cctaccgtaa cctacaactc tccaccaccg actcctcgcc 2040 tggttcctct ctcgtcgact ccagtggctc gtcctcttct cgctcgtcct ctccgtctca 2100 ttgacgcgcc agttcctcct tgttgcagat gctgctgcaa aaaggaaccg gttagtttct 2160 gtttcaattt ctgcatttcc actctattaa atttatcaaa catatcccga tttatctatg 2220 tttcaggacg ttgacctgaa gaaggatgtc gattgggctc cacccacccc tacgatcact 2280 aatctccctc tctcgaagta cttcattgtc agtaaggcta gcctagtgca gctgctgtca 2340 cgatgcaact cgtgcccttc tggtcagaat gatctcacat tcaccgaaga tgctcatgca 2400 ctatcgtgca cgtgcaagtg cactagttgc ggggttcaat tcatatggtc gaatagccgt 2460 gtgcttccga ctgcgaactc ttcctcgaag gaaaaactga gagaagtcaa catggatatg 2520 tgcgtcggat ctgctgtcac tgcagttggc actgcggtta gttcacaatt ataaattacg 2580 aaataggaaa attgtcaaat acatattttg taccgtaacc tatttcagag actgaactac 2640 ttcctgaaag ccgtcggcct gaatgttgtg tctaaaagga ctttccacag acacaaaaat 2700 gactatctcc ttcctgctgt cacccaagtt ttcactcaag cccaagagga actattcgaa 2760 agagtgaaag atcgacttag caaagtactg tttcattcgc ctgttctatc caatcaatac 2820 tgcctagggt gagcaattgc acgtagcggg cgatggtagt ttcgacacta gaggctactc 2880 agcagagtgg tgtagatact ttctggtcga tgcacacaca ggagaagcgc ttgttcatgt 2940 catcatgaac aaaaaagaaa ctggcagtag tgggacactc gaggtattgc atttaaataa 3000 aatgggaata attttgtatc tatacaggtg gcctgccttg agaaggcact cgacattctc 3060 tcggacaaaa tcggaggagt tcatctcatt tccactcttg ttactgatcg ccattcgggt 3120 gcaataaaga tggtcaaaca gaagtatccc tccatcaacc acttctttga cccttggcat 3180 tatttccgga atctcactct caatctgatc aaggtaattc gtttggaaat taaaacctac 3240 acataatttt ctacaaccat tctatttctt ccagatctgc aagccgacct acatgcagca 3300 aataagggac acttggtcaa gaatcttaat caacaaggcc tatgatgcag tactaaaggc 3360 gcagggaaac ggcacactcg ctagtgaaat gttcaggtcg tctcttcttt gctgtgcagg 3420 agtgcacaac ttttcgaatg tgagttctag tcgctattcc agactatcta aacactattt 3480 ataggacccc tctttcacag agttcaagca atgtctacat ggtccccctc cttccaattt 3540 cccctatatt ccaatagatg gaagagtgtt caaaaggctt caatcggaga tatatacgga 3600 aaagaatatc caggatattc aaaacgtttc ataccttcta aaaacgtcaa cgaacgaaag 3660 tctaaaccag attgcatggc gttatgcgcc taaagaatgc tacttcgaca ggtatttcag 3720 cccacatctt ttccagcaca atctaatcac atttcagaaa gggtcatgaa ttgagaacta 3780 tggaggcagt tttgcactgg aatgagctaa agagagatga agctaatgga actcgcacta 3840 ttgttagcct tttgatctca attgaaaatc agctcacaaa tctctacagt tactcattaa 3900 tatcagattg gcaagaagag ccattacaac aacacgttga agaagcacgt ttttcgaaac 3960 gtcaaatcta ctgccaaaaa tgcgtggcgc gaaactgtga agaataagtg ctatcaggta 4020 tgacaaccga ttgatctact tgatcccatg actcagttgc gcacctctct ctcatccact 4080 ccctatggaa cggtcaagaa ggaaaaagaa gagctacaga agaagagggt aaaagtgata 4140 ctgattgtca aatctaatat atgctttcta ggatctgtgg aacagtatga acacgcctac 4200 cgtaccccac agcagtgatg ctaaccttga cgaatccgaa gatgaagacg atctcccttc 4260 ttcctcccag gtcgaaggag agccaagtga cgaccttctg caagagatta gattgctcat 4320 cgaagaagat gctattaacg atcaactcag agaagaggaa aatgacgatg agtaaagtag 4380 tcattaggga atatatcatc gttttcatat tgttaacaat tgttacgtat tttgccgatt 4440 cgaactctca tctgatttac ggagactagg acaaatttgt cataccgtac aagataataa 4500 atataattca aaaatccaaa atttgccagt taacttatca ataatatctt gctaatcttt 4560 cgttcctaac ttgtaaatgt aaaccgatat catctaatgg aaggaatggg agaaactcaa 4620 tagacgacaa accttgtagc gtcatgattc tcgtgatttc ctcccttgaa gcaggcggca 4680 cagagtgaca taccttcaaa gagcaactgt ttaactcgat aaatggtcag gtgctcgatc 4740 attggcaaag agtaccgcat cgacacgagt aggctcctgt agtggagtga tgagaccctc 4800 gataagtcga tccagaagac gagctttctt tctgcaacac aaactccttt cgaatacagc 4860 tatttcttgg ataatgatga tgcttaagtc gattgatcgt accttagctc gggagtgaat 4920 ctgtttactc cgatggacga atgggggtag tttgcttctt tgagctcgtc cgctattgcc 4980 ttccattttt gatagtcgtt accgatctgg aagaaccaat atgaacatga atatccgtga 5040 aataagtaac atgacacagt gaataaaacg gtaaaaaata acacactgta cacgcatttc 5100 gatgtgaccg gcgatcaatt ttgactataa tcgcttctaa gaggctcaga agatattaat 5160 ttgataattt aatagcgcgt aaggtgaatt acaatagcga ggatagcaaa tagcttcaaa 5220 aacttgaaac cctcatcatt ttaacacgtt tcagtcatat atcagtcgaa ttctgccatc 5280 gcgaacattt gtttcgataa agcatcatca gatcttgctt aaatatcgca aattatatga 5340 ataagcaata aatatacaac tgaaactcct gtgaacaggg aataacagac tacaatcgga 5400 acaacttttg gaacagccac aaacagagtc atttaagcgg attttgagcg taaaaatgat 5460 gcagttgtgc tctaacagaa tcgaggaatc gttgcacagc taaattgtaa caacttaaaa 5520 aaggtaaaac agcgttaaat caataaaatg aaaaaatcaa aaaagctggg ctcaaagcca 5580 taaccggcaa atagaatcta cggtagtttc gctcgggtgc aggctcttcg tgccaagaaa 5640 atttcgcctc gattcgtgcc cctccccc 5668 // ID CR1-63_AAe repbase; DNA; INV; 4506 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-63_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4506 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1150-1150 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 9 sequences with >96% CC identity. XX FH Key Location/Qualifiers FT CDS 697..1548 FT /product="CR1-63_AAe_1p" FT /translation="MHTVCEQCANDFTGAVMKCGGFCSXQFCMKCTGMTEE FT THRLLNNNGHLVXMCFACRNILAKTRFSHAISSVNAANQNIIDALKTEIKD FT SILNDIRTEIRDNFKTLVDAVPSTPLTIRPPPLPHSSRNKRQREADVDDDT FT SSRRPPKLLCGTATRNSDVCSVAIDSADTVPDFWLYLSGIQPDVVDDTVRQ FT MVQESLGTNEFKLVKLIPKGKDQRMLTFISFKVGIAADLKEKAMSADTWPK FT GIRFREFENQNLPRAGFWRPSAPVPENTLPQITVTSDXPLNVA" FT CDS 1443..4445 FT /product="CR1-63_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="KPKLATSGFLETIRSCSREYITANNSHIRSSTERGLI FT LNTDIEQPSTLPSVPTSSFLVYYQNVRGLRTKTRELFLELCSCDYEVIAFT FT ETWLHPGIADSELTSNYNVYRCDRNNSTSEFQRGGGVLIAVKKHHRCEVVS FT MSNGEDLEQVVVRIRLPTKTIHVCCIYIRPSSAPSVYEQHASCIQDICAGA FT SMNDAILLLGDYNLPHLVWRYDNDINAHLPTNASSESEISLTESVIASGLF FT QVLNIRNHNDRLLDLAFVNDPDNVEVFEPPSPLLKPDAHHKPFLLNVFLPA FT IEDNEKDDVLHFDFRRYDASALSRDLSNVDWNSILNESTVDDAVNVFYDVI FT YNVIREHVPMRKRRTTRSNNQPWWNNELRTLRNRLRKLRKRYFKHRSEENR FT TLLQTMERHFSTRRTSAFREYIRGLECNAKDDPSSFWTFVKSQRKSNGIPR FT EVSYQNVHASSAEESANLLADFLKSVYNDRPTPVTQEVLDGVVTYDLHMPL FT LNVTVEEVFRRFSDIDASKGPGPDGIPPLFVKQCAEALSLPATIIFNRSLS FT SGVFPALWRIASITPIHKAGNLNNAENYRGISILNCISKVLESFVHEALYR FT TVRSVIPEFQHGFVKGRSTATNLMSFVTCVKNRIEKKHQVDAIYIDFAKAF FT DKVPHTLTIAKLRKMGLPEWIIAWLESYLTERFAFVKVHGYESARFTIPSG FT VPQGSHLGPLLFVLFIADLCDLIQSEKLFYADDLKIFKTILSQMDCCVIQR FT DLDTIRNWCSANGMDTNAEKCKVISFTRSRAPIQQSYSIGVHELDRVTSVK FT DLGVVVDNKVTFNEHIATTTAKAFSLLGFLRRITKSFQDVYALKSIYCTVV FT RSVLEYAVQVWSPYHQVQSNRVERVQKSFLRYALRRLPWNDSIRLPPYEHR FT CMLIGLDTLASRRILLQRVFCFDLLTGNIDCGDLLQQLNIHVPNRRLRHRT FT FLYIPTHRTLYGHHNPLHSCCRYFNDVYDKFDFNITKMTFKNRIRH" XX SQ Sequence 4506 BP; 1255 A; 1047 C; 935 G; 1259 T; 10 other; catmcctggc atcactgctg aagtcgtttg ttgttgtcat ctgtgtcgct ctgaagattt 60 ckatctcatc acctgctttc gcctccacaa atcaacgttc gctgcaaatc gcatcccaaa 120 gccatcaatt tatcgttagc tgctgctgtt atcgctcaat ttggtcaggt aggatcgtga 180 ttttaaaata aatccgagtt ggcttgagag tggaagtgcc cgccaccacc gccattgccc 240 ccwmcakaaa atttcccacc tcgccactct ccacctgtac cagctgctgt caaacccaaa 300 cgaacaagct cttgttgatc tgccttctcg ctctcgctgc tgcgtawcgg aaatcaaaac 360 attcatctat tttcaaacct gtacgaatca tccgtcatct aagaaaatag aaagctgttw 420 cccgctccgt gttatcatta catccatccg aaaacgtccc atataatcca gcaagccgcc 480 tacatcgctg ttgtttcatc atttgacctg attaccgctg ccatcgaccg tgatcgtaag 540 tccactggaa gtattttagt gagccatttt gtataccacc tcaactcgct acacccgtac 600 tctttgctgc cttctcgccg ctcgttgcgc catctagttt ctgagcgttg aacctgatct 660 tctgataggg tgatccattt tccgtcaagg ctaaagatgc atactgtctg cgagcaatgc 720 gcaaatgatt tcactggagc ggtgatgaaa tgtggaggtt tctgttcawg ccaattttgc 780 atgaaatgta cgggtatgac ggaggaaaca caccgtttac tgaataataa cggtcatctc 840 gtakggatgt gttttgcttg tcgtaatata ctcgcaaaaa cacgtttctc tcatgccatt 900 tcatctgtga atgctgctaa tcagaacatc attgatgcac taaaaacgga aatcaaggac 960 agcattttga atgatattcg cactgaaata cgcgataact tcaaaacctt ggttgacgcg 1020 gttccatcga cgcctttgac aatacgacca cctccgttgc ctcattcgtc aagaaataag 1080 agacaacgtg aagctgacgt tgatgacgac acttcctcaa gacgcccacc gaagctgctc 1140 tgtggaactg ccactcgcaa ctctgacgtt tgttccgttg ctattgattc ggcggataca 1200 gtaccagatt tctggttgta tttgtccggt atccaacctg acgttgttga tgacactgta 1260 cggcaaatgg tacaggagag ccttgggaca aatgagttta aattggtgaa gcttattcct 1320 aaggggaaag accagcgtat gcttacgttc atttctttca aagttggcat tgccgctgac 1380 ttgaaggaaa aagctatgtc cgccgacaca tggcccaagg gaatccgttt tcgtgaattt 1440 gaaaaccaaa acttgccacg agcgggtttt tggagaccat ccgctcctgt tcccgagaat 1500 acattaccgc aaataacagt cacatcagat cwtccactga acgtggcttg atactgaata 1560 cggacatcga gcaaccatcc acgctgcctt ctgttcctac ttcctccttc ctcgtgtatt 1620 atcaaaatgt ccgtggcctc agaacgaaga cgagagagct ttttttagaa ctatgcagct 1680 gtgattacga agtcatcgca tttactgaaa cttggctgca cccgggtatt gctgattctg 1740 agcttacatc taactataat gtttaccgct gtgaccgcaa caattcaaca agtgaatttc 1800 aacgtggtgg aggtgttttg attgctgtca agaagcatca ccggtgcgaa gtagtttcga 1860 tgtcaaacgg agaagatctt gagcaagtag tggtgcgaat tagacttcct accaaaacga 1920 tccacgtctg ttgcatttac atcaggccta gttctgcccc atcagtttac gaacaacacg 1980 cttcttgcat ccaagatatt tgtgctggag cttcgatgaa cgatgccata ctattactag 2040 gggactataa tctaccacat cttgtgtggc gatacgacaa cgatataaac gctcatctcc 2100 caacgaatgc atcttccgaa tccgaaatct cgctgacaga atcagtgatc gcatctggac 2160 tgtttcaagt tttgaatatt cgcaatcata acgatcgact gctcgaccta gcttttgtga 2220 acgatccaga caatgtagag gtgtttgagc cgccttcgcc tctactgaaa cctgacgcac 2280 accataagcc gtttctgtta aacgtttttc tacccgcaat cgaagataac gaaaaggacg 2340 atgttttaca tttcgacttc cggcgatatg atgcaagcgc tcttagtaga gatctgtcta 2400 atgtcgactg gaattcaatt ctcaacgaaa gcactgtaga cgatgcagtc aacgtttttt 2460 atgacgtgat ctacaacgta atccgcgaac atgtaccgat gagaaagcgc cgaacaactc 2520 gcagcaataa tcaaccgtgg tggaacaatg aacttcgtac attgcggaac aggcttcgca 2580 agcttcgaaa acgatatttc aaacatcggt ccgaggaaaa cagaactctt cttcaaacaa 2640 tggaaagaca tttttcgacg cgacgaactt ccgcttttcg tgagtatatt aggggcttag 2700 aatgcaatgc caaagacgat ccttcatcat tctggacatt tgttaaaagt cagcgaaaat 2760 caaatggtat accgcgtgaa gtaagttacc aaaatgttca tgccagttct gcagaagaat 2820 ccgcaaattt attagccgat tttctgaaga gcgtttacaa tgacagacca acaccagtta 2880 ctcaggaggt acttgatggt gttgtgacct atgatctgca tatgccactc ctcaatgtaa 2940 ccgtcgaaga agttttcagg cgcttctctg atatcgatgc ttcaaaaggt ccaggaccgg 3000 atggaattcc gccattgttt gttaaacaat gtgcggaagc gctctctctt cctgcaacaa 3060 taattttcaa tcgatcgctt tcaagcggag ttttcccagc tttatggagg attgcgtcca 3120 tcactcctat ccataaggca ggaaacctga acaacgcaga aaactaccgt ggcatttcta 3180 tcctcaactg catttcaaaa gttctagaga gtttcgttca cgaggcattg tatagaactg 3240 tacgttcagt gattccggaa tttcagcatg gctttgtcaa aggtagatct actgcaacga 3300 acctcatgtc ttttgttaca tgtgtgaaaa acaggataga gaagaaacat caagtcgacg 3360 ctatatacat tgacttcgcc aaggcgttcg ataaagtacc gcatacccta acaattgcaa 3420 aactacggaa aatgggactt ccggaatgga taattgcgtg gctagaatcc tatctaaccg 3480 aacgatttgc tttcgttaaa gtacatgggt atgagtctgc tcggtttaca ataccgtctg 3540 gggttcctca aggcagtcac ctcggaccac tgctattcgt gttatttatt gcagacctat 3600 gcgatctgat tcaatcagaa aagttatttt atgccgacga tttgaaaatt tttaaaacga 3660 ttctctctca aatggactgt tgcgtaattc aacgcgattt ggacacaata aggaattggt 3720 gcagcgctaa cggaatggat accaatgctg agaaatgtaa ggtaatatca ttcactcgat 3780 cccgagcgcc gattcaacaa agttactcaa tcggagttca tgagctggac cgtgtcactt 3840 ccgtaaagga tttgggagtt gtcgttgata acaaggttac attcaatgaa cacattgcta 3900 caactactgc gaaagctttt tctctccttg ggtttctgcg aagaataacg aaatcgttcc 3960 aagatgtgta cgcactcaaa tcaatctact gtacggtagt ccgcagtgtt ttggagtacg 4020 ccgttcaagt gtggtcacca taccatcaag tccaaagcaa tagagttgaa agagtccaaa 4080 aatcattctt gcgatatgct ttacggcgcc tgccgtggaa cgactcgata aggttgccgc 4140 catatgaaca ccgatgcatg ttgattggac tggacacgct agcgtcgaga aggatactgc 4200 tacagagggt gttctgtttt gatcttctga cggggaatat cgattgcgga gatttgcttc 4260 aacaactgaa cattcatgtc ccgaaccgtc gcctacgcca tcgaactttt ttgtatattc 4320 cgacacatcg cactctttac gggcatcata accctctaca ctcttgttgt cgatacttta 4380 atgacgttta tgataaattc gattttaata ttactaagat gacgtttaaa aataggataa 4440 ggcattaaaa taagtttcag tctgtacaat taaattgaaa attgaagacg tagaataaat 4500 aaataa 4506 // ID L1_Ele26 repbase; DNA; INV; 4708 BP. XX AC . XX DT 15-OCT-2010 (Rel. 15.1, Created) DT 15-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE An L1 clade non-LTR retrotransposon family from Aedes aegypti. XX KW L1; Non-LTR Retrotransposon; Transposable Element; L1_Ele26. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4708 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4708 RA Kojima K.K. and Jurka J.; RT "L1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (15-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. This consensus is generated from 14 CC sequences with >96% identity, and ~99% identical to the original CC sequence in [1]. XX FH Key Location/Qualifiers FT CDS 171..1346 FT /product="L1_Ele26_1p" FT /translation="MASVRPRENTFKVDLTNFPKRPTFEDIHKFVYETLGL FT TVDHVVRLQMNHAQNCVHIKCRDLQTAQDVVQMHNEKHEIEVNKTKVKVRL FT VMDDGGVEVKIHDLSENIRNEEIVAFLKQYGDVITIKDQIWSENYPLKGIP FT TGVRVVKMMLRRHIKSFVTIQGEQTLVTYRNQVHTCKHCTNPSHPGSTCVE FT NKKLLGQKADLNDRLKLAAQSNNNPSTSFASVVNKSTATLMPHFVSLNELA FT SSAPASVATNNTVSTCSLPSVSDQPEIPAATTDGNDMQMKTTGMEEQWIDS FT AQQNIGEASTSGGITAGESAKQPSAGDQSTHQAATAVHDKATTSSVSVFKI FT PSNPSNTPNLTMEISDSDSYESSQDNGSFQKVKRRGRPKKPKIDPXFRS" FT CDS 1458..4658 FT /product="L1_Ele26_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MNNPSISYNIGSVNICAMSSVTKIQSMCSFFRLMDFD FT IVLLQEVENSRLSIPGYTVITNVDERKRGTAVALKAHIPYCNVQRSLDGRI FT ICVNVGNCVTVCNVYPPSGTQNTSSREHFFRQSLPFYLQQSTPNLILGGDF FT NCVVSAKDATGSSNHSQSLKQLLDSLKLCDTWDCVHKNQIAYSFVRPNCAS FT RIDRIYVSSSLVSSLRTSEYFVTPFSDHKAYKMRCCLPNLGKPQGRGFWSM FT RTHVLTPENLEEFEIKWNRWLRERRNFDSWMSWWLNFVKPKIISFFKWKTN FT LAFREFHVRNEYLYRRLREAYDNLYANPQGLTEVNSIKGKMLNLQSKFSKS FT FERINETIIAGEKISAFQLGERIQKKNKSTIRSMTHANIRLENPTEIERHV FT CQYFEALYTDENLSVNTEFPCNRFISPNCEFNIGAMNEITTQEIFFAIKSS FT ASRKSPGCDGLPKEFFLKTFDIIHRQLNLVINEALRGNIQQKFVDGIIVLS FT KKKGGDDSIKGYRPISLLNFDYKLLSRILKQRMDKIMTDNNLLNSNQKCSN FT SKRNIFEAVCAIKDRIVEINCKRRSGRLISFDLDHAFDRVNVNFLLNVMRS FT MRFHTDFVSLVGKIMSMSHSRILVNGNLSPEIPILRSVRQGDPISMHLFVL FT YLHPLLEKLTTICNHPLELVVAYADDISIIVVEELKLDLIKKAFMDFGMCS FT GAKLNLTKTTSIHIGAQQPTNRMVSWLKSEEAIKILGVVYFNSLKRTLDFN FT WAEVIKKTSRLMWLFKQRVLSMHQKVALINTFITSRLWFMASVLSIPNSAV FT AKITSRIGGFIWSRYPCRISMEQLSLPVEKGGLNLHLPMHKCKALLLNRFL FT QCGEHTPFAASFLEQMANPPNVSGIPALYPCLKQISKELPYLPESTIRNPS FT APAIHSIFRNMIKSPKIVEENPNIHWARVWKNIRRKTFTSEEKSTYYLVIN FT EKIPHAALFFRQNRIDSPMCNRCPNAVEDLEHKLSTCVRVQHLWNHLRLKL FT ETILNRRVTYKMLSLPELNNTEVNTRNKALKMFIVYVNFVLDVNSCFTIEA FT LDFVLCCNCL" XX SQ Sequence 4708 BP; 1472 A; 1026 C; 948 G; 1260 T; 2 other; cagttaggtt caagctctcg ctgcgaccag acgtctgaaa aacaagctcc gtttcgattt 60 ttttcctttt cttttcgcgc acgcacttcg gtgtgtgaaa gagtatttgc gtcagttttt 120 tgctggcgac cggtatcgtt ttttgctacg ttaagcaaac aaaatctacc atggcctctg 180 tgcgaccgcg tgaaaacacg ttcaaagtcg atctaacgaa ctttccaaaa cgaccaacgt 240 tcgaagacat tcacaagttc gtttacgaaa cacttggtct gactgtggat catgttgtgc 300 gtctacaaat gaaccacgca caaaactgtg tgcatatcaa gtgtcgtgat ttgcaaactg 360 ctcaggatgt agttcagatg cataacgaaa agcatgagat cgaagtgaac aagactaagg 420 tgaaggtacg tctcgtcatg gatgacggcg gcgtcgaggt gaaaattcat gacctgtccg 480 aaaacattcg gaatgaagaa atagtggcgt tcctgaagca gtacggtgat gtaatcacga 540 ttaaagacca gatctggagt gaaaactatc ccctgaaagg gataccgaca ggtgtgcgcg 600 tagtgaaaat gatgttgcgg cggcacataa aatcttttgt gaccattcaa ggggaacaga 660 ccctggttac ttaccggaac caggtgcaca cgtgtaaaca ttgtactaac ccatcccacc 720 cgggttccac gtgtgtggaa aacaagaaac tgttgggtca aaaagcagat cttaacgatc 780 gcctcaagct cgcggcgcaa tcgaacaaca atccttcaac cagctttgca agtgtggtga 840 acaaaagtac agcaacccta atgccacatt tcgtttcact gaacgaattg gcatcttcag 900 ctccagccag cgtagcaacg aacaacacag tcagcacatg tagccttccc tctgtgtctg 960 accaacccga gatccccgcc gccaccactg atgggaacga catgcagatg aaaacaactg 1020 gaatggaaga acaatggatc gacagcgcac agcagaatat cggagaagca tctacatctg 1080 gtggtattac ggcaggagaa tctgcaaaac aaccatcagc aggagatcaa tccacacacc 1140 aggctgcaac cgcagtccat gataaagcca caacgagttc ggtgagcgtt ttcaaaattc 1200 cctccaatcc ctccaacact cccaatctca ctatggagat ttccgacagt gacagttacg 1260 aatcgtcaca agacaacggt tcgtttcaaa aagtcaaacg caggggacgc cccaaaaaac 1320 caaaaatcga tcctwtgttt cgatcctaag cccattaaaa ttcaacatga tcctattact 1380 aacatgttaa ccccgagtag gccgcggtgt atgtcggccc cccaaagcta actttgtata 1440 cctgaaatgg atccaagatg aataatccat ctatcagtta taatatagga agtgtgaaca 1500 tttgtgccat gtccagtgta accaaaattc agtctatgtg ctcgtttttt cgtttaatgg 1560 attttgatat tgttctgcta caagaagtcg aaaattcacg cctatctata ccaggatata 1620 ccgttattac aaacgttgac gaaaggaaaa gaggaactgc cgttgcccta aaagcccata 1680 ttccatattg caatgtccag agaagcctag atgggcgtat tatatgtgtg aatgttggaa 1740 attgtgtaac tgtttgtaat gtgtaccccc catcgggcac tcagaatacg tcgtctaggg 1800 aacatttttt cagacaatct ctgccgtttt acctacaaca atctacacca aatcttatac 1860 ttggcggtga tttcaattgc gttgtgtccg ccaaagatgc aactgggtct agcaatcaca 1920 gtcagtctct gaaacaacta cttgattctc tgaagctatg tgatacgtgg gattgtgttc 1980 ataaaaatca aatcgcatat agtttcgtcc gcccaaattg tgcttctcgc attgatagaa 2040 tatatgtttc gtcgtctctt gtctcgtccc tgcgtacgtc cgaatatttc gttacacctt 2100 tttctgacca taaggcgtac aaaatgagat gttgcctccc aaatctgggc aaaccacaag 2160 gccgagggtt ctggtcgatg cgaacccatg tgttaactcc tgaaaacctc gaagaatttg 2220 agataaagtg gaaccggtgg ttgagagaga ggcgaaattt cgacagttgg atgtcgtggt 2280 ggctgaattt cgtcaaacct aaaatcatca gcttcttcaa gtggaagaca aatttagcgt 2340 ttcgagaatt tcatgtaaga aatgaatatt tgtatcgtcg gctacgtgaa gcgtatgaca 2400 atttgtatgc taatccacaa ggtcttacag aggttaatag catcaaaggc aaaatgctca 2460 acctacaaag taaattttca aaatcgttcg aaaggataaa tgaaacaatc atcgccggag 2520 aaaaaatttc tgcatttcag ttgggagaac gcattcaaaa gaaaaacaaa agcactatta 2580 gatcaatgac gcacgcaaac atccgcttag aaaaccccac tgaaattgag agacatgtct 2640 gtcagtactt tgaagcactc tacactgatg aaaatctatc tgtaaatacg gaatttccgt 2700 gcaatagatt tatctctcct aactgtgaat tcaatattgg agccatgaat gaaataacga 2760 ctcaggaaat cttctttgct atcaaatcaa gtgcttcccg taaatctcca ggatgtgacg 2820 gacttcctaa agaattcttc ttgaagacct ttgacattat acaccgtcaa ctcaatcttg 2880 ttataaatga ggctcttcgt ggaaatattc agcaaaaatt cgttgatgga ataattgtac 2940 tcagcaagaa aaaaggtggt gatgattcaa tcaaagggta taggcccatt tctctgttaa 3000 actttgatta taagttacta tcacgcatcc taaagcagag gatggataag atcatgacag 3060 acaacaatct cctaaactcc aatcaaaagt gttcgaactc taaacgaaat atatttgagg 3120 ccgtttgcgc catcaaggat cgaatcgtag aaatcaattg caaaagaaga tcgggaaggc 3180 taatttcgtt tgatcttgat catgcctttg atcgtgtgaa tgtaaatttc ttgttgaatg 3240 ttatgcgtag tatgcgtttc cacacggatt ttgtgtctct tgttggaaaa ataatgtcca 3300 tgtcccattc gmgaatactt gtcaatggaa atctcagtcc cgaaatccca atcctgcgat 3360 cagttcgcca aggtgatccc atcagcatgc acctcttcgt cctatacctt caccctctcc 3420 ttgaaaaact cacaaccatc tgcaatcatc ctttagaact cgttgtggca tatgctgacg 3480 acatatccat aattgttgtg gaagagttga aattggacct gattaaaaaa gcgttcatgg 3540 acttcggtat gtgctcggga gcaaagctaa atctcactaa aacgacgtcg attcacatag 3600 gagcacaaca acctacgaat cggatggtca gttggctcaa aagtgaagag gccatcaaaa 3660 ttcttggagt tgtctacttc aactcactca agcgcaccct tgacttcaac tgggcagagg 3720 tcataaagaa aacatctcgt ctaatgtggc tcttcaagca aagagtattg agtatgcatc 3780 agaaagttgc actgataaat acgttcatca cctcgagact ctggttcatg gcgtcagtgc 3840 ttagcattcc caactctgca gtagcaaaaa tcacatctcg catcggaggt ttcatctggt 3900 cacgttatcc ttgccgaatt tcaatggaac agctttctct tccagtagag aaagggggat 3960 taaacctgca tctcccgatg cacaaatgca aggctctgtt attgaaccgg tttcttcagt 4020 gcggagaaca cactcctttt gcagcatcgt tcttggaaca aatggccaac cctccgaatg 4080 tgtctgggat tccagcttta tatccttgtc tgaagcagat atctaaagag ttaccatatc 4140 tccctgagag cacgattagg aatccatctg caccagcaat tcattccatc ttcagaaaca 4200 tgattaaatc accaaaaata gtagaagaaa atccaaatat tcactgggca agagtttgga 4260 aaaacattcg cagaaaaact tttacttcgg aggaaaagtc cacctactat ttggtcatca 4320 acgagaaaat accacacgct gctctgttct tcagacaaaa taggattgac agtccaatgt 4380 gcaaccggtg tccaaatgca gttgaagatc tggagcacaa actctcaaca tgtgttcgcg 4440 tgcaacatct gtggaatcat cttcgtttaa aattagaaac aattttaaat agaagagtaa 4500 cctacaaaat gttatcttta ccagaactga ataacactga agtgaatacc agaaacaagg 4560 ctctaaaaat gtttattgtg tatgtaaact ttgttttaga tgtcaacagt tgttttacaa 4620 tagaagcact tgactttgtt ttatgttgta attgtcttta aatgtagtat gaactaaata 4680 aatgtgttta caaaaaaaaa aaaaaaaa 4708 // ID CR1-68_HM repbase; DNA; INV; 3954 BP. XX AC . XX DT 25-DEC-2008 (Rel. 13.12, Created) DT 25-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE CR1-type family - consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-68_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3954 RA Bao W. and Jurka J.; RT "CR1 families from Hydra magnipapillata."; RL Repbase Reports 8(12), 1895-1895 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 187..975 FT /product="CR1-68_HM_1p" FT /translation="MEFDYECLEEPNWPTIPSASAEVTKNMHKWCGKISYI FT VSDLLKRLKILEDEKINNLTATTNKPTTIFTYAQATSIQKNQLKPKETEVV FT MMAKITNEFNNKKRIENNIVISGVADHTTLDKQVNEENDKQTIETILKAIK FT PDFSITNAKRIIRLTKNAKRPEGPPDLILVELENKETQNFIIKNSKKLKDS FT INFKNIFINKDKTANERLFESELRKVRNTRNAALAQEVEGTNGRQRYNVHN FT GKKYYWGIRSGELKWIEIKST*" FT CDS 800..3862 FT /product="CR1-68_HM_2p" FT /translation="MRDCSKVNSEKLEILEMQHLLKRLREQMGGNDTMCTM FT EKNIIGVYDQVNSNGSKLNQHNLHIKIKRTSALNLGIQNVQSAMKKSAIIH FT DIVQNHSLDIFILTETWIKSDDPTAIKEDMAPPGFKTIQYCRSDHRGGGVA FT IIYRDNLNIKPLSLSSTCLSYERTSAKLNLANTIYNIVAIYRSNPIIPTNI FT FFFELSELLEELQTLSGEILLCGDFNCPGTTPITCNSRLNEILETYNMIQR FT VQSPTRISSTTGIANLLDLVINRHDDISLKSIQVTDEGISDHYMIRVELYH FT QPSNTKTVWFSKRNFRKLNLSCFIKELQVMPCCLSPENNVDDYAEQIRSSI FT TTVLDHLAPIRKYSMRIGKHDDQWLSSDAKNKKIRRRKLERCYRKTGSLSD FT KKLYRTACREAAAAIHKSRCSHYKAKLNSAYGDQKETWNIAKKLLHSNSSK FT TNNTIPEAKCDIISQYFIRKLEIIKNTIQERLAKTPISKFISELSPPVDIL FT TTFGTVTPKDVSKIIKTLKPKTSPLDIIPTNILKSCSEIFSPIIAHLADLS FT FKSGIFPTSYKVAQITPIPKKSGLAEFDLTNLRPISNLNTISKILEKLALS FT RLHPHVTSSVNFNPLQSGFRSYHSTETALLKICNDILLNIDDGLTTILLSL FT DISSAFDTIDHSLLITRIKNDFGVTDIALKWLTSYLHSRKSFVSIGSSNSR FT LIASSTGVPQGSVLGPFLFSMFVSPIYRIIAKFGTNHHQYADDTQLYTFLS FT PGKDSINKITECANAVTTWFLENGLLLNPNKTEAILFGSRKQTSKYKNDLN FT IAFSGTTITTINSVKILGVILDSKLSMDKHINNIIKSCNYHIRALRHIRPC FT LTKEAANIIACGIVNSQLDYCNSILSGTYQKNIKKLQRIQNNLSRIVFHSP FT FHLKSEILLKSLHWLPIHQRIIYKISVITYKILLSKSPAYLFELLEVRTSK FT QNTRSLDSCQLTRRQQKTSLGSKSFCMTAPRIWNGLSLNTRNSTSIETFKK FT RLKYELFMNSFSDS*" XX SQ Sequence 3954 BP; 1494 A; 751 C; 552 G; 1157 T; 0 other; cggaaaaaaa acacaaaaaa ctttatgttg aatttgtgtt aataagtccc tgtgtaataa 60 ggtaaatata tatactcaag aggtaatcca ctgttattca gggtggtagc cctctttcag 120 aagacatcag gaggtaatcc actgttattt agtgtggtag ccctctcata tatttgaaaa 180 aaaagcatgg aattcgacta tgaatgctta gaagaaccaa attggccaac aattccaagt 240 gcatctgctg aagttaccaa aaacatgcat aaatggtgtg gcaaaatatc atatattgtt 300 tcagaccttt taaaacgact taaaattctt gaagatgaaa agataaataa cttaacagca 360 acaacaaata aaccaacaac aatattcaca tatgcacaag caacatcaat tcagaaaaat 420 caattaaaac caaaagaaac tgaagtggta atgatggcaa aaattacaaa cgagtttaat 480 aacaaaaaaa gaattgaaaa caacatcgtt ataagtggag ttgctgatca tactacttta 540 gacaaacaag ttaatgaaga aaatgataaa caaaccattg aaacaatact aaaagcaatt 600 aaaccagatt tttcaataac aaatgcaaaa agaataattc gactaactaa aaatgctaaa 660 agaccagaag gaccacctga ccttatctta gttgaattgg agaacaaaga aactcaaaat 720 tttataatta aaaattccaa aaagttaaag gacagtatta acttcaaaaa tatttttatt 780 aataaagaca aaacagcaaa tgagagattg ttcgaaagtg aactcagaaa agttagaaat 840 actagaaatg cagcacttgc tcaagaggtt gagggaacaa atgggaggca acgatacaat 900 gtgcacaatg gaaaaaaata ttattggggt atacgatcag gtgaactcaa atggatcgaa 960 attaaatcaa cataatttac atataaaaat taaaagaacc tctgcattaa acttgggtat 1020 ccaaaatgtt caatctgcca tgaaaaaatc tgctatcatt catgatatag tgcaaaatca 1080 ttctctagat atttttatac tcactgaaac ttggataaag tcagacgacc ctactgcaat 1140 taaagaagat atggccccac caggctttaa aacaattcaa tactgccgat cagaccatcg 1200 tggaggaggc gttgcaataa tatatagaga caacttaaat atcaagccac tctcattatc 1260 atctacctgt ctgtcttatg aacgtacatc agctaaacta aatctggcaa atactatcta 1320 caatattgtt gctatttatc gatcaaatcc aatcattcca acaaacattt tctttttcga 1380 attatctgaa ctcctagagg aacttcaaac attatctgga gaaatacttc tgtgtggtga 1440 ctttaactgt ccaggtacta ctccaataac ctgcaattca agacttaatg agattttaga 1500 gacctacaac atgatacaac gagttcaatc tccaactcgc atatcatcta ctactggtat 1560 agccaacttg cttgatcttg taataaatcg acatgatgat atttcactca agagtatcca 1620 agtaactgac gagggtattt cagatcacta tatgattcgg gtagaacttt atcatcaacc 1680 ttcaaatact aaaactgttt ggttttcaaa acgaaacttt agaaaactaa acttgtcatg 1740 ctttataaaa gaattgcaag tgatgccatg ttgcttatca cctgaaaaca atgtggatga 1800 ttatgctgaa caaataagat caagtataac aactgttctt gatcatttag cacccattcg 1860 aaaatattca atgcgaattg gtaaacatga tgatcaatgg ttatcttcag atgccaagaa 1920 taaaaaaatc agacgtcgta aactggagcg ttgttataga aaaactggta gtttatcaga 1980 taaaaaatta tatcgaactg catgccgtga ggctgcagca gcaattcaca aatccagatg 2040 tagtcattat aaggcgaaac taaattcagc atacggtgat caaaaagaaa catggaacat 2100 tgccaagaaa ttactacatt cgaatagttc taaaactaat aacacaatac cagaagcaaa 2160 atgcgatata ataagtcaat atttcatacg caaactcgaa attataaaga acactatcca 2220 ggaaaggctt gcaaaaactc caatatctaa atttatctca gagttatctc cccctgtaga 2280 tatattaaca acatttggta cagtcacacc taaagatgtg tcgaaaataa tcaaaactct 2340 caaacctaag acatcgccac ttgatattat tccaacaaac atccttaaat cttgttcaga 2400 aatattcagt ccaattatag ctcaccttgc tgacctctca ttcaaatccg gaattttccc 2460 aacctcatat aaagttgctc aaataacacc aattcccaaa aaatcaggac tagcagaatt 2520 tgacctaaca aatctacgcc ccatatcaaa tctcaataca atttcgaaaa ttcttgaaaa 2580 gcttgcttta tctcgtcttc atccacatgt aacatcatct gtcaacttca atcctctaca 2640 atctggtttt agatcttatc actcaaccga aacagcactc ttaaagatat gcaatgacat 2700 cctgctaaac attgatgatg gtttgaccac tattctccta tctctggaca tatcttcagc 2760 atttgataca atcgatcaca gcctgctgat tacccgtatt aagaatgatt ttggtgtcac 2820 agatattgca ttaaaatggc taacatcata cttgcactct cgaaaatctt ttgtttctat 2880 tggatcatca aattctagac taatagcaag ttcaacagga gtaccacagg gtagtgtatt 2940 aggcccattt ctattttcca tgtttgtttc acctatatat cgtattatag ctaaatttgg 3000 tacaaatcat catcaatatg ctgatgatac ccaactttat actttcctca gccctggcaa 3060 agatagcatc aacaaaatca ctgagtgtgc taatgcagta acaacatggt ttctagaaaa 3120 tggacttctg cttaatccaa ataaaacaga agctatttta tttggatcta gaaaacaaac 3180 tagcaaatat aaaaatgatt tgaatattgc gttttctgga accacaataa ctacaataaa 3240 ttcagtaaaa atccttggag tcatcctaga ttcaaagctc tctatggaca agcacataaa 3300 caatattatt aaatcctgca attaccacat tcgtgcactt cgccacattc gtccatgtct 3360 gaccaaagaa gctgcaaaca taattgcttg tggcatcgtt aactctcagc ttgactattg 3420 taacagcatt ttatctggca cttatcaaaa aaatattaaa aaactccaac gaattcaaaa 3480 taacttatct cgtattgttt tccattctcc tttccattta aaatcagaaa tattgctgaa 3540 atctcttcat tggttaccaa tacatcaaag aataatctat aaaatatcag ttatcacata 3600 taaaatttta ttatcaaaat ctcctgcata tttatttgaa ctcctagaag tccgaacttc 3660 aaaacaaaat acaagatcat tagacagttg ccaacttaca cgtcgtcaac aaaaaacctc 3720 cctaggctca aaatcttttt gtatgactgc acctcgaatt tggaatggtt tatctttgaa 3780 cacgcgaaac tcaacttcta tagaaacatt caaaaaaaga ctaaaatatg aactttttat 3840 gaactctttt tcagactcat agtttacttg tttagtgctt ctgaattact tcatcactat 3900 tgtgattgtt gtaaataatg gcactttaat ctctaattat tattattatt atta 3954 // ID ERE1_EI repbase; DNA; INV; 1759 BP. XX AC . XX DT 25-JUN-2008 (Rel. 13.1, Created) DT 25-JUN-2008 (Rel. 13.1, Last updated, Version 1) XX DE Repetitive element from Entamoeba invadens - consensus sequence. XX KW Nonautonomous; Entamoeba; Transposable element; Ei_ERE1; ERE1_EI. XX OS Entamoeba invadens OC Eukaryota; Amoebozoa; Archamoebae; Entamoebidae; Entamoeba. XX RN [1] RP 1-1759 RA Lorenzi H.A. and Caler E.; RT "GenBank accession number EU099445."; RL Direct Submission to Genbank (09-AUG-2007). XX RN [2] RP 1-1759 RA Lorenzi H., Thiagarajan M., Haas B., Wortman J., Hall N. RA and Caler E.; RT "Genome wide survey and discovery of repetitive elements in three RT Entamoeba species."; RL Repbase Reports 8(10), 1682-1682 (2008). XX DR [2] (Consensus) XX FH Key Location/Qualifiers FT CDS 147..1127 FT /product="ERE1_EI_1p" FT /translation="MENELTALSALCTNAAKAVSDLNKNKKEYNFVDDKSL FT DIDQRIENAEGHFDRLDEVFDRKKNITTSLNTIVTDLDNASNKIKSEISLY FT STQSDNDKLYEIFNIKIALYEEKCQMDIKPLEDEINKIKQAKNNKIDQIMK FT SEKFINSKKQHDEEEEERIKREHITLEEKAGLEKLTGMKIGSVVFDSDKDD FT WSKDTSVFDEKVMNKSNLCFLVEDTNHNKFGGVFTGNINAVGKWINSGSSY FT IFSLVRSGTLNPKKFGKSGNYTNYEFYLYSQDNQYLFGFGGGCDIGVYKKG FT VNGSYCNPSTYTAKQNELTDNDNFTPKSVVVFQLN" XX SQ Sequence 1759 BP; 653 A; 274 C; 296 G; 536 T; 0 other; gaaattaaaa tcattttatg taagtttgaa tatttttatt tttggttttt aattgaaaaa 60 aaatcattat gtatgaggta ataatgtttt tgaatgagat ttatataaaa ttaaatataa 120 aaacaaataa actctattaa catacaatgg aaaacgagtt aacagcactc tcagcactct 180 gcactaacgc agcaaaggcg gtgtcagacc tcaacaaaaa taaaaaggaa tacaactttg 240 ttgacgacaa gtcacttgac attgatcaac gcattgaaaa tgcagaggga cacttcgaca 300 gacttgatga ggtatttgat agaaagaaga acatcacaac ttcactcaac acaatcgtca 360 ctgatttgga taacgcatca aacaaaatca aaagtgagat aagtctgtat tcaacacaga 420 gcgacaacga caagttgtat gagatattca atatcaaaat tgcactttat gaggagaagt 480 gccaaatgga cataaaacca cttgaagatg aaatcaacaa aatcaaacaa gcaaaaaaca 540 ataaaattga tcaaattatg aaatcggaga aattcatcaa ttcaaagaaa caacacgatg 600 aagaagaaga agaaagaata aagcgcgaac acatcacact tgaagaaaaa gccggtcttg 660 agaagttgac aggaatgaag attggaagtg ttgtgtttga ctcagacaaa gacgactggt 720 caaaagacac ttcagtgttt gatgaaaaag ttatgaacaa aagcaacttg tgctttttgg 780 tagaagacac aaatcacaac aagtttggtg gcgtgttcac tggtaatata aatgctgttg 840 gcaaatggat caacagtggc agttcttaca tcttttcact tgtgagaagt ggcactttga 900 atccaaaaaa gtttggtaag agtggaaatt acaccaatta tgaattttat ctttattctc 960 aagacaatca atatctcttt ggatttggtg gtggttgtga tataggtgtg tacaagaaag 1020 gtgtaaatgg tagttattgc aacccatcca cttacaccgc caaacaaaac gagttgactg 1080 ataatgacaa tttcacacca aagagtgttg ttgtcttcca acttaactga cttttttgtc 1140 attttttttt tttttatttc aaactttgca acacattgac tccaaaagtg tcatttttgt 1200 aatttgccac gtcaccacaa cttgaattaa tgtacacaag ttttaacaaa atctgtcctt 1260 tttgaagggc cggacagacg gaaagaaaac gggttttacc cttcaaacct aacactggaa 1320 cgaacactcg attaggatat ctttgatgtt tcttttttca ggcatctcct ctttgtgtct 1380 tgtatcttcc ggagatggat tcacgacgcc gcaactcgcc gggttcaaca agtttcgaca 1440 gttgcggcat atctactctt tgtgtcccgt tgtatctttc tgaagtatat taattgaaaa 1500 gtagtttggt atttttattt attatgggtt ttttcaaact acaaactata gtaattaact 1560 attaaaatca tattaaagtt aatattttca tctaaaaaca aatagcacaa atgttgatat 1620 aattaattat aatttaatag tttggtaaca caatttattt tttaaagcaa aaaactaaga 1680 aaagatcaat agactttcac ataattaaaa tattacaatt gacaatatat gtaaattgac 1740 tattttaaat aaataagta 1759 // ID Gypsy-134_AA-LTR repbase; DNA; INV; 1833 BP. XX AC AAGE02019934; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-134_AA_; KW Gypsy-134_AA-I; Gypsy-134_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-1833 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02019934; Positions 9135 10967. XX SQ Sequence 1833 BP; 518 A; 364 C; 397 G; 554 T; 0 other; tgaaacgatc attccgtacc agtcccattt taaattgttc gtacgttgta atgcctttaa 60 tatttattct tagcattttg aatcactttt gtttaacgtg ttatcgtgta gtacaagatt 120 aaaaagcttt tcctaatacg ccacctgtga aaaacaagcg gtaactgtga ttcaaatcgg 180 tatgaaaaaa atccatagta cctatgaaca tgagggaatc tacattcagg cggtttacat 240 tgatctgccc gaaactcggt tcaagtggaa aattccataa tttaattgcg cagtagagaa 300 tatttgaacg gaacagagag ctattttctg tattatcacg atatccaatc aaaaattacg 360 ttgtacgctc tcagtacacc atcgtgaggg aaattgcgtc ttagttttaa ataatatacc 420 aatccgaacg cgatcggcca tcatttcaag tggagttctc aaagtgaaca gttcgattcg 480 aacaagtgaa ttcgagtttc aaataatcga tattggaaaa tatcgtttga atttttatag 540 ctaaattcga agtaaatccg cgattaaatt taaatcttgt gataaattct aatttgaaat 600 ttgaattaca gaagatttac ctttttttga gttcgggtag tttttgttgg tgaacaaata 660 ggcagtagtc cacactgtgc cttagtgtgt gacttgaatc aggtaagctc taatgtgacc 720 agttgtgaag aagtgagttg ttctaaaata ccacgttgtt taggcatagc ccaacaacga 780 gggcagtcag ttttgggcgg ttcaaacgga atacctaagc ccgttcctgg tagatcctgg 840 tcggccttta cgcgatcccg accacgtggc ttcaatccgc attcttcgac cccagattct 900 gccggttagc tgcctgcacg tgccataaag tgatcgtacc gttccaactt aagtgtccct 960 actaccgaac tacgaatacc gttcatcgag gcgaaccatt cgaactgctc gaccgttagt 1020 cgagacacgt gggaattgag cagaagagca tcgaatcacc ttcaagccac actagtgctt 1080 cgtgtgcccg tgacgtcaag ttgtgcgctt tgagccacgt tctaaggctc aggactccac 1140 ctttccgggg acatctttac cccgagggct gtcgcacaca cgggacagat tcggtcttcg 1200 cttagcaagt cggtgaggtc gtcgatcggt gagtagaatc gcatccggct actacgagga 1260 cgacgtggga ccttggccgg cgtaatcagg tcagaataac ttttatgtgc acgaagaaat 1320 tgttctctag ctagagagaa agcttaccta aaatagaaaa ctgttcataa gatttgtttc 1380 cgtgacgtca caagtactga gcataggcgc ctgtcaaata aagcttaata actgcaaatt 1440 ttaaaaaact tgtgaagctc aaaaaaagtt ttcattaaat tttattgatt tttcgtgaaa 1500 taaattgttt tcaattaatt acttcaaatt tagttgaact cctttctttc tcgcgcgctt 1560 gagcttgtga caaattaacg agttggtttt attttagccg agacacactt cctgatcggt 1620 tgtgtaggag gcgtattgcg ttggtgagtt tggaacgtga ttgaattggc tttagttggt 1680 tcgttcaaat tagtaaagtt tgtatactgg aataactata acctgccttg agtagaggaa 1740 tctcatgccg ggatttgatc acagtcttcg gtccttctaa tttggaaggt ggcgcttaag 1800 ccgtagactg ccgtaacagg gacaaacgtt aca 1833 // ID piggyBac-4_BF repbase; DNA; INV; 9533 BP. XX AC . XX DT 13-APR-2011 (Rel. 16.04, Created) DT 13-APR-2011 (Rel. 16.04, Last updated, Version 1) XX DE piggyBac-4_BF DNA transposon DNA transposon - consensus. XX KW piggyBac; DNA transposon; Transposable Element; piggyBac-4_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-9533 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M. and Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-9533 RA Kapitonov V.V. and Jurka J.; RT "Transposable elements in the amphioxus genome."; RL Deposited to Repbase as the tmplanrep.ref file (02-May-2008). XX RN [3] RP 1-9533 RA Kapitonov V. and Jurka J.; RT "A family of piggyBac-4_BF DNA transposons from the amphioxus RT genome."; RL Direct Submission to Repbase Update (13-APR-2011). XX DR [3] (Consensus) XX CC A copy of SINE2-9_BF is masked by "N"s. A few kbs upstream of the CC TPase-coding ORF is composed of minisatellite-like repeats. XX FH Key Location/Qualifiers FT CDS 7074..9017 FT /product="piggyBac-4_BF_1p" FT /translation="MASGDDEEIEEDIEMYLREAAVAERAEAAGANGTVGQ FT NIAGEVDSDDSDDFDLYSVTKQVRGARVGEFSRRNKARGRAERPWASVSAG FT EVHPYKPQKFRAPGPVGPRRAEDNRNELSFFLEVLDQYLIRYIVRCTNRYA FT RQRRRVFRKERRSRWSPTSTTELKAFLGCMICAGYTYQPRLDLYWSDDDDV FT GCTLMKRTFPSHRFRQILRYLHYSPEGFCPPGSERLKSFKAREKLDRVMKV FT RKVMERVSRNSCRARHVGEHLVVDEATVAFRGRHSLRRYNPSKPTKYGYCL FT RALAETDGYILSDEVCGGRADPEDMSPDERCMHGIGDEYVSPLDLKPARVV FT NRLTMPFQGCYRTVYCDSYYTGVQLADYLFTKDTYIVGTVRRNASGLPTEG FT HPVPRKPRPGPGRPPLLKFPDPYPKSVPRGTSVRYHDGPVTVVKWQDTKSL FT ILLSTGTSVCAPDELTTRRCKGPSGATRLEVTCPAAMNMYNLHYKAVDTAD FT QLRAGYEFGRPSKKWHRQIFWYAMNKAVVNAFLNWKIYTAGRQDLPAHRVA FT QLSFRRALVRQLIGNFSARHRTPTASVALPGVIGNHRQISQGKPTKACRYC FT AKAGRHMPRAKRCRPFETVYKCEMCNVSVCHVSLRPECWREHLQDMQGDA" XX SQ Sequence 9533 BP; 1795 A; 2726 C; 3169 G; 1843 T; 0 other; cactttatta gtcgcaataa aaaatgggaa aacccggcgg caatcatgcg ggatcgcccg 60 gtgacattgt cccatctccc ggcaacgcgg aattgctgtc cgtaagtgcc agcggcggtg 120 ttgggaccat gcctccccga aagatgaccg agcaagttgt ggtgggtccc gggagctagt 180 gagttcatgg gccggcgaga gctgtgaaac tccggcggcg attatgcggg gtcgcccacc 240 cgatgctagc cttgcaatgt ccgtcacggg ctctagatcg ggcggcgaga gccgtgctaa 300 gtatgcctcc aatgttggtc gggcccgttg ccgatcaaga ccgcggagac acgccggcgg 360 gggcagtggc cggcacgcag agcaaggtcg ctcggtaatg cccgacaggc ggactcgccg 420 gtaaggactc ctccaatctt gatcgggccc gttgccgatc gagagcgcgg ggactcgcct 480 gcgggggcag tggccggcac gccgagcaag gccgctcggt aatgcccggc acacgggcgc 540 cggagtggac cgtaccgggt tcctgttgtt gcagacatgg cgtaatacat taagtattgg 600 ctatgtatag cctctcggcg ccgaggcaaa gcccgtgaac gtccccggtt gcgtaaggca 660 ggtccccggt cacgtaaggc aggtcccagt gaccgctttg gcgggccggg tgtcggtgcc 720 aaaggccggt tggcgccgat cggggtcgac cgggtcgggt ccggacagcc ggcggtgatc 780 aggtcaccag gccccggcac ccaccccgtg gagcattttt caaaaaagtg ctttacaggg 840 tgtctgacca tgtcgtggct cgctcaatct accttgagag agtggtcgtt ttttgactag 900 ctcggctttc cgcggagcct gacgggtttc tgccggtggc tgtcgagtca gggatggatc 960 gacagtgacc gtgggtgtgc ggggaacatg cccacggaga gaaaagtcgg gatggtccgg 1020 tgacttttcc ctctccggcg gtgccaacgg gctctgcacc ggccgagctg tcagccgtgg 1080 gcgggcggtt tgtgtctccg gctccagccg acttcgatcg cggtggtgga gttcccgggc 1140 cgtcaaggca cgagagaacg atgctgccgt gtcaggtagg tccatgtgac cgcactggcg 1200 gacgaagtaa cggcacacgg ggccggagtg gaccgtaccg gctctgactg tgttgcagac 1260 atggcgtaat acattaagta ttggctatgt attagcctct ctgcgctgag gcaaaacccg 1320 tgaccgtccc cggttgcgta aggcaggtcc ccggtcacgt aaggcaggtc ccagtgaccg 1380 ctttggcggg ccgggtgtcg gtgccaaagg ccggttggcg ccgatcgggg tcgaccgggt 1440 cgggtccgga cagccggcgg tgatcaggtc accaggcccc ggcacccacc ccgtggagca 1500 tttttcaaaa aagtgcttta cagggtgtct gaccatgtcg tggctcgctc aatctacctt 1560 gagagagtgg tcgttttttg actagctcgg ctttccgcgg agcctgacgg gtttctgccg 1620 gtggctgtcg agtccgggat ggatcgacag tgaccttgga tgtgcgggga acacgcccac 1680 ggagagaaaa gtcgggatgg tccggtgact tttccctctc cggcggtgcc aacgggctct 1740 gcaccggccg agctgtcagc cgtgggcggg cggtttgtgt ctccggctcc agccgacttc 1800 gatcgcggtg gtggagttcc cgggccgtca aggcacgaga gaacgatgct gccgtgtcag 1860 gtaggtccat gtgaccgcac tggcggacga agtaacggca cacggggccg gagtggactg 1920 taccggccct gatcgtgttg caggcatggc gtaatgcatt aagtattggc tatgtaaccg 1980 agacgggcct gcttggccgg cgggggactt ttaacccggc acccgaggcc cacccctctc 2040 ggtggggctc gccggcaaga actcctccaa tgttggtcgg gcccgttgcc gatcaagacc 2100 gcggagacac gccggcgggg gcagtggccg gcacgccgag caaggtcgct cggtaatgcc 2160 cggcaggcgg actcgccggt aaggactcct ccaatcttga tcgggcccgt tgccgaccga 2220 gagtgcgggg cctcgcctgc gggggtagtg gccggcacgc cgagcaaggc cgctcggtaa 2280 tgccctgcag gctggctcgc tggcagaaac tcctccaatg ttggttggtg ccgttgatga 2340 ccgagagccg ggtcgggtcc ggacagccgg cggtgatcag gtcaccaggc cccgacaccc 2400 accccgtgga gcgtttttca aaaagggctc tacagggtgt ctgaccatgt cgtggctcgc 2460 tcaatctatc ttgagagagt ggtcgttttt tgactggctc gggaaacggc agcgagtggc 2520 gagtgatcgc acactctctc gccagtcgtt ttgctgcccc gccgaccagg ctgccactcc 2580 cagcgagggc agccggaggt cgacatccta cttgtcggcg gtttccataa gtagcacctc 2640 ggacacttcc tccgcggagg ctgagtctgg gtccaccgcc atttctacct gcgtgcgagc 2700 aagcgttgcc aaacagtgtc taattgtggc acacttcgtt tttgagacag gagaggtgaa 2760 gtgatacagc ctgcattctt agccgcaaga caagttcttg ttgtcgttga gcaagaacgt 2820 gagctcgtgg tatggagcgg gcggtccgcc cgcaccgaac cactcttgga acgtagtcgg 2880 gcccgtaaaa agcgcgggcc ccacgttcgg gcgccttgag tgcgcgcccg tcgttcctac 2940 ttgcctttat gaggcgccag tcgttttgct acccactccc cgcgagggca gccggaggtc 3000 gacaaccggc taggaaggcg gccgttcctc cgttcctgcg ggcgaggacg gccgtgccga 3060 gggcgtccgg aagcagcgcc ggggccggcc tccgcggagg cagcgcccgg gtcaaccagg 3120 gacccaaacg gcgacggctc gccccttccg tctgcgtgcg agtacgcctt gcggtcttgg 3180 cagttcgaga cccggccgtc cgtctgctcg gcagacgcgt cggtcgtgag agtgcgggac 3240 cgttccctcg cggggacggc tcgggcgttg aagggaagca acgatcaggt gtggtcggga 3300 gttctgcgct caggctcccg tgggtttgcg gcaccgctgc gcattcccaa tttactgtag 3360 ctacggactt tggcggtggc ggttctcggg tgcttggcgc ggtcccggtc gagacgccta 3420 gagcgtctcc cggagcgact gtgacgcccg cccgccttgc caccagggcg tcctcgctac 3480 ctggttgatc ctgccagtgg tcatatgctt gtctcaaaga ttaagccatg cacgtgcaag 3540 ttcaactgtc ccagtgaaac tgcgaatggc tcagtaaatc agttatggtt cctttgatcg 3600 tcacatccta catggataac tgtggtaatt ctagagctaa tacatgcgga agaagcgccg 3660 acctcacggt ttggcgtgca tttatttatt tatttattta tttgcacacc tcgtggcaag 3720 gagcgggcgg tgcccctggc ctcccgcccc gacccactct tccttaccgc ctcgacaagg 3780 tggcgctccg acggtgctct cgcgtcaatc gcttgtcgtc cgtcttaaga cgacgtcgag 3840 cgcgggcgga gagtcaggag cgcccaggcc gacccgctcg gtggcgggac ccggcagctc 3900 ccttggaacg ttggtcgggg gccttcattg agcgcccgtc gttccccaga ctcaatcagc 3960 gagccgctcc gagtagcctg gtcgaggacg gagttaggtc ctcccggcga aaagtttagg 4020 gcgggcgagc gaaaatggga gcacctatat cctggcggag gggcgacgcg tcgcgggttt 4080 gcgcggaatg gggtcacggt ttagtctcac ggcggggcct aggtgagacg gctcttcccc 4140 ggcaccccgg cgtggcgttt cagtctctga ctcaatgcaa tcggtcttgc ggacgagtgg 4200 cctaacctta agcgtcggac ttcgagcggg cgtcccgcgg tccaaggcct cctttgggca 4260 gaggggagga ctgctgtctc ggtcccgccg gctagccacc ggcggtctcg accggatgct 4320 acactccccg aggtcttaaa gagaagggcg ggcgtcaccg tcccctctgc gtctaaccgg 4380 gtccggccgg ccggcaggga tgctcagggg gcgtgaggcg gtcttcgcag cgggtcccgg 4440 ttcgcggagc gacccccagc ccttccgctc gtttgcgccc ttcaccacga agggcacctt 4500 ccacgtctcg gaaacgatag tgccgaacga caggctttct atgtgttcag cttcctggcg 4560 gagacgctct ccccggtaga gactgtgctc tgtagaatgt cttccctgcg agcccgtgtc 4620 acgggtgtcc aagggggcac ccctcgatac tggggtgtct ccgccgatta gaagatcgga 4680 gggactacag ggttacgcgg gccgggcctg ttgaagacca gacgttcggc gggggcggtg 4740 accggcgccc cgagctcgcc gctcggtaat gcccggcagg cgggctcgcc ggctaagact 4800 cctccaatgt tggtcgggcc cgttgccgac caagaccgcg ggggcacgcc ggtgggggca 4860 gtggccggca cgccgagcaa ggccgctcgg taatacccgg caggcgggct cgccggcaag 4920 aactcctcca atgttgatcg ggcccgttgc cgaccaagac cgaggggaca cgccggcggg 4980 ggccgggccg gcacccccag caaggcctgc tcggtaatac ccggcaggcg ttcctcgccg 5040 gcaaggactc ctccaatgtt gatcgggccc gttgtcgacc aagaccgcgg gcgcacgccg 5100 gcgggggcag tggccggcgc cccgagcaag gccgctcggt aatgcccggt aggcgtgctc 5160 gccggcaaga actcctgcaa tgttgatcgg gcccgttgcc gaccaagcct gaggagacac 5220 gccggcgggg gcagtggccg gcacgccgat caaggccgct cggtaatacc cggcaggcgt 5280 tcctcgccgg taaggactcc tccaatgttg gtcgggcccg ttgccgacgg agagcccggg 5340 ggcacgccgg cgggggcagt ggccggcacg ctgagcaagg ccgctcggta atgcccggca 5400 ggcgggctcg ccggcaagaa ctcctccaat gttcgtcggg cccgttgccg acggagagcc 5460 cgggggcacg ccggcggggg cagtggccgg cacgccgagc aaggccgctc ggtaatgccc 5520 ggcaggcgtt cctcgccggc aagaactcct ccaatgttgg tcgggcccgt tgccgacgga 5580 gagccctggg gcacgccggc gggggcagtg gccggcacgc cgagcaaggc cgctcggtaa 5640 tgcccggcag gcgggctcgc cggtaagaac tcctccaatg ttggtcgggc ccgttgccga 5700 cggagagccc gggggcacgc cggcgggggc agtggccggc acgctgagca aggccgctcg 5760 gtaatgcccg gcaggcgggc tcgccggcaa gaactcctcc aatgttggtc gggcccgttg 5820 ccgaccaaga ccgcggagac acgccggcgg gggcagtggc cggcacgccg agcaaggccg 5880 ctcggtaatg cccggtaggc gggctcgccg gtaagaactc ctccaatgtt ggtcgggccc 5940 gttgccgacg gagagcccgg gggcacgccg gcgggggcag tggccggcac gctgagcaag 6000 gccgctcggt aatgcccggc aggcgggctc gccggtaagg actcctccga tcttggtcgg 6060 ggtcgaccgg gtcgggtccg gatagccagc ggtgatcagg tcaccaggct ccggcaccca 6120 ccccgtggag cgtttttcaa aaaagtgctt tacagggtgt ctgaccatgt tgtaggtcgc 6180 tcaatctaca ttgagagagt ggacgctttt tgacctgttc ggagtcagcc agccttccgc 6240 ggaccctgac gggtttctgc cgatggctgg ccagtccggg actggtcgac agggaccgtg 6300 gctgagcggg caagatttcc gcacggtgaa aagacgggct tgcccggtga cttttccctg 6360 tttggcgggt ccaacgggct ttgcaccggg cagcggtgaa gccttgggcg ggcggtgtgt 6420 gacctttgca ccgcccgact gcaatcgtgt cagaggaagc cccgggctgg ccatgcgagc 6480 gagcacgatg cggcggtggc aggaaggtcc cgccgagcct cgaggtgacg ccaaggcacg 6540 gttcctttgt acaggccttg ttacatcgga tgcatgaccg gatgtcggcc ataagggtga 6600 ccggcagctc acggcggcgg aacgaggtcg gcggcggttg gaggcgcctc tcgccgccga 6660 tgactacctc gtagcgggcg ccgtgtccac cacaggcggg cgcagcgcgc ttttttcggc 6720 ctaagttcgg cctcccggtg ccgagaccaa gcccgagacc gtgtctcggt ggtttttcaa 6780 aaaagtgctc tacagggtgt ctgaccatgt cgtgggtcgc tctgcctacc taggtagcgc 6840 gcgtagtcca gtggttagcg gccttgcctc tggaactaga agccgtgggt tcgatcccgg 6900 ctgtgtcgtt cacccgacat gcacgctacc ggaaagggtt gcagtcctta ggacaggacg 6960 ttaagccgtg gtccattgtt catggtaggg cgattgatct gtgagctgtt gtgggatgtt 7020 gatttgattg ctcacggacc aataaggagg ggaatgttcg tatgttttag gagatggcaa 7080 gcggggacga cgaagagata gaggaggaca ttgaaatgta tctccgggaa gcagcggtgg 7140 ctgagagagc cgaggcggcc ggggcgaatg gaactgttgg gcagaatatc gctggggagg 7200 tcgacagcga cgacagcgac gattttgact tgtattccgt cacgaagcag gtcagaggtg 7260 cccgagtcgg ggagttttca cgacggaaca aagcgagggg tcgagctgaa cggccgtggg 7320 catccgtcag tgccggtgag gttcatccgt acaagcccca gaaattcagg gcacccgggc 7380 ccgtcggccc gaggcgagcc gaggacaacc ggaatgaact gtctttcttt ctggaggtgt 7440 tggaccagta tcttatccgg tatatcgtgc gttgcaccaa taggtacgcc cggcagaggc 7500 gaagggtatt ccgcaaagaa aggcgctcgc gttggtcacc cacctcgacc accgagctga 7560 aagccttttt gggctgcatg atatgcgccg gctacacgta ccaaccgaga ctcgatttgt 7620 actggtctga tgacgatgac gttgggtgta cgctgatgaa aaggactttt ccttctcatc 7680 gttttcgtca aatcttgcgg taccttcact actcccccga agggttctgt cctcctggca 7740 gcgagcgtct gaagagtttt aaagcaaggg agaagttgga ccgagttatg aaggtacgca 7800 aagttatgga gagggtcagt cggaatagtt gccgtgcccg ccatgtcggc gagcatctcg 7860 tagttgacga agcgacagta gcgtttaggg gaaggcacag tcttcggcgc tacaaccctt 7920 cgaagccgac aaagtatgga tactgtctcc gagctctggc tgagacagac ggctacatat 7980 tgtccgacga agtgtgtgga gggcgtgcgg acccagagga catgtcgccg gacgaaagat 8040 gcatgcacgg cattggtgac gaatatgtca gccctcttga tctgaagccc gcccgagtcg 8100 tcaatagact gacgatgcca tttcagggat gctaccgcac cgtttattgc gattcctact 8160 acacaggagt ccaactggca gattaccttt tcaccaaaga cacttacatt gtggggacgg 8220 tgagacgaaa tgcgtcaggc ctaccgacgg aagggcatcc cgttccacgc aaacctaggc 8280 ctgggccagg caggccgcca ttgctgaagt tcccagaccc ctatcctaag agtgtaccga 8340 gaggcacttc tgtccggtac cacgacgggc cggtaacggt tgtgaaatgg caagacacca 8400 aatcgctcat actgctttct acgggaacta gtgtctgtgc tcccgacgaa ctaaccacca 8460 ggcgatgcaa gggtcccagt ggtgctaccc gactagaggt gacatgtcct gcagcgatga 8520 acatgtacaa tctccactac aaggcggtgg acacagccga ccagctccgg gcagggtacg 8580 aattcggtcg gccttcgaag aaatggcaca ggcaaatctt ctggtacgcc atgaacaaag 8640 ctgtggtgaa cgccttcttg aactggaaga tatacactgc tggcagacag gatctcccag 8700 cacatagggt tgcccagctg agctttcgac gggcgttagt gcggcagcta atcggcaatt 8760 tctccgcaag gcaccgaaca cccactgcga gcgtggccct gccaggagtc atcggcaacc 8820 accggcagat aagccaaggg aaacccacta aagcctgtcg ttactgcgca aaagctggcc 8880 gtcatatgcc acgagcgaaa cggtgcagac cgttcgagac tgtctacaaa tgcgagatgt 8940 gcaacgtgtc cgtatgccac gtcagtttgc gccccgagtg ttggcgtgag cacttgcagg 9000 acatgcaggg cgatgcatag atggtttagg accgaggaaa gtgtgttgca agaccggcca 9060 aatgttgtaa agttactggg agaagatgcg attctgtccg gtggcaactt tgtgtctaac 9120 cggcaactac cgataatccg acttgtaaag ttttaggact aaggaaaatg tgttgcaaga 9180 ccggccaaag gttgtaaagt tactgggaga agatgcgatt ctgtccggtg gaaactttgt 9240 gtctaaccgg caactaccga taatccgact tgtaaagtta taggactaaa gaaaatgtgt 9300 tgcaagaccg gccaaaggtt gtaaagttac tgggagaaga tgcgattatg tccggtggaa 9360 actttgtgtc taaccggcaa ctaccgataa tccgacttgt aaagttatag gactaaagaa 9420 aatgtgttgc aagaccggcc aacggttgta aagttactgg gagaagatgc gattctgtcc 9480 gatggcaact ttgtgtctaa ccgacaaata ccgatactct gcgtaataaa gtg 9533 // ID BEL-129_AA-I repbase; DNA; INV; 6146 BP. XX AC supercont1.127; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-129_AA_; KW BEL-129_AA-LTR; BEL-129_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6146 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.127; Positions 1984632 1990777. XX CC Positions [5173-5745] - Integrase core CC 'AGTGC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 43..6144 FT /product="BEL-129_AA-I_1p" FT /translation="MMHSETNPSSDRNCTRCSKSDSAENMVGCDLCDAWVH FT FGCAGVTASIENHSWKCDKCKEKILEDTTQRSVSSRTSSVSSKASQRVQLR FT LQMLEEQRQLRLKVLEEEEAARKKRAAEEEAARRKRAQEEAEFIRQKYELL FT MDETPSDLETYGHDGKEQNMGENIEKWLDVGRGLVGPVTSPPPAVEGRQTQ FT QEIMPPSSVPSQPRPDTFGQTQVNAAPTSTSTPQAYRNSVLIPIEASASGF FT APETMTSINQSQMHQSRSAVEGLNTGLRTSFGAEIPKAATVSFNPQISYVA FT NPAADVQPTWMNNYNTTPIVSITQPIITCQSDFIGQQYPTYTAANPTSGPG FT APVGSMFVPGLAPHSHDGSQWYNWNQRQPAVSQTNRFGVTNSYNQPGPSAV FT QIAARQVMSRDLPKFTGDPEGWPMFRSAFYNTTQACGYSHAENLSRLQRSL FT DGPALVSVKSRLLLPESVPQVMDTLEKLYGRPEILIHSLLKKLRDVPAPKT FT ENLRSIIDFGLAVQNLVDHMTIAKLTEHLCNPMLLHELVEKLPAFWKMQWS FT SFKQTRPEVNLATFSSFMSVLVSTASDVTLPDLSTPVSTKIEKGVRNKSKL FT FVHSDTIQSSREAPKEKREEPAMKTCTFCTNASHEIASCSQFKALDIDARW FT KAMRLKGLCRICLVPHRRWPCRSGKVCEIDGCRLRHHALLHSHLPPVTQSS FT AATQGNVAHQNHHLVLPCTLFRYLPIVIENNGRKVHTFAFLDDGSSSTLLE FT AGIAAELGVEGPTDSLFLTWTGNVSREEKQSQRVSIVVSGRGLKNQFKLNN FT VRTVQRLKLPSQNLQYERLIQAYPHLKGLPIQSYSNAVPGMIVGVEHARLL FT STLKIREGRETDPIAAKTRLGWCVFGKQAGGEDSVQQLSFHGELRDENRHL FT HELMKQFFGIEEASVTIKPEAEEDTRALQILQETTQRVSKGFECGLLWKFK FT KFSFPDSFPMAVKRAVALERKLAKNPVLKGKVSELIASYEEKGYAHRITED FT ELQTTQRSKVWYLPLGFVQNPKKPEKVRLIWDAAARVEGVSFNDMLLKGPD FT MLVSLFAVLLRFRQKNVAICGDIREMFHQISIRSPDNQAQRFLYKPSPDRP FT PQIYVMDVATFGATCSPCQAQYVKNQNANDLADRYPTAAEAIIKAHYVDDY FT LDSTDTVDEAVQLLSDVKYVHSQGGFEIRNFCSNSTEVLRRMGQTKSPDYK FT SMNLDQTNDVERVLGMVWRPIADVFSFDISIRPDILKLLDESIVPTKREVL FT RTVMSLFDPLGLLAHYIIQGKILMQDIWRSGTDWDEPITENLRGQWYRWSQ FT LLWKLNTVQVPRCFFSGVSKEALNRLQIHVFVDASDTAYSCVAYLRIVESG FT APRCSLVAAKTKVAPLKPLSIPRLELQAAMIGSRLINSLCTSLNLQIKQRF FT LWVDSSTVLAWLRSDSRRYHPFVAFRIGEILSLTTVNEWHKIRSKRNVADE FT ATKWGSGPQFDPESRWYKGPSFLWESEEDWPAEEQHSVTTVDLRASFLHVH FT VVKELIIDVDRFSTWERLVRTIAYVQRAAKNFRRKTATGILTNNEWRAAEN FT LLWKQVQMQNYPDEYTVLHNNKLNTDKEPMFLSKDSPLHTQSPFLDDDGVI FT RMNSRISAAPTTPYEAKFPILLPKGQRVITLLVDSYHRRFLHGNNETVFNE FT MRQQFRIPHLRSVIKQVARVCQRCRIKKAVPLPPMMAPLPQQRLTPFVKPF FT SHTGVDYFGPVLVKQGRSLVKRWIVLFTCLTIRAVHLEVAYSLSTQSCVMA FT IRRFIARRGSPETFYSDNGTNFIGANRLLIQQLRTVHESCAVTFTNAKTSW FT RFNPPLAPHMGGPWERMVRSVKSAMAEIADYHKHPSDEVLETVILEAESVI FT NSRPLTYVPLDSEEGDALTPNHFLLYGTKGVQQPSRTLTDQMSNLRDSHTL FT AQNMVDRFWQRWVSEYLPDLTRRTKWHQPTRPLQPGDVVLVVDESRRNGWT FT KGRVIDVQKAKDGQVRSAVVRTVQGVIKRPAVKLALLEVAGTEGIPKESQG FT ASELHGRG" XX SQ Sequence 6146 BP; 1789 A; 1371 C; 1517 G; 1469 T; 0 other; aactttaaga tttcaacggt tggattgtgg catctaaatt aaatgatgca ctctgagact 60 aatccgagca gtgatcgaaa ctgcacccga tgtagtaaat cggactcggc ggagaatatg 120 gtaggttgcg acctgtgtga tgcgtgggtg cattttggat gtgcaggtgt aactgcctct 180 atcgaaaatc acagctggaa atgtgataag tgtaaggaga agattttgga agacaccacc 240 caaagatcag tatcttcgag gaccagttcg gtaagcagta aggcctctca acgtgttcaa 300 ttacgactgc aaatgttgga agagcaaaga cagctgcgac tgaaagtttt agaagaggaa 360 gaagcagctc ggaagaaaag agccgctgaa gaggaagccg cccgtaggaa acgggctcaa 420 gaggaagctg agtttattcg ccagaagtac gagctattga tggatgaaac accatccgat 480 cttgagactt acggacatga cggaaaagag caaaatatgg gtgaaaacat tgagaaatgg 540 ttggatgtgg gtcgtggttt agtaggacca gttacgtcac caccgccagc ggtagaaggt 600 aggcagactc agcaagaaat aatgcctcct tcatcagtgc ctagtcaacc tagaccagac 660 actttcggtc aaactcaagt aaacgcggca ccgacttcta cgtctacccc tcaagcctat 720 aggaattccg ttttgatccc catcgaagcg tcagcgtccg gattcgcgcc agaaacgatg 780 acgtcgatca atcaaagtca gatgcatcag tctagatcag cagtagaagg attaaatacg 840 ggattacgta cgtcgttcgg tgcggaaata ccgaaagcag caacagtttc gttcaatcca 900 caaatttcat acgttgccaa tccggcggct gatgtgcaac caacgtggat gaacaattac 960 aatactactc ccatagttag tattacgcaa ccaataatta catgtcagtc agacttcatc 1020 ggccaacaat atccgaccta tacagcagct aatccaacga gtggcccagg ggcacctgtg 1080 ggaagtatgt ttgtacctgg tctggctcca cacagccatg acggttcaca gtggtacaat 1140 tggaatcaac gacaaccggc agtttcacaa acaaataggt ttggagttac aaattcatat 1200 aatcaaccag ggccctctgc tgtgcagata gcagcacgac aagtcatgtc tcgtgattta 1260 ccaaaattca ctggagatcc agagggatgg cctatgttcc gaagcgcctt ttacaataca 1320 acacaagctt gtggttattc gcatgcagag aatttgtcta gattacagcg cagtttggat 1380 ggtcctgctt tggtatctgt aaaaagccga cttcttctac cggaatcagt accgcaggta 1440 atggatacat tggaaaaact gtatggtaga ccggagattc tcatccactc tctgttaaaa 1500 aaattgcgcg atgttccagc accgaaaacc gagaatctga gaagtataat cgatttcggt 1560 ttagcagtac aaaacctggt agatcatatg acgattgcaa agctgacaga gcatttgtgt 1620 aaccccatgc tcctccatga gcttgtagag aagcttcctg ccttttggaa aatgcagtgg 1680 tcgagtttca aacagactag acctgaggtc aacttggcga ccttcagctc gtttatgtca 1740 gtcctggtca gtaccgcctc cgacgttact cttccagacc tgtctacacc ggtctcaacg 1800 aaaattgaaa agggagtccg caacaaatca aagctgtttg tgcattcgga tacgatacag 1860 tcgtcaagag aagctcctaa agagaaaagg gaggaacctg caatgaaaac atgtacgttt 1920 tgtacaaatg cgtcacacga aatagcaagt tgcagccagt tcaaagcgtt agatattgat 1980 gcgcgctgga aggcgatgcg actgaaggga ctatgcagaa tctgccttgt tccacaccgt 2040 agatggccat gccgttcggg aaaagtatgt gaaattgatg gatgtcgtct gcgccaccat 2100 gcactattgc attctcattt gcctcctgtg acccaaagca gtgctgctac gcaaggaaat 2160 gtggcccacc agaaccacca tctagtactg ccatgtacac tttttcggta cttgccaata 2220 gtaatcgaaa ataacggaag gaaggttcac acctttgcgt tcttagacga tggatcatca 2280 tcaacgttac ttgaagctgg aatcgctgct gaattaggag ttgaaggacc aacagactca 2340 ctgttcttga cgtggacggg aaacgtctct cgtgaggaga agcaatcaca acgtgtctcg 2400 atcgtcgttt caggacgtgg tctaaaaaat cagttcaagt tgaacaatgt gagaaccgtc 2460 cagaggttga aactaccaag tcagaatcta cagtatgaga gactaattca ggcataccct 2520 cacctcaagg gtctgccaat ccagagttac tccaacgctg ttcctggaat gattgtagga 2580 gtcgaacatg cccgtttact gtcaacgttg aagattcgtg aaggcaggga gacggaccca 2640 atagccgcga aaactcgtct tgggtggtgc gtatttggta agcaggcagg aggagaagac 2700 tctgttcaac aactgagctt tcacggagag ctgagggatg agaaccgcca tcttcatgaa 2760 ctaatgaaac aattcttcgg aattgaagag gcaagcgtga cgatcaagcc tgaagcagag 2820 gaggacacac gggccttaca aatattgcag gaaacgacgc aaagagtcag taaaggtttc 2880 gagtgcggtc tgctctggaa attcaaaaag ttcagttttc ctgatagttt tccaatggct 2940 gttaagcgcg cagtggccct ggagagaaag ctggcaaaaa atccggttct caaaggaaag 3000 gtgtcggaac taattgcatc atatgaagag aagggttatg ctcatcggat cactgaggac 3060 gagcttcaaa caacacaacg cagtaaagta tggtacttgc ctctcggatt cgtacagaat 3120 cctaaaaagc cggagaaggt ccgcttgatt tgggacgctg ctgctcgggt tgagggtgtc 3180 tctttcaacg atatgctctt gaaaggaccg gatatgctgg tatccctgtt tgcagtgttg 3240 ctacgatttc gtcagaaaaa cgtcgccatt tgtggcgata tcagagaaat gtttcaccag 3300 atatccattc gcagtcccga caatcaagcg caacgtttcc tttacaaacc ttctccagat 3360 agacccccgc aaatctacgt tatggatgtg gcaacgtttg gcgccacctg ctcaccttgt 3420 caagcgcagt acgttaaaaa tcagaacgct aacgacttag cggaccgata tccaactgca 3480 gcagaagcaa taataaaggc tcattatgta gacgactacc tagatagtac ggatactgta 3540 gatgaggcgg tacagttgtt gtctgatgtg aaatacgttc attcccaagg agggttcgaa 3600 attagaaatt tctgctctaa ttcgacagag gttcttcgac gaatgggtca aacaaaaagt 3660 cccgactaca aatcgatgaa tctagatcaa actaacgacg tagaacgagt tcttggaatg 3720 gtctggagac cgatcgcgga tgtgttttcg tttgatattt ccatcagacc ggacatcctg 3780 aaactcctgg atgagtcaat cgtacccacc aagcgagaag ttctacgaac ggttatgtcc 3840 ttattcgatc cgttaggttt gttggcgcat tacataatcc aaggaaaaat cctcatgcag 3900 gatatctgga ggtcaggaac agactgggat gaaccgataa cggaaaatct gcgaggacag 3960 tggtacagat ggagtcagct gttgtggaag ctcaatacag tccaggtccc acgatgtttc 4020 ttcagtggag tgagcaagga agcactgaat agattgcaga tacatgtgtt cgtcgatgct 4080 agtgatactg cgtattcgtg tgttgcctat ttgcgtattg ttgaaagcgg tgcacctcga 4140 tgttctctgg tcgccgccaa aactaaagtg gcaccactaa agcccctttc gataccgcga 4200 ctggaactac aggcagctat gattggtagt cggctaatca acagtctttg tacatcactc 4260 aatcttcaaa ttaagcagcg tttcttgtgg gtggatagtt cgacagtact ggcatggttg 4320 cggtccgaca gtcgccgtta tcaccctttc gtagcattta gaattggaga aattctaagc 4380 ctgaccaccg ttaatgagtg gcataagatc cggtccaaaa ggaatgtcgc cgatgaagca 4440 acgaagtggg ggtcaggacc acagttcgat cctgagagtc ggtggtacaa aggcccatcg 4500 tttttgtggg agtctgaaga ggattggcca gccgaagaac aacattcagt aaccacagtc 4560 gacttacgag cttccttcct acacgtacat gtggtaaagg agttaatcat cgacgtcgat 4620 agattctcta catgggaaag attggtaaga acgatagctt atgtacaaag agcagcgaag 4680 aactttcgtc gaaagacagc aacagggatt ctgaccaata acgaatggcg tgctgcagaa 4740 aatctactgt ggaaacaagt acagatgcag aattacccgg acgagtatac tgttttgcat 4800 aacaataagc taaatacgga taaagaacca atgttcttaa gcaaagatag tcctctccat 4860 acgcagtcac ctttccttga tgacgacggc gttattagaa tgaatagcag aataagcgct 4920 gctcccacca ccccttatga agctaagttt cctattctct taccaaaagg tcaacgggta 4980 attacactcc tcgtcgacag ctaccatcgc cggttcttac acggaaacaa tgagacagtt 5040 ttcaatgaga tgcgtcaaca gttcaggatt cctcatctgc gttcggtgat caagcaagta 5100 gcacgggttt gtcaacggtg ccgcattaaa aaggcagttc cactaccacc gatgatggcg 5160 ccactgccac aacaacgact tactccattc gtcaagccgt tttcacatac gggtgtggac 5220 tattttggac ccgttctggt gaaacaagga aggagtttag tgaaacgctg gattgtattg 5280 ttcacctgcc tcactatcag agcagtccat ctggaagttg catatagcct ctctactcag 5340 tcgtgtgtga tggccatacg ccgtttcatc gcccgccgag gttcaccgga aacgttttat 5400 tcggacaacg gcactaattt tattggagcg aatcggttac taattcagca attgcggact 5460 gtacacgaaa gttgtgcagt cacatttaca aatgcgaaga cgagctggcg gttcaatcca 5520 cctctggctc ctcatatggg gggaccatgg gagagaatgg tgcggtcagt caaatcagcc 5580 atggcggaga tagcagacta ccacaaacat cccagtgacg aagtactcga aaccgtaatt 5640 ttggaagcag agtcggtcat caattctaga cctttaacat atgtcccctt ggattcggaa 5700 gaaggggatg cactaactcc gaatcacttt cttttgtatg gaacaaaggg tgtacagcaa 5760 ccttctcgca cactaaccga tcagatgagt aatctcagag atagtcacac attggcgcaa 5820 aacatggtgg atcggttctg gcagagatgg gtgtctgaat acttaccaga tctaaccaga 5880 cggacaaagt ggcatcaacc aacaagacct ctgcaacctg gagacgtcgt cttagtggtg 5940 gatgaaagcc gccggaacgg gtggactaaa ggcagagtca tcgatgttca gaaggcgaaa 6000 gacggccaag tacgtagtgc ggtggttcga accgtacaag gagtcatcaa acgccctgca 6060 gtaaaactcg cgcttcttga ggtggctggt acagagggca tacccaagga gtctcaagga 6120 gcttcggaac tacacggaag ggggga 6146 // ID Tx1-14_BF repbase; DNA; INV; 5613 BP. XX AC . XX DT 28-APR-2009 (Rel. 14.04, Created) DT 28-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE Amphioxus Tx1-14_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW Tx1; Non-LTR Retrotransposon; Transposable Element; L1-14_BF; KW Tx1-14_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-5613 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-5613 RA Kapitonov V. and Jurka J.; RT "Young families of Tx1 non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(4), 851-851 (2009). XX DR [2] (Consensus) XX CC The consensus is of low quality: both ORFs are corrupted by CC mutations. XX SQ Sequence 5613 BP; 1746 A; 1014 C; 1025 G; 1807 T; 21 other; tacttcgcca gatacgagtg actgtgttcc tgtgggtatg cggaagcagg caaaacgggc 60 aagacagagc agcagtgacc aaacgagccc agataagcag ctcggcgcac agatgaaggg 120 cccccctccc acacctacga agatggcgag cggcgaggac cccaactttg acagtatttc 180 accggaacta cgtcctcttg tacacctact gtctacacaa agtaagtcac tgcatgacga 240 ggttatggct gagtttaagt ctgtccgaga ggagctgtcc gtacagatac agagcgtcaa 300 cactaagcta ggaaaaatgg cgtccgaagt ggatgacttg agagacagtg tcagctatca 360 gaacaaggaa atcgaagatt tacagcaagc actaaaagtt gaaaaacgtg aacggatcag 420 ggctacgtta cttgcggaaa gatactcgat gaaatcggac attattatcc gcggtttagt 480 gcaccaccag gatgagtccc cggatcaagt cgtcctcagt ttcctgtccg gccatcttgg 540 tatgagcttg gttccaccgt ttgttgcagt acaccggctg tcccgtccta cacagaacca 600 acccaaccca gctatgctgg tgcgactagt cagtctacaa cacagggagc tgatcctgag 660 cgcatggagg aagctaccag gagacaagaa gaagggccat gctgtacatg aacacttacc 720 acgtccattg cagcaagcca gggccaaact gatgccagag agagacagtg tgattgccaa 780 ggccaagact gataggagag aggcgagagt gttcatcaga gtacctccaa aagagacttg 840 tgccatcttg attgctgatg gtaaaactct gaaaacagtg gatgctgtag accttgtatt 900 ttcatgaact attattgata ccatgcctgt tgatatatct tgtttatccc tgaacaatta 960 ttgtgcaggt gtttgaaaag tccagtttgc atattaaagt tgttaaatat attgcaagtg 1020 tgacctcgtg tcatgtgttt ttctttaagt tatcattcat gaactatgct aaagctctgc 1080 ttaaaattca atattattgc ctatgtttta tttgtttgat ttctttgcaa agaagttgat 1140 tatattatgt actttttgaa cyayaytgat ttcyattatg ttwtgttgkt ttgagtttyw 1200 gaatttrykt katttctgaa gcagagatca tgttatgtta tggcttttga tttccattcc 1260 tgcttgattt gtaccaaagt ggtgattatg attatgttat gtttgttttg atttcttgaa 1320 tcagaccawt tcycatactt aytttstttg atcttattaa gagtaattat gttakgttaa 1380 tatgattttg tttttgyytg tatgatcttt tgtttcttgt ggaatagtac aattaatgat 1440 aagatttatg gcagacacac agtgccgcat ttaaatgatg ctgccaggag acaggaagca 1500 cagggctatt gtgaccctgt acatttatgt ttttgtgaac catccagttg cactttattt 1560 gatttttgtt gatgtttgtt cagttttgtt cctttttgtt agtacagatg ttatgatttc 1620 atcatcccca tcttgcccag atgaatttcc cccatgcctt catactwtag aatactacar 1680 catcttcgcc atcttcccca ccctgcgtga ccagtcccag aagacatgac tcaggtagaa 1740 ttacaaatct caagttataa ttgtaacggc cttagtgatt ctgttaagcg aagggagtta 1800 tttacttggt taagagacaa gccctaccat attatttgct tacaagaaac tcattcctca 1860 atacctgtag aagagagatg gcaaagtgag tggggaggta ccatgatttt ctgtcacgga 1920 acctccaatc aaaggggtac agccatgcta tttaaaaaca acattaaccc ccatatccac 1980 cagacaaagc ttgatgttaa tggtagatgg attattgttg atttgtctat agatgatttt 2040 agattttgta ttgtgaatat ttatgctcct aatgatgata aacctgaatt tttcttgaat 2100 ttatctatgg aactagacaa ttttgctgac agtactgaaa atcttattat tactggagat 2160 ttcaacactg tacaaaaccc cttattggac agattttcta ccaattgtac ttatcacccc 2220 aaagcttatg aagccatttc ggaatttaag agtaaattcg acttacatga tatttggaga 2280 tttagaaatc ctacaacagt gcgttacact tggcgccgcc gcagacaggc cagcagaatt 2340 gattatttcc tgactagttt ttcctttatt aatagagtaa acaaatgtgt catagccgat 2400 aagtttagat ctaatagcct tatcttttgt tattgcacag tttcctaggg ggaagggata 2460 ttggaaattt aatctgtctt tactagatga tagctctttt cagcataata ctgttaaaat 2520 tatgaatgaa ttttttagta taaatcaggg aacagcagat ccccatattg tatgggatac 2580 tgcaaaatgc tttttcaggg gacattgtat taaatttagt agttggaaaa acaaacaata 2640 tttagctagg gaaaaaaaac taacagatga tattaataga ttacaaagcc aaattgataa 2700 tatgccatcc ccctctgaat ctattttaga tgacttagat ggaaaacaaa aacaattaga 2760 attactttat aaccaacggg ctaaaggttt aatagtaatg tcaagagctc gctggatgga 2820 gtggggggat agatgtacac aatacttttt gaaattggca catagaaatt acactaggaa 2880 aaacatgcag aaactgaaaa tatctgataa tgtctttacc agtaatcccc aagaaatttt 2940 agaaaaacag tcagaattct actcttctct atattccttt gaagagcccc caatgcccct 3000 atccccagaa aacatagagc cctttttccc tgcacaatat aataaaaaac tgtccaatga 3060 acagcaactg ttatgtgaag gtcatatcac agatgaagaa ctacaaaaag ccatcaatac 3120 tttctcaaat tctaaatctc ctgggctaga tggtatccca gttgaagttt ataaacattt 3180 ctactcggta tttaaaaccc ccatgttaaa ttgtttcaat ttctcacttt ctaaaggtta 3240 tttgtccaat tcccaaaagc aaggccttat ctctttattg ttaaaggaag ggaaagaagg 3300 tcagcataaa gatcccacaa ttttagataa ttggaggcct ttaacattgt tgtgttgtga 3360 tgtaagaatt ctttcaaaat gcttagccct tagagttaaa acagttataa ctgatattat 3420 tcataaagac cagacaggat ttgtaaaggg gagattcatc ggggacaata taagacggat 3480 attagactta attgattatt atgagaaaga agaaaaacct ggcttaattt ttatttctga 3540 ttacaaaaaa gcttttgata ccataaggtg ggattttatt ataaaatcac ttgaattctt 3600 taattttgga ccacagttta ttgcctgggt tagagtttta tatagtaaca taacaagttc 3660 agttatcaat aatggttata tatctagtcc ttttacctta caccgtggag taagacaagg 3720 atgccccctt agcccctatt tatttatcat cgcaattgaa atgttggcaa ttaaaattcg 3780 tcggaatcat aatatatccg gattatctgt caaaggaaga acctcaaaaa tctctcagta 3840 tgcagatgat tccaattttc ccttcacccc tgatatttct tccttttatg gacttctgga 3900 agattttgat tccttttctc atatatctgg ccttctattg aattttgata agtgcaaggt 3960 tttaaggata ggaactctga aaaacacaaa ttttgtatta ccctcaggtc atcctattaa 4020 gtgggtagat ggaggtatag agatattagg tatttatatc cctgtagact taaccaatat 4080 tgtgactcaa aattttgacc ctcgcttagc caaaatagac aaacttctat tcccatggaa 4140 ggggaaaaaa ctgtcattat catggaaaaa ttacaattat caactccttg ctcgtccctc 4200 aattcacgta cctttttcag accttaccat tgccggacaa acacttcttt aaacactatg 4260 agaaaaattt tttttctttt atctggaatg gtggtcctga aagggtgggg aggaaaactt 4320 tgtacaactc tgtggaaaac ggtggtctga accttgtcaa tctctatgcc cacgcactga 4380 cacttaaagc atcctgggta cccaaactat tttttaaccc agactggttt acagcttgta 4440 tactacatac tcatccctgt attgaaagtg tattgtttcc cttttatcag ttaaaatctg 4500 cacaagacct cacactaagt cctttcttat cagaaatctt tcaagcttgg tacagctatc 4560 aatacaaacc tcctataacc cctgatgaga tcaggcaaca ggttttatgg atgaactcaa 4620 acatacaagt tggaggaaag cctattatct ttaattcttt tgttaataga catattatat 4680 ttgtaaatga tcttttgaat attgatggta caataatgtc ctacaatgaa ttgtgttata 4740 catatggtcc agtatgtaac aactttgtct atctacagtt acttactgca ataccaaaaa 4800 aatggaaatc tactcttata ggagttactc ccactctagt ttgtagaccc caacaacata 4860 gtagtgaatg gttattgaga attaagatta ataaggattt atacaaacat tttcttgtat 4920 ctgaaaaatt gattgatata ccacaaaatg tacaaatggc atggctgtac ctctttgata 4980 cccctattcc atggaaaaca gtatatactt ctgtaatttt ttgtactatt gaccctgtaa 5040 ctagattctt ccaatacaaa atccttcata aattcttacc taccaataga ttattatata 5100 tttggaaatg tgttaacagc cctttgtgct cactttgtca tatggaggag gagacatatt 5160 tacatctctt ctgggactgc ccccaagtaa ttgaattttg ggaaaaagta aaaacctggt 5220 tttctgaaaa aataggccaa acattaattg taaattcttt taatattatt tttggtaatc 5280 tcaatcaagg ggcctctata attggaaatc tgattatttt attggctaaa atgtacattt 5340 acaaatgcag agaatctaaa agagttaact ttaaatcatt tataaattat gtcaatttct 5400 ttgaaaaggt tgaagcccaa attgccaaca ggagagggaa gatggcaaag caccatggca 5460 agtggggaac tttagttaca aatactgcat cagtttctgt ttgatttatg ttttacataa 5520 tagttacgtt tcttttgtat atatgactta aggttttgga tggttcacct tttgtttatg 5580 tgttgtaatg ttatttgtga ataaaaaaaa aaa 5613 // ID BEL-42_AA-I repbase; DNA; INV; 5904 BP. XX AC AAGE02017346; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-42_AA_; KW BEL-42_AA-LTR; BEL-42_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5904 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02017346; Positions 8551 14454. XX CC 'AACTC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 106..4146 FT /product="BEL-42_AA-I_1p" FT /translation="MVGCDGCNNWYHYRCVGVDGSDIQRQEKWFCPTEACQ FT KAEVIAKKSEENRKRGSKGKKSLGVDNTSDKSSAKSDHLSGSSFEKKLKAL FT EKEQRSKEREIEEERQLWEKRMVMERILKEKRLQMEAEMRQREMKQEEELL FT QKALREKQEHLDRMERMRKEYQAKMADVERKLNTSGFEEMGLEKSQGCKVR FT MASLKLQERAGGINGKGSEQLGKVFEADDQEEEESSDEYSGESTSSEGEGC FT DEEYEEESEVESEADERVSRKKNRKQINKKGVPHGLGRQSAGPTKAQIAAR FT NGISRKLPIFSGKPEEWPLFIGSYEASNKACGYSDIENLVRLQECLKGPAL FT ESVRGQLLLPKSVPKVIEKLRQLYGRPEQLLQFHLEKVNRLESPKAERLET FT FIPFGNAVEQLCDHLEAADLKQHLVNPLLLQNLLDKLPSADKREWVRFKSN FT KRKVNLRTFSDFLSRIVSEACEANACGAVKVNETRNMKSGRVGMKEKGAIY FT IHSTASNSRDSTPLGNPPSGSVKPCKACKRTDHRLRFCQDFRAMTFADRMK FT VVEKGSICQICLNEHGRAPCKFKIRCDVEECNERHHPLLHPVGARVVMNTH FT FQIQGAILFRMVPVTLYSGNRMVKTLAFLDEGASITLVERSLTDKLGVVGV FT HEPLTITWTADVTRDERNSTKMNLWVSGSEEKILLKAVQTVKQLLLPKQSV FT DIGEMSATYRYLRGVPFSSYPSQRPGMLIGLNNLHAIAPMEAKIGQQGEPI FT AVKSKLGWAVYGPTKKLQDGQSFMGYHHTVSNEDLHDLLREHYAMEESVVA FT ISQESKEERRAREIMEQTTVRIGNRFEIGLLWRTDEQQFPNSYPMAVRRMK FT QLEQRLMKTPDLMDNVRKQIVEYQQKGYAHMATPEELAGFDPCSSWFLPIN FT VVLNPKKPGKVRLVWDAAATVNGVSLNSQLLTGPDLLTPLPSVINRFRERP FT IGFGGDIREMYHQILIRKADRRAQLFLFRNTPDQPPSVYVMDVATFGSKCS FT PSQAQYVKNRNASEFSKQFPEAAVAIEKKHYVDDYFDSVDTVEEAINRAKE FT VRFIHSKGGFEIRNWVSNSGTFLDELGEAKVSQAVHLNRDKDSGTERVLGI FT VWDPKDDQFSFSTDHRHDVKPYLLEGVRPTKRIVLSSVMGFFDPLGLLTPF FT TIHGKLVVQELWRTGCDWDEEIDDDSLAKWDRWTGLLPIVSEIRIPRCYFE FT GSLSTSFESLELHIFTDASIHAYGCAAYLRAVIDGSIKCSLVMSRSKVSPL FT KLQSVPRLELMAAMLGARMMHTVKSNHSLPITRQILWSDSQTVLSWIRSDQ FT HRYKQFVAFRIGEIVELTTSSDWRMSHRIKTSQTS" FT CDS 4176..5903 FT /product="BEL-42_AA-I_2p" FT /translation="MSDGPCFNGPHFLYEAENMWPEQVRTEVHNPEEMKAC FT VLFHGTAFSTPLIDVYATSRWRKLVRIAACVVRFVENCHRKRSGEPILVTK FT AHSSLERCIKGNPKRIVQPLCQEELHSGEIILLKQAQHDSFPEEIKVLEAR FT RDFNSTACVEKSSWLYSLRPILDADGIVRMDGRLALSKAIQFDQKFPIILP FT RRHEVTLKIIQSYHEKFGHANRETVFNELRQRFYIPKLRSVITSVVKGCNW FT CRVNRCQPQVPLMAPLPVQRVTPQLRPFCSVGVDYLGPVEVSVGRRKEKRW FT IALFTCLAVRAIHLEVVHSLTTEACQMAIRRFICKRGSPNEFFSDNGTNFK FT GASNEMIRAMERIKMECAENFVSPAIKWHFIPPSTPHMGGVWERMVKSVKE FT ALKAINDGRKLTDEVLLTTLFEAEDMINTRPMTYLPQDSGVEAITPNHFLR FT GIVKEIDQRVVHVEVGEALRNLYKRSQYLADKMWQRWVNEYLPNINRRTKW FT HDERGKLKVGDLVFVVDGNQRKNWSRGVVEDIIEGADGNIRQAMVRTVKGV FT LRRAVANLAVVEIPGKTGTDEAEPELRAG" XX SQ Sequence 5904 BP; 1752 A; 1191 C; 1614 G; 1347 T; 0 other; aactcaaaaa agatatgtcg gatttggaag cctcaaaaat gagtgtgaat ttcacaatga 60 cggcctgcgc tgcatgtaca atgacctcag acaccagatg aagggatggt tggttgcgat 120 ggatgtaaca actggtacca ctaccggtgt gttggtgtag acggatcgga catccaaagg 180 caagaaaagt ggttctgccc gacggaagcc tgtcagaagg cagaagtcat agcgaagaag 240 tcggaagaga accggaagag aggatcgaaa ggtaagaagt ctcttggcgt agataatacc 300 tccgataagt ctagcgccaa gtcggatcac ctttctggct cgtcgttcga gaagaagctg 360 aaagctttgg aaaaggagca acgatcgaaa gaaagggaaa tcgaagaaga acgccaactc 420 tgggagaagc gtatggtgat ggaacgtatt ttgaaggaaa aaaggctaca aatggaggcc 480 gagatgcgtc agcgcgaaat gaagcaggag gaagagttgt tgcagaaggc gttgcgggaa 540 aagcaagaac atttggatcg catggaacga atgcggaagg aataccaagc gaaaatggca 600 gatgtggaga ggaaattgaa tacttccggg tttgaagaga tgggattgga aaaaagtcaa 660 ggttgcaaag ttcgtatggc gtcattgaaa ctgcaggaga gggctggagg aataaatgga 720 aaaggatctg aacagcttgg gaaggttttc gaagcggatg accaggaaga agaagaaagc 780 tcagatgaat atagtggaga gtcaaccagc tccgaaggag aaggatgcga tgaagaatat 840 gaagaagaat ccgaagtgga gtcagaagct gatgaaagag tgtcaagaaa gaagaatagg 900 aaacagataa acaagaaagg agtaccacac gggctggggc ggcagtccgc aggtcccaca 960 aaagcacaaa tagcggcgcg taatggaatt tccaggaagc ttcccatatt ttctggtaaa 1020 cctgaagagt ggcctctttt cataggcagt tacgaggcat cgaataaagc atgcggatat 1080 tctgatatcg agaacctcgt aaggctccag gaatgcctta aaggaccagc tctggaaagt 1140 gtccgcggac agctgttgct gccgaagtcg gtgccgaagg tgattgagaa attgcgacaa 1200 ctgtacgggc gaccggagca acttctgcag ttccatctag aaaaagtaaa ccggttagag 1260 tcaccgaagg ccgaaaggtt ggaaacgttt attccctttg gaaatgccgt agagcagcta 1320 tgtgaccacc ttgaagctgc tgacttgaaa cagcatcttg tcaatccgtt gttgcttcaa 1380 aatttgttgg ataagctgcc atcagcggat aaaagagagt gggttcgctt caaaagcaac 1440 aagaggaaag ttaatttgag gacgttttcc gattttttgt cacgaattgt atcggaggct 1500 tgcgaagcca acgcatgcgg tgcggtgaag gtgaacgaaa cacgaaatat gaagtccggc 1560 agagttggaa tgaaagagaa aggcgcaatc tacatccata gcacagcatc aaattcaagg 1620 gacagcacac cccttggcaa tcccccaagc ggaagtgtaa aaccttgcaa agcttgcaag 1680 cgtactgatc atcgtttgcg gttttgccaa gacttcaggg cgatgacatt tgctgatcgg 1740 atgaaggttg tggaaaaggg cagcatctgc caaatctgtt tgaacgaaca tggaagagca 1800 ccatgcaaat ttaaaattcg ttgcgacgtt gaagaatgta acgaacgcca tcatccgctc 1860 ctacatccgg ttggagctag ggtcgttatg aacactcatt ttcagattca aggagcaatc 1920 ctattccgga tggtcccggt gacgttgtac tctggcaatc gtatggtaaa aactctagca 1980 tttttggacg aaggggcgtc gataacgtta gtggagaggt cgctgaccga taaattaggc 2040 gtagtaggag tccacgagcc gctaacaatc acctggacag cggacgtcac acgggacgaa 2100 agaaattcaa caaaaatgaa cctctgggtt tctggaagtg aagaaaagat ccttttaaaa 2160 gcagtgcaaa cggtaaaaca acttttgttg ccaaagcagt cagtagatat tggggaaatg 2220 agtgcaacgt atagatatct tcgaggtgta ccattctcat cttatccaag ccaacgtcca 2280 ggtatgttga ttggcctgaa caacctgcat gctatcgcgc caatggaagc caaaattggt 2340 cagcaaggcg aacctatcgc tgtgaaatcc aaactcggtt gggcagtgta cgggccgaca 2400 aagaagcttc aagatggaca gagtttcatg ggttatcatc acacagtttc caatgaagat 2460 ctacacgatc tcttgcgcga gcattacgcg atggaggagt ccgtggtggc catatcacag 2520 gagtcgaagg aagaaagacg ggctcgtgaa attatggaac aaaccacagt ccgaatcggt 2580 aatcgctttg aaataggtct gctgtggcga actgacgaac aacaatttcc aaatagctac 2640 ccaatggcag tgcgccgaat gaaacagttg gaacaacgtt tgatgaagac gccagacttg 2700 atggacaatg tacgcaagca gatcgtggag taccaacaga aggggtatgc gcacatggca 2760 acgccagaag agttagcggg gttcgatccg tgttcttcat ggtttcttcc tatcaatgtc 2820 gtactgaatc cgaagaaacc tgggaaagta cgactggtat gggacgcagc agcaacagtg 2880 aatggtgttt cactgaattc ccagctgttg accggtccgg atttgcttac tccgttacca 2940 agcgtaatca accgtttccg agagcgacca atcggtttcg gaggcgatat acgcgaaatg 3000 tatcaccaga tactgatacg aaaagccgac aggagagctc agctattcct tttccgaaac 3060 actccggatc aacctcccag cgtatatgtc atggatgtcg ctactttcgg atcaaaatgc 3120 tccccgagtc aagcgcaata cgtgaaaaac cggaatgcta gtgaattttc aaaacagttt 3180 ccagaagcag cagttgcgat agaaaagaaa cactacgtgg atgattattt tgacagcgta 3240 gacacagtcg aagaagccat caacagagcg aaagaggtcc gatttattca ttcgaaaggc 3300 ggcttcgaga ttaggaactg ggtgtctaac tctggcactt ttctcgacga actgggtgaa 3360 gcgaaagtgt cgcaagcggt gcatttgaac cgcgacaagg acagcggaac agaaagggtg 3420 ctcggtatcg tctgggatcc aaaggatgat caattttcgt tttcgacgga ccatcgacat 3480 gacgtgaaac cgtatcttct agaaggtgtt cgtccgacaa aaagaattgt tcttagtagt 3540 gtcatggggt ttttcgatcc gcttggtttg ctcacgccgt tcacaatcca tggaaaatta 3600 gtagtccaag agctgtggcg aactggttgt gactgggatg aagagataga cgatgatagt 3660 cttgcgaagt gggatcgctg gactggatta ttgccgatag tttccgaaat tcgaattccg 3720 cgttgttact ttgaaggcag tctttcaacg tcgtttgagt cgttggagct ccacattttc 3780 actgatgcaa gcattcacgc ctatggttgc gcggcatact tacgagcagt gattgacgga 3840 agcatcaagt gttcgctggt tatgtcccgc tcaaaagtgt ctccactgaa actacaatct 3900 gttccccgcc ttgaattgat ggcagctatg ctaggcgcaa ggatgatgca cacagtaaaa 3960 tccaatcatt cactgcccat aacaagacag attttgtgga gcgattctca aacggtcttg 4020 agttggatac gatcagatca acatagatat aaacaattcg ttgccttccg tataggagag 4080 atcgttgagc taacgacatc aagtgactgg aggatgtccc atcggataaa aacatcgcag 4140 acgtcgtgac aaaatgggga tcaggaccgc ctttgatgtc tgacggtccg tgttttaacg 4200 gacctcattt cctgtacgag gcggagaata tgtggccgga acaagtgaga acggaagttc 4260 ataatccgga ggagatgaag gcgtgcgtcc tgtttcacgg cacggcgttt tcgactcctc 4320 tcatcgacgt ttacgctacg tcacgatggc gtaaattagt tcgaatcgct gcttgcgttg 4380 tacgcttcgt tgaaaattgt cacaggaaaa gaagcggtga gccaattcta gtgactaagg 4440 ctcatagctc gctggaacga tgtataaaag ggaacccaaa gcgaatcgtg cagccgctgt 4500 gtcaagaaga gcttcattct ggtgaaatta ttctactgaa acaagctcaa cacgacagct 4560 tcccggagga gatcaaggtg ttggaagcta gacgggactt taactcaacg gcatgcgttg 4620 aaaaatcgag ctggctctac tcgttacgtc cgattcttga tgccgacggc attgtcagaa 4680 tggatggcag attagctttg tcaaaggcaa ttcaattcga tcaaaagttt ccgataattt 4740 tacctagacg ccacgaagtg acactcaaga tcattcaaag ctaccacgaa aagtttgggc 4800 atgcgaatcg ggagaccgta ttcaatgagc ttcgacaacg gttctacatt cctaagctac 4860 ggtctgtgat aacgtcagtc gtaaaaggct gtaattggtg tcgtgtcaat aggtgtcaac 4920 cacaagttcc attgatggca ccgcttccag ttcaacgagt cactccgcag ttgcgcccct 4980 tctgctcagt tggcgtcgac tatctcggtc cagtagaagt atcagttggt cgacgaaagg 5040 agaaaagatg gatcgccttg tttacttgct tagcagttag ggcaatccac ctggaagtag 5100 tacatagcct gactacagaa gcatgtcaaa tggcgatcag acgatttatc tgtaagcgag 5160 gttctccgaa tgagtttttc tcggataacg ggacgaattt caagggcgca agtaacgaga 5220 tgatacgagc gatggagaga atcaaaatgg aatgtgcaga gaacttcgtt agtccggcga 5280 tcaagtggca tttcattccg ccaagtactc cccacatggg tggtgtgtgg gagaggatgg 5340 ttaaatcggt caaggaggcg ttgaaggcga taaatgatgg taggaagctt actgatgagg 5400 tgctgttgac cacgttattc gaagcagaag acatgataaa cacacgtccg atgacatatc 5460 ttccgcaaga ttccggagtt gaagcaataa cgccaaatca ttttttgcgt ggaattgtca 5520 aggagataga tcaaagggtt gtgcacgttg aagttggaga agccctacga aatttgtaca 5580 aacggtcgca gtatctggcc gacaagatgt ggcagaggtg ggtaaacgag tatttgccga 5640 atatcaatcg gcggacgaag tggcacgatg agcgagggaa gctgaaagtt ggggacctgg 5700 tgttcgttgt cgatggaaat cagcgaaaga actggagccg tggagttgtc gaagacatca 5760 ttgaaggagc tgacggaaac atacgacaag cgatggttcg gacggttaaa ggcgttttaa 5820 gacgggctgt agcaaaccta gcagtggttg agattcctgg taaaaccgga accgatgaag 5880 cggaaccgga gttacgggct gggg 5904 // ID DNA8-15_AP repbase; DNA; INV; 243 BP. XX AC . XX DT 22-AUG-2009 (Rel. 14.08, Created) DT 22-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-15_AP. XX NM DNA8-15_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-243 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(8), 1757-1757 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 243 BP; 81 A; 50 C; 41 G; 71 T; 0 other; accatgacaa attatatagg tatcatcaca gcattttgat acgcatcctc cgctctttac 60 ttcgcctata tcgtagttct ccacttctta ttggctgtca atcatataca tacatatata 120 acatagaaca aaatcaacca atgagaagtg gagaactacg atataggccg gaggtgaaga 180 agatgcgtat ctactgcgca aaactagtac aactgtgatg atacctatat aatttgtcat 240 ggt 243 // ID Gypsy-2_DVir-LTR repbase; DNA; INV; 237 BP. XX AC scaffold_7526; XX DT 07-MAR-2011 (Rel. 16.03, Created) DT 07-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-2_DVir_; KW Gypsy-2_DVir-I; Gypsy-2_DVir-LTR. XX OS Drosophila virilis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; virilis group. XX RN [1] RP 1-237 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (07-MAR-2011). XX DR Genome; scaffold_7526; Positions 634 398. XX SQ Sequence 237 BP; 52 A; 64 C; 53 G; 68 T; 0 other; tgatatattc tgcttgcaga tcgtagtcgc tccttatata tcgtggtatt ccgtcctacc 60 ttaactctgt ctaccttttt tttctttatt atacttgacg cgtcgagtgt gttagggagc 120 aaaagaaccc acgccacttc cctgagcacg gtcggaaccc gtagacggag tacgctccca 180 atgcacaaca cgagtgttag gcaagacgcg cccagcgtag ccgtcgtgtt cgttaca 237 // ID Polinton-5_NVi repbase; DNA; INV; 18379 BP. XX AC . XX DT 15-APR-2009 (Rel. 14.04, Created) DT 15-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE autonomous Polinton DNA transposon - consensus. XX KW Polinton; DNA transposon; Transposable Element; Polinton-5_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-18379 RA Bao W. and Jurka J.; RT "Polinton DNA transposons from Nasonia vitripennis."; RL Repbase Reports 9(4), 795-795 (2009). XX RN [2] RP 1-18379 RA Bao W. and Jurka J.; RT "Polinton-5_NVi."; RL Direct Submission to Repbase Update (11-MAY-2009)Polinton-5_NVi RL reach the 5'-end.. XX DR [2] (Consensus) XX CC The consensus may be incomplete at 3'- ends. XX FH Key Location/Qualifiers FT CDS 16719..17384 FT /product="Polinton-5_NVi_7p" FT /translation="MEAMDAYLNHSFTFYQDFGNPSKILLPNHDVTFTNSF FT GARAIGIDERPTPTSAPTAEMTTHNGQQFISSVGGGGGHDEETQQHPPAQP FT ACKKQKKYESPPSVIMHQQTFAGLRQVKECIDLRFKQLEELSGLVNRGVDS FT ILDYIKDALKREEAELCREILKNVNSFKQYYALHKYKIDEIVKTNLNFATV FT SVDVLDVVLTEIYAFGLARIAKDVYDVMFPL*" FT CDS 8968..9630 FT /product="Polinton-5_NVi_3p" FT /translation="MDTRWKHPFTACIAGPTGAGKSWFLKKFLEHLPEMCD FT ARFDRVLLYYTEWQETYRSEFKVGGSAAVPIEFREGLPQRSDYSVDTGKKK FT LLILDDLMRESSNDIVLDLFSRSSHHLNLSVFLVTQNIFHQGKSQRDISLN FT SKYLVIFKNPRDRAQIRHLARQIYPEDPRFLQEAYIDATRGPHSYLFLDLT FT QAAVDEYRVRTCIFPSDDIQYAYVPKFGKK*" FT CDS join(14973..15671,15620..16072) FT /product="Polinton-5_NVi_6p" FT /translation="MTDYVIDIQGFRDINKKFIPKEVGVLSLQSRIVGHWI FT IRAPCSFTELPADMQQTNNYCTLDVHGLEWHDGDISFKNLRRNLCNLVKDA FT RRIYVRGQAKAKFLEKVVARRIINLEDFSAPSFDELSMQFPNVLLCTSHGI FT KKFENKKNFCALRRAYQIRRWIHSIVASPHRSDPYDINSQVMYQAILDYDR FT LQMERYHYPERFRASDVVDGIKKEADEKKDGAVEEEYVDAPSTREKRRCGR FT RRVRRCTINMNTRQIITAIRGLKADSIGVYAANHVPKLLSTPTAIVTNLDT FT SDQPGSHWVAIFIDKNGYGIYFDSYGVAPVSKHHLDRLGKNCTRFDWNKKQ FT VQSVDSKVCGEHCIMFLYHMCSGISLRKYLRFFFIRLCQE*" FT CDS join(2663..3055,3062..3586) FT /product="Polinton-5_NVi_1p" FT /translation="MRVIERASLPRVAEMSSATNEVRDAILRVANALKGRH FT LPVQRKCTREWYVRVHRMCEERLRRELRCLLTHCAYRWIADFYQRYFKHSS FT VVIKIVGSFLVERGARDVYCLRHIIEEHGKCLQCSLQSVFKKKYFIFVLLV FT ARITLQEDAEESDSDATLFVVSSPPSPQRATSSEEEEEEEEQEEQEQEEQE FT DIQAPMEEPMEDDAESEIEDSDHGDDRPGVSEQAVYRAGAPAPLRGSEQTM FT EAAMILLSFRYEPAPEFGPTRAEVFEGFATRDETAEEVAMNPYARFDQYSS FT FDRRINESKRNFF*" FT CDS join(12405..12851,12823..13449,13453..14823) FT /product="Polinton-5_NVi_5p" FT /translation="MLQRKCACVDQHQTTFNEKICRIFGFEDSETRRKNGT FT FVTSSMGSVVALGNRPASLSRAIPDQLYVYTDVCEPHTVGDTQAALLRIVS FT VDSAKYKFGSNIVRHFAPAHYIPLLHHSFRSIVIDIRDQHGVRIPFEYGTL FT TVTLHFKRNRRLRFTLNVIARERKKERERERERERERATLCHDESLHTLLR FT RASRRWRSETRLFRFNVSARTRLGAWLGGLFRKILPYIASGARAVGKEAVR FT TGINVLDDVANNGSNFKEAFKYRTKESGKKLKRKAGEKIAEMMKGSGYKSR FT AMHRRRQLRKTRTSSTIGSSKKRKQHTAGRKKRVTRKTKKRPKKRSTQQQQ FT QSLRSVTDIFGPRRIFGVKMAFLHSHSTECMSSELDLFTLPATQTSIESSS FT FLHYKPVSSLSDDVDAPLEFVVPAGSEHYFDLAHTMLHVQAKIVPADEATA FT TTEDLKVGPINNFMHSMFNQIDVFFNQKIVSPPNNAYPYRAYIETLLNYAP FT AAKESHLTASLWYDDTSGGFDSPANAVSTATAPMIVNKGLENRKYFTQNRR FT YFDMIGHLHHDLFNQDKMLINGVEMRVRLVRSKDAFCLMDATADGKFKLSI FT KEATLIVRRVKISPGVLLAHAQALSKTTAKYPITRVEVKSFTLHSGILGDS FT IDNVIHGQLPKRIILGFVENKAFNGNRALNPFNFQHFSINYISLYVDGVQI FT PSKPLQPRFTGLDKLYIDAFQTLYTGTGVHFLNEGFGINRYNYYKGNFLTA FT FDLTPDLSAHCATHWNLVRSGSIRIEVRFETALLTAINCIVYAEYDNVLEI FT DSSRQIVTDFSA*" FT CDS join(9683..10111,9957..10907,10832..12055) FT /product="Polinton-5_NVi_4p" FT /translation="MAPTTSKKKQAGCCIPVAALDTKHFNRKRVAVLNALC FT HTDGQQRTALLRTADKRLVRCICECALNILQGVIKIKDSHKRRLKKHKNVL FT RKLTICNNKKSVSDWQKKKRVLVQHGGAFLPLLLAPIIGALVSKLFNSKSA FT AAEDRFAIIRNLSAIGRKRNEFSYSTVVLFCRFYSHLLSGLLSRSYLIQSQ FT QQRKTKMDRARKMIIVPEDMFSRMQNVQPAAAAPPVVNESCENSVQTCGDN FT LSRLDSEMHTILHSNKFTDEREKCQNYLQVLRRYLFFKDSERHAEHDYERD FT VDEIEATSAPMTEEIILDSVPKSYAQKTRLLLKHWKTVAGDRLKWDSTGRV FT SIDGRPIQDSNIIELISDVVARRKKTLDQSEVPIGRLPFAKFIKTVDTPIK FT LIGNPEIVKIGKNLTEPNNTAKRRLPSKESIDALMEIHNASSSESPASPAL FT TRNRAKNKWLRLANFQQRITGIACFDEKSSKEQVATSRKLLSMSSSSYEQQ FT YDVSHEAGYAGARNLVRVNRKGKLFSAADTREKERIYEWLSNQDAYTLHRP FT VKRKFPRLSYNVSNIDDVWECDLLQLTTIKEHNDGYCYLLVVVDVLSKHAW FT LEPLRDKTTANVTAAFGRILERSNGRVPILLQSDKGKEFVGSTFQEFLKKR FT DIKFRTARNPDIKAAVVERLNRTVRERMWRYFSHNNTQRYIDVVQKIIEAY FT NHTQHSGTKMRPCDVSIYNAAKARENLAKRARLQSTYKNREKMASGVINKY FT KPGSYVRISRTKNTFEKGYEKNFSEEVFKIKRISNRQQLPTFILEDLNGEE FT IDGFFYLEELAHVGTKRMSDAAEQFKIERVIRTKGRGSKKQLLVKWAGYPD FT KFNSWIKASEVQKI*" FT CDS join(4859..5935,5835..6611,6404..7855,7734..8549) FT /product="Polinton-5_NVi_2p" FT /translation="MLSWVELAIRDIYEYLLTLIKSGSDRVGVSISSEYFA FT NGSAGLSYRLASALTGDDLWSVVSGLVQSNANFRVDESFTVCLTVVEMPVG FT AGGSRRKNLTVDSVSQRSIIQISNRDNLCLPRALVVGEAKIALQTNDTPAM FT RAEWNVIRDGRRQLQHERAVKLCQDAGVLVPREGCGVQELRQFQHYFTARG FT TALVAYEKDSMGSGEAPFFDGRCDACTTVIFLLYSELHYDVIVNIFGAAKA FT KFFCTFCNRGYSNIEDHNCRQLCSRCFVSPPCSENEALQKCASCNRSFFGD FT VCFRNHSVAGSYKAKHKRLCDVLQICLTCCKSVNRQRGKHECGKNWCRHCK FT STHGYNQHCFIQPKKKTGSAVSTSAAKIGVDIASRHTVIINTVLYSRKKND FT ASAKKYLYVFYDLETMQEASYRGDESIKMHVPNVCVAQQVCTECISSDDIS FT EWCPSCGVREHIFTEDPVGQLLQLVVRDKTDFENIICIAHNARGYDAQFVL FT RRLVERKINCAPSIILNGQKILCIRYGRTKFIDSLSYFQMKLAALPATFGL FT GETTKKGCFPTFLTPRLTPNTVAPYRMRTIIRPTLCRRASASSSCCGTARC FT VQRTQYSISARKSSTTVAWMDYEKRLFSHLFNTSANAEYRGAIPDAHYYSP FT DTMSSSEREQFLLWHSEMRAANAVFDFSKEIVDYCRMDVTILRRACVAFRK FT IFLEVGDTDPFVVATTIASACSHLYRKNFLKPQTIGIVPRGGYRRADKHSQ FT KAVEWLLQCEREIGREIVHAGRAREYRLPEGFLVDGFLPSTNPAENPIVFE FT YQGCYTHGCPECFKNNRNKPNAWGRTFDALLESTRAKIAQIQQLGYETREI FT WECEFDRVKRENPEIAKYVSEHPLISKITLNPRDAFFGGRTENFVAVYDAK FT PGERILYTDICSLYPYICKRGRFPLGHPKVYVGEESDELTGGNLNNFSAID FT GLVKCKVLPPRKLYQPLLPIKMHGKLLFALCRTCCEEMRQSDCCHPDASQR FT EFSGTWVADELRKAIELGYTITEIFVIWQYSMTEYDPLTGEGGLFAGYIDT FT FLRLKQEASGFPAWCVDAESKARYVREYRENEGISLVLGQVRTARSRKHRG FT FQLGALTRNRKRVTFASIAKMRGYRSFWGKFGQRDNLKQTVIVKEREDLLK FT LLTAEDKEVLSLLLVNEEVMYASWQYIDDAVESTPYTNVVIAAYTTTLARL FT KLFSYLEKLGKRTLYCDTDSCIFVCNENAQEYRPPLGSLLGDMTNELDEGT FT YITSFLSGGPKFYGYRTVNSRTGEVSEKCKVKGISLNFSNSLRINYPSIKS FT MIENYFKPDEENECINLKFNSIKRLPDHTVVTREEVKSCSIVLKKRRYVTP FT GLSLPYGYKS*" XX SQ Sequence 18379 BP; 5084 A; 4222 C; 4522 G; 4550 T; 1 other; agtagtatat gggctggtat ggagagactg ggcgtggcct agagtgacgt caccagtccc 60 gtggctagcg ccagttgttt atgtaaacag aagcccatgt ttatatcaac agtagctgat 120 aaggttaata aacatgattt acgaccgcta ctgtcataaa cagtctgtca tcgatgggtg 180 acttttattt gtttatttat aaataattat aaataatttt taaacaaaag ctatattacg 240 catacgatgc cagccctccc ccttttaccc caagcagacc cgagcgcgac tctctctctc 300 tttcgctgtc tactccatac cggccccgag ctcgaacgca cgtgcgcgac tctcgctata 360 ccagcccgta gcccccctct ccttttacac tatagtagcc cctgagctct ggcgcggtgg 420 ctcccgccgc tgctgtaccg gacccgcttc tctctctctc tctctctctc tctctctctt 480 ccctctctac gccatactcg cgcgcgtccg ccaaagccgc cgtctacctg tcgctcgcgt 540 atacctgctg tttggctttt attgccctag cgccgactct gatcctttct gcgctgtaaa 600 gagagagatt gttccacgta tataatttat atggtggccc tgaaaagggc cgattgtttt 660 ttgaaatata tattattgta attagcgttt cagcttgctg tagccgtctt tttcggggca 720 cgaaaaggtg tttgtggaaa aggatttttg attctgtgac gaggaggagg aggatttttt 780 agacagatta ttataaccgc ttttctctgg atacgagaaa gtgttggttg agaatgacga 840 ctgattgttc tgctttcttg gctccttgct tctcatgaga ggctgtctct cctcccacgt 900 agaacctttt ttctgctctt tgctctgctc ctgtcccatg gtcgagttgc tgctttgccg 960 gtcgcttgac gaatgattcg catcgaaatg aacgcgtgca tatataactc tactactcgg 1020 tgccgcaaag cttactctaa atttcggctt tggggatcgt tgatgggagt gggggttttg 1080 cacgcataaa aagagaaatt ctggcacaat atttatttac taaaaaacgc gtacgagtcg 1140 aatgactcag cttacaaggt aatgaaataa taaaatgaca cacgagcgcg cgcacacaca 1200 cacacacaca cacacacaaa catacttaca cagccacaca cacacacaca ctctaaattt 1260 ctacgctata aattacaaac aagcgataca cctacacttg cgagtggcgc gttgttggcg 1320 cagttgacgt gtgggccctc ctcgcagtaa accaggcacc aactctttcg gcggagtagg 1380 catccgatga aagttgtaag gtacgctgcg gttgcggcgt gctgtcggtg gcgtaccttg 1440 gacgttgttg tcgtcaccgt cgttgtcgtc gcctgctgat gatggtggag gcgaaggcga 1500 tctcggcgaa tccatatcca ggatgcacgg gcagggctgt ccgttgcact tccagcagcg 1560 aaagagaagg cacctccagc actgctgcca cacgtcgtgc tccgaactgt tggtgatgca 1620 cactctgcac ctcacgttga tcggcacaaa ctgtaccgcc agagaagcct ttcgctcggc 1680 ttcttgctgt ttttgctgca acagcgctga catgtccagc attctctgct ggtagcgctc 1740 gagctggccg ttcagcttct ccattgcacg ttttttagct tcgatacttg cgagaagcgc 1800 atcgcgttgg gctgctaatt ttttcaattc tgccatgatg gtggccacgt gtgtgctctc 1860 tgtagggact cagagcggac tgatcggtgc aagccgtgtc ataaacgtaa gccgttggca 1920 aaaatttctg cacgtaggta gagggggacc gcacctctcc actctttgcc ttttaacgac 1980 gttcgttgtc ttaagaatgc gagaccgttg gtaacgctat actcggactg gattatttgt 2040 atacacttga gctccagcgt cgtgggctta tcagaaaaaa tcggagacgg gtgagagcga 2100 aagaaaggtt ctcaatgtct cgcgaatcag ctccaggaag tatcgtgtga tcatggactc 2160 gggctttgtt ttattatcgc caacggatta gttttcaaaa ctaagcagag aacgaatcat 2220 cggatctccg gtagacactt gagaataagt gtaaactgtt tgtacagaga ttacaggtgt 2280 ccggtaagag agattaaaaa gggtcagccg tagagtcata aaaaactgtt cggtccgtgt 2340 caccggatct agcggtggcg acggtccatt ctctgagacg tccgctatcc gcctaaggtt 2400 tatgactgct acgtgcctaa agcgcaaaca gcgagtacgg tatcggattt atttctttcg 2460 atcactggtg gcagcctgga tttctcagct atggctgtaa tatgcgctgt tttaaaacaa 2520 atccccctct ttaacggttt ttctgctctg cgatcttctt attcgatgaa atcactccta 2580 agaaaggcat gatattttca tatcagccct gagccgtcta caaaactcct ccgtacgcta 2640 gcgcagtcat cagtctctcg aaatgcgtgt cattgagaga gccagcttac cacgtgtcgc 2700 agaaatgtcc agtgctacga acgaagttcg cgatgcgatc ctgcgcgtag ccaacgcgct 2760 caagggtcgt cacctacctg tgcagaggaa gtgtacccgc gagtggtacg tgcgcgttca 2820 tcggatgtgc gaggagaggc tgcgacgcga gctgcgctgc ttgctgacgc attgcgcgta 2880 ccgctggatc gccgattttt atcagagata ctttaaacat tctagtgttg tgataaaaat 2940 cgtcggcagt ttcttggtgg aacgcggtgc acgcgatgtg tactgtctga ggcatatcat 3000 cgaggaacat ggtaagtgtt tacagtgttc cctacaatca gtttttaaaa aaaaataata 3060 gtactttatt tttgttcttc ttgtagctcg aattaccctg caggaggacg ctgaggagag 3120 cgacagcgac gcaacgctgt ttgttgtgtc gtcacctcct tcaccacagc gagccacttc 3180 atccgaggag gaggaggagg aggaggagca ggaggagcag gaacaggagg agcaggagga 3240 tatacaagct cctatggagg agcccatgga ggatgatgct gagagcgaga tagaggacag 3300 cgatcacggc gatgatcggc ctggcgtctc cgagcaggct gtctacagag cgggcgctcc 3360 agcaccgctg agaggtagcg agcaaacgat ggaggctgcg atgattctgc tctcgttccg 3420 ctacgagcca gctcccgagt ttggtcctac gagagccgag gttttcgagg gctttgcgac 3480 tcgggatgag accgctgagg aagttgcgat gaatccgtat gcacgcttcg atcaatacag 3540 ctcgttcgac cgtcgtatta atgaaagtaa gagaaatttt ttttaacttt taaattttct 3600 taaaaaatta ttattactaa ctgttcgaaa attagtttaa tatacgaaca gagaacgaga 3660 aggttcgaga agcttgttat cttattttaa aattcggcgg ttttcgctag accgcacggc 3720 accagtgagg ttgccatggc gacgcgagct cgctcgctcc tactctatct ctctccgcct 3780 gattcgcgac cggtcttctg ccgctttcgc cgatttgtgt tatttagtta tcgtattgcc 3840 gaagcgctgt aaggactcgc ctcacagtga cgaccgttaa acagtatcgt ttgtttatgt 3900 atcagtactt ttaacactat agccttcatt tcttttccgt agtttcaaca tttgtaaata 3960 cactcgatct tacaataaat atcaaagcct tttcttacgg catagtaatt caagttaacg 4020 atggagagtg tttttgttta ttgccagcta gcccttcgag atccacaagt ctgtaacatc 4080 agttcgagag actataaata ctaacaataa taatttttta agttatcaga cgtcgtctga 4140 acgaaggcat ggagccctgt ccggatgcag atccagaaaa tgacgcggct atgatttttc 4200 aagctgctgc gctacgaatc gtgaataacg gtcgtgcgca ggaagtacgt tggttaaaca 4260 acttgcttga aaaatcagca gacgagttat tttctatgac tgcgttcgag atggccgagt 4320 tgttcagcga catcattcaa tgctacaacg gtcgtgcgga agaatgaagg ctcgcttctt 4380 tgcagtcgac gacgataatg tcagcagcag cagcagcagc gaggaagagg gcccatcatc 4440 gccggcaata tcatcggact cttcaacagt gctgagccct acattgtcga tgctgcagga 4500 ggtcgggttc tacgatgcgg gtcacccgat catcgatgac agcacagcga gcggcatatc 4560 gcaggtaggc ggcggcggtg acggcggcga tacctcgagc gaagagcgcc ggcggtgacg 4620 gcggcgatac ctcgggcgaa gaggcaccgt cgccgaacaa tagtattgta agcggttctg 4680 ctgcagcttc tgattcggat caggaggagg gaaggaggag gaggaggagg aagctcgtga 4740 cgatgagcgt tatcgtcgct tcgaaattat cgcgaatacg gtgcatcgcg tacggaaatt 4800 ccgtacagac gctcgccgtc tgacgctacg gattaaaaca ccgccgcgtg ggaccaacat 4860 gctcagctgg gttgagttag caatacgtga catttacgag tatttgctaa ctctaatcaa 4920 aagcgggagc gatcgcgtcg gtgtttcaat ctctagcgaa tatttcgcaa acgggtcggc 4980 cgggctctcg tatcgtctcg cgagcgcgct gactggcgat gatttatggt ctgtggtgtc 5040 tggactcgtt cagagcaacg caaattttcg agtcgacgag tcatttaccg tttgtttgac 5100 cgtcgtagag atgcctgtag gtgcaggtgg ctctcgacgg aaaaacttga ccgtcgacag 5160 cgtttctcaa cggtccatca ttcagatcag caatcgtgac aatctctgtc tgccgagagc 5220 gctcgttgta ggcgaggcga aaatcgcgct tcagacgaac gataccccgg cgatgcgagc 5280 cgagtggaac gtcatccgcg acggtcgtag gcaactacag catgagcgag ctgtaaaatt 5340 gtgccaggat gccggtgttc tcgtacctcg cgaaggttgt ggtgtacaag agcttcgtca 5400 attccagcat tattttactg ctcgtggtac agctttagtt gcttacgaga aagactcgat 5460 gggcagcggt gaagcaccat tttttgacgg acgctgcgat gcctgcacaa ccgtaatatt 5520 tttgctctac tccgagcttc attacgacgt gatcgtcaat atttttggag ctgcgaaggc 5580 aaaatttttc tgtacttttt gcaatcgcgg ctacagtaat atcgaggatc acaactgtag 5640 gcagctgtgt tcgcgctgct tcgtctcgcc accctgtagc gagaacgaag ctttgcaaaa 5700 atgcgcatcg tgcaatcgca gttttttcgg cgatgtatgt tttcgcaacc attcggttgc 5760 gggatcatac aaagcgaagc acaagcgatt gtgcgatgtg ctgcagattt gtcttacctg 5820 ctgcaaaagc gtgaaccggc agcgcggtaa gcacgagtgc ggcaaaaatt ggtgtcgaca 5880 ttgcaagtcg acacacggtt ataatcaaca ctgttttata cagccgaaaa aaaaatgatg 5940 cttcagcgaa aaaatatctt tatgttttct acgatctgga aacgatgcaa gaagcatcgt 6000 atcgaggcga tgaaagcata aagatgcacg tcccgaatgt ctgcgttgca cagcaagtct 6060 gcacggagtg cataagttct gacgacattt cagaatggtg tccctcgtgc ggtgtgcgcg 6120 agcatatatt taccgaagat cccgttggtc agctgctaca gcttgtagta cgcgacaaaa 6180 cggactttga aaatataata tgcatcgcgc acaacgcacg agggtacgat gcgcaattcg 6240 ttttacgacg gcttgtcgaa cgtaaaataa attgcgcgcc aagcatcata ctgaatggtc 6300 agaaaatact gtgcatccgg tatggacgca caaagttcat cgatagtctc agctattttc 6360 aaatgaagtt ggccgcgttg cctgctacat ttggtttagg tgagactacg aaaaaaggct 6420 gttttcccac ctttttaaca cctcggctaa cgccgaatac cgtggcgcca taccggatgc 6480 gcactattat tcgcccgaca ctatgtcgtc gagcgagcgc gagcagttct tgctgtggca 6540 cagcgagatg cgtgcagcga acgcagtatt cgatttcagc aaggaaatcg tcgactactg 6600 tcgcatggat gtgacgattc tgcgtcgcgc gtgtgtggct ttcagaaaaa tttttctcga 6660 ggtcggagac acggacccgt ttgtagtggc tactacaatc gcgtccgcgt gctctcatct 6720 ctatagaaaa aatttcttaa agccacagac tataggaata gtccccaggg gcgggtatag 6780 gcgcgccgac aagcactccc aaaaagccgt cgagtggctg ctgcagtgcg agcgagagat 6840 cggtcgcgaa atcgtgcatg caggcagagc gcgcgaatat cggcttcccg agggattttt 6900 ggtcgacgga ttcttaccat ctaccaatcc agcagaaaat cccattgttt ttgaatacca 6960 gggctgctat acgcatggct gccctgaatg tttcaaaaac aatcgcaaca agccgaatgc 7020 ctggggtcgt actttcgacg cgttgctcga gagtacacgc gctaaaatcg cgcagatcca 7080 acagctcggt tacgagacgc gcgaaatttg ggagtgcgag tttgacagag tcaagcgtga 7140 aaaccccgaa atcgccaagt acgtgtcgga gcatccgctc atcagcaaaa taacgcttaa 7200 tcctcgtgac gcgttcttcg gcggacggac cgaaaatttc gtcgcagttt acgatgctaa 7260 gccaggcgag cgtattctgt atacagatat atgctcgctg tatccttaca tttgtaaacg 7320 aggcagattt ccgctcggtc atccaaaggt ttacgtcggc gaggaatctg atgagctgac 7380 gggtggcaat ctcaacaatt tttcggccat agatgggctg gtaaaatgta aggttttacc 7440 gccgagaaaa ttgtatcagc ctcttttgcc gataaaaatg cacggcaagc ttctcttcgc 7500 actgtgtcgt acatgctgcg aagagatgcg gcagagcgat tgttgccacc cggatgcgag 7560 tcagcgagag ttcagcggaa cctgggtggc ggatgagctg cgaaaagcca tcgaactcgg 7620 ctatactata actgaaatat tcgtcatatg gcagtatagt atgaccgagt acgatccgct 7680 cacaggtgaa ggaggacttt tcgctggcta catcgacact ttcttgcgat taaagcagga 7740 agcatcgggg tttccagctt ggtgcgttga cgcggaatcg aaagcgcgtt acgttcgcga 7800 gtatcgcgaa aatgagggga tatcgctcgt tctggggcaa gttcggacag cgcgataatt 7860 taaaacagac ggtgatcgtc aaggagcgcg aagacctcct taaacttcta acggccgagg 7920 ataaagaggt cttgagtctc ctgctggtca acgaggaggt gatgtacgcg agctggcagt 7980 acatcgacga tgctgtggag tctactccat atacgaacgt agttatagca gcctacacga 8040 caactctggc ccgcttgaaa ctattttcgt atttggagaa gctcggcaag cgaacgctgt 8100 actgcgacac tgattcgtgc atatttgtgt gcaacgaaaa cgcacaagaa tacaggcccc 8160 cgttaggctc gttgctcggc gatatgacca acgagctcga cgaagggact tatattacga 8220 gcttcctctc gggtggtccg aaattttacg gctacagaac cgtgaattcg agaactggcg 8280 aggtatccga aaaatgtaaa gtcaagggta tctcgctaaa tttctcgaat tctttgagaa 8340 taaattatcc tagcattaag agtatgattg aaaattattt taagcctgat gaagagaatg 8400 aatgcattaa tcttaagttt aattctatta aaagattgcc agatcacact gtcgtcacgc 8460 gagaagaggt caaatcgtgt agcatagttc taaaaaagcg taggtatgtt actcccgggt 8520 tatcgctccc gtacggttac aaaagttaag tatagtatgt aagttgtttg tgaataaata 8580 aaagtaaatt taagaatctc gtttttgtat tagtttttaa gtaatataaa ttgttctgcg 8640 ttatgtttaa tatatttata ataaaaaaaa agtattttaa aatttttctt gtaattataa 8700 catccttctc tctcctttac ccactcatca tcgcacgctt ataggcgagc cgtacatgca 8760 cgcgtataga cggctcgtcc gcggatgata ctcgtggccg ctaatatacg cttataagcg 8820 gcccgcgtga gtatacgcgt atacgcggcc cgcgtgagta tacgcgtata cgaggcccgc 8880 gagagtacac gcgtatacgc ggcccgcggt agaggaataa tattaataaa aagcgctttt 8940 cgggaagaga gcgctttagt cgaaaagatg gatacgcgtt ggaagcatcc ttttacggcg 9000 tgcatagcag gaccgacagg agctggaaaa agctggtttc tgaaaaaatt tctcgagcat 9060 ctgccggaga tgtgcgatgc tcgcttcgac agggttctgc tctactatac cgagtggcaa 9120 gagacgtacc gcagtgaatt caaagttggc gggagtgctg ctgtacctat cgagtttcga 9180 gagggacttc cgcagcgaag cgattactcc gtggatacgg gaaagaaaaa attactgatt 9240 ctcgacgact tgatgcgaga atcatcgaac gatatagttc tggatttgtt ctcgcgttcg 9300 tctcatcatt tgaatttgtc ggtctttctg gtgacgcaga atatttttca ccagggaaag 9360 tctcagcgcg acataagttt aaattcaaag tatcttgtta tatttaaaaa tcctcgcgac 9420 agagctcaga ttcggcatct cgcgcgacaa atttatccgg aagatccgag gtttctgcag 9480 gaggcttaca tagacgctac gcgcggacct cactcgtatc tttttctcga tctaactcaa 9540 gcggccgtgg acgagtatag ggttcgcacg tgtatattcc caagcgacga tattcagtac 9600 gcctacgtgc cgaaattcgg taaaaagtaa aaatacaaaa taggtgtgca gagctctcgg 9660 catagacagt aggcgagaag ctatggcgcc aacaacatcg aagaagaaac aggctggctg 9720 ttgtataccc gtggctgcgc tcgacaccaa gcatttcaac agaaaacgag tagctgtact 9780 gaacgctctt tgtcacacgg acggccaaca acgtacggct cttttacgca cagccgacaa 9840 gagactcgtg cgttgcatat gcgagtgcgc tttgaatatc ttgcaaggtg tgatcaaaat 9900 caaagactcg cataaacgac gactcaaaaa gcacaaaaac gtgctgagaa aattgacgat 9960 ttgcaataat aagaaatctg tcagcgattg gcagaaaaag aaacgagttc tcgtacagca 10020 cggtggtgct tttttgccgc ttctactcgc acctattatc ggggctcttg tctcgaagtt 10080 atttaattca aagtcagcag cagcggaaga ctaaaatgga tcgtgcgaga aaaatgatta 10140 tcgtgcccga agatatgttt tcgcgcatgc aaaacgttca gccagcagca gcagcaccgc 10200 cagtagtaaa cgagtcgtgt gaaaactctg ttcaaacgtg cggcgacaat ctctcgagac 10260 tcgattccga gatgcacaca attctacatt cgaataaatt tacagatgag cgtgaaaaat 10320 gtcaaaacta tttacaagta ctgcggcgat atctgttttt caaagattcc gaacgtcacg 10380 ctgaacacga ttacgagcgc gacgtcgacg aaatagaggc cacatcggca ccaatgactg 10440 aagaaattat actcgacagc gtgcccaaga gttacgctca aaaaacgcgc cttctgctga 10500 aacactggaa aaccgttgcc ggcgatcgtc tcaagtggga tagcactggc agagtcagta 10560 tagacggccg gcccatacaa gattccaaca ttatagaact gattagcgac gttgtcgcac 10620 gcaggaaaaa aacactggat cagtcggaag tgcccatcgg ccgtttacct ttcgccaaat 10680 tcatcaagac agtcgacaca ccgataaagc tgattggaaa cccggaaata gtgaaaattg 10740 gtaaaaattt gacagagccg aataatacag cgaaacgacg actgcctagt aaagagagca 10800 tcgacgcgtt gatggaaata cacaacgcta gcagcagcga atcaccggca tcgcctgctt 10860 tgacgagaaa tcgagcaaag aacaagtggc tacgtctcgc aaacttttaa gcatgtcgtc 10920 gtcgtcgtac gagcagcaat acgatgtctc gcacgaggct ggttacgcgg gtgcgcgcaa 10980 tctcgtgcgt gtaaatcgaa aaggcaaatt atttagcgcg gctgacacgc gggagaaaga 11040 acgtatatac gagtggctta gtaatcaaga cgcctatact cttcacagac ccgttaagcg 11100 taaatttccg cgattaagtt acaacgtgag caacattgac gacgtatggg agtgcgattt 11160 gcttcaatta accacgatca aagagcacaa cgacggatac tgttacctac tcgtagtggt 11220 ggatgtattg agtaaacacg cgtggctgga gcctctacgc gataaaacta cagccaacgt 11280 caccgcagcc ttcggacgaa tactcgaacg cagcaacggt cgcgtcccga tactcttaca 11340 atccgacaag ggtaaggagt ttgtaggctc gacgtttcag gaatttttga aaaaacgcga 11400 catcaaattc cgtacagctc gaaaccccga tatcaaagcg gctgtggtgg aaagactaaa 11460 ccgtactgta cgcgaacgaa tgtggcgata cttttctcac aataacacgc aacgttacat 11520 cgacgtggtg cagaaaatca tcgaggctta caatcatacg caacacagcg gtacaaagat 11580 gcgcccctgt gacgtgagca tctacaatgc tgcgaaagcg cgcgaaaatt tagcgaaaag 11640 agctcgtctg caatcgactt acaagaatcg cgaaaagatg gcgtcaggtg taatcaataa 11700 atacaagccc ggatcttacg tacgcattag tcgcacgaaa aatactttcg agaagggata 11760 cgaaaaaaac tttagcgagg aagttttcaa gattaagcgc atatccaaca gacagcagtt 11820 accaacgttt atcctggaag atttgaacgg cgaagagatt gacggctttt tctatctcga 11880 ggagctcgct cacgttggca caaaacgcat gtcggacgct gctgagcaat tcaaaataga 11940 acgcgtaatt cgtacaaagg gtcgtggctc gaagaagcag ctactcgtca agtgggctgg 12000 ctatccggac aaattcaact cgtggatcaa agcctctgaa gtacaaaaaa tttaaaaatg 12060 gaccagaacg atttttacct agttctgcct agcaacagca gtatgatgta ctacaacgaa 12120 aacgccacca gctgctttac aacgcagctc tcgcgcgagc tcagactcag cgggaattgg 12180 tccgttgcgc tcgtcgaaat tcacgtgccg agcactgtca cgcacattca agaatcggac 12240 gccttttaca cgtttaaaca gtcggaaaac gcaaagcagg aaacgcattt tttccgcacg 12300 gtatttacga caagatcgag caactagcca acgaaataaa taaatcagcc gcctttcacg 12360 atcatctgcg tctcgaagtt gcaccctttc aaaagggcta ctacatgcta cagcgcaagt 12420 gtgcctgcgt tgatcagcat cagacaacgt ttaacgaaaa aatttgccgt attttcggtt 12480 tcgaagattc cgagactcgg cgaaaaaacg gcacctttgt cacatcgtca atgggcagtg 12540 tagtcgcgct aggaaacaga ccagcatcgc tttcccgagc tattcccgat cagctctacg 12600 tgtacacgga tgtatgcgag ccacacactg taggcgacac tcaagctgct ctactgcgta 12660 tagtttccgt ggatagtgca aagtacaaat tcggcagtaa tatcgtgaga cattttgcac 12720 ctgctcatta tattccactt ttacatcaca gttttcgctc gattgtcata gatataaggg 12780 atcagcacgg agtgcgaata ccattcgaat atggaacgct gacggttacg cttcacttta 12840 aacgtaatcg ctagagagag aaagaaagag agagagagag agagagagag agagagagag 12900 agagcaacgc tatgtcacga tgagtcacta catacgctac tacgacgagc aagtaggagg 12960 tggaggagtg agacacgtct attccggttc aacgtatcag cgcggacgag gttaggtgcg 13020 tggctcggag gtttatttcg caaaattctt ccgtacatcg cttccggtgc cagagctgtg 13080 ggcaaagagg ctgtgcgcac aggcatcaac gtgttggacg acgttgcaaa caacggctcg 13140 aatttcaagg aggcgttcaa atatcgtact aaagagtctg gaaaaaaatt gaagagaaaa 13200 gccggtgaaa agatagcaga aatgatgaaa ggctctggtt ataaatcacg cgcaatgcat 13260 cgaaggcgtc agttgagaaa gacacgtaca tcgagtacaa tcggctctag caaaaaacgc 13320 aaacaacata cggcaggacg gaagaagaga gtcacgagaa aaacgaagaa gaggcctaaa 13380 aagagaagta ctcagcagca gcagcagagt ttgagaagtg ttacggatat ttttggtcct 13440 aggcgtattt agtttggggt aaaaatggca tttttacatt ctcactcgac ggaatgcatg 13500 tccagcgagc tggatctttt cacgcttcca gccacccaaa cgtccattga aagcagcagc 13560 tttctgcatt ataagcctgt gtcgtctctt agcgacgatg tcgacgcacc attggaattt 13620 gtcgtgccgg caggtagcga gcattacttt gatctggctc acactatgct tcatgttcaa 13680 gctaaaatcg tacccgctga cgaggctacc gcaacgaccg aagaccttaa agtcggaccg 13740 atcaacaatt tcatgcattc aatgttcaat cagattgacg tttttttcaa tcagaaaatt 13800 gtttcgccgc ctaataacgc ctatccttac agagcgtaca tcgagacgct tctcaactac 13860 gctccggctg ctaaagagtc acatcttaca gcgagtctgt ggtacgacga cacgagcggc 13920 ggtttcgatt cgccagctaa tgcggtgagc actgccacag cacctatgat cgtgaacaaa 13980 gggctggaaa atcgcaaata ttttacacag aatcgacgat actttgacat gattggacat 14040 ctacatcacg acctctttaa tcaggataaa atgctgatta atggcgtaga aatgcgcgta 14100 cgtctagtga ggagtaagga tgctttttgt ctgatggacg cgacagccga tggaaaattc 14160 aaactcagca tcaaagaagc tacactcata gttcgcagag taaaaatcag tcctggagtt 14220 ttactggctc atgcgcaagc gttatcgaaa accacggcta aatatcccat aacgagagtc 14280 gaagttaagt cgtttactct tcactcggga atattgggcg attcgatcga caacgttata 14340 cacggtcagc tgccgaagag aataatcttg ggctttgtgg aaaacaaagc attcaacgga 14400 aaccgcgcgt taaacccttt taattttcaa cacttctcta taaattatat ttctctctac 14460 gtcgacggcg tacaaattcc aagcaaacct ttgcagccac gtttcacggg tctcgacaaa 14520 ctctacattg acgcttttca aacactttac acgggcacag gcgtgcattt tctcaacgaa 14580 ggttttggca tcaatcgcta caactattac aaaggcaatt ttttgacggc cttcgatctc 14640 actcctgatt tatcggcaca ctgtgccaca cattggaatc tcgtgcgctc gggtagcata 14700 cgcatagaag tgagatttga gacggctctt ctcactgcta tcaactgtat cgtttacgca 14760 gagtacgaca acgttctgga aatcgactct agccgccaaa tcgtaacgga ttttagcgct 14820 tgatgcgcac gcacactcga tcgcagcgcc tcaaaaaaaw ttttttttga ttcgcgcttc 14880 ataaggcgag cgcgctgaca gtctgcagtt agtctccctc tcgaaacagc cagcacctaa 14940 cgtactcgag cagcaacagc atcatcgtca tcatgacgga ctacgtgatc gatattcaag 15000 gctttcggga tataaataaa aaatttattc ccaaagaagt cggagtttta tctctccaga 15060 gccgtatagt tggtcattgg atcattcgcg cgccgtgcag ttttaccgaa ttacccgcgg 15120 atatgcagca gacgaataat tactgcactc tcgatgtaca cggtctagag tggcacgacg 15180 gagatatctc cttcaagaat ctgcgtcgta atctgtgcaa cttggtgaaa gacgcgcgcc 15240 gtatatacgt gcgtggccag gcaaaagcca agtttttgga aaaagtagtg gctcgaagaa 15300 tcatcaattt agaagatttt agtgcgcctt cgtttgacga actctctatg caattcccta 15360 acgtcttgct gtgcaccagc cacggtatca agaaattcga gaacaagaaa aatttttgcg 15420 cactgcgtcg agcctatcaa atcagaaggt ggatacactc aatcgttgcc agtcctcatc 15480 gaagcgaccc gtacgacata aatagccaag taatgtacca agctatatta gattacgatc 15540 ggctgcagat ggagagatat cactatcccg aaagatttcg tgctagcgac gttgtggacg 15600 gaattaaaaa agaagctgac gagaaaaaag acggtgcggt agaagaagag tacgtagatg 15660 caccatcaac atgaatacgc gtcaaataat cacagctatt agaggattaa aagccgattc 15720 aataggagtt tacgctgcta atcacgttcc gaaattgcta tccacaccga cagctattgt 15780 aacgaatctc gacacgtccg atcagccagg atcacactgg gttgccatct tcatagataa 15840 aaatggatac ggaatttact ttgacagcta cggagttgca cccgtatcaa agcatcatct 15900 tgatcgactc ggaaaaaact gcacgcgatt cgactggaat aaaaaacaag tgcagagcgt 15960 cgactctaaa gtatgtggcg agcactgcat tatgtttctg tatcatatgt gtagtggcat 16020 ttctctgcga aaatatctac gttttttttt catcagactt tgtcaagaat gacgcattag 16080 ccgttaagtt ttatcgaagg ctcatgagaa gaatgcaaaa taaaacaatg cgtcatcgcc 16140 atcatcgagt caacgcattc ccacgcacga gttctacagg acggggttac tgtaatcaga 16200 tctgcacagc caagaccgga tgtttgttat ctgagcagtt tatacaaacg tactaggtgt 16260 gacggttgcg ttaaacaaaa caacttctat cgatatcaaa cgtgtctcgt ttcactatgg 16320 aatgtaaata ctgtaacggc cgtttctaga aagaactgcg actcgttgac gctcgctctt 16380 tgtcgtccca ttctttataa aggtgtcgtg gcgcagtcgc cagccatcag tcgtaagaac 16440 actctcgtca agtagacctc gtgtgtaaaa aatctttgca atggattcag cagcagcagc 16500 agcaccaagt gccgagtctt ggaaaaataa gccgatcacc tgcaacttgt tgtactcgac 16560 gacgtacgcg ctcaacaaat caaactgcaa gcgtgcaacc ataggcctgg agtattacgg 16620 cggttactac agagcagtgt taaaaatttg tgccaatgga tcgccagcca agtatgtgac 16680 actcgacaca cacagcgtgg gaagtgatca aagatcagat ggaggctatg gatgcatatc 16740 tgaatcactc cttcacgttc taccaggatt ttggaaatcc tagcaagatc ctccttccaa 16800 atcacgatgt cacattcaca aacagcttcg gagcaagggc gatcggcatc gacgagaggc 16860 caacaccaac atcagcacct acagcagaaa tgacaaccca caacggtcag cagttcatca 16920 gcagtgtcgg cggcggcggc ggccatgacg aggaaacaca acagcatcca cctgcacagc 16980 ctgcgtgcaa gaagcagaaa aagtacgaga gtccgcctag tgtaattatg catcaacaaa 17040 cgttcgctgg cttaagacag gtaaaagagt gcatcgattt gcgttttaaa cagctcgagg 17100 agctttcggg attagttaat cgcggcgtgg actcaatttt agattatata aaagacgctt 17160 tgaaaagaga agaggctgaa ctatgtcgtg aaattctgaa aaatgtaaac tcgttcaaac 17220 aatattacgc ccttcataaa tataaaatcg acgaaattgt taagacaaac ttaaatttcg 17280 ccactgtaag cgtcgacgtt ttagatgtag ttttaacaga aatctatgcg tttggcttag 17340 ccagaatcgc aaaagatgtg tacgacgtta tgtttccgct ataacttttt ttattcatct 17400 cctatcgtta gtttctaagc tgtaaatatg ctttattgtt ctacgcacac acacacacac 17460 acacgcgcgc ataaatatat catgtattca atttttactg ttttattaaa taaatataag 17520 agtgtatcaa acaaaaattt ctctatttta atgtgtacaa atcccatcaa taaaccccaa 17580 agctgaaatt tagagtaggc tttgcggcac cgtgtagtag gaaatcatac aggttacata 17640 ttattacagc gcagaaagga tcagagccgg cgctagggca ataaaagcca aacagcaggt 17700 atacgcgagc gacaggtaga cggcggcttt ggcggacgcg cgcgagtatg gcgtagagag 17760 gggagagaga gagagaagcg ggtccggtac agcagcggcg ggagccaccg cgccagagct 17820 caggggctac tatagtgtaa aaggagaggg gggctacggg ctggtatagc gagagtcgcg 17880 cacgtgcgtt cgagctcggg gccggtatgg agtagaaagc gagagagaga gagagagaga 17940 gagagagaga gagacaccgc gtcagagctc ggggctacta tagtgtaaaa ggagaggggg 18000 gctacgggct ggtatagcga gagtcgcgca cgtgcgttcg agctcggggc tggtatggag 18060 tagacagcga aagagagaga gagagtcgcg ctcgggtctg cttggggtaa aaggaggagg 18120 gctggcatcg tatgcgtgat atagcttttg tttaaaaatt atttataatt atttataaat 18180 aaacaaataa aagtcaccca tcgatgacag actttttgtt tatgacagta gcggtcgtaa 18240 atcatgttta ttaaccttat cagctactgt tgatataaac atgggcttct gtttacataa 18300 acaactggcg ccagccacgg gactggtgac gtcactctag gccacgccca gtctctccat 18360 accagcccat atactactc 18379 // ID Gypsy5-LTR_Dmoj repbase; DNA; INV; 309 BP. XX AC scaffold_6390; XX DT 13-MAY-2009 (Rel. 14.05, Created) DT 13-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy5_Dmoj; KW Gypsy5-I_Dmoj; Gypsy5-LTR_Dmoj. XX OS Drosophila mojavensis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; repleta group; OC mulleri subgroup. XX RN [1] RP 1-309 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1053-1053 (2009). XX DR Genome; scaffold_6390; Positions 57297 56989. XX SQ Sequence 309 BP; 93 A; 58 C; 64 G; 94 T; 0 other; tgtcgtgccc tcacaatatt atgaatatcg tgccatcact attataagca ttgtgtcatc 60 tctatatttt aagattaaga gcgtaaagag gaaagaataa aatgtacttc gcaataacgc 120 ttaatgttgt atgtgtgtct atgcgacggc cgttgtttca attccattcg ttttttcacc 180 gatccgttat cctaacctgg cagacgtttc tgtctgacgc agtctgggcc cgacagtaag 240 ttagagcgag ataagaataa caattgcaat acatgtgagc gagaatgtgt gagtacacaa 300 aaattagca 309 // ID Kiri-4_AAe repbase; DNA; INV; 4392 BP. XX AC . XX DT 21-OCT-2010 (Rel. 15.1, Created) DT 06-JAN-2011 (Rel. 16.02, Last updated, Version 3) XX DE A Kiri non-LTR retrotransposon family from Aedes aegypti. XX KW Kiri; Non-LTR Retrotransposon; Transposable Element; L2_Ele6; KW Kiri-4_AAe. XX NM Kiritsubo-4_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4392 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4392 RA Kojima K.K. and Jurka J.; RT "A distinct group of non-LTR retrotransposons from the yellow RT fever mosquito."; RL Direct Submission to Repbase Update (21-OCT-2010). XX DR [2] (Consensus) XX CC [1] Originally named as L2_Ele6. CC [2] Consensus update and re-classification. This consensus is CC generated from 23 sequences with >93% identity, and ~100% CC identical to the original sequence in [1]. This family does not CC belong to the L2 clade and is renamed as Kiri. It could CC constitute a new clade with other Kiri elements. XX FH Key Location/Qualifiers FT CDS 361..1128 FT /product="Kiri-4_AAe_1p" FT /translation="MSQLWLRSRTAEMSKLDSKRKEIVHGSSPSNEISNNE FT LARLITNQGKTMQDQVNKLGNDLRAEFSKGMDEIKADLVEVNSKLCCMRQD FT VFDNANAIARSNLSNDLIISGVPFTENEDLLHYFRCWCKTMGYADGNVPYV FT DIRRLSRTPLNAGNIYFILVQFAITNQRNDFFSCYLRNQSLNLNQIGFQTN FT KRVYVNENLTPTARKIKSKAIELRKAGKLAKVFSRAGEIFVRSKEIDQDVI FT IKSEEDLRRFEHLGG" FT CDS 1443..4274 FT /product="Kiri-4_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MANVRREYTSTDWSVQGIVMKSAFISNKLSICCMNAQ FT SICARRMSKVVELRQIAKVSNVDVICITESWLNINVSSDVLDIEGYTIVRN FT DRVGRLGGGILVYLKKNINFKIIEASSNDIGTPETEFIFVEISIDHRKVLM FT GFFYNPPELDCSQVIHDKLTCLKYQYDHIYLMGDFNTNIINTLSEKVKRFC FT ATLQCLSLSSLGRIPTHFHRTGASLIDMIITNDTGSVLRFNQIEVPFLSNH FT DLLFASLDFDLLPNNNTRTFRDYSRINSSKVIELYNRIEWNEFYSIDNPDY FT LVPFFNNHIENIFDSCVPLKTVHNHPRSNPWFNQDISKAMIDRDLAYRVWK FT VSKCDTDYSLYKSLRNKVTALINTAKKEYWNTKVSNASTSKELWKYFGQMN FT VNSKKTNGDTRFSPDEINVHFSNCYSTSNDVVANNPSLVDSFFSFSAIELN FT QIINATYEIKSNAVGLDGIPIKFVKSILPLLLKPMHHIFNSIIKLGVYPNA FT WKISKIVPINKKSNSNSLDNLRPISILSAISKIFEKLVKNQIVTYISNSNL FT LSSSQSGYRKGHSPKTAMIKVCDDMGFVLDSGGSVVLLLLDFSKAFDSICH FT ETLCHKLESQFGFDFDAVSLVRSYLTNRSQTVFVNNQFSEYLPITSGVPQG FT SVLGPILFTLYINDLPIRLAGCNVHLFADDVQVYFNCSGLTTDAAAELINN FT NLQRILDWAVENHLILNARKTQACYISRFKRMVEKPEIVLNGNRILYSDVV FT TSLGIIVQQNFEWDSFILHQCGKVYAALRTLRSNAYFLSSATKLRLFKSLI FT LPHFITCDFIMTQASRTTMDRLKVALNCCIRFVYNLNRYSHVTPLQNALIG FT CPFNKFPSVRSVLLLHNIIRTQSPSYLFSKLIPLRSVRGKKFVIKRVRTSH FT YASSLLVRGISLWNNLPSHIQNTNSSLGFKKTCLQFFNHE" XX SQ Sequence 4392 BP; 1326 A; 808 C; 845 G; 1413 T; 0 other; gtgtgtcaac agtggtagaa tgaggtgaca tggttatatc gttgcgcctc taaagcgaaa 60 ataacgcgta tatccacctt gttgtgttcg ccatctgcgg tctaccgttg agtataatcg 120 tgttgctgta cattaaaccg ttatctcatc gattggtgaa gtcctttgga agaaaagcga 180 ttaatatcat cagttggaca ttccgcgaga taaagtctac aagttgctgt tgttgctgtg 240 ttgtgcgctc tatctgtatt gctgttgctg ttgtatgctg ttgctgcttc catctgacta 300 ttgctgctgc tgtttgtgtg gtactgttcg tcgtggtgtg cctcggttta ataacagttc 360 atgagtcaac tgtggttacg cagccgaaca gccgaaatgt caaaactcga ttccaagcgt 420 aaggagatcg ttcacggtag ttccccatct aacgagatct ccaacaatga actggcacga 480 ttgattacaa accagggcaa aacaatgcag gaccaagtaa ataaactcgg gaacgatttg 540 cgtgcagagt tttcgaaagg aatggatgag attaaggcag atttggttga agtcaattcc 600 aaattatgct gtatgcgaca agatgtgttc gacaatgcca acgcgattgc tcgatcgaac 660 ctttcgaacg atttgatcat tagcggagtt cctttcaccg agaacgaaga tctactacac 720 tatttccgat gctggtgcaa gactatgggc tatgcagatg gtaatgtccc atacgtggat 780 attcgacgcc tgtccaggac accgctgaat gctggcaaca tctatttcat tttggtgcaa 840 ttcgctataa ccaaccagcg caacgacttt ttctcgtgtt atctgcggaa tcagtctctc 900 aatctgaacc aaattggctt ccaaaccaac aagcgagtgt acgtaaatga aaatcttact 960 cccacggcca gaaaaatcaa atccaaggcc atcgaattga ggaaagcagg gaagctggca 1020 aaggtttttt cacgtgcagg ggaaattttc gtcagaagca aagagatcga ccaagacgtt 1080 atcatcaaat ctgaagagga tctaagacgt ttcgagcatc ttggcggcta atcctatccc 1140 tataagtgct ctctgtcctt cctcacctgt cctaatttct ccatgtatcc tgtcctgaaa 1200 gttgtagcca ataagttttt tttttttaat taaggtttaa catcaatcct tgctattacc 1260 ggtagattgt gttattttta tctttatgtc tagtttgttt cttttttctt tgcgttatta 1320 gtagttttat gcgtagtcca tatcaatgta aatcaggttt ttgtttttgt aaataatgca 1380 atattcgagc gaaaaaggaa tgcgtaaaca tgttttctta gggtgggatt tcaactggga 1440 taatggctaa tgtacgccgc gaatatacct ctacagattg gagtgttcag ggaatcgtaa 1500 tgaaatcagc atttatatct aacaaactgt caatttgctg catgaacgct caaagcatat 1560 gtgccagacg aatgagtaaa gttgttgagc tgcgtcagat tgcaaaagtt tcaaatgtag 1620 acgttatttg catcacagaa tcctggttaa atattaatgt ttccagtgat gtgttggata 1680 ttgaaggata cacaattgtc agaaatgatc gtgtgggtag gcttggagga ggtattcttg 1740 tatacctgaa aaagaacatt aatttcaaaa ttattgaagc atccagcaat gatattggta 1800 cacctgaaac agaattcatt tttgtggaaa tttctatcga tcacagaaaa gtgctcatgg 1860 gattttttta taatccccca gaactagatt gctcgcaagt gattcacgat aaattgacct 1920 gcttaaaata tcaatatgac catatctatt tgatgggaga ctttaacacg aacataataa 1980 atactttatc agaaaaagta aagcggttct gtgctacttt acaatgttta tctctttcaa 2040 gtctaggccg tattccaaca cattttcatc gaactggtgc atcacttatt gatatgatta 2100 ttacaaatga tactggttct gtgctacgct tcaatcaaat tgaagttcca tttttatcta 2160 atcacgattt attatttgct tcactggatt ttgatttgtt gcctaataat aatacgagga 2220 cttttcgtga ctatagtcga attaattcat ccaaagtaat agaattgtat aaccgaattg 2280 agtggaatga attctattct attgataacc ctgactattt ggttcctttt ttcaacaatc 2340 acatagagaa tatattcgat tcctgcgtac cattaaaaac agtccataac catccaagaa 2400 gcaatccttg gttcaaccag gatatatcca aagcaatgat tgatcgtgat ctagcctacc 2460 gagtgtggaa ggtctcaaaa tgtgatactg actattcgtt gtataaaagc ttaagaaata 2520 aggtcactgc tttgataaat acagctaaga aggagtattg gaacacaaaa gtttcaaatg 2580 cttctacctc aaaagaatta tggaaatatt ttggacaaat gaacgtcaat tctaagaaaa 2640 caaacggtga tacacgtttt tcgcctgatg aaatcaatgt ccatttttct aattgctaca 2700 gcactagcaa cgatgttgtt gcgaataatc cgtctttggt tgattcattt ttttctttta 2760 gtgccattga attgaatcaa ataataaatg caacttatga aatcaagtct aatgcagtcg 2820 gtttagacgg aatcccaata aaatttgtga agtctatttt acctctcttg ttgaagccaa 2880 tgcaccatat tttcaacagc ataataaaac tgggggtcta tcctaatgcg tggaaaattt 2940 caaaaatcgt gccgataaat aaaaaatcta atagcaattc tttggataac ctccgaccca 3000 tcagcatatt gagtgccatt tcaaaaatat tcgaaaagtt ggttaagaac caaatagtca 3060 cctatatctc gaatagtaat ttactctctt cctcacaatc tggttatcgt aaagggcata 3120 gtccaaaaac agccatgatt aaagtatgtg acgacatggg ctttgtccta gatagtggcg 3180 gctcagtagt acttctgctg ctagattttt caaaagcatt cgactcaatt tgccatgaga 3240 ctctgtgtca caagttagag agtcagtttg ggttcgactt tgatgctgta agtctcgttc 3300 gttcgtatct gacgaataga tcgcagactg tgttcgtgaa taatcagttt tcagaatatt 3360 tgcctattac ttcaggtgta ccacagggct cagtacttgg gccgatatta ttcacgttat 3420 atatcaacga tctcccaata aggttagcag gctgtaatgt ccaccttttt gcagacgacg 3480 tacaagtgta ctttaattgc tcagggctta ctactgacgc tgctgcagaa ttgataaaca 3540 ataatttgca aagaattttg gattgggctg tggagaacca tttaattcta aatgcacgta 3600 aaactcaggc ttgttacatc agccgtttta agaggatggt ggaaaaacca gaaatagtgt 3660 tgaatggtaa tcgaattttg tactctgatg ttgtaacgag tctgggaatc atcgttcaac 3720 aaaatttcga atgggatagt ttcattctac accagtgtgg gaaagtttat gctgctttac 3780 gtactcttag atcaaacgca tattttttga gttcggctac gaaacttcga ttgttcaaaa 3840 gcttaatact cccacatttc ataacttgcg actttattat gactcaggca tccagaacta 3900 caatggatcg attgaaagtt gcacttaatt gctgcattcg atttgtctac aatctaaatc 3960 gctactcgca tgttactccg ctacagaatg ctttgatcgg ttgccctttt aacaaatttc 4020 cttcagttag atctgtttta ttgttacata acataatacg aactcaatct cctagttatt 4080 tattctcaaa attgattcct ttgagaagcg tacgaggtaa aaaatttgtt ataaagcgcg 4140 taagaacttc tcactacgcc agttcactcc tagttagagg tatatcttta tggaataatt 4200 taccatcgca tattcaaaat acgaattcat ccttagggtt taagaaaaca tgtttacaat 4260 tctttaatca tgaatagata ttaattttcg tagaaagttt tttttaatgt gtcatctcat 4320 actattaatc tacgcttaac agaaaagact atgtcttatg tttatggaaa ataaaaatga 4380 aatgaaatga aa 4392 // ID hATx-25_SM repbase; DNA; INV; 2250 BP. XX AC . XX DT 17-AUG-2009 (Rel. 14.08, Created) DT 17-AUG-2009 (Rel. 14.08, Last updated, Version 2) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hATx-25_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-2250 RA Jurka J.; RT "haT-type repetitive family from freshwater planarian Schmidtea RT mediterranea."; RL Repbase Reports 9(8), 1860-1860 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 272..1171 FT /product="hATx-25_SM_1p" FT /translation="MKRIPKETQISPKRIRNSNLFTPIDSFYKPNRTSLNE FT IVSKLATIDGMSIRQICSSEFIRKSILSLGFTLSKQPRDVMKLIYDYYDET FT KAIIISNIKKKQLINQKVSISIDEWTSLRNRRFLNIHVYYCDGDSDNLGLV FT RVCGSCTSEVLIELVDQKLKDFEIYFERDVVATTSDGANVMLKFGRLSPAD FT HQLCYNHAIHLAVMSVLYSKITNNQKISDGVDSDSEQEYCLNSSVSDSESN FT GSTSDAEFISEEYLLLTKDMNIAETIRRVRRIVKIFKNSPVKNSILQKYII FT VVEKKSCH" FT CDS 1444..2025 FT /product="hATx-25_SM_2p" FT /translation="LSRQLLTELKYQLSKRRNILSISLIKFLQDPNFKRDR FT GDDFFNFSSKSEIVKIAKVYTARLFRKIGDKENFDLNNEIDLDDMTTENDL FT IIDKNSALNLSQLLEKSIQESVKQPQPFLDDNYSSLTKEINLYEVTGKITP FT NXEMLRXALSTIRPTSTQNERNFSVSGNXVSVRRTRLTDKSIDRLCFLKHY FT FMKK" XX SQ Sequence 2250 BP; 840 A; 322 C; 358 G; 725 T; 5 other; ttctcgagaa atattccatt aaattctcga gaaaactagg aatttataaa aaatgctaaa 60 aatccaatca tctaatggga tatctacaac ttctcttctt tcattagata acttatagac 120 ggcttcttct attaaaaatt atgataattg ttggaatttc tttgaaaaaa tagaaaagga 180 aacagcaaag tgtaatattt gttccaaaca aatatcgtgt aaagcatcag ctacaacagg 240 attgcatcgt cacttggatc atatacattt tatgaaaagg atcccgaagg aaacacaaat 300 ctcacctaaa agaattagaa attcaaattt atttacacca attgattcat tttataaacc 360 aaatagaaca agccttaacg agattgtttc aaaattagct accattgatg gaatgtccat 420 aagacaaatt tgttccagtg aattcatacg aaaaagcatt ttatcattag gctttacctt 480 atccaaacaa cctcgcgatg taatgaagtt aatttacgac tactatgatg aaacgaaagc 540 gattatcatt tctaatataa agaagaagca attgataaat caaaaggtca gtatatctat 600 cgatgaatgg actagcttaa gaaatcgaag gtttttaaat attcacgttt actattgcga 660 cggcgactct gataatttag gattggttag agtttgtggt agttgtactt cagaagtact 720 aattgaatta gttgatcaga aactaaaaga ttttgaaata tattttgaac gtgatgttgt 780 ggcaactacg agtgatggag cgaatgtaat gctgaaattt gggcgtctct ctccagcaga 840 tcaccaatta tgttataatc atgctattca tttagcagtg atgtcagttc tttattccaa 900 aataacaaac aatcaaaaaa ttagcgatgg cgttgatagt gattctgaac aagaatattg 960 cttaaattca tctgttagcg attctgaaag taatgggtca acttctgatg ccgaatttat 1020 ttctgaagaa tatttattac ttactaaaga tatgaatatt gcagaaacta taagacgagt 1080 tcgaagaatt gttaagatat tcaaaaattc accggttaag aattcaatat tgcaaaaata 1140 tataatcgtt gttgaaaaaa agagctgtca ttgattttgg actgtaaaac acgatggaat 1200 agcttattta ctatgctaga tcgtgtagtt cgtgtcattg aggcaattga aaagtcttta 1260 aaagatttga atttatgcca tttgtggagt gtagagaaca caatgacaac taaatgtatt 1320 ttaaacgcat taggacctgt taaaattgca atcgaggctt taagccgaaa ggatacgaat 1380 atactaacta gcgaaggaat actcaaattt ttattcaatg cacttcaaac tgaaaattcg 1440 taattaagta gacaattgtt aactgaacta aaatatcaat tatcaaaaag aagaaatatt 1500 ttatcaattt cncttataaa attcttgcaa gatccaaatt ttaaaagaga tcgtggcgac 1560 gattttttca acttttcttc aaagtcagaa atcgtcaaaa ttgcgaaagt ttatacggca 1620 agattattta gaaagattgg agataaagaa aattttgatc taaacaatga aatcgatctt 1680 gacgatatga ctacagaaaa tgatttgatt atcgataaaa attcggcctt gaatctatct 1740 cagttactag aaaaatcaat tcaagagagc gtaaagcagc ctcagccctt cttagatgat 1800 aattattcgt cactaacaaa agaaattaat ttatatgaag tgacaggaaa gattactcca 1860 aacttngaaa tgctgagaaa ngctttgtca acaatacgac cgacgtcaac gcaaaatgaa 1920 cgaaattttt cngtctctgg aaatnttgtg agcgtgagaa gaacgcgatt aactgataaa 1980 tcaattgatc gcttatgttt tttaaaacat tattttatga aaaaataaag ctaatttgaa 2040 tttaatttaa tttttattaa atataattta cgtacattat accgtaccac tcaatttctg 2100 caatatttaa taactgactt gcatatttct aaaagatgtt taataaagac caaatttttt 2160 caaatattta aagcctctgt gtatttagaa tttctcaaat ttcgcgagaa ttctcgagaa 2220 tggaatttat ttttctcgat ttctcgagaa 2250 // ID BEL-636_AA-I repbase; DNA; INV; 7387 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-636_AA_; KW BEL-636_AA-LTR; Pao_Bel_Ele148; BEL-636_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-7387 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [6357-6935] - Integrase core CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS join(2028..5549,5553..7301) FT /product="BEL-636_AA-I_1p" FT /translation="MLGAGKSPSLRALQLKLKEVLLMFNDIQHFVSKYRAG FT TSAVQIQVRLSKVDELWKMFSDTLVEIMSHEDYVADEEKYAKERQQFSAQY FT YNIKAFLLEKLEQIQRAAMDHSTNVHDSSTLNISEHVRLPQIKLQTFKGDI FT DEWISFRDLFISLIHSRTDLPDVEKLHYLKGCLMGEPKGLIDPLAITAANY FT NVAWNLITKRYDNSKQLRKRQVQALLNMPYIAKESVSELQTLVEGFERIVQ FT TLDQVVEVAEYKDLLLVNMLSSRLDPITRRSWEEQSSTKDKDTFKDLVDFL FT QRRIQVLASIPARSSDSKVSSTNASRQRAVVKASFNSASQSSTSCACCSEN FT HFLYSCPAFQGLSVRDREGVLRKHSLCRNCFRFGHQGKDCRSKYSCRTCKG FT RHHTLVCFKQNADTVSTVGNSASNCPPDNGEPSCPGTNTVASVATSEISVC FT NAAGAASQVFLATTLILVEDNEGNQYGARALLDSGSECNFVSERLCQLLKV FT VRSKTDVRVQGIGQTTSLVKHQVQIKVRSRVSSFARSVNFLVLPKVTVNLP FT TTTVDTRELVIPEGVRLADPGFFKSSGVDMVLGIELFFSLFDIDRKISLGD FT NMPTLIDSVFGWVVCGTMAMPKAISQVNCNLSVKDRLENLLTRFWECEEVA FT DQIKSSPEETLCEEQYQESVKRAEDGRYIVGLPMDGVKLQRLGESRDIALR FT RFYSTERRLDRDSNLRQQYVAFMDEYLQLRHMKEVQEIASEGSKRCYLPHH FT PVVKEASTTTKVRVVFDASCRTSNGISLNDALYAGPVIQEDLRAIIIRGRT FT RQIMVVADVEKMFRQIWIRPEDRHLQCILWRSSPMEDVRTYELNTVTYGTK FT PAPFLATRTLQQLATDEGRRFPSAAVAVREDTYMDDVITGADNVEDATRLR FT CELQDMMEAGGFHLRKWASNCSKVLEGVPVENLAIPNTEEITLESNPSITT FT LGLVWVPGTDKLRFKFQVPQLNPAELLTKRKILSTIATLFDPTGLIGAVIV FT EAKIFMQRLWTIEGEDGQRLDWDQPVPPKVGEEWCKFYEHLPRLQEVEVHR FT CVIIPAAVIVEVHCFSDASMKAYGACIYLRSQDSSGNILVRLLASRSKVAP FT LKTQSLPRLELCGALLGTQLCQKVREAIRFKGEIFFWTDSTCVLRWIAATP FT AIWTTFVANRVSKIQLSDGCHWGHVKGTENPADLISRGLNASNIIDNALWW FT SGPFWLQHPQEQWPVRDAGTNEEGEEERRRCATANTVSTCDEFNQWYIAKF FT DSYDDLIRRTAYWLRMMELLKQPKDQRKNEEFLTIVERKRAEQVLIRRVQM FT EAFPEEWKALSKGSSVSAKSPLRWFHPYISKEDGLLRIGGRLNHSQEPENK FT KHPPILPARHMFTRKLLQSFHERLLHAGPQLLLATVRLHYWPLGGRSLARQ FT LVHKCLRCYRSQPSTVQQFMGDLPSSRVTVSRPFSRVGVDYFGPVYVRLGP FT RRTAVKAYVALFVCFCTKAIHMELVTDLSTERFLQALRRFISRRGKPSDIF FT SDNGTNFVGARNQLTELMALPRNRDCHEKISKECNAEGIQWHFSPPSGPHF FT GGLWEAAVRSAKKHLLKVLGENVLTFEDMNTLIVQIESCLNSRPLTQLTED FT PEDLTPLTPAHFLVGSSLQGLPEYDYTEIPFNRLKQWHAVQQKFQHFWIRW FT KREYLAQLQGRMKRWKPPVEIETGKLVIIQDDNQPPLRWKMARIHQLHPGD FT DGVVRVVTLKTSNGYLKRPVERICLLPLPNQDVVVQP" XX SQ Sequence 7387 BP; 1875 A; 1758 C; 1910 G; 1842 T; 2 other; tgttggtcct tcgagccgga tcagggaccc ccggtaggac ttggaaccca caaacccgct 60 cattatcacc aaaacgccat ccgctggaac gccatttgca accgccaaat tgtcgcatca 120 attgtcgcct attgtcgttg gatacaaagg ataacgccat catctgtaag gaacatagga 180 gggtgtcacc atcttggacg ctacatctgg cttggacatc agcaccgttg gctattgtgt 240 aggtcagtat agctactata tatgtcctgc cgaagctgga cattccggac tccttctaac 300 actattctct attggtaata cggaatccaa cgacaacgac gtacaagcac ctggaaatcc 360 cctggaatat tcattggatc ggaaacaacc ctatgccctg gaacacactc tttcggattg 420 gaaggttcac attcgaccgc tttgaagaca tcattggact tcggacttgc gctggacctg 480 ttggcagtaa cattttctaa ggtaactgca tgaatgtacg cgcggccccc cggcggccgg 540 ggccgccggg ccgggggggg ggcccgcgcg gggccgcccg gccccgggcg gcccgtatgt 600 atatgtccag ccggagcaat tcgcgcaagt actaacacca cacttcgatt ttcctatgcg 660 ttgagaacca ctggaagcaa caaggattac gtcatcgggg atacccttgc atattcatca 720 ttcgtcatcg aaggcaacaa cattcgacta cttcgggaca tcgagcacgt gcgtttggta 780 cgacgtcatc attacaacgt ggtcaccgag agaggctctc tggcatatca tctacatcgg 840 caacggcttc gtcggactat cgagatatcg cttgctattg gtgtcgggcg agtgaaacac 900 catacggaag gcctgcacaa cacatcgatc gaagaggagg ccagcgtaaa tggtctgtca 960 ggttagtgaa caatagtatg tgttcagcca gcctggacat tgcagactat atacacattt 1020 taattatata atcctcttcg accggtattg ttgttgaccg tttgcacact gcatgagccc 1080 tgagaacaaa gaacttgttt acgtctggat cgaaaacttg tttacgtctg gatcgagatt 1140 cggtacctat tctgctattt tgcagacaac tttcctgtaa ggttagttac tgcagtatat 1200 ggccagccga ggccgactca ttttgatact acatgaatct atccccttac acatgtttgc 1260 ttgatcgaga ttgtattgga tttggactga ttttacctac cgtactgctg cactaccgcc 1320 cccgcggccc ccggcggccg gccccggccg ccccggcgcc ccgcgccccg gccccccgcc 1380 ggggcgcggg ccccgccggc gcggggccgc gcgcccccgg ggcggccgcc gcgccgggcc 1440 ggccgggcgc gccccggccg ggccggggcc cggggccccg ggcgcgcggc gccgcgccgc 1500 cggcccgccg gggggcgggg cgcgcgccgc cgcccgggcc ccgccgcccc gcgcgggccg 1560 ccgccgccgg cgggccggcg ccgccggccg ccggccggcg ccgcgcccgc cccgcccgcc 1620 cgcgccggcc gccgcggcgg cggcccccgc ccgggggcgg tacttcgtga gaaggccaac 1680 agcgcaccgt agcttttgtg aaggtttgtg ctacagtata tggcctgccg aagttaggtt 1740 tattttggat actacattgc ttattttgca ctctttgatc actgctgttc gagtcgtcac 1800 cgtggatgcg acacttgatc gatcggttgg actgctaatg cgatttggat atattttgga 1860 attggacttg gattttcttc agcatttcga aaccttttgg tcgaagtgac tgttccgtca 1920 ggtcagtctt actcagtata tgtctagccg aggctagaca tttattttgc tactacacct 1980 gaatcttttt tcgatttacc ccgaacgtgt gagaaccaac aatcatcatg cttggagctg 2040 ggaagtcgcc gagcctacga gcattgcagc tcaagctcaa ggaggtcctg ctgatgttca 2100 atgatattca gcatttcgtc tcaaaatata gagcaggtac tagcgcagtt cagattcagg 2160 ttagattaag caaggtagat gagttgtgga aaatgtttag cgataccttg gttgagatta 2220 tgtcgcatga ggattatgta gcagatgagg aaaagtatgc taaggaaaga cagcagttta 2280 gcgctcagta ttataacatt aaggcattcc ttttggagaa attagagcag atccaacgag 2340 cggcaatgga ccattcaact aatgtccatg acagctctac gttgaacata tcggaacacg 2400 ttaggttacc gcagatcaaa ttgcaaacct tcaagggaga tatagacgaa tggattagtt 2460 ttcgagattt attcatttcg ctcattcatt ctcgtacgga tttacctgat gtcgagaagt 2520 tacattattt gaaggggtgc ttgatgggtg aacccaaagg attaatcgac cctctagcaa 2580 ttaccgcagc caattacaac gttgcttgga acctgatcac taaacgttac gataacagca 2640 agcaacttag gaaacgacaa gtccaagccc ttttgaatat gccctatatt gccaaggaat 2700 cggtttcgga actgcaaact ttagtcgaag gttttgagag gattgtccag acccttgacc 2760 aggttgttga ggtagcggaa tacaaggatt tacttttggt caacatgctg tctagtcgat 2820 tagaccctat tactcgtcgg tcatgggagg aacagtcttc tacaaaggac aaggatacgt 2880 tcaaggattt ggttgatttc ttgcaacgtc gtatacaagt attggcatcg ataccggcac 2940 gatcatcgga ctccaaggtt tcatcaacca atgcatcacg gcagagggcc gtggtgaagg 3000 ctagcttcaa ttctgcgtca caatcatcta ccagttgcgc atgctgttcg gagaaccatt 3060 tcttgtatag ctgtccggca ttccaaggac tgtcagttag ggatcgcgag ggtgtgctac 3120 ggaagcactc gttatgccga aactgcttca gatttggtca ccaagggaag gattgtcggt 3180 cgaagtactc ctgtcgaaca tgtaaggggc gacatcatac actcgtttgc ttcaagcaaa 3240 atgcggacac tgtttctacg gttggaaata gtgcttcaaa ttgtcctcca gataacggag 3300 agccatcatg tcctggaacg aatacggtag cgagcgtggc gacttcggag atatctgtat 3360 gcaatgcggc aggggcagca tcgcaagttt tcttggctac aacgttaata cttgtcgagg 3420 acaacgaagg caaccaatac ggagcgcgtg cgcttttgga ctctggatcc gaatgcaact 3480 ttgtttcgga aaggctttgt cagcttctca aggtggttcg atcaaagact gacgtcaggg 3540 tacaaggaat cggtcaaact acatcactgg tgaaacatca ggttcagatc aaggttcgat 3600 caagggtttc atcattcgct cggtcggtga attttctggt tttaccgaag gtaaccgtca 3660 accttcctac tactacggtg gatactcggg aattggtcat tcctgaaggc gtacggctag 3720 cggacccagg tttcttcaaa tcaagtggag tagacatggt tctaggtatt gaactcttct 3780 ttagcttgtt cgatatcgat cgcaaaatat cgctcggaga caatatgccc acgttgatcg 3840 attcggtgtt tggttgggtg gtgtgcggca ccatggcaat gcccaaggcg atatcccagg 3900 tcaactgtaa tctttcggtg aaggatcgtc ttgaaaactt actcacacgg ttttgggaat 3960 gcgaagaggt cgcagatcaa atcaaatctt cacctgaaga gacgctttgc gaggaacaat 4020 atcaggaatc agtcaaaaga gctgaagacg gtagatacat cgttggtctt cccatggacg 4080 gtgttaaact gcagcgactt ggcgagtcaa gggacatcgc cttgcgtcgt ttctacagta 4140 cggaacgaag attggatagg gattccaatt tacgtcagca atatgtagca ttcatggacg 4200 aatacctaca gctacgacat atgaaggagg ttcaggaaat agcaagcgaa ggtagcaaac 4260 ggtgttacct accacaccat cctgtagtga aggaggcgtc cacgacaacc aaggtacggg 4320 tcgtgtttga cgcatcctgt aggacgtcca atggcatatc actcaacgat gcgctctatg 4380 ctggtccggt tattcaggag gatttacgag cgatcatcat tcgtgggcga actcggcaga 4440 tcatggttgt ggcagacgtg gagaagatgt tccgccaaat ttggatacgt ccggaagata 4500 ggcatcttca atgcattctt tggcgttctt cacccatgga agacgtcaga acctatgagc 4560 taaatacggt tacctacgga accaaacctg ctccatttct ggctacgagg actctacagc 4620 aacttgccac cgacgaaggt cgtcggtttc catcggcagc agtcgctgtg agagaagaca 4680 cttacatgga cgatgtaatc actggagcgg ataacgtgga ggatgcaaca agattgcgtt 4740 gcgaactaca agatatgatg gaagcaggtg gttttcatct ccgaaaatgg gcgtccaatt 4800 gttcaaaggt gttagagggc gtccccgtcg aaaatttggc tattcccaac accgaagaaa 4860 tcactctcga gtcgaatcca tcgattacta cacttggcct ggtgtgggtg ccaggaacgg 4920 acaaattaag attcaaattc caagttccac aacttaatcc agcagaattg ttgacaaaac 4980 ggaaaatact gtctacgatc gccacacttt tcgatcccac tggactcata ggagctgtga 5040 tcgtagaagc aaaaatattc atgcagcgtt tatggacgat agaaggtgaa gatggtcaac 5100 gactggattg ggatcagccg gtaccgccga aggtgggcga ggaatggtgc aaattctatg 5160 aacatctacc acggttgcaa gaggtcgaag tccatcggtg tgtaatcatc ccagctgcag 5220 tcattgtaga ggttcattgt ttttccgacg cttcaatgaa ggcttatggg gcttgcatat 5280 atttgcggag tcaagattcg tcaggaaata ttcttgtacg gctgttggca tcacgttcca 5340 aggtggcacc tttaaaaacc caatctctac caaggttaga actttgtggg gcgctacttg 5400 gaactcaact ctgtcagaag gtgcgtgaag caattcgttt caaaggagag attttctttt 5460 ggaccgactc aacatgtgtc ttgcgctgga tagccgcaac tccggcgata tggacaacat 5520 ttgtggctaa tagggtgtcc aaaattcaag stctatctga tggatgtcat tggggacatg 5580 tgaagggtac cgagaatcct gcggacctga tttcccgcgg gctgaacgcc tccaatatca 5640 tcgataacgc tctgtggtgg tcgggacctt tctggcttca acatccccag gaacagtggc 5700 cggtaagaga tgcgggaacc aacgaagaag gagaggaaga acgacgtcga tgtgctacag 5760 ctaataccgt gtcaacgtgt gatgagttca accagtggta cattgctaag tttgattcgt 5820 acgacgattt gattcggcgt accgcttatt ggttgagaat gatggaatta ctcaagcaac 5880 cgaaagatca gaggaaaaac gaggagtttt tgactatcgt cgaaaggaaa agggccgaac 5940 aggttctcat tcgtcgtgtt caaatggaag cgtttccaga agaatggaag gcgttgtcga 6000 aagggtcgtc tgtgtctgct aaatcaccgt taagatggtt ccacccctac atctccaaag 6060 aggatggact tctgcgcatt ggaggaaggc tcaaccattc acaagagcct gaaaataaga 6120 agcacccacc aattcttcca gcccgtcata tgttcactcg gaaacttctc cagtcgttcc 6180 atgagagact tttgcacgct gggcctcagt tgttgttagc aacagtaagg cttcattatt 6240 ggcctttagg tggaagaagt cttgctcgtc agttggtaca taaatgcctt agatgctatc 6300 ggtcgcaacc atcgactgtc caacaattta tgggagattt accatcgtca agggtgactg 6360 tatcgagacc attttctcgg gttggagtgg actattttgg tcctgtctat gtaagacttg 6420 gtccaaggcg aaccgctgta aaggcatacg tggctctttt tgtatgtttt tgcaccaagg 6480 ccatccatat ggaattggtg acagatttgt ccacggagag gtttctccag gccttgagac 6540 gatttatttc acgcagaggg aaaccaagcg acattttctc cgataacgga accaatttcg 6600 ttggagcgag aaatcagctg accgaattga tggctttgcc gagaaataga gactgccacg 6660 agaagatatc caaggaatgt aacgcagagg gtatacagtg gcattttagt ccccccagtg 6720 ggccacattt tggcggcctc tgggaagccg cagtacgttc tgcgaaaaaa catctgctca 6780 aggtgcttgg cgaaaacgtt ttgaccttcg aggatatgaa tactctcatt gttcaaatag 6840 agagttgctt gaactcacgg cctttaacac agttaaccga agatcctgaa gatttaactc 6900 ctttaacgcc agcccatttc ctcgttgggt catcactaca aggccttccc gagtatgact 6960 acactgaaat cccctttaac cgacttaaac aatggcacgc tgtccaacaa aaatttcaac 7020 atttctggat aagatggaag agggagtatt tagcacaatt gcaaggcagg atgaaaaggt 7080 ggaaaccacc tgtagagatc gagactggga aactagtcat tatccaagat gataaccaac 7140 ccccattacg ttggaagatg gctaggattc atcagcttca ccctggtgat gatggggtcg 7200 tkagggtagt cactcttaag actagcaatg gatatttgaa acgtcctgtt gaaaggattt 7260 gcctgctccc gttaccaaat caagatgttg tagtccaacc atgaactcca cacgtaactc 7320 gcagcgattc tggtcgaaga ggttcttata ttttcagaaa tcaccgtgct ttcaggatgg 7380 gcgagca 7387 // ID BEL2-I_AP repbase; DNA; INV; 2265 BP. XX AC Contig56303; XX DT 21-APR-2008 (Rel. 13.04, Created) DT 21-APR-2008 (Rel. 15.12, Last updated, Version 0) XX DE LTR retrotransposon from pea aphid: internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL2AP; BEL2-I_AP; KW BEL2-LTR_AP. XX NM BEL2-I_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-2265 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from pea aphid."; RL Repbase Reports 8(4), 431-431 (2008). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR Genome; Contig56303; Positions 2811 547. XX CC Positions [1153-1719] - Integrase core CC LTRs are 96% similar to each other. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX FH Key Location/Qualifiers FT CDS 562..2064 FT /product="BEL2-I_AP_1p" FT /translation="MVPEKEVEWFTRFSSLRRMQRVTAIMYRFIRHTRTKR FT VPYVAPKYIHDPISDEEISNAMLPIIRMTQSLHFVSLLRILQVPTTKIVPR FT SIAQLAPFVDKTNIIRVGGRLRNSCVAAETKNPILLPKSSTLTTLIIRHFH FT LNHFHAGPQLTSSLISSCYWIISSRSAIRYVVFRCVVCARHRASSIHPMMA FT DLPSSRVTLCRPFLHVGIDFAGPLIIAEGRRKNARSIKCYLSIFVCMTIKA FT VHIEVVSDLSTSTFLAALQRFVARRGTPSDIYTDCGTNFKGAEQQLRHMML FT DATAKTTYTNAILCKWHFNPPAAPHFGGLWEAAVKSTKYHLKRVIGTQRLT FT FEEMVTLTSRIEALLNSRPITPLSADPNDYRALTPGHFLIGHPMVEVPEKD FT VIDIPQNRLNRWELLRQMYQSLWKRWSTEYLSSLQRRTKWVDNQPNVKIND FT LVLINMPNQPPIHWKLGRIQQVHPRADGVVRVATVRTEHGTLTRPIVKLAI FT LPPDN" XX SQ Sequence 2265 BP; 676 A; 517 C; 414 G; 658 T; 0 other; ctgagaaata tcactactat tttataccgt ttattttgcg ttattattat tatagtaatc 60 tctcttaata cctcacagtg accccataag gctcatcatc accggtaaaa ttatattcat 120 cgcgtggcca acaaaaacca cgttgaccat accacgcttg gagctgtgcg gtgcattatt 180 attagcacaa gtgctacatc gactaacaaa tacattccac ggtaatatat tcatctcagc 240 aacactagcc tggacatact caaccatagt gttatcatgg ctcacctcag tcgagtttta 300 aaatattcgt aaccaacaga cttgcaaaaa tcgccgaaat cctaccaact tgtcaatggc 360 gacatgttgt ttcggattca aacccagcgg actgtgtttc aagaggttta ttcccatcac 420 agattcatga ccaacattta tattggcagg gacctccgtt tttgaaatta ccggaatctg 480 aatggccaat tacatcgttc aaaccaattc aaccatcaca cttgccagat tattcggatt 540 caacaaaacg cgtgtttagt aatggtgccc gaaaaggaag ttgaatggtt cacacgtttt 600 tcgtcattaa gacgtatgca acgagtcaca gccataatgt atcggtttat aagacacacg 660 cgcacgaaac gcgttccata tgtcgcaccc aaatatatac atgaccctat ctccgatgaa 720 gaaatttcaa atgcgatgtt accgattatt cgtatgacac aatcgttaca ctttgtaagt 780 ttgttgagga tcctacaagt acctactaca aagatagtcc ctcgttcaat tgcccaactt 840 gcgccgtttg ttgataaaac caacattatt cgggtgggtg gacgtctccg aaattcatgt 900 gtagcagcag aaactaaaaa cccaatatta ttaccaaaat caagtacgtt gacgacactc 960 attattcgcc atttccactt gaatcatttc cacgctggac cccaattaac gtcatcgcta 1020 atatcaagtt gttattggat aatatccagt cgatctgcca tacgttacgt tgtcttcaga 1080 tgcgttgtgt gtgctcgcca tagggcatcg agtattcatc ctatgatggc cgacctccct 1140 tctagtcgag taacattatg ccgaccattt ttgcatgttg gtatcgactt tgctggacca 1200 cttatcatag ccgaaggccg tcgtaagaac gcacgatcga taaaatgtta cctttctatt 1260 ttcgtgtgta tgactatcaa agctgtacac attgaagtgg tttcagacct ttcaacaagt 1320 acgtttttgg ctgctttgca acgctttgtc gctcgacgtg gtaccccatc agacatttac 1380 actgactgtg gaacaaactt taaaggagca gaacaacaat tacgtcatat gatgttagat 1440 gccaccgcca aaacaacata tacaaatgca attttatgta agtggcattt taaccctcct 1500 gcagccccac attttggtgg cctgtgggag gctgctgtaa aatccacaaa atatcattta 1560 aaacgagtta ttggaactca acggttaacg tttgaagaaa tggtcacatt aacaagtcgg 1620 attgaggctc tacttaattc tcgcccgata actccattgt ctgctgatcc taatgattat 1680 cgtgcactaa cgccaggaca ctttcttatt ggtcatccaa tggttgaagt accagaaaag 1740 gacgtaatcg acatacctca aaaccgcctc aatcgttggg aactcctacg acaaatgtat 1800 cagtcactct ggaaaagatg gtcaacagag tacttatcat cgttgcaacg acgcacaaaa 1860 tgggtagaca atcaaccaaa cgtcaagatt aacgacttag tattgataaa catgccgaat 1920 caaccaccta tacactggaa actaggtcgc atacaacaag ttcatccgcg cgcagatggc 1980 gtggtgcgtg tagccacggt gcgtactgaa catggcaccc taacacgacc aatagtcaaa 2040 ctagccatcc ttcctcctga taattgatta ttcctgttcc tataattttc ctgttgatgt 2100 gggtgaatta tttttattat tattttatat tgtaaattcg tgatttctat atatattata 2160 tttgtattat gtaaattatt accacacgat gcgggaccgt gaagaggacc aatacaccat 2220 caagtattat tgaaagctgg attacccttt caatggcggg agtga 2265 // ID Gypsy-229_AA-I repbase; DNA; INV; 5523 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-229_AA_; KW Gypsy-229_AA-LTR; Gypsy-229_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5523 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1061-1061 (2011). XX DR [2] (Consensus) XX CC 'GTCC' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 1137..4877 FT /product="Gypsy-229_AA-I_1p" FT /translation="MAQSNINSTLEPYRKGNSFGDWIERLGFFFNMNKVPD FT EEKRDHFITLSGPTIFRELKLLFPNTNLAEVPYTEMITKLKARLDKTESDL FT VQRLKFNLRVQQPDESLEDFVLSVKLQAEFCNFENFKQMAIRDRIVAGIRD FT KSLQQRLLNEEKLTLETAEKLIATWEIARNNAKNMEYSNNVDQIASLKGFP FT GARLNKLAATMELAGSANANVGADRGRGSVKNRLGYSPYHKDEWKQKQNRL FT WRNRGQDKERNRQSNRPDYSQLICDFCGVKGHIKRKCFKLKNMHRDAVNMV FT NQNTSGSSPDDFLSDMVNRMRTDSESENETDGENRNNIFQCLHVSCISKLS FT EPCLISVEIENVLIDMEVDCGSSVTVMSKKQYFESFSKDLMKSQRNLVVVN FT GANLGIEGEVDVFVKFKGISRNMRLLVLNCDNNFTPLLGRPWLDTLFPEWR FT NFFVNSINFKHETDQIVDEIKIKYKDVFIKNFSKPIQGFEADLVLKSEVPI FT FKKAYDVPYRLRDKVLEYLSKLEHEKVITPIKTSEWASPVIVVMKKNNEIR FT LVIDCKVSVNKFIIPNTYPLPTAQDVFAGLAGCKLFCSLDLEGAYTQLSLS FT ERSKKFMVINTLKGLYRYNRLPQGASSSAAIFQQVMEQILNGIEHVSVYLD FT DVLIAGKDLDDCKQKLVIVLERLQKANIKVNWDKCKFFVTELVHLGHVISE FT NGLMPCQDKISTIKQAKVPRNETELKSFLGLINYYHKFIPNLSSKLYYLYN FT LLKTEVTFHWDSNCDEAFEGSKKALVEARFLEFYDPDKPIVIVSDASGYGL FT GGVMAHLIEGVEKPIYFTSFSLNSAQQKYPILHLEALALVCTVKKFHKFLY FT GKKFFVYTDHKPLVAIFGREGRNSIYATRLQRFVLELSIYEFEIQYRPSKH FT MGNADFCSRFPLDNVVPADCDVELVNSINFGREIPIDFVKIACKTKEDEFL FT QNIISFMTNGWPAKVNKQFIDVYANQQDLELIDGCLLYQNRVVIPASLKKS FT VLNLLHANHAGIVKMKRLARRNVYWFGINSEIENFVTACDICNSMMIVPKT FT KTESEWIPTTRPFSRIHIDFFYFEHRTXLLAVDSYSKWVEVEXMXNGTDCD FT KVLRKLVAIFARXGLPDVLVSDGGPPFNAYSFVDFLKRQGINVLKSPPYNP FT SSNGQAERLVRTVKDVLKKFLLDXEFSHLNVEDQINLFLINYRNNCLNIEG FT NNPSQKIFCYKPKTFLDLLNPKSHYKNFYERPKCMIT" XX SQ Sequence 5523 BP; 1798 A; 888 C; 1155 G; 1676 T; 6 other; gtccatttta tataagttcc ggtttacgga catttcagtt ggcgacgagt gtgaaaggga 60 atcatattag tgcaaagtgt tggaaagtga agccatcgga ctagcagcag aagtgtttaa 120 tcaatagcag aacacttgga ccgacgctat ttatacggac gccattctta cgacggattt 180 tctttgcctc gattggcggt agtcagcagc gcgctagtgc aacagactgc atagtgcacc 240 aattgctgga aggctttgta gaagagccgt tgtgacgtac ggcattcgtc ggcaattaat 300 tgaccaaaat caatctggtg aggtaagata gttttggtga tttgtaattc ttagtggctt 360 ggtttagcgc cattttgaag accgaacaat tgggtagaac cataaaagaa ctccatctct 420 ttttcatcaa cagaacaata ccttaaggaa ttgaaaagaa atttttctta acaattacat 480 tgttttattc tttttattaa gagtgatttt cttttattta aatagacagt tggtatagtt 540 gaatccatat ttatagaccc taaactaaat taactataca tgaactactt tactgttgaa 600 aggatctcca aatctcagtg tatgaataat gtatgaaact atctaaagaa acaatcaata 660 aaatgaaatg aaatgaaatg aaatgaaatg aaataaatag acagtttctt ctggagtatt 720 cagcagttgt tgccatcatt gttgagcgtt attgaaccgg cgtctgtagg aaattgggta 780 gttttccaac cgcacgacta gactgaactg gttcgatcaa catcggttcc tatcttacca 840 atcgttctgg ctatcatcga tttcggctcc tatcaatcgt ttcggcgcac ggtaggtcta 900 aaatacgtca actgacggct tgcgtggttg ggtcgtcgaa gggcgaagtc cgcaagttgc 960 aaactccaac tcctaccccg atcggcaaag gctgaaggac attttcggtg agttcaattt 1020 tctttttcaa cctttgggaa ccatatagtg catctttttg taagaaacga ttgttttatt 1080 ctagaaataa ttcgttcaat tgccgttgat tattagtttt tgtgattttc aataacatgg 1140 ctcaatccaa cattaatagc acgttggagc cttaccgtaa gggcaattct ttcggggatt 1200 ggattgaacg tttgggcttc tttttcaaca tgaataaagt ccccgacgaa gaaaaacggg 1260 accattttat tactctgagt ggtcctacaa tttttaggga gctcaaattg ttatttccaa 1320 atactaattt agcggaagtt ccttacaccg aaatgataac caagttaaag gcacgccttg 1380 acaagacgga atctgatctt gtgcaaagat tgaaatttaa cttgagagta cagcaaccgg 1440 atgaatcatt agaagatttc gtgttgtctg ttaagcttca agcagaattt tgcaactttg 1500 aaaatttcaa acaaatggct attcgtgacc gcattgttgc tggtatccga gacaaatctc 1560 tccaacaaag gttgttgaat gaagagaaat taacgcttga aactgcagaa aaactcattg 1620 caacttggga aattgcgagg aataatgcga aaaacatgga gtacagcaac aatgtggatc 1680 aaatagcttc cttaaaaggt ttccctgggg caagacttaa caaattggca gctacgatgg 1740 agttggcagg aagtgcgaat gcaaatgtag gggctgaccg aggccgtggg tcggtcaaaa 1800 atagattggg ttattctccg tatcataaag atgaatggaa gcagaagcag aatagactat 1860 ggcggaatcg tggacaggac aaggagagaa atcgtcaatc aaaccgccct gattattctc 1920 aactgatttg cgacttttgt ggtgtcaaag ggcacattaa acgaaaatgt tttaaattga 1980 agaatatgca tagggacgcc gtgaacatgg ttaatcagaa cacttctgga tccagcccgg 2040 acgatttcct aagcgatatg gtgaacagga tgcgaacgga ttcagagagt gagaatgaaa 2100 cagacggtga gaatagaaac aatatttttc aatgtttgca tgtgtcgtgt attagtaaat 2160 taagcgagcc ttgtttaata agtgtagaaa ttgagaatgt tttgattgat atggaagtgg 2220 attgtggatc gtcagtaacc gttatgagta aaaagcagta ttttgaaagt ttttcaaaag 2280 atttaatgaa aagccaaaga aatttggtag ttgttaatgg agcaaatctc ggtatagagg 2340 gagaagtgga cgtttttgta aaatttaagg gaatttcgag aaatatgagg cttttagtgt 2400 taaattgcga caataatttt acccctttat tagggagacc gtggttggac acattatttc 2460 cggaatggag gaactttttt gtgaattcta taaatttcaa gcatgagacg gaccaaatag 2520 tagatgaaat caaaataaag tataaagatg tttttattaa aaacttttct aaaccaatcc 2580 aaggttttga agccgatttg gttttaaaat cggaggtacc gattttcaag aaagcctacg 2640 atgtaccata ccgattacgt gataaagttc ttgaatacct atccaaatta gaacacgaaa 2700 aagtaataac accaattaag acaagtgaat gggcttctcc ggtaatagtc gtgatgaaaa 2760 aaaataatga aataaggctg gtaatagact gtaaagtttc tgtcaacaaa tttattatcc 2820 caaatactta tccgctgcca acagctcaag acgtttttgc tggtttggct ggctgtaagt 2880 tattttgttc attggatctt gaaggtgctt atacacagtt gtccttgtca gagagatcaa 2940 agaagtttat ggtaataaac actctaaaag gactttatag atacaatcgt ttaccacagg 3000 gtgcttcatc cagcgctgca atttttcagc aagtgatgga gcagattcta aatggaattg 3060 aacatgtttc ggtttacttg gatgatgtgt tgattgctgg gaaagatttg gatgactgta 3120 agcaaaagtt agttatagtc cttgagcggc ttcaaaaggc caacattaaa gtgaattggg 3180 acaaatgcaa gttttttgta acagaattag ttcatttagg tcacgtaatt agtgaaaatg 3240 gattaatgcc ttgtcaagac aaaatttcca ccatcaaaca agcgaaagta ccaagaaatg 3300 aaactgaact aaaatctttc ctaggattaa tcaactatta ccataaattc attccaaatt 3360 tatcttctaa gctttattat ttgtataatt tattaaaaac tgaagttaca tttcactggg 3420 attccaactg cgacgaagca tttgaaggca gcaaaaaagc attggtagaa gctcgttttc 3480 tagagtttta cgacccagat aaaccaatag taattgtttc tgatgcttca ggatatggtt 3540 taggcggagt tatggcgcat ttgattgaag gagttgaaaa accaatatat tttacatctt 3600 tttcactaaa ttctgctcaa caaaaatatc ctatcctaca tttagaagca cttgctttag 3660 tctgcactgt aaaaaagttt cataaatttt tatatggtaa aaagtttttc gtttatacgg 3720 atcataaacc actagttgca atatttggta gagaagggag aaattcaatt tacgctacaa 3780 gactgcagag atttgtgcta gagttatcga tttacgaatt tgaaatacaa tataggccct 3840 caaagcatat gggaaatgct gatttttgtt cacgtttccc tttggacaac gtagttcccg 3900 ctgattgtga cgttgagcta gtaaacagta ttaactttgg tagggaaatc cctattgatt 3960 ttgtcaagat tgcttgcaaa actaaggagg atgagttttt acagaatatt atttcgttta 4020 tgacaaatgg atggccggca aaagtaaaca aacaatttat tgacgtatat gcaaatcagc 4080 aggatttgga gttaatagac ggttgtcttc tatatcaaaa tagagtagta ataccagcgt 4140 cgctaaaaaa gagcgtttta aatcttttgc atgcaaacca tgcgggaata gtaaaaatga 4200 aacgacttgc taggcgaaat gtgtattggt ttggtatcaa ctctgaaatt gaaaattttg 4260 tgacagcttg cgatatttgc aatagcatga tgatagttcc gaaaacaaag acagaatccg 4320 agtggatacc aacgacgagg ccttttagta ggatacacat tgattttttt tactttgagc 4380 accgtacttw tttattagcg gttgatagtt attctaaatg ggtcgaggta gaatkgatga 4440 gmaatggtac agattgtgat aaagttttaa gaaagttagt agcaattttt gctagatwcg 4500 gattgcctga cgtkctggta tcagacggtg gtcctccatt caatgcttat agttttgttg 4560 actttttgaa gagacaggga atcaatgttc taaaaagtcc accatacaac ccttccagca 4620 atgggcaggc tgaaaggttg gtaagaaccg taaaagatgt attgaaaaag tttctacttg 4680 accsagagtt ctctcattta aatgtggagg accagattaa cttgtttcta attaactaca 4740 gaaacaattg tcttaatatt gagggaaata atccttcaca aaaaattttt tgttacaagc 4800 ccaaaacttt cttagatttg ttaaatccaa aatctcatta caaaaatttt tacgagaggc 4860 caaagtgcat gataacctga caacttttaa aaatgcagtt ccagagcatg atgatattat 4920 cataaagaaa gatggaaatg tcccgaataa attaactgat ccttttgaaa atctgacgcc 4980 tggagacgaa atatggtata aaaaccataa ccctcatcat acggctaagt ggcttaagtc 5040 aattttcatt aaaaggcact ctcgtaatac ttttcaggtg caaattggaa gcgtgctcac 5100 catggcacac agcaatcaac ttcgcgtacc taagtgtggg gactcctata ataggcctaa 5160 cgttcgttta attcggcaac agccgatgct tggcactact agccgggagg agttttatgg 5220 cttcacagca gacgagctta gacgtgacag gaaaaggaga tacccggttg agaattcaac 5280 gcatgatcca gataaattgc cggagatgag tcttaggcgt tcaaaaagac caagaaaagc 5340 aaaaactgat gaagatttta tttatttgta ttaattttta gttgaaatat gctgatctct 5400 tttctgagtg ttaattatta aaagttattg taattttcga tttctttgct ctgatatatg 5460 aactaaaaac attgtaaata atttgaattt ataacttcca aagggggaag gactgaggtg 5520 tcc 5523 // ID SIRE_TC repbase; DNA; INV; 867 BP. XX AC AF227592; XX DT 09-JUL-2004 (Rel. 9.06, Created) DT 09-JUL-2004 (Rel. 9.06, Last updated, Version 2) XX DE Trypanosoma cruzi clone SINE related to VIPER retroelement. XX KW SINE; Non-LTR Retrotransposon; Transposable Element; AF227592; KW LTR; SIRE repeat region; SIRE_TC; retroelement. XX OS Trypanosoma cruzi OC Eukaryota; Euglenozoa; Kinetoplastida; Trypanosomatidae; OC Trypanosoma; Schizotrypanum. XX RN [1] RA Vazquez M., Ben-Dov C., Lorenzi H., Moore T., Schijman A. RA and Levin J.M.; RT "The short interspersed repetitive element of Trypanosoma cruzi, RT SIRE, is part of VIPER, an unusual retroelement related to long RT terminal repeat retrotransposons."; RL Proc. Natl. Acad. Sci. U.S.A 97(5), 2128-2133 (2000). XX DR Genbank; AF227592; Positions 1 867. XX SQ Sequence 867 BP; 235 A; 113 C; 221 G; 188 T; 110 other; ggggggtttn tttggggaan nccccnaaaa tttttaaaac cncnagagnt tgngnggnac 60 nccttttaaa ttnanaanga gaggggtgca attccccnng ggtttgagaa nggggaaana 120 cggaaaattt ngngtttaga aggaaaaana ttgtntttta tnnaancacg gggggggggn 180 aaaatttttt tttggggggg gggggnaaaa naaggngnat ntttnccccg ggggggggan 240 anaanagggn aaagnnattg nganggggcg ggaaacgaaa atnaaaaatn ttanaattaa 300 ataaatgtgg aacccattcc naaacaaaaa ccattctttt ggaattcnna aancaaancg 360 gaggggntcc aanggtcggg angaaccccn nantnggttn attttttaaa agaangnatt 420 ngataaaana ggtaaaaann naatgnatng agggggnnag gtttgtacaa aggtnnaggt 480 cccctccgaa tagngtgagc gggccgtcga cagtggttnt cggatggaaa gataananan 540 acacttcnga gcttttanag tngaggncaa tgtgggaanc aantttcgtt ccattantcc 600 ccnnangant cggatctggt ttgnntncaa nnngaganaa agtgaggagg cgaccagggg 660 cccagaaggg gatnttatcg tgnatnttcc gaagggttcg acttaaggaa ccggggagcc 720 aacanngggg ggatgaaatt atgtatttgt ctttaaccat ccgacccacc ttttttcatt 780 ttttatnctt gcggcgacgt ccatnttggt tggaggaccc caaanttngc canttcnncn 840 gganngattt ttatgagaag nananta 867 // ID Gypsy-41_DWil-LTR repbase; DNA; INV; 442 BP. XX AC scaffold_181154; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-41_DWil_; KW Gypsy-41_DWil-I; Gypsy-41_DWil-LTR. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-442 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_181154; Positions 1994457 1994898. XX SQ Sequence 442 BP; 151 A; 97 C; 71 G; 123 T; 0 other; tgtagcatgt tcacatatcg aatgatcact gtacctagtt acatacatac ataattcata 60 aataatatca aaaacatatt gtagcacttt gtcacaatat ttatgtatac acattcaagt 120 accatgcgtt tcccgtaaca ttgcattgta ctggacagtg ctgagtcgtc atatctaaca 180 cgcgcaacga ctcaggcacc aagtgaacat gatacgtcct cactctcccg cagcgaacca 240 tacataatta tgtaaacata tatacataag cgtaccgctc ctctgctggc ttcactggca 300 ccgcagtcac gcggacttta gctatactga acaattgtga agatttaaga attcattcaa 360 taaacagaac actaagtgaa aacaccagca actaagcgaa aatataattg ttgtattatt 420 gtgaattgtg ggacatagta ca 442 // ID BEL-166_AA-I repbase; DNA; INV; 6278 BP. XX AC supercont1.390; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-166_AA_; KW BEL-166_AA-LTR; BEL-166_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6278 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.390; Positions 477836 471559. XX CC 'TCTAC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 140..3775 FT /product="BEL-166_AA-I_2p" FT /translation="MMVGHAKTCGVCTNGGTTASMMGCDMCDSWFHPNCIG FT ASETNIEPDKTWRCSRCATDGEVTEPASACSLKSNRSSASSQARKQLLIQQ FT LEEQRALKLKQRAEEDEIRKRRAEEDEAYLQQKLNIILEEDDDEEKMSRLS FT SRASRKKVLDWLTNEQSIGKAPVTGSSIINVNQTHGTPVTVSGLQIPGAVV FT TTSSTMVVPASTSTPNSETNGAPPPVSHAIVPTEATTTMLDARKPMLAFPV FT TRAGITGYTAPPEHQVSGTFGNIANHPEIHSTKTRVRPALPAIPTGSNYAF FT ESIPSIMPLTTQFATGLSLRSLPAGQPSMGFSAGAPAIQSTTIFSGDVASR FT FNQIPPLSDPTAAVHSRPTIISQDPVPPRPTPSSVTYQPNHVSQPGVGDEQ FT QQLGFPYQSVPTSAQLAARQVMPRELPVFSGDPQDWPLFSSSFYNTTAACG FT FTDAENLARLQRCLKGHALDSVKSRLLMPESVPHVMETLRKLYGRPEVLIH FT TLMQRLREVSSPKAENLQSIINFGMAVRNLVDHMFVAQLVDHLRNPMLLHE FT LVEKLPPQLKMQWAWYKRMQTDVNLATFGEFMTELVDTASDVTLPSQTQLQ FT QSLNKAGRDNHKLYAHAEAEGNPNMTATSHLRRSQVAESSKRACWYCSDEA FT HEVAGCPQFKSLDLDGRWKVIRSKGLCRTCLIPHRRWPCRSLKECGVDGCR FT LHHHALLHSPAASNVAVDHHLALQNHHSTLSICLFRYVPVTLENNGKKVET FT FAFLDDGCQTTLMETELATELEIYGPTETLWLGWTGNISREERGSQRVNVN FT ISGRGLKSKFKLSNVRTVQQLKLQGQTLDYDELQKRYPHMRGLPLYSYVDA FT TPRIIIGIEHVQLLTTLKAREGRPNEPVAVKTRLGWCVYGKQAGECESVER FT LHIHTEKHIENHELHELMKQFFEVDEAAVASPIESADDIRARNILERTTRR FT IEGGFETGLLWKYDQPTFPDSYPLALHRLESLERRFNKDPELRERVVSLIH FT EYEKKGYAHKITTEELESTESNRVWYLPIGVVRNPRKPEKIRLIWDAAARV FT RGLCFNDMLLKGPDMLTSLFDVLLRFRQRSIAICGDIREMFHQVRIISSDK FT QSQRFLFRKHQSEQPQIYVMDVATFGAKCSPCSSQAVKNKNAEEFAAEYPR FT AAEAIINAHYVDDYLDSVDTVEEAVQLVTDVKHVHAKGGFEIRNFSSNSVD FT VL" FT CDS 4237..6276 FT /product="BEL-166_AA-I_1p" FT /translation="MRGLFANCTSRDSSLHSGCSQNQSGSTKTTLHSSAGA FT TSWRNRLSSVEHNLYRIGPTDRVIWTDSATVLSWIKSDSRRYHPFVAFRVG FT EILNSTNVDEWHHVPSKLNVADDATKWGSGPDFNPNSRYYVGENFLYLPMD FT EWPKQTKQKYITKEELRTVFHLHREIPSPLINVSRFSNWNRLVRSTAYVLR FT SLKKMRGAKLFGELTSDELLQAENFLWRQVQLEVYPDEYYSLQYNKQHPDD FT TPKSIDKSSPLYQDSLFLDDAGVIRMFSRIGAAPTAPYEAKYPIILPKDHL FT LTLLLVDSYHRRFVHINHETVLNEVRRRFRIPQLRRVIKRVANSCQRCKVK FT KATPRPPMMAPLPEFRLTPNIRAFSYTGIDYFGPLLVKVGRSLAKRWVALF FT TCLTTRAVHLEVIHSLSTKACIMAIRRFVARRGSPAAIYSDNGTCFKGASN FT ILANQIQGIHEDCAVTFTNARTTWHFNPPSAPHMGGCWERMVRSIKVAMAG FT IAEYPHHPTDEVLETVVLEAESIVNSRPLTYIPLDDMEQEALTPNHFLLYG FT VQGVVQPRSVCEIEGATLRDSWKLTQCLVDHFWKRWVHEYLPTITRRTKWF FT RPVKPLELGDLVLVVDENKRNGWLRGRIIEVKRGSDGQVRSAVVRTKDGTL FT TRPAVKLALLDVRTIDDQDIEERAPVLHGEE" XX SQ Sequence 6278 BP; 1757 A; 1511 C; 1571 G; 1439 T; 0 other; aacttttaag attaactgtt catcgagaaa gtccttcgaa gtaagtgatc ctgcgtagta 60 agcagattag acctgcgaag taagcagtac agtgcccatt tcagaacctc cactgcacac 120 cggaatccct cttttgttga tgatggtggg acatgctaag acctgcggtg tgtgcaccaa 180 cggtgggacg accgcctcga tgatggggtg tgatatgtgt gattcgtggt tccaccctaa 240 ctgcattggt gcaagcgaaa ccaatatcga accggacaag acctggagat gcagtcgctg 300 tgctacagat ggcgaagtta cagaaccggc aagtgcgtgc agtcttaagt cgaatcgaag 360 ttcggctagc agccaagcgc gtaaacaatt attgatacaa cagctcgaag aacagcgtgc 420 cctgaaactg aagcaacggg ccgaggagga cgaaatccgt aaacgacgag cggaagagga 480 cgaagcctac cttcagcaga agctgaacat tatcttggag gaagatgatg acgaagagaa 540 aatgagccgt ttgagcagtc gcgccagccg taagaaggtc ctagattggc taactaatga 600 acaatcaatc ggaaaagctc ccgttactgg ttcatccatt atcaacgtca accaaacgca 660 cggaactcca gtcacggtat ctgggctaca aatcccagga gctgtggtta caacatctag 720 tacgatggtc gtgccagcat ctacatcgac accgaattcg gaaaccaacg gagctccccc 780 acccgtttct catgcaatag tgccgacaga agcgactact acaatgttgg atgctaggaa 840 gcctatgtta gcatttccgg taactcgggc aggcatcacg ggttacaccg caccgccaga 900 gcaccaagtg agtggtacat ttggtaatat tgcaaatcat ccggaaattc actccacaaa 960 aacacgcgta agacctgcct tgccggcaat tccaactggt tcaaattatg cttttgagtc 1020 aatcccgtcg ataatgccgc taacgacaca attcgccacc ggtctatcac ttcgaagttt 1080 acctgctggt caaccatcga tgggtttctc ggctggcgct ccggcaattc aatcaaccac 1140 gattttctcc ggcgatgtgg catcacgctt taaccaaatt cctcctctat cagacccaac 1200 agccgcagta cattctcgac caactataat ttcgcaagat ccggtgccac caagaccaac 1260 accgagctca gtgacatacc aaccgaatca cgtttcacag cctggagttg gtgacgaaca 1320 gcagcagttg ggatttccat accaatcggt gcccactagt gcgcaattag cagcacgcca 1380 agtaatgccc cgggaattgc cggtcttctc cggtgatccg caggactggc cgttgttttc 1440 tagttcattc tataacacta cggcagcgtg tgggttcaca gacgctgaga atctggctcg 1500 actccaacgt tgcttgaagg ggcacgcgtt agactcagtg aaaagtcgat tgttgatgcc 1560 agaatcagtg ccccacgtta tggagacact gaggaaatta tatggacgcc cggaagtctt 1620 aattcacact ctgatgcagc ggttgcgcga agtatcatct ccaaaagccg agaatctcca 1680 atcaatcata aactttggaa tggctgttcg taacttggtg gaccatatgt tcgtggcaca 1740 gctcgtagac cacctccgaa atccgatgct gctacatgag ctagtagaaa aactcccacc 1800 gcaactaaaa atgcaatggg cgtggtataa acgtatgcaa accgacgtca acttagcgac 1860 gtttggtgag ttcatgaccg aacttgtgga taccgcatca gatgtaacac ttccatcgca 1920 aacacaactc caacagagtt taaacaaggc gggccgagac aaccataaac tgtatgctca 1980 cgcagaagcc gaaggtaatc caaatatgac agctactagc catcttagac gttcacaagt 2040 cgcagaatca agcaaacgag cttgctggta ttgctctgac gaggcgcacg aggtagctgg 2100 ctgcccgcag tttaagtctt tagatttgga tggacgatgg aaagtgattc gttcgaaggg 2160 cctttgtaga acatgtctta ttccacaccg tagatggcct tgtcgttcat taaaggaatg 2220 tggggttgat ggttgtcgac tacaccacca tgctctactt cattcaccag cagcgtctaa 2280 cgttgccgtc gaccaccatt tggccttgca gaatcatcac tcgacgctga gcatttgcct 2340 tttccgctat gttccagtga cgctcgaaaa taacggaaag aaggtggaaa cgtttgcgtt 2400 tttagacgac gggtgtcaaa ctacgcttat ggaaactgaa ttggcaacgg aactagaaat 2460 ctatggacca acggaaacac tctggcttgg atggaccgga aacatatctc gtgaagaaag 2520 gggttcacaa cgagtcaatg tgaacatttc cggtagaggg ctaaaaagca agtttaaact 2580 gagcaacgtg cggactgtac aacaactgaa actgcaggga cagacgttag attatgatga 2640 gttacagaag aggtatccac atatgcgagg tctcccactg tacagctacg tggatgcgac 2700 accaaggatc attatcggca ttgaacatgt gcagctcctt accactctca aagcacgaga 2760 agggcgacct aacgaaccag tggccgtcaa gacacgatta ggctggtgcg tatatggcaa 2820 gcaggcagga gaatgtgagt cagttgaacg gctgcatatc catacagaaa agcatattga 2880 gaatcatgag ctgcacgaat tgatgaagca gttttttgaa gtcgatgaag cagcagtagc 2940 ctcgccgatc gagtccgcgg atgacattcg ggcccggaac attctggaac gtacaacacg 3000 tagaatagaa ggaggatttg agactggctt gctgtggaag tatgatcagc caacctttcc 3060 cgatagttat ccattggccc tccacagact agaatcgttg gagaggcgtt tcaacaaaga 3120 tccagaattg cgggagcgag tggtatccct aatacatgag tatgagaaga aggggtacgc 3180 tcacaaaatt actacagaag agctagagtc aacagaatcc aatcgggtgt ggtacctacc 3240 tattggagtg gtgcggaatc cgagaaagcc ggagaaaatt cgcttgattt gggacgctgc 3300 agcacgagtt cgtggtctct gcttcaatga catgctgttg aaaggaccag acatgttgac 3360 ttcattattc gacgtcttgc ttcggtttag acagagatcc atagctattt gtggggatat 3420 tcgagagatg ttccaccaag tccgtatcat ctcaagcgat aaacaatccc agaggtttct 3480 ttttcgtaaa catcaatcgg agcagccaca gatctacgtt atggatgtgg ctacctttgg 3540 cgcaaagtgt tcaccttgtt catcgcaagc tgtgaaaaac aagaatgcgg aagagtttgc 3600 agcagaatac cccagagcag ccgaggccat catcaacgcc cactacgtgg atgactacct 3660 cgatagtgta gatactgtcg aagaagccgt gcaactggtg acagacgtca agcacgtaca 3720 tgcaaaaggt ggatttgaga tccgcaattt ttcgtccaat tcggtggatg tgctatagca 3780 gcgtggagaa gcacgaagtc tggaaaagaa gtcgttgagc atggatgtat cggcatacgt 3840 ggaacgtgtg ctaggcatgg tttggaaacc agccctggat ttgttcacct tcgacacggc 3900 actgaaagat gaactggaac aacttttgga gcagaatgta acaccaacca agcggcaggt 3960 actgcgattg gtaatgtcgc tgttcgatcc atgtggtttt atcgctcatt ttaccgtgca 4020 tggtaaaata ctgatgcagc atatctggcg aagcgggaca gactgggatg agcaaatagc 4080 tgacgattta cgtgaaatgt ggaaggattg gacacgactg ttgaagcagc tgatcgacgt 4140 ccaagttcca agatgtttct ttggagaagg tgaaagtaaa ctacccgcca ctatccagct 4200 ccatcttttc gttgatgcaa gcgaactcgc atatgcatgc gtggcctatt tgcgaattgt 4260 acaagccgag acagttcgtt gcattctggt tgcagccaaa accaaagtgg ctccactaaa 4320 accactctcc attcctcggc tggagctaca agctggcgta ataggctgtc gtctgttgaa 4380 cacaatctgt accgcattgg acctaccgat cgagtcattt ggactgattc agcaacggta 4440 ctttcctgga tcaagtctga cagtcgacga taccacccgt tcgtagcgtt tcgcgtgggt 4500 gaaatcttaa atagcactaa cgtcgacgag tggcaccatg taccaagcaa actaaatgtg 4560 gccgatgacg caacgaagtg gggcagtggc cctgatttca acccgaatag ccgctattat 4620 gtcggtgaaa actttctgta tctaccgatg gatgaatggc caaaacaaac caagcagaag 4680 tacatcacga aggaagagct tcggacagta ttccacctcc atcgagaaat accctcaccg 4740 ctgataaatg tgtcccgatt ctccaactgg aaccggttgg tgcgatcaac tgcatacgtg 4800 cttcgttcgc tgaagaagat gcgtggcgca aaactctttg gtgaattaac tagcgatgaa 4860 ctactacagg ctgagaactt tctctggcgc caagtacagc tcgaagtata cccagacgag 4920 tattattcgc tgcagtataa taaacaacac cctgacgaca cgccgaagtc aatcgataag 4980 agtagcccac tctaccagga ttccctgttc ttagacgatg cgggagtgat taggatgttc 5040 agtagaatag gggcggcacc cactgcccca tacgaagcta agtatccaat tatcctacct 5100 aaagatcacc tcttaacgtt gctccttgtc gacagctatc accgccgatt cgttcatatt 5160 aatcatgaaa ccgtgcttaa tgaggtgcgc cggcgattca gaatccctca actacgtcgc 5220 gtgatcaaac gcgtcgcgaa ttcttgtcaa cggtgcaagg tcaagaaagc tactccgcgt 5280 ccaccgatga tggcaccgct tcctgaattc cgactgacac cgaatattcg ggcatttagc 5340 tacaccggca ttgactactt cggtccgctc ttggtgaagg tgggccgcag cttggcgaag 5400 cgctgggtag cgcttttcac gtgcctcact acacgtgcag tgcacttgga agttatacat 5460 agtctttcaa caaaagcatg catcatggca atccgtaggt ttgtagcccg aagaggttct 5520 ccagcagcaa tctactccga caacgggaca tgcttcaagg gggccagtaa catattagcc 5580 aaccaaattc aaggcataca cgaagactgt gcagttactt tcacaaacgc aaggacaaca 5640 tggcacttta accccccgtc cgctccccat atgggcggat gctgggagcg aatggttcgc 5700 tcaatcaagg tagcaatggc gggaattgca gagtaccctc atcatccaac agacgaagtg 5760 ttagaaactg ttgtactgga ggcggaatct attgtaaatt cgcgaccgct aacgtacatt 5820 ccgcttgacg acatggagca ggaggcctta acaccaaacc attttctcct gtacggcgtt 5880 cagggggtcg ttcaacctag atcagtctgt gaaattgaag gtgcaacact acgtgatagc 5940 tggaaactaa cccagtgttt agttgatcac ttttggaagc gctgggtcca tgaatacctg 6000 ccaacaataa ccaggcgcac caagtggttt cggccagtca agccattgga actaggcgat 6060 ttggtactag tggtggacga gaataaacga aatgggtggt tgagaggacg gatcatcgag 6120 gtgaagcgcg gaagtgacgg acaggtgcgc agcgcagttg tgcgcaccaa agatgggaca 6180 ctcaccagac cagccgtcaa gctggcactg ttggacgttc gtactattga cgatcaggat 6240 atagaagaga gagcgccggt actacacggg gaggagaa 6278 // ID PAT repbase; DNA; INV; 5514 BP. XX AC X60774; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 28-SEP-1995 (Rel. 1.08, Last updated, Version 1) XX DE P.redivivus PAT retroposon. XX KW PAT; PAT retroposon; Repetitive element. XX OS Panagrellus redivivus OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Panagrolaimoidea; Panagrolaimidae; Panagrellus. XX RN [1] RP 1-5514 RA de Chastonay Y.; RT "PAT."; RL Direct Submission to Genbank (04-JUL-1991)Y. De Chastonay, Inst RL of Zoology, Univ of Fribourg, Perolles, 1700 Fribourg, RL SWITZERLAND. XX RN [2] RP 1-5514 RA de Chastonay Y., Felder H., Link C., Aeby P., Tobler H. RA and Mueller F.; RT "Unusual features of the retroid element PAT from the nematode RT Panagrellus redivivus."; RL Nucleic Acids Res 20(7), 1623-1628 (1992). XX DR GenBank; X60774; Positions 3890 4194. XX SQ Sequence 5514 BP; 1448 A; 1419 C; 1515 G; 1132 T; 0 other; aacggcaggt tttgacggac gcttccctcg ttgggagcga aggtgctgca cgcctccgag 60 cgcccaaggc gtcctggagc tgaaagtcct gctgcagtcg gtcgaggaag caggtgaaac 120 ccggctagtc gtcattctag cccgggtttt taggtacgga gtggttgtcc cactcagctc 180 ttaattttcc attgttttcc ccctttttcc ccctaatccc acgtcttatc gtcccagtac 240 cccaggacgc taatagctag ctgcaaggac tatcgagaga ccgagtccga cccaagtttg 300 cctcgaccat agtcgggagg tgtacgaaat cggttggaaa accccagagt ttctctgaga 360 ccgattttat atataatttc ctgccccacc agtcccccat gtacccccgc aggagcaacc 420 ccagacaccg tctcggtcag tacccattca aaagagtcct agcgatctcg ctgacttcaa 480 ttctcggaag tggcggagct cttttgccag ttttatatat aattctaccc gaaagtgcat 540 cagccccctt cccagtccca acccaggccg ctacagggca tattcttccc cctgtcggtc 600 aaggctatcg tttggccgaa gaaaagccga ggaacctgtc cccatttcta ttgatcctgt 660 atatataatc ccgccgtcgc accgcagcat ccaaacccat cccaaccccc atttacccct 720 gtcggtgcga cctgtcttga ggagaagtcc tttccacagt tggtggacga gaaactgagg 780 acgtgggaaa cgagtatacc ccgatcaagg gaagccccaa gccagaagtc ggtctgaggt 840 cacggtgaat tcccgcatct ctctgcgtgt cgtggcggag gaagagtgac gagtggctat 900 atagcccggc gtggccgatt ggcctgtagt aggcaggtat agacctggcc gaagagtgtt 960 tctaccctcg tgacgtgtcc ccaaagggta aagattgtgc gtgtagctta tataagcagc 1020 tcgtgaggag cgtactgtac aagagtactg acagttggtt ggtgttagag gtaaccagcc 1080 aagatggaga agatatccga ggtcttcgag aaactgccca aggaggtctc tgctgccatc 1140 caggagttat cgcccgatgc gatcagggag gtgatcggtg gactcgtcga ccccgagagg 1200 gctcaagcct acctgaaggc agaatccgag aagctcaagc aggcggcgaa ggaagctgcc 1260 gagaccgagc tttgcggccg tgaacaagga gaatgcccag agccaaggac ctcgatcaga 1320 atcggcactg gcaaggagcc cgacgcggcg gcggagaatc gcgagttggt ggccaagtgc 1380 gcggccatga tggccatgat gaccaaggct gcggccgaga aggaaacccc acgaacttgg 1440 tcgacgaaag cttatgagta ccagcacgtc ttcaatgcgt cggtgaaggc ccttatccag 1500 gcgaaggaat tcgataaggc tgtcgtgaag ctcgacgcta gaaatgccga cctggcattg 1560 gctgacaagc agcccggtat cctcgctacc ctggacactg ccaagtccta tgctcagggc 1620 tcaggagaat ccatgtcgaa cgctttggtg agttttttgc ttatacaagc agtaaatgac 1680 cgagatactt gcagctcgca gcgatcattg ccaacccgcc aaggcgtctg gaccggcgac 1740 cgaagcaagc agcgtggtgg atacaagcga tacggccagc agccatcctc gtctacggct 1800 ccccccgcca aatcattccg ctgtttcaag tgcggcagaa acggtcacta tgcctcatcg 1860 tgtcatgcca agaccaccta ggtccccgga ggccatgaat tcccaagtcg atctcagcac 1920 atgtccccca gatatgaacc attgctcgtt gttaaataaa gttgttgtcc aaaatccgtt 1980 tgattctctg gcaagaagtg tgcaggggtg ggaaagtatt ggacacccca gcgaattttg 2040 aaatccatta gtgaaggata tcggctacca tggagaggcg agcagcgccc cgaggaacaa 2100 agaccgaatg cacaatctac catggagcac gcagcattcg tcaccgatga ggatgaaaag 2160 ctgtgctcaa cgggagcggc ggagttagtc ccgagtgaaa ggttaggcga cgtcaaagtg 2220 atcagtgctc tatcggtttc ggtgaacgcg gatgcgaagt gtagattggt catggatctg 2280 accacggtga acccgtatat cacagctaac aaaatcaaac tggagaatgt agccatagca 2340 aagtccctca tccccaagtc aggtttcatg cttacgtttg acatgaaaag tgggtaccac 2400 caagcaagaa tggctgactc cgagttgatc tatctggcgt tcaggtggga aggcaagact 2460 ttttggatga aagctttgcc atttgggtta tcctcggccc ccgaatactt caccaagttg 2520 ttccgacacc cgctggcaac gttgagggga gatggtgtga actgcctttt gtatctggac 2580 gacctattgg tgtggagtga aacctacgaa ggcgcttgtg aagcttcagc caaagtaagg 2640 gcgcttttcg ggaagctagg tgtggttctt aacaacgaaa agtcttcggt taccccgcaa 2700 agagaggtga aatggttagg ggtggtcttt aacctgactc acgggacact caaaatctcg 2760 aaaaacagaa tcgagaacgc cttggcggca gcggccaggc tgctgaacag gaagcgccca 2820 tcggcaaagg acaggctcaa gtttacgggt gcattaaact cgatgcacga tgttctcggt 2880 cccatggcag ccatcagaac aaagtctctt ttctgcttta ttgcatcggt aacccccagg 2940 ctgggggtga gattagccct gtcagaaagg gagaaagcag acatcaaata ctggcagcga 3000 aatctggtcg agaggaatgt ctggagaata caagatacca gaccatcgga atatgtgttc 3060 gcaacggatg catcggcgac aggagtaggc gccgtcaagt tgaacccgaa agatctcaca 3120 gagctgtctt cagcctatcg ggaattcgat gaatacggag ggaacgacct cgagcaccac 3180 agggagttac tggcagtgca gttcgcatta catcattact tggcgtcaaa gaagaacacg 3240 gtagtgacag tcaggactga caatcagaat atcccgagga ttctagctaa gggttcgggt 3300 gtccaggagt tgaatgagct agtcctgcag gtgacagaat ggtgtgaaca aaggaaagtt 3360 gagttgatga cgacttggat cccaagggca atgaacagtg cggcggatag agcgtcgagg 3420 gaaacggacc cagatgattg ggcaatctcg aaagaaatct ttgaaaagtt gacggcaaag 3480 ttccagaagt gccagtgcga ccggttcgcg tcccataaaa caaaacagct agacaagttc 3540 atgagcagag ttccgtgccc aggatcagct ggagtcaacg cgttcgcgta ccagtggaca 3600 gactggtcaa gctggtgtgt acccccaccg gccctactgg tgaggacatg gaagcacatt 3660 gaatcacatg cctgcgaggg gttgttggta tcgccagatt ggccggcaaa tgtagtggct 3720 acggcagcaa gcagggcggt aaggaaagga ttcgcgaagc tagtctacag aatcagagcg 3780 ggcacaaggt gcataacgcc tccggccttc tccacaggtg ctttccagac cccctacgcc 3840 caatcggacc tccttgtcta ccggttcaac acctttccac ggttctagct atctgtttaa 3900 cagattgtat cctaaagaaa tcccgatgcg cgtcccccca tttgagtgac aggagcaatg 3960 gggtgcgaag gagagggagc aacgtagccg cgagggggtc acggtcgaag ttaaaacgat 4020 agttttcacg cagtgtaacg cagtgtaaaa cggcaggttt tgacggagca gcagttccct 4080 cgttgggagc gaagagcaga cgcctccgag cgcccaaggc tcctggagcg aaagcctgcg 4140 cactcggtcg aggaagcagg tgaaacccgg ctagtcgtca ttctagcccg ggttcagcgt 4200 gagccgcgag taatgatcag attaccgaga cagaagaaaa gaagcaaaaa actgatattc 4260 caatcacaat tcaggtatgg attcgacgaa catccggaat gagctagcac tgacgctatc 4320 ggaagcggag gcaaaggatt tggcggaccg tatcgtatca tcagtcagag caccgtcgac 4380 cctcggcctg tacacgccca tcatccagcg atatgaggaa ttcgcagcca attcatctgc 4440 gccgtacagt tcatcagtgt tcgcattcct ggcaaagaac tacctaggca agtcgtcatt 4500 taggacagca gcagcggcat tgaggcttca ttttgtggcg aagggagtct cacctagccc 4560 gttggacagt gatgccatgg caatgcttag agccgcagcg agccgagaag ccgaagccag 4620 tccacaggag caaatggctc acagcagacc tgattggcct ggcaaaccat ctcaaaaact 4680 ccaattccgc cccggataaa aggttgcgta cttggttctc gtgaattacg cgtcgttcat 4740 gagaccatct gaaggtgtcg ccgttagggt ggaagatgtg gcgtttgagg gcaaccaaat 4800 gcagattcga atccagaaga cgaaaacaaa ccacaatggc caccgcaaat agcgagttgt 4860 ggatgcgcat ccggatcacg attgctgccc ggtcaaggcg gtgaaggaat ggctagcaga 4920 tcccgctaga aaggcttcag aatggttgtt cccgaacttc aacctcgtca cccaacacat 4980 caagctggac cgtgcccaaa gcgaaatccg taagttgcgg tcccaaggca tcatcccgga 5040 aggtttcacc ttgcacggcc tcagaggcgg agccaccaca gcatgcatcg aagccggtat 5100 cccgatcgac gcagtccagc gagccggaag atggtccaat cccaactcca tgaagccgta 5160 catcgagagg acgacaagaa ccctccaagg aaccaccaat gcaattgccg caggcctccg 5220 actcgcacaa gacgacgcca gcaaccatcc caacccgaca gaagaatgat gaatccatgc 5280 gcatcagaag aatctcttaa tatactgaat gatttaatgc catgcttttt gtcgtaatta 5340 tcaatgctaa gtgtaatcac gtatacgttt aacagttatg taccgaaata aacccgtggc 5400 gccccccatt tgagtgacag gagcaatggg gccgaaggag agggagcaac gtagccgagg 5460 ggtacggtgc gaagttaaaa cgatagtttt cacgcagtgt aacgcagtga aaac 5514 // ID BEL-12_DPu-LTR repbase; DNA; INV; 318 BP. XX AC . XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from Daphnia: long terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-12_DP_; KW BEL-12_DPu-I; BEL-12_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-318 RA Jurka J.; RT "LTR retrotransposons from Daphnia."; RL Direct Submission to RU (16-DEC-2010). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR [1] (Consensus) XX SQ Sequence 318 BP; 69 A; 97 C; 75 G; 76 T; 1 other; tgttgggaac gaaagacgtc cccaacggcc gacgacgaga tccggccgaa cgaccaggat 60 tccggcccac ttacgggaat acgatcttcg gggcccccga taggtggccc cttgtcaccc 120 gcctgactgt accgattctt tgtgcccccc cccctggtgt caatgtaatg ttccggaggg 180 ggtcgggccc tctgtcttaa tctatcaatc gagtcagtct caatacaagt caggtcgagc 240 aatctcaatt cagtgtcatc ttcattcatt tcatctaata ttcggtaagg naactctgac 300 acgcttccct gccgaaca 318 // ID Gypsy-37_OD-LTR repbase; DNA; INV; 231 BP. XX AC CABV01004218; XX DT 24-FEB-2011 (Rel. 16.02, Created) DT 24-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Oikopleura dioica genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-37_OD_; KW Gypsy-37_OD-I; Gypsy-37_OD-LTR. XX OS Oikopleura dioica OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; OC Oikopleuridae; Oikopleura. XX RN [1] RP 1-231 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Oikopleura dioica genome."; RL Direct Submission to RU (24-FEB-2011). XX DR Genome; CABV01004218; Positions 725 495. XX SQ Sequence 231 BP; 69 A; 49 C; 48 G; 65 T; 0 other; tgacgggtcg gaaatgccct tgttagattt agacctgata gttttagact ctaggtcaag 60 tagttgagtt tagactccat ctaggtcaac ctgacctaga ggacactata tatacaacca 120 catttcctgt acacttacga ttcaacttat cggatcactc gagaggaata aactactttt 180 attgagatcc atgtgttaga atgcataagg cgacgcagga ccgaacctgc a 231 // ID ORTE-1_AAe repbase; DNA; INV; 9510 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A non-LTR retrotransposon family encoding cysteine protease from DE Aedes aegypti. XX KW Non-LTR Retrotransposon; Transposable Element; ORTE-1_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-9510 RA Kojima K.K. and Jurka J.; RT "A lineage of non-LTR retrotransposons encoding an OTU cysteine RT protease from the yellow fever mosquito."; RL Repbase Reports 11(4), 1124-1124 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 9 sequences with >98% CC identity. CC This family encodes OTU superfamily cysteine protease upstream CC of apurinic-like endonuclease. It is positioned at the sister CC lineage of the lineage including RTE and RTEX in RTclass1. XX FH Key Location/Qualifiers FT CDS 580..5331 FT /product="ORTE-1_AAe_1p" FT /note="OTU cysteine protease, endonuclease and FT reverse transcriptase." FT /translation="MDLFNXDINDFQTAVVQKIPAGSFITAIMLQINKINS FT EFKNHQLNEMEFCCLIADFIRCNDHLFSKMWEEDIKEVVHKIFSMESGVEV FT HMAQAIVELFQTKICFYQKKREVEVIQPNSCSNTNMLTEINILFTFEDGTE FT KFYSIQSITDKKSSGKKESGYNEQPRNELYKVKEYESGREMMVKKIKGDGN FT CFIRAIIDQMNKSEDYYRCNLSDIIELRNKIANHMNKNRERYEMYLIESLE FT ENGFLKFIEDIRKPGVYLGHESMVAVTEIFDKCILLYQTDSETLRIGEEKD FT REEDVLRLLYTGENGVKNHFDSIRIVKGQTEENTGTEKTSVEEVKEKESYT FT SMHGKHNDDSEEISTIRYRKTQDTPITGKQIKETGKLVQEEGKQWQWNKER FT NNQKKSDDKKAVEKNMDERGNETRKITREKHKPINKDRAITKDFITDNRKE FT EILDLEEKRRRQRSKEREDRLEQLATQIKTNEERSMLINGSQRQGWEENDS FT QERLDREKQAKTDKNTRGRIQMDVQGKTEEPLSIVNDEQVRKENNTKDVKL FT GDKKENPRISTKQRNDRKRAKNERVDEEGVQENSDTMDGGTKKHRNHVAEE FT RGETRKGQVIESEESRATGVKATGEIGEENEEVKESIMLIATLNIRGCSKL FT EKRQEIDNMLESFNIDVAALQEVNVKAELITTRNYEWRIGGQGQNKSRGLA FT ILIRKDKGIDVMEWKEIGKYGLSISLLSKGRKIILINVHGPNKNGYIFYSS FT LGKLISQDHLRSNLIMVGDWNAQIGKDSVMPEDAECIGNRLGFSNNNDNGE FT EFKMFLTIHKIKNMSTVIGKNTDITWKSGKKESQIDHVLKPREGKIEIRFI FT KGYWTGINTDHKMIVTGIRTREKENKEKGKKKIPIDASVLKYDTIKEKYQE FT ALKKYEIRQEKGNTIEXDFKAVAGKMKKAANEVLRSARAPLTPIRKAALNK FT LKTAINLSNKHPDIFPYRWKLKDRRMEFQAAIRSHNERKIKKFYKELNDFD FT VAVRIKKSYQFLKGFLKRKKHKNVYIPMSQWNEALKESEGPEIQLMDEHDT FT CPVTVPPKEEEMMQIVQQQCNGKSAGADGIRMELLKYADKQTQQELADIWK FT RVWIENHLPEDMEKTIQVPIPKNAKAKGVQDFRRISLCNVAYKPYAKWIKQ FT RLREFTGEPDINQAAFTEGRSTDDHMFVTRRILEEYWNAGNPLYVAALDIS FT KAFDNVSLLQLRSILAKLNVPSHLIDRVLQCIRKEKIRVRWQNQYTTECNR FT GKGVKQGCPLSPFLFNIVMQNVVRKVKEKVPELNLMNTGSLKLPLILIFAD FT DILIITKDEEGLKRILIALEESLREVGLEINNDKSQILIRAPNATKRPPNT FT INLNGRNYDVKTSMRYLGAWLTDTLNRPATTKQRCVNAAKVSKLVVEFCKK FT FKPTWEIGKLIYNTVIAPSILYGTKTATLTKRSRVQLGKYEKQILKDIWNQ FT CRKTDERKFNVRKELKGKTINRRVRVNRISYYGHIQRREETHPIKMAMKLK FT FNKKKHGRPSFTWKKSLEQDFNRYNGITEEEWSQIAQDRDKIKKKAEEIYR FT NNESEISDGEDSTEQDM" XX SQ Sequence 9510 BP; 3679 A; 1317 C; 2015 G; 2497 T; 2 other; acggattaat ggtgtgaatg tgaacgggag gtttcgaaag acaaatgcat gaacggagct 60 agcgagtgat aaatcgggag cggggagacg aagtgagagg ttggtgcgag agagacgaag 120 tgaaaggtgg gtgatagtaa agtaggtgag atgtgtgaga ggtaaatata tagacaatgg 180 agtgaagtga ccattggatt attagtagta gtgtaggtca atcaagaagt ggtgaaggtg 240 gtagtggcga tgacggttag gatgccatag ataatatata aagcgacttg gtggtgttga 300 agaaatgaga agggtgtaga gaagagcgga aaagaaaggt gaattgaaac acaggggtgt 360 tcaactgaat tgggctacgt gacaggattg gagcgataag agaaaaataa gtgaaaggta 420 ttaattattt aaaactaagg taagacgact gatgactgga atgcattcta atacactgat 480 gaattaaggt aagactagat ataagcaaat atatattatg agttttcttt ttctttatca 540 attatagcat aacatttata actacacaaa gagtacaaaa tggatttatt taatwttgat 600 atcaatgatt ttcaaactgc tgttgtgcaa aaaataccag caggttcttt tatcacagct 660 attatgcttc aaattaacaa aattaattct gaatttaaaa atcatcaatt aaacgaaatg 720 gaattttgtt gtttgatagc ggattttatt aggtgcaatg atcatttatt ttcaaaaatg 780 tgggaagaag atataaaaga ggttgttcat aagatttttt caatggaatc gggagtggaa 840 gttcacatgg ctcaagcaat agttgagctt tttcaaacga aaatatgttt ttatcaaaag 900 aagcgggaag tagaagtgat tcaaccaaat tcgtgttcta atacgaatat gcttacagag 960 ataaacatat tgtttacgtt cgaggatggc acagaaaaat tttatagcat tcagagtata 1020 acggacaaaa aatctagtgg taaaaaggaa agcggataca atgagcagcc tagaaatgaa 1080 ttatataaag ttaaggaata cgaatctgga agagaaatga tggtgaaaaa aattaaggga 1140 gatggaaact gttttataag agcgatcata gaccaaatga acaaaagcga ggattattat 1200 aggtgtaatt tgtcagatat aatagaatta agaaataaga ttgccaatca tatgaataag 1260 aacagagaac gttatgaaat gtacttgatt gaaagccttg aggagaacgg ctttttaaaa 1320 ttcattgagg acataagaaa accaggagta tatttaggac atgagagtat ggtagcggta 1380 acagaaatat tcgataaatg tattttgcta taccagacgg attctgaaac attgaggatt 1440 ggagaagaaa aagatagaga agaagatgtg ttacgattac tatatactgg agaaaatgga 1500 gtgaaaaatc atttcgacag catcaggatt gttaaaggac aaacagaaga aaacacagga 1560 actgagaaaa caagtgtgga agaggtaaag gaaaaggaaa gctacacaag tatgcatgga 1620 aaacacaacg atgattcaga agaaatctct acaatcagat atagaaaaac tcaggataca 1680 cccataacag ggaaacaaat taaggaaaca ggaaaattgg ttcaagagga aggaaagcag 1740 tggcaatgga ataaagaaag aaataaccaa aagaaaagcg acgacaaaaa agcagtagag 1800 aaaaatatgg atgaaagggg aaacgagaca cggaagataa cgagagagaa acataaacca 1860 attaacaagg atagagcaat aacgaaagac ttcataacgg ataatagaaa agaagaaatt 1920 ctagacttag aagaaaaaag aaggagacaa agaagtaagg aaagggaaga cagacttgaa 1980 caactcgcga cacagattaa aacaaatgaa gaaagatcaa tgttaataaa tggaagccag 2040 aggcaagggt gggaagaaaa cgacagtcag gaaagattag atagagagaa acaagcaaaa 2100 acagataaaa ataccagagg aagaattcag atggacgtac aaggaaagac agaggaacct 2160 ctaagtatag tgaacgatga acaagtgcga aaggaaaata acacaaagga tgtaaagcta 2220 ggggataaaa aggaaaatcc aaggatatca acaaaacaga gaaatgatag gaagcgagct 2280 aaaaatgaaa gagttgacga ggaaggagtg caggaaaatt cagatacaat ggatggagga 2340 accaagaagc ataggaatca cgtagcagaa gaaagaggag aaacaagaaa gggacaagta 2400 atagaaagcg aagaatcaag agcaacggga gtaaaagcaa caggagaaat aggtgaggaa 2460 aacgaagaag ttaaagaaag tataatgtta atcgccacat taaatataag aggatgctcg 2520 aaactggaaa aacgacagga aatcgataat atgttagaat ccttcaatat agatgtagcg 2580 gctttacaag aagtgaacgt taaggctgag ctgattacta ctcgtaatta tgaatggagg 2640 ataggagggc aaggacaaaa caaatctaga ggattagcga tactcataag aaaagacaag 2700 ggaatagatg tgatggaatg gaaagaaata gggaaatatg gattatctat aagtttgtta 2760 tcaaaaggta ggaaaattat cttgataaat gttcatggcc caaacaaaaa cggatacata 2820 ttctactcga gtttagggaa attaataagc caagaccatc taagatcaaa ccttataatg 2880 gtgggtgatt ggaatgccca aataggaaag gactctgtta tgccagaaga cgcagaatgt 2940 ataggaaata ggttaggatt tagtaataat aatgacaatg gcgaagaatt taaaatgttc 3000 ctaacaatcc ataagataaa aaacatgtca acggtgatcg gtaaaaacac agatataacg 3060 tggaaaagtg gaaaaaagga aagccaaatt gatcatgttt tgaaaccaag agaagggaaa 3120 atcgaaatac ggttcataaa gggatactgg actggaataa atacagatca caaaatgatt 3180 gttacaggta taagaaccag agagaaagag aataaggaaa agggtaaaaa gaaaatacca 3240 atagatgcat cagttttgaa gtacgatacc attaaagaaa aataccagga ggcactaaaa 3300 aaatacgaaa taaggcagga aaaagggaac acaatagagc wagatttcaa agcagtagca 3360 ggaaaaatga aaaaagcagc aaatgaagta ttgagatcag caagagcacc actaacgccc 3420 attaggaagg cagcgctaaa taaactgaaa acagcgataa acctctcaaa taaacacccg 3480 gatatttttc catatagatg gaagctcaaa gacagaagaa tggagttcca agcagcaata 3540 cgatctcata atgagaggaa aataaagaag ttctataagg agttgaacga ttttgacgtg 3600 gcggtaagga ttaagaaatc ttaccaattt ctcaaggggt ttttgaaacg taagaaacac 3660 aagaatgtgt acatcccgat gagccaatgg aatgaagcat taaaagaaag cgaaggaccg 3720 gaaattcaac tgatggacga acatgatact tgtcccgtga cagtaccacc aaaggaagag 3780 gagatgatgc agattgtgca gcagcagtgc aacggcaaat cggccggagc agatggcatt 3840 cggatggaac ttctgaaata cgccgacaaa cagacgcaac aggaactggc tgatatatgg 3900 aaaagggtat ggatagaaaa tcatctgccc gaagatatgg agaagacgat tcaagttcca 3960 attccaaaaa atgctaaggc taaaggagtg caagatttta gaagaataag cttgtgtaac 4020 gttgcttata aaccgtacgc caaatggata aagcagagat taagagaatt caccggggag 4080 ccagacatca atcaagcagc gtttacagaa ggaagatcaa cggacgacca tatgttcgta 4140 acaagaagaa ttctagagga atactggaat gcagggaatc ccttatacgt ggcagcctta 4200 gatatcagca aagcatttga caatgtaagt ttactgcaat tgagatccat attggctaaa 4260 ttgaatgtac catcccacct tatagatagg gtattgcagt gcatacgaaa agaaaagata 4320 agagtaagat ggcaaaacca atacaccaca gaatgtaata gagggaaagg ggttaaacaa 4380 ggatgcccac tctcaccttt tctgttcaac attgtaatgc agaacgtagt taggaaggtt 4440 aaggaaaagg taccggaatt aaatttgatg aacacaggat cacttaaact accattgata 4500 ttaatatttg cagacgatat tcttattata acaaaagatg aagaaggact aaaaaggata 4560 cttatagcac tagaggaaag tttaagagag gtaggactgg aaataaataa cgacaagagt 4620 cagatattga tcagagctcc caacgcaaca aaaaggcctc cgaacactat aaatcttaac 4680 gggaggaact atgatgtaaa gacatcaatg agataccttg gcgcatggct gacagatacc 4740 ttgaacagac ctgcaactac caaacaaaga tgtgttaatg ccgcaaaagt atccaaattg 4800 gttgtagaat tctgcaagaa atttaaacca acatgggaaa ttggaaaatt aatatataac 4860 acagttattg caccatcgat cctttacggg acaaaaacag ctacactgac caaaagaagc 4920 agagttcaac taggaaaata tgagaagcag atactgaagg acatttggaa tcaatgcagg 4980 aaaacggatg agcgtaagtt caatgtacgg aaggaactta aaggaaaaac gataaatcgt 5040 agagtcagag tcaatcggat tagttactat ggacatattc aacggagaga agagacacac 5100 cctatcaaaa tggcgatgaa attgaaattt aacaagaaaa aacatggaag accaagcttc 5160 acgtggaaaa aatcattaga acaggatttc aataggtaca acggaataac agaagaagag 5220 tggagtcaaa tagctcagga tagagataaa attaaaaaga aagccgagga aatatatagg 5280 aacaatgaaa gcgaaatttc tgacggggaa gatagtaccg aacaagacat gtagtaaata 5340 attgaagaag tatcaaaaaa aaaaaaaaac atacacaaaa acctacaaaa tttacacaca 5400 cgcataaaca tattggaaaa taacacatat gagtatgcac aaatacgaat tatacatata 5460 aaaagatacg aaaacaaata tacatacaca aaaacatatg ataactcaca cgaaacacta 5520 atacactcaa tcactcataa acacatacaa aataatcaat aatgaaaatt actctgaaaa 5580 tacacaaaac atagatacta aaataaacac acacataaat tacacataaa cacatataca 5640 ggattgaaca gaacatttgc aacttttccg gttttccata cggaatggcc ggttttggta 5700 ggtttgatct cggttgtttg tgggccgaat tgagtgagat tttcacggag cattggattt 5760 cggcacatgt ttttgaatgg tttttccgat tatgggtttg gatgtagtgg cggtttgact 5820 gaagtgaaat ttgaaattta atacccctac acaaaaactg tcttcagcaa acttgttcat 5880 agcgtcaaaa tagacaactt tgctgaaaat atcatcaagc tattctgtta acgtgttgag 5940 ttatgtcaat aaaaaaaaaa tcgtaataag ggtcgctttg gtcaaaccat taatgctttt 6000 gaactcgtgg ttggggggat cagtcggagg catgtgttga ggttcgggtt gtgcctggtg 6060 ttctgtgggg gtttcattcg ggtcggtccg tggatagctg gggtctggct tgccggggtt 6120 ggtcgtttcg tgtggggacc ggggaggttg cgagtgtttt gtcaaatact gtaaatacat 6180 acaagacaaa ataacacaag cacatgtata tatatatgag aacacatgca gaaatagaca 6240 ttgacttata acacaaatta aagctgtttt aaaaagataa ttactcaatt ttaataataa 6300 acacatagca ttagtacaag aatgacatgc agtcatcagt cgtttgagtt ggtcaaaaga 6360 aaataaaatt gttcgattta aaaaatgcaa ggaatttgaa tatcatataa ttaactagaa 6420 ataataacta cacttgttta ctttcgagga atgattagaa tatctcgcgt tcggtatcgt 6480 ggctgtggtg atcggtttca cttcgtcgat tttcctttta actaattact tggccgggcg 6540 actgttttag actgctattt agattgagat tgtccattgt catgctaaat cgtaaagtgt 6600 taccaaaatc tattgaactt cgtgtattaa cccttactcg acgaagagag atcgatcact 6660 gcgggtacgc ctgtatgatg ctgaacgcaa gttaagaata tagaaaactc ttctcttcac 6720 ttttgcatgg gatgttttag attagcagtg tttgttcgtt ttactggtga ttttagtgat 6780 gtcattgtgt tgtgtgggtt tttcttttat agccagtaga ttgttgcaga gcgcccaaag 6840 gggttcagat tatgatcata gtggaattta ttattttctt gttataaata atataagaat 6900 ataaagaata attacaatta tcttcattgt acaatgcaaa tcataaactg aactccttaa 6960 taataaatgt ctagttattt tttatatgtt aaccaattag aagttttgaa gtagtacaat 7020 aaagtagatt agtaaatgtg tatatgccga aaaaaagtct taatgccaac caaagttata 7080 acaaggttca gtcgtagcat taaaattcag aataacttgt catgccgact gaatgttgtt 7140 tataaaaatc aaagaagaat aatcataata ataaaaataa acatggtgga gggaaatgat 7200 agcaacgaga agcatcataa taattctatg agaaatttat agcaggttat aaataaatat 7260 gatgattatt tctactgtta acttattatt caaatcttta aaaatagtta tagaatgaaa 7320 cattgctatt gcctgatatg aaattaaaat aattgaaaaa acattaataa aatcaataat 7380 tagttttgaa ttcaaaaaaa aaaataaata aatacaaata ttaaaaataa taaagataca 7440 taatgatttg gatgatgtga gggaggatgg gaggtcgttg ctatcatttg tctgtatgta 7500 taaacataaa ataaatgtta aaataaacaa gtatgtataa ttttagtcgc tattaactga 7560 tggatatact ggtaagaata catattcgtc ttgtttgcga gcagtgaaca gtaaattata 7620 catctattgc agtttaacct gatgaaacta cagagtataa tgaccgttac ataaccgaat 7680 tatattaaca ctagtatgtt aaattcaaat ttctaatgaa ttaaatttca tttctattaa 7740 atttatagag aacaaaatag gaataatcga tggatcgatg ttggagctgc aaacagacta 7800 aagctgcata ttgcacacgg tcctctatgg agttatattt cataatttct tctctctata 7860 tataaaacat tttaggggca gcagcaggaa ggtgtgagca tttaaatagt gttaaaattg 7920 aagctctgaa ccgtatttct cttaggtacg aaaagcgtta aaaacaaaac gtaggatgta 7980 cggagtatca taatggttat attatattgt agtttattgt acgagacctt ttggaacaat 8040 aaatcgtcaa ttgggaaatt cggatatgaa gtggtgttgc acgaaagtaa catcagtagc 8100 aacagtgttg tcgtttcata caagggacta gtgtgtaatg ttattttatg ttatattgtg 8160 tagcgttgta ctatatctac tgttcatagg taataaatag gtatacaaat aattcctttg 8220 gcgcataaaa taacagataa ttcttattcc ttatagctac tagtccagtg gaggtgcacg 8280 aattacttat ttataattgt tttgttatac taattcattg gcagaaattc cagaagcgtc 8340 atggagtcat cggagatact aacataatca ctgctattaa tatttcagca gttctattcc 8400 gaaaactgta attcgtcctt actctggaga ctacctaacg aggttcaatt catcacaatt 8460 taagaatgga aattctgata catctatatt catatcatat acaactataa attccattct 8520 atatatttta ttatagttat ttcatttatt atatgctggg tattgagagg gtgcattagg 8580 gcatcagaga agtcacagat ggacgatgta gtagggtgtc agccgaaatc taggagaatg 8640 tgccgtaaca tccttacact acatgctgct acttataaat gactttaatg taattatctt 8700 tgtttaatac agtaggctat agcgaatgaa acgaagcgtc ggatgtatta ttattttgtt 8760 tgtactccat gaacatatgg aagctggaat tttgtttttg atgtatccgg aaactttatt 8820 aacatatcac tgattttttt atttttattc aaattatttg tttattctta tcattcttct 8880 gtcgaccagc ctaggagcat ctgtagcaca aatattcaaa agaaaattat atgtcggccg 8940 atataatact gcatgctgtt tcactgcgtc ataatttatt gtaactacca ttttattttc 9000 atattttatt atcatattat tatatatata tacattataa tcttatctac tatataacat 9060 atcaatataa ttgaattttt catgatttgg gaagggaggg agcatcaggg catcattgaa 9120 attacaactg gacaatgaag tggagtgcca atcggagtca ggaggaaaag tggtagtcgt 9180 tcacgtctag tacttttctc tccaacagtt gtgaaagatt tgacgcgccc ttcttctaac 9240 aaacctttcc gtggaaagct cgacaagtaa tgcaaattct aatactcaat ctgttggatc 9300 atatggctgg aaggagattc tctatttccc gcgggtttcg cgtaagtgtt tctacttacg 9360 ggtccgccac cccccttttg gcccttcata cagcgatatg ttgaatgcag attggtaata 9420 aaatttgacc ttactgttga gtggagccgc atgcagagca agactcaagc aaggattggt 9480 ggctacctac atacatacat acatacatat 9510 // ID Gypsy-29-LTR_NVi repbase; DNA; INV; 409 BP. XX AC . XX DT 12-MAY-2009 (Rel. 14.05, Created) DT 12-MAY-2009 (Rel. 14.05, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: long terminal DE repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy-29-LTR_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-409 RA Bao W. and Jurka J.; RT "LTR retrotransposon from Nasonia parasitic wasp."; RL Repbase Reports 9(5), 995-995 (2009). XX DR [1] (Consensus) XX SQ Sequence 409 BP; 87 A; 126 C; 109 G; 87 T; 0 other; tgtggcgcca cgcactggac agcgtcggcg ccagcgtaag cagacgctgc agccgaggcc 60 tgacttcgac tagcggagta ccgccggagt cagcctcaag atgtcgtccg cccgacgcca 120 cgctgtggcc agcggggagc acgtacccgg tcgccattct ggaactggcc gccagactga 180 gcacacgctc aattttcttc agtattacta gttaaaataa agtcttcaga gttgagtaat 240 agtacatcgg agtgtgaatc ttttcctcct gtaccctcct gtgcccacgc gactcgcgtt 300 cgtcagtgat acagttaaat tctaaagtgg tgaccccgtt cagcagcacg cttcggagtc 360 gggcacagcg cgaagaaccg acgccatctt cgccgcgtca tctcactct 409 // ID BEL-34_AA-I repbase; DNA; INV; 6681 BP. XX AC AAGE02019757; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-34_AA_; KW BEL-34_AA-LTR; BEL-34_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6681 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02019757; Positions 34878 28198. XX CC Positions [5771-6328] - Integrase core CC 'CTTAT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 35..6679 FT /product="BEL-34_AA-I_1p" FT /translation="MNRKSIVRQTQLRTQDGGEGTPNPQNDVDLRENASVM FT SSEANFEASTIDDERANERDCGVCNRPNKAEWFMVQCGDCGKWYHFSCAGV FT THKTVHSKPFSCASCVPRSIPAATSIAERTSTSSSRKAKLVRDMQRLEEER FT ALEEKIQQATLKKLEREREYIARKYEILEQQDEADMSSVHSGRSSRTSRVK FT EWLLDQNNAVEVPRAVTDDPVVEDGENSHVGTLVNRDNVQITSTPLKTGFC FT AIGTNQKGGESRPASSIGGVADSDMEKTFGSITINGNGRPTKAGVTENVLP FT LVDVQSLADLLENTSPKVPEKGVRPKNPIPFYEKWRRETEDLRKEHILRQQ FT QHENEIEVRRKRELELVGKLNQLKGCKEEQFELQRRREADLLRQLQLREEE FT GAEFKKQQQKELNERDEELQRLRKIEQAFRVMQMSSQQKQIDANGSDGASD FT DAMRPQGCSQAGYANNIPIQTTPLVSVPHLSTVSMDSIDTPSNTPHHRNEL FT DESSMATPMGGRSQYSIHTNQSLPMFNHFPADQHKPLSNLPTFPPFVTPQY FT EPWPPFVHPRPSVANYGPLPVQFGPSPQQVAARQVISKELPPFTGDPIEWP FT MFICNYMHSTSACGYTDSENLLRLQRCLKGTAKDAVSSLLLHPSSVPQIIS FT TLQTLYGRPEQIVNNMVSKVRSAPAPKADRLDTWVTFGLMVQNLCGHLRAV FT GLENHLSNPILLQELVEKLPPTVKFNWALYQQSLPTVDLKSFGDYMEKVAT FT ATSSVTTSLINQPMTSRDERARGKDKAFVNAHAVSTTEKADVGESCQEHQH FT YQQHISAKKESSSCPACDGGGHQVANCATFRKLSVDSRWKEVKEKRLCGRC FT LTAHNRWPCRGEVCGVNGCQKRHHRLLHTDMLTEEMKSTGETKDATVSVHR FT TLTSSTLFRVLPVTLYGSKGQVDTFAFLDDGSSVTMVDKSTAEALGLEGNA FT ENLCIHWTGGIQKKICAQQVTVQISAVGSDKRFDASEVYIVEHLGLPEQTV FT NFEEMEQRFAYLKGLPVRSFSTAVPGLMIGLSNIHLLATLKLREGRIGEPI FT ATKTRIGWAVYGSLRGGVDQLPHRQMHIDTRPSDKDMHSFVQDFFALESLG FT ISVVPTTETVENQQAWKILEETTKRTESGRFQTGLLWKNDKIQFPDSEPMA FT EKRLMCLEKRLNRDPELYNTVRRQISEMQEKGYTHKATAAELAKFKRERSW FT FLPLGVVLNPKKPGKVRLIWDAAAKVKGVSLNTELLSGPDLLAPLLRVMFG FT FRERQVAICADIMEMFHQILIREEDRSAQLYKWRDSPDLTMETMIMDVAIF FT GATCSPTQSQYVKNKNADEYEVTHPRAANAIKEKHYVDDYLDSLDTVQEAT FT ELALEVAKVHAAGGFFIRNWISNDKTVLKQIGENSPTTVKSFIGEGQAERL FT LGIVWLPEEDVFTFTLNFRPDIRILIEDVLVPSKREMLRVVMSIYDPLGLV FT AAFVIHGKILIQDVWRTRIDWDQKVPLDVFNRWKLWLAVLLTMTSVRIPRC FT YFPGYEPASYRSLQLHIFVDASEQAFAACAYFRIVDRGRVRCCLVASKTKV FT APLRPLSIPRLELMAAVIGVRLKKTIINDHSLEIQQTFFHSDAGTVLAWIR FT TDPRRYRQFVSFRVSEILSLSTVEEWRWVPTKLNVADEATKWGRGPSFEPD FT SRIMIGPLFLYDDQREWPRDYRSGTEETEEELRPAYVYSHFLSKPLVNFSN FT FSRWERLLRSVAYIHRFMGNLKRRCRQEPGMEGGLTTLELQEAERTLWRLV FT QSDAYPDEVATLKYNLQADKKHLKKLEKRSPLAKLTPVMDKEGIVRIDGRL FT VDSDYVTHDAKYPIILAKEHSVTTLLLDWYHRKFRHANNETVVNEVRQRFH FT VPSVRVQVRAAKKRCMWCQVYKAVPVPPKMAPLPQVRLTPFKRPFTYTGID FT YFGPYFVKVGRSTVKRWVALFTCLVTRAIHLEMTNNLSTDSCKKAIRRFIA FT RRGAPAEFYSDRGTNFVGASRELIAEIKRINVQLSSTFTDHQTQWKFNPPA FT APHMGGCWERMVRSVKVAMGVLPFERKLDEESLATFLAEAEHMINSRPLTF FT VPVESDDHESLTPNHFLMLNSSGVKQPEKTPVDEGMALRGSWNKIQYTLDN FT FWRRWLKEYQPTLTRRTKWFHDVRHVQEGDLVVIADEGVRNRWLRGRIMRT FT YPGRDGIPRRADVRTSDGSVMRDRPVTKKFWRHVGE" XX SQ Sequence 6681 BP; 1934 A; 1490 C; 1736 G; 1521 T; 0 other; gaaactaaag agcttaagga ttaggaacgt caatatgaat cggaaatcga ttgtaaggca 60 aacacaattg cggacacagg acggaggaga aggtacacct aaccctcaga acgatgtcga 120 tctacgcgag aatgcatccg tgatgtcatc agaggcaaat tttgaagcat ccacaatcga 180 cgatgagcgg gcgaatgaaa gggattgtgg cgtctgcaat cgccccaaca aagccgaatg 240 gttcatggta cagtgtggag attgtgggaa gtggtaccac ttttcctgtg ccggtgtgac 300 tcataaaaca gtacattcca agccgttttc ttgtgcgtcg tgtgtccctc gatctattcc 360 tgcagcaacc tcgatagctg agcgaacgag tacatctagc tcgcgcaagg caaaactggt 420 tcgtgatatg cagcgccttg aggaggaacg cgccttggaa gaaaagatcc aacaagcaac 480 attgaagaaa ctggagcgcg agagagagta catcgcacgg aaatatgaaa tcctggagca 540 acaggacgag gctgacatgt cgagcgtgca tagtggtcgg tcaagtcgaa ctagcagagt 600 gaaggagtgg ttactcgatc agaataatgc cgtagaagtt cccagggctg ttaccgatga 660 tcctgtcgtt gaagacgggg aaaacagtca tgttggaact ttagttaatc gggacaatgt 720 acagatcact tccactcctt tgaagacggg attttgtgca atcggcacaa accagaaggg 780 tggtgaaagt cgcccagctt ccagtatcgg tggagtagcg gatagcgaca tggagaagac 840 gttcgggagc ataaccatca atggcaatgg ccgtccaacg aaagcgggag ttacagaaaa 900 cgtccttccg ctggtagatg tacagtcgct tgctgatcta ctggaaaaca cttcacccaa 960 agttcctgaa aaaggtgttc gtccaaagaa tccaatacct ttttacgaga agtggcgtag 1020 agagactgaa gatcttcgga aagaacatat cctacgacaa caacagcatg aaaatgaaat 1080 tgaagtacga cgtaaacgtg agttggaact tgtcggcaag ctcaaccagt tgaaaggctg 1140 caaggaggag caattcgaat tgcaacgtcg acgggaagca gacctcttgc gtcagctgca 1200 gcttcgagaa gaagaaggtg cagagtttaa gaagcagcag caaaaagagt taaacgagcg 1260 ggatgaggag ttgcaacgtc tgcgtaaaat agagcaggct ttccgcgtta tgcagatgtc 1320 gtctcagcag aagcagattg acgcaaacgg tagtgacgga gcgagtgacg atgccatgcg 1380 accgcaaggt tgctcgcaag caggatacgc caacaacatc cctatccaaa ccactccttt 1440 ggtaagtgtg ccccatttat ctactgtttc tatggactca atagatactc cttcgaatac 1500 acctcaccac cgaaatgagt tagacgagtc ttcaatggct acaccaatgg gtggtagatc 1560 tcaatactct atccacacga atcaatcgtt gccaatgttt aaccattttc ccgctgatca 1620 gcataaaccg ctgtcgaatc taccgacgtt tccccctttt gtcacgcccc aatatgaacc 1680 atggcctcct ttcgtccacc caagaccgtc tgttgcaaac tatggtccac taccagtaca 1740 gtttgggcca tcgccgcagc aagttgcagc aaggcaggtg atctctaaag aactgcctcc 1800 attcactgga gacccaatcg aatggccgat gttcatttgt aattacatgc attcaacaag 1860 tgcatgcggg tatacggact ccgaaaacct actgcgactt caacggtgtt tgaaaggcac 1920 agccaaagat gcagttagta gtctcctcct acatccgtcg tcggtccctc aaataatttc 1980 gacgttgcaa accttgtacg gccgacctga acaaatcgtt aacaacatgg tgtcgaaagt 2040 gcgcagtgct cctgctccaa aagcggatcg cttggacaca tgggttacct ttggtctgat 2100 ggtgcagaac ttgtgcggtc atcttagggc ggtcgggctg gagaatcatt tgtccaatcc 2160 aattctattg caggagcttg tggagaaatt gccacctact gtcaagttta actgggcact 2220 ttatcaacaa agcttgccca ctgtggatct caaaagtttc ggggactata tggagaaggt 2280 tgccaccgct acaagtagtg tgaccacgtc tctaatcaac caaccaatga ctagcaggga 2340 tgaacgagcc aggggcaagg acaaagcatt cgtgaatgct catgcagttt caactacaga 2400 gaaagctgac gtaggagaat cttgccaaga gcatcaacac tatcaacagc atatttcagc 2460 gaagaaagaa tcatcgtcat gtccggcctg tgacggagga ggtcatcagg ttgcaaactg 2520 tgcgacattc agaaaactga gtgtagattc ccgctggaag gaagtgaaag agaagaggct 2580 ttgtggccgg tgtttaacgg cgcataaccg ttggccttgc cgcggggaag tttgcggagt 2640 gaatggttgt cagaaacgtc accaccgact acttcacacc gatatgctga cagaggaaat 2700 gaagtcaacc ggagaaacaa aagatgccac tgtttcagtt catcgtacgc ttacgtcatc 2760 aacgctattt cgtgtgctac cggtgactct ctacggtagc aaagggcaag ttgacacatt 2820 cgcctttcta gatgatgggt cttccgtaac tatggtagat aaatcaacag cagaagcgct 2880 agggcttgaa ggaaacgcag aaaatttgtg tattcattgg actggaggaa tccagaagaa 2940 aatctgcgct caacaagtaa cggtacaaat atcagctgtt ggcagcgaca agcgctttga 3000 cgcatctgag gtgtacattg ttgagcatct gggcttaccc gaacaaactg ttaactttga 3060 ggaaatggaa caacgtttcg catatctcaa gggactgccg gtgagaagct tctcgacagc 3120 tgtcccaggg ttgatgatag gactgagcaa tatccactta ttagctacac tgaaactgcg 3180 tgaaggacgg atcggtgaac ccatagcaac gaagacgcgc attggatggg cagtttatgg 3240 aagtctacgt gggggagtcg atcaactccc gcaccgtcaa atgcacattg atacgagacc 3300 gtcggataaa gatatgcatt ctttcgtaca agatttcttt gccttggaaa gtctgggaat 3360 ttcggtggta ccgacaacgg agacggtgga aaatcaacag gcctggaaaa tcctggagga 3420 aaccacgaag cgtacggaga gcggtagatt tcaaacgggg ttgctctgga aaaatgataa 3480 aattcaattc ccggacagcg aaccaatggc tgaaaagcgg cttatgtgtc tcgagaagcg 3540 gctcaacaga gacccagagc tgtacaatac tgttcgtcga cagatatctg aaatgcaaga 3600 gaagggctat acacataaag ccactgctgc tgaattggct aagttcaaac gtgaacgttc 3660 atggttccta ccactaggag tggtattgaa tccaaagaaa ccaggaaagg ttagactgat 3720 ttgggatgca gcggccaaag tcaaaggtgt ttcgctgaac actgaattat taagcggtcc 3780 ggatctacta gcaccgctgt tgagagtgat gtttggattt cgcgagagac aagtggctat 3840 atgtgcagac atcatggaga tgttccacca aatattaatt cgggaggaag accgcagcgc 3900 acaactctac aagtggagag actcgccgga cctgacgatg gaaaccatga ttatggacgt 3960 tgcaatcttc ggcgcgacat gttccccaac gcaatcccag tacgtaaaaa acaaaaatgc 4020 cgacgaatac gaagttacgc atccaagagc agccaacgcg ataaaagaga agcactatgt 4080 cgacgactat cttgacagtc tagatacagt acaagaagcg acagagttgg cactagaagt 4140 ggctaaggtt catgcggctg gcggtttctt catccgaaac tggatatcaa acgacaaaac 4200 cgtattgaag caaataggcg aaaatagtcc aacgaccgtg aaaagtttca tcggagaagg 4260 ccaagctgaa cgcttattgg gaatcgtatg gctaccagag gaggacgttt tcacttttac 4320 tctgaatttt cgtccagaca ttcggattct tattgaagac gtattggtgc catcgaagcg 4380 agaaatgcta cgagtagtga tgagcatcta cgatccacta ggacttgttg cggcatttgt 4440 tattcatgga aagattctca tccaagatgt ctggagaact agaatcgatt gggatcagaa 4500 agtcccactg gatgtcttca atcgctggaa attgtggtta gccgttctgc tcacgatgac 4560 cagcgtaagg attcctcgat gttatttccc aggctacgag cctgcaagct atcgctcact 4620 gcaacttcac atattcgttg atgcaagtga gcaggctttc gctgcctgcg catattttcg 4680 catcgtagat agaggaagag ttcgttgttg cttggtggcg tcgaaaacca aagtagcgcc 4740 gcttagaccg ttatctatac cacgtctcga gttgatggcc gcagtgatag gagttcgttt 4800 aaagaagaca atcataaacg atcattcttt ggagattcaa caaacgttct ttcatagtga 4860 cgcgggtaca gtcctcgcct ggatccggac tgatcctcgt cgttatcgcc aatttgtctc 4920 tttccgagtg agcgaaattt tgagcttgtc gacggtggaa gaatggcgtt gggttccgac 4980 aaagcttaac gtcgcagacg aggcaacgaa gtggggaaga ggaccatcgt tcgaacccga 5040 cagcagaatc atgataggac ctttgttctt gtacgatgat caacgtgagt ggcctaggga 5100 ctaccgatca ggaacagagg agacagaaga agaacttcgg ccagcgtacg tatacagtca 5160 ttttctatca aaaccacttg tgaatttctc gaatttttcg agatgggaac ggttgttgcg 5220 tagcgttgcc tatatacacc gttttatggg aaacttgaaa cgacgctgca gacaggagcc 5280 tgggatggaa ggaggcctca cgacgctaga attgcaggaa gcggaacgta cgctttggag 5340 gcttgtacag tcggacgcat atcctgatga agtggccacc ctaaagtata acttgcaagc 5400 agacaagaaa catcttaaga agcttgagaa aagaagccct cttgcgaaac tcactccagt 5460 aatggacaaa gaaggaattg tccgtataga tggtcgacta gttgacagtg attacgtaac 5520 ccacgatgca aagtatccca ttattctagc gaaagaacat tctgtaacca ccttactgtt 5580 ggactggtat catcgcaaat ttcgccatgc caacaacgaa accgtggtga acgaagttcg 5640 ccaaagattc cacgtgccaa gtgtgagagt tcaagtccga gcggcgaaaa agcgatgcat 5700 gtggtgccaa gtgtacaaag ctgtacctgt tccaccaaaa atggcaccgc tgccccaagt 5760 gcggctgaca cccttcaagc gaccgtttac gtacaccgga attgattact tcgggccgta 5820 tttcgttaaa gtcgggcgat ctacggtgaa aagatgggtc gccctgttta catgtttggt 5880 gactcgggcg attcatctgg agatgacgaa taatttgtca accgactcgt gtaagaaggc 5940 gatacggaga ttcatagcgc gtagaggagc cccagctgag ttctactcgg accgtggaac 6000 caacttcgtc ggtgccagcc gagaattaat agcagaaata aagcgcatca acgttcagct 6060 cagcagcaca ttcacggatc atcagacaca atggaagttc aacccccctg ccgctccaca 6120 catggggggc tgctgggagc ggatggtacg ctcggtcaaa gtcgcgatgg gggtattacc 6180 tttcgaaagg aaactcgatg aagagtcctt ggcaacgttt cttgctgaag ccgagcacat 6240 gataaactcg cgtcctttga ctttcgtgcc ggttgaaagc gacgatcacg agtcacttac 6300 accgaatcac ttcctaatgc tgaattcaag tggagtgaaa caacctgaga agacaccggt 6360 agacgagggc atggcattga gaggaagctg gaacaagata cagtacacgc tagacaattt 6420 ctggcgacgc tggttgaagg agtatcaacc gaccctaacg cggcgaacta aatggttcca 6480 tgatgttcgt catgtccaag aaggtgactt ggttgtaata gcggatgagg gagtgaggaa 6540 tcggtggctc agaggtcgta taatgaggac atatccagga agagatggta taccacgccg 6600 tgcagacgta cgtacatcag atggttcggt tatgcgagat cgaccagtaa ctaagaagtt 6660 ctggcgacac gtgggggagg a 6681 // ID Gypsy-172_AA-LTR repbase; DNA; INV; 444 BP. XX AC supercont1.188; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-172_AA_; KW Gypsy-172_AA-I; Gypsy-172_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-444 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.188; Positions 1616146 1615703. XX SQ Sequence 444 BP; 147 A; 97 C; 78 G; 122 T; 0 other; tgtaagcaaa gctctttaga caaacattga attactcatt taatattcat tctactctca 60 aattacaatt gctacatatg ctaaccatgc actcttcaag tgccggtcaa taaatcaatg 120 caccctgctg agtacattgt actcagcaga cccgaataga cagaaggtca agttacagac 180 ctccactgtt gctgcgtgaa ccaaaccaaa atgcgatatg cagatgttgc atatctgtac 240 agtaaatatg cagtacatac cgatcgtcat tgattggccc gatcgaagct aagcacctcc 300 tccagtcaga atgactatca atgtattagt tttgtagact ttaagaaata ggttagctgt 360 aagatgcaat acaataaacg tagaaatctg attttgaacc tgaaaccgtt cgggtgtttc 420 aatgccaaat gtccgaacga caca 444 // ID Gypsy7-LTR_AP repbase; DNA; INV; 130 BP. XX AC Contig49402; XX DT 21-APR-2008 (Rel. 13.04, Created) DT 21-APR-2008 (Rel. 15.12, Last updated, Version 0) XX DE LTR retrotransposon from pea aphid: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy7AP; KW Gypsy7-I_AP; Gypsy7-LTR_AP. XX NM Gypsy7-LTR_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-130 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from pea aphid."; RL Repbase Reports 8(4), 450-450 (2008). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 130 BP; 14 A; 47 C; 32 G; 37 T; 0 other; tgtggcggcc cgtctacgtg gcgccacaat cgctcctacg ttcacgtcta tccgaacgac 60 acggcctgtc tgccgttccc gtgttcgcgc gctctctctt tgttttgtat tccgtccgtc 120 gcgccgtaca 130 // ID Gypsy-58_CQ-I repbase; DNA; INV; 7699 BP. XX AC AAWU01036277; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-58_CQ_; KW Gypsy-58_CQ-LTR; Gypsy-58_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-7699 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 495-495 (2011). XX DR Genome; AAWU01036277; Positions 15018 7320. XX CC Positions [3360-3863] - Reverse transcriptase CC Positions [4914-5393] - Integrase core CC 'TTTT' target site duplication CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 433..2481 FT /product="Gypsy-58_CQ-I_1p" FT /translation="MESYIFPSTSTLNDEELNFELKLRNKQIYPNEKLEQK FT QLFVKRLFTEDREKKTKYKGINFYENEKPIIIKNVDEIIEKLNTTFDRRLL FT SRLRHYLIRTNYALTASEFDETDKTYLIKEIEDVFTKFKKKFKYSSEDEGL FT ESEGNGKDGKNDKKKDEPTSKDDKLGVSSKLNASLDQILSKLSLASERITD FT FIEASQLKKLNRTGNDWSDYSNSEDERERGRRNRDSRDNKSHGDSKQPRLG FT EYDKWRRGRKIFQGPEPDPIPGHSRQNFSNHGRGLGGTSFHNDQGRGRKVN FT RGGNLSHIDRQQGYDSSGDYGSGSSSRDVRRQPSKKRVVSRHRNNRGRSLS FT SSDGHADRDGPQRRHRRRSGNRHRRDISDRSSSASRHRRGYRRSRVENWDL FT IFSGDNRSIQVEDFLYRIKKLARHEDVSQGELLRNIHHRLKGEAYDWWFTR FT EENFTRWSRFEDEIRFRYGNPNRDRGIRAQIRELKQKRGEKFVAYVTEVEK FT LTQCLVRPFSPDTLFELIWENMRPHYRSRLSTITVDDLEHLIELNHKIDAN FT DPFFFKPLNGTRNDVNHLTVEESDYSDDEGFQVNAIQKQRTPRQSAPPGGQ FT SRAANISNDRLTSDQQQQPGTAEVPVCWNCQAIGHAWRECTKPKLVFCYAC FT GKLGRTARTCEKNHYPTSQEDQDQDQSTNWRRDA" FT CDS 2664..5759 FT /product="Gypsy-58_CQ-I_2p" FT /translation="MSSLNLIEAHGLKILKSNIKVCTADDTEHTCLGYVNV FT PYTLGNETKVVSTLVVPQIKKPLILGMDFWKAFNIRPVITTNNRTEELDLT FT WTSTLSDTPNVNCLRVENDHALADNEIIALSIHIMEFDVDEDQIHMKSQPE FT EDDSLDIPTLELPEDPEALIDAVETEHPLTVEEKRQLKDVLRQFDCTSEGR FT LGRTTLIEHEIELVEGTNMKELPMYRYSPKIWENVEQELERWKTLDVIEEC FT TTEFASPLVPVKKANGKIRVCLDSRRINSVTKKDAYPMRNMSEIFHRLQKA FT KYFSIVDLKDAYFQIPLRESSRNYTAFRTPKGLFRFKVVPFGLKNAPFTMS FT RVMNLAIGFDLEPNVFIYLDDIVIATEDLNEHFRLLNEVASRLKRAGLTIS FT VEKSRFCRKQVRYLGYLLTENGLAIDSAKLEPILNYPRPKTIREVRRLMGL FT MGFYQKFIHRYSHVTAPITDLLKKTKKFRWSEEANNALSELKSVLTSAPVL FT SNPDYSRPFLIETDASQLAVGAALMQEFDEGKRIIGYFSKKLSSTQRKYSA FT TEKECLGVLLAVENFRHYIEGSTFTVVTDAKSITWLFSISAANANSRLLRW FT ALKLQSYDFVLQYRKGKDNVLADCLSRIETVQIVDQDYVELRKKIQKDPDK FT FKSFRVSGERIYKYIEEPGAFKDKRFEWKYYPPMSERTEIIHKIHDSAHLG FT YDKTLNKLRETYFWPNMPTETKKFCRECIPCKTAKGININPTPAMGSQKKF FT CDHPWQFITLDYVGPFPASGRNRSTCLLVLTDVFTKFVLVQPFRQATASSL FT VHFLEQTVFLLFGVPEMVLSDNGTQFLSKEFSKLLEHYGVKHWLTPSYHPQ FT VNNTERVNKVITTAIRATLKGNHKTWAENIQQIACAIRNSVHESTKYSPYF FT LTFGRNMISNGMEYENLREMNTPTSTTLSEQEREELYKEVRKNLAEAYQKQ FT AKYYNTRSNSKAPTYVVGEKVLKKCTLLSDKSKDFCAKLGPKYVEAYVNRV FT LGDSYELKDKDDKILGIFHATFLKKF" XX SQ Sequence 7699 BP; 2476 A; 1425 C; 1635 G; 2163 T; 0 other; aaaatcttca tagcacggtt aggtcggtta cattggcgcc caacgcaaaa agttttccat 60 agaatttgct tttgttggat gatatttgaa taggaaaaga aagtgccact gtttttttat 120 attttttttc agtttgtttt tttttttttt ttttgaaact gtacatattt aaacgaaatc 180 ggagtgttat gaaatttctc tcagtttttt ttataaaggt tgcgacttct ccgcatgtat 240 caaagatatt tttttgtaga tatatacatt ttttttagtg gttaggtttt ttttcatttg 300 tacatacttt ctattgaaaa tgatctttga acattttttc taaatattag tttttttttt 360 aaatttacat tatcttcata ctgatataac gagttaggct aatagaaagt tatagattta 420 gttagtttga aaatggaatc gtacattttt ccctccacat caactttgaa cgatgaagaa 480 ttgaattttg aacttaaatt acgtaataaa caaatttatc caaatgaaaa attagaacaa 540 aaacaactat tcgtcaaaag attgttcacc gaagacagag agaagaaaac aaaatacaaa 600 ggaattaatt tttatgaaaa tgaaaaacca attatcatta aaaacgttga tgaaattatt 660 gagaaattga acactacatt tgatcgtcgt cttctctcaa ggttgcgaca ttacctaatt 720 cggaccaatt atgctttaac agcatctgag tttgacgaaa ctgataaaac atacctaatc 780 aaggaaattg aagatgtttt cacaaaattt aaaaagaaat tcaaatacag cagtgaggat 840 gagggattag aatcagaagg aaacgggaag gatgggaaga atgataagaa aaaagatgaa 900 cctactagca aggatgataa gttgggagtt tcctccaaac tgaatgcatc acttgatcaa 960 attttaagca aattaagtct cgcctcggaa agaattacag attttatcga agcgagccaa 1020 ttaaaaaagc ttaacagaac gggaaatgat tggtcagatt attcgaacag tgaagacgaa 1080 agggaaaggg gtagaagaaa cagagattcc agggataata aaagccatgg agactctaaa 1140 caacccagac tgggagaata tgataagtgg agacgtggtc gaaaaatctt tcaaggaccg 1200 gaaccggatc cgataccggg acatagcaga cagaactttt cgaatcatgg tcgtggtctc 1260 ggcgggacat cattccataa tgatcaaggt cggggcagaa aggtgaaccg gggaggtaac 1320 ctgtctcata tcgacagaca gcagggctac gatagctcag gtgactacgg ttcaggcagc 1380 tcgagtcgtg acgtcagacg acaacccagc aagaaacgag tggtaagccg ccaccggaac 1440 aacagaggga gatcccttag cagttccgat ggtcacgcag atcgagatgg tccacaaagg 1500 cgacacagga ggcgttccgg aaatcgacac cgacgagaca tttccgaccg tagttcgagt 1560 gcatccagac acaggcgtgg gtaccggaga tcgcgagttg agaactggga cctgatcttc 1620 tcaggtgaca acaggtcaat tcaggtagag gattttctgt atcgaatcaa aaagttggcc 1680 aggcacgagg acgtgtcaca aggcgagttg cttcggaaca tacatcaccg attaaagggt 1740 gaagcgtacg actggtggtt cacgcgcgag gagaatttca ccagatggtc tagatttgaa 1800 gacgaaatcc gtttcaggta cggaaacccg aaccgggaca ggggaattag ggctcagatt 1860 agggagttga aacagaaacg gggagaaaag tttgttgctt acgtaacaga ggtcgaaaaa 1920 ctgactcaat gtctagttcg accattttct ccagatacct tatttgaact aatttgggaa 1980 aacatgaggc cccattatcg ttcgcgcctt tcaacaatta ctgttgatga tttggaacat 2040 ttgattgaat taaatcacaa aattgacgct aatgatcctt tcttttttaa gccactaaac 2100 ggaacaagga atgatgttaa tcatctaacc gtggaagaaa gtgattattc tgacgatgaa 2160 ggatttcagg tgaacgccat tcagaaacag cgaacaccta gacaatcagc accacctggt 2220 ggtcagtcta gagctgcgaa tatttcaaat gatcgtctaa cttcagatca acaacaacaa 2280 ccaggtacag cggaggttcc agtttgttgg aactgccagg ccatcggcca tgcttggaga 2340 gaatgtacaa aaccaaaact agttttctgt tatgcttgtg ggaagttagg tcgaacagct 2400 cgaacttgtg agaagaatca ctaccctaca tcccaagaag atcaagatca agatcagtcg 2460 acaaactggc ggcgggatgc ttgatcggga attcaaccat ctcgccagta gaaacaattc 2520 ccagcgaaga ggaatctact tacttaaata acaactcagt cttacaaatt agggttaatc 2580 ctagtgcttg cccacacatt aaagttaacg tgttaggttc agaaattgtt gcactgcttg 2640 attctggtgc tggaataagt gtcatgagct cgttgaactt aatcgaagct catggactaa 2700 aaattttgaa atctaatatc aaagtctgca cggccgacga tactgaacac acttgcctag 2760 gttatgtgaa cgttccatac acgctaggaa atgaaactaa agttgtatca acactagtcg 2820 tacctcaaat taagaaacca ttaattctag gaatggattt ttggaaagcg tttaacattc 2880 gaccagtaat aaccacaaac aacaggacgg aggagttgga cctgacgtgg acgtccacgc 2940 ttagcgatac gcctaacgtc aactgcctac gcgtagagaa tgaccatgcg ttggcggaca 3000 acgagatcat tgcgttgtca atacacatca tggagtttga cgtggatgag gatcagatcc 3060 atatgaaatc tcaacccgaa gaggacgatt cgttagacat tcccacgcta gagctgccag 3120 aagatccgga agctttgatt gacgcggtgg aaacggaaca cccgttgact gttgaggaaa 3180 aacgtcagct gaaagatgtc ctgcggcagt tcgattgcac cagcgaaggt cgtcttggga 3240 ggacaaccct cattgagcac gagatcgaac tcgtggaagg gaccaacatg aaggagttac 3300 cgatgtatcg ttactcgccg aagatctggg agaacgtcga gcaggagtta gagcgttgga 3360 aaacgttgga tgttatcgag gagtgtacaa ccgagttcgc cagtcccctg gttccagtaa 3420 agaaagcgaa cggaaaaatc cgagtttgtt tggactcgcg ccggatcaat tctgtcacta 3480 agaaggacgc ttacccgatg aggaacatgt cggagatctt tcaccggttg cagaaagcga 3540 agtactttag tatagttgac ttaaaggatg cgtactttca aataccactt cgggagagtt 3600 cgcgtaacta cactgccttc cgcacgccca aaggcctttt taggttcaaa gttgtccctt 3660 ttggcctaaa aaacgcgccg ttcacgatga gtagagttat gaatctcgca attggcttcg 3720 atttggagcc aaatgttttt atatatctag acgatattgt tatcgccact gaagacctga 3780 acgaacactt caggctcctg aatgaagttg cgagtaggct caaacgagca ggtttaacaa 3840 tatctgttga aaaaagccgg ttttgtcgaa agcaagtgcg atatttaggc taccttttaa 3900 ctgaaaacgg cctcgcgatc gatagtgcca agttagaacc gatactcaat tacccgcgcc 3960 cgaagaccat ccgcgaagtg agaagattaa tgggactaat gggattctat cagaagttca 4020 ttcatcgtta cagtcacgtg actgccccaa taactgattt attgaagaag accaagaagt 4080 tccggtggtc tgaagaggcc aacaacgcgt tgagtgaatt aaaatctgta ttaacgtccg 4140 cccccgtttt atcaaacccc gattactcgc gaccgttctt aatcgagact gatgcgtccc 4200 agctcgccgt gggtgctgct ctgatgcaag aatttgacga aggaaaacgt attattggtt 4260 attttagcaa aaaattatcg agtacgcaaa ggaaatattc cgcaacggaa aaagagtgtt 4320 taggagtgct tctcgccgtg gaaaattttc gacattacat tgaaggatcg acatttaccg 4380 tggttaccga cgcgaaaagt attacctggt tgttctcaat ttccgccgcg aacgcgaact 4440 ctcgtttgtt aagatgggct ttaaaattac aatcgtacga ttttgtatta caataccgga 4500 aagggaaaga taacgttcta gccgattgcc tctcgcgaat cgagaccgtc cagatcgttg 4560 accaagacta cgtcgaactt agaaagaaaa ttcagaaaga cccagacaag tttaagagct 4620 ttagagtttc tggggaacgt atttacaaat acatcgaaga acccggggcg tttaaggaca 4680 aacggtttga atggaaatat tatccaccta tgtccgaacg aaccgaaatc attcacaaaa 4740 ttcacgattc agcacacctg ggatatgaca aaacattaaa caaactcagg gaaacctatt 4800 tctggccaaa catgcctacc gaaacaaaaa aattttgtcg cgagtgcatt ccatgcaaaa 4860 ccgcgaaagg aatcaacata aatccaactc ctgctatggg ttctcaaaag aaattctgcg 4920 accacccctg gcaattcata actttagatt atgttgggcc ctttccagct tcaggtagaa 4980 acagaagcac atgtttgctc gttctaactg atgtgtttac gaaatttgtc cttgtccaac 5040 cttttagaca ggcaacagca tcatcattag tacattttct agaacaaaca gtatttttgt 5100 tgtttggggt accggaaatg gttttatccg ataatggcac acaatttttg tcaaaagaat 5160 tttccaaact tttagaacat tacggagtta aacattggct tacaccttct taccacccac 5220 aggtgaataa cactgaaagg gtaaacaaag taataacaac tgcaatcagg gccactttga 5280 aggggaatca caaaacatgg gcggaaaaca ttcaacaaat cgcttgtgcg attaggaatt 5340 cagtacatga atcgacaaaa tattcacctt attttctgac atttggtcgc aacatgattt 5400 cgaatggaat ggaatacgaa aatttacgag aaatgaacac accgactagt accaccctaa 5460 gcgaacaaga acgagaagaa ttgtacaaag aagtacgtaa aaacttagca gaagcttacc 5520 agaaacaagc gaaatattac aatacaagat ctaactcaaa agcaccaact tatgttgtgg 5580 gagaaaaagt tttgaagaag tgtaccttat tatcggacaa atcgaaagat ttttgcgcca 5640 aacttggacc aaaatacgtc gaagcttacg tgaatcgagt tctcggagat agttacgaat 5700 taaaagacaa agacgacaag attttgggca ttttccatgc aactttctta aaaaagtttt 5760 aaatgtacat aaaacaatca aacttattga aactaggtta gtaaatattg ctacattatt 5820 ttacctagcc ttagaaatgt cggaacgaca tttgtacata aactcagcaa aatctactta 5880 gttctatatt agtaaacaaa aacattattc actatcgctc accattttct tcattcataa 5940 atcgtttaag tttagatcca acttttcagc atgtcgacgt agattcattg aggagtattc 6000 ttgaaacact tttttgaagt cgttgagcat aaatttcatt gattaaatcc cttggggcag 6060 cataaatttc attcatccgt atccgtatgt agtattggtg aaatctttag aacgtagtag 6120 aaggtgacac acacactata gaacagcaac gaaaaatctt tccggctctc cctgttgatt 6180 tagtatgata agttacctgt aaatagacaa caatctaata aattatagga taattaacta 6240 tttataccga atactctcca gttttagaga atcaggggga aaaagtgcca atttttcagc 6300 tgaattcgat tttcttagtc cacacgcggc acacgttgtt tttgaggcac tcttgttttg 6360 gttatgacag ctgatttgac gttgtcattc cgggacaggg ctgtgtaagg ttatgctgtc 6420 agttttttcg atgatttttt ttcggcaggg atgaacacgg ttcacggcga ggaaattcga 6480 ggtttaatcg aacaatgttt gagaataaga attttttttt taaattttat ttttccgtta 6540 aatttggatt tgaagcattt ttgaaaattt tgaggtcgaa caaaaattac cattagactt 6600 atgattaagt tgaggtgatt ttgttctgtt tggaggacag tttgattgaa gttttgtcga 6660 tattttgatc gatgtgaagc aatcactcta ctttaccttt ttaacttctt tcctaggatg 6720 gatatgatcc attaggaaat ttggcaaacc agtttgcgtt gttagatttg tttatcgaaa 6780 aaagacatta gtcgaaagaa actgatagta tgaacatctt tggtgaaagt agctctagta 6840 attaagttgt tatctgagag aaatgaagta actaaattta gtatacataa gttttagata 6900 ataatttcgg ttcaggatcc cgtgcacagc tacccagtca cgaaatattc tagcagagct 6960 ggttctgtca gaatcctgct agaacagtgg aacatttctg ctagaatgct gtaagaatat 7020 ccagctctat aagaactctg ctagaatcct gctagaacgt cgtgactggg tagtttttag 7080 gttatgaaaa tttgttcaat tttcaattga acaaattttc ataaaacgag gtggtgtgat 7140 gtaacgtttt ccgttattgc gaattaattt ttttagttta aacttaaagt taataaagaa 7200 ataaaaaaaa atgtaaataa ataaatataa aaaaaaatta ttaaataaat atctagattg 7260 aggcaacgcg cgtaccggta tcagacgaag cgagtgtata ttcacgactc acgtttcgtg 7320 gaattttata gcaaatggga tctgtcacac tgaacgttgg tgagcccgtc tcgggtgccg 7380 ccatattccg gggactcgga acttcggagg tttcctgggt atgagagttt cgggtctcgt 7440 tctttctgcc aacggaacgg ctggtgataa gaagtgcgcg acgcgactag tgactttaac 7500 ctttgccttt gccttttacc ttaataggtt agttgtacat agtgcgagtg ttcttcctaa 7560 agatttattt aattaagttt actttttatt ccattagttt tacctaagtg aattcttgga 7620 tttttgcgtc cgctttctta aatacagttc actttaacat acaataaggt aacgtagttt 7680 gtttgaatta aatatataa 7699 // ID Copia-3_SI-LTR repbase; DNA; INV; 256 BP. XX AC AEAQ01006153; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: long terminal DE repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-3_SI_; KW Copia-3_SI-I; Copia-3_SI-LTR. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-256 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01006153; Positions 126 381. XX SQ Sequence 256 BP; 60 A; 64 C; 44 G; 88 T; 0 other; tgttggagta ttcatcatgt gaggcgctga ccaccgacga cattttttcc ctagaccagt 60 tttgtctctc tcactctatc tctctctcgc tctattcatt cttcccttca tgagcttgct 120 tttgtatcac aagtcgttca ttaaagatat tgcatattga actgtgtaat gtacagtgta 180 cacgacgtgt catcataccc tgtatcctct cgagattcct gcgtgaaagg gaacattcag 240 ctcattaaag tcaaca 256 // ID Gypsy-54_AA-I repbase; DNA; INV; 7046 BP. XX AC AAGE02021233; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-54_AA_; KW Gypsy-54_AA-LTR; Gypsy-54_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-7046 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02021233; Positions 26723 33768. XX CC Positions [4253-4789] - Reverse transcriptase CC Positions [5963-6430] - Integrase core CC 'GGATT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 2109..3317 FT /product="Gypsy-54_AA-I_3p" FT /translation="MLHSNYRRTSKKKLRKALTAAIHENTMLREKLFQLQS FT QQIGSEFASAPSSTLREGASTSTNTGNESLLMTTMNNMTISSLNVPECKPS FT EGETEIDKAAFEHWKEVLMSAMDLISATDERSKMAIFKIKAGSKLLEIFNS FT TETTENMPDEYETPFSNAIVRIDEYFGSRAYTLTQRSKLLIMSQKANETSV FT QFIRRVGAAAKLCNYRSDEENEAIMRTVTKGATDSRVRTLAHRNWVSQRSL FT KDLIDAVRDWEVEQSNENDFQRSQRVATIAAVAQGPNDRFKPSNEFKMRNA FT VQWHSDSRRTGWHNEERRSNFRSGSRGRGGFRQVNPSSQSNSRCCRCGSIY FT HMEAECRMKHKVCNSCGKLGHISRVCRDPSMFNKQTRAQKRNWSNRQRCYC FT LDCQDSKVG" FT CDS join(3521..5389,5393..6865) FT /product="Gypsy-54_AA-I_1p" FT /translation="MHMQKKMGTVLIRYVRCTLQVSSSVSVSADVLGEITI FT QKEVHDIENLPILNHKQTERDAIISAKVAGVEISFFIDSGAQVNTITKHYF FT DEILANEVSAEQLRDLRFSTDKPLRAYASDVNIDVVAQFSAELYISEDRPV FT FMETFYVVDEIRALLGFNTATRYGVLAVGLNVNVCKWTEDPWKCELGNIRT FT VTTPPLKQFPKFNIPPVCLSYDKEMPPARNVYTHIPPAFKDQTTQKLNNLL FT QTGIIEVVTPEMDRSFCSSLLVVPKGKDDIRLVIDLRGPNRCIHRTPFKMP FT TFESIVMQLHGAKWFSTIDLTNAFFHVELDESSRHLTNFFAGDGLYRYRRL FT PFGLSNAPDIFQEIVQVGILNGCKGVVNYLDDFLIFGATEEEHDNNLAEVM FT KRLENHNVQINLKKCAIHQQEVDFLGFKLSSEGWKVESEKYSAIQNFRRPV FT TIAEVKSFLGLINFVERFIPQRADRTRKLRELAKSETLYWNEELEEEFEYL FT RNEALNKISTLGYYSRQDRTELYVDASPYGLGAVLVQFDKDSVPRVIACAS FT KALSTAELKYPQTQKEALAMVWGVERFSMYLMSISFVIRTDAESNEFIFNG FT MHRIGKRAVSRAEAWALRLQPYNFRIILFAKVGILIIYTYTFFHIIIYRVE FT RVSGDMNVADALSRLVIQSQSETFDEANDRHLLFHIDAGSLEISWNEIETF FT AESDEEFSKVRLAIQSGFWEPGFRKYESQEKELRILGSMLFKGDRIILPSA FT LRDKAIQSAHRGHVGIGSTKRILREYFWWPGMSSEAASFVKQCETCLQLSR FT KNPPIPLTSRELPNGPWEILQVDFYTDKEFGHGEFLVVVDTYSRYLHVVEM FT RSIDADSTNTALNQIFEVWGLPLAIQSDNGPPFQSEKFVRTWESKGVRIRK FT SIPLHAQSNGAVERQNKGIKDALAASKLDNINWKLALEQYISVHNKVRPLS FT RLGVTPFELLVGWKCRGVFPCLWETNCSEIDRNEIREKDADSKLQSKQYAD FT FKRGAKESSIKVGDVVIVSVAKRLKSDPTFGSERFTVVARQGAKIVVRSDR FT GVTFSRNLQDVKLAPQQPREISDDRTIDQNDLFDSRPQRSRKLPNRFNDMH FT LYHIFQ" XX SQ Sequence 7046 BP; 2368 A; 1195 C; 1594 G; 1889 T; 0 other; ttggcgcaga agtgcctccc aaaaagtaag tattacaata gattcaaatt aaaggaaact 60 gaagtgatga gaagatgatg ataacgtaag gcgtaaaatg gaagttttgg aacaaaaagg 120 gctgaaatac cggatgcaga gcgggcgaga tcagtaaaga aaataatcgt cggattgtaa 180 gcaaaaataa tgaaagagat ttggttgaaa agttaagcta gaggaaaaag tgttttttta 240 aatatgattg tggaaattag agctgctttt gcaattaatt gttgatcatg aataacgcta 300 aacagttatt gaaaagaagc tggcgagaaa aaaaggactg cttcgttaat attaaatatg 360 caagctggcg agttaaagga ctgcttgcat gatggaaata attaaaacaa tgagatggtt 420 aatgtatacg gcaagcaaga tggcgagaat aaatgaaaag ctggtgagac aaaggactgc 480 ttcttcatta tttaatacgc aagttggcga gtttgaggac tgcttgcatg atagaaaata 540 tataaaaaga ataaatttga ccgatgtgtt tagcagacaa gctggcgagt ataaggactg 600 cttgtcgaaa taattgaata aactgtgtaa attatgtgaa ttagtaactg aagagaagct 660 ggcgagacaa aggactgctt cttcgatatt taaaacgcaa gctggcgagt ttaaggactg 720 cttgcatgat agaaaatata ttaaaagaat taatttgacc aatgtgttca acagacaagc 780 tggcgagaat aaggactgct tgtcgaaata attgagttag gtgtaaatta tataaattag 840 taaccgaaga gaagctggcg agacaaagga ctgcttcttc gatatttaaa acgcaagctg 900 gcgagtttaa ggactgcttg catgatagaa aatatattga aagaataaat ttgaccgatg 960 tgtttagcag acaagctggc gagaataagg actgcttgtc gaaataattg aataaattgt 1020 gtaaattatg tgaattagta actgaagaga agctgtcgag acaaaggact gctttttatt 1080 attgaatacg taagctggcg agtttaagga ctgcttactt gatagaaaat attattaaat 1140 aaaaggacgt tgttttcggc agacaagttg gcgagaataa ggacagcttg tcgaaataat 1200 tgtcttagat gtaaataatg tgaactagta actaaagaga agctggcgag gcaaaggact 1260 gcttctttat catttaatac taaagctttc gtttctaagg actgcttgca tgatgaaaag 1320 tataaaataa gagagatagc cgatgtgact agtaggcaag ctggcgaggg taaggactgc 1380 ttgtcgaaat agtgggtttt aggcgtaaat tatgcggaat agtaactaaa cacatacgac 1440 tgcttcgttc acatttgata tgcaaactgg caagtttaaa gacttctgat ggaagcatga 1500 tggtaactat gaaacaaaaa aaagtgatta cctatgtgtt tgtcaggtaa ggtagtgaaa 1560 gtgagcactc ctggctaaaa cgaaaatttt aacgttatat gcagaacgaa gattcaaaaa 1620 taaaagagat gctgcttcgt ttaaatatta tggcctctgc ttgttatgat aaattatttt 1680 gttgtaatag gtattaactc gagaaaatta ctttatctaa gcggacgaag aataaagatt 1740 gcttgtttat ttgtaaaata tgcaaacgat aataagaact gcttgttggg ataaaacaaa 1800 ttttgcagct atatattaca agaactgata tatatataaa aaaaagagta gcgctaatat 1860 tcaacaacaa aaataaagaa cctaatagaa agctttactg tgattgtaat atatccaaat 1920 ggaatgaact ctagaaaatt taaccatatt tttgaggttc ttaaagaata caattatatc 1980 ccgtcgctat tgatcaacaa aagaagagga atctcctcgt aatttaatta caaaacattt 2040 aaataccttg aagtgtaatt tttttatcaa aatcaaaact tgtgattttg ttttctaatg 2100 caatttttat gttgcattcg aattacagga ggacttctaa gaaaaaactg cgtaaggctt 2160 tgacagcggc gattcatgaa aacacaatgt tgcgcgaaaa gctttttcaa ttgcagtcgc 2220 agcagatcgg atctgagttt gccagcgctc catcgagtac gttgcgcgaa ggcgcaagta 2280 cctcgacgaa cactggaaat gaatcactat tgatgaccac catgaataac atgactatta 2340 gttctcttaa tgtacccgaa tgcaagcctt cggaaggcga aactgaaatt gacaaagctg 2400 cattcgaaca ctggaaagag gtactaatgt ctgctatgga cctgatttct gcgacggatg 2460 agcgatccaa aatggctatt tttaaaatca aagccggatc aaagcttctg gaaattttca 2520 atagtacaga gacaactgag aatatgccag atgaatatga aactcccttt tcaaacgcaa 2580 ttgttcgtat tgacgagtat ttcgggtcaa gagcctatac tttgactcaa cgcagcaagc 2640 tattaattat gagtcagaaa gcgaacgaaa caagcgtaca attcataagg cgagtcggtg 2700 ccgctgcaaa attatgcaac taccgatctg atgaggaaaa tgaagccatt atgcgtacag 2760 ttacaaaagg agctacagat tcacgtgtgc gtactctggc acacagaaat tgggttagcc 2820 agcgttcact caaggatcta attgacgcag tgcgagattg ggaagttgag caatccaacg 2880 agaatgactt ccaaagaagc caacgagtag ccacaatagc agccgtggca cagggcccga 2940 atgatcgatt caaaccatct aatgaattca aaatgcgcaa cgcggtgcag tggcacagtg 3000 atagcagacg cacaggttgg cacaacgaag aaaggcgatc gaatttccga agcggtagtc 3060 gaggtcgtgg tgggttccga caagttaatc caagtagtca gagcaacagc cgctgctgta 3120 gatgtggcag tatttatcac atggaagcgg agtgccgaat gaagcataaa gtatgcaact 3180 catgtggcaa gttgggccat atctcacgcg tttgtcgaga cccatccatg ttcaacaaac 3240 aaaccagagc ccagaaacga aattggagca accgtcaacg atgctactgc ctcgactgcc 3300 aagattcaaa agttggatga agctctgaat ccagaggaac aggtaatgaa aacgtctaac 3360 gatactacct gaatgttatc gtatacattg ttatcaatca tttgatattt gatataagct 3420 cgattttgta actttcaatc attaaaatgt gttttaaata taattatcac aacaaaaata 3480 gtagggttta aatttttcat cgtacataaa atactcgtag atgcacatgc aaaaaaaaat 3540 gggaactgtt ttaattagat atgtacgttg tacattgcag gtatcttcga gtgtatcagt 3600 ctctgcggat gtattagggg aaattactat tcagaaggaa gtccatgata tagaaaatct 3660 tcctattctg aatcataagc aaaccgaacg tgatgcgatt atttctgcaa aagttgcggg 3720 cgtagaaatt agctttttca ttgattccgg agcgcaagta aacactatca ctaagcacta 3780 cttcgacgaa atccttgcaa atgaggtttc agcagaacag ttacgcgact tgagattctc 3840 gacagacaaa ccgcttagag catatgcatc tgatgtaaac attgatgtgg tcgcccaatt 3900 ttcagcagag ttatacattt cagaagacag gccggttttc atggaaacat tttacgtggt 3960 tgatgaaata agagccctgc tgggtttcaa taccgcaacc agatacggcg ttctagctgt 4020 cgggcttaat gttaatgttt gtaagtggac agaagatcct tggaagtgtg aattgggaaa 4080 catacgtact gtgacaactc caccccttaa gcaattcccc aaattcaata tcccacctgt 4140 atgtctgagt tatgataagg aaatgccgcc agcccggaat gtgtatacgc atattccacc 4200 tgcattcaaa gatcaaacaa cacaaaaact gaacaatttg ctgcagacag gaataatcga 4260 agtagttact ccagaaatgg acagatcctt ttgctcgtct ctcctggtcg tcccaaaggg 4320 caaagacgat attcggttag tcatagattt acgaggacca aatcgttgca ttcacaggac 4380 tcccttcaaa atgccaacct ttgaatctat cgtaatgcaa ctgcatggtg ccaaatggtt 4440 ttcaaccatt gatttaacaa atgcgttttt tcatgtggag ctcgatgaaa gctctcggca 4500 cttaaccaat tttttcgctg gtgacggtct gtatagatat cggaggttgc cattcggatt 4560 atcgaacgca ccggatatct ttcaagaaat tgttcaagta ggcatattga acggttgtaa 4620 aggagtcgta aactatttgg atgatttcct tatatttggg gctacggaag aggagcatga 4680 taacaatctc gctgaagtta tgaagcgcct ggagaaccac aatgttcaaa taaaccttaa 4740 gaaatgtgca attcaccaac aggaggttga ttttcttggg tttaagctgt cttcagaagg 4800 atggaaagtg gaatcagaaa aatattctgc cattcagaat ttccgaaggc cagttacaat 4860 tgcggaagtg aaaagctttc taggtctcat caactttgtc gaaagattca tccctcaacg 4920 agcagataga actcgaaagt tgcgtgaatt agcgaaatca gagacattgt attggaatga 4980 agaactggaa gaggaattcg aatatttgcg aaacgaagct ttaaataaga taagtaccct 5040 agggtactac agcagacaag atagaacaga gctctatgta gatgcttcac catacggtct 5100 cggtgccgta ttggttcagt tcgacaaaga ttcagttcca cgggttatcg catgcgcttc 5160 aaaggctctt tcaacggcag agcttaaata ccctcaaaca caaaaagaag cactagctat 5220 ggtgtggggt gtggagcgct tctcaatgta tctgatgagc atttcttttg taatacggac 5280 ggatgctgaa tcgaacgaat tcattttcaa tgggatgcac agaatcggca aacgcgcggt 5340 ctctagagcg gaggcatggg cattaagact gcaaccttat aattttcggt aaataatatt 5400 gtttgccaaa gtaggcatat tgataatata tacatataca ttttttcata tcattatcta 5460 tagagttgag cgggtttcgg gcgatatgaa cgtagcagat gctctatcta ggctggtgat 5520 acaatctcaa tccgagacgt tcgacgaagc aaatgataga catcttcttt ttcacattga 5580 cgcgggatct ttggaaattt cttggaacga aatagaaaca ttcgctgaaa gtgatgagga 5640 gttctccaaa gtaagattag caatacagtc aggcttctgg gaacccggtt ttcgtaaata 5700 tgaatcgcag gagaaggaac tccgaatact cggatctatg ttgtttaagg gagaccgtat 5760 tatcttgcct tcagcgctgc gcgataaagc catacaatcc gcccaccgag gccatgttgg 5820 cattggttcc actaagagga ttttacgaga gtatttctgg tggcccggga tgagctcaga 5880 agcagccagt tttgtgaagc aatgtgaaac ctgtttacaa ctttcgcgca aaaacccacc 5940 gatacctctc acaagtcggg aattgccgaa tgggccctgg gaaatccttc aagtggattt 6000 ttatactgac aaagaattcg gtcatggaga attcctcgtt gtagtcgaca cgtactcgcg 6060 ttacttgcac gtagtggaaa tgagaagtat agatgcggat agtacgaata cggctttgaa 6120 tcaaattttt gaagtatggg gattaccctt agctatccag agcgataacg gtccgccttt 6180 tcagagtgaa aagttcgtga gaacgtggga atccaaagga gtacgaattc ggaaatccat 6240 tccgctacat gcgcaatcaa acggtgcagt tgagcgccaa aataagggaa tcaaggatgc 6300 tttagcggcg tcaaagcttg ataacatcaa ctggaaatta gcactagagc aatatatttc 6360 agttcacaac aaggtcaggc cgttgtcgcg actgggagtc acaccgtttg agctcttagt 6420 gggctggaaa tgccgaggcg tgtttccgtg cttgtgggaa acgaactgtt cagaaataga 6480 tcggaatgaa atcagggaaa aagatgcaga ttcgaaatta cagagcaaac agtatgctga 6540 tttcaaacgc ggtgctaagg aatcgagcat caaggttgga gatgtagtta tagtgtcggt 6600 agcaaagcga ttgaagtcag acccgacttt tggatctgag agatttacag ttgtggcaag 6660 gcaaggagct aaaattgttg ttagaagtga cagaggagtt acgttttcga gaaatttaca 6720 agatgtgaag ctagctccgc agcagcccag agaaatttca gatgatagga cgattgatca 6780 gaatgatctg tttgattcaa gaccacaaag atcacggaag ttaccaaatc ggttcaacga 6840 tatgcatttg taccacattt ttcagtaaat aagtaaaagc aaacagtcat gtaaatgaaa 6900 ataaggaaat cagaaagaaa taaatggaag gataaaagtt cataaaattg ttgaagtttc 6960 tttgattttt ttaaaacaaa atatctaaat agataaactt gattgaaata actgaataag 7020 gaaataagag tagaggtagt aggtat 7046 // ID LOA-9_AAe repbase; DNA; INV; 6136 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A LOA non-LTR retrotransposon from Aedes aegypti. XX KW Loa; Non-LTR Retrotransposon; Transposable Element; LOA-9_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6136 RA Kojima K.K. and Jurka J.; RT "LOA clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1419-1419 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 14 sequences with >97% CC identity. XX FH Key Location/Qualifiers FT CDS 559..2211 FT /product="LOA-9_AAe_1p" FT /translation="MSATDAKLMEIDETEGAVDSCSAESVPSCSSSILDTP FT IDGDDDYEDDDDFEDGINVTILSTTTKSLLEDCSTTENPVVKTSVTDPICT FT DDKQQTKLNGAARKRYKRMIDAGVDPEEAYRLARIPCQPPSSEKRSRNDDL FT NGSNSSDGNPRKKKNLCPVLEPKHGLNRPISGKFSVQNRLQINTNARSGRI FT ETIQSGSNPSFSAVVQYVRVGIVPKDYPDVELSSQQLLATRKAILAKVVQQ FT RKETVKPKFGQCSFRTGHLILVCKNQETADWLKSIASTLSIAGEVELVALD FT ENKIPRPEIIIGFFPVSAEDSTDEILALLESQNDGLNTDEWRIKERKIINQ FT LHVELIFTVDGASMDTIKKCEFLLDYKFGTAPLRRKLPPKHKTPDNSDESV FT DKSGDNLKPAEQHTALSGVGENKQTKIPTEHLKDTGDSARGNKSNPNLNRP FT GTSGIGSTGNLGYPKISKEPRCVDVGSENTSQESSGPKALLGSKRYAKNSP FT INHGQTKTVLDNKDPVHKNKYISKYIKSLGNHETNIGGLKSPQESETSKNN FT ASEK" FT CDS 2198..6013 FT /product="LOA-9_AAe_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="MQVKSNECNNLNKHILRNNITRTNNTSITNTNIDTNT FT TINNQANKQIKFVQINLHHAKGASAVLHKTFTHSKLDIALIQEPWTYNSRV FT LGIXVSSSKLIYDENQTAPRTAILVNSRIXFLPVTEFIKRDIVAILMEIPT FT SRGVAEMYVASAYFPGDAEDIPPHDVSNFVSFCKRKNKPFIIGCDANAHHT FT VWSSTDINDRGENLLEYIINNNIDICNKGCKATFVNSIREEVLDLTLCSAN FT ISENLKNWHVSNETSLSDHRQILFEYQANYVQTEAYRNPRKTNWDLYYSKL FT LTECKLPDKNINTVQELEKVSVXMLXSVQEAFHASCPITVRSTNRDVSWWN FT KRLEILRKNTRKLFNRAKRTREWDQYKIALTEYNKEIRRSKRLDWRHTCES FT IENTPVVARLQKVLSKDHTNGLGTVKKENNCLTSNAKETLEVMMQTHFPDS FT VLISNNDENMLENIIGNETTEMNSHGEHSNNFSTSCCDAKNIAEQIFTVSK FT VDWAINSFDAFKSPGVDGIFPAFLQKSNGILTPYLCNMFKASIILGYIPKS FT WRQIRVIFIPKANKKDKTSPKSFRPISLSSIMLKIMEKLIDQYVKSTYMRK FT YPLNNNQFAYQTGKSTTTALHALVTKIEKTYEAKEILLATFLDIEGAFDNA FT SHRSMTRTMLKREFDICIVQWINEMLSKREISAKLGSSLISVKAVKGCPQG FT GVLSPLLWSLVVDELLASLEAQGFEVIGFADDIVIIVRGKHDNIIASRMQA FT AINYTIKWCESEGLDINPQKTSIIPFTKKRTFKISDLRMRGTLMTLSSNVK FT YLGVILDRKLNWSLHLSNILDKAANALWISKRTFGNKWGLRPAMIHWIYMA FT IVRPRITYASLVWWPKTLEKCAIKKLEKIQRLATISITGAMRSTPTKALDA FT MLNLLPLYQFVQLEAGKAALRLKRTTVFYDGDIKGHLEILKKVLINPLVTV FT NNDWMETTFNFDRRYNVVEPDRSVWDVGGPQIRSGSIVFYTDGSKQNDQVG FT AGVTGPGVCLSVSMGRWPTVFQAEIQAILECASICLKRKYKHSNICIFSDS FT QAALAALKSYTCTSKLVWECTLLLQKLSVNNTIVLYWVPGHQGIQGNEKAD FT ELARVGSSQHLQGPEPFCGVSPCSIKMELRNWEKAMVNTNWNKTVXARQSK FT KFITPNKSNTQKLLALSKKDLCAYTGLITGHCLVKYHLRLIKRIEDDICRF FT CSEETETSEHLLCNCVALFAVRLKYLDQGFLQPSDIWSLAPFKVVRFIRHI FT IPHFENTEIMS" XX SQ Sequence 6136 BP; 2129 A; 1129 C; 1183 G; 1689 T; 6 other; agtttggcaa ccctgccgtg atatagtggc gatttttatc ggctcgtgtt tatcctcgat 60 ttaccgttga aaacgaaaga tttaatcgca agtgcactat tggcatacta tttggtgaac 120 tgattaacga aaagtgtgtt waaaaggtag ggtttgaagt tattttgcga caaaatacgc 180 ttcttatttt ttcctgacag tgagtttttg ttctgatagt gatacttttg tgcacaatac 240 attgatgcta acaatagtaa ataagtgtta tacgttccac cttgaacata ctttccctct 300 ctatcgtcta cttgcatgca agtcgtttct gtgttgtgga gaaaaagaat accaccaaac 360 ttacaataaa tggatacagt ttagtacatg tcttagttcc atcattcttg gctctgattc 420 agaacaaaat ttgttgttct acgagtatca atgtttacat atgatatatg ttgtttcact 480 actggcagtg gaatttcttt gatccaatgt acttagattt cacaactctg tttaaaacac 540 tatgctttca gcaacagaat gagtgctacc gatgcaaaac tgatggagat tgatgaaaca 600 gagggagcag tcgatagttg ctcagctgaa agtgttccat catgctcatc atctatctta 660 gatactccga ttgatggtga cgatgactat gaagacgacg acgacttcga agatggtatc 720 aacgtcacca tattgtcaac tactaccaaa agccttctcg aagattgttc gacaactgaa 780 aatccagttg ttaagactag tgtgacagac ccaatttgta cggatgacaa acagcaaacg 840 aagttgaatg gagctgcacg gaagcggtat aagcgaatga ttgatgctgg agttgatcca 900 gaagaagcat atcgcctcgc tcgaattccg tgtcagccgc cgagttctga aaagcggtca 960 cgtaatgacg atcttaacgg ttctaacagt agtgatggaa atcctcgtaa aaagaaaaat 1020 ttgtgtcctg ttcttgaacc aaaacacgga ttaaaccgcc caatctcggg aaaattttcg 1080 gttcaaaatc gtttacaaat aaatacaaat gcacgctctg gacgaattga aacgatacag 1140 agcggaagca atccatcttt tagtgccgtt gtacaatacg ttagggtagg aatcgttccg 1200 aaggactacc ccgacgtaga actttcttcc cagcaattgc tagcaacacg aaaagctatt 1260 ttggcaaagg ttgtacagca acgcaaagaa acggtgaaac ctaaatttgg tcaatgttca 1320 ttcaggaccg ggcatttaat tctagtctgc aaaaaccagg aaacagctga ctggttgaaa 1380 agtatagctt ccacactatc tattgctgga gaagtagaac ttgttgcttt agacgagaat 1440 aaaattcctc gtcctgaaat aattatcggg tttttcccag tgagtgctga agacagcact 1500 gatgagattc tagcgctcct ggaaagccaa aatgacggtt tgaatactga cgaatggaga 1560 ataaaggaga gaaaaattat caatcaactg catgttgaat taatttttac agtagatgga 1620 gcttcaatgg acacgatcaa gaagtgtgaa ttccttcttg actataagtt cggaacagct 1680 ccacttcgcc gtaaacttcc tccaaaacac aaaacgcccg ataactcgga cgaaagtgtt 1740 gacaaatctg gtgataattt gaaaccagca gagcagcaca ctgctctgtc tggggtagga 1800 gaaaataagc agacgaagat ccctacagaa cacttaaagg atactggaga ctctgcacga 1860 ggaaataaat cgaatccaaa tctaaaccga cctggtacta gtgggattgg gagtacaggt 1920 aatctgggct acccaaaaat ctctaaagaa cctcgctgcg ttgatgtagg atcggaaaac 1980 acttcccaag agtcttcagg acctaaagct ctattgggat cgaaacgata tgctaaaaat 2040 tcccctatca atcatggtca aacaaaaaca gttttagaca acaaagaccc tgtacataaa 2100 aacaaatata tttctaaata cataaaaagt ttgggtaatc acgaaaccaa catagggggg 2160 ctaaaatcac cccaagaaag tgaaacttct aagaataatg caagtgaaaa gtaatgaatg 2220 caataatttg aataaacata ttttgaggaa taacattacg cgaaccaata atactagtat 2280 cacaaatacc aacattgata ctaacacaac catcaataat caagcaaata agcagattaa 2340 atttgttcaa ataaacttac accatgcgaa aggcgcatct gctgtactac acaaaacgtt 2400 cactcacagc aaactggaca ttgcactgat tcaggagccg tggacgtata atagtagagt 2460 attaggcata scagtatctt ctagcaaact tatttatgat gaaaatcaaa ctgccccccg 2520 aactgctatt ctagtcaata gtaggattaa ktttttgccg gtaacagagt tcatcaaaag 2580 agatattgtt gcaattttaa tggaaattcc tacttctcgt ggagtagcgg aaatgtatgt 2640 tgcatccgct tactttcctg gagatgcaga agacattcct ccccatgacg tttcaaattt 2700 tgtttctttt tgcaaacgta aaaacaagcc cttcataata ggatgtgatg cgaatgctca 2760 tcacacagtt tggagtagca ccgatataaa tgataggggg gaaaacctcc tagaatacat 2820 aatcaacaat aacattgaca tttgtaacaa aggatgtaaa gcaacctttg ttaattccat 2880 cagggaagaa gttttggatt tgacactttg tagtgcaaac atctcagaaa atttaaaaaa 2940 ctggcatgta tcaaacgaaa cttccctgtc tgaccacaga caaatattgt ttgaatatca 3000 agcaaattat gttcaaactg aagcttatcg caaccctcgt aaaacaaatt gggatttata 3060 ttactcaaaa cttttgactg agtgtaaact tcctgacaag aacataaaca cagttcaaga 3120 acttgaaaaa gtttcggtas tcatgttaca wtcagtccaa gaggcattcc atgcgagttg 3180 cccaatcact gtacgttcca caaacagaga tgtttcgtgg tggaataaac gcctagaaat 3240 tcttagaaaa aatactagaa aactattcaa tagagcaaag aggaccaggg aatgggacca 3300 atataaaatt gctctgactg aatataataa agaaataaga agatctaaaa gactggactg 3360 gagacataca tgtgaaagca ttgaaaacac tcctgtagtt gcaagactac aaaaagtcct 3420 atcgaaagat catacaaatg gtttaggtac tgtaaagaaa gaaaacaact gcctaacatc 3480 gaacgcaaaa gaaacgttgg aagttatgat gcagacacat tttcctgatt ctgtactgat 3540 tagtaataat gatgaaaaca tgttagaaaa cattatcggt aatgaaacaa ccgagatgaa 3600 cagtcatggt gaacacagta ataacttctc gacctcttgt tgtgacgcaa aaaatatagc 3660 tgaacaaatt tttactgtat ctaaggttga ttgggcaatc aattcatttg atgcattcaa 3720 atctcctggt gtagatggaa tatttccagc atttttgcaa aaaagtaatg gtatccttac 3780 cccatatctt tgtaatatgt ttaaagcaag tattatactt ggatacatac ccaaatcatg 3840 gcgacagatt cgagtgattt ttattccgaa ggctaacaaa aaagataaga cttctcccaa 3900 atcattcaga ccaataagtt tgtcatctat aatgctcaaa attatggaaa aacttattga 3960 tcaatatgtt aaatctactt atatgagaaa gtatcctcta aataataatc aatttgctta 4020 tcaaactggt aaatcaacta caactgcgtt acacgcactt gtaactaaaa tagaaaaaac 4080 gtatgaagca aaagaaattt tactagcaac ttttctagat attgaaggag cgtttgataa 4140 tgcgtctcat agatccatga cgaggacaat gttaaagcga gagtttgaca tatgtatcgt 4200 tcaatggatt aatgaaatgc tatcaaaacg tgaaatatca gcaaaactcg gtagttcatt 4260 aatatctgtt aaagcagtga aaggatgtcc acaaggaggc gttctttcgc ctcttctctg 4320 gtcacttgtt gttgatgaat tactggcaag tttagaagca cagggatttg aagtaattgg 4380 gtttgcagat gacattgtta ttatagtcag aggaaaacat gacaatatca ttgcaagcag 4440 aatgcaagca gctataaact acacaattaa atggtgcgaa tctgaaggac tggacatcaa 4500 tcctcaaaaa acaagcatca ttccgtttac taagaaaaga acattcaaga tatctgatct 4560 tcgtatgaga gggactttaa tgacactttc atcgaatgtt aaatatctgg gagtcatttt 4620 agatcgtaaa ttaaactgga gtctacatct aagtaacata ctcgataagg cagcaaatgc 4680 actatggata agtaaaagaa cttttggaaa taaatgggga ttgcggccag ctatgatcca 4740 ctggatatac atggctattg tgagacccag aattacgtat gcttcattag tatggtggcc 4800 gaagactttg gaaaagtgtg caataaaaaa attagaaaaa atacaaaggc tggctactat 4860 atccataaca ggcgcaatgc gcagcacacc tacaaaagca ttagatgcaa tgcttaatct 4920 actccctctg tatcaattcg ttcaactaga agctggaaaa gccgcactac ggcttaaacg 4980 aactacggtt ttttatgatg gtgacataaa aggtcatttg gaaattttga aaaaagttct 5040 aattaatcca ttagttacag tcaacaatga ctggatggaa acgacattta attttgatcg 5100 taggtacaat gtagtagagc cggatcgctc tgtctgggat gtgggaggac cacaaattcg 5160 ctcaggttca atagtcttct atactgacgg ttccaaacaa aacgatcaag tgggagctgg 5220 tgttacaggt cccggtgtat gtttatcagt ctcaatgggt agatggccaa ccgtcttcca 5280 ggctgaaata caagctatac tagaatgtgc ttctatttgc ttaaaaagaa aatacaagca 5340 ttccaacata tgcatttttt ctgatagcca agcagcgctc gctgctttga aatcatacac 5400 atgtacttcg aagttggttt gggaatgtac tctattactt caaaaattat cggtaaataa 5460 tactatagtt ctttactggg tgcctgggca tcagggaata caaggaaatg aaaaagcaga 5520 tgagctggca agagttggat catctcaaca tttacaaggg ccagagcctt tctgtggagt 5580 ctcaccatgc tccattaaaa tggagctcag gaactgggaa aaagccatgg taaataccaa 5640 ctggaataaa actgttgwag cccgccaatc taaaaagttt ataacaccca ataaatcaaa 5700 tacacaaaaa cttctcgccc tgtcaaaaaa agacctgtgt gcatacacag gactaattac 5760 aggacattgt ttagttaaat atcatttacg actaatcaaa agaattgaag atgatatttg 5820 tcgcttttgt agtgaggaaa cagaaacctc tgaacacctt ctgtgtaact gcgttgcact 5880 ttttgctgtg agattaaagt atcttgatca agggtttcta cagccctctg atatatggtc 5940 cttagctcca ttcaaagtag ttcggtttat acgacatatt attccccatt ttgaaaatac 6000 cgaaatcatg agttgaggat aactaatcat agtaatctat caccttgaat acggttatac 6060 attaaaatgg gtggtgtatc acaatagatc aaacaaatgg tcgcagtgat tatacaccca 6120 acacaggaaa aaaaaa 6136 // ID HARB-1_AP repbase; DNA; INV; 2390 BP. XX AC . XX DT 31-JUL-2009 (Rel. 14.07, Created) DT 31-JUL-2009 (Rel. 15.12, Last updated, Version 2) XX DE DNA transposon: consensus. XX KW Harbinger; DNA transposon; Transposable Element; HARB-1_AP. XX NM HARB-1_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-2390 RA Jurka J. and Baney O.; RT "DNA transposons from pea aphid."; RL Repbase Reports 9(7), 1349-1349 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX FH Key Location/Qualifiers FT CDS 1..684 FT /product="HARB-1_AP_1p" FT /translation="PNCLGAIDGKHXRCKNPXNSGSXFFNYKKYFSXVXMA FT VVDANLNFIXIDVGAYGREGDSNVFKECPFGKKLYAEXLNLPEPXXLPNTN FT NXPQPXVFVGDEAFALHTNLLRPYPGRGLNDXRRVFNYRLSRARRTVECAF FT GVLANKWRVLHTPIXVEPDFADXIVKACCXLHNFVRKRDGINFEDTEANXL FT DDIEARGAGARSQGIXVRDYFAXYFMGPGAVPFQYKVL" XX SQ Sequence 2390 BP; 753 A; 387 C; 387 G; 757 T; 106 other; ccgaattgtc tnggngcnat tgacggnaaa catatncgat gtaaaaatcc naanaattcn 60 ggntcattnt ttttcaatta caaaaagtat ttttctatng tattnatggc tgtagtcgac 120 gcaaatttaa attttattnc natcgatgtc ggngcntacg gtcgagaagg ngattcnaat 180 gtntttaaag aatgtccatt cggaaaaaaa ttatatgccg aanaacttaa tcttccggaa 240 ccaantnntt taccaaatac aaacaacanc ccacagccnt ncgtttttgt tggngacgaa 300 gcnttcgcnc tacatacaaa tttattaaga ccgtatccgg gacgtggnct caacgatann 360 cgacgngtat ttaactatag gttgtcnaga gcaaggcgna cggtcgaatg tgcnttcgga 420 gtactngcna ataaatggcg ngtnttacat acnccnatac nagtngaacc cgatttcgcn 480 gacgnnatcg tnaaagcntg ttgtatncta cacaattttg tncgnaaaag agacggnatc 540 aattttgaag atactgaagc caatancttg gatgatattg aagctcgcgg agcaggagct 600 cgttcncaag gnattgangt nagagattac tttgccgant attttatggg cccnggagcn 660 gtaccttttc agtacaaagt actttaattt atttacttaa ttataatttg taantantac 720 ttattttttt tttatataaa acattataat attttttatt tgtatttata tttagatttn 780 tgtaccgaaa ttattatact aaatatttta atatnattat tataataatt aaataatata 840 taacgtaata tatatctact tacctttatt ttctttttcn gtatttgatt tttcagtcca 900 atctncatac atgctttccc cgatattaag ccaacttttt tntttnacga actttgtcca 960 tgtattcttt tgancccatt tcccaaatag acgggttatt ttgaatacac gatatgaatt 1020 tatcggtatc gaaaaccatt ttaaaataaa atgtttcaat gtttataaat atntaaaaat 1080 aaataattaa ttcaaattaa taatttaaca aaaatatata aaattaaaat attcaaaaaa 1140 cgcagcggcc aaaacgcaca gtgtggcaca cgcgatttgg cgggaancaa cccggcggga 1200 agtatnanca aaacgcatta tttcgtatcg tgcgtgccgc actgtgcgtt tcggccgctg 1260 cgttttttag antattttaa ttttatatat ttttaaaata tttaaanaaa aaatggtttt 1320 tgataccgat ggnttctgat accataaatt attatcgtgt attcaaagta acccgtctat 1380 ttgggaaatg ggtactaaag aatacatgga taagttcgtg aagcaaaaaa gttggntcaa 1440 tatcggggaa agcatgtatg cagaattgga cagaaaaanc ggaacctgaa aaagaaaata 1500 caggtaagta gaatattatg ttatatattt ttaaattttc agcatagtat ttacaatatt 1560 attagtaatt atttttaatt tctattgggt atacgacttt taatctctat caaatacaaa 1620 aaaaaagact aaatctatta cattttaatt tattattant aatttacaaa taataaagta 1680 ataagatctt tttaaataag cagcttataa tactttatac tgaaaatcta cggcnccngg 1740 gcccataaaa tagtcggcga agtaatccct cacntcaacg ccttccgaac gagctcctgt 1800 tccgcgagct tcnatgtcat ccaggctatg ggtttcagca tcttcaaaat taattccgtc 1860 tctcttacgg acaaaattat gtaanataca acacgctttg acgatancgt cagcgaaatc 1920 gggttccact tgtatnggag tatgtaacac acgccattta ttngcgagna ctccgaangc 1980 gcattcgacc gtncgccttg ctctngacaa tcnatagtta aatacncgtc gagtatcggt 2040 naggttacgt cccggatacg gtcttaataa atttgtatgt agagcgaaag cttcgtcccc 2100 aacaaaaacg nanggctgtg gattgttgtt tgtgtttggt aaaagatttg gttccggaag 2160 attnagtttt tcggcatata anattttacc gaatggacat tctttaaana cattggaatc 2220 cccttctcta ccgtaagacc cgacatcgat tgnaataaan tgtaaatttg cgtcgactac 2280 agccatnaat accatagaaa aaactttttg taattgaaaa acaatgatcc ngaattgtta 2340 ggatttttac atcgnatatg tttnccgtca atagcgccta ggcaattcgg 2390 // ID L1-2_CQ repbase; DNA; INV; 4917 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE An L1 non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW L1; Non-LTR Retrotransposon; Transposable Element; L1-2_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4917 RA Kojima K.K. and Jurka J.; RT "L1 non-LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 132-132 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 9 sequences with >95% CC identity. XX FH Key Location/Qualifiers FT CDS 185..1606 FT /product="L1-2_CQ_1p" FT /translation="MAQVRSRENTFKVDLSNFPKRPSIEELQKFVLVKLGL FT AVGQVKRFQVNHAQNCVHVKCSELKVAQDTVAMHNGKHVMEINKTKVTVRL FT VMEDGGVEVKIHDLSENVTNDDIVAFLRHFGDVLSIREVSWGEGFPLRGMS FT SGIRVAKMILRRHIKSFATILGESTLISYRGQPMTCRHCTLTQHIGMSCVE FT NKKLLGQKADLSARLKSASGSTSTSYASVLNGVAPDTNTLLPEFSGTILGP FT VIPQRPIRPESTGTAPNAPQSGAATSATGHSADGSLNISGDGTSSVQTSGS FT TDVAEHSAGSEASGGGTSSEVSNTLASETEVSAGSVATAIEGGALIDRPNE FT SELASTSDMILIPLPPAAPAAEGDNKETEEMETESPSAADDAAELNTPHPK FT LPCEEEDAEMPAVSDLASCESVSPFEFPAPIPVPIPHPDLKKTHSLSSISG FT TESEGSARSGEFTEVKKKRRPGRPKKPKV" FT CDS 1639..4833 FT /product="L1-2_CQ_2p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="MNLPCSYNIATININAISNDNKIQSLKTFIRSTDLDI FT ILLQEVENPNLALPGFNVLTNVDDSRRGTAIALKSHIRFSNVRRSLDTRII FT AVTLRSMVTICNVYAPSGTVNAASRERLFNNTLPLFLQNCSDHCILGGDFN FT CVIASKDATGTSNHSLALKNLTSNLGLKDAWEVLKANQIAYSFIRPNCSSR FT LDRLYVSGAFVPGLRTAEYLVTSFTDHKAFKVRCCLPDQGRPTGNGFWSMR FT AHVLTEENLEEFELKWNRWLREKQNFNSWIEWWIRCAKPRIKSFFRWKTNE FT VFRRFNVANELLYGQLRAAYDNLTNNPAMVPEINRIKGKMLLLQNNFSKAF FT ERLNDKFLGDEKLSTFQLGDRAGRKKNSTISSIKHQGRLLEDSAEVEDHIF FT DYFRQLYSAEDVQLNRDFPSNRLVDPNSEANNRAMEEITTEEIFTAIQTSA FT SRKSPGCDGIPKEFYVKAFNIIHRQLNLILNEALRGGFPEQFLDGVVVLSK FT KKTNLETIKAYRPITLLNYDYKLLARILKVRLERIINENRILNSCQKCSNA FT RRDIFEAVHAIKDRVVELNCKRKTGKLISFDLDHAFDRVDHTYLLSVLRSM FT RINENLVALIGRLMCSARSRIIINGNLSAQFPIQRSVRQGDPLSMLLFVVF FT LHPLLEKLLAICNHPKELVVAYADDISIILVDETKLAEVRRAFDDFGLCSG FT ARLNIAKTTAVNIGPQRENSTGAGWTTVCDSVKILGVIYFNSLKRTIEENW FT RETIRKTSYLMWLFKPRDLSIHQKVTLLNLFVTSKLWYMASVLSMPNACIA FT KVTSQIGQFIWGRYPTRVSIDQLALPVAEGGLNLHVPKMKSKALILNRFIR FT GSSETPFAASFVSQLDNPPNLSGIPALYPCLKAIARELPYLPQRLIGQPTA FT TELHGYFRRTLTTPKIMNEHPAVAWRSVWRNIRSRVLTSAEKTSFYLLVNR FT KIPHAELLFRQNRLDSGTCQHCLGAVEDLEHKFAKCTRVRHLWNFIQSKLG FT AILNRRVSFNLFLIPELNNIEANSKSRALKLFIFYINFVLDPDCSFTLASL FT DFAVSLVC" XX SQ Sequence 4917 BP; 1316 A; 1268 C; 1230 G; 1103 T; 0 other; tagtttgcgt caggctctcg acgcaataag acgtgtgcat cggagctctt ctgcgcgctt 60 tgggttatcg cgtgtgtcgg ccatctgcac cacttgctgg tgcagatttc gcgtgtttaa 120 gcgtcagttt tcaactgacg accggtttta atttgcgatc cggattacat ccgaactcgc 180 aacaatggct caagtgagat cgcgtgaaaa caccttcaag gtggaccttt cgaactttcc 240 gaaacgaccg tccattgagg aactgcagaa gtttgtgcta gtgaaactag gactggccgt 300 tggccaagtg aaaaggttcc aggtgaacca tgcgcagaat tgcgtacatg tgaagtgtag 360 tgagctgaag gtggcacaag acaccgttgc tatgcacaat ggcaaacacg tgatggagat 420 caacaagacc aaggtcacgg tgcgcttggt gatggaggac gggggagtgg aggtaaagat 480 ccatgacctc tctgaaaacg tcacgaacga cgacattgtt gcgttcctgc gccattttgg 540 cgacgtgctc agcattcgcg aggtgtcatg gggagagggt ttccctctcc gtggcatgtc 600 atcaggcatc cgggtggcga agatgatcct tcgccgccat attaaatcgt tcgcaacgat 660 tctaggagag agcacactaa tctcctacag agggcagccg atgacctgta ggcactgcac 720 gcttacccaa cacattggta tgtcgtgcgt ggagaacaaa aaacttctcg gtcaaaaagc 780 cgatctcagc gcccgcctga aaagtgcgag cggatcaacc tcaactagct acgctagtgt 840 gctgaacggt gttgcccctg acaccaacac attgctgcca gagttcagtg gcaccatcct 900 cggtccggtc atcccgcaac gaccgataag acccgagagc acgggcacgg cacccaacgc 960 ccctcagagc ggagcggcaa cgagcgctac cggacactca gctgatggat cgcttaacat 1020 cagcggagac ggtacatcgt ctgttcagac aagcggctcg acggacgttg ccgaacactc 1080 agctggctct gaagctagcg gaggcggcac gtcgtccgag gtttcgaaca ccctcgcaag 1140 cgagacagag gtctcagctg gatcagttgc gaccgccatc gaaggtggtg cgttgatcga 1200 tcgccctaac gagtctgagc tcgccagcac aagcgacatg atactcattc cactcccccc 1260 tgctgcacct gctgctgaag gagacaacaa agaaaccgaa gaaatggaga ccgaaagccc 1320 gtctgctgcc gacgatgctg cagaactcaa cactccgcac ccgaagctgc cgtgtgagga 1380 agaggacgca gaaatgccgg ccgtgtcgga cctcgctagc tgtgagtcgg tgagtccttt 1440 cgaattccct gcacctatcc ctgtccctat ccctcatcct gacctcaaga agacgcactc 1500 gttgagttcc atctctggaa ctgagagcga gggaagtgcg cgctctggtg agttcactga 1560 ggtgaagaag aagcgacgac cgggtcgtcc aaaaaagcca aaagtttagt cacctagcac 1620 cctgcggctc ccactgcaat gaatctgcct tgctcctata acatcgccac catcaatatt 1680 aatgccattt cgaacgacaa caagatccaa tccctgaaaa cctttattcg ctcaactgac 1740 cttgacatta tactgcttca agaagttgaa aacccaaacc tagctctccc agggttcaac 1800 gttttaacga acgtggacga cagtaggaga ggaacagcaa tagctttgaa atcgcacatc 1860 cgtttctcga acgttcgacg cagcctcgac acgcgtatca ttgccgttac cctgcgaagc 1920 atggtgacca tttgcaacgt atacgccccc tccggaaccg tgaacgccgc atcccgtgaa 1980 cgcctgttca acaatacgct tccccttttc ctccaaaact gctcagacca ctgcattctt 2040 ggcggtgact tcaactgcgt gattgcttcg aaggacgcaa ctggtaccag caaccacagc 2100 ttggcgctta aaaacctaac cagcaacctg ggacttaaag acgcgtggga agttctgaaa 2160 gctaatcaga tcgcgtacag cttcatcagg ccgaactgct cctcccggct tgaccgattg 2220 tacgtttccg gtgcatttgt accgggcttg cgcacggcgg agtatctcgt gacgtcgttc 2280 acggaccaca aggcgttcaa agtgcggtgc tgtcttcccg atcaaggacg gccgaccgga 2340 aatggtttct ggtcgatgcg tgcgcatgtg ttgactgaag agaacctcga ggaattcgag 2400 ctcaaatgga atcgctggct acgagagaag cagaacttca acagctggat cgagtggtgg 2460 attcgctgtg ccaaacccag aataaagagc ttcttccgct ggaagacaaa cgaagttttc 2520 cgccggttca acgttgcgaa cgaactactg tatggacaac tccgagcagc ctacgacaac 2580 ttgaccaaca atccagcgat ggttcccgaa atcaaccgta taaaaggaaa gatgcttctg 2640 ctacaaaata atttctcgaa agccttcgag cgcctgaacg acaagtttct gggcgacgaa 2700 aagctgtcga cattccagct aggtgaccgg gccggtcgca agaagaacag cacaatctct 2760 tctatcaaac accaaggaag acttctggaa gactcagcgg aagtcgaaga tcacatcttt 2820 gactatttcc gccagctgta cagtgctgaa gatgttcagc tcaaccggga ctttccgagt 2880 aatcgactag tggacccgaa ctcggaggca aacaaccgtg cgatggaaga aattacaacg 2940 gaagaaatct tcactgctat ccagacgagt gcttcccgga agtccccagg atgtgatggc 3000 atcccgaagg agttttatgt caaggcgttc aacattattc atcgacagct caacctgatc 3060 ctgaacgagg cgctgcgggg aggtttcccc gaacagttcc tggacggtgt agtggtcttg 3120 agcaagaaga aaacgaacct ggaaaccatc aaagcgtaca ggcccattac cctgctgaac 3180 tacgactaca aactgctggc gcgcattctg aaggtgcgac tggaacgaat aataaacgag 3240 aatcgaatcc tcaactcgtg tcagaagtgc tccaacgcaa gacgcgatat cttcgaggct 3300 gttcatgcga taaaggacag ggttgtggag ttgaactgta agcgaaaaac tgggaagcta 3360 atctcgttcg atctggacca tgcgttcgac cgggttgacc atacgtacct gctaagcgtg 3420 ttgcgcagca tgcgcatcaa cgaaaatctg gtggctttga ttggtaggct gatgtgctcc 3480 gcccgatcac ggataatcat caatgggaat ctctccgctc aatttcccat tcagaggtct 3540 gtgcgccaag gtgatcctct tagcatgctg ctctttgtcg ttttccttca tcctctcctc 3600 gagaaacttc ttgctatttg taaccatccc aaagagctag tggtggccta cgcggatgac 3660 atttccatca ttttggtgga cgaaacgaag ctggctgaag tcaggcgggc tttcgacgac 3720 ttcggactgt gctcgggggc caggctgaac atagcgaaaa caactgcagt gaacattgga 3780 ccgcagcggg agaactcaac aggagcggga tggacgacgg tttgcgactc tgtgaagatc 3840 ttgggagtta tctacttcaa ctcgctgaaa cgaacgatcg aggagaactg gagagaaacc 3900 atccggaaga cgtcctacct catgtggctt ttcaagccac gtgatctctc catccatcaa 3960 aaggtcactc ttttgaattt gtttgtgacg tctaagttgt ggtatatggc atcagtgctt 4020 agcatgccaa acgcttgcat agccaaagtc acgtcccaga ttggccagtt tatctgggga 4080 agatatccga ctcgggtatc cattgaccag ctagctctcc cggtagcaga gggaggacta 4140 aacctgcacg tgccaaagat gaaaagcaaa gctttgattc taaaccgctt tatcagagga 4200 tctagcgaaa cgccattcgc agcttccttc gtgagtcagc tggacaaccc accaaacctt 4260 agtgggattc cggcgctgta cccgtgtctg aaagccatag ctcgagaact gccatactta 4320 ccgcaacggc tgattggaca accaacggca acagaactcc acgggtactt caggagaaca 4380 ctaaccacgc caaaaatcat gaacgagcac ccggctgtag cgtggaggtc agtctggcgc 4440 aacatcagga gcagagtact cacatctgcg gagaagactt ctttctattt gctggtcaac 4500 cggaaaatac cacacgctga attgctgttc cgacagaaca gactggacag tggaacgtgt 4560 caacattgtc tcggtgccgt ggaagatctg gagcacaaat tcgcgaagtg tactcgagtg 4620 agacatctct ggaacttcat tcaatcaaaa ttaggagcaa ttttgaatcg aagagttagt 4680 ttcaatttgt ttttgatacc agagctcaac aatattgagg ctaacagtaa aagcagggcg 4740 ctaaagcttt ttattttcta tataaatttt gtgttagatc cagattgttc ttttactcta 4800 gcatcactag atttcgcagt tagtttagtt tgttaaagag ttctttaaaa actacaaaaa 4860 catgtacatt taaacaaaat gatctaaata aacgttttta caaaaaaaaa aaaaaaa 4917 // ID Gypsy-97_CQ-I repbase; DNA; INV; 7137 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-97_CQ_; KW Gypsy-97_CQ-LTR; Gypsy-97_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-7137 RA Jurka J.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 573-573 (2011). XX DR [2] (Consensus) XX CC Positions [4675-5148] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 404..1438 FT /product="Gypsy-97_CQ-I_2p" FT /translation="MDNLKKLDDYFAKEMNFLSNNFPNEQEDIVLKMKEAK FT VTVNSYKDKLIELREISIPKEYDKIVNNFNETVKSFQLTLECFASKLNQEE FT RLIDLKLNVMEDTEFPMDSALKCIPDFYGKSEDLNSFLDQINYFYNKIPKG FT VSLTPLINVIQLKLKGKAKPFTKTILKLTWEEIEKNLLHEFGDKKSSLNIF FT KRIDTLSQYEHETFKAYKNRALEILFDLEIDKNVDSFAMENLRIHFIAGLK FT NTYLQQTARNLTIQNFRDFLSILEEKRISNEEFENLYKDKQRLNSQNIKNE FT FSNQKQDVNDNRNFSTHQNRLNNYQNQYNYSNKRYHQNRFRNNFKQFYTQR FT KN" FT CDS 1885..5502 FT /product="Gypsy-97_CQ-I_1p" FT /translation="MKNLSVPGIIGAEFLKKHTLFIDKNFKFIVLQKPKIY FT FKSNNVESNQIYKKNSFENNESSVTNYQNDTSGFQKDQEVKYEGEILNNVA FT FCENIVRSDETSDGFENKLDDQSCMPTELNLDIDKDGDVTEFKNLKSIKGR FT ERIQKIIDLVNLEHLNKHNFEEIVSMIEKFNKLFFIEGDELTFTDAAIHEI FT ETTTNIPINKRQYRMPEATKVHIDEQIDEMLKLGIIKPSKSPWNAPVLCIP FT KKVGADGKQKYRIVVDFRSLNMITKPFVFPIPLINEILDNIGDAQYFSSID FT LKSGFYQIPINPKDAAKTAFSTSKGHYEFLRMPMGLRNSPATFQKLMNIIL FT YSIQPIKAFVYLDDIIVFGKTIEEHNENLFKILEALYQNNLKVETSKCTIL FT KTQINYLGHTIDKHGIMPTDDNIKTIKELKRPQNVKDVRSFLGTINFYGKF FT IPNMADKRKHLNNLLKKDVKFKWTEECEKAFHDLKHCLISEPILVRPNYKD FT TFVITTDASDYAVGAVLSNAKTNDNPIAYASRALSVSEKKYHVIEKELLAI FT VWAVEYFKHYIFNQQFIVYTDHRPLIAIWRLKETSPTLSKLRLKIQGIGCD FT IRYKQGKENVVADFLSRIKHEENSNANIENVSESSNNNLIAVVTRQQTRQN FT SNLIDNSKPVRNNQSLLQQNSNYTTDNLSDNDDDLDITIDSLQAIDIMDKK FT DDDSWKKFTLDDFFDAQNKEEINFSLLKFTKTMINVNNTDANFVIINSKTA FT FKELSELVDLPHGMKDNINGRIFSFPDNKLWGFILNGSKRSITNREELFEG FT LLESFVKCPDFAKSAKNIQIISFRNLQQLPDVNILRFMAGKFDKMFTLYAS FT NEDRMFVKVEDRETVLKEFHDAPLGGHVGGKRMLQRISPLFTWENMRRDIE FT NYVRQCELCQKNKIWPKNKIPMKITTTSTEPFQKVFMDIVILTPSDDNNRY FT GLVIQDDLTRFLTVAVLPDQESSTVAKAFVEHFICRYGAPLEVVTDQGTNF FT MSNLLKNVCKILKIKKINTSAYHPQANLVERSNRELKIYLRNFIVGNPQTW FT DQLIPFFTFQYNTTINSSTGFTPFELLYGRLARIPNTVYNITDRELNYDDY FT IKEMKANFKNVHDKAVENVIASKHKRKEIYDKDAKEWQPWWGDLVIVETIP FT TGVGKKLQSLWRGPYEVVDLPSEQTTVIKNGNKLEKIHNNRLRKFND" XX SQ Sequence 7137 BP; 2803 A; 993 C; 1211 G; 2130 T; 0 other; aagtggtgac agcagaaaag tttaaagaaa attaaacagg aacgcttttg ctgactagtg 60 aaaaaaccag tgtcgaaccg ctagccaatg ccaaaggaaa catttagtga agagtgaaga 120 tggcagttca aacgccagcc attatgattg aaaaaatgtg acaatccgcc agccaaaccc 180 atggaaagca acttcagttt tgtgacgatt gaagctttaa aatcacgatt atcgaaatat 240 ttttcatcga accgccagcc atggagaaca ttcagaaaac cagcaaagaa aaatgggtaa 300 gttgtaatat tttactattt ttgtaattaa attgaagtgg ataattttgt tcatgggcaa 360 ttaaatgtta ttttgaaaag ggaaattgtt aaattctttc aacatggata atttaaagaa 420 actagatgat tattttgcta aggaaatgaa ttttctttcc aataattttc ctaatgaaca 480 ggaagacatt gttttaaaaa tgaaagaagc aaaagtgaca gtcaattctt acaaagataa 540 attgattgag ttacgagaga tttcaattcc caaagaatat gacaagattg tgaacaattt 600 taatgaaact gtcaaatcat ttcagttaac tcttgaatgt tttgcaagta aactaaatca 660 agaagagcgt ttgatagatt taaaattgaa tgtgatggag gataccgagt ttccaatgga 720 ttctgcgctt aagtgtattc cagattttta tggtaaatca gaagatttaa attctttcct 780 tgatcaaatt aattattttt acaataaaat tccaaaaggc gtttctctta caccattgat 840 taatgtaata caactgaaat taaaaggtaa agctaaacca tttactaaaa ctattttaaa 900 acttacttgg gaagaaattg aaaagaattt gttacatgaa tttggtgata aaaagtcttc 960 tttaaatatt ttcaaacgga tcgatactct ttcgcagtac gaacatgaaa cttttaaagc 1020 atacaagaac agggcactag aaattttatt cgatttagaa atcgataaaa atgttgatag 1080 ctttgcaatg gaaaatctaa gaattcattt tattgcggga ctgaagaata cttatttgca 1140 acaaactgca agaaatttaa caatacaaaa ctttagagat tttttgagta ttctagaaga 1200 aaaaagaatt agtaatgagg aatttgaaaa tttgtacaag gacaaacaaa gattaaattc 1260 acaaaacatt aaaaatgaat tttcaaatca aaagcaagac gtgaatgata atcgtaactt 1320 ttcaacccat caaaatcgat tgaataacta tcaaaatcaa tataactact caaacaaaag 1380 atatcatcaa aaccgattca ggaataattt taaacaattt tatacacagc gaaaaaactg 1440 aatgatgggg ttttcgatca ttttggcgtc cgaaacaccc gacaaaatgc gaatcaatat 1500 aacagacgct acaattatcg tgctcctagg tattatagaa atcgtaacaa tatttttcgt 1560 aataatttta gtttgacttc tcagagagaa ttctacaatc ctcctcatga agtactaaga 1620 ataattttaa ctccttatct tcaatttaga ttaagaattt catctaaaca aaatccattt 1680 cattctattt actatttact tgatacaggt gctagtgtca atgtaataag tagtaatatt 1740 ttgaatcaaa taggttacac tcatgttaat tttaaggaca aaattacttt aactggcatt 1800 ggtaatacat taactgaaac gataggatcg gtggttattg atttattaat cggtaaaaca 1860 atttataaca caaaattcta tgttatgaag aatttgtctg tgcctggtat aattggagca 1920 gaatttttaa agaagcacac tctcttcatc gataaaaact ttaagttcat tgtattacaa 1980 aagccaaaaa tatattttaa atcgaataat gttgaatcaa atcaaatcta taagaaaaat 2040 agttttgaaa ataatgaatc aagcgttact aactatcaaa atgataccag tggttttcaa 2100 aaagaccaag aagtcaagta tgaaggagaa atactaaata atgttgcgtt ttgcgaaaat 2160 attgtacgct ctgatgaaac aagtgatgga tttgaaaata aattagatga tcaatcttgt 2220 atgccaacag aattaaattt agacattgat aaagatggtg atgtaacgga attcaaaaat 2280 ttaaaaagca ttaaaggtag ggaaagaatt cagaaaataa ttgatttagt aaatctagaa 2340 catttgaata aacataattt tgaggaaatt gtttctatga ttgaaaaatt taacaaatta 2400 ttttttattg aaggagatga attgactttt acggatgcag caattcatga aatagaaacc 2460 acaacaaata ttcccattaa taagaggcaa tataggatgc ctgaggcaac aaaagtgcac 2520 attgatgagc aaattgatga aatgctcaaa ctgggtatta ttaaacctag taaaagcccg 2580 tggaatgctc ctgttttatg tataccaaaa aaggttggag ctgacggcaa acagaaatat 2640 agaattgttg tagatttcag atcactaaac atgataacaa aaccttttgt tttcccgatt 2700 ccattgatta atgaaatttt ggataacatt ggcgatgcac aatacttttc gtcaattgat 2760 ttaaagtcag gtttttatca gatacctata aaccccaaag atgcagctaa aactgcattt 2820 tcaacttcaa aaggacatta tgaattttta agaatgccta tgggtcttag aaatagccca 2880 gcaacatttc aaaaattaat gaatattatt ttgtactcga ttcaaccaat taaagcattt 2940 gtttatctcg atgatattat tgtatttgga aaaactattg aagaacataa tgaaaactta 3000 ttcaagattt tagaagcatt ataccaaaat aatttgaagg tagaaacatc aaaatgtacc 3060 attctgaaga cacagattaa ttacctgggt catacgattg ataagcatgg aattatgcct 3120 acagatgata acatcaaaac aatcaaagaa ttaaaacgtc ctcaaaatgt taaggatgtt 3180 cgttcatttc tgggaacaat caatttttac ggcaagttta ttccgaatat ggcagataaa 3240 cgcaaacatt tgaataattt attgaaaaag gatgtaaaat ttaaatggac tgaagagtgt 3300 gaaaaagcgt tccatgattt gaaacactgc ttaatttcag aaccaatttt agtacgtccg 3360 aattataagg atacctttgt cattacaaca gatgctagtg attatgctgt tggagccgtg 3420 ttatcaaatg ctaaaacaaa tgataaccca atcgcttatg ctagtagagc attaagtgtt 3480 tctgaaaaga agtatcacgt tattgaaaaa gaattactcg ctatagtttg ggcagttgaa 3540 tattttaaac attatatttt taaccaacag tttattgttt atactgatca taggccgttg 3600 attgcaattt ggagactaaa agagacctca ccgactcttt ccaaattaag attaaaaatt 3660 caaggtattg gatgtgatat ccgatataaa cagggaaaag agaatgttgt tgctgatttt 3720 ctttctcgta tcaagcacga agaaaattca aatgcaaaca ttgaaaatgt ttcggaaagt 3780 tcaaataata atttaattgc tgttgttact agacaacaaa cacgtcaaaa ttcaaattta 3840 attgataact caaaacctgt tcgaaataat caatctttat tacaacagaa ttcaaattat 3900 actactgata atctttcaga caatgatgat gatttagaca taactattga ttccttacag 3960 gctatcgata ttatggacaa aaaggatgat gattcatgga aaaagtttac attggatgac 4020 tttttcgatg ctcaaaacaa agaagaaatt aattttagtt tacttaaatt tacaaaaaca 4080 atgatcaatg tgaataatac agatgctaat tttgtcatca ttaatagtaa gacagcgttc 4140 aaagagctaa gtgaattggt ggatctacca catggtatga aggacaatat aaatggaaga 4200 attttttcat ttccagataa caaattgtgg ggttttattt tgaatggatc aaaacgttca 4260 attactaata gagaagagtt atttgagggt ttgttagaaa gttttgttaa atgtcctgat 4320 tttgccaaat cagctaaaaa tattcaaatt atttcattta gaaatttaca gcaacttccc 4380 gatgtcaata ttctaagatt tatggcagga aaatttgata aaatgtttac actatatgca 4440 tctaatgaag acagaatgtt tgtaaaagta gaagacagag aaacggtttt gaaagaattt 4500 catgatgctc cccttggagg tcatgtggga ggtaaaagaa tgcttcaacg tataagtcca 4560 cttttcacat gggaaaacat gcgtagagat attgaaaact atgtaagaca atgtgaactg 4620 tgccaaaaga acaagatttg gcccaagaat aagattccta tgaagataac aacgacttca 4680 acggaaccct tccaaaaagt attcatggat attgtaatat taaccccatc tgacgacaat 4740 aatcgttatg gtttagtgat tcaagatgat ctaacaagat ttttaacagt tgctgtattg 4800 cctgatcagg aaagctcaac tgtcgctaaa gcattcgtgg aacattttat ttgtcgctat 4860 ggtgctccac tagaggtagt taccgatcaa ggaacaaatt ttatgagtaa tttactgaaa 4920 aatgtatgca aaattttgaa aataaagaaa ataaatacaa gtgcatatca tccacaagca 4980 aatttagtag aacgatcaaa tagagaactg aaaatctatt tgcgaaactt tattgttgga 5040 aatccgcaaa catgggatca attaattcca tttttcactt tccagtataa tacaacaatt 5100 aattcgtcta ctggatttac accatttgaa ttattgtacg gaagattagc gagaattcca 5160 aatacagttt ataacattac tgacagagag ttaaattacg atgattacat aaaggaaatg 5220 aaagcaaatt ttaaaaatgt ccacgacaaa gctgtagaaa acgtaattgc ttcaaaacat 5280 aaaagaaagg aaatttacga caaagatgca aaagaatggc aaccttggtg gggggatttg 5340 gttattgtag aaaccattcc cacaggtgta ggaaagaagt tacaaagttt atggcgaggt 5400 ccatatgaag ttgttgattt accaagtgaa caaactacag taattaagaa tggaaacaag 5460 ttggaaaaga ttcacaacaa tagattgcga aaattcaatg attaacgagg attttattta 5520 tgaacaagca ttttattata aacaaatgac taagatattg ggaaggtaaa aagatgaata 5580 tagattataa attacaacgt gcagaaaaag aaagattatc aaggtcatga actgaaatta 5640 aatccacgtg gtacataatt ataacatatg tttaaagacg tcaataatta tatttaatgg 5700 gattatttag taaaagagtg cactctactg atcagtacaa taaatgatat cacagtataa 5760 ttgagatgga taaatgatgt tttgatgcgt gattggacaa gggactgcag cccagaatac 5820 atgtgtttgt gtgtgaaaaa taaaatgcag tgtgggtcga gaaaacacaa atagattatc 5880 ccacatcaca tgataaagta tgtatttagt ttcattttag gaaaaaattt aggtccttat 5940 tcaaacaagc ctaattaaac ctggaaaaag agaggggaag aggattttgc aaacaaaatc 6000 tttacaacgc accttaaaca caaatattat cgatgaactg aagtaaaaaa cgtgagagta 6060 agcggagtaa gtataaaaaa ctaatatttt gaaaaacttt ttagccaatg ccagtgtaga 6120 tcaatgagca gtagatgcag gataggctta ggaaattata attataatat actttgctac 6180 aatgattaat atcaacttta aacaaagaat tgttaaattg atagtcactt aggaaattat 6240 tctctgtaac acgacactac acaaatatgt aaaatctcac ctacacgaac acaaatatca 6300 tcaaacgtta acattcggat acagcggata gcagcaatac aactacaaca tttcggacaa 6360 cacatttggc atcaacgaat tattggaagt attgcattca taaattgcag aaaattgaga 6420 caggagatga gttgtataaa aaatggtgga acataacaat gatggagtga agtatatgat 6480 acaaggcaaa gaatggttat gatgctaaat tatattacat ggaatagttt ctgtttgttc 6540 agtcagaagc aaaaacttct caaccggaaa aatatatatc aaaactgaca attaaggcaa 6600 gacatccaaa tacaattctc ggatctaatg gaaaggaaca agtatggcaa aatgatgaaa 6660 catgatctgt acactacaat cagatcttat ggaatggaat gaaaagtttc gattgaggtc 6720 cacgaaaatt ttatcgactg cagaacacaa agaaggactg acgcagatct ctatgcaata 6780 agaaaaaagc atcaaggaaa ggacaagaag gaaaatgtat caaataatgt ttggcagaat 6840 tgatccacta tagtcaaatc aacaaagtca aatgtctcaa tctatgtggt ttattgaatg 6900 ttttcaagtt tcaaacaaaa agtcaagtga agaacaacat cgacgaatta tcaagccagc 6960 tattttatca attatatcaa cgcagaggag acaacagaac gtgaacaaca attatattta 7020 gtactcagat atacaaaaac atatttttca aaatcaacgt ttatcaagaa ccagcgttag 7080 gcttacagta gatgattatc ttagcagaca aacatctagc tgggtagtgt aagggta 7137 // ID Copia-133_AA-I repbase; DNA; INV; 3861 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-133_AA_; KW Copia-133_AA-LTR; Ty1_copia_Ele8; Copia-133_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-3861 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [1263-1790] - Integrase core CC 'CTTTT' target site duplication CC LTRs are 94% similar to each other. XX FH Key Location/Qualifiers FT CDS 1086..2210 FT /product="Copia-133_AA-I_1p" FT /translation="MSKEVRHLPDCQHTWHRRFGHRDPAVLDRIQEEKLCE FT GFSMHDCGIRHVCEHCLAGKLSRIPFPKQAQNRAARILDVVHTDVCGPMKN FT VTPGGCRYLMTLIDGYSRYTVVCLLRQKSDAAGCIKRYVAHVKTRFGRAPG FT AIRSDGGGEYANKELKEFYEREGIQAQFTTAYSPQQNGVAERKNRSLQEMA FT TCMLLDAGLPKKYWGEAIMTAGYLQNRLPSRSVDSTPFEKWFGRKPSLKNL FT RVFGSPAYVHVPDVKRSKFDHKARKLLFVGYCDDRKAFRFLNPENDEITIS FT RDARFIELGDGSLDQRDPVSDGSEQFVELDVTAVKSQQRRKIRPNWSHRKM FT KQHFRRSRVRTITIRMRKKVQSAENAGAEAKH" FT CDS 2334..3851 FT /product="Copia-133_AA-I_2p" FT /translation="MAEEYDALMRNDTWSLVQLPAGRVPIGCKWIYKKKED FT SSGNVSRFKARLVAQGFSQRYGVVYDAVFAPVASQATLRILLTVAGYKKMA FT VRHLDVKSAYLHGKLQDEIYMQQPKGFVVPGKEKLVCRLKRSLCGLKRAAR FT IWNTTISELLKGLGFDQSRSDPCLFIKRSPGGGFVYLMLHVDDMIVAGMTE FT KEIDAVEFELRKKITLSSLGEIGHFLGIRITKDKNGFYCLDQENYIKKVAS FT RFGLEHAKGSSVPIDVGYFRSREGSKELPDNERYRSLAGALLYVAVNTRPD FT VAASVSLLCRRISQPTEVDWNELKRVVKYLMKTSNSKLRLCAERGKPLRLS FT GYCDADWGGDTADRKSNTGYLFLLGEASVSWASRKQSSVSLSSMEAEYIAV FT SEACKELVWLRRMLDEMGAEQCGPTVLFEDNRSCMEFVEMERVSRRSKHID FT TRMFYTKDMCEKGLVELRFCPSENMTADLLTKPLGPVKQKKFAEAMGLTEV FT ESWAGRREKY" XX SQ Sequence 3861 BP; 965 A; 826 C; 1215 G; 851 T; 4 other; ggttatgggc tccaggcckt gcgggaactg gaacagaaag tgtttttttt tktgcgaaag 60 tgaatcgttt cggcgattga ggttatgtcg cttggcgaat aaccaccagt gattcgactg 120 tgtgcgttgt ttcatcgatg cgagtgcgaa caagtcgaga caagtttgcg tgcgactctc 180 tcggtgcggg tgcgcgaaag tgaaaacaaa ctttgcgagt gttttgctcg atcgcgggcg 240 tgcgaacagt gaaccgaaaa tgagcgagaa atatttgttc tcccggttga gtgaccagaa 300 ctatcccgtc tggaagaagc gcatggagat gcttcttaag cgcgaagatt tgtggtcggt 360 agtgtcggag gcgaagccgg aaccggcaac cgatgtgtgg tcgaaacgtg atcagaaagc 420 gctagctact atcgttctgt acgttgaaga cagtcagctg aatttagtgc gggatgcggt 480 tacagcggct gatgtgtgga aagtgctccg ggatttccac gaaaaagcga caatgacgtc 540 gcgggtggca ttgctccggc ggatttgcag tctcaacatg gctgaaggag gagacatcga 600 gaaacatctc tacgatctcg aagagctttt tgaccgacta gccactgctg gtcaagctct 660 ggaaacacca ctcaagattg cgatgattta tcgcagtctc cctgaatcgt atggaggtct 720 gataacagca ttggaaagca gacccgatgc agatcagacg ctccagctgg tgaagcaaaa 780 attactcgac gagcaccagc ggcgtgtgga gcgatcaggg gaagtttcgg agaaagccat 840 gaagagtcgg agtaaggaaa aggagaagat ctgctaccat tgtcggaagc cagggcactt 900 ccggcgcgat tgccgtctgt tgaaatcgca gcagaagagt gataccgaac ataaagtgaa 960 gccttccgga agcaaggcga agaaagcggt cacagtgcag tgttgtgaat gaagagggat 1020 aagtggtcgc tacggctgag ctgtgtggca atttgtacat cctgaagacg gcagcggatg 1080 cgcggatgag caaggaggtt cggcatctgc cggattgcca gcatacgtgg catcgccggt 1140 ttggccaccg ggatccggcg gttttggatc gcattcagga ggagaaactc tgcgagggtt 1200 tcagtatgca cgattgcggg atcaggcatg tatgcgagca ttgcctagca ggtaaattat 1260 ctagaattcc ttttcctaag caagcacaga accgtgctgc tcgtatattg gacgtagttc 1320 atacggacgt atgcggtcca atgaagaacg tcacgccggg agggtgcagg tacctgatga 1380 ccctcattga cggctacagt cgctacactg tggtctgcct gctcaggcag aaatcggacg 1440 ctgcgggctg tatcaagcgt tatgtggccc acgtgaaaac gcgattcggt cgagcgcccg 1500 gtgctatacg ttctgatggt gggggcgaat acgctaataa ggagttgaag gagttctatg 1560 agcgagaggg catccaggcc caatttacga ccgcgtattc cccacaacaa aatggcgtgg 1620 cggaacgtaa aaaccgatcc ctccaggaga tggcaacgtg catgttgttg gacgcaggac 1680 ttcccaagaa gtactggggc gaggcgatta tgaccgcggg gtacctccag aaccggttgc 1740 cgtcgcgatc tgtcgattcg acaccgtttg agaaatggtt cggcagaaag ccttcgttga 1800 agaatctgcg tgtattcgga agtccggcgt acgtccacgt ccccgacgtg aagcgaagca 1860 agttcgacca taaggctcga aagttgttgt tcgttggcta ttgcgatgat aggaaggcgt 1920 ttcgtttcct gaacccggaa aatgatgaaa tcacgatcag ccgggatgct agattcatcg 1980 agttgggaga tggttcgctg gaccagcgtg atccagtatc cgacggtagc gaacagtttg 2040 tggagctcga tgtgacagcg gtgaagagcc aacagcggag gaagattcgg ccgaactgga 2100 gtcacaggaa gatgaagcaa cacttcagga ggagtcgtgt tcggactatt acgatccgga 2160 tgaggaagaa ggtgcagtcg gcggagaacg ccggtgccga agcgaagcac scgcggagtg 2220 cttccgaagc gattcgaaga ttacgaagtg gatgcagcca tagcagttga aggagaaccg 2280 gatacctacg aagaggcagt gaacagttcc gaacggattt gtggaaggcc gcaatggctg 2340 aagaatacga tgcgttgatg cggaatgaca cctggtcgct ggttcagctg ccggccggcc 2400 gtgttcccat cggctgtaag tggatatata agaagaagga agacagttcc ggcaatgtat 2460 cgcgattcaa agctcgcctt gtagcccagg gcttttcgca gcgttatggg gtcgtttacg 2520 acgctgtatt cgccccggtt gccagccagg cgacgttgcg gattctcttg acagtggctg 2580 ggtacaagaa gatggcggta cgtcacttag acgtgaagtc agcatacctt cacggaaagc 2640 tgcaggacga aatttatatg cagcagccga aggggtttgt ggtgcctgga aaagagaagc 2700 tcgtatgtcg gttgaagcgg agcctatgcg gcttgaagcg ggctgcaagg atttggaaca 2760 ccacgattag cgagttgctg aaaggattgg gcttcgatca gtcccgatca gatccttgtt 2820 tgttcatcaa acgttcgcca ggaggaggtt tcgtctacct gatgttgcat gtcgatgata 2880 tgattgtcgc cggcatgacc gagaaggaga ttgatgctgt cgagtttgag ctgcggaaga 2940 agattacctt gtcgtcgttg ggagagattg gccattttct gggcatccgg attacgaagg 3000 acaagaacgg attctactgt ctggatcagg agaattacat caagaaggtg gcgtcgcgat 3060 tcggactcga acacgccaaa ggttcgagcg tgccgatcga cgttggttac ttcagaagtc 3120 gtgagggaag taaggagctg ccagacaatg aamgatatcg tagcctcgcc ggtgcgctac 3180 tctacgtcgc tgtcaataca cgtcccgacg tagcagcaag tgtgtcgcta ttgtgcagga 3240 ggatcagcca gccgacagag gtagattgga acgagctgaa gcgtgtggtg aaatatctca 3300 tgaagaccag caatagcaag ctgcgactgt gtgcagaacg tggcaaaccg ctgcgtttga 3360 gtggatactg tgatgctgac tggggcggag acacggcgga tcgcaagtcc aacacgggat 3420 acctgttctt acttggtgaa gcgtctgtta gttgggcgag tcggaagcag tctagcgttt 3480 cgctctccag tatggaagcg gagtatatcg cagtgtcaga agcgtgcaaa gaacttgtgt 3540 ggctacgccg tatgttggat gagatgggag ctgagcaatg cggccccacc gtgctgtttg 3600 aagataaccg aagctgtatg gagtttgtcg agatggagcg cgtctcgcgc aggtcgaaac 3660 acatagacac tcgtatgttc tacacgaagg acatgtgtga gaaaggtttg gtggaactac 3720 gattctgccc ttcggagaac atgacggccg atcttctcac gaagccgttg ggtcccgtga 3780 agcagaagaa gtttgcagaa gccatggggt tgacggaagt tgagagttgg gccggaagaa 3840 gagaaaaata ttgaggagga g 3861 // ID Gypsy-145_AA-I repbase; DNA; INV; 4231 BP. XX AC AAGE02022110; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-145_AA_; KW Gypsy-145_AA-LTR; Gypsy-145_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4231 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02022110; Positions 85422 81192. XX CC 'AAACC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 17..4186 FT /product="Gypsy-145_AA-I_1p" FT /translation="MAQPDPPPADPAAQGAQQIAGAPPPNFHFDAFDRRKI FT RWSRWVERLETAFVIYGVGAEELRRKNLLLHLMGPDTYEIACDKVAPQNLR FT DMRYQEIVDVLEGHFNPQPLEISENFRFKCRRQGDKNASSPDETIDEYIVA FT LRRIAVTCNFGGYLETALRNQLVFGLKRNDIRSRLLERDIAVGMELSLKGG FT AEIEGAFAKQEVHSLQHPPGKPKPKKLGGKSASEMHCFRCGDKSHLAKKCR FT HQHTECSFCKIRGHLERVCMKKAAAAKGDANRKGAKSGAAKTNYIELRNDS FT GGSAAATGGEVEVGEICTVDSRSLEAKLWLPVKVCGVEVRFEVDTGSPVSI FT INSQYYGKYFRGMKLHTSDVSLVSYCNTDINVKGVFKAAVEWNGTESMLPL FT YVVESGKHPLLGREWLKVLAIDWNSVLRKPAAVNTINPTTTTSAALSQLFQ FT TYGKVFEDSIGRISSVQAHLTLKPNAAPVFLKARKVPFNLMKAVEDELDKL FT VSEGVLTKVDNSNWATPIVPVKKSDNRVRICGDYKQTVNPNLKVDRHPLPT FT VDELFASLAGGEKFSKIDLVQAYLQLEVAPEDREILTLNTHRGLYRPNRLM FT YGLSSAPAIWQRQIEVILQGIEGVTVFLDDIKITGPTEEIHLARLEEVLRR FT LHRYNIRVNKQKCVFFADRIEYCGYLLDREGVHKVKKKMDAVQDMPRPRNK FT EEVRSFVGFINYYGRFFDNLSSLIYPLNNLLKDSVEFRWTKECERSFNAVK FT QQMQSERCLVHYSPELPLVLATDASPYGVGAVLSHVYPDGSERPIQFASQT FT LNRTQQKYMQVDKEAYAIIFGVKKFFQFLYGRRFLLVTDNHAITKIFNEQK FT GLPVMSALRMQHYATFLQSFDYEVRFRKSADHANADAMSRCPLTKADPDNV FT VEESDVVEMNQIDTLPVTAEELSQATSEDKVVKKLMQGLRHGQAVDGKDRF FT GIEQSEFAIQNGCLLRGIRVYVPRTLRERVLKELHSTHFGMTRTKSLARGF FT CWWMGIDHDIEIMISNCADCQSVRPEPVKVLPHPWEPATEPFQRVHADFAG FT PFLDSYFFIVVDAYTKWPEIVVCKSITADSTERMCREIFSRFGIPAVFVSD FT HGTQFTAESFQSFLKQNGITHKMGAPYHPATNGQAERYVQTFKQKLKALHC FT SKSKLDVELANILMTYRKMIHPSTGQSPSMMMFGRQLKSRLDLMLPKNPER FT RPANPVARVFSDGDRVRVRDFLTANKWQFGRVVAKLGKLRYSVRLDDGRLW FT ERHVDHMAGVGSKLAISGEIELPTMVEARAEHNHEFPELVPPVVTTTDRGG FT DRNVQTTAEAAGGPPVAGTSTPTRLTVTPPVQPLANGGQRPQAVVSEDVVQ FT PRQVHGRNDQPMRRSSRVVKPPQRLNL" XX SQ Sequence 4231 BP; 1111 A; 979 C; 1166 G; 975 T; 0 other; gttttggccg acgaggatgg cgcaaccaga tccaccacca gccgatccag ctgctcaggg 60 tgctcagcaa atcgcgggag ctccgccacc gaactttcat ttcgacgcat tcgacagaag 120 gaagattcga tggtcgcgat gggtggagag gttagaaacg gcgtttgtga tttacggcgt 180 cggagcggaa gaattgcgtc gcaaaaactt gctgctacat ttgatgggcc ccgacacgta 240 cgaaattgct tgcgacaagg tggctccaca gaatcttcga gacatgcggt accaggagat 300 tgtcgatgtg ctagaggggc atttcaaccc acaacccctc gaaattagtg agaactttag 360 gttcaagtgc cgtcggcagg gggacaagaa cgccagttcc ccggacgaga caatcgacga 420 gtacatcgtt gcgctgcgca ggattgccgt aacctgcaac ttcggaggat acctggagac 480 tgcattgagg aatcaactag tcttcgggtt gaaacggaac gacattcgca gccgcctgct 540 ggaacgggac attgcggttg ggatggagtt gtcgctcaaa ggaggagctg aaatcgaagg 600 tgcttttgcg aaacaggagg tacattccct acaacaccca ccaggtaagc ctaaacctaa 660 gaaactgggg ggcaagtccg ctagcgaaat gcactgcttt cgctgcggcg acaaatcaca 720 cctggcgaag aaatgtcggc atcaacacac agagtgctca ttttgcaaaa taagagggca 780 tcttgagcgc gtatgtatga agaaggctgc tgctgcaaaa ggtgatgcta accgaaaggg 840 tgcaaaatcg ggtgctgcaa aaacaaatta catcgaactt cgcaacgatt caggtggaag 900 tgctgctgct actggtgggg aagttgaagt gggggagatt tgtacagtgg attctcgttc 960 gttagaggcg aaattgtggc tgccggtgaa ggtttgcggt gtcgaggtgc gttttgaggt 1020 agataccggt agcccagtca gcatcatcaa ctcgcagtac tacggtaaat acttccgcgg 1080 gatgaagctg catacgagtg acgtcagtct tgtaagctac tgcaatacgg acatcaacgt 1140 gaagggggtg ttcaaagcag cagtagaatg gaatgggacg gagtcaatgc tgccgctgta 1200 cgttgttgag tctgggaagc atcctctact gggccgggaa tggttgaagg ttcttgcgat 1260 cgattggaat tccgttttgc ggaagccggc tgcggtcaac acgatcaatc caactactac 1320 tacttctgct gcgttgagtc aacttttcca aacctacggg aaggtgtttg aagattcaat 1380 cggtcggatt tcctctgtgc aggcccatct tacattgaaa ccaaatgcgg ctccggtttt 1440 tctgaaggct cggaaggttc ccttcaactt aatgaaggca gtcgaggacg aacttgacaa 1500 attggtgtct gagggagttc tcaccaaggt ggacaacagc aattgggcca cccccatcgt 1560 tccggtcaaa aagtcggata accgagtcag gatctgcggg gattacaaac agacggtgaa 1620 ccccaacttg aaggtcgatc ggcatccact tccaacggtc gacgagttgt ttgcttctct 1680 ggcaggcgga gaaaagtttt caaagatcga tcttgttcag gcgtatttgc agttggaggt 1740 tgctcctgag gatagagaaa ttctaacgct gaatacgcac cgcggactgt accgtcctaa 1800 ccggcttatg tacggtcttt catccgctcc tgcaatatgg caacgccaaa tcgaggtcat 1860 cctacaaggc atcgaaggtg ttaccgtatt cttagatgac attaagataa caggaccgac 1920 cgaagaaatc catctggcgc gattggagga agtgcttcgt aggctgcatc gctacaatat 1980 ccgagtcaac aagcagaaat gcgtgttttt cgccgatcgc atcgaatact gcggatacct 2040 tttggatcgc gaaggtgttc acaaagtgaa gaagaagatg gatgccgtac aggacatgcc 2100 gcggcctcgg aacaaggagg aggtgcggtc gttcgtagga ttcattaatt attacggccg 2160 ctttttcgac aatttgagtt ctctgattta tccgttgaat aacttgctga aggattccgt 2220 cgagttcagg tggacgaagg aatgcgaacg ttccttcaat gccgtcaagc agcaaatgca 2280 gtcggaaagg tgtttggtcc actactcgcc ggagttgccg ttagtcctag ccaccgatgc 2340 ctcaccttat ggtgtaggag ccgtcttgag ccacgtctac cctgatggtt ctgaacggcc 2400 catacagttc gcttcgcaaa ccttgaaccg tacacaacag aaatatatgc aggtagacaa 2460 agaggcttac gccatcatct ttggcgtcaa aaagttcttc cagtttttgt acgggcggag 2520 attcctactg gtgacggaca accatgcgat tacgaagatt ttcaacgagc aaaaaggact 2580 tccggtaatg tctgcattga ggatgcagca ctatgccact ttcctacaat cttttgacta 2640 cgaagtgagg tttcggaagt ccgcagatca tgcaaatgct gatgcgatgt ccagatgtcc 2700 actgacgaaa gctgatccag ataacgtagt ggaggagtcg gatgtggtgg aaatgaatca 2760 gattgacacc cttccggtta ctgcagagga gttgtcgcag gctacatcgg aggacaaagt 2820 ggtgaagaaa ctgatgcagg gattacggca cggacaggca gtcgacggga aagaccgttt 2880 tggaatagaa caatcggagt tcgctatcca gaatggctgt ttactgcgag ggatccgtgt 2940 ctacgtgcca agaacgcttc gagaacgagt actgaaggag ctgcactcta cacatttcgg 3000 catgacgcgt acaaaatcac ttgcaagagg tttttgctgg tggatgggaa ttgaccacga 3060 catcgagatt atgatttcga actgcgcgga ttgtcagtcc gtccgccctg agccggtgaa 3120 ggttctaccg cacccgtggg aaccagcaac cgaaccattc cagcgagttc atgccgattt 3180 tgccgggccg ttcttagaca gctacttctt catagttgtc gacgcctaca cgaagtggcc 3240 cgagattgta gtctgcaagt caatcacagc tgacagcacg gagagaatgt gtcgagagat 3300 cttcagccgc ttcggaattc ctgcggtgtt cgtaagcgat catgggacgc aatttacggc 3360 tgaatcgttt cagagtttct tgaagcaaaa cgggattaca cacaagatgg gtgcccctta 3420 ccatcctgcg acaaacggtc aggcggagag atatgttcag acgttcaaac agaaactgaa 3480 ggctctacac tgttcaaagt cgaagcttga cgtcgaactt gcaaatatcc tgatgacata 3540 ccgaaagatg attcacccgt caactggaca atcaccgtct atgatgatgt tcggaagaca 3600 actcaaatcc cgcctagacc tgatgctacc gaagaatcca gagcgaagac cagcaaaccc 3660 cgtagctcgt gtgttttccg acggagacag agtaagagtt cgtgattttc tgacggcaaa 3720 caagtggcag ttcggcagag ttgtggcgaa gcttgggaag ctgagatact cggttcgact 3780 cgatgatgga aggctatggg agcgacacgt tgatcatatg gctggcgtgg gatcgaagct 3840 agctatatcc ggagaaattg aactgccaac gatggtggaa gctcgagctg aacataacca 3900 cgagtttcct gaactggttc caccagtcgt tactactact gatcgtggtg gtgaccgcaa 3960 tgtacaaact accgcagagg ctgccggtgg accgccagtt gctggtacat ctacccctac 4020 gagactgaca gttaccccgc ctgttcaacc attggcgaac ggtgggcagc gaccacaagc 4080 cgtggtatcc gaggatgttg tgcaaccgcg ccaggtgcac ggaagaaacg atcaaccgat 4140 gagaagatct agtagagtag taaaaccccc tcagagattg aatctctaac tattttgcat 4200 tatataactt tcttttcacg agggggagag t 4231 // ID Gypsy-197_AA-LTR repbase; DNA; INV; 164 BP. XX AC supercont1.83; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-197_AA_; KW Gypsy-197_AA-I; Gypsy-197_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-164 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.83; Positions 2895173 2895010. XX SQ Sequence 164 BP; 45 A; 19 C; 34 G; 66 T; 0 other; tgtgatgaga tgtctaagta tgtttatctg tggtctttac caacacaatg atgtatgtag 60 tacggtgtta cctttatatt gatatacttt gtttgtttac aactgaatac atgattgatg 120 gtagcgaatc agagtagttg gtgttattaa gtttatctat caca 164 // ID Kolobok-19_HMa repbase; DNA; INV; 2575 BP. XX AC . XX DT 15-SEP-2009 (Rel. 14.09, Created) DT 15-SEP-2009 (Rel. 14.09, Last updated, Version 2) XX DE Kolobok-type DNA transposon - a consensus. XX KW Kolobok; DNA transposon; Transposable Element; Kolobok-19_HMa. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2575 RA Jurka J.; RT "DNA transposons from Hydra magnipapillata."; RL Repbase Reports 9(9), 1918-1918 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 376..2178 FT /product="Kolobok-19_HMa_1p" FT /translation="MGRRKSGYKPLKRVFHGNRFVVAPLETIKNPISTSEI FT IINEAPCSASGLKLKNNNVKEADLNISYLLIDFNILAETICLYSVCPECNQ FT MGLCTQNLTTEKKGWANKLRLSCTYCQWSKNIFTSKEIILPNKSGQKCFDI FT NTRCVMAFREIGKGHASIQTFSRILSLTDPINLKAYNEINNKMLNSYMTAA FT NESMTNAVRECKANVDEVVECTVSLDGTWQKRGHESLNGVVTAISRTNNKV FT IDYEVLSKKCQACLSWRNKKGTNEYQIWNSSHNCSINHKGSAGSMETQGAI FT AIFHRSIHQNGLRYKNYVGDGDSSSFLKVVESKPYGEDFVINKLECVGHIQ FT KRLGNRLRSLRQSMKGQVLVDGKKLVGKGRLTESAINLLQNYFGMSIRQNS FT EVYSMKKAIIAVLLHCSESDTKAQRHLYCPRTNDSWCKWYKLNEVERLNYE FT PKLSLPAAIRDILKPIFMELSADSLLIKCLHGMTQNVNESLNAMIWSRCPK FT SQYCSQKIIKIGVCSAIIHFNDGFNGLHNVFKFLNITPGASFCINGPKRDL FT CRIKKANKKSSSFCKKQRKVLRAKRKGFIDEQTNTNDCESYMTGAAYFSSP FT FPE" XX SQ Sequence 2575 BP; 936 A; 365 C; 442 G; 832 T; 0 other; ggtggtccta taccattaaa aatagaaaat ttcaaactta attttgtgca aaatttatac 60 tagtttaaga ttttttaata gttttaacta aattcagagc acaagttggt atatttttaa 120 ttttaaacaa agtactgaaa atgttttcca tagttacgcc cctagcaacg gcaatttcaa 180 aatcaaaaaa gtttttgctg atctttttta cccactgcta aaatgaatct ttgaaatgtt 240 aacactgcat ttttaaagta tagtttgcca acttttaaac tcattttttg gctatacttt 300 gcaagaacaa attgacttta acctcgtgct tcatttaatt tgtttgaatt tagtaaattg 360 gtactagagt taaaaatggg tcgaagaaaa tcaggttaca aacctttaaa aagagtattt 420 catggaaatc gatttgtagt tgctccacta gaaacaataa agaatccaat atctacttca 480 gaaatcatta taaatgaagc tccatgttca gcaagtggtt taaaattgaa aaataataat 540 gtgaaggaag cagatttgaa tatatcttat ttgctaatag actttaatat tttggcagaa 600 acaatatgtt tgtattcagt ttgcccagaa tgtaatcaaa tgggactgtg tactcaaaat 660 ttaacaacag aaaagaaagg atgggcaaat aaactgagac tttcctgtac ttattgtcaa 720 tggagtaaaa atatttttac ctctaaagaa attattttac ctaataaaag tggtcaaaaa 780 tgttttgaca ttaatactag atgtgttatg gcatttcgag aaattggcaa aggtcatgca 840 tcaattcaga ccttttcgcg tatactaagt ttaacagatc ctataaatct caaagcttat 900 aacgaaatta acaacaaaat gttaaactca tacatgactg ctgctaatga aagtatgacc 960 aatgcagtta gggaatgcaa agctaatgta gatgaagttg ttgaatgtac tgtatctcta 1020 gatgggacat ggcaaaaaag aggtcatgaa tctcttaatg gtgttgttac tgcaatatct 1080 cgtactaaca ataaggttat tgactacgaa gtactttcaa aaaagtgtca agcttgcctg 1140 tcatggcgta ataaaaaagg cacaaatgaa taccaaatat ggaacagcag ccataactgt 1200 tcaataaacc acaagggctc agcaggctca atggagacac aaggagctat agcaattttt 1260 catcgatcaa ttcatcaaaa cggtttgcgt tataaaaatt atgttgggga tggtgacagc 1320 agctcatttc ttaaagttgt agagagtaag ccatatggtg aagattttgt tattaataaa 1380 cttgaatgtg ttggtcacat acaaaaaaga cttgggaata gactacgttc actacgacag 1440 agtatgaaag gacaagtttt agtagatggg aaaaaacttg ttggaaaagg gcgcttaact 1500 gaatctgcta ttaacttact gcaaaactac tttggcatgt ccatcagaca aaactctgaa 1560 gtttatagca tgaaaaaagc cattattgca gttctgcttc attgtagtga gtctgatact 1620 aaagctcaaa gacatctata ttgtccaaga accaatgata gttggtgcaa atggtacaaa 1680 ctaaatgaag tggaaaggtt aaattatgaa cctaaattat cactgcctgc tgctattaga 1740 gatattttaa aaccaatttt tatggagcta agtgctgata gtttattgat aaaatgtttg 1800 catggaatga cccagaatgt caatgagtca cttaatgcaa tgatttggtc tcgatgccct 1860 aaaagtcaat attgcagtca aaaaataatt aaaattggtg tttgttcagc aattattcat 1920 tttaatgatg gttttaatgg acttcataat gtatttaaat ttttaaacat tactcctgga 1980 gcttcatttt gcataaatgg tccaaaacga gatttatgta gaataaaaaa agctaacaag 2040 aagtcaagta gtttctgtaa aaaacaaaga aaagttttaa gagctaaaag aaagggtttt 2100 attgatgaac agacaaacac aaatgactgt gagtcttata tgactggagc agcctatttt 2160 tcctcaccgt ttccagaata aaatttccta gactttacag tttgatctaa aaatttccca 2220 agttattcgt ttaggtttga tctacaacct gaactaaaat gaaagggtct tagaactaac 2280 gtaactatag taaatttttg atataaatga ttttttatct ataaaattta aactactcaa 2340 taaattttat tgattttagt tcaaattgtt cttcatagta aaacaaaact ttttgtaaaa 2400 ttttattcaa tgagtaaata tggtttaaaa gttattaagg aaacagtaat tggggtattt 2460 cgggtcccaa catggtttat taagcccgta aaatatctgg gcatcctatt aaatgcagta 2520 ataagtgttt taaggtcaaa cgttagtgac acaagtttga tggtatagga ccacc 2575 // ID Sake_BM repbase; DNA; INV; 5140 BP. XX AC . XX DT 17-FEB-2006 (Rel. 11.02, Created) DT 30-APR-2010 (Rel. 15.05, Last updated, Version 2) XX DE Non-LTR retrotransposon Sake_BM - a consensus. XX KW Daphne; Non-LTR Retrotransposon; Transposable Element; KW Interspersed repeat; reverse transcriptase; LTR retrotransposon; KW AP-endonuclease; Sake_BM. XX NM Sake_BM. XX OS Bombyx mori OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Bombycidae; Bombycinae; Bombyx. XX RN [1] RP 1-5140 RA Schon I. and Arkhipova I.R.; RT "Two families of non-LTR retrotransposons, Syrinx and Daphne, RT from the Darwinulid ostracod, Darwinula stevensoni."; RL Gene 371(2), 296-307 (2006). XX DR [1] (Consensus) XX CC Sake is a non-LTR retrotransposon from the domestic silkworm, CC Bombyx mori, and is a member of the Daphne clade, which is a CC sister clade to the L2 clade, and includes elements from both CC protostomes and deuterostomes. The full-length consensus sequence CC was assembled from 500 contigs in the B. mori (strain Dazao) CC genome assembly at NCBI. The divergence of individual copies from CC the consensus is 1-5%. Sake codes for two overlapping ORFs: ORF1 CC with a CX4CX10HX7CX8CX13LX6LX6L motif and two coiled-coil CC domains, and ORF2 containing the AP-endonuclease and reverse CC transcriptase domains. The 5' UTR of Sake is 290 bp in length, CC and the 3' UTR is 240 bp in length and ends with a (TTTGA) CC microsatellite sequence. The complete 5 UTR region is found in CC ten different contigs, indicating that there may be at least ten CC full- length copies of Sake in the B. mori genome. None of the CC assembled contigs contain the element in its entirety, although CC the low level of sequence divergence may indicate recent CC activity. XX FH Key Location/Qualifiers FT CDS 290..1933 FT /product="Sake_BM_1p" FT /translation="MMTRSIKSRQVDEQLRIALEQLKNTREKYDILLQESE FT QNEEEMLSVISKNTDLKKQLSQLFIEHSEALEINQKLQDTINTFSQCSDEF FT CDSLKITAELRHKLAEADDCISDLRSELCKLKEQQPQSLYSELVKSEPSLV FT AVDTALNFDITTIDLTGSDAETSERSQLVLSKNKVRKYVKINKLIYKTQRK FT LRNYKSQEYIKFICKQRTSLSHQVAVLTEQSQQIHHKHMEEIECLSTQINN FT LKSSLENITLNYELAQRDISEHVLAMDNLIDTCNYNQQRFDSLTMQYAECK FT CSASGCQELPHVEIQKHNVSILSVLESQSNDIMKAAPICIPTTASSQVSRE FT DKIKIKIFSDELGKQMGVALHDLCFGQHVINYCMPGANPAQIFDKILNTTH FT CLNTVIIIMLGRRENVNQKQILHYVEQVNMLNIKKIIMFTFPYLQDLPTEN FT KIRFNINNAIYNAFDNNNCILTSKKYAFKLIDINNIIKFRYLRTSKNDIFK FT LSNKNFRQVATSLQYFIHNFLAINLAKKIPASIEQEKNMTCFKTLESASNL FT N" FT CDS 1906..4878 FT /product="Sake_BM_2p" FT /note="9-aa overlap; AP-endonuclease/reverse FT transcriptase." FT /translation="DFGICFQFKLISKTKESHSCNYKSINTTTRLYPSKDV FT NVSIINIFNLVHQNLQGMMGKELEIEMFLNMHNINVFCVTEHWFRNYELLF FT NFNDHQLSSSFCRENAIRGGSLILVSKCLKCKERKDVVALSVERIIEISCV FT ELEHLIVVCVYRPPDALYDSFENILENVLLKLSVSNKQIFVCGDFNINLLE FT NTNTTIRFRTLLKSYNLPNLFLEPTRTTTTSATCLDNIFTNVTPINKKIIN FT QLTSDHSGQFVSFESNMNSKDKNTLVIVPINKKRIEKFRNNIKQEHSEIIY FT NKNPNLSYQNMFERINLVFTDVFIPKTVTLKNKTVFSEWATTGVYKSRKKL FT YDLYSEKAYNNDETFHQYVRNYSKLFRKVCLAAKSLHLSDLIKNAPDKIKM FT TWNIIGRESGKVRNSHQEFSLKIDDKLVTSHEDVANAFEKFFSDIPVSTTT FT SLNSSPTAAEILLHNHVNKCNEIFKFKKINSSNIIKSFNSLNVKNTADLWG FT ISVKVLKSVIDIIAPHLVSIFNDCIRCGVFPDLMKHSKVIPLFKSGSTDDP FT SNYRPISVLPTLSKIFEKIILTQLLEHFNSNNLLHNKQFGFTRGRSTTDAG FT AYLVKNIFKSWEESHDCLGIFCDLSKAFDCVEHETLVRKLHHYGIRDGALE FT LITSYLSGRIQTVDVKGNRSSGTLLKMGVPQGSILGPFLFLIYINDLPSFI FT ESRHEVVLFADDTSLLFKIKRQLQVYDEVNDAISCVVHWFRINNLLLNSKK FT TKCIKFTLKCIKPSLNVRQVDSNVIVSEESLELVESTVFLGITVDSKLQWG FT PHIHKLASKLSSAAYAVKKIRMLTNADTARLVYFSYFHSVMSYGILLWGNA FT ADVEMIFILQKRAIRAIYNMHSRESLREKFKEIKVLTMPSQYIFENLMYVR FT KHIEEFPKMSDIHNRNTRNKHKLVVPMSRLHKIRNSFGCLSVRLYNKIPQD FT VQNLHIHRFKKTIKEHLCNKAYYKVNDYLEDCTKWE" XX SQ Sequence 5140 BP; 1817 A; 773 C; 878 G; 1672 T; 0 other; tttttttggt cggtaggttc ggttgtgttc ggttcgattt ctctgctccc gtaaataaat 60 taattaaaaa taaactaaca cttcataata tcattaaaac atatcaaagt agtgttaaag 120 gtgtataaat aagtaaacta tattataaat agtgctttaa aacgttattt aattgattgt 180 taacgttata atattgcttg gtgagtgtgt aatattttgt aatttgaggg taatctgacc 240 ggcaacttcg taaattttct ttacatattt tattgtaatt tattttaaaa tgatgactcg 300 cagcatcaaa tctcgtcagg ttgatgaaca gcttcgtatt gctttagaac aattaaaaaa 360 tactcgcgaa aaatatgata tacttttaca agaaagtgaa caaaatgaag aggagatgct 420 gtcagtcatc tctaagaata ctgaccttaa aaaacaattg tctcagctct tcattgaaca 480 cagtgaagct cttgagatca accagaagtt gcaggatacg attaacacct tttctcaatg 540 cagtgatgaa ttttgtgatt cattgaaaat caccgctgag ttgagacata aattggctga 600 ggctgatgat tgtatttctg acttgcggtc agagctatgt aaacttaagg aacagcagcc 660 tcagagctta tattcggaac tggttaagag cgaaccctct ttggttgctg tagatactgc 720 cttgaatttt gacataacta ctattgatct gactggtagt gatgcagaga cctcagagcg 780 ctcccagtta gttttaagta aaaataaagt aaggaaatat gttaaaatta ataaattaat 840 atataagact caacgtaaat tgaggaatta caagtctcag gaatacatca agttcatttg 900 caagcagaga acttcacttt cacaccaggt tgctgtattg actgaacaga gtcaacaaat 960 tcatcacaaa cacatggaag aaatagaatg tttatctaca caaataaaca atttaaagtc 1020 aagtttagag aacattacat taaactatga actagcacag agggacatta gtgagcatgt 1080 gttggcaatg gacaatttaa ttgatacttg taattataat caacagcgtt ttgactcttt 1140 gacaatgcag tacgctgaat gtaaatgctc tgcatcgggt tgtcaggagc taccacatgt 1200 tgaaattcaa aaacataatg tgtctattct ttcagtgcta gagtctcaaa gcaatgatat 1260 tatgaaggca gcacctatct gtatacctac tactgcatct tcacaagttt ctagagaaga 1320 caaaattaaa attaaaattt ttagtgatga gctggggaaa cagatgggtg tggcattaca 1380 tgacctttgt tttggacaac atgtgattaa ttactgtatg ccaggtgcta atccagcaca 1440 aatatttgat aagattttaa ataccacaca ttgtctaaac actgtcataa taattatgtt 1500 agggaggaga gaaaacgtta accaaaaaca aattttgcat tatgttgaac aagttaatat 1560 gttaaatatt aaaaaaatta ttatgttcac ctttccttac ttgcaggact tgccgacaga 1620 aaacaaaatt agatttaata ttaataatgc aatttataat gcttttgata ataataactg 1680 catcttgaca tccaaaaaat atgcatttaa attaatagac ataaataata taataaaatt 1740 tagatattta aggacatcaa aaaatgacat atttaaatta agtaataaaa attttagaca 1800 agtagcaaca tcgctacagt attttataca caatttccta gctataaatt tagctaaaaa 1860 aatacctgct tctattgagc aggaaaaaaa tatgacatgt tttaagactt tggaatctgc 1920 ttccaattta aattaattag taagacaaag gagtcacact cttgcaatta caaatcaatt 1980 aatacaacaa ctagattata cccatcaaaa gacgttaacg tttcaattat taacatattc 2040 aatttggtgc atcaaaattt acaagggatg atgggtaaag agctagaaat tgaaatgttt 2100 ttaaatatgc ataatattaa tgtattctgt gtcactgagc actggtttag aaactatgaa 2160 ttattgttta attttaatga tcaccagttg tcaagttcct tctgcagaga gaacgctatt 2220 cgcggtggct cactgattct cgttagtaaa tgtttgaaat gtaaggaacg aaaggatgta 2280 gtggccctct ctgttgagag aattattgaa atttcctgtg ttgagcttga gcacctcatc 2340 gttgtctgcg tctacagacc tcctgatgcg ttatatgatt cttttgaaaa tattttggaa 2400 aatgtattgc taaagctttc tgtctctaat aaacaaattt ttgtatgtgg tgattttaat 2460 attaatcttt tagaaaacac aaataccact attagattca gaacattgtt aaagtcatat 2520 aacctcccaa acttattttt agagcctact aggacaacga ccacatcggc aacatgttta 2580 gataacatat ttacaaatgt aacaccgatt aacaaaaaaa ttattaatca gttaacatca 2640 gaccatagtg gacaatttgt gtcttttgaa tctaatatga attctaaaga taaaaatact 2700 cttgtgattg ttcctattaa taaaaaacga attgagaagt ttagaaataa tattaagcaa 2760 gaacattcag aaattattta taacaaaaac ccaaatttgt cataccagaa tatgtttgag 2820 agaattaatt tggtctttac tgatgtattt attcctaaaa ctgtaacatt aaaaaacaaa 2880 acagtgttca gcgagtgggc caccaccgga gtgtacaaaa gtagaaaaaa gttatatgat 2940 ctgtactctg aaaaagctta taataatgat gaaacatttc atcaatatgt caggaactat 3000 tcaaaactat ttagaaaagt ttgtttagcg gcaaaatctt tacatttaag tgatttaata 3060 aaaaatgccc ctgacaagat aaagatgact tggaacatca ttggtagaga aagtggaaaa 3120 gtcagaaata gccaccaaga attctcatta aaaatagatg ataaattggt aactagtcat 3180 gaagatgtgg ctaatgcttt tgaaaagttt ttctctgaca ttccagtttc aactaccact 3240 tctctaaatt catcccccac agcagctgaa atattattgc ataaccatgt caacaaatgt 3300 aatgaaattt ttaaatttaa gaaaattaat tcaagcaata taataaaaag ttttaatagc 3360 cttaatgtaa aaaatacagc tgacttgtgg ggaatatcag taaaggtttt gaagtctgta 3420 atagacatta ttgctcctca tcttgttagt atttttaatg attgtattag gtgtggtgta 3480 tttcctgact taatgaaaca tagtaaagtg attcctcttt ttaaatctgg tagtactgat 3540 gacccctcta actatagacc tatttcagta ctccctacgt tgagtaaaat ctttgaaaaa 3600 attattttga cacaactttt agaacatttt aattcaaata acctgcttca taataaacaa 3660 ttcgggttta ccaggggtcg ctctacaact gatgcaggtg cttatctagt caaaaatatt 3720 tttaaatctt gggaggaatc gcatgattgt cttggaattt tctgtgactt atccaaagca 3780 tttgactgtg ttgaacatga aacattggtg aggaaactac atcactatgg tattagggat 3840 ggtgcattgg aacttattac ttcctattta tcaggaagga tacaaacagt agatgtgaaa 3900 ggtaatagat cttcaggcac cttgttgaaa atgggtgtac ctcagggttc cattttgggt 3960 ccttttttat ttctaatata tataaacgat ttacctagtt ttattgagtc ccgacacgag 4020 gtcgtattat tcgcagatga tacatcttta ttatttaaaa ttaaacgaca attacaagtc 4080 tatgacgaag tgaatgatgc gatttcgtgt gtggttcatt ggttccgtat caataaccta 4140 ttattgaata gtaagaaaac taaatgtatt aaatttactt taaaatgtat taaaccatct 4200 ttaaatgtga gacaagtaga tagtaatgta attgtttctg aggaatcatt ggagcttgtt 4260 gagtcaaccg tatttcttgg tataacagtg gactccaaac tgcagtgggg acctcatatt 4320 cataaattgg cgagtaagct cagctctgca gcatacgcag taaaaaaaat tagaatgtta 4380 acaaacgcgg acacggctcg tttagtttac tttagttact tccacagtgt catgtcctat 4440 ggcattttgc tatggggcaa tgcggccgat gtagaaatga tatttattct gcagaaaaga 4500 gctatacgtg ctatttataa catgcactca agggaatccc tgagggagaa atttaaagaa 4560 attaaagttc tcactatgcc atcccagtac atttttgaaa atttgatgta tgttcgtaaa 4620 catattgagg agtttcctaa aatgtcggac atacataata gaaatactag gaacaaacac 4680 aagcttgttg tgccgatgag taggttacat aagatacgaa attcattcgg gtgtttgtct 4740 gtgcgcctgt acaacaaaat cccacaagat gttcagaacc tacatataca taggtttaag 4800 aaaactatta aagaacatct gtgcaataaa gcttactata aagtcaatga ttatctagaa 4860 gattgcacaa agtgggaatg agttgctcgc tccgggcatt tcaatattgt agtaattgtt 4920 atgttattat actcatggta aaaatgaata tttaaaaaaa cctaatatta aaaaaaatat 4980 ctaatattta aaaaaaaaaa acatgcccgc tgagtttctt gccaattctt ctccggacgg 5040 aggctagttc ttgtgaattg gcggtagttc ttttgacgtt caacaagtat gtactttcat 5100 ttatgtggaa taaaaaattt tttgatttga tttgatttga 5140 // ID hATx-10_SM repbase; DNA; INV; 2980 BP. XX AC . XX DT 11-FEB-2008 (Rel. 13.02, Created) DT 03-MAR-2008 (Rel. 13.02, Last updated, Version 1) XX DE hATx-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hATx-10_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-2980 RA Jurka J.; RT "A distinct, diverse family of hAT transposons from Schmidtea RT mediterranea."; RL Repbase Reports 8(2), 26-26 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS join(644..1537,1651..2541) FT /product="hATx-10_SM_1p" FT /translation="MNGKKETPWKYFKREKDTGICQIQACGKRIKAAGSST FT SGLHSHLRTKHNINLLKRDHDHINNDAVVETSETIRSGQITSFLRKKIDDS FT LPAVISRMIAVDGLPFISICKSYDIRKLLEGKYENIPKSPSAIQNIVINYG FT KNAKKEVTDELLFRKRKLIKFSLTLDEWTSIRNRRYLNINVHDGTTFWNLG FT LSRIHGKMPAEKCEEILRNKLENFNISLKDIVGCTTDGASVMRKFGSNLNI FT IHQLCMAHGIQLAVVXILYKKDKPTEIDEIENQDQENIEIEDQETIEFEDH FT VGRFSNSNSLRKEKSLLYDSKTRWNSLLTTLERFYELKDCIRKSIIDLNLD FT FAFNQAELKLLHDIISALQPVKVAVEALCGSDINLLSADVVLIFMLDELTA FT LNSALSIKLKSELVYRIQERRTIYSKVLQFLHNPEQQNHNDHSIFKLPSKT FT TITKLIIELIEKLNLYEEENENEDDENPEEVILNEDMPSDNNFCSLKEKLY FT IKMHQNLHYNVRNERVNKSSNLQTTIRKEMKTFCEENIRGKHLEMVYQYLK FT TIRPTSVESERAFSAAGLICTRIRSRLADESLDTLCFLKSYFNTKKIL" XX SQ Sequence 2980 BP; 1141 A; 419 C; 481 G; 938 T; 1 other; tagggattcc attaccggta ataccgaaac cggtattacc ggtttttgat gaaaactaca 60 tcttcggtat taccggtatt acaaaataaa aaaccggtaa ttaccgaaat ttattaattt 120 cgttaaaaat gatatatttt aattatattt taagtcctat ttctttgctg actggcattt 180 catggtctac gtctacatca acacaaaaca gatgggtgta catttctaag aagcttttat 240 attttatgac ccccaatata tctaatttat ctggacggat tggagtcagc gaatggccgt 300 cggaagtaat tccgatacat ttgacggctc ttttgaatag caggtaaaaa aggccggccg 360 gaaaggcaag atctttttcg gtctgtttct caaaagagtc gccaaattta tacagttttg 420 ttttgacatc cttgctgatc agtaatacgc gcagcataaa aagaatcggt tcgtcaagtt 480 cgtcgagatt ttttcgtgtg aacattatta tacacaattc aacaaaaaat acaagtttcg 540 acgttaaaag tttctgtaaa aggtaaattt ttaattattt tttattactt atttattttg 600 tatcgattat atttttttaa ataaataatt attttacagc acaatgaacg gaaagaaaga 660 aacgccatgg aaatacttca aacgagaaaa ggatactggc atatgccaaa tacaagcttg 720 cggtaagcgc atcaaagcag ctggtagcag tactagtgga ttacattccc atttaagaac 780 caagcacaat attaatttat taaaaagaga ccatgatcac atcaataacg atgctgttgt 840 cgaaacatcg gaaactattc gttctggtca aatcacatcg tttttgcgga aaaaaatcga 900 cgactctttg ccagctgtaa tatcgcgtat gatcgctgta gatggtcttc cgtttatttc 960 aatatgtaaa tcatacgata tacgaaagct gcttgaagga aaatacgaaa atattccaaa 1020 atctccaagt gccatacaaa atattgttat caattatgga aaaaatgcca agaaagaagt 1080 aacagatgaa ttactgttca gaaagcgcaa gttaatcaaa ttcagtttaa ctcttgatga 1140 gtggacatcg attagaaatc gtagatattt aaatatcaat gtgcacgatg gcacaacttt 1200 ttggaacctc ggcctcagca gaattcacgg gaaaatgcca gcagaaaaat gtgaagaaat 1260 tttgagaaat aaattggaaa attttaacat atctttaaaa gatatcgtcg gctgcacaac 1320 agatggtgca tcagtaatgc gcaaatttgg atctaattta aatattattc accaattatg 1380 tatggctcac ggtatacaat tagcagttgt ayaaatttta tataaaaaag ataaaccaac 1440 agaaatcgac gaaattgaaa atcaagacca ggaaaatatt gaaattgaag accaggaaac 1500 tattgaattt gaagaccatg ttggacgatt ctctaattaa tgatattaaa gatcctagcg 1560 tcaattgtct aattaataaa attagaacga ttataaaaat atttaaaaaa tcacctacaa 1620 aaaatgatga agttttgcaa aaacatgtga tcgaacagtt taagaaaaga aaagtcatta 1680 ttatatgact caaaaacaag atggaatagc ttgttaacaa cactggaaag attttatgag 1740 ctcaaagatt gtattcgtaa aagcattatc gatttgaacc tagattttgc atttaatcag 1800 gctgaattaa aattattgca cgatattata tccgctttgc agccagtaaa agttgctgtg 1860 gaagctcttt gcggatcaga tattaattta ctttctgcag atgttgtatt aatattcatg 1920 ttagatgaat taactgcatt gaacagcgca ttaagcatca aattaaaaag tgaacttgtt 1980 tacagaatac aagaaagaag aacaatttat tcgaaagttt tacaatttct acacaaccca 2040 gaacaacaaa atcacaatga tcacagcata tttaaattac cttcgaaaac aacgataacc 2100 aaattaataa tagaattgat agagaaatta aatttatatg aagaggaaaa tgaaaatgaa 2160 gatgatgaaa atcctgaaga agttatttta aatgaagata tgccgagtga caacaatttt 2220 tgctcattaa aagaaaaatt atatataaaa atgcatcaaa atcttcatta taatgtgcga 2280 aatgaaagag taaataaaag ctcgaattta caaactacga ttcgcaaaga aatgaaaact 2340 ttttgcgagg aaaacattcg tggtaaacat ttagaaatgg tttaccaata tttaaaaacc 2400 attcgtccta ctagtgtaga gtccgagaga gccttttcag ccgcaggatt aatttgcacg 2460 agaataaggt ctaggttagc ggacgaaagt ttagatacat tgtgtttttt aaaatcctat 2520 tttaatacaa aaaaaatatt gtaaataaat tcattctatt tttttctgtt tctttttata 2580 aaatattttg atttaataaa tgtaatgtaa tgatttgaat gtatgtattg taataatgta 2640 ataattagtt ttaataaatt gtttgacctc ctcattgtat tcaattttat cgattaataa 2700 aacattttca aaaaaattgt attgtgtatc ttttataaaa tttaataagt ttaatgatta 2760 tgaaagtaat taacaattaa ctaacgtaaa ttatcaaatt aattaattat taataaccaa 2820 tagttaacaa attaatgata aatgaaacaa ttgggagaat taataataga agttgaataa 2880 aagaagacgg tattaccggt aaaaaccggt aataccgtcg atagaaatat ttttttcggt 2940 aataccggta ttgcaaaagt cggtattttt ggaatcccta 2980 // ID Gypsy-624_AA-I repbase; DNA; INV; 4559 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-624_AA_; KW Gypsy-624_AA-LTR; Ty3_gypsy_Ele75; Gypsy-624_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4559 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [3503-3967] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 87..1298 FT /product="Gypsy-624_AA-I_2p" FT /translation="MSEGNAGQIPSQNANQQPPANMQQQFRPPPVVNVPPA FT AVNFPPPGQVDPYLLWFQQQQQNYVAELFRQQQEVIRQQQEVMQQQQQAFM FT SQQEQLIRNILTSIQVQVPSNPEAILDSLASNIKEFRYDPENNVTFAVWYS FT RYEDLFEKDASRLDGEAKVRLLMRKLGMSEHERYVSFILPKAPKEFDFSTT FT VKKLSALFGAAESVISRRYRCLQISKQPQEDYVSYACRINKSCVEFELSKL FT SEEQFKCLVFVCGLKSDKDAEVRLRLLSKIEERNDVTLEQLSEECQRLLNL FT RHDTAMIEEHSTAVNALRTNQQRFKSGSPSKFHQSGGEKSSFGSSSVKTSP FT ETACWLCGGMHYARECPYKSNKCNDCGKVGHKAGYCRGSEGRAPATEAECR FT GSESSDRQLL" FT CDS 1745..4546 FT /product="Gypsy-624_AA-I_1p" FT /translation="MAPVFRPKRPVAYAMYQAVDNELDRLQREKIITPVDF FT SEWAAPIVVVRKANGKIRICGDYSTGLNEALQPHQYPLPLPQDIFANLANC FT TVFSQIDLTDAFLQVEVDEGCRDLLTINTHRGLYRYNRLPPGVKAAPGAFQ FT QLIDTMLVGLKGVSGYLDDIVVGGVDEEDHNRNLRAVLQRIQEFGFTIRAE FT KCSFGKNQIRYLGHLCDRHGIRPDPAKIEAIQKLPAPTDVSGVRSFLGAIN FT YYGKFVPNMRTLRFPLDELLKAKGEFRWTAECQRSFDRFKEILGSDLLLTH FT YDPRREIIVSADASSIGVGATISHRFPDGSVKVVQHAARALTKAEMGYSQP FT DREGLAIVFAVTKFHKMIFGRKFRLQTDHAPLVRIFGSRKGVPVYTANRLQ FT RWALTLLLYDFSIEYVQTEKFGNADVLSRLIDNHAKPEEDYVIASVILEED FT LRFVADEAVSCLPLSFKLVEQETQTDDQLRKVYRYLREGWPEEAKIDDPEI FT RRFHGRRDSLCTVGKCIMFGERLVIPEKHRQRCLRQLHRGHPGILRMKALA FT RSYVYWPSIDDEIVQYVKACKHCASVAKSPPKAAPVPWPRPTGPWKRVHVD FT FAGPIDGVYYLLAVDAHSKWPEVVATQRITSTATISILRSIFARLGMPETL FT VSDNGTQFTSSEFQKFCSDSGIDHVTTAPFHPQSNGQAERFVDTFKRALKK FT IQEGKIGVGEALDVFLLTYRTTPNRQVEEGKSPSEAMFGRRIRTSLDLLRP FT PPVRPPSEDKEENRRSFSAHDTVYAKVYSNNKWRWAPGTVCEKIGKVMYTV FT WVEDQRMVRAHVNQMRSRAGTPASNSKPQSSALPLDVLLDAWSIPRATPAV FT STSSLLPLNSTEAPSGGSSPLQPVPSTSSSLSTSLSSSTSSVSASPEFASA FT DSEPATPVQLPRRSSRARRPPQWFDPYHLY" XX SQ Sequence 4559 BP; 1096 A; 1135 C; 1339 G; 989 T; 0 other; aaagtggcga cttcgggagt ggcaaatatt gcgaaaatta gtgcaattaa ttagccacgg 60 tgacacccac cgcgaaacgg cgaaggatga gtgagggtaa tgctggtcaa attccatcgc 120 aaaatgcgaa ccagcagccc ccggcgaata tgcagcagca gtttcggccg ccgccggtag 180 tgaatgtgcc tcccgcagca gtgaattttc ctccccccgg acaagttgac ccatatttac 240 tgtggttcca gcagcagcag caaaattatg tggcggaact ttttcggcag cagcaggagg 300 tgattcgaca gcagcaagag gtgatgcagc agcagcagca agcattcatg tcccagcagg 360 agcagctgat tcggaacatt ttgacgtcga tccaggtgca ggttccgtca aatccggaag 420 cgatattgga ctccctagcc agcaacataa aggagttccg atatgacccg gaaaacaatg 480 ttacgtttgc tgtgtggtac agtcggtacg aagacctttt cgagaaggat gccagccggc 540 tcgacggtga agcgaaagtg cgtttgctga tgcggaagct gggaatgtcg gaacacgaaa 600 ggtacgtcag tttcattcta ccgaaagcgc cgaaggaatt cgacttttcg acgacggtga 660 agaaattgag tgccttgttc ggagctgccg agtccgtcat tagtcggaga tatcgatgcc 720 tgcagatttc gaagcagcca caggaggact acgtctcgta tgcatgccgc ataaacaaat 780 cctgtgtgga gttcgagctg tccaaacttt ccgaggagca gttcaaatgt ctcgtgtttg 840 tgtgtggact gaagtcggac aaggatgcgg aggtacgatt gcggctgctg tcgaagattg 900 aggagaggaa cgacgtgacg ttggaacaac tgtccgaaga gtgtcaacgt ttgttgaatt 960 tgcgacacga tacggcgatg attgaggagc actcgaccgc cgtgaatgcc ctgagaacca 1020 accagcagcg gttcaagagt gggtcaccgt cgaaattcca ccagtccggt ggagaaaaga 1080 gcagtttcgg atcgagcagc gtcaagacga gtccggaaac agcttgctgg ctatgcggag 1140 gaatgcacta tgccagagag tgcccctaca agagcaacaa gtgcaacgat tgtggcaagg 1200 ttggccataa agcgggttac tgcaggggca gcgaaggtcg agccccggcg acggaagccg 1260 aatgccgtgg cagcgaaagt agtgaccgtc aactcctgta gcgtgcaaca gcgtcggaag 1320 tacgtgccgg ttggaatcaa cggatcacca gtgcgccttc aaatcgatac cggttccgac 1380 atcaccatca tttcttcgga atcgtggagg aagattggca gcccttcaac gtctgtgtca 1440 acggtcgtcg ccaagtcagc tagtggtagg cccctgcaga ttgaaggtga gtttgcatgc 1500 gatctttcta tcgatggaca agtttgcgtg gcgtagtccg agtgacgaga gagaagttgc 1560 atcttctggg ttccgacatg atcgatgctt tcggattgtg gtcggtccca ttcgactcga 1620 tttgcaacag tgtcagcagt actccagcct ccgtgagcaa gcttcaggca gaatttccga 1680 gagtgttcgc aggtagtccg gactttgcaa gaaagcgaag gtgcagttcc agctgaagac 1740 cgggatggcc cctgtgtttc ggccgaagcg tccggttgcg tacgcgatgt accaggcagt 1800 cgacaacgaa ctggaccgtt tgcagcgtga gaagatcatc acaccggtag acttttcgga 1860 gtgggcagct ccgattgtgg tggtccgcaa ggcgaatggc aagatacgaa tctgcgggga 1920 ctattcgacg ggactgaatg aagcgctgca accccatcag tatccgttac ctctgccgca 1980 ggacattttt gccaatcttg cgaactgcac cgtcttcagc cagatcgatt tgacggatgc 2040 gtttctgcaa gtggaggtgg acgaaggctg tcgtgacctc ctgactatca acacgcaccg 2100 aggcttgtat cgctacaatc gtcttccacc gggtgtgaag gcagcacccg gagcgtttca 2160 acagctgatc gacacgatgt tggttgggtt gaaaggagtg agcggctacc tggacgacat 2220 agttgtcggc ggcgtcgacg aagaagacca caatcggaat cttcgagcgg tgctgcaaag 2280 aatccaggag ttcgggttca ccatccgagc ggaaaagtgt tctttcggca agaatcaaat 2340 ccgctacttg ggacatctgt gtgatcgaca cggaatacgg ccggatccgg cgaaaatcga 2400 ggctatccag aagctgcctg cgccgacgga cgtcagtggt gtccgttctt ttctgggtgc 2460 cataaactat tacggcaagt tcgtgcctaa tatgcgcacg ttgcgattcc cgctcgacga 2520 gctgctgaaa gctaaggggg agttccgttg gacagctgag tgtcaacgtt cgttcgaccg 2580 gttcaaggaa attcttggtt ccgatcttct gttgactcat tacgatccgc gacgagagat 2640 catcgtttcg gcggatgctt catcgatcgg tgtcggcgcg actatcagcc ataggttccc 2700 cgacggttcg gtgaaggttg ttcagcacgc agcgcgagcg cttacgaagg cagagatggg 2760 ctacagtcag ccggatcgcg aaggtttggc gattgtgttc gctgtaacga aattccacaa 2820 gatgattttc ggcaggaagt ttcgtttgca gaccgaccat gcacctttgg tccggatttt 2880 cggatcgcgg aaaggcgttc cggtgtacac agcgaaccgt ttacagcgct gggcgctgac 2940 gttgctgttg tacgattttt ccatcgagta cgtgcaaacg gagaaatttg ggaacgccga 3000 cgtcctttcg aggctgatcg acaaccatgc caagcctgag gaggattacg tgattgcgag 3060 cgtgattctg gaggaggatt tgcggttcgt agcggacgaa gcagtgagct gtcttccgct 3120 cagtttcaaa ttagtggaac aggagacgca aaccgatgat cagctacgca aagtgtaccg 3180 gtaccttcga gaaggctggc cggaagaagc aaaaatcgac gatccggaga ttcgacggtt 3240 ccatgggcgg agagattcgt tgtgtactgt tgggaagtgc atcatgttcg gagagcgatt 3300 ggtgattccc gagaagcatc gacagcggtg cctgcgacag ttacaccgag gacaccccgg 3360 aattctgcgg atgaaggcgt tggctaggag ctacgtatat tggccgtcga ttgacgatga 3420 gatcgtccag tacgtgaagg catgcaagca ctgcgcttcc gtagcaaaat ctccaccgaa 3480 agcagcaccg gttccgtggc caagaccaac aggtccatgg aagcgtgttc acgtggattt 3540 tgctggtccc atcgacggag tgtactacct gctagcagtc gacgcgcatt cgaagtggcc 3600 tgaagtggta gcaacgcagc gaatcacgtc cacagctacc atcagcattc tacgaagcat 3660 ctttgcccgg ctgggaatgc ctgaaacgct cgtcagtgac aacggtacgc aattcactag 3720 cagcgagttc cagaagttct gcagcgatag cggcatcgat cacgtcacca cggcaccttt 3780 ccacccacag tcaaacgggc aggctgaaag gttcgtggac accttcaaga gggcgctgaa 3840 gaaaattcag gaggggaaga tcggagttgg cgaagcgttg gatgtcttct tgctgaccta 3900 ccgcacaaca ccgaatcgac aggtcgaaga aggtaaatca ccgtccgaag caatgttcgg 3960 ccggcgcatc agaacgagtt tggacctact tcgccctcca ccggtgcgtc caccgagcga 4020 agacaaggaa gaaaaccgaa ggagcttctc tgcacacgat acggtctatg cgaaggtgta 4080 cagcaacaac aagtggcgct gggcacctgg aaccgtttgt gagaagatcg gcaaggtgat 4140 gtataccgtt tgggttgaag atcaacggat ggttcgtgcg cacgtcaacc agatgcgcag 4200 tcgggccggc actccggcaa gcaacagtaa gccgcagtcg tcagcgctgc ctctcgacgt 4260 tctgctggat gcgtggagca ttccaagagc tacgccggca gtctcaacat catctctgtt 4320 gccgttgaat tcaaccgagg ctccatctgg tgggtcctcg ccgttgcaac ctgtgccgtc 4380 tacgtcctca tcgttgtcta cgtccttgtc ctcgtcaacg tcatctgtat cagcatctcc 4440 agagttcgca tcagcagata gcgaaccagc tacaccagta caacttccac gtcggtcttc 4500 ccgagctcga aggccgcctc agtggtttga tccgtaccat ctctattgag aagggggga 4559 // ID Sola1-2_LG repbase; DNA; INV; 4396 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.05, Created) DT 22-MAR-2011 (Rel. 16.05, Last updated, Version 1) XX DE Sola1-type DNA transposon from the owl limpet. XX KW Sola; DNA transposon; Transposable Element; Sola1-2_LG. XX OS Lottia gigantea OC Eukaryota; Metazoa; Mollusca; Gastropoda; Patellogastropoda; OC Lottioidea; Lottiidae; Lottia. XX RN [1] RA Yuan Y.W. and Wessler S.R.; RT "The catalytic domain of all eukaryotic cut-and-paste transposase RT superfamilies."; RL Proc Natl Acad Sci U S A 108(19), 7884-7889 (2011). XX RN [2] RP 1-4396 RA Jurka J.; RT "DNA transposons from the owl limpet."; RL Direct Submission to Repbase Update (22-MAR-2011). XX DR [2] (Consensus) XX FH Key Location/Qualifiers FT CDS join(1176..2249,2253..3458) FT /product="Sola1-2_LG_1p" FT /translation="MNNSMDTIAGTETFGIWTDNSKEINAANCIDEYNPDL FT KMIESILQSDISMKRKNEPMVLTDNLIEYTTSQEQDNANKSIFDIQENGEC FT LKFRRLEPRVPNDTSLVYTWSPRFTFQSNEIMPSNETATRSMSLPENKERP FT MYSQLSEDELTDLDDSVADPSYRPKRREFLDAYSNEEEEVTIEDENLTPAK FT LKPNKPANPDLWTRNVNMNRRMKGKDYTGSIIENGKSLKVQKPGRQLRPRG FT CTDFCKKSKQRFCDVFSEDDRLTLFNKFWALNWHQKQSVVVHSVSKKKVAE FT RKVEGNTDGIFRKKQSFVYSLQLNEKSYTTCKQMFLTTLDLGEKALYRWLN FT HSESGIPSPAKAPKRRKVLKTTGVQHDQAGDFLQSLCKLPSHYCRSSTSKS FT YLEPIFVSFEDLYRIYKDHCLSNAGTALCKKAFHSVFDELNLALFRPKKDQ FT CDVCCAFENQNLTKDVYDNHIQRKADARLSKVQDKLVAENDESVKVLTMDL FT QAVLLSPRLQASAMYYKTKLACHNFTIYDLATKDVTCFFWYEGEGELTANT FT FASCIFEYVRMMSPKVKTLIIYSDGCTYQNRNTTMSNTLLKCAANNDITIV FT QKYLEKGHTQMEVDSVHSTIEQKIRNRPIYCPQSYIDLIKSVRPKQPCKVF FT YLSHTFFSDCAKLGYYNSIRPGTKVGDPVVTNLRVIQYTPDGIIKYKINYN FT DEYSDLPRRSKISTASPLDAIPRLYSGPIKIKSSKYQHLQQLLSVLPQDYH FT PFYEGLLHD" XX SQ Sequence 4396 BP; 1588 A; 826 C; 766 G; 1216 T; 0 other; tcgtcggtgg gatatcaaaa ggtcgtatat ccggttttgg cgcgaaagcc tttttctgca 60 ctaacgtaac ccccatgaat aggagcatcg actggtacag ataccatatc tctaagccaa 120 ttaaaagtga catattgggt catatgatta aaggatgtat atcgattggc ttaaaataat 180 gtatgtcaaa aagacgtata tctttgtcaa caataacaac tgacgttata aggagcctac 240 tttgatgagg taagaaaagg gaataaaaaa cgtattaagt ctatttattt cattcatttt 300 agcagcttta cttgttctgt gatcacccga ctttggccta tttcaaatac caacttctta 360 aacagttatg ataccgtatc atcgtttaag tatcgctatc gagaacgtca agtgatatac 420 gtcctttttg catggataca tcattctatt tcatttgttc ttgtttattg cagaaaatgt 480 caaaaagacc accttccaga ggacagcagc ttgtattgtt gtccctgaaa tccagatctt 540 aaataaattc aataaccgaa aatggaaaat cacccatcat aacaaataac aaagactcat 600 ctaaaaaaac aaaaaccata gacttacctg aaacaagaca aaacaaagac ttacctgtaa 660 taaaagaaaa caaagactta cctgtaataa tagaaaacaa agacctacct gaaacaagag 720 aacgcaaaga cttacctaaa acaagagaaa acaaagactt aattgaaaca agagaaacca 780 aagacttacc cgaaacaaga gggaacaaag acttgcctga aaccagagaa aacaaagact 840 tgcctgaaac aagagaaaac aaagacttac ctgcaaccag agaaaacaaa gacttacctg 900 aaacaagaga aaacaaagac ttacctgaaa cgagagaaaa caaagacttg cctgaaacca 960 gagaaaacaa agacttacct gaaacaagag aaaacaaaga cttacctgaa acaagagaaa 1020 acaaagactt aactgaaaca agagaaaaca atgacttacg tgtaataata gaaaatggta 1080 actcgcctaa tatatcagaa acaaaggaga gtagtaaaat aacaccgata gaagacattt 1140 ggacagctac ggcaatagca cccatagaaa aatggatgaa taacagcatg gatacaattg 1200 caggaacaga aacctttgga atctggactg acaactcaaa agagattaac gctgcaaatt 1260 gcatagatga atataaccca gatctcaaaa tgattgaaag tattcttcaa tcagatatct 1320 caatgaagag aaagaatgaa cctatggttt tgacagataa cctaatagaa tacacaacta 1380 gccaagaaca agataatgct aacaagtcaa tatttgatat tcaagaaaat ggagaatgtt 1440 taaaatttag acgattggaa ccaagagtac caaatgatac aagtcttgtt tatacatgga 1500 gtccaagatt tacctttcaa agtaatgaaa taatgccttc caatgaaact gccacaagaa 1560 gcatgtcact tccagaaaac aaagaacggc ctatgtacag ccaattatca gaggatgaat 1620 taacagatct tgatgattct gtagcagatc cttcttacag acccaaacgg agagagtttt 1680 tagacgctta cagcaatgaa gaagaggaag ttacaattga agatgaaaac ttgacacctg 1740 ctaaactcaa gccaaataaa ccagcaaatc cggatctttg gacaagaaat gtaaatatga 1800 atcgacgtat gaagggaaaa gattacacag gatctataat tgaaaatgga aaaagtttaa 1860 aggtacaaaa accaggacgg caactgcgtc cacgaggctg tactgatttt tgcaagaagt 1920 ctaaacaaag attttgtgat gtgttctctg aggacgatag acttaccttg tttaataaat 1980 tttgggcttt aaattggcat caaaagcaga gcgtagtagt ccattcagtg tccaaaaaga 2040 aagtagcgga gcgtaaagta gaaggtaaca cagatggaat attccgaaaa aaacaaagtt 2100 tcgtctactc tttgcaatta aatgaaaaat cttacactac ttgcaagcaa atgttcctta 2160 caactctcga tttgggagag aaagctcttt accgctggtt aaaccattca gaatctggta 2220 taccatcacc tgctaaagca ccaaagagat aacgaaaggt actgaagaca accggagttc 2280 aacatgatca agctggagat tttctgcagt ctttatgcaa attaccttca cattactgca 2340 gatcatcaac gtctaaatca tatttagagc ccatcttcgt ttcatttgaa gatctatata 2400 gaatttacaa agatcattgt ttgtcaaatg ctggtactgc tctttgtaag aaagcattcc 2460 acagtgtctt tgatgagtta aatttggcac tctttcgacc aaaaaaagat cagtgtgatg 2520 tttgctgtgc gttcgagaat cagaatctga ctaaagatgt atacgacaat cacatccagc 2580 ggaaggctga tgcacggcta tctaaggtac aagataagct tgttgctgaa aatgatgaat 2640 ctgtcaaggt acttaccatg gatttacagg ctgtattatt atcacctcga ttacaagcat 2700 cagcaatgta ttataaaacc aagctggcat gccacaattt tacgatatac gatttggcta 2760 ctaaagatgt aacatgcttc ttttggtatg aaggcgaagg tgaactaaca gccaatactt 2820 ttgcttcatg catattcgag tatgtaagga tgatgtcgcc aaaagtgaaa acgctcataa 2880 tttacagtga cgggtgtacg taccaaaacc gtaacactac catgtctaac accctgctca 2940 agtgtgcagc caacaacgat attaccatag tccagaagta tttggagaaa ggtcacactc 3000 agatggaagt ggacagtgtt cacagtacaa ttgaacaaaa gatcagaaac agacctatct 3060 actgccccca gagttacatt gatttgatca agtctgtccg ccccaagcaa ccctgcaaag 3120 tattttattt gtctcacacc tttttttctg attgtgccaa acttggctat tacaattcta 3180 ttagacctgg aacgaaagta ggtgatccag ttgtaactaa tttgcgagtt atacagtaca 3240 ctccagatgg aataataaag tacaaaataa actataatga tgaatattca gatctcccaa 3300 gaaggtcaaa aatatctact gcatcacctt tagatgcaat accaagattg tactccggac 3360 caattaagat aaagtcgtct aagtaccagc atcttcagca gttgctgtct gttttgcctc 3420 aggactatca cccattctat gagggtcttt tgcatgatta attgaccaaa tatgagctat 3480 agtgctatgt tgtgttgtaa caaatttaat gaaacgtttc tgattataaa ttgattgaat 3540 tattataatc ctacttgctc ttatctatgt cattttgtca gattaaacag actgtgctaa 3600 tgaaacgcct ctagagccgt atacctttcg aggtcttgca ccatgtatct attatactta 3660 ccttctcgac cttgagccat gtacctgttt tgctttcctt cctggatatt aaaacattta 3720 cgggttatac ttatcttgct ggaccctgca ccatgtacct gctgtactta cctcgtcaga 3780 tactttttat gcttaccttg ctggatctta aacaatgcac ctactatact taccttgctc 3840 gaccttgaac caaaaaccta ctttactttg cttacatcat gtacttgtta gacttatctt 3900 gctgaaactt gaaccatgta cctgttgtac ttacctttct gggccttgaa ccatgaaatg 3960 tgccagttac tttgaacaag ttaggacgta cctcgtcttc catgcacata tttaagaaat 4020 aaaaagacgt atatcctatt tatagcgaca ataaaataaa aacttactat tttaaatcaa 4080 aaagttctta gacaaccgaa aaattcaaat tgatgaatat tttgttgaga tacaatgctg 4140 tttagcatgt atctgcaaaa ggatgtatat ctcaaaatat cagacttttg taacaaaaat 4200 gatgtatgtc tggcacgtgc ataatctgac ttaacatact ttttgatttt taattcatta 4260 agtcaagttt gaaaagatga gcactcggta caattttggt aaaaattcaa ttaattttga 4320 gtctttgggg agttttttgc atttcctgaa aaatttgaaa attcggatat acgacctttt 4380 gatatcccac cgacga 4396 // ID TTAA23_AP repbase; DNA; INV; 586 BP. XX AC . XX DT 26-AUG-2009 (Rel. 14.09, Created) DT 26-AUG-2009 (Rel. 15.12, Last updated, Version 2) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; nonautonomous; TTAA23_AP. XX NM TTAA23_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-586 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2091-2091 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC TSD is TTAA tetranucleotide. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea CC Aphid Genome Project. XX SQ Sequence 586 BP; 212 A; 81 C; 98 G; 188 T; 7 other; gggggttgga gcatagcata ttttaaggag taatatttag gcagttcata ttacacgtat 60 atttgaactg tgtcattgtg tgagtagtaa atgaacatta gtgtgtcctt atcaatcatc 120 aggctaatga caatggtaac gattctaaga acagtctcac acaagcacgc gcgtgctata 180 agtttttnaa atcggtagta cacaanacag ctaaaagaga aagatttatt tagttacttc 240 ccgtgacccg atttaaattt tgaaaaaaga tctgtattcg ataacttttt tactaggtta 300 tgtaggaaaa ataattgatg ttttaaaact gnatgtaagt gtataaaaat aaatactttc 360 atgatttttn aaaaagtaaa catattttta agccttattg aatanaaaaa tcaaaaatct 420 gtttcctaca taacctagaa aaggggatga tgagttcata cgtanaanat aataatattc 480 taatgagttc atacgtattt taaaataata atcggacatc gggtagtacg taaactaatc 540 ttccgcttat tggtctaaaa ctacaaaaat atgtgctcca accgcc 586 // ID SMAR8 repbase; DNA; INV; 2729 BP. XX AC . XX DT 30-SEP-2007 (Rel. 12.09, Created) DT 01-OCT-2007 (Rel. 12.09, Last updated, Version 1) XX DE Consensus sequence of Mariner-type family of repeats. XX KW Mariner/Tc1; DNA transposon; Transposable Element; SMAR8. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-2729 RA Jurka J.; RT "SMAR8: Mariner-type element from freshwater planarian (Schmidtea RT mediterranea)."; RL Repbase Reports 7(9), 997-997 (2007). XX DR [1] (Consensus) XX SQ Sequence 2729 BP; 1017 A; 371 C; 418 G; 923 T; 0 other; tactacagtc ttccgccgaa catagcgcct aggcgttata tgtgaatact gcgcctactg 60 ggcgttttat tcgaacataa cgcctaggcg ttatgttcga ataaaacgtt gtgtgaaaac 120 tctaaatatg acacacaacc aagttttttt ttcatttgta atatgacaca tttgcttatt 180 gggtacttta cattaaaaaa taaaaattat atacgaaatt ccaatctgtt gcgattcaga 240 tttgctgatt acattttttg gttatatttt tttataattt tcgtagtaag caattacaat 300 tcaaaatatc atgttttgac tacttctact tttactgagc aatatatgtt tacgatttga 360 cagcacatga ataaatgtta ctttatgaaa aaaattcaag tttgttttta taaaattgag 420 agtttgtttt tataaaaaaa ttatagaaat taatactcct tttacaaatt aacaaaagtg 480 gatgttttta gcacgctatt tcgtaaaggt agacttttta attttaatta atgcaaagat 540 catttcattt attttaaatt ttatgatttc agccatagcc atagaatgtc aaataaccta 600 aaaagaaaaa attattcaat taaacataaa ctaaaaattt taaatgaata caatgccggt 660 gtgaaaggca gtggctggat ggcattgtca aaaaaataca acattggtac atcaattctt 720 aggggatggc acaatcaaaa agcacaactt cttcaaactt taaacaatca atccgtgcaa 780 ataaaagttg cacgaagatt acctggtggt ggacgaaaat tggaatttcc tgaaattgaa 840 aatgaagtat tgcaatggat acttgataga aattctaagg gcttaagggt taaagataag 900 tacattcaat taaaatcaca agaagtttgt gaacgtttga aaataataga tactgaaaac 960 aattccaagt ggattaattt caaacattca tcaggttggt gtaaacgatt taaaaatcgc 1020 catgagttag tatctcgacg tcaaaccaca acaaaaaaaa tccctgaaaa tgttaatgct 1080 attgtaagca attttttaaa aataattcat gcaaaaattg aggaaaaaca aataaaaatg 1140 tgcaatatca ttaatttcga ccaagtacca cgctatttcg aatccgaacc aaaaacgact 1200 ataacccaga agggatgcag agaagtttta atgagaaaag ctggtacctc acataaacgt 1260 ttcacaacca cattatgtat aactttagag ggtaaatttt taaaaccaca cgttctgttc 1320 tcgaatttaa aaaaaaacca attgtaaaca aaaagtgcct tgtaaatgtg aatagaactg 1380 gaatgtttaa cgatgacatc ttgaaacaat atattaaaga aattattttg tgtaggccac 1440 aaaccagttt agctcgtgat ccagtattgc tgataatgga ttcttacgga agccatgtaa 1500 aattacatga atctaaatat ctggaaaaat tcaatatttt tgttgtgtta gtgccttcga 1560 atttgacaag tctcctgcaa ccattagacg taggagtaaa tcgtagtttt caagcatatt 1620 ataatacacg gtatgatgaa tatatatcag aagcactcaa aaatcctgaa atgcaaactc 1680 gacaaggaaa tcctaaaatt cctaactata aaatagttag cgactgggtt gtggactata 1740 tatcgacaaa ggattctagc tttatagtaa aagcatttaa atgttgcgga attgtaccta 1800 aaatagattt taatattgat gcactccata ctactttaca aaaaattttg caacctactt 1860 ttaattacga tgattggatt catgaaaatg aagatttaat tgctgaaaat tcgaattttt 1920 ttgacaatga agattgtgaa gattggtttt atccagagga agtgtcaaca agttttttta 1980 gatgtgttca tcagataaaa gataaaaaca gttctttttc agtttttatg aataattaca 2040 tgccacaaat aatttcatat ataaaaagta atccagatac atctgacatt ttagatgagc 2100 atgacgaaga acttataagt cgtggcgaaa attcttcttc atatgccgaa atttatgcta 2160 ctgccttaaa ggaaaaatgg tgcataaaag tagtagaagt ggacgaaaat tgtgttcaag 2220 ttgaaaactt tacgtatgat gtagagtctc caacaaaatc tattctgtta gtcagaaacg 2280 aaaatttttt tgctttaaat ttaaaagcat agtctcttta tatttactta attttattag 2340 attacatata tatatttata tatttatgtg gtatatattt atatatttat gtggtatata 2400 tatatttata tatttatgtg gtttaagtaa tgtttatgtt cctatattta atatttttat 2460 ttttatttaa tatacacaca tataggagtt aataatccta aaataatatg tatatttatg 2520 tgaattctga aaagttatgt ttatgttatt tttttttatt aataaaactt aatttcataa 2580 aaaaatcttt tgtttcaaat atagtgccta ggcgttctat tacgctaggc gttctaaacg 2640 agaaaaaaaa tgcaaaaaat ttaaaaccga aataaaaagc tttcccgaat ataacgccta 2700 ggcgctatgt tcggcggaag actgtagta 2729 // ID DNA8-15_CQ repbase; DNA; INV; 2364 BP. XX AC . XX DT 29-DEC-2010 (Rel. 16.01, Created) DT 29-DEC-2010 (Rel. 16.01, Last updated, Version 1) XX DE Non-autonomous DNA transposon from Culex quinquefasciatus - DE consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-15_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-2364 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the southern house mosquito."; RL Repbase Reports 11(1), 92-92 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >88% identity. CC 8-bp TSD. ~40-bp TIRs. XX SQ Sequence 2364 BP; 950 A; 223 C; 358 G; 833 T; 0 other; caggggcgcc gactagtcca aaatgttggg ggggcacggt tgttttccga aatcgatagg 60 taagagtggc gattagatgt taaagtttct tgtatatcac gaattttaac aatttgaata 120 ttacgatgtg atacgatttt ttttcatccg gaatccattt tttttagaaa aagataattt 180 attaaaacgt gattgagaat gttaattggg gggaattttt tatatgtttg gaaggcgaat 240 tccttctcat tggcatattc tcacttattt tctagaagta acagtcaata ataaaaataa 300 tcagatattt agaaattcaa caatacaaat attctaaaaa taaaaacaaa agtaaaaagc 360 aaaaatttag aacttcagaa atttaaaaat acaaatacac atttaaaaaa acatccattt 420 ttcaaatctg caatttagaa aaaaaataac aaaattcaat attccaaaaa aaaaataata 480 gtttaaaaaa attctagggt taaaaaaact catggcttga aaatgttaag aaatcaaaag 540 ttgaaattca gaaatttaaa aaatcagaaa tttagaaata acaataaaat tcaatgttta 600 aaaaaatcga aaatttaaaa attatgagac atcaaaattt tttaagaatt taagaattta 660 agaatttaag aatttaagaa tttaagaatt taagaattta agaatttaag aatttaagaa 720 tttaagaatt taagaattta agaatttaag aatttaagaa tttaagaatt taagaattta 780 agaatttaag aatttaagaa tttaagaatt taagaattta agaatttaag aatttaagaa 840 tttaagaatt taagaattta agaatttaag aatttaagaa tttaagaatt taagaattta 900 agaatttaag aatttaagaa tttaagaatt taagaattta agaatttaag aatttaagaa 960 tttaagaatt taagaattta agaatttaag aatttaagaa tttaagaatt taagaattta 1020 agaatttaag aatttaagaa tttaagaatt taagaattta agaatttaag aatttaagaa 1080 tttaagaatt taagaattta agaatttaag aatttaagaa tttaagaatt taagaattta 1140 agaatttaag aatttaagaa tttaagaatt taagaattta agaatttaag aatttaagaa 1200 tttaagaatt taagaattta agaatttaag aatttaagaa tttaagaatt taagaattta 1260 agaatttaag aatttaagaa tttaagaatt taagaattta agaatttaag aatttaagaa 1320 tttaagaatt taagaattta agaatttaag aagtttaaga atttaagaat ttaagaattt 1380 aagaatgctt atttcctgtt ttttttggca tattttttga gggtgcagtt aaaatttcta 1440 cggccttttt ttttaaatcg aacataaatt gcagaaggcg atacactttt gttttttaag 1500 attataatgt ggaattgtct cagattaggt tttaatttgt cggtttctca gtaaagcggt 1560 gcgctagttt tcaaaaattt actttttttg aaacaattgt atttttcggc taaaattggt 1620 atagaacggt tagcggttaa caaatgtatt tgaaatcttt tcaaatcctt tctaaaatta 1680 cccaaaaaat tatcgaattt atcagcattt ttgtaaatgt aaagcagtca acaaatgacc 1740 attttgaaaa gtgtttcagt ttatattcgc taaatcctga tctacaatgg caaatcgcag 1800 aatgttgcgt ttgaaaactt gatgattgtt ttcgcaagag tttgcgtaag tttgattgta 1860 gatgaagtgc tatggaaaaa gtacattggt tttgtttttg aaacaacatg cacagataga 1920 aatcttatga gcaaatccgt aaaatctatg ttgacgacat ctctcaaatc atttaccgat 1980 gcctctgtgc tcttgtacta agcttactga ttttcctaga gagcacagtg gtatagataa 2040 aacgttgtcg atttagaaaa aaatgcatct aaatccagaa aattgtaaac gattttgttc 2100 aaaaaatttc agcgatttcg aaaacaacaa atgtttttga actattttac ggattcgctt 2160 acaagaattc tatccgtgtg tacaaacaat gttccattta agttttagat attattttca 2220 attatttctg tgattttttt atatttgata taagaatatt tttcttccaa aatttctggg 2280 gggggcacaa tgtatgggct gcccccccag ccaaatttta gggggggcaa aaagtaccgt 2340 gcccccccag agtcggcgcc cctg 2364 // ID L1-N4_CQ repbase; DNA; INV; 1430 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A HAL1-like non-LTR retrotransposon family from Culex DE quinquefasciatus - consensus. XX KW L1; Non-LTR Retrotransposon; Transposable Element; nonautonomous; KW L1-N4_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-1430 RA Kojima K.K. and Jurka J.; RT "HAL1-like non-autonomous non-LTR retrotransposons from the RT southern house mosquito."; RL Repbase Reports 11(1), 103-103 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 9 sequences with >99% CC identity. CC This family encodes a protein similar to ORF1ps of L1 in CC mosquitoes. Thus it is likely a HAL1-type element. XX FH Key Location/Qualifiers FT CDS 161..1279 FT /product="L1-N4_CQ_1p" FT /translation="MSHVRTRENTFKVILSNFPKRPKFEEIHKFIHVNLAL FT TPDQVKRLQFSHAENCAYVKCSELKIAQETVAKHNGQHVIESNNAKVKVRL FT VMVDGGVEVKLHDLSENVTNEEIVAFLQHYGDVLSIKETSWGSNYTFHGVS FT SGIRVAKMILRRHIKSFVTIQNESTLISYKNQPETCRHCTLIQHIGMSCVD FT NKKLVGQKTALNQRMSNTQPTPSTSYASVVNSGSSTNTLQIKVVPDNPKIE FT SLSNSESVAKAIVIGNSAQTTVVPSELGTSSSILNISPPAGTEGATVQALR FT KDADAAGSSALIGRTDDDEMPIAVFTATSSVDGTFKVPHPFLEPKPHQPMD FT ASSGSDSDHSGSSESKKRKPGRSKKPKLDK" XX SQ Sequence 1430 BP; 419 A; 338 C; 321 G; 352 T; 0 other; aatcagttcg ctttgtgctc gtgaagcgac cggtcgtata tcttgagctc ctcggtacgc 60 gtgttttgtt tcgtctttgt tccacaccac cgggtgtagg agctgagatt tctcgtcagc 120 tcgattgacg actcggtgtt ttcttggtgg aaacgcaaca atgtctcacg tgagaacaag 180 ggagaacacc ttcaaggtga ttctatcaaa ttttccaaaa cgtcctaagt ttgaggaaat 240 acataaattc atccacgtta accttgctct gactccagac caagtgaagc gtctgcaatt 300 cagccatgct gaaaattgcg cgtacgtgaa gtgcagtgag ctgaaaatag cgcaagaaac 360 tgttgctaag cacaatgggc aacatgtcat cgagtcgaac aacgccaagg ttaaagttcg 420 tttggtaatg gtggacgggg gagtggaggt gaagctccac gatctctcgg agaacgtcac 480 caacgaagag atcgtggcct ttcttcagca ttacggagac gtcctcagca tcaaagagac 540 aagctgggga agcaattaca ctttccatgg tgtttcgtct ggaattcggg tggcgaaaat 600 gatacttcgt cgccacatca aatcgttcgt gactattcaa aacgaatcta ctctcatctc 660 ctacaaaaac caacctgaaa cgtgtaggca ttgcacactg atccagcaca tcgggatgtc 720 gtgtgtggac aataaaaaac tcgtcggcca aaagaccgct ctcaaccaaa ggatgagtaa 780 tactcaacca acaccatcta ctagctatgc tagtgtggtc aacagtggat catctaccaa 840 cacattgcaa atcaaggttg ttccagacaa tcccaagatc gagagcttga gtaactcgga 900 gagcgtcgcg aaggcgatcg tgatcggtaa ctcagcccag acaacggtcg tgcccagtga 960 actcggcact tcatccagta ttttaaacat atcgcctcct gctggaacag agggcgccac 1020 agtccaagct ctacgtaagg acgcggacgc tgcaggttca tcggcgctga tcggtagaac 1080 tgacgacgac gagatgccca tcgctgtgtt cactgctact tcatccgtgg acggcacatt 1140 caaggttcca caccccttcc ttgaaccgaa gcctcatcaa ccaatggacg ctagctctgg 1200 tagtgatagt gatcactctg gaagctccga atctaaaaag cgcaaacctg gtagatccaa 1260 aaagcccaag ctggacaagt gaaaacgaac aggattataa tgatgtttat tcaaatgata 1320 aatgtgtttt tgaaattgct gttcaataac tttattatat ttttaattta cttaaaacct 1380 atgtaacctt ctcgaagttc caaataaacg tttttacaaa aaaaaaaaaa 1430 // ID Gypsy-3_TCa-LTR repbase; DNA; INV; 174 BP. XX AC ChLG4; XX DT 21-FEB-2011 (Rel. 16.02, Created) DT 21-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the red flour beetle genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-3_TCa_; KW Gypsy-3_TCa-I; Gypsy-3_TCa-LTR. XX OS Tribolium castaneum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Coleoptera; Polyphaga; Cucujiformia; OC Tenebrionidae; Tribolium. XX RN [1] RP 1-174 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the red flour beetle genome."; RL Direct Submission to RU (21-FEB-2011). XX DR Genome; ChLG4; Positions 3677836 3677663. XX SQ Sequence 174 BP; 45 A; 30 C; 52 G; 47 T; 0 other; tgttaggttg gcaactctca cgtgggcgac acgggagttg gaggggggtt tttttggcag 60 agtgagttag ttaggttatg tcgcagaagg ggcctggacg caccctcaat gctccgggat 120 attagtcaat aaacagttat aaggttgaat ggtctaatta caactaacct aaca 174 // ID Mariner-3_HM repbase; DNA; INV; 2890 BP. XX AC . XX DT 22-MAR-2008 (Rel. 13.03, Created) DT 22-MAR-2008 (Rel. 13.03, Last updated, Version 1) XX DE Mariner-type family: consensus sequence. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-3_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2890 RA Jurka J.; RT "Families of Mariner elements from Hydra magnipapillata."; RL Repbase Reports 8(3), 220-220 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(671..1708,1712..2554) FT /product="Mariner-3_HM_1p" FT /translation="MDNFKQKEERESRIQLAIQAITEKKMSYVQAAKCYNV FT AKSTLFDRTKGSSNNRGAPRKMSNITEAIIVDLLKFMSDIGFGLNRKDVFI FT VVENYLKESNQRSLFKDGKPTRKWYSGFMNKYRKEICPRKVSGMQTIRAVA FT TQPAIIDNWFEQLAVAYNEHNLGDKPFQIFNCDESGLQFDQGKVKIICRKG FT TKNPKKLAPSNEKQMTTILTCCDAFGNYLPHQIIYKGKHVMKDWCKGGAQN FT VYYNSSSSGWMESEHFLSWFETVFLPHANKLSGFKVLILDGHASHMSLELK FT KKALENYILLWRLPAHTSHFLQPLDVGVFKTVKGAWKRIVESYLTKNHFQS FT LTNHFPAMFKDLIQNGGFKPENARSGFKNTGIFPLDRSQISSKKTSIGAVF FT QAVENNNIENLFDSSTVSSPLVMGNATNTPNTPSRMSNREFLLKSFATLNN FT AVQQVGKDINKSVLNMLRDQLTPTLNKRVQKSERIKRPANYNMTSADAIAY FT DEAEQASKKQKLDEQAAKKFTREQKKISTATQKAKNAQKREEKKKLRQSQS FT SKIPGKKGIKTKNQLEDPHFPQASNNILVSDDLLCYSCEINWTNNLDNQWL FT ACEQCSNWSCIECVFDYRPTMEFICNECV" XX SQ Sequence 2890 BP; 1124 A; 424 C; 454 G; 888 T; 0 other; taccgtaaaa acccctaatt ccggatgtaa cctaattccg gatggtttta gtaaaaagtt 60 tcataactcc cttaatataa tgaatttatg aataattttt ttaattaata aacgtcttta 120 ccagacaaat cgaaaaatat aaaaaaaaat tatataaact aatgcaatca tgaaagattt 180 caatttgaaa tctctcataa ctaaaaaaac aaaaaaaagg aaaaaaaaaa ctttgacttg 240 caacaaaaaa attaaaaaag taaaaacact attttgtgca gaattttatg ctgattattt 300 tagccaaaaa aaaattaaat taccttttga aaaaaatgaa aaacaaattt gctaccagcc 360 aaaaaaaaaa gacacctaat tccggataga ggtaattatt attttatgta cctaaaaaaa 420 ataataaaca aattagccag cagcaataaa caaatacact tttgagtccc ggatacagtt 480 aattacaatt ggctatttta ttttttatca ctttcatgca cataatataa tatattatgt 540 tcattaaagt aataaaaaat aaaatagcca caagggaaaa aataattgac actaatccag 600 atacagctat tttttttaac atccataaag ttaacgcatt tgtgttaatt gtattactgt 660 attaaagaat atggataatt ttaaacaaaa agaagagcga gagagtcgaa tccagctagc 720 tattcaagct attactgaaa aaaaaatgtc atacgtacaa gctgctaagt gttataatgt 780 ggcaaaatct acactttttg acagaacaaa aggatcatct aataatcgag gagcgccaag 840 aaagatgagc aacatcactg aagctattat tgttgattta ttaaaattta tgagtgatat 900 aggcttcggc ttgaatagaa aagacgtgtt tattgttgtt gaaaactatt taaaagaatc 960 aaaccaacgt agtctattca aagacggcaa accgactaga aaatggtact ccggtttcat 1020 gaataaatat cgtaaagaaa tttgcccaag aaaagtaagc ggtatgcaaa ctattagagc 1080 tgtggccaca caaccagcta taattgacaa ttggtttgag caattagcag tagcatataa 1140 tgaacataat ttaggcgata agccgtttca aatatttaac tgtgatgagt ctggtttgca 1200 atttgatcaa ggcaaagtca aaattatttg tagaaaagga acaaaaaacc ctaaaaagct 1260 agcaccatcg aatgaaaagc aaatgacgac aatcttgact tgttgtgacg cattcggtaa 1320 ttatttacct catcaaataa tttataaagg caaacatgtt atgaaagact ggtgcaaagg 1380 cggtgcccag aatgtttatt ataacagtag tagttcaggc tggatggaat ccgaacattt 1440 tttgtcgtgg tttgaaacag tttttttgcc tcacgcaaac aaactttcag gatttaaagt 1500 tctaatactt gatggccatg cttcacatat gagcttagag ttgaaaaaaa aagctttaga 1560 aaattatatt ttactttggc gtttgccggc tcataccagc cattttttac agccactaga 1620 tgttggcgtt tttaaaacag taaaaggcgc atggaagaga attgtcgaaa gttatttgac 1680 taaaaatcac tttcaaagcc tgactaattg acattttccg gccatgttta aggatctcat 1740 ccaaaatggt gggtttaaac ctgaaaatgc acggagtggt ttcaaaaata ctggtatatt 1800 tccactcgat cgttcgcaaa tttcttctaa aaaaacttcc attggtgctg tgtttcaagc 1860 tgtcgaaaac aacaatattg aaaatttatt tgattcttca acagtatcta gtccattagt 1920 tatgggaaat gctaccaata cacctaacac accgtcaaga atgtcaaatc gtgaattttt 1980 gcttaaaagc tttgctactt tgaataatgc tgttcaacaa gtgggcaaag atataaacaa 2040 atcagtgtta aatatgcttc gagatcaatt gacaccaact ctaaataaaa gagttcaaaa 2100 aagtgagaga ataaagcgac cagctaatta taatatgact tcggcggatg ccatagctta 2160 cgatgaagct gaacaagcat caaaaaaaca aaaattagat gaacaagctg caaagaaatt 2220 tactagagaa caaaaaaaga tttccactgc tactcaaaaa gcgaaaaatg cacagaaaag 2280 agaagaaaaa aaaaagttga gacaaagtca gtcgtctaaa ataccaggta aaaaaggtat 2340 aaagacaaag aatcaattag aagaccctca ttttcctcaa gcttcaaata acattttagt 2400 atctgatgat cttctatgtt attcttgcga aattaattgg acaaataact tagataacca 2460 atggcttgcg tgtgagcagt gttcaaattg gtcatgcatt gaatgtgtct ttgattatcg 2520 tccaaccatg gaatttattt gtaacgaatg tgtataaaat aatatatatt ctttttttat 2580 taaacgattt cattttttta atggattaaa ataattattt taatgttagt attatttgtg 2640 tatgtgtttt tctgtatttt atacgcatcc ggaattaggg acataaacac ctaattccgg 2700 ataaaaacaa tattttttaa aaataaaata aaagatagat ttatattgca aaaataatat 2760 tttttatatc gtttgatttg tcttgttatg aagtttattt atttaaaaaa attatttcaa 2820 aattctttat attgagctag ttatgacaat ttaagtccta aaccatccgg aattaggggt 2880 ttttacggta 2890 // ID BEL-52_CQ-I repbase; DNA; INV; 6009 BP. XX AC AAWU01015848; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-52_CQ_; KW BEL-52_CQ-LTR; BEL-52_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-6009 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 257-257 (2011). XX DR GenBank; AAWU01015848; Positions 34582 28574. XX CC Positions [4989-5567] - Integrase core CC 'CAACC' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 657..6008 FT /product="BEL-52_CQ-I_1p" FT /translation="MPPKAKTSLKVLQTRMKGHQTTFDNIYQFMKDWTEAV FT NLTEVKVRLERLDRLWDRINECREEVESHEEYTPEALAGVQDRVTFENKYF FT ELKSFLMDTIRENTEDSVLNTTSLSHQNTLPGSTPHVKLPQIALPKFTGNI FT DEWHTFRDLYTSLIHFQPDLPDIEKFHYLRSQLLGRPLTIVESVRFTKANY FT SLAWELLCKEYSNTKSLHRQQVQALFDLPVLRKESASELTDLLDAFERIVK FT SLDQVTQPSQYKDLLLVHLLSSRLDSSTRRSWEEHSAPKDTDSAKDLTDFL FT HRRVCVLKSLPTRPAEQRNETVAAKPVKKVSTVRSCNTTTQFGPKCVACSD FT NHLLYTCPRFLKMSVQERDGLLRTNSLCRNCFRSDHLARDCRSRFTCRNCR FT GKHHTLVCFKGKSSDPRPMTNTTTTEPTHTPMETAVDDTNTKVVNSATTST FT VTCNTATGSTAGVLLLTAMVVLEDDQGHRVQARALLDSAAECNLMSRHMRK FT QLMLREARSTIQVVGIQGIASKVQAKVNVTVRSRNSQFSQPMEVYVMPKLS FT SQSTTAAVNTSAWNLPAGIELADPRFLESHPIDLVLGAEHFFDYFVTGRRM FT QLGDNLPLLVDSVFGWVVAGRYPVDGPITSVLCDAVVSSRLDELIEKFWEY FT EEVGLENNYSPDEARCEDHFVKTTQRQPSGRYQVSLPRNEELIRSLGDSKV FT TAERRFFQIERRLNRDSSLRDQYVAFMEEYEALGHMRRVSPEEEGADRFYL FT PHHPVVREASTTTRVRVVFDASAKTSTNLSVNDCLLSGPIIQRSLRSIILR FT SRFRPIMVVADIEKMFRQVDVCPEDRRFQSILWRPTPDQPLATYEMSTVTY FT GTKPAPFLATRTLVQLAADEAERYPLASAAVKEDFYMDDAITGADDPITAK FT QLRVELQEMLNSGGFKLRKFASNCPEVLEGLPNEDLSIQSADGIYLDPDPM FT VKTLGLIWMPHTDVFRFQFKAPPLPDTPLTKQKLFSIVASLFDPLGFIGPV FT ITRAKIMMQLSWSLLDENGHKLTWTSPLPPRIEAEFRKFYEQIPRLNELRI FT KRVVVMPGAVLIQLHFFSDASEKALGAVAYVRSEDSIGHILVAPLMSKSRV FT APLKTQSIPKLELRGTTLAAEMYVEVMESLELCHLSPEIFFWTDSEIVLRW FT LAGIPATWTTFVANRVAKIQRLTENCHWNHISGKKNPADLISRGVAPEDLL FT DEPLWWEGEWLLLSMDHWPRRTVFTADGLEEERRKVVAATTADEPSFIDKY FT IAHYSTYTQMVRHTAWWQRCTRNLRTPKDDRYIGPLTSAELLQAETHILQL FT VQGECFARELMAIGKRECVSRSSPLRWYNPFLSRDGLLRVGGRLGNSGECE FT SVKHPIVLPARHTLTKLLMRHYHLNLLHAGPQLMLSTIRLRFWPLGGRNLA FT REVCHQCIVCYRTKPANIHQFMAELPTPRVVPSKPFSTTGVDYFGPIYVRP FT GYRRTAVKAYVAVFVCFSTKAVHLELVTDLSTACFIQALRRFISRRGKCAK FT LFSDNGTNFVGARNQLEDLFKLLRSKEHREAVSRWCADDGMQWSFIPPGAP FT HFGGLWESAVRAAKVHLLKVLGDTAVSYEDMATLLCQVECCLNSRPLTQLS FT DDPNDLEPLTPGHFLVTTSLQALPEEDLTNIATGRLDHRQQIQQCVQLYWK FT RWRSEYLTQLQGRTKWWKPPKEIKPGSLVIIRDDNQPPTRWRMARILEVFP FT GDDGVVRVVSLKTADGVIKRPVTKICILPVESTVADPVENPAENPAEAETG FT ENAGTPAVPEGGR" XX SQ Sequence 6009 BP; 1483 A; 1649 C; 1587 G; 1290 T; 0 other; ttggtccttc gaaccggatt ggtctggtcg aaggacggta ggagccaaaa tcgccatcgt 60 gaaatcccaa accattcacg aaacggcaag gagaacgcca ttttggacgg cgtcggtacg 120 gctcacctgt tcgaggaaat tcagcgtagt catcctacaa ggcacatttc ctatggaaat 180 ctaggtgagt gcggtttcaa aaccttacac ccaaaaatcc agtactgctc gtggtctctg 240 tgtcacaggt accctcgaca cacccaaccc attcatcacc acactggctg aggatcggtt 300 agcaaagctg ctaattgatc gcaatcacca tcccacccac atcgtcggtg gcgagcgcac 360 ggagcgtaaa tcgttcattc gttcgaaccg aactgctgaa acgccgtgcc acatccactc 420 ctgcccatcc cattgtgctc gagacgctct attgtctcat gcccgaagca catggtcggt 480 gctgaatcac ctgtggtcca gctgatgcgt tacgaagcga ccgctggctc tgctgcgaac 540 gaggatcatc tttaaagatt ttacaaggct caaatcctag agccagcagg taattagcgc 600 tccaataaac tcactccccc aagttgtgct actttggtct catttttcga ggaacaatgc 660 cgccgaaagc gaaaacgtcg ctgaaagtcc tgcagacgcg gatgaagggg caccaaacga 720 cgttcgacaa catctaccag ttcatgaagg attggaccga agcagtaaac ctaaccgagg 780 taaaagtgcg actggagagg ctcgaccgtc tctgggaccg gattaacgaa tgccgagaag 840 aagtggagtc ccacgaagag tacacccctg aagcacttgc tggtgtccaa gatcgggtaa 900 cgtttgagaa caaatacttt gaactcaagt cgttccttat ggacacgatt cgggagaata 960 ctgaagattc tgtgctcaat accacctcac tctcacacca aaacactcta cccggttcca 1020 ccccccatgt caagctaccc caaatcgcac tccccaagtt cacgggtaac atcgatgaat 1080 ggcacacatt tcgcgacttg tacacttccc tcatccactt tcaacctgac ttacccgaca 1140 tcgaaaaatt ccactacctt cgtagtcagc tgttgggcag gccgctcacc attgttgaaa 1200 gcgtccgatt cacgaaggcc aactactcgt tggcatggga actcctgtgc aaggaatact 1260 ccaacaccaa atccctccac agacagcagg ttcaggccct ctttgatctc cctgtcctgc 1320 ggaaggagtc cgcatccgag ttgaccgatt tgttggatgc cttcgagagg atcgtgaagt 1380 cgctggacca ggtaacccaa ccctcgcagt acaaggatct gctgctcgta cacctgctca 1440 gctcgcgctt ggatagctcc accagaagga gttgggagga acattctgca cctaaggaca 1500 ccgacagcgc gaaggactta accgacttcc tacaccggcg tgtttgcgtt ctaaagtcac 1560 ttcccacccg gccggctgag cagagaaacg agactgttgc agctaaaccg gtcaagaagg 1620 tctccacggt gagaagctgt aacaccacta cccagtttgg ccctaagtgt gttgcgtgtt 1680 cggataacca tctgctctac acatgcccac gctttctcaa gatgtctgtg caggaacgag 1740 acggcctact gcggactaac tcactgtgcc gaaactgttt tcgcagcgat cacctagctc 1800 gggactgcag atcgaggttt acatgtagga actgtagagg aaagcaccac acactcgttt 1860 gcttcaaggg gaagtcaagc gacccaagac caatgaccaa caccaccacc actgaaccaa 1920 cccacacacc catggaaaca gctgttgacg acaccaacac caaggtggtt aactcggcaa 1980 ccacaagcac cgtgacctgc aacacagcaa ctggatcgac tgcgggagtt ttattgctca 2040 cagcgatggt tgtattggag gacgaccagg gtcatcgagt gcaggcaagg gctcttctag 2100 atagtgcagc ggaatgcaat ctgatgagca gacacatgag gaaacagctg atgctccggg 2160 aggctcggag cacaattcag gtggtaggca tccaaggcat agcttccaag gtccaagcca 2220 aggtcaacgt gactgtccgg tcacgtaact cccaattttc ccaacccatg gaagtttacg 2280 ttatgcccaa actctcgtcc caatccacta cggctgctgt taacacttct gcttggaacc 2340 taccagctgg aatagagtta gcagacccaa ggtttctgga atcccacccc attgatctcg 2400 tactaggcgc agaacatttt ttcgattatt tcgttactgg acgtcgcatg cagttgggcg 2460 acaatttacc ccttttggtc gactctgttt ttggatgggt agtcgcaggc agatacccgg 2520 tggatggccc tattacgtct gtcctgtgcg atgctgttgt ttcgagtcga ctagacgagc 2580 tgatcgagaa gttttgggag tacgaagagg ttggtttgga gaacaattac tcacccgatg 2640 aagccagatg tgaggaccat tttgtcaaga ccacccaaag acaaccatct ggacggtacc 2700 aagtttctct gccaaggaat gaggaactca tccgtagtct aggcgattcc aaggtgactg 2760 cggaacggag gtttttccag atcgagcggc gactcaaccg ggactcgtcg ttgcgcgatc 2820 agtacgtggc ctttatggag gaatacgagg ccttgggaca catgcggcgt gtctcaccgg 2880 aagaggaagg tgcggaccga ttctacctcc ctcaccaccc agtcgtgcga gaagcgagca 2940 ctaccacacg ggtccgagta gtgttcgacg cttcagccaa aacatccacc aacctgtccg 3000 tcaacgattg cctgctgtcc ggaccgatca tccagcgaag cctacggtcc atcatcctac 3060 gcagtcgttt ccgtccaatt atggtggtag ccgacattga gaaaatgttt cggcaggtgg 3120 atgtctgccc ggaagacagg cggttccagt cgatcctgtg gcgcccaacc ccagaccaac 3180 ccttggcaac ctacgaaatg tcaacggtga cctacggaac caaacctgcc ccattcctcg 3240 cgacgagaac cctggtgcaa ctagctgctg acgaggccga acgctaccca ctggcgtcag 3300 ctgctgtgaa ggaagacttc tacatggacg atgcgattac gggggcggac gatccaatca 3360 ccgccaaaca acttcgtgtt gagctgcaag agatgctgaa cagtggagga ttcaagctga 3420 ggaagtttgc gtcgaactgc ccagaggtgc tggaaggcct acccaacgaa gacttgtcga 3480 tccaatcagc ggacggaatc taccttgacc ctgatcccat ggtgaaaact ctgggactga 3540 tttggatgcc ccacaccgac gtcttcaggt tccagttcaa agccccaccc ttgcctgaca 3600 ctccgctgac gaagcagaaa ctcttctcga tcgttgctag tttgtttgat ccccttggtt 3660 ttattggccc cgtgataacc cgagcaaaaa taatgatgca gctttcttgg tcgctgttgg 3720 atgaaaatgg ccataaactg acttggacgt caccactacc accaaggatc gaagcagagt 3780 tcagaaagtt ctacgaacaa attccccgac tgaacgaatt gcggattaag cgtgtggtag 3840 taatgcctgg ggctgtgcta atccaactac atttcttctc tgacgcttca gaaaaggccc 3900 tcggagcggt tgcctacgtc aggtcggaag actcgatcgg tcacatcctt gttgccccac 3960 tgatgtccaa atctagggtg gcacccctca aaacccaaag catcccaaaa cttgagctcc 4020 gtggtacaac cctggcggcc gaaatgtacg tcgaggtcat ggaatccctc gaactctgcc 4080 atctctctcc tgaaatcttc ttttggacgg attcagaaat cgttttgcgc tggctggctg 4140 gtatccctgc gacctggaca acgtttgtcg ccaaccgagt cgctaagatc cagcgtttga 4200 ccgagaactg ccactggaac catatttctg gaaagaagaa ccctgcagac ctcatctccc 4260 gcggagtggc accagaggac ctactggacg agccgctgtg gtgggaaggc gaatggttac 4320 tgctgagcat ggatcactgg ccgaggcgga cagtcttcac ggctgatgga ctggaggagg 4380 agaggcggaa agtggtagca gcgacaacag ctgatgagcc cagcttcatc gacaagtaca 4440 tagcacacta ctcaacctac acacagatgg tgcgccacac ggcctggtgg cagcgctgta 4500 cacgcaacct acgaaccccc aaggatgatc gctacattgg accgttaact tcagctgaac 4560 tactacaagc agagacccac attctacagc tggtgcaagg ggagtgtttc gctcgcgagc 4620 tgatggcaat cggcaagcga gagtgtgttt cgcggtcatc gcccctgcgg tggtacaatc 4680 catttctgtc gcgggatggt ctcttgaggg tagggggtag acttgggaat tcgggagagt 4740 gtgagagtgt gaagcacccg atcgtattgc cagcacgtca cacgctgacc aagctgttga 4800 tgcggcacta ccacttgaat cttctgcacg ctggaccaca actaatgctg agcacaatcc 4860 gacttcgatt ttggccactt ggtggacgca acctagccag agaagtctgc caccaatgta 4920 ttgtttgcta caggaccaag ccagcaaaca tacaccagtt catggctgaa ctaccaacac 4980 caagagtcgt gccatccaaa cccttttcta caactggtgt cgactatttt ggaccgatct 5040 acgtacgacc agggtaccga cggacagcag tgaaggcgta cgtggcagta ttcgtctgtt 5100 tctccacgaa ggcggttcat ctggagttgg tgaccgacct gtcgacggcg tgttttatcc 5160 aagcgttgcg gcggtttatt tcgcgtcgtg gaaagtgcgc caagctgttt tcggacaacg 5220 gcacaaattt cgtgggtgcg cggaatcaac tggaggacct tttcaagctg ttgcgatcca 5280 aggaacaccg tgaggcggtg tcgaggtggt gtgcggacga cggtatgcag tggagtttta 5340 ttccccctgg ggccccccat tttggtgggc tttgggagag cgcggtacgg gcggcgaaag 5400 ttcatctgct gaaggtgcta ggcgacacag ccgtgtcgta cgaagatatg gcgacgcttc 5460 tctgccaagt ggagtgttgt ctcaattcaa ggccgctgac tcaactctcg gacgacccca 5520 atgacctaga gcccctgacc ccagggcact tcttggtcac cacatcgttg caagcactac 5580 ccgaggagga cctgaccaac attgcaacgg gccgattgga ccatcggcag cagatccaac 5640 agtgtgtgca actctactgg aaacggtgga ggtcggagta cctcacacag ctgcagggaa 5700 gaaccaagtg gtggaagccg ccgaaggaga tcaagcctgg gagtctggtg atcatacggg 5760 acgacaatca accaccgaca cgctggcgga tggcgcgcat cctggaggtg tttccggggg 5820 acgatggcgt ggttcgagtc gtctcgctga agacggcgga cggtgtcatc aagcgaccgg 5880 taacgaagat ctgtatcctg ccggtggaga gtacggtcgc ggatccggtg gagaatccgg 5940 cggagaatcc ggcggaggcg gagacaggtg agaatgctgg aactccggct gttccagagg 6000 gggggagga 6009 // ID Gypsy-15_SI-LTR repbase; DNA; INV; 252 BP. XX AC AEAQ01023729; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: long terminal DE repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-15_SI_; KW Gypsy-15_SI-I; Gypsy-15_SI-LTR. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-252 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01023729; Positions 190 441. XX SQ Sequence 252 BP; 62 A; 72 C; 35 G; 83 T; 0 other; tgttatgttt ggaattaccg ccatatgcat ttcgcacact acacccgtgt acctctctcg 60 accaataaga agccgccttt cttttcatga tcactgaccc atctatatta tccggtaacc 120 ttccgggcct tcccagagag tcaatctatg tattcagtct tttcttttag tctgtctgta 180 tcccgtaaca ataaactcgg cattgcatta taaatatccc tcttgcttaa ttgtgaatca 240 cccacaccca ca 252 // ID GENIE1_GI repbase; DNA; INV; 3971 BP. XX AC . XX DT 17-NOV-2004 (Rel. 9.1, Created) DT 17-NOV-2004 (Rel. 9.1, Last updated, Version 1) XX DE Giardia intestinalis non-LTR retrotransposon GENIE1_GI, consensus DE of complete cds. XX KW Non-LTR Retrotransposon; Transposable Element; GENIE1_GI; KW pol domain. XX OS Giardia intestinalis OC Eukaryota; Diplomonadida; Hexamitidae; Giardiinae; Giardia. XX RN [1] RA Burke D.W., Malik S.H., Rich M.S. and Eickbush H.T.; RT "Ancient Lineages of Non-LTR Retrotransposons in the Primitive RT Eukaryote, Giardia lamblia."; RL Mol. Biol. Evol 19(5), 619-630 (2002). XX RN [2] RA Gentles A. and Jurka J.; RT "GENIE1_GI non-LTR retrotransposon."; RL Direct Submission to Repbase Update (OCT-2004). XX DR [2] (Consensus) XX SQ Sequence 3971 BP; 586 A; 1504 C; 1172 G; 709 T; 0 other; gcgcctggcc cccctggccg tgacccgcgc gcccccgcgc acaggagcat acccctggag 60 ctggggagaa ggcccccctg gccgcgcccc cgccgcgaga ccacgcgggc acgggggctg 120 ggcgccccgg gcgggccctt cccctccgtc gtgaggctgc ctcccctcca atgtccttac 180 cgccttgcgc gcatattacg tagtaaaccg cctctgtaaa aaaaaaaaaa aaccatgcgt 240 tccgcccttt ccgcgccttc gggaaccacg gagatcgggg cttcctcagg cccagagcct 300 cccaaccacg ctccaaccga ctcccacagg cccctaccta ggggcactcc cgatcctatg 360 acgctgggtg acggcaacga tgcctcgacg accgccatgg cccggccttc ctcttgcgag 420 cccctctgcg gctcccacgg gcccccatcc ccccagcctg ggcctggcaa cagccgttcc 480 gccacggaca actctcccaa tgagtcttct cctgtccatg gcttacaggg ctccggggcc 540 tcggtggccc gggacgcctc tcctccctgc ctgtcgaacg accttctgcc ctgtggccag 600 gagcccctcc cgattcccga cgacggcctc ccatacgacc tgggccacga caactccccg 660 tccatcacgg acgagcagga ggcccggggc gcagacctgc ctgctcctgg gggccgtacg 720 actgtctgcg agcgctgcgg agaggtgctg cctgactctg cggcgatcga tgtccacatc 780 gggatgcacc atccgcctcc cggttcacct tctccgtcgc gcagcccttc tccgctgtcg 840 gggtctgccc ctccacctgc agccgccgag ccctctccct cgccaacgcc ggaccctgtg 900 ttcccgtatg catgcagcat gtgccgggcc cgatacaaaa cggagagggg gctgaaggcc 960 catatcgcca ggctcggcca ctacctcccg ctcgacgcca cagtccccga gcgcatcggg 1020 gtgcctagtg gcagcccgtc ccctgagctg cttcgcctcc tcactgacgt attcacagag 1080 atggcggggc gccttcctcc tgtagaggcc ctagcgcgct tcctctcgac gtgccgccgc 1140 atggagggct ctggacggct ccccccggcg cagtcgcttc tccggaaggg gctgctcggc 1200 cgggcatggc aggcgctgtt gagcgagacg tcctctgccg ccagggtccc cgagccccag 1260 ggcaaggagc gggaagcgat tgtccatgag ctccacccgc gccccatgcc tgtctctctg 1320 ccccccgtcc atcacgtgct cggcccgtcg cccaagataa cggccaaggc gctgctgaag 1380 gagctgcagg cgatgaggcc tgtcgcggcc ggaccctctg ggctggggaa gcctcacctc 1440 ctgcaccttt gtggggccgc cggggctgcg gagctcttca cctctgtcct gacgaccctc 1500 ttctccagca ggaactgggc ccagctccag cccctgtgcg agttcaggct gaagcttctg 1560 cccaagtctg gcggccgatg gcgccctatc gcagtgcagg agacgcttct ggtcgccttc 1620 caccgcttgc tcctgcgtcg gactcccgcg ctccgcaagc tcccggcgtg gcagctggcc 1680 tttgagcacc tcgcccagat gaaggcgatc cgcgcggccg aggagctgaa gaggacacac 1740 cacctgctca cggtggatgt gcggaatgcc ttcaacagcg tcccgcattc cgtcatcctc 1800 tttgcgctcc gccgggcagg ggtggtccag ccgaccgtcg cgtacattga gtcgttcctc 1860 gcggctcggc actcctcgga cctccccgcg gtcccagcgg gcgtgccaca gggggacccg 1920 ctgtccatgg cgatgttctg ccagagcctt gtctggccgg tggagacgta cctcggccag 1980 tacaaggtcg ttgcatacgc cgacgacctc gtcattgcgt cagaggaggc gattcccata 2040 gataccgtga agagtgacgc gcagtctgcc ctctcccgca tagggctcac ggtcgagctc 2100 tcgaagtgct cctcgaccca ggccggggcg atctccttca tgggcacccg cgtgctcaag 2160 cactcctcct tcaacctcgc gcagacatcg gcccgccggc tccacgagca tctcgccgtc 2220 ctccgtgctt cgggtctctc actccacgac cgcctcaggc tcctgtctgc ctgcgtcgtc 2280 cctgcagtta actatggacc ccttgttgac gactacccgg gcccgtcccc ctatgccgac 2340 gtcgatgcgc agatagtgga ggaggtcgcg acccttctgg agatccccga accccttgcg 2400 aagacccttg ctctgacgcc ccgcgcgaag tacggcctcg gactggtgct gccccatcac 2460 tactacgacg agatgcacag gcagcgccag gacatgaagg cgggcgtctt ccgcgagctg 2520 aggaagaagc gcctgcagga cacggccgcg ctccgatcct tcctgccgct cgcattgctg 2580 ggctgcgcac ccctggacaa cacgcaggtc ctgttcatag gggactgctt ggcggggagg 2640 taccagcggg gccggccgat gggcacgtgc tgccactgca agcagccctt ccttcccagg 2700 caccatctgg tgtgcaaggc cattaacggg attcacgtgg cgcggcacga caagattctg 2760 gacgccctgc tcgcgtgctc ccgcggtcgc gctgggtctg ttgtgcgcaa tcccacgatc 2820 ccagtggatc acctccagcc tgaccttgtc atcggtgggg gcttcgggga cttggtggtc 2880 actgtcccgt ggaggctgga gcggtcctat gccctgaagg ccgccaagta tcgccccctc 2940 gtgctccagg ggcgggcggc ccacatcctc cccgtcgtgg tcggtgctga cggcgtcctg 3000 caccacctgt ctgcggccgg actcgcgttc gccggcgtgg accttgcgcg cttcatgcag 3060 gaggcggcgc aggtcatcct ctggcactac aggctgtcgg ccctcctgta cgctgggctg 3120 cgagtggaga ggccggtgca ccgccccgcc cctgcagtat ccctgccgga accggctccc 3180 aacgaggctc gggctactcc tcctactccc ctcacccctg ccatgtggag ccctggcacg 3240 tcgccctccg tagagatcat ggccgatctc agccagcatc ctgatgcggc agtccctcct 3300 ccctggggtc ctgctgaggt ctcctcctcg atcacctggg gcaccgatca ccccgcacag 3360 ccggatgcaa acgaacgccc agaccccttc atgtgctcaa gggccccctc cccgcctcca 3420 ctggatgacg acagccccga cgatgcagga gcacccgagc cggtgcacaa ggccctgccg 3480 tccttcttca agcgcgtcgg tcctcccaag ccgcatgccc ctgatggctt ctaccccttc 3540 aaacgcatcg acacggggcg ctagtggtca tgcttgggca ccctgcccgc tgtcggctgc 3600 cccacagcgc gaacggaccc ctgggcctgc gtgccgagct gtctcccctg gtactcctcg 3660 ctcgcacccg aggcctcttg cccctatcat tcgggtgtgc atcacgtcca gacacggtgc 3720 gatagatgcc tagccgcgat gccatacaca ctcgaccagg ccgggccagg ccggacgccc 3780 ccctggatgg ccagacaccc gccctgccta cgtttgggca cttcctcgat gggactcgtc 3840 atctgcccac gccgcagccc ggtgttcgtc tgtctgtcct cccgggcagc ggcggtcctt 3900 gggtgctgag tctggcgagc tggtgctggg ggaagaggct agccccgtgt gtcattcgta 3960 ccatcgcacc g 3971 // ID Harbinger-4_BF repbase; DNA; INV; 4515 BP. XX AC . XX DT 04-AUG-2008 (Rel. 13.08, Created) DT 02-JUN-2011 (Rel. 16.05, Last updated, Version 2) XX DE Amphioxus Harbinger-4_BF autonomous DNA transposon - consensus. XX KW Harbinger; DNA transposon; Transposable Element; Harbinger-4_BF. XX NM Harbinger-4_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-4515 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-4515 RA Kapitonov V. and Jurka J.; RT "Harbinger-4_BF - a family of autonomous DNA transposons from the RT amphioxus genome."; RL Repbase Reports 8(8), 801-801 (2008). XX DR [2] (Consensus) XX FH Key Location/Qualifiers FT CDS 786..1865 FT /product="Harbinger-4_BF_1p" FT /translation="MAELRNEDPTAFHIFMRMPIAMYDELLERVTGRLTKK FT ATFMRDPLDPGLKLALTIQHLVSGNTYASMKFSWRVPKCTISLVVREVCEA FT IIATYLDELMVCPTYTPNQWRQIADRFYQKWNFPHTCGAIDGKHVACQGPW FT NSGSMYYNYKGFYSISLMALVDADYRFIWADIGGLGSASDAQVFSQVFNAS FT ELKECIEDGTIGFPDPEPLPNDTQDVPFFIIGDDAFSLRTTMMKPYSGRGL FT AREERIFNYRVSRARRVAENAFGILARRFRVLLTTMQHHPSTVKLIVTTCV FT LLHNLMRSRYPALQDVELDQAEDVQHEFVPWGMERWSQLTGHHQRDWSQQS FT LQGREDAEEAAQALGQL" XX SQ Sequence 4515 BP; 1165 A; 1043 C; 1132 G; 1175 T; 0 other; ggtaggttta cacctgcgta cattttcatg ccgcatgagt tccgttttga catttttcga 60 cattttgact gatttcttca atacgcggcc gcacgcacaa ttcgcacctc aaacgcatct 120 gaatcgtttt tagaaagttt cacgaacgca ggcaaacgaa aatatacgcc cagtacaccc 180 gatagttttg gacatgcaca aaactatcgt tcgtacgcag ttggttcgcc aaaacgtcat 240 gactacgcag tgaatcggtc ccacgtcggt cgagtgcggt atgaagacgc tttacagtag 300 tttcgctttg tgcgcagtaa tacgctgata tcgcggcatt gcatctacat ggcgcactta 360 cagcgcagtt gcaacgcaca atgaacaacc acataacgat tgccgacata ggccccgagt 420 tagtgccgtt cgttctgcga tcgtgttgcg tatagtactg attagtaacg tctgaaaaac 480 taactgattg cgtacagata gcgtgatcga cgcacactcc agattttgta taaatacgaa 540 gccgcatctg accccaaaca gaacagatct caagatggca gatccaggcc aggaccgacg 600 acacctaatg attgctgtca tccaacacca gcatgacatc atgaatctac aactccatct 660 ccttcgtcaa agacagagag aaagaagaag aagacaagca cgccaaagga ccgtctgggt 720 ccgaccttgg atagggagaa gagaagagtt ggggatatat cgtaagtatt ggggatatat 780 cgttgatggc ggagctccga aatgaagacc cgacggcctt tcacatcttc atgagaatgc 840 ccattgccat gtatgatgaa ctgctggaga gagtgacagg aagacttact aaaaaggcca 900 cctttatgag agaccccctg gacccaggct tgaagctagc actgactatc caacaccttg 960 tctctgggaa cacgtacgcg tccatgaagt tttcttggcg ggtaccaaag tgcaccatat 1020 ctctggttgt gcgagaggta tgcgaagcta tcatagcgac gtacctggat gagctgatgg 1080 tttgcccaac gtatacaccc aaccaatggc ggcagatagc tgacaggttc taccagaagt 1140 ggaattttcc tcacacctgt ggagctatag atgggaagca cgtggcctgc caaggccctt 1200 ggaatagtgg ttccatgtac tacaactaca aggggttcta ctccatctcg ctcatggcct 1260 tggttgacgc ggattacaga ttcatctggg ctgacattgg tggcctcggt tctgcatccg 1320 acgcccaagt cttctctcaa gtcttcaatg caagtgagct gaaggagtgc atagaggatg 1380 gtacaatcgg gtttccagat cccgagcctc ttcccaacga cacccaagat gttccattct 1440 tcatcatagg tgatgatgct ttctccctga ggaccaccat gatgaagcct tacagtggca 1500 ggggcctggc tagagaagaa cggattttca attaccgtgt ttctcgtgca aggagggtcg 1560 cagagaatgc ctttggcatt ctagctagaa ggttccgggt gttgctcacc accatgcagc 1620 accacccctc cacagtgaag ctcatagtca caacatgtgt gctgctacac aacctgatgc 1680 ggagccggta cccagcactg caggatgtag agctagacca ggcagaggat gtgcaacatg 1740 aatttgtacc gtggggcatg gagagatggt cacaacttac aggacaccat caacgtgact 1800 ggtcccaaca gagcctccag ggaagggaag atgcagagga agctgctcaa gcactgggtc 1860 aactctgaag caggctctgt tccatggcag gatggaatga tctagtctgc ttgacacgtg 1920 tatgtaacat atatttgata tctgctgata atgataattt ttgtgtgtga agaagcatat 1980 gatattagga gggcataaaa tttcataatt gtttaattga cgtatttgat aaacatcttg 2040 agtccgtttt atgttgtatg tgtaagtgta tgtcatttta cagtagttga aattgtacat 2100 gcaaaacatt acgttcgccg gttcgtgtaa ttgatttata tacaaagatg atatagagaa 2160 tcatttaagg acaaagtgaa aacaaaaaca atcttaacag tgcatataac ttttaatgta 2220 actgattgcc cacattactg taaatgaaag tacagtgtca ctgcttcaga aataaataag 2280 acaaaactct tcactgtttt ctcatgtttt aatgtttcat ttccaatact cgctaacaaa 2340 attatacata acagaaaatg ttcctctgat aaaaatgttc tgcttatcat caattgcata 2400 agtgttcttc tgacactaaa atcgctcaca aagaaaatta tacataacag aaaatgttcc 2460 tctgactcaa atgttctact tatcatcaat tgttggtgcc ttgggtgtgt tcagaacact 2520 gctggaagtg ggcgtatcag ggttcttagg tgacggcaat atgccgctta gagtcatgtt 2580 agatataccg gagagattgc ttgtgctgcc ccgctcagac aactgcggtg actgctgttg 2640 gtactgctgt gactgcagcc gatactgtgg cgactgctgg cggtggtagg tctgctggta 2700 ctgctgtgac tgctgctggt agtgctgctg atacagtggc ggttcttctt gatactgcgg 2760 ttgcagctgt tggtactgct gtagatgctg ctgcttgtac tgaggctgca ggggagcctg 2820 gcggtactga tccacgtact ggggggttgc agacgaccaa acagaactgg ctggcgcctg 2880 gtgcctccac atgtgaggtg gtggctggta ctgctctgaa ggtgtaataa cagtcggagg 2940 tgcagagttt ggtcttacag gtctgatagc agtaggtgcg aggaacgatg gagcaggagc 3000 agcacattct ttaccaccat cggaagagga agagtcgtta tccatgtcat tcttgcgaag 3060 aatgtcccaa aaagctgcct tggtcactct gtacctgtcc ttgggcatgg tgagtagtga 3120 atccctcacg aagttggcaa agttagctct cctgctgagg ggtgcaggct caaccaactg 3180 cttaagaagt tgaccagact gacgcaggtt ctcctggagg gtctgcatac atggagtatc 3240 cactgaggta tcatcctgct tccgcttctt actcttcttg gtggtggtgg tggtggtgtt 3300 ggtgtctccc gaagtgggag gggtctcctc gtcctcctcc atttctacac gagcagcttg 3360 ggcagccgct gcttctgcct gtgccaagga cttactcccc tgtccggcga ggtgttcttt 3420 cacctgcata aattgcaatg cattatcaat aagaaaacag aacaaagtga catgtctgga 3480 ataaacatga atgcaataaa cgtagtcgta agaaaatagc agtgatattc atgcttacac 3540 ttctcaatgg ctgtcccacg tgtcggacca gtggccgcaa gaagtcacaa cttgtgaaaa 3600 tccatgtttc tctctctgtg agcttccttg ccgcatcccc gctcttcttc ttgtggagcc 3660 ttgtgaaaat gtcctttata gatctccacc agcctcttaa gcagattcct gtcttgtcca 3720 actctttggc cttgtcgttc cacagtgcgt ccttcagatc cgtgcgctta tagctgatga 3780 gttttctgtt ccaaaggatc tcgttctctc ggatccattc caccaaggcc atctctgtct 3840 gaccatccag agaaaagctg ttctttttcg ctcttgtttt tttgggcgcg gacaccgctt 3900 gagcgtcgtc tgaaccagag gggtcggaca acactggagt gttgtctgga gctgggtcgg 3960 acatagctgg agtgtcgtct ggagctgggt cggacacaac tggagtgtcg tctggagctg 4020 ggtcggacac aactggagtg tcgtctggag ctgggtcgga caccactgga gcgtcgttac 4080 agccaccagc aggcggccta gaggagccac ggccacggcc agagccggca tgtcaacatg 4140 aaaatctgca ctggtaatga aaagaaggct ggctgtacct gcatttatac tcacctgaaa 4200 aacgcgtgga atgcaattaa cgccaggtgc gtcaggcatg cgatagacag tcgttgtgtg 4260 cgtcaggaat gcgagaagca aacgcaactc aattgcagct attcgcggca aaggtgttgc 4320 gtccttcata tgcaaatcac gcgcacctca gtcggaatgg agtcgctgaa gtcgtaactc 4380 atgcgccatt tgggcaagat gtataatcta attaatacca caaaaaataa aaatgttggg 4440 cgttcggctg cgtattgaca gaatttcatt cggaactcac gcgcacttga aaatatacgc 4500 aggtgtaaac ccacc 4515 // ID CR1-1_BM repbase; DNA; INV; 3410 BP. XX AC . XX DT 30-JUL-2009 (Rel. 15.03, Created) DT 30-JUL-2009 (Rel. 15.03, Last updated, Version 1) XX DE CR1-1_BM autonomous non-LTR retrotransposon from silkworm - DE consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-1_BM. XX OS Bombyx mori OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Bombycidae; Bombycinae; Bombyx. XX RN [1] RP 1-3410 RA Kapitonov V.V. and Jurka J.; RT "Young families of CR1 non-LTR retrotransposons from animals."; RL Repbase Reports 10(3), 528-528 (2010). XX DR [1] (Consensus) XX CC The consensus sequence is less than 1% divergent from several CC copies of CR1-1_BM. XX FH Key Location/Qualifiers FT CDS 274..3162 FT /product="CR1-1_BM_1p" FT /note="APE endonuclease and RT domains." FT /translation="MIYFFSLSLMANLKIFYQNVRGLRSKTNLFYRNIILC FT DNDIICLSETWLMPGIYDSELFDERFVVYRCDRNYNSRNDSLGGGVLIAVR FT RGITVFNLTCLPSNRNRSADIISIKINIKHNNVSKLLHLYCCYFPQCSSQF FT EDEIGFFEDISDTIITNPGDEFLCVGDFNIRNASWELIGVDGPARLLNGDR FT WDSLVFGLSNFLAFNSFKQFNSISNINKRCLDLVISNCACKVVRSSLPLVS FT EDDHHPALDIQIQGLDVRPLRSPGRSILLFHKSDYSIINEELSKIDWKQSF FT MNLDVDMAVGKFHEILDKIISDNVPVKVVRGNSNFPYWYSSALIKLIKEKR FT KFHRKWKCYGRLADYSSFSVLRERQKALKICCYNAHITKCEDDILKCSKQF FT WRFVKSKRSAGDFPNELFLNDRCSSDGSEICNLFNVHFGSAFLTNNTQSAS FT SLSYTPASYDLNCVDISSFFISVGKVKQYLKNLDISKGAGPDGIPPVFLRQ FT CYAQLCYPLHLLFNHSLSTGVMPRIWKRSFVVPVFKSGDKHNIANYRPISK FT ICAIPKMFERIVYDFLFPLIRPHIIEQQHGFISHRSTETNLCEFLDYVLSS FT MEDGHQVDVVYTDYSKAFDRINFDILIEKLHGLGVHGDLLRWLESYVRDRS FT QAVTFNGFCSSFAPVPSGVPQGSHLGPMLFNIYVNDVSNVFRKSKFIMYAD FT DKKVYKVIKSLNDCLELQDDLNNLFVYCQNNLLTVNTNKCTIITFSRGRSN FT ILYDYSINGQTLVRTTVVRDLGIYLDSSLIFDFHVNEIVNKAFRMLGFILR FT VGKDFKRPSTLILLYNSFVRSILEYGSVTWNPQYNIYIQRLERVQKKFFKH FT LLYHCRRFNSPEPTEFVSLVDRRLLRDQMFLYKIINNEIDSSYLLSKISFK FT CSRISARSKQTFHMSTARTKYASNSFVRRSCRLYDAKFSNADIFCLSLVAY FT RKAVLECL" XX SQ Sequence 3410 BP; 928 A; 528 C; 650 G; 1303 T; 1 other; gactatttca gttttttgag aaacttttaa atgttttttt aactatttta ataacattat 60 tatgtaaggc ttgttatgat atagcaacat acttattgtt tagcattgga gaatttttta 120 ttttgttcgt ttggaacttt tactcctggg gtgggtgttt gattaatgtt gagatctata 180 ttctcgtttg tgtaggtact tatgtattga catgtcaatt attatgctgt ggtttgagcc 240 gtggactatt aatactgttg gttaattatt tatatgatct attttttctc tctctctcta 300 atggctaatt taaagatttt ttaccagaac gttcgtggtt tacgtagtaa aactaatttg 360 ttttatagaa acattatttt atgtgataat gatattatat gtcttagtga gacatggtta 420 atgccgggta tttatgactc ggagttgttt gatgagcgtt ttgttgttta ccgttgcgat 480 cggaactata actctcgaaa tgattctttg ggtggcgggg tacttattgc tgtgcgtaga 540 ggaattaccg tttttaattt aacctgtctc ccttccaaca gaaaccgatc agctgatatt 600 atttcgataa aaataaatat taagcacaat aacgttagta aactactcca tctttactgt 660 tgctattttc ctcaatgttc ctctcagttt gaagatgaaa ttggtttttt tgaggatatt 720 tctgacacaa ttattactaa tcccggtgac gaatttttgt gcgtcggcga tttcaatata 780 aggaatgctt cttgggagct gatcggtgtt gatggacctg ctagactctt gaatggtgat 840 cggtgggact cacttgtctt tggtctttct aattttttag cttttaatag ttttaaacaa 900 tttaattcga tttctaatat aaataaaaga tgcttagatt tagtaataag taattgtgct 960 tgtaaggttg ttcgttcgtc ccttcctctt gtatctgagg atgatcatca tccagcctta 1020 gatattcaga ttcaggggtt agacgtgaga cccctacgtt ctcctggccg gtccatattg 1080 cttttccata aaagtgacta ctctattatt aacgaggaac tttcgaaaat tgattggaaa 1140 caatctttta tgaatttaga tgtcgacatg gccgtgggca agtttcatga aatcttagac 1200 aagattattt ctgataatgt gcctgtgaag gtcgtaagag gtaactccaa tttcccttac 1260 tggtactcct ctgctttaat caagttgatt aaagaaaaaa gaaaatttca caggaaatgg 1320 aaatgttatg gtcgattggc cgactattca agtttttctg ttcttcgtga aagacaaaaa 1380 gccttaaaaa tttgctgtta caatgcacac ataacaaagt gtgaggatga tatcttgaag 1440 tgcagtaagc aattttggag atttgttaaa tccaagagga gtgctgggga ttttcctaac 1500 gaattgttct taaatgatag gtgctcatct gatggttctg aaatatgtaa tttatttaat 1560 gttcattttg gttctgcctt tttaactaac aatacacagt cagcatcatc gttaagttat 1620 acccctgctt cttacgacct taactgtgta gatatttctt cattttttat ttcggtgggt 1680 aaagtcaagc aatatttaaa aaatcttgat atttccaagg gtgctggtcc ggatggtatt 1740 ccgcctgttt ttttgcgaca atgttacgct cagctgtgtt accctttgca tcttcttttt 1800 aatcactcat tatcaactgg tgttatgccc cgtatttgga agcggtcgtt cgtggtgcca 1860 gtctttaaga gcggtgataa acataacata gctaactaca gaccaatttc taaaatttgt 1920 gccatcccta agatgtttga acgtatagtt tatgattttt tatttccatt gattaggcct 1980 catattatag agcaacagca tggttttata agccatagat ctactgagac aaacctctgt 2040 gagtttttgg attatgtatt gagctccatg gaggatggtc atcaggttga tgtggtgtac 2100 acggattatt ccaaagcgtt cgatcgaata aattttgaca ttttgattga gaaattgcat 2160 ggacttgggg tccacggtga tttgctgcga tggcttgagt cttatgttag agatcgcagt 2220 caggctgtga ctttcaatgg tttttgttca tcatttgcac ctgttccctc gggagttcct 2280 cagggctctc atctcggtcc tatgttattt aatatatatg tcaatgacgt ttcgaatgtt 2340 ttcaggaaat ccaaattcat tatgtacgct gatgacaaaa aagtatataa agttataaag 2400 tccttaaacg actgtttgga attacaggat gatttgaaca acttatttgt ttactgtcaa 2460 aataacttac tcacagtgaa tactaataag tgtactatta taacattctc acgtggaaga 2520 tcgaatatct tgtatgacta ttctattaat ggtcaaactt tggtacgcac cacagttgtt 2580 cgtgatttgg gtatttattt ggattccagt ttgatttttg attttcatgt taatgaaata 2640 gtaaataagg cttttcgaat gttaggcttc atacttagag ttggtaaaga ctttaaaaga 2700 ccatctactt tgattctgtt gtacaacagt tttgtacggt ctatattgga atatggctcc 2760 gtcacatgga acccccaata taatatttat attcagcgac tagaaagagt ccaaaaaaag 2820 ttttttaagc atcttttata ccattgtcgt cgctttaact ctccagaacc gacagaattt 2880 gtctctctag tcgatcgcag gctactgagg gatcaaatgt ttttatataa aataataaat 2940 aacgagattg attctagcta tcttctttcc aaaatttcat ttaaatgtag tcgtatttcc 3000 gcgcgttcta aacagacctt ccacatgtcc acggctcgaa cgaaatatgc ttctaactct 3060 tttgtccgta gatcttgtag gttgtatgat gctaagttct ctaatgctga tatcttttgc 3120 ttgtcacttg tagcatatag aaaagctgtt ttggagtgtt tatagggtga tttggcccga 3180 taagtgttgt agttttatag tgtgagttaa tgtataaaat gatatgtgcg tattgtgttg 3240 cttttgtttt tttktttaat ttaaatttct aaatttctct tcttgttcac tgtctactta 3300 tatgtgattt tgattgtgga aacatatttg gttattgttt tagctatatt ttttaaatat 3360 tgtatgtttt gtattgtact gtttgtttcc caaataaata aataaataaa 3410 // ID Gypsy-7_RP-LTR repbase; DNA; INV; 251 BP. XX AC ACPB02048231; XX DT 28-JAN-2011 (Rel. 16.02, Created) DT 28-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Rhodnius prolixus genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-7_RP_; KW Gypsy-7_RP-I; Gypsy-7_RP-LTR. XX OS Rhodnius prolixus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Euhemiptera; Heteroptera; OC Panheteroptera; Cimicomorpha; Reduviidae; Triatominae; Rhodnius. XX RN [1] RP 1-251 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Rhodnius prolixus genome."; RL Direct Submission to RU (28-JAN-2011). XX DR Genome; ACPB02048231; Positions 3571 3821. XX SQ Sequence 251 BP; 86 A; 22 C; 43 G; 100 T; 0 other; tgtgggcgaa tattttaatt gtttaaaatc atattaatta ttaaataatt ataataatac 60 ggacagatga taaaacatat gtgtgatata ttgaatgtaa aaaaatattg aagctggagg 120 tttttgtttt atattcatat agcgttgtgt ttttttgact gccttaaccc atgttgttga 180 ggaagaaaaa aaataataag gattcatcat tgtagtggtt gttcttattt ctcttagtga 240 ataatccttc a 251 // ID P-2_Hrobusta repbase; DNA; INV; 4115 BP. XX AC . XX DT 10-MAY-2011 (Rel. 16.05, Created) DT 10-MAY-2011 (Rel. 16.05, Last updated, Version 1) XX DE DNA transposon: consensus. XX KW P; DNA transposon; Transposable Element; P-2_Hrobusta. XX OS Helobdella robusta OC Eukaryota; Metazoa; Annelida; Clitellata; Hirudinida; Hirudinea; OC Rhynchobdellida; Glossiphoniidae; Helobdella. XX RN [1] RP 1-4115 RA Yuan Y.W. and Wessler S.R.; RT "The catalytic domain of all eukaryotic cut-and-paste transposase RT superfamilies."; RL Proc Natl Acad Sci U S A 108(19), 7884-7889 (2011). XX DR [1] (Consensus) XX SQ Sequence 4115 BP; 1544 A; 500 C; 543 G; 1528 T; 0 other; cattgaatta tatataagaa tggctttata gtgttgacct tcatctcaag gtcaggaaac 60 ttggaagcaa acttttttag caccaaattg tcaatgggtt tttccgtttt ccaatctata 120 tttagcttct tccctgttaa tgtgccaccg gcggcattct tgtaaacgtt ttttgctgct 180 aaaatcattc tacggcagtt ttaatcgatt tttaacttaa taatttaact gaattatgta 240 aaatttttgg aatcatttat tatttgatcc attttgattt tataaagttt atttgtttaa 300 aagttttaaa aatgttacat tgtgccagga gacactaggc ccagactagt agaagaaaag 360 aggctttgct tttttgtatt tttatttaaa atttttgact ttttattttt ttaactatcg 420 ttaattaatg ttcaaatatt ttttatattg tttttatatt atgggtttat tattattatt 480 aggatgccct tgtaaaatgg ctttttgtca taagaaatac agaatacaaa gagtttcttt 540 attttaattt attgggtata tatatatata tattatatat atatatatat atatatatat 600 attatatata tatatatata tatatatatg atgaaaaatt ttaaatgcag ctatttactt 660 tcaaacttaa atcatattta gacgttttgc ttaatggatg cacatctcat caagcaacag 720 caaaggcaat caaagtatta caagtccaga atcacaatgc tgcaggcaac aataagaaaa 780 cttatgaatg aatgtaataa attaaaaaat gaattagcta catacaaagg ttttatgtta 840 tgtaaccata gcaaaatgaa ttgaaatttt gattaatttt ttgttttttt ttgcttctag 900 ataaaaaact cataccattt aaaaagagtt aacaaagtat ataatgatag ttacaagacc 960 ttttatatgc agctgcattt tcatggtcct catgcatacc aatttttaag aaaaagtgga 1020 ctaacactgc cacactcgag aacattgaac aagtaaaatt ttgttaataa tgatattctg 1080 taaactaata ttgctatata aagaaatatg tacagcaaat aaaatattgc aaatgctctt 1140 taatccttga ttcgatatca ataaagcctc aaataacgta taataagaaa attggtaggt 1200 tttttaaaat atattttgtt tgtaaaattg tgagtttaaa atatagcata ttttggtagc 1260 atatgctagt ttattccttg tgtattttat ttatttcttg attttttttt gtttcccatt 1320 tcaccataaa catttgtttt caatttattt aattttattg atgttaaatg ttaagcataa 1380 attatcaaac gcattgttta tttgtgaaga tgactttatt ggatatgtta actatggaga 1440 tgggaatgaa gacatagttg cttcagagat gttgattttt tatgttgtta gtctgagaga 1500 agcttggagg cattcaatag cttattttta tacaaataag ttattatctc acaagttgaa 1560 agattttata gagaaagcca ttgataagtt acataatgct ggagttaggg taaacaacat 1620 ttgtttttct taaagatatg ttaaaatttt tatacattat atttattttt tagggttgtt 1680 tcactggtaa tggatgggtt ggccactaat gtttcggcat gtggattgct tggatgctct 1740 ttcacatttc caaacatgca aacaagtttc ttgtccccaa caactggcca gaaaatatat 1800 atcatatttg atgcatgcca catgctgaaa ttaatgagga atatgtttca ttcttatatg 1860 atattttata ataataatcc aatcaaatgg aatcatatcg taaacttagt taattatcag 1920 caacaaatca aaatgaaact aaaaaataaa ttaaccaaaa atcgtgtaaa ttttgaaaag 1980 cataagatga aagttaaata tgcatcttga gtctttagtt gctctgcggc taaatcattg 2040 gaactcttaa acggatttta tgtttttttc tgacgtttct gaaacaatag cttttgtgaa 2100 aaacatcaat gatcttttcc atatattgaa cagcaaactg atactggata gagaatggaa 2160 gcaatacatt acaaagagca acctgacagc cactatagat aaactttatg aaatgaaagc 2220 ctacttacta agtttgaaaa atttagatgg tgttacattt acaaaactca aggttttttt 2280 ttatatatct caatatctga actaaaactt aattacaaat aatttcagaa aaatggcttg 2340 gacaattgac atatccctca aaaggagtat taaaagtaaa atgatacatt tatttatttt 2400 ctggtaaaaa caataaattt taatgatttt cagatctgcc aagaatcaga aagattatta 2460 aaaataaata attattgttt taagaaaaaa cttgtaattg aaagcaacat actcccacaa 2520 tgtgtcaacg aagatttatt taacatggaa aatcatttca atgatgtaca gaatgcaaac 2580 cactattatg aattaataaa aaaaatcatc atatacaaca ttaagacgtt ttaacttaac 2640 taaagggttt tctatcaaca tgcagtttta ataaaagtaa attaaataat aaaatgcaac 2700 gaaaattata ctatttaaat ttattaaatc aattgtttta ttgaatttgt acagtgcttc 2760 taatagatta catcagaagc aaattctaca accttagttt tatcttatat atatataaat 2820 atatatatat atatatatat atatatttca cttttcaaga tagttgaaag ataaatttat 2880 aattatgtaa atcaaacatt ttgcagttat acaacaaata aacaaaaccg aaatccagaa 2940 ataaaaaaaa gtaagactga tctaactacg agttaagtta attactaatt atagcaagat 3000 agatgtttta taatcaacaa gcaagaaata taacaattta cacataaata taaaagtata 3060 catgttcatg atctagcacc ttcagggtta ttttataaat agtaatgtta aactaaaaat 3120 tattaaacat tccataccta ttctattaaa aatcaacagt tttataaaat ttaattaaat 3180 caatagcata atgaatttca tgttagctta atattaatta aactattatt tttttaatat 3240 attaatgtca attacagttg aaatagttaa gttaaagaat ttgattttaa caaattatta 3300 tataattaat ttaatatatt aataaattat atataaatca tagcatttaa agttaaaaca 3360 gaaaaaaaag attgtttgct tttactacca aaagaaattt taaaaacctc ctctgtgatt 3420 aaatttggtt tgggtatgtt atgttgaaag caaatattcc tttgtattgt aagctttggg 3480 gtgttgaaag attctagttt aggacatatg cacaggaatt acaccttaat ttaaaaatta 3540 ttagttgatt aatttattta taacaatata aattgatcat gagttatatt ttttaaatat 3600 taataacatt tcatttttac aataaaattt ttacccactt cttttcgttt tctagattct 3660 aatttaattt aggaatgatt cttactctaa atgtctaatc atcgtgattc gtgggtgctg 3720 ggaaaatgaa tgaagaaatc tttatttcag aaaaatgagg atcgctaacg aacttattac 3780 aaatcttcga tttttccgat gcatcaaaaa aatggcaata tctatttaat ttaacgatta 3840 aatacggcac attataaaac ctaaaacttc acggaatgtt gaaaacgaaa taaattttaa 3900 caaacgagct gaatgtagat cggcaaagag tcattaacat aacttcggga aggccaagat 3960 tttttctaat tatatctcag atttttctaa aaataaatca aaatgctcat gaataattaa 4020 tttttaggaa cttgcttcca gaggtaactt gcccgttgaa aaaaattttc aatgttaaaa 4080 cgctggtagg gcgcttccag ttatataatt ccatg 4115 // ID BEL-2_DWil-I repbase; DNA; INV; 4144 BP. XX AC scaffold_177039; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-2_DWil_; KW BEL-2_DWil-LTR; BEL-2_DWil-I. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-4144 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (04-MAR-2011). XX DR Genome; scaffold_177039; Positions 333 4476. XX CC Positions [3326-3766] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 846..3410 FT /product="BEL-2_DWil-I_1p" FT /translation="MLRLQKSLKGKAYELVRDKLLLPILVPDVINTLKTFF FT GRPEQILDRLIDKIRKIYVHKDKLESIVDFAMAVRNACATMEACNLTSHLN FT NPMLLRELIDKLPTQQKLNWAMHPRNESIPVIKSFSDWLYKIAEAAATVMP FT GQSTKANNVNTHSQAHQQPQEGPNNRENGWQQNCKPKLQCVMCASTDHQVA FT LCQSFKRLSVDDRQRIINEAKVCYRCLKSHRRKCFSNRDCGIDNCNSKHHP FT LLHKGTGRTEVVTAHHKDNVVEGQFFRVVPIKLYGANVTFNTFAFLDEGSS FT VTLIDESILDKLKISSTPEPICLQWTGEETRNEEDSVKTHLQISEAGTDKK FT LWLYNVHSVKKLGLPKQTINVEELSKQYPYLRGLPLKSYQNGTPLILIGSN FT NWNLAIPRKIREGRRNEPIASKCTLGWCIQGSTNSSKHVTMHHCECNWSEI FT DKAIKESFTMEPIASRELQSKNDKRAKGILEDTCKKVNGRYEVGLLWKDDT FT VILPESYTAALKRLQCLRAKIKRTPELFDKIQSQIDNLLEKGYASELRPEE FT ISDRKERVWYLPIFIALNPNKPNKIRLVWDAAAKSHGKSLNDFMLTGPDYL FT NPLTSVLMAFRVGRIAVCADIAEMFHQINIQESDMHVQRFLWLNKNDHTPR FT IFVMRAMTFGINCAPCIAHYVRDKNAVEYFSDWKRLYRAVANIMLYAQKLR FT AKCSGKPLPQTLSPENIQAAKSLLCRQAQLATYSNEINSLKDKSVVDKSSS FT LIALNVYLDSEDIIRVKGRSDSITSDGDAIVIPRHSRIAFLIAKDYHEKSH FT HIFHECTINLIRSVYYIPRLRVLYKMSEMHANTAKYPQRSLKCHKWHLYRS FT PD" XX SQ Sequence 4144 BP; 1387 A; 963 C; 859 G; 935 T; 0 other; catttcttta aaaaagttta ttgaggttaa aaagtttccg cgtgcccgaa tttcaccata 60 tgtatcaagt gcaaatacat aaagacacac atatacatgt atattagagc gtaactacac 120 ataaagtgtt caaagccatg catgtgtaaa aaacctttaa gtagtacatg cacaaattgc 180 aggggataaa ttcatagcac aatataatcg gaaaccaata gactaaaaca aatccaaagc 240 atcatgaata aagatcaaga catttgtagt gcctgtaaca agagtcgcct aaacagtagt 300 acccaacatt gggttcaatg cgacaattgc cagaaatggt accactttca atgtgctgga 360 gtcaatagct caatagcaga ggctgaatgg atttgcaaca gctgccaaca agcgctggac 420 aacgtggcat caacaagcac tggtcgtgac ccgtcaacag gagcgcccaa tataccaagt 480 gccaacaaag ccaccgaaca acattgcccc gaatctaacg cggcaaatac gtcttatctc 540 ggcaaactaa acttagtcga gcaaactcaa tcgccaagct caaattattt gatgttggca 600 atgctggagg aaaggcggga agccgaaaaa agatacatcg agcaaaaata caacttgctt 660 gcacagcagc aaggaacgca ggtaacgtgt ttccaattaa atggacctac tgcttctcag 720 ctagccgcaa ggcaagtttt acctgcagat ctacccatct ttaacggaaa tccagaggac 780 tggccaatct ttattagcat gtacgagaca gctgccgtat agcggggttt tcaaacgccg 840 aaaacatgct acgacttcag aagagcctca aaggtaaagc ctacgaacta gttcgtgata 900 agctactgct gccaatactg gttcctgatg ttattaatac gctcaagaca ttcttcggac 960 gcccggagca aatactcgat cgattgatcg acaaaataag aaaaatatac gtacacaagg 1020 ataaattaga atccatcgtt gatttcgcaa tggcggtgag aaacgcatgt gcaaccatgg 1080 aagcgtgcaa tttaacctca catttaaata acccgatgct tcttcgagaa cttatcgaca 1140 aactacccac ccagcaaaag ttaaactggg ctatgcatcc acgtaacgaa tccatcccgg 1200 ttataaaatc atttagcgat tggttgtaca aaatagctga ggccgcagcc actgtcatgc 1260 ctggccaaag cactaaggcc aataatgtca acacacactc acaagcacat caacaaccac 1320 aggaaggacc aaataatcga gagaatgggt ggcaacaaaa ctgtaagccg aagctacaat 1380 gcgtgatgtg tgcttccacg gatcaccagg ttgctctatg tcaaagcttc aagcgactgt 1440 cggttgacga taggcaacga ataataaacg aagccaaggt atgctacagg tgcctcaaaa 1500 gtcaccgccg taaatgcttc tctaatcgtg actgcggaat cgacaattgc aactcaaaac 1560 atcacccgct tcttcacaaa ggcacaggca gaaccgaagt tgttactgct catcataaag 1620 acaatgtagt ggaaggacaa ttttttcgtg ttgtacccat caaactgtat ggagccaatg 1680 taacctttaa cacgttcgca tttttagatg agggctcatc agtcactctc atcgatgaaa 1740 gcattttgga caagctgaag atctcttcaa cgccagagcc aatttgcctg caatggaccg 1800 gtgaagaaac cagaaacgaa gaggactccg ttaagaccca cctacaaatc tcggaagcgg 1860 gaactgacaa aaagctatgg ctctacaacg tccactcagt aaagaaatta ggattgccga 1920 aacaaaccat caatgttgaa gaactatcca agcaatatcc ttatttgcgt ggtttgccgc 1980 taaagtcgta ccaaaacggt acaccactaa ttttaatcgg cagcaacaat tggaatctag 2040 caataccccg caaaatccgc gagggcagac gaaacgaacc cattgcttca aagtgcacat 2100 taggatggtg catccaagga tcaacaaact caagcaaaca cgtaacaatg catcattgcg 2160 agtgcaattg gagtgaaata gataaagcca taaaggaaag tttcacaatg gaaccaatcg 2220 catctagaga attgcaatcc aaaaacgata aaagggcgaa gggcattctc gaagatacct 2280 gcaaaaaggt caatggtcga tacgaagtcg gcctactctg gaaagacgac acagtcattt 2340 tacccgaaag ctataccgca gcgctgaaac gcctccaatg cttacgggcc aaaattaaaa 2400 gaacgccgga actgtttgat aaaattcaat cacaaattga caatctatta gaaaaaggat 2460 atgcatcgga gctcagacca gaagaaatat ccgatcgaaa ggaacgcgtc tggtatctcc 2520 caatatttat tgctttaaat ccgaataagc caaacaaaat acgactagta tgggatgccg 2580 ccgccaaaag tcatggaaaa tcgctgaatg attttatgct tactgggcct gattatctaa 2640 atccattgac ttctgttcta atggccttca gagtaggacg gattgccgtg tgcgcagata 2700 tagccgagat gtttcatcaa ataaacattc aggaaagcga catgcacgtc cagcggttct 2760 tatggctcaa taaaaacgac cacactccaa ggatcttcgt tatgcgcgcg atgacatttg 2820 gaattaattg cgcaccttgt attgcccact atgtacgcga taaaaatgca gtagaatact 2880 tttctgactg gaagcgccta tatcgagcag tggccaacat catgctgtac gcccaaaagt 2940 tacgcgctaa gtgttcggga aagcctctac cacaaaccct aagccccgaa aatatacaag 3000 cagcaaaatc tcttttatgc agacaggccc aattagcgac gtactcaaat gaaatcaatt 3060 cgctcaaaga caaatcagtg gtggacaagt caagctcatt aattgcttta aatgtttacc 3120 tggactcaga agacatcatc cgcgtgaagg gcaggtcaga cagtataact tctgacggtg 3180 acgcaatcgt aatcccaaga cacagcagaa ttgccttttt aattgcaaag gattatcacg 3240 aaaaatctca ccatatattt cacgagtgta ctatcaacct tattagaagc gtatattata 3300 taccacgcct aagagtccta tataaaatgt ccgaaatgca tgccaacact gcaaaatatc 3360 cacagcgaag cctcaagtgc cacaaatggc acctataccg atcgccagac tagcgagaga 3420 aatcttctcc gacaacggaa ccaatttcaa agctacggaa aaactagtta aggaggaact 3480 tgtgaagatt gacttcgatc aaatcgccgt tcactatgaa gctataaaat ggcgttttaa 3540 cccaccagct gcaccccaca tgggaggagc gtgggaacgt ttggtccgga ctgtaaaatc 3600 cgtcctaaag gccatttgcc cctctagcaa ctttaatgac gaaacacttc gaagtgcctt 3660 aatggaagcc gaatttataa ttaactccag gcctctatcg tttgtttctt tagatacagg 3720 tgacgatgaa gcgcttacac ccaaccactt gcttctggga tcaaaagcgg gctacaaacc 3780 agtatcagga aatgtcgccg acctaaggtt acgatggcat caaacgcaat cgtttgctga 3840 tcgcttctgg aaaagatggg tgaaggagta cacaccaaca ttaactcgac gcagcaaatg 3900 gtttactaag cgtccaccag tcgatatcgg agatgtagtt attgtcgtgg acgataatct 3960 gccgaggaac ctgtggccaa aaggccgcat aaccgatgtc gtaaccgcca aagatggcca 4020 agttcgcaga gcaaccatat gtacaccaaa cggcatcatg atacgccccg tttccaaaat 4080 tgctgttcta gacgtcggat taaaggaaag tgaacatcct gaggtcgttc acgggggggg 4140 ggaa 4144 // ID Gypsy-4_CQ-I repbase; DNA; INV; 8328 BP. XX AC AAWU01033212; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-4_CQ_; KW Gypsy-4_CQ-LTR; Gypsy-4_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-8328 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 387-387 (2011). XX DR Genome; AAWU01033212; Positions 25247 16920. XX CC Positions [4723-5196] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 317..1543 FT /product="Gypsy-4_CQ-I_2p" FT /translation="MEELNKLDSYFSLEIANLLEATPKSKEEIEQKTTEIA FT EAVENYKQLLIASRDTLQPKEFNKINNGFKEDHKNLQEALEVFAKKLIIVS FT DDEEKYGEVENLKIIPVSLKIHKNPKNPNSYEFPMDSAINCIPDFFGHAED FT LTSFIDQINYFYSKMPKGVSHTPLINVIKLKLKGRAKFFANSISNLPWEKV FT EQNMIEEFSVKKSSVNIFKTISTISQKQFEKFKSYKNRALEILADLESVED FT FNKNSLVMKNFKLHFIAGLNNIELQRTARNIVETDFRQFLNTLEHHYVSDE FT EFEDLQKHVQKMNIQTPTVKTNFGSQFKHYNNNNNGNLANNAPVFRNNYQQ FT NYHRNNDNRNVTQENNRFRHQQQHRDFNQNHSNNRPNQNFNAHRGISSQQN FT NMSNNSKTYYPQRKN" FT CDS 1933..5376 FT /product="Gypsy-4_CQ-I_1p" FT /translation="MGSVIIELLIGNTIFSTKFYVMKNLPVPGIIGAEFLK FT RHTLFIDKNFNFIVLQKPRLYFNNHYNKNEENLADINCEEISTSSDDENDG FT NEENANIIAYCENNAQTYEIHNSPQSNFDDELCIPKELNLDIDKDCDVIEY FT KKFKYVCGEERLKKLLETINLKHLNNENHDAIVFMIRKFNKLFFLEGDELT FT FTNAAIHEIETTTNIPINKRQYRMPESTKTHIDEQIEEMLKLGIIQPSKSP FT WNAPVLCIPKKVGADGKKKYRIVVKFRALNFITKPFVFPIPLINEILDNMV FT DAQYFSSIDLKSGFYQILIHPKDAAKTAFSTWKGHYEFLRMPMGLKNSPAT FT FQKLMNIILFEIQPVKAFVYLDDIIIFGNTIKEHNDNLFKILNALSRHNLK FT VEPSKCSILKTQIKYLGHTIDKHGIRPTEDNINIIKNLKRPQNIKDVRSFL FT GTVNFYGKFIPNMADKRKHLNNLLKKDVKFKWSEDCEKAFHDLKQCLMTEP FT ILVRPNYNDTFVITTDASDYAVGAVLSNAKTNDNPIAYASRALSISEKKYY FT TIEKELLAIVWAIEYFKHYIFNQQFIVYTDHRPLIAIWRLKETSPTLSKLR FT LKIQGIGCDIRYKQGKENVVADFLSRIKHEDVEEINSENNLIAVVTRQQKR FT QQKDSSNQIGNSKNLIIDDINIDITNDISQNEENLNETIDVLQTIDIGNEK FT EDDSWKKYSLDDFFDAQNKDEIDFKLLKFTKTMLNESDADATFVILNSKTA FT FKELSQLVDLPHGMKDYLYDNILSIPEHKMYGLIINGSVKSVTNIEELFNG FT LVKSFENCPDFIKSVRNIQIVSFRNLQHNPDVNILRFIAGKYKKIFTLYAS FT NDTRMFVKMEDRDTVLKEFHDSPLGGHVGGRRMLQRISPLFTWDNMRRDVE FT NYVKQCELCQKNKIWPKHKIPMKITTTSTEPFQKIYMDIVILTISEDNNRY FT GLVIQDDLTRFLVVAPLPDQESTTVARAFVEHFICRFGAPAEVVTDQGANF FT MSNLLKHTCKILKIKKINTSAYHPQANLVERSNRELKTFLRNYIAGNPQTW FT DQLLPFFTFQFNTTVNSSTGYSPFELLYGRQARIPNSVYNVTETELNYDDY FT VTEMKANFKNIHDKAVQNVLASKHKRKEIYDKNAKTRIMK" FT CDS 5454..6425 FT /product="Gypsy-4_CQ-I_3p" FT /translation="MSNLHIFSIKARESAPICSLTGYGTDHSRIFILEIVG FT FTSPFRKKVSDENYDIFTYYNSKITVVENHIKNQLRNVYRNIMTESCKIDK FT ALLETKLTLARLNPAEFVKSIIKRDGFTAVTAGEVLHIIECKPVFVTMRNA FT ENRCFQEMPISYNNVSMFLAPVTRIIQDKGTEIDCSPLLPVKYVLGGRWYT FT VDSRLRETYNPIILTSDLVTDWNYEAIPNLIETGIYNPESINKMRNMIFGQ FT NAGRIAATIMYRTLAGQNPSHQGFKFDSLLASPIMDHELIRLLKAIMKGIS FT IFSEISSFFLGTYIIIVQVIKIKNKLFGAVIG" XX SQ Sequence 8328 BP; 3293 A; 1238 C; 1364 G; 2433 T; 0 other; aagtggtgac aagcggaaag tttcactctg aaactggaac gtcatttctg ctggcttaac 60 cagttagaaa agttttgtaa tagaaaagta aagtgttaga agtgtcacaa gaatcttaaa 120 atgataattt ggtaagtacc atttttttcc atatttttac taacaccttt tttatcgggg 180 acttagagtt tttctgacca actcaaaact taaaaaaaaa cttgtggatt tctctaattt 240 aggaaacagc ctgagagtct gcagaaaata atttactttt tttaaaagta agaaaaattt 300 ccttccctga atcacgatgg aagaattaaa taaattggac agctattttt cgttggaaat 360 agcaaattta ttggaagcta ctcccaaatc taaagaagaa atcgaacaga agacaacaga 420 aatagcagaa gctgttgaaa attataagca actattaatt gcttcaaggg atactcttca 480 acctaaagag ttcaacaaaa ttaacaatgg gttcaaagaa gatcacaaaa atcttcaaga 540 agctcttgaa gtttttgcta aaaaactaat tatagtttct gatgatgagg agaaatatgg 600 cgaggtagaa aatttaaaaa ttattcccgt cagtttaaaa attcataaga accctaaaaa 660 tccaaattct tacgaatttc caatggattc agcaattaat tgcattccag atttctttgg 720 acatgcagaa gatttaactt catttattga tcaaattaat tatttttaca gtaaaatgcc 780 aaaaggagtg tctcataccc ctttgatcaa tgtaataaaa ctgaaattaa aaggtagagc 840 aaaatttttt gcaaattcaa tttcaaattt accatgggag aaggttgaac agaatatgat 900 tgaagaattc agtgtaaaaa aatcatcggt aaatattttc aaaacaatta gtacaatctc 960 acaaaagcaa tttgagaaat ttaaatcata caaaaacaga gcattggaaa ttcttgcaga 1020 cttagagtcc gtagaagatt tcaataagaa cagtttagtt atgaaaaatt ttaaacttca 1080 tttcattgcg ggtttaaaca acattgaatt acagcgcact gcgagaaata ttgtggaaac 1140 agattttagg caatttttga atactctcga acatcattac gtcagtgatg aagaatttga 1200 agatcttcag aaacacgtac agaaaatgaa catacagaca ccaactgtga aaactaactt 1260 cggaagccaa tttaaacact acaataataa taataacggt aatttggcga ataatgcacc 1320 agtttttcga aataattatc aacaaaatta ccacaggaat aatgataaca gaaatgttac 1380 gcaggaaaat aatagattcc ggcatcaaca acaacatcgt gattttaacc aaaatcattc 1440 aaacaatcga ccgaaccaaa attttaacgc tcatagagga attagctcgc aacagaataa 1500 tatgtctaat aattctaaaa catattatcc acaacgaaaa aactaatttt tggggttttc 1560 gatagatttg gcgtccgaaa tacccggcaa aatggaaaat tcaatagacg ctctaatttc 1620 cgagctcgaa atcgttttag gagacggaat tataatcgga ataatatttg gaataataat 1680 agtttaagag agcaaagaga attttataac ccccctcacg aaattctcaa aataatttta 1740 actccttttc tacaatttag attaagaatt gcttcaaaac aaaacccgtt taattttatt 1800 cactacttgc ttgacacagg tgcaagtgtt aatataataa gaagcaatgt tttaaataaa 1860 ataggttatt ctcatgtcaa ttttaaagat aatatttctt taactggcat caataatacg 1920 tgtaatgaaa caatgggttc agttataatc gaattgttga ttggtaacac aattttcagt 1980 acaaaatttt atgttatgaa gaatttacct gtaccgggaa ttattggtgc agaattcttg 2040 aagagacaca ctcttttcat tgataaaaat tttaatttca ttgttttaca aaaaccaaga 2100 ttatatttta ataatcatta taacaaaaat gaggagaatc ttgcagatat taattgtgaa 2160 gaaatatcta cttcaagtga cgatgaaaat gacgggaatg aagaaaacgc aaatattata 2220 gcttattgtg aaaataatgc tcaaacgtat gaaatacata attcacctca aagtaatttt 2280 gatgatgaat tatgcattcc caaggaatta aacttggata ttgacaaaga ttgtgacgtc 2340 attgaatata aaaagttcaa atatgtatgc ggcgaagaaa gattaaaaaa acttttagaa 2400 acaattaatt taaaacatct aaataatgaa aatcatgacg ccatagtttt catgattcga 2460 aaattcaata aattattttt cctcgagggc gatgaattga cttttacaaa tgcagcaatt 2520 cacgaaatcg aaacaacaac aaacatacca attaataaac ggcaatatag aatgccagaa 2580 tctacgaaaa ctcatattga cgaacagatc gaagaaatgc ttaaattagg aattattcaa 2640 cctagcaaaa gtccttggaa tgctcctgta ctgtgcatcc cgaagaaagt gggagctgat 2700 ggcaagaaaa aatacagaat cgtggttaag tttagagcat taaactttat aacgaaacca 2760 tttgtttttc cgattccatt aataaacgag attctggata acatggtaga tgctcaatat 2820 ttttcttcga ttgatttaaa atcaggattt tatcaaatac tcattcatcc aaaagatgcg 2880 gctaaaactg cattttctac ttggaaggga cattatgaat ttttaagaat gcctatgggt 2940 ctcaaaaaca gtccggcaac ttttcaaaag ctaatgaata ttattctttt tgaaattcaa 3000 ccagtgaagg cttttgtcta tctggatgac attattatat ttggaaatac cataaaagaa 3060 cataacgata atctgttcaa aattttaaat gctttaagtc gtcataactt gaaagtggag 3120 ccgtccaaat gtagcatttt gaagacacaa attaaatatt tgggtcatac tattgacaaa 3180 catggaatta gaccaacgga agataatata aatattatta aaaatttaaa acgtcctcaa 3240 aacataaaag atgttcgttc gtttcttgga acagttaact tttacggaaa attcatacca 3300 aatatggcag ataaacgaaa acatttaaat aatttactaa aaaaggatgt caaattcaaa 3360 tggtcagagg attgtgaaaa agcatttcat gatttaaaac aatgcttgat gacagaacca 3420 attttagttc gtccaaatta taatgataca tttgtgatta ctacagatgc tagtgattat 3480 gctgtagggg cagtattatc aaatgccaaa acaaatgaca atccaattgc atacgccagt 3540 agagcattaa gtatttctga aaagaaatat tatacaattg aaaaagaatt acttgctatt 3600 gtatgggcta ttgagtattt taaacactat atttttaatc aacagtttat tgtctatact 3660 gaccatagac cattaattgc aatttggaga ttaaaagaaa cctcaccaac tctttcaaaa 3720 ttaagattaa aaattcaagg aattgggtgt gacattagat acaaacaagg caaagaaaat 3780 gttgttgcag attttctttc tcgcattaaa catgaagatg ttgaggaaat taattcagaa 3840 aataatttaa ttgctgtcgt gaccagacaa cagaaacgtc aacaaaagga ttcatcaaac 3900 caaattggaa attcgaaaaa tctaattata gatgatatca acatagatat aacaaatgat 3960 atctcgcaaa atgaagaaaa tttaaatgaa actatcgacg tattacaaac aatcgatatt 4020 ggaaatgaaa aggaagatga ttcatggaaa aagtattctt tggacgactt cttcgatgct 4080 caaaataagg atgaaattga ttttaaatta cttaaattta caaaaacaat gctaaacgaa 4140 agtgatgcag atgccacatt cgtaatacta aatagtaaaa cagcatttaa agaattatct 4200 caattagttg acctaccaca tggtatgaaa gattacttgt atgataatat tttatccatt 4260 ccggaacaca aaatgtatgg tttaattatc aatggctcag ttaaatcagt aacaaacatt 4320 gaagaacttt tcaatggatt agtaaaaagt tttgaaaatt gtcctgattt cataaaatca 4380 gttcgaaaca tacaaattgt ttcgtttaga aatttgcaac acaatccaga tgtaaacatt 4440 ttaagattta tagctggaaa atataaaaaa atattcacat tatatgcatc aaatgacact 4500 agaatgtttg ttaaaatgga agacagagat acggtgttaa aagaatttca tgattcacca 4560 ttaggaggtc atgtaggtgg aagaagaatg cttcaacgta ttagtcctct tttcacatgg 4620 gataatatgc gtcgagacgt tgaaaattat gtaaagcaat gtgaattatg tcaaaaaaat 4680 aaaatttggc caaaacacaa aattccaatg aaaattacga caacatcgac agaacctttt 4740 cagaaaattt atatggatat agtaatttta actatatccg aagacaataa tcggtatgga 4800 ttagtcattc aagacgatct tacgagattt ttagtagttg ctcctttacc ggatcaagaa 4860 agcacgacag tggcaagagc ttttgtagag catttcattt gtcgatttgg tgctccagca 4920 gaagtcgtaa cagaccaagg agctaatttt atgagtaatt tattaaaaca tacttgcaaa 4980 atattaaaaa tcaagaaaat taatacaagt gcctatcatc cacaggcaaa tttagttgaa 5040 cgttcaaata gagaacttaa aacgttctta cgaaattaca ttgcaggaaa tcctcaaaca 5100 tgggatcagt tattaccatt tttcacgttt caatttaata ccacagtaaa ctcatctact 5160 ggatattcac cttttgaatt attatatggt agacaagcta gaattccgaa ttcagtctac 5220 aatgtcacgg agacagaact aaattacgat gattatgtta cagagatgaa agcaaatttt 5280 aaaaatattc acgacaaagc tgtacaaaat gtattggctt caaaacacaa aagaaaagaa 5340 atttatgaca aaaatgcgaa gacacggatt atgaagtagt ttttgaaggt ttagtaaaca 5400 aaaccgttga tggaagtaat gaaaatgcca aacactttgc tgtttatagt accatgtcaa 5460 atttgcacat tttttcaatt aaagcaagag agagtgctcc aatttgttcg ttgacaggtt 5520 acggcacaga ccattcaaga atttttattt tagaaatagt gggatttaca tccccattta 5580 gaaagaaagt ctctgacgaa aattatgata tattcacgta ttataattca aaaattactg 5640 tagtggaaaa ccatattaaa aaccaactgc gaaatgttta tagaaatatt atgacggaat 5700 catgtaaaat cgataaggct ttgctagaaa cgaaactaac actagctaga ttaaatcctg 5760 cagaatttgt gaaaagtata ataaaaaggg atggatttac agcagtgact gcaggggaag 5820 tattacatat tatagaatgt aaaccagttt ttgtaacaat gagaaacgca gaaaatagat 5880 gttttcaaga aatgccgatt tcatataata atgtatcaat gtttctagct ccagttacta 5940 gaataatcca ggacaaggga accgaaatag attgtagtcc gttattaccg gtaaaatatg 6000 ttttaggagg tcgttggtac actgtagata gtagattaag agaaacatac aatccaatta 6060 tactaacttc tgacttagta acagattgga attatgaagc aattccaaat ttgatagaaa 6120 ctggcattta taatcctgaa agtataaaca aaatgagaaa catgattttc ggacaaaacg 6180 ctggcagaat tgcagcaacg ataatgtata gaaccttggc agggcaaaat cccagtcacc 6240 aagggtttaa attcgattcc ttattagcat cgccaataat ggatcatgaa ttaataagac 6300 tattgaaagc gattatgaaa ggaatttcaa tattttcgga aatatcatca tttttcctag 6360 gaacttatat aattatcgtt caagtaatta aaattaaaaa taaattattt ggcgccgtaa 6420 taggataaca taaaaaagag ctcaagaaaa tcaaattaaa aaaaatatat aaaaaaataa 6480 caaaaatgag atgcgaactg aatgaatcaa tgaatgaatg agtgtttata tggacaaggt 6540 actgcagacc agaatgaatg tgtaaaagaa tgttttaagt atgataaatg gttcgacata 6600 tccacaaaca aaattataat ttattatttt catttttttt attatataga aggtagttcc 6660 agaataatta gtaaatatac aaaaaccagc gttaacgacc tatttccagc ataaaaagag 6720 agagaggagg atgattccgc aagcgaaatc ttaggtaggc tactctcctt gtaataataa 6780 ttattttttt tattttttta gccatgacaa gaaaggaata tatattatgg cacattaatg 6840 accgagcctc gtaagcggta gatcggggtt caaaatcccg gctcggacca acacaactgg 6900 tgatcttttc ccttctggat tcgattgctt agtaaaggga aggtagtgta tcgtcacaaa 6960 ctggacctta tcacgacatc ataggtcgcc atgttaacat tagttgagtg aaaaactgcc 7020 actgaatccg ctttgtaaat gccggctccg atactcttca cgggtattcc catcaggaac 7080 tggggaagat ttactttaca catttaaaaa aaaggtaaaa tattcaactg ggcatagtaa 7140 ttaataataa taataatgat aagaaaaatc aatacaatgc aacaatacaa tttaaaagat 7200 cgtaaaatgt tttgaaatta agtgaaaaaa aaaaaacaaa tacacgttaa attataaacc 7260 cacatacaca aacactacac ttattaaatt aagattccgg agttcaccaa tgggacatca 7320 agatcaacaa gcatcgcaga agttcctcga gctctaacaa ttaatcgaaa acggttgagg 7380 tatgatagga tttgtgaggt gaaaggtgaa acttgattca aatggattga taatctgata 7440 attggaaaaa cgatggcaac agtgcagcaa taactttgtg tggaggaaag atatatctgt 7500 tcaatcaaaa taaatgtgaa tggaaccgaa tactttatga acgattaact catgagacga 7560 aggatgaaac atgatctgaa catcagatat agatcaaaac aattagatac tatacaatag 7620 ctactcgtta caattggatg aaaccgtata atcacagttt aatgctgatc aaaacaaatg 7680 aatggaactg caattcgcta caaattaaaa acaactgaat ggatagcaca gtgtattcac 7740 acataaagga ttgacgcaaa tctccacaca atttcttcaa atatgctgga aaagaaacga 7800 cataaacaga atggtatcag attaagttga caagaatcat tttcactgaa aaccaagcaa 7860 tgacttgatt attttaaacc gtaagattga tattgaataa aagagtggat atgacaacaa 7920 tataattttt tttcaacacg gaaggaacgt catcagccac aacatcaaca tcaaggttga 7980 tagttttttt tcatcagcca caacatcagc gagatcatca attaataatc atcatcatcg 8040 aacagaatca attgtaataa caatcaaaat taagttaaat attaagtcta accaatttcc 8100 aacatatacc tagctgacag tagatgatcg tctttagcag acaaacatct agctcgtaag 8160 tgtggggata tgaagcgacc ctcgccttac cagaaaaatt cacagtagat tttctctgtt 8220 aaaaaacgaa aatattagga aaaggcgaat caggagagtt gtaagaaaaa ggaagacaaa 8280 taaaaatcaa ttaggagcgt aggacgggaa aacccgacaa cactttta 8328 // ID R2B_DM repbase; DNA; INV; 3446 BP. XX AC AF015685; XX DT 27-JUL-1999 (Rel. 4.06, Created) DT 01-JUL-2005 (Rel. 10.08, Last updated, Version 2) XX DE Drosophila mercatorum R2 retrotransposon reverse transcriptase DE domain protein gene, complete cds. XX KW R2; Non-LTR Retrotransposon; Transposable Element; R2B_DM. XX NM R2B_DM. XX OS Drosophila mercatorum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; repleta group; OC mercatorum subgroup. XX RN [1] RP 1-3446 RA Malik S.H., Burke D.W. and Eickbush H.T.; RT "Complete R2 element from Drosophila mercatorum."; RL Unpublished. XX RN [2] RP 1-3446 RA Malik S.H., Burke D.W. and Eickbush H.T.; RT "R2B_DM."; RL Direct Submission to Genbank (23-JUL-1997)Biology, University of RL Rochester, Hutchison Hall, Rochester, NY 14627, USA. XX DR GenBank; AF015685; Positions 83 3528. XX SQ Sequence 3446 BP; 941 A; 807 C; 874 G; 824 T; 0 other; agggagtaaa tgggtttctg gtttacttgt aaatcattaa tcaagtcgac tcgtagacac 60 actgtgtccc acaaagttgt catgtggcct cctcgtggtg tctcccggaa gttatttggc 120 taggcgtttc ctggtcaaat aaactaattc gagcggcgaa ctggtcccgt tggttacctg 180 ccctcaggga tcgaaaaact gatggtacaa atgttcaata atgagcctcg gatagctgaa 240 aataactcag ttgaatatac tcctacggtt actcgcctag gagaccatca ggtacggaac 300 gcagatgcta atttgcagac tatgtttccg tgcagagaat gtgaacgatc tttcagaacg 360 aagatcggct tgggtgttca catgagacat cgacacaaag acgaactgga tacagcgcgt 420 cggcgtgtcg atgtcaaagc acggtggaat gaggaagaac tgtctatgat ggcaaggaag 480 gaacttgagc tcacagcaaa tggtgagagg ttcataaata aaaaactagc ggagatcttt 540 acaaaccgca gcgtcgatgc tatcaaaaag tgtcgacaga gagataatta caaggcgaaa 600 atcgagcagc tacaggggca agcagctctt atctcagaag caaatgaacc tcctaccaca 660 cagcgccgcc ctagtttaag tgagctcgag gtaactccct catcgtcaca ttcagttcca 720 atcgctccac caccgataca ttcggatgat atcctcttgc aggagcttca gggcatgtcg 780 cctgtcgcag taagaagatc ttggagagtc gaggtcttgc aatctatcat tgatagagcg 840 catatctcgg gcaaagaagc aactctccag tgtttgtcta actatctctt ggaaatattt 900 ccgaatcgaa acgaccgccc cagttccgcg acggtccccg cccggcgccc tcgcaataga 960 agaattagta gaaggcagca gtacgccaga tgcataaaat ccttgcttga tggaaccgac 1020 gagtcggcat tgccaaacca gagtattatg gagccttatt ggagacaagt catgacacag 1080 ccgagcccca gcctttgcag caacaccgtt ccccgtaaag ggaacatgca agagggggta 1140 tggtcaccga ttacgtctag ggacctacaa gtgcataaag tgccattgac ttcgtcgcca 1200 gggcccgatg gaattacttc acaaactgcc agaagtattc ctatcggaat aatgcttcgc 1260 attgttaacc taatactttg gtgcggagat ttgcccgtac ccttccgaat ggctcgaacc 1320 attttcattc cgaaaacggt aagagcaaac agaccgcaag actttcgtcc gatatcggtt 1380 ccttcgatcg tggtgagaca gcttaatgcc atcttggcat cccgactaac ggccgccgtt 1440 agttgggatc cgagacagcg aggctttcta cccacggacg gatgcgctga taacgcgacg 1500 atagtcgact tagtcctgag ggatcaccat aagcgttatg cgtcatgcta cattgccacg 1560 ctagatgtta gcaaggcatt tgactctgta gctcatgacg cggtattcaa caccgtaacc 1620 gcttatggtg ccccaaagag cttcgtcgac tacgtacgta gatggtacag tggaggtggc 1680 acctacttca atggtggaga ttggaggtca gaggaattcg tccctgctag aggtgttaag 1740 cagggtgacc ctctgtcccc cgtactgttc aacttgatca tcgatcggtt gctcaggtct 1800 cttcccaaag acatcggtgt ccatgtcgga aatgctaaag ttaatgcttg tgcttttgca 1860 gatgatttga tgctgtttgc ttccactcca aaggggcttc aggaactctt gaataccaca 1920 gtaaagtttc tatcttctgt tgggctaacc cttaacgctg ataaatgttt cacgatcagt 1980 attaaggggc aaccaaagca aaaggttacg gtggtcgaac aacgtacctt ttgtataggt 2040 cgcgcacgtg tccagctgaa gcgttcggag gagtggaaat atctcggcat tcatttcact 2100 gcagatggga gggctcgtta caatccttcg gaggacattg gtccaaagtt ggagagatta 2160 atgcagtccc cccttaaacc acaacagaag ttattcgccc tcagaactgt cctggtgcct 2220 caactctatc acaagctcac gcttgggagt gttgctctag gcgttctgag aaagtgtgac 2280 aagctggtac gatcattcgc taggaagctg ctgggtcttc cgttggatgt gtcagttgcg 2340 ttctaccatg cccctcacag ttgtgggggt ctcggtatac cctcggtaag atggatagct 2400 ccgatgttgc gcactaaaag attggcagga attaactggc cccatctcga acaatccgag 2460 gtggccagtg cctttcttag tgaggagctc cgacgggccc gggatagagc gaaggctgga 2520 gttaacgagc tgctatcaca accaaagatt gatacgtact gggcggatag attgtacacg 2580 tctgttgatg ggaatggtct ccgtgaagca aggcgctatg ccccacagca cggatgggta 2640 agtcagccca cgcgtctgat gagcggtaaa gcttatcgga cagggatcca attacgcatt 2700 aacgccctcc ccacgaggtc tcggaccacg cggggaaggc atgaaatgaa tagacaatgt 2760 cgtgcaggat gtgatgctcc gagtcataat cacgtcctgc aaagatgtca tagaacacat 2820 gggagtcgag tgtcacggca taatggagtg gtttcctatc tcaagaaggg gcttgagaca 2880 aggggctaca ccgtctattc agagcagagc cttcacggcc aaaatcgggt atataaaccc 2940 gatatagtag cattccgaca tgacagcaca atagtcgttg acgcgcaggt agtgacagac 3000 ggactggatc ttgacagagc tcatcagagc aaagtcgaga tctataacag acaggactta 3060 cttacgacat tgcggtctgt atatcgggca cgtgagaaca tcgaggttgt ctccgctacg 3120 ctaaactgga ggggtatttg gagctttcaa tccataacaa ggttgaggac tctgggtatc 3180 ctaacagctg gtgacagcaa tgttatcagt tccagggtag tgtccggccg agtctacagc 3240 tttaagacct tcatgttcca tgcagggttt catagaggaa tggcttagca gctctgaaag 3300 cgtatcctac aaacgctgct aacttgcaca acatcttaaa aagttataaa gaaagcacac 3360 gggtgtctta gcttatcaga tatctctggt atagcgagtc atatagcgca acaaaaaaaa 3420 aaaaaaaaaa aaaaaaatag ccaaat 3446 // ID BEL-48_CQ-LTR repbase; DNA; INV; 361 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-48_CQ_; KW BEL-48_CQ-I; BEL-48_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-361 RA Jurka J.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 250-250 (2011). XX DR [2] (Consensus) XX SQ Sequence 361 BP; 104 A; 90 C; 98 G; 69 T; 0 other; tgttcggcac ggcgccgaat gttttgacct gatttgcgca actgcctccc agtgtgcgtg 60 agtccagccc tgcgtcgctg tcagcgacgt tgctgccaac gtgacagcgc gaaacgacgc 120 aacgagcgaa aacggcgcga accgaagcgc gcgcaaacga aaagaaggag agagacagag 180 tggaaaattt tcactccgtt tcgagacgcg aacgagacac gtcgtgcagt gtaaataaaa 240 atcggaaatt tgaaagtttt gtgaagtaaa aggtgtaaaa gcaagcccca gtgtttttat 300 cactccagaa tcccgaatac agtccacccg cagaatccag ttgggggatt tggccccaac 360 a 361 // ID hATx-5_HM repbase; DNA; INV; 2631 BP. XX AC . XX DT 07-DEC-2008 (Rel. 13.12, Created) DT 13-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hATx-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hATx-5_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2631 RA Jurka J.; RT "A distinct, diverse family of hAT transposons from Hydra RT magnipapillata."; RL Repbase Reports 8(12), 1824-1824 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 546..2501 FT /product="hATx-5_HM_1p" FT /translation="MSIVNGSKAWQYFCKEKSGESAICKKCNVKIMCKGFS FT TSGLLRHLKNAHKDLDLNMKRLHEDTGSEIKIVQKKITSFVKQQTIEEIVS FT KLATVDGFSINAITKSEFIRKSLSEKGMLLPKNPSHVMGMIKKQYDLAKQT FT VVLEITKEISSGSRFSLSLDEYTSLKNRRYLNVNLHLSTKFWNLGMTPIAG FT SLPAEKIVDLVENKLSDFSLSIKKHIIASVTDGASVMKKFGRLSGIEHQLC FT YAHGLHLAVCDVLYKKSDSSAKYIEFESQFQDSDENLSDYDEFDDVFDCND FT ESYSSTGVTFIDDLQNSVNIALTIQKVRKVVRMFRKSPLKNEVLQMYVKST FT FHKELKLILDVKTRWNSLVAMLERFLKVKTCILKAMIDLKEVLNISEDEYA FT IMNDITISLQPIKVGIERLGRSNATLLDAEGVMIFILDDLAEKKNFIANKL FT LQSVQQRIEERRNANLIGLLKFLNNPISYNQESNINSKLLLPSKHALKATA FT KKIYTRLYIEDASTSSSNNEIVEVSRDNESMVSDPSIPQPVAPNLLGKSIL FT TLEQKLDATICNVKQVEKAALENISNLKNLTKEIKLFEATGKRSESLERIY FT STLMTIKPTSIESERAFSAAGLFMGKIRSRLSDKSIDCLCFLKQYFILNSN FT ID*" XX SQ Sequence 2631 BP; 979 A; 395 C; 459 G; 798 T; 0 other; tagggtttct caaacgggat cccgggatcc cgttttcccg ggatcccggg gataaaagtg 60 gcagcgggat ttcccgggat atcactctta tatcccggaa tataattttt tttataattt 120 tacatatgat aactttgaac taattcgacc aaaaataaga aaaatattat tgcatttagt 180 tgtgttttat gtgttcatac cttatattaa aacctagcac aacaagaaat gttattgaaa 240 ctatcttaaa ctatcaactc aagaaatgca ttgcaatgcg atgttttctt ttttctcgtt 300 attataaaat atgagtaaaa aataaaaaag atttcttgca acgaaaacag taagtttgat 360 ttttaaactt ctaaattatt gaaaagttaa actttttctg tatcgaatta ttgaaatatt 420 taataatttt tgtatttaaa aaacggcaaa caaacactta aaagtcatac gtaagataac 480 atacatatgt aagaaatatt tgtaacttta cttttagttg ttttacgatt caatttagtt 540 ataccatgtc aattgtaaat ggatcgaaag catggcagta tttttgtaaa gaaaaatctg 600 gcgaatctgc aatttgcaaa aagtgcaatg taaagataat gtgcaaagga ttttcgacaa 660 gtgggctcct acgtcattta aaaaacgcac ataaagattt ggatttaaat atgaagcgtc 720 tacatgaaga tactggttcg gaaattaaaa ttgtgcaaaa gaaaatcaca tcgtttgtta 780 aacagcagac tatcgaggaa attgtgtcaa agctagcaac ggtggatggc ttctcaataa 840 atgccataac aaaaagtgaa ttcattagaa aatcattgtc tgaaaaaggc atgttattac 900 caaaaaatcc atcccatgtt atgggtatga taaaaaaaca atatgattta gctaaacaaa 960 ctgtggtttt agagataaca aaagaaatat ctagcgggtc gcgattttct ttatcgttag 1020 acgaatatac gtctttaaaa aatcgacgtt atttaaatgt aaatttacac ttatcaacaa 1080 agttttggaa cctgggaatg actccgatag ctggatcact tcctgcagaa aaaattgttg 1140 atctagttga aaataaactg tcagatttca gtttatcaat taaaaagcac atcatagcta 1200 gtgtcactga cggagcatcc gtaatgaaaa aatttggacg gctaagtggt attgaacatc 1260 agctgtgtta tgcccacgga cttcatttgg cagtatgtga tgttctttac aaaaagtcag 1320 actcttcagc aaaatatatt gaatttgaat cgcagtttca agattccgat gaaaatttga 1380 gtgattacga tgagtttgat gatgtatttg actgtaatga tgagtcatac agctcaacag 1440 gcgttacgtt tattgatgat ttacaaaata gcgtaaatat agccttaaca attcagaagg 1500 ttcgaaaagt ggttcgaatg ttccgcaaat cacctcttaa aaatgaagtt ttacaaatgt 1560 acgtaaaatc tacgtttcat aaagaactta agctgatatt agatgtcaaa acgaggtgga 1620 atagtcttgt tgcaatgcta gaaagattcc ttaaagtaaa aacatgtata ctaaaagcaa 1680 tgatcgattt aaaagaagtt ctgaatatct ccgaagatga atatgcgatt atgaacgaca 1740 taacaatttc tctgcaaccg ataaaagttg gaatagaaag acttggtaga agtaatgcta 1800 ctctacttga cgcagaaggt gtaatgatat tcatcttaga cgatttggcg gagaaaaaaa 1860 attttatagc taataaactg cttcaatctg tgcagcagag aattgaagaa cgcagaaatg 1920 ccaatttaat tggactacta aaatttttaa acaacccaat tagttataat caggaatcaa 1980 atatcaactc caaactactc ttaccgtcca aacatgcatt aaaagcaaca gcaaaaaaaa 2040 tttataccag gttgtacata gaggatgcta gcactagttc atcaaataat gaaatagttg 2100 aagtgagtcg tgataatgaa agcatggttt ctgacccgtc aatacctcaa cctgttgctc 2160 caaatttatt aggcaaaagt attttgacac ttgaacaaaa attggatgcg actatttgta 2220 atgtcaaaca agtagaaaaa gcagctctag aaaacattag caatttaaaa aacctcacaa 2280 aagaaatcaa actttttgag gcaactggaa agcgtagcga aagtttagaa agaatatact 2340 caacattaat gaccatcaag ccgacatcaa ttgaatcaga aagagctttt tctgccgcgg 2400 gtctatttat gggaaaaatt cgttcacgat taagtgacaa atcaattgat tgtttatgtt 2460 ttttaaaaca atattttatt ttaaatagta atatagatta gcgtgctaag ataaatttgt 2520 gtcagaaacc gcttgttatt atattaaaaa ttatgccggg aaatcccggg ataaataagt 2580 tacacaccgg gaatcccggg atgctaaatt tacgggaaat gagaaaccct a 2631 // ID DNA-TTAA-1_CQ repbase; DNA; INV; 733 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A DNA transposon family from Culex quinquefasciatus - consensus. XX KW DNA transposon; Transposable Element; nonautonomous; KW DNA-TTAA-1_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-733 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the southern house mosquito."; RL Repbase Reports 11(1), 65-65 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >89% CC identity. ~340 bp TIRs. TSDs are TTAA. Both termini CC show similarity with those of DNA-2_AAe. Likely a non-autonomous CC piggyBac element. XX SQ Sequence 733 BP; 212 A; 157 C; 153 G; 210 T; 1 other; cactcaaacg ctcgggtagt catttgaccc ctattttttt tcttaaaaat ctcataactt 60 ttggtagaat tgaccaattt ggatgcttcc ggttgcaaaa gatccagatt tgtctatatt 120 ttcaactgtc agatggggga caaatatggt ccacttttac cggagatatt ccggattccg 180 ttggggtacc tcggccctcc tatttgggtt tttggtcaat ataacaaaaa ctattccaca 240 gaacaatgcc ggaaatgatg caaaacttca aggcatcatc ctaatacatc atcctgcata 300 gatacatgac atggtcaaga ccttcgggca ccggaacagg ttccaaccgg aaacggttca 360 ctgagctgat gtcgattggt gcccaactgg aaccggctcc ggtgccccgt aggacttaac 420 catgtcaatt atttttgcag gatgatgtat taggatgatg ccttgaagtt ttgcatcatt 480 tccggcattt ttctgtggaa aagtttttgt tatattgacc aaaaacccaw ataggagggc 540 cgaggtaccc caacggaatc cggaatatct ccggtaaaag tggaccatat ttgtccccca 600 tctgacagtt caaaatatag acaaatctgg atcttttgca accggaagca tccaaattgg 660 tcaattctac caaaagttat gagattttta agaaaaaaaa taggggtcaa atgactaccc 720 gagcgtttga gtg 733 // ID Academ-3_HM repbase; DNA; INV; 6114 BP. XX AC . XX DT 16-MAY-2011 (Rel. 16.05, Created) DT 16-MAY-2011 (Rel. 16.05, Last updated, Version 1) XX DE DNA transposon: consensus. XX KW Academ; DNA transposon; Transposable Element; Academ-3_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3053 RA Yuan Y.W. and Wessler S.R.; RT "The catalytic domain of all eukaryotic cut-and-paste transposase RT superfamilies."; RL Proc Natl Acad Sci U S A 108(19), 7884-7889 (2011). XX DR [1] (Consensus) XX SQ Sequence 6114 BP; 2172 A; 959 C; 1041 G; 1942 T; 0 other; tagtgggggg aataagggta aaaatagttc aaattgtgat accagcataa aacttggcga 60 aattgctgtc aacagctata caagttaaaa gagaaaagga ccccatttat aaatatcttt 120 ggcgttgagt tattgagcct tcatttttta gagaaatgga cttaaagaac tttatatttt 180 agtgtaaaat cagctttaaa taaaagcgga aacgatgcac gtggtcatac tggaaatttt 240 tataacaaat acaattcaat actttttttt aggtgagtgc aaacaaagta tgtaaacaaa 300 taaacgacga tttgttcgta agttggtaaa tgtatctttc atgataagtt aatattcgtg 360 aatatataag tcacgagtta ttttttcagt ttttcctgag ttaatgtaat ttaccaatga 420 atttcctatt tatatatttt tttattttgt taatttttaa ctatttctaa gcttttatca 480 acattataaa cttaataatg ttgattaaaa aaactgaaaa tagctagctt tatgatagag 540 cttgctattt tcagttgttt ttactgttat tttactatat taatgcaaaa aaagtaaaac 600 ctttaaaaat ctatataaaa ccaaataaca aaaataacct atagaagtta ataataaatt 660 gtaatgtctc agaagagttt caaacttgta aatgcaacag gttatgaatt atcaacaaca 720 acaaataatg cgacagagtg ggataaatgt tttatatgtc aagagcaatc atcaaccatt 780 agcgacatta ttaatccttc taaatcaaaa aaacagcaaa aagacactgg atataagaca 840 ttacttagta acctagagaa gttcaaagaa ataggtttga ttatataaca atatgtttgt 900 ttttaataca tctttttttg tcatttgctt ttgacagaaa attagtgagt taaatttatc 960 agatagaggc tgatattgtc ttatcaaatg taaatcattt tctgctttca tattttctca 1020 agtgttgaag caaattttaa ttttttcact ctttaactaa atttaatcca atgtttttta 1080 aaacaaacca tttgttttta tattatatct tgagtggaaa gtatttatct ccaaaaatct 1140 gttaagtaat tttttgctga aaattcagat ttatcttatg caggtagttt gtcagttggt 1200 ttaacttccc gactgcaatg cgaagatttg ctgcaacagc tttgcaacaa caacgcagta 1260 tatcacaaaa aatgttacac taattttgat aactatcact accagagagc atctaaacgg 1320 agaaaaatat tagatgtgac agaagagcga tcaatcccaa accatggtac aaggtcaaag 1380 tttcaggcca acaactttca agaaaattgt ttcttgtgtg agaaggtgga taacaaaaac 1440 ctaagtcaag tgatgactct tgaactagat aaaaaggttc gcaggtgtgc tgaaatttta 1500 tcagattcta agttacttgc aaagctaagt gaaggtgatc ttgtagctac tgaagctcat 1560 tatcataaag cttgtctcag ccaattatat aacagaatac gctctttaca gaaagaccaa 1620 ccagtaataa aaagcgaaag caacattgtt tatggaattg ttcttggaga gataatagaa 1680 tatatctatc agtgctatga aaatgaagca attattcctg tgtttcaatt atcagaacta 1740 tatcaattgt tttgcaaccg tatgaaaagt tatgataaaa gtttcaactg tgaaaacaaa 1800 actaggctaa aagaaaaagt tctaaacctt gtaccagagt taggggaatt taaaaaaggt 1860 agagaagttt tgctgacgtt taaagcggat ctgggaagtg ctatttctga tgcttgttat 1920 ttaacgaatg aggaagaagg aatatgttta gtaaaagcag caactatttt gcgaaaacaa 1980 atttttaaag tgctaagtac tgataacgtc tcaacaattt caaatggttt agaaagtgaa 2040 cttaccccaa acacattaaa aagctttgtt agaatggtgt tatatggaca gaaggtaaaa 2100 gatgctaaca ataattcagc acataccaaa gttgtagcaa ctgtttctca actcatattt 2160 tacaattgtt tgaaaaaaaa tgtcgaagcc aaacgagaag gaaaacatac ccgtcataaa 2220 gcaggtacaa ttccaccgtt tccattatat gttggtttaa acatacattc caaaacacga 2280 aagaaaggaa ttgtaaataa tttagcaaag ttcggagtta gtgtacctca aacaagagtt 2340 gatgccatta aaaccacact ttcaaatcaa ctatgcaagc tttacaatga taacaatacg 2400 gtttgtcctc cttcactcat tgaaaatgtt tttacaactg cagcaattga taacattgat 2460 cacaaccctt cttctagtac agctactaaa tcttttcatg gaacatcaat ctccatattt 2520 caacatgctg agatagactt acctgtaaag aggtatgatt atgatttttc ggaaaaagcg 2580 actaacgcta ttttggagtt accatcatat tacacaacaa tagagcctac taaagaccat 2640 tctgtggagt atcctataca aacaacaaat ttccaccaat tcgagaactt tgatgctttt 2700 agtgattcaa gagaatggct tacagctgta gttaaagttt tacaaaacaa atcagaaggg 2760 gaaggaacaa cgcacatttc ttggacagca tttaattcaa agacaatcat agatgaacag 2820 aaaacaaaaa atacttctat tttgttgcca atcattaatg aatcgatcaa ctcaacatct 2880 atggtaagac aaacttttaa tatagttaaa aaagttctcg caaaaatcaa cccaactcaa 2940 gtgcctataa taactgctga ccaaccagtt tatgcattag gtaaacaagt tcagtggcac 3000 tacccggaac tgtatggcga agataaactt ttaatgatga tgggaggttt acacatagaa 3060 atggcctcct tatctttggt tggagattgg ttagaaggca gtggttggtg tgatgcaata 3120 actaaagctg gtataacaac ttctggacga gctgaatcta tgctcactgg tcgaaaagta 3180 aaacgttcaa ggtatgcagc acaagtttca ttagctggtt tttattctct attgactgaa 3240 gcattccaga aagtatctac aacatccttt gacttgtgga tccatgaaca aagaacagca 3300 tacacccaat ttaattattg gtttactgct atggagttgc aagcaatagt cctgttgctt 3360 gttaagagtt tcagaatggg aaattttaat atgtttatca gtgccttgga acaaattgtt 3420 ccttggatgt ttgctttaga tcacacgcac tatgctagat ggcttccaat ttttcttgct 3480 gacatgaaga tgcttcctta taaacaccct gaggtttaca ctgagttctg caaaggtttt 3540 ttcacatttc aaaagacaag acgccctttt tcatctatgg ctattgacca ggcccatgaa 3600 caaaacatca aaatcataaa aggagatggc ggctcaattg actttttaga tagcccaagt 3660 gcgttgataa agtggatggt gtgtgaacca ttacttgctg aacttctaga tggatttgga 3720 gaagaagaaa aagataatga aaagagtcat cacaaagata cacctgtttt tgaaaagaat 3780 tataatgaac atgtatcaga gtttattgaa gttgtaaaag attttggaaa tccattttta 3840 gtagaagaag atactctcat acatttatca acaaaaatat tacttgacgc agatgctagg 3900 gcatccgtca aaaaagctaa gcaaacgggt attgagtcat ccagatgtta tattcataat 3960 aggttaacaa ctggaaggga ttcaatatat aaaacaatac ccaaaaataa cttacgcctg 4020 tttagagaaa aaaacatcct tgccactcaa aaaggaaggt taaaaacagt aatgctgaaa 4080 gaagaatgca aaatgtttgc atcactttat gttgcatctc aacatagaga tggagattta 4140 gatgattttt ttcgacatga aaatcatgct tatccaccag caatatctga atacggtaaa 4200 ctccgaaaaa ctaataaatc ggagtacctt aaatgccttg aagaatattc agcccctact 4260 ttgacttctc ctgaaggtat cactgcaaag gtgttagacg gaggagctat tgttcatata 4320 tttcatccgg taaattcaaa gacattcaag gagtattctg aaaatgagtt tcgagagaaa 4380 ataattaata ccacatcaac aaaacataaa gaaataaaaa ggattgacat tgtgtttgac 4440 cgttattttg aatgtagttt aaaaacccag actaggaaaa gtcgtgggaa tggtgtaaga 4500 gtacgtgttt ctgattcaac tcccatctgg aaaaactagt cacagttctt acgattggat 4560 gaaaataaaa atgagttgtt taaaatgctt gcaataaact ttgtacaagt agagacattt 4620 gacttacttt tagtaacaac gttagatgaa actgttttat ccaatttcag agattttaat 4680 aaaaaatctc accttgtaac cacgaggaag ctgacactag aattttgcta catgttaaaa 4740 atgcagttga cagtgggcac caagttgtct gtattcaaac agttgacaca gatgttgttg 4800 cgatctcaat ttccgttttt aaacaattaa cgggtatcaa gcaactttgg atagaatttg 4860 gagttggaaa actcaaacga tggcttccaa tacactatta cgcacaaaag ataggtgaaa 4920 aagcagaagc tgtttcattg tggtttgcat tcactggctg tgataccgtg tcatcatttg 4980 caggacgagg aaaaaaactt gcatggaatc tctggaacaa tggagatgaa gaaattatta 5040 gtgcattttg caggtaacct agctaaccta tgtttaaata gtataaaaag gcagtaattt 5100 tagatttgct agcttctgct gctaacttct aaaatcattt tcctataggc tgcttttttt 5160 gtgcaaattt agtccaatat gtggttttat tttatttttt agcttgtcat ctccaagcac 5220 tagagttatt tcacctgata catttctaaa gtttgagcgc tacgttattc tactttatga 5280 taaaacaagt tccactactg acctcaccgt atgccgaaaa atgctgttca ctaaaaattc 5340 acgacagagt gaagggttac cacctacaaa agatgcactc tttcagcatt taaaaagagc 5400 tgtgtttcaa ggaaggtgca tattttttta aatatttaaa tggacttatt attctattca 5460 aaaaatttta ttaaatcaaa tatttaaaaa tgatacttac aaataattat ttatttagta 5520 ttatatgggg acaatccttg gtgccacagc agattttacc cgaagctact aaatggggtt 5580 ggaaaaaaac aaataacgca atgacacctt tgtggatgac tctacctgag gcatcaaaat 5640 catgcatcga gttaataaaa tgtagatgca agaaaagttg cggaacgcaa tgcaaatgca 5700 aacgttcagg attatcctgt acagacctat gcgagtgttc tggtcagtgt tttgaataaa 5760 atgaacaaac ctcatttttg ttaacaagtt ttaaaaaaaa ttagttccat tactctgaaa 5820 tatataaata aaactttttt tatgctttta atcatatttt aattagtaat gtggtatagt 5880 tgtttgttgt ttcttaaaat tctgcttaat gcaaccttat ataatcggtt cagaagattt 5940 tatagtttta cagtcaactc agttgggggt cgccccagga ccaaaaaaat attttcagaa 6000 tggggtcttt ttttttttga attcttttaa tttatagaca acttcctgaa agtttcaagc 6060 tggtatcaca atttgaacta tttcgctaaa tatttacttt tattcccccc acta 6114 // ID BEL-15_DWil-I repbase; DNA; INV; 5068 BP. XX AC scaffold_181145; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-15_DWil_; KW BEL-15_DWil-LTR; BEL-15_DWil-I. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-5068 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_181145; Positions 1700269 1705336. XX CC Positions [4143-4691] - Integrase core CC 'ATAAC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 528..2072 FT /product="BEL-15_DWil-I_1p" FT /translation="MRYSRRKFGRTFVCRSTAASNDSELMARAEPPRKLYD FT LPSFDGTPESWPMFKEAFDMTTVEYGYNNRQNLLRLQKAIVGRAREVVECL FT LIHSQNVPDVMETLKDRFGRPEQLVKSQINNVRSFPNISEDELENLVEFAT FT KVRNLSAFLDAANARHHMSNPTLMDILIGKLPVQQRLSWSNYVMSLNREVT FT IKDLNEWLQTMAKVVYGACLSTSESTSKIKKERTRMFHADQRHQTSDYMEA FT RCIMCNGSHWLRKCNGFANLNHDEKWQFVRKKGLCFNCLREGHRSDKCKVQ FT VKCEVPNCNRRHNTLLHMEKEPIVVVKPSSSKEFSGTATESKGSTLLFKIL FT PVRLHAKGHTVQCFAFLDEGSAVSLMNESLAAALHLKGPKEDLTLQWFGNK FT QVTESTSRFDVGISGTNGYTIYDLKGVRTVKNLKLPKQTINRKDLYFKNRH FT LENIPLQSYSNVEPELLIGLDNVHLGIPRKVKSSVGRGPAVVKTKLGWVVY FT GKIDDHPNAILCYNAIIFG" FT CDS 3894..5048 FT /product="BEL-15_DWil-I_2p" FT /translation="MDTRRPIILPQGSHVSKLLITHYHNKWKHQNENSIIA FT EIRRKFWIPHLRQEVRRTSRTCFICRMERARPAPPLMGQLPSDRVTPFVRP FT FSYTGIDYMGPFAVVRGRGTEKRWICIFTCLTVRAIHLELARDLTADTCLK FT VIHNFMRRRGVPIRIRSDNGTNLVGAERILRQELYKKVAAGLAIEKIEWLF FT NCPLNPHAGGCWQRLVRSVRRAMGHVLLQGNLKEDTLYSLMCEAENIVNSR FT PLTHIAIDSPTGDPITPNHFLLGTANSTQTPHPQEEDYHPTMKNLRIKQQM FT THSFWRKWVNEYLPDLCRRTKWYHSVHPLALGDIVLICDSAQHRVNWQKGR FT IIEVTTGRDGQVRSAKVKTSTGEFRRPACMLAKLDFGESKMD" XX SQ Sequence 5068 BP; 1557 A; 953 C; 1306 G; 1252 T; 0 other; atttttattt cgcggttcag ttggtttcgc ggaccaatcc acctttggac aaaggcgagg 60 atgcttctaa agaggcttgg attgatgaat tgaacgttta tgagaagcac aaggtaagtg 120 cctattagat tgttgatttg gagggatttt ctgtcggatg gtggtaacta gaaaaatggc 180 ggcaaacacg gcgtcaggtt taatcgaccc gatggttgat gaacaaaatg gcgctgttat 240 ggcggatagc gctgcgggca gcagtaatag ggacaatgag gcgccgctga aaagctcgaa 300 cgagatgact ttaatgttga gcactatggc agacatgatg aaacaaatgc aacaatttca 360 gttagaggcg acacgtacaa acgtggttca aatgcaggca tttcaagcgg aaatggtcag 420 gatgaacgaa ctacaaatgc agacaattca ggctggaatc gagagaacgg gcttactggt 480 tggtagggca tttgaacatc agcaggctag agcggcatcc cgtggggatg aggtacagtc 540 ggaggaagtt cgggcgaact tttgtatgcc gaagtacagc ggccagcaac gacagcgagt 600 tgatggcacg agcagaaccg cctagaaagt tgtatgatct tccgtctttc gatggtacac 660 cggaatcatg gccaatgttc aaggaggcat ttgatatgac aacagtggag tacggttata 720 ataaccgaca gaatttattg cgcctacaaa aggcgattgt aggccgggct agggaagtag 780 ttgaatgtct acttattcac agtcagaatg tgcctgacgt gatggaaact ttaaaggatc 840 gattcggaag acctgaacaa ctggtaaaga gtcaaattaa taatgtgcga tcatttccaa 900 acatttcaga agatgaacta gaaaacttgg ttgagtttgc tacgaaggtg aggaacctgt 960 cagctttttt ggatgcagcc aatgcacgtc atcacatgtc aaatccaaca ctaatggaca 1020 ttcttattgg aaagctaccg gtgcagcaaa ggctcagctg gtcaaactat gttatgtcgt 1080 taaatcgcga agtgacaatc aaggacttga acgaatggtt acagacaatg gcgaaagttg 1140 tatatggggc ctgtcttagt acttcggaat caacgtccaa aataaagaag gaaagaacaa 1200 gaatgtttca tgcagatcag agacatcaaa catcggatta tatggaagca agatgcatta 1260 tgtgtaacgg aagtcactgg ctgaggaagt gcaatgggtt cgcaaacttg aatcacgatg 1320 agaaatggca gtttgtacgg aagaagggct tgtgttttaa ttgcttgcgt gaaggccatc 1380 gcagcgacaa gtgcaaagtt caagttaagt gcgaagtgcc aaactgcaac agacgccaca 1440 atacattgct gcatatggag aaagaaccga ttgtcgtagt aaaacccagc agttccaaag 1500 agttttctgg cactgcaaca gagagcaaag gctcaacgct tctgttcaag atactaccgg 1560 tacggctaca tgccaagggt cacacagtgc aatgttttgc tttccttgac gaagggtctg 1620 ctgtttctct gatgaatgag agcttggctg ctgcgctcca tttaaaaggc ccaaaggaag 1680 atctgacatt acaatggttt ggcaataagc aggtaaccga aagcacaagc cgattcgatg 1740 tgggcatatc gggaacaaat ggctacacaa tttatgatct aaagggcgtt cgaacagtca 1800 aaaacctaaa gttgcccaaa caaacaataa atagaaagga cttgtatttt aaaaatcgac 1860 atctcgagaa cataccattg cagtcgtata gcaatgtaga acctgagttg ttgattggtc 1920 ttgataatgt gcacctaggc atacccagaa aggtgaagtc gtctgtaggt cgaggaccag 1980 ctgtggtaaa gactaaacta ggttgggtgg tgtatggtaa gatcgatgac catcctaatg 2040 ctatattatg ctataatgct ataatattcg gttgactcag gagtcagaac agaacaaaag 2100 cctggataag gcatggtacc tgccacactt tggagtcgtc aatccacata aacccgggaa 2160 aattagattc gtattcgatg ctgcagcgga agtcgacggt gtgtcgctga atgcactgct 2220 gatgagaggg cccgaccaat gccaaccatt aagttacgta cttatgaaat ttagacaaag 2280 gtccgtaggt atttgcgcag acatagaaga aatgtttgat acgcaaagat gaccgatggg 2340 ctcaacgcat tctatggaga aaggatgaaa caaatgagtt taaggcatat gagatgatgg 2400 taatgacgtt tggagcgaaa tgctcacccg catcagggca gtttgttaag aataagaatg 2460 catcggaata caaagacttg tttccacagg cggctgatgc tattgtgagg aatcattatg 2520 ttgatgattt cgtccactca tatggcaacg aaacagaagc aataagagtt acggaacagg 2580 tgatacagat acaccgaaag gggggatttg agttgaagaa atttgtatct aactctaaag 2640 tctgcaatga taattttggt gaaggtagca tggatgatgt tctgagcctc gataaagatg 2700 gggcattgta tagtcagaag gtacttggta tgcattggta tacggcagag gataaatttg 2760 ggttcatact ttcgtttaat aaaatcgata aatctctgtt ctctgatgaa agaattccaa 2820 caaagcggga agtactacgg atcgttatgt caatcttcga tcccttcgga atattaatcg 2880 agtatcgttg atggcaaaac tcctccttca gtctatttgg aggaaaaaga tagaatggga 2940 tcaatccata aatcaagaag agtacatcga atggaaaaga tggcttcgtt ttgttaatga 3000 aatagcttta ataagggttc caagatggta tggcacaact tttagtaatg agtgtgtaga 3060 gttacacata tttggagatg cgagcgagat ggcgtatgca gcagtgggct attggcgagt 3120 tgaagatgct gaaggatgga aagtgtcttt tataatggga aagacaaaat gcgcaccgat 3180 gaaactgtct actgtgccaa gactggagtt gcaggctgcg gttatggcga ctaggctcag 3240 aaactcgata ttagagggac atgatattca accaaaacga gtttgtatgt ggtcggactc 3300 atcaactgtt ttagcatgga tccgttcaga tcagagaaag tacaagccat acgtcgccca 3360 tcgagtgaac gaaattctag aaagctctgg cgtcgaagaa tggaattggg ttgccggcaa 3420 ggacaatcca gcggattttg gcacgaagat taaaagagac agcagacagt cactatgggt 3480 aaccgggcca gatttttatt ggatgatctt gaggaatcac cacagcagag caagggtaat 3540 tttagtactg acatggagct gcgcccaaaa ttctgtatgg tcaccacgaa gcaaacggat 3600 gatctctttg gaagattctc atcgtacacg agactcgtac ggatctggct tgggtgctac 3660 gatttagaag aaaagaagtt cgtgaaaatt acctgatatg ctcagagata gactatgctg 3720 aagtagttct gttcagaagt gtacaaaaga gttcttatcc agatgagtac agtgagctcg 3780 ccaggggtaa actcattgac aagggaagta gtcttctaaa actaacaccg gagataaata 3840 atgatggctt gttacgtgta ggcggacgta tagataaagc ttgtgccgtt gatatggaca 3900 ctagacgccc aataattctg ccacaaggga gccacgtgtc gaaactgcta atcacacatt 3960 atcacaacaa atggaagcat caaaatgaaa attcaatcat cgctgaaata cgtcggaagt 4020 tttggattcc tcacctacgc caagaggtac gccggacgtc acgaacatgc ttcatatgcc 4080 gtatggagag agctcgccca gcaccaccct tgatgggaca actgccctcg gatagagtta 4140 ctccttttgt tcgtccattt tcttacactg gaattgacta catggggccg tttgcagtgg 4200 ttaggggaag aggaacagag aagcgatgga tatgcatttt tacttgtttg actgtacgcg 4260 ccatacattt ggaattggct agggacttga cggccgacac ttgccttaaa gtcattcata 4320 acttcatgag gcggcgtgga gtcccaataa gaatcaggtc tgataacgga acaaacctag 4380 tgggagccga gcgaatatta agacaagaat tgtataagaa agtagctgcc gggctcgcaa 4440 tcgaaaaaat cgagtggctt tttaactgcc cattgaatcc acatgcaggc ggatgttggc 4500 aacgtttagt tcgaagtgta agacgagcta tggggcacgt gctactacaa ggaaatttaa 4560 aggaggatac cctctacagc ttgatgtgcg aagctgaaaa tattgttaat tcacggcctt 4620 tgacgcacat tgccatagat agtccgacag gggatcctat aactccgaat cactttcttt 4680 taggtacagc caactccact cagactccgc atccgcaaga agaggattat caccctacca 4740 tgaagaattt gcggattaaa cagcagatga ctcattcgtt ttggcgcaag tgggtaaacg 4800 aatacctgcc tgatttgtgt cgccgcacca agtggtatca ttcggtacac ccgctagctc 4860 tgggagatat cgtccttatt tgcgacagcg cacagcatcg agttaactgg cagaagggca 4920 gaatcatcga ggtgacaacc ggacgagatg gccaagtacg atctgcgaaa gtgaagacca 4980 gcactggaga gtttcgtcga ccggcatgca tgttagccaa actggatttt ggtgaatcta 5040 agatggacta gattcacggt ggcgggga 5068 // ID CR1-106_AAe repbase; DNA; INV; 5553 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-106_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5553 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1194-1194 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 20 sequences with >96% CC identity. Sequences around 2500-2650 are unclear due to the CC small number of copies and divergence. XX FH Key Location/Qualifiers FT CDS 389..1483 FT /product="CR1-106_AAe_1p" FT /translation="MDLCQVCLNRLDSGSLINCSGACGKMFHFGCVGMTKP FT HFSSWSAKIGLFWFCESCRLNFDPAVYDREKTIMKALRELLIRTDSIDTRL FT GNYGDNLRKISKSLNGCQRQSKPDNCYQQQSSFLHSINELSLDDTMDDPIN FT RSRSCDDTSFFEVLDEINSSISGQPDKFVVGSSKRVQIIENPTSNAGTSRN FT TSRTNVSTPAAPDKRSIVPSNKPSSSQTTDGIQNPTESTVDKIVTVSASSL FT NNCGNHRDNSSKPPSIPLKVAKTTQLSNESESFYVTPFAPDETEEEVKLYV FT CEISNTHSSLVNVIKLVPRGKTADDLSFVSFKVTVSKSISNVVGDPWYWPE FT GISVRTFEPNPKNGVFSRLPRT" FT CDS join(1417..2535,2557..5355) FT /product="CR1-106_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="GNICPYFRAKSKKRSFFPPSQDLVNTNLYHPYPVLPE FT QATFPVSSGERLNSICPAEAGKDTTPFLNGESNFQYFERAVVQNRQNNDCI FT YTSNECPAMNNIQPGCHTDYPSVVFTPSVDPEDISAYHFAIRPVLSGKLYS FT GICPAVARVDTAPNLNGGSYPQLNHYVSPAIDRTPSDENRDSRVYHLAALP FT VLSGKSDSGICLAVASDDAAPNLNGDSHLQPINRSATSITQSGGHSEDSEG FT IRAYHSEHLPASSGKSNSGICLAEARQNTAPNLNGIPLLQLNDDDSTSPTI FT DPGSTSSCRVSSNKVILEMYYQNVGGMNTTLADYSLACCDTSYDMIALVET FT WLNDKTLSQQIFGPSYSVSVSDHQVHIYFFNLLVSQYDPNLDKTSKKIKNK FT RIFLYFVYRCDRSSSNSVKRSGGGVILAVSSMFKSRIIFPPSGIGVEQLWV FT AITFAGWTLYVCVVYFPPDRIHDSELIDAHLDSLTWVFNEMGINDSIMIIG FT DFNIPSIEWKLNSSGFLHPDSNRSVVHKLSCKLLDGYSTAGLAQINGVTNS FT NGRMLDLVFVSNELLHISDVFEAPAPLVKTSRHHPHLQLALKASMQSSIAT FT VEEIYYDYNRADYRSMNEFLASINWNELLFDCDANAATVRFSNVVLYSIDQ FT FVPIKIVKRSNHPAWFNSALRRLKTKKRHALTVYTKHRNARNKRRYVTING FT IYKRLNKRLYLAYQTGIEEKLKXNPKNFWQHVNKQRKEDGLPSTMVCNNVE FT ASSVGDIAELFRLHFQSVFVAEHLPDLNILNAASNVPQGPAFEQISPISTE FT MVESTGNHLKISTSPGPDGIPSIVIRKCISSIALPLATIFNQSLLSGVFPK FT NWKTSYVFPVFKKGDKRVISNYRGIASLSAASKLFEKLVLEYMMHHCSNLI FT AEEQHGFTPKRSTMSNLVLYTNSILREMESGNQTDAIYTDFSAAFDKVNHQ FT IAVAKLNRLGFNGFFLVWIKSYLTDRSMSVKIGTHLSSSFTVTSGVPQGSH FT LGPLIFLLYLNDVNLLLSCYKLSFADDFKLYLTIKTEADADFLQDQLNIFS FT DWCTSNRMILNSSKCSVITFTRKQNPIIRDYILHGTVLNRESVVKDLGVLL FT DSKLSFKDHISYVTTKASAQLGFIFRITKQFKDVYCLKTLYCSLVRSILEY FT GSVVWSPFYKNGVDRIESVQRKFLRFALRLLPWNDPLNLPNYEDRCKLINL FT DLLEVRRNVSKATFIADLLTSRINCANLLCQLNINIRSRILRTNDFLRVPG FT SRTNYCYNSPLASMCRVFNKCYNVFDFNLSRSTIKKRIKQLLCYS" XX SQ Sequence 5553 BP; 1583 A; 1237 C; 1050 G; 1682 T; 1 other; ttctgggaac actgccgctc gtagttcgtt tgtttgtttt tcttgttcga tttcataccc 60 gtttttgaca atactttcgg ctagttaaaa taccgaaaat aatagaatcg gacggtttgc 120 cataagcttt actttcgtac agaattcgtt tttgtgtgtg gctgctgtac tagtgatatt 180 ttgtgccatc aaattgctat ttgtttaccc gctacaacac ccactttcgc tcatttttcc 240 tttcgaagcg ccatctgcca gattttattg gaaactgtgc aagatattgc atacaagcga 300 ctttgcaata ttcgttcaac cgctccaggt gtctgcatac ccaaaggcgt tacaactagc 360 ttctcgcact tgcagaaatt tcggaaggat ggatttatgt caagtttgct taaatcgctt 420 ggactctgga agcctgatta attgcagcgg agcatgcggc aagatgtttc attttggttg 480 cgtcggaatg accaaaccgc atttctcctc atggtcagcc aaaattggat tattttggtt 540 ctgcgagtcc tgtcgtttaa actttgatcc agcagtttat gatagagaaa aaacgatcat 600 gaaggctcta cgtgaattgc tgattcgtac tgattccata gacacgagac ttggaaatta 660 cggcgataat cttcgtaaaa ttagcaaaag cctaaacgga tgtcaaagac aatcgaagcc 720 agataattgt taccaacaac aatcctcgtt tttgcatagc atcaatgaac tctctttgga 780 cgatacaatg gacgatccaa tcaatcgctc tcgatcttgt gatgatacat cattcttcga 840 ggttttagac gagatcaata gttctatctc gggccaacct gataagtttg ttgttgggag 900 tagcaaacga gtgcagataa tagagaaccc aaccagcaat gccggaacca gtaggaatac 960 atctcgaaca aatgtgtcta cccctgctgc tcccgacaag cgatcgattg tcccttcaaa 1020 caaaccttca tcttctcaaa ctaccgatgg aatccagaat ccgactgagt ctactgttga 1080 caagattgtt actgttagcg caagctcgct gaacaattgt ggcaatcatc gtgataactc 1140 gtccaaaccg ccttcaattc cactgaaagt tgcaaaaacc actcagctat ccaatgaatc 1200 agaatcattt tatgtgacac catttgcacc ggatgaaacc gaagaggaag tgaagctcta 1260 tgtatgcgag atttcaaata ctcactcgtc attagtaaat gtgattaagc tcgttccacg 1320 cgggaaaact gccgacgacc tttcatttgt ctcattcaaa gtaacggtta gtaaatcgat 1380 ttcgaacgtg gtcggcgatc cgtggtattg gcctgaggga atatctgtcc gtactttcga 1440 gccaaatcca aaaaacggag ttttttcccg ccttcccagg acttagtgaa caccaattta 1500 tatcacccgt atccagtttt accggaacaa gcgacgttcc ctgtttcgtc aggtgagcgt 1560 ttgaatagta tatgtcccgc cgaagccggg aaagacacta caccctttct gaatggagaa 1620 tccaactttc agtattttga acgtgctgtt gttcaaaacc gacaaaacaa tgactgcatc 1680 tacacttcaa acgaatgtcc agctatgaat aatatacaac ctggttgcca caccgattat 1740 ccttctgttg tattcacacc atccgttgat cctgaagaca tcagtgcgta ccactttgcc 1800 atccgtcctg ttttgtcagg taagctctac agtggtatat gtcctgccgt agccagggtc 1860 gatactgcgc caaatttgaa tggaggttcc tatccgcagc tcaaccacta cgttagtcct 1920 gcaatcgacc gtacgccatc cgatgaaaac cgtgattccc gtgtatacca cttagccgcg 1980 cttcctgttt tatcaggtaa gtctgacagt ggtatatgtc ttgccgtagc tagcgatgat 2040 gccgcaccaa atttgaatgg agattcacat ttacagccca tcaaccgttc tgccacttcc 2100 atcacccaat ccggcggcca ttctgaagac tcggaaggca tccgtgcata ccactctgaa 2160 caccttcctg cttcgtcagg taagtcgaac agtggtatat gtcttgccga agctagacag 2220 aatactgcac ccaatctgaa cggaattcct cttcttcagc tcaacgacga cgattctact 2280 tcccctacta ttgatcccgg atctacttca agttgtcgcg tgtcatccaa caaagttatc 2340 ctggagatgt actaccaaaa tgtcggcggg atgaacacaa ctttggctga ttactcttta 2400 gcctgctgcg atactagcta cgatatgatc gcattggttg aaacatggct caatgacaaa 2460 acactctctc aacaaatttt tggtcctagc tattccgtat ccgtttccga ccatcaagtt 2520 catatttatt ttttttgaaa ctgttacgtg tcgtaaaatt tgttagtttc ccagtatgat 2580 ccgaaccttg acaaaacatc aaaaaaaata aaaaataaaa gaattttcct ttattttgta 2640 tatcgatgtg accgttcatc ctccaatagt gtcaaacgtt caggaggtgg agttattcta 2700 gcagttagct ccatgtttaa gtctcgcatc attttccctc cttctggcat cggagttgag 2760 caactttggg ttgctattac attcgcagga tggaccttgt acgtatgcgt agtttacttt 2820 ccgcccgacc gtatccatga ctccgaactt atcgatgctc atctagattc gctcacctgg 2880 gtattcaatg aaatgggcat caatgacagt atcatgataa taggtgactt taatattccg 2940 tcaatagagt ggaagctaaa cagttcaggc tttttgcatc ccgattctaa tcgatctgtt 3000 gttcacaaac tttcctgtaa attacttgac ggttacagca ctgccggttt ggcgcaaatc 3060 aacggagtga caaatagtaa cggccgaatg ttggacctcg tctttgtcag taacgaactt 3120 ttgcatatct ctgacgtctt tgaagcccca gcgccattgg tgaagactag caggcatcat 3180 cctcatctac aactcgctct caaggcatct atgcagagct caattgcaac ggtggaagaa 3240 atctattacg actataacag agccgactat cgatccatga atgaatttct ggccagtatc 3300 aactggaacg aactactgtt tgattgcgac gcaaatgctg cgactgttcg tttctcaaat 3360 gtcgttctat attcgattga tcaattcgtc cctattaaga tcgtgaagcg atctaatcat 3420 ccagcttggt tcaattctgc tttgaggaga ttaaaaacta agaaacggca tgccctaacg 3480 gtttacacaa aacaccgaaa tgcccgtaac aaacgtcgct atgttacaat caatggcatt 3540 tataaacgtc tcaacaaaag gctttatctc gcttatcaaa caggaatcga ggaaaaactt 3600 aaatktaacc ccaaaaattt ctggcagcat gtgaacaaac aacgcaaaga agatggtctt 3660 ccgtctacca tggtgtgtaa caacgtggaa gcgtcctctg tcggagacat tgcagaattg 3720 tttcgcctac actttcaaag cgtattcgtt gctgaacacc ttcccgatct caatatccta 3780 aatgcggctt ccaatgttcc tcaaggtcct gctttcgaac aaatatctcc tatttccact 3840 gaaatggttg agtctactgg aaatcatcta aagatatcaa cgagtcctgg tccagatggg 3900 attccttcaa ttgtaatccg taaatgcata tcatcgatag cattaccttt agcaacgatt 3960 ttcaaccaat cgttgttaag cggtgtattc cccaagaatt ggaaaacatc ttatgtattt 4020 cctgttttta aaaaaggaga caaaagagtt atatcgaact atcgtggaat cgcctcctta 4080 agtgccgcct ctaaactgtt tgaaaagctg gttttagaat acatgatgca tcattgctcc 4140 aatttgattg ccgaagaaca acacggtttt acaccgaaac gatcaacgat gagtaactta 4200 gttctctaca ccaactcgat tctccgtgaa atggagagcg gaaaccaaac tgatgccatt 4260 tatacagact tttcagccgc tttcgataaa gtaaaccatc aaattgcagt agctaagctt 4320 aatcgtctcg gcttcaacgg tttcttcctt gtatggatca aatcgtacct cacggatcga 4380 tccatgtctg ttaaaatagg aacgcacctc tcatcttcgt ttaccgtgac atcaggagtc 4440 ccacaaggaa gtcatcttgg gccgctgatt ttcttgttgt acctcaacga cgtgaatctt 4500 cttctgagct gttacaaact gtcattcgct gatgatttca aactctattt aactattaag 4560 actgaagctg atgcagactt tcttcaagat caactcaaca ttttttcgga ttggtgcact 4620 tcaaatagaa tgattttaaa ttcttcaaaa tgctcagtta taacatttac gcgcaaacag 4680 aatcccatta tcagagacta tatcttacat ggtaccgttt tgaacagaga atctgtcgta 4740 aaagatctgg gagttctatt agactcgaaa ctttctttca aagatcatat ttcttatgtg 4800 accacaaaag cttcagctca attaggtttt attttccgta ttactaaaca gtttaaagat 4860 gtgtattgtc tgaaaacttt atactgttcc ttagttcgct caatccttga atatgggtcg 4920 gttgtttggt ctccttttta taaaaatggt gtcgacagga ttgaatcagt tcaaagaaaa 4980 tttttaagat tcgctttaag attattgcct tggaatgacc cgctgaacct cccgaactac 5040 gaagatcgct gtaaactaat aaacctagat ctccttgaag tgcgacgaaa tgtatcgaaa 5100 gcgacattta tcgctgacct gctgacttct cgtataaact gtgccaattt gctgtgccaa 5160 ttgaacatca acatccgtag tcgaattttg cgcaccaacg actttcttcg cgtaccggga 5220 tcgagaacta attactgtta taattctcct ttagctagta tgtgtcgtgt attcaacaag 5280 tgttataatg tgtttgattt taatttgtct agatctacta ttaagaaaag aattaagcaa 5340 ttattgtgtt attcgtaaga tttgaatgtt atttaacaga tatttgttta ttagttttaa 5400 gtcaattgta tcatttggat ttgttaatct gttgatgcga aaagatgaga aggttttatg 5460 cctatttgag agagagttca tttttaagct caactcaaac gggcttttcc ctactcctaa 5520 aataaaataa aataaaataa aataaaataa aaa 5553 // ID DNA8-2_TCa repbase; DNA; INV; 1185 BP. XX AC . XX DT 21-MAR-2009 (Rel. 14.03, Created) DT 22-MAR-2009 (Rel. 14.03, Last updated, Version 1) XX DE Non-autonomous DNA transposon. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-2_TCa. XX OS Tribolium castaneum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Coleoptera; Polyphaga; Cucujiformia; OC Tenebrionidae; Tribolium. XX RN [1] RP 1-1185 RA Jurka J.; RT "DNA transposons from insects."; RL Repbase Reports 9(3), 669-669 (2009). XX DR [1] (Consensus) XX CC 8bp TSD. Unclassified. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Tribolium CC castaneum Genome Project. XX SQ Sequence 1185 BP; 381 A; 190 C; 207 G; 407 T; 0 other; accagtgtga ggaaatttta tggggtggcc atgaggccac aaccgatcag ctgactcgtc 60 gtagcgctct cgcgatgttc atatttatcg atgaaatggt ccgttatagc gcgtgcattg 120 gcgtagttta ctaggggttt atacaattaa tgttaccggt tgattcgtca gaaacgttac 180 gatttgtata tttttttaaa aaggacttta aagaaggatt atttagttat ctacaaaatg 240 tgctcgaggt gaaatcgcag cgtatttccc tttgaaacgt ggtgtggttg aaaacatttc 300 aagggctaaa aatggtaaat aattaaaaaa atattaatgt cacaccaaag aatactttga 360 aatgaattac aatttcgtaa tttagttcta ctgggtcact tttatgcttt ttaaataagt 420 acatgagata aaaatgttct tttgactttt tagatattat taaaattaac agagagttag 480 ggcggttaaa agtaaaggta taatattaat agataagtga cattaattat tttttaatta 540 tttacgaatt acgattttta gctctcgaaa tgttatcaaa ggcgatgaac acgcttcaga 600 gggaaatacg gttcggtttc acctcaggcg cattttgcag taattttaaa ttgcaataac 660 ttggtgagtc ttacagataa cgaaatgatt cttgcaccaa aaatttattt caagtattag 720 ctttatttcg tctcgattat gtgggagtct atcttttaaa ttaaaaaact ttctcgcatc 780 tcgaaaattt cactacacaa ttatgcaacg ttacaagaaa atgtaatttt tttggtttcc 840 ctttatcaaa actcaacctt gttaaactac gtataaatct gcatttattg aaaatttgtt 900 tgcaaaaaaa acggagtttc tagctttatc cagaaggaag atatgagtat gtaaacgcaa 960 aatgtgtcca attttaggtg tttgcaaatt tttagctaaa acgtcttttc agccccttcg 1020 cgagggttga aacaaaaaat tctttgtcta aacatttttt aatactcttt cctaacttat 1080 atcaacaatg caagctgttc cacaacattt cgatttaaag gtttatctga ctgcaagcgc 1140 gcgcccatgc actaaccttc gcccataaaa tttcctcaca ctggt 1185 // ID Gypsy-94_AA-I repbase; DNA; INV; 4805 BP. XX AC supercont1.342; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-94_AA_; KW Gypsy-94_AA-LTR; Gypsy-94_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4805 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.342; Positions 962015 957211. XX CC Positions [2198-2698] - Reverse transcriptase CC Positions [3819-4289] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 617..3571 FT /product="Gypsy-94_AA-I_1p" FT /translation="MALNGLTTMIEPYRKGSSFCDWVDRLAYCFQANAVSD FT EMKKAHLITLGGTVLYSELKLLYPNGALDTAVYADIATKLKARLDKIEPDV FT IQRVQFNSRIRQAGETLEDFVLALKLQAEFCGFGDFKNMAIRDRIVAGVND FT KALQQRLLNEEKLTLETAEKIITTWQIAETNAKALGSRENSWDQIASLKMA FT GANAVGTSMRKLARTYELARQDENSRGSVKNRLGYRQWPVRQGVYQRQPTD FT WGKLRSLNTGEWRNPEHKRLSRFANFTCDYCGIKGHIKKRCFKLKNWKRMN FT MVEDQQAPAVDEDLHDMINRMTTKDSDTEDDDMDVGESENEEIECMNITSI FT NKINNPCLLELTIEGKLLSMEIDCGSAVTVINKRQYFSLFDKPLQKCNKNL FT AVVNGSKLDVLGEAKVFVSFQGKEAILKLLVLDCCNNFIPLLGRPWLDVLF FT PNWRIFFSNQVVINNMVIDNNEVLISDLKQQFSKVFVKDFSTPIVGFQADL FT VLKTEQPIFKKAYDVPYRLKDKVDDYLDRLEREKVITPIQTSQWASPVIVV FT IKKNNQIRLVIDCKVSINKMIIPNTYPLPTAQDLFAKLAGCKLFCALDLEG FT AYTQLELTDRSKKCMVINTTRGLYTYNRLPQGASSSAAIFQQIMDQILHGI FT DGVYVYLDDVLIAGKNFEDCLSKLRIVLAKLSKANIKVNLDKCRFFVELDY FT LGHVISEEGLKPCPDKISTIKNANVPKNVSELKSFLGLINYYQKFIPHLST FT KLFHLYNLLKKDVKFVWNSNCSKAFEDSKNALLATDLLEFYDSNKPLVIVS FT DASGYGLGGVIAHVIDGTEKPISFTSFSLDKCQKSYPILHLEALALVCTIK FT KFHKYLYGKKFFVFTDHKPLVGIFGKEGRNSIFVTRIQRYVLEQSIYDFEI FT RYRPSSKMGNADFCSRFPLPQSIPNELHTDFVKSLNFSKNVPLDHEKIANA FT SKNDEFMCKVISFMKHGWPDRLNKRFCEST" FT CDS 3687..4805 FT /product="Gypsy-94_AA-I_2p" FT /translation="MKRLARQSVYWFGINSQIEEFVKKCDVCNSMASILKQ FT KIDSEWIPTTRPFSRVHVDFFHFSSHTFLLIVDSYSKWVEVEWMKTGTECG FT KVLKKLVTFFARFGLPDVLVSDGDPPFNSFAFTLFLENQGIKVLKCPPYHP FT ANNGQAERFVRTVKDVLKKFLLEPEHCELELEDQINLFLINYRNNSLTADG FT HFPSERVFAYPPKTLIDLLNPKKHYRKFLLKPAKTSNEDNLLEVVSRTKRK FT NPLDDLMEGDVIWYQNHKPNHPAKWLEATYLKQISQNLFQVLIGNVVATVH FT REQIRTSGGSDREMCPNVVLTKQRAIDVTDGPGNCVSESLSDDVNKNRSDK FT YYQEEKFSGKRKKPADWVDEPPRRSKRTLF" XX SQ Sequence 4805 BP; 1538 A; 815 C; 1052 G; 1400 T; 0 other; caagttgacg acgaggcaac cggacatcaa tacaccgtag ccgtaagtaa atgctgtcag 60 ttgtgaggca gctcagttga attgtagacg gcttggtgaa cggtgcgaaa cgcgtgcaat 120 cggcaaaacc tccttttaac cggcgattgg tgagcgtcca gacggaaagt ttacggaagt 180 tattttttat tctcaaaaga aaattcgacc gtggttcttt tcgggtgttt caaattgagc 240 tggtttggaa ataaaagcat cagtgagcat ccatcaagct acggagggca cacagcccca 300 cgcaaattca gtagtgaaaa tacgacatca ggatcggtct ttgtgattgt agcgccaagg 360 agggttcaac gtagttgagt ttaccagctt tggttgaccc agcatctttt cgaacgagat 420 aaccagaagg atcgttcaac cagtatcaac atcaaggcca ttgaccttag ttggttcatc 480 gaccggttgg aacgcaagtg agtgagcagt aaattgtgtt ttaccaattg tgtaatttac 540 tccattttgg ggtctttatt ggtgcctttt gttagtatcc ttttatttga attttagatt 600 gttattcttt ctaacaatgg cgttaaatgg attgactact atgattgagc catacagaaa 660 aggctcttcc ttttgtgatt gggtagaccg gcttgcctat tgctttcaag ccaacgcagt 720 ttccgacgaa atgaaaaaag ctcatttgat aacattaggg ggaacagttt tgtattccga 780 acttaaatta ttatatccta atggggcttt agataccgct gtgtatgcgg acattgcaac 840 caaattgaaa gccaggcttg acaaaattga acctgatgtg attcagcgtg tccaatttaa 900 ttcccgaatt agacaagctg gtgagacttt ggaagatttt gttttagccc tgaagcttca 960 ggctgagttt tgcggcttcg gcgatttcaa aaatatggca ataagagatc gcatcgtagc 1020 tggcgttaat gacaaagccc tccaacaaag gcttctaaac gaagaaaagc tgacgttgga 1080 gacggctgag aaaattataa cgacatggca aattgccgaa actaacgcta aagctttggg 1140 aagtcgggaa aacagttggg atcaaatagc ctctttgaaa atggcaggtg ctaatgcagt 1200 tggaacttct atgaggaaac ttgctagaac atatgagctg gcaaggcaag acgaaaattc 1260 cagaggttct gtaaagaatc gattaggata tagacaatgg ccagtacgtc aaggggttta 1320 ccaaagacaa ccaactgatt ggggtaaatt gaggtcattg aacacaggcg aatggcggaa 1380 tccagaacat aaaagactat cgcgctttgc caacttcaca tgtgactact gtggaattaa 1440 gggtcatatt aaaaaacgat gctttaaatt gaagaactgg aagagaatga acatggtgga 1500 ggatcagcaa gctccagcag tggatgagga tttgcatgac atgatcaata gaatgaccac 1560 caaggactcg gataccgaag acgacgacat ggacgtaggt gagagcgaga atgaagaaat 1620 tgaatgcatg aatattactt ctattaacaa aattaataac ccttgtttgt tggagctgac 1680 tattgaaggc aaattactta gtatggaaat cgattgtggg tcggcggtga ctgtaataaa 1740 taaacgacaa tacttttcgt tgtttgacaa acctttgcag aagtgcaata aaaatttggc 1800 tgttgttaat gggtccaaac tagacgtgct tggtgaagcg aaagtttttg tcagttttca 1860 aggaaaggaa gcaattttga aacttttagt gctagattgt tgcaataatt ttattccatt 1920 attgggtcga ccttggctgg acgttttgtt ccctaactgg cggatttttt tttctaatca 1980 agttgtgatc aacaatatgg tcattgacaa taatgaggtg ttgattagtg acttgaaaca 2040 acagttttct aaagtttttg ttaaggattt ttcgacacct attgtaggct ttcaagctga 2100 tctagtgttg aaaacggaac aaccaatttt taaaaaagcc tacgatgtac catacagatt 2160 aaaagacaag gttgatgatt acttggatag actagaaagg gagaaggtta taacaccaat 2220 acaaactagt caatgggctt ctccggttat cgtggtaatt aaaaagaaca accaaataag 2280 acttgtaata gattgcaaag tttcaataaa taagatgata ataccgaata cttacccttt 2340 accaactgca caagatttgt ttgcaaaatt agcaggatgt aaattgtttt gcgctcttga 2400 ccttgaaggt gcatacactc aattagaatt gacagatcgt tccaagaaat gtatggtcat 2460 taatacaaca agaggtttat atacttacaa tcgactacct caaggagctt catcaagcgc 2520 tgccatattt cagcaaataa tggaccaaat attacatggt attgacgggg tttatgttta 2580 tctagatgac gtactgattg cgggaaaaaa ctttgaagac tgtctttcta aattgcggat 2640 agttttagcg aagttgtcaa aggcaaatat taaagtaaat ttagacaaat gcagattttt 2700 tgttgagttg gactatcttg gtcatgttat tagcgaagaa ggtttaaaac catgtcccga 2760 caagatttct acaataaaaa acgcaaacgt gccaaaaaat gtaagtgaat tgaaatcttt 2820 tttgggtttg attaattatt accaaaagtt cattcctcac ctctctacaa agctgtttca 2880 tttgtataat ctattgaaga aggatgtaaa atttgtttgg aatagcaatt gcagtaaagc 2940 gtttgaggat tctaaaaatg cacttttggc aacagatctg cttgaatttt atgactcaaa 3000 taaaccactt gttatagtct ctgatgcatc agggtatgga ctagggggtg tcattgccca 3060 tgtaatagat ggaacagaaa agccaataag ttttacctcg ttttctcttg ataaatgtca 3120 aaaatcctac cccattttgc atctggaagc tcttgcattg gtctgtacta ttaaaaaatt 3180 tcacaaatat ctctatggga aaaagttttt cgtttttaca gatcataaac cattggtagg 3240 aatatttggt aaagagggac gaaattcaat atttgtcaca agaatacaaa gatatgtttt 3300 ggagcagtcc atatacgact ttgagatccg ctatagacct tcgtctaaaa tgggaaacgc 3360 agatttttgc tcccgttttc ctttgcctca gtcaatacca aatgaattac atacggattt 3420 tgtaaagagt ttaaacttca gtaaaaacgt accgctggac catgaaaaaa ttgcaaatgc 3480 ttccaaaaat gatgaattta tgtgtaaagt tatttctttc atgaaacatg gttggcctga 3540 tagattgaac aaacgttttt gcgaatcaac atgaattgga gatagtagat gaatgtttat 3600 tgtatcagga tcgcgtggtc atcccagcgg tattacaacc gcatgttctg aatctcttac 3660 atggtaacca cgcaggaatt gtaaaaatga aaaggttagc aaggcagtcc gtttattggt 3720 ttggcataaa ctcgcagata gaggaatttg tgaagaaatg tgatgtgtgt aacagtatgg 3780 catccatttt gaaacagaaa attgattcgg aatggattcc cacaacaagg ccttttagca 3840 gagttcatgt ggactttttc catttttcaa gccacacgtt tctattgatt gttgacagtt 3900 actcaaagtg ggtggaagta gaatggatga aaactggcac tgagtgtgga aaagtcctca 3960 aaaaattagt tacatttttt gccaggtttg gtcttccgga tgtgttagtt tcagacggag 4020 atcctccatt caattctttt gctttcacac ttttcttgga aaaccaaggg attaaggtgc 4080 taaaatgccc tccataccat ccagcaaaca acggtcaggc ggagcggttt gtgagaacgg 4140 tgaaggacgt tttaaaaaag ttcttattag aacctgagca ttgcgaactt gaattggagg 4200 accagattaa cttattttta atcaattata ggaataattc attaacggct gatgggcatt 4260 tcccatccga aagagtgttt gcttatccac ctaaaacatt gatcgacttg ctgaatccaa 4320 aaaaacatta taggaagttt ttactaaaac cagctaaaac ctctaatgag gataatctac 4380 tggaggtcgt ttccaggacg aaacgaaaaa atcctctaga tgatctgatg gagggtgacg 4440 tgatctggta tcagaatcat aaacccaatc atccagcaaa atggttggaa gcaacttacc 4500 taaagcaaat ctctcaaaac ttattccagg ttctaattgg aaacgttgtg gccacggtac 4560 acagagaaca gattaggacg tcaggaggtt cagacagaga gatgtgtcca aacgtggtgt 4620 taacaaagca gagggcgatc gatgtaaccg atgggccagg caattgtgtt tccgagagtt 4680 taagtgatga tgtgaataaa aaccgtagtg ataaatatta ccaagaagaa aagttttctg 4740 gtaaaaggaa aaaacctgcc gactgggtgg atgaaccacc tcggcgttct aaaagaactc 4800 tcttt 4805 // ID BEL-145_AA-LTR repbase; DNA; INV; 515 BP. XX AC AAGE02022721; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-145_AA_; KW BEL-145_AA-I; BEL-145_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-515 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02022721; Positions 8409 8923. XX SQ Sequence 515 BP; 180 A; 89 C; 113 G; 133 T; 0 other; tgtacagttg taataatcac cagaaatctg atcggatgac cccttgatca tcgctgtgga 60 acctattttt tgaccacagt atatcagatc atgggaaccc aatactatgt tcgatctcta 120 atacagtggc caatgtgtac gaaacgtgta ggaagagaag aaaaaatgca cttttgaaga 180 ggtcaaggga cggagaaata gaatttaagt tagtcataag atgccaatcg cttagagccg 240 ttcggtcatg ggaccaaatt gtaactagaa ttagtgtaat aaatgttgaa tgttagtgta 300 aagtaaattg aatgtttaac gtgtaataaa gtgttatgta attgtaaata aagtgtgtat 360 ggaataaagt gtatagtgtc ttcaataaaa actttggaac cttgactcac ccgagcaagc 420 aagacagccc aatacttggc ctaatcccga ggaaggagcc tcccgaatcg cctgaaatcc 480 acccaaggta aaggaaccag taacaaggtg caaca 515 // ID Gypsy-82_CQ-LTR repbase; DNA; INV; 349 BP. XX AC AAWU01003460; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-82_CQ_; KW Gypsy-82_CQ-I; Gypsy-82_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-349 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 544-544 (2011). XX DR Genome; AAWU01003460; Positions 388 40. XX SQ Sequence 349 BP; 105 A; 72 C; 78 G; 94 T; 0 other; tgtggcgata agaccactca gaggttcttg aaccatacag gttctggaac cttcatctaa 60 ttgtttgatc ggaacattcc acaaggtgtg gaatgcataa tgtaaagttt gtcccgtgac 120 cttatctgaa gaaggttcgt tgcacctcca aataagagag agtgcattag tcttatctct 180 atgcttaggc taatagaccg gtcaaacaac cagagggcta tgaccggtcg cgttcctgac 240 actgttagga acgcatataa ataggtaaga aatttgaaat aaagtcagtc gaggttgaaa 300 ctgcaaaccg tctttagtac atggaattat cacaggcccc tgtctcaca 349 // ID Penelope-11_HM repbase; DNA; INV; 6338 BP. XX AC . XX DT 26-JAN-2009 (Rel. 14.02, Created) DT 26-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE Penelope-like element. XX KW Penelope; Non-LTR Retrotransposon; Transposable Element; KW Penelope-11_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-6338 RA Bao W. and Jurka J.; RT "Penelope-like elements from Hydra magnipapillata."; RL Repbase Reports 9(2), 449-449 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 598..2505 FT /product="Penelope-11_HM_1p" FT /translation="MPQWAGKLNANYNKQLKEKINITRTNMQNKTITNISG FT TQLTKNELTILNKGLKFIYSPQEPDFDLYTKQIKKFKRHIYCKMFFNKQHP FT TSQNPTKKDRTLKRPNPNWNPLADKNMKLKRYMTIIDRETERIMKDPTYTN FT ANNTTINERKALYKLRRNKSITIKKTDKGGGICILDKKTYEEKILQLLEDK FT NTYDELPNDTTEMVTDKIINEIIMMRNARAIPEKVANFILPNTPSRTPLFY FT GLPKIHKQGTPLRPIVSGCDGPTDNLSEYVVKYLQPMAETLPAYFRDTTHL FT LRLLSDINSPESPITLITADVTSLYTNIPHNDGIQTIKNFITEHLHTIKFP FT PELPPIIPTRHFCHLIELILKNSSFMFGDRAFRQKFGTSMGTRMAPPYANI FT FMSTFDKTIHNKFKNSILLYKRFIDDILIIFTGTTQQTEELTTYANTLHND FT IKFTFNTSNDKINFMDITLQINKNNNTLTSKLYRKPTDTLSLLNFHSNHPR FT HQKIGIIYSQALRLNRLISDEDELNKELKNLTITLVTKNYPLNVINHHISR FT ALLKTQTELITQSKPLKLQDNDDTTSNQIPIILPNDNIGRELAQMITKHWA FT IIKNDPDLNTILKPALLKVLSNHKSLNDLLISTRHKA*" XX SQ Sequence 6338 BP; 2438 A; 1340 C; 847 G; 1703 T; 10 other; ttgcacttat ataatatatt tctttcttct cttaagactc ttacggcctg atggttccat 60 atcagtggtg gaactttaat agcttccact gcgaaacaat agcagattta atccctctcc 120 cgtcccacta gtgggaacgg gatgcggaac ttgggcaaca acttcttcgg acaaataact 180 ttatacacat cytgcacttt ttgtatctta ttttacacca tctttttaca ctctttctca 240 actttctaat tcaactcttt tatgaaaacc agacttacgt aaaacccaaa cgaaacttat 300 aacacaaaag gaaccactaa aattacaaga tagcaatgac acttaatact actacgggac 360 actaataagt acacctaata ctaccacgaa cactaaacac cttatacaaa catctaaaac 420 actaataaca tttaaaacta ccaccacgaa cacgacacta ataacacgat attaataatg 480 ccattaatta catatctttc accttagctc atatcactgg cagaaggaaa ccgtaactta 540 tacacaacta gcaacagcca tagaaacttc aaacccctct cttacaaaaa attattcatg 600 ccacaatggg ccggaaaact aaacgcaaac tacaacaaac aacttaaaga aaagattaac 660 atcactagaa ccaatatgca aaacaaaaca attacgaaca tctcaggcac ccaattaaca 720 aaaaatgaac ttactatact aaataaagga cttaaattta tctactcccc acaggaacct 780 gactttgatc tatatacgaa acaaataaaa aaatttaaac gacatattta ctgtaaaatg 840 ttttttaata aacaacaccc aacatcgcaa aacccaacga aaaaagaccg cacgctcaaa 900 cgacccaacc ctaattggaa cccccttgca gataaaaaca tgaaactcaa aagatacatg 960 accataatag acagagagac ggaaagaatc atgaaagacc caacctacac aaatgcaaat 1020 aacaccacaa tcaatgaaag gaaagctctg tataaactaa gaagaaacaa atccataaca 1080 atcaagaaga cagataaggg cggtggtata tgcatattag ataagaaaac atatgaagaa 1140 aaaattttac aacttttgga agacaaaaac acatacgacg aactaccaaa cgacacaaca 1200 gaaatggtaa cagacaaaat aataaacgaa attattatga tgaggaatgc acgagcratc 1260 cccgagaaag tagccaactt catactaccy aacacaccga gtcgcactcc cctattttac 1320 ggactaccca aaatacacaa acaaggaaca ccactacgac cgatagtttc tgggtgcgat 1380 ggaccgactg ataacctctc tgaatatgtg gtaaagtacc tgcaacctat ggcagaaaca 1440 cttcccgcat acttcaggga caccactcac ttgcttagat tactatctga catcaactcc 1500 cccgagagcc cgataacact gataactgcc gacgtaacat cgctatacac taacatacca 1560 cacaatgacg gcatccaaac tatcaaaaac tttataaccg aacatttaca taccatcaaa 1620 ttccctccmg aactaccacc catcataccg actagacatt tttgccacct tattgaactg 1680 atacttaaga acagctcgtt tatgtttggc gacagggcrt tccgccagaa atttggcacc 1740 tccatgggta ccagaatggc accgccctac gccaacatat tcatgagtac ttttgacaar 1800 acaatacaca acaaatttaa aaacagtatt ttactgtaca aaagattcat cgacgacata 1860 ctgatcatat tcacgggcac tacacaacaa acggaagaac ttaccacata cgccaacacc 1920 ctacataacg acatcaaatt tacattcaac acctccaatg acaagatcaa cttcatggac 1980 atcactctac aaataaacaa aaataacaac acactcacct caaaacttta cagaaaacct 2040 acagacaccc tgtccctcct aaactttcac tcaaaccatc ccagacacca aaaaataggc 2100 atcatctaca gccaagcact acgactcaac agactcatct ccgatgaaga cgaactaaac 2160 aaagaactca agaaccttac aattacactc gtcactaaaa actacccact taacgtcatt 2220 aaccaccaca tctccagagc cttacttaaa acccaaacgg aactaataac ccaatcgaaa 2280 ccactaaaat tacaagacaa cgatgacacc acatccaatc aaatccccat catactcccc 2340 aacgacaaca tagggaggga actagcacaa atgatcacaa aacactgggc tattatcaaa 2400 aacgaccctg acctaaacac aatactgaaa cctgcactac ttaaagtact atccaaccac 2460 aaatcattaa acgacctact tatatctaca agacacaaag cttaaacaca cacaaaacta 2520 caaacaaacg accactacca ccaccatcca agtacgacac aaccatgact gctccccata 2580 acctgaccaa gacacaatga atgagaggaa tacaactgta catgaaatga cagaataata 2640 catcaacagg aaaaatctga atatatttac tttgcgaata aaggtttttt aaaaaaacat 2700 caactagaaa tacacccaca taaaatactt gtaaacataa tttacttata aatagaaata 2760 cttataccag accattgtat ttagttataa taatacagac acctaaatca tacacatctc 2820 cctacctcaa tacatcccct aataacacta atgttaataa tccccataat cccctccttc 2880 attaacaaat gattacaaaa cattacaaaa atgatcacca cgaccaaaca taatagaaac 2940 gaatatccta aattataaac gtaaatatga cttaactatt gatcctgata taaaaacata 3000 acagatgttt tgaacacaat gaatacattg cacataagaa atataatcgg ctatccaact 3060 gtctcttcga ctctaggaag gtgtattaaa ctaaatctgc gatatcccac ctatcttaca 3120 aacagataaa acaactaagt ttatcgcgcg tgacagcagc tattgtacta aaaaactaat 3180 acaaatacaa cctaaagaaa aattactaaa aacttcataa atgaaaatac taataaaaac 3240 aatttgattc atttttaatt agaaaaactt tattattaac actaacgcaa gtaaaaccta 3300 aagtataatt attttgttaa atccatatat gaaaaaattt gttagcaaat atgcttcgcg 3360 cagtttaaac ctttatcgcc taaaaaacat ttatgctgct aactaatttt caaaaaatct 3420 aaaaagtgta tatattgtta atgtaaattc caagtattga ttatttaata acaaagaatt 3480 acttatttat attaatatca ttctatagtt ttataaaaaa gaatgatgga acagacagag 3540 tggggttgat cggcaataac aattgtcgct gataggttaa cgatttgagg aggtcggatt 3600 tctgactggc ccttcgctgg ctgattcagt gacccttctt cgttgctctc catctcacaa 3660 gtctagtttc atgatattct tgaaatttaa cggtgatcct gcatcgttga ggtatcagca 3720 tagaagccta gttgccaata agttaacgat ttgaggaggt cggttctccg actgaccctt 3780 cgttggctga tgtggcgacc ctctctcgat gtttttatct taactttcaa gatccagcta 3840 aaacaatgag cagcccaaac acataaatat ttaacaccag ctacacattc aaataaacta 3900 aaagaatttt taaaaaaatt caaatgaaaa atgacgtaaa aagttaaaaa agaaacgtaa 3960 cttaaaaaat tctttgtaaa aaaagcaact ttactattaa cactaacgca agtataacct 4020 aaagtayaat tattttgtya aatccatata tgaaaaamtt tgttagcaaa tatgcttcgc 4080 gcagttaatt aacttttatc gaacttaaaa aacatttatg ctgctaactt aattttcaaa 4140 aaatttaaaa agcgtatata ttgttaattt aaattccaag tattggttgt ttaaaaacaa 4200 agaattattt atttcaatta aaatcattcc atttttataa aaaagactga tggaacagac 4260 aaagtgaggt tgatcagcaa taacacttgt cgctgatagg ttaacgattt gaggaggtca 4320 gttttttgac tggcccttcg ctagctgatt cagtgaccct tcttcgttgc tctccatctc 4380 acaagtctag tttcatgata ttcttgaaat ttaacggtga tcctgcatcg ttgaggtatc 4440 agcatagaag cctagttgcc aataagttaa cgatttgagg aggtcggttc tccgactgac 4500 ccttcgttgg ctgatgtggc gaccctctct cgatgttttt atcttaactt tcaagatcca 4560 gctaaaacaa tgagcagccc aaacacataa atatttaaca ccagctgcac attcaaataa 4620 actaaaagaa tttttaaaaa aattttaaaa gtttttatat tgttaattta aattccaagt 4680 gttggttatt taaaaacaaa gaattattta tttcaattaa tatcattcta tttttataaa 4740 aaagactgat ggaacagaca gagtgaggtt gattagcaat aacacttgtc gctgataggt 4800 taacgatttg aggaggtcgg wtttctgact ggcccttcgc tagctgattc agtgaccctt 4860 cttcgttgcc ctccatctca caggtctagt ttcttgatat tcttgaaatt taacggtgat 4920 cctgcatcgt tgaggtatca gcaaagaagc ctagttgcca ataagttaac gatttgagga 4980 ggtcggttct ccgactgacc cttcgttggc tgatgtggcg accctctctc gatgttttta 5040 tcttaacttt caagatccag ccaaaacaat gagcagccca aacacgtaaa tattcaacac 5100 cagctacact tccaaataaa ctaaaagatt ttttaaaaaa attttaaaag tgtttatatc 5160 gttaatttaa attccaagta ttggttgttt aaaaacaaag aattatttat tttaattaat 5220 aatattctat ttttataaaa aagactgatg aaaaagactg atggaataga cagagtaagg 5280 ctgatcagcg atgacactta gcactgatag gttaacgatt tgaggaggat tgagttttga 5340 taataaaatt taaaaaaagt gacttccgcg tctttaaacg atatattaat ttgattttgg 5400 cgcctatgtt tggtttattt attagatttt tgtattattt attagaattt tagacttaaa 5460 aatacttttt ctttaccttt tacaataact actacgctga ggatagccca tataaattcc 5520 taatagtcta acccgagtga cttaacctaa gaaaaatgat tgctcgcgta acctttcttt 5580 cattacttac cttcctttta agaaccatca cgatttaaca tcaaaaagga cattgaggag 5640 aagggttcga agacgacaaa atattcaaga gagaattgca gctaagcgga acttagaggc 5700 atctatagat agtgagggaa cattagaaaa ggagcagaaa gtggtcgaag aaccaccagt 5760 aaaaataaat aaagaagata cagaagaagt ggaaaataca ttaaggttct tggccgataa 5820 tttagacgtt ttcacgaaca atataattgt aaagtaaggt acatttaaat gataaatata 5880 tacttaaaaa cattagtaac ataattccga atcaccacct attgtaatac caatgatttc 5940 ttacccctat aacccttatc ccaaccggaa cacgtgactt tatccagaat ctacgtgtaa 6000 cccaaaccct aattatcaat aattatcaca acaattatca taaatatatt aaacatctaa 6060 atccccctta gaacaactgc tctaaatata cccactaaga cattgaatta cagagacagc 6120 caatttcaat acttgtctct aaaataaaac cgatgacaat aacaaaaaat aattcccgcg 6180 gaaaatgata gaaaaactca ccttgagaat tacctaactc ttttcgcgga gactaaagaa 6240 gctctgacac gtaaatagaa caatagacca ttgaaaactt acaacgatag aacaatggaa 6300 tttctaaaaa cagaagtttt acatttttat gtgcacac 6338 // ID AFRP1 repbase; DNA; INV; 118 BP. XX AC M30506; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 28-SEP-1995 (Rel. 1.08, Last updated, Version 1) XX DE A.formosa repetitive sequence. XX KW AFRP1; Repetitive sequence. XX OS Acropora formosa OC Eukaryota; Metazoa; Cnidaria; Anthozoa; Hexacorallia; OC Scleractinia; Astrocoeniina; Acroporidae; Acropora. XX RN [1] RP 1-118 RA McMillan J. and Miller J.D.; RT "Nucleotide sequences of highly repetitive DNA from scleractinian RT corals."; RL Gene 83(1), 185-186 (1989). XX DR GenBank; M30506; Positions 1 118. XX SQ Sequence 118 BP; 33 A; 23 C; 23 G; 39 T; 0 other; gatcgacaag ttttggtggt ttttcagcca atgccatact ttttgcaaca tttttctaaa 60 aattgggtca aaaccctagt gcacggtact ttgcacaaaa agttgcttat ctcgagga 118 // ID Zator-1_CP repbase; DNA; INV; 5670 BP. XX AC AAWU01037170.1; XX DT 29-JAN-2009 (Rel. 14.02, Created) DT 25-JUL-2009 (Rel. 14.08, Last updated, Version 2) XX DE Zator DNA transposons from Culex pipiens. XX KW Zator; DNA transposon; Transposable Element; Zator-1_CP. XX NM Zator-1_CP. XX OS Culex pipiens OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RP 1-5670 RA Bao W., Jurka M.G., Kapitonov V.V. and Jurka J.; RT "New superfamilies of eukaryotic DNA transposons and their RT internal divisions."; RL Mol Biol Evol 26(5), 983-993 (2009). XX DR EMBL/GenBank/DDBJ; AAWU01037170.1; Positions 5724 55. XX FH Key Location/Qualifiers FT CDS join(442..930,906..1205,1108..1719,2269..2583, FT 2544..2837,2925..3719,3755..4333) FT /product="Zator-1_CP_1p" FT /translation="MDHNEMFGDLFKAFVKAYPEKKKSACQAEAILFWNQL FT KTSEPKDKLPELVRSKIQQLEALALGRQGALMKFWGKAASAASPTPAVSQS FT RASPSEHGASGSGQSSRKGEAKIAQPSSASCLKRPAPVQQKISEELNEVTA FT QVLRMKERESAGLMTEGLKGNSSNRPEGEFKQLKLLTGQRNKLEKIVESKI FT SQQERQKKRRVVEKSAKERLLEEIPEAGKLKKLGFYTSKIKYVYILSQVPV FT SKIKHRPSTPGRAANRFIEGHLNMFIFSAKFLSVRSNIGRPRLEEQQTDLL FT KAICDIAIYGSAAQDKRQLEAIRSIHTLDELKDALFHQGFKLSRSATYLRL FT LPRRSNTVEGERHVNPVPVKLQKATNTKHHDHSDGRFCTSTIWSLEELSSL FT LGPEEVAFLSQDDKARVVIGLTAANKQSPFLMHIEYRVKLPDHNWVVAQRH FT KLIPSVTAGIIMSMSMSMVDCQYDQKVKHYRCLPSCDALRRTYQFLNRVLC FT LTSYRRRAIRATMLWDGLSKRNEIIIYNSTMRISISMHFRSTSRGFARVDS FT LSDRSNELHTTDGPTSLATPRPLFQWPNFSGHTATPFSMISRIFDHFPAGS FT RASIHSPIDPTNTTRLMAQLLWPHRDPFFNDFENFLPLSRGFARVDSLSDR FT SNEHHTTDGPTSLATPRPLFQWPNFSGHTATPFSMISRIFYHFPVGSRASI FT HSPIDPTNTTRLSVTAGIIIKKNGLGKPEAVGYSGPTYVGIRSGKHSSSTA FT YSHALDFKRLTELEEFAPILRHDGIIKPVVVIVVDGGPDENPRYRKVIQVA FT IHHFLEHNLDAIFIATNAPGRSAFNRVERRMSPLNRELAGLILPHDNYGSH FT LDSQMRTTDEHLERQNFKFAGKTLAEVWGGLIIDGYPTVAEFIEPEASEVD FT PESLECRDQCWFSNHVRTSQYMLQIVKCDNRLKHFFAFYFRSCCSPLRSSL FT STVLPGRFLPPPIPFIHSSDVLKAAGKDDHQHFPSLFVAISLDVSKILPKS FT YLGFKKIPYDLFCPSQTNLVNDRICTTCGLYHASLVMLKEHKKEHKVTAAT FT KKIRSKHVVTRRRNELLVTTADGGDLDWLDSDTVDLTGIPMDWAEGVENLN FT DSHPIIDIEKHLLNPWGED*" XX SQ Sequence 5670 BP; 1577 A; 1253 C; 1301 G; 1539 T; 0 other; ggcgtcatcc ataaagtacg tcacgctctt agggggggag ggggtttggc gaagcgtaac 60 aaagtgtgac aaggggggga ggggggttca gatcagcgtg acgtaccgtt agatttattt 120 ttcttttaat ttgtcctaaa caaccaaccg caatgccaaa aagcttgaaa tagaaagcaa 180 ttctaaaacg catccgtagc caactcaaaa ataggtttgt gcaaacttga ttctaagtac 240 acggtttaag cgtgtgaaat caatatacac gattatagaa tttttgaatt tgcatgagtt 300 tcgtcagatt attagtatgc ttcattgaga gtgtactcag ttgacagcta caattttaaa 360 ttttcaactg ttcttttgac gtgtgcggcg caaattgagc ggaagttttt ttttaaacca 420 agtgcaaaga acgacttaaa aatggaccac aacgaaatgt ttggtgacct tttcaaggct 480 ttcgttaaag cgtatcctga gaagaagaaa tctgcgtgcc aagcagaggc gatcctcttt 540 tggaaccagc tcaaaacatc tgaaccgaag gacaaactgc cggaactcgt aaggtcaaaa 600 attcagcagc tcgaagcgct ggcgcttgga cggcagggag ctttaatgaa gttctggggg 660 aaggctgcaa gcgctgctag cccaactcct gcggtatcgc agtcgagggc gtctccatcg 720 gaacatggag catcggggtc cggacagtcg agccgaaaag gcgaagcaaa aatagcgcag 780 ccaagttccg cgagctgttt gaagcgacct gcgccagtcc agcagaaaat ctcggaagag 840 ctgaacgagg ttacagccca agtgctgcgg atgaaagagc gcgaatctgc gggattgatg 900 actgaaggcc tgaaggggaa ttcaagcaat tgaaattatt gactggccaa agaaataagt 960 tggagaaaat agttgaaagc aaaatcagcc agcaggagag gcagaaaaag cgacgagtcg 1020 ttgaaaagag cgccaaggaa aggcttcttg aagagattcc tgaagcaggt aagttaaaaa 1080 agctgggttt ttacacctct aaaataaaat atgtttatat tctcagccaa gttcctgtca 1140 gtaagatcaa acatcggccg tccacgcctg gaagagcagc aaacagattt attgaaggcc 1200 atctgtgata tcgcgatcta cggttcggcg gctcaggaca agcgacagtt agaagcgatt 1260 cgaagtattc atacattgga cgaactcaaa gatgcgctgt tccaccaagg tttcaagctg 1320 agtagatctg caacctactt acgattgctg ccacgtagat cgaacactgt ggaaggggaa 1380 cgtcacgtca atccagttcc ggtcaaactg caaaaagcca ctaacaccaa gcaccatgat 1440 cattcagatg gtcgattttg cacgtctaca atttggtctc tggaagagct ctcttcactg 1500 ctaggaccgg aagaggttgc atttctaagc caggacgaca aagcgcgagt tgtaatcggt 1560 ttaacagccg caaacaaaca gtccccattc ttgatgcaca tcgaataccg agttaaactt 1620 ccagaccaca actgggtcgt cgcacagcga cacaagctta tcccatcagt caccgctgga 1680 ataattatga gcatgagcat gagcatggtt gactgccaat gagctgctac tccgttattg 1740 acggatcagc tgaagttaca caatgaacca acagatgaat agtgggagct aatcatcctc 1800 actgtataac ccctgaagat ctctgcttta agtcaatacc ggcgcccccc caaggagatg 1860 cagttcaaca aaggtaggaa tgttagtccg atagttgaag ttgcagactc atcagacaca 1920 gagtttgtca ctctatacct gttgacaccg catgagaccg ttgaatccac agcatctcct 1980 tcaagcatca cgggaagtgg gggaattgtg ttagtagggg aaggaaaggg aaggtcagga 2040 ttcatcttgg tagatgatat gaccagaatg tgaaacaatc gttgccttcc gagctacgac 2100 gctatgagaa ggtcttatca atcgcgtatt ttttacttct taccgccggc gtgccatccg 2160 agcaacgatg ctttgggaag gactttcaaa acgaaatgaa ttttgaatta acgtattatt 2220 cggtgatgta caatttagga tagatcagga tccatttttg taagatgata tgaccaaaaa 2280 gtgaaacatt atcgctgcct tccgagttgc gacgctttga gaaggactta tcaattcctt 2340 aatcgtgtgt tatgtttaac ttcttaccgc cgacgtgcca tccgagcaac gatgctatgg 2400 gatgggcttt caaaacgaaa tgaaattata atatataatt caacgatgcg aatctctatt 2460 tcaatgcatt tccgatctac ttctcgcggg ttcgcgcgcg tcgattcact ctccgatcga 2520 tccaacgaac ttcacacgac tgatggccca acttctctgg ccacaccgcg accccttttt 2580 caatgatttc gagaattttc gaccactttc ccgcgggttc gcgcgcgtcg attcactctc 2640 cgatcgatcc aacgaacacc acacgactga tggcccaact tctctggcca caccgcgacc 2700 cctttttcaa tgatttcgag aattttctac cactttcccg cgggttcgcg cgcgtcgatt 2760 cactctccga tcgatccaac gaacaccaca cgactgatgg cccaacttct ctggccacac 2820 cgcgacccct ttttcaatga tttcgagaac tttctactac tttcccgcgg gttcgcgcgc 2880 gtcgattcac tctccgatcg atccaacgaa caccacacga ctgatggccc aacttctctg 2940 gccacaccgc gacccctttt tcaatgattt cgagaatttt ctaccacttt cccgtgggtt 3000 cgcgcgcgtc gattcactct ccgatcgatc caacgaacac cacacgactc tcagtcaccg 3060 ctggaataat tatcaagaaa aatggattgg gaaagccgga ggctgttggg tattccggtc 3120 cgacatacgt tggaatccgc tccggaaagc actcgtcgtc aaccgcgtat tctcatgcac 3180 tggatttcaa acgattgacc gagctggaag aatttgcgcc gatcttgcgt catgatggga 3240 tcatcaaacc cgtggttgta attgtggtgg atggtgggcc tgacgaaaac ccgcgctatc 3300 gcaaggttat tcaggtcgcc atccaccact ttcttgaaca taaccttgat gccattttca 3360 tcgcgactaa cgcgccggga agaagtgcct tcaaccgggt cgaaagacgg atgtccccgt 3420 taaaccgaga gctagcaggt ctaatcctcc ctcacgataa ttatggatca catctagact 3480 cacaaatgcg gaccacagat gaacatttgg agcggcagaa tttcaagttc gctggaaaaa 3540 cgcttgcgga ggtttggggt gggctgatca ttgatggata tccgacggtt gctgaattta 3600 ttgagccaga agcgtcagaa gttgaccccg aatcgttgga gtgtcgtgac caatgctggt 3660 tttcgaacca cgttcgaacc agccaataca tgctccagat tgtgaagtgt gataataggt 3720 aaatttagca tatattaatt taaaacaata ataattaaaa catttttttg cattttattt 3780 cagatcctgc tgttcgccac tgcgaagctc tttgtcgacg gtacttcctg gacggtttct 3840 tcctccacca attccgttta tccactcgtc cgatgtacta aaagctgcag gaaaggatga 3900 ccaccaacac tttccatctt tgtttgtcgc catttcactc gacgtttcga agatcctacc 3960 aaaatcttat cttgggttca agaaaatccc gtatgatctg ttttgccctt ctcagaccaa 4020 tctggtcaac gatcggatct gtacgacctg tggactgtac catgcgtctc tagttatgct 4080 taaggagcat aaaaaagagc acaaagttac ggcagcgacg aagaagattc gttccaagca 4140 tgtcgtgacc agacgacgaa atgagctatt agtgacgaca gcggacggcg gtgatcttga 4200 ctggctggat agtgatacag ttgatctcac tggtatcccc atggattggg cagaaggtgt 4260 tgagaactta aacgattcac atcccataat tgatatcgaa aaacatctgc taaatccttg 4320 gggtgaagat tagcattgct agattgattt gcgttgactg ctgaaacgat gatattatat 4380 tgttatattg tggatttagc tcgtagctgg attttctggc atcttctcac ggtcattgga 4440 aacgctcgtg gatacaccta ggggttccca gttggagtga aaaaattcaa tttatagaaa 4500 aattatggat taacattttg cttttttatc acatctccga catcaggaat aattgtttga 4560 acatgctcgc aattctttta ttcggtcgga aaaatattat ttttcttgaa aaaactttca 4620 tttttaatca catgatcacc cctaattctg gctcgatgcc gccatcatgc gaccgcgtgc 4680 tccgtttgtt tgtcaacaaa tttgcagctg gatggtgaga cgaagtgagg ggggaagaga 4740 ttttgacatt tttatgcccg catctggccc gccgcctatt tcgtcggtac cgtcagtggg 4800 ggtgacattg ggtctggggg gtgagattgg gtcaaagtga ttttttacgg attttaccat 4860 ttctcagatt cttttcaatg aaactgaact ctgttaaaag ggttgtgtag gggacatctt 4920 aagatgactt cgctgaaaaa aattcgttcc tagaacatat cctgttattt tggcagatgt 4980 ctaaagttgg ggtacgtttt tggccaaaaa tgacctctcg aaaaatcatt tttctaatat 5040 ttttgttaga attgctcgaa aactacccaa gtgtttgaaa atctccactt caaacattgt 5100 agcgtgtcaa tacgattccc tcgaacaaga aacgctgttg gatgacttgt ttttaccata 5160 tctttgtatt tcagcatcat tttagtttca tttgacccaa tgtcaccccc tctaaggggt 5220 gagattgggt caattttcaa acaattggca tttaaggtag tgttcatcaa aatcgaccat 5280 attttgggaa aatgttagta aactatcgaa gaacaaatct acgttaaaag attttcgatg 5340 ttatttaaat tgtttttgtt attaagaaat ttcgagaggt gtttcgtttt ttgacccata 5400 gtcaccccca ctgacggtac ctgagcagcc ggtcgtttcc agtgaccgag tccagctagg 5460 tactgatcgt acaataacaa tgaaataaaa acaacttgaa ataagataac ttcgtttttt 5520 taaataagat ttttctattg gctgggggag gcgggggggt catgttatgt gacgtacttt 5580 tatagggggg ttaatatgaa gcgtgacaaa gtgtgacatg gggggttaaa aatcgcccga 5640 tttagcgtga catactttat ggatgacgcc 5670 // ID CR1_Ele17 repbase; DNA; INV; 4657 BP. XX AC . XX DT 08-OCT-2010 (Rel. 15.1, Created) DT 08-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE A CR1 clade non-LTR retrotransposon family from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1_Ele17. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4657 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4657 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (08-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. This consensus is generated from 26 CC sequences with >96% identity, and ~98% identical to the original CC sequence in [1]. XX FH Key Location/Qualifiers FT CDS 771..1622 FT /product="CR1_Ele17_1p" FT /translation="MSDICEKCANHITGDSINCGGFCSSMVCMRCSGIADD FT AYASIKANMHLVWMCTACKNLLSKARFSNSLVSINKANESVIESMKVEIRN FT SVLAEIKYEIRSNFKTLIDSVPRTPASIYRTPHPPSIRSKRLRDNDCDDDA FT ATHRPAKSMCCVGTSATVADLVVSEAAEENRAKFWLYLSGILPEVPESKVV FT ELAESRLKTTNLQVVKLVPRGKDARTLTFVSYKIGIPLELKTIALAPETWP FT RGIRFREFESVGNKKQFFWRPEAVTPNPIIQTETPIASSILPR" FT CDS 1535..4588 FT /product="CR1_Ele17_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="ETVFLEAGSCDTQPNHPDRDTDSIVHPSSLVSEELTT FT TATSVHHAISHIIPPSFPVPPSPTVTPFPRAPIVPLTIYYQNVGGMRTKTN FT DFHLLTASCDYDVIVLTETWLRSDVRNTELSSDYNIYRCDRSSMTSRFQRG FT GGTLIAVKKSVASMAVQLDRSDSLEQVAVKVSSSNKSVYVCGIYLRPNSDP FT GKYAAHSDCVEQIINKTSFGDSVVIVGDYNLPHLLWSYDDDVNSYIPTNAS FT TEQELSFAENMIATGLQQICNVRNSNGRLLDLAFVNSSEVELLEAPSAILP FT TDRHHKPFVLRLATRSASENLDVAVGQDFDFRRCNYEAVIDDLHLLNWDEI FT FNGANINEAVDTFYTLVFGIISRHTPLRRTNPQRSNYQPWWNSDLRHQRNV FT LRKARRRFLRHRTEXNKSFLRSLESEYNECVSSSFREYIFRIQDEVKADPS FT SFWRFFKSRKDAQAIPSEMSYGDSLSYDIEGSVELFADYFKSVYNVETPPD FT PHEMLNSLPSFDIHLPFPTFAVAEVVSALNSVDPSKGPGPDKLPPVFIQRC FT AEVLGPPVCRLFNLSLSEAVFPDAWKVAAITPIHKAENIHDVKNYRPISIL FT CCLAKTLELLVHSRMYAAAMPVISQYQHGFVKNRSTATNLMVFASAVNSSL FT EKRCQVDGVYVDFSKAFDKVPHELAMRKLERLGFPDWLVKWLRSYLRDRTA FT FVNLRSTKSTSFATPSGVPQGSHLGPLIFILFINDLSHRLNSKHLMYADDL FT KLYRIISSMVDCAALQQDIDAVARWCVMNGMEMNALKCKVISFTKSRSQLS FT FNYTANGVSFERVTSIKDLGVIMDRKLNFAEHISKTTTKAFAMLGFIRRNA FT SDFQDVYALKTVYCAIVRSLLEYAVQIWAPYQETHITRIEKVQRCFMRFAL FT RRLPWNDPLRLPPYENRCELIKLEPLRTRRVFLQRMFAFDVLTNRIDCPDL FT LQQANFFVPARRSRPRSLFWTARHRTVFGQNHPMERCFNLLNVDVFDFDVS FT RNKFKARIRLLH" XX SQ Sequence 4657 BP; 1231 A; 1129 C; 1014 G; 1282 T; 1 other; tctggcatca ctgctattgt ggtttgtgtt gtttctgtgt cgctctggat attattcaat 60 ttatcgccaa catctacgtc gaaaaatcat cgttcaccgc aaatccgact cataaaccga 120 aatcattcac ccgcaatcga ttctacgcca aagttggaca ggtaggatag tgattttaat 180 aatccgagct ggcttgagag tggaagttta gcctgcatcg ccattgcctc ctccagcaat 240 tacccacctc gccgctctcc acctgtacct tttggtcgta ctgcccaact gctgccccca 300 agtccaacca aaaatctcac actcgccatc tcacatctca tcgtcaccgc atcaatccga 360 atcaaaacaa acaactataa gacctgctag caattatttc tggagaatat acgtccgttt 420 ctcgtcccgt tttatcattc cgacaaatcc gaacgtctcg acgaccagga attctgcatc 480 gccgcatctg ggaatttttt cgaatgcgtt tttccgttca tccgtttcga atctgtaagt 540 tgtgagcctg tccacatcat taagtgcatc tcgtcaacga acattgccga taacgtcatc 600 gccatcggtt tcaacgaaca ccaatagagg cgtacgcgag aggtgagata gcttttcact 660 gttcacgtat atctgtcatc gttagtgaac ggacatctcg tcgattgctt tgagataggc 720 catctatccg ccactagtat catttctaca gtgttgctag tttgttcgca atgtcggata 780 tttgtgaaaa atgcgccaat cacataaccg gcgattcgat aaattgtggt ggtttttgct 840 cgtcgatggt ttgtatgcgg tgctccggga ttgctgatga tgcatacgcc tcgatcaaag 900 caaatatgca tctagtatgg atgtgtacgg cgtgcaaaaa tcttctatcg aaagcgcgat 960 tttcgaactc tttggtgtca atcaacaagg caaatgaatc cgtgattgaa tcaatgaaag 1020 tggaaattcg aaatagcgtg ctggcagaga tcaagtatga aattcgttca aatttcaaaa 1080 ctttgatcga ttccgtaccc cgtaccccag cttcaattta ccgcacacca catccacctt 1140 cgattagatc caagcgtctg cgtgataacg attgtgacga cgatgcggct acccatcggc 1200 cggcaaaatc catgtgttgc gttggaacta gtgctacagt cgcggatctg gttgtttctg 1260 aagccgcaga agaaaatcgc gcaaaattct ggctgtactt gtctggcata ttgccggagg 1320 taccagagag caaggtcgtc gaactagcgg agtccagatt gaagacaacg aatctgcagg 1380 tggttaaact cgttcctcga gggaaagacg ccagaacgct cacttttgtt tcgtacaaaa 1440 ttggaatacc acttgaactc aaaaccatcg ctctggcacc ggaaacttgg ccacgtggaa 1500 tacgcttccg agaatttgaa agtgttggga ataagaaaca gtttttttgg aggccggaag 1560 ctgtgacacc caacccaatc atccagaccg agacaccgat agcatcgtcc atccttcctc 1620 gttagtttcc gaagagttaa cgacaactgc tacgtccgtg catcacgcta tatctcacat 1680 cattccgcca tcgttccccg ttccgccatc gcccaccgtt acgccgttcc cgagagcacc 1740 tatcgttcca cttactattt actaccaaaa cgtcggcgga atgcgtacga aaactaacga 1800 ttttcactta ctgaccgctt cttgtgacta tgacgttatc gtgctgacgg aaacgtggct 1860 tcgcagcgat gtgaggaaca ctgaactgtc gtccgactat aacatctatc gatgcgaccg 1920 aagcagtatg actagtcgtt tccaaagagg agggggaaca ctcatcgctg tcaagaagtc 1980 tgttgctagc atggctgttc agctggacag atctgactcg ttggaacagg ttgctgtaaa 2040 agtatcctcg tcgaacaaat ccgtctatgt ttgtggaatc tatttgagac ccaacagcga 2100 tcccggtaaa tatgccgcac attccgactg tgttgagcaa attatcaaca aaacaagttt 2160 cggcgattct gtagtgattg ttggtgacta caatctgcca catttactat ggtcatacga 2220 tgacgacgtg aactcctaca ttccgacgaa tgcctcgact gagcaggagc tttcttttgc 2280 tgaaaacatg atcgctactg gactacagca gatttgtaac gtgcgtaatt ccaacggcag 2340 attgctcgac ttggcttttg taaacagtag tgaggtggaa ctacttgaag cgccatcagc 2400 tattttacca actgaccgtc atcataaacc gtttgtgcta cgcctagcca cgcgcagtgc 2460 cagcgagaat ctagacgttg cagtaggcca agattttgat tttcgacgat gtaactatga 2520 agccgttatt gatgatctac atctgttgaa ctgggatgaa atcttcaacg gagcaaatat 2580 caacgaagcg gtcgatactt tctacacatt ggtattcgga atcattagca ggcacactcc 2640 gctccgcaga actaatcctc agcggtcgaa ctatcagcca tggtggaatt cggatcttcg 2700 acaccaaaga aacgttcttc gcaaagctcg ccgtcgcttc ttacgccatc gcactgaaga 2760 maacaagagt tttctccgta gtcttgaatc tgagtacaac gaatgtgtgt cttcttcgtt 2820 ccgtgagtat atttttcgta ttcaagatga ggtcaaggcc gatccttcct ctttttggcg 2880 atttttcaaa agcaggaaag atgcgcaggc aatcccatcc gaaatgtcgt acggcgactc 2940 attaagttac gacatcgaag ggtctgtcga acttttcgcc gactatttca aatccgttta 3000 taacgttgaa actccaccgg atccgcatga gatgctgaat tccctgccgt ctttcgacat 3060 acatttaccg tttccaacgt ttgccgtggc agaagtcgta agtgctctca actctgtcga 3120 tccttccaag ggtcctggac ctgataaact ccctccggtc ttcattcaac gttgcgctga 3180 agtacttgga ccacctgttt gtcggttgtt caacctgtcg ttgtcggaag ctgtttttcc 3240 ggatgcttgg aaagttgctg ccataacgcc cattcacaaa gccgaaaata tccatgacgt 3300 gaaaaactat agacccatct cgattctctg ttgcttagct aagacactgg aactcttagt 3360 tcacagtaga atgtacgctg cggctatgcc agtaatatcg caatatcagc acggcttcgt 3420 gaagaatcgg tcaacagcta ctaacctcat ggttttcgct agtgctgtga actctagctt 3480 ggaaaaacgc tgccaagtgg atggcgttta cgtagatttt tccaaagctt tcgataaggt 3540 gccccatgag ttagcaatgc gtaaacttga gcggctagga tttcccgact ggttggtcaa 3600 gtggctacga tcatatctac gagacagaac cgcctttgtt aatctacgat ccaccaaatc 3660 aactagtttc gctactccgt ctggcgtgcc gcaaggtagc cacctcggtc ctttgatttt 3720 cattttgttc atcaacgact tgagtcatcg actgaactcg aagcacctta tgtacgcaga 3780 cgatttgaaa ttataccgta tcatcagttc catggtcgac tgtgctgctc ttcagcaaga 3840 catcgacgct gtagcacgct ggtgcgtaat gaacgggatg gaaatgaatg cgttgaagtg 3900 caaagtgata agcttcacta aatcacggtc gcagttgagc ttcaactaca ctgcaaatgg 3960 tgtcagcttt gaacgtgtaa cgtcgataaa agacctggga gtgataatgg accgtaagct 4020 gaatttcgct gaacacatct caaagacaac aacgaaagct ttcgctatgc tgggtttcat 4080 tcgtcgtaat gcatccgact ttcaagatgt gtacgccctg aagaccgttt actgcgcaat 4140 cgttcgcagc cttctcgaat atgctgttca aatctgggca ccatatcaag aaacgcatat 4200 tactcgcata gaaaaggtgc aacgttgttt catgcgtttt gcgttacgga gactaccatg 4260 gaacgatccg ctcagactgc ctccttacga gaaccgctgt gaacttatca agctcgagcc 4320 tttgcgaaca cgtcgtgtat ttctccaaag aatgtttgct ttcgatgtct tgacgaatcg 4380 gattgattgc cctgatcttc tgcaacaagc gaactttttt gttccggctc ggagatcccg 4440 cccacgttcg ttgttctgga ccgcaagaca tcgtacggta tttggacaaa atcacccgat 4500 ggaaagatgt tttaacttgt taaatgttga tgtttttgac tttgatgtga gtagaaataa 4560 gttcaaagct agaataagat tattgcatta gattaagacc aatacagtct gtacagcaat 4620 gctgaagacg aagtaaataa ataaataaat aaataaa 4657 // ID Gypsy-9_CQ-I repbase; DNA; INV; 1656 BP. XX AC AAWU01032098; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-9_CQ_; KW Gypsy-9_CQ-LTR; Gypsy-9_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-1656 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 397-397 (2011). XX DR Genome; AAWU01032098; Positions 69627 71282. XX CC 'CGATT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 59..1567 FT /product="Gypsy-9_CQ-I_1p" FT /translation="MPEGDNIQPDGNLLYERLLQQNERLEAQNARMMEMLE FT RFNLQDGSSRTSNGPEFIIETLASNIREFIYDPDNGLVFDRWYRKYEDLFL FT KDGAKLDDAAKVRLLLRSLNVAVHDKYVNFVLPKHPRDIEFKETVKKLTEL FT FSVQASLFSKRYQCFQLSKSESDDFVTYAGIVNKHCEDFELKKLTADQFKS FT LLFICGLRSSKDADIRTRLLSMLEVNAEGACTLEALITECHRLQNLKHDTA FT MVEHKPVSSVCAVKQDRDKPSPATGSSSSDSQSMKTQSALPPSPCWCCGDM FT HFVRDCAYKQHVCQDCMQTGHKEGYCSCVPKKSKNKQQTNVNSLYAANRVN FT STSKRKFLTVSINGSQASLQFDTGSDITVISRKQWCDNLDSPPLSPTKQIA FT RTASGKSLSLLGELRCQVTLGGVTRTGTFYVTEKQINLFGLDWIELFELWD FT TPMSAVCSGVTGKVNRVKQQRSLAVKSQGEANMKTGLGATDAASSQSESTA FT WYEEHLR" XX SQ Sequence 1656 BP; 425 A; 481 C; 439 G; 311 T; 0 other; ttggcgacga ggaaaaagca agatttttac ctttgaagaa gtgttgcgga tcacaagaat 60 gccggaagga gacaacattc agcccgacgg taacctgctg tacgagcgac ttctgcagca 120 gaacgagcgg ctggaggccc agaacgcgag gatgatggag atgctggagc gcttcaacct 180 gcaagacggt tccagtcgga cttcaaacgg tccggaattc atcattgaaa cgctggcgtc 240 caacatccgg gagttcatct acgatcccga caacggtctc gtatttgacc gatggtaccg 300 caagtacgaa gatctcttcc tcaaggacgg cgcgaagctc gacgacgcag ctaaagtccg 360 gttgctgctg agaagcctca acgtggccgt gcacgacaag tacgtcaact tcgtgctccc 420 gaaacatccc cgtgacatcg agttcaagga gacggtgaag aagctgaccg agctcttcag 480 tgtccaagcc tcgctgttca gcaaacgata ccagtgcttt caactgtcca agagcgagtc 540 ggacgacttc gtgacgtacg ctggcatcgt caacaagcac tgcgaggact tcgaactcaa 600 gaagctcacg gcggaccagt tcaagagcct gctgttcatc tgcggtttgc gctcttcaaa 660 agatgccgac atccgcacga gactcctgtc catgctcgag gtcaacgcgg agggcgcatg 720 cactttggaa gcactgatca ccgaatgcca tcgcctccaa aacctcaagc acgacaccgc 780 catggtggaa cacaagccgg taagctcagt ctgcgcggtc aagcaagatc gggacaagcc 840 ttcaccggcg acgggttcca gctccagcga cagccagtca atgaaaacac agtccgccct 900 accaccatca ccttgctggt gctgtggcga catgcatttc gttcgagact gcgcctacaa 960 acagcacgtc tgccaggact gcatgcagac cggccacaag gaaggttact gcagctgcgt 1020 gcccaagaag tccaagaaca agcagcagac gaacgtcaac agcctgtatg ccgccaaccg 1080 ggtcaactct acctcgaagc gtaagttcct gacggtgagc atcaacggtt cccaagcatc 1140 tctgcagttc gacaccggct cggacatcac agtgatctcg cggaagcagt ggtgcgacaa 1200 cttggattca ccgccgctgt ctcccaccaa gcagattgcc agaacagcct ctggtaagtc 1260 actgtcccta ctcggtgagc tccgatgcca agtcacactc ggaggcgtca ccagaaccgg 1320 aactttctac gtgactgaaa agcagatcaa cctgttcggt ctcgactgga tcgagctatt 1380 tgagctgtgg gacaccccga tgtcagcagt atgcagcggc gtcactggaa aggtcaaccg 1440 cgtcaagcag cagcgttctc ttgcagtcaa atcgcaaggg gaggcaaaca tgaaaacagg 1500 tctaggtgca accgacgcag cttccagcca gtccgaatcc acggcatggt acgaagaaca 1560 tctccgctga tactgttccc aatatctcac cgcttgaaga tggcgagcga tctggatcac 1620 atcaagtcgc accctgtgct ggtcttaagg gggaga 1656 // ID BEL-213_AA-LTR repbase; DNA; INV; 706 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-213_AA_; KW BEL-213_AA-I; BEL-213_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-706 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 884-884 (2011). XX DR [1] (Consensus) XX SQ Sequence 706 BP; 190 A; 168 C; 169 G; 174 T; 5 other; tgtacgatag ctctacccag aataacaaca gattatttgc tgttcaacaa attgcgacga 60 tagagatgac caaaacaaac ccaccgataa ttggaatagc cggctcaata cacttaccgt 120 ccggcgtgcg gacgatggtc gcctagtttg acccacattt catcccagca acgatagcag 180 ctgttcgcat gcgtatgggt cacccgatgt ccgagatgtg cgamgtagak ctacctagag 240 cagcagagtg caggagagta cagagatgtt gctcagatct cctgttctta ggcagattgt 300 gagtgaccct taggtttatc gatgaccaat ttaaattggg agaagtcagt ctggattgag 360 cgtcgaacgg gtcggtcgwt gtaccgtgat ttttaatctg ttagatttaa gacgatatgt 420 gttagtgtta agtaataaat tgtgtgtata aagattgtgt ttccctccaa ccgcgcgtgt 480 gtttgtgttt ccaagtgaga aggcctacca ascccagaaa tcaccggatc accaggatct 540 tcaggacact ggaattggta tccaacacgt cctattcgac gatactgtgc caccaagccg 600 tcagcttcag tcgcccatca cccgcaccat tcccccaggc gatacawcgt gttgaagttg 660 gtcaccgaga gctatcgacc aaagggtgag gaccaactac gcaaca 706 // ID FR1 repbase; DNA; INV; 189 BP. XX AC X78688; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 28-SEP-1995 (Rel. 1.08, Last updated, Version 1) XX DE FR1 repeated sequence. XX KW FR1; Repetitive element. XX OS Spodoptera frugiperda OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Noctuoidea; Noctuidae; Amphipyrinae; Spodoptera. XX RN [1] RP 1-189 RA Lu J.Y., Kochert D.G., Isenhour J.D. and Adang J.M.; RT "Molecular characterization of a strain-specific repeated DNA in RT the fall armyworm Spodoptera frugiperda (Lepidoptera, RT Noctuidae)."; RL Unpublished. XX RN [2] RP 1-189 RA Adang J.M.; RT "Direct submission."; RL Direct Submission to EMBL/GenBank/DDBJ (07-APR-1994). M.J. Adang, RL University of Georgia, Dept of Entomology, Athens, GA 30602, RL GREECE. XX DR GenBank; X78688; Positions 1 189. XX SQ Sequence 189 BP; 54 A; 34 C; 36 G; 65 T; 0 other; gaattcgtgt aaaacgtact ttctttgttt ttctatgaga gaagacattg gttgaccttt 60 ttacaccggt cacaacgaat attcgtgatt gcacttccac tacaatctgt ttcacggatg 120 agttgaagga aatatcggat aagcattctt tgtcggaaat cataatataa aacgtgctct 180 tctatgtcg 189 // ID Gypsy-2_BM-I repbase; DNA; INV; 4805 BP. XX AC nscaf2953; XX DT 19-MAR-2010 (Rel. 15.07, Created) DT 19-MAR-2010 (Rel. 15.07, Last updated, Version -1) XX DE LTR retrotransposon from silkworm: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-2_BM_; KW Gypsy-2_BM-LTR; Gypsy-2_BM-I. XX OS Bombyx mori OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Bombycidae; Bombycinae; Bombyx. XX RN [1] RP 1-4805 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from silkworm."; RL Repbase Reports 10(7), 979-979 (2010). XX DR Genome; nscaf2953; Positions 3438914 3434110. XX CC Positions [3171-3647] - Integrase core CC 'TATGTA' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 52..2064 FT /product="Gypsy-2_BM-I_1p" FT /translation="MMFEQMSLFSSGMEAMLQRIAENMSVHRPSQSSPTTL FT VNFDPDEPEADIVNWCTLSEMIIEQKKLDGVDLILTLTHSLKGRAATCLTK FT IQPGKISWPTIKEMLISKFSKPMMMQDHFDLIIKFEIEGKEGPAEAGIRLW FT QLIEKIPDANMPEQVITGFAISILSQCDERIRRELNSVIITTKTQLFRTLR FT GFTLKRKNEDPASSDSEPKRFRSLFRGSCHLCGKLGHRGVECRDRRLLSKN FT FIIPAKPEQPPRPWARSSRPQVSCYVCGDANHVASGCPMRYKKKDDTAPGA FT VAGPSRYVNVCSRASHGVLQISGVDFPFMFDSGSECSLIKQSHSNLLIGKR FT FHEQVTLRGVGSNNIASTLQIKCPTSVQDLRFDVLYHVLPDKNLTESVLLG FT RDILEAGLSVKISDGKLTFAKTKTSNSCSKSEICDLNMIDTDLVNENRDRL FT IRILKKYSGSFIQKLPTARIKTAELKIDLIDPTKIVHRRPYRLSPTELKIV FT NEKVNEMLVANIIRESCSPFASPILLVRKKDGGDRMCVDYRELNSNTRPDS FT YPLPLINDQIDKLHGARFYCSIDMASGFHQLKVHKDSIEKTAFVTPTGQYE FT FLTMPFGLRNAPQVFQRAINAVLKPLNDDKILIYMDDVLSASETIDEGLSR FT LDKLLKALSNSGFSFNFKKCFL" FT CDS 2055..4223 FT /product="Gypsy-2_BM-I_2p" FT /translation="MFFMKQKVEYLGYVISSGEVRPNPRKIEALRNVPKTT FT TVKQVRQLIGLATYFRQFIPNFSNLLKPLYPLTSGKGTISWTTEHDKIHTK FT IVNYLTSEPVLRIFDPSLPIELHTDASSEGYGAVLIQKVDNLPHVVAYFSH FT RTSDAESRYHSYELETLAVVKSVEHFRHYLYGRKFTVFTDCNALKASHSKK FT DLTPRVHRWWAILQSYDFSIVYKEEKNMAHADFLSRNSLISQNGAKEKIVN FT FSELERGWLSIEQQRDSEISDLIVKAKSNELPPEISHTYDVRQGILYRKIE FT RGKTTKWLPILPRSLIWSLISHVHTEIKHLGYEKTLDKLYELYWFQNMAKY FT VKKFVDSCVICKSSKGPSGAQQVRLHPIPKIAVPWHTVHIDLTGKLSGKSD FT QKEYCSVLIDAFTKYVLLKHTLNLNSANAIKSVKDAVHLFGAPKRIIADRG FT RSYDNTGFKDFCKDSNIELHLIATGTSRANGQVERVMRTLKSLLTIAETNS FT DQSWQEQLGDIQLALNSTRCRVTGFTPIELMFGVQGSSLEISKVTTDSDNP FT SRLDLDLVRSNASENIKKMAQAEVARFSRGKAKVKPFSVGDFVFVKSEERH FT QTKLDRKYKGPFKITAILDNDRYELRHINGSNRVYKFSHENLREVPRGPSG FT LLEASENYSDDDVTACTDELIDLVHETDLNNDCSVTDNRSNTLSANSDTMS FT VGSDTISVSSGTSGIGNVEKLEK" XX SQ Sequence 4805 BP; 1600 A; 951 C; 974 G; 1280 T; 0 other; gaagtgtcgg actcacctcc actgcagtca tcaaataact cggccaacaa tatgatgttt 60 gaacaaatgt cattattttc atctgggatg gaagccatgc tgcaaagaat cgctgaaaat 120 atgtcggttc accgacctag ccagagttcc ccaaccacat tggtcaattt tgacccggat 180 gaaccggagg ctgatattgt aaattggtgt acgctgagcg aaatgatcat cgaacagaag 240 aaattggatg gtgtagattt aattttaact ctcacacatt cacttaaggg tcgagctgct 300 acttgcctca ctaaaataca acctggtaag atatcctggc ctactattaa agagatgctg 360 atatcaaaat tctcgaaacc tatgatgatg caagaccatt tcgacctgat tattaaattc 420 gaaatagaag gcaaagaggg accagctgaa gccggtatcc ggctatggca actcatagag 480 aagattcccg acgccaatat gccagaacaa gtaatcaccg ggttcgcaat atcaatactg 540 tcgcagtgtg acgaaagaat tcgtcgagaa ttaaactctg ttattattac caccaaaact 600 caattatttc gcacactacg tggcttcaca ctgaaaagaa aaaacgagga tcctgcttca 660 agtgattcag aaccaaaacg ctttcggagc ttattccgtg gaagctgtca tctctgcgga 720 aaacttgggc atcgtggcgt agaatgccgc gatagacgac tactctcaaa aaatttcatt 780 attccagcaa aacctgaaca acctcctcgt ccgtgggcgc gcagttctcg cccacaagtc 840 agctgttacg tgtgcggtga tgcaaaccat gtagcttctg ggtgtcccat gcgctacaag 900 aagaaagatg acaccgctcc aggagccgtg gcaggtccaa gcagatacgt caacgtgtgt 960 tccagagctt cacacggtgt gttacagata agtggtgttg atttcccctt tatgtttgat 1020 tcagggtcag aatgcagtct cataaaacaa agccacagta atttactcat tggcaagcgg 1080 tttcatgaac aagtcacttt gagaggtgta ggtagtaata atatcgcgtc tacgttacaa 1140 attaagtgcc ccactagtgt acaagatctg aggtttgatg tgttgtatca cgttttgcca 1200 gacaaaaatc tcactgaaag cgtattgtta ggtcgtgata ttttagaggc aggattatcc 1260 gtaaaaatca gtgacggaaa attaactttt gctaaaacca aaacctccaa ctcttgttca 1320 aagtcggaaa tttgtgactt aaatatgata gacaccgatt tggtaaatga gaatagggat 1380 cgtttaatta gaatactcaa gaaatattca ggttctttta ttcagaagct accaaccgcg 1440 cgtattaaaa ctgctgagct gaaaattgat cttatagacc ctactaagat tgttcataga 1500 agaccttatc ggttatcccc aactgaactg aaaatcgtta atgaaaaggt taatgaaatg 1560 ttagttgcca atataattag agaaagttgc tcgccatttg caagccctat cttattagta 1620 agaaagaaag atggaggtga tcgtatgtgt gtggattaca gagagcttaa tagtaacacg 1680 cgccctgata gttaccctct tcccttaatc aatgaccaga tagacaaact acacggagcc 1740 cgattttact gtagtatcga tatggcctct ggtttccacc aactgaaagt ccataaagac 1800 tccatagaga aaactgcctt cgttacacca acaggccaat acgaattttt gactatgccc 1860 tttggtcttc gaaatgctcc acaagttttc caaagagcaa tcaacgcagt attgaaaccg 1920 cttaacgatg ataaaatttt gatttatatg gatgatgtgt tatctgcgtc agaaactatt 1980 gatgaggggc tatcacgcct tgacaaattg ttaaaagcct tatccaactc cgggttttca 2040 ttcaatttta aaaaatgttt tttatgaaac agaaggtcga atacttgggt tacgtaatct 2100 catcagggga ggtcaggcca aacccgcgta aaattgaagc tcttcgaaat gtcccgaaaa 2160 ctaccactgt aaaacaagtc agacaattaa tcggcttggc aacctatttt cgtcagttta 2220 tcccaaactt ctcaaattta ttgaaacctc tatacccttt aacatcagga aaaggtacta 2280 tctcatggac tacagagcat gacaaaattc acactaaaat tgtcaattat cttacatctg 2340 aacccgtgtt aaggatattt gacccttctc tacctatcga gctacatact gatgcaagta 2400 gtgaggggta tggcgcagtc ttaatccaaa aagtcgataa cctccctcac gtcgttgctt 2460 actttagtca ccggacgtct gacgcagaga gccgctatca ttcgtatgaa ttggaaacgt 2520 tggcagtggt caaatctgtg gagcatttcc gtcattatct ttatggccga aaatttactg 2580 tgtttactga ttgtaatgcc ttaaaagcct cccattcaaa aaaagaccta acacctaggg 2640 ttcacaggtg gtgggcaatt ttgcaatctt acgatttttc catcgtgtat aaggaagaaa 2700 agaatatggc tcatgctgat tttttgtcgc gaaactccct tattagtcaa aacggtgcta 2760 aggaaaaaat tgtaaatttt tccgagcttg aaagggggtg gctctctata gagcaacaac 2820 gtgattccga aatttcagat ctaattgtta aggctaaatc caacgaatta cctcctgaaa 2880 tctcgcacac atacgatgtt agacaaggta tactgtatag aaaaatagaa cgtggaaaaa 2940 ctaccaagtg gctacctata ttgcctcgat ctttgatctg gtctcttata agccacgtgc 3000 atactgaaat aaaacatcta gggtacgaaa aaacgctcga caaactgtat gaactttatt 3060 ggtttcaaaa tatggctaaa tatgtaaaga aattcgtaga ttcttgtgtg atatgtaaat 3120 cttcaaaagg cccatcgggc gctcaacagg tacgtttgca tccaatccca aaaatagctg 3180 ttccgtggca taccgtgcac atagacctta ccggaaagtt aagcggtaaa agtgatcaga 3240 aggaatattg ctctgtcctc attgatgcct ttacaaagta cgtattgcta aaacatacat 3300 tgaatttaaa ctctgccaac gcaataaaat cagtcaaaga tgccgttcat ctatttggcg 3360 ctccgaaacg aatcatagct gaccgagggc gttcatatga taacacagga tttaaagact 3420 tctgtaaaga tagcaatata gaattgcatc taatcgctac gggtaccagc agggctaacg 3480 gacaggttga gcgcgttatg cgaactttga agagccttct cacaatagct gaaactaact 3540 cagaccaatc atggcaagag cagttaggag atattcaact cgcccttaat agtacacgtt 3600 gtcgagtaac agggttcact ccaatagaat taatgtttgg cgttcaaggt agctcacttg 3660 aaatctctaa agttacaaca gactctgaca atccctctag attagatctt gatttagtac 3720 ggtcaaacgc ctccgaaaat ataaaaaaaa tggcacaagc agaagttgct cggtttagtc 3780 ggggcaaagc taaagtaaaa cccttttctg ttggagattt tgtttttgtt aaaagtgaag 3840 aacgacacca aactaaatta gaccgaaaat ataaaggtcc ctttaagatt actgcaattt 3900 tggataacga tagatatgag ttgagacata taaatggttc taatagagtg tacaaatttt 3960 cccatgagaa cttacgcgaa gtacctcgag gcccgtcagg cttactagaa gcttctgaaa 4020 attatagtga cgatgacgtg acggcgtgca cagatgagtt aattgatctt gttcatgaaa 4080 ctgatctcaa taatgactgc tctgtaactg ataatagaag taacactctg tccgcaaata 4140 gtgatactat gtcagtgggt agtgatacaa tatctgtaag ctcaggcacc tctggtattg 4200 gaaatgtaga aaaactagaa aaataaaatc ggtctcgcac tatttctgaa gaaactctat 4260 ctaaatgttc tggttaccag gggagtaaaa aaaaaaaaag aagaaaaaaa aagagaaaaa 4320 aaaaaaaaaa aaaaaaaaaa aaaaaaagag aattgaagtc attgaagggt ctatccggtc 4380 caatgacttg gctaaaggga tgataatccg gactagcaaa aaaaaaaaaa aaaaagaaaa 4440 aaaaaaagag aattgtaagt cattgcaggg tctatccggt ccaatgactt ggctaaaggg 4500 atattaatcc ggactagcaa aaaaaaaaaa aaaagaaagg ctatagggat gtttatccgg 4560 actagccgca ataacttgct acagggatta cgatccggtc tagcattatc cggtccagtt 4620 taaaaactga agggataaca tcagggtttc tgacgaatta tgcgaaaccg aaccgaccca 4680 gattgttaat atcatagacc acattttaag ttgcactggt taaagtcagg atgaacgaat 4740 acgcttattg tctacagacc aaccataaaa aggcgcacga ggacgagcgc atgtcagtat 4800 ggccg 4805 // ID P-4_HM repbase; DNA; INV; 3017 BP. XX AC . XX DT 17-MAR-2008 (Rel. 13.03, Created) DT 17-MAR-2008 (Rel. 13.03, Last updated, Version 1) XX DE P-type DNA transposon family: consensus. XX KW P; DNA transposon; Transposable Element; P-4_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3017 RA Jurka J.; RT "P-type DNA transposon families from Hydra magnipapillata."; RL Repbase Reports 8(3), 350-350 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 112..2691 FT /product="P-4_HM_1p" FT /translation="MVNKCCVVNCRSNYHNYGEVSTVVFSFPKNEELKKYW FT IKFVNRKDWTPTNSSVICIKHFEKKYYKKGNKNQRFRLIKNLKPIPTIFDC FT TNLTEEGSLQLIKSSNSLRKSPTKRIFQPDQYEQFLLNDLINSFNDITESF FT APDGFSFLKYDDHVIFYKLSHSTLSIPEVTECIRVNNEMHVKLFYRGSPLP FT LPQWFCQGRDCRLSCKSMLHNFPTYIKSQAEQFSSVLTELQQIKYMKKPIY FT SANIIRYSLILRYTSMQSYQLLREELPLPSLSLLKKITSGNIDALSCAKRL FT KLEGKISQDICLIFDEMYLQKCEEYFAGEMIGSDESGEMYKGIVCFMIVGL FT KQNIPYVIKTSPEKTINAEWLKEEIFECLRILTECEFNVRVIVCDNHSANV FT SCFKKILCASNQQAEDLFFIYNLKKIYLCFDVVHLIKNIRNNLLNCKRFLF FT PDFNFNGFKDPIIVCGGEIKWSTFHNIYEKDLTLDAGLRKAPKITMKVLHP FT GNFKQNVSLALAIFDETTSAAIHSYFPDLKSSADFLTLFNKWWVLSNSKCQ FT YSACNILGNAVVSGDKKPEFLREMAHWIEVWQSQKIPNCEKFTLSVQTAAA FT LIRTLNSHASLIEDLLNDGYHFVLTARFQSDPLERRFGQYRQMSGGRFLVG FT LKDVTISEKILKIKSIVKESINFDDSLKVPESENNLTDWKNFLLDIDEAQC FT TPEIMTLAPESREVGAHIAGYIAKKLKKRFGTCCKEFLCCVNIDESNPDHT FT YLTIISRGGLTIPSPSLLDYVCTSFAMIDYFYKKIKKFSLHARQALEYLLN FT HFYDSFQTFSCPMHEKTGIKLAGRIVVNIFLNNKRIISTSEVAVDHVKSFK FT KRKFEKI" XX SQ Sequence 3017 BP; 1077 A; 395 C; 480 G; 1065 T; 0 other; catggcctac tttatttata cggccggata ttttacattt tgaagaaaaa aatagaaggc 60 cagttttctg gaaaagtgtt tttaagaaag cagctttgga agttcaaaaa gatggttaac 120 aaatgttgcg tcgtaaattg tcgttcaaat tatcataatt atggtgaagt atctacagta 180 gtattttcgt ttccaaaaaa tgaagagtta aaaaaatatt ggattaaatt tgtaaataga 240 aaagattgga cacctactaa ttcttccgtt atatgcatca aacattttga aaaaaaatat 300 tataagaaag gtaataagaa tcaacgtttt agattaataa aaaatttaaa gcctattcca 360 actatttttg actgcacaaa tttgactgaa gaaggttcat tacaacttat aaaatcttct 420 aattcattaa gaaaatcacc aactaaaaga atatttcagc cagatcaata tgaacagttt 480 ttattgaatg atttgattaa tagctttaac gatattactg aaagttttgc tccagatggt 540 ttttcattct tgaagtatga tgatcatgta attttttata aattgagtca tagtacttta 600 tctattccag aggttacaga gtgcattcgt gtaaacaatg agatgcatgt taaattattt 660 tacagaggtt cacctctgcc attacctcag tggttttgtc aaggaagaga ttgtcgttta 720 tcttgtaaaa gtatgcttca taattttcct acctacatta aatcacaagc cgaacaattt 780 tcatctgtat tgactgaatt acagcaaatt aaatatatga aaaaacctat ttattcagcc 840 aatattataa ggtattcact gatattacga tatacttcta tgcagtcata ccaacttttg 900 cgtgaagaat tgcctttacc atcattatca ttacttaaaa aaataactag tggaaatatt 960 gatgctctaa gttgtgcaaa acgtttaaaa ttagaaggta agatatctca agacatttgc 1020 ttgatttttg atgaaatgta cttgcaaaaa tgtgaggagt attttgctgg agagatgatt 1080 ggtagtgatg agtctggaga aatgtataag ggaatagttt gttttatgat agttggcctg 1140 aaacaaaaca ttccatatgt gatcaaaacg tctcctgaaa aaacaatcaa tgctgaatgg 1200 ttgaaagaag aaatttttga atgtcttcgt attttaactg aatgtgaatt taatgttaga 1260 gttattgtgt gtgacaacca ttctgctaat gtttcttgtt ttaaaaaaat cctttgtgca 1320 tcgaatcaac aagctgaaga tttgtttttt atttacaatc taaaaaaaat ttacctctgt 1380 tttgatgttg tacatcttat taaaaacatc agaaataatt tgttaaattg caaaagattt 1440 ttatttccag attttaactt taatggattt aaagatccta taattgtatg tgggggagaa 1500 attaaatgga gtaccttcca caatatttat gaaaaagacc tcactcttga tgctggtttg 1560 cgtaaagcac caaaaataac aatgaaagtg ttacatcctg gcaattttaa acaaaatgtt 1620 tctttagcac ttgcaatttt tgatgaaact acatcagcag cgattcactc atattttcca 1680 gatcttaaaa gtagtgcaga ttttcttaca ttatttaaca agtggtgggt tttatcaaat 1740 tctaaatgtc agtattcagc atgtaatatt ttaggtaatg cagtcgtttc tggtgataaa 1800 aaacctgaat ttcttcggga aatggcacat tggattgaag tttggcaatc acaaaagata 1860 ccaaactgtg agaagtttac tttaagtgtt cagacagctg cagcactaat aagaacactt 1920 aatagtcatg cctcattgat cgaggacttg cttaatgatg gatatcactt tgttttaact 1980 gcgagatttc aaagtgaccc attagaaagg agatttggac aatatagaca aatgagcggt 2040 ggaagatttt tggtaggttt aaaagatgtc actatttctg aaaaaatatt aaaaatcaaa 2100 agtatagtta aagaaagtat taacttcgat gatagtttaa aagtgcctga gtcagaaaat 2160 aatttaacag attggaaaaa ctttcttctt gatattgatg aagctcagtg tactccagaa 2220 atcatgacac ttgcaccaga aagtagggaa gtaggtgctc atattgctgg ttatatagca 2280 aagaaactta aaaaacgttt tggtacatgt tgtaaagaat ttttatgttg tgtcaatatt 2340 gatgaaagca atcctgatca tacttattta acaattattt ctaggggtgg tctcactatt 2400 ccttcaccat cacttttgga ttatgtttgc acatcctttg ctatgattga ctacttttat 2460 aaaaaaataa agaaatttag tttacatgca cgacaagctt tagaatattt gttaaaccac 2520 ttttatgact cttttcagac tttttcttgt cccatgcatg agaaaactgg gataaaactt 2580 gctggaagaa ttgttgttaa catttttctt aataataaaa gaattatctc taccagtgaa 2640 gtagctgttg atcatgtaaa atcttttaaa aagagaaaat ttgagaaaat ttaataaatt 2700 atatataatt attttatttt taaatgaaaa aagagaaaca tttaaatatt ttttataact 2760 gttcttgttt cttacatcgt ttttactaat actaaatttt aaaataaaca ctatttattt 2820 cttaataaat gttcgtttaa tatctaaaat tatgaataaa tacaactttt gtggactttt 2880 taaatgtaga aagtaaaata atattgaaaa tgttcaaaac gtattaaatt tttgaaacat 2940 tacgatttta aaaactggcc ttctattttt ttcttcaaaa tgtaaaatat ccggccgtat 3000 aaataaagta ggccatg 3017 // ID BEL-11_CQ-LTR repbase; DNA; INV; 235 BP. XX AC AAWU01008510; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-11_CQ_; KW BEL-11_CQ-I; BEL-11_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-235 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 176-176 (2011). XX DR GenBank; AAWU01008510; Positions 114731 114965. XX SQ Sequence 235 BP; 77 A; 47 C; 48 G; 63 T; 0 other; tgtttcgcgc aaaataacct aagagtgtaa gaaacgttgt agcgttctaa atttgacagc 60 tagtctctag aattttcttg tcgccactgg cggaaatagt taacgaagag gaggagaaga 120 agaaatacaa gtaaaaagct actcgcgcgg taaagacgtg ttttttattc cgcaataaat 180 ctagttttca atacagtcca cgcgttccct tttccactga ccaatccgaa gaaca 235 // ID Sola1-1_MonBre repbase; DNA; INV; 2354 BP. XX AC . XX DT 17-MAY-2011 (Rel. 16.05, Created) DT 17-MAY-2011 (Rel. 16.05, Last updated, Version 1) XX DE DNA transposon: consensus. XX KW Sola; DNA transposon; Transposable Element; Sola1-1_MonBre. XX OS Monosiga brevicollis OC Eukaryota; Choanoflagellida; Codonosigidae; Monosiga. XX RN [1] RP 1-2354 RA Yuan Y.W. and Wessler S.R.; RT "The catalytic domain of all eukaryotic cut-and-paste transposase RT superfamilies."; RL Proc Natl Acad Sci U S A 108(19), 7884-7889 (2011). XX DR [1] (Consensus) XX SQ Sequence 2354 BP; 547 A; 649 C; 619 G; 539 T; 0 other; gtgtgaggcg cagtcaaaat tcctagtaac ttgtttttct gacgaatgga gttagcagca 60 cgaccatgag cgttggtgag gtgcgcgcag cctccaaaaa caccgaaatc gatagggtac 120 tcctgtgacg cgccacgacg ctgccaccac gactgccaaa aagtctgaca ttttcatttc 180 gttcgcgttc tcgagttaca gcttaatctt tgatgaggcg acatgttcgc ggagattgag 240 gcggagcttg ctgctattcc ggaagatgac ccagacagcg gagtgggcat acaccaggat 300 gagctggtgg gtgaagactc tcaagtatgt gaaggggccc gacacgctga gctcagtgtt 360 gaagaactgc gctcgcggac gctcgcctgg gcgacaaagc ccggctgctt gtgctcgtgc 420 ggggacaggg agcgaccctg cttcaatgtt gacgaccctg tggatgtgaa atgggcaatg 480 ggttttcggc gcttatccga gcaagaccga acagcagcgg tccgtgcaat gctgtgggcc 540 ctttcatctc cgtcggggat caataatgac ctcagtcgcc gttctggtcc cgccaagaag 600 cgcaagggtg acctaccagt cgccgacaca tcccaatcag atcgagctgg gcgcaccact 660 acagtctacg ctatacgtgg gcggcgagtg tgccttggtg gctttactgc gttcacacaa 720 gtgagtgctt ccttgatcaa cgaccatcga ggccagctgt ccaagctcga tagttttgag 780 gctcatgagt cggcaacagg cagggccagg gcatcaacga ctacataccc aaatcgcact 840 ggcgtttctg gcacgctacg ctgaagaaca cgcagcctac tgtccatcgg gtgccggccg 900 tgaacgcgat caaccaatac agattctgcc aactagcaca acgaaactcc aagtctatca 960 agcctatgaa gccaaatact cggttcttgc cgcgaccttg agcgccaaaa ttcaggcatc 1020 aacaccagat gccgtgcagc cctctgacgc tcccctctcc cgtgctgcct ttcttcgagt 1080 ttggcaagca cacctcccaa cacttcgaat tcgcaagtct ggctctgatt tttgcgatga 1140 gtgtgggcgc ctcaaggccc tcctcagtca atctccaagc attgcggaga cccttgccgc 1200 tcatagagcg agagcacttg tggagcgcac tatctaccag cagacccgta ggcaagctga 1260 ggcatcagga gctccgattc tgcacctcac cttcgacttt gctgagaaag ttttgctgcc 1320 acgagttctc gatcagccag gcaagatgta ctttgcgtct ggactcaagg ttgacctgtt 1380 tggcgtggcc atcagcaaca cgaagcagca gtttaactac gtgcttgttg aaggccactg 1440 gccgatggga aaagatgcca atcgtgtctg ttccatgtta cacgatgccc ttgccgaccc 1500 acgcttggct tctctaccac aagctcgata tcttgagctg catgccgaca attgtgcagg 1560 tcagaacaag aaccgcttcg tgatgcagta tctcgcctgg cgtatcatgg ttggcttgag 1620 cgagaccatc aatctacatt tcatggttct ggggcacacc aagaatcact gtgatgccta 1680 ttttggtctg atcaaacgca agttgcggag acgagacgtg tggaccccaa gagacatgat 1740 ggatgtcgtc agtgccagtt gcccagcttc tatctgcaag ccgggaaatc gagtggaatg 1800 gatcgactgg aaagccattc ttggccaata ctttcttgac aagggcattc cagacatctc 1860 gctcatgtcg cactttgagt tcagcgccag ctggcccgga caagttgcga tccggtgcac 1920 acctgattca gctgaacagc ggattcatct attgcgacct ggtgtgacaa tggatcaggt 1980 ggcagctcgc gcagtagaag acctggcggg tatgaagctc gcgattccct cgctcagctc 2040 cctgcgtgcc accagcaagc tgaacaggca tgagtatctc accgcagtcg tttcacagtt 2100 tcctgggcaa ctgggacaac agacgccaga cttctttggt aatggcgatc cagagacaac 2160 ggccccttcc agctagatca agcataaaac aaccatatct cgccttggag agcaggcacg 2220 accctcattt ttttttctcc agatgcgcaa aaccccagtc gtttgcaggt gtttttggca 2280 tttttgggag taaaaccgct gtttctcgag tctgacaaaa acaagttaat aggaattttg 2340 attgcgcctc acac 2354 // ID Penelope-3_HM repbase; DNA; INV; 2346 BP. XX AC . XX DT 13-DEC-2008 (Rel. 13.12, Created) DT 14-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE Penelope-like element. XX KW Penelope; Non-LTR Retrotransposon; Transposable Element; KW Penelope-3_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2346 RA Jurka J.; RT "Penelope-like elements from Hydra magnipapillata."; RL Repbase Reports 8(12), 2093-2093 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 33..2153 FT /product="Penelope-3_HM_1p" FT /translation="MEKINLKYSIKNIPIPSEKEYKLKLLNKTEMLVKKMR FT WKAFFFINNYPRNDSNYTDYGIKSSKCPPQINEMRSFEKDLINLVKSIKFC FT NVNCKFQNNMQKDLKSIRTSHKTLIKADKTSNLYKLSKDEYNKLLTNAITS FT TYKKSMPELKDQIIKEGKNLLIHHEVYNNIEINGSSGCFFTLKDHKDNFIN FT NPSVRLLNPSKNEVGRISKVILSKIIHELKNKLQLNQWQSTHFVIDWFKKI FT KNKHLCKFLIFDINDFYPSISENILTNAISFAEQHVIIKNDDKKIIYHARK FT SLIFNNDEAWIKKRGGLFDVTMGAFDGAEVCELVGIFMLFQLSQHYDKNNF FT GLYRDDGLAIFENKSGQQMEKIKKHFTHIFKSNNLLISIQCNIKIVNFLDV FT TLNLIDSSYQPYCKPNNQLMYVHSESNHPPNIIKEIPRTIELRLSNMAVNE FT TVFNNSIIPYEEALRKSGYNSKLTYQPQINKNQKRHRKRKIIWYNPPFSKN FT VETKIGNRFLALIDQHFPVGHRLHKIFNRNYIKVSYSCMPNVKSLINTHNN FT KILHSDKNKTSKKCNCIDKNICPLNNQCLFSNIVYQATVSSGDPQCKDKVY FT FGISETTFKLRYANHLKSFNAPKYKNDTELSKEIWELKSKHINPVITWKII FT AQCKPYNQASKVCNLCLREKFHILSYKGENLLNKRYEIISKCRHSKKFLLS FT LFDTGD*" XX SQ Sequence 2346 BP; 945 A; 386 C; 316 G; 699 T; 0 other; agttgagcac tcttctcaag tcgtcaacaa aaatggaaaa aataaacttg aaatattcaa 60 tcaaaaatat accgattcca tctgaaaaag aatataaatt aaaactacta aacaaaacag 120 aaatgctcgt aaagaaaatg agatggaagg cgttcttttt cataaataac tacccaagaa 180 atgattcaaa ttatacagac tacggaatca aatcatcaaa atgcccacca caaataaatg 240 aaatgcgttc tttcgaaaaa gaccttataa acttagttaa atcaataaaa ttttgcaacg 300 taaactgtaa gtttcaaaat aatatgcaaa aagacttaaa atccatacgt acgtcgcata 360 aaacattaat aaaggcagac aaaacatcta atttatataa attatctaaa gatgaataca 420 acaaactgtt aacaaacgca attacttcta cctacaaaaa gtcgatgcca gaattaaaag 480 atcaaataat taaggaagga aaaaaccttt taatccacca tgaagtgtat aacaatattg 540 agataaacgg atcgtccggc tgttttttta cattaaaaga ccataaggat aattttatta 600 acaacccttc agttcgcctt ttaaaccctt cgaaaaacga ggtaggaaga attagtaaag 660 ttattttatc taaaattatt cacgaattaa aaaataaact acagttgaat caatggcaaa 720 gcacccattt tgttatcgat tggtttaaaa aaatcaaaaa taaacacctt tgcaaatttc 780 taatttttga tataaatgat ttttacccat ctattagtga aaacattctt accaacgcaa 840 tcagtttcgc cgaacaacat gttattataa aaaatgacga taaaaaaata atatatcatg 900 caagaaaatc tttaattttt aataacgacg aagcttggat aaagaaaaga ggcggattat 960 ttgatgtcac aatgggagca tttgatggag ctgaagtttg tgaactagtt ggtattttta 1020 tgttgtttca gctctctcaa cattacgata aaaataattt tggcttgtat cgtgacgacg 1080 ggcttgctat ttttgaaaac aaaagcggcc agcaaatgga aaaaatcaaa aagcatttta 1140 cacatatttt taaaagtaac aacctactta tctccatcca atgtaatatt aaaattgtca 1200 attttcttga tgtaacattg aacctgattg acagttctta tcaaccgtat tgtaaaccta 1260 ataaccaact tatgtatgtt cattcagagt caaaccatcc acctaatata ataaaagaaa 1320 tccctcgcac tattgagtta agattatcga atatggcagt aaatgaaact gtgttcaata 1380 actcaattat tccgtatgaa gaggctttac gtaaatcagg atacaattca aaactcacct 1440 accaaccgca aataaacaaa aatcaaaaaa gacatcgcaa acgtaaaatt atatggtaca 1500 accccccgtt tagcaaaaat gttgaaacta aaattggaaa tcgcttttta gctttaattg 1560 accaacactt tccagtagga cacaggcttc ataaaatttt taatcgaaac tatattaaag 1620 taagctatag ttgcatgcct aatgtcaagt ctttaataaa tacacataat aacaaaattt 1680 tacatagcga taaaaataaa acatcaaaaa aatgtaactg catagacaaa aatatttgtc 1740 cgttaaataa ccaatgcctt tttagcaata tcgtatacca agccactgta agttctggtg 1800 atcctcaatg caaagataaa gtttatttcg gtataagtga aaccacattt aaattaagat 1860 atgccaacca tcttaaatct ttcaatgcac ccaaatataa aaatgacact gaactttcca 1920 aagaaatatg ggagctaaaa tcgaaacata taaatcctgt gattacatgg aagataatag 1980 cacaatgtaa gccctacaat caagcatcaa aagtttgtaa tctttgtctc cgcgagaagt 2040 ttcacatatt atcatataaa ggagaaaact tacttaataa aagatatgaa ataatatcaa 2100 agtgccggca ttcaaaaaaa tttttactat ccttatttga taccggggac taaaaccttt 2160 gacgtctttt tgttttgttt tgttttgttt ggtaacgtca gaactcgttc caccgtaatt 2220 ttaatttgta atttttaaac ggtttttaag tgacgaaata cacaatggct gatgattgcc 2280 gaaaggcatg aaactcagag ttccatcaaa agttgttttt attcatttaa tataaataaa 2340 tatata 2346 // ID LIN4_SM repbase; DNA; INV; 5873 BP. XX AC . XX DT 09-FEB-2008 (Rel. 13.02, Created) DT 10-AUG-2009 (Rel. 14.09, Last updated, Version 2) XX DE Non-LTR retrotransposon; consensus. XX KW NeSL; Non-LTR Retrotransposon; Transposable Element; LIN4_SM. XX NM LIN4_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-5873 RA Jurka J.; RT "Non-LTR retrotransposon from Schmidtea mediterranea."; RL Repbase Reports 8(2), 162-162 (2008). XX DR [1] (Consensus) XX CC The 5' and 3' termini are approximate. CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 307..1863 FT /product="LIN4_SM_1p" FT /translation="MAPDQTTELKTAETAPDITFNKRDKHIFKEINRIIEH FT KTKQTTDNLTSDFLKQVDEQQKIIIETQQMIRDLNRKLDYSSPRRIMTNDT FT KFTTSHSENPNPPTTPQITDPHSKNSKNISQIYPGFNVNDLEFTPTPAQLN FT PCYDIDDVLFTPTPNQDMNSPIKCHQPSLNNSIETLSEEDDDIDIHKEKTP FT KNRNRIVDLDAQETTPLETYSPLSHTIDSSQSLFSQPKKKLTQSIVMQYPE FT MFSANEAPDLNGHFELYFALEPNVDRHDVANPNNAMTSQPPIISTTTVEPT FT PSEPIKNILLSNEQLITPNTRSPFLLRNRDPIFKLGSEGYLKCVVSNCKTG FT RKIGPLMLDKQHNHMHSYHQKSIKPTDTFECLICNRNKIKPTNISLKNIEN FT HLTEVHKDREFPGTKEQYNDKKKILTFDMNEEGTLVCAYLGKKAQCDIYLT FT FNTDLNEINKHIISKHKKKTFDHENSVRCYCGMTIDIHHTNSHFISHKNNN FT TTIEIENDPPTHIPATNADIGTD" FT CDS 2646..5150 FT /product="LIN4_SM_2p" FT /translation="MTNILHRETPILSPEYRSHTKINAEEMEKNFSQLDTS FT SSPGNDKITYADWRKMDPVFEYLTELFNQIIKNGRSPTAWKTFRTKLIIKP FT GKEHAPHEVSSWRPLAILDTTYRFFASIINNRLLSWIGVNNLLSRNQKAVG FT TPDGCAEHNTVISLAKEWAIRNGSDINIVWLDLADAFGSIPHNLIWHTLSR FT LKLNNTTINLIKEMYTDCFSIYECENKKTKQIRVTNGVKQGCPMSMTLFSL FT SIDFIIKNILIDHPLVINKHNISIMAYADDIVLISKTRTQMREMIKDIIKY FT TDMATLKFRPSKCGYFQLKRNHTDTSLKLYNENIPIIGEDNIYKYLGVDFG FT QKGKHNIDDTLELAIKDTEKLFNSDLHPTQKIQAYKTYIHSRLIFVFRNCH FT INHMILDSNRNKIVQHREKQMGFDQKIKRKIKDTLQDIHQNINNNFIYANI FT KMGGLGIVPSIDEYFIQSIAHLIKILNSTDRGMRDFIIKELINITQGRFPN FT QMSDIDLSLKWLNTEIKGDKYVHKTIFSKFHKSIRRLNEKFNIIIKIILVE FT ERFELEIKGNNFATTIDIDSLKETSNILHDLTSEWYAEQWYLMTCQGHIAK FT TIGNNRQSAYLIKHNVLNDQQFYFLIKARNNMLSLNYNTHRVKESANTLCR FT LCNKEPETQAHIFNHCTQTCNARRNKHNNVMEKASEYLISKGFHVDVEKPP FT PGIETRLRPDLIIKSKRNKLIHILDIKVPYDHLVNFESAKEENYKKYKDLS FT LEIGKANNCHTTVSALVVGTLGSWDCGNNSTLAKIGLNRTEIKKLAKICMT FT TAVISSYRIYMDHVTNRINNISTQKLMPQALSA" XX SQ Sequence 5873 BP; 2268 A; 1304 C; 932 G; 1369 T; 0 other; gtcaacagat caaatgacta tcatgcaaat caaggtcaac agaatgacta caaatctaat 60 aataatggtc aaaagaatga ctatattcaa ttatcacctc tacctaacaa aattaggaaa 120 aatatagata aatgttccaa agaagcagat aggaaagccc caagagcccc gtgcaatcac 180 cacaatatta cccagaccaa aacaattaat aaaagagtcg acatcacgaa ccaatttaac 240 attgttttta gtccaccaaa tcacaaccct cctaccttaa taagcacaga cagaacagat 300 accccaatgg ccccagacca aacaaccgaa ctaaagacag cagaaacagc ccccgatata 360 acctttaata aaagagacaa acatatcttt aaagaaataa atagaataat agagcataag 420 actaaacaaa cgacagataa cctaacctca gactttctaa agcaggtaga tgaacaacaa 480 aaaattataa tagaaaccca acagatgata cgagacctaa ataggaaact agactacagt 540 agcccacgaa gaattatgac caatgataca aaattcacga cctcacactc tgaaaaccca 600 aatccaccaa ccactcccca aataactgac ccacactcca aaaattccaa aaacatctcc 660 caaatatacc ctggcttcaa tgtaaatgac ctagaattca ctccaacgcc cgcccaatta 720 aacccgtgct acgacataga tgacgtacta ttcactccaa ccccaaacca ggacatgaac 780 tctccaatta agtgccacca accttctctt aataactcca tagagacact atctgaagaa 840 gacgatgata tagacatcca caaagaaaaa acaccaaaaa atagaaatag gatagttgac 900 ctggacgccc aagagaccac acctttagaa acctactctc ccctatccca taccatagac 960 tcctcacaat ccctcttctc gcagcctaag aaaaaattaa cacagtcaat agtaatgcaa 1020 taccctgaaa tgttttccgc caacgaagca cctgacctta atggccactt cgagttatat 1080 tttgccttag aacccaatgt agacagacac gatgtagcca accccaacaa cgcaatgaca 1140 tctcaaccac caattatatc cactaccaca gtagaaccta ccccctccga acctatcaaa 1200 aatatacttc tttctaatga acaattaata acgcccaaca ccagatctcc attcctcctt 1260 cgaaatagag accccatctt caaactcgga agcgaagggt accttaagtg cgtggtaagt 1320 aactgcaaaa ctggcagaaa aataggacct ttaatgttgg acaaacaaca caaccacatg 1380 cactcttatc accaaaaatc cattaagccg actgacacat ttgaatgcct catatgcaat 1440 cgaaataaga taaaaccaac caatattagc cttaaaaaca tagaaaatca cctgacagaa 1500 gtccataaag acagagaatt tcccggcacc aaagagcagt ataacgataa aaagaaaata 1560 ctaacctttg acatgaatga ggagggtacg cttgtctgtg cttatttggg gaagaaggcg 1620 caatgtgata tatacctgac ctttaacact gacctaaatg aaattaataa acacattatc 1680 tctaaacata agaaaaaaac ctttgaccat gaaaatagtg taagatgcta ctgcggcatg 1740 acaatagaca ttcatcacac taactcccat tttatctctc ataagaataa taacaccacc 1800 atagagattg aaaacgaccc accgactcat atcccagcta ccaatgctga cattggcacc 1860 gactgactat ctcaacacca catctcgatc tttatgctta cctttgctac tctgccttca 1920 acaatcttca tagatagctt tataggtaag gcatacatta ataagaattt tgaaaccttc 1980 ttctcggcct tccccttcca tagataccct aactggaatg accttctttg gcctatgaac 2040 cctgaaaaaa acctatggat tctcttctat ataaataaaa cctctggcat agcaacaatt 2100 atcgacccga cctgcgaaaa cagcaccttg aaacaccaag caatgacctc atcaatagta 2160 tctgttctga ccaccctgca aaacatcatc gagaaaacca caattaaggt cactgaagta 2220 ccataccctc aatgccagtt aattaaagac tctagcttct acatatgcca ctttgcctca 2280 tgtctagtta agaacacccc cataacacta cccaacatag ggatgtctct gacattacta 2340 agtatagtac atttattaaa gatctaataa tacaaacaaa aactaataac ataacgcctg 2400 acgacagcct aaaagaattc tatagaatcc atgaagcgac taatcctaag aaatataaaa 2460 ccaagcacgc agctccgaat ttcctagtaa acaaagacaa gaagaaacta gtattcgagc 2520 tgcggcaagg ctttgatctt aggcaaaaag tcactatagg taggatactt aaccccttaa 2580 gcaatataag caccacaccc tcacttgacg acttaattaa taacttctca aaagaaccca 2640 aacatatgac caacatactg cacagagaaa caccaatcct atccccggaa taccgatcac 2700 acacgaagat caatgctgaa gaaatggaga aaaacttcag ccaactggac accagctcct 2760 ctccaggaaa tgacaaaatc acatatgctg actggagaaa aatggatcca gtcttcgaat 2820 acctgacgga gctattcaac cagatcatca agaatggacg aagcccaacg gcttggaaga 2880 ccttcagaac caaactgatc ataaaaccgg ggaaggaaca tgcaccgcat gaagtatcct 2940 cttggagacc tcttgccatc cttgacacaa cttacagatt ctttgcctct atcataaaca 3000 acaggctact gtcctggata ggggtgaaca atttgttaag taggaaccag aaagcagtgg 3060 gtactccgga tgggtgcgct gaacacaaca cagttatctc tctggcaaag gaatgggcaa 3120 tcaggaatgg gtcggatata aacatcgtgt ggctagatct cgccgatgct tttggaagca 3180 tcccccataa tctgatctgg catactttgt cgagattgaa actgaataac acaacaataa 3240 acctaattaa ggaaatgtac acagactgtt tctcaatata cgaatgtgaa aacaaaaaga 3300 caaaacagat tagggtgacc aatggagtta agcaaggatg tcccatgtcc atgacccttt 3360 ttagtctatc tattgacttc attatcaaaa atattttaat tgaccaccca ctagttatta 3420 ataaacacaa tataagcata atggcatacg cggatgatat agtattaata tccaaaacca 3480 gaacccagat gagagaaatg attaaagata ttattaaata caccgacatg gccactttaa 3540 aatttagacc aagcaagtgt ggatatttcc aactaaaacg caatcacact gacacctcac 3600 ttaaattata taatgaaaat attccgatca taggagaaga taacatatat aaatacttag 3660 gggtagactt tggccaaaag ggcaagcata atatagatga cacccttgag ttagccatta 3720 aagacacaga aaaacttttc aattctgacc tacacccaac ccaaaaaatc caagcctata 3780 agacatacat tcactcccga ctaatattcg tttttagaaa ctgccacatt aatcacatga 3840 tactagatag taataggaat aagatagtac aacataggga aaaacaaatg ggctttgatc 3900 aaaaaataaa acgtaaaatt aaagacaccc tacaagacat tcatcaaaat attaacaata 3960 actttatcta tgcaaacatt aaaatgggag gtttaggaat agtccctagt attgatgaat 4020 acttcataca aagtatagcc caccttatta aaatattaaa ctccacagat aggggaatga 4080 gagacttcat aattaaagaa ttaataaata ttacccaagg tagattcccc aaccaaatgt 4140 cagatatcga cctatcactt aaatggttaa atacggaaat taaaggggat aaatatgtgc 4200 acaaaacaat tttttcaaaa ttccataaat ccatacgtag gttaaatgaa aaatttaata 4260 ttattattaa aatcattcta gtagaagagc gctttgagct agaaattaag gggaataact 4320 ttgcgaccac tatagatata gattcgctta aggagacctc taatatactc catgacctca 4380 ctagcgagtg gtatgctgaa cagtggtacc taatgacttg tcaaggacac atagctaaga 4440 ctataggaaa caataggcaa tcagcatacc ttatcaaaca taatgtgctt aatgaccagc 4500 agttctactt tttgatcaaa gcccgtaata atatgctaag tctcaactat aatacccata 4560 gagttaagga aagcgcaaat acattatgta gactttgcaa taaggaacct gaaacacagg 4620 cacatatttt caaccactgt acccaaacat gcaatgctag gagaaataag cacaacaacg 4680 tgatggaaaa agctagtgaa tacttaatct ctaaagggtt ccatgtagac gtagaaaaac 4740 caccaccagg catcgaaacg cgactacgac cagacctaat tataaaatca aagagaaata 4800 agctaatcca catcctagac attaaggtac cctacgacca ccttgttaac ttcgaaagcg 4860 ctaaagagga aaattataaa aaatataaag acttgtcgct agaaataggt aaagctaata 4920 actgtcatac aacggtaagc gcactagtgg taggcacact agggtcatgg gattgcggaa 4980 ataatagtac cctagctaaa ataggattaa ataggacaga gatcaaaaaa ttagctaaaa 5040 tctgcatgac aactgcagta atatccagtt acagaatata catggatcat gtaaccaata 5100 ggataaataa tataagcaca caaaaattga tgcctcaagc actgagtgct tagaagcacc 5160 aaaaaatcta cttattgaaa tatcattcct tttgaaaaag agggatgata tttcactagt 5220 agatataata gaatagtaaa atgtacccac atgtacctag tggcatagca tgcttattag 5280 tgcatcccta aagatgaagg aagataaaaa atgtcacttg gggtatggaa tacttccaag 5340 tacattaata aaatattaac agtaatacaa aataaggttg ggttattaat gacacttgat 5400 gtcattaata accttgtgat acgaatatta ttaattgata aaagtgaggg tagaagcact 5460 ggatgctcgc tagcctacac tctcctaccc ataaaaagca aactaatggg tacgagacca 5520 cgcagctccc aatagagcac tggcacttag atgccactgc ctattattta gaagacaatg 5580 agatgatatt tagccaatat catcacatca gcagacataa ccaatcaaag gtatgtctgt 5640 gaagcatcgc taaatgatgc tcaaaattgg ctctgaactg gcatgggctg gaggagggaa 5700 tggcgggagt gctgtcagtt caagcgaagt gttgtgatgg cgacaactga tgatcctcaa 5760 ggggaacaca gtaacatcac taaatagtag aaacacaaaa gaccagcaga aacactcacg 5820 tttcagctta tccgaaagga gaggctggta aaaccaagct tggctcacta tgc 5873 // ID P-26_HM repbase; DNA; INV; 3995 BP. XX AC . XX DT 15-DEC-2008 (Rel. 13.12, Created) DT 15-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE P-type DNA transposon family: consensus. XX KW P; DNA transposon; Transposable Element; P-26_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3995 RA Jurka J.; RT "P-type DNA transposon families from Hydra magnipapillata."; RL Repbase Reports 8(12), 2079-2079 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 719..3409 FT /product="P-26_HM_1p" FT /translation="MVINCSVYNCKSRFDKSSLISFHRFPLNDEILCKKWA FT TATKCLQITPSKHSRICSKHFKLEDFECQENGKRVLKKNIVPSIFDFHESL FT NKNNKRNQLYNDDTSHIKKTICSISNKEVTELEITNTIKEKILKQNSPKKL FT LLRKKIKILKQKLKRKDKKIASLSKVIQNLEKKKIISIDVAELLENNFSGL FT TKELLKSELKNRGKKKKGYRYSEEVKKFALTLHFYSPRAYEFVKSTFSLPA FT VSSLSNWTSSVDCLPGFFGDVFKFLQNKTFEDSSYKDCALIFDSMHIRAAL FT RYDNVTGSYEGFSSFGNNILAYDPDKLATEALIFMVVGLKGHWKCPIGYIL FT CDKISSTDLSCLIKNALHLCATHNLNVWSITCDGLISNFSAMKNLGCSFGS FT KIKEMCGSFPHPEYDHPIYYIPDPCHMLKLARNALCSIGIFVDEQNQYIKW FT DHIKALHKLQEEEGFKFANKLGNYHIDFRRHKMNVKIAAQTLSGSVADAIE FT FLMVSGHPTFIDAQGTINFIRTLDKLFDLLNSKNLFEKGSKKPLFLNEAPK FT WIETIEKSVNYLVNLSDLSKTSLIKHRRKTFVLGFIVAATSVRDLSLNLLS FT RHENPFEFILTYKLSQDHVELLFACIRGKNGFNNNPDARQLKSALKRILLR FT NSIIGSKFSNCLTFDQKSSGSIFLLKWSKRRSPLVEQNIISSEDSNYFKDL FT AGCLSSVSTSIYKEAILGYIAGFIVRKLYSKITCLTCANSLISPKTLSTDH FT EYSIFCSSSWKSFTTFKNRGGLIFPSCSVLKIVEKCEHVFKVLVCGADPHN FT LKISSKKNLKTYLVHFINQKLANDALFTELNMHDLEHEILTEDMHSSQLLK FT KIIEKYLTLRLHSYGQLYTRNELHKGKIGLRQQSNKLVLFKGL*" XX SQ Sequence 3995 BP; 1472 A; 536 C; 574 G; 1412 T; 1 other; caatgatata tattaactag atgtctaact tccgaaaaaa agtgaagcac attttttttg 60 ccaaccagca tatgattggc taataacttg ttttgccaac cagcatctga ttggttaaca 120 aatattacgg aaagacaaar agcaaaaata aaatccaaac aaaaaatatt tgtgtcagtt 180 aaatcagaaa atgggtttaa aagttagtca ggttggaatg atattgtgag atgctttagt 240 attggataat tttataagtg taggttaaaa gtaacgtcag ttacttttaa ctttctatta 300 taaaaaataa ctgctaagct agtaatttta gtttaaagta aaaagacttt gaaccttcgg 360 aaaatatttt aagtatataa tttttaaaag ttaacatctt agataagttt ttatataatt 420 aattaagtta aaatatctgt tatagtatgt aaacttatcc tatatatttc tttataataa 480 aatacaatat ttctaagggt atatatatac cctcgttatt caagtatata tatatatata 540 tatatatata tatatatata tatatatata tatatatata tatatatatt caaatgatat 600 tatatatgat aaagtctggg ataatgtgtt tgagtgtgtg tgtatgtatt tataaaacat 660 aaaagatata tttttgaaaa caatactttt tattttttta gagtaacaaa taataaacat 720 ggttattaat tgttcagttt acaactgtaa aagcagattt gataaaagta gtttaatttc 780 ttttcacaga ttccctttaa atgatgaaat tttatgtaaa aaatgggcaa cagcaactaa 840 atgtttacaa attacaccta gtaaacatag tagaatctgc agtaaacatt ttaaattaga 900 agattttgag tgtcaagaga atgggaaacg tgtgctgaaa aaaaatattg tgccttcaat 960 atttgacttt catgaatcat tgaacaaaaa taacaagcgg aatcagctat ataatgatga 1020 tacatcacac attaaaaaaa caatttgttc tatctcgaat aaagaggtaa cagagcttga 1080 aattacaaat actattaagg aaaaaatact taaacaaaat tctccaaaaa aactattact 1140 tagaaaaaag attaaaatat taaaacaaaa gcttaaacga aaagataaaa aaattgcttc 1200 acttagcaaa gttattcaaa acttggagaa aaaaaaaatt atttccattg atgttgctga 1260 attacttgag aacaacttct ctggactgac taaagaactt ttgaaatctg aattaaaaaa 1320 tagaggcaaa aaaaaaaaag gttatagata ctctgaagaa gttaaaaaat ttgctttaac 1380 actgcacttt tattctccaa gagcatatga atttgttaaa tctacattct cattacctgc 1440 cgtaagctca ctttcaaact ggacctcatc tgtagattgt ttgcctgggt tttttggaga 1500 tgttttcaag tttttacaaa ataagacatt tgaggattca tcatacaaag attgcgcatt 1560 aatatttgat tctatgcata tcagggctgc tctgcggtat gataatgtta ctggatcata 1620 tgaaggtttt tctagttttg gaaacaatat tctagcctat gacccagata aattagcaac 1680 agaagcactt atttttatgg ttgttggatt gaaaggccat tggaaatgtc ctataggcta 1740 tatattgtgt gataaaattt cctcaacgga tttaagttgt ctaattaaaa atgctttaca 1800 tttatgcgct actcataatc tgaatgtttg gagtataaca tgtgatggac ttatttccaa 1860 ctttagtgca atgaagaatc ttgggtgttc attcggaagt aaaattaaag agatgtgtgg 1920 tagttttccc catcctgaat atgatcatcc tatatattat atccctgatc cttgtcatat 1980 gcttaaatta gctcgaaacg ctctgtgtag tatcggtatc tttgttgatg aacaaaatca 2040 atatataaag tgggaccata ttaaagcatt gcataaactg caagaagaag aaggatttaa 2100 gtttgcaaat aaacttggca attaccacat tgactttcgc agacataaaa tgaatgttaa 2160 aattgcagct caaacgctaa gtggttcagt tgctgatgct atagagtttt tgatggtctc 2220 aggtcatcca acttttattg atgcccaagg cacaattaat tttattagaa ctttagataa 2280 attattcgac ttgttaaact ctaaaaacct ttttgaaaaa ggatctaaga aacctttgtt 2340 tttgaatgaa gctccaaaat ggatagaaac catagaaaaa tctgtaaatt acttagttaa 2400 cttatcagac ctatctaaaa cttctttaat taaacacaga agaaagacat ttgtacttgg 2460 tttcattgta gcagccacaa gtgttagaga tttatccctt aaccttttat ctagacatga 2520 gaatcctttt gaatttattt taacatataa actgtcacaa gatcatgttg aattattatt 2580 tgcttgtatt cgaggcaaaa atggtttcaa taacaatcca gatgcacgtc agttaaaatc 2640 tgctttaaaa agaattttat taagaaactc tatcatagga tcaaaattta gcaactgcct 2700 tacttttgac caaaaatctt ctggctcaat ctttttatta aagtggagta aaaggagatc 2760 tcctcttgtt gagcaaaata taatatctag tgaggattct aactatttta aagaccttgc 2820 tggttgtctt tcatcagtct ctacttcaat ttataaagaa gctatacttg gatatattgc 2880 agggttcatt gtaagaaaac tttatagtaa aattacatgt ctgacttgtg ccaatagttt 2940 aatttctccg aagactttat ccactgatca tgagtattct atattttgca gttcatcatg 3000 gaaaagtttt acaactttta aaaatcgagg tggtttaatc tttccatcct gtagtgtttt 3060 aaagattgtt gaaaaatgtg agcacgtttt taaagttttg gtgtgtgggg cagaccccca 3120 taatttaaaa atatcttcta aaaaaaatct taaaacctat ctggtgcact ttataaatca 3180 aaaacttgca aatgacgcac tgtttactga actaaatatg catgatttag aacatgagat 3240 tttgactgaa gacatgcatt cttcacagct tttaaaaaaa attatagaaa aatatttaac 3300 attgcgtcta cactcatatg gtcagctgta cactagaaat gagttgcata aaggaaagat 3360 tggattacgg caacagtcca ataagcttgt tttgttcaaa ggtttatgaa aaattacaat 3420 aataatattc aatattattt gtaagctatt tatgccaaaa aataaaatta attctttaaa 3480 acaaaaattt tattatttct gaaaattcct taactgtaac tgttaactgt gtgttacagt 3540 tttttgaact aatttaaatt ttttaattga aaacctacat taaaaataaa aatttatacc 3600 tatttatatc agcaacactt ttgatttatt ttcataatgg aatatgtgtt cttttataac 3660 aataacttaa aggtttccaa cagggtactt ttaagttaca gttgtaaaag tcaaaaacat 3720 attttttgcc aacatatatt tatatttgcc aagtaaaaat gtaatatctt tgtattatat 3780 aataacttct aactttcttt atctacaaac gcttttaaat agctgttttt aaataacagt 3840 ttataaatag atatttttaa catttaaaaa gttaacatcg ccaaaatgtt tttttctttg 3900 ctgcggctac cggctttata accttcaaaa acttgcttca tattttctta cggaatttat 3960 acgaagttag acatctagtt aaatatatat cattg 3995 // ID DNA8-77_AP repbase; DNA; INV; 1253 BP. XX AC . XX DT 25-AUG-2009 (Rel. 14.09, Created) DT 25-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-77_AP. XX NM DNA8-77_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-1253 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2013-2013 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 1253 BP; 460 A; 170 C; 173 G; 448 T; 2 other; cagtggcgga tttaagggtt caaagaaagt cggcaaatca ataagagggg cctcctacaa 60 aatttatcag taacatactt acccccattt cacaaaaccg atcgcctcgt ttgcgagaaa 120 taatatcaac caatattcaa ttttatagta acaattaatc ttctaattaa aaaaaaaaaa 180 atgttttagt ttttatttat ttttacaaat aaaacaaaat taaacgtagg tatataaaat 240 gcctgtgtag aatgtagatn ataattaaaa tgtagcttga caaaaaacca atgcaaatat 300 aaaaaaaaag tgaagccata aaattataca taatattata aaggtttaat ttgagttaca 360 cacttttaaa tttcgtacta cacacttcac tttttgtact acacacaatt ttaggagctg 420 ctatatacac aaaagtcgat gataaaatta ccattttata ttataattca caaaaacaat 480 aatgatgtgt gtgtatgttt ntgtgggtga cggaccggtc ggctaggttg gcgagtgaat 540 tggtggtgtg gaggcaatta ctcgttgctt ctatttataa ttttatactt aatttaataa 600 taataatact tattttaaat ttcaatcaat actttatagt tatagtaata gttatatagc 660 agttgtaaga aacaatgtcc taacccgtta aatgtataaa cataataata tgaaaaacat 720 agtgcaaacg cattaaattc acagccctgt ttaccacatt tttcaatatt tatatgtacc 780 tacgttgcga atttatcgcg aaaacggttt ccaataaatc atgatttacg ccgctgatta 840 agtaataaaa agattgttat tatacttaca aaattattgt ttttattttc atgcaatcat 900 gaattatatt atatattaat ttataattat aaataatctt atttgtgatc aatggtagtg 960 ggtagaatgt agaataccta acttagattt acgtaggtat atataagtat ttacttcgat 1020 cttttctact ttatttaatt ataatatttt ctacttttga agtattccta tatatttaaa 1080 ttatttttat tattaaaaca atggtaaatt atattccagt ttgaaatata catttttttt 1140 ggttaaaatt taaaaaaaaa atcaacttta taatttaggt gggcctcagt gcacaattta 1200 ggagggggcc ccgttggcat ttgccgacta aacgaccgtt aaatccgcca ctg 1253 // ID Copia-18_DPu-LTR repbase; DNA; INV; 290 BP. XX AC scaffold_128; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-18_DPu_; KW Copia-18_DPu-LTR; Copia-18_DPu-I. XX NM Copia-18_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-290 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 700-700 (2010). XX DR Genome; scaffold_128; Positions 321989 321700. XX CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX SQ Sequence 290 BP; 85 A; 70 C; 52 G; 83 T; 0 other; tgttggaagc agacattcag ccatctagcg gtcaacagac aacttcatcc atatatgata 60 ggaggatgcc gtttagcctc acccgaggct tctctacgaa gcgtctctct gtctggtacc 120 aaccacgctg attgtgatca accctcgttt atattcactg gaattcttag gtaaatgtcg 180 agaagtttat cacttattta ttacttgtat gcaacgcaca tgtacacaaa tacagacctc 240 tatgagagga atcacaaacc ccaagtaata ttgtcgtatt cattccaaca 290 // ID Rehavkus-3_TC repbase; DNA; INV; 18501 BP. XX AC . XX DT 30-APR-2006 (Rel. 11.04, Created) DT 26-MAY-2011 (Rel. 16.05, Last updated, Version 2) XX DE This is a recently transposed Rehavkus-3_TC DNA transposon - a DE consensus sequence. XX KW MuDR; DNA transposon; Transposable Element; Interspersed repeat; KW Rehavkus-3_TC; Rehavkus group. XX NM Rehavkus-3_TC. XX OS Tribolium castaneum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Coleoptera; Polyphaga; Cucujiformia; OC Tenebrionidae; Tribolium. XX RN [1] RP 1-18501 RA Kapitonov V.V., Gentles A.J. and Jurka J.; RT "Rehavkus-3_TC, a family of Rehavkus DNA transposons from the red RT flour beetle genome."; RL Repbase Reports 6(4), 196-196 (2006). XX DR [1] (Consensus) XX CC Rehavkus-3_TC belongs to the Rehavkus group of MuDR "cut and CC paste" DNA transposons. Transposons from this group are CC widespread in different metazoa, including insects, sea squirts, CC sea urchin and fish. The beetle genome harbors several ~95% CC identical copies of Rehavkus-3_TC. Its 709-bp inverted termini CC are composed of a 283-bp terminal inverted repeat and 167-bp CC subterminal minisatellite-like unit. In the 3' terminal portion, CC the minisatellite unit covers ~11-kb region. The transposon is CC flanked by 9-bp target site duplications and encodes a 1200-aa CC Rehavkus-3_TC transposase. Its C-terminal portion contains the CC PHD finger and Ulp1 cysteine protease. XX FH Key Location/Qualifiers FT CDS 1193..4797 FT /product="Rehavkus-3_TCp" FT /translation="MEAVVYSSSLNFCHPYKITSFVKGVERDLTRNKLMKT FT RPLIFRKDSIRTANMELLTVAGNYQQIKSESVIRKARSEALNLRKRFNDDN FT LDLTSMQKSHAEYIKQVSFPLNIKIFSNEQLLVLQNSIDFNLYFDATGSVV FT RNQSDPKSTVYYYAGVVVTKNHRVCPVLEMVSSEHATNNIQDLFVRFINFC FT KESSMSNIPFETVVTDFSFANIHAVCLSFNEMTLNQYLSTCFDLAIHHKSV FT PKKIKIIRLCAAHFMKNICTHLNEFVGQSKKSHHKSFIKELIACAFNLTTF FT EDVKFWFENACIILMYETYNDAVQIALHNISSIANSLSTNLTKNIGDDDFG FT IDHSVSVSEDFISHTKLRKNSPFYKEFLKIFNAVVAKEEKHNNSNIYCCKE FT FMNYILDRYIPFLPMWTLIIHTKHSERVTNNAIERYFGILKNEVLERQTYL FT APSLFIRHSRDYVLSIYREVKYNIGKEGLARAPKTKKDLSNGTIQSKRLKK FT SPSSMVTDDKVFILSENDKISVASSFGSAGTIESQSDQIYSHKETWKSKPV FT NKRASYFRGTHLKRQLFADKLDTIQEVSEVPEIIDVDKVPDINVMTPVAEK FT SFQINFLPNGLINSIEYYLNNFTVGYRDFIVARFDNALGDKCLNNTEFKKI FT SKHTLTNKVANILNSVLIKKYQLHAATIWSVQRSRMAIVLQKFSEKMPILR FT NITYMILNEVEDKWVLLKIDIREREVGVFKPFGLKEIELQSYLTQFVNFVK FT QYNTKKRNLSTFKMIPEDGWRMSAMMSEYTLDPLCSTANGLMVYWILTKLV FT SKQSKPLDDSRSFDHEKFIKEIQTLLIHFSDDMTNNCLICGKIELDHTVNW FT IRCNTCGRWFHTHHDDINTSQNLDSSKATFACLLCQKFFNIESSTHEIVKS FT HLRENALMKNVDYYVECKAMGKTYTVAKYYNYGIINVLSVEDYQDLHGQQW FT LSSMLIDQYLNLYYKVNCLTKEWFIDYLILESKKSEFIFFNEELQKLPRSF FT FNELPRLAGKKVILIPILRSSHFTLAVVDFENKTFHYLNPMKSNTDHTSQM FT FIRFKKFISEYKKYCSFNADLECFQLKTVEHDYQTDTNNCGIFVLFYIEAI FT LNKNSLKQMPCTPQQYRINVKEFLLENSDDMQHVCMGCAKEVVTSEENFLK FT CTFCLRLTHTYCLYAPWTIQNVCHLCVRNYNPKTSQSAVGT*X" XX SQ Sequence 18501 BP; 5693 A; 2456 C; 3212 G; 5382 T; 1758 other; atttgtgtga agacagctcc caaaaccagc tcctcgaatt tccgaagaga cccgaacatg 60 aagccagttt cagactagca aaataagttt cagtttgctt agatgtatgt ttggttgtca 120 gctttgaaac cgtttaataa aataaaaatc cgaaaaaaca ctaaacaccc tcttaaaaat 180 gacattttaa gaaagtcact atgctaatag cgaaaaactc aattatttac cacaaattgc 240 tttttaattt ttcccacttt ttcacctaat ttcaatattt atggctccgc aaatgcatta 300 ttgtagtttt cgccccattt caccccgaaa tcggttttgt cacacaatgc ccttttgacc 360 actaattcaa caataaaaaa gacgtttttt aaaaataaaa ttatggcctt ttagttgtcg 420 gaacaatgtt aaaaattgtt atgttgccca gctccgcaaa tgcactgttg cagttttcgc 480 cccatttcac cccgaaatcg gttttgtcac acaatgccct tttgaccact aattcaacaa 540 taaaaaagac gtttttaaaa ataaaattat ggccttttag ttgtcggaac aatgttaaaa 600 ttgttatgtt gcccagctcc gcaaatgcac tattgcagtt ttcgccccat ttcaccccga 660 aatcggtttt gtcacacaat gcccttttga ccaccaattc aacaacaaat gtagtcctgt 720 tgatgatgtt cctttaagcg aagacactcc atattttgaa ttgaataaaa cactagatgt 780 tacacgtgaa gacattgaag ttcctttagg actttcggat ctttcttttg taaaagccga 840 tgaaaagcgc cgcctattac cattgggtac taaaaatgtt gagccaattt tgcacactcc 900 tgtcagtcgg aggaaatacg attgctgcaa ttttttagaa ggaaaatttg caatcacaaa 960 agttgtttgg aaattaattt ttgaagatgg tcaatttcat cctagaaatt atccatatta 1020 ctttcgcaat agaatagcta aatacgttaa taacacgtgt aatattgctt tttataaatg 1080 gcgcagtcgt aaaaatgaca cttttgtttt atatgcctac tgcagacata acaactcaaa 1140 atgcaaaaat tttaaaattg aaataaaaaa aaatattaaa ttagatcaat acatggaagc 1200 cgtcgtgtat tcatcttctc ttaacttttg ccatccatac aaaattacaa gttttgttaa 1260 gggtgttgaa agagatttaa cacgaaacaa gctcatgaaa acgcgaccgc ttatttttcg 1320 aaaagattca attcgtacag ctaatatgga acttctgaca gtagctggta attatcagca 1380 aataaaatct gagtcggtga ttagaaaagc aaggtcagaa gctttaaatt taagaaaacg 1440 tttcaacgat gataatttgg atttaacttc gatgcagaaa tcacacgctg aatacattaa 1500 acaggtctca tttccattaa atataaaaat ttttagtaat gaacaattgc tagtattaca 1560 aaattcgata gacttcaatt tatacttcga tgctacaggt tcggttgttc gaaatcagtc 1620 tgatcctaag agtacagttt attattacgc tggtgtcgtt gttactaaaa atcatcgagt 1680 gtgtccagtt ctagaaatgg tatcgtcgga acacgcaaca aataatattc aggacctctt 1740 cgtgaggttt ataaacttct gtaaagaaag tagcatgtca aatataccct tcgaaactgt 1800 tgttactgat ttcagtttcg caaatattca cgcggtatgt ctgtctttta atgaaatgac 1860 gctcaatcaa tatttgtcca cgtgttttga tctagccatt caccataagt ccgtcccaaa 1920 aaaaattaaa ataatccgtt tatgtgcagc tcattttatg aaaaatattt gcacgcattt 1980 aaatgaattt gttggtcaat ctaagaagtc tcatcataaa tcgttcatca aggaattaat 2040 tgcatgtgct ttcaacttaa ctaccttcga agatgttaag ttttggtttg aaaatgcttg 2100 cattattcta atgtatgaaa cttataacga tgccgtgcaa atcgcattgc ataacatttc 2160 aagtattgct aactctcttt ctacgaattt aacaaaaaat attggtgacg acgattttgg 2220 tattgaccac agcgtttctg tttcggaaga ttttatatct catacaaaat taagaaaaaa 2280 tagtccattt tacaaagaat ttttaaaaat tttcaatgct gttgttgcca aggaagaaaa 2340 acataataat tcaaacatct actgttgtaa ggaatttatg aattatattc tagataggta 2400 tattcctttt ttaccaatgt ggacgcttat aatccataca aaacattcag aaagagtgac 2460 aaataatgca attgaaagat attttggaat tttgaagaat gaagtattgg aaagacaaac 2520 atacttggca ccctctttat ttattcgaca ttcgagggat tacgtactta gtatttatcg 2580 tgaagttaag tataatatag gaaaagaggg tttggccaga gctccaaaaa caaaaaaaga 2640 tttaagtaat ggtactatac aatccaaaag gctgaaaaaa tcaccatcat caatggtgac 2700 tgatgataaa gtctttatat tgagtgaaaa tgataaaatc agtgtcgctt cttcatttgg 2760 aagtgctggt actattgaaa gtcaatcaga ccaaatttat tctcacaagg aaacttggaa 2820 aagtaaacct gtaaataaac gggcatctta ctttcgagga actcatttaa agagacaatt 2880 atttgctgat aaattggata ctatccaaga agtctcagaa gtacccgaaa taattgatgt 2940 ggataaagtt cccgacataa atgtaatgac acctgtcgcc gaaaaatcgt ttcaaataaa 3000 tttcttgccc aatggcttaa taaactccat cgagtattat ttaaataatt ttactgtagg 3060 ctacagggac tttatagtgg cacggtttga caatgctttg ggagacaaat gtttaaataa 3120 tacggaattc aaaaaaattt caaaacatac tctcactaat aaagttgcta atattttaaa 3180 ttcagtttta ataaaaaaat atcagctaca cgcggccact atatggtctg ttcaaaggag 3240 tcgaatggct atagtgctgc aaaaattttc tgaaaaaatg cctatcttac gtaatattac 3300 atacatgata ttaaacgaag tggaagacaa atgggtttta cttaaaattg acattagaga 3360 aagagaagta ggagttttta aaccatttgg tttaaaagaa attgaattgc aaagttatct 3420 tacgcagttt gtaaattttg taaagcaata taacactaaa aaaaggaatt taagcacttt 3480 taaaatgatt cctgaagatg gatggcgtat gtcggcaatg atgagcgaat acactttgga 3540 tcctttgtgc tcaactgcaa acggtttaat ggtatattgg attttaacaa aattggtttc 3600 aaaacaatca aaaccgttag atgattcaag gtcatttgat cacgaaaaat ttatcaaaga 3660 aattcaaaca cttcttattc acttttctga cgatatgaca aataactgtt tgatttgtgg 3720 aaaaatagaa ttggatcata cagtaaattg gatccgttgt aacacttgtg gcagatggtt 3780 tcatactcat catgacgaca tcaacacaag tcaaaattta gattcgtcta aagccacatt 3840 cgcctgcttg ttgtgtcaaa aattttttaa tattgaatca tccactcatg aaatcgtaaa 3900 atctcatctt cgcgaaaatg ctctaatgaa aaacgttgat tattatgttg aatgtaaagc 3960 aatgggtaaa acatataccg ttgccaagta ttataactac ggtataatta atgttttatc 4020 agtagaagat taccaggatc ttcatggtca gcaatggtta agtagcatgc taattgatca 4080 atatttaaat ttgtattata aggttaattg cttaaccaag gaatggttca tagactattt 4140 gattttggag tcaaaaaaaa gtgaattcat tttttttaat gaggaactac aaaagttacc 4200 aagaagtttt ttcaacgagc tacctaggct ggcgggaaaa aaagtaattt taattccaat 4260 tctcagaagt tcccatttta cactggcagt agtggatttt gaaaataaaa cgttccacta 4320 tttgaatcca atgaaatcca atacagatca cacatctcaa atgtttattc gatttaaaaa 4380 gtttatatca gaatataaaa aatattgcag ttttaacgct gatctagaat gttttcaatt 4440 aaagacggtt gaacatgact accaaactga tacaaacaat tgtggtattt ttgttctctt 4500 ttatattgag gcaattttaa ataaaaactc attaaaacaa atgccatgta caccacaaca 4560 atatagaata aacgtgaagg agtttttgtt ggaaaattca gatgacatgc aacatgtttg 4620 tatgggatgt gccaaagaag ttgtaacttc tgaagaaaat tttctaaaat gtacattttg 4680 tttgagacta actcatacat actgtcttta tgctccttgg acaattcaaa atgtgtgtca 4740 tttatgtgta agaaattata atccaaaaac atctcaaagt gctgttggca cttaaaaatt 4800 tacaacacat acataaaatt gagaaattta aaagtaacat aattttgtta tgtacttaaa 4860 aaaatccggt ccacagcagg aggattctat agaaaatata ggatccgaat gtttataaaa 4920 tatacctacc gaacctaggg cttatttata gaaacttttt gtttaaatga aacctttaca 4980 tcatgactac acatttttag tgtaatattc ccgtttatat gatcagtact ttatattatt 5040 agatattttt ttcgataaat agctctatta gtattgtacg agtaggtatc agcactgaaa 5100 tattttctgg gtgttatcag cactgaaata ttttctgggt gttatcagca ctgaactact 5160 ttttcggtgt tcgctacact gaaaagtgtt ctgggtgttt tttggaccgg atgataaagt 5220 gcataatttt gagaattatg aaaaaactct aaattctaat attaaaaaga tgtttctttg 5280 atccttgatg tctgtaatgg tttaacatgt aacaaagatg gttgtaacga atacaaggta 5340 tgttgaaata aacgcaataa acttccaatt ataaatataa tgtttttgtt ttacagcatt 5400 tatgttccac ttcggattaa ttcaatcgac agccaggctc taaaggttta ccacgaaaaa 5460 tgagaaaaaa caaaagtggt tatcacgaat acaaggtatt cgtatgttga aataaactca 5520 ataaatttcc aattataaat acaatctttt tgttttacag tatttatgtt ccacttcgga 5580 ttaactcaat cgacagccag gctctaaagg tttaccacga aaaatgagaa aaaacaaaag 5640 tggttatcac gtatacaagg tacttgtatg ttgaaataaa ctcaataaat ttccaattat 5700 aaatacaatc tttttgtttt acagtattta tgttccactt cggattaact caatcgacag 5760 ccaggctcta aaggtttacc acgaaaaatg agaaaaaaca aaagtggtta tcacgtatac 5820 aaggtactcg tatgttgaaa taaactcaat aaatttccaa ttataaatac aatctttttg 5880 ttttacagta tttatgttcc acttcggatt aactcaatcg acagccaggc tctaaaggtt 5940 taccacgaaa aatgagaaaa aacaaaagtg gttatcacgt atacaaggta ctcgtatgtt 6000 gaaataaact caataaattt ccaattataa atacaatctt tttgttttac agtatttatg 6060 ttccacttcg gattaactca atcgacagcc aggctctaaa ggtttaccac gaaaaatgag 6120 aaaaaacaaa agtggttatc acgaatacaa ggtactcgta tgttgaaata aactcaataa 6180 atttccaatt ataaatacaa tctttttgtt ttacagtatt tatgttccac tccgaatcaa 6240 ctcaaccgat agtcagtaat tcgacaactt caagttctcc aagattctcg aagtttaatc 6300 gtcttaaatt tggtgcaaat ttttaaattt gttgttaaag ctataatttc gttattcaga 6360 aatagtattt agtagtcctt atcctctaag atataattag tctattgtgt ttgtatatct 6420 tgtggtaact gatatctatt gaaaagaaac ccgaatttat tcgaacaaat ttttacccgc 6480 acattataac tttatattta attatgatta tgtttttttg atattcgcta ttccgcttgt 6540 aagattcgaa aaaagttaaa ttttctgcaa aaaatctaag cacagttgtt ttctgtccta 6600 agatcttacc tggtcggcag agacagccag tcttttatgt tttattgaaa gggcgcatga 6660 gtcttaacac ttttacaaaa accgtcataa tttcacagat aaggtttatg aaggttgcat 6720 aatcttagaa ggtaaggttt actaattact acactatttt tgttttgaaa gaaggcgttg 6780 tgtaaaacat tctgtattta tgtttaaatt tgaaaggaaa tgtgcaatat aagtttgtta 6840 cttgatttgt tactttattt aattggatag taaattttgt gttgtagtcg acaatactta 6900 aaaaatatag ttttacttgg attataattg ctttaaattt ttcttttaag aaagnnnnnn 6960 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnaggcca taattttatt 7020 tttgaaaaac gtctttttta ttgttgaatt ggtggtcaaa agggcattgt gtgacaaaac 7080 cgatttcggg gtgaaatggg gcgaaaactg caatagtgca tttgcggagc tggacaagat 7140 aaattttttt aacattgttc cgacaactaa aaggccataa ttttattttt gaaaaacgtc 7200 ttttttattg ttgaattggt ggtcaaaagg gcattgtgtg acaaaaccga tttcggggtg 7260 aaattgggcg aaaactgcaa tagtgcattt gcggagctgg acaagataaa tttttttaac 7320 attgttccga caactaaaag gccataattt tatttttgaa aaacgtcttt tttattgttg 7380 aattggtggt caaaagggca ttgtgtgaca aaaccgattt cggggtgaaa tggggcgaaa 7440 actgcaatag tgcatttgcg gagctggaca agataaattt ttttaacatt gttccgacaa 7500 ctaaaaggcc ataattttat ttttgaaaaa cgtctttttt attgttgaat tggtggtcaa 7560 aagggcattg tgtgacaaaa ccgatttcgg ggtgaaattg ggcgaaaact gcaatagtgc 7620 atttgcggag ctggacaaga taaatttttt taacattgtt ccgacaacta aaaggccata 7680 attttatttt tgaaaaacgt cttttttatt gttgaattgg tggtcaaaag ggcattgtgt 7740 gacaaaccga tttcggggtg aaatggggcg aaaactgcaa tagtgcattt gcggagctgg 7800 acaacataac aatttttaac attgttccga caactaaaag accataattt tccttttaaa 7860 aaacgtcttt tttattgtta aattgggggt ggggaagggc attgagtgac aaaaccgatt 7920 tcagggtgaa atggggcgaa aactgcaata atgtatttgc ggagctggaa aacataacaa 7980 tttttaacat tgttccgaca actaaaaggc cataatttta tttttaaaaa acgtcttttt 8040 tattgttgaa ttggtggtca aaagggcatt gtgtgacaaa accgatttcg gggtgaaatg 8100 gggcgaaaac tgcaataatg catttgcgga gctggacaac ataacaattt ttaacattgt 8160 tccgacaact aaaaggccat aattttattt ttaaaaaacg tcttttttat tgttaaattg 8220 gtggtcaaaa gggcattgtg tgacaaaacc gatttcgggg tgaaatgggg cgaaaactgc 8280 aataatgcat ttgcggagct ggaaaacata acaattttta acattgttcc gacaactaaa 8340 aggccataat tttcttttta aaaaacgtct tttttattgt tgaattggtg gtcaaaaggg 8400 cattgtgtga caaaaccgat ttcggggtga aatggggcga aaactgcaat aatgcatttg 8460 cggagctgaa caacataaca atttttaaca ttgttccgac aactaaaaga ccataatttt 8520 atttttaaaa aacgtctttt ttattgttaa attggtggtc aaaagggcat tatgtgacaa 8580 aaccgatttc ggggtgaaat ggggcgaaaa ctgcaataat gcatttgcgg agctggaaaa 8640 cataacaatt tttaacattg ttccgacaac taaaaggcca taattttatt tttaaaaaac 8700 gtctttttta ttgttgaatt ggtggtcaaa agggcattat gtgacaaaac cgatttcggg 8760 gtgaaatggg gcgaaaactg caataatgca tttgcggagc tggaaaacat aacaattttt 8820 aacattgttc cgacaactaa aaggccataa ttttattttt aaaaaacgtc ttttttattg 8880 ttgaattggt ggtcaaaagg gcattatgtg acaaaaccga tttcggggtg aaatggggcg 8940 aaaactgcaa taatgcattt gcggagctgg aaaacataac aatttttaac attgttccga 9000 caactaaaag gccataattt tatttttaaa aaacgtcttt tttattgttg aattggtggt 9060 caaaagggca ttatgtgaca aaaccgattt cggggtgaaa tggggcgaaa actgcaataa 9120 tgcatttgcg gagctggaaa acataacaat ttttaacatt gttccgacaa ctaaaaggcc 9180 ataattttat ttttaaaaaa cgtctttttt attgttgaat tggtggtcaa aagggcattg 9240 tgtgacaaaa ccgatttcgg ggtgaaatgg ggcgaaaact gcaataatgc atttgcggag 9300 ctggaaaaca taacaatttt taacattgtt ccgacaacta aaaggccata attttatttt 9360 taaaaaacgt cttttttatt gttgaattgg tggtcaaaag ggcattgtgt gacaaaaccg 9420 atttcggggt gaaatggggc gaaaactaca ataatgcatt tgcggagctg aacaacataa 9480 caatttttaa cattgttccg acaactaaaa ggccataatt ttatttttaa aaaacgtctt 9540 ttttattgtt gaattggtgg tcaaaagggc attatgtgac aaaaccgatt tcggggtgaa 9600 atggggcgaa aactgcaata gtgcatttgc ggagctggaa aacataacaa tttttaacat 9660 tgttccgaca actaaaaggc cataatttta tttttaaaaa acgtcttttt tattgttgaa 9720 ttagtggtca aaagggcatt gtgtgacaaa accgatttcg gggtgaaatg gggcgaaaac 9780 tgcaatagtg catttgcgga gctggaaaac ataacaattt ttaacattgt tccgacaact 9840 aaaaggccat aattttattt ttaaaaaacg tcttttttat tgttgaattg gtggtcaaaa 9900 gggcattgtg tgacaaaacc gatttcgggg tgaaatgggg cgaaaactgc aacagtgcat 9960 ttgcggagct ggacaagata aattttttta acattgttcc gacaactaaa aagccataat 10020 tttattttta aaaaacgtct tttttattgt tgaattagtg gtcaaaaggg cattgtgtga 10080 caaaaccgat ttcggggtga aatggggcga aaactgcaat aatgcatttg cggagctgga 10140 aaacataaca atttttaaca ttgttccgac aactaaaagg ccataatttt atttttaaaa 10200 aacgtctttt ttattgttga attggtggtc aaaagggcat tgtgtgacaa aaccgatttc 10260 ggggtgaaat ggggcgaaaa ctacaataat gcatttgcgg agctgaacaa cataacaatt 10320 tttaacattg ttccgacaac taaaaggcca taattttatt tttaaaaaac gtctttttta 10380 ttgttgactt ggtggtcaaa agggcattat gtgacaaaac cgatttcggg gtgaaatggg 10440 gcgaaaactg caataatgca tttgcggagc tggaaaacat aacaattttt aacattgttc 10500 cgacaactaa aaggccataa ttttattttt aaaaaacgtc ttttttattg ttgaattggt 10560 ggtcaaaagg gcattatgtg acaaaaccga tttcggggtg aaatggggcg aaaactgcaa 10620 taatgcattt gcggagctgg aaaacataac aatttttaac attgttccga caactaaaag 10680 gccataattt tatttttaaa aaacgtcttt tttattgttg aattggtggt caaaagggca 10740 ttgtgtgaca aaaccgattt cggggtgaaa tggggcgaaa actacaataa tgcatttgcg 10800 gagctgaaca acataacaat ttttaacatt gttccgacaa ctaaaaggcc ataattttat 10860 ttttaaaaaa cgtctttttt attgttgaat tggtggtcaa aagggcatta tgtgacaaaa 10920 ccgatttcgg ggtgaaatgg ggcgaaaact gcaataatgc atttgcggag ctggaaaaca 10980 taacaatttt taacattgtt ccgacaacta aaaggccata attttatttt taaaaaacgt 11040 cttttttatt gttgaattgg tggtcaaaag ggcattgtgt gacaaaaccg atttcggggt 11100 gaaatggggc gaaaactaca ataatgcatt tgcggagctg aacaacataa caatttttaa 11160 cattgttccg acaactaaaa ggccataatt ttatttttaa aaaacgtctt ttttattgtt 11220 gaattggtgg tcaaaagggc attatgtgac aaaaccgatt tcggggtgaa atggggcgaa 11280 aactgcaata gtgcatttgc ggagctggaa aacataacaa tttttaacat tgttccgaca 11340 actaaaaggc cataatttta tttttaaaaa acgtcttttt tattgttgaa ttggtggtca 11400 aaagggcatt gtgtgacaaa accgatttcg gggtgaaatg gggcgaaaac tgcaataatg 11460 catttgcgga gctggaaaac ataacaattt ttaacattgt tccgacaact aaaaggccat 11520 aattttattt ttaaaaaacg tcttttttat tgttgaattg gtggtcaaaa gggcattatg 11580 tgacaaaacc gatttcgggg tgaaatgggg cgaaaactgc aataatgcat ttgcggagct 11640 ggaaaacata acaattttta acattgttcc gacaactaaa aggccataat tttattttta 11700 aaaaacgtct tttttattgt tgaattggtg gtcaaaaggg cattatgtga caaaaccgat 11760 ttcggggtga aatggggcga aaactgcaat agtgcatttg cggagctgga aaacataaca 11820 atttttaaca ttgttccgac aactaaaagg ccataatttt atttttaaaa aacgtctttt 11880 ttattgttga attggtggtc aaaagggcat tgtgtgacaa aaccgatttc ggggtgaaat 11940 ggggcgaaaa ctgcaataat gcatttgcgg agctggaaaa cataacaatt tttaacattg 12000 ttccgacaac taaaaggcca taattttatt tttaaaaaac gtctttttta ttgttgaatt 12060 ggtggtcaaa agggcattgt gtgacaaaac cgatttcggg gtgaaatggg gcgaaaactg 12120 caataatgca tttgcggagc tgaacaacat aacaattttt aacattgttc cgacaactaa 12180 aaggccataa ttttattttt aaaaaacgtc ttttttattg ttgaattggt ggtcaaaagg 12240 gcattgtgtg acaaaaccga tttcggggtg aaatggggcg aaaactgcaa taatgcattt 12300 gcggagctga acaacataac aatttttaac attgttccga caactaaaag gccataattt 12360 tatttttaaa aaacgtcttt tttattgttg aattggtggt caaaagggca ttgtgtgaca 12420 aaaccgattt cggggtgaaa tggggcgaaa actgcaataa tgcatttgcg gagctggaaa 12480 acataacaat ttttaacatt gttccgacaa ctaaaaggcc ataattttat ttttaaaaaa 12540 cgtctttttt attgttgaat tagtggtcaa aagggcattg tgtgacaaaa ccgatttcgg 12600 ggtgaaatgg ggcgaaaact gcaataatgc atttgcggag ctggaaaaca taacaatttt 12660 taacattgtt ccgacaacta aaaggccata attttatttt taaaaaacgt cttttttatt 12720 gttgaattgg tggtcaaaag ggcattgtgt gacaaaaccg atttcggggt gaaatggggc 12780 gaaaactgca acagtgcatt tgcggagctg gacaagataa atttttttaa cattgttccg 12840 acaactaaaa agccataatt ttatttttaa aaaacgtctt ttttattgtt gaattagtgg 12900 tcaaaagggc attgtgtgac aaaaccgatt tcggggtgaa atggggcgaa aactgcaata 12960 atgcatttgc ggagctggaa aacataacaa tttttaacat tgttccgaca actaaaaggc 13020 cataatttta tttttaaaaa acgtcttttt tattgttgaa ttggtggtca aaagggcatt 13080 gtgtgacaaa accgatttcg gggtgaaatg gggcgaaaac tgcaatagtg catttgcgga 13140 gctgggcaac ataacaattt ttaacattgt tccgacaact aaaaggccat aattttattt 13200 ttaaaaaacg tcttttttat tgttgaatta gtggtcaaaa gggcattgtg tgacaaaacc 13260 gatttcgggg tgaaatgggg cgaaaactgc aacagtgcat ttgcggagct ggacaagatt 13320 aattttttga acattgttcc gacaactaaa aggccataat tttatttttg aaaaacgtct 13380 tttttattgt tgaattggtg gtcaaaaggg cattgtgtga caaaaccgat ttcggggtga 13440 aatggggcga aaactgcaat agtgcatttg cggagctgga caagataaat ttttttaaca 13500 ttgttccgac aactaaaagg ccataatttt atttttgaaa aacgtctttt tattgttgaa 13560 ttggtggtca aaagggcatt gtgtgacaaa accgatttcg gggttaaatg gggcgaaaac 13620 tgcaatagtg catttgcgga gctgggcaac ataacaattt ttaacattgt tccgacaact 13680 aaaaggccat aattttattt ttgaaaaacg tcttttttat tgttgaattg gtggtcaaaa 13740 gggcattgtg tgacaaaacc gatttcgggg tgaaatgggg cgaaaactgc aatagtgcat 13800 ttgcggagct gggcaacata acaattttta acattgttcc gacaactaaa aggtcataat 13860 tttattttta aaaaacgtct tttttatagt tgaattggtg gtcaaaaggg cattgtgtga 13920 caaaaccgat ttcggggtga aatggggcga aaactgcnnn nnnnnnnnnn nnnnnnnnnn 13980 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 14040 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 14100 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 14160 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 14220 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 14280 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 14340 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 14400 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 14460 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 14520 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 14580 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 14640 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 14700 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 14760 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 14820 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 14880 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 14940 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 15000 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 15060 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 15120 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 15180 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 15240 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 15300 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 15360 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 15420 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 15480 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 15540 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 15600 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 15660 nnnnnaaccg atttcgggtg aaatgggcga aactgcaata atgcatttgc ggagctggaa 15720 aacataacaa ttttaacatt gtccgacaac taaaaggcca taattttatt tttaaaaaac 15780 gtctttttta ttgttgaatt ggtggtcaaa agggcattgt gtgacaaaac cgatttcggg 15840 gtgaaatggg gcgaaaacta caataatgca tttgcggagc tgaacaacat aacaattttt 15900 aacattgttc cgacaactaa aaggccataa ttttattttt aaaaaacgtc ttttttattg 15960 ttgaattggt ggtcaaaagg gcattatgtg acaaaaccga tttcggggtg aaatggggcg 16020 aaaactgcaa taatgcattt gcggagctgg aaaacataac aatttttaac attgttccga 16080 caactaaaag gccataattt tatttttaaa aaacgtcttt tttattgttg aattggtggt 16140 caaaagggca ttatgtgaca aaaccgattt cggggtgaaa tggggcgaaa actgcaataa 16200 tgcatttgcg gagctggaaa acataacaat ttttaacatt gttccgacaa ctaaaaggcc 16260 ataattttat ttttaaaaaa cgtctttttt attgttgaat tggtggtcaa aagggcatta 16320 tgtgacaaaa ccgatttcgg ggtgaaatgg ggcgaaaact gcaataatgc atttgcggag 16380 ctggaaaaca taacaatttt taacattgtt ccgacaacta aaaggccata attttatttt 16440 taaaaaacgt cttttttatt gttgaattgg tggtcaaaag ggcattgtgt gacaaaaccg 16500 atttcggggt gaaatggggc gaaaactgca ataatgcatt tgcggagctg gaaaacataa 16560 caatttttaa cattgttccg acaactaaaa ggccataatt ttatttttaa aaaacgtctt 16620 ttttattgtt gaattggtgg tcaaaagggc attgtgtgac aaaaccgatt tcggggtgaa 16680 atggggcgaa aactgcaata atgcatttgc ggagctgaac aacataacaa tttttaacat 16740 tgttccgaca actaaaaggc cataatttta tttttaaaaa acgtcttttt tattgttgaa 16800 ttggtggtca aaagggcatt gtgtgacaaa accgatttcg gggtgaaatg gggcgaaaac 16860 tgcaataatg catttgcgga gctggaaaac ataacaattt ttaacattgt tccgacaact 16920 aaaaggccat aattttattt ttaaaaaacg tcttttttat tgttgaattg gtggtcaaaa 16980 gggcattgtg tgacaaaacc gatttcgggg tgaaatgggg cgaaaactac aataatgcat 17040 ttgcggagct ggaaaacata acaattttta acattgttcc gacaactaaa aggccataat 17100 tttattttta aaaaacgtct tttttattgt tgaattggtg gtcaaaaggg cattatgtga 17160 caaaaccgat ttcggggtga aatggggcga aaactgcaat agtgcatttg cggagctgga 17220 aaacataaca atttttaaca ttgttccgac aactaaaagg ccataatttt atttttaaaa 17280 aacgtctttt ttattgttga attagtggtc aaaagggcat tgtgtgacaa aaccgatttc 17340 ggggtgaaat ggggcgaaaa ctgcaatagt gcatttgcgg agctggaaaa cataacaatt 17400 tttaacattg ttccgacaac taaaaggcca taattttatt tttaaaaaac gtctttttta 17460 ttgttgaatt ggtggtcaaa agggcattgt gtgacaaaac cgatttcggg gtgaaatggg 17520 gcgaaaactg caacagtgca tttgcggagc tggacaagat aaattttttt aacattgttc 17580 cgacaactaa aaagccataa ttttattttt aaaaaacgtc ttttttattg ttgaattagt 17640 ggtcaaaagg gcattgtgtg acaaaaccga tttcggggtg aaatggggcg aaaactgcaa 17700 taatgcattt gcggagctgg aaaacataac aatttttaac attgttccga caactaaaag 17760 gccataattt tatttttaaa aaacgtcttt tttattgttg aattggtggt caaaagggca 17820 ttgtgtgaca aaaccgattt cggggtgaaa tggggcgaaa actgcaatag tgcatttgcg 17880 gagctgggca acataacaat ttttaacatt gttccgacaa ctaaaaggcc ataattttat 17940 ttttaaaaaa cgtctttttt attgttgaat tagtggtcaa aagggcattg tgtgacaaaa 18000 ccgatttcgg ggtgaaatgg ggcgaaaact gcaacagtgc atttgcggag ctgggcaaca 18060 taacaatttt taacattgtt ccgacaacta aaaggccata attttatttt taaaaaacgt 18120 cttttttatt gttgaattag tggtcaaaag ggcattgtgt gacaaaaccg atttcggggt 18180 gaaatggggc gaaaactaca ataatgcatt tgcggagcca taaatattga aattaggtga 18240 aaaagtggga aaaattaaaa agcaatttgt ggtaaataat tgagtttttc gctattagca 18300 tagtgacttt cttaaaatgt catttttaag agggtgttta gtgttttttc ggatttttat 18360 tttattaaac ggtttcaaag ctgacaacca aacatacatc taagcaaact gaaacttatt 18420 ttgctagtct gaaactggct tcatgttcgg gtctcttcgg aaattcgagg agctggtttt 18480 gggagctgtc ttcacacaaa t 18501 // ID Transib1_Dwil repbase; DNA; INV; 3060 BP. XX AC . XX DT 27-AUG-2008 (Rel. 13.09, Created) DT 27-AUG-2008 (Rel. 13.09, Last updated, Version 1) XX DE A new family of Transib elements in Drosophila willistoni. XX KW Transib; DNA transposon; Transposable Element; transposon; KW autonomous; Drosophila willistoni; Transib1_Dwil. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-3060 RA Styles P.; RT "Transib1_Dwil: a new family of Transib elements in Drosophila RT willistoni."; RL Repbase Reports 8(9), 909-909 (2008). XX DR [1] (Consensus) XX CC Transib1 is a family of autonomous transposons in Drosophila CC willistoni and Drosophila mojavensis. The 3060bp consensus CC sequence was obtained by multiple alignment of 27 sequences. The CC consensus sequence encodes a 672 amino acid transposase, with CC some homology to Transib transposases, in a single ORF between CC positions 499 and 2518. Transib1_Dwil has 40bp terminal inverted CC repeats. Transib1_Dwil appears to have been active recently in D. CC willistoni. Elements are on average 98.02% identical to the CC consensus sequence, ranging from 96.38% to 99.30%. Two point CC mutations at positions 847 and 848, introducing a premature stop CC codon into the transposase ORF, are shared by six elements, which CC have apparently been propagated non-autonomously. Transib1 CC appears to have been involved in a horizontal transfer event CC between D. willistoni and D. mojavensis. The identity between the CC consensus sequences in these two species is 86.99%. XX FH Key Location/Qualifiers FT CDS 498..2516 FT /product="Transib1_Dwil_1p" FT /translation="MSCIYPHIILEIIFLATMSVCKRALFTIWHTEPDASK FT KTFAVETYIYTNIIDKTKFDKSQCAYIAMKIKNFIKKLTWKWKENHRICQN FT FEVNNQEWLDDDLVLVDQISNVGRPKLSFGDSQGKTKKKKLSDIVKEIPSN FT ELVSAASTSLYNCGKRSDESLPKKVISFIQSDDKQPINYTPEEALALYIDG FT GYTKKAYTDLQRGTKKRNANIFPTYDVLRSAKKLCYPSGLEVTDHSFEVPL FT QNLIDHTVSRIYKAHYKDFPSIENNNGSQLTAIYKWGCDGSSGHSTYRQNF FT NDLNACMIDQHMFSVYLVPLEILRGSEKVWKNEKPSSTRYCRPLKVLCRKE FT TADLVNDVVNNFKSQITTIQPTIVESFEVHHQFYMTMIDGKVFSVLAESSS FT SVCGICGTTPKKMNDLSDIKNLPCKENLYEYGLSSLHAWIRFLDCILHISY FT RLIIKNWQVRSTDKSAVDIRKLEIQKNLRKEMGILVDIPQPGGSGNTNNGN FT TARKFFHDAKLAAKITGVNYLLNSRFSVILRTLASGCEIDTQAFKHYAFQT FT AILFVDKYPWFYMPSSVHKILIHGAEIIEHFALPIGMMSEEALEARNKDQR FT KFRLNHTRKNSRSNTMEDLAHTLFISSDPFITMLAKSTDKFSKKDQFLDKD FT IISLILSKDAQEELSDCTTSDTD*" XX SQ Sequence 3060 BP; 1110 A; 473 C; 527 G; 950 T; 0 other; tgcacagtgg gcgaaacggc tgatttagct ccgattaatt caaatttaaa gtaaaacttg 60 gcgccaaaat tgtttttgca ttcgattact aacatattca atagtccaaa aacattaatc 120 gattcgattt tctaccaata tttgaacagc agtgaagctt tgaagtcagc atatgtttca 180 tcagctgttg ccatcagctt tcccgccaat tctgattgcg tataaaaaaa atttgtataa 240 attagtgttg ttgctttcaa ttgaactttt tgctgtaaat taaagaaaaa ggtatgtagt 300 tactaacaaa ataaatgttt gtgacctgtg attgtgtttc gtcccaaaat gtaatatttt 360 ctttggtact tatatgtgtc ttaacattta ctaataaatt gaactttagt agagtataac 420 ttttaaggtc cgactgggtt cggtatcaat gtttttttaa aaattagtct gtgtattacg 480 aagtgaaatt gtccacaatg agctgcatat atccgcatat tattttagaa ataatttttt 540 tagcaacaat gtcagtgtgc aagcgagcac tttttacaat atggcatacg gagcctgatg 600 cttcaaaaaa aacttttgcg gtagagacgt acatatacac caatattatt gataaaacta 660 aattcgacaa aagccaatgc gcgtacattg ccatgaaaat taaaaatttc ataaaaaaat 720 tgacatggaa atggaaagag aaccaccgaa tttgccaaaa ttttgaggta aacaaccaag 780 agtggttaga tgacgatcta gtattggtgg accaaatttc aaatgttgga cgacctaaac 840 tcagttttgg agatagtcag ggaaaaacaa aaaaaaaaaa attgtcggac attgtcaagg 900 aaattccgag caatgagttg gtctctgcgg caagtactag tttgtacaat tgtggaaagc 960 gaagtgatga aagtcttccc aagaaagtaa tatcgttcat tcaatccgat gataaacaac 1020 caattaacta cacaccagag gaagccttag ccttatatat tgatggtggt tacacaaaga 1080 aggcttacac ggacttgcag cgtggtacca aaaagcgaaa tgcaaatatt tttccgacgt 1140 atgatgtatt gcggtctgca aaaaaactgt gctatcctag tggtctagaa gtaactgatc 1200 attcatttga ggtaccttta caaaatttaa ttgaccacac agtatcccga atatataaag 1260 cacattataa agattttcca agtatagaga ataacaatgg ttcccagcta actgctatat 1320 ataaatgggg ctgtgatgga agtagtggcc actcaactta tcgccaaaat tttaatgatt 1380 taaacgcttg tatgatcgat cagcacatgt tttcggtgta cttggtacca ttggaaatac 1440 ttcgtggatc agaaaaagtt tggaagaatg agaaaccttc atccactcgt tattgcaggc 1500 cattaaaagt gctttgtcga aaggaaacag cagatttggt taatgatgtt gtaaataatt 1560 tcaaaagcca aataacaaca atacagccaa ctattgtaga aagttttgaa gtacatcatc 1620 aattttatat gaccatgata gatggaaaag tatttagtgt tcttgcggaa tcgtcatctt 1680 cagtttgtgg aatatgtggt acaactccaa aaaaaatgaa tgatctcagt gatattaaaa 1740 atttaccatg caaagaaaat ctctacgaat atggtttatc cagtctacat gcttggattc 1800 ggttccttga ttgcattctg catattagct atagactgat aattaaaaat tggcaagttc 1860 gaagtacaga taaatcagct gttgatatta gaaaactgga aatacaaaag aatttgcgaa 1920 aggagatggg tatattggtt gatattcccc agccaggtgg ctcaggcaat acaaataacg 1980 gcaacacagc gagaaagttt tttcatgatg caaaattagc tgccaaaata actggagtca 2040 actatctttt aaattctcga tttagtgtaa tcttacgaac actagcatcc ggatgtgaaa 2100 ttgatacaca agcctttaaa cattatgcat ttcaaactgc aatattattt gtagacaagt 2160 acccatggtt ctatatgcct tcatctgtcc ataaaatttt gattcacggt gcagaaatta 2220 tagaacactt tgcccttccg ataggtatga tgtcagaaga agctcttgag gccagaaata 2280 aagaccagag aaagtttcga ttaaaccaca cacgaaaaaa ttctcgcagt aacacaatgg 2340 aagacttggc tcacacactt tttatttcat cggacccatt tattacaatg ttagcaaaat 2400 ccacagataa attttctaaa aaagatcagt ttttagataa agacattatt agtcttattt 2460 taagtaaaga cgcacaagaa gagttatcag attgcacaac atcagatacc gattaggagt 2520 aaaattgaaa aattaaaaaa aaagtatata aaatattttt gaagttgatt atatttactc 2580 tatattgaat gattaaataa atgtatgttt taagaaagaa acatgttttt tttttaaaga 2640 tacaagcata tattttctcg attaaacata actcttgctg aaaatgagta cacatgcaag 2700 agaatattaa atttccgatg agctttttta ataatatatg tagctgtcca atttatttga 2760 attttttttt atatgtgtat atatccttta taagtaacag caaaaaaaaa aaattttgaa 2820 aatgtgtaac gaaatttctt aaaattgagc gaaaacgacc agacatagaa taaaacaggt 2880 ttacactaac ttggaacaaa taataaaaac aaatattttc actgaatgaa acaatttttg 2940 ttcattagtt ttgatgtgta ataaatttat ctacaaaaaa aaaaccgtac aaaaatagca 3000 taatttttgc agaagttatt aattaatcgc tgctaaatca gccgtttcgc ccactgtgtg 3060 // ID JAM1 repbase; DNA; INV; 3464 BP. XX AC . XX DT 04-OCT-2010 (Rel. 15.1, Created) DT 04-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE JAM1 RTE non-LTR retrotransposon family: consensus. XX KW RTE; Non-LTR Retrotransposon; Transposable Element; JAM1. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-3464 RA Warren M.A., Hughes A.M. and Crampton M.J.; RT "Zebedee: a novel copia-Ty1 family of transposable elements in RT the genome of the medically important mosquito Aedes aegypti."; RL Mol. Gen. Genet 254(5), 505-513 (1997). XX RN [2] RP 1-3464 RA Jurka J.; RT "Non-LTR retrotransposons from the yellow fever mosquito."; RL Direct Submission to Repbase Update (04-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. >99% identity. Probably active. XX FH Key Location/Qualifiers FT CDS 717..3425 FT /product="JAM1_1p" FT /translation="MGDMQRRVIGWWPINERMCKLRIKGRFFNFSIINVHS FT PHSGSTDDDKDAFYAQLEREYDRCPSHDVKIIIGDLNAQVGQEEEFRPTIG FT KFSAHRLTNENGLRLIDFAASKNMAIRSTYFQHSLPYRYTWRSPQQTESQI FT DHVLIDGRHFSDITDVRTYRGANIDSDHYLVMVKLRPKLSVINNVRYRRPP FT RYNLERLKQPDVANAYAQHLEAALPDEGELDRAPLEDCWRTVKAAINDAAE FT SVVGYVERSSRNDWFDEECQEVLEEKNAARAAMLQHGTRQNVERYRLKRKQ FT QTRLFRDKKRRLEEVECQEMELLYRSQETRKFYQKLNTSRKGFVPRAEMCR FT DKDGSILTDGREVIERWKQHYDEHLNGAENTGTEGQDSEGDGYVSTADSGN FT QPAPTMGEVKDAIQQLKNNKAAGKDGIGAELIKMGPDRLAACLHRLIVRIW FT ETEQLPEEWKQGVICPIYKKGDKLECENYRAITILNAAYKVLSQILFRRLS FT PIANEFVGSYQAGFIDGRSTTDQIFSVRQILQKCREYQVPTHHLFIDFKAA FT YDSIDRIELWKIMDENSFPGKLTRLIRATMDGVQNCVKISGEHSSSFESRR FT GLRQGDGLSCLLFNIALEGVMRRAGLNSRGTIFTRSGQFVCFADDMDIIGR FT KFETVADLFTRLKREATRVGLMVNASKTKYMLVGGTERDRTRLGSSVTIDG FT DTFEVVDEFVYLGSLLTADNNVSREIRRRIISGSRAYYGLQKKLRSRKIHP FT RTKCTMYKTLIRPVVLYGHEAWTMLEEDLQALGVFERRVLRTIFGGVQENG FT VWRRRMNHELAQLYGEPSIVKVAKAGRIRWAGHVARMPDNNPVKMVFATNP FT VGTRRRGAQRARWIDQVHQDLESVGHSRGWREAAMNRGNWRNIVGEALSR" XX SQ Sequence 3464 BP; 941 A; 846 C; 1010 G; 667 T; 0 other; ttggcgcggt aaatggctgg gcatggcgta ccattggtac ctcgcgtacc tgcaggaata 60 aaatagaccc ctttgtgcgg tccttagcct cttgcccagc aactcctatc cctacctcct 120 cgtggtactg gccggggtac gagtaacctt ggggaagatc gggtaaccaa cccccggtgg 180 gaactttggt cgtatgctga cagggaaggg ggggtttgct tttgcaaacc tggagcgtct 240 gtactccacg ttaggagcgg ctcacaacag cgtctgttcc ccatgtcagg ggcggctgat 300 catcgtccga gtgccagaga aggactctaa gctaaactgc gcactatggc cctccgaaca 360 tttaggggga atggtcctcc ggaaatctag ggggttggtg tcaggccctg cgagccagcc 420 gtaaaaacac atcagcacag gaacgtcaac gagagaatac ggaccggaac aatcggcaaa 480 gaccacagcg acgaaaatgg actagcgatt ggaaactcgg tacgtggaac tgcaaatctc 540 tcaacttcat tggaagtact cgcatactct ccgatgtact gaagacccgc ggtttcgaca 600 tcgtagcgct gcaggaggtg tgctggacag gagcattggt gcgaacgttt agaggtaatc 660 ataccatcta ccagagctgc ggcaacacac gcgagctggg aacagctttt atagtgatgg 720 gtgatatgca aaggcgcgtg atcgggtggt ggccgatcaa tgaacgaatg tgcaagttaa 780 gaatcaaagg ccgattcttt aacttcagca taatcaacgt gcatagccca cactccggaa 840 gcactgatga tgacaaggac gcattttacg cgcagctcga acgcgagtac gaccgctgcc 900 caagccacga cgtcaagatc atcataggag atttgaacgc tcaggttggc caggaggagg 960 agttcagacc gacgattgga aagttcagcg cccaccggct gacgaacgag aacggcctac 1020 gactgataga ttttgccgcc tccaagaaca tggccattcg tagcacctat ttccagcaca 1080 gcctcccgta tcggtacacc tggagatcac ctcagcagac agaatcgcaa atcgaccacg 1140 ttttgatcga tggacggcac ttctccgaca taaccgacgt cagaacctat cgtggcgcca 1200 acattgactc cgaccactac ctggtgatgg tgaaactgcg cccaaaacta tccgtcatca 1260 acaatgtacg gtaccgacgc ccgccccggt acaatctcga gcggctgaaa caaccggatg 1320 tcgccaatgc gtacgcgcag catcttgagg cagcgttgcc ggatgagggc gagctcgata 1380 gggcccctct tgaggactgc tggaggacag tcaaagcagc cattaacgac gctgccgaaa 1440 gcgttgtcgg atatgtggaa cggagctcaa gaaacgattg gttcgacgag gagtgccaag 1500 aggttttaga ggagaagaat gcagcgcggg ctgcaatgct gcagcatggt acgcggcaaa 1560 acgtggaacg atacagactg aagcggaaac agcaaacccg cctattccgg gacaaaaagc 1620 gccgcctgga agaggtggaa tgccaagaga tggagttgct gtaccgttct caagaaacgc 1680 ggaagttcta tcagaagctc aacacatccc gcaaaggctt cgtgccgcga gctgagatgt 1740 gccgggataa ggatgggagc atcttgacgg acggacgcga ggtgatcgaa aggtggaagc 1800 agcactacga tgaacacctg aatggcgcag agaacacagg cacagaaggt caggacagcg 1860 aaggcgatgg ctacgtcagc acagcggaca gcggaaatca accagctccc acgatggggg 1920 aagttaagga tgccattcaa cagctcaaga acaacaaagc cgctggcaag gatggtatcg 1980 gagccgaact catcaagatg ggcccggaca ggttggccgc ttgtctgcat cggctgatag 2040 tcagaatctg ggaaacggaa cagctaccgg aggagtggaa gcaaggcgtt atatgcccta 2100 tctacaaaaa gggcgacaaa ctggagtgtg aaaattatcg tgcaatcacc atcctaaacg 2160 ccgcctataa agtgctatcc cagattctct tccgtcgtct atcacctata gcaaacgagt 2220 tcgtgggaag ttatcaagca ggtttcatcg acggccgctc gacaacggac cagatctttt 2280 ccgtgcggca aatcctccag aaatgccgtg agtaccaggt ccctacgcac catttgttca 2340 tcgatttcaa ggcggcatac gatagtatcg accgcataga gctatggaaa atcatggacg 2400 agaacagctt tcccgggaag ctcacaagat tgatcagagc aacgatggac ggtgtgcaaa 2460 actgcgtgaa gatctcgggc gaacactcca gttcgttcga gtctcggcgg ggactacgac 2520 agggcgacgg actttcgtgc ctgttgttca atattgcgct tgaaggtgtc atgcggagag 2580 ccggacttaa cagtcgaggc acgattttca cgagatccgg acaatttgtt tgcttcgcgg 2640 acgacatgga tattattggg agaaaatttg aaacggtggc agatttgttc acccgcctga 2700 aacgcgaagc aacaagagtc gggctaatgg tgaatgcgtc gaaaacaaag tacatgctgg 2760 ttggcggaac tgagcgcgac aggacccgcc taggaagcag tgttacgata gacggggata 2820 ccttcgaggt ggtggacgag ttcgtctacc tcggatcctt gttgacggct gacaacaatg 2880 ttagtcggga aatacgaagg cgcatcatca gcggaagtcg tgcctactat gggctccaga 2940 agaaactgcg gtcaagaaag attcaccccc gcaccaaatg cacgatgtac aaaacgctca 3000 taagaccggt agtcctctat gggcatgagg cgtggactat gctcgaggag gacttgcaag 3060 ctcttggggt tttcgaacgc cgagtgctaa ggacgatctt cggcggcgtg caggagaacg 3120 gcgtgtggcg gcgaaggatg aaccacgagc tcgctcaact ctacggcgaa cccagtatcg 3180 tgaaggtagc taaagctgga aggatacgct gggcagggca tgttgcaaga atgccggaca 3240 acaaccctgt aaagatggtg ttcgccacga atccggtcgg aacaagaagg cgtggggcgc 3300 agcgagctag gtggattgac caggtacacc aggacctgga gagcgtgggt cacagtcgag 3360 gatggagaga agcggccatg aaccgaggga attggcgaaa tattgttggc gaggctttat 3420 caagataatt gatgtaaagc caaataagta agtaagtaag taag 3464 // ID Gypsy-72_CQ-I repbase; DNA; INV; 6771 BP. XX AC AAWU01040934; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-72_CQ_; KW Gypsy-72_CQ-LTR; Gypsy-72_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-6771 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 523-523 (2011). XX DR Genome; AAWU01040934; Positions 25057 18287. XX CC Positions [3512-3973] - Reverse transcriptase CC Positions [4997-5473] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 366..2741 FT /product="Gypsy-72_CQ-I_2p" FT /translation="MESYGSFPEERNREEAIDTWYSDLRADMLTEDELNFE FT LIVRDILVDRDRSRMRGCLRSKLKSEQEAGQIPPDRTLKNPVRQEIEIARQ FT KLEEIETQIKNGSAEQIAYMRSRLLHLGARLRLALLSAAEMEKDDLTTIFS FT IVMDHLYYFYYSDEARGNETESGTESENEEPTLEQIIQQAGAAETLQVDKA FT VLLRELEKFRKANEEESRLRNELQALQNDISVGRQQSDMGDQGVQTDDLNF FT LFQQWQISSAGLTIPTCTPTTTSSFSPSPGFLGHPKIKQVPKSVSFNVPST FT YGLGTSYSGGFSSYPSYTLLNNPIPSPYPQPPFGYQNSSLPRYQQPHPPFG FT NPQQFYRPPSNFPPYPQNQNFNYSYYQQTQPLPLPLPLPPPPPPQQQPQPP FT NPQPPRKTRPQQQPQQQPQQQPQEHQQNQVPQTSYLKSLPVSKWGIEKYAG FT DDQGLNLNEFLFIVKSHAMSEKVADDELFNSALHLFSGPALNWYQSMRSTG FT RLYNWDHLVWELRKAFVHPDLDSMVRVKVYLRKQQRNESFQSYNHDMEKLF FT RSMTVQIPEIEKKKLVQQNMRSDYRKQLTFLTIPDLPTLNAVGRMIDAGNF FT SLYNKMFGSEKSVNMVSEPPEKKTEVKKTPQLAPQQTHIQPANTNNNNSRQ FT NQQPGNQNNYQPRGNQDNYQARGSGSNPNQGNQYRANPAANATNTFTQNLA FT QPGGSGLNNSNRTASTPSVLARLVDEWRPPNPNVCYNCGRTQHYHEECREP FT KRVFCLICGFKGFETLNCPYCRKNGIQAVASRAPSSNQQRN" FT CDS 2801..5851 FT /product="Gypsy-72_CQ-I_1p" FT /translation="MYAAPDEVNQISLLNSDDNRPYAHVKVYGVHTKALLD FT SGSQSTMINDTLFAKLKSKNLKPLNRQIILKTAGGDSLEISGQFMIPFSFD FT GRIKIIPTLVVPRLAVDCLCGMDFWNKFQISPIIQYTAFAEELISPASPKD FT TILNHSEQEMIENIKRTFKTAMPGQLSTTPLISHNIELKEEWIGKPAVRQF FT PYVMSPKVQALVAEELDRMLELGIIERTHSDWALNVVPVIKPTGKVRLCLD FT ARKINERTVRDSYPLPHPGRILGQLPKARYLSTIDLTEAFLQIPLDENSRK FT YTSFCVQGKGLFRFTRLPFGLINSPATLSRLMDRVLGFGELEPNVFVYLDD FT VVVVSETFEEHVRLLAEVSRRLREANLAINVDKSRFGVSELPFLGYLLSTE FT GLRPNPDKIRPIVDYERPTTITKLRRFLGMANYYRRFLPDFSGIVAPLTEL FT LKTKSKIIQWTDEAEQAFIAVKECLISTPVLCSPNFEKEFVIQTDASDVAV FT AGVLTQIQEGQERVISYFSHKLTTPQRNYHPAEKEALAALLSIQAFRGYIE FT CCHFTLITDSSALTHILTTSWKVGSRCSRWSLNLQQYDMTIIHRKGKDNVV FT ADALSRSVAAVSQGAEDEWYGSMKQSVLENPDDFVDFRVINGQLFKFVSTN FT DSPYDHRYEWKIVVPLGDRESIITTNHDDAMHLGYEKTLSRVRQRFYWPKM FT AKDIRQYVQNCGICKEVKGVTVPVVPKMGEMRVADHPWQIIAVDFIGPLPR FT SKKGNQHLLVVVDLFSKWVSLHAVRKIDSSNLCIVLKDLWFHKNSVPQVII FT SDNASCFTSREFKNLLDRFGIVHWLNSRYHSQANPVERVNRTVNAAIRTYV FT RDDQRLWDTRLAELETILNTSTHSSTKLTPYFVTHGHEMFIKGEDHQLVAN FT QEPLPEETMEKNRKKLFGEIYDQVKGNLTKANEVCRKQYNLRHRRFPNSFT FT PGQLVYRKNMRLSSAVDNYNAKYGPQYLPCKVKAKIGTSSYDLEGLDGKAL FT GVWPAAHLKPG" XX SQ Sequence 6771 BP; 2093 A; 1373 C; 1414 G; 1891 T; 0 other; attggcgcgc agcgtaaaat caaaaatcaa agatttttat cctttcgagt gggggaagaa 60 cgatagttgg cacttttatc gcgatttttt tggatttatt atccacgctt tgggatgtgt 120 gttcgtaaat ggaggttatg tcaaccatat gcttttttct tataggttat attcagaatt 180 acaagtcgat ttgactccct ttttttgttt taagtttccc ttaaggtatt tttgggtgaa 240 aaaagttctt atccataatc cttaattaat ttcactttaa gatttgttta tttttttgct 300 tgattcagtc aaattttgga acttttagtt ttcatcgctt agtagtttaa ttagattatt 360 ttaaaatgga gtcttacgga tcatttccag aagaaagaaa tcgagaggaa gctatcgaca 420 cgtggtattc agacctgcgt gctgatatgt taactgaaga tgaattgaat ttcgaattaa 480 ttgtacgaga cattttggtt gatcgtgatc gttctcgaat gcgaggatgc ttacggtcta 540 aattgaaaag tgaacaagaa gctggtcaaa taccaccaga cagaactttg aagaatccag 600 taagacaaga aatcgaaatt gcccgccaaa aacttgagga aattgaaact cagattaaaa 660 atggtagtgc tgagcaaatc gcctacatga gaagcagact gttgcacctg ggtgctagac 720 ttaggttggc acttttaagt gcagctgaaa tggaaaaaga tgatctgacg acgatttttt 780 caatcgtcat ggatcatttg tattactttt actatagtga tgaggcccga gggaatgaaa 840 cagaatcagg aacagaaagt gaaaacgaag aaccaacctt ggaacaaatt attcaacaag 900 ctggtgctgc ggagactctt caagtagaca aagcagtact acttcgagag ctggaaaaat 960 ttcgaaaagc aaacgaagag gaaagtcgtc tcagaaatga attgcaggcc ttacagaatg 1020 atatttcagt gggacgccag cagtctgata tgggagatca aggggttcag acagatgatt 1080 taaacttttt gtttcaacag tggcaaattt caagtgcagg tttgactatt ccgacctgta 1140 caccaacaac aacgtcgagc ttttctccta gtccaggatt tttaggacac ccaaaaatta 1200 agcaagtccc caaaagcgtt agttttaatg tgccatccac ttacggactc gggacatcgt 1260 actcgggtgg tttttctagt tatccgagct acacactgtt gaacaaccct atcccatctc 1320 cataccccca acctccattt ggttaccaaa attcatccct acccagatac cagcaacccc 1380 atccaccctt tggaaatcct caacagtttt acagaccacc atcaaatttt ccaccatacc 1440 ctcagaacca aaattttaat tattcgtact atcaacaaac ccaaccactt cccctgcctc 1500 tacccctacc tccaccacca cccccacaac aacaacctca accacccaat ccccaacccc 1560 ctcgcaaaac acgtcctcaa caacaacccc aacaacaacc tcaacaacaa ccccaagaac 1620 atcaacaaaa tcaagtacca caaacaagct atctcaaatc tttacctgtg tcaaagtggg 1680 gcattgaaaa gtatgctggt gacgaccaag gattgaactt gaatgaattt ttgtttattg 1740 tgaaaagtca tgctatgtcg gaaaaagtag cagatgatga actatttaat tcggcgctcc 1800 accttttttc gggaccggcc ttaaactggt atcaatcaat gcgttccacc ggaagattgt 1860 acaattggga tcaccttgtg tgggaactta gaaaagcttt tgttcatccc gacctcgaca 1920 gtatggtcag ggttaaagtt tatcttcgca aacaacaacg aaatgaatct tttcaatctt 1980 ataaccatga catggagaaa ttgtttcgat ccatgacggt tcaaattccc gaaatagaaa 2040 aaaagaaact ggttcagcag aacatgcgat cagactatcg caaacaattg acgttcctta 2100 cgattcctga tttaccaacc cttaacgccg ttggcaggat gatcgatgct ggaaattttt 2160 cattatacaa caaaatgttt gggtctgaaa aatcagttaa tatggtttca gaaccccccg 2220 aaaagaagac tgaggtcaag aagacacctc agctggcgcc ccaacaaacc catattcaac 2280 cggctaatac taataataac aactctcgac agaatcaaca acccgggaat cagaataact 2340 atcaacctcg tggaaatcag gataattacc aagcacgtgg gagtggtagt aatccaaacc 2400 agggaaatca gtacagggct aaccctgcag ccaacgcgac aaatactttt actcaaaatc 2460 ttgctcaacc aggtggctca ggcctgaaca attcgaacag gaccgcttcg actccgtcgg 2520 tcttagcacg tttggtggac gagtggagac ctccgaatcc gaacgtttgc tacaactgtg 2580 gtagaactca acattatcac gaagagtgtc gcgagcccaa aagggttttt tgtttaattt 2640 gtgggttcaa gggatttgaa acattaaatt gtccgtattg tcgaaaaaac gggattcagg 2700 cggttgcaag tcgcgctccg tcgagcaatc agcagcgcaa ttaaatgtct cgcttttgga 2760 gtcggagttt tctcaaattt gggaaccagc ctccgaaaaa atgtatgcag ctcctgatga 2820 agtgaatcaa atttccttgt taaattcaga tgataatcga ccctacgctc acgtgaaggt 2880 ttacggtgtt cacaccaaag cgttgttaga cagtggaagt cagtcaacca tgattaatga 2940 tactttattt gcaaaattaa aatcgaaaaa tttaaagcca ctcaaccgcc agatcattct 3000 gaaaacagcg ggtggcgact cacttgagat ttcgggtcaa ttcatgattc ctttctcgtt 3060 tgatggaaga attaaaataa taccaacttt agtcgttcca cggctagcag tagattgttt 3120 gtgtggaatg gatttttgga ataaatttca aatttctcca attattcaat acaccgcgtt 3180 cgcagaagag ttaatctccc ctgcgtcacc gaaagataca attttaaatc actccgaaca 3240 ggaaatgatc gaaaatatta aaagaacttt caaaactgcc atgccaggac aactttcaac 3300 aacaccactt atttcacaca atattgaact gaaagaggaa tggataggga aaccagccgt 3360 tcgtcagttc ccatacgtta tgtcacctaa ggttcaggca ctcgtagcgg aggaattaga 3420 tcggatgctc gaactgggaa taattgagcg aactcattca gattgggctt tgaatgtagt 3480 gccggttatc aaacccacgg gcaaggttag gctctgctta gatgctcgaa agataaatga 3540 aagaactgtg cgtgattcct accctttacc ccaccctggt cgaatcttag gacagttacc 3600 aaaggcgagg tatctctcaa cgatcgattt gacggaagct ttccttcaaa tcccgttgga 3660 tgagaactcc cggaagtaca cgtccttctg tgtccagggt aaagggttgt tccggttcac 3720 cagactccca ttcggtctaa tcaatagtcc agctacgctg tcgagactga tggaccgagt 3780 gttgggtttt ggtgagctgg aaccaaacgt tttcgtttac ttggacgatg tggtagtagt 3840 tagcgaaacg tttgaggaac acgtacgact tctagcagag gtctcacggc gactaagaga 3900 ggctaacttg gctatcaatg tggataaatc gcgttttggc gtttctgaat taccattttt 3960 aggatattta ttatccactg aaggattgcg tcccaacccg gacaaaattc gaccgatcgt 4020 agattacgaa cgaccaacaa cgataactaa attaagacga tttttaggaa tggcaaatta 4080 ttatcgccgt ttcctacccg actttagtgg gatagtagct ccattaactg aactactcaa 4140 aaccaagtca aaaattattc aatggacgga tgaagctgaa caggcattca ttgctgtcaa 4200 agagtgtctg attagtaccc cagtgctatg tagcccgaat tttgagaagg agtttgtcat 4260 tcaaacagac gcttcagatg tcgcggtagc gggggtgtta acgcagatac aagagggaca 4320 ggagagagta atttcatatt tttcacataa acttacaaca cctcaacgga attaccaccc 4380 ggctgagaaa gaggccctag ctgccctgct ttcaattcag gcttttagag gttacattga 4440 atgttgtcac tttacattaa ttactgattc gtcagcgttg actcacattt taaccacctc 4500 gtggaaagtg ggttcacgtt gcagccgctg gagtttaaac ctccagcaat acgatatgac 4560 aattatacac cgaaaaggga aggataatgt agtagccgac gccctgtcac gcagcgtcgc 4620 ggcggtctct cagggtgctg aagatgaatg gtacgggtca atgaagcaga gtgttttgga 4680 aaatcccgat gattttgtag attttcgggt gatcaacggg caacttttca agtttgtttc 4740 caccaacgat tctccgtacg accaccgtta tgaatggaag atcgttgtgc ctttaggaga 4800 tcgcgaatct attataacta ccaaccacga cgatgccatg catctcggtt atgagaagac 4860 tctttcgcgc gtacggcagc gcttttattg gccgaaaatg gcaaaagata ttcgtcaata 4920 tgtacaaaat tgtggaattt gtaaagaagt taaaggtgtt accgttccag tggtaccaaa 4980 aatgggagaa atgagggtgg ccgatcatcc gtggcagatt atcgccgtcg acttcatcgg 5040 gcccctcccc cgcagcaaga aaggcaatca acaccttctt gtagtggttg accttttcag 5100 taaatgggtt tcactacatg ctgttaggaa aattgatagt tcaaatttat gtatagtttt 5160 aaaagatttg tggttccata agaactctgt gccacaagtt atcatttctg ataacgctag 5220 ttgttttaca tcgcgtgaat tcaagaattt gttggacaga tttgggatag tccattggtt 5280 aaactctcgt tatcattcgc aagcgaaccc agtagagcgt gtgaaccgaa ccgtaaacgc 5340 agcaattagg acttatgtca gggatgatca acgcctttgg gacacccgtt tggccgaact 5400 tgagacaatt ttgaacacta gcacccacag ttccacaaaa cttactccat atttcgttac 5460 ccacggacat gaaatgttta taaaaggaga agatcatcaa ttagtcgcta atcaagaacc 5520 actgccggaa gagacaatgg aaaagaatcg aaagaaactg tttggggaaa tctatgacca 5580 ggtgaaagga aatttgacga aagcgaatga agtttgtcgg aaacaataca atcttagaca 5640 ccgtcgtttt ccaaattcgt tcacaccagg acaactagtt taccgtaaaa acatgcgtct 5700 ttcgagtgcc gttgacaatt ataatgctaa atacgggcct caatatttgc cttgtaaagt 5760 taaggccaag attggaacat cttcgtatga tttagaaggt ttagatggaa aagcactcgg 5820 cgtttggccc gcagcacatc ttaagccagg gtaaaaatat ctagtttatc atcaataact 5880 taagttaact tttttggaat taatttggtt ttggagaccc agatccttcg atcgaaatcc 5940 caactagaat aatttaatta taacactttg ctgcttcttg aaaaaagcta aaaaaaaatt 6000 agaaataaat tgatgttgct taaatgcgag cacttcgatg gttaagagca aactatttaa 6060 aataattgca ttacatctgt taaatgcccg atgttggtac tctacatgtc tgcctcgatg 6120 attgtccgag tcagatatgt taatatgtgt gagaagcctc acggtgtatt tttttttgtt 6180 cttttgataa gaatgttaca gtatacacag tagtggtatt ctccaatatg aaacaatagt 6240 tttcgcctca gcctccggac gcggagctga tagcgaacga ttacgatcgt gttgagggtc 6300 gactgataat aaagcgaccg aattggtatt agttaagaaa ttgatggtag gagtccagtg 6360 tgaacgataa agggatttgg aataattgcg atgagtgtga aagagacaaa atttgaatag 6420 cctaggattc gaatttggat gaaagtgata gaatgtttga ttttggaaat gatacgtttg 6480 accgtgttaa ttatttgata tagcacttgt gagagcgaga attatttttt tgttgcccca 6540 gtgcattaga agatttttta tttgtagtat aaattggatt ggataaatga atatgaactt 6600 ttccctcaga aatgcagagg tttgagacag tagagtaaat gtggtcttga ctaagtcgtt 6660 ctgggagtaa tcataagatt tagtatataa ccgatataag tttagtttac gagcatatta 6720 accccaaaaa aaattaaatt ctttaatttt tccccctgac ccagggagta t 6771 // ID HERO-2_BF repbase; DNA; INV; 3666 BP. XX AC . XX DT 19-MAY-2009 (Rel. 14.05, Created) DT 19-MAY-2009 (Rel. 14.05, Last updated, Version 1) XX DE Amphioxus HERO-2_BF autonomous non-LTR Retrotransposon - DE consensus. XX KW Hero; Non-LTR Retrotransposon; Transposable Element; HERO-2_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-3666 RA Kapitonov V.V. and Jurka J.; RT "Young families of HERO non-LTR retrotransposons from the RT amphioxus and sea urchin genomes."; RL Repbase Reports 9(5), 1138-1138 (2009). XX DR [1] (Consensus) XX CC The consensus sequence was built from several sequences less than CC 5% divergent from each other. The HERO-2_BF consensus sequence is CC 66% identical to the pufferfish HEROTn consensus sequence. CC Threfore, a horizontal transfer was involved in evolution of CC these families. XX FH Key Location/Qualifiers FT CDS 279..3581 FT /product="HERO-2_BFp" FT /note="reverse transcriptase and REL endonuclease." FT /translation="MNAVCVCGKVCKNQRGLRIHQTKMACLRRVQAEHRSG FT AVATTVEPVLSASAPGQTEEDQGPEAPHSARNLRATPAPPQGRKSDHHRVK FT WPAANSKEWSQFDEDVDMILESVSRGSTDQKLQSMCTVIMSMGAERFGTIG FT QRKPTDTMKPNRREVKIRQLRQELKSLRRSFKASTSGEERAALAELTHHLR FT EKLRTLRRAEWHKKKGKERARKRSAFITNPFGFTKRLLGQKRSGNLTCPVE FT EINLHLSNTFSDASRDVDLGPCPLLVTSPEPEVHFDISEPTLKEVRETVKA FT ARSSSAPGPSGVVYKVYKHCPRLVVRLWRILKVVWRRGKVAADWRQAEGVW FT IPKEEESSKVDQFRLISLLSVEGKIFFKIVAQRLIKYLLDNQYIDTSVQKG FT GVPGVPGCLEHTGVVTQLIREAKENRGDLAVLWLDLANAYGSIPHKLVETA FT LTRHHVPESIQNLILDYYSNFWLRAGSSTATSAWQRLEKGIITGCTISVPL FT FALAMNMIVKGAEAGCRGPVSRSGTRQPPIRAFMDDLTVMTATVPVCRWLL FT QGLERLITWARMSFKPAKSRSLVLKKGKVAERFRFTLGGTQIPTVSEKPVK FT SLGKVFNSSLKDTASVQQTRSDLTTWLEGIDKTGLPGSFKAWMFQHGVLPR FT VLWPLLVYEVPMTMVEQLERTISRFLRKWLGLPRSLSNIALYGRSTKLQLP FT LSGLTEEFKVTRAREVLMYRDSSDSKVSSAGIHVRTGRKWKAQEAVDQAEA FT RLRHSVLVGSVAVGRAGLGSCPKPRYDKVSGKEKRLLIQDEIRAGEEEDRR FT CRMVGMRKQGAWTRWEHADSRKVTWPELCRAEPSRIKFLISSVYDVLPSPA FT NLHVWGLAETPSCQLCQRRGTLEHILSCCPKALGEGRYRWRHDQVLRVLAD FT TVSNAIQSSRSQQPPKKSIVFVRAGEKTRQQPTSAGGLLSTARDWQLLVDL FT GRQLKFPEHIVATSLRPDMVLVSESTRQVVLLELTVPWEERISEANERKRA FT KYAELVVQSQSNGWRARCVPVEVGCRGFAGQSLAYVLKLLGVRGFRLRKSI FT RDILEAAEKASRWLWFRRGEPWKPHGHRSGNDQPRLGRPGEGVW" XX SQ Sequence 3666 BP; 928 A; 888 C; 1114 G; 736 T; 0 other; ttttcagtct ggctcagcca gtgaccgccg ggaaagtccg gctgactacc acgaataggg 60 tggtgacagc tggatagaca gacgacagct cggaaagacg gcattggggc agtatgggtt 120 ggcaccccta actgcatctc ccctaggaga gcatcccgca acacgctaca aagaaccaca 180 aagagcaata cccccaggga tgcccgagag ggggggagga tgagcatccc attcggacgg 240 tccaatcggt attgacccca gcaaacggag aatcgacaat gaatgcagtc tgtgtgtgtg 300 gcaaggtatg taagaaccag agaggtttga gaatccacca aacaaagatg gcctgcttaa 360 ggagggtgca ggcggagcac cgctcagggg ctgtggcaac cactgtagaa ccagtgttgt 420 cagcatcagc ccctggtcag acggaggagg atcagggccc ggaagctccc cacagtgccc 480 ggaacctccg cgcaacgcct gcccctccac aaggcaggaa gtcagatcat caccgagtga 540 agtggccagc cgcaaactcc aaggagtggt cgcagtttga cgaggacgtt gacatgatct 600 tggagtcggt gtcaagaggt agtacagacc aaaagcttca gtccatgtgc acagtgatta 660 tgtccatggg ggcagaacga tttggcacga ttgggcagag gaaaccgaca gacacaatga 720 agccaaatcg ccgggaagta aagatccgtc aactgaggca ggagctaaag tcgttgaggc 780 ggagctttaa ggcgagtacg tcgggagagg agagagctgc tcttgcagag ctcacacacc 840 accttaggga gaagcttagg accctcagaa gggcagagtg gcacaagaag aagggtaaag 900 aaagagcccg gaagcgcagt gctttcatca ccaacccttt cggcttcacc aagcgactcc 960 tagggcagaa gaggagtggg aacctgacct gcccagtcga ggagatcaac ctccacctca 1020 gcaatacctt cagtgatgcc tcgagagatg tggatcttgg tccttgccct ttgctggtga 1080 cttcacctga gccggaagtg cactttgaca tctctgaacc aactctgaag gaggtcagag 1140 agacagtcaa ggcggcgagg tccagttcgg cgccaggtcc cagtggcgtg gtatacaagg 1200 tctacaaaca ttgcccacgg cttgtggtgc gcctctggag gatcctaaag gtggtctggc 1260 gcagaggtaa agtggcggct gattggaggc aagccgaggg ggtttggatc ccaaaggaag 1320 aggagtcaag taaggtagac cagttccgct taatttctct gctcagtgtt gagggaaaga 1380 tcttcttcaa gattgtggcc cagcgtctaa taaagtacct tctggacaac cagtatattg 1440 acacatctgt gcagaagggg ggagttcctg gtgtcccagg atgtcttgaa cacacgggcg 1500 tagtgaccca gctcatccgg gaggctaagg agaacagagg ggacttggca gtcttgtggc 1560 tggatctcgc gaatgcgtat ggttcgatcc cccacaagct tgtggaaaca gcactgacca 1620 gacaccatgt tccagagtca attcagaacc tcatcttaga ttactacagc aacttctggc 1680 taagagctgg ctccagtaca gcaacttcag catggcaacg gttagagaag ggcatcatta 1740 ctggatgtac gatttcagtg cccctctttg cactagcgat gaacatgatt gttaaaggag 1800 cggaagcagg atgtaggggt cccgtgtcta ggtctggaac caggcagccg ccgattcgag 1860 ccttcatgga cgatctgacg gtgatgactg caacagtccc ggtgtgtaga tggctcctac 1920 agggattaga gcgtctcatt acatgggcac ggatgagttt caagccggcc aagtcaagat 1980 ctcttgtcct gaagaagggg aaggtggctg aaaggttccg tttcaccctg ggaggcactc 2040 agattcccac agtgtcagag aaaccagtca agagtctggg caaggtgttc aacagctctc 2100 tgaaggacac cgcttcagtt cagcagacta ggagtgacct gacaacgtgg ctcgagggaa 2160 ttgacaagac agggctacct ggtagcttca aggcctggat gttccagcat ggagtcttgc 2220 caagggtact ctggcctctt cttgtgtacg aggtgccgat gaccatggtg gagcaactgg 2280 agagaaccat cagcaggttc cttcgcaaat ggttggggct cccgaggtcc ttaagcaaca 2340 ttgccctgta cggtagatcc accaagctgc agcttccctt gagtggcctg actgaagagt 2400 tcaaggttac ccgtgcaaga gaagtgttga tgtaccggga ctcctcagac tccaaggtct 2460 cttcagccgg catccatgtc aggactggaa gaaaatggaa ggcacaggaa gcagtggatc 2520 aggcagaggc aaggttgaga cacagtgtcc tcgtggggtc cgtggcagta ggacgggcag 2580 gactgggcag ctgcccaaag cctcggtacg acaaagtcag cgggaaggag aagcgtctac 2640 tgatccagga tgagataagg gctggggaag aggaggatcg gcgatgcagg atggtaggca 2700 tgcgcaagca aggtgcgtgg actaggtggg aacatgctga ctcccgcaag gtcacatggc 2760 cagagttgtg cagagctgag ccttctcgga tcaaatttct catctcttca gtgtacgacg 2820 tgcttccaag tccagctaac ttgcatgtct ggggcttggc agagaccccc tcatgccaac 2880 tctgtcagag gagaggtacc cttgaacaca ttctcagttg ttgtccgaaa gcactagggg 2940 aagggaggta ccgctggcgg catgaccagg ttcttagggt gttggcagac acagttagca 3000 acgccatcca gagtagcagg agtcagcaac cccccaagaa gtcaattgtc tttgtcaggg 3060 ccggagagaa aacccgacaa caacccactt ccgcaggtgg gcttctctcc actgctagag 3120 attggcagct tctagtcgac cttgggagac agctcaagtt tccagaacac attgtagcca 3180 cgtcacttcg ccctgacatg gtactcgtgt cagaatccac cagacaagtg gttctgctgg 3240 agctaactgt tccctgggag gagcggataa gcgaagccaa cgagcggaag agggcgaagt 3300 atgccgaact ggtagtacaa agccagagta atgggtggag agcccggtgt gtaccagtgg 3360 aggttggttg ccggggtttc gcagggcagt ctttggctta tgtgttaaaa ctccttggag 3420 taagaggttt ccgtcttcgg aaatccatca gggatattct agaggctgcg gagaaagcct 3480 cacgttggtt gtggttccgt aggggggaac cgtggaagcc acacggacac aggtcgggga 3540 atgatcaacc tcggctgggt cgcccgggcg agggtgtatg gtgattaaag acccgaaaca 3600 cccaatgacc ccgggttcat cactgatgat gtgtccctgt tcgcactacc agagtgtatt 3660 ctagag 3666 // ID Gypsy-23_DWil-LTR repbase; DNA; INV; 200 BP. XX AC scaffold_181117; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-23_DWil_; KW Gypsy-23_DWil-I; Gypsy-23_DWil-LTR. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-200 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_181117; Positions 35872 36071. XX SQ Sequence 200 BP; 63 A; 40 C; 37 G; 60 T; 0 other; tgtagtatac ttttagggat aagattaggg cacatacttt tagggataga ttgtaagtaa 60 ttagaactta agcataatgt ctagctttaa gtaccgttaa ggacactagc cataagacca 120 atgtaaacgt tctcctataa ataccgggct tgcggcccca ataaatcact tcaagtcaaa 180 ccttgccttc gcctgtttca 200 // ID Sola1N-1_AP repbase; DNA; INV; 331 BP. XX AC . XX DT 30-AUG-2009 (Rel. 14.09, Created) DT 30-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW Sola; DNA transposon; Transposable Element; Nonautonomous; KW Sola1N-1_AP. XX NM Sola1N-1_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-331 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2069-2069 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 331 BP; 114 A; 47 C; 52 G; 118 T; 0 other; ctgtaccaca gaagtcaact aacgcaagtc cattttttga aattttgagt tagttatata 60 atcagaatat gctttttgtg atctttttaa aggtattaaa acgtaagtcc tatatacaat 120 aaatgcatag atagcaagaa gtatactaac ttaagtccta aactatgata ggagtataga 180 aacgtatgtc cagcgtttac ataagaaaag taatacaaaa taatattttt gaagtataca 240 attgtaagtc cattgcaaca aatatttttg ttttgtctct cctgtaaaat ttgctttttg 300 tgacttgcgt tactcgactt ctgtggtaca g 331 // ID BEL-168_AA-I repbase; DNA; INV; 6448 BP. XX AC AAGE02024719; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-168_AA_; KW BEL-168_AA-LTR; BEL-168_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6448 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02024719; Positions 33753 27306. XX CC Positions [5498-6055] - Integrase core CC 'AAATC' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 38..6448 FT /product="BEL-168_AA-I_1p" FT /translation="MSKSTKSVKSTKSARSTRSAKNHCQACKAIDTDRMVS FT CCHCGSWWHFECVGVNDSIAEPDRVFNCPKCQKPSVQVPVRTTVGKVALSV FT NSSTSSRAAKARLELEKLEAQKALAMKRLELQRREQERKHKEAELLFQQKK FT AQEEASLQQLELQLEETMMNESFRLREIIAAEEEEDDDNESVESVQSSSSK FT VRQWRAFNSTMVPSTEQAPKITPTDIETTTATGIIPAGQDGSKQPTSNQGF FT LEEAMASAVSGISLGQSGMEPTLGGIIGRIESTIAPTGSMSNTGLPAALPG FT FSLSRNYTPLKQASNDSSKTPIVPNPDSLSYLRQSFVPNPSVGTPVDNLSA FT NAQLSGLNAVNQISANQSGSALAVTDRLRGSLPRQSNPERLEQGHQREQSV FT APSQLGPQRNLDPGPSIPAQSRQGQRNAPAEPPCVQWEPPRDPDAPIFERA FT VFPVVPAYCGPTPQQLAARHVMNKELPIFTGNPEDWPLFVSAYYNSTQACG FT FTDAENLARLQRCLKGHAFESVQSRLLLPSSVPHVLATLETLYGRPELLIH FT ALLQKLRRVPAPKQDRLDTLIGFGIAVQNLSDHLEAGRHESHLNNPMLLFE FT LVEKLPANMKLDWSLYKQRCGEVNIRSFAQYMQTLVRAATDVTLQYDPRQQ FT QSQQQRTTKEKSSKDKNFCGAHAAEGAAEPEAVGNATHSSKAACLIYKNPD FT HRVKDCIEFSKKTVDERWKLIQQFSLCRLCLGAHGRRPCKIRKQCDMSGCQ FT LHHHPLLHSEKKHRGKRSLDDSKKSKRDNNSIAPGVKIERSNSVHEPSEAK FT PSGSKVATNYHFSGKTTLFRIIPVTLHGNNRSVTVFAFLDDGSEKTLVDEE FT VVKQLGVVGEKQPLCLQWTANVKRTEANSQRVALEIAGRSCSAKHSLSDVR FT TVSKLELPRQSLRYADLAEAFPYLKGLPIDDYDDAQPRILIGNDNAHVTST FT LKLRDGQPGEPIAAKSRLGWTVYGSSLEKTVDRVHSFHICECQEDNQTLHG FT LVEQFFSVESLGISETPPVESAEIQRSKRILIETTKRVGQRFETGLLWKYD FT YFEFPDSYHMAVRRLQCLERRMQNETVIGESVRKQMSEYQEKGYIHKATKK FT ELDDSDPRRTWYLPLGVVINPKKPSKIRIFCDAAAKVDGISLNTMLMKGPD FT LLNTLLDVLFAFREKRVALCADLKEMFHQIKIRPEDRHAQRLLWREDPAEA FT PDVYLMDVATFGATCSPCSAQFVKNRNAEEHAKEYPAAAEAIIRKHYVDDY FT LDSADNEEEAVKLAQDVKHVHSLGGFNLRNWLSNSKDVLARVGEGDPVPNL FT ITEKCLQLDKHSSTERVLGMFWKPEEDVFTFSTAVASEADHPTKRQALRVV FT MSPFDPAGLLSFFLIHGKILIQDLWRAKTDWDQPIPEKLLGKWKRWINLFR FT YMDQISIPRCYFPQRSIQDIGSLQLHIYVDASEEAYACVGYLRALFPDGIH FT VALVGGKSKVAPLKAHSIPRLELMAAVIGVRLAKTILNGHSLKVDKVFYWS FT DSKTVLAWINSDHRNYRQFVACRVGEILSKSRAEQWRWISTKKNVADEATK FT WGKGPCLSNNSRWFRGEEDLYLPEEQWITKVLPANQTTDEELRSCMVHREA FT LAPQLVKWDRFSKWTRLQRSVAYIHRYVQNLRRTANKKQRMEGLLTQEELS FT NAEATIFRWTQSEMYSDEVTALVRARNERTNQQVRLEKTSLIRKLSPFMDE FT SGVIRSDSRIAAATYVSYDTRFPIILPKEHRVTHLLVEWFHQNYLHANGET FT VVNEKNQRFHISQLRSFVRTVARKCTMCKMKKPVPAIPRMAPLPAARIQAY FT VRPFSFVGVDYFGPISIRVNRTIAKRWVGLFTCMTTRAVHLEVVHTLSAES FT CKMAIRRFIGRRGAPVEIRSDRGTNFVGSSNELKQEMGKINHQLAETFTNA FT NTKWVFNPPGAPHMGGAWERLVRSVKTALAAMDTSRTPNEETLATILIEAE FT SVVNSRPLTYIPLETAQQEALTPNHFLLLSSNGVAQTPRNLTDPKQACRND FT WNLCRTMVDQFWRRWVREYLPTIARRTKWFEEVKPIEVGDLVVIVEEKIRN FT GWVRGRVAKISVGRDGRVREAVVQTANGMVHRPVSKLAKLDVAVSKAEPEI FT PDQPYGSGN" XX SQ Sequence 6448 BP; 1854 A; 1516 C; 1720 G; 1358 T; 0 other; aatctaaaag atcgttcaag atcgtaacga catcaacatg tcgaaaagca ccaagagtgt 60 gaaaagtacg aagagtgcga gaagcactcg gagcgccaag aaccactgcc aggcttgtaa 120 ggcaatcgat accgatcgga tggtcagttg ctgtcactgc ggttcctggt ggcattttga 180 atgcgtgggc gtcaacgata gcatcgcaga gccagaccgg gtgttcaatt gtccaaagtg 240 ccagaaacca tccgtacagg tcccggtacg gacaacagtg ggaaaggtgg cactaagcgt 300 aaactcatcg acttctagca gagcagcgaa ggcacgatta gagctggaga aactcgaggc 360 ccaaaaagct ttggccatga agcgtctaga acttcagcga cgggaacagg agcgcaagca 420 taaggaagcg gagttactat tccagcagaa gaaggcgcag gaagaggcat cactccaaca 480 gctggagcta caattggaag agacgatgat gaacgaatca ttccgcctac gggagattat 540 cgccgcagaa gaggaggagg acgacgacaa cgagagtgtt gaatccgtgc aaagctcatc 600 cagcaaggtt cgacaatggc gagcttttaa ttcgacaatg gtaccctcga cggaacaggc 660 tccgaaaatt acacctaccg acatcgaaac tacgacagcg accggaataa tcccggctgg 720 gcaggacggc agcaagcaac caactagcaa tcagggattt ctggaagagg ctatggcctc 780 ggcggtatca ggtatttcgt tagggcaatc aggtatggag ccaacgttag gaggaataat 840 tggtcggata gaaagcacca ttgcaccgac aggtagtatg agcaacacgg gtcttccagc 900 ggctctccca gggtttagct taagtagaaa ctatactcca ctaaagcagg cgtctaatga 960 tagttctaag actcctattg ttccaaatcc tgactcactc tcgtatctcc gtcagtcctt 1020 cgtgcccaat ccctcggtcg gtaccccggt tgataatctg tcagcaaacg cgcaattgtc 1080 cggtttgaac gctgtgaacc aaatcagtgc gaatcaaagt ggtagtgcat tagcagtaac 1140 agatcgacta cgaggatcat taccacggca gtcgaatcct gaacgccttg aacaagggca 1200 ccaaagggaa cagtccgtag caccgagtca actggggccg caacgaaatc tagatccagg 1260 accatcgata ccagctcaat caaggcaagg acaacgaaat gccccggcgg aaccaccgtg 1320 tgtacaatgg gagccgccga gagatccaga tgccccaata ttcgagcgag ctgtatttcc 1380 agtagtaccg gcgtactgcg gaccaacacc tcaacaattg gctgccaggc acgtgatgaa 1440 caaggagctg ccaattttta ctggtaatcc ggaggattgg ccgctattcg taagcgcgta 1500 ttacaactct acccaagcgt gcggcttcac agacgcggaa aatctggcaa ggttgcagag 1560 gtgcctgaag ggacatgctt ttgaatcggt gcagagccga ctgctattgc cgtcaagtgt 1620 tccacatgtt ctcgctactt tggagacact ctatggcaga ccggagttgc ttattcatgc 1680 gctactgcag aagcttcgac gagttcctgc accgaagcag gatagattgg acacgttgat 1740 cggttttggt attgcggtgc agaatctcag cgaccattta gaagctggaa gacacgaatc 1800 gcatctcaac aacccgatgt tattgttcga gctggtagag aagcttccag caaacatgaa 1860 gctggattgg tcgctgtata agcaacgatg tggagaagta aatattcgct cgttcgccca 1920 gtacatgcag actttggtac gtgctgcaac cgacgtcact ttacagtacg atccaagaca 1980 gcagcagtct caacagcagc ggacaactaa agagaagagc agcaaagata agaacttctg 2040 tggtgcccac gctgcagagg gtgcagcgga accggaagca gttggaaatg cgacgcactc 2100 atcaaaggca gcttgtctaa tctacaagaa tccggatcat cgagttaaag attgtataga 2160 gttcagcaag aaaaccgtag atgaacgctg gaaactaata cagcagttca gcttgtgtcg 2220 actttgtctg ggagctcacg gaaggagacc ctgcaaaatt cgcaagcaat gtgacatgtc 2280 tgggtgccaa ttgcatcatc atccgttgct gcattctgaa aagaaacata gaggcaaacg 2340 aagcttggat gatagcaaga aaagcaaacg agataacaac tccatagcac caggtgtcaa 2400 gatagaacga agcaactccg tgcatgaacc atctgaagcg aagccgagtg gatcaaaagt 2460 cgctaccaac taccatttca gcgggaaaac gacgctgttt cggataatcc cagtgacact 2520 ccacgggaac aaccgttccg tgactgtatt cgccttttta gacgacggat cggagaaaac 2580 attggtggat gaagaagtgg tgaagcaact cggcgttgta ggagaaaagc agccgttgtg 2640 cctccagtgg accgcaaacg taaagagaac ggaagcaaat tctcaacgag tcgctttaga 2700 aattgcggga cgctcatgtt cagccaaaca ttccctgtcc gatgtgcgca ctgtaagcaa 2760 gctggaatta ccacggcagt cgctacgata cgctgatttg gcggaagcat ttccatacct 2820 aaagggtctg ccgatcgacg attacgatga cgcgcaacct cgcatcctca ttggcaacga 2880 taatgcgcat gtgacatcaa cattgaagtt acgggatgga cagccgggcg aaccgattgc 2940 cgccaaatca cggttagggt ggacggttta tggatccagt ctagaaaaaa cagtggacag 3000 agttcacagt ttccacattt gtgaatgtca ggaggacaat cagacgttac acggactggt 3060 ggagcaattt ttctctgtgg aaagcctggg aatatccgag acgccgcctg tagagtccgc 3120 tgaaattcaa cgatccaagc ggattcttat agaaacaacc aagcgggttg gtcagcgatt 3180 tgagacgggg cttctctgga agtatgacta tttcgaattc ccggacagct accacatggc 3240 agttcggcga ctgcaatgtt tggagaggcg catgcagaac gaaacagtca tcggagagag 3300 tgttcggaag caaatgtccg agtatcagga gaagggatac atccataaag cgaccaagaa 3360 agagctcgat gactcggatc caaggcgtac ttggtattta ccgttaggag tggtaatcaa 3420 cccgaagaag ccatccaaaa tccgaatttt ctgcgatgca gccgcaaaag tggatggaat 3480 ctcccttaac acgatgttga tgaaagggcc cgatcttcta aacacgctac tagatgtact 3540 cttcgccttc cgagaaaagc gtgttgctct gtgcgcagat ctgaaggaaa tgttccacca 3600 gattaaaata cgaccagaag atcgacatgc gcagcgcctg ctctggcgtg aagatcccgc 3660 tgaagcaccc gatgtctatc taatggacgt cgccacattc ggagctacat gctcaccgtg 3720 ttcagcacaa tttgtaaaga acagaaatgc agaagaacac gccaaggagt acccggccgc 3780 agcagaggca atcattcgaa agcattatgt ggatgactat ttagacagtg ctgataatga 3840 agaagaggct gtgaaactag cacaagatgt caaacatgtt cactcactcg gaggattcaa 3900 tctacgaaac tggctttcca actccaaaga tgttctcgca cgggtcgggg agggagatcc 3960 agtcccaaac ctcatcacgg agaaatgctt acagctggac aagcatagct ccactgaacg 4020 cgtacttggg atgttttgga agcctgagga agacgtgttc acgttttcca ccgctgtggc 4080 cagcgaagca gatcacccaa caaaacgaca agcgctacga gtggtcatga gtccattcga 4140 tccggccgga ctgttgagct tcttcctcat acacggcaag attctgatac aggatttgtg 4200 gcgagcgaag accgattggg atcagccgat accagaaaaa ctactcggaa agtggaagcg 4260 gtggataaat ttgttccgct acatggatca gatcagcatt ccgcgttgct acttccccca 4320 acgttcgata caggacatcg gatcattgca gctgcacata tacgtcgacg caagcgagga 4380 agcctacgcg tgcgttggat atttacgtgc tctgtttccc gacgggattc acgtagcgct 4440 ggttggcgga aagtcaaagg tggcaccgct aaaggcacat tctattccac ggctggagtt 4500 gatggccgcg gttatcggcg tgcgcttggc caagaccata ctaaacgggc actcgctaaa 4560 ggtcgataaa gtattttatt ggagcgactc aaagacggta ctcgcttgga taaactcgga 4620 ccaccgcaac taccggcagt ttgtcgcatg cagagtggga gagatcctgt caaagtcgag 4680 agcggaacaa tggcgttgga tatctacaaa aaagaacgtt gcagatgagg cgaccaagtg 4740 gggcaagggt ccctgtctgt ctaacaatag tcgatggttt cgtggagagg aagacttgta 4800 tctgccggaa gaacagtgga taactaaagt gcttcctgcc aatcaaacga cggatgaaga 4860 gctgagatcg tgtatggtac accgcgaagc attagcgcca caacttgtca aatgggatag 4920 attctcaaag tggacgcgat tgcagcgatc agtagcgtac attcatcgtt atgtgcaaaa 4980 tttgcgacgt actgcgaaca aaaagcaacg tatggaagga ttgcttacgc aagaagagct 5040 aagtaatgct gaagccacaa tattccgctg gacgcaaagc gaaatgtact cggatgaagt 5100 gacagctctt gttcgtgcaa ggaacgagcg tacaaatcaa caggtgcggc tggaaaagac 5160 cagcttaatc cgtaagctgt cgccatttat ggatgaatcg ggagtaattc gttcagatag 5220 tagaatagct gccgccacgt atgtttcgta tgacacgagg tttcccatca ttctccccaa 5280 ggaacaccga gttacacatt tactggttga atggttccac cagaattatc tgcacgccaa 5340 cggagaaaca gtcgtgaacg agaagaatca gcgcttccac atctcccagc tcagatcgtt 5400 cgtccgcacg gtagctagaa agtgtactat gtgcaaaatg aagaagcctg ttccagcgat 5460 tccccgaatg gccccactac ctgcggcccg gatacaagcc tacgtgcgtc cattttcttt 5520 cgtcggtgtc gattactttg gaccaatcag catcagagta aatcgcacca tcgccaaacg 5580 gtgggtcgga ttgttcactt gtatgaccac tcgagcagtc catctggagg ttgtccatac 5640 gctatcagca gagtcctgta agatggctat caggcgcttt atcggacgcc gaggagcacc 5700 agttgaaatc cgcagtgata ggggcacgaa tttcgttgga tcgagtaatg aattgaagca 5760 ggaaatgggc aaaatcaatc accagctggc cgaaactttc acgaatgcaa atacaaagtg 5820 ggtatttaac ccaccaggtg caccgcacat gggtggggcg tgggagcgtt tggtgcggtc 5880 agtgaaaacc gcacttgccg cgatggatac ttcacgaaca ccgaatgagg agacattggc 5940 aacaatactg atagaggctg agagtgtagt gaattccagg ccgcttacat atattcctct 6000 ggagacagcg caacaagaag ctctcacgcc aaaccacttc ttacttctta gttcgaatgg 6060 agtggcgcag acaccacgaa atttgacaga cccgaaacaa gcttgcagga acgactggaa 6120 cctgtgcaga acgatggttg atcaattctg gcgtcgatgg gtacgagaat accttccgac 6180 catcgcacgt cggacgaagt ggtttgagga ggttaagcca atcgaagtcg gggacttggt 6240 cgttatagtt gaggagaaga tacgcaatgg atgggtgcga ggacgtgtag ccaagatctc 6300 cgtaggacgc gatggacgag tacgagaagc tgtagtgcag actgcgaacg gcatggttca 6360 tcgaccggtt tcgaagctag cgaagttgga tgtagcagtg agtaaagccg agcctgagat 6420 accggaccag ccttacgggt cggggaac 6448 // ID Chapaev-1_BF repbase; DNA; INV; 6963 BP. XX AC scaffold_658; XX DT 30-SEP-2007 (Rel. 12.09, Created) DT 01-OCT-2007 (Rel. 12.09, Last updated, Version 1) XX DE Autonomous DNA transposon. XX KW Chapaev; DNA transposon; Transposable Element; Chapaev-1_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-6963 RA Kapitonov V.V. and Jurka J.; RT "Chapaev - a novel superfamily of DNA transposons."; RL Repbase Reports 7(9), 781-781 (2007). XX DR JGI v1.0; scaffold_658; Positions 159716 166678. XX CC Chapaev-1_BF belongs to the Chapaev superfamily. Hallmarks of the CC Chapaev transposons are 4-bp target-site duplications, terminal CC inverted repeats with the conserved '5-CAC and GTG-3' termini, CC and the Chapaev transposase. The Chapaev transposase is CC characterized by the conserved D-x(60-80)-D-x(220-290)-E CC catalytic triad. Chapaev transposons populate genomes of CC different animals, including sea urchin Strongylocentrotus CC purpuratus, amphioxus Branchiostoma floridae, starlet sea anemone CC Nematostella vectensis, sea hare mollusc Aplysia californica, CC mosquitoes Aedes aegypti and Culex pipiens, and nematode CC Caenorhabditis elegans. The N-terminal portion of Chapaev CC transposase in Chapaev-1_ACa, Chapaev-2_ACa, Chapaev-3_ACa, CC Chapaev-1_BF, Chapaev-2_BF, Chapaev-1_NV, Chapaev-2_NV, CC Chapaev-3_NV, and Chapaev-1_SP is similar to the N-terminal CC portion of RAG1 (100-370 aa in the human RAG1). It includes a CC novel type of zinc finger, called Chapa: CC H-X7-C-R-X-C-G-X35-D-X4-H-X4-C-X2-C-W-Xn-C-X2-C-X8-G. In the CC amphioxus and anemone Chapaevs, the N-terminal portion contains CC also the RING finger motif. Some Chapaev transposases (e.g. CC Chapaev-2_ACa) show low similarity to the RAG1 core. XX FH Key Location/Qualifiers FT CDS join(1041..2234,2475..2758,2871..3098,3299..3538, FT 3647..3738,3858..4069,4184..4555,4693..4833, FT 4965..5169,5294..5568) FT /product="Chapaev-1_BFp" FT /note="transposase." FT /translation="MSACSTDSQLQEHIKNLAKLCRFCGEFVQTNQDRAKQ FT IKLLSCVEFADKIKEVWLVDVQKDRSYLHPPSFCHRCRCKLYRKSDTLPTR FT VSWYPHWYGKTCIPCEKVATLTKGGRPVKKKSPGRPKTQKNEKGKNEKGEK FT VEECAKTMSEETFSELLSTLDDFPLDEDRFKPICSLEAYKCGICQCFVYLP FT VQALCAHVFCFECLHKLFQIAGNNNADCAICKAQIKASDLTSVHRTWRSCY FT AQLTFNCRNCCAQLRLDELKYHNECPTNTTQMAAPSHTTAPPLLPGRKYLL FT MVPHTNTQVIPPPVHVPVPVPPSTPRTVLTPPSTTMTTTPAPATCPPPSSR FT DAQPSTSVTSDKVTVGLSEKGPLTPDQQDAFYSMNCRMRSLSPDPRIVEAK FT TPGQPLRLMHVPKPRKSSAEAAPSTVRKKARVMEEVRDRVSGGDPGVLLRE FT ELRRVPRDELRSMLHQLKFDQVVIPSGHQLEAKVKIGLNYNQLGKLRSWLK FT MYNISMESERLTRRAARERLTAYDMHAETLPFCVKGAKKDGSDPTKVQLLP FT CAYISPVGAAIFDYLEKCQESECLTWHGGKIPQDEIWVKVGGDHGGGSFKL FT TFQLLNRDHPNSRQNTVVFSIFEAKDSRENLMLAIGRYAQELGDLQGSKWR FT AKDGKEHTMRVFGTGDYAFLCLWYGLSGASAVHPCLWCDITKMEMNNPESE FT DRRLSIPPRTLATLQQHYRDFMEQANGKLSLAKKHHNVIAPVMFALPVDQV FT IIPGLHISLGLYLRAFKLMETDMHELDLKLQSYLCGVLHEGEVSKEALLAD FT VHLGKFRCYVQAIEQARGLDDEADRVEEELHDQENELAWLACCNGTTEKLD FT EAVFAEACSMVEELVRQKDSLRSKAEEMRLKASIKKGDGPLTSQLDDLLQT FT LRVKRQAFHSNSFNGNHDDAIDELTGMVERTVTKIMDKFPDLPLSLVPRAT FT STAGTYKELFSRFARCHKLYSHGGPMEETAVCELDKAIKDFMSYYRSNVPN FT GTVPIKMHMLEDHVVPCIRRWGFGLGFMAEQGVEHVHALFNSLARPTCTIP FT DPVARLKSTLTSHLIGVCPTNTIG" XX SQ Sequence 6963 BP; 2021 A; 1546 C; 1485 G; 1911 T; 0 other; cacgtcaccg cctaaagttc tcatcctgct cacttagtgt gcgcctgcgc ggcctcgcga 60 ggcactccca aatatggtca tatcgtgtta cgccggcacg ggcagcccac gtgccggcat 120 agcaaccgaa tatttacttg ccgtaactaa ggtgtttttg agcaaataaa ccacatatca 180 acctcaaaga agttgtctta cttgagttta taccctgccc tgtaaggtct cattgatgaa 240 aggagacctg tggggatata taatttctaa tcacaaacat actgggacat gacacgcact 300 tccgggtaca cggcttcacc gggcaaactt ttatcatacc cttgactttg ctcttcttca 360 tctgagcact gctcctctcc cggagttatc caccggtaat ctttgtatct gttgtcaatc 420 tcatacttac cactactaat atccgtgctt cttttcttca tttgctttct ttcctttgtt 480 acgagcggat aaaatgacaa agagtgcacg acctgtaggt catatttaga acaatatcag 540 ttactgtaga gtcttgtatt tcgttctata ctgttctaaa ttaaattgta tccctacgaa 600 aattgttcta cctgaataaa atactcagga gactgatcat atcaaattgt tgttgtattt 660 aagtagataa atatcgctga atgtgccccc ctccccacca tcagagttct ttatgtaatg 720 aatgttacaa agagctgaga gaaaaaacaa cagggagaca tttctttgat gcttattaaa 780 cttttcttgt gacaacaata atacaataga tacaaaatac agactgtctc ctagaattga 840 taatgtgaat actttataga tgagaaacat aggagacatc tctttgctgc actactaaaa 900 ttatgttgtc aaaacaatac tagtagtaca ttggataaac aattcatatt gtctattcat 960 atagtgaata tatacctgta ttgcaatttt cagatgtaga tgtccatgaa ccttgctagt 1020 ttttttctca actgtacaaa atgtctgcct gcagcactga ttcacaacta caagaacaca 1080 tcaaaaacct tgcaaagctt tgccggtttt gtggcgagtt tgtgcagaca aaccaagaca 1140 gggcaaaaca aatcaaactg ctttcatgtg ttgagtttgc cgataagatt aaagaagtgt 1200 ggttggtaga tgtccaaaag gacaggtctt acctacatcc cccatccttt tgccataggt 1260 gtaggtgcaa actgtacagg aaatctgaca cgctaccaac acgagtgtca tggtatcctc 1320 actggtatgg aaaaacctgc attccttgtg agaaggtagc aacactgaca aaagggggga 1380 gaccagtaaa aaagaagtct ccaggtaggc caaaaactca gaaaaatgag aaaggcaaaa 1440 atgagaaagg tgaaaaggtg gaagagtgtg ccaaaactat gagtgaagag actttctctg 1500 aactcttaag cacattagat gacttccctc ttgatgagga tcgattcaaa ccaatctgta 1560 gtttggaggc atacaaatgt ggtatttgcc agtgttttgt atacctgcct gtgcaagcac 1620 tatgtgctca tgtgttttgc tttgaatgct tgcataagtt gttccagata gctggaaata 1680 acaatgctga ctgtgctata tgcaaagcac aaataaaagc aagtgacctg acatctgttc 1740 acaggacctg gaggtcctgc tatgctcaat taacttttaa ctgtagaaac tgctgtgcac 1800 agctgaggct ggatgaactt aagtatcata atgaatgtcc tacaaacaca acacaaatgg 1860 ctgctccttc tcacaccaca gcaccgcctc tgctacctgg aaggaaatat ctccttatgg 1920 tgcctcacac taacactcag gtgatcccac cacctgtaca tgtacctgta cctgtacctc 1980 cttccactcc tcgcactgtg ttaacccctc cctccactac catgactact acacctgccc 2040 ctgctacctg tcctcctcct tcctcccgtg atgcccaacc ttccacatct gttaccagtg 2100 ataaggtaac agttggtcta agtgagaaag gtcccctgac acctgaccag caggatgcct 2160 tttattcgat gaattgtcgg atgagatctc tctcccctga cccgaggatt gttgaggcaa 2220 aaacaccagg acaggtatgt atcgaaagtt ttctctaatt gacattttac atttatcgtc 2280 gtatccattt ttgattatac tgctgctggg atccctgtca caggggtagg cagcagtcgt 2340 aagagcagct atagattctg tctccacccg ccctacctct ttgtcttggt ttgcaaaaac 2400 tctctttccc tcccaccttc caccttcccc cctcccacag ataatgaaat gtgtgtctct 2460 tttttcaatt ttagccccta cgtctcatgc atgtccccaa gcccagaaag tcctcagcag 2520 aggcagcccc ttctaccgtt cgaaagaagg ctagggttat ggaggaggtg agagacaggg 2580 tgagtggtgg ggatcctgga gtgctgctcc gggaggaact gcggagggtg ccacgggacg 2640 agctgaggag catgctgcac cagctgaagt tcgatcaagt tgtcatccct tctggtcacc 2700 agctggaggc aaaagtaaag ataggactca actacaatca acttgggaaa ctcagaaggt 2760 aaaattgaaa accattttct gtagacaaca taagttacat ttacaacatt cttgtacaaa 2820 gaaccttact ggtatacata gtctaaacag agttgtctgt tctgttatag ctggcttaag 2880 atgtacaaca tctcaatgga gagtgagcga ctgacgagaa gggccgccag ggagcgcctt 2940 actgcatatg acatgcacgc agaaacccta cctttctgtg ttaagggagc gaagaaagat 3000 gggtcggacc ccacgaaggt gcaactgctt ccgtgtgcgt acatctcccc agtcggcgct 3060 gccatctttg actacctgga gaagtgtcag gagtcagagt aagaaagcat taattttctc 3120 agcttgcatg tttgtctcat ctcatctaac ccttagtcta tttgacagtt ggtgcaacac 3180 agaagatctg gcaaccagct atggtgtcac cctgtgtaca taaaaacaat gttattgctt 3240 atatatgtct aggttgaaaa gtaaaagtaa tttacaggaa attatctatc ttctacaggt 3300 gccttacatg gcatgggggg aaaatccccc aagatgagat ctgggtaaag gtcgggggag 3360 accacggggg aggctccttc aaactgacct tccagctgct caacagggac catccaaact 3420 ctcggcagaa caccgttgtg ttcagcattt ttgaggcaaa ggacagccgg gaaaatctga 3480 tgcttgccat cggccgttat gcccaggaac tgggcgatct gcaagggtca aagtggaggt 3540 gagcttccag ttagtactta ctttgacaag tagctaaacc tgtgttaact ttgtaccatt 3600 ttcttatgtc acctccaaat gaacctgatt caaattcaca ttgcagggcc aaggatggca 3660 aggagcacac gatgcgtgtg tttggcacag gagactatgc cttcctgtgc ctttggtatg 3720 ggctgtccgg agcaagtggt aagggccctt tccactttct ttcttgaaaa cagagccttc 3780 aaaaatgtta atgacttatt tctcataaag atatatagtt atactatgta ataattgttt 3840 taacactttt accacagctg tccacccatg cctgtggtgt gacatcacca aaatggaaat 3900 gaacaacccg gagtcagaag atcggaggct aagcatccca ccaagaacct tggcaaccct 3960 gcagcagcac taccgtgact tcatggagca ggctaatggc aagttgtcat tagccaaaaa 4020 acatcacaat gtcatcgccc ccgtgatgtt cgcattgcca gtagaccagg taaactaatt 4080 aaaaaatatc aatttgacaa tactgcgaaa tggaccattg cttacttgcg gaacaccaca 4140 taattgttta aacttatggc aaaatctgtc tatttgttca caggtcatca tacctggtct 4200 acatataagc ttggggctat accttcgtgc cttcaagcta atggagacag atatgcatga 4260 gcttgaccta aagctgcagt cgtacctgtg tggagtgcta cacgaggggg aggtgtccaa 4320 ggaggccctg ctggccgacg tccacctcgg aaagttcagg tgctacgtac aggccatcga 4380 acaggcccgg ggtctggacg acgaggctga ccgcgtggag gaggagctcc atgaccagga 4440 aaatgagctg gcttggctcg cctgctgcaa cggaacaaca gagaagttag atgaagctgt 4500 ctttgcagaa gcttgctcca tggtagagga gcttgtgcgt cagaaagatt cactggtaaa 4560 ttcaccatgt acaaagttgg cacatacata ttcttaacaa gatttattaa agatcaatat 4620 atatggatga atgtaacttg attatataat tgggatatgt cttacattat atacatctac 4680 ttattcatgc agaggtctaa ggcagaagaa atgcgactga aggcgtccat aaagaagggg 4740 gatggaccct taacgagcca actggatgac ctcctacaga cccttcgtgt gaagaggcag 4800 gccttccaca gcaactcctt caatggaaac cacgttaaca gaatgttgca ggcaagtact 4860 aagctagtta ttatttttag tttgggagtt gatgtcggtc acacagactt tgtgccttgt 4920 ataatattga gtctattgtg acttattcct ttttttatac acaggatgat gccatagatg 4980 aactaacagg gatggtggaa cgtacagtga ccaagatcat ggacaagttt ccagatctgc 5040 cgctatccct tgtgcccaga gcgacgtcaa cggcggggac atacaaggag ctgttcagtc 5100 gttttgcaag atgtcacaag ctctactccc atggaggccc catggaggag actgcagtat 5160 gtgagctagg tatgtacatg tacaaactat tttcttagaa tattaagtac attcttctac 5220 ttaaccaacg ttgtcaatac aacccgcttc ttaacgtaca ctcatttatg cttactctta 5280 tttgcttttc cagacaaagc catcaaagac ttcatgtcat actacagaag caatgtcccc 5340 aatggcaccg taccaataaa gatgcacatg ctagaagatc atgtggttcc ctgcatcagg 5400 aggtgggggt tcgggctggg gttcatggca gagcagggag ttgagcatgt ccatgcactg 5460 tttaactccc tggcaagacc cacctgtacc attcccgacc ctgtggcacg actcaagtca 5520 acactcacaa gccatctgat cggggtctgc ccaaccaaca ccataggcta ggcttcccat 5580 gatcttacat ttcaatgata cacaagaaat gcacaagtaa aaagtaagat acaaaaacag 5640 tgcaggtact tcggagtgtt ctacatgtaa tgtcaactcc cctgcctgta gtcacgtgaa 5700 aaaatgttgc atgaactgca aattagtacc aatacaatgt cagacagcat gtagactagt 5760 tattactagt tagttagaat tagtgttact catttagtga tttactgtat attatgtact 5820 gtataatgtc aatcggcatg tagtgtgaat tagtattact cattcagtga tttactgtat 5880 actatgtact gtataatgtc aatcgacatg tagtgtgaat cagtattact cattcagtga 5940 tttactgtat actatgtact gtataatgtc aatcggcatg tagtgtgaat cagtattacc 6000 tgcataccat cttcagtcaa ggaaggagtc gtcttctgtc atggactgat cttgtcactt 6060 caggtcatca ttactgtata caaaaattat gtagtgccaa tcagtacgac tgtatacaag 6120 ttacaaatta tgtcaatatt gtgggagaaa tgacaggagt gagtaatctg ttttggaaat 6180 tatagaaaac agagaaccaa tagctaaata tgtaagtact atccgtgcct ttccatgata 6240 ttgtacgtaa cagggagtaa agaatagaca gcattcattc tactactcac cttgctagaa 6300 agcatgtatt taattttttg ttgtcatgga aacttcttga tagaaataaa ttgttccgca 6360 agacgtaaac aaatacttct acaagttcta gcgccagtta tcttcattat gttgtcatat 6420 aaaccaaaac actacaaaat aatatgcatc agggcttcgg atatcaagca atatggcgtc 6480 gaaacgaaca aaacatacct gtgttggtcg atttccctcg tggggtaaaa aaaccaaacg 6540 ctgcagagct gtccacagga tgacgtttgg tagctttgac gaactatatc cgtgcttttt 6600 agcggcatca aaactccgtt tgatgtaaaa atctcgattt tacttctaca catgatcttg 6660 cacgtagttg tgctgacgcg gcacttccag cggcccgatg aacgctaaat ttcggccgaa 6720 acgcgagaaa aatatatcat aacaaagcca ggaacaatgg ctgccgtctt ctccaaaatg 6780 aaggaaccac gtgatcgaaa tcctggtata tcattggtta gaagtgatgg tagcagttct 6840 agacctgcaa tgttgccaaa tgtcgcatgg tggtgcatgg ccaactgacc tcattaaata 6900 tccaatacgt cacctgcgca gtctaggatg cgcagactaa gctgctggta agattcagac 6960 gtg 6963 // ID BEL-238_AA-LTR repbase; DNA; INV; 432 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-238_AA_; KW BEL-238_AA-I; BEL-238_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-432 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 932-932 (2011). XX DR [1] (Consensus) XX SQ Sequence 432 BP; 149 A; 77 C; 81 G; 125 T; 0 other; tgttagctga cgcagtattc acgcatcttc cgtccagacc ttaatcattg ggactcatca 60 accggtgagt ggatgccgat aagggtgttc gctatcaccc agtgagtacc gaactagatg 120 taaacacata aagaagctga gaatagtaat agaaagcaaa aattgataaa tctccataag 180 ctaatagcaa gtcattggta gcgtgaaatc ttttgctaaa atattgaatt cctagtaaaa 240 caataagttc taaattagtt atttaagtgc caaataagct gcgtgtgtta aattcataga 300 ttaggtaagc agaatgtctt caaaccaacc ctacatcaac taaaattgtg ttttataaac 360 caggtcttcg catgtgcttc gataataggc aattactgta ttatcctatt gtaccggtaa 420 ggaaatgtaa ca 432 // ID Helitron-1_NVi repbase; DNA; INV; 6398 BP. XX AC . XX DT 18-FEB-2009 (Rel. 14.02, Created) DT 18-FEB-2009 (Rel. 14.02, Last updated, Version 1) XX DE Helitron DNA transposon from Nasonia vitripennis - consensus. XX KW Helitron; DNA transposon; Transposable Element; Helitron-1_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-6398 RA Bao W. and Jurka J.; RT "Helitron DNA transposons from Nasonia vitripennis."; RL Repbase Reports 9(2), 481-481 (2009). XX DR [1] (Consensus) XX CC The 5'- and 3'- end are incomplete. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Nasonia CC Genome Project. XX FH Key Location/Qualifiers FT CDS join(2..1960,1852..3741,3669..5042,5030..5701, FT 5554..6384) FT /product="Helitron-1_NVi_1p" FT /translation="KRKYYQENREXILKKQRLRYHLNKEKRLASVKXYYES FT NKEKILVQRKNKRHEMSLKRSVVKXKXXLAKKIXKKYKSIXIKQSXKKXIQ FT TKEKYIQNVMQKLDCPSVLDRRIEADKLVTSAFYVRDNHISKVKRELXNXK FT EKIEVSLTRLGGEPIDDDNCSYALTALCGLPKHTSSSEGYFGESTYDFTNG FT QINDKLETHIILNEQGQTMNILPTIESNTKSKIWQCNDYCKKVDPDVLNEL FT NNLFQDLIDVTYINAYNFLKKIDNCSAQSHNSEKQGHPIICYSKPLLCTSK FT FLKLNILSVHYPKLRTIKRFLYKIIDIYKQIEKIESALNNIDIEVLKNIAL FT NAEQAAIFINSTNDEICLNETQIKNKYSQGIKEYTKIDMDPPKIPCVSCER FT LCCERSMESVDKYLDKTNLVEELLDIHIPEKQNYWKRLKELYDSPNFNNGF FT ICKCCSGKLKKYILPSTCVLNELYVKEVPDEIKTLNTYERMLIQRAKAFQT FT IVKMGTVMKKNLPNYCKIDKVTGRTFHLPLPLEETLKKICPDTDPLNINHE FT MFILIRGNPTKGKIIWEDYVDVKKIWHALHWLKRWNPLYFNIQIPSCPEQI FT KDILGNLNLQYEDLAXDNKMIIHKIMSMNTILKIIYLIKMLTKLIRKQTYM FT KSXKNDNTQNNEYEHXSENNISNQNVDKIDTETNIHEKPQIAALLTQKSQS FT DPYYEQYSIYPLHTKXINETDSKLYQMLQIYATALDVRAEDLDLKCFPDLY FT PYGMYGQHEKRSVSLRDYDYIRSRLTSKHXQFRLNIQYLFYLLFNNNIRQI FT SAGIFHKLNVVNPRYKYTVKELTEXLEQDQLDSNLESIFSRLRGTEAYWKQ FT VRNDLECMMNEYGPATFFITLSPGEWMWSALAEYIREANXWENDKRSISEL FT IAADPVSASRYIRNRFQAMLAYILSSAHPIGKVAHYYYACEYQGRGLPHYH FT CLFWIHDAPVYGKAPNKEIQKFILQNITCRIPNKHVSPELHRRVMAYQRHT FT HNNYCMRKKKTKTGFKSACRFGFPRPITPKLVLREVSVAIANKRKLRNKSR FT FYDLPRGPDEQCINDYMPALLVLWEGNNDAQFICESSYLLTKYVSSYGTKG FT EKSTVDFSDIQSNKSLCSRLWSFSMRALNHRECGAIEAADTLLGHPLHATD FT TDTVIRWLDVRMIRNRKLKPFKEIKKLPPDSTDIFYDSLIDSHYPNRPQEL FT NSLCLYDFAKWFDIIKKKPVYKAEYYEMPGGLFCKKRKKPYLINHWKYNPI FT INQKIIIIHYYYYFNLGGIPMNSNNQPEXYYXSLLLLFQPWRDTDELKNGE FT DSFTMAFQKRQCELXDAMKYHDKCGXIREAMEKMNEIIQNQLLETQENKSN FT PDNLPEGCVPIEIENAIKDFKDMAEKVVMNEADIENSISRFNVDQSNIFKK FT ITSVLQSNDKILRIFISGPGGTGKSFLIETIVAWNKXIRKKETAVTAPTGI FT AAYNVSGLTIHRLLHLPVEHGCTAKYKELSSAALKQIRQSLENVDLIIIDE FT ISMVSNITLMYIHLRLTEIFNTSECDDGWFGKIHIIVFGDLLQLPPVREKF FT AFVEVTKKEIEKYIGAINSFNLWTLFEYDELTINMRQKNDNQYNEILSRLR FT LGFXTDTDINVLQQRKLKFSSTNSLEIMQELCAVYEKLPTDTVCLLPTRNM FT CDILNDAMLXRIDSEDINLIANDSFKCPKNLQKKVLKMLDEDKDNACGIER FT VIKIKINSKVMIRRNIDVSLDCLIGLVNGTIGSVISVAKDKGDEIISVRIK FT LKDGTEHSIPKLEYKFVIMDKVYIIRQQFPICNSYGITIHKSQGLSLENAI FT VEAGNSIFCSGQTYVALSRVTKLQGLHLINFDPTNVKADSSAILQYNRLRQ FT KYRPDLQALVIQNDCMFSRPXAVQDRRWTVSKHILDIQANNSKLNIYLTVI FT GLFMVSQMRIMYLVMLTLVYRXCFIVHLLEGFYWRPKVLTFIKYXSDSNWV FT IHGFPNEDNVSCYANACIQSLFHCSSVRRILLEAKGFDLLKHSFNQYIAKT FT PVNIKALREYANEQYSACEQQDAAEFLLHLCNASMFLQSVIKHEINVVTRC FT PNCDERNSITTSNYILXLCLPDPMHIXTLQEVFDYNIDNWKNVEGECSNHC FT ACKRLQKTTVHVKNTIIILQLLLFSVDHQTTXKITNFRMKDIPKTVLKINK FT KKYKVINCIFHHGKSIETGHYTNVVRQNNHWFEINDMYIKQRPWPRGAKDA FT YLLFLEEITGN*" XX SQ Sequence 6398 BP; 2436 A; 902 C; 1049 G; 1945 T; 66 other; caaacgtaag tattatcaag aaaatagaga amaaatttta aaaaagcaaa gattacgtta 60 tcacttaaat aaagagaaac gattagctag tgtcaaaaam tattacgaat caaataagga 120 aaaaatttta gttcaaagaa aaaacaagag acatgaaatg agcttaaaac gaagcgttgt 180 taagwcwaaa aktagsttag ctaagaaaat wrcaaagaaa tacaaatcta tcannataaa 240 acaatctrat aaaaaattna twcaaacaaa agaaaaatat atacaaaatg ttatgcaaaa 300 attagattgt ccatctgtat tagatcgtag gatagaagct gataaattag tcacrtctgc 360 tttttatgtt cgagataatc atatttcgaa agtaaaaaga gaattgcwaa attktaagga 420 aaaaattgaa gtttcyttaa ctagattagg tggtgaaccw atagatgacg ataaytgttc 480 ttatgcactt actgctttat gtggtttacc taaacatact tcttcttcgg aaggatattt 540 tggcgaatca acatacgatt tcacaaatgg acaaattaat gataaactag aaacacatat 600 tattttgaac gaacaaggac aaacaatgaa tattctacca acgattgaat caaatactaa 660 aagtaaaatt tggcaatgta atgattattg taaaaaagtt gatccagatg tattaaatga 720 acttaacaat ttatttcaag atttaattga tgttacatat ataaatgctt acaattttct 780 gaaaaaaatt gataattgtt cagcacaatc tcataattca gagaaacaag gccatccgat 840 tatttgttat agcaaacctt tgttatgcac atcraaattt ttaaaattaa acattttatc 900 cgtccattat ccaaaactcc gcaccatcaa acgattttta tataaaataa ttgatattta 960 taaacaaatt gaaaaaatcg aaagtgcttt gaataatatt gatatcgagg tattgaaaaa 1020 tattgcatta aatgctgaac aagcagcaat atttattaat agcactaacg atgaaatttg 1080 cttaaatgaa actcaaataa aaaacaaata tagccaggga ataaaagagt acacaaaaat 1140 agatatggat ccacctaaaa ttccttgcgt ttcttgcgaa agactttgtt gtgagcgcag 1200 tatggaatct gttgataaat acttggataa aacgaatttg gtggaagaat tactcgatat 1260 acatattcct gaaaaacaaa attactggaa aagattaaaa gaattgtacg attctccaaa 1320 ttttaataat ggatttattt gtaaatgctg cagtggtaag ttaaaaaaat atatattacc 1380 atctacatgt gttttaaatg aactttatgt taaagaagtc cctgatgaaa taaaaacttt 1440 aaatacrtat gaaagaatgt tgatacaacg agccaaagct ttccaaacaa ttgttaaaat 1500 gggtacagtg atgaagaaaa atttaccyaa ttattgcaaa attgataaag tcacaggtag 1560 aacttttcat ttaccgttac ctttagagga aaccttaaaa aaaatttgtc cagataccga 1620 tcctctaaat attaatcatg aaatgtttat tttaatacga ggtaatccga caaaggggaa 1680 aattatttgg gaagattatg ttgatgttaa gaaaatatgg catgctttgc attggttaaa 1740 acgttggaat cctttatatt ttaatataca gataccgtct tgtcctgaac aaattaaaga 1800 cattcttgga aatctaaatt tacagtatga agatttggct amagataata aaatgataat 1860 acacaaaata atgagtatga acacsattct gaaaataata tatctaatca aaatgttgac 1920 aaaattgata cggaaacaaa catacatgaa aagccrcaaa tagctgcatt actaacacag 1980 aaatctcagt cagayccata ttatgaacag tattctattt atccattaca tacaaaaara 2040 attaatgaaa ctgattcaaa attatatcag atgctccaga tatatgctac tgctttggat 2100 gttcgtgctg aagatttaga tttaaaatgt tttccagact tgtatcctta tggaatgtat 2160 gggcagcatg aaaaaagatc tgtaagtttg agggattatg attatataag atctagattg 2220 acatctaaac acycacagtt tagattgaat attcagtacc tattttattt gttatttaat 2280 aataatataa gacaaatcag cgctggaatc ttycataaat tgaatgttgt taatccacgc 2340 tataaatata cagtaaaaga acttacagaa maattagaac aagatcaatt agattccaat 2400 ttagaaagta ttttttctcg tttgcgaggt actgaagcat attggaaaca ggttagaaat 2460 gatttagaat gcatgatgaa tgagtatgga ccagctacgt tttttattac tttaagccct 2520 ggtgagtgga tgtggtctgc attagctgaa tacatacgcg aagctaatkg ttgggaaaac 2580 gataaaagat ctattagcga acttatagct gctgatcctg tgtcagcttc aaggtatatt 2640 cggaacagat tccaagcrat gctagcttat atactttctt cggctcatcc aataggaaaa 2700 gtagctcatt attattatgc ttgtgaatac cagggcagag gattaccaca ttatcattgt 2760 ttattttgga ttcatgatgc accygtttac ggtaaagcac ctaataaaga aatccaaaaa 2820 tttattttac aaaayattac atgtcgaata cctaataaac atgtttcacc ggaacttcat 2880 agacgtgtta tggcttatca acgtcataca cataataact attgtatgcg taagaaaaaa 2940 actaaaacag gtttcaagag cgcttgtcgt tttggatttc cccgtccaat aacacctaaa 3000 ttagttttga gagaagtttc tgtagctatt gcgaacaaaa gaaaattaag aaataagagc 3060 agattctatg atttaccacg aggaccagat gaacaatgta ttaatgatta tatgcctgct 3120 ttgttggtac tttgggaagg taataatgac gcgcaattta tttgcgagtc ttcttacctg 3180 ttaacaaaat atgtatcctc ttatggaact aaaggagaaa aaagtactgt agatttttcc 3240 gatattcaat ccaataaatc tttgtgcagt cggctttggt ctttttctat gcgagcccta 3300 aatcacaggg agtgcggagc aatcgaagca gctgacacat tgttagggca tcctcttcat 3360 gctacagata ctgatactgt aattagatgg ttagacgtta gaatgatacg aaatagaaaa 3420 cttaaaccat ttaaagaaat aaaaaaacta ccaccggact caacagatat attttacgat 3480 tcattaattg acagccatta tccaaaccga cctcaggaat taaattcatt rtgtttgtac 3540 gaytttgcaa agtggtttga tataataaaa aagaaaccag tatataaggc agaatattac 3600 gagatgccag gaggtctctt ttgtaaaaag cgraaaaagc cttatcttat aaatcactgg 3660 aaatataatc caataatcaa ccagaaratt attatyattc actactacta ctatttcaac 3720 cttggaggga taccgatgaa ttaaaaaatg gggaagattc ttttacaatg gcatttcaaa 3780 aaagacaatg cgaattaatk gatgcaatga agtatcatga taaatgtgga aygatacgag 3840 aagctatgga aaagatgaay gaaattattc aaaatcaatt gctggaaacg caagaaaata 3900 aatctaatcc agacaatttg ccagaaggct gtgttccgat tgagatagag aatgccataa 3960 aagattttaa agatatggcc gaaaaagtcg tcatgaacga agcagatata gagaattcta 4020 tttcaaggtt caatgtagat caatcaaata ttttcaaaaa aattacttct gtcttacagt 4080 ctaatgacaa aattttgaga attttcataa gtggcccagg tggtactgga aaaagtttct 4140 tgatcgaaac tattgtcgca tggaataaaa saatacgtaa gaaagaaaca gctgttacag 4200 cgcctactgg gatagcagct tacaatgttt ctggtttaac tattcataga cttttgcatt 4260 taccagtgga gcatgggtgt acagcgaaat ataaagaact atcgtctgca gctttgaaac 4320 aaatacggca aagcttggaa aatgttgact tgattattat tgatgaaata tccatggtat 4380 ctaatattac attaatgtat attcatttac gtcttactga aatcttcaat acttccgaat 4440 gcgatgatgg ttggtttgga aaaatacata ttattgtatt tggtgattta cttcaacttc 4500 cacctgttcg agaaaaattc gctttcgttg aagtaacgaa aaaagaaatt gaaaaataca 4560 ttggagctat aaattcattt aatttgtgga ctttattcga gtacgatgag ctgacaatta 4620 acatgaggca aaaaaatgat aatcaatata acgaaatact cagtagacta aggttaggat 4680 ttmtwactga tackgatatc aatgttctac aacagcgaaa actaaaattt tcaagtacaa 4740 atagtcttga aataatgcaa gaattgtgcg cggtgtatga aaaattacct actgatactg 4800 tatgtttgtt accaacgagg aatatgtgcg atattctaaa tgatgctatg ctgmgtagaa 4860 ttgattctga agatatmaat cttatagcta atgattcttt caaatgtccg aaaaatttgc 4920 aaaagaaagt tttaaaaatg ttagatgagg acaaagataa tgcttgtgga attgagcgtg 4980 taatcaaaat aaaaataaat tcaaaagtta tgataaggag aaatattgat gtctcattgg 5040 attagtaaat ggcactatag gttctgtaat ttctgttgct aaagataagg gagatgaaat 5100 catcagtgtc cgtataaaat taaaagatgg tacagaacat tcaataccta aactagaata 5160 caaatttgtc attatggaca aagtatatat aatacgtcaa caatttccga tctgtaacag 5220 ttacggaata actattcaca aaagtcaagg cttaagttta gaaaatgcta ttgtagaagc 5280 aggtaattct atattttgtt ctgggcaaac ttatgtcgct ttatcacgag taacgaaatt 5340 gcaaggctta catcttatca attttgatcc racaaatgtt aaagctgatt catccgcgat 5400 cctgcaatat aatagattaa gacaaaaata tcgyccygay cttcaagcat tagttattca 5460 aaacgattgc atgtttagta gacccartgc agttcaagat agacggtgga cagtatcaaa 5520 acacatttta gacatacagg ctaataatag taaattaaat atwtatctga cagtaattgg 5580 gttattcatg gtttcccaaa tgaggataat gtatcttgtt atgctaacgc ttgtatacag 5640 agyttgtttc attgttcatc tgttagaagg attctattgg aggccaaagg ttttgacctt 5700 ctaaaacatt cttttaatca atatatagca aaaacwcctg ttaatattaa agcattaaga 5760 gaatatgcaa acgaacaata ttccgcttgt gagcagcaag atgctgctga atttctttta 5820 catctttgta atgcatcyat gtttttgcar agtgttatca aacacgaaat taatgttgta 5880 acaaggtgtc cgaattgyga tgaaaggaat tctataacaa cttcaaatta cattttgyca 5940 ctttgtctac ctgaccctat gcatatacaw acattacaag aagtttttga ttataatata 6000 gataattgga aaaaygtcga aggggagtgc agtaatcatt gtgcatgcaa aagattacaa 6060 aaaactacyg ttcacgtaaa aaatacaatt ataattttac aactactatt gttttccgtt 6120 gaccatcaga cgacawcaaa aataactaat ttcagaatga aagatatacc aaaaactgta 6180 cttaaaatca ataaaaaaaa atataaagta attaattgca tttttcatca cggtaaatct 6240 attgaaacag gacattacac taatgtggta cgtcaraata accattggtt tgaaataaat 6300 gatatgtata ttaaacaaag accatggcct cgtggagcaa aagatgccta cttattattt 6360 ttagaagaaa tcactggcaa ttaaaaaatw aaagttct 6398 // ID piggyBac-3_BF repbase; DNA; INV; 4566 BP. XX AC . XX DT 13-APR-2011 (Rel. 16.04, Created) DT 13-APR-2011 (Rel. 16.04, Last updated, Version 1) XX DE piggyBac-3_BF DNA transposon DNA transposon - consensus. XX KW piggyBac; DNA transposon; Transposable Element; piggyBac-3_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-4566 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M. and Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-4566 RA Kapitonov V.V. and Jurka J.; RT "Transposable elements in the amphioxus genome."; RL Deposited to Repbase as the tmplanrep.ref file (02-May-2008). XX RN [3] RP 1-4566 RA Kapitonov V. and Jurka J.; RT "A family of piggyBac-3_BF DNA transposons from the amphioxus RT genome."; RL Direct Submission to Repbase Update (13-APR-2011). XX DR [3] (Consensus) XX FH Key Location/Qualifiers FT CDS join(322..909,1147..2757) FT /product="piggyBac-3_BF_1p" FT /translation="MPRPRYSVHEVHDVLFTEDSDPDFPDSDGELEGNSDS FT DSDSDDNVQPVRHSARGTRGAQGTASADVDHVGDDTDRDRGRGRVRGRGVR FT GRGRGRGAQRGRGAQRGRRGGRGGRSRGRTTRGGTRPAASGRDDADPSTSG FT DDGAGPSTSGNAAQSRSNPRQATLDPRLEDENWQPVDGDTVPQTPHAFTAE FT RGFTEEVTLPDNPTPLDFLDLLVPPSVYKTCTEQTNKYARDFFEDHPVDSQ FT KPRARPRDWPEEGITETDIKVFLALTIAMGLIHHQDIADYWSTDEVLETPF FT FPTIITRDKFQLIYKFFHLCDNDTYVPRGDPNYNPFHKLGDVYPTILERFT FT AVWSPGKEICIDEGMVPFRGNVHFRTYNPDKPDKYGLKAYELCDSNNGYCC FT RFEMYGGKGKEPSAKGLTHDIVTRLMTPYLDKGHTLYCDNYYSSPQLFLDL FT SDAGTNACGTVRNRKGLPDEFKNAKLTAPKERFCMSNGPILALKYKDRRDV FT KMLSTAHSAKIIGTGKRNRDREEITKPECVHYYNRFMGAVDSSDQMVSYLS FT FRRRTLKWWKKAFFHIFSLAILNAYLMYKYHFQQAQASRPREEEEDPSKVP FT KPMLHREFRKQLVKTLISTSGYAHDTRGRRVSASAHLLCRLSERHFLERIP FT PVNAGGRHPARYCVVCSSAAATGAAAGTGTSGKKRSSYQCKQCRKTLCIEP FT CMELFHTRSDYIKAYQRRQSGDYSPVAIPHVTF" XX SQ Sequence 4566 BP; 1283 A; 981 C; 1070 G; 1232 T; 0 other; ccctagtcct gcggcacgct acaattgtgt tgttgcacta gcatgttctg tatgcgacgc 60 gacactattg tagtttcggg ggtatgttta ggatacgcac gtagcgctag cgtgtgttac 120 ttccgggttg ctgacatcgt gtcaggtttg acctctcagg ggttaagggt caacgaacta 180 ttacgcaacc ggaagtacgt catcccataa gccattgcgc ggctaattta ttcaatatgg 240 ccgccggcgt cggctgctac atgtaaatat cttttgtttg ctttgtttgg gtgttttgcg 300 cggcaaattt cactatctat catgccgagg ccgcgatatt cggtacatga ggttcatgac 360 gtgctcttta cggaggattc ggaccctgat ttccccgatt ctgacggaga attggaggga 420 aatagcgaca gtgactcgga ctcggacgac aacgtgcagc cagttcggca ttctgcacga 480 gggacaagag gggcacaagg aaccgcgagc gctgacgtag atcacgtagg ggatgacaca 540 gatcgtgatc gtggccgtgg ccgtgtgaga ggacgtggag tacgaggcag gggcaggggc 600 cggggcgctc agcgaggccg gggcgctcag cgaggccggc gaggggggcg tggtggtcgg 660 agtcgcgggc ggacaacacg tggcggcacg cgccccgcag caagtgggcg tgatgacgcg 720 gatccctcaa caagtgggga tgatggcgct ggtccctcaa caagtggcaa tgctgctcag 780 agccgttcca acccgcgtca agccactcta gatccaaggc ttgaagatga aaactggcag 840 ccagtggatg gtgacaccgt accccagact ccgcatgcgt tcactgcaga aaggggattt 900 actgaggaag taagttgata tatttttgtt cactgacagt tagtgttaca gtactttggt 960 ttttggtcca aataagttgg tcttggtgat ttttgcagct tgtgtataaa acaatattag 1020 gtcaaatttt agctttctat ttttgagatt tactttcttt tctttttgtt actttatgaa 1080 agcatgtaga caattatgaa aacactttat ttgtagaact gataggatgc tgcatttttc 1140 ctgtaggtta cattgccaga taatccaact cctttggatt tccttgacct gttggtgcct 1200 ccgtcagtat acaaaacgtg cacagagcag accaacaagt atgcccggga cttcttcgaa 1260 gatcatccag tggacagtca aaagcccagg gcacgcccgc gtgactggcc agaggagggc 1320 atcacggaga cagacatcaa ggtcttcttg gctctaacaa ttgccatggg cctcatccac 1380 caccaggaca tcgcagacta ctggagtacg gatgaggtcc tggagacgcc gttcttcccc 1440 accattataa cccgggataa attccagcta atctacaaat tcttccacct ctgcgacaat 1500 gacacctatg tcccccgtgg tgaccctaac tacaacccct ttcacaagct tggtgacgtg 1560 tatcctacta tccttgagag attcaccgcc gtttggtctc caggaaagga gatatgtatc 1620 gacgagggga tggtgccatt caggggcaac gtacacttcc ggacatacaa tcctgacaag 1680 ccggacaagt acggcctgaa agcgtatgag ttgtgcgact ccaataatgg ctactgctgc 1740 cgcttcgaga tgtatggagg gaaggggaaa gagccttcag cgaaggggtt gacacacgac 1800 atcgtcaccc ggctgatgac cccatacctg gataagggac atacactcta ttgtgataat 1860 tattactctt ccccacagct gttcttggat ctgagcgatg caggtactaa tgcttgcgga 1920 actgtgagaa accgcaaggg tctcccagat gagttcaaga atgccaagct aactgcaccc 1980 aaagagaggt tctgcatgtc caacggtcct atccttgcct taaagtacaa ggacagacgt 2040 gacgtcaaaa tgctatcaac tgcacacagt gcaaagatca ttggcactgg caagcggaac 2100 cgagataggg aggagattac taaaccagag tgcgtgcact actataacag gttcatggga 2160 gcagtcgatt cctcagatca aatggtctca tatctaagct tccgccgacg caccctgaaa 2220 tggtggaaga aggcattctt ccatatcttc agccttgcca ttctcaacgc atacctaatg 2280 tataagtatc atttccaaca agcccaagca tcaagaccaa gagaggaaga agaagatcca 2340 agcaaggtcc caaaaccaat gctgcaccgg gaattccgca agcagctggt caagacgctg 2400 atcagcacat caggctatgc ccatgatacc aggggaagga gggtcagcgc cagtgcccac 2460 ctgttatgtc gcctgtcaga acgccacttc ctggagcgca ttccaccagt caacgcaggg 2520 ggcaggcacc cagcaaggta ctgtgttgtc tgcagctctg cagcagcgac aggcgcagct 2580 gcaggtaccg gtacatccgg gaagaagcgc tccagctatc agtgtaagca gtgtaggaaa 2640 accttgtgca ttgagccatg tatggagttg ttccacacgc gaagcgacta catcaaggcg 2700 tatcagcgga gacaatcagg tgattactca cctgttgcaa taccacatgt aactttctga 2760 cctaagtttt aattgtcatg tccccaggac tcctcaggaa ctttgtattg ttgttacaat 2820 gtatttactg gtgctgtagg acttagagtg aaggttgtgg cagaatatac tttttattgc 2880 ttttttatga ctatataatc aatagagtag gtgtagacaa ctaaaaatat gattcctagc 2940 aaagtagaga ctctaagctt taatttgata tacagcaaac ctatattgca cctaccatac 3000 agtatcaaaa tgaagttaat tcaacacaga agaagtagtg attccacaga aaagttagtt 3060 cagtaccaag gacaatctcc tgtcatttac tcacctgttg caatacctgt gtattttctg 3120 acctaagttg tattgttgtc atgtccccag gactcctcag gaactttgta ttgttgttac 3180 aatgtatctc actggtgctg taggacttag agtgaaggtt gtggcagaat atacttttta 3240 ttgctttttt atgactatat aatcaataga gtaggtgtag acaactaaaa atatgattcc 3300 tagcaaagta gagactctaa gctttaattt gatatacagc aaacctatat tgcacctacc 3360 atacagtatc aaaatgaagt taattcaaca cagaagaagt agtgattcca cagaaaagtt 3420 agttcagtac caaggacaat ctcctgtcat ttactcacct gttgcaatac ctgtgtattt 3480 tctgacctaa gttgtattgt tgtcatgtcc ccaggactcc tcaggaactt tgtattgttg 3540 ttacaatgta tctcactggt gctgtaggac ttagagtgaa ggttgtggca gaatatactt 3600 tttattgctt ttttatgact atataatcaa tagagtaggt gtagacaact aaaaatatga 3660 ttcctagcaa agtagagact ctaagcttta atttgatata cagcaaacct atattgcacc 3720 taccatacag tatcaaaatg aagttaattc aacacagaag aagtagtgat tccacagaaa 3780 agttagttca gtaccaagga caatctcctg tcatttactc acctgttgca atacctgtgt 3840 attttctgac ctaagttgta ttgttgtcat gtccccagga ctcctcagga actttgtact 3900 gttgttacaa tgtatttcac tggtgctgta ggacttagag tgaaagattg tggcagagta 3960 tcctttttat tgctttttta tgactatata atcaatagaa tagttgtaga caacttaaaa 4020 tatgattcct agcaaagtag agactctaag cttttatttg atatacagca aacctatatt 4080 gcacctacca tacagtatca aaatgaagtt aattcaacac agaagaagta gtgattccac 4140 agaaaagtta gttcagtacc aaggacaatc tcctgtcatt tactcacctg tagcaatacc 4200 tgtgtgattt ctttactcat ataaggaagg ataatgttgt cagagatctt gaaatagttt 4260 gtagtcaaga aattcacatg ataaatgtag gatatgtgag tagaatacaa aggtttcatg 4320 aaataagtgt gttttattgt tttattgccg ctattaacca ttataacaga aagaattgta 4380 gataacagta gcatttctag aaagtacaca ttgttagctt tacaatgata tataccttta 4440 tggggttact gtaaagtaaa gtgactaaaa atgagcaata tcaaactagt accctttgct 4500 tcgatggggt ctgggaaccc cagcagtggg ggcgagtttt ggtatagaaa gccagcagga 4560 caaggg 4566 // ID ITmD37E_Ele9 repbase; DNA; INV; 1296 BP. XX AC . XX DT 13-OCT-2010 (Rel. 15.1, Created) DT 13-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE An ITmD37E DNA transposon family from Aedes aegypti. XX KW Mariner/Tc1; DNA transposon; Transposable Element; ITmD37E_Ele9. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-1296 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-1296 RA Kojima K.K. and Jurka J.; RT "ITmD37E-type DNA transposons from the yellow fever mosquito."; RL Direct Submission to Repbase Update (13-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. >95% identical to consensus. TIRs are 30 bp CC long. TA TSDs. This consensus is ~96% identical to the original CC sequence in [1]. This family encodes a DD37E-type transposase and CC is similar to Tx_mos from Toxorhynchites amboinensis. XX FH Key Location/Qualifiers FT CDS 152..1168 FT /product="ITmD37E_Ele9_1p" FT /note="transposase." FT /translation="MASNEEIVRQQILGTHFENPNASHRWIGKQLGIAHST FT VSHVIKTFKDRKTTKRKNGSGKKRGTSNPEKARMVLNQFKRYPNISIRDVA FT KKAKVSVCFVQKTMKRSGLHVYKVQKAPNRSEKQNSVAKTRARKLYREWLT FT KPFCVVMDDETYIKADTKQMPGQEHYVATSKLAVPESVRKKKLSKFAKKFL FT VWQAICGCGEMSQPFITSGTVNGQIYKDECLQKRLLPFLRSHDGPTLFWPD FT LATCHYARPVLDWYKDNDVVFVPKDVNPPNTPELRPIEKFWAIMKAKLVKT FT PKLIENELELKKAWLKVSKQVAPIIAQNLMKGVKAKVRAFGLGEEVQ" XX SQ Sequence 1296 BP; 420 A; 268 C; 295 G; 313 T; 0 other; aagactgtat tcgatgaaat ttggacacac gcaattgttc gtaacttttt agcgtgtggg 60 taaaaattat tgaaattttg actgaatata tttcattgtg taatgtttac ataggccgag 120 tttcgtgaaa atcggtcgag tgatttcagt gatggcttcg aatgaagaaa tagtgcgcca 180 acaaattttg ggcactcact tcgaaaaccc aaacgcttct catcgatgga tcggaaaaca 240 gctcggaatc gcacattcaa ccgtatctca tgtgatcaaa acgttcaagg accgtaagac 300 aaccaagcgg aaaaatggta gtggcaaaaa acgtggcaca tcaaatccgg agaaggccag 360 gatggtattg aaccaattta agcgttatcc caacatttcg atcagggatg ttgcgaaaaa 420 ggcgaaagtt tcagtttgtt ttgtgcagaa aacaatgaaa cgttctggat tacatgtgta 480 caaggtccag aaggcaccga accggagcga aaaacagaac tcggtggcca aaactcgtgc 540 aagaaaactg tacagggaat ggctcacaaa accgttctgc gttgtgatgg acgatgagac 600 ctacattaaa gccgacacca aacaaatgcc aggacaggaa cactacgtcg ccacctctaa 660 gttggccgtt ccggagagtg tacggaagaa gaaactgtca aaattcgcca aaaaattcct 720 tgtctggcaa gcgatctgcg gatgtggcga aatgagtcag cccttcatca cctctgggac 780 cgtcaacggg caaatctaca aggacgaatg cctccaaaaa cgacttctgc ccttcttgag 840 atctcacgat ggcccgacac tgttttggcc agatttggcc acttgccact atgcccggcc 900 cgtcttagac tggtacaaag ataatgatgt agtttttgta ccaaaagacg tcaaccctcc 960 aaacactccg gaacttcgcc caattgaaaa attttgggca atcatgaagg caaagctcgt 1020 taaaactcca aaactgatag aaaatgaact tgaactaaag aaagcgtggt taaaagtaag 1080 taagcaagtc gctcctatca ttgcccaaaa tctgatgaaa ggggtcaagg caaaggtgcg 1140 ggcatttggg ttgggagaag aggttcaata aaccaatgat gtcaaaactt aaccatttag 1200 tagtgcttta tttccagaaa gtttgaaaag aatcgggctc aaggaaaatt tttggtgacc 1260 tatttagtgt gtccaaattt catcgaatac agtctt 1296 // ID Gypsy-223_AA-LTR repbase; DNA; INV; 1141 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-223_AA_; KW Gypsy-223_AA-I; Gypsy-223_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-1141 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1050-1050 (2011). XX DR [2] (Consensus) XX SQ Sequence 1141 BP; 273 A; 318 C; 299 G; 250 T; 1 other; tgttaccgaa tggtaacttg ataaaaagag tggggatgta gggccaaata ttatatagct 60 agactaacgt tcaatttaaa tgagggggcg tcggcgagtt gacctaatgc casccgatag 120 acatcttggg tggtgaagtt gagcaatagt ctgaagtggg tggcgatgtg ggcggaaaca 180 cggagagtga gcaattttgg gagagaagat cggacgattg gggctaagag tgcgacgtgt 240 gagaaggttc gggatcagtc tacagaagct cgttccagcc gacagaagtt atagaagaca 300 acctccaaag tagcgcgtat tacggtggtg caagtagaat cgcacacatc ctgtgcgtta 360 gtgccgaaag cttcctccag gcccgtaaaa tctcctcgac aaggccagga aaagagtgaa 420 ccgttcattt tccgttgaaa cgacaccacg tttgttacgg accgaaggtc ggctgtctca 480 ccttccaact accgccgctg ggccgttcca cgcttccagt ggctccgtca gttgcgatcc 540 cgcgccaaat tcgtcatcaa tcgctttcaa gccgttcccc gacagcctca acggatattt 600 cgccggtttt cgttggaatt acttggccac tttccggctt cccggcaagc agcatccaga 660 cacgtggacg gttgtccgtt gggctccgct ccacgatcac ccctgcaacg ttattgggca 720 ctccaaccaa tttggtattg ctccacggca accagaacga cctcgaaagt tcttcgcaga 780 gtattcactc ggccagcgtc cggcaacatc atcggccact ctccagccac actctaggcc 840 cacttgttat catccgcccg cccaccaccg gtaagctggg cctgtattcg acgtgtcgca 900 tcactccgct gtgcgacgac ccccagaaga gaaaatataa ggtaattgct gacaaaattt 960 tgtttcagga cttccccgcc gaaaactccc ccacccaata acgccctggg acccgggagg 1020 caaaagaggc cttcggtgcc caccctggaa gctgccgttg gctagtacca gcagcttggg 1080 gcttaggcac gtccccgtct gggacggcgc aatattctcc cggggccaaa aggtagtttc 1140 a 1141 // ID Gypsy-208_AA-LTR repbase; DNA; INV; 307 BP. XX AC AAGE02029057; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-208_AA_; KW Gypsy-208_AA-I; Gypsy-208_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-307 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02029057; Positions 69038 69344. XX SQ Sequence 307 BP; 103 A; 54 C; 52 G; 98 T; 0 other; tgagatgttt gttataagcg tgtttgctat aacgattctg attattacct tattatacgt 60 aagctagatt tagagtgtaa aatacaatta catttgtaca taaaatatat accaatacaa 120 agaagcatcg gaggcagacg aaggtctgat gagaacaaaa ataaatcata agagtcattc 180 gattctgaca cctagctgga aacagtcgac tttattttcc ctttgttaaa gaaacccaac 240 agtgtactct ccccagttca accgatttta ttagttcaat ttcaagtccc ttttggagga 300 cttgtca 307 // ID BEL-40_CQ-I repbase; DNA; INV; 2212 BP. XX AC AAWU01013031; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-40_CQ_; KW BEL-40_CQ-LTR; BEL-40_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-2212 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 233-233 (2011). XX DR GenBank; AAWU01013031; Positions 10722 12933. XX CC 'CTTAT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 277..2160 FT /product="BEL-40_CQ-I_1p" FT /translation="MPPKIPRTPLKKTEEDIRQEELDGLVHIREQKVDRIT FT RMKEAITSVPRGGRTAATIRGNQKKLELVNSEFDSVHQRIILLADAKSRGQ FT HDAKQREFEAMYDELIMTLEQWTDDLKPAVAAGTSNALAPAGPQPVVINQP FT LPKIIPTFDGKYESWDRFKIMFKDVVDRSNELPRIKLYHLEKALIGSAQGA FT IDEKTIQDGNYAHAWEILEEQYGDKRRMIDLHIGGLLRVQKLVKESHEELR FT SLVQNFHGHVENLKFLGQEFTGVSEQIAVYILAHALDEDTYKLWEATIKRG FT ELPKYDEAIKFLKSRVSVLERWETTKLAEPKQRERSSHLTSVSEQPYHIAN FT TAVMSHPKFRCDFCNEHHLTFKCTAFNGLPVYQRMAAVRERNLCQNCLRSG FT HHWKECGSQHSCRKCQRRHNTLLHRQPRVSRITGQKQPQSSPASCENPRAF FT NAILPTALVNLSDNENQPVPCRILMDSGSQVNFISEAMADRLKLQTKAASV FT PICGIGASKTVSKQLVSVKLHSRISSFNANVECLIVPKVTGTVPPSPVDIT FT EWPIPVGTQLADPTFHKPDSIDMILGVTMFFRLLKNGQIELDGNQPDLRET FT HLGWVVAGHGGVDDNLQTRFPVNVGVAISA" XX SQ Sequence 2212 BP; 617 A; 597 C; 585 G; 413 T; 0 other; ttggtccaat cgacccgaat ttggacatcc ggcgtggatg tggtaaagaa gcccggtgaa 60 aagtgcgagg caaacgtgcc tcagagaaca agcggacagt tcccgcgaaa agtcgctccg 120 gtgagagaca cgtagccgcg agtgcaaaag tgcgtttccg gaacattcgc aaacaatcac 180 gacggaggca aaccggccca atcccgttcg cgagtgtcca aagagcaaag aagaagaacc 240 taacctaaaa gaaacaaaca gtgaacagtg ctcaaaatgc cgccaaaaat cccacgtact 300 ccgttgaaga agacggagga agacatccgc caagaagagc tggatggcct ggtccacatc 360 agagagcaga aggtggacag gatcacccgg atgaaagaag cgatcaccag tgtgcccaga 420 ggagggcgca cagccgcgac gataagagga aaccagaaga agcttgagtt ggttaacagt 480 gagttcgaca gcgtccatca gcggatcatc ctcctggcgg atgcaaagtc tcgaggtcag 540 catgatgcca agcagcgaga gttcgaagcc atgtacgatg agctcatcat gacccttgag 600 cagtggaccg atgacctcaa acccgccgtc gcagcaggaa cgtccaacgc cctcgcgcca 660 gccggccctc aaccggtggt catcaaccag cctctgccca aaatcatccc aacgttcgac 720 ggcaagtatg agtcttggga cagattcaag atcatgttca aggacgttgt ggaccggtca 780 aacgaacttc ccagaatcaa gctgtatcac ctagagaagg ccttgattgg aagcgctcag 840 ggtgcgatcg acgaaaaaac gatccaggac ggcaattacg ctcatgcctg ggagattttg 900 gaggagcagt acggggacaa gcgacggatg atcgatctgc acatcggtgg tttgctacga 960 gtccagaagc tggtcaaaga gagccacgag gagttgaggt ccctagtgca gaacttccac 1020 ggtcacgtcg aaaacctcaa attcctgggc caggagttta ccggagtgtc ggaacagatc 1080 gccgtctaca tcctggcaca cgcgctggac gaagatacat ataaactctg ggaggcgacc 1140 atcaagcgtg gagaactccc caaatacgac gaggccatca agttcctgaa aagccgcgtc 1200 tccgtgcttg agcgatggga aactaccaag ctggcagaac caaaacaacg agaacgatca 1260 agtcatctga cttccgtatc cgagcagcct taccatatcg ccaacacagc cgtcatgtcg 1320 catccgaaat tccggtgcga tttctgtaac gaacatcacc tcaccttcaa gtgcactgcc 1380 ttcaatggcc ttccggtgta ccagcgcatg gcagcggtca gagagaggaa cctgtgccaa 1440 aattgcctga ggagtggaca tcactggaag gaatgtggat cccaacactc gtgccgcaaa 1500 tgtcaacgga ggcataacac attgctacac cggcaaccac gagtctctcg aattaccggc 1560 caaaagcaac ctcagtcctc accagcatcc tgtgaaaacc cacgggcgtt caacgctata 1620 ctgcccacag cgttggtcaa cctgagcgac aacgagaacc aacctgtgcc gtgtcgcata 1680 ctcatggaca gcggatcaca ggtaaatttt atttctgaag ccatggcaga tcgtctcaaa 1740 ctgcaaacta aggccgcaag cgttcccatt tgcggcatcg gagcatcgaa aactgtttcg 1800 aagcagttgg tctctgtgaa actccattcc cgaatcagca gcttcaatgc aaacgtggag 1860 tgcttgattg ttccgaaggt gaccggaacg gtgccaccct cgccggtcga catcacagag 1920 tggccaatcc ccgttggcac ccagcttgct gatccaacgt tccacaagcc agactctatc 1980 gacatgatcc tgggagtaac catgttcttt cggttgctga aaaacggaca gattgaatta 2040 gatggcaatc aaccggatct tcgagaaact caccttggtt gggtcgttgc tggacatggt 2100 ggcgtcgacg ataatctaca aacgcgattt ccggtcaacg ttggggtggc aatttctgcg 2160 taatccagca aggcaggtat ggctggaact tcttcccagc ccggggggag ta 2212 // ID hAT-N6B_AP repbase; DNA; INV; 614 BP. XX AC . XX DT 29-AUG-2009 (Rel. 14.09, Created) DT 29-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW hAT-N6B_AP. XX NM hAT-N6B_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-614 RA Jurka J.; RT "hAT-type DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2105-2105 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX SQ Sequence 614 BP; 195 A; 94 C; 98 G; 225 T; 2 other; tagggcagag gacttgtagc atttgcatat ttgttttgct aagtcacaaa aatagcagag 60 tacgaaacaa atacgtttca tgctttcaca aagctagttt tgaaatttta ttttgcatat 120 aaatgcatat tttgagtctt ttttgttttt agggcatatt ttacactttt atacgcaatt 180 ttatactttt tgagacatat ttaacttaaa tcaatttttt ccacaccgac atttttattt 240 caaacaatac agtcaaaatg tttggataaa aatttaaaaa atgtttgtct ttttataata 300 wtattgactg tccgcgggca cacgtttttc ctacaagcgc ttgaggatcg ccgataagcc 360 aataatataa tttgctgtaa ttaattttat taatagatag ataatcgtaa acgatctaay 420 gtcgtacacc atacgacatc accaccacag tgagccgtgt gcgcactatg gattgcgggc 480 ttttattcga ttaggattaa aaaaaaatgt agaaaaattt taaatgcata ttttaagtgc 540 atattatagg gtttttaagt gcataagtgc ttgcatattt agtgcttttt tagagctaca 600 agtcctctgc ccta 614 // ID P-1_Dpulex repbase; DNA; INV; 4707 BP. XX AC . XX DT 16-MAY-2011 (Rel. 16.05, Created) DT 16-MAY-2011 (Rel. 16.05, Last updated, Version 1) XX DE DNA transposon: consensus. XX KW P; DNA transposon; Transposable Element; P-1_Dpulex. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-3053 RA Yuan Y.W. and Wessler S.R.; RT "The catalytic domain of all eukaryotic cut-and-paste transposase RT superfamilies."; RL Proc Natl Acad Sci U S A 108(19), 7884-7889 (2011). XX DR [1] (Consensus) XX SQ Sequence 4707 BP; 1429 A; 905 C; 936 G; 1437 T; 0 other; cacggtctac caggagaggt acccaatcgc cgtatccctt tcgtcagcca gaataatcaa 60 gggacttaga agcttccaca aaatgccgct tggaacgcgg gcgtagcttg gtcccaagct 120 gtgtgtaaag cgtctgtcat tgcgtggtgt tctgtggttc tgtcttacag atcttgttgt 180 gcgttttgaa tttttaactg atttcacacc tttaatgcct ttaattttca tttgtcttgt 240 atttcaaaat ggtgaagtgc tttgtgacgg gttgctctat agaatttctc agatgtaaag 300 aaaaaattac aaaatttcgt gctcccagag acgaaaagca atttattcta tggcagaaag 360 ctattccaag aagtgacaga aagctgacca aacaggacta tgtgtgtgca aagcacttta 420 aagataaaga tttaaccaaa gagagaacca ttctcaatga ggtatttcct ctgaagattt 480 ggaagttggc agctgaagcc attcctacgc tcaatctatg taattgcagt atgtctataa 540 ttttgtatct taaataatca ttcatctgtt tcactttttt agggggtgac aacgataaag 600 taaaaaagaa aatgaaggag aatagtaaac ggaatgttga tcctgaaaag tggcaaaatg 660 agcttgattt tcccaacagt cctgagccca ttaaagagcc agacagtcaa gaaactctgg 720 aatcgaacat cacacctgag tttgcagctg atgttgagat gctggatcct ctagaatatg 780 ttgatgagcc catcgtagaa gcagtcaaca accttaattg tgatcccaca agagcaaagc 840 ttcccgaaca ttggaaatgg tgtcttactg atgaagatct acaacaggat gatcccgtag 900 cttacagcaa gataactgct gttagatttg ttttgtaacg acttcacatt tcccatcaaa 960 atgattacag ttgtaggtgg tgaagtgttt tacactgtca agggggttct agtcacacca 1020 cctaatttcc tgccaaaatc attcagcaca gtagaagaac tcactgactt gttggctcgc 1080 tttgactcaa ccaatatttg tggtggtttc aaattggttt gtaatggagc actaacaccc 1140 agcatgaaaa aagagacagt caagcagttt ggtgaacgga gatcgaaatt ctgctcttga 1200 cttctccaga aaggctccac ttgctataga tgcgaacacc tgaataagtt agtggctggt 1260 aaatcaccga ccccaagtcc aatggagaga cttcttgacc gaagccagaa atcaacttac 1320 aaaaatcgtc tgttaaagtg gcagttacaa cgtaaaggaa aactgattaa ggtagcaaat 1380 ataagttatt tatttgtaag acctttttta atcgcatctt gttttgtttt ttcaagaatt 1440 aagacaagaa atcaaggatt taagaaaaga cgtgcgtgcg agagtgccct tcaagataaa 1500 ctgtctagcc ttaatcccaa ggtatttaat catttcaatg agacaggcga tgaaactgaa 1560 tgtggttact gtaactgtta tgtaagagtt gtctaaattc ctaccctact tgtattaata 1620 cctcccattg acttacacga ctacagtttt ccaccaagaa gaataaccaa cgatgtgtga 1680 tcagaacttt ctcgagtatt cttactaatt tttaacaatt aattgacatt agtatcacta 1740 tttttctttc cactgtggta tttatctatt tccccatttt tttttatggc ctgattattc 1800 atacaaagaa ttgaatctca tttatgtctt ttgttaacaa tattctaatc tgtcttagtt 1860 ttatccatag gaagttatgg tgattaaggc tatatttaga aaggctgagc aaaagaactc 1920 aagatgtatg cggtatgatc ctgaatttct tttagaatgc gtgttactcc gcatcaagag 1980 caaatctaca tatgatcatt taagagccac caacattctc cctctccctt gtccagagac 2040 tatccgtcgg ctattaagct gcatgccatg taaatttggc cttaatagtt ttgctttatc 2100 gtcaattaag acgtttcttt ctggaaagcc aaagagtatg tgttacggta gtttggtgtg 2160 ggatgaaatg accatagctg gtgatgtttc ttttgaccca atgaaattag ttttcgaggg 2220 ctttgttgat tacggagaag atgatggatt cagtgaaccc atcacgttaa agaaacacga 2280 gggtgaatta gcggaccatg cactagtttt gattttcaga ccctatcgat attcgtggat 2340 ccagccaatc gcctgctatg cgaccaaagg agcttgtcct ggtggagtta tccaccaact 2400 catggccagg gcaataaccg ttcttcatca aaatggtgcg attgtgaagt cagtcgtttg 2460 tgatggggcg caaaccaata aaacggtaat gcgactatgc agcatcacag gtaaattctc 2520 atcaatccat aatatcttca ctgatggtag agaaaacgag caagatacct gtccgagatc 2580 agctatgtct gacgtcgatg atgacggaaa caaagacaca acttcattcg accaccccac 2640 acttgaaaaa tcattgattt actttttggt tgatgtccca cacctgctaa aaacaataag 2700 aaacaacatt ttgcagtgaa aaccgcccag gtattttacg ttatatttgg atctttaatt 2760 taccattttt tccaattctc gtaaagatat attcaatatc ctatctcttt tagttcaaag 2820 gtaaagcggt gaattcagtg acttcgaatt gctgtttaaa accagcagga tagctggaac 2880 tcccctcagt ggacttcata agctgacaga atctcatctt tacccgacgt cgttcgaaaa 2940 aatgaatgtg agattagccg cacagatttt atcaaaatca gttgctactg catttcagta 3000 ttttcgtaaa ctagaaaata cgaaggcatc ttttgagggt aggttaaatt aaataagtag 3060 aataactgca gcttgacacg taacaataat ttatcttgca gataccattg gaacggagga 3120 aatggtcagc attatcaatg atgcttttga tgtaatgaat ggacgttgct atcaaaaatc 3180 catccgtagg acgtcatgga aaaaagacag aaaggtattg gacaagccta cttaactata 3240 ttgctccagc tttccgtata atcctaactt ttagttaact ggtattattt tgtttattaa 3300 tctgtttcat aaatcttgtt tatcattttg cgtgactaga tattggaatc gttgttggag 3360 gcattgtggg aaactgaact tgcttccgat tcaacaaatc taaagccgtt cgcttcacag 3420 acaacgtttg aggcattaag agtcacagtg ttcagtacca tcgaattgac cgagcacctt 3480 ttttcgagcg agatcaatta tgaattcgtc ttgacaggaa aactaaatca ggattgcata 3540 gaggtatgct tacgttaaaa tttaaatcag atcactgata actaactttt gattttcgtc 3600 agcgattttt cggaatcatt cgtcaggctg gtggtggtac cgagacgcca accgtacact 3660 cctttctggc gctttttcgc atgctttgcg tctactaccc taccaaaacg actattgccc 3720 atgcaaacgt cgaagaggaa cgaatgtcgt tgttaacgtc ctacaaggac tgtatgctga 3780 atcgatttaa aaaagacaag aaggaagcaa aggccagaaa gcaagctctt aaggacagac 3840 ttgcaaacgg aatgtctatg atcgacggaa ctgtgtttaa tatgtccgcg tacaacaaat 3900 gcattgatga tgtggtttac tatcttgctg gctacgtctt gcattcgagg cgcaatctta 3960 ttggatcttg tgatgaatgt tggaaatcgt taacaacgga tgaagacctt cccgaaaatt 4020 cttcgtttcc taactggctt attgttttac gagataaagg tggcctcaaa aaagtaacac 4080 ctaacatgtt tttttctaat ttcggcgatc gagacaatgc tgatgaatca ttttagtcag 4140 gaagggagct acatcagaga ttcatttgaa aaggttattg aaaaagcatc ccattttacc 4200 atctactcaa tctgttgccc tgcccacaaa aaatcccttg tccctagtct tgtatatgaa 4260 tatgtagtca ttcgttttcg attccaagca aagtggaaga agaacgtgga agtgtcgaac 4320 gtaaccagtc aacgacatca gtcaagaaaa ttatcgaaat tgtgaattat tctagtcgaa 4380 atcgaaatgc tgcctgcgta tgtctttatt tacgacatta ttttctttaa ttgtcaaaca 4440 tatttcctaa atgtgatatt gtaacctttt acaagttttt ttgactttca ataaaaaccc 4500 agttataatt gcaaaaaaat tgagaaatgt atttcaaacc atatcatatg attcgaaatt 4560 caattgatcg tagttcgtat acccgccaag ctttggagag ttgggaccaa gctgcgcccg 4620 cgatcctagc ggcaaatttg ggaagcttct gagtcccttg attagtctgc tgccatggcg 4680 tttgggtacc tctcctggta gaccgtg 4707 // ID BEL-44_CQ-I repbase; DNA; INV; 5297 BP. XX AC AAWU01003709; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-44_CQ_; KW BEL-44_CQ-LTR; BEL-44_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-5297 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 241-241 (2011). XX DR Genome; AAWU01003709; Positions 19516 24812. XX CC 'ACAAC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 1097..3919 FT /product="BEL-44_CQ-I_2p" FT /translation="MDVAGRRELVLSKKLCFNCLKPLHTANKCPSKSTCRT FT PDCKQRHHTTLCQQRQQHSQEPEKADRDHPPRQPEETEEASSVSANAAQAK FT ATKHEPSSVALLPTVIANLEGKDGKLHQVRCLVDSGSQASLITEACVKRIG FT LKRTNASLEVTGVNGEIVGTTAGAVTLVMSSRFDGETKLTTQAYVLGRLTA FT TLPNQRFNVADLPFLEGLELADPSFNSPGEMDVILGADVFLSILQAGQVKN FT RQGIPVAQRSIFGWMVSGKISQPCRTSSHHAVLNVAQEFDIDRTLRLFWED FT QEIPEGKLLTTEEKRVIEHFDSTLTRSEKDGHFIVRLPLDDSKCKLGESLN FT AATKRLRAMERKFQRDPNFKERYVAFMREYQKLGHMEEIPETEVQADCTKS FT YYLPHHGVVKEDSSTTKLRVVFDGSCATTTGVSLNDLLLNAPNVNADLFDV FT LLRFRTYPVVFTADVEKMYRQVWVHRDDTDYQRIVWRESPDEPIRHFRLLT FT VTYGLKNSGFLAMSALKKAAEDFEQHYPGAAERIKEHTYVDDLVSGADLAA FT EAARLVEQIDEIAGKAGFTLRKWCSNVDETLLPLADVKNAAVPIQFPDERN FT AIKALGIHWLPQEDAFTFKVNMKTDGPNTKHQLLSDSAKLFDPIGWLAPVT FT VRAKILFQQCWLYDMNWHDPLPAAVEQMWIEFKENLPRLEQVKIPRWMSSY FT NGHVELHGFSDASEEAYSAVVYIRTFNEIDGAYVNLVAAKTKVAPIRQVSL FT PRLELNGAWLLARLMKRVAKAFERFKVEMFAYTDSTIVLHQLAFHPRKLDT FT YVANRTASILEVLPRSRWFHVKSEENPADCASRGISPAELVNHPLWWFGPP FT WLKAHASTWKHEMPDEEFDEATLEVRKKFRTFNIVVRLPKTTYEEEKQILD FT QILFTAWSVPTAVTGQPVPLQPATGKQGEAFRFDRPG" FT CDS 3669..5270 FT /product="BEL-44_CQ-I_3p" FT /translation="MRQPGSMRCLTRNSTKQRWRYARNSERSTLWSDYQKP FT RTKRRSRFLIRFSSLLGACRQLSRVNRFLYNLRPGNKEKRSGSIAPAELHS FT ARMQFVRLAQHDVFQKEIKTLAAGREVSPKSKIANLYPFLDGDGTLRIGGR FT LQQSSLPFEVKHPVILPKDHRFTKLLVEELHSQNCHAGPSLLTATVNQRYW FT IQGCQSVIKQVIHNCVRCRRLKAKTAQQLMGSLPAARVTACRPFTHVGVDY FT AGPIQVRCSNTRGTRSMKGYIVVFVCLSTKAVHLEAVSDLTSQGFISALKR FT MIARRGYCCEIWSDNGTNLVGADRLLQEIYEAIMAHSKEAEQFLCNLGIKW FT KFIPPASPHQGGIWEAAVKSAKSLLRPVVGNEKLTFEELSTVLCQIEACLN FT SRPLCPLSTSPDSLEALTPGHVLVGQPLNMIPEPDVMHLKMNQLDRWQKMQ FT RYSTEFWNRWRDEYIATLQPRGKWRASEENIKPGQLVLVKNDNAPPSAWEL FT ARVVAVHPDRAGLVRNVTLRRGKHEYQRSVQKICPLPN" XX SQ Sequence 5297 BP; 1265 A; 1512 C; 1548 G; 972 T; 0 other; ttggtccttc gcggcgcgga tatggatcaa gagcggttga agtccctcac cagcaagcga 60 gaagtcatcc tcgcgaaggt gaagtgggag ctgtccgtcg cgaaagccat caaaactcgc 120 aacccgtccc tcggcgaagt ggcggaacgg cgggacaagt tgaccggcct cgctcagtgt 180 tttgacgacg tacagactga gatcgaggag gccacgccga acctggagga ggtggtcacg 240 gtgttcaacc accgaatcct gttcgaggac gcctacttcc agatcaagga catctacacg 300 gagtatttgg accagcatgt ggaggagccg gaagttaggc cggaagaccg gcaggacgat 360 ctgcgagacg ccgtcaaggc gctccttgaa tcccagcagc agatgctgct cgcgcagtgc 420 caaaaaccac caccacaaac gctggcgcta gcggcgggga gccacgtatc gaacccggta 480 ccccataacg tcgtcaagct gccacaaatc gacattccga agttcaccgg ggagcgtaag 540 cactggcgct catttaagga cctcttcgtg tgcacaattc acagccgtac cgacctgcgg 600 gattcggtaa agatgcagta cctgttctcc tacctggacg gtgaagcgaa ggggaaagtc 660 gactcgttct cgatcaacga tggcaactac cgtgaggcgt gggacgcact cgtgacctac 720 tatgacaaac acaaatacac ggtgtttgcc ctcgtccggg agtttgtcga ccagcaacca 780 gtcaccaacg ccaaggaatt gaagcacctc gtagccacat ccgacgacgt cgtccgccag 840 ttgaaggccc taggacggga gtacgagtca cgggatccgg gggacttttg caaacttcca 900 agagttcctg gagcagcggt gtgacgcgtt ggagacgtgc tcggcgttca gcaagaagac 960 gggccctgaa gtttcgaaga aggagtctca aaagccaacc caaagaaaag tgcaagcgtt 1020 tcacacgagc gctgctgtga gctgcgcgaa atgcagtaaa gatcacccga cgtatcactg 1080 cgacgtgttc aaggccatgg acgtggccgg acggcgggag ctagtgctct ccaaaaagct 1140 gtgcttcaac tgtctgaagc cgctccacac ggcgaacaag tgtccgtcga agtcaacgtg 1200 ccgcacgccc gattgtaagc agcgacatca cacgacgctg tgccagcagc ggcagcagca 1260 cagtcaggaa ccggagaagg ctgaccgcga ccatcctcca cgccaacccg aagagacgga 1320 ggaggcctcg tcggtctcgg ccaacgcagc tcaagccaag gcgaccaagc acgaaccgtc 1380 gtcggtggca ctgctgccca cggtgatcgc caacttggaa gggaaggatg ggaagttgca 1440 ccaggtgcga tgtctcgtcg acagcggatc acaagcgtcg ctgattacgg aggcatgtgt 1500 caagcgcatt gggctgaagc gcacgaacgc ctccctggaa gtcaccggag tgaacggaga 1560 aatcgtcggc accacggccg gtgcagtcac gctggtgatg tcttcgcgtt tcgacggaga 1620 gaccaagctc accacgcaag cctacgtgtt gggaaggctg acggcaaccc tgcccaacca 1680 gcgcttcaac gtagcagacc tgcccttcct ggaggggctg gagttagcgg atccgagctt 1740 caacagtcca ggcgagatgg acgttattct cggagcagat gtcttcctgt ccatcctaca 1800 agccggacag gtcaagaatc gccaaggaat ccccgtggcc cagcgttcca tctttgggtg 1860 gatggtatcg gggaagatct cccagccgtg tcgcacaagc agtcaccatg cagttctcaa 1920 cgtcgcccag gagtttgaca tcgaccgcac cttgcggttg ttctgggagg accaggagat 1980 ccctgaaggc aagctgctga caaccgagga gaaaagggtc atcgaacact tcgattcgac 2040 tctcacgcgc tcagagaagg atggccactt tatcgtcaga ctccccctcg acgattcaaa 2100 gtgcaaactg ggagaatcgc tcaacgcagc aaccaagcgc ctgcgggcca tggaacgcaa 2160 gttccaacgg gatcccaact tcaaggaacg gtacgttgcc ttcatgcgcg agtaccagaa 2220 gctgggccac atggaggaaa ttcccgagac ggaggttcaa gcagactgca ccaagtccta 2280 ctacctccca caccacgggg tcgtgaagga ggacagctcc acgaccaaac tccgggtcgt 2340 tttcgacggc tcttgcgcca ccacgaccgg ggtctcacta aacgacctac tcttgaacgc 2400 acccaacgtc aacgcagatc tcttcgatgt gctgctgcga ttcagaacct acccggtagt 2460 gttcacggcg gatgttgaaa aaatgtaccg ccaggtctgg gtgcaccgtg acgacacgga 2520 ctatcagcgc atcgtctgga gagagtcacc cgacgaaccg atccggcact ttcgtctgct 2580 gaccgtaacg tacgggctga agaactccgg gttcctggcg atgtcagcgc tcaagaaagc 2640 agcagaggac ttcgagcagc attaccctgg ggctgctgag cggatcaagg aacacacgta 2700 cgtggacgac ctggtctccg gtgcagattt ggcggcagaa gcagcgcggt tggtggagca 2760 gatcgacgag attgctggta aggccggctt cactctccgc aagtggtgct ccaacgtgga 2820 cgaaacgctt ctaccactcg ccgacgtgaa gaacgcagcg gtgccaatcc agttccccga 2880 cgaacgcaac gccatcaagg cgctggggat ccactggttg ccgcaagagg atgcgttcac 2940 cttcaaggtc aacatgaaaa cggatggacc aaacacgaag caccagttgc tgtccgactc 3000 ggcgaagctg ttcgacccca tcggatggct ggcgccagtg acagtgcgag cgaagatcct 3060 gtttcaacag tgttggctgt acgacatgaa ttggcacgac ccgctgcccg ccgcggtcga 3120 gcaaatgtgg atcgagttta aggagaacct gccgcgcctg gagcaggtca agataccaag 3180 gtggatgtcg agctacaatg gacacgtcga gctacacggc ttctcggatg cctccgagga 3240 agcctactct gccgtcgtct acatccgaac gttcaacgag attgacggtg cctacgtcaa 3300 cctagtcgcg gccaaaacga aggtcgcacc gatccggcag gtttcactac ctcggctgga 3360 gttgaacgga gcgtggctac tagcaaggtt gatgaaacgg gtggccaagg cgttcgagcg 3420 gttcaaggtg gagatgtttg cgtacacaga ttcaacaatc gttctgcacc aacttgcttt 3480 ccacccacgc aaacttgaca cgtacgtggc caacagaacg gcatcgatcc tggaagtact 3540 gccacggtct cgctggttcc acgtcaagtc agaagagaat ccagccgact gtgcctcccg 3600 tggaatctca ccggcggaac tggtcaatca tcccttgtgg tggttcggcc cgccgtggct 3660 gaaggcacat gcgtcaacct ggaagcatga gatgcctgac gaggaattcg acgaagcaac 3720 gctggaggta cgcaagaaat tccgaacgtt caacattgtg gtcagactac caaaaaccac 3780 gtacgaagag gagaagcaga ttcttgatca gattctcttc actgcttgga gcgtgccgac 3840 agctgtcacg ggtcaaccgg ttcctctaca acctgcgacc gggaaacaag gagaagcgtt 3900 ccggttcgat cgccccggct gagctgcaca gcgcgcgaat gcagttcgta aggctggctc 3960 agcacgacgt cttccagaag gagatcaaaa ccctcgccgc gggtcgcgag gtttcaccga 4020 agtcgaagat tgccaacctg tacccgttct tggacggcga tggaacgctc cgaattggcg 4080 gtcgtttgca gcagtcgtcc ctaccgttcg aggtcaagca cccggtgatt cttcccaagg 4140 atcatcgatt cacgaagctg ctggtggagg agctccactc gcagaactgc cacgccggac 4200 cgtcgctctt gactgctacg gtcaaccaac gctattggat tcaaggctgc caatcggtaa 4260 tcaagcaggt gattcacaac tgtgtgaggt gccgtcgtct gaaggccaag acggcgcagc 4320 agctgatggg cagcctccct gcagctcgag tcacggcgtg ccggccattc actcacgtgg 4380 gggtggatta cgcagggccc atacaagtac gctgcagcaa cacccgagga acacgctcta 4440 tgaaggggta catcgtcgtg tttgtgtgcc tctcaaccaa ggccgtacac ctggaggcgg 4500 tcagcgacct cacgtcgcag ggattcattt ccgcactcaa gaggatgatc gcacgtcgcg 4560 gttattgttg cgaaatctgg tcggacaacg gcactaatct ggttggtgcg gaccgactgc 4620 tgcaggagat ctacgaagcg attatggcac acagcaagga agccgagcaa ttcctctgca 4680 atctcggcat caagtggaag tttattccgc cggccagccc tcaccagggt ggcatctggg 4740 aagcagccgt caaaagtgcc aagagcctgc tgcgtccagt cgttgggaac gaaaaactga 4800 cgttcgaaga actcagtacg gtgctctgcc aaatcgaggc ctgcttgaat tcgagaccac 4860 tctgcccgtt gtcaacatca ccggacagtc tcgaagcttt gacgccgggg cacgttcttg 4920 tgggccagcc gttgaacatg attccagagc ccgacgtcat gcacttgaag atgaatcagc 4980 tcgaccgttg gcagaagatg cagcgctact ccaccgagtt ttggaaccgc tggcgtgacg 5040 aatacatcgc gacgttgcag ccgagaggaa agtggcgcgc aagtgaggag aacatcaagc 5100 cggggcaact ggtcctggtc aagaatgaca acgctccacc atccgcctgg gagctggccc 5160 gtgtcgtagc agttcatccg gatcgagcag gactggttcg gaacgtgacg ctgcgtcgtg 5220 ggaagcacga gtaccaacgc tcggtgcaga agatctgtcc gcttccgaat tgagacgctg 5280 tctcaaggcg gggagga 5297 // ID BEL-63_CQ-LTR repbase; DNA; INV; 648 BP. XX AC AAWU01017436; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-63_CQ_; KW BEL-63_CQ-I; BEL-63_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-648 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 280-280 (2011). XX DR GenBank; AAWU01017436; Positions 46958 46311. XX SQ Sequence 648 BP; 174 A; 143 C; 163 G; 168 T; 0 other; tgttgaagat gtaacttaag aattgtaaat agtacggtta aggggaacat tgggttactt 60 aaaattagtt aatttagaat ttgtaaatat ctacctaata acctaaacat aaatgtaaat 120 ttgtgccaac acgatctgaa agtgatctaa agccgatcac acacatgcaa tacacacatc 180 gtacagcaga tcgctgtaca gctcgaagct ctatagggag aagaattgta cttagagtat 240 cagttcggaa taaacgatca ccgcgtacac accagtttgt gcccttacac gccgcgttct 300 ctctcggaat aaagtttgtt gacccgcaac acttttttct ttgctgtgtc cgaacggttc 360 gatcgcgcgt gaattcgcta agtgccctag aaatcgggac ttagccgatt tggaggcaaa 420 tcgctttgct cttggggatt gtggcgcatc ttgaccgacc tggcgaaagg tggcaagagc 480 ggccagcgct gaagatttgt ggtctgcgga cgaagaaggg cttgtggcga cccaagccct 540 cggtcaccgt ggatagaccg tgtgtgtgtg cgtgtgctta accggtgcta cacaaggtgt 600 agcaaggaaa gcccttcgat cccggcaccg ttcgctctgt gctgaaca 648 // ID Gypsy-66_CQ-LTR repbase; DNA; INV; 1152 BP. XX AC AAWU01038782; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-66_CQ_; KW Gypsy-66_CQ-I; Gypsy-66_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-1152 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 512-512 (2011). XX DR Genome; AAWU01038782; Positions 3636 4787. XX SQ Sequence 1152 BP; 331 A; 288 C; 269 G; 264 T; 0 other; tgtaaccact cgttacaagt ccttcattct tcaacatcaa taaaccaaaa ataaatgcga 60 acgttcacga ttcttagcta tggggttagt agggaattag ggaataaata aataaatgta 120 atttagttag tagggttagt aaaaagggga aattaataaa taaaaacaca atttgttaac 180 acactataac aacacacaaa ctgtgggaca aaccaacaac cagcgacagt attgtcttcg 240 tcttaaaaca tcgcaaaggg ggtggtagac aacgcccaaa agtgatgggt gtcacgtgac 300 gtaatgtcgt gtcaccgacc aaaaggcttg aagggacgag aattagaccg ccaacggtca 360 cattctccca ggcaagtgag cgatctatac cgttagaaaa gtagagcgaa ggactagaag 420 tgaaagtggt ggaaatttgt gcgagtgcac ctcctgtcgt tcaggacctt ttgtcctttg 480 gcgtaaacgc cattttgcga gttaagcacg ccgcttgacc gccagatcgc tcgctcttcg 540 gcccagtttg cgtaagcgcg gaggagccgg cttactgcgt ctaatcgtca aagatcatca 600 gttgctctgg tccgtggacc ataatttcca aggagcacgc tcgggccctc gtggctctca 660 ttccaaagga gcgtaaattc cccctctcgg tttggtcaac gtgggcaggt ccccgatcgg 720 ccgatccagt caaggaggga aggcctggtc cctaagccgt gctctaccta ccatccaccc 780 ccaccacgtc gagaaccagc accggtgctt cagcagcgaa cgcgagcgct cgcagccagc 840 ggactggctg agtagcagat cgtcgtcgtc agcggaaaag cagcggtacg taccgagtcc 900 cccacacaca catacactta caaatataca cttttgaacc gacttaaatt ctttaaataa 960 atttcccctg ttgtctccca aaagaggagt ttcattcaac ctgtaacaca agaaaagtga 1020 agttcccaaa attaagttcg cgtatctcct cgttggtgac tagcgcagcc gaccctgcga 1080 ttaccgtgag tctgtgctgt gactagctaa ccgtaggcaa aaactaatgt gggtaagtcc 1140 cctaaagttt ca 1152 // ID BEL-87_CQ-LTR repbase; DNA; INV; 188 BP. XX AC AAWU01006055; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-87_CQ_; KW BEL-87_CQ-I; BEL-87_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-188 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 308-308 (2011). XX DR GenBank; AAWU01006055; Positions 2950 3137. XX SQ Sequence 188 BP; 61 A; 48 C; 36 G; 43 T; 0 other; tgttgggacg catttggcaa ccctgcagcg agtgacagct agaagacaga aattttgatt 60 cagtgcattt ccgcttttac aacacactca cactaacaca cagcacgaac aaccagcaca 120 gtttagagac ggcaagagaa taaaccagtc gcaaattaca gtccacgcgt ttttaattcg 180 tctctaca 188 // ID hAT-45_HM repbase; DNA; INV; 3686 BP. XX AC . XX DT 15-DEC-2008 (Rel. 13.12, Created) DT 15-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type DNA transposon family - consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-45_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3686 RA Bao W. and Jurka J.; RT "hAT-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 2033-2033 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 574..2793 FT /product="hAT-45_HM_1p" FT /translation="MDANKDHRRRLSTVKKWEKLLNCELEYDLNGDEVIRL FT RCSLCKKFEKRISQTKLFSMTWIKPGTISIKKDSLAAHLSSAQHKEATRIH FT QQTTLGSVSLFNHVIHNTPIGRGLRKMAVKDADSLRKKFNIAYYLAKRERP FT FTDYPYLIALEKTNGVTNLGNSYGTDRAAAIFTDYIGTVIKNKLTECLKNC FT RFYCVLCDGSTDSAVSEQELIYILYLKDGSPKVEFLSIETAENSNAVGLKK FT CITDAFTRVGITNSYKHLLGVNLDGANVNLGAYAGLGALLKEGSPWLEVVH FT CFNHRLELALKDAFESLSAFKTVDELLLQLYYLYQKSPKRYRELQGLAEAW FT GNSVPKPTNACGTRWIDHKYKAMKIALENYGVFMTHIESLAITDSQIAKRA FT ELKGFLKRWKNASVPINLSIYLDVLSPLRRLSLSFQNNIHDPVKAVRRVQE FT FTWTMAKLQILLENSLDNFLDDTSVMTHYKRLSKDIETIDSKQCYQNIPLS FT QYTEVNSHVLDQYKLIIANITICMEGRFQSLQKSPLFMHLVALLDVYNWPL FT NVNDGTFGDKAIGEVVEHFSEFLIGVGCDLNNILHEWIILKAYMFPIISNN FT QNEYYLKIWEIVFSKSSLVKECRNILDIFELFLICPFTNAKVERMFSCMNR FT VKNDWRSCLRRDRLESLLRISEEGPEIDKFEPDTAMDEWYNDKVRRLSANF FT HRYPEKRKRKSGKTDIDLATVTISDLEESESEEEYCDNF*" XX SQ Sequence 3686 BP; 1337 A; 513 C; 618 G; 1218 T; 0 other; cagggctcga attaagcgta tcgctatcgc gattcgcgat tacttttcgc accttcgcga 60 ttagtttttt cttaaaattt gttttttcgc gacctttctt tttttcgcga ttattctttt 120 atcattttag aaaaatctaa ttttagataa ttaatattaa tcgcgaatat ttagcgttct 180 tttacagtat aaaactgcag tgacttctgt aacaataaat ttttttaaaa gctgagcata 240 gagtaaaaga aagaaaaata aaatatttag aaaatcaatt agcaaaaatg tattagtaaa 300 aatgtaaaaa aatacctttt cgaatcaagt aataaaagat tttgattaaa aaaagaaata 360 aaagttaatt gattgaaaag tcattttctt aactcgaatt ctcaagggct ttattctgat 420 tttgctcatt cattatgaac aaatttaatg atatttttat aagtttaatt tgcgaagaat 480 tgtggaaggc aatacagact ctaaactagt tattaaaaaa aaactctcta aagtttgtta 540 gtatacttgc taactttaaa ttaatagcaa taaatggatg caaataagga ccacagacgc 600 agattatcta cagttaaaaa gtgggaaaaa cttttgaatt gcgagttaga atacgattta 660 aatggagatg aagttataag attaagatgc agtttatgta aaaaatttga aaaacggata 720 agtcaaacaa aattgttttc tatgacctgg atcaaaccag gaacaatatc aataaaaaaa 780 gactcactgg cagcgcactt aagtagtgcg caacataaag aggccacgcg aattcatcaa 840 caaacaacat taggttcagt ttcattattt aaccatgtta ttcataatac tcctatcggg 900 cgtggtttac gtaaaatggc cgtaaaagac gccgactcgt tacggaaaaa gtttaatatt 960 gcatattact tggcaaaacg agagcgtcca ttcacagatt atccttattt gattgcatta 1020 gaaaagacta atggggttac aaatttagga aattcttatg gcactgaccg tgcagctgca 1080 atatttacag attacattgg caccgtaatt aaaaacaaat taactgaatg tttaaaaaat 1140 tgtagatttt actgtgttct ctgtgatgga agcacagatt cagctgtatc ggaacaggaa 1200 ttgatttaca ttctttattt aaaagacgga agtccgaaag tagaattctt atctattgaa 1260 acggcagaga attctaatgc tgttggcctg aaaaaatgta taacagacgc ttttacaaga 1320 gttggcatta ccaattctta taaacatcta ttgggtgtaa atcttgatgg agctaacgtt 1380 aatctaggag cttatgctgg tttaggcgca ttactaaagg aaggttctcc ttggctggaa 1440 gttgtacact gttttaacca tcgacttgaa cttgctttaa aagatgcgtt tgaaagttta 1500 tctgccttca aaactgttga cgaactcctt ttacaacttt attacttgta tcaaaagtca 1560 ccaaagcgct accgagaact acaaggatta gccgaagctt ggggtaatag cgttccgaaa 1620 ccaacaaatg catgcggaac tcggtggatt gatcacaagt ataaagcaat gaaaattgct 1680 ttagaaaatt atggtgtgtt tatgacacat attgaatcat tggctataac agactcccag 1740 atagcaaaga gagcagagtt aaaaggtttt ttaaagcgat ggaaaaatgc gtctgtgcct 1800 attaatttat caatatatct tgatgtttta tctccccttc ggcgtttgag tttaagtttt 1860 cagaacaata ttcatgatcc agtaaaagct gtaagaagag tccaagaatt tacatggaca 1920 atggccaagc tgcaaattct tctagaaaat tcgttagaca attttcttga tgatacaagt 1980 gttatgaccc attataaaag actgtctaaa gatattgaga caattgattc aaagcaatgt 2040 tatcaaaata ttccactttc tcaatatacc gaagttaata gccatgttct tgaccagtac 2100 aaactaatta ttgcaaatat taccatatgt atggaaggcc gttttcaatc tctccaaaaa 2160 tcaccattgt ttatgcattt ggtagcttta cttgacgttt ataactggcc gttaaatgtg 2220 aatgacggta cttttggcga taaagctatc ggtgaagttg ttgaacattt ttctgaattt 2280 ttaattggtg taggatgtga tttgaataat atacttcatg agtggatcat tttaaaagca 2340 tacatgtttc ccatcataag taacaatcaa aatgagtatt atttgaaaat ttgggaaatt 2400 gttttttcca aatcaagttt ggtcaaagaa tgccgaaata ttcttgatat ttttgaacta 2460 tttttaattt gtccttttac aaatgcgaag gtagaacgaa tgttttcatg tatgaatcgc 2520 gtaaaaaatg actggagaag ttgtcttaga cgagatcgtt tagaatctct acttcggata 2580 agcgaagaag gtcctgagat tgacaaattt gaacctgata ctgctatgga cgaatggtat 2640 aatgataaag ttagacgatt gtcagcaaat tttcatcgtt atccggaaaa aagaaaaaga 2700 aagtcaggaa aaacagacat tgatctagcg acagttacaa tatcagattt ggaagaaagt 2760 gagagcgagg aagaatattg tgacaacttt taattttatt ttagcgcatt gtaacccctt 2820 aaaattaaaa tatcaaataa caactttaaa tagcgtatga tttttagctt cgtaaaaata 2880 cttttacgtt agtaaaaatc ctgaaaaatt ctaagcatta aactagtgaa attattacaa 2940 aacgataaaa atatttacga ttacgttcta atactttttc tattaacaat caaaaaaagt 3000 attagatcgt aaaagattct attatttaga cacatttttt tctttcgctt tattttattg 3060 aatgaaaaaa gtgaacgcta acttttatta aaaaatgcat tgaatgaaaa agcaatagat 3120 aaaaatatca tcggaatttg ataaggttac ttagataaga aaactgactt aataagaatt 3180 taattatcat tcttgtttgt tgcgcaggtt ttcgtcaaaa tgttaagttt taacatttta 3240 tcacaataat gttttcgtca tttttaatag ttttttataa ctgtttcaaa attagagttt 3300 taaataattt ttttttactg ttaaaaattt atagaaaatt aaatattcta gttgttttaa 3360 acagtttaaa atatgtaaat aaaatagaag atgaaataat taattaaaag aaggttagga 3420 atatcaagtt cattgatttt acaaaaatac agccgttaga tttaatgcaa tatgataata 3480 gtaataatgt aaactataac taaagctatt ttacaaccgc taaccgtaaa tgttatgaca 3540 tcttatgcaa aaatgagact tattatgtat gttaatataa ataagttaaa ataaataatg 3600 cccccccccc cacggggagg agggggaagg agttatgagc aacaaaaaaa tcgcgattag 3660 atttttttac gttaattcga gccctg 3686 // ID Transib-4_HM repbase; DNA; INV; 3931 BP. XX AC . XX DT 30-JAN-2008 (Rel. 13.01, Created) DT 30-JAN-2008 (Rel. 13.01, Last updated, Version 1) XX DE Autonomous Transib DNA transposon -a consensus sequence. XX KW Transib; DNA transposon; Transposable Element; Transib-4_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3931 RA Kapitonov V.V. and Jurka J.; RT "Autonomous Transib transposons from the hydra genome."; RL Repbase Reports 8(1), 4-4 (2008). XX DR [1] (Consensus) XX CC Transib-4_HM is a young family of autonomous Transib DNA CC transposons that were active in the hydra genome just a few CC million years ago (they are ~4% divergent from their consensus CC sequence). The consensus sequence was obtained based on a CC multiple alignment of ~20 copies; it codes for a 656-aa Transib CC transposase. Like other Transib transposons, Transib-4_HM is CC characterized by 5-bp target site duplications and short terminal CC inverted repeats. CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 1361..3328 FT /product="Transib-4_HMp" FT /note="Transib transposase." FT /translation="MSSINRNQIHLFIRENKLSLSNEKDKLVVFNFISEQL FT GNSKCKVQFSKFTFEYRKRWAASGRYNDIFLDKNLAWLCKDFISPQQTFSL FT CNTETTPKLSLGRPCKSFEESSDKSKKRKVSKLESLASSSNELLFASSCKL FT WKQGRRKHAKVVRNLSKTQTDEHVQNTPNEALALILDNNLSKSDYQRLRNN FT AVRKGCNLYPAYNNVREAKVLCLPENNLWTVTDYSAEVNMQALVNHTALRL FT IECQAEVINSLEDLQRMTLRSKVGFDGSTGQSLYTQITTEEENRNITNEAS FT LFLTCFVPLQLSGYCNGEKKIIWSNPHPSSTLFCRPIRFSYKKETSDVLKE FT EEQFLYNSIEKLFTTKYDKLTVLHKIEITMVDGKVATALSTATKSSQCCSV FT CGCNPKKMNDLNTAKNLPITKTGLNYGLSTLHAWIRTMECLLKISYKLTIQ FT KWSTREPSEKKIIQDRKTSIQKKLRDEVGLLIDFPRTCGSGSSNTGNTARQ FT FFQNAEKTAEILQLDVIIIKMLHVILCTLSSQYVIDSSRFRDFCMQTAERY FT VNQYPWYHMPQSLHRILIHGWQVVDRMALPIGMLSEEAQEARNKDFKKFRE FT SFSRKCSRSKTNEDLLRRLMCSSDPIISNLRTPHHPKKKDFPDGVLELLKE FT VPLKDFV" XX SQ Sequence 3931 BP; 1462 A; 615 C; 609 G; 1245 T; 0 other; cacagtgggc cactgactgg acaaaagtat aaaaatcaaa gtcagaattt tgatgtcgta 60 atggttttaa acctttttat agatattatg aaacaattta aaacctttta tatacatcta 120 aaattgtaat gaatagacta attatgtggt taaagtatat gcaaacgtaa attagaaaaa 180 aaaaattata aagcaaaaaa tgttgtttac aaaatctatg gtaataatgt aacactgtat 240 taaatattgc gcattgaaaa ttattttatt gttgaactgt ctattttctg cattttaaga 300 gttgttagat taaaataggt aagaaaacaa ataagttata ataattttaa atacttcatt 360 aaaagaatat ttgaagtgta ttttttggtt taaaaaatac aaaaaataga agatcttgta 420 gtatcctgtt attttatata catattttga agcttttata tccatattat gaaaatatgt 480 ttaatttagc tgctaatatt tgctaaatta atttttattt gcaatgattt gcaaaaatat 540 tcaggggcgg atccagcatt ttttggtgtt tgagatagct aacttccaga catggagcaa 600 actccaaaca tttatatgtt tatacttata tcaaataagg atggtttgca aaaaattgcg 660 atgattggag cctgggaaca aaaaagcccc aaaatttgag acgtacctgc cccaaacgtg 720 agagcaacaa gttgataaaa tattttttgt actattctca acaagactta tatttatcca 780 taaattttaa atttcattgt aaaatatttc agcgttcaaa aattatgacc atgtaaagtt 840 tgaatccccc tcaattcgag ggggttacaa tttttatatg gtcataattt ttgagcgcta 900 aaatatttta ctatgaaatt taaaatttat tgataaatat aagtcttgtt gagaatagta 960 caaaaaatat tttatcaact tgttgctctc acgtttgggg caggtacgtc tcaaattttg 1020 gggctttttt gttcccaggc tccaatcatc gcaatttttt gcaaaccatc cttatttgac 1080 ataagtataa acatataaat gtttggagtt agctacatgt ctggaagtta gctatctcaa 1140 acaccaaaaa atgctggatc cacccctgat attgtatata aaatataaca tcgcgaacaa 1200 aataatacaa tatgttataa tataatgaat attttgattc aaaaataaga ctatttaatt 1260 atagttaata ttataataat taaaagaaaa tatttctaca taaaataaat gaaaacctat 1320 tgataaattg ttttattttt aactttagac tatttttatt atgtcaagca ttaatagaaa 1380 tcaaatccat ttatttataa gagaaaacaa attatcatta tcaaatgaaa aagataagct 1440 ggtagttttt aattttatta gtgaacaact tggcaattct aaatgtaagg ttcaattttc 1500 aaaatttact tttgaataca gaaagcgatg ggctgcaagt ggacgttaca atgatatatt 1560 tttggataaa aatttagcct ggctttgcaa agactttatt agtccccaac aaaccttttc 1620 tttatgcaat acagaaacta cacctaaatt atctcttggc aggccttgta aatcatttga 1680 agaatcatct gataaatcta aaaagaggaa agtgtcaaaa cttgaatcat tggcaagttc 1740 ttcaaacgag ctgttgtttg ctagcagttg caaactttgg aaacaaggaa ggagaaagca 1800 tgctaaggta gttagaaatt tatcaaaaac tcaaactgat gaacatgttc aaaatactcc 1860 caatgaagcc ttagcactca tcttagataa taatctgagc aagtctgatt atcagcgcct 1920 aaggaacaat gctgtaagaa aaggctgtaa tttatatcca gcttataata atgtacggga 1980 agcaaaagta ctttgtcttc cagagaataa cctatggact gtcactgact attctgcaga 2040 ggtcaacatg caagcattgg tcaatcatac agcactacgt ttaatagaat gtcaggcaga 2100 agttattaac tccttagaag atcttcaacg aatgacccta aggtccaaag ttggctttga 2160 tggatcaact ggccaaagcc tttatactca aataacaaca gaagaagaaa acagaaatat 2220 tacaaatgaa gccagcttat tcctcacttg ctttgtgcct ttgcagttat ctggatattg 2280 taatggcgaa aagaagatta tttggtccaa ccctcatcca tcatccactt tattttgcag 2340 accaataaga ttctcctata aaaaagaaac atcagatgta ttaaaagaag aagaacaatt 2400 cctttacaac tcaatagaaa aattatttac tacaaagtat gacaaattaa cagttctgca 2460 caaaattgaa attacaatgg tggatgggaa agttgccaca gcactatcca cagcaacaaa 2520 atcatcgcaa tgctgctcag tttgtggttg caatccaaaa aaaatgaatg acttaaatac 2580 tgcaaaaaat ctcccaatta ctaaaactgg actcaattat gggctttcaa ctctccatgc 2640 atggatcaga actatggaat gccttctcaa aataagctac aaattaacaa tacagaaatg 2700 gtcaacacga gaaccatcag aaaaaaaaat aattcaagat agaaagacat caattcaaaa 2760 gaaacttcgt gatgaagttg ggcttttaat tgattttcca aggacttgtg gttcaggatc 2820 ttcaaacact ggcaatactg ctcgacaatt ttttcaaaat gcagagaaga ctgctgaaat 2880 tctccaacta gatgtcatca tcattaagat gttgcatgtt attttgtgca ctttatcgtc 2940 acaatatgtt attgattctt ctcgatttag agacttctgc atgcaaacag ctgaacgata 3000 tgtgaaccag tatccatggt accacatgcc acagagcctg caccgaattc ttattcatgg 3060 ttggcaagtt gtggatcgaa tggctttgcc aattggcatg ctcagtgaag aggcacaaga 3120 agcacgtaac aaggacttca agaagttccg ggagtctttt tcaagaaagt gctcccgttc 3180 taaaacaaat gaagacttgt taaggagatt gatgtgttca tctgatccaa ttattagtaa 3240 tttgagaaca ccacatcatc caaagaagaa agactttcca gatggagtgc tagagctact 3300 aaaagaagta ccacttaaag attttgtgta aatatattat attattacat ttttactgca 3360 atgttacttg aattgtgttt ctgttttcat tttttctgct tatctgttta ctatattttt 3420 attgctacaa attaacgatg ttataacgac attttgttta ccccattgta actaaggata 3480 ttgtaactaa gcaatataat tatccatagc aacaaaataa aaaaatagca ccttcataaa 3540 aaatgcacac atcatattag tactcatgcc aaatttagtg aatatagcac tcgtaaaata 3600 tctgcaaata ccaaaaatag gttgttgcta agcaaaataa aggtatccat agcaacaaaa 3660 taaaaaaaaa caccttcata aaaagtgcag acatcatatt agtacccatg ctaattttag 3720 taaatatagc actcgtgaaa tatctggaaa tcccaaaaat aggttgttgc taagcaaaat 3780 aaaggtatcc atagcaacga aataaaaaaa tagcaccttc ataaaaaggg cagacatcat 3840 attagtactc gtgctaattt tagtgattat agccctaata ttaaattagt tatgaatttt 3900 gtccagtaaa agtgaattct ggcccactgt g 3931 // ID DNAX-10_AP repbase; DNA; INV; 224 BP. XX AC . XX DT 29-AUG-2009 (Rel. 14.09, Created) DT 29-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNAX-10_AP. XX NM DNAX-10_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-224 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2065-2065 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC TSD unclear. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 224 BP; 64 A; 51 C; 44 G; 65 T; 0 other; taaagccgga ttcacagctg cgtcgtgtgt cgtgtcgtgt cgatcaaatc gtattgtgat 60 tggatattac ttttttggta tcatgtaata accaagttgc aaccgagtat accaagaatt 120 gactactgct caacttttaa accatattgg ttaccagtcg taaccatata ataaccaagc 180 cctcgcacga cacgacacac gacgtagctg tgaatgcgcc ttta 224 // ID Kiri-3_CQ repbase; DNA; INV; 5673 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 16-FEB-2011 (Rel. 16.01, Last updated, Version 2) XX DE A Kiri non-LTR retrotransposon family from Culex quinquefasciatus DE - consensus. XX KW Kiri; Non-LTR Retrotransposon; Transposable Element; Kiri-3_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-5673 RA Kojima K.K. and Jurka J.; RT "Kiri non-LTR retrotransposons from the southern house RT mosquito."; RL Repbase Reports 11(1), 122-122 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 9 sequences with >98% CC identity. CC This family and related Kiri elements found in two mosquito CC species, Culex quinquefasciatus and Aedes aegypti, constitute a CC distinct lineage belonging to the CR1 group. The phylogenetic CC analysis with PhyML indicates that Kiri is a sister lineage of CC the L2 clade. Although several Kiri elements do not encode two CC proteins, probably due to 5' truncation, Kiri elements generally CC encode two proteins. Their ORF1 proteins commonly contain an CC L1-like RNA-binding domain. XX FH Key Location/Qualifiers FT CDS 104..883 FT /product="Kiri-3_CQ_1p" FT /translation="MSNRSTRSNSVPSLASLFELENGNQNKRSRKELNKDS FT VPTMSLEDLWTKIESKMDSFKQDFDKRIDGLETQLSQLKTECTARIDDLSE FT AVVEVRADLNLASNWIGRVEKYQDLIITGVPYSPTENLKTVFRDIAAKLAY FT DPLDVPMVDLKRLAKPPIAAGSAPPILCQFAIRNERNAFYSKYLSLRNLNL FT EHIGFNNKNRIFINENLTPQDREIRTAAIKLKKEGRIQQVFSRDGVVHVKS FT RGGPAEACYTVEHLRSCSK" FT CDS 2668..5496 FT /product="Kiri-3_CQ_2p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="MASSTTGNASANQYITRAVVNAIFDNTKLNICQINVQ FT SLCARNLVKLEELRRVFKDSKADVVCFTETWLDTSITDSLIAIDGYNLIRN FT DRNRHGGGICVYYREGLQCRVVEKSKSVRDTSSTEFLLLEIECLNDKLLMG FT VYYNPPTVDCSEILSAQISKYSMKYKACFLIGDFNTDPNRSGAKSKRFNDV FT LSGLAFHFANREPTYFYQTGCSLLDLLLTDSPNQISKFNQVSMPGISHHDL FT ILASLEFSFTIPENRSYYHDYNNINYAELNEAFSRINWANYLNCAEPDILL FT EFFNVNMLNLFTQFVPLKEFKVRKNPWFNAVIEKAMIDRDLSYRNWKRSKS FT IEDKNHFKILRNRVNSLIEKAKSDHSNRILNTNVPSKKLWSNIKNLGIASK FT TKSPVVCNSSVENINQYFSSNFSPSQGDFNSNLAAVDGFHFRAVEDYEIVN FT AISSITSNATGLDNIPIRFINLMLPLILPIVKHLFNTIITTCIFPRDWKKI FT KVIPIKKKSSSCDVTNLRPISLLSSLSKAFEKIIKYQLTEHVNRYDLLHPL FT QSGFRKGHSTETALIKVHNDVARCIDRRGVAVLLLIDFSKAFDRVSHTKLL FT NKLATSFGLSRPAVLLIESYLRDRSQSVFLNGFHSSFVDILSGVPQGSVLG FT PLLFSLFINDLPPVLKFCSVHLFADDVQIYLCSDANINIADMQRKINSDLL FT KVQSWSEKNLLAINPAKTKALLISRLKTPPNPPELFLKNELLEFVDHACNL FT GVIFRNNLDWDKQINSQIGKIYGSLKQLSLTTRFLNSHTKLKLFKSLLYPH FT FIFGDFIYPNATVESLNKLKVALNSCVRYVYNLSRYHRVSHLQKNLIGCSF FT SDFYKYRSCLTLFRVMHTGEPNYLSSNIIPLRSSRSKQFLIPQHYSTYYSQ FT SLFVRGISTWNLLPLNIKTSMSIPYFKRKLLAELNNPNQ" XX SQ Sequence 5673 BP; 1823 A; 1199 C; 987 G; 1664 T; 0 other; gtgctgatgt ttacacttcg taaatcatta agttgaactc agccagcacg ccacaagcta 60 tactcccttc agaagtctct ctgagttcca gtgctctgct ataatgagta accgctcaac 120 tcgctctaac tcagtgccgt cactggcatc actgttcgaa ctcgagaatg ggaatcaaaa 180 caaacgatcc cgcaaagaac ttaacaagga ttctgtgcca acaatgtcgc tggaagacct 240 ttggaccaaa attgagtcca aaatggactc gttcaagcag gatttcgaca aacgcatcga 300 tggactggaa acgcaacttt cgcagctgaa gacggagtgc actgcaagga ttgatgatct 360 ttcggaagct gttgtagaag tgcgcgccga cttgaatctc gcttcaaact ggattggacg 420 ggtcgagaag taccaggacc tcatcattac cggcgtaccc tactcaccga cggaaaatct 480 caagactgtc ttccgtgata tcgcagctaa gctcgcatac gatccactcg atgtcccaat 540 ggtagatctc aaacgtttgg ccaagccccc catcgctgcc ggatccgcgc cgccgatttt 600 gtgccagttc gccattcgca acgagagaaa cgccttctac agcaagtacc tcagcctaag 660 gaacctcaac ctcgagcaca tcggcttcaa caacaagaac cgcattttca tcaatgagaa 720 tctaaccccc caagatcgcg aaatccgtac tgctgctatc aagctgaaga aagaaggtcg 780 aatccagcaa gtgttcagca gagatggagt tgtccacgtc aagtctagag gaggtcctgc 840 tgaagcttgc tacaccgttg aacatctgag atcctgcagc aaataatccc tttccaacaa 900 gctactcttt tccttccatt cttcccttgt atccaatccc caaaattccc ccatgttttc 960 ttccgtccta aaagcaaaaa aaaaaaaaga gaaatgcaaa aaaaaaagaa atgcaaaaaa 1020 aagaaatgca aaaaaaaatc caaaaaaaac aaaaaaaaaa ttaacaaaat aaaaaaacac 1080 aaaaaaaata caaaaaataa ctattaaaaa gtccgtaggt gacacaagcc aagtgtcgct 1140 gagatagagg actcagccgt aggagaaacc cgactccttg tgggtcggtc gcctgaccta 1200 caatgcgtcc tgcggtcact ccgtaggggt gcgcaggacg tggtaggtta agtggttact 1260 ccgtaggggc gagctgaacc cccgagccag tggaggccac tgacggctgt tccaaaacaa 1320 acccttttcc ctttacgctt ttctttttcc ttccccaagt ttctcctatg aatccgtttc 1380 ccagttatcc ttgactccat tttcctctcc taaaagtaaa aaaaaaaaaa aaaaaaaaac 1440 caaaaaaaaa caaaaaaaaa attaaaaaaa aagaaaaaaa caaaaaaaaa ttaaaaaaaa 1500 acgccaaaaa aacaaaaaaa aataaccatt aaaagtccgt aggtgacaca agccaagtgt 1560 cgctgagata gaggactcag ccgtaggaga aacccgactc cttgtgggtc ggtcgcctga 1620 cctacaatgc gtcctgcggt cactccgtag gggtgcgcag gacgtggtag gttaagtggt 1680 tactccgtag gggcgagctg aacccccgag ccagtggagg ccactgacgg ctgttccaaa 1740 acaaaccctt ttccctttac gcttttcttt ttccttcccc aagttctcct atgaaccgtt 1800 tcccagttat ccttgactcc attttcctct cctaaaagta aaaaaaaaaa aataaaaaaa 1860 accaaaaaaa acaaaaaaaa taaaaaaaag aaaaaaacaa aaaaaaaacg ccaaaaaaat 1920 acaaaaaaaa taaccattaa aagtccgtag gtgacacaag ccaagtgtcg ctgagataga 1980 ggactcagcc gtaggagaaa cccgactcct tgtgggtcgg tcgcctgacc tacaatgcgt 2040 cctgcggtca ctccgtaggg gtgcgcagga cgtggtaggt taagtggtta ctccgtaggg 2100 gcgagctgaa cccccgagcc agtggaggcc actgacggct gttccaaaac aaaccctttt 2160 ccctttacgc ttttcttttt ccttccccaa gtttctccta tgaatccgtt tcccagttat 2220 ccttgattcc attttccctt cctgaaagtt aagttccggt ggaccctgag acgagtatat 2280 ccccgtgcaa tctggattgt cggatcctgg accatttgcg atgttggcct gttgctgttg 2340 ttgctgctgt gttggtgctg agatgtcggt tgttgctgcc ctcctgacgt ggcgatgtga 2400 tgtttcgttt gctggttcct tcctgcgggc ggctatcgtt gattcagtgc tgatcaaagt 2460 tctttactgt gatcctaacc tctgagtgag ttattgccag caccattcac tagttttttg 2520 gaggagttct ttgatatgtt tctatgcttt ttcaatttaa aaattatagt tttagtttta 2580 gaattaagtt tattttctca aatgttgaat ctgattctgt tgttgtgtac aggttaggtt 2640 tgcgccgtcc ttcgatttac ctttataatg gctagttcaa caactggaaa tgcctcagct 2700 aaccaatata taacaagagc tgttgtaaat gcaatctttg ataatactaa actaaacatt 2760 tgtcaaatca acgtacaaag cttgtgtgca agaaatttag ttaaacttga agaattaaga 2820 cgagttttca aggatagtaa agcggatgtt gtttgtttta ctgaaacatg gcttgatacc 2880 tctattactg attctttaat cgcaattgat ggttacaact tgatacgaaa tgatcgtaat 2940 agacacggag gtggcatttg cgtttattac agagaaggcc ttcaatgcag agttgtcgaa 3000 aaatcaaaat ccgtacgtga tactagttct accgaatttt tgttacttga aattgaatgt 3060 cttaatgata aattgttaat gggagtctat tacaatcccc caactgttga ttgctctgaa 3120 attctttctg cgcaaatttc aaaatactca atgaaatata aagcttgctt cctaatagga 3180 gattttaaca ccgatccaaa tagatctggc gcgaaatcaa agcgcttcaa tgatgtttta 3240 agtggactag cttttcattt tgctaatagg gaaccaacat atttctatca aacaggatgc 3300 tctcttcttg atctactatt aactgattcc ccaaatcaaa tttctaaatt caaccaagtc 3360 tccatgccag gcatttctca tcatgatcta atcctagctt cacttgaatt ttcttttacc 3420 atacctgaaa atcgttctta ttatcatgac tataataata ttaattatgc tgaattaaat 3480 gaagccttca gcagaataaa ttgggcaaac tatttgaatt gtgctgagcc agatatactt 3540 ttagaatttt tcaatgtaaa tatgctaaat ctatttacgc aatttgttcc tttaaaggag 3600 ttcaaagtac gcaaaaatcc atggttcaat gcagtcattg aaaaggccat gattgataga 3660 gatctatcct ataggaactg gaaacgtagc aaatccattg aagacaaaaa tcatttcaaa 3720 attttaagaa atagagtaaa ctctttaatc gaaaaagcta aatctgatca tagcaatcgt 3780 attctcaaca caaatgttcc aagtaaaaaa ctatggagta atattaaaaa cttaggaata 3840 gcaagtaaaa ctaaatctcc agttgtctgc aacagctcag ttgaaaatat taaccaatat 3900 ttctcttcaa acttttcgcc ttctcaaggt gattttaact ctaatttagc cgctgtggat 3960 ggttttcatt tccgcgctgt tgaagactat gaaatagtga atgcaatctc ttcaattact 4020 tctaatgcta ctggattaga caacattcca atacgattta ttaatcttat gcttccactc 4080 atcctaccaa tagtcaaaca tctatttaac acaattatta caacttgtat ttttcctcgg 4140 gactggaaaa agattaaagt aattccaatt aagaaaaaat catcgtcctg tgatgttact 4200 aatttaaggc ctatcagttt gttaagctca ctttctaaag catttgaaaa gataattaaa 4260 tatcagctta ctgaacatgt taataggtat gatcttcttc atccattgca gtctggattt 4320 agaaaaggtc atagtacaga aactgctttg attaaggtac ataatgatgt agcacgttgt 4380 attgatagac gaggcgttgc tgttcttctt cttattgact tttctaaagc gtttgatcga 4440 gtatcccaca ctaagctctt aaacaaattg gccacatcat ttggactttc ccgacctgcc 4500 gtgttgctaa tcgaatctta tctaagagat cgatctcaat ctgtttttct taatgggttt 4560 cattcttctt ttgttgatat tctctccggt gttccgcaag gttcagtact cggaccttta 4620 cttttctcgt tgtttataaa tgatttacct ccagttttaa aattttgttc tgtgcacctc 4680 tttgctgatg atgtgcaaat ttatctttgt tctgatgcaa atattaacat tgctgatatg 4740 caaagaaaaa taaatagcga tttactaaaa gtacagagct ggtcagaaaa gaacttgtta 4800 gcaataaatc ccgcaaaaac taaagcctta ttaataagtc gtctcaaaac acccccaaat 4860 cctccagagc ttttcctcaa aaacgaactg ctcgaatttg tcgatcatgc ttgtaatctt 4920 ggcgtcatct ttagaaataa cctagattgg gacaagcaaa ttaattctca aattggaaaa 4980 atttatggct ctctgaaaca attgagttta actacaagat ttcttaattc acatactaaa 5040 cttaaactgt ttaaatctct gttataccca cacttcattt ttggtgattt tatatatcct 5100 aacgcaactg ttgagtccct aaataaactc aaagtggctc taaattcctg tgtcagatat 5160 gtttacaatt tatctcgata tcaccgagta tcacatttac agaaaaattt aattggctgc 5220 agtttctctg acttttacaa gtacagatca tgtctcacac tctttagagt aatgcacaca 5280 ggtgaaccta attatcttag ttccaatata attcctctaa gatcatccag atctaaacaa 5340 ttcctaatac ctcaacatta ttctacttat tatagtcaat cattattcgt tagaggcatt 5400 tcaacctgga atctattacc tttaaatatc aaaacaagta tgtctatacc ttatttcaag 5460 cgcaaactcc tggccgaact caacaacccg aatcagtgat acatttgaga aaatgtgtag 5520 taaatttgag aaaattagca ttaatttatt taagttaagt gaaattgaat aactccatca 5580 ctaagtgaat tcccgccaca atgtaatgaa ttaaaaggca cgtgccttat attacatgaa 5640 taaataaaca aataataata ataataataa taa 5673 // ID piggyBac-7_SM repbase; DNA; INV; 2588 BP. XX AC . XX DT 29-MAY-2008 (Rel. 13.05, Created) DT 29-MAY-2008 (Rel. 13.05, Last updated, Version 1) XX DE DNA transposon from Schmidtea mediterranea. XX KW piggyBac; DNA transposon; Transposable Element; piggyBac-7_SM. XX OS Schmidtea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae. XX RN [1] RP 1-2588 RA Kapitonov V.V. and Jurka J.; RT "Families of autonomous piggyBac element from the freshwater RT planarian genome and two clear examples of horizontal transfer of RT piggyBac transposons between a flatworm and insect."; RL Repbase Reports 8(5), 526-526 (2008)Repbase Reports. XX DR [1] (Consensus) XX CC piggyBac-7_SM is a very young family of piggyBac transposons, CC characterized by 14-bp TIRs (one mismatch) and TTAA target-site CC duplications. The consensus sequence was reconstructed based on CC multiple alignment of 8 copies (they are ~99.8% identical to the CC consensus). This transposon is likely currently active; and the CC consensus sequences is a good approximation of the active CC transposon. XX FH Key Location/Qualifiers FT CDS 595..2340 FT /product="piggyBac-7_SMp" FT /note="piggyBac transposase." FT /translation="MTKQMKIEKLLEKIRNIPEGFSDIEEDKLYDSDGDLI FT ESNNGPYGSDDYMFDSGDNDSDHEESSASDSEYENILNIPKRKRNQQLSTS FT SEEEVEEDEIELSADGTVWRKLNEGSSGGRLPLHNIFKDTPGPTGYAKRNI FT MKGSASSAFSLIIDRNIMEYIIKCTEAEAFRVFGKKWTLTEEKLKAFMAIV FT CARGAYEAKNLKLSYLWNKKWGPKFFADTMSRNDFTEILRFIRFDNKNSRS FT QRLQTDKFALISKVWNKFIENSQNCYKPGQNITVDEQLFATKARCRFTQYM FT PNKPNKFGIKFWLAADVQSKYIINGFPYLGKDETRSSSTSLSEFVALKLME FT PYTMKGRTVITDNFFTSVSLASKLLAKRTTLVGTIRSNKRELPKFVKIKKD FT NMQCYSTLLYKANDFILTIYKSKPQKKVLVLSSKHKSVQINKNKKLLPETI FT TFYNKNKCGVDSSDQMARKYSTKSGSRRWPLQVFFNILDLAIINAWILYKK FT TTAINIARKDFMFQLADELAADFVKSRSQIERHNQQPLDESTNTRKQCQTG FT FCNKNKTTLMCDKCKRYVCGKCISVKKIICKRCEE" XX SQ Sequence 2588 BP; 983 A; 376 C; 439 G; 790 T; 0 other; cactagattt accgagaccg tcaaaacgac ggttttcaat tttcttgcta aaaaattaca 60 taattacaat taatattttt tcgaaataat tgcatgactt ttctacatat gtagcggtaa 120 ttaattttat aaattgtttc tatttttaaa tgtatccaaa atgcaaattt ttttgtgcta 180 tacctatttc taccaacacc cgtcaaaatg acggctacta atatttaaga caaatttgat 240 atttgttgta cggtttattg ctatattcgc cagtatgatt ttagaagttg ataaataata 300 ccttgttatc tcaaaaatga atcatttatt ctatatacta catattttct gttgaaaaag 360 aaaacaaatt ggttgtcaca tgatgagtga tctcacaggc tccatatgat ttcgttgaaa 420 actgaggttc tcaccgcata tgctcagtca gtaattttag ccattggcag atatagtagt 480 atttatccgc gccagttcat ctccaccctc acgatttcaa cagaggtaag ataatttaga 540 aaaaataaaa aatgtttttg gttttatatt tttgttattt atcaggccgt aaaaatgacc 600 aagcaaatga aaatagaaaa gttattagaa aaaatacgta acataccaga aggattctct 660 gacattgaag aagataaact ttatgatagt gatggcgatc taatcgaaag taataacgga 720 ccgtacggta gtgatgatta catgtttgat agtggtgata atgacagtga tcatgaggaa 780 tcttctgcaa gtgatagtga atatgaaaat attttgaata taccgaaaag gaaaaggaac 840 caacaattat ctacaagttc agaagaagaa gtagaggagg atgaaattga attgtctgcg 900 gatggaacag tctggagaaa actaaacgaa ggatcttctg ggggaagatt accactacat 960 aatattttca aagatacacc tggtccaaca ggatatgcca aaaggaacat aatgaaaggt 1020 tctgcaagta gtgcattttc tttgattatc gacagaaata ttatggagta tataataaaa 1080 tgcacagaag cggaggcttt tagagttttt ggtaaaaaat ggacacttac agaagaaaaa 1140 ttaaaagctt ttatggctat agtatgtgca cgtggagcat atgaagccaa aaatctaaaa 1200 ctttcctatt tatggaataa aaaatggggg cctaagtttt ttgcagatac catgagtagg 1260 aacgatttta ctgaaatttt aagatttatt cgtttcgaca acaaaaattc tagaagtcaa 1320 cgtttgcaaa ctgataaatt tgctttaatt tcgaaagtgt ggaataaatt tattgaaaat 1380 agccaaaact gctacaaacc agggcaaaat ataacagtag acgaacaact atttgctaca 1440 aaagctaggt gcaggtttac gcagtatatg ccaaataagc ccaacaaatt tggaataaaa 1500 ttttggctgg cagcagatgt ccaaagcaaa tatataataa atggatttcc atatttaggt 1560 aaggatgaaa ctcgttcctc ctctacttca ttatctgaat ttgttgcctt aaaactaatg 1620 gagccatata cgatgaaagg gagaactgta ataacagata atttttttac aagcgtatcg 1680 ttagcatcga agctgcttgc aaaaaggacc acgttggttg gaactatacg tagtaataaa 1740 cgagaactgc caaagtttgt aaaaataaaa aaagataata tgcaatgtta ttcaactttg 1800 ctttacaaag caaatgattt tattctcacg atatataaaa gcaaaccaca aaaaaaagta 1860 ctagtgctaa gctcaaaaca taagtcggtt caaattaaca aaaataaaaa gctattgccc 1920 gaaacaatta cattctataa caaaaataaa tgtggtgtgg attcatctga tcaaatggcc 1980 agaaaataca gcacaaaatc tggatctaga agatggccac tacaagtatt cttcaatata 2040 ctagaccttg ccataataaa tgcttggata ttatataaga aaacaactgc aataaatata 2100 gcaagaaaag attttatgtt tcaactggcg gacgaacttg ctgcagattt cgtaaaatca 2160 agaagtcaga tagagaggca taaccagcaa cctcttgatg aatcaacaaa tacaagaaag 2220 caatgtcaaa cagggttttg caataagaat aaaactactc ttatgtgcga taaatgcaaa 2280 cgatacgtat gtggcaaatg tatatcagta aaaaaaatta tttgtaaaag atgcgaagaa 2340 taaaaatcaa aatcaaacta aattttaaaa aaagaatatc gttcctgaat tattattatt 2400 attttttctt taattgtttc tgttgtaaaa tactataata tgcttgttat aaatgtttta 2460 ttgtattttt attatttaat aaacaattat acatattgcc tctcaaattg tgtaaagtat 2520 ttcgtaaccc gtcattttga ccgctcatgg tagaactagg tatatcaaaa agtccggtag 2580 atctagtg 2588 // ID Harbinger-N1A_BF repbase; DNA; INV; 875 BP. XX AC . XX DT 04-AUG-2008 (Rel. 13.08, Created) DT 04-AUG-2008 (Rel. 13.08, Last updated, Version 1) XX DE Amphioxus Harbinger-N1A_BF autonomous DNA transposon - consensus. XX KW DNA transposon; Transposable Element; Harbinger-N1_BF; KW Harbinger-N1A_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-875 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-875 RA Kapitonov V. and Jurka J.; RT "Harbinger-N1_BF - a family of autonomous DNA transposons from RT the amphioxus genome."; RL Repbase Reports 8(8), 812-812 (2008). XX DR [2] (Consensus) XX CC It is a subfamily of Harbinger-N1_BF. XX SQ Sequence 875 BP; 259 A; 175 C; 171 G; 270 T; 0 other; ggccacgttg atttgattat atggatgaca tccgcgctcg caccaatttt cggccatttc 60 caaaaaaaaa aaaaagtttt cgaccacaca ataaattgtg caaagcagct gtaaaatgag 120 cacaaatatt ccgtagattg aaaaaaaagt atcagaaaac agataactaa tgccatagaa 180 tgccaaaatg aatctaaagt caaagaataa gatgtgaaag gacagaaaga aaaaaaatct 240 attgttggtt gggggaggta ccagtacaga aaattcaggt ccagaggatc atttccaggt 300 ccggacatga acctggacct gattgtctgt ggaaacttga gaactggcaa tactcaaaac 360 aatggtaatt ggtcaatttt gctacaaagg aatctgtttg gtggattatt ggactcccac 420 tggcatttta aaatcccatc tccatgtaat aaaggctaac tgtctctgta cctttgactg 480 tagaaacttg gtgcaattga ctataaactc tacttgactt cgctcttgtt gtcttcttgg 540 ccccccggtt agaccccaaa tggctgccga catagtgtgt ccattttgta attggcgaaa 600 cctgttcctc tgtgtttttt tttcaggtct gtttttttcg gattggtcca agaagaaaaa 660 aatgttttgc acccgtatga tgtactggtc cctacaccta actgttggta taccatacta 720 tgtgtgttta gtcctatgtt caaccttcct ggccactttg atttcatact gaaggcaact 780 tttttttttt ggccaaactt tttttttatt cgctcgctcg catcagtttt ggagttcccg 840 gaggatgtca tccatataat caaatcaaca tggcc 875 // ID BEL-158_AA-I repbase; DNA; INV; 2323 BP. XX AC supercont1.323; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-158_AA_; KW BEL-158_AA-LTR; BEL-158_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-2323 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.323; Positions 433387 431065. XX CC 'ATAAC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 343..1986 FT /product="BEL-158_AA-I_1p" FT /translation="MKHLWQATSATVADQSLISVGIQQRTHSAFPKVILKE FT QNLIVNTNERSDMEPDEHQFQSSASLTPLPSFSKERIVAVEVNPNTNTGLA FT KGDANISHRSYRITNQRVQIVAMLINYHAPSMWLRERVDSSIMSTQKTSQP FT MEGEMNEYHIANSESKCVRILLDPDTGQLTETIRVSIFTQWRTKFVNLTLY FT CSTKLDGQGTRIYSAVNFETTISNKAKWKEIKPQIFKPCDVAQPLIVIPQS FT IIDIALHVLHYNTDAGLQSQTKLFCNLMCLLTDIYQRFEFYQLSSLKSDSV FT NIQHQRFPRRFAQKADRSRLSTRKRAYSRMRDHGERVSFHDHEWKSICQQR FT LENRRENVRTDRKRSWTTMDKVHLEVNSTLSLVDISSKTIQQEELRNILCI FT KLTCKNWSYPNCNTKMPMPNWTCGAMPRYRAVRPIRKRDTNVASLSNVAHA FT VHARKPAMVRLASLSTIPMRSDEVVSWVPIVGVQTTLMISEAISCQTLSSK FT WTWLNVTVLRILWVTLANKSGDEINCKMKPDSMDSNHGLDLTNSRQFSVIL FT V" XX SQ Sequence 2323 BP; 745 A; 487 C; 524 G; 567 T; 0 other; aactttaaaa ctttcgtggt tatggactgt acaccgacat cagcgggagg ctgcgtcatc 60 caatcgcgcc ccgaatcaca tcgtcttcct atatcccgaa gcaaaacatc tcgtattgcc 120 ctacgactgc agcaactgga tgaaatgcgc gcaattgagc tgcgccagat agaagctgag 180 aaacgggcac ttgtattaga gcataatgcc atgcaagcaa aatacactct tctgctagaa 240 caaagcaaga aaaaatctcc tcttcacaat agaggtgatg taaacaaacc ttcaactgac 300 cctagttcag tacaaaccga aaaacgagag tgagtgaaac caatgaaaca tttgtggcag 360 gcgacatcgg caacagtagc cgatcaatcc ctaatatctg tcgggattca gcagcggact 420 cacagcgcgt ttcccaaagt cattctaaaa gagcagaacc taatcgtcaa caccaatgaa 480 agatcggaca tggaaccgga tgaacatcaa ttccaatcct ctgcgtcact tacgccgtta 540 ccaagcttct ccaaggagcg tattgtggca gtcgaagtaa atccgaatac gaatactgga 600 ttggcaaaag gagatgccaa catttctcat cgatcgtatc gtatcaccaa ccagcgggta 660 cagatagtcg caatgctgat caattatcat gcgccttcaa tgtggctacg tgagcgcgta 720 gattcatcta taatgagtac gcagaagaca tcccagccta tggaaggaga aatgaacgag 780 taccatatag caaatagcga gagcaaatgt gttcgtattc ttttagatcc ggacactggc 840 cagctgactg agactatccg agtttccatt ttcacgcagt ggagaacgaa attcgtcaat 900 ttaacgttat attgttcgac caaacttgac gggcaaggta cacgtattta ctcggcggta 960 aatttcgaaa ctacaatcag taacaaagca aagtggaagg aaattaagcc tcaaatcttc 1020 aaaccgtgtg atgtcgctca gcctctcatc gttattcctc agtcgattat cgacattgcg 1080 ttgcacgttc tccactacaa cacagatgca ggacttcaat cgcaaacaaa gttgttttgc 1140 aacttaatgt gcctgttgac ggatatctat cagcgtttcg aattctatca actatcgtca 1200 ttgaagtcag atagtgtcaa tattcaacat caacgattcc ctaggagatt tgcacaaaaa 1260 gcggatcgtt cacgattgtc caccagaaag cgagcgtatt cgcgaatgcg tgatcatggc 1320 gaacgagtat cgtttcatga ccacgagtgg aagtccatct gccaacagag attggagaat 1380 cgaagggaga atgtgcggac cgacaggaaa cgaagttgga ctacaatgga taaggttcac 1440 ctcgaagtga acagcactct ttcattggtt gacatctcgt caaaaacaat tcaacaagag 1500 gagctgcgaa atatactctg cattaaattg acctgcaaga actggagcta tccaaattgc 1560 aacactaaaa tgccgatgcc caattggacg tgtggagcga tgcctcgcta tagggctgtg 1620 agaccgatca ggaaaagaga caccaatgtg gcaagtttat caaatgtagc gcacgctgtt 1680 catgcgcgta aaccagcgat ggtcaggttg gcgagtttgt caactatccc aatgcgttct 1740 gacgaagttg tttcgtgggt acccatagtg ggtgtccaaa ctactttgat gataagcgaa 1800 gcaataagtt gccaaacttt gagctcgaaa tggacttggc tcaatgtaac agttttgcgc 1860 atactttggg tgacacttgc taataaatca ggagacgaaa ttaattgcaa aatgaagccc 1920 gatagtatgg acagcaacca cggattggat ttgaccaatt cacgtcagtt ttcagtgata 1980 ctggtctgaa agattattaa aaatttgaaa cgtggcgtct ccggtggttt aggaagtgtt 2040 aacagagaaa atcgtaaaaa tcgttgtaac cactttaacc atggctctaa tgatatagga 2100 acaggctagt taggaggatt tcaggtgtta agtcagaaat gaacattaaa ttagagacta 2160 gtatcaaatt tgtaaacgaa ggtcgaaaag atggtcagcg aatgtgaaat ggatcacaat 2220 tcagttcgaa taggaagaaa gtataacatt gcggtaactt tgaaaatatt taaaaagaaa 2280 aatggtaaaa ctgatccgat ggtcagtctt acggggggga gta 2323 // ID BEL-41_AA-LTR repbase; DNA; INV; 523 BP. XX AC supercont1.245; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-41_AA_; KW BEL-41_AA-I; BEL-41_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-523 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.245; Positions 454696 455218. XX SQ Sequence 523 BP; 181 A; 96 C; 95 G; 151 T; 0 other; tgttgctgcc agcactgctc cgtcgtgtaa ccacaactga atagtctctc acccgtagac 60 aaaggccaga gacgaaagaa tgatcattat caatttgtga tgtggaacgt ggtagaagta 120 catgcaagtg cgtgatgtga aattggttta gttaaactca tgaattaacc ttttctaaaa 180 tctatttaca gttgaactta ttcctaaatt tgtaaagcta aatttatccc taaaaattcg 240 aatcacagag ttattatcac agaagtaagt tccatttgta caatcagaac taaaactaac 300 cttaaaatta atatcacaga tatacgtaga cggaatctac ggcaggaaga aaagtagagt 360 ttcctgagaa aaggctcacc gaaatttgta agttaggttt acatcttaaa aaaccctatc 420 taatttggaa taaaacttct agcttaaagc tttacacgca gcacaaactc gagtttgcgt 480 tctgctgaag acgtcggaaa gagcctcttt tgttctagga aca 523 // ID Mariner-19_HM repbase; DNA; INV; 3670 BP. XX AC . XX DT 14-DEC-2008 (Rel. 13.12, Created) DT 14-DEC-2008 (Rel. 13.12, Last updated, Version 2) XX DE Mariner-type family: consensus sequence. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-19_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3670 RA Jurka J.; RT "Families of Mariner elements from Hydra magnipapillata."; RL Repbase Reports 8(12), 1953-1953 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(1379..2500,2463..3137) FT /product="Mariner-19_HM_1p" FT /translation="MKIFSVTEEQELVQYCLVSAKMGYGLSPIKLRALTFE FT YALKLEKKCPHSRRGSAKPWETNKRAGVDWFRAFLSRHPELVIRKPEATSI FT GRTSAFNKYNVNLFYNNLQSLLIKYKFSPNDIWNVDETGVTTVQVPEXVLA FT KRGERQVASVTSAERGILVTMCNAVNASGSSITPFYIFPRVHFKDFFLRNS FT IPGSVGTANKSGWMMESTFMEWFNHFIKSVRPSKTNPVLLILDNHESHMSI FT NFIDLASDNGVIVLTIPPHTSHKLQPLDITVYGPFKRYYNREIDSWLVSHP FT GKTVSIYDIAEISGKAWAKVSMPVNIISGFSVSGISPFQPDRWKDEDFFLS FT QVTNRPNPETLNINPVSLEIIRPNPEALNIKKLLDLILKHLTSNRPNPEAL FT NINPVSVEVIRPNPEALNIYPVSVEINRNVKVNKNVSNTLNVESIRPYPSA FT KARLEKTKGARKKLSSSILTDTPIKEALRNQQLQQNLKKQNFSEKQKDAKN FT KRQRKSRKKSSNSKTLKFPLMTYNNDEADDILVERNDNGVKDSLNFCGGIT FT SNYDLCAICEEMGRDNEVWYRCRQCASWAHKACTSADNASQYVCDYCNLII FT *" XX SQ Sequence 3670 BP; 1300 A; 522 C; 587 G; 1259 T; 2 other; ggggagagtg gtactgaatg gaccttgtac tgaatggacc attagaaaaa taaaaactta 60 acaagctgtt aattgatttt ttttaattgc atataatttt tgtatactca aagtttcttt 120 gatatttttg gtttaacgtt attcgtaata atttatcttt tcaatttgtt tcaatttaca 180 ctagattttt aaggtaaagt taactatggt taataataat attatagata catatttttt 240 actttgtata acattagaat gaatgttata tgatttactg attgctgtga ttatatgggt 300 aacatgtaca ttttgattag tgtactggat gaaccaccat tgattgtact gaatggaccg 360 gttcattttt ctgcgaagtt atttatcgcg cttcaktatt ttacttacta aaacaaaata 420 cacaagtttt ttcatctaaa tgaactaagg agatgtgagt taggaaataa accaaaatat 480 ctagttaacg aagacttaag gcgcaagttc aattattcat tattcaatat aatattaagg 540 taagcagata attaagttct aatagcttaa tattaggtta ataattatta atttgtagtt 600 tgtttcatta tttcaataac tgaatgtcat aatattatta cgatatataa ataaggtggt 660 attactttta ataacatcta gcatataaca tttgttatat atactatata taataaaagt 720 tatatgctag atatgttata tggcatataa tatgctagat aagctatatg ctagatatta 780 tataatatat ctgaaatgtt atttatacgt ttttaaaaac aatgacaatt accatggtca 840 gtaattttta caatacaaaa aatcaagtaa tatctaataa aaccaagaat aaaatttcgt 900 aaattaattg ttatatttaa attactaaag cttgttgatc ataataaaaa attattctac 960 attcaagctt ttatataatt tagtttttat acatgttata tgtgtgtgtg tgtgtttaga 1020 taattaaata tatatatata tatatatata tatatatata tatatatata tatatatata 1080 tatatatata tatatatata tataaaatag catcgataat ttttttttat aatatggtta 1140 taaacaagtt ttatttattt cagaagtatt atgagacatc gaaaagcaac agcactaaaa 1200 ggtgattggg atcccaataa tatgatccaa gcagttgaaa atgttttaaa taaaactcta 1260 tctgaaagaa aagctgttga gagtcaaacg ttcaactctt aagaggcgaa taaaagaagc 1320 tcgactatct caaaatggat tgaataaatt gcaatgtgtt ccatatagta acagttctat 1380 gaaaattttt tcagtaacag aagaacaaga gctggttcaa tattgtttag tttctgcaaa 1440 aatgggatat gggttaagtc ctattaagct aagagcacta acatttgaat atgctctcaa 1500 acttgagaag aagtgtccac attctcgtcg aggttctgct aaaccttggg aaacaaataa 1560 acgtgcaggt gtagattggt ttagagcttt tttaagtcgt catccagaac ttgttataag 1620 aaaaccagag gcaacatcaa ttggtcgaac gtcagcattc aacaagtata atgtaaatct 1680 attttataat aatctgcaaa gtcttttgat taaatacaag ttttcaccaa atgatatatg 1740 gaacgttgat gagactggtg ttactactgt acaagtgcca gagmatgttt tagccaagag 1800 aggagagaga caagttgctt ctgttacaag tgcagaacga ggtatcttgg taacaatgtg 1860 caatgcagta aatgcaagtg gatcctcaat aacaccattc tacatatttc cacgtgttca 1920 ttttaaagat ttttttctgc gaaattcaat tcctggttct gttggcactg ccaacaaatc 1980 tgggtggatg atggaatcaa cctttatgga gtggtttaat cattttataa aaagtgttcg 2040 accatctaag acaaaccctg tgctgttgat actagataat catgagtctc atatgtcaat 2100 taacttcatt gacttagcat cagataatgg cgtaattgtt ttaacaatcc caccccatac 2160 atcacacaaa ctgcaaccat tagatataac agtgtatgga cctttcaaac gttattataa 2220 tcgtgaaatt gatagctggt tggtatcaca tccaggaaaa actgtttcaa tctatgacat 2280 tgcagaaatt tctggaaagg cttgggcaaa agtatcaatg ccggttaaca tcattagtgg 2340 tttttcagta tctggaataa gtccttttca acctgatcgg tggaaggatg aggatttttt 2400 tttatctcag gttacaaata gacccaatcc tgaaacactc aacatcaatc cagtgtcatt 2460 agaaattatt agacctaatc ctgaagcact taacatcaaa tagacctaat cctgaagcac 2520 tcaatatcaa tccagtttca gtagaagtta ttagacctaa tcctgaagca ctcaacatct 2580 atccagtttc agtagaaata aatagaaatg ttaaagtcaa caaaaatgtt agcaataccc 2640 tgaatgtgga aagtatacgc ccttatcctt ctgcaaaagc tcgtttagaa aaaaccaaag 2700 gtgctcgcaa aaagctctcc tcttcgattc taactgatac tccaattaaa gaggctcttc 2760 gaaatcaaca acttcagcaa aatttaaaaa aacaaaactt ttctgaaaaa caaaaagatg 2820 ctaaaaataa acgtcaaagg aaatcaagaa agaaatcaag caactcaaaa actctaaagt 2880 ttccattaat gacatataat aatgatgagg ctgatgatat tcttgttgaa agaaatgata 2940 atggtgtcaa agatagttta aatttttgtg gaggtattac atcaaattac gatttgtgtg 3000 ctatatgtga agaaatgggt cgtgacaacg aagtgtggta ccgttgcaga cagtgtgcga 3060 gctgggcaca taaagcttgc acaagtgcag ataatgccag ccaatatgtt tgtgattatt 3120 gtaatttgat tatctgattg agacttcggt ctaaagttgt attaattaat tcagaaatat 3180 caatgtctca acatacttct ttcttgtttt taaactccat ttatcaccct aattgctttg 3240 cataatgtaa cttgttatgc gcaattgttt acaactttta ttataatata caatatacaa 3300 ggatagttag ttaagatatt aatcttgtct cccgaattaa tatatcggct atagttatta 3360 gcgatagaga tattgatctt ttttataaaa aaaaaaactt tttttgtggt ccattcagta 3420 caagggttgg tccattcagt acaatgaaca gtactgaatg gaccactaaa taatttggta 3480 cgggtcacat ttatcgcaca ttttgcctat gttctgtttg cgcaatggtt tacaactttt 3540 aaggattgtt agttaaggta ttaattttga cttctgaatt aatatatccg ccatagttgt 3600 tgacaatatg ggaaaaaatt atttttacaa aaaagttgaa ttttgtggtc cattcagtac 3660 cactctcccc 3670 // ID DNA8-63_AP repbase; DNA; INV; 168 BP. XX AC . XX DT 25-AUG-2009 (Rel. 14.09, Created) DT 25-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-63_AP. XX NM DNA8-63_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-168 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 1997-1997 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 168 BP; 45 A; 30 C; 47 G; 46 T; 0 other; cataggcgca aatagcgttt gagatttggg ggggctaata gggctaatag ggccttactt 60 tgataatatt gaataagctt aaaacaaaat gtaggaaatg tttgggaggg ctatggacat 120 ttttgggggg gcttagcccc taaagccccc cccctatttg cgcctatg 168 // ID Gypsy-96_AA-LTR repbase; DNA; INV; 146 BP. XX AC supercont1.312; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-96_AA_; KW Gypsy-96_AA-I; Gypsy-96_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-146 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.312; Positions 1190435 1190290. XX SQ Sequence 146 BP; 54 A; 23 C; 26 G; 43 T; 0 other; tgttacatgt ttaacataaa aatgtaatga ctttgttttg aaaaatggtt tggcaacact 60 gatcaaataa aaagcagttt tgaattcctg acgctgtgaa caagtcgtct atcaatccga 120 gctccgaagt aaaatagcat gaaaca 146 // ID Gypsy-8_DPu-I repbase; DNA; INV; 5134 BP. XX AC scaffold_126; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-8_DPu_; KW Gypsy-8_DPu-LTR; Gypsy-8_DPu-I. XX NM Gypsy-8_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-5134 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 731-731 (2010). XX DR Genome; scaffold_126; Positions 190683 195816. XX CC 'AAAAG' target site duplication CC LTRs are 100% similar to each other. CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX FH Key Location/Qualifiers FT CDS 763..2781 FT /product="Gypsy-8_DPu-I_1p" FT /translation="MPPKPFKPYGTPPHLDLEEFKDNFEIWHQQWKIFLSL FT STINTALPQGERPEYIANIFLSCLSNATLKAVLTMGLSATDLVDADVIIKK FT LQERCNAGRNCHVWRQQFASRVQRDSESVENWLCDLRDIARKCEFEKDCCA FT ACQNTRLLGQIVFGVSDDEVRRKLLELGAKLTLDKAINIIRTAEATRLQSS FT NMKQGATAPVNQIKSSTGKRLNDKQAPPPKSQYRGKPSVRWHPPGCHPYGC FT WNCGSASRHAKEECPAFGKECHSCHKSGHFQSVCTQSNSSPKPETAGSILI FT QSILHNDMVRLSVTPACSVEESFIQMLPDTGASIDAIPACLYRCQFKDIPL FT SSHGPKAITATGAPIVSLGQFPASFVWAASSGGPVVTSIHVLQDLQQPVIS FT KATQKKLGILPAQYPHACVLAAITTSPSDVPKLPDIQSTLRELMKEVPSIF FT DGVCRPMRSAPCHFQLKDDAVPSSIRGSRPISVPLMPKVKRELDSLENQQV FT IAKVSEPTAWVHPIVIVPKADDEIRICGDFTSLNRCIIRPVFEAPTPFQAV FT RTIPPGMKFFTVIDALKGYHQVELDDESSLMTTISTPYGRYKYLRLPFGVS FT LAGDDYGRRLADIFDDFPNCRRVVEDVLVFSATWDEHVSLVRRLFHLAAEH FT QIAINVKKNCFCSTLRVVWRLRRQ" FT CDS 2858..4861 FT /product="Gypsy-8_DPu-I_2p" FT /translation="MRAFHGLCQQVGNFSDELATFLHPLAPLLRKDFVWEW FT TQQHETDFQAARAALSSSSVSKLAFYNPAHPTSLHVDASRLRGLGFILRQQ FT QTNGSWNVVQAGSRFLSDAESRYAMIELECLAAAWAMRKCRQFLEGLPSFN FT LFTDHRPLIPILNDYFLDKLDNPRILRLRLSMQRYSFVASWVPGRNNVMAD FT ALSRSPVDQPSPTDEIAEGPHSASEHVHLMDVMEGSSSNNPDILLSSVSAA FT SAVDPVLISLRHTILQGFPNDKCNLPLDLRPFWQVRSQLYIDGDLILVGPR FT VVIPVSLRQEVLQRLLQMHQGATKIRQRARQSVYWPSIDNDIVMAVKSCPT FT CSELLPSNPSEPLFPHEPVSRPFEFIFADLGTFRGRDFFIVADQFSGWPQV FT YPFPDTNTSTRRIIDALRSFFTCGAGAPIKLWSDGGPQFKSDEYLTFLREW FT NISHGRSSPHHPKSNGYAEAAVKSMKKLIAGSWTSGSFDLDKFGKGLLLFR FT NAPIAGGASPSQVVFDRPTRDLIPAHRRSFAPEWQKAAGILEKRALRAKEL FT RTFHYNRTTRPLPALRVGDNVVIQHHRSKRWTTPGVIVEVGAFRDYLVKTP FT AGRLFRRNRRFLRLHSPTAVPHSPSPAPNSVHPPVSSSSPTTAAPPVPAAP FT ADTSPAGPRRSTRLAAKKP" XX SQ Sequence 5134 BP; 1022 A; 1504 C; 1110 G; 1498 T; 0 other; tggcgcagtt gatatattct tagtgaattc ccgaacttat tccctgactc tcattcagtt 60 ccggtagtgt gtgactgtga ggacattttc aactcgtttc ttgtgaattt cccccctggc 120 tttgtcccgg cgtggtcttg tggaggccgc cattttggtt ttgagtgtgt ttctccctta 180 caccgatcca ttttctatcg ttctttatcg ccggcatccc ttgccatagt ttcatcctcg 240 ttcgaattca ttgatgtcat tttcccccat tttttttgcg ttttcgtgca tctattgtcc 300 gttaaattca caccccattt cattcttgaa gagggcccac atgttgcgtg cccgccattc 360 tttctttcac cgtcaacgtt cggcacattt gtttcctttt cgtcctttca ttttcacatt 420 cacatctttc attttttttt gtttttttct ttcgcccatc agtcggatac cctacatttc 480 cgtcgatttc attttcaacg tttcccttcc gtcatttcat tttcacattt caatatttca 540 ttttttttat gttttctctc gcccaccagt catatacccc tcgccgtact agccactcat 600 tttcattcaa ttctttcatt cagccccgtt caaatacttc attcaattca cttcactttg 660 ttcattcaat tcaattcgct tagttcattc tttttcatcc ggtctccctc gtttgtcatt 720 gcccccccat cttgttggtg attcatttcc cgtggtgtcg cgatgcctcc caaaccattc 780 aaaccgtatg gcacgccgcc ccatcttgac ttagaagaat tcaaagacaa cttcgagata 840 tggcatcagc agtggaagat atttctctcc ctctccacca tcaacacggc gctgcctcag 900 ggtgaacgcc cggaatacat tgcaaacatt ttcctgtcct gtctctcgaa tgcgacactg 960 aaggcggtgt tgacaatggg tctatccgcg accgacttgg tggatgctga cgtcattatc 1020 aagaaactgc aggagcggtg caacgctggc cgaaattgtc atgtttggcg ccagcagttt 1080 gcgtctcgtg ttcagcgtga ttccgagtca gtcgaaaatt ggctctgcga ccttcgagat 1140 attgctcgca agtgcgagtt cgaaaaggat tgctgtgctg cctgccagaa cacacgtctg 1200 ttgggacaaa tcgttttcgg tgtttctgac gacgaagtgc gtcgcaagct tttggagctc 1260 ggcgctaagc tgacattgga taaagccatc aacattatcc gcacggcaga ggctacgcgt 1320 ctacagtcgt ccaacatgaa gcagggcgcc acagcaccgg tgaatcaaat caagtcttcc 1380 acaggaaagc gcctcaacga caaacaggcc ccgcctccga aaagtcagta ccgcggtaaa 1440 ccatctgttc gttggcatcc gcccggttgt catccgtatg gttgctggaa ctgcggatca 1500 gcttcacgtc atgctaaaga ggagtgccct gcgttcggga aggaatgtca ttcgtgccac 1560 aagtcgggtc atttccagtc cgtgtgcact caaagtaatt cgtccccgaa acctgagacg 1620 gccggcagta ttttgattca gtccatcctt cacaacgaca tggtccggct cagtgtcacc 1680 cctgcctgca gcgttgaaga gtcgttcatt cagatgcttc cggatactgg agcgtccatc 1740 gatgccattc cggcttgttt ataccggtgt cagttcaaag acattccgtt atcttctcat 1800 gggcccaagg caatcactgc aacaggagct ccaatagtgt cactgggcca gtttccggca 1860 tcgtttgttt gggccgccag ttccggtggt ccggtcgtca cttcaatcca cgttcttcaa 1920 gatctacagc aaccggtcat ctcgaaagcg acgcagaaaa aacttggaat tctccccgcc 1980 caataccctc acgcttgtgt attggcggct attaccactt ccccatcgga tgttccaaag 2040 ttgccagata tccaatcgac attgagggag ttgatgaagg aggtgccttc aatattcgac 2100 ggtgtttgcc ggcccatgcg cagcgcaccc tgccacttcc agctcaagga cgatgctgtc 2160 ccctcatcca tccgtggttc gcgccctatt tccgttccgc tgatgccgaa ggtcaaacgc 2220 gagttagatt cattggagaa ccagcaggtc atcgccaaag tatctgaacc gacggcgtgg 2280 gtccatccca tagtgattgt tccaaaagcg gacgacgaaa tcaggatctg cggtgatttc 2340 acttctctca accgatgcat catccgtccg gttttcgagg ccccgacacc ctttcaggcg 2400 gtccgcacga tccctccggg aatgaaattc ttcactgtca tcgacgctct caaaggctat 2460 catcaagtgg aacttgacga cgagtccagc ctcatgacga ccatttcaac tccgtatggc 2520 cgctacaaat accttcgcct gccttttggt gtttctctcg ccggtgacga ctacggtcgg 2580 cgtcttgccg atatttttga tgatttcccg aactgccggc gtgtagtcga agacgttctg 2640 gtcttttctg ccacttggga cgaacacgtt agtctcgtac ggcgtttatt tcatctcgcg 2700 gcggaacatc aaatagccat caacgtgaaa aaaaattgtt tttgctcaac cctccgtgtt 2760 gtttggcggc tacgtcgtca gtgagagtgg tttcctgccc aatccggatc ttctcaaggc 2820 cattcgtgag ttcccgaagc caacatgcgt ttcagagatg cgggcctttc acggcctctg 2880 tcaacaagtt ggtaactttt cagacgaatt ggcaaccttt cttcaccccc tggctcccct 2940 cttacgaaag gattttgtat gggaatggac tcaacaacac gaaacggact ttcaggctgc 3000 ccgggccgcc ttatcttcgt cgtccgtttc caaattggcg ttttacaacc cggctcaccc 3060 gacgtccctc catgttgatg cttctcgtct gcgcggatta ggcttcatcc tccgtcagca 3120 gcagaccaat ggcagttgga atgtcgtcca agctggatcc cgcttcctgt cggatgccga 3180 gtcccgctac gccatgattg aactggagtg tttggctgcg gcatgggcga tgcggaaatg 3240 ccggcaattt cttgagggac ttccatcgtt taatctcttt acggatcacc gccccttgat 3300 ccccattcta aacgattact ttctagacaa gctagacaat ccccggattt tgcgccttcg 3360 cctttccatg cagcgatatt ctttcgtcgc gtcgtgggtt cctggaagaa ataacgtaat 3420 ggccgacgcg ttatcccgtt ctcctgttga tcagccttct ccaaccgatg aaattgcaga 3480 aggcccccat tctgcttccg agcatgttca cctgatggat gtgatggaag gttcaagctc 3540 caacaatcct gatattttgc tatcatcagt atctgctgca tcggccgtcg atcctgtgct 3600 gatttctctc cgtcacacca ttctacaagg ttttcctaac gataaatgca accttccttt 3660 agatctccgt ccgttctggc aggtccgcag tcaactttat attgatgggg acctcatcct 3720 tgtcggccct cgtgttgtca tcccggtttc cttacgtcaa gaagttctcc agcggctcct 3780 tcagatgcac cagggagcaa cgaagattcg tcagcgtgcc cgtcaatcgg tgtattggcc 3840 gtccatagac aatgacatcg ttatggccgt caagtcttgc ccaacatgct ccgagcttct 3900 cccttccaac ccgtcggagc ccctttttcc acacgaaccg gtatcccgtc cgttcgagtt 3960 tatctttgcc gaccttggca cttttcgcgg tcgcgatttt tttattgttg ccgaccagtt 4020 cagtggttgg ccccaggtgt acccctttcc agacaccaac acatcgacac gccgaatcat 4080 cgacgccctc cgctccttct tcacttgcgg agcaggcgct cctatcaagc tttggtcaga 4140 cggtggcccg cagttcaagt cggatgaata tctaactttc cttcgagaat ggaacatctc 4200 tcacggccgg tcttctcctc atcatcccaa gtctaacggt tatgcggaag ctgccgtgaa 4260 atcaatgaaa aaactcattg ctggatcctg gacctcaggt tcctttgacc tggacaagtt 4320 tggaaagggc ctcctcctct tccgtaatgc tcccattgct ggaggcgcat ccccttctca 4380 ggtggtcttt gaccgaccca ctcgtgacct aatcccggct catcgtcgtt cgtttgcgcc 4440 tgaatggcag aaggctgccg gaattctcga aaaacgcgca ctccgcgcca aagaactccg 4500 tacattccat tacaaccgca ccacacgccc acttcctgcc cttcgcgtcg gtgataacgt 4560 cgtcattcag catcaccgtt ccaaacgctg gacaactccg ggagtcatcg tggaagttgg 4620 cgcattcagg gactacttgg taaagacccc cgctggtcgg cttttccgtc gtaatcgacg 4680 ttttcttcgt ttgcattccc ctacggccgt gcctcatagc ccgtctccgg cgccaaatag 4740 tgtccatcct ccggtttcat cttcttctcc gacgacagca gcccctcctg ttcccgctgc 4800 tcccgctgac acatcaccgg ccggtcctcg tcgctcgacg cgtctcgcgg ccaagaaacc 4860 gtaattccca tacgctggtg aactagtgtt cttaacggaa tattgtttca ttcattcatt 4920 tcgtgttcat tagtcccctg ctttggctca ttattcgtcc attcattttt ttattgttcc 4980 tttctctttc catcacttcc ccttcagccg tcttctgctt ttgtgtcatc acatgtttct 5040 tttttggttt tcgtttcccc ccccccctga tcatctcttg taacttcatt gtttgtttgt 5100 ctgtttcgtg atttgtccct tggagaaaaa gaca 5134 // ID BEL-156_AA-I repbase; DNA; INV; 6614 BP. XX AC AAGE02017627; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-156_AA_; KW BEL-156_AA-LTR; BEL-156_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6614 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02017627; Positions 7225 612. XX CC Positions [4341-4922] - Integrase core CC 'GGGGT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 91..3528 FT /product="BEL-156_AA-I_1p" FT /translation="MSKLVKNPNGHCKLCTNKDSMDNMVECDDCDRWYHLS FT CAKLDRSPTAEESWICEHCADIQRKWYKQGAAAIEKKIATQKQTENKSPNS FT NEQILQELTNAITSAFKQVKVEPEERTPVNQANDWTIYLKRQALISLPKFN FT GSSKEWPKFEKIFNDTTQEGNFSPLENLNRLQQSLEGIAARCVSQLMMDAG FT NIPQIMERLKLNFGRPEIIYNELVTELSKIRKENRMAVVDISEALGNLVSN FT LELINFQDYLRDPRLVNETVQKLPYNLQVKWAEERQANGPVATLKELYTFL FT TPHAKISRIMQNAATPAPRKTINIHHDIRQQDRQHDNHSRGPQCGNCSGHR FT ITDCQLFRDMDTKKRNEVVRTNKLCIGCLSPFHFFKNCNRSRRCGLNGCNK FT NHHSWLHDKQETRKPPTVPPTEKKDNDQKPLTTTTQNDEKLVGHHRIKRSK FT ILYQIIPVTLKNGSKSVDTYAFLDAGSSATLVDRSIVDTLELEGRNDPLHL FT TWTQDISKDEDNSMNVELQISGSSKKLYTIKNVRTTKNLQLPRQTVDFEKL FT KQKFSYFKDLPITSFTNERPMILIGIEHSHLLIPLERRMRKWNEPIAVKSK FT LGWFLFGNISASVQHEYAMIHHEEEMTKLMKSYFSTEDFGVKISENSLKSE FT EEKRVVNIMEKTVKYTDGRYEVGLLWKDDDIKFPESYGNAYKRLIMMEKKL FT EKDPQLKEWALKTFKEYQTKGYLRKLTSEEVKKTTERTYYLPHFVVVNKNK FT AKPRLVFDAAAKIADKSFNSELLSGPDINEPIHGVALRFRENRIAVSGDIQ FT EMFHQTRITAADQDSQQILWRDCNTDKPAEVYVLQVMSFGATCSPFCAQYV FT KNKNAENFAEMYPEAAVAVKKQHFVDDYLDSFSNSEDAKKITNQVIEIHQH FT AGYYIRNFLSNDKELMKAIPQDRRHKLFTKPMEDKDSHAEKVLGIYWNTEN FT DCLSYLIKWNIYGEKLLQNMLPTKREVLAYIMSIYDPMGLISNLSIHGKIL FT MQELHKATLNWDDEIPNHLHDHWNTWVTTIKEAENLEIPRCLTNGTVDEVE FT LHTFVDASDKAFAAAVYCKSTNSNGTYITLVAGKARVAPIKGLSIPRLELQ FT AAILGTRLTNQIRKDLRLKVSKITYWSDSQTVLSWIKR" FT CDS 3516..6026 FT /product="BEL-156_AA-I_2p" FT /translation="MDKKIDIKFQPFVAHRIGEILDSTAMDQWRWVPSEHN FT PADAATKLNGNKNIWLNGPEFLKTEEVNWPNKSKEQIHVFVIETNKKTGND FT YDEFRLVQENKYDKWWRLVNSVCIIKKFVEWKEDRNNFTKDVTLDTRNAAE FT NIIYKKAQWQGFPEEMEDLHLKGSVSIKSHIRCLKPYLDEFGVMRSNGRLE FT KATNLSKNLKFPVILPQKNVITKMIVRSTHQRYLHQGENAIIAALQQKYWI FT INIRAVVRNEKKCCQKCIIDSAKPEAPMMAALPDFRIKPYVHPFTHCGVDY FT FGPFEVVVKRSTEKRWGVIFTCMSSRAVHIEMAEKLDTDSFIVCLRTFQNR FT RGKITHIYSDNGTNFVGVNNIMAQLVSDINTRMGRSEAAKMEIKWTFNPPA FT APHFGGVWERLIRIIKSALQVLLKSWGERTPRCETLRAAFVQAEFFLNARP FT LTHIPVDNIDDEVLTPFHALIGCSGEYVPPYFTEVNNFQEQCKQVQHAGQQ FT FWNRWKNEYIPTLAKRNKWTNKIEPIKVDDIVIITDDEATPNTWLKGRVIK FT VHTASDGQVRSVEIKTKNGVKKRPSVKVAALDVSGSREKEMDKKFQQSINR FT KRETILHFSDNSSSKSTLDRSFPNWGKRRTIDEEEAKAIRDKTPQNKRVKI FT NSSNSLGKIMLATTTLLSIANAINIKPIEEDGLIFDHEGTCLLKRGSWKTN FT IYTGISPAQDINFIDHVHQNLTIALKEMERQTKDQTIHQLATSIGHQCDDA FT IQEIQQITRKRRSKGIFGFLKDIFFGGDDIDEAVEAMRIQEDNKIHHVSET FT VQKQSELMRKPDCVAASLQHMKNRNYQANIPNSSN" XX SQ Sequence 6614 BP; 2463 A; 1237 C; 1321 G; 1593 T; 0 other; tttggtggct ccagagagga aggtataaga gttagccatt ttttaacgaa taagattcgc 60 atacggattt actcgtattt acgtatcaaa atgtcgaagc tggtaaagaa cccaaacggg 120 cactgtaagt tgtgcaccaa caaggacagt atggacaaca tggtggaatg tgacgattgt 180 gatcgctggt atcacctctc ctgcgccaaa ctggatcgat cacctacggc agaggaatcc 240 tggatatgcg agcactgtgc cgatatccaa aggaaatggt ataagcaagg cgctgctgcc 300 atcgagaaga aaattgccac tcaaaaacaa accgagaaca agagccccaa ttccaacgag 360 caaatcctac aagaattgac caatgcaatc accagtgcat tcaaacaggt caaagtagaa 420 ccagaagaac gcactccagt caaccaggcc aatgattgga ccatttattt gaaaagacag 480 gctttgataa gcctgcctaa gtttaatggt tcatcaaagg aatggcctaa gttcgaaaaa 540 atctttaatg ataccaccca ggaaggaaat ttcagcccat tggaaaattt aaatagactc 600 caacagtcgc ttgaaggaat tgcagcaaga tgcgtaagcc aattaatgat ggatgctggg 660 aatatcccac aaataatgga gcgattaaag ctgaattttg gacgtccgga aataatctac 720 aacgaacttg tgacagagtt gtccaagatt cgaaaggaga accgaatggc tgtggtggat 780 atttcagagg cactcggtaa tctggttagc aacctggagc taatcaattt tcaagattac 840 ctacgagacc ccagactcgt caatgaaacg gttcaaaagt taccgtataa tctccaagta 900 aagtgggccg aagaaaggca agcaaatggc ccagtcgcaa cattaaagga gttatacaca 960 ttcctaactc cacatgcaaa gataagtcgc ataatgcaga atgccgcaac tccagcccca 1020 cgaaagacga ttaatataca tcatgatata cgtcaacaag accgtcaaca tgataatcac 1080 tcaagaggtc cacaatgtgg aaattgctca ggtcatagga ttactgactg ccaattattc 1140 agagacatgg atactaaaaa gcgcaatgaa gttgtgagaa caaataagct gtgcatagga 1200 tgtctcagcc catttcactt cttcaagaat tgtaatcgtt cacgcaggtg tggtttaaat 1260 ggttgcaata aaaatcatca ttcgtggctc cacgataaac aagagaccag gaagccacct 1320 actgtgccac ccaccgagaa aaaggataat gatcaaaagc cactaacgac gactactcaa 1380 aatgatgaaa aattagttgg tcatcatcga attaaacgca gcaaaattct ttaccaaata 1440 attcctgtga ccctaaagaa tggatcaaaa tctgtggata cgtatgcttt tctcgacgca 1500 ggatcgtctg ccacattagt cgatagaagc atcgtcgata ctttggaatt ggaaggaaga 1560 aatgatcctc tacatctcac gtggactcaa gatataagta aagatgagga taatagtatg 1620 aacgtggaac ttcaaataag tggatcatca aaaaagttgt atactatcaa gaacgtaaga 1680 acgacgaaga acttgcagtt gccaagacaa acagttgatt tcgaaaaatt gaagcaaaaa 1740 ttttcttatt ttaaagattt gccaatcaca agttttacaa acgagcgtcc aatgattctt 1800 attgggattg aacacagtca tctgttgatt cctttagaaa gacgaatgag gaaatggaac 1860 gaaccgattg ctgtcaagtc taaattagga tggttcttgt ttggcaatat atctgcatcc 1920 gttcagcacg agtatgcaat gatacaccac gaagaagaaa tgacaaaact aatgaaatca 1980 tatttttcaa ccgaagattt tggagttaag atatctgaga attctctaaa atccgaagag 2040 gaaaaacgag ttgtcaacat catggaaaaa accgttaaat acactgatgg cagatacgag 2100 gttggtttgc tgtggaaaga tgacgatata aaattcccag aaagctacgg caacgcgtac 2160 aagcgtttga taatgatgga gaaaaaactg gagaaggatc ctcagcttaa agagtgggcg 2220 ttgaaaacat tcaaagaata tcaaacgaag ggatatctac gaaaactaac atctgaagag 2280 gtaaaaaaga caacagaacg tacttactat ctaccgcact tcgtagttgt caacaaaaat 2340 aaggcgaaac cacgattagt tttcgatgca gcagcaaaaa tcgctgacaa atctttcaat 2400 tcagagctac tgtcaggacc tgatatcaat gagccgatac acggagtagc attgcgtttt 2460 cgagaaaaca ggatagcagt tagcggtgat atacaggaga tgtttcacca aacaaggata 2520 accgcagcgg atcaagattc tcaacaaatt ttatggagag actgcaacac tgataaacct 2580 gcggaagtgt atgtgctgca agtaatgtca tttggagcaa cgtgttctcc attctgcgct 2640 caatatgtga agaacaaaaa tgctgaaaat ttcgcggaaa tgtatccaga agctgctgtt 2700 gctgtgaaaa aacaacactt tgtagatgac tatttagaca gtttctcaaa ctcggaagac 2760 gccaagaaga ttactaatca agtaatcgaa atccatcaac atgcaggcta ttatattcgg 2820 aatttcttat cgaatgacaa ggagctaatg aaggccatac ctcaagatag acgccataaa 2880 ctatttacta aaccaatgga ggataaagac tcacatgctg agaaggttct tggtatctat 2940 tggaacacag aaaatgactg cctgtcgtac ctgataaagt ggaatatcta tggagaaaaa 3000 ctacttcaga atatgttacc aaccaagaga gaggtgttag catacataat gagcatctac 3060 gacccgatgg gactaatttc taacctatca attcacggaa aaatattaat gcaggagtta 3120 cataaagcca ctttgaattg ggatgatgag attccaaacc acctgcatga tcactggaac 3180 acgtgggtca caacgattaa agaagctgaa aacttagaaa ttccaaggtg cttaacaaat 3240 ggaactgttg acgaggtgga actgcatacc tttgtagacg catccgataa agcatttgca 3300 gcagcagtct attgcaaaag cacaaattcc aacggaacat acataacact tgtagccgga 3360 aaagccagag tcgctccaat caaaggatta tctataccac gacttgaact tcaagcagct 3420 atcctaggaa cgaggctaac gaatcaaata cgaaaagact tacgtctcaa ggtatcaaag 3480 ataacttact ggtcagactc acaaactgtc ttgtcatgga taaaaagata gatataaagt 3540 tccaaccatt cgttgcacac agaataggcg agattctgga ctccacagcg atggatcaat 3600 ggaggtgggt accatcagag cacaatccag ctgatgctgc aacaaagtta aatggaaaca 3660 agaacatatg gttaaatggc ccagaattcc taaaaactga agaagttaac tggccaaata 3720 aatctaaaga acagatacat gtttttgtga tagaaaccaa taaaaaaacg ggcaacgatt 3780 atgacgaatt tcgattggtc caagaaaata aatatgataa gtggtggaga ctggttaaca 3840 gtgtctgtat tatcaagaag ttcgtcgaat ggaaagagga cagaaacaat ttcacgaaag 3900 atgttacttt agacaccaga aacgcagctg aaaacataat atataagaaa gcccaatggc 3960 aaggattccc ggaagaaatg gaagatttac atctcaaagg ctctgtcagt ataaaaagtc 4020 atattagatg tcttaagccg tatttggatg aatttggtgt tatgcgatct aatggaagac 4080 tggaaaaagc gacgaatttg tcaaaaaatc tcaagtttcc agtgattctg ccccagaaaa 4140 acgtcatcac caaaatgata gtaagatcaa cacatcaacg ttacctacat caaggagaaa 4200 atgccataat tgctgctcta caacaaaaat attggataat aaacatacga gcagtagtga 4260 gaaacgagaa aaaatgctgc caaaaatgta taatcgactc tgcaaaaccc gaagcaccaa 4320 tgatggctgc tttgcctgat ttcagaatta aaccgtacgt tcacccattc acacattgtg 4380 gagttgatta cttcggtcca ttcgaggtag tagttaagag atcaaccgaa aagcggtggg 4440 gagtaatatt cacttgcatg tcatcaagag ctgtacatat cgaaatggcg gaaaaattgg 4500 atacagattc attcattgtt tgccttagaa ctttccaaaa tcgacgtgga aagattactc 4560 acatatacag tgataatgga acaaactttg ttggagtaaa caatataatg gctcaattgg 4620 tgtccgacat aaacacaaga atgggcagaa gtgaagcagc aaaaatggaa ataaaatgga 4680 catttaatcc cccggctgca ccacatttcg gaggagtatg ggaaagactg attaggatca 4740 tcaaatcagc tctacaagtt ttattaaaat cttggggtga aagaacaccc aggtgcgaaa 4800 cattaagagc agcgtttgta caagccgaat tcttccttaa cgcaagacct ttaacacaca 4860 tccctgtgga taacattgat gatgaggtat taactccgtt ccacgcatta ataggttgtt 4920 caggagaata tgttccacca tattttactg aagtcaacaa ttttcaagaa cagtgtaaac 4980 aagttcaaca tgctggacaa cagttttgga atcgctggaa gaatgaatac attccaacac 5040 tcgcaaaaag gaacaaatgg acaaacaaaa tagagccaat taaggtcgat gacatcgtga 5100 ttataactga cgatgaagca actccaaata catggttgaa aggacgagtc attaaagttc 5160 acactgcatc agatggtcaa gttcgatcag tcgaaatcaa aacaaagaat ggagttaaga 5220 aaaggccatc ggtaaaggta gcagccttag atgtatccgg ttcaagggag aaagaaatgg 5280 ataagaaatt tcaacaatcc atcaatagaa aaagggaaac gattctacat ttctcagaca 5340 actcttccag caaatcaact ttagatagat cattcccaaa ctggggaaaa cgcagaacta 5400 tcgatgaaga ggaagctaaa gctatacgag ataaaacacc tcaaaacaaa cgagttaaaa 5460 tcaattcatc caactcgtta ggaaaaatca tgctagccac aacaacattg ctatcgattg 5520 ctaatgctat caacatcaag ccaatagagg aagatggtct tatttttgat catgaaggaa 5580 cgtgtcttct gaaaagaggc tcctggaaaa caaacatata caccggtata tctccagcac 5640 aggacatcaa ttttattgac cacgtccatc agaatttaac aatcgcgtta aaagagatgg 5700 aaaggcagac gaaggatcaa acaattcatc agctggcgac atcaattggg catcaatgcg 5760 atgatgcaat ccaagaaatt caacaaatca cccgtaaacg cagaagcaag ggaatattcg 5820 gattccttaa agacatattc ttcggaggag acgacatcga cgaagcagta gaagcgatga 5880 ggattcagga agataacaaa attcatcatg tttcggaaac tgtacaaaag caatctgaat 5940 taatgcgaaa acctgattgt gtagccgcat cattacagca catgaagaac agaaattatc 6000 aagcaaatat tccaaactca tcaaactagc aaaggttaat actgtattat attatacaaa 6060 taatcccaag gagatggtaa tacattgtga caaaacagtt atatctccac catatgaagc 6120 tgctatagta actatggatc cagattgtaa gattaaatca aattcaagaa caatttatgc 6180 ttatttgaga aacgaatcca ggaaaacgct taccttcttc caacctagag ctacaatcaa 6240 tttgaataaa aatattaaaa cattgcaaga cattaagaaa gagaaaaaga aactagatac 6300 tggttacgaa tacatcgata atatagaaga accaattaat gatagttcaa catttcacta 6360 ctacaatcct tactcttgga tagcagttat aataatattg attgccatta ttatattatg 6420 tatcttcata tatttcaagt acattagatg taatgatcca gtagttcaac cgagctttca 6480 aatgaatgaa atttcaagta tacctgtaaa tcgaagaagt aaaatatcag atagacaaac 6540 tattcaagaa gagcagattt aaataatata caattaattt gtaaatcaat ggaatcgatt 6600 tatgagggtg ggaa 6614 // ID Copia-102_AA-I repbase; DNA; INV; 4123 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-102_AA_; KW Copia-102_AA-LTR; Ty1_copia_Ele94; Copia-102_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4123 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [1472-1999] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 95..4123 FT /product="Copia-102_AA-I_1p" FT /translation="MERTGIAKLNNENYDSWKLEVEFLLVREGLWKYVSPG FT VKPEPAADGANAAEIAAWEEADQRARATIGLLLSRSQHGHIRNTTTAKAVW FT DNLKQQHEKKTLTSKVQLLKRICDLQYHEGDDIVEHLQEYEDLFEKLANAG FT TKLDNDLQVILVFRSLPSSFDALTTTLENRSEDDLTLELVKGKIVNEVLKR FT KEKTPAEGIAMKAEDKKKNIVCHHCQKAGHKKRFCKLLTKKNKEESARSQS FT ESDRKAEKAKLAKSEEEVFAFGVSDSAVSNSVDGWIIDSGASSHMCTNKGY FT FQSLDTLRDDTPKSVTVADGKSADVKGIGICKLRCYGQDSKVKQIVLSEVL FT YVPDLDMNLVSVSKLVQKGATVTFSETGCTISRGSRIAAVAPRYMGLYHLR FT LAEQAGAAMEQSCGKHCVHEWHRKLGHRDVQAIREMENKQLATGIKVGSCS FT SNITCETCLQGKMTRLPFPKKAKIKSKAVLDLVHSDLCGPMNTVTPGGHRY FT FLTLIDDFSRYTTLYFLHKKSETVEAIKDFVRCMKTRFGKPPKMIRSDQGG FT EYRATELVRFLKDEGILQQFTTAYTPQQNGVAERKNRSLVEMARCMIIDAG FT MHYRYWAEAVSTANFLQNMLPTKPIQLTPYEMWCGEKPDFGMMQIFGSEAY FT VFVPKEKRTKLESKAVKMTFVGYSSQHKAWRFIDTKTNKIVFSRDARFLPT FT KENPDSSTPDAPEDHVVEELPPVSVPATTESHMDRLPDGFASDEEDFYGYE FT EEDFHGYEDEPTMYEDACNSGENVSEHGDESDAVQEEFGEIEEEQSSGDLR FT RSNRTNKGVPPDRFSASSRFARPQSSEPRSYKQATTGTESAQWITAMKDEL FT ISHRENKTWDVVKLPPGKKIIGCRWIYKRKLDEHGRLSSYKARLVAQGHKQ FT RFGLDYDEIFAPVAKQVTFRMLLTIASRRNMLVKHVDVKTAYLNGDLKETV FT YMKIPPGVAAEPNEVCLLRKSLYGLKQSARMWNRKLDETLRQMGFKPAEAD FT PCLYVRWRNKKLTFILVYVDDMLICTSTQEEYEEILEALNSKFQMTALGDV FT KQFLGIQVTKTENGYCLNQQAYINRMVASFGLDDAKGSRIPMDPGYIQLKE FT EVERLPNNDKYQSLVGGLLYVSVHTRPDIGISASILGRRVSNPTVADWTEA FT KRTLRYLKATSGLQLQLGGDAGDLEGFADADWAGNVLDRKSNSGYLFRFGG FT GLISWCARKQPCVALSSTEAEYISLSECCQELMWLKKLMKDFGESVKKPIC FT IFEDNQSCIKQLSQAKTIGKRSKHIDTKYHFVKDLFEQEQIDVTYCPSEDM FT LADILTKPLSRVKLEILRDRIGLRSARDEEE" XX SQ Sequence 4123 BP; 1229 A; 824 C; 1099 G; 971 T; 0 other; ataggttatg ggccggtgtt agccattttg ttttcgtgcg cattgaaaat cgcaattttt 60 cggttcgttt caagtgtgat cgcgaaaatt gaaaatggag agaacgggta ttgcgaaatt 120 gaacaatgaa aattacgact cgtggaagct ggaagtggaa tttctgctcg taagagaagg 180 attgtggaag tacgtttcgc cgggagtgaa gcctgaacct gctgctgacg gcgcaaacgc 240 ggctgagatc gctgcttggg aagaagcaga tcagagagcg cgtgcgacaa tcggattgct 300 gctttctcgg agtcaacacg gccatattcg gaacactacg accgcgaagg ctgtgtggga 360 taatttgaag caacaacacg agaagaaaac gttgacttcc aaggtgcaac ttctgaagcg 420 gatttgtgac cttcagtatc acgagggcga tgacattgtg gagcatctgc aggaatatga 480 agatcttttc gagaagcttg cgaatgccgg aaccaaattg gacaacgatt tgcaagtgat 540 tctagtcttt cgtagcctgc ctagctcatt tgatgcgtta acaactacgt tggagaaccg 600 ttcggaggac gatttgacgt tggaactcgt aaaaggcaaa atcgtcaacg aggtcctcaa 660 gcgaaaggag aaaacgccag cagaaggcat cgccatgaaa gcggaagaca agaagaagaa 720 tattgtgtgc catcactgtc aaaaggcggg acacaaaaag cgtttctgca agcttttgac 780 gaagaaaaat aaagaagaaa gtgcgcgaag ccagagtgag tctgatcgta aagcagaaaa 840 agctaaactg gcaaaatcgg aagaagaagt gttcgcgttc ggtgtaagtg acagtgcggt 900 ctcgaattcg gtcgacggct ggataatcga ctcgggcgca agttcacata tgtgcaccaa 960 taagggatat ttccaatcgt tggacacatt gcgcgatgac actccgaaat cagttacggt 1020 cgcggacggg aaaagtgctg atgtgaaagg tatcggtatt tgcaaattgc gttgctatgg 1080 acaagactca aaagtgaaac aaatcgtgtt aagtgaggtc ttgtatgttc cggatctcga 1140 tatgaaccta gtatcagtga gtaagctggt gcaaaaagga gccacggtta ctttcagtga 1200 aacaggatgt acaatatcaa gaggaagtcg gattgctgca gttgccccca ggtatatggg 1260 cctttaccac ttgagattgg cggaacaagc gggtgcagcg atggagcaaa gttgcggaaa 1320 acactgtgta catgagtggc accgtaaact aggacacagg gatgtccaag ccatccgaga 1380 aatggagaat aagcagctag cgactggtat caaggtaggt agctgttcct ctaatatcac 1440 ttgtgaaacg tgtctgcaag ggaaaatgac tcgtcttcca tttcccaaga aggctaaaat 1500 caaatcgaaa gccgttcttg acctggtgca cagtgacttg tgtggaccca tgaacacagt 1560 cacaccaggt ggtcaccgct attttttaac gcttatcgac gattttagcc gatacaccac 1620 gttatatttc ctgcacaaga agtcagaaac tgtcgaggca atcaaagact tcgtgcggtg 1680 tatgaagaca cgatttggta aaccaccaaa aatgattaga tctgatcaag gcggtgagta 1740 ccgtgcaacg gagctggttc gtttcttgaa ggatgaagga attttgcagc aatttactac 1800 agcctatacg ccgcaacaaa acggcgttgc tgagcgaaaa aatcgttcat tagttgagat 1860 ggctcgctgc atgattattg atgctggtat gcattaccgg tattgggctg aagctgttag 1920 cacagccaat ttccttcaaa acatgctacc taccaaaccg atccagttga caccgtacga 1980 aatgtggtgc ggtgaaaagc cggactttgg aatgatgcaa attttcggct ccgaagccta 2040 tgtatttgtg ccgaaagaaa agcgtactaa actcgaatcg aaggcggtga aaatgacttt 2100 tgtagggtat tcatcgcaac ataaggcgtg gcgctttata gatacgaaaa cgaacaaaat 2160 tgtattcagt cgcgatgcaa ggtttctacc cacaaaagaa aaccctgact cgagtactcc 2220 agacgcgcca gaagatcacg ttgtggaaga actaccgccg gtgtctgtac ctgcaactac 2280 agagagccat atggatagac taccagatgg atttgcatcg gatgaagaag atttctatgg 2340 atatgaagaa gaagatttcc atggttatga agatgaaccc actatgtatg aagatgcctg 2400 taatagcgga gaaaacgttt cggagcacgg cgatgaaagt gatgcggttc aggaggagtt 2460 tggtgaaatt gaggaggagc aatccagtgg cgatctacgc cgctcgaatc ggacaaacaa 2520 gggcgttccg ccggatcgtt tttctgctag cagtagattt gcccgtccac agtcatcgga 2580 acccaggagc tacaagcaag ccacaaccgg aaccgaaagt gcacagtgga ttaccgctat 2640 gaaggatgaa ctcatttcgc atcgtgagaa caagacgtgg gatgtggtca agttgccgcc 2700 tggaaagaaa ataattggat gccgttggat atacaagcga aagctggatg aacacggaag 2760 gcttagcagt tataaagcgc gcctagtggc ccaggggcac aaacaacgat tcggtcttga 2820 ctacgatgag atctttgcac ctgttgcaaa gcaagtcacg tttagaatgc tgctaacaat 2880 cgcaagtcgt cgtaacatgc tggtaaaaca cgtggatgtt aagacagcat atctgaacgg 2940 tgacttgaag gagactgtgt atatgaagat tccaccaggc gttgctgcag aacccaacga 3000 agtttgcttg ttacggaaga gtttatatgg actgaagcag tcggcccgta tgtggaatcg 3060 caaattagat gagactctac gccagatggg atttaagcct gcagaagctg atccgtgtct 3120 gtacgtgcgt tggcgtaata agaagttgac gtttattctg gtgtacgtcg atgatatgtt 3180 gatctgcaca agcacccagg aggagtatga ggaaattctt gaagcgttga acagcaagtt 3240 ccaaatgacc gcccttggag atgttaaaca gtttttggga atacaggtca ctaaaactga 3300 aaacggttac tgtttaaacc agcaagcata cattaaccga atggtggcta gttttggttt 3360 ggacgatgct aaaggatctc gcattccaat ggacccggga tacattcagc tgaaggagga 3420 ggtagaaagg ctacccaata atgataaata ccagagcctt gtgggtggtc tgctttatgt 3480 atccgtacac acccgtcctg acattggaat tagtgcttcg atactaggac gccgagtcag 3540 taacccaaca gtagccgact ggacggaggc aaaaagaaca ttgcgctatc tgaaagctac 3600 gagtggactg cagcttcaac tgggcggtga tgccggagat cttgaaggct ttgctgatgc 3660 cgactgggcc gggaacgtct tagatcggaa gtcaaactca ggatatctgt ttcgctttgg 3720 tggtggactc atctcttggt gcgcaaggaa acaaccatgc gtggcattgt cttcaaccga 3780 ggcggaatac atttcactat cggaatgctg ccaggaactg atgtggctga aaaagttaat 3840 gaaagatttt ggtgaatcag tgaaaaagcc catatgcatc tttgaagaca atcagagttg 3900 cattaagcag ctcagtcaag caaaaaccat tggcaagcgt tcaaaacata ttgacacaaa 3960 ataccatttt gtcaaggacc tttttgaaca agaacaaatc gacgtgacct attgcccatc 4020 cgaagatatg cttgcggata ttctgactaa gccgttgagt cgtgttaagc tggagatact 4080 gcgagacagg attggcctca ggtcagcacg tgatgaggag gag 4123 // ID GLSAT3 repbase; DNA; INV; 312 BP. XX AC M10378; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 28-SEP-1995 (Rel. 1.08, Last updated, Version 1) XX DE G.lateralis 'gc' rich satellite DNA. XX KW SAT; Satellite; Simple Repeat; GLSAT3; KW Satellite repetitive element. XX OS Gecarcinus lateralis OC Eukaryota; Metazoa; Arthropoda; Crustacea; Malacostraca; OC Eumalacostraca; Eucarida; Decapoda; Pleocyemata; Brachyura; OC Eubrachyura; Grapsoidea; Gecarcinidae; Gecarcinus. XX RN [1] RP 1-312 RA LaMarca E.M., Allison P.D. and Skinner M.D.; RT "Irreversible denaturation mapping of a pyrimidine-rich domain of RT a complex satellite DNA."; RL J. Biol. Chem 256(12), 6475-6479 (1981). XX DR GenBank; M10378; Positions 1 312. XX SQ Sequence 312 BP; 49 A; 79 C; 71 G; 93 T; 20 other; agcgcagtgt ggcggctgac gatgcaccgc gatacgccga taccggacag tttccttgct 60 tatgtacgga tgcaataatg ttattgctta attcttcttg ttgttcttct cgcctttctt 120 ctccttgttc ttcttctttc ttctccttgt tacagttgct tgccgcacnn nnnnnnnnnn 180 nnnnnnnncg tgctggaatt ggttgttcct ctcccccttc tgtccttcct taacgacgca 240 ccgtgtaagg cgtaaagtcg ttcgaaggga gaggatggaa cagatcccgt acgcacggga 300 ggagcgctca cg 312 // ID TCORP1 repbase; DNA; INV; 198 BP. XX AC M84610; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 28-SEP-1995 (Rel. 1.08, Last updated, Version 1) XX DE Trichostrongylus colubriformis DNA repeat regions. XX KW Repetitive sequence; TCORP1. XX OS Trichostrongylus colubriformis OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Strongylida; Trichostrongyloidea; Trichostrongylidae; OC Trichostrongylinae; Trichostrongylus. XX RN [1] RP 1-198 RA Callaghan J.M. and Beh J.K.; RT "A middle-repetitive DNA sequence element in the sheep parasitic RT nematode, Trichostrongylus colubriformis."; RL Parasitology 109 ( Pt 3), 345-350 (1994). XX DR GenBank; M84610; Positions 724 921. XX SQ Sequence 198 BP; 54 A; 32 C; 67 G; 45 T; 0 other; aaattcatat ctcgcgtcgc agtagagctg ggaccctgaa tgtttaaagt gatagagggg 60 gtggtaaggg aacatcaatg gtgggggaac ggaagtgata gcaactgata agggctgata 120 aggagcctcg agtggctaat gggcgggtaa attcatatct cgccgtcgca gtggagctgg 180 gaccctgaat ttttgaag 198 // ID Copia-110_AA-LTR repbase; DNA; INV; 141 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-110_AA_; KW Ty1_copia_Ele111; Copia-110_AA-I; Copia-110_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-141 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 141 BP; 39 A; 28 C; 26 G; 45 T; 3 other; tgtcggaagt aatgcgacaa ttagtattca tagtaggcta tgataaactg tacatagaaa 60 gtagaaatat tttttttcat tcwtgttcca accgccgtac gtkaaggtag tttgktctct 120 ggctcccacc gcatttcacc a 141 // ID Copia-41_AA-LTR repbase; DNA; INV; 134 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-41_AA_; KW Copia-41_AA-I; Copia-41_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-134 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 964-964 (2011). XX DR [2] (Consensus) XX SQ Sequence 134 BP; 43 A; 24 C; 26 G; 41 T; 0 other; tgttgaagta ggcgcatgtg caagtgcgat attcaagcat gccatggaat aattagagta 60 taagttaatt cgaaataaac tctcagttat cgttagacca tcaaccaagt cggttgtact 120 ttcatttaac tcca 134 // ID BEL-8_SI-LTR repbase; DNA; INV; 276 BP. XX AC AEAQ01025887; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: long terminal DE repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-8_SI_; KW BEL-8_SI-I; BEL-8_SI-LTR. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-276 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01025887; Positions 12766 13041. XX SQ Sequence 276 BP; 58 A; 64 C; 57 G; 97 T; 0 other; tgttcacgct cataattagt catttctatt actttgcatt tattttacgt ctttgaaatt 60 agcgcgccct cagggcagca actcaacctt cagtgatcgc ttcggcgaca gatgtcgtgg 120 cgagcgttgt ctagtttcgt ttagcgagcg acggctctcg tcatggggct agttacttac 180 ttacttactt ctccttgttc tgtacccgag cgtcgagagc ttgtaaaatt tcgcaataaa 240 ggttttgttt tcgaattcaa ttttcgctat ccaaca 276 // ID Gypsy-62_AA-LTR repbase; DNA; INV; 126 BP. XX AC supercont1.345; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-62_AA_; KW Gypsy-62_AA-I; Gypsy-62_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-126 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.345; Positions 165 290. XX SQ Sequence 126 BP; 43 A; 27 C; 16 G; 40 T; 0 other; tgtaacgtat aagcgtaaca tgtgtttatt ggaatacctt cacataacag tatcttgaac 60 cttaccttta taccatgaac catacctgac aataaacatc attgattgac tcgtaactag 120 cttaca 126 // ID Gypsy-180_AA-LTR repbase; DNA; INV; 1654 BP. XX AC supercont1.157; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-180_AA_; KW Gypsy-180_AA-I; Gypsy-180_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-1654 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.157; Positions 760947 759294. XX SQ Sequence 1654 BP; 464 A; 389 C; 443 G; 358 T; 0 other; tgttacgttt tgtatgttta gcgatattgt atcaggagaa aactttgaca accattaaga 60 ttataacaca aaaacagtaa ggcattgctc cgccagtcaa agtttaagca ccactgcgac 120 aaccatccgt caccagcctg ccaaataaca tcagcacagc aaatcattct caatacataa 180 gcgcactcaa gtacataagg gaacgaagac gaaacaactc cacacctaac agcacggatg 240 gaagagccgt aggagaagaa gcgatctcta cggtgacgtt gctcagcata tgcgggagag 300 tgattgcccg aaggccgaca cacacacacg ccaaccaaag ccagcacgat tgagctaggg 360 cgagaaagat ttttcatcgt cgttctattg tcagtcccca tgatgcgatc ggtcggttcg 420 gtttaataag cggattttgc cacaaaaagt tagtgccgtt taaagccggc gaaagagaac 480 ttttgcagtt ttccatagtg aaaggttaca tagttgttac cggccgtggc ggaaaacatc 540 cggacgcgaa aggcgccaga gctcatcatc gtcacgtggc catacggtga aagtaaggat 600 cacggttttc gcggcaaata tcaacgacct agaaggcacg aagaagcatt ttagagtggt 660 gagtcaaatt gtagattgac aacgaagtgg gagctaattt tgtacatcat tcaggccaac 720 cgcttcgaaa ccgacgtgat tgtccgatca cgagatcatc gctggcatca ggacgggcac 780 gcattgttca acaataagaa cgccgcaaat ccattgtctg ctgacgctac tgctcggatc 840 gcgattccga cggaaagaaa gggggtaggt acgctacatt gtgtgggcaa aatggccgac 900 taatccgttc ctgatttagg gacgattttg catccaccaa tagcgcatag ccactgcaag 960 agccgtagga ggacctcgaa gcgagccagg ttggctgtcg cttccctcga agcacccaca 1020 cacgacgaga aaccccttat tgggtatgtg tagccggcaa tcaagaacga gcagtcacta 1080 accggcagaa caacagggtt ttttcgttgc agtttcgttc tttggctcat cgaaggaagg 1140 cccagcgagt ctacactcag gggttcccaa caccgcggtg gaggatcgtc cgtagccata 1200 tcagcgtttg acatttgaca catcggagga agagcggact gtgactcacg accaggggtt 1260 cccaacacag tagtgcaagg tcgtcggcgg tgaatgaccg tcgagagcca gccgcttcag 1320 ctgaggtggt gcaggagcac agagcgtgcg acgacggcga gcgcagaccg tcagaaagga 1380 tgcggaggct ggtggagcta aaggcggcgg gagcataatg gccggtgccg tgagtaccag 1440 caatctagga ttaagccact acatgtaaga agggatagaa tataggcctc gaggtcaagt 1500 ttaggagtca ggcagtcatg atggcaccca atatatgcat ttaaaatgaa ttactcgttt 1560 tgtgattgag ttggagccct ttgccttcag tgtttgtgcc gaatcccctt ttgataatgg 1620 tagttatgct ctaccgggtc gggaacttgc aaca 1654 // ID Jockey_Ele5 repbase; DNA; INV; 4129 BP. XX AC . XX DT 22-DEC-2010 (Rel. 16.01, Created) DT 22-DEC-2010 (Rel. 16.01, Last updated, Version 3) XX DE A Jockey clade non-LTR retrotransposon family from Aedes aegypti. XX KW Jockey; Non-LTR Retrotransposon; Transposable Element; KW Jockey_Ele5. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4129 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4129 RA Jurka J.; RT "Jockey clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (06-OCT-2010). XX DR [2] (Consensus) XX FH Key Location/Qualifiers FT CDS 87..1337 FT /product="Jockey_Ele5_1p" FT /translation="MGRKRKVRNGKPADITASTPEQSXTRKKRQVDEISLQ FT ETILSRNRYSTGGTAVNDSDLGDFGDVPDEQQQQQQRDPPGKIPPLMVKSV FT GLSKLQATMKALNISAMYKLCRIGVKVILNCKEDYQKAKAYLVRSKAEFFS FT FDMPSEKPFKAVIRGLCDMEPEAIKMELENRYKLKPLAVFPMNRHDRKRVY FT RDALYLVHFKKGTVTLGALKAARAVNDVIITWEAYRGANRDVTQCMRCLHF FT GHGARNCNVTPRCGVCTQQHETRDCPIEEVVDYKCANCGGNHRATDRTCEK FT RAEYKRIRMQASSQNQPRRRMKEAPKFSEENFPPIPSQAGGRPSERQAGTD FT TPPPGLSYRDRLCQRGTSIFQPQQSNAAPHDNSAPLLTPEQLLPIFHDMWQ FT QMQQCRTRADQILCLGEFIIRHG" FT CDS 1193..3982 FT /product="Jockey_Ele5_2p" FT /translation="MQRRTTTRLHCSRLSNFCPSSTICGSKCNSVAPVRTK FT SSAWENSSSAMANGLKVLLWNAHSVKPKDVELLDLAEREEAQVITITETHL FT KPGVNFCLPGYNTVRLDRTATNGGGVAIAVKQPIKFVIQPDYRTSVIEAVG FT VEIATPDGPLLVIAAYCPQQCYEGSGKARQFKNDLVKLTRHRKKFIVACDL FT NAKHEAWGNHRRNRNGNLLFDESQLGYFVVNAPKDPTFLSPAGIPACLDFF FT LTNMEVPLPTTINELSSDHLPVLLRVGGDAEREQRRTRKDYHRVNWVVFQR FT KVDSRVDAEVQLDTVEDIDLAVENLQAAIVAAESECVPRVQIVSSQLQFDP FT ELRSIMRARNVFRRQFQRTGDLSKKSQANYLTKIIALRVQEMRNDRFERDV FT RRMDPFSRPFWKVAKILKSKPQQIPPLKCDENLLVSPLEKANAIASSFSKN FT HLIGSHLVSPMEQAVSDTLLTLNNTPCYVPDNKKISVEEVSAAIKGAKNMK FT APGFDTLFNLVLKNLSVSALSHIANIFNRCLDLNYFPNCWKLAKVVPILKP FT GKDPTSPKSYRPISLLSALSKLFEKVILNRVLDHVNNNNILLQEQFGFRKG FT HSTSHQLYRVTNIINKNKSVAKSTVMGLLDVEKAFDNVWHDGLVYKLYRYN FT FPNYLIKIIQHYLKDRQFRVALSGELSDSLQIPAGVPQGSLLGPILYSIYT FT SDVPQLPDGCYLSLFADDTAIMVKGRNTRTTVKKLQECLNSFQQYASLWKI FT KLNAAKTQVIVFPYNRSPKLLPPDDCKIVMDGVSIDWSTDVLYLGLTIDQK FT LLFRSHVEKTKTKCNKIVKALYPLINRRSKLCLKNKIAIFKQIISPVIDYA FT MPVWQSTAMSHRKTLQIVQNKALKMILNVPYHTKNTEVHDIAGVQMLDQKI FT IANFDKFKAKCQQSEYAMINTLFQQN" XX SQ Sequence 4129 BP; 1222 A; 1036 C; 964 G; 904 T; 3 other; cagtttactt tcgccccttg agggtatcgg acgtttttcg agagcgcggt ttaatatcgt 60 wgttagttcg cggcttgcaa cccagtatgg gacgcaagcg aaaggttcgg aacggaaagc 120 cggccgacat cacggccagc acaccggagc agtccwacac gaggaagaag cgtcaagttg 180 acgaaatctc tcttcaggag acaatcctgt cgcgaaatcg gtacagcact ggmggaacgg 240 ctgtcaacga tagcgactta ggcgactttg gcgacgtgcc ggatgagcag cagcaacaac 300 agcagcgcga tccgcctgga aaaatccctc cgctgatggt aaagtccgtc ggcctcagca 360 aactgcaagc caccatgaag gcgctcaaca tcagtgcgat gtacaagcta tgtcgcattg 420 gagtcaaagt gatcctcaac tgcaaggaag actaccaaaa agccaaggcc tacctggtga 480 ggagcaaagc cgaattcttc agcttcgaca tgccttcgga aaaaccattc aaggctgtca 540 tccgtggtct ctgtgacatg gaaccagagg ccatcaagat ggaattggaa aaccgctaca 600 aactcaagcc attagcggta tttccaatga atcgccacga caggaagaga gtgtatcgag 660 atgccctgta cctcgtgcac ttcaagaaag gaacagtgac ccttggagct ctgaaggcag 720 ctagagcagt caacgacgtc atcatcacct gggaggcata ccgtggagcg aatcgagacg 780 ttactcagtg tatgcgctgc ctgcatttcg gtcacggagc aagaaattgc aatgtgacgc 840 cgaggtgtgg tgtctgcact cagcaacacg agaccagaga ctgccccatc gaagaggtgg 900 ttgactacaa gtgtgccaat tgtggcggta accatcgagc gaccgatcgc acctgtgaaa 960 aacgagcgga atacaagcgg atccgaatgc aagcttcaag ccaaaaccag ccaagacgcc 1020 ggatgaagga ggcgccgaaa ttcagcgagg agaacttccc cccaatccca tcgcaagccg 1080 ggggcaggcc atccgaacgt caagcaggca cggatactcc tccgccgggt ttgtcctatc 1140 gggatcgact ttgccagcgt ggaacatcta tcttccagcc tcagcagagc aatgcagcgc 1200 cgcacgacaa ctcggctcca ttgctcacgc ctgagcaact tctgcccatc ttccacgata 1260 tgtggcagca aatgcaacag tgtcgcaccc gtgcggacca aatcctctgc ttgggagaat 1320 tcatcatccg ccatggctaa tggtctgaag gtactgctct ggaacgccca ctccgttaaa 1380 cccaaggatg tggaattact cgacctcgcg gagagagaag aagcgcaagt aatcaccatc 1440 accgaaaccc atctgaagcc cggcgtcaac ttctgccttc cgggatacaa cacagtgcgg 1500 cttgacagaa cggctaccaa tggaggtgga gtggccattg ctgttaagca gccgatcaaa 1560 ttcgtgatcc agcccgacta caggacatcg gtcatcgagg cagttggggt agaaatagct 1620 actccagatg gacccctgct ggtaattgcc gcctactgcc cgcagcagtg ctatgaagga 1680 tcaggtaagg caaggcagtt caagaatgat cttgtaaagc tgaccagaca tcgcaagaag 1740 ttcatcgtgg catgcgacct aaatgccaaa cacgaggcgt ggggaaatca tcgccgcaac 1800 aggaacggaa accttctatt cgacgagtcc caactggggt acttcgtggt gaacgctccg 1860 aaggacccga cattcctgtc gccggccggc atcccagcct gcctggactt cttcctgacc 1920 aacatggagg tcccactacc gacaaccatc aacgaactga gctccgacca cttgcctgtc 1980 ctgctaaggg ttggtggtga cgctgaacga gagcaacgcc gaacgaggaa ggactaccac 2040 cgggtgaact gggtggtgtt tcaacgaaaa gtcgattcca gagtcgacgc agaagtgcaa 2100 ctggacaccg ttgaagacat cgacctggcg gtagaaaacc tgcaagcggc catcgtagca 2160 gccgagagcg agtgcgtccc acgagtgcaa atcgtaagta gccaactaca gttcgaccct 2220 gaactccgca gtataatgcg agctaggaat gtgtttagga ggcagttcca gcgtactgga 2280 gatttgtcaa aaaagtctca agcaaattac ttgaccaaaa ttattgcttt gcgagttcag 2340 gaaatgagga acgacaggtt tgagcgagat gtcaggagaa tggatccgtt ttcaaggccg 2400 ttctggaagg tagccaaaat cttaaaatca aaaccccaac agattccccc cctcaagtgt 2460 gatgaaaact tactggtgtc tccgttagag aaggcaaatg ccattgcatc tagcttcagc 2520 aaaaatcatc tgataggtag ccatttggtt agtccaatgg agcaagcagt ctcagataca 2580 ctcctcactt tgaataacac cccctgttac gtccctgaca acaagaaaat ctcggtcgaa 2640 gaagtctctg cagccattaa aggtgctaaa aatatgaaag cccctggctt tgatacactc 2700 tttaacctgg ttctcaaaaa ccttagtgtg tccgctctgt cccacattgc caacatcttc 2760 aacagatgcc tagatttgaa ctacttcccg aattgttgga agttggcaaa agtggtacca 2820 atactgaaac caggaaaaga tcccaccagt ccaaagagct acaggccaat tagtctgctg 2880 tcggctctca gtaagctgtt tgaaaaagtg attctaaatc gagtattaga tcacgtaaac 2940 aacaacaaca tcctcctaca ggagcagttc ggcttccgta agggtcactc gacgtctcat 3000 cagctgtatc gcgtaaccaa tatcattaac aaaaacaaat ccgtcgctaa gtccaccgtc 3060 atgggtcttc tcgatgtaga gaaggccttt gacaatgtgt ggcacgacgg gcttgtttac 3120 aagctctacc gttataattt cccgaactat ctaatcaaga taatccaaca ctatctgaaa 3180 gatcgacagt ttcgagtggc ccttagtggt gaactttcgg acagcttaca aatcccagcc 3240 ggagtgccgc agggtagcct actcggtcca atactctata gtatttacac atcagatgtg 3300 ccgcaacttc ccgatggatg ttacctgtct ctctttgccg acgatactgc cattatggtt 3360 aagggacgta acaccagaac cacagtcaaa aaactacaag aatgcttgaa ctcctttcaa 3420 cagtatgcat ccctgtggaa aataaagctg aatgcagcca aaacccaggt gatagttttc 3480 ccatacaatc gctccccaaa gttactccct cccgatgact gcaaaattgt tatggatggt 3540 gtttccatcg actggtccac cgatgtcttg tatctgggac tcactattga ccagaagctc 3600 ctcttccgat cccatgtaga gaaaacaaag accaaatgca acaaaattgt aaaagcacta 3660 tatcccttga taaatcgcag atcaaaatta tgtctcaaga acaaaatagc aatattcaaa 3720 cagattatat caccagtaat agattatgct atgcccgtct ggcaatcaac tgcaatgtcc 3780 catagaaaaa cactacaaat tgttcaaaat aaggctctta aaatgatatt aaatgttcca 3840 taccatacta agaatactga agtgcatgat attgcaggtg ttcaaatgct ggatcaaaaa 3900 ataatagcca attttgataa attcaaagca aagtgtcaac agtcagaata tgcaatgata 3960 aatacgctgt ttcagcaaaa ttaggttagg attgtagtta ggattagttt gtaagtattt 4020 gtaaggaaaa attataacat ttgctaagag cggaaaactc taaactatta taagaaacaa 4080 cacttttgtg gaagaagatc aaagctgaaa tgccatgagc agcatcact 4129 // ID MARIAM1 repbase; DNA; INV; 1282 BP. XX AC . XX DT 28-OCT-2005 (Rel. 10.1, Created) DT 31-OCT-2005 (Rel. 10.1, Last updated, Version 2) XX DE Mariner-type DNA transposon from Apis mellifera (a consensus). XX KW Mariner/Tc1; DNA transposon; Transposable Element; MARIAM1. XX OS Apis mellifera OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; Apoidea; OC Apidae; Apis. XX RN [1] RP 543-993 RA Robertson H.M.; RT "The mariner transposable element is widespread in insects."; RL Nature 362(6417), 241-245 (1993). XX RN [2] RP 1-1282 RA Jurka J.; RT "MARIAM1: Mariner-type transposon from honeybee (a consensus)."; RL Direct Submission to RR (28-OCT-2005). XX DR [2] (Consensus) XX CC A fragment of this transposon was originally reported in ref. 1. CC Present in multiple copies >95% similar to consensus. The CC consensus sequence was derived from Genome Assembly Amel 3.0 CC available from BCM-HGSC. XX FH Key Location/Qualifiers FT CDS 181..1206 FT /product="MARIAM1_1p" FT /translation="MATDKVHLRHCILYEFQQGRNATEACRNLLKVFGEGT FT VSDRTCRRWYEKFETGDFDLSDKPRSGRPSLIDDDVVKAMLEQDPFLTTSE FT IAERLNSAQQTISDHIRKIGLVWKYSRWVPHELSQKNLDDRVVICTSLLAR FT NKIEPFLNRMITGDEKWITYNNIVRKRAYCEPGKPSPSTSKPNLTLNKRML FT CIWWDIRGPIHYELLKPNEKLNSEKYCQQLDNLKTAVQKKRPAMXNRKDMI FT LHHDNARPHAALGTRQKIAEVGWEILSHPPYSPDIAPSDYHLFLSLQNFLT FT GKKFKNEEDVERALVQFFASKDETFFKNGIYKLPSRWQEIINNNGNYIIQ" XX SQ Sequence 1282 BP; 431 A; 231 C; 262 G; 357 T; 1 other; taggtctacc ggaaagttct gtccgaatct atgacatcat tttcgccacg taagcacatg 60 tttatttatt gcatgttcgg ctctatattt ttatcgctta atgtatacat actgacgtag 120 caaataaact ataataaagt tgattcacat tagtcttaag tgtgaaacga tagtataccc 180 atggcgactg ataaagttca tttacgccac tgtattttat acgaatttca acaaggaaga 240 aatgctacag aagcatgtag aaatttattg aaagtgtttg gtgaaggtac agtttctgat 300 aggacatgca gaagatggta cgaaaaattt gaaacaggtg atttcgacct ttctgataag 360 ccacgttctg ggcgaccatc tttgatcgac gacgatgttg ttaaggcaat gttggagcaa 420 gatccttttc tgacaacatc ggagatcgca gaaaggctta attcagctca acaaaccatt 480 tctgaccata ttcggaagat aggattggtg tggaagtatt caagatgggt gccacatgaa 540 ttaagtcaga aaaatttgga tgatcgagtt gtcatatgca catctctgct tgctcggaac 600 aaaatcgagc cctttttgaa ccggatgata actggggatg aaaagtggat tacatacaac 660 aacattgtaa ggaaaagggc atattgtgaa cccggaaaac ctagcccttc cacctctaaa 720 ccaaatttga ctctgaataa gagaatgttg tgtatatggt gggacattcg aggaccaata 780 cattatgagc ttttaaaacc gaacgaaaag ctcaattcgg agaagtattg tcagcaactg 840 gataatttaa agacagcggt ccaaaaaaag aggccggcaa tgttsaatag gaaggacatg 900 atactgcacc acgataacgc cagaccacac gctgctttag ggactcgtca aaaaattgca 960 gaagtaggct gggaaattct gtcgcaccca ccatattccc cggacatagc accctctgat 1020 tatcacttgt ttttatcctt acaaaatttt ttgacgggca aaaaattcaa aaatgaagaa 1080 gatgtcgaac gagcactggt tcaatttttc gcatcaaaag atgaaacatt tttcaaaaat 1140 gggatataca aattgccctc acgctggcaa gagatcatta ataataatgg caattatatt 1200 attcaataaa gttaattggc ggtaagaaaa aatttgtatt ttgttttatt ccaaaaacgg 1260 acagaacttt ccggtagacc ta 1282 // ID hATw-5_BF repbase; DNA; INV; 6084 BP. XX AC ABEP01037006.1; XX DT 13-JAN-2009 (Rel. 14.02, Created) DT 13-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE hATw-type DNA transposon family from Branchiostoma floridae. XX KW hAT; DNA transposon; Transposable Element; hATw; 7-bp TSD; KW hATw-5_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-6084 RA Bao W. and Jurka J.; RT "hATw-type DNA transposons from Branchiostoma floridae."; RL Repbase Reports 9(2), 518-518 (2009). XX DR EMBL/GenBank/DDBJ; ABEP01037006.1; Positions 89453 83370. XX FH Key Location/Qualifiers FT CDS 1461..5159 FT /product="hATw-5_BF_1p" FT /translation="MDEPAPTITNMMVAEIEAFRTKTKATLPDAKQWLEGL FT LGKGFFDDVKKTSFEYSLTSRREKMNKAKQNKTPKLLDYLKQTYTKPIPRN FT AQPNNPEPSTSNLSQKLEAAKKSKGILQRELKEKDLEIQGMVVKNFSLEKQ FT LEAKDRQKEKEEKRMRKKEKLLEKREAEMKESAIELNIRNWKEKLRRRDSN FT ITKLKELAKGSKNTAAKLQQLKKDKGKGSRQLQYFKKRTRALEEELKDLKK FT KNKTKDKRLSQLEQEVEDLVEQALEKREIKLREGPHKNSAFTEDVRECFMA FT LLTYEVSSHNLSKVAWCVLTILAKIKVKLSDFPGETFCQEMRNEANLTATV FT HVAEVIAQKSNNTIHSDGTSRDQQKVVGLQASCGGKTATMGLQEVTSGVAE FT EQLEAFKFTCNKMAQLTASPERVEARYNELVSSFKNTMGDGAASQKRFNQL FT LEEYRQEILPHVQQNWETLTTQQQESLSKMNHMYCNLHALIGFATYADEAL FT TELETKVWRARDGPLGVEDLREFQDKNGRYSWPLKDSATQRLVRTTCSAFG FT PEGSQSAGRIQAFKDFLHLSSKSVDGEEVQNKKSLVKLLSFRANRFNIVFK FT AAAAVYHHNGDIRRLFEEGFVKADNKLLKAVLADCSCDPLLAGCRALGIIA FT VQITEPYWRLIEMKDVHILDLSSHLQSALAKFKQWAKDATPLLQPDMPPLF FT HTTQPVQPIKDEVFNSLYNNTSDNISTMTKQALEVILSHMIVVLERRFPEQ FT LYGAYSDSSNPTQREEMKGMEKSNRRGENDFGYWSQILTTKPSISLMAAEG FT QLMFKTNKTAEWLRKLSKDNPGRYSAILSLVKKRKNEWKKKYAERENEQQE FT AREAKLQAQAEEHARKIKKKEEQQDGRKKQLEEYGGPASTPAECEEFIATV FT SPLPENKEKLALRAHITYLKEKHPAFAKSNSLFILSSGGRDHKPDILSANL FT RSILRSPELQTQSEAATEPEEATTSAPQQRLTAEDANDKVDRIKAQFAKKA FT AECSKQPRNVERPVEGAENSESIDWDDSEEESDAEESDAEESDAEESDAEE FT SDAEESDAEESDAEESDAEESDAEERNAEESSEEDGEESKQHREEPPAKKQ FT RLSDSEKDEEDEEDIVNGSMVVVAYEDDWALGEVKEVKDDGELVIKYMQRK FT KSKKGKCTFFWPRRGCVYTVEDKFILCTVASSSVLPNKDTLGRDSVISDDD FT FTEIQRKYNIYYNQYFAKGQHR*" XX SQ Sequence 6084 BP; 1991 A; 1193 C; 1443 G; 1457 T; 0 other; taggggtgcc gcacggcccg cgatttccta tcccaacgcg ctgacacgcg ctaaaaccga 60 aaattgcgca atgtttgagg ctcaaagttg ggcatttaac ggacaaaaaa cgcctagatt 120 tttgtatcag tagttgaaga gggttcttat ctgcatatct gtgaaagttt tataaaaata 180 ccttcaacca taagcgagca atcgcccgcc gggaaatgcg tgttttgccc gcggcgccat 240 gaacgccctc ccccgtgcgc agtattgccg ggggccggtt gcgttcgaaa gggattgcgg 300 gatttgtcta caatcgtaaa caagaaagtt agccaccaat ttgtagccga cacttcgagc 360 tacaaaatgc taccaccgtt aaacgggtgc gacgtcggtg tgaaaaaata ttcgcatgta 420 aacttttcaa agtcgattat ctcagcaatg tccctccgta aaaaattttt aacggttata 480 tttttgaaaa attcagaagt ttgcctttct gttggtacca aattcgcata tatttgagcg 540 attacaacgc ccggaacatg gggctacttt tagcattacc gactggattt tctcgatttc 600 caacatggcg tccgcttcgc ccgggggcga agatcgatct cacgatgccg ggatttctgc 660 gacatccgac tcaaatgagg ccgctaatat acccaaaagg ccaggtaagg aatgtttctc 720 gccttttatg tagtttttag gtggaatata tgagttttat agcagatatt gccgtctgaa 780 tatcgaggga aatgtagcac atgcttacct catgtagcgt cccttgtccg gacatgttcg 840 ctggctctgt cgttgctttt attttgattt ttgggccaga actacggtga attgatcctt 900 gttaacgtac attcagtaat ttcaccgaaa aactatgttt tatgcactgt ttctttacaa 960 tatcgaatcg attttatgta ccaaaatctg cagcgttacc agtactactt acgtagtagg 1020 gagagggtgg gaggggggcg gcggctgccc agcgtccccg acctttgaaa gaatattttt 1080 acaatcttat ttcactacaa tattttattt tgtctcttga caagatttaa tgaactttgc 1140 ttaggttttt attgaacctt agactgatct ctttgggatc tatttactat gctgacatta 1200 ctcaatagca agttccatta catatcagct gttacatata ataatatatg atatagtgct 1260 aactatgaca tgaatgtggt caatatttaa atgacattac aatgtacatt tttagagata 1320 agtatttgag ttaaacttga tatttagata acattccatc cttcataatc acattcacag 1380 gaatatctgt tgttgggcca gagtgcgaga aggcaggcat cacacccacg gtgttgcaga 1440 acggatggac cagccagcag atggacgagc cagctcccac gataaccaac atgatggtgg 1500 ctgaaatcga agcttttcgc accaagacga aggcaacact tcctgatgcc aaacagtggc 1560 tggaaggact acttgggaaa ggcttctttg atgatgtgaa gaaaaccagc tttgagtact 1620 ctctgacatc aaggagagag aagatgaata aggcaaaaca aaacaagacc ccaaaacttc 1680 tcgactacct gaaacaaaca tacacaaagc ccatcccaag aaatgcacaa ccaaacaatc 1740 cagaaccatc aacctctaat ctttctcaga agcttgaagc tgcaaagaag tccaagggta 1800 tactacagag agaactcaag gagaaagatt tggaaattca gggcatggtt gtgaagaatt 1860 tcagcttgga aaaacagctg gaagcaaagg atagacaaaa agagaaggaa gagaaaagga 1920 tgagaaagaa ggagaagctg ttagaaaaga gagaagcaga aatgaaggag agtgctatcg 1980 aactcaacat aaggaactgg aaagaaaagt tgagacggag ggacagcaac ataacaaagc 2040 ttaaagagtt agcaaagggc tccaagaata cagcagcaaa acttcagcag ttgaagaagg 2100 acaagggaaa gggtagccgt caactgcagt acttcaagaa gagaactaga gcattggagg 2160 aagagctgaa agacttaaag aagaagaaca aaacaaagga caaaagactt tcccaactcg 2220 agcaagaagt tgaggacctt gtagagcagg cgttggaaaa gcgagaaatc aagctacgtg 2280 agggccccca caaaaacagt gctttcacag aagatgtgag ggaatgcttc atggcactgt 2340 tgacctatga agtgagctcc cacaacttat ccaaggtagc ttggtgtgtg ctgacaatcc 2400 ttgccaagat taaagttaag ctatctgact tccctggtga aacattttgc caggagatga 2460 ggaatgaagc caacctcaca gccaccgtcc atgtagctga agtcattgct cagaagagta 2520 ataacacgat ccacagtgat ggcactagca gagatcagca gaaagttgtg ggtctgcaag 2580 cctcatgtgg aggaaagaca gctacaatgg gtctgcaaga agtgacgagt ggagttgctg 2640 aagaacagct ggaggcattt aagttcactt gcaacaagat ggcgcagctg acagcctccc 2700 cagaaagggt agaagccaga tataatgagc tggtgtcatc cttcaagaac actatgggcg 2760 atggcgctgc aagtcagaag cgttttaacc agctacttga agagtatcga caggaaatac 2820 taccacatgt gcaacaaaac tgggaaaccc tgaccacaca gcagcaggag agcttgtcga 2880 aaatgaacca catgtactgc aacctccatg ctttaattgg gtttgcaacc tatgcagatg 2940 aggctcttac tgagctggag acgaaagtgt ggcgggcaag ggatgggccg ctgggtgtcg 3000 aagatcttag ggagtttcaa gacaagaatg gaaggtacag ttggcccctg aaagactccg 3060 ccacacaaag gctcgtaaga acaacctgtt ctgcctttgg gccagaggga agccagagcg 3120 cggggcgaat tcaggcattc aaggattttc tgcacctgtc ctctaagtcc gtcgatggtg 3180 aggaggtgca gaacaagaaa agccttgtca agctgctttc gttccgcgct aaccgcttta 3240 acattgtgtt taaagctgct gcagcagtct accaccacaa tggcgacatc cgacggttgt 3300 ttgaagaggg gtttgtgaag gcagacaaca aactactaaa agctgtgctg gcagactgct 3360 cttgcgatcc actgcttgca ggctgcaggg cattgggaat cattgctgtc cagatcactg 3420 aaccttactg gagactgatt gaaatgaagg atgtccacat tcttgatctg tcttcccacc 3480 ttcaatcagc cctagccaag tttaagcagt gggcgaagga tgctacacca cttctccaac 3540 ctgacatgcc accactcttc cacacaacac agccagtaca acctatcaaa gatgaggtct 3600 tcaactctct gtacaacaat acctctgaca acatctccac tatgacaaag caggccctgg 3660 aagtgattct gtcccacatg atcgtggtcc tggaaagaag atttcctgaa cagttatatg 3720 gggcatattc agacagcagc aatccaacac agagagaaga aatgaaggga atggaaaaga 3780 gcaacagaag gggggagaat gactttggtt attggtccca gatcctgaca accaaacctt 3840 ccatctctct aatggctgct gagggccagc taatgttcaa aacaaacaaa actgcagaat 3900 ggttgaggaa gttgtcaaag gacaaccctg gcaggtacag tgcaatactg tcactcgtca 3960 aaaagaggaa gaatgagtgg aagaagaagt atgcagaaag agaaaatgag caacaagaag 4020 ccagagaggc taagttacaa gctcaagcag aggagcatgc aagaaaaatc aaaaagaaag 4080 aagaacagca ggatggaagg aagaaacagt tggaagagta tggagggcca gcttcaaccc 4140 ctgcagaatg tgaggaattc attgcaaccg tgtcaccact tccagagaac aaggagaaac 4200 ttgctctcag ggcacacata acgtacctaa aggagaaaca ccctgctttt gcaaagtcca 4260 acagcctgtt catactttca tctggtggaa gagaccataa acctgacatt ctgtctgcaa 4320 acttgcgttc gattttacgc tcaccagaac tccaaactca aagtgaagca gctacagagc 4380 cagaagaggc aacaacatcc gccccacagc agcggttgac agctgaggat gctaatgaca 4440 aagttgacag aattaaggca cagtttgcaa agaaggcagc agaatgcagt aaacagcctc 4500 gtaatgttga aagaccagtt gagggtgcag aaaacagtga gagcatcgac tgggatgaca 4560 gtgaagagga aagtgatgca gaggagagtg atgcagagga gagtgatgca gaggagagtg 4620 atgcagagga gagtgatgca gaggagagtg atgcagagga gagtgatgca gaggagagtg 4680 atgcagagga gagtgatgca gaggagagga atgcggagga gagttcggag gaagatggag 4740 aagaaagcaa acaacacagg gaggaacctc cagcaaagaa acaaagactg tcagactcag 4800 agaaagatga ggaagatgaa gaggacattg ttaatggaag catggttgta gttgcctatg 4860 aggatgattg ggctttgggc gaagtgaagg aagtgaagga tgacggtgaa ctggttataa 4920 agtatatgca gagaaagaaa tctaagaaag gaaagtgtac tttcttttgg ccaaggagag 4980 gatgtgtata cacagtagag gataagttta tcctatgtac agtagcaagc tccagtgtac 5040 taccaaacaa agacacttta ggaagagaca gtgtaatttc agatgatgac ttcactgaga 5100 tccagagaaa atacaacatc tactacaatc aatacttcgc caagggccag caccgatgat 5160 cctcctgaaa aagaaaatac ttagagttat tcctgctcat gatatagcaa caatacagga 5220 tagatataca cattcagtat aacatggtac atagaactat gttgagtgcc ctaaggatat 5280 acagtgaaag atattgtttc acatgattca tatgtatgtc aaatgtaatg tgagcatttt 5340 gttaacctta tcaatgtcat aaaactagta tatgcacatc agatgtgtca attttgatgg 5400 cagatttgga ctcggcatgt gcaaaaaccc cctaggagca aagtttgtgc agattcgtcc 5460 aactactagc gaaatattgt gatttcgcat atttaatgag gggacttagt aattattacg 5520 cccaaaatgt agtcaagtgt cataaaatca tattttgtga tgagatacat caattttaat 5580 gtcatatttg gattcagcac ccagaaaaac ccccaggatg taaagtttgt aggaattgat 5640 caaagtatat gagaaatatt gcaattttgc atatttaatg agctgactta gaaattttct 5700 tgttgacctt cgtcgacgta cctgtttatt ctgtaaatat ttaccatgtt tcgaggttat 5760 gtaactcagg aaatgatgac tctacaaagc tgaagtttct acatgtttta atatgctaac 5820 aataaagtct ttaaaaggtg aattgatggt tgtgtgatat tctgttctca aaatatataa 5880 gtatgaatat ttgataaaat tgcttattta ccgaattcgt taaaacaaag gccccaaaac 5940 gaaacttaaa ttgtaaatat ctcggccata caacatccaa tttgcttcaa acaaaaacta 6000 ttttgtttct ttcagtaagc tctttctgat gaacttaatt gcaaattgct aagaacaagt 6060 tgaatttttt gtccaatacc ccta 6084 // ID Gypsy-83_CQ-I repbase; DNA; INV; 4778 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-83_CQ_; KW Gypsy-83_CQ-LTR; Gypsy-83_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4778 RA Jurka J.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 545-545 (2011). XX DR [2] (Consensus) XX CC Positions [3731-4045] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 472..3798 FT /product="Gypsy-83_CQ-I_1p" FT /translation="MPNSMTLVNQIEQFIPGTIPFAQYLEQLEWVFAHHQV FT EQDEDKKKAFLAASSREVYTELKKLFPGKDLKKETFVSITTALQKRYDKTD FT SDMIQRFKFYQRKQRENESAEDFILNVKQQAELCDFGQFKDQAIRDRLVCG FT LNDEVLRQRLFDEEDLTLTKVEKLIVNRELATVRAKLVSDNDVYRGNVLNR FT VGDRNRNNRANGAPSFRGRSRSRGRPGGRRERSRSFSAGRRHRSPAATNRK FT FFCTHCRRPGHTRSYCFDLPENKKAVKFLDEPASTSKNVNRRLVNYREDES FT DYDEDMNCLMISSIKKLSEPCLRQVYVDGKQLEMEVDCGAAVSVINSVVYE FT KKFKHIPLMKCDRKLAVINGSRLNVEGQISVEVVFNGIRSNVKLIVLRCSS FT VFAPLLGRDWLDVFTPRWRDAFGISAGIKQLDVFQDHMVEKFKRKFPNIFD FT KSLASPIQGFEAELVLKCDTPIFKRAYDVPLRLKDKVIEHLKSLERDGVIT FT PVDASEWASPVIVVIKKDGGIRMVIDCKVSINKVIIPNTYPLPLVQDIFAS FT LAGAKVFCSLDLAGAYTQLLLSKKSREIMVINTIIGLFVYNRLPQGASSSA FT AIFQRIMEKVLFNIDGVYCYLDDVLIAGKDKEDCVCKLELVLERLSNANIK FT VNLQKCKWFVNSLPFLGHVLTDNGLMPSPDRVETIRRAKIPNNVSELKAFL FT GLINYYGKFIPNLSTRLSCLYRLLKKEVRFVWSSECNRAFEHCKQSLLNSK FT LLEFFDPSKPVIVVTDACSYGLGGVIAHQINGEERPVCFTSFSLNNAQKSY FT PILHLEALAVVSTIKKFHKYLFGKKFTVFTDHKPLIGIFGREGRNSLFVTR FT LQRYVLELSIYEFDIVYRPSAKMGNADFCSRFPLPEEVPAALQREYIKCLN FT ITNEFPLDFALIARETKKDKFLQQVVYFMKNGWPDKLDRNFRDVYAQHQDL FT EEVEGCLLYQDRVCIPVCLQKRILSLLHKNHSGITKIKQLARRTVYWFGMN FT HDIECFVKSCNICCEMNAVSKPVPHSPWIPTKKPFARIHADFFYFDKKVFL FT VIVDSFSKWLEVYYMKFGTDARNVKSKFMSFFCQFWSAGCCCYRWRPTVSF FT S" XX SQ Sequence 4778 BP; 1321 A; 804 C; 1163 G; 1487 T; 3 other; ttaaagtggc gacgacggag aagtactagt aaaggaagga ttcttttctc ttgcggcggt 60 ggttttacgc tgattttctg caccggctgc tgtggtcttc ggtcattctt gacaaggtca 120 tctggtttgt agcagattaa caagtaagtt tcacgttagc agtaagaagt tcgtttgttt 180 ggacgctgaa aatawttatt cgagacacgt ttggtagtct ccaaattagc aacttgggca 240 ccattttgtt ttgccattgt gtgttagaga ataaaagagt tttttgttgt tttgacgcca 300 ttgtttagca cacgcggtag tcatcagctt tttgacgcga cacttgagtg tctgacggtg 360 tttaataatt caaagttgtg tggcatcgca tcgggcttaa caatttgaca gtttggttgc 420 tcctgttgca tagtttgcca agttttaatc ggctgtgttt tattctccaa aatgcccaac 480 agcatgacgc tggtcaacca aatcgaacaa ttcattcccg ggaccatccc gtttgcgcaa 540 tacctcgagc agttggagtg ggttttcgct caccaccagg tggagcagga tgaggataag 600 aagaaggcgt ttttagctgc gagtagcagg gaggtataca ccgagctgaa aaagctattc 660 cccgggaaag acctcaaaaa ggaaactttc gtgtcgatta ctactgcttt gcaaaaaaga 720 tacgataaaa ccgatagtga catgattcag cgatttaaat tttatcagcg caaacagcgg 780 gaaaacgaga gtgctgaaga tttcattcta aacgtaaaac agcaggcgga gctttgtgac 840 ttcgggcagt tcaaggatca agccattcgg gatcgtcttg tttgtggttt gaacgatgaa 900 gttcttaggc agcggctctt tgacgaggaa gatcttacgt taacgaaagt tgaaaaattg 960 atcgtaaacc gggagcttgc tacagtgcgg gccaagctag tgtcggacaa tgatgtttat 1020 agaggcaatg tgcttaatcg tgttggtgac cgtaacagga acaacagagc gaacggtgcg 1080 cccagttttc gtggtcgatc gcgcagcagg ggacgtccgg gaggtaggag agagagaagt 1140 cgttcgttct ctgcgggtcg caggcatcgt tcaccggcgg cgactaatcg taaatttttc 1200 tgtactcact gtcgtcggcc tggtcacacg cgtagttatt gttttgactt accggaaaat 1260 aaaaaagctg tgaaattttt ggacgaacca gcttctactt ccaaaaatgt gaatcgtcgg 1320 ctggttaact atcgtgaaga cgagagtgat tatgacgaag acatgaactg tcttatgatc 1380 tcgtctatca agaagttgag cgagccttgc ttgcgccaag tttacgtcga cggaaaacag 1440 ctggaaatgg aggtggattg cggagcagca gtttctgtga ttaattcggt tgtttacgag 1500 aagaagttta agcatatccc tttgatgaag tgcgaccgga agctagccgt aatcaacgga 1560 agtcggctga acgttgaggg gcagatttcc gtcgaggttg ttttcaacgg catcaggagt 1620 aacgtcaagc tgatcgtgct tcgttgtagc agtgtttttg cacctcttct cgggcgagac 1680 tggctggatg tgtttactcc gcgctggaga gatgcttttg gaatcagcgc tgggatcaag 1740 cagctggacg tttttcaaga tcatatggtc gagaagttta agcgtaagtt tcctaatatt 1800 tttgataaat ctctggcatc accaattcaa gggtttgaag cagaacttgt tttaaaatgt 1860 gacactccaa tttttaagcg tgcttatgat gtgccgttaa ggttgaaaga caaggtgatt 1920 gaacatttga aaagtttaga gcgtgatggt gtgattacac cagtagatgc aagcgagtgg 1980 gcatcaccag tcattgttgt gattaaaaaa gacggcggta ttagaatggt cattgactgt 2040 aaagtttcta tcaataaagt gattatacca aatacttatc ctttgccttt agtgcaggat 2100 atttttgcgt ctttggctgg tgctaaagtt ttttgctcac ttgaccttgc tggtgcttac 2160 acgcaattat tgttgtcaaa gaaatcacgt gaaataatgg taattaacac aattattggt 2220 ttgtttgttt ataatagact accacagggt gcgtcttcca gtgctgctat ttttcagcgc 2280 attatggaga aagttttgtt taacatagat ggggtctatt gttatttgga cgatgttttg 2340 attgcaggca aggacaaaga ggactgtgtg tgtaagcttg aattagtgtt ggagcgtctc 2400 tctaatgcta atatcaaggt caatctgcag aaatgcaaat ggtttgtgaa tagtttgcca 2460 tttttgggcc atgttttgac cgacaatggt ttaatgccta gcccagatag agttgaaact 2520 attcggaggg ctaaaattcc gaataatgtt tctgaactta aggcattttt gggtttgatt 2580 aattactatg gtaaattcat tcccaatttg tccacacgcc tcagttgttt gtatcgttta 2640 cttaaaaagg aagtgcgttt tgtttggagc tctgagtgta atcgagcttt tgagcactgc 2700 aagcaatctt tgttaaactc aaaattgttg gaattttttg acccgagtaa gcctgtgata 2760 gttgtgactg atgcttgtag ttatggtctg ggtggagtta ttgcacatca gataaatggt 2820 gaagagagac cagtttgttt tacatctttc agtttaaata atgcacaaaa atcttaccca 2880 attctgcacc ttgaagcatt agctgttgtt agtaccatta agaagtttca caagtatttg 2940 tttggcaaga agtttacagt ttttactgat cataagccgt taattggaat ttttggcagg 3000 gaggggagaa actcgttgtt tgtgacacgg ttacaaagat atgttttgga gttgagtatt 3060 tatgagtttg atattgttta cagaccctct gccaagatgg ggaacgcaga cttttgctcg 3120 cgttttcctc tacctgaaga agttccagct gcgcttcagc gagagtatat taaatgttta 3180 aatattacca acgagtttcc tttggatttt gcccttattg caagggaaac caaaaaggat 3240 aaatttttac aacaggttgt ttattttatg aaaaatggtt ggccagataa attggaccgg 3300 aactttcggg atgtttacgc tcaacatcaa gacttggaag aagttgaagg ttgtttactt 3360 tatcaagatc gtgtttgtat tccagtttgt ttacaaaagc ggattttaag tttgttgcat 3420 aaaaatcatt caggtattac aaaaattaag caactggcaa gaagaactgt ttactggttt 3480 ggcatgaatc atgacattga atgttttgtt aaatcatgta atatttgttg cgaaatgaac 3540 gctgtttcaa aaccagtccc tcactcacct tggattccga ctaaaaagcc ttttgctcgt 3600 attcatgcag atttctttta ttttgataag aaagttttcc tggttattgt cgatagtttt 3660 tcaaaatggt tggaagttta ttacatgaag tttggaacag atgccaggaa tgtaaaatca 3720 aaatttatga gttttttttg ccagttttgg tctgccggat gttgttgtta ccgatggagg 3780 cccaccgttt cattctcgtg agtttgttga ttttttggag agaaacaaca ttagggtgat 3840 gaagagccca ccgtataatc catccagcaa tgggcaagct gagcgaatgg ttagagtcgt 3900 caaggaaggg ttgaaaaagt ttatgttgga tcccgagatg aagggaatga acatagagga 3960 cgtagtttcg tttttcttgt ttaattatag aaatacctgt ttggataatg gttcgtttcc 4020 gtccgagcgt ttgtttaatt ttaaacccaa aacattatta gatttgttga atccacggag 4080 cagttttaag aaaaatttga cgactaacga gaaagctgaa agtgtacctg ttgctccagt 4140 gaaaagagat gagattgaca atcttcggcc gggagatcta gtttatttta aaaatttcaa 4200 atcaacakaa attcgtcgtt ggttggaagc aaaatttctt agacggattt cttctwatac 4260 atttcaggtg tccgttgggg gccgcgttta cctggcccac agaaatcaga ttaaggtagc 4320 gcgtggtggc tacacccggg gcggtctgct ggtggcgtcg aggtacgaga gaatgcagaa 4380 tcgaaagaga agccgcgaag aagaagaaga cgatgccgac gatgaatttt acgggttcgt 4440 ctccgattct ttcgtcttcc acgagccgat ggacgtcgac caagaaggtt ctcgcgagtt 4500 cgatccccag gagggttcat caagaagtat aaggtgccct gagaatttgc gttcagaagg 4560 tcaacaagat cattcggaat tcaatccgcg tagatcgagc cgattgaaaa aggctaggaa 4620 agatagtcgt tttgtttatt tctaaaaaaa aagcaatgtt gaatttcttg tacaaataat 4680 tgattgcatt tttaattggt gtctttcatt cagaattaaa tatcagtgaa attttgaaat 4740 attaattgta ataaactata actcaaaaag ggaagaac 4778 // ID CR1-73_HM repbase; DNA; INV; 4599 BP. XX AC . XX DT 29-DEC-2008 (Rel. 13.12, Created) DT 29-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE CR1-type family - consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-73_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4599 RA Bao W. and Jurka J.; RT "CR1 families from Hydra magnipapillata."; RL Repbase Reports 8(12), 1900-1900 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 77..826 FT /product="CR1-73_HM_1p" FT /translation="MSVTLVEVKKMLKEMFLEYRKEMETMLKQQQQIFIEI FT LSANTKIINERLDKVEENSKESANKIKIIVKDIEDIKLSLNFHDEVFDKKI FT ATIHKQNNDRVEKENITTQNINEKLRKFEDRSRRNNLRIDGLPESVKETWN FT ETEDKVKLFLVNKLGLNGIEVERAHRTGARKEGSSRTVIMKLLRYNDKTKI FT LKESYRLKGTNIYLNEDFSHETVNIRKRLLAEVKERRDNGENVGLRYDKII FT KLNNIKKK*" FT CDS 915..3971 FT /product="CR1-73_HM_2p" FT /translation="MTMTETKYFEIFDKENTLLLNNNNDPDKNFFENDSFK FT SNYYSIEELSCFINASQNTFSALTLNIRSMSKNFENFKEMLKDISHEFTVI FT CLLETWCKDENTINSNFQLKGYKALHQTRSRGTGGGICFYIHNTIQFKKVE FT SLSLLNANCELFTVELTNQGKANAYITGLYRPPDGNIKSFKKHIIQYLELA FT KNKHLYIMGDTNLDLLKIKTNNNVKRFLNNLLQHGVEPLINKPTRVTLRTA FT TLIDNIFTNTTIKCNIQSGIIKSDISDHFPIFLLLNNLNNKKENKTIITTK FT RQINEKSITKFRSNLSMENWDQVVNCKDTNIAYDIFKHTFTTLYEKAFPKL FT EQLTKLKNLKSPWMTKGLIKSSKKKKKLYEKFLKKKTFKNEKIYKQYKNLF FT EKLKKHSKKNYYCNLLENYCGNTKKTWGVIKEVTGKKKDATKVFPQYLKTD FT ENDTVANKAIDIAEKFNNYFIKIGEKLANKITRGKKCYKLYLEKKASVMNE FT LEITPDELKNAFNTLQENKSPGLDDISVNVVKNVYDIIEYPLLHICNLSLK FT NGVFPDQLKLAKVVPIFKSGDDSIASNYRPISILPCISKILERIMHNRLNN FT YLVNHHLLNNNQYGFQKHHSTEHAVAKLVNEILNGFDNNKYTLGVFLDLSK FT AFDTVDHKILLYKLYNYGIRNNNLKWFRSYLNNRKQALSYNNTVTRLEDIT FT CGVPQGSILGPLLFLVYINDIYLSSNILNFVLFADDTQVFFSHTNISLLFN FT TVNQELENLNEWFKANKLSLNVEKSKYILFTKLSKSETLPLKLPDLLIDKT FT KLKREYSIKFLGVILNEHLSWSNHIKVIENKISKIIGIMYKTKHLLNTNCL FT KNIYFAFIHSYISYCNIIWASTYPSKLNMIYKIQKRAIRLILNANKYASAK FT PLLREIGVPNVYEINILNILIFMFKFKNGMLPSIFQTYFFSLNHKYETKYS FT TNNFIIPKTLSKQADFSISCRGPRLWNLLLTNNIKTLNSTQHFKRATKQHL FT FGFDTKQLLSYY*" XX SQ Sequence 4599 BP; 1887 A; 678 C; 631 G; 1403 T; 0 other; cgctttttaa cagctcttta taaattatta aaataaaaga taaacttatt attattaaaa 60 aaaagataaa tctaaaatgt cagttactct tgtagaagtt aaaaaaatgc tgaaggaaat 120 gtttttggaa tatagaaaag aaatggaaac aatgcttaag caacaacaac aaatatttat 180 agaaatctta agcgcaaata caaaaataat taatgagaga cttgacaaag ttgaagaaaa 240 ctctaaagaa agtgcaaaca aaataaagat aatagtaaaa gatatcgaag acataaaatt 300 gagcttaaat tttcatgatg aagtttttga caaaaaaatt gcaactatcc acaaacaaaa 360 caatgatcga gtggaaaagg aaaacattac aacgcaaaat ataaatgaaa agctaagaaa 420 gtttgaagat agatcgagaa gaaataattt aagaatagac gggttaccag aaagtgtgaa 480 agaaacatgg aatgaaactg aggataaagt taaactattt ttagtcaata agttgggact 540 aaatggaata gaagtcgaac gagctcatcg aaccggagcc agaaaagaag gaagctccag 600 aacagttata atgaagttgt taagatacaa tgataaaaca aaaattctaa aagagtcata 660 tagactaaaa ggtacaaaca tttacttaaa cgaagatttt tcccatgaga ctgtcaatat 720 taggaaaaga ctcctggctg aagtaaaaga acgacgcgat aatggtgaaa atgttggttt 780 aaggtacgat aaaattatta aattaaataa tatcaaaaaa aaataactct tctttcttat 840 atttatattt tacacgtgta tgaatatatt tatatatctt atatttaatt ttttaagcag 900 ttaaattttt aattatgaca atgacggaaa caaaatattt tgaaattttt gataaagaaa 960 acactcttct tctgaacaat aacaatgacc ctgacaaaaa tttttttgaa aacgactcat 1020 ttaagtctaa ttactatagt atcgaagaat tatcttgttt tataaatgcg tcacaaaata 1080 ctttttcagc gctgacatta aacatacgca gcatgagcaa aaactttgaa aactttaaag 1140 aaatgctaaa agacataagt cacgaattta cggtaatatg ccttttagaa acatggtgca 1200 aagacgaaaa taccataaac tcaaattttc aattaaaagg ctataaagct ctccatcaaa 1260 ctcgtagtag aggtactggc ggaggaatat gtttctacat acataataca atacagttta 1320 aaaaagtaga aagcctaagc ttactgaacg caaactgtga acttttcacg gttgaattaa 1380 caaaccaagg taaagcaaat gcgtatatta ctggcttata tagaccacca gatggcaata 1440 taaaatcctt taaaaaacat ataattcagt acctagaact ggctaaaaat aagcacttat 1500 acataatggg agatactaac ttggatcttt taaaaataaa aactaataac aatgtcaaac 1560 gctttttaaa taatctctta cagcacggcg tagaaccctt aataaataag ccaactcgag 1620 taactctccg aacagctacc ctaatcgata acatatttac aaatactacc ataaaatgta 1680 atattcaaag cggaataata aaatctgaca taagtgacca ttttccgata tttcttctgc 1740 ttaacaatct caacaataaa aaagaaaaca aaacaataat tacaacaaaa aggcaaataa 1800 atgaaaaaag tataacaaag ttccgaagca acttatcaat ggaaaactgg gatcaggttg 1860 ttaattgcaa agacaccaac atagcttacg atatttttaa acacactttt acaactttgt 1920 acgaaaaagc gtttcccaag cttgagcaat taacaaaact caaaaacctt aaaagtcctt 1980 ggatgacaaa agggcttatt aaatcttcaa aaaaaaaaaa aaaactttat gaaaaatttt 2040 taaaaaaaaa gacttttaaa aacgaaaaaa tatataaaca atacaaaaat ttatttgaaa 2100 aattaaaaaa acactcaaaa aaaaactatt actgcaattt attagaaaat tattgcggta 2160 acactaaaaa aacatggggc gtaataaaag aggtaacagg aaaaaaaaaa gatgctacca 2220 aagtgttccc gcaatactta aaaacagatg aaaatgatac agtagccaat aaggcaatag 2280 atattgcaga gaaatttaat aactatttta taaaaattgg ggaaaaacta gctaataaaa 2340 taactcgtgg aaaaaaatgc tataagctgt atctagaaaa aaaagcctca gtcatgaacg 2400 agcttgaaat aactccggat gagcttaaaa atgcatttaa taccttacaa gaaaacaaaa 2460 gtccaggact agatgacatt agcgtgaacg ttgtaaaaaa tgtctacgat attattgaat 2520 atccccttct gcatatctgt aatctgtctt taaaaaatgg ggtgttccct gaccagctaa 2580 agctagcaaa agttgttcct atatttaaaa gcggagacga ttcgattgct tctaactaca 2640 ggcctatttc tattttgcca tgcatttcaa aaatattaga gcgcataatg cacaataggc 2700 taaacaatta tctagtaaac caccacctac taaataataa ccaatatggc tttcaaaaac 2760 accattcaac agaacacgct gtagccaaac ttgtcaatga aattttaaat gggtttgaca 2820 acaataaata cactcttggg gtattcctcg atctctcaaa agcgtttgac actgttgacc 2880 acaaaattct tctttataaa ctatataatt atggaataag aaataacaat ttaaaatggt 2940 ttcgctctta ccttaataat cggaaacaag ctttatcata taacaacact gtaacacgac 3000 ttgaagacat tacttgtggg gttccacaag gatcaattct tggtccatta ttattcctcg 3060 tttatattaa cgatatctac ttatcttcta acatacttaa ctttgtcctt tttgccgatg 3120 atacgcaagt gttctttagc cacacaaaca ttagcctatt atttaacact gttaatcaag 3180 aacttgaaaa cctgaacgaa tggtttaagg ctaacaaact ctccttaaat gtcgaaaaaa 3240 gtaaatatat tctttttaca aaattatcaa aatctgaaac cctcccctta aaactacctg 3300 atttactaat tgataaaact aaacttaaac gagaatactc aatcaaattt ttaggcgtaa 3360 ttcttaatga acatcttagc tggagtaacc atatcaaagt aatagaaaat aagatatcaa 3420 aaattattgg tattatgtat aaaacaaaac atctcctaaa cacaaattgt ttaaaaaata 3480 tttattttgc atttattcat agctacatca gctactgcaa cattatctgg gcaagtactt 3540 atccttcaaa gttaaatatg atctacaaaa tacaaaaacg agcaattcgc ttgatattaa 3600 atgcaaataa atacgctagt gccaaaccac tactaagaga aattggagtg cctaatgtct 3660 atgaaataaa tattcttaat atacttattt tcatgtttaa gtttaaaaat ggtatgctac 3720 caagcatttt ccaaacttat tttttttctt taaaccacaa atacgaaact aaatactcaa 3780 caaataattt tattatacca aaaacattat caaaacaagc tgatttttcg atatcttgcc 3840 gagggccgcg gttatggaac ttgcttttaa caaacaatat aaaaacttta aattcaactc 3900 aacattttaa acgtgcaaca aaacaacact tatttggttt tgacacaaag caattactat 3960 cgtattatta aaaacttttg gtaatcttag ttatttaatg tcttataaat tgactcaaca 4020 tcacaattat attgtatggt acttagaaca tgtatgatat tattttattt tacaataata 4080 acgtaatttg taaatgcaat atttattata tcttacgcaa tttgtagatg gaatgtaata 4140 cagatttatt tgatttgtat gtatatatga acggacatat atttgtgtgt acgtgtgcgt 4200 atgtgtgtgt gtatgtgtgt gtgtatgtgt gcgtatgtgt gtgtgcgtgc gtgtataata 4260 aaaataaaag tcgaataaaa tagagaatta aaaaaaaata ataataataa tgagaagaaa 4320 taattaatat attttttatt ttttttattt ttttctcttt ctgtaaataa taatttgatt 4380 tttaagttct tttcttctta aaatcagttc ttgctttagt ttcttttttc ctttttcatt 4440 tttatcggct gaatttttta gttatattaa cggggctgga tgataagatc atgtcttctg 4500 ccagctccgg tcgaaacttt aatgttataa tttttatgtt gtaaaaaaac attgtaaaag 4560 attattcatt taatgacgaa ataaataaat aaataaaaa 4599 // ID TTAA3C_AP repbase; DNA; INV; 425 BP. XX AC . XX DT 23-AUG-2009 (Rel. 14.08, Created) DT 23-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; TTAA3C_AP. XX NM TTAA3C_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-425 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(8), 1782-1782 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC TSD is TTAA tetranucleotide. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 425 BP; 150 A; 67 C; 62 G; 146 T; 0 other; gaggacgcca cacccgcatt tgttgtctcc gtcttacaaa cgtacaacat ggcaaatttt 60 acgttcacaa taatacattt tactctgtaa tttttaaaat tagagtgaat tgacctctta 120 taaaatttaa aggtaagatt attatctagg gcatctcgga ggcttttatt gatattttta 180 ttaagaagca agttatgacc atttttaaac tgtacatttt aaaataatca taacttactt 240 aaaaataata atatcaataa aatcctccga gatgccctag ataataatct tacctttaaa 300 ttttataaga ggtcaattca ctctaatttt aaaaataaca gagtaaaatg tattattgtg 360 aacgtaaaat ttgctatgtt gtacgtttgt aagacggaga caacaaatgc cggtgtggcg 420 tcctc 425 // ID Gypsy-4_IS-I repbase; DNA; INV; 4085 BP. XX AC ABJB010633793; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the black-legged tick genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-4_IS_; KW Gypsy-4_IS-LTR; Gypsy-4_IS-I. XX OS Ixodes scapularis OC Eukaryota; Metazoa; Arthropoda; Chelicerata; Arachnida; Acari; OC Parasitiformes; Ixodida; Ixodoidea; Ixodidae; Ixodinae; Ixodes. XX RN [1] RP 1-4085 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the black-legged tick genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; ABJB010633793; Positions 4212 8296. XX CC Positions [3171-3731] - Integrase core CC 'AGTAG' target site duplication CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 3051..4043 FT /product="Gypsy-4_IS-I_3p" FT /translation="MVVMKGMARSLVWWPNMDAEVERCSRQWSQCLQNSPM FT PPKAEPVPWPEPKECWERLHLDYAGPFEGKMLVVLVDAKSKWLEVIIVSSA FT TSEQTVEHLRDIFARFGLPRCIVTDNGTSFTGQPFQDFVQGNGIKHLRTAP FT FHPASNGLAERAVRTVKDGLKKTEGRDLKLRLAQWLLMYRRAPRPSGTSPA FT KQMLAYPMKARLDICIPRRGEEAILEQLPLESRQRAEQEEKKKGLRHKRGN FT KVAVRNFGKGPRWWLGEVENTSGTSIVTVSAPQGKVRRHNDQVKPHLTTLA FT SSPGEQLPPNKDQGYATSKARTPGMRRPTRTVRKPLRFT" FT CDS join(124..1005,1009..2106) FT /product="Gypsy-4_IS-I_1p" FT /translation="MTTFGKLEEYDNKEPWLSYTERVDAFFNANGIEDDEK FT KKWIFLSTVGASTYATLRSLLASAKPKEKKYKDLMTTLNSHFSPPPSEIAE FT SFRFNTRVQLENESVATFVAELRQMAEHCNFGAALNRMLRDRLVCGIRDKT FT VQQGLLVEKNLTLEKAVEISRSAEAAERNATEIQRGHVDATTSGNQEGTTN FT HFKFHRKKNGQQGPKGQKWKDPKATAKKKPGPQDKASACIRCGSRDHKPPE FT CPHINTKCYNCEKVRHLASCCLTKTDKPKKQSTKQVNVLESVGGPSEYYLH FT ALANSTMKPITVTFMVNGLPIEMELDSGSPVSLNSEDTYLIHQRSLPRPAE FT TDIKLKCIRGPIPLQGTLIVNVSLGETKSQQTLLVVSYKSPNLCGRNWMAA FT FDLLPRQVNSTQLGETATDRGLLETLQELEEVFKPGCGAFKGPPVHVDVLP FT DAQPRYYRARSVPHALRDKVELELQRLEEEGIITAIDHSDWASPIVPVLKA FT DGKSVKISGDYKVGVNPAMMTTQYPLPKVEDIFASLQGGVKFSKLDFREAY FT NQVLVDKETSKLLVINTHRGLFAYNRLAFGVSSAPALFQRRMEETLQGIPG FT TSVYLDDVLVTGKTDAEHLENLRKVLQKVKDSGLKLKREKCEFFKESLCYL FT GHEINAEGLKP" XX SQ Sequence 4085 BP; 1213 A; 1044 C; 1034 G; 794 T; 0 other; actggcgacg agaaacggat cgacgacaag ccggagaagc cgccaagcag cagcaccacg 60 ggatgacccg gtaggaattc tacactgcgt gaacttccgc gcgtgaattc cgaaactata 120 gaaatgacga cttttggcaa gctagaagaa tacgacaata aggagccctg gttatcctac 180 acggaaagag ttgacgcgtt cttcaatgca aacggcatcg aagacgacga gaagaagaaa 240 tggatcttct tatcaaccgt cggggccagc acttacgcga cactccggag tctgctagct 300 tctgcgaagc caaaagaaaa gaagtacaag gacctcatga ctaccttgaa cagtcacttc 360 agccctccac cgtccgaaat cgccgagagt tttcgcttca acacacgagt acagctcgag 420 aacgaaagcg tcgctacgtt cgtcgccgaa cttcggcaga tggcagaaca ctgcaacttc 480 ggagcagctc tcaaccgcat gctgagggac cgcctcgtgt gtgggatacg ggacaaaact 540 gtgcagcaag gactgttggt agagaaaaac ctaaccctgg agaaggctgt cgaaatctcg 600 aggtcggcgg aagcagcgga acgcaacgcg accgagatac aaagaggcca cgtggacgcg 660 accacttcgg gaaaccaaga aggtaccact aatcatttca agttccacag aaagaaaaat 720 ggtcaacagg ggcctaaagg tcaaaaatgg aaggatccaa aggccacagc aaaaaagaag 780 ccaggaccac aagataaagc aagtgcctgc attcgttgtg gttcccggga tcacaagccc 840 ccggagtgtc cccacatcaa cacaaagtgc tacaactgcg aaaaagttag acacctggca 900 agttgctgtt tgacaaagac tgataaacca aagaagcagt ctacaaaaca agtgaacgtg 960 ctggaatcag ttggtggacc gtccgagtac tacttgcatg cattgtaagc aaactcaact 1020 atgaaaccaa tcacggtaac attcatggta aatggacttc caattgagat ggagctagat 1080 tcaggatccc ccgtctcgct gaactccgag gatacatacc tgatccatca gagaagtttg 1140 cctagaccag ctgaaacaga catcaagctc aaatgcatcc gtggacccat tcccctacaa 1200 ggaacattga ttgtaaacgt aagtctcggc gagaccaaaa gtcaacagac acttctggtt 1260 gtttcttaca agagccctaa cctttgtggc agaaactgga tggcagcatt tgacctgctt 1320 ccacggcaag taaactccac tcaactggga gaaactgcaa ccgaccgagg gctactggaa 1380 acactccagg aattggaaga ggtcttcaaa ccaggttgcg gtgcattcaa aggaccaccg 1440 gtgcatgtgg acgttctacc cgacgcccaa ccacggtact acagagcccg ctcagtacca 1500 catgcactga gggacaaggt ggaactggaa ctccaacgac ttgaagaaga aggcatcatc 1560 acggccatag accactctga ttgggcatcc cccatcgtac cagtcctaaa agccgatggc 1620 aaaagtgtca agatctccgg ggactacaag gtgggtgtga atcctgcaat gatgacaacc 1680 cagtacccac tgcccaaagt ggaagacatc tttgcttctc tgcaaggagg agtcaagttt 1740 tcgaaactag acttccgtga agcatacaac caagtactcg ttgataaaga aaccagcaag 1800 ctcctggtca tcaacacaca ccgaggactg tttgcttaca accgtttagc ttttggcgtg 1860 tcctcggctc cggcactgtt ccaaagaaga atggaagaga cccttcaggg aattccagga 1920 acatctgttt acctggatga tgttctggtg actggaaaga ccgacgcaga acatctagag 1980 aacctaagga aagttctcca aaaagtcaaa gactctggac tgaagctcaa gcgagagaaa 2040 tgcgagttct tcaaagaatc actctgctat ctgggccatg agatcaatgc agaggggctg 2100 aagccttaaa agaaaaacgc agaagctatc cttgaagcac ctgaacccag ggatgtcagt 2160 gaactgagat cctatattgg acttctttcg tactacggca agtttattcc gaatctgtcc 2220 actgtccttg cacccttgta tgccctactg cacaagaaca cccgctggca atggacagac 2280 actgaacgaa aagcctttgt cgaaagcaag aatgccatca tggaggcaag ggttctgacc 2340 cactacgacc catcgaaaga gcttgttctg gcatgcgacg cctcaccgta tggtgtgggt 2400 gctgtgctct cgcatcggaa ggatggcgtg gagtcgcccc ttgcattcgc atcccggact 2460 ctgacaccgg ccgagaggaa ttactcacag ttggaaaagg aagcactggc cattattttt 2520 ggtgtcacac ggttcagaga ctacctgccg tgtcgaagtt ttgtgctcat cactgaccac 2580 aagccacttg tggggatttt tcgggaagac aaggcaattc cagccatgac agcttcgcgc 2640 atccaacgct gggctcttac tcttggagcc tacaggtaca gcatcgagca ccgaccagga 2700 cgtctcaacg gaaacgccga tgccatgagc aggctgccct tgaagacggc acacaccgat 2760 cctccggaac ctccggaact ggtcaactca atctcgagct tggagaagct ggagatttca 2820 gtcaagcaac tgcaacaatt cacaagagtg acaaggactt aagccaagtg ctacagtggg 2880 tggaggaagg atggccgcaa aagcctccgg acaagtcact acaaccattt tggaacagac 2940 gcgatgagct aagcgtgtat cggaatctcc tgtactgggg aaaccgagtg gttgtgccaa 3000 cccctgctcg ccaacacatc ttagacctgc tgcatgaaac tcaccagggg atggtggtca 3060 tgaaaggcat ggcgagatct cttgtgtggt ggccgaacat ggacgcagaa gtagagcgct 3120 gttcacgaca atggtcacag tgcctccaga actccccaat gccaccaaaa gcagaaccgg 3180 taccctggcc cgagccaaag gaatgctggg agcgtctgca tctagactat gcaggaccgt 3240 tcgaagggaa gatgctagtg gtgttggtag atgcgaagtc aaaatggctt gaagtcatca 3300 ttgtgtcaag tgccacatcc gagcaaaccg tagaacacct acgtgacatt tttgcaaggt 3360 tcggactacc aaggtgcatt gtaacagaca atggaacttc attcactggt cagcccttcc 3420 aagacttcgt tcaaggaaat gggatcaagc atctccgaac tgcccctttt catccagcat 3480 ctaatggctt agcagaacga gctgttcgca cggtcaagga tggactcaag aaaacggagg 3540 gaagagacct caagctccga cttgctcaat ggttgctcat gtatcgccga gcaccccgtc 3600 cgagcggaac gtctccagca aaacagatgc tggcttatcc aatgaaagca agactggaca 3660 tctgtattcc caggaggggt gaagaggcaa ttctggagca gttgccgtta gaatctaggc 3720 aacgggccga gcaagaagaa aagaaaaagg ggctacgcca caagagaggg aacaaggtag 3780 cagttcgcaa tttcggtaaa ggacctcgat ggtggttggg cgaagttgaa aacaccagtg 3840 gaacatctat tgttactgtg tcagctccgc aaggaaaagt gcgccgccac aacgaccaag 3900 taaagccgca cttgaccact ttagcatcaa gtcctggaga acaactacca cctaacaagg 3960 accaaggcta cgcaacgagc aaagcaagga cgcccggtat gagacgaccg acacgaactg 4020 tccgcaagcc cctgagattt acataacctt cgataagaag gcagtcttca ttaagaaggg 4080 aggag 4085 // ID Rehavkus-1_DY repbase; DNA; INV; 5919 BP. XX AC AAEU01000045; XX DT 30-APR-2006 (Rel. 11.04, Created) DT 26-MAY-2011 (Rel. 16.05, Last updated, Version 2) XX DE This is a recently transposed copy of the Rehavkus-1_DY DNA DE transposon - a fossilized copy. XX KW MuDR; DNA transposon; Transposable Element; Interspersed repeat; KW Rehavkus-1_DY; Rehavkus group. XX NM Rehavkus-1_DY. XX OS Drosophila yakuba OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; melanogaster subgroup. XX RN [1] RP 1-5919 RA Kapitonov V.V., Gentles A.J. and Jurka J.; RT "Rehavkus-1_DY, a family of Rehavkus DNA transposons from the RT fruit fly Drosophila yakuba genome."; RL Repbase Reports 6(4), 192-192 (2006). XX DR EMBL/GenBank/DDBJ; AAEU01000045; Positions 10882 4964. XX CC Rehavkus-1_DY belongs to the Rehavkus group of the MuDR CC superfamily of "cut and paste" DNA transposons. Transposons from CC this group are widespread in different metazoa, including CC insects, sea squirts, sea urchin and fish. Its 1217-bp inverted CC termini are composed of a 290-bp terminal inverted repeat and CC 133-bp subterminal minisatellite-like unit. Similar structure of CC termini characterizes all Rehavkus transposons identified so far CC in different species. The transposon is flanked by a 9-bp target CC site duplication and encodes the 969-aa Rehavkus-1_DY CC transposase. The ~1050-bp termini of Rehavkus-1_DY are 79% CC identical with termini of the FB4/FB transposon identified CC previously in the D. melanogaster genome. The TPase-encoding CC region is 86% identical to a portion of the FB/NOF transposon CC from D. melanogaster that codes for a hypothetical protein. Due CC to detection of this protein conserved in diverse Rehavkus CC transposons that were not significantly similar to any other CC transposases, these transposons were reprted in 2006 as a new CC superfamily of "cut and paste" transposases [1]. However, in a CC few years, as a result of massive increase of the number of CC diverse proteins in GenBank, Rehavkus transposase sequences have CC converged (by PSI-BLAST) with MuDR transposases in the same set CC of similar sequences. XX FH Key Location/Qualifiers FT CDS 1902..4808 FT /product="Rehavkus-1_DYp" FT /translation="MPTKSHVDVPALVEAFCNGDIFTETGTLKPRSDNVWM FT DISNKFQGTISAKTLNLYARINRNGIITMVKERCGIKQLDITGNNSTLNST FT SPDDDPEFQIIEAVKNGPLPILYINLELDQELWTSIAPKMDQKTDLRSKSY FT LQRNWTDVIAKLMYEKVPLPCAFNFKKAKLSDEVDNIWLRIEGYCNECSSI FT LKGHCLVKPDEHCGISISVSVPDTRGIPHNSKRRCTGSRRLEIGNELLLKK FT AALWRKEATDNMNDDDPEPSFIPKLPTLRKLREEATNRHLGITKDRDPVSS FT LYLKKYEGELAGCILDIGLDEFFCIYCTGAQIKTYASRIKTIRKISVDATG FT SVVLPIQKPNGDSSYVFLYQIVMEGDDSIFPVFQMLSAKHDTASIQFWLSR FT FISKSGHFPLEVVSDFSLALLNGISLSFNECRIKTYIQKCFQSLLTDERTD FT LPPCFIRLDIAHLIKMICRKNVFKGKLPNIKDFYVRCIGLATTCETKDCFA FT ETIKSVLIVALSQSSGEDEKGHFLSSYRNERNLLTRIATFAAQDHDEKHEE FT NFIPEDQEEVDEDVSEFVLNIKNAAEEEALKCNSVRCRPNPYFLPDLMSPL FT LKLCKYFVLWTNVMKEKFCSKYDVGSSALVEAYFKDLKNTDMSIFHRPVRA FT DKFVVQHIRCIEAVCKLEQAAMKRKSVNIPTFIKEKTPKKLCREETKEFLE FT EILEENEVEYLLQEENWKVKNKTIKPTENVEEDNGNDYINKETELVKRPKE FT KPRGKYLKKCPNVELLYNRPLRRKRDEILNNGGSMGPLWIGKQLLQFKNTC FT PLDSLVEILSTAYIDNFYYKNQLDDFYADNLTIELVKKYAVEGVTSSLYRE FT RGLVLKSFFDEKHQTIHCHANIGAFIEKALNGVPSASSHRIHNKNKNDCEN FT KKKYILSGAIEYVPSPGGAIGHYIAYCRRIIGSWEVHNDMCRQWRKFTALN FT TKMTLHILIYTRKN" XX SQ Sequence 5919 BP; 2083 A; 956 C; 1016 G; 1864 T; 0 other; agctcaaaga agctgggtcc gccaaaatcg aatttttgaa atttgaaagg tggaatcgtt 60 tgaccatcgg ttgaccatgt ttgaccacca attacttttt tttgaccacg tccagttttg 120 aagatatgga atttcgaaaa ttttcgaaaa ttttcgaact tcaaaaagtt gacttttttg 180 aacttttttt tttttaaatc gcaataactt cgtttgacca cgtttgacca ccctttagaa 240 ttttgaaaaa acttttagtt tggaaaatat aagcacttaa gcaattaagc atttttcata 300 cctcaaaact aaatattttc aaacaaaatc gtttgaccat cctttaaaaa tgctttttat 360 cgtttgacca ccctttaaaa tttttttttt tcatttgctt acccttaaaa aaaaatattt 420 tcgtttgccc aactcttaaa actaaatatt ttcaaaaaaa aacttttgac catcctttaa 480 aattgcattc tatcgtttga ccacctttca aaaattattt ttttcgtttg cttaccctta 540 aaaaaaaata aaaacgattt cccacctctt aaaactaaat attttcaaaa aaaaactttt 600 gaccatcctt taaaattgca ttctatcgtt tgaccaccct tcaaaaatta tttttttcgt 660 ttgcttaccc ttaaaaaaaa taaaaacgat ttcccacctc ttaaaactaa atattttcaa 720 aaaaaacttt tgaccatcct ttaaaattgc attctatcgt ttgaccaccc ttcaaaaatt 780 atttttttcg tttgcttacc cttaaaaaaa aataaaaacg atttcccacc tcttaaaact 840 aaatattttc aaaaaaaaac ttttgaccat cctttaaaat tgcattctat cgtttgacca 900 cccttcaaaa attatttttt tcgtttgctt acccttaaaa aaaataaaaa cgatttccca 960 cctcttaaaa ctaaatattt tcaaaaaaaa acttttgacc atcctttaaa attgcattct 1020 atcgtttgac cacccttcaa aaattatttt tttcgtttgc ttacccttaa aaaaaatttt 1080 caaataaaat tcaaataaat cttacaattt tggcaaaata ctctctcttt cttttattgt 1140 ttcttaaaaa taattaattt tgttaacaac aattttttaa caataattat tttgtaaact 1200 ttatattata taaatttgca cctgtgacat agggttaatt ttttaggtaa attgtttttt 1260 tttacattac aggtacttta aagaatccac aattaaaaat ctctttcctc ctcgattctc 1320 ataatatgta tataaatcgt catccgaaac tcataattag aatatatttt ttgtgtgaag 1380 actgaaaaag ttgtgcgtga atttaacaac gttttgtgct gaaaaagtta aagtcgtact 1440 gtttttaaga aaagtgcttg cgttctaata tttgaaccct tctaaatttt gttgacagtc 1500 tttttaaaga cttgcaattt ttggaatatt gatttgtgtt ctcgtgaatt ttgttgacag 1560 tctttttaaa gacttgcaaa attaaaattg accagatttt acaatttagt aaagtttcat 1620 ttgtgtctat tcctttgttt gttggaaaag gaatttaagt gaagtgggta agcacaaaat 1680 tcaaaataaa tttgtgtact tgctttctat tctaaagatt ttctttaaat ctaaagatct 1740 gcattttcaa tattgattta tttttttaca aaccagagtc cttgcttttt ttgcgtaatc 1800 ccttaaacat ttgcagcgca attgtgaaaa gtgctaatta aacattagtg taaaacgaca 1860 atttttttaa ctgcataaaa ggaataaatt ttattaaaag gatgccgacc aaatcgcatg 1920 tggatgtccc cgccttagtg gaggcatttt gcaatggtga tatctttacg gagactggaa 1980 ctcttaagcc aagaagcgat aatgtttgga tggatattag taacaaattt caaggaacaa 2040 tcagcgcgaa gacgctaaat ttgtacgcca gaattaatcg gaatggcatt ataacgatgg 2100 tgaaagaacg atgtggcatt aagcagctgg atatcactgg caataatagc actttaaata 2160 gcacatctcc cgatgatgac ccagagttcc agatcattga agctgttaag aatggaccat 2220 tgcctatttt atatattaac ctggaattag accaggaatt gtggacatcg attgccccta 2280 aaatggacca aaaaacggat ttgagatcaa aaagctatct gcaacgtaac tggacggatg 2340 taatagccaa gctgatgtac gaaaaagttc ctcttccgtg tgcatttaac ttcaagaagg 2400 caaaactttc cgacgaagtg gataatatat ggcttagaat tgaaggctat tgcaatgagt 2460 gcagctcaat tttaaaagga cattgccttg tgaaacccga tgaacattgc ggcatatcga 2520 tatccgtttc ggtaccggac acacgaggta ttccccataa ttcaaaacga cggtgcactg 2580 gatccagaag acttgaaatt ggcaacgaat tgcttttaaa aaaagccgca ttgtggagga 2640 aggaagcaac cgacaatatg aatgatgacg acccagaacc aagtttcatc ccaaagttac 2700 caactcttcg aaaacttcgt gaagaggcaa ccaataggca cctcggaatc accaaggatc 2760 gtgatccagt ttcatcatta taccttaaaa agtatgaggg tgaattggcc ggatgcatcc 2820 ttgatattgg actggatgaa tttttttgca tatactgcac gggagcccaa ataaaaacat 2880 atgcatcaag gataaaaacc attagaaaaa tttctgtcga cgcaactgga agcgtcgtgt 2940 tacccattca aaaaccgaac ggcgactcaa gttatgtatt cttataccaa attgtgatgg 3000 agggcgacga cagcatattt ccagtttttc aaatgctgtc ggctaaacat gatacagcca 3060 gcatacagtt ttggctgagc cgatttattt cgaaatcagg acattttcca ctggaggttg 3120 tgtccgattt ttccttggct ttgctcaatg gaataagctt gagctttaac gagtgccgca 3180 ttaagacata tattcaaaaa tgttttcaaa gccttttgac ggatgaacga acggatctac 3240 caccctgctt tattcgactt gacatcgccc acctcataaa aatgatatgc cggaagaacg 3300 tttttaaagg aaagttgccg aacataaagg atttttacgt aagatgcatt ggtctagcaa 3360 caacgtgtga gacaaaggac tgttttgcgg aaacaataaa atcagtgctg attgtcgcac 3420 taagccaatc ctcaggagaa gacgaaaaag gacactttct ttcgagctac aggaacgaaa 3480 gaaatctgct caccagaata gctacatttg ctgcccagga tcacgatgag aaacatgaag 3540 aaaatttcat accagaggac caggaggaag ttgacgagga cgtttcggag tttgtcctca 3600 atataaaaaa cgctgccgaa gaagaagcgt taaaatgcaa ttctgtccgc tgtcggccaa 3660 atccatattt tctgcctgat ctgatgtcac cattacttaa gttgtgcaaa tattttgtac 3720 tatggaccaa cgtaatgaaa gaaaagttct gctcgaaata tgatgtcggc tcttcggcac 3780 ttgtggaagc ttattttaag gatttgaaaa acacagatat gagtatcttc caccgaccag 3840 tgcgagcgga taaattcgtt gtgcaacata ttcgctgcat cgaagctgtt tgcaagctgg 3900 aacaagccgc catgaaacga aagtccgtta atattcctac ctttataaaa gaaaaaacac 3960 ccaaaaaatt gtgccgtgag gaaaccaagg aattcctgga ggaaattctt gaagagaacg 4020 aagtggaata ccttctacaa gaagaaaact ggaaggtaaa aaataaaacg ataaagccca 4080 cggaaaatgt cgaagaagac aatggaaatg attatataaa caaggaaacg gaattagtga 4140 aacggcctaa agaaaaacca agaggaaaat atctcaaaaa atgtcctaac gtggaattat 4200 tgtacaatcg acctcttcga aggaaacggg acgaaatatt gaataatggt ggatcaatgg 4260 gacctctctg gattggcaaa caattattgc aatttaaaaa tacttgcccg cttgactccc 4320 tcgtggaaat attgtcgacc gcgtacatcg acaattttta ttacaaaaac caactggatg 4380 atttctacgc tgacaactta acgatagaat tggtaaaaaa gtatgccgtc gagggagtta 4440 cttctagttt gtaccgcgaa aggggtctgg tcctgaaaag tttttttgat gaaaaacacc 4500 agacaataca ttgtcacgcc aatattgggg cttttattga gaaagcccta aatggagtac 4560 ccagtgcgtc aagtcatagg atccataaca aaaacaaaaa tgattgcgaa aacaaaaaga 4620 aatatattct aagtggtgcc atagaatacg ttccttcgcc aggaggtgca atcggacact 4680 atattgcata ttgccgcaga ataattggat cttgggaagt acacaatgat atgtgcagac 4740 aatggagaaa gtttacagcg ttaaatacaa aaatgacact ccacattttg atatacacga 4800 ggaaaaatta atatattaat ataattaata aataaaattt atataatata aagtttacaa 4860 aataattatt gttaaaaaat tgttgttaac aaaattaatt atttttaaga aacaataaaa 4920 gaaagagaga gtattttgcc aaaattgtaa gatttatttg aattttattt gaaaattttt 4980 tttaagggta agcaaacgaa aaaaataatt tttgaagggt ggtcaaacga tagaatgcaa 5040 ttttaaagga tggtcaaaag tttttttttg aaaatattta gttttaagag gtgggaaatc 5100 gtttttattt ttttttaagg gtaagcaaac gaaaaaaata atttttgaag ggtggtcaaa 5160 cgatagaatg caattttaaa ggatggtcaa aagttttttt tgaaaatatt tagttttaag 5220 aggtgggaaa tcgtttttat tttttttaag ggtaagcaaa cgaaaaaaat aatttttgaa 5280 gggtggtcaa acgatagaat gcaattttaa aggatggtca aaagtttttt tttgaaaata 5340 tttagtttta agaggtggga aatcgttttt attttttttt aagggtaagc aaacgaaaaa 5400 aataattttt gaaaggtggt caaacgatag aatgcaattt taaaggatgg tcaaaagttt 5460 ttttttgaaa atatttagtt ttaagagttg ggcaaacgaa aatatttttt tttaagggta 5520 agcaaatgaa aaaaaaaaat tttaaagggt ggtcaaacga taaaaagcat ttttaaagga 5580 tggtcaaacg attttgtttg aaaatattta gttttgaggt atgaaaaatg cttaattgct 5640 taagtgctta tattttccaa actaaaagtt ttttcaaaat tctaaagggt ggtcaaacgt 5700 ggtcaaacga agttattgcg atttaaaaaa aaaaaagttc aaaaaagtca actttttgaa 5760 gttcgaaaat tttcgaaaat tttcgaaatt ccatatcttc aaaactggac gtggtcaaaa 5820 aaaagtaatt ggtggtcaaa catggtcaaa cgatggtcaa acgattccac ctttcaaatt 5880 tcaaaaattc gattttggcg gacccagctt ctttgagct 5919 // ID Gypsy-5_DPu-LTR repbase; DNA; INV; 148 BP. XX AC scaffold_64; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-5_DPu_; KW Gypsy-5_DPu-LTR; Gypsy-5_DPu-I. XX NM Gypsy-5_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-148 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 726-726 (2010). XX DR Genome; scaffold_64; Positions 343177 343030. XX CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX SQ Sequence 148 BP; 38 A; 27 C; 45 G; 38 T; 0 other; tgttagatct tgatccatgt cttgtatgat gaccgctaga gcccgcccag gttgggaggt 60 aagggcttca gggttagtgt gtgtcgagtc gaggctgagt gaacagctta tggagagtga 120 accaagtaat acaccaggat acttaaca 148 // ID LT1_DPu-I repbase; DNA; INV; 990 BP. XX AC . XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE Non-autonomous LTR retrotransposon from Daphnia (internal DE portion). XX KW LTR Retrotransposon; Transposable Element; nonautonomous; KW LT1_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-990 RA Jurka J.; RT "LTR retrotransposons from Daphnia."; RL Direct Submission to RU (16-DEC-2010). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR [1] (Consensus) XX CC >94% identical to consensus. XX SQ Sequence 990 BP; 318 A; 160 C; 180 G; 332 T; 0 other; cttaatgact ggtgacatct tgcggtggcg actacaactt tttcacctta ggtttaaatt 60 ccactttcca ttcttaacgt gcttgtcatt tacttaacgt gttaagttta aaataacgtg 120 ttaagtttaa aattaattct ttcttgtcga atatttttta ttgaaatata ttcgttatgg 180 atggttaaac atgattaaaa accaccagat gtccatagca tcgccgtgat gacgcaatca 240 ttggtgtccc aaccatgacg caacaactca gatttgtatt taagttttgc agttatttga 300 ctaaaagatg gttattttta aattcaatta ttgaacttta tgtgtaaaaa taggaacata 360 tgagttgagc aaactaaacg taaataactt gcaaggatat tgaactgtga cagaaaaatc 420 ccgttttaag accaaaaagt ctgtgaagaa aagtgtgaat tgtgaataag tccgacttaa 480 caataggccc tgaatttgtc atttaagaca gttttcaact ggcttacgac tatcccttcg 540 cggcgggtaa gctggctagt tcgtcgagtt aataagtgtc agtagcgtaa ggttagatgg 600 ttattcgctt ttctgtaaaa taaatctgat caaataaaaa atttggccag tctaattgga 660 ggtttaacca gcgacccatt gttagcttgt catgtaataa acgtgttaag tttaaaatta 720 gcctttatcg aatattttta tttaaatata ttcgttggta ccgttacatg cacatgataa 780 tgccgtcacc aaagatgtcc atagcatcgc cgtgacgacg caatctttga tgcccaccca 840 tgacgcaata accagattgt tttaagtttt gcagtaatta cttgatgttt agattcatta 900 ttgaactttg ttgtaaaaat aggaaaatat gagttcaaac taaacgtaat aacttgcaag 960 gttattgaac tgcgaatgaa gcaagctaat 990 // ID hATw-1_HM repbase; DNA; INV; 8506 BP. XX AC . XX DT 12-JAN-2009 (Rel. 14.02, Created) DT 12-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE hATw-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hATw; 7-bp TSD; KW hATw-1_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-8506 RA Bao W. and Jurka J.; RT "hATw-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 9(2), 418-418 (2009). XX DR [1] (Consensus) XX CC The consensus was buit from several copies which are ~97 % CC identical to the consensus. It is characterized by 7-bp TSD and CC 11-bp imperfect TIRs. The hATw transposons belong to a distinct CC hAT group which encodes transposases which are distantly related CC to other hAT transposases. hATw elements identified so far CC produce 7-bp TSD. CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 1209..4631 FT /product="hATw-1_HM_1p" FT /translation="SLKLNLILILATKVLSEIATCKNILDFLDINDGIISL FT KYPDFVLTNSHFLQICKDKNKHNITLENLSLFFTTYCKYNISRVMLIRILK FT KLKGYIVKLNRAKNLDFKKHFLSSALSFNGAFSDVVGLLPSDACSTPTSIY FT NSEIVPNQVTVFVSSDLNLQITNDELSCPVTPTSIIYIPLENRSEKFLTET FT RFSEENLPVSNLDKKLPLFSGDLTYQIKRYKRRINHLKLQNRQVNRRCYEY FT KRKYQAILSKYNVRNVNKRDVRKDCKINQLLQINLRLEKEIDLARKRSQSD FT RQKAWYLKKKIENTILKTSDETNDSFLKESLNYFENLTEELKERVDIFEKN FT VIDVYEGGRYNDEIRSVYYDLLSKNVSVNNVESVIRTVLQKMAGISCGTLP FT KKSLASEFFSEMNLLAKAQVREAMLNSTNNVLHTDGTKYNFREVGSFQVTT FT SSGSYTFGIEDMFSGEAQSYFGELKNLLTDMSKIFSPDKENYENDLRKLIF FT SFKSLMTDRTIVNSSFFHQFKHWREAILPFVVENYDQLPLNEKLKISQMHH FT VFCGLHVIHNLGIYAEKAIIEWEKVVEQEGSIHGGFKSSQNSRTFDILYEL FT SKLTSYRHGDQRNGKADEWKAFLRKRFSKNFMVSFLHHRFNIIFVLGGATY FT YHKDHLKEFVFNLGGSNFLHESIRHDIDNIIFHASTRALGIFNKLISGPLF FT RIIEEEGHIFSLNTIWLQMFNYFVSCSSDASKMLSGSTFFDVKYLTKDEVF FT DHLFSNCNDLLLDALTQECLEVICCTCSVMIQSQLRDQLPGGKYYNPGDDV FT LEETQNCPRTNVVSERDFASYDRRLKMKPNMTTVAAAGVIMFNNNKTADWI FT DGKSQEEIERLVILARRNRKGYIRKYREKKANILQYKIDEMDKRNLEKEIK FT EQKASEEKEFLTTEIGKDGGLILCEEDFEKILGTENEIVMKLQRQIKFRKK FT VLQQKFPEKWLQLGEKAKGEYYKKFGIDKLKTNLISIFKFLKDVPEQRATA FT IKCSVIRQINERQEMLLTLKKKVRLEGDARLIMETGSKIRTGTSKGGVPKF FT LGKRIKHKMVDNDGKDVWYSGLVVSVLDDNEFDEECEFQILYDGYEDKYDI FT ELVKEWRMKCVVVEGKADGYVEGEVCKKQKCS*" XX SQ Sequence 8506 BP; 2864 A; 1113 C; 1427 G; 3101 T; 1 other; tagcgattcc acggctaagc gaaaaacatt tgacgtttga caaaaaacac gaaattttca 60 tgaagtttgc tctcttatat ctttttgtat aggaaagcaa gcaacctcag gttttttgca 120 tcaattagtg ttactcaacc tcttttcagc catatatgaa aagtactatg ctttatatgg 180 cttcacttaa atattatcca tatcttgaga ccacttttga gaaaaaatct tgaagcggga 240 acgttttttt aaaacgaagc ttttggaggt aaggattccg aagtaccaaa acaaatccaa 300 tttttataaa actttccgta tatattaaaa aaaggttgag gaatctaatg gaatgtgaac 360 actttttaac atttatacat aaatttttag ttctcagtgt ttaaaaactt caaggttaat 420 tattatagaa aaaaaagtga tttttgaaag tgttcaagtg gaaatggcaa attcagcctt 480 tcctcttttt aaaacaagtg tttaattatg ttttcttaaa aaataggtgc ttcggatttg 540 tatattagaa ttatgtatat atacatctat atttcttcgc gtattttatg acgttttttg 600 ttttgatttt gcaattccga aattttaact acaaataaaa ttgaaagtga aagtatattc 660 cttattaatg acttaagttt tcaaggcaat aaaaataaag attttatatt tttaaaggta 720 ttttgtctgg attgctcaac tacaagcttg attctcactg tttctactac tgtttttagt 780 ctgtagattc tctaaactgt ttttttcaaa acaaaaagaa atagtaatca actaaacttt 840 tagtctaaag tttacaaaat ggaaatgtgt aagaatgtta tcattttttt tttcatttat 900 ttttatattt ttttattggc taagtaaaat aaccaatgaa ctttaatttc aaattcaaat 960 tctcttcttt aggctgtgaa cttaacaatc tacgtgaaat ggtggttcaa tctataagtc 1020 aagaatgtaa gtcatacttt gatagacttt ttaaattttt aaatccggaa aaatgtataa 1080 caaacatttt atatttcaat aattttattt atagaatctt aagcttgatg atctaccttt 1140 atttttattt tcacaaagct agatattatc aaacttcaac tctttgatta acaatagatt 1200 tattgtaatc actcaaatta aatttaatat taattttagc taccaaagtg ttgtctgaaa 1260 tagcaacctg caagaatatt ttggattttc tggatataaa tgatggcatt atttctctca 1320 aatatcctga ttttgtgtta accaatagcc attttctaca aatttgcaaa gacaaaaata 1380 aacataacat aacattagaa aatctttcat tattttttac aacctactgt aagtataata 1440 tctctagagt gatgcttata aggatcttaa aaaagttaaa aggttacatt gtaaaattaa 1500 atcgagccaa aaatttggac tttaaaaaac attttttaag ctctgcttta tcttttaatg 1560 gtgctttttc tgatgttgtt ggtttgttac cttctgatgc ttgttctacc cctacttcta 1620 tctacaatag tgaaattgtt ccaaatcaag ttactgtttt tgtatcttct gatttaaatt 1680 tgcaaattac taatgatgaa ttgtcgtgtc ctgttacccc aacttctatt atatacatcc 1740 ctttagaaaa taggtctgaa aaatttctta ctgaaacgcg ttttagtgaa gagaatcttc 1800 cagtttccaa tttagataaa aaactacctt tattttctgg tgatttgacc tatcaaataa 1860 aaagatacaa acggagaatt aatcacctta aattgcaaaa tagacaagtt aatagaagat 1920 gttatgaata taaaagaaaa tatcaagcaa tcttgtcaaa atataatgtt cgaaatgtca 1980 acaaaagaga cgttagaaaa gattgtaaga ttaaccaatt gttgcaaata aatttaagat 2040 tagaaaaaga aattgatttg gctagaaagc gttcccaatc agatcgacaa aaggcctggt 2100 atttaaagaa aaaaattgaa aacactattt tgaaaacatc tgatgaaaca aatgattcat 2160 tcttaaagga gagtttgaat tactttgaaa atcttacaga agagctgaag gaaagagttg 2220 acatttttga aaaaaatgta attgacgttt atgaaggagg tagatataat gatgaaattc 2280 gctcggtata ttacgacctt ttaagtaaaa atgtttctgt taataatgtt gaatctgtta 2340 tcagaactgt actacaaaaa atggctggaa tttcatgtgg cacacttcca aaaaaatctt 2400 tggcttctga attttttagt gaaatgaacc ttttagcaaa agctcaagtg agagaggcta 2460 tgttgaatag tacaaacaat gttcttcata cagatggcac aaaatataat tttagagagg 2520 ttggtagttt tcaagtcaca acatcatctg gtagctacac gtttgggata gaagatatgt 2580 tttcgggaga ggcacaatct tattttggag agctaaaaaa cttactcact gatatgtcta 2640 agatattttc tcctgacaag gaaaactacg agaatgatct taggaaactt attttttctt 2700 tcaaatcttt gatgacagat cgcactattg ttaattcaag tttttttcac caatttaaac 2760 attggagaga agcaattttg ccttttgttg ttgaaaacta tgatcagctt cctttaaatg 2820 aaaaattaaa aatttctcaa atgcatcatg ttttttgtgg cttacacgtt atccacaatc 2880 taggaatata tgcagaaaaa gcaataatag aatgggaaaa agtggttgag caggaaggta 2940 gtattcatgg tggttttaaa agttctcaaa actctcgcac ctttgatatt ttgtatgaac 3000 tttcaaaact tacaagctat cggcatggtg atcagcgaaa tggaaaagct gatgaatgga 3060 aggcattttt gcgtaaaagg ttttcaaaaa attttatggt ttcatttctg catcatcgct 3120 ttaatattat atttgtactt ggtggagcaa cgtattatca taaagaccac cttaaagaat 3180 ttgttttcaa tcttggtggt tcaaattttc ttcatgagtc aattaggcat gatattgaca 3240 acatcatttt tcatgcttca actagagctc tgggaatttt taataaattg atatcaggac 3300 ctctttttcg tattattgaa gaggagggtc atattttctc tttaaatacc atttggttgc 3360 aaatgttcaa ttattttgtt tcttgttctt ctgatgcatc aaaaatgtta agtggttcta 3420 cattttttga tgtaaaatat ctcaccaagg atgaagtgtt tgatcattta ttttctaatt 3480 gtaatgacct acttttggat gccttaacac aagaatgtct tgaagttata tgttgcacat 3540 gttcggttat gattcagagt cagttaagag accaattgcc tggaggaaaa tattacaatc 3600 cgggtgatga tgtgcttgaa gaaactcaaa actgcccacg tactaatgtt gtgtctgaac 3660 gtgactttgc ttcctatgac cgcaggttaa aaatgaaacc taacatgaca acagtagctg 3720 cagctggtgt gatcatgttt aacaacaaca aaacagctga ttggatagat ggaaaatcac 3780 aggaagaaat tgaaaggcta gttatattag caaggcgaaa tagaaaaggg tatataagaa 3840 agtacagaga gaaaaaagca aatatattgc agtacaaaat tgatgagatg gacaagcgca 3900 acttagaaaa ggaaataaag gaacaaaagg caagcgaaga gaaagagttt ttgacaacag 3960 aaatcggaaa ggatggtggt ttaattttgt gtgaagagga ttttgaaaaa attttaggaa 4020 ctgaaaatga aattgtaatg aagttgcaaa gacaaattaa atttcgaaaa aaagtattac 4080 agcaaaaatt tccagaaaag tggttacaat taggagaaaa ggcaaaaggg gagtattaca 4140 aaaaatttgg tattgataaa ttaaaaacca atctaatatc aatttttaaa tttttaaaag 4200 acgttcctga acaacgcgca actgcaataa aatgttcagt tatcagacag attaatgaaa 4260 gacaagaaat gttgttaaca ttaaaaaaaa aagtcagatt agagggtgac gctaggttaa 4320 ttatggaaac tggaagtaaa ataagaacag gcacatcaaa gggaggggtt ccaaagtttt 4380 taggaaaaag aataaaacat aaaatggtag ataatgatgg caaggatgta tggtattcag 4440 ggctcgtggt aagtgttcta gacgacaatg aatttgatga agagtgtgag tttcagatac 4500 tatatgatgg atatgaagat aagtatgaca ttgagttggt caaagagtgg aggatgaaat 4560 gtgttgtggt agagggaaaa gctgatggtt atgtggaggg cgaagtttgt aaaaaacaaa 4620 aatgcagtta atatcatttt tgacacaatg gggatttaat ttattgtatt tatgtttgtt 4680 attattctta ttattaataa tattattgtt attattattt ttattgtcat tcaatttaat 4740 gtgttcttgt ttgtgttctg tttggtcatc tggagcggtg ttggcgcacc gacaattctt 4800 tatatgtata tttgttttct attggacatc tgccgtcatg gtgacatacc cgtaagtcat 4860 gctttctatt tttgtgctga catacctgtc aatatatgta tccttgtgtt tacatttttt 4920 tgtgtttgca tgtatgtttt tgtaggtgtg tctgcaatta tttgcatgtt tgtttttact 4980 cacttgttag tcactatatg catgtttgtg tttatgtatc tgttagtcat tatgtgtatg 5040 tttgtgttct gttggacctc tgaagtggtg ttgacgcaca tgtaagtcac tatatgtatg 5100 tttgtgttta tgtacctgtt actcattatg tgtatgtttg tgttctgttg gacctctgga 5160 gcggtgttgg cgcacccgta agtcactata tgtatgtttg tgtatatgta tccgttagtc 5220 attatgtgta tgtttgtgtt ctgtcagacc tctggagcgg tgttgacaca cccgtaagtc 5280 actatatgta tgtttgtgtt tatatacctg ttactcatta tgtgtatgtt tgtgttctgt 5340 tggacctctg gagcggtgtt aacgcacccg taagtcacta tatgtatgtt tgtgtttatg 5400 tacctatcag tcattatgtg tatgtttgtg ttctgttgga cctctggagc ggtgctgacc 5460 cacccgtaag tcattatatg tatgtttgta tttatgtagc cattagtcat tatgtgtatc 5520 tttgtgttta tctacctgta agtcattata tgtatcttac tcttttgttg gaactgtgga 5580 atgactataa tttcatttga taatgcaggt tagtcatttt gatcaagttt ttgttttttt 5640 ttatttatgt ttgtgttctg ttggtcttgt ggcacagcgt tgaaacaaca gtaagtcatt 5700 gtatgtatat ttgtgagttg ttccaactag gctgcaagca atcattgtag gagttggaag 5760 ttactggaag agaaaagatg aagtttgtag agtaagataa caattgacag atgacttaaa 5820 agattgcaaa ttatatgaaa cagggaagca agataaagga agcgaattcc aaagaactga 5880 tgttcgagga aaaaaataga agaataagag tttttggagc acttagaaac agtcacaaga 5940 gaatgaattt tagtagataa acaagggact ctagctattt agagcagtgc ccatatttgt 6000 agaaaagaga aagagaagca ccattacaac gatgtgataa tggttggagg ttggctgcaa 6060 gagcaggtcc aactatgttt acaatgcatt tttgctcctt gtctaaaaga gaaagggcgt 6120 cattagaata tctgccccag cgttttttgc agcgtttagt gccacagggt ctgtgtgttt 6180 tgtgattgtg atgttattgt tgtgatgttt tgtgttgata gacaacttgg cttaccttct 6240 cgctggtgtc tatcgttaaa aatgattatt aggttgtttc gttactgaaa ctacttctaa 6300 tcattcacta aaagccaaga gataaatttt agacaaacaa tattttttgt tttaatgttg 6360 ttttcatttt ttcatcttag gtgttgtcga tagttcacaa atctcaaaaa ataagtttct 6420 tattttatat ttatttagtt ttaattgtta atttataact ttttttaatt tttaattttt 6480 aaaatgaatt aacaaatatt ttttttaaat tattgtgtat ctaatgttat ttgatgtatt 6540 attgtatgta tttagatgta ttgtttttaa taataatttt attacaattt taggtagagt 6600 tctgctgtta cacctactat tactgttgta agtactcaaa atttcaggaa gttctgttgc 6660 tgtaagttat tttcattcaa atttgaattt tagttttatt aaagaatctt ctgtctgtta 6720 aataggtaat tctgaatgat ttatttcaat gcccttacgc agtgatttac taatcggttt 6780 gtaaataaag caatagtttt aaacagatga attaattgga ttccacgatc ctttatgcat 6840 atagcatgaa gtttttataa tataacctaa atttttttct ttttttttct aaaagattta 6900 aatctaaatt tatttttaaa agactaggta aataaataaa aacatttcag tctgtaaaat 6960 taacaattct ggataatttt gacagatgca aaaaaaatta tggcatttaa ttttcttata 7020 gtatataaga aaattaaatg ccataatttt ttttgtatct ctatcaaaaa tttcaatttc 7080 aaatttcttc agtctgtaaa atagacgatt ctgtatgatc ccaacagatt caataaaaac 7140 tggcacacta tttaaaattt caagtcaaat ataataactg ttcaaatata tcaaatatat 7200 ttaaggtcga tatgcccaat tggtttgtgc atcccttttt ctgtaattta atgaccggtc 7260 tgtaatttat gcaataattt tactgacagg tctgaaaatt atgcaatagt ttttaaatga 7320 tgtatttttt ggattccatg attcctttat gcatatttgc atagtgcaac cataattttt 7380 ttcatcataa cacagttaaa taagtagaaa cccttcagtc tgtaaaataa acaattctgg 7440 ataatcccga cagatttaat aaaaactgga tcactattta aaatttcaag tcaaatgtaa 7500 taacggttcr aaaatatcaa aaattttaaa aaaattacaa attatttaaa gttgatatgt 7560 ttaatttgtt tgaacagccc attttcagca atttactgac cgaaaagtaa tttatatgat 7620 tatttcactg accggtctgt aaattacgca atagttttca aatgttttat tttttggatt 7680 ccatgattct tttatgcata ttcacatggt ataattataa tatcacccca aattttattc 7740 tccaaaacta agaaaataat gtaaaaccct tcagctgtaa aatagatgat tctggacaat 7800 cgcaacagat tttttaaaat aaaattttgg ttttttattt tcttatagtt ttttgatcac 7860 tatgaaaaat ttcaagtcaa aagtaataac cattcaaaag ttattcaaaa gtcagtatga 7920 ccccaaaaat atagtaatta gtaaaaactc tcagtctgta aaataggtga tcctggtaga 7980 tcctgacaga ttttaaaaaa aaaaattttg gttttttatt ttcttatagt tgttggattg 8040 ctatagaata tttcaattca aatgtaataa cgattcaaaa gttattaaaa agtcagtatg 8100 accctaaaaa tgtataaatt agtaaaaatt ctggtctgta aaatagataa tttctggtca 8160 atcctgacag atttctaaaa aaaaaaattt ggaattttat tttcttatag ttaattggtc 8220 actatggaaa atttcaagtc aaatgtatca tcggttcaaa agttattcaa atgtcaataa 8280 tactttcatt ttcggtgcca tttcgcagta aattttccat acttaagccc tatacctttt 8340 gtcaaaaaat tacccgtttt ttcactacaa aaaggtcaat aacttttgat ccgcttaact 8400 tcaaaggcca aatgacccct cattttttaa gtcaaaccaa gctctataca atgacatcaa 8460 ttatatttga atattttaat tataaaaaaa tcgtctggaa tggcta 8506 // ID Gypsy6-LTR_Dpse repbase; DNA; INV; 887 BP. XX AC Unknown_group_213; XX DT 13-MAY-2009 (Rel. 14.05, Created) DT 13-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy6_Dpse; KW Gypsy6-I_Dpse; Gypsy6-LTR_Dpse. XX OS Drosophila pseudoobscura OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; obscura group; OC pseudoobscura subgroup. XX RN [1] RP 1-887 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1060-1060 (2009). XX DR Genome; Unknown_group_213; Positions 15654 14768. XX SQ Sequence 887 BP; 287 A; 227 C; 195 G; 178 T; 0 other; tgtgttccct gcgcaaccaa aagatctacc atccgcactg gctctggcca gggaagctga 60 ggctagcatt gaaagaagca tgtttgcgaa ctcttacgct aaagccttgc aagaccgagc 120 gcaaattgcc gaaactaaca aaaaccgtgc ccaaggcaga caaaccaagt ttaacaaaga 180 tgagcaaaat caggagagaa atccccattt tgaaaaacgc caaaagggcg gtccccagcc 240 tcaggcccct aaagacaatc aatccgaagc tccacagccc atggaagttg actcttcctc 300 cagatttaga caaaaaaccg actacagcag gcgtcagact tacgagtcca acgcagcaaa 360 atgcaagaat tcgtcagagc gttcaacagg tcaaagacgg caacgcatta acaatgtggt 420 acaatcagac tccaaagagg ccgtaacaga attcgagaag gcggccaaag aggcagttga 480 agaattagag agtgagaacg agtacgcacc tggtgacgac tcaattaatt ttttagggaa 540 cactcccggc ttccgtacat tcaacgacgg ttggctggga gaactctgaa gatgttaatt 600 gacactggcg cggcaaagaa ttatgtgaag cccttaacag agctaaaggc cataacgccg 660 gtcgccaagt cattctcggt aaactcgatc cacgggtcta ccgaaatcaa gcgcaagtgc 720 ttgatgaata ttttccagca tacctccccg ttttttctcc tcgataccct aagttctttc 780 gacgccatca taggcttcga cttgctaaca caagccggag tcaaattgaa cttggcaaaa 840 aacactctgg aataccaggg tacatctgaa aaactccagt actatca 887 // ID Copia-26_CQ-I repbase; DNA; INV; 4166 BP. XX AC . XX DT 07-JAN-2011 (Rel. 16.02, Created) DT 07-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-26_CQ_; KW Copia-26_CQ-LTR; Copia-26_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4166 RA Jurka J.; RT "LTR retrotransposons from the southern house mosquito."; RL Direct Submission to RU (07-JAN-2011). XX DR [2] (Consensus) XX CC Positions [1473-2000] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 1401..2678 FT /product="Copia-26_CQ-I_2p" FT /translation="MADGIEFQEGVLPDCVSCFEGKQTRQPFPSSSSRATE FT LLELVHSDLCGPVEVPSLGGSRYFMTFLDDASHKVTVYFLEKKSQALDAFK FT RFKARAERESGKKLRVLRTDNGREYVNADFRKVLEQDGIGHQTTCPYTPEQ FT NGRAERLNRTLVEKARCMLNEAKLGKEFWAEAISTAAYLVNRCPTRTLGDM FT VPEQAWTGKKPNLKNLKVFGSSVMVHIPKVRRQKFDPKSRRGTFVGYCEGT FT KGYRVYDPETGDFVISRDVVAIEEGSSNKSVACELILKPVEFMELHFVEAM FT QPELPIEREDVVRDRRDEGDQAGVIEIGHEDPPPEEPVEALVDFLEGALPS FT QPERDQLPSSKGSRGAANGSAEPRASTTNSTVMVLFLAKMFPHSQLPHPRC FT NPLLLRWKTRRATRRFCDGQIEIDGLRPCRKR" FT CDS 2579..4153 FT /product="Copia-26_CQ-I_1p" FT /translation="MQSVIVEMEDPKSYEEVLRRADRDRWIAAMQEEMNSL FT SENRTWVLADLPNGRKAIRNKWVFKTKRGPDGVVQRYKARLVVKGCSQRPG FT VDYDEVYSPVVRYATVRFLMALSVKYNLDIDQMDAVTAFLQGELNDEEIYM FT LQPEGFVDAGGRVCRLKKALYGLKQSSRVWNTQLDAALQDFGLHRSAVDPC FT LYWSFQGDKLLFVTIYVDDFLIFTNDLNMKKKLKANLNKRFKMKDLGQASH FT CLGIRITRNREDGKLWLDQQAYVEDIVKRFGMADCYPVAMPAEPSLRLDKT FT MSPGSKEEAEEMHKVPYKEAVGCLSFAAQVTRPDIAFAVNMVSQFAANPGK FT QHWDAVKRIFRYLKGTANKKLEYSKEGADQIRGYTDADWGGDPDDRRSTTG FT YVFTLQGGAISWNVKKQPTVALSSCEAEYMALSRTIQEAMWWNNLRAQMFK FT VEPVDLYCDNQSAIAIASNGSYNPRTKHVSIRYHFVHESLQHGLVKLGYVS FT TVEQPADGFTKPMNSQKQTKLCEAIGISD" XX SQ Sequence 4166 BP; 1051 A; 925 C; 1331 G; 840 T; 19 other; agacgtgttt tctctactct actaaaggtt atgggcccag ctcaaaccta gaagaagttc 60 aagaagttaa ttttggaatt tgaagatgtc gacgagaagg tcaccagttc gagatccagc 120 acgaaacgga aacggaggag ccggcaacgg cgggaatgga gccccgctag cccaagtcgg 180 aaggaacgtt cgggtctacg gtagttcggt ggccatgccg gcgatcgaga aactaaaggg 240 tcgggagaac tactcgacgt gggcgttckc gatgcggatg accctgatcc gcgagggmag 300 ctggcctgcc gtcaagcgac gagcggaagg tgacatcgtg gcggaagacc tgaaggaacg 360 ggcgctggct acgatctgtc tgggtstgga aaacamcaac ttcggtcwcg tscaggacgm 420 ggaagmcscc gaggaagcct gggggaagct ggaggccgcg ttccaggaca gcggttctgc 480 ccggcgagtt ggcctgctca ggaagctcac gtcggtgcac ctcgasgact gcgcgagcgt 540 cgaagagtac gtcgacgagg ttatgactgc gagtcaacgt ctagctgcga tcgggttcaa 600 ggtggacgat tcctggctgg cgggaatgct gttgatggga cttccggagc actacgagcc 660 gatgatcatg ggactcgagg cgtctgggaa ggcgmtgtca gcggatgccg tgaagtcgaa 720 gatcctgcag gacgtgaags wcgagmgkgg gccgacmaas kccgggtgcg gacggwgcac 780 tgtaccaggg gtacaagttc gagtcgaacc agggagccgc tgcagtcaag atcgagaaaa 840 gcgatgcttt gttacaaatg cgacaagccg ggacattttg cggccaagtg tccagagaag 900 cggatcacga aaaatcgaag cctacagcct ggttcagcac ggacacgacg ggtggtgtca 960 cgagcaggcg catggtactt ggactcagga gcatcgtgtc acctggcaac gtcggaacag 1020 aacttcaacg gatgaggagt cggtgcagtt cgtcgttgga acggcgaaca acggttcgat 1080 gacggcggtc tcaagaggaa acgtggcatt ggatgtccag acggcagctt ggacgttcgc 1140 ggagtgctga agatcccaga cctggcctcg aatctgctct cggtgagtac aatctgtaaa 1200 gagggccaca cggtgatctt cacggcaaag cagtgcgagg tactgggcga agacggtaag 1260 caggtcgtct tccgggcgta gaggagggtg gactgtatcg attggaaaga ccggagagat 1320 cgtttctggt gagcaagttg gagctttggc acaagcgact cggccatttg aacgttggga 1380 gtttgcagaa actcaaacgg atggcggatg gaatcgagtt ccaggaaggc gtgctgccag 1440 attgtgtctc gtgctttgaa gggaagcaga ctcgacagcc gtttccaagt agcagttcga 1500 gggcaactga acttttggaa ctggtccact ctgacctgtg tggtccggtg gaggtgcctt 1560 cgcttggagg aagtcggtac tttatgacgt tcttggatga tgcgagtcac aaagtcacgg 1620 tgtactttct ggagaagaag agtcaggcac tggatgcgtt caagaggttc aaggctcgag 1680 ctgaacgaga atctggaaag aaattgcgag ttttacgtac ggacaatggc agagagtacg 1740 tgaatgcaga tttccgcaag gttctggagc aggacgggat tggtcatcaa accacgtgtc 1800 cgtatacacc agagcagaat ggtcgggctg agcgactgaa caggacgttg gtcgaaaaag 1860 cgcggtgcat gctgaacgag gccaaactgg gcaaggagtt ctgggctgag gcaatttcga 1920 cagcggcgta cttggtgaac cgttgtccga cacgaacgct tggagacatg gtgccggaac 1980 aagcatggac cggaaagaag ccgaatctca agaacctgaa ggttttcgga tcgagcgtga 2040 tggtgcacat cccgaaggta agacgacaga agtttgaccc gaaatcaaga agaggtactt 2100 tcgttggata ttgcgagggc acaaaggggt accgtgtcta tgatcctgaa acgggagatt 2160 ttgtgatcag tcgggatgtc gtcgccattg aagaaggttc atcaaataag agcgtggcgt 2220 gtgagctgat tctgaaaccg gtcgaattca tggaattgca ctttgttgaa gccatgcagc 2280 ctgagttgcc tatcgagcgt gaggacgtcg ttcgagaccg gcgggacgag ggagatcaag 2340 cgggcgtgat cgaaattggt catgaagatc cacccccgga ggagcccgtt gaagccttgg 2400 ttgacttcct tgaaggcgcg ctcccttcgc aaccagaacg tgaccagctg cccagcagca 2460 agggttcacg aggcgcagcg aacgggagcg cagaaccccg ggcaagtaca acgaattcaa 2520 ctgttatggt actttttctg gccaaaatgt ttccccacag tcagctgcca catccccgat 2580 gcaatccgtt attgttgaga tggaagaccc gaagagctac gaggaggttc tgcgacgggc 2640 agatcgagat cgatggattg cggccatgca ggaagagatg aactctctga gcgagaacag 2700 aacgtgggtg cttgctgatt tgccaaatgg acgaaaggca atccggaaca agtgggtgtt 2760 caaaacaaag cgaggtccgg acggtgttgt acaacggtac aaggcgcgac tggtggtgaa 2820 gggttgttca caacggcccg gtgtggacta cgacgaagtc tattctccgg tggtgcggta 2880 tgcaacggtt cggttcctga tggcgctttc ggtgaagtac aacctggaca tcgatcagat 2940 ggatgcagta accgcattcc tgcagggcga gctgaacgac gaggagatct acatgctaca 3000 accagaaggg ttcgtggatg ctggaggaag ggtgtgcaga ctgaagaaag cgttatacgg 3060 tctaaagcag tcaagtcgag tttggaacac tcaactggac gcggctcttc aagattttgg 3120 actccatcgt tcggcagttg acccgtgtct gtactggtcg ttccagggcg ataagctgtt 3180 gttcgtgacg atttatgtgg atgattttct catctttacg aacgacctga acatgaagaa 3240 aaagttgaag gcgaatctga acaaacggtt caagatgaag gacctgggcc aggcatcgca 3300 ctgtcttgga atcaggatca caaggaatcg cgaggatgga aagctgtggc tggatcaaca 3360 ggcctacgtc gaggacattg tgaagcggtt cggtatggcg gattgttatc cggttgctat 3420 gccagcggaa ccgagcttgc ggctggacaa gaccatgagt cctggaagca aggaggaagc 3480 cgaggaaatg cataaggtgc cgtacaagga agctgttggg tgtctttcgt ttgcagctca 3540 agtgacacgg ccggatatag cttttgcagt caacatggtg agccagttcg ctgccaaccc 3600 tggaaagcaa cactgggacg cggtcaaaag gattttccgc tacctgaagg ggactgctaa 3660 caagaagctg gaatattcga aggaaggtgc cgaccagatc agaggataca ccgatgccga 3720 ttggggagga gatccagatg atcggaggtc cacaactggt tacgttttca ccctgcaagg 3780 tggagccata tcgtggaacg tcaagaagca gccgacagtg gctttgtcgt cgtgtgaagc 3840 tgaatacatg gcgctttctc ggacgattca agaagccatg tggtggaaca acctgcgagc 3900 tcagatgttc aaggtggagc ctgtggacct gtactgcgac aaccaatctg caatcgccat 3960 cgctagtaac ggttcgtaca atccacgcac gaaacacgtg agcattcgct accacttcgt 4020 gcacgagagt ctgcagcatg gacttgtaaa gctgggatac gtatccaccg ttgagcagcc 4080 agcggacgga ttcacgaagc cgatgaacag ccagaagcag acgaagctat gcgaggccat 4140 tggaatatcg gattagggag gagtat 4166 // ID Gypsy-7_CQ-I repbase; DNA; INV; 4631 BP. XX AC AAWU01000353; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-7_CQ_; KW Gypsy-7_CQ-LTR; Gypsy-7_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4631 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 393-393 (2011). XX DR GenBank; AAWU01000353; Positions 15443 20073. XX CC Positions [3513-3863] - Integrase core CC 'ACCCC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 258..4595 FT /product="Gypsy-7_CQ-I_1p" FT /translation="MDADQFKQFLDQQKSIMERLLAKLSTESTGATGSAAN FT RGQACNVPVPQPSPLSLVGDMAENFDFFERSWNNYVKASGLDKWPATEAPR FT KVNILLTVIGEQARRKFHNFELTEAQQGDPKAALEAIKALVVAKRNIIVDR FT FDFFSAAQSASESIDDFCARLKALAKLAKLGTLEPELLAFKVVTSNKWSHL FT RTKMLTISDITLEKAIDLCRAEEIAAKRSQELGGFLPTPEVNKISRGRGDH FT KQKTLRCKFCGGSHEFTKGACPALGKRCHRCKGKNHFETVCKASSKGRKSK FT RVKEVKDDYYSESDTASGGSESSEESGEECEIGKIYDNSQNGGCVLAELDL FT KFNNSWETVLCELDTGANTSLIGLDCLKKLTGTDNPELLASKFRLQSFGGN FT PITVLGQVKVPCRRPNRKFSLVLQVVDVDHRPLLSARASRELGLVKFCNTV FT TFEEPVQPTPPSPHSEKLFNIYRVKAQEIVSSHESLFTGYGKFSGTASLEI FT DDSVTPTIQPPRRVPIAIRAKLKEELEKLESDGIIVKETKHTEWVSNVVIV FT QRGSGFRICLDPVPLNKALKRPNLQFVTLDEVLPELGKAKVFSTVDAKKGF FT WHVVLDEPSSKLTTFWTPFGRYRWIRLPFGVAPAPEIFQIKLQEVIQGLKG FT VECIADDLLVFGVGDTLEEALADHNRRLEKLLCRLELHNVKLNKAKLKLCE FT RSVKFYGHVLTDEGLKPDESKVAAIRDYPQPKNRKEVHRFVGMVTYLGRFI FT NNLSANLTHLRMLIPESATWKWTSVEESEFNKVKSLVCDIKTLRYYDVNQP FT ITIECDASSIGLGVVVYQRDEVVGYASRTLTATEKNYAQIEKELLAILFAC FT TRFDQLIVGNPKATVRTDHKPLVNIFKKPLLSAPRRLQHMLLNLQRYKLST FT EFVTGKDNVVADALSRAPAGGAEGDDFYKKQDIFKIFEEIQEVKLSSFLGV FT SSAKLNDLMRETEKDTPLQHVIGYVRGGWPASADQVPDAVKIFFGYRHELS FT TQDGLVFRSDRIVVPYILRRKMIESCHASHNGIEATLRLARANLFWPGMTS FT EIKNAVKGCSVCAKFAASQQNPPMKSHEIPVHPFQLVSMDVFFAEQSGVKH FT KFLITVDHYSDFFEVNLLKDLSPESVINACKQNFARFGVPQRVVTDNGTNF FT ANRKMVQFANNWDFELVTSAPHHQQSNGKAEAAVKIAKQLMKKAAESESDF FT WFSLLHWRNIPNNVGSSPASRLLSRSTRCGVPTAATHLVPRVEEDVPAKIE FT ANRKRSKLCYDRKARNLPELEVGSPVYVQLNPETSKLWTQGTITNRLNERS FT YQVNVGGADYRRSLVHLKPRKEPATLPDRQSSSCVMNYREEETTDENDHQQ FT LNTREERREATVDETVCQQAVQPCVLQPETSTLAEMSTPRELAVTPLPIRQ FT ERQERNRTITPQKVERPKRVTKIPEKFKDYKL" XX SQ Sequence 4631 BP; 1241 A; 1199 C; 1251 G; 940 T; 0 other; tggtgtcaga agctgacgca agcttctagg tcttttcggg cgtcatttag tgaaatcgcg 60 gaaaacgacc aggttttttt tcctacgaat aaaagtgttc gtgtcgagtc gcttcggcgg 120 ccatattaaa aaactgaact gaattgacca acgacggcgg ccattttgaa gcaaaagtcg 180 cgtcgttggc cgttgtgaaa aagtgactgt gtgaaaaata aaaaaagtga attgaaaaaa 240 aaagaacgct ttccagaatg gatgcggatc agtttaaaca gtttttggac caacaaaaat 300 ccataatgga gcgtctgctg gccaaactgt cgacggagtc aacaggggcg accgggtccg 360 cggcgaatcg tggccaagca tgcaatgttc cagtacccca gccgtcgccg ttgtctcttg 420 tgggggacat ggccgagaac ttcgatttct tcgagcgcag ctggaataac tacgtgaagg 480 ctagcggctt ggacaaatgg ccggcaacgg aggctccccg caaagtgaat attctgctga 540 cggtgattgg agagcaagcc aggcggaagt tccacaactt cgagttgaca gaagcacagc 600 aaggagaccc gaaagcagca ctcgaagcta tcaaagcgtt ggtcgtggcc aaacggaaca 660 taattgtgga ccgttttgat ttcttctccg ccgcacagtc agctagtgag tccatcgacg 720 acttctgtgc caggctcaag gcgctggcca agctagcgaa gctcggaacc ctggaaccgg 780 aactgctcgc tttcaaagtc gtcacatcca acaagtggtc gcacctgcgc acaaagatgc 840 tgacaatttc ggacataacc ctcgagaaag ccatcgatct gtgccgtgca gaggagatcg 900 cagctaaacg atcccaagaa ctgggaggtt tcctgccaac gccagaagtc aacaagatca 960 gccgagggag aggagaccac aagcagaaga cgttgcgctg caagttctgc ggaggctctc 1020 acgagttcac aaagggagca tgcccggcgc ttggcaaacg ttgccaccgc tgtaaaggca 1080 aaaatcactt cgaaacagtc tgcaaagcta gctcgaaggg tcggaagtcg aagcgggtca 1140 aggaagtgaa agacgactac tactccgagt cggacacagc gtccggcggc agtgaatctt 1200 ccgaagagtc cggagaggag tgcgaaattg gcaagatcta cgacaattcg caaaacggcg 1260 gttgcgttct tgctgaactg gacttgaaat tcaacaactc gtgggaaact gtgctctgtg 1320 aactggacac tggggcaaat accagcctga ttggcctgga ctgtctcaag aaactgactg 1380 gaacggacaa tcccgaactg ctggcgtcga aattccgcct gcagagtttt ggtggaaacc 1440 ccatcacagt attgggccag gtgaaggtgc cgtgtcggcg gccgaacagg aagttttcgc 1500 tggttctcca ggtcgttgat gtcgaccacc gccccctact gtcggcaaga gcttcacgcg 1560 aacttgggct ggtgaagttt tgcaacacag ttaccttcga ggagccagtt caaccaacac 1620 cgccgtcgcc ccactctgag aagctgttca acatttaccg cgtgaaagct caagaaatcg 1680 tcagcagcca cgaaagcctc ttcacgggct acggaaagtt ttccggcaca gcgtcgttgg 1740 aaatcgacga cagcgtcaca ccgacgatcc aaccgcctcg acgagttccg atcgcgatcc 1800 gagcaaagct gaaggaagaa ctcgaaaagc tcgaaagtga cggcatcatc gtgaaggaga 1860 cgaaacacac cgaatgggta agcaacgtcg taatcgtcca gcgaggctct ggctttcgta 1920 tttgtctgga tccggttccg ctgaacaaag ctctcaagcg tcccaacctc caattcgtaa 1980 cccttgatga agtgcttccg gaactgggca aagcaaaagt tttttccacg gtcgatgcca 2040 aaaagggctt ctggcacgtt gtgctggacg agccctccag caagctcacc accttctgga 2100 caccgtttgg gcgttaccgt tggattcgcc ttccgttcgg tgttgcacca gcgccggaga 2160 ttttccagat caaactccag gaagtaatcc aaggtctcaa gggagttgaa tgcatcgcgg 2220 acgatttgct ggtgttcggc gtgggcgaca cgctggagga ggccctagct gatcacaacc 2280 gccgccttga gaaacttctt tgtcggttgg aactccacaa cgtgaaactt aacaaagcca 2340 agctgaagtt gtgtgaacgg tccgtcaaat tctacggtca cgtcctcacc gatgaaggtt 2400 tgaagccgga cgagtccaag gttgctgcga tacgtgacta tccccagcca aaaaatcgca 2460 aggaggtgca ccggtttgtt ggcatggtca cgtacttggg gcgcttcatc aacaatctca 2520 gcgccaactt gacccaccta cgaatgctga tcccggagtc ggcaacttgg aagtggacat 2580 ctgtggaaga gagcgagttc aacaaggtga aatcgttggt ttgcgacatc aaaacactac 2640 gctactacga tgtcaaccag ccaatcacca ttgaatgtga cgctagcagc atcggtctcg 2700 gtgtggtcgt gtatcagcgc gacgaagtgg ttggatacgc gtcgcgaacc ctcacggcaa 2760 ctgaaaaaaa ctacgcgcag atcgaaaaag aactcctggc gatacttttt gcctgtacca 2820 gattcgacca gctgatagtt ggaaatccaa aagcgaccgt caggaccgac cacaagccac 2880 tggtgaacat cttcaagaaa ccgcttttgt cagcaccacg acggctgcaa cacatgttgc 2940 tcaacctaca gcggtacaag ctgtcaacgg agttcgttac gggaaaggac aatgtggttg 3000 cggatgcttt gtcacgtgca ccagccggcg gagcagaagg agacgacttc tacaagaagc 3060 aggacatctt caagatcttc gaagaaatcc aagaggtgaa actcagcagc ttcttaggtg 3120 tgtcgagcgc caaactgaac gatctcatgc gggagactga aaaggacacc ccactccagc 3180 acgtcatcgg ctacgttcga ggtggatggc ctgcctctgc cgatcaagtt ccggacgccg 3240 tcaaaatctt ctttggatat cggcatgaac tgtctaccca ggatgggctg gtcttccgca 3300 gtgacaggat tgtggtccca tatatcctcc gccggaagat gatcgagagt tgtcacgcaa 3360 gccacaacgg catcgaagca acactacggt tggcaagggc gaacctgttc tggccgggca 3420 tgacgtcaga gatcaagaac gcggtgaaag ggtgctctgt ctgtgcaaag tttgccgcgt 3480 ctcaacaaaa ccctccgatg aaaagtcacg aaattccggt acatccgttc caactcgtgt 3540 caatggatgt cttcttcgct gagcagagcg gcgtcaagca caagttcctg atcaccgtcg 3600 atcactactc ggactttttc gaagtgaacc tgctgaaaga tctgagtccg gaatccgtga 3660 tcaacgcctg caaacagaac ttcgcgaggt ttggtgtgcc ccaacgtgtc gtcaccgaca 3720 acggaaccaa ctttgccaac cgaaaaatgg tgcagttcgc caacaactgg gactttgagc 3780 ttgtcacgtc ggcaccacac caccaacaat ccaacggcaa ggctgaggcc gcggtcaaga 3840 ttgccaagca gctgatgaag aaggcagctg aaagtgaatc cgacttttgg ttttcgttgt 3900 tgcattggcg aaacattccg aacaacgtcg gatccagtcc agcatcacgc ttgctgtccc 3960 gttcaactcg ctgtggtgtg ccaactgccg caacccacct tgtgcctagg gtggaggaag 4020 atgttcccgc taaaattgaa gccaatcgga agcgatcgaa gctgtgctac gaccgaaagg 4080 cacgaaactt accggaactg gaagttggat ccccggtata cgtgcagctg aatccggaga 4140 cttccaagct gtggacacag ggcaccatca cgaaccgact caacgaacgc tcataccagg 4200 tgaacgtggg aggtgccgat taccgccgat cgctggtaca tctgaagccg cgtaaagaac 4260 ccgctacgct gcctgaccgc cagtcgtcat cctgcgtgat gaattatcgg gaggaggaaa 4320 caaccgacga gaacgatcac caacagctca acacccggga ggaacgcaga gaagccacgg 4380 tggacgaaac cgtttgtcaa caagccgtgc agccgtgcgt gctgcagcct gaaacgtcaa 4440 cgcttgcgga aatgtcaaca ccgagggagt tggctgttac accgttgccg atccgacagg 4500 agcggcagga gaggaatcgg acgattacac cgcagaaagt cgagcgaccg aaaagggtta 4560 ccaaaattcc agagaagttt aaagattata agttgtagtt ttggaatatt gtttttagag 4620 aaaagagagg a 4631 // ID CR1-42_BF repbase; DNA; INV; 1873 BP. XX AC . XX DT 30-JUL-2009 (Rel. 14.07, Created) DT 30-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE Amphioxus CR1-42_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-42_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-1873 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-1873 RA Kapitonov V. and Jurka J.; RT "Young families of CR1 non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(7), 1613-1613 (2009). XX DR [2] (Consensus) XX SQ Sequence 1873 BP; 521 A; 387 C; 343 G; 622 T; 0 other; ggggaaaccc cgttgcccga cttgattgat acttctagtg tgccaagtat ctcctcagtt 60 acattttctg tcagcaccat tgaagctatt ctcaaaaacc tggaagttac aaaggccaca 120 gggccagacg accttccagc tatggtcctt aataagtgtg cacctcagtt agctccttcc 180 ttgaaaatgt tgtttgaact gtgtttgaac actgggaaaa taccttccat gtggaagaaa 240 gcaaatgtga tccctgtgtt taagaagggg gacaagagca gtgtgtccaa ctatagacct 300 atctcattgt tatgtataac aagtaaagta tttgaaagat gtatattcaa ttctgtgttt 360 ccgttagtct cgaattctct ataccccctt caacatggat ttgttagggg taagtccacc 420 gcaactcaac ttttagaagt gtatgacgaa attggacagg tgttggacgt atcagggcaa 480 gttgatgtaa tatttttaga cttttgtaaa gcttttgata cagtatccca tgtcttactt 540 gtacacaagc tgaaattgtt tggaattaat ggtcaactac tttctgttat cagtaattac 600 cttaagaatc gccaccaaag aacaattgta gagggtcacc agtctaagta cctacctgtg 660 ttgtctggtg ttcctcaggg ttcgatcctt ggtccattcc tatttctgtt atttattaat 720 gatctacctg accacgtaca aaccggcagg atggccatgt ttgcagacga tgctaagtgt 780 tttaagagaa ttgattcaat ttttgattgc atcagattcc aggctgactt ggatagtctt 840 acaaactgga gtaaaacttg gaaaatgaag ttccatccat caaaatgttc agtgatcagc 900 gtcacacgta aaccactccc tgtaacctat tgctatcata tagatggtac aacacttagc 960 agaactgcta catatgaaga tcttggtctc caggtttgtc aaaacctcag ttggaatatc 1020 catgtacata gaaaagtatc aaagtgcaat tcattactcg gtatgataaa gcgatctgtt 1080 gggtacaatg ctccattaat agtcaaactt aacctgtttc gttctctagt aataccgcat 1140 ttagattatt gctcacaggt gtggtcccct cacacgagat ttcttttacg taaagttgag 1200 ggagttcaac ggagagccac caaatacatc tgtaacaact atgatttgtt atatagggac 1260 agacttgtaa ccactaagtt gttacctctt tgttataggc gtgaactttt tgacatattg 1320 tttctcttta agtgttttcg tggcctatac tccattgata ttcatcggtt tgtacaagta 1380 tcattaccac atagacaact ccgttcaact gaccctttcc aacttgttgt ccgtgtatgt 1440 aaaaccgaaa catttacttt ctcttacttc agccgcattg tacttatctg gaacacttta 1500 cctgcaacaa tccgacaaca tatgtactca aaccttacca ttccatctgt taggaaatta 1560 ctggtcaatc actatacatc ccgcacagat cacaccttct cagctgataa tttttgtacc 1620 tggacttcta tttgccgctg ctctcattgc ttactttcat gattcactag ctattttatt 1680 cccaaccgtt attgaacttg ttaacttgtt atcaatcttt tattgtattt atatgtaggt 1740 ttcgattata ggtttttatt acattgcttg tagatcttct ggggagtcgg cctcgtagag 1800 aaactggatt tctgttgccg gctcccctca gattgtaact gatctgtgaa gtttcaataa 1860 ataaataaat aaa 1873 // ID I-1_DP repbase; DNA; INV; 6085 BP. XX AC . XX DT 28-JUL-2009 (Rel. 14.07, Created) DT 28-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE A family of I non-LTR retrotransposons from Daphnia - consensus DE sequence. XX KW I; Non-LTR Retrotransposon; Transposable Element; I group; KW I-1_DP. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-6085 RA Kapitonov V.V. and Jurka J.; RT "New families of I non-LTR retrotransposons from animals."; RL Repbase Reports 9(7), 1530-1530 (2009). XX DR [1] (Consensus) XX CC The consensus sequence was derived from multiple alignment of CC several copies ~98% identical to each other. The 3' terminus is CC composed of the (TCAAA)n microsatellite. XX FH Key Location/Qualifiers FT CDS 808..2106 FT /product="I-1_DP_1p" FT /note="ORF1." FT /translation="MAENLANSDSLSTPIDHTMEGIEEGNAKRRLNHNHDP FT RPHHSRLVDTLNSPLESGAPVIVNLVSEPQSFDRLSLLEKKQFINGLINTI FT GQVKDGTKWTRQGQLYIYPTSNRQKKQLLELKAVKEFQISCNLAKSEINVK FT GVIYNVPINNSDADLLELVSSQGVTHVHRFQSGPEESKTPLTTVALTFNTQ FT TLPREIVIAHEIFRVKKYIPRPSQCRKCWSLQHQEDTCKVSPICRYCSQQH FT PPTPVCTNAPKCPTCHKSDHAAGTFACPLFANKQNVIRFAYESNIPISEAG FT KLLDRNNVTQTKPIHLPVFEKENPEILALRREVQNLQQQIEKILQSPPITE FT LSERVTALETEVVQIKEQIEPLLTLPASVEKSNQEMKKGFSETQDQLAQLS FT ALIQRSLSVQPSQPGQANRPLPKPISNTTTNTSASTSKK" FT CDS 2110..5904 FT /product="I-1_DP_2p" FT /note="ORF2: AP endonuclease, RT, RNase H." FT /translation="PPPPLKIIQWNARSLYKSKLQEFKYNLLLTNPHIVLL FT SETHWKDHYTAQFSAYQTFALNRPTQGGGVAILVKKNIQTSPLPLPQSSDI FT EAIGVTIKTRNNTHLDIISLYCPNGNSCRRGNIEAILNAPRNTTIIGGDLN FT AHSGMWEDGKPENNNGKIISNYLLNQSNLILATPKNLGTRPSHNNTQNSTI FT DLTFTSPALGPTTTIHLGPYWSSDHLPIIININTDLPPTQLINPNWRFNEE FT KWEEWNEEISKTLITNKLSNATSPESAYSILYSSIIEASQIHFTPNRNALV FT REKPKPWWTPQCKKATSMVRRAYNKWRSSLLQSDKTELNRLEAIKKKTIIK FT AKNDAWEKFIGSLDQPGNTTTFWNFAKFMLKEKRTKQSWPTLVDQTNRQLT FT DNLDKANAMLDQLCPNNNHPEDDRSTLYLQKIQDAINNEIPHPINSQFNIT FT ELKMCIKTLPNKAMGADRLHNKMLSKLSDQNLHSLLFVLNYMFRTGYIPEA FT WKHAVVTPILKPGKPPERTDSYRPISLTSCLAKIQEKIINNRLKWYLDKNA FT HLPKHQTGFRRGCTTTDNLIRLEEAISNGFNTSHSTTAVFLDLAKAYDDTW FT LTGLLYKVTKFNIRGVMLNWLSNFLTNRTVNVKIEDSLSTTRTINKGVPQG FT CVLSPILFNIMMADFPLPDPRTNLALFADDILIHTTSPTNPEAERILQRYL FT NKIDYWATSWKFKFSVPKCATLTFSRKRAKEIDVKLFLNKSQINSVNHYKY FT LGITFDSKLNWDNHLNNTLHSVERKANLLKLLTFGKSTLNTKLLVRIYKAT FT IRSKLDYGATVLSSIPKSKMNKLETTQNTILRNIIGALKSTPTPLIYLETG FT LTSIKDRWDLLASRYLNKLNTKPWNPAYSTIQNTQNNIKTWKPRSIPAVIP FT HLNKMRINSEECFSLMPSYNPWIEPIAPWNLFEIKTKLFPLSKKAAINKPQ FT LTQEIFRKLMENEPTDNLTIYTDGSCCELTNISSCAFYIPKLGEKKAWALS FT AFTTSFNAELAAIKQALQFTYPLDWPVITILTDSKSAIQAISNFKWESSTL FT ITEILNIITNQKSAGTQVTFVWIPSHAGIHGNEVADELANRTRTDPDGLVL FT EYTPNISEKLTNLKSKHIQSTFEKIKSSSTNLAVLHRTNMKPIPWFRHNNR FT RIQVALLRLRCGHNRLNHCVSKWNADTTPNCPNGCVETENNTHVLITCPHY FT SRARSPLKNLLASLQLPLDVPTITGMNNSIPKHTQKKIASRLIKFLIETNL FT VNRL" XX SQ Sequence 6085 BP; 2191 A; 1662 C; 928 G; 1304 T; 0 other; tgtggaactt ggtcatttct tcagctgtgg ctctgtgtgg tcgctaacaa aaaataaatt 60 taatttaact gttgcactta tcatctagct tcttgaatta ttacatatat ttccatttta 120 cggtacagcc cataattcaa gagcaagttg tgggtcatgc gtatgcgatg aggtttcatg 180 ggatgtccct tcaatttcaa aaattaatga ttaattacag tttcattaac ataaatgtgt 240 tcaatattat tacctgtcca taataaaaat tcccaatgtc acctaggtta ataaaaaaga 300 tattacaaat ttaagagtta aaatgaaaag tcaccataaa aatgccaacc cacccaccaa 360 gtaggacgtc aattataccc ccgacaacct cattgacaaa tttgacatga aaatcttaat 420 aaaattttga agcaattcaa aaataaaacc cgatttaaat gagaatatag ccacatgtct 480 ttctttttta atgctatgtg ctcttcagtc ttcaaacaac tgtaattctt tgtgatactg 540 atgtgcataa atgacgtccc actagccaaa tattgttact tccacatcaa ttttgtacat 600 caaaatgtca aaatagatag gttaaataaa tttagcaata atatcttagc aataatatca 660 aaactgaaat acaaatcaaa tctggttgtt gcggcgcgct agactttatc cagaacggac 720 gtcaaaatca gtgcgccaca accaattgtt ctcaaacttc ccatccccca acacgttaaa 780 ctcccccact catacttcag gaaagaaatg gccgaaaatc tggctaattc agactcttta 840 tccacaccca tcgaccacac tatggaaggt atcgaagaag gaaacgcaaa aagaagatta 900 aaccacaacc acgatcctcg cccacaccat tcccgtctag tagacaccct aaattcccct 960 ctcgaaagtg gcgccccagt gatagtgaac ctcgtcagtg agcctcaatc attcgatcgc 1020 ctcagccttc tagaaaagaa acagttcata aatggattaa tcaacaccat cggccaggtg 1080 aaagacggaa ccaaatggac tcgccaggga caactctaca tataccctac ttccaatcgc 1140 caaaagaaac agctactaga actcaaagcg gtaaaagaat tccagatctc gtgcaatcta 1200 gctaagtcag aaataaatgt taaaggagta atatataatg tgccgatcaa caactccgat 1260 gcggatctcc tagagctagt ttcatcacaa ggtgtcacac atgtgcacag attccagtca 1320 ggtccagaag aatcaaaaac tcccctgaca acagtggcat taaccttcaa cacccaaaca 1380 ctccctcgtg agattgtaat cgcccatgaa atattccgcg tcaagaaata catcccacgc 1440 ccctctcaat gtcggaaatg ctggtcgctc caacaccaag aagacacatg caaagttagc 1500 ccaatctgca ggtactgctc ccagcaacac cctcctaccc ctgtttgtac aaacgcacct 1560 aaatgcccaa catgccataa atctgaccat gccgcgggaa cctttgcctg ccccttattc 1620 gcaaacaaac aaaatgtgat tagatttgca tatgaaagta atatcccaat tagcgaggcc 1680 ggaaaattac tagaccgcaa caacgtcacc caaaccaaac caatccatct cccagtcttc 1740 gagaaagaga accccgaaat cctagccctc cgtcgggaag tacaaaacct ccaacaacaa 1800 attgaaaaaa tcctgcaatc accaccaatt accgaacttt cagaaagagt gacggctcta 1860 gagaccgaag tggtccaaat caaagaacag atcgagcccc tcctcacact ccccgcctca 1920 gtagagaaat ctaatcaaga gatgaaaaaa ggattctctg aaacccagga ccaactagca 1980 cagctatcag ccctcataca aagatccttg agcgtacagc ccagccaacc aggccaggca 2040 aaccgcccac ttcccaagcc aatttcgaat accacaacca acactagcgc ctctacatca 2100 aagaaatgac cccccccccc actaaagata atccaatgga acgcaagaag tctttataaa 2160 tctaagttac aagaattcaa gtacaatctt ttgttaacca acccccacat tgtcctctta 2220 agtgaaaccc attggaaaga ccattatacc gcacaatttt ctgcctacca aacattcgca 2280 ctaaatcgcc ccacacaagg tggtggagta gccatccttg ttaaaaaaaa tattcaaact 2340 tcaccactgc cactccccca atctagtgac atagaagcaa taggagtcac aataaaaact 2400 agaaacaata cgcacctaga cattatctcc ctttactgcc caaacgggaa ctcgtgcaga 2460 agaggaaaca tagaagccat cctcaacgcc ccacgtaata ccactatcat aggcggagac 2520 ttgaatgccc actcagggat gtgggaagac ggtaaacctg aaaataacaa tggaaagata 2580 attagcaact atctattaaa ccaatctaac ctaatactcg ccaccccgaa gaacctaggt 2640 acccggccga gtcacaacaa cacccaaaat tcaacaatag acctcacctt tacttcacca 2700 gcactaggac caaccacaac aatccatctc ggcccttact ggagcagtga ccatttgcca 2760 ataatcatca atatcaacac tgacctaccc cctacgcaac tgatcaaccc taactggaga 2820 ttcaatgaag agaaatggga ggagtggaac gaagaaatct caaaaacctt aatcaccaac 2880 aaattgagca acgcaacatc ccccgagtca gcctactcca tactatactc ctcaattatc 2940 gaggcaagcc agatacactt tacacccaac cgaaacgcgt tggtaaggga aaaaccgaag 3000 ccttggtgga ccccccaatg taaaaaggca accagcatgg tacgaagagc ttacaacaag 3060 tggcgatctt ccctcctaca atcagacaaa acagaattaa acagactaga agccatcaaa 3120 aagaagacta ttatcaaagc taagaacgat gcctgggaaa aattcattgg ttcactggac 3180 caaccgggaa acaccaccac cttctggaac ttcgccaaat tcatgctcaa agaaaagaga 3240 accaaacaat cctggcccac ccttgtggac caaacgaaca ggcaactcac cgacaaccta 3300 gacaaagcga atgctatgct agaccaactc tgcccaaaca acaaccaccc agaagatgac 3360 aggtcaactc tataccttca gaagatccaa gacgcaataa acaatgaaat tccccacccg 3420 atcaactctc aatttaacat tacagaactt aaaatgtgca ttaaaaccct ccctaacaaa 3480 gccatgggcg ctgacaggct tcacaacaag atgctctcaa aactatccga ccaaaacctc 3540 cactcattgc tatttgtact caactatatg ttccgtactg gatacatacc tgaagcatgg 3600 aagcacgcag tagtcacacc gatactaaaa ccagggaaac ccccagaacg cacagactcg 3660 tacaggccca tatcattaac ctcttgtcta gcaaaaatcc aagaaaaaat tattaataac 3720 agactgaaat ggtacctgga taaaaatgct caccttccaa aacaccaaac gggatttagg 3780 cgaggatgta caacaacaga caacctaatc cgactcgaag aagccatctc aaacggattc 3840 aacactagcc attccaccac agctgtcttc ctagacctgg ctaaagccta cgatgatacc 3900 tggttaaccg gccttctata caaagtcaca aaattcaata ttagaggtgt catgttaaac 3960 tggttaagca actttcttac taatagaaca gtcaacgtaa aaatcgaaga ttctctctca 4020 acaacacgaa ccataaacaa aggagtccca caaggatgcg ttctcagccc catactgttc 4080 aacattatga tggcagactt ccctcttcct gaccctcgca ccaacttggc tctattcgct 4140 gacgacatac taatacacac tacatcccca accaatccgg aagccgaacg tatcctacaa 4200 agatacctta acaaaatcga ttactgggca acctcatgga aattcaaatt ctccgttccc 4260 aaatgtgcca ccctaacctt ctcaaggaaa cgagccaagg agattgacgt aaagttattc 4320 ctgaacaaaa gccaaatcaa ctcagtcaac cactacaagt acctagggat caccttcgac 4380 tctaaattga actgggacaa ccacttaaac aataccctcc atagtgtaga aagaaaagca 4440 aacttgctca agctcctaac ctttggcaaa tcaacactaa acaccaaact attagtaaga 4500 atatacaaag ccacaatacg cagcaagctc gactacggcg ccaccgtcct atcttctata 4560 ccaaaatcca aaatgaacaa actagaaaca actcaaaata caatcctacg caacatcatc 4620 ggtgccctca agtctactcc gacccccctt atctaccttg aaacaggact aacctcaatc 4680 aaagataggt gggatctctt agcatcaaga tacctaaaca aactcaacac caagccatgg 4740 aacccagcct actccacaat ccaaaataca cagaacaaca ttaaaacctg gaaacccaga 4800 agcataccgg cagtaatccc ccatttaaat aagatgcgta tcaactccga agaatgtttc 4860 tccctaatgc cgagctacaa tccctggata gagcctatag ccccttggaa cctattcgaa 4920 ataaaaacca agttattccc actatctaaa aaagcagcga taaataaacc tcaacttacc 4980 caagaaatat ttagaaagct gatggaaaat gaacccacag ataacctaac catctacaca 5040 gatggttcat gctgcgaact cacaaacatt tcctcctgtg ccttttacat acctaaacta 5100 ggggaaaaaa aagcatgggc attatcagca ttcacaacaa gcttcaacgc tgagctagca 5160 gcaattaaac aagctttaca attcacttac ccactcgact ggccagttat cactattctc 5220 acagactcca aatcagcaat acaggccata tccaacttca aatgggaatc cagcacacta 5280 ataacagaga tattaaacat cataactaat caaaaatcag caggtaccca agtcaccttc 5340 gtatggattc ctagtcatgc tggcatacat ggaaatgaag tcgcagacga acttgccaac 5400 agaactcgaa cagaccctga cggactggtg ttggaataca caccaaacat ctctgaaaag 5460 ctcacaaacc taaaaagtaa acatattcaa tcaacgttcg aaaaaattaa atcatcgtca 5520 acaaacctag cggttttaca ccgaaccaac atgaagccaa tcccgtggtt tagacacaat 5580 aacagaagaa tccaagtcgc actactcaga cttagatgcg gccacaatag actaaaccac 5640 tgcgtaagca aatggaatgc cgacacaaca ccaaactgcc caaacggctg cgtagaaacg 5700 gagaataaca cacacgtcct catcacatgc cctcactact ctagagcaag atcgccactg 5760 aaaaatcttc tcgcctcgct acagctccct ctggatgtac ctacaataac cggaatgaat 5820 aactccatcc ctaaacacac acaaaaaaaa atcgcatcac gcctgattaa attcctgatc 5880 gaaacaaatt tagtcaacag actttaaaca gaaactagca caaccccctt ccccccctgt 5940 cttggctcaa aattcttcat cataaacaaa ataagaaact cacgaattat ggtgtataag 6000 ggcttcctcc ggaagcccgt aatctgcaac caatggcaca gccatggcca gaggcaccta 6060 tcctctcaaa tcaaatcaaa tcaaa 6085 // ID Kiri-28_AAe repbase; DNA; INV; 3054 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 16-FEB-2011 (Rel. 16.02, Last updated, Version 1) XX DE A Kiri non-LTR retrotransposon family from Aedes aegypti. XX KW Kiri; Non-LTR Retrotransposon; Transposable Element; Kiri-28_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-3054 RA Kojima K.K. and Jurka J.; RT "Kiri non-LTR retrotransposons from the yellow fever mosquito."; RL Repbase Reports 11(2), 723-723 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 12 sequences with >94% CC identity. CC This family and related Kiri elements found in two mosquito CC species, Culex quinquefasciatus and Aedes aegypti, constitute a CC distinct lineage belonging to the CR1 group. The phylogenetic CC analysis with PhyML indicates that Kiri is a sister lineage of CC the L2 clade. Although several Kiri elements do not encode two CC proteins, probably due to 5' truncation, Kiri elements generally CC encode two proteins. Their ORF1 proteins commonly contain an CC L1-like RNA-binding domain. XX FH Key Location/Qualifiers FT CDS 28..2871 FT /product="Kiri-28_AAe_1p" FT /note="endonuclease and reverse transcriptase." FT /translation="MASRSGRDASSNWCLPGTVMKSALVKNKLSVCCINSQ FT SICARKFCKLDELRQIAQTSSVDIICVTETWLNEKIDDSLLTIEGYKIIRH FT DRKGRLGGGIAIFIKEAIHFKVLEKSLDKPESSYSEYAVVEIVTGSEKLCL FT AVMYNPPVVNCALIFDEILSSYGTQFNDILFVGDLNTNLLNHNSTRTTELL FT TSMSIHDMFSIGSEPTFFYNNGSSQLDLMLINCRSRVLRFNQVDVPSFSNH FT DMIFVSLDFDTTHTPKKIEYRDYKSIDIDSLIDDFDSLDWFNYYRSNDANF FT LLSFFNSSILALHEKYVPIRSFVSRNNKNEWFNSTISKAIVDRDLAYKRWK FT NSKSPDDLNLFKILRNQVTMKVRQSKKSHYERKLDTSLPSKELWKRLKNIG FT VTKTAKSPTTLFNSDEINANFANNFTDSSDESEVNSIFNESFRRNRFYFDQ FT INEIDVVNALYHTKSNAIGLDELPIKFLKSINPLIIRPIAHLFNCIIATSI FT FPDEWKKSKIIPFKKKANLNSIQNLRPISILSALSKVFERIIKEQVCLFIN FT RENLLISHQSGYRAKHSTKTAMLKVFDDIGLVIDNGRPVVLLLLDFSKAFD FT TISHRILCRKLTENFGFSSHAMNLIKSYLVGRTQTVLNDGVFSSYLPISSG FT VPQGSILGPILFSIYINDLPNVLKYCQIHIFADDVQLYFGCCDDSSSIISR FT KINEDLTSIAEWSIRNKLLLNANKTCATFLSNSRSNSILKPILKINNTEIE FT FVDQAICLGITIHSSFKFDNFIFGQCGKIYASLRVVRAVSSFLNVDIKLKL FT FKSLILPHFIACDFILSKSSMYAESRLRIALNACIRVVFGLDKYSSVSHLQ FT HYLLGCPFDKFAPLRCCLFLFKLATHKTPGYLYEKLKPLRNSRAKKYAIPR FT HNTSSYGNSFFVRGISYWNSLPNDITLESSLFAFKRKCTEYFNS" XX SQ Sequence 3054 BP; 1011 A; 553 C; 497 G; 993 T; 0 other; taatcgctaa gatcaaaatt tatcctaatg gctagtagat ctggtagaga tgcctcgtcg 60 aattggtgtt tgccagggac cgttatgaaa agtgcactcg ttaaaaataa actttcagtg 120 tgttgtatta acagccagag tatatgcgct cgcaagtttt gtaaattaga cgaactgcgt 180 caaatagctc aaacgagctc agttgacatt atttgcgtaa cagagacatg gcttaatgag 240 aagatcgacg attcattgct gacaatcgaa ggctataaaa ttataaggca tgacagaaaa 300 ggaagattag gaggtggtat cgcaattttt ataaaagaag caatccactt caaagtactc 360 gaaaaatcat tagataaacc tgaatcgtct tattctgaat atgcggtagt cgagatagtt 420 actggttctg aaaaattatg ccttgctgta atgtacaacc cacctgtcgt aaactgtgct 480 ctaatatttg atgaaatatt atctagttat ggcactcaat tcaacgatat actatttgtc 540 ggtgatttga acacaaattt gctgaaccat aattcaacga gaacaacaga acttctaacg 600 tcaatgtcca ttcatgacat gtttagtatc ggatccgaac caaccttttt ttataacaac 660 ggctcatcgc agctcgatct tatgcttatt aattgtagat ctcgagttct aagattcaac 720 caagtcgatg tcccatcatt ttcaaatcat gacatgatat tcgtttcgct cgatttcgac 780 acaactcata cacctaaaaa aattgaatat cgcgattaca agtccattga tatcgatagt 840 ctcattgatg attttgattc tctagattgg tttaattact atcgttcaaa tgatgcaaat 900 tttcttctta gttttttcaa ctcaagtatt ttagcattac atgagaagta cgttcctatc 960 cgttcatttg tttcaaggaa caacaaaaac gaatggttca acagtacaat cagtaaagca 1020 attgtggatc gggatcttgc ttacaaaaga tggaaaaact cgaaaagccc cgatgaccta 1080 aatctattca aaatactccg caatcaggta accatgaaag ttcggcaatc taaaaagtca 1140 cattatgaaa gaaaactgga tactagtttg ccttccaaag aactatggaa aagattaaaa 1200 aatattggtg taactaaaac agcaaaatca cccacaactc tgttcaactc agacgaaatc 1260 aatgccaact ttgctaataa cttcacagat agtagtgatg aatctgaggt caacagtatc 1320 ttcaacgaat ctttcaggag aaatcgtttt tactttgacc aaatcaatga aattgatgtc 1380 gtcaatgctt tatatcacac aaaatctaat gctataggac tagatgaact tccaattaaa 1440 tttctgaaat caatcaatcc attgataatt agacctattg cgcatttgtt taattgtatt 1500 atcgccacga gtatatttcc ggatgaatgg aaaaaatcca aaataatacc attcaagaaa 1560 aaggctaatc taaattctat acaaaatctt cgtccaataa gtatattgtc agcgttatcc 1620 aaagtttttg aacgaatcat caaggagcaa gtctgtcttt ttatcaatcg ggagaatttg 1680 ttgatttcac accaatctgg ctatcgcgcc aaacatagca caaaaactgc aatgctgaaa 1740 gtcttcgatg acataggtct tgtcatagat aatggacgac cagtcgtttt attgcttctt 1800 gacttttcga aagcctttga caccatctca caccgaattc tttgtcgcaa gttaacagaa 1860 aattttggtt tttccagcca cgcaatgaat ctcataaaat cgtatttagt tggacgaacc 1920 caaactgttt tgaatgacgg ggttttctct agttatctcc ctatttcttc cggtgtaccc 1980 caaggttcaa ttctgggacc tattctcttt tctatataca ttaatgacct tccaaatgtt 2040 ttaaaatatt gccaaatcca tatatttgca gacgacgttc aattgtattt tggatgttgc 2100 gatgattctt cgtcaataat atcaagaaaa atcaacgaag atcttacaag cattgctgag 2160 tggtcgataa ggaataaatt attattgaat gcgaataaga cttgtgctac gtttctaagc 2220 aattctcgat ctaatagtat ccttaaacca attttgaaaa taaacaacac tgaaattgaa 2280 tttgtagatc aggcaatctg tttgggaatt acgattcatt ccagttttaa atttgataat 2340 ttcatatttg gtcaatgtgg taaaatttat gctagtctac gagttgtccg tgcggtttct 2400 tcttttttaa atgtcgatat aaagctcaaa ctcttcaaaa gtttaattct tccacatttc 2460 atagcctgtg attttattct ttcgaaatct tcaatgtacg ctgaatctag attacgaata 2520 gctcttaatg cttgcattag ggtagtattc ggccttgata aatattctag tgtctcccat 2580 cttcagcatt acttgttagg atgcccattc gacaaatttg ctccattaag atgctgctta 2640 tttttgttta aacttgcgac acataaaact ccaggttatt tatacgaaaa actaaagcct 2700 ttgagaaatt cccgtgcaaa aaaatatgct attccgcgac acaacacatc ctcttatggc 2760 aattcctttt tcgtgagagg catctcctat tggaattcat tacccaatga tataaccctt 2820 gagagttctt tatttgcgtt taagaggaaa tgtacggagt actttaatag ttagcttgaa 2880 gttattttta agattgtata aattattgaa atgtttgaat ttttattttg aattagtaat 2940 ggtaaacgac acaccttcct acgcattcct aacattatgt aacaattcaa aaagacgcta 3000 agtcttatgt tacgaattat tagtgaaata aataaaataa aataaataaa ataa 3054 // ID MuDR8x_AP repbase; DNA; INV; 1928 BP. XX AC Contig51986; XX DT 25-JUN-2009 (Rel. 14.07, Created) DT 25-JUN-2009 (Rel. 15.12, Last updated, Version 2) XX DE MuDR-type DNA transposon. XX KW MuDR; DNA transposon; Transposable Element; MuDR8x_AP. XX NM MuDR8x_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-1928 RA Jurka J.; RT "DNA transposons from pea aphid."; RL Repbase Reports 9(7), 1357-1357 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX FH Key Location/Qualifiers FT CDS join(521..601,712..1512) FT /product="MuDR8x_AP_1p" FT /translation="MKNERIISTMFSEKGETLKVVNDFKFGFNSHSRKTRI FT KNFESPKVNNSLKRKAITDVSERPSKLFHRLLKEENVSTLTLTDVTYYIKY FT NMHYARRTSCPKLPKTISDVHSILNSIEMIRGCRFHLGQSWWRKIQELGLA FT SDYIQNNEIGKYLALTFGLSFLEPQEVGDFFSFEMSAIQPDEHKITEFANY FT LVDTYIGENSIFPPDLWAEKSSSSQKTTNACESFHSKFNKFFDSPHPNIYT FT FLEVLKNMQTDTTILIRSSLEKRRVPRKPIRERIQYIENKIMKLQSKIMLT FT Y*" XX SQ Sequence 1928 BP; 730 A; 263 C; 276 G; 659 T; 0 other; ggcgcacacg aattgcgtac caaagcaggt aaacgaacct ttttgaacga attgcgtccc 60 ggacgcaatt catgttactt tttgccttaa tatatttttg gttgttttca attttttttt 120 taattttgat aatattttta aattttttaa caatttaaat atttatcaca aatattatta 180 ttatatgaaa actagatttt agtgggtttg tacttttata gatactttta taaaaatatt 240 tcggactata cggactttcc aatcaatatg taactatgga ttataagcga gacactatat 300 aagataacat cacatatagt atgatgtagg tgatcatgtt aacaccacag tgattagata 360 caggtgcccg attaatatag ataatccact tttcagtcat aactgataaa attagtgaca 420 ccagaattgt ctgacaagta acattcagta gctacattat attggtgtcg tctgcagtct 480 gctcaaattt actatttatt cattcaacgt ttttattaaa atgaaaaatg aacgtattat 540 atcaacaatg ttcagtgaaa aaggcgaaac attaaaagtt gttaatgatt ttaaatttgg 600 ataacacaaa aattattcga atggagatat tagatggaaa tgtacaaata aaaactacag 660 agctttttta cgaatggata cacaagataa actaaaagtt agtgttagta atttaattca 720 cattcacgaa aaactagaat taaaaatttt gaatcgccaa aagttaacaa tagtttaaaa 780 cggaaagcaa ttacagacgt atcagaacga cctagtaaat tatttcatag actattaaaa 840 gaagaaaacg tgtccacgct aacattaact gatgttacgt actatataaa atataatatg 900 cattatgcac gtagaaccag ttgccctaaa ttaccaaaaa ctataagtga tgttcattct 960 attttaaact cgatagaaat gataagaggt tgtcgatttc atctcggcca gtcttggtgg 1020 agaaaaatac aagaactagg cttggcttca gactatatcc aaaataatga aataggcaag 1080 tacttagcat tgacatttgg attatctttc ctagaaccac aagaagttgg agattttttt 1140 tcgtttgaaa tgtctgctat tcaaccagat gagcacaaaa ttacagagtt tgctaattat 1200 ttagtagata catacatagg tgaaaattca atatttcctc cagacttatg ggcagaaaaa 1260 agttcgagct cacaaaaaac tactaatgct tgcgaatctt ttcattcaaa atttaacaaa 1320 ttttttgatt ctcctcatcc aaatatatac acatttttag aagttttaaa aaatatgcaa 1380 accgatacaa caatattgat aagaagtagt ctagaaaaac gacgagtacc aagaaaacca 1440 ataagggaaa gaatacagta tatagaaaat aaaataatga aattacaatc aaagattatg 1500 ttaacatatt agccaataaa tataaataca gacaaaatta aattactaac ctatgatatt 1560 atgattttat aatttaattt ttttttttta aatatgtgaa tataaccatt tattcctatg 1620 atattacaat ttaaattttc taaaaatatg taaatatttg acgatttatt atatatgatt 1680 tgtgaaatta cgtttgctca gttttaattt tttttttttt gaatgatttg tgaatatgac 1740 catttatact tatgaaatta tgatttttta aaaatatgtg aatatatgac gaattattat 1800 atacgatatt atgttttcac tttaagaacc attcaactgt agaaaaaaag gtaacacgaa 1860 ttgcgtccgg gacgtaattc gttcaaaaag gttcgtttac ctgacttggt acgcaattcg 1920 tgtgcacc 1928 // ID TTAA2_AP repbase; DNA; INV; 427 BP. XX AC . XX DT 21-AUG-2009 (Rel. 14.08, Created) DT 21-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; TTAA2_AP. XX NM TTAA2_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-427 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(8), 1780-1780 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC TSD is TTAA tetranucleotide. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 427 BP; 150 A; 73 C; 73 G; 131 T; 0 other; gaggacgcta cccatacatt tgttgtctcc gtcttacaca cgcagtcgac atagcaaatt 60 gtcgttcact agtttcaata gtgtgctgtt agttttgata ctagagtgaa ttgacctatt 120 ataaaacttt taggtaagaa cattatctgt gcttaagcgt tgccgatttt tcgaaatttt 180 cattttcaag caagatatgg gtatgtgaaa tatcaaaaat taaaaatgct catatctcgc 240 ttgaaaatta aaatatcgaa taaaagccaa cgcttaagca cagataatgt tattatctaa 300 aaaattgata ataggtcaaa tcactctaat atcaaaacta acagcacact attgaaacta 360 gtgaacgaaa atttgctatg tcgtacgtgt gtaagacgga gacaacaaat gcatgggtag 420 catcctc 427 // ID Transib-1_CQ repbase; DNA; INV; 3362 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A Transib DNA transposon family from Culex quinquefasciatus - DE consensus. XX KW Transib; DNA transposon; Transposable Element; Transib-1_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-3362 RA Kojima K.K. and Jurka J.; RT "Transib DNA transposons from the southern house mosquito."; RL Repbase Reports 11(1), 631-631 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 10 sequences with >99% CC identity. 5bp TSDs. 11bp TIRs. XX FH Key Location/Qualifiers FT CDS 757..2847 FT /product="Transib-1_CQ_1p" FT /note="transposase." FT /translation="MDNNTVVSMFDVTKHHSENVKTAFEYICKNYNIAEPS FT HDLLREKLRMLFCSFKTRYTKAHRRSSFFESSNNNWLHSEFDLEGFIVENV FT NLTSGSTKKRGREPKEFDAKSDRSVRRTIDNESLDPKVDSIHKALLMARRT FT AFKRHDVNVVKVIGHLLKNQDYFSKMYVQLTNDRQLKTPEEAFALIIEAHL FT TKAQYEIFHFDCPSRYPPYNVIAGAKKTCAPPVEFIEGSASKIKVELQALM FT NHTASRILQVIRQDVTDYLDSHDLDCAELVLLSSWGMDGSTGYSQFNHSLP FT EGSIDDSDVFAATLTPIQLYVHSDGKHILWHNPTPQSIRFCRPIMLQFIKE FT SREIILKTKHDLEKEMCELTPLKIDLPGDKFVLIDFNFVLSMIDGKVLTYI FT TGSSSMQNCPICGATPNIMNSIEKLEEGFTANEDTLYYGISPLHAWIRFFE FT CLLHISYRMEIKQWRVTKDLKLNYLKRKKMVLEALHKAFGFRADQPRSGGK FT GTSTTGNICRKAFSNPELLSSVLGIEKELIERFRNILIAINCQEAINPEIF FT DEYCKDTYRFYLEHYEWYKIPATLHKVLAHAGDIILHSPAPLGVLAEEAAE FT CQHKHLKLFRSHFARKRSREANLMDVFLRALHESDPYLSSMWVSKKHTKKV FT CSSYPTVVRSFMVFEDSESDTSSNIMNDLMDAVDEVDEDFEEDVVDED" XX SQ Sequence 3362 BP; 1144 A; 513 C; 647 G; 1058 T; 0 other; cactatgggg tttcaggcgg ccataaatcg aaaaaccgac atactctaat tatttttttg 60 gtattttttt tcgaaaataa acattctaac taagaaaaat ccggagtttg aaagttgtag 120 gttcaaaatt cactgaattg cagaccttca aaggcaaaaa tggtaagaaa aaacatgttt 180 tttgacaaaa aaatgattat ttttcatagt tgtactgtgt agctgtttat gtattttttc 240 ctgcatcatt atatatattc agaaaggtaa ttgattcaac ttttcaataa aattttacaa 300 aacacacttt tacaggctaa aaaaaataat aaaaatcaaa aaagatggat ttttgttcaa 360 aatttagttt tttttcaact ttgttaatgg agtacttttt ttgtgtgcat ttgtgcgatc 420 tgaaaatttc ccagcttaaa atttgaacca tgtcctaact tgtctccaaa tttcaaagtg 480 cacttatgtg cactttttga gttatgccat tttgaacatt ttgattcatg caagaaaatt 540 aatgatttcg cgcatacgct tggttaaaat cacaatattt acctgcggag gcagaaatga 600 cgtacctgct atgtatgttt tggaattgat ttgatattca tgactttgtt ggtacttagg 660 tacgtttaat aaaattagaa atacctatgc taagtattta acagcaacaa ttcttcgaat 720 attcatatca gttagttgtt gtgttctttt aaaaatatgg ataataatac cgtagtgtcc 780 atgttcgatg taacgaagca ccattctgaa aatgtgaaaa ctgctttcga gtatatttgt 840 aagaattaca acattgcaga accatcacat gatctactca gggaaaaact acgcatgttg 900 ttctgtagct ttaaaacgag gtatacaaaa gcccatagaa gaagttcatt ttttgagtca 960 tctaacaaca attggttaca tagtgagttt gatttagaag gatttatcgt ggaaaacgtg 1020 aatttaacaa gtggatcaac gaaaaaacgt ggacgagaac ccaaagagtt tgatgcgaag 1080 agtgatagga gtgttcgacg taccattgat aatgaatcat tggatcctaa agtggattca 1140 attcataagg ctttgttaat ggcaaggaga acagcattta aaaggcacga tgtgaatgta 1200 gttaaagtga taggacacct cctgaaaaat caggattact tttctaaaat gtacgttcag 1260 ttaactaacg accgacagtt aaaaacgccg gaagaagcat ttgcgctgat tatcgaagca 1320 caccttacta aggctcagta tgaaatattt cattttgatt gcccttcgag atatccgcct 1380 tacaatgtta tagcaggtgc caaaaaaaca tgcgcacctc ctgttgaatt catcgaaggg 1440 tcagcctcga agataaaagt ggaattgcaa gctctgatga atcatacggc atctagaata 1500 cttcaagtaa taaggcagga tgtgacagat tatctagata gtcatgattt agattgcgcc 1560 gaattggttt tattgagtag ttggggaatg gatggatcca ctggatattc tcaatttaat 1620 cattcattgc cagaaggtag cattgatgat tccgatgtct ttgctgcaac gttaactccc 1680 atccaattat acgtgcatag tgatggtaaa catattttat ggcataatcc tacacctcaa 1740 agcatcagat tttgtcgacc aattatgtta cagtttatta aagaatctag agaaatcatc 1800 ctgaaaacaa aacatgatct ggaaaaagaa atgtgtgaat tgacaccttt gaaaattgat 1860 cttccgggtg ataaattcgt gttgatcgat tttaattttg ttttgagtat gatagacgga 1920 aaagtgctga catatattac tggttcatca tcgatgcaaa attgtcctat atgtggggca 1980 acacctaata taatgaactc catcgaaaaa ctagaagaag gatttactgc caatgaagac 2040 acattatact atggaatatc tccacttcat gcttggatac ggtttttcga atgtctgtta 2100 catatttcat atcggatgga gatcaaacaa tggagggtta cgaaggatct taagctgaat 2160 tatcttaaac gaaaaaagat ggttctggaa gcattgcaca aggcatttgg atttagagct 2220 gatcaaccaa gatcaggcgg taaaggtacg agtactaccg gaaacatttg taggaaagca 2280 ttttctaacc cggaactttt aagcagtgta ttgggcattg aaaaagagtt gatcgagaga 2340 tttcgtaata ttttgatagc gatcaactgc caagaagcca ttaatcctga aatatttgat 2400 gaatattgca aagatacata taggttctat ttagaacact acgaatggta taaaattcct 2460 gcaactctgc ataaagtgct tgctcatgca ggtgatataa ttcttcattc tcctgctcct 2520 cttggtgttt tggcagaaga agccgctgaa tgtcaacaca aacatttgaa attatttcgt 2580 tcgcattttg ctagaaaaag atcacgagaa gcaaatttaa tggatgtatt tttaagggca 2640 ttgcatgagt cggacccata tttaagttca atgtgggtaa gcaaaaaaca taccaagaaa 2700 gtttgctctt catacccaac agttgttaga agtttcatgg ttttcgagga ttctgaaagt 2760 gatacaagtt ctaacataat gaatgatctt atggatgctg ttgatgaagt tgacgaagat 2820 tttgaagaag atgttgtgga cgaagattag atttttgaca gatgaagtgc attgagatat 2880 atctagtcat gtaatattta ttaagtgttg gtcaataaat ttttaataat caaattaatt 2940 gcattttcat aaacattcct ttgtttcctc cccttctgtg atggtacaat gttccagaaa 3000 atcctaaaac atcgtaattt tttttaattt gggtggtgag gggggggggg ggtcaaagag 3060 aggtagattt tgactcaaga ggaaggggag ggggattgaa agtaaacgaa aactataaca 3120 aagcataagc cgccattccg tgcatcaaac agacagagca aggatattaa aattcaccat 3180 ttcaatgcat gtgggctcaa atgtataaaa aaatcgaaaa aaatcgaaaa tttaactttt 3240 ttttttgaaa aatatcaaat ttcatttgtt tacatgataa taagtacaaa caacacaaaa 3300 aagtcacaaa aaagtgaaaa aatatttttg gccgctcgct tctatggaaa taccccatag 3360 tg 3362 // ID Copia-97_AA-LTR repbase; DNA; INV; 220 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-97_AA_; KW Ty1_copia_Ele184; Copia-97_AA-I; Copia-97_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-220 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 220 BP; 55 A; 47 C; 39 G; 79 T; 0 other; tgttagagga gcaacgacct cgtaatataa acattttgtg ccagcctaat cattcgtgtg 60 aattgtacca gtctaatgtt tcctctgctg gctatgcaat gcgcgccatt ttctatctga 120 aaagtgaact cgttgtgtgc gcatcgttgt tttaataaat cgttatttaa ttcgttattc 180 agtataactc agacttttat tccactgctg cctaatccca 220 // ID Gypsy-8_DVir-I repbase; DNA; INV; 4420 BP. XX AC scaffold_12963; XX DT 10-MAR-2011 (Rel. 16.03, Created) DT 10-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-8_DVir_; KW Gypsy-8_DVir-LTR; Gypsy-8_DVir-I. XX OS Drosophila virilis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; virilis group. XX RN [1] RP 1-4420 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (09-MAR-2011). XX DR Genome; scaffold_12963; Positions 2607611 2603192. XX CC 'TATA' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 106..4068 FT /product="Gypsy-8_DVir-I_1p" FT /translation="MQMSNSASSTKKSISLPKFNPDIADADADAWCKTVDI FT ILDDNPIEGSTLVMTLSGSLEGSASPWLSQICYTGMNWQNFKELFLQRFVG FT METSAATLMNIFNGRPNDGECLSVYANRLVTSLLSKWKTANMEEIAVSVVL FT AHTSKIDKSLQRMLYTTNIKSQDEMQQHLKAYAYGRSNGGLQPNIPMEPDR FT KRSKTYGIKCHHCGNLGHKMVECRKRKYEEGNVSKNSALPKVRSAVTCFKC FT GDAGHIASSCTKGVSTSGVGSHDKRINICCVSEPKGILRQSGELVPFCFDS FT GAECTLIRESCAEKFSGKRINNLIVLRGIGENPVYSTMQILATVKIDQYSL FT EILFHVIMDKFLKYDVLIGRDILGLGFGVSIEADKFEMYKTKTVDVLIKVP FT DISYVEHLNNDRALSNTDKNRLLELLQNYADKFIEGIPQTRVTTGELEIRL FT LDPNKTVQRRPYRLSIEEKEVVRNKVKELLDANVIRPSCSPFASPMLMVRK FT KNGTDRLCVDYRELNANTVADKYPLPLISDQINRLSEGEYFSSLDMASGFY FT QVPIHPDSIERTAFVTPEGQYEFLAMPFGLKNASSIFQRAIVRALGDLAQT FT YAIVYIDDVLILAKTKEEALERLQTVLENLSRAGFSFNITKCSFLKTSIEY FT LGYLVEAGQIRPNPRKIQALAELPSPQSVTQVRQFIGLASYFRQFVPKFSE FT QMKALYNLTSKNNKFVWKPEYEEIRQKIVSILTNDSVLTIFDPLKPIELHT FT DASSIGYGAILMHKIENRPRVIEYYSKCTTMAEAKYHSYELETLAVYNAVR FT HFRHYLHGQKFIVHTDCNSLKASIKKINLTPRVHRWWAYLQSFDFSIEYRK FT GERMLHADFFSRNPLPTDYKSSAKTVEMHINLAEITDDWLLAEQQRDSDIA FT SITLKLRNHKMGEAISASYELRAGVSYRRIQRNGSTKCLPVIPRAFRWSVI FT NNVHEAIMHLGWKKTLDKMYANYWFEKMSKYVKKFVENCITCRVSKPPSGK FT PQIEMHPIPKDDIPWHTVHIDITGKLSGKNDLKEYIIVQVDAFTKFVHLQH FT TLNINTENCIKAVKETVALFGVPARIIADQGRCFASTRFREFCSSQKIGLH FT LIATGASRANGQVERVMSTLKSMLTAVESSQRSWQDALKDIQLAINCTTNR FT VTKYSPLELLIGKEARPLGMITISDEKTIDKDLIRAQAKDNMVNNAKYDKA FT RFDKNKAKIVKYKLGDHVLLKREERNQTKLDPKFKGPFEIIEILDGDRYLL FT KSLTSKRTFKYAHEYLRAFPSNNVSEELIDLRDDECDNVEDIAQDTVALRE FT SAVGLIHDG" XX SQ Sequence 4420 BP; 1484 A; 828 C; 972 G; 1136 T; 0 other; tcagaagtgg gattaggcga tgcccctcca cgaggcaatt ctgacgattc gtggcgtcag 60 cttttggaat ctcaaaatcg aaatattctt gagattatta aagcaatgca gatgtcaaat 120 tcagcatcct caacaaagaa gagcatctcg ttgccaaagt tcaacccgga cattgcagac 180 gcagatgcag atgcctggtg caaaactgtg gacataatac ttgacgataa tcctattgag 240 ggcagtacac tggtgatgac cctcagcggc tccctagaag gaagcgcctc gccctggctt 300 tctcaaatat gctatactgg aatgaactgg caaaatttta aggaattgtt tttgcaacgt 360 ttcgtgggaa tggaaacatc ggcggcaact ttaatgaaca tttttaatgg ccgaccgaat 420 gatggcgaat gcctctccgt atatgcgaac cgtcttgtga cctcactttt atccaaatgg 480 aagacagcga acatggagga aattgcagtc tcagttgttt tggcacatac atcgaaaatc 540 gacaaaagtc tgcaacgtat gttatatacg actaatatta aaagtcaaga cgagatgcaa 600 cagcatctga aggcctatgc ttatggaagg tcaaatggag gtctacagcc aaatatccca 660 atggaaccag atcgcaagag atcgaaaaca tatggcatta aatgtcatca ttgtggaaac 720 ctcggacaca aaatggtcga atgtcgcaag aggaagtatg aagaaggcaa cgtgtcaaaa 780 aactcagcat tacctaaagt tcgatctgcc gttacctgct tcaaatgtgg cgatgctggc 840 cacattgcgt cttcatgtac gaaaggagtt tctacgagcg gtgttggatc acatgacaag 900 cgtataaaca tctgttgtgt gtccgagcca aaaggaatcc tgcgtcaatc tggtgagctc 960 gttccatttt gttttgactc tggagcggaa tgtacgctca tcagagagag ttgtgccgag 1020 aaattttccg gaaaaagaat taataatttg atagttttga ggggtattgg agaaaatcct 1080 gtgtatagca ctatgcaaat cttggctact gttaaaattg atcagtactc cttagaaata 1140 ttatttcatg taataatgga taagtttttg aaatacgatg tattgatagg ccgtgatata 1200 ctcggattag gctttggagt ctccatcgag gccgacaaat ttgaaatgta caaaaccaaa 1260 actgttgatg tactaataaa agtgccagat ataagttacg tagagcatct gaataacgat 1320 agagcattaa gcaacactga taagaatcgt ctgctagaat tattgcaaaa ctatgctgat 1380 aaatttattg agggtattcc tcaaacgcga gtgactacgg gtgaacttga aatccgcctg 1440 cttgacccaa ataaaactgt acaaaggcgt ccgtataggt taagcataga agaaaaagaa 1500 gtcgtgcgaa ataaagtaaa agaattgtta gacgcaaacg ttatacgtcc tagttgctct 1560 ccatttgcaa gcccaatgct gatggtaaga aagaaaaatg gaaccgatcg actatgtgtc 1620 gattatcggg aactgaatgc taatacggtc gctgacaaat atccgctgcc tttgatttct 1680 gatcaaatta atagactgag tgaaggcgaa tatttctcca gtttagacat ggcaagtggt 1740 ttttaccaag ttccaataca cccagactct atcgagcgca ctgcttttgt caccccagaa 1800 ggacagtatg aatttttggc catgcctttt ggattaaaaa atgcttcctc gattttccag 1860 cgcgcaatcg tgagagcatt gggtgatttg gctcaaacgt atgccatcgt ctatatagat 1920 gacgtgttaa ttctagcgaa gacaaaggaa gaggcacttg aaaggctaca aacggtcttg 1980 gaaaacttgt ctagagctgg attctcattt aacattacaa aatgctcgtt cttgaaaaca 2040 agcatagaat atttaggata cttggttgaa gcgggtcaaa ttcgaccaaa tccacgcaag 2100 atacaagcgt tggccgaatt gccgtcaccg cagtcagtaa cgcaggtgcg acaattcata 2160 ggcctagcct cgtattttag gcaattcgta ccaaaatttt ctgagcagat gaaagcttta 2220 tataatctca cctccaagaa taataaattt gtctggaagc cggaatatga ggaaattcgt 2280 caaaaaatcg tatcaatttt gactaatgac tccgttttga caattttcga tccactaaag 2340 ccaattgaat tacatacaga cgcaagttct attggatatg gtgctatttt aatgcacaag 2400 atagaaaata gacccagagt cattgaatat tacagcaaat gtacgacaat ggctgaagca 2460 aaatatcatt catacgagct tgagaccctc gcagtttaca acgcagtaag acacttccgc 2520 cattatttac atggtcaaaa atttattgtg catacagatt gcaactccct gaaagcaagt 2580 atcaaaaaaa ttaaccttac cccaagggta catagatggt gggcatatct gcaatcattt 2640 gatttcagca ttgaatacag gaagggagaa cgcatgttac atgctgattt tttctcgcgc 2700 aatccgttgc ctactgatta taaatcctca gcgaaaacgg ttgaaatgca cataaaccta 2760 gctgagataa ccgacgattg gcttctagcc gaacagcaac gagattctga cattgcttcc 2820 ataaccttaa aattgcgaaa ccataagatg ggagaagcta tctcagcgtc gtacgaattg 2880 agagcaggag tatcatatcg aaggattcaa agaaatggca gcacaaaatg tttgcctgta 2940 atacctagag cctttagatg gtcagtcatt aacaatgttc acgaagccat aatgcattta 3000 gggtggaaga aaactctaga caaaatgtat gccaactact ggttcgagaa gatgtctaag 3060 tatgtgaaaa agtttgtaga aaattgcatt acctgtagag tgtcaaaacc accatctggt 3120 aagccacaaa tagaaatgca cccgattcct aaagatgaca taccctggca cactgtgcat 3180 attgatataa ccggaaaatt aagtggcaaa aacgacttaa aagagtacat catagttcaa 3240 gttgatgcat tcacaaaatt tgttcacctt caacatacgc tgaacattaa tacggaaaac 3300 tgcattaagg cagtcaaaga aactgttgca ctattcggag tgccggcacg catcatagct 3360 gatcagggta gatgttttgc tagcacaagg tttagggaat tttgttcttc gcagaagata 3420 ggtttacacc tgattgcgac gggcgccagt cgagcaaacg gccaagtaga gcgtgtcatg 3480 agcaccttga agagcatgct cacagcggtt gagagtagtc agcgatcctg gcaagacgct 3540 ttaaaagaca tacagctggc aataaattgc acgacaaatc gtgtaacgaa atatagtccg 3600 ttagaattgc taatcggtaa agaggcaagg ccgttaggaa tgataacaat atcagatgaa 3660 aaaacaattg ataaagattt aataagagca caagcgaaag ataacatggt gaataacgct 3720 aaatatgata aggctaggtt tgataaaaat aaagcaaaaa ttgtaaaata taagcttggt 3780 gatcatgtcc tactcaagag ggaagaacga aatcaaacga agttagatcc caagtttaag 3840 ggaccttttg agataatcga aatacttgat ggagacagat atttattaaa gtcattgacg 3900 agtaagcgca cgtttaagta tgcacatgag tatttgcgag cttttccaag caacaatgta 3960 tcggaggagt tgatagattt gagagatgat gagtgtgaca atgttgagga cattgctcaa 4020 gacactgtgg cactaagaga gagtgcggtt gggctcatac acgatggcta agaccgcacc 4080 ataatactca cgccgtgcac aagtgctcga tcatggcgtg ctgaagcggg tgatattatg 4140 gcgggggtca gaggttgaaa gacctgtatg gatccgctcc atgttatcgt tgctgggcag 4200 caagagccta tagacgtgtg atgacagaac aacaacaaaa aaaaaaaaaa aaaaaaaaaa 4260 aaaataaaaa aataaaacct acaatgaccg atgatactgc tgtttgtaaa gctattctta 4320 aagtaaagta atgactaata aattttaccc aacaattttg actatggcat gatctaagcg 4380 aaatgtacac acgaggacgt gtgaaatgtc aggaaggccg 4420 // ID Crack-36_AAe repbase; DNA; INV; 4006 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A Crack non-LTR retrotransposon from Aedes aegypti. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; KW Crack-36_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4006 RA Kojima K.K. and Jurka J.; RT "Crack clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1252-1252 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >95% CC identity. It is positioned at the deepest branch of CC Crack/Daphne CC in the RTclass1 phylogeny. XX FH Key Location/Qualifiers FT CDS 119..1018 FT /product="Crack-36_AAe_1p" FT /translation="MPSTTRNQSKKRVDISPTEGDKSKESTSSENPAKRHW FT KNHNVTMSKDMTLMDLHKLLKSEIQSSKEEIQSSIMDMKGEIRKLSSEVDE FT VRQSQSFINAEFEHFKSGLSKITDEMKKTTSDVALLKSGHAELSSQLATME FT HDLNFVQQGQLSNNLLISNVIKTADEDLCAILCKICEVLEVELFEREVLVI FT TRLATRNTKQIEPILVQFANRFVKDRIISAAKNTTLSCQNIGFTIDQRIYF FT NHHLTPHNQALLQMARAYKRKHGYRFAWFSKGHIYIKRDENSTAVRITSSA FT DLPSEIDE" FT CDS 1072..3882 FT /product="Crack-36_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MEHLSDHLTSMDIEHRLNSVSVSSSTLNLLHLNTRSC FT RGKMSDLVQLIDQLRHVHVVVFSETWLYQNEICNMINYDVYHSCRPTRGGG FT VSMFILSSFKHRKIYELVDDMNNFLIIELMEYNLKVFGVYTPGRNVDYFLQ FT KFEETLSNFKRMVVMGDMNINLLDDSNPIVNNYICSLNANGFVVLNKMNSA FT HATRTSNTISTIIDHVITDILDRKMVLVINDTEAHVSDHKTLVFSIQHNVQ FT SKRRIYEKHVIHYENILHENFRAELDEVQNYDEFESLCNHKINTNRSIYTV FT VDDRKPLQPYIDRNLYAEIRKKNKLFKLSKANPNNDALKRNYLLLRNSIRN FT KTKLAKTRYYNAKFENCVGDPKKMWATINEVVFNRPSSKQDNQINLVENGK FT KILCPNENAQIFNEYFLSIGSQNTSNTNANLEQNDNPELFDFLQVTGRDVE FT QTITKLNRTAAVGKDGISVKFLNRTKSFITPKLTDLINDVIATGSFPDKLK FT MSKVIPLFKSGDCTLKTCFRPISILPATSKVFESVLENQMRSYLLNNNIIN FT RNQYGFQKKSNTVAASADMMNYIYRAKDRGLKVICVFLDLSKAFDCISHTR FT LLGKVQKYGFSEHAVKLLRSYLDNRLQYTFVNNCYSAPGIVKSGIPQGSIL FT GPLLFIIYVNDMWNLPLRGRLQLYADDACLVYEASTTEELEFDIQHDMNLL FT LRWLSENGLKLNVEKSSYIALNNNLSVDIQVDSILLKKTDTSSFLGLILDT FT NLSWKAHIEKVIKKVTPYVFAMRKARKFISMDTCWKLYNSFILPHFTYMSC FT LWGCAASIYLKPLEIIQNKVVKIIRNLPIRFPSVELYSESTLSIRSLCKYN FT MMIHIYKVRNNLIKTNVVLFNVSDVHSHNTRQQNSIFVNCPRTQYGWKNLF FT YQGIIMYNSLPNEMKSLNLRLFKRSLKQYLCNM" XX SQ Sequence 4006 BP; 1405 A; 700 C; 758 G; 1142 T; 1 other; cagtttcagt tgagctgtcg atgagtgcac acgtcttttt cggtcgctgc tgtgtgttgg 60 gctctgtttg tccgtattgt ctttttgagc aatcgcgtgt gttttctcat cttgttctat 120 gccgagcact acaagaaatc agtcgaaaaa acgagtggat attagcccaa ctgaaggtga 180 taagagcaag gaatcaacgt catccgagaa tccagcgaag cgtcattgga agaaccataa 240 cgtcaccatg agtaaggaca tgactcttat ggacctacac aagttgctca aaagtgaaat 300 acaatcttcc aaggaagaga ttcagagcag cataatggat atgaaagggg aaatcaggaa 360 acttagttca gaagttgatg aagttaggca atcccaaagc ttcattaacg cggagtttga 420 acatttcaag agcggtctga gtaaaatcac ggatgaaatg aaaaagacca cctctgatgt 480 cgcgcttctg aaaagcgggc atgcggagct ctcatcacag ctggctacaa tggaacatga 540 cctcaatttt gttcagcaag gtcaactatc aaacaacctg ctgatttcca acgtaataaa 600 aaccgccgac gaagatctct gtgcaatctt atgcaaaatc tgtgaggttc tcgaggttga 660 attgttcgag cgcgaagttc tggtcatcac ccgattagca acgaggaaca caaagcaaat 720 tgaaccgatt ctggtgcagt ttgccaacag atttgtaaag gataggatca tttcagcagc 780 aaaaaatacc acactatcat gccaaaatat cgggttcaca atcgaccaac gcatctactt 840 caaccatcac ttgacaccac acaaccaagc attgctccag atggcgagag cctacaaaag 900 gaagcacggc tataggttcg cgtggttcag taagggtcac atctacatca agcgggacga 960 aaactctacg gctgttcgta ttacttcaag tgctgatcta ccttcggaaa ttgatgagtg 1020 atgccacctc cgtagctcca cctctacaaa cataccaata tcgatttcgc gatggagcac 1080 ctgagtgacc atttgacgag tatggacatc gaacatcgtt taaattcggt atcggtctct 1140 tcatctacat taaaccttct tcatctcaac acgaggagct gccgaggcaa aatgagtgac 1200 ttggttcaac tgatcgatca gttgcgccac gtgcatgtag tagtgtttag tgaaacgtgg 1260 ctttatcaga acgaaatttg taatatgatt aactacgatg tgtatcactc atgtagacca 1320 actagaggcg gtggagtttc aatgttcatt ctttcgtcat tcaaacatcg caagatatac 1380 gaactagtag atgatatgaa caattttctg ataattgaat tgatggaata taatttgaaa 1440 gtttttggtg tatatactcc ggggaggaat gtagattatt tccttcaaaa gtttgaagag 1500 actcttagta atttcaaaag aatggtcgta atgggagata tgaacattaa tctgttagat 1560 gatagtaacc ctattgtgaa taattacata tgtagcctaa atgcaaacgg ttttgttgta 1620 ttgaacaaaa tgaatagtgc acatgccact agaacatcga ataccatatc aacgatcatt 1680 gatcatgtaa ttacggatat tctggatcga aaaatggttc tggtaatcaa tgacaccgaa 1740 gcccatgttt ccgatcacaa aacgttggta ttttcaatac aacataatgt acaatctaaa 1800 agaaggatat atgagaaaca tgtaattcat tatgaaaaca tattacatga aaattttagg 1860 gctgaattag atgaagttca aaattacgat gaatttgaaa gcctctgcaa tcataagata 1920 aacacgaaca gatccatata cacagtagtt gatgacagaa aaccactgca accttacatt 1980 gatagaaatt tatacgcwga aataagaaag aaaaacaaat tattcaaact ttcgaaagcg 2040 aatccaaata atgatgcatt gaagagaaac tatctgttac ttagaaattc cattagaaat 2100 aaaacaaaat tagcaaaaac aaggtactat aacgcaaaat ttgaaaattg tgttggtgat 2160 cctaaaaaga tgtgggccac gataaatgaa gtagttttca atagaccaag tagtaagcaa 2220 gataatcaga tcaatttagt agaaaatgga aagaaaatat tgtgtccaaa cgaaaatgca 2280 caaatattca atgaatattt tcttagtata ggctctcaaa atacgtcaaa tacgaatgcg 2340 aatctcgagc agaatgataa tccggaatta ttcgattttt tacaagtaac aggaagagat 2400 gttgaacaaa cgataacaaa attaaaccgc actgcggcag taggtaaaga tggaatatct 2460 gttaagtttc tgaatcgtac aaaaagtttt ataacaccta aattaactga tttgatcaat 2520 gatgtaattg caacaggtag ttttccagat aagctgaaga tgtcgaaagt gatcccattg 2580 ttcaaatctg gtgattgtac cttaaaaaca tgttttcgcc caatatcaat acttccagcc 2640 acatctaagg tttttgaatc agttctagaa aaccaaatga ggtcgtatct gcttaataat 2700 aatattataa atagaaatca gtatggattc caaaaaaagt caaatacagt ggcggcgtcc 2760 gctgatatga tgaattacat atatcgcgcc aaagacagag gtttaaaagt tatttgtgta 2820 tttttagatt taagcaaagc gtttgattgt atctcacaca caagactatt aggcaaagtt 2880 caaaagtatg gattttcaga acacgctgtc aagcttctta gatcgtattt ggataaccga 2940 ctgcaataca ctttcgtcaa taattgttat agcgccccag gaattgtaaa gtcaggtatt 3000 ccacaagggt caatactagg accccttttg tttataatat atgtgaacga tatgtggaat 3060 ttacccttaa gaggacgctt acagctatat gcagacgacg cctgccttgt atacgaagct 3120 agtacaacgg aagaattgga atttgatatt caacatgata tgaatctgtt gttgaggtgg 3180 ttatcagaaa acggtttaaa gctaaacgtt gaaaagtcct cctacatagc actgaataac 3240 aacttatcag ttgatataca agtagactcc attttgctta aaaagactga tacatctagt 3300 tttcttggcc taatattaga tactaatttg tcgtggaagg ctcatattga aaaagtaatc 3360 aaaaaagtta ccccttatgt ttttgccatg agaaaagcta ggaaattcat ctcgatggat 3420 acatgttgga agctgtataa ttcatttatc ttgcctcact ttacatacat gagttgcttg 3480 tggggatgtg cagctagtat atatttgaaa cccctagaaa taattcagaa caaagtagtt 3540 aaaattataa gaaatctacc tatacgcttt ccttcagtcg agctttacag cgaatctact 3600 ttgtctatac gatcactctg caaatataac atgatgattc atatttataa ggtccgtaac 3660 aatttgataa aaacaaatgt cgtactgttt aatgtatcag atgtacactc acataataca 3720 agacagcaaa attcgatctt tgtaaactgt cctaggaccc aatatggctg gaaaaatctt 3780 ttttatcagg gaattataat gtataactcg ctgccaaatg aaatgaaatc tcttaatctt 3840 aggttgttta agcgtagttt aaaacaatat ttgtgtaaca tgtaatttat agttagaaca 3900 gccaactgaa taagtagaat cgctattaga gtagctgata acagtttgtt ataaactttt 3960 agctacaaat tatattaata aataaaaaaa aaaaaaaaaa aaaaaa 4006 // ID Gypsy-38_DPu-LTR repbase; DNA; INV; 185 BP. XX AC ACJG01004157; XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the common water flea: long terminal DE repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-38_DPu_; KW Gypsy-38_DPu-I; Gypsy-38_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-185 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the common water flea."; RL Direct Submission to RU (08-FEB-2011). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR Genome; ACJG01004157; Positions 11608 11424. XX SQ Sequence 185 BP; 31 A; 57 C; 43 G; 54 T; 0 other; tgttatgtac gcgtcttgtg ggtttccccc accgaccggt aggtgtccct cttccccttc 60 ttccccggct atgcccaccg cggggagtct agtttatgcc tcgcttgtac acggtggcgt 120 ctgacgcatc tggctctctt gtaatacaat cacgtcgatt gttaacgacg acttagaacc 180 taaca 185 // ID Gypsy-225_AA-LTR repbase; DNA; INV; 269 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-225_AA_; KW Gypsy-225_AA-I; Gypsy-225_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-269 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1054-1054 (2011). XX DR [2] (Consensus) XX SQ Sequence 269 BP; 79 A; 52 C; 64 G; 74 T; 0 other; tgtggggatt ataccttagg catccctagg tgctccccca gatacacact acaacacaat 60 caacgccgaa tccagcaggg cgaatgctgg tgacgataga gtgagtctgg aacagtgata 120 gcaatcgttc ggtcgtgttt gctagaaaat atatgatttg attttaccat agttttttat 180 gaagttgcga tatccgaagt tgtcgtccag aacgcgtatc tccgtattaa aaagtaggga 240 agtgtagtga aagcatcaag tttactaca 269 // ID Gypsy-21_AA-LTR repbase; DNA; INV; 134 BP. XX AC supercont1.124; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-21_AA_; KW Gypsy-21_AA-I; Gypsy-21_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-134 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.124; Positions 1276825 1276958. XX SQ Sequence 134 BP; 47 A; 27 C; 18 G; 42 T; 0 other; tgttacatca aggcaactca tgaaatgttg gccttgacag aacaaatcat ttttcaattc 60 attatatgtc caaccaccaa atagaataga cgttaaacta gaaactacct ttgttttatt 120 ctacggacgt atca 134 // ID LOA_Ele6 repbase; DNA; INV; 5259 BP. XX AC . XX DT 06-OCT-2010 (Rel. 15.1, Created) DT 06-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE A LOA clade non-LTR retrotransposon family from Aedes aegypti. XX KW Loa; Non-LTR Retrotransposon; Transposable Element; Lian; KW LOA_Ele6. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-5259 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5259 RA Kojima K.K. and Jurka J.; RT "LOA clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (06-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. This consensus is generated from 27 CC sequences with >97% identity, and ~100% identical to the original CC sequence in [1]. XX FH Key Location/Qualifiers FT CDS 51..1421 FT /product="LOA_Ele6_1p" FT /translation="MVLVPPKESTPKRPRNINNSSNSDSKPIPKKQKGQSI FT HISVNSRMENIRQDQPTALENQQSTYSEVAKWVRVGILPKNYPQIQLTDEQ FT LYSLQEAILLKVTLQRRESFKPKFTNCWQRTGHLVINCQDRETSDWLNSVV FT PTLIPWKGAELTVVNADDIPRLDVMVGFFPQSVNDENDTIRVFIESQNDGL FT STDRWKIIQRNILYENHVEWFFTVDEASMQHFKTCNFLINYKFGQTKLRKK FT GMYKPGVDGMVISRDEPKELEPVSSVDPGTEQSLSVPVTSGTDGKVPEAKS FT SKNHHQSTQSTGLPKNENSQGKFDDRIYSGPSKNSNHFGLQVDVQCPGLAK FT DQQCSTILMDQKSSERSKDRNTFGLPKDQSGYQKDQNYSGRLKDQNAAGPS FT KDRNCSGSQRDRNTSGLPMDRNTSGLPKARSRSGHIKDRNSSEPSKEQMPS FT GPEKDQHSRKYK" FT CDS 1477..5160 FT /product="LOA_Ele6_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="MNTDIKFIQINLHHAKGATGILCRRFLKENFSVAFIQ FT EPWVNKNRVLGIAGQIGKLVYDEGNSNPRTAILIKGGIKFVPITQFIAKDI FT VVIRMEIPTTRGKTEAYVVSAYFPGDVEAVPPPELTAFINYCKTNNKHFII FT GCDANSHHTIWGSTNINKRGEDLFEYISSNNIDLCNKGNQPTFITKAREEV FT LDITMCSPLFSTRIKNWHVSDEQSLSDHKHIVFQYNAGEIVKIAYKNPRNT FT NWERYASTMQSECLLLNTTIQSTMQLDRVAKNITEQIIQAYNVNCPVKEEI FT SNREVPWWNKTLNNLRKKSRKLFNKAKISGQWDQYKKALTMYNKEIRKSKR FT RTWRSYCENIEKTPDIAKLQKVLSKDHSNGLGNLKKTNGDFTVNSQETLEM FT LMETHFPGSNLVVSDNEESINVNDNYGPTGTIRYNSLIPDTIFTKSKVEWA FT MNSFEPFKSPGMDGIFPALLQKGGEILLDSITNIFKASLMLGHVPKIWTQV FT RVVFIPKVGKRDKTNPKSFRPISLTSTMLKIMEKIIDEYIKTEYLQKIPLN FT RYQYAYQTNKSTVTALHNLVTKIESTLDRKEIALVAFLDVEGAFDNATFSS FT IKKSMENRHFDPGIIAWIMEMLKNREVSAELGVSSITIKTTKGCPQGGVLS FT PLLWSLVVDDILNKLTEQGFEVIGFADDVAIIVRGKFEHTITDLMQSALNC FT ISRWCQNEGLNINPSKTVLVPFTRRRKVTFKNITIEGCIVQYASSVKYLGV FT ILDDKLNWNLHLEQVISKATIALWVSKNTFGKKWGLKPKMIHWIYTVIVRP FT RVVYAALVWWPKTKQSTAQKKLEKLQRLACMAMTGAMRSTPSKALEAVLYK FT LPLHQHVQMEAEKNALRLRRTNIFLEGNLSGHLSILKDFQINPLVVMNEDW FT MERTLNLDIPYKVVETDRQMWESGGPSISPGSIMFYTDGSKMSDNTGAGIT FT GPGVNVSIPMGRWTTVFLAEIYAILECASICLRRKYRHARICVFSDSQAAL FT NALKSYTCQSKLVWECIQVLKQLAMNNQVSLYWVPGHCGIEKNEKADFLAR FT RGSNTQFLGPEPFCGVSKCVIQMEIKKWERNMIQSNWIGTNTSRQSKLFIT FT PNKQITEKLLNLNKKSFCIVIGLFTGHCPARYHLKKMGRSPTDICRFCDCE FT IETSQHLLCECSALFAQRRQYFNKGILSPFEIWQGNPNKVVEFILRIIPDW FT GTSHYQSTAVTLNSNLSS" XX SQ Sequence 5259 BP; 1865 A; 972 C; 1099 G; 1323 T; 0 other; gtttcaagaa actgttgttg agtggacatg gtcgggagga ggcactgtcg atggttctag 60 tgccaccaaa agagtcgact ccgaaaagac ctaggaacat caacaacagt agtaacagcg 120 acagcaaacc catccctaaa aagcagaaag gacagtcgat tcacatttct gtcaacagcc 180 ggatggaaaa tatcagacaa gatcaaccaa ctgctctcga aaatcaacaa tcgacctaca 240 gtgaggtggc caaatgggta agggtaggta ttctgccgaa gaactatcct caaatccagc 300 tgacagacga acaactgtac agtctacaag aggccatttt gctgaaggtt acgctgcaaa 360 gaagggagtc tttcaagcca aagtttacga actgttggca acgaacagga catttggtga 420 taaactgcca ggatagggaa acgtcggatt ggctaaattc agtggttcct acgcttatcc 480 cttggaaagg cgcggaactt acggtagtca atgcggacga tatcccaaga ctggacgtaa 540 tggtaggctt tttccctcaa agtgttaatg atgaaaatga cactatcaga gttttcatag 600 agagccagaa tgacgggttg agtaccgaca ggtggaaaat cattcaacgt aacattctct 660 atgagaatca cgttgaatgg tttttcacag ttgacgaggc gtccatgcaa cattttaaga 720 catgcaactt cctcatcaac tacaagttcg ggcaaacaaa gctacggaaa aaagggatgt 780 acaaacctgg ggtagatggg atggtaatct cgagagacga gccgaaggag ctggaaccgg 840 taagtagcgt ggatcctggg accgaacaaa gccttagtgt cccggtgaca agcgggacag 900 acggtaaagt gccggaagca aagtcctcga aaaatcacca ccaaagcact cagagtacag 960 ggcttcccaa aaacgaaaac agccaaggga aattcgatga ccgaatttat tctgggccgt 1020 ctaagaactc gaaccatttt gggctccaag tggatgtaca atgccctggg ctcgcgaaag 1080 accagcagtg ctccacaata ctgatggacc aaaaatcttc agagcgttca aaggaccgga 1140 atacttttgg gcttccgaaa gaccaatctg gctatcaaaa agatcaaaat tattctgggc 1200 gactaaaaga ccagaatgct gctgggcctt cgaaggaccg gaactgttct gggtctcaga 1260 gggaccgtaa cacatctggg cttccaatgg accggaacac atctgggctt ccaaaggccc 1320 gaagtcgttc tggacatata aaagaccgga attcatccga accttcgaag gaacaaatgc 1380 cttctgggcc tgaaaaagac cagcacagca gaaaatataa ataacaatcg aaacatgaag 1440 agtgaattac tagtcctcta atgaacattt ataaaaatga atactgatat taaatttatt 1500 caaataaatc ttcaccatgc caaaggtgca acaggaatac tttgtcgaag atttttgaaa 1560 gaaaatttta gtgtcgcgtt tattcaagaa ccttgggtta acaagaaccg agttcttgga 1620 atcgctgggc aaataggaaa actagtttat gatgaaggaa actcaaatcc tcggacagca 1680 atcttaataa aagggggaat aaaattcgta cctataacac agtttattgc aaaagatata 1740 gtcgtgattc gcatggaaat acccacaact aggggcaaaa ctgaagctta tgtagtttca 1800 gcttatttcc ctggagatgt ggaggcggta cctccacccg aattgacagc tttcattaat 1860 tattgtaaaa ctaataataa acactttatc attggttgcg atgcaaattc ccatcataca 1920 atctggggaa gcaccaacat caataaaaga ggggaagacc tttttgaata catatcttcg 1980 aacaacatcg atttatgtaa caaaggaaat caacctacat ttataaccaa agctagggaa 2040 gaagtattgg acataaccat gtgcagtcct ttattttcga cgcgtataaa aaactggcat 2100 gtttcagatg aacaatcttt atctgatcac aagcacatag tgtttcaata caatgctggg 2160 gaaatagtta aaatagcgta taaaaatccc aggaacacta actgggagag atatgcttca 2220 acaatgcaat ctgaatgttt attgctaaat actacaattc aatcaacaat gcaattggat 2280 agagttgcaa aaaatattac tgaacaaata atacaagcat acaacgttaa ttgtcctgtc 2340 aaggaagaaa tctcaaatcg agaggttcct tggtggaata aaacattaaa taatttaagg 2400 aaaaaatcgc gaaaactttt caataaagcg aaaatttctg ggcaatggga ccaatataaa 2460 aaggccctta caatgtacaa caaagaaatt agaaaatcaa aacgacgaac ctggagatcc 2520 tattgtgaaa acatagagaa aactccagat atagctaaac tgcaaaaagt gctttccaaa 2580 gaccattcaa atggtctggg aaatttaaag aaaacaaatg gcgattttac tgttaactca 2640 caagaaaccc ttgaaatgtt gatggagact catttcccag gatcaaactt agtggtaagt 2700 gataatgaag aaagtataaa cgttaacgat aattatgggc cgaccgggac tataagatat 2760 aattcgctta tacctgatac cattttcaca aaatctaaag ttgaatgggc gatgaactct 2820 ttcgaaccat ttaaatctcc gggaatggac ggtatatttc ctgcactgct tcagaagggt 2880 ggggaaatac tgctggattc tataacaaat attttcaaag caagtttaat gttaggccat 2940 gtaccaaaaa tatggaccca agtacgagtt gtcttcattc ctaaagtagg aaaaagagac 3000 aaaacaaatc caaaatcttt taggccaata agtctaacat ctactatgct taaaatcatg 3060 gaaaagataa tagatgaata tatcaaaaca gaatacctac aaaagattcc tcttaacagg 3120 tatcagtatg catatcaaac taacaaatct acagtcactg ctctacataa tttggttaca 3180 aaaatagaaa gtactctcga tagaaaagaa attgctcttg tagcatttct tgatgtagaa 3240 ggtgcttttg ataatgcgac attcagttca ataaaaaagt caatggaaaa caggcatttc 3300 gatcctggca ttatagcttg gattatggaa atgctgaaaa atagagaagt atctgctgaa 3360 ttgggagtat catctataac aataaagacc actaagggat gtcctcaagg aggcgtattg 3420 tcgcctttat tatggtcgct tgttgttgat gatattctta ataaattgac agaacaaggc 3480 ttcgaagtta tcggttttgc tgatgatgtg gccataatag ttcgtggaaa gtttgaacat 3540 accatcacag atttaatgca gtctgcacta aactgcattt cgcggtggtg ccaaaacgaa 3600 gggttgaata taaacccatc gaaaaccgtt ttggtaccgt tcactcgcag acggaaagta 3660 acttttaaaa atataacaat tgagggatgt attgtccaat atgcatccag tgtaaaatat 3720 ctaggagtta ttctagatga taaacttaat tggaacttgc acttagagca ggtaataagc 3780 aaagccacta ttgccctgtg ggtgagtaag aatacttttg gtaaaaaatg gggtctcaaa 3840 cctaaaatga ttcattggat ttatacggtt attgtaaggc ccagggtagt ttatgccgca 3900 ctagtatggt ggcctaaaac aaagcaaagc acagcccaaa aaaagctaga aaaattgcaa 3960 cgtttggcct gcatggctat gacaggagca atgcgtagta cgccatcaaa ggccttagag 4020 gcagttctgt ataagcttcc actacatcaa cacgtgcaaa tggaagctga aaagaatgca 4080 ctaagactgc gaagaacgaa tatctttctt gaaggaaatc tttccggaca tctaagtata 4140 ctgaaagatt ttcaaattaa tccattagta gtcatgaacg aagactggat ggaaagaacc 4200 ctaaacctag atataccata taaggtggtt gaaacagacc gccaaatgtg ggaatcgggt 4260 gggcctagta tttcaccagg atcgattatg ttctacaccg atggttctaa aatgagtgat 4320 aacacagggg ctggaataac aggtccagga gttaatgttt caataccaat gggacgatgg 4380 acaactgttt tccttgcaga aatatacgcc atcttagagt gtgcttctat atgtcttaga 4440 agaaagtata gacatgcaag gatttgcgta ttttcagaca gccaggcggc tctgaacgca 4500 ttaaaatcct atacatgcca atcaaaactg gtttgggaat gcatacaagt attgaagcaa 4560 ctggctatga ataaccaggt gtccctgtac tgggtaccag gccattgtgg aatcgaaaaa 4620 aatgagaaag ctgattttct ggctagacga ggatcgaata ctcagttttt aggaccagaa 4680 cccttttgtg gagtttcaaa atgtgtaata caaatggaaa taaaaaaatg ggagcgtaat 4740 atgatacagt cgaactggat tggtacaaat acatcaagac agtctaaact gtttataact 4800 ccaaataaac aaattacaga aaaacttcta aacttaaata aaaaatcatt ctgtatagtt 4860 attggactat ttacaggaca ttgtccagct agatatcatt taaagaagat gggtcgaagt 4920 ccaactgata tctgtcggtt ttgcgactgt gagatagaga cctctcaaca cttgctttgt 4980 gaatgctcag cactatttgc gcaaagaaga cagtatttca acaagggcat actgagtcct 5040 tttgaaattt ggcaaggaaa tcccaataag gtagttgaat tcattttaag aatcatacca 5100 gattggggta cgtcgcacta tcagtcaact gcagttactc tcaatagtaa cttgtcgtcc 5160 tgaaaatatg cgaaaaataa acaggggtat accacaatag atcaacttaa tggtcgcagt 5220 ggttcaaatc ccaacaaaag aaaaaaaaaa aaaaaaaaa 5259 // ID BEL-616_AA-LTR repbase; DNA; INV; 312 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-616_AA_; KW Pao_Bel_Ele86; BEL-616_AA-I; BEL-616_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-312 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 312 BP; 106 A; 42 C; 73 G; 91 T; 0 other; tgttacacta aaagtggaaa gaattattat tcaatgttcg gtttttgagt caatttttgt 60 tagagtgtaa aacagaatca aatgtctata tttttctctg tgtgctacca gcgcagacga 120 caggaggtgt agaaagaaag cgagggtttg tgtagatttg gcgcgatgcg aacggaagta 180 ggaaggagaa aggaaagaag gaaatcaata aaagtagaaa attccgtcga gttaagacgt 240 gtttgttttc gttaaataaa attcttaaaa tacagtccac ttgcttttcg tccctgaact 300 atccgaagaa ca 312 // ID Crack-32_BF repbase; DNA; INV; 2606 BP. XX AC . XX DT 30-APR-2009 (Rel. 14.04, Created) DT 30-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE Amphioxus Crack-32_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; KW Crack-32_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-2606 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-2606 RA Kapitonov V. and Jurka J.; RT "Young families of Crack non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(4), 837-837 (2009). XX DR [2] (Consensus) XX CC Its 5'-terminal portion is incomplete. XX FH Key Location/Qualifiers FT CDS 1..2349 FT /product="Crack-32_BF_2p" FT /translation="EELKFLCDAYDLSNVITEPTRVTEGSSTLIDVILTSN FT PSKHSNSGVYKGSLSDHHLIFTYRGLKNPRPPPKYISTRIFKGFDEGKFRE FT DLATIPWDTLNILDSVQSMWATWKSLFETVCNKHAPLKKFRVRGEDCPPWL FT TQDIREIMALRDKARHTAESTQSQTDWETYKQLRNHTKRLIISSKRTYLTD FT EINNGNTNGMWKSLKSLLGKTKSSNISCLNDANGDMCSSPLEIAQELNKYF FT GTVAEKLARGINTGMSLLSPLQFVHRCLARFTFSQVTTESVQRELLALKTD FT KAVGLDNLNNRLLKAGAEIIAPSLTNLFNESLRRSEFPDDWKKARLTPIHK FT AGDRAAPNNYRPVSILPAVSKILERTVHTQLYKYLTDNSILTEAQSGFRPG FT HSTQSAAHLLTEKWLDAMNDGQLTGAVFIDLSKAFDTLDHNILLQKLFCYG FT IQGGALDWFTSYLSSRQHCTAVDGSTSDFHRMQYGVPQGSILGPLLFLIYV FT NDMPNCAQFCNISMYADDTVIYYCSKDVSSIQENLQNDLNTLSQWFMANSL FT SVNSGKCKFMLIGTDKRLNSCKSPNLSINGAAVTECKMYKYLGVLIDRNLN FT WKCQVNAILCKLRRSLSILKHVRTFVSTPTLLTLYNSICLPHIMYACTVWD FT SAPERELEKLQRMQNRAGKLILRAPYLTPSAEVHSRLGWKDVKSLHKHHKA FT LLVYKALNNKLPPYIRRMFTFCRDRSVRTTRRSASDLLVVPKPVREVYRRS FT LAYSGSLLWNSLVPEARAATSISSFKRXVTR*" XX SQ Sequence 2606 BP; 761 A; 604 C; 553 G; 687 T; 1 other; gaggagctca agttcttgtg tgatgcttat gaccttagca acgtcataac cgaaccaaca 60 agagtgactg aaggatcctc cacccttatt gacgtcatct taacgtctaa ccctagtaaa 120 cactctaaca gcggtgttta taagggttcg ctaagcgacc atcaccttat cttcacatat 180 cgtggattaa agaaccctag gccaccccct aaatacattt ccacaagaat ctttaaaggc 240 ttcgatgagg gaaaatttag ggaagatttg gccacaattc catgggacac tcttaacatt 300 ttggactccg tgcaaagtat gtgggcaaca tggaaatcac tctttgaaac cgtttgcaac 360 aagcatgcgc ctctcaagaa gttccgtgta aggggagagg actgtccacc atggctcaca 420 caagatatac gggagataat ggcgcttcgt gataaagcca ggcatacggc tgagtcgacc 480 cagagccaaa ctgactggga aacctacaaa cagttgcgta atcacaccaa acgcctaatc 540 atctcgagca aacgaacata cctgactgat gaaattaaca acggaaatac caatggcatg 600 tggaaatcac tcaaatctct gttaggaaag accaagtcgt ccaatatttc ctgcctcaat 660 gatgcgaatg gagatatgtg ctcctcgcca ttggaaatcg cccaggaact taacaagtat 720 tttggcaccg ttgccgaaaa gttagcacgt ggtattaaca cggggatgtc acttctcagc 780 ccactgcagt tcgtgcaccg ttgtctggct cgtttcacct tcagtcaagt aacaacagag 840 tctgtccaac gtgaactcct cgctctgaaa acggataaag ccgttggtct agacaacctt 900 aacaacagat tgctcaaggc tggggctgag atcatcgcac catctctgac aaatttattt 960 aatgaatctc tgcgaagaag cgagtttcca gatgactgga aaaaggcgcg tctaacgccc 1020 attcataaag caggggatag agcagcacca aacaattaca ggcctgttag tatactacct 1080 gctgtctcga agattttgga gagaacagta cacactcaat tgtacaagta cctgacagat 1140 aacagtattc ttacagaggc ccagtctggt tttcgaccag gccattctac tcagtccgca 1200 gcacatcttc tcacagagaa gtggctcgat gcaatgaatg atggtcagtt gacaggggcg 1260 gtgtttatag atctctccaa ggcgtttgat acgctggatc ataatattct actccaaaag 1320 ctgttttgct atggcataca ggggggcgca ctggattggt ttacgtctta cctgtctagt 1380 aggcagcatt gcactgccgt tgatggatct acctctgatt ttcatcgaat gcaatatgga 1440 gtaccccaag gctcaatatt gggaccgctc ttgttcttga tctatgttaa tgatatgcca 1500 aattgcgccc aattctgtaa tatatcaatg tatgcagacg ataccgtcat ctactattgt 1560 agcaaggatg tcagtagcat tcaagaaaac cttcagaatg atcttaatac tctttcacag 1620 tggtttatgg caaattcatt atcagtgaac agtggaaagt gcaagtttat gctcattggt 1680 acggacaaaa ggcttaacag ttgcaaaagc ccaaacctgt ccatcaatgg ggcagctgta 1740 actgaatgta agatgtacaa ataccttgga gtgctgattg accgtaattt aaactggaag 1800 tgccaagtta atgccatttt gtgcaaactc aggcgatcac ttagcatttt aaagcacgtc 1860 cgcacttttg tttcaacgcc taccctgtta actttgtaca attccatatg tctgccacat 1920 attatgtacg cttgtactgt ctgggactct gcgccagaac gggagctgga gaaacttcaa 1980 cggatgcaga accgagccgg caagttgatt ctacgtgccc catatctaac tccatcggcg 2040 gaagttcact cacgtctggg atggaaggat gtgaagtcgc tacacaagca ccacaaagcc 2100 ctactggtat acaaggcctt gaacaataag ttaccaccct acatacgtcg tatgttcacc 2160 ttttgcaggg acagatcagt caggaccacc agacgtagtg cttccgatct tctagtagta 2220 cctaagccag tgagagaggt ctacagacga tcactagctt actctggttc acttctctgg 2280 aattcactag taccggaagc cagagctgca acctcaatct ctagtttcaa aagamtagtg 2340 acacggtagc gatcctcgaa cgtgtatgga atcaacgagc caatgattgt catttgttta 2400 cattattatt attctttcat ttatgtatgt atgtatttag aattccttta ttttctgtat 2460 gatcaatagg aattttgtgt caatatgtat attattaggt gacccctcac ctcctccatg 2520 tatgctcgac ctccacgaaa agcggcctag taggccgatg gagtatctat cgagtcaaaa 2580 taaaggcaca aacaaacaaa caaaca 2606 // ID DNA8-107_AP repbase; DNA; INV; 1026 BP. XX AC . XX DT 29-AUG-2009 (Rel. 14.09, Created) DT 29-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-107_AP. XX NM DNA8-107_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-1026 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2045-2045 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. It contains a piggyBac-like insertions (pos. 123-679) CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 1026 BP; 348 A; 159 C; 164 G; 355 T; 0 other; cagtgccgta gccagggggt gggcactggg ggcacgtgcc cccccccgaa atttttttta 60 gcttgtttta tattatatta tatgtttgat aaaaattaaa agttaaggct gtcgaagtga 120 accattttga aggggtcgga tttaggaaat attgtacatc catcgttagt actatatcta 180 tgtgtgtagt gactgatcat tagtgtgtgc gagttccccg aacgctttat gtaatttaag 240 ataaacgacg gctaagattc ttgaacccca gtaaagtaca tacgcgcagt cacgtcacta 300 tattttgtga taaaaaaatg ttcaacttac atgtgccaca aaactcccaa aaagtggaag 360 acaaattcac ccagtaccaa aagccctttt ggctatttta attttgaaaa catatatttg 420 aattccttaa tctcttttct agattatgta ggaaatggat ttatgatctg tagtattgtt 480 taattagtat tatagtacca gagtaatttt cttttttttt tttcaaaatt cacttttact 540 tttgaattac taaaaaaaaa aaaatcgctt ccaacaaaac acacaaaaag cggagaaatt 600 gttatgtttt tttcgaaatt aaaattggat ttctggaagt atattttttt ccgcttattg 660 atctaattgc acgaaaaaac taccagcttt caacagtcct aatatagtct tgtatttttg 720 ttagtaaatt ttaataatat tcgaaccaaa aatggaatgt tttgtttaca acacattttg 780 atataatgat aataatattg atgaatacat taacctatgg tttacaataa tattaataca 840 ttgttatatt aaaatgtagc caatgtaggt accatacttg ttaacataaa tataatatag 900 tcataaaacg agtagtagat acactttatg aacaaagggg tcataattag atattctagc 960 atttcaaagg ctagagaaaa aaaattaagt gccccccccc aacatttttt tctggctacg 1020 gcactg 1026 // ID Gypsy-60_CQ-LTR repbase; DNA; INV; 173 BP. XX AC AAWU01037505; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-60_CQ_; KW Gypsy-60_CQ-I; Gypsy-60_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-173 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 500-500 (2011). XX DR Genome; AAWU01037505; Positions 2181 2009. XX SQ Sequence 173 BP; 60 A; 28 C; 29 G; 56 T; 0 other; tgttgtattc cacaaatatt gtttagcgtg tagggacttt tgcttcacct ataaaatact 60 atcttgaaca aggaaatcga aatgtcatta ataaagtagc gagtgaagat aaaacacgcg 120 gttttcgata cgggatttaa ttcgtctttt attacacaaa atccaatcta aca 173 // ID BEL-74_CQ-LTR repbase; DNA; INV; 219 BP. XX AC AAWU01004636; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-74_CQ_; KW BEL-74_CQ-I; BEL-74_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-219 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 294-294 (2011). XX DR GenBank; AAWU01004636; Positions 4910 4692. XX SQ Sequence 219 BP; 76 A; 64 C; 30 G; 49 T; 0 other; tgattgcgct gcattgaaat ttacagtcca catttttccc agtgtaccca cactaacaca 60 ttgtaaaccc caaacccatc atacctcaaa actatgacac tttgacactg aaaaacacaa 120 aaaaaaaccg ttcttgcacc gagcagacgt gttcccttta ttcgctcgaa accacgaaag 180 aattcgcgaa gtccgaccag tgaaaaattc ccccgaaca 219 // ID BEL-18_CQ-I repbase; DNA; INV; 8327 BP. XX AC AAWU01035939; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-18_CQ_; KW BEL-18_CQ-LTR; BEL-18_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-8327 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 189-189 (2011). XX DR Genome; AAWU01035939; Positions 10894 19220. XX CC 'AACCG' target site duplication CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 46..1626 FT /product="BEL-18_CQ-I_1p" FT /translation="MATFPQEMILRPCKICQQDGRPDNMMLCFSCKGWFHP FT RCLGPGEKLFIGSWTCAACVAAAELPASESTTIAPNTVQTETSATATKAPA FT GSILTEKAKADLLWIAEQREYVLREVERQRLELDQKKEEMERLSREAAQML FT SMNLQAVQSTSGQQSSANSSEGWVQRIMEEIKNISIGTPGPTGHDTQTSGY FT MHSSTPTKAANIGTLPTNVGTFTVLNRPTIPSTGYSCGTDSFLSLPEHQQQ FT TSQQQPPPQPPHGLSQTWPHPQQTWTHSQQGGYQSNQAWPQPQTSGPQPSA FT PQPHQDAPNTHRSGPEQAWPQPQQVDHQPQPGRSQQLAPQPTQAWQQSQPG FT GQQQVAPQRQQAWPQPQLGRSQQSAPQPQQTWSQPPQGGQQQGPQQPQPQQ FT VQPQPQQAWSQRQQAWPQPQPAWQQPQPNWPQPQQGWPQPQQVWPQHQQDW FT LQQGLQHMGQTPQDWTPPHVQHRSSYWPDRSCQRIYLNFMETQRNGHSSSL FT RLLTPPTLAVIQTWRTWHAFSRRCKEELWNR" FT CDS 1167..2165 FT /product="BEL-18_CQ-I_3p" FT /translation="MVATTTRRTTTGTTAAAASTGSTATATGLVAAATGLA FT STTTGLAAAATELAAATTRLASATTGLASASTGLAATRTAAHGTDASGLDS FT PPCPTPQQLLARQIMPKDLPKFHGNPEEWPLFFTAFTNSTNACGYSNVENM FT ARLQQAVQGRALEQVRGRLLYPTLVPQVMSTLYMLYGRPELIIQTLLDKVR FT DCPSPKAEKLETLITFGMTVQNLCDHIIAAGHVAHLCNPVLLRELVEKLPA FT QQRLNWALYKQQFPATDLSTFAAYMAKLVEAASEVTVITDSKQPRSGRTER FT GVERSYVNVHSTSDSPTRDEHPKRSTEKGDRGHQMLSVQRN" FT CDS 4672..8325 FT /product="BEL-18_CQ-I_2p" FT /translation="MRFPDLARKFTYLRGLPVRSYEAATPGILIGSNNAGL FT IATLNLREGELGDPLASKTRLGWTIYGSSSDGQRTTNFTLHVCNYQEDQHA FT DQELHDLVKRHFTVESIGVSADKGPESEEDKRARLILMETTKRVADGFETG FT LLWRHDETRFPDSFPMAMRRLECFERRMKNDPELQASVQQQIKEYLESSYI FT HEVTPEEIESANPHKIWYLPLGAVRNPKKPGKIRLIWDASAKVGNVSLNAM FT LLKGPDLLTPLASVLCGFRERPVAVCGDLRQMFHQFRIRSEDVHSQRFLYR FT EHFTEPVKVFAMDVGSFGATCSPCQAQYIKNLNAREHEKEFPEAAKAVINK FT HYVDDYLDSFDTEAEAIKVALDVKELHSRGGFEIRNWHSNSEALLARVGEP FT KKVSTRAISIDSESEAERVLGLLWLPEEDLLAFATELQLDGIPPTKRNILR FT CVMSLFDSQGLLSHITIQGRMIIQDTWRGKTNWDDEVIETIRVRWLRWTQL FT FKKVGGIRLNRAYFPGFSAAEIGAVELHIFTDASEEAYACTAYFRVVINGK FT VYVTLAMAKAKVAPLKALSVPRLELMGALLGARLAKAVMEYHSFPICRRVF FT WTDSKTTLAWIQSEHRRYRQFVAFRVGEILSKTDVIEWRYVPTLLNPADEA FT TKWGKGPSTDVNSRWFSGPEFLYLPETEWPAQVSTELDTDEELRPCMVHRE FT HQTEDVYEINRFSRWELLQRTAAYVHRFIGNCRRKMQNQPLAAGCLKGEEL FT RTAERSLWRTVQLSEYPDEVIALKQGGARPKKTIEKTSPLYKLTPYLDEHG FT VMRVDSRIGAVPYVNFDFKYPIILPRRHYLTKLIVDWYHRRYLHANHSTVH FT NEVRQRFHVSSLYTVVLQVAKDCQTCKNNKAVPTTPRMAPLPYSRLAAGVR FT PFSYVGLDYFGPIHVRVGRSSVKRWVALFTCLTVRAVHLEVVHTLSTESCK FT MAVRRFVARRGSPVEIHSDNGTNFQGASRELKDEIEVIGKNLAETFTNSNT FT KWLFIPPSAPHFGGSWERLVKSVKVALRSLCSDRKPDDETLLTVLAEAGSI FT VNSRPLTKIPLEDASQEALTPNHFLLLSSNGVIQPPSSLTQPPKVTRTNWN FT LAKQLVDQFWRRWISEYLPIISKRTRWFSETEALKVDDLVLIVNEGQRNGW FT TRGRVLTVIPGRDGRIRQATVQTAAGILKRPVAKLAVLNVQDSCNATVPAE FT PERRYGSG" XX SQ Sequence 8327 BP; 2274 A; 2197 C; 2180 G; 1676 T; 0 other; ttatcaaaga attacgtgag ttctcgtgaa cagacaaagc acacgatggc gacctttccg 60 caggagatga tacttcgacc gtgcaagatc tgccaacagg acggtcgacc ggacaatatg 120 atgctctgct tcagctgtaa aggatggttt catccgcgct gcttgggtcc cggagagaag 180 ctattcattg gaagctggac gtgcgcagcc tgtgtggctg cggcagaact ccctgcaagt 240 gaatcaacca ctatcgcacc gaacacagta caaactgaaa cctcagctac tgcaacaaag 300 gcgccggctg ggtcgattct caccgagaag gccaaagcgg atttactgtg gatagcagag 360 caaagggagt acgttctccg tgaagttgaa agacaacggc tggaactaga tcaaaagaag 420 gaggaaatgg aaaggctgag ccgcgaggct gcccagatgc tgagcatgaa tctgcaagca 480 gtacaaagca cgtctggcca acagtccagc gcgaacagtt cggaaggttg ggtgcaaagg 540 attatggagg agatcaagaa tatatctatt ggcactcctg gaccgaccgg acacgacacg 600 caaacctctg gttacatgca cagctccact cccacaaagg cagccaatat cgggactctg 660 cctacgaacg taggcacgtt tacagtgctt aaccgtccca ccattccatc gacgggctac 720 tcttgtggca ctgactcctt cttgtcccta cccgaacacc agcaacaaac gtcgcagcag 780 cagccaccgc ctcagccgcc acacggtttg tcgcaaactt ggccgcaccc acaacagact 840 tggacgcatt ctcaacaagg tggatatcag tcaaatcagg cttggcctca gccgcaaacg 900 agtgggccgc aaccaagtgc gcctcaacca catcaggatg caccgaacac gcatcggagt 960 ggaccagaac aggcttggcc tcagccacag caagttgacc atcagccaca accaggtcgg 1020 tcgcaacagc ttgcgccgca gccgacacag gcttggcagc agtcacaacc aggtggacag 1080 caacaggttg cgccgcagcg tcaacaggct tggccgcagc cacaactagg aagatcgcaa 1140 cagagtgcgc cgcagcctca acagacatgg tcgcaaccac cacaaggagg acaacaacag 1200 ggaccacagc agccgcagcc tcaacaggtt caaccgcaac cgcaacaggc ttggtcgcag 1260 cggcaacagg cttggcctca accacaaccg gcttggcagc agccgcaacc gaattggccg 1320 cagccacaac aaggttggcc tcagccacaa caggtttggc ctcagcatca acaggattgg 1380 ctgcaacaag gactgcagca catgggacag acgcctcagg actggactcc cccccatgtc 1440 caacaccgca gcagttactg gccagacaga tcatgccaaa ggatttacct aaatttcatg 1500 gaaacccaga ggaatggcca ctcttcttca ctgcgtttac taactccacc aacgcttgcg 1560 gttattcaaa cgtggagaac atggcacgcc ttcagcaggc ggtgcaagga agagctctgg 1620 aacaggtgag gggccggttg ctttacccga cactcgtgcc ccaggtgatg tcgactttgt 1680 atatgctcta cggaaggcca gaattgatca tccagacgct actggacaag gtgcgagact 1740 gcccgtcgcc aaaggcagag aaactagaaa ccctaataac cttcggcatg acagttcaga 1800 acctatgtga ccacataata gccgccggtc acgtagcaca tctctgcaac ccagtccttc 1860 tgcgagagct ggtggaaaag ctcccagcac aacaacgtct gaactgggct ctatataaac 1920 aacagtttcc ggcaaccgac ctcagcacat ttgctgctta catggcgaaa ctcgttgaag 1980 cggcgtccga ggtcacggtc atcaccgatt ccaaacagcc acgttcggga cgaactgaac 2040 gcggcgtaga aaggagctac gtaaacgttc actcaaccag tgactcgccg acgagagatg 2100 aacaccccaa aagaagcacc gaaaaaggag accgtggaca tcaaatgctt agcgtgcaac 2160 gcaactaacc acaaggtgaa aaactgcgag acgtttcaaa agtgggaact caatttgcgc 2220 tggaagctag ttcaagataa ccacctctgt cgaacatgcc tgggcaaaca cggacgtcgc 2280 ccgtgtaatt taagaacgcg atgcggtatg gacggatgtc aggaacggca tcatgaacta 2340 ctacaccgcg aaactccaaa ggaccaaacg ccatcacagg agggacacaa tcgagccgga 2400 gaaacgaaga tcgcaggaga tgggtttaat gctcaccacg ctacaaagaa gtccacattg 2460 ttccgaatcc taccagtaac actaacctgg atctgtgaac acatacgcat tccttgacga 2520 cggttccgac ttgaccctag tcgaacaatc tatcgcggga caactgggaa tcgacgacgg 2580 tgttcctact cctctatgtc tatcttggac cagcaacgtg acaagacaag aaccgaagtc 2640 ccaacgcgtt cgtctggaga tctccggaga aggcaagaaa gagcggttta ctcaaggggg 2700 ttgacactcg atcattctgg aataattctg ggttaattct ggaatgaacg gaatgacgat 2760 gatgattcta gcaaaaatga tgaagtgcgg gtaagagtgt aactgtcata atatttatct 2820 ttctaaacca aaacaagtta cggttctctg cagcattgag taggacccgg taggacaaca 2880 gaaccggcgg ttgttctgta ctgttccggt gccgtgcgac gtccagagca acggccagat 2940 tggccttcac tggaatctga ccgagctgct gttggccact agacgaggtt ggtattattg 3000 gatttgctaa gtggttgttt agtggatttt taaaactgac tttcaatttc tgatttccct 3060 agacagcatc taagtggacc agctacgtac cggcgaggcc aacgtgctag ttattccggt 3120 ggaggtcgat acagcgcatc caagatcagg ctaggaagcg gcagacggaa ctgaacctgt 3180 tgggggactt gtttgggccg tagatagcaa gggaagggac tgaggtggcg gagattcctt 3240 ggacgagtcg gatgaggagt tcgcgaggag gctgatgccg acgggcagaa cttttttttg 3300 atgatttgaa gacgtcaaaa cgtgcttgcc ttgccggccg agatcgtcat gaacaaaacc 3360 accaagaagc aaacgtaaat tcagatgcag tcgcagaaga tttcggtact gaacacccca 3420 tacagtaact tcgtggacca gttacgcaat ggcgaggtca acgtgctggt cgttcgggtg 3480 gagattgatc ctgcaacgcg ttgggccaca ccgttctcgt cgtgctgttc aacattatcg 3540 accgatttct atccatactg ctcgcactcg gagtggccag tcaggggcaa ccgacgatga 3600 ccaccgaaag aatgcgaaaa ttgacacctc gacccgtacc atctggacga cgacgtcaga 3660 tggtacgggg tcaagagcca caacatggcg gatagtgtaa tgcaatttgg tgatattgtg 3720 aaataaagta taattaaatt gagttagttt gattttaacc aactcggtga cgtaaccgcc 3780 gcatcagtta gtggacatcg cccgcaatct atcggcttgc acaagggcat cccagtccac 3840 tcggcggacc gactgcaggg tggggttgcc attcgacgca ccaacgattt cctcgcaatc 3900 tcgtcggatg atgaactcga agtttttatt cacaaattct ggtcacgcac gccgtgatcc 3960 gaaaggctac gtcggagtag tggcagttgc aagtggtaaa ctcgaatcgg ccgttctggt 4020 cttagcgatg ttctggatga acgaacacgt cgtggttccg gaagtccaag cgcaagcgca 4080 ttctcaagca gctgcatcca gttcgcctcg ttcactatta gaaactgtaa tgcgaaaaaa 4140 aatgtgtgtt gcacacggtc aaatgggccg ccaccggccc ggcatcgcca ccacaatgtg 4200 tggccgcgtc tgcaaccgtt gtaccaccag tggctgatcc gtcctgccgg tgaccacaca 4260 aacagtcacc cccatcggtt gacggccacc gaaaattgtt ccgccatcca gtcaccgaat 4320 atctcgtagg ccagcgcgag gtccactcca gggtatcttg tacgcgttgg aaaacgcctc 4380 cgcagggact tttgcttgcc acaaacgcca cgaacaagga cacggaattt tcaatgtttg 4440 atattggaaa gatgtttact tttccaacaa gagaatgaca gtttcaaatt gctaacacgc 4500 aaaaacaaaa ttaggtagaa acggaattat tatgtttatg ccagcatcat tccagaataa 4560 ttctggaatg aacagaatga ttcgaatgaa aatattcgat gtgtcaaccc cctttggttt 4620 actctgaacg atacgaggac ggtagccagc ttggacctgc cgaagcagac aatgcggttc 4680 cctgacctcg cgcgaaagtt cacctatcta cgaggtctgc cagtcagaag ctacgaggca 4740 gccacccctg gaatcctaat cggctctaac aacgcgggac tcattgccac gctgaacctg 4800 cgcgaaggcg aacttggcga tccactagcc agtaaaacac gactgggctg gactatctat 4860 ggatcttctt cagacggaca gagaacgacc aacttcacgc tacacgtctg caactatcag 4920 gaggaccaac atgctgacca ggaattgcac gacctcgtga aacgccactt cacagtcgag 4980 agcatcggcg tctcagcaga caagggccct gaatccgaag aagacaagcg tgctcgcctc 5040 atcttaatgg agacaacaaa acgtgttgcg gatgggtttg agaccggcct tctctggcgc 5100 catgacgaga ccaggtttcc agacagcttc ccgatggcga tgcgccgcct ggagtgtttc 5160 gaacgaagaa tgaaaaatga ccccgagctg caggcaagcg ttcagcagca aatcaaagag 5220 tacctggaga gtagttatat tcacgaggtg acgccggagg agattgaaag cgcaaaccct 5280 cacaaaatct ggtacctccc cctgggcgct gtgcggaacc caaagaagcc aggaaaaatc 5340 cgactaattt gggatgcatc agctaaagtc ggaaacgtct ccctgaacgc aatgctcctc 5400 aaaggacctg acctgctcac ccctctggcc tcggtgttgt gcggctttcg agaacgacca 5460 gtagctgttt gcggtgacct acgacaaatg tttcaccagt tcagaattcg gagcgaggat 5520 gtacacagcc agcgattcct ctatcgggaa cacttcacag aacccgtcaa ggtattcgcg 5580 atggatgtag gaagcttcgg tgccacctgc tcaccctgtc aagcccagta tataaaaaat 5640 ctgaatgcca gagagcacga aaaggaattc cccgaagcgg ccaaagcggt cataaacaag 5700 cactatgtcg acgactacct tgatagtttt gacacagaag ctgaagccat caaagtcgcg 5760 ctggatgtca aggaactgca ctcccgcggg ggattcgaaa taagaaattg gcactcaaat 5820 tcagaagcac ttctggcacg ggtcggggag cccaaaaagg tctccacgag agccatcagc 5880 atcgactcgg agagcgaagc agaacgagtt ctggggctgt tgtggctgcc ggaagaggat 5940 cttctagcat ttgccacaga gctacagcta gacggaattc ctccaacaaa acggaacatc 6000 ctgcgttgcg ttatgagctt gttcgactcc caaggtttgc tgtcccacat taccattcaa 6060 gggaggatga tcatccagga cacctggcgc ggcaagacga attgggacga cgaagtcatc 6120 gaaacaatac gcgtgcggtg gctcagatgg acacagctgt ttaagaaggt cggaggaatt 6180 aggctaaacc gggcgtactt cccaggattc tctgcagcgg agattggcgc ggttgagcta 6240 catatcttca ccgatgcgag cgaggaagct tacgcctgta ctgcctactt ccgggtcgta 6300 atcaacggga aagtgtacgt cacgctggcg atggctaaag ctaaggttgc cccgttgaaa 6360 gcgttgtcag taccacgtct ggagttaatg ggggctctgc tgggggcacg gctggctaaa 6420 gctgtgatgg aataccactc ctttcctatc tgtcgccggg tcttttggac agactcaaaa 6480 acaacgctcg cctggatcca gtcggaacat cgccggtatc gacaatttgt ggccttcaga 6540 gtaggagaga ttttgagcaa gactgatgtc attgaatgga gatacgtccc aactctactc 6600 aatccggctg acgaagcaac caaatggggt aagggaccca gcactgacgt caactcaagg 6660 tggttcagcg gaccggaatt cctctatcta ccagaaacgg aatggccggc gcaagtgtcg 6720 acagaactgg atacggacga ggagcttcgg ccctgtatgg ttcaccggga gcatcaaacg 6780 gaagatgtct acgaaatcaa ccgcttctcc cgttgggagc ttcttcaaag gacagcagca 6840 tacgttcacc gcttcatcgg gaactgcagg cgcaaaatgc aaaaccagcc tctcgcagcg 6900 gggtgcttga aaggggaaga gttgcggaca gcggagcgca gcctgtggcg cacggtgcag 6960 ctgtcagaat atccggacga agttatcgct ttaaaacaag gaggagcacg acccaagaag 7020 accatcgaga agacaagccc cctatataag ctgacaccat acctggatga gcatggtgtg 7080 atgcgagtcg acagccgtat tggagccgtt ccatacgtca acttcgactt caagtaccct 7140 attatcctgc cgcggcgtca ttaccttacc aagctgatcg tcgactggta ccaccgcagg 7200 tacttgcacg ccaatcacag tacagtccac aacgaggttc gccaaagatt tcacgtctcc 7260 agtctgtaca cagttgttct ccaagttgca aaggattgtc agacctgtaa gaataacaaa 7320 gcagttccaa cgactcctcg aatggctcct ttaccatact cgaggctcgc ggccggggtg 7380 cgtccatttt cgtatgtcgg gctcgactat tttggaccca tacatgtacg tgtcggacga 7440 agctctgtaa agcggtgggt cgcccttttt acctgcctga cagtgcgagc agttcacctc 7500 gaagtcgtcc atacactctc gaccgaatca tgcaagatgg ctgtgcgaag gttcgtggct 7560 cgtcgcggat ctccggtgga aatacattca gacaacggaa cgaattttca aggggctagc 7620 cgggagctga aggacgaaat cgaggttatt ggaaaaaact tggccgaaac cttcaccaac 7680 tctaacacaa agtggctgtt cattccaccg tccgcgccac acttcggagg ttcgtgggaa 7740 agactggtga aatccgtcaa ggtggcactt cgttcattgt gcagtgatcg caagcccgac 7800 gacgagacac tgctgactgt ccttgccgag gcagggtcga tagtgaactc cagaccacta 7860 acgaaaatcc cactggaaga cgccagccag gaagctctca ccccgaatca ctttttgctg 7920 ctaagctcga atggagtcat acaaccacct tcgtcactga cgcagccacc gaaggtaact 7980 cgcaccaact ggaacttggc caagcagctc gtggatcaat tttggagacg gtggattagc 8040 gaatacctcc caataatatc caaaaggacg agatggttca gtgaaacgga agctctcaaa 8100 gtagacgatt tggtgctgat cgttaacgaa ggacaaagga atggttggac aagaggacga 8160 gtactgacgg tgatccccgg ccgcgacggc cgcattcgac aggcgacagt gcaaacggcg 8220 gcagggatac tgaaacgacc agtagctaaa ttggcagtgc tgaacgtcca ggacagctgt 8280 aacgcaacag taccagcgga accggaaagg cgttacgggt cggggga 8327 // ID EdSINE1 repbase; DNA; INV; 601 BP. XX AC . XX DT 11-JAN-2007 (Rel. 12.01, Created) DT 24-JAN-2007 (Rel. 12.01, Last updated, Version 2) XX DE Entamoeba dispar SINE1 consensus. XX KW SINE; Non-LTR Retrotransposon; Transposable Element; Entamoeba; KW Entamoeba dispar; UEE-1; EdSINE1. XX NM EdSINE1. XX OS Entamoeba dispar OC Eukaryota; Amoebozoa; Archamoebae; Entamoebidae; Entamoeba. XX RN [1] RP 1-601 RA Sharma R., Azam A., Bhattacharya S. and Bhattacharya A.; RG School of Life Sciences, Jawaharlal Nehru University, New Delhi, RG India; RT "Identification of novel genes of non-pathogenic Entamoeba dispar RT by expressed sequence tag analysis."; RL Molecular & Biochemical Parasitology 99(2), 279-285 (1999). XX RN [2] RP 1-601 RA Shire A.M. and Ackers J.P.; RT "SINE elements of Entamoeba dispar."; RL Mol Biochem Parasitol 152(1), 47-52 (2007). XX DR [2] (Consensus) XX CC This 601 bp sequence is the consensus sequence of a family of CC SINEs (Short Interspersed repetitive Elements) in the genome of CC the protozoan parasite Entamoeba dispar. PCR primers based on CC this sequence amplify multiple members of this SINE family. CC Northern blotting and cDNA clones confirm that they are CC transcribed but the transcripts are full of stop codons and there CC is no evidence to suggest that they are transcribed. The 5' and CC 3' ends of these E. dispar SINEs are very similar to Entamoeba CC histolytica SINE1s but the remainder of the sequence is very CC different. XX SQ Sequence 601 BP; 253 A; 84 C; 94 G; 170 T; 0 other; ggattcgaag gtagcacgtc agagacacca cacagaaccc taaaatattt ccatccttcg 60 attcccccca gttatggact ggttatgacg ttgcgtttgg ttagaaaaac atcttaccca 120 gtactgcaag aatagcagtg tttattatgt acaatgacag aatatcattg acaataaacg 180 aatgggtatt tataaagatg ttattattac ttgttagaat acaagtataa tttgataata 240 taagaaagat tatcaaagga aaagggtcaa gtggttgagg ctaaaaagaa tattagccca 300 tagcaataca gtagtaatta tatactctcc ttaaaataag aaaataaaga agaagaaatt 360 taaactcatc aaaattaaaa agaaattaaa taatgaatat ttaaataatg gatgtttaaa 420 atactctatt ccacatcaaa aatgagaaaa gaaaattctc acagttcaac aactaaaagc 480 aaagaataag ctaatagttg aagaacaaat acatttaaag cgaagggatg ggattagtct 540 cccctgagca aggaacaata gaaaatattt ctattaatac ttaattaact actttttatt 600 t 601 // ID Gypsy13-LTR_Dpse repbase; DNA; INV; 374 BP. XX AC Unknown_singleton_87; XX DT 13-MAY-2009 (Rel. 14.05, Created) DT 13-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy13_Dpse; KW Gypsy13-I_Dpse; Gypsy13-LTR_Dpse. XX OS Drosophila pseudoobscura OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; obscura group; OC pseudoobscura subgroup. XX RN [1] RP 1-374 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1090-1090 (2009). XX DR Genome; Unknown_singleton_87; Positions 11255 10882. XX SQ Sequence 374 BP; 86 A; 83 C; 108 G; 97 T; 0 other; tgtaacgctt ggatcagcga gcgagctagt tcgcagcgcc atctagtggt cggcaccgta 60 agggccgtgg tagaccaaac accctctatg ggctaggtcg gggagatcac gaaaagtggc 120 ataggaagag tgatagagtt aacagcttgg attgtcaact ttctctggca tttcaccctg 180 tcctttgaga gggcccgagg gttgtagctt gttgaggggc ttggcttcgg tctcttcccg 240 ttttggcgct cgacccgagc gttcgttttc tctgtcaaca gccaacaagt attttttata 300 agtgctaaaa aggaaaaaaa cgtgtaatcg cggcgggagg agaagagctt cagcattccg 360 gcattagttt tcca 374 // ID Gypsy-109_AA-I repbase; DNA; INV; 4636 BP. XX AC AAGE02027654; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-109_AA_; KW Gypsy-109_AA-LTR; Gypsy-109_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4636 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02027654; Positions 20580 15945. XX CC Positions [2018-2638] - Reverse transcriptase CC Positions [3644-4105] - Integrase core CC 'CACAC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 950..4606 FT /product="Gypsy-109_AA-I_1p" FT /translation="MLRCQEQSNKCDFGKSAEESRSISVIDKVILFAPSDL FT KEQLLQKDNLTIDDVTKIVSSYESVKHQARAISSSTGGVSENAMFGHESAS FT GVNRIRPSTSKECTRCGRKGHYANDISCPARNKECNKCRRSGHFANQCRTN FT VSLKRKFDESAISNKRLKAERIREIRSNQGDTKDRSFIFNINDGDELLWMK FT VGGVLLQVMVDSGCKRNIVDERSWNYLKANGVKTWNQKKDCDEVFLPYGAN FT AKPLTVLGAFDTSVVVEDAGKVMERVATFYVIQGGQQCLLGRTTATELGVL FT FVGLPSKQGVNSVRIDEKRPFPKIKGVKVTIPIDKTVPPVCQHPRRPPIAL FT LAKIEDKLNSLLTNDIIEPVEGGCQWVSPLVTVIKDNGDLRLCIDMRRANV FT AIVRERHMMPTIDDFLPRFTGAKYFSRLDIKEAFHQVELDEDSRYITTFIT FT HMGLFRYKRLMYGIVIAPEIFQRILEQILSRCSEHTVNFIDDILVFAETES FT EHDAALDLVLSTLKEYGVLLNQAKCVFKVQRLDFLGHILSSEGIQPALDKI FT AAIQQFRPPSSQEEVRSFLGLVTYIGRFLPNLATVTAPLRELTCSGIKFEW FT GKDQQRAFSILKEMISNVNLLRFFNNSLRTRVIADASPVALGAVLIQFEGT FT TDDDPRPIAYASKSLTQTERRYCQTEKEALALVWAVERFSPYLLGRKFELE FT TDHKPLEAIFKPTSRPCPRIERWVLRLQSFTFVVKYRKGKSNIADPLSRMV FT EPGITEDFDAENKFMVLAVLESAAIDVQELEESARMDRVLDLVKTSLQTGN FT WDESDVKPYVPFKNELGMVGDTIVRGGKLVVPTNLRQRMLDLAHEGHPGES FT VIKRRLRDRVWWPGMDREVSNRVAVCEGCRLVGLPSKPEPMSRRPLPSKPW FT VDVAIDFLGPLPNGMYLLVIIDYYSRYKEVELMSRITAKDTVCRLDKIFTR FT LGYPRTITLDNAKQFVGIEFEEYSKKHGIHLNHSTPYWPQENGLVERQNRS FT LLKRLQISNALGRDWQQDLQDYLVMYYTTPHSITGRTPTELLYGRTIRSKI FT PAVDDIETAPPLTDFADKDLESKEKGKDQEDKRRKSKRSSVEIGDSVLMQN FT LLPGNKLQTTYMPKKYVVVSRSGPRATVEDPESGKSYDRNVAHLKKLKDPV FT EPPTSLNSGDSTNEADCELNVASSEDEFYGFEDELPSKVAPPERQQRVRKK FT PAKYVDYQL" XX SQ Sequence 4636 BP; 1345 A; 972 C; 1186 G; 1133 T; 0 other; ttggcgactg aggaaaactg gaaagcagga gaaattggaa aaattaggta tgaattcagt 60 aatttatcag aacgattatt cgaacgggaa aagtttgttt tgcgaaatgg aggaaaatga 120 ggttatgttt acatttattt ttctttcttc agactgctat actacgctcg tagtggctgt 180 tccgcgcagc ggattttgta tgctaccaaa gagtagcggc tgtttatcag cgatttacta 240 cgagaaggat aagactactt aaaagtagtg gctgtcttga acagcgatag tggtaggcta 300 ccaagcggta gcggctgtaa acaatcagcg acatttaccc aaccactagt acggaaaggc 360 taccggctgt agcggctgtg gaacagcgtt tgacagacca caaaggtggc cttagagcac 420 aaggaaggct acgtgaaagt agcggctctc agagcggtgt gtaaccacac agtggctgca 480 gctctaaggc ggaaaatcta gaaattgacg gtaaggttga gtttaagata agaaatgaac 540 gaggaaataa tcccaagctt aattttagtt attctgaaac cgttgaaacg ttgcaatgga 600 aaagtgggac attcctcaat tccatttcaa agctcttgag aggaatgcgg tgcgcaacga 660 gtggatgaaa tataagcgaa atttcgattt catagttgct gccactggag agactgataa 720 aactcgcatc cgaaatattt tccttgcgaa gggcggaccg gacctccagg aagtattcac 780 gtctattccc ggagctgacg tgcaggatga tgctgaaaac ggagttgatt cctatgcagt 840 tgctattgcc aaactagacg cttactttgc gcccaaacaa cacgatacat ttgagcgcaa 900 cttgttttga acattaaaac cgagtctgga cgagactctg gtgaaattta tgctccgttg 960 ccaagagcaa tccaacaagt gtgatttcgg gaagtccgct gaagaaagcc gatcgatcag 1020 tgttatcgac aaggttatcc ttttcgctcc tagcgatttg aaagagcaat tgctgcaaaa 1080 ggataatctg accatcgacg atgttacgaa aatagtgagc tcctacgaat cagttaaaca 1140 tcaggctcga gcgatcagct cctcaactgg tggagtttct gaaaatgcca tgtttggtca 1200 tgagtcggcc agtggtgtta acaggattcg cccctcaaca tcgaaggaat gcacacgctg 1260 tggaagaaag ggccactatg cgaacgacat cagttgtccg gcgagaaata aggagtgcaa 1320 caaatgcagg cgtagtggac acttcgccaa tcaatgtcga acaaatgttt cccttaagcg 1380 taagtttgat gagtcggcta tcagcaacaa acgcctaaag gcagagcgta ttcgcgagat 1440 aagaagcaac caaggcgata caaaggatcg gagcttcatc ttcaacataa atgatggtga 1500 cgaattgctt tggatgaaag taggcggcgt actgctacag gtgatggtcg actcaggctg 1560 caagaggaat attgtagacg aacgatcgtg gaattactta aaggcaaatg gtgttaaaac 1620 atggaaccag aaaaaggact gcgatgaagt attcctccct tatggtgcaa atgcaaagcc 1680 cctgacggta cttggtgcgt tcgacacttc tgttgtggtt gaagatgccg gcaaggttat 1740 ggagagggtt gccacttttt acgtgataca aggtggtcaa cagtgcctac tgggtagaac 1800 aactgctacc gaattgggcg ttctgtttgt tgggttacca agcaagcaag gtgtcaactc 1860 agtgagaata gacgaaaagc gtccgtttcc taaaatcaaa ggggtaaagg tcaccattcc 1920 gatcgacaag accgttccac ctgtctgtca acacccaagg cggccaccga tcgctctact 1980 agcaaaaatc gaagacaagc taaactccct tttgacaaat gatatcattg aaccggtgga 2040 aggcggttgt caatgggtgt cgcccctggt aacggtgatc aaagacaacg gtgatttacg 2100 cttatgcata gacatgcgcc gggcaaacgt cgcgattgtg agagaacggc acatgatgcc 2160 cacaattgat gactttctgc cacgcttcac gggtgccaag tattttagcc gtctcgacat 2220 caaagaagct tttcatcaag tagagcttga tgaagacagc cgttatataa cgacgttcat 2280 tacacacatg ggtctgtttc gttacaaaag gcttatgtat ggaatcgtga ttgccccgga 2340 gatatttcaa aggatactcg agcaaatact gagccgatgt agcgagcaca ccgtgaattt 2400 tatcgacgac atcttagtat ttgctgagac cgaaagcgaa cacgacgcag cgcttgattt 2460 ggttctgtct acccttaagg agtatggagt gttgttgaac caggccaaat gcgtattcaa 2520 ggtccaacgg ttggatttct tgggacacat cctttcctca gaaggtatac aacctgcgct 2580 ggataagatt gctgctatcc aacaattccg cccaccttct agtcaggaag aagtccgtag 2640 tttcttaggc cttgttacgt atatcggacg tttcttaccg aatttggcca ccgtcactgc 2700 tccgcttcgg gagttaactt gttctgggat caaatttgaa tggggaaaag atcaacagcg 2760 agcattttcg atactgaagg aaatgatttc aaacgtaaac ctactgcggt tcttcaataa 2820 ctcattacgt acacgtgtta ttgccgacgc ttcgccggtt gcattaggcg ctgttttaat 2880 acagtttgaa ggcacaacag atgatgatcc tcgtccaatt gcgtacgcga gcaaaagcct 2940 gacccaaacc gagcgtagat actgccaaac cgaaaaagag gctttagctc tggtttgggc 3000 agtggagcgg ttttcgccat atctactggg ccgtaaattc gaactggaga ctgaccacaa 3060 acctcttgag gctattttta agccgacatc acgaccgtgc cctcgcatcg aaagatgggt 3120 ccttcgactc cagtctttca cgtttgtggt taaatatcgt aaaggaaaaa gcaacattgc 3180 tgacccgctt tctcggatgg ttgagccagg tattactgag gattttgatg cggagaataa 3240 attcatggtt ctagcggtgt tggagtcggc ggcgattgat gtacaggaat tggaagagtc 3300 tgcaaggatg gatcgagttc tggatctagt gaagacatct ttacagacag gaaattggga 3360 tgaatcggac gtgaagccat acgttccgtt taaaaatgag ttgggtatgg taggagacac 3420 aattgtccga ggaggtaaac tggttgttcc aacgaatctg cgtcaacgaa tgttggattt 3480 ggcgcacgaa gggcatccag gagagtcagt tattaagcgc cgtctccgag accgagtatg 3540 gtggccaggg atggaccgtg aggtcagtaa ccgagtagct gtttgcgagg gttgcagatt 3600 ggttgggctt ccaagtaaac cagagccgat gtcacgtcgt ccgcttccta gcaagccgtg 3660 ggtggacgtg gctatcgact tccttggtcc actaccgaac ggaatgtatt tgcttgtcat 3720 aattgattat tacagtcggt acaaggaagt ggaactgatg agcagaatca cagcaaagga 3780 tacggtttgc aggttggaca aaatatttac gcgattggga tacccacgga caattacttt 3840 ggataacgcg aagcaattcg tgggcatcga attcgaggaa tacagcaaga aacacggcat 3900 tcacctcaat cactccacac catactggcc ccaggaaaac ggcctcgtgg agaggcagaa 3960 tcgatcactg ctaaagagac tgcaaataag caacgcttta ggcagggatt ggcaacaaga 4020 cctacaagat tatttggtaa tgtattacac taccccgcat tctataacgg gccggacgcc 4080 aacggaactc ctgtatggcc gcacgatacg atccaaaatt ccggcggttg atgacatcga 4140 gacagcccct ccattgaccg attttgctga caaggatttg gaatcgaagg aaaaagggaa 4200 ggatcaggag gataagcgac gaaaatcaaa gaggtcatcc gtcgaaattg gagactcagt 4260 tctaatgcaa aatctgctac caggtaataa gctgcagacg acctatatgc ccaagaaata 4320 cgttgtggtt tcccgatctg gtccacgcgc tactgttgag gatcctgaaa gcggcaaatc 4380 atacgatcgg aatgtcgcac atcttaaaaa gcttaaagat cccgttgaac ctcctacatc 4440 tttgaacagt ggtgattcaa caaatgaagc tgactgcgag ctgaatgttg cgtcgtctga 4500 agatgagttc tacgggttcg aggatgaatt gccatcgaag gtagctccac cagaacgaca 4560 acagcgtgta aggaagaagc cagctaaata cgtagattac cagttgtaac tttgaatttt 4620 gaaaaaaggg gagatg 4636 // ID CR1_Ele8 repbase; DNA; INV; 4798 BP. XX AC . XX DT 22-OCT-2010 (Rel. 15.1, Created) DT 22-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE A CR1 clade non-LTR retrotransposon family from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1_Ele8. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4798 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4798 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (22-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. This consensus is generated from 15 CC sequences with >96% identity, and ~99% identical to the original CC sequence in [1]. XX FH Key Location/Qualifiers FT CDS 2..856 FT /product="CR1_Ele8_1p" FT /translation="MSVVCAACIGDATESHVRCRGFCTETFHPRCTGISAD FT TFEEVMKNQQLFWLCPSCTSLMKDIRFRNTARAAHDFGHDQAIHSQSDTLQ FT HLKSEILMELKNEIRTNFATLINSNSFTPKTTQRVKMDALPTTRGRRLFST FT ANLSAPKSKPVLLQGTGSTLSPSTEIQTVPAPQPKFWLYLSRVARDVSVEQ FT ITALACQRLNTADIQVIRLVAKGKDISTLSFVSFKIGMNAELKPKALSTST FT WPKGLVFREFSNDNATGNFWRPRPPFAHNDPLSIPMEADNMRME" FT CDS 766..4596 FT /product="CR1_Ele8_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="CNWKFLATKTSICTQRSAEYSDGSRQHEDGIEDIQPA FT TVFDIYDATLLPTHHHQPPGRTLTASLMEAPDLLNTVEPLLPAINSRPGPV FT FETVEGVFQNRNSGKYVFDTNCSPPEKHLNSSHYATCTTLSSTLFHQPPGR FT KLASSLMGAPILLNTVEPLQPAFNSRPGPVFETEEGVFQNRNSGKYDFDTN FT CSLPETPLDSSHNASNLTRSSTLQYHPPGRTLASSLMEASNPLNTVEPLQP FT AFNSRPGPVFETGEGVFQNHHKCKYDIHTNSALPVILPASSLSSDGGQDCL FT FQDVARVPTSATTVPFAQHISAGSNRSDVHHSIRGYSSSESHSELRVYYQN FT VRGLRTKIDQFFVAVADAEYDIIVLTETWLDDCIFSPQLFGNAYSVYRTDR FT SPLNSRKTRGGGVLIAVSSCLQSCIDPSSVSSSLEQVWVKVRLRGFAVSIG FT VLYLPPDRKNDLLDIQQHLESVESVMSGLEQHDFAILFGDYNHPEMLWSPS FT EDGGFKIDSGRTSLSASSSALYDGFCFHGLTQVNSIKNAYDRILDLVLVNE FT RTLPVCRLSQAVESLVVPDTAHPALEIGIRITTPVKFEHVLECRRFDFHRA FT DYDVLSNELASIDWQFLDSVRSLNDMVDTFSNEVLRIIERHVPVQRPPLKP FT PWTNPYLRKLKKRRGSALRKYRRSRVPLLKRRFVLASNKYVSYNRLSYVRY FT VDRTQRNLRLYPKRFWSFVKSKRKEDGLPSVMHYEESTVETAAEKCNLFAQ FT HFKAAFSDVDVSSSQVAEALRNTPSESIDFGIFEITEDHVLEALRKLKYSA FT SPGPDGIPSCLLKRCSAALLVPLMKLFNCSLQRGTFPAAWKTSLLSPIFKK FT GDKCDIANYRGITSLTACSKVFEIIVNNALFESCKNYISTAQHGFFPKRSV FT NTNLVEFTSLCIRSMDAGKQVDVIYTDFKSAFDRVNHAILLRKLEKLGMSP FT GLVRWFESYLSGRSMKVRIGAAHSDSFAIPSGVPQGSNLGPLLFSLFINDV FT AFIVPRGERVLYADDVKIFMVVNSVEDCLALQATLNSFECWSRRNKLELCV FT RKCYSITFSRKHNPIHFDYSLSGQTLERKNEVKDLGVTLDNELTFRPHYED FT VLSRANKQLGFIFKIAEGFSDPLCFKSLYCALVRSILEAAVVVWCPYNRVW FT IDRFEAVQRKFVRLALRNLPWRDRIHLPPYEHRCMLLGLDTLESRRVSMQS FT SFIGKIFKGEIDAPAILHELRIYVPERPLRRRNFLNMDARNSRYGQHDPIR FT YMVARFNEVFEQFDFI" XX SQ Sequence 4798 BP; 1284 A; 1096 C; 1030 G; 1387 T; 1 other; catgtctgtg gtttgtgccg cctgtatcgg tgatgcgacg gagagtcacg tgcgatgccg 60 gggattctgc actgaaacgt ttcatccaag atgcaccggt atatctgctg acacttttga 120 ggaagtgatg aagaatcagc aacttttttg gctctgtcca tcttgcactt cgttgatgaa 180 agatatccgt tttcggaaca ctgctcgtgc ggcacacgat tttgggcatg atcaagccat 240 tcattctcaa agtgacacgc tgcaacacct caaatcagaa attttgatgg agctgaaaaa 300 tgagattcgc acaaattttg ctacattgat caattcaaac tcatttactc cgaaaacaac 360 tcaacgtgtt aaaatggatg ctttgcctac aacaagagga cgtcgacttt tcagcacggc 420 aaatttgtct gcgccaaaat caaaaccagt acttttgcaa ggaactggca gtacattgtc 480 cccatctact gaaatacaaa ctgttccagc accacaaccg aaattttggc tttatctgtc 540 tcgtgttgca cgggatgttt ccgtagagca gataactgcg cttgcatgtc aacgtctgaa 600 taccgctgat attcaagtca ttcgcctggt tgctaaagga aaagatatta gcaccctgtc 660 atttgtatca tttaaaattg gtatgaatgc agaattaaag ccaaaggcgc tttccacctc 720 cacatggcca aaaggacttg ttttccgaga attttccaat gataatgcaa ctggaaattt 780 ttggcgacca agacctccat ttgcacacaa cgatccgctg agtattccga tggaagcaga 840 caacatgagg atggaataga agacattcaa ccagccacag tttttgatat ctacgatgct 900 actttacttc caacgcacca ccatcagccc ccgggacgca cacttaccgc aagcctcatg 960 gaagcccctg atctgctcaa cacagtcgag cctctcctgc cagcgatcaa cagccgtccc 1020 ggtcctgtgt ttgagacagt tgaaggggtc ttccaaaatc ggaactcagg caagtacgtc 1080 ttcgatacga attgttcgcc gcctgaaaaa catcttaatt ccagccatta cgctacgtgt 1140 acaacattat cttcaacgct attccatcaa ccaccgggac gcaaacttgc ctcaagcctc 1200 atgggagccc ctattctact caacacagtc gagcctctcc agccagcgtt caacagccgt 1260 cccggtcctg tgtttgagac agaagaaggg gtcttccaaa atcggaactc aggcaagtac 1320 gacttcgata cgaattgttc actgcctgaa acgcctctcg attccagtca taacgccagt 1380 aacttaacaa gatcttcaac gctgcagtat catccaccag gacgcacact tgcctctagc 1440 ctgatggaag cctctaatcc gctcaacaca gtcgagcccc tccagccagc gttcaacagc 1500 cgtcctggtc ctgtgtttga gacaggagag ggggtcttcc aaaatcatca caaatgcaag 1560 tacgatatac acacgaacag tgcgcttcct gtaatattac ccgcttctag cttgtcgtct 1620 gacgggggcc aagattgtct cttccaagat gttgctcgtg ttcccacttc tgccactact 1680 gtgcctttcg ctcagcacat atcggcggga tcaaaccgga gtgatgttca tcattcaatt 1740 cgtggatact cttcatctga atctcattca gaactgcgag tgtattacca aaatgtcaga 1800 ggacttcgca ccaaaatcga tcagtttttt gtagcagttg cagatgcaga atatgacata 1860 attgtactga cggaaacttg gctcgacgat tgtatctttt cacctcagtt atttggtaat 1920 gcatactcag tctacagaac tgatcgcagc ccacttaaca gcagaaaaac tagaggtggt 1980 ggggtgttaa tagccgtttc gtcctgcttg caatcctgta ttgatccatc ttcagtgagc 2040 agttctttgg agcaagtctg ggtgaaggtc cgtttacgag gttttgctgt tagtattgga 2100 gtactttatt taccacccga ccgcaaaaat gatctgctag acattcagca gcatttggaa 2160 tctgttgaat ctgttatgtc aggtttggaa caacatgatt ttgccattct tttcggcgac 2220 tacaatcatc ctgagatgct gtggtctcct tctgaggatg gaggctttaa aattgactcc 2280 ggacgcacaa gcctcagtgc ctctagtagt gcgctgtatg atggtttctg cttccatggc 2340 ctaacacaag tcaattcgat aaaaaacgcg tacgacagga tcctagattt agtgctggtg 2400 aatgaaagaa cgctacctgt atgtcgtctc tctcaagcag ttgaatcgct cgtggtaccc 2460 gacacagctc acccagcctt agaaatcggc attaggataa ctacgccggt gaaatttgag 2520 cacgttctcg aatgtagaag gtttgatttt catcgagcag attatgatgt gctctccaat 2580 gagcttgcat caatcgattg gcaattcctc gactctgttc gcagcttaaa tgacatggtc 2640 gacacattta gcaatgaagt tctgcgaatc atcgaacgac atgttccggt tcagcgccct 2700 cctttaaaac caccttggac taatccttac cttcgtaaac tgaaaaaacg taggggatca 2760 gcactgcgga aatacagacg cagtcgcgta ccacttttaa aacgacgctt cgtgctggcc 2820 agcaacaagt atgttagcta caatcgtctt tcatatgttc gmtatgttga tcgaactcaa 2880 cggaatcttc gtctttatcc aaaacgtttt tggtcttttg tgaagtctaa gcgaaaggaa 2940 gatggactac cgtctgtaat gcattacgaa gaatctactg tggagaccgc tgctgaaaag 3000 tgcaaccttt ttgcacagca tttcaaggct gctttttctg acgtagatgt ctcgtcatct 3060 caagtagccg aggccttacg caatactccg tcggagagca tcgatttcgg tatctttgaa 3120 ataactgaag atcacgttct tgaagcttta cgaaagctta agtactcagc atcacccgga 3180 ccagatggaa tcccgtcatg tttgctgaaa aggtgttcag ctgccctttt agtgccatta 3240 atgaaattgt tcaattgctc gttacaacga ggaacctttc ccgctgcgtg gaagacttct 3300 ttgctttctc cgatttttaa gaaaggcgac aaatgtgaca ttgccaacta cagaggaatt 3360 acatctttaa cagcatgttc aaaagttttt gaaataatag tgaataacgc tcttttcgaa 3420 tcatgcaaga actacatttc tactgctcaa cacggattct ttccaaaacg gtcagtgaat 3480 actaacctag tcgagtttac ctcgctgtgc attcggtcaa tggacgcagg aaagcaggtc 3540 gatgtcatct atacggattt caaatcagca tttgatcgtg tgaatcatgc tattctcctt 3600 cgaaagcttg agaagttggg gatgtcgcca ggcctcgtta gatggttcga atcgtatctt 3660 tccggccgat ctatgaaagt acggattggg gcagctcatt cggattcttt tgctattcct 3720 tcaggagtgc cacaaggaag taatctagga cctctcttgt tctctctgtt tatcaacgat 3780 gttgccttca ttgtgcctcg cggagaacga gtcctttatg cagatgacgt gaaaattttc 3840 atggtggtca atagcgttga agattgtttg gcattacagg cgaccttaaa ttctttcgaa 3900 tgctggagca ggcgtaataa acttgagctc tgcgtaagga aatgctactc aatcactttt 3960 agcagaaaac ataatccaat ccatttcgat tattcgttgt ccggtcaaac attggaacgt 4020 aaaaacgaag ttaaggatct tggagtgaca ctggataatg agctgacatt caggccgcac 4080 tatgaggacg ttctttctag agcaaacaag cagcttggtt tcatatttaa gattgcagaa 4140 ggtttctcgg atcctttgtg cttcaaatca ttatactgtg ccttggttcg ctctattttg 4200 gaagctgctg ttgtagtttg gtgcccgtac aatagagttt ggatcgatcg tttcgaggct 4260 gtgcaacgga agttcgtgcg actggctctc aggaatcttc cgtggcgtga tagaattcat 4320 cttcctccgt atgagcaccg atgtatgctg cttgggttgg atactttgga aagtaggaga 4380 gtttcaatgc aatctagttt cataggaaag atcttcaaag gcgaaattga tgctccagct 4440 atactccacg agttgcgtat ctatgtccca gaacgccctt tgagacgacg caactttctc 4500 aatatggatg cgcgcaatag ccgatacggg caacatgatc ctattagata catggttgct 4560 cgttttaatg aagttttcga acagtttgat tttatttaat gtatttatct tgtaatatta 4620 ggtttttaac actttgtttg ttagttttag attttaagtg tcttttgttg taatttgttg 4680 tgttacttgt ttcaaaaaga tgcgggtttt atgcctattt caggctttcc cagccatact 4740 tcattaagac ctaggtcaga tgaagaaata catataaata aataaataaa taaataaa 4798 // ID Shinagawa-4_AAe repbase; DNA; INV; 2556 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.03, Created) DT 11-APR-2011 (Rel. 16.04, Last updated, Version 2) XX DE A non-autonomous DNA transposon family from Aedes aegypti. XX KW DNA transposon; Transposable Element; nonautonomous; KW Shinagawa-4_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-2556 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the yellow fever mosquito."; RL Repbase Reports 11(3), 841-841 (2011). XX DR [2] (Consensus) XX CC ~98% identical to consensus. 8-bp TSDs. TIRs are ~110 bp long CC and composed by degenerate repeats. Related non-autonomous CC elements, named Shinagawa, are found in Aedes aegypti and CC Culex quinquefasciatus. CC The region 2229-1960 is an inserted FEILAI-1B-like element (~84% CC identical to the FEILAI-1B_AAe consensus). XX SQ Sequence 2556 BP; 822 A; 478 C; 409 G; 847 T; 0 other; gattgtatga catttgccag aaaaccattt gccagaatca atttgccaga atgatttttg 60 ccagaaaacc attccccaga atgtaccatt cgccagaaag ccattcccca gaatggacca 120 tttgccagaa aaccattccc cagaatcatt ttttgtaatt cgttttcaac cttgatatca 180 attgatcgac atatttgtga agatgcatcg aagccaaacc tcaaattttc aagagcgcaa 240 atctagagac ccaaacagcc attccagctg aaaatttaat cgattggtca ccaccagcgt 300 gtgaccaatc gattaagttt tcagctcaaa ccgctggctg gttctcgagg tttatgctct 360 tgaaaatttg aggtttggct tcgatgcatc ttcacctcaa gcctaccgaa attttcgata 420 tttgtcattt ttgtatcttg aatatactca tattatttgc acttatagca tgaaacattg 480 tgtagtcaac tttcacatta tttacatcaa gaggattgga ctactagcac tgcacagctg 540 cagcgcaaaa ttaattaaaa tttagcgtca cttctacagt acttcactcg catcgtaaat 600 attgctccaa ataatgttta tatctttgta atattataac aagtataaaa caacgcaact 660 gatcgatccg tgttatttag tctatgttgt taaaatcctt cgcatacaat tttatgcaat 720 tttcactact tacatagctg gtctgagaac aaaaaagctg aaacgtttga cacgaattag 780 tttagttgaa taatttcaga aatatgaata gtgtatcttt gcggaaaaaa attgcaaaac 840 aactaaaaag aacagccttt gagtgaaaga agggaaaatt atgtttgaaa tatgatcagt 900 ttagcgccaa taatcattgg aacaatataa ctcaaatgac tacccaatgt tctttcggtc 960 atatatttaa atacattatt cagtccttga aaagaacgtt aaactgtttt actcttaata 1020 ataatttgaa ccgcattatt tattgtttta aagtagtttt acaacatttg cgacaggctg 1080 ccggaaaact cttcaatctt cagtgtttac cctgattgct agttttagct atctggaacc 1140 ggagatattt tgaaattctt agcaggactg ctacgtggcc atgattaatg gtggaaacca 1200 aaaaaacaca catatattgg tttctcatat tctgtcgctt gctctacgaa acaagtcggt 1260 tatgaagatt aatgttttcg gaatcactcg agtaatcttc aatgagaaat tagatgaagg 1320 tcatggtaaa tattccatta ttgaatcact ttcaatttag ttttttttta tttgaggaca 1380 tatattgatt gtccatttcc cgatatttgc atttttttac ataaatcctt tttacagaca 1440 tttttattga cagcagtcct atattgattt ttttagctgt aattatttaa aattgagatg 1500 acataacctt gtttacatta cttccattcc ttctgttaac tgcaggcaga aacacacagt 1560 acaattataa agactagcca ataatataaa gaagggtaaa tctcctatga agcatataag 1620 tccttgaatt agttagccct aaagaacagc ctaatcttca aagaagggaa aatcattaat 1680 agaatgttga aaatacagtt gcctataagg ggttgtccat taacaacgtg gtcaattctt 1740 tcggattttt tacctccacg gggtctttcg tccatacaaa aatttaaaaa gaattgtttg 1800 tatcgtggtc attggccagg cccccttcca tcctcccata aatgatcaca tggttaatgg 1860 acgaccccta atagcttgta taaaattgat gactatttta acattaaaaa aaaacagtta 1920 tttctcagaa actatgaaaa cctcggaaat ctattacacc ttcttcttct tatcttgtgt 1980 tacgtcccaa ctgggacaga gccaaatctc agctttgtgt tcttacgagc acctccacag 2040 ttattaactg agaactttta ttgccaattg atcatttttg catgtgtata acgtattgcc 2100 gatatgaata tgctctatgc cctgggaaat cgagaaaatt ttcaatccaa taaaatcttc 2160 gcccggtgga attcgaactc acgtccgtca gcttgctgaa tagctgcatg tttaccccta 2220 cagctatcta tataataagt ttatgatatc attattagtt taaaacattt tatcaaacaa 2280 aaagcttaaa aacttacgtc acaattattt tgcatataat tatgtatcgt acgctgaaaa 2340 tcataatgat tcgaaagatt ttacacacct aaatattgct acacaaagca aagtaatgta 2400 gtaatatttt ttctgggaaa tggtctttct ggggaatgac tttctgggga atggtccatt 2460 ctggcaaatg gttttctggc aaatgatcca ttctggcaaa ttgattctgg caaacggctt 2520 tctggcaaat ggttttctgg caaatgtcat acaacc 2556 // ID Gypsy-253_AA-LTR repbase; DNA; INV; 196 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-253_AA_; KW Gypsy-253_AA-I; Gypsy-253_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-196 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1108-1108 (2011). XX DR [1] (Consensus) XX SQ Sequence 196 BP; 55 A; 43 C; 32 G; 66 T; 0 other; tgttgtgatt cgtcttacct actctatatg taccttgaat tcgattcacc tatgttacac 60 tacctgtcta tcaaactacc aaccatatga ctgtcaagag aaaatgaaat aaagtgtcat 120 tctttgttgt gaacaagcac gagtaagacg ttatttattt gtctccgtga ttcggccaat 180 ccctgcgttt acgaca 196 // ID CR1_Ele2 repbase; DNA; INV; 4005 BP. XX AC . XX DT 07-OCT-2010 (Rel. 15.1, Created) DT 07-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE A CR1 clade non-LTR retrotransposon family from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1_Ele2. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4005 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4005 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (07-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update and extension. This consensus is ~100% CC identical to the original sequence in [1]. XX FH Key Location/Qualifiers FT CDS 226..1056 FT /product="CR1_Ele2_1p" FT /translation="MESICFACTNTVKKEDEILCNGFCRSSFHLQCVHQSE FT EIRNTIADCSQLYWMCKACSKMMSNANFRQAISSTNNVISLMTDEYSKALD FT ELRQEIANNTFKINTILQRTPLPSTPHIPRSQLSVPSRKRPRLIADDSSHH FT DNTSVGTRELAPDESIPLAAPRKADQFWLYLSGFDPQATVPQIEKLVKNNL FT NTDKSFDIVKLVPKGKTLDELTFVSFKVGMDLQLKDTALSGSSWQKGIIFR FT QFDFNHSTSTRQTFRFRPSPRDDHSQQSPNITSYHQ" FT CDS 899..3952 FT /product="CR1_Ele2_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="KTPHYPAHRGRRELSSVSSILITPHPLVKRFASDLHQ FT ETTIHNNHQTSHRTINERSSAQLRNTQHTTTPRETITLYYQNVRGLRTKTN FT DVFLALSESDYDIIAFTETWLNNDIHNSELTHNFTIYRCDRNASNSCFQRG FT GGVLIGVRNCLQSTSLTIAECECLEQIVVRITLPDFDLFVCAIYLPPNSDT FT ILYEQHSACVQHLLGLAGDHGRVVVIGDYNLPHLCWMFDEEISSFLPVNAS FT SEQELALIESVVGCGLHQINDLTNANRRLLDLAFVNDDKYVELLEPPQPLM FT KMDNHHKPFVLKFESTMRSECDALDPFDFDFKQCDFSSLNEQIAAVEWNEY FT LNGCHIDEAVATFYDQVYGILRETVPLNQRRRGIGKRQPWWNNELQRLRNR FT LRKARKRYFRSKNEQHKVELRDLESEYNSLHARCFSCYIRRTEENIKHDPK FT SFWSFVRSRKQTNSIPQHVFYKDVNAETPSDSANLFSSFFRSVLSNNRSPS FT SEEYLSSLPRYDLNVAVFNFSTRDVCCKLRGIDDSKTAGPDRLPPLFVKNC FT ADSLTVPATILFNRSLSEGTFPCVWKNAAITPIHKSGNMNNVENYRGISIL FT SCLPKAFESLIHDSLYPHVQHIISEFQHGFVKKRSTTSNLMSYVSSLVKAM FT EKRQQIDAVYIDFTKAFDRVPHSLSVKKLDKMGLPSWLTRWILSYLTERSA FT HVRIAGISSDPFEITSGVPQGSHLGPLLFVLFVNDLCEVIKSQKLMFADDL FT KLFRVVSSLTDCCAVQADIDALVRWCMLNGMEVNVQKCCVITFSRKRNVID FT FGYRMSATNITRVNTVKDLGVLLDSKLNFAQHIAATTAKAYAVLGFIKRNT FT QEFEDVYCLKSLYCSLVRSILEYGVLVWAPYHTTQSNRIERIQRNFIRYAL FT RRLPWTDPVRLPPYEHRCALVHLETLACRRVLLQRLFVFDILSNNVDCSVL FT LTNINLNVPARTTRQADFLRFPVHRTLFGQNNPFYVCCRRFNEVSDRFDFN FT MSKLTFKRSISS" XX SQ Sequence 4005 BP; 1108 A; 921 C; 841 G; 1134 T; 1 other; tgtgtgtgta aggtaaacaa ttcgtttaac agcgttttta ctctgtgcta attctgtgtt 60 ttttccacgt acctgtgtct ttttctgcga attcaactgt acactggacg catctttcgt 120 cgttttaacg gtgaaaactg gagttcaacg attatctgct ttaactgagc tgaatttctt 180 aaaataaagc tacatctcaa gagtgaacgg cttcaactat cagcgatgga gtctatttgc 240 ttcgcttgca ccaacaccgt caagaaggaa gatgagatat tatgcaatgg tttctgcaga 300 tcttcattcc atctgcaatg cgtgcatcag tccgaggaaa ttcgtaacac gatagctgat 360 tgctctcaac tgtactggat gtgtaaggcc tgttcgaaaa tgatgtcgaa cgctaatttt 420 cgtcaggcaa tatcgtccac caacaatgtc atcagtttaa tgaccgacga atactccaaa 480 gcactggatg aattgaggca ggaaattgca aataacacct tcaaaatcaa cactattcta 540 caacgwaccc cactaccatc aacaccacat atacctcgat cgcaactttc agtaccgtcg 600 agaaaacgtc ctcggctaat tgccgatgat tcatctcacc acgataacac atccgtcggt 660 actagggagc ttgcccctga cgaatccatc ccgctagccg cacctcgaaa agctgatcaa 720 ttttggttgt atttgtccgg cttcgaccca caagctactg tgccacagat agaaaagttg 780 gtgaagaaca atctcaacac tgacaaatcg tttgacattg tcaaattagt gccgaaaggg 840 aaaactttgg atgaacttac gttcgtttct ttcaaagttg gcatggattt gcagttgaaa 900 gacaccgcac tatccggctc atcgtggcag aagggaatta tcttccgtca gttcgatttt 960 aatcactcca catccactcg tcaaacgttt cgcttccgac cttcaccaag agacgaccat 1020 tcacaacaat caccaaacat cacatcgtac catcaatgaa cgatcctccg cacagctccg 1080 aaacacacaa cacactacca ctccacggga aacgattact ctgtactatc agaacgtccg 1140 cggacttcga accaaaacaa acgacgtttt cttagcattg tctgaaagcg actatgatat 1200 aattgctttc acggaaactt ggctcaacaa cgatattcac aactcagaac tcacacacaa 1260 ctttacgatc tatcgttgtg atagaaatgc cagcaacagc tgttttcagc gtggtggtgg 1320 tgtgctgatc ggcgtaagaa attgtttaca aagtacgtct ctcactattg cggaatgtga 1380 atgtttagag caaatcgttg ttcgcatcac actaccagat ttcgatttgt ttgtttgtgc 1440 gatctaccta ccacccaaca gcgataccat tctatacgaa cagcactctg cctgtgttca 1500 acatttgctg ggtctcgccg gcgatcatgg tcgtgtggtg gttatcggtg actacaatct 1560 cccccacttg tgttggatgt ttgatgagga aattagctct ttcttgccgg tgaacgcttc 1620 atctgagcaa gaattagcct tgatcgagtc agtggttggc tgcggtctac atcaaattaa 1680 tgacttgact aacgctaatc ggaggcttct cgatctggcg tttgttaacg atgataaata 1740 tgtggagcta ttggagccac cgcaaccgtt gatgaaaatg gacaaccacc acaaaccctt 1800 tgttttgaaa tttgaaagta ccatgcgtag cgagtgtgac gctttggacc ccttcgattt 1860 tgatttcaaa caatgtgact tctcatcgct caatgaacaa atagctgcag ttgaatggaa 1920 tgagtacctt aatggatgtc atattgatga agcagtggca acattttatg atcaggtgta 1980 tggaattctt cgtgagactg ttccgttgaa ccagagacgt cgaggtatcg gaaaaaggca 2040 accctggtgg aataatgaac tacaacgtct acgtaatcga ctgcgcaagg ctaggaaacg 2100 ttattttcgt tcgaaaaatg aacagcacaa agttgaactg cgtgacttag agagcgaata 2160 caactctttg catgcacgat gctttagctg ttacattcgt cgcacggagg aaaatattaa 2220 gcatgatccg aaatcgtttt ggtcgttcgt gagaagtaga aagcaaacaa atagcattcc 2280 tcaacatgtt ttttacaagg acgtgaacgc cgaaacacca tctgattcgg caaatctgtt 2340 ctcgtcattc tttcgaagtg tgctaagtaa taaccgatca ccttcgtctg aagagtacct 2400 gagcagtttg ccacgatacg atttgaacgt tgctgttttc aacttttcga cgagggatgt 2460 gtgctgcaag ctgcgaggaa tagacgactc aaagaccgct ggacctgacc gccttcctcc 2520 tctcttcgtc aaaaactgtg ctgactcgct tactgttcct gcgactattc tcttcaaccg 2580 ttcattgtcc gaaggaacat tcccgtgcgt ttggaaaaat gctgctatta ctcccataca 2640 caaatcgggt aatatgaaca acgtcgaaaa ctaccgtggc atctccattc tgagttgtct 2700 gccaaaagct tttgaaagcc tcattcacga ttccctctac cctcacgtgc aacacatcat 2760 ctctgagttc caacatggct ttgtgaaaaa acggtcaaca acttcaaatt taatgtcgta 2820 tgtgtcttcg ttggtgaaag cgatggagaa gcgccaacaa atcgacgcgg tctatattga 2880 ctttacgaaa gcatttgata gagtaccgca ctcactgtcg gtgaaaaaat tagataaaat 2940 gggtctgccc tcttggttga cacgatggat actctcgtac ctcaccgaac gcagcgctca 3000 tgtacgtatt gctggtataa gttccgatcc ctttgaaatt acatcaggcg tcccccaggg 3060 gagccacctg ggtccgctgt tgtttgtgct gtttgtcaac gacctttgcg aagttattaa 3120 atcccaaaaa ctaatgttcg cggacgattt gaaattattt cgtgtggttt catcgttgac 3180 tgattgttgc gctgtgcaag ctgatattga cgccttggtg agatggtgca tgctgaatgg 3240 aatggaagtg aatgttcaaa aatgttgcgt catcactttt agccgaaaga ggaatgtgat 3300 tgacttcgga tacagaatgt cggcaactaa tatcaccaga gtcaacaccg taaaggacct 3360 cggggttcta ttggacagca aattgaattt cgctcagcac atagctgcaa caactgccaa 3420 ggcatatgcg gtacttggat ttatcaaacg caacactcag gagtttgaag atgtctactg 3480 cctgaagtct ctctactgct cgttagtacg aagtattctg gaatacggcg tacttgtttg 3540 ggcgccatac cataccaccc aatccaaccg gattgaacga attcagcgca actttattcg 3600 ttacgcgctc cgtcggctac catggactga tccagttcga cttccaccgt acgagcatcg 3660 atgcgccttg gttcatctag aaactttggc ttgcaggagg gttttgcttc aacgcttgtt 3720 cgtgtttgac atcttaagta acaacgtaga ctgctcagtg ctgttgacca acatcaattt 3780 aaatgtgccc gctagaacta ctcgacaagc tgattttctc cggttccctg tgcatcgtac 3840 tttgtttgga cagaacaatc cgttctatgt ttgttgtcgt cgttttaacg aagtatctga 3900 cagatttgat tttaatatgt ctaagttaac attcaaaaga agtattagtt cataagtatc 3960 agtctgtgcg acttgttcga agacatgtaa taaataaata aataa 4005 // ID Copia-24_SI-LTR repbase; DNA; INV; 305 BP. XX AC AEAQ01023605; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: long terminal DE repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-24_SI_; KW Copia-24_SI-I; Copia-24_SI-LTR. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-305 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01023605; Positions 800 496. XX SQ Sequence 305 BP; 78 A; 79 C; 55 G; 93 T; 0 other; tgttaaaata gctctcacat ttatttcctt tatcttacta accaggcgca ttttgtaatt 60 cgcggtagtt atacatcgac agaccgatct atgatcggta gtatgcccgt acggcggcgc 120 ttcggttctg cgcaaccaga aaaggcgctg aatgtctctc ccccttctct tcttactctc 180 ttgcgtgcgt tcacgctaac gtcactttcg ttgtatgtat atccaaccga aaggtacatt 240 ataataaaac agtgttcaag atacaaagaa ggcgttagat ccattcacca catccacctt 300 gtcca 305 // ID CR1-42_HM repbase; DNA; INV; 4938 BP. XX AC . XX DT 18-DEC-2008 (Rel. 13.12, Created) DT 18-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE CR1-type family - consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-42_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4938 RA Bao W. and Jurka J.; RT "CR1 families from Hydra magnipapillata."; RL Repbase Reports 8(12), 1870-1870 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 1084..4353 FT /product="CR1-42_HM_1p" FT /translation="MKNKTLCIFCFKTVTDNHKAIYCDCCSSWVHAKCNQI FT DLLTYNLLSEDSSEWYCFNCISKNMPFGLLTDNELLTTLSCKNPQTASSNF FT LVSPIHSRNILKELNTISGSAINCKFYDVVDFNKAIKPNSDIYLHLNIASL FT PYHIDDLCSLISTLNIIPMAIGITESNLYLNDTNITDITINGFNIEHCPTE FT AKKGGALLYLQSNLNYVVRHDLKIYLPKLLESVFVEIIKPHKTNVLIGCIY FT RHPSMNQNEFNTTFLTPLLEKLSKEQKLIFLMGDFNVDLLSYSESNPTSNY FT LDTLCSYSFNPSIILPTRITTKSQTLIDNIFMNFYSSDHLSGNLTISISDH FT TAQFVCVPGTSNQPNKVKISRCLKRCLKKFDPNKFIIEISKINWELIIKKD FT DNINESINNFLKSFNRVLDQCAPYKTLTKNQIKLKSKPWISKGLLKSISVK FT NKLYKKFVRCKNINTKNELFTKFKFYRNKISNLLKSSKKSYYITYFNNNIN FT NIKNTWKGIKDIINIKGSLHKTPTQLNLNENIITDHATVSNIFNNYFVSIR FT EKLLNNIMPSEYTYSDFLNFPNANSFFIDPVSEDEVSSLISNTLKNDKSYG FT PNSIPTFFLKLTSHIISKPLSIMINNSFKNGKFPDVFKVAQVIPIYKAGSL FT LDFSNYRPISLLSNLSKLFEKAMHNRLYKFLDKFKCLYQHQYGFRSKHSTT FT LALIEITEKIRRALDNKHFACGVFVDLQKAFDTVDHSILLTKLEYYGIRGI FT PLKWFTSYLKNRTQFVSINSTKSDLQNCSNGVPQGSVLGPLLFLLFINDLN FT TSFKFATAYHFADDTNMLLVDKSLKKINKHINHDLGNLVQWLRSNKLSLNS FT NKTELIIFKSNLAPIKKQLNFRLSGQKIYPLDSIKYLGIKIDSNLSFQGHL FT NDLAMKLSRSNGMLAKVRHYVNLETLLNIYHAIFGSHLRYACQVWGQSHKQ FT IILRLSHLQNKALKIIYFQYNNSKSNVLYLISKILKISDLVVFLNCLFVWN FT YHHKNLPLSFNNFFSITNCHYPLRSISSLNLVIPKHQSAKYGNKSIKYQCI FT SSWNKLPPDLKSLNSISEFKIKLFNYLLKKYN*" XX SQ Sequence 4938 BP; 1703 A; 831 C; 517 G; 1883 T; 4 other; tgcgcgatga aggaaagatg gcggtctwaa ttctgagttg aagtttggta tacttttatt 60 ttawcgagtc gaaatttata tttatccatc tattattaaa tcaatgaaaa agataaaaaa 120 ttaataaagt aattagtatc aaaataattt attcgtgatc aatattaata caaagtaacc 180 ctaaagtagc aaacaaaaaa acgagccata tttcattatc catcaaacaa gaagaaaaga 240 agcaagaaga aaagtataac ataaataaaa tgaggagaag aaccgaaaaa aaaaatagca 300 cgaatctatt ggaaagcata agatttaaag atacttaaac ctccttattt ttttcttttg 360 atgatttttg gttagccaca tttgtttatt atttttctgg actatttatc aatatttgta 420 ggttatttat tttcattact ttatagaatt ataattatat ctaaatagct aatatatctg 480 taatataatt tattcatata tataaaatat tattatatac ctatataact atctatatat 540 ctatatagct tttattagtg attatataaa tgtcatataa cttaatcttt cagtttatgt 600 taattatata tttacataac tatcattatt tacataacta tcactattta cataactatc 660 attatttatc atttgaactt tatttattat ttgtacattt ctcttcattt attcttttct 720 ttgtgcttgt tttaacattt tatttgtata tttattttta tttattcttt tcttcatgat 780 tattttaaca ttcttcccaa gtatattatt ttattattat attggcttgt tagtgttttt 840 gttagttgtc aacatctatt taatctcttc aacatcatct acttgatttt gtataaccta 900 cctgtactgt gttatatata ctgttcttat gcaaatccat tttgcctcac aattactctc 960 ttattatatt attatattat tataagttta tcttatttat tatatcctta gtcactcctt 1020 tttttttcat caccttctgt gtttatcaat aataaaacat aataacattc tgcctatcaa 1080 gttatgaaaa ataaaacact ctgtattttt tgttttaaaa ctgtaacaga taaccacaaa 1140 gccatctatt gtgactgttg cagttcatgg gttcatgcta agtgcaacca aatagatttg 1200 ctcacttata atcttctctc tgaagactct tcagaatggt actgttttaa ttgtatatct 1260 aaaaacatgc catttggtct gcttactgat aatgagctct taacaacatt atcatgcaaa 1320 aatccgcaaa ctgcatcatc aaattttctt gtatctccaa ttcattcaag aaacatactt 1380 aaagaactta atactatctc tggttctgct ataaattgta aattctatga tgttgttgac 1440 tttaataaag ctatcaaacc taattcagat atatatcttc accttaacat tgcatcactt 1500 ccctatcaca ttgatgacct gtgcagtctt atcagcacat taaatatcat acctatggca 1560 atagggatta ctgaatccaa cttatacctt aatgatacca atataacaga cataactatt 1620 aatggattca atattgagca ttgccccact gaggccaaaa aaggtggtgc tcttttatat 1680 ctgcagtcta atttaaatta tgtagttcgt catgacttaa agatttactt accaaaatta 1740 ctagagtctg tttttgttga aatcattaag ccacacaaaa caaatgtcct tattggttgt 1800 atatatcgtc atccatcaat gaatcaaaat gaattcaata caacattttt aactccactt 1860 cttgaaaagt taagcaaaga acaaaaattg atctttttaa tgggtgattt caatgtagat 1920 ctactgagct atagtgaatc taatcctact tccaactacc ttgatacctt atgttcttac 1980 tcatttaatc cttccataat cttacctacc cgtataacaa ctaaatctca aacacttatt 2040 gataacatct ttatgaactt ctactcttct gatcatctct caggtaacct aactatttct 2100 atctcggatc acacagcaca atttgtttgt gttcctggca catccaacca accaaataaa 2160 gttaaaattt ctagatgttt aaaaagatgt ttaaaaaaat ttgaccctaa caagtttatt 2220 atagaaattt ctaaaatcaa ctgggagctt ataattaaaa aagatgacaa tattaatgaa 2280 tccatcaata attttttaaa atcatttaat agggttctag accaatgcgc cccctataaa 2340 actttgacta aaaatcaaat caaacttaaa tctaaaccct ggatmtctaa aggcttactt 2400 aaatctatct ctgtaaaaaa caaactctat aaaaaatttg taagatgtaa aaacattaat 2460 actaaaaatg agcttttcac aaaatttaaa ttctatagaa ataaaattag caacttatta 2520 aaaagtagta aaaaatcata ctacataact tattttaata acaacataaa caacataaaa 2580 aatacatgga aaggcataaa agatatcata aatattaaag gatcattaca taaaacacct 2640 actcagttaa atctgaatga aaatattatc acagaccatg ccactgtttc aaatatattt 2700 aataactact ttgtctctat tcgtgaaaaa ctactcaata atattatgcc atcagaatat 2760 acttatagtg actttcttaa ttttcctaat gctaattctt tctttattga tcctgttagt 2820 gaagatgagg tatctagtct tattagtaat actcttaaaa atgataaaag ttatggacct 2880 aacagcatac ctacattttt tcttaaactt acctctcaca taatctccaa acctcttagc 2940 attatgataa ataattcttt taaaaatggc aaattccctg atgtttttaa agtagctcaa 3000 gtcattccaa tttataaagc aggctcctta ctagactttt ctaactatcg acccatttcc 3060 cttctttcca acctaagcaa actttttgag aaggctatgc ataacagact atacaagttt 3120 ctagataaat ttaaatgctt gtaccaacat caatatggat ttcgaagcaa acattctaca 3180 acccttgcac ttattgaaat tactgagaag attagaagag ctctagataa taaacacttt 3240 gcatgtgggg tgtttgtaga tttgcagaaa gcttttgata ctgtggatca ctccatcctt 3300 ctcacaaaat tagaatacta cggaatcaga ggcatcccac taaaatggtt tacttcatat 3360 ctcaaaaata gaacacagtt tgtttccatt aatagcacta aatctgattt acagaattgc 3420 tcaaatggcg tcccacaagg ttcagtttta ggaccacttc tttttcttct ttttattaat 3480 gacttgaata cttcttttaa atttgcaact gcttaccatt ttgctgatga tactaacatg 3540 ctactggttg acaaatcact taaaaaaatc aacaaacata ttaatcatga tcttggaaat 3600 cttgttcagt ggcttcgttc taataaactt tctcttaatt ctaacaaaac agagcttatc 3660 atctttaaat ccaatctggc accaattaaa aagcaattaa atttcagact aagtggacaa 3720 aaaatttatc ctttagactc tattaaatat cttgggatta aaattgattc taatctttct 3780 tttcaaggcc atctgaatga cttagcaatg aaactcagca gatctaatgg aatgctagcc 3840 aaagtccgcc attatgtaaa tctagaaact ctgctaaata tatatcatgc tatttttggg 3900 tcccatctaa gatatgcatg tcaagtatgg ggacaatccc ataaacaaat cattttaagg 3960 ttgtcccatc tccaaaataa agccttgaaa ataatatatt ttcagtataa taactctaaa 4020 tctaatgtac tatacctaat ctccaaaata ttaaaaataa gcgaccttgt ggtattcctc 4080 aactgcctat ttgtctggaa ttatcatcac aaaaaccttc ccctatcatt caacaatttc 4140 ttttcaatta caaattgtca ctatcctctt cgctccatca gtagcttaaa cttagttatt 4200 ccgaaacacc aatctgctaa atatggaaat aaatccatta aatatcaatg tattagctca 4260 tggaacaaac tacctcctga tctaaaatct cttaattcaa tctctgaatt caaaatcaag 4320 ctttttaatt atctgttaaa aaaatacaat taaatatttc taccaactaa tattgacatt 4380 ctttactata tatgtcaatt cgtaatcctt cagtttttaa tttgatattt ttaacatttt 4440 gaaattctct acttatctat ctacatctaa tttctctggt tttctatctg ccatttttaa 4500 actttattat taaactatct atactaatat tattaatttt agtaccaatg ggttggcatt 4560 ctttatttat ctatctattt atctcttttc attcaaaaat tctctatttc tcaactattc 4620 tacttcgtac taaatctttg ttgttgccat cactctttac taaatttttg ttgctgctgt 4680 tattgtcttt atttatgaaa ttcttctctt attataattg ttattatttc gttactatta 4740 ttattattgt tataattatt atttttgttg ttgttgttgc tattattatt cttatcacaa 4800 tcattgttat tattgttatt attgttatta ttattttayt gttattattg ttattactat 4860 tattattgtt attattgtta ttactattat tgttattatt gctattatta gtattattaa 4920 taacagttat tattatta 4938 // ID Gypsy-6_AC-LTR repbase; DNA; INV; 204 BP. XX AC AASC02015807; XX DT 18-JAN-2011 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from Aplysia californica (California sea DE hare): long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-6_AC_; KW Gypsy-6_AC-I; Gypsy-6_AC-LTR. XX OS Aplysia californica OC Eukaryota; Metazoa; Mollusca; Gastropoda; Heterobranchia; OC Euthyneura; Euopisthobranchia; Aplysiomorpha; Aplysioidea; OC Aplysiidae; Aplysia. XX RN [1] RP 1-204 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Aplysia californica (California sea RT hare)."; RL Direct Submission to Repbase Update (02-FEB-2011). XX DR Genome; AASC02015807; Positions 94798 95001. XX SQ Sequence 204 BP; 60 A; 44 C; 31 G; 69 T; 0 other; tgtggaataa tgtacatcac acaactcttc tgttataact gttaggtacg tatcgggatt 60 tagacagtac ctttcttcta cttagtagtc acttttcggt tacaatgtac tagaggacta 120 taataaaacc aaagagcctt ttcagacgtg cgtttcttca ctctctctat atctgtttgt 180 acccacaaac cgaacaatat taca 204 // ID CR1-105_AAe repbase; DNA; INV; 4962 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-105_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4962 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1193-1193 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 10 sequences with >97% CC identity. XX FH Key Location/Qualifiers FT CDS 360..1181 FT /product="CR1-105_AAe_1p" FT /translation="MSKTCGKCSEAINGIDFVICRGYCGAFFHINDCSGVT FT RALQSYFTSNKNLFWMCDECAELFGNSHFRVISNQADEKSPLTSLVSAITE FT LRTEIKQLNVKPTAHASPAANIRWPAIDQRRGTKRPRVHETNAPQLEACRV FT GSKKTQDNVASVPISKADVEPRFWLYLSKICPDVTVEAVRAMTKANLNIEN FT DPIVVKLVPKGKAIESLSFVSFKIGLDPSLKNKXLDPETWPEGLLFREFED FT YGAPKFRVPLRINVQSTPLGTNLSPITPVMDLS" FT CDS 1073..4876 FT /product="CR1-105_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="IRRLWSPKISRSSEDQCAINTARNKFISNHSRHGFEL FT SFNPGRNNLSVLEALYPPITVPFFPPAFVSRPGPVFGCGERVFRTECQGEY FT HQIQSNAISVTSNAFSXTVPTHTSTNATSTPECTPARLMEAPDPFVTVEPL FT LPASCSHSGPVYDFGKGVFQPVLTGKYASTLNICXPVTSVGSRNFSNDNSS FT NSQRLFDVFANRDQTVSSSALLSPPGCTPASIVEAPNPLITVEPLLPAFSS FT HPGPVYELGDGVFRTAFAGKYACNSNNSLPVTPTVSSNVTDSFACSVHHST FT VSSASPVCSIAEATQNRATSRSVHRSISVYYQNVRGLRTKLVHLRLLLTSC FT DYDVLVFTETWLRDDIESAEISSDYTLFRCDRSPSTSQYSRGGGVLIAVKN FT LLHCEAIPLSNSEELEQIAVCVKSQFRSLYIVAVYLPPNTNADLYSAHANS FT VQSIVNRTSGRDIILSLGDFNLPNLQWVLDDDINGYIPTNISSEQEQCLIE FT SLFSSGLRQVNNVMNTRGRLLDLVFVSLPEFLDLTEPPVPILPIDTHHMPL FT VLLFDVSDENSIEFESRDDDLRYDFTTCDFNQLKEAFADTDWNSQLQGANV FT DESVSLFYDKIFAVCNSSVPLKRRVVNSMFNKPWWTSELRHFRNVLRKARH FT RFFLTKSENDRINLREVEISYKTLLLSTYEAYLTKVQDNVKQNPSRFWDFI FT KKQKSSARVPNNVTYHDLHASSNVEAANLFASFFESVFSKVSPVWRRDSFA FT HIPSYNIRLPMIQFSPEEVLKAIDDLDIKKGPGTDGIPPVLLKKCSAELSI FT PIAYLFNRSLSERTFPMFWKTASVVPIHKSGNMSRVENYRGVSILCCLSKV FT LEKLMHGVLYTIASPIICDSQHGFMKHRSTTSNLMCYVTNLSREIEGRRQV FT DAVYIDFAKAFDTVPHALVTEKLNRIGFPNWISEWICSYLSGRTAHVVVNS FT ASSRSFSITSGVPQGSVLGPLLFNIFVNDLCYLLSSFKLSFADDLKLFRVI FT RSFHDCAALQEDINVMLTWCSNNGMRVNCKKCKVISFTRSNSPINHQYRMN FT TDILQRVTSICDLGVIVDEKLTFKEHVKTTSAKAFSVLGFIRRHAAEFTDV FT YALKVLYCALVRSILEYASPIWSPYHVTDVLSIERVQKKFVRFALRLLPWN FT DPNNLPPYAERCQLIGLEPLSCRRERSQRLLVFDIITNAIDCPVLLEQTPL FT YIPPRSFRNSPLLAVPYHRTNYGYNNPLDSCIRSFNEICNEFDFDISKNLF FT KNRISRP" XX SQ Sequence 4962 BP; 1324 A; 1205 C; 1005 G; 1423 T; 5 other; attctggcat cactgctatg ttcatgtatg caattaattc ggtcgtgctt ttttattgat 60 tttctttgtg aaattccgag ttttaaatca tcgttttgtt tgccgttacc ttatgtgaag 120 tgaattcgtg gttcctcatt gtgatcgtgt gaaattttag catattcgcg atagtttgtt 180 tttgtttgat ctgttaaacc acaagagcct tctaccatcc acgccgatct tttccaactg 240 gagcgacatc tgctggtcaa cacaaaaaac tgctgttcga ttttgaagca tctgagcttt 300 cgcctgttcg gaggaatatc gaggagtaca ccagattatc tacccgtagg cgcttcacca 360 tgtcgaaaac ctgtggaaag tgctcagaag ccataaacgg tatcgatttt gtaatctgcc 420 gtggatactg tggagctttc ttccacatca atgactgctc tggagttact cgtgctttgc 480 aatcatattt tacatccaac aagaatctgt tctggatgtg tgatgagtgt gcagagctgt 540 tcggaaattc tcactttcgt gtcatttcaa atcaagcaga tgagaaatca ccccttacwt 600 cgcttgtttc tgccatcact gaactgcgca ctgaaataaa gcagcttaat gtgaagccca 660 cagctcatgc atcaccagca gctaatatac gttggcctgc tattgatcag cgtagaggca 720 caaagcgtcc tcgagtgcat gaaactaatg cgcctcaatt ggaagcttgt cgcgtaggca 780 gcaaaaaaac gcaagataat gtcgcttcgg ttccaattag caaagccgat gtggaaccaa 840 gattctggct ttatctctcc aagatttgcc cagatgtcac cgtggaagct gttcgkgcaa 900 tgacaaaagc caacctgaat atcgaaaatg atccaatcgt ggtgaaactt gtgccaaaag 960 gtaaagctat cgaatcactt tccttcgtct cgttcaaaat tggactagac ccttcgctca 1020 agaataaakc gcttgatcca gaaacatggc ctgagggcct actgtttcgt gaattcgaag 1080 actatggagc cccaaaattt cgcgttcctc tgaggatcaa tgtgcaatca acaccgctag 1140 gaacaaattt atctccaatc actcccgtca tggatttgag ctgagtttca acccaggacg 1200 caacaattta agtgttttgg aagccctgta tccccccatc acagtcccgt ttttcccgcc 1260 agcgttcgtc agtcgtcctg gtcctgtgtt tgggtgtgga gaaagggtct tccgaaccga 1320 atgtcaaggc gagtatcatc aaattcaatc aaatgccata tcagttactt caaacgcttt 1380 tagtwcgact gttccaaccc acacttcaac taatgctacg tcaacaccgg aatgcacgcc 1440 tgcacgcctt atggaagccc ctgatccttt cgtcacagtc gagccactcc tgccagcgtc 1500 ctgcagtcat tccggtcctg tgtacgattt cggaaagggg gtcttccaac ccgtcctcac 1560 aggcaagtat gcatcaacac tgaacatctg csctccagta acttctgttg gttccagaaa 1620 tttttctaac gacaacagct ccaattctca acgcttattc gacgtattcg ccaaccgtga 1680 tcagacagta tcatcttcag cacttctctc gccaccggga tgcacgcctg ctagtatcgt 1740 ggaagcccct aatcctctca tcacagtcga gccactcctg ccagcgttca gcagtcatcc 1800 cggtcctgtg tatgagcttg gtgacggggt cttccgaacc gcattcgcag gcaagtatgc 1860 atgtaattcg aacaattctc ttccggtaac gcctaccgtt tccagcaacg taactgattc 1920 gttcgcgtgc tctgttcatc attccacggt atcgtcggca agccctgttt gttccattgc 1980 cgaagcaact caaaatcgtg caacatcgag gtccgtacat cgttctatct cggtgtacta 2040 tcaaaacgtc agaggattgc gcaccaaact tgtccacctg cgactcctgc taaccagctg 2100 cgactatgac gtgcttgttt ttaccgagac gtggttgcgt gatgatattg aaagtgctga 2160 aatctcatct gactatacgc tttttcgctg cgaccgtagt ccatcaacta gtcaatattc 2220 acgcggtggt ggcgtattaa ttgctgtgaa aaatctcctt cactgtgagg cgattccatt 2280 gtccaacagc gaagaattgg aacagatagc agtctgtgtg aaatcgcaat ttcgttcgct 2340 ctacattgtt gctgtttatc tccctccaaa cacaaatgcc gatttgtatt ctgctcatgc 2400 aaattcagtg caatcaatcg tgaatcgcac atcgggcaga gacatcattt tgtccttagg 2460 tgactttaac ctccctaatt tgcaatgggt tttggacgac gacataaatg gttatattcc 2520 aacgaacatt tcttccgaac aagaacaatg tctcattgaa tccttgttct catctggctt 2580 acgacaagtc aacaatgtaa tgaatacgcg cggtagactt cttgatcttg tgttcgtaag 2640 tctaccagag tttttggatt tgacagaacc tcctgtacca attttaccta ttgacactca 2700 tcacatgccg cttgttttac tatttgacgt gagcgacgag aattcgatcg agtttgaaag 2760 ccgtgacgac gatctcagat acgattttac cacctgtgac ttcaaccaac taaaagaagc 2820 atttgcagat accgattgga actcgcaact gcaaggcgct aatgtagatg aatcggtttc 2880 gttattttac gataagattt tcgcagtatg caattcttct gtccctctca aacgaagggt 2940 cgtcaactct atgttcaaca aaccgtggtg gacctctgaa ctgcgtcatt ttcgaaacgt 3000 gcttcgaaag gctcgtcacc gtttttttct caccaaatct gaaaacgaca gaattaatct 3060 acgtgaagtg gaaatatcct acaaaacatt actgctatcg acttatgaag cttacttaac 3120 caaggttcag gacaatgtta agcaaaatcc atctcgtttc tgggatttca tcaaaaagca 3180 aaaatcatct gctcgcgttc ctaacaacgt aacttaccac gatcttcatg caagctcaaa 3240 cgtggaagcc gccaatcttt tcgcctcgtt ttttgaaagt gtgttcagta aggtctctcc 3300 cgtatggcgt cgtgacagct ttgctcatat tccgtcgtat aacatccgcc ttcctatgat 3360 ccagttctct cctgaagaag tattgaaagc catcgatgat cttgacatca aaaaagggcc 3420 ggggacagac ggtattccac cggtgctctt gaaaaaatgc tctgcggaac tatcgattcc 3480 tattgcgtat ttgttcaatc gatcgctcag tgaaagaacg tttccaatgt tttggaagac 3540 agcatctgtt gtgcccatcc ataagtctgg aaacatgagt cgcgttgaaa attatcgtgg 3600 cgtctctatt ttgtgctgtt taagcaaagt tcttgaaaag ctaatgcatg gagtactgta 3660 tacaatcgcc tcacctataa tctgcgacag tcaacatggc ttcatgaagc atcgatcgac 3720 aacatccaat ctaatgtgtt atgttaccaa cttatcacgt gaaattgaag ggaggaggca 3780 agtcgacgcg gtgtacatag attttgccaa agccttcgat actgttccac acgctttagt 3840 cactgagaag ctgaaccgca tcggcttccc gaactggatt tctgaatgga tttgctcgta 3900 tctctctgga cgtaccgcac acgtagtagt taactcagca agctcccgat cattcagcat 3960 tacgtctggc gtgccccaag gcagcgttct tggaccatta ctgtttaaca tattcgtaaa 4020 cgatctgtgc tatctccttt cctcgttcaa gctgtcgttc gccgacgact taaaactttt 4080 ccgagtcata cgttcgttcc acgattgtgc ggcattgcaa gaggatatca atgtaatgtt 4140 aacctggtgt agcaataacg gtatgcgcgt gaactgcaaa aaatgcaagg ttatatcctt 4200 cacccgttca aactccccaa tcaaccacca atatcgcatg aacaccgata tactgcagcg 4260 agtaacatcc atatgtgact tgggagttat cgttgacgaa aaactaacgt tcaaggaaca 4320 tgtgaagaca acgagtgcca aagcattctc agtgctagga tttatacgtc gacatgctgc 4380 ggagtttacc gatgtatacg cattgaaagt gctatattgt gcgttagtac gtagcatctt 4440 agagtacgcc tccccgatat ggtccccata tcatgttacg gatgtactct caatcgaacg 4500 tgtccaaaag aaattcgtac gttttgcatt gcgactgctt ccttggaacg atccaaacaa 4560 tcttcctcca tatgctgaac gctgtcaact gataggatta gagccactat cttgcagacg 4620 tgagagaagt caacggttgc tagttttcga catcatcaca aatgcgattg attgtcccgt 4680 gcttcttgag caaacaccgc tttacatccc tcctcgtagt ttccgaaact ctccattact 4740 agctgttcca tatcaccgga caaattacgg ctacaacaac ccgcttgact cctgcattcg 4800 gtcgtttaac gagatttgta acgaattcga ttttgatata tccaaaaact tgttcaaaaa 4860 ccgtataagt aggccatagg aaatttagat atagttagtt cagtctgtac gacttacagt 4920 cgaagacggt gaaataaata aataaataaa taaataaata aa 4962 // ID Zator-1_Lgigantea repbase; DNA; INV; 3221 BP. XX AC . XX DT 12-MAY-2011 (Rel. 16.05, Created) DT 12-MAY-2011 (Rel. 16.05, Last updated, Version 1) XX DE DNA transposon: consensus. XX KW Zator; DNA transposon; Transposable Element; Zator-1_Lgigantea. XX OS Lottia gigantea OC Eukaryota; Metazoa; Mollusca; Gastropoda; Patellogastropoda; OC Lottioidea; Lottiidae; Lottia. XX RN [1] RP 1-3221 RA Yuan Y.W. and Wessler S.R.; RT "The catalytic domain of all eukaryotic cut-and-paste transposase RT superfamilies."; RL Proc Natl Acad Sci U S A 108(19), 7884-7889 (2011). XX DR [1] (Consensus) XX SQ Sequence 3221 BP; 1050 A; 546 C; 600 G; 1025 T; 0 other; ggcgacacca atttataaac ttggattacg ggcccgccga cctgcttttt tgacgaatcc 60 caataaaaat aaaaatgaaa aaatctattt tttttttatt cttactgtgt ccgccgatgt 120 aaaaatttca acacgaaatc ctaaagggac gtaacttggt tgataagttg tactcgctta 180 gcgaaagcgt tcactgttga atggtcagtg ttaatgagac aaaaggctag ttccacgatg 240 tttttgataa cggtactaca aagcaagtac gactatgcaa atgaggtgcc tgcaaatttc 300 taaaataaat cttgaactga agctattttc attcgaaaga taaattggtt tgtaatatgt 360 tgattgttgg ctagatagtt ttaaaatttg ttgtattatg agattagttc tggaaatctg 420 gaaaagcctt tcaaaaaata aaataaacca acaacacatg atttgtcatt tggttcgttc 480 cagtctgcaa attttgtaaa aattctctca tcggccatta ttcaattgtt gtgatgaaaa 540 tacatctatg tgaaaatgtg agaaaatgag cattgaactc gatgtttata tcaacgtctc 600 catatgtacc gagggccatg taacaaaacg aactatagtg cgtgcctgat gacgaaaatg 660 ttggtaattt gttgacgcgc tccggatggc aggacacaga agaagtaacc gtctccagcg 720 ttaaagctgg atgaaataga ccatgaaact accgatgtag tgatttgact ggagattctt 780 ctgcatctga aactgaactg attaaaggtg tggaccaaag ggttgcagaa tttctcctca 840 gtactgaaga agaagaaata atctatgacc ttcgtaaatt aaatggagaa aatggtaaaa 900 caaaatttga tgcattttgg gaagagaccc aaaaatatat tgaagaaatg gtcccagcag 960 ttcatgaaag acgtcatggt gaacagttgt acttgcctta catttatcga tcgaagagct 1020 aagaaataca attcaaaaac gacttcctac ttcaactgat attccaagca atgagtggct 1080 tcgtctccag ttctggccaa aaaatgttac aagcttgcgt gctttacaga cacaggacga 1140 tttgatttaa agtttgttgt tcagcagaga ctgcttcggg caacccatcc agatgcaaaa 1200 tattgtgctg tacaatataa atatcttagg aactttgccg taaaactccg taacttttca 1260 aaattgattt atttggatga taaatcaatt gtacctgtag gctaaccaga cagacccatt 1320 tcaactggtg ttagagctca caacaaaagt cttgcttctc aaggattggt ttcccttgac 1380 catgacttcc atatccatgg tattattcca tctgtttgct tcatcagttc aataccggaa 1440 gatataagtg attcattttt tcaaggacag atttctgtta catgtaaaaa caaagtgttt 1500 gaggcatcag aagcaactag gcatggtaca gaaacattaa gagacgtgta tagtgaagat 1560 ggtgtgtatc ttgattcacc aattttgata tgctacacag acggtggccc tgatcacagg 1620 accacatatc cttctgtaca gctagcaagc atagctcttt ttctgcttct tgaccttgac 1680 atgtttgtaa cttgtcgtac agcaccatgt cagagttatg caaacttggc agaatgtgtt 1740 aatgccattt taaacgtgca ttacaaaatg tcgcattgtc aagagataaa atggacgact 1800 ctgtggaagc aagagttaaa catatatcaa ccatgaaaga tctaagatta gcttctgcca 1860 aaagtcaaaa tctgaaacaa gccatgatca agtctgttga accaattatt caaatgttaa 1920 tggagcgctt tcaaagattg aaattaaaaa atgagtcagt aattgcctgc aagtctgcta 1980 ctggtgatga aattgatacc tttattagtg cattaaaact gattgacaac aattttgatg 2040 tagaatcgct ttccaaaaaa tccattgaat cgtgtccaaa aatgattgat ttctttgata 2100 agcactgcag aaaaagacat tatatttttc agataaaaaa atgtagtgaa attgatgccc 2160 caaactgttg gtactgttca atgtctcctt ctcgtctaga caaggatgtg ttcaatgatt 2220 tagatttcat tcccgatcca gttttaaatg aaaataaaac tgaattttta aaatttgagg 2280 atgtgtatgg cacagttacc gatgattcag ctcggccttc attgaaactt agtggtgcat 2340 gtgagaggga caaacttcac aagacaatat tagtaactgc aaaagtgcgt tcaaaaatct 2400 attgtggtga gtgcaaaaag cccagatgca tgtattcaaa caaaaagctt gacttaaagg 2460 aaacaaaagc cataagttca atacaaaata gctctaatta tatctgtgga gattcgttat 2520 ttccagagga taatcagtgt tacgatacta ttattgtaaa agaaggtcta aggtgcaggg 2580 atccaatgga agtatcttat tatagtgcca agctagtgaa atttccagat tgttgttatt 2640 actgtgcatc agaagggcca cttgcaaatg acattgacat tcaacagtta aagttaaatt 2700 actctgttgt cagatctatc tgctgccaat gtaaagctac tggaaaggag ccagttgtaa 2760 ggggagccaa aaatgttaaa aaatatcttg aaagcatata caatataatg ctgtgtgaca 2820 attgacccaa acttttgatg gctatcttgt gtttaattta tgtgcatttt cgtataagta 2880 tgctacttgc actaatttat tttgcattta atatgtgtta tctttcacat tttaaaggct 2940 ttcggccaat tctctttcct gtatctaaga tatattttga gtgtattttg ttttaaaggc 3000 ataattacac acatgacata tcacctttaa tcttttcaat atattcattg aggctcaatt 3060 taagttgttt gtttccaaag gatattaata aagttttaaa tacattcctt tttcgtcttc 3120 ttgaaatgca gatttttttt ttatttttat ttttatcccc ccctgctccc tgcttttttc 3180 aaaagattgc ccgtaattca actaattaaa ttggtgtggc c 3221 // ID Gypsy-233_AA-I repbase; DNA; INV; 7189 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-233_AA_; KW Gypsy-233_AA-LTR; Gypsy-233_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-7189 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1069-1069 (2011). XX DR [1] (Consensus) XX CC Positions [5188-5670] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 375..1352 FT /product="Gypsy-233_AA-I_3p" FT /translation="MDGYRINANELLEEELDYELSIRGHSTDDLLTAKRRT FT ARKLLRVLEDPNRVWTSSYDLQQELVETPIKLQEIENLLREGRHDGCLSRL FT VHYHKRIRRYVPQSTKQKDDQIRLMFVIGSVAEKYFDVDFCEATFKVSTSK FT VVPAQAQRPRELTPTTSQLAEYPTPPDRSSQVAPGIDPLMNPTELDGAVGG FT VLRPLSSTAVNSPHRDERAAASRSSEALRREAVEELDNRRRSAPGSLGNPF FT LEGFEDPPRSSGLGNFKQAPDPTLLSNAADCSVRAPVFQTDFLVRRSLGQI FT RSSQQSQHSVEYPPLARQSSGQSDPPMPQARPPQ" FT CDS 1452..2756 FT /product="Gypsy-233_AA-I_4p" FT /translation="MDEYVHTSEIETYVKAYLERIIRRPLDQQTDVSQLAD FT QIANVGLHDTEISMISRGVTGPRRTRFAEPQPPLQLSSQDQPYRSSPNPEI FT DRRTRYEGFSQIPRPPIDPRPPSRPEFNPQPQGYFRNPYDSRPFSDPVVFP FT NNSRRLPHQQCNIIEKWPKFSGDNNLIPVTDFLRQINILCRSYAITKEELR FT MHAHLLFKDSAYIWFTTYEEKFLSWEALEFNLKMRFDNPNRDRLIKEELRN FT RKQRPNELFSAFLTDIESLAQRMIHKMSEREKFDLIVENMKMSYKRRLALE FT PINSIEHLAQMCYRFDALEPNLYHIGGTSKPQVHQMVAESESEDDREADED FT EQVFAINRKADNFSYKSKGRNTDKSGTRTEMPTKSQSQTLCWNCNQLGHMW FT RECDKKKVIFCHICGHADTTAFRCPNRHNLGERNSDDEKNE" FT CDS join(3601..4896,4900..6033) FT /product="Gypsy-233_AA-I_1p" FT /translation="MQRLGVIEECPGPVDFLNPLLPIKKSNGKWRICLDSR FT RLNQSTKKDDYPFPNMMGILQRIQRSKYFSVIDLSESYYQVPLAKDAKDKT FT AFRTNKGLFRFTVMPFGLTNAPATMARLMARVIGHDLEPWVYVYLDDIIIV FT SNSFEEHLRLIRIVAERLTQAGLTINLTKSKFCQTKIKYLGYVLSEQGLSM FT DASKIQPVLDYPTPKCVKDIRRLLGLAGFYQKFIAKYSEIVTPISNLLKKD FT RRKFSWTQEADIALEKLKSALVSGPVLANPDFGLPFIIETDSSDLAIGAVL FT VQMHGEERKTIAYFSKKLSSTQRRYSATERECLAVLLSIEHFKHFVEGKPF FT VVSTDAMSLTFLQSMSIESKSPRIARWALKLAKYDILLQYKKGSENIPADA FT LSRAINSIDVTLEDPYISQLKDMIIKYPSLSGYPDFDIRITNGKIFKFVTN FT TTVPEDHGFRWKYLVPSAERKDIIRSTHEEAHLGFLKTLAKVREQYYWPRM FT ASDIKRFCHSCAVCKESKTPNTNVTPICGKPKLCSRPWEMISMDFLGPYPR FT SKKGNMWLLVVCDFFSKFVVVQCMHAATSSSVCTFVENLVFNLFGAPAVCI FT TDNAKVFTSDLFEQLLRRYQVTHWNLAVYHPSPNPTERVNRVIVTAIRCSL FT NQQKDHREWDKSVHQIAKAIRTSVHDSTGFSPFFINFGRNMVSSGSEYELL FT RESDHNPIQLSEDMKDLFAIVRKNLMKAYQRYSQPYNLRANKTHTFQKGEE FT VYKKNVHLSDKGHNFVGKLANKFSRVRITEVLGTNSYTLETLDGKRIPGTY FT HGSFLKRV" XX SQ Sequence 7189 BP; 2196 A; 1503 C; 1493 G; 1992 T; 5 other; ttttggcgcc caacgtgggg ccgtgaaata gattgcagta gttaaatttt tctagtattt 60 tttttctttg aattcgttaa attaaagtaa ccaatcgttg ccgtacacat tcgtagaatt 120 tagattgatt gaattaaatt tgagatatct tggtcaaata ggattagttc ttaagatttt 180 cgtagttgta agatctgaat tggtattgaa ttggttatat gaattgaagt ttcctagtgg 240 attggtaatt tctttataat ataataggtg tagattatta aattgaatta ttcgttgaag 300 ttcaaagtaa atcacgttta cgaatagaat tttaggacat ttttttcaaa ttttcaatta 360 aatttaatag cacaatggat ggctacagaa ttaatgctaa tgaactctta gaagaggagt 420 tagattatga gttgagtatc agaggtcatt cgactgatga tcttctcaca gcgaaacgtc 480 gaacagcacg aaaacttctt cgggtccttg aagatcctaa tcgtgtctgg acgtcttctt 540 atgatttgca gcaagagttg gtggaaacgc cgatcaagtt gcaggaaatt gaaaacctgt 600 tgcgtgaagg acgccatgac ggatgtttgt cccgtcttgt gcattatcac aagcggattc 660 ggcgatatgt gccgcagtcc acgaagcaaa aagacgatca gataaggttg atgtttgtca 720 ttggaagtgt tgcggagaaa tatttcgacg tagacttttg cgaggctacg ttcaaggtgt 780 cgacaagtaa agtggttcct gctcaggcgc aacgaccgag ggaattgaca ccaacgacga 840 gtcagttggc tgaatatccg acgccaccgg atcggtcgag ccaggtagct cccggaattg 900 atccgctaat gaacccaacg gaattagatg gagctgttgg aggagtcttg cgtccattat 960 cgtcaacagc ggttaactcg ccgcatcgag atgaaagagc tgctgcatcc cgcagttcmg 1020 aagcacttag acgtgaagca gttgaagagt tggataatcg acggcgttct gcacctggat 1080 cactgggaaa cccmttccta gaaggtttcg aagatccgcc acgatccagc ggcttgggaa 1140 atttcaagca agcacccgat ccgacactgt tgagtaatgc tgcagattgt tctgttcgtg 1200 ctccagtgtt ccaaacggat ttcttggtac gaagatcact tggccagatt cgttcaagtc 1260 aacagtccca gcattcggtc gagtatccgc cattagctcg acagagctca ggtcaatccg 1320 atcctccaat gccacaggca cgtccaccgc agcmtaattc gttcttgaat cctaatkttg 1380 gtccagaacc acgtccttct cgtccagctt cctcgtcgac cccgcgaaac cctgataata 1440 gtggtagaaa tatggatgag tatgtgcata cgtcggagat agaaacatat gtcaaagcat 1500 acctggagag aataatccgt cgccctttag atcaacaaac ggatgtgagt cagcttgcag 1560 atcaaattgc aaatgttggt ttgcatgata ctgaaatttc gatgatttca cgtggtgtta 1620 ctggcccgcg tcgaactcga tttgcagaac ctcagcctcc gttacaattg tcatctcaag 1680 accaaccgta tcgtagctct ccaaaccctg aaatagatag acgaactcgt tatgaaggct 1740 tttcacaaat tccacgtcca ccgattgatc cgagacctcc atctaggcca gaattcaacc 1800 ctcaacccca aggttatttc cgtaatccgt atgattccag accattctca gaccccgttg 1860 tcttccccaa taattcgagg cgattgccac accagcaatg caacataatt gaaaaatggc 1920 ctaaattcag tggagataat aatctcatcc ctgtaacgga tttcttgcgc cagattaaca 1980 tattatgcag atcctatgct attaccaaag aagagcttag gatgcatgct catttgctat 2040 ttaaagacag tgcatatatt tggttcacga cctacgagga aaaattcctg tcttgggaag 2100 cgttagagtt caacttgaag atgcgattcg ataatccgaa ccgtgatcgc ctaatcaagg 2160 aagaactaag aaaccgtaag caaaggccga atgaattgtt cagcgctttc ctaacggata 2220 ttgaaagttt ggctcagcga atgattcata aaatgtccga aagagaaaag tttgacctca 2280 tcgtcgagaa catgaaaatg agttataaac gacgcttggc tttagagccg attaactcaa 2340 ttgaacatct tgcacagatg tgttacagat tcgacgcctt agagccaaac ttgtatcata 2400 ttggcgggac gtccaaaccc caagtccacc aaatggtcgc agaaagcgaa tccgaagacg 2460 atagagaagc cgatgaagac gaacaggtgt tcgctattaa tcgaaaggct gacaatttct 2520 cttataagtc taaaggaaga aacacagaca aatctggaac cagaactgaa atgccgacca 2580 aatctcaatc ccaaacttta tgttggaatt gtaatcaact gggacacatg tggcgtgaat 2640 gcgataagaa gaaagtgatc ttctgtcaca tttgtggcca cgcagacacg actgctttcc 2700 ggtgcccaaa caggcataat ttgggtgaaa ggaatagcga cgatgaaaaa aacgagtaaa 2760 tgaggagctt tcagggaatc aagctacctc taatactagc tcatctccga ttcccaaacc 2820 caattttgac cacttcgacc ggatttgcac gatcaatact tccctacgaa aatgtcccca 2880 tttgaaggtt cgaatcctag aggaacaaat agaagcacta gctgataccg gtgctagttt 2940 gtctattatt agttcgcttg ctctaattga taaactaggt ttgaaaattc aaccctcgaa 3000 ccttaaagtg tctacagctg atggtactcc ctatagatgc atcggatatg taaatctgcc 3060 aatatcagtt gaatctgtaa ctcatgtcct gcccaccata gtagtccctg agttgaagaa 3120 ggatctgata ctaggcatgg atttcatgga aaaatttggt tatcgcttga ttggacctac 3180 taaggccgta cctcgaatca gtgaatcggt aaattcagtt gatcttctat ggatagaaga 3240 ccactttacc gatgacgacg aagttgtgtg ttttcaatta actcctgact catctgttgt 3300 tgaaacgmct cctgaacaac caatcgatga aagcttggaa attcccacaa tcgaaatgtc 3360 tgaatacagt ctccaaaatc cggctgattt agagcaccga acgattctct cgcagctacg 3420 gaacatcaag ccctattttc ggcagttcaa aaactacctg ccacgaaaga tggaacccta 3480 ggacgtacat cccttattga acacacgatt gaacttttgc ctgggtcgaa acccaaaagg 3540 tttccgtcat accgatggtc accttccgtt gaagaagtca tcgatgcaga agttcagcga 3600 atgcaacgat taggcgtgat cgaagaatgt ccagggccag tagatttctt aaatcctctt 3660 ctgccaatca agaaatcaaa tggaaaatgg cgtatatgtt tagactctcg tcgtctgaac 3720 cagagtacta aaaaggatga ttatccattc ccaaacatga tgggcatcct ccaaagaatc 3780 cagcgatcga agtactttag cgttatcgat ttatccgaat cgtattacca agtacctctc 3840 gcgaaagacg ccaaggataa aaccgcattc agaacaaata aaggactgtt tcgattcacg 3900 gttatgccat ttgggttaac gaatgcccca gcgaccatgg ccaggttaat ggcacgtgta 3960 atagggcatg atttagaacc ttgggtatac gtctacttgg acgatattat tatcgtatcc 4020 aactccttcg aggaacatct gcgactgata agaatcgtag cagaacggtt gacacaggct 4080 ggtctaacca tcaatctaac aaaaagcaag ttttgtcaaa caaaaatcaa gtatcttgga 4140 tacgtcttat cagagcaagg actgtctatg gacgcgagta aaattcagcc agtgttggat 4200 tatcctaccc cgaagtgcgt taaggacatt cgtagactcc tcgggttggc aggattttac 4260 caaaagttta tcgccaagta ctcggaaatc gtcaccccga ttagtaacct tcttaagaag 4320 gaccgtcgca agttttcttg gacacaagaa gcagacatcg cgttggaaaa gcttaaatcc 4380 gcattagttt ctggacctgt gttggctaat ccagattttg gtctaccttt tattattgaa 4440 acagacagct cagaccttgc cataggtgct gttctagttc aaatgcatgg cgaagagaga 4500 aaaactatag catacttctc caagaagctc tccagcactc agaggcgcta cagtgcgacc 4560 gaacgagagt gtttagctgt cttgcttagt atcgagcatt tcaaacattt tgtagagggc 4620 aaacctttcg ttgtctctac agacgcgatg agtctcacct tcttacaaag catgtcaata 4680 gaatcgaagt ctccacgcat cgcaagatgg gcgttgaaac tggctaagta cgatattctc 4740 ctacagtaca agaaaggatc ggagaacatc ccagccgacg ccttatctcg cgctataaac 4800 tctattgatg ttacgctcga agatccctac attagtcagc taaaggatat gatcattaaa 4860 tatccaagcc tgtcgggata cccagacttt gatatttaac gaataactaa cggcaaaata 4920 ttcaagttcg ttacgaacac tactgtccct gaagatcatg gatttcgttg gaaatatttg 4980 gttccctcgg cggaacgaaa ggatatcatt cgttcaacgc atgaagaggc ccatttaggt 5040 ttcttaaaaa cgctagcaaa agtacgtgag caatattatt ggcctcgtat ggcctcagat 5100 atcaaacggt tctgtcatag ctgtgccgtt tgcaaagagt ctaagacacc caatactaat 5160 gtcactccta tctgtggtaa acctaaacta tgctcccgac cctgggagat gatatcaatg 5220 gactttcttg gtccttaccc taggtccaag aagggaaata tgtggctatt agtagtctgt 5280 gatttctttt caaagttcgt tgtagtacaa tgtatgcacg ctgcaacgtc tagctccgtg 5340 tgtactttcg ttgaaaacct cgtcttcaat ctctttggtg cccctgctgt gtgcattacc 5400 gacaatgcga aggtatttac ctcggatttg tttgaacagc ttctccgaag atatcaagtc 5460 acgcattgga acttagcagt ctatcaccct agtccaaacc ccacagagag ggtaaataga 5520 gtcattgtta ctgcgatacg atgctctctc aatcaacaaa aggaccatcg tgaatgggac 5580 aaatctgtgc accaaatagc caaagccata cgcacctcag ttcatgacag tactggcttc 5640 tcgccctttt tcatcaattt tggaaggaac atggtcagct caggttctga gtatgaactt 5700 ctacgagagt ctgatcataa tcccatccag ctaagcgaag acatgaaaga cctgtttgct 5760 atcgtcagga agaatttgat gaaagcttat cagcgatatt cacaaccgta taatcttcga 5820 gcaaataaaa cacacacatt ccagaaagga gaagaagtat acaagaaaaa cgtacatctg 5880 tcggacaaag gacacaactt tgtagggaaa ttggcaaaca aatttagcag agtacgaatc 5940 acagaagttt tagggaccaa ctcatacaca ttagagacac tggacggaaa aagaatccca 6000 ggaacgtacc atggatcgtt tctaaaacgt gtataaaaga tcaagctatg acggtgttac 6060 tatagagcac acaaaaacaa ctagacaaat ctaatcaaaa tactctttag aggtttattc 6120 cggcgttcga gatgtcatca atatgtttcc tcacgtcgag agtcgtaaat cgttcatgca 6180 caggctatgt tgatgttcga gcccagctcc atcaaaccac ggcaattcgc ttccaaaata 6240 ctcctccgag gcattattcc atcgattgag atgtcaccta cgtgtttcct catcgaacta 6300 tccaccatcc aatccatttt acgaacctag aacaggtaaa taaacaaaaa ttagcatagg 6360 tttagagtaa aagtaaatcg tcagtaaact taccagtatc aatttcagta tgttgtaaat 6420 tacaccgctc ggttcacaat aatgccatac accactttag atgtagtttc aatagcccgg 6480 atccaattaa aacagcagaa ttaaagttaa ctttccactt ttgcacatta acgtttcact 6540 tttgccgacc actttgacag ttatttgttt tgctacatct ttactttctc tcactctctc 6600 agtttgatgt tttctctcgg tattctgaaa aggaaagctt tggacgcaaa gtgagtctag 6660 ggcgcgtttg ttatgagaat gagtgtttag caagtagggt ggttctaaaa ttaaagattt 6720 ccaattttta ttggagtaat tcggtttatt attgtaggcg atttcagatc ggaatctgag 6780 gtatagtttt ccaattatga ttggagttat tcggataaca ataaatttgg tttttcagat 6840 aaatctgaag taaattggtt agtttaagta aaacataatc agatatttct gatgttggtg 6900 tgaattaggt tttgtgtacg aagatggttc agtagaggaa tggaaatcaa aatggttcag 6960 aatatccaag aggaaacttc agtccgttaa taaatgaatt cgagtatatg taattgagat 7020 ttgggtttaa gaagtaattg tattaaaaaa tgacagtagt tgaattaaac atattttaac 7080 taaaatataa gtatgaccaa gaagtaaatt aacgaatatt agcaatcttt tccctataaa 7140 aatttttaaa ttttcaattt aaaaattttt ataattaagc atcggcgaa 7189 // ID Copia-18_CQ-LTR repbase; DNA; INV; 250 BP. XX AC AAWU01015423; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-18_CQ_; KW Copia-18_CQ-I; Copia-18_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-250 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 352-352 (2011). XX DR GenBank; AAWU01015423; Positions 65984 65735. XX SQ Sequence 250 BP; 52 A; 69 C; 54 G; 75 T; 0 other; tgttgaaagt aaacagtgtt cacctataca cctcctcctt ggcgcgccgt ttctgcacca 60 accttctgtt gcgttgcctc ggcgagtgca cggcatcatt cccgctcaaa acgttgtcgc 120 gtgcggttcg atttctcgct aaataaaata aaacgttctt ttgtaaaagt taaacgcgag 180 tgtgttttcc tttcgccgat tctcgtctcg tcgcggtaat ccgctgtgca ttccggacaa 240 aagtccgtca 250 // ID SIRE7_TC repbase; DNA; INV; 741 BP. XX AC AF227610; XX DT 09-JUL-2004 (Rel. 9.06, Created) DT 09-JUL-2004 (Rel. 9.06, Last updated, Version 2) XX DE Trypanosoma cruzi clone SIRE repeat region. XX KW SIRE7_TC; short interspersed repetitive element. XX OS Trypanosoma cruzi OC Eukaryota; Euglenozoa; Kinetoplastida; Trypanosomatidae; OC Trypanosoma; Schizotrypanum. XX RN [1] RA Vazquez M., Ben-Dov C., Lorenzi H., Moore T., Schijman A. RA and Levin J.M.; RT "The short interspersed repetitive element of Trypanosoma cruzi, RT SIRE, is part of VIPER, an unusual retroelement related to long RT terminal repeat retrotransposons."; RL Proc. Natl. Acad. Sci. U.S.A 97(5), 2128-2133 (2000). XX DR Genbank; AF227610; Positions 1 741. XX SQ Sequence 741 BP; 214 A; 95 C; 180 G; 171 T; 81 other; aatnaggaaa aagggggggg aagggggngn gcaaangnnn aaaaagagat gggggggaga 60 ngggtnaaat ttnatgaggg ggnancccca ggggcaaaaa aaaggaanan gnnttcnngg 120 ttnggaangn aggggggant tgtggagcnn gataacaatt ttnacaacga ggaaacaggn 180 tangananga tganggaann naaanggact caactatggg gaattnggcc cttggagncc 240 aaagaaanta ggnaacgang gctggcntcn aatnaatttt aaattctgat aataancgnn 300 ttcacttntn tanaanaatt ttttnttttt tatgaangtg ggnagaacan aaatttnagg 360 gggaacagag agcagnaagg gaataggcaa anatttttat ttttatttgg ccatnccacc 420 caaccccntt tgattnccan cangcggngg ggtnttgtgn ttcggnggac cccaaaagtt 480 tgccgcgttg nangtaaaat atccttcagn ttttaantga ggacaaaaga ccatgngaac 540 ggtnaattga atttattgat tggacaaaaa gagaacattg aggcaattaa cgacctttgc 600 ggcaacccca nttttttccc ctaccttttt ttggtttnta aaagaaattt naggattngg 660 ccagnggcac agaagattat ccagtntttt atttnttcca ttcatcctgc cgggagatga 720 atatttgngt gggtnccgtn a 741 // ID Gypsy9-I_Dya repbase; DNA; INV; 4457 BP. XX AC chr2R; XX DT 18-MAY-2009 (Rel. 14.05, Created) DT 18-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy9_Dya; KW Gypsy9-LTR_Dya; Gypsy9-I_Dya. XX OS Drosophila yakuba OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; melanogaster subgroup. XX RN [1] RP 1-4457 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1073-1073 (2009). XX DR Genome; chr2R; Positions 1397572 1402028. XX CC Positions [1516-1941] - Reverse transcriptase CC Positions [3142-3615] - Integrase core CC 'TATA' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 382..4086 FT /product="Gypsy9-I_Dya_1p" FT /translation="MFLNMLTKRPTDGECLATHASRMVTELTTKWREMETE FT EIAVSVTLALLASFDNRLQRLTFTSNVQSRGDLQSELKAFTFNKRKSGFVE FT QQTEAHPKRQKPLSMECHFCGKFGHKMFECRLRKQQQHASNNERHDRYDEH FT QRGKPNVTCYKCGERGHISTQCTQKESDKKTFIREKRVEQCSVAVPKGYMN FT HKGKIFEVTFDSGAECSLMKEKISTTFSGKRLNNIVMLKGIGSNGICSTVQ FT ILSNVKIDENCIEILFYVIGDDHMQNDIVIGREILNQGFDIALSSKEFKII FT RSKIINLCTGSEPADIDTELEGQDKIKLNSLLSKYSNCFIKGIPCTKVKVG FT EIKIRLIDPRKTIQRRPYRLSPCERDLVREKINELLKCNIIRPSCSPYASP FT ILLVKKKNGSDRLCVDYRELNSNTIADKYPLPLIQDQINRLRGVKYFTCLD FT MASGYYQIPLNSESIEYTAFVTPDGQYEFLSMPFGLKNAPSVFQRLVMQAL FT GDLANSYVIVYMDDIMITAANQCEALERLEIVLNILTDAGFSFNITKCSFL FT KTSVQYLGFCVSAGEIRPNPQKIVALTALPPPQSVTSLRQFIGLASYFRQF FT IKGFSQLMKPLYFLTSKKNKFIWKTEHEEIRRTVIFTLTDKPVLVIFDPNY FT PIELHTDASCSGYGAILLHKIDNKPQVIEYYSKTTSPAESRYHSYELETLA FT VVNAIKHFRHYLLGRNFTVFTDCNSLKASRNKQDLTPRAQRWWAYLQSFTF FT DIQYREGKRMAHVDFLSRNPISTKPTSAAKVTEEKRVNLAEISSNWLIAEQ FT RRDSDIAEIVNRLKEFSLAEDIAKTYELRAEVLYRKIQRNGRTRCLPVVPR FT AFRWSVINQVHESIMHLGFEKTLDKVYDFYWFDNMSKYIRKFVDNCITCKM FT SKSTSGKIQAELHPIPKINIPWHTIHIDITGKLSGKSDQKEYIIVQVDAFT FT KYVYLYHTLKLDSENCINAVKSSVSLFGVPNRIIADQGRCFTGAKFSQFCS FT DHKINLHLIATGASRANGQVERTMSTLKNMLTAAETGTQSWQDVLGDVQLA FT INCTTNRVTRASPLEMLLGRVARPLGLLPPSDLENDVDLKNVREQAEKNVL FT ATAVYDKERFDKNKAKILRHQVGDFVLLKSEERHQTKLSPKFKGPFEITQV FT LEGDRYILKSLTNKRKYKYAHEDLRKLPEGEVPEELNVFDDYNNNFESKNE FT NNVEDESEKENIENVEVNKECEAYQ" XX SQ Sequence 4457 BP; 1568 A; 743 C; 923 G; 1223 T; 0 other; tcagaagtgg gatctggccc tctaccttca tcagtagatg gaccatggcg tgccatattg 60 gaactacaaa ataaaaactt aattgaacta gtgcgggcca tacaaacaac gacccctgac 120 tcagtgcaaa gtaaatcggt tcaacttccc aaatataacc cagataagac tggcgccatc 180 gcttcatcat ggtgtactac agttgaaatt attttgaaag aaaatccatt aaaaggcagt 240 gcgttggtag tggccttaag ttctgcattg gagggtagtg cttcgcaatg gctatcccaa 300 gtgtgttacg ccgatataac ctggctcgaa ttcaaggaaa tatttataca acgctttgac 360 acaaaggaaa caccagccgc aatgttttta aatatgctga caaagcgacc gactgatggc 420 gaatgtctcg caactcatgc aagccgtatg gtcacagaac ttacaacgaa gtggcgtgaa 480 atggaaacag aagaaatcgc tgtatctgtg actcttgcat tgctggcaag ctttgataac 540 cgactgcagc gtttaacttt cacatcgaat gttcaatcca gaggggattt acagtccgaa 600 ttaaaagcgt tcacttttaa taaaaggaaa agcggttttg tggagcaaca aactgaggct 660 catccgaaac gacaaaagcc attatcgatg gagtgtcatt tttgcggaaa atttggacat 720 aagatgtttg aatgcagact ccggaaacaa caacagcatg caagtaataa tgaacgacat 780 gaccgatacg atgaacatca acgtggaaaa ccaaatgtaa cgtgctacaa gtgcggagag 840 cgcggacaca tatctaccca atgtacgcag aaggagtctg ataagaagac attcatccgg 900 gaaaagcgag tggagcagtg cagcgttgca gtacctaagg gatatatgaa ccataaaggc 960 aaaatttttg aagtcacctt tgattccgga gcagaatgtt cactgatgaa agaaaagata 1020 agtaccacat tttctggtaa aagattgaat aatattgtta tgttaaaagg cataggcagt 1080 aatgggatat gtagtacggt acaaattcta agtaacgtta aaattgacga aaactgtatc 1140 gagattttgt tttatgtaat tggggatgat cacatgcaaa acgatattgt cattggtcgc 1200 gaaatattaa atcagggttt tgatattgca ctttcttcaa aagaatttaa aattataaga 1260 tcaaaaataa ttaatttatg tactggtagc gagcctgccg atattgatac cgagcttgaa 1320 ggacaagata aaataaaact taattcatta ttgtcaaagt actctaattg ttttataaaa 1380 ggtattccat gcactaaagt taaagttggc gaaataaaaa ttaggttaat cgatccaaga 1440 aaaactattc aaagaaggcc ctatagatta agtccatgcg agcgagattt agtccgagaa 1500 aaaattaatg agctgttgaa atgcaatata ataagaccaa gctgctcacc ttatgcaagc 1560 cctatattgt tagtaaaaaa aaagaacggt tctgatagac tttgtgttga ctatagagag 1620 ttaaattcaa acacaatcgc tgataagtac ccgttaccac ttatacagga ccaaataaat 1680 agacttcggg gagttaaata tttcacatgc ttagatatgg ccagcggata ctatcaaatt 1740 ccattaaatt ctgaatccat tgagtataca gcgtttgtga cgccagatgg ccaatatgag 1800 tttttgtcga tgccattcgg gcttaaaaat gcgccctctg tttttcaacg cttagtaatg 1860 caggctctcg gtgatctagc caactcttat gtcattgtct atatggatga cattatgata 1920 actgctgcaa atcaatgtga ggctttagaa aggttagaga ttgttttaaa tattttgaca 1980 gatgctgggt tttcattcaa cataacaaag tgctctttct taaaaacatc ggttcaatat 2040 ttaggtttct gtgttagtgc tggggaaatt cgtccaaacc ctcagaaaat tgtagcattg 2100 actgctcttc cgcctccaca gtcagttacg tcacttaggc aatttattgg attggcatcc 2160 tatttccgac aatttataaa gggattttcg cagttgatga aaccgttata tttcttaact 2220 tcaaagaaaa ataaatttat ttggaaaaca gaacatgaag aaatacggag gactgttatt 2280 tttactctta ctgataaacc agtccttgtt atttttgacc ccaactatcc aattgagctg 2340 catacagatg caagttgcag tggatatggt gcaatcttat tgcacaaaat tgataacaaa 2400 ccccaagtaa tagaatacta tagtaaaact acttcacctg cagagtcgag ataccactca 2460 tatgagctgg aaaccttagc agtagtaaac gctattaaac attttcgaca ttatttgttg 2520 ggtaggaact ttactgtttt taccgactgt aactcactaa aggcctcaag aaataaacaa 2580 gatttgaccc ctagagcgca gcgctggtgg gcttatttac agtcatttac attcgatatt 2640 caatacagag aaggaaaaag aatggcccat gttgattttt tgtcgagaaa tcccatatct 2700 acaaagccaa cttctgccgc taaggttact gaagagaaga gagttaattt ggccgaaatt 2760 tctagcaatt ggctcattgc tgaacagcgg cgagactctg acatagcgga aattgtgaat 2820 agattaaaag agttttctct tgctgaagat atcgcaaaaa cctatgaact aagggcagaa 2880 gttctttatc ggaaaataca gagaaatggc agaacacgtt gtttaccagt agtgccacgt 2940 gcgttcagat ggtctgtcat aaaccaagtt cacgagtcaa taatgcattt aggctttgaa 3000 aagacacttg ataaagttta cgatttttat tggttcgata atatgtcaaa atacatccga 3060 aaattcgttg acaattgcat cacttgtaaa atgtccaaat ccacctcagg aaaaattcag 3120 gctgaactgc atccaatccc gaaaattaat attccttggc acaccatcca catcgacata 3180 acagggaaat taagtggcaa gagtgaccag aaggagtaca ttattgttca ggtagacgcg 3240 tttactaagt atgtctatct atatcatacc cttaagttgg acagtgagaa ttgtataaac 3300 gctgttaaat cctctgtttc attgtttgga gtaccaaacc gaataatagc tgaccaggga 3360 aggtgtttta ccggagcgaa gtttagtcaa ttctgttctg atcacaaaat aaacttgcac 3420 ttgattgcca ccggtgctag tcgggctaac ggtcaagtgg aacggacaat gagcacgttg 3480 aaaaatatgt taacggcggc agagacagga acgcagtctt ggcaggatgt gctaggagat 3540 gtacagttag caattaattg tacaactaac cgtgtaacaa gggctagtcc tttagaaatg 3600 ttattgggaa gggtagctcg gcctttaggc ctacttccac ctagtgatct agaaaatgat 3660 gttgatctga aaaacgtaag ggaacaggct gaaaaaaatg ttttagccac tgcagtttac 3720 gacaaagaac gattcgataa aaacaaggca aaaatactta gacaccaagt aggagatttt 3780 gtattactta aaagcgaaga aaggcatcaa accaaactaa gcccaaagtt taaagggccg 3840 tttgaaatta cacaagtttt agagggcgac agatacatct taaaatcctt aaccaataaa 3900 agaaaataca aatatgcgca tgaagattta cggaaattgc cagaagggga agtacctgag 3960 gagttgaatg tgtttgatga ttataataat aatttcgaaa gtaaaaatga aaataatgtt 4020 gaagatgaaa gcgaaaaaga aaatattgag aatgttgaag taaataaaga atgtgaagca 4080 taccaataag ggtgatgaaa tgaagagtat tagaagcaat ataatgcaag cacaagtgca 4140 aaacttacta gagtctgttt tgatcatggc gtgtggggca ttttattgag ttattctaac 4200 gttgaataaa gaatgttgaa taaagaatgt tgaagaaggt aaaatgatga aataaaagaa 4260 aaataagaaa gagaatggta aagattatgt ttaaatatga aggtttgtca ctataatgtt 4320 aaaggataac taattaagga aataaagttt attgtgctat gaaaaaaaaa aaataaataa 4380 ataaaaaaat aaatgaataa gagtaacttg aattatggaa ctttatacac gaggtcgtgt 4440 tagagtcagg atggccg 4457 // ID Merlin1B_SM repbase; DNA; INV; 1171 BP. XX AC . XX DT 11-AUG-2009 (Rel. 14.08, Created) DT 11-AUG-2009 (Rel. 14.08, Last updated, Version 2) XX DE Consensus sequence of Merlin-type family of repeats. XX KW Merlin; DNA transposon; Transposable Element; Merlin1B_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-1171 RA Jurka J.; RT "Merlin-type element from freshwater planarian (Schmidtea RT mediterranea)."; RL Repbase Reports 9(8), 1894-1894 (2009). XX DR [1] (Consensus) XX CC The youngest copies are >92% identical with consensus. 8 bp TSD. CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 156..1040 FT /product="Merlin1B_SM_1p" FT /translation="MNLLQLSALCNDKRSSLQFLQQHGIVHNLRRCSNKHV FT MILSLTDRQDRWRCRQSSCRQDIPVRQGTWLQGSRLSYRQIVLFIYCWSKE FT LTSIRFCETELEIFKSSVIDWNNYIREVCANTLINNPIVIGGPNTTVEIDE FT SLFTRRKNHVGRVLPQQWVFGGICRQTGECFMFTVQDRTAATLLPIISTHI FT LPGTTVISDQWRAYNGITAIAGGGFHHQTVNHSLYFVDPNTGANTQRIERS FT WKAAKERNKRHNGTHRSMLDSYMCEYMWRTRTKVRQLDVFDAILADIVLFW FT PPI" XX SQ Sequence 1171 BP; 386 A; 201 C; 218 G; 366 T; 0 other; ggtacttgtc tgttaattca ccccaatttt tattttttat ttttttacag aaatgaatgc 60 aaaatcattt gttataacat tctgctatgt taaattattt cattaacatt tcaaataaca 120 aaataaaata aataaataat taataaaata ttaatatgaa tttgttacaa ttatccgctt 180 tatgtaatga taaaagaagt agtctacagt tccttcaaca acatggaatt gtgcataatc 240 tgcgtcgttg ttcaaacaaa catgtcatga ttttgtcgct gacagatcgt caggatcgtt 300 ggagatgtcg tcagagttca tgtcggcaag atatacctgt tcggcaaggg acatggttac 360 aaggatcacg gctatcatac aggcaaattg ttttgttcat ttattgctgg tcaaaagagt 420 tgaccagtat acgtttctgt gaaactgaac ttgaaatttt caagtcatct gtcatagact 480 ggaataacta tattagagag gtgtgtgcca atacgttgat caataatccc attgtcatag 540 gtggacctaa cacaactgtt gaaatagacg aaagtctgtt tactcgacgc aaaaaccatg 600 ttggtcgagt attaccgcaa cagtgggtgt ttggaggtat atgtcgacaa actggtgaat 660 gtttcatgtt tactgtccaa gacagaacag cagccaccct tcttccgata atatctacgc 720 acatattgcc aggaactaca gttatatcag accaatggag agcctataat ggcatcactg 780 caattgctgg aggcggattc catcatcaga ctgtaaacca ctctctatat tttgttgatc 840 ctaacactgg agcaaatact caaagaattg aacgttcgtg gaaggcagca aaagaacgta 900 acaaacgtca caatggaaca cacagaagca tgttggattc atacatgtgt gaatacatgt 960 ggcgtacacg tacaaaagtt cggcaattgg atgtatttga tgcgattctc gctgatattg 1020 tgcttttttg gccacccatt taacctgtga caattagtgt tcaaacaatt aaatataaca 1080 tacttagtgt tgtttcatat tagaataaat gattttgcat tcatttctgt aaaaaaaaat 1140 aaaaattggg gtgaattaac agacaagtac c 1171 // ID Gypsy-8_IS-LTR repbase; DNA; INV; 140 BP. XX AC ABJB010050798; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the black-legged tick genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-8_IS_; KW Gypsy-8_IS-I; Gypsy-8_IS-LTR. XX OS Ixodes scapularis OC Eukaryota; Metazoa; Arthropoda; Chelicerata; Arachnida; Acari; OC Parasitiformes; Ixodida; Ixodoidea; Ixodidae; Ixodinae; Ixodes. XX RN [1] RP 1-140 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the black-legged tick genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; ABJB010050798; Positions 4487 4348. XX SQ Sequence 140 BP; 33 A; 29 C; 42 G; 36 T; 0 other; tgtcatataa agcgcctcgc ggtctgggat gcatgtgggg agaaaagaag aagacgagtt 60 gtgtgcagga gcgagggcta aacccgacgg tgttctagtt actcttgtct tggctcttct 120 ttacatagcc ctcggtaaca 140 // ID DIRS-5_DPu repbase; DNA; INV; 4909 BP. XX AC . XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE DIRS retrotransposon from Daphnia. XX KW DIRS; LTR Retrotransposon; Transposable Element; nonautonomous; KW DIRS-5_DPu. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-4909 RA Jurka J.; RT "DIRS retrotransposons from Daphnia."; RL Direct Submission to RU (09-DEC-2010). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 475..3597 FT /product="DIRS-5_DPu_1p" FT /translation="MSSSSARSGSSSSASSESRSGRNRHDRRGEEERDRRV FT REERERMDREDREERERTLREEEERANRDRETEERERRREREERDRESRSR FT SRDRPPRGGLPGIDEAPGVAPIAPAPFSGKFTLDREMGRDLRSWMTAKLKP FT AEGKQLREKYVPSFRSDSFELVCPQLDSSMARRLKDLKSVEATKAESIEKS FT LLAEQYKILDIARPLLFVWENLSKDPALKVSPLSEATGTALQLWGNSFFNL FT TARRRENILKTTDPKFVSLLNEPHRFCKKETGSLFGRSFLRRMVRDADDDR FT KLRNIGRAGGPHQKPYTRDSSNFRSGREHGRYNGRFQPYRGSNEFNKSGSF FT NQQSRGGRYEHFSIPLVKDSTGDIHHLGGRVKLFSEFWPTLTQDRWVLEAV FT SLGVRIPFLERPTVPFYLDNMRMSEKVMAICDEEIKALIEKEAIKEVAGPE FT QRFVSGLFVIPKSSGGYRPIINLKRLNRLVEPKHFKMEGIGVLKELVREGD FT HFCKIDLKDAYLTIPLHPDDQEFLQFRWRGKGYQFLSLCFGLASAPWAFTK FT ILKPVVAFLRRQGIRLIIYLDDILILNQSKEGVEKDFATTVRILEACGFLI FT NTEKSIGEGAKCIEYLGLLINSEELSLSLLPKKIGEIVLLCEKALNAGTIS FT LRDIAKILGNFAWAVQAIPFAQAHFRVIQQLYIEHANRLGENLQSKIILSV FT GARQDLSWWKDNLARVNGKAISASIPDLIIYSDASLSGWGAALNGASASGP FT WTSEDRSRHINELELLAALWALKSFTAGASNIAVQLMLDNRTAVAYVNKSG FT GTRSRNLCSIAARIAEWCEDRLLTVSAVYLPGALNILADRLSRMRPDVSDW FT MLDQSVFRQLQTIWDPQVDLFAANWNRQLELFASWQHQPEAMAVDAFSLNW FT SLFLGYAFPPFSLIQRCLNKIRKDRAEMVLIAPIWPAQPWYPVLLSLVCEP FT PRILPTCSGILSDPSGCPHPLVSSRTLSLAAWRLSGDDMRPRAFREERSSF FT CWEQIVIPQQLHTRAPGTVGVLGVFARVKIPCLLV" XX SQ Sequence 4909 BP; 1232 A; 1047 C; 1369 G; 1260 T; 1 other; tttgtacggg taagtgcacc tcttatccct tctagaaatt ttccagctaa ttgggtattg 60 ggggagggct ggttctgaaa gattctacta aagtgttcaa ttatttcttg gttaatctgc 120 tcaggttggg taattttaca gttatcatta gttttgagct gagattctga gctatcagag 180 acttctggaa attgctcgtt gacttgttca tatgaaaggt agatgtctct tcttcggcta 240 tactttgaac gcgtgaacgt attgcaaatc ccctcaatgc ttctcgctcc caggctagaa 300 actcacgttt caactccatg taacgcgcgt gattgaaatt tacattgtat gaaaattcgc 360 tcacctaaaa ccttgtgttg acccgtttat tagctgtctt tcaaagtgag ttttcttctg 420 gccccctatc ggtcttaccc ctcagacctt tttcagctgt cgttcgaagc ggtcatgtcg 480 tcgagtagtg ccagatcggg ttcaagttcg agcgcaagtt cagaaagtag aagtgggcgt 540 aaccgacatg accgtcgagg agaggaagag agagatagga gagttcgaga ggagagagaa 600 agaatggatc gagaggacag agaagagaga gagagaacct taagagaaga ggaagagaga 660 gctaatcgag atagggaaac ggaggagcgt gaacgacgac gagaaagaga agaacgtgat 720 agggaatcac gcagccgttc aagagaccgt ccaccacgag gcggtcttcc gggcatagac 780 gaagctccag gtgtggcccc aattgctcca gccccgttca gcggaaaatt caccctggac 840 cgcgagatgg gcagagactt aagatcctgg atgactgcaa aattgaagcc ggcggagggt 900 aagcagttaa gagagaagta tgtgccaagc ttcagatcgg attcctttga gctggtatgc 960 ccgcagctgg acagctctat ggcgagaaga ctgaaggatt tgaagagtgt ggaggcgaca 1020 aaagcagagt ccatcgagaa gagtttattg gccgagcaat acaagatttt ggacatcgcc 1080 cggccgctgt tattcgtatg ggagaattta tcgaaagatc cagccttaaa agtctctcct 1140 ttatccgaag ccacggggac ggccttgcag ctctggggca attcattctt caacctgacg 1200 gcgaggagac gggagaacat tttgaagact acagacccca agttcgtgtc tttgctgaac 1260 gagcctcatc gtttctgcaa gaaagagacc gggtctctct ttggtaggag ctttttgaga 1320 aggatggtgc gtgatgcaga cgacgaccgg aagctgagaa atattggaag agccggtggc 1380 cctcatcaaa agccttacac gcgtgatagc agcaacttcc ggagtggcag ggagcatgga 1440 cgttacaacg ggcgtttcca gccctacaga ggttcgaacg agttcaacaa gagcggtagt 1500 ttcaaccaac agtcaagggg aggaaggtat gaacattttt ctatcccttt agttaaggat 1560 tccaccggtg acattcatca tctgggcggt agagttaagc ttttttccga gttttggccg 1620 actttaactc aagaccggtg ggtcttggag gcagtctcat taggagtgcg catcccgttt 1680 ttggagaggc caacggtccc cttttattta gacaacatgc gtatgagtga gaaagtgatg 1740 gcaatttgtg acgaggaaat taaagcatta atcgagaaag aggcgattaa agaggtggct 1800 ggcccggagc agagatttgt gagcggatta tttgtgatac ccaagagttc gggcggctat 1860 cgaccgatta ttaatttaaa acggcttaat cggttggtgg agccgaaaca tttcaaaatg 1920 gaaggtattg gggttttgaa agagctagtg agggagggcg accatttttg caagatcgac 1980 ttgaaggacg cgtatttgac tatcccactc cacccagacg accaagaatt tcttcagttc 2040 aggtggaggg ggaagggtta tcaattttta tctttatgct ttggcctggc gtcggctcca 2100 tgggcgttca ccaaaatttt gaagccggtg gtcgcctttt tgcgtagaca ggggattaga 2160 ctgataattt acctggacga catccttatt ctgaaccagt ccaaggaagg ggtagaaaaa 2220 gattttgcta cgacagtgcg gattttggaa gcttgcgggt tcttaataaa tacggaaaaa 2280 tcgataggag aaggggcgaa gtgtatcgaa tatctgggcc tcctaatcaa ttcagaagaa 2340 ctttccttgt cacttcttcc gaagaagata ggtgagattg ttctgctttg cgagaaagcc 2400 ctgaatgcag gaacaatatc gttgcgggat attgcaaaga tcttgggtaa ttttgcgtgg 2460 gcagtgcagg ctattccatt tgctcaagcc cactttagag taattcagca actgtacatc 2520 gagcatgcga accggttggg tgagaacctc cagtccaaga tcattttgag cgtgggggcc 2580 cgtcaagatt tgagttggtg gaaggacaat ttggccaggg taaacgggaa agctatttcg 2640 gcgagcattc ccgatttgat catctactcc gatgcctcgc tctcgggttg gggcgcagcg 2700 ctgaacggtg cgtcagcgag tggcccgtgg actagtgaag acaggagtcg gcatattaac 2760 gagctagagt tattagcggc actgtgggcg ctgaagtcct ttacggcagg agcttccaac 2820 attgcggttc aattaatgtt ggataatcgt acggctgtcg cttatgtgaa caagtcagga 2880 gggacgcgct cgagaaattt gtgttcgatt gcggcgcgga tcgcagaatg gtgcgaggat 2940 cgcctcttaa cggtgtcggc tgtatacctt ccgggtgctc ttaatattct tgcagaccgg 3000 ttgtcacgga tgcgccccga tgtgagcgac tggatgctgg atcagtccgt cttccgtcaa 3060 ctacagacca tttgggaccc acaagtagac ctgtttgctg cgaattggaa tcgacagctg 3120 gagctcttcg ccagctggca gcaccaaccg gaggcgatgg ccgtggatgc attcagcctg 3180 aattggagcc tcttcctggg atatgccttc cccccatttt ccttgattca gcgttgcctg 3240 aacaagataa ggaaggatcg ggcagagatg gttctgatag ccccgatttg gccagcgcaa 3300 ccctggtacc cagttctgtt gagtctggtg tgcgagcccc ctcggatttt gccaacatgc 3360 agcgggatcc tgtcggatcc gtcgggatgt ccgcatccgc tagtatccag tcggacgttg 3420 tcactagccg cctggagatt atccggggac gatatgaggc ccagggcctt tcgagaggag 3480 cggtcgagct tctgttggga gcaaatcgtg ataccacagc agctgcatac cagagcgcct 3540 ggaacggttg gcgtacttgg tgtcttcgcc agagtgaaga tcccctgtct cctggtttga 3600 ataaggtatt agatttccta gcggggttat gtaacgaagg aaaggcctac aggtcgataa 3660 acgtttatcg ctcgatgttg agctcgactt tgaaggccat tgatggcttt gatgtaggga 3720 agcacccgat ggtgatgaag gtcatgcagg gtatttataa tgttaaaccc ccagctccta 3780 agtacaataa tttttgggac gtaaacgagg ttttgaaatt tttggattgg caggcgaatt 3840 cagagacgcc ctttgccaag ttatcaagta agacggtaat gttattggcc ctgacttccc 3900 tgtgcagagt gtcggaactg gcgtcaatat cgcgtgactc gattgcattt agttcccagg 3960 gagtaaaatt gagcctaacg cgtcccagaa aggcccaacg ccaggcggcc ctaaaggtgt 4020 tcaacttgaa gaggctggac ccacccgctt ttacttgccc ggtggtctgt ctggaggctt 4080 acgtcaaagc gtcggatgta tttaggaaga acgaccagag tcttttattc ttggccacca 4140 ggtctccctt cagatcagtg ggagcgtcaa ctattggtcg atggattaag accttgttgg 4200 cggaagccgg agtagatact tctgtatatt cagcgcattc gacgagagga gcttcagcat 4260 ccagggcggc cagtgctgga gtatcagtcg aaaatatcct tcggagcgga ggttgggcat 4320 cagaatcagt ttttgttaga cactacagac gcgaagtcga ttcagggaga gagtttgcga 4380 ccgcagtgtt gcagacagct tgagtatcga gtatgcttta aagtcacaag gttttaggtg 4440 agcgaatttt catacaattg gaaaattgct cgaagtcgcg tagcgactga agagcagttg 4500 agattgtatg gaaagttaaa gcgaaacctt aaaccttgtg tagaccgaat atcagccgcc 4560 ctccagcccc agttttattc tggcccccta tcggtcttac ccctcagagg acgaggcgga 4620 accttcgcga acggcggtcg ccgcggttca taggattcct cgcgtttttc gtctgggatt 4680 taaagtggct caatttnttt gttgttatgt tatgcctcta aaggtctgag gggtaagacc 4740 gatagggggc cagaagaaaa ctcactttga aagacagcta ataaacgggt caacacaagg 4800 tttaaggttt cgctttaact ttccatacaa tctcaactgc tcttcagtcg ctacgcgact 4860 tcgagcaatt ttccaattgt catgaaaatc gtttctttca gttgttgtt 4909 // ID CR1_Ele4 repbase; DNA; INV; 4538 BP. XX AC . XX DT 19-OCT-2010 (Rel. 15.1, Created) DT 19-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE A CR1 clade non-LTR retrotransposon family from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1_Ele4. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4538 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4538 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (19-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. This consensus is generated from 20 CC sequences with >95% identity, and ~99% identical to the original CC sequence in [1]. XX FH Key Location/Qualifiers FT CDS 726..1577 FT /product="CR1_Ele4_1p" FT /translation="MLSVCEQCANEFTGTPVKCMGFCSSLFCKKCSGMTDE FT TQRLIDTNNHLVWMCFACCNILSKSRFHKSVASVNAANEKIIDALKTEIKD FT SILQDIRTEIRDNFKTLVEAVPSTPLAIQPPPFRPSSRSKRQRDTDVDDDS FT TNGRPSKQPCATGTRYSEVGSLAINPTSATPEFWLYLSGIQPDVEEDQVRH FT MVQESLATNDLKVVKLIPRNRDLRTLSFVSFKVGVSADLREKAMSADTWPR FT GIRFREFEDHSLTRQGFWRPTSVPVITVPQTIIIPDPQQPSMG" FT CDS 1580..4465 FT /product="CR1_Ele4_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MLNSDTEPPRTSSVLSSFLVYYQNVRGLRTKTSDLFV FT QLSDCDYEVLAFTETWLHPGIANSELSSAYNLYRCDRNASTSNLQRGGGVM FT IAVKKHHRCEIVSFGQGFGLEQVAVRIKLPSKSVYVCCIYISPNSDSSIYD FT QHASCIREVCEKASINDTIVLLGDYNLPHLIWSYDDEINAYLPTNASSESE FT ITLTESVLSVGLFQVLNVLNQNGRLLDLAFVNDAHSVEVLEPPMPLLKVDQ FT HHKPILLNFHLPLKNDDTDYYDIRLDFRHYDVSAVCRSITAINWNVTLNEN FT SVDDAVIEFYNLIHDIIREHVPRKNNQAARSFNQPWWNDDLRHFRNRLRKI FT RKRYFKHRSEENWITLQSMEREYSSMRRTAFREYISGLEHKAKSDPSSFWK FT FVKNQRKPNSVPREISFQNEHASSATDSANLLAAFFKSVFNENSTQVTHEV FT LDGVLSYDLRMPPMEVTVDEVYQRLSDLDTTKGPGPDEIPPLFIKQCAESL FT ALPLSILFNRSLTNRVFPALWKEASITPIHKSGNLTNAENYRGISILSCFS FT KILECFFHEALYRAVLPVIPEVQHGFVKNRSTTTNLMTFVTSVRTKMEKNR FT QVDAVYIDFAKAFDRVPHGLLIAKLQKMGLPEWATSWLRSYLTARWAFVKI FT DGIKSSRFRIPSGVPQGSHLGPLLFILFTADLCGLIQSEKLFFADDLKIFR FT AILSRIDCCIIQRDLDTIVKWCHANGMEVNADKCKCISFSRSRSPMQQNYE FT MASQTLERVSSIKDLGVIVDSRMNFNEHIATTTAKAFSLLGFVRRITKSFR FT DVYAMKSIYCAVVRSVLEYAVQVWAPYHQVQADRIERVQRSFVRYVLRQLP FT WNDPIRLPPYDHRCRLIGLDTLASRRVMAQRIFCFDLLTGNIDCANLLRQM FT NFHAPVRRLRQNTLFYLPTHRTLYGHNNPLHSCCRYFNEVHDKFDFNITKT FT TFKSRIR" XX SQ Sequence 4538 BP; 1241 A; 1068 C; 954 G; 1273 T; 2 other; ttgagaacca ctgttctagt cgcttgcgtt gttgtttttg ttgctccgat catttcccgt 60 attttcacct gctaccgcca ccgcaattca tatttcaccg aaaatcgagt tccaagtacg 120 acgactcatc accagcatct gcaattccca ctatatctgg tcaggtagga tcgtgatttc 180 aagttaaaat ccgagcaggc ttgagagtgg aagtagtttg cctcctccgc cattgccgcc 240 acaaaaactt tcccacctcg ccgctctcca cctgtacccg ttgctgttaa catatcgctc 300 gctgaatcaa tcacatcgtc ctactcgctc gcaccgctgc atccgttccg cataaccaaa 360 ccaaaacaaa ccgcaattgc caatctgtaa aatcccccag agcatgtaat atactgccgt 420 gtactgccct gtgtcatcat ttcaacaatc cgaaacccac cagcatagat tcawcctcag 480 gaaccatccc atttcatcat tggaccatct ttcgccacca atccgtgcgc catcagagta 540 agtcgagtgg aattgttttt gtatcgttgc tgctccattg ttgctgtctc gctaccgttc 600 tgctgtcact gtcaccgctg acaaatcgcc gtttgctgtc ttccaactat tcgctgcgcc 660 atctagtttt tgagcatcgt aactctttcc gatattaggg tgactcaaat tctttgccgc 720 aaaaaatgct ttcagtttgt gagcagtgtg caaatgaatt taccggcacg ccagtgaaat 780 gcatgggttt ctgctcaagc cttttctgta agaaatgctc tggaatgacg gatgaaaccc 840 agcgcttgat cgacacaaat aatcatctcg tctggatgtg ttttgcttgc tgcaacattt 900 tgtcgaaatc acgtttccat aaatccgttg catccgtaaa tgcggccaat gaaaaaatca 960 tcgatgcgct gaaaacggaa ataaaagaca gcattcttca agatattcgt actgaaatcc 1020 gcgataactt caaaaccttg gttgaagcgg taccctctac acctctcgca atccaaccgc 1080 caccgtttcg tccatcttca agaagcaaga ggcaacgtga caccgatgtg gatgacgata 1140 gtactaatgg gcgcccatca aagcagccnt gtgcaactgg cacgcgctac tcagaggttg 1200 gttccctggc aattaatccc acgagtgcaa caccagagtt ttggttgtat ttgtctggta 1260 ttcaacccga cgtcgaagag gatcaggtac gtcacatggt gcaggagagc ctagcgacga 1320 acgacctgaa agtagtgaag ctgattccaa ggaatagaga tcttcgaacg ttatcgtttg 1380 tctctttcaa agttggagtc tctgctgatt tgagggaaaa ggcaatgtct gctgatacct 1440 ggcccagagg aattcgtttt cgtgagtttg aagatcatag cctaacacgc cagggttttt 1500 ggagaccaac atctgttccc gtgattacgg ttccacaaac gatcatcata ccagaccccc 1560 agcaaccgag catgggttaa tgctgaattc cgataccgaa ccaccacgta cgtcgtctgt 1620 cctgtcctcc tttcttgtgt attaccaaaa tgttcgtggt ctaagaacaa agacgagcga 1680 tctgtttgta caattaagtg actgcgatta tgaggttctg gcttttacgg aaacttggtt 1740 gcatccaggt attgccaatt cagagctgtc ctctgcatac aatttgtatc ggtgtgatcg 1800 gaacgcctcg acaagcaatc ttcaacgagg gggtggcgtc atgatagctg ttaagaagca 1860 ccatcgatgc gagatagttt cattcggtca ggggttcggc cttgagcaag ttgcagtacg 1920 tattaagctt ccatcaaaat cggtctatgt ttgctgtatc tacattagtc ccaactccga 1980 ttcgtcaatc tatgaccaac atgcttcctg cattcgagag gtttgcgaga aagcatcgat 2040 caacgacacc atagttttgt tgggcgacta caatctaccg catctcattt ggagttatga 2100 tgacgaaatc aacgcgtacc taccgacaaa tgcatcttct gaatcagaga tcacgttgac 2160 ggaatcagtg ctttcagtcg gactctttca agtgctgaac gtccttaatc aaaatggtcg 2220 tttactcgat ttggcgtttg taaatgatgc tcatagtgtg gaagtattgg agccgcctat 2280 gcctctgtta aaagttgacc agcaccacaa gccgatcctc ctaaattttc acctaccgtt 2340 gaaaaatgac gatacggatt attacgacat ccgactcgat tttcgtcatt atgacgtcag 2400 tgctgtctgt aggagtataa ccgccatcaa ctggaatgta actctgaatg aaaactctgt 2460 ggacgacgca gtcatcgagt tttacaatct aatccacgac atcatccgtg aacatgtgcc 2520 tcgcaaaaac aaccaagcag ctcgcagctt caatcaaccg tggtggaacg acgatcttcg 2580 acactttcga aacaggctac gcaaaatccg taaaaggtat tttaaacata gatccgaaga 2640 gaactggatt acacttcagt caatggaacg ggagtattca tccatgcgtc gtacagcttt 2700 ccgtgagtat atatcaggct tggaacacaa agcgaaaagc gatccttcgt cattttggaa 2760 gtttgtgaaa aatcaacgca aaccaaacag tgtgcctcgt gagatatctt tccagaatga 2820 gcatgctagt tccgcaacag actcagccaa tcttttagcc gcatttttca aaagtgtgtt 2880 taacgaaaat tcgactcaag ttactcacga ggtgctcgat ggtgttttgt cgtatgattt 2940 gcgtatgcca cccatggagg taactgtaga cgaagtttac cagcgtttat ctgacttgga 3000 taccactaaa ggtccaggtc ccgatgaaat tccaccgctg tttattaagc agtgcgccga 3060 atcgcttgcc ctacccttat caattctctt taatcgatcg ctgacgaatc gcgtttttcc 3120 tgctttatgg aaagaagcgt ctattactcc aattcacaaa tcgggcaacc taacgaatgc 3180 tgaaaattac cgtggtattt caatactgag ctgtttttcc aagattctcg aatgcttctt 3240 ccatgaggcc ttgtatcgag ccgttttgcc agtcattccg gaggtacagc acggttttgt 3300 caaaaacaga tctacaacga cgaatctaat gacgtttgta actagtgtta gaaccaaaat 3360 ggaaaaaaat cgtcaagtcg atgccgtata tatcgatttt gcaaaggctt tcgatagagt 3420 acctcacgga ttacttattg caaagctaca aaaaatgggc ctccccgagt gggcaacatc 3480 ttggttacgc tcatatttga ctgcacgctg ggctttcgtc aaaattgatg gtattaagtc 3540 atcccgtttc aggatacctt cgggtgttcc tcagggaagt catctgggac cgctgctttt 3600 cattttgttc actgccgatc tgtgtggtct aattcaatca gaaaaattgt ttttcgccga 3660 tgacctgaaa atcttccgag cgatactatc ccgtatcgac tgttgtatta tacaacgaga 3720 tttggatacg atcgtgaagt ggtgtcacgc caacggaatg gaagtaaatg cagacaaatg 3780 taaatgtatt tcattttcac gctcacgttc cccgatgcaa caaaattacg aaatggcatc 3840 tcaaacactt gaacgtgtca gttctattaa ggatcttgga gtgattgtcg atagcaggat 3900 gaactttaat gagcatattg ctactactac ggcgaaagcg ttttccttac taggattcgt 3960 acgaaggatc acaaaatcgt ttcgggatgt ttatgcgatg aaatcgatct actgtgcagt 4020 agttcgcagt gtgttggagt acgccgtaca agtctgggca ccgtatcatc aagttcaagc 4080 tgacagaatc gaacgggtgc agcgatcttt tgtgcgttat gttttgcgtc aattaccgtg 4140 gaatgatccc atcagattac ctccctacga tcaccgttgc aggctaatag gattggacac 4200 gttagcgtcg agaagagtta tggcacaacg aatcttctgc ttcgacttac tgacaggaaa 4260 catcgactgc gcaaatcttc ttcgacagat gaatttccat gctccggttc gtcgtctacg 4320 acaaaatacg ttattttatc taccaacgca tcgtactctt tacggtcata ataatcctct 4380 ccatagctgc tgtagatatt tcaatgaagt gcatgacaag ttcgatttta atataacaaa 4440 gacgacattt aaaagtagga taagataatt aatttaagct acagtctgtg caaccattga 4500 gttgaaggca aagaagaaat aaataaataa ataaataa 4538 // ID P-17_HM repbase; DNA; INV; 2718 BP. XX AC . XX DT 21-MAR-2008 (Rel. 13.03, Created) DT 21-MAR-2008 (Rel. 13.03, Last updated, Version 1) XX DE P-type DNA transposon family: consensus. XX KW P; DNA transposon; Transposable Element; P-17_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2718 RA Jurka J.; RT "P-type DNA transposon families from Hydra magnipapillata."; RL Repbase Reports 8(3), 363-363 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(134..895,1023..2333) FT /product="P-17_HM_1p" FT /translation="MPNKCCVYGCSSNYKSKVKDNVYVTMYRFPFKSEDRE FT LWIKKLPNANFVFTDYKRICAQHWPKDAELKKVKGGSFVPVHPPSVFKGIP FT VSCIPIVSRKRDVTGLICSERNNLPDEFENFKKMDVIQFNSIVKNLKSIIN FT NCEEKFYFVEKEKNISLISYEHCGPVPKFSIYITLEDSGDIYFTGYSQLLQ FT IKLSFLNKGVIKYYSQLENAVSYVINNETDSKRASYIRHQVALLNKTNNTP FT LALVHLYYLFSIFVNQKTFQYFNAKKVPAYYLLNDTVHILKCIRNNWITEK FT LQCLKYRLLNDNDTVRLAEWKFIIQLHAADSRIVKLSPLTQKAVSPKPIER FT QNVGLVLKVFCDETVAALRVKFPEAEDTALFIETVVKWWLIVNSKAKGLDI FT RLNDIRRKPITSVNDWQIDFLENYISYFAENLKCVNSKLREKKLTMDTASA FT LQKTSKRLASCAKYLLNCGVSYVLLGHMQSDALEKQFGKYRQGSGGTYLIT FT VQNVIQKFRIDKTRKLLDKHTDFTIFPPAHHHCDKCELDVEFNMSICENCE FT DQLPNECKQGLLYIAGFIAYKLPELHSILEETDTFDMFQKYGTLISSLNRG FT RLREPSDNLVYFVFYCFFIFEVTLVNKVSPLLCKKGLIDIFINCNSKFQLL FT VQSVKKVCCILANIFLNNYCKKHSEDFGKECLIKQAKLSTSK" XX SQ Sequence 2718 BP; 999 A; 367 C; 441 G; 910 T; 1 other; caaggccttc ttaaacaggc cgcgacagct cattttccgt aaaattagcg aaggccgact 60 tttttacgga gaagtatagt ttcaatgagg attttggttg tttgatttaa actataatag 120 ttaaactcgt aaaatgccta ataaatgctg cgtttatggt tgctcctcaa attacaaatc 180 aaaagtaaaa gataatgttt atgtcacaat gtatagattc ccctttaaat ccgaggatcg 240 cgagttatgg attaaaaaac ttccaaatgc aaactttgtg tttacagatt ataagcgaat 300 ttgtgctcag cactggccta aagatgcaga attaaaaaaa gtcaaaggtg gtagttttgt 360 tccagtgcat cctccatcag tctttaaagg tataccagtg tcttgtattc caatagtgtc 420 tagaaaacga gatgtcactg ggttaatatg ctctgaaaga aataatttac ctgatgagtt 480 cgaaaacttc aaaaaaatgg acgttattca atttaattca attgtaaaaa atttgaagtc 540 aataataaat aactgtgaag aaaaatttta ttttgttgaa aaagaaaaaa atatatcttt 600 aatatcatat gaacattgtg gaccagtacc taagttttca atatatataa ctttagaaga 660 ttctggagat atttatttta ctggatatag ccaactttta caaattaaac tgtctttttt 720 aaacaaagga gttattaaat attattctca actggagaat gctgttagtt atgttattaa 780 taatgaaaca gattctaagc gagcaagtta tattaggcat caagtagctt tattaaataa 840 aacaaacaat acaccattag cattagtgca tctttattat ctctttagta tattttgact 900 cgttgaaaag cttgttttgc aaagttacaa caaattttca aacaagcgtg atagaaaaaa 960 taaatcaaca aatcataatt gcaggtggaa agcctyttgg gttaataatg gataactgtt 1020 gagtaaacca aaaaacattt caatatttta atgcaaaaaa agttccagca tactatcttt 1080 taaatgacac tgtgcatatt ttaaaatgta ttcgaaataa ttggatcact gaaaaattac 1140 aatgtttaaa atatagattg ctaaatgata acgatactgt gcgacttgca gaatggaaat 1200 ttataattca acttcatgct gcagatagtc gaatagttaa gttgtcaccc ttaacacaaa 1260 aagctgtgtc accaaaacca attgaacgac aaaatgttgg attagttctt aaagtttttt 1320 gtgatgaaac agttgcagca ttacgtgtaa aatttcctga agctgaagat actgcacttt 1380 ttattgaaac agttgttaag tggtggctta ttgtcaacag taaagcaaaa ggtctagata 1440 ttagattaaa tgatattcga agaaagccta ttacttcagt caatgattgg caaatagatt 1500 tcttagagaa ctatatttcg tattttgctg aaaatttgaa atgcgtaaat tcaaaattaa 1560 gagagaaaaa gttaacaatg gatactgcaa gtgctttgca aaaaacttct aaaagattag 1620 caagctgtgc caagtatttg ttgaattgcg gtgtatctta tgtattgtta ggacatatgc 1680 aatcagatgc actagaaaaa cagtttggga aatatcggca agggtcaggt ggcacatatc 1740 tgataactgt gcaaaatgtt attcaaaaat ttagaatcga caaaactaga aagttgctag 1800 acaagcatac agattttaca atatttcctc ctgctcacca ccactgtgat aagtgtgaac 1860 ttgatgtgga gtttaatatg tcaatatgtg aaaattgtga ggatcagttg ccaaatgaat 1920 gcaaacaagg gcttttatat attgcaggtt ttattgctta caaacttcca gaattacatt 1980 caattttaga agaaacagac acctttgata tgtttcaaaa atatggaact ttgatcagtt 2040 ctttgaacag gggaagacta agagaacctt ctgacaattt agtctacttt gttttttatt 2100 gtttttttat atttgaagtg acacttgtga ataaagtttc accattgctg tgcaagaaag 2160 gattaattga catttttatt aactgcaata gtaaatttca attgcttgtt caaagtgtta 2220 aaaaagtttg ctgcattttg gcaaacattt ttttaaacaa ttattgtaaa aaacactctg 2280 aagattttgg gaaggaatgt ttaataaaac aagcaaaact ttcaacttca aaataaaata 2340 ctttgagtca aaaaagtcaa aataatttta ataaaataaa gttaatatat ttgtttgttt 2400 ttatcaaaaa atttattatt tgtattacaa gtgttaaaca gctaaaataa acatatagtt 2460 ataaaaacga gatttagtca tttttagcta aactccttaa atttttattg taattaactt 2520 atttctaaac gtaaaaaaaa ttcctacaat atttgctttc gaaattcaat caaaatgttt 2580 tgtcctaata aagctcttta tttgtttctg aagtttgttt acaaaagcaa agataaaaaa 2640 taattttaac cgtaaaaaag tcggccttcg ctaattttac ggaaaatgag ctgtcgcggc 2700 ctgtttaaga aggccttg 2718 // ID BEL-65_CQ-LTR repbase; DNA; INV; 383 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-65_CQ_; KW BEL-65_CQ-I; BEL-65_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-383 RA Jurka J.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 284-284 (2011). XX DR [2] (Consensus) XX SQ Sequence 383 BP; 119 A; 115 C; 88 G; 61 T; 0 other; tgttacgtct gaccgaagac ttgcccccac ccttgcaaca cccgttgact acgccgctga 60 gcaacgccgc agctgtcaac ctcaccgaca actacaacgg cccgcaacga gtttggccac 120 caacgaggtg gagaaaagct cgcctactgc gcactgacta gtcccgatcc acccaccgaa 180 cgattagcac gtgcagcaag aacccatcaa gacagaagca gaaagataga aaggaagaag 240 agttggtgaa gtgatagaag gaggaggaaa attaaagaaa cctagaaaat caaaccctcc 300 cagtgttttg ctaccccgga caaatcttcc accgcgaaac agtccacttc cgtcagttca 360 agaggccgcc gttggtccga aca 383 // ID KAMIKAZE_BM-I repbase; DNA; INV; 7270 BP. XX AC AB042120; XX DT 25-APR-2010 (Rel. 15.04, Created) DT 25-APR-2010 (Rel. 15.04, Last updated, Version 2) XX DE Bombyx mori Pao-like retrotransposon Kamikaze DNA (internal DE portion). XX KW BEL; LTR Retrotransposon; Transposable Element; KW reverse transcriptase; protease; integrase; RNase H; gag domain; KW pao-like retrotransposon; KAMIKAZE_BM; KAMIKAZE_BM-I. XX OS Bombyx mori OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Bombycidae; Bombycinae; Bombyx. XX RN [1] RA Abe H., Ohbayashi F., Sugasaki T., Kanehara M., Terada T., RA Shimada T., Kawai S., Mita K., Kanamori Y. et al.; RT "Two novel Pao-like retrotransposons (Kamikaze and Yamato) from RT the silkworm species Bombyx mori and B. mandarina: common RT structural features of Pao-like elements."; RL Mol. Genet. Genomics 265(2), 375-385 (2001). XX DR EMBL/GenBank/DDBJ; AB042120; Positions 156 7425. XX FH Key Location/Qualifiers FT CDS join(1532..3196,3200..6718) FT /product="KAMIKAZE_BM-I_1p" FT /translation="MPSKSQAAIFREATKFITLLKEIALSILAGDRIKYDM FT ADLSARLPRLDTNKSKLEDVYYEILDDEDLSEEQEKMYTKNYETSIVDYHD FT ILVAWEKINSKPASSPSSQPSTSSTLTTVLPKIALPKFSSKVEDWPSFITI FT FKSLTDDMLTLSDSVKLHYLFSCLSGEALGMVSHLKITNENFSVALDILTR FT RYENRRVLIDRFVDIIMSLPNIHSRSNIRTLFLTPLISAQSALSNLDLPMK FT DCDYIFVSIVVRKLKGELRTLFERKHGSRQSLPTLKNLISFLEEHARCNET FT EWSNTTYSQYNQPSQKSGFSRRQSPTSQPLQVSRQTQYQQRRPFTPNNSRP FT EYLPRTTAPPWQINSAHPTQLSQNNNLYCPYCKTKGHKLITCTRYNDQPIQ FT GRWDFIRARERCQRCLGPHYENECKTTGVCKECGSANHHTSLHRHGASSPI FT QSSAHLLPLNLREQPRQRPTHCPRARSSHATSHTPSPSHTPQVATQSTARD FT DVTALKAERDALARALAAERQRAATAPWLYKEADRPVSPQEYQTHGSSLHQ FT SRSPRRNQLSTITEHDQEEKYAQLMHTHTTHTRPTVSTQTITSAQHENHNP FT NTPILLPTVTLEIADVHGQFQRARALLDCGSMVTLITQRMARTLQLPLKNT FT SLQISGVGNQRTPYSKASIKIVCRPTHSETPTITATAHILKHVTGYLPLGK FT VQNISHMVDQSTIPLADQGYHTPAPIDLLLGSDILGQVLDGTKVSLGPGRP FT IAFGTIFDFTLLGPIHDLYTAPVSKTESAHIVSAQPELEVSTHDIRKSLEK FT FWESEEPQLHVETTPLQDQCEEIFRTTTTRKPSGQYTVTLPFLHNMPELGS FT THAIALRRFLNLEKKLQADPYLRVKYIDFMADYQKLGHMSPCHPSTFAHKP FT HFYIPHHGIFKSGSDKLRTVFDGSCKSSNGVSLNDCLHTGPALQQDIVDII FT LSFRTHPVVFSTDICMMFRNILIHPDQRRYQLILWRSSPDQPLLTYALNTV FT TYGLRSSPYHAIRTLIQLADDEGHRYPAAAQVLRKSIFVDDILTGHDSVVK FT AQALQNDLINLLALGGFQLSKWTSNCPQLLERFPDDQCDMPKNFDISPDSN FT SIKVLDIQWIPQSDELTYRISLPSIKQITKRSILSTVASFYDPNGWVTPVI FT FRAKLLLQKLWLLKLEWDEHTPVEVQTEWNRISQDLPQLSTLRLPRLICTN FT KLKSTYSLHGFADASEASFAAVIFYLHELDEHRCVNVYLVIAKSRVAPVRN FT RLTIPKMELSAASLLTQLMIRVSTQLLSHITIKQHICWSDSTIVLAWLNTP FT PHRLQVFEANRVAKITSNPITSTWKHVPTNLNPADCASRGMSAQSLSAHDL FT WWSPSWLKEPPDTWPKMPPALGHHALPGLKPKKVPAHIAVPDLDLDLLTRF FT SSLDKLVGVTACIKRFIFNCRHNSTDRCSGPLTVGERRDALLFWVRSVQHN FT EFAEDIYRLQVGKICTVRLQRLSPLMKDDLLRVGGRLTHAPIRYDAQHPLV FT LPSSSPLVDLIIDHYHRINCHPGTDTLHAILRQQFWILSARRVIRHRVYKC FT IRCFRYRAQARAPLMADLPADRVTPQPVFSQVSTDFAGPFLIKSSTLRNAK FT LMKAYFCVFVCLSTKAVHLELVSSLSTEAFLAAMQRFVSRRGTPTLIRSDC FT GTNYVGAKNHLIEVQDFLAQNNDTITHRLANQHITWLLQPPTGPWFTKPWP FT SRNCGQEH" XX SQ Sequence 7270 BP; 2103 A; 1787 C; 1320 G; 2060 T; 0 other; ttttttggtc cttcgagctt tgaaccctaa atcctttgga tacaatccat tgccctccat 60 agacccaccc aaattttgat atacaaacgg ctcaaaaaaa aaaaaaattg gcccttttgt 120 aggtttcaat cggttataat cagcccatta tttaactcta aaacttattt actataataa 180 tatcttcatt ttattttatt tttatttatt ttattaggct cacaacacac atcacaagct 240 aaagatacac aaaaaacaca aataaaaaga aacaaaatct ggcttgcaac ataagacatg 300 tgcgcacgac aaaacaagac atatagtgct gaatttgtat aacaatgttc cttacaatta 360 tgacaaaaca acacaataac aaaaactaat ctaacgaata atgtaaggaa gaaaaaaaaa 420 aaaaaaatcg caatatagta cattaaacaa ttacattaac ctacctatta ataaagtgga 480 ggatgccctt gaacattttt ctaaacgtaa ctaacctaca agtaaaaata tctaactcag 540 gtgacgacct tgccagttca ttataggtac gacacattcg ggtaatggga ttataataag 600 atatattgtt tttataaatt agcaatgcaa acaaattgtt ttggttgacc ctccgagcac 660 taaatttaat gtttaatgaa atctgagata acagttcagg cgaatccaac agagagtgaa 720 ttatcttata taagtatatc agatcagtgc atgtacgtcg atcttctagt gactgcatgt 780 tatagtttaa tagtctttgc tcgtatttaa gttcagagcg gcctggttgt atacgataac 840 ttaataatct taagcacttc atttgcacac gctcaatagt gttgctcaat ttcgttgttt 900 ataataatta ctaacacggc aaaagtatat aaattgtctg cccattatac aatcacttca 960 ttccatttta cccacaacaa taaaaagtta ttaaaagtta aaacgaagaa cgtaaacaag 1020 tagggactta tcgtctcggc tcatcgaatg aatattatat atacctattt aaatatattt 1080 ctaatcatat agacctcatt agtacattag tttgattttt aaaagtgtgg aacattgatg 1140 atttttttca ttaaatttgt tgtcaaagcg tatcgtgatt tgtgtcattt cattatactt 1200 taacttttat tgtaattaca aacttgtagg tacttacaat tcctacgata tttaggtgct 1260 tgcgcgccac agtacagttc ataacagtaa atcaactgta actttgattt ttgccatagt 1320 tcggtgtaga acagtaggtc aaaaggtttt tgaaaaaaaa aaaaattgtt ttcatttaca 1380 ttatcattac acagggtagg ttcagtatac agggtgcccc attttgcctt tttattataa 1440 tttttttttt tttttttaga acacactttt tttttccttg tctttttttt tctcttcgcc 1500 tttgtttagt taagtccata tattgttatc aatgccgtcg aaatctcagg cagctatttt 1560 tcgagaagca acaaaattca taacattatt aaaggaaatt gccttgtcca tactggctgg 1620 ggacagaata aaatatgaca tggccgattt gtctgctcgc ttgcctagat tggataccaa 1680 taaatctaaa ctagaagatg tatattatga gatcctcgat gacgaggatc tctcagagga 1740 gcaggaaaaa atgtacacaa aaaattatga gacctcgatt gttgactacc atgacatctt 1800 agtagcatgg gaaaaaataa actcgaagcc tgcttcctca cccagctctc aaccctcaac 1860 ttctagtacg ctgacgactg ttctgcctaa aattgccctc cccaagttca gctcaaaggt 1920 cgaagattgg ccttcattta ttacaatttt taaaagtctc acggatgata tgctcactct 1980 ctcagattct gtcaaactcc attatttgtt ttcatgccta tcgggcgaag cgcttggtat 2040 ggtgtctcat ctcaaaatta ctaatgagaa tttctccgtt gccttggaca ttttaaccag 2100 gcgctacgag aatcgccgcg tgctgattga taggtttgtg gacatcataa tgagcctccc 2160 aaatatacat tcacgctcta atatccgcac actctttttg acacctttga tttccgcgca 2220 atcagccctt agtaatctag atctgcccat gaaagactgc gattatatct ttgtctcgat 2280 agtggtgcga aaactcaaag gtgagctccg cactctattc gagcgtaaac atgggtcgcg 2340 tcagtcttta cccactttaa aaaatttaat ctctttttta gaagaacacg cgcgctgtaa 2400 tgagaccgaa tggtctaata caacatactc ccaatacaat caaccttctc aaaagtctgg 2460 tttttctcgt aggcagtctc ctacttctca gccccttcaa gtttcacgac aaacacaata 2520 tcaacagcgt cggcccttta ccccaaataa ttcccgccca gagtatctac cccgaactac 2580 tgccccacca tggcaaatta actcagctca tcctactcag ctctcccaaa ataataactt 2640 atattgtcca tattgcaaga caaaagggca caaactaatt acgtgtactc gctacaatga 2700 tcagccgata cagggtagat gggactttat tagagcccgc gagcgttgtc aacgctgtct 2760 aggaccgcac tatgaaaacg agtgcaaaac cacgggtgtg tgtaaggaat gtggaagtgc 2820 taatcatcac acctcacttc accgccatgg cgcatcgtct ccaatacagt cttctgcaca 2880 tctccttccc ctgaatttac gggagcagcc taggcagcgc cctactcatt gtcctcgggc 2940 acgctcgtca catgcgacct cgcatacgcc gagcccatca cacacaccac aagtggcaac 3000 acagagcacg gcaagagatg acgtgacggc tctgaaagca gaaagagacg ccctcgccag 3060 agcattggcg gcggaaagac aacgtgcggc caccgcgccc tggttgtata aagaggcaga 3120 taggccagtc tcaccccaag agtatcaaac tcatggttct tcgctacatc aatcgcgatc 3180 accacgccgg aaccaatgac tatctactat cacagaacac gaccaggaag agaaatacgc 3240 tcagctcatg cacacccata caacacacac tcgtcccacg gtatcaacac aaaccataac 3300 ttcagcacaa catgaaaacc acaaccctaa tacacccata ttattgccta ctgtcacctt 3360 agaaatagca gatgttcatg gtcaatttca aagagcacga gccttattgg attgtggcag 3420 tatggtaaca ttaattactc agcgcatggc tcgaacttta caactacccc taaagaatac 3480 ttcattacaa atttcgggag tgggaaatca gcgcacaccg tactccaagg catcaatcaa 3540 aattgtatgt cgacccactc atagcgaaac acctacgata acggctacag ctcatatttt 3600 aaagcatgtt actggttatc tgcctctagg gaaggtacag aatatatcac atatggtaga 3660 ccagtcaacg atccccttag ctgatcaagg ataccataca ccagctccaa ttgatctgtt 3720 gttaggctcg gatatcctcg gacaggtttt agatgggact aaagtctcat tggggcccgg 3780 caggccaata gccttcggca cgatatttga ttttacattg ttaggaccca tacacgattt 3840 gtatactgct cctgtaagca aaactgaatc ggctcacatt gtcagcgcac aaccagaact 3900 cgaggtatct acacatgaca ttcgcaagag ccttgagaag ttctgggaat ctgaagagcc 3960 ccaacttcat gttgaaacta cccctttgca agatcagtgt gaagaaatat tccgcactac 4020 cactactcgt aaaccctcag gtcaatatac ggtaacttta ccctttttac ataatatgcc 4080 cgaactagga agtacccatg ccatagcatt gcgcaggttt cttaatcttg aaaagaagtt 4140 acaagctgac ccatatctcc gtgtgaaata cattgatttc atggcagatt atcagaagct 4200 gggtcacatg tccccatgcc acccgagtac ttttgctcat aaacctcact tctatatccc 4260 tcatcatggc atcttcaagt cgggatcaga caagctacgc actgtttttg atggtagctg 4320 taaaagctca aatggtgtca gcctaaatga ctgtttacac actggtccag ctcttcaaca 4380 ggacattgtc gatattatct tgtcgtttag aactcatccc gtagttttca gcactgacat 4440 ttgtatgatg ttcagaaaca tactcataca tcccgatcag aggcgttacc agctcattct 4500 ctggcgctct tcgccagatc agcctctgct tacatatgcc ttaaacactg ttacatacgg 4560 gttacgatca agcccatatc atgctatacg cactttaatc caattagcgg atgatgaagg 4620 tcatcgctat cctgcagctg ctcaagtgct gcgcaagtcc atattcgtcg atgatattct 4680 cactggtcac gattctgtag taaaggctca agctctccaa aacgacttga ttaacttact 4740 agccttaggt ggttttcagc tcagtaaatg gacctcaaac tgccctcagc ttctcgaaag 4800 atttcccgat gatcagtgtg acatgcccaa gaactttgac attagcccag actccaactc 4860 cataaaagta ttagatatac agtggattcc tcaatcggat gagttaactt atcgtatttc 4920 tctcccctca ataaaacaaa ttaccaaacg ctcaatcctc agcacagtcg cctcttttta 4980 tgaccctaac ggttgggtta ctcctgttat cttccgcgca aagcttctcc tacagaagct 5040 gtggcttcta aagttagaat gggacgaaca cacccctgtt gaagtgcaga cagaatggaa 5100 ccgcatatca caagatctgc cccaattgtc aacccttcgc ttgcctcgcc ttatttgtac 5160 aaacaaactt aagtctactt actcgcttca cgggttcgca gacgcctcag aggcaagttt 5220 tgctgccgtt atattttatc tccatgaact cgatgagcac agatgtgtca acgtatacct 5280 cgtaattgct aaatcacgag tcgcacccgt tagaaatcgt cttaccatcc ccaagatgga 5340 gttatctgct gcaagtctct taacacagct tatgatacgt gtctcgactc aattgttgtc 5400 ccatattacc ataaagcaac acatatgctg gtcagacagt accattgtcc tagcatggct 5460 aaacacacct cctcatagac ttcaggtctt tgaagctaac agggttgcca agataacatc 5520 caatccaatc acctccacat ggaaacatgt tcctacaaac ctgaaccctg cagactgtgc 5580 gagcagggga atgtctgccc agtcactatc ggctcatgat ctttggtggt ctcctagctg 5640 gctgaaggag cctcctgaca cctggcccaa aatgcctcct gctctaggcc accatgcatt 5700 accaggtctg aagcccaaga aagtaccagc tcatatagca gttcctgacc tcgaccttga 5760 cttgcttact cgattcagtt ctctcgacaa gctcgttgga gtgacagctt gcataaaacg 5820 ctttattttc aattgtagac acaactcaac tgataggtgc tctggtccct tgacagttgg 5880 agaacgtaga gatgctctgt tattctgggt tcgctctgtt caacacaatg agtttgcgga 5940 agatatttac cgtcttcagg taggcaagat ttgtaccgtt cgacttcagc gtctttcacc 6000 cttgatgaaa gatgatctct tgagagttgg tggtcgcctc actcatgcgc caataagata 6060 cgatgctcag catcctctcg ttctacccag ctctagccca ctagttgatc tcatcataga 6120 ccactatcac agaataaact gtcatcctgg gactgacaca ttacacgcta tcctccgtca 6180 gcagttctgg atactctcag cacgtcgtgt catcaggcat cgcgtgtaca aatgcattag 6240 atgtttccga tatcgcgctc aggccagggc tccgttgatg gctgatttgc ccgcagacag 6300 agtcacaccc cagccagtat ttagtcaggt ctctacagat tttgctggcc cgtttctcat 6360 taaatctagt actttacgta acgccaaact catgaaggca tacttttgtg tctttgtttg 6420 cttatctact aaggcggttc acttggaact cgtttcgtca ctcagcactg aagcgtttct 6480 tgctgcaatg caacggtttg tatcgcgtcg cggcactcct actcttatcc gttctgactg 6540 tggcacaaat tatgtaggag ctaagaatca tctcattgag gtacaagatt ttctcgcgca 6600 aaataatgac acaatcacgc ataggctcgc taatcagcac ataacttggc tgctccaacc 6660 gcccacggga ccctggttca cgaaaccctg gccttcacga aattgcggtc aagagcacta 6720 agaagttact ttatcatgtt ataggtgaac agcacctcac atttgaagaa ttttccacac 6780 ttttaactcg agtcgaggca gtacttaatt ctagacctct ctgcccactt tcgtcagatc 6840 cctccgattt tgagccacta acagctggcc actttcttat tggctgtcca ctcaccgctc 6900 ttcctgagcc atcttttgat gataggccat tatcagctct taagcgtttt cagctcatcc 6960 aagctctctc tcagcgtttt tggactctgt ggaccaaatc ctatctccat actctccaaa 7020 cccgctccaa atggctttcc acatcctcac ctcctcaagt tggagagctc gtcctcatca 7080 aagaggacaa tcagcccccg ttgcaatgga aacttggaag gattctccgc acgttacccg 7140 gcaaggatgg agtgatcaga gtcgtggatt tggacacgaa gcatggaagt ctgcggagac 7200 ctgtgttcaa gttagccagg ttgccgctgg attcagcaca tgaagaagtt cagccccggc 7260 cgccccagga 7270 // ID BEL-2_BMa-I repbase; DNA; INV; 5789 BP. XX AC AAQA01000514; XX DT 24-FEB-2011 (Rel. 16.02, Created) DT 24-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Brugia malayi genome: internal DE portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-2_BMa_; KW BEL-2_BMa-LTR; BEL-2_BMa-I. XX OS Brugia malayi OC Eukaryota; Metazoa; Nematoda; Chromadorea; Spirurida; Filarioidea; OC Onchocercidae; Brugia. XX RN [1] RP 1-5789 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Brugia malayi genome."; RL Direct Submission to RU (07-FEB-2011). XX DR Genome; AAQA01000514; Positions 2307 8095. XX CC Positions [4317-4892] - Integrase core CC 'GCAAA' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 3858..5789 FT /product="BEL-2_BMa-I_3p" FT /translation="MIRTTVRVLKFLKKLSKGKLTWLSSVSERNPIIPEDY FT NLAAELLIKQSQSEGITAEEVEKYNLHYADRYWRFQSRLQLPNDREYSSQT FT IYLPRHNRITELIIQHHHETTHHGGIPQTISRIRKLYWIPKGRAEVKKVLN FT RCMSCKRWKAKPFKLPPMPNYPQERITESRTFGRIGLDYLGPITVKIQLEK FT AKRWIALFTCFTTRAVHLEMVDDLSAESFLTALRRFVARRGYPELILSDNA FT TQFQAVFQAIKTQESQLSNFLAEKGIKCKTITPRAPWSGGIYERLIGLTKR FT SLRRAIGRRLLKEGELITLIIEVESILNTRPLTYVNFDDSIILRPIDFIIP FT NASLSILNSNNDREDEFSPYSLTTQERLLQHWSNTLETLKTFWEIWKDEYL FT NSLKDRYQLEHKSPRNVEKRQPIEGEVVLLDEPHAPRGTWKLARIKKLNVA FT NDGYVRSAQIETSSGKLLNRPVSTLYPLEVTPEEQSTPENPANPSERAEPI FT IQNRNTTTNTKISSESGEEQPIAHRTRSQTNLQKREKSNFVLTNLLLTVAI FT SIILQAAETKECKWTSGVPFNMPQKWNCKTIETRNATPIRITVYTHVHVKI FT PTTIPLHLIQTCTTHNQGLESVEDHNVLSYKPQIYPLEYFSLFPRE" FT CDS join(1032..2102,2106..3383) FT /product="BEL-2_BMa-I_1p" FT /translation="MENQSPIKGKPRPRHNPNETSALTTSVPEKPIIKALS FT RPCALCNKNHWDEECQIYPTLKQRLQRLKKLNACLNCFKIGHMAARCKNKK FT RNCFHCKGQHNSALCPNKFKEMPNKPEDPDSTTTTNIIVEKEKKRDKRILL FT LCKQINVFNPTNPNTQQRTLALFDVGSQLSFVSKNLAKQLMLNETDEGKLN FT IASFGKKDPNTHLTTRIEIGIKIGKDRTVLLEANTVDYLTNKLQVVNLSES FT DIRSLKGKSNKLISTVNGSNRDILIGANHFFQFINFDKAENLQSGFTLLHT FT KVGPILAGSGYTKKISPNSRMSSQIVYAANTINVPDLDQFWKLELIGIQDK FT PEINDDEEALKQFKTIRKSDGRYQVAWPWKDTKLSDKLSDNFELCLGRLQS FT LIKRLHRNQHLLTTYNETINEQLRSNVIERVSHFTNHQGIIHYLPHHEVVK FT SDNTTTKVRIVYDASAHLRGRKSLNDVLYRGPITLPDLAGILIRLRTMKII FT IIADVEKAFLQLELLPTERDCTRFLWVKDIQKQVDRDNLICYRFKRVPFGV FT ISSPFLLSATLNYHLEHYETETAREIKKNLYVDNIILSASHEKEALNKYEE FT TKSIFKDASMNVREFLSNDQTFNEQLPKSDRARMSQIKKILGIKWNPHKDH FT IQITLNPWTEQTTTKRTILRFIASQYDPLGFLVPLIVPFKIFLQELWRKNI FT HWDQPLDNQLIETWNNLVVECPTHIKEIPRFTIDLSQQITFHVFTDASTVA FT YAAVVYAHQNTHTSLLFAKSRLAPIKGMTIPN" XX SQ Sequence 5789 BP; 2104 A; 1293 C; 1098 G; 1291 T; 3 other; aacttaatgg tgccacgagc cgggaacatg tcatccatca tcatcagaac cgcagaccaa 60 acaaaacgaa gattgtgtga agtcctcaac gaaatcaagc gtctgaactt ggattcggtc 120 ggccaaatga caacaaccga cgaaacaatc cgtcttcatc aggaacggag aaggatcacc 180 caagaaaaaa tcatgcatat ccaaatatac atagaggcgt tggaatcggt caatacagag 240 tggaaacaat tcatccagca gatttcagtc tcaaccaaaa gaaaagaaga agaagatcga 300 taccttcaga taaacgatga tgaatcaggt attaccaact taattttaca agcaaaacaa 360 actgtagtca ttttaaaaat gtatcaaaat gattcggagt tcatattaca acggttaaat 420 cagtctatgg taagggagga aacaactcaa ccccaaggaa cccacacaag cgggtaaata 480 ccctacagta aatctcccac aactacccac tgcccacatt cagtgggaat ccaagacgct 540 ggagggaatt ttggaatagc tttgatacag ctatacatca acaagcaatt cccgatatcc 600 ataaactaaa ctatctcatc gcatgtctca aaggagacgc cctccaagct atcagaggtt 660 atgacataac tccggagaat tacagtatcg taagaaatgt cctcgtcgaa aagtttgggc 720 agccttacgc catcaaacga gccttgttca ccgaattata ctccatcaaa aagaacgata 780 gagagtggaa agcaacgctt gaggccattg agcgaacact tcgacaattg gaagccatcg 840 gtgagaattt agagcagtcc ggtatcgaaa tagctgtgga agggaaattg ccaacctgga 900 tcctagaaca ggtttacaga agaaagaggc aggaatcgtg gtctgtcacc aaactccgaa 960 acttcctcac ggagcttgca tctgtgaacg aagaagtagt acaaagccag tccctatggt 1020 ttaaagggaa tatggagaat caatctccaa taaaaggcaa gccaagacca cgacacaatc 1080 ccaatgagac ttcagcctta acaacctctg ttccggagaa acccatcatc aaagccctat 1140 caagaccttg tgcactatgc aacaagaatc actgggatga agagtgccag atttatccca 1200 ctctcaaaca aaggctgcaa cgtctgaaga aactaaacgc ttgcctcaac tgcttcaaga 1260 tcggacatat ggcagccaga tgcaaaaaca agaaacggaa ttgcttccac tgcaagggtc 1320 aacacaattc cgcgctgtgc cctaacaaat tcaaggaaat gcccaataaa ccagaggatc 1380 cagactccac taccacgaca aatataatag tcgaaaaaga gaagaaacgt gacaagcgca 1440 tcctactctt atgcaagcag ataaatgtgt tcaacccaac taatccaaat actcaacaaa 1500 ggactttagc cctattcgac gttggatcac aattatcctt tgtttcaaag aacttagcca 1560 aacaactaat gcttaacgaa acagatgaag ggaaattaaa catcgcttca ttcggcaaaa 1620 aggacccgaa cacgcaccta accaccagaa ttgaaatagg gataaaaatc ggaaaggacc 1680 gtacagtgtt gctcgaagcg aacaccgtgg actatttaac aaacaaacta caagtagtca 1740 acttaagtga aagcgacata cgatccctca aaggaaagtc caacaaattg atttccacag 1800 tgaatggaag caaccgggat atattaatag gagccaacca cttctttcaa ttcatcaatt 1860 tcgacaaagc agagaatctg caatcgggat tcacactgct ccacactaaa gtgggcccga 1920 tcctggctgg gtctggctat acaaagaaaa tttctcccaa ttcacgaatg tcctcacaaa 1980 tagtttacgc cgctaacacg atcaacgttc cagatttaga tcaattctgg aaactggaac 2040 taataggcat acaagacaaa cccgagatca atgacgatga agaagcctta aaacaattta 2100 agmaaacgat tagaaaatcc gatggacgat accaggtcgc atggccatgg aaagatacca 2160 arttaagcga caaattaagc gacaatttcg aattatgtct cggacgctta cagtcgttaa 2220 tcaagcgact tcaccgcaac caacatctac ttaccacata caatgaaaca atcaatgaac 2280 agttaagatc caatgtcatc gaaagggtaa gtcatttcac aaaccatcaa ggtattatcc 2340 attacttgcc ccatcatgaa gtggtaaaat cagataacac cacgacaaag gtcagaatag 2400 tttatgacgc ttccgctcac ctcagaggca ggaaaagctt aaacgatgtc ttgtatcgag 2460 gccctatcac ccttcccgat ttagctggaa tattaattcg actaagaaca atgaaaatca 2520 tcataatagc tgacgtcgaa aargccttcc tacaattaga gttgctgcca acggaacggg 2580 actgcacgcg ctttctatgg gtaaaagata ttcagaaaca ggtggacaga gataatttaa 2640 tatgctatcg attcaaaagg gttccctttg gagtgatatc gtcacccttt ttattatcag 2700 caactctcaa ttatcatttg gaacattatg agactgaaac tgcacgtgaa atcaagaaga 2760 atctctacgt cgacaatatc atcttatcag cgagccatga aaaggaagcc ttgaataaat 2820 atgaagaaac aaaatcaatt ttcaaggatg cgtcaatgaa cgtgcgagaa ttcctctcaa 2880 atgaccagac attcaatgag caactaccca agtcggaccg agcacgaatg agtcaaataa 2940 agaaaatatt agggattaag tggaatccac acaaggatca tatccaaata acactaaatc 3000 catggaccga acaaacaaca acaaaaagaa ctattctacg gttcatcgcc tcacaatacg 3060 acccacttgg attcctagta ccactgatag taccgttcaa gatatttctc caagaactct 3120 ggagaaaaaa catacattgg gatcaacccc tcgacaatca attgattgaa acctggaaca 3180 atcttgtggt agaatgccca actcatatca aggaaatccc gcgattcaca atcgatttat 3240 cacaacaaat tacctttcat gttttcaccg atgcatctac agtggcatat gctgcagtag 3300 tatacgccca tcaaaatacg catacgtcgt tactctttgc taaatcacga ttagcaccta 3360 ttaaaggaat gacgatacca aactagaatt actagctatt ctcatcggtg tacgcgcagc 3420 tcagtttgtc atcaaacaaa tggaattaga aaatccaaag gtagtggtat ggtcggactc 3480 tcgctgtgca ctacattgga tacaaaacaa ttctcgtctc ctaccaaaat ttatacagaa 3540 tcgtgtggaa gaaatacaaa tggcgaaatt cgcatatcgg tatatcccaa gcgaacacaa 3600 tccagcagac attgctacca gaggaaccat tccgactcgt ctaataagtt acgagccttg 3660 gtggagcggg gcccacatgg atcaatggta atgaaccgaa ttggcctcat tgggaatacg 3720 acgttccaca agaagaggat aacgaagaag ataagaatat agataagaat atagtagcaa 3780 caacaaccat agaagaaatc atcaatacaa atcgaatcaa tctcctggag gcaaaacgat 3840 tcagtaaatg gacaaagatg ataagaacaa cagtaagagt actaaaattt ctcaaaaaat 3900 tatctaaagg aaaattaacc tggttaagca gcgtatcaga aaggaatccg attatacccg 3960 aggattacaa tctcgctgcg gaactactaa taaaacaatc ccaatcggaa ggtataacag 4020 cagaagaagt ggaaaagtac aatctgcact atgcagacag gtactggaga ttccaaagca 4080 gactacaatt gccgaacgac agggaatact catcccaaac aatatatctc cccagacaca 4140 atcgaataac agaactaatt attcaacacc accacgaaac gacgcatcat ggaggtatcc 4200 ctcaaactat ttccagaata agaaagcttt actggattcc gaaaggaagg gctgaagtca 4260 agaaggtgtt aaaccgctgt atgagttgca aacgctggaa agccaagcca ttcaaattgc 4320 cacctatgcc aaattatccg caagagcgga tcactgaatc aagaacgttt ggacgaattg 4380 gcctcgacta tttggggccg attactgtga agatacaatt ggaaaaagca aagcgatgga 4440 tcgccttgtt cacatgcttc accaccagag cggtacattt ggaaatggtg gacgatctat 4500 cagcggaaag tttcctaact gcattaagaa gatttgttgc gcgacgcggc tatcccgaac 4560 tcattctttc ggacaatgcc acccaatttc aggcagtttt tcaagccata aaaacgcaag 4620 aatcacaact ctctaatttc cttgctgaga aaggaatcaa atgtaagact ataacaccga 4680 gagcaccatg gagcggtgga atctacgaaa gattaatcgg cctaacaaaa aggagtttaa 4740 gaagagctat cggtagacga ctgcttaagg aaggggaact aatcacactg attatcgaag 4800 tggaaagcat tcttaatact cgccccttaa cttacgtaaa tttcgacgac tcaattattt 4860 tgcgaccaat agacttcatc attcccaatg cttctttgtc tatactgaat agcaataacg 4920 acagagaaga cgaattctca ccttatagtc taaccacaca agaaaggtta cttcaacact 4980 ggtcgaatac cctagaaaca ttgaaaacat tttgggagat atggaaagac gaatacttga 5040 atagcttaaa ggatcgatac caattagaac ataaatctcc aagaaatgta gaaaagagac 5100 agccgataga aggagaagtg gtactgttag atgagcctca tgcaccacgt ggtacatgga 5160 aactagcaag gataaagaaa ttaaacgtag ccaatgatgg gtacgtcagg agcgctcaga 5220 tcgaaacatc gtcaggaaaa cttctaaatc gacctgtaag cacgttgtat ccacttgaag 5280 ttactccaga agaacagtct actccggaga accccgcgaa cccatcagag agagcagaac 5340 caattatcca aaaccgaaac accacaacta acacgaagat ttcaagtgaa agtggtgagg 5400 agcaacctat tgctcatcga acacggagcc agacaaattt acaaaaaagg gagaagtcaa 5460 atttcgtttt aaccaactta ctgttaacag tagcaatatc gataattcta caagcagcag 5520 aaacgaagga atgcaaatgg acatcaggtg ttccatttaa catgcctcag aaatggaatt 5580 gcaaaacaat tgaaactcgt aacgccacac cgattcgaat tacagtctac acgcatgttc 5640 atgtgaaaat acctaccacc atccctttac atttaattca aacttgtact acccataatc 5700 aaggtttaga aagtgttgaa gatcacaatg ttttatccta taaaccccaa atctatccat 5760 tagaatactt ttcgctcttc cctcgggag 5789 // ID CR1-73_AAe repbase; DNA; INV; 5095 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-73_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5095 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1161-1161 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 14 sequences with >98% CC identity. XX FH Key Location/Qualifiers FT CDS 161..1978 FT /product="CR1-73_AAe_1p" FT /translation="MTKKCDGEPCXNGSARGGRAACSKCHKQFHLKCVNLT FT GNQYKAVRECPGAFWFCVVCRNGDITSGSRSESYETDNALILQRITSVLKL FT VGIQIDVSRSICRAISSLYSRPKCCNSATKVVSPARPTQNDSRNFLDELER FT MQFEFTEVFNSFINNDPPSSTTGGNKRDRTSSSTSSFPRNDKRMRVDVSVG FT TSDNTDIIVPLLADMDEFSRNNPPDVESTAAVQQLATTQKSNAATTNQAIA FT TTNATSATTTNVASATANATTNATAFTSAITTTTAAVTYPPTINSQVSTIT FT AVNSTASLTTNTNQATSNAPTALPHRDHAASTIYFDRNTAENASSTAPCST FT LPQVPLPHRVMFSTSTVSTSTNATSSRIPSGQHAMPTHMATSESDNNQNAS FT LAHYLTIPPQQSNVVFNSSQNTAITGNYRLNTDIMPPPVAVLSVAPPSTPL FT KWYYLTRFQPHETADNIISFIVSKTKCDPTSIICHKLVRDNRNANFPLTFI FT SFKLSVPATIESFVTTPHFWPTGISIKPFFARNTSNDKPFLGTARQRFLVQ FT NTPSPRYRAALSTRHQQQMSSPLPSPFVNQNSGILPHMSNQQQLIAAQSQL FT NRITSTMV" FT CDS 1681..4878 FT /product="CR1-73_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="KLCHNATFLAYWYFHQAFFCQKHFERQAFLGNSSTTF FT PRSEYPVTALPRRPLHQTSTTNVFTASIPVCESEQWDIAPYVQSTAAHCSP FT ISTQQDNFDNGVNRKLLIYYQNVGGINTRVAEYHLACSDSSYDAYAFTETW FT LNDQTISSQIFGTSYNVFRLDRNLNNSSKKSGGGVLLALRANLKARQLSIP FT SSEAVEQIWVAVSFESHTXFICVLYIPPDRTQDLTLIDQHINSLYWVTTQM FT KINDSLIIMGDFNLPTIKWKLSPSNYLYPDPSHSSISSSTVKLLDSYNVAR FT ISQLNTVLNNNDRMLDLCFGSFEGDVRYVLMEAPSPLVKTSVHHPPLLIEA FT NGAISSTFKEPVEMLYYDFKHGDYRSMNTFLSHINWHDFIDLDLNSSTVTL FT SNILLYAIDQFVPKRSRKSSPQPPWSNNRLKQLKRLKRSLLRKYSKLKTPR FT AKQNYHAANSRYKTLNKNLFLSHQRAVQQKLRHNPKGFWNYVNEQRKESGL FT PTVMFKGNVECSSTEGICDLFLNQFSSVFTNESLQEHQINKAASAIPVHAP FT VGDHPIIDAXTVADVCSSLKASTSSGPDGVPAFILKKCSSSLSLPLSRLFN FT LSLQVGTFPNHWKKSYIFPVFKKGNKQEVSNYRGIAALCAISKLFEKVVYD FT FLFHNCKLFISEHQHGFMPKRSTNTNLVLYTSFIAQTLQKGHQVDSIYTDF FT SAAFDKINHRIAVAKFNRLGFSGSLLSWLQSYLSCREMAVKIGDTTSNFFP FT ATSGVPQGSHLGPLIFLVYLNDVHLQLKSLKLSFADDFKLYWAVKDVKDAQ FT FLQNQLDVFTEWCDTNRMDLNASKCSVISFSRKQSTFHFDYRIRSTTLKRE FT SVVKDLGVLLDIKLTFKEHISYITSKASRTLGFIFRVAKHFKNIQCLKSLY FT CTLVRSILECSSVVWSPYYQNSILRIESIQHKFLRFGLRHLPWSNPHNLPS FT YEDRCKLIGLELLSVRRDVSKSLFISDLLQSKIDCPSLLSQLTFNIPYRRL FT RSCTFLFLQSARTNYGHNEPFISMCRIFNRCSSEFDFHLSRYTLRKKYSQI FT LAASVSRSS" XX SQ Sequence 5095 BP; 1438 A; 1272 C; 878 G; 1502 T; 5 other; ttacgcgaac aaaccgattt attatctacg aagactcagt tctgttttgc attgctttca 60 accatcataa ttcattcgct ataccatacc gccgcctgcc gttcattgcg tgtactagca 120 cgacagctcg actgtttwca tctgcatctg gtttcacacc atgactaaaa agtgtgacgg 180 tgaaccttgc ttmaacggat ctgctcgagg cggacgggct gcttgctcca aatgccacaa 240 gcaatttcat ctcaaatgtg tcaatctgac gggaaaccaa tataaagctg ttcgtgaatg 300 ccccggtgcc ttttggttct gtgtcgtatg tcgtaatggc gatatcactt ctggaagccg 360 ctccgaatcc tacgaaactg ataatgcgct tattctccaa cgaatcacat cggttctgaa 420 actcgtcgga atccaaatcg acgtatctcg ctctatttgc cgtgctatca gttcacttta 480 ctcgcgaccg aaatgttgta attctgcgac taaggtcgtg tctcctgctc gcccaacgca 540 gaacgattcg aggaatttcc ttgacgaact cgagcggatg caatttgagt tcaccgaagt 600 cttcaattca tttattaata atgatccacc ttcttctacc actggaggca acaaacgtga 660 ccgcacctcc agctctacgt catcatttcc gaggaacgac aaacgaatga gagttgacgt 720 ttcagttggt acttcggata acaccgatat tatagtcccc cttcttgcag atatggatga 780 gttttcgcgc aacaacccac cagatgttga atctaccgcg gcagttcaac aacttgccac 840 gactcagaaa tctaacgctg ccaccaccaa tcaagccatc gccaccacca acgccacctc 900 cgctaccacc accaacgtcg cctccgctac cgccaacgcc accaccaacg ccaccgcctt 960 tacctccgcc atcactacta ctactgctgc tgttacctat cctcctacaa tcaacagtca 1020 agtttctact atcactgcgg tgaactctac agcaagcctc accacgaaca ccaaccaagc 1080 tacctccaat gctcctactg cccttcctca cagagatcat gccgcaagta caatatactt 1140 cgaccgaaat actgctgaaa acgcatcttc gaccgcccca tgctctactc tgccccaagt 1200 gccattgcct catcgagtta tgttttctac atcaacggta agcacttcaa caaatgccac 1260 ctcatctaga atcccgtcag gccaacatgc tatgccaacg cacatggcaa catctgaatc 1320 agacaataat caaaatgcat cacttgccca ctaccttaca ataccgccac aacaatcaaa 1380 tgttgttttt aacagctctc aaaatactgc aattactgga aactatagat taaacaccga 1440 tattatgcct ccccctgttg ctgtcctttc cgtcgctcca ccatcaacac cacttaaatg 1500 gtactacctt actaggttcc agccacacga aactgcagac aatattatct cgttcattgt 1560 aagcaaaacc aaatgtgatc ctacttcgat tatttgccat aagctagtgc gcgataaccg 1620 taatgccaat tttcctctga cattcatttc attcaaatta agtgttccag caaccattga 1680 aagctttgtc acaacgccac atttttggcc tactggtatt tccatcaagc ctttttttgc 1740 cagaaacact tcgaacgaca agcctttctt gggaacagct cgacaacgtt tcctcgttca 1800 gaataccccg tcaccgcgct accgcgccgc cctctccacc agacatcaac aacaaatgtc 1860 ttcaccgctt ccatccccgt ttgtgaatca gaacagtggg atattgcccc atatgtccaa 1920 tcaacagcag ctcattgcag cccaatctca actcaacagg ataacttcga caatggtgta 1980 aatcgtaaat tgttgattta ttatcagaac gttggtggga ttaacactcg tgtcgctgaa 2040 taccatctcg cttgttcgga ctccagttat gatgcatatg cgttcactga aacgtggcta 2100 aatgatcaaa cgatatctag ccagattttc ggtacttcct acaatgtttt tcgattggat 2160 cgtaatctta acaatagtag taagaaatca ggcggaggag tgctgttagc tctgcgcgcc 2220 aatctcaaag ctcgtcagtt atcgattcca agttctgaag ctgtggagca gatttgggta 2280 gctgttagtt ttgaaagtca tactmttttc atttgtgttc tgtatattcc acctgaccgt 2340 actcaggact tgactttaat tgatcagcac attaattcac tatattgggt caccactcaa 2400 atgaaaatta atgatagcct gataattatg ggtgacttta atctgcccac catcaaatgg 2460 aagctgagcc cctcgaacta tttgtatcca gaccctagtc actcctcgat tagctcatct 2520 acagtgaaac tcctagatag ttacaatgtg gctcgtatat cacagttaaa tacagtactt 2580 aacaacaatg acaggatgct tgacctatgc tttggtagtt tcgaaggtga tgtacgttat 2640 gtattaatgg aagccccttc acctcttgtg aaaacatcgg ttcaccatcc accgttatta 2700 attgaggcta atggtgctat ttccagcact ttcaaggaac ctgttgaaat gctgtactac 2760 gatttcaaac acggagatta tcgtagtatg aatactttcc tctcccatat caactggcat 2820 gacttcatcg accttgacct caattcctca accgtaactc tatccaatat cttgttatat 2880 gcaatcgatc aatttgtccc aaaaagatct cgcaaatcgt cacctcagcc accatggagc 2940 aacaatcgct taaaacaatt aaagaggcta aaacgatctc tcttacgcaa atactcaaag 3000 ttaaaaactc cccgtgctaa acaaaattat catgctgcca attctcgtta caaaaccttg 3060 aataagaatt tatttctttc acaccaacga gcagttcaac agaaattacg acataatcct 3120 aaaggattct ggaattatgt caacgagcaa cgtaaggaat ccggtcttcc tacggtcatg 3180 ttcaagggaa atgttgaatg ttcatctaca gaaggcattt gtgatttgtt tttaaatcaa 3240 ttttctagtg tgtttacgaa tgaatctttg caagaacacc agatcaataa agctgctagt 3300 gctattcctg ttcacgctcc cgttggtgat caccctatca tcgatgcaga macagtcgca 3360 gatgtttgct cttccttgaa agcttcaaca agctccggtc ctgatggtgt gcctgctttt 3420 attttgaaaa agtgctccag tagtctttct ttacctcttt ctcgtctctt caatctatcg 3480 ttacaagttg gaaccttccc taaccattgg aaaaaatcat acattttccc tgttttcaag 3540 aaaggaaata agcaggaagt aagcaattat cgtggtattg ccgctctctg tgccatctca 3600 aaattatttg agaaagtggt gtatgatttc ttattccaca actgcaaact tttcatctcc 3660 gagcatcaac atggttttat gccgaaacgc tcaacgaata ccaacttggt tctctacact 3720 tcgtttatcg ctcaaacact gcagaaaggt catcaggtag actcgattta tacggacttc 3780 tcggctgcat ttgataaaat taaccatcgt attgctgttg ctaaattcaa tcggctcggt 3840 ttttctggat cattgctcag ctggcttcaa tcttacctga gttgtcgtga aatggccgtc 3900 aaaattggag acactacatc taacttcttc ccagcaacct ccggagtgcc tcaaggtagt 3960 cacttggggc ctctaatttt tttagtatat ctcaacgatg ttcatttaca actcaaatcc 4020 ctaaagctct catttgctga tgacttcaag ttgtactggg ctgtcaaaga tgttaaagat 4080 gcacaatttc ttcaaaatca gctcgatgtg ttcactgagt ggtgcgacac taaccgaatg 4140 gatctgaatg caagtaaatg ttcggtgatt tcattttccc gcaaacagtc aacgtttcat 4200 ttcgactata gaatcaggtc tactactctt aagagagagt cggttgttaa ggacttagga 4260 gtgttactag acattaaact cactttcaaa gagcatatta gttacataac atckaaagcc 4320 tctagaacac tcggctttat tttccgtgtc gcaaaacact ttaaaaatat tcagtgcctt 4380 aagtctcttt actgtacact agtgagatca atattagaat gctcttcagt tgtctggtcc 4440 ccttattacc aaaatagtat tcttcgaatc gagtcgatac agcataaatt tctgcgcttc 4500 ggactgcgtc atctcccttg gagtaatccc cataatttac ctagttatga agaccgttgt 4560 aaacttatag ggttagagct gttaagtgtt cgtcgcgatg tatccaaatc tctgtttatc 4620 tccgatcttc ttcaatcaaa aattgattgc ccttcgttgt tatcacaact taccttcaat 4680 attccatacc gtcgtcttcg atcctgcaca tttctgtttc ttcagagtgc tcgcactaat 4740 tatggacaca acgaaccgtt tattagcatg tgtcgaatct tcaaccgttg ttcgtctgaa 4800 ttcgatttcc atctctctcg ttacaccctc cgtaaaaaat atagccaaat acttgcagcc 4860 tcagttagtc gaagtagtta atctttgtca gatagcttag gatataacta gttgtatttt 4920 agatgtaatt atagatgtaa gccaaaattg tatcatttgg aaaccatgta aattctgttg 4980 atacgaaaag aagagaaggt tttgcgccca tttgagaaag acgctaatgt tgatctactc 5040 aaatgggctt ttccctactc caaataaaga aataaagaaa taaagaaata aagaa 5095 // ID MuDR-2_TV repbase; DNA; INV; 2997 BP. XX AC . XX DT 11-DEC-2008 (Rel. 13.12, Created) DT 11-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE MuDR DNA transposon from Trichomonas vaginalis. XX KW MuDR; DNA transposon; Transposable Element; MuDR-2_TV. XX OS Trichomonas vaginalis OC Eukaryota; Parabasalia; Trichomonadida; Trichomonadidae; OC Trichomonas. XX RN [1] RP 1-2997 RA Kapitonov V.V. and Jurka J.; RT "MuDR DNA transposons from protozoans."; RL Repbase Reports 8(12), 1812-1812 (2008). XX DR [1] (Consensus) XX CC The MuDR-2_TV consensus sequence was derived from multiple CC alignment of 10 copies that are <1% divergent from it. MuDR-2_TV CC copies are usually flanked by 10-bp TSDs (sometimes TSDs are 9 or CC 11 bp long; several copies are not flanked by TSDs). MuDR-2_TV CC contains imperfect 30-bp TIRs and codes for a 559-aa MuDR CC transposase. CC This sequence was derived from sequence data generated by TIGR CC Database: The Trichomonas vaginalis Genome Sequencing Project. XX FH Key Location/Qualifiers FT CDS 1205..2881 FT /product="MuDR-2_TVp" FT /note="MuDR transposase." FT /translation="MQHPPPDPPCEVDQAIGYLEYSRFVIAAELHGNIRFV FT KDPQKLSIGTGVLIYHRCEYEGCPAGFKFIKNFDNYIFKSANLTHIHSGPP FT PQHKNTPTSGTYRAWIKKFLMNHGSPLNATQEVNKTLEIPKDHTCIHMTMK FT QTAINQLKYYLQQKDLMRSLPDISLNQFQSLQNYVNQWEDQHDDLIYFDIQ FT GEDTDFPDKLVFIYSDNDMISQIHEKPPIYHLDSTFKLIIHGFPFYVLATK FT FANTHSIPLCYFIIYPDNSENISFCLSKYFETTHTEPEFIMSDCALNIFNG FT IRNSFPECNIFWCALHVIRALKKNLSKINDEEIRSEVEKFMNILCYYRDCT FT EEDAAKMYKEHIIDKIQDQLEFNQYFTRQWDIHKQQWIAAARPNELTVVNN FT VSESLFKKIKYHDFGCVKNQRIDVFVKNLLEEVAPNYFYRIKNDLLQTGFI FT PSIRIREPRLTDYKTKLKREVQSRLYQVLNFVQNQEANLNPLRFLLENAEE FT KLSQINEKIQNSITYLRSFEIPNELLDTYMLEINEFCYNSPGYILEHFEEF FT LEDFKEKNNLID" XX SQ Sequence 2997 BP; 1043 A; 504 C; 426 G; 1024 T; 0 other; gaaacaataa gaattaccga aaaagtgaca aaacatcaaa aacagggagg gggtctcaat 60 ttttattact tatattattt tacaatattt atataaatac tctttataat atattttact 120 ataaaattat acctcaaatt caaatattta aaatattatt tacataatta tctcaaaatt 180 atatttttat atattatttt tcatatgtat aaatcatata atctatatct atgcaaatct 240 catctaaaat tttttgaaac aaaccgctac cattttataa tggtccatct taaattcatc 300 ataaattctg gtacatgtta gtaccagagt cttcaaatct caaacactct ggtaccaaca 360 tgtaccagac gcgagtttcc aatgccctcg tacgattttc tattaagatc ttcttgaatg 420 agaaggtctt catagaaact cgtacgaggg cttctctttt aaatactctg gtacctgctg 480 gtatcggagt cgactttaca gccctttccc caattttgaa gctggggaaa gttttcaagc 540 aatgaaaggg aagctctcgc tatgaggtga aaatctagaa tttttacctc atagcgagta 600 atcctttgtt aattgcttga aaactttctt caaattatca aaatttcgag gaaagtttcc 660 ttgcttcaaa gtagagatgg agccctttga gccactgcat ataggtcagt gactcaaaat 720 actccatctc tactagaaat cgccaaaatt tgggtcgcaa tctttgaaat gcgacgcaag 780 ttttggcaat ttcgcgaggg acttgctcca tttggtgcat agattcacgc tctatgcacc 840 agatggaaca attctttcgc tgacagcggg aaactttgtc tgaatagttt aacattcaga 900 caaagttttc cgctgtcaga gagggaatgg aagggtacaa tcatagccaa caattcttcc 960 ttttaaccaa agataattga ttcgtatgta aaatattaag ctaaaatttc tctttctttt 1020 agtacacatt cttccatttc gtgtacatgt atcatatatt aagtactagt tatctattct 1080 atatccttta tcccaattta gatactcaaa gataattctt tttcagaaga aaaaaaatat 1140 tttctattaa accgttattt tttcaatgtg agttttcaat tttatttttc ctttttgctg 1200 ctcaatgcaa catcctccac ccgatcctcc atgcgaggtt gatcaagcca taggatattt 1260 agaatattct cgttttgtta tagctgcaga gctgcatggt aacataagat ttgtcaaaga 1320 cccacaaaaa ctatcaattg gaacaggagt attaatctac caccgatgcg aatacgaagg 1380 ctgcccagca ggctttaaat ttattaaaaa tttcgacaat tacattttta aatctgcaaa 1440 cttaactcat attcactcag gaccaccacc acaacataaa aatacaccta caagtggtac 1500 ttaccgtgct tggattaaaa aatttctcat gaatcacgga agtccactta atgcaacaca 1560 agaagtcaac aaaacgcttg aaatacctaa agatcatact tgtattcaca tgactatgaa 1620 acaaactgca atcaatcaat tgaaatatta tttacaacaa aaagatctta tgagatcttt 1680 accagacata tctttaaatc aatttcaatc tttacaaaac tatgttaatc aatgggaaga 1740 tcaacatgat gaccttattt attttgacat tcaaggtgaa gatactgatt ttccagacaa 1800 attggttttt atttactctg ataatgatat gatctctcaa attcatgaaa aaccaccaat 1860 ttatcactta gactctacgt ttaaattaat tattcatgga tttccttttt atgtattagc 1920 taccaaattt gctaatactc attctatccc tttgtgttat tttataatat atccagataa 1980 tagtgaaaat atttcttttt gcttaagtaa atattttgaa acaactcaca cagaacctga 2040 atttatcatg tctgactgtg cactaaacat tttcaacggg attaggaatt catttccgga 2100 gtgcaatatc ttttggtgcg cgctacatgt aatacgcgca cttaaaaaga atctctcaaa 2160 gataaatgat gaagagatta gatcagaagt cgaaaaattt atgaacattt tatgttatta 2220 tcgtgattgc acagaagaag atgctgcaaa aatgtataaa gagcacatta ttgataaaat 2280 tcaagatcaa ttggaattta atcaatactt tacacgtcag tgggacattc ataaacaaca 2340 gtggatagcc gctgccagac caaatgaatt aaccgttgtg aataacgttt ctgaatcttt 2400 atttaaaaag ataaaatatc acgattttgg ctgcgttaaa aatcaaagaa ttgatgtttt 2460 tgtcaaaaat cttttagaag aagttgcacc caattacttt tatcgtatta aaaacgattt 2520 acttcaaaca ggttttattc cttcaattag aattcgcgag cctcgattga cagattataa 2580 aacaaaattg aaaagagaag ttcaatcacg tttataccaa gtattaaatt ttgttcagaa 2640 tcaagaagcc aatttgaatc cattaagatt tttattggaa aatgctgaag aaaaactttc 2700 tcaaataaat gagaagatac aaaattcaat tacatattta agatcgtttg aaattccaaa 2760 tgaattgtta gatacgtata tgttagaaat taatgaattt tgttataatt ctccaggtta 2820 tattttagaa catttcgaag aatttttgga ggatttcaaa gaaaagaata atttaataga 2880 ctaaaaaatt tattattttt atttattgtg aattttaaat tgtcactttt taagtgttac 2940 tccctattga aagtgtcact ttttgggtgt cactttttcg gaaattctta tatactc 2997 // ID Kolobok-3_HM repbase; DNA; INV; 2649 BP. XX AC . XX DT 14-DEC-2008 (Rel. 13.12, Created) DT 18-DEC-2008 (Rel. 13.12, Last updated, Version 2) XX DE Kolobok-type family. XX KW Kolobok; DNA transposon; Transposable Element; Kolobok-3_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2649 RA Jurka J.; RT "A distinct subgroup of Kolobok-type DNA transposons."; RL Repbase Reports 8(12), 1819-1819 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(347..1408,1318..2157) FT /product="Kolobok-3_HM_1p" FT /translation="MVKADRIKVRTKRAGKRKKIFNRKVISSSVSVNNVNF FT PSLSVNIESPVNSQTDIFDTPSSSLRKXEAVISSTPQKYTNTKNKVLTGYR FT IIDCSILSDVISVLSCPTCFQTTLAIIENKSKKQGLACELSIICLKCKYQN FT DFYASKLVNKKGNFDINNRTVYTMRTLGLGHSGIERFTTLMNMPKPMTPKN FT YDKLVLKITNITEEVAQETMTDAVSDLRLQCQNDNEILDVGVSCDGTWQRR FT GFSSLNQKLSKKXPTAYVQWXNSHICKMNYVGSAGGMECEGASRIFQRSIK FT KHKLQYINFLGDGDSKSYNSVKDVYPSTQVNKLECVGHYQKRVGTRLRKLK FT KKLRDWVVVVDIRMCWTLPEACWNQTPKIEKKVKGLGGRGRLTDATIDRLQ FT NFFGVAIRQNTGNLIAMKSAALATLFHVASSKANNWHYPHCPTGVSSWCKF FT NRDKANSTNTYKPGPGLPLEVVYKIRPVFEELTKEDELKKCLHGKTQNANE FT SFNGKIWDRIPKTKYVSLTLLKFGVYDAVANFNIGMKSSILIYEKLNMVPG FT FYTLNGSNDINKTRIKWSIYKLNSHNKLHRQKLRAKKLKKNDKTLKKKVKH FT MKQEVFNIKYSLXYNVTYICRSFIVYFHFLCVFSV*" XX SQ Sequence 2649 BP; 926 A; 395 C; 451 G; 871 T; 6 other; ttaaggggta tactccccca gaaattttta aaaaatcaaa aaaaaaaaaa tgatttaaac 60 aattaaatat atgggggttt cagaaatgtt aatsgtttta cctaaaaata ttctaaatac 120 caaaattttt cgtttttgag agttgtacaa aaaagagaaa attttttacc cctaaaaaaa 180 tcacttgtaa accaaaacaa cgtattgatt tcacaagtta aaagcatttt tggcttcttg 240 ttagacttgt tagcacttta tttttggtat atgaagacta agttcgtttt gttgttttga 300 agcttatttt ttgtcccaga aattttaaat tttttagttt tcagacatgg ttaaggctga 360 tcgaattaaa gttagaacta agcgagcagg caaaagaaag aagattttta atagaaaagt 420 gataagtagc agtgtgagtg taaataatgt aaattttcct tctttaagtg taaatatcga 480 atcacctgta aatagtcaaa ctgatatctt cgatacgcca agctcttctc ttcgcaaakt 540 tgaagcagtc attagctcaa ctccgcaaaa atataccaat actaaaaaca aagttctgac 600 aggatataga attattgatt gttcaatttt gtctgatgtt attagcgttt tgagctgtcc 660 gacttgtttc caaacgactt tagcaatcat tgaaaacaaa agcaaaaagc aaggacttgc 720 atgtgaactt tctattattt gtttaaagtg taaatatcaa aatgactttt acgcttctaa 780 gttagttaat aaaaaaggaa actttgatat aaacaatcga acagtatata caatgaggac 840 tcttggatta ggacattccg gaattgaaag gtttacaacc ctcatgaaca tgccaaaacc 900 aatgacgcca aaaaactatg acaaacttgt tttaaaaatt actaatatta cagaggaagt 960 tgcacaggaa acaatgactg atgctgtatc tgatttaaga ctacagtgtc aaaacgataa 1020 tgagattctt gatgttggtg tatcatgtga tggcacatgg cagcgtaggg gattttcctc 1080 attaaatcaa aaactttcta aaaaaratcc tactgcatat gtccagtgga raaattccca 1140 tatttgcaaa atgaattatg taggttctgc aggtggaatg gaatgcgaag gagccagtcg 1200 tattttccag cgatctatta aaaaacacaa attgcaatac attaactttt taggtgatgg 1260 agatagcaaa agttacaata gtgtcaagga tgtttatcct agtactcaag tgaataaatt 1320 agaatgtgtt ggacactacc agaagcgtgt tggaaccaga ctccgaaaat tgaaaaaaaa 1380 gttaagggac tgggtggtcg tggtagattg acagatgcaa caattgatcg actgcaaaat 1440 ttctttggtg tggcaattag acagaatact ggtaatttaa ttgccatgaa atcagcagcg 1500 cttgctactc tttttcacgt tgcatcatca aaagcaaata attggcacta tccacattgt 1560 ccaacaggtg taagtagttg gtgtaaattt aacagagaca aagcaaatag cactaacacc 1620 tacaagccag gacctggtct acctttagag gttgtataca agattagacc tgtgtttgaa 1680 gagctcacaa aagaagatga gcttaaaaaa tgtctgcacg gtaaaacaca aaatgcaaat 1740 gagtctttca atgggaaaat ttgggatcgc ataccgaaaa caaaatatgt ctcgctgact 1800 ttgctgaagt ttggtgttta tgatgctgtt gccaatttta acattgggat gaagtcttca 1860 attctaattt atgaaaaact gaacatggtt ccaggttttt atacactaaa cggttctaac 1920 gacataaata aaacacgcat taaatggtcg atctataaac taaactccca taacaagttg 1980 catcgtcaaa agttaagggc taaaaaactg aagaaaaatg acaaaactct aaaaaagaaa 2040 gtaaaacata tgaagcagga ggtttttaat attaaatata gtctartata taatgttacg 2100 tatatttgta gatcttttat cgtttatttt cactttttat gcgttttctc agtttaaggt 2160 ttttcaatat gccggccaaa attmcaagag aaccgcttga gctttttatc tgaaattttc 2220 aacaaatttt cattttaata aaagctgtgt tttaaacgga acagaatttt gatatactta 2280 tgaataaaac agtatggcta atttaatttt tcaaaaaatg aacttttttc tatcgtcaac 2340 tttagagcac aagtcttcag gcccagaact ttttttaaaa aatctgttcc gtttaaaaca 2400 ctcatacacc tattctaaat gtgttaccaa agttttgttt agcctttttt attagttttt 2460 gctaaaactt ggccagcgca aatcctattt tctgcttatt ttgggggttt tttggggtta 2520 ctgcagaaac cgttgaccaa tttttttttt ttttttttaa atttctatat ttttaatatg 2580 gataatcgtt ttatttcaaa agtttttttt tattgtctat atttaaaaat gggggggtat 2640 accccttaa 2649 // ID Gypsy-20_IS-LTR repbase; DNA; INV; 187 BP. XX AC ABJB010875357; XX DT 15-FEB-2011 (Rel. 16.02, Created) DT 15-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the black-legged tick genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-20_IS_; KW Gypsy-20_IS-I; Gypsy-20_IS-LTR. XX OS Ixodes scapularis OC Eukaryota; Metazoa; Arthropoda; Chelicerata; Arachnida; Acari; OC Parasitiformes; Ixodida; Ixodoidea; Ixodidae; Ixodinae; Ixodes. XX RN [1] RP 1-187 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the black-legged tick genome."; RL Direct Submission to RU (15-FEB-2011). XX DR Genome; ABJB010875357; Positions 7785 7971. XX SQ Sequence 187 BP; 42 A; 33 C; 60 G; 52 T; 0 other; tgtagtgtta ctgaatcatt attgggttta agtggttcag cgaacgtccg ggagtgggag 60 cggtggcgtg ctggtttcga gcgctgctgc tgcgctgcga ctgtatcgtt agggctagga 120 aaataaagag tcagcgagtg ctgtctgaga cacaacggag tacggtcata cttttattta 180 tacgaca 187 // ID hATm-2_HR repbase; DNA; INV; 3194 BP. XX AC . XX DT 30-OCT-2007 (Rel. 12.1, Created) DT 30-OCT-2007 (Rel. 12.1, Last updated, Version 1) XX DE hATm-2_HR, a family of autonomous hATm DNA transposons - a DE consensus sequence. XX KW hAT; DNA transposon; Transposable Element; KW Autonomous DNA transposon; hAT superfamily; hATm group; KW hATm-2_HR. XX OS Helobdella robusta OC Eukaryota; Metazoa; Annelida; Clitellata; Hirudinida; Hirudinea; OC Rhynchobdellida; Glossiphoniidae; Helobdella. XX RN [1] RP 1-3194 RA Kapitonov V.V. and Jurka J.; RT "hATm, a distinct group of hAT DNA transposons in animals."; RL Repbase Reports 7(10), 1050-1050 (2007). XX DR [1] (Consensus) XX CC hATm is a separate group of hAT DNA transposons. Significant CC identity between hATm and hAT transposases appears after PSI CC BLAST iterations. hATm transposons exist in genomes of different CC animals, including segmented worms (Helobdella robusta, leech), CC flatworms (Schmidtea mediterranea, freshwater planarian), insects CC (Aedes aegyptus, mosquito), and tunicate (Ciona savignyi sea CC squirt). All identified hATm transposons are characterized by CC 8-bp target site duplications and conserved termini CC (5'-TAGGgtGgyccnnA). The planarian hAT-7_SM, hAT-8_SM and CC hAT-10_SM transposons also belong to the hATm group. Their CC putative classification as hAT transposons was not supported by CC significant similarity (BLAST and PSI-BLAST) to known hAT CC transposase proteins. CC hATm-2_HR is a young family of hATm autonomous DNA transposons CC identified in the leech genome. The consensus sequence was built CC based on multiple alignment of 12 copies that are ~2% divergent CC from the consensus. TIRs are 12-bp long. This transposon is CC target site specific: hATm-2_HR copies are usually inserted into CC (TA)n. XX FH Key Location/Qualifiers FT CDS 419..2746 FT /product="hATm-2_HRp" FT /note="transposase." FT /translation="MASNSNCGALTTRSMTNIWLIGQLQSELPVNVLPTTG FT DVLRMFFYYHQVEKKTVPESAKLSSDKVMDMWNKARIPTTYHTHVVEKVKS FT VVDDYKLIKKNKNRQSEAQRAREKEFEEKEHMLFDIAHQHAMQLIRIQEDI FT EFLEDQRGSRRMQMSGIDKDLTKKEERTEQRKCKEEERKEREQERMTESTS FT LHLSSSDDSDSEQQDTEDNYEIEIPVYYKKQISEVEDSLSSSASKKTRTVQ FT DMLSSPDVASALDRINLSDRKFTVLAAAIAQASGQDIRDTSLSRSTVHRKR FT QQHRSTIDSSIRQQFQDRDRNPLLVHWDGKIMNDDPHSRTDRLAVVVTGCN FT VEKILGIVKIASSTGQAQANATFQLLKLWDVSEDIIGMCFDTTAANTGTSS FT GACVLLEKLLHRNLLHFACRHHVHELIIGEVFTVLFGPSRGPNIGMFERFR FT AYWPNINQSNHKPLDDDRMNHSLLQMMRSDIVPSLTCFLSADSSYIPREDY FT KELVELCLLVLGYPMQTDGKYHFRVPGAYHMARWMAKVIYCLKMYLFRNEF FT KLTATETKSLTEFCLFATHIYVPAWMLCPIPSDAPVNDLQLLSRIEQYSEI FT NKNVASAATKKLKNHLWYLGTEMVWLSLFSNKVANAEKQLIVEAMTAVDSD FT WSVRGVKYPATELEQVKSKPLHDLVSGTSRAALLSLGMDVAILRETEPDSW FT NDLPLFQKVANVVKSLKVVNDTAERTVALMTNFNQSITKNETELQKLIQVV FT EDNRTRIPDSSKRTLASAQAAAAMF" XX SQ Sequence 3194 BP; 1087 A; 579 C; 652 G; 876 T; 0 other; tagggtggtt cacatttgaa tggggaaaaa aattttttgt ctgatttgag tgcacaccct 60 ctgaaaagtt tccattgggt cctaaaaact tacaaaccaa aaataagaca aatcaaatta 120 aatttagagg ttgctcatgc atctcaaaat tatataaaaa tgtgataatt cattatttcg 180 aaaccaaata caaatataac tggaaaagta tcaaaagttg gttacttact taattcaaaa 240 catagatatt agatatttta aaatgcatta ttattattaa atattctatg gtcaccatat 300 ttgttgtcag caaaaagtaa aataggataa atatctgcat aatttaactg tgtcgattta 360 ctagtaattt gaggtgattt tgaaataccg gtactgtaca tatttaacta cgtataaaat 420 ggcttctaat tcaaactgtg gtgctcttac aacacgcagc atgactaaca tatggctgat 480 tggacagtta caaagtgaac ttccagtaaa tgtattgcct acaacaggtg atgtcctgag 540 aatgtttttt tattatcatc aagtagaaaa gaaaactgtg cctgaaagtg caaaactgtc 600 gtcagataag gtcatggaca tgtggaacaa ggctagaata ccaacaacct accatactca 660 tgttgtagaa aaagtaaagt cggtagttga tgactacaag ctaataaaaa agaacaaaaa 720 cagacaaagt gaagcacaac gagcacgaga gaaggaattt gaagaaaagg agcacatgct 780 gtttgatatt gctcaccaac acgccatgca gctcatccga attcaggaag atatagaatt 840 tcttgaagac cagaggggca gtcgacgaat gcaaatgagt ggcattgata aagacctgac 900 caagaaagaa gagcgcacag aacaacgtaa atgcaaagaa gaagagagaa aagagcgaga 960 gcaggagcgg atgacagaaa gtacctcact acatttatct agttctgatg acagtgatag 1020 tgaacagcag gacacagagg acaactatga aattgaaatt ccagtgtact acaaaaagca 1080 gatcagtgaa gtagaagaca gtctgtcaag cagtgccagc aaaaaaacac gaactgtgca 1140 ggatatgttg tcgtcgccag atgttgcttc agcactggat agaataaacc tgtcagatag 1200 aaagttcaca gtattagcag cagccattgc acaagctagt gggcaagaca tcagggatac 1260 atctctgtca cgttccacag ttcatcgtaa acgacagcaa catcgatcaa ccattgacag 1320 cagtattcgt caacaatttc aggatcgtga cagaaacccc cttctggttc actgggatgg 1380 gaagatcatg aatgatgatc cacactccag aactgataga ctggctgttg ttgtgactgg 1440 ctgtaatgtg gagaaaatct taggaattgt taaaattgca tcaagcacgg gacaagcaca 1500 ggccaatgca acttttcagt tgctcaaatt atgggacgtt agtgaagaca ttataggcat 1560 gtgctttgat actactgctg ccaatactgg gacgtcaagt ggagcatgtg ttttgctgga 1620 gaaactgctc catagaaatc tcttgcactt tgcttgtcgt catcatgtac atgaattgat 1680 aatcggtgaa gttttcacag ttctgtttgg tccaagccgt ggccccaaca ttggaatgtt 1740 tgaaagattt cgagcctact ggccaaatat aaatcagtca aaccacaagc cacttgatga 1800 tgacaggatg aaccactcat tgctacagat gatgcggtct gacatagtac catctttgac 1860 atgttttctt tcagctgaca gctcatacat cccacgagag gattacaagg agttggttga 1920 actctgtctc ttagtgctgg gttatccaat gcagactgat ggcaagtacc atttccgtgt 1980 tcctggagca tatcacatgg ccagatggat ggccaaagtg atctactgtc tcaaaatgta 2040 tttgttccgc aatgaattca agctaacagc aactgaaaca aaaagtttaa ctgaattctg 2100 cctatttgca acccatatat atgtgccagc atggatgttg tgcccaatac ccagtgatgc 2160 accagtcaat gacctgcaac ttctgagcag aattgaacaa tactctgaaa tcaacaagaa 2220 tgtagcaagt gctgccacaa aaaagttgaa gaatcatctg tggtatctcg gtacagaaat 2280 ggtttggctg tcattgttct caaataaggt tgcaaatgct gagaagcagt tgattgtgga 2340 agcaatgact gcagtagatt ctgactggag tgtgcgaggt gtcaaatatc ctgcaactga 2400 actggaacag gtgaaaagca aaccactgca tgatctggta tcaggtacat ctcgtgctgc 2460 attgttatca cttggaatgg atgttgctat cttgcgtgaa actgaaccag actcatggaa 2520 tgatcttcct ttatttcaaa aagtagcaaa tgttgtgaag tcacttaagg tcgttaatga 2580 cactgcagaa cgcactgtgg cactcatgac aaattttaac cagtctataa ccaaaaatga 2640 aacagaactg caaaaactga tacaggtggt ggaggacaat cgaacacgta ttccggactc 2700 ctctaagcgc acattagcca gtgctcaagc agctgctgct atgttttgag tctgacatac 2760 tatgagtgat taacagaaaa acataaactg agtacattaa ttgcctcaag ttaatttcag 2820 tttctcaatc aaagattaac agtactgaac tttcgtttgg cataatcaag ttgcctttac 2880 cttgtgagaa ctgtttgttt agtgttgaat attttgttat tttgagtata aatttgtaga 2940 actgttaaat cttacatgat tttcacaaga atgttgttca ttttactgca aaataaacat 3000 taaacacaaa attgtagcac tgaccaatat tcgcttatat gctaaatatg gtgaaaaact 3060 gaatgtcatt gagcaacctc taaatattgt cagaattgaa tgaaattttg cacacatgct 3120 aactttcagc taaggaaaaa atataggggg tgtgcactta aatattaaaa aaaaattttt 3180 tttgaaccac ccta 3194 // ID CR1-12_CQ repbase; DNA; INV; 3978 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-12_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-3978 RA Kojima K.K. and Jurka J.; RT "CR1 non-LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 14-14 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 10 sequences with >96% CC identity. XX FH Key Location/Qualifiers FT CDS 16..684 FT /product="CR1-12_CQ_1p" FT /translation="MIKHTKEIPSLSAAASSRSAPVVSHPYPAHGDRAGGF FT RPGFSGKYSCCPSNSPLDKSXVSSESSSRQPLQNSILXIPSARSKSVRNFS FT DYGGALAPPTTLQLSGPAIVSNLIVSAEVSRRISVAPNSIETASPGRFLLS FT TVGALNSPAAADTIAEPAISKTQLAGHLSRPGDVCGYGARVPKSSARASTH FT VAEPFLSLTITRIPAPTEDIHPLALPRYFTRDA" FT CDS 531..3803 FT /product="CR1-12_CQ_2p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="CVRVRGXGSQIFSTGKYSRCGTFSLVDDHPHSSSDRR FT HSPARAAALLHSGRLIQSPVEAPERPVAVLPHVNAQQSRPEPAYGHREGVF FT QRQLSGEYPVLEHMSRADDLLVSSTSLRTTSAAEAPNRLDTQRALSFYYQN FT VRGLRTKIDDLFLAVIDCDYDVIVLTETWLDDEIFSPQLFGTGFVVYRNDR FT DTVRTGKKTGGGICIAVSKKFDSTEFPVPTDASSLEQLWVVINGVADQKLY FT VGSVYISPDLARNSAVIESHFNSASAVAEAMRPNDLHLLLGDYNQPKITWA FT KFNSRHAHPSSSSASYPLSSSTLLDGMALLGMKQLNDVSNNLNRTLDLVLI FT DDDSVNYCSVAEAPEALLRIDPPHPPLEVTVSLRVPVPFIDEPDTRSYNFY FT KADFPALNRALLGVNWNALNETQNVDEAVAFFNSTLKTAFRDHVPTRAPPR FT KPPWSNDRMKKLKRRRAAALRKYSARRNPVTHRAFRLASKRYTHYKRHRYG FT IHVRRTQENLKRNPKLFWSFVNEKRKETGLPAKMFLGEANSTSTEETCNLF FT AQHFLSVFRPSPATPDQVRSALRAVPADALHLSAVPITEEDVRMAIAKLKS FT SSSAGPDGIPSIVLKSCAVAVTKPLQRIFNLSIQSETFPNCWKSSYMFPIF FT KKGDKRDVSNYRGITSLCAGSKLMEIIVNTLLINATKGYISTAQHGFFSGR FT STSTNLVDFISFCKRNMEGGGQVDVVYTDLKAAFDRIDHRILLAKLDHLGV FT SCKVVNWVASYLTGRTLSVKIGASESVPFCMTSGVPQGSNIGPSFFSVYYN FT DVTFVLPPGSRILYADDLKIYQPIHNVDDCRVLQQLIESFTQWCDSNLLSV FT SFSKCSIISFTRKKHPICWPYTIGDQPLERATVVKDLGILLDEMLSLSAHY FT NAVISKANRNLGFVFRIANEFRDPACLRALYYALVRSHLETAVVAWSPHTD FT EWVTRIESVQRKFTRFALRFHSWPDQVVAPSYEQRCEALGMETLSKRRQFL FT RAAFVGKLLLGAIDAPNILARINFNVIPRPLRTWNALRLDFHRTQYGQQEP FT IRAMCDVFNDFDDLFDYSMSVDSFISNLRRTIFV" XX SQ Sequence 3978 BP; 963 A; 1125 C; 907 G; 978 T; 5 other; gtsggaatag gatgcatgat caagcacact aaggaaatcc cctctctctc cgctgcggct 60 tcaagtcgct ccgcgcctgt tgttagtcat ccttacccgg cgcacggaga tcgagctggg 120 ggtttccgac ctggcttctc aggcaagtat tcatgttgtc ccagtaattc cccacttgat 180 aaaagttkgg tttccagcga atcatcctca cgccagccac tgcagaactc aatcctakgg 240 attccwtcag cccggtcgaa gtctgtacgc aacttttcgg attacggggg agcccttgcc 300 ccccctacca cactccagct atctggcccg gctattgtca gcaaccttat cgtcagcgcg 360 gaagtctcca gacgcatctc tgtcgcaccg aattccatcg agactgcttc tccaggacgc 420 ttcttgctaa gtaccgtggg agccctcaac tctcccgccg cagccgatac catcgcagag 480 ccagccatat cgaagaccca acttgctggc catttaagtc gtcctggtga tgtgtgcggg 540 tacggggcgm gggttcccaa atcttcagca cgggcaagta ctcacgttgc ggaacctttt 600 ctctcgttga cgatcacccg cattccagct ccgaccgaag acattcaccc gctcgcgctg 660 ccgcgctact tcactcggga cgcctgatac aaagtcctgt ggaagccccc gaacgacccg 720 tcgcagtcct gccgcatgtc aacgctcagc aaagccgtcc cgaacctgcg tacggccatc 780 gtgagggggt cttccaacgg caactctcag gcgagtaccc cgttcttgaa catatgtccc 840 gcgctgatga tctgttggtt tccagtacct ccctgaggac tacatcagcg gccgaagctc 900 ctaatcggtt ggacacgcag cgcgctctga gcttctacta tcaaaacgta aggggacttc 960 gtacaaaaat cgacgatctg ttcctggctg tgatcgattg cgattatgat gtcatcgttc 1020 ttacggagac ctggctagac gatgaaatct tctcgcccca gttgttcggg actggctttg 1080 tagtctaccg caacgatcga gacactgttc gcactggaaa gaaaactggc ggtggaatct 1140 gcatcgctgt ctcgaaaaag tttgattcaa cggaatttcc tgttccgacg gatgccagca 1200 gcctcgaaca gctttgggtt gtcatcaatg gcgtcgcaga tcagaagctg tacgtcggct 1260 ccgtctacat cagcccggat cttgccagaa attcagcagt tattgagtcg cactttaact 1320 cggcctccgc tgtggccgaa gcgatgcgtc cgaacgattt gcacctgctt ctcggcgatt 1380 acaaccaacc taaaataact tgggcaaaat tcaactcgag acacgcgcac cccagttctt 1440 cgtctgcatc ttatccgctt tccagttcta cactactcga cggcatggca ctgcttggaa 1500 tgaaacagtt gaatgatgtt agcaataact taaatcgcac acttgatttg gtgctcatag 1560 atgacgattc tgtaaactac tgctctgtcg ctgaggcccc ggaagcgctg cttagaatcg 1620 acccgccaca tcccccactg gaggtaactg tttcgctgag ggtgccagtg ccgttcatcg 1680 acgaacctga tacccgatca tacaacttct ataaggcgga ttttcccgcc ctcaatcgcg 1740 cacttctcgg agtaaactgg aatgccctca atgaaacgca gaatgtggat gaagctgtgg 1800 cattcttcaa ctctacactg aaaaccgcct tcagagatca cgttccaact cgtgcacccc 1860 ctcggaaacc tccctggtcc aatgatcgca tgaaaaaact gaaacgacgg cgtgccgctg 1920 ctttgaggaa gtattcagct agacggaatc ctgtaaccca tcgcgccttc cgtctcgcca 1980 gcaaacgtta cacgcactac aaaagacacc gttatggaat ccatgttcgc cgtactcaag 2040 agaatctgaa acgtaatcca aaacttttct ggtcttttgt gaacgagaaa cgcaaagaaa 2100 ccgggctccc agctaaaatg ttcctcggcg aagctaactc tacttccacg gaggagacgt 2160 gcaacttgtt cgctcagcac tttttgagcg tatttcgtcc atcgcctgca actccggatc 2220 aagtgcgcag cgcactccgc gctgtccccg cagacgcact tcacctgagc gctgtaccaa 2280 tcacggagga ggacgtacga atggcgatcg caaagctgaa gtcctcatcg tctgccggac 2340 ctgacggaat cccatcaatt gtgctgaaat cctgtgctgt cgccgtcaca aaaccgttgc 2400 agcgaatttt caacctgtcc atacaatccg aaaccttccc gaattgctgg aaatcatcgt 2460 acatgttccc aatcttcaag aaaggggata agcgggacgt ttccaattac cggggaataa 2520 catctctttg tgccggttcc aaactgatgg aaatcatcgt aaacacgctc ctgataaacg 2580 caacgaaggg ttacatctca acagcacagc acgggttctt ttcgggacgc tcgacaagta 2640 caaacttagt cgacttcatc tcattctgca aacgaaacat ggaaggcgga ggacaagtgg 2700 atgttgttta caccgacctt aaggctgcgt tcgacaggat cgaccaccgc atcttactcg 2760 ccaagctcga ccacctcggc gtttcatgca aagtcgtcaa ctgggttgca tcgtacctga 2820 ctggcagaac actctcggtc aaaatcggtg cttccgaatc agtgccgttc tgtatgacgt 2880 ctggcgttcc tcaagggagt aatataggac caagcttctt ctctgtctac tacaacgacg 2940 ttacctttgt gctcccaccc ggaagcagaa ttctctacgc cgacgaccta aagatttacc 3000 agccaattca caacgtagac gactgccggg tgctccaaca gctgattgag tcttttactc 3060 agtggtgtga ttcaaatttg ttgtcagtaa gtttctctaa atgttctatt atctctttca 3120 cacgcaaaaa acaccccatc tgctggcctt acaccatcgg agaccaaccg ctggagaggg 3180 cgaccgtcgt taaggatctt ggcatactcc tcgatgaaat gctcagcctc tccgcccact 3240 acaacgccgt tatctccaaa gcaaaccgca atctcgggtt cgtttttcgg atcgcaaacg 3300 agtttcgtga tcctgcctgc ctacgagctc tttactacgc actcgttcgt tctcacctcg 3360 agactgctgt ggtggcttgg agcccgcaca ctgatgaatg ggtgactaga atcgagtcgg 3420 tacagaggaa gttcacaaga ttcgcattgc gcttccactc atggcctgat caagttgtag 3480 cgccatccta cgaacaacgc tgcgaggctc tagggatgga aacactctct aagagaaggc 3540 agttcttacg ggcggctttc gtggggaaac tgttattagg tgccatcgac gctcctaata 3600 tcctcgccag gatcaacttc aacgtcatcc cgcggccgct cagaacgtgg aacgcactcc 3660 gcctagactt tcatcgtact caatatggac agcaggaacc tatccgggcg atgtgtgacg 3720 ttttcaatga ttttgacgat ttatttgatt actcgatgtc tgtagattcc tttatttcta 3780 acttacgccg tactattttt gtttagatct aagtttatat tgtatctcgt gaaatgtatt 3840 gaccatgttt aaaagacggt gggttttatg cctattcgag agtggcgaat ttagcgatcc 3900 aactcgaacg ggcttttacc acccagtcca ttaagacaga taaatgtcgg atggacaata 3960 aaataaaaat aacaaata 3978 // ID Copia-26_AA-LTR repbase; DNA; INV; 235 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-26_AA_; KW Copia-26_AA-I; Copia-26_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-235 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 953-953 (2011). XX DR [2] (Consensus) XX SQ Sequence 235 BP; 52 A; 56 C; 55 G; 72 T; 0 other; tgtcgggatg tgcgacatct gtccccatat ctatgtgcgg caatgtgtga gagaagggaa 60 gtaaccaata tcagttcact gttaatgtcc gtggaataaa gacgttcaaa cgtacgttcg 120 ttgtttacca ctgtcgcgtt aaattaccga aagttacccg gattgccccc tgtatttccc 180 tcgttggtag tgttccgctg cgtgacctgt ccaccgtagt ttactctgtt ctaca 235 // ID hATx-4_HM repbase; DNA; INV; 3181 BP. XX AC . XX DT 07-DEC-2008 (Rel. 13.12, Created) DT 11-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hATx-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hATx-4_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3181 RA Jurka J.; RT "A distinct, diverse family of hAT transposons from Hydra RT magnipapillata."; RL Repbase Reports 8(12), 1823-1823 (2008). XX DR [1] (Consensus) XX CC Very young family. Copies nearly identical to consensus. CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 955..2745 FT /product="hATx-4_HM_1p" FT /translation="MVSEAWKYFNKVVVNDKKYGDCKKCHKRIACPGSSTN FT GLIGHVKSCEKIDLKPFSSRPVLTKMSLNDLLAKLTALDGYSIRSITQSET FT LRMLFENYGHSLPKSASAISEKIKMFHAEAKYLMIAELAALKAKGEKFGLT FT GDEWCSNRGRRYYNLNVHHDRTYCLGMMRLPSPSSAETLNILIMEKLKEFG FT IDSEDIAGKTTDGCSWMVKLGTIFKFPHQICQNHSIHLSVCDLIYAKKDCR FT KIVNKETESIIETETETDGESEGEFLIEYSENEIEYEENIEKNLQKLRILI FT KFLKYSSFRYEVLKRKIHEIYGSSHDKGLLFDNRTRWNSVFKMIKRFLDCL FT EAVKKTLEELNCLNMLAEIDLDFLKSLSDLLEVVADAAVRLEKPNTTILEC FT DIIMETLVRKLKVAETDISSNFAETLISRYEDRKNVMLVSLSRYLHNPDFF FT KNKNHTFFYASKKDIQIYASKLFDQLFATKENSLQSKSDDHAKNKQEEPSQ FT STSFFSEIQEVLIQNQSAAKKPKKVKTFTDQLKEYEVTGVKSETVLKFELA FT IHRIQATSSEAERVFSSSSRFVTKFRSSLSDEMIDVLIFLKYYLKRSSV*" XX SQ Sequence 3181 BP; 1186 A; 452 C; 506 G; 1037 T; 0 other; cagtggatcc attcggaact ccgtctcgac ggatttttct cggtttttcg gagtttcgga 60 gttcgagaca aatcgagaaa cttcgacgaa acttcgacga agtttttaga aaataccaat 120 tttttattcg taaaataatt gaattcttat tttatttcta cgtggttttt acaattgtaa 180 attgtaattt taaaaattgt ttcaaatgta agtgttctct tcgtttaata acaacactaa 240 tatacatcgc aaagcaattg gtaatgcaca gtgggccagg tgatgatggt gaaaaaaacc 300 tcaaaaaaag tttattctat tcaaaaccat atattaaaac atgctcatac ttttccattg 360 tactatttca aaaactcttt tgtgtatatt agtatctttt tagtattagt atttttttaa 420 atccacaaaa atcaattgcg gtataataaa atgtaatgca taataaaaag caagaatttt 480 ttaattttta agcaaagata taataaacag taacttttaa aatattatgc gaatgtgata 540 ttagtttata tattttaact actcaatagt ttacctttac aaaaaaaaag tttagtcaag 600 atcattattt ataatataaa caaagactgg tcgtaaaata acgaggtaag tttttaacat 660 ataaaacata taaaactttc attttataca agtagtgtcg tattttgtat gatcaaaata 720 attttgtcca ccataagtct aaaatccggc ccactgagca ttgtggcaaa attgtgcaat 780 aaaatgttat cagtgaaaat aaataataat aataattaat gtttaattat acttacctat 840 cagaataaat aacttaataa cttaaaatgt gataactttt cacaattaaa ataaaattaa 900 aaacaaatat gggaggtcat tatcaaattg ataatttttc tttcatagaa aaaaatggtt 960 tcggaagcgt ggaagtactt taataaagtt gtggtaaatg ataagaagta tggagattgc 1020 aaaaaatgtc ataaaagaat cgcttgtcca ggaagcagca caaacggcct tattggacac 1080 gtcaaaagtt gcgagaaaat agatttgaaa ccattcagtt ccagaccagt tttgactaaa 1140 atgtcattaa acgatcttct tgccaagttg acagctcttg atgggtattc gattaggtcc 1200 ataactcaat ctgaaacact aagaatgctt tttgaaaatt acgggcatag tttgcctaag 1260 tcagcgtccg caatttcaga aaaaattaag atgtttcatg ctgaagccaa atacttaatg 1320 atcgctgaat tggctgcatt gaaagcaaaa ggagaaaaat ttggactaac aggtgacgaa 1380 tggtgtagca atagaggtag aagatactac aatttaaatg ttcatcacga cagaacgtat 1440 tgtttaggaa tgatgagatt gccatctcct agttctgctg aaacattaaa cattttgata 1500 atggaaaaat taaaagaatt tggtatagat agtgaagata ttgcaggtaa aactacggat 1560 ggctgtagtt ggatggtgaa attgggtacc atatttaagt ttccacatca gatctgccag 1620 aatcattcaa ttcatttgag tgtttgtgat ttgatatacg ctaaaaaaga ttgtcgcaaa 1680 attgtgaaca aagaaactga atcgattatt gaaacagaaa cagaaactga tggagagtct 1740 gaaggagagt ttcttattga atattcggag aatgagattg agtatgaaga aaatatcgaa 1800 aaaaatctac agaaacttcg aatactaatt aagtttttga aatattcatc gtttcgatat 1860 gaagttctta aaagaaaaat acacgaaata tacggatctt cgcatgacaa aggactgttg 1920 ttcgacaacc gaacaagatg gaacagtgtt tttaagatga ttaaaagatt tctggattgc 1980 ttagaagctg taaaaaaaac actcgaagag ctaaactgct taaacatgtt agctgaaatt 2040 gacttggatt tcttaaaatc gctgtctgac ttattggaag tcgttgctga cgctgcagtt 2100 cgcttggaaa aaccaaatac aacaattttg gagtgtgaca ttattatgga aacgttagta 2160 aggaaattga aggttgcaga aactgacatc agttctaatt ttgcagagac cttaatctcc 2220 agatatgaag atcgtaaaaa cgttatgttg gtatcattga gtcgttattt gcacaatccc 2280 gattttttta aaaataaaaa ccacacattt ttttacgctt ccaaaaaaga tattcaaatt 2340 tatgcttcaa aattatttga tcaactattt gcaacaaaag aaaatagtct ccaatcaaaa 2400 tcagatgatc acgctaaaaa caaacaagaa gaaccatcac aatcgacatc ttttttctcc 2460 gaaattcaag aagttctaat tcaaaatcaa tcagcagcta aaaaaccaaa aaaagtcaag 2520 accttcaccg atcagttgaa agaatatgaa gtaactggtg tgaaatcaga aacagtttta 2580 aaatttgaac tagccatcca cagaattcaa gcaacttcct cggaagcaga aagagttttc 2640 tcctcgtctt ccagatttgt cacaaagttt cgtagttcgc tttctgatga aatgatagat 2700 gttttgatct ttttaaaata ttatttaaaa agatcttcag tataaatact ttttttttca 2760 aatttttttt ttaattttta ctataggtgc cccaacaagt ctttattgca tagcaacgca 2820 gaagggcatt taactgaaag ttcacgcctt cttcctaacc gatgccggtt agaaagccaa 2880 ttattagata taacttgttc gttaaaaaat tgtgataaaa aaaatattta tgtactaaga 2940 aatataaaaa tataaatttt ttaaaatttt attatgaatt ataaatcgaa atatttaaaa 3000 ttaaaaaata aaaatacaag aacaatgttt aattatcaaa tagagagttt aattttttta 3060 tcaaatagat agtttaattt ttataacaaa atattatcga aactccgatt ttttctcgac 3120 tccgacggag ttttttctcg atttctctcg actccgtttt ttctcgattt tggatccttt 3180 g 3181 // ID piggyBac-11_SM repbase; DNA; INV; 1984 BP. XX AC . XX DT 30-MAY-2008 (Rel. 13.05, Created) DT 30-MAY-2008 (Rel. 13.05, Last updated, Version 1) XX DE DNA transposon from Schmidtea mediterranea. XX KW piggyBac; DNA transposon; Transposable Element; piggyBac-11_SM. XX OS Schmidtea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae. XX RN [1] RP 1-1984 RA Kapitonov V.V. and Jurka J.; RT "Families of autonomous piggyBac element from the freshwater RT planarian genome and two clear examples of horizontal transfer of RT piggyBac transposons between a flatworm and insect."; RL Repbase Reports 8(5), 530-530 (2008)Repbase Reports. XX DR [1] (Consensus) XX CC piggyBac-11_SM is a relatively old family of piggyBac CC transposons, characterized by 14-bp TIRs (1 mismatch) and TTAA CC target-site duplications. The consensus sequence was CC reconstructed based on multiple alignment of 15 copies, which are CC ~94% identical to the consensus sequence. XX FH Key Location/Qualifiers FT CDS 241..1773 FT /product="piggyBac-11_SMp" FT /note="piggyBac transposase." FT /translation="MADFVEHIADNGSTLSSDDEFDMETDIEMENDDENDI FT WTNSSFVENKFEYSLQSRMCIAHIDNMQPIDYFMNFFDETLMNIIVCETNL FT FQRQNPIPHREHMKKWQDTTLQEMYIFLGIILLTGMIGKRALRDYWTTDFL FT FQTPVFAKIMSRNRFQDLLRCLHFNDNRKKDNDRLFKLNPIVNYLKQKFRF FT CTVPSQKLCIDESLLLWKGKLSFKQYIPSKRSRFGIKMFNLVDCSTRVLLD FT FIIYTGKETNIRIDKSIGFSGSIVMTLLLPFLNKGHILYVDNWYSSPSLFT FT LLLKKRTGAVGTVRKNRKHLPKLQAKLAKGQTQQLNNGKLLYMKWLDKREV FT FMLSSIHQPTMAPTGKTDRNNQAIVKPTCIIDYNQNMGAVDQLDMQISFTE FT VIRKTVKWYKKVFFHLLDLAAFNSYALYRKKEKSKMTHLEFRKTLVRQIFE FT KFNVFENLPSGPSVRAQHFPGQYTERDNSNRIIKHRCNVCRSLTTWNCETC FT EIALCLPGCFKTKHS" XX SQ Sequence 1984 BP; 695 A; 320 C; 324 G; 645 T; 0 other; ccctttgagt cccgcttgta tctccggaga tcatttgata ttaaaaaata actgtttgcc 60 tcatccttga acgcgatacc tagatctttt ctactaaacg agagctatct tttgacaaaa 120 taaaaaagta aaaaagttgt ttagaaaaat tttggtgagt tggagaaaag tgaaatttga 180 caccattttc tttctttgtt gaatataaaa taagtttatc taaaaaagtg atcttgaatt 240 atggctgatt tcgtagagca tattgctgac aatggttcaa cacttagttc tgatgatgag 300 ttcgacatgg agaccgatat tgaaatggag aatgacgatg aaaatgacat ttggacaaat 360 tcgagtttcg tggaaaataa atttgaatat tcgttgcaat cgagaatgtg tattgcccat 420 attgataata tgcagcctat tgattacttt atgaactttt tcgacgagac tttgatgaac 480 attattgtat gcgaaaccaa tttatttcaa aggcaaaatc ctattccaca tcgagaacat 540 atgaagaaat ggcaagatac tactctacag gaaatgtata tctttcttgg aataatttta 600 ctcactggaa tgattggcaa aagagcacta agggattatt ggacaactga ttttttgttt 660 caaactcctg tatttgccaa aattatgtcg agaaatcgtt tccaagactt attgagatgc 720 cttcacttca atgataatcg aaaaaaggat aatgatcgcc tattcaagtt gaatccaatc 780 gtcaattact tgaaacaaaa gtttagattc tgcaccgtcc catcacaaaa attatgtatt 840 gatgaaagtt tgcttctctg gaaagggaaa ttatcgttca agcaatatat tccttcaaag 900 agaagcagat ttggaatcaa gatgttcaat ttggttgatt gctctacacg agtattgttg 960 gatttcataa tttataccgg taaagaaacg aatattcgaa ttgataaatc tattggattt 1020 agtggatcga tcgtaatgac tcttttatta ccgtttttga acaaaggaca catcctttat 1080 gttgacaact ggtattcaag tccgagcctt ttcacattat tactcaaaaa gagaacagga 1140 gctgttggaa cagttagaaa aaatcgaaaa cacctaccga aattgcaggc taaattagca 1200 aaaggacaaa ctcaacagtt gaataatggg aaacttttat atatgaaatg gttagacaag 1260 cgagaagttt ttatgctatc atcaattcat caaccaacta tggctcctac tggaaaaact 1320 gatcgaaaca accaggctat tgtaaaacca acttgcatta ttgattacaa ccaaaatatg 1380 ggcgctgttg atcaactaga tatgcaaata tcgttcacag aagtaattag aaagactgtg 1440 aaatggtaca aaaaggtgtt cttccacctt cttgatctag ctgctttcaa ctcatatgca 1500 ctttatcgta aaaaagaaaa gtcgaaaatg acacatcttg aattcagaaa aactttggtt 1560 agacaaatat tcgaaaagtt caatgtattt gaaaatctac cttctggtcc atcagttcga 1620 gctcagcatt ttcctggaca atatacagaa agagataatt ctaatagaat tataaagcat 1680 cgctgtaacg tttgcagatc acttacaact tggaattgcg aaacttgtga aattgcatta 1740 tgtctacctg gctgtttcaa aacaaagcat tcataatcgc gttgccttag tattatattt 1800 agtattattt tatcaatttc tttctttttt cataaaaaaa attaattttc aaattttcaa 1860 aaaaaaattc aaactttttc taaacttttt ttaatacctt tttcttatca aattcgttat 1920 acttattcat tttcgacaat atttaaccat taataaataa aattagaccc cgggactcaa 1980 aagg 1984 // ID R1_Ele8 repbase; DNA; INV; 5677 BP. XX AC . XX DT 05-OCT-2010 (Rel. 15.1, Created) DT 05-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE An R1 clade non-LTR retrotransposon family from Aedes aegypti. XX KW R1; Non-LTR Retrotransposon; Transposable Element; R1_Ele8. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-5677 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5677 RA Kojima K.K. and Jurka J.; RT "R1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (06-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update and extension. Both termini are incomplete; CC thus target sequences are unknown. This consensus is ~97% CC identical to the original sequence in [1]. ~79% identical to CC Waldo-6_AAe. XX FH Key Location/Qualifiers FT CDS 1..1428 FT /product="R1_Ele8_1p" FT /translation="GDPTKMENISDNTVMEVEAAEGENANAFARSGKLQRS FT PVAQQETPASSSRALQGENSRLQAGLGFQSPSFTPKSSNSICQEGLTGNSK FT LMEAKKKVNELYEYMKDRTNVHHKIRNLVTSIKSAINAAENEQKALKMRAE FT AAEKALQNAKEQTVADTPKTPKAHANKPSKKRDRDSPGEEAGSKKQKNEQG FT NYLLNGAENDEEWQTVRSQKNRGKKHGEGREKKKEGTKKXTSRPRREWSKG FT DAILVKANDQTTYAAILRKVREDPKLKDLGENVVRTRRTQRGEMLFELKND FT PVIKSSAYQELIAESLANEASVKALTQEAVVECRYLDEITSNDDLQQELRS FT QCDIGDVAMTIRLSKSYDGTQLATIRLPVAAANKLVEKEKVKIGWSVCPLR FT LVSRAERPPMRCFKCMDFGHQAAICNGPDRSELCRKCGEKGHFGKDCTKRP FT KCLLCKPEEGNAHVTGGFKCPAYKKAIASRQ" FT CDS join(1407..1952,1873..4449) FT /product="R1_Ele8_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="EGDRITTVVEVTQINLNHCDTAQQLLWQSTTEARCDV FT AIIAEPYRVPLDNVNWVADSTGIAAIQVMGRFPIQEVVSSTKEGFVIAKVN FT GVYVGSCYAPPRWTLDEFRRMLEELTDELVGRKPVLIGGDFNAWAVEWGSR FT VTNARGYSLLEALTKLDLQLCNEGTVSTFRKKRTSIDHRSYMVISSCAMKV FT PLAHLGKNGRASIIDLTWCSPSLAGNMNWRVSEEYTHSDHQAIRYTIGRRD FT CAATPRTTTIVQQWKTKAFDKDLFVEALRVDSDIPDLDAVELTRAMVRACD FT ITMPRKRRPSNNRRPVYWWNEALCTLRAACLKARRRYQRARHEEVREERRI FT VFQQARTAFKREIKLSKSNCFKEPWGDAYRVIMAKIKGPSTPAETCADKLR FT VIVEGLFPKHDPTMWPATPYGNEEVGNAEARQISNEELIAAVKALKLNKAP FT GPDGIPNVALKAAVLAYPDMVRKVMQKCLDEGQFPEIWKIQKLVLLPKPGK FT PPGDPGSYRPICLLDTLGKLLERVILSRVMKCTESENGLSERQFGFRKGRS FT TVDAIRTVLERAEKALKQKRRGDRYCAVVTIDVKNAFNSASWEAIATALHR FT MRVPDYLCQILKSYFQNRVLRYETDAGPKELRITAGVPQGSILGPTLWNAM FT YDGVLSLELPKGVEIVGFADDVVLTVIGETQEEVEMLATEAIGLVEDWMEG FT VKLKIAHHKTEVLLVSNCKAVMHGEITVGGHAIASQRKLKHLGVMLDDRLN FT FNSHVDYACEKAAKASNALTRLMPNIGGPRSSKRRLLAVVSTSLLRYGAPA FT WGAALQTKRNRDKLNSTFRLITMRVVSAYRTISSEAACVIAGMIPICIILS FT EDIDCYQRRETRQARKLARIRSLAKWQQEWDASEKGRWTHRLIPNLSAWVN FT REHGEVNFHLTQFLSGHGCFKQYLHRFGHASSPFCPECMEIEETPEHVVFD FT CPRFNIERNTVAAAFGEYFSVEGVVHGMISDPDLWNMMTNMVVQIMSGLQR FT NWREEQRNETQRAALDVGRRGADEPEANLPPELSD" XX SQ Sequence 5677 BP; 1618 A; 1367 C; 1684 G; 1005 T; 3 other; ggggacccaa caaaaatgga gaacatctca gacaataccg taatggaggt tgaagcagca 60 gagggtgaga acgcaaacgc gttcgcgaga agcggaaagc tacaacgttc gccagtagca 120 cagcaggaga caccagcaag tagcagcagg gcgttgcaag gtgagaattc tagactccaa 180 gcaggactag ggtttcaaag cccatcgttc acaccgaaat cgagcaacag catttgccag 240 gaaggcctaa cggggaattc gaagttaatg gaggccaaaa agaaggtcaa tgaactctac 300 gagtacatga aggacaggac taatgtgcat cacaagatta ggaatctagt aacgagcata 360 aaatccgcca taaacgcagc cgaaaacgaa cagaaagcgc taaaaatgag agccgaggca 420 gccgaaaagg cactacaaaa tgctaaggaa caaacagtgg ctgacacgcc taaaacccct 480 aaggcacatg ccaacaagcc gtcgaagaag cgtgatagag actcgccagg agaagaagcg 540 ggctcgaaaa aacagaaaaa cgagcagggt aactacctgt tgaacggagc agaaaacgat 600 gaagaatggc aaaccgtcag gagccagaag aacagaggga agaaacatgg cgaaggaaga 660 gaaaagaaaa aggagggtac gaagaaagwg actagccgcc cacggcgtga gtggtcgaag 720 ggggacgcca tactagttaa ggcaaacgac caaaccacgt acgcagcgat tcttcgtaag 780 gtcagagagg atccgaagct aaaagacctc ggagagaacg tggtcagaac gaggcgtact 840 cagagaggag aaatgctctt cgagctgaag aacgatccgg tgatcaaaag ctcggcctac 900 caggagttga ttgctgagtc gttggccaac gaggcaagcg taaaagcctt aactcaggag 960 gcagtggttg agtgcagata cctggacgag atcacatcca atgatgatct gcaacaggag 1020 ttgcgatcac agtgcgacat tggagatgtg gccatgacaa tccgactttc aaagtcgtac 1080 gatggcacac agctagcgac gattcggcta ccagtggccg cagccaacaa actagtggag 1140 aaggaaaagg tgaaaattgg atggtcggtt tgtccgctga gactcgtctc ccgagccgag 1200 aggccaccga tgaggtgctt caagtgcatg gacttcggac atcaggcggc aatctgtaac 1260 ggtcccgaca gatccgagtt gtgcaggaaa tgcggggaga aagggcactt cgggaaagat 1320 tgcacgaagc gaccaaagtg tttactctgt aaaccggagg aaggaaacgc ccatgtaacg 1380 ggcggtttta agtgtcctgc gtataagaag gcgatcgcat cacgacagta gtggaagtaa 1440 cgcagatcaa cttgaatcat tgcgacactg cacagcaact gttgtggcag tcaacaacgg 1500 aagcaaggtg cgacgttgcg attatagctg agccgtatcg agtaccgctc gataatgtca 1560 actgggtggc ggacagcacg ggaatagcgg cgatacaggt gatgggtagg ttccccatcc 1620 aagaagtggt ttctagcacc aaagaaggat tcgtgatcgc gaaagtcaat ggtgtctacg 1680 taggcagctg ctacgctcct ccgaggtgga cgctcgacga attcaggcgg atgttagagg 1740 aattaaccga tgagcttgtc ggtcggaagc cagtactaat tggaggagac ttcaatgcat 1800 gggccgtgga gtggggtagc agggtaacca atgccagagg ttacagcctg ctggaggccc 1860 tgacaaagct agatctccag ctgtgcaatg aaggtaccgt tagcacattt aggaaaaaac 1920 ggacgagcat cgatcatcga tcttacatgg tgtagcccat cgctggcggg caacatgaac 1980 tggagggtaa gtgaggagta tacccatagc gatcaccagg caatccgcta taccattggc 2040 cgacgggact gcgcggcaac gccgagaacc acaactatcg tacagcagtg gaagacgaag 2100 gccttcgaca aggacctttt cgtcgaagca cttcgtgtag acagcgatat cccggatctc 2160 gatgcggtgg agctgacaag ggcgatggtc agggcgtgtg acataacaat gccaagaaaa 2220 cggaggccat caaacaaccg gcgtcccgta tactggtgga acgaggcact ctgcactctt 2280 cgcgccgctt gtcttaaagc gaggagacga tatcagagag cgaggcacga ggaagtccga 2340 gaagagcgaa gaattgtgtt ccagcaagcc agaacggctt tcaaacgaga gattaagctg 2400 agcaagtcca actgcttcaa agagccttgg ggggatgcct accgagtaat aatggcgaaa 2460 atcaagggcc cgtcgacgcc ggccgaaaca tgtgcggaca aactgagagt catcgtcgag 2520 ggtctattcc caaagcatga tcctacaatg tggccggcca caccatatgg caatgaagag 2580 gtaggaaatg ccgaagctcg tcagatctcc aacgaggagc ttatagcagc tgtgaaagcg 2640 ctgaagctga acaaagcccc gggtccggac ggaatcccca acgtggcact caaagcagcg 2700 gtccttgcgt atccagatat ggtaaggaag gttatgcaga aatgcctgga cgaaggtcaa 2760 tttccagaaa tatggaaaat tcagaaactg gtactgttgc cgaagccagg aaaaccgccc 2820 ggggatccag gatcatatag gcctatatgt ttactggata cactcggtaa actcttggaa 2880 cgggttatcc tcagcagggt gatgaaatgc acagagagcg agaacgggct atcagaaagg 2940 cagtttggat tccggaaagg aaggtcgacg gtggacgcaa ttcggacagt gctcgagagg 3000 gccgagaaag cattgaaaca aaagcgtaga ggagatcgtt actgcgccgt ggtcacgata 3060 gatgtgaaga acgcgttcaa cagcgccagc tgggaggcca ttgccactgc gctgcacaga 3120 atgcgggttc ccgactactt gtgccagatt ttgaagagct acttccaaaa ccgtgtactg 3180 aggtatgaaa ccgacgccgg cccgaaggag ttaagaataa cggctggagt cccacaaggt 3240 tccatactgg gaccaacgct gtggaatgcg atgtacgacg gggtcctatc gctggagcta 3300 cccaaggggg tcgagattgt cggcttcgcc gatgacgtcg tcctaacggt aataggcgag 3360 acgcaggagg aagtagaaat gctggcaacg gaggcaattg gtctggtcga agattggatg 3420 gagggagtca agctgaagat agcccaccac aaaacggagg tgctactagt tagcaactgc 3480 aaagcagtca tgcacggtga gatcactgtc ggggggcatg ccatagcatc acagcgaaag 3540 ctgaaacacc tgggcgtaat gctagacgat cggctgaact tcaacagtca cgtcgattat 3600 gcatgcgaga aggcggcgaa ggcgagcaat gcgctcacaa gacttatgcc caacataggt 3660 ggtcctagaa gcagtaagag gcgtcttctg gctgtcgtat caacatcgct actaagatac 3720 ggtgcgccag cctggggcgc ggcgctacag accaagcgga atcgggataa gttaaacagc 3780 acgttccgac tcatcaccat gcgagtggtg agcgcgtaca ggactatatc gtcggaggcg 3840 gcatgcgtaa tcgctgggat gatcccgatc tgcatcatcc tgtccgaaga catcgactgc 3900 tatcagcgaa gagaaaccag gcaggcgagg aagttggcaa gaattcgttc gctggctaag 3960 tggcagcaag aatgggatgc ctccgagaaa ggcaggtgga cccacaggct catcccgaat 4020 ctatcggcct gggtgaacag ggagcacgga gaagtgaact tccacctgac acagttcttg 4080 tcaggtcacg gctgcttcaa gcagtatctg catcgattcg gccatgcgtc ttcaccattc 4140 tgtcctgagt gtatggagat tgaagaaacg ccagaacatg ttgtcttcga ctgcccgagg 4200 ttcaacatag aacgaaacac ggtggcggct gcttttggtg agtacttcag cgtagaaggt 4260 gtggtccacg gaatgataag cgatcccgat ctttggaata tgatgaccaa catggtggtg 4320 caaataatgt ctggcttaca gcgcaactgg cgagaagagc agcgaaacga gacgcagaga 4380 gcagcgcttg acgtcgggcg tcggggcgct gatgaaccgg aagccaacct cccaccggaa 4440 ttgtcggact gacctcggca ctcgacaagt cagtctgttg atgagaggag accatcgggc 4500 gcgatcggag taggctagat cctccgtcgg ggactagacc gagtagaacg agcgtaacgt 4560 agtatcggta ataagtcgtc gaggcgcctg tgaaccggaa gtcaccctcc aaccggaatt 4620 gcagaaccga ccctgacact tggccgacca ccatcgagtc ggagtaggct agatcctccg 4680 ccggggacta gctgagtaga tcgcgaaaca gcaccggcaa atggtcgtcg gggcgcctgt 4740 gaaccggaag tttccctcca ccggaatcgc aggaccgacc tcggcatctg cccgccagct 4800 tcgagtaggc tatccaccgt cggggactag gagtagatgg agcaccggca aatgtgtcgg 4860 ggcgccgtga ccggaagtcc ctccaccgga tcgcaggacc gacctcggca tctgcccgga 4920 tagcatcggt tcggaagatc ttccgccgcc ggggaaatct tcgtcggagt aggatagatc 4980 caccgtcggg ggctattctg agtagtgcga gcgtatcacc ggcttgagtc gtcggagcgc 5040 cagtgaaccg gaagccatcc tccaaccgga atctctgggc cgacttcggc actctaccgg 5100 ccaacaacgg agtatgaaca acgcgtaacg ttaccggttt cgggagatat tcctccgtcg 5160 aggtactctc catcggagta ggctagcttc accgtcggga actamgccga gtagtatcga 5220 tcacgacgaa ccgccggctt tgggtcgtcg gggcgcttgc gaaccggaag tcacccctca 5280 accggaatcg caagaccgac ctcggcatcc aaccggtccg ccgtagagca tgacgagcag 5340 agaaatggag tcaagcgcac caacaaaaca cagcagcatc catcgaaaaa gtctcacatg 5400 gcccaggcgt gtggtcggcc ccaaaatgtg agtgacgttg aagagacaaa ctgcagccag 5460 atcgaataga gcaagaggtg acgaaaaggc gtagaggcat caacaagccc tctgccccct 5520 gaagtaatac catgaggtag ttccaggggs acatcggact tgtagcccaa gccaaagaaa 5580 agtggcgtgg tacagagttg tttccctttt atctctgtta ccgcactcga gacgtgcatc 5640 cgcagagaga ccgtaataga ccgttaagtg agaacca 5677 // ID DGLT-P_DD repbase; DNA; INV; 6017 BP. XX AC AF298205; XX DT 09-JUL-2004 (Rel. 9.06, Created) DT 09-JUL-2004 (Rel. 9.06, Last updated, Version 1) XX DE Dictyostelium discoideum complex repeat DGLT-P, complete DE sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; DGLT-P_DD; KW gypsy-like LTR retrotransposon; reverse transcriptase. XX OS Dictyostelium discoideum OC Eukaryota; Amoebozoa; Mycetozoa; Dictyosteliida; Dictyostelium. XX RN [1] RA Glockner G., Szafranski K., Winckler T., Dingermann T., RA Quail A.M., Cox E., Eichinger L., Noegel A.A. et al.; RT "The complex repeats of Dictyostelium discoideum."; RL Genome Res 11(4), 585-594 (2001). XX DR Genbank; AF298205; Positions 1 6017. XX CC Complex repeat DGLT-P containing reverse transcriptase domain CC similar to Gypsy-like LTR; retrotransposons. XX SQ Sequence 6017 BP; 2371 A; 893 C; 938 G; 1815 T; 0 other; gcatctacca tcgttacatc aggaacaaat ttgatatgag gtataggtgg ttaaatgaag 60 aagttatcaa caagcaatcg ttggttacat taacctctga tatcggtaga aacgaaaccg 120 atgctactaa caaattttat tccgttttag attcttatgt tccggtgaag gaagtaagaa 180 taaatcaaat acaaaattat agattcaata tccaatagtc aaggtagtgc cagtggtttc 240 aataatcaag atgtatctaa tggtcaaaga tttacaaatg aataaccaag gaggttcaaa 300 taattcaaga aacaatacat tcaagaaaag aactatcgga agtttcaaat gtttcaagtg 360 tggttaagaa gatcatatcg ccagagatcg tagagtagca tcaaccaaat ttcaatcagt 420 caatgtagta tttagtgata gtgttccaaa tcaatgtata gataatgaaa gaaagccacc 480 agtggaaaat gaagatcaac agccatcaaa atcaagaaga attaataatg aagaagcaga 540 agaaagtaag ccacaggata cttttatgtt agccaatggt gtctactgaa gttcaatggc 600 accatccgct gctgccatca atccaacatc tgtaagaata ttaggggata gtggtgccga 660 tatcaacata atcccaaaag cagcattaac aatggaacaa ttgaaatcaa gagaagaatc 720 aaatgtaatt tttccgtcag cttgtggtaa ttcagtcact tggttcaata gcttgttcca 780 actgttgcaa ataagaccgc tgcaatggct gccaagtttg agattagtag tgatgtagtg 840 gaaatcattc ttggttccat gggtagtaca tacaattgta aggatgatat atggtcaatt 900 caagttagta agaagcaaca tcaagtaaat cagtacgtcc gtccaattag tatatgtctt 960 attagtgaac tggatgaaaa atttaccaag aaaggtatca agaaggaaat taggaactgg 1020 atcaacaacc tcaagaccat gtacaatatt caaatcaatg atcaagaggt tttcaatcat 1080 caaccaacgt accaagttca caaagttaat catcagtcga atcaagaaaa atacaaggta 1140 gtggaagtaa aacaccaaga atggaaggaa gttcaagatc aattcatcat caaccatctg 1200 aaacaaaact acaaggaagt cttcgattca ttaaaccaag ttccgccatc tcggaaaggt 1260 ttcaatttat aagggaagtt aaaagatggt gcacaggttc caaaatcaag atgttatcca 1320 gtaccattgt caatggaaca agaattgaaa gatcaactta ctgaaaggtt aaagaagtta 1380 tggattgaaa gaaacaaatc agaattcggt gcaccagtga tattcgcaaa gaagaaaaat 1440 ggtaagtggc gaatgtgcat cgataaccgt tcattaaatg actttactat ttatgatagt 1500 tatccacttc caaatacaaa ggtactcatc cagaagacca aaggtgcaaa gttaatgtcc 1560 aggatcgatt tagccgatgg tttccaccag atccaagtag aacctagaga tcggtcaaag 1620 acagcattcc atacaccttt tggtacattc cagtggaggg gaatgccatt cggtgccaag 1680 aatgcaccgt atacttttca aagatttatg taacaaatat tggccgatga aatcgattga 1740 ggactcgtcg tggtctatat cgatgatatt ttagtactaa caaaatcttt ttattatgat 1800 caacattaac aagatttatc acaggtattc aataagctca agtatcataa cctcaatgta 1860 atgtagtaca actcttcatc cgtgtcaaat cgtcgagtgg ccagaaccaa ttgataagac 1920 taccgttaga taatttatag gaaccgttaa ttattgcaag gatttcatcc gaaatttagc 1980 cattatagct gatccactct atcgtctcac taagaagaat tctgctttca catgggatac 2040 tgaggcgcaa ttggcttttg atcaagttaa atcagctgta gccaatgctg taagtctcaa 2100 ccttccagat accaagtatc cattctacct tgaatgtgtt agaaaaggaa gcctattcgt 2160 gaaatcatta gaaatgaatc acttcactat ctttgggttc aaggtaatca tcataataat 2220 atttaattcc taagaaacca aagggtcctt gggttaaatc aaagaatcaa cagatggatg 2280 cagaagattg atgcctgaaa agtcacactt gaatacatca aaggcgtcac caattcagcc 2340 gctgatggtc tttcagatca cattataata ataatattta tcacataact tcgatcgaag 2400 taaaatggac ttctctcaaa ttactaactc tcttaaaaat tctttttgtg ataaaaatta 2460 ttttaataca ggtatcattc ttgatgaaat gaagatcgtt aaggaaattt caaaaggtta 2520 tgaattgggg agttagctaa acttattaaa tctgaaacca tcaacgacca gtataaaagt 2580 ttaagtatca catatcagaa tggtatcgaa tggatcaaga ttgaagatgt catacagtta 2640 gtcatttcaa gttatcatga ttctgctatt gctggacact ggagtaatcg taaaacttct 2700 gaattagtca agagaaattt ttattggtat ggaatgatgg aatatatctt aatttactgc 2760 aagagttgca atgtttgttt atgcgccaca gatgaacatg ggattatact tgccatctca 2820 tttacctgtc tgatgcttta tggaaatcaa tatggacttt atatccgggt tggatgcctg 2880 tactgttgat caagttgagt ataatgatac tttgttcaca tcaacacctt ggaagaaatg 2940 tattcaagtc aacaatatta ctcacaatac atgtctacct taccaccatc atggtaatgg 3000 tcaagctcaa atcatggtca gaattgtctc gaatgttctc cgaaagtgtc taatcaattc 3060 caaatttgtt tctaatccag atatgtcatc caatatacaa aggaaagaag ataatcagtt 3120 gcatgtaaaa gaaaataaat cagtcacaga ttttatcaat caaataaatg ttagaaactt 3180 aaaaccaaat caatataatc atttagcaac ttatttaatg atgattataa tgaattattt 3240 gggactgaat taataattaa taatatcaaa cttaaacata ttattgttaa tggaaatgaa 3300 aaagaacaaa aatatatcat aaatttccgc aaaaaaaaaa aacaaaaaaa aataaaaaaa 3360 aaaaaatttg tttttttttg ttaaattttt ttttttgttt tacaatataa aatttaatat 3420 gaaaaagttt acaactgaat gttttatatg taattgtgta attgaatttt cagccaaaac 3480 gattgaatta attcaagaac atgttcataa atgtataaat gttaaactag atgaaaatga 3540 aattgaaaga tttccaacaa ttgaacaatt taatgataaa caaaaacaga tagaagaatt 3600 atcaaaaaaa gttgaagaat taacttttaa attgaacttt gttgaataaa atatgaatag 3660 ttgttcactc tcaagtgagg aaaggtggaa aagaaaaaga acaaatgaat taaaactaaa 3720 tttcttcaaa gatgaatcac agctagttca aacatcaaaa tcagtccaaa caaccaaacc 3780 aaatttagtt gaaccaatcg aaaccattga aaacattgaa cccattcaac agatcgaatc 3840 gactcaacaa acaaaacaaa tatcaaatgt aaattcacca gtaaaattga aacaatatac 3900 acaaaccatg gagacaatta taagaagatt ctccccaaaa gaaaaagaaa aagaaaaaga 3960 aaaagaagaa aaagatgaaa aatcaaaaga taaaaaagaa ccaattaaaa caagatcaaa 4020 atcaaaaaat gatgaataag aagaagagga tgaacaagag ttggttcaag gaaaaccaaa 4080 aggtaattcc aaaaatatca aaacaaaaaa ataaaaataa aagacaattt ttaacaaata 4140 tttgtattat tatttttgta tccattgaaa aattaaagga taaattttta tctttaattg 4200 gaatatcata cgaaaatttc aatgactaca ttttaaattt agaaaagttg gttgttaaaa 4260 atggtgaaaa aaggggtcga aagaaactat attgcatttc agagagattt tttattgcaa 4320 tggtttattt tcgccactat acatgacctg ttataatgga agttcttttt aatattgata 4380 ccagaacttt aaatagatat gttgatgaat atatattata tacaaaacaa ttagcaatta 4440 ataatttcca atcaatattt aatgatagaa aagataaatc aaatagttgt gactattatg 4500 atggtgttaa acaatataca gtaacaatgg tggtcgatgg atcggaacaa caaataacta 4560 ttcccaccaa tgaaaaatta agaaatatgt tatacagtgg taaaaaatgt aaatcaactt 4620 taactctact agtttattgt tgtcccgata gtgggaaaat attacacata gggtacccaa 4680 gtggtggttg taaaaacgat attaatttat ttcaaagaga tttagaattt attaaatcac 4740 tagatgaaaa tgttgattcc gttttggtga caaaagattc aggggtttaa caaaacattt 4800 caaaaatata tttacgatcc catcaggtca aagaactgat taccaaaaat aaaaagataa 4860 caatcaaaaa tctattagaa taataataga aaacgtattt acaagaatta aacaatttta 4920 aatagcaagt ctaccactta gatataaatt taaaagctat acaatagata tggataaagt 4980 tttggaaaga catcacgatg tgtggtgtgt cattggctat ttagtcaata aataccaaac 5040 aatcagaagt gatttatcaa atgaaacaaa acttgttgat tttggtgctc tttttgatat 5100 tcaataaatt taatttttcc aagttaaaaa cttcatatta aattaatatg aaaaaaacat 5160 ttttaacaaa aaaaacaatt tttttttttt ttttttcatt ttttgaattt tttcttgcgg 5220 aaatttatca aacttcaaaa agttactaac caatatcaac aattaaacac ttatccatta 5280 gatcaatgct taaaagataa tgaaaatatt attgtaataa taccattaga gacttttgta 5340 attaatcaag taatttccat acatcatgat tcagcactag caggtcattg aagtagtttc 5400 aaaaccattg attaaattaa aagaaacttc ttttgaaaag gagtgattgc tgatatcaaa 5460 gattattgta gtaattataa aatatgtatt tgtgcctccg attcaaggaa acatcaaatt 5520 ggattactat cacctctttc accatcaaga tgttttagtg aaattagcat ggatttcctt 5580 tcaagtttga acaaatgtac aattaatcaa attgaacttt aatagaatat taattgtagt 5640 tgaccgatta tccaaatatg tcgtactcat accaattcca tcatctacta attcaaaaga 5700 tatttttcca ttgttacaag atcaagttat gtttacattt agatttccat ccgtaattat 5760 ttcagataat gatccgttat tctcaagtaa ctcatggtcc aagttcttaa ttaataataa 5820 tgtcaaacat cacacatgct taccctatca tcatcaggcg acggtcaagc agagatataa 5880 gttagagtct tagcaaacgc tttacgtaaa actttattac aagccaaacc tctgcaaatc 5940 cagagcttga tatgactgat attaataata ttcataatag ttcctggaca ttgtacttaa 6000 aagtggttca atttttc 6017 // ID BEL4-LTR_AP repbase; DNA; INV; 280 BP. XX AC Contig12244; XX DT 21-APR-2008 (Rel. 13.04, Created) DT 21-APR-2008 (Rel. 15.12, Last updated, Version 0) XX DE LTR retrotransposon from pea aphid: long terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL4AP; BEL4-I_AP; KW BEL4-LTR_AP. XX NM BEL4-LTR_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-280 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from pea aphid."; RL Repbase Reports 8(4), 436-436 (2008). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 280 BP; 89 A; 55 C; 46 G; 90 T; 0 other; tgttcggtca ctactcatta tatattatta ttatcggttg tactgttatc ggtgactcga 60 ccgggacacc ggatctataa aattattgta agtcagtcga catagtatcg cgcaaccgag 120 tgcacgtcgt tcgaccctct aagctttttc gtattattaa ttgtaatatt tcaagttatg 180 tttgcatcag ctaaataaat ttatacccaa agtaaaaaat caagtcgagt aattagtttc 240 atgagtacct acccaaacca cgtaacatga gctaaaaaca 280 // ID Copia-24_CQ-I repbase; DNA; INV; 2270 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-24_CQ_; KW Copia-24_CQ-LTR; Copia-24_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-2270 RA Jurka J.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 363-363 (2011). XX DR [2] (Consensus) XX CC 'CTCGA' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 171..2177 FT /product="Copia-24_CQ-I_1p" FT /translation="MEHEFVSHAFDGTNFSCWSFRIEADLKAQKLHHCIER FT TLEEESFFTVVAEEGVAERVRKEALQKSRKVEDEKCKLFLLKTIAEPQREV FT VRGMSSPKLMWEALKELSDRKAVRAGSNGTFPSKCHICGKPGHKKVDCPLK FT KKAKHAAGSVQQPKKWKAHAAESSAGAKEDVPFEPEAEEANSEEADGKFRW FT VLDSDVCEHMVGDRNLLVNVRRMETPKVVNVAEAGVPLESRYVGEVKMKAV FT VGRRTLNCTVHDVLYVPGLTQNLFSVEKEAERGMEVSFGRNHARISKDGDV FT KCTATRNGSLYVLDVELEMCGSAVVAELESQEEVPEDKMCTDDELDSEQSD FT PDDTVQEEEEVPQENLVSVPEENAVPDLEETAGERRGRVEKHPAGFSELEL FT ERKWAARPDHELKLKWPARPDHDVCVAVALNTEAYVDEILDTNEALQKRDD FT WPLWKRAIDDELRSLEKNRKNRKWDLVEAPAGRRVVCCKCVFRIKKEEDGT FT AARYKARLVVTGCSQRAGYDYKETYARNVVRELPAGAVEQLHQTKERKPPG FT FERGIKVRKMNKSLVGAKALQNRKKRYRRRMLFKKISKLEGAQTERDRRSA FT ARSRSPLNIRFCLRTDRNRHQLSGGKSRGEQVHYEEDLQDRKRMNLGASAT FT TTSVIAGQEELTKLPVPPDK" XX SQ Sequence 2270 BP; 609 A; 512 C; 758 G; 391 T; 0 other; ggttgtgtgg cccagcaaat cgcgtggcgc gtcgccaagg aaaccgcgaa cggaaggaaa 60 aagttttttc catgtgcgcg aacgtccggg aaggtgacgc aattttcgcg atcggaaagt 120 gagttgaatt ttttgggaag tgagacgcga aaagaacgtt ttctagtgcg atggagcacg 180 aatttgtgtc gcacgcgttc gacgggacga atttctcgtg ctggagtttc cggatcgaag 240 cggatttgaa ggcgcagaaa ctccaccact gtatcgagcg gacgctggag gaggaatcgt 300 ttttcacggt ggttgcggag gaaggcgtgg cggagcgggt gcggaaggaa gcactgcaga 360 agtcccggaa ggtagaggac gaaaagtgca aattgttcct gctcaagacg atcgccgagc 420 cgcaaaggga ggtcgtacgt ggaatgtcat cgccgaagtt gatgtgggaa gccctgaagg 480 aattgtcaga ccggaaagca gtgcgtgccg gaagtaacgg aacgtttccg tccaagtgtc 540 acatctgcgg gaaaccgggg cacaagaagg tggactgccc gctcaagaag aaggcgaaac 600 acgctgccgg aagtgtgcag cagccgaaga agtggaaggc gcatgcggcg gaaagcagtg 660 ccggtgccaa ggaagacgtg ccgttcgagc cagaagcgga agaagccaac agtgaggaag 720 cagacggaaa gttccggtgg gtgctggaca gtgatgtttg cgagcacatg gttggtgatc 780 ggaacctgct ggtaaacgtg cgtcggatgg aaaccccgaa ggttgtcaac gtggcggaag 840 caggagtgcc gctcgaaagt cgctacgtcg gagaggtaaa aatgaaggcc gtcgtcggaa 900 gaaggacgtt gaactgcacg gttcacgacg tcctgtacgt gcctggattg acgcagaacc 960 tattctccgt ggagaaggag gctgagcgcg gaatggaagt ctcgtttggc cggaaccatg 1020 ctcgcatctc aaaggacgga gatgtcaagt gcactgcaac gcggaacgga agtttgtacg 1080 tgcttgacgt agagctcgag atgtgtggat cggccgtggt tgctgagttg gagagtcaag 1140 aagaagtgcc ggaagataaa atgtgcacgg acgatgaact tgacagtgaa caatcagatc 1200 cagatgacac agtgcaagaa gaagaagaag tgcctcaaga aaacctagtg tcagtgccgg 1260 aagagaacgc agtaccagat ctggaggaaa ctgcgggaga aagaagaggt cgcgtggaga 1320 agcatccggc tgggttctcc gaactcgagt tggagcggaa gtgggcggca aggcccgacc 1380 acgagttgaa gctgaagtgg ccggcaaggc ccgaccatga cgtctgcgtc gccgttgcgt 1440 tgaacaccga ggcctacgtg gacgagattc tcgacaccaa tgaagcgctg cagaaacgcg 1500 acgactggcc gctgtggaag cgcgctatcg acgacgagtt gcggtcgcta gagaagaacc 1560 ggaagaaccg caagtgggac ctcgtcgaag ctcctgctgg tcgccgagtt gtgtgctgca 1620 agtgcgtgtt caggatcaag aaggaagaag acggtaccgc tgccaggtac aaagctcgtc 1680 tcgtcgtgac ggggtgctcg caacgtgcgg ggtacgacta caaagaaacg tacgcccgaa 1740 acgtcgttcg tgagctgccg gcaggagcgg tggaacagct gcaccagaca aaagaacgga 1800 agcctccagg atttgagagg gggattaagg tgcgcaagat gaacaaatcg ctggttggtg 1860 cgaaggcgct gcagaaccgg aaaaagcggt acaggcgtcg aatgctgttc aagaagatct 1920 cgaagcttga aggagcgcaa acggaacgag accgtcgatc cgctgctagg agtcgatcgc 1980 cactcaacat tcggttttgc ctgcgaaccg atcggaaccg ccatcaactc tccggtggta 2040 agtcaagagg agagcaagta cactacgaag aagatctaca agaccggaag cggatgaacc 2100 tcggagcatc cgcaacaaca acatccgtga tagctggcca ggaagaactc accaagttgc 2160 cagttcctcc tgacaagtga ccagctcgcc gatgtgttaa ccaagggccg aagctgtttc 2220 gaccgagaag ccaacgagaa gcttcgaagc actttaggtt tgagaggggg 2270 // ID Crack-2_CP repbase; DNA; INV; 4815 BP. XX AC . XX DT 22-JUL-2009 (Rel. 14.07, Created) DT 22-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE Culex pipiens Crack non-LTR retrotransposon. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; Crack-2_CP. XX OS Culex pipiens OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RP 1-4815 RA Kapitonov V.V. and Jurka J.; RT "A family of Crack retrotransposons from Culex pipiens."; RL Repbase Reports 9(7), 1333-1333 (2009). XX DR [1] (Consensus) XX CC The Crack-2_CP consensus sequence was derived from multiple CC alignment of 4 copies of Crack-2_CP that are 97% identical to CC each other. XX FH Key Location/Qualifiers FT CDS 304..1356 FT /product="Crack-2_CP_1p" FT /note="ORF1." FT /translation="MSHGEDAIVCAGCEKEEADENKVIECVECHRYWHTKC FT KKLYGSTARRARSKPFLCSTECSELRSSVENDKKAEGLIAKVLSEVQCMRQ FT EHAESNRELRNAFKELEKSQSFLAEKFEGINNDIKDLKLGQHFLKGQVDEV FT HERYESVGATVERLEKEVDQHNRANIKKNAVILGVPATKDENITAVIEAIA FT EAINCQLPEDAVFEAKRIGDQKDGGKVAPIRVVFANEETKEKLFDCKRTHG FT VLKAAQLGSNFASASGRIVLRDELTPYGINLLREVRDLQESLEIKFVWPGR FT DGVILMRRTENSKIERVRDKNDIRKLNQRRAKRSLDETNSSFNMSTNSSPR FT IQQESKKR" FT CDS 1405..4311 FT /product="Crack-2_CP_2p" FT /note="ORF2." FT /translation="MSVLNYNYECIEECITNKAELGFHASFRVFQWNIRGM FT GSLTKFDVIKEFLDRYNERIDVIVIGETWIKEGCQDMYVLDGYKAVFSCRR FT SSHGGLAMFVREDLHLNLTEVRHTEGFHIIDADVQACGRDIHVIAVYRPPG FT YPFTSFLDIIDDKLSRTKKDQQFFIVGDVNVPTNLPADNVVREYSRMLSSH FT NLKVTNTTVTRPASGNILDHVVGSIGLGANLLNETIECEVSDHSMILSSVN FT FKVPKVTQILEKRIINHTLLNQRFANVLANLRSDLDANEKIKEISARYNDV FT LNASSKVVTIEVKTKGYCPWMSYDLWYLMRMKENLRRRSKTHPSDSHLKDL FT LKHVSKTLQDKKEKAKREYYHNLIHNSSEKNSWKLINEVMGRKTKTKQEIS FT LKMNGETISDTSQVANALNEFFCSVGAELASEIDSDKNIWKFQSLPSCNLS FT LFLRPATLEEIILLIKDLNVNKAPGPDGVPATVIKTHHLAFANILLEVFNE FT IISTGQYPESLKLARVVPVFKSGEATNVNNYRPISTLSAMNKIVEKLIAKR FT ISAFLSANGLISNFQYGFRQGSSTLTATNELLEDIYGHLDNRRFSGALFLD FT LKKAFDTVNHELLLRKLERYCIRGIPNKLLRSYLSNRLQFVSVNGFSSDHL FT HIATGVPQGSILGPLLFLIFINDLPRMTWRGQIRLFADDTVLLYGNADSTR FT IAEDIKADLQCLLEYFKSNLLSLNLGKTNYMIFHSAWRNVPVLPAVLVEDV FT EIQRVSVYKYLGLILDSKLNWELHIDHLKQKIAPICGALWKMSSFVPRTWL FT LKLYYTMVHSRLQYLTAVWGSARQVRIRELQSLQNRCLKVVFNLPRLYSTQ FT MLYEDPRHSALPIRGLHAIQNITLTHNMINDPETHHNLELPTLNSGRPTSS FT TGDLRLTRPNTELGKNRFTYIGYKLHNLLPVPLKLVTSKQSFKQRLRSLLK FT STVSCFLAANDVRL" XX SQ Sequence 4815 BP; 1442 A; 1127 C; 1095 G; 1151 T; 0 other; gagtgctgat tcttaccaag tggctgcatt gttcttcggg gatggaaaac cgaccccgga 60 aaagcaagtt ctacgctggt gtttggagac atttttgccg caaactatcg ggattcgtgc 120 agggagtcac tggtggtatt atccgacgca ggagaaagca ggttgtacaa aaaatcatca 180 ccaaaatttc gacataatta caatagtcca cactttgcac tgtcaaccat cgggaggttc 240 atcgattttg ctgcagtcta tcgtcactac cttcttatct caccatctga gacctttctc 300 actatgtctc acggtgagga cgcgatcgtg tgtgccggct gtgagaagga agaagctgat 360 gaaaacaaag tgatcgagtg tgtggagtgc cacagatact ggcatacgaa gtgcaagaag 420 ctgtatggta gtactgccag gagagctcgc tcgaagccct tcttgtgcag cactgagtgc 480 tccgagctga gatccagtgt tgagaatgat aagaaggcag aagggttgat agcgaaggtg 540 ttgagcgaag tccagtgtat gcggcaggag catgcagaga gtaatcggga actgaggaat 600 gccttcaagg agctggaaaa aagtcagtcg tttctcgctg aaaagttcga gggcataaac 660 aacgatatca aggacttgaa actaggccaa catttcctga aaggacaggt cgatgaggtg 720 catgagaggt acgagagtgt tggtgccact gtggaaaggt tggaaaagga ggttgaccag 780 cacaaccgag ccaacatcaa gaaaaatgct gtcatcctgg gagtaccggc aaccaaggac 840 gaaaacatca cagctgtaat cgaagctata gcagaggcaa tcaattgtca actacccgag 900 gatgcagttt tcgaagctaa acggattggg gatcagaaag acggagggaa agtggctccg 960 attagggtgg tttttgcgaa cgaggagaca aaagaaaagc tgtttgactg caagcgaacg 1020 catggtgtcc tgaaggcagc tcaactagga tcgaactttg catcagcttc gggcagaatc 1080 gtcttgaggg atgagcttac gccgtacggg ataaatctgc ttcgtgaagt acgtgatcta 1140 caggaaagtc tggaaatcaa gttcgtctgg ccgggaagag atggtgttat cctgatgaga 1200 agaaccgaaa attcaaaaat cgaacgtgtt cgcgataaaa acgatatccg gaagctgaat 1260 caacgtcgag ctaaaagaag tcttgatgaa actaattcct cgttcaacat gtccacgaat 1320 tcctcgccgc gcattcagca ggagtccaaa aagcgttaat tgtatgttgc cttttcatta 1380 tgaaatgtat cttttttctt taaaatgtct gttctaaatt ataattatga atgtatcgaa 1440 gaatgtataa caaataaagc tgaattgggg ttccatgcgt ccttcagggt gtttcagtgg 1500 aatattagag ggatgggatc tctgaccaag tttgatgtaa ttaaagaatt tctggatagg 1560 tataatgagc gaattgatgt gattgttatc ggagagacat ggataaaaga gggctgccaa 1620 gatatgtacg tgctggatgg atacaaggct gtcttctcct gtcggcgttc ctcacatggt 1680 gggttagcca tgtttgtgag ggaggatctt cacctaaacc taacggaggt tagacatacg 1740 gaaggatttc acataattga tgcagatgtt caagcttgtg gccgtgacat ccatgtaatt 1800 gctgtttatc gaccgccggg gtacccattt acgagttttc tggacatcat tgatgacaaa 1860 ctgtctcgaa ccaaaaagga ccaacaattc ttcatcgttg gtgatgttaa cgtgcccaca 1920 aacttgcctg ctgataatgt tgttagagaa tattcccgta tgctttcatc gcacaactta 1980 aaggttacca acaccactgt caccaggcct gccagtggga acattctgga ccacgtcgtg 2040 ggatctattg gactaggtgc aaacttgctg aatgaaacga tcgagtgtga agtcagtgat 2100 cattcgatga ttctgtcttc agtcaacttt aaggtgccga aagtaaccca gattctggag 2160 aagcgtataa tcaaccatac gctgctgaat caacgttttg ccaatgtctt agcaaacttg 2220 cggagtgatc ttgatgcgaa tgagaaaatc aaggaaatat ctgcaagata caacgatgtt 2280 ttgaatgcgt cctcaaaggt tgtcaccatt gaagtcaaaa caaaaggata ctgtccatgg 2340 atgtcctatg acctttggta tctaatgagg atgaaagaaa acctgcgtag gagaagtaaa 2400 acccacccca gtgactccca tctgaaggat ttattgaagc acgtctccaa gaccctgcag 2460 gacaagaagg aaaaggcaaa aagggaatat taccacaacc tgattcacaa ttcatcagaa 2520 aagaactcct ggaagctgat caacgaagtt atgggccgga aaaccaaaac taagcaagaa 2580 atatcattaa agatgaacgg cgagactatt tctgatactt cgcaagtggc caatgcactc 2640 aatgaattct tctgcagcgt gggagctgag cttgcatctg aaatagacag cgacaaaaat 2700 atctggaagt tccaatctct accctcatgc aacctctccc tgttcctgag acctgctacc 2760 ctggaggaga ttatactgct gatcaaggac ctcaacgtta acaaagcacc tggtccagat 2820 ggtgtaccag cgactgtaat caagacgcac caccttgcat tcgcaaatat attgctggag 2880 gtattcaacg agatcattag caccggtcag taccctgaat ccctgaagct ggcacgcgtt 2940 gttcctgttt tcaaatccgg agaagcgacg aacgttaaca actacaggcc aatatcaacg 3000 ctttctgcca tgaacaagat cgtggaaaaa ctgatcgcaa aaaggatatc tgcattcttg 3060 tctgcaaacg ggctgatctc caactttcag tacggatttc ggcaaggaag tagtacattg 3120 acagcgacca acgaacttct ggaagatatt tatgggcacc ttgacaaccg ccggttctct 3180 ggagcgctgt ttcttgatct caagaaagcg ttcgatacgg tcaaccacga actgctgctt 3240 cgaaagttgg aacgctactg tataagagga attcccaaca agctgttgag aagttacctg 3300 tcaaaccgcc tccaattcgt ctcggtaaat ggattttcta gtgatcatct acacattgcc 3360 actggagtgc cccaaggaag cattcttggt ccactgctgt tccttatctt catcaacgac 3420 ctgcctcgta tgacctggcg tggacaaatt cgtttgtttg ctgatgatac ggttcttctg 3480 tatgggaacg ctgacagtac aaggattgct gaggacataa aagctgatct acagtgcctt 3540 cttgagtact tcaagtcaaa cctgttatca ctaaacctgg gaaaaacaaa ctacatgatt 3600 ttccactctg catggaggaa cgtgcctgtc ttgcctgcgg tacttgtaga agatgttgag 3660 atccagaggg tttcagtata caagtacctt ggtctaatac tggacagtaa gctgaactgg 3720 gagttgcata tcgatcatct gaaacaaaag attgcaccca tctgcggagc tctatggaaa 3780 atgtcgtcgt tcgtgcctcg aacctggctg ctgaagttgt actacacaat ggttcactcg 3840 agactccagt atcttacagc ggtatggggt tcagcgcgcc aggttcgaat tcgggaactg 3900 caatcgctac aaaaccgctg tttgaaagtg gtgttcaacc ttccccggct gtattccacc 3960 caaatgctgt acgaagaccc gaggcactcc gccttaccta tccgaggact ccatgccatt 4020 cagaacataa cactgacgca caacatgatc aacgatcctg agacccacca caaccttgag 4080 cttccaacac tcaatagtgg acgccccacc agctctactg gagaccttag attaacacgc 4140 cccaacacag agcttggaaa aaatcgcttc acgtacattg gctataaact gcacaacctt 4200 ctgcccgttc ccctaaaact agtgacttcg aagcaatctt tcaagcaaag actccgtagt 4260 ttattgaaat ccactgtcag ttgctttctt gctgccaacg atgttcgcct ctaatgccac 4320 cctatctgct ctaactctcg ttttccaatg ccgccaccgc ccgccgagac cgccaaccgc 4380 caaccgccca ccaccaaccg cccaccgcca accgcccacc accaaccgcc caccgccaac 4440 cgccaaacgc ccaccgccag ccatcaaccg cccatcgcca atcgcccacc acccaccgcc 4500 caccgaaaac cgccaaatgc ttaccaccca caccccacca cccaccacca tcaattgtat 4560 tcactaacac cagtctctaa ctcttaaaaa aaaaaaaaat taaaaaaaag aatcgcaaaa 4620 aaataaaaaa aaagaagaaa aaatcctctt aaaagagcaa aattgctcac tgaggatttg 4680 tacagttaag cgagctggaa aaaaaaaatc caccagcaac aaccaccgag cctaacctct 4740 tccaggtagg tgaatgtttc cagctgcttg ctaggaagtg tgaaattcgg tattgagtaa 4800 aaaaaaaaaa aaaaa 4815 // ID Copia3-NVi_LTR repbase; DNA; INV; 290 BP. XX AC AAZX01004352; XX DT 05-NOV-2007 (Rel. 12.11, Created) DT 05-NOV-2007 (Rel. 12.11, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: long terminal DE repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia3-NVi; KW Copia3-NVi_I; Copia3-NVi_LTR. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-290 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from Nasonia parasitic wasp."; RL Repbase Reports 7(11), 1130-1130 (2007). XX DR Genome; AAZX01004352; Positions 15033 14744. XX SQ Sequence 290 BP; 75 A; 57 C; 56 G; 102 T; 0 other; tgttgtagtg taatttttcg tattgtctat atagagcgta tgtgtatgta tgtgctagag 60 agagagcgct cctaaacgct ctctatctct ttctcgcgga gcgtcgcttt taactgttct 120 gtgtagaata agaattcaaa ttagttagtt gagcacgagc gtctagagtg agcacacgtc 180 tggtgcctct cttgtgtact tatagctaga ataaatctca atcaaacaga cattgtctct 240 ctctcaataa acctttaata tcctgcttat atccgtgctc atatttaaca 290 // ID Kolobok-5_TV repbase; DNA; INV; 3489 BP. XX AC . XX DT 27-FEB-2007 (Rel. 12.02, Created) DT 27-FEB-2007 (Rel. 12.02, Last updated, Version 1) XX DE A family of autonomous Kolobok transposons - a consensus. XX KW Kolobok; DNA transposon; Transposable Element; KW Interspersed repeat; Kolobok-5_TV. XX OS Trichomonas vaginalis OC Eukaryota; Parabasalia; Trichomonadida; Trichomonadidae; OC Trichomonas. XX RN [1] RP 1-3489 RA Kapitonov V.V. and Jurka J.; RT "Kolobok, a novel superfamily of eukaryotic DNA transposons."; RL Repbase Reports 7(2), 122-122 (2007). XX DR [1] (Consensus) XX CC Kolobok-5_TV is a consensus sequence of a family of autonomous CC Kolobok transposons that were active in the T. vaginalis genome CC in a last few million years. The Kolobok-5_TV transposon is CC characterized by 20-bp terminal inverted repeats, TTAA target CC site duplications, and it encodes the 833-aa Kolobok-5_TV1p CC transposase. Kolobok transposons, including numerous families of CC non-autonomous elements, constitute >2% of the T. vaginalis CC genome. See also comments in Kolobok-1_XT. XX FH Key Location/Qualifiers FT CDS 467..2965 FT /product="Kolobok-5_TV1p" FT /translation="MEYSIPIMEGYTIEEILITYKADDCQKRVYHRTVYSN FT KKPFSHFINLNQLNSKQDHSQSNVKEETIFNVKFKPGVPYSDVVKNLHTYI FT SLISGAGFSKHIELASLLNFHCPTETTLLAHLKSILPDLEKVATASCEANI FT QEILKHDSEYKFNVSLDCAWSSKRDALHAVITLIDINTNKILDFRIVSRHP FT ELIKFKSPKILVEPKISPGLMESYALRDMLKSDLWKKIKVFCSDGDVKDST FT MIENSGAQLSHVRDPNHVIKSAFSKIVTKYPKFEKIFRILFACPTLKPEQK FT VQRWMGLINRYPGESPEHIEKREIITNIAPLFLRISKQYHTNFNEAFNSLK FT AHLVPKNLRWIIGFPARIFVSVFAYNEDNWKTKLRRLMGIDDSYFDEDILQ FT VILNFENKSDYDHIRRTSENYRNHNRCVHYIPDEIDPEFADLVPHKPSKDG FT SRSEKIEYTDDEDPPADEEPSESENLSLPTDYPKFNKKSIFGSDIKVPPDW FT KRFSFFFKAKEIPDDPQEFQNLLSKFKQDLYNTYSSLIRINEENISAVAVY FT DSQQAKEIMNSAFTDFHNFEKIKINAINYLNVLSHNPPSLEMIADFHPDLI FT QLPQTLRKTHDKPKKQKLGIKLGKNLSFLAVIVQLITFRPLMHKLFDELFS FT NPNEQQLIYKEILSLTNRNFSENADVTYLYHLYSKRQKIHQADPIDAFKQL FT QESLACEARQLDLIDYFKKFCWGFTDKIDIPLVNNNELQVTPTFSDYLFID FT VAGFTFPLDDYTWNNNPAVKSKLVAIITYHSRQYNIIILDENINSWILFCN FT TKISYIRDEELANNLNKGAYNTPKYLIFEKI" XX SQ Sequence 3489 BP; 1271 A; 560 C; 517 G; 1141 T; 0 other; gggtgcccta tagacgatag accaccctca gaaaagcact tttaaggtta tttctgttaa 60 gcttttatct cgttcaagag agagattttt tgaactttac catatcataa tgcttagtta 120 tattatgacc aatctctacg attttgaaga tttaaattac gttgagaatt catcgatttt 180 gtgatttttt gatttttttt cgagtttttt tgattttttc aaaaaaataa aaaagaaaga 240 aaataaggaa ggcatatagt gttttcatta aaaacaatta ttacagtaaa caatgtcgcc 300 aattattggt tttttgaaat aataaatttg aaaaatgaaa tattgagcat atattctacc 360 caatatcaat tgactgatta aatacaatat atatctttgc tttattcact ttgattactt 420 ttgtattaaa aaaaccgtaa aatttggcgg gattttagct cttcctatgg agtattctat 480 tccaatcatg gaaggataca caattgaaga gatattaata acatataaag ctgatgactg 540 ccaaaaaagg gtctatcatc gaactgttta ttcaaataag aaaccttttt cgcatttcat 600 taatctaaac cagcttaatt caaaacaaga tcattcacaa tcaaatgtta aggaagaaac 660 aatattcaac gtcaaattta agcctggagt tccgtatagt gacgttgtca agaatcttca 720 tacttacatt tctcttatat ctggtgcagg attttcaaaa cacatcgaat tggcatcact 780 cctaaacttt cattgcccaa cagaaactac attacttgcg catctaaaga gcatattgcc 840 agatcttgag aaagttgcaa cagcaagttg tgaagctaat atacaagaaa ttcttaaaca 900 tgattcagaa tacaaattta atgtttcatt agattgcgca tggagttcta agcgcgatgc 960 tttacatgcc gtcattacgc ttatagatat aaatacaaac aaaatattag attttagaat 1020 agtttcaagg catccagaat taataaaatt caaaagtccc aaaattttgg ttgaaccaaa 1080 aatttcacca ggattaatgg aaagctatgc attacgtgat atgttgaaat ctgatttatg 1140 gaaaaaaata aaagtttttt gttcagatgg tgatgttaaa gattcaacaa tgatagagaa 1200 ttctggtgca caattatcac atgtaagaga tccaaatcat gtgataaaat cagcattttc 1260 aaaaattgtc actaaatatc ctaaatttga aaaaatattt aggatccttt ttgcatgccc 1320 aacattaaaa ccagagcaaa aggttcagag atggatggga ttaattaata gatatccagg 1380 agaatcacca gagcatatcg aaaaaagaga aattattaca aatattgctc cattgtttct 1440 aagaatcagc aaacaatatc atacaaattt caacgaagct ttcaattcat tgaaagccca 1500 tttagtccca aaaaatcttc gttggattat tggatttcca gccagaattt ttgtttctgt 1560 atttgcgtat aatgaagaca attggaagac aaaactaaga agattaatgg gtatagatga 1620 ttcttatttt gatgaagata ttttacaagt tatattaaat ttcgaaaata aatccgatta 1680 tgaccatatc agaagaactt cggaaaacta cagaaatcat aacagatgcg ttcattacat 1740 tccagatgaa attgacccag aattcgctga cttagttcct cataaaccgt caaaagatgg 1800 atccagatcc gaaaaaattg aatacactga cgatgaggat ccgccagcgg atgaagaacc 1860 gtcagaatct gaaaatcttt cacttccaac agattatcca aaattcaaca aaaaatctat 1920 atttggttcc gatatcaaag taccaccaga ttggaaaagg ttttcatttt tcttcaaggc 1980 gaaggaaatt ccagatgatc ctcaagaatt ccaaaattta ttatccaaat ttaaacaaga 2040 tctttataat acgtactcgt ctttaattcg tattaatgaa gaaaacatct cggctgttgc 2100 agtttacgat tctcagcaag caaaagaaat tatgaattca gcttttactg attttcataa 2160 ctttgagaaa attaagataa acgcaatcaa ttatcttaac gtcctcagcc acaatccacc 2220 atctcttgaa atgatagcag attttcatcc tgatttgata caacttccac agacactcag 2280 aaaaactcat gataagccta aaaaacaaaa gcttggtatc aagcttggaa aaaatctcag 2340 ttttcttgcc gttattgttc aacttattac atttcgacca ttaatgcaca aactatttga 2400 cgaattattc tctaatccca atgagcaaca gttaatatac aaagaaatat tatcgctcac 2460 aaatcggaac ttttcggaaa atgcagatgt tacttatctg tatcacctgt actcgaaacg 2520 gcaaaagata catcaagctg atccaattga tgctttcaaa caattacaag aatcgcttgc 2580 atgtgaagct aggcaactag atcttattga ttatttcaaa aaattctgct ggggatttac 2640 agataaaatt gacattcctc ttgttaataa taacgaactt caagttactc ctactttttc 2700 agattatctt tttattgatg ttgctggttt tacgtttcca ttggatgatt acacatggaa 2760 taacaatcca gctgtaaaat ccaaactagt agccataata acttatcaca gtcgacaata 2820 taatattatt attttggacg aaaacatcaa ttcatggatt ctcttttgta ataccaaaat 2880 atcttacatt cgtgatgagg aattggcgaa taatttaaac aaaggcgcat ataatacacc 2940 gaaatatttg attttcgaaa aaatttaggt ttttcaaaaa ttaaatcttt catttgtaat 3000 gtatgaataa aatgtatgta tttgatataa ttgatacatt cgaaattttt attctgaaaa 3060 aaataataaa caatttttgg cttttgtgat ctgataatct gtaagaataa ataccgacaa 3120 attttattaa tattcatcgt aaaagtattc ggctcttgat gtaaataagg aatatttttt 3180 ttcaaaagtt gcttttgaag actcttttat caagtagctg catttattca ttgataagaa 3240 agtatgctgt accaaaaatc gataattcac cacattaacg gccaattttg ttttgcattt 3300 cgacaaaaaa gctcctttta tataacatta ctgggcgaaa attagagtat ctacgatgac 3360 aaaacaaaat acatcaactt gattttttgc cgagaaaaat gtttttggag gtaccaaaat 3420 aggtctcgaa aagcattggt tttatttaag gttatttttt ctgagggggc tatcgtctat 3480 agggcaccc 3489 // ID BEL-22_AA-LTR repbase; DNA; INV; 609 BP. XX AC supercont1.310; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-22_AA_; KW BEL-22_AA-I; BEL-22_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-609 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.310; Positions 123114 122506. XX SQ Sequence 609 BP; 190 A; 90 C; 123 G; 206 T; 0 other; tgttgccaca tcactcggag agatggcatc cctgccatcg aatgcagtag tgtgtgacta 60 aaggatttga ctgatgtcaa aatagattac ttgcaatttt gaagtgcttt gcgacagaag 120 ttaacccatt gaatttattg gatttatgta gaatttatta ttatttcaat ttccctttat 180 taccgaatta ctagaagtaa ggctacgtta gtgctatcaa aggtatgcag cataagtctg 240 tgggttatgg tatattagag ctgatttcat ttgcaggaac taaatactcg ctgagcgtat 300 tgaatttgtg gatcgcggta gatacgggat tattattggc aggtgaaata tatccaaatt 360 gtaattaaat gtataccaaa tatgtataac tagaatgtcg ctttacacag aatttagttg 420 cgaacgggct attgaacaac agtaacgcaa gatcatgcat gtatggtctc actgaattgt 480 aggttaaact actatggatt aaaattacca tgaactaatt tgttaatctt ttttctagtt 540 tgaagcataa aataaacggc tatcaatatc agtggaagat acagttcgtt tttgttctcg 600 tttgcaaca 609 // ID hATx-6_HM repbase; DNA; INV; 3233 BP. XX AC . XX DT 17-DEC-2008 (Rel. 13.12, Created) DT 17-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hATx-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hATx-6_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3233 RA Bao W. and Jurka J.; RT "A distinct, diverse family of hAT transposons from Hydra RT magnipapillata."; RL Repbase Reports 8(12), 1825-1825 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 553..2883 FT /product="hATx-6_HM_1p" FT /translation="MSQIKTIWNYFKKSNKGENAKCMLCSAILKISQRSTK FT GLLIHLKTKHGTDLKLNVLQQDGASTSKQHESVELESKIYIDCEDEEVILS FT KKVKTIDTFFVKELSMEKMVSRMVAKDGFTFNSFCTSSDLRYLFSKSGFRL FT PTSPNTIRSMVVKFSDSVKADIMQKFISLKKQNQKFSITFDEWTSQNSQRY FT LNLNVHMEAKHFNLGLIRIHGSCNAEYCVKLVAGRLNSFNIDLQKDIIAMT FT TDGASVMVKVGKMMSCYHQLCYAHGIQLAVIDILYKKNNGEVEESFTTNEN FT CEEEDEHENTDDDEVINKEYGLTIILDRPSAELVSNYKDLMKKVRKVVKIF FT KNSPVKNDTYLQKYILHEHGKKLKLILDCKTRWNSMFDMLERFHRLKVCIE FT KALIDVGSEIRLTQEEWSQINDLIICLAPLKLGLELLCRRESNLVTAETTF FT KFMLDKLQKQSSLLSIDLVKNDTYLQKYILHEHGKKLKLILDCKTRWNSMF FT DMLERFHRLKVCIEKALIDVGSEIRLTQEEWSQINDLIICLAPLKLGLELL FT CRRESNLVTAETTFKFMLDKLQKQSSLLSIDLAAALRERFTQRRNKNLNGL FT LVYLQNPKKYEQEIKKNENTFSMPKRTVMRQEIKKILTRVYTIERNKDEEN FT ELEEKEAVDSINNSQILTPTILSLKKELENHIQKDKKSWTENVKEIPNLDS FT LDFDKLLKQEMSAYENNGVRGKFLDIAHNYLMTISPTSVEAERAFSAAGYI FT CSPLRSRLCDDTLSRVCLLRAHFLQ*" XX SQ Sequence 3233 BP; 1209 A; 480 C; 569 G; 975 T; 0 other; tagggattgc aatcccgcat tgcgtgatcc cgcaatcccg cgggatccca cattatttta 60 gcgcatgcga gatcccgcaa aaactactgc gggatccgcg ggatttgcgg gatccgcagg 120 atttgctgag ttttgaaatt attgctttta aataaaaaaa agcaattcta taaacaaaga 180 cgaaaaagat atcaaattaa agtatttaat gtatttgcat aacaaattca tttaaaagtt 240 tactagtcat taaatgcaca ataaaatgta tactgagaaa ctcgttcaaa tttttttttg 300 ttttgttaat gattagtcat aaaatagtca ttaagtgcac aatgaaatgt atactgagaa 360 actcgttcaa attttttttg ttttgttaat gactatgttg aaatttttgc taatgagtat 420 attgaaattt ttgttgatga gtatgttgaa aacgttaaat tgaaaacgtg attaataaaa 480 agcgcactac tttttaaaaa gacttcaaaa attatttaca tcaagatttt aaaacaaaag 540 tacctactaa gcatgtcgca aattaaaact atatggaatt attttaaaaa aagtaataaa 600 ggagaaaatg cgaagtgcat gttatgttcg gctattttaa aaattagtca aagatcaaca 660 aaagggttat taatccattt aaaaactaaa catggaactg atttaaaatt aaacgtttta 720 caacaagacg gagcttcaac ttcaaaacag catgaatcag ttgaactaga gtctaaaata 780 tacatcgact gcgaagacga agaggtaatt ctatcaaaaa aggtcaaaac aatagacacg 840 tttttcgtaa aagaactttc tatggaaaaa atggtatcgc gcatggttgc taaagatggc 900 tttacattta atagtttttg cacatcttca gatttgcggt atttgttttc aaagagtggt 960 ttccggctcc ctacttcacc taatactata agatcgatgg tagtaaaatt ctcagattct 1020 gtaaaagccg atataatgca aaaatttata tcactaaaaa agcaaaatca aaagttttca 1080 ataacgtttg acgaatggac atcacaaaat agtcagagat accttaactt aaatgtacat 1140 atggaggcaa aacactttaa tcttggattg attcgaatcc atggatcgtg taatgcagaa 1200 tattgcgtca agttagttgc aggaaggcta aatagcttca atatcgacct ccaaaaagac 1260 attatcgcca tgaccactga tggagcaagt gtgatggtga aagtcggtaa aatgatgtct 1320 tgctatcatc aactatgtta tgctcatgga atacaacttg cagttattga tatcttatac 1380 aagaaaaaca acggagaagt cgaagaatcg tttactacaa atgaaaattg cgaagaagaa 1440 gatgaacatg agaacactga tgatgatgaa gttataaata aagaatatgg tttgacaata 1500 attttggacc gaccatctgc tgaattggtt agcaattaca aagatctcat gaaaaaagtg 1560 agaaaggttg tgaaaatatt taaaaactct cctgtcaaaa atgatacata tttacagaag 1620 tatatcctac atgaacacgg caaaaagctt aagttgattc ttgattgtaa aactaggtgg 1680 aacagtatgt tcgatatgtt agaaagattt catagactaa aagtgtgtat tgaaaaagcc 1740 ttaattgatg ttgggtcaga aattcgtttg acccaagaag aatggtccca aattaatgac 1800 ttaattattt gtctggcacc actaaaactt ggtctcgaac tattatgcag gagagaatcg 1860 aaccttgtca ctgcggaaac aacttttaaa tttatgttag ataaattaca aaagcaatca 1920 tcacttttga gcattgattt agtcaaaaat gatacatatt tacagaagta tatcctacat 1980 gaacacggca aaaagcttaa gttgattctt gattgtaaaa ctaggtggaa cagtatgttc 2040 gatatgttag aaagatttca tagactaaaa gtgtgtattg aaaaagcctt aattgatgtt 2100 gggtcagaaa ttcgtttgac ccaagaagaa tggtcccaaa ttaatgactt aattatttgt 2160 ctggcaccac taaaacttgg tctcgaacta ttatgcagga gagaatcgaa ccttgtcact 2220 gcggaaacaa cttttaaatt tatgttagat aaattacaaa agcaatcatc acttttgagc 2280 attgatttag ccgcggcact tcgtgagagg tttacacaac gtcgtaataa gaatttaaat 2340 ggactactgg tttacttaca aaatcctaaa aaatatgaac aagaaattaa aaaaaatgaa 2400 aatacttttt caatgccaaa gagaaccgtt atgcgtcaag aaataaaaaa aatattgaca 2460 agagtttata caattgaaag aaataaagac gaagaaaatg aactcgaaga aaaagaagct 2520 gtagattcta ttaacaattc ccaaatactt actcctacaa ttctatcgct gaaaaaagaa 2580 ttggaaaatc atatacagaa agacaagaaa agctggacag aaaatgtaaa ggagatacca 2640 aatttagatt ctttagactt tgataaatta ttgaagcagg aaatgtctgc ctatgaaaat 2700 aacggagtaa gaggcaagtt tctagatata gctcataatt acttgatgac catatcacca 2760 acaagcgtag aggccgagcg agctttttca gctgctggct atatatgcag tcctttgagg 2820 agtagattat gtgatgatac tttaagtagg gtctgtttgt taagagcgca tttcttacaa 2880 tgatactttc tcatacttat ttttaaaaat aagtaaagct aaaagacgct ctacgtaaag 2940 ctaaaagacg ttttacgtat ttttatctta tccttttgtt cattaaatta agaattatat 3000 tgatttacaa atgaattgtg tttttggctt catttacctc atcttatcaa tttaataagt 3060 tgagtatatt ttttctgaaa taattgaaag aaaaatacaa ttatcatttt gccttttata 3120 acgtgcataa gaaatctacc aatcccgcgc gggatcccgc gggatccgca cataaattaa 3180 aaaatccgcg cggatcccgc gggattgaaa agtgtgcggg attgcaatcc cta 3233 // ID Ginger2-N1_AP repbase; DNA; INV; 1530 BP. XX AC . XX DT 03-FEB-2010 (Rel. 15.01, Created) DT 03-FEB-2010 (Rel. 15.12, Last updated, Version 2) XX DE Nonautonomous Ginger2 DNA transposon from Acyrthosiphon pisum. XX KW Ginger2/TDD; DNA transposon; Transposable Element; Nonautonomous; KW integrase; Ginger; Ginger2; Ginger2-N1_AP. XX NM Ginger2-N1_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-1530 RA Bao W., Kapitonov V.V. and Jurka J.; RT "Ginger DNA transposons in eukaryotes and their evolutionary RT relationships with long terminal repeat retrotransposons."; RL Mobile DNA 1, 3-3 (2010). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC TIR is 17-bp long. XX SQ Sequence 1530 BP; 529 A; 235 C; 254 G; 510 T; 2 other; tgttaccgga aaagatgaaa attcaggaat tgtcggtata tcctagtcat tgttgacaat 60 gactggaaag gttaggcaat gttggatatt tcktagtttt atcaacattt ccgtattata 120 ttaggcaatg tccgaatcga aacaaatata aacacagcgc tttatcggtg atcgatctga 180 ttatggaaca tagctatcta tactattact tattagtata taatatggta aatcctgttt 240 tatgttatcg tttttattgc ttataactta taagttataa taaaattgac tatacagaat 300 tatcgtagct ggtatcgtta tggwcacgta ctcacatccc gcaagtgact gtacagtact 360 gtttgtgcat tgaagtagtg aaacaacgtg taataacatt gttttttttc aacaatctgt 420 ttaaaaatgg atttaaatga aatgcgcata aaatttgaca acgcaattct attattaatt 480 gaatctaaac gtaacgacaa aaactttttt ttgaataatg atgaatattg tgaacgaatc 540 gaagaaataa aacaatccaa aattacgcta tctacggctg gagtaaaaaa gtgtacgaaa 600 gactatagga atgttcgtaa gtacgatatt cttgttatca acggtaaaga tcgtcttatt 660 aaaactataa ccaatgcgtc agagggtgtg cgttattatg tacaaaatga agagttgttt 720 gatatcattc attctacaca tacagcaatc ggttacggag gccgagaccg tataatggct 780 gaacttaaat taaaatacgt gaatgtgaca aaggaaacaa taatggttta tttaagtcta 840 tgttctgatt tccataaaaa atcttcaaat ccaaaaagag ggcttgtatc aaaacctatt 900 ttacattcgg catacaactc acgtgctcaa ctggatttaa tagatatgca ataacagagc 960 gtaaatgatt tccgatttat aatgaattac caagaccacc taacaaagtt cgttgtactt 1020 aaaccattaa aaacagcgta attttagagt gttttatttc ttcgattcgt tcacaatatt 1080 cattattaat caaaaaaaag ttgttgtgtt acgtttagat tgtatttaat agaattgcat 1140 tgtcaaattt tatgcgcatt tcatttaaaa ccatttttaa acagattgtt gaaaaaaaca 1200 atgttgcttc actgcttcaa tgcacataat ttaccagata tttatacaac atttggagca 1260 ccagccatac tgcacttaga caatggaaga gagtttgtca ataatactat aaacgaactt 1320 catgctatgt ggggtgatgt aaaaattgtt catgggaaac cttgacatag tcagagccag 1380 aaacttagct ctggttatca tcaatgcctt cttttattag gaattgtcat caatgactag 1440 ttattgacgt taattcctta ctaaatcagg cattatcgtt attgactagt cattgtcaac 1500 actgactgaa tgtcatcttt tccggtaaca 1530 // ID BEL2_Cis_I repbase; DNA; INV; 3206 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 06-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE Internal portion of BEL LTR Retrotransposon from Ciona savignyi. XX KW BEL; LTR Retrotransposon; Transposable Element; internal portion; KW BEL2_Cis_I. XX OS Ciona savignyi OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. XX RN [1] RP 1-3206 RA Smit A.F.; RT "BEL2_Cis_I - BEL LTR Retrotransposon from Ciona savignyi."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX CC Ci000175 2% div; ORF from bp 610 to 3135 corresponding to CC Pao/Bel-like pol protein with a big internal deletion only CC leaving the C-terminal part of the integrase domain. Note the CC obnoxious internal 13mer repeat it carries around everywhere. XX SQ Sequence 3206 BP; 847 A; 874 C; 715 G; 767 T; 3 other; tcgttttcaa attccttgaa ctaccgtcgt cgttcccgtc cagtcgtcgt tcccgtcccg 60 tcgtcgttcc cgtccngtcg tcgttcccgt cgccgatccg tcgtcgttgc cgattcccgt 120 tgccgattcc cgtcgccgat tcccgttgcc gattcccgtc gccgattccc gttgccgatt 180 cccgtcgccg attcccgttg ccgattcccg ttgccgattc ccgtcgccga ttcccgttgc 240 cgattcccgt tgccgattcc cgttgccgat tcccgtcgcc gattcccgtt gccgattccc 300 gttgccgatt cccgtcgccg attcccgttg acgattcccg ttgccgattc ccgttgacga 360 ttcccgttgc cgattcccgt tgacgattcc cgttgccgat tcccgtcgcc gattcccgtt 420 gccgattccc gttgccgatt cccgtcgccg attcccgttg ccgattcccg ttgccgattc 480 ccgtcgccga ttcccgttgc cgattcccgt tgccgattcc cgttgccgat tcccgtcgcc 540 gattcccgtt gccgattccc gttgccgatt cccgttgacg attcccgtcg ccgattcccg 600 ttcctgaacg ttttcaattc cgttattcct gctgataagc cgcaaaatga agattccgaa 660 gactttaaat tgtctcaaac aactttacgt ggatgccgaa aggctaatgc tttctccagt 720 aggtaggacg tcactacgta agctcatacc aataatcgac cgcgtatttg acgagctaat 780 ggacttatgt gtaaattttc cagaagatca accgcttgaa gatgtttaca tatcaagaac 840 tcagttaaag agtcagaaac gtgagttcga tacgcatgtg cgtcagtggt taaataaaac 900 tgaattgcaa ttagcaagga ctaaccgcga cacgtcgagc acaatttcta ctgcaaactt 960 tatccggagt gaacgccgaa aaagccaaat caggttgaaa ctcgccgaac ttgcctacaa 1020 gcaagctatg gacaggttgt tcgaaacacg caagcgtgca aaggagcaaa cagatcgtgc 1080 tgctgaaagg acacgcgtag cagcgaaaga agccaatcaa aaagctcgcg aatctataga 1140 agcagagcgt gtcgccaaca aaaaacaacc agaatttgaa tacgcgaagg cagtttctaa 1200 agctcggaaa gatgaagaaa tcttggaggc tctagactcg acctcaccaa cattacaacc 1260 caacgttgtt gccgtagcta caacgaacaa ccgcactaca gcacggcctt cgcttgcaga 1320 cgctcgatcc aacttaagcg atcacaagaa atcagttgca gaattaacgt cgccacgaca 1380 agtacaaaat actgtaccgg taagccgtag acaccaaccc ccgccccgtg tgtacaagcc 1440 cgagcgtatc gaaataagcg cagctgttga aatcgatgaa tgctgcacca gtatcaacac 1500 ggccaaggtt tatataaacg ttgttcccgg taaagtccgt tacgcagaca aagaggttgc 1560 tacctgtgca tttatcgatc aaggttccac gacaacacgc cgtgaggagt cgatcgtaag 1620 ttcaccaaat gctgctggta attcacgttg tattacgttg caaaccctaa catcatctcg 1680 tgttttagat acagtgtccc ttgcactctc tgcccagcct gtggagggtg aagattggaa 1740 taaccttccc gacgtcgttg ctgtcgataa gatatataag aatcctaatg tattacatga 1800 tccgcaggcg ttgcagcgac accatcattt gcaaggtgtg aaaattccac agatcgaaga 1860 tggatcagtc aagctgctaa tcggggcgaa cgntccccaa gttttccgtg tcgagtccat 1920 gagaagcgga aatggtcgat gtcccgacgc tgtcaaaacc cgtctaggat ggtcacttct 1980 aggcccctgt ggtgataaaa gccgttccaa ttgtaaggcc agcgttatgt ttttaaagga 2040 tgaaagcgat ctcaaaccaa tgatcgacga tctcgatgac ctcaacgaca gctttcttcg 2100 gttcccgacc tcgattgaag accgaagagc acacgacctt atgaaggaat cagttaagtt 2160 tgttgataat cattatgaac tacccctccc atngcgccac gattatgaga ttctcccaga 2220 caatggcgcc atggcgatgg ggcgccttaa agttttagca aagcgacttg tgcaaaaccc 2280 aaatattaaa caaaagtatg tcgagcaagt gaacaacatg atgacaaagt gctattcaga 2340 gaaggtgcca tatgacgaaa tcgtcaccga ccgcagaatc tggtacctac cgcaccaacc 2400 cgttactaac cctcacaaac cggacaaact atgtgtaaat caaatacatg aatttcttct 2460 acaacgcgaa attcaatgga atttcaaccc tccctgcgct tcacacatgg gtggagcctg 2520 ggaaagaatg attagatccg ttcgtcggat tttactttcc atttcaggtg aaaagacatt 2580 gaacgatgat caactgacaa ccctgctgct ggagtccgag gcaatattga attgtcgacc 2640 tctcactccc gttacactag atatcgatgg cgagacgcca ttaacaccga accacctgct 2700 caaggtgaac ccgtcaagtg agctgcctcc aactttgacc aatgaacggc aaagttacgc 2760 tcgacgccgc tggcgctacg ttctacatct agccaaccga atttgggcca gatggtcggg 2820 agagtacctc cgaactatca tcgcacgcca gaaatggcac aaaagaaaag aaaacgtgaa 2880 gataggagac gttgtatttc tggtggacaa cacaacacca cgatcacaat ggtccattgg 2940 aagaatcacc tccgtgtacc cggacagcca cggtgtagtt cgaaatgtgc tagtcaaagc 3000 acgtgacacc gaattcaaga gaccgataca caaactatgt atcatagtac ccgctacgga 3060 cgtgcgcaat gagagtgttg accctgctac agctctcaac gagattgcgg gcattagccc 3120 ggtcattgaa tgaatcttaa cacgcgacgt aaatctatta aattgtaaat tgctcaattg 3180 tacgatctcg caatttgggg ggggag 3206 // ID Proto1-6_NG repbase; DNA; INV; 2903 BP. XX AC . XX DT 21-JUL-2009 (Rel. 14.07, Created) DT 21-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE Proto1-6_NG is a non-LTR retrotransposon from the Naegleria DE gruberi amoeboflagellate genome - incomplete consensus sequence. XX KW Proto1; Non-LTR Retrotransposon; Transposable Element; KW Proto1-6_NG. XX OS Naegleria gruberi OC Eukaryota; Heterolobosea; Schizopyrenida; Vahlkampfiidae; OC Naegleria. XX RN [1] RP 1-2903 RA Kapitonov V.V. and Jurka J.; RT "Proto1-6_NG non-LTR retrotransposon from the Naegleria gruberi RT amoeboflagellate genome."; RL Repbase Reports 9(7), 1553-1553 (2009). XX DR [1] (Consensus) XX CC Proto1-6_NG is a very young familiy of non-LTR retrotransposons CC that belongs to the Proto1 clade of non-LTR retrotransposons. The CC 5'-terminal portion of Proto1-6_NG is not known. XX FH Key Location/Qualifiers FT CDS 2..2848 FT /product="Proto1-6_NG_1p" FT /note="incomplete ORF2: contains the RT and RNase H FT domains." FT /translation="TKSQPNQSDFQIHRRINSLPIHRLIKEINVLTFEYSN FT NNKQQTLDELNKKKDELKYKQKEFFKKERDRWLHMINNTSIGFLYRFDADL FT NTKNINWKNLPDSVNKDSTLQFFTQKFKKEYEEIKIKLPTSTVKEGPQEIT FT INKDDILSTLKRLKSKSTGTDFITNDLLKEMTETQATQLAIEFNKMISETK FT DIPIIMKEGWIKLIPKKDSIVTHKDIRPITIMSTLYRLLFNIIARKIGKWA FT KDNINIRQQAFTSNREVNNHILVIQALRVKYKKTNAFVITNLDIENAYDGV FT EHCVLRLAMKHCKFPPNLTTFIINAYSNHKAYIEWNDILINPLTKTRGVPQ FT GCPLAPLMYNTITQIIIDYQIENWKIPGTPSFKIEEIGTLNFADDCSLVTQ FT PNKKHNTKLKQLNKWIGNLKMKINASKSITSVTPERIALKDFTVKLDDTLI FT PRQTNQRTLGNYITEDHLFTKEIDKREARVTQILKILPIHLIEPENLKRII FT MGKTISSFIHLARGNSISNPRLQTIDSTIRGNLKRSLKIDAKVKPSTMYAQ FT FERKGLEIPNLEYIANSILCNNLINKFHSSNPLVREAVQYGINNCNTNKTK FT LHNIFEKFHYILKQHKLTTTICNTNHQLLEKEKDEQYSYSEFTIWTDGSKS FT ENKVGFSVIIQNNEFTKGYKYKLHSYHSNNMAELSAITTALHIIPPNSNAK FT VLTDSEIAVKIINNTNYKGQFKQIKDEITNLISTKKLQASIEWTKAHTGIT FT DGNNYADKKAKQASRIGHRIAPQHLLTQGQLIITNSKKVWKSRIKGVLSEK FT LYNQGTSELINRTNLNYLKEHTTPTHKYSIWRLMINGHLRSLIVDNPQCPT FT CNKNLDTEHLFIECPEMNSARNWMIEEIKKNTNLEPVLSTERTEILTPNSF FT QLNIFGILTEPIQGIIKEQWPKIQGGLSLFGGRVQKQYNKILKIQ" XX SQ Sequence 2903 BP; 1281 A; 608 C; 395 G; 619 T; 0 other; aacgaaaagt caacccaacc aatcagactt ccaaattcac agaagaatca actcactccc 60 aatacataga ctcatcaagg aaatcaatgt actcacattc gaatattcaa ataataataa 120 gcaacaaaca ctagatgaac tgaataagaa gaaagatgaa ctgaaataca aacagaagga 180 attttttaag aaagagagag atagatggct acatatgatt aacaatacaa gtattggatt 240 tctatataga tttgatgcag atctgaacac caagaatatc aattggaaaa atctaccaga 300 cagcgtaaac aaagatagca ctctccaatt cttcacacag aaattcaaga aagaatatga 360 agaaatcaaa atcaaactcc caacatcaac agtcaaagaa ggtccacaag agatcacaat 420 taacaaagat gatattctaa gcacactcaa gagactcaaa tctaaatcta caggcacaga 480 ctttattaca aatgacttac tcaaggaaat gacagagaca caagcaacac aattagcaat 540 agaatttaat aagatgatta gtgaaacaaa ggatatacca ataataatga aagaaggttg 600 gatcaaatta atcccaaaga aagactctat tgtaacacac aaagacattc gtcctatcac 660 aataatgagt acactataca gactactatt taacatcata gcaaggaaaa taggtaaatg 720 ggcaaaagat aacatcaata ttagacaaca agcattcaca tccaacagag aagttaacaa 780 ccatattcta gtaatccaag cattaagagt caaatacaag aaaacaaatg catttgtaat 840 caccaacttg gacatagaaa atgcatatga cggtgtagaa cattgtgtcc ttagattagc 900 tatgaaacat tgtaaattcc caccaaatct cactacattc ataatcaatg catacagcaa 960 tcacaaagca tatatagaat ggaatgacat attaattaac cctctcacca aaacaagagg 1020 tgttccacaa ggatgtccac tagcaccact aatgtataac acaatcaccc aaattataat 1080 agattaccaa atcgagaatt ggaaaatacc tggcacacca tcattcaaaa ttgaagaaat 1140 aggaactctt aattttgctg atgactgctc attagtaaca caacccaaca agaaacacaa 1200 taccaaactc aaacaactga acaaatggat agggaatctc aaaatgaaaa tcaatgcttc 1260 taaatcaata acttctgtca caccggaaag aatagccctt aaagacttca cagttaagct 1320 tgacgatacg ttaataccaa gacaaaccaa tcaaagaaca ttaggaaact acatcacaga 1380 agaccatctc ttcacaaaag aaattgacaa aagagaagct agagtaacac aaatactaaa 1440 aatacttcca atccatttga tagaaccaga aaatctcaaa agaatcataa tgggcaagac 1500 aattagtagt tttatccatc tagcaagagg aaattcaatt tccaatcctc gcctacaaac 1560 aattgactca acaattaggg gtaacctaaa aagatcactg aagatcgatg caaaggtaaa 1620 accatccaca atgtatgcac aattcgaaag aaaaggacta gaaattccaa acctcgaata 1680 catagccaac tcaatccttt gcaataactt aatcaacaaa ttccactcaa gtaacccact 1740 tgtcagagaa gcagtacagt acggaattaa taactgtaac acgaacaaaa caaaactaca 1800 caacatattt gaaaaattcc actacatcct aaaacaacac aaacttacca caacaatatg 1860 taacaccaat catcaacttc tggaaaagga aaaagatgaa caatactcct actcagaatt 1920 cacaatatgg actgatggat ccaaatcaga aaacaaagta ggattcagtg taatcattca 1980 gaacaacgaa tttacaaaag gatacaaata caagctccac tcataccact caaacaacat 2040 ggctgaatta tcagcaatta caacagcact acacatcata ccaccaaaca gcaatgcaaa 2100 ggtactcact gacagtgaaa tagcagtcaa aataattaac aacacaaatt acaaaggtca 2160 atttaagcaa atcaaagatg aaattaccaa cctcatcagc actaaaaaac ttcaagcatc 2220 catcgaatgg acaaaagctc atacaggaat aacagatgga aataactacg ctgataagaa 2280 agccaaacaa gcctcaagaa taggacatcg aattgcaccc caacatctac taactcaagg 2340 gcaattaatt atcaccaatt cgaaaaaagt ttggaaaagt agaatcaaag gagtacttag 2400 tgaaaagctc tacaatcaag gaacatccga actcatcaac agaacaaatc tcaattacct 2460 caaagaacat accaccccaa ctcacaaata ctcaatctgg aggctcatga taaatggaca 2520 cttaagaagc ctcatagtag acaacccaca atgcccaaca tgtaacaaaa accttgacac 2580 agaacacctc ttcatagaat gcccagaaat gaattcagct agaaactgga tgattgaaga 2640 aatcaaaaaa aataccaatc tcgaaccagt actaagcaca gagaggacag aaatattaac 2700 accaaactca ttccaactga atatctttgg tattctgaca gaacctattc aaggtattat 2760 caaagaacaa tggcccaaga tacaaggagg cctctcactt tttggaggaa gagtacaaaa 2820 acaatacaac aaaatcttaa aaattcaata agacgttcca aatggacgaa cccgtcgggt 2880 tcattcgatt aataaacgtt taa 2903 // ID DNAX-1_TCa repbase; DNA; INV; 1517 BP. XX AC . XX DT 21-MAR-2009 (Rel. 14.03, Created) DT 22-MAR-2009 (Rel. 14.03, Last updated, Version 1) XX DE Non-autonomous DNA transposon. XX KW DNA transposon; Transposable Element; Nonautonomous; DNAX-1_TCa. XX OS Tribolium castaneum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Coleoptera; Polyphaga; Cucujiformia; OC Tenebrionidae; Tribolium. XX RN [1] RP 1-1517 RA Jurka J.; RT "DNA transposons from insects."; RL Repbase Reports 9(3), 670-670 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Tribolium CC castaneum Genome Project. XX SQ Sequence 1517 BP; 553 A; 222 C; 185 G; 557 T; 0 other; tgtacagggt gctgcatttt ggatgtccga ataggggatc tccgaaacta agaggtttag 60 agaaaaacga gtgacacatt ctcgggtccg ttttttgaga aaataatttt ggtcaaaacc 120 gcatctcgct atcgtctttt gttttcgact tataaacaaa agttgacatt ttagcgaaat 180 cttaaaaaat tcatatcttc cttattataa aagatacaca tttgaaacaa agacattata 240 caggcacttt ttttcagaga atttaataat gttatcagcg atttttttca accattcgtt 300 taactgtaat aacataaagt tgtatttttt taaataggaa tcatagttgg ctatgacatt 360 ttcgaaaagc ttattttttt ctgatttgat atgtctattt gtgtaataca ttaattttaa 420 atatttaaga aaaaacaaaa catttcgctt tttctttata gtaggccatg aaaaaatttt 480 ggatttttaa ttaaaactat gaaaattgta ttacccaaca agtatttata atttaatgat 540 tttttaaacc ctaattaatt caaaaacagc ataataaatt attattagat caaatttttg 600 caattaataa aaattatttg tttttatatt taaaatggca agcttacgtt gtcattctat 660 tttccaattt taaacatgaa atgtcggaag aagaaaaatc ttattaattg tatgattgcg 720 tatcaataaa attttatttt acaaaaaaat gtcaatccaa acaccaaaat caaattccat 780 ctgcggtaaa aaaatacaca acattatcaa cgtacacctc cattgcactt cactaaatca 840 taaaaaagtg tcatcaaatg gtagtgacac atcgttttca attttctgca cattttttat 900 gtaattaaaa tctgcgacga tcgtatattt gttcaattga aaataacttt aacatagctt 960 atttttataa tttcacattt tttactttaa taaccgttca caaaatttct agattattca 1020 caccttttat gtgaaataat ttttccatgg ccaactactt tcaaatattt taaatatatt 1080 ttttatcaaa agtatataaa aatatcaaac tatatttata ttgttttgag ttcagaaaaa 1140 aataagcttt tcgaaaatgt catagccaac tatgattctc atttaaaaaa aatacaactt 1200 tatgttatta cagctaaacg aatgattaaa aaaaatcgct gataacatta ttaaattctc 1260 ctgtaaaaaa gtgcctgtat aatgtctttg tttcaaatgt gtatctttta taataaggaa 1320 gatatgaatt ttttaagatt tcgctaaaat gtcaactttt gtttataagt cgaaaacaaa 1380 agacgatagc gagatgcggt tttgaccaaa attattttct caaaaaacgg acccgagaat 1440 gtgtcactcg tttttctcta aacctcttag tttcggagat cccctattcg gacatccaaa 1500 atgcagcacc ctgtaca 1517 // ID RTE-5_BM repbase; DNA; INV; 3040 BP. XX AC . XX DT 30-APR-2010 (Rel. 15.07, Created) DT 30-APR-2010 (Rel. 15.07, Last updated, Version 2) XX DE Non-LTR retrotransposon - a consensus. XX KW RTE; Non-LTR Retrotransposon; Transposable Element; RTE-5_BM. XX OS Bombyx mori OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Bombycidae; Bombycinae; Bombyx. XX RN [1] RP 1-3040 RA Jurka J.; RT "Non-LTR retrotransposons from Bombyx mori."; RL Repbase Reports 10(7), 1057-1057 (2010). XX DR [1] (Consensus) XX CC ~96% identical to consensus. XX FH Key Location/Qualifiers FT CDS 692..2554 FT /product="RTE-5_BM_1p" FT /translation="MTFGAWNVRTLLDRDGNACPERKTAIVARELHRYNVD FT VAALSETHLADEGELVEVGAGYTFFWKGTAASETRHSGVGFAIKNHLVKRL FT EEYPVHISDRVTTLRVHLDKDNYLNVISVYAPTLDKSDDIKDKFYGEVTLC FT LDSINAREQVLLLGDFNARVGRDYEAWPGVLGRHGVGNMNSNGQLLLSLCA FT QYGLAITNTMFRLAAKYKTTWMHPRSKHWHLIDYAIVRQRDFSQVQITRVM FT RGAHCWSDHRLIVTKLRLRLRRPRRSPMKKLASLDIEKLRDPEVRKNYAEA FT LSKKLLSVNETGDVDADWGTLSSHIMDTASIALGRKSRRNEDWFDGNDGVL FT RAALDKHRDLLRQHRRRKGSVAAVKASDLELRKLSRQIKDKWWQDKTSHMQ FT WLTDTNQLGEFYSEARKLMGTSNHAKVPLKSLDSRHLLTSKEDVLKRWAEH FT FNALLNVDRSADLQSIALMPQLPLALELDEPLLRDEVVAAIRQQKDKRAAG FT ADLIPGELIKYGGEELYTLVWELFVRMWEEERVPDSFKVSRITALYKNKGD FT RSDCNSYRGISLLSAPGKVFARVLLNRLKDLSEKILPETQFGFRPDRGTCE FT AIFSVRQLQEKSREQGRQLYLWH" XX SQ Sequence 3040 BP; 799 A; 754 C; 808 G; 677 T; 2 other; ggcagggggc gctcaaatat actatctgtc cggtagtata ttactgagtg cacgggcacg 60 cgatcccgcc ttccctgagc tgctatagac cacaacctac tcttccgtgg ctacagtatc 120 ctgtccttcc tactcctaac tccctcttct cctcatctct ttaataccca tccctacacg 180 cggctccctc ccgcttaaca cacaaagtct gacttggcga ggaaggcaac acaacgcaga 240 tatgcaacgt gacttggcgg actcggcggg tcagcaaggc aaggtggatg gcctagggcg 300 acggtggagg gcatatagtc accgtcggcc agacacaaca tccaacccct tcttcagtcg 360 tagcggtttc ccgttccact gatccaagaa gggataggac gagggtgtgc gcctgcgtga 420 gcgcaaacgt ttgcgcacct aaatatatcc tgcgtatctg accgcttccg ttaatggact 480 ccgaacaacg gcgtgggttc ggcagcaccc gagatgacaa agcagccctg tttagggagg 540 cactgctttc cccgctcagc cggggagggg gctagaaaag gtgccctaaa aattgctcgt 600 cccatttgat aagtggtagt aaccgcggct gccactgcat cagtcaaatc gtggtcgacg 660 ttgcaaaaga aatattataa aaacatctat catgacattt ggcgcgtgga acgtgagaac 720 gcttcttgat cgagatggca acgcctgccc tgaacgcaag accgccatag tagctcggga 780 acttcatcgc tacaacgtag atgtggctgc tctcagcgaa acacaccttg ccgatgaagg 840 agagctggtg gaagtaggtg ctggttatac cttcttctgg aaaggtaccg ctgcctctga 900 aacgcgacac tcaggcgtag gatttgccat caagaaccac ttagtaaagc gattagagga 960 gtaccctgta catatctcgg accgcgttac cacactgcgn gttcacctgg acaaagacaa 1020 ttaccttaat gtcatcagtg tctatgctcc aacgcttgac aagtctgatg acatcaagga 1080 caaattctat ggggaagtga ctctttgcct tgacagcata aacgccagag agcaggtact 1140 attgctgggc gacttcaatg ccagggttgg tcgggactat gaggcttggc ctggagttct 1200 gggtagacac ggagtcggca acatgaacag caatggtcag ttgctgctca gtctttgtgc 1260 tcaatacggt ctagcaatta cgaacactat gtttagactt gccgctaagt acaagacaac 1320 atggatgcac ccaagatcca agcactggca tttgatcgac tatgctatcg taaggcaaag 1380 agatttcagc caagtgcaga tcacccgtgt tatgcgtggt gcgcactgct ggtctgacca 1440 ccgacttatt gtcactaagc tacgactccg cctccgccgc ccgcgtagat ctcctatgaa 1500 aaagcttgcg tctttggaca tagagaagct aagagatcct gaggtgagga agaactatgc 1560 tgaagcgttg tctaaaaagc tattatcagt taatgagaca ggcgatgttg atgctgactg 1620 gggaacttta tcatctcata tcatggatac cgcttcaatt gcactgggta ggaaaagccg 1680 tcgcaatgaa gactggtttg atggaaatga tggagttttg cgggcggcac tcgacaaaca 1740 ccgtgatctc ctgcgacagc acagaagacg taaaggaagc gtggcggcgg tcaaagctag 1800 tgacctagaa ttgcgcaaac tgtcacgaca aataaaagac aagtggtggc aggacaaaac 1860 tagtcatatg caatggctca ctgacacaaa ccagctcggt gagttttata gcgaggcgcg 1920 taagctgatg ggtacatcca accatgcaaa ggttccttta aagtctctag atagtaggca 1980 cctcttaaca agcaaggagg atgtactaaa gcgctgggca gagcacttca atgccctgct 2040 gaatgtggat cgatcagcag atctgcaaag catcgcttta atgcctcaac tccctcttgc 2100 ccttgagctg gacgagcctc tattgcgtga cgaggttgtt gctgccatca gacagcaaaa 2160 agataaaagg gcggctggcg ctgatcttat accaggagag ctgatcaagt acggcggaga 2220 ggagttgtac acgttggtgt gggagctgtt tgttcgtatg tgggaggaag agcgcgttcc 2280 ggatagcttt aaagtatcgc gcataactgc tctctataag aacaaaggcg accgatctga 2340 ctgcaactcc taccgcggta tttcgctcct atcggccccc ggaaaggtct ttgctagagt 2400 actcttaaac cgcctaaagg acttatctga aaagatcctg cccgaaactc agtttggctt 2460 ccgcccggac cgaggcacat gcgaagcgat cttctctgtg cgtcaactac aagaaaagag 2520 cagagagcag ggtcgtcagc tgtacctctg gcattgaggc ctatttaatg cgtcggcagc 2580 ttcgntggtg cggtcatgta tcacgcatga cggaggaaag agtggcgaag cgcatcttct 2640 tctctgaatt gcaggacggc aagcgaaagc atggcggaca actcctgcgg tacaaggatg 2700 tcgtgaaacg acacatgaag agatgtgata tagagccctc tcaatgggag cgcttggcag 2760 cacagcggcc agaatggcgc aggatggtga acggcaaagt acgcgagttc gaggatcagc 2820 gtaaagctga ccttgattac aaacgcgacc agttgaaggc ccgcccacct gctgccataa 2880 cctataatta tgagaacggc gtgcttacgt gccctcaatg cgcaaggagt tttgccgcga 2940 agataggcta tataagccat cttcgagcgc acgagcgcca aatcgacgga taggagtcaa 3000 agtggtcgcc atggccgaaa tcggtcggat gaatcatcat 3040 // ID DNA8-76_AP repbase; DNA; INV; 455 BP. XX AC . XX DT 25-AUG-2009 (Rel. 14.09, Created) DT 25-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-76_AP. XX NM DNA8-76_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-455 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2012-2012 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 455 BP; 167 A; 72 C; 70 G; 146 T; 0 other; cataggcgtg tccagacctc atccgcaggt cgtgcacaac agtgacatat ccaggggagg 60 ggttactgca ccccccccca aaaaaaaaaa ttagattaag gtttgggaat atgaccccca 120 aattttgaaa taagctttat catctgataa taattaaatc agttttactt tttttattag 180 gaataaaaaa caatattcct tttatcagat aaataaataa ctaggtaggt tatatgaaat 240 tgtatttagt aattttaaaa tataatttta atactaataa tactttttat aaatgtttat 300 ttttacgtga tgaaaatcaa aaaacttaat acgaaatcag aaacttagat aaaaatgttt 360 atattatgca ccgcaattta taatgtacct aagttggatc aaatttcgca ggtcatgcct 420 gtgcatgaca ggcatgacct gtagacacgc ctatg 455 // ID Copia-1-I_BF repbase; DNA; INV; 5195 BP. XX AC ABEP02005391.1; XX DT 16-JUN-2009 (Rel. 14.06, Created) DT 16-JUN-2009 (Rel. 14.06, Last updated, Version 1) XX DE LTR retrotransposon from Branchiostoma floridae: internal DE portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-1-I_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-5195 RA Kapitonov V.V., Bao W. and Jurka J.; RT "LTR retrotransposons from Branchiostoma floridae."; RL Repbase Reports 9(6), 1166-1166 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 112..4329 FT /product="Copia-1-I_BF_1p" FT /translation="MATCERTRLPLFDLTLKTYEQYRFEIQCWNQVTKIPK FT KQRGIEILLSLPEGPKDEFGTREFLSSRLAMEDLTAEDGYDKVLAKLDEHL FT RRDDMGRLWESFVNFDKFQRTKDMSVSEYISRFDILYHQLNKTGDVTLPAS FT VLGLLLIRRANLTPDESKLVLTGLDYTKEDTTLFVQAKASLRKFSDSLVAP FT CTLASSSDMGAFSMPRVKQEIVNVADRVPDREVYQTARGGFTKWSRKPNKN FT KSVGRGQSVQNGGAHATRDSRSGGQRSARRDSGFNPKSRNGDYLRCYDCGS FT IKHLLDSCPHAQVQKANLTENDAEFAEDPQFAVLFTGASKCLMSELSTEAS FT LCAVLDSACSSTVCGLSWLKKYLSALGEKDLKHVQQEASSNVFKFGGGEKL FT VSLGKYRIPAKLANNDVLIVTDVVDSDIPLLMSLDAMKKADIVLHTREDRA FT TIFGKNVNLNLTTSGHYCVSLVRDPNIEVSEVLQVDLGEDVSQTKTKLLHL FT HRQFAHPNAEKLTQLLKGANSWRSHYAAILQEICESCDVCKRFKRGLARPV FT AALPKSTRFNQVVAMDLKKWRSQYLLYFVDEFSRLTVAKRIARKHPKEVVD FT AFMGLWMACGYGIPEQIMVDNGGEFTAEEILEFSSRLNIKVNTTAGHSPFS FT NGVCERNHAVMDVMLEKLVYENPKTPIDQLIAWACTAKNAMCMFAGYSPYQ FT IVFGRNPVLPGFETHPPSTNDIEGDILLKHLTALAAARRAFTEAESSERVR FT RAXRHKIRIAERHYAPNDHVYYKRDDSSEWFGPAKVVVQDGKIVFLRHGAY FT VIRVHVNRVVLSGQEYEKQTPTAVGTSPPAVNDPTTDDNGQERQNWTFPKT FT VDDVGDHRGFHGAGQTYDPDERRAGEQGGAESWKRDEHGRLYSVPTGDSES FT PDVLCVSTGNVVLHTKRVDINVQRARNEELQKLKDFETFEVVDDAGQERIS FT TRWVDTIKEDGSRKSRLVARGFEEHDHIQSDSPTISKSVLRVFIVLCHMYG FT WTVKTMDIKSAFLQGETLSREVLVQPPRGYEMVGKLWKLRKCLYGLNDAAR FT AFYMSVKHTLLQLGCQTSDLEPAMFLYQVNGTLRGFILTHVDDFLYGGDDL FT FEDKIIKPLSAKYHTSRLESGTFRYVGINIVQGTDKVVLHQNDYLQGLSDD FT DLPTPTRESDRDLNSVEYSLFRSLVGKLNWLSNGTRPEMSFKTIDLSTKFA FT HATTAHLRDAVKTVKKLQVSEGHLVYPRLHGFQDVGLTVFADASLANLQNS FT GSCGGHLIFMSDSYGKCALVAWHSGRIKRVCRSTLAAETLAMANALEEAVY FT LREILQTSTATNVPITAISDNKSLVQAVASTSLVLEKRLRIEVSTIKELVE FT LSNVTLKWVPGSHQLADVLTKKGVTADSLLTVITTGNMIKLP*" XX SQ Sequence 5195 BP; 1459 A; 1112 C; 1324 G; 1299 T; 1 other; aatggtagca gagggtgcaa ctctagacag ttagagtcaa cggtccgggt cgacagtgcc 60 cgatttctcg gacgttttct gtgcaccggc ttcgaagacg acctttccaa gatggcgacg 120 tgtgaacgga ctcgtctccc actcttcgat cttactctta aaacttacga gcaatacaga 180 tttgaaatcc agtgttggaa ccaggtcaca aagattccca agaaacaacg tggcatcgag 240 atccttctca gtcttcccga agggccgaag gatgagtttg ggacaaggga attcctctcc 300 tcccggcttg ccatggagga cctgacggcc gaggacggct atgacaaagt tctcgccaaa 360 ctggatgaac atctccggcg tgacgatatg ggaagattat gggaatcctt tgtgaacttt 420 gacaagtttc aacgtaccaa ggacatgtcg gtgagtgaat atatatcccg atttgacatt 480 ctatatcatc agctaaacaa gacgggagac gtgaccctcc ccgcaagtgt attgggatta 540 ttgcttatcc gccgggcgaa tctcacccct gatgaatcga aactggttct caccggtctg 600 gattatacaa aggaggacac gacgcttttt gtccaagcca aagcctcttt gcgcaaattt 660 tcggacagtc ttgtagcgcc ctgtacgtta gcctccagtt cggatatggg tgctttttcg 720 atgcctcgtg tcaaacaaga gatcgtgaac gtggcagacc gggttccaga ccgggaggtg 780 taccagaccg cccgtggggg gttcacaaaa tggagtcgga agccaaataa aaacaagtca 840 gtcgggaggg gtcagagtgt acaaaatggc ggcgcccatg ccacccgtga ctctaggtca 900 gggggtcaga ggtcggctag acgggattct ggatttaacc ccaaatcaag aaatggggac 960 tatcttcgct gctatgactg cggttcgatc aaacatttgc tcgactcctg cccacatgca 1020 caggtacaga aagcgaatct gacagagaat gacgcagagt tcgcagaaga tccacagttt 1080 gcagtattgt ttacaggtgc ttcgaagtgc cttatgtcag aactgtcaac agaggcgtca 1140 ctttgtgctg ttctcgattc ggcctgttct tccactgtct gtggcctttc ctggttgaag 1200 aaatatctat ctgctttagg ggagaaagat ctaaagcatg tccaacagga ggccagctcc 1260 aacgtgttta agtttggcgg gggcgagaaa ttggtttctt tgggcaaata tcgtattcct 1320 gccaaactgg ctaacaacga tgtcctgatt gtcacggatg ttgtagacag tgacattccg 1380 ctcctcatgt ccctcgatgc tatgaagaag gctgacattg tgttacacac gcgtgaagac 1440 agggccacca tctttggtaa aaatgtcaac ctcaatctga ccacctcagg tcactactgt 1500 gtgtccttag tacgtgatcc taatatcgag gtcagcgagg tgttgcaagt agacctggga 1560 gaggatgtgt cccagacgaa gacaaaattg ctccatctac atcgccagtt tgcccatccc 1620 aatgctgaga agctgactca gctgctcaag ggtgcaaatt cgtggcgaag tcattatgca 1680 gccattctgc aggagatatg tgaaagctgt gatgtatgta agcggtttaa gagagggctg 1740 gctcggccag tggcagcttt acccaaatcc accaggttca accaagtcgt ggctatggac 1800 ctaaagaagt ggcgttcgca atatctgttg tatttcgtgg atgagttttc gcggctgact 1860 gtagcaaaac ggatcgctcg aaaacatccc aaagaggtgg tagatgcctt tatggggctc 1920 tggatggcct gtggatacgg catcccggaa caaatcatgg tggataatgg aggggaattc 1980 acagccgagg aaatcctgga gttcagcagt aggctgaaca taaaagtgaa taccactgcg 2040 ggacattcac ctttctccaa tggtgtgtgt gaacgtaacc atgcagtgat ggacgtcatg 2100 ctggaaaagt tggtttatga gaaccccaaa acacccatcg atcagctcat tgcatgggca 2160 tgtacagcta agaatgccat gtgtatgttt gctggatatt ctccatatca aatcgtgttt 2220 ggtagaaatc cggtgttgcc cggttttgag acgcatcctc cgtctacaaa cgacattgaa 2280 ggtgacatcc tgctgaaaca tctgacagct cttgccgcag ccagaagagc attcacagaa 2340 gcagagtcct ctgaacgagt acgccgagct ntacgccaca aaattagaat tgccgaacgt 2400 cactatgctc caaacgacca tgtgtactac aagagagatg attcatcaga atggtttggc 2460 ccagcaaagg ttgtagttca ggatggcaaa attgttttcc tccgacatgg cgcatatgtg 2520 attcgggtcc atgtcaatag agtggtcttg tcgggtcagg aatatgagaa acaaaccccc 2580 acggccgtgg ggacatctcc acctgcagtg aatgatccca caacagatga taatgggcag 2640 gaacgtcaga attggacctt tcccaagact gtagatgacg tgggtgacca cagaggtttc 2700 cacggtgcgg gtcagacata tgatccagat gagagacgcg caggtgaaca aggtggtgcg 2760 gaatcctgga aacgtgatga gcacggcaga ctgtacagtg ttcccactgg agactccgag 2820 tcacccgacg tgttgtgtgt gagtacaggc aatgtagtgc tccacacaaa acgggtagac 2880 atcaatgtcc aaagggccag aaatgaagaa ctgcaaaaac tgaaagattt tgagactttt 2940 gaagtcgtag atgacgctgg gcaagaacgc atatcaacca ggtgggttga taccattaaa 3000 gaagatggct cgcggaaatc gcgtttggta gcgagaggct ttgaagagca tgatcatatc 3060 cagagtgata gccccaccat cagtaagtca gtgttaaggg tgttcatagt actatgtcat 3120 atgtatggat ggactgtaaa gactatggac atcaagtctg ctttcctcca aggagaaacg 3180 ctcagtcgtg aggtccttgt tcaacctcct cgcgggtatg aaatggtggg taagctatgg 3240 aaactccgga aatgcctgta tggcctcaat gatgctgctc gggcgttcta catgtcagtg 3300 aagcacacac tcctacagct agggtgtcaa acctcggatt tggagcctgc catgttcctg 3360 tatcaagtaa atgggacctt acgcgggttc atcctgactc acgtggatga tttcctatac 3420 gggggtgatg acttatttga ggacaagatc atcaagccac tttccgcgaa gtatcacacc 3480 agtagactgg aatccggaac cttcagatat gtgggcatca acattgttca gggaactgac 3540 aaggtagttc tacaccaaaa tgactacctc cagggtttgt ctgacgatga cttacccacc 3600 cctacgcgtg aaagtgacag agatctcaac agtgtagaat attccctatt ccgctcactg 3660 gttggcaaat tgaactggtt gtcaaatggt acgcgaccag agatgtcatt caagacaatt 3720 gatttgtcta caaagtttgc acatgccaca accgcccacc tgagggatgc agtcaagact 3780 gtaaagaagc tccaggtatc agagggacac ttggtgtatc cgagacttca tggctttcag 3840 gatgtaggcc tgacagtttt cgctgacgca agcctggcta accttcagaa tagcggtagc 3900 tgtgggggcc acttaatctt catgtctgac tcctacggga aatgtgccct tgtagcctgg 3960 cacagcggca ggattaaaag ggtgtgccga agcaccctgg cagcggaaac ccttgcaatg 4020 gcgaatgcgc ttgaagaagc ggtctatctt cgggagatcc tgcaaacaag caccgcgaca 4080 aatgtgccta taaccgccat ctcggacaat aaaagtctgg tccaggcagt tgcgtccaca 4140 agccttgtac tggagaagcg tctgcgaatt gaggtttcta caatcaaaga actcgtggaa 4200 ctatccaatg taaccctgaa atgggtaccg ggaagtcatc agttggcaga tgttctgacc 4260 aagaaagggg tcactgccga ctctttgcta actgttatca ccacgggtaa catgatcaag 4320 ttgccataga acttttgcat gcaaacaacc aacaaaacca aaagtgagga cactagtgaa 4380 gatactttct agctatgtaa tcaccagagg gtgaacagac attagaggaa ggacagtgaa 4440 gtttaatggg taagagactt ggcaagaagt ctacaagtag aggaaggaca gtgaagttta 4500 atatgcaaga gacttggtaa gaaagctaca gtcactgtat tacgctgttg tgaagatagt 4560 gaagtttaat gtgtaagaga cttggcaaga agtctacaaa tacaagtatc gcattacgct 4620 gttgtgaaga tagtgaagtt taatgtgtaa gagacttggc aagaagtcta caagtatcgc 4680 attacactgt tgtgaagata gtgaagttta atgtgtaaga gacttggcaa gaagtctaca 4740 agtatcgcat tacactgttg tgaagatagt gaagtttaat gtgtaagaga cttggcaaga 4800 agtctacaag tatcgcatta cactgttgtg aagatagtga agtttaatga cttggcaaga 4860 agtccacagg cattgttatg taaatctgat gttgaagttg agaaaacagt taatgtttgt 4920 aacaaacttc atctaccact ttatgtagtt catgtagttt acttaatgta gtaagaacgt 4980 tttatgtatc tactttgtta taaagttatc tagtctctgt agaacgcagc ctttaaatac 5040 agttgatgta actttgtagt catgtaacct ctatagttct aaatatagtt ctatatgctc 5100 tatagttctt tagacaatgt taaaaatatt tggtagactt caatgtttca acttatgtaa 5160 cgtcatctag agtagaaaag aagaaaaagg gggaa 5195 // ID BEL-57_CQ-LTR repbase; DNA; INV; 300 BP. XX AC AAWU01004587; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-57_CQ_; KW BEL-57_CQ-I; BEL-57_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-300 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 268-268 (2011). XX DR GenBank; AAWU01004587; Positions 81583 81882. XX SQ Sequence 300 BP; 82 A; 74 C; 64 G; 80 T; 0 other; tgttgggaca cgctagtgcc caaacacgaa gcccgaatag ataagagaga atttgaattc 60 ggttacatta gtattagcac actatcacat tatcacaacg ctcggccagc ggcccgatca 120 cgagagaatt tggctaccta ctttgcgaca aggtcaagtt cgtggtgatc gccactcaca 180 agttgagttg aaatatagag ttcaaaccag cctcacgcgt ttttttgatt ctcgactctt 240 ctatcgaaga agtgttcgtt agccaccgat attccgcgtt ttgacgaccc cttcgtaaca 300 // ID BEL-607_AA-I repbase; DNA; INV; 6427 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-607_AA_; KW BEL-607_AA-LTR; Pao_Bel_Ele58; BEL-607_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-6427 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [5084-5668] - Integrase core CC LTRs are 93% similar to each other. XX FH Key Location/Qualifiers FT CDS 737..6046 FT /product="BEL-607_AA-I_1p" FT /translation="MTGKKDKKLAEDKLILKRVLATRDVVERFISEYDHER FT DSGEVVVRLETLNRTNKEFLHIQSEIEKLDDEENFEHHMAMRNEFENRFCK FT MKGFLLAKRPTERNHPLLNSTMTSSTLAPLHNVTGFHHRLPKIDLPKFSGD FT ESRWISFRDNFLSMIHGNDDIPMVNKLQYLLQSLEGDARKPFESVDIQADN FT YAPTWDALMKRYDNKRFLRKELFRRLFELPSMKRESAQELNTLVDDFQRHV FT KALAKLGEPIQHWDTPLTFILCNKLDSYTLRAWEQETRQKDEVKYDELIEF FT LTQQVRMLKSVSSDLQYRSQGPSIKVAGPAPKKSSTVKSIVNSATNEPKSN FT VPQCHACPQKHQLFQCPTFANMPVLQRRELVSQRSLCWNCFRPNHQAKACK FT SKFSCRTCHAKHHTLLHDQVAPARSTPNPAGSSNTSMQQPIPSTSANVGTA FT GSSNPPEVNLSVQSSRSTVLLETVVLSIVDKHGKEITARALLDSAAMSNFI FT TKKLANALATRQAVVDIAVAGIGESMKQIKRQISATIKSQSKPFSTTLEFL FT IMKKPTANLPTIAINTASWNIPKLPLADPNFNVPGMIDIIIGGECYHEIHT FT GNRLSLGDGLPFLVDTLFGWTVSGKTAINSSIAPPVCYLSTVDRSLETALQ FT RFWELEAVDHGPTYSTEEKRCEEIYANTTTRTHSGRYVVRLPRSEDPQVTL FT GDSRTIATRRFYSLERRLEKDSALKQTYHEFIDEYLHLGHMRKLDVVDDDV FT PHCYLPHHPVFKESSTTTKVRVVFDASCKTSSAVSLNDTLLVGPVVQQPLV FT NIIIRFRFHVVAFVADVEKMYRQVLHHSVDQHFLRILFRRNPSEPLATYEL FT LTVTYGTAAAPYLATRTLLQLAQDEGDKYPDAVQAVVEDFYVDDLLSGADN FT VEAAIKKRCQITNMLASAGFPLRKWASNEPDALAEVPPEDLAIEPLHDLQD FT DQSVSTLGLVWDTRTDTLRFNVEMPLPAPILTKRKIISYIARIFDPLGLVG FT PVIATAKMFMQRLWKLKTEDNHPYEWDRPLPPKLQEEWKKYHATLDILSEL FT RIPRFVSFAGAPSIQLHFFSDASELAYGACCYVRSEGDDRITVRLLTSKSK FT VAPLATKHSIARLELSAAVLSTRLFQKVEQSLRTTADVFFWTDSTTVLQWL FT NSAPNRWKTFVANRVSTIQSNTEGHSWKHVPGIDNPADELSRGLMPTELPN FT QSRWWNGPAWLSQSPSHWPDASIPQEESSLVAEEARKVTLVSLESEGVQFA FT DHLFSRFSTYTKLRRVVAYCIKYIRSLRAVVQQSTEDRSLILTTVDLMKSD FT QALARLAQQQTFSEELSELHATGVVKGAPLKWLKPRLSDDGILRVGGRLEN FT ANASEASKHPIILSAKHPFSRLLADHYHKTLLHAGPQLMLSTLRQRYWILG FT GRDLTRSTYHRCVKCFRIKPNLIQQSTADLPISRVTPARPFSISGVDYCGP FT VYLKSPVRNRPPTKAYIAIFVCFSTRAVHIEPVSDLSTPAFLSALRRFIAR FT RGRIQELHSDNGTAFKGAANSLRRLYEMLKINNSDREAIIRWCADQEIQWK FT FIPPRAPHFGGLWEAAVKSRKHHLLREVGQTSLSWEDMTTLLAQIEMCLNS FT RPLTPIPTESTDLEALTPGHFLVGSNLQAVPEADVANIPDNRLDHWQRTTK FT QLQRIWTRWYPEYLAQLQARANKGCKSPVNLQLGRIVVIKEENLSPAQWPL FT GRITKLHPGKDGIVRVVSLKTATAENVVRPVAKVALLPIQTEEDHQSTEP" XX SQ Sequence 6427 BP; 1634 A; 1781 C; 1527 G; 1484 T; 1 other; ttggtccttc gagccggatc ggaggaagca tccaagccca aatcgacgcg cgaattggtg 60 gtattcgcca ttgtggtaag tggtcagagg acgttgacca cagaaggtag gccattcaag 120 ctatctctat tgatcagcaa attgctgcac acacgatcgg cgccaacctt tgcccccagt 180 acagaagccc atagtacgta ctgtgtgagg aataacgtgt aacgttgcat gcttgtattg 240 ctaggcgaag aacgatcgaa tcgtttgtat gtagaataag cgcacgtaga agtttggcaa 300 aatcatatgc ggtggcatat aggctcaccc aattagcctg ttagataatt gccttaggtg 360 cgctatacag tgtcgcacac atttgttggt tcggtggcaa acccaaataa atcatttaat 420 ttttaattga ttttcaattt cctacgattg cgcggtggcg tttttagcgg atccgcgaat 480 cgtttaagga gacacgtaac cgcgtcggga aggattattc gttacgattt tacgtgttct 540 ctatcggtcg ttcgttccgt gttcgcgtag aagcatcacg tgacgcgacg ggaaggagat 600 acacgttcga ttatccgtgg tgtttccgtc gttcgcgtac gagcagcaca acgcgacggg 660 aaggagttat tcgtgcgatt ttgtgttgtt cacgtttcgg ttcgcgcagt caagtgattc 720 gtggtggtag taagtgatga cgggcaagaa ggacaagaag ttggcggagg ataagctgat 780 tctcaaacgg gtattggcaa cgcgagatgt ggtggaaagg ttcattagcg aatacgacca 840 cgaaagggat tctggtgaag tcgtcgtgcg gttggaaacg ctgaacagaa ccaacaagga 900 gtttctgcat atccagtccg agatcgagaa gctagatgat gaggaaaact tcgaacacca 960 catggcgatg cggaatgagt ttgagaaccg gttctgcaag atgaagggct ttcttctcgc 1020 caagcgaccc acggagcgga atcatccgct tcttaattca acgatgacgt catcaacact 1080 agcgccgctt cataacgtga ctggattcca tcatcgtttg ccgaaaatcg acctgccaaa 1140 gttctctgga gacgaatcgc gctggatttc gttccgggat aattttctat ccatgatcca 1200 cggcaacgat gacattccta tggtgaacaa gttgcaatat ctgctgcagt ctttagaagg 1260 agatgccagg aagccgtttg agagtgtcga tatccaagcg gacaactacg caccaacctg 1320 ggatgcgctc atgaagagat acgacaacaa gcggtttttg cggaaggagt tgtttcggcg 1380 attattcgaa ctgccatcga tgaagcgtga gtcggcgcag gagctgaaca ctctggtgga 1440 cgacttccaa cgacacgtga aggctttagc gaaacttggt gaaccaattc agcattggga 1500 caccccgctc acgttcattc tctgcaacaa gctcgattcc tacacgttgc gtgcctggga 1560 gcaggaaact cgacagaaag acgaagtcaa gtatgatgaa ctgatcgagt tcctcacgca 1620 acaagtccgg atgctgaagt ccgtctctag tgatctgcag tatcgttccc aagggcccag 1680 catcaaggtg gccggcccag ctccgaagaa atcctccaca gtcaagtcca tcgtcaacag 1740 tgctaccaac gaacccaaat ccaacgttcc acaatgccat gcttgcccac agaagcacca 1800 gctgttccaa tgtccgacgt ttgccaatat gccagtcttg caacgtcgag agttggtttc 1860 acagcgaagt ttatgttgga actgtttccg cccaaaccat caagcgaagg cgtgtaagtc 1920 gaagttttcg tgcagaacgt gccacgccaa gcaccatacg ttattgcacg accaagttgc 1980 accagcaaga tccacgccga accctgccgg ctcgtcgaac acgtctatgc agcaaccaat 2040 tccatccacg tcagcgaacg ttgggaccgc tggatcgtcg aatccacccg aagtgaatct 2100 atccgttcaa tcaagtagaa gcacggtcct attggaaacg gttgttctct ccatcgtcga 2160 caagcacggc aaagaaatca ccgcccgagc acttcttgat tccgctgcga tgtcaaactt 2220 catcaccaag aagctggcaa atgctcttgc cacccgtcaa gccgtcgtag atattgcagt 2280 tgctggaatc ggagaatcaa tgaagcagat caagcggcag atctccgcaa cgatcaaatc 2340 ccaatctaaa cccttttcca ccacactcga attcctaatc atgaagaagc ctactgccaa 2400 tctacccact atcgctatta atactgcatc atggaacata ccaaagcttc cactagcaga 2460 tccgaacttt aatgttccgg gcatgatcga cattatcatc ggcggtgaat gttaccatga 2520 aatccacaca ggtaaccgtc tgtcactcgg tgacggtcta cctttcttgg ttgacaccct 2580 tttcggatgg acagtctctg gaaaaactgc catcaactcc tccatcgcac caccggtgtg 2640 ctacctgtcg acagtcgatc gatctctaga aacagcgctt cagagattct gggaactcga 2700 agccgtcgat catggcccta cgtattccac tgaggaaaaa cgatgcgaag aaatctatgc 2760 caatactacc acccgaactc actccggaag gtacgtcgta cgccttcctc ggtccgaaga 2820 tccgcaagtc accctcggcg actctcgaac aattgccact cgccgcttct acagcctcga 2880 gagacgcctt gagaaagatt ccgccttaaa gcaaacctac cacgagttca tcgacgaata 2940 ccttcacctc ggtcacatgc gcaaactcga tgttgttgac gacgatgtgc cccactgtta 3000 cctgccccac catccagtgt ttaaggagag cagcaccacc accaaggtgc gcgtagtgtt 3060 cgacgcgtcc tgcaagacat cttccgctgt gtctttgaac gatacactcc tcgtcggtcc 3120 agttgtgcaa cagcctctcg tcaatatcat cattcgcttc cgattccacg tagtcgcatt 3180 tgtggcagac gtggagaaaa tgtatcgaca agtcctccat cattccgtcg atcagcattt 3240 tctgcgcatc ctgtttcgtc gaaatccatc cgagccgctc gctacctacg aacttctcac 3300 cgtcacatac ggtacggcag ctgctccata cctggccact cgaaccctgc tgcagttggc 3360 ccaggatgaa ggagataagt atccagatgc cgttcaagcc gtcgttgaag acttctacgt 3420 cgatgatttg ctctccggag cagacaatgt ggaagctgcc atcaagaaac gttgccaaat 3480 caccaatatg cttgcatccg ctggattccc gttgaggaag tgggcctcca acgaaccaga 3540 tgcactggcc gaagtaccac ccgaggatct ggcaatcgaa ccactccatg atctgcaaga 3600 cgatcaatcc gtctcaacat tgggacttgt ttgggacaca aggacggaca cgctacgttt 3660 caacgtcgaa atgccacttc cagcgccgat cctaaccaaa cgcaagatta tttcgtacat 3720 cgccagaata ttcgatccgt tgggcctcgt cggccctgtg atagctaccg ccaagatgtt 3780 catgcagcgg ctttggaagc tgaagaccga ggacaatcat ccctacgagt gggacagacc 3840 gcttccacct aaattgcagg aagaatggaa gaaataccac gccacgttgg atattctttc 3900 cgaacttcgg attcctcgat tcgtttcctt tgccggtgca cccagtattc agctacactt 3960 cttttccgat gcgtcggagt tggcctacgg tgcctgttgc tacgttcgct ccgaaggcga 4020 tgatcgtata accgtgagac tgctcacgtc caagtcgaaa gtcgcacctc tcgcaaccaa 4080 gcattccatc gcccgtctcg agctcagcgc agccgtcctg tccaccagac tgttccagaa 4140 ggttgaacaa tcattacgca caacagctga tgtcttcttc tggacagatt caaccaccgt 4200 tctgcaatgg ctcaattccg ctccgaatcg ctggaaaaca ttcgtggcaa atcgagtatc 4260 gacgatccaa tcaaacacag aaggacattc ctggaaacat gttcccggaa tagacaatcc 4320 tgccgatgaa ctatcccgag gcctgatgcc aaccgaactg ccaaaccaat ctcgatggtg 4380 gaacggtccg gcatggctct cacaatctcc cagtcattgg cccgatgcaa gcatcccaca 4440 agaagagtct tcgttagtcg ccgaagaagc acgcaaggtt acgctggtat cgttggagag 4500 tgaaggagtc cagtttgccg atcatctgtt ttcacgcttc tccacgtaca ccaaacttcg 4560 tcgtgtggtt gcgtactgca tcaagtacat ccgttccctc cgagccgtag tacagcagtc 4620 taccgaagac agatcgctta ttctaaccac ggtagatctg atgaagtccg accaagccct 4680 agctcgttta gcacaacaac aaaccttttc ggaagaacta tccgagctac atgccaccgg 4740 cgtcgtgaaa ggcgcaccgc tgaaatggtt gaagccgcgt ctaagcgatg acggcatcct 4800 tcgcgttgga ggccggcttg agaacgctaa tgcatcagaa gcaagcaaac atcccataat 4860 cctgtcagcc aaacatccat tttcccgctt gcttgccgac cactaccata aaactctcct 4920 ccatgcgggt ccccaactga tgttatccac gttgcgccaa aggtattgga tccttggagg 4980 gagagatctg actcgttcga cctaccatcg ttgcgtcaag tgcttccgca taaagccgaa 5040 cctcatccag caaagcactg cagatttgcc gatatcacgt gtcacaccgg cgcgcccttt 5100 ttccatttct ggagtggatt actgtggccc agtgtacctc aagtctcccg ttcgaaatcg 5160 accgccgacc aaggcttaca tagccatttt tgtgtgcttc tccacgagag cagtgcatat 5220 tgaaccggtt tccgacctgt ccacgccagc atttttatcc gctctacgtc gcttcatcgc 5280 tcgccgcggc cgtatccagg agctccactc cgataacgga accgcattca agggtgcagc 5340 aaattcgttg cgccgtcttt atgaaatgct gaagatcaac aacagcgatc gtgaggcaat 5400 catccgttgg tgtgcggacc aggaaatcca gtggaagttt attccgccac gtgccccaca 5460 tttcggaggt ttgtgggagg cggcagtgaa atcgcgcaag catcatcttc tccgagaagt 5520 cggtcagacg tccctcagct gggaagacat gacaaccctg ctggcacaaa tcgaaatgtg 5580 tctgaattcc cgtccactaa ctccgatccc aaccgaatcc accgacctcg aagcgctcac 5640 acctgggcac tttctcgttg gctcgaacct gcaagcagtt ccggaagctg atgttgccaa 5700 tattcctgac aaccgtctgg atcactggca gcgtacaacc aaacagcttc agcgtatctg 5760 gaccaggtgg tatccggaat acctcgccca gcttcaagca cgagccaata aaggatgcaa 5820 gtcaccggtg aaccttcagt tgggacggat tgtggtcatc aaggaggaaa acctttcgcc 5880 ggcgcagtgg cctttaggcc gcattacaaa gcttcaccca ggcaaggatg gaatcgtccg 5940 tgtagtcagt ctgaagacag ctaccgcgga gaacgtcgta cgcccagtcg ccaaggtggc 6000 cctcctgcca attcaaaccg aagaagatca tcaatcaacc gaaccgtaag cagaccctac 6060 aaagatcggt agagcagtcc gaaactgtct caacgcatgt ttggatagct aacaggtgat 6120 tagcgttcca acccttttac cagtccctgt tcgttctacc gggtctctct tttcacagac 6180 acctttctgc tctcgtcatc tcatcaactc gccgctctcg ttcgcatccc gtacgccgtc 6240 tgctgaatga gcccattcag gcagcatcca actgatkaat gagatccgac ggcgcgacga 6300 agctcatcgt gttcgccacc cgaagtcagc cgatttcgga gagagtagtg agagaggaag 6360 catgctgaga gagttgaaac agtgtggccg tttcaaggtg gccggaatga tcagtacgct 6420 gatcaaa 6427 // ID Gypsy-21_SI-LTR repbase; DNA; INV; 1019 BP. XX AC AEAQ01024002; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: long terminal DE repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-21_SI_; KW Gypsy-21_SI-I; Gypsy-21_SI-LTR. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-1019 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01024002; Positions 2460 3478. XX SQ Sequence 1019 BP; 242 A; 346 C; 260 G; 171 T; 0 other; tgtgacgatg caccccgtat agaaaaacat ccccacggca aacatcagcc gaaaagaccg 60 ccgggccata accggggatg ccgtgggcac ccactcttaa gaatccaagg ccgcgatacc 120 gcctggtgac aaacatcgcc agaagaaacg cgccgagcca tcgcacgcgc tgagtcggcg 180 cgcacgcact ctcggaggac acgcggtcct ccctaccccg acgactccgc cgtcaagcgc 240 cgagaaagat aatggaggcg agcgaagtcg ctcgatataa atagggccac cgcgacgaga 300 aacgacactc tgccaccgac cagcttcccg atcacggtcc gctgcgtcgg cgatatcctg 360 ctccttaaag ttggaaaccc ggtcgaagag tgcttggccg cggcgagagg gtactccgcc 420 tccgacgcaa atccattcga catacaccag cctccgcgaa ccgaggacca cgagtctcgg 480 tccacgagcc ggtctccggc gacccagagg ctgccctcct ctcctcgcga cacccttcaa 540 aatcgcgacc gccgtgttcg ggctctcggg tcaaggtact tgacagagtg tcgcgcgtcc 600 gatctgcctg cgacgattcc gtcgcgaaac cgccatatcg cgcgggaacc cgatccgccc 660 aagataatcg cgtcacaaca gctagcccgc gaatcgtacc gtaggccacg cctaccgcgt 720 cgccgaagca cgccacacag gccgcggggc cgcacctaca cggaatcgct ccgaaacgct 780 cccgtgtagt ctagcgccgg gacgcagcta tacgtaagca cctgtatata tgtaaacagc 840 tcaaacgtgt acatatcatt tagcaaatat atctatgtaa ctgcattcga actcatatct 900 cgcatcgagt aattcaaacc gctccgacga ctttcctttc tcccgcgtcg aaccattttc 960 gcggtctcgc accgggagtc cgacgagctg atccgcgcgt ggaaagtcgg taaattaca 1019 // ID BEL-218_AA-I repbase; DNA; INV; 2455 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-218_AA_; KW BEL-218_AA-LTR; BEL-218_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-2455 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 893-893 (2011). XX DR [1] (Consensus) XX CC 'AAGAT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 369..2318 FT /product="BEL-218_AA-I_1p" FT /translation="MSAAERKLRGLKNRKRSIITSFTAIKAFISGYQAERD FT KCEVPVRLENLIALWDEFSSVQTELETIEEDDDALELYLKERMELERIYYR FT AKGTLLLWNQPATQLAPTQPRENANQACVKLPDIKLPVFSGQFDEWLNFHD FT LFISLVHSSSNLSTIEKFYYLRSSLAGEALQLIQTIPISNEQYSVAWNLLE FT SHFQNPRRLKRTYVQSLFDFPVMKRETAAELHSLIDKFQTNVKILKQLGEE FT TEHWDVLLIHLLSTRLDSVTRRSWEEYSESNNATKFQQLTDFLQHRVNVLE FT TVNGLTDAQFYARKSIVSRAASFDASDHNFRPCPACSNQHFLYQCSVFLEM FT TIDGREQLVRRHQLCRNCLRRGHGLSECLSESRCRKCSERHHTKLCSISDS FT VRNSSYATSSHQDQFSEFSQTSDAARPSSPPPAAYTGVTSCTSQQGSRSKV FT LLATATVVMVDDDGREHHVRALLDSGSECSFATERLAQRMKVHRQRTNVPI FT AGIGQASTEVRSKFWTCVKSRVSKYTKKIELFILPKVTVDLPSIPVKVTNW FT PLPTGIHLADNGFYQPAPIDVVFGADLFFDIFNVDGRIPLGESLPTLVNSA FT FGWVVSGRISSDQDVPSAVCNLAAVNWTTEGTSRSYRCPYVKSRIFRRRIT FT H" XX SQ Sequence 2455 BP; 698 A; 580 C; 527 G; 650 T; 0 other; tggtccttcg agccggatgg gtgtcgagtg aagtccaccc aagccagtaa aacccgccat 60 cgtactacgt aatatcgccc agtgaataca ctattctgct ccacaaaaaa taaggcatcg 120 ttaatgcctc acataggtga gtgcggttcc atccaactta tattgttgag ggttttctac 180 cgtgatcttt tttcatgtac caatcatcga gtggttaact gcgacgatca aatcagtgcc 240 attgaattga atgaagtaag aatactccat ctactacgat tgcgaatcgc aacccgcgag 300 agaagaactt tggaatttga tacaagcaat caattgcctg ggcgcagaat tcatcagcaa 360 cgcttgtaat gtcggctgcc gagcgaaaac tgcgcggttt gaagaatcgt aagagaagca 420 tcattacgtc attcactgca atcaaggcct ttatatcggg ataccaagcc gaacgtgata 480 aatgtgaagt tcccgtccgg ctagagaacc ttatcgcact ctgggatgaa ttcagctctg 540 tacaaactga actggaaaca atagaagaag atgatgatgc actagaactc tacctcaaag 600 aacgcatgga actagagcgc atctactacc gtgccaaggg tacgctacta ttgtggaatc 660 agcctgccac acagcttgca cctacacaac cgagagagaa tgcaaatcag gcctgcgtca 720 aactgccaga cattaaatta cctgttttca gtggacaatt cgatgaatgg ctcaactttc 780 atgatctttt catctctctt gtgcattctt cgtcaaatct ttccacaata gaaaagtttt 840 attatctgcg ctcgtcgttg gccggtgaag ctctgcagct catccaaact ataccgatca 900 gcaatgaaca atactccgtt gcttggaacc ttctagaaag tcactttcag aacccacgtc 960 gtttgaagcg cacatacgtt caatcgctat tcgactttcc tgtgatgaag cgcgaaacag 1020 ccgcagaatt gcactctctt atagataagt ttcaaaccaa cgtaaaaatt ttgaaacagt 1080 tgggagagga aaccgaacac tgggatgtac tccttattca tctattaagc actcgactag 1140 attctgtgac cagacgtagc tgggaggagt attcggaatc gaacaatgca acgaagtttc 1200 aacagctcac tgatttcctt cagcatcgtg tcaacgtcct ggaaacagtc aacggtttaa 1260 cggatgctca attctatgcg aggaaatcaa tcgtctcacg tgcagcaagt ttcgacgcct 1320 cggaccacaa ctttcgccct tgcccagcat gttcgaatca acattttctc taccagtgta 1380 gcgtatttct tgaaatgacc attgacggaa gggagcaact ggttcgacgt catcaacttt 1440 gtcgcaactg tttacgacga ggacacggct taagcgaatg tttatccgag agccgctgcc 1500 gcaaatgtag cgaacgtcat cacaccaaat tgtgcagtat ttctgatagc gtcaggaata 1560 gtagttacgc cacttcttct catcaggatc aatttagcga atttagccaa actagcgacg 1620 ctgctagacc atcttctcca ccaccagcgg cgtatacagg agttacgagc tgtacttcac 1680 agcaaggatc ccgaagcaag gtactgttag caacagcgac tgttgtcatg gtagatgatg 1740 acggtaggga acaccatgtc agagcattac ttgattctgg cagtgaatgt agttttgcaa 1800 ctgaacggct cgctcaacga atgaaggtgc atcgtcagag aacaaacgtt cctattgcag 1860 gtattggtca agcgtcgacc gaagttcgtt ccaaattttg gacttgcgtt aagtctcggg 1920 tttctaaata cactaagaaa attgaactat tcatccttcc caaagtaacc gtggacttgc 1980 cgtcaattcc tgtcaaggta actaactggc cactccctac gggtatacac cttgctgata 2040 acggattcta tcaacccgcc ccaatcgatg tcgtattcgg cgcggatttg ttcttcgata 2100 ttttcaacgt cgatgggcga attccgcttg gtgaatcttt gccaaccttg gtaaattctg 2160 cgttcggttg ggtagtcagt ggcaggattt caagtgatca agatgttcca tcagctgttt 2220 gcaatctagc tgctgtaaac tggaccacag aaggcaccag cagaagttac cgatgtccat 2280 atgtcaaatc acgaatattc cgacgtcgga ttacccatta atttctttga gcatcaatca 2340 atgccttgtg gatattatca caatgtgtca acatcaaccg taacctacgg aagcaaaaat 2400 taatagtctg ttgcatttaa taaattcaga aacagttatt tctgcggggg cagta 2455 // ID I_Ele4 repbase; DNA; INV; 8314 BP. XX AC . XX DT 08-OCT-2010 (Rel. 15.1, Created) DT 08-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE An I clade non-LTR retrotransposon family from Aedes aegypti. XX KW I; Non-LTR Retrotransposon; Transposable Element; I_Ele4. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-8314 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-8314 RA Kojima K.K. and Jurka J.; RT "I clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (08-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. This consensus is generated from 20 CC sequences with >88% identity, and ~96% identical to the original CC sequence in [1]. XX FH Key Location/Qualifiers FT CDS 1010..2698 FT /product="I_Ele4_1p" FT /translation="MAASSAGPPTNSWWDGPANDNFGDRVDGSFPGPTLPP FT FMDPEGLHGELHLLRISGVNGPLPNKPFHIRRSVERFVGGKIDGAFPEANK FT SSYALKVRSQRQFNKLLDMNQLLDGTPVLVAEHPTLNSTRFVVSCRDVVDM FT SEQELLEELKEQGVKEVRRITKRKGQARENTPALVLTCRGTIRPETINFGY FT IRCRTRPYYPSPMQCFNCWLFGHTKLRCQAKVATCGTCSGDHPIAENRDCD FT IGDFCKTCNSNEHRISSRSCPRWQYENTIQKVKVDQGISYPAARRXVEQNR FT GSXTFATMASYATKXPXQPRERNEELSAVLAAKDAEIAELRAALATRXAPS FT XTVSSEIETLKXIVXDQARQIQLLTEQVSVFLKAVMPAASXTSPTNXAPTT FT KALTXSSATXXEETGXSEPSATTNXASXTTLNNSPTTSKNGVHEPIVDPEA FT XKDTISAPLSXSVTXNEQHXDKXLDXDPSDSDTASSDRSANIEFFSPHQSK FT NTLSPXTPKPPRTRALLPRGSGSNVDPTKTPGKRPISSISRTETLFQQQKK FT SKIKSAEGHAAVSNKR" FT CDS 2695..7797 FT /product="I_Ele4_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="KVSPSTLSIFSPPPQRHKPVSHCTDAHHAAAQINPQR FT NIIFLATDRRSAGELEATSEPESPIRLRHHAQNNSPAVSLDSRGANVPDVL FT PLPESLGRLRRLEDDMDEELPLHLLRCPAKTPGSQSPVGAVVSHSGTDGQP FT LALGEDLEEGNQVAKPTSLDDVGSQVFQVIGTRRALPATPKDNVGSQVFFH FT QANTVAYTSNPTIYDAVSSTSVSVDSFVKNPGHPGHSNPGGKSHPVMSTIH FT SERPFPDPTVDQLPPLEPPGPSFSDVRSQSSYASSGHLGNSNPSGSSTKLL FT KNSNPDVIETIAQHTIPSAPTTWPPRSLLMDDQPRSFSAMYSENLNPGAIA FT VAPPPPVLLPPSSGTPGPFSLDNRPLFSEQINSDGTTAALTLAVRPISPLL FT GDHHRCLDTSGARRLGSKDYSGFPSYSFDEVPLANANPPASSRSPEYPSGG FT DSLNRSAAITPPNFPRHHSENSKDTSSSSTHSKNSPSENCDFRSRFPTPTL FT PDPTKFCLQWNINGFYHNLSNLELLTHVSSPWVIALQEVNRTSTTAMNRSL FT SGKYRWTMKKSRNVRHTVALGVLLSVPFEIIEIDSDLPIVAVRLVGSFRLT FT IINCYLPCGSLSGIKEKILNIIRTTPEPRLLLGDFNAHHQIWGGDRTDPRG FT TAILNALEDSDMVSLNDGSPTFYNGRVASSIDLSMASRSIAYKFLWQACAD FT LHGSDHYPIQISAAVTSPPITRRPRWCYDRADWQIYQSNINHLLELEEPAS FT LTSFSNVITKAAASAIPKSSSRPGRKALRWWSDDTRSIVKARRKALRALKR FT LPNDHPEKEDKLRIYRAVNIKCRSTIRTAKRLSWEEFLDSMNPSQTSADLW FT GKINALSGKRKITPISLDIQGSTITEPSAVAEALGNYFGSLVSIDAYDPVF FT RQRIANNQSSLSNFVIPEDHGTSQLNCSFSMRELNFALGSSTGKSAGPDGI FT GYPLIKNLPLGGKAKLLELINSMWHQRSFPDEWRESLVIPIPKTNVASRDT FT SKYRPISLTCCLSKVVERTVNRRLKEQLEADGRLGHRQHAFRPGYGTDSYF FT ASLGDLLHDARTKGQHADLVSLDISKAFNRTWTPLVLKQLTEWGFSGNLIG FT FLKNFLLHRSFRVAVGNHSSSAFAEETGVPQGSVIAVTLFLVAMNGVFENL FT PPDIYIFVYADDILLVAIGHTPRRTRVKAQAAVTAVHKWASSVGFSLSAAK FT SVRGHVCNSRHKLVSPGIKIDGQPIPNRKTVRTLGILIDRNLLFKEHFNAV FT KTNCRSRLNIIKTISKPHRSNNRSTRFRVAQAIVDSRLLYGLEITCLSQQN FT LLEILSPIFNGYVRTISGLLPSTPADAACAEAGILPFRHRVHAAICRKAAT FT YASKTAGNDRVALFDQADRILREVANSSLPPVARVHWHGASSWQLPPPIVD FT EQIKKTFRSGDDSTVLRASVLELLRTKYPEHIHRYTDGSLSQRGVGIGISG FT ADLTLCQSLPTECSVFSAEAAAIFYAAISPASAPILIITDSASVLSALQSE FT RPSHPWIQGIRAHTPETVTFTWVPGHCGIQGNVAADLLAGTGHQMSRFTLS FT VPPPDVKRWIAKSIRDYWAHEWHQNTSAQLRKVKGDTSSWEELSSLRDQKI FT ISRLRTGHTRFAYNFDGRSFRQECEQCGVHNSAEHVICHCPLYGHHRSTNN FT IPGSIRDALANDPASLSALICFLKETRLYYRT" XX SQ Sequence 8314 BP; 2100 A; 2359 C; 1866 G; 1949 T; 40 other; cagtttgcat cgagccggcc ttgattgcac acgcttttta ttgcggtcgt tttacgttgt 60 tttaattgtt cgttacgtaa tcgtgtttag ttgtttttta aacgttacta gtttttcacg 120 gggtacgtgt gtgaaattag tgccgaatct taattgtcgt tgccgcgaaa tagcgttkag 180 ccgagccctm aacgggcgag ttccctcttg ccggcgcgtg ttttttcacc cgggagaggg 240 gaaaagttag tgaatttttg tcacgttttt acgtggccgt gtacactctg tgacccctat 300 agtgcttgaa tagtgtcacc aagaagtctt aggggggagt agcatctact ctcagccctt 360 cggggctaag aaaagccctt tcccagagtc ctctttggga aacaaatcca aagtgtgtgg 420 caaaattttt tgacttcgcg aaattgaacg ccgctttacc agtgcataac agtggaagaa 480 gcacacattt gaccggcagt gaccgtaaaa caagggtatt gtttcggtgt cgcattgata 540 ttgcctgcct gagtttctca tagccgggcc tacagaaagt gatcggtggt ggactatata 600 tattgctgtt gggaagaatt atatagtgct tcacgggcaa accctccccg caccccgcgg 660 tgatagagag gtgttttttt acagtgctag tgttataccc acggtgtgcc tacaaccgtt 720 gttaacaaaa aaagggaagt agtttctctc ctacgcattg gggtgcaagg atcaaaatag 780 acccttgaca agagggttcg taggactcag tgcgcgattt gttttttttt tctttttcct 840 cccccacgtc ttggggtttm gagwtgagac atttcagtag tgcgcgagtg cagatcgagt 900 gaaccgttgc ttcctgttta gtgtgttcag ttgccgttca tctgctgcct tcaacagggt 960 ttctccaggt gagttgtggg tggtggtgtt ttccgtgtga ggggccgcca tggcggctag 1020 ttcggccgga ccacccacaa attcttggtg ggatggtcca gcgaatgata attttggtga 1080 cagggttgat ggctcttttc cgggaccaac gcttccgccg ttcatggatc cagaagggct 1140 ccacggcgag ctgcaccttt tgagaatttc tggtgttaac ggtcccctcc ccaacaagcc 1200 cttccacatc cgacgttcgg tggaaaggtt tgtcggtggt aaaattgatg gcgcatttcc 1260 agaggcaaat aagtcatctt atgctctgaa ggtgagaagt cagcggcaat tcaacaagct 1320 gctggacatg aatcaactac tggacggtac cccggttctg gtcgcggaac acccaacttt 1380 gaattcaacc cgttttgtgg tcagttgtag agatgtagtt gatatgtcag agcaggaact 1440 actggaggag ttaaaggagc aaggtgtgaa agaagtcagg cgaatcacca agcgaaaagg 1500 acaggcccgg gaaaacaccc cagctttagt actgacttgt cgtggcacca ttcgaccaga 1560 aacaataaat tttggttaca tccgctgcag gacccgaccg tactacccga gtccaatgca 1620 gtgttttaac tgctggcttt ttgggcacac caaactgcgt tgccaagcaa aagtcgcgac 1680 atgtgggacc tgttccgggg atcacccgat tgccgagaac agagactgcg acattggcga 1740 tttttgcaaa acctgcaact ccaacgaaca caggatmtcg agccgctctt gccctcggtg 1800 gcaatatgag aacaccattc aaaaagtgaa ggtcgaccag gggatttcgt accctgcagc 1860 ccgccgcwtg gttgagcaga accgaggcag tgsaactttc gccackatgg ctagctacgc 1920 taccaaggwt ccccwacaac caagagaacg aaacgaagaa ttatcagccg ttctagcagc 1980 maaggacgck gagatagcag agctccgggc wgcgcttgct actcgtsmag ctccttcggw 2040 cacmgttagt agcgaaatag aaacactgaa gwccattgtc kctgaccagg ctaggcaaat 2100 ccagctccta accgagcaag tctcagtwtt cctgaaagcc gtcatgccgg cagcmagcwt 2160 tacaagccca acwaatcktg caccaacaac caaagctcta acwmcttcct ctgcaaccgw 2220 cawtgaggaa actggcasct ccgagccatc tgcaactacc aacwccgctt ccgmcactac 2280 ccttaacaac tcgcctacca ccagtaaaaa cggagtccac gaacctatcg tcgacccaga 2340 agctkccaag gacaccatct cggcaccttt gtctksaagc gtcaccwcca acgaacagca 2400 ccmagacaaa wtgcttgaca amgatccctc agactctgac actgcgtctt ccgaccgcag 2460 cgcwaacatt gaattcttct ctccccacca gtctaaaaac acactctcac cacawactcc 2520 caaaccgcct cgcactaggg ccctactccc tagaggtagc ggaagcaatg ttgaccccac 2580 caaaaccccc ggtaaaaggc caataagctc aatatcccgc actgagaccc tgtttcagca 2640 acagaaaaaa tcgaagataa aatctgccga gggccatgca gcggtctcca ataaaaggta 2700 agtccatcaa ccctatccat cttctcacca ccaccccaac gccataaacc tgtctctcac 2760 tgcactgatg cccatcacgc agcagcccaa atcaatcctc aacgcaacat cattttcttg 2820 gctactgatc gtcggagcgc cggtgaactg gaagcaacgt ccgaaccaga atcgccgatc 2880 cgactccggc atcatgccca aaataacagt cctgccgtct ctcttgatag tcggggcgcc 2940 aatgtaccgg acgttttacc cctaccggaa tcgttgggcc gactccggcg tcttgaagac 3000 gacatggacg aagagctccc gcttcactta cttcgttgcc cagctaaaac ccccggtagc 3060 cagagccccg tcggtgcggt cgtttcccac tccggaactg acggacaacc tttggcgctg 3120 ggggaagacc tggaagaggg aaatcaagtt gccaagccta cttccctgga cgacgtggga 3180 agtcaagttt ttcaagtaat tgggacgcgt cgggctctac ccgctacccc aaaggacaac 3240 gtgggaagtc aagttttctt tcatcaagca aatactgtag cctataccag caacccaacc 3300 atctacgatg ccgtatcgtc aaccagcgta tcagttgatt ctttcgtgaa aaatcctggg 3360 caccccgggc actccaaccc gggtggtaag tcccaccccg tcatgagcac aatacattcc 3420 gagcgcccct ttccggaccc taccgtagat caactgccac ctttagagcc tcccgggcca 3480 tccttttcgg acgtccgatc ccagagtagt tacgcctcat ctggacacct cgggaactcc 3540 aaccccagtg gatcgtccac aaagctcctc aagaattcga atcctgatgt catcgaaacc 3600 atagcgcaac ataccattcc ttccgctcca acgacatggc cgcccaggtc acttttaatg 3660 gatgaccagc ctcggagctt cagtgcgatg tattctgaga accttaaccc cggcgccatt 3720 gcagtggcac caccaccgcc cgttttattg ccaccatcat caggcacccc cgggccattt 3780 tctttggata accggcccct tttctccgaa caaatcaact ccgatggaac cactgccgct 3840 ctcacattag ctgtacgccc tatttcaccg cttctgggtg accaccaccg atgccttgac 3900 acctccgggg cccgccgcct aggaagcaag gattattccg ggttcccttc atatagtttc 3960 gatgaagttc ccctagcaaa tgcaaacccg cctgcgtcgt caagatctcc ggagtatccc 4020 tctggtgggg actcgctgaa tcgttccgct gctataactc ctccgaattt tcccagacat 4080 cactctgaga acagcaaaga tacatcatct tcatcaaccc actcaaaaaa ctcaccatcc 4140 gagaactgtg atttccgatc ccggttcccc acaccaactc tacccgatcc caccaagttc 4200 tgcctccagt ggaacattaa cggtttttac cacaatctga gcaatctaga gcttttgaca 4260 cacgtcagtt ccccttgggt tattgctctg caggaggtga atcgtacgtc aactaccgct 4320 atgaaccgct ccctcagcgg aaaatacaga tggacaatga agaaaagcag gaatgtgcgg 4380 cacaccgtgg ccctcggcgt cttgctgagt gtccctttcg aaattataga aatcgacagt 4440 gacttaccca ttgtagctgt tcgattagtt ggtagcttta ggctaacgat cattaactgc 4500 taccttcctt gcggctccct ctctggaatc aaagagaaaa tcttaaacat aatcaggact 4560 actccagagc cccgactgct ccttggagat ttcaatgcgc accatcaaat ctggggtggc 4620 gataggacag atccacgcgg aacagctatc cttaacgccc tagaagattc ggacatggta 4680 tccttaaatg acggctcacc caccttctac aacggccgtg tcgcttcgtc catagattta 4740 tcaatggcaa gtcgctccat agcttacaag ttcctatggc aagcttgcgc cgacttgcat 4800 ggtagtgatc attatccgat tcagatttct gcagcggtca cgtccccgcc aatcaccaga 4860 cgccccaggt ggtgttacga tcgggcggac tggcaaatct accaatcaaa catcaatcat 4920 cttctcgaat tagaagagcc tgcttcttta accagctttt ccaatgttat tacaaaggca 4980 gctgcctcag ccatccccaa aagcagcagc agaccaggaa ggaaagctct tcgctggtgg 5040 tctgatgaca cccgctccat tgtcaaagca cgccggaaag ccctgagagc acttaaaagg 5100 cttcccaacg accatccgga gaaagaagat aaactcagga tttaccgagc cgtcaacata 5160 aaatgtcgtt ccaccataag gacagccaag cggttatcat gggaagaatt tcttgattcc 5220 atgaatcctt cccaaacctc ggcagattta tgggggaaga ttaacgccct aagtggtaaa 5280 aggaaaatca cgccgatttc cctggacatc caagggtcaa cgatcaccga gccttctgcg 5340 gttgccgaag ccctgggaaa ttactttggt agtctagtca gcatcgatgc atacgatcca 5400 gttttccgcc aaagaatcgc taacaaccaa tctagtttat ccaatttcgt gattccggag 5460 gaccatggca cctctcagtt gaattgctcc ttctccatgc gagagctcaa ctttgccctc 5520 gggtctagta ctggtaaatc cgccggcccc gacggtattg gctaccccct cataaaaaac 5580 ctccctctcg gaggtaaagc gaaactgtta gagctaatca atagtatgtg gcaccagcgc 5640 tcattcccgg acgaatggag agaaagtctc gtcatcccta ttcccaaaac caacgtggcc 5700 tctcgggaca catctaaata tcgtcccata tctctaacgt gttgcctctc gaaggtggtc 5760 gagagaacgg ttaaccgacg attaaaagag cagctcgaag cggatggtcg cctcggacat 5820 cgacagcacg ccttcaggcc gggatacggc actgactctt acttcgcctc ccttggagac 5880 ctccttcatg atgcgcggac aaaaggacag catgcagacc tcgtttcact ggacatttcc 5940 aaggcgttca atagaacatg gacgcctttg gtactaaaac aactgacgga atggggtttc 6000 tccgggaacc tgatcggctt ccttaaaaac tttcttctgc atcgatcttt tcgggttgca 6060 gtagggaacc acagctcaag cgccttcgcc gaggaaactg gggtcccaca aggttcggtc 6120 atcgcggtta cgctgttcct ggtagcgatg aacggagtgt tcgaaaactt gccgccagac 6180 atttacattt ttgtatatgc tgacgacatc ttgctggtag ccattggtca tactcctagg 6240 cgcactagag tcaaagcgca agcagcagta acggcagtac acaaatgggc ttcttctgtt 6300 ggtttttccc tctccgcagc taaaagcgtc cgcggccacg tttgtaactc gcgacacaaa 6360 ctagtgagtc caggaatcaa aattgacggg caacccattc caaacaggaa aactgtcagg 6420 actttaggaa ttttgattga ccgtaacctg ctgtttaagg agcatttcaa tgctgtcaaa 6480 acgaactgtc gttcccgtct gaacatcatt aaaacgatat ccaagccgca taggtccaac 6540 aatcgatcca cgcgcttcag ggtagcacaa gctatcgtag atagcaggct actttacggt 6600 ctcgagatca cctgtttgag ccaacaaaat ctgcttgaaa tcctctcacc gattttcaac 6660 gggtacgtcc gcactatctc cggtttactt ccatcaaccc ccgccgacgc agcctgtgca 6720 gaagctggta tcctcccatt ccgccatcga gtgcacgcag ccatctgccg caaggcggct 6780 acttacgctt caaaaacagc tggaaacgat agagtcgctc tctttgatca agcagatcgt 6840 atcctccgtg aggttgccaa ctccagtctt ccccctgtgg ctagagtaca ctggcatgga 6900 gcgagcagtt ggcagcttcc ccctccaatc gtagacgagc agataaaaaa gaccttccgt 6960 agtggcgatg actctacggt gcttagagcg tccgtactgg aacttctgcg cactaaatac 7020 ccggaacaca tacaccgtta cacagacggc tctctttcgc agcgtggagt aggtatcggc 7080 atctctggcg ctgacctgac actctgtcaa agccttccaa ctgagtgttc ggttttctct 7140 gccgaagcag cggccatctt ttatgcagcc atctcaccag cgagtgctcc aatcctcatc 7200 atcaccgatt ccgcaagcgt cctatcagcc ttgcagtcag aacggccgtc ccacccgtgg 7260 atacagggta tacgagccca tactccagag acagtcacct ttacctgggt ccctgggcat 7320 tgcggtattc aaggaaatgt ggctgcagac ttgttagctg gtacgggaca ccaaatgtct 7380 cgttttaccc tttccgttcc acccccagac gtaaaacgct ggatcgctaa atccatcaga 7440 gactattggg cccatgagtg gcaccaaaat acttcggctc agcttcggaa ggtaaaagga 7500 gatacctcta gctgggaaga gctctcctcg ctccgcgacc agaaaataat ttcgcgcctt 7560 aggaccggac atacccgttt tgcctataac tttgatggca gatccttccg tcaagagtgt 7620 gaacaatgcg gtgtacacaa ctcagcggaa catgtaatct gccactgccc tctctacgga 7680 caccacagaa gcactaataa cattcccgga agcatccgcg atgccctcgc aaatgacccc 7740 gcatcattgt cggccctaat ttgcttcctt aaggaaacca gactctacta ccgcacgtga 7800 cgactctaca acctgatctc aacttaatcc acgtcagaga atacggattg gacgatcggt 7860 gggctccctg aagacctttc tttcgggggt aagtcccatc caaccaacgt aacccctaac 7920 gcaaccgcac catcatttca atgaacctgc aagaatcttc acaccgccgc gaccggtggg 7980 ctccccggaa gattccgttt tcggggataa gacccatcca acaacaaagt catcaatcgc 8040 aaccacgcca acgatccttt ctctttttct cggctgcaga agacgtcggc ccagcaaatg 8100 cgcacgtacc cccttttttt aagttgtttt tatcgtccta cgtgaaaaac gcacaatggt 8160 gagaccccat ggttgggctc aaaggctggt ccctctgacc atccttcttt actcattttt 8220 cgtgggatgc ccatctggca tttccagtca ggtgaagaac tagccattcg tcaatgtgct 8280 aaaattcacc ttaataaaga aacaaaaaaa aaaa 8314 // ID Gypsy-14_RP-I repbase; DNA; INV; 6777 BP. XX AC ACPB02011988; XX DT 28-JAN-2011 (Rel. 16.02, Created) DT 28-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Rhodnius prolixus genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-14_RP_; KW Gypsy-14_RP-LTR; Gypsy-14_RP-I. XX OS Rhodnius prolixus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Euhemiptera; Heteroptera; OC Panheteroptera; Cimicomorpha; Reduviidae; Triatominae; Rhodnius. XX RN [1] RP 1-6777 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Rhodnius prolixus genome."; RL Direct Submission to RU (28-JAN-2011). XX DR Genome; ACPB02011988; Positions 8848 2072. XX CC Positions [4341-4802] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 448..1491 FT /product="Gypsy-14_RP-I_1p" FT /translation="MTSLPLTYDVLRCVPEYDGNSYELYHFINTCEELLNT FT YCKPNIKENEHNHWLLLKTALNKIKGPAKIVVYNNNCQTVNEVIAALRKNF FT ADNRTVPDLFSEINSMKAKPREHPLEFLNRLDEKRNVILTRYRLDGISGIV FT LNELTRQLDAHLVRIFLYGIHPSLGAHLQSLQCQALDDTRLKLINDCGIIL FT SQLKLPNTPENNTDTITNNHHNNHRQKKPWRQPHTYSKNNYTSQPFHRYNH FT YYPPNLNHHSPQIPQITHPNYNQLTYPQIQFNRPNPSNFPSKPFNPHSQNT FT VSMKTVRPRHELTITEMNQKNEVSELKKQIETLNKSVSQLTEHFLELGTHH FT HPPDP" FT CDS 2130..5231 FT /product="Gypsy-14_RP-I_3p" FT /translation="MELDEVESAYTTDSVTNCPDELAEVKKEIPKLIRTDH FT MNKEEKVKIENLIKEYPEIIKRDKDKLTSTTLIKHKINTKDDTPVYTRNYR FT HPIAFKADIHKEIEKLLENKIIQNSNSPYNSPIWVVPKKPDASGKRKIRMV FT IDYRKLNEKTIEDKYPLPNIEDLFGKIGRATYFSAIDLASGFHQIEMDPES FT IPKTAFSTESGHYEFLRMPFGLKNAPPTFQRAMNLIFADTPNVLVYMDDII FT VFSDNLTEHLKHLQKVFQKLKDNNLKIQLDKTEFFKKELLYLGHIISNKGI FT APNPDKVSTIKNFPLPKTTKQIKQFLGLTGYYRKMIKNYAKIAKPLTNALR FT QDGKIDTKDQEYIDSFNTLKEYLQNSPILQLPDFQKEFHLTTDASNLALGA FT VLSQNFEGKDLPIAYASRTLNKAEERLSTIEKELLAIVWSCKHFRPYLYGK FT KFTIYTDHKPLQWLHNMKEPTSKLLRWKCTLLDYEFDVKYIPGKTNLVADA FT LSRIPQEISTIDNPPEERNSENLSEVNPEEVIDQFLREYPPQIDNDNSSLA FT TVHSQESSSEQVTLMDKDKILNIEQHQILIERGSPNIQIIKVFNKNRIKLT FT ISNINIEDQLKEFVLQYLQPKTTYGIYCQTSEILQQEYDHIFLTLQTIITR FT DFSSIKLRRYYKLLIDIEDKPEQLETIGNYHNGKTYHRGISESYEHIRRRY FT YWPGMFKDISDFINQCQTCLKVKYDRKPIKAQYQITPTPNKPFEKIAIDVF FT IFNSQKFLTIIDLFSKKLTVYPIKTHNSIEIQGKLQTYFSLFPLPTTIQMD FT NGKEFQNIGIKNLLSLYNINPYYTTPGHSQSQGTIERVHSTLIELLNAIQI FT ENKNNTIGKNMTLAVIAYNNSIISKLKLTPMEVTFGTNNIAPQNVQVNILE FT EKTRQYHQDLELIHKIIKDNIEKEKIERTQKLNKTREPNINLPKTLHIKST FT KHKKHVPKYFPVKHNPSTKMEKLKNSKTCKEFKIHPNRVKRPRKVTKPLSI FT TGNHDPTLQPSTSRDPNISYSTDSSPDH" FT CDS 5158..6753 FT /product="Gypsy-14_RP-I_2p" FT /translation="MTPHFNHLLVATLTLATLLTAHQIIDLTKSNGIIALR FT IQPSYIINNTYTYIHNINIEEINEEINQIENNLNLINPKNRNPHYINIINS FT FILYSKDKLSHIYLNKIPRQKRGLINGLGTAISWITGNMDADDKERYDKII FT KQIANNEYSLQHNVQNQISMNQDIINKFNADIDIIRANNNKAKLYLQILSN FT ETTAVQLAEQYNFYLINLQLLINKINDIDNSIEFCKTSIVHSSIITRRELS FT RIVKKTGVQFISDDPEILWQVGSVRCALHDNFISYFLELPLSSDSYETIFF FT LSHPFKHGTETAAIYAHPSMVLRKYNNLYYSKDCILIKNIYYCKKAKEIKN FT NECIENILKNKNHDCKTIVLDNTKPFVKYVNVINKYLFYNYEDIMLNIKNT FT TNITIYSKYCYLLELDENEFIYNISQPYTYWESKLLDLGSISINYPSTTSN FT FSFDNLHKLNLKSRPLEFLEELPIVNTHSIILYVIVGIIIIVLLCYMLSKT FT NFSKRLRKTDTATAVGIKLEIPSGTTLNPSPRTASS" XX SQ Sequence 6777 BP; 2708 A; 1420 C; 962 G; 1687 T; 0 other; taactggcgc ccgattgcag gaacagcaat acttcttaac gtttcgatca cttatccaaa 60 ctttaagtcg accgtagtat ttaccgcaac cctttaaagt tacgaccaat tgtccagctg 120 aagtggaagc agtacctact acagattgct gtcgaacagt gtggcctatt gccccgctaa 180 acacctaaaa gaagacaacg tgtgaccagt cgtcccacgg aaaggaagct ggagaaatca 240 agaacacaac aaagcaaccc ttacacctac tggaccgact catccgaaga cgaactaccc 300 ctcaagcctt ccaaggagcg tacttcaagg agagcgaagt ctacggtacg gtggaaagct 360 gctacaacta cagcacttgc agaacaacca acaacaagtg ttactacagc cacatctcca 420 ggtaactcaa ttgacataac tatcaacatg acgtctttac cactaactta cgatgtactt 480 agatgcgtac cagagtacga cgggaatagc tatgaattgt accactttat aaacacttgt 540 gaagaactac taaacactta ctgtaaaccc aatataaaag aaaacgagca taaccactgg 600 ttacttttaa aaaccgcgtt aaacaaaatc aaagggccag ctaagatagt agtctataac 660 aacaattgcc aaactgttaa tgaagtcata gctgctctaa gaaagaattt cgcagacaac 720 cgcacagtac ccgacctatt ttctgaaatc aatagcatga aagctaaacc tagggaacat 780 cctctagagt ttctcaacag gttagatgaa aagagaaatg taatcttgac tagatacaga 840 ctagacggaa tttccgggat agtgctaaat gaactgacca gacaattaga tgcccatctc 900 gttagaatat tcctatacgg aattcatccc tcattaggtg cacatttaca atcattgcaa 960 tgccaagcat tggatgatac acgattaaaa ttaataaatg actgtggaat aattttaagc 1020 caattaaaac tacccaatac ccctgagaac aacacagaca caattactaa taatcaccat 1080 aacaatcaca gacaaaagaa accatggcgt cagccacaca cttattcaaa aaacaattac 1140 accagccaac catttcatcg ctacaaccac tattaccctc caaatttgaa tcatcattca 1200 cctcaaatac ctcaaattac acatccaaac tataatcagc ttacatatcc tcaaattcaa 1260 tttaaccgac ctaacccctc taactttcct agtaaacctt ttaaccctca ttcgcagaat 1320 accgtgtcaa tgaaaacagt tagaccaaga catgaactaa ccattacgga aatgaatcaa 1380 aaaaatgaag tcagcgaatt gaaaaaacaa atagaaacac taaacaaatc agtatcacaa 1440 ttaacggaac attttttaga attaggaaca caccaccacc caccagaccc ttaacctttg 1500 aactaaaatc catttccaag tccttaccct acttcctaga tgaaaatgca ctcaaaaacc 1560 tcatagacac aggctcacaa aaaaactaca taaatccaaa catcatccca ccaaacacac 1620 aaagacacaa agaaaacttt actgtcaaaa cccctacagg agaagaaacc ggaacggaat 1680 acatattttt aactttagac aagacctttc ctaatacttt agacttacct actaaattct 1740 atgtcttccc cttctctgac agatacgact tattattagg ttacgaaact cttaaagtta 1800 ttaacgccac tatagatttt aaggctagtg ttttaaggta tgataatgga catcgcaatt 1860 tattgtttaa tataaaagat aaaataaatg aaaaaggtaa tgataaaata aataaaacaa 1920 agaaagtaaa agaaaatata aatttaaaaa agggattaaa tataatcgaa atagaagtaa 1980 aaaataataa ttgtaagata ggtttaatag ataattacga tttaaaaaat gagaaagtag 2040 agttaaaaaa gggattaata aaaaataata aaggcaaagc aaaatgctta gtctatgccc 2100 acgaggtcac gaccatttgt cctgaaccca tggagctgga cgaagtagag tcagcctata 2160 ctaccgattc cgtgaccaat tgtcccgatg aattggcaga agtaaaaaag gaaatcccca 2220 aattaataag aaccgaccac atgaataaag aggagaaagt taagatagaa aatttaataa 2280 aggaatatcc agagataata aaaagagata aggataaact cactagtacc accttaatta 2340 aacacaaaat taatactaaa gacgataccc ccgtctacac tagaaactat cggcacccca 2400 tagcctttaa ggcagatata cacaaagaaa ttgagaagtt gctcgaaaat aaaataattc 2460 aaaacagtaa ctccccatat aattcgccca tttgggtagt cccaaaaaaa cctgacgctt 2520 caggtaaacg aaagataaga atggtaatag attatcgtaa acttaacgaa aaaaccatag 2580 aagacaagta tcctctccca aacatagagg atctattcgg aaaaatagga agagcgacct 2640 acttctccgc tatagattta gcatctgggt tccaccagat agaaatggat cccgaatcaa 2700 tacccaaaac cgctttcagt acggagtccg ggcattatga attcttaaga atgcccttcg 2760 ggttgaaaaa tgcacctccc acttttcaga gagctatgaa tctcattttt gccgacaccc 2820 caaacgtcct agtatacatg gacgatatca ttgtgttctc cgacaactta actgaacact 2880 taaaacattt gcaaaaagta ttccaaaaat taaaagacaa taacttaaaa attcaattag 2940 acaagactga atttttcaaa aaagaactac tctatctggg acacataata tccaataaag 3000 ggatagctcc caatcccgac aaagtaagta ccatcaaaaa tttcccctta cctaaaacca 3060 caaagcaaat taaacaattt ctcggattga caggatatta cagaaaaatg attaaaaatt 3120 acgcaaaaat tgctaaacca ttaaccaacg cattaagaca agacggtaaa atagacacaa 3180 aggatcagga atacatagac tcattcaata cattaaaaga atacctacag aactccccca 3240 tactccaact accagatttc cagaaagagt tccatttaac cactgacgcc tccaatttag 3300 cattaggggc agtcctttcg caaaatttcg agggcaagga tttacccata gcctacgcct 3360 cgagaactct aaataaggct gaagaaaggc tgagcaccat tgagaaggag ctgttagcca 3420 ttgtttggtc ttgcaaacat ttcagacctt acttgtacgg aaaaaaattt accatatata 3480 ctgaccataa acctcttcag tggctccata atatgaaaga gcccacgtca aaactcttga 3540 gatggaaatg caccttattg gattatgagt tcgacgtgaa atatatccca gggaaaacca 3600 atttagtagc cgacgccttg tcgcgtatcc cccaggagat atccaccata gacaaccccc 3660 ctgaggaaag aaattctgaa aacctcagtg aagtaaatcc tgaagaagta attgaccaat 3720 ttctaagaga atatcctccc caaattgata atgataacag cagcctcgct actgtacact 3780 cccaagagag ctcatccgag caagtaactc tcatggacaa agataaaata ttaaatatag 3840 agcaacatca aatactaata gaaaggggat cgccaaatat acaaataata aaagtattta 3900 ataagaatag aataaagtta actataagta acattaatat agaggaccaa ttaaaagaat 3960 ttgtcttgca atacttacaa ccaaagacaa cttatggaat atactgtcag acctcagaaa 4020 tattacaaca agagtacgac catatcttcc ttacgttaca aacaattata actagagact 4080 tttcttctat taaactgcga cgatactata aactcctaat agacattgag gacaaacctg 4140 agcaattaga gactatagga aattatcaca atggtaaaac ttatcatcgc ggtatatccg 4200 aatcctatga acatatccgc cgccgatatt actggcccgg catgtttaag gatatatctg 4260 actttataaa tcagtgtcaa acctgcctaa aagtgaaata cgacagaaag cctataaaag 4320 cccaatacca aattacacca acccctaata aaccttttga gaaaatagcc atagacgtgt 4380 tcatttttaa ctctcaaaag tttttaacca taatagatct cttcagtaaa aaactgactg 4440 tgtacccaat taaaacccat aactcgattg aaattcaagg caagcttcaa acatatttct 4500 cactgttccc attacccaca acaattcaaa tggataacgg gaaagagttt caaaacatcg 4560 gcattaagaa tctcttgtcc ctctataaca ttaatcctta ttacactaca ccaggtcatt 4620 ctcagtctca aggtacaatc gaaagagtac actctaccct tattgaatta ttaaacgcca 4680 ttcaaataga aaacaaaaat aatacaattg gaaaaaacat gacactagct gtcatagcct 4740 acaacaatag cataatctcc aaattaaaac tcacccctat ggaagtgacg ttcggtacta 4800 acaacatcgc accccaaaac gttcaggtta atatactcga agaaaagacg agacagtacc 4860 accaagacct cgagctaata cacaaaataa taaaagataa tatcgaaaag gaaaagatcg 4920 aaagaaccca gaaattaaac aagacacgag aacctaacat aaaccttccc aaaacactac 4980 acataaaatc aaccaaacac aagaaacatg tccccaaata ctttccagta aaacacaacc 5040 catcaaccaa gatggaaaaa ctcaaaaact ccaaaacttg caaagaattt aaaatccatc 5100 caaacagggt aaagagaccc agaaaagtaa caaaaccact ttctattaca gggaaccatg 5160 accccacact tcaaccatct actagtcgcg accctaacat tagctactct actgacagct 5220 caccagatca ttgatctaac caaatccaac ggaattattg ctttacgaat acagccatcc 5280 tacataatta acaatacata cacctacata cacaacataa acattgaaga aattaacgaa 5340 gaaatcaatc aaatagaaaa caacctaaac ttaattaacc ccaagaatag aaatccccat 5400 tatatcaata taattaattc tttcattcta tactctaaag acaaattgtc acatatatat 5460 ttaaataaaa taccaagaca aaagcgtgga ctaattaacg gactgggaac ggcaatttcc 5520 tggataaccg gaaatatgga cgcagatgat aaagaaagat acgataaaat aattaaacaa 5580 atagccaaca acgaatattc cctacagcac aacgtacaaa atcaaatatc tatgaatcag 5640 gacattataa acaaatttaa cgccgacata gacatcatac gcgcaaacaa taacaaagca 5700 aagttgtatt tgcaaattct ttctaatgag accacagctg tgcaactagc tgagcaatac 5760 aacttctatt taataaactt gcaattacta ataaataaga taaacgacat agataacagt 5820 atagagtttt gtaaaaccag tatagtgcat tccagtataa tcacaaggag ggaacttagt 5880 agaatagtaa agaaaacagg tgttcaattt ataagtgatg atccagaaat attatggcaa 5940 gtaggctcag tcagatgtgc cctacatgat aatttcatat cttatttttt agaactccct 6000 ttatcttctg actcatacga aacaatattc tttttatccc atccttttaa acatggtaca 6060 gaaaccgctg caatttatgc gcatccctct atggtcctta gaaaatataa taatttatat 6120 tatagcaaag attgtatatt aattaagaat atatattatt gtaaaaaggc taaagaaata 6180 aaaaataatg aatgtattga aaatatattg aaaaataaaa accatgattg taaaactatt 6240 gtattagata atacgaaacc atttgtaaaa tatgttaatg taattaataa atacttgttt 6300 tataattatg aagacattat gttaaacatt aagaatacaa ctaacattac aatctactcc 6360 aagtattgtt atcttttaga attagacgaa aatgaattta tttataatat ctcccaacca 6420 tatacttatt gggaaagtaa attgttagac ttaggtagca taagtataaa ttatccttct 6480 actacttcta atttttcttt tgataattta cacaaattaa atttaaaatc aagaccctta 6540 gagtttctag aggaactgcc tattgtaaac acgcactcta taatacttta tgtaatagtt 6600 ggtataatca taattgtttt attatgttac atgctatcaa aaaccaattt ttccaaaaga 6660 cttaggaaaa ccgacacagc cacagcagtt ggtataaaat tggaaatacc atccggtaca 6720 accctcaacc cttcgccgag gacggcttca tcttaaggag ggaggagtta cgaacgc 6777 // ID MuDR6x_AP repbase; DNA; INV; 2386 BP. XX AC Contig9458; XX DT 25-JUN-2009 (Rel. 14.07, Created) DT 25-JUN-2009 (Rel. 15.12, Last updated, Version 2) XX DE MuDR-type DNA transposon. XX KW MuDR; DNA transposon; Transposable Element; MuDR6x_AP. XX NM MuDR6x_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-2386 RA Jurka J.; RT "DNA transposons from pea aphid."; RL Repbase Reports 9(7), 1355-1355 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX FH Key Location/Qualifiers FT CDS join(1166..1462,1440..1877,1881..2039) FT /product="MuDR6x_AP_1p" FT /translation="MLKETDDLFGDRTFKSVPSIFSQLYTIYCYARSLVIP FT LVYILMTSRTKETYVEVLRQLLILQPDLNPKSIKIDFEQPFISVFKQIFPN FT AHVSGCFFHFGVDVFFILASVYGVNYNLFGLQKKYAEDSTFALQVKQLCAL FT AFVPVLDVVESFNTILESQYFIDNEQLFEPLVDYFEETWIGRTVRNRKRKP FT LFQIEMWNCYNNVINDHPRTNNIVEGWHHAFNASLGCHHATIWKFISFLKQ FT EQGLEAKIEKIILGENNPIKKRKYKNIDQCLKNTVENYETTNINTYLKGIA FT HNFNF*" XX SQ Sequence 2386 BP; 896 A; 340 C; 341 G; 809 T; 0 other; gtcatttggt ccggtaaata atatccggcg acaattgatc cgcgacattt gatacggttc 60 aaaaagaacc ggtaccaaaa atatccggct aaaaaacgtc cggtaaaaga ataatataaa 120 atacaaatat atataaatta gaactattat tttagaaaaa taatctatga tattgtagta 180 tattatagaa tgtaataagc gaatagtata ttaccttatg gcttatatac agaaaaggat 240 cggaaaaacc ttgaataaag aatgtaatca ggtctcaacg atagaaatta ttattataac 300 ggaaatagcg taatatcaga ctacacgata ctgtacgata ctatttatat atatgtattg 360 tataccgtac actatacgat acatataagt tttttcgatt ttagtacagg tgtacaccta 420 gccgtatgtg gacatgttta agtaacgtta atactaaaga atgccactta cattcatcaa 480 aagctaaaag gcataaatat gcttttaaat aaaagttttt tatatagtat tcattatgtg 540 tacgaaacaa aagttgtttg acgttgtatc gaatataaaa agttgtcatg taaaagccga 600 tgtcacataa caacgaaaga ggaaacgggt atgtacatca aaatctgtat accaaattaa 660 tagaatttaa ttaattagat taatttttaa aaaaatattt aataattatt tttaagtacc 720 tagactgtat ttaaaattgt gcaaccacag atcattccac aaaaacaata caattttttt 780 gttttttttt ccataggaga tcttaaaaga aactgatcat tcgcatgctc caaacttagc 840 caaaattgag gacaaacaac tggtgaatac tttaaaggat actgctataa cagaacaaac 900 ttctgtaaga aatattattt caaacgtgtt aggccaaact tctgttccag taattggaca 960 attaccagtc ataaaatcat tatcaaggac tattcaacgg actcgagtaa caaaagaaaa 1020 tgctccagtt aatccaagta ataccaaaga tctaattata ccagaaacat ataaattaac 1080 caacaaaaat gaattttttt tggcatacaa tagtggtagt agtgaaagta gaattttaat 1140 ttttactact caaaacaatt taaatatgct taaagaaacg gatgacttgt ttggtgacag 1200 aacttttaaa tctgttccct cgatattttc ccagttatat acaatctatt gctatgcacg 1260 tagtttagtt attccattag tgtatatatt aatgactagt cgtacaaaag aaacctacgt 1320 tgaagtttta aggcaattgc taatattaca accagatctg aacccaaaat caattaaaat 1380 agattttgag caaccattta tttcggtgtt caaacaaatt tttcctaatg ctcatgtaag 1440 tggatgtttt tttcattttg gctagtgtgt atggcgtaaa ttacaatctt tttgggctac 1500 aaaagaaata tgcagaagac tcaacgtttg ccctacaagt caaacaatta tgtgctttag 1560 cattcgtacc tgtccttgat gtagtagaat cttttaatac cattctagag tcccaatact 1620 ttattgataa cgaacaacta tttgaaccat tagtagatta ctttgaggaa acatggatag 1680 gtcgaacagt tcgtaatagg aaaagaaaac cattatttca aatagaaatg tggaactgtt 1740 ataataatgt aattaatgac caccctcgca caaacaatat tgttgaaggt tggcaccatg 1800 ctttcaatgc atcattggga tgtcatcatg cgactatttg gaaatttata tcttttttga 1860 agcaagaaca agggctttaa gaagcaaaaa tagaaaaaat tattcttggt gaaaataacc 1920 ctataaaaaa aagaaaatat aaaaacatag accaatgtct taaaaataca gtagaaaact 1980 atgaaactac aaatatcaat acatatttga agggtatcgc tcataatttt aatttctaaa 2040 catttgtatt ctataagact tttataatta cattatatca ttttatatta ttatgcctta 2100 acaataatta atagtttata tattaactga ttattatttt ttttataatt tctatataat 2160 atttttttta ccgacgtttt ttagccagtg attattatta ttttttacct gacattttta 2220 aaagttatga aataaaataa taattctaat ttatatatat ttgtatttta tattattctt 2280 ttaccagacg ttttttagcc ggatattttt ggtacaggtt ttttttggac cggatcaaat 2340 gtcgcggatc aattgttgcc ggatattatt taccagacca aatgac 2386 // ID BEL-60_CQ-I repbase; DNA; INV; 5970 BP. XX AC AAWU01016932; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-60_CQ_; KW BEL-60_CQ-LTR; BEL-60_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-5970 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 273-273 (2011). XX DR GenBank; AAWU01016932; Positions 40001 45970. XX CC Positions [5018-5578] - Integrase core CC 'GATAG' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 947..5970 FT /product="BEL-60_CQ-I_1p" FT /translation="MPTPRGNRSLPALSSISDRRGSPTPPPKPQDLSSGSR FT LTSSQIAARHIVKDLPRFGGNPEDWPRFIAAYERTKKMCGFENDELLDRLE FT RCLYDKAQLAVQNLLLHPNKVPLIIERLRTLYGNPEIIVETMVQRVRMMPR FT PKADRMDTIIDFGIAVQNLCATMEACQMDECLYNVALLQELVEYLPPSIKV FT NWALHRQGKLKVSLGDFGDWLGNLVEALSKVTRPQPAVKGHGVHRDRNNEH FT VHVHSNTLDDQTVHETCLACGSDCQSLDKCAKFLKMSPKHRWEQIREKSIC FT RKCLKRHFKPCDTKVPCGKNSCKFLHHELLHDNSKHKLPEELKATNVQSQE FT ECNAHHCCLGSVLFKYAKVTIHGKGKSITTYAFLDSGSTCSLMEHSLWKEL FT GLEGEQYPLCIGWTAGQGRYEAGSVKCAVDISSIHSGHRNRLKKLHTVESL FT QLPAQTMNIDDLSQHYRHLSNLPIDSYENVRPRILLGMDNAFLDHPIEARE FT GGENQPVAALTRLGWVVYGPCSVEENSKKNHHQEFNYHICQCDTMFAEMKQ FT YFMLDSLGIQTPSKPLMSKDDERAMELLRSQTVREGNRYRTGLLWKFDEVK FT LPDSRAMALNRLRCLEKRMIREPELATALQAKIEDYEQKGYIRKLTDEEER FT AHRDRVWYLPIFVVTNPNKPGKLRIVWDAAATVRGASLNSFLLKGPDQLAP FT MVNVLYRFREYRTAVTGDIREMFHQVGVHPEDQHCQRFLWNNGRPGSTPSI FT YVMQVMTFGATCSPSCAQFVKNTNAERFNEDHPAAVNAIINDHYVDDMLSS FT VETEQEAIELATSVRDIHAQGGFEIRGWRSNSSSVLEALSAEQLNEKNLNV FT IAQFSTEKVLGMWWDSCTDTFTFKLPTKPDKELLNGSRVPTKKEVVSVLMS FT IFDPLGLLANVLMFLKVLIQEIWRSKVGWEDPITATQFEKWLTWLKVIEQV FT ENVTVPRCYRQITSSSSKTNVQLHMFVDAGVQGCATVGYLRFEEADRIECA FT FVAGKTKVAPNKLTSIPRLEIDAGTMGVRLAQKIMEALRIVIHQRFFWTDA FT RDVLCWLHSDHRQYTQFVGFRVAEILEKSDLSEWNYCPSKLNPADDGTKWT FT KTPDLSKKSRWMHALCFIQEKTSNWPEKLTHIGTTQLELKHSVGLHCATAQ FT VIRWENFSTWKRLLRHTALCKRFVRNSKASTDKQLRTTGEITQEELQAAEA FT YLHRQAQEAEYLEEIAILTKAANCDEPDRYHIPKSSSLFKLSPHLDERGVL FT RMSGRAGACEFIEPEVSCPVILPRNHHITVLTVRSMHFRYHHTSNETVVNE FT LRQRYRIPHLRRVCSQVRWSCQDCKNRSARPAAPLMADLPQARLAGFSRPF FT SYTGVDYFGPMCVAVGRRVEKRWGVLLTCLVTRAVHIEIAHSLNTDSCIMA FT IRNFAALRGTPLELFSDQGTNFIGSDRELREALQKIDNDKLFKEFTTPNTK FT WTFNPPSSPHMGGSWERMVQSVKKILTQLKLPRNPTDEVLRNTLLEIANVI FT NSRPLTYIPIDDENSPALTPNHFLLGSSSGSKPLVPFSDDIATLRNNWKAS FT QLYANIFWKRWVKEYLPTICRRTKWHQPVRPIQVGDVVVVVDPDLPRNCWP FT IGRVVSTNTNNKDGQVRSAVVRSNGRYYERPVVKLAVLDVGINTSKLDNDS FT SVLGGT" XX SQ Sequence 5970 BP; 1658 A; 1578 C; 1528 G; 1206 T; 0 other; ttttaaattt tccgtttaac taaaccggaa tgtctgacaa aggcgggccg gagaaacccg 60 gcgcgatatg tggcaaatgc gacaaaatgg gttggaagcg gatggtgaaa tgcgcgaaat 120 gcagactctg gtatcactat gaatgcgtgc gtgtgacttc tggagtgctc gacgccccct 180 gggtttgcaa tgactgcctt agaagatcta tcgaggaaag caatgaagtc cttgaactgc 240 atctgcgtga ggaacggcgg aaggtgtatc gcagcctaca gcagaaacta gaaaagcagt 300 ggcgtgaaca aaaccctgga actgatcaca cgcagtctga gaacgttgag tacgttaagt 360 ttgggaatac cgtcatccaa attgagcgac gtggtggaat tcctctggct acatccagtc 420 cttctgccgg tgcggaacca gcaatgaaac ccggtcagtc cggaacactt cagcgcctcc 480 gtgaactgga agaagcaaac aggaagctac gtagacagat agaattggcc caacaatcgg 540 aaacatcaag aggattcact tcccacacgg aaccggcgga atcactgtac gaatatccgc 600 tgcccagctc aagactagaa cgtatccttg acccaacaaa gatcaatcca accggatttc 660 tgagtcctgt cgttgagaga acagaatctt cccaccacgg gatctgaagc tgaaagcgct 720 agcggaaagg caggcgctag agaagaagca gcttgaagaa agacaagcac tggagcaaga 780 acttctcgac caacaagacc ttgacgaaga cttgcgatct gtacgaacac gcagcagtga 840 tcaccactcg tccggatcaa acttcttgaa tgtcagcaaa tggctagatg aactgcaaat 900 ctctctcaac cctgtgccag ccgcgagcga aaccccctcc ggaagcatgc caactccccg 960 cggaaacaga tctttgcccg cgctgtcttc tattagtgac cggcgaggtt cacctacacc 1020 tcctccaaag cctcaagatc tatcgagcgg ctcgcgattg accagcagtc aaatagctgc 1080 ccgccatatt gtgaaggact tgccaaggtt tggcggaaac ccggaagact ggccacggtt 1140 catagccgcg tacgaacgaa caaagaagat gtgtggattc gagaatgatg aactcctgga 1200 tcgactggaa cgttgcctgt acgataaagc acaacttgcg gtccagaacc tattgctaca 1260 tcccaacaag gttccgttga tcattgagcg cttgaggacc ctttacggga atccagagat 1320 cattgtggag acgatggtac agcgagtccg aatgatgccc cgaccaaaag cggaccgaat 1380 ggacacgata atcgatttcg gaattgcagt acaaaacctt tgtgctacca tggaggcttg 1440 tcagatggat gagtgcttat acaacgtggc tctgctacaa gagctcgtcg agtacctgcc 1500 gccgtccatt aaggttaact gggcgctgca ccggcagggc aagttaaagg tgtctctggg 1560 cgacttcgga gactggcttg gtaacctggt cgaagctcta agtaaggtga caagaccgca 1620 acctgcggta aaaggccacg gtgtccaccg ggatcgtaac aatgagcacg tccatgttca 1680 ctccaatacc ctggacgacc aaacagttca cgaaacctgc ctcgcatgcg gaagcgactg 1740 tcaatcgctg gacaaatgtg ccaaattcct aaaaatgtca cccaaacatc gatgggaaca 1800 aattagggaa aaaagcattt gtcgcaagtg cctaaaaaga cattttaaac catgcgacac 1860 gaaagttccg tgtggtaaga atagctgcaa attcctgcac catgaacttc tccatgacaa 1920 cagcaagcac aagttacctg aggaactgaa agcaacgaac gttcagtcgc aggaggagtg 1980 taatgcgcat cactgctgcc tcggatctgt tctattcaag tacgcaaagg tcaccatcca 2040 cggcaaggga aagtccatca ccacctacgc attcctcgac agcggctcga cttgctcact 2100 gatggaacac agcctctgga aagaacttgg acttgaagga gagcagtacc cactatgcat 2160 tggatggaca gcaggacaag gtcgatacga ggccggctcg gtcaagtgcg cggtggacat 2220 ctcgagtatc cacagtggac atcgaaaccg attaaagaaa cttcacacgg tcgaaagcct 2280 acagctgccg gcgcagacga tgaacattga cgatctctcg caacactacc ggcatctttc 2340 gaacctgccg atcgactcct acgaaaacgt tcgccccaga atcctcttgg ggatggacaa 2400 tgcgttccta gaccacccaa tcgaagcacg agaaggaggg gagaaccaac cggtcgctgc 2460 gcttacccgt ctaggatggg tcgtttacgg gccttgctcg gttgaagaaa actccaagaa 2520 aaaccatcac caagaattca actaccacat atgccaatgt gacacgatgt ttgctgagat 2580 gaagcagtac tttatgctgg acagtcttgg aatccaaacg cccagcaaac cgctgatgtc 2640 caaggacgac gaacgggcga tggagctgct gagatcgcag acagttcgag aaggaaaccg 2700 gtaccgaacc ggtcttttgt ggaagttcga tgaagtgaag ctacccgact cgcgagctat 2760 ggcgctgaac cgactgcgct gcctagagaa gcgcatgatt cgagagccgg aacttgctac 2820 ggcgctgcag gctaaaatcg aagactacga gcagaagggt tatataagga agctaaccga 2880 tgaagaggag cgagcacatc gtgatcgcgt gtggtatttg ccaatatttg tggtaacaaa 2940 cccaaacaag ccgggaaaac tacgcatcgt atgggacgcg gctgcaaccg tccgtggcgc 3000 atcgcttaac tcttttctgc tgaaaggccc tgaccagctt gcgccgatgg tcaatgtcct 3060 gtaccgcttt cgggagtacc gcactgctgt gacaggtgac atccgtgaga tgttccacca 3120 agttggtgta caccctgaag accagcattg ccagcgattc ctgtggaaca acgggcgccc 3180 cggatcaacc ccttcgatct acgttatgca ggtgatgacg ttcggcgcga cctgctcccc 3240 aagctgcgca caatttgtca aaaacaccaa tgccgagcgc ttcaacgagg accatccggc 3300 agcggtaaac gcaatcatca atgaccacta cgtggatgat atgctaagca gcgttgaaac 3360 ggagcaagag gcaatcgagc tggcgactag cgtccgagat atccacgcgc agggaggatt 3420 cgaaatacga ggttggagat caaattcgtc ttcagttctc gaagcactca gtgcggagca 3480 gttgaacgag aagaatttaa acgtcatcgc gcagttttca accgaaaagg tgttgggaat 3540 gtggtgggac tcctgcaccg atacatttac gttcaaactg ccgaccaagc ccgacaagga 3600 actgttgaac gggtctcgtg taccgaccaa gaaagaggtt gtcagcgtgc tgatgagcat 3660 cttcgatcct cttgggcttc tggccaacgt gctgatgttc ctgaaagttc tcattcaaga 3720 gatttggcgc tcaaaggtcg ggtgggaaga tccgataaca gcgacacagt tcgaaaagtg 3780 gcttacatgg ctgaaagtca tcgaacaggt agaaaacgta acggtgccga gatgttaccg 3840 ccaaataact tcgagctcgt cgaaaaccaa tgtgcagctg cacatgttcg tagacgctgg 3900 cgtgcaaggt tgcgcgacgg tcggttacct gcggttcgaa gaggctgatc gcatcgaatg 3960 tgcatttgtc gctggaaaga ctaaagttgc cccaaacaag ctaacgtcga taccacgact 4020 tgaaatagac gctggaacca tgggagtgcg cctcgcacag aagataatgg aagcccttcg 4080 catcgtgata caccaacggt tcttttggac cgatgctcga gatgtcctgt gctggctgca 4140 ttccgaccat cgccagtaca cccaattcgt cggtttccgt gttgccgaaa tactggaaaa 4200 gtcggacctg tccgaatgga actattgccc cagtaagctt aatccagcag acgatggaac 4260 caagtggacc aaaacaccag atctgtcgaa aaagagccgc tggatgcacg cattatgctt 4320 cattcaagaa aagacatcaa actggccaga gaagcttact cacatcggaa cgacacagtt 4380 agaacttaag cactcggtgg gactgcactg tgcgacagct caagtcatca gatgggagaa 4440 tttctcaacc tggaaacggc tacttcgaca caccgctttg tgcaagcgct tcgttcggaa 4500 ctctaaagct tcaacggaca aacaattacg cacaactggc gaaatcaccc aagaagaact 4560 gcaagcagcg gaagcatacc tgcatcgaca agctcaggag gcagagtacc tggaagagat 4620 agccattctc accaaagcag caaactgtga tgaaccagac aggtaccaca tccctaaaag 4680 cagctcgctc ttcaagttat cgccgcacct ggacgaacgt ggcgtgctgc gtatgtctgg 4740 tagagcgggt gcttgcgagt ttatcgaacc agaagtcagc tgcccagtca tcctcccacg 4800 aaaccatcat ataactgtgc tcaccgtacg gtcgatgcac tttcgctatc atcacaccag 4860 caacgaaacc gtcgtaaacg agctccgaca gcgatacaga attccacatt tgcgccgggt 4920 ctgttcccag gtccgatggt cctgccagga ttgcaaaaac cgcagtgccc gccctgctgc 4980 tccgctcatg gccgatctac cacaggcaag gcttgccggt ttctcgagac cgttctcata 5040 caccggggtg gactactttg gtcccatgtg cgtcgccgtg ggtcgccgcg tagagaagcg 5100 ctggggcgtg ctcttgacct gcctggtaac gcgagcagta catattgaga tcgcgcactc 5160 tctaaacacc gactcttgta taatggccat ccgcaacttc gctgcactgc gcggaacgcc 5220 tctcgagctg tttagtgatc aaggcactaa cttcatcgga tcagaccgcg agctaagaga 5280 ggcgctccag aaaatcgaca acgacaagct gttcaaggag ttcacaacac ccaatacgaa 5340 gtggacattt aacccaccca gctccccaca catgggtgga agctgggaga gaatggtgca 5400 atccgtcaag aaaatactga cccaactcaa gctaccccgc aacccgaccg atgaagtact 5460 tcgcaacacg ctccttgaga tcgctaacgt catcaattcg cgacccctaa catacatccc 5520 tattgatgac gagaactctc ccgcactgac tccaaaccat tttctgctcg gctcctcaag 5580 cggcagtaaa ccgctggtgc cattcagcga cgacatcgct acgctcagaa acaactggaa 5640 ggcctctcaa ttgtacgcaa acattttctg gaaaagatgg gtgaaggagt atcttccaac 5700 aatctgtcgc cgaacgaaat ggcatcaacc tgtgcggccg atacaagtcg gcgatgtcgt 5760 cgtcgtagtt gacccggacc tgcctcgcaa ctgctggcca atagggcgag tggtatcaac 5820 gaacacaaac aacaaggacg ggcaagttcg cagcgctgtt gttaggtcta atggccggta 5880 ctacgaaaga ccagtggtga agctcgcggt tctagacgtc ggtatcaata ctagtaagct 5940 ggacaacgat tccagcgtac tcggggggac 5970 // ID RTE-8_BF repbase; DNA; INV; 3427 BP. XX AC . XX DT 29-JUL-2009 (Rel. 14.07, Created) DT 29-JUL-2009 (Rel. 14.07, Last updated, Version -1) XX DE Amphioxus RTE-8_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW RTE; Non-LTR Retrotransposon; Transposable Element; RTE-8_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-3427 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-3427 RA Kapitonov V. and Jurka J.; RT "Young families of RTE non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(7), 1706-1706 (2009). XX DR [2] (Consensus) XX FH Key Location/Qualifiers FT CDS 218..3394 FT /product="RTE-8_BF_1p" FT /translation="PGHRLATGKTSGHKTGACPVLFEMESLHHLKFRVGSL FT NVGTLKGRSSEVVETLTRRRVDLCCLQETRWSGGLDANQARFVKGKDSRYK FT FYWCGNKQGQGGVGIMLAEKWVENVFEVQRISDRILLLRLVIGKSVFTFLS FT VYAPQIGLSDAEKERFYDQMQSTIAKIPASETVFPIGDWNGHVGEDAQGFE FT EVHGGHGFGERNTEGERILEFAVANDLLIGNTLFVKRESHLVTYTSGNHRT FT QIDYILFRQSFRKAVSNIKVIPGEECARQHQLLVCDFVVRTPAVRKRKFTP FT RLRTWKLRDTAVAREFRETFVSKVESVTTDATGGEVEDLWSRLKTPLLEAA FT ASVCGYSKNHLWKPETWWWDDHVEEAVSKKRARFKVFNSLRKQGKTAVAMV FT AKTAYNEAKRLAKHAVWLAKSAAEKETFAVIDPQGADVYRVAKQMERSNQD FT VVGEMCVRNDAGELSLNDEDKMKAWVEHYNRLLNVEFDWPMDELPEAAPVV FT GPAPPVTTEMISKALSKMRFGKAVGPSGINAEMLKAAGEEGIELTRQLTEA FT VFRNGTVPVEWEKSIILSLYKGKGEALDRGNYRGLKLTDHVMKLLERVLDS FT AIRKMVNIDDLQFAFVPGRGTTDAIFIVRQLQEKFIAANKPLYFAFVDLEK FT AFDRVPRRVLWWALRSLGVEEWAVRVIQAMYANARSRVRVNGQYSEEFGVG FT VGVHQGSVLSPLLFILVLEALSREFRTGVPWELLYADDLVIIADTLEECIA FT RLKAWKSGMERKGLRVNMGKTKIMISGQGLNKLKETGSFPCAVCRSGVGAN FT SIQCTVCNFWVHKRCCGIRGSLLAVQNYTCPRCRGEARPLDGRPTTQVTVD FT EVVLDVEASFCYLGDMLCAGGGCELAVTTRSCVAWGKFKKLLPILTSRHLP FT FKTRGKVFDCCVRAAMLHGSETWAPTTSDLQRLRRNDRAMIRWICGVKPRD FT ETPSSSLLDKLGILDIISVLQSRRLRWFGHVERSSDCINKITKLKVSGNRG FT RGRPRKTWRDCVSRDLKECGLTGVDPLDRAKWKRSVKTGRLLPTPASGNPA FT AV" XX SQ Sequence 3427 BP; 821 A; 825 C; 1014 G; 767 T; 0 other; ggactaggga aaagatctga cattcccccc cccccccttc cccctactca gggatagcga 60 gttggccaga caggggtcat agctgaagtg gaagcagcct gtctagggac gagccccccg 120 aaacaaagct cctacctgac ctggggtagc tccattgcgg cagctgacga taaactgaca 180 gtctggtgtg atggatgatg ttgtgagagc cctgtgaccg ggccacaggt tggcgacagg 240 aaagacatcc ggccataaaa cgggcgcatg ccctgtccta tttgaaatgg aaagtcttca 300 ccacttgaag ttcagagttg gttccttgaa tgtgggcacc ctgaagggac gtagcagtga 360 agttgtggag acactcacta gaagaagggt ggatctctgt tgtctgcaag agaccaggtg 420 gtccggaggc cttgatgcta accaagcccg gtttgttaaa ggaaaggact cgcgctacaa 480 gttctactgg tgtgggaaca agcaaggtca aggcggtgta ggcatcatgc ttgctgagaa 540 gtgggtggag aatgtctttg aggtccagcg tatctctgat aggatcctcc tgctgcgact 600 cgtcattggc aaatccgttt tcacgttcct ctccgtctat gcgccccaga tcggactctc 660 ggatgctgag aaagagcgtt tctacgacca gatgcagagc accatcgcca agatacctgc 720 ctctgagacc gtgtttccta tcggagactg gaatggtcat gttggtgagg acgctcaggg 780 gtttgaggag gtccatggcg gtcatggctt tggtgaacgg aacactgaag gggagaggat 840 attggaattt gccgtggcaa acgacctgct catcggaaac actttgttcg tcaagaggga 900 gtcccacttg gtcacctaca cctctggtaa ccacaggacc cagattgact acatcctctt 960 ccgccagagt ttcagaaagg ctgtctccaa cataaaggtc atccccggtg aggagtgtgc 1020 tcgacagcac cagctgttag tgtgtgactt tgtagtacgc acccctgcag ttcgtaagcg 1080 caagtttaca cctcggctgc ggacttggaa gttgcgtgac accgctgttg ctagggagtt 1140 ccgggagact tttgtatcca aagttgagtc agtcactact gatgccactg gtggggaggt 1200 agaggatctc tggtcaaggt tgaagacccc gctcttggaa gcggctgcga gcgtgtgtgg 1260 gtactccaag aaccatctgt ggaaacctga gacctggtgg tgggatgacc atgtagaaga 1320 agctgtgtcc aagaaaaggg cacggtttaa ggtcttcaac tcccttagga agcaaggtaa 1380 aactgcggta gcaatggtcg caaaaactgc ctacaatgaa gccaaacgcc ttgctaagca 1440 cgctgtctgg ctggccaaat ctgcagcaga gaaagagact tttgctgtca tcgatcctca 1500 aggtgcagat gtgtaccgcg tagccaaaca gatggagcgt tccaaccagg atgttgtagg 1560 cgaaatgtgc gttcggaacg acgcaggtga actctccctc aacgacgagg ataagatgaa 1620 ggcctgggtc gaacactaca acaggctctt gaacgtggag tttgactggc caatggatga 1680 gcttccagag gctgctccag ttgtaggacc tgctccacca gtcactactg agatgatcag 1740 caaggcactg agtaagatga ggtttggaaa agctgtgggc ccttctggca tcaacgctga 1800 gatgctaaag gcagctggcg aggagggcat cgagttgact agacaactga ctgaagcggt 1860 cttcaggaac ggaactgtgc cggttgagtg ggagaagagc atcattctca gtctctacaa 1920 ggggaaaggc gaagctctag accgaggcaa ctacagaggc ctcaagctca ccgaccatgt 1980 catgaagtta ctggagcgtg tgctggactc agctattcgt aagatggtca acatcgacga 2040 ccttcagttt gcctttgtgc ctggcagagg caccaccgat gccatcttca ttgtgcgcca 2100 gctgcaagag aagttcatag ccgcaaacaa acctctgtac ttcgcctttg tcgatctgga 2160 gaaggccttc gatcgagttc ccagaagggt gctgtggtgg gcactgagaa gtcttggtgt 2220 tgaggagtgg gccgtgaggg tgatccaggc catgtacgca aatgccagga gccgtgtgcg 2280 agtgaacgga cagtacagcg aggagttcgg cgttggtgtt ggcgttcacc aggggtctgt 2340 cctcagtccc ctgctgttca tcctcgtttt ggaagctctg tctcgtgaat tccgcactgg 2400 agtcccctgg gagctcctgt acgcagacga cctggtgatc atcgcagaca ccctggaaga 2460 atgcatcgct agactgaagg catggaagtc gggcatggaa cgcaaaggcc tgcgggttaa 2520 catggggaag acaaagatca tgatttctgg tcaaggactg aacaaactga aggaaactgg 2580 atctttcccg tgtgctgtat gccgttccgg ggtgggtgct aactccatcc agtgtacagt 2640 ttgtaacttc tgggtccaca agagatgctg tggcataaga gggagtctgc tagctgtgca 2700 gaactacaca tgtccgcggt gtcgagggga agctaggcca ctggatggcc gccctacaac 2760 acaagtcact gtggatgaag tggtcctgga cgtggaagcc agcttctgtt accttgggga 2820 catgctctgt gctggtggcg gatgtgagct tgcagtcacc accagatcct gcgttgcctg 2880 gggtaaattc aagaagctct tgcccatcct tacttccagg cacctaccct tcaaaacccg 2940 tggcaaggtg tttgactgct gtgtacgtgc ggccatgctt catgggagtg aaacctgggc 3000 acctaccacc tcggatctac agcggttgcg ccgcaatgac cgtgccatga tcagatggat 3060 ttgtggcgtc aaaccccggg atgagacccc ttcatctagc ttacttgata aacttggtat 3120 tctggacatc atctcagtgc tgcagtcccg ccgtcttaga tggtttgggc acgttgaaag 3180 atctagtgac tgcatcaata agatcaccaa actgaaagta tctggcaaca gaggtcgtgg 3240 tagacctagg aaaacctgga gggactgtgt cagcagggac ttgaaggagt gcggtctgac 3300 aggggtcgac ccactggaca gagccaaatg gaagcggagt gtgaagactg gccgactgct 3360 gcctacccct gcgtcaggga acccagcagc agtataatct aaccatggat agtgagtgag 3420 tgagtga 3427 // ID Gypsy-234_AA-LTR repbase; DNA; INV; 127 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-234_AA_; KW Gypsy-234_AA-I; Gypsy-234_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-127 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR [1] (Consensus) XX SQ Sequence 127 BP; 35 A; 26 C; 29 G; 36 T; 1 other; tgtagaaacg ttgcatcttg cgacccmcga gagttgaagt gtagaattgg agaataaaga 60 tcacttgtat ttcgacgcgc atccgtgaca cttgtattct gtgtcttatc cgagatccga 120 tacaaca 127 // ID Merlin8_SM repbase; DNA; INV; 1078 BP. XX AC . XX DT 14-AUG-2009 (Rel. 14.08, Created) DT 14-AUG-2009 (Rel. 14.08, Last updated, Version 2) XX DE Consensus sequence of Merlin-type family of repeats. XX KW Merlin; DNA transposon; Transposable Element; Merlin8_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-1078 RA Jurka J.; RT "Merlin-type element from freshwater planarian (Schmidtea RT mediterranea)."; RL Repbase Reports 9(8), 1898-1898 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 99..968 FT /product="Merlin8_SM_1p" FT /translation="MNIFELPRTEATAIAFLQVNGLLPSIRLCKNGHEMKL FT TISTDVKWRCRKQGCTQTIKMRIGNFFERSRLPFVTAVRFIYSWALAYSSG FT KFCRRECGMDEGTTADWSNYMREACVMHLLKKSLRKIGGTSLIVEIDESMF FT TKRKNNAGRSLPQQWIFGGICRETRQCFLVMVPDRSADTLLRAIEENIEDA FT SIIYSDSWRGYRTSDLEKANFEHFKVNHRYNFVDPETGAHTQTVERMWGSA FT KWRNKKHRGTARHHLESYLAEFMCRQEAKDEDPFDWILKLLREFWTPEE" XX SQ Sequence 1078 BP; 288 A; 239 C; 264 G; 287 T; 0 other; ggtactatag ggttatccgg gaccaatttt acaagtttca taattttaca ttttctatcg 60 aatttttgtt tgattttcat ttattttaaa tttaatccat gaatatcttc gaactgccgc 120 gtactgaagc tactgcgatc gctttcctgc aagttaatgg actcctgcct tcgattcgct 180 tatgtaagaa cggccacgaa atgaagctaa caatcagcac tgatgtcaaa tggcgatgcc 240 gaaagcaggg atgtactcag acaatcaaga tgagaatagg taacttcttc gagagatcgc 300 gtctcccatt cgtcacggcc gttcgcttca tctacagctg ggctctggcg tactcgtcgg 360 gcaagttctg caggcgcgaa tgtggaatgg acgaaggcac tactgcggac tggagcaact 420 acatgcgtga agcctgcgta atgcatcttc tgaagaagtc actgcgaaag attggtggta 480 cgagcttgat cgtcgagata gatgagagca tgttcactaa gcgcaagaat aacgccggaa 540 gaagccttcc tcagcagtgg attttcggag gaatctgccg tgagacgcgc caatgcttcc 600 tcgtcatggt gccagatcga tctgctgata cgctactgcg tgctattgag gagaatatcg 660 aagatgcctc tatcatctac tcagacagct ggagaggtta caggacttct gatctggaga 720 aagccaactt cgagcatttc aaagtgaacc atcgatacaa cttcgtcgat ccggaaactg 780 gagcgcacac gcagaccgtt gagcggatgt ggggttccgc gaaatggagg aacaagaagc 840 atcgtgggac tgctcgccac cacctggaga gctacctagc tgagttcatg tgccgccaag 900 aagccaaaga cgaagatccg tttgattgga tcctgaagct actgcgtgag ttttggacgc 960 cggaagagtg actatttgct attattctgt tctttcgaac agcaagaata aatatttatc 1020 ttttatctac tcgattttta taatttgtaa aattggtccc ggataaccct atagtacc 1078 // ID TransibN3_DP repbase; DNA; INV; 1547 BP. XX AC AADE01000709; XX DT 13-JUN-2005 (Rel. 10.05, Created) DT 13-JUN-2005 (Rel. 10.05, Last updated, Version 1) XX DE TransibN3_DP is a nonautonomous DNA transposon - a fossilized DE copy. XX KW Transib; DNA transposon; Transposable Element; Nonautonomous; KW Interspersed repeat; TransibN3_DP. XX OS Drosophila pseudoobscura OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; obscura group; OC pseudoobscura subgroup. XX RN [1] RP 1-1547 RA Kapitonov V.V. and Jurka J.; RT "RAG1 core and V(D)J recombination signal sequences were derived RT from Transib transposons."; RL PLoS Biol 3(6), (2005). XX DR GenBank; AADE01000709; Positions 29631 28085. XX CC TransibN3_DP belongs to the TRANSIB family of DNA transposons. CC This element is characterized by the CACTG target site CC duplications. CC TransibN3_DP has imperfect 29-bp terminal inverted repeats (5 CC mismatches). XX SQ Sequence 1547 BP; 547 A; 292 C; 328 G; 380 T; 0 other; cactatggga aatttgacaa ctttttgtga ccacaaacca ttcttccaaa ctcattaaaa 60 gaatatatat ttagccgatc atcaatctcc agtgagcgca ccactgttaa gttaacagat 120 gtcaaaaata gctgtttaat agcgattagc gagtgttttt gcaagttgaa aagaacaatt 180 caaggagatt ttttacaatg gattcaaatc agtgagttat gtagtatata tttctgactt 240 tgaactgtag gaatggtcct tatattttgt gtaattaata atcgtttaac gagtatcatt 300 ttgtgaccaa aatgagtttt tgaggaggcc tatacaatga gtgcaaccta ttccaaccat 360 aagatttaat gcatattggg gaatattcaa atcatctcca tttaacactc aaagaggact 420 tcatattgaa ttgttttaca gactgaacag ttgtggttgt tgttctgagg gcgatgaggg 480 catcttatcg cttttgctat cgtttttaaa gaattggcga catcctaccg acatccggcg 540 tacttttagt caccccaaaa tctacgcaga tattactggt attgatttaa agctgattca 600 aaatcctggg actcttttga tgtgtcactc aagacaattg tcaacagata tggctaaatt 660 gcaaaccgga gattattatg ggcgaatatc cttggctgac aatgactgca acagtgcaca 720 aaatacttgt acactcaagt gatattagcg cgttattctt ttgagagaag gttccgaatg 780 acggaataaa attcatgcag ttgagaagat aatttcacag aagtgttcca gaatgcacaa 840 gctaacaacg gggaggggaa cagtagtaac agcagattag cagcagagta acggcagttc 900 aacagcagtt caacagcagt tcaacagcag atcaacagca gacaaacagc agagtaacag 960 cagatcaaca gcagatcaag agcagaataa cagcagttca acagcagatc aacagcagat 1020 caacagcaga gcaacagaag atcaactgca gttacagcag cgtaacagca gcaacagcag 1080 atcaagagcg gaataacagc agacaaacag cagatcaaca gcagatcaag agcggaataa 1140 cagcagagta acagaagatc ggcagtggag taacagcaga taaacagcaa cggcagaagg 1200 agagcagaag aaagcagtcg caaaaagaga atcgcagcag ccagaagtaa ctttcagcgg 1260 cagcggcagc agcagcacga atcagcagag atcaagccgt gatgtgaaaa cctattgctc 1320 ttgcaattac cagacttgca tctaaataaa attcatgcct attcttgaca gggcgcttgc 1380 aagtgcaaag aaaaagtccc tcgttactgg gagggaaagt ttagtcaaaa atttagatgc 1440 gttttcgcct taaaaaaaaa gagttcccct tcaaaaacga aaaacaaaat attgtttcgc 1500 ttggaagtta ttaattgcca caaaaagtgg caacatttcc cagggtg 1547 // ID BEL-623_AA-LTR repbase; DNA; INV; 556 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-623_AA_; KW Pao_Bel_Ele155; BEL-623_AA-I; BEL-623_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-556 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 556 BP; 180 A; 101 C; 98 G; 161 T; 16 other; tggatagcgc cggtccctcg aataactctc ggattgatag wwscgcgtcg cttcgatctt 60 ccacgcctgt caacacttct gctgctgaca ctttgaagcg agtgtcaaac cctckattmg 120 tkagaaaaac agaagsaaac cttgaackac acgaataatt gcggaatctt gcattttgcc 180 aagcaattta atatttatct aaacctaawa tttccttatc gtaagtactc tgcttaaaac 240 gaattatata aaactwattc cgaatgaatt taattatcta gattgggswc caaatctgtc 300 catcagaaag gctgtattgt atgactcagc agcttagaag tttatggttg taagtaaccc 360 aaaatttgaa atttataaag aaactattga aatgaaatcg catttatagt aatttagagc 420 taacagggaa ttcctgattg tttgttggcc aagggtagga gaacggacca aattgtaggt 480 aatccscaaa ttaattatgt cgaacawata tkaataagaa atgwatttag ctttagcgct 540 ttcatcaccc ggaaca 556 // ID BEL-45_AA-LTR repbase; DNA; INV; 580 BP. XX AC AAGE02018048; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-45_AA_; KW BEL-45_AA-I; BEL-45_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-580 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02018048; Positions 22899 23478. XX SQ Sequence 580 BP; 199 A; 84 C; 121 G; 176 T; 0 other; tgttaaagaa ccacagggca gcaacgtttg gtgagttccg tatcacccgc agttgtcgaa 60 ttgacggctg gtaaatgaaa gcgtagatgg gagaaagaat gacgtagcag gcagacgaat 120 gatgaggcag aattgagaat cgaacaaata gcaaatcaag tgcaggtgga attgaactga 180 agaatttcta gttagtgact aggtaaattg tttatcctac ttaaatatta cgtaaaattg 240 ttctaattat tattacattt aatagtatct taaacctagt agtacggaat tgaagcgtat 300 gaattcgtta tttcctatag ctacattcta aaagtaagta atgcaataca attgaattct 360 aaagactaat tgttcgatga tggttgatat taggttaatg caataacgca ttgcggtttt 420 ttgaattgtt tcggctagaa ataagaagga cacgaaaatg tgagtaaatt gatttgtgca 480 aattatctcc taaattactc attgcaatat aattgcagct ttgaagctga agtaactgct 540 atcaagacgt gtcacgacct tccttacgtt atccgaaaca 580 // ID DNA8-71_AP repbase; DNA; INV; 551 BP. XX AC . XX DT 25-AUG-2009 (Rel. 14.09, Created) DT 25-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-71_AP. XX NM DNA8-71_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-551 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2007-2007 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 551 BP; 196 A; 54 C; 74 G; 227 T; 0 other; taaagaccgg atttatatgt ttttacatat cgttactggg tctatgcagt cttatgagag 60 taaaaaatgt ggacgattcg ttcaattctg aatgcgtaga aaaaagtttt ttttgcatat 120 ttgagaatat ttcatgggtt agagaatatt taacccattt agcatatttt ggaatatttc 180 gtaaattgtg agaaaaaatt tgtatttatt tttaatttta atattacggt acccagaaaa 240 attttttttg aacatttaca attttttttt tatcatttcc aactcttact aaagtgcaat 300 agtattccat tagaatacgc aacttttaat acctaattaa tacctcacta ttaaaataat 360 gttgaaaaaa aaaaatttta aaatgcatat taaagcaaat aatgatgttt attgattatt 420 tttgcatatt ttgcatattt tagcatattt tttaaatttt taagagaata taatgagaat 480 atttgaaggt tttttagaga atattagcat catattttag gtattttaag agaatataaa 540 tccggtcttt a 551 // ID Gypsy-178_AA-I repbase; DNA; INV; 7717 BP. XX AC AAGE02025061; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-178_AA_; KW Gypsy-178_AA-LTR; Gypsy-178_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-7717 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02025061; Positions 60076 52360. XX CC Positions [5609-6064] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 3275..5605 FT /product="Gypsy-178_AA-I_1p" FT /translation="MSVLCKKRANFDVEGRRMSQKKPSTVFVPYKPEISEL FT KLDVEGDTRPFAIIDVMGVELVGLLDSGAQISILGCGSKNLLTKLKLKLLS FT TETKLTTAGGSRLDVLGYVNLPVTFHGQTKIVTTLVAPAMQRRLILGMNFW FT RMFNIEPTIQGQLGGIAEAMEAEVEKHDDAEILTADQQARLEEIKVKFKVF FT REGDKLGVTPLISHTIEFSEEFKNAKPVRLNPYPWSPEVQKYVNEELDKWI FT DSGVVERSNSDWAMLIVPVVKKNIGEDGEEAIKVRMCLDARKLNERTKRDA FT YPLPHQDRILGRLGAARYLSTIDLSKAFWQVPLNPESRKYTAFRVFGRGLF FT QFTRLPFGLVNSPATLSRLMDQVLGYGELEPNIFVYLDDIVVASNSFEEHI FT QCLEEVARRLNDANLSINIEKSRFCVPELPYLGFILSREGVRPNPDKVEAI FT VNFERPSSVRSLRRFLGMVNYYRHFVEGFSEITAPLTDLLKGKPKVITWND FT QAEIAFNILKEKLICAPILANPNFDLPFKIQTDASDMAIAGILTQENEGKE FT HVVAYYSRKLTTPQRSWKAAEKEGLAALEAIEKFRPYVEGTQFTLITDSSA FT LSFIMNSKWKPSSKLSRWSMLLQQYDMKVQHRKGTENVVADSLSRAVEAVE FT VAAQNDWYSQQYRKVEENPESFADFKIEENKLFKFVAAASDVLDYRYEWKM FT YVLEGLRKSVLKEEHDDALHPGYEKTIQKLKTRYYWPKMAIDTKRYVQACD FT VCKQCKPSSVSSNPPMGKQRLTDRPF" FT CDS join(1383..2387,2391..3311) FT /product="Gypsy-178_AA-I_2p" FT /translation="MNAEFELKYQSMNVAHLLVDEVEYELKIRQVPFNVGD FT SRDVKRRRLRQKLKEQKEKNNFEIVLEMSPEECKRDLNIVDEKIAKIHDQL FT ENRRTRKSELPELQTRLVHLYNRLMRLKEKVDTDGLEGIALNLYQEHFTVL FT TSDPDVRKRVEENLSKQLIDLSVQDRSEESTLKETEEENSSESEDRGNQNK FT DKNKELSGNTSTPKERRKSKKQKDSEEMFKKLLEHVASYLESRLSSISEGR FT RKGDSSCSEDVGIEEARKSRRKEFRGRNRRDSSSGEDQEDNVLRSQRRSEI FT ASKKKSRREVSPPKRKSRNQRRTDYEEASNEDSSSGEESEEPHVRRRPRSV FT AEWKMKYDGKDEGKKLNKFITEVEFMAEAENLGKRDLFNEAIHLFSGEART FT WYIEGKKNRDFRNWRELVTELKLEFQPPDMDFHYEQQAAQRRQKRSEKFQD FT YFNAVMEIFQFMAVAPSEQRMFDIVFRNLRSDYKNVLVVKGIRTLKGLKQW FT GRKLDSANIWMYRNREGDAVPKNAQVHEVRRDFPRNDNRPAEKKWKSAPYN FT GKSAKPDQKEGVTNISKRQEVPQDNQRPGTSKADLESRIANYRVPDRLTCF FT NCRGKYHHHRTCLLEKEVFCMVCGFHNYTKEKCPFCAKNGRTST" XX SQ Sequence 7717 BP; 2498 A; 1422 C; 1811 G; 1986 T; 0 other; cttacacgct aagatcagtc tacccaaata tgggtaaagt ttacccataa tcgctgaaaa 60 gtgcacacac tcattttatg ggtaaagtgg actctaccca taaattggta aactcctaaa 120 acccataaat gggtatatga ttattagcaa aaatatagga aaagtaccca tatatgagta 180 ctttattttt ctttattttg ttgattttca atgataatac taaaataaaa gttctaatta 240 taaatacaaa caacatgatt ttattgaaaa ttatgaatag tggggtcaaa aggatagtca 300 aatataagat gaggtgttat tattctgcac ataaaccaac atgcgcgaac actgcttctt 360 ccaagtattt atacgagccg ctgtattgaa catgatgctg ctgttgatcg gaaaaatctg 420 atcccagtgg agcgatttaa aaatcacctt caaccacatt tccacatgat gaagagtttt 480 gattccgcaa taaatgtaat ctgaaacaaa gaatagctca agtagtaatc attgttgcta 540 tttttctgtt tttgcattat aaataataat atttacttac cttaaaatta ttaatttcaa 600 tagtttttga acaatgctgt ttaaagacag caacgtcgcg cgattattgt tgatgtttta 660 tttttcatcc aaatatgggt aaacgataat acccatattt gagtaaagca actttactca 720 taatatgagt aaactttacc aattttaatt agtaaattct acccatatta taggtttaat 780 atgaaaaact caaaaatggg taaataaact acctatttat gggtaaactg atcttagcgt 840 gtagttagtc acaattggcg cccaactaca acacgcgaaa ggtaagctga ggatcgatta 900 aaattagaaa aatctaacga ttttttatag gcggcaggtg agtacacaac ttgcacagtt 960 gtgtaggatc tttgctctgt tttgcaccga ccaaaattgt tctattctct tggaaacttt 1020 aagacattgg ccgttcgcgt ttcagtttcg cttgaaaggc aaaagaatta aaaagtgcct 1080 agtgaatagc agttagactt gcaaacaaga ttgatcacaa aaatatttac acttaatgaa 1140 gagtaattaa cagcaattga aaaaggtcct gaacgaaatt gaattagcca ctacactaca 1200 caaacacttg aattacattt atttgaattt ttaccctgtt tgttagtatc taagtttatt 1260 cgtatttatt ttgaaatttt ttgaattcgt ctactatagc tttaatctag aattagtata 1320 tttgaatttt attctgaatt attttgtttt tacttgaatt tggttggttt ttcttgagca 1380 acatgaatgc tgaatttgaa ctcaaatatc agtcgatgaa tgtagcccat ctactcgtag 1440 atgaggtgga gtacgaactc aagatacgcc aggttccgtt caacgttggt gattctcgtg 1500 atgtgaagcg tcgacgattg cgtcagaaac taaaagagca aaaggagaaa aataattttg 1560 aaattgtttt ggaaatgtcc cctgaagagt gtaaaagaga tttgaatata gttgacgaaa 1620 agatagcaaa aattcacgat cagttagaga ataggagaac gaggaaatca gaattgccag 1680 agttgcaaac aaggctcgtc catttgtaca ataggttgat gagattgaag gagaaagtag 1740 acacagatgg acttgaggga attgctctaa atctctacca agaacatttc acagttctta 1800 catcagatcc agatgtgagg aaaagagtag aggaaaattt aagcaaacag ctaatagact 1860 tgagcgtaca ggatagatca gaagagagta cattgaaaga aactgaagaa gaaaacagtt 1920 cagaaagtga agatagaggt aaccagaata aggacaaaaa taaagaattg agtggtaata 1980 ccagcactcc caaagaaaga agaaaaagta agaagcagaa agattcagaa gagatgttta 2040 aaaagttgtt ggagcatgtg gctagctatc tggaatccag actaagcagt attagtgaag 2100 gaagaagaaa aggagatagt tcttgtagcg aagatgtagg tattgaagaa gctaggaaaa 2160 gtagacggaa agaatttcgt ggaaggaata gaagagatag ttcgtctggc gaggatcagg 2220 aagataatgt tcttagaagt caacgccgtt cggaaattgc ctcaaagaaa aagagtagaa 2280 gggaagtcag tccacctaag aggaaatcta ggaaccaacg aagaacagac tacgaggaag 2340 caagtaacga ggatagcagt tccggagaag agagtgagga gcctcattga gtacggagac 2400 gaccacggtc tgtggcggaa tggaaaatga agtacgatgg aaaggacgaa ggcaagaagt 2460 tgaataaatt cataaccgaa gtagaattca tggcggaggc agaaaacttg ggaaagcggg 2520 atctgttcaa cgaagcgatt caccttttca gcggagaggc tcggacgtgg tacatcgaag 2580 ggaagaagaa tcgagacttc cggaattgga gagagttggt gacagaactt aaactagagt 2640 tccaaccgcc agacatggat ttccactacg agcaacaagc ggctcaacga aggcaaaaac 2700 gttctgaaaa gtttcaagac tatttcaacg cagtcatgga aattttccag ttcatggctg 2760 ttgcgccgag cgaacaacga atgtttgaca tcgtcttccg gaatctgaga tccgactaca 2820 agaacgtttt ggtggtcaaa gggatacgta cgttgaaggg tttgaagcag tggggacgaa 2880 agctcgactc ggctaacatc tggatgtatc gtaaccgaga aggcgacgca gttcctaaaa 2940 atgcacaagt ccatgaggtg cgtagagatt tcccccggaa tgataatagg ccggctgaga 3000 aaaagtggaa atctgctccg tataatggaa aaagtgcgaa gcctgaccag aaggaaggcg 3060 ttaccaacat ctcgaaaagg caagaagttc ctcaggacaa tcaacgtccg ggaacgagta 3120 aagcggatct ggagtcgaga attgcgaact accgagttcc agacaggcta acttgtttca 3180 actgtcgagg taaatatcac caccatcgga cttgtctgtt ggaaaaggag gtattttgta 3240 tggtttgcgg gttccacaac tacacgaaag aaaaatgtcc gttctgtgca aaaaacgggc 3300 gaacttcgac gtagaaggtc gtcgaatgtc gcagaaaaag ccttcaactg tttttgtgcc 3360 gtacaaacct gaaatcagcg agttgaagtt agacgtggag ggggatactc gtccgtttgc 3420 catcatagat gtcatgggag tagagcttgt tggtctgttg gacagtggag cgcaaatttc 3480 aattcttggg tgtggcagca aaaatttgct aaccaagttg aaattgaagc tgttgtcaac 3540 tgaaactaag ctcacaaccg caggaggttc acgattagac gttttggggt acgtgaacct 3600 accagtcacg tttcacgggc aaacgaaaat tgtcaccacg ttggttgcac cagcaatgca 3660 acgtagactg attttaggca tgaacttctg gcgaatgttc aacattgagc cgacaattca 3720 aggacagttg ggaggaatag cagaagccat ggaggctgag gtagaaaagc atgatgatgc 3780 ggagattctg acagcagatc agcaggcaag gttggaggag ataaaagtta aatttaaagt 3840 tttcagagaa ggagataaac tgggagttac gccgttaatc tcccacacca tcgagttcag 3900 tgaggagttt aaaaacgcaa aaccagtaag attgaatcca tatccctggt cgccggaagt 3960 tcaaaaatat gtgaacgagg aactcgataa gtggattgac tctggtgtgg ttgagaggtc 4020 aaacagcgac tgggcaatgt tgattgtgcc cgttgtgaaa aagaacattg gagaggatgg 4080 agaagaagcg attaaggtca ggatgtgttt ggacgctagg aagttgaacg agcggacaaa 4140 acgagatgcc tacccgttac ctcatcaaga taggatcttg ggaaggctag gggctgcaag 4200 gtacttatct accattgacc tttccaaggc attttggcag gtaccattga acccggaatc 4260 ccgtaaatac acagccttcc gagtgttcgg acgtggctta tttcaattta cccgtctccc 4320 atttggcctg gtgaacagcc cagcaacgct ttcacgattg atggaccagg ttttaggcta 4380 tggtgagcta gagccgaaca ttttcgttta tctcgacgac atagtagtcg cgagtaattc 4440 gttcgaggaa cacattcaat gtcttgaaga ggtggcaaga aggttgaacg atgccaattt 4500 gtcgatcaac atcgaaaagt cacgattttg cgttcctgag cttccgtatt tgggcttcat 4560 cttgtccaga gaaggggtga ggcctaatcc ggataaagtg gaagcaatcg tgaattttga 4620 gcgaccatcc tctgtgcgtt ctcttcgccg gtttttaggg atggtgaatt actatcgcca 4680 ctttgttgag ggattcagcg aaattacagc accgctaacg gaccttctta aaggcaagcc 4740 gaaggtaatt acctggaacg atcaagccga gatagcgttc aacattttga aagaaaaatt 4800 aatttgtgct cccattttag ccaacccaaa tttcgatctt ccgtttaaaa tacagacaga 4860 cgccagtgac atggcgatcg ctggcatttt aacccaggaa aatgagggta aagaacatgt 4920 cgtagcttat tactcaagga agctgacgac ccctcaaagg tcctggaagg ctgctgagaa 4980 agaaggtctc gcagctctgg aagcgatcga gaaatttcgc ccgtacgtcg aaggaaccca 5040 gttcaccctg attacggact cgtcggccct gtcgttcatc atgaactcca agtggaagcc 5100 atcgagcaaa ctcagccgct ggagtatgct gcttcagcaa tacgatatga aggtccagca 5160 tcgcaaaggg acagaaaacg tcgtagcaga ctcactctca cgtgctgtag aagccgtaga 5220 ggtcgctgcg cagaatgatt ggtactctca gcaatatcga aaggtggaag aaaatccaga 5280 atctttcgct gattttaaaa ttgaagaaaa caaacttttc aaatttgtag cggccgcttc 5340 tgatgtgttg gattaccgct acgagtggaa aatgtacgtg ctggaaggat tgaggaagtc 5400 ggttcttaag gaagaacacg atgatgctct ccacccagga tacgagaaga caatccaaaa 5460 gcttaaaact cggtactact ggccgaagat ggcaattgac accaaacgct acgtgcaggc 5520 ttgcgatgtg tgtaagcagt gcaagccttc atcggtttca tccaacccac cgatgggaaa 5580 acagaggttg actgatcgtc cattctagat actggccttg gatttcattc agaatctccc 5640 gaggagtaaa aacggcaaaa cccacttatt agtcctaatg gacattttct ctaagtggac 5700 tatgttagtt cctattcgga agattgaagc caaggcagtc tgtcaagtgg ttgaagatca 5760 gtggtttcgg cgatatggaa cgccggaggt gattatcagc gataacgcta ccacgttcac 5820 cgggatggaa ttccaagcgt tactgcaaaa acgaggaatt cgtcattggc cgaactcaag 5880 acatcacagc caagcgaatc ctgtggagcg aactaaccgg acgatcaact cttgccttag 5940 aacctacatg cagcaggatc agcgggtatg ggataccaca atccctgagg ttgaagaaat 6000 gatcaacacc acagtgcatt cttcaacggg attctcgcca tatcgtatac tgtacgggca 6060 tgagaaagtg gtaaaaggtg aagaacatcg gctggaaagg gatgaagggg aacgttccgt 6120 agaagctcga gagaagtaca gacagaaagt tggaagaaac attagagaga tcgttaagaa 6180 aaatctggag aaaagcgatg aaaaatgtag aaaagtgtac aacttacgat tcaagaagtt 6240 tgctccaact ttcgagatcg gccagaaagt ctacaagcgg aatttccgcc aatcgtccgc 6300 agctgagcat tacaacgcga aatatgggcc attattcact ccatgcacca taattgccaa 6360 acgcggtagc agctcgtacg agttggcgga tgaaaacggc aaagcattgg gagtattttc 6420 ggctagcgat cttcgcgctg gtaatgatgc tccaaaacgc gactagaatc ctaaaataag 6480 aatactggga tttaggacca aggtcatggt ggactacggt catcatagaa ccaactcacc 6540 gtactaccac gtgggaagtc agaccacgtg cgcagatgta ggtttaagtc atgattgtca 6600 tcagattttg tgtcgatatc atgtagctgt ccagttagaa gtcattcatc gcaacatggt 6660 catcgtatcg ttgctgtatg tccgataata attgtccagt ttgttttgat tctcgtccgt 6720 gaagtcattg tcctcatacg tagtcagtct attctcgtga gtggtcttcg tcggagagaa 6780 tttgaaatac gtcatcgtct cgtcatcatt gtttcagcag tcggtgtaag tcttagctgg 6840 tgtagttgag agatgatgag atcctgtaaa acgagagaag agaatctaat tatagaaacc 6900 tagaaacaca aattaaatcc ggattgacga taaggcattg ttagtgcacg aacaatagcc 6960 gggtaacatt tatattgtac atcgcttctg atgaagcgat gacgatacga tagggtcgaa 7020 atagggaaac attcgcctac ggtaattaat cgccacttac ctgtgtgaac caatagaaat 7080 catccaatcg aaatgcgcta cttttcgggg ttgttccctc caaccaggtg gccaactggt 7140 tgaacaaacc tgaaaagaaa ctgtttcgtc agcctaacat aaagatagga acggaagtaa 7200 attcactcat cttcataaat cacgaatttc accaacaagt tgaacttcaa taacccggcc 7260 attaaagttc aactccagac ttttgcatac acggtacgcg cgcgatagaa gacgtaggta 7320 tactgaatcg tcttctgcta tagtagaccc gataaccaga taattcgaca aagcgatcgc 7380 ttgatacccg aatgaagccg tttaacggcg cttggaaaaa tgaagtagta gaatatgata 7440 agggtataca gactcacata gacactaaac agaatcactg gtggcatatt atggcaaatt 7500 ttaaatgtaa tttagctttt aattgctcaa gattttaatg tgttcatgtg acagtctaaa 7560 agaattgtat ttttaactgc tacgtgtctt tataatgaac gttgaatttg tgtttttaca 7620 ggaaaatgtc ttatcgtatg tccaaagttt tagaaaattt aaggagaaat aaatgttaaa 7680 cctcgttcaa catttatttc cccgaggtgc gggatag 7717 // ID Gypsy-71_CQ-I repbase; DNA; INV; 8027 BP. XX AC AAWU01040029; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-71_CQ_; KW Gypsy-71_CQ-LTR; Gypsy-71_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-8027 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 521-521 (2011). XX DR Genome; AAWU01040029; Positions 10963 2937. XX CC Positions [4176-4679] - Reverse transcriptase CC Positions [5739-6218] - Integrase core CC 'AACT' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 370..3348 FT /product="Gypsy-71_CQ-I_1p" FT /translation="MYDYHADDLVEEEIRYELDLREVRVEQGATIDSMKRS FT LRKWLKHDKENKYEYIARLTFDQEYDVIKHDLAQISTLITLRPEQKIRSRL FT VHLYYRILRSTAYQRINERQSFLDEINGLLRIYFSKQKPNSQVQTLGQVVQ FT TNPVPNPSSCQQQIIPQNVNQLSNQAVNHTTNSNSGQVSNQASNVNTNQNV FT EENLSHIWDQQIEQAVNQMMNPNPSTNPSTNTSTNMIRNEVNVSSSSSFAN FT MPASQFWTQIADQTESNHVQNVSNNTLDNRAAQSECQVIRRVKLRLGGYQV FT QFGPPQPINQNQNFSQNSMNRTSSPINLNPQVPNFNFPPPTIPSNTERNQS FT QYQVPCNPNFEFPPQSDFLRNHAQNQDQRNAHYQMPRNPNFEFPPQSNFFR FT NQAQNQDQRNTPFSGAQTNRNQNPQLGQNLTNKSFSQTNLNSETNLPFAKQ FT KAPTQHNYRTAVNTTSHLDPPNFSISSQQSNSTAINMHDLSRALNEIQFSQ FT NTTQNHSEQDQSQNQNTSQNSANANQNQPSNSQANDLVNDDFRSEMQSLIR FT NLVRDSIVDYFGDLEQARTPTMTPRSTFFQDRVEQRRNEILTSNPGNDTQN FT VNNSRSSFPFSSNNIPNNPNIPNAQNAPTSVNNPQSNFPFLSNNIPNNPNI FT PNAQNAPNSVNNPQSSFPLLSNNIPNNVNIPNNPNFPNTQNVPNNPRFNDQ FT NYNNPLKTKLERWQIHYSGEPGPRSLSVGDFVRQVSILAYSNQVSNEQLLQ FT QSHVFFTGEARRWYFTYWEKFSTWDHLIYYLKLDFEHPNKDEAVEDAIKSR FT KHRGNERFKTYLQDMERLFQELSYKIDERQKLKLIYQNTKISYKRRLLLIP FT IYTLEQLTDCCHSFDCLEPNLHPAINQSTIHNSHPINQISCSELDDEENEF FT EDDEQVNALFRDKRFKNKKFIRRENSNNSPEDNQDGIKRSTCWNCRKTGHW FT YTDCPEPRTTFCYVCGEPDCTLKTCPNKHTWRPSSPKN" FT CDS 4323..6593 FT /product="Gypsy-71_CQ-I_2p" FT /translation="MSQILHRIEKSKYFTTIDLKDAYHQVLLHPDSRNLTA FT FRTSKALYRYKTMPFGLLNSGATLTRLMTRVIGYDLSPKVWVYLDDIIITS FT DSLNEHFDLLQKVALRLHKANLTISVEKSKFCQKSVKYLGFVLSETGISVD FT AAKIQPILDYPTPRNVKDIRRLLGIAGFYQRFIENYSAIVTPISDLLKKSK FT AKFQWTEIAEEALLALKSALTSPPILANPNFSEPFVIETDSSDIAVGAVLT FT QVLEGKRRCIAYFSKKLSSTQRKYGATERECLGVLMAIDHFRPFIEGTHFT FT IMTDAMSLTFLKTMSINSRSPRLARWAMKIQDLDVDFVYKKGKDNITADAL FT SRSLNQIKAQVSDAQYEHLRDSIERFPEKYNDFKVIDRKIYKFVTNAGRPD FT DPTFRWKYVPTALEKPSIVSDTHNQAHLGFEKTLSTIQQKYFWPRMSADVL FT RFCKNCMTCKRSKVDNINPNPPCGKEKIFHRPWEMISIDFIGPYSRSKSGN FT AYALVVTDYFSKFTLIQVMRQATTAAVIKFLENNVFLLFGVPNVLISDNGP FT QFISRAFKQFLEKYNVNHWNLAVYRPNPNPTERVNRVIVSAIRSSLENKTQ FT KDWDQGIQFIASAICNTVHDSTGYTPYFVNFGRNIISSGKEYEHLTDTDQN FT ITQDPSKISNDMKNLYEAVRINLARAYKRYAGNYNLRSTRNIKFEPGEIVL FT KKNFYQSDKSANFNAKLAPKYSEAKVKRQTGTNTYELWDTNMHKRLGVFHS FT SVLKKI" XX SQ Sequence 8027 BP; 2686 A; 1707 C; 1383 G; 2251 T; 0 other; ttttggcgcc caactaccgg atttagtatt gattttggtt tattcattta cttttcctat 60 tactgcgttg tgataatttg cagtttactc agtttattat tataattttc tcttattaga 120 gagtttgaaa attcaatttt tattatcatt tactgaaaat tatactttat ttcagttatt 180 tttttttttt cattattttt aagcccaact gaatttcaat tttccccacc tttcctttaa 240 acctatcttt tcaaatcttt tttttttctc atttgctcat ttcaattttt tgaaccccgt 300 attattatat tgtttatatt gtttaattgg atttccaaat caaagtttaa cgttaaataa 360 ctgtacaaaa tgtacgacta tcatgcagat gacttggtcg aagaagagat caggtatgaa 420 ttagaccttc gggaagtccg ggttgaacaa ggcgctacca ttgatagtat gaaacggtct 480 ttgcgcaagt ggttgaaaca cgacaaagag aacaaatatg aatacatcgc gagactaaca 540 tttgaccagg agtacgatgt gatcaaacat gacctagcac aaatcagtac acttatcacg 600 ttgcgtccag aacagaaaat ccgatctagg ttagttcacc tttactatcg tattttacgc 660 tcgaccgctt accagcgcat taatgaaaga cagtctttcc ttgatgagat taacggactt 720 ttgcgaatct atttcagtaa acaaaaaccc aactctcagg ttcaaacctt aggacaagta 780 gttcagacta acccagttcc aaatcctagt tcttgccaac aacaaattat tcctcaaaac 840 gtaaatcaac tatcaaacca ggcagtaaat cacactacaa attcaaattc aggacaagtt 900 tcaaatcaag cgtcgaatgt taatacgaac cagaatgtag aagaaaactt aagccatatt 960 tgggatcagc agattgaaca ggctgttaat cagatgatga atccaaatcc ctcaacaaat 1020 ccctcaacga atacatcaac aaatatgatc agaaatgagg tgaatgttag tagttcaagc 1080 agttttgcaa acatgccagc ttcacaattt tggacccaaa tcgccgacca aactgagtca 1140 aatcatgttc agaacgtatc aaataatacc ttagataata gggcagctca gagcgaatgc 1200 caggtgatca ggcgtgtaaa attgagatta ggtgggtacc aggttcaatt cggaccaccc 1260 caacccatta atcaaaatca aaacttctca caaaattcta tgaacaggac atcttctccg 1320 atcaacctca acccacaagt gccaaatttt aatttcccac ccccaacaat tccatcaaat 1380 accgagcgca accaatcaca atatcaagtg ccgtgtaatc caaacttcga gtttccccca 1440 caatcagatt tccttcgaaa tcatgctcaa aatcaggatc agagaaacgc acactatcaa 1500 atgccgcgta atccaaactt cgagtttccc ccacaatcca atttttttcg gaatcaagct 1560 caaaaccagg atcagagaaa tacaccattt agcggagccc aaaccaatag aaatcaaaat 1620 cctcaactag gccaaaacct taccaacaaa tcgttttcac agaccaactt aaattcagaa 1680 acaaatctac cctttgcaaa acagaaagca ccaacccaac acaactatag aacagcagtg 1740 aacactactt ctcacctaga tcctcctaat ttttcaattt caagccaaca atcaaacagt 1800 accgcaataa acatgcatga tttgagcaga gctctcaatg aaatacaatt ctcccagaac 1860 acaacccaga atcattctga acaagatcaa tctcaaaacc aaaatacgtc ccaaaattct 1920 gctaacgcta atcagaatca accctcgaac agtcaggcaa acgatttagt taacgatgat 1980 tttcgttcag aaatgcaatc tctaattaga aacctggtca gggattcgat cgtagattat 2040 ttcggtgact tagagcaagc gagaacccca acaatgacac caagatcaac ttttttccaa 2100 gaccgtgtcg aacaaaggcg taatgaaatt cttacttcta atccaggaaa cgacactcaa 2160 aatgttaata actctcggag tagttttcca ttctcgagta ataatatccc taacaacccg 2220 aacatcccta acgcccagaa tgccccaacc agtgttaata acccacagag taattttcca 2280 ttcttgagta ataatatccc taacaaccca aacatcccta acgcccagaa tgccccaaac 2340 agtgtaaata accctcaaag tagttttccg ttattaagta ataacatccc aaacaacgtg 2400 aacatcccta acaaccccaa tttcccaaac acccaaaatg tcccaaataa tcccagattc 2460 aatgatcaaa actacaacaa cccgttgaaa acaaaactag aaaggtggca aatacattac 2520 agtggtgaac caggaccaag atcgctatcg gttggagatt ttgtaagaca agtctcaata 2580 ctggcatatt caaaccaagt ttctaacgaa caattgcttc aacagtctca tgtgtttttc 2640 acaggggaag ccagacgttg gtatttcacg tactgggaaa aattttcaac ttgggatcac 2700 cttatttact atttgaagtt ggactttgaa catcccaaca aggacgaggc ggtcgaagat 2760 gccattaaga gtcgcaaaca tcgtggaaac gagcgattta agacatactt gcaagatatg 2820 gaaaggttat tccaggagtt atcatacaaa atcgatgaaa ggcaaaaact taaattaatt 2880 taccaaaata caaaaatctc atataaaaga cgcttattgt tgatcccaat ttacactttg 2940 gaacaactaa ccgattgttg tcacagcttt gactgtttgg aaccaaatct tcaccctgcg 3000 attaaccaat cgacgatcca caattctcat ccaataaatc agatcagctg ctctgaattg 3060 gatgatgagg aaaatgaatt tgaagacgat gaacaagtga atgcactttt cagggataaa 3120 agatttaaga acaagaaatt cataagaaga gagaattcga acaactcacc agaagataat 3180 caggacggta tcaagcgtag cacgtgttgg aactgcagga agactggaca ctggtacacg 3240 gattgtccag aaccacgtac gaccttctgt tacgtgtgcg gtgagccgga ttgtacgttg 3300 aaaacttgtc cgaataagca tacgtggcgt ccaagtagtc caaaaaacta ggtggcggga 3360 cggcccagga cgtactgccg tcgtcccaag ttgatttcaa ttatttcaat cgagttttcc 3420 acgtcaatac gagcccttca cgatgtccgc acgcaaccgt tgaaatcctc aatgaaaggt 3480 taacaggact tctcgattca ggagcgaata ttacggtgac taacgcagta gatcttctcg 3540 agcgattaaa tctaaaaata gtccaaaatc agattcgtgt tcaagcagcc aatggagctc 3600 agcttacctg tatcggtttc gcgtacattc ccttcacgtt caataataag acgaaagtca 3660 ttccaaccgc gatcatcccg gaattgtcga aagacttgat cctcggaaca gatttttgga 3720 aagcgttcga tatccacttg tcaatcggcg gtaaacctgt cccggaaggc agcgcgatcg 3780 atttcgatgt taattccata tcgcacgtgg cagaatacga tcacgataac atctgtttca 3840 gcttagaagt agacccgtca ttccgcggcc cagtagaatc ccctccggtc gacgaaagtt 3900 tagatattcc ttcaatagaa atttccaatc acgatcttac ttcggttgat gagattaaaa 3960 cagaacacgc gctgtccgaa cctgaaaaac tgaaattatt tgaaatattg aaagcttttc 4020 cacgaactgt gaacggaaaa ataggacgaa cacaacttat tcaacataaa atcgtactaa 4080 atcaaagtct gcacgaaatc aaacacaaaa aagttccgat ttatccaatt tcacccaaaa 4140 ttgaaaaaga agtagacaag gaaattgaac gactcaagag tttagacctg attgaagaat 4200 gtgaaagtga tttcattaac cctttgcttc ctgtgcggaa aggagaaaat aaatggcgtt 4260 tgtgtcttga tgcgagacga ttaaattcat tgacaaaacg ggatgaatac ccattcccaa 4320 atatgtcaca aattttacac cgaatcgaaa aatctaaata tttcacaaca attgatttaa 4380 aagacgccta ccatcaggta ctcttacatc ccgattcacg taatctgacc gcttttcgca 4440 cctcaaaagc gctttatcgg tacaaaacga tgccatttgg gttgttaaac agtggtgcaa 4500 cattaacccg attaatgaca cgtgttatcg gttatgattt gagcccgaag gtgtgggtct 4560 accttgacga catcattata acgtccgatt ctttgaatga acattttgac ttgcttcaaa 4620 aagtcgcact tcgcctacac aaggccaatt tgacaatcag tgttgagaaa tccaaatttt 4680 gccagaaatc agtaaaatat ttaggattcg tattgagcga aactggaatc tcagtagatg 4740 cagcaaaaat tcaaccaatt cttgattatc ctaccccaag aaatgtcaaa gatattcgaa 4800 gattgttagg catcgctgga ttttatcagc gtttcattga aaattacagc gcaatcgtga 4860 ctccaatttc tgatctattg aagaaatcaa aagcaaaatt tcaatggaca gaaatagccg 4920 aagaagcact tcttgcccta aaatccgcgt taacttcgcc cccaattttg gcaaacccaa 4980 attttagtga acctttcgtg atagaaaccg acagttcaga tattgcagtc ggtgcagttt 5040 tgactcaagt gttggaggga aaaagacgtt gtattgcata tttttctaaa aagttatctt 5100 caacccaaag gaagtatgga gccactgagc gtgaatgcct tggtgttttg atggccattg 5160 accatttccg gccattcatc gaagggactc atttcaccat tatgaccgac gcgatgagcc 5220 ttactttttt gaaaacaatg tccattaata gccgttcacc tagactggcc cggtgggcga 5280 tgaaaattca agatttagat gtcgattttg tttataaaaa gggaaaggac aacataacag 5340 ctgacgcact ttcgcgctcg ctgaaccaaa tcaaagcaca ggtttcagat gcacaatacg 5400 agcacctgcg ggattcaatt gaacgatttc ccgagaaata taacgatttt aaagttattg 5460 ataggaaaat atataaattt gtcacaaacg caggaaggcc agacgatcct actttccgtt 5520 ggaaatatgt cccaacagcg ctggaaaaac cttcaattgt cagtgacacc cacaaccaag 5580 cgcatttggg ttttgagaaa actttaagta caattcaaca aaaatacttt tggcccagaa 5640 tgagcgctga tgtgctccga ttttgtaaaa attgcatgac ttgtaaacga tcgaaagtgg 5700 ataacatcaa ccctaatcca ccctgtggga aagagaaaat ttttcatcga ccttgggaga 5760 tgatttcaat tgattttatt ggtccttatt ctcgttcaaa atcgggaaac gcctacgcac 5820 ttgtagtaac ggattatttt tccaaattta ctttgattca agttatgcga caagcaacaa 5880 cagcagctgt gataaaattt ctggaaaata atgttttttt actttttggc gtaccgaatg 5940 tgttgatttc agacaacggt ccacaattta tttcacgagc ttttaaacaa tttcttgaaa 6000 aatacaatgt taatcattgg aatttggcag tttacaggcc caatccaaac cctaccgaac 6060 gagttaatcg agtgatagta tcagcaattc gtagttcgct agagaataaa acccaaaaag 6120 attgggatca gggaattcaa ttcatagcat cagccatatg caacacagtt cacgactcaa 6180 caggttacac accgtatttt gttaactttg gcagaaacat aattagctca ggaaaagaat 6240 atgaacacct aacggacaca gaccagaaca taacacaaga tccatcaaaa attagcaacg 6300 acatgaagaa cttgtacgaa gccgttcgca taaacttagc gcgagcatac aaacgttatg 6360 ctggaaatta caaccttaga tcgacaagaa acataaaatt tgagcctgga gagatcgttc 6420 tcaaaaagaa cttctaccag tcagacaaat ctgcaaactt taatgctaaa ttagcaccca 6480 aatatagcga agcaaaagtg aaaagacaaa ctggaacgaa tacatatgaa ttgtgggaca 6540 caaacatgca caaacgcctt ggagtctttc actcctcggt tctaaagaaa atataaatat 6600 ataaatacag tctagacttt aggaccgaga cttcacactt accaactctt ttgttacatt 6660 ttttccggct atgtcgatct cgccatcaga gtcgagccag tctcctcgat gtgctgagac 6720 caacaataga taaatatata acacacacaa acacttagtg gatttttttc ctgtaatttt 6780 aatatgtatg tcgtcgtgca ataattttct ctgttaagtt taccaataca ctattttgaa 6840 ataatatttt cgaacaaaga tggccccaaa aacattattt catcttacaa ttttctatta 6900 gattttcata actttgccac cacatttcaa tatttccacc agtagtagtc gcccatcaat 6960 ataatcacca ccagaatttt cctttttcct acgatcatcc catttcccaa cccattatta 7020 tctctctaaa actcagattt atcctcccac acattttttt tctcgagacg tgtgtttcct 7080 ttagtcaccc ataacacact cacatgatca ccttagattg gcaacataat ttccgctaaa 7140 atcagtattt tcatttaatt ttccccagac cgtcgcgtag aatttgattc gctgtaaagt 7200 accgacgaag actgattttg tttttcaaga ttggcactac ttactatcca caatattaca 7260 tacacaatag tacaaggcgt tggagcactg atagggattg catgcgtagt tgtattaact 7320 attatcaggg aatacagaga tgagtgagat cgagtgcacg taggttcgcg tgatagtgtg 7380 atagggacga atgcatcagt tgtgaccgcg gtttgatcga gagtagtcga gcgtgcgcga 7440 gtttagtttt taagtgcgtt cttcgcgata attaatcgaa ttttttcgaa aagaaatttt 7500 agtttaccgc atgatcgatc cagagtgacc atatatatat ggtaaatttg ttttaattag 7560 aagccaaaaa ggccaaaaac cagagatttc tgggagtttt ttatgctaat tttaaggtgc 7620 gaatgatttg catgtgcgac aggtgcaaaa ggcggtgcga tggtatgtgt aggcgctata 7680 agttcccgtc gttctccaat gcgcaataga ccggatattc tatgcagcgc tttaacaagt 7740 aagattaact tataaattta gttcaagccg atgaaaaaaa gatggccgtt atttgtattt 7800 ctctgttgag caacccttct tcttctatct acttattatc ctcttattat tatacgtgct 7860 ctcttttgtt atacttttta aaattgatct gaatcataaa atcaaacttt tatagatttg 7920 taagtgtaaa taaaactaat tttaatacaa ttttacgtaa aaattgcctc agtgaaaatt 7980 ccaatcacag tagcaattga aattttcttt tttttgttgg ggggtaa 8027 // ID Gypsy-7_DWil-LTR repbase; DNA; INV; 840 BP. XX AC scaffold_181088; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-7_DWil_; KW Gypsy-7_DWil-I; Gypsy-7_DWil-LTR. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-840 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_181088; Positions 34533 35372. XX SQ Sequence 840 BP; 322 A; 120 C; 200 G; 198 T; 0 other; tgggaaagtt cagcaaatca tcaacgataa ggcacctaga agtactaaga tagctccttt 60 taagatccta accggtattg acatgcgtag atcagaggat gtaaacctta aagaactact 120 cgatgatctc taatcgaaga accagcagaa cacagaggga aaatacgaat ggaggcagtt 180 gaaaatataa aacaaattca ggaggagaac aagaagtcag ctgatctaaa gagaaaagag 240 ggaaaacgat tcaaaataaa tgatttggtt gctattaagc gtacccaata tggcgttggt 300 ctgaaactta aaggaaagta ttaaggaccc tataaagtaa cggcaattaa aaatcatgga 360 agatatgaag tagagaaagt tggacaggcg gaagggccta ataggacatc aactgtatcg 420 gaatatatga aattgtgggg accgtcattc gggtcgaatg tacagtcagg agggccgaat 480 gtgggaaatg gtattgagag agagcaatca aaagtaatag aagagaggag acaaacgcgt 540 agtggccgaa cattctagta gtagtagaga gagcagggca taggatcatt atataagttg 600 agagagagct tgttaaagag gccggggatt cgcaaagaga ctgtcaactg agtcgcacgt 660 caaggtccat aattagttta gtgttatttc tataagtatg tttaaattct aacaatatat 720 tgaactcgat tgtcatcaaa caattcaaat attaataaat aaataaatat agtaacgatc 780 gctaaatcaa acgctacatt tttgggggct caaccgggat attgtttaaa gaaattaaca 840 // ID Gypsy-8_TCa-LTR repbase; DNA; INV; 824 BP. XX AC chrUn_2; XX DT 21-FEB-2011 (Rel. 16.02, Created) DT 21-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the red flour beetle genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-8_TCa_; KW Gypsy-8_TCa-I; Gypsy-8_TCa-LTR. XX OS Tribolium castaneum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Coleoptera; Polyphaga; Cucujiformia; OC Tenebrionidae; Tribolium. XX RN [1] RP 1-824 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the red flour beetle genome."; RL Direct Submission to RU (21-FEB-2011). XX DR Genome; chrUn_2; Positions 25018 25841. XX SQ Sequence 824 BP; 221 A; 154 C; 179 G; 270 T; 0 other; tgtagcataa ttgcctattt taacaattaa atattttcta taagtaattt gcgttattta 60 tttcgaactt attctttcga taaaatgaac tttcgggcgt accgacgggg gtgcgcaact 120 ctcacttgtg ttctgtcaat tgtcgatact ggagaacaaa ggaatcgagt ttggtagcgt 180 gaggacgcag aaaaaattag gtgagttgat ctcacagagt atttatcgtt aataattaat 240 cttaagtaca attgatcaat ctagacttcc taattacgtg ataatatagg ttataacgct 300 tttcaatgtc ttggtcttgc cacgtggttg tgatgagagt gaggttaggc tgggctggct 360 ctccgccctt cctattgcac aaacctcgac tctgtaaata gaattgatgt gactggttgg 420 ctctccacca ggtcttagtg atatttggcc ttccagccga tttctccgtg gtactccggg 480 agtaatcata tggcttggcg cagcacctct tggctctcca gagtgctggg gattgagcat 540 tgaaagcgct tcagattagc acttttggcc ctccaaagtg cggagtttca atattcaagt 600 aaactgagta gtactcgaaa ttagagattg taaaatcgtt tggggccaaa cgtactgttt 660 gtttatctgt tgtcgcatcc ttgacgttaa cataaatact cagaccccat ttttctaatt 720 tgttacaggt aaagcggagc aatgatggtc tatcaaccgc tgattggtta attaattaag 780 taaatcaatg tagtaaagtc aatttttgca aagcccgtct taca 824 // ID Gypsy-104_AA-I repbase; DNA; INV; 5751 BP. XX AC supercont1.206; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-104_AA_; KW Gypsy-104_AA-LTR; Gypsy-104_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5751 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.206; Positions 124906 119156. XX CC Positions [3305-3817] - Reverse transcriptase CC Positions [5161-5580] - Integrase core CC 'AAAA' target site duplication CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 2702..4270 FT /product="Gypsy-104_AA-I_2p" FT /translation="MIQYKYKVDYELWMLFILNRNKELTVPIQDNLNRAII FT LPQRCEIIRRIPSLTQCQEDSVVCSEEIHPGIFMGNTIVNSQNPYVKLVNT FT LEVPVLIENITPTTLPLKNFYVHTSTHNTIPQSRAKILADTIKIDNANPNL FT KNKLQKLLNEYNDIFHLPGDRLSTNNFYEQKINLEDKQPVFIANYKQIHAH FT VPEIKNQVNNLLENGIIEPSVSHYNSPILLVPKRTQNGQKWRLVVDFRQLN FT KKLFPDKFPLPRIDAILDQLGRAKYFTTLDLMSGFHQIPLEEHSKKYTAFS FT TSTGNYHFKRLPFGLNISPNSFQRMMTIALAGLTPECAFVYIDDIVVIGCS FT ENHHLGNLRIVFERLRKYNLKLNPEKCCFFKQEVTYLGYKITDKGILPDDS FT KYDVIQKYPVPKNADEVRRYVAFCNYYRKFIQNFSEIAQPLNKLLRKNTLF FT EWSDECHQSFELLKTKLLQPPLLQYPDFQKEFILTTDASSTACGAVLSQKY FT NEVELPIAFASRTFTKSELNKAIIEKN" FT CDS 4372..5580 FT /product="Gypsy-104_AA-I_1p" FT /translation="MKKPSNKLQRIRLDLEEYDFEIEYVKGKQNVIADALS FT RINITSAELKTINVITRSMTKKKNPEEDKVSILQPDKLRMFNALSFEEVLN FT LPKLEFGIINELNSVICTPKIKNKTSKKDLLLVRGIHLNNKGNDALVHLFQ FT SIEKDAIKLNLNQLALSTDDEIFEFITAEALKKIANISMESIEILLYEKPK FT IIKDPRMAAEILKKTHNSPTGVHLGQTKMYIKLRKEFYWKKFKESIQNFVN FT NCKHCKMNKHQKPTHEKFIKTTTPCRPFEIISVDTIGPFLKTNYNNRYAVT FT IQCDITKYVTVISIINKEASTVAKAVVEKFMLTYGTNITAVRTDMGTEYRN FT ELFKSIGKLLKFEHKFSTPYHPQTIGALERNHKCLNEYLRIFTNEHINDWD FT EWVNYYTFA" XX SQ Sequence 5751 BP; 2200 A; 1072 C; 1007 G; 1472 T; 0 other; tggcgaccgg acgcatcgag cacaacagtt cctcaggcta gcacaattac agtgtgatct 60 attccgagac aaatttagga aagaacagta agtgaagtaa accctgcgca gaacgaaagt 120 gataatctgg actagtgatt aaaatgatga gtgcactatt tgccggaatt ttagcataag 180 cattgttact acacttgcta aaaatgacaa aagattttgt tgaagaactc aaggaaacta 240 tgaatttggc aaacgacgcc acccggtaaa cccgtcctag tgctacgcgt aataaaatgg 300 gaaagtccgc tagcaaaact gttagcacca caggagaccc ccaagtgaac attgttaatc 360 agttggaaga ccacagtgtg cgccacgaag gccacgaatt aaaattgtgg ctcttgctag 420 cactgaattt gcttcaaata ctgtgggctg tgttccaatg gaacacgaaa aggattaaga 480 ccaaagcctt ccgcaaaggg ttgaaatccc aagacaatct aaacagggtc tagagtgaaa 540 tttaggaaga ttaaaagaca tctagtggaa aaactatgaa aaacatttag taaacttaat 600 ctctgcgcgc aatgaagaaa gaacgattat caaacgtgct accgcagcat aaacagacct 660 tggaaaatct agcaatcaag aaatcgaact tcggtcaaaa agtgaaaagt gaaaccaaaa 720 gaacaaataa gcacggacaa caagtgaaaa aaacagatga gtgcatgata ttgcagccag 780 tgaaattcca aataatcgag caactaccgt ggctggaaga tcggttggaa gaaatgcgcc 840 aatggggaat acgcacgaaa atcgacgttg agaaggtcaa ggcactggcc gcaaagctac 900 aacaaccgaa gaagaaacca accacaacaa tccgcatgaa gccagctgag tcaacaaaat 960 cgagcacatc aacgaagcag taatacatcg atcgtcagta atagagcgct tcaagcaata 1020 tgtaagtaaa ttaagttata aaaaaaaaat agacaacaca tacccaatat agataaaaaa 1080 aaagtgtaat gaaccttgta caaaatcaat ttttcatcat aaactgccag cgaaaatttt 1140 cctttaattt cttttcatag tcgcattcac cttaaatgga caatgaagtg gatgtgctca 1200 taaaaaaata gtcaatatac aaaataactt caggaaatcg cctaatcgga agtatcttag 1260 acaaactttg attaagaaag cgattgaaag ccgggactta tatcatcaac ttttaggaaa 1320 acttgaattc catccgcagg gaaaattaat cttaggtggt gtacgaaaca cctatagcga 1380 aataaaaata ttcattgacg gtaggctagt tttagattct aatttaccaa gtttcaaaac 1440 tgtggcacaa acaattttaa gtgctataaa aattaaaaac atcttcgaag ccaccaaaat 1500 ggcaacaata cttgaagtta ttaagatagc ttccacactt atccctaact acagcggaaa 1560 cccggataaa ttagaatctg tagtttctgc tttacacgct ctggacacta ttgtaacaga 1620 tgcaaccaga acagctgcaa taaacgtagt tttatcaaaa ctggaaggta aaccacgctc 1680 cgccgtcgga aatgcacccc aaacgattaa cgttattgtg caaaatttgt gggataagtg 1740 taaaaacaca caaacatcag aacttctttt agcaaaatta tctgcaactc gacaaacagg 1800 gacactgtcc aattttacag atgaggtcgc tagaaaatat atatctagca gaaaatatcc 1860 ccgacaacac cgccagtaaa atggctgtga aagcgggtat aaaggcattg tgtggtggta 1920 taaaaaatac tcaaacacaa cttatattaa aggcaggcaa ttttgagacg cttaatgcag 1980 cgattattaa agccacagaa aacgatacaa acacctacga agaacagggt aatgacaacg 2040 tgaatgtctt tgctatgcaa aatagatttc acaacaactt gagaggtcga ggaagaggag 2100 ttagaaattt ctcgaataat gctcatttcc ctcgagaaaa tggctacccg caaaattttt 2160 atcagcaagg ccaatacaga ggacaaacta ggtacaggaa taattggtca tccagaggaa 2220 atttcaaccc cttcggtggt cgtggaagac ttaggccccc acagcattct atgtatgtag 2280 cacaacagga aaatatcgaa aacgatgaac aacagggtca agatgatcag cagcagcaac 2340 aacaactatc aactcccaac cagaatcacc ctttaggcgt tcagttcggg cagcatacac 2400 cataaatgta tctgcctgca actttgtgaa attaaaactt ggattatcgg aagtagaatg 2460 tacctttttg ttagattctg gttccgatat ttcaattatt aaagccagta aagttaaatc 2520 tgaccaaatt tattatccct cggaaaactg caatattaaa ggcgtcggtg aaggaacaat 2580 tacttcacta ggcagcactc atacatgcct tcatatagaa ggcgaaaaaa ttaatcaatc 2640 ttttcaaata gtgtcgaatg gcttcccgat accaactgat ggtatcctag gaagagattt 2700 catgattcag tataaatata aagttgacta cgaactgtgg atgcttttca ttttaaatag 2760 aaataaagag ttaacagtcc ctattcaaga caatttaaat agagccatta tcctacccca 2820 gcgttgcgaa ataattagaa gaattccttc tttgacacaa tgtcaagagg attcggtagt 2880 atgctcagag gaaatccacc ccgggatttt tatgggtaat acaattgtga atagtcaaaa 2940 cccttatgtg aagctagtga ataccttaga agtaccagtt cttatagaaa acattactcc 3000 tacaacctta cctttaaaaa acttttacgt acatacatcc actcataata ctattcctca 3060 atcaagagca aaaattttag ctgatactat caaaatagat aatgccaacc caaatttaaa 3120 aaataaattg cagaagttat taaatgaata taacgacata tttcatttac ctggagaccg 3180 tttaagtact aataattttt atgagcaaaa aattaattta gaagataaac aacctgtgtt 3240 catagctaat tataagcaaa tccacgcaca tgtaccagaa attaagaatc aggtgaataa 3300 tttgttggaa aatggaatta tagaaccttc agtatcgcat tacaattccc cgatactgtt 3360 ggtgccaaaa aggacacaaa atggacaaaa atggcgatta gtggttgact tccgccaatt 3420 aaataaaaaa ctttttcctg ataaatttcc attgccaaga attgatgcta ttttagatca 3480 acttggcagg gcgaaatatt ttaccacact cgatttaatg tcgggatttc atcagatccc 3540 attagaagaa cactcaaaaa aatatactgc gttttcaacc tcaaccggta attatcattt 3600 taaaagatta ccattcggac ttaacatatc acccaatagt ttccagagaa tgatgactat 3660 tgcattggca ggcctaacac cagagtgcgc gtttgtatat atagacgaca ttgtggtgat 3720 aggctgttct gaaaatcatc atctgggaaa ccttagaatc gtttttgaaa gattacgcaa 3780 atataacctg aaattaaacc ctgaaaaatg ttgcttcttt aaacaagagg taacctattt 3840 aggttacaaa attacagata aaggcatttt gccagacgac tcgaaatatg atgtcatcca 3900 aaaataccca gtacccaaaa acgccgatga agtaagacgc tatgtagcct tctgtaatta 3960 ctataggaaa tttatacaaa acttttcgga aatcgctcaa ccattaaata aattactaag 4020 aaaaaacaca ctatttgaat ggtccgatga atgccaccaa tcttttgaac ttttgaaaac 4080 aaagcttcta caaccaccac ttcttcaata tccggatttc cagaaggaat ttatactaac 4140 aactgatgca tcatcaacag cttgtggcgc tgttctctca caaaagtata acgaagtgga 4200 gctcccaata gcatttgcaa gtagaacctt tacaaaaagt gaactaaaca aagcaattat 4260 tgaaaagaac tagccgcaat acattgggct atagaatatt ttaaaccata tttgttggga 4320 cgaaaattcc gagtgaaaac agatcatagg cccttagtat atttatttgg aatgaagaaa 4380 ccttccaata aattacagag gattcgtctt gacttggagg aatatgactt cgaaatagag 4440 tatgtaaaag gtaagcagaa tgttatcgct gatgctcttt cacgcatcaa cattacttca 4500 gctgaactaa aaacaatcaa tgtaataacc agaagcatga ctaaaaagaa aaaccccgaa 4560 gaggataaag taagcatctt gcagccagat aaacttagga tgttcaacgc tctttcattt 4620 gaagaagttt taaatcttcc aaaattagag tttggtataa taaacgaact aaatagtgta 4680 atctgtacac caaaaataaa aaataaaact agcaaaaagg acctcttatt agtaagagga 4740 atccatttga ataataaagg aaatgatgca ttagtgcact tattccaaag tatagaaaaa 4800 gatgcaatta aattaaactt gaaccagctg gcactatcta ctgatgatga aatttttgag 4860 tttataaccg cagaagcatt aaaaaagata gcaaatatat ccatggaaag tattgaaata 4920 ttattatatg aaaaaccaaa aattataaaa gatccaagaa tggccgctga aattctcaag 4980 aaaacccaca acagtccaac cggagtacat ctgggtcaaa ctaaaatgta tattaagcta 5040 agaaaagagt tctattggaa aaagttcaag gaatcaatcc aaaattttgt aaacaattgc 5100 aaacattgta agatgaataa acaccaaaaa ccaactcatg aaaaatttat aaaaactaca 5160 actccttgca gaccgtttga aattatttcg gtagatacca ttggaccatt cctaaaaaca 5220 aactataaca ataggtacgc agttaccata caatgtgata ttacaaaata tgttacagtt 5280 atttcaatta taaacaaaga agcctcaaca gtagccaaag cagtggtgga aaaatttatg 5340 ctaacatacg gaacaaatat aacagccgtt aggactgaca tgggaacaga atataggaat 5400 gaattattca agagcatagg aaagttattg aaatttgaac ataaattctc cacgccatat 5460 cacccacaga ctatcggtgc gttggagagg aatcacaagt gtctaaacga gtacttgcga 5520 attttcacta acgaacatat taatgattgg gatgaatggg ttaactacta cacattcgca 5580 taaaacacct ctccaaatct agatcacgga tatacgcctt tcgaacttgt ctttggtaga 5640 acagaaaaaa ttcagaacca atcccttaca gaggtagtac aaccacttta taactatgac 5700 gattacgcaa aagagctgca atatagactt ttaggcctat ataaggattt t 5751 // ID Chapaev3-1_AA repbase; DNA; INV; 5071 BP. XX AC . XX DT 29-FEB-2008 (Rel. 13.02, Created) DT 29-FEB-2008 (Rel. 13.02, Last updated, Version 1) XX DE Chapaev3-1_AA is an autonomous DNA transposon - consensus. XX KW Chapaev; DNA transposon; Transposable Element; Chapaev3; KW Chapaev3-1_AA. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-5071 RA Kapitonov V.V. and Jurka J.; RT "Chapaev3, a distinctive group of animal Chapaev transposons."; RL Repbase Reports 8(2), 42-42 (2008). XX DR [1] (Consensus) XX CC Chapaev3-1_AA belongs to the Chapaev3 group of the Chapaev CC superfamily of DNA transposons (see comments on Chapaev3-1_PM). CC Chapaev3-1_AA is a young family of mosquito Chapaev3 transposons: CC genomic copies of Chapae3-1_AA elements are ~98.7% identical to CC their consensus sequence, which was derived from multiple CC alignment of four Chapaev3-1_AA elements. Chapaev3-1_AA contains CC imperfect 1300-bp terminal inverted repeats and encodes a 574-aa CC transposase. This family is an outgroup among identified CC Chapaev3 transposons: the Chapaev3 TPase is only 35-39% identical CC to other Chapaev3 transposases. XX FH Key Location/Qualifiers FT CDS 1525..3294 FT /product="Chapaev3-1_AAp" FT /note="transposase." FT /translation="MFLHSSEKQSVNCKVKMDNSKCDLVRSLLCNICGLYT FT PRHKQRNFTQLVIEKYKQYFKRDPQINENFSPNHLCVCCHSMLLKKEHRSI FT LAVPMIWSVPDVAHMYCYSCLMPSLVGRKWCQRSEIQYPKRSLCTLPEKVL FT PSDGMESEEEETCCPDNHEDDLPVINSTPTIATSSSYHPQPSTSAGHADNP FT DLVGQASFDDLTRDLNLPSNLSELLGSRLREKKLLKTGTRTANRSKFDDFR FT STFDEHNGITYLRNLHGLFELFEVDHDPTEWRLFIDGSVSSLKALLLHNGN FT VYPSIPVAYSNIHKEKYNVLQDILYLINYPAFKWKIIADFKLINILMGLKS FT GNPKFPCFICLWDRNNDGDKYSESAWPLRPSFDKYPGNKPDLYSAILEPLV FT APEDILHPPLHIKLGLVTQLFKKIVQQNSKVKEHLKDLFPKLSDMKVEMGI FT FDGPKIRDIFKSTDFDNILTIDEKSAYQNLKLVCAGFLGNNRANNYKELIS FT NMMTCYKKLDINVTLKMHSLICHLARFPSNLGSFSDEQGERAHQDFKDIEQ FT RFKGKNNVNAMGTYCWGLVRQKDPEEHKRQSPTKTKYFVVQYM" XX SQ Sequence 5071 BP; 1721 A; 882 C; 900 G; 1568 T; 0 other; cacagttcga caaaaaaagt cgatctgaat gcacagagac gagacacccc gggcttgaaa 60 gtatgaaaaa agacaaaaat aatggcgttt acagaatgtt cgtgtctgcc ccaggtcatt 120 tttaaaatat acttgaacat tttgcatttt ctattaaacc cctctaccga cagcttcatt 180 ttttaccgct aaaaaatatt caaatcactt tttgtttctt gatatttttg caccattctt 240 tcacaagttc tcaaaaaact cttctagttt ttgaatatgt gtcaatattg ataattagtc 300 atctagatcc agagatgttc caaaattcct tgggggaccg gcgcgtagcc ataacccacg 360 taaatatctc aggctacaga ttttttatcg tattcggata ttctttcctc gaaatgatac 420 attaaggtga acatgttgtg agaaaatgaa gcaatttggt gcagccgtct ttgagtaatg 480 agcatttatg tttctggtac tatcctggac aattaaagat cttgaaaact ctaaaaaacc 540 tcatatcgta attttcagat ttctccaaga aaactgaacc gatttgtatg attttttcag 600 agtagctcct tattacctga tattgtatag tacacatttt atttttttga taaattgacc 660 aaaaacaaaa tggccgccaa agacatttta tatggaaaat gtcggtcccc caaggaacat 720 cggaatatct tcaataccag atcaccaatg attaatatta gcacggtttc ttaaactaga 780 agagtttttt gagatcttgt gaaaaaatgg agcaaaaata tctagaaaca aaaaagttat 840 cgcgatttga atattttttt tgctgttaaa aatgaagctg ccggtagagg ggttcaatat 900 ctaatctaat caacaacccg acacagtgtt ttattttcag gacagcgaaa ttcatgtgcc 960 atataatgtt ttaaacttga aattcaatat gaatagttgt actaagattt gcatggactc 1020 ccccccccct ttcgtaccct ccccattttt aaagtatcga aaacgatgga atatttgata 1080 taaatggttg aaataagcaa tttagatgat actacaatcc accagctatc tctccagaaa 1140 tatagtggta tatgaacaac ccttcgacta taatgtggag ccgtccataa aatgcgtatt 1200 cattttggag gggagcgggt ctgttcaaag gttacggttc gtataaatga aaatagaagt 1260 ttaaagagtt gatcataaga gaagggggat aaaataccaa aggaaggata aacaatgctg 1320 aaatggaagc tattagaaac ctgaccaacg ttttacaggt aaaataaatt tgtaatgaac 1380 ataaaaaaca attgattatt tttttttgtt atcgaacttg ctttttcgag tgggtagcta 1440 aaactcatct tttcaaacat tacctcacat tcggccgttg agttaattta tacctatact 1500 tatataaata atgtacgacg tgatatgttt cttcattcaa gtgaaaaaca aagtgttaat 1560 tgcaaggtga aaatggataa ctctaagtgt gatttggtta gaagtctctt gtgtaacatt 1620 tgtggccttt acacgccaag acacaagcaa agaaatttca ctcaactggt aattgaaaag 1680 tacaagcagt actttaaaag ggatccacaa atcaacgaga atttttctcc aaatcatctt 1740 tgtgtttgct gccattcaat gctgctcaag aaagaacatc ggtcaatact tgcagtccct 1800 atgatatggt cagttcctga tgtagctcat atgtattgtt attcctgtct aatgccctca 1860 ttagtgggta gaaaatggtg ccaaaggagt gaaatccaat acccaaaaag gtcgttgtgc 1920 actcttccag aaaaagtctt accatctgac gggatggaaa gtgaggagga ggaaacttgt 1980 tgtcctgata accacgagga tgacttaccg gtaataaact cgacgcctac aatagcaaca 2040 agttcatcat atcatccaca accgtctacg tcagcgggtc atgctgataa tcccgacttg 2100 gttggtcaag ctagctttga tgatttaacg cgggatctaa atttgccttc aaacctttct 2160 gaattgttag gatccagatt gagggagaag aaactgttaa aaacaggtac aagaaccgca 2220 aataggtcga agttcgatga tttccgctca acgttcgatg aacacaacgg aataacatat 2280 ttaaggaatc ttcacgggct gtttgaactt ttcgaggtcg atcacgatcc aactgaatgg 2340 cgtcttttta ttgatgggag tgtatcgtct ttaaaagcct tgttactgca caatggaaat 2400 gtttacccca gtattccagt ggcttacagt aacatacata aagaaaaata taacgtgctg 2460 caagacattt tatatttgat taattaccct gcttttaaat ggaaaataat tgcagatttc 2520 aagctgatca atattctcat gggactgaaa agcggtaatc caaagttccc atgttttatt 2580 tgcctgtggg acaggaacaa tgatggtgat aaatattccg aatcagcatg gccactacgt 2640 ccttcttttg ataagtaccc aggcaataaa ccagatttat acagtgctat attagaacca 2700 ttagtcgccc cagaagacat ccttcatcca cctttgcaca ttaagcttgg acttgttaca 2760 cagctcttca aaaaaattgt tcaacaaaac tctaaagtga aggagcatct gaaggatctt 2820 ttcccaaaat taagtgatat gaaagttgag atgggcatat tcgatgggcc taaaattcga 2880 gatatattca aaagcactga ttttgataat atattgacaa ttgatgaaaa atctgcttac 2940 caaaatttga aacttgtttg tgctggtttt ctggggaaca atagagctaa caactacaaa 3000 gaacttataa gtaacatgat gacttgctat aaaaagctag acattaatgt tacattgaaa 3060 atgcattctt tgatttgtca ccttgcaaga tttccttcaa accttggaag tttttcagac 3120 gagcagggcg agcgagcaca tcaagatttt aaggacatcg aacaacggtt taaagggaaa 3180 aataacgtca atgcaatggg aacatattgt tggggattag ttcgacaaaa agatccagaa 3240 gaacataaaa gacaatcacc aaccaaaact aaatatttcg ttgttcaata tatgtagaaa 3300 aaaaggttaa taaaaatatc atgaaacata tattccattt tattattctt aatccgaaat 3360 ccgaaaaatg taaataattg taacaaaagt ataaaaatga gatgtacaaa aaataaagca 3420 gaagataatt tagctgcaaa taatctctac tgtactgtac ctcaaggaat agtagatctc 3480 cgtgaacaat aaattcgttt tgaaaatgta tatatttttg atacttgtaa acattctcaa 3540 attctcaata acaattgaaa ggcactcttt gtgaccaaat gaaatattgt cattgtaatg 3600 cattgtgctg ttcatatatg agaaatgaag caaagatgat atagccatca caatagcatc 3660 aaatttgata gaaataagta aaacgttcgt tagtctgtta gatcttaaat tcattatagt 3720 aaacaagaaa caaaataacc tgcaagggat cctccataac cacgtggaaa tatttttggt 3780 atttttcacc ccccttcccc ttgtgatcaa ctcttcaatt ttttattttt atttatatga 3840 gccgttactt ctgaacagac cccctcccct aaaataaaca cgcaatttat agacggctcc 3900 acgatatggt caaggggttg ttcataaact acgatatttc tgaagagata gctggtggat 3960 tgtagtatta ctaaaatcgc ttatttcaac tatttatatc atatattcca tcatttgcga 4020 tactttagaa atggggaagg tacgaaaaag gggaccccat gcaaatctta gtacaactat 4080 tcatattgaa tttcaagttt aaaacataat atggcacatg aattccgctg tcccgaaaat 4140 aaaacactgt gtcgggttgt tgattagatt agatattgaa cccctctacc ggcagcttca 4200 tttttaacag caaaaaaaaa atattcaaat cgcgataact tttttgtttc tagatatttt 4260 tgctccattt tttcacaaaa tctcaaaaag ctcttctagt ttaagaaacc gtgctaatat 4320 taatcattgg tgatctggta ttgaagatat tccgatgttc cttgggggac cgacattttc 4380 catataaaat gtctttggcg gccattttgt ttttggtcaa tttatcaaaa aaataaaatg 4440 tgtactatac aatatcaggt aataaggagc tactctgaaa aaatcataca aatcggttca 4500 gttttcttgg agaaatctga aaattacgat ataaggtttt aaggagtttt caagatcttt 4560 aattgtctag gatagtacca gaaacataaa tgctcattac tcaaagacgg ctgcaccaaa 4620 ttgcttcatt ttttcacagc atattcgcct taatgtattc cgaggagtga atatccgaat 4680 gcgataaaaa atctgtagcc tgagatattt acgtgggtta tggctacgcg ccggtcccca 4740 aggaattttg gaatatctcc ggatccagat gaccaattat caatattgac acatattcga 4800 aaaccagaag agttttttga gaacttgtga aagaatggtg caaaaatatc gagaaacaaa 4860 aaagatatcg tgatttgaat attttttagc ggtaaaaaat gaagctgtcg gtagaggggt 4920 tcaataaaaa atgcaaaatg ttcaagtata ttttaaaaat gacctggggc agacacgaac 4980 attctgtaaa cgccattatt tttgtctttt ttcatacttt cggggtgtct cgtctctgtg 5040 cattcagatc gacttttttt gtcgaactgt g 5071 // ID Gypsy-19_DPu-I repbase; DNA; INV; 5339 BP. XX AC scaffold_613; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-19_DPu_; KW Gypsy-19_DPu-LTR; Gypsy-19_DPu-I. XX NM Gypsy-19_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-5339 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 753-753 (2010). XX DR Genome; scaffold_613; Positions 9954 4616. XX CC Positions [4184-4648] - Integrase core CC 'TTGTG' target site duplication CC LTRs are 100% similar to each other. CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX FH Key Location/Qualifiers FT CDS 593..4135 FT /product="Gypsy-19_DPu-I_1p" FT /translation="MATVDDALDAATAAAAAASAAAGRIASVEQNQQAMTQ FT QLTAITQQLQLLLQGGGGAGGSGGGSASGGAGGSSGGLPGAGGAGGGGAGG FT GGAGGGGVGGGGAGGAQQRRRIDPSCLDKLHGDASLSQLRTWRNRWNDFCQ FT LSQLSTYPSNEQMAAFRMVLDPAMQQIVEVALGIPSATPLSPTDVLDQINT FT YIRSKRNIALDRVAFEDCRQSTSETFDDFYIRLRGLAEAADLCVTCSDTRL FT TTRIMAGIRDSETRRKLLALSPFPTLQHTVNICRSEEAAKANEKSLSNAPV FT ISHMQTRHQGSARQTDTSRCGSCGRLPHRAGETCPAMGKQCHLCGANNHFS FT PCCPNSKKPKSEPADSGGASSSGGSSAGGGAGSSGGRYHDGSRSRAHMKRI FT VVGNVQASRRRRPAPTISIQLGHSTGQAATTIEKVTPDGGAEATVGGMDVL FT HALGFSEADLSSSTFDLVMAEKSTPLLVIGEKEFTATYEGTAASITITFSP FT DISGLLLSWYDCISLGILHDGYPRPKKNSRHGMQINSLQTANRRKAPYVYD FT GIVPLDPSPDDIQRIGNDIARQFDDVFDQTGSLNCMEGPEMIIELTDDATP FT FYVNGSRPLPFADRPAVKKLLDDYVEKKIICPVTEPSDWAAPLVVTRKSDG FT SLRICVDHTRLNRFVRRPTHPTRAPRDAVAEITGDAKFFSTFDAANGYYQI FT PLSPSSQHLTVFMTPWGRYKYLRAPMGLCSSSDEYNRRADLAFENVSNTVR FT VVDDLLRFDNSFPEHVAGVCTVLSAARKSGITFSLKKFQFARTQVQWVGFQ FT IQPGGVSVDPEKLRAISDFPKPTNITELRSFMGLVEQLAGFSTEVAAAKTP FT LRPLLSSRTPFLWTADHDHAFEAVKAALVAPPILAPFDPELETSLQVDASL FT KNGMGYALLQRHEDIWKLVDANSRWCTDTESRYAIVELELAAVEWAMRKCR FT LYLLGLPSFQLVVDHQALVTILDKYTLDAVENPKLQRLKERLSPFVFSTIW FT RKGRSHSIPDALSRAPVNDPSPDDEIANDNVQSFARRVVIRQVSSIQQATD FT EMEEDGIDVESHLSDPMLDELRDVAESDSDYVELMAAISTGFRTPRNQTAL FT GVRQYWSVREELSVDDGLVLFGRRIVIPRPARRELIKKLHAAHQGIVRMKR FT RARQTVFRPGMSNDITLWVESC" XX SQ Sequence 5339 BP; 1208 A; 1536 C; 1496 G; 1099 T; 0 other; tggcgcagtt ggactaatct acgatccctc agtgtgttca acgagtttca tactcgtcac 60 ctgacataat tattttcgtg tgcggtggag aaacgttgca gcgagttggt gtccatgccc 120 cacgcggcgg ccatcttggc atctcgtgtc tgtgcacctc caccggctct gcccgtacgg 180 gctgtggtaa ctatttatcc tacagtgtat attctacgtt aggcctgtaa ttcatcacgt 240 gtgcatggca gcgggccatt tcgatataca ttttactttt cctggggcgg tgactttcgc 300 ctctacgtac tctacggttt attgaggcgg agcggccgcc attttcgttt tcttgccgtc 360 cccgtgtgtg tgtggccaag tttttatggc gcagcgagtt tcggcatcac gtgatacgat 420 ctaaaccgag tgccacgtcg atcgagcgac atcgatctat agtgacaatc cgtaggctgc 480 aagcaggctt ttttccacat ccggacacac ttataatcta aggtcttcat cccgtgtaca 540 tctgccaact tcagacctgc ccgggtcgtc acggccaaga acagccggca caatggcgac 600 ggtggacgac gcattggacg cggcaacagc cgccgcagca gcagcgtcgg cggcagccgg 660 tcggatcgcc tcggtcgaac agaatcagca ggcaatgacg caacaattga ctgccattac 720 tcaacaatta cagctgctgc tacagggcgg cggtggcgct ggtggcagtg gtggaggttc 780 ggctagcggc ggcgctggag gcagtagcgg agggttgccc ggcgctggtg gcgctggtgg 840 cggaggcgca ggtggcggtg gcgctggtgg cggcggcgta ggcggcggcg gcgcaggagg 900 agcgcaacaa cgacgtagaa tcgacccatc ttgtttagac aagcttcacg gcgacgcgtc 960 gctctcccaa ctacgtacgt ggaggaatcg ttggaacgat ttctgccaat tgagtcagct 1020 gtctacgtac ccttccaacg aacaaatggc cgccttccgg atggtcctcg acccagccat 1080 gcagcagatt gtggaggtgg cgctcggaat tccttcagcg acaccattat caccgaccga 1140 cgtcctggac caaatcaaca cgtacatccg atcaaagcga aatatcgcac ttgatcgcgt 1200 cgctttcgag gactgtcgtc agagcacctc agaaactttc gacgatttct atattcgttt 1260 gcgtggattg gcagaagcag ccgacttatg tgtaacctgt tccgacactc ggctgacgac 1320 ccgcatcatg gcgggtatac gcgattccga gaccagacgg aagttgctcg cgctgagccc 1380 gttcccgacg ctgcaacaca ccgtcaacat atgtcgaagc gaggaggcag ccaaggccaa 1440 cgagaagtca ttgagcaacg cgccggtcat ctctcacatg caaactaggc accaagggtc 1500 agcacggcag accgacacca gcaggtgcgg gtcgtgcgga cgcctgccac accgggcggg 1560 tgagacatgt ccagcaatgg gcaaacagtg ccatctgtgc ggtgccaaca accatttttc 1620 tccatgctgc ccaaattcca aaaaaccgaa atctgagccg gcggacagcg gcggcgctag 1680 cagtagcggt ggcagttcgg ctggcggtgg cgctggtagt agcggcggtc gataccacga 1740 cggaagccgt tcacgagcgc atatgaagcg tatagtggtg ggcaatgtgc aagccagcag 1800 gcgacgacgc ccggcgccca ccatttccat acagctgggc cattcaaccg gacaggcagc 1860 caccactatt gagaaggtga ctccggacgg cggggcagag gcgactgttg gcggaatgga 1920 cgtgttgcac gcgctaggat tttctgaagc agacttgtcg tcatccactt tcgacctcgt 1980 catggcggag aaatccaccc cgttgctagt aataggggaa aaggagttta cggccacgta 2040 cgaagggacg gcagcaagca tcactatcac gttcagcccc gatatctcgg gacttttatt 2100 gtcgtggtac gactgcatca gcctaggaat tctccacgac ggatacccga ggccaaagaa 2160 gaattcacgt cacggtatgc agattaattc ccttcagaca gcaaaccgcc gcaaggcacc 2220 gtacgtttat gacggaattg ttcctctaga tccgtcacca gacgacattc agcgtattgg 2280 gaacgatatt gccagacagt tcgacgacgt cttcgaccaa acaggctcac tcaactgcat 2340 ggaaggaccg gagatgatta tcgagctgac ggacgacgcc acccctttct acgtcaacgg 2400 atccaggccc cttcctttcg ccgaccgccc ggctgtaaag aaattgctgg acgactacgt 2460 cgaaaagaaa atcatatgcc ctgtgaccga accatcggac tgggcggctc cacttgtcgt 2520 aacacgcaag tctgacggct cattacgcat atgcgtcgat catacgcgtc tcaaccgatt 2580 cgtgcgacgg cctactcacc ccactcgagc accgcgggac gcggtggcgg aaatcacggg 2640 tgacgcaaaa ttcttctcga cgtttgatgc ggccaacgga tattaccaga tccccctatc 2700 accctcgtcg cagcatctca ccgttttcat gacaccatgg ggcaggtaca aatatttgcg 2760 agctccgatg gggttgtgca gttccagcga tgaatacaac cggcgtgccg acctagcctt 2820 cgagaacgtc agcaacaccg tccgggtagt ggacgacctg ctccgctttg acaattcctt 2880 cccggaacac gtagcgggag tatgcacggt actgtcagcg gccaggaaat ccggtatcac 2940 cttcagcctg aaaaaatttc aatttgctcg cacgcaggtc cagtgggtcg gtttccaaat 3000 ccaaccggga ggcgtatccg tcgatccaga aaagttacgg gccatctcgg attttccgaa 3060 accaaccaac atcacggagc tgcgctcctt catggggctc gtcgagcaac tagccggatt 3120 ttctacagag gtggcagccg caaaaacacc gctgcgcccc cttctgagct ccaggacacc 3180 atttttatgg acagccgacc acgaccacgc cttcgaagcg gtcaaggctg ctctcgtggc 3240 gccacctatc ctagcgccat ttgatccgga gctggagacg tcgctgcagg tcgacgcttc 3300 tctgaagaac ggcatgggct atgcgctgct tcagcgtcac gaagacatct ggaaactcgt 3360 cgacgccaac tctcggtggt gcaccgacac agaatcgcgt tatgccattg tggagctgga 3420 actggcagcc gtcgaatggg cgatgcgtaa gtgccgcctc tatctccttg gattaccatc 3480 gtttcagctg gtcgtggacc atcaggcgtt ggtcactatc ctggacaaat acaccctgga 3540 cgcggtagaa aatcccaaac tccagcgttt aaaggagcgg ttgtccccat ttgttttttc 3600 gaccatatgg cggaaaggac ggagccactc cattccggac gccctttcca gggctcctgt 3660 gaacgacccg agccccgacg acgagattgc caacgacaac gtccagtctt ttgccagacg 3720 tgtggtaatc cgccaggtca gcagtataca acaggccacc gacgagatgg aggaggacgg 3780 catcgacgtg gagtctcacc tgtccgaccc catgctggac gaattgagag atgtagctga 3840 atcggattcc gattacgtcg agctcatggc ggcgatatca acgggattcc ggacaccacg 3900 caatcaaaca gccctgggcg ttcgtcaata ttggtctgtc cgcgaagagc tatcggtaga 3960 cgacggcctc gtcctcttcg gccgccgcat cgtcattcca cgtccggccc gccgggagtt 4020 aatcaaaaag ctccacgcgg cccatcaggg catcgtacga atgaagcgcc gggcgcgtca 4080 aaccgtcttt cggccgggca tgtcgaacga cataaccttg tgggtggaaa gctgctaggc 4140 atgtcaagag cgactcccgc atcaacagaa ggagcctcta atgcgcgacc ccctgccaac 4200 acgggtattc gaagatgttt cagccgacct ttttcaggtg ggctcgctac acgtcctcgt 4260 ctacgcggac cgcctttccg gatggcccat cgttcaccaa tggcgccacg acccgtctgc 4320 gcgggaagtt acacaggcca tcattgaaaa tttcgtcgac ctgggcgttc ccgtgcggct 4380 gcggtccgac aacggcccac agttcgaagc ccacagcttc cagaccaaat taagccaatg 4440 gggcgtagcg tggggcagct ctacgccaca ctacccccaa agcaacggcc acgcggaagc 4500 agcagttgca gcgatgaagg acctagtgac aaaaatttca tcccacggcg atatcacatc 4560 ggacgagttc gcccgcggaa tgttggaatt ccgaaacacc ccgagagaaa acggcaagtc 4620 accggcggaa atggtttttg gacatccgtt acggtccatt attccagcgc accggacggc 4680 atacgccact cattggcggt cagccatgga aggccgtgac cggcaaacag cgatcgacgc 4740 ggaggtaaag ttccgctacg acgagcacgc tcgcccactc gccccgctat cattgggcac 4800 ccacgttcgg gtgcccacgt tcgagagcca aagacaaagc tctgggacaa ggtcggcgtc 4860 gtggtcagca ttggtcgtta tcgatcctac cggataaaat tcgcgagtgg gagcgtgttg 4920 tggcgtaacc gaagattcct ccgcccgatg gtagctgttc cggagacaaa cgagcagaca 4980 gcatttcacg aaggcgatgg cgcgggtgac agcacggacg ttcagcacgc ggcagccggg 5040 cagacggacg gcgaacaaca ggacaacttc ccgggcgcgt ctcttcccct gggggccgcc 5100 agcagttcgg tcgatacatc agcagacgct ggacggcagc ccgtacgtcg gagcgaacgt 5160 gtacgaaaga aaagagtgat atttaacgtg taatgtactg gctgaatttc cattcatgtg 5220 atgtgtagcg tactggctga attttcattc atgtagtttt ttttttgaag gggggagcat 5280 catattcttg tgttcagctg tttatctgcg cttgtgcgct ataacagctc gaggagggt 5339 // ID BEL-222_AA-LTR repbase; DNA; INV; 552 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-222_AA_; KW BEL-222_AA-I; BEL-222_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-552 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 902-902 (2011). XX DR [2] (Consensus) XX SQ Sequence 552 BP; 210 A; 110 C; 102 G; 128 T; 2 other; tgtgttacga ttaagcttaa ataactaaat tatcatctaa acatagaaca tcaattatca 60 actaaactgt tatacacaat cwttaatcaa atggaacaac cctaaaattg tacaacacat 120 acacattaag taactatact gataacacga acaaaaccga agatcgatta cctatgtaca 180 acacttaggc kacaagataa acattaccgg ccagtcagtc agcaatcgca tagcaagcaa 240 taaagaccgc aaccaccacg cgcgcgagtt tctaaaacga acatataaag tgtactcaca 300 tccatatcaa taaatcagtt tcttcaataa agtgtgaagt tttagagaag tttggagtgc 360 tttgaaggat taaggacaag gaattgggaa aggaaggaca aggactttgg aggatcaata 420 gttgtttgga caactttccg gtttgttgcg gaggaaattg ggctgcaaaa gaaactccca 480 caacagaggt caaagggtca acgttgggtc gaaccacaga aatcccaggt aattcgctca 540 acccacgaat ca 552 // ID R1D_NVi repbase; DNA; INV; 7035 BP. XX AC . XX DT 08-NOV-2010 (Rel. 15.11, Created) DT 08-NOV-2010 (Rel. 15.11, Last updated, Version 2) XX DE 28S rDNA-specific non-LTR retrotransposon R1 in Nasonia DE vitripennis. XX KW R1; Non-LTR Retrotransposon; Transposable Element; R1D_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-7035 RA Stage D.E. and Eickbush T.H.; RT "Maintenance of multiple lineages of R1 and R2 retrotransposable RT elements in the ribosomal RNA gene loci of Nasonia."; RL Insect Molecular Biology 19(Suppl 1), 37-48 (2010). XX DR [1] (Consensus) XX CC This family is specifically inserted into 28S ribosomal RNA CC genes. XX FH Key Location/Qualifiers FT CDS 784..1146 FT /product="R1D_NVi_1p" FT /note="broken." FT /translation="RRNYLARSPRRLVAAGNCKRVSASGTRLLGVDGAIVG FT LSDLVFFSSFFSPPRVACSLHLGYCQGVGGIFTLGTFRDRKQCVWNQKAMR FT VESKSNACGIKKQCVWNLKAMRVESNQKAMRVS" FT CDS 2589..6122 FT /product="R1D_NVi_2p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="KLKITMPGPQKKNVAEVQLEERSPTTPGNPLVAESGR FT LLTGAREDLEHPSVRTWPVRGGGGLRRLGEGEVGLRAMSVRLVRDERIEKM FT ARKTVDVEEPGSKNEIRFLQINAGGGQIVNAEICELIASKKIDIVLAQEPY FT SKVNKGSRYFTGLRRASRAICLKSAGTKTAPKAFVAVPNPDLHAFFVSALS FT TQHCVVAEVHTPSVTFFAVSMYFQFCDDIEVHLGQLEKVLENLRGQKVVIG FT IDANAESSLWSPRGTNEKGEKLERLIAAFGLHVVNDRTQPPTFEERGVSSY FT IDVTLVSGSMIAEVQSWKVKRDWTSSDHNAIVFKITTVAQTDRVDSSRFNI FT RRADWGLLDSTIKELSVSHLDHIVLDSAEEVEQMADALQKVLYEACETAIP FT RRRRIRKNNPWWTRELTDKKSELYSARRNMQQQWSLPGHSLRKAEYRALLR FT DYCRSVKGAKVGSWQEVVTVRGNEEPWGVVYKQLRGKLQNERTLSSVRCGD FT TESMSMLETANRLLEVHVPDDTPSNETPEQAQIRESINSPPETEDAAPFEQ FT WEIALILASLKNNKAPGFDLLEVRVLKAAIKAIPHHFLRLFNACLEHGVFP FT RAWKQASLIFLPKGGKDSSDSKSYRPISLLPVTGKLYERLVKRRLSDTALG FT PDMISDRQFGFRAGMSTEDAIIELRRLTAASPKKQVAALLFDVKGAFDCIW FT RPAILQSLKEKDCPKNIYKLLVSYFENRQAQVVWGTNQVSKQATRGCPQGS FT VLGPSGWNLGFDPLLRSLEQGVVTEGGGKLPINFVAYADDLAVLVEGDSRA FT EIEKVGKAVVKHIVERCSAIKLEVSESKTVGIFVKKPKVVGSKAVKINRKD FT CRKGGARNPKIELGGKSISFEQSVRYLGVHFDANLGISAHCKYLREKLVPL FT FSDLRKLAQCQWGLGHKALETIYKGVFVPTVCYASAAWYKEGAHTDRILED FT LHRQILIAITRCYRSTSYEAACVLAGTLPICIQLRVSVAKYHLRKGEDAEI FT GGVVIRHDPEGLKENYNRVLEVANEMWQARWEASEQGNATRELFFPDVVAR FT VKSDWIRPDHFTSQVLTGHGYFNEKLHQLSLAKAAACICCGEPDNNLHFLL FT ECPAFAEFRDELITPVSGGLEAPEATLMLVSSPEGFAALKEYSRVAFECKR FT QLENALTESDEGLSSESEE" XX SQ Sequence 7035 BP; 1819 A; 1741 C; 1867 G; 1608 T; 0 other; ccggttggtt gaagtactca aggcaacagt tcttgttgca gggtatggat ccttctggaa 60 gtgccagtga tacagcgtct acgtcgcgct aggtatcaac atctctccca caagttgggg 120 ggccctttct aaagtgaaga ttgggccgcg agatggccaa tagtatctcg gctaagttcg 180 aacgcgcaaa gatggggcag tggaatcacc ttatgagaac aggacaagat tcccctgtat 240 gtgtcagagc tgcaggctcg gtattgtata gcataaagga ttacttggtg gtattagcaa 300 atacgaattt ccgggagcgc tgagggttct cccggggaag cgctccctaa aagaagtgtg 360 gaaggaaaaa tttttcctgc agaagcactc tcgactcttg atgatcggta gatctgttgt 420 tatgccgtat cgagcttgta tgcctgggtc aagggcccga ctcaggttgc gagcccggtg 480 gcccaattgt acacgaaccc gcccgttcga acacctcaac cgttgatata cgtgagcctc 540 ctttgggtaa gtgctacgac aatcagcggt tttcgaaagc gaatctgcct ctggcagccc 600 aaactcccgt gcgcctcttg aggcgcacgt attcccttcc ttccttgcgc tcctaaacgc 660 ccgcgtaact ccagtgagta acgtgtggtt tcttctaggc cgggtaggaa tttattaggt 720 gataggtttt agttaggata gaattagctc tggcactctg tgaccactta cccccgtggg 780 taacgccgta attatctcgc gcggtcgcct cgtcggctcg tggcggctgg gaactgcaag 840 cgcgtaagcg cgagcggaac tcggctgctg ggggtcgatg gcgcgattgt aggcctctcc 900 gatctagtct tcttttcttc cttcttttct ccacctcgcg tggcttgctc tctgcacctt 960 ggctattgtc agggtgtggg gggcattttc acgttgggga cttttcgaga tagaaagcaa 1020 tgcgtgtgga atcaaaaagc aatgcgtgtg gaatcaaaaa gcaatgcgtg tggaataaaa 1080 aagcaatgcg tgtggaatct aaaagcaatg cgtgtggaat caaatcaaaa agcaatgcgt 1140 gtcagttgat acttagtctt agattcactt gcgtcagtaa gacgagccga ccattagggt 1200 cggcaagtca cgttaggaac ttttccaaat cccgaaaagc aatgcgtgtc gattgacaaa 1260 atagatttta gcttcactag catccataaa acgagccggc cattagggcc ggcaagtcac 1320 gttaaggact ttttcaaatt tttcatttga gaaagcaatg cgtatcaact gacaactaaa 1380 tcatagccgg ccattagggc cggcaagtca cgttagggac tttttcatat tttccacgcg 1440 ccccaagagt agaatcaccc cgcgaatata aactcaactt cctctctctc ttttctatac 1500 tctcacacac acatacgcac ccacaacaaa cacaaaaaca cgatgcaggc cgcgcatctg 1560 ggccagtttc gctgtcacag cgttagggtg agaccaggta actttgccgg catcaatcac 1620 gaacacacac acactaacac ttttcgatgc aggccgcgca cttgggccag tttcgctgtc 1680 acagcgttag ggtgagacca agtatctttg ccggcataaa ataactcctt ttcttaacta 1740 agacaacttc tcttctcttc tcctctaaac ttagctaaac caggcacgtg agagagcgct 1800 ctcctgccac ctgcgcactc taatattaat tcgaaaaggg attcgcgatc ctctagctca 1860 aggcggcctg agaaagccag gttcgacgcc tgtttccccg tgacaggcac ctagaactct 1920 tagcaactaa tgtgtcttcc gtaagtccgc gtgcgtgacc ccataggtac cgctatgttg 1980 cactgaaaat catctcgcga aataagagct cgtcgtgctc atgcgttgcg ttgtcttccc 2040 taagttatct agcttgggcc gatggagggc tagccttctc ggtcttacag tgcaactagc 2100 aaaggcttac atagcaatct caggtactct ttaagttgaa aactctagtc gcatctggcc 2160 tgtcacggga agaaatagaa tacatgaatt aaagccacgc gccgcgtatc gtgcagggat 2220 gggtaaacag tctgcgcgga ccatgtaggt tcacgacttc acagtcgcta ggaggcggag 2280 ccgtaagccc cgtgtaaaac cagaggcacc tcctgggccg cgtgatagtt tgggagcccg 2340 ggccgtcagt tagccaggcg gttaggctgc tccgtaacag cacgtgtaaa agcttcgtcg 2400 ccggagcact tggttggcta cctatgggga tgtgtaacgg cacggagtgc tgccgcggct 2460 gctgaaagtt ccgctctctc ttttgttcgc gctttttggc gcatggtaac cgtgcaggcg 2520 tccacccagt gggaaacaaa ctcgtcctaa attaaggctc taatacatat acttattcct 2580 gcaggtaaaa actcaaaatc acaatgccag gtccacaaaa gaaaaacgtt gctgaggtgc 2640 agctagagga gaggtcccca accactccag ggaatcccct cgtcgccgag tcgggccgtc 2700 ttctgaccgg tgcgagggaa gaccttgagc atccctcagt caggacctgg ccggttaggg 2760 gcggtggcgg gctacgacgg cttggggaag gtgaagtcgg cctccgagcc atgtctgtta 2820 gattagttcg tgacgagcgc atcgaaaaga tggcacggaa aacagtagat gtcgaggagc 2880 cgggctccaa aaacgagatt aggttcttgc aaataaacgc gggaggaggg cagatagtaa 2940 acgccgaaat ttgcgaatta atagcgtcaa aaaagataga catagttctg gctcaggagc 3000 cgtactcgaa agttaataaa gggtcgcgtt atttcacagg ccttagacgt gcaagtcggg 3060 caatatgtct aaagagcgca ggcacaaaga cggctcctaa agccttcgta gccgtgccaa 3120 accccgactt gcacgccttc ttcgtctcag cgttaagcac ccaacactgc gttgttgccg 3180 aggtgcatac gcctagcgtc acgtttttcg ccgtttcaat gtactttcaa ttctgcgacg 3240 atattgaagt acacctcggg caactagaga aagtattaga gaacctcaga ggtcaaaagg 3300 tagtaatagg cattgacgca aacgcggaat cctcgctttg gtcccctcgt gggacaaacg 3360 agaaaggaga gaagctcgag cgactaatcg cggctttcgg cctccacgta gtaaacgaca 3420 gaacccaacc tccgaccttc gaagagaggg gagtttcgtc ctacatcgac gtgactctcg 3480 tctcggggtc catgatcgcg gaggtacagt cctggaaagt gaaacgggac tggacttcca 3540 gcgaccataa cgcgatagtc tttaaaatca ctaccgtagc ccaaacggat cgagtggact 3600 ccagccgatt caacatcaga cgagctgact ggggcctgct cgactctacg atcaaggagt 3660 tgtctgtttc ccaccttgac cacattgtct tggatagtgc agaggaggtc gagcaaatgg 3720 ccgatgctct ccagaaagtc ctgtacgaag cgtgcgaaac cgccataccg cgtaggcgcc 3780 gtatccgaaa aaataacccc tggtggactc gagaacttac cgacaaaaag tccgagctct 3840 acagcgctag gcgtaatatg cagcaacagt ggagcctccc tggacacagt ttgcgaaaag 3900 cagaatatcg ggctctcttg cgcgattact gccgatcggt gaaaggggcc aaggtcggca 3960 gctggcaaga agtcgtcaca gtgcgcggaa atgaggagcc atggggggta gtctataagc 4020 agctcagagg caagctgcaa aacgaaagaa ccctcagttc cgttcggtgt ggggatacgg 4080 agtcaatgtc gatgttggag acggccaacc gtctgcttga ggtgcacgtt ccagacgata 4140 caccttccaa cgaaaccccc gagcaggcac agattagaga atcaataaac tcaccgcccg 4200 agaccgaaga tgccgcaccc ttcgagcaat gggagatagc tctcattctc gcatccctca 4260 aaaacaacaa agctcccggc ttcgaccttc ttgaagtcag agtcctaaag gctgccatca 4320 aagccattcc tcatcacttc ctgcggctct tcaacgcctg cctagagcac ggcgtcttcc 4380 cccgagcctg gaaacaggct tccctcattt tcctcccaaa aggaggcaaa gatagtagcg 4440 actcgaaatc gtaccgaccc atcagtctcc tcccggttac aggtaaactc tacgagcggt 4500 tagtaaaaag gagactatcc gatacagcgc taggaccaga catgatctcc gacaggcagt 4560 tcggcttcag ggctggcatg tctactgaag acgcgatcat cgagctgcgc agacttacag 4620 ccgcttcccc taaaaagcag gttgctgcgc ttcttttcga tgttaaaggc gcctttgact 4680 gcatttggcg cccggccatc ctccaaagcc tcaaagagaa agattgtccc aaaaatatat 4740 acaaacttct cgttagctac tttgaaaata ggcaggctca ggtagtttgg gggacaaatc 4800 aagtctccaa gcaggcaact aggggctgtc cgcagggttc ggttttagga ccctcgggct 4860 ggaaccttgg attcgatccg ctgctccgca gcctcgagca aggtgtagtg acagagggag 4920 gcggaaaact cccaataaac ttcgtcgcat atgcggatga cttggccgta ctagtcgaag 4980 gggactctag ggcggaaata gaaaaagtag gaaaggcggt cgtaaagcat atcgtcgaaa 5040 gatgctcggc tataaaattg gaggtttcgg aatctaagac ggtagggatt ttcgttaaaa 5100 aacctaaggt agtaggttca aaagcggtaa aaataaaccg gaaagactgc cgtaagggag 5160 gagcgcgaaa tccgaaaata gagttgggcg ggaaatcgat cagttttgaa cagtcggtac 5220 gttatcttgg cgtgcatttc gatgcaaact tgggcattag cgcccactgc aaatatctta 5280 gggaaaagtt agtaccgctc tttagcgact tgcgtaaact ggcacaatgc cagtggggtc 5340 tgggacacaa ggcgttggag acgatataca agggtgtatt cgtcccaacg gtatgttacg 5400 cgtccgcggc gtggtacaag gagggagcgc atacagatag gattctcgaa gatctgcaca 5460 ggcagatcct catagctatt acacgatgtt accgatcgac atcttacgag gccgcgtgcg 5520 tactagcggg aactctcccg atatgtatcc aacttagggt tagcgtggcg aagtatcacc 5580 tgagaaaagg tgaagacgcg gagataggcg gcgtcgtaat tagacacgac ccagagggtc 5640 taaaggaaaa ttataatagg gttcttgagg tcgcgaatga gatgtggcag gcgcgttggg 5700 aggcatcgga acagggcaat gctactcgcg aacttttctt tccagacgta gttgccagag 5760 ttaaaagcga ctggatccgt ccagatcact tcacctcgca ggttctcacg ggtcacgggt 5820 actttaatga gaaactccac cagctctctt tggcaaaggc agcggcttgt atctgttgcg 5880 gcgaacccga caacaactta cacttccttt tagaatgccc tgccttcgct gaattccgcg 5940 atgagctgat aactccggtt tcgggtgggc ttgaggcgcc ggaagccacg ctaatgttag 6000 tatcttcccc agaagggttc gcggctttga aagaatacag tagagtagcg ttcgaatgta 6060 agaggcaatt ggaaaatgcc cttacggaat ccgacgaagg attaagcagt gagagtgagg 6120 aatagagtga ctgaggtggt gggtgaaagg aagagctggt gaaaaatgtc taagttaggc 6180 aaacaaacgc gaagtcaaaa gcatgcttgg cttggccctc gcgaaagtcg ccttagggct 6240 tgactcgcga aataaactcg ttcgaaactt ggccatcgtc cgcgtctcac gcgtaaggcc 6300 ccgtggaagg tcgctatatg cttgacacgc gaatgcatgc tcgatcgtaa aaatttggcc 6360 gtcgaccgaa ttgaccatct tgggtctacc aaagataaca gtaaattcaa gaataaattt 6420 ggtgtgcttg tgcgttactg caactacaaa tcacgcctct cgaacgagga gtgtagttag 6480 ggcctttaga accttcgcct aaaacatcgt ggaggagatg ttgaaggagt cctgtcccca 6540 agcattgttc gctggcgaca agggctctgg ctgaaggatt agcacagagc cgctcgtttt 6600 gcagaggctc aaagcaggac ttttgtcctg tccccgagcg cacctttcga agaaggtgcg 6660 cccggcatta ttattttatt tactaacaac atgtatttct cctaacaggt acaaacaaaa 6720 ttagtggaga tccaggcggg atctcgcgaa tgcgccttcc cgtggttccc cgtggacggt 6780 ccggtggatg gtagagcttg ctcaccatcc cgctatgact gactaaagca ttcgtcccag 6840 ttgactgatt gtccccgcac ggccatcctc ggaagaccgg gcgggtacaa tctgttgatc 6900 gccaatgggc acttgaattt ttccaggaac gttctccttt cgggtggttc gatagatggt 6960 ggacggaaaa caaggtcgcg tatgcttatg gcgaggtagc gagtccaaat aacatcaggg 7020 ctaaccgaaa ctaaa 7035 // ID Academ-1_Lgigantea repbase; DNA; INV; 6437 BP. XX AC . XX DT 16-MAY-2011 (Rel. 16.05, Created) DT 16-MAY-2011 (Rel. 16.05, Last updated, Version 1) XX DE DNA transposon: consensus. XX KW Academ; DNA transposon; Transposable Element; Academ-1_Lgigantea. XX OS Lottia gigantea OC Eukaryota; Metazoa; Mollusca; Gastropoda; Patellogastropoda; OC Lottioidea; Lottiidae; Lottia. XX RN [1] RP 1-3053 RA Yuan Y.W. and Wessler S.R.; RT "The catalytic domain of all eukaryotic cut-and-paste transposase RT superfamilies."; RL Proc Natl Acad Sci U S A 108(19), 7884-7889 (2011). XX DR [1] (Consensus) XX SQ Sequence 6437 BP; 2159 A; 1136 C; 1182 G; 1960 T; 0 other; tagagcggga gctaggtgga gtttaatcat aaagttgagc ccaaaaggtt tgatacctcc 60 attgtgaaga cattactata ggacttcaaa atcaagaatt cttaaggtcc ccgctttgtg 120 cgttacctag gcacagctcc tcaaagatcc cacaaaaaca cgtttaatca aaaaattgag 180 ccatggatgt ttttttcata aaaagtaact atgaagtgtg ctgatcacaa aactacatca 240 taagaacact tcaggtggtg caaaaggtgc ttaagaacac ctttaattag ggctaattag 300 cgttttaatc aaacttttga tccaatgaaa acttcaatgc caacatctta attatccagt 360 ttttcaaaac acatgtagta aaaagtttta atttggttta attactgcac aaatagacca 420 cagataggcc ctactttatt tccgatgttt atgacagtga tctcccttga acatcaagcc 480 ttatgtttta gtttgtttac attcttaaag agaaacttaa aactaatcta atcaataatt 540 ttagaaatta gtaactctta aatttgtgca tgtactattt tcttgaaaga attgttctat 600 ggataattcg gagagcatgg aaacaccatc agtaaaacta aaagactgta tattacacct 660 ggatagtgga aaatctgaca atttggagga attagtaaaa ctctaatcac accattggga 720 agtcattcaa caatccgcaa aatcaagatc aagtaaaaca aatttcaacc aatctaaata 780 ttttgatgtt attgcctcat tgccagtgga atataactct caaaaaggat atcacaaaaa 840 ctgttataaa aatttcacgg caatttcaca caaatctgta gattcagttg tatccaccca 900 aacattgtta cgttcaacca cagacagagc tgaatcaagt gggagtacag gggggtttga 960 aaaaaactgc atcttttgta attctaaacg aaagaagaaa ggtgataaat ttgagtatgt 1020 cggagcatgt caaaccagag atgctgaaat actaataaag aggtcagcaa atatcctgaa 1080 ggacaataaa ttacttacca aaatttcatc cattgatttt gttgcaaaag aagtccatta 1140 ccaccattca tgtcgaacta attatgtgaa agcagcaaga cgtattgaag acagagacag 1200 taatactgac gaaaagaagc gtgaaagtga attactgata gctatatata aatatgtaaa 1260 tgatttcatt ataaaaaaga aaagatctga attactaacg tccgtatatt cccaatactt 1320 aacattttgc gaggaatgtg acgaaatacc agtcagtagt gcacaatatt tgatgaaaat 1380 tcttttaaag aaatttgaca aacaattaaa atcacactct gatgcaggca agaaacaggg 1440 ggctatattg tacaatgccg aaatatatca ccagaagcag ttaggaaagc ttacgatttt 1500 tgtgcaacat ctgaaaatat tattgtgaaa gcggcattgg ttcttcgaaa gctcgtgaaa 1560 agtatagaga aatcagaatt accggacgat tcgacaatag agacattgca gaaaggggag 1620 gcaaaccccc agaccagaat tagtcaatat aaattcttcc gtacactttt tggaggacct 1680 atctacgata accattccac taaaactatc agaagggcag ggtcatcaag ccaagacgct 1740 ttgttcattg tttcaaatgg aactataaat ccagctaaac atataagtct tggtggtgct 1800 atcaagtccc tgacagggac taagaagatg cttacactac taaacagatt cggacactgc 1860 tccaattata atttcataga agaattggaa actatggttg gtgaaaaaat ccagaatcag 1920 aacacttgtt tacctgctga ttgcaagaaa aatgtacctt tcggtgttgc ttttgacaat 1980 tttgatgaat tgtgcgagac gctatcagga tcaaatacgt tgcatgatac tatgggtatt 2040 atgtatcagg ctgagagtat ctcttctgag tgcatgaatg tacctgcatc aacaattggt 2100 aaaggctcaa acagtaggaa aagaaaactg gatgtcactg aatttaacct gcagccttac 2160 aggaaaaaac caagaatgac tatttttgaa taccaatgta gagatgtatg gaaatccatg 2220 gataccaaaa ggcgtaaagg acatctggat ttcacatgga tggtttgcta tggtcttatg 2280 acaacaaata ttcctttgtg ggttggtttc aattctcaat tctacgtaga cgagttgcag 2340 aaacaaaggg taatgtacat gaagaatttg caacaaccaa taactcagct tgatgtaatt 2400 caacatacac taaaaattac tcaacaatgt gcagaagaga ttggacagag ctatggcata 2460 gtaagctatg acctgaatgc tgcaaaaccg gccatgcaaa tccaggctac agaaacgcca 2520 ctatatgaca atattttcat catgcccgga gtgttccata ttgaaatggc tatgtttaag 2580 gctattggca aagttattga tggctcagga ggtcctgcta tgctcacaga atctttagta 2640 gtatcatcag gttctttgaa tggcttttta tctggtacac attttaatag atgtaaaaga 2700 cttcatccgc ttttagcagt atccctagaa actttgtttt ttcaaaagtg tttggatcaa 2760 tacccacagg ctgaggattt tcgagacttt atagcaagca ttgatggaat gaccggatca 2820 gatattgtgg ccaaatgtga aggttccagt gtttttaatt tggcaatggc attctttaat 2880 cagttcaaaa cagatacttt ggctggctaa catgtggcta cttctcaatt ttggataatg 2940 tatgtaactt tcatacaaaa tttccatttg ttggagagag ccattcgaac aaatgatgtg 3000 gaactttatt ctttccctct aactccaatt attgatctct tttttgccac aaaccatgtc 3060 aattacgcaa gatggttaac aaaatttcaa ttagatttat tgaacattga caacactcat 3120 ccaggcttat ctgatttact caactctggc gtgtttacag taagaagaac agatcatagt 3180 ttcagcagaa ttctaacttt ggaacagtca atcaatgcta atgccgcttc acgtcaaact 3240 ggattgtctc atatgacaaa caactttgga gctcgacaaa gatgggtggt aaccaggccc 3300 tttagggctt taatggtcag ttctatttta cagatggctg gaatgtctaa tcccgagaat 3360 gcttcaaatg aactaaaacc tactcgaata atgcgtgatc acgaggatta ttctaaaatt 3420 gttaaacaaa taaaaaattc ttgtgatcct tttatcttac cagaagatac agatgactct 3480 cttattaaca ttagtactgg taaaagtgtg aaattagaaa taaaaaacaa cctgttaaca 3540 ataacagaga gaggagcaga attgcataag caattcatcg aagattgtat tgctgatcca 3600 aacagatttg agaaacctat taaaagacag aaacttctta ctttcagtga tggctccatc 3660 aggaatacca aaagcacaag taacaaggta gctagtctga aatgcactcg ggatttgatg 3720 gggcaattgt tattgattgc tgtcaataga aaattagatt tggaatacgt attgacatat 3780 ccactgactc cagtaccatt atccatgtgc aactatgatg ggacactagc aaaaaccgac 3840 aaatctacac tcttcagaga attagagaaa atgattaatg gtgaaactct aactactcct 3900 tctcttgacg catatgtgat tgacggtaat ttcttgttgc atcttttacc caataaaata 3960 tcgcctacat acggaggtct ggcctctacc atcttgataa tggctacatc aaccacttca 4020 aaaagagttg atctcctctt tgacacctat aaagaaccat caataaaaga ttgtgagagg 4080 caaagaagag gggctgaaga acaggaattt gtaattacta gtccagatca acttcgtcct 4140 cgtaatttga gtgatgcact taaatctaca tcatttaaaa agaatcttcc aagctttttg 4200 attgaggaat ggaagaagtt acactatagt accatcatta aagattgcca tctatatatc 4260 ggtcatctta accaatgtca tcattttttc gttgaaaacc acgaagtccg tcatgagtgc 4320 attgctacaa tggagtcaaa tcatgaggaa gcagatacaa tgatatgttt tcatgccaag 4380 gttattgatg aaacggaaca cccaggacac attgtggtac gtgcatcaga cactgacata 4440 gcagttattt tgattcacca ttcccacaaa attgcagcca ctgtttggat ggacgttgga 4500 acttgctcga gaaatagtcg tcgatatata aatataacgc aaatcgccac taccctaggt 4560 acttctatgt gtgcggcttt actagctttt catgtcttta ctgggtctga tttcacttca 4620 gcctttgcca gaaaaggaaa agtcagacca ctatcttttg aaaaatgata aaaaggctca 4680 acactcattt tctcttttaa ccaaagaatg tcctatttct gaaagtgtca tttcagaatt 4740 agaaatgttc ttaagtaaga tttacggtgc aaaaaaatct gaaacatctt tgaacagcta 4800 caggtataaa atgcttcaaa gggggttttc tccgaaaaga gaggataaac catttgcaaa 4860 aatcaaagga ataaatgcca gtgacattcc accatgccaa tttgagttgt caccccatat 4920 cagacgtgta ggatttgtta ctaggatgtg gctgatactg cttttattac atgtgaacct 4980 tctgcagcag atggttggta tctaaaagac agtcattacc atataaattg gtttgatggt 5040 tgtcagatgc ccgaaagact tatcccagac gatatggatg aaactgtgga atcagatgat 5100 gaattaatat tagcagcagg atctagtgat gaaagtgatg cggatgaata actctggtgt 5160 tgtgaatgaa gtaaaaacat gaacatttgc actgttttat caatccaaag aatggatctt 5220 catatttatt ttctctacgg tttttgtata ataatttcta gttttacttt tggaaacttt 5280 catgtaacta actgatagaa ccatatattt aaggttccta aatcactttc agaatgaaaa 5340 ttcaaaattc aaaatccaaa agtgatatag gaagccaaaa tttggctctt aacagatgag 5400 aaactaaaaa aatgtagagg actatctcca ttaaacagcg atgatgtgat catattttaa 5460 tgttaaagag ttattatctc ttagagtgaa catgaaattg aaattcttaa acttggtcaa 5520 tacgagtata ttagtattat cagtgtggac ttacctgtgt cataacttta caacagaatc 5580 tctctataat cttatatatc cattaaagta tattagtgta tagttatgac accgggtaaa 5640 tccactgctg aatctatagt caagtttgga caagtttgac aattgtaatt ttctgttcat 5700 tcatccacct gtttgaactt tgacctgttc ttaagcacct tttgcaccga ccaaactgtt 5760 cttacgaatg tgttttgaaa tcctgacgac tgggctgata ctttggcatt gaaatttttc 5820 ttgggctcaa aagtttgatt aaaacgctaa ttaaaggtgt tcttaagcac cttttgcacc 5880 acctgaattg ttcttacgaa tgttttctga tcatgatatt tgctattcta cctaataaaa 5940 ttgaaatgta tgatgggctt tgaagtttga ttaacaggta attttggcca aattaccacc 6000 tgtttgaact ttgaactgtt cttaagcacc ttttgcaccg accaaactgt tcttacgatt 6060 gtgtttgaaa tcctgacgac tgggttgata ctttggcatt ggaatttttc ttgggctcaa 6120 aagtttgatt aaaacgctac ttagccctaa ttaaaggtgt tcttaagcac cttttgcacc 6180 acctgaagtg ttcttatgat gtagttttat gatcagcaca cttcatagtt actttttatg 6240 aaaaaaacat cgatgggctc gatattttga ttaaacgtgt ttttggggaa tctttgagga 6300 gctgtgccta ggtaacgcac caagcgggga ccttaagaat tcttgatttt gaagtcctgt 6360 agtaatgtct tcacaatgga ggtatcaaac cttttgggct caactttatg atttaactcc 6420 acctagctcc cggtcta 6437 // ID BEL-94_CQ-LTR repbase; DNA; INV; 655 BP. XX AC AAWU01007086; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-94_CQ_; KW BEL-94_CQ-I; BEL-94_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-655 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 316-316 (2011). XX DR GenBank; AAWU01007086; Positions 16543 15889. XX SQ Sequence 655 BP; 193 A; 134 C; 142 G; 186 T; 0 other; tgttgccgac cctgtagagc tgtgccaggc aacctcaaac tgtacaatct tggaaagtag 60 atcgagcccg gttgacaggt ctgacagcac tgtcaacgaa ccggaaggag tttttttttg 120 tctttttcgg ctaaagctgg agcaagcagc tcgcctcgtt ttttgcttag ccacaaaatt 180 agttaaagtg cgtattgcgc gaggaaaata attcagaaat ttgtaatagt tagcagaact 240 acacttagcg cgtaagtgga catattttta cttgttcaaa catgtgtaaa acgttgaact 300 ttctgtagga aaccacacag cccacccaag cctactttga ggttagaaaa cgccatcgaa 360 tccaacggtc gaagcaagaa ttgtttaatt cgattgtaag ttgttgatta ttagctaaat 420 aagcgcaagt aacgttaacc tttgttccca aggaccggac gttcggtgaa ggagatagtt 480 aggagtgttt cggatacacc gtttaaccga gggaaccata atcgtaagat tcgccacctt 540 gtttgaatta ccgttactta tttgaaatgc atttcagttt cgagctgaac caaacggaca 600 cgctacgaag acggtttctc gagtattttc tccgaaaata aacacttttc taaca 655 // ID Gypsy-208_AA-I repbase; DNA; INV; 6360 BP. XX AC AAGE02029057; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-208_AA_; KW Gypsy-208_AA-LTR; Gypsy-208_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6360 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02029057; Positions 69345 75704. XX CC Positions [3610-4032] - Reverse transcriptase CC Positions [5167-5637] - Integrase core CC 'ATGAC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 93..1058 FT /product="Gypsy-208_AA-I_2p" FT /translation="MDCVRKNTLVVDFSVLPVRPDVAKVEQFLENAIKLNL FT ADVKCIQLHNTRNCVYIQMKDHDTALQYENTHNIKRVFMCNDKPFKIPVYV FT DSEAVTVRVCDLPPSMLHATVGEHMMRYGDVISIQNERWKNYFPGIYNGVR FT VLQMRLKQSIPSFITINNEIATVRHPNQIMTCRWCSKPAHPGQKCLEENAS FT KNLSISLTPTTSSSNEHKFSDADFPPINNKQSTEPSTKSVPPRDKRAPVEN FT EPCTKIVPTDQEQQLNSINNDDDDDDDVNDDSSSSPYECGDSTYKRRLSTK FT RGKEKKKLCAIQGLQGDCDSNTSVNINKKK" FT CDS 1981..6066 FT /product="Gypsy-208_AA-I_1p" FT /translation="MAQSINSILEPYRKGSSFGDWVERLGFLFNMNKIADN FT EKRDHFITLSGPIIFKELKLLYPNSNLAEVPYEEMVTKLKARLDKTESDLV FT QRLKFNVRVQQPDESVEDFVLSVKLQAEFCNYNCNAETFKKMAIRDRIVAG FT IRDKALQQRLLNEENLTLETAEKFIATWEIAKNNAKSMECGSSVDQIAALK FT PLGFTGARLSNLTAIAARENYGTRGESNRGSVKSRLGYYPYQKDQRQQKQW FT RNRGQDKERNRQYSRPDYSQMVCDFCGVKGHIKRKCFKLKNMHRDAVNMIN FT PNTSGTNPDEFLSEMVNRLRADSDSESDADGETNVLQCMHVSSINRISEPC FT LLKVEIDGILFDMEADCGSSVTVMSRKQYFDNFSKPLTKTRRNLIVVNGAS FT LVIEGEVDVLVNFKGICSNLKLLILNCENNFTPLLGRPWLDAFFPNWRHFF FT VNSISSNEQSNQILVDDIRTKFKDIFVKDFSSPIHGFEADLVLKSEVPVFK FT KAYDVPYRLRDKVVEYLAKLECEKVITPIKTSEWASPVIVVMKKNNEIRLV FT IDCKVSINKFIIPNTYPLPIAQDVFAGLAGCKIFCSLDLEGAYTQLSLSER FT SKDFMVINTIKGLYRYNRLPQGASSSASIFQQVMDQVLNGIENVSVYLDDV FT LIAGKNLNDCKKKLFLVLDRLKNANIKVNWDKCKFFVTELVHLGHVISGKG FT LMPCQDKISTIENARVPKNETELKSFLGLINYYHKFIPYLSLKLFPLYNLL FT KTEVKFNWDCKCDKAFEESKKALIEAQILEFYDPNKQIVIVSDASGYGLGG FT VMAHLIDGVEKPIYFTSFSLNAAQQKYPMLHLEALALVCTVKKFHKFIYGK FT KFLVYTDHKPLVGIFGKEGRNSIYATRLQRFVLELSIYDFEIQYRPSKYLG FT NADFCSRFPLDHAIPVELDTELVNSINFGRELPVDFKVIADRTKNDSFLQN FT IISFMTNGWPAKINKQYIDVYANQQDLEFVDECLLYQNRVVIPATMKNAIL FT KLLHANHAGIVKMKRLARQCVYWFGINSDIERYVTTCDTCNSMMIVPKTKI FT NSKWIPTTRPFSRIHIDFFYFEHRSYLLVVDSYSKWVEIELMRNGTDCDKV FT LKKLVALFARFGLPDVLVSDGGPPFNAHAFVNFLKRQGINVLKSPPYNPSS FT NGQAERLVRTVKDVLKKFLLDPEFSHLDMEDQINLFLINYRNNCLSIEGEY FT PSKMIFSYKPKTILDLLNPKLRYKKFLMEQAVHEDLIKSKNSTPIDEDFNS FT KAYGGSRTDSPDPFDSLTAGDELWYKNHNPHCTAKWLKANFTKRLSRNTFQ FT VRIGSVLTMAHRSQLRVRKSGNFHEKPNIRLVRPLPELEAVSQEEFRGFT" XX SQ Sequence 6360 BP; 2045 A; 1090 C; 1334 G; 1891 T; 0 other; actggcgacg tcagttgaaa gcctttgctc ttagcctacg gacgtgtttg ctattcgctc 60 gtgtgcttcg tacgttttgc tccggcggta aaatggattg tgttcgaaaa aacactctag 120 tagtagattt tagcgttcta ccagtgcgac ctgatgtcgc aaaagtcgag caatttctgg 180 agaacgcaat caagctaaat ctcgcggatg ttaagtgtat tcaattacac aatacccgca 240 attgtgttta catccagatg aaggatcacg acactgcatt gcaatatgaa aacacacaca 300 acattaaacg tgtttttatg tgcaatgata agccgttcaa gatccccgtt tatgtggata 360 gtgaggcggt aacagttcgt gtgtgtgatt tacctccttc catgctccac gcaacagtcg 420 gggaacacat gatgaggtat ggcgatgtca tctccatcca gaacgagcga tggaagaact 480 actttcctgg catttacaac ggtgtgcgag tgctacagat gaggctgaag caatccattc 540 cgtcgttcat aacgatcaac aacgaaattg ctactgttcg ccatcctaac caaattatga 600 cgtgtaggtg gtgctccaaa cctgcgcatc ctggacaaaa atgtttagaa gaaaatgctt 660 ctaaaaacct aagcatttca ttaaccccaa ccacatcatc gtcaaacgaa cataaattca 720 gcgacgctga ttttccccca atcaacaaca aacaatcgac cgagccatct accaaatcag 780 taccacctag agacaaacga gcgcctgttg aaaacgaacc ttgcacaaaa atcgtaccta 840 cggaccaaga acagcagttg aactcgatca acaatgacga cgacgacgac gatgatgtga 900 acgacgacag ttcatcctct ccatacgagt gcggcgatag tacctacaag cggaggctgt 960 caactaagcg agggaaggag aagaaaaaac tttgtgctat tcagggattg cagggtgact 1020 gcgattccaa tacctctgta aatattaaca aaaaaaaata agctattcgg ccccgtaaag 1080 cctacgtgcg tttgggcctg ccaaataaat agaattaaaa aaaaaaaaaa actggcgacg 1140 agaaaaaaaa cgtggaagtg cttagtgcag cgtgcggacg acatcggctg aacaatttat 1200 tgtgtgacga agccatcaaa cccggcgctg gtgatacgga aaagccaacg ccatcacgta 1260 gtgcgtagac aatcattgca aatcgttggc gggtgcgatt ttcggtcgac gactaactgc 1320 aactggtgga gttcgttaag gtgaaggttc gttaaacgag attcaatttg gtttggtaag 1380 tttttcgttg gcgaaatcga gttaccattt ttggtgattt tggtgagctg cgggttgtag 1440 tggcttgatt atacgccatt ttgaaaattg ctttgtgtct ttgttaatta cacagaacaa 1500 ataataatta gagaattgaa aagaaacaat tgaaaacaaa agcatcatag aattatttta 1560 ttaagagagt ttttttcttt tattccatta gacagtggcc gcggaaacaa ttggttgaca 1620 atttctgcca tcgccaatct agctgatttt tatcggtgaa ttgactacat catctgactg 1680 gtttgctaaa ccgttgtttg ctgaaattct ccgctatcgg cacaggttca agctggtttc 1740 aatccccaag acgttaactg acggctcgca cggttgccta ggggcaaaga tcgtgagaag 1800 tgaacctctt cttttatccc tggtcggcaa aggctgaagg acttatctgg tgagtgattt 1860 tatttctttt accttgaagg agtcagcaaa tgcttctatt ttaagaggac attatttttt 1920 ttagaaataa ttgtttcatc tgattagtgg taaccaattt cagtagtttc aacacctaac 1980 atggcccaat ctattaatag catcttagaa ccttatcgaa agggcagttc ttttggagat 2040 tgggttgagc ggttaggctt tttattcaat atgaataaga ttgctgataa cgaaaaacgc 2100 gaccacttta ttaccctgag tggccctatt atttttaaag aactcaaatt gctatacccc 2160 aatagcaatt tagccgaggt tccttatgaa gaaatggtta ccaaattgaa agcacgtttg 2220 gataaaactg aatccgattt ggtgcaacgc ctaaaattta atgtacgagt gcagcaaccg 2280 gatgaatcag tagaggattt cgtgttgtct gtaaaattgc aagcagaatt ttgtaactat 2340 aattgtaatg ctgagacgtt taaaaaaatg gctattcgtg atcgcattgt tgccggtatt 2400 cgagataaag ctcttcaaca aaggttgtta aacgaagaga atttaacctt agaaactgct 2460 gaaaagttca ttgctacttg ggaaattgcc aagaacaatg ctaaaagtat ggaatgcggc 2520 agcagtgtgg atcaaatagc ggctttgaaa cctttggggt tcactggagc tagactgagt 2580 aatttaactg caattgcagc aagggaaaat tatggtactc ggggagaatc taatcgagga 2640 tcagttaaaa gtcgattagg ttattatcct tatcagaaag accagaggca gcaaaagcaa 2700 tggagaaatc gaggacaaga caaggagagg aatcgtcagt atagtcgtcc tgattattct 2760 caaatggtgt gcgatttctg tggtgtcaaa gggcacatta agagaaagtg tttcaaattg 2820 aaaaacatgc acagggatgc cgttaacatg attaatccga acacttctgg caccaacccg 2880 gacgaattcc tgagtgagat ggtcaacaga ttgcgtgcag attcagacag tgagagtgat 2940 gcggatggtg agacaaacgt tcttcaatgt atgcatgtgt cgtctattaa tagaattagt 3000 gagccttgtc ttttgaaagt tgaaattgat ggtattttat ttgatatgga agcggattgt 3060 ggttcgtctg taacagttat gagtaggaaa caatatttcg ataatttttc caaaccattg 3120 acaaaaaccc gaaggaattt gattgttgta aatggcgcga gtctggtcat tgaaggagaa 3180 gtggacgttt tagtcaattt taagggtatt tgctcaaatt tgaagctttt gatactgaat 3240 tgtgaaaata actttacccc tttgctagga aggccatggt tggatgcatt ttttccaaat 3300 tggagacatt tttttgtaaa ttctatttcc tccaatgaac aatcaaatca gatacttgtt 3360 gatgacatca gaacgaagtt caaagatatt tttgttaaag atttttcatc accaatccat 3420 ggtttcgagg ctgatttggt tttgaaatcc gaagttccag tttttaagaa agcgtatgat 3480 gtaccttatc gtttacgtga taaagttgta gaatatttag caaaactgga atgtgaaaaa 3540 gttatcacac cgattaagac gagtgaatgg gcttctcctg ttatagtagt aatgaaaaaa 3600 aataatgaaa tacgattagt gatagattgt aaagtgtcga tcaacaagtt tattattcca 3660 aatacctacc ctttacccat agctcaagac gtttttgctg gtttagctgg atgcaaaatt 3720 ttttgttcat tggaccttga gggtgcttat acccaattat cactttcgga aagatccaaa 3780 gatttcatgg ttataaacac gattaaagga ctctatagat acaatcgttt accacagggg 3840 gcctcgtcta gtgcttcgat tttccaacag gttatggatc aggttctaaa tggaattgaa 3900 aatgtttcgg tatatttaga cgatgttttg atcgctggga agaatttgaa tgattgtaag 3960 aaaaaattgt ttttagtact tgacagactt aaaaatgcta acataaaagt aaattgggat 4020 aaatgtaagt ttttcgttac cgaattggta catttaggtc atgtaattag tggtaaaggt 4080 ttgatgccat gtcaagacaa aatttctaca attgaaaacg ctagagtacc taaaaatgag 4140 acagagctta agtctttcct tggattgatc aattactatc ataaattcat accttatttg 4200 tctttaaaac ttttcccttt atacaattta ttgaaaactg aagttaaatt taactgggat 4260 tgcaaatgcg ataaagcctt tgaagagagc aaaaaagcat taattgaagc acaaatttta 4320 gagttttatg accccaataa acaaatagtc atagtttcag atgcttctgg ttatggtttg 4380 ggtggagtaa tggctcattt gatagacgga gttgaaaaac caatctattt tacatccttt 4440 tcgctgaatg cagcccaaca aaagtatcca atgcttcatt tagaagcctt ggctttggtt 4500 tgcacagtta aaaaatttca caaatttata tacggcaaga aatttttagt ttatacggat 4560 cataagccat tggttggaat ttttggaaag gaaggaagga attcaattta tgctactaga 4620 ctacagagat ttgttttgga gctatcaatt tatgatttcg aaattcaata taggccatca 4680 aaatatttag gaaatgctga tttttgttca cgtttccctc tcgaccatgc tattcctgtg 4740 gaattagata cagaattagt taacagcatc aattttggta gggaactccc tgtagatttt 4800 aaagtaatag ctgacagaac taaaaacgat tcatttttac aaaatattat ttcattcatg 4860 acaaatggtt ggccagctaa aataaacaag caatacattg atgtatacgc taatcaacag 4920 gacttggagt ttgtagacga atgtctactg tatcaaaaca gagttgtcat accagcaaca 4980 atgaaaaatg caattttgaa acttttacac gccaatcatg caggaatagt caaaatgaag 5040 cgattagcta ggcaatgtgt ctattggttt ggaattaatt ccgatattga acgatatgtg 5100 actacttgtg atacttgcaa cagcatgatg atagttccaa aaaccaaaat caattcaaaa 5160 tggatcccaa cgacgagacc attcagtaga atacacatag atttttttta ttttgagcac 5220 cgttcctact tgttagtagt tgacagttat tctaaatggg ttgaaataga gttgatgaga 5280 aatggcacag attgtgataa agttttgaag aaattagtag ctttgtttgc tagatttggt 5340 ttaccagatg tattagtatc agacgggggt cctccattta atgcgcatgc ttttgtaaat 5400 ttcttaaaac ggcagggaat aaatgttcta aaaagtccgc cttataaccc ttctagtaat 5460 ggtcaggctg aaaggttggt gagaaccgta aaagatgttt taaaaaagtt tttacttgat 5520 ccagaatttt cccatttgga catggaggac cagattaatt tgtttttaat taactataga 5580 aacaactgtc tttctattga aggggaatat ccttcaaaaa tgattttctc atacaagccg 5640 aaaacaatat tggacttact aaatcctaaa cttcgctaca aaaagttttt gatggagcaa 5700 gctgtgcatg aagatttaat taagagtaaa aactcaactc caattgatga ggattttaat 5760 tcaaaagctt atggtggttc tcgaacagat tcacctgatc cttttgacag tctgacagct 5820 ggggacgaat tgtggtacaa gaatcataat cctcattgca cagctaaatg gcttaaagca 5880 aactttacta aaaggctctc tcgcaacaca ttccaggtgc gaattggaag cgtgctaacc 5940 atggcgcatc gaagtcaact acgcgtacgt aagagtggca acttccatga aaagcctaat 6000 attcggctgg tccggccatt accggagcta gaagccgtaa gccaggagga gtttcgagga 6060 tttacctaag aagagatcag acgtggacag aagagaagga tggtcgaaag tgttgatagc 6120 ccatatcttg aatcagaaaa atcgccggat ggtccaaggc gttcaaaacg aaaacgaaga 6180 gtaaaccatg ataatgattt tgtttataag taaaaaggtt gtgaactcga attgatgttc 6240 aaaaaaattc tatactcgtt gaatgttatt attattcttt attgaatgct ctgaattatg 6300 aattgaatat aattgtcaaa actagatatt tgtattctga acttccaaag agggaaggac 6360 // ID Gypsy-2_DPu-I repbase; DNA; INV; 3737 BP. XX AC scaffold_44; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-2_DPu_; KW Gypsy-2_DPu-LTR; Gypsy-2_DPu-I. XX NM Gypsy-2_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-3737 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 719-719 (2010). XX DR Genome; scaffold_44; Positions 589861 593597. XX CC 'GAAAC' target site duplication CC LTRs are 95% similar to each other. CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX FH Key Location/Qualifiers FT CDS 901..3498 FT /product="Gypsy-2_DPu-I_1p" FT /translation="MPPKPFKPYGTPPPLDVEEFKDSFEIWHQQWKIFLSL FT STINTALPQTERPQYIANILLSCLSNSTLKAVLTMGLSATDLEDADVIIKK FT LQERCNAGRNCHVWRQQFASRVQREAESIDSWLSDLRDLSRKCEFEKDCCA FT ACQNTRLLGQIVFGVLDDDVRRKLLELGAKLTLEKAITIIRTAEATRLQSS FT NMKQGTTAPVNQIKSATGKRPNDRQAGSQRSQPRGKPSVRWHPPGCQPYGC FT WSCGAASRHAKEECPAFGKECHTCHKSGHFQSVCSQGSSSSKTEIAGSITI FT QSILQDDMVRLGITPACSAAEYFIQMLPDSGASIDAIPVGLYQRHFKDIPL FT SSHDPKAVTVTGSSIISLGQFHASLVWASSSSGPVATSIHVLQDLQQPVVS FT KETQKKLGMLPAQYPHTCVLASIPSSESSVPESPVIQSTLIGLMSEVPSIF FT DGVCRPMRGAPCHFQLKEDAVPSSIRRQVRSQLSVDNDIILVGPRVVIPAS FT LRPEILRRLLLMHQGATKIRQRARQSVYWPSIDNDIVMAVKSCPTCSERLP FT SHPPEPLLSHEPASRPFEFIFADLGTFRGRDFLIIADQFSGWPQVYPFPDT FT NTSSRRIIDAFRSFFTCGAGAPIKLWSDGGPQFKSDEYLSFLKEWDISHGR FT SSPHHPKSNGHAEAAVKSMKKLIAGSWTSGSFDLDKFGKGLLLFRNASIAG FT GASPSQVVFNQPTHDVIPAHRRSFAPEWQKAAGILENRALRARELQKQHYN FT RSTHPLPALHIGDSVVIQHHKSKRCSTPGVIVEVGAFRDYLIKTPAGRLFR FT RNRRLLRLRAPAVVPHSPSRPPVACYPPVADSPSFLADPPVPAASTVPSSV FT GLRRSQWIASRKP" XX SQ Sequence 3737 BP; 740 A; 1101 C; 796 G; 1100 T; 0 other; tggcgcagtt gatttattct tcgtgaattc tcgaacttat tcccgatcac aacttcagtt 60 ttgttttttg tagtgtgtgt gaatgtgaac aggcaatttt caattcgatt ccagtattct 120 tgtgattgtt tcccccccgg ccgtttcaca gcgtggtttt ggtggaggcc gccattttgg 180 atttggtgca tttctccctt acgatcgtcc attccctttc ggtgctcatc atcccagtgt 240 ggttttgtgg aggccgccat tttgttttga gtgagtttcc tcatgacgcc ggcccgtttc 300 cgttcaattt tcatcacttg agtctattgc cccattcaca cccctatttc agttcattgg 360 cattattatt ccccttatct cttgttcgtt cgtgtccgtt ttctgtttca ttcttgaaga 420 gggcccacat gttgcgcgcc caccattttt tttccttcaa cgtcactctg gtgtgcccca 480 ctcgtctccc gttcattact tttgagtatt tcctcccgtt tcacgctagt ttcttccgat 540 atttcatttt ttgttttgtt ccctctctcc tgtttttttt tttttttttt tttttttttg 600 agagccacca ttttgctttt ctctttcccc cctgttcact tgcgtcccgc cattctcgtt 660 ccacgcttgc atttccgcat tcatggattt ccacattcat tctattctcc cacatttcag 720 tgtctatcat tccacccaaa gtccccttct catttgttct catcgctgta ctgaccactc 780 attttcattc aaatatttca ttcaactcca ttcaaatcca attcattctc gttccgtccc 840 tttttaatta ttattgttca tccctgttct gatcagtgat ccattccacg tcgtgtcgcg 900 atgcctccca aaccgttcaa accctatggc acgccacctc ctctcgatgt ggaagaattc 960 aaggattcct tcgagatatg gcatcaacag tggaagattt ttctctccct ttcaactatc 1020 aacacggcgc tgccccagac tgagcgccca cagtatatcg caaatattct gctgtcctgt 1080 ctttccaatt ccacactgaa ggcggtgtta acgatgggtt tatccgcgac tgacctggaa 1140 gatgctgatg tcatcatcaa gaaactgcag gagcggtgca acgctggccg caactgccat 1200 gtctggcgcc agcagtttgc ctctcgtgtt caacgtgagg ccgagtccat cgacagttgg 1260 ctcagcgacc ttcgagattt atcccgcaag tgcgagtttg aaaaggattg ctgcgctgcc 1320 tgccagaaca cacggctgtt ggggcaaatt gtcttcggtg ttctggacga cgatgtgcgt 1380 cgcaagttat tggagctcgg cgctaagttg acactggaaa aagctatcac cattattcgc 1440 acggcagagg ccacgcgtct gcagtcatcc aacatgaagc aaggcaccac ggcaccagtc 1500 aatcaaatta aatcagcgac agggaagcgg cccaacgaca gacaagcagg ttcacagagg 1560 agtcagcccc gcggtaaacc atctgtccga tggcaccctc ccggttgtca accgtatggt 1620 tgctggagct gtggagctgc gtcccgccat gcaaaggagg aatgccctgc gttcgggaag 1680 gagtgtcata cgtgccacaa atcgggtcat tttcagtcag tgtgctccca aggtagttca 1740 tcatctaaaa ctgaaatcgc cggtagcatc accattcaat ccatccttca agacgacatg 1800 gtccggctcg gtatcactcc tgcctgcagc gccgcggaat atttcattca aatgcttcca 1860 gattccggag catccatcga cgccatcccg gttggcttat atcaacgtca cttcaaggac 1920 attccgttat cttctcacga ccccaaagcc gtgactgtca caggttcctc tataatatca 1980 ctgggccagt ttcacgcatc gttggtttgg gccagcagtt ccagcggtcc ggtcgctaca 2040 tcgatccatg tccttcaaga tcttcagcag ccggtggttt caaaggaaac acaaaagaaa 2100 cttggaatgc ttcctgccca atacccccac acctgcgtgt tggcatccat tccctcttcc 2160 gaatcgagtg ttcctgagtc gccagtcatc caatcaacat tgattggatt gatgtcggaa 2220 gtaccttcca tcttcgatgg tgtctgccgg ccgatgcgtg gcgctccatg ccatttccag 2280 ctcaaggaag atgcagttcc ctcgtcaatc cggcggcagg tccgcagcca actttccgtc 2340 gacaatgaca tcatcctcgt cggtcctcgt gtcgtcattc cggcgtcttt acgcccggaa 2400 attctacgtc ggctcctcct gatgcaccag ggtgcaacca aaattcgcca acgcgcacgc 2460 cagtctgtat attggccttc aattgacaat gatattgtca tggccgtcaa atcctgtccg 2520 acttgctccg agcgtctccc ttcacaccca ccggaacccc tcctctctca tgagccggct 2580 tcccgtccat ttgaattcat attcgccgat ctaggcacct ttcgcggaag ggatttttta 2640 atcattgccg accagttcag tggatggcct caagtgtatc cattccccga caccaacacg 2700 tcttcacgcc ggatcatcga tgcattccgc tctttcttca cttgcggagc gggcgcgccc 2760 ataaagctgt ggtcggacgg cggcccgcag tttaagtcgg atgaatacct ttcttttttg 2820 aaggaatggg acatttctca tggccgttct tcacctcatc acccaaaatc taacggtcac 2880 gcggaggccg ccgtgaaatc tatgaaaaag ctcatcgcgg gttcctggac ctcgggttcc 2940 ttcgacctgg ataagttcgg aaagggcctt ctcctcttcc gtaatgcctc tatcgccggg 3000 ggtgcttccc cctcccaggt tgtcttcaac cagcccactc acgacgtcat tccggctcac 3060 cgacgttcat ttgcacctga gtggcaaaag gctgctggaa ttcttgaaaa ccgggcgctc 3120 cgcgccagag agctccagaa gcagcattac aaccgttcaa cgcaccccct tccagctctt 3180 cacatcggag atagtgtcgt catccagcat cacaagtcta aacgctgttc aaccccgggg 3240 gttatcgtgg aagtcggcgc cttcagggac tatctaatta aaaccccggc tggtcgtttg 3300 ttccgccgta accgacggct tcttcgcctg cgtgctcctg ctgtcgtccc tcatagccct 3360 tctcggccgc ctgtcgcttg ctatcctccg gttgcagact caccatcatt cctggcggat 3420 cctcccgttc ccgccgcatc cacggtccct tcatcggtcg gtcttcgtcg ctcacagtgg 3480 atcgcgtccc gaaaaccata atgcctcggt gttggttaac agaggcgatc tacgaaattt 3540 tgcgtgttca tttatgcttc gtttattggt cacctcgttt attcaactca taattctctc 3600 gttgttgacc ccacacccat ccagacgttt tctcctaatg tagtatctgt gaatcttttc 3660 cccttttgat acggcttttg atttctgttt catttactgc ggtctattgc gtgttttgta 3720 ccttggagaa agagaca 3737 // ID Slatif16 repbase; DNA; INV; 347 BP. XX AC GU229960; XX DT 19-APR-2011 (Rel. 16.04, Created) DT 19-APR-2011 (Rel. 16.04, Last updated, Version 1) XX DE Mariner-like element internal region of mellifera subfamily. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Slatif16. XX OS Scaptodrosophila latifasciaeformis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Scaptodrosophila; OC latifasciaeformis group. XX RN [1] RP 1-347 RA Wallau G.L., Hua-Van A., Capy P. and Loreto E.L.; RT "The evolutionary history of mariner-like elements in Neotropical RT drosophilids."; RL Genetica 139(3), 327-338 (2011). XX DR EMBL/GenBank/DDBJ; GU229960; Positions 1 347. XX CC Clone Slatif16. XX SQ Sequence 347 BP; 91 A; 83 C; 91 G; 82 T; 0 other; tttgggtgcc gcatgagttg acgcaaaaaa accttctgga ccgaatcaac gcctgcgata 60 tgctgctgaa acggaacgaa ctcgacccat tcttgaagcg gatggtgact tggcgacgaa 120 aaatggatca ctatttacga caaattatca agcgaaaacg gtcgtggtcg aaggcgcgcg 180 tgaatcgtcc caatacagtg gccaagccag ggagttgacc gccaggaagg ttttgcctgt 240 gtgtttcggt gcggattggt aagggaatct atcccactat gagctgctcc ctatatggcc 300 tagacgcttt aattcctacc tatcttactg cgatacaaac tggaccg 347 // ID BEL-9_DWil-I repbase; DNA; INV; 5443 BP. XX AC scaffold_181100; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-9_DWil_; KW BEL-9_DWil-LTR; BEL-9_DWil-I. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-5443 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_181100; Positions 213485 208043. XX CC Positions [4466-5047] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 3110..5413 FT /product="BEL-9_DWil-I_1p" FT /translation="MFDLLGFISPVIVRCKILMQDLTLKGLDWDEPVDGRI FT KERWKELLQDLKDLSNLQIPRYVNTSKSYRCEIHGFADASERAYGCCLYVR FT SELPSGVKVTLRVAKSKIAPTKTQSLAKLELCGTLLLSRTWRKFERTLKLY FT IDKVYFWTDSKIVLQWLKLHASTLNCFVANRVSELQQHREEIIWKHVPSNK FT NPADIVSRGCRTSELQTSMWITGPTFLNLSKAEWPILRQPLPPPIEVRTKS FT ALLCSRADNANSQSILNMIINNSSSYIPIVRIVSYLYCSVNKVKPDRTLSS FT SSIADIRITETEMYQSFWKIVALIQQECYADKISLLKNKQTLTPCLQRLTP FT FMHTMTFGQTAVCVMRVGGRLENGDLPFNAKHQALLPQNHRFTKLYIEFLH FT NKNHHAGPKFILSLLREQIWVINARALIRKVVRSCIRCFHFKPRLMTQLMG FT NLAADRLRAERPFLVTGVDFCGPFFTSYRIRGKPPYKTYAAIFVCFASKAI FT HIELVSELSTNAFLLCIRRFVARRGVPLHMVCDNATNFVGAATRLAEFKSK FT LFNSDVIDHIKTYSSIKGFKFCFIPPRAPHFGGLWEASVKSAKTLALKNIS FT NAFLTFEELLTVFSDVEAILNSRPLAPTSDDPKDFSALTPGHLLIGTELTS FT VPESLPYHSTDPPGTELRHLNRWQRVTYLKQQFWTLWARDYVHNLQQRSKW FT IYEKANVALGQLVIVHEDNVPAQHWPLARVASVHAGPDDKIRVVELRTAKG FT IFKRPIHKIAPLSESFI" XX SQ Sequence 5443 BP; 1754 A; 1196 C; 1043 G; 1450 T; 0 other; ttttggcgac ggtgacagga ccataaactt gctgttaaat aagttttggt tcaattgcaa 60 ctaagttgga ctcgttgttt cataacttct taagctacca caagtttccg aaaagggatt 120 gcaagaggca atcaatcatc aaacacccgt tgagacatct aggaagaaaa gtcgacatca 180 ttaatgcata tgataccacg ttaaactctc cacatactgc agtaaagtca ttggaataaa 240 accgcaagta ttcaatcaaa caattactgt actcgcacat gtccaataag agaaaaccga 300 acagttatat tttcgtcgtt tgtcattcca ctcgttacat atcgaacaca aacaagcgtg 360 agtgcctgcg ccaactgtac atcaaaaaac agttaagttt gcgaacatac acgtataact 420 attcaacatt ctggttggtt ctgctacttg gaggtgttgc tggcgtttat tgaagcatta 480 tgggcaaaga gggttcaaac actcccgtaa gctctaaggg aacagacaaa ctaacaaatc 540 ttagatccca gcggactcag tttgaaaatc atattcgaac cttacatgaa agatcattgc 600 tactagcgac caagaagatg ccgatacact agactgcaga cttcaaatac ttgaatctca 660 ttataggcaa attactcaca ttcaaacaca aattgaaaat ttgtacgttt gataaagagc 720 gcagtaattt agatgaaaag ttcataaagg tcaaggtaca tttattgtct cttttacaaa 780 ggaaacgttc aatgggacca gtaaacacat cggatatgag ttttatgagt tcttcaatgt 840 ctcctacagt ggttcacagt caacgattgt ctaaactaaa attacccagt ttcgatggca 900 aatattcagc gtttaagaga ttcaaatcgt ctttcctaaa tcttattcac aacgatcacc 960 atttgccaac aattgataag ttaaattttc ttctcgaatg tttatcggga cctgcactgg 1020 aggtggtgca gtcgttccaa gtcaccgaac aaaactacag tcaagcttcc atgattttat 1080 atcgagtctg ttcaatttac cattagtcaa gcaaccagaa gcaagacttt tgcagaagct 1140 catcgataat gcgtccgcta tacgcggatc acttcttaca ctaggatcaa ctgaggaaat 1200 aatgaataac atcatgattc acatacttct tacgaaggtt gatgctgaga ccaaaggcgt 1260 ctacgacgaa aagcaaagct tcgacaagct tttaacgtgg gaggaattct gcacaattct 1320 tcatcgccgc tgccagttcc tagaaagtca tcagacaggt caactcggtc agcgcaatag 1380 ttcactagga ggaaaacgac catcaacgtc aaaggcattc ctcaatatta actctcaatg 1440 cttacactgc cattcgacgg cacactatgt tagtaaatgt aaatcatttg gagagttatc 1500 gctacaacaa aggttcgaaa cagtcaaacg gatcatcggg acatcgcgtc aaggattgtc 1560 cctcactctc ccggtgcaag atgtgccgat cggctcatca tacgttattg catcaattcg 1620 cccccacccc gataaaaaac ctatcatcat cgattcaatc tgttaatcca ataactaatt 1680 cacaaagcac ttcgttatta acgaacactg ataaaaaggg gatgcttcct acagcattgg 1740 ttcacataat ggataagtct ggccaattac atactgtatt attagattca tgttctgaac 1800 ttaattttat tacagaagag actgccaagc ggctaaatct ggatagaact cgtgtatctc 1860 aagaattcag cggcattacc ggaagcagtc aaggcattaa acacaatgcg tttgccacaa 1920 tacgttctag gcattcatca tttacgtggt ccgccaactt tgccgtcatt aagaaaactt 1980 gtgctcagca accacacgag catatcgata catcaaattg gaccatacct acggatatca 2040 atctagccga tccgtatttc aacgagcccc acaagataca cattttaata aacacgaatg 2100 ggctgtttgc agcaatcgtc gatggtcagc aagtcttggg cgaaaaaatg tcatatttaa 2160 ttaacacgga atatggatgg atagcgaaag ccatttttgt aaaaatgcaa ttgtaaaatc 2220 ggatggaaga ctccaagttc gacttccatt taaggaacac ccagagcaat taggattctc 2280 ctacgacaac gcagtacgca aatccttagg aaatcttctc gaccgcgacg atcaactaaa 2340 aaaagcatat gtagatttca tcaacgagta tgtaaaactt ggccacatgt ctaccaaaac 2400 tgtcgcaagc cacaacgaac cgcattatta catacctcat cattgcgtgt tgcgtccgaa 2460 aagcgttact acgaaacata gagttgtttg atgcttcggc taaaacatca tcgggaaaat 2520 cattaaatga gctgttaatg attggcccca caattcaaca agacctattt ataacatttc 2580 tatcatttcg tctgaacagg tatgcactta caggcgacat agccaagatg aatcgccaat 2640 ttgtcataga tccaagagac cgaagacttc agcgcattgt atggcgacca acaaaaacag 2700 aagaactaaa aacttacgag ctcaacaccg tgacatatgg catgtcagct gcacccttct 2760 tagctatcag aggcgtccac catattgccg atttgtactc acatgatcat cctataggag 2820 ctaaaacatt acgcaacgac ttatatgtgg atgacttgct aaccggagct gattgcgtaa 2880 ccgagcttca acacattaaa aatgatacag tagacatctt agcacaggct ggattgcatc 2940 ttacaaaatt tattagcaac tgcaaggaaa ttacgaaaac ttcagaggac gaagtatttt 3000 tcgccatgga agatcaagac actaccaaaa ctctgggaat gacgtggatg ccatttgtga 3060 ccaccaagaa ccatttacca aacgatcaat tctttcagta gtagcaagaa tgtttgactt 3120 gctgggattc ataagtccgg tcatagttcg gtgcaaaatt cttatgcaag acttgacact 3180 caagggccta gattgggatg agcctgtcga tggtcgaatt aaggaaagat ggaaagaact 3240 tttacaggat cttaaagact tatcaaatct acaaatacca cgatatgtga acacatcaaa 3300 atcgtatcga tgcgaaatac acggattcgc tgacgcatcc gaaagagcgt acggatgctg 3360 tctttatgtt cgatcagaac tcccgagtgg agtcaaagtc actttacgcg ttgcgaaatc 3420 aaaaatagca cctaccaaga cacagtcatt ggctaagtta gaactgtgcg gaacattgtt 3480 gctcagtcgg acttggcgca aattcgaaag gacattaaaa ttatacatcg acaaggtata 3540 cttttggacc gattcaaaaa ttgttctgca atggctgaaa cttcatgcat caacactaaa 3600 ttgttttgta gcaaatagag tatcagagtt acaacaacac agggaggaaa taatatggaa 3660 gcacgtccct tctaacaaga atccagcaga cattgtttct cgtggttgtc ggacatcaga 3720 gctgcaaaca agcatgtgga taacagggcc aacattctta aatttgtcta aggccgaatg 3780 gccaattctt aggcaaccac ttccgccgcc tatcgaagta agaaccaagt cagcacttct 3840 ttgtagcaga gcggacaacg ctaattcaca atctatttta aatatgatca taaataattc 3900 ttcatcgtat attccaatcg ttcgaattgt ttcatacctc tattgcagcg tgaacaaagt 3960 gaagcctgat cgcactctaa gttcatcatc aatagccgat attcgcatta ctgagacaga 4020 aatgtatcaa tcgttttgga aaattgttgc actaatacaa caagaatgtt atgctgataa 4080 aatatcttta ctaaaaaaca aacaaacctt aacaccgtgt ctacagcgtt tgacaccatt 4140 catgcacaca atgacatttg gacaaactgc agtatgtgta atgagagtcg gaggacgtct 4200 cgaaaacgga gatcttccat tcaacgccaa gcaccaggcc ctgctacctc aaaatcatcg 4260 ttttaccaag ctatacattg aattcttgca caacaaaaat catcatgctg gcccaaagtt 4320 tattttatct cttttacgag aacaaatttg ggtaatcaac gcacgagcgt tgattagaaa 4380 agttgtgcga agctgcatcc gctgctttca ctttaaacct cgcctcatga ctcaactgat 4440 gggcaatctt gctgctgatc gactacgagc tgaaagacca ttcctagtaa caggagtaga 4500 tttttgcggc ccatttttca cgtcttacag gattcgcgga aagccgccgt acaaaacata 4560 tgcggccatt tttgtatgtt tcgcttcaaa ggcgattcat attgaattag tctccgaatt 4620 gtcaacaaat gcatttctgc tttgtatacg tagatttgta gcgcgccgtg gtgtaccatt 4680 gcacatggtg tgcgacaacg ccacaaactt tgttggcgct gctacaaggc tagctgagtt 4740 caagtcaaaa ttgtttaact cagacgtcat agatcacata aaaacatata gttcaattaa 4800 aggatttaag ttttgcttta ttccccctag agctccacac tttggcggcc tctgggaagc 4860 gtcagtaaaa tcagcgaaga cactggctct gaagaatatt tcaaatgcat ttcttacttt 4920 cgaggaacta ctcactgtgt tctcggatgt agaagctatc cttaattcaa ggccattagc 4980 acccacttca gatgacccca aggatttcag tgctctcact cctgggcatt tgctgattgg 5040 aaccgaatta acgtcagtac cagaatcatt accttaccat tccacggacc ctcccggaac 5100 ggaacttcgg catctgaata gatggcagcg agtgacctat ttgaaacaac aattctggac 5160 actgtgggca cgggattatg ttcataactt acaacaacgc agcaaatgga tttacgaaaa 5220 agcaaatgtg gccttgggac aactagttat agtacatgaa gacaacgtgc cggctcaaca 5280 ttggccttta gcaagagtag cgtctgtaca tgccggacct gatgacaaaa ttcgagtagt 5340 cgaacttagg actgcaaaag gaatcttcaa aaggccaatt cacaagatcg ctccgctttc 5400 agaatcattt atttgaaagt tggttctttc aacaggggga gga 5443 // ID Slatif2cons repbase; DNA; INV; 502 BP. XX AC . XX DT 19-APR-2011 (Rel. 16.04, Created) DT 19-APR-2011 (Rel. 16.04, Last updated, Version 1) XX DE Consensus of mariner-like element internal region of mellifera DE subfamily. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Slatif1cons; KW Slatif2cons. XX OS Scaptodrosophila latifasciaeformis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Scaptodrosophila; OC latifasciaeformis group. XX RN [1] RP 1-502 RA Wallau G.L., Hua-Van A., Capy P. and Loreto E.L.; RT "The evolutionary history of mariner-like elements in Neotropical RT drosophilids."; RL Genetica 139(3), 327-338 (2011). XX DR [1] (Consensus) XX CC onsensus of clones with show less than eight percent divergence. CC Slatif2cons. XX SQ Sequence 502 BP; 172 A; 100 C; 103 G; 127 T; 0 other; tgggtgccgc atgaattgaa agaaattcat ttaacaaatc gtttaaacgt ttgtgatatg 60 catcttaaac gcaatgaaaa ttgtccattt ttgaagcaaa tcataactgg agatgaaaaa 120 tggattgttt acaaaacgtc aatcgaaaac gatcatggtg caaccatgat gaagcgccac 180 aaaccacttc caaaggctga tattcaccaa aagaagatta tgctgtcagt attggtggga 240 ttggaagagt gtggtatatt attgagctgc ttaaaaaagg aaccaaaacg attaattcgg 300 atgtcttact gtaagcaact ggacaaattg aatacagcca ttaaggagaa gcgaccagaa 360 tctggttaat cgtaaaagct gtccatattc caccaggaca acgctagact cgcactacat 420 ctttggtcac ccgccaaaat actgagtgaa cttggccggg aacttcttga tgcatccacc 480 atactgtcca gacctaggcc ca 502 // ID CR1-48_HM repbase; DNA; INV; 4668 BP. XX AC . XX DT 19-DEC-2008 (Rel. 13.12, Created) DT 19-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE CR1-type family - consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-48_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4668 RA Bao W. and Jurka J.; RT "CR1 families from Hydra magnipapillata."; RL Repbase Reports 8(12), 1876-1876 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(775..2331,2252..2836,2721..3563,3547..4179) FT /product="CR1-48_HM_1p" FT /translation="MALNCHLCKKLISQSCRSSQCNICLFWSHTKCDRLND FT TKYSLLQLSRSLRSCLNCIKSVFPFTDVQDNEFKLLFSSDIILKTDDLPLS FT TNLFPSFHLNKVSNQINNYVSSRVLDSEEEISSSLDIFCKYYDISELCTLN FT FNKISALSFFHLNISSLQKHFEDLNIILSLINFDFSIIGITETRLKKGTSS FT IHPILIDNYNIEQTPTESSCGGTLLYLSKKLIYKRRDDLMIYTPNCLESTF FT VEVIFSKKSNIIVGCIYRHPCMDINIFNNIMSVLLQKISAENKSVFLLGDY FT NIDLMQSNNDIATSDFLNLFSFYNIRPFITLPTRITDHSATIIDNIFSSLI FT ANEIVSGNLTTSVSDHLPQFCICPNFNKLFIPRRHNLYSRSFKNFNKENFK FT SDLSKIDWDDITKSNDTNTCFQTFITETTLILDRYAPLKKISIKNFKRRFK FT PWITKGIIKSIQVRNLFQKKMLRSKDLINKKKFESIFKRYKYIIVTLIKLS FT KKNHFKNFFLRKCKKSSRTLERYLNLVKKTILKIFFSENVKNLRELWKGIN FT TLINVKNLNKSSPNCLQENDTYITDPTLISESFNTFFTNVAHKLKSNIHSS FT YINYQKYLKHPNLHSLLLSPTNISEISSLISKLNPKTSLGPNSIPTNILID FT FNYDFSNILSKLFNTSFINGKFPDILKLSCVIPIFKNGSKLSCTNYRPISL FT LSNISKLLEKTYVFNYLALFQYLKMALNFHALTTDLFLFYLXLVSFLRKLM FT YSRVYSFLNSFNCLNDFQFGFRSKHSTCHALISITEKIRKALDTGHFVCGV FT FIDLQKAFDTVDHSILISKLKHYGIRGIVNDWFRSYLSDRKQFVSINGFNS FT SYKSVLCGVPQGSVLGPLLFLIYVNDLINSVHFSSVHLFADDTNLLHINKS FT YHSLCKNVNSDLKGIVHWLNANLICLNTEKTELIFFSSPRKNNFFKNVEIK FT IKINNKRLYPSRVIKYLGVLIDCNLSWNFHIDELSKKISSARKSRANGMLS FT IIRHYVNKVTLRSIYYALFSSHLSYCCQVWGQVGNYRINKLLSSQRQSIRL FT INFQPIKSDTSKSFKKLFIPTFPALVKYANLLFVFDSLNKNIPSPITNLFT FT ESRLLHSYSTRNIHNGKLNVPSFKTQKYGKRSIVYQCVCEWNRSLACILNE FT SKKIYPKIPLLHLKQYQFKKILKNIFILTFWHYYPYDLNIXLVFFS*" XX SQ Sequence 4668 BP; 1635 A; 669 C; 515 G; 1846 T; 3 other; tgttttttta accgttgaaa aggaaaaagg aaataaagga aggcaattaa aaactgaatt 60 aattaccaat tgtttcaaat aaagaattca aaaatatcaa aataaaagaa gagtaaaaca 120 aaaaaacata ttataaaaaa agaaactttt aacaaataaa cactcaaaaa caacatttaa 180 aaaaaaaaaa ataaaagaaa attcgaacat cgatttttac ttaacaaaaa aagaaaaaga 240 aaagaaaaga aaaatagtta aaaaaaaaga gataagttga taaattaata agaagatcag 300 aagagaagat cagtttctta aaagagtaga taatttttat agtttttatt tgtatctatt 360 attttttatc ttattttaaa tattacacta ttatttttat ttatattact acaatttatt 420 attagattac ttattttgat atataatatt taatatttat ttttattatt gttattatat 480 tttttaattt acatttctct taactttaat tagttttttt actcttttag tctgatttta 540 taattttctt caaaattttt gtttttgtta tagtttactg tttgtttggc tgttgcgtta 600 ttgtttaact accattaata ttatttatta taattgttag tttttcgata ctttaattgg 660 ttgttcttaa ttatctattc atacgcatta ttttatttta acaattatta ataacatagc 720 tatttacttc ctacttattg tttcttacat tctctctaat attgcggttc cactatggct 780 ttaaactgtc atttgtgtaa aaaattaatt agtcaaagtt gccgctcatc tcaatgtaat 840 atttgtttat tttggtctca tacaaaatgt gacaggctta atgatactaa gtacagttta 900 ctacaattaa gtcgctcgtt gcggagttgt ttaaactgta ttaaaagtgt ctttccattc 960 acagatgtac aagataatga atttaaattg ctattttctt ctgatataat tcttaaaact 1020 gatgatttac ctctgtctac aaaccttttc ccgtcttttc atttaaataa agtctctaat 1080 caaattaata attatgtttc atctcgtgtt ttagattctg aggaagaaat ttcttcttca 1140 ttagacatat tttgtaaata ttatgacata agcgaattat gtacacttaa ttttaataaa 1200 ataagcgctc tctccttttt tcatctaaac atatcatcgt tacaaaaaca ctttgaggat 1260 ttaaacataa tattatcatt aattaatttt gatttttcca ttattggtat aactgaaaca 1320 aggttaaaaa aaggaacatc ctcaattcat ccgattctca ttgataatta caacattgag 1380 cagacaccaa cggaatcctc ttgtggcggt acattattat atttatcaaa aaaactaata 1440 tacaaacgaa gagatgattt aatgatttat acacctaatt gtttagaatc cacatttgtt 1500 gaagttattt tttctaaaaa gtctaatatt attgttggtt gcatttatag acacccttgc 1560 atggatatta atatttttaa taatatcatg tctgtcttac ttcaaaaaat atctgctgaa 1620 aataaatctg ttttcttact tggtgattat aatattgatc tgatgcaatc aaataatgac 1680 attgcgacat ctgatttttt aaacttattt tctttttata atattcgccc ttttattact 1740 ttgccgacta gaattactga tcactctgct acaattattg ataatatttt ctctagttta 1800 atagctaatg aaattgtttc aggaaatttg acaacttctg tttctgatca cttacctcaa 1860 ttttgtattt gtcccaattt caataaatta tttataccac gcagacacaa tttatatagc 1920 aggagtttta aaaactttaa taaagaaaac tttaaatctg atttatcaaa aattgactgg 1980 gacgatatta caaagtccaa tgatactaac acatgtttcc aaacgtttat tactgaaaca 2040 acattgattt tagatcgtta cgccccactt aaaaaaatca gtataaaaaa cttcaaacgt 2100 cgattcaaac catggattac aaaaggaatt ataaaatcta ttcaagtgcg caatcttttc 2160 caaaaaaaaa tgttaagatc taaagatctt attaataaaa aaaaatttga aagtattttt 2220 aaaagatata aatatataat agttacatta attaaactta gtaaaaaaaa ccattttaaa 2280 aatttttttc tcagaaaatg taaaaaatct tcgcgaactt tggaaaggta ttaacactct 2340 aattaatgta aaaaatttga ataaatcatc tcccaactgt ttgcaagaaa atgatacata 2400 tataactgac ccgactttaa tctctgaatc atttaacact ttttttacaa atgttgctca 2460 taaactcaaa tccaatattc attcatctta cattaactat caaaaatatc ttaaacatcc 2520 aaatttacat agtttattat tatctccaac aaatatttct gaaatttcat ctcttatatc 2580 taaattgaat ccaaaaacat cattaggtcc aaacagcatt cccacaaata ttctwataga 2640 ttttaactac gatttttcaa atattctttc taaactattt aatacctcat ttataaatgg 2700 aaaatttcct gatattttga aattatcttg cgttattcca atatttaaaa atggctctaa 2760 actttcatgc actaactaca gacctatttc tcttttatct aayattagta agcttcttga 2820 gaaaacttat gtattctaga gtgtactctt ttctaaactc atttaattgt ttaaatgatt 2880 ttcaatttgg ttttcgatca aaacattcta cttgccatgc attaataagt ataacggaaa 2940 aaatcaggaa agctcttgat actggtcatt tcgtttgtgg tgtttttatc gatttacaaa 3000 aagctttcga taccgttgac catagtattt tgatttctaa attaaaacat tatggaattc 3060 gcggtattgt aaatgactgg tttcgatctt atctttcaga tcgaaaacaa tttgtaagta 3120 ttaatggttt caattctagt tataaatcag ttttgtgtgg tgttcctcaa ggttctgttc 3180 taggtcctct tttgttttta atctacgtta acgacctaat caactctgtt cacttctctt 3240 cagtccatct gtttgctgat gatacaaatc tcttgcacat caacaaatcg tatcattctt 3300 tatgtaaaaa tgttaattcg gatctaaaag gtatagttca ctggttaaat gctaacctaa 3360 tatgtctcaa caccgaaaaa actgaactaa tttttttttc atctccaaga aaaaacaatt 3420 tctttaaaaa tgttgaaatc aaaatcaaaa ttaataataa acggctatat ccttcaagag 3480 ttattaaata tcttggtgta ttaattgatt gtaatctttc atggaacttt catattgatg 3540 aactaagcaa gaaaatctcg agctaacggt atgctttcaa taattcgtca ttatgtcaat 3600 aaagttacac ttaggagtat ttactatgca ttattctcat ctcatcttag ctattgttgt 3660 caagtttggg gtcaagttgg aaactatcgc attaacaagt tgctatcgtc tcagcgtcaa 3720 tccatcagat taataaactt ccaaccaatt aaatcagata catcaaagtc cttcaaaaaa 3780 cttttcattc caacatttcc agcgctagta aagtatgcta atcttctttt tgtttttgac 3840 tccttaaata aaaatatacc atctcctatc acaaacttgt tcacagaaag tagactatta 3900 cattcctact cgactagaaa tatacacaat ggaaaactaa atgtaccatc ttttaaaacc 3960 cagaagtatg gtaaacgttc tattgtttat cagtgcgtct gtgaatggaa caggagcctt 4020 gcatgcattt tgaatgagtc caaaaaaata tatcctaaaa tacctttact ccatcttaaa 4080 caatatcaat ttaaaaaaat tctgaaaaac atttttattc taactttttg gcattattat 4140 ccttatgatt taaatattrt tttagttttc ttttcgtaat tactacaatt atcatcagca 4200 ctttttgttt atttttttta ttagtaaaac ctgcaaattg atttctacat gtttgttttg 4260 atgtatattt tatgtgcttt ggtttgtgtg tttatcggtc catgtgtttc agtgtgtgtc 4320 tttgcttatt attacttata ttatattaaa aattggtgtc actcctttga tcagatttct 4380 gtatgagtga caccatccta ataaatcatt aacttattat tatattaata ttagttatta 4440 ttattatttt ttcattgtaa ttattgtaaa catacttatt attactatta ctataatttg 4500 tataattatt ttttttatct tatgttatta ttatatattt ttgctattat tactattatt 4560 attattatta ttattattat tattattatt attattatca ttattatcaa aactattgta 4620 aaatgattta cggaaaataa atagcttgtt gttgttgttg ttgttgtt 4668 // ID R2D_NGi repbase; DNA; INV; 493 BP. XX AC . XX DT 08-NOV-2010 (Rel. 15.11, Created) DT 08-NOV-2010 (Rel. 15.11, Last updated, Version 2) XX DE 28S rDNA-specific non-LTR retrotransposon R2 in Nasonia giraulti. XX KW R2; Non-LTR Retrotransposon; Transposable Element; R2D_NGi. XX OS Nasonia giraulti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-493 RA Stage D.E. and Eickbush T.H.; RT "Maintenance of multiple lineages of R1 and R2 retrotransposable RT elements in the ribosomal RNA gene loci of Nasonia."; RL Insect Molecular Biology 19(Suppl 1), 37-48 (2010). XX DR [1] (Consensus) XX CC This family is specifically inserted into 28S ribosomal RNA CC genes. XX FH Key Location/Qualifiers FT CDS 2..385 FT /product="R2D_NGi_1p" FT /note="possible carboxyl terminal end of ORF." FT /translation="EGLRKPDLIAKRGNMAILVDGQVVSEQADLSRAHRQK FT ATKYVGLKRQIEERYGVKEVHFTSVTLSARGIWSKTSAADLVRLGVLKIRD FT YKVISTRVLIGGVSIFKDFNRRTSRTAGQRAGVLHRQGIG" XX SQ Sequence 493 BP; 173 A; 84 C; 137 G; 96 T; 3 other; ggaaggactg agaaagccgg accttattgc caagagagga aacatggcga tccttgttga 60 cggacaagtg gtgagtgagc aggcagacct atcaagagca catcgtcaga aggcaacgaa 120 gtacgtagga ttgaagcggc agattgaaga aagatatgga gtaaaagaag tgcattttac 180 atctgtgaca ctatcagcaa gaggaatttg gagcaagaca tcggcagcgg acctagtacg 240 acttggagtg ttgaagataa gggactacaa agtgatatcc acccgggtat taatcggagg 300 agtttccatc ttcaaagact tcaatcggag aacatcacgg acagcaggac aacgggcagg 360 agtgcttcat cggcagggca tcggctgaag agnagaagaa gaaactttcg gttattctcg 420 gatgcctggg actcggatac aggcagaatg tntattacnt agtaattgat taaaaaaaaa 480 aaaaaaaaaa aaa 493 // ID BEL-75_AA-LTR repbase; DNA; INV; 519 BP. XX AC AAGE02021353; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-75_AA_; KW BEL-75_AA-I; BEL-75_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-519 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02021353; Positions 4029 3511. XX SQ Sequence 519 BP; 165 A; 111 C; 111 G; 132 T; 0 other; tgttcgtgca ccacgaaaga actttgtctg ttccctccat ctgtgtaccg cgacgtgtgg 60 cgattccctc gtgcgagaac attagtccga caaggatgac agctaatgac aatagcccct 120 tcggaaatgc caagatggag aaaaacagac ttccgatccc tagtatataa aaggcctcga 180 gagacgtcat catcctcttc tttcatccct ctacgacgat acatggagcc atcatcgacg 240 ttcatccaca cggaaattta aaaatacagt gaattagaaa aaattaagtg aactctaatg 300 aaagtgaatt tgttgtagct tgcatgaaat aaaaattaag gtataagtga acatttaaat 360 taattcgagg aataaagtgg acttaaggtg aaacgaagtt gttgcgaatg ctgtagaaga 420 aaggactaaa taccaaagtg cgatcaactt ggccgcgata tccttggcct ggcagtggtt 480 ctcagtgtaa ccgtccaccg ttcatctgtg ctaccgaca 519 // ID DNA8-56_AP repbase; DNA; INV; 585 BP. XX AC . XX DT 25-AUG-2009 (Rel. 14.09, Created) DT 25-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-56_AP. XX NM DNA8-56_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-585 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 1989-1989 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 585 BP; 239 A; 72 C; 60 G; 214 T; 0 other; cagtcctgta aagaatactt ttaaaaagta cttgagtaaa tactcaaata catttttttt 60 taagtattta agatactact caaatacttt aattgtaaag tatttcaaat actactcaaa 120 tactttgaaa aagtatttgg aatacttttc aaatactttt ggtttttaaa gtgggtttct 180 aattatcgta atataaacac ataggtagta ggtaatctaa tagttatcta atagttttaa 240 agggaatatt gttattcaat attcaataac tttttttaga aatctggttt tcacaaacct 300 ttgggcttcg gctaacacag aaatacagaa tattaaataa aactattatt ctattattgt 360 agttgacaat aatatttttc caaccaatta tttcgaataa aaataaaatg ttcgaaataa 420 aaaaccaaag tattcaatac caaaaagtat ttaatacaaa aaaaagtatt taaaatacca 480 ttcaaaatac ttcatgctga aagtatttaa aaagttattc aatacttaaa aaagtattcg 540 aatactttta ctcaaatact tttacttaaa tactttacag gactg 585 // ID Gypsy-70_AA-LTR repbase; DNA; INV; 191 BP. XX AC supercont1.22; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-70_AA_; KW Gypsy-70_AA-I; Gypsy-70_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-191 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.22; Positions 3992850 3992660. XX SQ Sequence 191 BP; 68 A; 51 C; 27 G; 45 T; 0 other; tgatgtatcc tgtcaaatga cagctctgaa aaagagacaa catcaactgc tacacatcaa 60 gctatctcat tataaagctt tcctgaaaat aaaagagcta tgagccggta gttctatttc 120 ggctatcaac cagacaacaa ctctctacaa accctcccta atccgaagtg cccctaaacc 180 aggacattac a 191 // ID Gypsy-17_IS-LTR repbase; DNA; INV; 156 BP. XX AC ABJB010032597; XX DT 15-FEB-2011 (Rel. 16.02, Created) DT 15-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the black-legged tick genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-17_IS_; KW Gypsy-17_IS-I; Gypsy-17_IS-LTR. XX OS Ixodes scapularis OC Eukaryota; Metazoa; Arthropoda; Chelicerata; Arachnida; Acari; OC Parasitiformes; Ixodida; Ixodoidea; Ixodidae; Ixodinae; Ixodes. XX RN [1] RP 1-156 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the black-legged tick genome."; RL Direct Submission to RU (15-FEB-2011). XX DR Genome; ABJB010032597; Positions 1333 1488. XX SQ Sequence 156 BP; 53 A; 29 C; 42 G; 32 T; 0 other; tgtaataata ttgttttgta agtccgaata aacggggcaa gtagggggac aaaagaggaa 60 gaagaggaca gagcgcccag gggatctcaa aacgttctac acgtagtgtt gcttgtaacc 120 cggtacgacc ggtacaatca ggtaaatact accaca 156 // ID Gypsy-202_AA-LTR repbase; DNA; INV; 240 BP. XX AC AAGE02024456; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-202_AA_; KW Gypsy-202_AA-I; Gypsy-202_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-240 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02024456; Positions 4218 3979. XX SQ Sequence 240 BP; 79 A; 43 C; 43 G; 75 T; 0 other; tgttgtgtta gtgcatatac gaaagagagc ttatcgttta agagagaaaa aggaatataa 60 atgctgtgct atgaaacaga acagaaatac aagatcagaa gttatctgta cgcaagctga 120 accagacgtg tttcctgaaa ccttttaatt acagtatcac cgaaactcta tttaagtctt 180 cctcttaaaa cgtatacttt ttggttccac ctcgattcag tccgctttgt agttattaca 240 // ID hAT-3_BF repbase; DNA; INV; 4845 BP. XX AC . XX DT 30-SEP-2008 (Rel. 13.09, Created) DT 30-SEP-2008 (Rel. 13.09, Last updated, Version 1) XX DE Amphioxus hAT-3_BF autonomous DNA transposon - consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-3_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-4845 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-4845 RA Kapitonov V. and Jurka J.; RT "hAT transposons in the amphioxus genome."; RL Repbase Reports 8(9), 923-923 (2008). XX DR [2] (Consensus) XX CC The transposase contains a zinc finger (pos. 186-262). XX FH Key Location/Qualifiers FT CDS join(401..556,1226..3694) FT /product="hAT-3_BFp" FT /note="transposase." FT /translation="MAPKRKAKDGNSSCKKKTTILSFLTRGRPEESATNNN FT QNPDDQSRTRELQDKVLDVPLTLPTTPESGRAQPTSSQKEPGKAQPTQSQE FT ELGQAQPTSSQKELGKAQPTSSQKEPGKAQPTSSQKEPGKAQPTPSQEEPG FT KAQPTSSVSSPSVSSQPAACDCIGCTITGPGAFQPKDPTVLSAFQNKGRKF FT QSDWYKDRKWITLCTKQRKVFCAPCRYAKQHRLTFSTNQESSFVTDGFDNY FT KKGIERLDIHAASDPHTEAVLKCSAISGQSIEAQLNSQMAETQRLHRDGLL FT KQLSALKFLLRQGLAIRGHHDKDGNLYHLLQTWADDSEVVRSWLHQGRFMS FT HDHINELINLMGNDVLRSVLARIKGSNPAYFAIIADEATDVSCNEQLNISV FT RYVDHDYEVHEDSLGLYKLSSTDAATITAAIKDTLLRTGLPLKLCRGQAYD FT GAANMKGHRTGVATRIKTEEPAAVPVHCWAHSLNLCLQDTCRQIVAVRDAM FT DLAREIDKLINYSPKRKTLFTQLASGADSSTSSGTIKPLCPTRWTVRTGAL FT ESILANYSTLMDTMQQVNETTRDEYGLKAGGVLAALEKFGTLFSIRLGQLL FT FSAAETTSTTLQKKDLSVQEAMDSVETLKKYYRRQRTEESFDAFYASTVEK FT ARELNIDEPVLPRYRRQPRRLDDGSDPHRHACPKDFHRLVYFQACDLLIGE FT LTERFNQEFLKPVVAMEKLLLNAANGDDFTEKMDEVTSSVFGNDLDPPKLR FT RHLSMLPDIIEQALPEVKKVTSIRTICSAMTTASHRSTFTQTHKLLRLYLT FT VPITSATSERAFSSLKRLLTYLRSTMTEQRLNNCMLMHVHKDIVDDMDLSD FT IAADFASLNGDRMRHFGQWKNK" XX SQ Sequence 4845 BP; 1389 A; 1186 C; 1125 G; 1145 T; 0 other; cagtggcggc ggcaccgtgg gggcagtggg ggcggtcgcc cccacgaaaa attggtcgtg 60 ggggcgtcgc ccccacgaaa aaataagctg aaaataagct aaaacctgaa acattcatct 120 atttttgtca ctaaaatctg ccgcaaatgt cggtagaaat gtacaccgac cgtgcggccc 180 tttcacggaa ggcccgaaac cttctgcacg tccgagtcgt gaactcctct gaaccttccc 240 gatatgcgga ggggtcatgt gataaggtaa cgcaccaaat atggagcttg attctgtcgg 300 cgtacgtcgg aggctgtcgg aaactctcgg aaaaagtaaa aaattcccag agatcagtgc 360 ggtccataaa agttttatcg acgcccggac ggacgccaac atggctccaa aacggaaagc 420 gaaagatgga aattcgtcct gcaagaagaa aacgacaatc ctgtcatttc tcacccgtgg 480 tcgaccggaa gagagtgcga ccaataacaa ccagaaccct gacgatcaat ccagaactcg 540 agagcttcaa gacaaggtat gcatctattt acaaagattg ttttggtata tctctgcgcc 600 agctgttttt tatatcaaat ttatgtttgt attttgtcat ttcttactct gtttttttac 660 ggttaagttt aaatgcgagt gtgatggctg atggcgtaca cagcgttccg tcagggttac 720 acaaaattac agcggcgctc tcatagtgat actagtgata tataatttgc tggactaata 780 taaaatattg ttgattcatc atcttgtatc aacaaatatt ttacatcgtg cagtccccgg 840 cggaggagag tccggctgtc acatatttgt ttgggacgga aatactcaac cgaaccaaac 900 gaacagtttg ctgagaaaaa aaaaaactat acaaactgct gcatggtttt gttacggcgt 960 ggggacgatg tggacactca ctttattcag gaataattgg ttacagacgc acgcactgtc 1020 agcggccggg gaatgatgcc tcggactatc ccgccggtct ggagcgaatt agctagggat 1080 gaccgtggca agtccgtgtg tggtcagatc aaaacataaa taaacatcat ttatcttaac 1140 gacttaattt cttgggatta tataatagac tcccttatca accaaccggg caaatcaaca 1200 agaaatgcaa tttgtatcat aacaggtctt ggatgtgcca ctgactttgc ccacaacacc 1260 cgagtcagga agagcgcagc ctacatcgtc tcagaaagag ccgggaaaag cgcagcctac 1320 acaatcccag gaagagctgg gacaagcgca gcctacatcg tctcagaaag agctgggaaa 1380 agcgcagcct acatcgtctc agaaagagcc gggaaaagcg cagcctacat cgtctcagaa 1440 agagccggga aaagcgcagc ctacaccgtc ccaggaagag ccgggaaaag cgcagcctac 1500 atcgtccgtg tcatctccaa gtgtttcatc ccagcctgct gcttgtgact gcataggatg 1560 taccattaca ggccctggtg catttcaacc caaagatccc acagttctta gcgcatttca 1620 gaacaaggga agaaagtttc agtcggactg gtacaaggat cgcaagtgga tcaccttatg 1680 cacaaaacag cgaaaagtgt tctgtgcccc ctgtcgctat gccaaacagc acaggctcac 1740 cttttccact aaccaagaat cttcctttgt gacagacggg tttgacaatt acaagaaagg 1800 gattgaaagg ttggacattc acgcggcttc agacccacac acagaagctg tgttgaagtg 1860 ctctgccatt tccggacaat ccattgaagc tcagctcaac tcacaaatgg cggaaacaca 1920 gcggctccat cgggatggcc tcctgaagca actgtcagcc ctgaagtttt tgttaagaca 1980 aggacttgcc atccgaggtc accacgacaa ggatggcaac ctttaccacc ttctacaaac 2040 ttgggctgac gacagtgaag ttgtccgttc gtggttacat caaggacgct ttatgtcgca 2100 tgaccatatc aacgaactga tcaatctcat gggcaacgat gtcttgaggt cggtgctggc 2160 acgaataaaa ggcagtaatc cagcgtattt tgctatcatt gcggacgaag ctacagatgt 2220 ctcttgcaat gaacagctga acatcagtgt ccggtatgta gatcatgatt acgaggttca 2280 tgaggacagt cttggcctct acaagctctc atctacagac gccgccacca tcacagcagc 2340 gatcaaagac acccttctcc ggacaggtct tccactgaaa ctgtgtcgcg gacaggcgta 2400 cgacggggca gccaacatga agggccatcg cactggggtc gcaacgcgaa taaaaacaga 2460 ggagcctgcc gccgtcccag tccattgctg ggcccactca ctaaatctgt gtcttcaaga 2520 tacctgtaga cagatcgtgg ccgttcggga cgcaatggat cttgccagag aaattgacaa 2580 actgatcaat tactcgccaa agaggaagac cttgttcaca caattggcct ccggagctga 2640 ttcatcaact tcaagtggaa cgatcaagcc cctctgtccc acacgctgga ctgtacgaac 2700 aggagccctt gagagtatcc tcgctaacta cagcactctc atggacacta tgcagcaagt 2760 caacgagacc acaagggacg aatatggatt gaaagctggc ggtgtcctcg ctgcgttgga 2820 aaagttcggc acgctctttt ctatccgctt aggccagctt ctgttctctg cggcggagac 2880 gacgtcgacg accttgcaaa agaaggactt gagtgtacaa gaagccatgg acagtgtgga 2940 gacacttaag aagtactatc ggcgccagag aacggaagag tcgttcgacg cattctacgc 3000 atcaacggtg gagaaggctc gagagctgaa catagacgag ccagtgcttc cacggtaccg 3060 acgtcagccg aggcgcctgg acgatggaag tgatccgcac agacacgcct gcccaaagga 3120 tttccatcgt ctggtttact tccaggcatg tgacctcctg ataggagaac ttacagagag 3180 gtttaaccag gaattcctga agcctgtcgt cgccatggag aagctcctcc tgaatgcggc 3240 gaacggagac gacttcacag agaagatgga tgaagtcacg tcgtcagtct ttgggaacga 3300 cctggacccc cccaagctga gacgccacct ctccatgctc cccgacataa tcgagcaggc 3360 acttcccgag gtgaagaaag tcacaagtat acgaaccatt tgttctgcaa tgaccacagc 3420 ctcacatcga tcgacattca cacaaaccca caaactgctc cggctctacc tgacagtacc 3480 catcacatcc gccacttctg agagagcatt ctcatcattg aagagactac tgacgtacct 3540 acgctccacc atgacagagc agcgactcaa caattgtatg ctaatgcacg tccacaagga 3600 cattgtggat gacatggacc tgtcagacat tgcggctgac tttgcgtcgc tgaatggaga 3660 tcgaatgcgg cactttggac agtggaaaaa caagtgaaca aacaagtaga ctattcaact 3720 tgtagaaact ttacttctat gttggatgca catttaatag tatgagtata gtactagcct 3780 tcggcgctct atttgttgac atgtactgca tgtatttttc gtattattag tatatagcca 3840 ttataatgta taggtgcgta tagattactt aacgtctatt gtacattcac tgtaacgtgt 3900 gacttaacta acgacctacc gacgctgtgc atgtcaggac catgttggca catagaccac 3960 gatcttgaag tcttgaccca atgtatttac ctaccgtggt tggctcatcg tttaagacat 4020 gtaattgtgg cctttggtgc tgcttgatat cctagacctg gtcccttaac ataatttgtt 4080 gctacttaac agtgttaaag taagaatgtg atgacttgta aaaggctaca atggcgattt 4140 tatataccga gggagcccct cggcgtaagt tacgaagtta tgaagttccc acatcgtttt 4200 acttatgtct aataaaggtt atttgcaagt ttggaaacgg tgttgtgagt cgattattat 4260 atagtcaaat gtagttagta acacaagaat aagttagtag aaagctttgt tattacataa 4320 cggaaaacac caaatgaaaa tcgagccgag aacataattt tgaacgacga aactgttaac 4380 aaactgaatg aaaacaggtg cctaacgaat tgtaacccac atcacgattt tacaagtcga 4440 atctgaaaac gaaaacaaga aaatagttaa aacacttgca aatacatgtt tgtaaatact 4500 atagtatata attcttgcga tcccgttggt aatgcggcat ttgctccgcc ccgccccctc 4560 aatgtcgcat gggtcagtct attttcgcga cagtcataca cccttctcca gaaaacgtaa 4620 gagtaatata cctaagtaca ccatcaaaat ttgctccaca aaatgcagga aacagcgttt 4680 cagagggtct agatttgaaa attttcccgg gggagcatgc ccccggaccc ccctagaaag 4740 gtcgcgcctt cggcgcgacc cctcgcgcct tcggcgctcg atttggtgat attcacaagc 4800 cagaagtttc gcccccacag tcaaaaaaac gtgccgccgc ctctg 4845 // ID Homo8 repbase; DNA; INV; 2801 BP. XX AC . XX DT 09-OCT-2009 (Rel. 14.1, Created) DT 09-OCT-2009 (Rel. 14.1, Last updated, Version 1) XX DE Homo8 is a putative autonomous DNA transposon. XX KW hAT; DNA transposon; Transposable Element; HOBO; transposase; KW Homo8. XX OS Drosophila mojavensis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; repleta group; OC mulleri subgroup. XX RN [1] RP 1-2801 RA Ortiz M.F. and Loreto E.L.S.; RT "Characterization of new hAT transposable elements in 12 RT Drosophila genomes."; RL Genetica 135(1), 67-75 (2009)DOI 10.1007/s10709-008-9259-5. XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 534..2267 FT /product="Homo8_1p" FT /translation="MDKFLRDASGKGIISIQLYLYFCICICMCYFCVDRKS FT TDIDEGETQVKRTRYGQSEVWKFFTKSCNGSAKCLKCGKVYLTSGNTSNLS FT GHLKRMHPTLTISELPKETGSILSFIDKKYEPSSNRKKALDSALMYYISSD FT MRPFSVVENKGFRHLVKALDPRYELPSRSKLRDSCMTDLYKKMRENLKSVL FT DRVEHCAITTDCWTSRANESYLTVTCHFITENFMLRTAVLSTKKLLDETNH FT SADNISLTLREVLIEWQVIEKVTAVVTDNARSMIKACELLQKRHVPCFAHC FT INLVVQGCLSKEKIKNVIGKCKSIVSFFKNSSIAYAKFRDEQKIEKPYNLK FT QECPTRWNSAFYMVERILSTHAAIAKVLLNTPKAVLPLSADEILVLEDLKL FT LLAPFDHATKRASSSSSVTTSIIIPIVYGLIHNLEKINVRLVSDDGREAYN FT SLMEGIRQRLTQYEKRTVTRIATLLDPRFKKEGFLSIANASESQKILENEL FT ATLYCVLPASPTNDIQCLPAPNCSSTEPELFEFLEENISKKVRSGRVDSIL FT ALRQYFMSENLSSSSSPLDYWKVMFLRYKMI" XX SQ Sequence 2801 BP; 909 A; 497 C; 569 G; 826 T; 0 other; tgtttgagat tgagctccag ccacagcctg atggcagcca tgaggcggtg ctcaatctca 60 agcagatcgc tatgcaagca gtcagtgtgg agcgtatggc caagcccatc gatccggccc 120 agtaccatcg atgtcaggcg tttggcaatt cgaaaaacta ctgtaggcgg ccatttaggt 180 gtatgaagtg tgccggcgag catgcttcta ccgattgccc aaaagagacg aggcggcaac 240 atgcgcaaat tgtagtggtt cacacgttag ctgctttaaa ggctgtccag tagtgctggg 300 cgcaacatcg atgtaatatc gatacatcga tgtttgaaat aaaatgagaa acatcgatgt 360 ttgtttttcg atgatcgatg ttgtgtataa atgttcccac cactaaattt actggtttct 420 ttacattaag caaaaattgt gttgtgagct tggtaagaac aaatttgcat ttatttattt 480 ctttctttta ttttttattt attatttttg taggatattg gtgctggtag gcaatggata 540 agtttttgag agacgctagt ggcaaaggta ttattagcat tcaattatat ttatatttct 600 gcatatgtat atgtatgtgt tatttttgtg tagatcgcaa atcaacagac attgacgaag 660 gagaaacaca agtaaaaagg acaaggtatg gacagtcgga ggtgtggaaa ttttttacca 720 aatcgtgtaa tggttcagca aaatgcctaa aatgtggaaa agtatatctt acaagtggca 780 ataccagtaa tttatctggc cacttaaagc gaatgcatcc gacgctaact ataagcgagc 840 ttcctaagga gacagggtct attttatcgt ttatagataa aaaatatgaa ccgtcttcaa 900 ataggaagaa agctctcgac agtgcgctga tgtactacat aagttcagat atgcgaccat 960 tttcggttgt cgagaacaaa ggattccgac accttgtcaa agctcttgat ccaagatatg 1020 agttgccatc tagatcgaag ttgcgggatt cttgcatgac agatctctat aaaaaaatga 1080 gagaaaatct caagtctgtc cttgatcgcg tggaacactg cgctataaca actgattgct 1140 ggacctcacg tgctaacgaa agttatctta cagtcacgtg ccattttatt acagaaaact 1200 ttatgttacg cacagcagtt ttgtccacca aaaaactctt agacgagact aatcattcag 1260 ccgataacat atccctaacc ctgcgtgaag tcctgataga atggcaagta attgaaaaag 1320 tgactgcagt agtgacggat aacgccagaa gtatgatcaa agcttgcgaa ctgttacaaa 1380 aacggcacgt gccatgcttt gcgcactgca taaacttggt tgtccaaggc tgtttatcca 1440 aagaaaaaat taaaaacgta atagggaaat gcaagagtat tgtgtcgttt ttcaagaaca 1500 gctctatagc gtatgccaaa tttagagacg agcaaaaaat agagaagccc tataatttaa 1560 aacaggagtg tcccacaagg tggaatagtg ctttttatat ggtagagcgc attttaagta 1620 ctcatgccgc catcgcaaag gtactgctga acaccccaaa agcagtgttg ccactatctg 1680 cggacgagat tcttgtgttg gaggacttaa agcttctact cgctcctttt gaccacgcga 1740 caaaaagggc atcatcgagt agttcagtga caacatcaat aattattcca attgtatacg 1800 gacttataca taatttagaa aaaattaatg tccgactggt ctctgatgat ggtcgtgagg 1860 catacaattc attaatggaa ggaattcgac aaagacttac gcaatacgaa aaacgcacag 1920 tcacgcgaat agcaactctt ctggatccca gattcaaaaa ggagggcttt ctgtcgattg 1980 cgaatgccag cgaatcccaa aaaattttag aaaatgagct tgctaccctc tactgcgttt 2040 tgcccgcgtc tcccactaac gacatacaat gcttgccagc accgaactgc agctctacag 2100 aaccagagct ttttgagttc ttggaagaaa atatttcaaa aaaagtacgg agtggacgag 2160 tggactctat tttggcactt cgtcaatatt ttatgtccga aaatttatca tcatcatcca 2220 gtccattaga ctattggaag gtaatgtttc tcagatataa gatgatttaa aattatattg 2280 atattttatt tgcagatctc aaatgacaag ccagtcgtaa aatgcgctaa gaaatattta 2340 tgcgtgccag caacctctat ggagagtgag aggatgttca gcaaagctgg actacttgta 2400 agcgaaaaaa gaagctcgct gaagtcaaaa aatgttgaca tactggtgtt catcaacaaa 2460 aacgattggg tcttaaatga gacccattag aatatgagat ggtaagtttc ccatacttag 2520 attaatttta aataatgata ttaataacaa tgtaattgca gatcttaaaa tacgataatt 2580 taaaaaaaaa gactgatgga gtgatatatt ttttattttt tatttacttt ttttattgtt 2640 tcttttttat ttttaagttt tgaaatggat tacaaacgaa aatatataat ttaattttat 2700 taagaatgta aacattgaat tcatttataa taaaaaaaaa ttccgatttt tcatcgatac 2760 atcgatgttt tggattgaaa aacatcgaaa catcgatgtc a 2801 // ID Copia-134_AA-LTR repbase; DNA; INV; 129 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-134_AA_; KW Ty1_copia_Ele217; Copia-134_AA-I; Copia-134_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-129 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 129 BP; 40 A; 28 C; 22 G; 39 T; 0 other; tgttagcgaa tgcaaatttg caatcgaagc aacccatgac tttagtgtta agattgattt 60 cgaataaaaa ctatataagt tgttagaccg tcacctagac cacacgcgtc ttttattagc 120 tctgcccca 129 // ID R1-3_BM repbase; DNA; INV; 2864 BP. XX AC . XX DT 29-APR-2010 (Rel. 15.07, Created) DT 29-APR-2010 (Rel. 15.07, Last updated, Version 2) XX DE Non-LTR retrotransposon - a consensus. XX KW R1; Non-LTR Retrotransposon; Transposable Element; R1-3_BM. XX OS Bombyx mori OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Bombycidae; Bombycinae; Bombyx. XX RN [1] RP 1-2864 RA Jurka J.; RT "Non-LTR retrotransposons from Bombyx mori."; RL Repbase Reports 10(7), 1049-1049 (2010). XX DR [1] (Consensus) XX CC >96% identical to consensus. XX FH Key Location/Qualifiers FT CDS 136..1713 FT /product="R1-3_BM_1p" FT /translation="MVVSSCCSSPKRTTRKRAKGSKNASSDSSTEXEASRS FT GSLVSRTRPSKRGRGRPPTTGQYVGLAAAKQAHLKAQREELELRAEQEVVE FT SVHNLREKRRAVHLISGNITADPATRLQETAQMALTIAAKARNPKGTYIKA FT LKEMATTIKEATEDLASRSANEETLRLQALSERQEAEILQLRKEVEDVRAE FT MVRLAQTTAQPTPAPTLPNEDDERRLQIIMRAVGSMLDARLAGLEARLLPE FT PRLRPPLAADARRRSRESEIDPDTAGLVSEATPPYAALPQLETNKKVKPKK FT QKGNNKKTTPSAPPEPPAPPEPRAFPPAPAALTVSWATVARRGARPRREKE FT PAPTTNAEGQRDRNPAPLKPKKKGRKKRLRAPRSQAVILRLHPEAIEKGLS FT YREVLAEARASIDAGALGIPIEKVRSAITGAKILIVSGEDQIAKADLLAEK FT LKEVLSSKRVTVTRPMVTAAIRISGLDDSLTEGDDMHRACAHRRGISCAAA FT KLARLSANSLRRWQSRLAPRASIKPADPEE" XX SQ Sequence 2864 BP; 671 A; 832 C; 949 G; 411 T; 1 other; ggggggggca caagctgttc cgggaaagag tccgagaagg acggggagaa aaggaagagg 60 tccacaaggg acgaaagaga aacggtgcaa aacgaagacc gcaggagaga aagcagcgtt 120 tgctcggagg acagcatggt ggtcagctcc tgctgttcat caccaaagag aacgacgcgc 180 aaaagggcaa aaggctcgaa gaacgcaagc tccgacagca gtaccgaanc ggaggcgtct 240 aggtcaggat cgttggtgtc caggaccaga ccatcgaaga gaggccgagg tcgtcctcca 300 acgaccggcc aatacgtggg attggcggcg gcgaagcagg cacacttaaa agctcaacgt 360 gaagagctgg agcttcgcgc tgaacaggag gtagtggaaa gcgtgcacaa cctgcgtgag 420 aagaggcgcg cagtacattt aatatccggg aacataacag cagatcccgc gactaggctc 480 caggagaccg cccagatggc ccttaccatc gcggcgaaag cgcgcaaccc gaaggggacg 540 tacatcaaag cccttaagga gatggccaca acaataaagg aggccacgga agatctagct 600 tccagatcgg ccaacgagga aaccctgagg ctgcaggcac tcagcgagag acaggaggcg 660 gagatcctgc agctcaggaa ggaagtagag gacgtaagag cggaaatggt gcgcctcgct 720 cagacgacag cgcagcccac tcccgcccca acactgccaa acgaagacga cgagcgacgc 780 ctgcagataa tcatgcgagc agtcgggtcc atgctggacg cccgactcgc gggtctggag 840 gcccgtctac tcccagagcc tcgcttgcga ccacctttgg ccgctgacgc aagaaggagg 900 agcagagaga gcgaaatcga tccagacact gcaggtcttg tgtcggaggc gacgccgcct 960 tatgcggctc tgccccagct tgagaccaac aagaaagtga agcccaagaa acagaagggc 1020 aataacaaga agacgacgcc gtcagctcca cccgagccgc cagctccacc cgagccgcgc 1080 gcattccctc cggcccccgc tgcccttacg gtcagttggg cgaccgttgc gcgtagagga 1140 gcacggccga gaagggaaaa ggagccagca ccgacgacca acgcggaagg ccaacgagac 1200 agaaaccctg ccccgctcaa gcctaagaag aaaggtagaa agaagagact tcgagctccg 1260 cgctcgcagg ccgttattct gaggctgcac ccggaggcca tcgaaaaggg gctctcctac 1320 cgggaggtgc tcgccgaggc gagggccagc atcgacgccg gagccttagg gatccccatc 1380 gagaaggtcc ggtcagcaat aaccggagcc aaaatactca tagtgagcgg cgaggaccag 1440 atcgcgaagg ccgacttgct ggccgagaag ctcaaggagg ttctctcttc caagagagta 1500 acggtgacaa gaccgatggt gaccgctgca atccgcataa gcggacttga cgattcgctg 1560 acggaggggg acgatatgca ccgagcttgc gcgcatcggc gggggatatc gtgcgcggcg 1620 gcgaagcttg cgcgcttatc ggcgaattcg ctccgtcggt ggcagtcaag gctggcacct 1680 agagcctcta tcaagcccgc ggatccggag gagtagaccg ccagggccac gcggggaggt 1740 tctccgcgtt ccaggttctc gcccgtaacc cgatagcagc ggtgaagacg atctagctcc 1800 atcgagatcg ttacagtgac gaagctgcgc atcgggtgga gcgaccttgt agaggaatca 1860 gctagtccac ctgctggaaa atggcccgct cgagactaca atgcttccgc tgccactctc 1920 cgctccaacc cccggcggcg cgccgctgcc acgcgttggg ccatgtgggt gcgcggtgcc 1980 cgtcgtcggt cgaccgcagc cacgactgct accggtgcgg acagaccggc cacgtggcat 2040 ccggatgcac actggcaccg cgatgcgccg tctgcgcctc cgccgggaaa ccagcggacc 2100 acacttcggg gggcaaggcc tgtgcaaggc ccccgagaaa accgaggaga gaggacgtcg 2160 ctgcacccgc ggccgcaccc gttgctgcag accgaagtca actcggcgag acgacgactc 2220 cgatggacgt ccagcctccg tcgggcacaa ggaagaccgg ggctttatga agaaccgggc 2280 atgccgatga accgaggagg aagtcgtgag gaggaggtcg tggacctcgg ttcgtaggca 2340 gtgaaggcgg agcgggtggt gcgggcagaa agcccccacc cggctcctgc aggggccgtc 2400 gattgctggg tgcgggggcg cccggtgctc ggcactaagg tcctgtgggg tggtggtgcg 2460 ctgtggctcc ggtgcgcccc tcacgagtcg tggagatgag agggggaaga agtccccggt 2520 ggcagatccc gcccaacgta acattggggt gggagccggc gaaggcgggc ctatggcgcg 2580 cacatggcgg tacctgggtc cccgcccagc cggagtggag gagctcgttg gggtttagtc 2640 ggtagtcgtt aagctgggcg ggtttggcga gagctgaact tagcccagcg cgcctttcta 2700 aaggcgtagt ctccgtggac caattggttg agggcgcgga cctcggttcg cgacttcttc 2760 ctctttcttc caccggaggc gcggagtccg acataacccg gtccgacccc cgtcggcagg 2820 gtatccgtaa agactaggat tccccatcga taccaaaaaa aaaa 2864 // ID Harbinger-N16_BF repbase; DNA; INV; 1062 BP. XX AC . XX DT 04-AUG-2008 (Rel. 13.08, Created) DT 04-AUG-2008 (Rel. 13.08, Last updated, Version 1) XX DE Amphioxus Harbinger-N16_BF non-autonomous DNA transposon - DE consensus. XX KW Harbinger; DNA transposon; Transposable Element; Nonautonomous; KW Harbinger-N16_BF; non-autonomous. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-1062 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-1062 RA Kapitonov V. and Jurka J.; RT "Harbinger-N16_BF - a family of non-autonomous DNA transposons RT from the amphioxus genome."; RL Repbase Reports 8(8), 808-808 (2008). XX DR [2] (Consensus) XX CC This family is characterized by 36-bp TIRs and TWA TSDs (verified CC by insertions into other transposons). XX SQ Sequence 1062 BP; 305 A; 254 C; 204 G; 299 T; 0 other; agccccggtc acaaagcgcg tacgattcct tgcgatggac tattccgcat atcgtacgat 60 gagcgcagta catcgcaaca aaatcgtaag catcgtaagc catcgcaacg catcggatgg 120 tgttcaaaat ttttccagcg tcgcacgatt tttcacttgt gtctcttctg tatagccgtc 180 tgaccgcttt tgtgcttctt taaccaacaa aatgtggaca tagctgttga ataccttcct 240 ataatatctt taactgtaaa aataaattaa aaaagaccat gacaacaaga gtattacaag 300 caaaagttca aatcaagcgt gagtactggt atggttatct cggcctgctt gccggctcta 360 cgtgtcaggc tctttcgaag atcgtatcat gatatgctat caacaaaaag atgccatcat 420 acaaaaatca tatcataagc attatatcat atatatttcc gactcataga gcctgccttt 480 atgttcgagt tgtccccatg gttattacga ttatggtaaa ctgacccaag accgacgaga 540 gctgagattt gatctaatta taaactcacg tgcatgaagc gatcaaacaa ttgtctgcat 600 cacctgtgcg gggcgcgctg ttctctggtt gcgactgtgg cagttgcgac tgatatattt 660 ccccttttcg tcaatatcga caatatcagg gaaaactgaa accatcacaa catgctaaaa 720 attcataaat ctataacctc atcagtagac aaaaattgcc gtaatgaagt ctgctatggg 780 ggcaaataaa atcttacgac gatagcgaaa ttttgaacat gttcaaaatc catttcagct 840 ctatttttcg ccatacgacg ctccccgatt tactatgatt cactgcgatt gacgcaggca 900 cgctcttatg acctcatcgt acgatgcact acgcgctttg tgaccacggc tttatatact 960 taaaccattg cttccgattc tcccccgact tacgacgcac tgcgcttatc ctgcgcatcg 1020 gaaggaatcg caaggaatcg tacgcgcttt gtgaccgggg ct 1062 // ID Gypsy-31_AA-I repbase; DNA; INV; 6109 BP. XX AC AAGE02023662; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-31_AA_; KW Gypsy-31_AA-LTR; Gypsy-31_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6109 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02023662; Positions 28124 34232. XX CC Positions [1781-2320] - Reverse transcriptase CC Positions [3464-3835] - Integrase core CC 'ATAG' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 1100..3292 FT /product="Gypsy-31_AA-I_1p" FT /translation="MVHEENNFIKMVGIGGMRLNALIDSGCKVSTIQWRFA FT GNAGRLEDTNTTLVGFGGRKVKVDKMVKTTVKLDEIEVSVKLNVVPNWTQS FT TALILGRDVLNHDGVVMVNKSGEVRFQRDGSVRTVEAQPESQMVVELQKRQ FT YETMFTIDDRSEFEEIRESDLNADGTKEEILKLVNEFRSCFAKNMKELGIA FT KDTEMKIELSDKDPVYVKPHRMEFAREAALREIVEEFVDANIVVESVSPYS FT SRVVMVPKKDGSYRMAVDYRLLNKKTVKDRYPMPDIEWCLNKLSGAELFIT FT VDLYSGYYQIPVAEESQACTAFSTRDGHYHFLRMPFGLVNGCSVFQRAMNN FT LTAKLRKEGIVVYIDDLVIGGKCVVELLEKFRRLLEVLEESGFTINLKKSH FT FFKSTIEFLGFEISKKDVRPGSAKTKAVKNFPVPQTVQQVQQFLGLAGFFR FT KFVPRFSLVASPLFALLKKEARFGWEREQEEAFVALKKVLSSRPLLVLFDP FT KREVELHTDASKDGLSGILLMQMDEGLQPISYFSKKTTEAESNYHSYELEV FT LAVVTSVERFRNYLIGKFFPIRTDCTAVRDTYEKREMNARIARWFLKLQEY FT DFKMVHRPGSSMRHVDALSRNPVESGSETLPVLAEMLQRQDPKLLLIIDTL FT RREPTTGEDRQVKLNYELVQNRLMRIVDGLKLWVVPSRVRWRLVNTYHDEM FT GHFGEEKVLELLRDKFWFPKMRKYVRSYIEA" XX SQ Sequence 6109 BP; 1833 A; 1065 C; 1592 G; 1619 T; 0 other; agctgtagac aggatgaaag ccttgtgtgg tacgggccag tgcgaacaat agatacggtg 60 cgcattgtcg gatggtgaaa tttgcaaatt gtgaacgtgt ggtttttgcg gagatttgtt 120 aagcacacgg ttcaggtgtt agcgaaggaa gttcgcggtt gataattcac gatacggtag 180 tcgtggatga gtacggacga cgaagatagg tcggacgttg agacggctgg ttatagagac 240 ggtgaattta gaacgccggt gtgtttcgta accgatcgaa aagctaagca aaagtcgcgc 300 gtgtttgttg ctgatggatg caaaattgaa agcagattta gaagccaaag aagaagagat 360 ccagaaggtc aaaatgcaat tgttgcaaat cggagctgga acgagtggtg ggacgagttt 420 cagtgtagat cgccggccag atttttggga gctctgcctt gtttgtctgc cgaagaatgg 480 atcgaggaga tcgatactac ggcggctcat tataagtgga aagaagacac aaagctccat 540 tgtgctcggt tgaatttgga gggctcggcg aagttgtggt gggctgaggt tcaaagcgtt 600 gcgactacat gggcaacctc ctcgcagaaa ttggttaatg cctacccgtc ggcgcgtgat 660 ccgattttct accacaatca gatgtcccaa cgaaagaagc aaaaagagga aacgttggat 720 gagtatgtct atacccaagt ggctatggga aagcgtgctg gttttcaaga atcagtgata 780 gtgaaatacg taatcaatgg gttgagggac ttcacggcga actgtaaagt gaccttggcc 840 gggaaaatag ctaccgtcga agcgttgatg gaaatggatg gagggtttca tggacgctcc 900 ttcggagaga gatgtagtga aaattgggga gaaaaggaaa gagaagcgtg ccttgtgcta 960 taggtgtaat aagcctggct acaaagcgat agtgtgtaaa gcgacagcag atcggagttg 1020 tttcagctgt ggagatagtg gccatcaggc gtagagttgc cctaaatcgc ctcgggccgg 1080 accatcgtcg cgagtgcaga tggtccacga agagaacaat ttcatcaaaa tggttggaat 1140 aggaggcatg aggttgaacg ccctaattga ctccggatgc aaggtgtcga ctatacagtg 1200 gcggttcgcg ggaaatgcag gcagattgga agacacaaac accactttag tgggtttcgg 1260 tggaagaaaa gtgaaagtgg ataaaatggt gaagaccacg gtgaagctag atgaaataga 1320 agtttcggta aaattgaacg ttgttccaaa ctggactcag agcacggcat tgatcttggg 1380 tcgagatgtg ttaaatcacg atggcgtagt gatggtgaac aaaagtggtg aagtgaggtt 1440 tcagcgtgac ggtagtgtgc gtaccgtgga ggcacaacct gaaagccaaa tggttgttga 1500 actccaaaag cgacaatatg agacgatgtt cacgatcgat gaccgttccg agttcgaaga 1560 aatcagggaa agtgatttga atgctgatgg cacaaaggaa gaaattttaa aactagtgaa 1620 tgagtttcga tcgtgctttg ccaagaacat gaaggaattg ggaatcgcga aagacactga 1680 gatgaagata gaactgagtg ataaagatcc cgtatatgtg aaaccacatc gcatggaatt 1740 cgcccgagag gcagcgttgc gggagattgt ggaggaattc gtggatgcca atattgtggt 1800 cgagtctgta tcgccgtata gcagtagagt ggttatggta ccgaaaaaag atggatcgta 1860 tcgtatggca gttgattacc gcttgctgaa taagaagaca gttaaggacc gttacccaat 1920 gcctgatata gagtggtgct tgaataagtt gagtggggcc gagttgttta ttaccgtgga 1980 tctttattcc gggtattacc agattcccgt cgcggaagaa agtcaagcat gtacagcttt 2040 ttcgacaagg gatggtcatt accatttcct ccgtatgccg tttgggctgg tgaatggctg 2100 ttcggtattt cagcgggcaa tgaacaattt gacggcaaaa ttgcggaaag aaggtatagt 2160 cgtctatatt gatgacctag tgattggggg aaaatgtgtg gtggaattat tggaaaaatt 2220 tagacgtttg ctggaagtgt tggaagaaag tggttttaca atcaacctga agaagtcgca 2280 tttctttaaa tcaaccatcg agtttttggg gttcgaaatc tcgaaaaagg atgtgcgtcc 2340 cggatcagcc aaaacaaaag cagtgaaaaa ctttccggtg ccgcaaacag tgcagcaagt 2400 acagcagttt ttgggtttag ccgggttttt ccggaaattc gtaccgcgat tcagtctggt 2460 ggcatcgccc ttgttcgcgt tactgaagaa agaagcacga tttggctggg aacgagaaca 2520 agaagaagca tttgtggcgc tgaagaaggt gttgagttca cgaccactgc tagtgctgtt 2580 tgatccaaag cgggaggtcg aattacacac cgacgcttca aaggatggtt tatcaggaat 2640 cctattgatg caaatggatg agggccttca accaataagt tatttcagta agaagactac 2700 cgaagcagaa tccaattatc acagctacga gttagaagtt ttggcggttg tgacgagtgt 2760 agaaagattt cgaaactatc tcattgggaa attcttcccg attcgtacgg attgtacggc 2820 ggtgcgcgac acttacgaga agcgcgagat gaatgctcgt attgctaggt ggtttttaaa 2880 gcttcaggag tacgacttca aaatggtgca ccgaccaggg agttcgatgc gacacgtaga 2940 cgccctcagt cggaatccag tggaatcagg aagtgaaacc ttaccagtgt tggcagagat 3000 gttacagagg caagatccca aattgctgtt gataatagac actttgcgac gcgagcctac 3060 tacgggcgaa gacagacagg ttaaactaaa ctacgaatta gtgcagaatc ggttgatgag 3120 gatcgttgat ggtctaaaat tgtgggtggt gccgagcaga gtgcggtgga gattggtcaa 3180 cacctaccac gatgaaatgg gacattttgg tgaagaaaaa gttctcgagt tgttacgtga 3240 caagttttgg tttccgaaaa tgaggaagta tgttcggtca tatattgaag cctgacctca 3300 gtgtgcatac tacaagtgca aagggaataa gccagagggg tttctgcacc cagtgccgaa 3360 agaaccagta cccttcagca cagttcattt ggaccacatg gggccgttcg ttcgctctgc 3420 tcgcggaaat tgttatgtgc tcctgttaac ctgcgggttt tagaaatttg tgatagtccg 3480 cgcagtgcga tcgacgcaga ctggcccggt actaagtttt ttgattgaga ttaccggagt 3540 gtttgggacc cctaaccgta tagtgaccga cagaggtacg gcgttcacgt cgaaacagtt 3600 cgtagaatat tgtcgtgtta atcaaattca gcatatcaag actgcagtgg ggagccccag 3660 agcaaatgga caggtggagc gatcgaacaa aactgtactc aatgctctgc ggtgcaccgt 3720 gggtgagaac aaaacgcgct gggatgaacg agtagcagca gttcagtggt caatcaattc 3780 ggtggtgaat tcgaccacac agttggcccc gaatgcgtta gcgttttcat tcaagccgag 3840 agacgtaaat caaaacgcga ttgtgatggc gttgcacgat gaagcggata acgtggtggg 3900 ggactcgaat gaacttaggc agcgagcgat gtcaagaatt acagccagac agcaggtgca 3960 gaaaaactac ttcgatgaat agcgccgaaa acctacagaa taccaggtag gtgacatggt 4020 taaagttgaa cgagatctga ctgctatggg acacagccga aagttggagt ctagattcaa 4080 ggggccgttc gtggtgatga aaatactgga taacgatcga tatgagcttc aagaaatacc 4140 aggaacgaaa ggctcgagga ggatgggcta ctacagtgta ttcgggggac agaatcaaac 4200 gctggtgtca actggcagat atagatggtt gtgaaacatc caacggagaa gaagagatgt 4260 gaattgtttg actagacgtt taacttgaaa ttagtaacat tttcattttc attttgcagc 4320 gtttggtggg gcaatgagag cgcaagtcag tccaaagccg ggggaaggga taggtaatgg 4380 ccgtaatagt ctatgcggac cacttgaaca ccatcggaaa ggataagaag ggttggggta 4440 gggtataggg aattggagaa gacttaacac agtaatgcga ttaatgctgc aaaatgtaaa 4500 tgtgctgaaa gcaatttaat ttgagttttg aagtttgatt tgacttatta aaattccaga 4560 tagatcggcc attgtacaag taagaaagag tatgctgatc gctttctaga aattttataa 4620 acaacaaaat aataacaata tgagaatatg agagtgatgc taagggctgc tgatggcacg 4680 atgcggcgct ttaggagatt ttgatactga ttgaacatta ctataaagag cgcaattcaa 4740 tcgatggtga taactttaca aactgtatct gttttgcact gattgacgat gctgatttat 4800 ataaaaatat ttgatattat tagttcattt gtttgtgtga tctatcagtt aataatggaa 4860 aaaatttaca tacccttgat gatgatactt ccctgaccag aaatattcta ttttgagcat 4920 ccttttcgaa gctattctat ccaggtatcc atgtactgct ttcaatcctg aaattattct 4980 tattgtgcat gctcatgcat tatattagtg atgctcataa tcatgcaaca gataagaaga 5040 agagaaaaag agaatatagg aacagtgatt agtaatgagt taattatgta gttttgttag 5100 aattataaac aattaaacag tagtaatgaa tagcttacaa aaattttgca gcatagctga 5160 gcctttcgga tcgtaagacc gatgtatgaa aataatttgt ttcgaaattt tccgcgtggt 5220 cttctagtgc ccctattaga taagaatcag tcatagacat acataccgaa tgctcgccat 5280 gaccctatga ccaacatttg tatatgtata gccttagctg atgtggcttt tctaataggt 5340 agttctttca ctcattgatt gtcttcaaag ccttgtcaag tatagaaaat tgattttatt 5400 ttatttatta ctacattgaa gctacattgt acttattaaa tattaggtat tattttgttt 5460 ttatcactat tgatatgatc aaggccagaa ttcccagtga gtgaagaatt ttatagtact 5520 agaacaccat agaagatcgc taaacattac ctaatcatgg aacggctttt tgctctatta 5580 ttttaagtag atgctattga tctccctttg ctcctatccg caacccggtg cacgcgccct 5640 gttggttttc gaaaacctcg tagaagatta caaaccgtca atggcattcc caattactac 5700 cccacattgc acattcaagg gtctgattgt gctcctctct gtaatcttct cgccagtttg 5760 aaattatatt ttcccgaagt gaaagtaatt ttgatgagtg atagcgtagt gcagcccaac 5820 tgcatagtac agtagtttaa ccagaatctt taggtcagaa gataaaattt aaatgttgta 5880 ataaagaggc cattgccatt gctttgaaat tcaccctaca tctaatcaaa actactaaca 5940 acagcgatca attatcaaaa ttcatcgctc cctggaaaat ataattccaa gcccagggtt 6000 taacttgaaa ttagtaacat tttgtagcaa aactactaag aaaagtgcga agtattcaaa 6060 aaatgaatta gaaacctaat ttcaaatcag gttagatagg gtgtccgaa 6109 // ID Copia-1-LTR_HM repbase; DNA; INV; 187 BP. XX AC . XX DT 15-JAN-2009 (Rel. 14.02, Created) DT 15-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE LTR retrotransposon from Hydra magnipapillata: long terminal DE repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-1-LTR_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-187 RA Bao W. and Jurka J.; RT "LTR retrotransposons from Hydra magnipapillata."; RL Repbase Reports 9(2), 441-441 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX SQ Sequence 187 BP; 54 A; 28 C; 25 G; 80 T; 0 other; tgttaagttt ttatgacacc atctttaagt tgtagttata tcggcaccgt atttttatgt 60 gccgtatatt ctttatgttt ttagtttcag ccacgaggct atgtttttat gtatatttta 120 agaatattta taaacactaa tatcgacttt tatttcctcc atcatgagtg aacacaaaat 180 attaaca 187 // ID Gypsy-75_CQ-LTR repbase; DNA; INV; 194 BP. XX AC AAWU01021168; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-75_CQ_; KW Gypsy-75_CQ-I; Gypsy-75_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-194 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 530-530 (2011). XX DR GenBank; AAWU01021168; Positions 11693 11886. XX SQ Sequence 194 BP; 54 A; 41 C; 38 G; 61 T; 0 other; tgttatatta tgcgtcattt gtgtaagacc cccaaacgta gtttgtgaac atcccctgaa 60 acgtcattca tgacagccat tcatcgggtc gctgattcag tgcttgagtg tgcaagagtg 120 aagacgtaca ataaatcgct ctgttttatt tagtaaaccg gtctcttctt tggctaatca 180 cacgtaataa aaca 194 // ID BEL-6_DPu-LTR repbase; DNA; INV; 268 BP. XX AC scaffold_290; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: long terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-6_DPu_; KW BEL-6_DPu-LTR; BEL-6_DPu-I. XX NM BEL-6_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-268 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 660-660 (2010). XX DR Genome; scaffold_290; Positions 49642 49909. XX CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX SQ Sequence 268 BP; 66 A; 61 C; 42 G; 99 T; 0 other; tgtcggaaat ttccgaccct ctattttctt agtactgtaa cagtcagatt ttcacgtgaa 60 aggcatgtac tagaaactag tgcagggcct aaatatttct aagtattgtt ttcatatttt 120 tattcgtagt atttcaccaa ggtcttccgc gtatcatcaa ggatgctgcg ccagtttttc 180 ccaatttctc cccaaaacgt ttcatctctt ttctcatgtt cgaagcaact agcttcagat 240 tcttttacat cgtcattgac tactgtca 268 // ID Gypsy-231_AA-LTR repbase; DNA; INV; 231 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-231_AA_; KW Gypsy-231_AA-I; Gypsy-231_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-231 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1066-1066 (2011). XX DR [2] (Consensus) XX SQ Sequence 231 BP; 76 A; 52 C; 36 G; 67 T; 0 other; tgtagcatac ttaattttat aaataagttt tccgtgtgtc atcaaatgac agctctgaaa 60 atgtagttca tataacttgt atatgcgtac tcacacacat gagcatagac aaccataaat 120 aggtgatctg aaaatagaga gcccagttca gtctggtttc aacttacaac gagtaaagtt 180 cgtttcgact tcaccctcac tccgaaagtg ccccattcct caaactcaac a 231 // ID I-15_AAe repbase; DNA; INV; 4983 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A I non-LTR retrotransposon family from Aedes aegypti. XX KW I; Non-LTR Retrotransposon; Transposable Element; I-15_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4983 RA Kojima K.K. and Jurka J.; RT "I non-LTR retrotransposons from the yellow fever mosquito."; RL Repbase Reports 11(4), 1370-1370 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 6 sequences with >98% CC identity. XX FH Key Location/Qualifiers FT CDS 100..1239 FT /product="I-15_AAe_1p" FT /translation="MADDNLRRQRQYPISSPGPFLVIAESEDGSNLAPSTL FT SKNVIRLMGDQFVKSLPQSKRRIKILMASAEAANTLVAAKLPNVSFSIPQR FT LVETLGVGHVELDVDDEELQYAISFDKTKAEQDNNPEVLEIRRITKRAGEG FT RERLTTVIVTFAGQTLPTHIELNRALYPIKQYVFPQRQCIKCWRFGHGEKN FT CRSKIRCNRCTETNITVEPEHVCETEEPKCVNCNGGHLANEVKKCPYAIRR FT KEADLNRNAAYSQGPKDWFSNLAIPTAKVPTIVIPPTTTFIPIENEDTNND FT HLAPGPSKRRRPSTCLDGEAIPALELHVESSIRSVVEEAFKSPEIIGAVDD FT IIGAPVEYKERAEENLRAAISEVVSQRVQTFMSSLRL" FT CDS 1246..4908 FT /product="I-15_AAe_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="NTPGTPVQSIPLQILQWNCRGAISKKANILHLINSLQ FT SSVVCLSETHLDNMTSFDISGFRVFRKDHQRNSMGVLVAARNELHPKALTL FT PSIDGIEWIGLQLHSCVGVMSIVSLYIHPHASVSQNQLEILIRCVPRPCVL FT TGDWNAKHPMWGNVRHNSRGNYLHAVLEDSDLVILNDGSITRFDATRTASA FT LDLTLVSAEISLMFNWEVIEDPNSSDHLPISCGSLELIVETRFPAINYRRM FT DWEKFNSNVESAILCVESEITYEQFFNILWSALVDSSPPPRHVGSLKVPQP FT FWDDELSAARQDSRSKFKVWRRTLEYEKYCEYVEADNEFKRLVKVKKRQSW FT KQLCDGFDSQTSVSSLWTLARRFKGRASGGNRQLRDEQQINIFLDKLAPAS FT TMENVINFPNCDCNIHNHSSCFTTLELEAVIQKGKDTAPGVDGISYSVIQR FT LPFIAKKSLIKIYNTIFETGVIPEVWNTFQVVPILKPGKPPSEANSFRPIA FT LASCLRKSYETMVKEKIEWYVENHKLLPDGICGFRKGKGTLDALHLLIDEV FT QTAFQNRQHVIACSIDIQGAYDNVQIHSLVAQMRSLGVAECLSMSVYALFK FT ERHLVVSLGDTSKIERTTWKGLPQGSPLSPLCFNLLIHGLFKPSDGAVIRV FT GFADDITIAVRGSNMLDSVNKSQEAINEMVHEINNLGLQVSPSKCTAIVFA FT KRMVDDVPQLRVEDQILEYSPSLKLLGLHLTPTLHWGKHFFYLKHRAGLFI FT NFMKSVAGQAWGADPGALLTIYKSCVRPILEYGSIFFRGASEKDSITLDRI FT QWNCIRIALGSTKTTHTGSLEVMAGLTPLDIRREAATMKFIERRYSIKSWN FT DKFVKTTIDGSASTWIRRSILQYTAYTGRQVETFEVLPCFEFELGMRQKVV FT TVDKSVSFELSXNSSLKANEVVRNLLEHRYPNASNLATDGSKTIDGTAYAV FT VNAFGQPISQIKLPHQVSVFHAELLALRKVTQIIAAMPRADYVVLTDSLSS FT LESLSNIKITSYHPSAWYEIKALVQSIEEKGSSITFLWVPSHQNIPLNEAV FT DGAAKEACQSGGSDYYRFTCLDIAYPTRTRSINLWQSAWNAGIKGRFCHSI FT LPIVSTQAWFNDFGFSRREIVLLSKLISNHSRLPSHLKRNNIVEDDTCQCG FT EGVASPDHLLFDCNIYEDWRRNVWIEIVGERAFPDLQLILRSRDMNVLKTI FT ANFFINCNIDM" XX SQ Sequence 4983 BP; 1497 A; 1072 C; 1110 G; 1303 T; 1 other; cagtgctgct ttagccttgg ttagcaacag acgtgtcgca ccaaagcgtg tatagcaaaa 60 taaatttcaa tagaaactac aacttctgtt gaactgagta tggctgacga taatctccgt 120 cgtcaaagac agtatcctat ttcatcgccg gggccattcc tggtaatcgc tgaatctgag 180 gatggctcaa atttagcacc atcgacgctt tcgaaaaacg tgattcgcct gatgggagat 240 caatttgtga aatcattacc acaatcgaaa cgccggatca aaatattaat ggcatcagct 300 gaagctgcga ataccctggt tgcggcaaag ttaccaaacg tttcgttctc gattccgcag 360 cggctagtgg agacgttagg tgttggccat gtggaactcg acgttgatga tgaagaattg 420 caatatgcca tatcattcga taagacaaaa gccgagcaag acaataaccc tgaagtgttg 480 gaaattcgac gtatcacgaa gcgcgcgggt gaaggtagag agcggttgac tactgttatc 540 gtcacgtttg cgggccaaac actacctacc cacatcgagc tcaatagagc gctctatcca 600 attaaacagt atgtgtttcc ccaacgccaa tgcatcaaat gttggcgttt cggacacgga 660 gagaaaaact gtaggagcaa aattcgatgc aacaggtgca ccgagactaa catcacggtt 720 gagccagaac atgtatgtga gacggaggaa ccgaaatgcg ttaattgcaa cggcggtcat 780 ctagcaaatg aagtcaagaa atgcccttat gctattcgcc gcaaggaagc cgatcttaac 840 aggaacgcag cttattccca aggacctaag gattggttct ccaacttagc aattcccact 900 gcaaaagtac caactatcgt tattcctcca actacaacct tcattccaat tgagaatgaa 960 gacaccaata atgaccatct tgccccagga ccaagtaagc gtagacgtcc atctacttgt 1020 ctggatggcg aagcaatccc agctttggaa cttcacgtgg aatccagcat tcggtctgtc 1080 gtggaagaag ccttcaaaag tccggaaatc attggagccg tcgacgacat catcggagcg 1140 ccggtggaat acaaagaaag agctgaagaa aatttgcggg ctgctatcag tgaagtggtg 1200 agccaaaggg tgcaaacctt tatgagctct ttacgtttat aataaaacac gccgggcacc 1260 cccgttcaaa gcatacctct acaaattcta cagtggaatt gtaggggagc gatcagcaaa 1320 aaggcaaaca tcttgcatct cattaacagc cttcaatcat cggtagtgtg tttatccgaa 1380 acacacctag acaacatgac cagcttcgac atttccggtt ttcgagtttt tcggaaggat 1440 caccagcgaa actctatggg ggtactcgtt gctgctagaa acgagttgca ccctaaggct 1500 ttaacattac cctcgatcga tggaattgaa tggatcggat tacaactcca cagctgtgta 1560 ggtgttatgt ccatcgtttc gctctatatt catccacatg catcggtatc ccagaatcaa 1620 ttagaaatat taattaggtg tgtaccacgc ccgtgtgtgc tgactggaga ttggaatgct 1680 aaacatccta tgtgggggaa tgttcggcat aactctcgtg gcaattatct gcacgctgtt 1740 ttagaggact ccgatttagt aatacttaat gatggtagta tcaccagatt cgacgctaca 1800 cgaaccgcta gtgcattgga cctcacttta gtgtctgcgg agatcagctt gatgttcaat 1860 tgggaggtga ttgaggatcc taacagtagt gatcatctac ccatctcctg tggatcgcta 1920 gaactgattg tagaaacacg gttccctgca atcaattaca gaagaatgga ctgggaaaaa 1980 ttcaactcga atgtggaaag tgcaatactt tgtgtggaaa gtgaaataac atacgaacaa 2040 ttcttcaata ttctatggag tgctttggta gattcttcac ctccccctcg tcatgttgga 2100 agtcttaagg tacctcaacc tttctgggac gatgaattga gcgcagcccg tcaggactct 2160 agatcgaagt tcaaggtctg gcgtcgtacc ctagaatatg aaaagtattg tgaatatgtg 2220 gaagcagata acgaattcaa aagactggtg aaagtgaaaa aaaggcaatc ttggaagcag 2280 ctttgtgacg gtttcgattc ccaaacttca gtcagctcac tttggacgct tgcacggaga 2340 ttcaagggcc gtgcatccgg tggaaatcgg caattacgtg atgaacaaca gattaacata 2400 tttctggaca aacttgcacc tgcatctaca atggaaaatg tgataaactt cccaaattgt 2460 gactgtaata tccataacca tagttcttgc ttcacaactt tggagctaga ggcggttatt 2520 cagaagggaa aggacactgc tccaggtgtc gatggtatat cttactcagt gattcagagg 2580 ttgccattca tagccaagaa atcattgatt aaaatttaca acaccatttt tgaaacaggt 2640 gttatccccg aggtctggaa cacctttcaa gtggttccga ttctgaagcc tggaaaacct 2700 ccatcagaag ctaactcttt ccgccccata gccttggcgt catgtctccg taaaagttat 2760 gagactatgg ttaaagaaaa aattgaatgg tatgttgaaa atcacaagct tcttcctgat 2820 ggaatttgcg gatttcggaa aggtaagggc accctagacg cactccatct gctcatcgat 2880 gaagtgcaaa cggcgtttca gaatagacaa cacgtgatag cctgtagtat cgacatccaa 2940 ggagcatacg acaacgttca aatacattct ctggtggcac aaatgagatc attaggagtt 3000 gctgagtgcc tatccatgtc ggtttatgca ctgtttaaag aacgacatct tgttgtgtct 3060 ctgggagata cgagtaaaat tgaacgcaca acatggaaag ggctgccgca aggatcccca 3120 ctgagcccat tatgcttcaa tctactgatt cacggactat tcaagccgtc tgatggtgct 3180 gttattcggg tcggctttgc tgatgatata acaattgcgg tacgtggttc gaatatgctg 3240 gatagtgtaa ataaatccca agaagcaatc aatgaaatgg tgcacgaaat aaacaacctt 3300 ggcttacaag tatcaccttc taagtgtaca gcgattgttt ttgcgaaaag aatggtagac 3360 gatgtgccgc agctgcgagt ggaagatcaa attttagagt acagtccatc gttgaagctg 3420 ctaggacttc acttaacacc tacattacac tggggaaaac atttcttcta tcttaaacat 3480 cgcgcgggac tgttcatcaa cttcatgaag tccgttgctg gccaagcatg gggtgccgat 3540 cctggcgctc ttcttaccat ctataaatct tgcgttagac cgatcttgga gtacgggtct 3600 atatttttcc gtggcgcctc cgagaaagat tcgattactt tagaccggat ccaatggaac 3660 tgcatacgta ttgcacttgg atctacgaaa acaactcata cgggttctct agaggtgatg 3720 gcaggtctaa cacctttaga tatcagaaga gaagctgcaa ctatgaaatt cattgaacga 3780 agatattcca tcaaatcttg gaatgataaa tttgtgaaga cgacaatcga tggttctgct 3840 tcaacctgga tccgtcgtag tattttacag tacacggctt atacaggacg tcaagtggaa 3900 acattcgaag tactgccatg ctttgagttc gaattaggca tgagacaaaa agtggtaact 3960 gttgacaaat cggtatcgtt cgaattatca agmaactcat ccttgaaagc taacgaagtc 4020 gtccgaaatc ttctcgaaca tcgatatccg aatgcatcaa atctggctac agacggttcc 4080 aaaacaattg atggaactgc ctatgcggta gtaaatgcgt ttggtcagcc aatcagccaa 4140 attaaacttc cgcatcaagt ttctgttttt catgcggagc tgcttgcact acggaaagtc 4200 acacaaatta tcgcagctat gccgagggcc gactatgtag tactcacgga tagcttaagt 4260 agccttgaat ctctttccaa cataaaaatc acctcttatc acccgtcagc gtggtatgaa 4320 attaaggcat tggttcaatc catcgaagaa aagggatcaa gcattacatt cctatgggta 4380 ccatctcatc aaaatattcc gttaaacgaa gcagttgacg gagcagcaaa agaagcatgc 4440 caatcgggtg gatcagatta ctatcggttc acatgtttgg acatagctta tccaactcgg 4500 acaagatcta tcaacttgtg gcaatcagct tggaatgcgg gcattaaagg cagattttgt 4560 cacagtattc ttccaatcgt ttcaacgcaa gcatggttca atgattttgg tttttcgcgg 4620 agagaaatcg ttcttctatc caaactgata agcaaccact ccagactgcc gtcacacctc 4680 aagcggaata acatcgttga agacgacact tgccagtgtg gcgaaggcgt agcttcgcca 4740 gatcatctcc tattcgactg caacatctac gaggattgga gacggaatgt ttggatcgaa 4800 atcgtcggtg aaagggcatt tccagatcta cagttaatac tacgtagcag agatatgaat 4860 gtacttaaaa caatagcaaa tttctttatt aattgtaata ttgatatgtg aattatttgt 4920 tgacaaaaca tggccaaaca atggaaacaa gaggccaaat aaataaataa atagaaaaaa 4980 aaa 4983 // ID Gypsy-258_AA-I repbase; DNA; INV; 5252 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-258_AA_; KW Gypsy-258_AA-LTR; Gypsy-258_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-5252 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1117-1117 (2011). XX DR [1] (Consensus) XX CC Positions [4069-4530] - Integrase core CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 2638..5058 FT /product="Gypsy-258_AA-I_1p" FT /translation="MSKIDLEAAYYHFELHPDSRDITTFVARSGVYRFCRL FT MFGIKSAPELFQREMESLFRGIKGLIVYMDDILIHGETEEAHDRTLEEVMK FT RIKEMNLKINSQKSMFGVTELTFLGYRLSQDGIRPTDEKIEAIRDLQPPTS FT VSDLRSLLGLINFLGRFVPNLADLTFHMRQLLVKSNTFEWKVSHDQELNKL FT KELLGKVESLAFFDPLDETFLVTDASPVGLGAILIQMKNGSPRPVSCVSKS FT LTAYEKKYCQTEKECYAIIWAMEKLFVYLYGLHFTLITDCKPLEYLFNKAQ FT SKPSARIERWILRLQSFDFTVKYEPGVSNLADSLSRLSQINDSTEESADVL FT SWLSEEIKPSAMSIEEIERATLQDDELQNVKEALGTDDWDGVLAEFRTATV FT RDELSSYGDLVLRGDRIVVPKPLRAKVIEIAHLGHQGATAMKAQLRAKVWF FT PQMDKAVESAVRKCKPCLMTSIPDKPNPLARRIPTEPWQDLAIDFKEGLPD FT GISLLVVVCYTSRFVQVEPMKPATTQRVIGALLRMISCFGIPRSITADNGP FT QFRATEFSKFCDSYGIHLNLSTPYWPEQNGAVERQMRNIGKRLKISAIQDT FT DWKTDLYEYITLYHSTPQETTGLSPGQMMFGREIRNRIPSIHQPPKLRLEA FT AKDKDMMIKEYHQRHANQERHAKEHKLKRGDVVLMRDLNPGAMQPNFRQDE FT FEVTQVDKGAITVKSMDTGKVYLRNSSHLKKLKDSETVLNDTTEQYCFEGP FT VETESRDGDTYAGASGESTEQVDAATDGVTDKRELRAKRASKLPNKFQDYV FT MVVNE" XX SQ Sequence 5252 BP; 1598 A; 993 C; 1360 G; 1297 T; 4 other; atggaccgat gcacgagttc actaatttga cgtttgagcg gtgccgaatt cactcgttgc 60 catggtcacg taaataacac ggcaccgctc aaacgtcaaa aagtgaaccc gtgcataggt 120 ccattaagta cgagagctag agaagagcag acgtaacgcg cgtaagctcc tgtgtgatcg 180 gagtatcaaa caaatttaca atggcttcga ggatgataat tggtaagaat tgcgtgcaaa 240 aattgaggaa aaaaacattg aacgctttgt acagcgacta aactatacat tgtgaaatat 300 ggattaaatg aaaaatggac atcaaagaat taaattggtt ggaaatggac atcgaaagaa 360 tcggcactga aaaagatggc ggaggtgttg gaaaagaaca gtaaagtgga cgcaagtctt 420 tttttatttt tccatgcggc tgggtatgcg cgtgatagat gagaggaaaa gctgattttt 480 cattcagaaa aaaaaacgaa aaaatgacga tggttcgtat tggaagaaaa gaaagttttt 540 tttttttgct gaaaaattgc ttcgtgtttg agaaaagctt cgtgctagaa aatagcttcg 600 tgcttggaaa gaataaaaag cttcatgatt gtaggagaga aagcttcgtg ctaagaatgg 660 cttcgtgctg gaaagcttcg tgctaagaat ggcttcgtgc tgagaatggc ttcgtgctga 720 gaatgacttc gtgttaagaa tgacttcatg ctgagaatgg ctttgtgctg agaatggctt 780 cgtgctaaga atggcttcgt gctaagaatg gcttcgtgct aagaatggct tcgtgcctag 840 aatggctttg tgtttaaaaa cggcttagtg ctagaaatga cttaacgatc aataacgact 900 tcgtgtctta taccaaaaat ttgctgtcat aacaaaattg agatgtttgc tcgtagaatt 960 actaaaatga agtttttctt tctctctatg ttgttataga tgcggcagca atgccaccgt 1020 ttaaagcagg agatgatcca aggaataatt ggataaaatg gagaaaggct ctggaacgtt 1080 tcctacgagt caataaagtt gaggctgacg aagataaata tgatttatta ctcgttttgg 1140 gggcgattga gttgcaatca ttctatgaca agattacaaa atgggagctc cgtcgtccag 1200 tatctgataa cggcgaagaa tacattgtgt tgaagtatga atcggcaatt gcatcactgg 1260 atgcttattt tgcaccacaa ctaaacaagc gatttgagcg tcaccttttc cgtgctatga 1320 agcaggaaag tcacgaacct ttcgaggaat tcgtattccg attgaaggat caggcaaatc 1380 gctgccagtt tgttgacgtg gatgatgcca tcgtggacca gattatagaa ggctgtattt 1440 ctccggagct aaggaaaaag ttgctgaccg aggatctcag tttgcatgaa acaaccgtcg 1500 tgggaaaaac tttggaggaa gtgcagaaac aagcaaaaga gttctcgaaa ccgtctacat 1560 ctggattgtc gagcttagat ggagcagttg tacagaaaat caacgagcag tcttccggtt 1620 ctggcaagca agtaggaaga caagtaggaa atactcgaaa atgctacaat tgcaataagg 1680 tcggccacct ggcaaaagat tcgggaaaat gtccagcaag gaatgtaact tgtcattcat 1740 gtggaaccgt cggacacttc gccgtttgtt gcaggaaacg aaaatacgat tcatcagcaa 1800 cgtcttccac gacgtcgaat aaaaagacaa gagtccatgc aatcgtatca ccgcaggaca 1860 aacgagatgg tgtatttttt gtgcggacag gagacsaatt agacgaggta ctgttgcttg 1920 atctgggagg ggtcagagcg aaaatgctgg tggactcggg atctccggcg aacatcatca 1980 acaacgagac atacgtgcta ctgaaagcag aacgagcgat gatgatgaac gatagaacac 2040 ctcgacaaga agagctaagg ctgaaatctt ttgcgtcgga taaagagata cgattcagcg 2100 gagtattcga gatcgagata aagattccgg aggatgaaag cggaatttgg tcccatgttc 2160 tagttgcccc ggaaggccaa gttaatctct tgagcaaagg aacggcgttt gcgttaggag 2220 tccttaaaat cggatataag gtgaatcagg tatcgaagat ctgtgaagac gaattcccaa 2280 agattccgaa tgcgatgttg aaaattcagg tggacgaaac agttcatccc gttgtgcaac 2340 cggtgcgtag gttcccagtc gccatggaag cagatgtgga ggacgcgata aacgatttgg 2400 tccaaaagaa gatcgtwgaa agagctgaag ggcctttaag ttgggtgtca ccattggtcc 2460 ccgtcaggaa gaccgatgga agaattaggc tctgcgtgga tatgcgggcg gcaaatcgag 2520 cggttatgcg cgagaattac cctatgccaa acatcgatag cgcgatggcg acgatcacta 2580 aggtaagttt tcaattgttg ctgtaaaaaa agtactttaa atggttaggt cgtgaagatg 2640 tcgaaaatag acctggaagc cgcgtactac cactttgaat tacatccaga tagcagggac 2700 attacaacgt ttgtagctag gagtggtgtg taccgtttct gtcgtctcat gtttgggatc 2760 aaatctgccc cggagttgtt ccaacgtgag atggaaagtt tattccgtgg catcaaggga 2820 ctaattgtgt atatggatga tatactcatt cacggcgaga cggaagaggc tcatgatcgt 2880 acattagaag aggttatgaa gcgaatcaag gagatgaact tgaaaatcaa ctcgcaaaag 2940 tcaatgtttg gagtgaccga attgactttt ttaggatacc gtctttctca ggatggtatt 3000 cgccctacag atgagaaaat cgaagccata cgcgatttgc agccaccaac atcagtttct 3060 gatttgagat cgctgctagg attgattaat tttcttggta gattcgttcc aaatttggca 3120 gatctaactt tccacatgcg tcagctgctt gtcaaaagta atacgtttga atggaaagta 3180 tcgcatgatc aggagcttaa taagcttaag gaactgctgg ggaaggtgga atcgttggct 3240 ttctttgacc cgctagatga aacgtttctg gttaccgatg ctagtcctgt tggactaggc 3300 gccatactga tccagatgaa aaatggttct ccgagaccag tatcgtgcgt atcgaagagt 3360 ttgacagctt atgaaaaaaa gtactgtcaa actgaaaaag aatgttacgc gatcatttgg 3420 gctatggaga aattgttcgt gtacctttac ggactgcact ttacgctcat cactgactgt 3480 aagccgctgg agtacctctt caataaagcg caatctaaac catcagctag aattgagcgc 3540 tggattctgc gactgcaaag cttcgatttt acggtcaaat acgagcctgg ggtaagtaac 3600 cttgcggatt ctttgtcgag gctttcacag atcaacgaca gtacagagga aagtgctgat 3660 gtcctatctt ggttatccga ggaaatcaaa ccatcagcga tgtccatcga ggaaattgaa 3720 agagcgaccc tacaagatga cgagctacag aacgttaagg aagctctagg tacagatgat 3780 tgggatggag tactagccga gttcagaact gcgaccgttc gtgatgagtt aagttcgtac 3840 ggtgacctag tacttcgagg tgataggatc gttgtgccta aaccattgag agccaaagtg 3900 atcgaaattg cacacctggg acatcaagga gccacggcaa tgaaagctca gctacgggcc 3960 aaagtttggt ttccacagat ggacaaagca gtagagtcag ctgttcggaa gtgtaagcca 4020 tgcttaatga cgtccattcc tgataagcca aatcctttgg ctcgtcggat tccaacagaa 4080 ccttggcaag atctcgccat cgatttcaaa gaagggctgc cggatggtat atcgttatta 4140 gtagtggtat gttacacttc tcgattcgtc caagtagaac ccatgaagcc ggcgacaaca 4200 caacgagtga tcggagcatt gttaagaatg attagttgtt ttgggattcc gcggtcgatc 4260 acagcggata atggaccgca attcagggcg acagagtttt cgaaattttg cgacagctat 4320 gggattcatt tgaatctttc aacaccgtac tggccagagc aaaacggcgc agtggaacgc 4380 cagatgagga acattggcaa acgtttgaag atcagtgcwa tacaggatac ggactggaag 4440 accgatttat acgaatacat cactctgtac cattcaacgc cacaggagac caccggttta 4500 tcacctggac agatgatgtt tggtagagaa attcgtaatc gcattccttc aattcaccaa 4560 cctccaaaac tgmgattaga agccgcgaaa gacaaggata tgatgatcaa agagtatcat 4620 caaagacacg ctaaccaaga acgtcatgcg aaagagcata aattgaagag gggagacgta 4680 gtattgatgc gcgatctaaa tccgggagct atgcagccta actttaggca agacgagttt 4740 gaggtaacac aggtggataa aggagcaatc acagtgaaat caatggatac tggaaaagtt 4800 tacctacgga acagttcaca ccttaagaag ctgaaagact cagaaacggt gttgaatgat 4860 acgacagaac aatattgctt tgaaggacca gttgagactg aatcacggga tggagacaca 4920 tatgcaggcg cctcaggtga gtctactgaa caggtagatg cagccacaga tggtgtgact 4980 gacaagagag agctacgagc caaacgcgca agcaaactgc cgaataagtt tcaggattat 5040 gtaatggttg ttaacgagta ggtaatctaa gacttgatgt ggatgattgg aattcagcaa 5100 taaaagataa ttgatggaat aagagaggta ttatggcccg gcgcgcgcgc ggccggcgcg 5160 cgcgggggcg cgcgggcccg cccgcggccc cgggccccgc gcgccccggc ccgggccctg 5220 ggatttgttt tactgtctaa aaataggagg ga 5252 // ID Copia-8_CQ-LTR repbase; DNA; INV; 102 BP. XX AC AAWU01044740; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-8_CQ_; KW Copia-8_CQ-I; Copia-8_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-102 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 332-332 (2011). XX DR Genome; AAWU01044740; Positions 154 53. XX SQ Sequence 102 BP; 18 A; 34 C; 26 G; 24 T; 0 other; tgagtgtgca accccttgca acccgtgtgc cccactttag tgcgccgcca gtgcacgcgc 60 tagctgtcaa acgctgtttg gaggaagctt cacttcgcct ca 102 // ID Dpauli6 repbase; DNA; INV; 493 BP. XX AC GU229937; XX DT 19-APR-2011 (Rel. 16.04, Created) DT 19-APR-2011 (Rel. 16.04, Last updated, Version 1) XX DE Mariner-like element internal region of mauritiana subfamily. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Dpauli6. XX OS Drosophila paulistorum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-493 RA Wallau G.L., Hua-Van A., Capy P. and Loreto E.L.; RT "The evolutionary history of mariner-like elements in Neotropical RT drosophilids."; RL Genetica 139(3), 327-338 (2011). XX DR EMBL/GenBank/DDBJ; GU229937; Positions 1 493. XX CC Clone Dpauli6. XX SQ Sequence 493 BP; 157 A; 128 C; 115 G; 93 T; 0 other; ttgggtgccc cacgaactga aaccaagaga cattgaaagg cgattatgca ttgctgaaca 60 actgcttgca agacaacaaa gaaagggttt ttttgcaccg aattgtgaca gggatgaaaa 120 gtggatccat tacaacaacg aaacccggcg caaatcttgg ggtaagcccg gtcacaaagc 180 agtatccact ccgaaaccca atttccatgg aaccaaggtt atgctctgtg tttggtggga 240 ccagctaggc ccgatccact acgaattgct gaaacagggc cagactatca acggggagct 300 ctaccgacaa caattgagcc gtctgagccg ggcactcaaa gaaaaaaggc cacaattcga 360 agaaaggcac gacaaagtca ttcttcagca agacaatgca agaccacaca ccagcagagt 420 ggtcaaagat tacctcaacg agctgaaatg ggagattttg ccccaccccg ctatactccc 480 tgatctcgcg cct 493 // ID BEL-1_AA-I repbase; DNA; INV; 5465 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-1_AA_; KW BEL-1_AA-LTR; BEL-1_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5465 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 851-851 (2011). XX DR [2] (Consensus) XX CC Positions [4485-5066] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 183..5465 FT /product="BEL-1_AA-I_1p" FT /translation="MGLDHTPKKTTVDPEQLQMLIHQRGQIKGKVTKINNC FT LEKAEDDPSQISASLLKVYGKKLEIHYSEYTEAHREVIAMIPPAKMEEQDE FT KLDEFDALHTEALNRMERLSEYFAKPVPATAVASGAGQVIVQHHPLKAPIP FT SFDGRVENWPKFKAMFEDLVVKSGDSDAIKLHHLDKALVGDAAGLINAKMI FT RDNNFAQVWKQLSEQFENKRVIVDTHIDGLLQLKTVAKGNFKDLLELTKSC FT ERHVAGLEYQGLVVDDLSGIIITKLVTSRLDDHTLQLWERKQEHGELPSYE FT DTLQFLKGECQILERYHNSRHPVAAKEPPSKQSKPVNQKVHTVTSPNSECQ FT CKVCGENHRHFECPIFTKMPIQERIIKVKQLRICFNCLRPGHCAKDCSSKR FT SCATCHKRHHSLLHEDVPSNSTETKVLPDKNSPNLAAEAGAVPKIVPQPNP FT NSSCSCNHTQTTKTVMLLTAVVNLESANGQFVPCRAMLDSGSQVCFVSESI FT ANRLMISREPVNVPVTGIGGAKIYVREKLTVTMQSRCSDFSTDIDCLVVPK FT VTGIIPSVKINTSSWPIPAGVQLADPTFHTPDKIDMLIGASKFFTLLKSGH FT IHLADGLPELYETQLGWVFSGEFDNVAANTVVSHPVSVNSLAETMSRFWEI FT EDVSQSTKDEGESDACEENFRCTHRRTSNGRYMVSLPFREDVASLQDNRSV FT ALRRFLMLERRFKRDPTLKLLYSDFIAEYEALGHCREVNESNDDPTKRRYY FT LPHHAVLRPTSSTTKLRVVFDASAKCNPSDKSLNEVLQVGGVVQSDLLSIL FT LRFRVHPVVFTADIAKMYRQILVSEEHTRFQRIFWRSESNQRLRVLELLTV FT TYGTAAAPFLATRCLVQLCHDEGNKYPIASNIIIEDCYVDDVLSGADTVEE FT AIEAQRQLEDLLHLGGFPIHKWSSNCQELLENIPEENREKLVRLDQTSTDE FT VIKTLGLTWSPNSDEFLFITKSPMKASETYTKRKVFSEIGRLFDPLGLVSP FT VIVVAKILMQKLWMSGLSWDETLEGDLLLSWMKFCDALHQMSQIKIPRRVF FT LCNAVAIEIHGFSDASISAYGAVLYVRTVFSDGRAEMRLLCSKSRVAPISE FT LSIPRKELLAAQLLSRLVGKVAGSMKVKFDDVTLWSDSQIVLAWLRKPLTS FT LQVFVRNRVAEVVSSTKDYNWKYVPTKENPADIVSRGCLPRALASNELWWN FT GPLFLQSANYNSEPPGPLSDEELPELKLTSFVSAIVYNADNLPVFAKFSSF FT RKLQRVIAYVKRFISNCKIKDPKLRILHCYLTIPELRQSMDVIVKVVQHEV FT LGDEISRIENNEPCKRITSLNPVYDNGILRVGGRLKNSSILTMAKHPYILP FT RHPIVDLLIRAYHLENMHVGPSSLLVILRGRFWLTEGRSAVRKITRSCVTC FT FRARPTMTNQLMGNLPACRVTPAHPFEITGVDYAGPVFVKQGRRKPVVEKA FT YISVFVCMVTRAVHLELVSDMTTEAFIAALQRFVSRRGLPREMHSDNGSNF FT RGAKAELNELYKLFRSQSAVDKIEGFCLSKEIAWYFIPPEAPNFGGLWEAA FT VKSAKYHLKRTLKDTNLTFEEYVTVLTQVEAILNSRPLYATTPDPDDPEVI FT TPGHFLIGRPITAIAEPSYHDIAINRLGRWQFLQKLRENFWKKWRNDYLQT FT MQQRTKDHVKKHNLLPGMIVLLEEQNLPPLSWKLGKIVRTYPGSDELVRTV FT DVLVDGTVYKRPASRIAVLPIEDNKQLLNLCSENASQPGGE" XX SQ Sequence 5465 BP; 1497 A; 1220 C; 1348 G; 1400 T; 0 other; ttggtccatc cgaaccggat ttgagccacg tcgtgcgtga gatgctgaat caaggaggat 60 gttggaagaa actagtgtgt gagaacgaaa aaagtgaact cgactaaaag tgaattggac 120 tagtagtgaa gaaagagacc tagtgcaagt gaaaaagtgt tgtgaaaagt gatttcggca 180 aaatgggact cgatcatacg ccgaagaaga ccactgtgga tcccgagcag ctgcaaatgc 240 tcatccacca gcgaggtcaa ataaaaggaa aagtgacgaa gataaacaat tgtcttgaaa 300 aagccgaaga cgatccgtcg caaatcagtg catcattgtt gaaagtgtac ggtaagaagt 360 tggagattca ttattccgag tacactgaag cccaccgtga agtgattgcc atgattccac 420 cggcgaaaat ggaagagcag gatgaaaaac tcgacgaatt cgatgctttg catacggagg 480 cgctcaaccg tatggagcgt ctttcggagt attttgcgaa gcctgtgcct gcaacagctg 540 tagctagtgg agcaggtcaa gtgattgtgc agcatcatcc tctcaaggca ccgattccat 600 cgttcgacgg ccgtgtggaa aattggccca aattcaaagc catgttcgag gacctagttg 660 tgaagagtgg tgattcggat gcaataaagt tgcatcactt ggataaggcc ttagtaggag 720 acgcagccgg cctcatcaac gcgaaaatga tccgagataa caactttgcc caagtgtgga 780 agcagttgag tgagcagttt gaaaacaagc gagtcatcgt ggacacccat atcgatggtt 840 tgctgcagtt gaaaacggtg gcaaagggca acttcaagga tctgctggag ctgacgaagt 900 cgtgtgaacg tcatgtcgcc ggcctggaat accagggtct ggtggtggac gacctgtccg 960 gaatcatcat tacgaagctg gtgacctccc gcttggacga tcatactcta caactgtggg 1020 agcgaaagca agagcatggt gagttgccca gttacgagga taccttgcag tttttgaaag 1080 gtgagtgcca aattctagag cgttatcata attctcgcca tcctgttgcg gcgaaagagc 1140 ctccatcgaa acaatccaaa cctgtgaatc aaaaagtgca tactgttacg tcccccaatt 1200 cagaatgtca atgtaaggtt tgtggtgaaa accaccgtca ttttgagtgt cccattttca 1260 ctaaaatgcc tatccaagag cgaattatta aagtgaagca acttagaatt tgcttcaatt 1320 gccttcgacc aggccattgc gccaaagatt gttcttccaa gcgtagttgt gcgacatgtc 1380 acaagcgaca ccattccctc ctccatgaag atgtaccatc gaactccact gagaccaagg 1440 tcctgcccga taagaattcc cctaaccttg cagcagaagc tggggcagtt cccaaaattg 1500 ttccgcaacc caaccccaat tcttcgtgct cttgcaacca tacgcagacc acgaagactg 1560 tgatgctatt gactgctgtg gtgaatctag agagtgctaa tggtcagttc gttccttgtc 1620 gagcaatgct agatagtggt tcccaagtct gttttgtgtc tgagtcgatt gccaaccgtt 1680 tgatgatatc ccgagagccg gtgaatgttc ctgttaccgg aattggtgga gcgaagatct 1740 acgtaagaga gaagctgacg gtgacaatgc agtccaggtg ttctgatttc tccacggata 1800 ttgattgcct tgtggtgcct aaagtgactg gtatcattcc ttccgtcaaa atcaatacct 1860 cgtcttggcc aattcccgca ggcgtccagc tcgctgaccc caccttccat actcctgaca 1920 aaatcgacat gctgatcggt gcctcgaagt ttttcacctt gttgaagtct ggtcatattc 1980 atctagccga tggccttccc gagctctacg aaactcaact gggttgggtt ttctctggag 2040 aattcgataa cgttgctgcc aacaccgtgg tatcccatcc agtatcagtt aattccctcg 2100 ctgaaacaat gagtcggttt tgggaaattg aagacgtttc gcagtccacg aaagacgaag 2160 gagagtcgga tgcatgcgag gagaactttc gttgtacaca tcgtcgtact tcaaacgggc 2220 gatacatggt ttcactccct ttccgagagg atgttgcttc actccaggac aaccgatccg 2280 tcgctttgcg ccgtttcctc atgctcgaaa ggcggttcaa aagggatcca accttgaaac 2340 tattgtattc tgatttcatt gctgagtacg aggcgttggg ccattgccgg gaagtcaatg 2400 agtccaatga tgatcctaca aaaaggcgtt attatttacc acaccatgcg gtactccggc 2460 cgaccagctc gacgacaaaa ttgagagtgg tttttgacgc ttcagcgaag tgcaatccgt 2520 cggataaatc actcaacgag gtactacaag tcggaggtgt tgtccaaagc gacctgttga 2580 gcattctact tcgatttagg gtgcatcctg tggtgttcac cgctgatata gccaaaatgt 2640 atcggcaaat actggtttcc gaagaacaca cccggtttca acgaatcttt tggcgatcgg 2700 agtcaaatca acgactacga gttctggagc tgttaacagt gacatatgga acggcagcgg 2760 cgccattcct ggctacgaga tgtctggttc agctttgtca cgatgaaggt aacaaatatc 2820 ctattgcgtc caacattatc atcgaagatt gttacgtgga tgatgtactc tcgggtgcag 2880 atactgttga ggaagctatt gaagcacagc gtcaattgga agatttgctt catctaggag 2940 gttttcccat tcataagtgg agctcaaatt gtcaggaact gctggaaaat attcctgagg 3000 aaaatcgaga aaagctagtg cgtttggatc agacatcaac cgacgaggta atcaagacac 3060 ttggacttac ctggagccca aattcagatg aatttttatt cattacgaag tccccaatga 3120 aagcgtctga aacatacacg aagcggaaag ttttctcgga gattggaaga ctttttgatc 3180 cattgggttt ggtctctccc gtcattgttg tagccaaaat cctaatgcaa aaattgtgga 3240 tgtccggcct ttcctgggat gaaactcttg aaggagattt acttctttct tggatgaagt 3300 tttgcgatgc tctgcaccaa atgagccaaa taaaaattcc tcgacgtgtt tttctgtgca 3360 acgccgttgc catagagatc catgggtttt cagacgcctc gatttctgcg tacggtgcag 3420 ttttgtacgt gaggaccgtc ttttcagatg gtagagctga aatgcgtttg ctatgcagca 3480 aatcgagagt tgctccaata agcgaattga gcattccaag aaaagagttg ctagctgctc 3540 aacttttatc acgactggtt ggcaaggtag ctggatcaat gaaggtcaaa ttcgatgacg 3600 taactctttg gtctgacagc caaatagtgc ttgcctggct ccgtaaacct ttaacgtcac 3660 tccaggtgtt tgtaaggaac cgagttgcgg aggtcgtttc aagtaccaag gattacaatt 3720 ggaaatatgt ccccacaaag gagaatccag cagatattgt gtcgagagga tgcctgccta 3780 gagccttggc gtctaatgag ttgtggtgga acggccctct attcctgcaa tcagcaaatt 3840 acaattccga accaccagga ccgctatccg atgaagagct gcctgagcta aaattgacat 3900 cgtttgtttc tgcaatcgtg tacaatgcag ataaccttcc cgtattcgcc aagttcagct 3960 catttcggaa actccagaga gtgattgctt atgtaaaacg gttcatctcg aactgtaaaa 4020 ttaaggatcc gaagcttcgt attctgcatt gctacctcac cattcccgaa ttacggcaat 4080 caatggacgt gatcgtcaag gttgtgcaac atgaggtttt gggagatgaa atcagccgta 4140 ttgagaacaa cgagccgtgt aaaaggataa cttcattaaa ccctgtgtac gacaatggaa 4200 tattgagggt aggtggaagg ttgaaaaatt cgtccatatt gaccatggct aaacatccgt 4260 acatcttacc gaggcaccct attgtcgatc ttctgataag agcttatcat ctcgagaaca 4320 tgcacgttgg gccgtcgagt ttgttggtga tacttcgagg tcgattttgg ttgacagaag 4380 gaaggtctgc cgtacgaaaa atcacaagga gctgcgtaac ttgtttccgt gccagaccaa 4440 ctatgacaaa tcagctgatg ggaaacctgc ccgcctgccg tgtaactcct gctcatccct 4500 ttgaaataac aggagtcgac tatgcgggtc ctgtcttcgt gaaacaaggc cggcgaaaac 4560 cagtagtgga gaaggcatat atatccgtat ttgtgtgtat ggtgacacga gccgtacact 4620 tggagttagt ctcagacatg accacggaag cattcattgc tgcattgcaa cgttttgtca 4680 gcagaagagg actgccacga gaaatgcact cagataatgg atcgaacttt cgcggggcaa 4740 aggcggaact caatgagctg tacaaactat ttcggtctca atctgctgtt gacaaaatcg 4800 agggtttttg cttgtcgaag gaaatcgctt ggtattttat tccccctgag gcgccaaatt 4860 tcggaggcct ctgggaggca gcagttaaga gtgctaagta ccacttgaaa cgcactctga 4920 aggacacaaa tttgacattc gaagaatatg taacagtttt gacacaggtt gaggctatat 4980 taaactctag accgctctat gctaccacac ctgatcccga tgatcccgag gtgatcactc 5040 ccggtcattt tttgatcggg cgtccaataa ctgcaatcgc ggagcctagc taccacgaca 5100 tcgccataaa tcgacttgga cgttggcagt tcctgcaaaa actacgggag aacttctgga 5160 aaaagtggag gaatgactac ctgcaaacta tgcaacaacg taccaaggac cacgttaaga 5220 agcataatct gctgccaggg atgattgtgc ttctggaaga gcaaaatttg ccaccgttga 5280 gttggaaatt ggggaaaatc gtgcgaacct atccaggatc cgatgaactg gttcgtacag 5340 tcgacgtact ggttgatggc actgtgtaca agcgtccagc ttcacgaatc gcagtgcttc 5400 caattgagga caataaacag ctgctgaatc tttgttccga gaatgcttct cagcccgggg 5460 gagaa 5465 // ID Tx1-13_BF repbase; DNA; INV; 5705 BP. XX AC . XX DT 28-APR-2009 (Rel. 14.04, Created) DT 28-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE Amphioxus Tx1-13_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW Tx1; Non-LTR Retrotransposon; Transposable Element; Tx1-13_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-5705 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-5705 RA Kapitonov V. and Jurka J.; RT "Young families of Tx1 non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(4), 850-850 (2009). XX DR [2] (Consensus) XX FH Key Location/Qualifiers FT CDS 195..2060 FT /product="Tx1-13_BF_1p" FT /note="ORF1p." FT /translation="MATPKSDGNDMFQKSFSIKFEEETDVRRILESFHEMA FT GIDVKSDLHNVWLKGRNTFEVTTNTMGKKNELMLSLKNWYPHVEVQGYGMG FT TVVSLFRVPSYITEGTVQAVLRQYGKVMSVKYPTYPPPYDNIQKGIRQYRM FT VIEKPIPNALRINNNHIWVKYEGQKQVCLKCNQEGHFASDCDQIRCFKCNE FT IGHTIKDCTNEIKCNVCGKSGHTNRECAASYARALAKPSNRWTRESVVNFE FT EMSGGVSVSDSDSEGEELTDEEDESDDLVNTGCNACKEKSGFPLHVKKVCH FT RCDKESPSEDVHLITCKKCSEQDTFYCGECFHKRRTVTSESEISIYLSTSQ FT DELDPEETTPSPETQTSSEETQLPSQEVVSEADSQRKQASSQKAQPPTRKG FT KVPSTSTPSQVKSSKTSSGQEKTVQSKTPLSQDKPTPSQDIVPPSQDPTTE FT PTKIARKTSLSQDKSGQDLASPSQDPVQGPSNTPETIWTTVVSKRNRTSDS FT LPQPAAKKATPKSSVSSSQKSSLERIIAQKRAYEGSRLMRNNGFPMACAHF FT GYKCPKEELLWSSLSLHLSNTHGLKSPTFACPVTGCAIRSPSPAKWLTHLI FT CEHVEYVEGTTPKALERWFSKKM*" FT CDS 2104..5616 FT /product="Tx1-13_BF_2p" FT /note="endonuclease and RT." FT /translation="MASLLLLLISQMALNIATFNVNGLRNEQKRTQAFNLF FT KEKRLDIICLQECHVGDKSEQLKWTKQWGGRALWNPGTQTSCGVAILFSPC FT SDWSVSDINKDIDGRIISCKISCRETAFKLCNIYAPVLNSARRKFLSDLLE FT YISTRNNGNNVILCGDFNFVEDLTWDKKGGNPTQGNIGNKEIKDICSRFDL FT EDKWRVLNPNTFRFTWRDKKCTIQCRLDRFYLPQRLAVSLFTYLPVPFSDH FT DGVSFSVKITKNIVRGEGLWRCNNSVIATQTFQLDFDKHFDFWVLLIPYFP FT DLGQWWDNVKGKIRQLIVSHAKRMALEKRQNRDRLMERIDFLTITIAQQPN FT PDLAKELARVKQELFDLKRQENEGAKVRSRVQWEEQGEKPTKFFLSREISR FT GKAKVIEEIRDEKGTIIKDQKGISKVFESFYENLYANEDLDEEAQKELLNQ FT LSNTIPEELQVQLEQPLTSEDYLLALKSFSNGKCPGEDGLTKEFYIMFWDK FT LKNYLPKILNEGLNRGVLSESQKGSVITLLNKAGDPLEVKNKRPISLLNVD FT YKILSKAITNRLKKVLKYVIHEDQTCAVPGRSITDNLLDFRNIIDFVNHKN FT LEGAIISLDQQKAFDRVNHQFLDKVLTKMGFGHVFRGHIQSLYKNAYCKIM FT VNGHLSNKIAFSRGVRQGCSLSPLLYVISLEPLACLIRNNKDIRGVQLPNG FT TEKKLSMFADDSSALITTDDSIGRLFEDIKTYELGAGAKLNKGKSEALWLG FT KWRGRGDGPIQLKQWTNSKLKMVGGFFGNGNLAKTNWKHRISGFKDKLVKW FT EDSTLSFEGRKVVINTVLIPTLWYIAPVFPLPKVCEQEIQSLIFKFLWGGK FT TEQVKRSVLYLPKMKGGLEIVNVAEKAKAMFALTFKHLVQKESAAWAVIGR FT YWLGLFLNRLGLYQWSNLVPHCVEWPKTQQHLKEVVEGYNSLDSTLDWSQT FT TLKCLYSLSEGTWAGDAAVVLRQPKKNWSKIWAVCHNEILPNVLKDLNWKI FT VHNIIKTKSTLRFWKLIKDNKCIWPKCNQSETLQHVFFECEKAEQCWSWLE FT GFIRRWIAHDFRIEQSFATLDNVDVLDKISKSKNDIVIYLSSSLRQRLWKE FT RCELLYDKRLSSPQVTAVNVKTLLKNKITVEFHKLPESEFHRNWVKGFKWL FT KISNGKLTFRF*" XX SQ Sequence 5705 BP; 1852 A; 1207 C; 1293 G; 1353 T; 0 other; ggtggaactc cctttgggat cactccaccc gccaatagca aagagcaaca cctcgacgca 60 cacacttgac gggaaaaatc cgggccggag tataacggaa acgttagcag gtccggagaa 120 gagtactatc aagacacttt ctacagggat tccctatttg ttggtgggaa ctaaattgcc 180 cccttgtaat aaccatggcg accccaaaga gcgacggaaa tgatatgttc cagaagagct 240 tcagtattaa attcgaagaa gaaacggatg taaggaggat tttggaaagc ttccacgaaa 300 tggcaggtat agacgtcaag tctgacctac ataacgtttg gctaaaagga agaaacacct 360 tcgaagtaac aacgaacacg atgggaaaaa agaatgagtt aatgttaagt ctaaagaatt 420 ggtatccaca tgtagaggtc caggggtacg gcatgggcac ggtggtgtcc ctgttccggg 480 tgccctcgta tatcacagaa ggaacagtgc aagcagtgtt aaggcagtac ggtaaggtga 540 tgtcagtaaa atacccgacc tacccaccac cctatgacaa cattcagaaa ggtatcagac 600 agtatcgcat ggtcattgaa aagccaattc ctaatgccct gagaatcaac aataaccata 660 tctgggtaaa atatgaaggt caaaaacaag tatgcctaaa gtgcaatcag gagggtcact 720 ttgcctccga ttgcgaccag attagatgtt ttaagtgtaa tgaaattggt cacaccatta 780 aagactgcac aaacgagatc aagtgtaatg tctgcggcaa gtccggacac accaacaggg 840 agtgtgcagc ttcctacgcg agagccctag ctaaacccag taataggtgg acgcgggaaa 900 gcgtagtgaa ctttgaagaa atgtctgggg gggtttcagt gtcagactca gactcagaag 960 gagaagaact taccgatgag gaagatgagt ccgatgactt agttaatacg ggatgtaacg 1020 cttgtaaaga aaagtctggc ttccccctac acgtgaagaa ggtatgccat aggtgtgata 1080 aggagtcccc ctcagaggat gtgcacctta tcacatgtaa aaagtgctca gaacaggaca 1140 ctttctactg tggagagtgt ttccataaaa gaagaacggt aacatcggaa agtgagattt 1200 caatctacct atccacgagt caggatgagt tagatccaga agaaacaacc cctagcccag 1260 aaactcaaac ctctagtgaa gaaacacagc tccctagtca ggaagttgtt tctgaggctg 1320 atagccaaag aaagcaggct tctagtcaga aggcacagcc tcctactagg aaggggaaag 1380 ttcccagtac atcaactcca agtcaagtca aatcaagcaa aacatcttct ggacaggaaa 1440 aaacagtcca aagtaaaaca ccccttagtc aggacaaacc cactcctagt caggacatag 1500 tgcctcctag tcaggatcca acaacagaac ctactaaaat agctaggaaa acatccctaa 1560 gccaagacaa atccggtcag gatctagcgt ctcctagcca ggacccagta cagggtccta 1620 gtaacacccc tgagacgatt tggacaactg tagtgtcaaa aagaaacaga acctcagact 1680 cgcttccgca accggcagca aaaaaagcga cgcctaaatc ttccgtatcg tcttcgcaaa 1740 agtcttccct agagagaata atagctcaga aaagagccta cgaagggtct cgactgatga 1800 gaaacaacgg ttttcccatg gcttgtgccc attttgggta caagtgcccc aaagaggaat 1860 tgttgtggag ttccctaagc ttacatctga gtaacaccca cggcttgaag tctccaacct 1920 tcgcgtgccc ggtgactggc tgcgcgatcc gatcaccctc gcccgccaaa tggctaacgc 1980 atctaatatg cgaacacgtc gagtacgtag aaggaacgac accgaaagcc ttagaacgtt 2040 ggttctccaa gaaaatgtag acccccaatt cgtaaatgtt cccccattga ataacaccca 2100 ctcatggctt ccctgttact tcttctcatt agccaaatgg ctctaaacat cgctaccttc 2160 aatgttaatg gtttaagaaa cgagcaaaag cgtacacaag cttttaatct tttcaaggaa 2220 aaacgtttgg atataatctg tcttcaagaa tgtcatgttg gtgacaaatc ggaacaatta 2280 aaatggacaa aacagtgggg agggagggcc ctatggaacc caggaactca gacttcctgt 2340 ggagtagcta tcttgttttc cccttgttct gattggtcag tgtcggatat caacaaagat 2400 atagatggca gaattatatc gtgcaaaatt tcttgccgag aaacggcctt taagttatgt 2460 aatatatatg ccccagtcct gaattctgcc aggagaaagt ttttgtctga cctattagaa 2520 tacatttcca caaggaacaa cggcaacaac gttatccttt gtggagactt taacttcgta 2580 gaagacttga cgtgggataa aaaggggggg aaccctaccc aaggaaacat cggtaataaa 2640 gaaattaaag acatctgttc acgttttgac cttgaagata aatggagggt tttgaatccc 2700 aacacctttc gttttacctg gagggataaa aagtgcacga tacaatgcag actggataga 2760 ttttacttac cccaaagatt ggccgttagc ctctttactt atctcccagt ccccttttcg 2820 gaccatgacg gtgtctcttt ctctgttaaa ataacaaaaa acatagtaag gggggagggt 2880 ctgtggaggt gcaacaactc tgttatagct acccaaactt tccaattgga ctttgacaag 2940 cattttgact tctgggtctt gctcatcccg tacttccctg acctaggtca gtggtgggac 3000 aacgtgaagg gcaagatcag acagctgatt gtaagtcatg ccaagaggat ggccctagaa 3060 aagagacaaa atagagaccg tttaatggaa cgtatcgatt tccttactat taccatagca 3120 cagcagccaa atcccgattt agcaaaggaa ttggcccggg taaaacaaga gctttttgac 3180 ctaaaaagac aggaaaacga gggggccaaa gtccgttcgc gagtgcaatg ggaggaacaa 3240 ggagaaaaac ctacaaaatt ctttctgtcc cgagaaatat ccaggggaaa agctaaggtt 3300 atagaagaaa taagagacga gaaaggaacg attataaaag atcaaaaagg cataagtaaa 3360 gttttcgaat ccttttacga aaacctatac gcaaacgaag atctagatga ggaagcccag 3420 aaggaacttc tgaatcagtt atctaatact atacctgaag aactccaagt ccaactagaa 3480 cagcctctaa catcagaaga ttatctattg gccctaaaga gttttagtaa cggaaaatgc 3540 cctggggagg atgggttgac caaagaattt tatattatgt tttgggataa attaaagaac 3600 tacttaccta agattctaaa cgaaggccta aacagggggg tcctatctga gtcccaaaaa 3660 ggcagcgtca tcaccctgct caataaggcg ggtgacccgc tagaagtaaa gaacaaaaga 3720 cccatttctc tacttaatgt cgactataaa atcctctcaa aagctatcac gaacaggcta 3780 aagaaagtcc tcaaatatgt aatccatgag gatcagacat gtgccgtacc tggtaggtcg 3840 atcaccgaca acttgttgga ctttagaaac atcatcgact ttgtaaatca caaaaatcta 3900 gaaggtgcca tcatatctct agatcaacaa aaagcctttg atagagtgaa ccaccaattt 3960 ttggataaag tgctcaccaa aatgggattt ggacatgtct ttagaggaca cattcagagt 4020 ctatataaaa acgcctattg taaaattatg gtaaacggac acttatcaaa taaaatcgcg 4080 tttagcaggg gagtaagaca ggggtgttcc ctgtctccct tgctatacgt tatctcccta 4140 gaaccgttag cttgtctaat aagaaataac aaagacataa ggggggtcca actaccaaat 4200 gggacagaga aaaaactcag tatgtttgcg gacgattcca gtgccttaat tactacggac 4260 gattccattg gacgtctttt tgaggatatc aaaacatatg aactgggtgc gggtgcaaaa 4320 ctcaacaaag gaaagtctga agccttgtgg cttggtaaat ggaggggtag gggggatggc 4380 ccaatacaat taaaacaatg gacaaactca aaactaaaga tggttggggg tttctttggg 4440 aatggcaact tagccaaaac caactggaaa cacagaattt cgggtttcaa agacaagttg 4500 gttaaatggg aggacagcac cctgtccttc gaaggaagga aagtggttat caacaccgtg 4560 ctaataccga cgctatggta catagccccg gtatttccct taccaaaagt gtgcgaacag 4620 gaaattcaat cactcatctt taagttcctg tgggggggta aaacggaaca agttaagaga 4680 tcggttttgt atcttccaaa gatgaaaggt ggattggaga tagttaatgt cgcggaaaag 4740 gctaaggcca tgttcgcctt gacctttaaa catctggtcc aaaaagaaag tgctgcttgg 4800 gcggttatag gcagatactg gctcggtctg ttcttgaata gactaggcct gtatcagtgg 4860 agtaacctgg ttccacactg tgtggaatgg cctaagacgc aacagcactt aaaggaggtg 4920 gtggaagggt ataatagctt agattccact ttagactggt cacaaacaac tttaaaatgc 4980 ttgtactcct tgagtgaggg cacttgggcc ggggatgccg ccgtcgtcct tcgccagccg 5040 aaaaaaaatt ggtcaaagat ctgggcggtc tgtcataatg aaattctccc taacgttttg 5100 aaagacctaa attggaaaat agttcataat ataatcaaga caaaaagcac tctgagattt 5160 tggaagttaa tcaaagacaa taagtgtatt tggcccaaat gtaaccaatc agaaacactt 5220 caacacgtat tcttcgaatg cgaaaaagcc gagcaatgct ggtcttggct agaaggtttc 5280 atacgaaggt ggatcgcgca cgattttcga atcgaacaat catttgcaac actggataac 5340 gtggatgtac tcgacaaaat ttctaagtca aaaaatgaca ttgtaatcta tctgtcgtct 5400 tccctgaggc agagattgtg gaaagagcgg tgtgaactct tatatgataa gagactctct 5460 tcgccacagg taacggcagt aaacgtcaaa acccttttga aaaacaagat aacggtagag 5520 ttccacaaac ttccagaaag cgagttccat cgtaattggg ttaagggatt caagtggtta 5580 aaaatttcta atggaaaact tacctttcgt ttctaatacc cctccaggtc ctggggtcgg 5640 tttgggatgt agtcctcccg tcgtccacca tggattccag caccgagtct tgacaaatct 5700 accct 5705 // ID Gypsy-11_AA-I repbase; DNA; INV; 4341 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-11_AA_; KW Gypsy-11_AA-LTR; Gypsy-11_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4341 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 991-991 (2011). XX DR [2] (Consensus) XX CC Positions [3276-3758] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 162..1910 FT /product="Gypsy-11_AA-I_1p" FT /translation="MATADAKMNPSISSAGVQAGTPKTLNFGVFENYVAGD FT DFEVYEERMTQHFLLHDVPEERKVAFLLTHLGMDTYAILKKLLQPVNPSTK FT RYEELVLTLKRHFRPEVNKVSERYRFHQADQKAGQSVTEYVVELKALVEKC FT EYGEFLKEALRDRFVFGIFDGRLRTHLLKQKDVSFDKAVEEALTWELAEKD FT NKVREGNFSAHVVRSNKPWRARSKSRSRFHDDKKMQPRRNKLCDKCGREHE FT PGKCPAKNWKCYSCGKQGHAASMCYAKVSKQNRSSSQEPRKIQTVGTVGSG FT DELALELANLRMQLNSLQDKSFLLERSGVMVETLFVEGQPVDFEVDSGACA FT TVISSSLYRKQFSHLQLFGVRNDFSTVTGEGLKIIGGISAQVSKDRSGPSK FT KLVLVVIESEKSFRPLLGRTWMDVLWPSWRSSLKQGNVSINSFDRSIFDSI FT RVKYPNTVSDVNSPIRDFEAEIVIESNVTPIFHAAYSPPFQQRPAIEAELN FT RLCDENILKRVQHSRWASPIVVVPKANGKLRLCIDCKVTINPFLRSEHYPL FT PRIDDLFAKLANCKVFCVIDLRGAYQQLRVSENSQQ" FT CDS 3126..4298 FT /product="Gypsy-11_AA-I_2p" FT /translation="MLHANHDGIVRGKMLGRSLFWWKNMQGDIENYFKNCE FT ICDQRRNVPTEKVTSKWPKTSCPMDRIHIDFFAFEGKTFLILVDSFSKFIE FT AKLMSSTNADQVNEKLEDFFKFFGLPKEIVSDNGPPFNSFDFVNHWETRNV FT KVTKSPVYHPQSNGTAERGVQTVKNFLKKRLLDQQLNSKNLSRKLNEILAW FT YNSTPTTVTSLSPSELILRYRPRTVLTGINPKANSSVEQSNRNTRQVRFDE FT SKNETIVYDAGNRKIATDKEIPEKQFKKGERVMYRNHLKEVVRWIPVVVID FT KIGKFLYKIKFIENGNIRIVHVNQLRHKNSFIESNLTKLPEKFVDQSRCKR FT RRSDSGNGIHAPTPKYLKRRSESMSDELMLRRSARESRPPVRFGFDEL" XX SQ Sequence 4341 BP; 1314 A; 737 C; 1115 G; 1172 T; 3 other; tagtaaattt tggcgacgag tttgtggaag cagtgagtgc tgtcgtgaag cgcaatagtc 60 gaacacgtgg tgtgacgcgt acggaagata gtagcggcgt tagaagaacg atcgttagtt 120 cggcgcgagt gaagtgaaca cgtgattggt ttgaggttag gatggctaca gctgatgcga 180 agatgaatcc gagtatttcg agtgccggag tgcaagcggg aacaccgaaa acgttaaatt 240 tcggagtgtt cgagaattac gtcgcgggtg atgatttcga agtgtacgaa gaacggatga 300 cccagcattt tctccttcat gatgtgccgg aagagagaaa ggtggcattt ttgctcacac 360 atttgggaat ggacacgtac gcaatcctga aaaagttgct tcaaccagtg aatccgagta 420 cgaaacggta tgaagagcta gtgttgacct tgaaacggca tttcaggcca gaagtgaaca 480 aagtgtccga acgttatcgc tttcaccaag ccgaccagaa agctggacaa tcggtgactg 540 aatatgtagt ggaattgaaa gcgttggtag aaaagtgcga gtatggtgaa tttttgaaag 600 aagcgttgcg ggacagattt gtgttcggga ttttcgacgg taggttacgt acacacctgc 660 tgaaacaaaa agatgtgtcc ttcgacaagg ctgttgaaga ggcattgacg tgggagttgg 720 ccgaaaaaga caacaaagtg cgcgaaggta actttagtgc ccacgtggtg agatcgaaca 780 aaccgtggcg agcaagaagc aaaagtcgat cgcggtttca tgatgacaag aagatgcagc 840 cgagaagaaa caaattgtgc gataagtgcg gtcgtgaaca cgagccagga aagtgtccgg 900 ccaagaactg gaagtgctat tcatgcggaa aacagggcca tgcggcgagc atgtgctatg 960 cgaaagtgtc gaagcagaat cgtagttcga gtcaggagcc ccggaagatc caaaccgtgg 1020 ggacggttgg ttccggagac gagttggcat tggagttagc caatttgcgg atgcagctga 1080 attctctcca ggataagtcg tttttgttag aacgtagtgg tgtgatggta gaaaccttgt 1140 ttgttgaggg ccagccagta gattttgagg tagatagtgg tgcttgtgca acagtaatca 1200 gtagtagctt gtatcggaag caattttctc atttgcaatt gtttggtgtg aggaatgatt 1260 tttcaaccgt aaccggtgaa gggttaaaaa ttatcggtgg aatcagtgca caggtgtcta 1320 aggatagaag tggtccgagc aaaaaattag tgcttgtggt gatcgagagc gaaaaatcct 1380 tcagaccgct tcttggacgg acctggatgg acgtactatg gccaagttgg agatcgtcgc 1440 ttaagcaagg taatgtgagt ataaactctt ttgatcgctc aatttttgat tcaatacgcg 1500 ttaagtaccc caataccgtc tcagatgtaa attctcccat ccgtgatttc gaagccgaga 1560 tcgttattga aagcaatgtg actcctattt ttcatgcggc atactctcct ccgttccaac 1620 agcgaccagc gatagaagct gagctgaatc gtctctgtga tgagaatatt ttgaagagag 1680 tgcaacacag tagatgggcc tctccgattg tagttgttcc aaaggcaaat ggcaaattga 1740 ggttatgcat cgattgtaag gtaacgatta atccgttttt acggtcagag cactatccgc 1800 taccgaggat agatgatctg ttcgctaagt tagcgaattg taaagtgttt tgcgtgattg 1860 accttcgggg tgcataccaa cagctcagag tctcagagaa ctctcagcaa twcttaacaa 1920 ttaacacaca tgttggattg tttcagtatt tgagattgcc ttttggagtg gcaagcgcgc 1980 catccatatt ccagagcata atggaccaga tattgggtga tattgaaggt tgtgggtgtt 2040 atttggatga tgcattgatc ggtgcagaaa gcttagagaa atgtaagcaa attttggatg 2100 tcgtcttagc acgtttgaac cattacaatg taaaaatcaa tctagataag agtcgatttt 2160 tcgcgagttc cgttgattat ttagggcata cggtttccgg agatggatta aagccgaata 2220 aaagtaaagt cgacgcgatc gtgaacgcgc cagcgcctaa aaacgtgact gaattgcagt 2280 cgtatcttgg tttactgaac tattatgcga agttcatacc gaatatttcg tcggaattgc 2340 gagttttgta tcgtctacta cggaaagatg aacgtttcgt ctggactcaa gaatgtgagg 2400 acagtttcgc gaagagtaag caactgattt tgaacaataa cgtgttacag ttgtacgatc 2460 cgcgaaaacc gatagtcgtc gcggcagatg ctagccccta cggtgtggga gcagttttat 2520 cgcatatcgt tgatggcgaa gaaaagccgg tattgtttgc gtcgtgtaca ttatctccag 2580 cggagaagaa ctattcacaa cttcatcgag aaggtttagc gattattttt gcggtgaaac 2640 ggtttcacaa atacatttac ggtcataagt tcaagcttat ttcggattgt gaagcgttaa 2700 aagaaattta ccatccacga aaaggtacgt cgattgtagc gacatcgcgg ttacagagat 2760 gggcggttat tttatcaatg tatgaatacg awttcgaata caggccaaat cggtgtttgg 2820 cgaacgcgga cgcgttatcg agattaccgg ttccgggaag tacggaaatt gaagaacttt 2880 cgatcaacag actggaaagt tgtccagatc tgccgttgaa aacgcgagac gttgcagatt 2940 ttactgctaa ggatcaagtt ctttctcaag tttacggatt cgtgatgcgc ggttggccgc 3000 gaagtgtacc ggaamatttg aaatattatt ataaccttcg aaattcgtta aattcgcagg 3060 acggatgttt gttttacggt gatcgtgtag taataccgaa attgctacag tcaatcgttt 3120 tgaagatgtt acatgcgaat catgacggta tcgtgcgagg caaaatgcta ggtagaagtt 3180 tgttctggtg gaaaaatatg caaggtgaca tcgaaaatta tttcaaaaat tgcgagattt 3240 gtgatcagag gcgaaatgta ccaactgaaa aggttacatc aaagtggccg aaaacgagtt 3300 gtccgatgga taggatccac attgatttct ttgcgttcga gggtaaaaca tttttgatac 3360 tagtagattc tttttctaag ttcatcgaag caaagttaat gagtagtact aatgcagatc 3420 aggttaatga aaaactagag gatttcttta agttttttgg actgccaaag gagatagttt 3480 ccgacaacgg acctccgttt aattcgtttg atttcgttaa tcactgggag acgcgaaacg 3540 tgaaagtgac taaatcgcca gtgtaccatc ctcagtccaa tggtacagct gagcgtggtg 3600 tacaaacggt taaaaatttc ttgaaaaaac gcctgttgga tcaacaattg aattctaaaa 3660 atttgtcgcg gaagttgaat gaaatactcg cttggtacaa tagtactcca acaaccgtca 3720 catccttatc acctagtgaa ttgattttgc gttacagacc ccgtacagtt ttaactggga 3780 ttaacccgaa agctaattct tcggttgaac aatcgaaccg aaacaccaga caggtaagat 3840 ttgatgagtc caaaaatgag acaattgtgt acgatgcagg taacagaaag atcgcaactg 3900 ataaagagat tcccgagaaa caatttaaaa aaggggaaag agtaatgtat aggaatcatt 3960 tgaaggaagt tgttagatgg attccagtag ttgttatcga taagataggg aaatttttgt 4020 ataaaattaa atttattgag aatgggaaca tcagaattgt acatgtaaat cagttgcggc 4080 acaagaattc ttttatcgaa tcaaatttga ctaaattgcc agaaaagttc gtagatcagt 4140 ctagatgtaa gcgtagaaga agtgattcag gaaacggaat tcatgcacca acaccaaaat 4200 atttgaaaag aagaagtgaa tcaatgagtg atgaactaat gcttagaaga tcagctaggg 4260 aaagtagacc tccagtaaga tttggatttg atgaattgta attagataag tatgtagttc 4320 agaatcaaaa aaggggaaat t 4341 // ID CR1-90_AAe repbase; DNA; INV; 5095 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-90_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5095 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1178-1178 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 17 sequences with >93% CC identity. XX FH Key Location/Qualifiers FT CDS 407..1489 FT /product="CR1-90_AAe_1p" FT /translation="MDACHVCSGPLDSARAINCSGSCGRIFHFVCVGMTKS FT QFSSWLAKIGLFWFCDSCRLNFVPAIHDREKIIMKALRELIIRTDSMDTRL FT GNYGENLRKINKTLLESHQQAKSNNSPHTFVQRINEMNFDDSIDDPINRSR FT SCEDTSFFEVLDEVNSTIALLPEKFVVGSNKRVQIVTNPSSESSSRNIPRN FT DISSPATPDRQMNQATKRKSPASNLSHRTVNRTESSGDRIVFANNGANSTA FT LPSSLKRPNSISLKVANDMQVSNDLESFYVTPFTPDQNEEEVKQYVIEISN FT AQPSMVKVTKLVPRGKNAEDLSFVSFKVSVCKTVSSVVGDPWYWPEGITVR FT TFEPTTKNGSAARLPVSK" FT CDS 1444..4881 FT /product="CR1-90_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="THYKKRXSCTSSCLQVKSHAASGHITFPVLPGKPKNG FT ICPAEALIDAAPKLNEGSFLQYHTTNRPHTSTDLLIDDNDDHFANHLAALP FT VSSGNLSMDICLAEAEQDAAPFLNEGLLLQFDVVQSVPDSPTCHLAALPVS FT PGELKSGICLAEAVKDSTRKLNGGLHFQRDDISAAITSEPTSSCDDRVLLF FT YFQNVGGMNTTLAKYLLACSDAXYDVIALVETWLSDNTLSQQIFGPTYSVF FT RCDRSVSNSVKLSGGGVVLAINSRYRSRIICPPTGNIVEQVWVAVTFMDWT FT LYICVVYFPPDRINDPVLINAHLESISWVFNEMNINDNIMVLGDFNIPTVK FT WKRNGSGFFHPDSCQSSISNISCELLDGYSTAGLVQINGVVNSNGRLLDLA FT FASKELLTIVGIAEAPAPLVKYCQHHPPLQLTIQASIRSSNDIVEDFFYDY FT NNANYECMNRLLCNINWNALLPNRDVNDAAVVFSNIMLYSIDQYVPKKIVK FT GPKHPAWSNSKLRKLKSKKRRALRDYTRHRTLLNKRRYVTINNNYKRMNEK FT LYLIHQTEIQRKLKANPKSFWQHVNEQRKDSGIPSTMVLDDVEASSIEDIA FT ELFRVHFCSVFVDEQLSDEEVFMAASNISQRLAFDQMPEVSCQLVETIGNN FT LKISTNPGPDGIPSIVLRKCISSIAQPLAQIFNMSLKNGVFPCNWKESFVF FT PVFKKGNKRVVSNYRGIASQSAASKLFEKLVLEYMMHNCSGLISEDQHGFT FT PKRSTLTSLVLYTNSIIRQIESGHQTDAIYTDFSAAFDKINHQIIVAKLAR FT LGFSGTILKWLESYLLDRSMSVKIGIYNSSAFKVTSGVPQGSHLGPFIFLL FT YLNDVNLRLKCFKLSFADDFKLYSTVKRAADAAFLQDQLNEFSDWCDINRM FT ILNPEKCCVITFTRKRQPILHEYSLKRIVLKRESVVKDLGILLDSKMTFKD FT HISYVTAKASAQLGFIFRVAKSFRDVHCLKSLYCCLVRSILEYGSVVWSPF FT YQNGIQRVETVQRKFVRYALRFLPWNDPYNLPSYENRCKLIDLDLLEMRRN FT VSKATFISDLLSSRIDCPSLLSQLKINIRSRTMRSNDFIRLPFSRTNYCYH FT APLTSMCRIFNKCYSVFDFHLSRTTIKKYFMQILSLFT" XX SQ Sequence 5095 BP; 1463 A; 1072 C; 1045 G; 1510 T; 5 other; tggcatcact gacaatgtat gctgtttgaa gttcttcggt gctctagttt gattatgttt 60 tcgttggatt actcgttttg aaaaataccg tgaaccgttg gtttcgtcgg ctaagaatag 120 ttcgtacctg atttaagtgc atattctcgt cattcatcga tgtacttgtg gtttttgtgt 180 ctttatacga actttgttta cccactattg cacctctgat cgcgcgattt tttttttttc 240 tgaagcgcca tctgccagaa tacatcagat cactactaag caggaacaag tcggcaacat 300 aagttgaaag caaccgctca atcaattgta tacgatcggt cagtgcacgt agtctaccta 360 aaggagctat ttgtwttgcc caacaacgat ttggattttc gtgagaatgg atgcatgtca 420 tgtttgctct ggcccattgg attctgcgag ggcgattaat tgcagtggat catgcggccg 480 aatatttcat ttcgtttgcg tgggaatgac gaaatcccaa ttctcatcgt ggttagccaa 540 aattggttta ttttggttct gcgactcgtg tcgtttgaat ttcgtaccag caatccatga 600 tcgagaaaaa atcattatga aagctttacg tgagctcatt attagaaccg attcgatgga 660 cacacgacta gggaattatg gcgaaaatct caggaagatc aataaaacgc tacttgaatc 720 gcaccaacag gcgaaatcaa acaattcgcc tcatactttt gtgcaaagga ttaatgagat 780 gaatttcgac gattctatcg atgaccctat caaccgatca agatcgtgtg aagatacttc 840 gtttttcgaa gttctcgatg aagtaaatag cacgattgca ctccttccgg aaaaattcgt 900 tgtagggtct aacaagcgag tgcaaatagt aacgaatccg tcgtctgaat caagcagcag 960 aaatattcct cgaaacgata tctcctctcc tgccaccccc gatagacaaa tgaatcaggc 1020 aaccaaaagg aagtcaccag ctagtaattt atcccacaga accgtaaatc gcaccgagtc 1080 cagtggcgat agaattgtat tcgctaacaa cggagctaac tcaactgctc ttccaagtag 1140 cttgaagcgg ccgaattcaa tatcgctgaa agttgctaac gatatgcagg tttctaatga 1200 cttggagtca ttttacgtaa caccctttac acccgatcaa aacgaggaag aagtgaaaca 1260 atatgtgatc gagatttcca atgctcagcc ttcgatggta aaagtgacca agctagtgcc 1320 acgaggaaag aacgctgagg acctttcctt tgtatcgttc aaagtgtcgg tttgcaaaac 1380 tgtttcgagc gtagtaggtg acccatggta ttggccggaa ggaatcactg ttcgcacatt 1440 tgaacccact acaaaaaacg gwtcagctgc acgtcttcct gtctccaagt aaaaagtcac 1500 gcagcatcgg gacacattac gttccctgta ttaccaggta agcccaagaa cggtatatgt 1560 cccgccgaag ccttaataga tgctgcacca aaattgaatg aaggttcgtt tttgcagtac 1620 cacaccacca accgaccaca cacttcgacc gatttgttga tagatgacaa tgatgaccac 1680 tttgcaaacc acttggctgc acttcctgtt tcctcaggta atttaagcat ggatatatgt 1740 cttgccgaag ccgaacaaga tgctgcacca tttttgaatg aaggcctact tttgcagttt 1800 gatgttgttc agtctgttcc cgactctccg acatgccact tggccgcact tcctgtttct 1860 ccaggtgagt tgaagagtgg tatatgtctt gccgaagccg tgaaagattc tacacgtaaa 1920 ttgaatggag gtttacattt tcagcgcgat gacatctccg ctgcaatcac cagtgaaccc 1980 actagctcct gtgatgatcg agtgctgcta ttttatttcc aaaatgttgg aggaatgaat 2040 acgacactgg ccaagtacct gttagcctgc agtgacgcca sttatgatgt gatcgccttg 2100 gttgaaacct ggctcagcga caatactctg tctcagcaaa tcttcgggcc cacmtattca 2160 gtattccgct gcgaccgttc tgtatccaac agcgttaaac tttctggggg tggagtagtt 2220 ttggcgatta attcacgtta cagatctcgt attatttgtc ctccgactgg taacatcgtc 2280 gagcaagtat gggtggcggt cacattcatg gattggacgt tatatatatg tgtagtgtac 2340 tttccacctg atcgtatcaa cgatccagta cttattaacg cccatctaga atccatctca 2400 tgggtattca atgaaatgaa cattaacgac aacatcatgg tacttggcga tttcaatatt 2460 cctaccgtga aatggaaacg aaacggttct ggcttcttcc atccggattc ttgtcaatct 2520 tctatcagca atatttcctg cgagctgctg gatgggtaca gtaccgccgg attagttcaa 2580 attaatggag tggtcaacag taacggccga ctgttggatc tagcatttgc gagtaaagag 2640 ttactaacca ttgttggcat agctgaagct ccggcacctt tagtgaaata ctgccaacat 2700 catcctcctc tgcagctcac tattcaagcg tcgattcgaa gttcgaacga catagtcgag 2760 gatttttttt atgactacaa caatgcaaac tatgagtgta tgaatcgatt gttatgcaat 2820 atcaactgga atgctcttct acccaatcga gatgttaatg acgcagcggt tgtattctcc 2880 aatatcatgc tgtactctat tgatcagtac gttccaaaga agatcgtgaa aggtccgaag 2940 catccagcat ggtccaattc aaagttgaga aagctgaaaa gcaaaaaacg acgtgcccta 3000 agagattata cgagacaccg cactttgttg aacaaacgac gatacgttac gattaacaac 3060 aactataagc gtatgaatga aaaactctac ttaatccacc aaactgaaat tcaacgtaaa 3120 ttgaaagcta accctaagag cttctggcag catgtaaatg aacagcgtaa ggattcgggc 3180 attccttcta ccatggtgtt agacgatgtt gaggcatcat caattgaaga tattgcagaa 3240 ctttttcgcg tgcatttctg tagcgtattc gtcgatgaac agctatcgga tgaagaagtt 3300 ttcatggctg cttctaatat ctcgcagcga ttagcttttg accaaatgcc agaagtttct 3360 tgtcaacttg tcgaaacgat cggtaataat ttgaaaatct ctacgaatcc tggtcctgat 3420 ggtataccgt ctatcgtgct cagaaaatgc atatcctcaa tcgcacagcc attagcgcaa 3480 attttcaaca tgtctctaaa gaatggtgtt tttccatgta attggaagga atccttcgtt 3540 ttcccggttt tcaaaaaagg gaataaacgg gtcgtttcaa actatcgagg aattgcatct 3600 cagagcgcag cttcgaaact gtttgaaaag cttgtcctgg agtacatgat gcataattgt 3660 tcaggtttaa tttccgagga tcaacatgga tttaccccga aaagatcaac gttgaccagc 3720 ctggttcttt acaccaactc aattatccgc caaatagaaa gtggccacca gaccgacgct 3780 atttacaccg atttctcagc ggcgttcgac aaaataaatc accagatcat agttgcaaaa 3840 cttgcccgtc tcggatttag cggaactatt ctcaagtggc tggaatcata tctcctcgac 3900 cgttctatgt ctgtgaaaat cggcatttac aactcatctg cgtttaaagt aacttcgggt 3960 gttccgcaag ggagtcatct aggaccgttc atattcctgc tataccttaa cgatgtgaac 4020 cttcgtctta aatgctttaa actttcattt gccgatgatt tcaaactgta ttccaccgta 4080 aaaagagcag cagatgcagc cttccttcag gatcaactca acgaattctc cgactggtgc 4140 gatattaacc gaatgattct gaaccccgag aaatgttgtg taataacatt tacacgcaaa 4200 cgtcaaccaa ttctgcatga atacagccta aagaggatcg tcctgaagag ggaatcggtt 4260 gttaaggatt tgggaatatt attggattca aaaatgactt ttaaagacca catatcatat 4320 gttacagcaa aggcatcagc tcaattaggt tttattttcc gggtggctaa gtcgttccga 4380 gatgttcatt gtttgaagtc cctgtattgt tgtttagtgc gctcaattct tgaatatgga 4440 tcggttgttt ggtctccgtt ttaccagaat ggaattcaga gggttgagac cgtgcaaagg 4500 aaatttgtta gatatgcgtt aagattttta ccgtggaatg atccttacaa tctccctagc 4560 tatgaaaatc gatgtaaact aattgatctt gatttgctcg aaatgcgccg aaatgtatca 4620 aaagcaacct ttatctcgga tttacttagt tccagaattg actgccccag tttactgagt 4680 caacttaaaa ttaatatacg gagtcgaacc atgcgtagta atgatttcat acgtttaccg 4740 ttctcgagaa ctaactattg ttaccatgca ccattaacta gtatgtgtcg tatttttaat 4800 aagtgttaca gtgtgtttga ttttcatttg tctaggacta caattaagaa gtattttatg 4860 caaattttgt ctctcttcac gtgatagatt ttaagaattt tatttgataa stttatattt 4920 tttgttatat gtgtattgta aagctgattg tatcattggg aatgtaattc tgttgatgcg 4980 aaaagatgag gaggttttgc gcccgtttga gagagagcaa tagtaaaaac cgctcaactc 5040 aaacaggctt ttccctgctc caaataaata aataaataaa taaataaata aataa 5095 // ID Gypsy-19_RP-LTR repbase; DNA; INV; 333 BP. XX AC ACPB02043044; XX DT 28-JAN-2011 (Rel. 16.02, Created) DT 28-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Rhodnius prolixus genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-19_RP_; KW Gypsy-19_RP-I; Gypsy-19_RP-LTR. XX OS Rhodnius prolixus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Euhemiptera; Heteroptera; OC Panheteroptera; Cimicomorpha; Reduviidae; Triatominae; Rhodnius. XX RN [1] RP 1-333 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Rhodnius prolixus genome."; RL Direct Submission to RU (28-JAN-2011). XX DR Genome; ACPB02043044; Positions 4 336. XX SQ Sequence 333 BP; 106 A; 37 C; 60 G; 130 T; 0 other; tgtaagagtg aactatcttg tttttctttt tatttattta aagaaggtat cttggcgcgc 60 tttttgtttt tgaggaaggt taagggaggt tagcacccca taaaaatatt gaatgtgcag 120 tgtgcagttt tctattatag tattgaagtg aggtttaatt aggtaaatga ctataaaaac 180 cgattttctt ttataggtga gctttttact tatttgttta taagataact ttgtattata 240 aggtaacttt gtattgttag actaaaataa aatcttcttt tataggaaaa atcggtatat 300 agtgaataac aaaacccgaa cccatttacc aca 333 // ID HAT2b_Cis repbase; DNA; INV; 789 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 06-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE hAT DNA transposon from Ciona savignyi. XX KW hAT; DNA transposon; Transposable Element; HAT2b_Cis. XX OS Ciona savignyi OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. XX RN [1] RP 1-789 RA Smit A.F.; RT "HAT2b_Cis - hAT DNA transposon from Ciona savignyi."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX CC Ci000522 shares terminally inverted repeats with HAT2_Cis. No CC similarities with other DNA transposons yet. XX SQ Sequence 789 BP; 256 A; 159 C; 153 G; 221 T; 0 other; cccagtatgg aaagcatggc aagtgcaccc gcgatatttt ttaaactatc ataaatgcac 60 ccgaaaaaaa tcgtaaaatg cacccctgct aacctgcact tgctatataa aaacaaacca 120 ttcgtcaacg attaacgcat ttaacgcagc cctaatacga ctaatcgtac gcattcaatg 180 aaaagcacag taaataagct ttcccttaac ccaggaaacg tcgattaagc attcgatgcg 240 gtagcggtca gattgaatcg cacaaatagt ctcgaacact gatacgtacg gtactttacc 300 taaagcgaga aggaaagcga acatgtttat aggttagtta tcgaagaaaa atacatcagt 360 gttctgttta aaatgcaaac gatttattta ctgagctatt aagactagtt ttaatcctaa 420 acctgaaact ctaatttaat tttatccgtt aggactggat taaatcgatt gaatagtcat 480 gggtacgata tgggaattca catttcaaaa aaatatggaa cattaagtgc tttccagagt 540 tgattaattg atgcaaactt aaagtccaaa acagaaaaat attatttatt gtaccgtgta 600 tttctatttc ggccgtttct ataggtttac gaaagttcca acagagaatt ggcgatatgt 660 aacgtaattg cttggtatat ctgcaccatg cattaggatc gtgcacccgg ctgtcgggat 720 cgtgcacccg gcggtcgggg ttgtgcaccc gcgagatctc gtcgagatct gccatacttt 780 ccagactgg 789 // ID Crack-16_BF repbase; DNA; INV; 3386 BP. XX AC . XX DT 30-APR-2009 (Rel. 14.04, Created) DT 30-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE Amphioxus Crack-16_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; KW Crack-16_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-3386 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-3386 RA Kapitonov V. and Jurka J.; RT "Young families of Crack non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(4), 821-821 (2009). XX DR [2] (Consensus) XX CC Its 5'-terminal portion is incomplete. XX FH Key Location/Qualifiers FT CDS 1..2928 FT /product="Crack-16_BF_2p" FT /translation="DIQTNPGPDLHLPSKGLHVGHVNINSLRNKVQELSQL FT LDVNKLHILAISETHLDSSVLDGALNVGEYSVLRKDRNNRGCGVAMYIHNN FT IPHKRRSDLETSDTEIIWAEVHLPYVKPILVGCLYRPPSAKLEYLHKIVQI FT LDGVTEADREVLLLGDFNINWLSKYCPLKERLETELSTLNLKQMVKEVTRI FT SRTINCPVSSSCIDLIFSNRPDKFLSTKSTPLGCTDHNLVHTTRKTKLPKG FT QSTILLKRTFKRFDQEQFLQDIADTTWDTVYENDNVEGALDNFMTLFTTVA FT DKHAPVCKRTVRASSAQWVDEDLRVLMDERDRAKAEANATGTLSDFNRYKR FT LRNDVVRENRKKKKTFFKDQIETHSKDRNTKQLWTVINGMMGKTSGKSPSF FT IESKGQIITKPRDIAEHFNDYFIDKVNGLTLDKDINKQEAVAPIENIMSSK FT NCSFTFQAATVNQVKNILKSLPPGKATGHDHIDNTLLKLSAEYIAGPVCYI FT INSSLRQGVFPAQWKKAKVIPLIKNRSAPLNGPNSRPISLLPVISKVMEKV FT CQKQIQSYLISNDIISQKQHAYKQHHSTASALVQMTDHWLGEIDKGNLVGA FT VLLDFSAAFDTVNHEILLAKLKSYGFADSAVEWMESYLTNRSQAVYINGAF FT STYKETKSGVPQGSCLGPLLFCLYTNDLATITKKAELVMYADDTTPFASAP FT SARAVESILQSDLTRIWEWIQINKLILNVGKTKSILLGSHHKLKHNPTLDL FT TVGGKKIMQVSQAELLGYTVDQHLSWNLHANKITRKMTGVTATIRKYKDYF FT PEDTLKTVIQSLVLTHLDYCSGILSSTAQTQLDRLQIIQNRAARLAANTSH FT TTSSEVIHDRLCWPSVKERMFTTTICAFHKIVTTKEPKCMYDNLRNINTAH FT HHNTRLSLRGGYVQPKPRTNSLTRTFQYRCIKTYNTLPTSITTKNQEVFKT FT KLREWVKAQKDKNQPLTSWW*" XX SQ Sequence 3386 BP; 1185 A; 718 C; 628 G; 855 T; 0 other; gacatacaga ccaatcctgg tccagacctt cacctgccta gtaaagggct ccatgtggga 60 catgttaaca ttaacagtct tcgaaataaa gtacaagagt tatcccagtt attagatgtc 120 aacaaattac atattttagc catctcggaa acacatctag acagtagtgt tctagacgga 180 gcattaaacg ttggagaata ctctgtatta agaaaagacc gcaacaacag aggatgcggg 240 gtagctatgt acatacataa caacattcca cacaaaagac gttctgactt agaaaccagc 300 gatacggaaa taatatgggc tgaagttcac cttccgtatg ttaaacctat cctagttggg 360 tgcctgtaca ggcccccaag cgcaaaactt gagtacctac acaaaattgt ccaaatattg 420 gacggagtga cagaagcaga ccgagaagtt ttactactgg gtgactttaa catcaactgg 480 ctaagcaaat attgtccact gaaagaaaga ctagaaactg aactctcaac actaaactta 540 aagcaaatgg tgaaagaagt aacaagaata agtagaacta ttaattgccc tgtgtcgtcc 600 tcatgtattg atctaatctt ttcaaacaga ccggacaaat tcctaagcac taaatctact 660 cctctagggt gtactgacca taatctggtg cacacaacta gaaaaacaaa actgcctaaa 720 ggacagtcca caatacttct taaacgcaca ttcaaaagat ttgatcaaga acaattctta 780 caggatatag ctgacaccac atgggacaca gtatacgaaa atgacaacgt tgagggagct 840 cttgataact tcatgactct ctttacgacg gtggctgaca aacatgctcc agtctgtaaa 900 agaaccgtaa gagcttcatc cgcccagtgg gtagatgaag acctaagggt tcttatggac 960 gaacgagatc gtgctaaggc ggaggcaaat gccacaggaa cgttgtcaga cttcaacagg 1020 tacaagcgtc tccggaatga cgtggtaaga gagaatagga agaagaagaa gacctttttc 1080 aaagaccaaa tagagacaca cagcaaagac agaaatacca aacagttgtg gactgtgata 1140 aacggaatga tgggcaaaac gtccggcaaa tctcctagtt tcattgagtc aaagggacaa 1200 ataattacca aacctaggga tatagccgag cactttaacg attatttcat tgataaggta 1260 aatggtctaa cactagataa agacataaac aagcaagaag ccgtggcacc tatagagaat 1320 attatgtcat ccaagaattg ttctttcaca ttccaagctg ctacagtcaa ccaggtaaaa 1380 aacattctca aatcacttcc accaggtaag gccacagggc acgaccatat agacaacaca 1440 ctactgaagc tcagcgcaga gtacatagca ggccctgtct gttacataat aaacagctca 1500 ttaagacaag gggtcttccc agcacagtgg aagaaagcaa aagtgatacc actaatcaaa 1560 aacaggtctg ccccattaaa cggtccgaac agccgaccca tcagcttgct gccagtgatc 1620 agtaaagtta tggaaaaagt atgccaaaaa caaattcaaa gctatctcat ctctaacgat 1680 ataatatcac agaaacaaca tgcatacaag caacaccact ccacagccag cgctttagta 1740 caaatgactg atcactggct aggggaaata gacaagggaa atctggtcgg tgcagtcttg 1800 cttgatttca gtgccgcttt tgacactgtt aaccatgaaa tattgttggc caagttaaaa 1860 agctatggat ttgcagattc tgctgtagaa tggatggaaa gctatttaac aaacaggagt 1920 caggccgtct acattaacgg agccttttca acttataagg agaccaagtc cggagttcca 1980 caagggagct gcctgggtcc gttgttgttt tgtctctata ctaatgactt ggccaccata 2040 acaaagaaag ccgaactagt catgtatgcc gatgacacta ctccctttgc ttccgcaccc 2100 agtgctagag cagtagaaag tatcttacag tcagatctca cgcgcatctg ggaatggata 2160 caaataaata agcttattct taatgtcgga aagacaaagt ccatactcct gggcagtcac 2220 cataaactaa agcataatcc aacactggat ctcactgtag gggggaagaa aataatgcaa 2280 gttagccaag ctgaactcct gggttacaca gtagaccaac acctatcctg gaatttgcat 2340 gctaacaaga tcacaagaaa gatgacagga gtcacagcaa caatcaggaa gtataaagat 2400 tactttcctg aagacacact aaaaacagta atccaatcac ttgtacttac gcatcttgac 2460 tactgttctg ggatactttc ctcaacagca caaacacagc tagacagact acaaataatc 2520 caaaacagag ctgcaaggtt agccgcaaat acatcacaca caacaagctc agaagtgata 2580 catgatcgac tatgttggcc gtcagttaag gaaagaatgt ttacaacaac aatctgtgcc 2640 ttccataaga tagtaacgac taaagaacca aaatgtatgt atgacaactt aagaaatata 2700 aacaccgcac atcaccacaa cacaaggcta tcactgagag gaggctacgt acaacctaaa 2760 ccccgtacaa attcattgac cagaacattt caatacagat gtataaagac ctacaacaca 2820 ctccctacaa gtattacaac aaaaaaccag gaagtattta agactaagtt aagagagtgg 2880 gtaaaggcac agaaagacaa aaaccagcca ctcacttcat ggtggtaaaa atgaaaccca 2940 tctctcggtt ccaattatgt tttgttttat tcatgatgta tcaatgacta ttattattat 3000 tatttttgct atatgattta tatgctttca aatactgttt gtttcaatac ttcagttcaa 3060 attgtatttg atttactatg attttgctcg acctccgacc tctcccactc cgtttttgag 3120 ttgatattac caagtttctg ttaattcaat tggacatttt gttgtatata taagaactaa 3180 gttgatttat tttattttac ttcacgtatg caaatcactg tttttcagtt taccatatgt 3240 ttgctttctg ttatttcaaa gttataattt gtataatttg tacatttctg ttgcaattat 3300 ttctgtattg ttttaatgtg gactccagga agactagtgt atttcactaa tggagatcta 3360 aataaaccaa accaaaccaa accaaa 3386 // ID BEL-40_AA-I repbase; DNA; INV; 5957 BP. XX AC AAGE02018633; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-40_AA_; KW BEL-40_AA-LTR; BEL-40_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5957 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02018633; Positions 10789 16745. XX CC Positions [4968-5549] - Integrase core CC 'ATAAT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 534..5933 FT /product="BEL-40_AA-I_1p" FT /translation="MPPVTRSKQAEEKTANQQQQSQDSVSVADTSGSFYGF FT AEWEAEMDPETKKEFEKAVRQRNQVRMKLVRINRTLASNQEIGLAQLNVLS FT KGLSATYAEFSQLHTNIVGLVPDEAMEEQEQEYADYEEMHYAASNVVEALI FT LAAKPAAIPPASATAPQVVIQQQPLKAPIPTFDGGYAAWPKFKAIFQDLME FT NSGDSDAIKLYHLDKALIGEAAGVLDTKVLSEGDYDHAWEILTDRFENERV FT IVETHIRGLLSLKKMTSETHKELRALLNEATRHVESLRYLEQEMTGVSEHI FT IVYLIISALDKSTRKAWESTQKKGQLPKYSQTIDFLKSRCQILENCDAAFP FT TTNPTVKPRQFQQLSKQSQKSHAVSTTISQACEICGGDHRNIQCTALDKLS FT ASQKQEKVRAAGVCFNCLRKGHRSRECPSDKSCRKCQRRHHTLLHDDGAPI FT QVTKSSVSLPAESVVNLPPVPAAPVQPKSGQMNPPVDQPVSTTCSSNFAQS FT SKTVLLLTAVVQVFDKRNQPFPCRVLLDSGSQVNFVTEELANRLGLPKKPA FT NVPITGINALRTLARDKVTLRIRSRVSSFQASLECLVTPKVTGTIPSSKIN FT INHWDIPDGVVLADPEFHTPDKVDLLIGAELFFDILKPSQLNLADNLPMLR FT DTHFGWIVSGVIVEPQVTNVSVQQSNHATVEDVERMMQQFWQIEEVPDVPK FT LSCEELACEAHFLSTYQRDEDGRFIVKLPFKQNINQLDDCRALALKRFLML FT EKRLVRNPELQTQYVEFLREYEALGHCHEVRETDDPPNQLAYYMPHHAVLR FT PSSSSTKCRVVFDASAKSSASELSLNEVLQVGPVVQNDLHFIVLRFRKFKI FT AFSGDVSKMYRQVLHAKQDRRFLRIFWRPHPLQPLRVLELSTVTYGTASAP FT FLATRCLVQLVEEDGDAFPIASRIVKEETYMDDVLSGADSVEEAIEAQRQL FT KQLLELGGFPIHKWCSNSEEFLEHIPEEDREQKKPLEERGVNEAIKVLGLL FT WDPSADTLFIANHPKATTAADQQRVTKRMMYSEIAKFFDPLGLVSPVIVLA FT KLLAQRLWQLKIGWDDPVDEATAQEWQELQTSLSHLHQIDIPRCVTFNEVI FT AYELHGFSDASTVAYGACVYLRSLFADGSAKLRLLTSKSKLAPLHDLSIPR FT KELCAALLLTRLVQKVLPALDMTFREIVLWCDSTIVLAWIRKPLNQLQLFV FT RNRIAVIQENTGDYRWEYVRSLRNPADIVSRGQLPETLKNNQLWWNGPDFL FT QRVEYDIDLPQFVPDDQLPELKGVIASPAVSIEPFPFFSRFSSFRTIQRIM FT GYVLRFVNNCRNPRNQRVSSRHLTVGELRRSTEAILHVIQLVHLADEIKRV FT NANEPCKRLANLRPIYSDGLLRVGGRLDRSLLPFENRHPIILPDKDPVVRL FT LVQKMHIELLHVGQTGLMNALRQRYWLLNARSTIRSITRTCVRCFRVNPSN FT TSQLMGNLPAARVVPSPPFAVTGVDYAGPFIIKQGARRPALIKAYVSVYVC FT MTTKAVHLEAVSDLSTDAFLASLKRFIGRRGMVQQLHSDNATNFRGAHHEL FT NELHRQFQDQQSVSTIEDFCRSREIEWHFIPPDAPEFGGLWEAAVKSAKTH FT LKRIVGNVKLTFEELSTVLVEIEAVLNSRPLFTISNDPADPLVITPAHYLI FT GRPLTAMAEPSLENVNATRLTRWQHLQLMREHFWRAWSREYLNTLQPRKKN FT LRTVPNIRKGMVVLLHDRNQPPLYWKMGRVTAVYPGDDGLVRAVDVYSGGS FT TFRRPINKLSVLPIEDNQPGSDQPIRKDC" XX SQ Sequence 5957 BP; 1479 A; 1653 C; 1560 G; 1265 T; 0 other; tctggtcctt cgaaccggat cactagtgcc gtgtagtgat ttcttgtgaa gattctgccg 60 gatttgaccg ccggccgagt ggtaaatccg gagaagaaaa acccgtgaaa atccttcctg 120 cattgtgccg ccgcccgagt ggcgcaagcg aagaaaaagt gttgtgacga tccagccgga 180 tcgagccgtc gtccgcccgc gtcgtcggcg tgcgaaagtg aaaaagtttg cacacgctgt 240 gcatgtccaa acgtgtttgt gcgagacgag ctacgcgaaa ttggaacaaa gtggaaagtg 300 aaccaaaaca agattgctcc ctctgctgtg agtgcccgag caaaaacttt gaactgaaaa 360 actgcgtcgc aagtttcgac cgaaaagttt tcgcgcgttc gagaaaaagt gtgcacaaat 420 tggtgcgctg cgcaaaagtg ttttcgcgac gagttaaatt tcgaaacgcg tccgaaagtg 480 tttgaactgt gcgtgatcct gaataagaag agcgcgaaga aaaacagtgc taaatgccgc 540 ctgtgacgcg gtcgaagcaa gccgaagaaa agaccgctaa tcagcagcag cagagccaag 600 attcggtatc cgttgccgat acttccggat cgttttacgg attcgctgag tgggaagcag 660 aaatggatcc agaaaccaag aaggagttcg agaaggccgt aaggcagcgc aaccaagtga 720 ggatgaagct ggtaagaatt aaccggacgc tggccagcaa tcaagaaatt ggattggcac 780 aactgaatgt gctttccaaa ggactttctg ccacctacgc cgagttcagt caactccata 840 ccaacattgt ggggctggta ccggacgaag cgatggagga gcaagaacag gaatacgcgg 900 attacgagga gatgcactat gccgcgtcca acgtcgtcga agcgctgatt ttggcagcaa 960 agccagctgc tatccctcca gccagtgcaa cagcacctca agtggtcatc cagcagcaac 1020 cattgaaggc accgattccg acgttcgatg ggggctatgc cgcctggcct aagttcaagg 1080 ccatcttcca agacttgatg gagaactcgg gggatagtga cgcaattaaa ctctatcacc 1140 tcgacaaggc actcatcggc gaggcagcag gtgtcctgga tacgaaggtc ctcagtgaag 1200 gagattacga tcatgcctgg gaaattctga cggaccgctt cgagaatgaa cgcgtgattg 1260 tggaaactca cattcgtggg ttgttgtccc tcaagaagat gacctcggag acccacaaag 1320 agcttcgagc gctcctgaat gaagccaccc gtcatgtcga gagcctccgc tacctcgagc 1380 aggaaatgac cggagtatcg gaacacatca tcgtgtatct gatcatctca gcgctggaca 1440 agtcaacccg gaaggcctgg gaaagtaccc agaagaaagg acagcttccc aagtactcgc 1500 agaccatcga tttcctgaag tccaggtgtc agatcctgga aaactgcgat gcagcgtttc 1560 caacgacgaa ccccacagtg aagccgaggc aattccagca actttcgaag caatcccaga 1620 agagccatgc agtttccact actatttcgc aagcgtgcga gatttgtggt ggtgatcatc 1680 ggaacatcca gtgcaccgca ttggacaagc tgagtgcctc ccagaagcaa gagaaggttc 1740 gagctgcagg agtttgcttc aactgcctgc gtaaagggca ccgctcccga gagtgtccat 1800 ccgataagtc gtgccgcaag tgccaacgcc ggcaccacac gctgctccat gatgatgggg 1860 cacccatcca agtgacgaag tcgagcgttt cccttcccgc ggagtccgtg gtaaacctac 1920 caccggttcc ggctgctccg gttcaaccga agtcggggca gatgaatcct cctgtggacc 1980 agcctgtgtc cactacctgt tcgtcgaact tcgcccagtc gtccaagacg gtactgctgc 2040 ttaccgcagt ggtgcaggtg tttgacaagc gaaatcagcc attcccatgt cgtgtcctgc 2100 tggacagcgg ctctcaggta aattttgtga ccgaagaatt agcaaatcga ctaggtttac 2160 cgaagaagcc agctaacgtc ccgattactg gtatcaacgc cttgcgcacc cttgcccgcg 2220 acaaggtaac cctgagaatt cgatcccgag tatccagctt tcaagccagc ctggagtgcc 2280 tagtgacacc gaaagtgacg ggcacgattc catcgtccaa gatcaacatc aaccactggg 2340 acattcctga cggagtggtt ctggccgatc ctgagttcca cacacccgac aaggtggatc 2400 tactaatcgg cgcggaacta ttcttcgaca ttttgaaacc gagtcagctg aatctagcgg 2460 acaaccttcc gatgttgcga gacactcact ttgggtggat cgtatccggc gtcatcgtcg 2520 agcctcaagt cacaaacgtt tccgtccagc aatccaacca cgctaccgtc gaagatgtcg 2580 agcgaatgat gcagcagttc tggcaaatcg aggaagtgcc agacgttccc aagctttcat 2640 gcgaggagtt ggcctgcgaa gcccacttct tgtccaccta tcagcgagat gaagatggac 2700 gattcatcgt gaagctaccg ttcaagcaga atatcaacca gctagacgac tgccgcgctc 2760 tagcactcaa gaggttcctg atgctggaga agagactcgt ccgcaatcca gaactgcaga 2820 cgcagtacgt ggagttcctc cgggagtacg aagctcttgg acactgccac gaagtccggg 2880 aaaccgacga ccctccaaat cagctagcgt attacatgcc gcatcacgcg gtgttacgac 2940 cttccagctc gagcacgaaa tgtcgtgtcg tgttcgacgc cagcgccaag tcttcagcat 3000 ccgagctgtc cctgaacgaa gtgctgcagg taggacctgt agtacagaat gatctgcatt 3060 tcatcgtcct gcgcttccgg aagttcaaga tagcgttctc cggagacgta tccaaaatgt 3120 accgtcaggt actgcatgcg aaacaagatc gtcgtttcct gcgaatcttt tggagacccc 3180 acccgttgca accgctgcgg gtcctggagt tgtctaccgt tacctacggt accgcatcgg 3240 caccattcct agctactagg tgcttggtgc agttggtgga agaagacggc gatgcctttc 3300 caatcgcctc tcgtatcgtg aaagaggaaa cgtatatgga tgatgtgctc tccggcgcag 3360 actcggtgga agaggccatc gaagcccagc gacaactgaa gcaactcctg gagctaggag 3420 gattccccat ccacaagtgg tgctcgaact ccgaagaatt tctcgagcac atccccgaag 3480 aagatcgcga gcagaagaag ccattggaag aacgaggagt gaacgaagcc atcaaggtgc 3540 tcggtttgct ctgggacccg agtgctgata ccttgttcat cgccaaccat ccgaaggcca 3600 cgacagcagc cgaccagcaa cgcgtgacga agagaatgat gtattcggag atcgccaagt 3660 tcttcgaccc tttgggtttg gtgtcgccgg tcatagtatt ggcgaagctc ctggcacagc 3720 gactgtggca gctcaagatc ggctgggacg atccggtcga cgaagcaaca gcgcaagagt 3780 ggcaagaatt gcaaacatcg ttgtcacatt tgcaccaaat cgatatcccg agatgcgtaa 3840 ccttcaatga agtgattgcg tatgagctgc acgggttctc cgatgcctct accgtggcgt 3900 acggggcctg tgtctacctg cgaagcttgt ttgccgacgg ttcggcgaaa ctacgactcc 3960 tcaccagtaa gtccaagttg gctcctctgc atgacctttc cattcctcgg aaagagttgt 4020 gcgccgctct actgctcacc cggttggtgc aaaaggtgtt accagccctg gacatgacgt 4080 tccgggaaat tgtgctgtgg tgcgatagta cgattgtcct ggcctggata cgaaagcctc 4140 tcaaccaact acagttgttc gtacgaaatc gaatcgctgt aatccaagag aataccggag 4200 actacagatg ggagtacgtt cggtccctgc gaaaccccgc tgatatcgtt tcccgaggcc 4260 aactaccaga aacactgaag aacaaccagc tttggtggaa tggcccggac ttcctccaaa 4320 gggtggaata cgacatcgac ttgccgcagt tcgttccaga cgatcagttg cccgaactca 4380 aaggagtgat agccagtccg gctgtaagta tcgaaccgtt tccatttttc tccaggttca 4440 gtagcttccg aactattcaa cgtatcatgg ggtacgtgct gcgattcgtc aacaactgcc 4500 gaaaccctcg caatcagcgc gtatcaagcc gacatctgac cgttggtgaa cttcgtcggt 4560 caactgaagc catcctccac gtaatccaac tcgttcattt ggcggacgag attaagcgag 4620 tgaatgccaa cgaaccctgt aagagactcg ccaaccttcg tccgatttac tctgatggcc 4680 tattgagagt gggcggccgt ttggatcgct cgctgctacc cttcgaaaat cgtcatccca 4740 tcatcttgcc ggataaggat ccggttgtac gcctgttggt gcaaaagatg cacatcgaat 4800 tgcttcacgt cgggcagacc ggcctgatga acgccctgcg ccaacgttat tggctcctga 4860 atgctcgttc taccattcgt tcaatcaccc gtacatgcgt gagatgcttc cgagtcaacc 4920 caagcaacac cagccagcta atgggcaact tgccggctgc cagagttgta ccatcacctc 4980 cgttcgccgt caccggcgtg gattacgccg gcccgttcat catcaagcaa ggagcgcgcc 5040 gtccagcgtt gatcaaggca tacgtctccg tctacgtgtg catgactacc aaggccgtcc 5100 acctggaagc agtgtcggac ttgagcaccg atgctttcct ggcgtccctt aagcgtttta 5160 tcggacgacg aggcatggtt caacagctcc actcggataa cgccaccaat ttccgaggag 5220 cccatcacga gttgaacgag ctccaccgac agtttcaaga ccaacaatca gtatcgacca 5280 tcgaagactt ctgtcgctcc cgtgagattg agtggcattt catcccgccg gatgcaccgg 5340 agttcggcgg cctctgggag gccgcagtaa aatctgcgaa gacccatctc aagcgcatcg 5400 tcggcaacgt caagttaacg ttcgaggaac tgtctaccgt cttggttgag atcgaggcgg 5460 tattgaactc tcgaccactg ttcaccatct ccaacgaccc tgcggatccg ctggtgatta 5520 caccggccca ctaccttatc ggccgtccgc tcaccgccat ggccgagcct tccctagaga 5580 atgtcaacgc aacccgcttg actcgatggc agcatctcca gctcatgcga gaacatttct 5640 ggcgcgcctg gagccgggaa tacctaaaca cgctgcagcc aaggaagaag aatctccgaa 5700 ctgtgcccaa catcaggaag ggcatggtag ttctactcca cgatcgaaat caacctccgc 5760 tctactggaa gatgggccgt gttacagcgg tctaccctgg tgacgatggc ttagtccgag 5820 ctgtcgacgt ctacagcggt ggttccacct tccgccgccc aatcaacaag ctgtcagtgc 5880 tacctatcga ggacaaccag cctggttccg accaaccgat acgtaaggat tgttgagtac 5940 cctcaacggg gggtgta 5957 // ID Gypsy-2_AC-LTR repbase; DNA; INV; 180 BP. XX AC AASC02001731; XX DT 18-JAN-2011 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from Aplysia californica (California sea DE hare): long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-2_AC_; KW Gypsy-2_AC-I; Gypsy-2_AC-LTR. XX OS Aplysia californica OC Eukaryota; Metazoa; Mollusca; Gastropoda; Heterobranchia; OC Euthyneura; Euopisthobranchia; Aplysiomorpha; Aplysioidea; OC Aplysiidae; Aplysia. XX RN [1] RP 1-180 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Aplysia californica (California sea RT hare)."; RL Direct Submission to Repbase Update (02-FEB-2011). XX DR Genome; AASC02001731; Positions 5450 5629. XX SQ Sequence 180 BP; 47 A; 31 C; 28 G; 74 T; 0 other; tgtatggtta tttgttatat gagattagtt catgttctct gtacgttatg tggtgtaccc 60 catattattt catgttatgt tgtacctgta tcattcattg tccctgttaa tccgtgttca 120 cagatgaaat catcatctac cgtttataat aaatgaacta cacaaagaag tctttcttca 180 // ID SART1 repbase; DNA; INV; 6702 BP. XX AC D85594; XX DT 30-JUN-2010 (Rel. 15.06, Created) DT 30-JUN-2010 (Rel. 15.06, Last updated, Version 2) XX DE Complete sequence of retrotransposon SART1. XX KW R1; Non-LTR Retrotransposon; Transposable Element; SART1. XX OS Bombyx mori OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Bombycidae; Bombycinae; Bombyx. XX RN [1] RP 1-6702 RA Takahashi H., Okazaki S. and Fujiwara H.; RT "A new family of site-specific retrotransposons, SART1, is RT inserted into telomeric repeats of the silkworm, Bombyx mori."; RL Nucleic Acids Res 25(8), 1578-1584 (1997). XX DR EMBL/GenBank/DDBJ; D85594; Positions 1 6702. XX CC This family of non-LTR retrotransposons is specifically inserted CC into insect-type telomeric repeats (TTAGG)n in the opposite CC direction to TRAS1. XX FH Key Location/Qualifiers FT CDS 880..3015 FT /product="SART1_1p" FT /translation="MSSYKEELPQEGTSRSAGGESLRAAERSAARCPSSSG FT GGNNCSKRSVKDGGRVSAKDGENKERRGGSVKDNEAANVLSDDSMATASSS FT RSSLTGSRRKRLKGSSTECSESSSGEETACSARRAASKTPSKRGRGRPPTT FT GQYVGLAAAKEAYVKAQREELALREREVTESVRKLRVREDAVAGSTGEPAL FT DPERRLDESAKLAVAIAAKSGNLKGTYVRALKEMAAAIREAKEDMAARTTD FT GETKRLQELCRRQEAEILHLKNAIADMRSEMARLAQAVGSPAAVPAPVPAP FT APIQRSDDEEERRLQRIMRAVGTMLDARLSGLEARLPPEPRMRPPLAADHR FT RSRGQEKTPTPPTAAESMPTPGPTAAVATAAPNGGGPQKKKAPKSRQQPAQ FT PEPRTFPPAPAALTEAWTTVARRGAKPRTATATGTVSAPPPLKEKKKRLRP FT PRSQAVIVKLQPEAVERGVTYRAVLAEARAKIDTAELGIPIQRIRSAVTGA FT KVLVVDGADQNEKADLLAEKLREVLPSDSIVVSRPTITAAVRLSGLDESIS FT REELVAEVARIGECPPDIVKVGEIKMGPGGTGQVLVRCPIAGVKKILAVNK FT LRIGWSVLRVQLLEARRLQCYRCHALGHVSARCPSSVDRSGECYRCGQTGH FT KSAGCALTPHCTICAGAGRPAAHVSGGKACAKPPKQRQQRSSAAEEKRRSQ FT PGTATATPMDDE" FT CDS 3018..6218 FT /product="SART1_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MTSSPYHILQGNLNRSARAQDLLIQSMAERLTHLAVV FT AEPYRVPSVPDWAGDIDGLVAVVQRRSAVGAPPEFDVVQRGRGFVAVFWAG FT LLVVGVYFSPNRPLAEFESFLDELGQVVGRSRSRRALVLGDLNAKSSAWGS FT PVTCPRGRETEEWLVGSGLVVLNRGAENTCVRRSGGSVVDVSFATPDVARR FT VCGWEVLVDVETLSDHRYIGFRVAAAPESSSVTTLPFGGGVEGPRWALKRL FT DTERLQEAAVVQAWRLDSLGEPADVREGAERLREAMSRVCDYAMPRVRAYA FT PRRQVYWWNEGIAGLRRRCAGSRRKYQHLRRRRRRDEEEEDRAYEEYREDV FT RALRVAIGEAKEAAWKELLETLDRDPWGRPYRLARSTMRSWAPPATSTLPP FT DIVRHVIGGLFPDAPGTPFVPPVMITPTTGAQIGAEEEEVSPAEFGAAIEK FT MKARRTAPGPDGLSNRAWALALQTEGGLGPVLRGLLSRCLREGRFPEEWKT FT GRLVLIPKEGRPRDQPSGYRPIVVLSEAGKLLERVVAGRLVRHLENVGPNL FT ADSQYGFRRGRSTLDAVQRVRDLSDQACSRGGVLLAVSIDISNAFNTVPWS FT TILESLRFHRVPTGLRNLIEDYLAGRAVVFPERRGWGHKAVSCGVPQGSVL FT GPLLWDVGFDWVLRGANLRGVQVICYADDTLVTARGDDYRSASILAAAAVA FT TVVARIRKLGLEVALHKSEAVCFHPARKGPPPGASITIGGTAIAVRSQLKY FT LGLVLDSRWSFDRHFDVLVPKLLGAAGALARLLPNIGGGGKSVRRLYLGVV FT RSMALYGAPVWSPTLSARNAALLQRVQRVLAVRVVRGYRTISTEVACALAG FT SLPWEYEAEVLAAVYRRRAQSLGRGSVPRLSVVARWRRAARRVATLKWKER FT LAAEEIHRGSRTRRRTLAALVPVLEAWIDRRHGVLDFRLTQVLSGHGCFGR FT YLWRVGREPHPGCHQCGHPDDDAQHALEACPRWEYPRQSLVAVLGADLSLP FT VVVSRMVEDERCWRAMADYADLVMTLRETEERERECDPNSVALRRKRRGGR FT RGRRAPAQLP" XX SQ Sequence 6702 BP; 1208 A; 1977 C; 2318 G; 1199 T; 0 other; cccggcccgg gacctgggcg ggccccccgg cgcgcactca gcgtggccgg gggctccgtc 60 tgtgggggag ttttgctgga cttccaccgg gcgcgtccgt aggtggcgga taacgggctt 120 ctgggagcag tcacactccc aggggtatcg ctccttcttt ctcgtcgtgt cacggcgggg 180 gagattgagc gacgcggacc aaaaccagca cggggggttc cgggctctcc acgcccttca 240 ctggctgaag ggtgtttctt ctggggatgc ccgggcctcc tcgttcccat tttccccctt 300 cgtttttatc ttccccttta tgttattttg ttttattttc tgtttgtttt gccttctttt 360 gttttatttg tcttattggt tttacttttt cttttttcat ctttttgttt tggggccacg 420 tccgaactat tttcggcgtg acccatcgga ggtggcgggc cgctcttggt ctgccttcgt 480 cgttgagaaa gcgagccggc gttgtttgcg cttcttccgg ccgctggtcg ccatgtcccc 540 ggctccacag ctacagggga acgccctcca gaggggggta ccggtatacc ggcgtggctt 600 cgtggattgg tgctcactcg agcgtagggc gggggtctgt cgtgctgctc gttgcactcg 660 atggactgcc ggccaacccg taatcctcac cgtgtggggg acgggggccg ctgccgcgag 720 gtacggggct gcgaggacgt aaacctctat aaaaaatacc ccaatcttaa gctctggcgc 780 ccggcgatgg gatgcatggc tggtgggaga ccagtcgcga gggcgtccca ggggaccgta 840 cccctggcta ataaaaacag gaaataacat aatgaaaaaa tgtccagtta taaagaagaa 900 ttaccccagg agggtacctc ccgctccgcg gggggagaat ccctccgtgc ggccgagcga 960 tcggccgcac gctgcccctc gtcttctggg gggggcaata actgttcgaa gaggtccgtg 1020 aaggacggtg gtagggtgtc cgcgaaggac ggagaaaata aagaacgtag aggagggtcc 1080 gtgaaggaca acgaagcagc aaacgtcttg tcggacgata gcatggcgac ggccagctcg 1140 agccgttcgt cgttgaccgg aagccgaaga aagaggctga agggctccag cacggagtgc 1200 tcagagagca gctccggtga agagacggcg tgctcagccc gaagagccgc ctccaagacc 1260 ccgtcaaaga ggggcagagg aagaccgccc acaaccgggc agtacgttgg cctcgccgcc 1320 gccaaggagg cgtatgtgaa ggcccaacga gaggagctgg cgctgcgcga gagagaggtc 1380 accgagagtg tccgaaagct gcgagtgagg gaagacgccg tcgcgggcag cacgggggaa 1440 ccagcactcg accccgaacg tcgactagac gagtccgcga agctggccgt agcaatcgcg 1500 gccaagtctg ggaacctgaa gggcacatac gtacgtgccc tgaaggagat ggctgccgcc 1560 attcgggagg ccaaggaaga tatggcagca aggactaccg acggtgagac gaagaggctg 1620 caggagctct gcaggaggca ggaggccgag atcctgcacc tcaagaatgc gatcgccgac 1680 atgaggtcgg agatggcgcg cctggcccag gccgtgggat cgccggcagc cgtccccgcc 1740 cccgtacccg cccccgcccc gatccaacgg agcgacgacg aggaggagcg gcgcctccag 1800 cgcatcatgc gtgcggtcgg caccatgctg gatgcgcgcc tctctgggct ggaagcgcga 1860 ctgcctccag agcccaggat gagaccaccc ctggccgcag accacaggag aagccgcggt 1920 caagaaaaga cgccgacacc accaacggca gcagaatcaa tgccgacgcc cggaccaaca 1980 gctgcagtcg cgacggcggc acccaacggt ggtggcccgc agaaaaagaa ggcgccgaaa 2040 agcaggcagc agccggctca accagaacca cggacattcc cccccgcccc agctgctctg 2100 acggaagcct ggacgacggt agcccgaaga ggcgccaagc cgaggacggc gacggcgacc 2160 ggcaccgtgt ccgctccacc gcccctgaag gaaaagaaga agagactccg gccaccccgc 2220 tcgcaggcag ttattgtcaa gctgcagccg gaggccgtcg agcgcggagt cacctatcgg 2280 gcagtcctcg ccgaggccag ggctaaaatc gatacggcag agctggggat ccctatccag 2340 cggatacgct cggcggtgac gggagcaaag gtattggtcg tagacggcgc cgaccagaac 2400 gaaaaagctg atttgctggc ggagaaatta cgggaggttt tgccctcgga cagcattgtc 2460 gtctcgaggc cgacaataac ggccgctgtc agactgagcg gcctcgacga atccatatcc 2520 cgggaggagc ttgttgccga ggtcgccagg attggagagt gcccccccga tattgtgaag 2580 gttggggaaa taaaaatggg ccccggaggc actggccagg tcttggtcag atgccccata 2640 gcgggtgtca aaaaaatatt ggccgtcaac aagctgcgga tcgggtggag cgtccttcgc 2700 gtgcaactcc tcgaggctag gcgcctgcag tgctaccgct gccacgcact gggccacgtg 2760 agtgcccgtt gcccatcgtc ggtggaccgc agtggtgagt gctaccgctg cggccagacc 2820 ggccacaagt ccgcgggctg cgcgctcacc ccgcattgta caatctgcgc cggcgccggt 2880 aggcctgcag cgcacgtctc cgggggcaag gcttgcgcca agcccccgaa acaaaggcaa 2940 cagaggagta gcgccgccga agagaaacgg cggagtcagc ccggtacggc tacggcgaca 3000 ccaatggacg acgaataatg accagcagcc cttatcatat actacagggc aacctcaatc 3060 gctccgccag agctcaggac ctgctgatcc agagcatggc ggagcggttg acccatctgg 3120 cggtcgtcgc cgagccctac cgggtccctt cggttcccga ctgggcggga gatattgatg 3180 gcctggtggc cgtggtccag cgtaggtcag cggtgggcgc tccgcccgaa ttcgacgtcg 3240 tacagagggg tcggggcttc gtcgcggtct tctgggcggg cttgctcgtc gtcggggtgt 3300 acttttcccc caaccggccg ctcgccgaat tcgagtcctt tctcgacgaa ctcggtcagg 3360 tcgtggggag gtcgcgctca aggcgtgcgc tcgttctcgg ggacctcaac gcgaagtcgt 3420 ccgcttgggg ttccccggtc acctgtccca ggggccggga gacggaggaa tggttggtcg 3480 ggagcggtct cgttgtcctc aaccgcggcg ccgagaacac ttgcgtccgt cgttcgggtg 3540 ggtccgtggt ggatgtgtcc tttgcgaccc ccgacgtcgc gcgccgcgtc tgcggttggg 3600 aggtgttggt cgacgtggag acgctctccg accaccgcta cattggtttc cgtgtggccg 3660 cggccccgga gtcttcttcg gtaacgactt tgccctttgg tggcggcgtg gagggcccgc 3720 gctgggccct gaagcgcctc gacacggagc ggctgcagga ggcggccgtc gtccaggcct 3780 ggcggttgga ctccctgggc gagccggcgg acgtgcgcga gggtgcggag cggctgcgcg 3840 aggcgatgtc gcgagtctgc gactacgcca tgccccgcgt ccgagcgtac gcgccgaggc 3900 gccaagtcta ctggtggaac gaggggatcg ccggcctgcg acgccgatgc gccggtagtc 3960 gccgaaaata tcaacacctt cgtcgtcgga ggcgccggga cgaggaagaa gaggaccggg 4020 cgtacgagga gtaccgggag gacgtgcggg ccctgcgcgt cgccattgga gaggcgaagg 4080 aggcggcgtg gaaggagctt ttggagacgc tggaccgcga cccgtggggg cgaccgtatc 4140 gcctcgcgcg gagcacaatg cgctcttggg ccccgcccgc gacgagcacc ctgccgcctg 4200 acatcgttcg gcacgttatt gggggactgt ttccggacgc acctggaact cccttcgttc 4260 cccccgtcat gataacaccg acaaccggag ctcagattgg tgcagaagag gaggaggtgt 4320 cgccggcgga attcggtgcg gccatcgaaa aaatgaaggc gaggaggacg gcgccaggtc 4380 ccgatggact gtcgaatcgg gcctgggcgc tggcattaca aaccgagggt ggtctgggac 4440 ctgtcctccg agggctgctc agcaggtgcc tccgagaggg caggttcccg gaagaatgga 4500 agacgggtcg gttggtcctc atccccaaag aggggcgccc cagagaccag ccatcgggat 4560 atcgtccaat agtcgtgctg agcgaggccg gcaagcttct cgagcgcgtc gttgccggtc 4620 gcctcgtgcg gcaccttgaa aatgtcgggc caaatctggc cgactctcag tacgggttcc 4680 ggaggggtcg ctcgaccttg gacgcggtcc agcgcgtccg cgacctctcc gaccaggcgt 4740 gctcccgggg aggcgtgttg ttggcggtgt ctatagacat atccaacgcc ttcaatacgg 4800 tcccctggag cacgatcctg gagtcgttaa ggtttcaccg cgtccccacc ggtctccgga 4860 acctgatcga ggattacctc gcagggcggg ctgtggtctt ccccgaacgg agggggtggg 4920 gacacaaagc ggtctcgtgt ggtgtcccgc aggggtcggt tctgggtccg ctcctgtggg 4980 acgtcgggtt cgactgggtc ctccgcgggg ctaacctgcg tggcgtgcag gtcatctgct 5040 acgccgacga cacgctggtg acggcccgcg gagacgacta ccgttcggcg tcgattctgg 5100 ctgcggctgc ggtggcaaca gtcgtagcca gaataaggaa gctcggcctg gaggtcgccc 5160 tgcacaagtc cgaggccgtg tgttttcacc cggctcggaa ggggcctcct cccggcgcga 5220 gtataaccat tggagggact gcgatcgctg tccggtccca acttaaatat ctggggctcg 5280 tgctggacag caggtggagt tttgaccggc attttgatgt actggtcccg aaattgctcg 5340 gggcggcggg tgccttagcc cgcttgctcc cgaacattgg aggaggagga aagtcggtcc 5400 ggcgtcttta cctgggagtg gtgcggagca tggccctgta cggcgctccc gtctggtcgc 5460 ccacgctctc cgctcgcaac gcggccctct tgcagagggt tcagagggtg ctggcggtga 5520 gggtcgtgcg gggctaccgc accatttcga cggaggtcgc ttgcgccctc gccggttccc 5580 ttccgtggga gtacgaggcc gaggtcctgg ccgcggtgta ccggcgcagg gcgcaatctt 5640 tgggtcgggg aagtgtgccc cgtctgtctg tcgtcgcccg atggaggcgt gctgcgcgtc 5700 gcgtggcaac gttgaagtgg aaagagaggc tggctgcaga ggagatccac cgtggttctc 5760 gaacccggcg gcgcaccctc gcagccctgg tgcccgtgtt ggaggcctgg attgacaggc 5820 gtcacggcgt gctcgatttc cggctcacgc aggtcctctc ggggcatggc tgcttcggga 5880 ggtacttgtg gcgggtcggg agagagccgc atcccggatg ccaccaatgc gggcatccag 5940 acgacgacgc gcagcacgcc ctcgaagcgt gcccgaggtg ggagtatccg cggcagtcac 6000 tcgtcgcggt gctcggggca gacttaagtt tgccggtcgt agtgtcccgc atggtcgagg 6060 acgagcgttg ctggagagcg atggccgact atgcggacct ggtcatgacg ctccgcgaaa 6120 cggaggagcg agaaagagag tgcgacccaa actcagttgc actcaggcga aagagacggg 6180 gtgggcggcg cgggcgaagg gcccccgctc agctcccgta gggaccgtcg ggcgtcgggc 6240 gcgggggcgc ctgatgctcg acacgatagt ccagtggtgg tggtgagggt atagggcgct 6300 gtggctccgg tgcctacctc acgaagaagt tgcggtcagc aatggccgac gttgatcccg 6360 ccgttcgtca ggctgggagc cggtgtgggg ggcctgcggg gcgcgtttcc ctgtggtatc 6420 gtaggttccc cctatgccgg aaagatagga gctcgttggg ttttagtcgg tagtcgttaa 6480 gctgggcggg ttcggcgcga gctgaactca gcccagcgcg cctttttcaa ggcgtagtct 6540 ccgtggacta attggtcgag ggcgcggacc tcggttcgcg acttcttcct gtttcttcca 6600 ccggaggcgc ggagtccgac ataacccggt ccgacccccg tcggccgggt atccgtaaag 6660 actgggattc cccatcgata ccaaaaaaaa aaaaaaaaaa aa 6702 // ID LIRP1 repbase; DNA; INV; 218 BP. XX AC L42476; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 01-JUL-2005 (Rel. 2.02, Last updated, Version 3) XX DE Leishmania infantum DNA repeat. XX KW LIRP1; Repetitive element. XX NM LIRP1. XX OS Leishmania infantum OC Eukaryota; Euglenozoa; Kinetoplastida; Trypanosomatidae; OC Leishmania; Leishmania donovani species complex. XX RN [1] RP 1-218 RA Piarroux R., Fontes M., Perasso R., Gambarelli F., Joblet C., RA Dumon H. and Quilici M.; RT "Phylogenetic relationships between Old World Leishmania strains RT revealed by analysis of a repetitive DNA sequence."; RL Mol Biochem Parasitol 73(1-2), 249-252 (1995). XX DR GenBank; L42476; Positions 1 218. XX SQ Sequence 218 BP; 56 A; 70 C; 64 G; 28 T; 0 other; gcaagaatca agaggcggtg tcacagagat gggcgaaggg ggacggcggg agcggcaaag 60 agagcgcggg cacacagcga cgtccgtgga gagaaaaaaa gagagagaca cacgcgtatt 120 cccttctgct aatgtgtacc cgcctctctg ccacagatca cgaggtcagc tccactccac 180 cctaacgcct cccccgcgca gccctgtcac acgctccc 218 // ID BEL-2-I_HM repbase; DNA; INV; 5361 BP. XX AC . XX DT 02-JAN-2009 (Rel. 14.02, Created) DT 02-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE LTR retrotransposon from Hydra magnipapillata: internal portion - DE consensus. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-2-I_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-5361 RA Bao W. and Jurka J.; RT "LTR retrotransposons from Hydra magnipapillata."; RL Repbase Reports 9(2), 432-432 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 51..5330 FT /product="BEL-2-I_HM_1p" FT /translation="MDQERNYQREINMLNINIKCIERNTASSDKSSIILAL FT DKITNLLARFEMAKDELTRKMLDDGESIDTVEEWLSKPIAEVEAANAIKNE FT MVNKLDLINKNECVKRFEYEKQFMEHQLTIQKEAEQASLRKLQNEEEWYLK FT KLKTKETLRKSVGNNNEAVQQSVKLQKYTITKFDGDYKDWLRFWNQFTVEV FT DNSNISNISKFNYLMELVEGKPREDILGLPHSLEGYEEAKRILQDTYGKDI FT WIHKALIKDLEGMMAIHNTHKIKEVHDFYNKLARTVRTLKTMHKLQTAQSF FT VYSLMDKLGPVREILTQNDDGWEEWGLEQLVEKLQKYIERNPLNSDFNTEQ FT IKYDKKNPNVQSRSYENRISAFKQHNSGNYDKSMMTSVGKNECIYCGLRNH FT KSESCIKVLNVARRKEILKNKRLCYNCIGYGHSAANCQSRGCRKCNKKHHT FT SICQNVNATMDDKKETMGTVFNASTTVHPTVMAKVNGEKARIMLDTGAGSS FT YICTNLITKLKLKPTRREHKTIEQLYGTVNKRVEIYNVTLESITIPEFHIK FT IECINAEKEILTFLPNPMIKEVKKQFARLRRLKFSNDGEDDEIQPVHIILG FT AADYQRIKTTEPMVLGKNPDKDPGAELTMFGWTLSGKQMELSSGIEKGFLL FT NTGRDEFEQMCSLEVLGLSDTNNESMFHQDFSEKLHQTNEGYYETRMPWKK FT DVVKLPNNRDLAIKRLKSTTNRLEKLNKLEQYNEIMMEQISEGILEKVPEK FT PTGEIVHYIPHQAVIKENAESTKMRVVYDCSARKDAQSPSLNDCLEVGPSL FT QPLIFDILIRNRMNKLCVLADIKKAFLQIRIQDMDRDAQRLIWYEDLKKMK FT LMELRFTRVIFGSSSSPYILGATIKKHISKYKDIYPKTVVALEEDTYVDDL FT QAGGETEEELIRFKYESTQILNEAGFQLHKWHSNVRELENNVEDKSTKILG FT HPWNKETDQLSIEFTTCINNGNKENLTKRKILSAINSVFDVLGFSAPVLIT FT GKILYSRLCLSKIGWDHQLPKELLEEWQTWIKTISNKRTILIPRCVVSGRI FT IELELHGFSDASKLAVCACIYVVTSHQNSKTSNLLVAKARIAPKDLSIPRL FT ELVAAHTLSKLMNHVRKTLVNYNISKVFNWVDSTAVLYWLKERGSWSQFVR FT NRVQQILLNGEVKWLYVPTKENPSDLGTRGVSPEKLSSLWFNGPIWLSYKE FT NWPNQSEIFDNSYGTLCESIALKENVWMVTEEIKENRTFEELWNKYNYWKI FT LRISSFIKRFIYNCRNKEKIVGPIKTEEMIEAETVNIKHLQGSVVLKADVE FT LKQDEKNLWRCHGRVTGYSPIFIPKGFLLTLRIIEHHHTKTLHGGVGDTMG FT SIRERFWIPNLRVAVKKVIRSCNLCKRYRVKPLLPPTKAMLPHFRTDNVEP FT FAVSGVDFAGPLKYKVPKNSIKKCYVALFTCASTRAVYLKLCHDLSAVEFQ FT RVLKEFVARKGPPQMIISDNAKTFVATGKWLLTLKNDENIANYLAIQAIKW FT RFNLSRAPWWGGLFERLIGIMKKSLSKTIGKGMLTFNELEEVLLDVECSMN FT NRPLCYQGDQFDNQVLTPNVLMRGKPAILLEEDIALIAKEEYAIRRAKFIK FT NCKTHLRKRWMNEYVHAMEERQQVRNKGSNIKLPVVGSVVLIKEDVKNKAL FT LNIGRVESEIKGKDGVTRGLKIRLGNGYVIERPIQLVCDLEIDFKAESMIE FT SKKNVEIGKKVETLKRVPRRAKLDARQKISCIMEDEMAD*" XX SQ Sequence 5361 BP; 2067 A; 831 C; 1133 G; 1330 T; 0 other; ctaaaaggtg gcgaccctac caggaccaaa ggaacctaca aataatcgaa atggatcaag 60 agcgaaatta tcagagggag atcaacatgt tgaatataaa cattaaatgc atagaacgca 120 acaccgcaag cagcgacaaa tcgagcataa tcttggcttt agacaaaata acaaatttgt 180 tggcaagatt tgaaatggca aaagatgagc tgaccagaaa aatgctggat gacggcgaaa 240 gcattgatac ggttgaagaa tggttatcga aaccaatagc ggaagttgaa gcagcgaatg 300 caattaaaaa cgaaatggta aataagttgg atctaataaa taaaaacgaa tgcgtcaaga 360 gattcgagta tgaaaaacaa tttatggaac accagttaac aatccaaaaa gaagcggaac 420 aagcttcatt gcggaaactt caaaacgaag aagaatggta tctaaagaag ctaaaaacaa 480 aagaaacatt gcgtaaatct gttggaaaca ataacgaagc tgttcagcaa tccgtcaagc 540 ttcaaaagta tactattaca aaatttgatg gcgattataa agactggtta cgattttgga 600 atcaattcac ggtggaagtc gataactcta acatatcaaa cattagcaaa ttcaattact 660 taatggaact cgtcgaagga aaacccagag aggacatact tggtttacca cactcgttgg 720 aaggatacga agaagccaag cgtatattac aagacactta tggaaaagat atttggattc 780 acaaagcatt gattaaagat cttgaaggaa tgatggctat acacaatacg cacaaaatta 840 aagaagttca cgatttttat aataaacttg caagaacagt gcgaacatta aaaacgatgc 900 acaaactaca aactgctcaa tcatttgtgt attcattaat ggataaactt ggaccggttc 960 gtgaaatact cacacaaaac gatgatggat gggaagaatg gggactggag caactcgtcg 1020 aaaagttaca aaaatacata gaaagaaatc cattgaactc cgatttcaac actgaacaaa 1080 taaaatacga taaaaagaac ccaaacgtac agtcaaggtc ttatgaaaac cgaatcagtg 1140 cattcaagca acataatagt ggtaattatg ataaatcaat gatgacaagt gttggtaaaa 1200 atgaatgtat ctactgtggg ttgcgaaatc acaaaagcga aagttgtata aaagtgttaa 1260 acgttgctag acgcaaggag attttaaaaa acaaaagatt atgttacaac tgcattggtt 1320 atggacattc cgcggcaaat tgtcaatcaa gaggatgcag aaagtgtaac aaaaaacatc 1380 acacgtcaat ttgtcaaaat gtaaatgcaa caatggacga taaaaaagaa accatgggaa 1440 cggtgtttaa tgcgagtaca actgttcacc cgacagtaat ggcaaaggta aatggtgaaa 1500 aagcaaggat tatgttggat acgggtgctg gaagttctta tatatgtaca aacttgataa 1560 caaaactgaa gttgaaacct acacgaaggg aacataaaac catcgaacaa ttatatggaa 1620 cggtaaacaa acgtgtggaa atatataatg ttacgctaga atcaataaca ataccggaat 1680 ttcacatcaa gattgaatgc atcaacgcag aaaaggaaat actgaccttt ttacccaatc 1740 cgatgattaa ggaggtaaaa aagcaatttg ctagacttag gagactgaaa tttagtaatg 1800 atggagaaga cgatgaaata caacctgttc acataatact tggtgctgct gattatcaac 1860 gaataaaaac taccgagccg atggttctcg gaaaaaatcc ggataaagat cccggagctg 1920 aattgactat gtttggatgg actctctctg ggaaacaaat ggaattaagc tctggtattg 1980 aaaagggttt tttattaaac actggtcgtg atgagttcga gcaaatgtgt agtcttgaag 2040 ttcttggatt gtcagacaca aacaatgaat caatgttcca ccaagacttt agcgaaaagc 2100 tgcaccaaac aaacgaaggt tattatgaaa ctagaatgcc ttggaaaaaa gatgttgtta 2160 aactgccgaa caacagagat ctggcaatta aaagattaaa aagtacaaca aatcgattgg 2220 aaaaacttaa caaactagag cagtataatg aaataatgat ggaacagatc agtgagggaa 2280 tactggaaaa agtgccagaa aaacctacgg gagaaattgt tcattacatt cctcatcaag 2340 ctgtaatcaa ggaaaatgct gagtcaacaa aaatgcgagt tgtttatgat tgttcagcaa 2400 gaaaagatgc tcaatcacct tcattgaatg attgtctaga agtgggacct tcgctgcaac 2460 ctctaatatt tgacattcta atacgaaatc ggatgaacaa actctgcgtt ttagctgaca 2520 ttaagaaagc atttctacaa atccgaattc aagatatgga tagagacgca caacgattaa 2580 tttggtatga agacttaaag aaaatgaaat taatggagtt acgatttaca agagtaattt 2640 ttggttccag ttcaagtcct tacattctcg gcgctacgat taaaaaacat atatcaaaat 2700 acaaggatat ttatccaaag acagtagttg ccttggaaga agatacttat gtcgatgatc 2760 tgcaagcggg tggagaaacg gaagaagaac ttatcaggtt taaatatgaa tcaacacaaa 2820 tcttgaacga agctggtttt cagttgcaca aatggcatag caacgtgaga gaattggaaa 2880 ataatgttga agacaaatca acaaaaatac ttggacatcc ttggaacaag gaaacagatc 2940 aattgtcaat tgaattcact acttgcatca ataacggaaa taaagaaaat ttaacaaaaa 3000 gaaaaattct atctgcaatc aacagtgttt ttgacgtatt aggatttagt gcacctgtgt 3060 taatcactgg aaaaatactg tatagtcgac tatgcttatc aaaaattgga tgggatcatc 3120 aactaccgaa ggaacttttg gaagaatggc aaacttggat taagacaata agcaacaaaa 3180 gaactatatt gattcctcgg tgtgttgtgt ccggaagaat tattgaacta gagctccacg 3240 gattctcaga tgcaagtaaa ttagcggtat gtgcatgcat ctatgttgta acgagccacc 3300 aaaattcgaa aacgagcaac ttacttgtag caaaagcacg gattgcacca aaggatctaa 3360 gtattccaag actcgagcta gtagctgcac ataccttaag caagttgatg aaccatgtca 3420 ggaaaacatt agtcaattat aacattagta aagtttttaa ttgggttgac agcactgctg 3480 tactttattg gctgaaggaa agaggcagtt ggtcacaatt tgtaagaaac cgagttcagc 3540 aaatcttatt aaatggagaa gtaaaatggc tttatgtccc aacaaaagaa aaccctagcg 3600 atcttggaac aagaggagtg agtccggaaa agttgtcaag cttgtggttc aatggtccaa 3660 tatggttgag ttataaagaa aattggccaa accaatcgga aatatttgat aactcatatg 3720 gcacattatg tgaaagtatt gctctaaagg agaacgtctg gatggtaaca gaagaaataa 3780 aggaaaatcg cacttttgaa gaattgtgga ataaatataa ttattggaag attttaagga 3840 tatcgtcgtt tataaaaagg ttcatttaca attgtcgcaa taaagaaaaa atagtcggac 3900 caataaaaac ggaagagatg atagaagccg aaacggtcaa tattaaacat ctacaaggat 3960 ctgttgtgct aaaagccgat gtggaactta aacaggatga aaagaatctt tggaggtgtc 4020 atggaagagt tactgggtat agcccaatat ttataccaaa agggttctta ctaactctac 4080 gaattattga gcaccatcac accaaaacct tacacggagg agtcggagat accatgggta 4140 gcataagaga aagattctgg ataccgaatt tgagagtggc cgttaaaaaa gttattcgta 4200 gttgcaattt gtgtaaaagg tacagagtga agcctctgtt accaccaaca aaagcaatgt 4260 taccgcactt tagaactgac aatgttgaac catttgcagt tagtggtgtt gattttgccg 4320 gaccgttaaa gtataaagta ccaaaaaatt cgattaaaaa atgctacgtt gcacttttta 4380 catgcgcaag cactagagcc gtctatctta aactttgtca tgacctatca gcagtagaat 4440 ttcagagggt tttaaaagaa tttgttgcga ggaaaggacc acctcagatg ataatcagtg 4500 acaatgcgaa aacatttgtg gcaacaggga agtggttact tacacttaaa aacgacgaga 4560 atattgccaa ctatcttgct atacaagcta taaaatggag gtttaatcta tcaagagccc 4620 cttggtgggg aggtttgttt gaaagattga ttggaataat gaagaaaagc ttatctaaaa 4680 ctatcggtaa agggatgctt actttcaatg aattagagga agtattgttg gatgttgagt 4740 gctcaatgaa caatagacca ctttgttacc aaggtgacca gttcgacaat caagtactga 4800 caccaaacgt attaatgagg ggaaaacctg caatattgct tgaagaagat attgctttaa 4860 ttgcaaaaga ggaatatgca atacggagag ccaagtttat aaaaaactgt aaaacccatc 4920 tgagaaaaag atggatgaac gaatatgtac atgcaatgga agaacgacag caagtaagga 4980 ataagggaag taacataaaa ctgccagttg ttggaagcgt ggtacttatt aaagaggatg 5040 taaaaaacaa agcacttcta aatattggac gagttgaaag cgaaatcaaa ggaaaagatg 5100 gagttacgcg tggcttaaaa atacgattgg ggaacggtta tgttattgaa cgaccaattc 5160 aactggtgtg tgacctggaa atcgatttta aggcagagag tatgattgaa tctaaaaaga 5220 acgtggaaat tggtaaaaaa gtggaaaccc taaaaagagt accgagacgg gcgaaattag 5280 atgcaagaca aaaaatcagt tgcatcatgg aagacgagat ggctgattaa agccgaagtc 5340 tcagctttaa tcgggggagt g 5361 // ID Gypsy2-LTR_Dya repbase; DNA; INV; 274 BP. XX AC chr2L; XX DT 18-MAY-2009 (Rel. 14.05, Created) DT 18-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy2_Dya; KW Gypsy2-I_Dya; Gypsy2-LTR_Dya. XX OS Drosophila yakuba OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; melanogaster subgroup. XX RN [1] RP 1-274 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1037-1037 (2009). XX DR Genome; chr2L; Positions 21202609 21202336. XX SQ Sequence 274 BP; 75 A; 53 C; 51 G; 95 T; 0 other; tgtgggattg gagtaagttg gcattatcat gcagcctaga tgtttgggag tgcacacact 60 gtttcctata tcagccagat cctaacagca ttgccaacag tctctctctc ccatggtgtc 120 tctgaagtaa ttcagtcgcg gcagaatttt taacgaagtc agtcgatgcg tttattagtg 180 cgagagtcgt agttcttttc atttttataa ttattaattt caaactttgt gaacaatacc 240 ctaaataata aacttaattt gtattatccc taca 274 // ID BEL1-I_AP repbase; DNA; INV; 6061 BP. XX AC Contig29878; XX DT 21-APR-2008 (Rel. 13.04, Created) DT 21-APR-2008 (Rel. 15.12, Last updated, Version 0) XX DE LTR retrotransposon from pea aphid: internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL1AP; BEL1-I_AP; KW BEL1-LTR_AP. XX NM BEL1-I_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-6061 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from pea aphid."; RL Repbase Reports 8(4), 429-429 (2008). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR Genome; Contig29878; Positions 1577 7637. XX CC Positions [4887-5480] - Integrase core CC LTRs are 99% similar to each other. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX FH Key Location/Qualifiers FT CDS 577..1818 FT /product="BEL1-I_AP_1p" FT /translation="MEGLNTLTARRGQIRAIVTRFQSFIRSPECDLKQIPL FT RQVKIEEAWQNFELVQTAIEELEINNDTNTDHSQYRIDFENLYFETVAEAE FT QKINSTRSNQIDTTVQSSDRGESINSTSSIIKLAALNIPVFNGSYNDWVSF FT KDIFTALIHTNSNLTPIQKFFYLRSSLTSDAANCIKNFETTAINYEHAWKT FT LTTRYQNEKLLIQCHVKDVCELNAVKANSSDSLRLFSDTLRSHISALEALK FT QRPSEWGPLLIHIICTKLDANSLTEWEVKSPKTEIAKVEDLMIFLDERSQI FT LQAVESSKNLMNTTSEINNSTCKKNSYNKSRTASTVLTTTSDVTCFICNLK FT HTIYKCPTFLALSINDRIKKVNEIKLCKYYACVNMILKKNVCHGIVLNVAR FT HTILCYIFLKIKLTMRKKRK" FT CDS 2196..5840 FT /product="BEL1-I_AP_2p" FT /translation="MSNIGHIPQNVSLADPLFWSPQKVDLLIGVSHFYDLI FT NERQIKPAPDGPVFQETRLGWVVSGPTSTAPKYNRNTEPITNSSCHLSLHN FT DATIENMLPRFWCVEEFERKDAYTIEEKMCKNYFDKTVTRGVDGRFIVHLP FT FRENVIDLGSSYDIAKRRLLNLERRFTNNPKLKNEYTNFINEYIQLGHMEQ FT IHDDQAVSDNKKACYLPHHAVFKETSTSTRLQVVFDASCKTSNGVSLNDIL FT LKGPVLQDDLIYILARFRTHNFVLSADITKMYRQFWVADEHRKFQRILWRT FT EPHEAIKTFQLKTITYGTVPASFLATGCLHKLADTQNIDDSISTVIKRDFY FT MDDFLGGTSSFESAIKLRDGLIKTMQSAGLELRKWASNNDNLIKGISKNQD FT NINTTVSISDDNSITKVLGLFWNSATDTLQFKVQQNNHTSNDQFTKRKILS FT EIACLFDPLGLVGPAIIQAKIMLQQLWRLKIQWDEPLPNDVQKQWNAYRTS FT LCVLNGLIIPRQITCESDIVNLQIHGFADASINAYGCCLYLRCTIASGRHT FT SKLICAKSKVAPLKCISLPRLELCAALLLSRLASRIIPKLNLKISKSYFWS FT DSSIVLAWITSPSNKWKTFVAHRVGEIQERTSISDWSHVDTKENPADIISR FT GCCPSKLESMSLWWFGPVWLIKNELKYPILNKIVNPTELEIPEVREVTMSN FT VCTNEQNDMLSLINNYSSINKLIHVIAYCLRFHYNTLLYKRLPSRPKLTGS FT LSMEEIKHARIIIIKGIQKKSFYREIQDLNKLKNVNASSKLFRLCPFIDDD FT GLIRVGGRLKNAASIDVYQRHPIVLPAYNHFTSLLFKYEHEQCMHGGPQAT FT LSSIRLQYWPLNGRNIARSTVHKCIKCFRYKPVVAQPIMGQLPADRVEPAR FT AFLKCGVDFAGPFLIKSSLRRNASVTKGYVCIFVCFTTKATHMELVSDLST FT PAFIGALNRFFDRRGKSSVIYSDNGTNFVGANHKLRVWYDLFQAEQHKKSI FT DDFLIQKGVQWKFIPPRSPHFGGLWEASVKSMKNLIQKTLGDARLTYEEFI FT TVLTRAEACLNSRPLTPISTDPNDLSTLTPGHFLIGDSLLAIPEPDISDVK FT INRLTRWRRLTHYSQIIWKKWSREYLNQLQERKRWAGEKGPRIDIGTVVLV FT RDDNISPLGWKLGLVTNIQRGTDNVIRSAEVRVGKGCFTRSVRNLCPLPFD FT ENIPK" XX SQ Sequence 6061 BP; 2063 A; 1101 C; 1140 G; 1757 T; 0 other; cttaaatcga agctttaaat aaattgctag tgtatcacgg caacatttgg tccattcgtc 60 cggatttgaa tgagcaacgg agcacgtgat cgatcaacga aaaatcattc gaaaagtgcg 120 ttattaaagt aaaagtgcca ttccatcgat cgacgccatc gcttatgaca cagaatttac 180 ctactactac caaaatcgtg aaagtgctag ctgttgctgt acccaaagtc tacgcgtaag 240 gttagcaaac cagcccacca accaccatcg aaggattcaa gttgttaagg taagagagtc 300 actttttata tttttccttt tttcccgtat tatttaaaag ttttcccgtc tcgttactta 360 aataaaatgg taattcccag tcaggctagt gagtcgagta cagtcgcgat atcaaagtaa 420 taaatattac gatatctttg tcaaaacgtc aacccatgtt cggataacat cgtattagtc 480 aataggtaca atataataat ataaccttag ttatctacac gggaatatat cgaatagcac 540 agtataaata tttttatcgt cgtcgtataa agtaaaatgg aaggtcttaa taccttaaca 600 gcacgacgcg gtcaaataag ggcaatagta acacgatttc aaagttttat aaggtcaccc 660 gaatgtgatc taaaacaaat acctctgcga caggtcaaaa tagaagaagc ttggcaaaat 720 ttcgagttag tccaaacagc gattgaagaa ttagagataa acaacgatac aaatacagat 780 cattcacagt ataggattga tttcgaaaat ttatattttg agacagttgc tgaggctgaa 840 caaaaaatta attcaactag gtcaaaccag attgacacca cggtccaaag ttccgatcgt 900 ggagagagca ttaattcgac atcatcgatt ataaaattgg cagcgttaaa tataccggtg 960 tttaatggta gttataatga ttgggtatca tttaaagaca tttttacggc attgattcat 1020 acaaatagta atttaacacc aattcaaaaa tttttctatt tacggtcttc tctaaccagc 1080 gacgcggcaa attgtataaa aaattttgaa acaacggcaa taaattacga gcacgcttgg 1140 aagacgttga ccacgcgtta ccaaaatgaa aaattattaa ttcaatgtca tgtaaaggat 1200 gtatgtgaat taaacgcagt taaagcaaat tcgtcggata gtttacgcct gttctcagac 1260 acactacgta gtcatatttc ggccctcgaa gcattaaagc aacgaccaag cgaatggggc 1320 ccattactca tacacataat atgcacaaaa ttagacgcaa actcgctaac tgaatgggag 1380 gtaaaatcac caaaaaccga gattgctaag gtcgaagatt taatgatatt tttagacgaa 1440 cgttcgcaaa tcttacaagc cgtagaatca tctaaaaatc ttatgaatac cacatcggaa 1500 attaataaca gtacatgtaa aaagaatagt tataataaat cacgtacagc ttctacagtg 1560 ctcactacca cctcagatgt aacctgcttc atatgcaact taaaacacac catatacaaa 1620 tgtcccacgt ttttagcgtt atcaataaac gatcgcatta agaaagtaaa tgaaatcaaa 1680 ttatgtaaat attatgcctg cgtaaacatg attttaaaaa aaaatgtttg tcacggaatt 1740 gttttaaatg tagcaagaca cacaatactt tgttacatat tcctcaaaat aaaattaaca 1800 atgaggaaaa aacggaaata gaagagaaaa caaaccgttc atcgatccaa actgacgcta 1860 acgcgtcaat aagtgcgcat gtgcatggca ccaactatga acagatactt ttatcaaccg 1920 cgattgttcg agcatttgga gaaaatcaaa aatcatcatt atgtcgcgca ttattagact 1980 caggttcgca aagcaatttt attacagagg aattggttca atgtttaaaa ttacgaagga 2040 cgaaaactta tcaccaaatt ggtggtatcg gctcaacgac gcaacacgca tactcatacg 2100 taatcgcaca ttttaaatcg agatttaacg actatagctt cacattaaaa ttgttggtag 2160 tacctaagat tacgagcgag ataccgtcaa aacagatgag taatataggt catataccgc 2220 agaatgtaag cttggccgat ccgttatttt ggtcgccaca aaaggttgat ttattaatag 2280 gtgtgtcaca tttttatgat ttaataaacg aacgtcaaat caaacccgca cccgacggcc 2340 ctgtttttca agagacgaga cttgggtggg tagtgtcagg acctacgtcg acggcaccaa 2400 agtataatcg caatactgaa ccgataacta acagctcgtg tcacttatca ttacataacg 2460 acgcgacaat agaaaatatg ttacctcggt tttggtgcgt ggaagaattt gaaagaaagg 2520 atgcatatac aattgaagaa aaaatgtgca aaaattattt cgacaagacg gtgacaaggg 2580 gcgtagatgg ccgatttatc gtacatctac cttttcgtga gaacgtaata gatttaggta 2640 gttcctacga catagctaag cgccgtctct taaatcttga acgtcgtttt acgaataacc 2700 cgaagttaaa aaacgaatat actaatttta taaatgaata tatacagcta ggtcatatgg 2760 agcaaataca cgacgaccaa gctgttagcg ataataagaa agcatgttat ttaccgcatc 2820 acgcggtctt caaagagaca agtacgtcaa cacgtcttca agtagtgttc gacgcatcat 2880 gcaagactag taacggtgtt agcttaaatg atatactatt gaaggggccc gtcttgcagg 2940 acgatttgat atatatactt gctcgctttc gaacacacaa tttcgtatta tccgcggata 3000 ttacaaaaat gtataggcaa ttttgggtag ccgatgaaca tagaaaattt caacgtatat 3060 tatggcgcac agagccacat gaagcaataa aaacttttca acttaagaca atcacttacg 3120 gcaccgtacc cgcatctttt ttagccaccg gatgtctaca taaattggcc gacacacaaa 3180 atatcgatga ctccatatca accgtcataa aacgtgattt ttatatggat gattttttag 3240 gcggcactag ttcgtttgaa tcagctatta aactacggga cggcttgatt aaaacaatgc 3300 aatcagccgg tttagaatta cgaaagtggg cctcaaataa tgataattta attaaaggta 3360 tatcgaaaaa tcaagataat atcaatacga cggtttccat tagcgatgac aattcgataa 3420 cgaaggttct cgggttattt tggaactcag ccacggacac tttacaattt aaagtacagc 3480 aaaataatca tacatccaac gatcaattta caaaacgcaa aatattatcg gagattgcgt 3540 gtttatttga tccgttagga ttggtaggtc ctgctattat acaggccaaa attatgttac 3600 agcagttgtg gcgactcaaa atacaatggg acgaaccact tccgaacgat gttcaaaaac 3660 aatggaatgc ttatagaact tcgttgtgcg tattaaatgg tctaattata cctcgacaaa 3720 ttacgtgcga aagcgatatc gtaaatctac aaatccacgg atttgccgat gcgagtataa 3780 atgcatatgg ctgttgttta tatttacgat gcaccattgc cagtggtagg cacacttcga 3840 aattaatatg tgccaagtct aaggtcgcgc cattaaaatg tatatcttta cctagattgg 3900 aattatgtgc ggcactttta ttgtcaagat tggcaagtag aataattcca aaactaaatc 3960 tgaaaattag taaaagttat ttttggtccg attcgagtat agttttagca tggattacgt 4020 ctccctcaaa caagtggaaa acttttgtgg cccaccgagt cggagaaatc caagaaagaa 4080 catctatatc agactggtca catgtcgata ccaaagaaaa tccagcggat attatttcac 4140 gtggttgttg tccgtcaaaa ttagaatcaa tgtcattatg gtggtttggc ccggtctggt 4200 taataaaaaa cgaattaaaa tacccgatat taaataaaat tgtaaatcca acggagttag 4260 aaatacccga agtccgagag gtgacgatgt ctaatgtatg tacaaacgag caaaacgaca 4320 tgttgtcatt aataaacaat tactcgtcta taaacaaatt aattcacgta atagcttatt 4380 gtttacgttt tcattataac acattattat ataagagatt accgtcaagg ccaaaattaa 4440 cgggatcact tagtatggag gaaattaaac acgcacgtat tatcataata aagggtattc 4500 aaaaaaaatc attctatcgc gaaatacaag atttgaataa attgaaaaat gtaaatgcat 4560 ctagcaaatt gttccgtttg tgtccgttta tagacgacga cggtctgata agagtaggag 4620 gtcgtttgaa aaatgccgcg tcgatagacg tttatcaacg acatccaatc gtcctaccag 4680 catataatca ttttactagt ttattgttta aatacgaaca cgagcagtgt atgcacggcg 4740 gaccccaggc aaccttatct tcaataagat tacaatattg gcctttaaat gggcgaaata 4800 tagctcgaag tacagtacat aaatgcatta aatgttttcg ttataaaccg gttgttgcgc 4860 agccaattat gggtcaatta cccgcagacc gcgtagaacc ggcacgcgct tttttaaaat 4920 gtggagtcga cttcgctgga ccgtttttaa ttaaatcaag tttacgtcgt aacgcgtctg 4980 taactaaagg ctatgtatgt atttttgtat gttttacaac caaggcgacg catatggagt 5040 tggtcagtga tttgtctaca ccagcgttta tcggtgcttt aaatcgattt ttcgacagaa 5100 gaggcaagag tagcgtaatt tattccgata atggtaccaa cttcgtcggt gctaatcata 5160 aattgagagt ttggtatgat ttatttcaag cagaacaaca caagaaaagt attgatgatt 5220 ttttaataca aaagggcgtg caatggaaat ttatacctcc acgttctccg catttcggcg 5280 gactatggga agcctccgtc aagtcgatga aaaatttaat acaaaagaca ctaggtgacg 5340 ctcgactaac gtacgaagag tttattactg tactcactcg cgctgaagct tgtttgaatt 5400 cccgtcccct tactcctatc tccactgatc caaatgatct gtcaacttta acaccaggtc 5460 atttcctaat aggtgactct ctattagcca tccccgaacc tgatatatcg gacgtcaaaa 5520 taaatagatt gactagatgg cgaaggttga ctcattattc ccaaataatt tggaaaaaat 5580 ggagtcgtga atatttgaac caactgcagg agaggaaaag gtgggcaggt gagaaaggac 5640 ctaggataga catcggtacc gtggtgttag tgcgagacga taatatttct cccttgggtt 5700 ggaagttggg ccttgtaaca aatattcagc gaggtactga caacgtcatc cgttcagctg 5760 aggtcagagt gggaaaaggt tgctttacac ggtcagtgcg gaacttatgc cctctacctt 5820 ttgatgaaaa tattcctaaa tagtcatatt attatataaa ttaccattta tttatttact 5880 ttgtattctg atattttttg tattatgacg atattctata gtatttaatg catattattt 5940 aattgcatag tataattatt aataatttat aatttttgat tttttttatg gttttgtaat 6000 ttagaatacc atttataata ttttatgttt tgttgaaaat gtttcaaggc gggcggtatg 6060 t 6061 // ID DNA8-72_AP repbase; DNA; INV; 413 BP. XX AC . XX DT 25-AUG-2009 (Rel. 14.09, Created) DT 25-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-72_AP. XX NM DNA8-72_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-413 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2008-2008 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 413 BP; 153 A; 49 C; 71 G; 140 T; 0 other; caggggtctc caaccttttt ttacggcggg ccaaaattga gatacgcttt ggcatcgcgg 60 gccaacattt ttagaggaat agatcggtta agaaattgaa gaatagataa gatttattta 120 gaaaaattaa tgtttgttta tatacttttt ttattttata acgagttttt ctattattag 180 ttagtgttag taattagtat aatataggta taaattacta ttataagctg ttacgaataa 240 caccaaaaga aaaaaactaa taaattaatt ttatttaatt aattcaataa attaaaaaca 300 atgaaaacta aatgaaaata atataataaa tatattattt tcgaaacatt aattagcttg 360 gcgggccggt gaaaaaccct tggcgggccg gattcgtagg ttggagaccc ctg 413 // ID BEL-602_AA-I repbase; DNA; INV; 6248 BP. XX AC . XX DT 13-JAN-2011 (Rel. 15.1, Created) DT 13-JAN-2011 (Rel. 15.1, Last updated, Version 2) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-602_AA_; KW BEL-602_AA-LTR; Pao_Bel_Ele218; BEL-602_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-6248 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [2] (Consensus) XX CC Positions [5744-6301] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 149..2782 FT /product="BEL-602_AA-I_1p" FT /translation="MCADVTAGIENLPWTCKECLNTGTADKSQHREATVVD FT VSDASGKEPADSLDPRVDKRSSNNEQDQDSDQEEAEMQHQIELRQMRAKFE FT RQLQREKEKMVLKIRLEREMLQRKKEAEAEFHKMRNEMYKEFEGVMDPFVQ FT EEGAVGGTYVSNHVQLETDLEKAWKEKFKFPEQQGKSADDIRGTFPKSSTP FT ENAPLQRDTINNRQSAIENVLEESTPLKNPQVPTVKIDKPVLSHPPHPELS FT ANIPRNSALYQAPISTANIEVPATFNPSANQQGIRNQHQGEPELTKAQIAA FT RKGPFAKLPVFTGRPEEWPLFISSFNNGNAACNWTDLENLGRLQESIRGPA FT LEAVRSRLLLPESVPRVIDTLRLLYGRPEQLLHSLMLKARKADPPRIDRLG FT TFIHYGMVVQQLCDHLVASGLIDHLVNPMLITELVEKLPPSTKMEWVRYKR FT QQHVVDLRTFSDFLSIIVSEATEATLYTDFHVDSRPNRDKKEKKSKARDHE FT GFLNAHIGVQSPSQRSTNQHNIEPQQAIRKPCRGCNSLEHRTRSCEDFQRL FT TYSDRLKITEKWKMCQLCLNEHGQIRCRFKGHCNVGDCKERHHPLLHPPNP FT PMALSTNCHVHNSEHHPIIFRMVPIKLHHEGRTLDVVAFLDEGSSYSLMES FT AIADQLKLKGAWEPILVKWTAGMSRLERDSRSVDVSISATGSKEKYLLRNV FT HTVKELQLPEQRIRFAEVAARFKHLCGLPVADCLSGSPKVLIGLKHLHVYA FT PLESRIGNPGEPIAVRTKLGWTIYGPQGGENMATGFAGHHTVGDITELDLQ FT ELLRRHFTLEETGLDVKVLPESAEDRRAKELLEQTTMRVGKRFETGLLWRS FT DDPQLPDSFPMALKRVKSLERRLEK" FT CDS 4478..5542 FT /product="BEL-602_AA-I_2p" FT /translation="MEIPRCYFADVKASDYQDLELHVFADASEEAYGCVAY FT FRVLVKGEPRVALVSAKSKVAPLQYMSIPRMELLAAVLGARLAAFVKSNHS FT VQVQRVFYHIDSATVLSWIRSDHRKYKQFVAYRIGEILTITNPKQWSWVAT FT KHNIADVLTKWGKNGPPLATESEWVRGPDILYRLDQEYSQRDLPPPGVVEE FT LRAYHLFHEVSFHKALIDTTRFSRWTILVRSMACVFRFISNCHKRMKKQPI FT ETLQASPKLVKLLKRTFPSIKSPLKREEYQMAENFLWKAAQQEGFPDEMKT FT LLKNNRLPQSKLHLIERSSPLYRLAPFLDANGVIRMEGRAAHADFIPFEQR FT FPIVLPKGHDVT" XX SQ Sequence 6248 BP; 1812 A; 1438 C; 1557 G; 1435 T; 6 other; attctcaaaa atcaatgcga tgccatcatc taaggaaaag gaacataacc ccgaaagcac 60 cgggtatgat tgcgcgcttt gcgaaagacc gaatcatgcg gatagcaata tggtcagctg 120 tgagamgtgt cagagttggt tccactttat gtgcgcggat gtgactgctg gcatcgaaaa 180 cctcccgtgg acgtgcaagg agtgtttaaa cacaggtact gcggacaaat ctcaacaccg 240 tgaagccaca gtcgtcgatg tctcagacgc gtccggcaag gagccggccg attcccttga 300 tccgagggtc gataaaagat cgtcaaacaa cgagcaagat caggattcag accaggaaga 360 agccgaaatg cagcaccaaa tagagctgcg tcaaatgaga gccaagttcg agcgccagct 420 tcagagggag aaagagaaga tggttctaaa aattcgccta gaaagggaaa tgcttcaaag 480 gaagaaagag gcggaagcag aatttcacaa gatgcggaac gagatgtaca aggagttcga 540 gggtgtgatg gacccattcg tccaggaaga gggcgctgta ggaggtacct atgtgtcgaa 600 tcacgtacag ttggaaacgg atttagaaaa ggcgtggaag gagaagttca aatttccgga 660 acagcagggc aaatcggccg atgatatccg tggaactttt ccgaaatctt ctacacccga 720 aaacgctcca ctgcaacgag atactatcaa caaccgccaa tcagcaatcg aaaatgttct 780 ggaggaatcg acgccactca agaatccaca agtaccaacc gttaagatag acaagcctgt 840 cttatcgcat ccaccgcatc cagaattgtc tgcaaatatt cctcgaaact cggccttata 900 tcaggccccc atctctacgg cgaatataga ggtgcccgcg acgttcaacc caagtgcgaa 960 tcaacagggg attcgaaatc agcatcaagg agagcctgaa ctgaccaaag ctcagatagc 1020 ggcaagaaaa ggtccgtttg caaagctacc cgtttttacc ggtcggccag aagagtggcc 1080 gctcttcata agcagcttca ataatggcaa tgctgcatgc aactggaccg atcttgaaaa 1140 tcttggtaga ttacaagaga gtatccgagg gccagctctg gaagccgtaa ggagcagact 1200 gctgttgcca gaatcggtcc cacgagtcat tgatacgctt cggcttctct atggccgccc 1260 ggagcaactg ctccattcac ttatgctgaa agcaaggaaa gcggatccac ctcgtattga 1320 tcgtttgggt acattcatcc actatggtat ggtcgttcag caattgtgcg accacctagt 1380 agcatcggga ctaatagacc atcttgtcaa ccccatgctc atcacggaac tagtggaaaa 1440 gctaccgccc agtacgaaga tggagtgggt tagatataaa cgtcaacaac acgttgtgga 1500 tttgagaact ttttcggatt tcctctcgat aatcgtttcg gaagctaccg aagcaacgct 1560 ctacacggat ttccacgtgg atagtcggcc taaccgggac aagaaagaaa agaaatcgaa 1620 agctagggat cacgaaggat ttctcaatgc tcacatcggt gtccaatctc catcacaacg 1680 ctctacaaat caacataata ttgagcccca acaagcgata cgtaaaccgt gccgtggatg 1740 taacagtctc gagcatcgaa cgcgttcctg tgaagatttc cagaggctga cgtatagcga 1800 taggctaaag ataactgaaa agtggaaaat gtgtcaattg tgcctcaacg agcacgggca 1860 aatacgttgc cgattcaaag gtcactgcaa tgtaggagat tgcaaagagc ggcaccatcc 1920 cctccttcat ccaccaaacc ctcctatggc tttgtcgaca aactgtcatg tgcacaactc 1980 ggaacatcat ccaatcattt ttaggatggt tccaattaag ttgcaccatg aagggcgtac 2040 ccttgacgtt gtcgcctttt tggatgaagg gtcttcttat tcgctgatgg aaagcgccat 2100 tgcagaccaa ctgaaactga aaggagcctg ggaacccatt cttgtcaaat ggaccgcggg 2160 aatgagtaga ctggaacggg actcaaggag cgttgacgta tcaatttcgg ccaccgggtc 2220 aaaggagaaa tatctgctac gaaatgtcca cacggttaaa gagcttcaac tccctgagca 2280 aaggattcgt tttgctgagg tggcagctcg attcaagcat ctttgtggac taccagtagc 2340 ggactgttta agtggatctc caaaggtgct tatcggcttg aaacatttgc atgtgtatgc 2400 accgctggag tcacggattg gaaatcctgg tgagccaata gcagttcgta ccaaactagg 2460 ctggaccatt tacggcccac aaggtggtga aaacatggca acagggtttg ctggccatca 2520 tacagtcggt gacataactg aactagattt gcaagaacta cttcgaaggc actttacatt 2580 ggaggaaacc ggtttggacg tcaaagtgct gcctgaatca gccgaagatc gcagagcaaa 2640 ggagctactg gaacaaacca ctatgcgggt cggcaagcgc ttcgaaacgg ggcttctgtg 2700 gcgaagcgat gacccacagc taccagatag cttccccatg gcgctaaaac gcgtgaagag 2760 cctggagcgg agattggaga aaaawttgga gctgaaacag aatgtagaga ggcaaattga 2820 agaataccaa cacaaaggct acgctcacat cgctacgaac actgaactga ttgaagctga 2880 acctggaaag gtctggtatc tccctttaaa tgtggtactc aacccaagaa aagccgggca 2940 aagtccggct tgtatgggac gcagccgcgt ccgttcaggg aaagtcgctc aactctgaat 3000 tgctcaaagg gcccgacctt ctatccagtt tgccgtctgt gttgtgtccg ttccgagaac 3060 gtcccatcgc tttcggtggc gatatagcgg aaatgtatca ccagctgcga atacggtcaa 3120 gtgacaagtc agcacaaagg tttctatatc gtccgaacgc atcaggtcct cccataacct 3180 acgtcatgga cgtcgccacc tttggcgcaa ccagttcacc atgctcagca cagwttatta 3240 aagatcgaaa cgctcaggac tatgcggagg aatatccgga cgctgtggat gcaattacac 3300 agcgtaacaa aaatgacatt tttgcgtgtc tcaagaatta aataatgtgt ctctagtaga 3360 tttgaggttg ctgaatctga tgccgttcac agaaatgctc cagcacgtca caatttttag 3420 ctacaggtcg ctaaagttgt aaaaaacaca ggtttcatca atgttcacaa aaatttaaac 3480 aatgacttat cgaaaaatat tatgatctta tcgataaata ataaacatta cgtggacgat 3540 tatttagatt ccacgttcac tgtgagcgag gctatcaaac gagcagagga ggttacatac 3600 atccactcca atgccggttt tcacatccga aattgggtgt ccaacagtga agaattccaa 3660 caacacttcg gtagaaaacc ggagaaccac atcgttcatt tcgctagcga caagtccagc 3720 agcaatagta tggaaagaat tctgggtatg tcttgggata cagtaaagga tgtctttgtg 3780 tgtactacgg ctctgcgcga cgatctgcaa gtatatttga ccgaaagtaa gctgcccacc 3840 aaacgagtat tgttgagcgt tgtcatgagt tttttttgac ccgctcggcc tgtgggcgcc 3900 tttcaccgtg cacggaaaaa tcatcgttca agatctttgg aggaatggtt gttcgtggga 3960 cgagataatt gatgagcact ccgcgaagaa gtggtaccgt tggattgcac tactgccaag 4020 tcttcaagcc atggaaattc cgcgatgtta cttcgccgac gtgaaggcat cagactacca 4080 ggacctcgaa ttacacgtgt ttgcagatgc aagcgaagaa gcttatggat gtgtcgctta 4140 ttttcgtgta ttggtgaaag gagaaccgag agtggcactg gtttcagcaa aatcgaaggt 4200 agctccgttg caatacatgt ccataccgag aatggaactc cttgctgcgg tacttggagc 4260 cagattagcc gcctttgtga agtcgaacca ctctgtacag gtgcaaagag tgttctacca 4320 tatcgattca gctacagtac tttcatggat ccgctccgac cataggaaat acaaacagtt 4380 tgtcgcatat cgcatcgggg aaattttgac catcaccaat ccaaagcaat ggagctgggt 4440 ggctacgaag cacaatatag ccgatgtttt aacgaaatgg ggaaaaaatg gaccaccgtt 4500 ggcgacggaa agcgaatggg tacgaggccc tgacattctt tatcgattag atcaagagta 4560 ctctcaacga gatttgccac cccctggcgt agtagaagaa ctgcgagcat atcatctgtt 4620 ccacgaagtt tcgtttcaca aggctctaat agacaccacg cgattctcgc gttggaccat 4680 attggtaaga agcatggcct gtgtgtttcg tttcatttct aattgtcaca aaaggatgaa 4740 gaaacagcca atagaaacac tgcaagcctc gcctaagttg gtgaagttgc taaaacgaac 4800 gtttccatca atcaaatcac ctctcaagcg cgaagaatac cagatggcgg aaaactttct 4860 atggaaagca gcacagcaag aaggattccc cgacgaaatg aaaacgctat tgaagaacaa 4920 cagactacct caatcgaaat tgcaccttat tgaacgctcc agcccgctct atagattggc 4980 tccatttcta gatgcgaatg gcgtgatccg gatggaaggc cgtgcagccc acgcggattt 5040 catccctttc gagcagcgtt tcccaattgt actaccgaag ggccacgacg ttacakcgaa 5100 actactacta cactatcatg caaaattcgg tcacgccaac cgtgagacgg tagtgaacga 5160 gctgcgacag cgtttctatg ttcaaaatat tcgcgcggct gttttgcagg tgattaagga 5220 ctgtamttgg tgcaaaatac ataaatgtat tcctgtggcg cctaggatgg ctcctttacc 5280 ggtgcagcgt ctaacccccc aawtacggcc atttagttat gtaggagttg actacttcgg 5340 cccagtagtc gtcactgtag gacgtcgctc agaaaaaaga tgggtgtgtc tattcacatg 5400 ccttgtgact cgggcgatcc atatggaaat agcctatagc ttgagcagcc aatcatgcgc 5460 catggcaatc agacgtttca tttgcagaag gggcactccg ttggagattt tttccgacaa 5520 cggaactaat tttcaagctg cgagtaagga gttcatccag gaaattcgtt gcatagaaat 5580 ggaatgtgcg gacatcttta ccgacgctag gactcgttgg aactttaacc ccccagcagc 5640 accacacatg ggtggtgtat gggaacggtt agtgaggtcg gcaaaggagg cactaaaggc 5700 tttacacaat ggcggcaaat taagtgacga gatattgttg acgatattat cagaagccga 5760 ggacatggtg aattctcgtc cgttaacata cgtgccacaa gaatctgcag actgcgaagc 5820 cctcacgccg aatcattttc ttcgtggttt gcctgccgga gagcgtgaag aggctaattt 5880 gccaacaagt tctgccgaag cgttgcggga cagctataaa cgatctcaag agctggcaga 5940 catgttatgg aaaagatggc tgaaggaata catcccaaca atcaaccatc gcaccaaatg 6000 gcttgctgaa caagaaccta tcgcagaagg agagctggtc tatattactg atgggaacaa 6060 ccgtaggacc tggatccggg gtatagtaga aaaggtgata cgagggactg atggaagaat 6120 tcgacgtgcg ctagttagaa cgtctaaggg cgtttttcgt cgggcggtcg ccaagctagc 6180 ggtgatggag ttgcggagta aatctggtca aatccctggc cctgaaccag agttacggga 6240 gggggaat 6248 // ID CR1_Ele38B_AAe repbase; DNA; INV; 5065 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; KW CR1_Ele38B_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5065 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1216-1216 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 14 sequences with >97% CC identity. The consensus is ~78% identical to CR1_Ele38. XX FH Key Location/Qualifiers FT CDS 299..1159 FT /product="CR1_Ele38B_AAe_1p" FT /translation="MSDACDQCAKPVKSDDEFITCMAFCERMVHIRCSVTK FT LNKPFVKIIHESPNLIWMCDERAKLMKIARFKSAVSSFGEAFQSITEKQES FT VHAEIRKELAKQGQQIALLSKRMTPSSAFLRESGSFSRQPPSKRRRDEEIN FT FNPTASKPLLGGTKETTTASILTVSEPVELFWLYLSRVHPSVKPEDIEKLA FT KDCLESEDPVKAIPLVKRGIDASRLNFISYKIGIDHKLRQTALSPDTWPKG FT ILFREFEDLSAKNSWLPRLNTPTVMVSPELGASQFSTPSTGVNLAG" FT CDS 1006..4890 FT /product="CR1_Ele38B_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="HVAKRNFVPRVRRSKCKKLLVTPIEYADSYGVARIGS FT ITVFHSLNRSQSSRVISNNAHRSNVIERVNSSCNPERFACRLKGALDPLDT FT VAHAGSCHRSRSGPVVETGDRVSQPSFSGKYTSISSNSSPDQPPYSSTPSN FT AVQHNSNDIGVLTIRQSQLETXNVQLTPRPPRLQGKRIYPSTVLPGRTADC FT TLEVSGSSSSVVPFPASVHHSRPGPDARCGSEIFQTPLSGKLLHQVVASSN FT PDADPYFSTLDTLTGNPERTLESPMEVLNPSDAVVPPAAFVHHSRSGPAVG FT SGERVFQQPNNGEYFSIVNPQSLPDAQMPSRHIGVPAVQRNQDILMYYQNV FT GGMNSCVDDYRLAVSDTCYDIIVLTETWLDSRTLSSQVLGIDYEVFRCDRN FT PSNSRKSTGGGVLVAVRHGLKAKAVEKDLWSCLEQVWVSVELGDRTLFLCA FT LYIAPDRVRDNELIQAHCDSVFTIMETANPVDDIFILGDFNLAGISWKRSD FT NGFLYPDPVRSSLHACAANLLDNYSTATLAQINHIVNENNRSLDLCFVCLQ FT DKAPIIEAAPCELVKVVPHHPPLIITVDNNHFHDFNNRPATVSYDYAKADH FT RSIADVLATIHWESILDHNDIEVAAQTFAHVLSYIIDRHVPKRAHHNASRP FT PWQTSELRKLKSVKRAALRKFTKYRTPSSRCYYLRSNYEYQRVSRHCFQQY FT QRSIEQKLKRHPKSFWKYVNEQRKESGLPSSMEWNGRTATCLQEICQLFSS FT KFASVFSDVQISNEQVILAANNAPLNGQTLDNFDVDDNAISRAALQLKSSY FT NPGPDGIPSVVLKKHIDSLLTPLHSLFRLSLSTGIFPSCWKVANMFPVHKK FT GRKRDVDNYRGITSLSSVSKLFELVVIEPLLSHCKHQLDNDQHGFITGRST FT TTNLLCFTSYITDSMVDRVQTDAIYTDLSAAFDKLDHNIAIAKLDRFGICG FT NFLRWFRSYLTDRQLKVVIGDCSSDSFHATSGIPQGSHLGPVIFLLYFNDV FT HYVLKAPRLSYADDLKLFLRIRSTADCLYLQQQLDLFASWCSLNGMVVNPT FT KCSIITFSRKKRSIAFPYSLLGTSLERVDHIKDLGVFLDSQLSFKQHISYT FT VSKASTTLGFIFRIAKNFSDVYSLKSLYCSLVRSTLEYGAAVWSPAYNNGA FT ERIESVQRRFLRFALRKLPWRDPFRLPSYESRCQLIDLDLLRCRRDVIRAL FT TIADVLQGRIDCGTILEQINLNVRPRSLRNNIMLRLPLFRTNYGLHGALSG FT LQRVFNRVSSLFDFNITRETLRRRFSLFFAERDN" XX SQ Sequence 5065 BP; 1326 A; 1243 C; 1078 G; 1416 T; 2 other; tttgtttttt tatcacgcgc cgaaatttaa cagtttcccg tgaaatatat atcgtattta 60 cctgcgttta gtttaccgcc ataaaatact gtgccgtgct cttcgtcaag ttgataaatt 120 ccacatatcc tgtgttaaaa ccaaggaaag ttcgataata ttatcgcctc acgtcgcgcc 180 gttgatagtt gttttgcttt cgctttgttt tcgccactgg tctctctgtg cactttttgg 240 actaccgtac tttactagaa taataaagtc acagacaaac agacagtaac tccaaaccat 300 gtctgatgct tgtgatcaat gtgctaagcc ggtcaaatcc gatgacgagt ttattacttg 360 tatggctttt tgtgaacgaa tggtacatat taggtgctcg gttacgaagc tcaacaaacc 420 gtttgtgaaa atcattcatg aaagcccaaa tctgatttgg atgtgcgacg agcgcgcgaa 480 attgatgaaa atcgctagat ttaaatctgc cgtctcttcg ttcggtgagg ccttccaatc 540 cattaccgaa aaacaagaat ctgtacacgc agagattaga aaggaacttg caaaacaagg 600 ccaacaaatt gctctgttgt ctaagcgtat gactccatcg tccgccttct tgcgtgaatc 660 tggatcgttt tctcgacaac cgccctcaaa aagacgccgt gatgaagaga tcaactttaa 720 cccgactgct tcaaaaccgc tgctaggtgg cacaaaagaa acaactaccg ccagcatact 780 tactgtctcc gaaccggttg agttgttctg gctgtatctt tctcgtgtcc atccgagtgt 840 caaaccggaa gacatagaaa agttggctaa agactgtttg gagagtgaag atcctgtcaa 900 ggcgattcca ctcgtaaagc gtggaattga cgctagccgc ttaaacttca tttcctacaa 960 gattggcatt gatcataaac tccgtcagac tgcactgagc cctgacacgt ggccaaaagg 1020 aattttgttc cgcgagttcg aagatctaag tgcaaaaaac tcttggttac cccgattgaa 1080 tacgccgaca gttatggtgt cgcccgaatt gggagcatca cagttttcca ctccctcaac 1140 cggagtcaat ctagccgggt aatttcgaat aatgcgcatc ggagcaacgt gattgaacgt 1200 gtcaacagtt cctgtaatcc agaacgcttt gcctgccgcc ttaagggagc cctcgatcca 1260 ctcgacacag tcgcgcatgc tggttcctgt catcgaagtc gttctggacc tgttgtcgag 1320 actggtgaca gggtctccca accttctttc tcaggcaagt acacatccat tagttccaat 1380 tcttcgcctg atcagcctcc gtattccagc actccttcta atgctgtcca gcataacagc 1440 aacgacattg gagtactcac cattagacaa tctcagcttg aaacaakcaa tgttcaacta 1500 acaccgaggc ctccaagact ccaaggaaaa cgcatttatc catctaccgt actaccagga 1560 cgcactgcag attgcacttt ggaagtctct ggatcctcta gctcagtcgt gccttttcca 1620 gccagcgttc atcacagtcg tcctggtcct gatgctagat gtggttcaga gatcttccag 1680 actcctttgt caggcaagtt gttacatcaa gttgtcgctt cttcgaaccc tgatgccgat 1740 ccgtatttca gcactttgga tacgcttact ggaaacccgg aacgcacctt agaaagccct 1800 atggaagtcc tcaatccctc cgacgcagtc gtgcctcctg ctgccttcgt tcatcatagt 1860 cgttccggcc ctgctgtcgg aagtggagag agggtcttcc aacaacctaa caacggcgag 1920 tattttagta ttgtgaatcc tcagtctttg cctgatgcac aaatgccttc cagacatatc 1980 ggcgtcccag cagtgcagcg caaccaagat atactaatgt attaccagaa cgttggcggg 2040 atgaactcat gtgttgacga ctaccgcttg gctgtatcgg atacgtgcta cgatatcatc 2100 gttctaaccg aaacgtggct tgactctcga acgctgtcca gtcaggtgct tgggatcgat 2160 tatgaagtgt tccgttgcga tcgaaatcct agcaatagca gaaaatctac aggaggtgga 2220 gtacttgtgg ctgttcgtca cggtttaaaa gcaaaggcag ttgaaaagga tctgtggagt 2280 tgcctggaac aagtttgggt atccgtcgag ctaggtgatc gtaccttatt cttgtgtgcc 2340 ttgtacatcg cgcctgaccg agttcgcgat aacgagctaa tacaagcaca ttgcgactca 2400 gtttttacta taatggaaac tgccaatcct gttgacgaca tcttcattct gggtgatttc 2460 aatctagctg gcatttcgtg gaaacgctcg gacaacggct tcctctatcc ggatcctgta 2520 cgatcgtcac tgcatgcttg cgcagctaat ctcctcgata attacagcac tgccacgttg 2580 gcacaaatca accatatcgt caacgaaaac aatcgaagtt tggatctttg cttcgtttgc 2640 ttgcaagata aagctccaat aattgaggcg gcaccttgcg agctggtgaa agtagtccct 2700 catcatcctc cgttgatcat caccgttgat aataaccact ttcacgattt taacaaccgc 2760 cccgctaccg tgtcgtacga ttacgcgaaa gctgatcatc gtagcatcgc agatgtgtta 2820 gccaccatcc actgggagag cattcttgat cacaacgaca tcgaagtagc tgcgcaaact 2880 tttgcacatg tattgtcata catcattgac aggcacgttc cgaaaagagc acaccacaat 2940 gcctcccgac ctccttggca aaccagtgaa ctgcggaagt tgaagtcggt caagcgagcg 3000 gctctaagga agttcactaa gtaccgaaca ccttcatcgc gctgttacta tctaaggtcg 3060 aactacgagt atcagcgagt cagtcgtcac tgtttccagc agtatcagcg aagcattgag 3120 cagaaactta aacgccatcc aaagtctttc tggaaatacg tgaatgagca gcgtaaggag 3180 tctggtcttc catcctcgat ggaatggaat ggtagaaccg caacgtgcct ccaggagata 3240 tgccagttgt tttcttccaa atttgccagc gtattcagcg acgttcaaat aagtaatgaa 3300 caagtcatcc tagcagccaa caatgctcct ctaaacggac aaacgctgga caatttcgat 3360 gttgacgata atgccatttc cagggccgca ttgcaactca agtcgtcgta taaccctgga 3420 ccagatggaa ttccgtcagt agtcctcaag aagcatatcg acagtctgct tactccgctg 3480 cacagtttat ttcgtctatc actttccacc ggaatctttc cgtcatgctg gaaagtagca 3540 aacatgtttc cagtgcataa gaaagggaga aaacgtgacg tagataatta tcgtggcatc 3600 acatctctga gctcagtttc gaagcttttc gaactcgttg ttattgaacc attgctatca 3660 cactgcaaac atcagcttga caacgatcaa cacggcttca tcaccggtcg ctcgaccact 3720 actaatctac tatgcttcac atcgtacata actgatagta tggtcgatag agttcaaact 3780 gatgcaatct ataccgatct gtccgctgct ttcgacaagt tggaccacaa catcgccatt 3840 gcaaagctcg acaggttcgg catctgcggt aacttcctgc gctggttcag atcgtatctc 3900 accgatcgtc agttaaaagt cgtgataggg gactgcagct ccgacagctt tcatgctact 3960 tccggcatac cacaaggaag tcacctgggt ccagtgattt tcttgcttta tttcaatgac 4020 gttcattacg tattaaaagc ccctcggtta tcttacgcgg atgatctgaa attgttctta 4080 cgaatacgct caactgctga ttgtttgtac ctgcaacaac aacttgactt gtttgcaagc 4140 tggtgctctc taaatggtat ggtggtgaat ccaacaaagt gctccatcat aacgttttcc 4200 aggaaaaagc gatcaatagc gttcccttat agtttgctcg ggacgagtct cgaacgtgtg 4260 gatcacatca aggacctagg tgttttcctt gattcacagt tatcattcaa acaacatatt 4320 tcgtacacgg tcagcaaagc ttcgacaact ctcggtttca tcttcagaat cgccaagaat 4380 ttctcagatg twtacagcct gaaatcgctt tattgctctc ttgtgcgctc cacgctggaa 4440 tacggcgctg cggtttggag ccccgcttac aacaatggag cggaaaggat cgaatctgtc 4500 caacgaagat ttcttcgatt cgcacttcgc aagttaccgt ggagagatcc gtttcgcctg 4560 ccgagctacg aaagtcgttg tcagctaata gacttggacc ttcttcgctg tagaagggat 4620 gtcataagag ctttgactat tgcggatgtg ttgcagggac gcatcgactg tggaactatt 4680 ctggaacaaa ttaatttgaa cgtccgacca cgctcgcttc gcaacaacat catgctaaga 4740 ctgcctctat ttcgtacaaa ttatggactt cacggtgctc ttagtggact acagcgagtt 4800 ttcaacagag tatcttcatt gtttgatttc aacattaccc gagagacgct ccgtcgaaga 4860 ttttcattgt ttttcgctga acgagataat tagtctaagt gtttagttta agttttatcg 4920 cctgactatt taatatttgt ttttaaattc attcttattg tttttgacgt cactgtactt 4980 tgttgttaat gttagcatat acttatattt taagacatca ttggggctac tacttgcctg 5040 ttgatgtagt ataaacaaat aaaca 5065 // ID Gypsy17-I_Dya repbase; DNA; INV; 4207 BP. XX AC chrU; XX DT 19-MAY-2009 (Rel. 14.05, Created) DT 19-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy17_Dya; KW Gypsy17-LTR_Dya; Gypsy17-I_Dya. XX OS Drosophila yakuba OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; melanogaster subgroup. XX RN [1] RP 1-4207 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1105-1105 (2009). XX DR Genome; chrU; Positions 6462571 6458365. XX CC Positions [3186-3656] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 2640..4193 FT /product="Gypsy17-I_Dya_3p" FT /translation="MTSNRLQRWAIILMAYQYDIKYRSTTAHGNADALSRL FT PVNSDDKFDQEEACYNIVEVSCPINVDVVKRNIEHDYILKEVYKYVYNGWP FT EQVERELMPYFNRRFAFTFNSSLLCLHSEVNRVIIPSKLRKTILNFLHDGH FT WGIVRMKQLARQHVWWPGIDEDINQLAKDCNICKVANPAPAKEFFSWPNTT FT SAWERIHIDFAGPIFNSMWLICVDAYSQFPFITQMSSTTTENTISALTTIF FT AIEGYPKTLVSDNGPQLTAESFKVFCKNFGINHITTAPFHPASNGLAERFV FT QSFKTAVGKNIRDGLSVRAAVTKYLSSYRFTPNAEGKSPAELLHGRPVRTI FT LSQLLEKKSDSKPLASTKYFPSQKVFARNYARGEKWIEAVIDRPVGHMLFI FT LRTKTGVIKRHINQIKPRFGPDAIKDSKEENTTYWWAPDLPNQPTTQLEAQ FT SAATQQEDPISEPSQDVQPSRVSEESQPNVQKNPQPRRSGIPIRRSSRTRQ FT EVNRYQTTNFRKRHTNKKMPLN" FT CDS join(73..1083,1087..2580) FT /product="Gypsy17-I_Dya_1p" FT /translation="MENQSNLQELLAQQQQLLQQQQQWMQAFLERERNFPV FT AGEQIVVCPPFPSFSKDSQNWETYLQQLTQHFKAYSVLAEEKKKAYFLSWV FT GTHVFELVKNLFGSENPNTHTYVQITEKLTEHFKQKRHVVAARYEFFKRQM FT RDSQTHKEWVADLRGIARECQFVCQSEGCTLNYVNEMIRDQIIVHTPYDAI FT RTAALQKLQPSLEDVISIAETYEATMKTVAVIKESDKKSVDMNAVYVHKNN FT HKHRNSGDGKGGKIALKSCSGCGNSHMREKCKFRDAICHTCGRKGHIAVVC FT MSKQCKMKMENSDKKNQNKNTDTIQTVYTVEKLARFEINKTMIDVILNTIV FT PFQMDSGATVSVMNLQTYRQVGSPKLAKCIRILHAFGQSEIPVLGELHTEA FT KCGNKSANVTIIVANVENSNNLFGLDLFKTFNFEIQQISNVSEAQLSQMTD FT LCTKYKEVFEPAIGTIKNFKASIYLKPNSLPKFYKSRQIPFAQMDKFKEEA FT QRLTNAGIWKPIKFSNWALAPKPGGALRICGDFKQGVNSQLDIEQYPLPTR FT ETLFHAIRHGKHFSKIDLKDAYLQMELDDATKKIMVVNTPLGLFQYQRLPY FT GIASAPAIFQRYLEQLLNGIEGCGNYLDDIIISAPTTEQHLARLERILGIL FT QENGIKCKKEKCFFLKEEIEYLGRRVSSKGILPDTSGLEAVKELKPPTNLQ FT QLDAFMGKVNYYCNFIPNYSQVAAPLNQLRRKNTLFKFGTDQQRAFTALKT FT HILNATELVHFNEDLPLVLATDASSFGIGVVLSHLHPGNREKPIAFASKTL FT DVHQVRYSQIEKEALSIIFGVQKFHQYTVENLS" XX SQ Sequence 4207 BP; 1431 A; 899 C; 844 G; 1033 T; 0 other; attattggcg acgagtaaaa aaaacagaaa ttcgtgtcgg tagtctattg caagagatcg 60 aacgaaaaaa caatggagaa tcaaagtaat ttacaagagt tgttggctca acagcaacaa 120 ttacttcagc agcaacaaca atggatgcaa gcatttcttg aacgggaacg taactttccc 180 gtagcaggag aacaaatcgt tgtgtgtccg ccgttcccaa gttttagtaa ggatagccaa 240 aattgggaga cgtatttaca acagctaacg caacatttta aagcgtactc agttctagcc 300 gaagagaaga aaaaagccta ttttctttct tgggtaggta cacatgtttt cgaactagtg 360 aaaaatctat tcgggagtga aaatccgaac acacacacat atgtacaaat caccgaaaag 420 ttaacggagc acttcaagca aaaacggcat gttgtcgctg ctcgttatga atttttcaaa 480 cggcaaatgc gggactcaca aacacacaag gagtgggtag ctgatctacg tggcatagct 540 cgcgaatgcc aatttgtatg ccaatcagaa gggtgtacat taaactatgt aaatgagatg 600 atccgcgatc aaataatagt gcatacgcca tacgacgcaa ttcgcacagc agctttgcaa 660 aagcttcagc catcgctaga agatgtaatt tctattgccg aaacatacga ggcaacaatg 720 aaaacagtag ctgttatcaa ggaaagtgac aagaaatcag tagacatgaa cgcagtttac 780 gtacacaaaa acaatcacaa acaccgtaac agtggggatg gaaaaggagg caagatcgcg 840 ctcaaatctt gctccggctg tggaaactca cacatgcgcg aaaaatgtaa atttcgtgat 900 gccatttgcc atacatgtgg aagaaaggga cacatagctg ttgtatgcat gtcaaaacag 960 tgcaaaatga agatggaaaa tagtgacaaa aagaaccaaa acaaaaacac cgatactatc 1020 caaacggtat acactgtgga aaaattagct aggtttgaaa tcaataaaac catgattgat 1080 gtttaaattc taaatacaat cgttccgttt caaatggatt caggtgcaac tgtgtcggtg 1140 atgaatttgc aaacataccg ccaagttggc agccctaaat tagcaaaatg tattcgtatt 1200 ttgcatgcgt ttggccaaag cgaaattcca gttttgggtg aactgcacac tgaggcaaag 1260 tgtggcaaca aatcagcaaa cgtaacgatc atcgtggcaa acgtggaaaa ttcaaacaat 1320 ctttttggtt tggatttgtt caaaacattc aatttcgaaa ttcagcaaat ttcaaatgta 1380 agcgaagcac agctgtctca aatgactgac ctttgcacca aatataagga agtatttgaa 1440 cctgctatcg gaactataaa aaactttaag gcaagcatat acttaaagcc aaattcgctt 1500 ccaaaattct acaaaagccg acagattccg tttgcacaaa tggacaaatt taaagaggaa 1560 gcgcagcgtc tcacaaatgc aggtatttgg aaacccatta aattcagcaa ttgggcacta 1620 gcacccaagc cgggtggagc actacgtatt tgtggcgact ttaaacaagg tgtcaattcc 1680 caattggaca tcgagcaata cccattgcca acgcgagaaa ctctatttca tgccattcgc 1740 catggaaaac atttttccaa aatcgatctg aaggacgcat acctacaaat ggaactagat 1800 gatgcaacga agaaaattat ggttgtcaac acgccgttag gacttttcca gtaccaacgt 1860 ttaccgtatg gaatagccag cgctccagct atattccaac ggtatctcga gcaacttcta 1920 aatggcatag aaggttgtgg gaattacctc gacgacataa ttatctcagc gcctacgact 1980 gagcaacacc ttgctcgcct tgaaagaatt ttaggtattc tacaagagaa tggtataaaa 2040 tgtaagaaag aaaaatgttt cttcctcaaa gaagaaatcg aatatttggg gagaagagtg 2100 agcagtaagg gcatccttcc ggatacctca ggactagaag ctgtcaagga attgaagccg 2160 cccactaatc tacaacagtt agatgcgttt atgggtaaag taaactatta ttgcaatttt 2220 atcccaaatt attcacaagt agcagctccg ctgaatcagc tgcgcagaaa aaacacacta 2280 ttcaagtttg gaacagacca gcaacgggca ttcacagcac ttaaaacaca tattttaaac 2340 gctacagagc tagtacattt taacgaggac cttccgctcg ttttagctac cgacgcttct 2400 tcattcggta taggagtcgt gttgtcacac ctgcatccgg gtaacagaga aaagccaatc 2460 gcttttgcat ctaaaactct agatgtgcac caagtcaggt acagccaaat agaaaaggaa 2520 gcactttcta ttatatttgg agtacaaaag tttcatcagt atacggtaga aaatttatcc 2580 taatcactga tcataagcct ttagttacga tattttcacc aagcaagcat ctaccaacga 2640 tgacttcaaa ccgactccag cgctgggcta taattctgat ggcctatcag tacgatataa 2700 aatatcgttc gacaaccgca catggaaatg cagacgcact ctcacgtctt ccagtcaact 2760 ctgatgataa atttgatcag gaggaggcat gctataacat agtcgaagta tcttgcccga 2820 taaatgtcga tgtagtcaag cgaaacattg aacacgatta tattctaaag gaagtttata 2880 aatacgttta caatggctgg cctgaacagg tagaaagaga attaatgcca tattttaacc 2940 gaagatttgc cttcacgttc aacagcagcc ttctttgcct tcattctgaa gtaaaccgag 3000 ttattattcc aagcaagcta cgtaaaacga ttcttaattt cctacacgat ggtcattggg 3060 gaatcgtacg tatgaaacaa ctagcacgtc aacatgtgtg gtggccaggg atcgacgagg 3120 atatcaacca attagcaaaa gattgtaaca tctgcaaagt ggctaatcca gctcctgcaa 3180 aagagttttt tagctggcca aacactacat ccgcctggga aagaatacat atcgactttg 3240 cgggtccgat ttttaattca atgtggttaa tttgcgttga cgcttactct cagttcccat 3300 ttataacgca aatgtcgtcc accacaacag aaaatactat ttcagcactc acaacaatat 3360 ttgccatcga aggttatccg aaaaccttag ttagcgataa tggtccccag ctaacagcag 3420 aaagtttcaa ggtattttgc aagaactttg gaataaacca catcaccact gcaccatttc 3480 atccagcatc aaatggtctt gctgagcgat tcgtacaatc attcaaaaca gcagtgggca 3540 aaaatatcag agatgggtta tctgtcagag cagccgttac gaaatattta agttcgtatc 3600 gttttacacc aaatgcagaa ggtaaatctc cagcagaact tctacacggt cgcccagttc 3660 gcactatttt aagccaacta ttagagaaga aatctgatag taagccacta gcatctacga 3720 agtattttcc aagccagaaa gtatttgctc gaaactacgc aagaggggaa aagtggatag 3780 aagctgtcat cgaccggcct gtaggacata tgctattcat ccttcgtacc aaaacaggtg 3840 tcatcaagcg tcacatcaat caaatcaagc caagatttgg tccagatgcc atcaaggatt 3900 ctaaggaaga aaacactaca tattggtggg cacctgatct cccaaatcag cccacaactc 3960 agctggaagc tcagtcagcg gctacacaac aagaagatcc gatatccgaa ccgtcacagg 4020 atgttcagcc aagccgagtt tctgaagagt cacagcccaa cgttcagaag aatcctcagc 4080 ctcgccgttc aggtattccg attcgtagaa gctctcggac acgccaagaa gtcaaccgct 4140 atcagaccac gaacttcagg aagcgccata ctaacaaaaa aatgccatta aattaagggg 4200 agagatg 4207 // ID Copia-12_DPu-LTR repbase; DNA; INV; 368 BP. XX AC scaffold_242; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-12_DPu_; KW Copia-12_DPu-LTR; Copia-12_DPu-I. XX NM Copia-12_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-368 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 688-688 (2010). XX DR Genome; scaffold_242; Positions 37196 36829. XX CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX SQ Sequence 368 BP; 108 A; 70 C; 78 G; 112 T; 0 other; tgttgtttga ctaattaaaa cacacaccac cagggggctc tcttactcgt atacgcgacg 60 tctgtacgcg tctctataga aacgtacgtc aacgtcgtag ttgaagctga atgcttttag 120 aagagtctct ctcagggact gacttcgttc ctaattaaga aaagtgttct taccactgtg 180 ttcgtgaatc taactaaagg tatgaaacaa gtgtatcgct tttttgtgtg tatttgtgta 240 tgtgtctatg tacaatcaag aaaggcccta cagttgaagg cagagaggag cccacatact 300 gtttgtgatg gctggtctaa ataaagaaag atactaatga aacgcccttg ttaattctat 360 aatcaaca 368 // ID Copia-13_AA-I repbase; DNA; INV; 4154 BP. XX AC AAGE02020441; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-13_AA_; KW Copia-13_AA-LTR; Copia-13_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4154 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02020441; Positions 12009 7856. XX CC Positions [1445-1978] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS join(95..1324,1328..4141) FT /product="Copia-13_AA-I_1p" FT /translation="MSLNHLNNGSGENGRTAAEAISVRVGGGSVALPAIER FT LRGRENYTTWAFAMRMILIREGCWEAVSAETAEAQALVPEDQKQRALATIC FT LSLENYNYSLVLDATDARAAWRKLESAFQDSGLNRKIGLLRKFTSIRLVNS FT PSVEAYVDELMSTCHKLASVGFKVDDSWLAAMLLMGLPEHYEPMIMGLEAS FT GIALTADAVKSKILQDVKIERGPTSCSSEGALYSKQGSRRGKATPAKSKSD FT EKTCFNCRKPDHFAAKCPEKQQQSKMSKHNGLCSIFAMGDVNAGEWYFDSG FT ATCHMARSDQEFTNEAKVSHNVGTANNTSMQAVAKGSVAMDSSNGRIDVSG FT VLKIPDLATNLLSVSAICKKGKTVIFTKDKCEVLDQNGDLVVSGVEDGGLY FT RVNRPEKSFLAGGIQLHRRLGHLHEGGLKKLKRIANGVDFQDGSLAGCVAC FT LEGKQARRPFTSSASRSEELLELVHSDLCGPVEVSSLGGSRYFMTFLDDAS FT KRVTVYFLKNKSQALEAFKSFKAKAERQTGKKLKILRTDNGKEYVNADFRK FT VLEQDGIKHQTTCPYTPEQNGAAERMNRTLVEKARCMLNDAKLGKEFWAEA FT ISTAAHVVNRCPTRALEDKTPEEAWTGKKPNLGHLKIFGSSVTTLVPKAKR FT KKFDPKSEKGIFVGYCEDTKGYRVYDPVNDRFNVSRDVVVLQEGGALHVPE FT KRTEKVEFMELWNEYVEHSGAAALPEIDRPMETVVSDAAQPAGVIDQPIGN FT PGHSGTLGEAIADDSDKEEFEDACADIALPPQLIRPADEQGLRRSGRERRY FT PGKYDDFISYSSFSGDVHSSQLSSNAMRDDDPTSYDEVLRRPDRELWMAAM FT QEEIASLAENSTWVLADLPEGRKAIRNKWVFRTKRGPDGSIQRYKARLVVK FT GCSQRPGLDYNEVYSPVVRYATVRYLMALAVRYNLDIDQMDAVTAFLQGEL FT KDEDIYMLQPEGFVASNGKVCKLKKALYGLKQSSRVWNAQLDAVLQEFGLK FT RSSVDPCLYWSMRGNKMMFVTIYVDDLLIFTNDRILKKKLKAHLQKRFQMK FT DLGEAQHCLGIRITRKREDGKLWLDQQAYIEDIIDRFGVAEAHPIATPADP FT SVKLDKSMAPKTKSEIEEMKLVPFKEAVGCLSFVAQVTRPDIAFAVNVVSQ FT YGANPGRPHWEAVKRIIRYLKGTTTKKLEYSASAPAEIVGYSDADWGGDVD FT DRKSMTGYVFLMQGGAISWNVKKQPTVALSSCEAEYMAMSRTIQEAMWWRN FT LQSQFFEARSISVRCDNQSAISIANNGAYNPRTKHVSIRYHFVHDSLHQGI FT VKLNYISTTEQPADGFTKPMTVQKQQKFRKLTGVAD" XX SQ Sequence 4154 BP; 1141 A; 899 C; 1216 G; 898 T; 0 other; ataggttatc ggcccagcaa gaggaccata accggaagaa gtctggaaga tatttaaatt 60 caagaaactt caagcagttt aatttggatt caaaatgtcg ctgaatcacc tcaataacgg 120 aagtggagaa aacggaagga ccgcagccga agcaataagc gttcgagttg gtggaggttc 180 cgtggcgtta ccggccatcg agaggctgcg cggtagggag aactacacga cgtgggcgtt 240 tgcgatgcgg atgattctca tccgagaagg atgctgggaa gccgtgagtg ccgaaacggc 300 cgaagctcaa gcgttagtgc cggaggatca gaagcagagg gcgttggcaa ccatctgttt 360 gagcctggag aactataatt acagtttggt tctggatgcg accgatgcgc gagcggcctg 420 gaggaagctc gaatctgcgt ttcaagacag cggcctgaat cgaaaaattg ggttgctaag 480 gaagttcact tcgatacgtc tcgtcaacag tccaagtgtc gaagcctacg tggatgagct 540 gatgagtacc tgtcacaagt tggcctcggt aggtttcaaa gtggacgatt cctggctggc 600 tgccatgttg ctaatggggt taccggagca ttacgagcct atgataatgg gcctcgaggc 660 gtctggtatc gcgctgacgg cggatgcagt caagtcgaag attttgcaag atgtcaagat 720 cgagcgaggg ccgaccagct gcagcagtga aggagctcta tatagcaagc agggttccag 780 aagaggcaag gctacgccgg cgaaatcgaa gagtgacgag aaaacctgtt tcaactgtag 840 gaagcctgat cattttgctg ccaagtgccc ggagaagcaa cagcagtcga aaatgagcaa 900 gcataacggt ctttgttcca tttttgccat gggagacgtg aacgctggtg aatggtattt 960 cgattcgggt gccacttgtc acatggcacg gtcggatcaa gaattcacca atgaagcgaa 1020 ggtgagtcac aacgttggga cggcgaacaa taccagcatg caagccgttg ccaagggttc 1080 cgtcgcaatg gatagcagca atggtcgaat cgatgtcagc ggagttctga agatcccgga 1140 tctcgccaca aatttgctgt cggtgagtgc tatctgtaag aaagggaaaa ccgttatttt 1200 caccaaagat aaatgtgagg ttctggacca aaatggcgat ctggtcgtca gcggagtcga 1260 ggatggaggt ttgtacagag tcaaccgtcc ggagaaatcg ttcttggcgg gcggaattca 1320 gctctgacat cgacgtcttg gtcatctaca cgaaggtggt ctgaagaagc tgaagcggat 1380 cgcaaacgga gtcgattttc aagatggatc gttggctggt tgtgttgcct gtctcgaagg 1440 caaacaagcg aggcgtccgt tcacaagtag tgcgtccaga tcggaagaac tgttggaatt 1500 ggtacattcc gacctctgcg gaccagtgga ggtttcgtcg ctaggtggaa gtcgctattt 1560 tatgacgttt ttggacgatg cgagcaagcg tgtaacagtg tatttcttga agaacaagag 1620 tcaagccctt gaagcgttca agtcgttcaa ggcaaaagcg gagaggcaaa cgggtaagaa 1680 gctgaaaatc ctgcgtacag acaatggaaa agagtacgtg aatgcagatt tccgaaaggt 1740 gttggagcaa gacggtataa agcatcaaac aacgtgtcct tatacgcctg agcagaacgg 1800 ggcagcagag cgaatgaacc gcacgctggt tgaaaaagcg cggtgcatgc tcaacgatgc 1860 aaaactaggt aaggagtttt gggcagaggc tatttcgacc gcagcgcatg tggtgaaccg 1920 atgtcctaca cgagcactgg aagacaaaac gccggaagaa gcatggaccg gcaagaagcc 1980 gaatctgggg catctcaaga tttttggttc gagcgtaacg acgctagtac cgaaggcgaa 2040 aaggaagaag tttgacccca aatccgaaaa aggtattttt gtcggctact gtgaagatac 2100 gaagggttat cgagtgtatg atccggtgaa tgaccgtttc aacgtcagcc gagatgtcgt 2160 ggtactgcaa gaaggtggag ccctccatgt accagaaaaa cgaacggaaa aggttgagtt 2220 catggagctt tggaacgaat atgtcgaaca ttccggtgct gcagcacttc ctgaaatcga 2280 tcgaccgatg gaaaccgttg tgtcggatgc agcccagcct gccggtgtta tcgatcaacc 2340 gattggtaat cctggccata gcggaacact gggagaagct atcgcagacg acagtgataa 2400 agaagagttc gaggatgcct gtgccgacat tgcgctccca ccgcaactaa ttagaccagc 2460 tgatgagcaa gggttaaggc gcagcggtcg ggagcgccgc tacccaggca agtacgatga 2520 ttttataagt tatagttcat tttctggcga tgtccattcc tctcagttat caagcaatgc 2580 gatgcgcgat gatgatccga cgagttacga cgaagttctg cggcgacccg atcgagaact 2640 ttggatggcg gcgatgcaag aggaaatcgc ctccctcgcc gagaacagca catgggtgct 2700 agcggatttg ccagaaggaa gaaaagcgat tcggaacaaa tgggtattca gaacgaagag 2760 aggtcccgac ggaagtatcc agaggtacaa ggcccgcctc gtggtgaagg gctgttctca 2820 acgacccggt ctggattaca acgaagtgta ctcaccggtc gtgcggtacg cgaccgtccg 2880 atatttgatg gctctggcgg tgcgttacaa tttggatatc gatcaaatgg acgctgtaac 2940 agcgtttttg caaggcgagc tgaaagatga agacatctac atgctgcagc cggaagggtt 3000 cgttgcgtcg aacggaaaag tctgcaaatt gaagaaggcc ctctacggcc ttaagcagtc 3060 cagtcgcgtg tggaacgcgc agttggatgc ggtgttgcag gaatttggat tgaaacgatc 3120 gagcgtagat ccttgtctat actggtcgat gcgtgggaac aaaatgatgt ttgtgaccat 3180 ctacgtcgac gatctcctaa tattcacaaa cgaccggatt ctgaagaaga agttgaaggc 3240 tcatctgcag aaacggttcc aaatgaagga cctaggagaa gcacagcatt gtctcggcat 3300 ccggatcact aggaagcgtg aagatgggaa gctatggttg gaccagcaag cctacatcga 3360 agacatcatc gatcgttttg gtgtggccga agcccaccca attgcaacac cggcggatcc 3420 gagcgtcaag ttagataaat cgatggcacc gaagacaaaa tctgaaattg aagaaatgaa 3480 gttagttccg tttaaggaag cggtagggtg cttgtctttt gtagctcaag ttactcggcc 3540 ggatatcgct tttgccgtca acgttgtgag ccagtacggt gccaaccctg gccgtcctca 3600 ctgggaagcg gtgaagagga taatacggta cctgaaaggt actaccacga agaaactgga 3660 atattcggca agcgctccag cggaaattgt tggatacagc gatgccgatt ggggaggaga 3720 tgtggatgat cggaaatcca tgaccggcta cgtgttcctg atgcaaggcg gtgctatctc 3780 gtggaacgtc aagaagcaac ccaccgttgc tctctcatcg tgtgaagcgg agtatatggc 3840 catgtcgcgc actattcagg aagctatgtg gtggagaaac cttcagtcgc agtttttcga 3900 agcaagatcg atttccgtgc gatgcgataa tcaatcggcg atcagcattg ccaacaatgg 3960 agcgtacaac ccgaggacaa agcacgtcag catacggtat cacttcgttc acgacagtct 4020 acatcaaggc atcgtgaagc tcaattacat ttctacaacg gaacagccag ctgacggatt 4080 caccaaacca atgaccgtac agaagcagca gaagttccgg aagttaacag gcgtcgcgga 4140 ttagggagga gtgt 4154 // ID Gypsy-57_AA-I repbase; DNA; INV; 4874 BP. XX AC AAGE02020632; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-57_AA_; KW Gypsy-57_AA-LTR; Gypsy-57_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4874 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02020632; Positions 21189 26062. XX CC Positions [3885-4361] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 775..2442 FT /product="Gypsy-57_AA-I_1p" FT /translation="MEDSRPLPQFRCKEIEKQKLFKEWKTWKGALECYFDA FT YGICDQKQRRAKLLHLGGPQLQRVFHNLPDRENFPLVSLERQWYDVAVNAL FT DAFFQPYRQDCLERYKLRQIKQKEGERFADFILRLRQQISDCGFDKYPEEV FT KEVLSEIFLIDTIIEGCQSEELRRRILQQDRCLAEIESLGVTLEGIDHQVK FT GFNRRNEQAGFGQQVLRIGNESGSLQKKFRYQSTGFKGKFSGKDGYTCYNC FT GRRDHISTSEICPAKGKVCHACKRVGHFESSCRSRKRKQPQLTKNVRAVEE FT SAPELAVQEAESSKTYYTFHSGNESNVVPCVVGGIQLDLLVDSGSDVNLVP FT ANVWENLKQACIVVHQCVKGSSKILKGYANDNPLPILGTFVADVKVGHKAV FT RAEFFVIKGGQRSILGDQTAKQLGILQVGLQVNSIAHQVEPFPKIKDVQVH FT IHTDPEVKPVFQPVRRVPIPLEAAVDKKLNQLLATDIIEVKHGPATWVSPL FT VVVGKSNGEPRICLDLRRVNKAVLRERHPMPVVDEYMARLGEGKIWSKLDI FT KDAFLQVGR" FT CDS 2610..4853 FT /product="Gypsy-57_AA-I_2p" FT /translation="MDQILAGCTGTYWYLDDVVVEGKDLQEHDERLNEVLK FT RFNDRSVQLNWEKCVFRAKKIDFVGYTISAEGIYPLSSKIETILSFRQPDS FT ESEVRSFLGLANYLNKFIPDLATLDEPLRQLTRKGCAFVWTEQHQQSFEQI FT KAAMANAGSLGFFNTEDRTAVMADASPTGLGALLLQTDRSGCSRVVSCASK FT SLTETENKYCQTEKEALAIVWSVERFYTYLYGNEFDILTDCKALEFLFTVR FT SRPCARIERWVLRLQAFDYRVIYIPGEKNVADVLSRLATLKPIPFDPSEEI FT AIRQVALYAATTAALDWKEIVKESREDPEICQILECLDQDKVDEIPLAYRI FT VANELCRCDDVLLRTDRIVVPAVLRHRVLCVAHEGHIGMRMMKAHLRCAVW FT WPKMDSAVEDFVKKCRSCALVAAPDPPVPMIRKEMPYGPWEEVAVDFLGPL FT PEGQTLFVVVDFYSRFIEICEMTQITANDTILQLSKIFCRYGVPITLRADN FT GPQLSTSCEEFSTFCEEMGIRLVNTIPYWPQQNGEVERQNRSILKRLKIAQ FT ELGQDWRRVLDQYILSYHSTPHPTTGRCPFDLMFGRRIRSKLPQIPREVSA FT DEGVSDRDREQKEKGKVYADQKRRACQSNIEVGDKVLVKRFRKENKLSTNF FT SPEEHSVVRRRGADVTIKSLETGKECRRNISHLKKLSDVTSTCESSNDAVP FT SNEEAEHNLVAGNEDIQVSTEARELIDRRLHLQRNRKEPARFLDYLPH" XX SQ Sequence 4874 BP; 1419 A; 887 C; 1260 G; 1308 T; 0 other; caaagtggcg acgaagatag aggattgaag gtaagaaatg gatttccgaa aaatagtgtt 60 ttgagattat cattatacca ttgtgttccg ctggcaaaga ggtggaagat ggcgtcagat 120 tttcttggct catactttca ttttactgac gggcaatagt atgtgtgtaa tgccgatttt 180 ctgaagtcga aggggtgtat gagaaaatgt tgacctgttg aaaatgttcg gatgaaataa 240 aataaacagg ggagcagttg atgatagtct ttgagatgga tgctaacaaa acattgactt 300 atatgctaca aatggggagt gataagaaga agcgttattt tctagagtgt acctgttggg 360 gcaacaggag gggatcaaaa aggaaatttg gtcccgggtc acagtgaagt tgtttgtacc 420 tgtaagtgta caggaggtga atgtctattc acgggtaccg ttgtgaaata tgaatgtacc 480 tgttggcata gcaggggggg agtgtgcgat atgctcccgg gtataattta caaattagta 540 tgtgtacctg gtcaatggac aggaggtgaa atataatttc acgggttcgg ttgcaaatgt 600 attctaaaag ttaattagtt gaagtatacg aatcaaataa aatatgattt tctcgtgttt 660 tgctttatta ttttatatga cgaattgaga taaacaaata tgacaataaa catattcaaa 720 tgcaagctcg tgaaacggtt ggtttagaat ttcgaaattg tgttataatt gcagatggaa 780 gactcccggc cactcccaca atttcgttgc aaagaaattg aaaagcaaaa gcttttcaag 840 gagtggaaga cctggaaagg agctcttgag tgctactttg atgcttatgg aatttgtgac 900 caaaagcaga gaagagcgaa gttactacat ctaggcggtc cacaactgca acgggtgttc 960 cacaatttgc cggacagaga aaacttccca ttggtttctc ttgagaggca gtggtatgat 1020 gtggcggtca atgcattaga tgcattcttc cagccgtatc gccaagactg tttggagagg 1080 tacaagttgc ggcagataaa gcagaaggaa ggagaacggt tcgccgattt cattctacgt 1140 ttgagacaac agatttccga ttgcggcttt gacaaatatc cggaagaagt taaagaggtg 1200 ttgagcgaaa tattccttat cgatactatc atcgaagggt gccagtctga agaactgcgg 1260 cgtaggatac tacagcaaga taggtgcttg gcggagattg agtcgttagg tgtgacgttg 1320 gaagggatag atcatcaggt gaaaggtttt aacaggagga acgaacaagc aggttttgga 1380 cagcaggttt tgagaattgg aaacgaatcg ggttcacttc agaagaagtt tcgctatcag 1440 tctacgggat tcaagggaaa gtttagtgga aaggacggat atacttgcta taactgtggt 1500 cgccgtgatc atatttctac ttccgaaatc tgtccagcaa aaggaaaagt ttgtcatgcc 1560 tgtaaacgag ttggtcactt cgagtcgagc tgtcgatccc gaaagaggaa gcagccacag 1620 ctgacgaaaa acgtgagagc tgtagaggag tccgcccctg agctagctgt ccaagaggct 1680 gaatcatcga aaacctacta tacattccat tccggcaacg aatccaacgt agtgccatgt 1740 gtggttggcg gaattcagtt ggatctacta gttgactctg gatctgatgt gaatcttgtt 1800 cctgccaatg tgtgggagaa cctcaagcag gcttgtattg tagtacacca gtgcgtgaaa 1860 ggcagcagta aaattctcaa aggctatgca aacgataatc ctctgccgat acttggaact 1920 ttcgtggctg atgttaaagt tgggcataag gctgttcggg cggagttttt cgtcatcaaa 1980 ggtggacaaa ggagcatttt gggagaccaa acggccaagc aactggggat ccttcaagta 2040 gggctgcaag tcaatagtat cgcacatcag gtcgaacctt ttccaaagat taaagacgtg 2100 caggtacaca tccatacaga tccggaagtt aagcctgttt ttcaaccggt tcgacgcgtc 2160 ccaattccac tcgaggctgc agtggacaag aagttgaatc aactgttagc aacggatatt 2220 attgaagtca agcacggccc tgccacttgg gtttcccctc tggtcgtagt gggaaaaagc 2280 aatggagagc cgagaatttg ccttgatctg cggcgagtga acaaagcggt tcttcgtgaa 2340 cgtcatccga tgcctgtggt ggatgagtac atggccagat taggtgaagg taagatttgg 2400 agcaagttgg acatcaagga cgcattcctt caggtaggtc gttagagatg actaaataaa 2460 gcaattgcaa ctatctatga tgtgtttgtc aggttgaatt agctcccgcg tcaagagatg 2520 tcaccacatt tatcacgaac aaggggcttt ttcggtttaa aagacttcct tttggattgg 2580 tctgtgctcc ggaactgttc caaaaggtca tggaccagat attagcgggt tgcactggaa 2640 catattggta tctagatgat gtagtagttg aaggaaagga tctacaagaa cacgacgaaa 2700 ggcttaatga agtattgaaa cgttttaacg atcggtcagt ccaactcaac tgggagaaat 2760 gcgtatttcg tgcaaagaaa attgattttg tgggttatac tatttcagct gaaggtatat 2820 atccattgag ttcaaagatt gaaacaatac tgtccttccg tcaacctgat tctgaatcgg 2880 aggtccgcag tttcttggga ttggcaaatt atttaaacaa attcattcct gatttggcta 2940 ctctcgatga acctctgcga cagttaacaa gaaagggctg tgcgtttgtt tggactgagc 3000 aacatcaaca atcttttgaa caaatcaaag ctgcaatggc caatgccgga tcattaggat 3060 tcttcaacac tgaagaccgc actgcagtca tggctgatgc tagtccaacg gggctagggg 3120 ccctgctcct tcaaacagat aggtcaggat gtagtcgagt cgtcagttgt gcctcaaaat 3180 ctttaacaga gacggagaac aagtactgcc aaaccgagaa ggaagccttg gctatcgtat 3240 ggagcgttga gcgtttttac acctatttgt acggaaatga attcgatatt ttgactgatt 3300 gcaaggcctt ggagttcttg ttcaccgtgc gctcgagacc ttgtgctaga atcgaacgtt 3360 gggtattgag gcttcaagca ttcgactaca gagttatcta catacctggc gagaaaaacg 3420 tagcagatgt tttatctcgg ctggcgacac tcaaaccgat cccatttgat ccatcagaag 3480 aaattgcgat cagacaagtt gctctgtatg cagcaacaac agctgcatta gactggaagg 3540 aaatagttaa agagtcacga gaagatccgg agatttgcca aattctagag tgtctggatc 3600 aagataaggt cgatgagatt ccgttggctt atcgtatagt tgccaacgaa ctatgtcgtt 3660 gtgatgatgt ccttttgaga actgacagaa ttgttgtacc tgctgtgctg cgtcaccgtg 3720 tgttgtgtgt tgcccatgaa ggacatatcg gtatgcgaat gatgaaggct catttgcgtt 3780 gtgctgtttg gtggcctaaa atggattcag cggtagagga ttttgtgaag aaatgccgct 3840 catgtgcttt ggttgctgct ccggatccac ctgtaccgat gattcgtaag gagatgcctt 3900 atgggccctg ggaagaagtt gcggtagatt ttttaggtcc attaccagag ggacagacac 3960 tgttcgttgt tgtggatttt tacagtcgct tcatagaaat ttgtgagatg acacagatta 4020 cagcaaacga cactatttta cagctgtcta agattttttg ccgttacggg gtgcccataa 4080 ctttacgcgc agataacggt cctcagttga gtacttcttg tgaagagttc agcacttttt 4140 gtgaagagat ggggattcgt ctggtgaata ccattccata ctggccgcag caaaatggcg 4200 aagtggagcg acagaatcga tccattttga aacgactaaa gattgctcag gagttaggtc 4260 aagactggag aagagttctg gatcaataca tcttgtccta ccattctact cctcatccta 4320 caacaggacg gtgtcctttt gacctaatgt ttggaaggcg aattcgcagc aaactaccgc 4380 aaataccacg tgaggtttca gccgatgaag gagtcagtga tcgcgatcga gaacaaaaag 4440 aaaaggggaa ggtttacgct gatcaaaaac gacgagcttg tcaaagcaac attgaggtgg 4500 gcgataaggt attggttaaa agatttcgca aagagaataa gcttagcaca aatttctcac 4560 cagaagagca ttctgttgtt cgaagacggg gagcggatgt gaccataaaa tcgttggaaa 4620 ctggtaagga atgtcgccgt aacatctcac atctcaaaaa gttgtcagat gtgacttcaa 4680 cttgcgagag cagcaatgat gcagtgcctt caaacgaaga agcagaacac aatctagttg 4740 ctggaaatga ggatattcaa gtttcaactg aggctaggga actaatcgat cgaagattac 4800 atttgcaacg aaataggaaa gaacctgcta ggtttttgga ctacctgcca cactaaaata 4860 agttaagggg gggt 4874 // ID P-1N_HM repbase; DNA; INV; 2534 BP. XX AC . XX DT 21-MAR-2008 (Rel. 13.03, Created) DT 21-MAR-2008 (Rel. 13.03, Last updated, Version 1) XX DE P-type non-autonomous DNA transposon family: consensus. XX KW P; DNA transposon; Transposable Element; Nonautonomous; P-1N_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2534 RA Jurka J.; RT "P-type DNA transposon families from Hydra magnipapillata."; RL Repbase Reports 8(3), 347-347 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX SQ Sequence 2534 BP; 963 A; 325 C; 329 G; 915 T; 2 other; acggctcaag tataaaaact tcatttttgt tatggttaag agtcttgttt gcactatagt 60 tgttttatct ttattttatt caaatggttt aataatagtt agattttctt tgagaactta 120 cattattttg tgctagattt tttgttaact gattgcctgc ttcagtttat tccaataaca 180 aagaatactg aaataggctt tcagatagct atatcaacac ataccaacat gctactaaaa 240 ataatcttac atattgctaa aactactcac aattctccat tggaaataat tttaaaggtt 300 tgtaattttg tatttaatat aattgcaatt attaagtaga aaacataaac ttttctttaa 360 aatatttaca ctgcaaaaca aggttttcca atgtaaatac tttattatgc tgcatgtctc 420 aaacatacag taattaaaat gcaacaaatc aggccattta tttacagtag gcaagaaata 480 tcaggcatac caactgtttt ggagtcaata gctgttaata aatttctaat ttttttcatt 540 gccaaataaa tgcttatcat atttctttat ttaaatggtt tcgcagtggt gaaaattaca 600 aattgaaaaa ctataatata ttagaaaaat cattgacaaa aatttggtat caagagatta 660 tttgcttctt gtgggtaaga tgtacttcca aaaagcagcc cattatcaaa gtggtaaacg 720 atttgatgaa gattgagaaa ataatctttt taaagaagtt tttaaagaag ttttttgttt 780 ttatgattgt agggcttcaa aaatttattc ttatgttttg aaatcctcat ctaaagttta 840 cataactggt ttatgtttga aaaaagaaat tgatgagtat attacctcaa tgcaccaatc 900 tgaatttaag ataagagctg ttgttaccga taaccattct acaaatgtta atgcattttt 960 gtagtttaat aaagcatata ctggagatgg aaaagtatta atttatcatt tagtttagaa 1020 aaggagtgtt taaaacatac ttgttattct ctgtttttgt ttttttcttc ttgttgaagt 1080 tactgctgat tacattgcat ggaatatatt tcttaaagtt tattaaaaga tcaattatta 1140 aacaggcaag aaaaagcaat aaaaattaat tatcaagtca catattttgg aaacaacaaa 1200 caagttgtca acattgacaa tatttgacga aacaactgca gctgtaattt agagttattt 1260 ttctgaaaga aataacgcag cacaattttt atcattgttt tataagttat ttatcatact 1320 taattcaaag cagaaactta attcatccaa tcaaactcag cartgttgct ttgaaaggtg 1380 ataacaaacc cacactttat agagaaattg caaattgtac tgaaatttga tcaaaataaa 1440 atttgataaa tactttacat taaccaaaca aacttcttgt gctctaatta caacactaca 1500 agcaattgta tttcttatgc atgatttcct tgaagaaggc tttacatttg tttatagttc 1560 caatttacaa agtgatcctt taaagctagg ttttagtaag tatagttaaa tgaccagtag 1620 tagacttttt gtgagtcttt tagaaatatg cattagtaaa aagattttgg cagattttcc 1680 cagaaaatat ttatcagaat tttgataatc ctagttaaaa aatattatgc aagatattkc 1740 ttagttgtct acataaatac tactgtcagc tttctgaaga tagcacagat atatatacta 1800 aagcaagtaa taatggtaaa aattaacaaa aagattttaa tgtaaaactt atgatcaata 1860 aacgttactt tataaagaga agaacctata tattcaattc aacctctgtc taattttatt 1920 tgtcatcttt ttagtatttt atgttatttc tctaataaaa cattgcagtc attctttttc 1980 aattaaaaat ttgtgaaaca aattttaaag atttgatcta gcagttttaa ttttacaaat 2040 aaaaaccacc aaaaatgggg aatcaaatat agatttcgta ttgttaacaa tattttttat 2100 aataatttgc aaaaaaaaat tacatattca ttgagaaaag atcaagtaaa agaattccaa 2160 aaaagacaaa gacaaaataa acatttctaa caaaaatttc ttcttttaat ttcaattata 2220 aaaaatacaa aagtacaaac cttttttttg ttcaatagag agttttaggc aatcttagca 2280 acatgaaaga ttatttttag tagcatgttg gtatgcgttg atatagttat ctgaaagcct 2340 atttcagtat tctgtgttgt tggaataaac tgaagcaggc aatcagttaa cagaaaatcc 2400 agcacaaaat aatgtaagta ctcaaagaaa atctaactat tattaaacca ttttaataaa 2460 ataaaaataa aacaattata gtgcaaacaa gactcttaac cataacaaaa atgaagtttt 2520 aatacttgag ccgt 2534 // ID DNA4-5_AP repbase; DNA; INV; 158 BP. XX AC . XX DT 23-AUG-2009 (Rel. 14.08, Created) DT 23-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA4-5_AP. XX NM DNA4-5_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-158 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(8), 1742-1742 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 4 bp TSD (TATA). CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 158 BP; 38 A; 38 C; 47 G; 35 T; 0 other; tactccggcc cacgagagtc catatgtcag aacgcgtccg tcactgcgca tcggcacgtt 60 gccgaatagt gggaatggca tcatggtggg agaaaacaga cgtcacgtat tactatatga 120 atgcgcggtt tacgtatgga ctctcgtggg ccggagta 158 // ID BEL-55_AA-LTR repbase; DNA; INV; 597 BP. XX AC supercont1.317; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-55_AA_; KW BEL-55_AA-I; BEL-55_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-597 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.317; Positions 121981 121385. XX SQ Sequence 597 BP; 175 A; 122 C; 138 G; 162 T; 0 other; tgtttgcgta caaatgcgaa gtggtacggt attatactcg acgagcgatg caacatgtcg 60 atttagattg gatgtcgcac cacccctgag tcgtcggatc tacgagttgt tggatcttca 120 acgggagtcg aatcagttca tcttagtgct catagtctct gagtactagc tgcgagttac 180 tatgtacttt atcatccaga gcgggagaac gtacgatcct caacgagagc acgtagtaca 240 ttctcgatag atgagccagc gaatcgtcgg gaaaaccaaa ggacagacag tacgcattta 300 cacctcgccg ggatcggtag ttaactaatc agtgtgagca tttatgaaga tgtagagtgt 360 gatagcaata aaatttctca ttagtagtgt aagaaattgc gaataaatca aattaaacgt 420 aaagtagtgc caataaacga tacgtagtgt ttcctaaact gtgaatagtg gttgttatct 480 gcttcgatac gaaaaggtta ctccgaacca tctaattggc tagctaacca tcctctcgac 540 actggagtcg attgatgaca aagtgatgta agtcctcttc tcgccctgac cgctaca 597 // ID BEL-236_AA-I repbase; DNA; INV; 6741 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-236_AA_; KW BEL-236_AA-LTR; BEL-236_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-6741 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 927-927 (2011). XX DR [1] (Consensus) XX CC Positions [5673-6278] - Integrase core CC LTRs are 91% similar to each other. XX FH Key Location/Qualifiers FT CDS 1356..2519 FT /product="BEL-236_AA-I_1p" FT /translation="MPPTKKGSAALRQLKTQLEEVTASFDDITSFIQQFNG FT NTTASQVEVRLERIDELWDRFSEALVDLKSHEEFEDEEKYYNKLRTVISNR FT YYEGKSFLKDKAKELQEPNDSDTSIHDSSAVGVLEHVRLPQINLQKFSGEI FT DDWLSFRDLFASLIHWRTDLPEVEKFHYLKGCLQGEPKTLVDSLKITKANY FT QIAWDILCKRYNNNKLLKKKQVQSLFKLPTLSKESVSELRSLVEGFDRIVQ FT TLDQIIQPEDYKDLLLVNLLTLRLDPYTRRAWEECAASKEQDALKDLTEFL FT RKRIQVLESLPPKPADSKVFQQANPKPKPTAVKTSYNSVQSTGGRCVACKD FT NHFLYQCSAFHQLSVADRDSLLKSNGLCRNCFRSGHQARDCQSKF" FT CDS join(2934..4277,4281..6644) FT /product="BEL-236_AA-I_2p" FT /translation="MVLRSRVSDFSREMSFLVLPKVTANLPTTSINTEGWN FT IPDGIHLADPAFFQSNDVDIVLGIEAFFDFFETGRRISIGDQLPILTESVF FT GWVVSGGCSSSSVSQQINCHFSATTELEKLMEKFWSCEEVEFANSYSPEEQ FT LCEDLFQRGVQRGSDGRYTVPLPKNEGVLSALGESWDIAFRRLLGTERRLA FT RDENLRKQYVAFMDEYLNLEHMRKVEDVGEESVKRCYLPHHPVVKEASTTT FT KVRVVFDASCKSSTGVSLNDALLVGPVVQEDLRSIILRSRTRQILLVSDVE FT KMFRQILTTPEDRPLQSILYRFSPDEEVAVFELNTVTYGTKPAPFLATRTL FT QQLAADEKTNYPLAARAIGEDVYMDDVITGTDDLDTAISLRTQLSDIMECG FT GFKLRKWCSNVQRVLEGVPTENLAIPDSTGINLDSDKSVKTLGLTWMPATD FT TLRQFDIPLLDPPIISTKRRILSNIATLFDPLGLIGATTVSAKIFMQRLWT FT LKDAQGKRLDWDQPVPSTVGEEWRKFHEELPLLNEIRVDRCVIIPNAISVE FT LHCFSDASEKAYGGCVYIKSQDSNGTVQIRLLSSRSRVTPLRSQSIPRLEL FT CGALLLSELFEKVKNSTRLSVPTFFWTDSTCVLRWIAATPTTWTTFVANRV FT AKIQTITEGWNWRHVAGTDNPADIISRGLSPKEIVHNKLWWEGPDWMKADR FT ENWPCGGADDLEVGEEERRRTTVVCVASPIAQFNKYYLSKFGSFADLTRRT FT AYWLRLMKLLQTPKEHRSENSFLTTAELKRAEYTIVQSVQREIFVDEFKAL FT EKGESVSKRSPLRWFHPYIDADGLLRVGGRIKNSDELEGTKHPAVLPAHHE FT LTRMIMRHFHERLMHAGPQLLLGVVRLQFWPLGGRNVARQEVHRCVKCYRT FT KPAIVQQFMGELPSSRVTASRAFTKTGVDYFGPVYIRPAPRRTAVKAYVAV FT FICMCTKAVHLELVSDLSTERFLQALRRFVSRRGRPSDMFSDNGTNFVGAR FT NKLNELFRMLKDSGHHDRISTELAVDGIQWHFNPPSAPHFGGLWEAAVRSA FT KNHLLKVIGENVVSSEDFSTLLTQVEACLNSRPLVPMSDDPDDLRPLTPAH FT FLVGSSLQALPDNELSDIPSNRLNKFQLIQQRLQHFWNRWRREYLSQLQAR FT TKRWKPAIAVEEGKLIVIKDENVSPIRWKMGRIIKLHPGEDGVTRVVTVKT FT ATGELKRPVEKICILPIPTESEDNAS" XX SQ Sequence 6741 BP; 1818 A; 1584 C; 1610 G; 1710 T; 19 other; ttttggtcct tcgagccgga ttccgaaccc ggggaccagt tggaaggaag atcaccattg 60 ttgcttcgat cgaagcgcgw ggatgtcgcc atcgcgccga cggtaaatcg ccatctcaac 120 aaggattgaa twgaggwwcc gccatcgaag tcaagttttg ttaaggatcg ccattcacgg 180 agsataaagg gaaacawctt ctgcattgtg mwgcwacatt gcgccaaatt ccccattacg 240 mttgggctgt tgggagccka acctcattca acaaaggctg aaggattccg atgaagagcg 300 attggattgg aacgaatttg gactgcttga cgaattgagg acgctttaga ggatcctcgt 360 tgcgctggac caccgaaccg gccggagtat ctggtttctc aggtaaatac tctttagttc 420 ggtgtatgtc tagccgaagc attgcttgct cgttactaac tccaattttc gacgcatgtt 480 attggattta ttgcatgaat ggatgcaagc gacaaccgtc gttatcgaac cgtgatttat 540 tggaggctac ttcggatcga atttgtggaa ctagtggacc gacgagtacc ccctcgaggc 600 caggacgatt gcggtggaca gtttggaaga gggagcgctc tctacctccc ttcttggtga 660 gtgaacttag tatatgtact gcactgccga agctaggcat atagataata cacaattttg 720 atggaatwtt atgtgaatcc tcttcattta ctgccacgca agccctgttt gagaactgct 780 cgttagagca agtctgctgc atattaccca aagtgcattc tcgacttttg gatctggttt 840 tgcggagttg ttagtcctcc aggttagttc tacggtatat gtcctaccga agctacgcct 900 acgattacta catgccattc attcctcttc ataccactac acactattgg aacacgagtt 960 atttcccaat tgagctggaa ccccattgga acgttcgaca agcaaccatt ggagtttgtt 1020 gtatttggcc atatattacg gatttcggga tttgtaccgc tgttggtacc tccaggtagg 1080 ctgccacagt atatgcccgg ccgaagccaa ggtttctccg gatattacac ccaccttttc 1140 ctcttctttg aacccaaact ggaactgagt tacgaccctg ccaatattga tgggacgttt 1200 cggaattttg acgttttttt tggaaagttt ggattactag aacagttacc tgttctgtca 1260 ggtcagtgtt ttaggacagt atatgtccgc cgaagcaagt ccatacgcgg catactacat 1320 aattcttttc tttgccaacc aatcgacgtc acatcatgcc accaaccaaa aagggatccg 1380 ctgctttgag gcagctgaag actcagttag aggaagtaac ggcttctttc gatgacatca 1440 catctttcat ccaacaattc aacggaaata ctacggcttc tcaggttgaa gttcgtttgg 1500 agagaatcga tgaactgtgg gataggttca gtgaggcttt agttgatttg aagtcgcatg 1560 aagaatttga ggatgaggag aaatattaca acaagcttcg aactgttatc agtaaccgat 1620 attatgaagg gaagtccttc cttaaggaca aggccaagga acttcaagaa cccaacgatt 1680 cggatacatc cattcatgat tcgtcagcgg tgggtgttct agaacatgtt cgtttgccgc 1740 agattaatct ccaaaaattc agtggtgaaa tcgatgactg gttgagtttt agggacttgt 1800 tcgcctcgtt aattcactgg cggactgatc ttcccgaggt agaaaaattc cattacttaa 1860 agggatgtct tcaaggcgag ccaaaaactc tggtcgactc gttaaaaatc actaaggcca 1920 attaccagat tgcttgggac attctgtgta aacgatacaa taacaacaag ttactgaaga 1980 agaagcaagt ccagtccctg ttcaagcttc cgaccctctc caaagaatcc gtttctgagt 2040 tgcgaagtct cgtcgaaggt tttgatcgaa tcgttcagac actcgatcaa atcatacaac 2100 ctgaggatta taaggactta ctactggtga atctcctgac tctgcgactg gacccctata 2160 cccgccgtgc atgggaggaa tgtgcggcat ccaaggagca agatgcactg aaggatctaa 2220 ccgaatttct tcgtaagcgt atccaagttc tggaatcact cccaccaaag ccagcggatt 2280 ccaaggtttt tcaacaagct aatccaaaac ctaagccgac ggcagtgaaa accagttata 2340 attcggttca atcgactgga ggaagatgtg tggcttgtaa ggacaaccat tttctatacc 2400 aatgttccgc attccatcaa ctttccgtgg cagacaggga ttcactgctg aaatccaatg 2460 gwctttgccg aaattgcttt cgttctggtc atcaggcaag ggactgccaa tccaaattcw 2520 cctgtcggat ttgcaggaaa cgtcaccaca ccctggtttg cttcaaatcg gagaaggaaa 2580 atgctacaac ggtcgtagcg gttgctgggg gcaaccatcc ttcaaactcc aaggattccc 2640 atggttcaac tggttccaat ccaactcttg tggccaacat ggcggccacg gacgtttcga 2700 catgcaacat tgctcagaag gtgtcctcga gaattctact cgcgacggca gtggttatcg 2760 tcgaagatga gatcggtgca aggttcaacg ctcgagctct attggactct ggatcggaga 2820 gcaactttgt gtcgsaacga ttatcccaac ggatgaagst ctccagaaac aaggtcgacg 2880 ttkcggttak cggcatagcg cagggaacag ccaaggttaa acatacagtc caaatggtac 2940 ttcgctcgcg agtttcggat ttctcgcggg aaatgagctt cctggtacta cctaaggtaa 3000 ctgcgaatct acccaccact tctatcaaca cggaagggtg gaatatacca gacggaatcc 3060 atctggcgga tccggcgttc tttcaatcca acgatgtcga cattgtgcta ggaatagaag 3120 cctttttcga tttctttgaa actggaagga gaatttcgat cggcgatcaa ctcccaatcc 3180 tcacggaatc ggtattcgga tgggtagtca gtggtggctg ctcgtcttca agcgtatccc 3240 aacaaatcaa ctgccatttc tcagcaacca cggagctgga aaagttgatg gagaagttct 3300 ggtcctgcga agaggtagaa ttcgctaact cctattcccc ggaggagcaa ttatgtgaag 3360 atttgttcca acgtggagtc caacgaggtt cagacggccg gtacaccgtc cctttaccaa 3420 aaaacgaagg tgtcttgtca gcattgggcg agtcatggga catcgctttc cgacgccttc 3480 ttggaacgga gagaaggttg gcaagggatg agaatctacg gaaacagtac gtcgcattca 3540 tggacgagta cttaaacctg gaacacatgc ggaaggtcga agatgtcggc gaagaatcgg 3600 tcaaacgatg ctatctacca catcacccgg tggtcaagga ggccagtact accacaaagg 3660 ttagagttgt cttcgacgct tcttgcaagt cgtcaactgg agtgtccctg aatgatgcgc 3720 ttctggtagg gccagttgtg caggaagatt tgaggtccat catcctacgg agtcggacga 3780 gacagattct cctcgtatcg gacgttgaga aaatgttccg acagatcctt actacaccag 3840 aggatcgacc tttgcaatct atactctacc gattttctcc cgacgaggaa gttgcggtgt 3900 ttgagttgaa taccgtaacc tatggaacaa aacccgctcc tttcttggca actcggactc 3960 ttcagcagtt ggcagctgat gagaaaacaa attatccgct cgcggcacga gcgattggtg 4020 aagacgtcta tatggatgat gtcatcactg gtacagacga cctggatact gctatttccc 4080 tcagaaccca actgtctgat attatggaat gcggtggttt caagctgaga aagtggtgct 4140 ccaatgttca aagggttttg gaaggcgtac ctacagaaaa cttggctatt ccggacagta 4200 caggcattaa tttggactcc gacaaatccg taaagacgtt aggactgact tggatgcccg 4260 caaccgacac cctacgatwc cagtttgata ttcctctact agatccacca atcatcagca 4320 caaaacgtcg tattttgtcc aacatcgcta cattgttcga cccccttggt ttgattggtg 4380 ccacaacggt ttctgccaag atcttcatgc agcgactatg gacgttgaag gacgctcaag 4440 gcaaacgctt ggattgggat caacctgtac cttcaacggt gggtgaggaa tggcggaaat 4500 ttcatgaaga actccctcta ctaaacgaaa tccgcgtcga tcgctgtgtc atcattccaa 4560 acgctatttc ggtggagcta cattgttttt ctgacgcctc ggaaaaggcg tacggcggtt 4620 gcgtctacat aaaaagccaa gattcgaacg gaacggtcca gatacgacta ctatcatccc 4680 ggtctagggt aactccactc cgtagtcaat caattccaag acttgaacta tgcggtgcac 4740 ttctattgtc cgaactattt gagaaggtca aaaattccac aagactgtcg gttccaacgt 4800 tcttttggac ggattcaact tgcgtactcc gttggattgc agctacaccc acaacttgga 4860 ccacattcgt tgctaacagg gtggcgaaga ttcaaaccat aacggaagga tggaattgga 4920 gacacgtagc tggaacagat aatccagcag acattatatc ccgcggacta tcaccgaaag 4980 aaattgtcca caacaaactc tggtgggaag gcccagattg gatgaaggca gaccgagaaa 5040 attggccttg tggcggtgct gatgaccttg aagtaggaga ggaagagaga cgtcgcacaa 5100 cggttgtgtg cgtagcttca cccatagccc agttcaacaa atactacctt agtaaatttg 5160 gatcattcgc agacctcact cggcgaacag cgtactggct gaggctaatg aaacttcttc 5220 aaaccccaaa ggaacatcgt tccgaaaata gtttcttaac caccgcagaa ctgaaaaggg 5280 ccgaatatac tatcgttcaa agcgttcagc gagaaatctt cgtagatgaa ttcaaggctc 5340 tggaaaaagg ggaatccgta tccaaaaggt caccgctgag gtggtttcac ccatatatcg 5400 atgcagatgg attgctcaga gtcggaggga ggattaaaaa ctcggatgaa cttgaaggta 5460 ccaaacaccc agcagtctta ccagcacacc acgaattaac tcggatgatc atgagacatt 5520 tccatgaacg gcttatgcat gctggtccac agctattact aggagtagtt cgactgcaat 5580 tctggccact tggagggagg aatgtagcca gacaggaagt tcatcgttgc gtaaaatgtt 5640 atcgcacaaa accagcaata gtgcaacagt tcatgggtga actaccatcc tcacgagtta 5700 ccgcttcccg ggcgtttacc aaaactggcg tagattattt cggacccgtc tacatccgac 5760 cagctccgag gcgcactgcc gtgaaggcct acgtagctgt tttcatttgt atgtgtacta 5820 aagcggttca cttggagctc gtatccgacc tatcaacaga gcgattcttg caagcgcttc 5880 gaagattcgt atcaaggcgt ggaagacctt cggacatgtt ttctgataac ggtacgaatt 5940 tcgtcggagc caggaataaa ctgaatgaac tcttccgaat gctgaaagac tccggacatc 6000 atgaccgaat ctctactgaa cttgccgttg atggaataca gtggcatttt aaccccccga 6060 gtgcgccaca ttttggaggg ctttgggaag cagcagtccg gtcagccaaa aaccatttgt 6120 tgaaggtgat cggagaaaac gtagtttcat cagaagattt ctccacgctt ctaacgcagg 6180 tcgaagcctg cctcaattcc cgtccgttag tcccgatgtc agatgatccc gatgatctgc 6240 gacccttgac tccagcgcat tttcttgtgg gttcctcgct tcaggctctc cctgataacg 6300 agctgtcgga catcccatcg aaccgattaa ataaattcca actaatacag caacggctgc 6360 aacatttttg gaaccgatgg cggcgagaat atttgagcca actgcaagcc agaaccaaac 6420 gttggaaacc tgctatcgcc gtagaagaag gaaaacttat cgtcatcaag gatgaaaatg 6480 tctctccgat tcgctggaag atgggaagaa ttataaagct gcaccctggt gaagatggag 6540 ttaccagagt cgttacagtg aagaccgcca ccggtgaatt aaagcgtccg gtggagaaga 6600 tatgtatttt accgattcca accgaaagcg aagacaatgc gtcataaacc tgctccatat 6660 ccattcccga actcccatcc tgtcgaagag gattttcttg tttcttttca gaaattcgac 6720 gtatttctgg gtgggtgagg a 6741 // ID Copia-126_AA-LTR repbase; DNA; INV; 117 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-126_AA_; KW Ty1_copia_Ele160; Copia-126_AA-I; Copia-126_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-117 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 117 BP; 40 A; 25 C; 19 G; 33 T; 0 other; tgaagtagaa aacgaagcaa cctatgactt gtaaactaga ataagatttt gtaaaataaa 60 cttcacttta gttccgtcca cacgcgagac aagacgtgtt tctctattct ctgccca 117 // ID DNA2-2_SM repbase; DNA; INV; 288 BP. XX AC . XX DT 16-AUG-2009 (Rel. 14.08, Created) DT 16-AUG-2009 (Rel. 14.08, Last updated, Version 2) XX DE Putative non-autonomous DNA transposon: consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA2-2_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-288 RA Jurka J.; RT "DNA transposons from freshwater planarian Schmidtea RT mediterranea."; RL Repbase Reports 9(8), 1834-1834 (2009). XX DR [1] (Consensus) XX CC Preliminary classification: Mariner/Tc1 element. CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX SQ Sequence 288 BP; 113 A; 46 C; 41 G; 88 T; 0 other; acctagccct cgatttacac ggtttcggtt taacacgatt gaagaaatgc ggaaaaattc 60 aatttatatt acactagttt tcagtttaac acggtcacaa aaatttttga aaaaaaattt 120 aaaacaaaat tagaattcaa aaaaccaaaa atatacacaa tctcagtagg ctaatatata 180 tctcatttga aaatttagaa tattttaatc taagtaagct aatacacggt ttcgatttac 240 acggcatttt tgaggaacca atctaccgtg taaatcgagg gctaagtg 288 // ID Transib1_NVi repbase; DNA; INV; 3506 BP. XX AC . XX DT 13-NOV-2007 (Rel. 12.11, Created) DT 13-NOV-2007 (Rel. 12.11, Last updated, Version 1) XX DE Transib-type element: consensus. XX KW Transib; DNA transposon; Transposable Element; Transib1_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-3506 RA Jurka J.; RT "Transib1_NVi: Transib-type element from Nasonia parasitic RT wasp."; RL Repbase Reports 7(11), 1181-1181 (2007). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 1016..2452 FT /product="Transib1_NVi_1p" FT /translation="MNTFSIQNGILCDILLKLDNKSRSVNCLIDYIKAQYE FT LSDEESSIVLESSFMVTFKSKLSAVLYKEHIFRKKYVSWIADNFVVNFDNN FT LKRGRPVTDSFEGSSRSTKYRIIQSITSSFSQEEIKKAFYKNLRDSGKKHL FT IKQIEDLLSDPNCSMNNSVSEEIIPFTEDEAIALIKDAKLSKWQYDTIRKQ FT VKGKNVDIFIPYKQLFSAKENCYPISSSLSISEKGGSVDLQQLLNHTSKRL FT LQIPDILSNFKTSEAEYSLVLTSKWGCDGSSDHSQFKQIFSDGTSSDESIF FT MMCMVPLILEVNTSDGATVIWRNPQPGSTRHCRPISFVYAKEISEKTVADV FT NAMKTKIESLLPSIVNTDVNQFKIHHSLKLTMIDGKVCQAITGSPSCATCC FT ICGAKPSQMNDLYKLAERTENEENFEYGLSSLHAKIRFMENILHIAYRLDF FT CKWQATSDQQKTTNNAKKESLEFNTDSDKKLASS" XX SQ Sequence 3506 BP; 1175 A; 596 C; 597 G; 1136 T; 2 other; cgcacactgg gccctataga catttgttgt ttcaaaattc tctacagacc atggttttcg 60 accaatcaca ataaaattga agtttattaa aattctaaaa gacccataag gccagatttt 120 tgaaaaaatg ttataataat taattaaaaa tctaagttaa acaataaacc cataaacctt 180 aaggcattaa tggccataag caaatgggcg actactgaag gaagctatat atatttgtaa 240 gtggttattg gggtcgctga ttataaatct ggcattagtt ttgcaaaatt gatttgttta 300 aacaatttgt ttaaacaatt tattgcaaaa ttatcttttt attcaaaact actttagttt 360 agtctagttt tgcactgggt acttaattcc ccgtatatgt gcagcagcgt tcatcaccgc 420 accagtagta ctgcgcttgg ccgctaggcg gtgacgacac cagttttcgt atccgtcagc 480 gaagtagcgt cctgaaaggc atataatgat gcactaatcc atgaattttt accgtacttt 540 ttcgtgttaa aaatttgccg tgttagagat cacgttcgag tggcaggcta acggttaaat 600 attattataa ataagcattg aaattagact tcaaaactag aataaatatt aggagttttc 660 agcgattatc gttgcaaaca atttttcttc agccgttttc acggctacta tttgcataga 720 taattttcac gatcattatc agtcgttctc acgactactt ctccagcgat tatccttgca 780 aatagttttt tatcagccgt tttcacggct actgtttgca tagataattt ttacggtctt 840 tttcaaacag acattttcac gactactccg gcgattattg ttgcaaacag tttcttatca 900 gccgttttca cggctactgt tttcatatat agttttcagg ttaattttcg agcagtcatc 960 atcacgactg catcaaagtt acctttcatt catcccctct aagtgaaacc taacaatgaa 1020 tactttttcc attcaaaacg gtatattgtg tgatatactg ttgaaattag acaataaaag 1080 tagaagtgta aattgcttga ttgattatat taaagcgcaa tatgagctca gcgatgagga 1140 atcaagcatc gttcttgaga gttcttttat ggtcactttc aagtcaaaac tgtctgctgt 1200 actttataaa gaacacatat ttaggaaaaa atatgtatct tggatagcag ataactttgt 1260 tgtgaatttt gataacaacc tgaagcgtgg aagacctgtc actgatagtt ttgaaggcag 1320 ttcgagatct acgaaataca gaataataca gtctattact agtagctttt ctcaggaaga 1380 aataaaaaaa gctttttata aaaatttacg agattctggg aaaaagcatt taattaaaca 1440 aattgaagat cttttaagcg atcccaattg ttctatgaat aattccgttt cagaggaaat 1500 aattcctttc accgaagatg aagcaatagc tctgattaaa gatgcaaaat tgtcgaagtg 1560 gcagtatgat actattcgca aacaggtaaa aggaaaaaat gttgatattt tcatacctta 1620 taaacaactt ttttcagcaa aagaaaattg ttatcctata tcttctagtc tctccatatc 1680 tgaaaaaggt ggtagcgttg atttacaaca actgttgaat catacatcta aacgtctcct 1740 acagataccg gatattttat ctaatttcaa aactagtgag gctgaatact ctctagtttt 1800 gactagtaaa tggggatgtg acggatcatc agatcatagc cagtttaagc aaatattctc 1860 cgatggaact agctcagatg aaagtatttt catgatgtgt atggtaccac ttatcttaga 1920 agtcaatact tcagacggtg caacagtaat ttggcgtaac cctcaacctg gttcgactag 1980 gcattgccgg ccaatcagtt ttgtgtatgc caaggaaatc agtgagaaaa cagtggctga 2040 tgttaatgct atgaaaacta agatcgagtc actattgcct tcaattgtaa atacagatgt 2100 aaatcaattc aaaatacacc actcattaaa actaacaatg atagatggaa aagtatgcca 2160 agcaataaca ggttctcctt cttgtgcaac gtgttgcatt tgcggagcaa aaccttcaca 2220 aatgaacgat ttgtataaat tggctgaaag aaccgaaaat gaagaaaatt ttgagtatgg 2280 cctttcatct ttacatgcaa aaatacgatt tatggaaaac atactccaca tagcataccg 2340 tttggatttt tgtaagtggc aggcaacatc agatcaacaa aaracaacaa acaatgccaa 2400 gaaagaaagc ttagaattca acaccgattc agacaagaaa ctggcctcat cgtagattat 2460 ccacaacaag gatcgggtaa ttctaacgat ggtaatactg ctaggagatt tttccgtgac 2520 cctgcactga cagctgaaat tacaggtgtc aatgaaagtt taattacaag atacagtgta 2580 atactacaag ctcttgcttc tggatctaaa atagactccg acaaatttga cgattatgca 2640 aaagaaactg ctaagttata tgtatctttg tacgattggt tctacatgcc tgcatctgtg 2700 cataaaattc tcctccatgg tgcgaacatt ataaatcact ttctcattcc aatcggcatt 2760 ttatccgaag aagctcaaga atcacggaat aaagatttga aatactatcg tcaatttaat 2820 actcgaaaat gtggtagaat ctacacgaat atagatctat tgcataaatt attaatatct 2880 tctgatcctt atatatctag tttaaggcac aaggcaaaaa atataaagct tgatgttgat 2940 gatgctgtta agaacttatt aattttaact tgaaatgaag tgatacaatt ttaaaattgc 3000 tcagcttaag gtatttgggt ttgaggttaa aggtttacgg ttaacacnaa ttttctataa 3060 ttgcaaattg cccaccttgg agtagtgaat ttccttgttt atgctttgaa catgattggt 3120 tatagtaatt atgcccgtaa aattcaaaac taaagtagtt ttgaataaaa agataatttt 3180 gcaataaatt gtttaaacaa attgtttaaa caaatcaatt ttgcaaaact aatgccagat 3240 tcataatcag cgaccccaat aaccacttac aaatatatat agcttccttc agtagtcgcc 3300 catttgctta tggccattaa tgccttaagg tttatgggtt tattgtttaa cttagatttt 3360 taattaatta ttataacatt ttttcaaaaa tctggcctta tgggtctttt agaattttaa 3420 taaacttcaa ttttattgtg attggtcgaa aaccatggtc tgtagagaat tttgaaacaa 3480 caaatgtcta tagggcccag tgtgcg 3506 // ID Copia11-NVi_I repbase; DNA; INV; 4129 BP. XX AC AAZX01012276; XX DT 13-NOV-2007 (Rel. 12.11, Created) DT 13-NOV-2007 (Rel. 12.11, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: internal DE portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia11-NV; KW Copia11-NVi_LTR; internal portion; Copia11-NVi_I. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-4129 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from Nasonia parasitic wasp."; RL Repbase Reports 7(11), 1165-1165 (2007). XX DR Genome; AAZX01012276; Positions 9148 5020. XX CC Positions [1572-2075] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 72..4097 FT /product="Copia11-NV_I_1p" FT /translation="MATDAIIRNVVKFDGKNFQQWKFLITAALQANDLLEV FT TNGESKRPGEAARGAAPEAVRETEQVIKAWIKNDAKARYLITAAMEPEQMV FT SLLTCETSMEMWDRLLTIHEQKSASHKLLVSQRFHEYRMNPKDTVVQHVSK FT VQNLASQLLDLGENIPDIVVISKIIASLPTKYRHFRSAWSSVAPERQTIEY FT LQERLVEEESYRDAEDEESSALAVTTRRADKGEISSKQNREKKPRKSKKNI FT KCYVCHEKGHYARECPSRSKPNNREQSENIALIAALESSTNREKGEIAKAN FT ADWEPSSRQKQEIMETDQQDVWFTDSGASAHISHRREWFVDYRLRRDGSTV FT VLGDDRECSVAGEGKVLVERMVDGTWQDAVIENVLHVPEMGRNLYSVGQAT FT SKNIRICFEKDTVKFSRDQKVVAVGVKQSNHIYRMFLRVKPRKLEVNAVSL FT NVWHQRLGHLHERALKHITSKNIVKGVQLNDNEIFFCDDCQFGKAHKLAFK FT QEVQRNLEPGEMFHSDVCGPMKETSLGGARYFLLFKDDATGYRRVFFLKHK FT ADVYDNFVIFEREVNNKFGRPMQILRTDNGKEFANKEMKKYLQSRGIRHEF FT TAPYTPEQNGKVERENRTVVECARTLMCAKNLPGFLWAEAVNTAVYLLNRI FT STSTQKRNKTSYEMWEKKIPDVSHCRIWGSTTFKYEPKQLTTKFDSRSTKM FT VLVGYDSESSNYRLYDPEKKKITVSRHVTFNENSDEGEATKMVADENIFMF FT NDEDGTPDPPQRDRAAREEDPEAADVDDAPENGEQAERREISPPRRPVQRD FT RRLCKRPNRYTADFCAVVVPNSFQEAVQGPNSASWREAIERELTAHRINGT FT WRLVPRTPGIKTIGSKWVFKAIPDGDAGHHKKKARLCALGYMQKEGIDYTE FT TFSPVVRYDSLRMLLSIIAHENLEMRSFDVSAAFLHGKLNEEIYMEVPQRI FT DHNELNTGGKDVVCKLDKALYGLKQAPRCWNNRFKQFLAKFNFRECDADHC FT IFVAMYDKYRVYLCLFVDDGLLACKSDAVLQMILTELKAEFSITTGDASYF FT VGLQISRNRARRSIFINQSVYVREIIERFRMSNAKAVSVPADPNVILQAAE FT EGEAMDGSVPYREAVGCLMFVAMVSRPDIMFAVGTLSKFLNRHNVAHWRAV FT KRVFAYLGGSVDLGIEYKYYLDRYEPVGYSDADYANDPDTRRSTTGYVFCI FT SGGAVTWSSQRQRLVTLSTTEAEYVAASSAAKELCWIRKMLHEIRYQCNSG FT SILWVDNQSAIKIAKNPEYHKRTKHIDIRYHHIREKCMSGEIVIKYIPSEN FT QKADILTKALPRERFAKMRESLGIIKDPDM" XX SQ Sequence 4129 BP; 1338 A; 818 C; 1097 G; 876 T; 0 other; ataataggtt atgggcccag gacaagaggt tgaaagataa aaaaaaaaaa aaaagaaaac 60 gaagagggga aatggcaact gacgcgataa tacgaaacgt cgttaagttc gatgggaaga 120 attttcagca gtggaagttc ctaatcactg ctgcactcca agcaaacgac ctgctggagg 180 tcaccaacgg cgaaagcaaa agaccaggcg aagcagctag gggtgccgct ccggaagcag 240 taagagaaac tgaacaagta ataaaagctt ggataaaaaa tgacgccaaa gccaggtatc 300 tgataactgc agccatggag ccggagcaaa tggtaagctt actgacttgc gaaacctcga 360 tggaaatgtg ggatagactt ttgactatcc acgagcaaaa gtcggcgtct cacaagctct 420 tagtgagcca gagatttcac gagtaccgga tgaacccgaa ggatacggtg gtacaacatg 480 tgtcgaaggt acaaaatctg gcgagccaac tgcttgactt gggagaaaac atcccggaca 540 tcgtcgtcat ttcgaagata atcgccagct tgccaaccaa gtatcgacac ttcaggtcgg 600 cgtggagtag cgttgctccg gaacgacaaa ctatcgagta cctgcaggag agattagtag 660 aagaggaaag ttatagagat gccgaggacg aagaatcgtc agctctagct gtaactacga 720 gaagggcaga caaaggtgaa atctcatcaa aacaaaacag agagaagaaa ccacgtaagt 780 cgaagaagaa tattaaatgt tatgtatgtc atgagaaggg acattacgcg cgtgaatgcc 840 caagcagatc gaagccgaac aacagagaac agagtgaaaa catagctctg atagcggcac 900 tcgaatcgtc aacaaaccgc gagaaaggtg aaatagctaa agccaacgca gattgggagc 960 cgtcgtcgcg ccaaaagcaa gagataatgg aaaccgatca acaagatgtc tggtttaccg 1020 acagcggcgc gtcggcccac atatcgcaca ggagagagtg gttcgtcgat tatcgtctga 1080 gaagagatgg tagcaccgtg gtgctcggcg acgatcgcga gtgttctgtg gcgggcgaag 1140 gcaaagtgct agtcgagcgg atggtagacg gaacttggca agatgcggta attgaaaatg 1200 ttcttcatgt tcctgaaatg ggcaggaact tgtactcagt agggcaggcc acctcgaaga 1260 acatcagaat ttgcttcgag aaagacacgg tgaagttttc gagagaccaa aaagtcgtcg 1320 ctgtcggagt taaacaaagc aatcacatat atcggatgtt tctccgagtg aaaccaagaa 1380 aacttgaggt gaacgccgta agcctgaatg tctggcatca gagactcgga cacttacatg 1440 agcgagcgct gaaacacatc acctcgaaaa atatagtgaa aggagttcaa ctcaacgata 1500 acgagatatt tttctgcgac gactgccaat ttggaaaagc acacaaactg gcgttcaagc 1560 aagaagtgca aagaaacttg gaaccaggtg agatgttcca ctcggacgtg tgtggcccca 1620 tgaaggagac gtcattagga ggagccagat atttcctgct cttcaaggac gatgcaactg 1680 ggtacagacg tgtgtttttc ttgaaacaca aggcagatgt gtacgacaac ttcgtgatct 1740 tcgagcgaga ggtgaacaat aaattcggca gaccaatgca gatactccga accgacaacg 1800 gcaaagagtt tgcaaacaag gagatgaaga agtatttgca gtctcgaggc atcaggcatg 1860 aattcaccgc accatatacc ccggagcaga acggaaaggt cgagagagaa aatcgcaccg 1920 ttgttgagtg tgcgcgaaca ctcatgtgcg ccaagaatct accagggttc ctgtgggcgg 1980 aagcggtgaa tacagcggtg tatttgctca acagaatctc aacgtcgaca cagaagagaa 2040 acaaaacgtc gtatgaaatg tgggaaaaga agatccctga cgtcagtcac tgccgaattt 2100 ggggctcaac aacattcaaa tacgagccaa agcagctcac gaccaagttt gattcacgat 2160 ctacgaagat ggtcctggtt ggatacgaca gcgagtcatc caactacagg ctgtacgatc 2220 cagagaagaa gaagatcact gtgtcacgac acgtgacatt caatgaaaat tctgatgaag 2280 gagaagcaac aaagatggtc gctgatgaga acatatttat gttcaatgat gaggacggta 2340 ctccagatcc accgcagaga gacagagcag cgagagaaga agatcccgaa gcagccgatg 2400 ttgacgacgc tcctgaaaat ggagaacaag cagaaaggag agagatatcg cccccacgtc 2460 gtccggttca gagagacaga agactgtgta aaaggccaaa caggtataca gctgatttct 2520 gtgctgtcgt tgtaccaaac tcttttcagg aagccgtaca ggggccaaac agtgccagtt 2580 ggagagaagc catagagaga gaactgactg ctcatcgcat caatggcact tggaggttgg 2640 taccgagaac accgggaatc aaaaccattg gttcgaaatg ggtgttcaag gccattcctg 2700 atggtgatgc ggggcatcac aagaagaagg ctcgactctg tgctttaggg tacatgcaaa 2760 aggaagggat tgattacacg gagacctttt cacctgtcgt caggtacgac tctctcagaa 2820 tgctgctgtc gatcatcgca cacgagaatc tggagatgcg ctcattcgac gtgagtgcgg 2880 catttcttca tggaaagctc aacgaggaaa tctacatgga ggtaccgcag cgaatcgacc 2940 acaatgaatt aaatactgga ggtaaagacg tagtgtgtaa gttagataaa gccttgtatg 3000 ggttaaagca agcacctcgg tgctggaaca acaggttcaa acaattttta gctaaattca 3060 attttcgcga atgcgatgcg gaccactgca tatttgttgc aatgtacgat aagtaccgag 3120 tatatttgtg tctctttgtc gatgatggat tgctagcatg caagtccgat gcggtgttgc 3180 aaatgatttt aactgaactg aaggcagaat tttctataac aacaggagac gccagttact 3240 ttgtcgggct tcagatcagt cgtaatcgtg cgagaaggag catatttata aatcagagtg 3300 tgtatgtgag agaaatcata gagagattta gaatgagtaa cgcgaaggca gtaagcgtgc 3360 cggccgaccc caatgtgatt ctgcaagcgg cagaagaggg tgaagctatg gatggaagcg 3420 tgccctatag agaagcggtc ggatgcttaa tgtttgtggc gatggtctca agaccagata 3480 taatgttcgc agtagggact ctgtcaaaat ttctcaatag acacaatgtc gctcactggc 3540 gcgcagtcaa gcgtgtattt gcatacttag gcggtagtgt agatttaggg atagaatata 3600 aatactattt agatcggtat gaaccagtgg ggtattctga cgcggactac gcgaatgatc 3660 cagacacgag aaggtcaacc actggctatg tgttttgtat atcaggtgga gcggttacat 3720 ggtcgtctca gagacagcgt ttagtcacgt taagtacaac ggaagctgaa tatgtcgccg 3780 cttcgtcagc agctaaagaa ctgtgctgga ttcgtaaaat gttacatgag atcagatatc 3840 aatgtaattc tgggtcaata ttgtgggtag acaaccaaag tgcgatcaaa atagcgaaaa 3900 atccagagta tcacaaacgc accaaacaca ttgatattcg atatcatcat attcgagaaa 3960 aatgtatgtc gggagaaatt gtaataaaat atataccgtc ggaaaatcaa aaagcagaca 4020 tactaaccaa agcgttgcct cgcgagagat ttgcgaaaat gagagagagt ttggggataa 4080 ttaaagatcc tgatatgtaa tagagcaggt atagaaacgg gaggagtat 4129 // ID DNA8-41_AP repbase; DNA; INV; 349 BP. XX AC . XX DT 25-AUG-2009 (Rel. 14.09, Created) DT 25-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-41_AP. XX NM DNA8-41_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-349 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 1971-1971 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 349 BP; 120 A; 53 C; 67 G; 109 T; 0 other; cagaggttcc caaactgtgg gtcgcgaccc aaaagtgggt cgcgattata tttttggtgg 60 gtcgcgctat aatttgatat aatacatatt tttcgttgat gttcaaagtt caaatacagt 120 ttttcgttaa tcgttatctg cgagattgac gattacgagt ttacagcacc gtatcaaaca 180 gattaatagt tatacctatt gaaaagaaag cgatttatct gaaacaaatt agtgttagta 240 agtacctaaa tacctaaaaa aaaaaaaagg tcattcaatg tattgtgggt cgccaaattt 300 taaaaaattc tcaaaatggg tcgtacaata aaaagtttgg gaacctctg 349 // ID Gypsy-26_OD-I repbase; DNA; INV; 7746 BP. XX AC CABV01003036; XX DT 24-FEB-2011 (Rel. 16.02, Created) DT 24-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Oikopleura dioica genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-26_OD_; KW Gypsy-26_OD-LTR; Gypsy-26_OD-I. XX OS Oikopleura dioica OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; OC Oikopleuridae; Oikopleura. XX RN [1] RP 1-7746 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Oikopleura dioica genome."; RL Direct Submission to RU (24-FEB-2011). XX DR Genome; CABV01003036; Positions 27175 34920. XX CC Positions [3782-4249] - Integrase core CC 'AGGCT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 87..1151 FT /product="Gypsy-26_OD-I_3p" FT /translation="MSKRKLEDIKSKFTQDTEDMTTILNTSVKNYNILSGS FT EEEKKIVARFKKTRLRILMSYLSKDFESRYEDELRGLTGSTDFFDLLCEII FT GTVSDADLVKEAKGKLADISRDTEEEETYTRFIKRVSNLAKIASKNVDILK FT NHYIEESWNRNLTPEIRRYLLDQGRNSDTPQKTAEFLDRMKKYKRKAEVSV FT VSARDIILQEQIDNLSNQFASFPDMLRDSLSSSISSIVQQQVQAMTSEIAD FT IHRITARKETIRKQENPGRNNNFTTNERQNFSHQSRDTQYNREPQKRPNYQ FT EGPSERRFPDHFELAPDGRPFRCTTCGVLGHQSRNCKGTNFSCRYCGEIGH FT LKFACPKKQSKN" FT CDS 5829..7721 FT /product="Gypsy-26_OD-I_4p" FT /translation="MKIEILLLFPLYRADDEFIFDRINGIVFRREDPVFTF FT DSEVPRIVDIMIASPRLQFREAFQADCADIDEGLDMMPRNFFNIDGSLPDL FT QENLQRLSKTCSSAFNIFDAMIFSLVNDDLIPTEIFNKTSSQPPLKKNRLR FT RDEPTAISRQRRALGAVIAGGAAAVGAVAIVATAGYAISVDVKSKERDELL FT ERKINEDRQRLASLTTVVELLDESIDEVAKMIRKSETPIITFAGIELGEDE FT KMREKLVDADPNTLNNFFASYSQRIGRETIRAIMLLSTQRMPLMAKFITAI FT RAQCLAIQDTENMDLARSFCLHFALHASRFDTSLRFAGLGFTKFNSTVDAS FT PNGFRIKEIIISLEIKIPRLRLQAERYSVANLGYFKNDGSRWKLKMPLQLI FT VMPSREVLKMNPSLCLKFNPSYACSITSLEPSTCGESLLTSNNTRLCETYQ FT ADSDKCGYMETHDRAFISMAKESKVNFFHHHPSKQLLKIDTFTKEQYGGAI FT DCGATVIKVNAGFEIERVTTRINYIAPINIKVTNIEEHRYLQLENRTHLAL FT KNNHNLKMTDEKLDIKISAVEAKVMDGIHRITGWISGLSSIVVALLVGLLL FT YKLRCCKREPTSRIMLANFQPPSTSSSRTDSSL" FT CDS join(1595..2518,2522..5401) FT /product="Gypsy-26_OD-I_1p" FT /translation="MAGIKLIHKTSTFSSTARLPQDTIIPPRKTISIKISL FT ESQPTSTICVTPEACLKNKDLQIYDQVVQLQCLADSATIQVANLSNLPLKL FT PKNIPLCRIDEVSIKQLDEPTSGKLKEIMEELNIGEIPTEIRQKLKTIISS FT FTDVFAIEGEQLGTTDAMSYNIDTGSAAPVASQRYKTPYYLRQELKRIINA FT NISSGLLKPCSSPWAAPVLLVKKSNGKWRLVCDYRKLNSVTISNQYPLPDI FT EGLIDQMSTSTVFSTADLFTGFHQIPCDEETQKKVAITTDFGQYTWTAMPM FT GGKNAPAVFQQMMDNKSIPSSELAIYLDDLCLHSKSYESNLTIIEKVLTTL FT RKNNLKIRASKTEFLKKRIKFSGAIIENGFRRPNPEKTKAVLELQHPRNAK FT EAQSIFGLLNYHRNFIPHFAEKAAPITLAYRKGFRWTSEAATALTTLKNEI FT SEKAQKLRIPTPNSGQYAIETDASNNGLGAVLLFRANGDDSYMPAAYLSHK FT FDEAQKNYNTYEKELLAGKKAMEKWSHYLLGRQFDWITDNSCVNWAHRIKA FT RKAKIAKWLAEIGDFDFRTVLKPSNQMVISDCLSRQFGDDATPVVNMISGK FT ELKNLQDNEHNLVEIGRYSALDRWPNLRSKKLDSYYKNRQKITKGPDGELG FT IKDGHFKVFPPEFLVEDILKEYHDKCGHPGISQTYEQIARKYFIPDLRKLV FT TDYLKTCDRCQRIKPCTNPLNAPLGHVKPPTQPFERFAIDLVGPLPISNHH FT NTYICVSTDLFSKKTYAQPMKNKRPDVVVGAAMSDWLRNPSLPQSLLMDNG FT CEFASLRRFCEEKGIKVFRSPAYHPQTNGECENRNRTIKSRLKLLSGFVNW FT DRHLPYVIHQMNSAKHSVTKLTPFEIEFGFDGESPNDPYRRLPKKQEIDLD FT QIRNKILDNHLQRRSENEEIHQFEVGQQVLLKNVDPRNKLEKVTGPYEIKS FT KSNQGLSFSLENLRTGNTVSRHISHLKQYTERITEQPEPEEDTVIEPNPVQ FT KRKLRSYSVITRNEVLASTTISSDSTNTQSERQEDISEDHDNTANNETVFE FT DALDLTINQNENANSDDSTACILTNQSDETINGDADPNEPETVSLDVVIDT FT QSSETESIASTRTVAPVIQRICDLPGKALDKFMSDMNIDTKTKTWLEMKTM FT KAKKQKKLEKIYAWIETNKPDWKKDEEGFYLVEHSAIILNSKCYLNQLTLP FT DLKVLTRHLGLEVDFDDRLKKDIVHEIKAKALQKYPAFKCTKSGNLIIDPS FT LLGILQN" XX SQ Sequence 7746 BP; 2649 A; 1814 C; 1519 G; 1764 T; 0 other; tggtgaacac gtggcgacat ccgattgaaa tcctccagaa ttttgaagaa aacttcagag 60 aatccgtacc aaaaataacg cttaaaatgt caaagagaaa acttgaagac ataaaatcaa 120 agtttaccca agacacagaa gatatgacga caattctgaa tacctccgtc aagaattaca 180 atattctttc gggctcggaa gaagaaaaaa agatcgtagc gcgattcaag aaaacgagat 240 tgcgcatttt gatgtcctat ctttcaaaag acttcgaaag cagatatgaa gatgagttaa 300 gagggcttac aggatcaacg gactttttcg acctcctttg cgaaataatt ggaacagttt 360 cagatgcaga ccttgtcaag gaagccaaag gcaaactcgc cgacatttca cgggatactg 420 aagaagaaga aacctatact cgattcataa aaagagtttc aaacctagct aaaatcgcta 480 gtaaaaatgt cgacatcctg aaaaaccact acatcgaaga gagctggaat cgaaatctga 540 ctcccgaaat cagacgatat ctacttgatc aaggaagaaa cagcgacaca ccgcaaaaga 600 cagccgagtt tcttgacaga atgaagaaat acaagcgtaa ggcagaagta agtgttgtaa 660 gcgcccgtga cattatcctg caagaacaaa tcgacaacct cagcaatcaa tttgcgagct 720 tcccggatat gcttcgtgac tctcttagct cgtcaatttc atctatcgtt caacaacaag 780 tacaagcaat gacgtctgaa atcgccgaca tccacagaat caccgcaaga aaggaaacaa 840 ttaggaaaca agaaaatccg ggaagaaaca ataattttac gacaaatgaa agacaaaatt 900 tctcacatca atcaagagac actcaataca atcgagaacc gcagaaaaga ccaaactacc 960 aggaaggtcc atctgaacgc cgtttcccgg atcacttcga gctagcccca gatggaagac 1020 ccttcaggtg cacgacttgt ggcgtccttg gacaccaatc gcgtaattgc aaaggaacaa 1080 acttctcttg ccgatactgt ggtgagattg gccatctgaa atttgcttgc cctaagaaac 1140 agtcaaaaaa ctaagaacgg ggacttttta tgtgtcaaaa aagacccctg ctcttatcgc 1200 aaaattatct gcttctattg gaccaaaact actcatcaac ggccggatct tcggaaaaac 1260 agtcaaattt gttttggaca caggtgctca agtttcctgt ctaccacgtc attacatacc 1320 tgccgacaaa ctcgcaagcc tcgctccagc cccgtttgaa ctccaaagtt ataatggtag 1380 ttccatcaat gtgtatggat cactgagctg ctccgttcaa tgcggaacaa ttttactgaa 1440 ggatgccctc tttcttgttg tcgacaatat ctgctctcca atcatcggaa ctccggaaat 1500 atcagaaaat aacagtgtca tcaacactcc agcattatct ctcgaagaaa atggaaataa 1560 cgcccgatta gaactctcct cagcatctca accgatggct ggcatcaagc tcatccacaa 1620 aacatcaaca ttctcttcaa cagcccgtct gccccaagat accatcattc caccgagaaa 1680 aacgatttcc attaagatct ctcttgaatc acaacccact tctactatct gtgttacacc 1740 agaagcatgt ctcaaaaata aagacctcca aatttacgat caggttgtcc aacttcaatg 1800 ccttgcagat tcagcaacaa tacaagtcgc aaatctctcc aatttgcccc ttaaacttcc 1860 aaagaatatc ccactatgca gaatcgacga agttagcata aaacaacttg atgagcccac 1920 ctcaggaaag ctcaaagaaa tcatggaaga acttaatatt ggggaaattc cgactgaaat 1980 tcgtcagaaa cttaaaacca taatctcaag cttcacagat gtctttgcaa tagaaggaga 2040 acaacttggc acaactgatg caatgagcta caatatcgac acaggatcag cagctcctgt 2100 tgcatctcaa agatacaaga cgccttacta tcttcgacaa gaactgaaac gaattataaa 2160 tgcgaatatc agtagcggtc ttcttaaacc atgctcgagt ccatgggcag caccagtgct 2220 tcttgtcaag aagtcgaatg gaaaatggcg tctcgtctgt gattacagga aattaaattc 2280 tgttacaata agcaatcaat accctcttcc agatatagag ggtctgattg atcagatgtc 2340 tacgtcaaca gtcttttcta ccgccgatct gttcacagga tttcaccaaa ttccctgtga 2400 cgaagaaact cagaaaaaag ttgcgattac aactgacttt gggcagtaca cttggaccgc 2460 aatgccaatg ggcgggaaaa atgccccagc tgtttttcaa caaatgatgg ataacaaatg 2520 aagtattcca agcagtgaac tggcaatata tctggatgat ctttgtctcc attcaaaatc 2580 atacgaaagc aatctgacaa tcattgaaaa ggttttgacg acactaagaa aaaacaacct 2640 gaaaattcga gcatcaaaga cggagttcct aaaaaagaga ataaaattca gtggcgcaat 2700 aatcgagaat ggctttcgaa ggccgaatcc tgagaaaaca aaggcagtct tggaactgca 2760 acatcctcga aatgccaaag aagcgcaaag tattttcgga ttactaaact atcacagaaa 2820 ctttatacct cacttcgccg aaaaagcagc tccgatcaca cttgcataca gaaaaggttt 2880 cagatggact tctgaggcag caacagcact taccactcta aagaatgaaa tttctgaaaa 2940 agcacaaaaa ctacgaatcc caacgccaaa cagtggacaa tatgcaatag aaactgatgc 3000 cagtaataat ggactcgggg ctgttcttct ttttcgggca aacggtgatg atagctacat 3060 gccagcagct tacttgtctc acaaatttga tgaagcgcaa aagaactaca atacttatga 3120 aaaggagctt ctggcgggga aaaaagctat ggaaaagtgg agccattatc ttctcgggcg 3180 tcaattcgac tggataaccg acaattcttg tgtcaactgg gcacatagaa tcaaggcaag 3240 aaaagctaag atcgccaaat ggttagccga aatcggagac tttgacttta gaacagtact 3300 taaaccgtcc aaccaaatgg tcatctcaga ttgcctctcc cgtcagtttg gagatgatgc 3360 aactcctgtg gtcaatatga tctccggcaa agaactaaag aaccttcaag ataacgagca 3420 caaccttgtg gaaatcgggc gatactcggc actcgataga tggccaaatc tacgttcgaa 3480 gaaattggac agctactaca agaacagaca aaaaatcacg aagggaccgg acggggagct 3540 tggaatcaaa gatgggcact tcaaggtttt cccaccagaa ttcttggtcg aagatattct 3600 caaagagtac cacgacaagt gtggtcaccc gggtatttca cagacatatg agcaaatcgc 3660 acgcaagtac ttcattccgg acctaaggaa gctcgttacc gactatctga aaacctgcga 3720 tagatgtcaa agaataaaac cctgtacaaa cccacttaac gcgcccttag gacatgtcaa 3780 gccgcccaca cagccttttg aacgatttgc tatagatctt gtgggcccac tcccaatcag 3840 caatcatcac aatacctata tctgcgtgag tactgattta ttcagcaaga agacttatgc 3900 acaacctatg aaaaacaaga gaccagacgt agtagttgga gcggctatga gcgattggct 3960 acgaaatcca agtcttccgc aatcgcttct tatggacaac ggctgtgaat tcgctagctt 4020 acgtagattc tgcgaagaaa aaggcataaa ggtttttcgg tcaccggcgt accaccctca 4080 gacaaacggg gaatgtgaga atagaaacag gacaatcaag tcaagactga aacttctctc 4140 tggatttgtt aactgggata gacatcttcc atacgttata caccaaatga actcagcaaa 4200 gcacagcgtc acaaagctca ctcccttcga gatagaattc ggattcgacg gagaaagccc 4260 gaatgatccc tacagaagac tgccgaagaa gcaagaaatt gatctggatc aaatcaggaa 4320 taaaattctg gataatcatt tacaaagaag atcagaaaac gaggagattc atcagtttga 4380 ggtcggccag caagttcttc tgaagaacgt tgatccgaga aacaagcttg aaaaagttac 4440 tggcccttac gaaatcaaat caaaaagtaa tcagggtttg agcttttcac tcgagaatct 4500 aagaaccgga aacaccgtca gccgacacat tagtcatcta aagcagtata cagaacgaat 4560 aacagagcaa ccggagccag aagaagacac agtcatagaa cccaaccctg ttcagaagag 4620 aaagttaaga tcatactcag tgatcacgcg taacgaagtt ctggcatcaa ccactatttc 4680 atctgactca acgaacacgc aatctgagag gcaagaagac ataagcgaag accatgataa 4740 cacagcgaat aacgaaaccg tatttgaaga cgcgcttgac ttaacaatca atcaaaatga 4800 aaatgctaat tcagatgatt ccacggcttg cattttaaca aaccaaagtg acgaaacaat 4860 caacggagac gccgacccca atgagcccga aacagtcagc ttagatgtcg taattgatac 4920 ccagtcatcc gagacggaaa gtattgcaag cacaagaacc gtcgctccgg ttatccaaag 4980 aatttgtgat ttgcctggaa aagcacttga caagtttatg tcggacatga acattgatac 5040 aaaaacgaag acatggctgg agatgaaaac catgaaggca aagaaacaga aaaagctcga 5100 gaagatttac gcctggatag aaacaaacaa acctgactgg aagaaagatg aagaaggctt 5160 ttacctcgtc gagcattccg caatcatttt gaactccaaa tgctacctga atcaactgac 5220 actgccggat ttaaaggttc taaccagaca tctgggtcta gaggttgact tcgatgaccg 5280 actaaaaaaa gacattgtcc acgagatcaa ggcaaaagcg cttcagaaat acccggcgtt 5340 caaatgcaca aaatcaggaa atcttatcat tgatccgtct ctgcttggta ttctgcaaaa 5400 ctgaattcga aaaaattttg aattcggccc gaccgcgaat tcgaaaaaat tttgaattcg 5460 atttgaaccg cccgcaaccg acaaaaatat tttggagaag tccgtaaaag tcaccatgtt 5520 ttccctatgg tcgctttttg atgatagaaa atctaaattt tctcgaaatt ttgattttct 5580 tcaaatccaa aaatccttga actccgcaac aacccaaatc gaaacacaaa aatggtctga 5640 gcaaaacggt tacgcggtta gcttgcccta atttccgtca cattgccatc aaacaatcaa 5700 ggcaatatga cacagttctg cagcacgtga cgcgacagct gcgcgtcagg catgacggag 5760 cacccgcggc gagacgacat ataaacggac cagaacagac agaagtaagt cagaagacaa 5820 aatacgaaat gaaaatcgaa attcttcttc tttttccgct ctacagagcc gacgacgagt 5880 ttattttcga cagaataaac ggaattgttt tccgaagaga agatccagta ttcaccttcg 5940 actcagaagt tccgagaatc gtcgacatca tgatcgccag tcctcgactc cagttccgcg 6000 aagcatttca agcggactgt gcggatattg acgaaggact tgatatgatg ccgaggaatt 6060 tctttaatat agacggaagt ctcccagacc ttcaagaaaa tctacagaga ctttcaaaga 6120 cgtgctcatc tgcattcaac attttcgacg ctatgatctt ttcccttgtt aatgacgacc 6180 tcattccaac cgaaattttc aacaaaacct cttcgcaacc cccactgaag aaaaatcgcc 6240 ttcgccgcga cgaaccaacg gcgatctcac gacaacgacg agccctcgga gcagtcatag 6300 ctggaggagc tgccgctgtt ggagctgttg ctattgtagc aactgcggga tatgcgatat 6360 ctgtcgatgt gaagtcaaaa gagcgtgacg aactcttaga gagaaagatt aatgaagacc 6420 gacaaagact tgcaagtttg acaacagttg tcgaactact cgacgaaagc attgatgaag 6480 tagcaaaaat gatccgaaag agcgaaactc caatcatcac ctttgctgga atcgaacttg 6540 gagaagacga gaagatgcga gaaaaattgg tcgacgcgga cccgaacaca ctgaacaatt 6600 tcttcgcaag ctactctcaa agaataggta gagaaacaat tcgtgcaatc atgctgctat 6660 caactcaacg aatgcctctt atggcaaaat ttatcacggc aattcgagcc caatgcttgg 6720 ctatccaaga tacggaaaac atggacctcg ccagatcttt ctgtcttcac ttcgcacttc 6780 atgcatccag attcgatact tcattgcgat tcgctggcct tggcttcacg aaattcaatt 6840 caacggtcga tgccagcccg aacggattcc ggataaaaga aataatcatt tcgcttgaaa 6900 taaaaattcc ccgactgcgt ctccaagccg aaagatactc agtggcaaac cttggatact 6960 tcaagaacga cggctctaga tggaaactca aaatgcccct ccagctcatc gtcatgcctt 7020 ctcgagaagt acttaagatg aatccaagcc tctgtttgaa gttcaaccca tcctacgctt 7080 gcagtatcac gtctctcgaa ccctccacct gtggagaatc tcttctgaca tcaaataaca 7140 caagactatg cgaaacttat caagcagatt ccgacaaatg tggctacatg gaaactcatg 7200 atcgcgcttt tatttcaatg gcaaaagaaa gcaaagtcaa cttcttccat caccatccaa 7260 gcaaacaact tctaaaaatc gacaccttta caaaagaaca atacggcgga gcaatcgatt 7320 gtggcgctac cgtaataaaa gtcaatgccg gattcgaaat agaaagagta acaacaagaa 7380 taaactacat cgcaccaatc aacatcaaag taacaaatat tgaagaacac cgctatcttc 7440 aacttgaaaa cagaactcat ctggccctca aaaacaacca caatctcaaa atgacggatg 7500 aaaagctaga tataaagatt tcagcggtgg aagccaaagt aatggacggc attcaccgta 7560 ttacaggatg gatctccggc cttagcagca tcgtagtggc tctcctggtt ggccttctcc 7620 tgtacaaact ccgctgctgc aaaagagaac caacgagccg gattatgctg gcaaattttc 7680 aaccaccgtc tacttcttca agcagaacag attcctctct ctgaaacccg ccgactgacg 7740 atggtc 7746 // ID P-5_HM repbase; DNA; INV; 3230 BP. XX AC . XX DT 17-MAR-2008 (Rel. 13.03, Created) DT 17-MAR-2008 (Rel. 13.03, Last updated, Version 1) XX DE P-type DNA transposon family: consensus. XX KW P; DNA transposon; Transposable Element; P-5_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3230 RA Jurka J.; RT "P-type DNA transposon families from Hydra magnipapillata."; RL Repbase Reports 8(3), 351-351 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 243..2789 FT /product="P-5_HM_1p" FT /translation="MPNKCCVYACHTNYQSEKNKISSEVNKISVYRFPSDN FT VEREKWIKAIPNSNLVVTKYTVICELHWPPNFDTVIVRGGKRRPKNPPSVW FT LGIPLSQIPTGYPTERSTKRSLSGIRSTIDDELDTFLKRDKATFCDIKNKF FT IDSSDHQYPVISFMVDGYVVIQSLQFFNGIPFFVLKIYENLTFETFHYGIK FT CHISSLTVNRILTVDSWSKFDEILRFLSSLKIDNKKEVIMQHVSVMGPKLV FT GDKLYTPEILIRSFVYFLTSRALYNRLRNDYQLPSISTLTRITSKISNINE FT NKFIYSIFNSLPDKQKLCIILHDEIYVKKSLLYHGGTIFGRSEDNPVELSK FT TLLGIMAVCLNGGPKVLIKMIPISKLRSDFLFEQINLTSQIIDSVSANVKA FT IICDGNRVNQAFFKMYTFIPGKPWLTVDGKYLLYDFVHLIKNIRNLWLTEK FT TQELIFDDNGVTRIAKWTHLKQLYNAESRSFLKLSDLNETSISPKPVERQC FT VLTVLRVFSEKTYAALLQHPDMIHIDVNDTAIFINKVIIWWKILNVKAIGA FT DTRHNDPLQAVINNPDDNRLNLILQFGDMALKMAGPKGKRIKQLSKDTATC FT IHQTCYGLVDLCRYLLNTTQNYVLLGQFTSDHLEKEYSILRQGSGGTYFLS FT VQQVIEKLRIKHASLLLKLNVDIDNFNVKSGHQCALCDYKLSEDCCEILDN FT LQDLESSIADDVKMSLIHIAGYVTRNDKERTDYELLDQTTFYYQKYGQFSK FT FLDRGGLKIPTDNSCQWTFFCFVVFQIVKDHVCRKSLSSIFMIISDYYLLD FT MEEHHCVILSNIFLNKFMYHCHSKIWQRTCTKIIKNLLKVYLVVYCL" XX SQ Sequence 3230 BP; 1115 A; 453 C; 500 G; 1162 T; 0 other; catagcgttc tctactacag gcctattcat cgcgcaaccg gatttttttg tggaggccta 60 cttttttacg gcgaaaaaat tatttctcca cgttcagacc agatacaatg tcatgtcgtt 120 tatgacaaat atttaaataa tggaatataa tacaatgtta ataaacttat ataatttaaa 180 catggagttg cagatataat ttaaagcaaa taatatattt ataaaattta gattaaatca 240 tcatgccaaa taaatgttgt gtgtatgctt gtcacaccaa ctatcaaagt gaaaaaaaca 300 aaatatctag tgaagttaat aaaatatcag tgtatcggtt tccaagtgat aacgttgagc 360 gtgaaaaatg gataaaagct attccaaact ctaatttggt tgttaccaag tatactgtta 420 tctgtgagtt acactggcct ccaaattttg atacagttat agtgcgtgga ggaaaacgtc 480 gaccaaaaaa tcctccttct gtttggctag gaataccttt atctcaaatt ccaactggtt 540 atccaactga aagatctacc aaaagatctt taagtggcat tagaagtaca atagatgatg 600 agctagatac atttcttaaa agggataaag ctactttttg tgacattaaa aacaagttta 660 ttgatagcag tgatcaccaa tatccagtta tatcgtttat ggttgatggc tatgttgtta 720 tacaatcttt gcaatttttt aatggaatac ctttttttgt acttaaaatt tatgaaaatc 780 ttacctttga aacatttcat tatgggatta agtgtcacat ttctagctta acagtaaatc 840 gaatcttaac tgttgactcc tggtctaagt ttgatgagat acttcgtttt ttaagctctt 900 tgaaaattga taataaaaaa gaagtaatta tgcaacatgt ttcagttatg ggaccaaaat 960 tagttggaga taaactttat acacctgaaa tacttatacg ctcttttgta tattttttaa 1020 catcaagggc attatataat cgcttaagaa atgattatca acttccatct atttcaacac 1080 tgactcgaat tacttcaaaa atttctaata ttaatgaaaa caaatttatt tattcaattt 1140 ttaactctct tccggataag cagaaattat gtatcatact acatgatgaa atatatgtta 1200 aaaaaagttt gctttaccat ggtggaacta tttttggtag atcggaggat aacccagtgg 1260 aattgtccaa aacattgtta gggattatgg ctgtttgttt aaatgggggt ccaaaggtgt 1320 taattaaaat gataccaata tcaaaattac gatctgattt tttgtttgaa cagattaatt 1380 taactagtca aataattgat tcagtcagtg caaatgtaaa agctattata tgtgatggga 1440 atcgtgtcaa tcaggctttt tttaaaatgt acacattcat tccgggaaaa ccatggttaa 1500 cagtagatgg aaagtattta ctttatgatt ttgtacacct cataaaaaat attcgtaatc 1560 tttggttaac tgaaaaaact caggaattaa tatttgacga taacggagta accaggattg 1620 caaaatggac tcatttaaag cagctttaca atgcagaatc cagaagcttt ctaaagttat 1680 ctgacctgaa tgaaacttct atttctccta aacctgttga acgacagtgt gtattaacag 1740 ttttaagagt attttcagaa aaaacttatg ctgctctttt gcaacatcct gacatgattc 1800 atattgatgt taacgacact gctatattta ttaataaagt tattatttgg tggaaaattt 1860 taaatgtcaa agccattggt gctgatacaa ggcataatga tcctttgcag gctgttatca 1920 acaatcctga tgacaaccgt ttaaatttaa tactacaatt tggtgatatg gcacttaaaa 1980 tggcaggtcc caaaggtaag cgtattaaac agctttctaa agatactgca acttgtatac 2040 atcaaacttg ttatggacta gtagatttat gtagatattt acttaacact acgcaaaact 2100 atgtacttct tggtcaattt acttcagatc acctggaaaa agaatatagt atacttcgtc 2160 aaggttctgg tggtacatat tttcttagtg ttcaacaagt tattgaaaaa ttacgtatca 2220 aacatgcatc tttacttttg aagttaaatg tagatattga caattttaat gtgaagtctg 2280 gtcatcaatg tgcgttatgt gattacaaat tgtctgaaga ctgctgcgaa attcttgata 2340 acttgcaaga tctagaatct tctattgctg atgatgtcaa aatgtcactt attcacattg 2400 ctggttatgt tacacgaaat gataaagaac gaactgatta tgaactttta gatcaaacaa 2460 cgttctatta tcagaaatat ggtcagtttt ctaaattttt ggatcgtggt ggattaaaaa 2520 ttccaactga taattcatgc caatggacat ttttttgttt tgtagttttt caaatagtta 2580 aagatcatgt atgccgaaaa tctttaagta gtatttttat gattatatca gactattatt 2640 tattggatat ggaagaacat cattgtgtaa tattgagcaa cattttttta aataaattta 2700 tgtaccactg tcactccaag atctggcaaa gaacctgcac taaaattatt aaaaatctct 2760 taaaagtcta cttagttgtc tactgtttat aatatattca gtatactttt ttataaaaga 2820 tttataaaag tttcatatat tagttgttgt ttttttcaga ttttgttaac aattttaatc 2880 aaaatttatc taatattttt tgtttggttt aataacattt ctactcaata taagaaaata 2940 tcaattttgt atttagctgt taaagccaaa actaaaggtg aaataatata attatgtaaa 3000 tgtgtgaaat ttgaaattag aaaaaatttc tttgcctttt atttacattg tttattactt 3060 aatattaatt ttcttttttc atcaaaaaaa attattaggc tatacagaga attaaaatta 3120 ctttttagtt ttaaaattac gaagttatta tcttttttgc cgtaaaaaag taggcctcca 3180 caaaaaaatc cggttgcgcg atgaataggc ctgtagtaga gaacgctatg 3230 // ID Gypsy2-LTR_DV repbase; DNA; INV; 701 BP. XX AC scaffold_13324; XX DT 15-OCT-2009 (Rel. 14.12, Created) DT 15-OCT-2009 (Rel. 14.12, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy2_DV; KW Gypsy2-I_DV; Gypsy2-LTR_DV. XX OS Drosophila virilis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; virilis group. XX RN [1] RP 1-701 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(12), 3097-3097 (2009). XX DR Genome; scaffold_13324; Positions 21593 20893. XX SQ Sequence 701 BP; 191 A; 147 C; 187 G; 176 T; 0 other; tgcaagaagg aaaagcggtt acggctggtt ttcgttgcgt tgcaagaagc aaaagcggtt 60 acggctggtt ttcgttgcgt tgcaagaagc aaaagcggtt acggctgggt ttcgttactt 120 ttcgttgcgt tgcgaggcaa aggattgatg tggacagaca gaagtaaatg atctgagcga 180 acggacataa ttgtgcactg tgaagtttag aaaagagaag caaatgccac caccaaagat 240 gccgggactc gctgcccaaa gtgtaaaaaa ttctgaccct acggccgctt cgacatcaga 300 agtggaattg gatcctccta actctcagtt gtgtcaaatt ttggagcaac aaaatagaag 360 cgcagcgcct cgtgagtcaa agacggtgtt tttacccaaa tacaaccccg aagctaccgg 420 tgctgatgca tgtgcttgga gtattacggc cgatataatt tttattgaaa atccacttga 480 gggttgtgcg ctcgtgattg ccctcagtaa agctttggag ggcagtgcat cgcaatggct 540 ttcacaaatc tgttttcctg gcaagacttg ggttgagttc aagcaacttt ttacgcaaag 600 atttgtcggc gtagagacaa ccacagcaac atttcttaag ctgctagacg gacgcccagg 660 gactggagag tgcatgtcgt cgtatgcaag ccgactcgtc a 701 // ID BEL-78_CQ-I repbase; DNA; INV; 3014 BP. XX AC AAWU01022194; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-78_CQ_; KW BEL-78_CQ-LTR; BEL-78_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-3014 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 297-297 (2011). XX DR GenBank; AAWU01022194; Positions 14368 17381. XX CC 'ACCTT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 465..2981 FT /product="BEL-78_CQ-I_1p" FT /translation="MWRSFLGFVVPGENVELKTESESDPEDRKPVVEQLLL FT TPKQRYAEKKRKMADEEVKALLNRRGHVKGKLTRIKTALAQPVISAAQFEV FT YKANIEKYYGEYDDLNNKILDSKLTEEQASENDARYLEYEHLYDEVLVKLR FT ELTAPLERNQTVVRAGQAGQQIIVQQQPLSVPLPTFDGQYENWPKFKAIFT FT DLMQKSLDSDAIKLYHLEKSLVDAASGSIDAQTIKDNNYQLAWQTLEERYE FT NQRLIIDLHISGILQLKQMTKKSSKELRNLIEECARHVENLRFHKQELLGV FT SELVVIHILSSALDGETRELWEASIQRKQLPKYQETMDFLKKRCDILERCE FT SAASATSAVQLVLPETSSKPTTRIASAAMTSNDVECELCGGSHANFRCGSF FT RGMSVAERWAKIRDERMCFNCLRKGHRVESCPSERTCKCGQKHNSLLHYER FT TQQTPRNNPGETPALSVPVEALPQVPTPRGEAWTSGTGPVVSDADDEQQTT FT SCFSSGAQKSSRRVLLQTAVINVVDASGRFHPCRTLLDSGSQAHILSEAMA FT QKLGLPFEKCNVTVIGANAAKTQVKKGVTLSFASRYVNFSDKISCLVTEKP FT TGRIPSEKIDISAWRVPRGLQLADPHFNEPHEIDLVMGSDYVWELLRSEEY FT KLANGTVTLRETDLGWIVTGSYDSSPQLSCQVVHVLNAATQRTERVPFHGA FT RRLVQQAESVKDRFSTATKIAKKEPSGTSSKEEAVDQLGEKTFLHEEDPVK FT KCCFNSKQVLKCKEEKVIEPKRSALRDNDAGLPRVGGRPVSFPHGVAYAGA FT VTANRNPPRKERAGARRPVGNPANRKQRPASGGLRFF" XX SQ Sequence 3014 BP; 771 A; 758 C; 909 G; 576 T; 0 other; tttggtcctt tcgacccgaa tagaccggcg gaagtcggaa cgtcgcgagt gtcttgtgga 60 gtgaagtgca cggcaagaaa agaagaccgt gaactgatcc ggaagacgtc attcccggag 120 ggcgtttcac gtggccttga acagcgtgaa gctacgcgcg tgtgggagaa ccagcgacgc 180 ggagtgctcc ggatgtggtc ccattggccg ccggaagcag aaagtgagcc gtaagtggtc 240 ccattggcca ccggcagtga aaagtgaacc ggagtggtcc cattggccac cggtagtgaa 300 aagtggtccc attggccacg gcagaagaag tgcgcgcaaa aggtggtccc attggccatc 360 cgagcagcag agaaaagtgc gtttgtgtgt gtgtgcgaca acaagaagcg tgcgtcgcca 420 tcttggcaca tgaataggag aaaaaagtga agtgaaatcg gcggatgtgg cgtagttttt 480 tgggattcgt ggtcccggga gaaaacgtcg agctgaaaac cgaatcggaa tccgatccag 540 aagatcgcaa gcccgtggtc gaacaacttc tgctcactcc gaagcaacgg tacgcggaga 600 aaaagcggaa aatggcagac gaagaggtta aagcgttgct gaaccggcga gggcacgtta 660 agggcaagct aacccggatc aagactgccc tagcccagcc agtgatatcg gcagcgcagt 720 tcgaggtcta caaggctaac atcgagaagt actacggtga gtacgatgac ctgaacaata 780 agattttgga ttccaaactg acggaggaac aggcgagcga gaatgatgct cggtatttgg 840 agtacgagca tttgtacgac gaggtgttgg taaagctcag ggagctcacg gcaccactgg 900 aacgcaacca gacggttgtc cgcgctggac aagccggcca gcagatcatc gttcaacagc 960 aaccactgag tgtacccctg ccgacgtttg acgggcagta cgagaactgg cccaagttta 1020 aagcgatttt caccgatctg atgcagaaat cgttagactc ggacgccata aagctgtacc 1080 acctggagaa gtcgctcgtt gatgcggctt cagggagcat cgatgcccaa acgataaagg 1140 acaacaacta ccagcttgcg tggcagactc ttgaggagcg ctacgaaaat caacggctga 1200 tcatagacct ccacatcagc ggaattctgc agctgaagca gatgacgaag aagtcgtcga 1260 aagaactgcg gaaccttatt gaggagtgcg ccagacacgt tgagaacttg cggttccaca 1320 agcaggaact acttggtgtc tcggagctgg tcgtgatcca cattctgtcc tctgccctgg 1380 acggagaaac gcgagagttg tgggaggctt ccattcagcg aaagcagctc cctaagtatc 1440 aggagaccat ggacttcctg aagaagcggt gcgacatact ggaacggtgt gagtctgccg 1500 cttccgcaac gtctgctgtc cagctggttc ttccagagac gtcatcgaaa ccaacgacga 1560 ggatcgcctc ggcagccatg acttccaacg acgtggaatg tgagctgtgt ggtggatcgc 1620 acgccaactt tagatgtggc tcgttccgtg gcatgagtgt cgcggagaga tgggccaaga 1680 tccgtgacga gcggatgtgc ttcaactgcc tacgcaaggg tcatcgagtg gaaagctgtc 1740 catcagagcg tacctgcaag tgcggccaga agcacaacag tttgctgcat tatgagcgga 1800 ctcagcagac cccacggaat aatcccggag aaaccccagc tctctccgtg ccggtcgagg 1860 ctcttccaca agtaccgacc ccccgtggtg aggcttggac aagtggaaca ggtcctgttg 1920 tctctgacgc cgacgacgag caacagacaa cgtcttgttt cagcagcgga gcacagaagt 1980 cgtcccggag agtacttctg cagactgctg ttatcaacgt tgtggacgca tccggtagat 2040 tccatccgtg tcgcacgttg ctggattccg gctcacaagc gcacattctg tcggaggcga 2100 tggcgcagaa actcggccta cccttcgaga agtgcaacgt tacggtcatc ggagctaacg 2160 ctgcaaagac gcaagtgaaa aagggtgtta ccttgagttt tgcgtccagg tacgttaact 2220 tcagcgacaa aatttcgtgt cttgtcaccg agaaaccgac gggacggatc ccgtctgaga 2280 agatcgacat ctctgcctgg cgcgtcccac ggggattgca gctggcagat ccacacttca 2340 acgagccgca cgagatcgat ctggtgatgg gatcagatta cgtgtgggaa ttgttgcgat 2400 cagaagagta caagcttgca aacggcacgg tgacattgcg agagaccgat ctgggctgga 2460 tcgtgaccgg ttcgtacgac tcatctccgc agttgagttg ccaagtcgtc cacgtgctga 2520 acgccgctac gcaaagaacg gaacgtgtcc cgttccatgg agcgcgacgt ctggtgcagc 2580 aagcagagtc ggtgaaagac cggttctcaa ctgccaccaa gatcgcgaag aaggaaccgt 2640 ccggaaccag ctccaaggaa gaagcagtgg atcaactggg agagaagacg ttcctgcatg 2700 aagaagatcc agtaaaaaag tgctgcttca attcgaagca agtgttgaag tgcaaggaag 2760 agaaagtgat tgaacccaaa agaagtgccc tgagagacaa cgacgccggc ctgccgagag 2820 tcggtggacg tccggtgagc tttccgcacg gcgtggctta cgctggtgct gtgacggcga 2880 accgcaaccc acccaggaag gagcgcgctg gtgcgcgtag gcctgttggc aaccctgcca 2940 accgaaaaca gcgacctgct tcaggaggac ttcggttttt ctgagttgcc ggagtttgca 3000 gccgcggggg agaa 3014 // ID TWIN repbase; DNA; INV; 222 BP. XX AC . XX DT 23-OCT-2000 (Rel. 10.06, Created) DT 08-JUL-2005 (Rel. 10.06, Last updated, Version 2) XX DE Twin SINE retroposons from the Culex pipiens mosquito - a DE consensus. XX KW SINE; Non-LTR Retrotransposon; Transposable Element; KW Interspersed repeat; TWIN. XX NM TWIN. XX OS Culex pipiens OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RP 1-222 RA Feschotte C.; RT "TWIN."; RL Direct Submission to Repbase Update (18-OCT-2000). XX RN [2] RP 1-222 RA Feschotte C., Fourrier N., Desmons I. and Mouches C.; RT "Birth of a retroposon: the Twin SINE family from the vector RT mosquito Culex pipiens may have originated from a dimeric tRNA RT precursor."; RL Mol Biol Evol 18(1), 74-84 (2001). XX DR [1] (Consensus) XX CC Twins define a new type of tRNA-derived SINEs. These ~220-bp CC elements consist of two tRNA(Arg)-related regions separated by a CC 39-bp spacer. CC Other tRNA-unrelated sequences include a 5-bp 5' leader and a CC short 3' trailer which is followed by a polyA tract of variable CC length (0-14 bp). CC According to this structure, we propose that the Twin SINE family CC has originated by retroposition of a dimeric tRNA precursor. The CC estimated copy number of Twins in the C. pipiens genome is ~500. CC The consensus sequence is derived from a multiple alignment of 6 CC Twin copies. Pairwise identity between these copies ranges from CC 83% to 96%. CC There are no obvious target site duplications flanking these CC copies. XX SQ Sequence 222 BP; 48 A; 45 C; 70 G; 59 T; 0 other; gccgagcttc cgtggccgtg aggttacggg tttcgccttg taagcggaag gtgatgggtt 60 cgattcctgt ctggctcggc aaagtcagat cccttcaaag agtaaatatg ctcactggga 120 atactgaccg gtaggggatg ggtttcgact agcggcgtgc tggatttcca atccagaggt 180 cgtgagttcg attctcgtac cgggatgatg aagttttaaa aa 222 // ID Gypsy-77_AA-LTR repbase; DNA; INV; 603 BP. XX AC supercont1.255; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-77_AA_; KW Gypsy-77_AA-I; Gypsy-77_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-603 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.255; Positions 959433 958831. XX SQ Sequence 603 BP; 183 A; 128 C; 99 G; 193 T; 0 other; tgtagtagaa tatttcccat gttttatcct gagaaacagt catctcagat ttactaattt 60 acgagcagag aaacacagct ttctataaca tccataaact caaactcatg ttgtcatatg 120 attccgcatg acaacagaac attcagaact tctcataaat aaccgtgacc gtaataaccg 180 tcttccgtac atcaagtgcc ttccgtactg gttgtaccca aagacatcaa ttttggatat 240 ttttttttac cattcgtttt cctttacgca tgcataataa tgcaggccaa gaaacaatta 300 atacttttca cagcgttgac aaaaatgcca aatggcctag atctttcctt tacgttttcg 360 gctaccttcg tcttttacga cgactgtgtt atgcttagcg agttaccacc actcatgcta 420 tttgaacgtg attggttaac tctgcgtgga gactcttagc ccacccctag aatagttaaa 480 tcatgtgtag agttaagtta cgtagaagta gatttggaaa ataaagtcag atcgtttcgg 540 actgttacac gaacctgagt gtttcttgaa gatatcacag ttcatgatat caaaactatt 600 aca 603 // ID hATm-16_HM repbase; DNA; INV; 3691 BP. XX AC . XX DT 07-DEC-2008 (Rel. 13.12, Created) DT 07-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type family: consensus sequence. XX KW hAT; DNA transposon; Transposable Element; hATm-16_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3691 RA Jurka J.; RT "Families of hATm elements from Hydra magnipapillata."; RL Repbase Reports 8(12), 1910-1910 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(776..1474,1539..2411,2457..2837,2881..3270) FT /product="hATm-16_HM_1p" FT /translation="MPKVKVAITRMKSEKKYCGGPCELSTTDPSTFRQVIQ FT YYYYLIHINPNADILLVTQQISNNIKSIWATVNPRLPLITDKSINRKVKDL FT LVLVKDINRKHGKASAKRNLDANLDKLFDISACSCSLDVFPCSDRKVSCNK FT ENCIEEHIVCLCSQSNKVPLEDRAYLRDQRKKIGPKGAYQLSSVDRLSVKR FT IQRLDKERQKISHKLVNEDESLIAQVNLIEETSGLTNTEVNILDEGEWKPV FT NNPIGKYNLVKLPRFAMELVRNECSSNVGAALANALLHDIKHLLKNDVDIK FT DILIDKCKLDRAKSKVKIVSEEVDSEQKKQLICLGVDGKLDQITKTSIXIN FT LPQGEVLKKCIEPEHHLTFTYEDGKSSGNYLTHRTIPVTGATGLVLATETF FT SVLQEHNSLESIQAVLLDNTATNTGPISGLVVKLEEFLKRKLHLIGCALHQ FT NELPLRALFVKLDGDTTGPRSFSGLLGKRCAENIQDKNQVLFNRIENPIVD FT GYIPEEILVDLSCDQRLLFEYCKGIGFWPCKVVTRKVTLAIRLLCLYTRDN FT QPTVSLKKLVCFIVQVYAPAWFDIKKSSKFHESPRLIFSTITRINNLPFDD FT VKYIVKKNIKNNALCLKPENILYAMLKDNDSKIRNFGFQILLGLRQRYFFI FT VNLDLIYIYRVNNESLEVNLKKIPEINFNANHWYELVDISITKFTEPPTTQ FT HFSIEQIQYAIDNNVKPEIPDFPSHSQSVERAVKLVSDASQYVYGFENRHS FT CILTKLLSRKMRKPYISKGHYSHSYDDIF*" XX SQ Sequence 3691 BP; 1317 A; 548 C; 592 G; 1232 T; 2 other; ttagggtagg tcgattttag cttttttttt gaattttgag ttgcgtaatg ttgcagttac 60 tgtgtctagg ttagaaatga ggttttacca aatctcaagt atcctggttt aaaaatggct 120 tgcggatttg cattttaagt cttaattttg gactagtttt gttagattgt ttatttcaaa 180 cgttgcaaaa ttttatttta agtctagtaa tctcgtttta aaatatctcg ttttgagttt 240 ataaagtaac caataaacaa gcgcaattgt tttaagaagc caataaagtt tcgctaaatc 300 gtttacacgc atttaaactt agcaaatctt ttttgtattt awatttcttg tttgtatttt 360 aaagttagtt ttaaagttaa atttattaaa aagtaggttc aaaaataaaa ctgcaattta 420 aagaaatctt ttaatatcaa aggtaaatac ttatggttat tatatctaga gtaatgtttt 480 aaactgttta tattacataa cttttgattt gcaatttctg aagtagatta aaatataaaa 540 cttttcgatg gaactaatgg ttttactcaa ttcctgagtt tcctgaattc atggccccca 600 aatcttttca aaagagtttt atttctttat tttttatcaa agcaggtttt taatagtaag 660 ttttaatatt ttaggttttt ttgatgcttt ggcccccact aaaaactcac gctgttgccc 720 attactctaa gtgtaatact ttaatacagt ttacatatat aattaattta caaaaatgcc 780 gaaagtaaag gttgcaatta cacggatgaa aagtgaaaaa aaatattgtg gtggcccatg 840 tgaactatca acgacagatc caagcacttt tagacaggtt attcagtatt attactatct 900 tattcatata aatccaaatg cagatatctt attagtgact cagcaaatta gcaataatat 960 aaaatcaatt tgggctacag taaacccaag attacctctt ataactgaca aatctattaa 1020 taggaaagtt aaagatcttc tcgtacttgt taaggatatt aatcgcaagc acggaaaagc 1080 aagcgctaaa agaaatcttg atgcaaactt ggacaaactt tttgatattt ctgcgtgttc 1140 ctgttcatta gatgtgtttc cctgtagtga cagaaaagtt tcatgtaata aagaaaattg 1200 catcgaagaa catattgtat gcctttgttc tcaaagtaat aaagtacctc tggaagacag 1260 agcttatctt cgtgatcaga ggaaaaaaat tggtccaaaa ggtgcatatc aactttcttc 1320 cgtcgataga ttatctgtca aaaggattca aagacttgat aaagaacgac aaaaaatatc 1380 tcataagcta gttaatgaag atgaatccct catagcacaa gtaaacttaa ttgaagaaac 1440 aagtggtttg accaatacgg aggtaaatat tttataaata catgaaatat ttgtaacaga 1500 tgaaacagaa agcgttttat tttattaact ctttttagga tgaaggtgaa tggaaaccag 1560 taaataatcc tattggaaaa tacaacttag ttaaacttcc aagatttgca atggaattag 1620 tcagaaatga gtgttcgtca aacgtaggtg cagcccttgc aaatgcttta ctacatgaca 1680 taaaacatct cttgaaaaat gacgttgata ttaaagatat tttgattgac aaatgcaagt 1740 tagatagggc caagtcaaaa gtaaaaatcg ttagtgaaga ggtggactca gaacaaaaaa 1800 aacagttaat ttgtcttggg gttgacggca agctcgatca aataactaaa acaagtatar 1860 taataaactt acctcaagga gaagtactaa agaaatgcat tgaacctgaa catcacctca 1920 catttactta tgaagatgga aaatcgagtg gaaattattt gactcatagg acgattcctg 1980 tcaccggtgc tacaggttta gttcttgcta ccgaaacgtt tagtgtttta caagaacata 2040 acagtttgga aagtatacaa gctgttcttc ttgataatac tgctaccaat actggaccaa 2100 taagtggctt agtggtaaaa ctagaagagt ttttaaaaag aaaactacat ctaattggat 2160 gcgcattaca tcaaaatgaa cttccgttac gagctctttt tgtaaaactt gatggtgaca 2220 caactgggcc aagaagcttt agtggtctac ttggcaaaag atgtgctgaa aatattcagg 2280 ataaaaatca agttttgttt aatcgtattg aaaacccaat tgtagatgga tatattccag 2340 aagaaatact agtagacctt agttgcgatc aaagactttt atttgaatat tgcaaaggaa 2400 taggcttttg gtaaagtttc tgataaatgg gccgcctata aaattggacc gcttaaccat 2460 gcaaggtggt tacacgcaag gtaactctag ctattcgcct tctctgtttg tatactagag 2520 acaatcaacc aactgtttct ttaaaaaaac tcgtatgttt tattgttcag gtttacgctc 2580 cagcgtggtt cgatataaaa aaatcttcaa aatttcacga atccccgcgc cttatttttt 2640 ctactatcac tagaataaat aatctcccct ttgatgatgt taaatatatt gttaaaaaaa 2700 acattaaaaa caatgcattg tgcctaaaac ctgaaaatat tctttatgcc atgttaaaag 2760 ataatgacag caagatacga aactttggtt ttcaaatcct actgggtcta aggcaaaggt 2820 atttttttat agtaaactaa aataaagttt tgaaaaaaaa tgatacatta aatgtggtaa 2880 ttggatttga tttatattta tagagtaaac aacgaaagtt tggaagtaaa tttgaaaaaa 2940 attccagaaa ttaattttaa tgctaatcat tggtatgagt tggttgatat cagcattact 3000 aaatttacgg agcccccaac aacacaacat ttttcaattg agcaaatcca atatgcgatt 3060 gacaataatg ttaaacctga aatccccgac tttccatctc actcccagag tgttgagcga 3120 gcggtaaaac ttgtgtcgga cgcatcccaa tatgtttatg gattcgagaa tcgacacagc 3180 tgcattttaa ctaaactact tagcaggaag atgcgtaaac cgtatatttc aaaagggcat 3240 tattcccatt cttatgatga tattttttaa ttgaaaatgt tttcctaaaa tatatatatc 3300 catagccaaa atgtttttaa attgtgaaaa taatgtttat tttgatattc ttatgtaaat 3360 ataagaataa aagtttcact atttaaatct tcaaatagtt aacaagttgt tataaattta 3420 gcttaattgt tataaactgt tataaactta aatcatacag caaagcaaat agtattgaaa 3480 aacatttaaa tatgaactta gttaaaacgt caagttttgc atataacgct ttttaaggtg 3540 tttttttaca taatccaaca aatcttaaaa attaaatccg caagcgattc tttatccaaa 3600 atctctcaaa ttttgagagt aatcgtattt taactgtcta caactactgc aatattacgc 3660 aatatgcgga attaaaatcg acctacccta a 3691 // ID DNA3-3_AP repbase; DNA; INV; 169 BP. XX AC . XX DT 25-AUG-2009 (Rel. 14.09, Created) DT 25-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA3-3_AP. XX NM DNA3-3_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-169 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 1944-1944 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 3 bp TSD CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 169 BP; 43 A; 36 C; 40 G; 50 T; 0 other; aaggcgggtt tccactatat acacgtacgc gtgtacgtgc atacaaatcc tttgctgtga 60 ttggttggtt aggttaagtt accacacgat taaccacgat accgtgttgt tggacatagc 120 tgaacatttt tcgtgtacac ggcaaacgtg tatagtggaa acccgcctt 169 // ID Gypsy11-NVi_LTR repbase; DNA; INV; 306 BP. XX AC AAZX01023279; XX DT 13-NOV-2007 (Rel. 12.11, Created) DT 13-NOV-2007 (Rel. 12.11, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: long terminal DE repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy11-NV; KW Gypsy11-NVi_I; Gypsy11-NVi_LTR. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-306 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from Nasonia parasitic wasp."; RL Repbase Reports 7(11), 1145-1145 (2007). XX DR Genome; AAZX01023279; Positions 19782 19477. XX SQ Sequence 306 BP; 48 A; 107 C; 87 G; 64 T; 0 other; tgggcctcta cccaggtcca gccggaccag gcataaccca cattggtgac cccgacttgg 60 agagggctgc gcgggacctc gccggatttt cgccggcctg atggtttgca cccgctagct 120 gctctctgga ctcggcgtct ctgcctcgcg gtttgcctgt tcaggcgacc gccccgcagc 180 attcatcggc atcatcgccg cggtcagcct attcaggcaa ccgctctgct gcggaaaagc 240 gacgtcatcg acgcggtcag cctatatagg cgaccgcctt tcaacttgcc atcgttgccg 300 cggtca 306 // ID I-58_AAe repbase; DNA; INV; 7571 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE An I non-LTR retrotransposon from Aedes aegypti. XX KW I; Non-LTR Retrotransposon; Transposable Element; I-58_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-7571 RA Kojima K.K. and Jurka J.; RT "I clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1329-1329 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 4 sequences with >98% CC identity. CC The 3' ~5000 bp is ~91% identical, but the 5' 2500 bp is less CC than 70% identical to I_Ele14. XX FH Key Location/Qualifiers FT CDS 591..2375 FT /product="I-58_AAe_1p" FT /translation="MSASSPSLNPGDPGGPGGGYQFNRINGEYSNRLLPEF FT MDRDGNAGQLQFLRMKATTGAIPNDPFLLRLSVEKHVGGQISGAFKENKGI FT SYVLKVRSQAQFDKLLKMKKLADGTEIEIEEHSTLNQVKCVVSNADTIGLE FT ESYLVDQLHNQKVKEVRRIKRRNSNTNALENTPTLILTISGTVIPEHIDFG FT WSRCRTRHYYPSPMQCYRCWTFGHTGKRCTAPSRVCGRCSGVHPEDQTPEP FT SNDQGATVTPTPRTRSSCSEPPCCKTCDNSTQHALSSRECPAYKKENAIQI FT IRVDEGISYPQARREFEAREALKSSEKRNTYAGVASSSKDAEITELKNTVK FT KLENDAVRREKRMADMELALTNRQVSGRLEAAKDHGPIQELIRQVAELTAT FT VKQLQMDLREKDLTIAELREKQLLTPRNSVIPTADPTQQPTPTPQEASPNT FT TISSNSPKLKNVDPAMTSQVADWIRNLDQEEKEAKKLTIDKGAIPKKQKKG FT KGRSTAAIPSNEGNCLTQPHSLLSRQGERGEFFAKKMDIEDSDNSLESHET FT AASSTVGSKRSHIPISSAESTENSGDETRFTRSKPKLLRETVVDPSTK" FT CDS 2128..7329 FT /product="I-58_AAe_2p" FT /note="endonuclease, reverse transcriptase, and FT ribonuclease H." FT /translation="PSRTPSLAAKVKEASFLPKKWISRIAITRLNHTRQQH FT RPRSAPRGATSPFPVQNPPKTLATKPDSRGANPNFSVKPWSTRRRNSLPTN FT YVALEKPEKKTEFGTRPTILFTTTHTIAGDPSPEQNERPRTSSNVEEVNDN FT STQGNENAAEETSGNRGPPSAEVTPQPELGNNPRHPSVLSAEGPEDGRERS FT GPVGAEAIGPPELADNPGLPIRSGRGTEKGGNAYPPLDNEGIKPSLTKHKQ FT QDHRRTSDNEGFKTPPRRNPSRKRNPPDRYSPGWHNQPRSASKPDPPDPHQ FT GLPGHLGSKTPSGKPHRTQQILRPPASVPVFPKPFSVLELTNAEAYDKTIA FT TRDKMIAPPAAGPSSQTASSDPYQDRPGHLGRKNPSGKAHAVPLRPQPVSP FT TPGGTQIPPSATLRRKPTSTLIQSQQHSLMRFPSSPVGNDCLAVAPEIPVV FT VNPMPGPTLGGAPLESATRTLCPPNCFSQXYFSTAAPSPLPVVGSTSGPTL FT GGALSGSVSSNFIGSNNSKENSLSSFPVPALPTSHNFILQWNLNGYRSRLC FT DLELIIQNNQPWILALQETNNISADEMGRTLGGQYSWTVKRLANQRHSAAI FT GIRKGIPHKILDLPSDLPIVGIQILGSPSVSVACAYLPCGKIPNLHGHIHD FT CLQALPEPRIFLGDFNSHHPVWGSPKADTRGTNLIDALEEEDMLILNDGSP FT TFFNGHYSSEIDVTAVSCSFARNIQWSVSSDLHGSDHYPIRIALTNGAAIP FT KNTRRPRWKYEKADWAEYDLLIRSALINSPPENMGDFISTIVESAKATIPL FT TSCTPGRKALRWWSDETKAAVKARRKALRQLRRLPLDHPDRQFTLENYRKL FT HLECRNIIGDAKKATWESFLESMNASQTTSELWNRVNALNGKRKTTPLTLQ FT VQGKAMSDPEAVASELGKYFASLSAIESYETHFLNSVKPSTSTIKNFIIPP FT NTADPEMEQPFSARELSFALSRSNGKSAGPDEVGYPLLKNLPSVGKLALLD FT LYNKIWVNNDFPSAWKESLVVPIPKANIPTRDPTKFRPISLTCCAGKVMER FT MVNRRLRYHLEANGLLDFRQHAFRQGFGTSTYFAGLSDVLKEAYDKGNHAE FT VISLDISKAFNRTWTPLVLEQLAAWGIGGRTLHFVRNFLXDRTFQVVIGNT FT KSSSFPEETGVPQGSVLAVTLFLIAMNGVFSRLPKNIYVFVYADDIVIVVC FT GSTPTMTRIRAQTAVKSVAKWASDNGFQLSASKSIRCHICPSGHRITGPDI FT TIDGQPIPLRKTVRILGVTVDRALSFRQHFDTVKSACRSRLNLIKSISRPH FT RSNNRKIRFRVAHAIVDSRLVYGLELTSIAIDRLVEVLSPVYNSYIRIISG FT LLPSTPSDSACVEAGLLPFRIFIRASICCKTVAFLSKTAGEDRVFLLDEGN FT RALSTAANLSLPPVTRVHWLGDRSWRSVPPKIDNKIKNSFSAGSNSAALRR FT SVAELLQISYSDFALRYSDGSLTSAGVGIGVAGDIPDVSMSLPSQCSVFSA FT EAAAAFIAATTPSDRSILVLTDSASVISALQSDSPSHPWIQAILKYALPDT FT VFTWIPGHCGVPGNEMADHLAKSGLSGQRYTSEVPFMDLKRWIKSIFRQHW FT EDSWYRTRTLFLRKIKNTTTTWSDLPILKDQRILSRLRTGHTLISHNMGGG FT PFHKECESCHIPASVDHVLCACPIYEHLRQIHGLSDNIGEVLRDDATTIAA FT LLSFLHDANLYSSI" XX SQ Sequence 7571 BP; 2040 A; 2079 C; 1755 G; 1695 T; 2 other; cagttgacag ctcgtgttcg gcgcgttcgg atcagttttt tcactgctcc cgactaattt 60 tgcgagtgag ttctcggagt gcattttctg tggtttttcg cttttccacg ttttcccaac 120 ggtattttcg ttttgttcgc gatcgatttg ccgttttccg gggggttttt cacgcgtttt 180 cgccgtattt tcggtggtga aagtgctccc acgtggagga aaaaaggaga acaaaagaag 240 tgaaagcgtt cgttttaacc cgccgcgtgt gctttcgggg attataaccc aaaaagcaga 300 gctagtgaga gtgagcgtgg gcattacgcc ttggggagat tgaagtgaaa aaatcttccg 360 gatcttttcc ggaagaattt agtgctgggc cacggggcct gaagaatctt ttctacagag 420 cttcgtgaaa ggtgtgctgt accgtagtgt ggccttcttt aggcctttct tggatcccta 480 cgaggaccca cgaccaccag ctctgcggac acctccgcta caaaagccag caacttaagg 540 ccggtcgact gtgtggtcca ccaccaccat agtcccggcc cgccggcgcc atgtcggcga 600 gcagcccttc gctcaacccc ggggacccag ggggccccgg cggaggctac cagttcaaca 660 ggataaacgg tgagtactcc aaccgattac tacctgagtt catggacagg gacgggaatg 720 caggacaact gcaattccta agaatgaaag cgacaaccgg agccatcccc aatgatccgt 780 tccttctgcg actttccgtg gagaaacacg ttggtgggca gattagcggc gccttcaagg 840 agaataaagg catctcctac gtactgaagg tccgcagtca ggcccagttc gacaagttat 900 tgaagatgaa gaagctggct gatggaaccg agattgagat tgaggagcac tcgacgctca 960 accaagtcaa gtgtgtcgtg tccaacgctg acacgatagg actggaggaa agctacttgg 1020 tggaccagct tcataaccag aaggtcaagg aagtgcgcag gatcaagagg cgaaacagca 1080 acaccaacgc cctggaaaac accccgaccc taatcttgac gataagcggc accgtgatcc 1140 cggagcatat cgacttcggg tggtcaagat gccggacgag gcactactac cccagcccga 1200 tgcagtgcta ccggtgctgg actttcggac ataccgggaa gcgctgcacg gccccctcca 1260 gggtctgcgg cagatgtagt ggagtacacc ctgaggatca aacccccgaa ccatcaaacg 1320 accaaggtgc aacagtaaca ccgacgccaa ggaccaggag ttcgtgcagc gagccccctt 1380 gctgcaagac ctgtgacaac agcacgcagc acgcactttc cagccgagaa tgtcccgctt 1440 ataagaagga aaacgccatt caaattattc gtgttgacga aggaatctcg tacccgcagg 1500 cgcgccgaga attcgaagca cgagaagctt tgaaaagcag tgagaagcgg aacacttatg 1560 caggtgtcgc tagtagcagc aaggacgcgg agatcaccga actcaagaac acggtaaaga 1620 agctggaaaa cgacgccgtc cgtcgagaaa agagaatggc cgatatggag ctggccctaa 1680 ccaaccgcca agttagcgga aggctagagg cggccaagga ccatggcccc attcaggaat 1740 tgatccggca ggtagcagaa ctaaccgcca ccgtgaagca gctccaaatg gatctgagag 1800 aaaaagacct gaccattgcg gaactaaggg agaagcaatt gctgacccca aggaactccg 1860 taatcccaac cgctgatccc acccaacagc caacacctac cccgcaagaa gcctcaccaa 1920 acacgaccat cagtagcaac agtcccaagt tgaaaaatgt cgacccagca atgacctccc 1980 aggtcgcgga ctggatccgt aacctcgatc aggaggagaa ggaagcgaag aaactgacta 2040 tcgacaaggg agccattccg aaaaagcaaa agaaaggaaa aggaagaagc accgctgcca 2100 ttccttcgaa cgaaggaaac tgtctgaccc agccgcactc cctccttagc cgccaaggtg 2160 aaagaggcga gttttttgcc aaaaaaatgg atatcgagga tagcgataac tcgcttgaat 2220 cacacgagac agcagcatcg tccacggtcg gctccaagag gagccacatc cccatttcca 2280 gtgcagaatc caccgaaaac tctggcgacg aaacccgatt cacgaggagc aaacccaaac 2340 ttctccgtga aaccgtggtc gacccgtcga cgaaatagcc tgcccactaa ctacgtcgct 2400 ctagaaaaac cagaaaaaaa aaccgagttt ggtacacgac ccaccatcct tttcacgacc 2460 actcacacca tcgctggtga cccatcaccg gagcaaaacg agcgaccaag aaccagctcc 2520 aatgtagaag aagtaaacga caactcaact caaggcaacg agaacgccgc agaagagacc 2580 tcgggcaacc gaggcccccc cagtgcggaa gttacacccc aaccggaact agggaacaac 2640 ccccggcacc cgagtgtcct ctctgccgaa ggcccagagg acggtagaga acggtcaggc 2700 cccgtcggtg cggaagctat tggtccaccg gaactggcgg acaaccctgg ccttcccatc 2760 cggtctggaa gagggacgga aaagggcgga aacgcctatc cccctttgga caacgaggga 2820 atcaagccat ctttaacaaa acacaagcaa caagatcacc gcagaacatc ggacaacgag 2880 ggattcaaga caccaccacg caggaacccc tcccgcaaga ggaacccgcc ggaccgctac 2940 agccccggct ggcacaacca acccaggtca gcgagcaagc cggatccacc cgacccccat 3000 cagggtctgc ctgggcacct cggaagtaaa actccgagtg gtaagcccca ccgcacccag 3060 caaatactcc gcccgccagc ttcagttcca gtttttccga aaccgttttc agtacttgaa 3120 cttacaaacg cggaagcata cgataaaacg atcgccaccc gcgacaaaat gatagctcca 3180 cccgctgctg gcccctcaag ccaaacagcc tcctcagatc cctatcagga tcgccctggg 3240 cacctcggac gtaaaaatcc gagcggtaag gcccacgcag tcccgctgcg accccaaccg 3300 gtaagcccga cgccaggagg aacacagata cccccttctg caacgctgcg aaggaaacca 3360 acctccacct taatccagtc acagcagcac tccttgatga ggtttcctag cagccctgtt 3420 ggaaacgact gtcttgccgt agctcccgaa attccggtag tagtaaatcc catgccgggt 3480 cccacgcttg gtggggctcc gttggaatca gctactagaa ccttatgtcc tcccaattgc 3540 ttttcccaaw cctacttttc taccgcagct ccttcacctc tgccagttgt gggttccaca 3600 tcgggaccta cgctcggtgg agctctgtcg gggtctgtat ccagcaattt catcggttct 3660 aataattcca aagaaaattc tctctcaagt tttccggttc ctgctcttcc tacgtcacat 3720 aatttcattc tgcaatggaa tttgaatggc tacaggtctc gtctttgcga tcttgaattg 3780 attatccaga ataatcaacc ttggatctta gcccttcaag aaactaataa tatatctgcc 3840 gatgaaatgg gtcgcacgct cggcggccaa tattcttgga cagtcaaacg attagcaaat 3900 caacgacact ccgcggcaat tggaattcgt aaaggaattc ctcacaaaat tcttgatcta 3960 ccttccgacc ttcctatagt tgggatacag atactaggtt ccccgtcggt ctcagtagcc 4020 tgcgcttatc taccgtgtgg caaaattcca aatcttcacg gtcacattca tgattgtctt 4080 caagccctcc ccgaacctcg aatctttctt ggtgatttca acagtcacca tccggtctgg 4140 ggctccccca aggccgatac gcgcggcacc aacttaattg acgccttaga agaggaagac 4200 atgttgattc taaacgatgg ttcaccaact tttttcaatg gtcattattc cagcgaaatt 4260 gacgtcaccg cagtatcgtg ttctttcgcc agaaacattc agtggagtgt aagttctgac 4320 cttcatggta gtgatcacta tccaatccga atagcgttga ctaatggtgc ggccattcca 4380 aagaacacga gaagacccag atggaagtac gagaaagccg attgggctga gtacgatctg 4440 cttatccgtt ccgcccttat caacagtcct cctgaaaaca tgggtgactt catttcaacc 4500 atagttgaat ccgctaaggc cactattccg ctcaccagct gcacccctgg ccggaaggcc 4560 cttcgatggt ggtctgatga aaccaaggct gctgtaaaag cccgcagaaa agcgctgcgt 4620 caacttcgaa ggcttccact tgaccacccg gacaggcagt ttaccttaga gaattatcgc 4680 aagctccacc tagaatgtcg gaacatcatc ggtgatgcta agaaggccac gtgggagagt 4740 tttttggaga gcatgaatgc atctcaaact accagtgaac tatggaatcg agtgaatgct 4800 ttgaatggaa agcgaaaaac tacaccgctc actctacaag tgcaagggaa ggccatgtct 4860 gatccagaag cagttgcaag tgaacttggt aaatattttg ccagcctatc tgccattgaa 4920 agctacgaga cccactttct taattctgta aaaccttcca cttccactat caagaatttt 4980 attattccac ctaacactgc cgatcctgaa atggaacaac cattttcagc cagagaactc 5040 tccttcgcgc ttagtcggag caatggtaag tcagcgggtc ccgatgaagt tggttatccg 5100 ctcctcaaaa accttcccag tgtcggaaaa cttgcgctgt tagatcttta taacaagatc 5160 tgggtcaata atgatttccc ttctgcttgg aaagaaagtc tagtagtccc tatacctaag 5220 gctaacatcc ccacccgcga tccaactaag tttcgaccaa tatccttgac ctgctgtgcc 5280 ggaaaagtaa tggaaagaat ggtaaatcga aggctaagat atcacttgga agccaacggg 5340 ctattggact tccgccagca tgcctttcgc caaggatttg gaacatctac ttacttcgca 5400 ggtctcagtg atgtcctcaa ggaggcttac gataaaggta atcatgccga agtaatttca 5460 cttgacatct ctaaggcgtt taacagaacg tggaccccac ttgttctgga acaactggct 5520 gcttggggca taggaggaag aaccctgcat tttgtgcgca atttccttkc cgataggact 5580 ttccaagtag ttattgggaa cacaaagtca tcatccttcc ctgaagaaac aggcgttccc 5640 cagggttctg tgctagctgt aactcttttt ttgatcgcta tgaacggcgt cttctcccgt 5700 ctgccgaaga acatatatgt attcgtttat gcggatgata tagttatcgt tgtttgtgga 5760 tcaactccta ctatgacgcg gatccgagct caaaccgcag tcaaatcggt agcgaaatgg 5820 gcatcagata atggtttcca actgtccgcc agtaagagca tccgttgcca catttgtcca 5880 tccggacata ggataactgg cccggatatc acgatagacg gtcaaccaat tccgcttcgt 5940 aagactgttc gtattcttgg agtaacagta gatcgggctc tctccttccg tcaacatttt 6000 gatacagtca agtcagcgtg tcgatctagg ttgaacctga ttaaatccat ttcccgcccg 6060 catcgatcaa acaatcgaaa aatccgcttc agagtcgccc atgccattgt cgacagtcgg 6120 cttgtatacg gattggagct tactagcata gcgatagata gattggtcga agttctcagc 6180 ccagtctata actcctacat cagaataatt tctggtctgc ttccatcaac accatcagat 6240 tccgcttgcg ttgaagctgg gcttcttcca tttcgaatct ttattcgcgc ctccatctgc 6300 tgcaaaacgg tagcattcct cagcaagaca gccggcgaag acagggtctt tctccttgat 6360 gaggggaata gagccctaag cacggcagcc aacttgagtc tcccgccggt taccagggtt 6420 cactggcttg gagacaggag ctggcgttcc gtcccaccaa aaatcgacaa taaaataaag 6480 aacagttttt ctgccggcag caactctgct gctttgcgac gatcagtcgc agagctgcta 6540 caaatttctt attctgattt tgctcttcgg tactcagatg gatcccttac aagcgccggt 6600 gtcggcatag gggtggccgg tgacatcccg gacgtaagta tgagcctgcc atcacagtgt 6660 tcagtatttt ccgccgaggc agcagcagcc tttatcgccg ctaccacccc ttcggaccgt 6720 tcgatcttag tcctaaccga ctcagccagc gtaatatctg cgctacagtc agattcgcct 6780 tcgcaccctt ggattcaagc aatattgaag tacgcactac ctgacacggt tttcacgtgg 6840 atccctgggc actgcggagt tccagggaac gaaatggctg atcatcttgc caaatcgggc 6900 ctttcaggtc aacggtatac ttcagaggtt cctttcatgg accttaaacg gtggatcaaa 6960 tctatcttcc gtcaacattg ggaagattca tggtaccgca caagaacgct tttccttcgg 7020 aagattaaaa acacaaccac tacttggtca gaccttccaa ttctcaagga ccaaaggata 7080 ctatctcgct tacgaaccgg acacacccta atatcgcaca atatgggagg cggtcccttc 7140 cacaaagaat gtgagtcttg tcatatccca gcctctgtgg accatgtcct ttgtgcctgc 7200 cccatatacg aacatctgcg acaaatacat ggcctttctg acaacatcgg agaggtacta 7260 cgcgacgatg caactactat tgctgcttta ctgagttttc tacacgatgc caacttgtac 7320 agcagtatat gatcctgccc aatatgacga cgacattgat gccagcccac ggatacgaaa 7380 ctgttttact aaattgtact atgtatagat gtaagcttga aatgttagat tgacagttat 7440 ctctttttta tcggttcagt cttctgctga ccgtatttct cccgggtctc agccttctgc 7500 tgaaccttct cccgtgttga actagcataa tgttaaaaaa cacgttaata aagatgaaaa 7560 aaaaaaaaaa a 7571 // ID Gypsy-9_PPc-I repbase; DNA; INV; 4363 BP. XX AC . XX DT 08-JUL-2010 (Rel. 15.07, Created) DT 08-JUL-2010 (Rel. 15.07, Last updated, Version -1) XX DE LTR retrotransposon from the Pristionchus pacificus genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-9_PPc_; KW Gypsy-9_PPc-LTR; Gypsy-9_PPc-I. XX OS Pristionchus pacificus OC Eukaryota; Metazoa; Nematoda; Chromadorea; Diplogasterida; OC Neodiplogasteridae; Pristionchus. XX RN [1] RP 1-4363 RA Jurka J.; RT "LTR retrotransposons from the Pristionchus pacificus genome."; RL Repbase Reports 10(7), 1010-1010 (2010). XX DR [1] (Consensus) XX CC Positions [2731-3189] - Integrase core CC LTRs are 96% similar to each other. XX FH Key Location/Qualifiers FT CDS 76..3708 FT /product="Gypsy-9_PPc-I_1p" FT /translation="MPIEQFKAIVMLAGMDLPRHAATMFHVMNGMKGNDSP FT TIDSILSVVNTYKEVYSDSLAVSKQNDLQVNAITGNRSQSQRKYSYRKFPP FT KRPEEGGCIRCGRDHGGRECPHVNSTCFNCLGTGHIAPVCKNAKKSPERRE FT NRVNLVQGDLPEDLERFETDVIMNGKVIRMTVDTGSDITLISDSTWRRIGA FT PERNEQDAFPRCANGTPLSLSGKCHLTLELNGSVAQGSVYTSDESENLMGR FT DFITTFFRLIPKDAKVNAVKSDDSYANWIKGEYSEICKEGLGMCEKAKATF FT CLKEGANPVFCRRREVPLALLPKVDEEIDRLLAMEAIESVDYLEWAAPILV FT VPKPNGKPRVCVDFSTGLNASLEPNNHPLPLMAEIFTRLEGCTLFSQIDLS FT DAYLQIPVDEQCRDLLGVSTHRGLFRFKRLPFGVSVAPGIFQKAMDTMLAG FT CENSQAYLDDVVIGGKTKEIHDGNLKKVLERIKDYGFRIRPEKCSFGMSKI FT KYLGFIIDQAGRRPDPAKVKAVREMPQPQDQGSLRSFLGMVSYYGPFIDGM FT HKIRAPLDNLLKDDVEWIWSSKCERSFKEIRSILASDLNLIHYDPQKELVL FT AADASEKGIGAVIAHRVNGKLMPIAHASRSLKDAEIKYSQIEKEGLALIFG FT VTKFHRYLFGRRFVMQTDHKPLLSIFGSKEGIPVHTARRLFHWATILLGYN FT FSMEYISTDSFAYADALSRLISDSRDDRDEEKVLEEVETVVVRMISASIEN FT LPVTATDIREALSGDPLLRRVRDFHLTRWPDESKMRKDRDFEQIKSFFHMR FT KVISVVDDCLMVNDRVIIPHSIREKVLRMLHSGHPGIVRMKSIARQACYWY FT RMDQDIEKVVLSCDQCAQALKRPIKVPLEPWPKAAEPWERIHVDYCGPVEG FT KYLLVIVDSLSKWPEIVATPSMMAGATIRILNESIARNGLPRMIVTDNGTQ FT FDSDAFNRYCRGRGISHVNTPTYHPQSNGQAERFVDSVKRSLLKQKGERPL FT EEALQLFLYNYRKTPNPQCDGKTPSEIMFGRNIRSEIDLMAPIKVFDGKDD FT RIDRMKDQFDKKHGAKARSFKIGDRVLYDMQVGPNGRKWTMGTVDGRIGKT FT MYEVKLKDRTVRAHGNQLRKRMSSDREIEYLNEPVIEITGFDGRNSLVQEE FT PSLVDDVDDEMDATTAVETQSINDIANDDNDAPRRSTRVRAQPKRLNVNFQ FT E" XX SQ Sequence 4363 BP; 1247 A; 873 C; 1160 G; 1081 T; 2 other; agattgaagt gcgatgacat caaagtgttc tcagggatcg tgaatagagt ggtagaagag 60 gctaatatca atgagatgcc tatagaacag ttcaaggcga ttgtaatgct ggctggaatg 120 gatctgccca gacacgctgc aacaatgttc catgtcatga atggtatgaa aggaaacgat 180 tctcctacta ttgattcgat tctcagtgtg gtgaatacgt acaaagaagt gtactcggat 240 tcacttgctg tgtcaaagca gaatgatttg caagtaaatg ctatcactgg taatcggagt 300 cagagccaga ggaagtactc atacaggaag ttccctccga aaagaccaga agagggcgga 360 tgcatccgat gtggaaggga ccacgggggt agagaatgtc ctcatgtgaa cagcacttgt 420 ttcaattgcc ttggaacggg acatattgca cctgtgtgca agaatgcaaa gaaatctcct 480 gaaagaagag aaaatcgagt gaacctcgta cagggtgatc tacctgagga tcttgagaga 540 tttgaaacgg acgtgatcat gaatggtaaa gtgatcagaa tgaccgtaga caccggttcc 600 gatatcactc tgatctctga ttcaacatgg agaaggatcg gtgcacccga gagaaatgaa 660 caggatgcgt ttcctagatg tgcaaatggt acacccctga gcctatcggg taaatgccat 720 ctcaccctcg agttgaatgg aagcgttgct caaggcagtg tgtatacttc agatgagtct 780 gaaaatctaa tgggaagaga tttcataact actttctttc gactgattcc gaaagatgcg 840 aaggtgaacg cggtgaagag tgatgactct tatgcgaact ggatcaaggg agaatacagt 900 gagatctgta aagaaggact gggaatgtgt gagaaggcga aggcgacttt ctgtttgaaa 960 gagggagcaa atcccgtgtt ctgcaggaga agagaagtcc ctttggcact ccttcctaaa 1020 gtcgatgaag aaatagacag attactggct atggaagcta ttgaatcagt ggattatctt 1080 gagtgggcgg ctccgattct tgtggttccc aagccgaacg gaaagccgag agtctgtgtc 1140 gatttctcaa cgggattgaa tgcaagtctc gaaccgaata atcatccgtt gccactgatg 1200 gctgaaatat tcactcgatt ggaaggttgc actctattca gtcagatcga tctgagtgat 1260 gcttatctac agattccggt ggatgagcag tgtagagatt tgctcggagt gagcactcat 1320 cggggattat tccgattcaa gagactgcca ttcggtgtga gtgtcgctcc tggaatattc 1380 cagaaagcaa tggatacgat gctggccgga tgcgagaatt cccaggcata tctggatgac 1440 gtggtgattg gtgggaaaac gaaggagatt catgacggaa atctgaagaa agttctggaa 1500 agaatcaaag attacggatt cagaatccgt cctgagaagt gctcgttcgg aatgagcaag 1560 atcaagtatc tcggatttat aattgatcag gcgggaagaa gacccgatcc tgcgaaagtt 1620 aaggctgtga gagaaatgcc acagccacaa gatcaaggat cattgcggag tttcttggga 1680 atggtgtcct attatggtcc attcattgac ggaatgcaca agatacgagc tccccttgat 1740 aacctcttga aagatgatgt tgagtggata tggtcgagta aatgtgagag atcattcaag 1800 gaaatcagat caatccttgc atctgatctc aatctgattc attacgatcc tcagaaggaa 1860 ttggtgctcg cagctgatgc gagtgagaag ggcatcggag cagtaatcgc gcatcgagtt 1920 aatggaaaac tgatgccgat tgcccatgcc tcgagatcac tcaaggatgc agagataaag 1980 tattctcaga tcgagaagga agggctcgcc ctcatctttg gagtgaccaa attccacaga 2040 tacttgtttg gaaggagatt cgtgatgcaa actgatcaca aaccattgtt gtcgattttc 2100 ggatcgaaag agggaattcc tgtgcataca gcaagaaggc tgtttcactg ggctacaatt 2160 ttactcggat acaatttctc aatggagtac atcagtactg acagtttcgc atatgccgat 2220 gcactatctc gattgatttc tgattccaga gatgatcgag atgaggaaaa ggtgcttgaa 2280 gaagtggaaa ctgtagttgt tagaatgata agtgcaagca tcgagaactt accagtgacg 2340 gcaactgaca tcagagaggc gctgtcggga gatcctctgc tgcgtcgagt gagagacttc 2400 catctaacga gatggcctga tgagagcaaa atgcgaaagg atcgcgattt tgagcagatc 2460 aagtcgtttt tccacatgag aaaagtgatc tcagtcgtcg acgactgcct tatggtgaac 2520 gatagagtga ttattcccca ctcaatcaga gagaaagtac ttcggatgct ccattctgga 2580 cacccaggta tcgtgagaat gaagtcgata gccaggcagg cgtgctactg gtatcgcatg 2640 gatcaagaca tcgagaaagt ggtgctwtct tgtgatcagt gcgcacaggc gctcaagaga 2700 ccgatcaaag ttccactcga accatggccg aaagctgcgg aaccatggga aagaattcac 2760 gtggattatt gtgggccagt ggaagggaag tacctcctgg tgattgtgga ttcactatcg 2820 aagtggcctg agattgttgc gacaccgtcg atgatggcag gggcgaccat tcgaattttg 2880 aatgagtcaa ttgccagaaa tgggctgcca agaatgattg tcactgacaa tggcactcag 2940 ttcgattccg atgcattcaa tcggtactgt cggggaagag gaatttctca tgtgaacaca 3000 ccgacttacc atccccagag caatgggcaa gcagaaagat ttgtggactc agtgaagaga 3060 agtctgctga aacagaaggg agaacggcct ctcgaggagg cactgcaact gtttctgtac 3120 aattacagaa agactccgaa ccctcagtgc gatgggaaga caccgagtga gataatgttc 3180 ggaaggaata ttcgatctga gattgatcta atggctccga tcaaggtctt cgatggaaaa 3240 gatgatcgga ttgatagaat gaaagatcaa tttgataaga aacacggagc gaaggcaaga 3300 tcattcaaga tcggagatcg ggtcttgtac gacatgcaag tcggtcccaa cggtaggaag 3360 tggaccatgg gtactgtaga tggtcgaatc ggaaagacca tgtatgaagt gaagttgaaa 3420 gatcgcactg tgcgagctca tgggaatcag ttgagaaaga gaatgtcatc tgatcgggaa 3480 atcgaatatc tgaatgaacc agtgattgag atcacaggtt ttgatggcag aaattcgttg 3540 gttcaagagg agccaagtct ggtggacgac gtcgacgacg agatggatgc gactactgcc 3600 gtggagactc aatcgatcaa cgacatcgcc aatgatgaca acgacgcccc gagaagatca 3660 acgagagtgc gagcacaacc gaagcgactc aatgtcaatt ttcaagaata agacatacga 3720 cgaagaataa ttcccagacc aaagtgcttc gatcgtggcc ccatttgtta tttgactcac 3780 agtaattgac tgtattattg ctcagattct cccccaagtt cgaatctttg ggagggatgk 3840 tgttgtatcg agggacgctc cctctcctct ctagtcaatt accatatcga cgacgtgcga 3900 ttctggccgc ctcccctctc cctctgtcta gcaactgtct ggtaatttca ttagcatacg 3960 cactttacat aattaccatc tccctctccc ctctcccgtc ttgtttgcgc agtctcctga 4020 ctggttgctt gtagataaag ctctcttcct ttattccttg agtgaggaca gaccaataca 4080 taacattggc gttcagtact cacgttttgt gtctgtctct cctcgtgttt tcttttcgtg 4140 ttccatcgcg cgcttgtctg agtctgagga aaccgcggtt tgagaatcac gtcacaagtt 4200 ggtttgggaa tcagcttgtg cactggaaga cagcagtttg ggaaccgctg gggaagagtt 4260 gtttggcaat aacccttcat atccggttac tacggcagga gattggaaat tggtgaccga 4320 tctcaagtac acgtgtcatt acgatgggac agactggtgg aga 4363 // ID Gypsy-6_SI-I repbase; DNA; INV; 4219 BP. XX AC AEAQ01015719; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-6_SI_; KW Gypsy-6_SI-LTR; Gypsy-6_SI-I. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-4219 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01015719; Positions 247 4465. XX CC Positions [1760-2263] - Reverse transcriptase CC Positions [3320-3796] - Integrase core CC LTRs are 90% similar to each other. XX FH Key Location/Qualifiers FT CDS join(1277..2878,2882..4198) FT /product="Gypsy-6_SI-I_1p" FT /translation="MRGNRTIKFRGAGSGENTTLGEIQLKICIDMEIYDIV FT IHIVRDGIILHGMLIGSDFLSQVKVHMKNGVIKISKITHDYGELPEIYKID FT VVQDANKIDLSHITDECVKNEIEYLVKKYKPRKEKDVNIQMSIVVKDDIPV FT TQSARRLSAVEKAEVELQIREWLDKGIVRPLCLDYASPIVLVKKKNGATRI FT CVDYRKLNEKVVKNRYPLPLIEDQLDLLQGATLYSTLDLENGFFHVAIEES FT SRKYTAFVVSNGHYEFLKMPFGLCTSPSYFQKYVNAVFRDLTAKEIIAIYM FT DDLIIPVNVQEGLSRLKLVLETAAKHGLSFNWKKCRLLEPRVNYLGYVVED FT GKITPSEEKAIAVRHFPKPINIRAVQAFLGLTGYFRKFVPRYSHVARPLTD FT LLKKNVKFDFGKEQELAFNQLKQALSDKPVLHLYCPTAETELHTDASALRF FT GAILLQRDSEDGMFHPVYLASSKTTSGEAKYDSYKVEVLAIVRALQKFRVY FT LIGISFAIVTNCKAFTQTMKKKVISAQVAGWALFLEDFYSIVHRPGNNMRH FT VDALSRYPLPAAMIIEECDSSILVRLRRNQLEDEELKGIEKQLEENRIDGF FT TMRNGLLCKENNGVVPKLMQTALIRQVHKRGHFRSTKTEQLLKADYWFKNM FT HSKIEKVIQSCLACIMVTKKSGKQEGLLYPIKKEAPLDTYHIDHLGPMPST FT PKKYQYIFAVVNAFTKFVWLYLTRSTSAAKVLDHLMKQAAIFGNPRRIISD FT QGSAFKSGDFKAYCKDEGIEHTLIVTGIPRGNGQIERINSTLIPLLTKLSM FT PQSTRWHTFVARAQQYLNHVPSRSTGITPFHQLFGARMRLKDDPQIKEILD FT TENATIFQEEREHMREEAREAISKIQAENKRTYDKKRKKPNTYQENSLVAI FT QRTQGGPGLKLCTKYLGPYKVKRTLRNDHYIVEKIGEGEGPRRTSTAVDHM FT KPWTNLQEDPPDDGKDNI" XX SQ Sequence 4219 BP; 1494 A; 763 C; 992 G; 970 T; 0 other; gcgtagcgtg cgtgagcgtg cacgtgcacg tagagaggac agattcgtgt acgaaagttg 60 taagaataaa cgccattatt gtccacttct cgaaaccacg ccttgtgaac catatcttac 120 caacatcctt aaagtaaacc tacaattggg ggctcgtagc tgggatcaag gcagagaaga 180 aattgaggag aaagatgtcg gactacggtg aaggacgaga ggacgacgca aacgacgcaa 240 acgacgtgtt acaagtcgac acgatgacgg tggcccagat caagaaaaag ctcagggagc 300 tcaagctgaa ggtaacgggc aataagatcg atttggtggc acgattaaag gcggctttaa 360 ccctggacga tcaacacgaa aacgaaagca acgatgacga cgagtctagc ggcgagagca 420 acaacgaacg aggcgccggc ggtaatggcg gcgaggaaga cgaaagacgt aaccggcaaa 480 aatatatgcc gactttcaaa gacgtagaag aatcaatcga cacattcagc ggagacgacg 540 gtaaaaatat caagttgtgg attgaagaat tcgaggaact ggcgaaactg agtgagtggg 600 acacagtaca aaagacgatt tacgcaaaag gctattacgt ggctctgctc gtttgttcat 660 aaaagctgac ggtggaaaga cgtggagtgc gatcaagatg gccctaaagg cggaattcgc 720 attgaaggtg gacagtaggg cagtacacaa ggagctgcaa ctaagaaaga agaagactgg 780 cgaatcctat cacgaatatt gctacaagat gatggagata gcagcacgag cggatgtcga 840 gacgaaagcg gtgatatagt atattatcgg tattgacgac gaaggttatc gtaagtctgt 900 tctatatgga gataaaacta ttcgcgaatt aaaagacaaa ctcgatatat acgcagaatt 960 acatggtaaa acgaagttta agacgggtga aaacaaaaag aaattaccga acacaaacga 1020 taaagacgcg aagcgagatt cgaaacggtg ttataattgc ggcgataata gtcatttaag 1080 cgcggcgtgt ccttcgagag agcaaggcac aaaatgtttt gagtgcaata aatatgggca 1140 catagcagca aaatgtccgg aaaaggcgag tgtagaaaaa aagaagaact gtaaccttgt 1200 gcaatccgat tctggcaaat gccgtaaaaa tgttgctatt aacggtacga acagaaataa 1260 acagatcgga tcgccgatgc gcgggaatcg tacgataaag ttccgagggg cgggctcggg 1320 agaaaataca acgttaggtg aaatacagtt aaaaatatgt atcgacatgg aaatatacga 1380 tatcgttata catattgttc gcgacgggat aattctacat ggtatgctca ttggatctga 1440 tttcttaagt caggtgaaag tacacatgaa aaacggtgtt ataaaaattt caaagattac 1500 acacgattac ggcgaattac cagaaattta taaaattgac gtagtacaag atgcgaataa 1560 aatagacttg tcacatatta cagacgaatg tgttaagaac gaaatagaat atctagtaaa 1620 gaagtataaa ccccggaaag aaaaagatgt aaatattcaa atgagcatag tcgtgaaaga 1680 tgacatccca gtcacgcaga gcgcacgtag attatccgca gtagaaaaag ctgaagtaga 1740 attacaaatt cgggaatggc tcgacaaagg aatagtacga ccgttgtgtt tggactacgc 1800 aagtcctata gtattagtaa agaaaaaaaa tggcgcgacg agaatttgtg tagattaccg 1860 taagctaaac gaaaaagtag ttaagaatcg atatcctctg ccgctgattg aggatcaact 1920 agatttactt caaggtgcta cattgtacag tacattagat ttggaaaacg gattttttca 1980 cgtagcaatc gaagaaagca gtcgcaagta tacggctttt gtcgtctcta acggacatta 2040 tgaattttta aagatgccgt tcggactctg tacatctccg tcctattttc aaaaatatgt 2100 taacgcagtt ttccgtgact taacggcgaa agaaattata gcaatatata tggacgattt 2160 aatcatacca gtcaatgtgc aagaagggct ttcgcgactt aaactagtat tagaaacagc 2220 agcaaaacac ggattatctt ttaattggaa gaaatgtaga ttattagagc cgagagtaaa 2280 ttatttagga tacgtagtag aagacggtaa aattacacct tcagaagaaa aagctatcgc 2340 cgtaagacat ttcccaaagc cgattaacat acgcgcggtg caagcgttct taggtcttac 2400 tggatatttc cgaaagttcg taccaagata ttcacatgta gcacgaccgt tgacagactt 2460 gttaaaaaaa aatgtaaagt tcgattttgg aaaagaacaa gagcttgcat ttaatcaatt 2520 aaaacaggct cttagcgata agcctgtatt acatttatat tgtccaacgg cggaaacaga 2580 gttacataca gacgcatcag ctcttagatt tggcgcaata ctcctacagc gggatagtga 2640 agacggaatg ttccatccgg tgtatttggc gagtagcaag actacgtcag gcgaagcgaa 2700 atatgatagt tataaggtag aagtactagc gattgttcga gcgttgcaga aatttagagt 2760 atatttaatc ggaatctcat ttgcgatagt aaccaattgt aaagcattta cacagaccat 2820 gaagaaaaag gtcatcagtg cacaagtagc aggatgggca ctctttctag aagatttttg 2880 atattcgatt gtacatcgcc ccggcaacaa catgcgacac gtagatgcat taagccgata 2940 cccattgcct gccgccatga taatagaaga atgcgatagc agcattttag taagacttcg 3000 acgaaatcag ctggaagatg aagaattaaa aggtattgaa aaacaactag aagaaaatcg 3060 gatagatggc tttacgatgc gaaacggact attatgcaag gagaataatg gcgttgtacc 3120 aaaattaatg caaacagcgt tgatacgtca ggttcataaa cgtggtcatt ttagaagcac 3180 aaagacggag caactcctga aagctgacta ctggttcaaa aacatgcatt caaaaattga 3240 gaaggtgatt cagagttgcc ttgcctgcat catggttaca aagaaaagtg gcaaacaaga 3300 aggactactt tatccgatta agaaagaagc tccgttagat acttatcata tcgaccattt 3360 gggaccaatg ccatcgacgc cgaagaaata tcagtatata tttgcagtcg taaatgcttt 3420 tacgaagttc gtatggctat atctcacaag aagtactagt gcagcaaaag ttttagatca 3480 cttaatgaag caagcagcta tttttgggaa ccccagaaga atcatttccg accaagggtc 3540 agcgttcaaa tctggagatt ttaaagcata ctgtaaagac gaaggcattg aacatacttt 3600 gattgtcacg ggcataccaa gaggaaacgg ccagatcgag cggatcaata gcacattaat 3660 tccgttgtta accaagctgt cgatgccgca gtcaacccga tggcacacgt ttgttgcacg 3720 agcgcaacaa tatttaaatc acgtaccaag tcgaagcaca gggataactc cctttcatca 3780 gctgttcgga gcacgaatga gacttaaaga tgaccctcag attaaggaga ttttggatac 3840 tgaaaacgct actatttttc aagaagagcg cgaacacatg agagaagaag cacgtgaagc 3900 aatatccaaa atccaagctg agaacaaacg aacctatgat aaaaagcgga agaaaccaaa 3960 tacttatcaa gaaaatagtc tagtggctat tcaacggacg caaggtggac caggtttaaa 4020 gctctgtaca aagtatttag gaccatacaa ggtcaaacgt actctgcgca atgatcatta 4080 catcgtggaa aagataggcg aaggagaagg tccgcgtaga acctcaacgg ctgttgatca 4140 catgaagcca tggaccaatt tacaagaaga tccaccagac gatggaaaag acaacatctg 4200 aggtcagatg ttggtcgga 4219 // ID DNA9-2_AAe repbase; DNA; INV; 1443 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A non-autonomous DNA transposon family from Aedes aegypti. XX KW DNA transposon; Transposable Element; nonautonomous; DNA9-2_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-1443 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the yellow fever mosquito."; RL Repbase Reports 11(4), 1283-1283 (2011). XX DR [2] (Consensus) XX CC ~94% identical to consensus. 9-bp TSDs. Subterminal inverted CC repeats are 101 bp long. XX SQ Sequence 1443 BP; 478 A; 256 C; 247 G; 462 T; 0 other; ggttgggtga caaaacgtcg aaagacaaaa tgtcgaatgc caaaacgtcg aatgccaaaa 60 tgtcgaaagg acaaaacgtc gaagggacaa aacgtcgaaa ggacaaaacg tcgaaagaca 120 aaacgtcgaa tgccaaaatg tcgaaagcta aaatgtcgga gggacaaaac gtcgaaagga 180 caaaacgttg aataaacgtt gaatgccaca acgtcgaaaa gaacaaaagt acccgaatgt 240 ctgacccgaa tgtcttcgaa attttcctaa atctcaggct acagattttt ttttccagtt 300 cggttattca ctttttgaaa ctataatttg atggaagtct gttatggaaa aatagtaaaa 360 agtaataatt ttttttaaga tttcttgaag acacctcaac cgatttgtat gatttattaa 420 gagaagctct tggtattgta tagcccacat ttcatttttt tttatattga ctaaaacaaa 480 atgtcgtcaa aagaaatttt atatagggaa tataaatccc ccggaaaaat cggatcagtg 540 caaatacatt gttttccaaa gatatctacg ggataccata atttatagct ggtcttagaa 600 aggaaagtaa aaaataaata gccaagtctt gtcagaattg ttttagtatt ttttttttat 660 ttttgaaacg ataaaacaat tattggtgtc tctgattaaa gtgctcattt tgagattctc 720 tatacaaaaa aaaaatcaat gcacagaagt catttcaaaa gttcatataa aaatgtgctg 780 gatatcctga tgaaatatat ctatttccga aaagcaaatc aataaatcga ttcaattgta 840 cttatccgac aaaagaatga gtattttttt gaagaataaa gaaatcacta aatgtcgaaa 900 atccagtcaa tttgatgaaa ttaatgttac tgatttgttg aaagaagagc gagtcataag 960 gtgcggcgaa agaaagctta tcatcttttt catcttcaat cggtcttatt ctttaacagg 1020 cttgttttgt cttgtgtaat tgccttctca ttacctcagc aattaagtat agccataaga 1080 atgtcacctt acctccttga acatatcttc attatcgaac atcatctcag gcctccattt 1140 cagtttcctt ctcaaagtgc ttcactgctt cttttctcgc cagaaatgac atttttctga 1200 aaaatttgtc ccacgattac tttgatgaaa attgtaaagt tttgctttct caggccgcgc 1260 cgatcgcatc gcaaccattt tttccttaga aacgcttgcc aataacaatc gctagaaaaa 1320 tgactgcttt cgacgttttg tcccttcgac gttttgtccc tttcgacgtt ttgggattcg 1380 acgttttggc attcgacgtt ttgggtttcg acgttttgtc tttcgacatt ttgtccctaa 1440 acc 1443 // ID FEILAI-1B_AAe repbase; DNA; INV; 289 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A SINE family from Aedes aegypti. XX KW SINE; Non-LTR Retrotransposon; Transposable Element; KW FEILAI-1B_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-289 RA Kojima K.K. and Jurka J.; RT "SINEs from the yellow fever mosquito."; RL Repbase Reports 11(4), 1445-1445 (2011). XX DR [2] (Consensus) XX CC ~99% identical to consensus. This family is ~86% identical to CC FEILAI_AA. XX SQ Sequence 289 BP; 86 A; 64 C; 79 G; 60 T; 0 other; ggggcccaga tagccgtagc ggtaaacgcg cagctattca gcaagaccaa gctgagggtc 60 gtgggttcga atcccaccgg tcgaggatct tttcgggttg gaaattttct cgacttccca 120 gggcatagag tatcttcgta cctgccacac gatatacgca tgcaaaaatg gtcattggca 180 tagtaagctc tcagttaata actgtggaag tgctcataag aacactaagc tgagaagcag 240 gctctgtccc agtggggacg taacgccaga aaagaagaaa gaagaagaa 289 // ID SMAR18 repbase; DNA; INV; 2677 BP. XX AC . XX DT 04-OCT-2007 (Rel. 12.1, Created) DT 22-OCT-2007 (Rel. 12.1, Last updated, Version 1) XX DE Consensus sequence of Mariner-type family of repeats. XX KW Mariner/Tc1; DNA transposon; Transposable Element; SMAR18. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-2677 RA Jurka J.; RT "Mariner-type element from freshwater planarian (Schmidtea RT mediterranea)."; RL Repbase Reports 7(10), 1076-1076 (2007). XX DR [1] (Consensus) XX CC Youngest copies are 99% identical with consensus. XX FH Key Location/Qualifiers FT CDS 801..2153 FT /product="SMAR18_1p" FT /translation="MPISTNAIKTKALKIYAHLKESNPDAVLVTKASKQEF FT LASKGWFENFKSRFGLHNIKVQGETGSADVEAARVYPKTLAKIIEEGGYKG FT EQIFNADETGLNWKKMPHRTYISKNEKVAPGFKAAKDRITLLLCSNASGDY FT ITKPLFINRSLNPRALKNVDKSKLPVYWRANSKAWVTSSIFRDWFLNCFVP FT EVENYLKIKNIDFKVLLILDNAPGHPKDLNHPNVEIAFLPPNTTSIIQPLD FT QGIISTFKAFYIRQTFQLILDKMDSNPNMTVTELWKNFTILNCIKIVETSL FT KELKQSTLNGSWKKIWPEIVAKNNPVPPLRVEVSRILTLGQRFSGEGFDDM FT NEDDIYEIMNEGTELTETDLIQLTTESPSVSNLAQDDVTSVEDISESIPSF FT TLKCIREGLSLVEKMKSFFTTNDPSLERSSKLMREIDINLAPYYEIEKQLK FT KTPTKN" XX SQ Sequence 2677 BP; 970 A; 427 C; 448 G; 832 T; 0 other; tatcccttgt ataacgcggt cttatacaac gcggtttcga tataacgcga tttggaaaaa 60 aattctgttt cagtacaacg cgaaattttg acttatataa tgcgaagggg aaaatacata 120 attataaaat ctggtaacac taatttgcgt accacatatt tataataatg tatttttaat 180 gtcaactggg aattaccctt ttttgccttc acaacatgtg tgagtatacc aggcatgctc 240 acagcatata tatttgtact cttcttgcac atttatattt attatattat cagttcagtg 300 tgaacttttg tggatatcag gtatgtaatt aggaaaaaac atttatattc taaaattttt 360 ctttttagaa ttacaaattt aatattaaag tatcagcata agttatattt gtgatcgaag 420 ataggtatgt aatataaatt tttgttattg gtaattttct ttattattta aaatatacat 480 atgtaaattt ttgtacccat gaatagacaa atttcaactg ccgaaaaacc aacgctaaag 540 cgaaaattta ttaccttgga tgacaaaatt aagattctgg atagattaag cagtggtgaa 600 aaagcagcat taatagcaaa atctctcagt ttaaatgaat ccacaatacg caccattaaa 660 caaattgaaa ataaaataag aaattctgtt attgctgggt catcaataag cacaaaaaga 720 gttgcacgtg tgcgtgatct tttaatagaa aaaatggaaa aagcgttaat gctgtggata 780 gaagactgtg ctacaaaaaa atgccaatca gtacaaatgc cattaagacg aaagctctta 840 aaatttatgc acatttaaaa gaaagtaatc cagatgcagt cctagtaacc aaagcatcta 900 agcaggaatt cttagctagc aaaggctggt tcgaaaactt taaatctcgc tttggattac 960 acaatattaa agttcaaggt gaaactggat cggcagacgt agaagctgca agggtttacc 1020 ctaaaactct tgctaaaatt attgaagaag gtggttataa gggagaacaa atttttaatg 1080 cggacgaaac aggtctaaat tggaaaaaaa tgccgcatcg gacatacatt tcgaaaaatg 1140 aaaaagttgc accgggtttc aaggccgcaa aagacagaat tactctactt ctctgtagta 1200 atgcatcagg tgactacatt acaaaacctt tatttataaa ccgctcttta aatccacgag 1260 cgttgaaaaa tgtcgataaa tccaagttac ctgtttattg gcgggcaaat agcaaggctt 1320 gggttacaag cagtattttt cgagattggt ttttaaattg ctttgtacct gaggttgaaa 1380 attatttgaa gattaagaac atcgatttca aagtgctttt aatacttgat aatgcccctg 1440 gacatccaaa agacttaaat catcccaatg tagaaatagc gttcctacca cctaatacca 1500 cgtctatcat tcaaccgtta gaccaaggta taatatcaac gttcaaagca ttttatattc 1560 ggcaaacatt ccaacttatt cttgataaaa tggattcaaa tcctaacatg acagttacag 1620 aattatggaa aaattttaca attttaaact gcattaaaat tgtagaaaca tctcttaaag 1680 aattaaagca gtccacgcta aatggctcat ggaaaaagat atggccggag attgttgcaa 1740 aaaataatcc tgtgccgcca ttacgagtgg aagtgagccg tatattaacg ttaggtcaac 1800 gtttcagtgg agaaggcttt gacgatatga acgaagatga catttatgaa attatgaatg 1860 aaggtacaga gctcaccgaa acagacctaa ttcaattaac aactgaatct ccatccgttt 1920 caaatctcgc acaagatgac gtcacttcgg tagaagatat cagtgaatca attcccagtt 1980 tcactctgaa gtgcatacga gagggtttga gtttggttga gaaaatgaaa tctttcttta 2040 caactaatga tccttcactg gagagatcca gcaaactaat gagggaaata gatattaatc 2100 tagcacctta ttacgaaatt gaaaaacagc ttaaaaaaac acccaccaaa aactgataac 2160 agattttgta attataaaac cagtagaaga accacaaagt cccacaacta gtttcttatc 2220 caatagcgaa aactatgttg aagaaatatc ttctgatgaa gctattgctc ccccaaaacg 2280 cccacgcgca aagatctttg aagacagtga aacagattaa ttaccatatg aagtttgaag 2340 aataacaata aagtttttaa taaaccttta atttgtttta gtttgttttt aaatttttta 2400 atgtgcacat aattatttat attttgtttt ttgaattttg aaatatgtac atatatgtat 2460 tattattcat aacttgtgtt tttaattgta taagaaagtc tgaattttag tattgagttg 2520 aaatatatgt gcatctgtta tttagtatgt atttattttg aaagaaactt attggaacgt 2580 aacccccaaa tttatatgaa aactatgtct tagataacgc ggaatcccat aacgcgcact 2640 tattctggaa cgtaactacc gcgttataca agggata 2677 // ID BEL-113_AA-I repbase; DNA; INV; 5543 BP. XX AC supercont1.256; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-113_AA_; KW BEL-113_AA-LTR; BEL-113_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5543 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.256; Positions 868714 863172. XX CC 'TATAT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 178..5511 FT /product="BEL-113_AA-I_1p" FT /translation="MSTPSKTIAEQKSQQKAKQDNARMEEQLKTLSHQRGA FT VKGKLTRVRSAIEHSEDDPNPNIMNLHFLRLHQKTVEQSYREYNEFQNMIH FT ALPLSEERRAEQEAKYVEFETMYANLAIRLSMLIEAATNQMEKKEVVVAAT FT SSSMTPGSTLSYLPPLQAPLPTFDGSYERWFSFKSMFTTIMNRYTHEDPAI FT KLYHLRNSLVGPVAGIIDQDIVNNNDYNAAWQFLTDRFEDRRLIIDKHIEC FT LFNLPKITKDSSANLRKLLDVSSKNIEALKNLDLPLQGLGEQMVINIISSR FT MDKATRVAWEIRQKPGILPSYKDTMEFLQEQCKVTEKIETNMKVESVKPKA FT VTKAHTLINTSELKYDTKSELKCAVCKNAHELWKCESFKKMNVSEKYNTLK FT KTGCCFNCLQKGHRTNGCTSSHSCRECGKRHHTSLHANEVSSSRSPDSITA FT VNQNAVHEEARVSRKESEVSTQVDPPRSGTGTTLSVSVGSQTKQTLLSTAI FT VNVRASNSAVYPCRVLLDSASTEHFVTERFANLVCMKKEPVDYTVSGLNGT FT NTRIRRMVRITIESCVENFSAELQFLVAPKITGDLPERSFNASEWSIPADV FT KLADPEFNRRGRIDMLVGAEIFWNIVKTGQLKLGSNQPVLTDTKFGWVAGG FT VVSSDAPVIARSFCQTALEDLSELLRSFYKLEACDRIHLPNKVVDDICLEH FT FRDTYQRDQQGRYFVRHPFNDRKNELGDSREMATKRFLSLERRLDRQPDIK FT LQYASFMREYESLGHMRAIAVDENEEPGSAYYIPHHCVVKPSSTSTKLRVV FT FDGSATSSSGVAINDALMSGPNIQNDLFSILLNYRGYRYVFTVDVIKMFRQ FT VGVLPPDTAFQRIVWRYDRNEPLMVYELLTVTYGLASSAFQATMALRQVSE FT DHQHEYPQAARVIKKSTYMDDVIGGAHRIQDACALQKEVSELLQKGCFGTH FT KWCANRPEILQHVEKEFHGTDFELGDTSSNIIKTLGVVWNPREDWFSFSVV FT PGNPEATTKRKILSEVAKIYDLLGLVGPVITAAKLILREVSVLQVDWDDPV FT PQSIIHKWRCFREELTCLNNLRVPRWISTGDTVSVQLHGFSDASDEAYGAC FT IYTRVVQNDGSVTIRLVCSKSRILPKKMNKCKPITTPRAELLGAVLLSRLL FT EKTIAAIDIDFESVILWTDSQIVYSWIRKPPGMLQLYVSNRVSEIQRITGA FT YTWRYVPSHENPADVISRGEFPRKLIDNEMWWSGPPMLKAATIEEVQLEPL FT EENLPEMRTGVSLAVTSSLRRLMMFDRVNDYDKMLRSMAYFVRFARYLTSK FT MQTIISGPLTVPELRTALLVIVRCVQKETFHHEIRIIAEGGQSKHRLCGLK FT PFIDLQDGILRVGGRIKRALVPYDSRHQMLLPAGHPFSMAVVRSMHRSNLH FT IGQKSLLAMVRQRFWPLRVKSTIRKVIASCITCFRANPLKTSQLMGDLPSY FT RVQPAPTFAFTGVDFAGPFMIKSSTAGRRPLVTKAYVSLFVCMLTRAIHVE FT LVSDLTTNAFLAALRRFTSRRGLPCKMFSDNATNFVGAQNEVEELARLFED FT QQQAKKITDFCTTQGIEWTFIPPRSPHFGGIWEAGVKQVKHHLTRIVGGYK FT LSYEELYTTLTQIEAVLNSRPLVPSSDDPCDFTAVTPAHFLIGREMQAISE FT PSYLNLKMSTLSRWQLVQTIFQHFWRRWTAEYLPELQNRSKWTKTINITEG FT ALVLMIDQGAPPFQWPLGRVTELHPGKDGVTRVVTVRTSKGEYKRAITEVC FT LLPLDGQ" XX SQ Sequence 5543 BP; 1535 A; 1271 C; 1397 G; 1340 T; 0 other; ttggtccttc aggccgaata aatagaaaag tagtggaaaa acgtcgacgc gagtgttttt 60 tccgcgaaaa atcgttgatt tcgagcattc accagatttg ctggtcgggt gcagcgacgc 120 gccacgaaaa ttgcaaatgg ccgtcgttcg agaatagtga agtgatcgtg tcgtgtcatg 180 tcaacaccat cgaaaacgat cgccgaacag aaatcccagc aaaaggcgaa acaggataac 240 gccagaatgg aagaacagct gaagactctc tctcatcaga gaggggcagt gaagggcaaa 300 cttacacgtg tgagaagtgc cattgaacat agtgaagatg acccgaaccc gaacattatg 360 aatttgcatt tccttcggtt gcatcagaaa acagtcgaac agtcgtatcg cgaatacaat 420 gagttccaaa acatgattca tgccctgccc ctctctgagg agcgacgagc cgaacaggaa 480 gcgaaatacg tcgaattcga gacgatgtat gcgaatttgg cgatccggtt gagtatgctg 540 atagaagcgg caactaatca gatggagaag aaagaagttg tcgtagcagc aacatcgtcg 600 tcgatgacac ccggatcaac tctttcctac cttccgccat tgcaagcgcc gttgccgacc 660 ttcgacgggt cgtatgaacg atggttttcg tttaaatcca tgtttactac catcatgaat 720 cgatatacac acgaagaccc ggccattaaa ctttaccatc tgcgtaattc attagttggg 780 ccggtagcag gaatcattga ccaggacatt gtaaacaaca atgattacaa tgctgcctgg 840 caattcctga ctgatagatt cgaagacagg aggcttatca ttgacaagca catcgagtgt 900 ttgttcaatt tgccgaagat cactaaggac agttcggcaa atcttcgaaa gttgctcgac 960 gtgagcagca aaaacatcga agccttgaag aatcttgacc tcccgctgca aggcctcggc 1020 gagcaaatgg ttatcaacat catctcatcg cggatggaca aagcaacaag agtcgcgtgg 1080 gagatacggc agaaaccggg aatcctgccc agctacaagg atacgatgga gttcctgcaa 1140 gaacagtgta aagtgactga aaagattgaa acgaacatga aagtggaaag tgtgaaaccg 1200 aaagcggtga caaaagctca tacgctgata aatacaagtg aactgaaata tgatacgaaa 1260 agtgaactaa agtgtgctgt atgcaaaaat gctcatgaac tatggaaatg cgaaagcttt 1320 aagaagatga acgtgagtga aaaatacaac acgctgaaga aaacaggatg ctgctttaac 1380 tgcctgcaaa aaggccaccg cacgaacgga tgcacttcgt cgcatagttg ccgggagtgt 1440 ggcaagcgtc accatacatc acttcatgcc aacgaagttt ctagctccag atcacccgat 1500 tcgattaccg ctgtcaatca gaatgccgtc cacgaagaag caagagtctc tcgcaaggaa 1560 tctgaagtta gcacccaagt cgatcctcct cgatccggca ctggcacaac cctcagcgtt 1620 agtgttggaa gtcaaacgaa gcaaacattg ctctccactg caatcgtgaa cgtgcgcgct 1680 tccaactctg ctgtttaccc ctgccgggtg ctgttggatt ccgcgtcaac agaacacttc 1740 gttactgaac gtttcgctaa cttggtttgc atgaagaaag aacccgtaga ctacacggtc 1800 agtggtctca acgggacgaa cacgaggatc cgtcgcatgg ttcgcattac aatcgaatcc 1860 tgcgtcgaaa atttctccgc tgagctacaa tttcttgtag cgccgaagat tactggtgac 1920 cttccagaga gatcattcaa tgcatcagag tggtctattc cagctgacgt caagctagct 1980 gacccggagt tcaatcggcg aggacgcatc gacatgctcg ttggtgccga aatattctgg 2040 aacatcgtga aaactggcca attgaagctt ggatcgaacc agccagtgct tactgacacc 2100 aagtttggtt gggttgctgg tggtgtagtt tcgtccgatg cgccagtaat tgctcgatca 2160 ttctgccaaa ccgctcttga agacttgagc gaacttcttc gaagcttcta caaactagag 2220 gcctgtgatc ggatccacct tccgaacaag gtagtggatg atatttgctt ggagcatttt 2280 cgtgacactt accagagaga tcagcaagga aggtacttcg tacgccatcc gttcaacgac 2340 aggaaaaacg agcttggtga ttcccgagaa atggcgacga aacgattcct gtcattggag 2400 cgacggctcg ataggcaacc agatatcaaa ctacagtatg ccagctttat gcgagagtac 2460 gagtcccttg ggcatatgag agccattgcg gttgacgaga atgaagagcc aggttccgct 2520 tactacatac cacatcactg tgtagtaaaa ccttccagta cgtcgacaaa attgagggtc 2580 gttttcgacg gctctgcaac gtcctcatca ggagtggcaa tcaacgatgc gctgatgtca 2640 ggaccgaaca tacagaacga tttgttttcg attttgctga actaccgtgg ctacaggtac 2700 gtgttcacag tagacgtcat caaaatgttt cgacaagttg gtgtccttcc cccggacaca 2760 gcgtttcaaa gaatagtttg gcgctacgat cggaacgagc ctctaatggt gtatgagctt 2820 ttgacggtca cctatggtct ggcatcgtct gcattccaag ctaccatggc actccgccag 2880 gtttccgaag atcatcaaca cgaataccct caagcggcaa gggtaataaa aaagagcact 2940 tatatggacg acgttattgg aggtgcgcat agaatccagg atgcgtgcgc tctgcaaaaa 3000 gaagtaagcg aattattgca gaaaggttgc tttggaacgc acaaatggtg tgccaatcga 3060 ccggaaatat tgcaacatgt cgagaaggag ttccatggca ctgatttcga actgggcgat 3120 acaagctcga atatcatcaa aactttgggc gtcgtttgga atcctcgtga agattggttt 3180 tcgttctctg ttgttcctgg caacccggaa gctacaacga aaagaaaaat tctcagcgaa 3240 gttgcgaaaa tatacgattt gcttggtttg gttggtccag taataaccgc agccaaattg 3300 attctacggg aggtcagtgt tctccaagtg gattgggatg accccgtccc acagagtatc 3360 atccacaaat ggcggtgctt tcgtgaagaa ttgacgtgcc tgaacaacct tcgtgtgccc 3420 agatggattt cgacaggtga tacagtatca gtccagttgc atggcttttc cgacgcctca 3480 gacgaagctt atggggcatg tatctatact cgtgtcgtac aaaatgatgg ctctgttact 3540 atccgactag tctgcagcaa atccagaatc cttccgaaga aaatgaacaa gtgcaagccg 3600 atcactactc ctcgcgccga gttgttaggt gctgtccttc tgtcaaggct actcgaaaag 3660 acgattgctg cgatcgatat tgattttgaa tcagtgatcc tgtggaccga ctcgcagatc 3720 gtctacagtt ggattaggaa accacctgga atgctgcaac tgtatgtctc caaccgtgtg 3780 agcgaaatcc aacgaattac tggcgcgtat acttggcgat acgttcccag ccacgagaat 3840 ccagctgacg tgatttcacg tggtgaattt cctcggaagc tgatagacaa cgagatgtgg 3900 tggagcggtc cacccatgct taaggcggca acgattgaag aagtacagct cgaaccgttg 3960 gaagaaaact tgcccgagat gagaactggt gtttcgctgg cagttacttc ttcattacgc 4020 agattgatga tgtttgacag agtgaacgat tatgacaaaa tgcttcgatc aatggcctac 4080 ttcgttcggt ttgccaggta ccttaccagc aagatgcaga cgattatcag cgggccatta 4140 actgttccgg aattaagaac tgcgctgttg gtgatagtgc ggtgcgtgca gaaggagacg 4200 tttcatcatg agatacgaat tattgctgaa ggaggccaat cgaaacaccg tttgtgtggc 4260 ttgaaaccgt tcatcgattt gcaagatggt atcctgcgcg tgggcggtag aatcaaacgt 4320 gctttggtac cctacgatag tcgacatcag atgcttttgc ctgctggaca tccgttttcg 4380 atggcagttg tgagaagtat gcatcggtca aatttgcaca ttggacagaa gagtcttctt 4440 gcgatggttc gtcaaaggtt ttggccgttg agagtgaagt caactatacg caaggttata 4500 gctagctgca tcacctgctt tagagcgaat ccactgaaga catctcagct tatgggagac 4560 ttaccttcat accgagtcca accagcgccg acgtttgcct ttaccggtgt ggattttgcc 4620 ggtcccttta tgattaaatc gtcaacagca ggtcgaaggc cgttagtcac gaaggcctat 4680 gtcagtctgt ttgtctgcat gctgactcgt gcaattcatg tggaacttgt atctgacttg 4740 acaacgaatg ctttcttggc tgcgctgcga cggttcacaa gtcgtcgagg gctgccatgt 4800 aagatgtttt ccgacaatgc cacaaatttt gtcggggcac agaatgaggt tgaagagcta 4860 gcgcgtcttt tcgaagatca acagcaagcg aagaagataa cggatttttg cacgactcaa 4920 ggaatcgagt ggaccttcat accacctcgg agcccccatt tcggtggcat atgggaggcc 4980 ggggttaaac aagtgaaaca ccatctgaca aggattgttg gtggttataa gctgtcgtac 5040 gaggaactct acactacgtt gacccaaatt gaagcggtat taaactcccg accattggta 5100 ccgtcctctg atgatccctg tgactttact gcggttacac ctgcacactt cctgataggg 5160 cgggagatgc aggctatcag tgagccatcc tacttgaatc tgaagatgag cacactttcg 5220 cgatggcaat tagttcagac catcttccaa cacttctggc gaagatggac agccgagtac 5280 ttacctgaat tacagaacag gtcaaagtgg acgaagacca ttaacatcac tgaaggagct 5340 ttggtgctga tgatcgacca aggtgcgcct cctttccagt ggccgctcgg ccgggtaacc 5400 gaattacatc cggggaagga cggagtaaca cgtgtcgtca ccgtaaggac ttcgaagggt 5460 gaatataagc gggctataac ggaggtatgt ctgttgcctc tagatggcca gtaacattga 5520 aacaggtttc aaggccggga gga 5543 // ID LT1_DPu-LTR repbase; DNA; INV; 283 BP. XX AC . XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE Non-autonomous LTR retrotransposon from Daphnia (LTR portion). XX KW LTR Retrotransposon; Transposable Element; nonautonomous; KW LT1_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-283 RA Jurka J.; RT "LTR retrotransposons from Daphnia."; RL Direct Submission to RU (16-DEC-2010). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR [1] (Consensus) XX CC >94% identical to consensus. XX SQ Sequence 283 BP; 91 A; 55 C; 54 G; 83 T; 0 other; tgcgagcccg aagggcgaag ctaataatcg catacgcaga aaaaaaaaag ccatgtcact 60 ttgtctgtat gtatgtatgt atgtatgtcc cgctccatag ctcgggcgtt tcacgacaga 120 tttcggactt tttggtctta aaaattacac aaaaaatatg cggtgcgacc agaacaaaaa 180 taatttcatt tacttaatta gttctcccgt aattaatgaa aaactgctaa aataattgtt 240 taacgactgg tgacatctgg cggttgcgac tacaactttt tca 283 // ID Gypsy-244_AA-I repbase; DNA; INV; 4788 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-244_AA_; KW Gypsy-244_AA-LTR; Gypsy-244_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4788 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1089-1089 (2011). XX DR [1] (Consensus) XX CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 620..4756 FT /product="Gypsy-244_AA-I_1p" FT /translation="MKFVEHIFKCSYLQLKENNGNATKLNLIHFTFFSKLI FT EMDKWDLAPFSYKLMPQNEIRDEWQRYKRNFEYLALANGMTNKTRLKNIFL FT ARAGPDVQDVFKTLPGADVEERMGIDPFKIMLDKLDEYFAPKQHEAYQRFL FT FWSMTRNDKDEPLDKFILRATDAAGKCNFGQSKKEAQEISVIDKIIQLAPP FT DLREKLLQRENLVLDEVIKLVNSHEAIKFQANQMTSSNACQPSTSGGVNKI FT RNTTKSVFQGVPDGCSRCGRKGHYGNDKVCPARTKTCDRCGRLGHFAIRCH FT SSKPIQKNKREGKDDLYRKRFRFQPQQIRAIEDDVDEKGITSSFIFAVGEA FT DEYLWIAIGGVLVQILIDSGSQKNILDDVTWEKMKAQRVNVKNLRSNSDQT FT FKAYGRSAKPLVVRHVFESTIEILGGNQRICKDATFYVIEGGSQPLLGKVT FT ALELGVLSLGLPSTQSDEVYRLHTERKRPFPKMKGIKLNIPIDTTVTPVIQ FT HARRPPLALMDKIEEKLDYLLASDIIEPVYDFSQWASPLVTIIKENGDLRL FT CVDMRRANLAIKREAHLMPTFEDFLPRLKNAKMFSRLDIKDAFHQVELNEP FT SRYITTFITHRGMYRYKRLMFGISCAPEMFQKVLEQILSDCENVISFIDDV FT LVFGNTPEEHDRALQKVLDTLESKNILLNHAKCVFRVTELDFLGHHLSGKG FT VRPTEEKIAALKSFRAPQTAEELRSFLGLVTYVGKFLPDLATVCASLRELT FT HKGTTFKWLPYHEKAFQKLKEMIADIKMLRFFDNSLRTRVIADASPVGLGA FT VLVQFSDTSNDLDPRVISYASKSLSPAEKRYCQTEKEALALVWAVERFSIY FT LIGRKFELETDHKPLEAIFSTTSRPCARIERWVLRLQSFNFEVRYRKGVNN FT IADPFSRLATLSSEDAFDGDNQFLVLAILESAAIDTVELENASLEDEELSI FT VKSCLQRGTWDSDKVKSYEMIQSELGVVGDMLVRGNKLVVPKVLRKRMLTL FT AHEGHPGETVMKRRLRDRVWWPGMDREVTAHVSACQGCRLVSLPNKPEPMH FT RRELPVRPWLDVALDFLGPLPSGEYLLVIIDYYSRYKEVEIMKHITAEETI FT ARLRKIFTRLGLPVTITLDNARQFVSASFEDYCKQNGIMLNYSTPYWPQEN FT GLVERQNRSLLKRLQISHSMGRDWKSDLQDYLLMYYTSPHSITGKTPTELC FT YGRTIRSKIPSLIDFETVPSRAEITDRDKILKERSKESENAKRQAKLSDLE FT VGDTVLMKNLTPGNKLMPNFNPSECVVVNKEGSRVTVRNKESSKVYQRNAA FT HLKKVPSCTSVEEEVEDPPSCVDAEQTPTIQSETRNQMSSYNFSRVRRQTK FT QPSYLKDFVIHEVSSE" XX SQ Sequence 4788 BP; 1507 A; 894 C; 1109 G; 1278 T; 0 other; gtggcgacga ggatcggcgt tggaaacggt aagaagggtg gataattgtt atttatttac 60 agttgaagca tctggaagat tgggccgaga aatttgaaga ggttatggaa acggcaattg 120 atagcagtcc attgaaatga gtaaataatt ggaatggcgt attaaaatca tgaataaccg 180 attcggttat ggtcccggag accgttttca taaaaaagtg gccggtgtac tacacggcat 240 tgctcccgga gagcgtatgt aagaaaagtg gatgatgcat tgcatagtac tgctcccgga 300 gagcgttcca taagttttag tgaatagtgc gttgcacagt atagctccat gggagcgcga 360 aaaatgaacg gctggtgcat tgcacagcaa ggctcccgga gagcgtttca tataaacaac 420 cgaaagtggt acagggtaca gtgttgctcc tagagaacgg ttcagaaaat gttgaattgt 480 aaaacaattc taaatcgtgt acaccgttgt agtaatcaca gaactacatt gtcattgcag 540 gacatcaaat cgacagaatc gacgaaatcg agattgtttg aggagcgcct cgaagcatta 600 caggtaaaca aatgaacaga tgaaatttgt agagcatata ttcaaatgtt catatttgca 660 attaaaggaa aacaatggaa atgcaactaa attgaatcta atccatttta cgttttttag 720 caagttaatc gaaatggata agtgggacct ggcccctttc agttataaac tgatgcctca 780 aaacgaaatt cgagatgaat ggcaacgata caagcggaat ttcgaatacc tagcattggc 840 aaatggaatg acgaataaaa cacgtcttaa gaacatattc cttgcccgtg caggccccga 900 cgtccaagat gtgttcaaaa cacttcctgg tgctgatgtg gaagagcgga tgggcattga 960 tccgttcaag atcatgctgg acaaattgga cgaatatttt gctccgaaac aacacgaagc 1020 ataccagcga tttttgttct ggtcgatgac ccgcaatgat aaagatgagc cgttggataa 1080 gttcattttg cgagccacag atgccgcagg caagtgcaat ttcggacaga gtaagaaaga 1140 agcgcaagaa ataagcgtta ttgataagat cattcagttg gcacctcctg atctacggga 1200 gaaacttttg caaagggaaa atctggtttt ggacgaggtt atcaaattag ttaactccca 1260 tgaagccatc aagttccaag caaatcagat gacgtcttcg aatgcttgcc aaccctctac 1320 gtcaggtgga gtcaacaaaa tccggaatac cacaaaaagc gtctttcaag gagttcctga 1380 tggatgttcg cgctgcggcc gaaaaggaca ctatggaaat gacaaggtgt gcccggccag 1440 aacaaagaca tgtgaccgtt gtggaagact aggacatttt gctattcggt gccactcctc 1500 caaaccgatt cagaagaaca agcgagaagg aaaggatgat ctttatcgga aacgtttccg 1560 tttccaacca cagcagatca gagcgattga agatgatgta gatgagaaag gtattacttc 1620 aagttttatt tttgctgttg gtgaagcaga tgagtatttg tggattgcta ttggtggggt 1680 acttgttcag atcttgatcg attcaggaag tcagaaaaac attctcgacg atgtgacctg 1740 ggaaaaaatg aaggctcaaa gagtgaatgt taagaactta cggtcaaact ctgaccaaac 1800 tttcaaagca tatgggagga gtgcaaagcc tctagtggtt cgacacgtgt ttgaaagtac 1860 cattgaaatt ctcggaggaa accagcgtat ttgtaaagat gcaacctttt acgtgataga 1920 aggaggatcg caaccactcc tggggaaagt aacagctttg gaactaggcg ttttgtcatt 1980 gggtttgcct agtacccaat ctgatgaagt ttaccgtttg catactgaac ggaaaagacc 2040 ttttcctaaa atgaaaggta taaaactgaa tattccaatt gataccactg tcacacctgt 2100 aatacaacac gctcgacgtc cacctctagc tctgatggac aaaattgagg agaagttgga 2160 ttatttatta gcgtcagata taattgaacc agtctacgat tttagtcagt gggcctctcc 2220 tttggttaca attattaagg aaaacggtga cctgcgactt tgtgtagaca tgcggcgagc 2280 gaatttggct attaaacgag aagcacattt gatgccgact ttcgaggatt tccttcctcg 2340 actgaaaaat gctaaaatgt tcagtaggct ggacatcaaa gatgcatttc atcaagtcga 2400 attaaacgaa ccttctcgtt acatcacgac tttcataacg cacagaggaa tgtatcgtta 2460 taaacgactg atgtttggca tttcgtgtgc tccagaaatg ttccagaagg tcctggagca 2520 aatcctctca gattgtgaaa acgttattag ttttatcgat gacgtattgg ttttcggtaa 2580 tactccagaa gagcatgatc gagctttaca gaaggtgttg gacactctcg aaagcaaaaa 2640 cattctacta aatcatgcca agtgtgtttt ccgcgtcacc gaactagatt ttctaggaca 2700 ccatctatca ggaaaaggag tgagacctac tgaagaaaaa atagctgctt taaaatcttt 2760 tcgggcgcct caaacggcag aagaattgag gagttttctg gggctcgtta cgtacgttgg 2820 taaatttctt ccggatttag caactgtatg tgcatcttta cgagagctga cccacaaagg 2880 aacgacgttc aagtggctgc catatcatga gaaggcgttc caaaagctaa aagaaatgat 2940 tgcagatata aaaatgttaa gatttttcga caattcacta cgtaccagag tgattgcaga 3000 tgcatcgccg gtaggattag gtgctgtact ggtacagttt agtgatactt caaacgatct 3060 ggatccacgt gtcatttctt atgctagtaa aagtttgagc cccgcagaaa aacgctattg 3120 tcaaacagag aaggaagcgc tggcgttagt ctgggcagta gaaagatttt ctatttatct 3180 cataggtagg aaatttgaac tagagacgga ccataagcct ctcgaagcca tattttcaac 3240 aacctctaga ccgtgtgctc gtattgaacg gtgggtcctc cgtttacaat ccttcaactt 3300 tgaagtgcga tatcggaaag gtgtcaataa tatagctgat cctttctctc gtctagcaac 3360 gttaagttcg gaagatgcat ttgatggaga taatcaattt ttagtattag ctattttgga 3420 atcagcagca attgatacag tggaactaga aaatgcatcg ttagaagatg aagagctttc 3480 tatagttaaa agttgtttgc aacgtggaac atgggacagt gacaaagtca aatcatacga 3540 aatgattcag agcgagcttg gagttgtagg agatatgtta gttagaggaa ataaactagt 3600 tgttcctaaa gttttgagaa aaagaatgct aactttagct catgagggcc atcccggtga 3660 aactgttatg aaacgtcggc ttcgagatcg cgtttggtgg ccaggtatgg accgagaggt 3720 tacggcacac gtttcagcat gtcaaggttg tcgtctagta tctcttccca acaagcctga 3780 gccaatgcat cgtagagaac ttccggtgag accttggctt gatgttgctc tggattttct 3840 tggaccttta ccttcggggg aatacctgtt agttataatt gactactata gtcgatataa 3900 ggaggtcgag ataatgaaac acataaccgc tgaagaaaca attgctcgtc tacgtaaaat 3960 ttttacgaga ttgggtttgc cagtgacaat tactttggac aatgccagac aatttgtgag 4020 tgcatcattt gaggactatt gcaaacagaa cggaatcatg ctgaactatt ccactcctta 4080 ttggccccaa gaaaatggtc ttgtggaacg tcaaaatcgc tcgttattga agcgtcttca 4140 aataagccac tcaatgggtc gtgattggaa atcagatcta caagactatc ttctaatgta 4200 ctacacgtct cctcattcca taaccgggaa aacccccacg gagctttgct atgggcgaac 4260 cattcgatct aaaattcctt ccttaataga ttttgaaact gtgccatcta gagctgagat 4320 aacagataga gataaaatct tgaaagaaag gagtaaggag agtgaaaatg caaaacgaca 4380 agccaagcta tccgatcttg aagttggcga taccgtgtta atgaaaaatt taactcctgg 4440 aaataagctt atgccgaact tcaacccaag tgagtgtgtg gttgtgaaca aagaaggatc 4500 gagagtaact gtgaggaata aggaatcgag taaagtttat caacgtaatg cagcacatct 4560 aaagaaagtt cccagttgta catctgtcga agaagaggtc gaagaccctc cttcgtgtgt 4620 cgatgctgaa cagacaccca ccatacaatc tgaaacaaga aaccaaatgt catcttataa 4680 tttttctaga gtacgacgtc aaaccaaaca accctcttat ttgaaggatt ttgttatcca 4740 cgaagtgtct tcagagtgaa ttataattct atgagaaaaa aggagatg 4788 // ID Mariner-33_SM repbase; DNA; INV; 2140 BP. XX AC . XX DT 12-AUG-2009 (Rel. 14.08, Created) DT 12-AUG-2009 (Rel. 14.08, Last updated, Version 2) XX DE Mariner DNA transposon from Schmidtea mediterranea: consensus. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-33_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-2140 RA Jurka J.; RT "Mariner DNA transposons from Schmidtea mediterranea."; RL Repbase Reports 9(8), 1882-1882 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 279..1919 FT /product="Mariner-33_SM_1p" FT /translation="MNSKRKYSRKALTLEIKLDILKRFDRGHKAVDIAKAL FT TLPVTTVKSIRNRDASKIKESAKNATEMLSQQCTRNRSAIMVKMESLLTLW FT IQEQNYRRMPLSKIIIKAKAKSIYDSLRLQEPSTSEDTSFEASNGWFVRFK FT IRANLHNLSLKGEAASGDTVAAAVFPERLKNIIRDGEYCAKQIFNVDETGL FT FWKRLPTKTYISKMEKNAQGFKAGKERLTLLLGGNAAGDYKFKPFLIYASE FT NPRALKGTQKRNLPVYYRWNKKAWMTANLFRDWVSSCAIPELRSYCDKEKL FT EFKILIVLDNATSHPNDLDELSENIKFVYLPPNTTALIQPMDQGAISNFKS FT YYLRRTFKELFAATEGIDNITMKIFWKNFTILDGIKFIALSWDEVTTKSMN FT GCWRKLWPECVENQLENDTNDTINVIQEVAGLAATLQVEGMGVEDIEELLN FT SYNEEMTNSELMEIEFHPAYSSNSDQEIEEIVPEQTLTAKRLAQAIQLIDQ FT ALQIFDQDDPNQERSSKVSRAVISSINCYKEIYEEKNNIKYKPLCKNFLKK FT " XX SQ Sequence 2140 BP; 771 A; 317 C; 380 G; 672 T; 0 other; caccagggcc ccgctatgcg tcttttcgct cttgcgtcgg ttctttaaaa acaaaatgcc 60 cgctttgcgt cgatttctcg ttgtgcgtcg ttatttgttt tttttatttt gtcttcaaaa 120 tcatattaaa aaaacggtat attctagatg ttttctttca ttttctgtga ataatcttat 180 gagagtaaac ttaaattaaa agatattgat cgattttaaa tttatttatt tattttcaat 240 taagaaatta aaatttttgt agtaattgta ccgttaatat gaacagtaaa cgtaaatatt 300 caagaaaggc tttaacacta gaaataaaac ttgacatact taagcgcttt gatcgtggac 360 ataaggcagt agatattgca aaggctctta ctttaccagt aacaaccgtt aaaagtattc 420 ggaaccggga tgcttcaaaa ataaaagaaa gtgctaaaaa tgctacagaa atgttatctc 480 aacaatgtac acgtaataga tccgccataa tggtcaaaat ggaatcactt ctaacattgt 540 ggatacaaga acaaaattac cgtaggatgc ctctaagtaa aataatcatc aaagcaaaag 600 caaaatctat ctatgacagt ctaaggttac aagaaccaag cacaagtgag gatacgagtt 660 ttgaagctag caatggttgg tttgttcgat tcaaaatcag agcaaatttg cataatttaa 720 gccttaaagg ggaagctgca agtggagata ctgttgcagc agcagttttt ccagaaagat 780 taaaaaatat aattcgagac ggagaatact gcgcgaaaca aatttttaat gtagatgaaa 840 caggcttatt ttggaaacgc ctgcctacca aaacatacat ttcaaaaatg gaaaaaaatg 900 cacagggatt taaagctggt aaagagcgac taacacttct attaggtgga aatgctgctg 960 gtgattataa atttaaacca tttttaattt atgcttctga aaatcccaga gctttaaaag 1020 gtacacaaaa gagaaatctt cctgtatatt atcggtggaa taaaaaagcc tggatgactg 1080 ctaatttgtt tcgtgattgg gtgtcctctt gtgccattcc agagttgaga tcatattgcg 1140 ataaagaaaa gttggagttt aaaattctaa ttgtattgga taatgcaaca agtcacccaa 1200 atgatttaga tgaactttca gagaatatta aatttgttta tttaccacca aacacaactg 1260 cattaataca acctatggat caaggagcta tatctaattt taaatcgtat tatcttcgcc 1320 gaacttttaa ggaactcttc gcagccactg aaggaattga taatattact atgaaaatct 1380 tttggaaaaa tttcactata ttagatggaa ttaaatttat tgcattatct tgggatgaag 1440 taaccactaa aagcatgaac ggatgttggc gtaaattgtg gccagaatgt gtagagaacc 1500 aattggaaaa tgacacgaat gatacaataa atgtaattca agaagtagct ggcttagcgg 1560 ccacattgca agttgaagga atgggtgtcg aagatataga agaattattg aactcataca 1620 atgaagagat gacaaattcc gaattaatgg aaattgaatt tcatccagca tacagtagta 1680 attctgacca agaaattgaa gaaattgttc cagaacaaac tcttacagcg aaacgtttag 1740 cgcaagctat acaattaatt gatcaggctt tgcagatttt tgatcaagat gatcctaatc 1800 aggaacgaag ttctaaggta agtagagcag taataagtag cataaattgt tataaagaaa 1860 tttatgaaga aaaaaacaac ataaagtaca aaccactttg caaaaatttt ttaaaaaagt 1920 aaataagatc ttataataat ttttttatac aatttgtatg ttttctgtaa tatataaatt 1980 gaattttttg tgtatttctt cttttatttt gaattttatt tagataaata tatcgtatat 2040 tttctcttgc aaaatattat aatttgttcg ctatgcgtcg aaatccactt tgcgtctcta 2100 cttgtggaac cgaatagcga cgcaaagcgg ggccctggtg 2140 // ID BEL-1_BMa-I repbase; DNA; INV; 5603 BP. XX AC AAQA01001578; XX DT 24-FEB-2011 (Rel. 16.02, Created) DT 24-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Brugia malayi genome: internal DE portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-1_BMa_; KW BEL-1_BMa-LTR; BEL-1_BMa-I. XX OS Brugia malayi OC Eukaryota; Metazoa; Nematoda; Chromadorea; Spirurida; Filarioidea; OC Onchocercidae; Brugia. XX RN [1] RP 1-5603 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Brugia malayi genome."; RL Direct Submission to RU (07-FEB-2011). XX DR Genome; AAQA01001578; Positions 6799 1197. XX CC Positions [4286-4846] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS join(782..2014,2018..3481) FT /product="BEL-1_BMa-I_1p" FT /translation="MERVLRQLEAIGENVEQPTIETIIESKLPNWILNQVY FT QQKEEDSQWSVTKLRQLLRKTINRNDQVMEWQYLDNVSRKNLIKRHPNASI FT PENSALIAVKQSKQTRGTASAERNQLNQENPNWSTNKKRRPCIFCNNNHWD FT SKCHIYSTVEQRIDRLKEINVCSNCFKSGHNELNCIKRASCFYCKKSHNSA FT LCTTKTRDSGKIVAKNANISVNSTKKVEKKRVLLLCREINVFNPDKPEHQV FT QALVLFDVGADATFISQRLAHRLNLLETDEEEYKISSFGNRTPRVCRTTQT FT QIGVKTENSDQVTIQATIMDYLTNDLQVMEVVDKGENYHLKSYKKQPDIVI FT GADYFFNFIHMENAQQLKSGFTLLNSKVGPIIAGSGYTNELCHHQIFATKT FT SQENRAPDIDQFWKLDCGIQDRSDTQDDEQALEQFKKSITKQNGRYEVRWP FT WKPCKGKLSDNYGLCVSRLRMLIARLQSHKEELQEYDKAIQDQLQSGIIEE FT VQSQMNQDGIIHYLPHHDVRNPGKTTTKLRIVYDALAHIKGMKSLNEVLYR FT GSINLPDLVGVLLRFRMMKGVITADIEKAFLQIQLHPSERNCTRFLWLKEV FT EGSVSNENIKCYRFRRIPFGVISSPFLLSATLNYHLENHVSELAAEIKKNL FT YVDNVIVSSNGTQDALEKYAEMKSIFNEASMNLREFLSNDEEFYAELPQQD FT RAIRKNTKILGISWNPCQDVIQIKLNPWTDRELTKRTILQFVASQYDPLGF FT LVPIMVRFKIFLQNLWKKNNSWDQILDEQDHKQWKFLIAEWATVVKDLPRF FT VTTSTDLIGIHVFTDASSVAYSAAVYLVSQDMKETKSSLIFAKSRIAPIKG FT MTIPRLELMAILIGTRAAQFVMTQLDIVNTRIILWSDSKCALHWIK" FT CDS join(4286..4435,4439..5440) FT /product="BEL-1_BMa-I_3p" FT /translation="MASLPETRVNRSRAFARVGVDYFGPLSIKSNSVSTKR FT WVILFTCFTTRAMLELAEDQSAEIFLHAIRRFVARRGCPELILSDNATQFQ FT AFRSLMTQVKVSNFLAKGGMTWKNIIPKAPWQGGIYERLIGLTKNALRRAI FT GRKFLAERELVTLIAEVEGILNTRPLTYANFDDCVIIRPIDFILPNASLHL FT PINEVYSRQEEFTPYRLDTREKLVRHWESTLKTLNVFWEIWRTEYLTSLRE FT RTQREITSSKGAQKRTPRKGEIVLLNEAGIPRGMWKLLRIKDIKIDKDGKV FT RNVQVETPTGKLLDRPINVLYPLEVNDEEIHLKPNKKENMKIPETEQNTEE FT LQEPIAMRTRSNTKRQSRLKENQTTTTAPSSETCRDLHDQR" XX SQ Sequence 5603 BP; 2050 A; 1063 C; 1074 G; 1404 T; 12 other; aagaataatt ttggtgcccg aggtgaggta aggtatgtcg gccagtataa ttgttcaagc 60 tacaccagta aaagcgaatc tcgagggact cctcgatgaa atacaacaaa tggatctaac 120 tccgttagac caaaaagcga cagtagaagt attgtgccag caatacgaag caagggcaag 180 gatcatcaaa gaaaagttga tgcgcctcga aaaatacgtt ggcactcttg agaagatcaa 240 cgacaaatgg ttggaacaca ttcaactagc cccaatgtcg caaaagaaaa aagaagaaga 300 aaaatacgaa caaatggcaa acgacgatag aggtatttta aaattaatta acataggtac 360 ggataccatt ataaccttat ctatgtacaa ggatgataya gagttagccc ttaaacgtct 420 agcacaaatt aaagaaccta gcttaactga atgtcgtcca gtagtaaatt taccacaatt 480 gtcgttacca acrtttagtg gagaccctaa aacatggaga gaattctgga gtagtttcga 540 agcctccgta cacwctcaaa acataccaga tatccaaaaa ttaaattact tggtttcttg 600 cttaagagga aacgccctac agctagtaag aggttacgat agggcacctg aaaactatag 660 aattataaga gaattattag tggagaaatt tggtcgtgtt tccactataa gaaaattact 720 ttacaatgaa ctcatatcta caaaacgaaa caaccgagac tggaaaacaa taatcgaaga 780 aatggagaga gttctaaggc agttagaagc aattggcgaa aacgtagagc aacctaccat 840 cgaaacgata atagaatcca aattgccaaa ctggatatta aaccaggttt atcaacaaaa 900 agaagaagat agccagtggt ccgtaacaaa acttcgacag cttctcagga agacaattaa 960 tagaaacgac caggtaatgg aatggcaata cttggataac gtcagtcgta aaaatttaat 1020 taaacgtcat cctaacgcaa gcattccaga aaattcagca ttgattgcag taaaacaatc 1080 aaagcaaaca agaggaacgg cttcagccga aaggaaccag ctaaatcaag agaatccaaa 1140 ttggtcgaca aataagaaaa ggagaccctg tatcttctgc aataacaatc actgggatag 1200 caagtgtcac atatattcta cagtggagca gcgaattgat cgattgaagg aaatcaatgt 1260 ttgctctaat tgttttaaat ctggacacaa cgaactcaac tgcataaaaa gagcaagttg 1320 tttttattgc aaaaaatccc acaatagcgc cctttgtacc actaaractc gtgattctgg 1380 gaagattgtt gccaaaaatg caaacatatc tgttaattca ackaagaaag ttgaaaagaa 1440 acgagtactg cttttatgta gagaaattaa tgtctttaat cctgataaac ctgaacatca 1500 agtacaagct ttagtactgt ttgacgtcgg cgcagatgcg acctttatct cacaaaggct 1560 agctcatcga ctaaatctcc tagaaactga tgaagaagaa tataagatct cytcctttgg 1620 caataggact cctcgagtat gccgcacaac tcaaacacaa attggagtga aaacggaaaa 1680 ctccgatcaa gtgactatcc aagccactat aatggactat ttgacaaacg atcttcaagt 1740 gatggaagtg gtagacaaag gtgagaatta ccatcttaaa agttacaaga aacaacctga 1800 catagttata ggagctgact atttcttcaa ctttattcat atggaaaacg ctcagcaact 1860 aaagtcagga ttcactttac tgaacagcaa agttggacct atcatagcag gaagcggtta 1920 tacaaatgaa ttgtgccatc accagatctt tgcaacgaag acatctcaag aaaaccgtgc 1980 cccagatata gatcaatttt ggaagttrga ctgtaytggc atccaagatc gatctgatac 2040 acaagatgat gaacaagcac tggaacaatt taagaagagt atcactaaac aaaatggaag 2100 atacgaagta cgttggccat ggaaaccatg caaaggcaaa ttgagcgata attatggatt 2160 atgcgtcagc cgtttgcgaa tgctaatagc acgacttcag tcccataaag aggaattaca 2220 agaatatgat aaagcaatac aagatcaact acaatctggc ataatcgaag aagtacaatc 2280 acaaatgaat caagatggaa ttattcatta cctgccacat catgatgttc gtaatccagg 2340 taaaactacc actaagttaa ggattgttta tgatgcttta gcgcacataa aaggaatgaa 2400 aagtttaaat gaagtattat atcgtggatc aattaacctc ccagatcttg tgggagtatt 2460 attacgcttt agaatgatga aaggggtaat cacagctgac attgaaaagg cattcttaca 2520 aatacagtta cacccttcag agagaaattg tactcgcttc ctttggctaa aggaagtaga 2580 agggagtgta tccaatgaga atataaaatg ttaccgtttt agaagaatcc cgtttggagt 2640 aatatcatcc ccctttttac tatctgctac cttgaattat catttggaaa accacgttag 2700 tgaattagcc gcagaaataa agaaaaatct ttatgtggac aacgttattg tgtcatcaaa 2760 cggaacgcag gacgcattag aaaaatatgc agaaatgaaa tccatcttca atgaagcctc 2820 aatgaatcta agggagttct tatctaacga cgaagaattt tatgcagaac tgccgcagca 2880 agatcgagca ataagaaaaa atacgaaaat cctcggcatc tcatggaatc cctgtcaaga 2940 cgtcattcag ataaaattaa atccatggac tgaccgagaa ctaacaaaaa gaacaatctt 3000 acagtttgtt gcatcacaat atgacccact aggattttta gttccaataa tggtaagatt 3060 taaaatattt ttgcaaaatt tatggaaaaa gaataattcg tgggatcaga tattagatga 3120 gcaagaccat aagcaatgga aatttcttat tgctgaatgg gcaacggtcg taaaagactt 3180 accgcgattt gtaacaactt ctacagattt aattggaata catgtattta ctgatgcttc 3240 aagcgtagca tattcagcag cagtgtattt agtgagtcag gatatgaaag aaacaaagtc 3300 atcgctcatt tttgcaaaat ctcgcattgc tcctattaaa ggtatgacaa taccgcgtct 3360 agagctaatg gccatactaa tcggaacgcg agcagcgcaa tttgtcatga cacagctgga 3420 tatcgtcaat acaaggataa ttttatggtc ggattctaaa tgtgctttac attggataaa 3480 aamccwttct aacctattac caagatttgt tcaaaatcga gtcgaggaaa tacgtaaagc 3540 gaaatttatt tttcgttata ttccatcagc agataaccct gtagacgtgg caactagagg 3600 tcttaatcca aaacaactta gaagttttac accatggtgg catggaccgt cgtggctagt 3660 aaaaggagaa atcagttggc cacaatggga atacgaattt gccaacaatg acgaaccaga 3720 agaaatcact atatcagaag tctcgcgaat attcaaggat cataactttc aatttattga 3780 tagcaaacgt tttagcaaat ggctaagact gttacgaaca acagcgtgga tccttaaatt 3840 cattcgatta acaactaaag gcagattatc atggctacag tctgtttcaa tcgaaagaaa 3900 tcgatttaca gctgaagatt acaaagtgtc agaatggctg ttaattagac aagcacaatc 3960 tgaaggaatc aatgacgacg aaataaacaa gtggaattta ttccacaccg aagatgatag 4020 attatggaga tctactagtc gactcgtgaa ctcggaactg cctgaaacta gcaaatatcc 4080 tatctatctc cctcgacaca acccaataac agaactcctt atactacacc aacatgaaaa 4140 tctatgtcat tcaggtatcg cccatacact ttcagaatta aggagtagat tttggtttcc 4200 caaaggcagg actgaagtaa agcgaataat aaacagatgt cgaatctgta aacgttggaa 4260 ctccagatcg tttaaattac caccaatggc aagtctcccg gaaactcgtg tcaatcgatc 4320 tagagccttc gcacgtgttg gcgtggatta ttttggccct ctttcaatca agagcaatag 4380 cgtgtcaaca aaaaggtggg taatattatt tacatgcttt acaacaagag ccatgyatct 4440 cgaattagca gaagaccaat ccgcagaaat ctttttgcat gctatccgaa gatttgttgc 4500 gagacgaggg tgtcctgagc taattctgag tgayaatgcc actcaattcc aggctttcag 4560 atctcttatg acacaagtca aagtctccaa cttcttagca aagggaggaa tgacttggaa 4620 aaatattata ccaaaggctc cgtggcaagg aggaatctac gagagattaa ttgggttaac 4680 gaaaaatgcg ctaagaagag ccattggcag aaaattccta gcggaaaggg aattggtaac 4740 attaattgca gaagttgaag gtatacttaa tacccgacca ctaacttacg ccaattttga 4800 cgattgcgtc attatacgtc caattgattt tattttgcct aacgcttcgc tccatctacc 4860 tataaatgaa gtttatagca ggcaggaaga attcacccct tatagactag ataccagaga 4920 aaaactcgtg agacactggg aaagtaccct taaaactctc aatgtttttt gggaaatctg 4980 gagaaccgaa tatttaacta gcctcagaga acgcacgcaa agggaaatta cctcatccaa 5040 aggagcacaa aaaagaactc caagaaaggg agagattgta ctcttaaatg aagcgggaat 5100 tccaagagga atgtggaaat tattacggat caaggacatc aaaatcgata aagacggtaa 5160 agttaggaac gttcaggtgg aaactcccac tgggaaactt ctcgatcgtc caataaatgt 5220 cttgtatcct ctcgaagtta atgatgaaga aattcatctt aaacctaata agaaagaaaa 5280 tatgaaaatt ccggaaaccg aacagaacac tgaggagctc caggaaccta ttgcgatgcg 5340 aactaggagc aatacaaaac gacaatctcg attaaaggag aaccaaacaa ctaccaccgc 5400 accatcatca gaaacctgta gagatttaca cgatcaaaga tagatcaatg gaaataaaca 5460 agccccactt tcttatgaac cccacgcatt acctaccttt tctggtcggg agtgtcgtga 5520 aagacgaaat atttaaatta ttatatccaa aactaacaaa ttcctcttat ctaattcctt 5580 ttcctcgcac aacaaaaacc ctt 5603 // ID Copia2-LTR_Dmoj repbase; DNA; INV; 351 BP. XX AC scaffold_2198; XX DT 13-MAY-2009 (Rel. 14.05, Created) DT 13-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia2_Dmoj; KW Copia2-I_Dmoj; Copia2-LTR_Dmoj. XX OS Drosophila mojavensis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; repleta group; OC mulleri subgroup. XX RN [1] RP 1-351 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1024-1024 (2009). XX DR Genome; scaffold_2198; Positions 676 326. XX SQ Sequence 351 BP; 112 A; 57 C; 78 G; 104 T; 0 other; tgagcatttg agctaacggt aaacacagtc gtcggctgat cgcacatttg tctctagaat 60 aagaaaccga atgttctgga ttatgaaata gacgcaaaca tagaagcata aggagacatc 120 cagaacattc gatatgcagt agaagtaggt atataagcgt gctgtaagtc gagccaaagt 180 tcattcagta agttcattgt gtaagatcaa tgtcagtgtg tcggctcggt tagttaagtt 240 caaattgtcg cggtacgtga agaagtgtac ttgttactga atattgtgtt aaataaactt 300 tttgtgttgc gcctacatat accgtgtttc atttaagaaa aaccttcaac a 351 // ID I-49_AAe repbase; DNA; INV; 6747 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE An I non-LTR retrotransposon from Aedes aegypti. XX KW I; Non-LTR Retrotransposon; Transposable Element; I-49_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6747 RA Kojima K.K. and Jurka J.; RT "I clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1320-1320 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 20 sequences with >91% CC identity. XX FH Key Location/Qualifiers FT CDS 419..1831 FT /product="I-49_AAe_1p" FT /translation="MNGKTPPTVPPDIDPGASTSGFNLPNKQNRNKRTLPI FT WMDRNDQYGDAQFLLLRGANDTPLARKPLVIEKSVELAAGGPIAGAKSESR FT CTKYTLEVRNKSQVENLLSLKRLIDDTPVEVVYHPTRNVCRCVVTCWDFAE FT TSTDDIIRRLQDQNVTDVRRITQRIGKNVVNTGTMILTIRGPAPPSYVRLG FT LLQVPTRPYYPSPLLCYKCLSYGHTKTACKETEKCQNCSTIHDEMENCNRT FT PYCKNCGENEKPTSRTCAVYRREQEVIKIKVDEGLTYAAAREAHRSRITSS FT KGSFASIVQQRLINVQAAGDETNELKQQLKLQEEQNRALIVENEKLRNEIT FT NLRKSVNELAETTKQLQLQLKQQPQCAAPSDIVASVPTVQLPGISKMQTRQ FT QHRNTSESRKSAQYSSSEAKQQIKRHVSSPPKKPAKLRKSNRKKKHVEISS FT NPASADETASEEGFSEARPDEEMSQN" FT CDS 1889..6658 FT /product="I-49_AAe_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="MNPKTNQKNGSQKLLYKNGKHVSNPHEPASSTSYVSN FT NITNHSAIPRVCTTPGLFPAPTPTREVPEFSDEPLAAVGVGGLYSQASTHS FT ILNYEQTTDALNILVIQSPSTSYTSYNSNIFVDNSAIPRVYTTPGLFPVPT FT LTGEVPASPDEPLAAVGTGGLFSQASTHTISNHEQTTDASHILVSRPSPKR FT PSSNNSNNCITYSAIPRVCTTPGLCPVPTPTGEVPASPDEPLAAVGTGGLC FT SQASTHTIPNHEQTTDASHISVSRPVSKRPTPNNSSNPVTHSAIPRVYTTP FT GLFPVPTPTREVPDITEEPLAAVGTGGLYLQASTNAISSYGQTSDPMHIPD FT HSKNHSHLLRTTTSQSTSQNTVHVESTRATRNTCSPAEKCVAIQWNVNGLQ FT ERYAELQLLVTEYQPIIIAVQETRLSRYEIMNRFSHAEYNWKFLSGPSSPS FT QNGVGLAVSKSVPHSFLHVNSSMQVIAARVNHPICATYVSIYIPCHTSPSQ FT LENELEQLITELPSPIVLLGDFNSHSLLWGGIKTDAKGAIIEQLSAKHNLT FT ILNTGEHTRLDPRTGNTSAIDLSITSSNLAGRLRWMVDEDNRTSDHFPLIL FT KTVANITSIHHRRRWLHDQANWVEFEDILLERIPRDSAPSIDELTQHIIYA FT AEQSIPRTTGKIGRKAVPWWNQEVNAAIRARRKKLRKLKRMGENDPRRLQA FT LSEFQQARNASKKIVREAKTNSWNKFVSGFNPQCPANEMWRKVNSFCGKRS FT FQRQVLVIDGETVEDPVQVAEHLAEFFESVTKSADPCLTIEYGVEPVTTPN FT EEINIPFSIDELMWALEKGGNTSVGIDNISYPMLRHLPFRIKMQMLNTINH FT LWNSGRIPDAWKEGLIVPLSKPGKDPKQIGNQRPITLLSCFNKTLERMVNR FT RLVDFIENHYKFNSHQFAFRRGRGVDTYFAALEKDLRKPFDEGEHCELLSI FT DLQKAYDRADRNQIVAKLTSWGVNGKMLNFIKSFLSNRSFRVLVGDVQSTR FT RTQETGVPQGSILSVSLFLVLMETVFEKIPKGIRIFVYADDVVLLAIDKDP FT KLARQKIQKAASTLGLWAEQTKMIISAEKSSCMHICRKRKHPAVPPIEISS FT TVVPEVAYTRILGVIIDKRLNFKRHIAAVRTNVQTVLNFLKVIGNRMNGGT FT RKTMLQVTKAMLLPKIFFCINFINSGSVTNMKRLLPLFNAGVRESSGVFRS FT SPIESIMAEAGQLPFEYTLTLTEITKAIRLLERDTKLFPENYSQSQSITTY FT PSVERAKNAYEQLTQQELPRIAILHRIGDRSWNEPRIRIDWEIKQKFKVGD FT PPAKAQQLFLYLTSTKYMLYQHVFTDGSVDRYKVGFGVATENESFSKKLPK FT ECSIFSAEAKAIWFAIAKTCSNESATVIFTDSASVLTALEKGHSRHSWVQE FT IETVAKGKQLTLCWIPGHTRISGNEKADALAKAGEDDEFYNCDIPAEDALR FT CCKHEIRLAWENQWRNSTTFLRNIKATTFQWKDRQNPTLRRAISRLRIGHT FT RLTHQHLFNRETNVCPTCGVPVTVTHLLLDCRAYEDQRKECELGLTIGDIL FT TNEEENEKKLENFLRKTELINKL" XX SQ Sequence 6747 BP; 2238 A; 1575 C; 1408 G; 1525 T; 1 other; agttttgtca gtgtatccag gtccgagtta cattgcgcga cttacctcga ttatttttca 60 ctattttttc catcaaatag ccagcgataa tagctggaac tacggctaac acgtcgccag 120 atggtgtagc ccgtatattc acaaaatttc agtccgaact cctccaaacg cgtcggtttt 180 tcgcgagttg aaaaaaatag tttcgcaata aacgtaccga gcagtggaca tcttgtgcat 240 agagacgaat caattattgt taagactgca tgtgttagtt agctataact attgtaccgc 300 tgcaaagaaa cgcagcatcc acagcaattg acgagtgact gagcgactgt ttgtgagcga 360 acttctatcc acatagaagt cgactctctg tttgcgatag tacttcacct ccaatgaaat 420 gaacggcaaa acaccaccaa ccgtgcctcc cgatatagat ccaggagcaa gcacgagtgg 480 tttcaatttg ccgaataaac aaaacagaaa caagagaaca ttgccaatat ggatggatag 540 gaacgatcaa tacggtgatg cccaattctt gttactaaga ggtgctaatg atacccctct 600 tgcgagaaaa ccgctggtca tagaaaagtc tgtcgagctg gctgccggtg gcccgattgc 660 tggggcaaaa tcagaatcac gttgtacgaa atataccctt gaagtgcgca acaaatctca 720 ggttgagaac ctcttgagtc tcaaacgatt gatcgatgat acacctgtag aagtagtata 780 tcatccaacg cgtaatgttt gccgctgtgt tgtcacatgt tgggattttg cagaaacatc 840 tacagatgat atcatcagaa gactccagga tcaaaatgta acggatgttc gaaggattac 900 acaacgaatt ggcaaaaatg ttgtcaacac agggacgatg atactcacta ttcgcggacc 960 tgccccacca agttacgttc gactcggact cctgcaagtg ccaacgcgcc cttactatcc 1020 cagccctttg ttgtgctaca agtgtctgtc ttatggtcat actaagacag cttgtaagga 1080 aacggaaaaa tgccagaact gctcgactat acacgatgaa atggaaaatt gtaatcgtac 1140 accctactgc aaaaattgcg gagaaaacga aaaaccaacc agtcgcacat gtgcagtata 1200 tcgacgggag caagaggtca tcaaaattaa ggtcgacgag ggtctgactt atgctgcagc 1260 gagagaagca catagatcca gaatcactag ctcgaaaggc agtttcgcca gcatcgttca 1320 gcaacgtttg attaacgttc aagctgccgg tgatgagact aacgagctta aacaacaatt 1380 aaaactacag gaagagcaga acagagctct tattgtagaa aacgaaaaac ttcgaaatga 1440 gatcaccaac ttgcggaagt cagtcaacga acttgctgaa acgactaaac aattacaact 1500 acaactcaaa cagcagcccc aatgtgcagc cccaagcgac atagtggcaa gtgttcctac 1560 agtgcaactg ccgggaatat caaaaatgca aacgaggcag caacaccgta acacgtctga 1620 aagcagaaaa agtgcacaat attcaagcag cgaagcaaaa cagcaaatca agcgacacgt 1680 ttccagtcct ccgaaaaaac cggccaaact acgaaaatca aacaggaaga aaaaacacgt 1740 tgaaatctcc tcgaatccag cctcagcaga tgaaaccgct tcagaagaag gcttttccga 1800 agcacgaccg gatgaagaaa tgtctcaaaa ttaaatccca tcaaaacgct ggcaatagca 1860 agctacacta ccaacacaac atcaaacaat gaatcctaaa actaatcaaa aaaatggaag 1920 tcaaaagtta ttgtacaaaa atggaaaaca cgtcagcaac cctcatgagc cagcaagctc 1980 cacatcgtac gtcagcaaca acatcacaaa tcattcagcg attccacggg tttgtacaac 2040 cccaggcctt ttccctgcac caaccccgac cagagaagtt ccagaatttt ctgatgagcc 2100 tttggcggca gttggtgtgg ggggcctgta ctcacaggca agtactcatt caattctaaa 2160 ttatgaacaa acaacagacg ctctgaacat tctagtcatc cagtctccat caacaagtta 2220 cacatcgtat aacagcaaca tattcgtgga taattcagcg attccaaggg tttacacaac 2280 cccaggcctt ttccctgtac cgaccctgac cggagaagtc ccagctagtc ctgatgagcc 2340 tttggcggca gtcggtacag ggggcctgtt ttcacaggca agtacccata caatttcaaa 2400 ccacgaacag acaacagacg cttcgcacat tttagtatcc cggccttcac caaaacgtcc 2460 ttcatcaaac aacagcaaca actgcataac ttactcagcg attccaaggg tttgtacaac 2520 cccaggcctt tgccctgtac cgaccccgac cggagaagtc ccagctagtc ctgatgagcc 2580 tttggcggca gttggtacag ggggcctgtg ttcacaggca agtacccata caattccaaa 2640 tcacgaacag acaacagacg cttcgcacat ttcagtgtcc cgacctgtat caaaacgccc 2700 cacaccgaat aacagcagca accccgtgac tcattcagcg attccaaggg tttatacaac 2760 cccaggcctt ttccctgtac cgaccccgac cagagaagtc ccagatatca ctgaagagcc 2820 tttggcggca gttggtacag ggggcctgta cttacaggca agtaccaatg caatatcttc 2880 ttacgggcag acatcagatc caatgcatat tccagatcat tcgaaaaacc actcacacct 2940 attgcgaacc actaccagcc agtcgacctc tcagaacact gtacatgtcg agtcaacacg 3000 tgctactcga aacacgtgca gcccagctga aaaatgtgta gctattcagt ggaacgtaaa 3060 tggtcttcaa gaacgatatg ctgaactcca actattggta acagagtatc aaccaatcat 3120 aatcgcggta caagaaactc gactgtcgcg gtacgaaata atgaatagat tctcwcatgc 3180 cgagtacaat tggaaatttt tatcggggcc atcatcacca tctcagaacg gagtgggcct 3240 tgctgttagc aaatcagtcc cgcatagttt tctgcatgtg aattcatcaa tgcaagtcat 3300 cgccgccaga gtcaaccatc ctatctgtgc tacttatgtc tcaatctaca taccttgcca 3360 tacttcacca agccagttag aaaatgagtt agagcaactg ataacggaac taccttctcc 3420 tatcgtgctg ctgggtgatt ttaattccca tagcttactc tggggaggca taaaaactga 3480 tgccaaagga gcaataatcg aacagctctc agcgaaacat aatctaacca tcctcaatac 3540 tggggaacac acgcgattgg accctcgtac tggaaacacg tcagcaattg atttatcaat 3600 aacatcatca aacctagcag ggcgactccg gtggatggtt gatgaggata accgcaccag 3660 tgatcatttt ccactgatcc tcaaaactgt agccaacatc acaagcattc atcatcgtag 3720 gcgttggttg catgatcagg caaactgggt agagtttgag gatatactgt tggaaagaat 3780 tccacgtgat agcgctccat cgattgatga actcacccag cacatcatct acgcagcgga 3840 gcaaagcata ccacgaacga ctggaaaaat cggccgcaaa gctgttccat ggtggaacca 3900 agaagtaaat gctgcaatac gagcaaggcg gaaaaaactt cggaaactta aaaggatggg 3960 cgaaaacgat ccacgcagac tacaggcttt aagtgagttt caacaggcac gaaacgcgtc 4020 aaaaaagatc gtcagagagg caaaaaccaa ttcgtggaac aagtttgtct ctggattcaa 4080 cccacaatgt ccagccaacg aaatgtggcg gaaagtaaat tccttttgcg gaaagagaag 4140 ttttcaacga caagtgctag ttatcgacgg tgaaactgta gaagacccag tacaggttgc 4200 cgaacatctg gcggagttct tcgagtcagt tacaaaatct gcagatcctt gcttgacgat 4260 tgaatacggt gtagagccgg ttacaacccc caacgaagaa atcaacattc cttttagcat 4320 cgacgagctc atgtgggctc tcgaaaaagg tggaaacaca tcggttggca tcgataacat 4380 cagttatccg atgctaagac atctcccgtt tcgaataaaa atgcaaatgt tgaacacaat 4440 caatcatctc tggaactccg ggcgcatacc ggatgcatgg aaggaaggat tgatcgttcc 4500 tttatcaaaa ccaggaaaag atcctaaaca aattggcaac caacgcccga taacactgct 4560 aagctgcttt aataagactc ttgaaaggat ggtgaataga cggctagttg atttcattga 4620 aaaccattac aaatttaatt cacatcaatt cgcctttcga agaggaagag gagtagacac 4680 gtactttgcg gcactggaga aggacttacg taaaccattc gacgaaggcg aacactgtga 4740 attactgtcc attgatctgc aaaaagccta tgacagagct gaccgaaatc aaatcgttgc 4800 aaaactcacg agctggggag ttaacggtaa aatgctcaac ttcatcaaaa gttttctgtc 4860 taacagaagt tttcgagtac tggttggcga cgtgcaatct acgcgtcgaa ctcaagaaac 4920 aggagtccca caaggttcga ttctctcagt atcgcttttt ctagttttaa tggagaccgt 4980 gtttgagaaa ataccgaaag ggatacgaat attcgtctac gctgatgatg ttgtattact 5040 ggccatcgat aaggacccta aattggcgcg ccaaaaaatt caaaaagcag cttcgacgtt 5100 agggttatgg gctgaacaaa ctaaaatgat tatatcggct gaaaaatcaa gttgtatgca 5160 catttgcaga aagcggaaac acccagcagt tccaccaatt gaaattagta gcactgtagt 5220 tccagaagta gcgtatacaa ggatactagg agttataata gataaacgac tcaacttcaa 5280 aagacacata gcagccgtgc gaaccaacgt tcagactgtg cttaacttcc tcaaagtgat 5340 tggaaataga atgaacggtg gaacgcgtaa aacaatgcta caagtcacaa aggcaatgct 5400 tctccccaaa attttcttct gtataaactt catcaacagt ggcagcgtaa caaatatgaa 5460 gagactacta cctttattca atgcaggtgt tcgcgaatca tcgggagtct tcagatcaag 5520 cccgatagag tctataatgg ctgaagctgg acaattgcca tttgaataca ctctaactct 5580 cacagaaata accaaagcca ttcgattatt ggagagagat actaagcttt ttcctgaaaa 5640 ctattcccaa tcacaatcga taacaaccta cccatcggtt gaaagagcca aaaacgctta 5700 tgaacaactc acccagcaag aattacccag gattgcaata ctgcaccgga tcggtgacag 5760 atcatggaat gaaccaagaa ttagaatcga ctgggagatc aagcagaaat tcaaagttgg 5820 tgacccacca gcaaaagctc aacagttgtt cctatactta acttcaacta aatacatgct 5880 gtaccaacac gtgtttacgg atggctcagt ggacagatac aaagtaggct ttggtgtagc 5940 aacagaaaac gaaagtttta gtaaaaagct acccaaagaa tgctctatat tctctgcgga 6000 agcaaaagcc atttggtttg caatcgccaa aacctgttcg aacgaatcag ctacagttat 6060 attcacagat tcggcgagtg ttctgacagc tctagaaaag ggccactctc gtcactcttg 6120 ggtacaagaa attgaaacag tcgcaaaggg aaaacaacta accttatgct ggattccggg 6180 acatacgagg atttcaggta acgaaaaagc agatgctctg gccaaagcag gcgaggatga 6240 cgaattttat aactgcgaca ttcctgctga ggacgcacta cgatgttgca aacatgaaat 6300 aagactagca tgggaaaacc agtggagaaa tagtacaact tttctgagaa acatcaaagc 6360 aacaacattc cagtggaagg accgacaaaa cccaacgctg aggagggcca tcagtcgtct 6420 acgaatcggt cacacaagat tgacccatca acatctgttc aaccgtgaaa ctaatgtgtg 6480 cccaacatgc ggagttcctg ttacagtaac tcacctgctt ttggactgta gggcttacga 6540 agatcaaagg aaggaatgtg aactagggtt aactataggt gatattttga caaacgaaga 6600 agaaaacgaa aaaaaattgg aaaatttctt gaggaaaact gaattaatca acaaattatg 6660 aactgtagct aagacattgt taagaaataa aactttgaag gatgaatggc cacttccgag 6720 gctaaaatcc aaaaaaaaaa aaaaaaa 6747 // ID Copia-117_AA-I repbase; DNA; INV; 4375 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-117_AA_; KW Copia-117_AA-LTR; Ty1_copia_Ele78; Copia-117_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4375 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [1870-2373] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 1297..4365 FT /product="Copia-117_AA-I_1p" FT /translation="MVCDESIMENSRTVDPITISTAKDGQVMVGRKKGVVK FT LKSVVVKPGKIKKYELYDVLFIPNLQYNLLSVSKLVSAGKMVTFNDNGVVI FT SEKSGEVIASGKKVGGLFVLDLVVDDDDNGNEIACVTMVGSPLDLWHKRLG FT HLGYKNLEKLVSQRMVEGIEFERLSHIQGKRSVCSPCVIGKQNRKPFKTNC FT VRSTRPLEVVHSDVCGPMSERTYDGYRYFVVFIDDFTHFVVVYLLRHKHEV FT LDKFKEYEAMASAHFNFQISRFISDNGGEYYGKKFRRFCTSKGIQMIPTCP FT YTPQQNGVSERMNRTLLDKTRAMLNENDVPKELWGEAVYVSAYLTNRSPTR FT KIKGLKTPFEMWFNIKPKVDGLRIFGCLAYGQIAKERRKKLDDRSKKLTFV FT GYATNGYRLWDNESKRIIVSRDVIFDENKSKFRTTQDPVKVEYQYHKIVTS FT SIPTMPTDASDVSSNDEDDSDEDEVKHDVNDGDVEGNVQNPEITVRRSKRI FT QEKQKQKEEDVQGAFSAVSFVEDIPQTIPQLKNRDDWPLWKAAIQEELEAL FT EANNTWSLVNKVPSGFKPINSMWIFGIKDGESRRYKARLVAKGCSQRYGLD FT YNETFAPVAKMVTIRTILSMAVVKNMLVHQMDVKTAFLNGNLEEEVYMKLP FT YDEDGLSPICRLNRSLYGLKQAGRSWNQCFDDFMRKLNFSRLSSDTCVYVF FT NDYELYVVLYVDDILVVADDITRVNWIKQELAKRFQMKDIGAVKTYIGLEI FT ERNYKTKEMKISQKNYIEKILNRFGMQDCNPVATPMDVNIKWEACEGIRTD FT EPYKALLGCLQYLASMSRPDICAAVSILSRYQNCANCAHWNSLKRILRYLR FT GTKDLYLLYSFHEQARALEGFADADFGNNFEDRRSNSGNLFLVYGNIVSWS FT TKRQPTVSLSSTEAELISLCNGTKEGIWLSNLLREIGIESCPFTIYEDNIP FT CIRIAEEPREHQRTKHIDIKYMFIREVIQSKKLRIQFIKSEDQLADILTKP FT LPRMRFEKMTARLQLQN" XX SQ Sequence 4375 BP; 1405 A; 749 C; 1070 G; 1150 T; 1 other; acaggttatg ggcccaggag attaatttaa tcactggtta aatctaggaa ggtagaagta 60 ccgctagatt gtttttagat tggtggagac ctttagagga ctctaccgga attcacagat 120 cagtcgaaga gtgatctgga actgtttgaa ttttcatttc agattatgaa aattatcggt 180 tcatctacaa caatgagcga agtcggacag cggcagcgaa tggtaatgat accaaccttc 240 cagggagcag tcggtaagga ctatcccttt tggaaggtga gagtaacgca ctaccttaag 300 caacatggtc ttctgcactg tttggaacgc ataccaccca tggaagagta tgctgacaac 360 ccagaaggca ccacggcaga ggaagcagcg ctacgaaatg cgcattacga gaagcggctg 420 aaggaagacg atgaagtggt aaacatcttc tggacatccc tggacaacga cgctatggct 480 cacgtgctgg agtgtgttta cgccaagcaa atcctggacc gcctggactc ggtgtatctg 540 cgtcatggga gactggcact attttctatt aggcgcaagc tgtacaatct tcgtacggct 600 ggctatagat cactaatgga gttgttcttg gcccacgaga ggctcatcca ggagttggaa 660 cgagcaggtg aagcggtatc ggctagcgaa aaattgaaca ccttgctagt tgccataccg 720 gatcgtcacc aaagcatttt ggatgcaatt gcggtgatga ggcagggtga cttggcaaca 780 atgtctctgt ctgaaatccg tagccttttc ctggacgccg aagagaagaa ggaggaacac 840 cacgagctgg aaccatccaa cgtggcaatg gctgccggca accgaaacat gcccaagaag 900 aataagaaga gaaagaagga tacgaggacg tgttacgagt gtggcaaacc aggtcacacg 960 aaacggtact gttwccgtct gttgaacaga gaagtgccgg ggaataggca tgcttctgag 1020 agggaatatg tcccagcaga acgcgccaac attgccttag ttagtgtttc tgagagggaa 1080 tatgtcccag cagaaaacaa acgtaaaaag gtcggaattg agtctattga ttctgagaag 1140 gaaattgttc cagcagaata cagacgggag aaattcgaat ccgagcgtgg tttgattgct 1200 ggttctgaga gggaacgtgt cccaacagaa cgaaggcata gcaaaccggc ggataggaag 1260 gtgcgcctca tcgtggattc tggtgctagc gaacacatgg tctgtgacga atcgatcatg 1320 gagaacagtc ggaccgtgga tcccattacc atttcgacag ccaaagatgg gcaagtgatg 1380 gtgggaagga agaaaggagt tgtgaaatta aaaagtgttg ttgtaaagcc aggaaaaata 1440 aagaaatacg aattgtatga tgtgcttttt attcctaatt tacaatataa tctcctatcc 1500 gtttccaaac ttgtttctgc tggcaaaatg gtgacgttta atgataacgg tgttgtaata 1560 tccgagaaat ctggtgaagt cattgcgtca gggaagaaag tcggtggttt gtttgttctc 1620 gacctggtag ttgatgatga cgataatggt aatgagattg cttgcgttac aatggttgga 1680 agtccactcg atctttggca caaaagacta gggcacctag ggtacaaaaa tctagagaaa 1740 ttagtttccc agcgaatggt tgaaggtata gaattcgaaa ggctttcaca tattcagggg 1800 aagcgtagcg tatgttctcc ttgtgttata ggtaagcaaa atagaaaacc tttcaaaacc 1860 aattgtgttc gatcgactag accgttggaa gtcgtgcatt ctgatgtgtg tggaccaatg 1920 agcgaaagaa catacgatgg ttatagatat tttgtagtgt tcattgatga ttttactcat 1980 tttgttgttg tttatctgtt acgtcataag catgaggtgc tggacaaatt taaggaatat 2040 gaagcaatgg cttcagctca ctttaatttc caaatctcta ggtttatttc agataatggt 2100 ggagagtatt atgggaagaa attccgaaga ttttgtacaa gtaaaggaat tcaaatgatt 2160 ccaacttgtc catatactcc gcagcaaaat ggggtgagtg aaagaatgaa tcgcactcta 2220 ttagataaaa caagagcaat gcttaatgaa aacgatgtac ctaaagaact atggggagag 2280 gcagtatacg tgtctgccta tttaaccaat cgttccccaa ctcgaaagat aaaaggtttg 2340 aaaactccat tcgaaatgtg gtttaacatc aaaccaaaag tcgatggatt acgtatcttt 2400 ggttgcttgg cctatgggca gatagcgaaa gaaagacgta agaaattaga tgatagaagt 2460 aagaagttaa cttttgtagg gtatgccacg aatggatatc gattgtggga taatgaatct 2520 aaaaggataa ttgtctcacg cgatgtaatt ttcgatgaaa ataaatcgaa atttcgcact 2580 acccaagatc ctgttaaggt ggagtaccaa tatcacaaga ttgtgacgtc atcaattcca 2640 acaatgccca ctgatgcttc agacgtgagt tcaaacgatg aagatgatag tgacgaagat 2700 gaagtcaagc atgatgtcaa cgacggcgat gtcgaaggca atgtgcaaaa tcccgaaatt 2760 actgtacgcc gcagcaagag aatccaggag aaacaaaagc agaaagaaga agacgtgcag 2820 ggtgcgtttt cagcagtaag tttcgttgaa gacattcctc agacaatacc gcaactcaaa 2880 aacagagatg attggcctct atggaaagct gcaattcagg aggaactaga ggcattagaa 2940 gcgaacaata cgtggagttt ggtaaacaag gtcccatctg gattcaagcc gatcaattcc 3000 atgtggatat tcggaatcaa agatggagaa tccagacgat ataaggcaag acttgttgct 3060 aaaggatgtt cccaaagata tggtttggat tacaatgaaa cctttgctcc tgtggctaag 3120 atggtaacta ttagaactat tctttctatg gctgtagtta aaaatatgct cgtacatcaa 3180 atggatgtaa agacagcctt tctcaatgga aatttggaag aagaagttta tatgaagtta 3240 ccatatgacg aagatggact aagtccaatt tgtcgtttga acagaagtct ttatgggctg 3300 aagcaagctg gaagaagctg gaatcagtgc ttcgacgatt tcatgagaaa gctgaacttt 3360 tctaggttat ccagcgatac ttgtgtttat gtattcaatg attatgaatt gtatgtcgtg 3420 ctttatgtgg atgatatcct agttgtagca gacgatatta caagagttaa ctggataaag 3480 caagaattag ctaaaagatt tcaaatgaag gatattggag cagttaagac ctatattggg 3540 ttagaaatag aaaggaatta taagacaaaa gaaatgaaaa tttctcagaa aaattatatt 3600 gagaaaattc ttaatcgttt cggtatgcaa gactgtaacc ctgttgctac accaatggat 3660 gtcaacatta aatgggaagc gtgtgaaggc attagaacgg atgaaccgta taaagcttta 3720 ttgggttgtt tacagtactt agcttcaatg tctcgtcctg atatttgcgc tgctgtaagt 3780 atcctcagta gataccaaaa ttgtgcaaat tgtgcacatt ggaatagttt gaaaagaatc 3840 ttacggtact tacgaggtac aaaggatcta tatcttttat atagttttca cgaacaagca 3900 agagctttag aaggatttgc agatgctgac ttcgggaata attttgaaga tagaagatca 3960 aattctggaa atctgtttct ggtctatgga aatatagttt cttggtctac aaagagacag 4020 cctacagtaa gcctttcgtc aacagaggct gaactgattt ctctttgtaa tgggacgaag 4080 gaaggaattt ggttatcaaa tttattgcgt gagattggaa ttgaatcatg tcctttcaca 4140 atttatgaag ataacatccc ttgcattcga attgctgaag aaccaagaga acaccagaga 4200 acgaagcaca tcgatattaa atatatgttc atccgagaag tcatccagtc caagaagttg 4260 aggatccagt ttatcaagag tgaagatcaa ctagcagata tcctcacgaa gccattacca 4320 aggatgaggt tcgagaagat gacagcgaga ctacaattgc aaaattgagg ggaag 4375 // ID Chapaev3-2_NVi repbase; DNA; INV; 2226 BP. XX AC . XX DT 12-MAY-2009 (Rel. 14.05, Created) DT 12-MAY-2009 (Rel. 14.05, Last updated, Version 1) XX DE Chapaev3-2_NVi is an autonomous DNA transposon - consensus. XX KW Chapaev; DNA transposon; Transposable Element; Chapaev3; KW Chapaev3-2_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-2226 RA Bao W. and Jurka J.; RT "Chapaev transposons from Nasonia vitripennis."; RL Repbase Reports 9(5), 932-932 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 622..1818 FT /product="Chapaev3-2_NVi_1p" FT /translation="LNDLVRDLGLPKDGAEYLAAALKRKHLLANGTTAYFY FT RNREKKFRKFFTNDTENSLVYCSDVKGLVHELKPNVYEDEEWRLFIDSSKR FT SLKAVLLHNGNTFAPIPIAHSTKLKETYENLKIVLDKIKYSEHQWQVCGDL FT KIATLLLGQQSGYTKNPCFLCLWDSRDRINHYVKKEWEKRTNFIPGSKNIL FT HKPLIEPSKYLLPPLHIKLGLMKQYVKALDKEKDCFRYLQEKFPAISDAKL FT KEGIFDGPQIRRLFEDANFITSMDNTEEAAWQSLKNVSQNFLGNTKSEDYE FT NIVDELLENYKNLGCLMNLKLHFLHSHLDYFPENLGDYSEEQGERFHQDIS FT EMENRYQGRWDVNMMADFCWMLQRDEKTNNKKRNRNPLHRSFEEKRVRYKR FT RRTV*" XX SQ Sequence 2226 BP; 776 A; 356 C; 416 G; 676 T; 2 other; acaccggcac acaaagaaaa aaaagtgggg agtacttttg actagaccca tatatcccta 60 tgtattttga cgcgctgaat ccgaattcgg tgtyygtttt gcccgtacac ccccaaaatt 120 ttgagtaaat agcgaaaaac ccgtgaaaaa ttgctgaaaa tcgcatttgt agacgttggt 180 aaatgttaga aagttttctt gtgagtgtat tatacatgat tttagttcat tttaatgtgc 240 ctaatacgag ttaggtgttc gttttgcttt taactacaga ttttgcataa actgtacata 300 tgcgactgtg attgcataaa gaaaatgata aatttttttc atttattttt cattgagtgc 360 ttatgataga gatggaaaaa tatgtaagaa atgctgctct tacactgttc ttcttctttt 420 gtaacgtact cttttttctt caaacgatct atgtaatgga tttctatttc tctttttgtt 480 gttcgttttt tcgtctctat gtaacatcag caaaaatcag taatggaagt agatgaatcc 540 tatgataaag aaagttctga ggatgatgaa ttcctgccag ctggtgagag gatagctcct 600 caaaccttta ctcaaaaata attgaatgat cttgtaagag atttaggttt acctaaagat 660 ggagctgaat atttagcggc agcacttaag aggaaacact tgttggcaaa tggaacaact 720 gcttactttt atcgtaatag agagaagaag tttaggaaat tttttacaaa tgatacagaa 780 aattcactgg tatattgttc agatgttaaa ggattggtgc atgaattaaa accaaatgta 840 tatgaagatg aagaatggag actctttatt gattcgtcaa aacgaagtct gaaagccgtt 900 cttctccata atggtaatac atttgcacca attccaatag cccactcaac aaagttaaaa 960 gagacttacg aaaacttaaa gatagtattg gataagataa agtactcaga acaccaatgg 1020 caagtttgtg gtgatctgaa aattgctact ctgcttctag gccaacaatc tggttatacc 1080 aaaaatccct gctttttgtg tttatgggat agtagagaca ggataaatca ctatgtcaag 1140 aaagaatggg aaaagagaac taactttatt ccaggatcga agaacattct acacaaacca 1200 ttgatagaac cttcaaaata tctgctacca ccactgcata ttaaactagg acttatgaag 1260 cagtatgtta aagctttgga taaagagaaa gactgttttc ggtacttgca agaaaagttt 1320 cccgcaataa gtgatgctaa gttgaaagag ggtatttttg atggtccaca aattcgccga 1380 ctttttgaag atgccaattt tattacaagc atggacaaca ctgaagaagc agcttggcaa 1440 agtttgaaaa atgtttctca aaacttttta ggaaatacga aaagtgagga ttacgagaat 1500 atagttgacg agttactaga aaattacaaa aacttgggtt gtttaatgaa cttaaaattg 1560 cacttcttac actctcacct cgactacttt cctgaaaatc taggagatta cagtgaagaa 1620 caaggtgaac gattccatca agacataagt gaaatggaga atcggtatca gggaagatgg 1680 gatgtcaaca tgatggcgga tttttgctgg atgttacaga gagacgaaaa aacgaacaac 1740 aaaaagagaa atagaaatcc attacataga tcgtttgaag aaaaaagagt acgttacaaa 1800 agaagaagaa cagtgtaaga gcagcatttc ttacatattt ttccatctct atcataagca 1860 ctcaatgaaa aataaatgaa aaaaatttat cattttcttt atgcaatcac agtcgcatat 1920 gtacagttta tgcaaaatct gtagttaaaa gcaaaacgaa cacctaactc gtattaggca 1980 cattaaaatg aactaaaatc atgtataata cactcacaag aaaactttct aacatttacc 2040 aacgtctaca aatgcgattt tcagcaattt ttcacggttt ttcacgattt ttcgctattt 2100 actcaaaatt ttgggggtgt acgggcaaaa cggacaccga attcggattc agcgcgtcaa 2160 aatacatagg gatatatggg tctagtcaaa agtactcccc actttttttt ctttgtgtgc 2220 cggtgt 2226 // ID Gypsy-12_CQ-LTR repbase; DNA; INV; 172 BP. XX AC AAWU01009192; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-12_CQ_; KW Gypsy-12_CQ-I; Gypsy-12_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-172 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 404-404 (2011). XX DR GenBank; AAWU01009192; Positions 62866 63037. XX SQ Sequence 172 BP; 42 A; 53 C; 30 G; 47 T; 0 other; tgtggtaacc ccatgttacc atcggcgtac ccctcgccag tgcgatgtta tttgatacga 60 attgtaaaac tttgaattac agatcactcc cacagtgacc gtacgaagca caagacacgc 120 gtctcctttt ctcctccatc cgaactcttc tctaacctca cgggttagct ca 172 // ID Gypsy-110_AA-LTR repbase; DNA; INV; 230 BP. XX AC AAGE02027584; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-110_AA_; KW Gypsy-110_AA-I; Gypsy-110_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-230 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02027584; Positions 107124 106895. XX SQ Sequence 230 BP; 65 A; 34 C; 74 G; 57 T; 0 other; tgtagggtct ggcatcactg gagggctgac gtcgtaccaa cgagaagaag atgtgttact 60 ctttgggcgg ggaagtaaag aagaagtgat gacgggggag aatggttaag accgtgaagt 120 ggttagatca cgtatatcga gtggtggtga gtagaggaat aaactattga gatatattgt 180 taaatccgtg acctgcgtga ttgtacgagt attccgaagt gcccctcaca 230 // ID Copia-7_SI-I repbase; DNA; INV; 4127 BP. XX AC AEAQ01011831; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-7_SI_; KW Copia-7_SI-LTR; Copia-7_SI-I. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-4127 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01011831; Positions 4574 448. XX CC Positions [1541-2038] - Integrase core CC 'TAGAC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 80..2518 FT /product="Copia-7_SI-I_1p" FT /translation="MATNNQDDVIQIEKLKDSENFMIWKFQVTIVFKSLGL FT YDIVTGVLALAEGMTEQAKAEWIRKDARAQKIIITSIEKQPLTHVLVCKTS FT QEMFQRICAMYERDTEQQKCILLQEFFNFSYQKGTDIATHVSKLENLAYRL FT KVLNADIDDSMLMSKILVTLPPEKYKHFACAWESTAQDQTTLSNLTSRLLS FT EENRLSVTDARQDAVAFKATEKKCHKCNNTGHLARACKVNKGNIQDMRCFK FT CNGRGHIAKFCKKEDATGGTTKCNICKKFNHEEKDCYFRKNKTDDQTSTKD FT KVTFLASNCKTNKWILDSGSSSHITNDKNMFKNMRKIESEIGTAKKSQILK FT AEGIGTVETDKCVLNKVLYVKDLSKNLLSVNEITKNGGKVIFTKEKAEIWK FT DNIKLIGFRNGDGLWTVNLNQESMDSALLTRENKMAIQWHQKLGHLGKENM FT KKLQNISEGMKITDKSLDVLNNVCETCLKAKQTRIPFGEKRTRAKRPLEIV FT HTDVCGPIDPITWDKKKYMITFIDDFTHFVMVYLIEGKYEVAGTVKEYANQ FT VETKWNLKISKLRCDNGREYVNEELKIWCRRRGTILDVTTPYTPQLNGKAE FT RMNRTLIEKTRALLTDAQIDKSFWGEAVRTAAYLTNRSPTDAVKGTPLENW FT TSQKPDLTRLHIFGCNAYTKTLTPLKKLDDRCKSYIFVGYAPNGYRLYDEE FT RRKIVISRDVKFKEEITSKNKSAKINIHAEEEQTEKQEEIENNMENISGSE FT EEYQDADLEIEEEESQEETQEDTQEEQEELGRGKRKRKIPDRYKDYVYLTY FT REATTGPDKEN" FT CDS 2847..3968 FT /product="Copia-7_SI-I_2p" FT /translation="MELPDGYNCNNKICRLNKALYGLKQASLRWNQRFTTF FT LKTKGLKPTKAEPCIYRTDDNTLILAIYVDDGLIIGQNRQSIDKLLSNLGK FT EFHISVFKDIKSFLGIEIQQDKEGLKLLQLNYADHVVKMFNMKNSKPVNTP FT ILTNDDRSEEINEVFPYRESVGSILYLSSKTRPDLAYAAGYASRHLDKPYK FT QDVANIKRILRYINATKNLGISYKVNNDKNKEELIGYTDSDYAGDVETRKS FT TTGFVIMYCNGPISWCSKKQSVVALSSTEAEYIAATECCKEILYLKFLIEE FT LTGKSVKATLNIDNQSTIHLIKNGIGNKRSKHIDIRYRFINEKVAEGLISV FT NYCSTDTQLADIFTKALGQNKFEIHKKRLLN" XX SQ Sequence 4127 BP; 1654 A; 637 C; 874 G; 962 T; 0 other; ggttatgggc ccaggcgtca gaggcgtaaa atcaacgtaa aaagaaaggg taaaacgaga 60 gaggctagct gtacgaaaaa tggcgacgaa caatcaagac gacgtgatac agatagaaaa 120 actgaaggat agcgaaaact tcatgatatg gaagtttcaa gttacgatcg tattcaagtc 180 tttagggtta tatgatattg taacgggagt cttagcatta gcagaaggaa tgacagagca 240 agcaaaagca gaatggatcc ggaaagacgc acgtgctcag aaaataataa tcacgtctat 300 agaaaaacag ccgttgactc acgtactggt atgtaaaacg tcacaagaaa tgttccaaag 360 gatttgtgcc atgtacgaaa gggatacaga acagcagaaa tgtatcctac ttcaagaatt 420 tttcaatttt tcataccaga aaggaacaga tattgctaca cacgtgagta aacttgaaaa 480 tttagcatat aggttaaaag tcttaaatgc ggacattgat gactctatgt tgatgtcaaa 540 aatacttgtg actctgccgc ctgagaaata taaacatttt gcatgtgcct gggaatccac 600 cgctcaagac cagacgacgc tttcaaatct tacgtctcga ttgctctctg aggagaacag 660 actttcagtg accgatgcga ggcaggatgc ggtggcgttt aaggcgactg agaagaaatg 720 ccacaagtgc aataataccg ggcacctagc tagagcgtgc aaagtaaata aaggtaatat 780 tcaagatatg cgctgcttca aatgcaatgg tcgaggtcac atagcaaagt tttgtaagaa 840 ggaggatgct actggaggca caaccaagtg taacatatgt aaaaagttca atcatgaaga 900 gaaggactgt tactttcgca aaaacaaaac ggatgatcaa acaagtacaa aggacaaggt 960 aacgttcctg gccagcaact gtaaaacgaa taaatggatt ctggattcgg gctcgtcgtc 1020 gcatataaca aatgacaaga atatgtttaa aaatatgaga aaaatagaat cagaaatagg 1080 aacagcgaag aaatcacaaa ttttaaaggc tgaaggaatt ggcaccgtag aaacagataa 1140 atgtgtactc aacaaagtac tatatgttaa agatctaagt aaaaatttat tatcagtcaa 1200 cgaaattaca aagaacggag gaaaagtaat tttcactaaa gaaaaagcag agatatggaa 1260 agataacata aaactgattg gttttcgcaa cggagatggt ttatggaccg taaatttgaa 1320 tcaagaatct atggacagcg ctcttctgac acgagaaaac aaaatggcta ttcaatggca 1380 tcagaaattg ggacatctag gaaaggaaaa catgaagaaa ctacaaaata tatcagaagg 1440 tatgaaaata acagataaaa gcttggatgt cttaaataat gtatgtgaga cgtgtcttaa 1500 agctaaacaa acgagaatac cttttggaga gaagaggaca agagcaaaga gaccactgga 1560 gatcgttcac accgatgtgt gcggaccaat agatcctatt acttgggata agaagaaata 1620 tatgataacg ttcatagacg actttacaca ttttgttatg gtctatctca tcgagggaaa 1680 atatgaagtg gcaggtacag tgaaagaata cgcaaatcaa gtagaaacga aatggaattt 1740 gaaaatctcg aaactaagat gcgacaatgg acgcgagtac gtcaatgaag aactcaaaat 1800 ctggtgtaga cgaagaggaa caattctaga tgttacaacg ccttataccc cacaacttaa 1860 tggaaaggca gaacgtatga acagaacgtt aattgagaaa acgagagcct tacttacaga 1920 cgctcaaata gacaaaagtt tttggggaga agcagttcgc actgctgcat atttaacaaa 1980 tagaagcccc actgatgcag ttaaaggaac accgttagag aactggactt cacaaaagcc 2040 agatttaaca agattacaca tctttggttg taacgcctac acgaagacac taacaccatt 2100 aaaaaagctg gatgatagat gcaagagtta catcttcgtt ggatatgcac caaatgggta 2160 ccgactctac gatgaagaaa gaagaaaaat agtaatttca agagatgtaa aatttaaaga 2220 agagataaca tctaaaaaca aatcagctaa aataaatatc catgcagaag aagaacaaac 2280 tgaaaaacaa gaagaaattg aaaataacat ggaaaacata tctggaagcg aagaagagta 2340 tcaagatgca gatctagaga ttgaagaaga ggaatcacaa gaagaaacac aagaagatac 2400 acaagaagaa caagaggaac ttggtcgtgg aaagcggaag aggaagattc cagatagata 2460 caaagattat gtgtatttaa catacagaga agcgacaact ggaccagaca aggaaaatta 2520 ggaaaaaagc gatagaagaa gaaaagaagt cgctagaaga gaacaatacc tggaagacag 2580 tagatagaac tgaagcaaaa agtaaaaaaa ttttgagtaa caaatgggtc ttcaaaataa 2640 aggatgatgg aagatataag gcacggttgg tggtgagagg atgtgaacaa cgccacggaa 2700 tagactacga ggaaacgttt agtcccgtag taaatagcag ctcccttcga acactacttg 2760 cgattgccac aaaaagaaga gactacattg tcaaattcga cattaaaact gcattccttt 2820 atggaaattt agatgaagag atctttatgg aactgcctga tggatataat tgtaacaata 2880 aaatatgcag acttaacaag gctctctatg gactcaagca agcttcattg agatggaatc 2940 agcgcttcac taccttcctt aaaaccaagg gactaaaacc gaccaaagca gaaccgtgta 3000 tatataggac tgatgacaac accttaatat tggctattta tgtggacgat ggactcatca 3060 tagggcaaaa tagacagtcc atagataaac tactcagtaa tttaggtaag gaatttcata 3120 taagcgtatt taaagatata aaatcttttt taggtataga aatacaacag gataaagaag 3180 gtctcaaatt attgcaatta aactatgcgg atcatgttgt aaaaatgttt aacatgaaaa 3240 attcaaagcc tgtgaatact cccattctaa ccaatgatga cagaagcgag gaaatcaacg 3300 aagtttttcc atatcgagag agcgttggga gtatattgta tctttcgagt aaaacgagac 3360 ctgacctggc ctacgctgca ggatacgcaa gtagacattt agataaaccg tataaacaag 3420 acgtagcaaa tataaaaaga atattaagat atataaacgc gactaaaaat ctaggtatta 3480 gctataaagt aaataatgat aaaaataagg aagaattgat aggctataca gactccgatt 3540 acgcaggaga tgtagaaact cggaaaagca caaccggttt tgttattatg tattgtaatg 3600 gtcctataag ttggtgctcc aaaaaacagt ctgtagtagc gttatctagt accgaagcgg 3660 agtatatcgc ggcgacggaa tgttgcaaag aaattttata tcttaaattt ttaattgaag 3720 agttaacagg caaatccgtt aaagcgactt taaatatcga taaccaaagc actattcatt 3780 tgattaaaaa cggtatagga aataaaagaa gcaaacacat agacataaga tatagattta 3840 taaatgaaaa agttgctgaa ggactgataa gcgtaaacta ttgctctact gatacacaac 3900 tcgcagatat ttttaccaag gcattaggcc aaaataaatt tgagattcac aaaaagagat 3960 tgttaaacta agttaagaag aaaactataa atctagccta agtagttata agagtctata 4020 gtaatagtta caagagttta tagtaatagg tataagaatt tacagtaata gttataagag 4080 ttacgaaaag acttgaagtg caggacaatt taacaattaa ggggaag 4127 // ID Gypsy-27-LTR_NVi repbase; DNA; INV; 1201 BP. XX AC . XX DT 11-MAY-2009 (Rel. 14.05, Created) DT 11-MAY-2009 (Rel. 14.05, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: long terminal DE repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy-27-LTR_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-1201 RA Bao W. and Jurka J.; RT "LTR retrotransposon from Nasonia parasitic wasp."; RL Repbase Reports 9(5), 991-991 (2009). XX DR [1] (Consensus) XX SQ Sequence 1201 BP; 343 A; 251 C; 305 G; 302 T; 0 other; tgcgacgggg tgaccgtcta gtattaagaa gttctctctc tctcttttct acgtcaccct 60 atcagaactg catcaccatg agttaactct catagatctg gtatgcattc taggtagagg 120 aaggtcggaa tcggccacat agccgtgcaa tagcagccgg gagaaggaga gagagaatag 180 ccgactaaca gccgagttag ccgcttactg tagagcgctc ccaccagccg cgcgactaca 240 cgcacacgct ggagctcacg cacgctcaca cctaagccgc tcgagtcaga agataagccc 300 gccgccggga gggggaaaac ctcgcgccag gccgcggcag cagatactat actgctactt 360 gtatgtgtga ggcgttgagt gtgtgtaaaa agaatattca cctatgtaac ctaaagaatt 420 atttatttgt aatgtatgac atgagaggtg gtatgataca gtgtgaaaac ccaacgatcg 480 cacgagaaaa catatgtaaa tcgccaaatg taacggcact agaccgataa ctatgtattg 540 cgcttgtgtg agtactctag ggaggtgtac cgtcgcgcct cggccagaga attcgcccac 600 ttgagttttc gtcgcagtcc agagaacatc cctacggttg accagacatt tgttaattta 660 gagtagagta aaatataaga gtgcgttatt ttgtcgttat tgttataaga acgagggaga 720 ggtacgagag agaacttcag ccggaaggta gtgtttaatg gtaatattta tgatatatat 780 tgagtgtgag aggaataagc cggacgttgc gaaagtcgga gcgttactgc ctcgtggcag 840 catttttgac gagccgagat ttgtagtacc ggataactgt gtgagcgttg tgtgcgaaaa 900 cggaaattgt agctagtaga aaatatcacc ttagagcctc gcgtttatta tattacagcc 960 ggttttgttt aagtaaaaat ttgtgaagtg ataatcagtc gagtggtcta agcttattat 1020 ttcctatttt tccgatcgaa ctccctcgga gaatagagtc tcgtaactag ccggagacgc 1080 gaccccaaaa cgttcgcaga taaatcgtcc gacctggcgc cttcgaaata aatacggccg 1140 ggcgttagta gctaatcatt gttgcaaagg aaaataacga gagaaaatct acctcgttac 1200 a 1201 // ID Copia-24_DPu-I repbase; DNA; INV; 3474 BP. XX AC scaffold_34; XX DT 11-MAY-2010 (Rel. 15.05, Created) DT 11-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE Copia-like LTR retrotransposon from Daphnia: internal portion. XX KW LTR Retrotransposon; Transposable Element; Copia-24_DPu-I. XX NM Copia-24_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-3474 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 711-711 (2010). XX DR Genome; scaffold_34; Positions 704689 708162. XX CC Positions [867-1397] - Integrase core CC 'CAAGA' target site duplication CC LTRs are 100% similar to each other. CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX FH Key Location/Qualifiers FT CDS 327..3464 FT /product="Copia-24_DPu-I_1p" FT /translation="MSDIRDLFNNFQQIPPGKRLIKGVGKNNEALHATGVG FT DIAIRCKVDDVWHNGTLRKVLFVPNLGVNLFSIGAATERDIVASFDNNGVT FT LSNNGKIVGTGSKIQKRLYKMHFLNYQPPTESAALAARAKPNSIQIWHERL FT GHVNFATLKKMNSANFVEGLFIDNSTDTPLFCEGCVFGKHHRLPFPTCGRT FT RATKRGGLIHSDLCGPMSVPSLNGSLYFLTFRDDFTGYGFIRFLKKKSEVS FT SNIQQLIALFETETNERIVTLRSDNGGEYMSKELMQWIANKGIVHQTSTAK FT TPEQNGVAERYNRTILESAKSMLHSSTLGTQFWAEASAAAVYLHNRVSCKA FT MQTMTPYQGWHGRKPNVSHLHIFGCDAYYHIPKDERSKLEPKGQKCQFVGY FT SETQKAFRLYDPSSGKVKISRDVIFNENLSEAPIMSPPCCDVADVDILTEH FT RVGGDASASNDRATVSNSIICSNDTQATIEVEDVIEPFHGFDAPEAAMDPS FT LKTSRIRRKPDRLIEDPNFLCATTEDSSSIIEPQSYEEAITSPDAKKWISA FT MEEEMSSLEENQTWRLEKLPSGRKTIKCKWVYKVKMDSFGHPVRYKARLVA FT KGFSQKEGVDYDETFSPVVRHESVRAILSTAAANDLEILQLDVRTAFLHGE FT LTEEIFMDQPTGFTSTAYPTNVCRLQKSLYGLKQASRLWNIKFDGLLVNLG FT FTRSPIDSCVYHHNGGDGIIILAIWVDDGLLCGRDKKRLLEIIYQLSDHLE FT ISTQEADLFIGIKIDRDRPNRTIYLSQEQYIIRILHRFKMAECNPKGLPAD FT PFSRLTLAGVDGGPCSPSCDQSIYREAVGSLMFLMVCTRPDISYAVGQVAQ FT FCHDPKQVHWSAVTRILSYLKGTSQFGVVYKVGEKPQVLTAYADSDYAGDT FT DSRKSTSGFLLIFNGGPIAWGSRRQSCVSLSTTEAEYVAMCEATKEIVWAR FT QLLDSIGCVQTQPTALFGDNQGAVKLTLNPEFHRRTKHIDIRYHYIREQQV FT SGNIAVVHIGTKDQLADLLTKALPGPAFPELRTRIGVVPV" XX SQ Sequence 3474 BP; 1050 A; 788 C; 747 G; 889 T; 0 other; ggttatgggc ccagcttcgc cataggtatt ataaattgta ataactatac ttaagcaaga 60 aagtgaactc acgctggttt ctgaaacaga gctaaaatgg ccgctggaac tttttctgca 120 aaagatgttt ctcacatttc aaagttcaaa ggagatcagt tcaacttcta caagtttcaa 180 ttaaaactgg ttctaatgaa tcatggtcta ctcaacgtag tagaaggagt ctatcagaag 240 ccaacagtag tagccccagt ttcaaatcca ctggtattca atttgattca gattgttggt 300 atgccgactc aggagcaagt gaacacatgt ccgatatcag agacttattc aacaatttcc 360 agcagatccc accaggcaag cgtcttatca aaggtgtcgg caaaaacaac gaagctcttc 420 atgctacagg agttggagac attgccatca gatgtaaagt tgatgatgtg tggcataatg 480 gcaccctacg caaagttttg tttgttccca atctaggtgt gaacttattc tccataggag 540 cagcaacaga acgcgatatt gtagcatcat tcgacaacaa cggtgtgaca cttagtaaca 600 acggaaaaat tgttggaact ggttctaaga ttcagaagag gctctacaag atgcatttct 660 taaactacca accgccaacg gaatctgcag cactcgctgc cagagcaaaa ccaaactcca 720 tccagatttg gcatgaacgt ctgggacacg taaattttgc tacgttaaag aaaatgaatt 780 ctgctaactt cgtcgaaggt ctcttcatag ataattcaac ggacaccccc cttttttgcg 840 aaggatgcgt cttcggtaag catcatcggc tcccgtttcc tacgtgtggt cgcactcgag 900 caacaaaaag aggtggtctt atccacagtg atttatgtgg accaatgtct gttccgtcgc 960 ttaatggatc tctctacttc cttacgttcc gtgatgactt caccggatat ggctttataa 1020 ggttcctgaa gaagaagtcc gaagtgagct cgaacattca acaactcatt gccttgtttg 1080 aaaccgaaac taatgaacgc attgtaaccc ttcgatctga caacggaggg gaatacatga 1140 gcaaggagtt aatgcaatgg attgccaaca aaggaatcgt ccatcaaaca agtacagcaa 1200 aaacaccgga acaaaatgga gtcgctgagc gatacaatcg tacaattctt gagtcagcaa 1260 aaagcatgct ccattcatca actcttggaa ctcaattttg ggcggaggct tcagcagcag 1320 cagtgtatct acacaataga gtgtcatgca aggctatgca aacaatgaca ccctaccagg 1380 gatggcacgg aagaaaaccg aatgtctccc atcttcacat atttggctgt gacgcctact 1440 atcacattcc taaagatgaa cgctcaaaac ttgaaccaaa aggacaaaaa tgccaatttg 1500 ttggatactc tgaaacgcag aaagcctttc gactctacga cccttcatcc ggcaaagtaa 1560 aaatatcgag agatgttatt ttcaacgaaa atctaagtga agccccgatc atgtcgccgc 1620 cctgttgcga tgtagctgat gttgacattt tgactgaaca tagagtgggt ggggatgcat 1680 ccgcatccaa tgatcgtgcc accgtgtcta attcaatcat atgttccaac gacacacaag 1740 caactattga agtggaagat gttattgaac cctttcatgg atttgacgca ccagaagccg 1800 caatggatcc ttctctcaaa acatctcgta tccgaagaaa acccgatcgt ctcattgaag 1860 accccaattt tctttgtgcc acaaccgagg attcttcaag catcattgaa ccacagtcgt 1920 acgaagaagc aatcacatca ccagatgcga aaaaatggat tagtgctatg gaagaagaaa 1980 tgagttctct agaagaaaac caaacttggc gtctagaaaa acttccaagt ggccgcaaga 2040 caattaagtg taagtgggtc tacaaagtga aaatggactc attcggtcac cccgttcgct 2100 acaaggctcg cttggtagcc aagggcttct cacaaaagga gggcgttgat tacgacgaaa 2160 cattttctcc agttgtgcgc cacgaatcag taagagcaat tctttcaact gcagcagcga 2220 acgatttgga aattctacaa ctcgatgtaa gaactgcatt cttacatggt gaactaactg 2280 aagaaatctt catggatcaa ccgacaggtt ttacttcaac tgcctaccca accaacgttt 2340 gtcgtttgca gaaaagtttg tatggcctta agcaggcatc taggttgtgg aatattaaat 2400 ttgatggcct acttgttaat cttggattca ctcgaagccc aattgactct tgtgtttatc 2460 atcataatgg tggtgacggc atcatcatcc tagcaatttg ggtcgatgat ggtcttctgt 2520 gtggacggga caagaaacgg ctgctggaaa taatttacca gctatcagat catctagaaa 2580 tttccactca agaagctgat cttttcatcg gcataaaaat agatagagat cgtcctaatc 2640 gaactatcta tctttcacaa gaacagtaca tcattcgcat tcttcaccgg tttaaaatgg 2700 cggagtgcaa cccaaaggga ttaccagcgg acccgttttc cagacttacc ttggcaggtg 2760 ttgatggcgg accttgttca ccatcttgtg accagtccat ataccgtgag gcagttggca 2820 gcttaatgtt cttgatggtc tgcacacgtc cagacatttc atatgctgtc ggacaagtag 2880 ctcaattctg ccacgatcca aagcaagttc actggtcagc tgtaacaaga attctgtcat 2940 acttaaaagg gacgtcacaa tttggagtgg tctacaaagt cggagaaaaa ccccaggtat 3000 taaccgcata cgcagattcg gactacgctg gcgacactga ttcccgcaag tcaacttctg 3060 gatttcttct catctttaat gggggaccaa ttgcgtgggg cagtagacgc caatcttgtg 3120 tatctctatc gaccaccgaa gcggagtacg tagccatgtg cgaggcgaca aaagaaatag 3180 tttgggcacg ccaattactt gacagtattg gctgcgtaca gacacaacca acagctttat 3240 ttggcgataa tcaaggcgca gtaaaactca cactgaatcc agaattccat cgtcggacta 3300 agcatattga tatacgttat cactatatcc gtgagcagca ggtgagcggc aacattgcag 3360 tcgtccatat cggaacaaaa gaccaactag ccgatctact gaccaaggcg cttccgggcc 3420 cagcttttcc agagctcaga acaagaattg gagtagttcc ggtttgagtg ggga 3474 // ID Tx1-5_BF repbase; DNA; INV; 5449 BP. XX AC . XX DT 28-APR-2009 (Rel. 14.04, Created) DT 28-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE Amphioxus Tx1-5_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW Tx1; Non-LTR Retrotransposon; Transposable Element; L1-5_BF; KW Tx1-5_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-5449 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-5449 RA Kapitonov V. and Jurka J.; RT "Young families of Tx1 non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(4), 842-842 (2009). XX DR [2] (Consensus) XX CC Elements from this family are inserted preferentially into U2 CC snRNA. XX FH Key Location/Qualifiers FT CDS 19..1644 FT /product="Tx1-5_BF_1p" FT /note="ORF1." FT /translation="MSKLARYQSVKITFLELHHHQLKDVIDLLTKHGVKSM FT EIQSVQSKPNNDTTEVTFTTKAVLQRVVPSLSSDRSIDVETFGTGITVVTA FT RGIPFEFEDNYIRLRLKEYGMVLDTKYLTYANLGFPHIYTGTRQYRMKIQK FT HLPNTVRLGSDIVSFNYAGQPRSCHRCGSTAHFVADCPETKCGKCWELGHV FT AKDCNNQMKCSVCLEAGHSARTCHKSFASVVKPSSSWATQAKPAPGTSNAG FT PGVPKTKLVSSVDSESEESGQEDMLEEETTPHKLVGGPTAANPGSETKSPL FT KVVAADKARKSKGDEGKSDDKVEEEVAKSDDKTSPPQVVKTPVRAVTPPSE FT SPPKGEEGFWDLGSLPEDGANKMPEDEEEVEEMEIDKPTTTFKLALSASLA FT EELSKNLTSPGFTIGEGEHGPELTMNLDEPLIIDESDREENAPKRPHQSDP FT SDSDDSDTSAKNPKGKLTGKKVKKDSENTTKFGSQIDLFASSGETQSKEPP FT NSSKAKKSGLMAGAIKGAGNKGGGNKGAGNKGAGNKGGKTSNVKPR" FT CDS 1669..5139 FT /product="Tx1-5_BF_2p" FT /note="endonuclease and RT." FT /translation="MTDLNIVSLNVNGMKDRDKRQAVFDFCRTRKVDVACL FT QECHVSSLADKMYWSRQWGNKAIWSLGTNSAKGVGILFGPGISVVAHRSDT FT EGRVVSALVKLDDTNYNIVNIYAPSVPTQRVEFFAELHQYMFPNASLIVCG FT DFNCVLDPILDRCSGSSSSPRDNVRDVVELRGFCEDLGLVDVWREQHANQR FT EFTWSSKSCETRSRLDRFYVSDPVHCSSEIRTFPMSDHDAVFLSVTHTPRV FT ERGAGVWRCNTEVLKDGIFIEEFSERYEEWRARKGSHKCLRDWWEEVKNNT FT KDLLISHSKRRAKAATLIQRDLEKKIDMLRSCLNNGGTCPATVHDYQTAKE FT QLKKLLSDKLAGQRVRSRIQNFEKDEKPTRFFFSKERKRGEKKMVRELRTA FT DGEIVTSREEILETFHDFYANLYSADSINSEDKNYFLDKLVPTLPDTDRIR FT LDQPLSLEEMEQAVKEMQNGKTPGSDGLPKEFYTQFWAIVGQDLLQVLNEG FT LEENQLSPSQREGVITLLDKKGDPLNPANKRPISLLNVDYKILAKTLANRL FT KVAVGEVIHTDQSCGIPGRSLEDSLSLLRDIVAYTNSTNSTCVLLALDQEK FT AFDRVDHSYMAEVLEKLGFGPKFQSWIGTLYNSVSSRVLVNGDLSSPIHIE FT RGVRQGCPLSPLLYVLCIEPLAAAIRADPQIRGVKVPGGQEVKLVQYADDN FT TCVLSDQPSIDRAFHTIRRFESGTGSKLNFGKTEAVWLGRWRGRQDKPYPI FT GRWTSDSIIILGSPMGGERMAEEAWLQRFAKFKAKLDQWRNRKLTLIGKVV FT VCNSLAAATLWYTAPIFPLPKSIEKKLEKEMFAFIWDNKTELVARRTLYLP FT KEKGGLNLVCIPVKAQALLLKSVRKALTTPDAPAAKFTLYWLGFSLRRIDP FT ASWSNNAPHSVDPPPHYAQIAKILQNVQQSNIQVEWAAVAVSSLYSSLLEA FT EDFVPRCVRENPSLDWPEIWQAILNPLLTNWERMVCWNIAHDGLVTNKKLY FT SWRKFSKTSKCPRRGCDEVESISHVFLECAHVSEMWTWLEWLIARKICPNF FT VLSNKFVLWGLAPDGSSMKTRRVLGALSAISKNLIWRSRGDAKHNKKHHSS FT AELALLLKETLAERLVFEFTRLGPAGFYTNWAEGYSWADVDVAHLSLKF" XX SQ Sequence 5449 BP; 1502 A; 1232 C; 1491 G; 1224 T; 0 other; gggttcctgg tactcaggat gtcgaagttg gctagatatc agtctgtgaa aatcacattc 60 cttgagctcc accaccacca gctcaaggat gtgatagatt tactgaccaa acatggagta 120 aaatccatgg aaatccaaag tgttcagagc aagccgaaca acgatacgac tgaagtcacc 180 ttcacgacca aggcggttct gcagagggta gtgccgtctc tgtcgagtga ccgctcgata 240 gacgtggaga cgtttggtac ggggatcacg gttgtcacag ctcggggaat cccctttgag 300 tttgaagata actatatcag gctgagactg aaggagtatg gcatggtact tgatacaaag 360 taccttacat acgctaactt gggattcccc catatctaca cggggactcg ccagtatcgg 420 atgaaaatcc aaaagcacct gccaaacacg gtcaggctgg gtagtgacat cgtgtcgttt 480 aactatgcgg ggcagccgcg tagctgtcac cgctgcggca gtacagccca cttcgtggcg 540 gattgccccg aaacgaagtg tggaaaatgt tgggagctag ggcatgtggc gaaggattgt 600 aataaccaga tgaagtgctc ggtatgccta gaggcgggtc attccgcaag gacttgccac 660 aagtcctttg ccagcgtagt caagccatca agttcatggg ctacgcaagc gaagcccgct 720 cctggcactt ccaatgcagg tcctggcgtc cctaagacca aactggttag ctcagtcgat 780 agtgagagtg aggaaagcgg tcaggaagac atgctcgaag aggaaactac tcctcacaag 840 ttggtagggg gtcccactgc tgctaaccca ggtagtgaaa ccaagtcccc tcttaaggta 900 gtagccgcag acaaagccag gaagagtaaa ggggatgagg gcaagtctga cgacaaagtg 960 gaggaggagg ttgctaagtc ggatgacaaa acctctcccc cccaggtggt aaagaccccg 1020 gtcagggctg tcacaccacc gtccgaatcg ccaccgaaag gtgaggaggg gttctgggat 1080 ctgggctccc tccctgagga tggtgccaac aaaatgccag aagatgagga agaggtggaa 1140 gaaatggaga ttgacaagcc caccaccacc tttaagctgg ccctgtcggc atcgcttgcc 1200 gaagaattga gcaaaaactt gaccagtcct ggcttcacta taggcgaggg tgagcacggt 1260 cccgagctta ccatgaacct ggacgagccc ctcatcatag acgaaagtga tagagaggag 1320 aacgcgccta agcgtcctca tcagagtgac ccgagcgatt cggacgactc ggacacctcg 1380 gcaaagaacc ctaaagggaa actgacagga aagaaggtca aaaaggactc cgagaacact 1440 acaaaattcg ggtcccaaat tgacttgttt gcctcatccg gggagactca gtctaaggag 1500 cccccaaaca gctccaaagc gaaaaagtca ggcctgatgg caggtgccat caaaggtgct 1560 ggaaacaaag gtggcggcaa caaaggtgct ggcaacaaag gtgctggaaa caaaggtggt 1620 aagactagca atgtcaagcc acgctaagac cccttgaccg acctaaaaat gacggacctc 1680 aacatagtgt cactcaacgt aaatggaatg aaggacagag ataaacgcca ggctgttttt 1740 gatttttgtc ggacaaggaa agttgacgtt gcctgtcttc aggagtgtca cgtctcctcc 1800 ttggccgaca agatgtattg gtcacggcaa tgggggaaca aagccatttg gtctttgggt 1860 acgaactctg caaagggtgt agggatactc ttcggcccag gtatttcagt tgtggcccac 1920 cgctcagaca cggagggccg ggttgtgtcg gcgctagtta agttggacga tacaaactac 1980 aacattgtta atatttatgc cccaagtgtc cctacacaaa gggtggagtt ctttgcggaa 2040 cttcatcaat acatgttccc gaatgcatct ctgattgttt gtggagactt taattgtgta 2100 ctcgacccta tccttgacag atgctctggg tcgtcttctt ctcccagaga taacgttagg 2160 gatgtggtag agctcagggg tttctgtgag gatcttggtc tggtcgatgt ttggagagaa 2220 caacacgcaa atcagaggga gttcacatgg agctctaaat catgtgaaac tcgttccaga 2280 ctggacagat tttatgtgtc agaccctgtg cattgctcgt cggagataag aacattccca 2340 atgtctgatc acgacgcagt atttttgtcc gttacccaca ctcccagagt agaacgcgga 2400 gctggggtgt ggcgatgtaa cacggaggtt ttgaaggatg gcatttttat tgaagaattc 2460 tctgaaagat acgaagagtg gcgagcccgg aaaggcagcc acaagtgtct cagagattgg 2520 tgggaggagg tgaaaaataa caccaaagac ctcctgatta gccactcgaa acgcagagct 2580 aaggcagcaa cgttgataca gagagacctc gagaagaaaa tagatatgtt gaggtcgtgc 2640 ctgaacaatg gtgggacatg tcctgcaacg gttcacgatt atcagacagc aaaagaacaa 2700 ctgaaaaagt tgttgtcaga taagttagca ggacaacggg tccgtagcag aatacagaac 2760 tttgagaagg atgaaaagcc gaccaggttt ttcttcagca aagaaagaaa gaggggcgag 2820 aagaaaatgg tcagagagtt gaggacggct gatggcgaaa tagttacgtc aagggaagaa 2880 atcttagaaa ccttccatga cttctacgct aacctctata gcgcagattc cataaatagt 2940 gaggataaga attatttcct ggataaacta gttcccactc tccccgacac agaccggatt 3000 cggcttgacc agccgctttc gttagaagag atggaacagg ccgtcaagga aatgcagaat 3060 gggaaaaccc cggggtcaga tggtctcccc aaagagttct acactcagtt ctgggctatt 3120 gttgggcaag acttgctaca ggttctcaac gaaggtttag aagagaatca gttgtctcca 3180 tcccaaaggg agggtgttat cacactcctc gataagaaag gggacccttt gaacccggct 3240 aacaagcgcc cgatttcgct actcaatgtg gattacaaga tcttggccaa aactttggca 3300 aacaggctga aggttgccgt gggggaggtt atccatactg accaatcctg cggcataccg 3360 ggaagatctc tggaagacag tctgtctctg ctaagagaca tagtggcgta caccaacagc 3420 actaactcta cctgtgtgtt gttggcccta gatcaggaaa aagcctttga cagggttgac 3480 cacagctaca tggcagaggt cctcgagaag ttgggttttg gccccaagtt tcaaagctgg 3540 ataggcaccc tgtacaactc ggtttctagc cgagtgttag tcaacgggga cctgtcctcc 3600 ccgatccaca tcgaacgggg agttaggcag ggttgcccgc tatctccact gttgtatgtg 3660 ctctgtattg agccactggc agccgccatc agagcagatc cccaaattag aggtgttaaa 3720 gttccgggag gtcaggaagt caagcttgta caatacgctg acgataacac ctgcgtcctt 3780 tcagaccaac cgtcgattga tagagccttc cataccatcc gaaggttcga gtctggcaca 3840 ggttccaagc tgaactttgg aaagactgaa gccgtttggc taggcaggtg gagaggcagg 3900 caagacaaac cgtatccgat agggcggtgg acctcagaca gcatcatcat actggggtcc 3960 cccatgggcg gtgagcgaat ggctgaagag gcttggctac aacgttttgc caagttcaag 4020 gctaagctgg accagtggag aaataggaaa ctgaccctga ttgggaaagt ggtagtttgc 4080 aattctctag cagcggctac gctgtggtac acagcgccca tatttccttt gccgaagtcc 4140 attgagaaaa agttggagaa agagatgttc gcattcatat gggacaacaa gacagaacta 4200 gtggcaagaa ggactctgta tttgccgaag gagaaggggg gcctaaactt ggtttgcatc 4260 ccagtgaagg cgcaggcact tctcctcaaa tctgttcgaa aagctctgac taccccggat 4320 gccccggcgg caaagttcac cctttattgg ctagggttca gcttgagaag aatagacccg 4380 gcttcgtgga gcaacaatgc cccacactca gtggacccac caccgcacta tgcacaaatt 4440 gccaagatct tgcaaaatgt tcaacagagc aacattcagg tggaatgggc tgcagtcgct 4500 gtctcgtcac tgtactcctc ccttcttgag gcagaggact ttgtgccacg ttgcgttagg 4560 gagaacccaa gtcttgactg gccagagatt tggcaagcca tcctcaatcc ccttctcacc 4620 aactgggaga ggatggtttg ctggaatatc gctcacgatg gcttagtgac taacaagaag 4680 ctctattctt ggcggaagtt ttctaagacg tccaaatgcc cgagacgagg ttgtgacgaa 4740 gtggagagca tctcccatgt tttcttagaa tgtgcccacg tgtctgagat gtggacatgg 4800 ctggagtggc ttatagctag aaagatctgt ccgaactttg ttctttctaa caagtttgtg 4860 ctttggggct tagctccaga tgggtcatct atgaagacaa ggcgggtgtt aggagccttg 4920 tctgcaatct ctaagaacct gatttggcgg tcgagaggtg atgcaaaaca caacaagaag 4980 catcactcct ctgcggagtt ggctctcctc ctgaaggaga cgttagctga gagactggtt 5040 tttgagttca cgcgtcttgg cccggctggt ttctacacca actgggctga gggctattct 5100 tgggcagatg tagatgtcgc acatttgtca ttgaagttct agaggacttt atgaattccc 5160 tatattaggg acacagaaaa agagaaaatc caaaaaaaat ataaaataga aaacccaaaa 5220 acagtagatt agtggggggg atatgggaaa ggggatggac caagagtggt gggtgggagt 5280 gaggtgctgc gctatccgag tcgatctgtt tcagaattgg tttgatctgg gtccccggtg 5340 ggggatgtat caagagatat gtaagccggg gagtgtttat cctccctcgg tatagactgg 5400 taatgaacaa agtaatgctc gacaagtata aactactttt tgtcaaaag 5449 // ID Gypsy-196_AA-LTR repbase; DNA; INV; 1078 BP. XX AC supercont1.70; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-196_AA_; KW Gypsy-196_AA-I; Gypsy-196_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-1078 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.70; Positions 434311 433234. XX SQ Sequence 1078 BP; 320 A; 250 C; 238 G; 270 T; 0 other; tgttaccgat ctaaaaaaaa tcggaacaaa aagcattcct ttgaccttaa atttgctacc 60 ttatcctatt taaatttaaa agattatttg aatgaagtaa ttgaaattaa aaccaaatat 120 attaaaaaaa tattatccaa aaaaatctgc gagtagttta ttacattcca actgttagaa 180 ttgttaaaat taatccgtga attgaaacac cgttgtacta aaattattta tggaggggag 240 acggtaaatt caccaccaga cgaggatttt ttggatatcg ccacaacact ggatccggga 300 aagaggagga tggaaatata gcggagaaaa agggagattt tttaggaggt tgaggtggat 360 gtcgaaagaa agccagggat cagtatcgct tgaaccagta agcaagtcac aagtcctcta 420 aaaccgtatc tttgctacga agtcattaga tcaccgtgtg gaaagcgcgt ggacgcgatc 480 gggtagtgcc accaaaaagt gaagtgcctc agactcgcaa gccaaagcca attagttgag 540 gtcgaggatt caatcggcta ccctaaggat cgtatagaga ccctcgtcac gctgtcaatc 600 ggattacgac cggtcacact gtagcagacc accagtgggt agaatccctc tagagacggc 660 gcacatcccc gtttcgcctg ttacgtcatt tgtcgtcacc gccccgtcgt cgcttgaaca 720 tcaccttgag catcacagaa cccacggcag ggttgctgca ggtacgttcc atcctcctct 780 ccccacctcc gcaccagggc cccaacgtca tcgaacagcg ctgcaccgaa taccgtaccc 840 tccagaaatt caaataaaag aagaccttgc aattcaattc tctcagctct ctttaatgct 900 tgttctagga aagagtcatt ggatgtagcc tgttccgagt cgttgatcag tccactttgt 960 actatcggta agtgcacgtt gttgtgagtc ctcccgaggt cggtggccga ccctgagatt 1020 taatagccaa gaggtaagct agccggtctg cgacgtgcct acgtcaagga tactttca 1078 // ID SAT-3_NVi repbase; DNA; INV; 152 BP. XX AC . XX DT 21-APR-2009 (Rel. 14.04, Created) DT 21-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE Nasonia vitripennis satellite repeat. XX KW SAT; Satellite; Simple Repeat; Nonautonomous; SAT-3_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-152 RA Bao W. and Jurka J.; RT "Satellite repeats from Nasonia vitripennis."; RL Repbase Reports 9(4), 801-801 (2009). XX DR [1] (Consensus) XX SQ Sequence 152 BP; 48 A; 27 C; 27 G; 50 T; 0 other; acgcggtgga tcgatctgat ccttgcatag ttgtactcaa cagctcgagt tgcgttagat 60 taccgtacgt ttatcatagg tttgatatta gtaaacaaag aaaccaactg attttgactc 120 aatttataat aaaacgtgaa tatctctaat ct 152 // ID BEL-97_AA-I repbase; DNA; INV; 6199 BP. XX AC AAGE02034155; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-97_AA_; KW BEL-97_AA-LTR; BEL-97_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6199 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02034155; Positions 6448 250. XX CC Positions [5240-5758] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 371..3670 FT /product="BEL-97_AA-I_1p" FT /translation="MATVDPPHASQDHDCGACNQPNDADPQMVQCDACQVW FT FHLKCVGETPGVENRSFNCRTCQPPTDINKKSKKTRSQKGAGENRLKVPMV FT ATSGVTPKKHPEVPNVATNKASTIKSISSISRSRALTLQRKIETEQLLAEM FT KLAEAEKRLEEDRIMQERQRALREEKNRLQEELLRKMQELDEIVDEEKGSD FT YTSTTGIRKAKEWLASQRQQDDNSVSRSVPPFLPSKKSERSSHHTEERKEL FT NSPDASDHEAFEDQEEQVPENAFRRLSIERFQAIQNPEGVGPTANQIAARQ FT VWPKKLPIFSGEPEEWPIFYSSYEGANATCGFSDVENIIRLRECLRGPARE FT AVVSKLMFPRSVPSIIETLRRLYGRPELMVKNLLGKVRRLEAPKPERLDSL FT INFGMTVQQLSDHLIAADLQSHLANPTLMEELVDKLPAAYKLEWVRYKRAY FT AMPTLKEFAKFMDVLVADASEVTVLTQPKIEKIKHEKGKHDQKGHVYAHND FT DEVDKFPSRIPCPICNGTDHRVRNCEKFQRMKLESRLAAVERCNLCEVCLF FT NHSAKCRSRVRCNIGNCRERHNPLLHQAEKRTTTASRVVDAECNAHGPVTG FT SVLFKVVPVTLYNGDRRIDTFAFVDEGSSRTLIESSLARSLGLSGETVPLK FT LIWTSNVTRTERTSKLVDVLISARGQNQRFSLKGAHTVGELNLPKQSLSME FT ELTRRFNHLRDLPALSYLDAEPKILIGLRNLEIFTPLETRIGQPGDPVAVR FT SALGWVIYGPCRSDKNENDFIGVHDCSCKTNEALNELIRQQFILEEAVVCV FT TPLPESADDKRARHILDNTTRLAGDRYETGLLWKTDEVRFPNNWPLAVKRL FT KSLEVKLARDPDLRANVHQQIRDYVEKQYAHKATDHELASADPNRVWYLPL FT NVVTNPRKPEKKRLVWDAAAQVDGVSLNSQLLKGPDLLNSLPSVICVFRER FT PIAFGGDVKEMFHQVRIIPKDRHSQRFVFRFDSSQPPEIYVMDVATFGATC FT SPCSVQHVMRKNALEYATVFPDAVTAIINKTYMDDYVDSANTTDEAIIRAK FT QVRDIHARGGFEMRNWVSNSHDVLQALGEGKNQEPVVLGGITQ" FT CDS 4130..5758 FT /product="BEL-97_AA-I_2p" FT /translation="MAKSKVAPLQHQSIPRLELQAAVMGARMLKCAIECHT FT LAIGRCFLWTDSSTVLSWIRSDNRKYKQYVAHRVGEILSHSQITDWRWVPT FT KENVADCLTKWGRHTDPENMGLWLNGPAFLYGPEETWPQQRKIALNVPDEL FT RPCYLLHHIAIPQELIKVENISKWKVLLRTMAMVCRFINNCRLRIKGLPVE FT TIKATNLQLKCLQRSIPVISVPLQKNEYERAESILWRIAQADTFPDEVKVL FT MKNRDSPPENLVSIERTSPLYKLSPFADEFGVLRVDGRTANAGYASFDSRF FT PIILPKDHVVTFRLINHYHCQYGHANKETIVNEVRQRFFIPSLRSVVNKVM FT RACQRCKIKKCQPQSPRMAPLPVQRLTPYVRPFCYVGVDYLGPLEVTVGRH FT KEKRYVVVFTCLVVRAVHLEMAYDLSSESCVMAIRRFVRKRGPPVQIFSDN FT GTNFVGAKCELAQQIRQINDKCANTFTDAKTKWTFNPPAGPHMEGVWERMV FT RSVKQSMRALDDGRKLNDEILLTVLAETEWFINSRPLTYMPQECGND" XX SQ Sequence 6199 BP; 1835 A; 1354 C; 1524 G; 1486 T; 0 other; ctgatatgaa tttaattaaa ttggtttatt tgcccagaaa ctgtgaatta tactgtgcaa 60 ttgtattcgg gttagctgta gtcctctgaa ataatagagc tcgataatag gtaacattaa 120 tctatatctt attgaattag cccttaatta acgtgctttt ctctaaaagg gacaatcaat 180 ctttgtacgg aattgttcag tataggccgt aaattggaac aatacgagaa gactatcgta 240 agttaatgac tatcaatcaa ctatatttaa ataataaaga ttatcgtttt agctttaagc 300 ctcgtaagat aaaaaggctg tcaaggttct tctacatccg aacaatctta aaggatatca 360 ctcgactctg atggctacag tagacccacc ccatgccagt caggaccacg attgcggggc 420 ttgcaatcaa cccaatgatg ctgacccgca gatggtccaa tgcgatgcat gccaagtttg 480 gtttcatcta aagtgtgtcg gtgagactcc cggcgttgag aacagatcgt tcaattgtcg 540 cacgtgtcaa ccgcccaccg atatcaacaa gaaatcgaag aaaaccagat cgcagaaggg 600 tgctggtgaa aatcgactta aggtccccat ggtggctacg tctggagtga ccccgaaaaa 660 gcacccggag gttcctaacg ttgctacgaa caaagctagt acgatcaaat ctatttcgtc 720 gatatcgcgc tcccgtgctc taactctaca aagaaagata gagacagagc aactcttggc 780 ggagatgaaa ctggctgaag ctgaaaagcg gctcgaggaa gatcgtatca tgcaagaaag 840 acagcgagct ctgcgagaag aaaaaaatcg attgcaagag gaattattac ggaaaatgca 900 agaacttgac gagatcgttg atgaggaaaa gggctccgac tacaccagta ccaccggaat 960 acgtaaggct aaagaatggc tagcgagcca acgacagcaa gacgataaca gtgtgagtcg 1020 ttcagtacca ccatttcttc cttctaagaa gtcggaacgc agctcccatc atacggagga 1080 gcgcaaagaa ttgaactcac cagatgcgtc agatcatgaa gcattcgagg atcaagaaga 1140 acaggtacca gaaaacgcat tccggaggct gtctatagag aggtttcaag cgattcagaa 1200 tccggaaggt gtaggaccta ctgctaatca aattgccgct cggcaggttt ggccaaagaa 1260 attgccgata ttttctggcg agcctgagga gtggccaatc ttctatagca gctacgaagg 1320 ggcaaacgcg acttgtggtt tctcagatgt agagaatatc attcgccttc gagagtgtct 1380 tcgaggacca gcaagagaag cagtcgtctc gaagctcatg tttcctagga gtgtgccatc 1440 gatcatcgag acattacgac gattgtatgg acgacctgag ctgatggtaa aaaacttgct 1500 aggcaaggtt cgtcgcttgg aagcacctaa acctgaaagg ctggattctc tgataaactt 1560 tggcatgacg gtgcaacagt tgagtgacca cttgatagca gctgaccttc agagccacct 1620 tgccaatcca acgctgatgg aagagctagt tgataaacta ccagcggcgt ataaattgga 1680 atgggttcga tataaacggg catacgcaat gccgaccttg aaggaatttg ccaaatttat 1740 ggatgttttg gtggccgatg cgagcgaagt aactgtcttg acacaaccga agatcgaaaa 1800 aatcaagcac gaaaaaggga agcacgatca gaaagggcac gtctatgctc acaacgacga 1860 tgaagtagac aaatttcctt caaggatacc gtgccctatc tgcaatggaa cggatcatag 1920 ggtgcgaaac tgcgaaaagt tccagcgaat gaagctggaa tctcgtcttg cggcagtaga 1980 gaggtgtaat ctctgcgagg tatgtttgtt caaccacagt gcaaagtgtc gatcaagagt 2040 acgttgcaac atcggaaatt gtcgagagcg tcataatccg ctactccatc aagctgaaaa 2100 aagaacaaca accgcttcga gagtcgttga tgcagaatgt aatgcccatg gtccagtgac 2160 tggatcggtc ctattcaaag tagttccagt cactctgtac aatggagatc gaagaattga 2220 cacgttcgcc tttgtagacg aagggtcttc tagaacgttg attgaatcga gtttggcaag 2280 gagtcttggg ctttcaggtg aaacagtacc gttaaagtta atttggacct cgaacgttac 2340 gaggacagag agaacttcaa aactagtgga tgttttgata tcagcgcggg gacagaatca 2400 acggtttagc ctaaaaggcg ctcacaccgt tggtgaattg aatctaccca agcagagtct 2460 gtcgatggaa gaattaacta gacggtttaa tcacctgcgt gacctacctg ctctctcata 2520 ccttgacgcc gaaccgaaaa ttctaattgg cttacgaaat ttggaaatat ttacaccgct 2580 tgaaactcga attggtcaac caggcgaccc agtggcggta agaagcgccc tcggctgggt 2640 gatttatggt ccatgcagga gcgataaaaa tgaaaatgat ttcataggcg ttcacgattg 2700 tagctgcaaa acgaatgagg cattgaatga attgatacgc caacagttca ttttggaaga 2760 agcggtagtt tgtgttacgc cattgccaga atctgcagac gacaaacgcg cccgtcacat 2820 tcttgataac accactcgac ttgccggtga tagatatgag acgggactcc tctggaaaac 2880 tgatgaggtg cgtttcccga acaattggcc tttggcagta aagagattga aaagtctaga 2940 agtaaagctt gccagagatc ctgatctacg cgccaatgtg catcaacaaa ttagagatta 3000 cgttgaaaag cagtacgccc acaaggccac cgaccacgag ttagcgagcg ctgatccgaa 3060 tcgtgtatgg tatttgcctc tcaacgtggt gacaaatcct agaaaacctg agaagaaaag 3120 acttgtgtgg gacgcggcgg cacaagtgga tggtgtgtct ctaaattctc agctcttaaa 3180 aggaccagac ctgttgaatt cactgccatc cgttatttgc gtcttcagag agagaccgat 3240 tgcctttgga ggcgacgtta aagaaatgtt ccatcaggta cggataatcc cgaaggacag 3300 acactctcag cgattcgtct tcagattcga ttctagtcag ccaccggaga tatatgttat 3360 ggacgttgcg acgtttgggg ccacatgttc gccctgctcc gtacaacatg tcatgcgtaa 3420 aaatgcactt gaatacgcaa cggtgtttcc cgatgctgta acagcaataa ttaacaaaac 3480 ctacatggac gattacgtcg atagtgccaa cactaccgac gaagctatca ttagagcaaa 3540 gcaggtacga gacatacatg cacgtggcgg atttgagatg agaaactggg tgagtaacag 3600 tcacgacgtg ttacaagcgc tgggagaagg caagaatcag gaaccggttg tgcttggcgg 3660 cattacgcaa tagaaatggg agcgtgttct aggattgatt tgggatcctc agggtgattt 3720 cttctccttc tcaactcgat tccatggaga cttggaacca tatgtatctg gaaaccgtcg 3780 tccaacaaaa cgaattgttg cccgctgcat aatgagcttg tttaatccaa tcggcttgct 3840 agctccattt tcgatacatg gaaaaatgct cattcagaat ctttggagaa gtggcactat 3900 gtgggatcag gatgttttag atgaagaatt tggaaaatgg agacgctgga tacaactact 3960 ccccggaatc ggacgtttga gaatccctcg tccttacttt ggagatgcaa gcccaaataa 4020 gttcaactcg ttccaattgc acatatttac cgatgcgagc gagttggcta tgggttgtgc 4080 ggcgtatttt tgagcagttg atgaacaagg tgtacgctgc gcattgataa tggcgaagag 4140 caaagtcgct cccttacagc accaatcaat accgcgactg gaattacagg ctgcggttat 4200 gggggcaaga atgctcaagt gtgcgataga atgtcacacc ctcgcaattg gccgctgttt 4260 tctatggacc gattcgagca ctgtgctatc gtggatacgt tcggacaatc gaaaatacaa 4320 acaatatgtt gcgcatagag taggagaaat cctttcgcat tcgcagataa ccgactggcg 4380 ttgggtgccc acaaaggaaa acgtagcgga ttgtctaacc aaatggggaa gacacactga 4440 ccccgaaaac atgggtttgt ggttaaatgg acctgcgttc ttatacggcc cagaagaaac 4500 atggcctcag caacgaaaga tagcactgaa tgtcccggat gaactacgac cttgctatct 4560 actccaccac atcgcaattc cgcaggaact aataaaagtg gaaaatattt caaaatggaa 4620 agtattgctg cgaactatgg ctatggtgtg ccgtttcatc aacaactgtc ggctgcgaat 4680 taagggccta ccagttgaaa ccatcaaagc aacgaatctt cagctcaagt gtctgcaaag 4740 atccattcca gttatcagcg tacccctcca gaaaaacgag tacgaaagag ctgaatccat 4800 tctgtggcgc atcgcgcaag cagatacctt cccggacgaa gtaaaggtac tcatgaagaa 4860 ccgtgattcc ccgcctgaga atttggtgag tattgaacgc acgagccctc tgtacaaatt 4920 atctccattt gcggatgaat tcggagtctt acgagttgac ggtaggactg ctaacgcagg 4980 ctacgcttcc ttcgactcta ggtttcctat aatccttccg aaagaccatg tggttacatt 5040 tcggcttatt aatcactacc actgccaata cgggcatgca aataaagaaa caatcgttaa 5100 tgaagttcgc caacgattct tcataccaag tttaagatcg gttgtgaaca aagtaatgag 5160 ggcctgtcag cgatgtaaaa ttaaaaaatg ccaaccgcaa tctccacgaa tggccccgtt 5220 accggtacag cgactgactc catacgtaag accattctgc tacgtaggtg tggactacct 5280 tggccctttg gaagtcaccg ttggacgtca caaggaaaag cgatacgtcg tcgtctttac 5340 ttgccttgtg gttagagccg tccatttaga aatggcctat gacttgtcca gtgagtcatg 5400 tgtaatggca attcgacgtt tcgtccgcaa gaggggacct ccagtacaaa ttttctccga 5460 caacggcacg aattttgtcg gggccaaatg tgaattggct cagcagatta gacagatcaa 5520 cgacaaatgc gctaacacgt tcacagatgc caagaccaag tggacgttca accctcccgc 5580 tggtccacat atggaagggg tgtgggagcg tatggtgcgc agcgtaaagc aatccatgag 5640 agctctcgat gatggccgta agctcaacga tgaaatcctg ttaacagttc tagcagaaac 5700 agagtggttt atcaattctc gcccgcttac atacatgcct caggaatgtg gaaatgatta 5760 ggctctgact ccgaaccact ttattcttgg gaactcatct ggttcgcacg aaccgattat 5820 aaatcccact agtcttgcag caactttacg cagcagttat cagagaagtc aatacttatc 5880 tgacgcactg tggaagagat ggataagaga atacttccct accatcaatc gtagatccaa 5940 atggttcgac gacgtacgtt ctgtaaaagt gggagatttg gtctacgttg cggatggaga 6000 tcgcagaaca tggatacgag acaaagtaga agaagttatc caggggcgtg atggaagagt 6060 aaggcaggcc atcgtgaaga cggcaaatgg aaatggaaga cttaaacgtc ccgcggtcaa 6120 actggcagtg atggaggttg gagatagtgt atccggcgac cttcctaatg agcagcatcc 6180 ggatccacgg ggcggggga 6199 // ID Galileo_DW repbase; DNA; INV; 4386 BP. XX AC BK006360; XX DT 27-FEB-2008 (Rel. 13.02, Created) DT 28-FEB-2008 (Rel. 13.02, Last updated, Version 1) XX DE Drosophila willistoni transposon Galileo, complete. XX KW P; DNA transposon; Transposable Element; Galileo_DW. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-4386 RA Marzo M., Puig M. and Ruiz A.; RT "The Foldback-like element Galileo belongs to the P superfamily RT of DNA transposons and is widespread within the Drosophila RT genus."; RL Proc Natl Acad Sci U S A 105(8), 2957-2962 (2008). XX DR EMBL/GenBank/DDBJ; BK006360; Positions 1 4386. XX FH Key Location/Qualifiers FT CDS 1047..3698 FT /product="Galileo_DW_1p" FT /translation="MRSYRDGSEHFPVRMFKFPKNEFVRRQWVSKCNLQYD FT VNIDRALICNLHFEKKFLGTKFLKAGAIPTLLLTDEPNLNLIATDAKIDLY FT DFCEETHEEISTFQNIRKHGAEPTNILENIDNNTEKVEDISDISTDCMQFC FT SNCLKKEQNEAYYRKKCFEMSENLKKEIQKVKLCNKKIRHLRNVLRNERAR FT KLKYLKEKKKIDIQGLIDKKCVSKNSNTVCKMLLKNKQSWEDEEKIIAQSI FT NFYSSKAYNFMRDDLELNLPCNKSLQRWAPVRNMVPGLNENLLKHLKGIFL FT KMHNKSKNSVLVFDEISIRKGLQYNSHRDEVEGFVDDGIEKTDALCKQICV FT FMVRGLYDNWKFVLSYVATSTGLSSSILTELVNTNIRSAKSLGLIIRAVVC FT DQGPNNRGAFNKFGIKNETPYFTVDGQKIFGIYDVPHLIKSLRNILMRNYI FT NTPDGRVSWQVIVKLFEIDTKNTSARMCPKLSRKHIYPNSFEKMKVKYATQ FT VFSQTVASALKTLIQNGTFHDCEDVAIATCKFIEKINKLFDCLNSNNLFDK FT NPFKSAIQKESDIEKYIIEMKNYLKKCQYPKKIFCMDGIILSINSILMLLQ FT DIWSQGEGVYFLLLSRLNQDALEHLFYLIRSRGGTNNNPMLFEFNAIISKM FT LSMKIITSRTTTGNCQADEDLLINVIEDTKHELAIDNINDQDRVCYEDVNI FT SFDLYDDDMEGTEADEIQDTTLSIASANALRYFTGFVIHKSQQKFNCENCK FT ELVKENIVLYDQSEFFIFNKNYKILNNNLKLKNPQDDFFSLMKMHYNIFHN FT FFQKFPHARNIRQQIFNECIMRAENDKKYEDWYCESSKCIEHRKYILNYFL FT VVLLKKNTIWLLEKLCGASEKSVRKIEILKS" XX SQ Sequence 4386 BP; 1541 A; 720 C; 790 G; 1335 T; 0 other; cactaaccat agaacacata gatgggacaa actcatgatt ttttgattgc aaacgctagc 60 ctagtcatgg aaccccagag aacttcggga aaacccgaac gctctcactc tcaatgagag 120 cgtaaaatgg ggagagggca aagcagctac gaaaaaattt cagtctagca cagactaccg 180 aacgacgaag gcaggcctac gtcgcttgtg atttcgttct gtctatgtat gtgtacactc 240 gtcggtacat gctcaggctt cgtagtgtgt gtgtgacgtg aagttgtaca acatcgcatt 300 tcaatagctc gctcgaccaa aatttttcat tttcgacaaa acagcggcaa tgcgaatagg 360 atatgcccct gtcgaaagaa tatcaagaaa tattgacatc agctacgtat gtatcccggt 420 ggctgtgccg atagagtgtt tgcttggcaa taggaatgtc gctggttcga atcccaactc 480 gattatcgtt aacgaagttt ttttttgtct atttttttta tttatataaa aataaaaaaa 540 taaaaataaa caaaaggaaa aaaaaacaaa atgaaaataa ataaaaaaaa aatcaaaaag 600 gaacaaaata aaacaaaaaa aaaaacgaca aaaccaacaa aaaaaaaaaa caaaacaaag 660 aaacattatt tatattatta ttaggattac cccctaagac ttcaaaattt ctattttccg 720 aacgctcttt tcagatgcac cgcataattt ttccaaaagc catattcaac actactaaaa 780 tgtagtttgg gtactctaaa aaaaataccg gctttactac tgcgttttgg acaaaaatat 840 ggggatgttc ttgctcctct gttcctcttg cattattttt cacagcgttg tcaacgtttt 900 gccgttttag ttcattctct gtgcgactgt tgaacttcga gcatcatcac ctttggcata 960 tactcaagtt taaacagaga aaaacgatta aaaaccaggt cgttgataat agaaaaagtg 1020 gctgcaagtg cattattgcc tcatgcatga gaagttatag agatgggtca gagcactttc 1080 ccgtaagaat gtttaagttc cccaaaaatg agtttgttcg tcggcagtgg gtttctaaat 1140 gtaatttgca atatgatgtg aatattgata gggccttgat ttgcaactta cactttgaaa 1200 aaaaattttt gggtacaaaa tttttaaagg caggtgctat accaacttta ttgttgaccg 1260 atgagccaaa tttaaattta attgcaactg atgcgaaaat tgatttatac gatttctgtg 1320 aggaaacaca cgaagaaatt tctactttcc aaaatatacg taaacacggg gcagaaccca 1380 caaatatttt ggaaaatatc gacaacaata ccgaaaaagt tgaagacata tcagacatat 1440 caactgattg tatgcaattt tgttccaatt gtcttaaaaa agaacaaaat gaagcatatt 1500 atagaaagaa gtgctttgaa atgtctgaaa atctaaagaa agaaatacaa aaggtcaaac 1560 tatgtaacaa aaagataaga catttgagaa atgtattgcg aaatgaaaga gcaagaaaac 1620 ttaaatattt aaaggaaaaa aagaaaatcg atatacaagg attaattgac aaaaagtgcg 1680 tttcaaaaaa ttcaaacact gtttgcaaaa tgttacttaa gaacaagcag tcgtgggaag 1740 atgaagaaaa aattattgct caaagcatta atttttattc ttctaaggcc tataatttta 1800 tgcgtgacga tctggagtta aatcttccat gcaacaagtc tctgcaaaga tgggctcccg 1860 taagaaacat ggttccgggc ttaaatgaaa atttattgaa acatttaaaa ggaatttttt 1920 tgaagatgca caataaaagt aaaaattccg tattagtttt tgatgaaatt tcaataagaa 1980 aaggcctgca atataactcc cacagagatg aagtggaagg ttttgtagac gacggtattg 2040 agaaaacgga tgctttgtgt aagcaaatat gtgtttttat ggtgagggga ctctatgaca 2100 attggaagtt tgttttaagc tatgtagcaa cttccactgg actctcttcg tctatattaa 2160 ccgagttagt taatactaat attagatccg caaaaagctt gggcttaatt attagagcag 2220 tcgtatgtga tcaaggccca aataaccgag gagcattcaa caagtttggt ataaaaaatg 2280 agacaccgta ctttactgtt gacggtcaga aaatatttgg catatacgat gtcccccacc 2340 tcataaaatc attaagaaat attttaatgc gaaactatat taatactccg gatggtagag 2400 tctcttggca agtaattgtc aaattgtttg aaatagacac aaaaaacact tccgcaagaa 2460 tgtgcccaaa attatcccga aaacacatat atcccaactc atttgagaag atgaaggtga 2520 aatatgcaac acaggtattt agccaaacag ttgcttctgc actcaagacg ctaatacaaa 2580 acggcacttt tcatgactgc gaagatgtgg caatcgcgac atgtaagttt attgagaaga 2640 ttaataagct gtttgattgt ttaaatagca acaatttatt tgacaagaac cccttcaaat 2700 cagccattca aaaagaaagt gacattgaaa agtacattat tgaaatgaaa aactacttga 2760 agaaatgtca atatccaaaa aaaatttttt gtatggatgg aattatttta tcaataaatt 2820 caatattaat gcttttgcaa gatatttgga gtcaaggaga aggtgtatat tttctattat 2880 tgtcacggct aaatcaagat gctctcgagc acctatttta cttaataaga agtagaggtg 2940 gaaccaacaa caatccaatg ctttttgaat tcaacgcgat aatttcaaaa atgttgtcaa 3000 tgaaaatcat aacatcaaga acaaccactg gaaattgcca ggcagacgag gacttgctaa 3060 taaatgtcat tgaagataca aagcatgaac tggcaattga caatataaat gaccaagatc 3120 gggtatgtta cgaagacgtc aacatcagtt ttgacctata cgacgatgat atggaaggta 3180 ccgaggctga tgaaattcag gacacaacct tgagtatagc ttctgcaaat gcattacgat 3240 attttacggg attcgtgata cataagtcac aacaaaagtt taattgcgag aattgcaaag 3300 aacttgttaa agaaaacatt gtcctctacg accaatccga gttttttata tttaataaaa 3360 actataaaat attaaataat aatttaaaac tcaaaaaccc tcaagacgat ttttttagtc 3420 taatgaagat gcactataat atttttcata atttcttcca aaaatttcca catgctcgaa 3480 atataagaca acaaattttc aatgaatgta ttatgcgtgc tgaaaacgac aaaaaatacg 3540 aagattggta ttgtgaaagt agtaaatgca tagagcatcg aaaatacatt ttgaactatt 3600 ttctggtagt gttgttgaaa aaaaatacta tatggctttt ggaaaaatta tgcggtgcat 3660 ctgaaaagag cgttcggaaa atagaaattt tgaagtctta gggggttatc ctaataataa 3720 tataaataat gtttctttgt tttgtttttt tttttgttgg ttttgtcgtt tttttttttt 3780 tgttttattt tgttcctttt ttattttttt tttatttatt ttcattttgt ttttttttcc 3840 ttttgtttat ttttattttt ttatttttat ataaatagac aaataaaaac ttcgttaacg 3900 ataatcgagt tgggattcga accagcgaca ttcctattgc caagcaaaca ctctatcggc 3960 acagccaccg ggatacatac gtagctgatg ccaatatttc ttgatattct ttcgacaggg 4020 gcatatccta ttcgcattgc cgctgttttg tcgaaaatga aaaattttgg tcgagcgagc 4080 tattgaaatg cgatgttgta caacttcacg tcacacacac actacgaagc ctgagcatgt 4140 accgacgagt gtacacatac atagacagaa cgaaatcaca agcgacgttg gcctgccttc 4200 gtcgttcggt agtctgtgtt agactgaaat tttttcgtag ctgctttgcc ctctccccat 4260 tttacgctct cattgagagt gagagcgttc gggttttccc gaagttctct ggggttccat 4320 gactaggcta gggtttgcaa tcaaaaaatc atgagtttgt cccatctatg tgttctatgg 4380 ttagtg 4386 // ID DNAREP1_DYak repbase; DNA; INV; 793 BP. XX AC . XX DT 31-MAR-2007 (Rel. 12.03, Created) DT 29-SEP-2007 (Rel. 12.1, Last updated, Version 3) XX DE Non-autonomous family of Helitrons - a consensus sequence. XX KW Helitron; DNA transposon; Transposable Element; KW Interspersed repeat; DNAREP1_DM; nonautonomous; Helitron-1_DYak; KW DNAREP1_DYak. XX NM DNAREP1_DYak. XX OS Drosophila yakuba OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; melanogaster subgroup. XX RN [1] RP 793-1 RA Kapitonov V.V. and Jurka J.; RT "DNAREP1_DM."; RL Direct Submission to Repbase Update (JUL-1999). XX RN [2] RP 793-1 RA Kapitonov V.V. and Jurka J.; RT "Molecular paleontology of transposable elements in the RT Drosophila melanogaster genome."; RL Proc Natl Acad Sci USA 100(11), 6569-6574 (2003). XX RN [3] RP 1-793 RA Kapitonov V.V. and Jurka J.; RT "Helitrons in fruit flies."; RL Repbase Reports 7(3), 127-127 (2007). XX DR [3] (Consensus) XX CC This is a consensus sequence of a family of non-autonomous CC Helitron transposons transposed in the Drosophila yakuba genome a CC few million years ago (numerous copies are less than 2% divergent CC from the consensus sequence). DNAREP1_DYak is a deletion CC derivative of the autonomous Helitron-1_DYak. These transposons CC are usually inserted in the ttw|TTT target sites without the CC target site duplications (the insertion site is marked by "|"). CC Different families of Helitrons constitute >3% of the D. yakuba CC genome. After a few unsuccessful attempts [1-2], this is a final CC classification of DNAREP1_DM as a Helitron transposon. While CC DNAREP1 elements have lost their mobility in D. melanogaster, CC they have been mobile in the D. yakuba genome until the last ~1-2 CC million years. XX SQ Sequence 793 BP; 217 A; 201 C; 136 G; 239 T; 0 other; ttatacccgt tactcgtaga gtaaaagggt atactagatt cgttgaaaag tatgtaacag 60 gcagaaggaa gcgtttccga ccatataaag tatatatatt cttgatcagg atcaatagcc 120 gagtcgattt ggccatgtcc gtctgtccgt ccgtctgtcc gtctgtccgt ctgtccgtct 180 gtccgtctgt ccgtccgtat gaacgtcgag atctcaggaa ctacaaaagc tagaaagttg 240 agattaagca tacagactcc agggacatag acgcagcgca agtttgtcga ttcatgttgc 300 cacgcccact ctaacgccca caaaccgccc aaaactgcca cgcccacact tttgaaaaat 360 gttttaatat tttttcattt ttgtattggt cttgtaaatt tctatcgatt tgcaaaaaaa 420 ctttttgcca cgcccactct aacgcccaca aaccgcccaa agctgccacg cccacacttt 480 tgaaaaatgt tttaatattt tttcattttt gtattggtct tgtaaatttc tatcgatttg 540 caaaaaaact ttttgccacg cccactctaa cgcccacaaa ccgcccaaag ctgccacgcc 600 cacacttttg aaaaatgttt tgatattttt tcatttttgt attagtcttg taaatttcta 660 tctatttgcc aaaaaacttt tggccacgcc cactctaacg cccacaaacc gccaaaaact 720 gtccttcgca cttacactag ctgagtaacg ggtatcagat agtcggggaa ctcgactata 780 gcgttctctc ttg 793 // ID hAT-2_SM repbase; DNA; INV; 3128 BP. XX AC . XX DT 02-OCT-2007 (Rel. 12.1, Created) DT 08-OCT-2007 (Rel. 12.1, Last updated, Version 1) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-2_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-3128 RA Jurka J.; RT "haT-type repetitive family from freshwater planarian Schmidtea RT mediterranea."; RL Repbase Reports 7(10), 1031-1031 (2007). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 925..2871 FT /product="hAT-2_SM_1p" FT /translation="MDKWLKIGTQCPENDKNDNTNNNDGPTPTKLRKSVDE FT DPNKNESSGTVATSKNIRQTMLSNKKLRKYQEEYIKYGFTFCVVNGEERPL FT CVNCSDKLANESLKPAKLKRHLETKHKEFANKSENFFKRRAESMKNQTVFL FT KTYTTIPEKALRASLEVSYLIGKNLKPHTIGESLILPSAIKMTSIMHGEKY FT GNDLKTIPISRDTVSRRISAISRNIESVLLNRIQNSPVFALQIDETTDITK FT MSQLIVYVKYVFEEDISEDFLCCKRLEGRTTGEKIFEIINRYFEENDLAWA FT HCVAICTDGAAALTGSNKGLKGLIIKIAPHIVFNHCMIHRQALVAKDIDEE FT LHKVLQDVVIVINYIKGNSLNSRLFSILCNEMGSEYETLLLHTEVRWLSRG FT KILRRIFDLRNEVYNFLVEKKHILAQSFINEDWLGKLSYMVDIFEKLNDLN FT LSLQGESTTILTLSSKIEAFKNKLILWKGELNKNNTDMFPCFSEFTNENNI FT DFMLFKNIISHHLIKLGENFTKRFEKFPDKELGWIRDPFSFDIINSNIPLN FT EKEQLIDISSDETLRIQFRSLTCQKFWLSTENKYADLKKKAINILIQFAST FT YSCESGFSKLVAIKTKYRSRLDPEDDLRIAISHIVPNMEAIMSSLQAQTSH FT " XX SQ Sequence 3128 BP; 1145 A; 479 C; 524 G; 980 T; 0 other; ctagagcagt ggtgcgcaac ccgtgggtcg cgagaaaaat tttcagggtc gccaaattaa 60 aagaacttta tatgacaata aaaaatatag cgcgctcgta aacctggtgt gatgtcaatc 120 aacgttttct aagtatcgta ctaatggtgt atctattgtg ttacttaagt cacagcacac 180 cgggttcact ctaaggaatt ttaatggctc atgttacctt cgcgcatgtc gcaggcgaca 240 gactctgccc cgctcccacc ccaaccagac gtcgagtgtg tttactggaa tattcaacta 300 ttcagtagta cgttggtttt gttcacgttc atacattcag tagttagtta gttttgttca 360 cgttcgtatg gacatcgagt gaggtacgta tttatttaac ttattttttt attatcgtac 420 agtgtagtag tgttttcgat gtgcaataat aacatttaat taaaaaatta atatacagtg 480 gctcacaggg aatcagatgc agtaacgatt gtttttttaa ttaaaataaa acgtactgta 540 aacaatatat tttttaaatt aactgcacta cattatatca aagtgcacat attattgtaa 600 ttatatcgaa taaaaatgaa aaagattttc aatcaacttt agcctattag gaacagattc 660 tattatttag gtaaataata ggatttgtaa tatttataat caaagtaaat aaactaataa 720 caattttttt tgtttaatat gaagattaca acaggcttaa taatttatgt aagtaaatca 780 ttcaaaaaat tatattaaaa tagtgattaa tcccgaaaaa atattttatt ctgtgctgca 840 tcattttcac tgtgagccac tgtagatatt aaaattataa taaacgaaaa cttatagttt 900 tgtttcagtt ttaataccct cataatggat aagtggctaa aaataggtac ccaatgccca 960 gaaaatgaca agaatgacaa tacaaataat aatgatggac caacacctac taaacttaga 1020 aagagtgtgg atgaagaccc aaataaaaat gaatccagtg gcactgttgc tacttcaaaa 1080 aacattcgac aaactatgtt atctaataaa aaattgagaa aatatcaaga agagtatatt 1140 aaatacggat ttacattttg cgttgttaat ggggaagaac gtccgttgtg tgttaattgc 1200 tcggataaat tagcaaatga aagtttgaaa cctgctaaat taaaaagaca tttggaaacc 1260 aaacataaag aatttgcaaa taaatctgaa aattttttta aaagacgagc tgaaagtatg 1320 aaaaatcaaa ctgtattttt gaaaacatat acaacaattc ctgaaaaggc tctacgggca 1380 tccttagaag tttcatattt aataggaaaa aatttgaaac cacataccat tggagaatcc 1440 ctcattcttc cttcagcaat taaaatgaca tcaattatgc atggagaaaa atatggtaac 1500 gatttaaaaa caataccaat atcaagagat accgtttctc ggcgaatttc cgccatatca 1560 cgtaatatag aatcagttct gttaaatcga atacaaaact ctccagtgtt tgctctacaa 1620 attgacgaaa ctacagacat tactaaaatg tcccaattaa ttgtttatgt gaaatacgtt 1680 tttgaagaag acatatcaga agatttttta tgttgtaaaa ggttagaagg aagaacaact 1740 ggagagaaaa tatttgaaat tattaacaga tattttgagg aaaatgacct tgcttgggcc 1800 cactgtgtcg ccatttgtac agatggtgct gcagcattaa caggtagtaa taaaggactt 1860 aaggggttaa taataaaaat tgctccccat atagtattca accattgcat gatacataga 1920 caagcactag ttgcaaaaga catcgatgaa gaattgcaca aagtattaca ggatgttgtt 1980 attgtaatta attacattaa aggcaacagc ctaaacagtc gtctcttttc aatactctgt 2040 aatgagatgg gttctgagta tgagacgtta ttactacata ctgaagtcag atggctatct 2100 cgcggcaaaa tattgcgacg tatttttgat ttgagaaacg aagtttacaa ttttttggta 2160 gaaaagaaac atatattggc acaatcattc attaatgaag attggctcgg aaaactcagt 2220 tacatggttg acattttcga aaaacttaac gatttgaatt tgagtttgca gggtgagagt 2280 acaaccattt tgacattgag cagtaaaatc gaagctttta agaataagct gatactttgg 2340 aaaggagaat taaacaaaaa caacacagac atgtttccgt gcttttcaga atttactaat 2400 gaaaataata tagatttcat gttattcaaa aacatcattt cccatcattt gataaaattg 2460 ggagaaaact ttacgaaaag gttcgaaaaa tttccagata aagaattagg ttggatccgt 2520 gatccatttt cgtttgatat tattaattcc aatatccctt taaatgaaaa agaacagtta 2580 attgatattt caagcgacga aactctgcgt attcaattta ggtctctgac ttgccaaaaa 2640 ttttggttat caactgaaaa caaatatgcc gatttgaaaa aaaaggcaat taatattttg 2700 attcaattcg catcgaccta ctcatgtgaa tccggctttt caaaattagt tgccattaaa 2760 acaaaatatc gatctcgatt agatccagag gatgacttac ggattgcgat ttcccatata 2820 gttccgaaca tggaggcgat catgagttct ttgcaagcac aaacctcaca ttgaattctt 2880 atgaataaat gtttaaatgt atttttttta taaaataaat atttgtactc atatgcataa 2940 ataaaaccat tgtttcacaa aattatgtgt tgtgtaaaaa cctctaataa aaggaaattc 3000 cagaaatagt ttcattttaa ctacgttgac gattaaaaaa aacctactgt ctactcaggg 3060 gtcgcgacaa attttttcaa aaaaaatggg ggtcgcaaat gaagataggt tgcgcaccac 3120 tgctctag 3128 // ID Dmedpu27 repbase; DNA; INV; 317 BP. XX AC GU229976; XX DT 19-APR-2011 (Rel. 16.04, Created) DT 19-APR-2011 (Rel. 16.04, Last updated, Version 1) XX DE Mariner-like element internal region of irritans subfamily. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Dmedpu27. XX OS Drosophila mediopunctata OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; tripunctata group; OC tripunctata subgroup II. XX RN [1] RP 1-317 RA Wallau G.L., Hua-Van A., Capy P. and Loreto E.L.; RT "The evolutionary history of mariner-like elements in Neotropical RT drosophilids."; RL Genetica 139(3), 327-338 (2011). XX DR EMBL/GenBank/DDBJ; GU229976; Positions 1 317. XX CC Clone Dmedpu27. XX SQ Sequence 317 BP; 111 A; 68 C; 61 G; 77 T; 0 other; ccaaattttt gagtcaattt ataaccatgg atgaaacttg gattcattac tacacttcta 60 aatcaacgca acaggcaaaa cagtgtgttc cgccgggcca aagtgcttcg aagcgtccaa 120 aaacgcaaca atgggccgga aaggttatgc ctccgtattt tgggatgcac atggcataac 180 atttgtggac tatcttgaaa aaggtaaaac cataaccgga gcatactatt catcattatt 240 cgaccgattg acaatcgaaa ttgccgaaaa acgacagcat ttgaagaaga aaaacccgat 300 ttatcatcac gacaatg 317 // ID Gypsy-264_AA-LTR repbase; DNA; INV; 180 BP. XX AC . XX DT 13-JAN-2011 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-264_AA_; KW Gypsy-264_AA-I; Gypsy-264_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-180 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (13-JAN-2011). XX DR [1] (Consensus) XX SQ Sequence 180 BP; 49 A; 44 C; 36 G; 51 T; 0 other; tgtttaacgc aacacgagct taccagccct gcgaccaacg aactgtagat cggttcgttt 60 ggacgatcta ataaagagat gctcagtcta agtttgagcg tgcatagaag aacatatctc 120 ggtgtacttc tttccctcgg taccgaaacc tctttaatac taaatccctc ggttagttca 180 // ID POLINTN1_SM repbase; DNA; INV; 3153 BP. XX AC . XX DT 26-OCT-2007 (Rel. 12.1, Created) DT 26-OCT-2007 (Rel. 12.1, Last updated, Version 1) XX DE Putative non-autonomous Polinton-type DNA transposon: consensus. XX KW Polinton; DNA transposon; Transposable Element; Nonautonomous; KW POLINTN1_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-3153 RA Jurka J.; RT "POLINTN1_SM: Non-autonomous Polinton from Schmidtea RT mediterranea."; RL Repbase Reports 7(10), 1096-1096 (2007). XX DR [1] (Consensus) XX CC This element contains large TIRs, characteristic for CC non-autonomous Polintons. XX SQ Sequence 3153 BP; 1232 A; 326 C; 341 G; 1249 T; 5 other; aattaaaatt ttaggtttca ctaaaatatt tttaaaaatt aactgattta atcaactttt 60 taaatagaac ataaaattat aattaaaata tcatttacta ttcaaaatct atcagtcaaa 120 ttggattcta attggtaaat aattatctta aatagatgtt tcataaacat tttgttccta 180 aaataaactt ttatttaaaa taatcctaaa atcaagtcaa atctaataaa ttatcaagaa 240 attaagtgtt tctaaaatta tagatttacc ctcatttctt caataatttc wttaaaaaca 300 taatcaatcc ccaaatccat taaaattcaa cagaaaacca taaaattcac aaagatgatt 360 aaaatttact aaaataaata cctcgtaaaa ctagatatta ttaaaattat cataaaattt 420 ccctaaaagc tctaattgat ttacatagcc taaaattatt ttatttttgt cgagtacaac 480 ctaatctatt gaagggtgta attaaattta ttaaaagtta attaaatttt cattgaaaat 540 gacaagtcaa taacaagtca acaaatggaa ttttaattag tagagattat attctataaa 600 gaaaattttc agtgaagaag taatattggt gggagagtac attaagtgac taaaatcgtt 660 gaaattactg gaaaaagtgg gatagctaaa gatatctact aaatagatga aatatcaatt 720 gatatattaa aaacttacaa gggtaatata ctttgaatga agaaaatata catcgaagag 780 gaaaaatatt cattaaagtg agaaattatt cattggagag aaaaaatatt tattgaagag 840 ggaaaacaat cattgaataa cgtaaaatat taattgaagc taacaaatat tcagtgaatc 900 gagaattgca tataataaat tagacggtaa aatttctcaa tgatgagatg aaatatcaat 960 taaatattgg gaaatacttg aaataaatgg agtctgaatt attcactgat caacaaatat 1020 acttaaaaga tgaaaatatt cactgaaaat gaattgtact ttaagagtgg atgaattact 1080 attatattca attaaaaatt attaacaatt gagtagattt tgtcggttaa ttatatttaa 1140 atagggatat atattgttga tggatactta tgcgcggatg aatagttatt aaaaaaacgc 1200 gcgcgcgtat actaatccaa cgaatattca acaaatccgg catatttaat actataaata 1260 cgggtggatc tgatacaata taacaagtag gaagaatgta gtagaaagtc caacaaataa 1320 aggaatgtag taaacaatac attgaataga tgaataattt ggtattatga ttataagaaa 1380 catttagttt tttgtaatat taaataaatt cagaaaattg atgtatgttt tatattaatt 1440 attgcttttt ataaaatgtt ataaatttac ttttagaaat aataatcatt ttttgattta 1500 ttattatttt tgaatttaaa ttattatgtt gtcaatagtt acaattgaat aaattattta 1560 aattaatttt acatttggat aatatttatt tataaccata aatttataaa gatattattc 1620 aaaatatttt taatttatca aaaagtttta tttgagtata aaactaaatc aaatattcta 1680 tttactaatc aataaaaaaa aattatttta accaaatttc aaatataatt tcaaataata 1740 gaataatttt ataatttaaa ataatatatt gtcaatagtt acaactaaaa tagtatttaa 1800 atataagttt tttcaaaatt aattacattt agataataat tcttaatacc aataaaattc 1860 cataatgttc attcaaaata tatcattcaa tgtgttttga atgtaagaga cttagatttg 1920 atttctacta tctcagtatt tgatgtatat ttgtcggatt tgttgaatat tcgttggatt 1980 agtatacgcg tttttttaat aactattcat ccgcgcataa gtatccatca aaatatatat 2040 ccctatttaa atataattaa ccgacraaat ctactcaatt gttaataatt tttaattgaa 2100 tttttaattc atccactctt atagtacrtt cattatcagt gaatattttc atcttttaag 2160 tatatttgtt gatcagtgaa taattcagac tccatttaaa gtatttccca atatttaatt 2220 gatatttcat ctcatcattg agaaatttac cgtctaattt attataagca attctcgatt 2280 cactgaatat ttgttagctt caattcatat tttctcactt taatgattgt tttcctcttt 2340 aatgaatatt ttytctcttc aatgaatrat ttctcttatt caatgaatat tttctctctt 2400 caatgaatat ttttctcttc aatgaatatt ttcctcttcg atgtatattt tctccattca 2460 aagtatatta cccttgtaag tttttaatat atcaattgat atttcatcta tttagtagat 2520 atctttagct atcccacttt ttccagtaat ttcaacgatt ttagtcactt aatgtactct 2580 ctcaccaata ttacttcttc actgaaaatt ttctttatag aatataatct ctactaatta 2640 aaattccatt tgttgacttg ttattttcaa tgaaaattta attaaataaa ttacatcttt 2700 caatagatta ggttgtactc gacaaaaata aaataatttt aggctatgta aatcaattag 2760 agcttttagg gaaattttat gataatttaa gtaatatcta gttttacgag gtatttattt 2820 tagtaaattt taatcatctt tgtgaatttt atggttttct tttgaatttt aatggattta 2880 gggtttgatt atgtttttaa tgaaattatt gaagaaatta gggtaaatct ataattttag 2940 aaacacttaa tttcttgata atttattaga tttgacttga ttttaggatt attttaaata 3000 aaagttaaat ttaggaacaa aatgtttatg acatttattt attaattatt taccaatttg 3060 actgatagat tttgaatagt aaatgatatt ttaattataa ttttatgttc tatttaaaaa 3120 gttgattaaa tcagttgatt tttaaaaata ttt 3153 // ID BEL1-LTR_Dya repbase; DNA; INV; 214 BP. XX AC chr3R; XX DT 18-MAY-2009 (Rel. 14.05, Created) DT 18-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL1_Dya; KW BEL1-I_Dya; BEL1-LTR_Dya. XX OS Drosophila yakuba OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; melanogaster subgroup. XX RN [1] RP 1-214 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1007-1007 (2009). XX DR Genome; chr3R; Positions 168051 168264. XX SQ Sequence 214 BP; 59 A; 54 C; 47 G; 54 T; 0 other; tgttcatgcc gaacagctga gtccggtatc ccgccagcag agcatcgctg ttggcggaat 60 gtttcggaag tgggacacga agccgttatt ttatgtgaag aaattacttt aaaataaaac 120 cccaaacgag aaggacgaac tccaacacct tttgtttcat taaactctgg gaatatttgc 180 gacaccagct tttccggccc ccgacgtctc taca 214 // ID BEL-189_AA-I repbase; DNA; INV; 6009 BP. XX AC supercont1.85; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-189_AA_; KW BEL-189_AA-LTR; BEL-189_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6009 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.85; Positions 975534 981542. XX CC 'GTGAT' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS join(498..4085,4089..5984) FT /product="BEL-189_AA-I_1p" FT /translation="MQLRSGLKKNLFFTPTGSSSPSVVRIGTPVSSVREQL FT EKQKAEKERCRMAENVAALSRCCETTRAKVGRIREAIAVADLDHKKFSVHA FT LKLYLKTTDSAYEEYNGFQNRIYLADPSRKEEFEPIFIEFEELYKFTRIAL FT CEMLQAYEDEEKAILSVAAQSAAGNVSDSGLKGSNHQAGCSSSGVPVAFPP FT TLVLQQVALPTFDGRYENWFKFKQMFCDIADKCTADSAATKLHFLDKALIG FT KAQGAIDPQIIRDNDYQEAWRSLTQQFENLPALINGTTTRLLNVKQMVNES FT FAQLKGLLDEVEKCVSSLEYHNLKMDKLSEAIITSLIASKLDMATRKVWES FT SVTRGQLPEYKKMISVLRNQQAVLERCERAKPALKIRNSNSASKTQSSFAP FT MKTHTATVGKRNDTCTLCNGEHQIEKCDAFLKLNVNARYGKAKHFGLCFRC FT LKRGHRTAECKVEKKCSVCSRGHHLLMHPECKNESEKPMKDTAVDATGATN FT AVGKEESTSANCSLLCNANETKNQVLLATAIVNVIDACGVQHKCRALLDSG FT AMANFMSQRLADLLNLEKKAANVPVVGVNGMKTIMKFKVEAKVLSRVTEYG FT FSLDYLVVPKVTGTLPPAGVDVDRWPIPTNLTLADSSFFEPSRIDLLIGAE FT VFFELLRSGKITMAAELPILQESMLGWLVSGRVSECPSTGTVRVCHAQPIP FT SAKMELSMLLRQFWSIDEQDFPITNHQASFDCEQHFLKTHSRNESGRYVVR FT LPFRTNPSELGQSRQQAEKRFFALERRLDRNSDLKQQYKAFIEEYISLGHA FT REVHEADEEAFYLPHHCVLKPDNSTTKLRVVFDASARSSTNLSLNDIMMIG FT PTVQRSLFDIVLRFRCHRYVFTADVQKMYRQVMVHEDDQKYQRIVWRDNRE FT QELKTIQLSTVTYGTAAAPFLATRSLNQLTLDEQSDFPEASDAVLNGFYVD FT DALAGADGLDQTQKLQGDLIEMLDRGGFQLHKWCANDSALLQKIPVEARAK FT ELDFETCKEGESVKTLGLLWNPVNDVFMFRVSPFKMSSQLPTKRQIMSDVA FT RLFDPLEYLGPTVVIAKLVMQQLWKEKVGWDDPTTAEVMETWERFRSELCE FT ISGLKIPRRVTVNGTMSYEIHGFADASMKAYGCCIYLRCLKSDDTAEMSLL FT CSKSRVAPVKELNREWKGDEKPEEMTIPRLELCAAKLVEQAVKVLNAIQLD FT VRRTVLWTDAKIVLAWIRRLKPDAPIFVRNRVAVIQQLSGNFEWRHVPTTT FT NPADHISRGLYPVKLMTCELWWHGPGFLRASINAEPPQYDEDEADEYNVGD FT HEDTLAAIVDTNLNPPLEAINRCSDYRKLQRIFGYMSRFLYNCRSKLGDRR FT QGPLSVGDLRKAEMLMVRVVQLASYEEEIVCIQNNRSVKSKLRNLNPVFDE FT NERMLRGGGRIRHSNLPREQKHPLILPEGQHLTEILIDALHRENLHIGRNG FT LLSVIRKRFWPVNAKRVVYRVLKRCVRCFRVKPTDVQQFMGDLPSYRVTEA FT LPFSRTGVDYAGPLLLKQGRMKSPVKAYIALFVCMTTKALHLELVTSLSTE FT SFLAALHRFVGRRGNVSEMRSDQGTNFVGEDHQLKEFYDMLTSQLMQKKIA FT EFCQVRCIDWKFNPPKAPHQGGLWEAGVKCVKYHLHRVLKEAYLNYEEMNT FT LLVQIEAILNSRPLCQQSDDPCDYQALSPAHFLIGRELTAVAEPLYQRLRE FT SSLSRYQLVQKHKQTFWRRWSNDYVTEQQRRGKWNKSPSPIKNGMLVILKE FT ENMPPQTWRLGRIDDVYPGSDGVVRVVKIRTSNGSTFDRPTTQIAILPIED FT NEEPNI" XX SQ Sequence 6009 BP; 1723 A; 1247 C; 1548 G; 1491 T; 0 other; tttggtcact tcgatcctga taaagtaacc gcgagtggtt gagttgtatg gacgcgccgg 60 ataaaagaaa ttaagttttg tctgctgaag caccaaaatg gactcatccg gtgacatgtg 120 gtttcgagaa gagtagcaaa ttttggaagt ttgctagtga ttcaagaatt gagaagatat 180 tgtgaaacac ttagtttgtg tggattctga acattttgac ggacgttgtg gccacgttga 240 agagtgcccc gcaacgaacg attcctctga tcgcgatgaa agtgatgctg gatctgcatg 300 agagcagtcc tctgcaaatt gtgacagtga cggtgatcga ggacaatgaa ctgggaagct 360 tcaaagtgag ataagtccat aaactaagta gtttgaaact tgattcggtg agtttcgccg 420 aaaagtgacg ttggaggtac ataacctcta acgaagaaga aagacaaaca attgaaaaga 480 agaagcaata caaagtgatg caactgcgat ccggtttgaa aaagaatctg ttcttcactc 540 ccactggaag ctcaagtcct tctgtagtta gaattggcac gcccgtttca tctgtgcgag 600 agcagttgga gaaacagaag gcagaaaagg agcgttgtag aatggccgag aatgtggcgg 660 ctttatctcg ctgttgcgaa acgacaagag ccaaggtggg acgtattcgc gaagcgatag 720 cagtcgctga tttggaccac aagaaattca gtgtacacgc gttgaagtta tacctgaaga 780 ccactgattc agcgtacgaa gagtataatg ggttccagaa cagaatctac cttgccgacc 840 cttcgagaaa agaggagttc gagccgatct tcattgagtt tgaggaatta tacaagttta 900 cgcgtatcgc tttatgcgag atgctgcaag cttatgagga tgaagaaaaa gcaatactct 960 ccgtagctgc tcaatccgct gcaggaaacg tttctgactc tggattgaag ggaagtaatc 1020 accaagctgg ctgtagctct tctggtgtac ctgtggcctt tcccccgact ctcgtactgc 1080 aacaagtagc gctcccgact tttgatggtc gctacgagaa ttggtttaaa ttcaagcaga 1140 tgttttgcga cattgctgac aagtgtactg ccgattctgc tgcaactaag ctacacttcc 1200 tcgacaaggc tctaatagga aaggcacagg gagcaattga tccgcagata atcagggata 1260 atgattacca ggaagcttgg cgctcgttga cccagcaatt tgagaacctt cctgccttaa 1320 tcaatggcac cactacgagg ttgttgaatg taaagcagat ggtgaatgaa tcatttgctc 1380 agctgaaagg cctgttggat gaggttgaga agtgtgtgag ctctctcgag tatcacaatt 1440 tgaaaatgga caagttgtca gaagccatca tcaccagtct tatcgcttca aagctggata 1500 tggctactag gaaggtgtgg gaatcgagtg tcacacgcgg acaacttccg gaatacaaga 1560 agatgattag tgtcttgaga aatcaacaag cagtactcga gcgatgtgag agagcaaagc 1620 cagcattgaa aattcgaaat tcgaactcgg cgtctaaaac acagtcttcg tttgctccta 1680 tgaagaccca taccgcaaca gttggaaaaa ggaacgacac atgtacgttg tgcaatggtg 1740 aacaccagat tgaaaagtgc gatgctttct tgaaactgaa tgtgaacgct cgttacggga 1800 aggcgaaaca tttcgggcta tgcttccgtt gcctaaaacg gggccatcgc acggcagagt 1860 gcaaggtgga gaaaaagtgc tcagtatgtt ctcgtggaca ccacttgttg atgcatccag 1920 aatgtaagaa cgaatcggag aagccaatga aggatactgc agttgatgcg acgggtgcaa 1980 ccaatgctgt tggcaaagaa gaatcaacgt cggcaaattg ttctcttctg tgtaacgcca 2040 atgaaaccaa aaatcaggtt ttgttggcaa ctgcgattgt caacgtgatc gacgcttgtg 2100 gagtccagca caagtgccgt gctctactgg attcgggtgc aatggctaat tttatgtcgc 2160 aacgcctcgc tgatttgttg aatctggaga agaaagccgc aaacgttcct gtagttggcg 2220 tgaacggcat gaaaacgatt atgaagttca aagtggaagc taaagttctg tccagagtaa 2280 cggaatacgg tttcagtttg gattatcttg tggtaccgaa ggttaccgga acgctgcccc 2340 ctgctggagt ggatgtcgac cgttggccga taccaacaaa tttgacactg gccgactcgt 2400 ccttcttcga acccagccga atcgatttgt taattggagc ggaagtgttc tttgaattac 2460 tacgaagtgg caagataacg atggctgctg aactacccat tctgcaagag agtatgctgg 2520 ggtggctggt gtccggacgt gtatctgaat gtccatcaac gggtacagtc cgtgtatgcc 2580 atgctcagcc gatcccttcg gccaaaatgg agctttctat gctattacgg cagttttggt 2640 cgattgacga acaggatttt cctattacca accatcaagc aagcttcgac tgtgagcaac 2700 acttcttgaa gacgcactcg cgaaacgaat ccggacgcta cgtggtacga ttaccatttc 2760 gtacaaatcc aagtgaatta ggacagtcga gacagcaagc tgagaagaga ttttttgctt 2820 tggagcgcag acttgatagg aattctgacc tgaaacagca gtacaaggcg tttattgaag 2880 aatatatttc gcttggccat gctcgtgagg ttcatgaggc tgatgaagaa gcattctatc 2940 tacctcacca ctgtgtgctt aaaccggaca actcgacaac taagttaaga gttgtatttg 3000 atgcatcggc ccggagttcc accaatctat cattgaacga cattatgatg attgggccca 3060 ccgtacagcg ttccttgttc gatatagtgc tgcgttttcg atgtcacagg tacgttttca 3120 ccgccgacgt gcaaaaaatg taccgccagg taatggtaca cgaagatgat caaaaatacc 3180 aaaggatcgt gtggagagac aaccgtgagc aggagttgaa aacaatccaa ctttctacgg 3240 ttacgtatgg tactgctgca gcaccctttt tagcgacacg gtcgctgaat caattgactt 3300 tggatgaaca aagcgacttt ccggaggcga gcgacgcagt gctgaacggt ttctatgtgg 3360 atgacgcgct cgctggtgct gatggtctag accagacaca gaaactgcaa ggtgatttga 3420 tagagatgct ggatagagga ggatttcaat tgcacaaatg gtgtgcaaac gattctgcgc 3480 tcttgcagaa gattcccgtt gaagctcgtg cgaaggaact cgattttgaa acctgcaaag 3540 aaggcgagag tgttaaaacg cttggtttac tatggaatcc agtgaatgac gtgttcatgt 3600 tccgagtatc gccattcaaa atgtcgtcgc agttgcctac aaagcggcag attatgtccg 3660 acgtagcccg tttgttcgac ccgctcgagt acctcggacc aactgtagtt attgcaaaac 3720 tggtgatgca gcagctttgg aaggagaagg ttggttggga tgatccgacc acagcggaag 3780 tgatggagac gtgggaacga ttccgctccg agttgtgtga gataagtggt ctgaaaattc 3840 caagacgtgt gactgtgaat ggcacaatga gctatgaaat acatggcttt gcggatgcgt 3900 ccatgaaagc gtatggttgt tgcatatatt tgcggtgcct gaagtcagac gatactgcag 3960 agatgagcct attatgcagc aagtccagag ttgcacctgt aaaggagcta aaccgagaat 4020 ggaaaggtga tgagaaacca gaagaaatga ccatcccgcg tttagaactt tgcgcagcga 4080 agctttaggt ggaacaggca gtaaaggtgt tgaatgcaat tcaattggac gttcgtcgta 4140 cggtattatg gactgatgca aaaattgtat tggcttggat tagacgtttg aaaccagatg 4200 caccaatctt cgtgcgtaat agggttgctg tgatccaaca gctgagtgga aattttgaat 4260 ggagacatgt accgacaaca acgaacccag cagaccacat ctcacgcggt ttgtatcctg 4320 tcaaactaat gacgtgtgaa ctgtggtggc acggaccagg ctttcttcgt gccagcatta 4380 atgcggaacc gcctcagtac gatgaagacg aagcggatga gtataacgtt ggtgatcatg 4440 aggacacact tgcggctatc gtagacacga acctaaaccc acctttggag gctatcaata 4500 gatgcagtga ttataggaag ctgcagagaa tttttggata catgtcaaga ttcctataca 4560 actgtagatc taaattaggc gatcgacgac aaggaccact tagcgttggc gatctgcgta 4620 aagccgaaat gttgatggta agagtggttc aattggcatc atatgaagaa gaaattgttt 4680 gtattcaaaa caatcgttcg gtgaagagca agcttcgaaa cctcaaccca gtattcgacg 4740 aaaatgaacg gatgcttaga ggtggtggac gcattcgtca ttctaactta cctcgggaac 4800 agaagcaccc tctcattcta ccggaagggc aacatcttac ggaaatccta attgacgctc 4860 tacataggga gaatttgcac atcggacgta acggactact ttccgtgatc agaaagagat 4920 tctggccagt taatgccaaa cgagtcgtct atcgagtgct gaaaaggtgt gtgcgttgct 4980 tccgggtgaa accaactgat gtccagcaat tcatgggtga cttaccaagt tatcgcgtga 5040 ctgaagccct tcctttttcg agaactggtg tggactacgc cggtccatta ttgttgaagc 5100 aaggtcggat gaaatcaccg gtgaaggcgt atattgcatt atttgtatgc atgacgacaa 5160 aggcactgca cctcgagctg gttacgtcat tatcgacgga gtcgttcctg gctgcactgc 5220 acagatttgt tgggcgaaga ggcaacgttt cggaaatgag atcagatcaa ggcactaact 5280 tcgtaggtga agaccatcaa cttaaagaat tctacgatat gcttacgtcg cagttgatgc 5340 aaaagaagat tgcagaattc tgtcaagttc gatgcatcga ctggaaattt aacccaccca 5400 aggcgccaca ccaagggggt ctatgggaag caggggttaa atgtgttaaa tatcatttgc 5460 atcgtgtttt gaaggaggcc tacctaaact acgaagaaat gaatacattg ctggtacaaa 5520 ttgaggctat tctaaattcc cgccctttgt gtcagcaatc cgatgaccct tgtgattacc 5580 aagcattaag tccagctcat ttccttatcg gacgtgaact tactgctgta gcagagccac 5640 tttatcaaag actacgagag agttcactat caaggtacca actcgtgcag aagcataagc 5700 aaaccttttg gcgtcgttgg tccaacgact acgtaacaga gcaacaaaga cgaggaaagt 5760 ggaacaaatc accatctcct atcaagaatg gaatgttggt tatactgaag gaggagaata 5820 tgccacccca aacctggcgg ctcggcagaa ttgatgacgt ttatccagga agcgacggag 5880 tggtacgagt agttaagatt cgcaccagta acggctctac attcgaccgc ccaacaacgc 5940 aaatagctat tcttcccatc gaggacaacg aggagcccaa tatttgagcc cagctcaacg 6000 gggggtgga 6009 // ID HTE1 repbase; DNA; INV; 391 BP. XX AC M27873; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 28-SEP-1995 (Rel. 1.08, Last updated, Version 1) XX DE Holothuria tubulosa repetitive element, clone HTE-1. XX KW HTE1; Repetitive element. XX OS Holothuria tubulosa OC Eukaryota; Metazoa; Echinodermata; Eleutherozoa; Echinozoa; OC Holothuroidea; Aspidochirotacea; Aspidochirotida; Holothuriidae; OC Holothuria. XX RN [1] RP 1-391 RA Sainz J., Azorin F. and Cornudella L.; RT "Detection and molecular cloning of highly repeated DNA in the RT sea cucumber sperm."; RL Gene 80(1), 57-64 (1989). XX DR GenBank; M27873; Positions 1 391. XX SQ Sequence 391 BP; 130 A; 76 C; 50 G; 135 T; 0 other; aattcaaaaa atcagctatt cggcaccata ggtcaacgcg ctgcttattt ttctactttc 60 ggttaaaata tcaagtcatt tcaataattc attcaacaga aacagttggt acattgtgtc 120 aatgtaattc cacccagtta atagtcacga accgaaaatt tagtaacttt ttaaagttcc 180 accatttcaa gtacctttgt tttctttata tgttcataaa aatccaaatc caattccata 240 ggtcaacttg tcgtcaccat ttttacccat gtagaagata taatgtcgaa atctgtattt 300 gatcaagata tataaagggt acatcaaagt tctttacagg attcttaaaa ctgaaaaaag 360 tcctaacatt tcctcgtatt cttgcttcat g 391 // ID Gypsy-25_AA-I repbase; DNA; INV; 4573 BP. XX AC supercont1.308; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-25_AA_; KW Gypsy-25_AA-LTR; Gypsy-25_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4573 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.308; Positions 25674 30246. XX CC 'GATTT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 275..4534 FT /product="Gypsy-25_AA-I_1p" FT /translation="MEELRALLARQNELFAALADRVNNIQVNVPAMISPQP FT VPQPPLLCLEGDMSENFDFFERNWNNYASAMSMDQWPEDQNPKKVGFLLSV FT IGFEALRKYFNFELTDEQRQSPIAALAAIKAKVVRVRNMNLDLFEFLSAKQ FT ESESIDEFVTKLRVLAKPCQLGALEERFLTYKVVTANKWPHLRKRLITSSD FT VTLAKVVDECRLEEVSVRRFMALGAERESEVNYIQPRREKVEKKSCKFCGG FT RHIFEKGLCPALGQKCRKCKAKNHFEKMCPTNKGAVKKRRVKEVSIHSDDS FT DESESNVNSESEESSEERQIGKIYNNSSKGGHVLADLQLKVNGRWENVVCE FT LDTGANTSLIGYNWLKKLTGKRNPELFPSEFKLQSFGGNPIPVLGEVKLPC FT RHNEQKVSLILQVVDVDHRPLLSANVCDVLGLVKFCNTVSYVRPEKPISQS FT DVWKVHRIEANQIANDFSDVFNGYGRIEGEVLLEVDPAVPPLIQQPRRIPI FT ALRDKLKVELDNLEKSGIIVKEQQHTTWVNNILLVNKTASGESFRICLDPV FT PLNKALKRPNLQFLTLDEVFPELSNAKVFSTVDAKKGFWQVVLDEASSRLT FT TFWTPFARYRWVRLPFGISSAPEIFQMKMREVLDGLDGVECLADDVLIYGR FT GETLEEAMNDHNRCLKQLFERLRQHNVKLNRSKLNLCQPSVKFFGHLLTDK FT GLQADDSKISVIRNYPTPTDRKALQRFIGMLNYLSRFIPNLSSNLHALRKM FT LSKNDTWLWNTECQQEFDKAKSLVADIENLQYYNVNKPLWIECDASSYGLG FT VAVYQDSGVIGFASRVLTPTEQNYAQIEKELLAVLFACVRFDQLIIGNTQT FT VVKTDHKPLVTIMQKPLLKAPKRLQHMLLNLQRYNIQLQFVSGKENVLADA FT ISRAPEVPKENDSEFHKLHIYRVFGEVEDINLGSYLNISDDRINDILEQTS FT QDKVLQTVVCYTKDGWPKKISDVPDLVRPFFKHRHEIGYQDGLVFRGDRVV FT IPSSLRRKLVEKLHVGHPGIEASLRLARSNIFWPGMNDQIKNRIQECQTCA FT KFAASQWKPPMKSHPIPVYPFQLVSMDVFFQVYQGKQRIFLVTVDHYSDYF FT EIDILKNLSSSSVIQACKRNFACHGTPQMIVTDNGTNFVNEEMVKFSKSWD FT FKHSTSAPYHQQANGKAEAAVKIAKHLIMKSAEDGQDLWLALQLWRNTPNN FT IGSSPASRLFSRGTRCGIPMPATNLVPSVVKGVPEAIYENRQRIKYNYDKR FT SRRLPVLDIGSPVYVQLRPESSKLWTPGTVSNTLGDRSYLVEVDGSNYRRD FT AVNVKPRKESATPLTTTSDWMNMSSTSAALVAAPECTPSVTPIYSHERSQV FT TSNDTSPTELLGPEVAQPSLSATPATASVQSETLQNDRPKRQTKIPSKFKD FT FVVTIK" XX SQ Sequence 4573 BP; 1357 A; 946 C; 1070 G; 1200 T; 0 other; tggtgtcaga agtggtcgcg tagtgctttg ttccattctg cgtaatattg gagtatccgg 60 aaaagtattc cgttgtagtc gaaggaaatc ttccgttcta ggcctattga ttggggccag 120 tttcgttttt gcctcctcac atctcatgca ttgatgtgtg tgtgtccatt gcaacggtcg 180 gaaatatttg cccgacggcg tatgtggtag tgtgatcaga agtgcataaa acgtcggctt 240 gtgttcgtgt cagtataccg aataaacaac caacatggaa gaacttcgtg ccctcctcgc 300 aaggcagaat gaactgttcg ctgctttagc agaccgtgtg aataacatac aagtgaatgt 360 gccggctatg atttcacctc agccggtacc gcagcctccc ctgttgtgct tagaagggga 420 catgagcgag aactttgatt tttttgagcg taattggaac aattatgcca gtgcaatgag 480 tatggatcag tggcctgaag atcaaaatcc gaaaaaagtt gggtttttat tatcagtgat 540 tggatttgag gctcttcgaa aatatttcaa tttcgaattg acagacgaac aacgtcagtc 600 tccaattgcg gctttggctg ctattaaagc taaggtggtt cgtgtgcgta atatgaactt 660 ggatctattt gagttcctat ctgccaagca agagtctgag tcgatagatg agtttgtaac 720 gaagttaaga gtgcttgcta aaccgtgtca gttgggagca ttagaggaga gattccttac 780 atataaggtg gtcactgcca ataagtggcc acatctcagg aagcggttga ttacttctag 840 tgacgtcact ctcgctaagg ttgtggatga gtgcaggctt gaagaagttt cagtgcgtcg 900 attcatggct ctaggagccg agcgagagag tgaagtgaac tacatacaac cgagaagaga 960 aaaggttgag aagaagagtt gcaagttttg tggaggacga cacattttcg aaaaaggtct 1020 gtgtcctgca ttgggacaga agtgtcgaaa gtgtaaagca aaaaatcact ttgagaaaat 1080 gtgcccaaca aataaaggtg cagttaagaa gcgaagagtg aaagaggttt cgattcactc 1140 agacgatagt gatgagagcg aatcaaatgt gaacagtgaa tctgaagaat caagtgaaga 1200 acgccagatc ggaaaaatct acaacaattc gtcgaaaggt ggacacgtat tggctgattt 1260 acagttgaaa gtgaatggtc gctgggaaaa tgttgtgtgt gaacttgata cgggcgcaaa 1320 cactagcctg atcggatata actggttgaa aaaactaact ggaaaaagga atccggagct 1380 ctttccatct gaatttaaac tccagagctt tggtggcaat cccattccag ttcttggtga 1440 agtgaaacta ccgtgtcgtc acaacgaaca gaaggtctcg ttgatattac aagttgtcga 1500 cgttgatcat cggccgctac tctcagcgaa cgtttgtgat gttctaggct tggtgaaatt 1560 ctgcaatacg gtatcatatg tgcgacctga aaaaccaatc agtcagtcag atgtgtggaa 1620 agttcaccgt atcgaagcga accaaattgc gaatgacttc agtgatgttt ttaacgggta 1680 tggaaggatc gaaggagaag ttttgcttga ggttgatcca gcggtaccgc cgctgattca 1740 gcaaccacgg cgtatcccga tagcacttcg ggataaactg aaagtggagc tagacaattt 1800 ggagaaaagc ggaattatcg tcaaagagca gcaacacacc acatgggtga acaacatcct 1860 tctagtcaac aaaacagcat caggtgagtc gttcagaatc tgtctagatc cagtacccct 1920 gaacaaggcc ttaaaacgac cgaatctgca atttttaacc ctggatgaag tttttcctga 1980 gctctcgaat gcaaaggtct tctcaacagt tgatgctaaa aagggatttt ggcaagtggt 2040 tttggacgag gccagtagta gattaacaac attctggacc ccttttgcga gatatcgttg 2100 ggttcggcta ccatttggta tttcgtcagc gccggaaata tttcaaatga aaatgagaga 2160 ggttttagat ggcttagatg gagttgaatg tctggctgat gatgttttga tatacggtcg 2220 aggcgaaacg ttggaagagg caatgaatga ccataaccga tgccttaaac aactttttga 2280 acgtcttcgg cagcataatg taaaacttaa tcgctcgaag cttaatctgt gtcagccctc 2340 agtaaaattt tttggacact tgctgaccga taaggggctt caagcagatg actccaagat 2400 ctcagtaata agaaactacc cgacaccaac agatcgtaaa gccttgcaac gcttcatagg 2460 tatgctaaac tacttgagtc ggttcatacc aaatcttagt tcaaacttac acgctcttcg 2520 caaaatgttg tccaaaaatg atacttggtt gtggaacaca gaatgccaac aagagtttga 2580 taaggccaaa tcgctagtgg cagatattga aaaccttcaa tattacaacg taaacaagcc 2640 tctttggatc gaatgtgatg ccagctctta tggattagga gtggctgtgt atcaagactc 2700 cggggtgatt ggatttgcat ctcgagtact cacacccaca gaacaaaact acgcacaaat 2760 tgagaaagag ctgttagcag tgttgttcgc gtgtgtgcgt ttcgatcagc ttattattgg 2820 gaatacccaa actgttgtca agacggatca caagccgctg gttactatta tgcaaaagcc 2880 tctattgaaa gctcctaaac gactccagca catgcttcta aatctccaac gatacaacat 2940 tcaactacaa tttgtttctg gaaaagaaaa tgtgttggca gatgcgatat cacgtgcccc 3000 cgaagttcct aaagaaaatg attcggagtt ccataaactc catatctacc gtgtatttgg 3060 cgaagttgaa gacattaatt taggcagcta tctgaacatc tctgacgatc ggattaacga 3120 tattctggaa caaacttcac aagataaagt gctgcaaaca gttgtttgtt acaccaagga 3180 tggctggccg aagaaaatca gcgatgttcc agatcttgtg aggccatttt tcaagcatcg 3240 ccatgaaatt ggttatcaag atggtctagt atttcgtggg gatcgagtcg ttattccaag 3300 ttccctacga cgtaagctgg tcgaaaagct tcacgtcggt catcccggaa ttgaagcatc 3360 tctcagactg gctagatcca acattttctg gcccggaatg aacgaccaaa ttaagaatcg 3420 tatccaagaa tgtcaaactt gtgccaagtt cgctgcatcc cagtggaaac cacccatgaa 3480 atcccatccc atccctgttt atcctttcca attggtttca atggacgttt tcttccaagt 3540 gtatcaagga aaacaacgaa ttttcttggt aaccgtagac cattattccg actattttga 3600 aattgatatt ctcaaaaacc tgtcgtcttc ttcagtgatt caagcatgta aacggaactt 3660 cgcctgccat ggcacgcccc aaatgatcgt aacggacaac ggaacgaact ttgtgaatga 3720 agaaatggtc aaattcagca agtcatggga tttcaagcac tccacctcag ctccatatca 3780 ccaacaagca aacggcaaag ctgaggcagc agttaaaatt gcgaagcatc ttattatgaa 3840 gtcagcggaa gatggtcaag atctgtggtt ggcactacag ctttggagaa atacaccgaa 3900 caacataggc agcagtccgg cttcaagatt gttttcccgt ggaacacgat gcggtatacc 3960 gatgccagca acaaacttgg taccaagcgt ggtcaaaggg gtccctgaag ccatctatga 4020 aaatcgacaa aggataaaat acaactacga caaacggtct cgccgtctac ctgtactaga 4080 cataggttct ccggtctacg tccagctacg accagaatca tcaaagttgt ggacgccagg 4140 aacagtgagt aatacgctgg gagatcgatc gtatcttgtt gaggtggatg gttcaaacta 4200 tcgacgagat gctgtaaacg tcaaaccacg taaggaatca gctacgcccc taacaacgac 4260 ttcagactgg atgaatatgt catctacatc agcagctttg gtggcagctc ctgaatgcac 4320 tccctctgtg acaccaattt attcccatga gcggtcacaa gtgacgtcga acgacacttc 4380 tcctacggag ctactaggac cagaggttgc tcaaccatcg ttaagtgcta cgcctgcaac 4440 tgcttctgtg caaagtgaga cgcttcagaa cgatcgtcct aaacgtcaaa ctaagatacc 4500 atcaaagttt aaggattttg ttgtaacaat taagtagctt acccatcttg tttttcttgc 4560 aaaaagggga gga 4573 // ID Gypsy-43_CQ-I repbase; DNA; INV; 9099 BP. XX AC AAWU01034657; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-43_CQ_; KW Gypsy-43_CQ-LTR; Gypsy-43_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-9099 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 465-465 (2011). XX DR Genome; AAWU01034657; Positions 11387 20485. XX CC Positions [3810-4346] - Reverse transcriptase CC Positions [5367-5858] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 601..3000 FT /product="Gypsy-43_CQ-I_1p" FT /translation="MFKLVPFPRAEDLTIEEIEHELTVRKQPESIFLLDLP FT AKQRKVRSLFKRDQKESLTWPSYTNIREEEEYIHGRIDKLAKALEKKQETR FT FESRVLSYWHRTRRHSARTEAERKIKRELIQKIENLMKQFQFGVPQSPVKK FT QINNILESSRNSTPTTVSNISERSKLTLRTPVPCPITTVAITPTPTILITT FT PSPNLSPLVTSPITTPTLSISVASGSDTPNAVVTPTLTLESPNATTPTSRV FT STPLVQTPNVVPRNPRFEFPTPDRGGEGTVSNSGSSSPTKGKTTDMASGGS FT GQGNGGQAGDDKFMTEWKNMKALLVDLVDKNAELRTEIQEQKKALQAKGSQ FT GPVKMKTGATQKDVSSKGAIPKRGEGSTSTSSGIPQEEWEAMKEILRTIML FT QQSSASANNNNVPPRNNPQPGRGSGNPRQSPPPYPPSYEGDYSDSSSSRGG FT ARGGHRYRDQPGRVDGRIDKWKIRFSGDSNVSVENFLYKAKKLAEREEIPP FT WELLRNIHKLLEGTAEDWFFTYADDFDCWRTFECKFTHRFGNPNKDQGIRS FT TMKYRKQQRGEAFKAFVDEIVRLNRLLTNPLPQDRLYETIWDNMRDHYKTK FT IAVVGVGDLDQLLEINHRIDAADSSIKQQADGSGEFSNQRHVNHVEVDEYG FT SEEDGASVNAIRGQQSSYRPPCSSASQDQQRSTSAYRNPYRSRPQQQPREQ FT AHHQQQPQDQLQQQPQHLINQTRQADATQAANPLPVETSGATAQLLCWNCL FT VHGHSWRQCNKPKVIFCYGCGCLGRTITNCDRCSVTSTQRSSATPQGNE" FT CDS 3414..6260 FT /product="Gypsy-43_CQ-I_3p" FT /translation="MIDLGQGPESVEMVEQDDDALLCFTIEPSETTPTLED FT PEEDDTLDIPVYEGPTESVPDPDAIETEHVLTTEQRRQLSEVIRAFELTST FT GKLGRTHLIEHEIVLKEGAKPRNPPMYRCSPYVQQAINAEVERFKKLDAIE FT ECYSEWTNPLVPVPKKNGKVRVCLDSRKINKLTVKDSYPMRNMQDIFRRLG FT KATYFSVIDLKDAYFQIPLKEECRNLTAFRTSEGVFRFKVLPFGLMNAPFT FT MSRLMAKASGSDLEPNVFVYLDDIVIAASSFDEHLRLLRLVAERLRKAGLT FT ISLEKSRFFRKQVMYLGYLLNEHGVAIDKSRIQPILDYAQPRTQEDIRRLM FT GLAGFYQRFIKDYSRMTAPITDLLTKENKRFTWTKEAEDGFRELKAVLTSA FT PILGNPDFSKLFTIESDASDRAVGAALVQEQDGVTRVISYFSKKLNRTQRR FT YSAVEKECLGVLSAIQHFRHYVEGTKFRVITDARSLLWLFNVGAETGNAKL FT LRWALRIQAYDFDLEYRKGKANITADCLSRSIEVDAVRVTLPDDEYEEQIE FT NITGNPTKFSDYRVIDGRIFRYVKSPNRQTDPRFSWKLLPPRAERKEIITK FT VHDQAHFGFEKTLASLKERHDWPKMREEVRKHCRECLQCQVSKAGNQNVTP FT PMGPQKPVQYPWQFITLDYVGPLPASGKNRNTCLLVVTDVFSKFVLVQPFR FT EAKAHSLTEFVENNVFRLFGVPEIVLTDNGTQFLSKQFKTLLEAYHVQHWL FT TPAYHPQVNNTERVNRVITTAIRATLKKDHKHWADNLQDIADAIRNAVHDS FT THYSPYFVVFGRNKVSDGREYSWIRDNYEPSDDDKDVPEQKKKLFEEIKKN FT LTAAYQRHAKTYNLRTNANCPKYTVGEKVLKQTFDLSDKGKGFCKKLAPKY FT EPAVVRKVLGKNTYELEDLDGKRFGVYFTNRLKKWHTPQPGASGS" XX SQ Sequence 9099 BP; 2457 A; 2180 C; 2261 G; 2201 T; 0 other; gaactccctg tcccgaaacg ggattactga acccccctcg agaggtctct acggctgggc 60 aaaccgacaa acaggtggat ggtctccttc gggagtggcg cttaagccat ctcacttaac 120 cataggggag ttcgataggc tggcgggcta taactggcgc ccaacaatac aaggcatcta 180 ctgtactcta ttttttattt gatttttttt ttcttggatc gttttactaa attttctgaa 240 atttacggat tttgggaatt atttttttgg cttgattttt ttggtttgat ttgaattggt 300 tttttttgta gtattagatt cttttgttct gaaggaagag ttgagtttct cggagaaatt 360 gcaaacacta ttttgggttg tttttcattt gcatacattt attgtattta ctttctattt 420 tttcttttca tcatacatta tttatattat catatttttt tttcattgct acaatttttg 480 taattttggt actacgattt gtattattta aggagcctac tctgcagttt tgttgttata 540 gcaaactaat ttagcgagct ataaaattcg gaatttcgta attgaaccta attgcactag 600 atgttcaaat tggtaccgtt tcctcgtgcg gaggacctta cgatcgaaga gatcgagcac 660 gaactcacgg ttcgcaaaca accggaatca attttcctac tcgatctacc ggcaaaacaa 720 aggaaggtac gaagcttatt caaaagggat caaaaggaga gcttgacgtg gccgtcgtac 780 acaaacattc gggaagagga agagtacatc catggtagaa ttgacaaatt ggccaaagca 840 ctcgaaaaaa agcaagagac caggtttgaa tcgcgcgttt taagctattg gcaccggacc 900 agacgacatt cggcgcgtac ggaagcggag aggaagatca agcgcgaact gattcaaaag 960 attgagaatc ttatgaaaca gttccagttt ggggtacctc agtcgccagt caagaaacaa 1020 atcaacaata tcttggaatc atctcgtaac tcaactccca caactgtgtc caatatttcc 1080 gaacgttcga aactcacgtt gagaacccct gtaccttgcc ctatcacaac tgtagcgatt 1140 accccgacgc ccacaatttt gatcacgacg cctagtccaa atttatcacc attagtgact 1200 tctccaatca cgaccccaac tttgtcgatt tctgtcgcca gcggatcgga tacgccgaac 1260 gccgtcgtga cacccacact cacgcttgag agtccaaacg caactacgcc tacatcgcga 1320 gtatcaacgc cgttggtgca gaccccaaac gtagttccgc ggaacccgag gtttgaattt 1380 cccacaccag accggggtgg tgaaggtacg gtcagcaata gcggttcgag ttctccaacg 1440 aaaggaaaaa cgacagacat ggcttcagga ggttcaggtc agggcaacgg aggacaggct 1500 ggagatgaca agttcatgac ggagtggaaa aatatgaagg cgttgcttgt ggacttggtg 1560 gacaaaaacg cagagctaag aacggaaata caagaacaaa agaaggcgtt acaggcgaaa 1620 ggaagtcagg gaccagtcaa aatgaagacg ggcgcaaccc aaaaggacgt cagctcgaag 1680 ggagcgatac cgaagagagg agagggatca acgtcgactt catccggcat tccgcaagag 1740 gaatgggaag ccatgaagga gatactgcgg acgatcatgc tgcaacagag ctcagcatca 1800 gcaaacaaca acaacgtacc tcctaggaac aatcctcagc caggccgagg aagcggaaat 1860 ccacgccaat ctccaccgcc gtacccaccg agttacgagg gcgactacag cgactcatct 1920 agctcgagag gaggtgctcg aggaggccat cggtatcgag atcaaccagg acgagtcgac 1980 gggcgaatcg acaaatggaa gatccgattc tcaggcgatt ccaatgtatc cgtggagaat 2040 tttctttaca aggcgaagaa actagcggaa cgggaagaaa ttccgccgtg ggagctactc 2100 cggaacattc acaagctgct tgaaggaacc gccgaagact ggtttttcac ttacgcggac 2160 gatttcgact gttggcgaac gttcgagtgc aagttcacgc accggttcgg caatcccaac 2220 aaggaccagg gaatccgatc aaccatgaaa taccggaagc agcaacgggg agaggcgttc 2280 aaagcatttg tagatgagat cgtgaggttg aaccggctgc tgacaaaccc gttaccgcag 2340 gacaggctgt acgagaccat ctgggacaac atgagggatc actacaagac aaagattgcc 2400 gtagttgggg taggagactt ggaccaactg ttggaaatca accaccggat cgacgcagcg 2460 gacagctcga tcaagcagca agcagacggc tcaggcgagt tttccaatca gcgccacgtg 2520 aatcacgtgg aagttgatga gtatggaagc gaagaggacg gagcatcggt caacgccatc 2580 agaggtcaac agagcagcta caggccgccg tgcagttcgg cgagtcagga ccagcaacga 2640 tcgacaagtg cgtaccgtaa tccgtaccga tcgagaccgc agcagcaacc cagagaacaa 2700 gcacatcacc aacaacaacc tcaagatcaa ctacaacaac aaccgcagca tttgatcaat 2760 caaacgcgac aggcagacgc gactcaagcg gcgaacccac ttccggtaga gaccagcgga 2820 gccacagctc aacttttgtg ctggaactgc ttggttcacg ggcacagctg gcgccaatgc 2880 aacaaaccga aagtgatttt ctgctacggg tgcggctgtt tgggaagaac gatcacaaat 2940 tgtgatcggt gttcggtgac gtctacccaa cgttcctcgg ctacgccgca gggaaacgag 3000 tagaggggtg cgagttgggg aattcgacca ccgacctgag accaaacatt cccacttcaa 3060 atctcgttta cgaccctttt gcgcaagtat accaagtccg aatacaggcc aacaaatgtc 3120 cgcacatccg agtcaacatc ttcgacacgc aggtcgacgc gcttttagac tctggagcgg 3180 gaatcagcat tctgaactcg ctggacatca tcgatcagta ccgcttgaag attcaaccgg 3240 cggccatccg agtatcgacc gcggacggat cgaactacgg ctgcttggga tttgtaaatt 3300 tgccgtttac gttcaagaac acgacgcgag tgattccaac cattgtcgtc ccggagatct 3360 ctcggaaact aatactcgga gccgatttct gggaggcgtt cgggatcaag ccgatgatcg 3420 acctcggtca aggaccagaa agcgtcgaga tggtcgagca ggacgacgac gcacttctct 3480 gtttcacgat cgagccttcg gagacaacgc caacgttgga agaccccgag gaggacgaca 3540 cgctggacat tccggtgtac gaagggccga ccgagtccgt tccggatcca gatgcgatcg 3600 agaccgagca cgtcctgacg accgaacaac gccgacaact gtccgaagtc attcgtgcgt 3660 tcgagttgac gtcgaccggg aagttaggac gcacacacct catcgaacac gagattgtgc 3720 tgaaggaagg agcaaaaccg aggaacccgc ccatgtaccg ctgttcaccg tacgttcagc 3780 aagcgatcaa cgccgaggtg gaacggttca agaagctgga cgcgatcgag gaatgctaca 3840 gcgaatggac caacccgctt gttccggtgc cgaaaaagaa cgggaaagtt cgagtctgtc 3900 tcgattcgcg aaagatcaac aagttgacgg tcaaggatag ctacccaatg cggaacatgc 3960 aagatatatt ccgacggttg ggcaaggcta cgtacttttc cgtcatagac ttgaaagacg 4020 cctactttca aatacctttg aaagaggaat gtcgtaattt gaccgctttt cgaacttcag 4080 agggagtctt ccggttcaaa gtccttccgt tcgggctgat gaatgcgccg ttcacgatgt 4140 cccgactgat ggccaaggca agtggttcag acctagagcc taacgtcttc gtctatctgg 4200 acgacatcgt gatcgcggcg agttccttcg acgagcacct gcggttgctt cgactggtgg 4260 cggaacgact tcgaaaggcc gggttgacta tctcgctcga gaaatctcgt ttcttccgca 4320 agcaggtcat gtacttgggg tacctcctca acgagcacgg agtggccatc gacaaaagcc 4380 ggattcaacc gatcctcgac tacgcacagc cgcggacgca ggaggacatc cgaaggttaa 4440 tgggcctcgc cgggttctac caacggttca tcaaggacta cagtagaatg acggcgccga 4500 tcaccgactt gttgaccaaa gagaacaaac gcttcacgtg gactaaagag gcagaagacg 4560 ggtttcggga gctcaaagca gtcctgacct ccgcgccgat cttgggaaac cctgacttct 4620 cgaagctgtt caccatcgag tcggacgcgt cagaccgagc ggtaggagcg gcgctggtcc 4680 aagagcaaga cggcgtcacg cgcgtcatca gctacttcag caagaagctc aaccgaacac 4740 agcggcgata ttcagcagta gaaaaggagt gcctcggcgt attgtcggcc atacaacact 4800 tccgacatta cgtagaagga accaaattcc gggtcatcac cgatgccaga agcttgctct 4860 ggttgttcaa cgtcggcgca gagacgggca acgcgaagct attgagatgg gcgttgagga 4920 ttcaagccta cgacttcgac ctcgagtacc ggaaagggaa agcgaacatc acggccgact 4980 gcctgtcacg ctcgatcgaa gtggacgccg ttcgcgtcac gcttccggac gacgagtacg 5040 aagagcagat cgagaacatc actgggaacc caacgaagtt cagtgactat cgggtgatcg 5100 acggacggat cttccgctac gtgaagtcgc ctaaccgcca gaccgatcca cggtttagct 5160 ggaagctact gccgccgcga gccgaacgca aggagattat cacgaaggtt cacgaccaag 5220 ctcacttcgg tttcgaaaag acgctggcat cactgaagga gcgacacgac tggccgaaaa 5280 tgcgtgaaga agttcggaag cactgccgcg aatgcctaca gtgccaggtg agcaaagccg 5340 gcaaccagaa cgtcacgccg cccatgggac cacaaaaacc ggtccaatac ccgtggcagt 5400 tcatcacgct cgactacgtc ggtccgctac ccgcctcggg caagaacagg aacacatgcc 5460 tgctcgtcgt gaccgatgtg ttcagcaagt ttgttctcgt gcaaccgttc cgcgaggcga 5520 aagcacactc gctcacagag tttgtcgaaa acaacgtgtt tcgtctgttc ggagtcccgg 5580 agatagtcct gacggacaac ggaactcagt ttttatccaa gcagttcaaa acccttctcg 5640 aggcgtacca cgtccagcac tggctgaccc cggcgtacca ccctcaggtg aacaacaccg 5700 agcgggtaaa ccgggtgatc accacggcca tcagggcgac gttgaagaaa gaccacaaac 5760 attgggccga caaccttcag gacatcgcgg acgccatccg taacgctgta cacgattcga 5820 cacactacag tccatacttc gtggtcttcg gccggaacaa agtgtccgac ggccgggagt 5880 acagctggat cagagacaat tacgagccga gcgacgacga caaggacgtt ccagagcaga 5940 agaagaagct gttcgaagag atcaaaaaga acctgaccgc cgcctaccag agacacgcca 6000 agacgtacaa cctccgtacg aacgccaact gtcccaagta caccgtgggg gagaaagtac 6060 tgaagcagac ctttgacctg tccgacaagg ggaaaggctt ctgcaagaag ctcgcgccga 6120 agtacgaacc tgcggtcgtg cgaaaggtgc tgggaaagaa cacgtacgag ctggaggatc 6180 tggacggaaa acgctttggg gtctacttta ccaatcggct gaaaaagtgg cacacacctc 6240 agccaggggc aagcggatcg taatgagatc ttttcgaaga gcaaattgtt ttttttagct 6300 atgtccctcc taatcccacc aggagcaacc atttccgttt agggaacaaa acaccctcgg 6360 ggtaattacc tcattagggc actcgacgga ctcgtgttag ggaaaaccag aataaacaaa 6420 ccagcactga gggtaccgag ttgtaccacg cgcgatcgcg acttgggaga atcggttcga 6480 tctggggtga tttgtcagtt gcgttccaaa cttaagtacg ggaagatgaa gcaaagaagg 6540 agcaacttag ttgccatgac ttccaaaacg cccgatgact gtcgagagcc agtagtcact 6600 tgtactaaga ttgcacgaag ctatgaacga gcaagtgatg acctgtgtta gtagccaccg 6660 aaaggtgtga acgaagaaag cgagcctgcg atgacttcga cagcgatcga tgattgtcga 6720 aagctagtag tgcatctttg gacaacgccc acaaatcatc tcatgatcga ttccgacgat 6780 ccaattgtag acgatttaga acccaacgag ctatgaatag acttcaaccg aagaagtgac 6840 caagggtaga accggttcta ccacgcgaaa ataccaaatt ttgggccgag tagacaccac 6900 aaaccaagac tagcctgacc gagtcacagc tagcacgcgt aaaacgaaga agaggaagaa 6960 gatacgcaaa accgaacgcg acgcgaagaa gttagtggtt agtagggcct aacgcctaac 7020 ccttaaacta agttagtact aaactaggtt aatcttagat ctataaaaaa ctcacgatcc 7080 gctagtccag ttgggggtgg atcatccttc cttccgtcga gtgtgggtgg agtccaaact 7140 tccgtcagtc ggggcgagtc atccttccgg cgaagtgggg tgagggtggc cgaacatcgt 7200 cagctggccg agcaagatcg tcgtcacctg tttgcgtggg aggacgacat tgtaatatga 7260 tcactttcat tattagtttt cttcttatac ttacctgaat tgttgtttat ttttctacga 7320 tggagacgtt ttttcccgtt ttttttgcct cgttttccga tgattaacga tctttttcta 7380 atgtaaatag tagttatagt tttacccccc ttgtaaatag tagtttaagt tatagttagt 7440 aaacaatcga aacacagctg gcgtttcgct ggtttgttta cattcggttt ccaaatatag 7500 tagttttttt ttttttggca ccatagtttt tttttcccaa agtagatttt ccacaccaat 7560 ttttgcacag attttcccac ccagttttcc attttttgat aatttttccc aagtagtttc 7620 agtttgattg attagccaca atttccgcag tttttttccc ccgagagctt atcaatcgct 7680 aacgacaggt cgcgatttcc acgccgcagt agccatgccg ctgcgctgaa aataaacaaa 7740 caaaaaacca acttaaaacc gatagctgga acgatatctc agctcgactt tcctcgtttc 7800 cgttggggac ggttcttccg accatcccag tcgtcttttc tggccatttc actttttttt 7860 ttttcaccaa actttttcct ccaaaaactt tcactcgcgg ttcaaaactt gcgcctggca 7920 ttttttccac tttcgctttc tcacatgttt actccgttga cagttcgatg acgtttgttt 7980 tgtttgtgtg cgtagcaccg gtaaaatgcg cagggtgaat gaggtgacag atccagatct 8040 tggagatgga ttctgcatgg gtgatttttt cttgaatgcg aggggtgagc gaatgttttt 8100 tttgtagttt gtaattatgt gttgattctt caagaatttc gttatcggat tgatcctggt 8160 gttagaagtt cttgtgagga ggaatatttt gcgttccaaa tcataatgag tgttaaatcg 8220 cggaaatata tttccgtggc ggcgaattat gaggagaaac agaccccttt tctgtcagtg 8280 aagtttgttc ccacagactc gtcttagtat tgggccggga ttgcaaacta aactgactag 8340 ttcttttcgt gaacgttgtc ttcgcctatc ggctttgtcc tggtgtagaa gacacgtttc 8400 gtctactgtg ctagctcgcg tagggttatg cgcaaactct atgtcgatgc tatatttttt 8460 ttccgtagtt tgttttcatg ttttaatttt gtaattttga gtatgaatga gtcgtcgtca 8520 tcaatgaact gatcagtaat tttctgggtt acaaatttct ttattgcctc ttctccttat 8580 ttttttttct caatatttca ttagaattag ttgtacatat ttatacattc atcataaatt 8640 tctttattaa aggtcatttt ttttttcatg aatatttttt ttcaaaagcc tttgccgtat 8700 ttgagccggt tgattagagt taggaacatt tgtaaattga attttttttt ttggaattac 8760 tcgttttttt ttgcatttct cgaggcttta ctgttgttag tgatgatttg agtagtaatt 8820 cgctggagac cggagacttt agtcattgat ggttagtagc agacttacaa catcgtcgct 8880 gacactgtga tgttgtctga tctggaaagt gagacggagc tgactgttag caaggaccta 8940 acttgttagt gagttgacag ttcagtgatg aacgtatatg accccgaaga gaaacccaat 9000 catcagtagc acttctgttt ggcctacgaa aatttgtttg gtcgtttttc accgccgtgt 9060 taacaccaaa caaattttcg tacttttagt ggggaatga 9099 // ID BEL-4_AA-I repbase; DNA; INV; 5030 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-4_AA_; KW BEL-4_AA-LTR; BEL-4_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5030 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 857-857 (2011). XX DR [2] (Consensus) XX CC Positions [4019-4609] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 186..1631 FT /product="BEL-4_AA-I_2p" FT /translation="MSALFRVPCVRKASSSALSELADEFNRHVGILDKLEN FT VDAHWNSFLVERLSSLLDEKSLIEWETQCKDEETPQYERLLEFIRKRSRTL FT QKCTSSCNASSAIQVKSVKPKPTSSHVVSDNVMKCQSCKQAHPLVQCDAFI FT KLSPSQRLDFAKKHRLCINCLRGGHMARDCRSSLCRMCGKKHHSMLHLPTP FT ASNSAVIATPSEEQSTNTSQACTAICTTATTTSQSESSYSVANAQPVVLTR FT TLPQAIVPISQNAWSPLVAIGQPPSPSPVVSNGQTESVHLPCPPQPQQQAT FT SLTEANNTFDSIVFLSTAVVRVRDVNNVYHFARALLNSGSQSNFVSESLCQ FT KLDLKRTRINLPVSGIGQATVNVHYKVNIVLSSRFGSFEQQIDCLVLPKLT FT VSLPSRSIDISRWTIPRNLPLADPRFNISLGVDLIIGAELFFTLLEAQKLT FT LTDFQHYRKPFWDTWSLETPLPSRQRQKCVTLLQTKI" FT CDS 1760..4993 FT /product="BEL-4_AA-I_1p" FT /translation="MVRLPFRESQVPFLGDSYKSAVNRFSMMERRFSKDDE FT LRTEYTQFMEEYLRLGHMEECPSVDGPQFFLPHHAVRRPESTTTKTRVVFD FT ASSKSHGQLSLNDVLFTGPTVQPALLIMVVNFRLPRYVFSADAEKMFRQVW FT VHPDDRKFLSIVWRSDPSLPLNHYHLKTVTYGLACSPYQAARVLNKLAEDE FT GESYPLAAPVITKRFYVDDTLAGGDDLEEVVETCRQLQELLARGGFTLRKW FT CANDPTVLRHIPRELLGSTGPTEIGRSVITKALGLLWNPSTDRLSFQVPEL FT GDLQIVTKRAVVSEMSRLFDPLGLLGPVVINARIFVQGLWAKRLTWDEQLS FT DEESQWWQAFREDLANLKEITVPRHVVANCGHNYQLHCFCDASLKGYGSCV FT YVVGPNAAGEIESKLLIAKSRVAPLRGLSIPRLELCAALLGSQLIHNLRTT FT TDFIGTAVFWSDSTVVLHWILSPPNDWKVFVSNRIAEVQRLTRGLPWNYVP FT SELNPADRISRGINPSQIIDDSLWWHGPPFLTDTTATWLKCPSIWVINQET FT EQEKRAIVVLASVQDDNSIFDRYSELGKLLKVVAMCIRFGRNCRRLKAERV FT YGNITPQEVDTALKSMVRLAQVNCFTKEIKILKRHNVGPSGPADFESKSPL FT KNLNVCLDQSDMLRLDGRLKNEAGPHDSKFPLILPANHQISLLIARSLHIR FT TAHAGPSLLLATMRQRFWPIRGRELVRKVVRKCITCFRCRPTDYHQQMAPL FT PAVRVVPSRVFSKAGLDYCGPFNIRPLYGRGANVKMYVAIFVCLVVKAVHF FT EIVPSLTSAACINAIKRFVARRGRLIELHCDNATAFVGADRELKSLRRQYI FT EQFKADEWKDYCLDSGITFHFIPARSPHFGGLWEAGVKSFKYHFRRIFATN FT SYTLDEFTTAATHIESILNSRPLTPLTDHPDDLAVLTPGHFLVGEPMFSIP FT EPDMMNDQVPRLSRFQELRRSVQNFWKRWSRDYITQLHQRSKWRTSSPNIK FT KGALVLLKQEDLPPFLWNLGRVDETYTGPDGLVRVVLVRTNRGVYKRAVTE FT VRVLPIDTTEGDQNPINQDAA" XX SQ Sequence 5030 BP; 1328 A; 1228 C; 1201 G; 1273 T; 0 other; aatccgtaat acactcaaat ctgcaactgt cggctgtgca aaaactgcat tatttgcgca 60 gcagtttaaa gggtgatgcg tctcggttga tttcttccat tgcgatcacc gcggacaatt 120 atgcaatcgc atggaagacc atttgtgacc ggtacgagaa tactaactac ctagtgaaac 180 agcatatgtc agctctattc cgtgtgcctt gtgtgcgaaa agcaagttca tctgctctat 240 ccgaattggc agacgagttt aaccgccatg ttggaattct cgataaacta gaaaatgtgg 300 atgcccactg gaattcgttc ttagtggagc gtctcagcag cctactcgac gagaaatcgc 360 ttatcgagtg ggaaactcag tgtaaagatg aagaaactcc gcagtacgaa cgtttgcttg 420 agttcattcg caagagatca cgtactctac aaaagtgcac ttcgtcgtgt aacgcttctt 480 cggctataca agtgaaatcg gtaaaaccca aaccaacctc ctctcatgtg gtctccgaca 540 atgtgatgaa atgtcagagc tgcaagcaag ctcatccttt ggttcagtgt gacgcgttca 600 tcaagttgag cccaagccaa agactggact tcgcgaaaaa acatcgcttg tgtattaatt 660 gtttaagagg tggtcatatg gctagggatt gccggagtag tttgtgtaga atgtgtggga 720 agaaacatca tagtatgtta catctaccaa cgccagcgtc aaattcggca gtcatcgcta 780 caccgagcga agaacaatca acgaatactt ctcaagcttg cacagctatc tgtacaacag 840 caacaactac atctcaatct gagagttcgt attcggtcgc aaacgcacaa ccggtggtgc 900 tcacgcgtac tttaccccag gctatcgtac caatctcgca aaacgcttgg tcgcctctcg 960 tcgcgatcgg tcaacccccc tctccgtcgc cagttgtatc gaacggtcaa accgaatcag 1020 tgcaccttcc atgccctccc cagcctcagc aacaagccac gtcgttaaca gaagcaaaca 1080 atacgttcga cagcattgtg ttcctatcaa cagccgttgt gcgagttcga gatgtgaata 1140 atgtgtacca cttcgcacgc gctctattga acagtggttc acaatcgaat ttcgtatctg 1200 aatcgctatg ccagaaatta gacctaaaac gtacccgaat caaccttcct gtaagcggta 1260 tcggacaagc aaccgtgaat gttcactaca aggtgaacat cgtactatcg tcgcgattcg 1320 gtagttttga gcaacaaata gactgtttgg ttctaccaaa actaaccgtt agtcttccca 1380 gccgcagtat agacatctct cggtggacca ttccacgaaa cttaccgtta gcagatccga 1440 ggttcaatat ttctcttgga gtggacctca ttataggagc ggaattgttt ttcactctac 1500 tcgaagctca gaaactaacg ctgacggatt tccaacatta cagaaaaccg ttttgggata 1560 cgtggtctct ggaaacgccc cttccaagtc gccagagaca aaagtgtgtc acgttgctac 1620 agaccaagat ctaaatgctc agctggaacg aatgtgggag gtcgatgact tcgatgtagg 1680 acgtgcacta acacaggaag agtaacatgt ggaagatcat ttcatccgaa ccgtatttcg 1740 tgatgatact ggacgctata tggtacggct tccgtttaga gaatctcaag ttccattcct 1800 aggtgattcc tacaagtcgg ctgtgaacag attttcgatg atggagcgac gtttttccaa 1860 agacgacgag ctacgtacgg agtatacgca gttcatggaa gagtatttaa ggttggggca 1920 tatggaagaa tgtccatctg tcgatggccc gcagtttttt ctcccgcatc atgctgttcg 1980 tcgaccagaa agtacaacga caaaaaccag agtcgtcttc gacgctagca gcaaatccca 2040 tggccaattg tcgctaaatg atgtcctttt cactggcccc acggttcaac cagctctgtt 2100 gattatggtg gtgaatttcc gacttcccag gtacgtgttt tctgccgatg ccgagaaaat 2160 gtttcggcaa gtctgggtcc acccagatga ccgaaaattt ctgtcgatcg tttggcgttc 2220 ggatccgtct ttacctttga accactatca cttgaaaacc gtgacatacg gactagcctg 2280 ttccccgtat caagctgctc gcgtactgaa caagctcgcc gaggatgagg gagaaagtta 2340 tccactagcc gctccagtaa taacgaaacg tttctacgtg gatgacaccc ttgctggagg 2400 agatgatttg gaggaagttg tggagacttg ccgacagtta caagaactac ttgctcgggg 2460 aggattcacc cttcgtaagt ggtgtgccaa cgacccgaca gtgctacgtc acatacccag 2520 agaactcctc ggatctacag gtcccacgga aataggacgt agtgtaatca cgaaagccct 2580 tggattactg tggaacccaa gcaccgatcg gctaagtttc caggtacccg agctggggga 2640 tctgcagatt gtaacaaaac gagcggtagt gtcggaaatg tcaagactct ttgacccctt 2700 aggcctcctt gggccagtag taatcaatgc aaggatattt gtccaaggat tgtgggcaaa 2760 gcggctcact tgggatgaac agttatcaga tgaagaaagt caatggtggc aagcattccg 2820 cgaagatttg gctaacctga aggaaatcac cgtcccacgt cacgtagttg ccaactgtgg 2880 tcataattac cagctgcatt gtttttgtga cgcgtcatta aaaggttacg gaagctgcgt 2940 ctatgtggta ggcccaaatg cagctggaga gattgagagc aagttgctaa ttgcaaaatc 3000 acgggttgct cccctacggg gcttgtccat accaaggttg gaactttgcg cggctctcct 3060 tggcagccag ctgatacata atctgcgcac aactacagat tttattggaa ctgcggtatt 3120 ttggtcagac agcactgtgg ttttacattg gatcctgtcg ccaccaaatg actggaaggt 3180 gttcgtctcg aatcgcattg cggaggtgca acgtctaact cgagggttgc catggaatta 3240 tgtcccatct gagctcaatc cggcggaccg tatatcccga ggaataaatc caagccaaat 3300 catcgatgat tcactatggt ggcatgggcc gccattcctg acagatacga ctgcaacttg 3360 gctgaaatgt ccatcaattt gggtcatcaa ccaagaaact gagcaggaga aacgcgcaat 3420 cgtagtatta gctagcgtgc aagatgacaa ttctattttc gatagatatt ccgagctggg 3480 gaagttactg aaagtagtcg ccatgtgcat acgtttcggt cgcaactgtc ggcgattgaa 3540 agcggaaaga gtgtacggga atataactcc acaagaagtc gatacagctt tgaaatccat 3600 ggttcgcctc gctcaggtaa attgctttac taaagaaatt aaaatcctca aacgccataa 3660 cgttggtcca tcaggtcctg ccgatttcga gagcaaatct cctttaaaaa accttaacgt 3720 ttgtcttgat caatcagaca tgttgcgctt ggatggtcga ttgaaaaacg aagctggccc 3780 acacgattca aaatttcctt tgattttacc agccaatcat caaatcagcc ttttgattgc 3840 ccgttcccta cacataagga cggctcatgc aggaccttcc ttgttactag caaccatgcg 3900 tcaaaggttt tggccgatac gtggacggga attagttcgg aaagtggtcc gaaagtgtat 3960 aacatgtttc cgctgtcgac caacggatta ccaccaacaa atggctccac tgccagcagt 4020 cagagtggtc ccgtccaggg tgttttccaa agctggttta gactactgcg ggccattcaa 4080 cattcgtcct ttgtatggga ggggggctaa cgtgaaaatg tacgtcgcga ttttcgtttg 4140 cctggtggta aaagcggtac atttcgaaat cgttccgagt ctcacttccg ccgcgtgcat 4200 taatgcaata aaacggtttg tagctcgacg cggtcggtta atcgagctac attgcgataa 4260 cgcaacagct ttcgtcgggg ctgaccgcga gttgaaatcg ttgcgtcgcc aatatatcga 4320 gcagttcaag gcggacgagt ggaaggacta ctgtctcgac tccggaataa ccttccattt 4380 catacctgct cgctctccac atttcggggg tctgtgggag gccggagtga agtcgttcaa 4440 gtaccatttt cgccgaattt ttgctacaaa ttcctacact ttggacgagt ttactacggc 4500 agcaacccac atcgaaagca tactaaactc ccggcctctc actcctctca ccgatcaccc 4560 tgacgacctt gctgtgctta cgcctggcca ttttttggtc ggggaaccga tgttttccat 4620 cccggaaccg gatatgatga acgaccaggt gcctcgactc tctcgatttc aggagctacg 4680 ccgctctgtc cagaatttct ggaaacgatg gtccagggat tacattactc agctgcacca 4740 acgatcaaaa tggagaactt cctcaccgaa catcaaaaag ggagcgttag tccttctgaa 4800 gcaggaggat ctcccaccgt tcctgtggaa ccttggacgg gttgacgaaa catacacagg 4860 tcccgacggt ttagtgcgtg ttgtgctggt acgcaccaat cgcggggtct acaagcgagc 4920 agtcaccgaa gtccgagtgt tgcctattga tactacggag ggcgatcaaa atcctatcaa 4980 ccaggatgcc gcttgatcgt tgaaacggac tgtttcaacg gggcccggga 5030 // ID Gypsy-153_AA-I repbase; DNA; INV; 7109 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-153_AA_; KW Gypsy-153_AA-LTR; Gypsy-153_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-7109 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1031-1031 (2011). XX DR [2] (Consensus) XX CC Positions [4732-5208] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 353..2128 FT /product="Gypsy-153_AA-I_2p" FT /translation="MEVQNFEKMDYYINPNDLLQDEIDYELKLRCLAIDGS FT VQEKRQKLRQAQLEEIRKQKIFRVKKSITQEYDTVKAKVQDIKHVMQFYPE FT QKLISRLRHCRLRIIRANAITREQVRLKNELVYEIENLLDRYGSLVDRRMS FT LPLNTHLNFSNVAELNVQKNEQVGNLNLTSPSVMGQAIVNTVQTQNRSSDA FT PFLDELRIENNLSDEAVGGNDFAESDGFEEQLDLLFNDFRDASQHQRVRTH FT EPLLPGRKEQDLINVLPRNQIDTSKQEINQYIQQALSEQMASLMEKISPML FT DNQRSSTPNLPPPPPPPTPPPPRTAAPPLQPPAPAQPPREQVRQTRNEYYP FT NSVASHVSLAGDINPIQLNRPSFESRNRWQVPISKWRINFSGDSRGPTVTQ FT FLNRVEVLATNNRVSETDLLSQANFFFKEGSEAEEWYFTFCNKFTNWVGFK FT HQLRLRFEQPNKDKVIERQILDRRQLPHETFNAFISAIERLAQQLTKPMSE FT ERKLDILTENMKDSYKPFLTIYRIEKIDDLVAVCHALDKSMYRSYNSYPKN FT RPHQINNIEETEQDSDSQYEQEEGELNAIGQALRRKKVDEKTREY" FT CDS 3178..5583 FT /product="Gypsy-153_AA-I_1p" FT /translation="MLELDIIEESTAEWCNPLLPVKKSSGEWRICLDCRRI FT NEITKNEAYPFPDMLGILGRIERSKYFTVIDLSKAYWQIPLEESSRDYTSF FT RAGKQLFRFKVMPFGLKGAPITQTKLMNKVLGFDLEPHVYVYLDDIIITSN FT SLEEHFRLLRIVAERLRRANLTISLSKSKFCQKEISYLGYTLSDKGLAIDS FT TKVQPILEYPIPKTPKDVRRFVGMVSFYKQFIDHFSDLTAPITDLLKKSKG FT KIVWTKDADEAFLRIKSELISPNVLANPDFNLPFTIESDASNVAVGAVLTQ FT NQDGIRKPIAFFSKKLSATQRKYAPTEKECLGVILAIQKFRHYVEGSRFTV FT VTDAQSLIWLRQISAEGGSAKLIRWALKLQQYDFELLYRKGSLNITADALS FT RAVDAVSVSDPEYEELKRKILSNNQKFKDFRVTKDKVYKLIPSKMIDPRFQ FT WKFVPPMQQRFKIIQETHDSMHFGRNKTYKKLQEQFYWPSMENDVRKYCQG FT CETCKKVKYPNANPKPLMGKQKLASMPWQTISVDFVGPFPRSKAGNSVLLV FT VTDLFSKFVIIQPLRDAKTTPLITFLENMVFLLFGVPEILISDNGVQFKAK FT EFEKFLNKYHVSHWRNANYHPENNPTERVNRVIGAAIRSYLKEDHKEWDRD FT IHKVALAIRTAVHESTHFTPYFINYGRNYISSGTEYKYIRDNGMDVVYEPL FT HMNDNLREIFDTVKSNLKKAYERYAKYYNLRSNKALPTFDVGETVLKKNFH FT QSNKSKQFSAKLADPFSLAKVVAKIGTACYDLEDLNGNRLGVYHATHLQKK FT " XX SQ Sequence 7109 BP; 2364 A; 1348 C; 1423 G; 1972 T; 2 other; tttggcgccc aactaacggt tttttgaatt tgtacagtat ttagttaaat tgagtaggat 60 tagttttgaa gtttaagcga atttgtagat atttaagaat tgggattgag ttagttacgt 120 tcattgaaga tatatttcgg attaatatta gaattggctt taaattgaat ttgttttgct 180 gcattataca gaatttgaat ttttattttt actgaattga atttttataa taatatagtt 240 cgattaacat ataattattt ctttttctat ttttcaatat ttttttttat ttattctcca 300 ttttgtgatt gcgaaggtga cctgaaaata taaagaaaaa cttgttaaaa tcatggaagt 360 tcagaatttt gaaaaaatgg attattatat taatcctaat gatttattac aagatgaaat 420 tgattatgaa ttaaaattga gatgcttagc aattgatgga tccgtccagg aaaaacgcca 480 aaagttgaga caagcccagt tagaagaaat tcggaaacag aaaattttca gagtaaaaaa 540 atcaatcacg caagaatatg atacggtaaa agcaaaggtt caagacatca aacatgtaat 600 gcagttttat cccgagcaaa aacttatttc ccgacttaga cattgtagac ttagaatcat 660 tcgagcaaat gcgatcacaa gagagcaagt tcggttgaaa aatgaactag tctacgaaat 720 tgagaattta ctcgatcgat atggcagttt agtagatcgt agaatgagct tgccgctaaa 780 cactcatttg aatttttcta atgtagcgga acttaacgtt cagaaaaatg aacaggttgg 840 aaacttgaac ttaacaagcc cttcagttat gggacaagct attgttaata cagtacagac 900 tcagaaccga agctcagatg ctcctttctt ggatgaattg cgaatcgaaa acaatttatc 960 ggatgaagca gtaggaggaa atgattttgc ggaaagtgac gggttcgaag aacaattaga 1020 tctgttgttt aacgattttc gagacgcatc tcagcatcaa agagtacgta ctcatgagcc 1080 tctattacct ggtagaaagg aacaagatct gatcaacgtt ctgccacgga atcaaataga 1140 tacttcgaag caagagatca atcaatatat acaacaggct ttatccgagc aaatggcgtc 1200 acttatggaa aaaatatcgc caatgttaga taatcaaagg tcttcgacac ctaatttacc 1260 tccaccccca cctccgccta cacctccacc accccgaacg gcagcaccac cgcttcaacc 1320 acctgcgcca gcacaacctc ctagagaaca agtgcggcag acgagaaatg agtattatcc 1380 taattcagta gcatcacatg tttcgctcgc aggagatatt aatcccatac agcttaacag 1440 acctagtttt gaatctcgca atcgttggca agttccgatc agcaaatgga gaatcaattt 1500 tagtggtgac tcgcgaggac caacggtcac tcaatttctc aacagagttg aagtgttggc 1560 cacaaataac agagtttcag aaactgattt gttaagtcaa gctaacttct tttttaagga 1620 aggatcggaa gcggaggagt ggtacttcac attctgcaat aaatttacga actgggtcgg 1680 tttcaagcac cagctaaggt tgcgctttga acagccgaat aaagataagg ttattgaacg 1740 gcaaattctt gaccggaggc aactcccaca tgaaacgttc aatgccttta tttcagctat 1800 tgaaaggctg gctcaacaat taactaaacc aatgtctgaa gagagaaagc tggatatcct 1860 tactgagaac atgaaggata gttataaacc cttcctaaca atctatagga ttgaaaaaat 1920 agacgatctt gtcgccgtct gccatgcatt agacaagtct atgtacagga gctacaatag 1980 ctatcctaaa aatagaccgc atcaaattaa taacattgag gaaacggaac aagattcgga 2040 ttcccaatat gaacaggaag agggtgaatt aaacgcaata gggcaagccc tgcggcgtaa 2100 gaaagtagac gaaaagacaa gggaatackc tggtgctatt ccaaaaatac cgaaggacga 2160 ccaaaataac gtcttgtgtt ggaactgtag gcaatatggc catttttggc gcaattgtga 2220 caaacagaaa aagattttct gtcatttgtg tggacaaatg aattttgtga ccgcaaattg 2280 tcccaataac catcgattcc catcacaagc tcaggaaaac gaaaatccag atcgatcgta 2340 gggagtgatc catctgaagt cacggaaaac tccccagata gaggttctca aagaactgaa 2400 aatgatcaat catattcaaa atatgccaca gcgttccaaa ttaatacaaa acctaatcgg 2460 tgcccatatg ttaaagtcga aattctcggg acaccaatta tggcgttact agactctggg 2520 gctagtgtaa gcattttgag ttccagagaa atagtagaga aacacaagtt caaagttcaa 2580 ccgatcaact tgatagttaa aactgcagat gagactcccc taaactgtgt aggagtgata 2640 cagatcccat tcacatttca aggaaagact aatgtcattc cgacactctt gattccagaa 2700 gtttcaaaac cccttatact ggggatcgat ttctggaatt catttggaat tgctccagta 2760 gtggtagatc ataatggaat gcaaccaatc agtactgttg gcgatcacgg actgaacatg 2820 attgaatcgt tttttggtga tccagatgag ttgattcttt ttacggttga accaaacgga 2880 ccttcagtgg tagaagagga ttcaaccgaa gatatttcct tagaacttcc gtttatagaa 2940 gactctgaaa atagaaacat aacaaatatt attacagagc atactctaac aatacaagak 3000 cgagaagaac tcaatgaaat aattgaacta tttagaacta ctaataatgg aaaactagga 3060 agaacccatc tgtgttccca caaaattgaa ttggtcgacg gtgcccaacc taaaaagccc 3120 cctcagtaca gatgctcacc ccatattcag aatgaagtgg ataaagaaat agcaagaatg 3180 ttggaactag acatcataga agagtcaacc gcagaatggt gtaatccgtt actgccagtc 3240 aagaaatcgt ctggtgaatg gcgaatctgc ttggattgcc gcagaattaa tgaaataaca 3300 aaaaatgaag catatccgtt tccagatatg cttgggatcc ttgggcgcat tgagaggtcg 3360 aaatacttca cggtaatcga cctatcgaaa gcctattggc aaatacccct ggaggaatcc 3420 agcagggatt acacttcctt cagagctgga aagcagttgt ttcgttttaa agtaatgcca 3480 tttgggctga aaggtgcgcc aattactcaa acgaaactga tgaacaaagt tcttgggttc 3540 gacttggaac cgcatgtcta tgtgtacctt gatgatatta ttatcacgtc taatagctta 3600 gaagaacatt ttcgccttct gcgcattgta gcagagcggc tacgacgagc aaacctcaca 3660 ataagcttaa gcaaatctaa attttgccaa aaagaaatca gttatttagg ctacactttg 3720 tcagataagg ggctagcaat agacagtaca aaagttcaac caattttaga gtacccgatc 3780 cccaaaactc ctaaagacgt aagaaggttc gtgggtatgg tcagttttta taaacagttt 3840 atagatcatt tcagtgacct aactgcaccc ataacagact tgttaaagaa gtcgaaaggg 3900 aaaattgtat ggaccaaaga tgcagacgaa gcttttttac gaataaaatc ggagttgata 3960 tccccaaacg tattagccaa cccagatttt aacctgccct tcactattga gtcggatgct 4020 tcaaacgttg cggttggagc cgttctgact caaaaccaag acggaattag aaagcccata 4080 gcattcttct ccaaaaagct ctctgcaact cagcggaaat atgccccgac tgaaaaagag 4140 tgcctgggag taatattggc catacagaag ttcagacact atgtagaagg gtctcgtttt 4200 actgtagtga cagacgctca aagtctgatc tggcttcgcc aaatcagtgc cgaaggaggc 4260 tcagcaaaac tgatcagatg ggcactgaaa ttgcaacagt acgatttcga actgttgtat 4320 cgaaagggct ccttaaacat cactgctgac gcgctgtcta gagcagtcga tgcggttagc 4380 gtaagtgacc ccgaatacga ggaactgaaa cgaaaaatat tgtcaaataa tcagaaattt 4440 aaagatttca gggtgactaa agacaaagtg tacaagctga ttccatccaa aatgattgac 4500 cctaggtttc aatggaaatt tgtcccacca atgcaacaac gatttaaaat cattcaggaa 4560 actcacgatt ccatgcattt tgggcgtaat aagacataca aaaaattgca ggagcaattc 4620 tattggccat caatggagaa tgatgtccga aagtactgcc agggatgtga aacttgtaaa 4680 aaggtcaaat atcccaatgc taacccaaag ccattgatgg gcaaacagaa gttggcatcg 4740 atgccatggc aaacaatatc agtcgacttt gtcggaccgt tccctaggtc caaagcaggc 4800 aattcggtcc ttttagtagt tacggacctc ttctcaaaat ttgtaatcat tcaaccccta 4860 cgggatgcaa aaacaacacc tttgatcaca tttctcgaaa atatggtatt cctccttttc 4920 ggagtaccag aaattctgat ctctgacaat ggggtgcaat tcaaagccaa agaatttgaa 4980 aagtttttga acaaatatca tgtgtctcat tggagaaatg ctaactacca ccctgaaaac 5040 aaccccacag aaagggttaa ccgggtgata ggagcggcca tccgttcgta tttaaaggag 5100 gaccataaag aatgggacag agatatacac aaggtagccc tagccattag gacagctgtc 5160 catgaatcga cacattttac accctatttt attaactacg ggcgtaacta catcagctca 5220 ggaactgaat ataagtatat aagggataat ggaatggatg ttgtctatga accattgcat 5280 atgaacgata atctgcgtga aattttcgat actgttaaat caaatttgaa gaaagcatat 5340 gaacggtatg ctaaatacta caaccttcgc tcaaataagg cactcccaac gttcgatgta 5400 ggggagactg tcttgaaaaa gaactttcat caatcaaata agagcaagca attctctgct 5460 aagcttgctg atccattttc gttggctaaa gtagtagcaa aaatagggac agcgtgttac 5520 gacctagaag atctcaacgg aaatcgtttg ggcgtgtatc acgctacgca tctacagaag 5580 aaataatcaa ttgaaccaaa tttacagcta tgtctgatgc acactaaagc acagttacaa 5640 tgagattagt ccaaaaaaac acaaacttac ggtttggcaa ctccgagttc agcgggaggg 5700 attggcttca ataacagttt tcatcctaca agcaatcccc tcttcgcgtc tcgatggcat 5760 caccctagga ctgcttctga cctaaaagaa gaagaagcat ttagttctag tggattgata 5820 aaaagctatg taatagaata tgcaagagaa aaatcggttg ccctagaacg atcagtcagc 5880 ttaaccaccc aagcccggag ttcaccgagg tgagctgaga ggggagagta cgctggaagt 5940 cggtgcgagt ctgtagggcc agacgatgac cagactgacg ttcaaaccta gattcttctc 6000 ttacatatcc ttggacaaag tttcacaaca caaaaacact tcgaacaagt cttaagttca 6060 caattttaca gctatgtaca aaaacaaacg ataataaaca ctgtgaacgt ttaccacagg 6120 cactaagcgt aaacattgta tatagtctgt ttttaagcta tttctcaact taatcaatga 6180 attatcactt attttcgcat tgttctgttg tacataatat gttttatagt tgttcagtgc 6240 gggggcgtaa tgttttcgtc cgatgcgcag tgagcgacat ttagtgcgta gatgattacc 6300 atgtttccat agatccacca atgtaaattt cctgttagag ttccatgtat cagcatccat 6360 agtccattcc gtgttttcca tcacaattta caatagtttg tttagttttt cgcgtagttt 6420 cagcagtttt tttttttttg ataattctca caatcctgaa ccgattccat gttttttctt 6480 gcttcacttt gttttgattg tctttctttt tcacattcac cacagagatt ttgacagtac 6540 gatatgtcaa agcttttcgt cgatttgttt gacatgtcga tcaatcagtt tacagtagag 6600 gctgttgcaa tagggatacc gtaggttgat gttattttgg gagctctgat agttcctgca 6660 acaaattaag tagtttggtt gatgtttgtt cggatgtttc cggatagttc aaaatcatga 6720 tttaatgttg gtttagaagt ggctgagaag tttctcggcg attcaagtaa ggatagggag 6780 atgcgttgtg agctacgcaa atggagtttt tcacctaggt atcccatatg aacggcacaa 6840 tataattggc gaacttaaaa aaatagggtt tagttcttag atacgcatta gatattagag 6900 ttaattgtaa atattcagaa taataattga aaaataccat gaattaaata taagtaaacg 6960 tatatatgca gcaacacaaa agtttttcca agtagtttaa aaaaaatacc agagcaattt 7020 tatcgtctac agatgactag ttattttttg aaccacaaaa aaaataaatt cagtcccttc 7080 attcattttt tttagttggt agggggaag 7109 // ID Copia-104_AA-I repbase; DNA; INV; 4152 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-104_AA_; KW Copia-104_AA-LTR; Ty1_copia_Ele172; Copia-104_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4152 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [1660-2007] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 2364..4139 FT /product="Copia-104_AA-I_1p" FT /translation="MCPLLSDADSSEDEFADAEPHVALPPQSTRLADEQQL FT SRHSGRERRLPGKYNDYICYSSFAGPDEFPQNESSDVTSDPLSYNEAINRP FT DRERWIAAMDNELKALDSNNTWELTDLPEGRKAIRNKWIFKTKRNADGEIS FT RYKARLVVKGCSQRPGIDYEEVYSPVVRYSTIRYMMALAVQYDLDVEQMDA FT ITAFLQSDLKEETIFMEQPEGYEKDPTKVCKLKKALYGLKQSSRVWNRQLD FT EAMRQFGLTRSRMDPCLYFKVHKNEMIFVTIYVDDLLIITNNAYLKKRLKE FT FLNNRFQMKDLGTAQLCLGLRINRDRKGGTLRLDQQHYIEEVLKRFNLNNC FT RPVATPADPGNKLEASTSEETAEEIQEMQNVPYKEAVGCLTYLAQGTRPDI FT LLAVNKVSQFSSNPTRRHWEAVKRIMRYLKGTIDYCLEYSREGDPQFTGYT FT DADWGGDPASRKSTTGYIFKMMGGAVSWNVKRQASVALSSCEAEFVALSHT FT TQEALWWKQLLQEIGSQRSISIFCDNQSAICIAQNEGFNPKTKHISIRFHF FT VKDTLERGDIKLYYIPTQQQPADGFTKVLPKQKHEVFRSYLGIVN" XX SQ Sequence 4152 BP; 1199 A; 882 C; 1143 G; 908 T; 20 other; agtggttatg ggcccaggac gtggccgtag cttgaagtaa agtttgtccc tgttctcaag 60 attagtttga tcgttgaaga agcgccatga ttccgaggga tgtagaccag cagcagaatt 120 caagcagtag gacgaatacg actagaggcg gtgttgcggc cggatctgga gacggcgtgc 180 gakccggtgg tgtwgtkcgt gcctacggga acccagtgaa ccttccagcc attgaaaaac 240 tgcaagggcg tcaaaactac gcatcatggg cgttcgcgat gaagatgacg ctgatacgag 300 aagggtcstg gcgagctgtk aaaccggcgg aaggcgaggc agttgatccg gaagctagtg 360 agcgagcgct ggcaacaatt tgcctaagkc tkgagaagaa caactacagc ctagtccaaa 420 mcgcgamcag tgctaaggac gcatgggaga aactccggaa ggcatttcaa gacgacgggc 480 tgattcgaag atttggattg ctggaccgct tgacgtcggt gaagctggag aactacggta 540 ccgtggaaga ctatgtcgat gasttggtgt ccacggctca cgatcttagc gagatcgggt 600 tcgaagtgaa tgaccagtgg ctggtttcgt tattgttgaa aggacttcca gaatactatc 660 atccgatggt aatgggattg caagcgtccg gcttggcgtt gacagctgat atggtcaagg 720 ctaaaatcct tcaagacgtc aagtggccta ttacatcttc cggaagagaa ggcgcacttt 780 atacgaagca caaccctcac aagtccagaa gaccaaaggg cgcggttgaa aggaaggata 840 aaacctgttt camttgtaac aagcttggac atttcgcagc ggagtgtcct cagcggcaga 900 agcaaggtca gaattcaaaa scgaagggca aggctttgtg cgcgatgatg gctgttggca 960 agggcagcgc agaagaatgg tacttcgact ctgcagctag ctcccatatg acgagaagcg 1020 aggaaaattt cacgaagcaa gaagtctttg cgcaccccat cgagacggcg aataaccagt 1080 gtatgatgtc ctcagcsaaa ggggtggtga acctkgatct tcaagaagga cccatcgaag 1140 tacaggasgt tctgcacgtc ccagatttgg caacaaacct tctttcagtc agcaaaatct 1200 gtmagaaggg tttgacggtg acgttcactt ccgatcttgt aaagtctgtg atgaagatgg 1260 agatgtgatt gcttctggga ccgagaagaa tggtttgtac cggctcaact gcaagaaacc 1320 ggaaacatcg ctatgacaac ggaaacgaag atctggcaca ggaggctagg acacctgaat 1380 cagcagagca tgaagaaact cgaaacsatg gcagatggaa ttgaactgmg gacggagaag 1440 acattcgatt gtgtggcctg tgctacagga aaacaagctc gagatccttt ccattcaagc 1500 acaacgagag cagaaggatt gctggatttg gtacacaccg atctttgtgg accaatagaa 1560 gagccctcgt tgggttggta gtcgttattt cctaacwttc gttgacgatg ccastcgcaa 1620 ggtgtttgtc tacttcttga tgtcgaagag cagcgttaag gatgtgttta tgaacttcaa 1680 atccatggca gagcggcaga ctggaagaaa cttgaaagtc ttgaggtcgg acaatggctc 1740 agaatatgtc aatcggcgat ggaagcgtgc atgggaagag acggtataat acatcaaaca 1800 acttgttcgt acacgcctca gcagaacgga gttgccgagc gtatgaatcg gacgctcgta 1860 gaaaaagcaa ggtgcatgct gaacgattcg aaactctcaa agaagttctg ggcagaggca 1920 gtctcgaccg cagcatatct cgtcaaccgg agtccctcac gttcattgga atctataacg 1980 ccagaagagg cgtggagtgg cagaaaacca gtgctgaaac atctcaagat ttttggttca 2040 aaggcgatga ttcatgtacc caagcagaag agacagaagt tcgacccgaa ggccgttgag 2100 ggaatctttg tcgggtatgc ggagaagtcg aaaggatacc gtattttcat gcccaacgac 2160 agcagcatcg taacaagtcg tgacgtgaag attatcaatg agggagaatc ttcggcgatt 2220 gatgaagagt acataggtga agtcgaattc atggagcttc attcgtggat cgagcagatt 2280 gaacaaaccg aagtggttcc catgtgcaac atgatgctgt ttctgttgtc gttcccgagg 2340 tcgttccaca tgatgacgat gagatgtgtc ccttgctctc ggatgccgat tccagtgaag 2400 acgagtttgc cgatgctgag ccgcacgttg cgctcccacc gcaatcaaca agactagctg 2460 atgagcagca attgtcgagg cacagcggtc gggagcgccg gttaccaggc aagtacaacg 2520 attatatttg ttatagttct tttgctggcc ctgatgaatt cccacagaac gagtcttccg 2580 atgttacgag tgacccgctg agctacaatg aggcgatcaa cagaccggat cgtgaacgat 2640 ggattgctgc aatggacaac gaactcaagg cgctagacag caacaacacg tgggaactga 2700 ccgacttgcc cgaaggaagg aaggccatac gcaacaagtg gatattcaag acgaaacgaa 2760 atgcagatgg tgaaatctcc cggtacaagg ccaggcttgt ggtgaaaggc tgctcacaac 2820 gtcctgggat agactatgaa gaagtgtatt ctccagtggt gagatattca accatacgat 2880 acatgatggc tcttgcagtg cagtatgacc tggacgtgga gcagatggac gctatcacag 2940 cgtttctgca gtcggaccta aaagaagaaa ccatctttat ggagcagcca gaaggttacg 3000 agaaggatcc aaccaaggtc tgcaaattga agaaagcctt gtatggattg aaacaatcga 3060 gcagagtgtg gaaccggcaa ctggacgaag cgatgcggca gttcggactt actcgttcca 3120 gaatggatcc atgtctgtac ttcaaggtgc acaagaatga gatgatcttc gtgacgattt 3180 atgtggatga cctgctgatc attaccaaca acgcgtactt gaagaaaagg ctgaaggagt 3240 tcttgaacaa tcgcttccaa atgaaggacc ttggcacggc gcagttatgt ttggggctac 3300 gaatcaatcg tgatcggaaa ggaggcacac taaggctcga tcagcaacat tacatcgaag 3360 aagtactgaa gcgtttcaat ttgaacaact gtagacctgt ggcaacccca gcggatcccg 3420 gtaacaagtt ggaagcttca acatctgaag aaacggctga ggagattcaa gaaatgcaga 3480 acgttcccta caaggaggca gtgggatgct tgacctacct ggctcaaggc acgaggccgg 3540 atattttgtt agcagtaaac aaggtcagtc agttcagtag taatccaacc cgtcgccact 3600 gggaagctgt gaaacgaatt atgcggtatc tcaagggaac catcgattat tgcttggagt 3660 acagcagaga aggcgatcca caattcactg gttatacaga cgcggattgg ggaggagatc 3720 cagccagcag gaagtcaacg acaggttaca tcttcaagat gatgggagga gcagtttcct 3780 ggaatgtaaa acgacaggcg tcagttgcgt tatcatcatg cgaggcggaa tttgtagcac 3840 tttcccacac aacccaagag gccctatggt ggaagcagct attacaggag atcggaagcc 3900 agcgttcgat ttctattttc tgcgataatc aatccgcgat atgtattgct caaaacgaag 3960 gattcaaccc caagaccaag catatttcga ttcgattcca ttttgtgaag gacactctgg 4020 agagaggaga tatcaagctg tattacatcc caacacaaca gcaacccgca gatggcttca 4080 ccaaggttct accgaagcag aagcatgaag tatttcgaag ttatttagga atagtaaact 4140 aaggaggagt gt 4152 // ID Gypsy-96_AA-I repbase; DNA; INV; 4381 BP. XX AC supercont1.312; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-96_AA_; KW Gypsy-96_AA-LTR; Gypsy-96_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4381 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.312; Positions 1194816 1190436. XX CC 'CCATC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 154..2094 FT /product="Gypsy-96_AA-I_1p" FT /translation="MATEHTDDGMEGAAGGHNSKSRGQDDSSTSPIVAAAM FT SAAAAKIFNNNIPFPEPLVMTGNVRQNVDDFIEGFEIYLVASGLENREERV FT KIATFKAALGVESRKIFNNWPLQAAEKDTVVACLASLRNYMMPKRNVKLAR FT YEFLQCKQQPADGVNEEESAVQFINRARALVKDCNWGDLEEEMLRDVIVAG FT LRDTRMKKAFIDKVELTAVDVINQCMSEEATKMELEKNKWLEEERHSVNKV FT YSKRNKIKQCAYCGKGYHRNLQECPARGSTCFYCHQRNHFEAMCRKKQEDR FT ELHRKNKTRSSKMVHKISDSESEESCAETELKEEIVDTVEYLYSIDQEHGG FT LLKADLSFGTENKSKKVKCILDTGASCNVIGLPSLLEILDGKSVELDKQQP FT VLKGFGGSSTRALGRATLDLRHNDKQYKSVFTVVDFRQIPILSMYSCLKLN FT LLKLCLSVSAEHEVTAKNIVARYPGVFQGIGKLAGDVHLEVDHSVKPVVQP FT ARRIPVTLREELKIQLDEMEKLQIISKVNEPTEWVSNLVLVKRNNKIRVCI FT DPVVLNTALKRPHYPIPTVNELLPELSKAKIFTTVDAKSGFWQVQLDEESS FT RLTTFWTPYGRYRWLRMPFGISPAPEIFQRKLHETTHGLKGVRVLADDG" FT CDS 2480..3424 FT /product="Gypsy-96_AA-I_2p" FT /translation="MVLRAPVLKYFDVDKEVTIQCDSSSVGLGAVLLQEGQ FT PVCYASKTLTPTERRYAQIEKETLAILYACRKFEMYIIGKDVTVQTDHQPL FT LRIFKKPLIEAPMRLQRILLGLQRYKVKLLFKPAKEVVIADMLSRAALTES FT DLTDRSIYDVYTMDMDFTLADYEAINSAEYIPISDFRLDQLRRASSEDPDI FT QTIIRFIVDGWPLTIGELPERMRIYWKYKNEFYTQNGLVYRNNRILVPVSL FT RSNILERLHTSHSGLEATMKLARDTVFWPGINDQIRQRIQSCYSCLKFAPN FT QQREPMQTHQIPCYPYQKISLDL" XX SQ Sequence 4381 BP; 1379 A; 882 C; 1050 G; 1070 T; 0 other; tggtgtcaga agtggctgag tatacgggaa aatggttgta gcaagtattt gtgattaccg 60 tttgttgaaa caaccatagc gatcgaagga agataattgc tccgtgatag ttttgtgaaa 120 ttcggaatca gtgcgaatta tcgggaaaaa aagatggcga ctgagcatac ggacgatgga 180 atggaaggag ccgccggcgg gcacaacagc aaaagcagag gtcaagacga cagctctaca 240 tcgcctatcg ttgcagccgc catgtcagca gccgccgcaa aaatattcaa caataacatc 300 ccgtttcccg agccgctagt tatgacaggc aatgtgcgtc aaaatgttga tgatttcatt 360 gagggttttg aaatctatct cgtcgcttcg ggtttagaaa accgcgagga gcgagtgaaa 420 atagcaacgt ttaaagcagc tttaggagtt gaatctcgga aaattttcaa taattggccg 480 ttacaagcag cggaaaaaga taccgtggtg gcctgcttgg cttcattgcg gaactacatg 540 atgccgaagc gaaacgttaa gctagcgcgt tacgagtttc tacagtgtaa acaacaaccg 600 gcagatggag tgaatgaaga agaatccgcg gtacagttta taaacagagc cagagcgctt 660 gtgaaggatt gcaactgggg cgatcttgaa gaagaaatgt tgcgagacgt gattgtagcc 720 ggccttcgtg atacccgtat gaagaaagcg ttcatcgata aagtggaact tacagcggtg 780 gatgtcatca atcaatgtat gtcagaagaa gccacgaaga tggaattgga aaagaacaag 840 tggctagaag aagaacgaca ttctgtgaat aaagtgtact caaaacgaaa caaaataaag 900 cagtgtgcct actgtggtaa aggataccat cgcaatctgc aagagtgtcc agcaagggga 960 agcacttgct tctattgtca tcagcgcaac cacttcgaag ctatgtgcag gaagaagcaa 1020 gaagataggg aattgcacag gaagaacaaa acacgttcca gtaaaatggt gcataagatc 1080 agcgactcgg aatctgaaga aagttgtgcg gaaacagagc ttaaggagga aattgtggat 1140 actgtcgagt acctgtatag tatcgatcaa gaacatggcg gtttgttgaa ggcagacttg 1200 agcttcggaa cagagaacaa atccaaaaaa gtgaaatgca tactggatac cggtgcatcg 1260 tgcaacgtga tcggcctccc ttcgttgtta gagattttgg atggaaaatc ggtagaactt 1320 gataaacagc aaccagtctt gaagggattt ggtggctcat caactagagc gttgggcaga 1380 gctacattgg atttgcgaca caacgataag cagtataaat ccgttttcac cgttgttgat 1440 tttcgacaga taccgatatt gtcgatgtac tcgtgtctaa agttaaatct attgaaactg 1500 tgcttgtcgg tgtcggctga acatgaagtt actgcgaaga atatcgttgc cagatatcca 1560 ggcgtgttcc aaggaattgg taaattagct ggagatgttc acttggaagt tgatcattca 1620 gtaaagccgg tggtacaacc agcacgcagg atcccagtta ccttgcgcga ggaactgaaa 1680 attcagttag atgaaatgga gaagctgcaa ataatttcga aagtgaacga accaacggaa 1740 tgggtgagta atcttgtcct tgtaaaacga aataacaaaa ttagagtctg catcgatcct 1800 gttgttttaa atactgctct taaacgtcct cattatccaa ttccaactgt caatgaactt 1860 ctacctgaac tatcaaaagc taaaattttc actacggttg acgcaaaatc tggattttgg 1920 caggtgcagc tggatgagga aagttcgcgg cttacgactt tctggactcc ctacggtcgg 1980 tatcgttggt tgaggatgcc gtttgggatt tcacccgctc cagaaatctt tcaacggaag 2040 ctacatgaaa caacacacgg attgaaagga gttcgagtcc tagcagatga cggataattt 2100 tcggatgtgg gagcagttca gaagaggcca tgatagatca caataataac ctcgatgcgt 2160 ttctcaagag aatgagtgaa aaaaacgtca aactcaacaa ggacaagata cgtcttttcc 2220 aaccgcaagt gaaatttttt ggacacattc tcacctcggc gggagtaaaa ccagaccctg 2280 aaaaagttcg ttccatcatc gatatgcaac caccgcagga tgctgcaggg cttctccgat 2340 ttttgggaat gatcacctac ctctccaatt atctgcctaa attgtcgacg atagctgaac 2400 ctctaagaag actaactggg agcaagaagc atggagatgg gaagaagaac acgatgccgc 2460 gttcgttaaa ttgaagacga tggttttacg agctcctgtg ctgaaatatt tcgatgtcga 2520 taaggaggtc actattcagt gtgacagtag cagtgtagga ttgggagcag tactcctgca 2580 agaaggtcag cctgtgtgtt acgcttcgaa aacgttgaca ccaacggaaa gaaggtatgc 2640 acaaattgaa aaggaaacat tagccatact gtatgcctgc cggaaattcg agatgtatat 2700 tatcgggaag gacgttacgg ttcagactga tcatcaacct ttgttgcgga tcttcaagaa 2760 gccattgatt gaagcaccga tgagacttca acgaatacta ttaggattgc aaaggtacaa 2820 ggttaagcta ctatttaagc ctgcaaagga agtcgttatc gccgacatgt tatcaagggc 2880 agcacttact gaaagtgacc ttactgacag aagtatttat gacgtctaca caatggacat 2940 ggacttcacg ctggcggact atgaagcgat caactcagcg gagtatatac ctatttccga 3000 ctttcgtctg gatcaacttc gccgagcatc gtctgaagat ccggatatcc aaacaatcat 3060 tcgattcatt gtggacgggt ggcctttaac gattggagag ttgccagaga gaatgaggat 3120 ttactggaag tacaaaaacg aattctacac ccagaacgga ttggtctaca ggaataatcg 3180 tatattggta ccggttagtc ttcgatcaaa cattttggaa cgtttgcaca cttctcactc 3240 tggtctcgaa gcaacaatga aactcgctcg tgacactgtt ttttggccag gaatcaatga 3300 ccaaatacgc caacgcattc agagttgtta ttcttgtctg aaatttgccc caaatcaaca 3360 aagagaacct atgcaaacac atcaaattcc atgttacccg taccaaaaga tctctctcga 3420 tttgtgagag ctccaactga aaggcagcaa acacgtgtat ttgataacgg ttgatcattt 3480 ttctgacttc atcgaagtcg atgagctcca aagaaatgca accgcttcga atgtggtcaa 3540 taagtgccgc caaaactttg cccggtacgg cactccgatg tatgtaacaa ctgatggtgg 3600 tccgcagttc aacagcgctg aattcaggaa gttggctacg gagtgggaat tccttcatac 3660 gatgtctgct cctcaccacc aacaagccaa cggcaaagca gaagcagcgg tgaaaataat 3720 taaacagttg ctaaagaaaa caggagaaac taattcggat ttttggaaag cattacaaca 3780 gtggcgtaac gttccgaata actgtggaag ttctccggca caaaggatgt tctgtcgtcg 3840 tataagattc aacgttccta tggctgaagc aaaatacaca tcacatattc aagcgggagt 3900 aaaggagcgt attaagaaga atcgtcaaac tgcaaagtac tactatgatc gcacagcgaa 3960 gagtcttcca ccgttggaaa taggacagcc agtatttgtt aagaaaaaac ctactgacaa 4020 tacgtggtta cctggagaag taaaggctac tgcaactgat cgatccacaa tcgttgacgt 4080 tcgtggtcag caattacggc gtgataacgt gatgatcaaa cgtgtaccgc aagcagctta 4140 tcatcattca ctactatctg gaaacgaggt cacagcaagg tctgataata gccggaaacc 4200 tgtaccacag ctggatgcaa atcccacgat tgattccagt aatcaaggaa caccatctag 4260 accatcctca gcaccggaag tgacatcgga gcgacccaaa agacgtattc ttgttcccta 4320 acgttttaaa gattttgaaa tgtattgata agctcaattc ttcaattgat gaaaagggag 4380 a 4381 // ID Gypsy-53_CQ-LTR repbase; DNA; INV; 161 BP. XX AC AAWU01017349; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-53_CQ_; KW Gypsy-53_CQ-I; Gypsy-53_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-161 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 486-486 (2011). XX DR GenBank; AAWU01017349; Positions 40842 40682. XX SQ Sequence 161 BP; 49 A; 46 C; 31 G; 35 T; 0 other; tgtggtggtg caatcaatcg cattggtgca ccccccgtta aatagcacac gcatacactc 60 acacccgcga agagagaaaa accacatgaa taaaacacac tgttttctaa cctcgaacga 120 gacggttcgt ttttattcac gagtcccgaa tcggtcccac a 161 // ID Tx1-5_AAe repbase; DNA; INV; 4664 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A Tx1 non-LTR retrotransposon from Aedes aegypti. XX KW Tx1; Non-LTR Retrotransposon; Transposable Element; Tx1-5_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4664 RA Kojima K.K. and Jurka J.; RT "Tx1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1461-1461 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 14 sequences with >96% CC identity. It is positioned at the deepest branch of the Tx1 CC clade, and does not show sequence specificity. XX FH Key Location/Qualifiers FT CDS 138..1199 FT /product="Tx1-5_AAe_1p" FT /translation="MEDGLVKNTLAFRFPPGAPAPTLVDIARFVKNLDADL FT DSMETSYKLAEERCVCVKFKSLDDMKEAQSQIPEIVFFRYSNGEKVEVKTT FT VAGCCTKYVRIFDLPPEVPDSEIAAVLGRYGVVKRMIREKFPADLGLNMFT FT GVRGVYLDIKKVVPASLYFLNRKGRIFYEGLKQKCFLCREEGHLKADCPQR FT ENSKGVKSVPANHVLGESSVSSPALPGESDCEKVSVASTPSYSGVLTGAKP FT KVAEEKLVPKMVTLVSERNKSPARETSVERCASVENIEMESETDADDVVKS FT GVKRQHSTGGSAGETDEDGATAFTKAHGNRKSSRTAKKSMTPLETIASVPL FT SQRKGNRSSSK" FT CDS 1279..4539 FT /product="Tx1-5_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MMNFVRKVATININAMVVQAKKMLLKDFLVQNDVDVA FT LLQEVQFEDFSFIQSHEAFVNIGDTNKGTAILVRSTLQYSDLLMDPGGRVI FT SVVVDGLNFVNVYGHSGSQYRAERDELFTDTIAVHFNKSRVISNVLAGDFN FT CVLEEKDCKGKMYNFSSGLKSAVELLDLRDVAIAIKGAKREYTFYRGDSAS FT RLDRIYVSKSFIGSVSRCVAKPIAFSDHHALVVDYRIEADQRAPISGGSYW FT KINDFLLNETSIQEEFRKEYEEIKKRKKYAESFSEWWSYDVKRKVKQFYKR FT KAFEFNDANRRKKDIWSKKLSDLCEKQSKGENVQNEISVAKSKILEVEVDR FT MKGYASRFQPSSLLENEKIGLYHVAKMIKRKENPANWTLEGENGPVTNIRE FT LKGMVSEHYRTLFSENTPSESRCAALDHINKVFSAVAQEDFVRPFNEDELR FT ATINYAKQKKSPGPDGITYEFYLTHFDLLKDDLLKLFNGFLDNSLVPIENF FT ADGVITLIPKNGRAKELSNFRPISLLNTDYKLLTKLIANRISSKIGEIIDE FT GQSAVVPGKSCIDNLDITRTLVLKAQQSKSMKFALLSVDLEKAFDVVDHKR FT LWEVLERFGLPRTIITMIQRLYADATSRVLVKGSLTNSFQIKRSVRQGCPL FT SMTLFVLYIEALIRQIYAGVRGILIDNRFFKIRAFADDITIIIRNDEEFDL FT VFKIIDDYSLSFGIKLNFKKSNFIRFNNCRIGPQKVSEKNEIKILGLLVST FT CWKEIVNSNFKKLINALKHTVHMFSSRRMNLIEKVTVLNTFVLAKLWYVSQ FT VVPPNNMHLAEIKKITGEFLWSHHKMFRVQRDQLYLDNSRGGLALIDPESQ FT CQSLFIRNLLFSNGRKIDHPLLKINNCKNSTINTKKLFEVAKTISSMDHIV FT TNKQIYNYLIAKKTIEVKITNKFPDMPWEIIWENISAKFVSSQARSVLFEV FT FNDVYPNRIKMFNHNFANVSNCLCEICDKHDSNIHRIKECKNSKIVYDWVT FT DIIRTKLKIRMKSAEEIIYKSINVKDSKAKSALWLVSEMIMYNMKYHKNSC FT LYSFKKLIRDARWNNKKLFDKQFGKFLNFC" XX SQ Sequence 4664 BP; 1553 A; 735 C; 1061 G; 1312 T; 3 other; cagttgacta tcagacttcg tggtgcgtgg atgtgtttat tctatctctc gaacgcattg 60 taccctattt caatttaacg ttgcgcagtc acgtggctgc tgtattcttt tttcgtttcg 120 tcggtcgwat attaacaatg gaagacgggt tagtgaaaaa caccctagcg ttccggttcc 180 cgccgggagc accagcaccc acgctggtgg atatagcgcg atttgttaag aatcttgacg 240 cggatctcga ttcgatggag acaagttaca agttagcgga agaacgatgc gtctgtgtga 300 agttcaagtc actcgatgac atgaaagaag cccaatccca aattccagaa atcgttttct 360 ttcggtactc gaatggtgaa aaagtggaag tgaaaacgac agtggccgga tgctgtacga 420 aatacgtacg tatcttcgat ttgccgccgg aggtgccgga ttcggaaatc gctgctgtgt 480 tgggcagata cggagttgtt aagagaatga taagagaaaa gtttccggcg gatttggggc 540 taaatatgtt tacgggagtg cgtggagtgt atcttgacat taagaaggtg gttccagcat 600 cgttgtactt cctgaacagg aagggccgta ttttttatga agggttaaag cagaaatgct 660 tcctctgcag ggaggagggc catttgaaag ccgattgccc tcaaagagag aacagcaaag 720 gagtaaaatc ggtaccagca aaccacgtgc ttggggaatc gagtgtatct tccccagcac 780 ttccaggtga aagtgattgt gaaaaagtgt cagtagcatc tactccaagc tattcaggag 840 ttttaactgg tgccaagcct aaagtggcgg aagaaaagtt ggtgccaaaa atggtcacgc 900 tggtgtctga gcgaaataaa agtccggcac gagaaaccag tgtcgaacgg tgtgcttctg 960 ttgagaacat tgagatggaa tcggaaacgg atgcggacga tgtcgttaaa tctggtgtga 1020 agaggcaaca ttccaccgga ggatccgccg gtgaaactga cgaggacggt gcaacagcat 1080 tcaccaaggc tcatggaaat cggaaatcat cacgtactgc gaagaagtcg atgactccac 1140 tcgaaacgat tgcgagtgtg ccgttgtcac agcgtaaagg caatcgtagt tccagcaaat 1200 aggtgtaaag gtgattgagg ttatgtttca ttgcaacacg tgtctctgat cgtgggtagt 1260 ttgagaaatt ggtcgtgtat gatgaatttt gtgcggaaag ttgccacaat caacataaat 1320 gcgatggtgg tacaagcgaa aaaaatgctg ttgaaagatt ttctagttca aaatgatgtt 1380 gatgtagcgc tcctacagga ggttcagttc gaggatttct cctttattca gagtcatgaa 1440 gcatttgtaa acatcggtga tacaaacaaa ggtacggcga ttttagttag aagtaccttg 1500 caatacagtg accttctaat ggaccccggc ggaagagtta tatcggtggt agtagacggt 1560 ttgaactttg tcaacgtgta cgggcactca ggttctcaat acagagccga acgagacgag 1620 ctgtttaccg ataccattgc ggtgcacttc aataagtcta gagttatcag caatgtatta 1680 gcgggtgatt ttaattgtgt gttagaagag aaagactgta aaggaaaaat gtacaatttt 1740 tcgagcggtc taaagtcggc agtggaactt cttgatctcc gtgatgtggc gatcgcgatc 1800 aaaggagcaa aaagagagta tacgttctac cgtggagatt cagcttcgag actggatcgg 1860 atatatgtgt caaaaagttt tattggaagt gttagtcgat gcgtagcgaa gcctattgcc 1920 ttctccgatc atcatgcgct agtagtggat tatcggattg aagcagatca gcgagcgcct 1980 atttcgggtg gcagttactg gaaaataaat gactttttac tgaatgagac ctcgattcaa 2040 gaagagtttc gaaaagaata tgaagaaatt aagaaaagga agaaatatgc agagagtttc 2100 agtgaatggt ggtcatatga tgttaaacgt aaggttaagc agttttataa acggaaagcc 2160 ttcgaattca atgatgcaaa tagaagaaag aaagatattt ggagtaaaaa gctgagtgat 2220 ttatgtgaaa aacagtcaaa aggggaaaac gtacaaaacg agatcagtgt ggctaaatca 2280 aaaattctgg aagtggaagt agacaggatg aaaggctatg ctagccgttt ccaaccgagc 2340 tccttactgg aaaatgaaaa gataggacta taccatgttg ctaaaatgat caaaaggaag 2400 gaaaatccag ccaactggac tctcgaggga gagaacggtc cggttactaa tatcagggaa 2460 ttaaaaggta tggtaagtga acactatcgc acacttttta gcgaaaatac gcctagtgaa 2520 tctcgatgtg cggcattaga ccacatcaat aaagtgttta gtgcagtagc gcaagaggat 2580 tttgtacgtc cgttcaatga ggacgaactt cgagcgacca ttaactatgc aaaacagaag 2640 aaatccccgg gacctgacgg aataacttac gaattctacc taacacattt cgacttgtta 2700 aaagatgatc ttctgaagtt gttcaacggt tttttggaca attcacttgt tccaatcgag 2760 aactttgctg atggtgtaat tacgttaatc ccgaagaatg gaagagctaa agagttatcc 2820 aactttagac caataagctt attaaacact gattataaat tgcttactaa gttaattgca 2880 aatcgtattt cgagtaaaat cggtgaaata attgatgaag gtcagtcagc tgttgtgcct 2940 ggtaaatcat gtatagataa tctggatatt acgagaacac ttgtccttaa ggctcagcag 3000 tcaaaaagta tgaaatttgc gttattgtct gtagatttag aaaaagcatt tgatgttgtc 3060 gaccataaac gtttatggga ggtacttgaa agatttggtt taccacgtac gattataacg 3120 atgatacaaa gattatatgc tgatgcaaca tctcgagttt tagtgaaagg cagtctaaca 3180 aatagttttc agattaaaag atctgttaga caagggtgtc cgctgtccat gacgttgttt 3240 gtactgtaca tcgaggcttt aataagacaa atttacgctg gagtaagagg aatactaata 3300 gataatagat tttttaaaat aagagcattt gcggatgaca ttacaattat tattagaaat 3360 gatgaagagt ttgatctagt ttttaaaatt atcgatgatt attcgttaag ctttggaatc 3420 aagttgaatt ttaaaaagtc aaatttcatc agattcaata actgtagaat aggccctcaa 3480 aaagtttccg aaaaaaatga gataaaaatt ttaggactac ttgtgtcgac atgttggaaa 3540 gaaatagtga atagtaattt caaaaaatta attaacgctt taaagcatac cgtacatatg 3600 ttctcatcac gaagaatgaa tcttatagaa aaagtgactg tgctgaacac atttgtatta 3660 gcaaagcttt ggtatgtttc tcaggtagtc cctccgaata acatgcactt ggcagaaata 3720 aaaaagataa ctggtgaatt cttgtggagt catcataaaa tgttcagagt tcaaagagat 3780 caactttact tagataatag tcgaggtggt ttggctctaa tcgatccwga atcacagtgt 3840 caatcgcttt ttatcaggaa tttgcttttt agtaatggtc gtaagatcga tcatcctttg 3900 ttaaaaatca ataattgtaa aaactccaca atcaatacaa aaaagttatt tgaggtagct 3960 aaaacgataa gcagtatgga tcatattgtt actaacaagc agatatacaa ttacttaatc 4020 gctaaaaaga ctatcgaagt taaaataacw aacaaatttc cagatatgcc atgggaaatt 4080 atttgggaaa acatttctgc taaattcgtg tcttcacaag ctagatctgt attgtttgaa 4140 gtttttaatg atgtgtaccc gaataggatt aaaatgttca accataattt tgctaatgta 4200 agtaactgtt tatgtgaaat ttgtgataaa catgattcaa atatccacag gataaaggag 4260 tgtaaaaatt cgaaaattgt ttatgattgg gtaacagata taattagaac aaaattaaaa 4320 atcagaatga aatcagctga agaaataatt tacaaaagca tcaacgttaa agattccaaa 4380 gcaaaaagtg cattatggtt agtttcagag atgataatgt ataacatgaa ataccacaag 4440 aatagttgtt tatatagctt caaaaagctt attagagatg ctcgatggaa caacaaaaag 4500 ttatttgaca aacaatttgg taaatttctt aacttttgtt aatgaagtgc tataaaggta 4560 attgtagtta aacttaacaa tatcaacatg attgtatgta aaatacaata cggtgtatgc 4620 atccttaggt taagtttgta aacatgttgt aaataaaaaa aaaa 4664 // ID Gypsy-200_AA-I repbase; DNA; INV; 2624 BP. XX AC supercont1.67; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-200_AA_; KW Gypsy-200_AA-LTR; Gypsy-200_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-2624 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.67; Positions 210749 208126. XX CC 'CTCTT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 787..2376 FT /product="Gypsy-200_AA-I_1p" FT /translation="MPSFGTLDHYVKCTSFTNYMERLEHLSSFNNWTEENK FT KSVLIALTGPVVYDELKLIFPTVELATLNYADMVKKLKERFDKVDPDMMQR FT FYFYNRFQKADESAESFILSVKLQAELCNFQQFKETAIRDRLIMGLHDREL FT QKSLLMSETLTLDAIEKKLLSSEVANRQTKAMCSDTRRDEVHSVKSRLGQK FT VGESSRDYGRDRLDRSYRGRNVNRYDRRVSFRGRSPSEARNHGAYANAICN FT YCKKRGHLRRDCYSLKNRNSVHHVEKGGKDTSYDFKRSGGHDNSDEETDGM FT ECLMISAINKLNEPCMIEAKVQNCWMKMEIDSGSAVSVISEADCRKFFKAI FT PMRNTNRQLIVVDGAWLRILGEISVQVELNGLISQLKLIVLKCNNSFVPLI FT GRSWLDCFFPGWRSGFTNTTLVNNLNEQKDFSEPIVGYEGELILKNEQPIF FT KKAYTVPYKLKDKLAEHLEMLEQQKDDDHGSQESIEADSYSGSRIQVATSG FT NESEARARGVRRVPRLCRIRCRGRTIPRVSGRIE" XX SQ Sequence 2624 BP; 803 A; 443 C; 665 G; 713 T; 0 other; aagagtgaat ctacaacccc aagtgaaaca taaaagctgg cgacgaggaa aaacagtgat 60 attgtaaaag tgatattgtg cctcatacgg aataactcct gtagcaagga aattccagag 120 tgagccgtgt aagtagaaag cagagttttt gtagagggtg gtaagctcta gaaaacttgt 180 gcgaagacag tgtgcttgtg tcagttttga agaggtcaaa acgccttgtt caagtcagag 240 ccattgttga ggaacaaaac atttagcgct tcgttcggcg tgtaaattca tgtgtctcaa 300 gcatttgtaa tagcgtattt atattgtgaa accaaaaatt tcattttgtg caacttttca 360 ccattttatt tctccattat tcacagtact ccaccccgct cccaccgatt tggtcatttt 420 tgttctaccg cggaaccatt gtttgcctca gcatcagcaa aaagaaagtc ggcgtgccac 480 ccattctacc gacccactca ctaacgacga gcaagagtgg tgagcgtgtg agcgatcgtg 540 tggtagagca ctttgcgaga gagagggagt actgtcgctt tgggagaact tgtcactgtg 600 ctgactggtg agagttctct ggttgtgagc ggttgtgagc ggttgtgagt agtgagaaga 660 ctcaactgac tgtgtctcat acagacgtaa acaacagtgt gatcaacagt tgcaccagtg 720 gtgagaaaaa gtgcatcaag taagtggatt tttttttcta tttctgctga ttgttgctat 780 tgcaccatgc cttcgtttgg aacgttagac cattatgtga agtgtacgtc cttcacaaat 840 tatatggaac gcttggagca tcttagttct ttcaataatt ggacggaaga aaacaaaaaa 900 tcggtgctga ttgctctaac cggcccggtt gtgtatgatg agctgaaact tatatttccc 960 actgttgaat tagcaacatt gaattacgct gatatggtga aaaagttgaa ggaaaggttc 1020 gacaaggtag atcccgacat gatgcaacgg ttctattttt ataatcgttt tcaaaaagca 1080 gatgaatcag ctgaaagttt cattttgtcg gtgaagcttc aggcagaatt gtgcaatttt 1140 cagcagttta aggaaactgc aatcagggat cgtttgataa tgggacttca tgatcgggag 1200 ctgcaaaagt ctttgctaat gtcggagaca ctgacactgg acgcaataga aaagaaattg 1260 cttagcagtg aggtagccaa cagacaaact aaagcgatgt gtagcgatac aagaagagat 1320 gaagtgcatt cagtgaagag tagattaggt caaaaggttg gtgaaagtag tcgagactat 1380 ggacgtgaca gattagacag gtcatacaga ggcaggaacg tgaatcggta cgataggcgt 1440 gttagctttc gtggtagatc accgtctgaa gctagaaatc atggtgcata tgctaatgcg 1500 atatgtaatt actgcaaaaa acgaggtcat ttgagaagag actgttatag cttgaaaaat 1560 cgaaactcag tgcatcatgt tgagaagggt ggaaaagaca cgagctacga tttcaagcgg 1620 tcaggtggac atgataattc agacgaagaa acagatggaa tggagtgttt aatgatttct 1680 gcgattaaca aactgaatga accatgcatg attgaagcaa aggtgcaaaa ctgttggatg 1740 aaaatggaaa ttgattcagg gtcagctgtt tctgtgatca gtgaggccga ttgtaggaaa 1800 tttttcaagg caattccgat gcgtaacacc aatagacagc ttattgttgt tgatggtgcg 1860 tggctgagga ttttaggcga aatttctgtg caagtagagt tgaatggctt gatttcacaa 1920 ctaaagttaa tagtgcttaa atgcaataac agttttgttc ctttgattgg aagatcatgg 1980 cttgactgtt tcttccctgg atggcgatct ggatttacga atactacttt ggtgaataat 2040 ctgaatgagc agaaagattt ttcagaacct atcgtaggct atgaggggga actgatcctg 2100 aaaaacgagc agcccatttt caagaaggcg tatacggtac cctacaagct aaaggataaa 2160 ttggcggaac atttggaaat gctggagcag cagaaggacg acgatcatgg ctcacaagaa 2220 tcaattgaag ctgactcata ctccggttcg aggatccaag ttgctacttc cggaaatgaa 2280 tcggaagcgc gggcgcgcgg agtcagacga gtaccaaggt tgtgccggat ccgatgtcga 2340 ggaagaacca ttcctagggt ttccggacgt attgagtgaa ccgattccta agagagcaag 2400 acgtcagaaa acgagcggta gcctgatgat gaccaggagt cgcgctcgag aaatgaaaga 2460 taaagtttga ccggtctggt tttgaggaaa aatttgcgca tagtgcattc gattttaatt 2520 aatgaatttt attgaaatac gatctgaatt gtactattta aacctaagtt caaattatat 2580 tttaagaaga attagatata gcaaggttct tgagaaggaa ggac 2624 // ID Gypsy-2_BM-LTR repbase; DNA; INV; 375 BP. XX AC nscaf2953; XX DT 19-MAR-2010 (Rel. 15.07, Created) DT 19-MAR-2010 (Rel. 15.07, Last updated, Version -1) XX DE LTR retrotransposon from silkworm: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-2_BM_; KW Gypsy-2_BM-I; Gypsy-2_BM-LTR. XX OS Bombyx mori OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Bombycidae; Bombycinae; Bombyx. XX RN [1] RP 1-375 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from silkworm."; RL Repbase Reports 10(7), 980-980 (2010). XX DR Genome; nscaf2953; Positions 3434109 3433735. XX SQ Sequence 375 BP; 106 A; 81 C; 83 G; 105 T; 0 other; tgacggcgtt aacaatattt ctaataaaca tcacgtgcaa caggcgtgac tcacacaagc 60 caggtgcgcc ggccgtgtcg ttccactcgt aaacaacagc tgcaacagaa gtgactcata 120 agccagctgc gccggctagt ataaaaaggc ggaacgtgcc ttgcaatatt cattcgttcg 180 actgcgtttg cccggtgcac ttactatatc gttgttacgt tgtgtgattc agtgactgtt 240 gtacgtactg gttaaagttg gacaagtgtc aaagtgaatt gtcaagttaa catattgttg 300 attaaattag aaatcaaagg tgacttcgtt ttacttggtg ttcaacgaaa taatgtctac 360 cccgcaacct ataca 375 // ID Gypsy-8-I_HM repbase; DNA; INV; 4360 BP. XX AC . XX DT 30-DEC-2008 (Rel. 13.12, Created) DT 30-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE An internal portion of the LTR retrotransposon - a consensus DE sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW reverse transcriptase; Gypsy superfamily; Gypsy-8-I_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4360 RA Bao W. and Jurka J.; RT "Gypsy LTR retrotransposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 1982-1982 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 87..4307 FT /product="Gypsy-8-I_HM_1p" FT /translation="MAIKNLNLPNFPPFSCTEEASKLFSSWKRYIKRFELL FT CDSLVITDDKQKLSLLLTLIGDNTYEIYENIIPQDTLITYQQAVEAFNEHF FT KPQVNLSYEIFIFRKMAQRIDETTQQFFVRLNEQAIKCEFADKDKEIKQQI FT ELCTNISKLRKYSFQNPNKTLQDLLTTAKTFELMDHQIEELEKETIKQKEV FT NTVKKTKFPQTSQAPHETETKSLKRTCYRCGNEFPHTNICLALGKSCHSCG FT KIGHFATVCKTRKQNEQNTRIHQEKYKRNEQGYTKPLNTIYKDINESENPQ FT SKLNEGELYSVNAKQISNDGLENFEVTINIENIPITVLIDTGASINILNFK FT TFEKINNFLKRPLRLQKSKTQIITYGDNNPKLKIEGQINVVIESQTKFRQS FT TFHVVKTHHKNLLSGNDAKLLGLIQLNTDVNKKLLNKGQREQNNHDDSKQQ FT VCHSLQNLSKLNNTLLLNAPKRLEPLINSYKNSVFSGKIGKLSNYQVKLKI FT NESILPVAQRERRIPFALREKVKVELAKLEAEGIIETVTDEATPWISPMVI FT VPKNDGNLRICVDMRAANKAIGRTRFPTPTLDDVIIKLKDSAVFSKLDLMK FT AFHQLELAPESRSITTFQSETQIKRFKRLNFGVNSAQEELQNALREVLKDI FT EGTLNIADDVLIFASNTKEHDLILHKVLERFKQRGLTLNFEKCLFGKEKVK FT FYGFIFSKEGMQPDPEKLYNIKNMPTPENTKALQSFLGLMNYFKRFILHYS FT TITYPLRNLLHKDSLWNWNLECQHAFDKLKNAMTLDSCVGYFDPNKETTVF FT TDASLVGISAVIVQNTPNKQDFKLISYNSKALSPVQQRYSTLERECLAIVY FT ACEHNKTYLFGHPFKMYSDHEAIVKILNNPNATVPLRIERMTLRLQGYSFD FT LHYVKGKNNISDYSSRHPVDPCEDEGLEKYVNFTAEYACPKALSLLDIQRE FT TKADPILQILTELITTNTWHKLLHPNCPPELKKFQKDLLAYRNIRQELTVN FT QTSDLILKANRIVLPHSLEITVIQLAHNSHMGLEKTKSLLREKVYFPGLDT FT KVEEYIKRCAICQALGKQNAPAELLITPTPEEVWDTVNIDYLGPLPNGFYL FT VVLIDQTSKFPVVDIIHNTSADLLIDFLQKTIAVYGIPKTIVSDNGPPFTS FT FKIKMFLNKLNIQHKRITPLWPQANSQAESFMKPLIKTIRAAHIERKDWKK FT QLYNFLFSYRTTPHCTTRIPPSTLMFNRVTGFTIPSFQTRVDTNINDQAKK FT RQENAKLYRKKYHDLKYHAKNPSLNIGDTVLIKQKKYNKITAPFEFKPYTI FT TAKNGTMITAKSPVDNSQKTRNASHMKKVPTDIQFNPIPIKEEKDEEYLQN FT ESILPEKEITATMPTQLRSYHQEPILKTYPNRERRPIELWRKY*" XX SQ Sequence 4360 BP; 1735 A; 860 C; 664 G; 1100 T; 1 other; ttggcgacga agataaaaga aaagaaattt aaaataaaat aaaawaaaaa aattaacgaa 60 gattttagta aaaagcgtat ttatccatgg ctataaaaaa ccttaattta ccaaactttc 120 caccgttttc atgtactgaa gaagcaagta agttattttc aagctggaaa cgatacatta 180 aacgatttga gctactttgc gattcacttg taattacaga cgataaacaa aaactttcgt 240 tacttctaac tttaattgga gataacactt acgaaattta tgaaaacatt atacctcaag 300 atacgctaat cacataccaa caagctgtcg aagcttttaa cgaacatttc aagccccaag 360 taaacctgag ctatgaaatc tttatttttc gcaagatggc tcaacgtata gacgaaacca 420 ctcagcagtt tttcgttcga ttaaacgaac aagctatcaa atgcgagttt gctgataagg 480 ataaagaaat taaacaacaa attgagcttt gcacaaatat ttcaaaacta cggaaataca 540 gtttccaaaa tcctaacaaa actctacagg acttgttaac tactgccaaa acatttgagc 600 tgatggacca tcaaatagaa gaacttgaga aagaaacgat aaaacaaaaa gaagtaaata 660 cagtaaagaa aacaaaattt ccacaaacca gtcaagcgcc acatgagaca gaaactaagt 720 cgctaaaaag aacttgctac cgatgtggaa acgaatttcc acacacgaat atatgtttag 780 ctttggggaa atcttgccac tcctgcggga aaatcggaca ttttgcaaca gtctgtaaaa 840 ctagaaaaca aaacgaacag aatacaagaa tacatcaaga gaaatacaaa agaaatgaac 900 aaggttatac gaaaccatta aatacaattt acaaagatat caatgaatcc gaaaatcctc 960 agtcaaagtt aaacgaagga gaattgtact ctgtcaatgc taaacagata tctaacgacg 1020 gacttgagaa cttcgaggta acgattaaca tagaaaatat accaataacc gttctaatag 1080 acaccggagc ttcgataaac attctaaact ttaaaacatt tgaaaagata aacaacttcc 1140 ttaaaagacc acttcgacta caaaaatcta aaacacagat tattacttac ggggataaca 1200 atccaaagct caaaatagaa ggacagatca atgtcgtgat agaaagtcaa accaaatttc 1260 gtcagtccac attccacgtg gttaaaacac accacaaaaa tctgttatct ggaaatgacg 1320 ctaagttact cggtttaata caactaaata ctgacgtcaa taagaaactt ttaaacaaag 1380 gccaaagaga acaaaataac catgacgaca gtaaacaaca agtttgccat tctttgcaga 1440 acttaagcaa attaaacaat actctccttc taaacgctcc aaaacgatta gaaccactta 1500 taaactcata caaaaattct gtttttagcg gtaaaatagg aaaactctcc aactaccaag 1560 taaaattgaa aattaatgaa agtatactac cagtcgcaca aagagaaaga agaattcctt 1620 ttgcacttag agaaaaggtt aaagttgaac ttgcaaaact tgaagcagaa ggaataattg 1680 aaacggtaac agacgaagcg acaccctgga taagtccaat ggtaattgtg ccaaaaaatg 1740 acggcaacct aagaatttgc gtagacatgc gagcagctaa caaagctatt ggaagaacgc 1800 gttttccaac tccaaccctg gacgatgtta ttataaaact aaaagactcc gccgttttct 1860 cgaaactaga tttaatgaag gcatttcacc agctcgaact tgcaccggaa tccagatcaa 1920 taactacttt tcaatccgaa actcaaatta aacgttttaa aagacttaat tttggagtaa 1980 actccgcgca ggaggaacta caaaacgctt tacgagaagt tttaaaagat atagaaggta 2040 ctttgaacat tgccgacgat gttctaattt tcgcttcgaa tacaaaagag cacgatttaa 2100 tattacataa agtattagaa agatttaaac agagaggttt aactctaaat tttgaaaaat 2160 gcttatttgg aaaagaaaaa gtcaaatttt atggatttat attttcaaaa gaaggaatgc 2220 aaccagatcc ggaaaaactt tataatatca agaacatgcc aacaccagaa aacactaaag 2280 ctctccaaag ttttctcggg cttatgaact attttaaacg tttcattcta cactacagta 2340 caattactta tccattgaga aaccttctgc ataaagattc cttatggaac tggaaccttg 2400 agtgtcagca tgcatttgac aaactaaaaa acgctatgac attggactca tgtgtcggat 2460 acttcgatcc caacaaagag actactgtct tcaccgatgc aagtcttgta ggcatatcgg 2520 cagtcattgt tcaaaacact ccaaataaac aagattttaa actaatatct tataattcga 2580 aagcgctatc gcctgtacaa caacgctact ctacgctcga aagagaatgt ttagctatag 2640 tatacgcctg cgagcataac aaaacatacc tatttggaca cccttttaaa atgtacagcg 2700 accatgaagc tattgtaaag attttaaaca acccaaacgc tacagtacca cttcgcattg 2760 agcgtatgac tttacgatta cagggatatt cttttgatct acactacgta aaaggaaaaa 2820 ataatatatc tgattattct agtcgacacc ctgtagaccc ttgtgaggac gaagggctag 2880 aaaaatatgt caattttaca gctgaatacg catgtcctaa agccctttca ctactggata 2940 ttcaaagaga aactaaagcc gatcctattc tacaaatact tacagagtta attactacta 3000 atacctggca taaactatta catccgaatt gtccgccaga actaaaaaaa tttcaaaaag 3060 acttacttgc ttaccgtaat atacgccaag aacttacagt aaatcagact tcagacctta 3120 ttctaaaagc aaatcgaatt gtacttccac attctctgga aattacggtt atacagcttg 3180 ctcataatag tcacatggga cttgagaaaa ccaaatcatt acttagagaa aaagtttact 3240 ttcctggtct agacactaaa gtagaagaat atataaaacg atgtgcaatt tgtcaagccc 3300 ttggtaaaca aaatgcacca gccgaactat taatcacacc aacaccagag gaagtttggg 3360 acaccgttaa tatagattac cttggtcctt tacctaatgg attttatttg gtagtcctta 3420 tagaccaaac atcaaaattt ccggttgtcg atattataca taatacatct gctgacctcc 3480 ttattgattt tctacagaaa acgattgcag tatatggaat accgaaaacg attgtcagcg 3540 ataatggacc accttttact tcattcaaaa taaagatgtt tttgaacaaa ctgaatattc 3600 aacataaacg aatcaccccg ctttggccgc aagcaaactc acaagccgaa tcctttatga 3660 aaccactgat taaaacaata cgagcagcac acattgaacg gaaagactgg aaaaagcaat 3720 tatacaactt cctgttttcc taccgtacca caccccattg cacaacccga ataccaccat 3780 ctacactcat gtttaatcgc gttacaggtt ttacaatacc ttcattccaa acaagggtcg 3840 atacaaatat caatgatcaa gccaaaaaga gacaagaaaa tgctaagtta tacagaaaaa 3900 aatatcacga tttaaaatac cacgcaaaaa atccaagttt aaatatcggg gatacagtac 3960 tgataaaaca aaaaaaatat aacaaaatta cggcaccatt cgagttcaaa ccctacacaa 4020 ttactgcaaa gaatggtaca atgatcactg caaaatctcc cgtcgacaat tctcaaaaaa 4080 ctcgaaatgc ttcacacatg aagaaagtac caacggatat acaatttaat ccgataccaa 4140 ttaaagaaga gaaagatgaa gaatacctcc agaacgaatc aattttgcca gaaaaggaaa 4200 taacagctac tatgccgaca caactaagaa gctatcatca agaaccgatt ctgaagacat 4260 atcctaacag agaaagacgt cctatagaat tatggaggaa gtattaaaac attataaaac 4320 acaaacagta aaacatttca tttacagaag aaggggaaga 4360 // ID hAT-70_HM repbase; DNA; INV; 3795 BP. XX AC . XX DT 05-JAN-2009 (Rel. 14.02, Created) DT 05-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE hAT-type DNA transposon family - consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-70_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3795 RA Bao W. and Jurka J.; RT "hAT-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 9(2), 410-410 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(733..1035,1017..1559,1720..2541,2551..3354) FT /product="hAT-70_HM_1p" FT /translation="MPIKATILWNYFNKVTVSGVKCGECKKCQIQIKLTDG FT STSGIRYHLKTKHPTDYATMLRAQIELERQREEDIKEITEAAFELNDAYSK FT FLIIFHKIMFFLVNNVFPCIKIIFMFNFNFYFVFLDQRSQKGLSKCVNGGT FT IGKYSSQSGRQLETDMVIMSYIARSSSPFTLVDEPCFKEMVEHLNPKVIVK FT HSSTFSRYKLPLLYESVMDTVQNLIEKEVSTCQQVAITTDGWTSRSQDPYM FT TLTLHYINSQFELKKYVLNFDNFVGRHTGYHISKVIFLMLIIINFSSFLFK FT ALEEMIQKYPVLDAVPKKVIVHDAAANMKHAVSIMNKTKYESLLCADHLLN FT TSLLHATNEVQEVKECIDIATQLSSKVHRSTLACQLIEKECMLLRVNYVKI FT IAPVKTRWNSNCFMLESILKLKDVLISLREKDALNNNPLQNVIPSDNQFSL FT MASLMPVLKKMKIMSDCMSKDTEPVLHHMLTMLYKTDNFLHDHIDMENREI FT VNAFCKKLSEHLHLATRFNEHGRYNQHYALGNLLHPYFRGYKFYFLVKKTF FT FNILYLLKDQLLFMISGYMLRHYGIFEEFTRKVINDHPSSDAVSISEKNNH FT KATQVKMPVGVDECDDLQNIEIEQGNNNHTISQNVPEIEREFESFHAMPRV FT EEKHKVDVLKWWKKHASTLPLLAQLARNLLCIPAASSCSERVFSASGGIIN FT DKRHSLSTQTAKQLTLIKVNYDLVCPHMTLKIISDAEEALDPPALTPVRTP FT PKNPIPKAVFSNNQKRLLFKTKNVSTTPLCSSSLPSTPTSSGIKRKTMEIP FT EESETTSSEKTSESDTQQVLK*" XX SQ Sequence 3795 BP; 1308 A; 654 C; 607 G; 1226 T; 0 other; taagggtctg cgtctgctcg tattctctcg gaagttcggt atgaaacggt aaaaaatgtg 60 agtattcgtt tcgttccgtt tttacgaaac aaaaaatatc attttgtttt gtatcgcatt 120 tatacgaaac acgtaaatct tgtttcgttt cgtatcatgt tgttacgaaa cgtgaaatta 180 ttttttcgat ttgtaccgtt atttacgaaa cacaataaaa ttgtttcgtt tcttatcggt 240 ttttacggaa tatttcataa ctccccgccc aaacagatat ttcacgattc ccatgccctc 300 gagcaataca taaactttta cgtttaatac ttactaactc aatgtttcgt aaatcaaagt 360 ttaactaaaa aaatgtatta acaaagcaat gcaatagcat cattttcata aggcttagcg 420 catctaacga agttaaaaaa agatattgtc ttctgtaggt aggttttagt aaaactgatg 480 aacttaaagt taaaatttta ttacagtaaa accgttttac tgtaataaaa ttttaacttt 540 aagttcatca gacataaatt atgtatttgt ttataaactc gcatgaagag tttgaaatca 600 ttgtttgaat taaatacttc aaattatatt atataacatt ttaaatgcga tattgttgtg 660 ttaacttgaa atattttgca aaactaatta atttattgtt taaaaaccat agtaattctc 720 aatcatataa tcatgcctat aaaagctaca attttatgga attatttcaa taaagttact 780 gtgtcaggtg ttaagtgtgg ggaatgtaaa aagtgtcaaa tccaaatcaa gttaactgat 840 ggatccacta gtggtatcag gtaccatctg aaaacaaagc atcccactga ctacgctacg 900 atgttaagag cccaaattga acttgagcgc caaagagaag aagatattaa agaaattaca 960 gaagctgcct tcgaactgaa tgacgcatac agtaagtttc taattatttt tcataaaata 1020 atgtttttcc ttgtataaag ataattttca tgtttaactt taatttttat ttcgtttttt 1080 tagatcaacg atcacaaaaa ggcttatcta aatgtgttaa tgggggaact attgggaaat 1140 attcttccca atcaggcagg caattggaga cagacatggt aatcatgtca tacattgcca 1200 ggtcaagtag tccattcact ctagtagatg aaccatgctt caaagaaatg gtggaacatt 1260 taaatccaaa agtaattgtg aaacattcct cgactttttc aaggtacaag ctgccattgc 1320 tttatgaaag cgttatggat actgtgcaaa acctaattga aaaagaagtt tccacttgtc 1380 aacaggtggc tatcaccact gatggttgga cctctaggtc acaggatcca tacatgactc 1440 ttacgcttca ctatatcaat agtcagtttg aactgaaaaa atatgttctc aactttgaca 1500 atttcgtggg aagacacact ggttaccaca ttagcaaagt aatttttctt atgttgatat 1560 aatttgttct cgtaaatata aatttgtaag tgcaaaaata atttaaaaaa gtaaaactta 1620 tccttccgct agaatagcgt ttcttctagg attttgtcaa ttattgaaag aatttataat 1680 ttattgtaaa acctaaaagc tgcataagtt atcttataaa tcattaattt ttcttcattc 1740 ttgtttaagg ctctggaaga aatgatccaa aaatatccgg tattggacgc cgtgcccaag 1800 aaagtcattg tccacgatgc ggcggcaaac atgaaacatg ctgtgtcaat catgaacaaa 1860 acaaaatatg aatcactgtt gtgtgcagac cacctactaa acactagcct actacatgct 1920 accaatgaag tgcaagaagt caaagaatgc atagatattg ccacacaact atctagcaaa 1980 gtccacagat ccaccttagc ttgtcaactt attgaaaaag agtgtatgtt gctccgtgta 2040 aactatgtta aaatcattgc tccggtaaaa accaggtgga attctaactg ttttatgctg 2100 gagtcaatcc ttaaacttaa agatgtgttg atttctctta gagaaaaaga tgcactaaat 2160 aacaacccac tacagaacgt tattccaagt gacaaccaat tttcactgat ggcttcactg 2220 atgccagtct taaaaaaaat gaagattatg tcagattgca tgtccaagga cactgaacca 2280 gtgcttcatc atatgctaac tatgctttat aaaactgaca attttcttca tgaccacatt 2340 gatatggaaa acagagaaat agttaatgct ttttgtaaaa aactaagtga gcatttacac 2400 ctagccacgc gattcaatga acatggaaga tacaaccaac actatgcctt gggaaacctt 2460 cttcatccgt actttagagg ttataagttt tatttcttgg taaaaaaaac attctttaat 2520 attctttatt tgttaaaaga ctaaatttaa cagttattgt ttatgatttc aggatatatg 2580 cttagacatt acggtatctt cgaagagttc accagaaaag tcatcaacga tcatccatcg 2640 tcggacgctg tttcgatcag tgaaaaaaat aatcataaag ctacacaagt taaaatgcca 2700 gttggtgtgg atgagtgcga cgatttacaa aatatagaga ttgaacaggg taataataac 2760 cacaccatta gtcaaaatgt accggaaatt gaaagggagt ttgaatcttt ccacgctatg 2820 cctcgtgttg aagaaaagca taaagttgat gtgctgaaat ggtggaagaa acatgcctct 2880 actttgccat tgttagcaca attggcaaga aatttgctct gcattccagc tgcatcaagc 2940 tgtagtgaga gagtgttcag cgccagcggt ggaattataa atgataaaag acacagtctt 3000 tccactcaga cagcaaagca attgactttg attaaagtca attatgatct tgtctgtcct 3060 cacatgacct tgaaaattat aagcgacgct gaagaagccc tggacccccc agcattaact 3120 ccagttagaa cacctcccaa aaatcctatt cctaaagcag ttttttcaaa taaccaaaaa 3180 aggttattgt tcaaaacaaa aaacgtttca acaacgccat tatgttcttc ttctttacca 3240 tcaacaccaa catcttcagg aatcaaaaga aaaacaatgg aaattccaga agaatccgaa 3300 acaacatcat ccgaaaaaac atccgaaagt gacactcaac aagtattaaa ataaaaatgt 3360 tttacttttt attgtgaaat tttaaaatcg tgaaatataa ttgtctaaat aaaaactgat 3420 ttttcgtaaa aatcgcaaat gtgtgcttaa attggtacaa tataaaagta aattactttt 3480 ttattttatg ctagccctat ctgtcgtaaa aacattagta tttctcgtaa aactcagaaa 3540 tttcttgttt gtttgtttcg ttttttaccg ttttatttcg gaaaaagttt aatcttattt 3600 cgtttcgtac cggttttttc aaaacaaaaa atgtgtcccg tatttatcac gaaaccagaa 3660 aattgtattt cttttcgtac cgtattttgc gaaaaaagat aaaaattgtt tcgtttcgta 3720 acggttttat cgaaacgata tttttcatcg ttttgtttcg tttcgtcttg taccgttctg 3780 agacgcagac cctta 3795 // ID Gypsy2_MH-I repbase; DNA; INV; 4754 BP. XX AC ABLG01001375; XX DT 24-JUN-2009 (Rel. 14.07, Created) DT 24-JUN-2009 (Rel. 14.07, Last updated, Version -1) XX DE LTR retrotransposon from northern root-knot nematode: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy3_MH; KW Gypsy2_MH-LTR; Gypsy2_MH-I. XX OS Meloidogyne hapla OC Eukaryota; Metazoa; Nematoda; Chromadorea; Tylenchida; Tylenchina; OC Tylenchoidea; Meloidogynidae; Meloidogyninae; Meloidogyne. XX RN [1] RP 1-4754 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from northern root-knot nematode."; RL Repbase Reports 9(7), 1520-1520 (2009). XX DR Genome; ABLG01001375; Positions 994 5747. XX CC Positions [3753-4271] - Integrase core CC 'AAGAC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 528..1967 FT /product="Gypsy2_MH-I_1p" FT /translation="MSTLSPDQLKQLIDSFQVVAQQLKTSSGASINSATLA FT STLEARIGKYYFDAEGGNTFDNWYKRYGNFIEIDGKDLDESAKVRLLVGKL FT GDQEYSRLANSVLPKLPDQLKFKEVTEKLNTLFADSKSLFVRRFECFQFKQ FT QPGQDIGTFISSVNASCETANLSLSKETLKCMIMVVGIRDEYHDLRQKCLQ FT ILEEARKRNEEISLDKLGEECRSFPLLRESARSLSIVANGTPATNALSAKP FT QKWRENKPTVRNKINPKQSSHKQKNNSPSTPCLSCGNKGHWRSECQQRWSN FT CLKCGKKGHISAVCRSSNFQGNKKPPAIHHISFGPECLGTDFKNSDWWTVN FT PLINGIPCEMKIDTGSQISVFTSKTWKHLNSPKLEKVYFSAKNCNGAPFIL FT RGKFKCNVQHKTTQSTELVGYVSDHIVHDLLGLPWIKALDILQTESFCSNI FT NTAKPLKSTLKAIKDESSLGKSLQNNFKECLWPKD" FT CDS 2379..4733 FT /product="Gypsy2_MH-I_2p" FT /translation="MQCQRLPFGVKSAPAIFQQLMDQMLSGIPGAFAYLDD FT IVIASPSMEKHIETIFELFSQIQRFGLKIHLEKCSFLKKEIKFIGHIVNSS FT GIRPDPARTEAIRQMPAPTDITSLRSFLGAINYYGRFIKSMREIREPLDKL FT TRKDIAWEWSSEQETAFAKAKQILTSDLLLTHYDPKLPIVVAADASKNGIG FT STISHMFPDKSEKVIEHASVSFSSAQQNYSQIEKEALALVFAVQKFHKMLY FT GRKFTLQTDHKPLLAIFGSKNGIPIYTASRLQRWAITLSNYDFDIKYVNTE FT SFGKVDVLSRLIAEYPRPEEDKLIANICSETEHYVNAICENSTSVLPITAT FT EIANKTSQDPILLKVCNFIQTGWPQKCKDPDLTPFFPKRNQLSIVENCVLY FT GHRTIIPPTLQQSVLKAMHLAHPGISKMKAIAMEPGINLEIERLVRSCEDC FT QNAAKVSTKVPLCTWPVPKQVFERVHIDFAGPCSDGFTYLIIVDAYSKWPE FT VYRMQNTSSKETIRTLQLLANRLGLPKEIVSDNGPQFRSFEFATFCKQNGI FT KHTFTPPYHPQSNGQAERFVDTFKRAMKKCQREGKDWAEKALLAYRTTPHQ FT AIDGYSPDQLFLGRRLRTKLTLLHPDEKIVKEKVSATLPLKRQEYNSKMTK FT SFQQKHGAKQSEFCPGESIYLLNYRYGKTMWIPGKVITRIKNSPTYKISVP FT SIGHDVHRHANQLRRRLPIELDDDEVTDPEEPPITHEEDARDELPSPHKPI FT EKPPARQSPRAKRERRPPRRLELDPQQKTYTYVR" XX SQ Sequence 4754 BP; 1570 A; 1139 C; 842 G; 1203 T; 0 other; attggcgcgg tctaaaattg gactctaaaa ttggactcta ctttatttta ttaattcgac 60 aattcgaatt ctattttgtt cgacttttaa ttttattaaa ttttaacaac aaaaaaattt 120 attcaagtcg aatttgacta ttcgacaaat tctacatttc tttattgcct aaatttggaa 180 aactttctgt cgacaaactt cgacaaattc ttaatttaat ttttaaaatt cttcatttgt 240 cactaggcgt gactcaccac tcaaatcgat acaatttctt ttaaatcaaa ccatcaccct 300 tattagcttt gtgctaataa gtgaccaaca gctccaaaac cttttattag ctcggcacta 360 gtaaaggttc tactcatcaa aacaaaaccc tattagctag gtgctaataa acggttccta 420 ggcggaactg ggttaaattt ctacaaattt tctgcactat taacttggcg ttaatattga 480 ttcactttgt gtgaactgtg cacccttcaa aacaaatcga ttgaaaaatg agcacactct 540 cgcctgacca acttaagcaa ttaatcgact catttcaagt cgtcgctcaa caattaaaaa 600 catcgagcgg cgcatcgata aactctgcaa cactggcctc aacacttgaa gctagaatag 660 gcaaatatta tttcgacgca gagggaggca atacattcga taattggtat aaacgctatg 720 gaaattttat cgaaatcgac ggtaaagacc ttgacgaatc agcaaaagta cgacttttag 780 tcggaaaact tggagatcaa gaatattcgc gattagcgaa cagcgttctg ccaaaattgc 840 ctgaccaatt aaaatttaaa gaggtcaccg aaaagctcaa tacgcttttt gccgattcaa 900 aatccctttt tgtaaggcga tttgaatgct ttcaatttaa acagcagcca ggacaagata 960 ttggcacttt tatttcttct gtcaatgctt cctgcgaaac ggcaaacctc agcctcagca 1020 aagaaacact taaatgcatg ataatggtcg taggaatacg cgacgaatat catgacctta 1080 gacaaaaatg tctccaaatt cttgaagaag cgcgaaagag aaatgaggaa atcagcttag 1140 ataagcttgg agaagaatgt cgttctttcc cgcttcttcg tgaaagcgct cgatcacttt 1200 caattgtagc caacggtaca cctgccacaa acgcgttaag cgctaaacct caaaagtgga 1260 gagagaataa gccaacagtg cgcaacaaaa tcaatccaaa gcaaagcagc cataaacaga 1320 aaaacaattc cccatcaaca ccatgtcttt catgtggaaa taaaggccat tggcgaagtg 1380 aatgccagca acgctggtca aactgcttaa aatgcggcaa gaaaggacac atttctgctg 1440 tctgccgttc ttccaatttc caaggaaata aaaagccgcc agctattcac cacatttcct 1500 ttggaccaga atgtttggga actgatttta aaaattcgga ttggtggaca gtgaatccac 1560 taatcaatgg aataccctgc gaaatgaaaa tagatactgg atcgcaaatc tcagtattca 1620 cttcaaaaac ctggaaacat ctgaatagtc ctaagcttga aaaagtttac tttagcgcca 1680 agaactgcaa cggcgcacca tttatccttc gtggaaagtt taaatgcaat gttcaacaca 1740 aaaccacaca atcaactgaa cttgtgggtt acgtgtctga tcacatcgtg catgatcttc 1800 taggactgcc ttggattaaa gcactcgata ttctccaaac cgagtctttc tgcagtaaca 1860 tcaacacagc aaaacctctc aagtccacac taaaagccat caaggacgaa tcaagtctag 1920 gaaaatccct tcaaaataat ttcaaagaat gtctttggcc caaggactag gacactgcac 1980 aaaaatgaaa gctcacctcc acctaaaacc agaggctaaa ccaatcttct gcaaaggcta 2040 ggcccgtccc atacggagcc accgaagaag taaacgccga acttgaccga ctattggcca 2100 tcgggtcact gaaaaagatt gacttcagtg attgggcagc tccaatactc gccgttaaga 2160 aaaagaatgg aaatattcgg gtgtgtattg atttctccac aggtctaaac aatgcacttg 2220 agttaaatcg tcacccactg ccaagagtag aagatattta cgcatcgatt tctggcgcaa 2280 agttcttctc acaacttgat ctgcgtgatg catatcttca aattgaactc gacgatgagt 2340 ctaaaaagct ctgcggtgtt aacacacatc gcggaataat gcaatgccag cggctacctt 2400 tcggcgtaaa gtccgctcca gcaattttcc aacaattaat ggaccaaatg ctgtcaggaa 2460 taccaggagc attcgcctac ctggatgaca tcgttattgc tagcccctca atggaaaaac 2520 acatcgaaac aatctttgaa ttgttttcac aaatccaacg atttggcctc aaaattcatc 2580 tcgaaaaatg cagttttctg aaaaaggaaa ttaaattcat cggccacatt gtcaattcat 2640 caggaattcg acctgaccct gcaaggactg aagctatacg ccaaatgcct gcaccaactg 2700 acatcacctc actccgatca tttctcggcg caataaatta ttacggacgc ttcattaagt 2760 ccatgcgtga aattcgtgag ccacttgata aactcacacg caaagacatc gcctgggaat 2820 ggtcctcaga gcaagaaact gcttttgcta aagcaaaaca aattctcact tctgacttgc 2880 tcctcacaca ctacgaccca aaattaccaa ttgtcgtagc tgcagacgct tcgaaaaacg 2940 gaatcggatc caccatcagt cacatgttcc ctgacaaatc tgaaaaagtc atcgaacatg 3000 cttctgtttc tttttcatcc gcacaacaga attatagcca aattgagaag gaagcacttg 3060 ccctcgtctt cgccgttcaa aaattccaca aaatgcttta tggtcgtaaa ttcactctgc 3120 aaacagacca taagcccctc ctcgccatct tcggttcgaa aaacggaatt cccatatata 3180 cggcaagtcg tcttcaacgt tgggccataa ccctctcaaa ttatgatttt gacatcaaat 3240 atgtcaacac tgagtctttt ggaaaagtgg atgtactttc tcgactcatt gccgaatacc 3300 ctcgcccaga ggaagacaaa ttaattgcaa acatttgctc cgaaactgag cactatgtaa 3360 atgcaatctg cgaaaattct acttcggtat tacccatcac agccacagaa attgccaata 3420 aaacttcaca agaccccatc ttgctcaaag tctgcaattt catacaaaca ggttggccac 3480 aaaagtgtaa agaccctgat ctaacaccat tttttccgaa acgaaaccaa ctttcaatcg 3540 tcgaaaattg tgttttatac ggccacagaa caataatacc accaacattg caacaatcag 3600 ttttgaaagc gatgcacctc gcgcatccgg gaatttcaaa aatgaaagca atagccatgg 3660 aaccgggaat taacttagaa attgaacgat tggtaagatc gtgtgaagat tgccaaaacg 3720 ctgcaaaagt ttctacaaaa gttcctctct gcacatggcc agtaccaaaa caagtttttg 3780 agcgagtcca catcgatttc gccggaccct gctcggatgg cttcacctat ttaattattg 3840 tagatgctta ctccaaatgg ccagaggttt atcgaatgca aaatacctcc tcaaaagaaa 3900 caatcagaac tcttcaatta cttgcaaacc gactaggcct gcctaaagaa attgtttccg 3960 acaatggtcc tcaattccga tccttcgaat tcgccacatt ctgcaaacaa aacggaatta 4020 agcacacatt cactcctcct taccaccctc agagtaatgg ccaagcagaa cgttttgtcg 4080 acacgtttaa gcgtgccatg aagaaatgtc aaagggaagg gaaggattgg gctgaaaaag 4140 ctctactggc atatcgaacg actccacacc aagcgattga cggatactca ccggatcagc 4200 tatttctagg tcgcagactt agaactaaac ttactctgct tcaccctgac gagaaaatcg 4260 ttaaagagaa agtctcggcg actttacccc ttaaacgtca agaatacaat tccaaaatga 4320 caaaatcatt ccagcaaaaa cacggcgcaa aacaatccga gttttgcccc ggtgaatcaa 4380 tttacctgct gaactatagg tatggaaaaa ctatgtggat ccccggaaaa gtcatcacta 4440 gaatcaagaa ttcacccact tacaaaattt ctgttccatc aataggacac gatgtccata 4500 gacatgcgaa ccaacttcga cgtcgccttc ctatcgaatt ggatgatgat gaagtcactg 4560 atccagaaga accaccaatc actcatgagg aggacgctcg agacgagcta ccctcaccac 4620 acaagcctat tgaaaaacca ccggctaggc aaagccctcg tgcaaaacgt gaacgtcgcc 4680 ctccacgacg tcttgaactt gaccctcagc agaagacata cacctatgtg cgctgaacct 4740 tgcaagggga gaga 4754 // ID hAT-32_HM repbase; DNA; INV; 2365 BP. XX AC . XX DT 07-DEC-2008 (Rel. 13.12, Created) DT 12-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-32_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2365 RA Jurka J.; RT "hAT-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 2021-2021 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 655..1728 FT /product="hAT-32_HM_1p" FT /translation="MKRSQPNSTSADKYLKIDKFFTRTSLKEKDLFDLQLG FT RFIYSTNSSFRTVEHKEFKKFINMLHPGYTLPNRTQIGGEILDRVYTEEIE FT KCNVLNGKIVAMSLDGWSNVHNEPIVCISVITDKINILVETINTSGHSHTG FT EYLTQLAREAIDFVRRKFGCTVKSFVTDNAANMKSMRQVLSSERDIIITYG FT CAAHMLNLLANDLNIENVSKHVTQIIKYIRNNHQAGAIYKLSGGTKLPLPT FT ETRWNSVCDSLEKYVNNWSIIFAMFEKNKDLDPVIGKKVSDIQIKRNTEDL FT LKILKPIAVALDKLQRDQTKISDTVEIFKDLMNSLSFLSNEKKKKKKLNLD FT TFRLSVMLISWQTH*" XX SQ Sequence 2365 BP; 944 A; 307 C; 341 G; 773 T; 0 other; cagggttgca tggttttaac caaatggttt aaaccatggt ttaaaccaat taaaaaaatg 60 ttggtttaaa ccaaactaat gtttttttga aaagatttga acttacaaca ggaaactaaa 120 acagttcaca aattaactaa tcttcagtat agatatattt ttttaaatcc atgattttat 180 ttttttcata gaatagagtc atcagtgttc ttacagaaca gagcagtaat aaattagtaa 240 taacacttat ctgttttctc ctacaggctg tttctccttt aataggttta tcaatatata 300 tatatatata tatatatata tatatatata tatatatata tatatatata tagagatata 360 tatatattta tatatatatt atatatataa atgaaaggtc aaattcaaag gatgcatgaa 420 catataaaaa agtacaagca atcacaaccc catgaaatta tggagtcaat tgaagatcaa 480 caactttttg aaggtaatta atttactaca ttgtcattta aaattttata ctttgtctaa 540 actgtaaata tatatatatt taaattacta tcatgtatag acattaattt ataataattc 600 aataaactgc aatttcattt taaaggtaac tcaccatcaa catcacaaca acaaatgaaa 660 aggtctcagc ccaattcaac atcagctgat aaatatttga aaattgataa gtttttcaca 720 cgaacaagcc tcaaagaaaa agatttattt gatctgcaac ttggtagatt tatatacagt 780 acaaactcta gcttcagaac agttgaacac aaagaattta aaaaatttat caatatgctt 840 catccaggat acactctacc taatagaact caaattggag gagaaatatt agatagggtt 900 tacacagaag aaattgaaaa atgtaatgta ttgaatggga aaattgtagc catgtcttta 960 gatggctgga gtaatgtaca taatgaacct atagtttgta tctctgtaat tacagataaa 1020 attaacattt tagtagaaac aatcaacact tctggtcatt cacatactgg tgaatatctt 1080 acacaattag caagagaagc aatagatttt gtaagacgta aatttggatg cacagtaaaa 1140 agctttgtga ctgacaatgc agctaatatg aaatcaatga gacaagtatt gtcatctgaa 1200 agagatatta tcataactta tgggtgtgct gctcatatgc tgaatctttt agcaaatgat 1260 cttaacattg aaaatgtgag caagcatgtt acacaaataa taaaatatat aagaaataat 1320 catcaagctg gagcaattta taaattaagc ggaggaacaa aattgccact tcctactgaa 1380 actcgctgga attctgtatg tgattcactt gaaaaatatg tcaacaactg gagcattatt 1440 ttcgcaatgt ttgaaaaaaa taaagacttg gatcccgtga ttggtaaaaa agtaagtgat 1500 atacagataa agcgaaacac agaagattta ttgaaaatat tgaaaccaat tgcagtagct 1560 ctagataagt tgcaaagaga tcaaacaaaa ataagtgaca ctgttgaaat ttttaaagat 1620 ttgatgaaca gtttatcatt tttatcaaac gaaaaaaaaa aaaaaaaaaa attgaatcta 1680 gataccttca gactgtcagt gatgctcatt tcctggcaaa cacactagat cctagatatt 1740 ttggatctaa tctttctgaa aatgaaaatg aagctgccat gaatttcgca gataaagagt 1800 tcaagaattt gttaccaata ttattcaaat taaaagccaa atctactcca ttcgataaaa 1860 gctatttgta tagtgaatct gttttaaaaa atgtatcacc aatagaatgg tggaaatctc 1920 taaaaattgt ggattataac ataatggata ttgaaggttt attaaatgca ccagcttcta 1980 gtgctggaat cgaaagaatt ttttcaactt ttggtcttgt tcattctaag ttaagaaacc 2040 aagaaaccga aattgggggt agaaaaagca gcaaagcttg tatttatata tagaacatta 2100 aacaatgata tattgaataa ttttgatgaa atataacatt ggtaaagata tttggtaata 2160 atattggtaa taatattgat aaagaatata gtttactcaa ataaaatacg ttatgctttt 2220 tttaatttaa aaggataact aaataccgca ttttttagta aaaacctaat ttaaaccaaa 2280 aaaaccatgg tttttttggt ttttttaaaa aaacctatgg tttttggttt ttttcaaaaa 2340 aaaaaacggt ttttttgcaa ccctg 2365 // ID BEL-14_DWil-I repbase; DNA; INV; 4812 BP. XX AC scaffold_181145; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-14_DWil_; KW BEL-14_DWil-LTR; BEL-14_DWil-I. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-4812 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_181145; Positions 706856 702045. XX CC Positions [3886-4443] - Integrase core CC 'ATACT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 123..3062 FT /product="BEL-14_DWil-I_3p" FT /translation="MEELKALVGRRSRLKASITRILDWSERTQTTSTTEVA FT ARMDQLQSVWKEFNGIGDSIALLEDVDGYVDPEVDHVTYEERYLKAYSLLL FT GKKRLVQQQAPLSGDGDGAMGLHAYNDDIVHLLQQQQQLFEQLAANQSSSN FT LSMPARNEMAASGSTSGNGTTAFVHGGELPKIQIKRFAGLYTEWPAFQDIY FT ESTIHNKRELTNTQKFHHLKTLLVDDAVNLVRHLAITDTAYNTAWERLKER FT YNRPRHIVNSFLEKFMGLPTTNKIDVSILRKVSDGTNEIVRGLDAINHTGR FT DCWIQYLVLEKLDADTRRRWIERSMGNEAPTLEEFFKFLDDRCEELELSKR FT EIAFGGKTPAQITHTKRVTHSMVAVQAGSCTKCQATDHKLYGCQQFMDLTL FT AERRMFIREKALCYNCLKSGHMVKNCPSSFSCKYCKGKHHSLIHDPGNSMA FT LRNLQQGKRENQRSPNASHHDLSTTSLVAHNGSAPATVCSQSSAKLKTGKG FT LNRSILPTATVYVQKVNGDYITCRILLDTGSELSYVSERCIQALGLPRSAS FT SILVTGISSVQADTTRGSSTLQIKSRYSNNHLTVQAHVLGKITSTLERQNI FT DALALDVFSDLQLADSEFTTSAPIDVLLGGEHVWSTITGKKLYDNTGKLIA FT ISSIFGWVISSITMPQANNAFALTSYIDVNASLQRFWELEDISSTTKLEPD FT DEQVEKHFLATHTRDEKGKYIVELPFKVTNPEFGDTLQGALNRFQSVERRL FT QQDANLRAKYVNFMREYFDLGHMRELPPDEVNNAQRFYLPHHPVLGRKLRV FT VFDGSFCDTKGKSLNDYLFTGPSIQRDLFAVCLRFRMYKFVFSADIVKMFR FT QIWVNDKHRDFQRIVWRERPSDPIKHYQLCTVTYGTSCAPFLAVRVLEQLA FT TDHQQEFPNAAKILQEDFYVDDVHTFDKGGTSTSKGHGFCMVRFHNRSLLA FT IIRTLSTKDICWEPNFGNSGHHSKA" FT CDS join(2860..3882,3886..4812) FT /product="BEL-14_DWil-I_1p" FT /translation="MLQKSYKRIFMSTMFIRSIKAALRHQKVTVFAWCDST FT IVLYWLSYAPSRLKTFVGNRTSEILDTIPRHDWRHVDSKSNPADCASRGLM FT AADLKDFQLWWNGPSWIRDADQFLVRLNNSQVCLNISEKNIEKEVKSNCLT FT ALVEAAPDHPLDHLVQRVSSWLTLVHTVGYVLRFLRRTKGPFGDKGSNCLT FT FEEITAARIVCLRHAQTCFQDDYQLLLANKPLRSRSQLAKLSPMIDKDGLL FT RVGGRLHHSQLSTEAKHPVLLPKSHRITKLILEYEHRVNLHPGVSSLFVMV FT RQTFWIFGARNLIRKVTHDCLACFRQRHHTSQQRMADLPSIRVTQHFHLTL FT DVTMLVQSFLRMGKGRKPRIGKGYICLFVCLVTSAIHLELVTDLSTESFLA FT ALRRFVSLRGKCNKIYSDNGTNFIGAKRSLDEMQKLLASQPHIEKVRNALA FT NDGIQWVFIPPHAPHWGGKWESAVRCVKLHIRRVIGKSTLTYEQMRTLLAQ FT VSAVVNSRPLCYNSDTDINYLSPAHFLIGRPLTTIPEADLGHIPVGRLGYW FT QSIQSMMQGFWKQWHQEYLTTLQQRPKWTTTTPNIAVGDVVLVIESNTPPA FT HWHLALVLEAYAGKDQLVRAVRLKTSSGELTRPITKVAVLPRSETVFQGGP FT G" XX SQ Sequence 4812 BP; 1380 A; 1098 C; 1127 G; 1207 T; 0 other; taatttggtc attcgagccg aataagggaa tcggttttcg atcgaataaa gtgcttttcg 60 ttcgaatttc ttggatctat tggaaagttt tgaggattat tactctggaa tagtttttta 120 aaatggaaga actaaaggca ttggtgggtc gacggagccg actcaaggcc agcataacaa 180 gaattctgga ttggtcggaa cgaacccaaa ctacatcaac tactgaggta gccgctcgga 240 tggatcaact ccagtccgtg tggaaggaat ttaatggaat cggggattcc atagcccttc 300 tggaagatgt ggacggctat gtggatccag aggtcgatca cgtgacctac gaagagaggt 360 acttgaaggc atattcttta cttctgggaa aaaaacggtt agttcaacaa caagcaccat 420 tgtctggcga cggcgatggc gccatgggtc tgcacgcata taacgacgac atcgttcatc 480 tgctgcaaca acaacaacaa ctctttgaac agttagctgc aaatcagagc agttcaaact 540 tgtcgatgcc cgctcgcaat gagatggctg cttcgggttc gacttcggga aacggcacga 600 cggcattcgt gcatggcggc gaattgccaa aaattcaaat aaaacggttc gctggactct 660 acacagagtg gccagctttt caagatatct atgagagcac gatacacaac aaaagggagt 720 taaccaacac tcaaaagttc catcatttaa aaacactgct cgtcgatgat gctgtaaact 780 tggtgcggca cctggcgata actgacacgg cttacaacac tgcttgggaa cgcttaaagg 840 aaagatacaa taggccacgc catattgtga actcattttt ggagaaattt atgggtctac 900 caacgacaaa caaaattgat gtgtccatct tacgcaaggt atcagatggc acaaatgaga 960 ttgttcgcgg tctggatgcg atcaatcata caggacggga ctgctggata caatatctgg 1020 tattggaaaa gcttgacgct gatacacggc gcaggtggat tgagcgcagc atgggcaacg 1080 aggcgcccac actggaggaa ttcttcaagt tcttagatga tcgttgtgag gaattagagc 1140 tcagcaaacg ggagattgcg tttggaggca aaacgccggc gcaaatcaca cacacaaaac 1200 gtgtcacaca ttcgatggtt gcagttcaag ctggcagctg caccaaatgt caggcgacgg 1260 atcataagtt gtatggctgt caacaattca tggatttaac acttgctgaa cgtcgtatgt 1320 tcatcaggga aaaggcttta tgttataatt gcctgaaatc agggcatatg gtcaaaaact 1380 gtccatcatc attctcatgc aaatattgta aaggtaaaca ccattctctg atccatgatc 1440 caggtaactc aatggcttta aggaacttgc aacaaggaaa aagggagaat cagcgtagtc 1500 caaacgcttc tcatcacgat ttgtctacaa caagtctagt ggctcataat gggagtgcac 1560 cagcaacggt ttgttcacaa agctctgcga aattgaaaac aggaaaaggt ttaaatcgaa 1620 gcatattgcc tactgcaacg gtttatgttc aaaaggtaaa tggagactac attacatgtc 1680 gtatcttact agatactggg tcggaacttt catatgtatc agaacgatgc atacaagcac 1740 ttggattgcc acggtcggca tcatccattt tggtcacagg aatctcttcg gttcaggctg 1800 acaccacaag gggaagcagc acgcttcaga taaagtcaag gtattcaaac aatcacttaa 1860 cggtacaggc tcatgttctt ggcaaaatca catcaacact ggaaaggcaa aacattgacg 1920 cattggcact tgatgtcttt agcgatcttc aactcgctga ctcggaattc accacaagcg 1980 ctccaattga cgttcttttg ggcggtgaac acgtttggtc tacaatcaca gggaaaaagt 2040 tgtacgacaa tacgggcaaa ctcattgcaa tatcatcgat tttcggatgg gtcatcagct 2100 ctataacgat gccccaagct aacaacgctt tcgctttgac atcatacatt gatgttaacg 2160 cttcgctcca gaggttttgg gagctagagg acatcagttc cacaaccaaa ttggaacctg 2220 atgatgagca ggtggagaaa cactttctcg ctacgcacac tcgagatgaa aaggggaagt 2280 acatcgtgga acttccattt aaggtcacta atcctgaatt tggggatact ctacaaggag 2340 ctcttaatcg tttccaatcg gtggaacgac gcctacaaca agatgcaaat ctaagagcaa 2400 aatatgtcaa ctttatgagg gaatacttcg atttggggca tatgcgcgaa ctgccaccag 2460 atgaagtcaa taatgcacaa cggttttatc taccacatca cccggttttg ggtcggaagc 2520 tgagagtagt tttcgacgga tcattttgcg acaccaaagg caaatcacta aacgattatc 2580 tctttacagg gcctagcatt caacgcgatc tatttgctgt ctgcttgcgc tttcggatgt 2640 ataaatttgt attctcagcc gacatagtca agatgtttcg gcagatttgg gtgaacgaca 2700 aacaccgcga cttccagaga attgtctgga gagaaaggcc atcggatcca atcaaacact 2760 atcaactatg caccgtcact tatggcacct catgtgcacc attcctggct gttagggtat 2820 tagagcaact ggctactgat catcagcagg aattcccgaa tgctgcaaaa atcttacaag 2880 aggattttta tgtcgacgat gttcatacgt tcgataaagg cggcacttcg acatcaaaag 2940 gtcacggttt ttgcatggtg cgattccaca atcgttctct attggctatc atacgcaccc 3000 tctcgactaa agacatttgt tgggaaccga acttcggaaa ttctggacac cattccaagg 3060 catgattggc gtcatgtgga ctccaaatca aatccagctg attgcgcatc cagaggtctc 3120 atggctgcag acctaaagga ctttcagtta tggtggaatg gcccgtcatg gatacgtgac 3180 gcggatcagt ttctggtaag gttaaacaac tcacaagtct gtttgaatat ttcagaaaag 3240 aacatagaaa aggaagtcaa aagcaattgt ctgactgcat tagtagaggc agctcctgat 3300 catccacttg atcatcttgt tcaacgagta tcttcatggt tgacgctcgt tcacacggtt 3360 ggctatgtcc ttcgctttct acggcgcacg aagggtccat ttggggacaa gggctcaaac 3420 tgtcttacgt ttgaggaaat caccgcggca cgcattgtat gcttgcgcca cgcgcaaacc 3480 tgctttcagg atgactatca attgctactc gcaaataaac cattgcgaag tcgatctcag 3540 ctggctaaac tctcgccaat gatcgacaag gacggactac tcagggttgg aggacgcttg 3600 caccactcgc aattgtccac agaggcgaaa catccagttt tgctaccgaa atctcatcgc 3660 atcaccaagc tgatacttga atacgaacac agggtcaacc tgcaccctgg cgtctcatca 3720 ctctttgtta tggtacgtca aacattctgg atatttggcg caaggaactt gataaggaaa 3780 gtcacacacg actgtttggc ttgttttcga cagcggcatc atacatcaca acaaaggatg 3840 gctgatctac ccagcattcg tgtcactcag cacttccatt tgtaaacact ggatgtgact 3900 atgctggtcc aatccttctt aaggatggga aaggggcgca aaccgcgcat tggtaagggc 3960 tacatatgcc tatttgtttg tctggtcaca tcggcaatcc atctggaact tgtaacggac 4020 ttgagcaccg aatcattctt ggccgcgcta agacgtttcg tctctttgcg tggcaagtgc 4080 aacaaaatct atagcgacaa cggaacgaat ttcattggag ctaaacgctc tcttgatgaa 4140 atgcagaagc tgctggcatc tcagccacac atcgaaaagg ttaggaatgc tctggcgaat 4200 gatggcattc aatgggtatt cattccacca catgctcctc attggggagg aaaatgggaa 4260 tctgcagtca gatgcgtaaa gctgcatatt cgccgagtca ttgggaaatc tactctcacc 4320 tacgagcaga tgcgaactct acttgcacaa gtcagtgcgg tggtcaactc acgacccttg 4380 tgctacaact cggacacaga tatcaattat ttgtcgccag cacatttctt gatcggcagg 4440 cctctcacaa ctataccaga ggcagactta ggccacatcc ccgtgggccg acttgggtac 4500 tggcaaagta tccaatctat gatgcaaggt ttctggaagc aatggcatca ggagtatctt 4560 accacattgc aacagcgtcc aaagtggacc actacaacac caaatatagc agtgggagat 4620 gtggtgcttg taatagaatc gaacacccca ccagctcact ggcatctggc gctggttctc 4680 gaagcctacg caggcaagga tcaattggtt cgagcggtta gactcaagac ctcttcggga 4740 gaattaactc ggccaatcac caaggttgcg gtattgcccc gttcagaaac tgtgtttcag 4800 ggcgggccgg ga 4812 // ID Gypsy-24-LTR_NVi repbase; DNA; INV; 399 BP. XX AC . XX DT 22-APR-2009 (Rel. 14.04, Created) DT 22-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: long terminal DE repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy-24-LTR_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-399 RA Bao W. and Jurka J.; RT "LTR retrotransposon from Nasonia parasitic wasp."; RL Repbase Reports 9(4), 786-786 (2009). XX DR [1] (Consensus) XX SQ Sequence 399 BP; 128 A; 87 C; 76 G; 108 T; 0 other; tgtgacgtcc tgcgacgtat tgcatttaag caatatcagc ggaatcgtta cgaagccgaa 60 aactgacgac tcagattaat aagtagatca gcgaggttaa tgagtatccc ctgtatacgc 120 cgagcgacca gttgtaatta tacactcgat ctggactctt gtacgacgct cctatccgtc 180 gacaagagtt atccgggctt aagtcgcgat cccgatacct atagaacttt atcttgaata 240 aatcgtgaaa gtcaatcaac tacaagtgtt tttctattga atatatccat cctgtataag 300 cctgactaca gcacagcagc agagtttcca gcccaagcag ttaaacccaa gtaagtaaaa 360 attcaataat taattataga tcggaattgt gacttaaca 399 // ID Sola2-2_NVi repbase; DNA; INV; 4375 BP. XX AC . XX DT 16-FEB-2009 (Rel. 14.02, Created) DT 25-JUL-2009 (Rel. 14.08, Last updated, Version 2) XX DE Sola2 DNA transposons from Nasonia vitripennis. XX KW Sola; DNA transposon; Transposable Element; Sola2; Sola2-2_NVi. XX NM Sola2-2_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-4375 RA Bao W., Jurka M.G., Kapitonov V.V. and Jurka J.; RT "New superfamilies of eukaryotic DNA transposons and their RT internal divisions."; RL Mol Biol Evol 26(5), 983-993 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Nasonia CC Genome Project. XX FH Key Location/Qualifiers FT CDS 1488..3497 FT /product="Sola2-2_NVi_1p" FT /translation="MCRRYIIFFFTNHKICIFVMSSKKSQHTDRCCNPYAN FT LGNPHHKGKDLRMVSEYLRSKFPYLSDSDRICWNCRHEKYSCSNNSIEISL FT SNACDTDSVHCDSFDQRNVSLSAADEQNLSSQSNTSVVEVTDNDVRSEREM FT QLEDMLTKLKEKFSSLQITDPLRKRILTIAPEVWSVNKIAKEFNCSRRLAK FT QSKELRASGGILANTTAKAGKPLPDETVRKVKDFYMNDINSRVMSGKKDVV FT SVKSGEDRCLEQKRLLLLDLRELHKLYKESEPKFLVSFSKFAQLRPKQCVL FT AGSSGTHSVCVCTIHQNVKLMLDAINIKKLTEHSAKLLKDYKDCLKQITCE FT NSKTNCLLGECNLCSNTVDLSRFLHQLLDSKGIDKVQYSTWVTTDRSTLQT FT QLFSSHNFVEELCDRLTTLKPHSFISKQQSQFFEEKKNNLCIGEILAVFDF FT SENYKFIVQDASQAFHFNNKQCTLFPVVCYYKENQELKHKSIIFLSDSLQH FT DTAAVYTVQMKLIPHLKKELSAKKIIYFSDGAKQHFKNKFQMINLIHHERD FT FRIKAEWHMHATAHGKGASDGIGATFKREASRQSLLRKSTEPILSPLQLFQ FT WGNQSMKNMILFFYSKQEHTKSQRFLNARFKKAPAVPQISKHHCFIVEKGK FT ILLSKRYSNASDGLNLIYKT*" XX SQ Sequence 4375 BP; 1405 A; 770 C; 805 G; 1395 T; 0 other; gaggcattat ttgtcaaccg gacaccccct gctctccaga gttgtttaaa cttttaagag 60 tcaatcgagg tttcaaaatt tgtgtttttt cgtctagttt ttggagcaga gtggcgtttt 120 tcgattatca aaatggctga gacaatatgg cggcttaata caatggctta atgtaggact 180 ttagtttcat tacagaaatt attcaaaatg aatgagcgca tcaggctaaa aatgtttatg 240 caaaagtaat taatgtcata aaatatggat caaatctaaa aatggtgaaa ttcaagatgg 300 cggatccaat atgatagccg tataggctct tgcaattgaa aattgaaaat aaccgtttga 360 ctgtctccaa atttgatact caggggtttt tgggttcgat aattacaatt ctgatatcca 420 aattgctaaa ttcaagatgg cgggtccaat atggcatcct tatagtccct tgatattcaa 480 aatggagaat aagcgttcga ttggctcaaa atttgataca caggggtttt tagagttcat 540 aattacaatt ctgatgtcaa aatgtaaaat ttaaaatggc ggacggatat ggtggatcca 600 acatgatggc gtacaggtcg ttgccatact aaatgagcat attgcatagt cgcatataca 660 ataattgata ataatagtat aaatttttct taattcttga ataagtttat atctatcaga 720 actattagga tactactttt ccattaacgt tttcctccct tttcaaatga aaattgtttt 780 tctgacttga gttgcataat tggatctttt tttgaatctt ccattgaaac atctctacaa 840 cagtattgtt ttattggaat cgcggctacg gaaaaattgg tgctcgtgag gtttacaaat 900 tgccgaaatt ttgtcagtca tatccttcta ttgtttgcat tattcgtttt cagaataact 960 gagacaattt tccattcgtc gatgtcaact gtgagtgatc acctgccgac aattcttttc 1020 aggatagttg gtgagttctt tcattcgagg ccgtcaattg tgagtgaaca tctgccgaca 1080 gtccttttca ggatagttgg tgagttcttt cattcgaggc cgtcaatagt gagtgaacat 1140 ctgccgacag tccttttcag gatagttggt gagttctttc attcgaggcc gtcaatagtg 1200 agtgaacatc tgccgacagt ccttttcagg atagttggtg agttctttca ttcgaggccg 1260 tcaatagtga gtgaacatct gccgacagtc cttttcagga tagttgatga gttctttcat 1320 tcgaggtcgt caattgtgag tgaacatctg ccgagagtcc ttttcaggat agttggtgag 1380 ttctttcatt cgaggccgtc aatagtgagt gaacatctgc cgacagtcct tttcaggata 1440 gttgatgagt tctttcattc gaggtctcat ttcatttgtg attgtcaatg tgcaggcggt 1500 atattatttt ctttttcaca aatcataaga tttgcatctt cgtcatgtct tctaaaaaat 1560 ctcaacatac tgataggtgc tgtaatcctt atgcgaattt gggtaatcca catcacaagg 1620 ggaaagatct gagaatggtt tctgagtatc tgagaagcaa gtttccttat ttgtctgata 1680 gtgatcgaat atgttggaac tgcaggcacg aaaaatacag ctgttcaaac aatagtattg 1740 agatatctct cagtaatgcg tgtgatacag attctgttca ttgtgattca ttcgaccaaa 1800 ggaacgtttc attatctgca gctgatgaac agaatttatc atcgcaaagt aatacatcag 1860 tagtagaagt tactgataac gatgttcgat ccgaaagaga aatgcaattg gaagatatgt 1920 tgactaaatt gaaggaaaaa ttttcttcat tacaaataac ggacccttta aggaaacgta 1980 ttctcactat tgcaccagaa gtatggagtg tcaataaaat tgcaaaagaa ttcaattgta 2040 gtaggcgact agctaaacaa tcaaaggaac ttagagcttc aggaggtata ttggcaaata 2100 ctacagcaaa agctggaaag ccattgcctg atgagactgt acgaaaagtc aaagattttt 2160 atatgaatga tatcaatagt agagttatgt ctggcaaaaa agatgttgtt tctgttaaat 2220 caggagaaga tcgttgttta gaacaaaaac gtcttcttct tttagattta agagaacttc 2280 ataaattgta caaagaaagc gaacctaaat ttcttgttag tttcagtaaa tttgcacaat 2340 tgcgtccgaa acaatgcgtt cttgcaggtt cttctggcac tcattctgtc tgtgtgtgta 2400 caattcatca gaatgtaaag ttgatgctcg atgctataaa tatcaagaag cttacggaac 2460 attctgcgaa attactaaaa gattataaag actgcttgaa gcaaataacg tgtgaaaatt 2520 ctaaaacgaa ttgtttacta ggtgaatgta atctatgttc gaacacagta gacttgtctc 2580 gattcttaca tcaattatta gatagtaaag gcatcgataa agtccaatac agtacctggg 2640 ttacaactga caggtcaact ctacaaactc aactcttttc atcgcacaat ttcgttgaag 2700 agttatgcga tagattaaca actctaaagc ctcattcttt tatttctaag caacaatcac 2760 agtttttcga agaaaagaaa aataatttat gcatcggtga gatactagct gtcttcgact 2820 tttctgaaaa ctataaattt attgttcaag atgcatctca ggcatttcat tttaataata 2880 aacaatgcac cctttttcct gtagtatgct actataaaga aaaccaagaa ttgaagcata 2940 agagtatcat atttttatca gatagtttac agcatgatac agcagctgta tatactgtgc 3000 agatgaagtt aattccccat ttaaaaaaag aactcagtgc aaagaaaatc atttacttta 3060 gcgatggtgc aaaacagcac tttaagaaca aatttcaaat gattaatttg atacatcacg 3120 agagagactt tagaattaag gctgagtggc atatgcatgc aactgcgcac ggtaaaggcg 3180 cttcagacgg tattggagca acattcaaaa gagaagcttc gagacagagt cttttgcgta 3240 aatcaactga gcctatttta tcaccgcttc agctctttca atggggtaat caatccatga 3300 aaaatatgat attattcttt tatagtaaac aagagcatac aaaatctcag agatttttaa 3360 acgcaaggtt caaaaaagct cctgcagtac ctcaaatatc aaaacatcat tgttttatcg 3420 tggaaaaagg aaagattctg ttatcaaaga gatactctaa tgcttcagat ggattaaatc 3480 ttatttataa aacatgaaac tttacgaatg agggatgaac gcatcagctc ataatatttc 3540 tccaatataa aaatccgcga tgttccaatt ttcacactgc ttcacggact ttaagcgaac 3600 tcgcatcagt aagacccata taacctcgat attgctatct cgaatgaaat aactctaaca 3660 ccgaatttat gacttaaaaa aaacgtccct gataagtgat ttcaagacgt tttcagaatc 3720 gttatggaaa ttacattact attactgacc atgagtgcat acaccatgcc ccttttaatt 3780 tgcaagggcc taattttgta atattgacat taaaatagaa attgccgaac tcacaaaccc 3840 ctcagcatca atttttgagc cagtcaaacg attgttcacc attttgtatt gcaagggcct 3900 atgcggccgc catattggat ccaccatatc cgtccgccat tttaaatttt acattttgac 3960 atcagaattg taattatgaa ctccaaaaac ccctgagtat caaatttaga gccagtcaaa 4020 aggttatctt ctattttcaa ttgcaagagc ctatacggct gacatattgg atccgccatc 4080 ttgaatttca tcatttttag acttgatcca tattttatgt cattaattac ttttgcataa 4140 acatttttag cctgatgcgc tcattcattt tgaataattt ctgtaatgaa gctaaagtcc 4200 tacatcaagc cgttgaatta agccgccata tcgtgtcagc cattttgaaa atcgaaaaac 4260 gccactcggc tccaaaaact agacgaaaaa acacaaattt tgaatcctcg attgactctt 4320 aaaagtttaa acaattctgg agagcagggg gtgtccggtt gacaaataat gcctc 4375 // ID CR1-2_HM repbase; DNA; INV; 3742 BP. XX AC . XX DT 17-MAR-2008 (Rel. 13.03, Created) DT 29-OCT-2010 (Rel. 15.11, Last updated, Version 3) XX DE CR1-type family: consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-2_HM. XX NM CR1-2_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3742 RA Jurka J.; RT "CR1 families from Hydra magnipapillata."; RL Repbase Reports 8(3), 181-181 (2008). XX RN [2] RP 1-3742 RA Bao W. and Jurka J.; RT "Consensus update."; RL Direct Submission to Repbase Update (13-JAN-2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 763..3669 FT /product="CR1-2_HM_1p" FT /translation="MNKVTKTIKHKKPQSSPSSLNLSFTNIRGLRSNFSSV FT ESYLLQSSPDLLALCETNLSSAVSSCDLSVDGYLPLIRKDSNSHMLGLGIY FT IRKNSPICRETRFESTDYSFMCFRLAPLHSITFLFVLYRSPSSQDCTLLDV FT ISDYIDQALXLYPSANIVVVGDFNVHHTEWLGSSVSDSAGIKAHNFCLSQS FT LTQIVNFPTRFPDNPNHLPSLLDLCLVSDPSQCSVSPHSPLGASDHGLISL FT KLLSHSSLPSESPYHRTSYNYLKADWDSFRDFLRDGPWVEIFRLPVDKCAS FT YITSWIQAGMESFIPSRRFQVKPHSSPWFSSHCAAAIANRNHYFHIYQQNN FT SPENRRLFITARNHCKKVLSNAKARYSQIMKSRISSQKLGSRDFWRIFNSI FT NNKGKSVIPPLLYGSDFVTSPKDKAELFAKNFSSISSLDSTSCVLPDIAVK FT QVDPLLDIRITPASVSKVISCLDSSTACGPDNIPVIVLQKCSPELSSILSK FT LFNKCLSESCFPACWKAASVIPIFKNSGERSDSSNYRPISLLPIISKVFES FT LINKHLISHLESHNLLSDHQYGFRSSRSTADLLTVITDRFYRALDKGGEVK FT AIALDISKAFDKVWHAGLLHKLSSYGVSGNIFKIIESFLSNRSIKVVLDGQ FT HSSSYSVTSGVPQGSILGPILFLIYINDLPDILTSKVALFADDTTIYSCHD FT KKPTPSDCLQGASELEKDLTSATAWGSQWLVNFNSDKTQFFSANRYRNNLD FT LPIFMNGNVLDESPTLHLLGLTLTSDLSWKPYIKSVAKLASAKVASLYRAR FT HFLTLDSILYLYKSQIRPCMEYCCHIWGGSSNDALSLLDKVQKRIVNIVGP FT ALAANLQPLSHRRNVASLSLFYKYYNGHCSKELASLVPSTKIHSRVTRHSI FT KSHPFSVTVPKCSKNSYSSSFFPRTSVLWNSLPSSCFPDSYNLQSFKSSVN FT RYLALQSSSFLFQ*" XX SQ Sequence 3742 BP; 941 A; 862 C; 632 G; 1300 T; 7 other; tattaattaa gtcaggtcag gtttatcctg ctactggtaa catgtaaccc agtacaatct 60 gcttcatgtc ctgctgcctt gtaggatacg cctttttagg caaaggctag gagatgcaaa 120 ctccgacata aaacaccccc tgccttgggg ctcttggttg agtaaaggct agagatggtg 180 tctcgataaa aatactcatc ttgggcagat gttacatgca tccagctgct gtcttgtaga 240 aggcctccta ggcaaagact taaggggtaa acagattcta tctgttgacc agcctcgcac 300 cccttcctca tctattaggc tggcgcagat gtatttttaa tacattgttt ccagtttagg 360 atgttgaacg ctggatcttc ttgactcgat gcatgggttt tgcttgtgtc tctattttta 420 tgactaggca actcattcta ttatctccaa atgagggtac agctctaaaa ctcagtttta 480 tggttctgag gccggctggt agtcaggttt cccgaactct gtggtagctc tcagagaggc 540 tgattccatc aacagctgaa aaatatcaga gtactaacag tgccatgttg cgcatggatg 600 gtgtccctgt ttgtactttt ggtgtgcatt gccaaggcca catttggagc cctttgttac 660 ggcttagggt ttattartrg aaatgaggca attgcttgag ctataaaaca gtgtactgag 720 tactatctat gtttcgagtc aagttcttta acaaatttaa aaatgaataa agtaacaaaa 780 actataaaac acaaaaaacc acagtcatca ccaagttcgc taaacctatc attcaccaat 840 attcgtggtc ttcgaagtaa cttttcttct gttgagtctt atctcttgca aagttcacca 900 gacctacttg ctctttgtga gactaatttg agttcagctg tctcatcttg tgatcttagt 960 gttgatggtt atcttccttt aattcgtaaa gactccaata gtcacatgct tggcctgggc 1020 atttacattc gtaagaattc acccatttgt cgggaaacta ggtttgaatc yacagactac 1080 tcttttatgt gctttcgttt agcaccactt cactctatca cctttctctt tgttctatat 1140 cgctctcctt catcccaaga ctgcactctt cttgatgtta tttctgatta tattgaccaa 1200 gccctcwttc tttatccatc agccaatatt gttgttgtcg gtgactttaa tgttcatcac 1260 actgaatggc ttggctctag tgtcagtgac tctgcaggca ttaaggccca caacttttgc 1320 ctttctcaat ccctaactca aatagtcaac tttccaactc gctttccaga caacccgaat 1380 catttacctt ctctactcga cttatgtctt gtttctgatc ctagtcagtg ctcagtttct 1440 ccacactcac ccttaggtgc ttctgatcat ggtttgatct ctctaaaact attatctcat 1500 tcttctttac catcagaatc cccctatcat cgtacatctt acaactacct taaagctgac 1560 tgggactctt tccgtgattt tcttcgtgat ggcccctggg tagaaatctt tcgtcttcct 1620 gttgacaaat gtgcttctta cataacttcg tggattcagg ctggcatgga atcctttatt 1680 ccctctcgac gattccaggt caagcctcac tcttctccat ggttttcctc acattgtgct 1740 gctgcaattg cyaatcgaaa ccattacttc catatctatc agcaaaacaa ttctccagaa 1800 aacagacgtc tgtttattac tgctaggaac cattgtaaaa aggttttgtc taacgccaaa 1860 gcccgctatt ctcagatcat gaaatctcgt atttcatcac aaaaattagg ctctcgtgac 1920 ttctggagaa tctttaatag tatcaataat aagggcaagt ctgtaattcc acctcttttg 1980 tatggttcag actttgtcac ctcacctaaa gacaaagctg aattgtttgc caagaacttt 2040 tcatcaatat catctcttga ttccactagt tgcgttttac ctgatatagc cgtcaaacag 2100 gttgatccat tgctcgacat tcgtatcact ccagcttctg tatctaaagt gatttcctgc 2160 ttagactctt ctacagcttg cggcccggac aacatacctg ttatagtctt gcagaagtgt 2220 tctccggagc tgtcgtctat cctttcaaaa ctatttaaca agtgcttatc agagtcttgt 2280 tttccagcct gctggaaagc agcatctgtt atccctattt ttaaaaattc tggagagcga 2340 tctgattcat ctaactaccg tcccattagt cttcttccta tcataagcaa ggtttttgaa 2400 tctttaatta acaaacactt aatctctcat cttgaatctc ataacttact ttctgatcat 2460 caatatggat ttcgatcttc tcgttctacg gctgatttgc taacagtaat aaccgatagg 2520 ttttatcgtg cattagataa aggtggagag gttaaggcca ttgctcttga catttctaaa 2580 gcgtttgata aagtttggca tgctggtctt ctccataagc tttcttctta tggtgtatct 2640 ggaaacatct ttaagattat tgaatccttt ctttccaatc gtagtataaa agttgtcctt 2700 gatggacagc actcttcttc ttattctgta acctcagggg ttcctcaagg ttcaatcctt 2760 ggccctatac tctttttaat ttacattaac gatcttccag atattctcac atctaaggtg 2820 gcattgttcg ctgatgatac taccatttat tcttgtcayg ataagaagcc aactccctct 2880 gattgcttgc agggggcatc tgagcttgaa aaggatctca cttctgctac agcatggggc 2940 tcacagtggc tggtgaactt taattcagat aaaactcaat ttttttcggc caatcgttat 3000 cgcaataatt tagaccttcc tatatttatg aacggtaatg tactmgatga gtctcctacc 3060 cttcatcttc taggattaac tcttacttct gatctttctt ggaaacctta tatcaaatcc 3120 gttgcaaaat tagcatctgc taaagttgca tctctttatc gagctcgaca ctttcttact 3180 ctggattcta ttctctatct ctataaatct caaatccggc cttgtatgga atactgttgc 3240 catatctggg gcggatcttc taatgatgcc ctttctcttt tagacaaggt gcaaaaacgc 3300 attgtaaaca tagttggacc tgctcttgca gccaaccttc aacctctgtc acatcgtcgt 3360 aatgttgctt ctctttctct tttctacaaa tactataatg ggcactgctc taaagagcta 3420 gcgtctcttg tgccatctac taaaattcat tctcgtgtta ctcgtcattc aattaagtct 3480 catccttttt ctgttactgt tcctaagtgc tccaaaaact cttattcgtc tagttttttt 3540 cctcgaacat cagttctttg gaattcgctt ccttcatctt gctttcctga ttcatataat 3600 ttgcaatctt ttaagtcgtc tgtcaatcgt tatcttgctc tacaatcttc atcttttctc 3660 ttccagtaac ttccaacact aattagtggt tgcttgcagc cttgttggaa gcgaagatga 3720 ttaaaaaaaa aaaaaaaaaa aa 3742 // ID Gypsy12-LTR_Dpse repbase; DNA; INV; 579 BP. XX AC Unknown_group_699; XX DT 13-MAY-2009 (Rel. 14.05, Created) DT 13-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy12_Dpse; KW Gypsy12-I_Dpse; Gypsy12-LTR_Dpse. XX OS Drosophila pseudoobscura OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; obscura group; OC pseudoobscura subgroup. XX RN [1] RP 1-579 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1086-1086 (2009). XX DR Genome; Unknown_group_699; Positions 3292 3870. XX SQ Sequence 579 BP; 206 A; 89 C; 121 G; 163 T; 0 other; tgtaaaggac agtgtatcaa gcagttaaca ctatcccatc taaattaaca tttgaactta 60 agataccatc gataagacct taccgatata accagactta accgatgttt aaaagaccta 120 gccgataagg ggttatcgat atgggagtcg gttgttggaa gtagaagcag aaagttgaac 180 aaaaagtcgg aagaacagaa tcattaattg cacatcttaa atcaaacact ttgtactctt 240 tttcgatcaa aacattatta ataaacgata ctttaatatt aatcttacat ttgggggctc 300 gtccgcgtcg aataaagagc tcagagtgtg tgaaaaagaa aaacagaaga aagcagaagc 360 tgatgcaaga agaggattgt tgaaaagttg ttaaagttgt tgtttgtagc tacataaagt 420 tgtaaagttg ttgttggtgg ctataaacat taagttgaac aatcaaaatt ctgtgagcat 480 aacagtagac acggatttta ataaaagatc agtcgtttaa ttcggttgaa aaattgtgaa 540 attcattgtc gcgtcgcagc gtcgccttgc ccgtgcgca 579 // ID BEL-25_CQ-I repbase; DNA; INV; 6045 BP. XX AC AAWU01010219; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-25_CQ_; KW BEL-25_CQ-LTR; BEL-25_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-6045 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 203-203 (2011). XX DR GenBank; AAWU01010219; Positions 6770 726. XX CC Positions [5064-5642] - Integrase core CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 621..6020 FT /product="BEL-25_CQ-I_1p" FT /translation="MLYSGLRKKLRYHTPREKFSTPGATPAKTVKQQKQEQ FT AAAREKSKVDESVEAYSQCCQAAQAKVQRVRVAIESAQLDHSKFNIHALNT FT YLKTVDTAYAEANEYLNKIYLAAPSRRGEFEPLFVDFEELYEFVRIALCQM FT IEEHTEAKAALQLAAAQNVKPLPVPLVQPGTSGIDSMTRFTPTIVLQQSAL FT PTFDGKYESWFKFKQMFRDIADKCAADSAATKLHFLDKALVGKAQGAIDPQ FT IIRDNDYEGAWRSLTEQFENLPALISETISRLLSLKAMTGDSFNQLKTLID FT EVEKCVSSLEFHKLKMDKLSEAIVITLVSSKLDADTRKVWESSVKRGQMPA FT YKQLISVLRNQQHVLERCENAKGIQKNRSSHPAARSAQTTAASKAHTAVIQ FT KAASSCPVCDEKHAVEKCDAFKRSEVKIRYEKAKQLGLCFACLKKGHRTSD FT CPEKVKCSKCAKRHNVLLHPEEKCAQEKSVPELKPAGSTESAPTTVAKCVI FT PCRATEPPTQVLLATAVVYVYDDNGDQHKCRVLLDSGAMANFVSQRMADLL FT QLKKRCVNIPVIGVSGMRTMVKFQVHVRAKSRASASEFCLDYLVVPRVTGA FT LPVQKVHIDGWPIPAGMDLADPTFFEPSRIDLLVGAEAFFEMLLSGKIKMS FT SDLPVLQESTLGWLVSGRVAGSAAVTTVRACQAISAPDVDAELTNLLKKFW FT TIDDQTVEPKPDDDDCERHFSETHVRAADGRYVVRLPFRKEIEELGASRQQ FT AEKRFNHLERRLDQNPAQKKQYSDFIKEYVDLGHGRVLAETECGTQDGYFL FT PHHCVLRPDSSTTKLRVVFDASARSTTGHSLNDTMMIGPPVQDTLFEILLR FT FRLHKYAFTADVPKMYRQVRVSPEDTKFQRIVWRDDRSKPLQVIELMTVTY FT GTAAAPFLATRALNQLAADEKEDYPEASKVVSSSFYVDDVLSGAATIEKAK FT QLQSDLVALLARGGFELHKWCANDAALLEDIPIAKQEKQLDFENHDAKDTV FT KTLGLLWNPVEDHFFFRVKPLDKDRDNWTKQQVLSEIAKLFDPLGLLGPTV FT ALAKMVMQETWRSGIGWNDVLPPPLMARWRKLRNALAELAEITIPRRVTTD FT DARSWELHGYADASAKAYGCSIYLRNVMVDGTEELFLLCGKSRVTPVKEAE FT RKQKDDAEPADMTMPRLEACAAELLAEQIVKVVKAIDIPIDRVVLWSDSQI FT VLSWLEYMKPGTPVFVRNRVNRIRELTSKYEWHYVSTKNNPADHISRGLLP FT KQLKKCGLWWKARHTENLGPIVMLCEKMVPDGNRVLEVISDCGSFRKLERI FT FGYVLRFVGNCKKKLEDRRCGRLGREDYCAALQAMVKAVQQEEFSEDLKRL FT EAGKPLNNKLNKLNPMVEKDSGLLRVGGRLSNSDLPYNQRHPMILPEKHHL FT TELIIETLHTENLHVGLNGLLASLRRRFWPVNAKRAIHRVLRRCVTCFRVK FT PTDTEQFMGDLPKCRVTVAEPFARTGVDYAGPVMLKQGRLKAPVKGYIAVF FT VCLCTKAIHLELVTSMSTEAFLAALHRFVSRRGNVSEMKSDNGTNFIGAAR FT ELTELAELLRSQMLERKLDEFCQARSIDWSFNPPKAPHQGGLWEAGVKSAK FT HHLYRVLNESHLTYEEMNTLLIQIEAVLNSRPLCQQTDDPLDYRALSPGHF FT LVGRELTALPEPLYDGLKENKLTRYQLVQKRKQDFYRRWCNEYLTELQQRG FT KWNKGASVVRKGMLVILKQDNVPPQQWRLGRIVDTHPGKDGVTRVVTVRTS FT SGEYRRPTTQVAVLPILDNETQELEETTA" XX SQ Sequence 6045 BP; 1500 A; 1550 C; 1874 G; 1121 T; 0 other; tttggtcgta ttcgatccgg atttgcgtga ccgcgagtga aaagtgattt ctcgattcgc 60 ggatcgggaa aagattcgat tccggctcgg ttgtccgcgt cggtgaagtg atttcggagc 120 gttcggagta aatccggctt gccgaaaagt gaactggcac gtggttgcgg ggcagaggac 180 gaaaagcgga gcgtccggga agattccggc ttgccgaaaa gtggttcctg ctgccgtaac 240 agtgctgggg aagtgtgaat cccgaaaaga ttcgactcgg gtgaaaaggt cgaattccgg 300 gaaaaaagtg attccgaacg ggccaggctc gtcggaaaaa agtgatttgg aacacgttgc 360 gacgtgtcga gaaaagtggc ttgctgcggc ttgctgcaaa gaagaaattt cggaatcttc 420 tcgtcgtttt cgtcgacgcc atcttgtgca agtgaaggaa cggccttttg cgcaagggaa 480 agaaaagcct tttcaaggcc agtggtgcca gtttgtggaa aaaaagtgag gaggaagatt 540 tgaaaagcga cgaaaaagag aagagcgaga aagtgggaga gttttgagtg gtcttaattg 600 agctgagaag aagaaagaaa atgctgtaca gtgggttgcg gaaaaagctg cggtatcaca 660 ccccgcggga gaagttttct accccgggtg cgacgccggc gaaaaccgtc aagcagcaga 720 agcaagagca ggcggcggca cgggagaaga gcaaggtgga tgaatccgtc gaggcctaca 780 gccagtgttg ccaggcagct caagcgaagg ttcagcgagt aagagttgca atcgaaagtg 840 cgcagctgga ccactcgaaa ttcaacatcc acgcactgaa cacttacctc aagacggtgg 900 acacagccta cgctgaagca aacgaatacc tgaacaagat ctacttggct gccccgtccc 960 gaagaggtga gtttgaaccg ttgttcgtag acttcgagga gttatacgaa tttgtgcgga 1020 ttgcgctatg ccagatgatc gaggagcaca ccgaggccaa ggctgcgctg cagctggcag 1080 ccgcacagaa cgtgaaaccg ctaccagttc cgctggtcca acccggaacg agtggcatag 1140 attcgatgac gagattcacc ccgacgatcg tcctgcagca gtcggcgttg cccacctttg 1200 acggcaaata cgaaagctgg ttcaagttta agcagatgtt ccgggacatt gccgacaagt 1260 gcgctgctga ctcggctgcg acgaagttgc acttcttgga caaggccctg gtcgggaagg 1320 cgcaaggagc gatcgatcct cagatcatcc gagacaacga ctatgaggga gcctggcgaa 1380 gtctgacgga gcaattcgag aacctgccgg ccctgattag cgagacgatc tcgaggctgc 1440 tgagcttgaa ggcgatgacg ggcgactcct tcaaccaact gaagacgctg attgacgaag 1500 tggagaagtg tgtgagttcc ctggaatttc acaagcttaa gatggataag ctgtcggaag 1560 cgattgtgat cacgttggtg tcttcgaagt tggacgccga cacacggaag gtgtgggaat 1620 cgtcagtgaa gcgtgggcag atgcccgcat acaagcagct gatctccgtt cttcgcaacc 1680 agcagcatgt cttggagcgc tgcgagaacg cgaaggggat ccagaagaac cgcagctccc 1740 acccggccgc ccgctctgct caaacgacgg cggctagcaa ggcgcacacg gcggtgatcc 1800 agaaagcggc tagttcctgt ccagtgtgtg acgagaagca tgccgtggaa aagtgtgacg 1860 cgttcaagcg tagtgaagtg aagatcagat acgaaaaggc gaaacagctg gggctgtgct 1920 tcgcgtgtct gaaaaaggga catcgcacaa gtgattgtcc agaaaaagtg aagtgctcga 1980 agtgtgccaa gcgacacaat gtgctgctgc accccgagga aaagtgcgcc caggaaaagt 2040 cagtgcctga attgaaacct gctggttcca cggagagtgc accgaccacc gtggcgaagt 2100 gtgtgatccc gtgcagagcc actgaaccgc caacgcaggt gctacttgcg accgcggtgg 2160 tgtacgtgta cgacgacaac ggcgaccagc acaagtgccg tgtgttgctg gactccggag 2220 ctatggcgaa cttcgtgtcc cagcggatgg cggacctgct gcagctgaag aagcgctgcg 2280 tgaacatccc cgtcatcgga gtcagtggaa tgcgtacgat ggtgaagttc caagtgcacg 2340 tgcgagcgaa gtccagagcc tcggcaagcg agttctgtct ggactacctg gttgttcccc 2400 gcgttactgg ggcgctgccg gtacagaaag tgcacattga cggatggcca atacctgccg 2460 gaatggatct ggcggacccg acgttcttcg aaccatcccg catcgacctg ctggttgggg 2520 ctgaggcctt cttcgagatg ctgctctcag gtaagatcaa aatgtcctct gatttacctg 2580 tgctacaaga gagcacgctg ggatggttgg tgtccggacg agtggcgggc tcagccgcag 2640 tgacaacggt tcgagcgtgc caagcgattt ctgctcccga cgtggacgca gagctgacga 2700 acctgctgaa gaagttttgg acgatcgacg atcaaacggt cgagccgaaa cctgatgacg 2760 acgattgcga acggcacttt tcggagaccc acgtgcgtgc cgccgacgga cggtacgttg 2820 tacgactgcc gtttcggaag gaaattgaag aactaggagc gtcacgccag caggcggaga 2880 agcgcttcaa ccacctcgaa cgacgacttg accagaaccc agcgcagaag aagcagtact 2940 ccgattttat caaagagtac gtcgacctcg gccatggcag agtgctggct gaaacggagt 3000 gtggcacaca agacggatac tttctgccac accactgcgt gctccgtccg gacagctcca 3060 cgacgaagct acgagttgtt tttgatgctt ccgcccgaag cactactggc cactccctga 3120 acgacacgat gatgattggg ccgcccgtgc aagacacgtt gttcgagatc ctactgcggt 3180 ttaggctgca caagtacgcg ttcaccgctg acgtgcccaa aatgtaccgg caggtacgag 3240 tgagcccgga ggacaccaag ttccagcgga ttgtgtggcg cgacgatcga tccaagcctt 3300 tgcaagtgat cgaattgatg acggtgactt acgggacagc tgcggccccg ttcctcgcca 3360 cacgagcttt gaaccagctg gcagcagacg agaaggagga ctaccccgaa gccagcaagg 3420 ttgtgtcgtc gagcttttac gttgacgatg tgctttcggg tgcagcgacg attgagaaag 3480 cgaagcagct gcagtctgac ctggtggcgc tactcgcaag aggaggtttc gagctgcaca 3540 agtggtgcgc aaacgacgcg gcgctgctgg aagacatccc cattgcgaag caagaaaagc 3600 agctcgactt cgagaaccac gacgccaagg acacagtcaa gacgctcggg cttctgtgga 3660 accccgtcga ggaccacttc ttcttccgcg tcaagccgct ggacaaggac cgtgacaact 3720 ggacgaaaca gcaagtacta tcggagattg ccaaactgtt tgatccgctt gggttgctgg 3780 gcccaacggt agcgttggca aagatggtca tgcaagaaac ctggagaagt ggcattggat 3840 ggaacgacgt gctgccacca cccctgatgg cacgatggag aaaactgcgg aacgcccttg 3900 ctgaacttgc cgagatcacg atcccgaggc gagtgaccac cgatgacgcg agaagctggg 3960 agcttcacgg ctatgcggat gcgtccgcca aggcctacgg ttgcagcatc tacttgcgga 4020 acgtcatggt ggatggcaca gaagaactgt tcttgctgtg cgggaagtcg cgagtcactc 4080 cggtcaagga agccgaacgc aagcagaagg atgacgctga accagctgac atgacgatgc 4140 caagactgga ggcgtgcgcg gccgagctgc tggccgagca aattgtgaag gtggtgaagg 4200 cgatcgacat ccccattgac cgagtcgtac tgtggtccga ctcgcagatt gtactcagct 4260 ggctggagta catgaaacct ggaaccccgg tcttcgtcag aaatcgggtc aaccgaatcc 4320 gagagctcac aagcaagtac gagtggcact acgtatcaac caagaacaac cccgccgacc 4380 acatctcccg tggtttgcta ccgaagcagt tgaagaagtg cggcttgtgg tggaaggctc 4440 gccacaccga gaatctgggc ccgatagtga tgctctgcga gaagatggtt ccggacggga 4500 accgagtgct ggaagtgatc tcggactgcg gcagctttcg gaaactggag cggatctttg 4560 gctacgtgct gaggttcgtt ggaaactgca agaagaagct ggaagaccga cgatgtggca 4620 gactgggtag agaagactac tgtgccgcac ttcaagcgat ggtgaaagcg gtccagcagg 4680 aggagttcag cgaagacctg aagcgcctgg aagccggcaa gccgctcaac aacaagctga 4740 acaagttgaa cccgatggtg gagaaggaca gtggactgct gagagtcggt ggccgcctga 4800 gcaactccga cttgccgtac aaccaacggc acccgatgat cctgccagaa aagcaccacc 4860 tcactgagct gatcatcgag actctacaca ccgaaaacct acacgtcgga ctgaacggtc 4920 tgctagcgtc gctgaggcgt cgcttctggc ctgtgaacgc gaagcgggcg atacaccgag 4980 tgctgcggcg ctgtgttacg tgcttccggg tcaaacccac cgacacggag cagttcatgg 5040 gcgatctccc gaagtgtaga gtgacagtcg ctgagccatt tgcgcgaacc ggcgtggact 5100 acgctggacc agtgatgctg aagcaaggac gtctcaaggc accagtcaag gggtacattg 5160 cagtgtttgt ctgcctgtgt accaaagcaa tacaccttga gctcgtcacc tcgatgtcaa 5220 cggaagcgtt tctggcggcg ctacaccgtt tcgtgagccg acgcgggaac gtgagcgaaa 5280 tgaagtctga caatgggacc aacttcattg gagcggcgcg agaactgacc gaactagcag 5340 agctgttgcg gtcccagatg ctggagcgga agctagacga gttttgccaa gcacgaagta 5400 ttgactggag tttcaacccg cccaaggccc cgcaccaggg aggactgtgg gaagcaggtg 5460 taaagagtgc caaacaccac ctgtaccggg tactgaacga gtcccatctc acctacgaag 5520 aaatgaatac gctgctgatc caaatcgagg cggtgctgaa ctcgcgcccg ctttgtcagc 5580 agactgacga cccgctcgac taccgtgctc tcagcccagg gcacttcctc gtgggccgcg 5640 agttgacggc gcttccggaa ccgctgtacg acgggctgaa ggagaacaaa ctgacgcggt 5700 accagctcgt ccagaagcgg aagcaggatt tctatcgccg gtggtgcaac gagtacctta 5760 ccgagctgca acagcgcggc aagtggaaca agggagcttc cgtggttcgg aagggtatgt 5820 tggtcatcct gaagcaggac aacgtaccac cgcagcagtg gaggcttggg cgaatcgtgg 5880 acacccaccc cgggaaggac ggcgtcacga gggtggtgac ggttcgcacc agctccggtg 5940 aataccggag accaacgacg caggtagcag tacttccgat tttggacaac gagacccagg 6000 aactggagga gacgacggct tgagccaagc tcaagggggg gagga 6045 // ID BR6_CP repbase; DNA; INV; 201 BP. XX AC K01695; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 25-MAR-1997 (Rel. 2.02, Last updated, Version 2) XX DE C.pallidivittatus balbiani ring (BR6), secretory protein sp-Ic DE repetitive region. XX KW BR6_CP; CPBR6; Repetitive sequence; tandem repeat. XX OS Chironomus pallidivittatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Chironomoidea; OC Chironomidae; Chironominae; Chironomus. XX RN [1] RP 1-201 RA Galler R., Rydlander L., Riedel N., Kluding H. and Edstroem E.J.; RT "Balbiani ring induction in phosphate metabolism."; RL Proc. Natl. Acad. Sci. U.S.A 81(5), 1448-1452 (1984). XX DR GenBank; K01695; Positions 1 201. XX SQ Sequence 201 BP; 81 A; 38 C; 57 G; 25 T; 0 other; aaagagccta aatgcgatga tgaaatgaga gaaagggtca agagacgttg tgacaatgag 60 aatcgccgat ttgacgcaag aagatgtgaa tgcggtgaga agaaacgtcc agaagataac 120 gatgacgaag atcgcccaga acgtccagaa agacccgaaa gacctgaaag acctgaagaa 180 ccagaacgcg agcccgaaag a 201 // ID NOF_FB repbase; DNA; INV; 4347 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 20-MAR-2011 (Rel. 10.08, Last updated, Version 2) XX DE Putative MuDR DNA transposon from Drosophila. XX KW Rehavkus; DNA transposon; Transposable Element; MuDR; NOF_FB. XX NM NOF_FB. XX OS Drosophila OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae. XX RN [1] RP 1-4347 RA Smit A.F.; RT "NOF_FB - Putative MuDR DNA transposon from Drosophila."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX CC gb|X15469|FB|FBgn0002949|NOF FB 4347bp Derived from X51937 CC (g8297) (Rel. 44, Last updated, Version 6). XX SQ Sequence 4347 BP; 1445 A; 775 C; 885 G; 1212 T; 30 other; tatattctat tgcccaccat ataaacacgt gccactttcc tagttttagg atctgcctac 60 ataacacgtg cagacgcaca ggtgtttctg ggtttatata gaccaaaaat tggttccgat 120 tgccaatctt gtaatttaca gtttaccagg taattacata attttcaaac ctcactttat 180 gatagggtcc aattttttac ctgtgacaaa gtgttaaatt ttttaagaat gggtttttca 240 tggcaggtca gaatcctcta taaaatctaa aacacttgtc ggtatttgaa aatcgctctc 300 ctccttgatt ctcatattag gtgtaaaaga taaatccgga actcataatt aaaatatttt 360 ttatgtgaaa aagttgtgcg cgattttaac tacgcttacc cagtgctgga aaagttaaag 420 ttgttttgtt tttcaaagaa agtgaaagtt gctaagcacg aacttaagaa atctgagtga 480 ttgtgttaaa tttatttgaa tccttgtgaa ttttgttgac agtcttttta aagacttgca 540 aaattttcat attattcggt tcttgctttt atttttatac aacgcgtttt tcctttaggc 600 atacctttat acatttacag tgtaaacaac agtgtaaaac gtgtaaatca gtgcaaaata 660 gtttttttta tttactccat aaaaaataag tgttactgtc aggatgccgg ccaaaccgca 720 agtcgatggt cacaccttag tggatgcatt ttgctgcgcg aatattttta cggagactgg 780 agctcttaag ccaagaagcg ataaagtttg gatggatata agcaaccaat tgaaaggagc 840 gatcagcgcg aagacgctta atttctacgc cagaatcaat aggaataaca tgataactgt 900 ggttaaagaa cgatgtggaa ttcaacagct ggatactagt gccaatttaa ctttaaatag 960 cacatttcct gatgatgacc cggagttcca gatcaccgaa gcttcaaaaa atggaccatt 1020 gcctattttg tactttaacc tggagttgga cctggaattg tggagatcaa ttgcccccaa 1080 aaaggatcaa aaaactgaaa aactgcaacc taactggacg gatactatgg caaagttgat 1140 atacaaaaaa gttcctcttc cgtgtgcatt taattttaga aaagctaaac tttccgacaa 1200 agtggataat atttggctac gaattgaagg ctattgcaat gactgcagct caattttaaa 1260 gggacattgc cttgtgaaac ccgatgaaca atgcggcata atgatatctg tttcagtacc 1320 ggacacacga ggtatacctc ataataaaaa acgacggtgc actggatcga gacgacttga 1380 aattgggaac gagttgattt taaaaaaagc tgcattgtgg aggaaggaag ccaccgacaa 1440 catgaatgat gacgacccag aaccgagtta cataccaaat ttaccaaccc ttcggaaact 1500 tcgtgaagag gcaactaaca gacacctagg aattaccaag gatcgggatc cagtttcatc 1560 attatacctt aaaaagtatg agggtgaatt ggctggatgc attcttgaca ttggattgga 1620 tgaatttttc tgcatatact gcacaggaac ccaagtaaaa acatatgcat caaggataaa 1680 aactattaga aagatttcta ttgacgcaac tggaagcgtg gtgttaccca tccaaaaacc 1740 aaacggtgac tctagttatg tttttctgta ccaaattgta atggagggtg acgacagtat 1800 atttccagtt tttcagatgc tgtcggctaa acatgacaca gccagcatac agttttggtt 1860 aagcagattt atatcaaagt cggggcattt tccactggag gttgtatctg atttttcctt 1920 ggcattgcta aatggaataa gcttaagctt taatgagtgt aggattgcga cgtatataaa 1980 aaaatgtttc cacagccttt tgatggagga acggacggat ctgccaccct gctatattcg 2040 acttgacatc gcccacctaa ttaaaatgat atgccggaag aacgtcttca aaagtaaatt 2100 accgaacctc aaggattttt atactagatg tattggtctt gcaacaacgt gtgagacaaa 2160 ggacagtttt gcggaattaa ttaaatcagt actgattgtc gcactgagcc aatcctcagg 2220 ggaagatgaa aaaggagaca ttctttcaag ttacaggaat gaaaagtatc tgctcgccag 2280 aatagctaca tttactgccc cggatcacaa ggagaccatt gaggacaact gcataccaga 2340 ggaccaggag gaaattgacg aggatgttac ggactttatc tctaatatta aaatcgctgc 2400 cgaagaagaa gcgttaaatt gcaattcggt caactgtcgg ccaaatccgt atttcctacc 2460 tgagctaatg ccaccattaa ttaagttgtg caaatatttt gttttatgga caaacgtgat 2520 gaaggaaaag ttctgttcca aatatgatgt cggctcttcg gctcttgtgg aagcctattt 2580 caaggattta aaaaacacgg acatgagcat attccaccga ccagtgagag cggataaatt 2640 cgtggtgcaa catatccgat gcatcgaagc tgtttgcaag ctggaacgag ccgcgatgaa 2700 acgcaagacc gttaaaactc ccagctttat aaaagaaaac gctcctaaga aaatgtgcag 2760 taaggaaacc aagggatttc tggaggaaat acttgaagaa agcgaagtgg aatacctttt 2820 acaagaagaa aactggaagg tgaagaataa aacaataaag cccacggaag gaaatgatgc 2880 tgaagacaac gacactgatg atgaaaacaa ggaaatggat ttaagtgaac agcccaaaga 2940 aaaaccaagg ggaaaatatc tcaaaaaatg ccccaatgtg gagttattat acaatcgacc 3000 acatcgaagg aaacaggacg aaattttgca taatggtgga tcaatgggac ccgtctggat 3060 tggcaaacaa ttattgcaat tcaaaaatac ttgtccgttt gactctctag tggaaatatt 3120 gtcgaccgca tacatagaca atttttatta caaaagccta ttggatgatt tctacactga 3180 caacttgacg atagaattgg tgaaaaagta tgccgtcgag ggagtttcgt ccagtctcta 3240 ctgcgacaga ggtctggtcc taaaaagttt ttttgatgaa aaacaccaga ttataaaatg 3300 cgacgcaaat attgggtctt ttattgaaaa agcgctgaat ggagtaccca gtgcgtcaag 3360 tcatcggacc catataaaaa acaaccatga ttgcaggaac caaaaatata tccaccatcg 3420 gctggaggtt atagatgtcg aaaaagttgg ccacctcgac gtccaggagg tagtgatccc 3480 ctttattgat gagttttttg caagaactga tggagaatgt aaaatatgcg gtggacaaca 3540 gatccttgaa aggcagccag gaccgcatgt catacttgat atagaatttg caatggatgc 3600 ttttcatcaa attcatcata acggtttacc aggaacgacc actttacttc aagtgccgga 3660 ggaaatttta atacaggaaa agaaatatat tttaagtggt gccatcgaat atgttcctgc 3720 gatgggaggg gaaattggac attacattgc atattgccgc agagtcattg gatcttggga 3780 agtgcacaac gatatgtgca ggcaatggaa aaagttctca gctctaaata ccaaaatgac 3840 actccacatt ttgatataca cccggaaaaa ttaatgttta tttttaagcc ttgtttaaaa 3900 gtgtaaaaaa tatttgttgt taaaaattac aatcttaagt cctttgcaaa cgttgnnnnn 3960 nnnnnnnnnn nnnnnnnnnn nnnnncaaaa cttaaccctt tttcactttt atacctaata 4020 taaagaggtc cgtaaagtat caaggaggag agcgattttc aaataccgac aagtgtttta 4080 gattttatag aggattctga cctgccatga aaaacccatt cttaaaaaat ttaacacttt 4140 gtcacaggta aaaaattgga ccctatcata aagtgaggtt tgaaaattat gtaattacct 4200 ggtaaactgt aaattacaag attggcaatc ggaaccaatt tttggtctat ataaacccag 4260 aaacacctgt gcgtctgcac gtgttatgta ggcagatcct aaaactagga aagtggcacg 4320 tgtttatatg gtgggcaata gaattta 4347 // ID BEL-34_CQ-I repbase; DNA; INV; 6494 BP. XX AC AAWU01003986; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-34_CQ_; KW BEL-34_CQ-LTR; BEL-34_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-6494 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 221-221 (2011). XX DR GenBank; AAWU01003986; Positions 16606 10113. XX CC Positions [5508-6086] - Integrase core CC 'ATTGG' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 1149..6494 FT /product="BEL-34_CQ-I_1p" FT /translation="MSRRSTVTRLRNLMTSFNGIYAFMESYDASKQAGELV FT MRLEKLEPLWDKIDDAITEAELADSEEGTGEEERGEDKGEEKSESGEKKGS FT KPERYANIRSAFQNKYFTVKAFLQSKIRELPDQAQALPMVSQSAHTTPSTQ FT HVKLPTISLPRFSGNYEDWLPFRDLFVSLIHSSDLPNIEKFHYLRNQLDGP FT AKTEIANVKFTADHYTVAWELLEKRFGNTKHMKKLEIKSLFDLPTLRKESV FT AELRELVEGFDKTVRVLDQVVEYAKYRDLLLVHLLCSRLDDKTLRSWEEHV FT STKTEETFQDLIDFLRRHICVLESLPTKHQESHQSKPRKAFATKVSNQNTV FT HSNPNFGCLACSDSHPLFKCPSFARMSVSEREKIVNQNALCRNCFRKGHIA FT RNCTSRFSCQRCKERHHTLVCYKPEGKPNLCPAIPARPTKEEAGTAATAES FT SSAAVAETRTANTSSAATVGKVLLATAVLLLVDDAGQEFPARALLDSGSEC FT NIVSTKLAQRMHVSRNKANVQISGVGQVPTKTSQKVRATVKSRLSKYCEVM FT EFYVLAKVTEDLPTSPVESTSWTVPEGVQLADPEFFRTNPIDVLLGGEFFF FT NFFPSKQRISLGAGLPSLVESVFGWIVTGRCGWNRGEAPIVCQHSTVTETL FT EEIMTKFWECEDGGFTSDYSVEESTCEDHYVRTVKRGGDGRYTVGMPKSAD FT LHIKLGESKSAADRRLLFLERRLARDDDLKKEYHAFMKDYLDRNHMCKIIE FT DPTSTATTYYLPHHPVIRSSSTTTRVRVVFDASSKTSTGTSLNDVLLNGPV FT IQDDLRTIISRSRLFPILLVADVEKMFRQIRMDAEDLPLQRIRWRFSEDDP FT IDTYELLTVTYGTKPAPFLATRTLKQLSVDDATKYPLAAVRIARDVYMDDV FT ITGAYHPAEAKQIREQLHTMTLGAGFPLRKWVSNCEEALEGVSEDNLALPR FT EKGIDFDEERTVKTLGLVWEPKTDTFRFKIESTLIPPNELTKAKILSIIAK FT IFDPLGLVGPVVAKAKIFMQGIWELKNAKGKPWDWNDPLPKSMLDEWMQFY FT EQLHYLNNLRIPRFAMIPNPVHIQLHFCSDASEKALGANLYIRSEDKEGRV FT KVSFFTSKSRVAPLKRQPIPRLELNGFWLAADMYRKFKECTTFRFETFFWT FT DSRTVLQWLAKPPRTWNAYVANRVSFIQHITQGCHLFHVPGVMNPADQLSR FT GLDPKEFIDGDWDPLWMYETSSWPAQATPEEETEDCAKERRKHVESAAAVS FT AKSFNESYFAKFNEYWKLIRTTAYWRRYLRNRRLPEAERTLSVPLTTKELQ FT EAEWCLARLVQQEAFSRELQDLSKGKPVHNTSKLRWFNPQLSKEGVIRVGG FT RLGNCDRNENFKHPIIIPGNHHFSKLLADCLHLRLFHAGTQLMLATMRQKF FT WPLRGRDLCRQTTHQCKDCFKAKPQLLQQFMGQLPTPRTIAARPFTNTGVD FT YFGPVYIRQGYRRGPVKAYVAVFVCFGTKAVHLELVSDLSTAKFLQALRRF FT VARRGKPADMYSDNGTNFVGAKNDLKDLLHNLRHEKHHEHIQHECSQEGIN FT WHFIPPGAPHFGGLWEAAVKSAKKHLLRVVGQSSISHEDYITLLAQVEACL FT NSRPLTPLTEDPSDLEPLTPAHFLIGTSLEAVPDKNYSEIPNNRLTHWQSI FT QQQLQHFWRRWHTEYLQQLQARVKNWQPAIEIRPGRLVIVVDENQPSMKRK FT MARIHEIHPGADGVVRVVTLRTATGYLKRPVTKICLLPIPAESIEEEKAET FT AEIGQFQGGSE" XX SQ Sequence 6494 BP; 1598 A; 1822 C; 1702 G; 1372 T; 0 other; tttggtcctt cgaaccggat cggttgaagg tacccgcctg gagtcggttt cccgcgtccg 60 gaacacgctg gaccaagccg atcggaccgc cattgtcgcg aggacgcaca ataggccacc 120 gcggacaatt ggatattgtc ggaggagccc tacaaatccc gagttctttg tcgaggaggt 180 acaaaggacg ttccgccatc gaggttccct gaggttggcg ccatttcccg agcgtcatct 240 cacccgatcc cagccaagga ttggcctgat tccaacggtt tggcactcat ccacggactg 300 tgccactaat tgacggcgat taccacgatc gccacttgta cgaggtttcc gagtttggtg 360 ccatttcccg agcgtcgttc cgaccgcgcc ggccagtgta cgagagttcc tgattttaac 420 tttacaaggc ccgtttaaat cgggagcagg ttagtaccgc cccaataccc taaaatcggt 480 ttagtcggtg ttacgctttg gcacttttcc ctcccgcgag tcaaatcgca tccaggtgtg 540 ccagcctcca gacgctgcaa cgttcccgaa cggtcatcgg aactgaccgt atccgtgctt 600 ggactgtccc ggtccacccc agcgaaccga ttcgcacatc ggtaccccgt tgtacgcgcc 660 cgacaaccga tcgttcgagc cgatttggat tcggcactgg attcgaacga gaccacgccc 720 accgaagctc gtaaggactc gagctacccg tcgaccggaa gtcggcattg gtctaccaca 780 ccctgtgtgg tcccactcat cgagagagag agagtgatcg tcgcctggaa tcgatcacta 840 cgtcgtgctg ccgattcggc gccaaccacc cacaatccca cccaccgaag tcgacagcgg 900 actgtcggcg agagcagcga tcgttcgtga accgtcccat cccacaaagc cctgaggttc 960 gtgcacaagg aggtcgctcc atttcccact ttgaggtgaa ggcccgtacc aggaaaacgg 1020 tgcataatta attcaaggca cgaattaatt ggcaatcggt tagtaccatt ccaatatttc 1080 cccgcgcatc ccaagggtgc tcctttcgtg cttgtttatt ctagctgccc atttgtacat 1140 ctgccacgat gtctcgtcgc tcaaccgtga cccgcctgcg caacctgatg acgtcgttca 1200 acgggatcta cgctttcatg gagagctacg acgcaagcaa gcaggccggc gagctggtta 1260 tgcggctgga gaagctcgag ccgctgtggg acaaaatcga cgatgccatc acggaggcgg 1320 agctagcgga cagtgaggag gggaccggcg aggaggagag aggtgaggat aagggtgagg 1380 aaaaaagtga gagtggtgag aaaaaggggt ccaaaccgga gaggtacgcc aacattcgga 1440 gtgccttcca gaataaatat tttacggtca aagcattcct gcagtctaaa attcgggagc 1500 ttcccgatca agcccaagcc ctgcccatgg tttcgcagtc cgcccacacc accccttcta 1560 cccaacacgt taaattgcct actatcagtc tgccaaggtt ctctgggaac tatgaggatt 1620 ggttgccttt tcgagatctc ttcgtgtcgc tgattcactc gtcggatttg ccaaatattg 1680 aaaagtttca ctatctgcga aatcaacttg atggtcctgc caaaacggag atcgcgaacg 1740 tgaaatttac tgcagaccac tacactgtgg cgtgggagtt actggaaaag cggttcggca 1800 acaccaagca catgaagaag ctggaaatca agagtttgtt tgacctacca actttgcgga 1860 aggagtcggt tgctgagcta cgcgaactgg tggaaggatt cgacaaaacc gttcgagttc 1920 tggatcaagt tgtggagtac gccaagtaca gggatcttct tcttgttcat ctgctgtgct 1980 ctcgtttgga cgacaaaacg ctgcgcagct gggaggagca cgtgtccacg aagacggaag 2040 agactttcca ggatctgatc gacttcctgc gtcggcacat ctgtgttctg gaatctctgc 2100 ctaccaaaca ccaagagtcc caccagtcca aaccccgtaa agctttcgcc accaaggtct 2160 ccaaccaaaa cacggtccat tccaatccca acttcggctg tttggcatgt tctgactcac 2220 accccctctt caaatgccca tcgtttgcga ggatgtcagt cagcgagagg gagaagatcg 2280 tcaatcagaa cgcgctgtgc agaaactgtt tccggaaggg ccacatcgct aggaattgta 2340 cttccaggtt ttcgtgtcag cgatgtaagg aacgccacca tacgctggtc tgctacaaac 2400 cggaaggaaa acccaaccta tgtccagcca tcccagcaag gccaaccaag gaggaggcag 2460 gaacggcggc gaccgctgag tcgtcctctg ctgctgtcgc ggagacacgt acggcgaaca 2520 ccagttctgc agctacggtc ggaaaggtgc ttttagcaac cgcagtactt ttgctcgtcg 2580 acgatgctgg acaggagttt ccagcaaggg cacttttgga ctctgggtca gagtgcaata 2640 tcgtctccac caaactcgct caacggatgc acgtgtcccg gaacaaggca aacgtgcaga 2700 tttccggtgt tggtcaagta cctacaaaaa cttcgcagaa ggttcgagcc acggtcaagt 2760 ccagattgtc gaagtactgc gaggtcatgg agttttacgt tttggccaag gtgactgaag 2820 atctaccaac ctccccggtg gaatcaacta gctggactgt ccccgaaggt gtccaactgg 2880 cggaccctga attcttcaga accaacccta tcgatgtttt gctcggcgga gagttcttct 2940 ttaacttctt cccatccaaa cagaggatct cgcttggagc aggactacca tccctggtcg 3000 aatcggtgtt cggctggatc gtgactggca ggtgcggctg gaaccgtgga gaagcaccga 3060 tcgtatgcca gcattcaacc gtaacggaaa ccttggagga aattatgacc aaattctggg 3120 aatgcgagga cggaggattt acttccgatt attccgtgga ggagagcacg tgcgaggacc 3180 actacgtgcg caccgtcaag cgaggtggag acggtcggta caccgttgga atgccaaagt 3240 cggctgattt gcacatcaag ctaggagaat cgaaaagcgc tgctgatcgt cgtcttcttt 3300 tccttgagag aaggttggct cgcgacgatg acctgaagaa ggagtaccat gcgttcatga 3360 aggattactt ggacaggaac cacatgtgca agatcatcga agacccgaca agcaccgcga 3420 ccacctacta cctcccacac caccccgtca tccgaagttc cagcacaaca acgcgcgtcc 3480 gtgtcgtatt tgatgcgtca agcaaaacct ccactgggac atccctaaac gacgttctgc 3540 tgaacggccc agtcatccag gacgacctgc gcaccatcat ctcccgaagc cgcctgtttc 3600 ccatccttct cgtcgcagac gtggagaaaa tgtttcggca gattcggatg gacgccgagg 3660 acctaccgct gcagagaatt cgctggcgct tctcggagga cgatcccatc gacacgtacg 3720 agctgctaac cgtgacgtac ggcaccaaac cggcgccgtt cctggccacc agaaccctga 3780 agcagctgtc cgtcgatgat gcaaccaagt acccactggc cgccgtgagg atcgcgcgag 3840 atgtttacat ggacgatgta atcactggtg cataccaccc tgccgaagcg aaacaaattc 3900 gtgagcaact gcacacgatg acgctgggag caggttttcc gcttcgtaag tgggtctcca 3960 actgtgagga agcgctcgag ggtgtgagtg aggacaactt ggcgctaccg agagagaagg 4020 ggatcgattt cgacgaagag aggaccgtga agactcttgg gttggtctgg gagccgaaaa 4080 ctgacacgtt ccggttcaag attgagtcga cgctgatccc accgaacgag ctgaccaagg 4140 ccaaaatact ctctatcatt gccaaaattt tcgatccttt ggggcttgtt ggaccggtgg 4200 tggctaaggc aaagatcttt atgcagggaa tctgggagct gaagaacgcg aagggcaaac 4260 cctgggactg gaacgatcct ctgccgaagt cgatgctgga cgagtggatg cagttctacg 4320 agcagctgca ctacctgaac aacctgcgga ttccaagatt cgctatgatc cccaatccag 4380 tccacattca actccacttt tgtagcgatg cttccgagaa ggcactgggt gcgaatcttt 4440 acatccgttc tgaggacaag gaggggagag tcaaggtgtc tttcttcacg tccaaatctc 4500 gggtcgctcc tctcaagcga caacccattc cccggctcga gctgaacgga ttttggctag 4560 ccgcagacat gtaccggaag ttcaaggagt gcacaacttt ccgtttcgaa accttcttct 4620 ggacggattc cagaacggtc ctgcagtggc ttgcgaaacc cccaagaacg tggaatgcct 4680 acgtggcaaa ccgtgtctcc tttatccagc atattaccca aggttgccac ttgttccatg 4740 tacccggagt catgaaccca gctgaccagc tgtccagagg actggacccc aaggagttca 4800 ttgacggaga ttgggacccg ttgtggatgt acgagacgag ttcctggcct gcccaagcaa 4860 cacccgaaga ggagaccgaa gattgcgcca aggagaggag aaaacacgtc gagtcagctg 4920 ctgcagtttc tgccaagtca ttcaacgaat cgtattttgc gaagttcaac gaatactgga 4980 agctgatccg cacgaccgcc tactggagac ggtacctgcg caatcgtcgt ttaccggaag 5040 ccgagcgtac cctgtcagtc ccgctaacca ccaaggaact acaagaagcg gaatggtgtc 5100 tcgcacggct ggtccagcaa gaagctttca gcagggaact ccaggacctg tccaaaggca 5160 aacctgtcca caacacctcg aagctacggt ggttcaaccc gcaactctcc aaggaaggtg 5220 tcatccgcgt gggaggacgg ttgggcaact gcgaccgaaa cgagaacttc aagcacccga 5280 ttatcatccc aggaaaccac cacttctcca aacttctggc cgactgcctg catctacgac 5340 tgtttcacgc cggaacccag ctgatgctag cgacgatgcg acagaagttt tggccgctga 5400 gaggacgaga cctatgtcgg caaaccacac accagtgcaa ggactgtttc aaggcgaaac 5460 ctcaacttct acagcaattt atgggacaac taccaactcc aagaacgatc gccgctcgac 5520 ctttcaccaa tacaggcgta gactactttg ggccagtcta cattaggcaa gggtatagaa 5580 gggggccggt taaggcttac gtagctgtgt tcgtttgctt tggaactaaa gcggtccatt 5640 tggagctggt ttcggacctg tccaccgcca agttccttca agcgctgcgt cggtttgtgg 5700 ctcgtcgggg aaagccagcc gacatgtatt ccgacaatgg cacaaatttt gtcggcgcga 5760 aaaatgacct caaagacctt ttgcacaacc tcaggcacga gaaacaccac gaacacatcc 5820 agcacgaatg ctcccaggag ggtataaatt ggcactttat ccctcctggc gctccccact 5880 ttggtgggct ctgggaagcc gctgttaagt ccgccaagaa acaccttctc cgcgttgttg 5940 gtcaatcctc catctcacac gaggactaca taaccctcct cgcccaggta gaggcttgcc 6000 tcaactcaag acccctcacg cccctcactg aagatccctc cgacctcgaa ccgctaaccc 6060 ccgctcactt cctgatcggg acatccctag aagctgtgcc ggacaagaac tattccgaga 6120 tcccgaacaa ccgcctcaca cactggcaga gcatccaaca gcagcttcaa catttctggc 6180 gtcggtggca tacggagtac ctgcaacaac tacaagctcg tgttaagaac tggcagcctg 6240 cgatagaaat tcgccctgga cgtcttgtca tcgtggtcga cgagaaccaa ccgtcgatga 6300 aacggaagat ggcacgaata cacgagattc atcccggcgc tgatggtgtg gtccgcgtag 6360 ttacgcttcg gactgctact ggatacctga agagaccagt tacgaagatt tgcctgttac 6420 caataccagc agaatcgatc gaggaagaaa aggcagagac tgctgaaatc gggcaatttc 6480 agggggggtc ggaa 6494 // ID BEL-24_AA-I repbase; DNA; INV; 5672 BP. XX AC supercont1.15; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-24_AA_; KW BEL-24_AA-LTR; BEL-24_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5672 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.15; Positions 2074885 2080556. XX CC 'CTAGA' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 292..2130 FT /product="BEL-24_AA-I_1p" FT /translation="MSLVHSPKKTRSGVQHGDTSEEERENQNRLACAKESL FT IRKIGVLSRQRSKAADKIKRIGGILIDDDADVSVPKLKVYARNVDAIYSEF FT NDFHNQLAEILPDDAMDDHDQAYTRFEKDYHEVSTYIEEMLMDATKKETPD FT IKPQVIVQQQPLKAPIPTFDGRYENWPKFKAMFIDIMDRSNDTDAIKLYHL FT DKALVGAAAGILDARTMNENNYAQAWNILTERFENPRVIINTHIRGLLSLK FT KMTKESYRELRQLLDDCSGHIESLRYLKQDLLGVSELIVVHLLASALDAST FT RKQWEATIVRGTLPKYDDMMKFLKQQCNILENCETSAQVPPPRAERSNVKA FT ADQSKSSPRFNPKILTANSCGSASSEECDFCGEFHKNFQCEQLKNMSVPER FT VEKARSLGMCFNCLRKGHRLKDCPSDRKCLKCRQRHHTQLHDDTRFKSQQP FT ESKMHEPTASSESTPRPAVKEVQASTSSAPTSQGSHSVTSSCSIQATPTKK FT TVLLLTAVVHVYDSQGKPHCCRVLLDCGSQVNLITKKMAEAIGAKISSANV FT KISGVNNRTTTSMEKAIVKFRSRYSDYRATVECVITPAVTGRIPSTNIDVS FT DWRIQTSMCLKRLTC" FT CDS 2127..5648 FT /product="BEL-24_AA-I_2p" FT /translation="MLIGNELFFKIIKSGQYQLAEHLPELRDTHLGWVFTG FT ELEEVFHQGPSYSHTITVENVYEALQRFWSVEEVADQDPVTSEEVECEKHF FT RYTHERSDEGRFVVQLPLKENASQLSDCRSVALKRFHLLEQRLVRNPCLRE FT QYVQFIREYEQLGHCKQVSECKDSPNVQKCYLPHHAVIRPDSSSTKCRVVF FT DASAKPSQNSLSLNDVLKVGGTVQSDILDIVLRFRKHKYAFTADISKMYRQ FT IMVTEMHQPLQRFFWRESPEQPLKVYQLTTVTYGTASAPFLATRCLVQLAE FT DGKEKCPLGASTVREDFYVDDVLSGDDTLSAAIERQQQVKDLLASAGFPIH FT KWCSNSEQLLERIPVGDRETPKVLEEQGINTVIKVLGILWDPHSDDFLFSI FT KSPNLNSESKANTKRTILSEIAKLYDPLGFLSPIIVLAKLLLQQLWRSKVG FT WDEPVEDEISQQWMQLKKSLEATNQVHIPRRVLIDDAECNELHGFADASMS FT AYGACVFIRSIFNESAQLQLLCSKSKVAPLKAVTIPRLELCAALLLARLVN FT KVLQSIQMSFHRVVLWSDSQIVLAWLRKSPDQLQVFVRNRVAAIREETEDF FT EWMYVRSESNPADIVSRGQMPIELVKNELWWHGPDFLKKVNIENKPPEEII FT EDIPELRVAPTVLPVIEEEQLEVFSKYSSYRKLQRVIALVLRFANNCKKKD FT SASRVNKPIPTVAELRLAMNIIVKVIQREELSDEIARVESGEQCRKLGSLG FT PILVNGVLRVGGRLEHSKLPYGEKHPIILPCKSTIVRMLIRALHEEHLHLG FT PTGLVHVIRREFWLINAAATVRGVTRSCVKCFKVKPIDTSQYMGNLPSCRV FT THAPPFSVTGVDYAGPFHIKQGLRKVTTVKGYVAVFVCMVTRAVHLELVSD FT MTTSAFIAALQRFVSRRGIVHQLHSDNGTNFRGAHHELNQLYQAFKQEQDT FT HQIESFCVSKGIEWQFIPADAPEFGGIWEAAVKSMKSHLKRIIGNASLNFE FT QYVTVLAEIEAILNSRPLFVTSSRSDSPEVITPAHYLIGRPLTAIPEPSYE FT DIQANRLDKWQHLQMLREHFWKAWSREYLSSLQSRKKNQKLKSNVSPGMIV FT LVHNRNLPPLQWKLGVVTKTFPGVDGLIRAVDVYSDKSTFRRPINKLSVLP FT IEDNKAYFGQSQVLK" XX SQ Sequence 5672 BP; 1554 A; 1359 C; 1425 G; 1334 T; 0 other; ttggtccttc gagccggata tgaaccagcg acctatggat atacagtcca cattgctcgt 60 tcccccagac ggaacattgg tccttcgagc cggatctgaa ccagcgaccg tcccccgaaa 120 aagtgttaca gtccactttg ctcgttcccc cagacggaac attggtcctt cgagccggat 180 ccgtgatccg gtagtgattt agtgtgtgtg atcctccttg gactgagtta ctggtgaaaa 240 gagagacaga ctctgaaaaa gtgaagttga cgaaaccgga agtgttccaa aatgtctctc 300 gtgcatagcc ctaaaaagac acgttcggga gtgcagcacg gcgacacaag tgaagaagaa 360 agagaaaatc agaatcgttt ggcctgcgca aaggaatcgc tgatccggaa gattggtgtc 420 ctgagccgtc agcgttccaa ggcagccgat aaaataaagc ggatcggagg aattctgatc 480 gacgacgatg ccgacgtgtc cgtaccgaag ctgaaggtgt atgctagaaa cgtggatgcc 540 atctacagtg agtttaacga tttccacaac caacttgctg agatcctgcc ggatgatgca 600 atggatgacc acgatcaagc gtacaccaga ttcgagaaag attaccacga agtgtcgacg 660 tacatcgaag agatgctgat ggatgcaacg aaaaaggaaa cacctgacat caagccccaa 720 gtgatcgtcc agcagcagcc gctaaaagca ccgataccaa cgttcgatgg ccgatacgag 780 aattggccaa aattcaaggc gatgttcata gacattatgg acaggtccaa tgatactgat 840 gccatcaaat tgtatcactt ggacaaggct ctcgtcggag cagcagcagg aatcctcgat 900 gcccgaacaa tgaacgagaa caactacgct caggcgtgga acattctcac agagcgattc 960 gaaaatcccc gtgtcatcat caacacccac atccgcggcc tgctatcctt gaagaagatg 1020 acgaaggagt cctatcgtga attgcgccag ttattagacg attgttccgg gcatatcgaa 1080 agcctacgtt atctgaagca ggacctccta ggcgtgtcgg aactgatcgt ggtgcatttg 1140 cttgcatcag ccttggatgc ctctaccagg aagcagtggg aagcaaccat cgttcgagga 1200 acgcttccga aatacgacga catgatgaag ttcctaaagc agcagtgtaa catcctggag 1260 aattgcgaaa catcagctca agtccctcct cctcgtgccg aaagatcgaa cgtgaaagcc 1320 gcagaccagt cgaagtcaag tccgagattc aatccgaaaa tcctaaccgc taatagctgc 1380 ggcagtgctt caagtgaaga atgcgacttc tgcggcgaat tccacaagaa cttccagtgt 1440 gagcagctga aaaacatgag cgtacctgag cgagtcgaga aagcacgttc cttaggcatg 1500 tgcttcaatt gcttgaggaa gggtcatcgg ctgaaggatt gtccatccga tagaaagtgc 1560 ttgaagtgcc gccaacgtca tcatactcag cttcacgacg atacaagatt caagtcccaa 1620 cagcctgaaa gcaagatgca tgagccgaca gcttcatcgg agtccacacc gagaccagct 1680 gtgaaagaag ttcaagcaag cacatccagt gcacccacat cgcaaggctc acacagcgtg 1740 acgtcatcgt gttccatcca agcaactcca accaagaaaa ccgtcctcct tttaacagca 1800 gtggtgcacg tttacgattc ccaaggtaag cctcactgtt gtcgagtcct gctagactgc 1860 gggtcccagg tgaacctcat aacgaagaag atggccgaag ccatcggagc aaaaatttca 1920 tctgcaaatg tgaagatttc cggcgtcaac aatcgtacga caaccagtat ggagaaagcc 1980 atcgttaagt tccgatcccg ttatagtgat taccgtgcca cggtagagtg tgttatcacc 2040 ccagcagtga ctggtagaat tccgagtacc aacatcgatg tgtccgattg gcggatccag 2100 acttcaatgt gcctcaagag attgacatgc tgattggcaa tgaactcttc ttcaagatta 2160 tcaagtccgg tcagtaccag ttggcagagc atctaccaga gctgcgtgat actcatcttg 2220 gttgggtatt cacgggagag ctagaagaag tgttccatca aggtccatcg tattcccaca 2280 cgatcaccgt cgaaaacgtg tacgaagctc tgcaacgttt ttggagcgta gaagaagtag 2340 ctgatcaaga tcctgtgaca tccgaagaag tagagtgtga gaagcatttc cgctacacac 2400 atgaacgcag cgacgaaggg cgattcgttg tgcaacttcc tttgaaagaa aacgcctccc 2460 agctgagtga ctgcaggtcg gtcgctttaa agagattcca cctcctcgaa caacgcctcg 2520 ttcgaaatcc gtgtctccgt gagcagtacg tgcaattcat cagggagtat gaacagctcg 2580 gccactgcaa gcaagtaagt gaatgcaaag attctcccaa cgtccaaaag tgctatctac 2640 cgcaccacgc tgtcatccgt ccagatagtt cctcgacgaa gtgtcgagta gtgttcgatg 2700 cttctgcaaa gccatcgcag aacagccttt ccctcaacga tgtccttaag gtaggtggta 2760 ctgtccaatc cgatattctc gatattgtcc tgcgattccg caagcacaaa tatgctttca 2820 cggctgatat aagcaaaatg tatcgccaaa tcatggttac tgagatgcat cagccactcc 2880 aacgattttt ttggagagag agtcccgaac aaccgttgaa agtatatcag ctaactaccg 2940 ttacatacgg cacagcgagt gcaccctttc ttgcaactcg ctgcctggtt cagttggctg 3000 aagatggcaa agagaaatgt cccctgggag cgtcaactgt gagggaagac ttttacgttg 3060 acgatgtgtt gtctggcgac gatacgttga gtgcagcaat tgaacgacaa cagcaagtta 3120 aggatttgct ggccagcgct ggtttcccga tccataagtg gtgttccaat tccgagcagt 3180 tgctggaacg cattccagta ggtgatcgtg aaactcccaa ggtcctcgaa gagcaaggta 3240 tcaacaccgt catcaaagtg cttggtattt tgtgggatcc tcacagcgat gattttctgt 3300 tctcgatcaa atctccaaat ttgaactccg aatccaaagc caataccaag cgaacgatat 3360 tgtccgaaat tgccaagctg tatgacccac ttggattctt gtcgcccata attgttctcg 3420 ccaagttact attgcaacag ttgtggcgta gcaaggtggg ctgggatgag cctgtcgagg 3480 acgaaataag ccaacaatgg atgcagctga agaaatccct tgaagcaacg aaccaggtgc 3540 acattccaag gcgtgtgctg attgatgatg ctgaatgcaa cgagcttcat ggttttgctg 3600 acgcttcgat gtcagcctat ggagcgtgtg tgttcattcg aagcatattc aacgagtcgg 3660 cgcaactaca attactctgc agcaaatcca aagtagcccc tttgaaggca gtgacgattc 3720 cacggctgga attgtgcgcg gcactcctgc ttgcgcgttt ggtgaataag gttctgcagt 3780 caatccaaat gtcgtttcat cgtgtggtgc tctggtcgga cagtcagatt gttctggctt 3840 ggttgcgcaa atctcctgac cagcttcaag tgttcgtgcg caatcgtgtt gctgcaatcc 3900 gagaagaaac agaagatttc gagtggatgt acgtccggtc ggagtccaac ccggctgata 3960 tagtgtcgcg tgggcaaatg cccatcgaat tagtgaaaaa tgagttatgg tggcatggac 4020 cagacttttt gaaaaaagtg aacattgaaa ataagccacc agaagagatt attgaagaca 4080 ttccggagct acgtgttgct ccaacagtgt tacctgtaat tgaagaagaa cagttagaag 4140 tgttttcgaa gtacagttcg tacagaaagt tacaacgtgt aattgcgctg gtgttgcgtt 4200 tcgccaacaa ctgcaagaag aaggattcag cttcacgtgt gaataagcct atccctaccg 4260 tggctgaact tcgtcttgca atgaacatta ttgtgaaagt gatacaacga gaagagctca 4320 gcgatgagat tgctcgagtc gaatcgggtg aacagtgtcg aaagcttggt tcgttgggac 4380 cgatccttgt aaatggggtg cttagagtcg gaggacgttt agagcattcc aagctaccat 4440 atggtgaaaa gcatccgatc atcctgccgt gtaaaagtac gattgtgcga atgctgattc 4500 gtgccctaca tgaggagcat ctgcatctgg gaccaactgg tttggttcat gtgatccgaa 4560 gagaattctg gttaatcaat gctgctgcga ctgttcgtgg agtgaccaga tcttgcgtga 4620 agtgtttcaa ggtcaagccc atcgatacaa gccagtatat gggaaatctg ccgtcctgtc 4680 gcgtgacaca tgcgccccct ttttccgtca ctggcgtaga ttacgctgga ccgtttcata 4740 tcaagcaggg cttgcgtaag gtgaccacag tcaagggata tgtagcggtt tttgtatgta 4800 tggttactcg agccgtgcat ctagagctgg tgtcagacat gactaccagt gccttcatcg 4860 ctgcgctaca gaggttcgtg agtcggcgtg gaattgtgca tcagctgcat tccgacaacg 4920 gcaccaattt tcgtggtgca catcatgagc tgaatcaact ctaccaggcg ttcaaacagg 4980 agcaagatac ccaccaaatc gagtcgttct gcgtgtccaa gggaattgaa tggcaattta 5040 ttccggcgga tgcccccgaa tttggtggta tatgggaagc ggcagtcaag agtatgaaga 5100 gtcatctgaa gaggatcatt ggaaatgcat ccctgaattt cgagcagtat gtgactgtcc 5160 tggcagaaat cgaagcgatc ctgaattccc gtccgctatt cgtgacatcg tcaaggtcgg 5220 acagtccaga agtgattacg cctgcccatt accttatcgg gcgtcctctc accgcaatac 5280 cagagccgtc gtatgaagac atccaagcca accgattgga taagtggcag cacctgcaaa 5340 tgctccgaga gcatttctgg aaggcttggt cccgcgaata cctgagtagc ttgcagtcta 5400 gaaaaaagaa ccagaaactc aaatcaaatg ttagtcctgg aatgattgtt ctggtgcaca 5460 accggaattt gccccctctg caatggaagc tgggtgtcgt aaccaagacc tttcctggag 5520 tagatggctt gatacgagcc gtagatgtct acagcgacaa gtcaacgttt cgccgcccga 5580 tcaacaaatt gtcagtcctg ccgatagaag acaacaaggc ctacttcggc caatctcaag 5640 ttttgaaata gacatttcaa cgtggcggtg aa 5672 // ID ISL2EU-2_HM repbase; DNA; INV; 4359 BP. XX AC . XX DT 15-DEC-2008 (Rel. 13.12, Created) DT 17-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE A family of autonomous ISL2EU transposons - a consensus. XX KW ISL2EU; DNA transposon; Transposable Element; ISL2EU-2_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4359 RA Jurka J.; RT "ISL2EU-type transposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 2059-2059 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 124..1710 FT /product="ISL2EU-2_HM_1p" FT /translation="MKVNCAVIGCTNSSYKLEKLKNKTCFQHEGKILSECG FT CELPFRMFCFPSLKRNSEKRKHWISQLRREGKKKGSAWEPGTADRVCSDHF FT VDKIPTVMNPNPTINMGFDQPVQKKARRTLLKHPITSLHCSNKMQPVEPLL FT TEKSIVTKFPSVSFNADVSSPILSDHTYSMSLPLPNKCTSCEYKSSLITSY FT VSKVNSLTNQLKKLKIKQTVKSKQKFSWRLVNNDKKMNFYTGISSIAIFNV FT IFGLLKPFLPSIRYWRGPKHSRSKVKQLKSVSKCKLLSHREELLMTLMRLR FT LGLLNEDMADRFGISKSLCSNTFTTFIRIIANILGQAIIVWLPSEVIKKNL FT PQSFVKAKHHKCRVILDCFEIFIERPKSLYNQAVTWSDYKHHNTVKVLIGI FT APNGYITFLSKCYGGRASDKFITSDSGFYDLLERDDEVMADRGFQIREELL FT FRYCSLSVPPGARVKSQMTASECKKTTDVANLRIHIERAINRIKTFRILKN FT VLPISMLHHMDDIILSCAALCNLKPALIKKIV*" XX SQ Sequence 4359 BP; 1457 A; 684 C; 683 G; 1534 T; 1 other; caccagttcg agcgaagttc tgccattatt aataccgcgc aaaaaaaagt cctggaacga 60 atttagaagg gtaaatattt attttctcag ttgcgaatta gtgataaaat atatagtgat 120 aagatgaaag ttaattgtgc tgttatagga tgtaccaata gttcctataa gctagaaaaa 180 ctgaaaaata aaacgtgttt tcaacacgaa gggaagatat tatccgagtg tggatgtgag 240 ttaccattcc gaatgttttg tttcccaagt ttaaagagaa atagcgaaaa aagaaaacac 300 tggattagtc agcttagacg agaaggaaag aagaaaggaa gtgcatggga acctggaact 360 gctgatagag tatgctctga tcattttgtt gataaaatac caactgttat gaaccctaat 420 ccaacaataa atatgggttt tgatcaacct gtacaaaaaa aagctagaag aacattatta 480 aagcacccaa tcacttcttt acattgttct aataaaatgc agcctgtaga acctttgttg 540 actgaaaaaa gtattgtcac caagtttcca agtgttagtt ttaatgcaga tgtttcgtca 600 cctatattat cagaccatac ctattcaatg agtttaccac tacctaataa atgtacttca 660 tgtgaataca aatcttcact tatcacctct tacgtgagta aagtaaatag ccttactaat 720 caattaaaaa aattaaaaat aaaacaaaca gtcaagtcta aacaaaaatt ctcttggaga 780 ttagttaaca atgataagaa aatgaacttt tatactggta tatcttccat agctattttt 840 aacgttattt ttggtttatt aaaaccattt ttaccctcaa ttcgttattg gagaggccca 900 aaacattcac gtagcaaagt taagcaactc aaatctgtat ctaaatgtaa actgttatca 960 cacagagaag aacttttaat gacacttatg cggttgcgct taggtttatt aaatgaagat 1020 atggctgatc gttttggtat ttcaaaatct ttgtgttcta atacttttac aacttttatt 1080 agaattattg ctaatattct cggacaagct attattgttt ggctgccaag tgaagtcata 1140 aaaaaaaact tgccacagtc ttttgtgaaa gcaaagcatc acaagtgtag agtgattctt 1200 gattgttttg aaatatttat tgaacgtcca aagtcccttt acaaccaagc agtgacttgg 1260 tctgattata agcatcacaa cactgtgaaa gtattaattg gtatagcacc aaatggatac 1320 ataacgtttc tatccaaatg ttacggtggc agagcatcag ataaatttat tacaagtgac 1380 agtggttttt atgatctgtt ggaaagagac gatgaggtga tggcagacag aggatttcaa 1440 ataagagaag aactattatt tcgttattgc agcttatcag tcccaccagg tgcaagagta 1500 aaaagtcaaa tgactgcaag tgaatgtaag aagactactg atgttgctaa cttaagaatt 1560 cacatagaaa gggctataaa cagaataaaa acttttcgaa tcttaaaaaa tgtattacct 1620 atttcaatgc ttcatcatat ggatgatatt attttatcgt gtgcagcttt gtgcaattta 1680 aagcctgctt taataaaaaa aattgtttaa atttgtttaa aatgtatata tattttttaa 1740 tttagtacta ctcttaaatg cttaatacaa ggaagggtgt aaattttatt tctgacctta 1800 taaataatgg aatcttaatt taagggttgg cttttttttt ttttttacag aacatgcata 1860 aatcaattta agttagtgtg gattgtggtc aactgcgcat ctttaacaac tgttttataa 1920 ttacaataat attttagagg agytttggcc accctgtgta taaagtgagg aaaagcttat 1980 tttattaaaa tcaaaacatt attttcaaca aagattaaag agattttttt aattttcggt 2040 gttaataaat gggaggggga gtggtcttaa taaacatttg tgggtgggaa aacttttgga 2100 aaattactaa acttcctccc cttcttgtat taaacacacg atagtactga ggttgtattt 2160 ttttatttaa ttactgaggt tgttttttta atttttaatt acttgtaagg gatgccggac 2220 accaccgtgc acttttaatg aatacatgtg tttcatttta ttacttcttt ttactataat 2280 aagaggcgtg gataaacttt ttgattttga gtatattctt tacctaaata ttgttacacc 2340 attatttgga tcttattgtc ttcttttata ggaaaaaaaa aatctataaa tgccaattta 2400 aatataagaa gggttttata atttaaaaca cagattttca ttttcataat tttattccaa 2460 tcagttaata gtaaaatcta caaagacagt atttaagtaa tgatcgttgt aatacttaaa 2520 aaaatcaact ttcatactca accaaccagc ttcatcaaat gcaattttat caattatata 2580 accatgagct gtccatacca taaaataaca attaatcatc ttagcagtcc ccatttgaac 2640 ctgacattga gtatagtatc tatgcttttt gttaatgaca aatttttcat ttaacttttt 2700 tgtaatataa ggaagagata tattaggatc ttgtggtgat gtatggttta tagaaaatgg 2760 acattttatt tctaaacaag ctggcgggca acacgaacat cgaacaattc tatcagggct 2820 tccacctata tagggcattt ctacatctaa aaataatcca cattcattta cagttaaatt 2880 ttgatgaaag gatttcataa cttcaaaaaa ctggtttgct gcttcaactt ccatcgctct 2940 cccgtacttc aatgcaggta tatctgggtt cacaaaagtt aaccctgaaa tactttggtg 3000 taatgaccac atgtcaactt taatgcccat gttaatcttt ttaatttttt ttagaacatc 3060 gtgagctttt gaagcagtta taactcgctt tcgaaaatta aaccaaagct tatttgtgtt 3120 ttgacccata gtcagacgct ctatatcagt aattattttt tgagtcatat ttgtttttat 3180 attttcacaa aaagaaaatt tattgtctga cataacaata atatcatcca tggtcaaaat 3240 tgtaggtttg gtaacatttt caaacacgat taattcttcc atataataaa tttcaggttt 3300 aggtacagca gaagttataa tgctatgagg agcaacagtt tcgattgctt ttgctatgtc 3360 tgttaaattt aaaagcttta tatcacagtt tactaatggg ctgtaattac gcttttcatt 3420 tgatactagt gaacgctttt ttcttccacg ctgaccaaat tcatcacgat caaattttaa 3480 gtcgcatatt tttgttggtg ctactatgct acgatttgga agccactctg attttgaact 3540 agtacaagct acatttgtca aaccatttct aacagcagcc tctattcgaa agaggcttgc 3600 agcaacatga ttacatgtct gcgatagacc agctagacaa gtacaatgtg ctgaaattat 3660 tttagcagtt tttttattaa taataatcca tagcttgtga aagggatcat taattttttc 3720 agatttccta cagttccctt ttaaatagca atattcactc tctattccta tttgatggta 3780 aaataaaggt tcaagccagc cacatttaaa ataactataa gctttacacg ttttgtagtc 3840 acttaagtct ttacttgcaa tttcagcagg gttaaataac aaataattgt aaatatcagg 3900 atataacaat gttggccaac aagttagacc ttcatcttct gataaccaac cattactaag 3960 cttaaaagga tcaggtagtt caacaccgtt tataaaaagt ttttttttgt actcagtaag 4020 tatatcatgc tccacctctt cagctgtttt aataggcgta acattgtttt ccattgcaca 4080 atagactcga gctgcaagaa tttctttctt tccagatatt ttcaagccac gaagacgtaa 4140 ataagacttc agttcttcaa ccttcatact acacactgta tcataatcca ttttgattta 4200 tatatgttat atatttgttt taatagtgat aggatttgtt ttagttgttt gtttatactt 4260 cgtttactca agaattaaat tttacgactt gatttgtttc ccttataaac tcgatccagt 4320 cttttctttt aattctccgt aacgcggctc gaactggtg 4359 // ID Hogri1 repbase; DNA; INV; 3437 BP. XX AC . XX DT 10-OCT-2009 (Rel. 14.1, Created) DT 10-OCT-2009 (Rel. 14.1, Last updated, Version 1) XX DE Hogri1 is a putative autonomous DNA transposon. XX KW hAT; DNA transposon; Transposable Element; HOBO; transposase; KW Hogri1. XX OS Drosophila grimshawi OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Hawaiian Drosophila; OC grimshawi group; grimshawi subgroup. XX RN [1] RP 1-3437 RA Ortiz M.F. and Loreto E.L.S.; RT "Characterization of new hAT transposable elements in 12 RT Drosophila genomes."; RL Genetica 135(1), 67-75 (2009)DOI 10.1007/s10709-008-9259-5. XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 1281..2219 FT /product="Hogri1_1p" FT /translation="MLGNVIESAFKEVDALKDMLNQAKELVKYMENSGAFL FT NLETPLVNYSPNEWKSIYHMLKSIESHWTELINLCGDISGHNQQEISSVVN FT LLGHFEKCCRTLEDENLPTLHLVIPHLHRLRKQCSQPSQTTVESNMKLKLM FT DLLESLVQNHIVEFHKMAMFLFPPTNQLLQCTRTERNQTIEKCRMLMKRFC FT PQESSQNCDTLATVKQEEEDDLFSDFVTAHTPEDMILKEIRRYSDLQVPLC FT DGYNVLNWWYFHKDNFPLLYPLSCRILGTPASSAASEKVLAKAQSLLTHTN FT FSHVSHHCIANKIMFLNNNSY" XX SQ Sequence 3437 BP; 1072 A; 675 C; 707 G; 983 T; 0 other; actcatttat atatatatag gtatatggtg agcgcaatta tacaattcat ttaatcacac 60 atgtttaata aatatttaaa tatttcaaga atcaagcaca taatgaaaac acagtcgatt 120 tgctacagca cggcaacagt tcgatcaaaa atcgaaactg gcgaatttaa ggtgatttcc 180 aagcgaagtc gaagtcgagt gtggaatgta ttttcccggg ttgtggataa aaatggaacg 240 gaactcagga atgtggtttg ccgatcatgt ccgagtatct tcaagttcca cggcagcact 300 tcgaatttgg ttaggcacaa gtgctacaga aatgccattg attttggcca aacaggctct 360 gatgggaata agcgaatatc cagtgatctt gaaatgagca ctgaagatca ggattatggc 420 gttgatttag atccactgga tgaagtgaag agttccgacg cattgctgga tgatgaggac 480 acaaagggga gacttgcaca agccctcgcc gagtgggccg tcgagaattg ccgagccttg 540 gatattttcg aggataccgg attaaggaag ctagcagcat tatttattga actaggcgct 600 tattttggta cccaaatcaa agtggatgat ctcatgccgc agacagcggc cataactgcg 660 aatatataca actgctatca agacaaattg gagaaggtgc gccgggatgt gtgtgtggca 720 aggataaatg gattcagcat cacatgcaac acccgaacag ataattccat ggagaatacg 780 cagctttgcc tgaccatgca ctacatcaag gatggtaagc tattcagtcg tctgctcaca 840 gttgccagcc gggatgcgga gtcctgcaca ggtaattatc aaacttaaat aactcaaaca 900 gaattattgc tcattgattc gaacaaaaac gttgagcttt tacttaatga atcagatttt 960 taactcaata tttgaattgt gtgcactcaa atcggttttt aactatcatt tcaaattgaa 1020 tctatgactt acatttaaat taatttttaa caatgatttc aaattgaatc tataataatt 1080 cgattgaata tatttgaaga atatctcaca ttatatctaa tcaaaatttt atcgcaggtt 1140 ctctattgaa aacacaaatc aatacaattt tgagtgattt ccagtgcgtg tgggaggaag 1200 ataaaccaat ttttgtgaca tcgagtgagt ctatccgaga tgccatcatc ggagaagggc 1260 aagtggattg cattagcaaa atgttgggca atgtcattga atctgcattt aaggaagtcg 1320 acgctctgaa ggatatgtta aatcaagcca aggagctggt taaatatatg gaaaactctg 1380 gtgcattctt gaacttggag acgccattgg tcaactactc gccaaacgaa tggaaaagca 1440 tttaccacat gctaaagtct atcgagtccc attggaccga gcttatcaat ttatgcggtg 1500 atataagtgg tcataaccaa caggaaattt cttcggttgt caatcttctt ggacactttg 1560 agaaatgctg ccgaaccttg gaggatgaaa atttgccaac tttgcatttg gtcataccac 1620 atttgcaccg actgcgaaaa caatgcagcc aaccgagtca aactacagtt gagtccaata 1680 tgaagctcaa actgatggat ctcctggaat cactggtgca aaatcatatt gtcgaatttc 1740 ataagatggc catgtttctg tttccgccaa ctaatcaatt gcttcaatgt acacggaccg 1800 aaagaaatca gacaattgag aaatgcagga tgttaatgaa acgcttctgt ccccaagaga 1860 gcagccaaaa ttgtgatacg ttagcgaccg tcaagcagga ggaggaagat gacctcttct 1920 ctgattttgt aaccgcgcac acgcccgagg atatgatcct taaggaaata cgcagatatt 1980 ccgatcttca agtgccactt tgcgatggtt acaacgttct caattggtgg tatttccaca 2040 aagacaactt tccgctcttg tatccgctca gttgcagaat tttgggcaca cctgcgtcaa 2100 gtgcagcctc ggagaaagta ttagcaaaag cgcaaagttt attaacccac acgaatttta 2160 gtcacgtttc tcatcattgt atagcaaata aaataatgtt tttaaacaac aattcttact 2220 aaactacaca ataagcaaat taattttaaa atatgaataa ctaaaaaata ttaaagggaa 2280 gtgcaactgg ttcaaattta ttcaaatata tcgctgaact caaattgaaa cttgaatcaa 2340 atatacgatt gtgatttatt cacatgattc gatattttag ccatatagtt ctgatagttt 2400 gcggcggttg gccgatagcg aactgataaa tttcagtgct agcatacgct gcctctcgcc 2460 gttgttcaat actcgcgcac agagaaaatt ttggaagaga agaagctagt tgttattcga 2520 tctttcgtgc atttgaaaca aatagaattg tgtaaaatca actacaaagc gagcgcggag 2580 cgtgtaaaat gcatttgtat gaaatcaggc ggtaaagcaa atgcatagat taagaagaat 2640 gtgcgtgtgc accacgcaca cattgttaga tttttcggtc cccatttcgc tgctgctcac 2700 aaactgacgt caacgttggc gtcgctgcct acgtcgccgt catcgtcatc atacgccttg 2760 tcgtcaatga gagcgaagag cataacagtg tctaccaatt tgacggcata ttgtgcataa 2820 aagagagaga gacaatggtg agcgcaacag caacaacaac gaagtcaaaa gaaaaatgtt 2880 ttcatatttt atactgcctg tcatcgcgga aaagtggaaa attgcaaaaa accgcaattg 2940 ccacggcaat aattgtgcac taattagccg attgctgacg cgacctaacc aaaaattgca 3000 tttctctgtc aacagtgacc ccaaattgca acagctgatg ctgtgtggcc aggcaacaaa 3060 aaaagaaaaa aatcattgtc acctctctgc cgctggccct ccgttgctgc tgctgctgct 3120 gcactgcttc tgttattctc ttattgtgcc aagcacagtt ccaagctgtc ccccctcacc 3180 cctgttgccg cccactgcaa agttcgccca gtgaatgatt cacatttttc ttttgctagt 3240 ttttcttttt tcttctcttc tatgcttttt tatgtgaatt tttgtttttg ttgttgttgt 3300 tcgctgtgcg ctgcgcgagt tgttggcaaa tgcgtgcgcg caattaattt gcaaaaaata 3360 aaaaaaattc gaaaattgta acagtgctta taaatattta tgcattgact gctgtaaata 3420 tatatttata aatgtgt 3437 // ID Gypsy-21_DYa-LTR repbase; DNA; INV; 202 BP. XX AC chr2L_random; XX DT 06-APR-2011 (Rel. 16.04, Created) DT 06-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-21_DYa_; KW Gypsy-21_DYa-I; Gypsy-21_DYa-LTR. XX OS Drosophila yakuba OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; melanogaster subgroup. XX RN [1] RP 1-202 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; chr2L_random; Positions 1376778 1376979. XX SQ Sequence 202 BP; 68 A; 39 C; 38 G; 57 T; 0 other; tgtagtaggc tgaccctctg ggcacactca tatatcgata cttacactac acgatgacct 60 atgttaaacc gatgttaaac ctatgttaag ataatgttaa gctgatgaac attaagagtc 120 aatctgtttc tgcgagtcaa ataaagaaga agtatagaaa gaaataaaac tgtcgcgtgt 180 gtaatctttg ccgccttcca ca 202 // ID Copia-11_SI-LTR repbase; DNA; INV; 189 BP. XX AC AEAQ01016250; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: long terminal DE repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-11_SI_; KW Copia-11_SI-I; Copia-11_SI-LTR. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-189 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01016250; Positions 1094 906. XX SQ Sequence 189 BP; 48 A; 49 C; 30 G; 62 T; 0 other; tgttaaggat agcgcgatct aacgtcacat ttggtagtca cgtagtacgc cctcctttct 60 tttctttgtc tagcgcgcac gctctcttac tagagtcgtc tcccgccttt tgaaaagtat 120 cactgtactc attatacaat aaagtatttc tttatacaac agcagtcgat tcatgcataa 180 cctccaaca 189 // ID Gypsy-38_AA-I repbase; DNA; INV; 6707 BP. XX AC AAGE02019168; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-38_AA_; KW Gypsy-38_AA-LTR; Gypsy-38_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6707 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02019168; Positions 153411 146705. XX CC Positions [4993-5469] - Integrase core CC 'CAGT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 407..2674 FT /product="Gypsy-38_AA-I_2p" FT /translation="MDLQAEYRQMNVSHLAEDEVEHELLIRNMLFHFDDHD FT SVKRRKLKDRMKEERNVGINPTAFARTWRSAKEEIETIRSHVKVIGGIIEN FT PRTDARQKEKMRTRFVHYRVRIGQLARATDARRYSAEITEIENQMDQIIAT FT HLSKPLNPQSNPTKPKPIREKITQALDEVRTEIATLNDTVATLDSGDEGDE FT IDEAVGGTNSSLVDQKRKEMEASKKRTEEILEKLNEYEQGKEINMAQLLPA FT FRNFVIQTSEQQKSMREQEIRMAEERKKEMESQMKRKQNLEKLLTDLNESL FT RVQPMNVASATADKHDSQGSDDSDDSRQITHSQSKSSKARTETRSSPKNTI FT NSTDSSSEEIEQAGKRELKGNYVGKEKRKHNRKTEIRNKKKKNARSDSSEE FT TLSLDSSESSESSDFSRSSADSSSDSTEEERKRRKRHQKKKKHRKSRNTMK FT RIPVAEWKIRYDGKDDGRKIAEFLKEVKMRCRSEDVSDRELFRSAIHLFSG FT RAKDWYIDGVENGDFRNWSELKKELKREFLPPDIDFQLEVQATNRRQLRGE FT KFSDFFHEIQKIFHSMTKQLSEKRKFKIIWRNIRHDYKNALTGAGIKSLSK FT LRRYGRKVDENFSFLQKQPEAMNRQRSSQVSEITSNPTKSKTGSSGNNTRV FT FTNSRNQPKQKDSGEGKRDQKEIDRKEQGMREERSTPSVEKPTADKAVAVP FT TRYQRPPIGTCYNCRNHGHHYSECSEKRGMFCRVCGFPNVFTSAYPACPKN FT PEGSA" FT CDS 2575..5868 FT /product="Gypsy-38_AA-I_1p" FT /translation="MFREKRYVLSGVWISKCFHKRLSGLPKKPGGISLRRQ FT ADVNIEIPPNPNLIGSQLSSCGFEPLSTSDYDSQPSIEEIFVHIQGDGRPF FT AKVNVLGVEVLGLLDSGAQRSVLGIGSDKLIKSLKLKIHPTPTSVKTAEGK FT NVPVRGLVYLPITFRNQTRIIPTLVAPELRRRLILGFEDFWRMFNIRPTQQ FT TGDGFVRIDEFESVEMNPFGEPVSDLTDEQSAQLENVKQLFKAAIDGETLG FT VTPLISHKIEIKEEFQQSPPIRINPYPTSPEMQRKINREIDNLLTQKVIEK FT SHSDWSLSTVPVVKPSGDVRLCLDARRLNERTRRDAYPLPHQDRILSRLGA FT SKYMTTIDLTKAFLQIPLDDDSRKYTAFSVVGRGLFQFTRLPFGLVNSPAS FT LSRLMDEVLGYGELEPNVFVYLDDIVVVSDTFEAHLQSLREVAKRLKSANL FT SINLQKSKFCVTELPYLGYLLTSEGIRPNPERVDAILNYERPNSIRALRRF FT LGMANYYRRFIAAFSEISAPLSNLLRKKPKGIVWNDEAETAFVRLKESLIS FT APILSNPNFELPFQIQTDASDNAIAAVLTQVHDDGEKIIAYFSQKLTPAQQ FT SYAATEKKGLAVISAIAKFRPYIEGMHFVVMTDASALTHIMNGKWKTSSRL FT SRWSIDLQGYDFEIRHRRGRDNIIPDALSRAVEELEASEDDWYQDLVTKVE FT KSPEANLDYRVEAGKLYKFVPTKTEVFDYRYEWKLCVPEGLRKEVLQKEHD FT NAFHIGYEKLLDKVRQRYFWPNMAASIKKYVQSCHQCKEVKPTNVSQHPEM FT GKQRLTTKPFQILSMDFIQSLPRSKSGNTHLLVLMDLFSKWTMLFPVKKIA FT SDIVIRLVEQHWFRRYSVPEILITDNASCFLSKEFKAFLDRYHVQHWANAR FT HHSQANPVERLNRSITSCIRTYVKTNQRLWDTRISEIEYTINNTRHSSTGF FT TPYRVLYGHEIVADGDEHRLDVDTKEITEGDRIEQKLNIDKVIFDTVQKNL FT IRAHEKSAHNYNLRFKKPAPVYLVGQKVFRRNFSQSSAADAYNAKLGPTYI FT PCTIVARRGTSSYELIDSSEKNIGIFSAADLRPGGPENGSS" XX SQ Sequence 6707 BP; 2142 A; 1325 C; 1546 G; 1694 T; 0 other; attggcgccc aacttaaaga aagcttactg atcagagctc aaagtaggag tggatttaaa 60 tagattggag taggctttag ttttatgcat attgattacg taaatgtgct cggattggag 120 aaaggggaag cgaattagaa ttgtgtagtg cgaaacacat tctctgcaga agatattgcg 180 agatctgtgc ttttactagt gcgcgtacgt cgataagagc gaatacattt aacgaaagca 240 gtgctaaata ggagcgcgcg aacaaatttt aggttatttt tcaatttttt tcttgatcaa 300 tgggtcttga ctcgtgaaat atcatttaac tgcaagagga ttcgaacagt atctgttttt 360 cagagcacac aatttttatt ggtattgtga ttgtgagagg aatatcatgg atttgcaagc 420 ggaatacagg cagatgaacg tgtctcatct cgcagaagat gaagtcgaac acgagctctt 480 aattcggaac atgttgttcc atttcgatga tcatgatagt gtaaaacgta ggaagcttaa 540 ggataggatg aaggaagaga ggaatgtggg aataaacccc actgcttttg ccagaacttg 600 gaggtctgca aaagaagaga tcgaaaccat cagatctcac gtcaaagtta tcggtgggat 660 aattgagaac ccaagaactg atgctaggca gaaagagaaa atgagaaccc gtttcgtaca 720 ttaccgcgtg agaataggtc agttagcgag agccacagat gctcgtcgat attcagcaga 780 gattaccgaa atcgaaaacc aaatggatca gatcatagct actcacctat cgaagccttt 840 gaacccacag agtaatccaa ctaaacccaa accaataaga gagaaaatta cacaggcttt 900 agatgaggta cgcaccgaga tcgcaacctt aaatgatacg gttgcgactc tcgacagcgg 960 tgatgagggg gatgaaatag atgaggcagt tggaggaaca aactctagct tagttgatca 1020 gaaaaggaaa gagatggaag catcaaagaa aagaaccgag gaaattttgg aaaagctaaa 1080 tgaatatgaa caaggaaagg aaatcaatat ggctcaattg ttgccagctt tcagaaattt 1140 tgtcattcag acttcagaac agcagaaatc aatgcgtgag caggaaatca gaatggccga 1200 ggaaagaaag aaggaaatgg aatcacaaat gaagagaaaa caaaatttgg aaaaattatt 1260 aacagatttg aatgaaagtt tgagagtaca gcctatgaat gttgcttctg caactgcaga 1320 taaacatgat agtcagggat cagatgattc tgacgattcg cgacaaatta cgcattctca 1380 atcgaaatcg tcgaaagcga gaactgaaac tagaagctca ccgaaaaata ctattaactc 1440 gactgatagt tccagcgagg aaattgaaca agctgggaaa cgagaattga aaggaaatta 1500 tgtaggaaaa gaaaaacgga aacataatag gaaaactgaa atcagaaaca agaagaagaa 1560 aaatgctaga tccgattcgt ctgaggaaac tctctcgcta gattcgtcag agagctcaga 1620 gagctcagat ttttcacgtt cgagtgcgga tagttcatcc gactcgacag aggaagagag 1680 gaaaaggagg aaaaggcatc agaagaaaaa gaagcatagg aagagtagga ataccatgaa 1740 gagaattcct gtagcggaat ggaaaatcag gtatgacggt aaagatgatg gaagaaaaat 1800 agcagaattt ttgaaagagg ttaaaatgcg ttgccggtcg gaagatgttt cggatcgtga 1860 actatttcgt tctgccatac atttattttc gggtcgtgcc aaggattggt acatagacgg 1920 ggttgagaat ggtgatttcc gaaattggtc ggaactcaag aaggagttga agagagaatt 1980 tcttcctcct gacattgatt ttcagttaga agttcaggca actaaccgcc gccagcttcg 2040 aggggaaaag ttttcagact ttttccacga gattcaaaaa attttccatt cgatgactaa 2100 acagttatct gaaaagagaa agtttaaaat tatctggcgc aatatacgtc acgattataa 2160 gaacgcctta acaggagctg gaattaaaag tttgagtaaa ttgcgaaggt atggtagaaa 2220 agttgatgaa aactttagtt ttctacaaaa gcagccagag gctatgaatc gacagcgtag 2280 tagccaagta agtgaaatta cctcaaatcc aactaaaagt aaaactggca gctctggtaa 2340 taacacacga gtttttacca acagccgaaa ccagccaaag cagaaagatt caggggaggg 2400 gaaaagggat caaaaggaaa tcgataggaa ggaacagggt atgagagagg agagaagcac 2460 accatctgtg gagaagccca cagcggataa ggctgtagcg gtgcccacta gatatcagag 2520 gccaccgata gggacctgtt ataattgccg caatcatgga catcactatt ctgaatgttc 2580 cgagaaaaga ggtatgtttt gtcgggtgtg tggatttcca aatgttttca caagcgccta 2640 tccggcttgc ccaaaaaacc cggagggatc agcttgagga ggcaagctga tgtgaacatc 2700 gaaattcctc ccaatcccaa tctgattggt tctcaactga gttcctgtgg cttcgagcca 2760 ctctccacat ctgattacga ctctcagcct agtatcgaag aaatttttgt acatattcag 2820 ggggatggta gaccctttgc taaggttaac gttctgggtg ttgaagttct aggattgcta 2880 gacagtggtg ctcagcgatc ggtgctggga atcggatcag ataaactaat caagtcttta 2940 aaattgaaaa tacatccaac ccccacttca gtcaaaacag cagaagggaa aaatgtaccg 3000 gttagaggtc tcgtatattt accgatcact tttcgtaacc aaacccgaat catacccaca 3060 ctagtcgctc cagaactacg acgaagactc attttgggct tcgaggattt ctggagaatg 3120 tttaatattc gacccactca acagacaggg gacgggttcg tgagaattga cgagtttgag 3180 agtgttgaaa tgaatccctt tggagaacca gtatcggatc tcaccgatga gcagagtgcc 3240 caacttgaga acgttaaaca acttttcaag gcagcaatcg atggtgagac ccttggtgta 3300 actccgttga tctcacataa aattgagata aaagaggaat tccaacaatc tccgccaatt 3360 cggattaacc cctaccccac ctctccagag atgcagagga aaattaatcg cgaaattgac 3420 aatctcttaa ctcagaaagt tattgaaaag agtcacagtg actggtctct cagcactgtt 3480 cctgttgtga aaccttcagg ggacgtgaga ttatgtttag acgctcggcg tctcaatgag 3540 cgtactcgaa gggacgccta tcctctcccc caccaagacc gtatactgag tcgactaggg 3600 gcgagcaagt acatgaccac gatagattta actaaagcgt ttcttcaaat cccactcgac 3660 gatgactcaa gaaagtatac ggccttttct gtggtgggta gaggactgtt tcagttcacc 3720 agattgcctt ttggcctcgt caatagccca gctagtttgt ctcggttaat ggacgaggtg 3780 ttaggctatg gtgaactgga accaaatgtg ttcgtttacc tcgacgatat cgtcgtggta 3840 agcgacacat ttgaggccca ccttcagagt ctccgcgaag tggcaaagcg actcaaatcc 3900 gcaaacttgt caattaatct acaaaaatcg aagttttgcg taactgagtt accctatctc 3960 ggttatttat taacttctga gggtattcgt ccaaatcccg agagagtgga tgccattcta 4020 aactacgagc gacccaactc aattcgtgcc ctgcgccgct ttttgggcat ggcaaattat 4080 tacaggcgct tcattgccgc ctttagcgaa attagtgctc ctctctcaaa tctgctcagg 4140 aagaaaccga aaggaatagt gtggaacgat gaggccgaaa cagctttcgt tcgccttaag 4200 gagagcctga tatcggcccc gatattgagt aatccaaact ttgagttacc ctttcagata 4260 caaaccgatg ccagcgataa tgctatagcc gccgtgctaa cacaagtgca cgatgacggc 4320 gagaagatta tcgcatattt ttcacaaaaa ttaacccccg cacagcagtc ttatgctgca 4380 accgaaaaaa aaggattagc ggttatttca gctatcgcta agtttcgccc ctacattgaa 4440 gggatgcatt ttgtggtaat gaccgatgct tcagccctta cgcatatcat gaatggaaag 4500 tggaaaactt cgtctcgttt aagcagatgg agcattgacc tccagggata cgattttgag 4560 atccgacatc ggcgaggtag ggacaatatt atcccggatg cgctctctcg agctgttgaa 4620 gagctcgaag catcagaaga tgattggtat caggatctgg taaccaaagt tgagaaatct 4680 ccggaggcaa atttggacta ccgtgtggag gcaggaaaat tgtacaagtt cgtaccgaca 4740 aagacagaag tctttgatta ccggtacgaa tggaagttat gtgtgccaga aggtcttcga 4800 aaagaagtcc ttcagaagga acatgataac gcgtttcata taggctacga gaaattgctg 4860 gataaggtgc gacagcggta tttctggccc aatatggccg cttctataaa gaaatacgtt 4920 cagagctgcc accagtgtaa agaggtcaag ccaacaaacg tttctcagca cccagaaatg 4980 ggcaaacaac gtttgacgac taaaccgttt caaatacttt ccatggattt tatccaatcg 5040 ttgccacgca gtaaatcagg taacacacac ctcctcgttc taatggacct gttttcaaag 5100 tggacgatgt tgtttcccgt taagaaaatt gcctccgata ttgtaatccg tctggtagaa 5160 cagcactggt ttagaaggta ctctgtgcca gagatattaa ttactgacaa tgccagctgc 5220 ttcctcagca aggaattcaa agcctttctc gatcgatatc acgttcaaca ctgggctaat 5280 gcacgccacc acagtcaggc aaacccagtc gagaggctaa atcgcagtat tacctcgtgt 5340 attcgtacgt atgtgaagac gaatcagcgt ttgtgggata ctcggatctc agagatcgag 5400 tataccatca ataacacacg tcactcgtcg acggggttca ctccataccg agtgctctac 5460 ggtcacgaga ttgtggcaga cggtgatgaa cacaggctcg acgtggatac aaaggagatc 5520 actgaggggg acagaattga gcagaagctc aatattgata aggtcatatt cgatacggtg 5580 cagaaaaatc tcataagagc tcacgagaag agtgctcata attataattt gcgtttcaag 5640 aagcctgcac ctgtctacct agtcggccag aaggttttcc gtcgaaattt ttcccaatcc 5700 tcagcagcag atgcctacaa cgctaaattg ggacctacat acataccctg tacgattgta 5760 gcccgccgtg gcacaagctc gtacgagctc atagacagtt cggaaaaaaa tattggaatc 5820 ttctctgcgg ccgatctccg accaggaggc ccggagaatg gcagttcctg aatttataat 5880 caattaaatt gaaaattcca ttaattcact attttagcta atttagattc cccagatgga 5940 aatcgttggg atagttttaa ggtgcaccat acacttcgtc aattccgagc aataaattag 6000 agacttgaga gtgagataga gttcaagatc agatcatttg gtattcgatt gtatggtagg 6060 atttcataac aataatttcc atatttatat agttcctctt tagtcatttt gctgttaagt 6120 tagatgatca cgctcaacga tatcgtagtt tgcacaatag gaggatcacc ggtcaatgtc 6180 gtcaaaaatg taaaatgcca ccactccatt gggctacgaa aaatatctag cgctaatatt 6240 tttcctacgg ttgtccagta gcaatggtca aaaatccaac atagtcgaaa ttccaaaata 6300 gctcacaaac ttccatatct acacaaactc gtccaaccac aatcaccatg ccaataaatc 6360 gcttgagatc agctttatct cgtgtagctc ttgccagtat ataagtagtg ttcgataaag 6420 gttcatcctc tcaactttgt tgagatcttg ggatgagact aattgatcaa gagagtaagg 6480 gtaaatacgt ttgtaaatgg ttacatgcct attgtcggca aaatttctat taagtactca 6540 agacgttgaa atataagctc tttcggattc cgatgcttgc gaaaggcaaa ctcgaccctg 6600 ggagtatagt tctacaatag tcttccggaa ataaattgat agtagtccta aatatatttc 6660 aaaataaatt gatgaatcaa tttattttta cggggagtgt ggggtag 6707 // ID Mariner-14_HM repbase; DNA; INV; 2012 BP. XX AC . XX DT 22-MAR-2008 (Rel. 13.03, Created) DT 22-MAR-2008 (Rel. 13.03, Last updated, Version 1) XX DE Mariner-type family: consensus sequence. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-14_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2012 RA Jurka J.; RT "Families of Mariner elements from Hydra magnipapillata."; RL Repbase Reports 8(3), 231-231 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(147..719,1027..1536) FT /product="Mariner-14_HM_1p" FT /translation="MLIYPRIRVKPEFLDKAPIGSISGGSKNGWITTELFE FT IWFDHFLQAVQPQSRNQPVLLILDGHSSHKKNLSVIKKARHSNVIILSLPS FT HCTHKLQPLDVSFFKSLKIFYDQEVSTWLRHHPGRPVTELEVGELFGKAYG FT KAATVQNCQSGFKKCGIYPFDRNVFTEEDFAAAKATDHSYVVSKISKPNNN FT LLNNFYLESISPNSSFTLQKSSELATSHDETSQPYGVTFGSLAGLDIKNAK FT RTGIKRRVNHAEKITGSPYKSQLEESLNLKYKNRQPRKKCIKKSISTHQKT FT CLKNLDAKGAINKQLCAKCQYSYGDPVDPYLKDNWDKCLKCCRWWHETCAA FT ICGVYTKKVFTCDDCIKH" XX SQ Sequence 2012 BP; 706 A; 312 C; 309 G; 685 T; 0 other; aatctggatt gtctatttgc cataaaccag gaaaaattgt tgctcttaag ggaaaacaca 60 gtgttggtgg cttaacaagc tccgaaagag gtaaaacaat cacaatagtt tgttgccaat 120 cagcaagcgg gttttttgtc cctccgatgt tgatctatcc acgtattcga gtgaaacctg 180 aatttttaga taaagcccca ataggttcta tttctggtgg cagcaaaaat ggctggataa 240 caactgagtt gtttgaaata tggtttgatc attttttaca agcagtccag cctcagtcaa 300 gaaaccaacc agttcttcta attttagatg gacattcaag tcataaaaaa aatcttagtg 360 ttattaaaaa agcgcgccac tcaaatgtta ttattttatc acttccatcc cactgcaccc 420 acaaactgca accacttgat gtttcgtttt ttaaaagcct caaaatattc tatgatcagg 480 aggttagtac ttggcttcgt caccatcctg gtcgtcccgt aactgaacta gaggttggtg 540 aattatttgg aaaagcctat gggaaagctg caactgttca aaattgtcag tcagggttca 600 aaaaatgtgg aatatatcca tttgatagaa atgtgtttac tgaagaagac tttgctgcag 660 ccaaggctac tgatcactcg tatgttgtat caaaaatttc aaaaccaaat aataacctct 720 gaaatgcacc ctactctcaa caatgggtta gactctgcta aagaattaga taatattgat 780 tttgttacag attgtgttat tgacactcaa aaaggtaaat cgtcataaat gttgtattat 840 ttttaacatc ttccgcattt tatttaaaat acaaatatga cttttttaaa atcaagcaaa 900 atattaactt atctgttgtt ttattaacct aaaaaaaggt ttcttaaaga aagttcaaat 960 tttttaaaaa cattttaact attttttata atttgcatgt tgcctatttt tattcaccct 1020 ctttagttaa ataattttta tttagaatct atttcaccaa attcatcatt tacacttcag 1080 aagtcatcag aattagccac ctcacacgat gaaacctcac aaccttatgg tgttacattt 1140 ggatcactag ctggacttga cattaaaaat gcaaagcgaa caggtataaa aaggagggta 1200 aatcatgctg aaaaaattac tggatcacca tataaatctc agctagagga atctttaaac 1260 ttgaaatata agaacagaca gccacgtaaa aagtgcatta aaaaaagtat atctactcat 1320 caaaagacgt gtttgaaaaa ccttgatgct aaaggtgcga ttaataaaca actttgtgca 1380 aagtgtcaat acagttatgg tgatcctgtc gatccttatt taaaagataa ttgggataaa 1440 tgccttaaat gctgtagatg gtggcatgaa acatgtgctg ctatatgtgg tgtttatact 1500 aaaaaagtat ttacttgtga tgactgtata aagcactgat tcttaaaaaa aaagttatat 1560 gtcatatatt gataaaatgt gttttaactt tcttaaataa atctcttgtt aaaaaagttt 1620 cctttgagtt ttgtttttta tttttttaag tttaactatt gaagttatgt aaaatattat 1680 tttatgaact acttatgagc ttataaacat aaattcttat ttaaacagaa tttagaaata 1740 atcatcgaaa tcgtagaaat tagtaatcgt ttcattaaca attatacttt ggaagcacat 1800 ttatatttaa taaggtttct gttttgggtg acattcatac tttaataaaa aaggtgacat 1860 tcatacttta aaaatacatg acattcatac tttaataaaa taaataggtg gttcaaaaat 1920 tattattttt tgcaactgaa acgcccttaa ttaagtgcca attaggtgta acaaattcta 1980 tttaagagac aattctttat acaatccaga tt 2012 // ID BEL-1_DPu-LTR repbase; DNA; INV; 193 BP. XX AC scaffold_30; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: long terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-1_DPu_; KW BEL-1_DPu-LTR; BEL-1_DPu-I. XX NM BEL-1_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-193 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 650-650 (2010). XX DR Genome; scaffold_30; Positions 1095487 1095295. XX CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX SQ Sequence 193 BP; 60 A; 32 C; 58 G; 43 T; 0 other; tgtggcgcac cagcgccaga gttggcgatc tggcaaggca gcagagagat tggagagaag 60 aagaaaggag gggaagaaca aaaagtgaag aagacgacga cagaggtgca agaacagaag 120 tttcttttgg tgcccatttt gtcttgacgc tgaaatatac tctcttgaat ctgatttgtg 180 tcagtgtttc aca 193 // ID MARINER_Mi repbase; DNA; INV; 489 BP. XX AC AJ251413; XX DT 14-SEP-2004 (Rel. 9.08, Created) DT 14-SEP-2004 (Rel. 9.08, Last updated, Version 1) XX DE Meloidogyne incognita partial Mariner-like element. XX KW Mariner/Tc1; DNA transposon; Transposable Element; MARINER_Mi; KW mariner-like element; putative transposase partial domain. XX OS Meloidogyne incognita OC Eukaryota; Metazoa; Nematoda; Chromadorea; Tylenchida; Tylenchina; OC Tylenchoidea; Meloidogynidae; Meloidogyninae; Meloidogyne; OC Meloidogyne incognita group. XX RN [1] RA Leroy H., Leroy F., Auge-Gouillou C., Castagnone-Sereno P., RA Vanlerberghe-Masutti F., Bigot Y. and Abad P.; RT "Identification of mariner-like elements from the root-knot RT nematode Meloidogyne spp."; RL Mol. Biochem. Parasitol 107(2), 181-190 (2000). XX DR Genbank; AJ251413; Positions 1 489. XX SQ Sequence 489 BP; 134 A; 119 C; 108 G; 128 T; 0 other; tgggtaccgc atgaactaag ccaaattcac cttcaacaaa gggtaaacat ttgtcagcaa 60 ttgcttaatc gtcaacacgc acattctctt cttggccgct tagtgactgt agatgaaact 120 tgggttcact acagtggtag aatgcgcaaa gtgaattggc tccgaccaaa tcaaccggct 180 gtagcagtcc caaaaccaaa tccaatggga aaaaaggtta tgttgattgt tttttgggct 240 agatttggaa ttgtacattg ggaattaatt cctaggggca ctggacttaa tggagaactc 300 taccgtacat tcctggaccg cgttcaagtt gccattgatg gtttcacagt tcagggatga 360 cgtcatggcc aggttgtttt ccatcaggac aacgcgcctc cacaccgtgc aaatgcaact 420 cgtgcccaca ttactgaaac actcggatgg gaaattttac cacatccgct tataccccga 480 cctggcccc 489 // ID MuDR4x_SM repbase; DNA; INV; 1730 BP. XX AC . XX DT 12-AUG-2009 (Rel. 14.08, Created) DT 12-AUG-2009 (Rel. 14.08, Last updated, Version 2) XX DE MuDR-type DNA transposon element - a consensus. XX KW MuDR; DNA transposon; Transposable Element; KW Autonomous DNA transposon; MuDR4x_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-1730 RA Jurka J.; RT "MuDR-type elements from Schmidtea mediterranea."; RL Repbase Reports 9(8), 1903-1903 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 102..1598 FT /product="MuDR4x_SM_1p" FT /translation="MVVISKTRRGKPLLIEGEYQYHLNSKNSAGTKMYWLC FT TKREICKARLVTNADLENGITIFHSTSHAHIGDESAPEVRSVTEKIKARAR FT AEPNSHPSHIIREELRGVTDTEVILNVSETNSLVKMVNRVQNSARPNLPAS FT VSSCLITQPYNQTISGDLFLRFDSGVHDSNRVLIFYSDLGLRSLCRSTQIF FT ADGTFDTVPRIFFQLYTIHGEVFGYTFPFVYVLCLRKTRSVYESIINHLKE FT ASIRLNITFSPETIMVDFEMAAIMAFKNMLPNSTVKGCLFHFNQSLWRRIQ FT ENGLSVRYRNRQNTDLQGLVVALMALPFIPEEDVAGTFKDLVAFAPPELND FT FIEFYGRTYIGLIRNGDSWTSVTPRFENTMWSCYNSTLSGHRRTNNVVEGW FT HSRFHGEIQSHHATIWKFLEFIKKDEADNKILIEQLFGGHRKIRHPIKRMY FT LVNNSRILELVRSYGEFKNSGTIKIYLHSIGRLLKNNFTYIPEQDHNQPNM FT NEH" XX SQ Sequence 1730 BP; 577 A; 287 C; 326 G; 540 T; 0 other; agacgttaca cataaaattt attacacata aagcacattg attttgctgt tataaaatcc 60 gttgcttatg tctaaaaaaa ttattttttc gccctttaat tatggttgtt atatcgaaaa 120 caagaagagg gaagccttta ttaatagaag gggaatacca ataccaccta aactcaaaaa 180 attccgcagg aactaaaatg tactggcttt gcacgaagag agaaatctgt aaagccaggc 240 ttgtcacaaa tgctgacctt gaaaacggaa ttaccatatt ccattcaaca tcacacgctc 300 atataggtga tgaatcagca ccggaagtaa ggtcagttac agaaaaaata aaggcccgtg 360 caagggctga gccaaactca cacccttcgc acataatacg agaagagctt agaggagtca 420 ctgatacaga ggttattttg aatgtgagcg aaacaaactc gctagtaaaa atggtaaata 480 gagttcaaaa ctctgcaaga ccgaatctac cagcatctgt ttcgagttgt ttgatcaccc 540 aaccatacaa tcagacaatt tcaggcgatt tatttttaag gtttgattct ggcgtgcatg 600 actctaacag agttctcata ttctactctg atcttggctt acgatctcta tgtagaagca 660 ctcagatttt tgcagatgga acttttgata cagtaccaag aatctttttt caactgtata 720 cgatacatgg tgaggttttt ggttatactt tcccctttgt ttatgttcta tgcttaagaa 780 aaacaaggtc tgtttatgaa agtataatta atcatctaaa agaagcttct attagattaa 840 atattacttt ttcaccagaa acaataatgg tggactttga gatggcagca attatggcat 900 ttaagaacat gctacctaat tcaacagtaa aggggtgctt attccacttc aaccaatcat 960 tatggagaag gatacaagag aatgggttgt ctgtgagata ccgtaatagg caaaacactg 1020 atttgcaagg gttggtcgtt gcattgatgg ctttaccatt tattcctgaa gaggatgtag 1080 ctggaacctt taaagattta gtggcttttg ctccgcctga gcttaatgat tttatcgaat 1140 tctatggaag aacctatata ggtttaataa ggaatggaga tagctggaca tccgtaacac 1200 caaggtttga gaatactatg tggagttgtt ataactctac actttctgga caccgacgaa 1260 caaataacgt tgttgaaggg tggcattcta gattccatgg tgaaatccaa tctcatcatg 1320 caacaatttg gaaatttctt gagttcatca agaaagatga agctgataac aagattttaa 1380 ttgagcaatt atttggaggt cacaggaaga ttaggcaccc aataaagcga atgtatctcg 1440 ttaataactc acgaatcctc gagcttgttc ggtcgtatgg tgagtttaaa aactctggca 1500 cgataaagat atatttgcat agtataggga ggttattgaa gaataacttt acttatattc 1560 cagagcaaga tcacaatcag cctaatatga atgaacatta aagttcttat tttttttgtt 1620 ttttgaagaa agttctaaac caactcttcc aaatttaata atatgtgctt tatatataga 1680 aaatttttta ttataaattt tatgtgtaat aaattttatg tgtaacgtct 1730 // ID Gypsy-46_AA-I repbase; DNA; INV; 3887 BP. XX AC supercont1.286; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-46_AA_; KW Gypsy-46_AA-LTR; Gypsy-46_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-3887 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.286; Positions 1284105 1287991. XX CC Positions [2009-2485] - Integrase core CC 'AAAT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 168..1922 FT /product="Gypsy-46_AA-I_2p" FT /translation="MSVSEGAYIGTEDAGFTLVEKPDPCMHPANTKESGRD FT VRGAGQSSASGTIITAHDQVTSGEQRPDTLKRPTTGSDAQLGSSSTMSSGE FT EVAPGGSARRQEVSTGSSDSGYGGSALALASVWNRPIPGVAPRGAGHWRSH FT GATRTYAFMRDMLSPQYIADPLYVMGYKWTLADVEHNVERIQVAGHIVSVS FT GVMRAGTNPPELSDRGAKLWSNAVLTPVDASMQIRNLIPTGGSVNLSVMTQ FT LVSKYHGDLHTLARDEAPLLRMLLMFLTFLPAGTPGYASNTIKMLRLGANQ FT GKCTKTLHPYNRRGVLDNTIQIPANSADCDVRIVPLSTYVNALCAKPSNND FT RLAGFNILEAEIVAIESHNLEQNWFMAFLIAHTTTLWWNCAATEEYDFSSD FT PKAAAGSKSKFVIKTMTRASCVYVPGEWKKIAIVVVDQDYSGLTKFSVPHL FT GDAIKFETRIADFTKKVFTYLGLTENGAAPNPHTMLMAIREMCKRTALSGD FT MGTVLSMVQELAFTVPLGGAVRTDGVFGASGVGRFDINAADYLLDKSRTTD FT LAKEWDDMSGWITGYGDWSVTPTGKFMDALFQCDNKWM" FT CDS 1916..2863 FT /product="Gypsy-46_AA-I_1p" FT /translation="MDVKKVLSKCSKCKESKYPTVPTIPPMGEQKSATRPF FT QMIALDYLSGFVRSKSGNTDLLVCLDIFSKYVRLFPVKKISVDSLTKIIES FT EWFFKLGVPQTLISDNAVTFLGNKFQELLKKYNVHHFKNARRHCQNNPVER FT VNRVILACIRTYCQSDHRLWDTQIAQIEFAINNTKHLSTGYTPFFLVHGYE FT SIVDGRDHLQDRQTSDPSVDQFTQRREVVVGPLYEEVVRNNKKQFEKYKKN FT YDSRHKALPPTFSIGQKVYKKHFKLSNAADHYAAKLGPVYVPCKIIARRGA FT TSYELEDENGRNLGVFAAQDLIPE" XX SQ Sequence 3887 BP; 1192 A; 759 C; 927 G; 1009 T; 0 other; tttaaaatac cagtaagtaa gatcaagcga caaatttaag agagaaataa aaacagaatt 60 ataaacagat acaaaattac ttcaaagcaa cacgttacct gatccttaca cttaacgtgt 120 cgttgtgaat ataatattgt ctctatcgga tttctttcga aggataaatg agtgtttctg 180 agggtgctta catcggaaca gaagacgcgg gcttcacact ggttgagaag cctgatcctt 240 gtatgcatcc cgccaacacc aaagaatcag gacgtgacgt aagaggagca gggcaaagtt 300 cagcatcagg aacaattata acagcacacg accaagtgac atctggggag caacgacctg 360 ataccctgaa aagaccgaca actggttcag acgcacagtt aggatcatca tccaccatgt 420 cctcgggcga agaagttgca ccgggaggtt cggcccgtcg tcaagaggta tcgacggggt 480 catctgatag tggctacgga gggagtgcct tggcactagc atccgtatgg aaccgaccaa 540 tacctggggt cgcaccaaga ggcgcgggac attggaggag tcatggggcc acgcgcactt 600 acgctttcat gcgcgatatg ctcagcccac agtacatagc tgaccctctg tatgttatgg 660 ggtacaaatg gacactcgca gatgtggagc acaatgttga gcgcatacag gtagctggac 720 acattgtgtc ggtgtcaggt gtgatgcggg cggggacgaa cccacccgag ttgtcggaca 780 gaggtgcgaa gctatggtca aacgctgtgc tcacgccagt cgacgcttcc atgcagatcc 840 gcaacttgat accgacaggt ggaagtgtca atttgagtgt gatgactcaa ctcgtgtcaa 900 agtatcatgg tgacctgcac acgctggcga gagacgaggc gcccttactt cggatgctgt 960 taatgttctt aactttttta cctgctggaa cacctggata tgcatcaaac accatcaaga 1020 tgcttagact tggagctaac caggggaagt gtacaaaaac gctccaccca tacaaccgta 1080 gaggcgtatt ggataacact atccaaattc ctgctaatag tgcagactgc gatgtgcgca 1140 tcgtaccgtt gagtacttac gtgaatgcgc tttgtgccaa accgagtaat aatgacagac 1200 tggctggttt taacatctta gaggctgaaa tcgtcgccat tgaaagccac aacctggaac 1260 aaaactggtt catggcattt ctgattgccc ataccaccac tctttggtgg aattgcgcag 1320 ctacagagga gtatgacttc tcttcggatc cgaaagctgc cgcggggtcg aaatcgaaat 1380 ttgtcataaa aaccatgacg cgcgccagct gcgtctatgt ccctggtgag tggaagaaga 1440 tcgccatagt tgtcgttgac caagactaca gtgggttgac taagttcagc gtaccacatc 1500 tcggtgacgc gatcaagttt gaaaccagga tcgcggactt cacgaagaaa gtgttcactt 1560 acttagggct cactgaaaac ggtgcggctc caaaccccca cacaatgctc atggccataa 1620 gggaaatgtg taaacggaca gctctatctg gcgacatggg caccgtactg tcaatggttc 1680 aagagctggc ctttaccgta ccgcttggcg gcgcggtgag gacagacggc gtgttcgggg 1740 cgtctggagt agggaggttc gacatcaacg cagcagacta tttacttgac aaatcacgaa 1800 caactgatct tgccaaagag tgggatgata tgtctggatg gataacgggg tatggtgatt 1860 ggagtgttac acctactggt aagttcatgg acgcactgtt tcagtgcgac aacaaatgga 1920 tgtaaaaaaa gttttatcca agtgcagtaa gtgtaaagaa tctaaatatc caactgtgcc 1980 aaccattcct ccgatgggag agcagaaaag tgcaacacgt ccttttcaaa tgatagcact 2040 tgactattta agtggatttg tcagaagtaa atctggaaac actgacttat tagtatgttt 2100 ggacattttt tccaagtacg ttcgcctgtt tcctgttaag aaaatcagcg tggacagctt 2160 gacaaaaatc attgagagtg aatggttttt caaactggga gttcctcaaa cattgatctc 2220 agacaatgcg gtaacatttt tggggaacaa gttccaagaa ttgttaaaga agtacaatgt 2280 ccatcatttt aagaatgctc ggcgtcactg tcagaacaat ccagtagaga gagtgaatag 2340 agtcattttg gcatgcatca ggacttattg tcagtctgat catcgtcttt gggatacgca 2400 aatagctcaa atcgaatttg ctatcaataa tacgaaacat ttgtccactg gatacactcc 2460 gtttttcctt gttcatgggt atgaatcgat cgtagatgga agggaccatc ttcaagatag 2520 gcaaacttct gatccatcag tagatcaatt tactcaacga agagaagtgg ttgtgggtcc 2580 tttgtacgaa gaggttgtta ggaataataa gaagcaattt gagaaatata agaaaaatta 2640 cgatagccga cacaaagcat tacctccaac gttttccata gggcagaaag tgtacaaaaa 2700 gcacttcaag ctgtccaatg cagcagatca ttatgctgcg aagctaggtc cagtgtacgt 2760 cccttgcaag ataattgcca ggagaggggc aacttcttat gaactggaag atgagaatgg 2820 caggaattta ggtgttttcg cagctcaaga tttgatcccc gagtgaatgg tatattaagt 2880 gttaattgtg tgagttagtg tgaatttgtg tgaacagcta attatttgta gatgagattt 2940 tgtatggtgt gaacaattgt atgcagtgaa tttgaatcaa tgaatgttag gataaagatg 3000 gccatctaaa gtatggatga gaaaaatgtt tgaattgagt gttttgtagt gtgaaactaa 3060 gtgtaaacga gtgaagtgtg tagtatgtta atacgtagtg tgttaatacg tcatattatg 3120 agcagtcgca atagccaaaa gatgccgtct gtactcttta ggcgaggtaa gaacaaagtc 3180 ttgccttctc agtcataaga ttacttttgt agtctcttga cgcgaaaata ttcaaagaga 3240 taggtattaa aatatgcagg aacaaaaaaa acgcttaaaa cgccggttaa gaaaaagtac 3300 gaatataaga ctatcattaa aatcctgtct agacagtcag aagcgatctg aaaattcttc 3360 tgacatggta agaaagaatc gagcataact aaatctaacc agactggtcg tgaggaacct 3420 tcagtattct catgactcga caggattcaa tcgaaaatat aaaccaaatc tacaaattgt 3480 gagccgttgg agattcttag catcccctcg atgcagaaat agtatttgtt tttgtatggc 3540 agatgctgaa ctgaacacaa ttggtataac atagttcaat acagattatg aaaagagttc 3600 taacttttca atatcaaaca gtcagatcaa aatagtttaa aacgataaca gaagtaaaat 3660 cacagtaaaa atatatcatg cctctgtatt agaactcaaa atctgacccg acaagaattt 3720 tctctcagat ggataacgga ttatacttga caaaactgaa cacttgttta tctggacggc 3780 aattggctgt cgtgtgagtg gatttgactg tggtaattaa gacatcagta atattgagaa 3840 ggtcatcata gatgctctcc ccatattact gtctggtgtg ggggtat 3887 // ID Gypsy-614_AA-I repbase; DNA; INV; 6062 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-614_AA_; KW Gypsy-614_AA-LTR; Ty3_gypsy_Ele126; Gypsy-614_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-6062 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [5011-5490] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 850..5841 FT /product="Gypsy-614_AA-I_1p" FT /translation="MEQIEFEFSILKNIRNNLLKKPKNRIYTRSTILNKLK FT ETKEAHDNISDILTILETDLDIETFNKINKKAKEISSAIYLQLHQMLVTST FT TSSKFKTYAILVLSVVRFRKTFTMANVEIIKTVSTLMPSYDGSMDKLDSTI FT DALEVIETLVTDANRATIINVILTKLSGRARHTFVVKPADLQELKTKLKAV FT VTVTPPETIISKMIATKQKTDLATFTTEIESLATQLENAYISKKIPNDVAK FT ELSTKEAIKHMASGLKNEKTALILQAGTFNQLAEAINKIHEVNPTCEANVM FT FARANRNANKPSQSNETWMSRNAPYNQNRFGNRYSNQRFNPNSQGRSSSRH FT QQYGHRYEGRNDDRQSFSRRQNFNHTQQGHYSHQPNGHWRSNNNNNNHNCN FT TDRNDRNVYSTENGRGPAATPQASCAVPMNQQGETRHQATNPIQQVQRTNN FT QGVNVYNCNINYSNFVTLKLEMCEKPVTLIVDTGADVSLIKENVLKHLTNI FT YIKQNCIINGVTDGKLETIGCTYTNILLNNVQIPHTFQAVTKDFPIFTDGI FT LGRDFLINYECNICLKTWLLTFEYNHEKFEIPIQDKYNDHFIIPPRCQIIK FT QINLINVTKESVVLSKQIKPGVFYSNGLIDKDTQYINIMNVNNCSEKISMN FT DVLSNIKILDSSEFEKVNNKITRKSNKNRVVKLKEELNLTQVSDDIKPQLV FT KLCTTYNDIFALENDSLSTNNFYKQKITMEDESPVYIKNYRIPEAHKIEVD FT KQIDKMLNEKIIQPSVSPYNSPILLVPKKSNSDEKKWRLVIDFRQLNKKII FT ADKFPLPRIDEILDNLGRAKYFTTLDLMSGFHQIELDENSKKYTAFSSSSG FT HYEFNRLPFGLNISPNSFQRMMTIALSGLPPECAFLYIDDIIVVGCSINHH FT LKNLTMVFEKLRQYNLKLNPSKCHFFSSDVTYLGHHITDKGISPDPSKYFT FT IEKYPTPSNTDDVRRFVAFCNYYRRFIQNFAEIAHPLNKLLRKNVKFNWST FT ECKTAFETLKSQLLSPQILQFPNFKKQFILITDASKAACGAILAQKYDDVD FT LPIAYASKAFTKGESNKSTIEQELTAIHWAITHFRPYLYGRKFLVKTDHRP FT LVYLFSMRNPSSKLTRMRLELEEYDFEVQYVQGKTNVGADALSRINIDSET FT LKNLQILRVETRAQKKQKVQQLSNEQQTNVILDDEPDQLKIFESLNINEIY FT ELPKLTIEKTSNIKNHGIMCKIMNKKFNKESARAPFIPLKINKMKETLGKL FT LTKIDEKAHKLKITRIALALSSTIFKMMSINEFKKACNQHLRFIQILLYEP FT QQEVDNKNEIAKIIQENHDSAIGGHTGINRLYKKLKNFYIWPNMKNTIKNY FT VNNCIKCKQNKHSSKTYEKFEITPTPQKPFSIIAMDTIGPFTKSNSGNRYA FT LTIQCDFSKYIIVKPIPDKQAETLAKAFIENCILIYGSPSIIRTDQGTEYK FT NEVFNNINQMLQITHNFSTPYHPETIGSLERNHRCLNEFVRQFVNESHNDW FT DDWLSYYAFCYNTTPHSDFPYTPFELIFGRTATLPNNLKNPKEIEPIYNYE FT QYYSELKFKMKTAAQRTQELINKAKLKRQQNQENIAKPSSIKKGDYVMIEN FT ENRSKLDKIYKGPYKVTEINHPNVTIFDDNKNQYYTIHKNKLVKFK" XX SQ Sequence 6062 BP; 2410 A; 1019 C; 978 G; 1655 T; 0 other; tggcgaaccg tcgacagtct tgatggtaaa ttagtagaat ctaagtttaa atccagtcac 60 gtcagatttc agaagatagt agtcacgtta gattgaaacc gaagtacgaa gttaaagcgc 120 gaagttaaag tgcgaagttt aagtgcgaag ttaaagtgca aagtataaac ttcaaaaata 180 cacttgcaaa atgtcgtggc ttttcgggga cacctacgtg caacaatcga tgaactcggt 240 agagatcaaa gttatcgctg gtgtagcggt aggtttattg gcattagccg tagtatacgg 300 agctttgcgt ctccacggca aatttatgaa agccaagata gagcgaacaa ctcgcaaaga 360 agtaagattg aacaattcgg tggtagtgta aaccaaaagt gatggaggtg gtttaagaaa 420 aacaattccg tgaacattga acattccagc ttaagctgaa cagtgataaa aaagggtaaa 480 atacccaaaa ttaaaaatat taatcgagca gcagtaccaa tccatgaata gcagcggagg 540 cggaatgcag tgtgcagcac cagaaatgaa cgacactgag tagcagcagc agcgacgaaa 600 aggtaaataa aaggaaaatt aaacccagcg caaagcgcga tgagaccaag aagcgcatgg 660 aaggagtgaa gaaaagtgaa actaaatatg attttattga tacatgagat atgaatgaac 720 atgatttttt tttgtttctg aactaagtgt ggattttcat ttaatgtaca aaatcattat 780 ttaaagtgtt taattcgttt tgcgaaaaat aggtcgatga atatgaacaa catgaataca 840 tacaaataaa tggaacagat agaatttgaa ttctcaatac taaagaatat tagaaataat 900 ttattaaaaa agccgaaaaa tagaatttac acaaggagca caattttgaa taaattaaaa 960 gaaactaagg aagcacatga taatataagt gatatattaa caattttgga gacagatttg 1020 gatatcgaaa cctttaataa aattaacaag aaagcaaagg aaatttctag cgcaatttat 1080 ttgcagttgc atcagatgtt agtaacttcc acaacatcta gcaaatttaa aacttatgcc 1140 attcttgtat taagtgtagt gagattcagg aaaactttta cgatggccaa tgttgaaata 1200 atcaaaacgg tgagtacatt gatgccttct tatgatggat caatggacaa actagatagt 1260 acaattgatg ctttagaagt catagaaact ttggtaaccg atgcaaaccg agctaccatt 1320 ataaacgtta ttttaaccaa attaagtggc agagcacgac acacatttgt tgtgaagcca 1380 gcagatttac aagagttaaa aaccaaactt aaagcagtgg taacggttac accacccgaa 1440 acaataattt caaaaatgat cgcgactaaa caaaaaactg atttagcaac attcacaacg 1500 gaaattgaat cacttgcgac tcaacttgaa aacgcctata tttcaaaaaa gattccaaat 1560 gacgtagcaa aagagttatc tacaaaagaa gctatcaaac acatggcttc tggcctgaaa 1620 aatgaaaaga cggctctcat tttacaagct ggaacattca accagctggc cgaagcaata 1680 aacaaaatac acgaagtgaa ccccacatgc gaagccaatg taatgtttgc aagagcaaac 1740 cgaaatgcaa acaaaccgag tcagtctaat gagacttgga tgagcagaaa tgcaccatac 1800 aaccaaaatc gttttggcaa caggtactca aaccaacgtt tcaatccaaa tagccaaggt 1860 aggtcaagtt cgcgacatca gcagtatggt catagatatg aaggtcgaaa tgatgatcga 1920 cagagttttt ctagaaggca gaattttaac catactcagc aaggtcacta cagccaccaa 1980 ccaaatggtc actggcgtag taacaacaat aataacaatc ataactgcaa cactgatcgc 2040 aacgatcgta atgtttactc tacggaaaac gggcgcgggc cagctgcaac tccccaggcc 2100 agctgtgccg tgccaatgaa ccaacaaggg gaaacccgac accaagcaac aaatccaata 2160 cagcaagtgc aacgcacaaa caaccaaggc gtgaatgtat ataattgcaa tattaactat 2220 tcaaattttg taacactaaa attagaaatg tgtgaaaaac ctgtcacatt aattgttgac 2280 accggagctg atgtttcatt aattaaggag aacgtattga aacatcttac caatatatat 2340 ataaaacaga attgtattat taatggagtg acggatggaa aacttgagac catcggatgt 2400 acatatacga atattctttt aaacaacgtt caaataccac atacatttca agcggtcact 2460 aaagactttc ctatttttac tgatggaatt cttggccgtg attttctaat taattatgaa 2520 tgtaatatat gtctcaaaac atggctttta acatttgaat ataatcatga aaaattcgag 2580 ataccaattc aagataaata caacgaccat ttcattatac cccctcgatg tcaaattatt 2640 aaacaaataa acttaataaa tgtcaccaaa gaaagtgtcg tactttcaaa gcaaattaaa 2700 ccaggagtat tttactctaa cggattaata gataaagata ctcaatatat aaacataatg 2760 aacgtcaata attgttctga aaaaatttca atgaatgatg tattatcaaa tatcaaaatt 2820 ttagattcga gtgaattcga aaaagttaat aataaaatca cccgaaaaag taataaaaac 2880 agagtcgtta aattgaaaga agaattgaat ctaacacaag tatcagatga tattaaacca 2940 caattagtga aactgtgcac aacttataac gatatttttg ctctagagaa tgattctcta 3000 agtacgaata atttttacaa acaaaaaata actatggaag atgaaagccc tgtttacata 3060 aaaaattatc gcataccaga ggctcataaa attgaagttg acaaacagat tgataaaatg 3120 cttaatgaaa aaattataca accttcggta tctccataca attcaccaat tctgttggta 3180 ccgaaaaaat ctaactctga cgaaaagaaa tggaggttgg tgatagattt tcgccaacta 3240 aacaagaaaa taattgcgga taaatttcca cttccgcgta tcgatgaaat tcttgacaat 3300 ttgggcagag cgaaatattt tacaacgctt gaccttatgt caggatttca tcaaattgag 3360 ctagatgaaa attcaaaaaa atataccgca ttttctagtt cttcaggaca ttacgaattt 3420 aatcgtttac cttttgggtt gaatatatca cccaatagct tccaacgaat gatgacaatc 3480 gctctgagtg gtttaccacc agagtgtgcg tttctatata tcgatgatat tattgtggtg 3540 ggatgttcaa taaatcatca tttgaaaaat ttaactatgg tttttgaaaa acttagacag 3600 tataatttga aattaaaccc atcgaaatgt cattttttta gtagtgatgt cacttattta 3660 ggacatcata ttacggataa aggtatttca cctgaccctt caaaatattt tactattgaa 3720 aaatatccaa caccgtcaaa tactgatgat gtaagacgtt tcgtagcgtt ttgcaattat 3780 tatagacgat ttattcaaaa ctttgctgaa atagcacatc ctttaaacaa attattaaga 3840 aaaaatgtaa aatttaactg gtcaactgaa tgtaaaacag catttgaaac acttaaatcc 3900 caactcttat caccacaaat tttacagttt ccaaatttca agaaacaatt tatcttaata 3960 actgatgcct ctaaagcggc atgcggtgca atactagcac aaaagtatga cgatgtcgat 4020 ttacctattg catacgccag caaagcgttt acaaaaggtg aatctaacaa gtctactatt 4080 gagcaggaat taactgcaat ccattgggca attacgcatt ttagaccata tttatatgga 4140 agaaaatttc tcgttaaaac ggatcacaga ccgttagtgt atctgttttc tatgaggaat 4200 ccatcttcaa aattaaccag gatgcgttta gaactcgaag aatatgattt cgaagttcag 4260 tatgtacaag gcaaaaccaa cgtaggtgcc gacgcgttat ctagaatcaa tatagattct 4320 gaaacattaa aaaatttaca aattttaaga gtcgaaacga gagcacagaa aaaacaaaag 4380 gttcaacaat taagtaatga acaacaaaca aatgtgattc tagatgatga gcctgatcaa 4440 ctcaaaattt ttgaatcact taacatcaac gagatatacg aattacccaa acttaccatc 4500 gaaaagacgt ctaatattaa aaaccatggt ataatgtgca aaattatgaa caaaaaattc 4560 aataaggagt cggctcgagc gccatttatt cctctgaaaa tcaacaaaat gaaggaaacc 4620 ctaggtaaac tcttaacgaa aattgatgaa aaagcgcaca agttaaaaat tacacggata 4680 gctctagcgt tatccagtac aatttttaaa atgatgagta taaacgaatt taaaaaagca 4740 tgcaaccaac atcttcgttt tatacaaatt ttactatatg aaccccagca agaggttgac 4800 aacaaaaatg aaatcgcgaa aattatacaa gaaaatcatg attccgcaat tggtggacat 4860 actggtataa acagactata caaaaaactg aaaaattttt atatttggcc caacatgaaa 4920 aatacaatta aaaattacgt taataattgc attaaatgta aacaaaataa acattcttcc 4980 aaaacttatg aaaaatttga aattactcca actccccaga aaccgttttc aataatagct 5040 atggacacta tcggtccatt tactaaatca aactcaggta atagatacgc attaactatt 5100 caatgcgact tttcaaagta tattatcgtc aaaccgattc cagataaaca ggctgaaaca 5160 ttagctaaag catttattga aaactgcatt ttaatttatg gttctccatc cattatacga 5220 acggatcaag gaactgagta taaaaacgaa gttttcaata atatcaatca aatgttacaa 5280 attacacata atttttcaac tccgtatcat ccagaaacaa ttggtagtct agagagaaat 5340 catcgttgtc taaatgagtt tgtaagacaa tttgtcaatg agtcacataa tgattgggat 5400 gattggttgt cttattatgc attttgttac aacacgactc ctcactctga ttttccatat 5460 acaccatttg aacttatatt tggaagaaca gcaacgctac caaataatct aaaaaatcct 5520 aaagaaattg aaccaatata taattatgaa caatactact cagaactgaa attcaaaatg 5580 aaaactgcag cacagagaac acaagaatta attaacaaag caaaacttaa aagacaacaa 5640 aaccaagaaa atattgcaaa accatcgtca atcaaaaaag gagattatgt tatgatagaa 5700 aatgagaata ggtcaaaact cgataaaatt tataaaggtc catataaagt aacagagata 5760 aatcatccaa atgtaacgat atttgacgat aacaaaaatc aatattacac aatccacaaa 5820 aataaattag ttaaattcaa ataaaacaga aaacaatttt tttccttaag taagcataga 5880 atagtatatt ttcaactact aaactcaatt ttaaatcaac atttatagtt attaaataca 5940 taccaatcaa ttactatatt cattataaaa cttttataaa tgcaaagatt tttggagctt 6000 gaattttaaa aaaaactaaa atgatctcgc gatactcaat catttttctc ttagggggaa 6060 gg 6062 // ID BEL-19_AA-LTR repbase; DNA; INV; 510 BP. XX AC supercont1.314; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-19_AA_; KW BEL-19_AA-I; BEL-19_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-510 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.314; Positions 757398 757907. XX SQ Sequence 510 BP; 198 A; 80 C; 101 G; 131 T; 0 other; tgtgacgacg aaatcctccc cgcgatggag atggtctaac ggccgacagc tccgtttgtt 60 tcaagtggtc gaaatctgtc agcggttgcg atccgatgat gaatggaaac gaaaacgagg 120 ataaacaaaa acagcgtagt ggtagcatag tgtgcatagc tttcaacgag cgaattaagc 180 gaagttttca aagtaaaaac gtttatttca atcataaata tctttataaa cataacttaa 240 tactactatt tgcaattaaa agtaagtaac aatgaattta ttagcgaata gaaacataaa 300 aaatggaaat ttgtagccta cttaacctaa acgttgccta agactaagag ggaagaattt 360 cgaggtgcaa aggttactga aaaattgtaa gtagagaaaa taagtaatgt agtaacgtaa 420 gaacataaac taaacaataa ataattacag ctaaagccta tcctaacaaa cactatagcg 480 tttggattgc tattggaatc ggtgacaaca 510 // ID Zator-1_AA repbase; DNA; INV; 3907 BP. XX AC AAGE02018736.1; XX DT 29-JAN-2009 (Rel. 14.02, Created) DT 25-JUL-2009 (Rel. 14.08, Last updated, Version 2) XX DE Zator DNA transposons from Aedes aegypti. XX KW Zator; DNA transposon; Transposable Element; Zator-1_AA. XX NM Zator-1_AA. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-3907 RA Bao W., Jurka M.G., Kapitonov V.V. and Jurka J.; RT "New superfamilies of eukaryotic DNA transposons and their RT internal divisions."; RL Mol Biol Evol 26(5), 983-993 (2009). XX DR EMBL/GenBank/DDBJ; AAGE02018736.1; Positions 664 4570. XX FH Key Location/Qualifiers FT CDS join(1238..2647,2676..3644) FT /product="Zator-1_AA_1p" FT /translation="MDRAALYKKLYSSYEKANHLKPKRQIQQEVNEIWCKA FT KSDADTFERNINEAIDAAALKLSRNKAGLAKFWTLTQKAAPQELSQCSSQS FT LSKPEVCIENVKPTLPSPEPSTSSGLAPKRPAPAQERMRTQLADINQELLL FT LVGRKERDLLTEDDRQRMHRLRENKKVLETDLKKKIGDQIRSQKRRDDLKQ FT SFKELSEKIPEAKSILNLQATAGRPRVEDKQPEVLKAIVEIALYGSAAHDR FT RQDESYRSVRTLSDLTEAVNRHGFEISRSAMYLRLIPKRANSSEGLRHVKT FT VPVKLIKAQNDKHTQHQDGRFCTAAIHMLEKLAAMLGPREVFFVSQDDKAR FT VPLGLTAATKQSPMVMHVDYKVSLPDHDFVVAKEHKLIPSVYAAVKIQPDA FT MGKAEAVGYSGPTYIAIRSAKHCSSTANSHGSDYTRLLDLHEFEDFAKNDG FT EVKPVHVIVSDGGPDENPRYQKVVACDLHFFQVIKVAIHHFIQHNLDAIFL FT ATNAPGRSAYNRVERRMAPLSKQLAGLILEHEFYGSHLDSQMRTVDLELEK FT RNFSHAGQTLASLWSELQLDDYPVYAEYVESERSELSDLIDKDAGWFANHV FT RSSQYFLQVVKCHDKNCCKPFRSSLLTVLQNRFIPPPIPLYQTDLGLKAPN FT PANIGSKNFASLFVINSLDLSKILPNAIQNENIAFDTYCPSVFKRIKERTC FT KTCKLYFASNVIQKAHCQTHKGTSVSKKIRPKRVVCRRAEEVLVVTVDDEK FT EWIEIDDVEDSPIPNEDLSEEACPILALDTRMEIIWTTETD*" XX SQ Sequence 3907 BP; 1228 A; 761 C; 854 G; 1064 T; 0 other; gggaccgtcc ataaatgacg tagcttttat ggggggaggg gggggtttgg caatttgtga 60 cgatgtgtga cgataggggg gtaggggtcc acggtaagct acgtagcttt cagagaaaca 120 atgaaaaata tgaatgcaac acatgcagat gcgaaagcaa tgcacatgca gataataata 180 tgcatctcaa cagtcaccta actccgccat gtatcgtgag ccttttcaat ttttaactac 240 atcatgcgat tactcgaatg ttcatatatt ggtttgccaa gttcttgcta acgagaacat 300 tttttaaagt ccaagcagaa gatattgaag ctggtcaaaa tagttaatca ttatatttac 360 cgatcgctcg catgtttggc aatacgatct acaatatact caacttttta atggacgtat 420 acgacgatat atagattcga aaaaatatga gttactaaga cacacaacaa ataatacctg 480 attattaata caaaatatcc ttctcattta ttactgttcc atatttacga atatttttca 540 agtacaatta actctccctc actcgatatt ccgtatatat atccatagag aaccatagta 600 aaagttggtt ttcatggcta actcgatggt cccttggaac gcagttgcac tgcttttttg 660 ttctgtaact cgatacctcc ttaactcgat ggccccttca atatcgagta agggagagat 720 gacagtacca aaatatgttt ctacaatatt cacacttaca aaatattgaa tgaaatttta 780 ttcaatagag tgaaattatc gtttcttgac aaaaaaaaaa atacgacaaa aataattgag 840 attaggagta tttcatatgt tgtgctaaga tcgatgtcat cagcagcacg gtgtactata 900 ttgagaaggt gatatattcc gaaaactgac agctacttga cgacgtcatt cctatccacc 960 tcgaacaaat ttgcgaaaaa tatgatctat tggtccggaa ggtatatggt acgataggag 1020 tgacgtcaca tgtttacaat caatcttgat atttacatca gtaaagagac tgccttgtta 1080 atttttatgc cgtgatcagc agcatttgtt ttggttgagt aagcgtagcc tgtgtttgtg 1140 ttttgtcgta aaaatcatcc gcaaaaagtc gagtaagtta atcattggca atctctaaca 1200 actttttatc tcaaaatcat atttttacta cagaaaaatg gatcgtgctg cgttgtacaa 1260 aaaattgtat tcatcgtatg agaaagcgaa ccatttaaaa ccgaagcgtc aaattcagca 1320 ggaggtgaat gagatatggt gcaaagctaa gtcagatgcg gatacgtttg agaggaatat 1380 caatgaagct attgatgctg cagccctcaa gctttccagg aataaagcag gattggcaaa 1440 attctggaca ctgacgcaaa aggctgcacc acaggagcta agtcaatgtt cgtcccaaag 1500 cctatcgaag cctgaagtgt gcattgagaa tgtaaaacca acattaccat cgcctgaacc 1560 atctacatca tccggtcttg cccctaaacg accagcacct gcacaggaac ggatgcgtac 1620 tcagctggct gatatcaacc aagagctttt gcttcttgtt ggaaggaaag aacgagattt 1680 gctgacggaa gacgatcgac aacggatgca ccgactgaga gaaaacaaaa aagttctgga 1740 gactgatctc aaaaagaaaa ttggcgatca aatccgatca caaaagcgtc gcgacgattt 1800 gaagcagtcc ttcaaagaac tgagcgagaa aatcccggaa gcgaaatcga tcctcaacct 1860 acaggcaact gctggacggc cacgcgtgga ggacaagcaa ccggaggtgt tgaaagccat 1920 cgtagaaatt gctctttatg gctcggctgc ccatgatcgt cgtcaggacg aaagttatag 1980 aagtgtgcga acgttgagtg atttaaccga agctgtcaat cgtcatggat ttgagatcag 2040 cagaagcgca atgtaccttc gtctcattcc caaacgtgcc aattcttctg agggattgcg 2100 acatgtgaaa accgtaccgg tgaagctaat aaaagcgcaa aacgacaaac atacgcaaca 2160 tcaagacgga cggttttgca ctgccgcaat ccatatgctg gagaaattag ctgcaatgtt 2220 gggcccacga gaagtgtttt tcgtcagcca agacgacaag gctcgtgttc cactaggctt 2280 gactgctgcc actaagcagt caccgatggt catgcatgtg gattataaag tgtctttacc 2340 ggatcatgat tttgtcgtgg ctaaggagca taagttaatt ccgtcagttt atgctgccgt 2400 aaaaatccaa ccagatgcca tgggaaaggc ggaagcagtt ggttatagtg gtccaaccta 2460 cattgctata aggtcagcca agcactgttc gtctacagca aattctcatg gttctgacta 2520 tacgcggctc ctcgatctac atgagttcga agattttgcg aagaatgatg gagaggttaa 2580 accagttcat gttatcgtat ctgatggggg tccagatgaa aatcctcgct atcagaaggt 2640 tgttgcttag aattgtgatt tgaaattgtt aatgatgcga tttacatttt tttcaggtaa 2700 tcaaagtagc tattcatcat ttcatacagc acaatctgga tgcaattttc cttgctacta 2760 atgctccagg aagaagtgca tacaatagag ttgaacgacg tatggctcct ttaagtaagc 2820 agctcgctgg tctcattttg gaacatgaat tctatgggtc tcatttggac agccagatgc 2880 gaactgtaga tcttgagctt gagaaaagga acttcagtca tgccggacaa acactcgcct 2940 cattgtggag cgagttacaa ttggatgact accctgtgta cgctgagtat gtggaatcag 3000 aacgatcgga attatcagat ctgattgata aggatgccgg atggtttgct aaccatgtaa 3060 gaagtagcca atacttccta caagtcgtta aatgtcacga caagaactgt tgcaaaccct 3120 ttcgcagtag ccttctaacc gttcttcaaa atcgcttcat tccgccaccg atcccattat 3180 atcaaacgga tctcggacta aaagctccaa atccagcgaa tattggcagc aaaaatttcg 3240 cgtcattgtt tgttatcaat tcactggatc tttctaagat tttgccgaat gcaattcaga 3300 atgaaaatat tgccttcgat acttactgtc catcagtatt caaacgcatc aaggagcgca 3360 cctgcaagac atgcaaattg tactttgcgt caaacgtcat acaaaaagca cattgccaaa 3420 cgcacaaagg aacctcagtt tcaaagaaga tccgccccaa gagagttgtt tgcagaagag 3480 ctgaggaagt actcgttgtc acggtggatg atgaaaaaga gtggatcgag attgacgatg 3540 ttgaagactc gccgataccg aacgaagact tgagtgaaga agcctgtccg atacttgcac 3600 tggatacacg aatggaaata atctggacta ccgaaacaga ttgaaactga acgcttaaaa 3660 gaaatgttaa tatttcgtta taatatttaa atttctctac tttgttttta aataaatgat 3720 gaactcgaaa taattttata tgcaaaaaaa agttgaattt ttaaaggggt ttggaggggg 3780 ggggggtcag aagaatgcta cgtccttttc ataggggggg ttgataagtt tgtgacgaat 3840 tgctacgagg ggggaggggg gtgttaaaaa tagtcaaaat aaagctacgt catttatgga 3900 cggtccc 3907 // ID Copia-128_AA-LTR repbase; DNA; INV; 177 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-128_AA_; KW Ty1_copia_Ele170; Copia-128_AA-I; Copia-128_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-177 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 177 BP; 59 A; 37 C; 25 G; 56 T; 0 other; tgttgaaatc aaagcaactc agcaatattt gaattccgta ttatgtacag ttagtttccc 60 tgttgctatg atcattatat ataacaactg catagcaacc atagtagtat aaatatcccc 120 ctgtcgccag aaccagaaat cagtttctaa taaaccacca gtttgttagt gttacca 177 // ID DNA8-4B_AP repbase; DNA; INV; 264 BP. XX AC . XX DT 25-AUG-2009 (Rel. 14.09, Created) DT 25-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-4B_AP. XX NM DNA8-4B_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-264 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 1961-1961 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 264 BP; 82 A; 44 C; 44 G; 94 T; 0 other; cacagatata tattataata actagatacc cgactccata tacaacaaca ttattgaatt 60 atgatgccca acgccatttg tcactttggc aatgtttaca aattctaatc gttttgattg 120 gcaacgggct tgtttgatat gaaatgttgt tgtttattat catattacct ttctaacgat 180 tagtaaacat tgcccaagtt gtaaatggct gccagcgtca ttgataagaa gaagtcgggt 240 atctagttat tatatatatc tgtg 264 // ID Jockey-16_AAe repbase; DNA; INV; 4385 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A Jockey non-LTR retrotransposon from Aedes aegypti. XX KW Jockey; Non-LTR Retrotransposon; Transposable Element; KW Jockey-16_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4385 RA Kojima K.K. and Jurka J.; RT "Jockey clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1382-1382 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 7 sequences with >95% CC identity. XX FH Key Location/Qualifiers FT CDS 180..1454 FT /product="Jockey-16_AAe_1p" FT /translation="MIRRNYKVPARDITSDTQGAKRQKTDTSHRHPECFSQ FT PVANVPISNEFSLLDGTTDMESQELPTNSGNSSQSQGKRRRFPPISIVNKG FT TKEVRELLNIGNIPQHVYHMKAIKSGVQLSISGNQEGQIHHPKIISTLKES FT SVEFFTYTAAENQPVKIILSGLPDYTTQQLKEELEANNVYAKDIKTFWQKK FT IGPEVSTLYLLYFEKGKVRLSELQRVKTLFSIVVKWRYFTRKTSDAVQCFR FT CQRFGHGMQNCHVSPLCVKCGEKHLSASCSLPVKADLSKVDPAATRSKIRC FT ANCSGNHTANYLGCVARKNYLQRREATKNNQHQRKPSKQAFVPTSANWPSL FT NGESTSEPTRLLHQNPRPFRSYSEVLTSSENEASAAHTNDLFTLTEFMCLA FT RDLFARLKGCRNKEQQFMALSELMVKYVYHV" FT CDS 1441..4119 FT /product="Jockey-16_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MCIMSNLQTLRVVNWNSRSVANKKIEFFDFLDRFHID FT VGIVTETWLRPTSSFFHPRYNCIRLDRRSSENERGGGVLIAVRRDIKFVAL FT NITTKSVECTGISIPTENGINVRFIAAYFAGVRTSTDWNQYRIDLRNMMRS FT NEPLFVVGDLNARHRKWNCIKANKAGNILQSVVAQCNFHIHYPDTFTFHPT FT GRGRPSTLDLVVSNNMVNMTKPIAHNELSSDHLPVTFDIHLNTEPTESVNS FT IRCYKRADWSKFQRVVNSKLDLLDPSYAALNSVADIDRAIDQLTQTLLEAE FT NVAVPLVNIRAYDTPTVPESTRLLIALRNRRRRQWMRTRDPIYKQIVHSLN FT SRIAEECNQVRFRKFRETLQTMEHGSNQLWKITKALRNNCKYSPPLRDNST FT IVASPALKAKVLADSFSKFHSNSMESDPQTTEDVNRTLEFLDQADPAPDNS FT WLVRPKDVQRQIRNLKNKKAPGQDQIRNVMLKNLPRKGIVLLARLFSACLK FT LCYFPSRWKHAVIIAIPKPNKDATLPSNYRPISLLSCLSKLLERFILIRIE FT AHLDVSRIIPNEQFGFKKGHSTCHQLVRLVKQVRSGFTQGKSSGMILLDVE FT KAYDSVWQAAILHKMNRGNFPIATTKIVRSFLKDRSFHVAVDGEVSERKQI FT PYGVPQGAVLSPILYNIFTADMVMIADVQYYLFADDTGFVASHRDASVVVE FT KLQQAQQSLEAYQRQWKIKANAXKTQAIFFSRKRSPRNLPQREIRANGQSI FT PWSEVVQYLGASLDRKLTFAQHVSNSLKKCDKLTKALYSLVNRRSALDVRT FT KLLLYKTVFRPTLTYAFQAWHTCSQSHRTKIQRKQNRVLKMMMNLDYLHPT FT DDLHEQAGIELIDDWFQRLLPKFLIGCRTSENPLLEELVE" XX SQ Sequence 4385 BP; 1312 A; 1079 C; 896 G; 1097 T; 1 other; ataaattggt aaacaaacaa ctgtcaaacc gtacctacta gctatcggaa cgctcttaaa 60 ttttcaacaa ttatcgattg tttagtcaat cactctcgcg ttacgttgtg aaagtggcat 120 ccagaaaact ggacaatgcc ggggcaaatg aaggtggatc gtccggttct tcctacgtga 180 tgattcgccg taattacaaa gtacccgcgc gcgacataac ctcggacact caaggtgcaa 240 aacggcaaaa aacggatacc tcacaccgtc acccggaatg cttctcccaa ccagtagcaa 300 acgttccaat ttcaaacgag ttcagcctgt tagatggaac caccgacatg gaatcgcagg 360 agctgccaac aaattcggga aacagttctc aatcacaagg aaaaaggaga cgatttcctc 420 ctatttctat tgtcaacaag ggcaccaagg aagttcgcga actactcaac atcggcaata 480 ttccccagca tgtgtatcac atgaaggcga ttaagtccgg cgttcaacta tccatcagtg 540 gtaaccaaga gggccaaatc caccatccga aaatcatctc aacgctaaag gaatctagcg 600 tcgaattttt cacctacact gcagcagaaa accaaccagt gaagataatt ttgtccggcc 660 ttccggatta caccactcag cagctgaagg aagaactcga agcaaacaac gtgtatgcca 720 aagatatcaa aaccttctgg caaaagaaaa ttggtcccga agttagtacg ctttatctgc 780 tgtacttcga gaaaggcaag gtcaggctgt cggagctaca aagggtgaaa acactattta 840 gtatcgttgt caagtggaga tacttcactc gaaaaacttc cgatgcagtg caatgtttcc 900 gatgtcaacg tttcggacac ggaatgcaaa actgtcatgt ctctccactc tgtgtcaagt 960 gtggggaaaa acatctctca gcctcatgct cgcttcctgt aaaagcagat ttaagcaagg 1020 tagatccagc tgccactcgt tcgaaaatac gttgcgcaaa ttgcagtggc aaccacaccg 1080 ctaactatct cggttgtgtc gccagaaaaa actatcttca acgccgtgaa gcaacgaaga 1140 acaatcagca tcagcgcaaa cccagcaaac aagctttcgt acctacgtct gcgaactggc 1200 catcgctgaa tggagagtcg accagtgaac caacaagatt gctccatcaa aatcctcgac 1260 cgttccgttc ctactcggaa gtgctaacaa gctccgaaaa tgaagcgtca gcagcacaca 1320 ccaacgatct cttcactctc actgagttca tgtgcttagc tagagacttg ttcgctcgtc 1380 taaaaggatg tcgcaacaaa gaacagcaat ttatggctct ttcggagctg atggtcaagt 1440 atgtgtatca tgtctaattt acaaacctta cgagtggtca actggaacag caggtctgta 1500 gcaaacaaga aaatagagtt cttcgatttc ctcgatcggt tccatatcga tgttggtatt 1560 gtcaccgaga cctggctacg ccctacgtcg tcgtttttcc acccaaggta caactgcatt 1620 cgactcgatc gacgctccag cgaaaacgag cgagggggag gtgtactaat cgccgtccgg 1680 cgagatatca agtttgtcgc gctgaatata accacaaaat ctgtggaatg taccggcatc 1740 tccataccta cggaaaacgg aatcaacgtt cgcttcattg cggcctattt tgcgggagta 1800 agaacatcga ccgattggaa ccaataccgc atcgacttga ggaacatgat gaggagtaac 1860 gagccgttgt ttgtggttgg tgatttgaat gcaagacacc ggaaatggaa ttgcataaag 1920 gctaacaaag caggaaacat tcttcaaagc gtggtagcgc agtgcaactt tcatatacat 1980 tatccagaca cgttcacttt tcaccctacc ggtcgaggca gaccatcaac tctagatctt 2040 gtcgtgtcca ataatatggt taacatgacc aaaccaatag ctcacaacga gttgtcatct 2100 gaccatctgc cagtcacatt cgatattcac ctcaacaccg agccaactga atccgtgaac 2160 tcaatacggt gctacaaacg tgctgactgg tcaaagtttc aacgcgtagt taattcgaag 2220 ttggacctgc tagacccgtc ctatgctgct ctaaatagcg tagctgacat cgaccgagcg 2280 atagatcagc ttacacaaac actgttggaa gctgagaacg ttgcagttcc cctagttaac 2340 atccgcgctt atgacacacc gactgttcct gagagcactc gattgcttat cgcactacga 2400 aaccgccggc gtcgccagtg gatgcggaca agagatccca tctacaagca aattgtgcat 2460 tcgctcaact ctagaatcgc tgaggaatgt aaccaggttc gattccgcaa attcagagaa 2520 actctccaaa cgatggaaca tggaagcaat caactgtgga aaataactaa agcattgcgg 2580 aacaactgta agtacagccc gccactccgt gataattcaa caattgttgc atctccagca 2640 ctgaaggcta aagtgcttgc tgatagtttc tccaaattcc attcgaactc aatggaaagt 2700 gatccgcaaa ccacagaaga cgtcaatcga actctggaat tcctagacca agctgatcct 2760 gctcccgaca attcctggct tgttcgtcca aaggacgtac aaaggcaaat tcgcaaccta 2820 aaaaacaaga aagccccagg tcaagatcag ataaggaatg ttatgctgaa aaatcttcct 2880 cggaaaggca ttgtcctatt ggctagactg ttctcggcat gcctgaaatt atgttacttc 2940 ccatctcgtt ggaagcacgc cgttattatt gccatcccga agcctaacaa agatgcgaca 3000 cttccctcaa actatcgtcc gatcagtctc ctttcctgtc tcagcaaact tctggaacga 3060 tttattctga ttcgcatcga ggcacattta gatgtctcta ggattatacc aaacgagcaa 3120 tttggattta aaaaagggca ctctacgtgt catcaattgg tgcgactagt taagcaggtt 3180 cgatctggtt tcactcaagg taaatcatcc ggaatgatac tccttgacgt ggaaaaagca 3240 tacgactctg tgtggcaggc agctatcctc cacaaaatga atcgaggcaa tttccctatc 3300 gctactacca aaatagtacg ttcgtttctc aaagaccgat cgtttcatgt tgctgtcgat 3360 ggcgaggttt ctgaaaggaa acaaataccc tacggcgtgc ctcaaggagc agtacttagt 3420 ccgattctat ataacatatt cacagcggat atggttatga tagcggatgt ccagtattat 3480 ctgtttgccg acgacacagg ctttgtggca tctcatcgag acgcttcagt agtggtggaa 3540 aaactgcaac aagctcaaca gtcactggaa gcgtaccagc ggcagtggaa aattaaagcc 3600 aatgctasca aaactcaggc catatttttc tcgaggaaac gcagtcctcg aaaccttccg 3660 caacgcgaaa tcagagccaa tggacagtct attccctggt cagaagttgt tcagtacctc 3720 ggtgcttcgc tggaccgaaa attgaccttt gctcaacatg tctccaacag tctaaaaaag 3780 tgtgacaagc taacaaaagc cttgtattca ctggtcaacc gacgatctgc cctcgatgtt 3840 cgtactaagc tacttctgta caaaactgtg tttcgaccaa ctttaaccta cgctttccaa 3900 gcatggcaca cctgttccca atctcatcgc acaaagattc aacggaaaca gaatcgagtt 3960 ctcaaaatga tgatgaactt ggactatttg caccccactg atgatctgca tgagcaagct 4020 ggtattgaac taatcgacga ttggttccaa cggcttcttc ccaaattcct catcggttgc 4080 agaacctcgg aaaacccctt gctagaagaa ctagtagagt agagctgtga tattgttttt 4140 ttttactatt agattcatcc tctttttttt cctctaactg tcactcacct attgcaatta 4200 cgaaggtttt ttctttctaa attttccttc ctaaagattg actaccatta aggaatttaa 4260 atgtcagtat attttgctta gttctacgtc actgctgtta ggtcaaccca tctcagtcaa 4320 aaactaaatg taacatgaat tgtaaactgt ctaagatctc aataaaacta taagtaagta 4380 agtaa 4385 // ID Gypsy-614_AA-LTR repbase; DNA; INV; 536 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-614_AA_; KW Ty3_gypsy_Ele126; Gypsy-614_AA-I; Gypsy-614_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-536 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 536 BP; 173 A; 131 C; 76 G; 156 T; 0 other; tgtggcatct ctgcatcttc attcacccat tctgctatca atatatcaac acatgaatcc 60 tccagggcaa gcactctttg ctttgccaag aatacgaaac ctactttcta gcaaaacaat 120 aaattacaga acacacttga cattcttctg cttcgaccca catctggtaa aagtagaaca 180 atgctgaatg tgtcttttca cccttactca atttacaaac attaccatta cgtctatgtc 240 tcttccgcac catgtgcgta acagacgacg caacgtttaa tgtttatgac caattgaaat 300 ccgaagcggc gagttgttaa aacatcgcct agtacatgtt ttcaaaacct tttccttcgt 360 tccatgtaag ctccaaactt ctctgacaaa ttctagatat aagcaaactt ctatgtatag 420 gtgaagcaaa ctaaactatg taactttata aaaaccattt gaaatgaaat atacgagaga 480 ttcttacatc agacaccgtt gagactagtg ccttctccga aagtacacac gccaca 536 // ID Copia-5_DWil-LTR repbase; DNA; INV; 250 BP. XX AC scaffold_181074; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-5_DWil_; KW Copia-5_DWil-I; Copia-5_DWil-LTR. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-250 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_181074; Positions 598120 597871. XX SQ Sequence 250 BP; 93 A; 38 C; 33 G; 86 T; 0 other; tggaataccc ctatttcaat aataacatta tgcaacactt ctaacgtggt gatttaacat 60 agagattatt tatcaagtaa ctacatctgg tcccactaaa tcttttcttt tttggtgaaa 120 accgtttttg tgtaaacaag tggtgtggtg ttgcaacgaa aatataatca aagaaaataa 180 aagacatcat aatataaaac ataaaataaa ctacagtatt ctttattgaa tagctttatc 240 tagttttcca 250 // ID RTE-4_BF repbase; DNA; INV; 3158 BP. XX AC . XX DT 29-JUL-2009 (Rel. 14.07, Created) DT 29-JUL-2009 (Rel. 14.07, Last updated, Version -1) XX DE Amphioxus RTE-4_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW RTE; Non-LTR Retrotransposon; Transposable Element; RTE-4_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-3158 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-3158 RA Kapitonov V. and Jurka J.; RT "Young families of RTE non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(7), 1702-1702 (2009). XX DR [2] (Consensus) XX FH Key Location/Qualifiers FT CDS 2..3079 FT /product="RTE-4_BF_1p" FT /translation="TKGKKQTLRIGCWNVRTMTPGLSDDLQEISDARKTAV FT INNELLRLNVDIATLQETRLADSGTLRESDYTFIWQGKASQETRQHGVGFA FT IKNSLLDSTEPREKGTERILTLRLLTENGPLNLVSVYAPTLCSQEETKDEF FT YCQLQTTIQGIPKQEKLLLLGDFNARVGADYDSWPNCLGKFGVGNINDNGQ FT RLLEFCTINDLCITNSFFNTKPQHKVSWRHPRSKHWHQLDLVIVRRCHLSD FT VLLTRSYHSADCDTDHSLVCCTIRLTPKKIHRSKPPGKIRIDARKTQIPEK FT VEEFAEALEAALLKKPASENAEQRWSYLRDTLHRTALGVFGRKQGKTQDWF FT EEYASELNVVIEEKRTALLEHKRSPSQLTLQVLRTARSKVQRMARQCANKF FT WLDLCKSIQSSAETGNIRGMYEGIKKAIGPTQNGSAPLKSLSGETITDRGK FT QMERWLEHYSELYARENTINETALDSIETLSVLWELDAVPTLEEMSKAVDS FT LRSGKAPGMDGIPPEVIKSAKGVLLPELHDILCKCWEEGEIPQDMKDSNIV FT TLYKNKGDRSDCNNYRGISLLSIVGKIFARVVLGRLQRLADRVYPESQCGF FT RSERSTIDMIFSLRQLQEKCREQRQPLYIAFIDLTKAFDLVSRDGLFKILA FT KIGCPPKLLRMIQSFHTGMKGVVQFDGSSSEAFDIRSGVKQGCVLAPTLFG FT IFFAVMLKHAFGSSAEGVYLHTRSDGKLFNLARLKAKTKIREVLIRDLLFA FT DDAALAAHSVNKLQTLLDKFSAACQDFSLTISIKKTQVMCQGVDHPPAVTI FT NNYELEVVPKFTYLGSTVTDNLSLEAEINCKIGRAATTFARLQKRVWGNQK FT LTIHTKIAVYRACVLSVLLYGSETWSLYTSQERKLNTFHMRCLRRILHIKW FT SDYVTNVDVLTRANIPSMYTLLTQRRLRWIGHVCRMPDGRIPKDLLYSELA FT TGTRTRGRPHLRFKDVIKRDLKALDMSINTWESLAANRTLWRQEVKEGLKR FT GEAKQRLASEQRRSKRKNKQ" XX SQ Sequence 3158 BP; 968 A; 727 C; 733 G; 730 T; 0 other; gaccaaagga aagaaacaaa ctctcagaat tggttgttgg aatgtgcgaa caatgactcc 60 aggactgtct gatgatctcc aggaaataag tgacgctaga aaaacagctg tgattaacaa 120 tgaactcctg agacttaatg tagacattgc cactctacaa gaaacccgtc tggcagactc 180 agggaccttg cgagaaagtg actacacatt catctggcag gggaaagctt cacaggaaac 240 cagacagcat ggtgtaggtt ttgccatcaa gaactctcta ttagactcaa ctgaaccgag 300 agagaagggt acagagcgaa ttctcacact ccggcttctc acagaaaatg ggcccttgaa 360 ccttgtgagt gtgtatgctc caacactgtg ttcacaagaa gaaaccaagg atgagttcta 420 ctgccaactc cagactacca ttcaaggtat acccaagcag gaaaagctcc tattactggg 480 cgacttcaat gcccgcgtgg gtgctgatta tgactcctgg ccgaattgcc tcggaaaatt 540 tggtgttggc aacataaatg acaatgggca acgtctgctc gagttctgca ctatcaatga 600 tctgtgcatt acaaactcct tcttcaacac aaaaccacag cacaaggtat cctggcgcca 660 cccgcgctcc aaacattggc accaactaga cctggttatt gtgagacgct gtcacttaag 720 cgatgtctta ctcacccgct cctatcacag tgctgactgt gacactgacc actccctggt 780 ttgctgtaca atcagactta caccaaaaaa gatccatcgc tccaaaccac ctggtaaaat 840 ccgtatagat gccaggaaaa cacagattcc agagaaagta gaggaatttg cagaggctct 900 agaagccgct ctcctcaaga aacctgcgag cgaaaacgct gagcagaggt ggagttacct 960 cagagacaca ctccaccgta cagcattagg tgtctttgga agaaagcagg gaaagaccca 1020 agattggttt gaggagtatg caagtgaact aaatgtggtc attgaagaaa agcgtactgc 1080 acttctggag cataaacgct cacccagtca gctaaccttg caagtcctga gaactgccag 1140 gagcaaagtc cagcgcatgg ctcgacagtg cgcaaacaaa ttctggcttg atctctgcaa 1200 aagcattcag tcatcagccg aaacaggcaa catccgtggt atgtacgagg ggattaagaa 1260 agccattgga ccgacacaaa atggctccgc acctctaaag tctctgtctg gtgaaacaat 1320 taccgacaga gggaagcaga tggaaaggtg gctagaacat tactcagagc tgtatgcaag 1380 ggaaaatacc ataaatgaga cagcactgga cagcatcgag actctgtctg ttttgtggga 1440 actggatgca gtaccaacat tggaggagat gagcaaagcc gtcgacagcc tgcgctcagg 1500 taaagcacct ggtatggatg gaatacctcc tgaggtcatc aagagtgcta aaggtgtact 1560 gctgcctgaa ctgcatgata tactgtgtaa atgctgggag gagggagaga taccacagga 1620 catgaaagac tcaaacattg tcacattata caaaaataaa ggagatagaa gcgattgtaa 1680 taattatcgc ggaatttcct tactcagtat tgtggggaaa atctttgcac gggtggtcct 1740 aggtcggctg cagcgacttg ctgacagagt gtatccggag tcccaatgcg gcttcaggtc 1800 ggagcgctcc accattgaca tgatattctc cctccgacag ttacaagaaa aatgcagaga 1860 acagagacaa cccctttata ttgcgttcat agatctaaca aaagcttttg atcttgtcag 1920 cagagatggc ctattcaaaa tacttgccaa gatcggatgc cctccaaaac ttctcagaat 1980 gatccagtcc tttcacacag gtatgaaagg agtcgtccag tttgatggat cgtcctctga 2040 agcttttgac atccgcagcg gggtaaagca gggttgcgta ttggccccta ccttgttcgg 2100 cattttcttt gcagtcatgc tgaaacatgc tttcggctct tcagccgaag gagtttacct 2160 ccataccaga tcagatggga aacttttcaa cttggccaga cttaaagcca aaaccaaaat 2220 ccgtgaagta cttatcagag atcttctatt tgcagatgat gctgcactgg cagctcactc 2280 tgtgaacaaa ctacaaacac tgttggacaa attctctgct gcctgtcagg acttcagctt 2340 gaccataagt ataaagaaaa cacaagtgat gtgccaggga gtggatcacc ccccagcagt 2400 caccatcaac aattatgagt tagaagtagt tcccaagttt acttacctag ggtccactgt 2460 gacagacaac ctctccctgg aagctgaaat aaattgtaaa ataggacgag cagcaactac 2520 ttttgccaga ctacaaaaga gagtctgggg aaatcagaaa ctcacaattc acacaaagat 2580 tgctgtttat agggcgtgtg ttctcagtgt tctgctttat ggcagcgaaa catggtccct 2640 atacacaagt caggaaagga aactcaacac atttcacatg cgttgcttgc gtagaatcct 2700 tcatatcaag tggagtgatt atgtcaccaa tgtagatgta ctcacacgtg ctaatattcc 2760 tagtatgtac acactgctga cgcagcgaag acttcgctgg attgggcatg tgtgccgtat 2820 gccagatggg aggattccaa aggatttatt gtacagtgaa ctggctactg ggacaagaac 2880 acgtggccgt ccccacctac gttttaagga tgttatcaaa cgggacttaa aggcccttga 2940 catgagcatc aacacatggg agtcacttgc agcaaacaga actctttgga gacaggaagt 3000 caaggaggga ctaaaacggg gggaagcaaa gcagagactt gcatcagaac agagacgctc 3060 aaaacgtaaa aacaaacaat aagtacattg tgtagaattt atatgtagta ctactcatgg 3120 tcctgcgaga ccgacgagtg cctttactta cttactta 3158 // ID hATm-3_AA repbase; DNA; INV; 4130 BP. XX AC . XX DT 30-OCT-2007 (Rel. 12.1, Created) DT 22-OCT-2010 (Rel. 15.11, Last updated, Version 2) XX DE hATm-3_AA, a family of autonomous hATm DNA transposons - a DE consensus sequence. XX KW hAT; DNA transposon; Transposable Element; hAT superfamily; KW Autonomous DNA transposon; hATm group; hATm-3_AA. XX NM hATm-3_AA. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4130 RA Kapitonov V.V. and Jurka J.; RT "hATm, a distinct group of hAT DNA transposons in animals."; RL Repbase Reports 7(10), 1048-1048 (2007). XX DR [1] (Consensus) XX CC hATm is a separate group of hAT DNA transposons. Significant CC identity between hATm and hAT transposases appears after PSI CC BLAST iterations. hATm transposons exist in genomes of different CC animals, including segmented worms (Helobdella robusta, leech), CC flatworms (Schmidtea mediterranea, freshwater planarian), insects CC (Aedes aegyptus, mosquito), and tunicate (Ciona savignyi sea CC squirt). All identified hATm transposons are characterized by CC 8-bp target site duplications and conserved termini CC (5'-TAGGgtGgyccnnA). The planarian hAT-7_SM, hAT-8_SM and CC hAT-10_SM transposons also belong to the hATm group. Their CC putative classification as hAT transposons was not supported by CC significant similarity (BLAST and PSI-BLAST) to known hAT CC transposase proteins. CC hATm-3_AA is a young family of hATm autonomous DNA transposons CC identified in the mosquito genome. The consensus sequence was CC built based on multiple alignment of 4 copies that are 1-10% CC divergent from the consensus. TIRs are 140-bp long. CC The region 1436-2448 is an inserted CR1-82_AAe. XX FH Key Location/Qualifiers FT CDS join(400..649,1084..1325,1655..1771,2208..2355, FT 2431..2628,2685..3853) FT /product="hATm-3_AAp" FT /note="transposase." FT /translation="MARGRLIKFVGPQCEVIKGARLPTVRQVLAAFYYQLR FT WLKLSIRSSARKAVRQVLSYWKKNEIPITSEIQCILKVEALHAELSDNSVV FT EFESTSEDSEKEEVEHSESIRKSQRGTQNVMTLELVQALDAAKASSDAAAV FT ILAAAAKAFGVDLEKKLILTVLPFIKKARCNRKWCDEIDVGSQLCCESIWR FT RENRFGIATLECSSKMIGRSIRWKVIAEHFVMLIVSDFFGTNLQIDLRGLA FT SDDNPQLFELRQRTVRDQVERQAVLVTGKRTVQLLGVPFLEKGTGKAIGEA FT VIELIEKWNLNDRIKAMSFDTTSVNTGWKNGAASCIERHLETNILWLPCRH FT HIMELVLESVFHTIMGPTTGPDVTIFKKIQSNWNSFDKDAYEVPLISGFPD FT FLKNRVSQIVCFAENSILVRILSRILVLRCYFLHYQLFLQLGQPRDDYREF FT LELTIIFLGRVPPRGIQFRDPGPVHNARWMAKGIYCLKMYLFRAQFPQLEN FT TDDLFSVCVFIVGVYIEAWYSAPFAASAPRSDLKLIADLKSFGSFHSDAAG FT AAKKIANHLWYLSEELVGLSVFDTSLSFEEREKLVESIKNKQGNDDPPKKR FT NVTELSICLDELATTNTMEFFKILNIRTDFFDQPAEEWSTNVEYRRGEEII FT VNLKVVNDHAERAVKLMQDYNRNVSRKEDDFQNLLLTVEDLRKKLPNKQKA FT TIRNFINT" XX SQ Sequence 4130 BP; 1313 A; 771 C; 878 G; 1168 T; 0 other; taggccgtcc cttatttttc gaaatttcga aaagttaaaa gttcgttatt ctaaagtgct 60 cgttttggtg agaaaaaaat catgtaaaat tatgaagtcc gtattttgac gttaagtggt 120 cccccaacga ccctaaaggt attagatgta ggtgagcccc tcacgcagcg aaaagctcgc 180 tactctagtg cacgcagcat gagtagcgat atccctttcg atacgcactc aaccgcgcac 240 gttgagcgtt agcaaaaagc tttatttgtt tgtatttgtg aacgatttca caacgtaaac 300 aattacacag ttatgcattc cataaaacac gagattagtt tagactagca catcggtacg 360 tacgattgca ttttttaagc agtgaaatta atcgtaaaca tggcgcgtgg cagattaatc 420 aaatttgtgg gacctcaatg tgaagttata aagggagctc ggctcccaac agtccgtcaa 480 gtgttagctg cattctacta tcaacttcga tggctgaaac ttagcatcag atctagtgca 540 aggaaagctg ttagacaagt tttaagctac tggaaaaaaa acgaaatacc tataacgtct 600 gaaattcaat gcattttgaa agtcgaagca ctgcatgctg agttatctgg tatatgtaaa 660 catacactgc gtaaaaatag caatgctcaa aaggaacgcg agcttcggtt ctgcaataaa 720 cttgttggcc tcttcgatat atgtatcaaa aagtacggca atattggaca tgcaaaaaac 780 tgtcaataaa tcgaaaaatt tgacgttgaa gcgagcaact caagaaaaca tagatttctt 840 gagagaccag aggggaggaa atcgcataaa tacgcttgat gatgcacatg aggttctttc 900 tgcaaatatt cgtaaaaagc agcaaaaaat gaaagaaaaa aagatcctag atgcgcgtcg 960 gatgagggct gcagctcaat caattattgg taatatgttt gttaattttc accatggatt 1020 tttagttgaa acctttgata atatacaata ctttaaacat ttttttttta ttatttactt 1080 cagacaactc agtagtagag tttgaatcta ctagtgaaga ctctgagaag gaagaagtgg 1140 agcactctga atctattcga aagtcacaga gaggaacaca aaacgttatg actttggagt 1200 tagtacaggc tcttgatgca gcaaaagcca gctcagatgc agctgcagtc atcctagctg 1260 ctgctgctaa agcatttggc gtcgatttgg aaaaaaaact aatattaacc gtactaccat 1320 tcatcgtgag cgcgaaaagc taagagctga attggcgtct cacttgatgg aaaattttaa 1380 tccagatgat attctcacgg tccactggga tggaaaaata gtgacttctt tatctgttta 1440 tctgtttatc tgtttattat tgtaactcat cggacttatt agtctaaatg ataataaaag 1500 gagcaattaa gataactaac aatacattat cattaacatt caaaaagtga gaatcacact 1560 agtcgtaatc gcataatacg gttgacaaaa cgtttagcag gttcaccgaa ctcatattcc 1620 tcttccacaa ttgaaaaggt ccttaacatt gcagaaaaag gctcgttgta accgaaagtg 1680 gtgcgatgaa atcgatgttg gatcacagct gtgttgcgag tccatctgga ggcgcgaaaa 1740 tcgattcggg atagcaaccc tggagtgttc agttcgttac tgataatttt agctgcagtg 1800 gaaatctgtg caatattacg ccgacgttcc aacgtgtcca gattgagtaa ttggcaccta 1860 tctggatagg gaggtaggtt tacagggtca cgccatggta aacttctcag cgcaaaccgt 1920 atgaaccgtt tttgaaccct ctcaattctc aagttccaag tgagttggtg aggagtcaaa 1980 caaccgaggc agtctctaaa attgggcgga caagggcaca atacaatgac ttaagacagt 2040 agggatccgt gaagttttgc gcaattttag aaataaaccc aagctggcga tttgccttgg 2100 caatgatttc cgtcatgtgg ggtatgaagg tcagcttagt gtccaattga acaccaaggt 2160 cgcatacttg atcaactctg ttcagtagta taccatcgat tttgtagtca aagatgatag 2220 ggcgttcaat tcgatggaaa gtcattgcag agcattttgt gatgcttatc gtcagtgatt 2280 tcttcggcac caatttacaa atcgatctaa gagggcttgc aagtgatgac aatcctcaat 2340 tgttcgaact gagacgtaga tttttgcatc atccgcaaat atggatttac aatttggtcc 2400 aaggagcaac gcagcatcat taatatacag agagaacagt aagggaccag gtggaaagac 2460 aagcagtact tgtgacgggt aagagaactg ttcagttact tggagttcct ttccttgaaa 2520 agggtactgg taaagcaatt ggagaagctg taattgaatt gattgagaaa tggaacctta 2580 atgatcgtat caaagccatg tccttcgata ctaccagcgt gaacacaggt aatcgtcgaa 2640 tatttaacag ttatcaaacc ttaatattca ttatattttt caaggatgga aaaatggagc 2700 tgcttcgtgt atagagcgcc acttagagac aaacattttg tggctgccat gcagacacca 2760 tattatggaa cttgtcctag aaagtgtgtt ccatacaatt atgggaccaa caaccggacc 2820 tgacgtaacc attttcaaaa aaattcaaag taactggaat tcgttcgata aggatgcata 2880 tgaagtacca ctaatatccg gtttcccaga tttcttgaaa aatcgcgtgt ctcaaatcgt 2940 atgttttgct gaaaacagta tccttgtgag aatcttatcg agaatattgg tattgcgatg 3000 ttacttctta cattatcaat tatttttaca gcttgggcag ccaagagatg attatcgtga 3060 gtttctagaa cttaccatca tttttctggg ccgtgtcccc ccaagaggaa ttcagtttcg 3120 tgatcctggc cctgtacaca atgcacgctg gatggccaaa ggcatctact gcttgaaaat 3180 gtatctcttt agagcacagt ttccacagct tgagaatacc gatgatttgt ttagcgtatg 3240 tgtgtttatc gtgggagtat acattgaagc ttggtattcg gcaccgtttg ctgcgtctgc 3300 accacgtagt gacctcaaac tcatagctga tctaaagagt tttgggtctt tccattcgga 3360 cgctgctggc gctgctaaaa aaatcgcaaa ccatttgtgg tatttgagtg aggagctagt 3420 tggcctttca gtgtttgata ctagtttgtc gtttgaagaa cgagaaaagc ttgtcgaatc 3480 aattaaaaac aagcaaggaa atgatgatcc acctaaaaaa cggaatgtca ccgagctaag 3540 catttgcctt gacgaactgg ccacaacaaa caccatggag ttttttaaga tactgaatat 3600 tcgaactgat ttcttcgatc aaccagcgga ggaatggagt acaaatgttg aatatcgaag 3660 aggagaagaa attattgtta atctcaaagt agttaacgat cacgcggaaa gagcagtaaa 3720 actgatgcaa gattacaaca ggaatgtatc ccgaaaagaa gacgattttc aaaaccttct 3780 gcttactgta gaagatcttc gaaagaagct gccaaacaag caaaaagcaa caattcgaaa 3840 ctttattaac acatgaaata aatatacttt tcaaggaata tgatatgata catatattac 3900 tttcatccga ctagtaccct atcgatgcct aaggcacata cgttagagca gcgtgctcgt 3960 aggcgtgtcg gggggtggcc tataccagat acctttgggg ccgttggggg accacttaac 4020 gtcaaaatac ggacttcata attttacatg atttttttct caccaaaacg agcactttag 4080 aataacgaac ttttaacttt tcgaaatttc gaaaaataag ggacggccta 4130 // ID BEL-196_AA-LTR repbase; DNA; INV; 456 BP. XX AC supercont1.77; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-196_AA_; KW BEL-196_AA-I; BEL-196_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-456 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.77; Positions 1299332 1298877. XX SQ Sequence 456 BP; 139 A; 81 C; 106 G; 130 T; 0 other; tgatagaaac tagaatcagt tgctcaaaat atacatgcag tttgaccgcg gctcatggaa 60 taaccgattc agtttgagct attttgtgtt ttcgcccggt ttttctggca actctgaacc 120 cgatctgctt tgctcgtata cttctatata catgggatac ccactagccg acctctagca 180 aatcaactgt aggacagtag gattagattg attttgatag ttgaagagtg tgcaagtaaa 240 gtgacagaga agaaagtgaa agttgttttt tcttattgga ataaagtgga ctgtagaagt 300 agtgttccat gtgtattact gttgttgcct gctgaagaga atcaacttac ctgttgctgc 360 tgtgatctat ctgcacggag aaaaaaggga cctgaaaaga agcaaatcaa acacaacgta 420 gggaaacgaa aatcggcctg gcggtaagat tcacca 456 // ID Copia-19_AA-LTR repbase; DNA; INV; 279 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-19_AA_; KW Copia-19_AA-I; Copia-19_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-279 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 948-948 (2011). XX DR [2] (Consensus) XX SQ Sequence 279 BP; 74 A; 71 C; 55 G; 79 T; 0 other; tgttgcagta tagcaatcac caactcccac cgagtacaac cctgtgcagc gggcgagtga 60 gccgcatcag tgtaacactc tttcgcttgc tgtacacacc catacatacg agtcgggcta 120 accaccgacg gtaccatcag agtgttcttc attcaatatt cgactgtgta acagtaaaga 180 taataaagtt gtttctttcg ttcattcttt ataatagtaa aaccgcgttt gtgattcccg 240 agagaaaact gttcccactg ttttgctagt ctgccgaca 279 // ID ISL2EU-2_CS repbase; DNA; INV; 4178 BP. XX AC . XX DT 12-MAY-2011 (Rel. 16.05, Created) DT 12-MAY-2011 (Rel. 16.05, Last updated, Version 1) XX DE DNA transposon: consensus. XX KW ISL2EU; DNA transposon; Transposable Element; ISL2EU-2_CS. XX OS Ciona savignyi OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. XX RN [1] RP 1-3053 RA Yuan Y.W. and Wessler S.R.; RT "The catalytic domain of all eukaryotic cut-and-paste transposase RT superfamilies."; RL Proc Natl Acad Sci U S A 108(19), 7884-7889 (2011). XX DR [1] (Consensus) XX SQ Sequence 4178 BP; 1327 A; 817 C; 710 G; 1324 T; 0 other; ggggttaaac ggcttgaatt ccgcatggtg gcgcaacctg aatttgggtc cttcatagca 60 acagcataag tagtaatcta cggtaatagg cctatgctgt caacgttcag taatttatta 120 gcaagcaagt tcgcctttta aatatgcctc aatactgttg cgtgccacaa tgcaaaagtc 180 gaaagggtgg ccacacattt ccgaatgcag acaaaaaaaa agaattaaga atgcagtgga 240 gggtagcaat aaagcgaaat tttcccggaa ccaacaaatt atggaatgct ggaccatctg 300 atcgtgtttg tcaagatcat tttaaatcaa cagattactc ggaatttaca ggtactttac 360 aaattacaca ttttgaaatt acaaccattt ttgcattgcg tcgggatgtg aacctgaata 420 cctcgtcaag atctgccata ctttccaggc tggtccatag tgtgtaaaca tacacaaatt 480 atatagacct agctaacata atttaatgac tatgcttttt atttcctaga aaaacagaag 540 agacttctac cctcagcggt accttctata tttccattta aaaaaccagc atccaaatca 600 agctgcagaa gatcaaagcg tgcagatccc attacattat taccggctcc agtaattgtt 660 gaggaggtgc ctgagccgca acttgaaatt attgtgcaga atgttgcccc tgtatgtgaa 720 aaagcacagt cctgttcttc tgctgctact caatgtgacc tacctggact agcactttca 780 cctttaaagc tatataagaa aaatgatgca gctatgcatt actactgtgg tttcaacaat 840 tatagccatt ttaaattttt ttttaatgtt ttaggtcaag ctgcttacaa tttagaatat 900 cagtgtgtaa aattaaatcc taaagaacag ctttttttga cattaatgaa attaagacaa 960 tgtaaggaag attttgaatt gggacttatg tttaacattt cacaaactac agttagttct 1020 attgttgtaa cttggatcaa ttttatgtat tttcaactta aggaaataaa tatttggcca 1080 aattcccaaa ctgtacaaga gtatatgcca acagatttta aaatgaaatt tcccaaaacc 1140 agagtgattt tggatgctac agagtttcct atccacaaac ccacaaatgt aaactcacag 1200 agtgctacct tttcaagcta caaaaacaaa aacactttaa aggttatgat tggatgttca 1260 ccgaatgggt tggtgtcgtt tatttctaat gcttatggag gaagaacaac cgatcgccaa 1320 ataattgaaa ggtctgcact cctgaaacaa cctatgttta ctacagaaga tagcatcatg 1380 gcagaccgtg ggattatggt tcaggatttg tttgcaagta agggagttca agtgaacaca 1440 ccacatacat tgcaaggaaa gacccaactg tctgctgagg aagttgtctg tgatcgtcgt 1500 attgcatcaa agagaataca tgtggaaaga gtcatcggat taagcaaaac atttaaaata 1560 cttaaacagg atttgcaaca cactaaggtt aacctaggag gaagaatagt gtatctatgt 1620 ttcatgatat ctaattttag aaactctatt gtgagcaagc ttgcttgaaa tgatttgtac 1680 caactttgtt cttttgtttt tattttaata aaaaaaatca atcattcaaa acagctttat 1740 ttattattgt agtgattttg caaatgaagc tacatgcttt ctaaaacagt taaaataaaa 1800 agctttaagt ttaggaagca tactttgctc ccaaaaaaat ttatcaaagc cgattttttc 1860 aacaaacaaa ttttgctcat taacacatgg tgtaaacaca ataaaatagc accattttct 1920 atcacttata gctaattgac cctgtatttg attatagtac gaatgtgacc tattcaagca 1980 tactttgttt tcaatgagtt ttaaaaatga aaactctttc ccaggctgaa tatatttgtt 2040 ttctcgtcca atgtatgggc acttaacttc tacaatgtgg tctgcatcaa taactccatc 2100 tggtgtagca ccgagaaagg gccaagctgg atttacaaat aaccctgatg ggtgcactgt 2160 cagttgtgtt tttttgctaa aaagatcgac agcggcggat tcgaactgtc gaccatgtag 2220 tattggtaaa gtgtttaaca atggtggatt aaggatatct ccacaaaggc tttccatgtt 2280 tcttttctct gtggcgtgac atacagctcc aaacttactg gctgtaatcc tccattgacg 2340 ttctatgtgc cacacttcac tctgtgcctg gcgccgtgtt cttttctcta aattaatgga 2400 gtcttccatt gatacctaca gatagtaggt aaattggtaa tacttaaaat taacaattag 2460 tacggtaaca caataattaa taaaccggat tttgttaaaa ataataccca ttcattatta 2520 caatattatt ataaattata atagtaatct cttaccgcca caccttgatc cacccaatac 2580 tctgtaaatt tgttcttcag ataatcatga tctttaacag catcagatag gcttgctctt 2640 gaatacaaat agcgcatagg catatctgtg ttacttccgc cgcagaagtt aaacaccagg 2700 tttcgaacgt ggtcctcata cccagaagca tttctgtact gtgctggtct ggggtcatcc 2760 atcgaagcac ttctttttcc acctagttgc tcagctttta ttggtgaacc tataaaaaat 2820 aaaacaacta actagatttt ttttagatgc acataaatca gtttctatac aatatgttaa 2880 aaactatcat acatttcata aatgggaatt tcaaatattc atatgtataa taactatatg 2940 tatgtatgta taataagaaa acactcataa ataccattgt aatcacttaa tggtctatga 3000 aatgtctgga gattgtctgt acaactccta agcaatcgca gctctccagt atctttaaaa 3060 cactgcaaca tgagtaacaa agcagcgatg tgtttgcaag ttccatgtgg ccctgttccc 3120 gatgcacatt cacaatcaga gtttattatt cctatgtccg atactttcag tttggtgtag 3180 taacatacct gaaacacatt aactatacgg tactttgttg gtttttagaa tttaaactat 3240 atcaccattg tatgtacaac aagtctacac taacccgttt ttgcatcgat gccctgatca 3300 aggccgagaa gaaaacatgg ccattttcat gagttgtgtt gctgctgcag gtctctgtca 3360 tactgctttc aaacatcagt ttcccctttg tgatagcttt tatgtcattg gtacaaagac 3420 gatcgcaggc caatctatac cgaaagtatc catccatatg tgcatcggta attgttggta 3480 ctaactggcg atgctctgtt gttaattgtc taaaaccttc aagtggccaa ttcggaaaaa 3540 tagcttgggg aatatcgact ggcgttggac gaaagtcatt atgcttctga tatgattgaa 3600 gtctgagaaa caatgttata cattaaaacc cccaagccca ctgtatacac ttctctgtct 3660 gtttctgtat aaatctactt atctatcata aactctacct tcatttagtc caattacctt 3720 tcaataagct catgctttct tcctgaaatc tttgctcccc tttttctcaa ctctgcttgt 3780 aactgtctca cattcattat aaaaaaaact caaggctaag atgttgtaat ttaaccaaag 3840 gtctgcggaa taatatattc taactaggac atctgcccgt cgctgttctg ctatcaatac 3900 aatcaccaca tgctgcaatc tgcactaaac taagctggcc tatactaagt ttgtcaatag 3960 atagattcta gtttaaggat atatatcttc tttaagttca tatacggaac ttaattcaga 4020 actgcggtct aaaatcctct aatgttatgt aactatcata ccggtaacac acaaacccta 4080 caaaacgtgt ggagaatatc agctttagtt tttgccgcca gacttaggac ccaaattgct 4140 caattgcgcc ccaagtggag ctatcgacgg cgaacccc 4178 // ID Copia-29_AA-LTR repbase; DNA; INV; 209 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-29_AA_; KW Copia-29_AA-I; Copia-29_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-209 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 956-956 (2011). XX DR [2] (Consensus) XX SQ Sequence 209 BP; 53 A; 53 C; 36 G; 67 T; 0 other; tggtgaattt accatcacag ctactcagtg ggctatgttg cccccatcga aggtgacacc 60 actgctgaga gaaataacaa agacaacaac acttccgttt ttcattccat gtttcttttc 120 tcgtgaataa acacgctttt tatttgaact cgctaatttc gtcagtctcg ctagtttttc 180 cttgtgtggc catccgaaga accacttca 209 // ID Gypsy-32_DPu-I repbase; DNA; INV; 4401 BP. XX AC scaffold_53; XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from Daphnia: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-32_DP_; KW Gypsy-32_DPu-LTR; Gypsy-32_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-4401 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Direct Submission to RU (16-DEC-2010). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR Genome; scaffold_53; Positions 725566 729966. XX CC Positions [2931-3293] - Integrase core CC 'CCAGT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 627..4085 FT /product="Gypsy-32_DPu-I_1p" FT /translation="MSTRQAPSPAAKVHAVEEETHWIGDVDRGGTAMALPR FT SVEESYYVSHSLTSSGGESEWRQEVTVDGVKVNFKLDSGATCNILPYESFS FT RLPKTRRCLRPGPIVRSYRSQDGLLRVLGLHTAKVVHRGALFVIDFVVVDE FT PGQPPLLGLPSCDKLNLIRRVDAVQSPVEAPLPPIVVEFMDVFEGLGKLPV FT EHDIRLLSGANRVDPVVCAASRLPFRLEDRVFKKLDEMVNDKILTPVQEPT FT EWVSRMMVVGKPDGDVRICLDPFELNKAIQRQHFAVPTIEQLFSKLGKARY FT FCSLDAASGFYQIPLSDAASYLCTMATPKGRYRFLRLPFGLKSAPEIYLQT FT MNDLFGDLSGVLIYFDDFLVTGETEEELLANLRQVFVRCRLHDLKLQLKKC FT KFFLQELPWLGHVIGQGILKPDPLKISAVVEMPDPTCPADLVRLLGMVTYL FT DKFCQNLAGLTRPLRDLLKADAAWVWEEPQKMALGQLKSALSSLPVLRLFD FT HSLPLVVSVDASPVGIGAVLLQNNQPIAYSSTSLTDTQKRYFQIEKELLAV FT QFGLMRFRQYVYGQMVVVLTDHKPLVGLLEKPIASCSPRIQRMRLQLQRFD FT FQLVYKPGKELFIADTLSRAPSPRLFTDDVTQDCEEQVHAVLDLIIPHDST FT RVKFAAATMADPTLRLLKEVLRRGWPDHKAQCPVAVKPFWPVRHHLSEADG FT LLLSGSRLVVPISLRQEVLAGIHDGHFGEVKCILRAKSAVYWPGCDEQIRN FT MVASCSTCQTHRHRNPAQPLRPVPLPVHAFQWVSADIFLHGGVNYLLIVDA FT YSKWPACVPLRSLSSSSVVAEMDRIFSDFGVPEIVMSDNGSQFDCAEFREF FT CEGRAVRSVTSSPTYAQSNGLVERHIQTVKKTLLKMFTDGRSLWEALAAIR FT STPISSELPSPAVLLQGRHLRGSLPFLQDRLVPQFVSAKFVHGQLQRRQAT FT ACFTHGGRPDVRGSALIVGQRVRAFVSGLWLPGAVEAVCSEPDSYVVRLAD FT GRAFRRTRRDINLDNSPSAGLAVVQQSGGAAVIPSAVRGYRPAPANHLLPA FT LSWSPPAHAVSVPVAQPLPVALGQPSMMPQPPPVFVNPAPGPATPARPRAA FT PVSTDQRATVQLPAVQSTLTASSPLAVAPHRPGSTRSGRPYLKPS" XX SQ Sequence 4401 BP; 856 A; 1181 C; 1159 G; 1205 T; 0 other; tggtgtcaga agtgtgctct cacaactcga cggtgttgag tcgtaaccgc gtggtttgtg 60 agtattgaac tgcatccttc tttatcgtac gcatcctccg tgttgttgaa ctgcatccgt 120 ctttatctag tgcatccgtt tgtcttgcac cattttcttg aaatgtcgtc ttcgttgagg 180 actcctgaac ctttttcttt cggggccagt gatttggcgg cccagtgggg tatttggcgt 240 aaacagtttt cgtggtactt ggtggctaca aatagcggcc ttaacgtgga tgaagagcaa 300 atggtgggcg tgctcatcac tcttcttggc agtgagggcc tcaaaattta cgaaactttt 360 gtgttcaatc cagccaccga tgccattaaa attcagagtc gacctggtgc ccggaagttg 420 agagaaagcc ggattcatca ttcagagctc agcatgagca gcagccgtcc gctaatcaac 480 acccagccca gcagtatccc aagtgcaaag attgcggtcg ttcccataga aaaaatcagt 540 gtcgggtcac caatgttacg tgttttaatt gcaacgtggt cggccatgta tctagttgct 600 gtccgaatcc acccaagccc cgccagatgt cgacacgtca agcaccaagc cccgcggcca 660 aagttcatgc tgtagaggag gagacgcact ggattggtga tgtcgatcgt ggcggtacgg 720 cgatggccct cccgcggtca gttgaagaaa gttactacgt ctctcattcc ctaacgtcat 780 ctggtggtga gtctgagtgg cgccaagaag tgacagtaga tggcgttaaa gttaacttca 840 agcttgattc gggcgccaca tgcaacattc tgccgtatga gtcattttct cggctgccca 900 aaacacgccg ctgtcttcgc cctgggccca ttgttcgcag ttatcgttca caagatggcc 960 tcttacgcgt gctcggattg cacacagcca aggtggttca cagaggcgct ttatttgtca 1020 tcgactttgt cgtagttgac gagcctggcc agccacccct tctaggactc ccgtcttgcg 1080 acaagctgaa cttgattcgg cgagtcgatg ctgttcagtc gccagtagaa gctccgctgc 1140 cgccaatcgt cgtggaattt atggacgtct tcgaaggatt aggtaaatta cctgtcgaac 1200 acgacattag gctgctgtcg ggtgctaatc gcgtggatcc tgttgtgtgt gcagccagtc 1260 gacttccgtt ccggctggag gatcgcgttt tcaagaaatt ggacgaaatg gtcaacgaca 1320 aaatcctcac ccccgtgcaa gagccaactg agtgggtcag ccgaatgatg gtggtgggaa 1380 agccggacgg tgatgtccgt atttgtctgg acccgtttga gctcaacaag gctattcaac 1440 gtcaacattt tgccgtcccc accattgagc agctgttcag caagttgggt aaggctcgat 1500 atttttgcag ccttgatgct gcgtctggtt tctatcagat tcctttgtct gatgcagcgt 1560 cctatttgtg cacaatggcc actccgaaag gccgttatcg cttcctccgg ctgccgtttg 1620 gattgaagtc agcccctgaa atttatcttc agacgatgaa cgacttgttt ggagacctgt 1680 ccggcgtact catctacttt gatgatttcc ttgtaacggg tgaaacggag gaggaacttc 1740 tagccaacct tcgtcaagtg tttgtacgtt gccgtctcca tgacttgaag cttcagttaa 1800 aaaagtgcaa attcttcctc caggagctcc cgtggctagg ccatgtcatt ggccagggaa 1860 ttttaaagcc ggaccctctc aagatttcgg cagttgtgga aatgcctgat ccaacttgtc 1920 cggcagatct cgtccgtctc ctgggcatgg tgacatacct ggacaaattc tgccaaaatc 1980 tcgcgggtct gacacgaccc ttacgtgact tgctcaaagc agacgcagcg tgggtgtggg 2040 aagagcccca gaagatggcc cttggtcagc taaaaagtgc attgtcgtct cttccagtgc 2100 tacgactgtt tgaccactca ctgcccttgg ttgtgtccgt tgatgcgtct cccgtcggca 2160 tcggtgcagt gttgcttcaa aataaccaac cgatcgcgta ctcgtcaacg tcgttaaccg 2220 acactcaaaa acggtacttt cagatcgaaa aggagctgtt ggcggtccag tttggactca 2280 tgcgcttccg gcagtacgtc tatggccaaa tggtggtggt tctaaccgat cacaagccgt 2340 tggttggcct cctggagaag cccattgcgt cttgttcccc gagaatccag cggatgcgtc 2400 ttcaactcca gcgatttgat ttccagctcg tctacaagcc gggcaaggag ctgtttatcg 2460 ccgacacgct tagtcgggct ccgtcgcccc gcctctttac cgatgacgtc actcaagact 2520 gtgaggagca ggttcacgct gttcttgacc tgataattcc tcatgattca actcgtgtca 2580 agtttgctgc agcaacaatg gcagacccga ctctccgtct cctgaaagaa gtccttcggc 2640 gtggatggcc tgatcacaag gcccagtgtc cggtggccgt taagccgttc tggccagtac 2700 gccaccactt gtctgaagct gatggtctgc tactcagtgg cagccggttg gtggttccga 2760 tttcgcttcg gcaggaagtg ttggccggaa tacacgacgg ccattttgga gaggtgaagt 2820 gcatcttacg ggccaaatcg gccgtttatt ggcccggatg tgacgagcaa attcgcaaca 2880 tggtggccag ctgttcgacc tgccaaacgc atcgccatcg aaacccagcg cagcctcttc 2940 gtccagtgcc gttgccggtt catgcatttc agtgggtgtc ggctgacatt tttctacacg 3000 gtggcgtcaa ctacctcctg attgttgacg catatagcaa gtggccggca tgcgttccgt 3060 tgcgcagttt atcgtcgtcg tccgtcgtcg ccgaaatgga ccgcatattt agtgactttg 3120 gcgttccgga aattgtcatg tccgacaatg gctctcagtt cgattgcgcc gaatttcgag 3180 agttctgcga gggccgtgca gttcggtcag tgacgtccag cccaacctac gcacagtcca 3240 atggtctggt ggagcgccac atacagacag taaaaaagac gctcctaaaa atgtttacgg 3300 atggccggtc tctgtgggag gccttggcgg ccattcggtc aactccaata tcgtcggaac 3360 tgccgtctcc ggcagtgtta ttgcagggac gccaccttcg tggcagcttg ccgtttctgc 3420 aggatcgtct ggttccacaa tttgtatctg ccaaattcgt tcacggccag ctccagcgtc 3480 gtcaggctac ggcttgtttt actcacggcg gtcgtccgga tgttcgtggg tcggcgctca 3540 ttgtcggtca acgtgtccgg gcgttcgtct ctgggttgtg gttaccaggt gccgtcgaag 3600 ctgtttgcag cgaaccggac tcatatgtcg ttcggttggc ggatgggcga gcttttcgtc 3660 ggacgcgccg cgatataaat cttgacaact cgccgtcggc aggattggcc gtggttcaac 3720 aaagtggtgg tgctgctgta atcccgtctg ctgttcgcgg ttatcgcccc gcccctgcta 3780 atcatctgct gccggccctg tcttggtctc cgcctgccca cgctgtgagt gttccagttg 3840 cccagccgct tccggtagcc ctgggtcaac cgtcaatgat gccacagcct ccgccggtct 3900 ttgtcaaccc ggcgcccggc ccggcaaccc ctgcaaggcc acgagcagcg cccgtctcaa 3960 ccgaccagcg tgctaccgtc cagctgcccg ctgtccagtc aaccctgact gcatcaagtc 4020 cgctggccgt tgcgcctcat cgtccgggat cgactcgctc tggtcgccct tatctcaagc 4080 cgtcttaatt gtatgtgggc gattccgtgt tcgttgtcct ggctgttttg ctagtgacac 4140 ttttggctgt tttgctatac tttcactcat ggaattcatc gtctaattct caatttttgt 4200 tttcttcctg cgtctcgtcc cattgttcat gtcatgtcgt gtcttatctc attgttcatt 4260 tcatagtgat tgttctctgg ttttcgtccc attgttcatg tcatatcgtg ttttcctcat 4320 tattcatttc attgtgttta ttgtcttgtt cggcatgtta gttgtcgttc atttgtgtgt 4380 aatcagtgtg cgagggggag a 4401 // ID Copia-19_SI-LTR repbase; DNA; INV; 307 BP. XX AC AEAQ01024750; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: long terminal DE repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-19_SI_; KW Copia-19_SI-I; Copia-19_SI-LTR. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-307 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01024750; Positions 52 358. XX SQ Sequence 307 BP; 92 A; 50 C; 34 G; 131 T; 0 other; tgttaaaatt attttattat taataatttt ttgttcaacg ttatgattgt atacatgcaa 60 ttttttttaa taggttctct aaagtcacaa ttttaatttt gtttatgcat ctactatact 120 tttgtctatg gttgtgccta actgatctta ctcatctctc atctctctct ctctatgatg 180 tctttgtgta acttgcttat agataagcga tttgtattcg tataagaaaa atatattcac 240 tcaaggaaat acaagtacat tcgtatattc aagattgatc catccatcaa atcctcaaat 300 ttcaaca 307 // ID BEL5_Cis_LTR repbase; DNA; INV; 391 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 06-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE Long terminal repeat of BEL LTR Retrotransposon from Ciona DE savignyi. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL5_Cis_LTR. XX OS Ciona savignyi OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. XX RN [1] RP 1-391 RA Smit A.F.; RT "BEL5_Cis_LTR - BEL LTR Retrotransposon from Ciona savignyi."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX SQ Sequence 391 BP; 103 A; 80 C; 85 G; 120 T; 3 other; tgttacggtc attttagagc ccaatcccaa ttacgtataa ttcccgattt ctgtgtgctt 60 aatgaactgt cacggcgtct ctgccgattt gaaaggtcat cgaaatttcg gccttttctg 120 ttctttacta cgtgaaagca ggtttattca ttgagctgtt ctatatcgca cgaaagctaa 180 tccagaaatt gtgtctatgt gtcttatntg tactgtattg cgaataagct acngttaatt 240 tgctatcgca attacaagac agcacaagag ggtggtttta cgtcaaatat agaagataga 300 ncaatcactg gtcaagtgcg tggagcagct caaattcgcc aacacagggt tggcaacgta 360 gggtagggct tcgcccctgt tgaatcctac a 391 // ID Copia-27_DPu-I repbase; DNA; INV; 4287 BP. XX AC . XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from Daphnia: internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-27_DP_; KW Copia-27_DPu-LTR; Copia-27_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-4287 RA Jurka J.; RT "LTR retrotransposons from Daphnia."; RL Direct Submission to RU (16-DEC-2010). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR [1] (Consensus) XX CC Positions [1632-2159] - Integrase core CC LTRs are 94% similar to each other. XX FH Key Location/Qualifiers FT CDS join(78..950,954..2192) FT /product="Copia-27_DPu-I_1p" FT /translation="MASSVVSFSSKDVSHVVKFDGTNFPFWKFQIFLAFEQ FT HDLLKIVTGEETKPTLRNAVDAQGVSHSNEDQIKSWCNRDNAARVSIVATI FT EQVWQRSLINCKTANEMWKRLTSQHEQAALENVHLLLQRFFEYQYQKGHDV FT MSHVTAIETFARQLEDLGSPLTEAQIITKITCTLPPSFRPLLSAWENLETN FT KKTLQLLTSRLIKEESMNKVFGDSSSSDAAFFSKQSGFKPKADSSKMSNKQ FT VRFKRKCNYCTEEGHIEERCWQKSFDLRMAQSLKVHGDETQAKFVTMDMPT FT RKEESHQDWSGDYAFRSYSFFSKIKRRNWIADSGASRHMSDQEWCFTNYVP FT VKAGTWPVSGIGEDSQPLQVAGYGDVPVMCCVDGIYNRGVLQKVLHIPKLG FT VNLFSIKAATSQGFKAVFTDCRVELIKNGAVRLAGMSNENYLYTLDVISCH FT GKSITLEPKMESFSALAKSTPKSVHLWHRRLGHVAVSTIKKMADNLMADGL FT VLSEDLKKDIVCEGCIYGKHHRLPFPTSGRTRGRKIGDLIHSDVCGPVSVP FT SPGGAKYFVTFRDDYSSYNAIHFIKQKSEVFELFKQFVKRVKVEIGNEVVC FT LRSDNGGEYVGKEFETWLLKNGIRHETTVPYTPEQNGVSERLNRTVLESAR FT SMLHSSSLPLELWAEASNCAVYLLNRVATRSVKEKPHMKYGWESSRIFPTS FT ESLD" XX SQ Sequence 4287 BP; 1239 A; 950 C; 991 G; 1077 T; 30 other; ggttatgggc ccagcttaaa agtttgtttg ctaccctatt gtaaactgag aattttgttt 60 tgttcccaat ttccaatatg gcatctagtg tggtttcttt ctcatccaag gatgtaagtc 120 atgtggttaa atttgatggc actaatttcc ctttttggaa gtttcaaatc tttctcgcct 180 ttgaacaaca tgacttactg aagatagtca ctggagaaga gaccaaaccc accctccgaa 240 acgcagttga tgcacaagga gtgtcacact cgaatgaaga tcaaattaag tcatggtgca 300 atcgagacaa tgctgcaaga gtttccattg tggccaccat tgaacaagta tggcaacgat 360 ccttaattaa ttgtaaaact gcaaatgaga tgtggaagag gctaaccagc caacacgagc 420 aagctgcact tgaaaatgtg catttgttac ttcaaagatt ttttgagtat cagtaccaga 480 agggtcatga cgtaatgtct cacgtcactg ccattgaaac tttcgcaaga caacttgaag 540 accttgggtc acctctcact gaagcacaga taattacaaa aattacttgt actcttcctc 600 ccagttttag gccacttttg tctgcatggg agaatctaga gaccaacaag aaaactctac 660 agcttcttac ttctcgtctc atcaaagaag aaagtatgaa taaagtattt ggagatagca 720 gctcgtcaga tgcagccttt ttctccaaac agtctggctt caagccaaaa gctgattcaa 780 gtaagatgtc taataagcaa gtaagattca agaggaagtg taactattgt accgaagaag 840 gccatatcga agaaagatgc tggcagaaat cttttgattt gagaatggcc cagtcactca 900 aagtacatgg tgatgaaacg caagctaaat tcgtcaccat ggatatgccc sctactcgca 960 aagaagaatc acatcaagac tggtctggtg actatgcttt tcgttcctat agtttctttt 1020 caaaaataaa aaggcggaac tggattgctg attctggagc gtcaaggcat atgtctgatc 1080 aagaatggtg ttttacgaac tatgtgcccg tgaaagcagg aacctggcct gtttccggaa 1140 ttggagagga ttcccaacct ctacaagttg ctggttatgg tgacgtcccg gtcatgtgtt 1200 gtgtagacgg aatctacaat cgaggagtcc ttcaaaaggt tttgcacatt cccaaactcg 1260 gtgtcaacct atttagcatc aaggcagcca ctagtcaagg atttaaagct gtctttacag 1320 actgtagagt tgaattaatc aagaacggtg cagttcgttt agctggaatg agtaatgaaa 1380 attatttata cactctagac gtcatcagct gtcatgggaa gtcaatcact ctcgaaccca 1440 agatggaatc tttctcagcc ctggcgaaat cgacacccaa atcagttcac ctctggcatc 1500 gacgtctcgg tcatgttgca gtttctacaa taaagaagat ggctgacaat cttatggctg 1560 atggacttgt tctcagtgaa gatttaaaaa aggacatcgt gtgtgaaggt tgcatttatg 1620 gaaaacatca ccgtctcccc ttcccaacca gtggtcgaac tagaggcagg aaaattggtg 1680 acctcattca ctcagatgtc tgtggaccgg tatcagtccc ttcccctggt ggtgccaagt 1740 actttgtaac tttcagagac gactacagta gctataatgc cattcacttc atcaaacaaa 1800 agtcagaagt ttttgaactc ttcaagcaat ttgtgaaaag agtcaaagtt gaaataggta 1860 atgaagtcgt ctgtctgaga agcgacaatg gcggtgaata tgttggcaaa gaatttgaaa 1920 catggcttct caaaaatggc atacgccatg agacaaccgt tccatacact ccagagcaaa 1980 atggcgtgtc cgagagactc aacaggacgg tactagaatc ggctagaagc atgcttcatt 2040 cctcttctct accnctagag ctctgggcgg aagcatctaa ctgtgccgtt tatctactca 2100 accgagtggc gaccagatcc gtgaaggaaa aaccccatat gaagtatgga tgggagtcaa 2160 gccgaatctt tcccacgtca gagtctttgg atcsacagtg tatgtgcatg ttccaaaaga 2220 aaaaagaagc aagtttgatc ccaaggcagt gaaatgtcat catgtgggct actgtgaaac 2280 tcaaaaggcc ttcagagcct gggaccctgt aagtcgaaaa gttttaattt ccagggatgt 2340 tatttttcaa gaattggatg atcgagcagt tgaaatcgag cagccaaaca gtatttttga 2400 gcttgtggca aattcctgtt ttgaagagga aaaaagactg cctcaaccct cacaactcga 2460 cgtctccatt caatcagaga acattcaaga tgatccggct ttttccatag agtctccagc 2520 agaagtggaa atttccaatc ccagccatga agaagctaac gcggaagagt tggccggcga 2580 tcctacagct gatcaaacgc aagaaccttc cctcggacga agagttcgaa gaccacctgt 2640 caggtggatt gacgaatcta caaacaagtt ctatgctaag ctcgctgcag tgggtgccat 2700 ggaagagccg acaaacttca agaacgctat ggagtcccct caagcagacc aatggaagac 2760 tgcgatggaa gaagagatgg attctttgat gaagaacgaa acgtggacgc tgaccaacct 2820 gccacccggt cgacaagcca tcgagaacag gtgggtgtac aagttcaagc tggatggaga 2880 aggagnagtc cggcgttwca aagcccgact ngtcgcaaag ggcttcacgc agcgccccgg 2940 aattgacttt gatgaaacct tttctccngt tgtgaagtac gactctttac gtgccgttct 3000 ttccatagct gccgcgnngg atctcgagat gctgcaacta gatgtnaaga ctgcctttct 3060 aaatggagac ttagacgaag agctntacat ggctcaacca gaaggatttg ttgtncccgg 3120 tcaagaagaa gaagtgtgca agctgaagcg nagcctgtat ggactaaaac aggcttcnag 3180 agcctggaat ctgaaattca acggctttct cacaattttg gctttgtccg cagcagtgca 3240 gaccmgtgtg tttacgtcct aaaagaagaa aagtgcctaa cagtnatngc aatctgggtn 3300 gacgatggct tggtntgcag cagtcatccc caaaacttgc cagtgtggtt gactncctaa 3360 caaaacaatt tgagatgacg tctggccngc aggncgtttc gtggggattc agatcgtccg 3420 agacagatga aaggaaacca ttcacatttc tcaagaagac tacatcaagc gcgtgcttaa 3480 gaaatttaas atggcagaat gcaacgcacg cgtcgtacct gcggatccat gcacgcgact 3540 ctcgaaagat ctggacgcga ctcgtccgac tggcaaagga agtggttgaa gcccttccgt 3600 gaagcngtcg gttctctaat ntatgctgtc acttgtaccc gacctgatat tgcnttcgct 3660 gtcagccagg ttcmcaattt tccagactcc agcanaggca cactgggacg ccgtaaaacg 3720 gatcttctcg tacctgaagg ggactctgaa ctatggaatc acctttggag atctgactct 3780 cgaaatgagc tcctggccta caccgatgca gacttcgctg sagatctaga tgacaggcgc 3840 tcgacaccgg agtcattctg ctgctaaatg aggagccgtg tcttggaaaa gtcaaaggca 3900 aaagtgtgcc tcactgtcaa ccactgagtc cgaatatgtg gcggctgccg cggcggccaa 3960 agaagccgtc tggatgagac ggcttctcga agatctggcg ccacaatctg caccgaccac 4020 cctgttttgt gacaaccaaa gtkcaatcag gctcgttcag aatccggagt ttcatcagcg 4080 aacgaaacac atcgacgtca aatttcattt cgtccgcgct ctgcaagagg agcaagttat 4140 tgacgtcacc tacgtcaata cagatgtgca gctggccgac attttgacga aacctcttaa 4200 tggaccaaga tttacaaaat tgagagaaga gattaatatt actgtccatg tttgttaaaa 4260 agtttgttgc ctatgcttga gtgggtg 4287 // ID I-7C_AAe repbase; DNA; INV; 6501 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A I non-LTR retrotransposon family from Aedes aegypti. XX KW I; Non-LTR Retrotransposon; Transposable Element; I-7C_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6501 RA Kojima K.K. and Jurka J.; RT "I non-LTR retrotransposons from the yellow fever mosquito."; RL Repbase Reports 11(4), 1359-1359 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 11 sequences with >96% CC identity. The consensus is ~87% identical to I-7B_AAe and CC ~82% identical to I-7_AAe. XX FH Key Location/Qualifiers FT CDS 1075..2448 FT /product="I-7C_AAe_1p" FT /translation="METSSANSSSAHQRQALPSTPGKANPRQKAYPPGSKG FT PFLVFFRPKGKPLNKLQIEKDLAKSFRGIESVDAPSRDKLRITVSDREQAN FT RIASYELFSMEYRVYIPSRDIEIAGVVTEPYLSCADIKAGAGGFKNRDVPP FT VTILDVRQMNQVAPDGTKQPSNSFCVTFSGSALPDYLVIGKLRLPVRLYKP FT TVMHCDKCQQIGHTTPFCCNKPRCAKCGEQHVEGACSTEPKCTCCGQAPHE FT LTTCPRYIEREKHQIRSLQQRSKRSYAEMLKKIAPAVAPPAQPIASNNIFT FT SLADDEQGSDSEEGEEYTIVQTGSKRKRAVARQLRHRQQISQNAQQELRPS FT LKKSRSGENAIKANPPGFRLVPGDFPSLPGSSKTPDIPVFRPESQHANIPS FT ARQESVETSDKLTLSGIVEIIFKILEVSPATRNLINMALPFVKPLLKRLFS FT QWPILDSFISLDG" FT CDS 2444..6118 FT /product="I-7C_AAe_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="MDNSTNEVGDMIEILQWNCRSILKNIDAFKFLVHSTR FT CDIFALSETWLTSDKDISFHDFNIIRQDRGDGYGGVLLGIRKLHSFYRVDF FT PPMTGTEVVALQVTIRGKSFSIASVYLPPSARVSRRDLAAICCAMPAPRLV FT VGDFNSHGTAWGSMYDDNRSSLIYDLCDDFNLTILNTGEATRVKPPAPPSM FT LDLSICSNSLSLDCTWKVIQDPHGSDHLPIKISITNDSCQARQIDLAYDLT FT KHIDWGKYAELIIDGVQSVELLPPLEEYQFLSELIVNSALQAQRRPIPGSK FT VRRRPPTPWWDEECTAVYREKSAAFKXYRKRGSRENYERYTSLERKFGSLV FT KAKKRGYWRNFVNGLSRETSMTALWSAGRRMRNASXVNEDKEHSSRWIFQF FT AKKVCPDSVPVYRTVRACLPERNEVDRPFSMLEFSLALLSCNNSAPGMDRI FT KFNLLKNLPDVAKRRLLDLYNQFLEGNIVPDDWRQVRVIAIQKPGKPASDY FT NSYRPIAMLQCLRKLLEKMILYRLDKWVEANGFLSDTQFGFRRGKGTNDCL FT ALLSTEIQLAYAQKEQMGSVFLDIKGAFDSVCIDVLSDKLHDCGLSLLLNN FT YLYNLLSEKRMSFSHGTSTTSRISYMGLPQGSCLSPLLYNFYVRDIDDCLV FT ENCSLRQLADDAVVSVTGPGADDLQRPLQDTLDNLSTWALKLGIEFSPEKT FT ELVVFSKKRDPAKVELQLMGKELAQGISHMYLGVWFDSKCTWGKHIKYLYQ FT KCQQRINFMRTLTGTWWGAHPEDLIKLYRTTILSVLEYGSFCFQSAAKTHI FT LKLERIQYRCLRIALGCMSSTHTMSLEVLAGVLPLSDRFAELSLRYLIRCE FT VLNPLVIGNFEKLIERNPQTKFMTLYYWYMSLEVNPSLNAPNRSCFPDFSS FT STVGFDLSMKQEIHGIPDHLRSEYIPLIFANKFGHVSSDRTFYTDGSKTND FT STGFGVYNEFHSAAHKLQNPCSVYVAELAAIHYALERIASLPSDRYFIFTD FT SLSSIEAIRSMRPVKHSSYFLREIRSSLSALSNHTITLVWVPSHCSIPGNE FT KADSLAKVGAMEGDIYDRQIAFNEFFAIARQQTLLSWQQKWDNGDLGRWLH FT SILPRVSRKPWYKGLDMSRDFIKVMCRLMSNHYSLGSHLYRIGLSDSNRCG FT CGAGYRDINHVVWYCSEYGIARSDLCESLRARGKPDKEDIRDVLGRLDFDY FT MMLIYKFLKQNDVIV" XX SQ Sequence 6501 BP; 1758 A; 1515 C; 1423 G; 1798 T; 7 other; cattcctcgt cgtgcgctct atcgagacgg tcgactttat tccgtgctct gaaccattcg 60 ttatttttcg gtagagtttt ttctccacac gcaagttttt agatagtgga gtttttatat 120 cattcgctca tcctaccagt gaacgaaaat cggtatatcg cgcaacacgt ggaagcagct 180 agtcctggca tagtcctacg gctagactgt gcgtcgaagc atcatcgaag ggcaaaccgg 240 gggtcccccc ttcccgtcat ctaccatcgt gcccatcgca gtcaaacatc agcgcgcttt 300 catcgtagta cccagctcca gcagatcgaa cgagccaagg tcatcacacg agaaaacgcc 360 tgtttgacta cctacagtaa gcagaggaac aagctgtagc acgcggtgtg tggttaggta 420 aattgttcct tttcttggtt gcattgtttg ccgtcggtcg gacaatttca tctacgttga 480 tcgaatagtt acacagtggt ctcgaagtgt tttttccctt gaaatttttt tttttttttt 540 gctgaattat tatttttttt ttaataattt gatttttttt tcgtgttgca ttaatattga 600 ttaactcaca tagcagtgca aaatcaatca aattttcact tttttttttg ggttattgcc 660 attgttattt tcctcgaagc gtaatacaac gctacattct gtgagctaaa ccaatagaat 720 agaataatat tctgtgagct aaattaatta gaagctcact tgtaaatatt agttaggttt 780 tagatttttt ccttgtgtct aatattacaa cacaatggat ccaaatgatg aaggcggggg 840 cgattcgcag tttctgatat cctccgaaga cgagtcaaac aataattttg ttgaaaatgt 900 ctcccctatg gagaccggcg tagttacgaa acagggtctt ctcccctaca aggagcgcct 960 gaattacaaa ctccagtgag gttgcctgaa atcgaaaast ccattaccca tctttctgag 1020 gatgaatcta ttgaatcacc atcgtacgaa acattggtca gtccacaagt accmatggaa 1080 acatcmtctg cgaattcttc ctctgcacat caacggcaag cgctcccctc cacccctggt 1140 aaagcgaacc cccgccagaa agcttacccg cccggatcta agggtccctt tctggttttt 1200 ttccggccca aaggtaaacc gttaaacaaa ttgcaaatcg aaaaagattt ggcaaagtcg 1260 tttcggggca tcgagtctgt agatgcaccc agccgtgaca agctccgaat cactgtcagt 1320 gatcgtgaac aggcaaatag aattgcttcc tacgagctct tctcgatgga gtacagagta 1380 tacataccga gtcgagacat tgagatcgca ggggttgtca ccgaaccgta tttgagttgc 1440 gctgacatca aagctggtgc tggagggttt aaaaaccgtg atgttccccc agtcactatc 1500 cttgatgtca gacaaatgaa tcaggtggct cctgatggca caaaacagcc ctcaaattcg 1560 ttttgcgtca cattctctgg atcggcgctt ccggactact tggtgatcgg gaaacttcgg 1620 ttacccgttc gcctctataa acctacggtc atgcactgtg ataagtgcca acagattggt 1680 cataccacac ctttctgttg taacaaacca cgatgcgcta aatgcggtga gcaacatgtt 1740 gagggtgcct gcagtacaga acctaagtgt acatgctgtg gtcaagctcc acatgagctc 1800 accacatgtc ctaggtatat agagagggag aaacatcaaa ttcgatctct acagcaacgg 1860 tcgaagcgtt cttacgcaga aatgctgaag aagatcgccc cagctgttgc tccgccagcc 1920 caaccaatag ctagcaacaa tatcttcacc tccctagctg acgatgagca aggttctgac 1980 tctgaggagg gtgaagagta tactattgtg caaacaggat caaagagaaa gcgagctgtt 2040 gctagacagc tgcgacatcg tcagcaaatt tctcaaaacg cacagcagga actccgaccc 2100 tccctgaaaa agtcaagaag tggcgaaaat gctattaaag caaatccacc aggtttcaga 2160 cttgtccctg gagattttcc gtcacttccg ggatcatcta aaaccccaga tatcccagtt 2220 tttcgcccag aaagccaaca cgcaaatatt ccttcggctc gacaggaaag tgtcgaaact 2280 tccgacaaac taacactttc tgggattgtg gaaatcatct tcaaaatatt ggaagtttct 2340 cccgcaacaa gaaacctcat taacatggct cttcctttcg tgaaacctct tctgaaacgt 2400 ctgttttcac aatggccgat ccttgattcg ttcatatctc tcgatggata attcaaccaa 2460 cgaggtcggg gatatgatcg aaatcctaca gtggaattgt agaagcattc ttaaaaatat 2520 tgatgcgttt aaatttttag tccacagcac gcgctgtgac atattcgccc ttagtgaaac 2580 atggctcact tctgataaag atatttcttt ccacgatttt aacattatcc gtcaggatcg 2640 aggggacggt tatggagggg tgttattggg gatcagaaaa ctccactcct tttatagagt 2700 tgacttcccc ccgatgactg gcaccgaagt agtcgcattg caggtcacga tacgaggcaa 2760 aagctttagc atagcaagtg tgtacctgcc gccgagcgcc agggtatctc gcagggacct 2820 tgcagccatc tgctgcgcta tgccggctcc acggttagtc gttggcgatt ttaattcgca 2880 cggtacagcc tgggggtcaa tgtatgacga taaccgttcg tccttgatat acgacttgtg 2940 cgacgacttc aacttgacaa ttttaaacac tggggaagca acacgagtaa aacctccagc 3000 tccaccaagc atgttagacc tctcaatctg ttcgaattcg ctatcattgg attgcacgtg 3060 gaaagtgatt caagatcccc atggtagtga tcacctgcct atcaaaattt ctatcaccaa 3120 tgattcgtgt caggcccgcc agatcgactt agcgtatgac ctcacgaaac atattgactg 3180 gggaaagtac gctgagttga tcatcgatgg cgtgcagtcg gtcgagttac ttcctccgtt 3240 ggaagagtat cagttcctat ccgaattaat cgtcaacagt gctcttcaag cacagcgtcg 3300 accaataccg ggatcgaaag ttcgacgtcg gccacccacc ccgtggtggg atgaggagtg 3360 caccgcwgta tatcgggaaa aatccgccgc gttcaaagwa taccggaaac gcggttctcg 3420 kgaaaactac gagcgctata cctctcttga acgcaagttt ggtagccttg tcaaagcgaa 3480 gaaacgagga tattggcgaa atttcgtaaa cggtctttcg agggaaacgt cgatgacagc 3540 attatggtcc gccgggagaa gaatgcgcaa cgcgtcgwcg gtaaatgaag ataaagaaca 3600 ctcttctcgg tggatcttcc agttcgctaa gaaagtgtgt cctgactccg ttcccgtgta 3660 caggacggtt cgcgcttgtt tgccagagag aaacgaagtt gatagaccct tttcgatgct 3720 tgagttctca cttgctctcc tttcatgtaa caactccgcc ccaggaatgg ataggattaa 3780 gttcaacctg ctcaaaaacc tcccagacgt cgcgaagagg cgcttgttgg acttatacaa 3840 tcagtttctt gaaggcaaca ttgttccgga tgattggaga caagtgagag taattgccat 3900 ccaaaagccc ggaaaacccg cgtcggatta caactcgtac cgccctatcg cgatgctgca 3960 gtgcttacgc aagctgttag agaagatgat tctttatcgg cttgacaaat gggttgaagc 4020 gaacggcttt ctctcagata cgcaatttgg tttccgcaga ggcaaaggaa cgaacgactg 4080 tcttgcgtta ctttcaacag aaatccaact agcctatgct caaaaggagc aaatgggctc 4140 agtatttttg gacattaagg gggcttttga ttcagtatgc atagatgtcc tttcagacaa 4200 actccacgat tgtggacttt cattattatt aaacaactac ttgtataatt tgttgtctga 4260 gaaacgtatg agcttttctc acggtacctc aacaacttca cgaataagtt acatgggtct 4320 cccccagggc tcatgtttaa gtccccttct ttacaatttc tatgtcagag acattgatga 4380 ttgtctcgtg gaaaattgct cgctaagaca gcttgcggat gatgccgttg tttctgtaac 4440 aggaccaggg gcggatgatt tgcaaagacc actgcaagat actctagaca atttgtctac 4500 ttgggccttg aagctgggta tcgaattctc tccggagaaa actgagttgg ttgtcttttc 4560 taagaagcgt gatcctgcaa aggtagagct tcaactcatg ggtaaggagc ttgctcaggg 4620 tatttcacac atgtatctag gggtctggtt cgactctaaa tgcacctggg gaaagcacat 4680 caaatatttg tatcagaaat gccagcaacg aatcaacttc atgcgtaccc tcactggaac 4740 atggtgggga gcccacccgg aagatctgat aaagctatat cgaacaacga ttctgtcggt 4800 cctcgaatac ggtagctttt gttttcaatc cgccgcgaaa acacacatcc tgaagctgga 4860 acgtatccag tatcgttgtc ttcggatcgc gctaggatgc atgtcctcaa ctcatacgat 4920 gagtttagaa gttttagcag gagtactgcc tttatcagat cgtttcgcgg aattatcgct 4980 acgctacctc atccgatgtg aggtgttaaa tccattggtg attggaaatt ttgaaaagct 5040 aatcgaacga aatcctcaaa caaaatttat gacactgtac tactggtaca tgagtctgga 5100 ggttaaccct tcattgaatg ctccaaatcg tagttgcttc ccagacttct ccagttctac 5160 tgtaggtttt gatctgtcca tgaagcaaga aatccatgga attccagatc atctccgatc 5220 ggagtatatt ccactaattt ttgcaaacaa gttcggtcat gtcagcagcg acagaacgtt 5280 ttacactgat gggtcaaaaa caaatgattc cactggattt ggtgtttata acgaatttca 5340 tagtgccgcc cataaacttc aaaacccatg ctcagtatac gtcgcagaat tagcggctat 5400 acattatgca ttagagcgaa ttgcctctct tccctctgat cgatatttca tttttacgga 5460 tagcctcagc tccattgagg ctatacgttc aatgaggccg gtaaagcact cctcgtattt 5520 ccttcgcgaa atacgatctt cattgagtgc tttatcgaat cacaccatca ccttggtatg 5580 ggtcccttca cattgttcga taccgggcaa tgagaaagca gactcactcg ccaaggtggg 5640 cgctatggaa ggcgatattt atgaccgtca aatcgccttc aatgaatttt ttgcaattgc 5700 tcgtcagcaa accctgctca gttggcaaca aaaatgggac aatggtgact tgggtaggtg 5760 gttgcattcc attctccctc gggtgtccag gaagccgtgg tataaagggt tagacatgag 5820 tcgagacttt atcaaggtaa tgtgtcggct gatgtccaac cactactctt tgggatcgca 5880 cctctatcga atagggctct cagacagtaa ccgttgcggt tgtggtgcag gatacagaga 5940 catcaatcac gtggtgtggt actgttccga atacggcatt gccagatccg atttatgtga 6000 atccctcagg gcccgaggga aaccagataa ggaagacatt agagatgttc tgggtaggct 6060 ggatttcgat tacatgatgc ttatttacaa atttttgaag caaaatgatg tcattgtttg 6120 atttcccatt ctcgctgtcc agtccaacta gtcctccttt tggtcttcgt ctgtccacct 6180 tgtcccctcg aaatacgttt gtttgtaccg ttacaggttg tcgtcatgtc cactctacgg 6240 ttgacctgca gcaacaacga tactcactac ataacactgc tgaacgtcaa catgtaaacc 6300 ttaaacccta tcctctcctt catatttttt gttatcctta acctcggcca aaccgcgagt 6360 tttacggttc cccaaaacta atatagatta ttaagaagca attatgaatt tgtaaaaaaa 6420 aaaatcaatt gaattcggct ccgttatgcc tagtggcgct tgagcctgtt aaataaacgt 6480 ttaagtaaaa aaaaaaaaaa a 6501 // ID DNA-1_CQ repbase; DNA; INV; 3260 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A DNA transposon family from Culex quinquefasciatus - consensus. XX KW DNA transposon; Transposable Element; nonautonomous; DNA-1_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-3260 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the southern house mosquito."; RL Repbase Reports 11(1), 42-42 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >77% CC identity. subterminal inverted repeats. Including (TTTCTAAAA)n CC microsatellites at the middle. XX SQ Sequence 3260 BP; 1064 A; 530 C; 449 G; 1216 T; 1 other; taagggtgga ttttcatgga tttttcgatg accgacatcg aaagcggctt tcgatgctcg 60 tggtcatatt ttgccgaaaa ctcattttta tcaatcaggt taaatcgcaa aatggtttgg 120 attgatattt tatggtagta taagcacaaa caaacacata cccatcgccg gaaagttgcg 180 aaaatgcgat tttccgactt ttcattttcg atgcacgagc atctttagac gaccgacatc 240 gaaagcatac tcacgaccat gctttgctaa aatgtccgtg cattgaaact cgatgctcgt 300 tcagtgcctt tttgctacct cagcggcacc ctagacacaa acctaaataa cgctttgaaa 360 cgcttggtac gggatgggaa tgaaacttga cgaaacaaat tctatctgtc aaaccagacc 420 ttgtcaacaa tcaaaacagc tgattttgca tggtgaaatg gtggacattg ttttgcagga 480 ggaatccgtc caccggtggt tggatttcgg agcattgttt taccggagga atccgtccac 540 cggtggtcgg atttcagggc attgttttgc aggaggaatc cgttcaccag tggacggatc 600 gcctggccaa gttgacaaga ggaacttgtc caccggtggg gatttcgggg cattgttttg 660 caggaggaat cggtccaccg gtggccggat ttctggacat tgttttgcag gaggaatccg 720 tccaccggtg gttggatttc ggggcattgt tttgcaggag gaatccgtcc accggtggac 780 ggatttcagg gttttgtttt gcaggaggac ggatttctgg gcattgtttt gcaggaagat 840 tctgtgcacc gatggacgga ttgcctggcc aagttgacaa gaggaactgg tccaccggtg 900 gttggatttc ggggcattgt tttgcaggag gaatccgtcc accggtggcc ggatttctgg 960 gcattgtttt gcaggaggat kccgtgcacc gttggacgga ttgcttgcct aaattttgct 1020 taaatttctt aattccaaat tccaaaattt ctaaattctt aaaactaaaa tttctaaact 1080 aaaaattcta aattttctta aatttcaagt attctaaaat ttcaaagttt ctaaaatttc 1140 aaagttttct aaaatttcaa agttttctaa aatttttata atttctaaaa tattaaaaat 1200 atataaaatg tctttactta ctttaatttc ttcaattcct ttaatttctt aatatttata 1260 atttcctgaa tatcttaaat ttcttaaatt tctaaaattt cttaaatttc taaaatttct 1320 aaaattctaa atttctaaat ttctaaaatt tctaaaattt ctaaaatttc taaaatttct 1380 aaaatttcta aaatttctaa aatttctaaa atttctaaat ttcttaaatt tctaaatttc 1440 ttaaatttct aaatttctaa aatttctaaa tttctaaatt tcttaaattt ctaaaatttc 1500 taaaatttct aaaatttcta aaatttctaa aatttctaaa atttctaaaa tttctaaaat 1560 ttctaaaatt tctaaaattt ctaaaatttc taaaatttct aaaatttcta aaatttctaa 1620 aatttctaaa atttctaaaa tttctaaaat ttctaaaatt tctaaaattt ctaaaatttc 1680 taaaatttct aaaatttcta aaatttctaa aatttctaaa atttctaaaa tttctaaaat 1740 ttctaaaatt tctaaaattt ctaaaatttc taaaatttct aaaatttcta aaatttctaa 1800 aatttctaaa atttctaaaa tttctaaaat ttctaaaatt tctaaaattt ctaaaatttc 1860 taaaatttct aaaatttcta aaatttctaa aatttctaaa atttctaaaa tttctaaaat 1920 ttctaaaatt tctaaaattt ctaaaatttc taaaatttct aaaatttcta aaatttctaa 1980 aatttctaaa tttctaaaat ttctaaaatt tctaaaattt ctaaaatttc taaaatttct 2040 aaaatttcta aaatttctaa aatttctaaa atttgatttt ttaaaaattt ctaaaatttc 2100 tttaattctt ttaattcctt aaatttctta attcttttat tttctttaat ttcttaaatt 2160 tcctcaattt cctaaattac ttaattattt ctttaatttc tcatatttct tgaattattt 2220 atttcttaaa tttctaaaaa attaaattac ttaatttttt tgaaattcct gaaatttttc 2280 caatttttag tccactctat agagaatttg cagctaagct aaaagacaca tgtcgccacc 2340 tatgaacaaa tagcgtaaac ctatgatttt tttttgaaaa aatgtgtttt ttccttcata 2400 atcaacattt ttattaaact tatgccggat tcggatgaga aaaaatattt ccaacaattt 2460 ggtgtacaac tttccaatat ttgtttgatt tttctatgag aaaactgtaa atttagaaaa 2520 agatagtaat ttggactatt tggtaagaaa gtctgtcaaa aatcaataaa aattgttgaa 2580 tatacgtaaa cacatctgtt atcaattata aaactgttta tttcattgat ataaacgggg 2640 aaactcattt ttagtgttcc aaaacatgtt tttttttaga aaaagtcaca tatttctgcg 2700 cgcccaaaaa atgatgttca ttattttaaa gcaaaaaaaa tttctcgtct tttgcaacca 2760 gtttcattaa gattgatgca gggaaccatg agaaataagc aattttgttc ttcaatccag 2820 cagaaagaaa tacgattttt ggcaagtaac ctcattttgg cgccacctac atttgaaaca 2880 cagtcgcgct gcaaaacctc aattatctct catctttttt gcatgatttt tcgcaacgaa 2940 catgaggcaa aaggttttat ttaattaaat tatgcgctag tctaatcatg cgcctttttg 3000 aaaaagattt attgaacaac atcagcttat ttcaaaagtt gttctggaat tccaatcgcc 3060 acgcaatgct gctttgttta cgtttgaatc tgtcattttc catccttcgt taacttgcat 3120 gcaagtgttc cttattccgg aaagtcgatt ttggggttgt gtctaagcta aatgaagtaa 3180 cgcaagccac gagcatcgag ttggttcgat gctcgtgctc tcgatgtcgt tgcacaatcc 3240 atgaaaatcc agcctaagag 3260 // ID MISAT1 repbase; DNA; INV; 296 BP. XX AC L07110; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 28-SEP-1995 (Rel. 1.08, Last updated, Version 1) XX DE Meloidogyne incognita satellite repeat. XX KW SAT; Satellite; Simple Repeat; MISAT1; satellite repeat; KW Repetitive sequence. XX OS Meloidogyne hapla OC Eukaryota; Metazoa; Nematoda; Chromadorea; Tylenchida; Tylenchina; OC Tylenchoidea; Meloidogynidae; Meloidogyninae; Meloidogyne. XX RN [1] RP 1-296 RA Piotte C., Castagnone-Sereno P., Bongiovanni M., Dalmasso A. RA and Abad P.; RT "Cloning and Characterization of two satellite DNAs in the low RT C-value genome of the nematode Meloidogyne spp."; RL Unpublished (1992). XX DR GenBank; L07110; Positions 1 296. XX SQ Sequence 296 BP; 121 A; 34 C; 32 G; 109 T; 0 other; cttggtttaa ttacccaagt ttaaggtatg taaatcatta ctacttggga aaaattttgg 60 aattgtattt cgaaagaaat actcaaaatt aaatcaataa ttaaaagttt ttaaaaaaaa 120 cttgatacaa aaatttaaaa ataaactttt tactaaaagt attttataga ttatagaagg 180 agaaaagaag cagagaatgt acactatctc tcattaaatg aagaaaaatt aaattttatt 240 ccccatcaat attttcctaa ttttatggac attttgtcct tttatttttc caattc 296 // ID MarinerN-1B_AP repbase; DNA; INV; 244 BP. XX AC . XX DT 30-AUG-2009 (Rel. 14.09, Created) DT 30-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Nonautonomous; KW MarinerN-1B_AP. XX NM MarinerN-1B_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-244 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2066-2066 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 244 BP; 58 A; 71 C; 65 G; 50 T; 0 other; ctatactctg ttccatctaa gagtgaacca cttggctgac caaaacacgt tttttccgcg 60 cattcccacc acaatccgga catcggtgag agttttaaca cgggcgcccg acggccgacg 120 acggccggcg atgactgtcc gccaatcaca gaacgcataa ccatcgcatg gggggtcagg 180 cactgttcgg cggctcagtc tgcgtgcgcg aaagcggttc actcttagat ggaacagagt 240 atag 244 // ID Jockey-15_AAe repbase; DNA; INV; 4541 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A Jockey non-LTR retrotransposon from Aedes aegypti. XX KW Jockey; Non-LTR Retrotransposon; Transposable Element; KW Jockey-15_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4541 RA Kojima K.K. and Jurka J.; RT "Jockey clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1381-1381 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 6 sequences with >95% CC identity. XX FH Key Location/Qualifiers FT CDS 283..1629 FT /product="Jockey-15_AAe_1p" FT /translation="MGNKKKKSVSQDATIDDTKGVKRLRHEEATPSGFNNR FT LLATNPFVTLSANNGHPPAAQQQQNRPASQRQGNHRNLPQQQANQPPIPTK FT ERVPPLFTTSKMVDGMKKDLAGDKIHPLFKHCLTGTKIICSSFADYNGVKH FT YLGLKNLPFYTHDVPGTKPLKVIIHGLSKYTPAEIMEELKAANLKPVQVFP FT INRAEGRQYRDLLYLVHLEKGSITMADLQKKRALFQTVVEWERYRPKKKDV FT TQCANCLMFGHGARNCHMAPRCGKCTGPHLTSMCQPMEEAEPKCANCGANH FT EANNRNCPKRAEFMEIRRKASSKNQRGREQRPSQSPQFNDEQFPPLRYQVP FT NLPPLQPHRQPPRPPPSQPAPTGQRPSVQNRFAAAAAAPLVAPGWANYERR FT PPTTDPLDDGTLFPTKMMSDFAVSLFGRLLSCRSKREQISAVCDTVQSFMD FT KYGP" FT CDS 1622..4294 FT /product="Jockey-15_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MDPEAIRILNWNACSIRNKTRELAAFLEDRQIDIAII FT TETHLKPELSIYIPDFRTVRLDRTNSGGGGVAVVLRSNINCRLLPSFKTKI FT IEAVGVEVESPVGPISIIAVYCPKDTRANDGSATNLRNDIIKLTRRQGQYV FT IAGDLNAKHQVWGNSRRNNNGTIIQQDLEEGHYTIMGPDAPTRLSRTGVHA FT TIDVFLTNMTDKISQPVAYQELSSDHFPVVAEVGLSVNRHQITRRNYHRVN FT WDRFRRCVDDGIDYEERPVTKEDIDRQLCSIEEAISQAREQHVPAARQVSN FT TLPIDELTKDLIRLRNTTRRQYQRTGLPSLKTDFNRMSKIIKARMVDLRNN FT DFANKIRALPDCARPFWKMTKILKTKPRPIPPLIPLDNNGSMDRLITPDEK FT AEELGRHFVSSHNLGQNVISPHEAAVNEHAANLHLTPNDFSEELEVTADEL FT STYIRSFKNMKAPGFDKILNLELKHLSREFFEHLAEIFNQCLRLSYFPSAW FT KSAKVIPIQKPGKDPSSPKSYRPISLLSALSKLFEKTIHSRLLSFTDQNNI FT LLEEQFGFRRGRSTVNQLTRVTNILKRNKSVSKTSAMALLDIEKAFDNVWH FT DGLVYKLHRFNFPSYLVKIIKDYLSSRAFSVYLNGAVRLETQRISAGVPQG FT SILGPLLFNIFTSDMPPLPDNGTLSLFADDTAVVHKGRVIRALASKLQKSL FT DVLAEYLHSWKICINAAKTQVILFPHSQSPRLVPPEDCKIIMAGTAVEWSD FT TADYLGLTLDSKTNFRAQVDKTVTKCNILLKSLYPLINRKSTLSLKNKLAV FT YKQIVLPVIEYGMPVWECCARKHHLKLQRVQNKFLRMILNTPPRMRTTEVH FT RLADIKTLEDRFSEFKERYRARCQSSDQQVIRDLFPTL" XX SQ Sequence 4541 BP; 1189 A; 1362 C; 1055 G; 934 T; 1 other; tcattcttgc ttcgatatcg gagctgtgcg gacaagattt atggtgctct gccttgtaat 60 ccaaaagaag cgttttccga acaactctct tggacactac ctggcaacta agggttctcc 120 atctatcctc tgaattcgca agtgcgttga tagttttcgc tcgagattcc gagcccgcgc 180 cgcgtacttt cccccgcgct ctcccacctc ctccaggtgg caagagtaaa gtcaaaactg 240 actaagctcg ccgcgagtgc atcgttgccc attgctgacg acatgggcaa caagaagaaa 300 aagtcagtgt cccaggacgc gacgatcgac gacaccaagg gcgttaaacg cctacggcat 360 gaagaagcca ccccatccgg cttcaacaat cgcctgctgg cgaccaaccc gttcgtcacg 420 ctgagtgcga acaacggcca ccctccggct gcccagcaac aacaaaaccg ccctgcctcc 480 cagcgtcaag gcaatcatcg caacctcccc cagcaacaag caaaccagcc gccgatcccc 540 accaaggaga gagttccgcc gctgttcacg acatcgaaga tggtcgacgg catgaagaag 600 gacctggccg gcgataagat ccacccgcta ttcaagcatt gcttgacggg tacaaaaatc 660 atctgctcct cgttcgccga ctacaacggg gtcaaacact acctgggtct gaagaacctc 720 cccttctaca ctcacgacgt ccccggaacg aaaccgttga aggtcatcat ccatggcctc 780 tcaaagtaca cccccgctga gataatggag gagctcaagg cggcgaacct gaagccggtc 840 caggtttttc ccatcaaccg cgcggagggt agacagtacc gggacctcct ctacctggta 900 cacctggaga aaggctccat caccatggct gacctccaga agaaaagggc cctgttccag 960 acggtagtgg aatgggaacg ctatcggccc aagaagaagg acgtcacgca gtgcgcaaac 1020 tgcttgatgt tcggccacgg ggcaagaaat tgccacatgg ccccacgctg cgggaagtgc 1080 accggtcccc atctgacatc gatgtgccag cccatggagg aggctgagcc aaagtgtgcc 1140 aactgcggcg ccaatcacga ggccaacaat cgtaactgcc ccaagcgtgc ggagttcatg 1200 gagatccgca ggaaagcgtc ctccaaaaac cagcgggggc gtgaacagcg tccctcccag 1260 tcaccacagt tcaacgacga gcaattccca ccgctgcggt accaggtgcc gaacctcccg 1320 ccactccaac cgcaccgaca accacctcga ccaccaccga gccagccagc tcccactgga 1380 caacgccctt cggtccaaaa ccgtttcgcg gccgccgccg cagccccgct ggttgcccca 1440 gggtgggcca actatgagcg ccgccctcca accaccgacc cgctggacga cgggaccctc 1500 ttcccgacga agatgatgtc ggacttcgcc gtcagcctct tcggtcggct tctttcgtgc 1560 cgttccaagc gcgaacaaat cagcgccgtg tgtgacacgg tccaaagctt catggacaag 1620 tatggaccct gaagccatcc gcatcctgaa ctggaatgct tgttccatcc ggaacaaaac 1680 aagggagttg gctgccttcc ttgaagaccg ccagattgac atcgccatca tcaccgagac 1740 ccacctcaaa ccggaactca gcatttacat ccccgacttc cggaccgtga ggctcgaccg 1800 gactaactct gggggagggg gcgttgcggt tgtcctccgc tctaacatca actgtcgcct 1860 gctgccgagc ttcaagacga agatcatcga agctgttgga gtcgaggtgg aatctccagt 1920 cggaccaatc agcatcatag cagtctactg tcccaaggat acccgcgcca acgacggttc 1980 ggccacgaac ttacgaaatg atatcatcaa gctcactcgg cggcaggggc agtacgtcat 2040 cgctggtgat ctcaacgcca agcatcaagt ctgggggaac tctcggcgaa acaacaacgg 2100 gactatcatc cagcaagatc tcgaagaagg ccactacacc atcatgggcc cggatgctcc 2160 cacccggttg agccgaaccg gggtccacgc aaccatcgac gtcttcctca ccaacatgac 2220 ggacaaaatc tcccagccgg ttgcctacca ggagcttagc tcggaccact ttcccgtggt 2280 ggccgaagta gggctctcgg tcaatcggca tcaaatcacg cgacgcaact accaccgggt 2340 taactgggac cggttccggc ggtgcgtgga cgatggcatc gactacgagg agcgccctgt 2400 gacgaaagag gatattgacc gccagctgtg ctccatcgag gaggcgatct cccaggcccg 2460 ggagcaacat gtaccagcgg cccgtcaggt gagcaatacc cttcctatcg atgaactcac 2520 taaagatttg atccgtttac gtaacaccac tcgacggcag taccaacgca ctggcctgcc 2580 gtcgttgaaa accgacttca accgaatgtc taaaatcatc aaggccagaa tggtggacct 2640 caggaacaat gattttgcta ataaaatccg cgctctccca gactgtgcac ggccgttctg 2700 gaagatgacg aaaattttga aaaccaaacc cagacccatt ccaccgctga ttccattaga 2760 caacaatggc tctatggatc gcttgataac tcctgacgag aaggctgaag agttaggtcg 2820 gcacttcgtc agctcacaca atctgggaca aaacgtcatc agtccccacg aagctgctgt 2880 caacgagcat gctgccaatc tgcatctgac ccccaatgac ttttcggagg agttggaggt 2940 caccgctgac gaattgtcga cctatattag gtcmtttaaa aacatgaagg ccccaggctt 3000 tgacaaaatc cttaatttgg agcttaaaca cctgagccgc gagttctttg aacatctcgc 3060 ggaaatcttc aatcaatgtc tccggcttag ctacttcccc tccgcctgga agtcagcgaa 3120 agtcatccca attcagaagc ctgggaagga tccttcctcc cccaaaagct atcgtcccat 3180 cagccttctc tcagcgttat caaagttgtt tgagaaaacg atccacagtc gactcctatc 3240 ttttaccgac caaaacaaca tcttgctcga ggaacagttt ggattccgac gcggtcggtc 3300 caccgtgaac caactgactc gagttaccaa catcctcaag cggaacaagt ctgtctccaa 3360 aacatccgcc atggcactgc tagacattga aaaagccttt gacaatgtct ggcacgatgg 3420 cctggtgtac aagctacacc gatttaattt tcccagctac ctcgtgaaga tcatcaaaga 3480 ttatctctcg tcacgggcgt ttagcgtgta tctgaacggt gcggtacggc ttgagacgca 3540 aagaatctcc gctggcgtcc cccagggaag tatattaggt ccacttctgt tcaacatctt 3600 cacctcggac atgcccccgc tccctgataa cggaactctg tcgttgttcg ccgatgatac 3660 cgcagtcgtc cacaaaggta gggtcatacg tgcgctcgcc tccaaactgc agaaaagcct 3720 agacgtccta gcagagtacc ttcacagctg gaaaatttgc atcaacgcgg cgaagaccca 3780 ggtcatcctc ttcccccatt cccaatcccc gagacttgtt ccgcctgagg attgtaaaat 3840 catcatggcc ggtacagcgg tggagtggtc tgacactgct gactatcttg gcttgaccct 3900 agacagcaaa acgaacttca gagcacaggt cgacaagacg gtcaccaaat gcaacatcct 3960 gctaaaatca ctttaccctt tgatcaaccg aaagtcgacc ctgtctctga agaacaagct 4020 tgctgtctac aagcaaattg tccttcccgt cattgaatac ggcatgccag tctgggagtg 4080 ttgcgccaga aaacaccatc tgaagctcca acgagttcag aacaaatttc tcaggatgat 4140 cctgaacact cccccgcgga tgcgcaccac tgaggtccat cgtctggccg acataaaaac 4200 actcgaagat cgctttagtg agttcaaaga gaggtatagg gcacgttgcc agtcatctga 4260 ccagcaggtc attagggacc tgtttcccac actttaggtt atcaaatttt ttcttgttta 4320 gtcttgtata tagtagctag gttatcaaat ttttctttta aaactaccag agcctataag 4380 gccaaattaa ttaccagggt aaaaccgaac taataactaa gtaacaaaat aaatgtatgt 4440 atatcaaagc tgaaggacct cacgaggtcg acctcttaaa atgtaaaaca attttcttgt 4500 aaaacataat gtaaataaac acgaatttaa tttgaaaaaa a 4541 // ID Gypsy-249_AA-LTR repbase; DNA; INV; 1309 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-249_AA_; KW Gypsy-249_AA-I; Gypsy-249_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-1309 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1100-1100 (2011). XX DR [1] (Consensus) XX SQ Sequence 1309 BP; 352 A; 323 C; 275 G; 352 T; 7 other; tgtaacctcc caaaaaggtt acatttgttt tttattgtaa atattccttg taaatagttt 60 tcttttgatt attaccattg ctattattat tattggtatt attattttca ttatttttat 120 tatcatttct actattacta ataaggtagc aagaataatc aagaaaatga atttaagaac 180 tttaaagaaa acaagaattt gaattaataa aaacaacacc gattttccaa ttcccatttc 240 ccttgaacat ccctattccc cacctcccgt aacggctgtc acatccagcc acacagccac 300 agttcagcaa aaccgtccgc gtcattgtgt tggtgaacgc tcgggacacg actacgatga 360 cgatgacgaa gacggctagg agataaaatt gtttgtcacc gatggatctg cctctagaag 420 gtgaacagtt ttcatcaccg aaagtggtgc gcgtttttaa ccgctctgat ttctccccat 480 agtgaaccaa aagtcgggcc taaatcggcc attgtagtgc cagttgtgcc gcgtgaaaag 540 ttgccgtgac tccgtcggtg gtgtttgccg cgataaagca ccggttccac cgttactcgc 600 cggaaaaagt gaaggccacg aagaagcacc ggtttcgccg taaccagccg gggaagtgaa 660 gtgccacaag tctccagtgg gccataagtg gtccgtgggt cccatcggtg tgagcgacgg 720 aagccatcag ggtgtgtgac gtccaagttg cccctgcaca acaagatccc tcccatcgga 780 gttctccgaa cgaagccggc gtgccagcag atccctcttc gccattccaa ggggtgccca 840 gttttgagga agtaccacga ttgaccggat gatgttcgtc ttcgtctcaa ggatagactg 900 gatggaagga ggagcccgat ccctgcgctg tccagcgccg acttcagcca ccaatcaaca 960 ataaacaagg tggcacagtg taatccgagt caaaatccct tcctatcact ttttgtgsaa 1020 ttwgaaataw tgaaattata aaacgaacgt ctaaamtgta atactakcta ccttctttta 1080 ttttcttatt gaatcgctaa gcccttattt ctcactcttc kctgttcggt ccaccttggc 1140 aacgtagtgt tttccgacgt aggactacgt ccaggtaatc ttttataagg gaagtcgttc 1200 cgcccatctc tcccaatttc ctcttggagc cagcataaac gaccctgagg cccaggacaa 1260 aagcccggtt ggggacaaca gaggttccca acctawagaa tctctaaca 1309 // ID Gypsy-49_CQ-LTR repbase; DNA; INV; 2181 BP. XX AC AAWU01035675; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-49_CQ_; KW Gypsy-49_CQ-I; Gypsy-49_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-2181 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 478-478 (2011). XX DR Genome; AAWU01035675; Positions 31319 33499. XX SQ Sequence 2181 BP; 547 A; 455 C; 530 G; 649 T; 0 other; tgtaacgttc tgcgttgatt tgggaattta tagtttattt ttttatttaa ttggtagaaa 60 taatgcacat ctttacccta ctcatattct aaggttcgtt tccaataagc gaaagcttgt 120 ttcttggctg cagggttata gtattagtaa gtcttaaata ccatgggtac gttcttgagt 180 acgttggttg tatttttagt ttgctataaa gtattgcgag ttttgcgtgc ggggtcgtat 240 ctgtaaatac caagagtgtg catgtgggtg gttagaaaat ttgcagttca aaatataggt 300 ggcagtaaat acttagttta aaaaaaaatt gaattattct taactattta attccattgg 360 gtaacacaaa tgtcaaaaat cctctgagtc atgacgaaga gggttgctgg ctagaaaagt 420 tttccaactc gtccatttcg attacggagc ggctggaata aagaagtgcg cgtcgcgagc 480 aatctttata aaagttaatt ttttcctttt tacgttcaat ttttctaagg taaaagtgta 540 tatggtgcga gtgctttgaa gtgaattaac ttaaatattt tcctatccat tttttgtagt 600 tctttcgaat ttattgaaac ttaaataaat ttcgcgtcga acttctttag tgaaataatc 660 tttcgtaaaa taaaaggtaa agtaattgaa ttatattagg agttgtggga gctaattagt 720 tgcattggac agcagcgtcg tcccacggcg ctttttatta cgggctggat ttttcttcgc 780 ggggtctgcg gatttcgtgc gtcgaggcga tcagctacaa gcgtcgccat cttgtcgttt 840 ggaggttaaa gttcggattt tgggcaagct gcctggcagc gccgccatct tgtgctagga 900 gcgattaaag ggtcgcaagc tcgtgtgcca cgttacggcc cgcccattgg ggtagctaaa 960 cagaggcgag tggcaagttc gttaaattaa ccgtaacgaa attaacgcag tggtcgtcaa 1020 ttgccaccgt cgtttaaagt caccaaagag gtgacagccg agagccggta ttcgctgtgc 1080 gcgccaatta cgttcagtgt ttgccagtcc aaggggaggg ttcccgtggc ctttatcggt 1140 tgtgtagacg gaaacagtag tacgtgaagg ccggtattga taccccagcg tcccttcctg 1200 cggaaataat ccggattagc tcgattcgtc gtgcgtagtg ctctctggcc ccgaaaccgg 1260 aagtcttccg ccagccgcgc caccctctgc acctttaatt cggccaattt gcacacgagc 1320 gacccccgcg tgtcaccaca ccgaacacgc cggttatcgg agcaaaccca gacgccatct 1380 tgcggcccaa ggccgaaccg aacgagttcg tcgtttacca agcagcggac cactgggagg 1440 actactaaac gcgcctgttg cagcgcgctg aaggaacacc acgaagcgtc gcacacccct 1500 ggtcgtgcga catccacact gccagcagcg tgttcatcgg agcggcgccg gcatcggagc 1560 aacgtgttca tcggcggtgg attcgccgcg ctcacgcaca ccaccgtaag cccgaaaaac 1620 ctgcgcaggt atgtgggaaa attttatcat ggccatgagg tttttccccg gggggctgtc 1680 caatgctagg gagattgtag gggataggtt tgtgagggat taaagaaagt gaaatgaaaa 1740 atctttagtt tttctttggt ttcgacggag taaaaagttt gatggttctt ggaaccagaa 1800 ataaattcat gccttaaatc actcacactg ttttattatt tacgttcttt acgttcgttt 1860 ttggatatcc attccttatt ttccgattct ctttagcggt ttgaatggtt gaaaagtgtt 1920 tcgctgtgaa ccttctggat tcaaatagtc gattaaggct ttgtttgcca cactgtttat 1980 tatctacctg atttcgagca actcttggta ttgaacagat ctgacgtttc gcgcatttaa 2040 accgtattaa cgaccctgaa agaattaaac ctcatgctta gtcgggcaat taaaacatga 2100 caagtggtct ctcaacttga gagtggcgca taagccactg tgttacaggc tttccctagc 2160 tcggttgggt gaacggttac a 2181 // ID Gypsy-100_CQ-LTR repbase; DNA; INV; 1191 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-100_CQ_; KW Gypsy-100_CQ-I; Gypsy-100_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-1191 RA Jurka J.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 580-580 (2011). XX DR [2] (Consensus) XX SQ Sequence 1191 BP; 320 A; 311 C; 317 G; 242 T; 1 other; tgtgaccaac caccttggtt tgcccacatc gtctcatgaa ccttccctga acactcgggt 60 catttcaaga ggtagaccac ttagaaaagc tagagttagg gtagattttc aaccaagaaa 120 cgcaaaataa gcgcaacatt tgcaaatata ataaatctgc accaattcgt caaacaacag 180 tcgttttgaa tttaaatagc tagaaaaagt aatagaatgg cacaagmgat ttgcatcaca 240 catcacatta ggcattgttt tcagtagcga caacttggct actcttatca acacccacga 300 aacgcagaga tgaacgcaac ctcgacttct gcgtgatctt cgagggctct tcggagtctg 360 gacgggatcg actcgttcac ttgcttgagc accaggaagg agtccacacg ctcacgaaga 420 tctaacacca gtgagtagaa ccgcgattta agaacctccc acgaggtcga ggaagtcccg 480 aaagccaccg tctacggctg agggattgcg cgatcgtacg gccatccgta acctggccaa 540 gtcccgcgag ttctagtgtt cgaccagcac gctaattacc ctggattgcc ccttggcaag 600 tgttcctgct agccacgagc taagcatcgg tgggctccgt tacacatagc caccgccacg 660 aaccgcaccg gagtgtgcct ggacgtgcac gacaaccacg cacggtcctc gaacgtcccg 720 gatcgtctac tacgggtacg gaagcgagta cgagagacgc cgtgtcacga ggttccggac 780 gggaacgtga cttcggcgaa tactacggtg ttccacgccg gaacgcggga agtcctgcaa 840 cacgaggagg agacggcagg caacgagaag tcccaggtcg aggacgggga cggagcgtgt 900 tcgagatccg accacccacg cgtcggggaa cgcaccagat cccgatgtcc acgagcgacg 960 ccgagagtag ctgatcagcg ggcccataca ctgacaccac acaagtaagt ccggttaggt 1020 agcgtaggga gtaggttgtt agggcagatt ctcaaataaa cgtgccatga aaacgtagaa 1080 gtggtctttt ccctctttgc tttcaaatga agattaactg agtgagccga ccctgggtgt 1140 tcttgggggg ttttgtgcga ctgagcgggt tgtaacaaag ggaaagttac a 1191 // ID Ginger1-9_HM repbase; DNA; INV; 3758 BP. XX AC . XX DT 03-FEB-2010 (Rel. 15.01, Created) DT 03-FEB-2010 (Rel. 15.01, Last updated, Version 1) XX DE Ginger1 DNA transposon from Hydra magnipapillata. XX KW Ginger1; DNA transposon; Transposable Element; Ginger; integrase; KW Ginger1-9_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3758 RA Bao W., Kapitonov V.V. and Jurka J.; RT "Ginger DNA transposons in eukaryotes and their evolutionary RT relationships with long terminal repeat retrotransposons."; RL Mobile DNA 1, 3-3 (2010). XX DR [1] (Consensus) XX CC TIR is 120-bp. Tpase gene contains 2 introns: 510-888, 1166-1667. XX FH Key Location/Qualifiers FT CDS join(160..509,889..1165,1668..3131) FT /product="Ginger1-9_HM_1p" FT /translation="MDVEEYNNLVDLIVRGIYPQSIDKINKDGLRRKSKRY FT LVKDGLLFYYDKKRNMDLLVVLTSQKTMILEGCHSAILGGGHFGRDKTLEK FT LSERYYWKGMVNDVXXFCKYCDKCQRANRAFEKHSAELHPIKVKDEVWSTV FT GIDLIGPLPLTEKGNKYIITATCLFSKWPEAASLSDKTATSAAEFLYTCFT FT RHGCCEVQISDQGREFVNEVNHELNKMMGTKCNVTSAYHPQSNGEDERFNQ FT TLQRQLLKYVDEKQNTWDLYIESILFSYRVSVQDSTKQTPFYLVYGRQARL FT PVDLQMKSIKNNFENKPTIDDSLKNRESLLENLIGMRKNALQNIEKAQERQ FT KKAYDAKHCSENQLFKVGTQVLIKNSKKLXRKGSKMEPNWTGPYKICQILK FT KNTFRLCNINDCNKKLKQVYNMTRLKIYYPKSENMVPNNIVEIVQSNSEEK FT FKSISEQLSIFTLTMKKVILDGKELTDEHINFAQLIIKKQFPDIIGLQDTL FT LSQTDGFKAVSTDRISVQIHFINSHWVTSCSINGSISLYDSMYSKLDSHLI FT NQLARCYKNFADYDCEFPPSIVVNXKSVQQQTGSLDCGLFAIANAVHLALG FT XQPENLSYNQKFMRQHLEKMFYAGQFKPFPLLNKRSKLNITRNNSSAVNVQ FT LHCTCLMPCTYDFMVACCKCLKWFHIKCVHYKEDNSVQEWNCEICKLS" XX SQ Sequence 3758 BP; 1379 A; 492 C; 565 G; 1309 T; 13 other; tgtcacagga aattttgctg ctaggaaatt tcgcggctgc ggcctaattt cctaggaaaa 60 tctgctgctc agctgcatta tatcctagga aaaattgccg ctcgcaggaa aatttgccgc 120 ttttcatctc tttacctttc ttgatatcat tttataaata tggacgttga agaatataac 180 aatcttgttg atttaattgt cagaggaata taccctcaaa gcatcgataa gattaacaaa 240 gatgggttaa gaagaaagtc caaaagatat ttggtaaaag atggcctact tttttactac 300 gataaaaaaa gaaacatgga tctgctagtw gttctaacaa gtcaaaaaac tatgattctc 360 gaaggctgtc attctgcaat tcttggtggt ggtcactttg gtcgagataa aacattagaa 420 aaactctcag agaggtacta ctggaaaggg atggtaaatg atgtgargar tttttgtaaa 480 tattgtgata aatgtcaaag agcaaacagg tatgtttgtt aaataactta ctttttatat 540 actaaaaaat taagtttaaa aaattattat ttaataatta ctaaaaacaa taacaacaat 600 tataattgta attaataaaa ctacaggtat tattaataat ccttataggt attattaata 660 ataacttaga gataatatat tatctaaaca taattgtaaa aaaatgtatt atctaagaaa 720 aatattatct aagtaataac aaaaatatta cctaagtaat aacaaaaatt aaacataatt 780 tttcaaataa tataaattga taaatactta tttaactaaa attataattt gtaactaacc 840 ttaaaatatg cataatttct attatttctt tcataatata ctttatagag catttgaaaa 900 acactcagct gaactacatc ctattaaagt taaagatgaa gtatggagta cagttggaat 960 cgatttaatt ggacctcttc ctttaacaga aaaaggaaac aaatacatca taacagcaac 1020 ctgtttatty tccaagtggc ctgaagctgc ttccttatct gataaaacag ctacaagtgc 1080 agcagagttc ttgtatacat gctttacrag acatggttgt tgtgaagttc agataagtga 1140 tcaaggaaga gagtttgtta atgaggtact gtgttggatt tttgtttatt taattgtctt 1200 tttttttttt ttttgttaat ttttagtttt cattactttt ttggaaaaag ctctattttt 1260 tgtataactt tcatgattct aaattttgat aagttgatat atcataattg gatgtgaaga 1320 gtgaataaag aataagtcag aatttttaca actttagaac ttgttcattt aaattgataa 1380 acaaaatgag agttaaaaaa acaacagtta atcagcctca tctgcaatgc ttatatagaa 1440 aaattttttg taaactttgt caatattttg cgcaacaaaa atgttttatg ctaaaacttg 1500 ctaaacttaa ttcttttaat aactaatttt tatttattgg gttatgttat aattgcttat 1560 tatttaatac aaaccctata caagcatttt ttacaggcca ttttataatg ggtctatttt 1620 cttatattat ttctcttcat ataaatattt ttttttattt tgtttaggta aaccatgaat 1680 taaataaaat gatgggcaca aagtgcaatg ttacaagtgc ctaccaccca cagagcaacg 1740 gtgaagatga aaggttcaat caaactcttc agcgtcaact tttgaaatat gttgatgaaa 1800 agcaaaacac gtgggatcta tacattgaaa gtattttgtt ttcatatcgt gtctctgttc 1860 aagactctac taaacaaacc ccattttatc tagtttacgg gagacaagct aggcttcctg 1920 ttgatttaca aatgaaatct ataaaaaata attttgaaaa taaacctaca atagacgaca 1980 gtttaaaaaa tagagaaagt ttattagaaa atttaattgg aatgcgaaaa aatgcactac 2040 aaaatattga aaaagctcag gaacgccaaa aaaaagcata tgatgccaaa cattgttcag 2100 aaaaccagtt atttaaagtt ggcactcaag tgttaataaa aaatagtaaa aaacttwcca 2160 gaaaaggktc aaagatggag ccaaattgga ctggcccata caaaatttgc caaattttaa 2220 aaaagaatac ttttcgttta tgtaatataa atgattgtaa taagaaatta aaacaagtat 2280 acaatatgac tcgtttaaag atttattacc caaaatctga aaatatggtt ccaaataata 2340 ttgttgaaat agtccaaagt aattctgagg aaaagtttaa atctatatct gagcaacttt 2400 ctatttttac actgacaatg aaaaaagtta tactagacgg aaaagagttg actgatgaac 2460 atataaattt tgctcagttg ataataaaga agcaatttcc tgatattatt ggtttgcagg 2520 ataccttgtt gtcacagact gatggattta aggcagtcag tacagataga atttcagttc 2580 agattcattt tattaattct cattgggtca cttcatgttc tataaatgga tcaatttcac 2640 tttatgatag catgtatagc aaattagatt cacatctaat taatcagtta gctagatgtt 2700 acaagaattt tgccgattat gattgtgaat ttccaccaag tattgttgta aacrttaaaa 2760 gtgtccaaca acagactgga agtttggatt gtggattatt tgctatagct aatgctgtcc 2820 acttggcttt gggymaacaa ccagaaaatt tatcatataa tcaaaagttt atgagacaac 2880 atcttgaaaa aatgttttat gctggacagt ttaaaccctt tccattactc aataaacgaa 2940 gtaaattgaa tattacgcgt aacaattcct cagctgtaaa tgttcaactg cattgcacct 3000 gcttaatgcc ttgcacttat gattttatgg ttgcttgctg caaatgctta aaatggttcc 3060 atattaaatg tgtccattat aaagaagaca attcagtgca ggagtggaat tgcgaaatat 3120 gcaaactgtc ataaatgttt tttgtaaaat attattaaaa aatatattat ttttgtctta 3180 cagttttatt ttaattattc aagtcagcct attatatagt gtagcaacaa gatactttaa 3240 aaaagttcat taaaagctag aaatatttac ataattaaat atataaaata acttgcaaaa 3300 ctaattaatt tatattttta gttaatgcta ataaaagtat aattatcaat tgaaawtttt 3360 tttttttctt atgttggaat atttattcca agggttaagg aggattactt tcattaagtc 3420 taaggttttt attatgtagc ttacttaaaa agtaaagtga tttaaaaagg tgattttatt 3480 aagatgaaat ggatacaaac tatgaaaaat actttactaa ttgtaagtaa attttaattt 3540 gaactttata gtatgmtcat ytaatttaac ctgtttgttt gggtaaaaaa atgaacaatt 3600 tttaggaaag attggctcta attgatttgc ttttttcggc agcaaatttt cctgcgagcg 3660 gcaatttttc ctaggatata atgcagctga gcagcagatt ttcctaggaa attatgccgc 3720 agccgcaaaa tttcctagca gcataatttc ctgtgaca 3758 // ID DNA8-102_AP repbase; DNA; INV; 155 BP. XX AC . XX DT 29-AUG-2009 (Rel. 14.09, Created) DT 29-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-102_AP. XX NM DNA8-102_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-155 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2040-2040 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 155 BP; 40 A; 34 C; 38 G; 43 T; 0 other; cattggcgca actagacatt tgaactaagg ggtgcttaag ctccaggttt ttttgtgact 60 caataggacc taacacctaa agatttttta gggggtgcta attttgactt agggggtgct 120 aaggacccaa agcacccccc ctagttgcgc caatg 155 // ID Gypsy-17_CQ-LTR repbase; DNA; INV; 164 BP. XX AC AAWU01030073; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-17_CQ_; KW Gypsy-17_CQ-I; Gypsy-17_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-164 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 414-414 (2011). XX DR Genome; AAWU01030073; Positions 3098 2935. XX SQ Sequence 164 BP; 37 A; 48 C; 29 G; 50 T; 0 other; tgttatagtt ccgctttaga gtgtaccttg ccaagctccc tttccctcta ctttgggtaa 60 taccatttag ttgtgtaaga acctcgctag aataaacacg cgtcgcatcg cgctctctcc 120 aacaacgtgc tttctttatt ggccacaacc tcgcggctac aaca 164 // ID Copia-31_CQ-LTR repbase; DNA; INV; 135 BP. XX AC AAWU01021680; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-31_CQ_; KW Copia-31_CQ-I; Copia-31_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-135 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 372-372 (2011). XX DR GenBank; AAWU01021680; Positions 40054 40188. XX SQ Sequence 135 BP; 37 A; 34 C; 20 G; 44 T; 0 other; tgaagatcga agtaatcaat gacaatcggt gatcattaac tttaagtttt ccgttcaata 60 aaacttcatt ccacatttaa ctttccccac aaccagtcgt gttttccctc gctctctcta 120 gaggttatgg gccca 135 // ID Gypsy-18_RP-LTR repbase; DNA; INV; 221 BP. XX AC ACPB02042122; XX DT 28-JAN-2011 (Rel. 16.02, Created) DT 28-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Rhodnius prolixus genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-18_RP_; KW Gypsy-18_RP-I; Gypsy-18_RP-LTR. XX OS Rhodnius prolixus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Euhemiptera; Heteroptera; OC Panheteroptera; Cimicomorpha; Reduviidae; Triatominae; Rhodnius. XX RN [1] RP 1-221 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Rhodnius prolixus genome."; RL Direct Submission to RU (28-JAN-2011). XX DR Genome; ACPB02042122; Positions 12441 12221. XX SQ Sequence 221 BP; 65 A; 47 C; 36 G; 73 T; 0 other; tgcacactat ccacacagac acccacctgc acagataaga cgcttgccaa tgggatatat 60 acccctggaa acaggagatg gcagagatac tcgttacgta cttcattcat tagacattat 120 actggttgtt taaagccaaa gtgttgtaat ctttctcagt gtgtattcct tgttaataaa 180 gtttttgttt tttaaaaact ttacttcttc atcgattcac a 221 // ID Copia-5_DWil-I repbase; DNA; INV; 4673 BP. XX AC scaffold_181074; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-5_DWil_; KW Copia-5_DWil-LTR; Copia-5_DWil-I. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-4673 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_181074; Positions 602793 598121. XX CC Positions [1626-1895] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 1641..3572 FT /product="Copia-5_DWil-I_1p" FT /translation="MREFCVKNGISYHLTVPHTPQLNGVSERMIRTITEKA FT RTIVNGAKLNKSFWGEAALTATYLINRIPSRALGENKMTPYEMWHNRKPNL FT KSLKVFGSTVYVHNKVKKGKFDEKSFKAILVGYEPNGYKLWDVVKEKFFVA FT RDVVVDETNMVYSRASKPEEEVPNESRERNYEELLYESKECNHLNIRNDST FT ECVRTKFLNESKECDHLNIRNDSTECVRSELLYDSKECDELNFSNENKKRK FT HTDFLNENKEIYIPNESTDCEKDDQSKDGVQKNTDTKITQRKSERLNAKPK FT ISYSEDDESLNKYILNAHIFFDEIPSSFDEIKFRDDKSAWEDAIKTELKAH FT EINNTWTLTKMPENKNIVGSRWVFSIKHNELGIPIKYKARLVARGLTQKYQ FT VDYEETFAPVARIASFRCIMELAVQYDLIVHQMDVKTAFLNGVLNEEIYMQ FT PPQGISCDAGNACKLNKAIYGLKQAARCWFQVFEQALKECSFVNSPVDRCI FT YILDKGDVNDKIYVLLYVDDVVIATRDLERMNNFKIFLSTKFKMTYLNEIR FT HFIGIRIKMDQDIIYLSQAAYIKKILNKLNMDNCKSVSTPLPSKLNYDLLN FT SDEDCKAPCRNLIGCLMYIMLCTRPDLPTAVNILSRYSSKNNSELW" XX SQ Sequence 4673 BP; 1759 A; 665 C; 847 G; 1402 T; 0 other; acaggttatg ggcccagtcc atgcctattt ttgaagaagt tttattattt caatatattg 60 tgaaattcca actgtgcact taaaaatttt ctttctcttt ctttttcgtg atatttgtcc 120 gtaaagaaat ggaaaagaat aagtacaatt aaaccgttcg atggagaaaa atatggtgtg 180 tggaagttta gaattaggga gttgtttgcc gaattagacg ttctcaaagt tgttgatgag 240 ctaatgccaa atgaagtgga cgaaacttgg aataaggcag aaagaagtgc aaaaagcata 300 ttagttcaac atttgagtga ctctttttta cacttcgcgg cagacaatat ttccgcacgt 360 ggtcttatgc aaaaattaga tgccatatat gaactctttc cagtcatttc gttaaatttg 420 atgatattgt cagtgagata ttagcggcgg gtggcaagtt agccgaaatg gaaagagtta 480 tgtacttgct gcatacgttg ccttctactt acgatggtgt gattacagca atagaaacat 540 tgtccgatga aacactttca cttacctttg taaaaaataa attattagac tatgaaacta 600 aactaagaag tgatagcaat gacactagca aaaaggtcat gaaggcagtt gttaaatata 660 atcacaattc caatataaat cacgtgttta aaaatagagg cacaatacaa aagaaaaggg 720 tatttaaagg aaagcaaaaa ttcaacatta aatgccacca ttgtggtaga ctagggcaca 780 taaaaaagga ttgttttcat tttttaaaag aaattaaaaa taaagaaaat aaaactaata 840 aagaaaacaa taacaataac aataaacaag ttcagttgca acatccacaa ataaccacgg 900 ttttgctttt atgataaaaa atttaaacca agttgataaa ccaaaacaac ttggttttat 960 tattgattcg ggtgcaactg atcacctaat caatgatgaa tcgctgttta atgattgcgt 1020 ggaattggag caaccaataa aaattgctgt ggcgaaggaa ggccaataca tattcgctac 1080 aaaatgtgga atcgttcgat tatacaatgg ccacaacatc accttggagg atgttttata 1140 ctgcaaggag gcagcagaaa acttgatgtc agtgaagcgt ctgcaagaag ccggaatgtc 1200 aatccatttt gaccaagacg gtgtgtcaat taagaaaaat ggcatcactg ttgttgaaag 1260 ttcgggtaag ttcaacaaca tacaaatact aaaattccaa gcatacagta taaatgcggg 1320 aaataataat tatcgattat ggcatgagag attaggtcat gttagcaaag caaaactatt 1380 ggaaataaag aataataatt tgtttattga tagcagtctt ctaaataatt taaaaatttc 1440 ggtttgtttg ttgtccattc agacgtttgc ggcccaatta caccaacaac aatagacgat 1500 aaaaattact ttgtgatttt tgtagaaagt ctgacgtttt caatgctttc aaagattttg 1560 ctgcaaaaag tcacgctcac tttaatctta aaattgttaa cttgtatatt gacaacggta 1620 gataatacct gtcaaatgaa atgcgtgagt tttgtgttaa aaatggaata agctatcatt 1680 taaccgtgcc acatacacct cagttgaatg gtgtctcgga aagaatgatt agaaccatta 1740 cagaaaaagc tcgcaccata gtaaatggag caaaattaaa caaaagtttt tggggtgaag 1800 ctgcactcac tgcaacatat ttaataaaca gaattccaag cagagcactt ggtgaaaata 1860 aaatgacccc atatgaaatg tggcacaata ggaagccaaa tttaaaatct ttaaaagtat 1920 ttgggtcaac tgtctatgta cataataaag taaagaaagg caaatttgat gagaaatcat 1980 ttaaagccat tcttgtaggg tatgaaccaa atggctataa attatgggat gtcgtgaaag 2040 aaaaattttt tgttgctagg gacgttgttg tcgatgagac aaatatggtt tattctagag 2100 catctaagcc tgaagaagaa gtcccgaatg agagtaggga aagaaactat gaagaactct 2160 tatacgaaag taaggaatgt aatcatctga atatccggaa cgatagtacg gagtgtgtta 2220 gaacaaaatt cctgaatgaa agtaaggaat gtgatcatct aaatatccgg aacgatagta 2280 cggaatgcgt tagatcagaa ctcctatacg acagtaagga atgtgatgag cttaattttt 2340 cgaatgaaaa taaaaaaagg aaacatactg atttcttgaa tgaaaataaa gaaatatata 2400 tcccgaatga gagtacggat tgtgagaaag atgatcaatc aaaagatggc gtgcaaaaga 2460 atactgatac taaaataacg caaagaaaaa gtgaaagact taacgccaaa ccaaaaatat 2520 cttatagtga agatgatgag agtttgaaca aatatatttt aaatgcccat atatttttcg 2580 atgaaattcc aagctcattt gatgagataa aatttagaga tgataaatct gcttgggagg 2640 atgcaataaa gacggagtta aaggcccatg aaattaataa cacttggaca cttacaaaaa 2700 tgccagaaaa caagaatatc gttggtagta gatgggtatt ttctattaaa cataatgaat 2760 taggaatacc aataaaatat aaagctagac tagttgcaag aggacttact caaaaatatc 2820 aagttgatta tgaggagact tttgcacctg ttgctagaat tgctagtttt agatgcataa 2880 tggaactagc agttcaatat gatttaatag tgcatcagat ggatgttaaa acagcattcc 2940 ttaatggtgt tttaaatgaa gaaatatata tgcagcctcc tcaaggtata tcatgcgatg 3000 ctggaaatgc ttgtaagtta aataaagcga tttacggcct taaacaagca gctcgatgct 3060 ggtttcaagt gtttgagcaa gcattaaaag aatgtagttt tgtaaactct ccagtagatc 3120 gctgcattta tatcctggac aaaggtgacg taaatgataa aatatatgta ttattatatg 3180 ttgatgacgt agtaatagca acaagagact tagaaagaat gaataatttt aaaatttttt 3240 taagtacaaa attcaaaatg acttatttga atgagatacg tcattttatt ggaataagga 3300 taaaaatgga tcaagatata atatatttga gtcaagctgc atatataaag aaaattttaa 3360 ataaattgaa catggacaac tgtaaatcag ttagtacccc tttgccaagt aaacttaatt 3420 acgacctgct taactcagat gaggattgca aagccccatg tcgtaatctt attggatgtc 3480 taatgtacat aatgctttgt acgcgtccag atttacctac agctgttaat atcctgagta 3540 gatatagtag taaaaataat tcagaattat ggtagtgttt gaaaagagtt attagacatt 3600 gacatgaaac ttcattataa gaaaaataca tcatttgaaa acacactagt tgtatttgta 3660 gactcagatt ggggtggaaa tgaacttgat aggaaaagta caacgggcta tttatttaaa 3720 atgtttgatt ccaatctaat ttaataatct aatacaaaaa aacagaactc agtagcagcc 3780 tcgtcaactg aagctgagta catggctctt tttgaagccg taagagaggc tctatggttg 3840 aaatctcttc ttagtagtgt aaacatataa ctcaaaaggc ccattcaaat ttatgaagat 3900 aaccaaggct gtattagtat agcaaataat cccttatgtc ataaaagagc caaacatatt 3960 gatataaaat atcatttctc tagagaacag ataaagaata acgaaatttg tattgagtat 4020 atatcgacag ataatcaact ggcggacatc tttactaagc ctctacctgc tggacgattt 4080 gcagaattac gagatatgtt gggactacac atcgactaga atcaacaatt aatcttttta 4140 tgaattgcta aagaaatcta atttgactta atgttacttt aaaaaaaaaa cgcatactga 4200 attgctggat taataccaaa tttttgattt gacttaatgt aatttttaag aaacaaactg 4260 aattgttgaa tttaaaccaa atctaatttg acttaatgtt acttttaaaa aaaaaaaaaa 4320 cgcatactga attgctggat taataccaaa tttttgattt gacttaatgt aatttttaag 4380 aaacaaactg aattgttgaa tttaaaccaa atccaatttg acttaatgtt actttaaaaa 4440 cgcatactga attgctaaaa tggatactca tacactaaat ttatgttata accctatatt 4500 ttatgtatat tacaatgatc tgatcaggtt ttctctgggt ttccctgtat ccttgcagca 4560 aatgctggat cattcaatat tccccagaat gcacaccaac cacgtcagaa taagaattca 4620 atatttttat atttgtaaac ttaatgataa tgttattttt gaggggtgcc tat 4673 // ID R2_DSi repbase; DNA; INV; 3607 BP. XX AC . XX DT 08-NOV-2010 (Rel. 15.11, Created) DT 08-NOV-2010 (Rel. 15.11, Last updated, Version 2) XX DE 28S rDNA-specific non-LTR retrotransposon R2 in Drosophila DE simulans. XX KW R2; Non-LTR Retrotransposon; Transposable Element; R2_DSi. XX OS Drosophila simulans OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; melanogaster subgroup. XX RN [1] RP 1-3607 RA Stage D.E. and Eickbush T.H.; RT "Origin of nascent lineages and the mechanisms used to prime RT second-strand DNA synthesis in the R1 and R2 retrotransposons of RT Drosophila."; RL Genome Biology 10(5), R49-R49 (2009). XX DR [1] (Consensus) XX CC This family is specifically inserted into 28S ribosomal RNA CC genes. XX FH Key Location/Qualifiers FT CDS 161..3349 FT /product="R2_DSi_1p" FT /note="reverse transcriptase and restriction-like FT endonuclease." FT /translation="TCLAANLSGKNFSDGLVTQRKFTHIGTTNTNNEPRIS FT LHNLMTTRPSVDIFPEDQYEPNAAATLSRVPCTVCGRSFNSKRGLGVHMRS FT RHPDELDEERRRVDIKARWSEEEKWMMARKEVELTANGHKHMNKQLAVYFA FT NRSVEAIKKLRQRGDYKEKIEQIRGQSALVPEVANLTIRRRPSRSEQNHQV FT TTSETTPITPFEQSNREILRTLRGYSPVECHSKWRAQELQTIIDRAELEGK FT ETTLQCLSLYLLGIFPAQGVRHTLTRPPRRPRNRRESRRQQYAVVQRNWDK FT HKGRCIKSLLNGTDESVMPSQEVMVPYWREVMTQPSPSSCSGEVIQMDHSL FT ERVWSAITEHDLRASRISLSSSPGPDGITPKSAREVPSGIMLRIMNLILWC FT GNLPHSIRLARTVFIPKTVTAKRPQDFRPISVPSVLVRQLNAILATRLNSS FT INWDPRQRGFLPTDGCADNATIVDLVLRHSHKHFRSCYIANLDVSKAFDSL FT SHASIYDTLRAYGAPKGFVDYVQNTYEGGGTSLNGDGWSSEEFVPARGVKQ FT GDPLSPILFNLVMDRLLRNLPSEIGAKVGNAITNAAAFADDLVLFAETRMG FT LQVLLDKTLDFLSLVGLKLNADKCFTVGIKGQPKQKCTVLEAQSFYVGSRE FT IPSLKRTDEWKYLGINFTATGRVRCNPAEDIGPKLQRLTKAPLKPQQRMFA FT LRTVLIPQLYHKLALGSVAIGVLRKTDKLIRYYVRRWLNLPLDVPIAFIHA FT PPKSGGLGIPSLRWVAPMLRLRRLSNIKWPHLTQNEVASSFLEAEKQRARD FT RLLAEQNELLSRPAIEKYWANKLYLSVDGSGLREAGHWGPQHGWVNQPTRL FT LTGKEYIDGIRLRINALPTKSRTTRGRHELERQCRAGCDAPETTNHIMQKC FT YRSHGRRVARHNCVVNRIKRGLEERGCVVIVEPSLQCESGLNKPDLVALRQ FT DHIDVIDIQIVTDGHSMDDAHQRKINRYDRPDIRTELRRRFEAAGDIEFHS FT ATLNWRGIWSGQSVKRLIAKGLLSKYDSHIISVQVMRGSLGCFKQFMYLSG FT FSRDWT" XX SQ Sequence 3607 BP; 1070 A; 807 C; 902 G; 828 T; 0 other; gggatctggg gtaattgcga gcagaggggg agtatttttc tgtaattcgt aagtcatatc 60 atatggtgtg cggaagggga attttactct gtaactcaca agtctctcct ttactcaagt 120 cgactcaaaa cctcctcgtg gtggtccccg gtaatgctaa acttgtttag cagctaattt 180 gagcggcaaa aacttttccg atgggctggt tacccagagg aaatttactc atattggaac 240 tacgaacaca aataacgagc ctcggatatc tttacacaat ctgatgacga cccgaccctc 300 cgtggatatc ttcccggagg accaatatga accaaacgca gcggctactc tatctagggt 360 tccctgcaca gtatgtggcc ggtcctttaa cagcaagaga ggactcggtg ttcacatgcg 420 atctcggcac ccagacgaac ttgatgaaga acgtcgacgt gtcgatataa aggcaaggtg 480 gagtgaggaa gagaagtgga tgatggcgag aaaggaggtt gagctcacag caaatggaca 540 taaacacatg aacaagcaac tagcggtgta ttttgcaaac cgcagcgtcg aagccatcaa 600 aaagctaaga cagaggggcg attataagga gaaaatagag cagataagag ggcaatctgc 660 tctcgtcccg gaagttgcaa atctaaccat aaggcgccgc cctagtagaa gtgagcaaaa 720 ccaccaagta acaacatcag aaacaactcc aatcactccc ttcgaacagt cgaacaggga 780 aattttgcgg acactgcgtg ggtatagccc cgtagaatgc cattccaaat ggagagccca 840 agagctacaa acgatcattg acagggcaga gctcgaggga aaggaaacca ctctccaatg 900 cttatcgcta tatctcctgg gaatttttcc ggcacagggt gtacgacaca cgctgacgag 960 acctcctcgg agacctcgga ataggagaga aagcagaagg cagcagtatg ctgtcgtcca 1020 gcgtaactgg gataagcata aaggaagatg catcaagtcc ttgctaaatg gaactgatga 1080 gtcggtaatg ccaagccaag aagtaatggt tccctactgg agagaagtaa tgactcagcc 1140 tagcccaagc tcttgcagtg gagaagtgat acaaatggat cactcgcttg agagggtttg 1200 gtctgctatt acggagcatg accttcgggc gtcaagaatc tcattatctt catctccggg 1260 gcctgacggg ataactccaa aatctgccag ggaggtgccg tcaggtatta tgttgcgaat 1320 aatgaaccta attctatggt gcggtaatct accacactct atccgactgg ccagaaccgt 1380 cttcatcccg aaaacggtga cggcgaagcg accgcaagac tttcgtccaa tatcggtgcc 1440 ttcagtcctg gtaagacagc taaatgccat attggcaacc cggttgaact catcaatcaa 1500 ttgggacccg cgccagcggg gcttcttacc taccgacgga tgtgccgata atgcgacgat 1560 agttgactta gtcttgaggc atagccataa gcactttaga tcttgctaca tagctaattt 1620 agatgtaagc aaggcattcg attctttatc gcatgcatct atatatgaca ccttacgtgc 1680 ttatggtgcg ccaaagggct tcgttgacta cgtacagaat acgtacgagg gtggcggtac 1740 cagtctcaat ggggacggtt ggagttcaga ggaattcgtc cctgctagag gagtgaagca 1800 gggtgaccct ttgtctccta ttctatttaa cttggtaatg gacaggttac ttagaaacct 1860 acccagcgaa attggtgcca aagtcggaaa tgccattact aacgcggccg cgtttgcaga 1920 tgatttggta ctatttgctg aaactcgaat gggacttcaa gtattgttgg acaaaacgtt 1980 ggattttcta tctctcgtcg gcctcaaact taatgccgac aaatgtttta ccgttggcat 2040 taagggccag ccgaaacaga agtgtaccgt gctagaggca cagagcttct acgtaggctc 2100 gagggagatt ccatcactga agcgaacgga cgagtggaag tacttaggca tcaacttcac 2160 tgcaactggg agggttcgat gcaatccggc cgaggacatt ggtccaaagc tacaaagatt 2220 gacaaaggcc cccctcaaac cacaacagag gatgttcgcc cttaggactg tccttatccc 2280 acagctctat cacaagttag cccttgggag tgtggcgata ggcgtcctac gaaaaactga 2340 caaattaata agatattatg tgcgaagatg gctaaatctt ccgctggatg tgccgatagc 2400 attcattcat gcacccccaa aaagtggagg tctcggaatt ccatcactaa gatgggtagc 2460 tccaatgtta aggctaagac gattgagtaa tattaaatgg cctcacctca cgcaaaacga 2520 ggtagccagc tctttcctcg aagccgaaaa acaacgggcc cgagatagat tattagcaga 2580 acaaaatgaa ttgttatcgc gtccggcaat agaaaaatat tgggcgaaca aattgtacct 2640 ctcagttgat ggcagcggac tccgtgaagc gggccattgg ggaccgcaac acgggtgggt 2700 taatcaaccc acgcgtttac taacaggaaa ggaatatata gacggtattc gtctgcggat 2760 aaatgcccta cccacgaagt ctcgtactac aaggggaagg cacgaattgg aacgacagtg 2820 tcgtgcagga tgtgacgctc ccgaaacaac aaaccacata atgcaaaaat gttaccgatc 2880 gcatgggagg cgcgtagcta gacacaactg cgtagtaaat cgaatcaagc ggggacttga 2940 ggagagaggc tgcgtggtca ttgttgaacc aagtctgcag tgcgaatccg gccttaataa 3000 accggacctg gtggcactac gacaagatca cattgatgtg atcgacatac aaattgtgac 3060 agacggacac tctatggatg atgcacacca gcgcaaaatc aatagatacg acagaccgga 3120 catacgaact gaattgcgtc gcagattcga agccgcaggt gacattgaat tccattctgc 3180 caccctgaac tggaggggga tctggagtgg tcaatccgtt aaaagattga tagcaaaggg 3240 tctcctcagc aaatatgata gtcatatcat tagcgtccag gttatgagag gcagtctcgg 3300 ttgttttaaa cagttcatgt acctgagcgg gttttcccga gattggactt agctaaaacg 3360 tttggttcaa aacatttgct tgctgtcttg gcataacatc aataaaggca taaacatcgc 3420 aaaataatgg ttatatataa atggctatga ggatggtttt agtacgtagg cgttgcggaa 3480 cttcggttca gatagagcaa tgaatcgtgc atgctaggaa aactgaccac acgcagtgtt 3540 ggcagcccta gtatctttcg atagatttcc atacctccgc gatcaaaaaa aaaaaaaaaa 3600 aaaaaaa 3607 // ID BEL-210_AA-LTR repbase; DNA; INV; 199 BP. XX AC AAGE02017420; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-210_AA_; KW BEL-210_AA-I; BEL-210_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-199 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02017420; Positions 7580 7382. XX SQ Sequence 199 BP; 67 A; 37 C; 44 G; 51 T; 0 other; tgatgcgtcg cacggcgaca ctttgacgag tcccttggct gaagtattga gagagagaga 60 gaagaaggtg aacatgagaa aagttctttg tcttttttga atcagttaaa ataaacgctt 120 tatacctaaa gcacgcgatt ttattgaagc accgagaatc cgtctgaaac caaacccatc 180 tgagctaaat taatgaaca 199 // ID Gypsy2-NVi_I repbase; DNA; INV; 5001 BP. XX AC AAZX01004373; XX DT 05-NOV-2007 (Rel. 12.11, Created) DT 05-NOV-2007 (Rel. 12.11, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy2-NVi; KW Gypsy2-NVi_I; Gypsy2-NVi_LTR; internal portion. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-5001 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from Nasonia parasitic wasp."; RL Repbase Reports 7(11), 1118-1118 (2007). XX DR Genome; AAZX01004373; Positions 5002 10002. XX CC Positions [2117-2659] - Reverse transcriptase CC Positions [3746-4069] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 364..1389 FT /product="Gypsy2-NVi_I_1p" FT /translation="MAMQSRIVENLPLNGGQDQAQIFSASNAHHIIRDFFE FT NEGPEKSKAWINELENTKTLYNWGDALCVSIAKSKLKKGAWKWLLTKSSTV FT KTLGVFKEAFTGAFTYKRSRSEKLKVMAARSQNRNETLQHYFLDKIWLCEG FT LDLEVSKIRDEIAAGLWSRDLAHYALEHEYESTDEILQDLVRVEKIHSNRR FT ERLGEHRDRHNASGEAKQSGRGDHGSSSETSSRASERGAATTNSGAHSEKD FT KRAFNSGKCFNCSESGHLAQNCSKPKQVFTCFLCKRPGHVASRCMNNATQL FT EEKSTTNTPSHIALVDSSDSDASVKKFAREISIDNKKFVVYRYGSFSLYD" FT CDS 2018..4069 FT /product="Gypsy2-NVi_I_2p" FT /translation="MDIEVESNEIVPSQPYKLSSRDRDDLDTLVTDFKRAG FT IITETDSPHASPAFVVRKKDGSPRMVVHYGKVNKLSRKRNFPIPNFDDFLE FT KFHKARIFITGDLYMGYLQMPITERAKPLTAFIIETQTGQFETAMLGLSWA FT PIYFAKLMELVLGKARKLGIALNFFDDIFVYAENWDDLLKNFEIVLQLLKD FT AGLTLNIEKCRFGMRRVEFLGYMLGEGELRPGERKIAAIDQFPRPTDKHAI FT RRFLGLAEFFRRFVPNFARRKKPLSNLLRNDVVFVWAQAQEESFQNIKSAL FT VKKPVLKLYKPNAAKTELHTNASAAGLGAMLLQADAEHEPLRLVYAISRCT FT NTVEAKYHSSKLELMAISWALARSRPLLIGLKFVVVTDCQSLVYLNAWKTK FT NAQISRWMSEIAEYDLEVKHRRGENMQHVDALSRAPLAESQSDCFDVMTMT FT VDSDESEILVFQRSDSTILELINILKKREFERDKLEKQQVESFVLKDVLLY FT KRIVRDNQERELYVVPAAMRKALVIRYHDLGSHFGIEKTAKRIEEYYYFPR FT LRRYVRMHIRNCLECIMAKNKSGTGQGELHPIAPGRRPFDVVHVGPFPSTP FT RNNKYVFGLIDNLTKFVFIVPVKNVTAVTTVKHLTDFVNLYGAPGRIVSDR FT GTSFTSHVFQEFCVKQGIKHTLNSSRHPQANGLIN" XX SQ Sequence 5001 BP; 1585 A; 867 C; 1244 G; 1305 T; 0 other; attcagaagt gggatagacc ggtagtgtgt cagagagtgg acagtagact gaacgttttg 60 tgtactgtaa aagtgtggat ttatgtcaaa ggggttggat gagtttgaac aatttgtgat 120 tgagaacaat ttgatatccg atttgggaga aacgatgtct gagaagaatt tagaggcagc 180 attgcaagag atcgagagac aaaaggctcg acggcaagag agcgtctaca ggcagaaaaa 240 acagcgctcg acctcaagcg aaagattgag gagttggaaa gggagcatcg aggatcgacg 300 ccaggttcag cggattctca gaatgacctt ttggcgcagg caatcgcgtc gcttacctaa 360 gcgatggcaa tgcaatcgcg tattgttgag aatttgccat tgaatggagg acaggatcaa 420 gcccaaattt tctcggcttc aaatgcgcac catataatca gagacttctt cgaaaatgaa 480 ggcccggaaa agtcgaaagc gtggatcaac gagttggaga atacaaaaac attgtataat 540 tggggtgatg cactttgtgt gagcatcgca aaatcgaagt tgaagaaagg agcatggaaa 600 tggttattga caaaatcatc gacagtcaag acgttaggag tattcaagga agcgtttaca 660 ggagcattta cgtacaagag gtcacggagc gaaaagctga aggtgatggc agctcgttct 720 cagaatcgta acgagacatt acaacactac ttcttggata aaatctggct ttgtgaaggt 780 ttagacttag aagtaagtaa gattagagac gaaatagctg ccggcctatg gtcgcgtgat 840 ttggcgcatt acgcacttga acacgagtac gagtcgacgg acgagattct gcaagatttg 900 gtgagagtcg agaaaatcca cagcaaccga cgtgaacgtc taggagagca tcgagatcgt 960 cacaacgcat cgggagaagc caaacagagt ggaagggggg accacggctc aagcagcgaa 1020 acatcgtcaa gagcgtcaga gcgaggagca gcgactacca acagcggcgc ccattcggag 1080 aaggacaagc gtgcgtttaa ttcggggaag tgcttcaact gcagtgagtc ggggcattta 1140 gcgcaaaatt gttcgaagcc gaagcaagtg tttacatgct ttctttgtaa gcgtccggga 1200 catgtagctt ctcgatgtat gaataacgcg acgcagctcg aagaaaaatc aacgacaaat 1260 acgcctagtc atatcgctct agtagattca tccgattctg acgcatcagt gaaaaagttt 1320 gcgcgcgaaa ttagtattga taataagaaa tttgttgttt atcgatatgg gagcttcagt 1380 ttgtacgatt aagtcgatta ttgttttgcg tgaaggatac aaaatgttga atgcgccttc 1440 gattctaaga ggttttggcg gtaatactgt tgaatccccg ggagtagttg taggaatggt 1500 acggttagat gatttaaaac cgcgcgacgt aacttttaga gtcgtgcctg atacagcgca 1560 gaaatatgat gtaattttag gacgtccttt cacagaagct caagatataa gttattcgaa 1620 ggttggaggt aatttagtgt tttcggatgt tgacttagga gaaacaagga ggtcgacaaa 1680 aacgaatgct catgcgttcg aatgcgtcga tttgaaaccc ggtacaataa actttataaa 1740 tgtcgagatt gactcgtaca aagtgccagt aggtgtgata aatactagta atgagtgtag 1800 atctgttgat gcgttagaca ctgtaggtga atctctgttt ggaatagaag tcgcgaaaga 1860 attaacgcct aggtacgctc cgattacgct ggagtaaatt gtagtggata aacacattac 1920 ggaagagcaa aagggtgttt tacttgtact tttaaataaa tatcagcgat gttttgcgaa 1980 agaattgagt gagataggca gaacggacaa attaaccatg gatatagaag tcgaaagtaa 2040 tgaaatagtt ccttcacaac cgtacaagct tagtagccga gatcgagatg acctagacac 2100 gttagtcaca gactttaagc gcgcgggtat tataacagag actgattcgc ctcacgccag 2160 tccagccttc gttgtgcgaa agaaagatgg tagccctcga atggttgttc actacgggaa 2220 ggtaaataaa ctgtcgcgta agaggaattt tcccattccg aatttcgacg attttctgga 2280 aaagtttcat aaagctcgta tattcatcac cggagattta tatatgggtt atctgcagat 2340 gccaatcaca gaaagggcta agcctttaac cgcgttcatt atcgaaacac aaacgggaca 2400 gtttgaaacg gcgatgcttg gtctctcatg ggcaccgatt tatttcgcga aattgatgga 2460 gttagtatta ggaaaagccc gtaaattggg aatagctttg aatttttttg acgatatatt 2520 tgtttacgcg gaaaactggg atgatttatt gaaaaatttc gagattgtat tgcagttact 2580 gaaagacgca ggtttaactc taaatattga aaagtgtaga ttcggaatga gacgcgtcga 2640 atttttggga tatatgcttg gagaaggtga acttcgaccc ggtgagcgta agattgcggc 2700 aatcgaccaa tttccgcgtc cgacggataa acacgcgatt cgacgatttc ttgggctagc 2760 ggaatttttt aggcgtttcg tccctaattt cgctaggaga aaaaagcccc tgtcaaattt 2820 actgcgaaat gatgttgttt ttgtatgggc tcaggcgcaa gaggaatcgt ttcaaaacat 2880 aaaatctgcg ttagtaaaga aacctgtgtt aaaattatac aagccgaacg ctgcgaaaac 2940 cgaattgcat acaaatgctt cagctgcagg tttaggcgcg atgctactgc aagctgatgc 3000 ggaacatgag ccactgcggt tagtctatgc gataagtaga tgtacgaaca cggttgaagc 3060 gaaatatcat tcgagtaagc tggaattgat ggctatttcg tgggctttag ctaggtcgag 3120 accgttatta attgggttga aatttgtagt tgtaaccgat tgccaaagtc ttgtgtatct 3180 caatgcgtgg aagacgaaga acgcgcaaat ctcaaggtgg atgagcgaga ttgcagagta 3240 tgatctagaa gttaagcaca ggcgcggaga aaacatgcaa cacgtcgacg ctctatcacg 3300 agctccgtta gcagaaagtc agtcagattg ttttgatgtg atgacgatga ctgtagattc 3360 cgatgagagc gagattttag tgtttcaacg ttccgattcg acgatcttag aattaattaa 3420 tatactgaaa aagcgtgagt ttgaaagaga taaattggaa aagcaacaag tagaaagttt 3480 tgttttgaaa gacgtattac tttataaaag aattgtacgc gataatcaag agcgagaatt 3540 atatgtagta ccggcggcta tgcgaaaagc gctagttata agatatcatg atttggggag 3600 tcattttggc atcgaaaaga cagcaaaaag aatcgaagaa tattattatt ttccgaggtt 3660 gagaagatac gtacgcatgc acatccgcaa ttgtttggag tgcataatgg caaagaacaa 3720 gtcagggaca ggtcaagggg aactacaccc tattgctccg ggacgtcgtc cattcgatgt 3780 cgttcacgta ggaccgttcc cttctacgcc gcgaaataat aaatatgtgt ttggtctaat 3840 cgacaatcta actaagtttg tgtttattgt acccgttaag aacgttactg ctgttacgac 3900 tgttaaacat ctcactgatt ttgtaaattt gtacggagcg ccagggcgaa ttgtttcaga 3960 ccgaggtacg agtttcacct ctcacgtttt tcaagagttc tgcgtaaaac agggaataaa 4020 acacacgctt aattcgagtc gtcatccgca agcgaacgga ttaattaatt gagaggctta 4080 atcagacact cattccagca atgagaatgg ataatgacgg taaaagaacc gaatgattgg 4140 gatcgcaaca tctcgaaaat cgcgcgggac attaactgta ctgtcagcgc agccactggt 4200 gttgaaccat acaaagcgtt gttgggatat ttaccacgat tcaacgatgg taatttgcgt 4260 ttattaaccg aaaattgcga aacatatacc acacctagtg aacttcagga gcgcctccgc 4320 gagcgtattt tgttagagca agctgaatat aaagcgcgat acgataagaa tcgaaatttg 4380 aaaacagctt tcaaaattgg cgatatcgtg tatatgaagc aaaacactgt agctactggt 4440 acgtccaccc aaacttcaaa agcctttcgg taaaagaccg cttgtagttg taggtgttgg 4500 tcctagtgat acgtacagag tcaaacgttt aaatgaatcg caagatcgag gattcgtaac 4560 gacagctaat gtaagtcagt taaaaatttg gacaagtttc gaaaaagaca gcgttgacaa 4620 gtttgaagac gatagagacg gcgagcaaag tattagcgat gtaccggatg agacagatgt 4680 aaatattgat agtaatattc cgagttgttc aacgaatgta atcaaaattt caggcaatca 4740 agaagaaaag cttagttcag atcagcaaaa cgaaaacaca agcgcaaaat cattgccctt 4800 ggaaaaagcg agtcgtcaga ggcgcaaacc gaaacgattt gatgactatg taatgtaatt 4860 atttagttta gcaagcaatt attttattct atgtattatt aaacaaaaaa attgttatgc 4920 gatatgtttt ataatttgta tttttcaaaa gaaaagatta gctacagttt ggagaccaaa 4980 ctcctcgagg ttggccgaat g 5001 // ID Gypsy-18_SI-I repbase; DNA; INV; 4575 BP. XX AC AEAQ01023712; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-18_SI_; KW Gypsy-18_SI-LTR; Gypsy-18_SI-I. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-4575 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01023712; Positions 296 4870. XX CC Positions [3271-3780] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 158..1117 FT /product="Gypsy-18_SI-I_2p" FT /translation="MPLMHDDEGQASIQTPRQPLVQPAILEPPGYFMPGPE FT VSRVGMRMPEFSPTDPELWFSIIDRSFQAAGITVDSTKFGYALTAIGPRYT FT AEVRDIVLNPPEERAYEILKAELIKRLSSSQEHKTRRLLEHEEIGDRKPSQ FT FLRHLRSLAGNVVGDKVLRTIWLSRLPAHIQPHLVTRTEDSLDQVADIADT FT IMEATRAPPLQVAETTPSSFSRSAGQDDATSLEAKINLQLAQMRLSMQQEM FT AEQLTAIRKSIEAIGELERGRSHERRYTRPRSRSRSRPRAHGHATNGLCYY FT HWRFGPDARRCEAPCSSQQQTGNATASR" FT CDS 1024..4257 FT /product="Gypsy-18_SI-I_1p" FT /translation="MLLSLAFRPRRQTMRGSMLLTTANGKRNGQSLMAASD FT SGQLTRRLFVTDHDTRVSFLIDTGADLCVYPRRMVRGPRQKSNYELSAANG FT TTIHTYGTESLALNFGLRRVFAWRFVVADVSRPIIGADFLSFYGLLVDLRN FT GRLVDGMTSLTVQGRCLRCETPSIKTVTGATPYHELLMQYPDVTRPEGRPR FT ETKHDTRHFIKTTPGPPVVGRPRRLAPERLAAAKREFQRMMELGIVRPSKS FT CWSSPLHMVPKKGEEWRPCGDYRALNARTIPDCYPVRHIQDYAQNLQGKKI FT FATLDLVRAYHQIPVAEEDIPKTAITTPFGMYEFPYMSFGLRNAAQTFQRF FT IDEVLQGLEFSFAYIDDILVASSTEEEHKEHLKILFERLQKYGIVINPAKC FT VFGQPEIQFLGYLVTGEGTRPLPERVQAIRQYQMPRTAKDLRRYLGMLNFY FT RRFLPRAAESQAPLNDLLQDNAKGKEPVPWTPQAQRAFELTKESLAQAALL FT AHPRMSAELALFTDASDHSVGAVLQQRGDNGWEPLAFFSKKLSPAESKYSA FT FDRELLAVYLAIKHFLHMLEARVFTVYTDHKPLTFAFRQKPDKCSPRQFRH FT LDFIGQFTTDIRYISGADNIVADTFSRIDEVRSPLDYAALAASQKSDEELK FT DFLSQESSLWLKEVEIPGAGVTVWCDTATATPRPFLTKIFRRAAFESIHNL FT AHPGVKATAKLISQRYVWPSMNTDCRNWARACVPCQRSKITRHVVAPVGKF FT SAPSSRFEHIHIDIILMPSSEGKRYCLTCIDRFTRWPEAFPLEDQEAETVA FT RAFYEGWICRFGTPLRVTTDQGRQFESHLFRQLSELTGTAHLRTTAYHPQA FT NGMVERFHRQLKAAVKCHANSRWTQVLPTVLLGIRAAWREDLQATAAELVY FT GETLRLPGQFLTQRPMENSDDGANFIKELRHRFDDLRPIDGTRHGERRPFV FT FKDLGTTDQVFVRHDGPKTMLQSPYDGPFVVVRRDDKNFIISMHGKNVTVS FT IDRVKPAYLLSDSLTDAGETPPEETQGHSEDSTRGIRREQDENAPTPREQG FT ETVTRAGRRVRFPDRFQAGLR" XX SQ Sequence 4575 BP; 1259 A; 1103 C; 1199 G; 1014 T; 0 other; tctccgtgcg cctataccat tcccacacca aagtggtgcg cccgacgtga tcattctttt 60 tttttacgac caagcgcaaa gtttcgcgga attgtgacaa ttaacgcgtg tttgtgagtg 120 aaattgtgag tgcacggcca gacagaagtg cagcaacatg cctctcatgc acgacgacga 180 gggacaggcg tcgatacaga cgcctaggca accgctggtg caacctgcga tattagagcc 240 gccaggatat tttatgcctg gacctgaggt gagccgcgtg ggcatgcgaa tgccggagtt 300 ttcgcccacg gatccggaat tatggtttag catcatagac cggagttttc aagccgcagg 360 aataacggtc gattcgacga agttcgggta cgcgttgacg gcaatcggtc cgcgttatac 420 cgccgaggtg cgagacatcg tactgaatcc gccggaggag cgcgcgtacg aaatattaaa 480 agcagaactt attaaacgcc ttagctcatc gcaggagcat aagacgcgtc gcttactcga 540 gcacgaggag atcggcgacc gcaagccatc acaatttttg cgtcatcttc gtagcctcgc 600 cggcaacgtt gtcggcgata aagtattgcg aacgatttgg ctgagccgcc tacccgctca 660 tattcagccg catctagtga cgcgaacgga agattcgctg gatcaggtgg cggacatcgc 720 ggatacgatt atggaagcga cgcgtgcccc gccgttgcag gtcgcggaaa cgacgccgtc 780 ctcattttcg cgcagcgcag gacaggacga cgcgacatcc ttagaagcaa aaattaactt 840 gcaattagcg cagatgcgct tatcgatgca acaagagatg gcagagcagc tgacggccat 900 tcgaaaatct attgaggcga tcggcgagct cgagcggggc cgtagccacg agcgacggta 960 cacacgcccg cgttcgcgtt cgcgttctcg cccgcgagct catggccatg caaccaatgg 1020 cttatgctac tatcattggc gtttcggccc agacgccaga cgatgcgagg ctccatgctc 1080 ctcacaacag caaacgggaa acgcaacggc cagtcgttaa tggcggcaag cgattccggc 1140 cagttgacac gtcgcctctt cgtaacagac cacgatacga gagtcagttt cttaattgac 1200 acgggagctg atctttgtgt ctatccgcgg aggatggtgc gtggtccgcg acaaaaatcg 1260 aattacgagc tgtcggccgc aaacggaaca actattcata catatggcac agaatctttg 1320 gcactgaact tcggattgcg acgagttttc gcgtggcgtt tcgtcgtagc agatgtatcc 1380 aggccgatta tcggggccga cttcctttcg ttctacggac tgctggtaga tctgagaaat 1440 ggtcgactcg tggatggcat gacgagtctg acggtgcaag gacgatgttt acgatgcgag 1500 actccgagca ttaaaacggt aaccggagca actccgtatc acgagttgct gatgcaatat 1560 cctgacgtga cgaggccaga aggacggccg agagagacca agcatgatac acggcacttt 1620 atcaagacga cgccgggtcc gccggtggtc ggtcggccac gaagactcgc gccggagcgg 1680 ttggcagccg cgaaaagaga attccagagg atgatggaac tcggaattgt acgaccatct 1740 aaaagctgtt ggtcctcgcc cctgcatatg gtgccgaaga aaggagaaga atggagacct 1800 tgcggagatt acagggcgct caacgcacga acgattcccg attgctaccc ggtgcgtcat 1860 atccaggatt atgcacagaa cctacaggga aagaagatct ttgcaacgtt ggatctcgtt 1920 cgagcatatc atcagatccc ggtagcggaa gaagatatac caaagactgc gataacaaca 1980 ccgtttggta tgtatgaatt tccgtacatg tcgttcgggc tgcgcaatgc cgcgcagacc 2040 tttcagcgct tcatcgacga agttcttcag ggtctggaat tcagttttgc atatatcgac 2100 gatatactgg tcgcatcgtc aacggaggaa gagcataagg agcatctcaa gatactattt 2160 gagcgactgc agaagtacgg aatcgtaatt aacccggcta aatgcgtatt tggacagcca 2220 gaaatacagt ttctcggata cctggtgacg ggagaaggaa ctcgaccctt gcccgagcga 2280 gtgcaagcta tccgacaata ccaaatgcca aggacagcca aagatcttcg gcgatatcta 2340 ggcatgttaa acttttatag aaggtttctg ccgagagctg ccgaatcaca agcaccgctc 2400 aatgaccttc ttcaagacaa cgcaaaagga aaggaaccgg taccgtggac accacaggca 2460 caacgagcgt tcgaattaac aaaagaaagc ttggcacaag ccgctttgtt ggcacatcca 2520 aggatgagcg cggagctggc tttgtttaca gatgcatctg atcatagcgt cggagccgtg 2580 ctgcaacaac gaggtgacaa tggttgggag ccattagcat tcttttcgaa gaaattaagc 2640 ccagccgaat cgaaatatag cgcatttgac agagagttat tagcagtgta cctcgcgatt 2700 aaacattttc tgcatatgtt agaggcgcga gtttttacag tgtatacaga ccataaacca 2760 ttaaccttcg cgtttcgaca aaaacctgac aagtgctcac cacgtcaatt tagacattta 2820 gactttatag gccaatttac aacggacatt cgatatatat ccggagcgga taatattgtt 2880 gccgatacat tttcgagaat cgacgaagtg cggtcgccgt tggattatgc agcattagcg 2940 gcgtcgcaga aatctgacga ggagctaaaa gattttttga gccaagaatc gagtttatgg 3000 ttgaaagagg tcgagatccc tggagcagga gtgacagtgt ggtgcgatac agcaactgca 3060 acgccgagac cgtttttaac gaagatcttt cgccgagcgg cattcgagag catccacaac 3120 ctggcccatc caggagttaa agcaacggct aagctcatat ctcagagata tgtttggccg 3180 tcaatgaata ccgactgccg aaattgggct cgtgcatgcg taccgtgcca gcgatccaag 3240 attacgcgac atgtggtagc gcccgtcgga aagttttctg caccatccag tagatttgaa 3300 catatccaca tagatataat tttgatgccg agctccgagg gaaagagata ctgcctcacg 3360 tgcatcgacc gtttcacacg atggccagag gcattcccgc tggaagacca agaagcggag 3420 acggttgcta gagcttttta cgaaggatgg atttgccgat tcggaacacc tctccgagtt 3480 actactgatc agggacgtca atttgagtcc catctctttc ggcagttgag cgagttgacc 3540 ggaacagcac acctaagaac aacagcatac cacccgcaag cgaatggtat ggtggagaga 3600 tttcatcgtc agcttaaagc ggcggtgaag tgtcatgcga acagccgctg gacccaagtg 3660 ctgccgaccg ttctgctagg gatacgagcg gcctggagag aggatctgca agcgactgca 3720 gcggaacttg tttacggaga gacacttcgt cttccgggac agttcctgac tcaacggccg 3780 atggagaatt cagacgatgg cgctaatttc ataaaggaat tgcgccaccg ttttgacgat 3840 ctgcgcccga tcgatggaac ccgccatggg gaacgacgtc cgttcgtctt caaggacctg 3900 gggacaaccg accaggtgtt cgtccgacac gacggaccaa agacaatgtt gcaatcacca 3960 tatgatggcc cgttcgtggt cgtgagacgt gatgacaaga atttcatcat aagcatgcac 4020 gggaaaaatg taacggtttc aatagaccgt gttaaaccag catatctgct ttcagattcg 4080 ttaacagacg ctggcgaaac accaccagaa gaaacacaag gacacagcga agatagcact 4140 cggggcattc gtcgagagca agacgagaac gcgccgactc ccagggagca aggagaaacg 4200 gtgacgagag ccggacgaag ggttcgcttc ccggatcgtt tccaggcggg cctcagataa 4260 caataagatt tatccaaaca tgcatctatt ttatatgaca aaacgttaaa tatttataga 4320 tgtaaaaaga tcggaaacac gtattattat tataattaat cattaacatg catttacgca 4380 atttgcttat tttagttatc ttaattcatt attttctttc cttgcattta ttgtaattaa 4440 gcttagaata tttatcgtac tgctggtgca ttactggcag gggggtcatg tggcaatcaa 4500 ttgccacaat cctcatacaa aagaaaaaga aaaggaaaaa gaaacaaaaa agaaaatttt 4560 caccaaccga acata 4575 // ID Gypsy-143_AA-LTR repbase; DNA; INV; 1684 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-143_AA_; KW Gypsy-143_AA-I; Gypsy-143_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-1684 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1020-1020 (2011). XX DR [2] (Consensus) XX SQ Sequence 1684 BP; 478 A; 295 C; 364 G; 546 T; 1 other; tgtaacgtca tttagaaccc tattcgtagt tttcgttctc attgataaaa tttgtttgta 60 tttgttatgt atactagagt tcgtcgtgaa tgtcgcaaac tgacagcata gcttcaagtt 120 tagtccgaac accaaccaaa caagacgcca cctatgttca ttgaaagaac taaagaaata 180 tttcggaaat tcaaatctat tgcgtcttag agaattgacc agaatttctc tcgtcgctcc 240 tactaaattg cgccgatggg ctaatatgaa aattcccaaa ttagccaatt aaaagaaaac 300 aaaagatagt acaaccattg tacgcattcg ggtggaaatt gcgttcttct aaattagctc 360 aacccttttt taagtctgga tcgcgattgt tcatttatta tagtggagtg cgtcaaagcg 420 ataagatcgt tattagtcgg agtgattttg aaagtgtttg agtttaactt aagtctaatt 480 gtgactttgt attttggttt aaactaaact cgaagtaagt ttcgggagtt ttagatttac 540 ttagtgtatt aatttcagtt tatgtgtctt ttatgttcat cgtttagaat tttgaatttg 600 tcgagtcgaa ttgtaatatt ttgaaaaagt gcctgagata ttgtgagatt tggtgaataa 660 gaattcaggt aagcctaact tgattattta gtgagcaaat taagtgagaa tccgcgttga 720 ataggccctc cttgcccaat caacgtgggc ggctaatctg ggctagtacm aatcggaagc 780 cctccctcgg atacggctgg gactacaatc ggccgccacc aagaggctgt aagttcccga 840 gaacggcacc ggtccctgaa attgctgtgc aacatcggct accaacgctc gtgtgacatt 900 tcgctcgcca ataagtcgtc cggcgtgagc accgttccta taaaaagctc ttcactccac 960 tacaagggga cgcctttacc ccttatgcga cgccgacgcg cgaatcacat gttggcttta 1020 ccggtcgatg gagtaagttt gacccacagc aattggtgag tacgtacgtg aagttttgca 1080 atcggtaaac cgcatgggac aaccgacgag ggattgtcgc cgagaagaaa aggtcagatc 1140 tgacaataga aaataagaat ttgctcaaag ataaaatcaa aattgaatta aatcgtgacg 1200 tcacagtagg gattaggctt gtttgaattt aaataaataa ttactcgtta aatttaaggc 1260 aatcgatttg aactcgataa atttgaattt ttatcaaata aagtaacttc ggattgaatt 1320 tttgatttaa ataaacttac ttctagatta gttagattac tccactagtt ttattacgtc 1380 acgaattggt atttggctag atttaagatt ggtgtttggg ttcttttagt gaataagcat 1440 ttcccccatt ggttaatttc tgaatttatt tttcggtttc ggtatttttt atgattttgg 1500 aattggttct ggttagtttt tggagacgtt tgggttgttt tagtgcgaca tcggttctca 1560 tttactaggt gacctgccct gagagggtag tttcatgccg ggtaattaga aagagctaac 1620 ggtccttctt attggaaggt ggcgcgtaag ccgtttgctt agcgtaacag aacggaacgt 1680 taca 1684 // ID Ginger2-1_LS repbase; DNA; INV; 6081 BP. XX AC . XX DT 03-FEB-2010 (Rel. 15.01, Created) DT 03-FEB-2010 (Rel. 15.01, Last updated, Version 1) XX DE Ginger2 DNA transposon from Littorina saxatilis. XX KW Ginger2/TDD; DNA transposon; Transposable Element; Ginger; KW Ginger2; integrase; Ginger2-1_LS. XX OS Littorina saxatilis OC Eukaryota; Metazoa; Mollusca; Gastropoda; Caenogastropoda; OC Hypsogastropoda; Littorinimorpha; Littorinoidea; Littorinidae; OC Littorina. XX RN [1] RP 1-6081 RA Bao W., Kapitonov V.V. and Jurka J.; RT "Ginger DNA transposons in eukaryotes and their evolutionary RT relationships with long terminal repeat retrotransposons."; RL Mobile DNA 1, 3-3 (2010). XX DR [1] (Consensus) XX CC TIR is 180-bp long. XX FH Key Location/Qualifiers FT CDS join(408..704,1131..1634,1638..1874,3288..3461, FT 3702..4070,4302..4607,5103..5867) FT /product="Ginger2-1_LS_1p" FT /translation="MERFRHTLQVEADTSAMCPKSSLMSSEKFNTVVNHRR FT EPHTKVEPHFRHWVKKRTFSLMNMPGLGLNDVLVVPNPKHQSQVRRTTYIM FT FGQRFLFSIDLNDGAAKYLRVVHAGNIFDVVTDIHAQELKHSGYKKVLEYA FT QRHYHDIFRTFVQLFCTNCPTCQLSQPQVSRPPLRPIVEKDFLERVQVDLI FT DMRHNPDGEFNHICHFMDHFSKYHVLFPLKPKSAVEVAGLIEERVLAYFGP FT PKLFHSDNGREFVNQIIRALFSSWGGVTFVNGRPRHSQSQGLVERGNRTIE FT QKLAAMKNDNGCGADQTYPWSSWLPRVMVSLNSEVQATTNESPYKVVFGKC FT APSPVSRHAMVRKRAFRNTQTAAEHMENFYNKRKRVKIADFQEGDTVSVGI FT PKLDRTSTDFDIFGLKMDDSDSDSGSSFHGFDGGDVRNLGNRIELANSDQS FT DISVSTVHTADLSDFSSAISDIPSSSSDEDVDSDAEWTDTIDDWSEEEFSE FT SVQGRTRGGFQGFPETPPQPKIFFLNGVFLGPGVGPTFPLPLTAKPIDYFQ FT QLFPDDLFAQIRDETIRYARQNGDANFDTTTEEMKAYMGILFLMGLIPMPN FT YRCYWSSRPEMRQRVIYDVMPRNRYEVWSCKSIHKHRSYDFFNKQPLFSLI FT LSENNRTGHKKASKSYRTSRVKRLPAVIAKAHGTTKKSYLLTTAFGQLKGR FT YQGGDLELYSGTITPNETEELSLREAARKHNPENLFLKAHCNCHTGCMTKR FT CPCRAKKISCSTQCHGSSTCKTINSAPMPTSEMPVLSNADMLALASSTAFL FT NDQHINAANILIKQSFPAAQGLQDTLLQQNCSFQNPTGTFVQIFHTPNHWV FT TVTSDSKNEIKIYDSLRQKPSNDIQLLIAKY" XX SQ Sequence 6081 BP; 1682 A; 1521 C; 1300 G; 1578 T; 0 other; tgttaggcgt gtttgatcga tctgtgcgat cgcgcgatca cgctgcgcgt ttggtggatc 60 gatcaaacgc gcacacatcg atcgatgtgt gcgcgtttga tcgatgcaaa gaatacaaag 120 aatacaagaa tcgttaaaac gcgcagctct tatacttgcg catttggacg atctgtcaca 180 aagtaagtcc ctaccaccac acacacagat cgcgcgccgg aagtgaaaag tttcaaatac 240 aggaatgttc cgaaatatac cccaacaacg tttttgatta caattcagct ccttttttgt 300 tgtttttgat tactgctgag ctctattcac acaggctcaa gttagtttta ttgcttcatt 360 ctctcttgat tgcttcattc tctcttcatt cgtttaactg atacatgatg gagaggttta 420 ggcacacact gcaagtggag gcagatacat cagccatgtg tcccaaatca agcttgatgt 480 caagcgaaaa gttcaatacg gttgtgaacc accgaagaga accccatact aaggtggaac 540 cccattttcg acattgggtg aagaagagga cattttcctt gatgaacatg cccggattgg 600 gattaaacga tgtgttagtt gtccccaatc caaaacatca aagccaggta cgaagaacta 660 catatattat gtttgggcaa agatttttgt tctcgattga cttgttttaa tttctagcga 720 tctttcaaat tttgaagatt atttgttgta cttgtttgtt gtcaggcctc tcaatctttc 780 aaacagtctc tctctctctc tctctctctc tctctctctc cctctctctc tctcttagtc 840 tctcagtctc tctctcttct ctctctcagt ctctctctct ctctgtctct gtctctctct 900 cttagtctct cagtgtctct ctctctctta gtctctcagt ctctctctct tctctctctc 960 tctcacacac tctctctctc tctctctctc tctctctctc tctctctctc tcttagtctc 1020 tcagtctctc cccccaatcc ctccttcctc atctctgtcc atccctcaga catctcatga 1080 aaacagctga gctaattcga agtcgaggtg tgtgtgtgtg tttttaacag aacgatggag 1140 ctgctaaata tctgcgggtt gtccatgccg gcaacatatt cgacgtggtc actgacatcc 1200 atgcccaaga gttaaaacac agcggctaca agaaagtact cgaatatgcc cagcgtcact 1260 atcatgacat tttccgtacc tttgttcagt tgttttgtac aaactgtcca acatgtcaac 1320 tgagtcagcc tcaggtgtcc agaccacccc tgcgtccaat agttgagaag gatttcttag 1380 aacgtgtcca agttgatctc attgacatga gacacaatcc tgacggagag ttcaatcaca 1440 tttgccactt tatggatcat ttcagcaagt accatgttct atttccctta aaacctaaat 1500 cagcagtcga ggtggctggg ctcatagaag aaagggtctt agcttatttt ggacccccca 1560 aactattcca ttcagacaat ggccgagaat ttgtaaatca aattattcga gcattgtttt 1620 catcatgggg tggataggtc acattcgtga acgggcgtcc tcgacactca cagtcacagg 1680 gtctagtaga acgagggaat cggactattg aacaaaagct tgcagcgatg aaaaatgaca 1740 atggctgtgg agctgaccaa acataccctt ggtcatcatg gttgccaagg gtcatggttt 1800 cgttgaacag tgaggtacaa gcaaccacaa acgagtcccc atacaaagtg gtattcggaa 1860 aatgcgcacc atcagccatt tttccaaggg cagaaccaat ctgtgatgag tcagatctgg 1920 aaaataatga acagcctgaa tctcattctg agccttgtca gcccgagcct cctcaatcgt 1980 tgccccaagt gatcccgagg ccctcaacga cttgtcagcc tgagtcgatg ccacaagaga 2040 ggcccttaac gccttgtcag cctgagtcga tgccacaaga gaggccctca acgacttgtc 2100 agcctgagtc gatgccacaa gagaggccct caacgacttg tcagccagag tcgatgccac 2160 aagagaggcc ctcaacgact tgtcagtctg agtcgatgcc gcaagagagg ccctcaacga 2220 cttgtcagtc tgagtcgatg ccgcaagaga ggccctcaac gacttgtcag tctgagtcga 2280 tgccacaaga gaggccctca acgacttgtc agcctgagtc gatgccacaa gagaggccct 2340 caacgacttg tcagtctgag tcgatgccgc aagagaggcc ctcaacgact tgtcagcctg 2400 agtcgatgcc acaagagagg ccctcaacga cttgtcagcc tgagtcgatg ccacaagaga 2460 ggccctcaac gacttgtcag ccagagtcga tgccacaaga gaggccctta acgacttgtc 2520 agccagagtc gataccacaa gagaggccct caacgccttg tcagccagag tcgatgccac 2580 aagagaggcc cttaacgact tgtcagcctg agtcgatacc acaagagagg ccctcaacga 2640 cttgtcagcc tgagtcgatg ccacaagaga ggccctcaac gacttgtcag ccagagtcga 2700 tgccacaaga gaggccctta acgccttgtc agcctgagtc gatgccacaa gagaggccct 2760 caacgccttg tcagccagag tcgatgccac aagagaggcc ctcaacgcct tgtcagccag 2820 agtcgatgcc acaagagagg ccctcaacga cttgtcagtc tgagtcgata ccacaagaga 2880 ggccctcaac gacttgtcag ccagagtcga tgccacaaga gaggccctca acgccttgtc 2940 agccagagtc gatgccacaa gagaggccct caacgacttg tcagtctgag tcgatgccac 3000 aagagaggcc ctcaacgact tgtcagtctg agtcgatacc acaagagagg ccctcaacga 3060 cttgtcagcc agagtcgatg ccacaagaga ggccctcaac gacttgtcag ccagagtcga 3120 tgccgcaaga gaggccctca acgacttgtc agccagagtc gatgccacaa gaggggccct 3180 caacgacttg tcagccagag tcgatgccac aagagaggcc cttaacgcct tgtcagtctg 3240 agccaacaca accctcaacg ccttgtcagt ccgagccgct tacaccacct gtatctcgcc 3300 atgccatggt ccgaaaacgt gcgttccgaa acacgcagac agcagcggaa catatggaga 3360 atttctacaa taaacggaaa cgtgtgaaaa tagcagattt tcaagagggg gatactgtat 3420 cagtagggat tccaaagcta gacagaacgt ccacagattt taactcttga cgtcctgtac 3480 cgcgcggttc cgcttgacgt caataatcct gaaccgcgcg gtaccgctcg catgcagacg 3540 atattggcca aagagacgta acacggaaat acagctctcc tgaaggaaac cacgaatgct 3600 gtacttccga caataacacg tgcggaagtg ggaatttttt cggagttttc tgtcaacgat 3660 ttctattttt ttctggacga ttttcgtggt gatttgtgtt agacattttt gggctcaaaa 3720 tggacgactc tgactcggat agtgggtctt cttttcatgg gtttgatggc ggtgacgttc 3780 gtaatttggg caatagaatt gagttagcta attccgatca gagtgacatt tctgtttcga 3840 ccgtgcacac tgcggatttg tcggactttt catctgcgat ttcggatatt ccatcaagca 3900 gcagcgatga agatgttgac agtgatgccg aatggaccga tacgatcgac gattggagtg 3960 aagaggagtt ttctgagagt gttcaggggc ggacgagggg ggggtttcag gggtttccgg 4020 aaaccccccc ccagccaaaa attttttttt taaatggtgt ttttttgggg tattaatcag 4080 tgacaaaatc tgctgcctga aactggtaat gatcatcctc aaaatgcacc agattgcacc 4140 attttgcatc cttttttaca aaattttccg ggggggcatg cccccggacc cccctagcaa 4200 gctaggcgct ttgcgccgtc ggctcggcgc ttcgcgcctt cacacccata tcttcacaat 4260 atacttttgg aaaccccccc cataaaatga actgatccgc ccctggtgtt ggtccaacat 4320 ttcctcttcc cctcactgca aaaccaattg actacttcca gcaactcttc cctgatgatt 4380 tgtttgctca aattcgtgac gaaaccattc gatatgcccg acagaacggc gatgctaatt 4440 ttgacaccac aacggaggag atgaaagcct acatgggcat tctgttcctc atgggtttga 4500 ttccgatgcc caactacagg tgctactgga gcagcaggcc agagatgaga caacgtgtta 4560 tttatgatgt gatgcctcgg aacaggtatg aggtatggag ttgtaaataa cttgtatata 4620 ataagcatca atcatctgtg tgaattattg ctacttacct gtgacttacc tgtgacttac 4680 ctgaacaacc tttgaagaag atggtgatta attagtgata gtgtaaattt catgttgaga 4740 gagaaagaca atgcaagaca atgcaagata tctcactaaa atggacctat gtttcaaaaa 4800 ctcaacaatg tgtattcgat ggattaacac ctgagattgt acctgtgaat gacatgtgac 4860 aagtgagtac aattgcatgt ttatgtttat gcttgtatgt cctttttgtt tgtcatgaat 4920 gccatgaatg aaacatcttg aattagttga ggttaaatat taactttttt tttaattgtc 4980 acatctgtaa ttaacataat taacattttt tagataataa ggtacatcta aaaaatcttt 5040 aaacacacac gtaacataag catcattcat ttttgaacat ctacaaaaaa aattgaatag 5100 aatccattca caaacatagg agctatgatt tttttaataa acaacccctg tttagcctaa 5160 ttctctcaga aaataacagg actgggcaca agaaagcttc taagtcttat aggacgtcaa 5220 gagttaaaag gcttcctgct gtcattgcta aagcgcatgg gacaacaaaa aagtcctact 5280 tgttaacaac ggcattcgga caattaaaag gccgatatca aggcggtgat cttgaattgt 5340 acagtgggac aattacgccg aacgaaactg aagaactgag tctccgagaa gcggcaagga 5400 agcataatcc ggaaaacctc tttctgaaag cccactgtaa ttgtcacact ggttgcatga 5460 ccaaaagatg cccctgcaga gcaaagaaaa tcagctgctc gactcaatgc catggctcaa 5520 gcacatgcaa gaccattaat agtgccccaa tgccaacatc agaaatgcca gttctttcaa 5580 acgcagacat gctcgcattg gcatcctcaa ctgccttcct caatgatcaa cacatcaatg 5640 cagctaacat tttgataaaa caatcatttc cagcagcaca gggactccag gacacgctgt 5700 tgcaacagaa ttgttcattt caaaacccaa caggaacttt tgtgcaaata ttccacactc 5760 caaatcattg ggttactgta accagcgaca gcaaaaatga aataaagatc tatgactcac 5820 ttcggcagaa acccagcaat gacattcagc ttttgatagc aaagtacatt atataaatgt 5880 gaagaaccat cattcaacat caacatcatg aatttgtgag acggatcgtc caaatgcgca 5940 agtatataag agctgcgcgt tttaacgatt cttgtattct ttgtatcgat caaacgcgca 6000 cacatcgatc gatgtgtgcg cgtttgatcg atccaccaaa cgcgcagcgc gatcgcacag 6060 atcgatcaaa cgcgcctaac a 6081 // ID Vingi-3_BF repbase; DNA; INV; 2600 BP. XX AC . XX DT 01-FEB-2010 (Rel. 15.02, Created) DT 17-AUG-2010 (Rel. 15.09, Last updated, Version 3) XX DE A family of Vingi non-LTR retrotransposons - consensus sequence. XX KW Vingi; Non-LTR Retrotransposon; Transposable Element; Ingi-3_BF; KW Vingi-3_BF. XX NM Ingi-3_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-2600 RA Kojima K. and Jurka J.; RT "Ingi non-LTR retrotransposons from invertebrates."; RL Repbase Reports 10(2), 151-151 (2010). XX RN [2] RP 1-2600 RA Kojima K.K., Kapitonov V.V. and Jurka J.; RT "Recent Expansion of a New Ingi-Related Clade of Vingi non-LTR RT Retrotransposons in Hedgehogs."; RL Molecular Biology and Evolution 28(1), 17-20 (2011). XX DR [1] (Consensus) XX CC Originally classified as Ingi [1] and re-classified as Vingi [2]. CC ~8-bp TSDs. This consensus is 5'-truncated. The 3' termini are CC composed by (TGA)n microsatellite. The protein coding region CC likely includes frameshift mutations. XX FH Key Location/Qualifiers FT CDS 177..2198 FT /product="Vingi-3_BF_1p" FT /note="includes a part of endonuclease domain and a FT reverse transcriptase domain." FT /translation="CRITTYHRLLHQDFQKSVQDPLPKSQHSPISVLLTPV FT VKPITNRKPLPRLNYHRARWEDFTSKLEEGIEDIDPTSEGYETFQRLVWDV FT AKKTIPRGCRKQYIPGLTDSSKEIYDRYTKAYDTDPFAEETIELGENLLAS FT LSEARKERWRETIGGLDMTHNSKKAWKTIKKLNADQEPDQRATAVTPNRVA FT KQLLDNGKPHNKERGYVKRMKEEMRRALEESDEIFFPFSREELLACLKHLK FT TGKASGLDGISAEMIRHFGDKALDWLLQLFNKGASSTQLPKLWRRAKVIAL FT LKPNKDPTLPKSYRPISLLCILFKLYERLIMTRIKPTVEEQLCRDQCGFRE FT GRSCCGQVLNLIQFIEDGFETGTITGAVFVDLTAAYDTVNHRALLTKVARM FT IKSTQIVCIIESLLTNRHFFVEMDGKRSRWRARAQKNGLPQGSVLAPMLFN FT IYTNDQPTFNNIRRFIYADDLCLATQARSFKTIEKRLTDALQVLSGYYKSW FT FLNANPGKTQVCAFHLNNHAASRKLKITWEGKELENTPYPVYLGVTLDRTL FT SFKEHIAIAKLRRKVSTRNSLLGNLANSTWGADPSTLKQTALALCSSTAEY FT CAAVWERSAHASKVDVELNRACRTITGTLKATPLPALYKLSGICPPSIRRE FT AQTRVERDKQLRDPRHPLHGHQEVP" XX SQ Sequence 2600 BP; 774 A; 712 C; 612 G; 501 T; 1 other; atagtgcaga attactacat accaccggct actccacaat agtctagtgc agaattacta 60 cataccaccg gctactccac aatagtatag tgcagaatta ctacatacca ccggctactc 120 cacaatagtc tagtgcagaa ttactacata ccaccggcta ctccacaata gtatagtgca 180 gaattactac ataccaccgg ctactccacc aggactttca gaagtcagta caagatccct 240 tacccaagtc tcaacatagc cctatcagtg ttctcctcac tcctgttgtc aaacccatca 300 ccaacagaaa gccactaccg aggctcaact accacagagc caggtgggaa gatttcactt 360 caaagttgga agaagggatt gaagacattg acccaacatc tgaaggatac gaaacctttc 420 aaagattagt ctgggatgtt gccaagaaaa ccatccctcg tggatgtagg aaacaatata 480 tccctggcct aactgattcc agcaaagaaa tctacgacag gtacactaaa gcctacgaca 540 ccgacccatt tgcggaagaa accattgaac tcggagaaaa cctactagca tcactcagtg 600 aagcgaggaa ggaacgctgg cgagaaacta ttggtgggct ggacatgaca cacaacagca 660 agaaggcctg gaagaccatc aagaaactaa atgctgacca agaaccagac caacgtgcta 720 cagcagtgac tccgaaccga gtggcaaaac aactgctaga caacggaaaa cctcacaaca 780 aggaacgtgg ctacgttaaa aggatgaaag aagaaatgag acgcgccctt gaagagagcg 840 atgaaatctt cttccccttt tcccgtgagg agttgctagc ctgtctgaag catctaaaaa 900 ctggcaaagc atctggactt gatgggatca gtgccgagat gatcagacat tttggtgaca 960 aagccctaga ctggctacta cagttgttca acaaaggtgc tagcagtacc caactaccca 1020 agctgtggcg acgtgcaaag gttatagctc tgctaaaacc taacaaagac ccaaccttgc 1080 caaaaagcta cagaccaata tcactcctct gcatactctt caagttgtat gaacgactca 1140 taatgacccg tatcaagccc actgtcgagg agcaactctg ccgcgaccag tgtggattca 1200 gagaagggcg gtcgtgctgc ggacaagtac tgaacctcat tcagttcatc gaagacggct 1260 ttgagaccgg aacgataact ggtgctgtat tcgtcgatct cacagctgcc tatgacacgg 1320 tgaaccaccg tgctctcttg actaaagtgg cccgtatgat caagagtaca cagatcgtct 1380 gcatcataga gtcactcctg acgaatcgcc acttctttgt tgagatggac ggtaagcgca 1440 gtcgatggag agcacgtgca cagaagaatg gtttgccaca gggctcagtt ttagccccaa 1500 tgcttttcaa tatttatacc aatgatcagc ctactttcaa caacattcga aggttcatct 1560 atgcagatga cctctgtcta gcaacccagg caagaagttt caagaccatc gagaaacgcc 1620 tcacagatgc acttcaggta ctatctggat actacaagtc atggttcctg aacgccaacc 1680 cwggcaagac ccaggtctgc gccttccacc tgaacaacca cgctgcatcc aggaaactca 1740 agatcacatg ggaagggaaa gaactagaga acacacctta cccagtatac ctaggcgtga 1800 ccctcgacag gaccctgtcc ttcaaagagc acatcgcgat cgctaagctc aggaggaagg 1860 tatccacccg gaacagcctc ctcggtaacc ttgccaactc gacctggggt gccgacccta 1920 gcaccctgaa gcagactgcg ctggcgttgt gctcctcgac agccgagtac tgtgctgcag 1980 tgtgggagag gtcagcccat gcctccaagg ttgacgtaga gctcaacaga gcctgccgta 2040 ccatcactgg taccctgaaa gcaactcccc taccggctct gtacaagctt tccggcatct 2100 gtcccccaag catacgccga gaagcccaga caagagtgga gcgggacaag caactgcgcg 2160 atcccaggca ccccttacac gggcatcagg aggttccctg aaggctgaga tcacgccgca 2220 gtttcatgac ggtcccaggc ctgggggaag taccaccaga caccttcagg tctgagaggt 2280 ggaaagagag tgacacaaat aacaacgagg cactgccacc ccccagtgag gcactccccc 2340 ctggagcaga cctaccgggg agggagtggg caactctaaa cagggcgaga gcaaaggtgg 2400 gaaaaactgg tgacaacatg ctgaaatggg gcaaagccac aagctcagct tgcccgtgtg 2460 gagacgaacc acaaacaacg caacatctga tgcaacactg tgagctcggc cccagctgtt 2520 cagacagtga tctgagggag gccaacgacg cggcccgctg ttggatcggg acttggagcg 2580 gcaagatatg atgatgatga 2600 // ID BEL-605_AA-LTR repbase; DNA; INV; 508 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-605_AA_; KW Pao_Bel_Ele189; BEL-605_AA-I; BEL-605_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-508 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 508 BP; 181 A; 79 C; 93 G; 154 T; 1 other; tgtgacatgc ccctcgggca gtgcgcttca agcttcgagc ccctatcgga tacacggcgc 60 tcatggaaag tggtgatata taatatctta taatctattg ttttctctac ctattgtgtg 120 aactagagac tgaattgaga tcaaaacacg aacatgtgaa gtcataactt agtgaattta 180 aactactgct tgctaaatta ccagattatt gtacttttgt gtagaaattt gcaacggtaa 240 ckgtaaatat tagaagcttc aaaaatgaat taaacttatt attaacaatt attgtttaaa 300 tagcttgcaa aactagagct ggatatttag agtgtgcgaa gagaagatta tgctggaatg 360 aatactaaac gtgagttatt ttgaaaacta taaatctata aactaactaa ttcatgaaaa 420 tatacaggaa tttgaaattc tcctctgaaa agccaggaca ggagtatttt atgatctcgg 480 aaagaaattc aacgcagcat aagcaaca 508 // ID TDD_DD repbase; DNA; INV; 527 BP. XX AC K02644; XX DT 11-AUG-1999 (Rel. 4.07, Created) DT 11-AUG-1999 (Rel. 4.07, Last updated, Version 3) XX DE Slime mold (D.discoideum strain Ax-3L) transposons Tdd-3/Tdd-2 3' DE junction. XX KW DDTDD; TDD_DD; transposon. XX OS Dictyostelium discoideum OC Eukaryota; Amoebozoa; Mycetozoa; Dictyosteliida; Dictyostelium. XX RN [1] RP 1-527 RA Poole J.S. and Firtel A.R.; RT "Genomic instability and mobile genetic elements in regions RT surrounding two discoidin I genes of Dictyostelium discoideum."; RL Mol. Cell. Biol 4(4), 671-680 (1984). XX DR GenBank; K02644; Positions 1 527. XX SQ Sequence 527 BP; 276 A; 81 C; 35 G; 134 T; 1 other; tctagaatac atcaaaaaag caaaagatct gaaaanctta tcagcaaaag accataataa 60 cttcaaagat cctaagatcc ttctcacttc aacgatcaag ctaagaaaaa caacagctaa 120 ttactattgt attccagaat catctctccc aaatattata tcttttgatc aattcatatg 180 attaaaataa cccttcagct taaatgataa gcctttaaat aaatattaaa taaaactcgt 240 attaacacag atggacatat atcaatcttg ttaatccaat attaaaaaaa aaaaaaaaaa 300 agaaaaacat atccatcatc aatctaacat caactacatt caatctatct actctacaca 360 ctaattcctt caatccttgg attaggaata agaatagatc tgaataaacg agaaagaaaa 420 aaaaaaaaaa aataaaaata aaaataaaaa atatttttat ataaacacat aattaaaaaa 480 aaaataataa aaataatata aaaaaaaaaa aaaaaaaaaa aaaaaaa 527 // ID Gypsy-1-I_DD repbase; DNA; INV; 4290 BP. XX AC AAFI02000250; XX DT 26-FEB-2009 (Rel. 14.02, Created) DT 26-FEB-2009 (Rel. 14.02, Last updated, Version 1) XX DE An internal portion of the Gypsy LTR retrotransposon from DE Dictyostelium discoideum. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW reverse transcriptase; Gypsy superfamily; Gypsy-1-I_DD. XX OS Dictyostelium discoideum OC Eukaryota; Amoebozoa; Mycetozoa; Dictyosteliida; Dictyostelium. XX RN [1] RP 1-4290 RA Bao W. and Jurka J.; RT "Gypsy LTR retrotransposons from Dictyostelium discoideum."; RL Repbase Reports 9(2), 628-628 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 837..1502 FT /product="Gypsy-1-I_DD_1p" FT /translation="MIKRYDMKKFFKPEDELSKLTDQKFDKFMKYITAFGS FT LNAQINPPLTSERQIELFIRGVSDIEIKKSIDEGAPKLLDEAIMIARTKAN FT SKFKYDSLEFYNAGFTDKFIDKENRINNNVQDQQPQIQRQQQVIRPTRIPF FT PPSGKHPDYNVKQEQNNYANVKKHQYKNKPKGKPLKCYKCGKLGHFANQCK FT EEVNTITEQVSFTTNQQDLKRVSPRMYSQIH*" FT CDS join(1475..2443,2447..2947,3001..3465,3362..4288) FT /product="Gypsy-1-I_DD_2p" FT /translation="SQDVFTNTLTEPGAISGVEHQIKLTQKDASFQAYPVK FT FTNEQKDFLDEHIKELLKYKIIRESNSQVSSSVVLRPKPDGSMRMCINYTK FT LNNITVKDRYPIPDINEIWNQIKGSHVYSKLDMISGYYQIRIREGDKYLTA FT FSVPQGLFEFNVMPFGLCNAPAVFQRTIHKIFKEENRLTLQSFYDDILSHS FT KSVMDHTPHLEGIFKKMRDNKLLAKLSKCQFFQEEVKFLGHIIESNGVQID FT YDRLDPLMKLLDPKNVKELQRLIGTLNFFRKFVDNFASKIKPIYQLLRQDT FT IFEWNDAYKSICINIIERLKNNKIILVYPDTQLIQKPFKLETDASDIGVGA FT TLCQDHGIISFYSRTIRDTEKNYSASEKECLSVVCALEEFHYIIGTQETLV FT VTDNSAVSFLRNDLGKIRNKRFINWNVKLASYNITFKYRSGKENTFADALS FT SCPVEEVMSVISMGESVKLDSILDLKIAEEQRKDKHLVPIFKYLDIVVPMS FT MQKEVLEKFHDCSLVGGHLGYQKTYSKLASRYYWNNMGMDVKEYINNCDIC FT QKMKTSPFSKFSPELGTIIVEKPWDLVAVDFVGPMKVPSMSGHKYIIVFSD FT YVTKWVEAAATVDCTAETTAIHYFKFNYIKTRLSKKITIGLWYIILEIALP FT KQQQYITSNLIISRHGCPKRLLSDCGTSFLNKVISSVNELFKVKKVNTSPY FT HPQTDGLVERFNKTIVIMLKSFTQEVHTMWDLYLDSCLFAYRISVHTSTGA FT SPFSMLYGREATIPSDLGTLNPFNNSATTNDNYVGKLEETINKSLEIAKTK FT IQESQTKQKQYYDGLKTRKNSISPGDYVLLKNQTVKEEELKKFRPSWIGPF FT KVVRILSNQTIQIQVPLGSNMSTTQSLRNCKRYFNKINKLNQLPSFINQPS FT QVSMNELSFSRLVNPSIVVSSPPPSTSTPQHGQAVTQLPVVLPKSNRAGV" XX SQ Sequence 4290 BP; 1868 A; 629 C; 632 G; 1161 T; 0 other; tttggcgaca tcggtcaaca aataataata atagtaataa taaaaacaac aatcacatta 60 ggagaggttt ataaataaga aaaaaaaaat aatagtttcg gttggggaaa aaaaaaaaaa 120 aaaaaaaaaa aaaaaaaaaa agaggaaaaa aaaaaaatat aaaaatatat tttttaaaaa 180 attaaaaatc cttaaaatta attaaaaaga aacgtttcaa aattgaaaaa acaatttttc 240 aattaaatta attaaaattt ttaaatattg gtcccaaaaa aggaaatttt caaataaatt 300 aaataaatta aataaattaa ataaattaaa taaattaaat aaatcaaatt aaaattaaat 360 aataataatt taaataaatt ccatttaatt tcaaataata caagtttcaa aattcaatta 420 aaattaattt aaatttcaaa cattacaaaa taaaaataaa atcaaatcaa ggtaaatcaa 480 atcaaagtaa accaaatcaa gatatatcaa attaaaataa attaatcaaa tcaaattaat 540 tcaagtaaat caagttcaaa acacttttca agaaaaaaaa aaaaaaaaaa aaaacctctc 600 aaaaagaaaa aaaaaaaaaa aaaaaaaagg caaaaaataa aatttatttt gctgataatg 660 acttagagtc attaaatatt tcaacactat ttgaatatat cgaaatcact ggtgaaaata 720 atggagaata taatgaagtc aaaagaaaaa taattaaata ttttttgggt tcagcattaa 780 ctacttttat tcaaaaatca aaaatagtag aaaattataa tgatttaaaa aaagaaatga 840 ttaaaagata tgatatgaag aaatttttta aaccagaaga tgaattatca aaactcaccg 900 atcaaaaatt tgataaattc atgaagtaca tcacagcatt tggatcactc aatgctcaaa 960 tcaatcctcc attaaccagt gaaagacaaa ttgaattatt tattagaggg gtcagtgata 1020 tagaaatcaa aaagtcaatt gacgaaggtg caccaaaatt attagatgag gctataatga 1080 tagctcgtac aaaagcaaat tcaaaattca agtatgatag cttagaattt tacaatgctg 1140 gatttacaga caaatttatt gataaagaaa atagaatcaa taataacgta caagaccaac 1200 aaccccaaat tcaacgacaa caacaagtta taagaccaac caggatacca tttccaccat 1260 caggtaaaca ccctgattat aatgttaaac aagaacagaa caattatgca aacgttaaaa 1320 aacaccaata taaaaacaaa ccaaaaggga aaccacttaa atgttataaa tgtggcaagc 1380 ttggtcactt tgcaaatcag tgcaaagaag aagtcaatac cattactgaa caagtatctt 1440 ttacaacaaa ccagcaagat ttaaagagag ttagtcccag gatgtattca caaatacatt 1500 gactgaacca ggtgcgatta gtggagttga acatcaaatc aagttaactc aaaaggatgc 1560 atcatttcaa gcatatccag tgaagttcac aaatgaacaa aaagacttct tagatgaaca 1620 tattaaagaa ttattaaagt acaagattat aagagaatca aatagtcaag tatcatcaag 1680 tgtagtatta cgtccaaaac cagacgggag tatgagaatg tgtattaatt atacaaaatt 1740 aaataatatc actgtaaagg atcgatatcc aataccagat attaatgaaa tatggaatca 1800 aattaaagga tctcatgtat actcaaaatt agacatgata agtggatact atcaaatacg 1860 aattagagaa ggtgataaat acttaacagc cttttcagta ccacaaggtt tattcgaatt 1920 caatgtaatg ccatttggat tgtgtaatgc accagcagtc ttccaaagga ccattcacaa 1980 gatattcaaa gaggagaata gacttacctt acaatcattt tatgatgata tattatctca 2040 ctcaaaatca gtaatggatc atacaccaca tctagaaggt atcttcaaga aaatgagaga 2100 taataaactg ttagccaaat tgagcaaatg tcaatttttc caagaagaag taaaattctt 2160 aggtcatatc atagaatcga atggtgttca aatagattat gatagactag atccattaat 2220 gaagttatta gatccaaaga atgtaaaaga attacaaaga ttgatcggca cattaaattt 2280 cttcaggaaa tttgtagata atttcgcatc aaaaatcaaa ccaatatacc aattattaag 2340 acaagacacc atattcgaat ggaatgacgc ttataagtca atctgtatca atataatcga 2400 aaggttgaag aacaataaga tcattttagt gtacccagat acataacaat tgattcagaa 2460 accattcaaa ctcgaaactg atgcaagtga tatcggtgtt ggagcaactc tttgtcaaga 2520 tcacggtatt atatcattct attcaagaac catcagagac actgaaaaga attactcagc 2580 ctctgaaaaa gaatgtttaa gcgtagtatg tgcattagaa gaatttcatt atatcattgg 2640 cacccaagaa acattagtag tcactgataa cagtgcagta tcattcctta gaaatgatct 2700 tgggaagatc agaaacaaga gatttatcaa ttggaatgtt aagttggcta gttacaatat 2760 cacatttaaa tatagaagtg gtaaggagaa tacttttgca gacgcactta gtagttgtcc 2820 agtagaagaa gtaatgtcag ttatatcaat gggtgaatca gtcaaattag attcaatctt 2880 agacttaaag attgcagagg aacaaagaaa agataaacac ttagtcccaa tctttaaata 2940 cttagattaa tagtaaatcc aacattattc tttaagagac ctcaaatagt agccatttga 3000 atagttgtac caatgtcaat gcaaaaagaa gtattagaga agttccacga ctgttcatta 3060 gttggcggac atcttggata tcaaaagacc tactcaaagt tagcatcaag atattactgg 3120 aataatatgg gtatggatgt taaagaatac atcaacaatt gtgatatatg tcaaaagatg 3180 aaaaccagtc cattcagtaa gttctcacca gagttgggaa ccattatagt agagaaacca 3240 tgggacttgg ttgcagtaga tttcgttgga ccaatgaaag taccatcaat gtcaggtcat 3300 aagtacataa ttgtgttttc agattatgtt acgaagtggg ttgaagcagc agcaaccgta 3360 gattgcactg ccgaaacaac agcaatacat tacttcaaat ttaattatat caagacacgg 3420 ttgtccaaaa agattactat cggattgtgg tacatcattc ttgaataaag ttataagtag 3480 tgtaaatgaa ttattcaaag taaagaaagt aaatacaagc ccataccacc cacagactga 3540 tggattagtt gaaaggttca ataagaccat cgtaattatg ttaaaatcat tcacacaaga 3600 agttcacact atgtgggatt tgtatcttga ttcatgttta ttcgcatatc gaatctcagt 3660 ccatacatca acaggtgcaa gcccattctc aatgctttat ggaagagaag caactatacc 3720 aagtgatctt ggtacattaa atccattcaa taattcagca acaaccaatg ataattatgt 3780 tggaaagttg gaagaaacaa tcaacaaaag tttagagatt gcaaaaacaa aaattcaaga 3840 atctcaaaca aagcagaagc aatattatga tggattaaaa acaagaaaga attcaatctc 3900 accgggagat tatgtactct tgaaaaatca aaccgtaaaa gaagaagaat taaagaagtt 3960 cagaccaagt tggattggac cgtttaaagt tgttaggata ctttcaaacc aaacaataca 4020 aatacaagta ccattgggtt caaacatgtc aacaacacag tcattaagaa attgcaagag 4080 atatttcaac aaaataaaca aattgaacca attaccttca ttcataaatc aaccatccca 4140 agtatcaatg aatgaattat cattcagcag attagttaat ccaagtattg tagtatcgag 4200 tccaccacca tcaactagca caccacaaca tggtcaagct gtaacacaac tccctgtagt 4260 actcccaaaa tcaaatagag cgggggtgag 4290 // ID Gypsy-5_AA-I repbase; DNA; INV; 4914 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-5_AA_; KW Gypsy-5_AA-LTR; Gypsy-5_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4914 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 979-979 (2011). XX DR [2] (Consensus) XX CC Positions [3905-4366] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 664..2043 FT /product="Gypsy-5_AA-I_1p" FT /translation="MEEQLFKLPPFECDSVPITELRQKWFDYKKQFEYIAA FT VMSKKKKRKLKSIFLAVSGRQLQRVYESLPEVNQECEDDEDEFDSMIRRLD FT HYFAPKQHDTFERYSFWTQKPDAGETLDKFLLRAKVLANKCRFGSSEKESR FT DAAVIDKIVMLAPPELRRKILEKPKINLDDLTTLVNTHLSVQHQVRELGQR FT TSGAGNLLEASRGNTFVNKITTESENRSRWAKSHPTADCSRCGLRLHKPGE FT KCPAKDVQCHHCNRVGHFAKKCYFGSNKGTWKRKPSAENPNPRGHKTQKVN FT AINNDETSTTRDMKPEAQVQDSFIYAISDNHDEMVWCKVGNILVEMMIDSG FT SKYNIIDQQTWVYLQSRNAAVDNVKPSTKRLSAYAQKDSLDIICTFDANIA FT IVEGNDLNFAAAFYVIKDGKQNLLGRDTAKRLGVLLIGLPSVNNSEFIQHV FT SENTVDKFPVIKGSY" FT CDS 2543..4894 FT /product="Gypsy-5_AA-I_2p" FT /translation="MFRYTRLMFGICSASEHFQRIIEQILSNCPNSFNYQD FT DIFVHGKTEAEHDAALESVLRTLEAHNVVLNTKKCKFKVTETEFLGHDISQ FT QGVRPTEDKITAVQQFRSPTSAEEVRSFLGLVCYVGRFIPDLATKTFDLRQ FT LTVSGQKFDWTTKHEIAFNNLKRAVCSAPTLGFFDNNRRTRVIADASPVGL FT GAVLIQFEDEYDDKPIVISYASKSLSSTERRYCQTEKEALAIVWCVEKFKL FT YLLGRVFELETDHRPLTAIFKPTSQPPGRIERWVLRLQPFKFRIVYRPGKQ FT NIADSLSRLSLTTINEDVDKCDDQLYISAITDSVAVDVSEIKNAIITDPEL FT LLVKDALLTSDWTDGAIRDGAKKYIPFQNDLALLEGFVIRGCRIVIPQSLR FT ARMLQLAHEGHPGETLMISRLRDRVWWPGMDEDARKTVRNCEGCRLVSRPS FT APEPMRRREMPKEPWVDVAMDFLGPLPSSEYLLVIVDYYSRYKEVCTMKKI FT TSEETIKRIEPIFVRLGYPRTITLDNGRQFISTEFEEYCFSRNITLNHTAP FT YWPQANGEVERQNSSLLKRLKISHSTNRDWKNDLLEYLMMYNTTAHSVTGK FT APSVLLQNRLIRSKIPSISDIETAPPVNSEVHDRDRILKHRGKEREDVRRH FT AKPSDIQKGDRVLLQNLISTGKLTTTFGKTQYEVMERKGNRVKILDPVSGS FT VLERNIAHLKKIYNPQAISNEETSTSVMGDTINNPMNDQDPPPSSTVDDTN FT DNPMYNQDQAPPSRPPRIMCRPSWQRDYFVNQL" XX SQ Sequence 4914 BP; 1538 A; 1044 C; 1100 G; 1232 T; 0 other; tattggcgac gagtagaacc tgcggtgatt caagcaaaca aaatcagaag gtaaaccaaa 60 tcagtttttt tacaaatgtt gggagattgt tccagaaaga aaagtcaagc taaacaagcg 120 gttgcccagc aacagattgc gatacgcggg taaaagaaaa aacaagttcc ctctacatgg 180 gaactgcctc aggaacggat ttcatccgtt gagttaggcg gaatgaaaaa aatcaagttt 240 gcttcaacag tgaactgcct cagtgtcagg cggtcaagtt cgttttatca gcgaactgcc 300 tcagtgtcag gcgaaagaaa gaccaagttc acctaagaag ggaactgcct caggagcggt 360 tttgatccgt tgagtcaggc gcaagttcgt tttaacagcg aactgcctca gtgttaggcg 420 gcaagttcac cttaaaagtg aactgcctca gaagcggttt tgatccgttg agtcaggcgg 480 aaaaatggct agatcattac ctgccaatac aacggattgg tccgttgtgt caggcggtcg 540 gcaataggtt gctaatggca accccgtgtt aactagacta aaatattcaa aacaattctc 600 caatatttgt tcttacacca tgcacatggt actattgaaa ttaatttcaa tcatttctca 660 gacatggaag aacaactgtt caaactaccg ccttttgaat gcgatagcgt tccaataacg 720 gaactcaggc aaaaatggtt tgactataaa aagcagttcg agtatatcgc ggccgtcatg 780 agcaagaaaa agaagagaaa attgaagagc attttccttg ctgtttctgg tcgtcaactc 840 caacgggttt acgaaagctt gcctgaggta aaccaagagt gcgaggatga cgaggacgaa 900 tttgatagta tgattcgtcg actcgaccat tactttgctc ccaagcagca tgacacattc 960 gagagatatt cattttggac tcaaaaacca gatgccggag aaacattgga caagttcttg 1020 ttacgggcca aagtcctggc caacaaatgt cgatttggat catcggaaaa agagagcaga 1080 gatgcagcag ttatagacaa gatcgtcatg cttgcgcctc ccgaactccg acggaaaatt 1140 ttagaaaagc ccaaaatcaa cttggatgac ctaactaccc tcgtcaatac ccatctatcc 1200 gttcagcatc aggttcgcga gcttggccag cgtacctccg gtgctggaaa tctgttggaa 1260 gcaagtcgtg gtaacacgtt tgtcaacaaa attaccactg agagtgaaaa cagatcacgt 1320 tgggcgaaaa gtcatcctac agctgattgc agtcggtgtg gattgaggct gcacaagcca 1380 ggagaaaagt gtccagccaa agacgttcaa tgccaccact gcaaccgtgt tggacatttt 1440 gccaagaagt gttattttgg ttccaacaaa ggtacttgga aacgcaaacc atctgctgaa 1500 aatccaaacc ctcgcggtca caaaacacaa aaagttaatg ctattaacaa tgatgaaacc 1560 tctacaacac gtgatatgaa gccggaagct caagtgcagg actccttcat atacgcaatc 1620 agcgataacc acgatgaaat ggtttggtgc aaggtgggca atatactcgt tgaaatgatg 1680 attgactcgg gcagcaaata caacatcatt gatcaacaga catgggtcta cttgcaaagc 1740 aggaatgctg cagtcgacaa tgtgaaaccg tccacaaaac gtctatcagc atatgcacaa 1800 aaggattcat tggacatcat ttgcacgttc gatgcaaaca tagcgattgt ggagggcaat 1860 gatctgaatt ttgctgccgc cttctacgta ataaaagatg gtaagcagaa tttgcttggt 1920 agagacactg caaaacgatt gggagtgctg ctgattggtt tgcctagtgt caacaactcg 1980 gagttcatcc aacatgtttc tgagaatacc gttgataaat ttcctgtaat taaaggtagc 2040 tattaatatg tcatcaataa agataatatc ctaatcattc ctgtgcccag gtgttaaaat 2100 tcgaatcgat atcgatgaaa ccgtgacgcc cgttgctcag catgtccgtc gtgtacccat 2160 tgctctccgt cgacaagttg aagatcaaat caaccgactt ctaagaatgg gtattattga 2220 aagagttaac ggtcccagcc cgtgggtttc ccccgtagta attgttatca aggataacgg 2280 tgatgttcga ctgtgcattg atatgcgtcg agctaatacc gcaatcagaa gagagtatca 2340 catgattcct acgcttgatg atcttctcgc aaggtattgt attagacaac ttaattgttc 2400 aaattaaatt atttagaaaa cttctgtgat caggttaaac ggttgcaagt ggttctctcg 2460 tcttgacatt aaggatgcgt accaccaagt ggagttgcac gagtcaagtc gacacataac 2520 gacatttatt actcatcttg gtatgtttcg atacaccagg ttgatgttcg gaatctgcag 2580 cgcgtcagaa catttccagc gtattatcga gcaaattctg agcaattgtc caaactcttt 2640 caactatcaa gatgacattt ttgtacatgg gaagacggag gctgaacatg acgctgcttt 2700 agaaagtgtt ctacgcacat tagaagcaca caatgtcgtg ttgaacacta agaaatgtaa 2760 attcaaagtt acggaaacag aatttttggg ccatgacatt tctcagcaag gcgttagacc 2820 aacggaagat aagataacag cagtgcagca gttcagatcg cctacatctg ccgaagaagt 2880 tcgaagtttt ttgggactcg tttgctatgt aggaagattc attcccgatt tagcaactaa 2940 aacattcgac ctacggcagt tgacagtcag tggacaaaaa ttcgactgga cgacgaagca 3000 cgagatagcg ttcaataatt tgaaacgggc tgtatgctca gcgccaacgc tcggtttttt 3060 cgataacaac cgtcgaacta gagtcattgc ggatgcatcg cccgttggtt taggagctgt 3120 tttgatacag ttcgaagacg aatatgatga taaacccatc gttatatcgt atgccagcaa 3180 gagcctttca tcaacagagc gtcgatattg ccaaacagaa aaagaagcgc tggcaatagt 3240 ttggtgcgtt gaaaaattca aattgtacct acttggcaga gtcttcgaac tcgaaacaga 3300 tcaccgccca ttaactgcaa tcttcaagcc aacatcacag cctcccggac gcattgaaag 3360 gtgggtgctt aggctgcaac cgttcaagtt taggattgta tataggcctg gtaaacaaaa 3420 tatagcagat tcattatctc ggctatctct aacgacaatt aacgaagacg tagacaaatg 3480 tgatgatcaa ctctacatta gcgccattac agattcagtg gctgtagacg tttcggaaat 3540 taagaatgca atcattactg atccagaact acttctagtc aaggatgctc tgttgacttc 3600 agattggact gatggtgcga ttagggatgg tgctaagaaa tacatacctt ttcagaacga 3660 tcttgccctc ctcgaaggtt tcgtaattcg cgggtgcaga atcgtcattc ctcaaagtct 3720 acgcgcaaga atgctgcaac ttgctcacga aggtcatcct ggtgagacat taatgatatc 3780 acgacttcgt gacagggttt ggtggccggg tatggatgaa gatgcaagga aaacggtcag 3840 aaactgcgaa ggttgtcgat tggttagcag accatcagca cctgaaccaa tgcgccgccg 3900 cgaaatgcca aaagaaccct gggtagacgt tgccatggat tttctggggc ctctaccatc 3960 gagcgaatac cttctagtga ttgtggacta ctatagtaga tacaaggaag tgtgcaccat 4020 gaagaagatt acgtcagagg aaactattaa gagaattgag cccatcttcg tccgtttagg 4080 ttatccaaga acaattactc tggataatgg tcgtcagttt ataagtactg agttcgaaga 4140 atactgtttt tcaagaaata taactctgaa tcacacagcg ccatactggc cgcaggcgaa 4200 cggtgaagtt gagaggcaga acagttctct tttgaagcgg ctaaaaataa gtcattcaac 4260 gaatcgcgat tggaaaaatg atttgcttga gtacctcatg atgtacaata caactgcaca 4320 ttccgttaca ggtaaagctc catctgttct gttgcagaat cgtctaatcc gttccaaaat 4380 accatcaata agcgatatcg agacggcgcc tcctgttaac tctgaagttc acgataggga 4440 tagaattctg aaacatagag gcaaagaaag agaagatgta aggcgccatg caaaaccctc 4500 agatattcaa aaaggcgatc gcgttttgtt acaaaacctt atatcaaccg gaaagcttac 4560 tacaacgttc ggcaagaccc aatacgaagt catggaaaga aaaggaaacc gcgttaaaat 4620 tcttgatcct gtttcaggga gcgtcctgga aaggaatatc gcccatctca agaagatata 4680 caatccacaa gcaatatcga atgaagaaac atcaacatcc gtaatgggcg atacaatcaa 4740 caacccgatg aacgatcagg atccaccacc atcatccacc gttgatgaca ctaacgacaa 4800 tccgatgtac aatcaagatc aagcaccacc aagtcgccct cctcgtataa tgtgtcgacc 4860 atcctggcag cgagattact ttgtaaatca actttgatct ggaagaaagg gaga 4914 // ID EnSpm-1_AP repbase; DNA; INV; 2614 BP. XX AC . XX DT 28-FEB-2009 (Rel. 14.02, Created) DT 02-MAR-2009 (Rel. 15.12, Last updated, Version 2) XX DE EnSpm-type family from Acyrthosiphon pisum- consensus. XX KW EnSpm; DNA transposon; Transposable Element; EnSpm-1_AP. XX NM EnSpm-1_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-2614 RA Bao W. and Jurka J.; RT "EnSpm-type families from Acyrthosiphon pisum."; RL Repbase Reports 9(2), 465-465 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC This element is incomplete at both ends. XX FH Key Location/Qualifiers FT CDS 131..2011 FT /product="EnSpm-1_AP_1p" FT /translation="NLSDSSFGYSDRSPDGSLPDHESSQXLLNLGNDDXNN FT ILAEPSFREKLQNWAVKHRSNLTVEMIEDLLGILRAENIPDLPKSATTLLQ FT TKSNTNIKSMNSLKNTTGFYMYLGIEQGLKXIITDEYSENXIRLLFNIDGL FT PLFNGSNQQFWPILGLILHNDYXSQPFIVAVYSGDSKPQNIDXYLEDYVKE FT AKXLIQNGVTIGQRTFRVEIVGFSCDTPARSFIKKCKGHGGFXACERCETR FT GKTINKKRVYPSMTSXLRTKSSFIMKSQNEHHLGAGRSPLLDIPDFDPVKS FT IFLDSMHLLYLGVMKWIMQQLLGTKKVNRKCKLPVQDVNYLNLKLKIFVKY FT XIPKEFQRKKFDLEDFSHWKATQFRFFLHYCGSLVLHNILPKXMYKHFLLL FT VVACRILCDPELCIDNVGYARQLLRKFFELLPSFYGSDSQVMNXHNLIHLA FT DDVEXTKMHLSAISAFPFENYLGKIKRLIGGGSNPLAQLARRXSEQKACPE FT MVKKSAIKKKKFLIINSDMHEDEVNLEDIIFRGVTLSIKKPDNIVKMDSGH FT IXQITRIRKXQHSXFLHGYIFKXVTDVFXYPCXSXKXGIMKLGRLSETEKK FT FSXDNXLKKCVFFENXXXSXAXTXLHDS*" XX SQ Sequence 2614 BP; 965 A; 314 C; 387 G; 861 T; 87 other; tratttacat atggaaattg atgtctctag caatttacca acttatactc agtctataga 60 tattagtgaa catcctgata ttcatgaaca tgataattct agttcaaatt attttgacaa 120 tarttcttaa aatctatcag attcatcttt tgggtattca gacagatcac cagatggtag 180 tttaccagat catgaatctt ctcaaraatt actaaatctt ggtaatgatg atcasaataa 240 cattttrgca gaacccagtt ttcgtgaaaa attacaaaac tgggctgtaa aacatagaag 300 caatctgack gttgaaatga tagaagactt attaggaata ttaagggcag aaaatattcc 360 tgatttacca aaatctgcaa ctaccttatt acaaactaag tcaaatacaa atataaaatc 420 aatgaatagc ttaaaaaata ctactggatt ttatatgtat cttggtatag aacaaggttt 480 gaaagasata attactgatg aatattctga aaatwktatt cgcttgttat ttaatattga 540 tggtttacca ttgtttaatg gttctaacca acaattttgg cctatwttag gacttatttt 600 gcataatgat tatgawtcac aaccatttat tgtkgctgta tatagtggtg attctaaacc 660 acaaaatata gatrattatt tagaagatta tgtcaaggaa gcaaaawktt taattcaaaa 720 tggygtaact attggtcaaa gaacatttag agtagaaatt gttgggtttt cytgtgacac 780 gccagccaga tcattyatwa araaatgtaa agggcatggt gggttytwtg catgtgagcg 840 ctgygaaacc agaggcaaga caataaataa aaaaagagta tatcctagta tgacgtccam 900 gctacgtaca aaaagtagtt ttataatgaa aagtcaaaat gagcatcatt taggagcagg 960 tagatcacca ttattagata taccagactt tgatcctgtt aaatcaattt tcttagattc 1020 tatgcattta ttatacctag gagttatgaa atggataatg carcaattac ttggtactaa 1080 aaaagttaat cgwaaatgta agctacctgt acaggatgtt aattatttaa atttaaaatt 1140 aaagatattt gtgaaataym atataccaaa agagtttcar agaaaaaaat ttgatcttga 1200 rgatttttca cattggaaag ccacacaatt tcgttttttt ttacattact gtgggtcttt 1260 ggtgctacat aatattctac ctaaaraaat gtataaacat tttttacttt tagtkgtggc 1320 ttgtcgaata ctrtgtgatc ctgaattatg tattgataat gtkggttatg cwagacaatt 1380 attgaggaaa ttctttgagc tacttccatc attttatggt tcrgattcac aagtaatgaa 1440 takacacaac ttaatccact tagctgatga tgttgagyat accaagatgc atctgtctgc 1500 aatatctgcg tttccatttg aaaactatct tggtaaaatt aaacggttaa ttggaggagg 1560 aagtaatcct ctggcacagt tagccagaag awtatcagaa caraaagcat gtccagaaat 1620 ggtaaaaaaa agtgctataa aaaaaaaaaa gtttctcata attaactctg acatgcatga 1680 agacgaagtt aatttagaag acattatttt tcgyggagtt acattaagta ttaaaaaacc 1740 tgataatatt gtaaaaatgg attctggcca tattwttcaa attacgagaa tcagaaaara 1800 gcaacatagt rtktttttac atggatatat attcaagrrt gttactgatg tttttmaata 1860 tccwtgyama tcawcaaaar ttggaataat gaaaytgggw mgattrtcag aaacagaaaa 1920 aaaattttca wtagataatr ttttaaaaaa atgtgttttt tttgagaatg rcwgtaakag 1980 ttwtgcartt acwtwwctwc atgattcata aaataaaaaa taattttaac tataawtata 2040 acatattatt ratattttaa tttttaaaca atttttagta taattaaata attattatat 2100 aatatatgat atttaatttt aattcaacat atattttata trtgaytgtt ttttttatat 2160 aggtaaccaa satggacttr craaaaaata ctacncatgt tgttatcaaa ttycangaaa 2220 aaaaaaaaca cggaatgcaa agyattgatg ttataccaat ttcatggrtg tattgtaaaa 2280 aaggaaaatt gtactgtaaa tatccatmtg aaaawgaata caatagatta gatgaaatga 2340 gtaaracatc agtactacct gaagcccttt ggagaargtt tgaaataact ttaatyaaag 2400 aagctagtac gtaccatata catcattaaa ataacttwat aytttcaatt taatatgtgt 2460 taattaagta tcaattttta taaagttttt tttttacatt aaagaaaatt atgatcaagg 2520 attaagacgt atgaataggg catgctctga tactataatt attagcagta atttagaaga 2580 gcaaaatagt ccgaagaaga aataggtcct cttc 2614 // ID RTEX-5_BF repbase; DNA; INV; 6449 BP. XX AC . XX DT 30-JUN-2009 (Rel. 14.07, Created) DT 30-JUN-2009 (Rel. 14.07, Last updated, Version 1) XX DE Amphioxus RTEX-5_BF autonomous non-LTR retrotransposon - DE consensus. XX KW RTEX; Non-LTR Retrotransposon; Transposable Element; Jockey-13_BF; KW RTEX-5_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-6449 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-6449 RA Kapitonov V. and Jurka J.; RT "Young families of RTEX non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(7), 1721-1721 (2009). XX DR [2] (Consensus) XX CC The complete RTEX-5_BF consensus sequence contains two ORFs. The CC RTEX-5_BF ORF1 protein contains the esterase domain (467-609 aa). XX FH Key Location/Qualifiers FT CDS 150..2384 FT /product="RTEX-5_BF_1p" FT /note="Esterase domain." FT /translation="MAGIRSTRASTRAKSSTKWWNVSASKIPADETIHTDY FT NDEAKTSPCNTKFVTTTDRVPVWVKVCKLRYNEEYGKQSGQKVEWRDGDQA FT DVLIVTETEQKDGKVQESCVLSIHFWRNGTICIQGSAFLEWTDKIFPSLKK FT KVEETSNGNCQNGGPSQSDADGNKDHDTGSNTFTTPKGKTFLSSETVDTSS FT ETIDHSDETPVKVPVGTPRKTNRFRKAVENFISPFRTPTSTESPLNSSKNV FT LDFENTPTPSDTVEKSTSHDHITRDIQKHYTDLVDLVNTLQVEVKALKDEQ FT GTQKNEFEKKLKAVSGELKKQLEDEQKRHANERKSVNSRHNEEMATLTAKC FT ESLQSLVEKQQQQIAKLQEVTDSIQLSQSISKMRTYAQTVALNPGAHQVSA FT KNCISKDNGTNNATDDKTMDPDTAHANHTTTQPSTEEADSDVELTQQVSSQ FT EGTQVPTEESKQVRVYTDSIWKAVNVSRMFPNLSTHKDKTSTISDATKKLE FT TIHDPGTSYAILHIGSNDLDNSKHDDSSVQGCLQKTEELISIAKASFPNAT FT IVLSQVLPRGNDLQSDLNKNIKDYNQSVLQRYKDEEKLLYVRHKKLSTSRH FT LYRRDGIHLDEVTGTSLLVADVKRTIRSVENEYNPGYSRENWRQPPHSGQD FT RPQQNRGQDRTHRNRGQDRAQENRGQDRSQQNRGQDRPQQNRGQDRPQQNR FT SRDGSWRADTPGERSRPNWYRSQVNTPHEDIINLAYKLKELLNNF" FT CDS 2593..6297 FT /product="RTEX-2_BF_2p" FT /note="AP endonuclease and RT domains." FT /translation="MAHDVALRLTSWNIQGSLSKKCVEVDFLDSICKYDVV FT CIQETWLKPEDNFHIDGYHYFRSDRKLRKQSRRNSGGVAILFKRNLAKGIS FT KLHSNNSDIIWCKLDKNFFGFDRDVILGCLYLPPENSPIFATKKNTLFEDL FT TADIAKFNSGGHILLLGDLNSRTSNVQEKLFDGLDENACEIQLKPRFNTDT FT VNNRYGRELIDLCSAANMILVNGRTIGDLPGKNTCIKYNGTSTVDYCITSL FT HFFTSVQYFKVHDPTWMSDHCQISVSLKTNHSNFRHKTDEKTNSFPKRYIW FT EETSCEKFQEALRSSQTKNAIQQFISKPYSVSQTKHVVEDFNNIIKITADL FT SLKSIRVLKRKKKPVNKPWYDQQCRDLKSRMRALAAEIRKFPWRQDKRQNY FT VYKVKEYKKLIKKRKKQHNEGNMTFLSKLAKKEPKSFWKEINNLRNLENGS FT KVETGDEISISEWLNHFKTINANITVDTKDKEHADKIINSLEHETDNPLDF FT EITEEEIQNALSSLKSNKSSGTDSIINEMLKYGAPQLIQPFKKMFNIFLQS FT GHFPQQWSQSTLVPIYKSGEASNPGNYRGIAISSCVGKLFTFILNKRLQHF FT LESSNLLSPFQSGFRKDFRTSDNVFVLKTLIDQQTSKPGGKLYACFVDFRK FT AFDSVWRNGLFYKLLSLGIGGNFFRLIKSMYHNIEYCVKTPHGMSPYFQSF FT CGVRQGCNISPLLFNLYINDFPALLNPISCDSLFLKDTPVNSIFWADDLVL FT VSKSESGLKECLRALDQFCSLWKLSINKNKTKVMIFSKNGRTVNNKQSPFT FT SQGQTIEITNFYTYLGVDITPSGSFSRAIKQLRLKALRASFKLKSFLSSNS FT NPSISLAIDLFNSLVKPILLYCSELTGINAQCRDLIFKGIPECQNETTRTS FT IARVLSPILGSEVDILRVSRLGNISEASTIYTRPILVRFKKYSDKLNVIYQ FT SDTLFRNQHITIEQPKISYEIPDLEYVLLNFCKILLHVPKSSVNAAVRGEL FT GVFPLFVDSQTHLIKYWLRLHTLPNDRIVKKAYDTSVEEGHDWATHVQDIL FT CRHGFQNVWLNPAVNAKHFGNIFKERLKDTFLSGWREQLKSTNKLKTYSKI FT KKHFGMEEYLKSIHNRSFRYSITKLRISAHCLEIEKGRHRNTPATQRLCPT FT CHENIEDEFHFVMKCPTYSKERETLFKNIRTKTHISLAGAESDIFHVLLSC FT STHISVYVGQFLYNVFQMRDQINHT" XX SQ Sequence 6449 BP; 2240 A; 1382 C; 1227 G; 1600 T; 0 other; aaagcataat caagatggcg gacggtacgg gtgcactatc accagggctc cgtaagtgaa 60 gatgttttag agcggattac gattgttttt atcatagatt ttgagttttt tgagtaccgg 120 atatacagtt tagtaaccat atagtaagga tggctggaat cagaagcacc agggcgtcaa 180 ctcgggcaaa aagttcgacg aaatggtgga atgtctcagc gtcgaagatt ccagcggacg 240 agaccataca caccgactac aatgacgagg ccaaaaccag cccctgcaat acaaagtttg 300 tgacaactac cgacagagtt cctgtgtggg tgaaggtgtg taagttgcgc tacaacgaag 360 agtacggaaa gcagagcgga cagaaggtag aatggagaga cggagaccaa gcggacgtct 420 tgatcgtcac agagacagaa cagaaagacg gaaaagttca agaatcgtgc gtgctgtcaa 480 tccacttctg gagaaacggt acaatctgca tccaaggatc agcattcctg gaatggactg 540 acaaaatctt tccgagtctg aagaaaaaag ttgaagaaac atcgaatggc aactgtcaaa 600 atggcggccc aagccaaagc gacgccgacg gaaacaaaga ccacgatacc ggaagtaaca 660 ccttcaccac tccgaaaggt aagacgtttc tatcttctga aactgtggat acatcatccg 720 aaaccattga tcactcagat gaaacccctg tcaaggtccc tgttggcaca cctagaaaga 780 caaacagatt tagaaaagct gtcgaaaact ttatcagccc cttcaggacc ccaacttcta 840 ctgaatctcc actgaactct agcaaaaacg tgttggactt tgagaacacc ccgactccat 900 cggacactgt ggaaaagtca acttcacacg accacatcac aagagatatc cagaaacact 960 acacggatct tgttgacctt gttaatactc tacaagtaga agtcaaggcg ttaaaagatg 1020 aacagggtac acaaaagaat gaatttgaaa agaaactgaa agctgtctcc ggtgagttga 1080 aaaaacagct ggaagatgaa cagaaaaggc atgcaaatga aaggaagagt gtgaacagcc 1140 gccacaatga agaaatggcc accctcacgg cgaagtgtga atctctccag tcccttgttg 1200 aaaaacagca acaacagatt gctaaactgc aggaggtgac ggattccatt caactttccc 1260 aatccatttc aaagatgaga acgtatgcac agactgtggc tttgaaccca ggtgcccatc 1320 aggttagtgc taaaaactgt atctccaagg acaatggcac taataacgcc actgatgaca 1380 aaacaatgga tccggataca gctcatgcaa atcatacaac aacacagcca agtactgaag 1440 aagccgactc agacgttgaa ctgacccaac aggtttcgtc acaggaagga acacaagtcc 1500 cgacagagga gtcgaaacaa gtcagggtgt acacggactc catatggaaa gccgtcaatg 1560 tctccagaat gtttcccaac ctttcgacac acaaagacaa gacatctaca atatccgatg 1620 ccacaaagaa gctagaaacc atccacgacc caggaacatc ctatgcgatt cttcacatag 1680 gatctaacga cttagacaac tctaaacatg acgactcttc tgtacaaggc tgtctgcaga 1740 aaacagaaga actcatcagt attgccaaag catcatttcc aaacgccacc attgtcttgt 1800 ctcaggttct cccaagaggc aatgacctgc aatctgacct gaacaagaac atcaaggact 1860 ataaccaatc tgtactacaa agatacaagg atgaagagaa gctgttgtac gtacgtcaca 1920 agaaactgtc aacatctaga catctctaca ggcgcgatgg tatccacttg gatgaagtca 1980 cagggacgag tctcctggtg gcggatgtaa agaggaccat ccgctctgtg gagaacgaat 2040 acaatcccgg ttacagtaga gaaaactgga gacaaccccc acattcaggg caggacagac 2100 cacagcagaa ccgcggccag gacagaacac atcggaaccg cggacaggac agagcacaag 2160 agaaccgcgg acaggacaga tcacagcaaa accgcggaca ggatagacct caacagaacc 2220 gcggacagga cagaccacag cagaaccgca gccgagacgg atcctggcgt gccgatacac 2280 ctggagagcg gagccgtccc aactggtacc ggtcccaggt caacacacca cacgaggaca 2340 ttataaacct ggcctacaag ttgaaagaac tgctcaacaa cttttagata ttcagtgtac 2400 cagcgctgac caaaaggaaa gtgaaccctt ctattactct actttacata gaacttttga 2460 ttatttcata taaccaccaa atatagacct aaatggtact cattcatacg taaaccttaa 2520 aacatatgat gaacaacata cgtaattatt gctcttaaca tttttctact ttctgacttt 2580 ctatcatttc ttatggcaca tgatgtggcc ttacgcctaa ctagctggaa cattcaaggt 2640 agtcttagta aaaagtgtgt agaagtagat tttttagata gtatttgtaa atatgatgta 2700 gtctgtatcc aagaaacgtg gctgaaacct gaagacaact ttcatataga cggttaccac 2760 tatttccgaa gcgacagaaa gcttagaaaa caatctcgta gaaattcagg cggggtagct 2820 attctcttta aacgaaatct ggcgaaagga atctcgaaat tacatagtaa caactccgac 2880 attatttggt gcaaattaga taagaatttc tttggctttg acagggatgt aatcttaggt 2940 tgcctttatc tacctcctga aaattctcca atatttgcta ctaaaaagaa tacactgttt 3000 gaagacttaa ctgctgacat agccaaattc aacagcgggg ggcatatcct acttctgggt 3060 gatttaaatt ctagaacatc taacgtacaa gaaaaactct ttgatggcct tgatgaaaat 3120 gcctgtgaaa ttcaactaaa acccagattt aataccgaca ctgtaaataa cagatacggc 3180 agagaactca ttgatttatg ctctgctgct aatatgattc ttgtgaatgg aaggactatt 3240 ggggatcttc caggaaaaaa cacatgtata aaatataatg gcacaagtac ggtagactac 3300 tgcattacta gtctacattt cttcacatct gtacagtatt tcaaagttca cgacccaacc 3360 tggatgtccg atcattgtca aatttcagtc tcccttaaaa cgaatcactc aaattttaga 3420 cacaaaacag atgaaaaaac taattcattc cctaaaagat atatctggga agaaacgtca 3480 tgtgaaaaat tccaggaggc ccttcgttcc tcacaaacaa aaaacgccat acaacagttt 3540 atatcaaaac cttactctgt cagccaaaca aaacatgttg ttgaagactt taataatatc 3600 atcaaaataa cagctgactt gtcactaaaa tctattcgag ttttgaagcg aaaaaagaaa 3660 cctgtaaata aaccgtggta cgaccaacag tgtcgggatt taaaatcaag aatgcgagcc 3720 cttgccgcgg aaattaggaa atttccatgg cgacaagaca aaagacaaaa ttatgtatac 3780 aaagtcaagg agtataagaa actgataaaa aagagaaaaa aacaacataa tgaagggaac 3840 atgacgttct tatccaaact agctaagaaa gaacctaaat cattttggaa agaaatcaat 3900 aatcttcgaa acttagaaaa tgggtccaag gttgagacag gggacgaaat ttcaattagt 3960 gaatggctta atcattttaa aacaattaat gcaaacatca cagtagatac aaaagataaa 4020 gaacacgccg ataaaataat caactctctt gaacatgaaa cggataaccc gctagacttc 4080 gagattactg aagaagaaat ccaaaacgct ctatctagtt taaaaagcaa caagtctagc 4140 ggaacagatt caataattaa tgaaatgctg aaatatggcg cacctcaact gatacagccc 4200 ttcaaaaaaa tgttcaacat atttttacag tcaggacatt ttccccaaca gtggagccaa 4260 agcactctag ttccaatcta taaaagtgga gaagcatcaa atcctggcaa ctacagagga 4320 atagctattt ctagctgtgt cggcaaattg tttactttta tcttgaacaa acgtttacaa 4380 catttccttg aaagctcaaa tttactatca ccttttcaat ccggttttag aaaagatttt 4440 agaacaagcg ataacgtctt tgtcctcaaa acgttaatcg accaacagac atccaaacct 4500 gggggtaagc tatatgcttg tttcgtggat ttccgaaaag cttttgattc ggtatggcga 4560 aacggactat tctataaact tttatcgtta gggataggag gtaacttctt tagactaata 4620 aaatcaatgt atcacaacat agaatattgc gttaaaaccc cacacggcat gtcaccatac 4680 ttccagtcat tctgcggggt gagacaaggt tgtaacataa gtcccctcct ttttaacctt 4740 tatattaacg atttcccggc cctattaaat ccaatatctt gtgactccct attcttaaaa 4800 gatactcctg taaacagcat attttgggct gatgaccttg tacttgtatc gaaatcagaa 4860 tcaggcctca aagaatgttt gagggcactg gaccaatttt gctctctgtg gaaattatca 4920 atcaataaga acaaaacaaa agtaatgatc ttctccaaaa atgggcggac agtcaataac 4980 aaacagtcac ctttcacatc tcagggtcaa actatagaga taaccaactt ttatacctac 5040 ctaggagtgg acataacacc ttcaggatct tttagtcgag ccataaaaca gttaagattg 5100 aaagcactac gggcctcgtt caaattgaaa tcattcttat catctaattc caacccctcg 5160 atttccctag ctattgatct gtttaacagc ctggttaaac ctatcctact ttattgtagt 5220 gagctcacag gaataaatgc acaatgcaga gatctaatat tcaaggggat tccggaatgt 5280 caaaatgaaa cgaccagaac atctatagcc agagtcctca gcccgatatt gggatcagaa 5340 gttgacatac ttagagtctc tcgtctggga aatatatcag aagcctcgac tatttatact 5400 cgtcccattc ttgttcgatt caagaaatac agtgacaaac tgaacgtgat ataccagtca 5460 gacactttgt ttaggaacca gcatataaca atagaacaac caaaaatctc atatgaaata 5520 cctgacttag aatacgtact cttaaatttc tgcaaaattt tacttcatgt accgaaaagc 5580 tctgtcaatg ccgccgtaag aggggaatta ggagtatttc ctctattcgt tgacagtcaa 5640 acacatctga ttaaatattg gctaagattg catactctac caaatgatag gattgttaaa 5700 aaggcttacg acacttcagt cgaagaagga catgactggg ctactcacgt tcaggacatt 5760 ttatgccgac atgggttcca aaatgtttgg cttaaccccg ctgttaatgc caaacatttt 5820 gggaacatat tcaaagagcg tctaaaagac acgtttcttt ctggctggag ggaacagctc 5880 aaaagcacca acaaacttaa aacatattcg aaaatcaaaa aacactttgg tatggaagaa 5940 tatctcaaat caattcataa tagatcattt agatacagca ttacaaaact gagaataagt 6000 gcacattgct tagaaattga aaaaggtcgt caccgtaaca caccagccac tcagagactg 6060 tgtcctacat gtcatgaaaa tatagaagat gaattccact ttgttatgaa atgtcccacc 6120 tacagtaaag agagagaaac attgtttaag aatattagaa ccaaaacaca catctctctc 6180 gctggtgcag aaagcgatat tttccatgta ttgctctcat gttcaactca tatatccgtc 6240 tacgtaggcc agtttcttta taatgtattc caaatgcgag atcagattaa tcatacataa 6300 ctgtactatt atcgataatg ttttgttgtt gttttgtatg tagcctctta gtactactgt 6360 ctgtatagct attagttgtt attaattgcc atacattgta ccctgtgcga ttgttgagca 6420 ataaattcat tcattcattc attcattca 6449 // ID BEL-229_AA-I repbase; DNA; INV; 5951 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-229_AA_; KW BEL-229_AA-LTR; BEL-229_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-5951 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 913-913 (2011). XX DR [1] (Consensus) XX CC Positions [4992-5549] - Integrase core CC 'GAGTA' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 27..5951 FT /product="BEL-229_AA-I_1p" FT /translation="MASGDTSLGGLKTGRDCGGCDQPNDADSQMVQCDACQ FT VWYHLKCAGETPGVENRPFKCRTCQPPVRKQTRKQPRSQKSTSENRLKVPV FT VTTTDEHTETTPKKNPEVLPSKDNAGFPSKAISTKTVSSACRSRAVLQKQI FT EAEQRLAELKLAEAEKRLEEDRLMQERERALRDEKIKLQEDLLRKMQELEG FT TVDDEKRSGYSSSSGISKTRNWLAKQREQDIISEHIGNSSHRSPKMSDHIS FT RQSGDDNNEDQMLPELADREAYGAIDQQGSGGMHRRMSSIRPLEERPLEGV FT SNVNRIATREIRSSTQSMTGGPTANQIAARQIWPKKLPIFSGEPEEWPIFY FT SSYDGANAACGFSDVENIIRLRECLRGPAREAVISKLMFPKSVPSIIETLR FT RLYGRPELMVKSLLAKVRRLEAPKPERLDTLINFGMTVQQLSDHLIAADLQ FT SHLANPTLMEELVDKLPAAYKLEWVRFKRVFETPTLKEFAGFMDLLVADAS FT EVTVLSQSRTEKNRHEKEKHGHKGHVYAHNDDDVDKIPSPREPCPICGETN FT HRVRNCEKFQRMNVEFRITATKKYKLCAVCLFNHESKCRSRIRCNIENCQE FT RHHPLLHRPGRRSAVASRVIDAECNTHGPRTRAVLFKVVPVTLHNEGRSFD FT TFAFVDEGSSRTLIDSSLARLLKLKGETVPLKLIWTSNVTRTERSSKLVDV FT MISARGQDQTFSLNGAHTVNGLNLPKQTLLLDGLADRYDHLRDLPTISYVD FT AEPKILIGLRNLELFSPLETRVGQPGEPVAVRSVLGWAIYGPCGIDNQRSD FT FVGLHDCNCKPDEDLNELIRQQFVLEEAVVCVAPLPESVDDKRARQLLEET FT TRRLGDRYETGLLWKSDEIRFPNNWNMAMKRLKGLEVKLSRDPVLRTNVHQ FT QIREYVDKQYAHIATEQELASADPNRVWYLPINVVTHPRKPGKKRLVWDAA FT AQVDGMSLNSQLLKGPDLLNSLPSVISVFRERPIAFGGDVREMFHQVRIIP FT KDKHSQRFVFRFDPSDPPDIYIMDVATFGASCSPCSVQHVMRKNALEHAAD FT FPDAAAAIIERTYMDDYVDSTDTVEEAATRAKQVREIHAHAGFEMRNWVSN FT SEDVLRALGEIKEQKPVPIGDVTQENWERVLGLTWDPQGDFFSFSTTFHGD FT LEPYVSGNRRPTKRIVARSIMSVFDPTGMLAPFSIHGKMLIQDLWRSGTQW FT DQDIPDEEYAKWMRWTHLLPGIGRLKIPRYYFGTLSPNRFKSLQLHVFTDA FT SEFAMGCAAYFRAADDQGVHCALIMSKSKVAPLQHQSIPRLELQAAVMGAR FT MLKCAMENHNLPISRSFLWTDSTTVLSWIRSDNRKYKQYVAHRIGQILSHT FT QLNDWRWVPTKENVADFLTKWGKHTEPVSSSSWFNGPAFLYETEEKWPQQR FT QVASNIQEELRPCYSLHHIVIPQDLIKVENISKWSVLLRTMILVGRFINNC FT RLRINGQPIETIVATDNQLKHLKRSVPAMIVPFRRAEFEQAELNLWRLAQA FT ESYPDEVRVLLKNRDASPDALTSVERSSPLYKLSPFADEFGVVRVEGRTAD FT AGYAPFDARFPIILSKDHPITFRLINHYHNQYGHANKETIVNEVRQRFFVP FT SLRSVVDKVARACQRCKVRKCKPQYPRMAPLPEQRLTPYIRPFCYVGVDYL FT GPLEVAVGRRKEKRYVVVFTCLVVRAVHLEIAYDLSTDSCVMAIRRFVRKR FT GSPEQIFSDNGTNFVGANRELTQQIKLINDNCANTFTDAKTKWSFNPPAGP FT HMGGVWERMVRSVKESMRALDDGRKLNDEILLTVLAETEWFINSRPLTYMP FT QEAGNDEALTPNHFIFGNSSGSHEPLRSSTDLGEALRSSYQRSQYLSDALW FT KRWIKEYFPTVNRRSKWFNDVRPIKVGDLVYVADGDRRTWIRGKVEEVITG FT RDGRIRQAIVQTVNGNGKLKRPVVKLAVIEIGDCESGKFPERPHLDPRGGG FT " XX SQ Sequence 5951 BP; 1722 A; 1375 C; 1529 G; 1323 T; 2 other; ttcttaaagg ataccactca gctcagatgg cttcaggcga cacttcactt ggagggctca 60 agacgggccg cgattgcgga ggctgtgacc aacctaacga cgcagactca cagatggtcc 120 aatgcgatgc gtgtcaagta tggtaccatt tgaagtgtgc cggagaaact ccgggcgttg 180 aaaacagacc gttcaaatgt cgaacgtgcc aaccaccagt caggaagcag acgagaaaac 240 aaccccggtc gcagaagagc actagtgaaa atcggctgaa agtacccgtg gttacgacta 300 cagatgaaca cactgaaaca acaccaaaga agaatccgga agtgctgccg agcaaggata 360 atgcgggttt tccgagcaag gcgatttcta cgaaaaccgt ttcatcagca tgtcgttctc 420 gcgccgtttt gcagaaacag atagaggcag agcaacgttt ggcggaattg aagctagccg 480 aagctgagaa gcgtctagaa gaagatcgat taatgcagga acgagaacgc gctcttcgcg 540 acgagaaaat aaaattgcaa gaagatctgt tgcgtaaaat gcaggagcta gaaggaacag 600 tcgatgacga aaaacgctct ggatattcta gttcaagtgg gatcagcaag acaagaaatt 660 ggttggcaaa acaacgggag caggacatta tttccgaaca tatagggaac agctctcatc 720 gttctcccaa aatgtcagac cacatctctc gtcagtcagg tgacgacaac aatgaggatc 780 agatgctacc cgaattagca gaccgtgagg catacggtgc tatagatcaa caaggttcag 840 gggggatgca cagaaggatg tcatcaatca gacctttgga ggagcgtcct ctagaaggag 900 tgtcaaacgt aaatcgaatc gccacgcgsg agattcgatc atcaacgcaa tcgatgactg 960 gaggaccaac cgcgaaccaa atagctgctc gacagatttg gccgaagaaa ttgcccattt 1020 tttctggtga gcctgaagaa tggccaatat tctatagcag ctatgacgga gcgaatgcag 1080 cctgcggttt ctcagatgtt gagaacatca tccgtcttcg agagtgtctt cgaggaccag 1140 cgagagaagc ggtcatctcg aagctgatgt ttccaaaaag cgtgccatcg atcatcgaga 1200 ctctacgaag attgtatggc cgcccagaat tgatggtcaa atcgttactg gcaaaagttc 1260 gccggctgga ggcgccaaaa ccggagcgtt tagacacttt gataaatttc ggcatgacag 1320 tgcaacaatt aagtgatcac ttaatagcag ctgacctcca gagtcacctg gctaacccaa 1380 cgctaatgga agagctagtg gataagctgc ctgcggcgta taaattggaa tgggtacgat 1440 ttaagcgggt attcgagaca ccgacgttga aggaattcgc tggattcatg gatcttctgg 1500 tagctgacgc gagtgaagtt accgtgctat cacagtcgag gacagagaag aacaggcacg 1560 aaaaggagaa gcacggtcat aaggggcacg tctatgctca caacgacgat gacgtagaca 1620 agattccctc gccgagagaa ccgtgcccca tctgcggaga aacaaatcac agggtgcgta 1680 attgtgaaaa gtttcaacga atgaatgtag aatttcgcat cactgccacg aagaaataca 1740 aactttgtgc agtatgtcta ttcaatcacg aatccaagtg ccggtctaga atccgttgca 1800 acatagaaaa ttgtcaggag cgacatcacc cgctgcttca tcgccctgga cgaagaagtg 1860 ctgtagcctc gagagtcatc gacgcggagt gtaacactca cggtccgaga acaagagcag 1920 tattgttcaa agtggtacct gttacacttc acaatgaagg acgcagcttc gatacctttg 1980 cctttgttga tgaagggtca tcaaggactc tgatagactc aagcttagca cggcttctta 2040 aactaaaagg tgaaacagta ccactgaagc tgatttggac atcgaacgtt acgcgaacgg 2100 aaagatcatc aaaactggta gatgtcatga tctcagcgcg aggtcaggac caaacattca 2160 gtctcaatgg agctcacacg gtaaacgggc tgaatctacc caaacaaact ctgttgctcg 2220 acggtttggc tgatcggtat gaccatctgc gcgatctccc caccatctcg tatgtcgacg 2280 cagaacctaa aatcttgata ggtttgcgaa acttggaatt gttctctcca ctcgaaaccc 2340 gggtagggca gcctggtgag cctgtagcgg taagaagtgt gcttggctgg gcaatatacg 2400 gtccatgtgg aattgacaac cagaggagtg acttcgtagg ccttcacgac tgcaactgca 2460 aacctgacga agatttgaat gaactaattc ggcaacagtt cgttctcgag gaggcggtgg 2520 tttgcgtcgc gccgttaccc gaatctgttg acgacaaacg cgcacggcaa cttctagagg 2580 aaacaactcg gcgcttgggt gatagatacg aaacggggct tctttggaaa agtgatgaaa 2640 ttcgttttcc aaataattgg aacatggcaa tgaagcggtt aaaaggtctt gaggtgaagt 2700 taagccggga tccagtacta cgcacaaacg tacatcaaca gattcgggaa tatgttgata 2760 aacagtacgc tcacatagcc acggaacaag agctggctag tgctgaccca aatcgggtat 2820 ggtacctgcc tatcaacgta gtaacacatc ctcggaagcc agggaagaag agactagtgt 2880 gggacgcggc ggcacaagtt gatggcatgt cgctaaactc tcagcttcta aagggtcctg 2940 atttgctgaa ctcacttcct tccgttatta gcgtcttcag agaacggccg attgcctttg 3000 gaggcgacgt tagggaaatg tttcatcagg ttcgaataat cccgaaggac aaacattccc 3060 agcgcttcgt cttcagattc gaccccagtg accccccgga catttatata atggacgtag 3120 cgacctttgg ggcgagctgc tcgccctgtt ctgttcaaca tgttatgcgc aaaaatgctc 3180 ttgaacacgc agcagacttt cctgacgcag ctgctgcgat aatcgagagg acttacatgg 3240 atgattacgt tgatagtacc gacacggtag aagaagcagc aacgagggcg aaacaggtac 3300 gagaaataca cgcacatgcc gggtttgaga tgaggaactg ggttagcaac agtgaagacg 3360 tcttacgtgc gctgggtgaa attaaggagc aaaagccggt gccgattgga gacgttacac 3420 aggagaactg ggagcgtgtt ctggggttga cctgggatcc gcaaggagat ttcttctctt 3480 tctcaactac cttccatggg gacctagaac catacgtctc tggaaaccgt cgtcccacaa 3540 aacgtattgt cgcccgaagc ataatgagtg tgttcgatcc aaccggcatg cttgcaccgt 3600 tttcgataca cgggaaaatg ttaatacaag atctgtggag atctggcacc caatgggatc 3660 aagacattcc ggatgaagaa tacgcaaaat ggatgcgttg gacacatctt cttccgggga 3720 ttggacgcct gaaaattcct cgatattact ttggaactct aagtccaaac cgattcaagt 3780 cgctgcaact gcatgtattc accgatgcca gtgagttcgc catgggatgt gcagcatatt 3840 tccgggcagc cgatgaccaa ggtgtacatt gtgccttaat aatgtctaag agcaaagttg 3900 ctcctctgca acaccagtct ataccaagat tggagttgca ggcagcagtt atgggggcaa 3960 gaatgttgaa atgtgccatg gaaaaccata accttccaat cagtcgcagc ttcttgtgga 4020 cagattccac cacagtgctg tcgtggatcc gatcggataa ccgcaaatat aagcaatatg 4080 ttgctcatcg aatcgggcaa atcttgtcgc acacacagct gaacgactgg cgttgggtgc 4140 ccaccaaaga aaacgttgct gattttctga cgaaatgggg aaagcacacc gaaccggtct 4200 catctagttc gtggttcaat ggaccagcct ttttgtatga aacagaagag aaatggcctc 4260 agcagcgaca agtggcgtcg aatatccagg aagaactacg gccatgctat tcactacacc 4320 atatcgtgat tccacaggac ctaataaaag tggaaaacat ttctaaatgg agtgtgctgc 4380 tacgaacaat gatcttagtg ggtcggttta tcaacaactg tcgtctgaga atcaacggac 4440 agccgataga gaccatcgta gcgactgata accagctcaa gcatttgaaa cgttcggtac 4500 cagctatgat tgtaccgttt cgccgwgcag agtttgagca agcggaattg aacctttggc 4560 gacttgcaca ggcagaaagt tacccagacg aggtaagagt gctgctaaag aatcgcgatg 4620 cgtcaccgga tgcattgacg agtgttgaac gttcaagccc tttatacaag ttatcccctt 4680 tcgctgacga atttggcgtg gttcgagtag aaggcagaac cgccgatgca gggtacgctc 4740 ctttcgacgc aagatttccc ataatcctct cgaaagacca tccaataacg ttccgtctaa 4800 taaatcatta ccataaccaa tacggacacg caaataagga aacaatcgtg aacgaagtcc 4860 gccaacggtt cttcgtccca agtcttcgat cagtcgtaga taaggtagcg agagcatgtc 4920 agcgatgcaa agtaagaaaa tgcaaacctc agtatccacg gatggcccct ctacctgaac 4980 aacgattaac cccatacata agaccatttt gttatgttgg agtcgactat ctaggtccct 5040 tggaagttgc cgttggccga cgcaaggaga agcggtatgt ggtagtcttc acttgcctgg 5100 tggtcagagc cgtccacctt gagatagcct acgatttatc tactgactcg tgcgttatgg 5160 ctatacgacg ttttgttcgc aagagaggtt cacctgaaca aatattttct gataatggca 5220 caaactttgt aggtgccaac cgtgagttga cgcaacaaat taagctgatc aacgacaatt 5280 gcgcgaacac attcacggat gccaagacta agtggtcctt caatccaccc gccggtcctc 5340 atatgggtgg cgtgtgggag cgtatggtac gcagtgtgaa agaatccatg agagccctcg 5400 acgacggccg caaactcaac gatgaaatcc tgttaactgt gctggccgaa acagagtggt 5460 tcatcaactc ccgcccactc acatacatgc ctcaggaggc aggtaacgat gaggccctca 5520 cgcctaatca ttttatattt gggaattcat ctggttcaca cgaaccgctt aggagctcca 5580 ccgaccttgg agaagcattg cgtagcagtt accaacgaag ccagtatttg tcagatgctc 5640 tatggaaacg ttggataaaa gaatacttcc ctacagtcaa ccgtagatca aaatggttca 5700 acgacgtacg ccctatcaag gttggagatc tggtatacgt agcagatggc gaccggagga 5760 catggatacg agggaaggtt gaagaggtca ttacgggacg agacgggaga atacggcagg 5820 ccatcgtaca aacggtcaac ggaaacggca agttaaaacg accagtggtc aaacttgcgg 5880 tcatcgagat tggagattgt gaatccggga agttcccaga gaggccacat ctggatccac 5940 ggggcggggg a 5951 // ID hATm-37_HM repbase; DNA; INV; 2445 BP. XX AC . XX DT 07-DEC-2008 (Rel. 13.12, Created) DT 10-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type family: consensus sequence. XX KW hAT; DNA transposon; Transposable Element; hATm-37_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2445 RA Jurka J.; RT "Families of hATm elements from Hydra magnipapillata."; RL Repbase Reports 8(12), 1931-1931 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 451..2226 FT /product="hATm-37_HM_1p" FT /translation="MSKLSRNLVNKSSNVWQQNIKSKVVRRSEIPLRVTEL FT NELCKMPLNRDIVGRFLGLHDKMKEKKIRIETIAKELNLLWDKFNFPVVSM FT QRVHPKIANLIDAYVRYRKKKGKRFLEELASLFDITKTDGNWLSSEDKQLY FT LQQIESHGTVGYTTAKLAPVSTIHPSKRRKVMQSSVEKQVMVTSDESVQET FT DAEISPGDDFIPEVETKRKKYQKTKAAANLVTKYSLSTRQSSRVCSSLAAE FT CSAKLPTPSQTGIWKRVVRDGIRKAAEIKRILQQENDFCLHFDGKKLSSKE FT YEVVCLQSPSRKLNLGIIICESGSAANIFSGLKKLLDDFDAWKSIKMIVCD FT TTAVNTGSRNGIIARLYKEFERKCIIRPQYIGCQHHILDLVLRHLLDFCVP FT TVSQKPDINYEFVDNICSQYENLQTKYTGEVEVPASENAGWRSDFKFLFEL FT CEAYKVYKKCAKFPKIKWRKLPSLHNARWNSRATYALLAYFLLEEHRQQLS FT SICNFISTVWASAWFSNQHFTEHSFEELHSAVSEFDCQKATKCFQTHWVNQ FT PSVIDIPRSNIVAERAVKIMEDIYTTCKSDKYLNAKFLNSNLQL*" XX SQ Sequence 2445 BP; 834 A; 407 C; 448 G; 756 T; 0 other; ggcatatgac ttttaaactt tttgaataaa aattgatttc ctggggtttc agtagatgaa 60 cttttatatg tagattgcat ttattattcc tttagtgaaa aattatgaaa aaaggtggcg 120 cacccataag catcttttga tgacgtcact attcaatata aaactttaaa caaatggcgt 180 tatataatta tattgcgcta aatgtaggca atgttttata ccttaaatgt tagcttctta 240 ttgctaatta aagctaaata tagttttaac aatctgcctt taaccaatag gcttacactt 300 tcaaaatttc atattaaaag tagagtttct agtagtttgg ctttataagc ttcgattttt 360 ctcatctgct tttgactaat tgctttgttg ttttattgta gaagctattc tgacagaaaa 420 agtttagatt tcatttagtt catcaaagac atgtctaaac tatcacgtaa cctagttaat 480 aaaagctcca atgtatggca gcagaatata aaaagcaaag ttgtcaggcg ttcagaaata 540 cctttgcgag ttactgaatt aaatgaactc tgcaaaatgc ctcttaacag ggacattgtt 600 gggcgttttt tgggattaca tgataaaatg aaggagaaga aaataagaat agaaactatt 660 gcaaaggaac tgaatttact ttgggataaa tttaattttc ctgtcgtatc aatgcaacgt 720 gtccacccaa aaattgcaaa tctaattgat gcatatgtga gatacagaaa gaaaaagggt 780 aagagatttc tggaggaatt agcaagcctc tttgatataa ccaaaactga tggaaattgg 840 ttgtcctctg aagacaaaca actttaccta cagcagattg aatctcatgg tactgttgga 900 tacacaactg cgaagcttgc accagtctca acaattcatc cgtctaaacg ccgaaaagtt 960 atgcaatctt cagttgagaa acaagtcatg gtaacttcag atgaatctgt gcaagaaact 1020 gatgctgaaa tatcacctgg tgatgatttt atacctgaag tggaaacaaa gaggaaaaaa 1080 tatcaaaaaa ccaaggctgc tgcaaatctg gttaccaaat attctctatc tacaagacaa 1140 tcatccagag tctgttctag tctagctgct gaatgttctg ctaagttacc cacaccatca 1200 caaactggaa tatggaaaag agtggtcaga gatggaatta gaaaggctgc tgaaataaaa 1260 cgcattcttc agcaggaaaa tgatttctgt ttgcatttcg atggaaagaa actgtcatct 1320 aaagaatatg aagttgtttg cttgcaaagt ccttcaagga aattgaattt aggtatcatc 1380 atatgtgaat ctggttcggc tgcaaacata ttttctggat tgaagaagct cttagatgac 1440 ttcgatgcgt ggaagagtat taaaatgatt gtgtgcgata caacagcagt caacactggt 1500 agcagaaatg gaataatcgc tcggctttat aaagaatttg aaagaaaatg tattattaga 1560 ccgcagtaca ttgggtgtca acaccacatc cttgacttgg ttcttcgaca tttacttgac 1620 ttctgtgttc caacagtctc acagaaacca gatataaact atgagtttgt tgataacatt 1680 tgcagtcagt atgagaacct tcagacgaaa tatacaggtg aagtcgaagt acctgcttct 1740 gaaaatgctg gatggaggag tgatttcaag tttctattcg aactttgcga ggcttacaaa 1800 gtgtacaaga aatgcgctaa atttccaaaa atcaagtggc gaaaacttcc ttcacttcat 1860 aatgcgagat ggaactctag agcaacatat gctctactag cttattttct tctggaagaa 1920 catcgtcagc agctgagcag tatctgcaac tttatatcaa ctgtatgggc atcagcgtgg 1980 ttctctaatc agcatttcac tgaacacagt tttgaagaat tgcattctgc tgtttcagaa 2040 tttgattgcc agaaagcaac aaagtgtttc caaacgcatt gggtaaatca gccttcagtc 2100 attgacattc cacgatcaaa cattgtggct gaaagagcag tcaaaataat ggaagacatt 2160 tacactacct gcaaaagtga caagtacctt aatgctaaat tcttaaactc aaatttacaa 2220 ttgtgaacac tcaacagtga ctaaagtaag catatattca tatggatgtt taaaatttct 2280 gttttacaat atataacttg tgtgaattaa gcattgtagc tatttcttta agggtgcgcc 2340 aggtttttta aaatttaaac aaaaattaag tcatatttac tatatttaga taaaagttca 2400 tcatttgcca ccccaggaat cttgattgaa aaaaagtcat atacc 2445 // ID BEL-13_DPu-LTR repbase; DNA; INV; 267 BP. XX AC . XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from Daphnia: long terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-13_DP_; KW BEL-13_DPu-I; BEL-13_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-267 RA Jurka J.; RT "LTR retrotransposons from Daphnia."; RL Direct Submission to RU (16-DEC-2010). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR [1] (Consensus) XX CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX SQ Sequence 267 BP; 64 A; 72 C; 69 G; 62 T; 0 other; tgttccaaac gagccagatg gggcttcccc agccgcagcg gccgtgcgcc cacctggcgc 60 gtagtagcag agtcggagag cagtcagtcg agagtaaaga gtcgagtcaa gtcacgtcac 120 gtttttgttg agttttgaag ggcaaactag gtcgcgcctg ccagcaggca gccgtgacac 180 tcattctagt ttcccccttg tattgccagt aattcttatt tcgtgaaata caagtaactg 240 aatcccacac ccacgcgtta ttgaaca 267 // ID CR1-16_HM repbase; DNA; INV; 4088 BP. XX AC . XX DT 11-DEC-2008 (Rel. 13.12, Created) DT 14-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE CR1-type family: consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-16_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4088 RA Jurka J.; RT "CR1 families from Hydra magnipapillata."; RL Repbase Reports 8(12), 1844-1844 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 561..3644 FT /product="CR1-16_HM_1p" FT /translation="MALISDFETLVFNFFETKFFSLHENSDPDINYFHDTK FT IECSYYYPNELKGILFKNVNNDQIRILHVNIRSINCNFEKFQNLLEETKYF FT FNIICLTETWATSNDIKKNFNFYLSHFEMISLERQINKRGGGVLIYIHKDI FT KHCCRNDLSVSDGDGEILTIEIINERTKNILVSCCYRPPDGVSENLSMFLQ FT QNVIAKGNNEKKKNFLVGDFNMNCFVYNDDNKVKNFYDTIFESGAIPLINR FT PTRITKNSATLLDNIITTDLFNNDIKLGILKTDISDHFPIFLTINTDQLPK FT QKLNKIVKKRFYTSSNIEAFKNQLSLLHWKHLNFNENANNLYEKFYETFIS FT VYESNFPIVTITLKTKHFDNPWITKGFKKSSKTKQRLYIKYLKTKTPDSEK FT IYKDYKNLFEKIRKKLKKNYYSKLINKAKNDSKRTWQILKEITGKQKTCSS FT SLPQMLKVDNNSLHEPQIIAHEFNKYFTEIGSILSSKIQKTQTSFYDFLLP FT FDKNISSEELSAELSFDEFEKAFKSLKKNKAPGADEINGNIVIDCYEQLKN FT ILFKIFRASIHQGIFPERLKLARVTPLHKEGDRSNINNYRPISVLSIFSKI FT LERIIFNRVYNYFNYNNLLYNNQYGFRKGSSTEHAIIQFIHNISKSFEKSQ FT YTLGVFIDLSKAFDTVDHHILIQKIKYYGLNNKILKWFESYLTNRKQLVYS FT NDGYQSEPLSITCGVPQGSILGPLLFLIYVNDLNKASKLKSIMFADDTNLF FT LSHTDIYELFSTTNKELKHISNWFKANKLTLNINKTKWIIFHSCAKKRFLT FT NNMPQIYIDETIIKRDTVIKFLGVYLDENITWRKHIDHISTKVSKNIGILY FT KARNYLNKKNLTQLYYSFIHNYINYAIIAWGSTEKSKLQHLYRRQKHAIRV FT VNYADRFSRSKYFFDYMNVLDVYKLNIFNVLCFTFMWKNDLSLFVFNDLFS FT LKPINKYTLRSNSFLNEPFCKTKFNQFCIAYRAPHLWNKIVLPNFDPSTTL FT PVFKTKLKNFILCMDNILKFY*" XX SQ Sequence 4088 BP; 1605 A; 595 C; 524 G; 1364 T; 0 other; gaaagtctta acttccaaga aacaactatt atggaaaaaa taaataagat cacgaagcgt 60 tacgaaaagg agataaacaa cttaaataag aagacactag atttggagaa tcgatctcga 120 agaaacaacc taagattaga tggaatattt gaaaagccaa atgaaaattg gaatgagtgc 180 gaaaatgcag taaaggaaat gtttaaaaaa caactaaaaa ttagtaatga aattatcata 240 gaaagagctc atagaatagg ccaacctaaa gaagataaga aaccaagaac gattgtccta 300 aagttattaa attttcaaga caaaacaaaa atacttaacg ctacaaaaaa cttgcgagga 360 actgggatat atgttaatga ggatttcgct aaagaaacga tggaaagccg aaggatgttg 420 tgggaagagg tcaaaaagtt gcgacttgaa ggtaagtatg cagttataaa gtatgataaa 480 attgtttcta gagaatttcg aaagtaaatg cgctcattct taataagaac tcctttttaa 540 tttatttgtg ataattaatc atggctttaa taagtgactt tgaaactctt gtcttcaatt 600 tttttgaaac aaaatttttt tcactacacg aaaactccga cccagatatt aattatttcc 660 atgatacaaa aatagaatgt tcttactact atccaaatga attaaaagga attctattta 720 agaatgttaa caatgatcaa attagaattc tgcatgtaaa tatcagaagc attaattgta 780 attttgaaaa attccaaaat ttactcgagg aaactaagta cttttttaat attatttgct 840 taactgaaac ttgggcaacc tcaaacgaca taaaaaaaaa ttttaatttt tatctttctc 900 attttgaaat gatctctctc gaaagacaaa taaataagcg cggtggtgga gttctaattt 960 acattcataa agatataaag cattgttgta gaaatgatct tagtgtttct gacggcgatg 1020 gagaaatatt aacaattgag attattaatg aacgaacaaa aaatatttta gttagctgtt 1080 gttatcgacc gccagatggc gtgagcgaga acttgagcat gtttttacag caaaatgtta 1140 ttgcaaaagg taacaacgaa aagaaaaaaa actttcttgt tggtgatttt aatatgaatt 1200 gttttgtcta caatgacgat aacaaagtta aaaattttta tgatacaatt tttgaatctg 1260 gagcaatccc tttaataaat cgcccaacaa gaataacaaa aaattcagca actttgctgg 1320 ataatatcat taccacagac ttgttcaaca atgatataaa attgggtatc ttaaaaacgg 1380 atatttcgga ccactttcct atttttctaa ctataaacac cgaccaactg cctaaacaaa 1440 aattaaataa aattgtaaaa aaacgttttt atacatcatc aaacattgag gcctttaaaa 1500 atcagctctc attattacat tggaaacact taaattttaa tgaaaatgca aacaaccttt 1560 atgaaaaatt ctatgaaact tttatctctg tttacgagtc taattttcca attgttacaa 1620 ttacattaaa aacaaaacac tttgataatc cctggattac caaggggttc aaaaaatcat 1680 ccaaaactaa acaaaggcta tacataaaat atctaaaaac aaaaacacca gatagcgaaa 1740 aaatttacaa agattacaaa aatctattcg aaaaaatccg aaaaaaatta aaaaaaaatt 1800 actattcaaa attaattaat aaagctaaaa atgactcaaa acgcacttgg caaatattaa 1860 aagaaattac aggaaaacaa aaaacatgct caagctcttt gccacaaatg cttaaagtcg 1920 ataacaatag cttgcatgag ccacaaataa tagctcatga attcaataaa tattttactg 1980 aaattggatc aatcttatca agtaaaatac aaaaaactca gacctcattt tatgattttt 2040 tgttaccttt tgataaaaat atttcctctg aagaattatc tgctgaacta tcatttgatg 2100 agtttgagaa agcttttaaa tctttaaaaa agaataaagc acctggtgca gatgaaataa 2160 acgggaatat tgttatagat tgctatgaac aattaaaaaa tattcttttc aaaattttca 2220 gagcatctat tcatcaaggt atttttcctg aacgtttaaa acttgccaga gttacccctc 2280 tccataaaga aggcgacaga tccaatatca ataattatcg ccctatctct gttctttcta 2340 tattttctaa aattttagaa agaattattt tcaatagagt atacaattat tttaattaca 2400 acaatttatt atataacaat cagtatggtt tcagaaaagg gagctcaact gaacatgcca 2460 tcattcaatt tatacataac atctctaaat cttttgaaaa atctcaatac acactaggtg 2520 tctttattga cctatcaaaa gcttttgata cggtcgatca ccacatctta atccaaaaaa 2580 ttaaatatta tggtttaaac aataaaattt taaaatggtt tgaaagctat ttaacaaatc 2640 gaaagcaact tgtgtatagt aatgatggtt atcaaagtga acccctgagc ataacttgtg 2700 gtgttccaca aggctctatt ctcggaccac tactattttt aatctatgta aacgatttaa 2760 acaaagcttc caaattgaaa agtataatgt tcgctgatga tactaatctt ttcttatccc 2820 atactgatat ttatgaactc ttttcaacta caaacaaaga actcaaacac atttcgaact 2880 ggttcaaagc taataaatta actttaaata ttaacaaaac aaaatggatt atttttcact 2940 cctgcgcaaa aaaacgattt ttaacaaata atatgcctca aatttacatt gatgaaacta 3000 taataaaaag agatactgtt ataaaattct taggtgttta tcttgatgaa aatattactt 3060 ggagaaagca tattgatcat ataagcacca aagtttctaa aaatattggc attttataca 3120 aagctcgaaa ttatctaaac aaaaaaaact taacccagct ctattattca tttatacata 3180 attatataaa ttatgcaatc atagcctggg gaagtacgga gaaaagtaaa ttacaacatc 3240 tttatcgccg tcagaaacat gcaatccgtg tagttaatta tgcggatcgc ttttcacgtt 3300 cgaaatattt ttttgattat atgaatgtac tagatgttta taaacttaac atatttaatg 3360 ttttatgttt tacttttatg tggaaaaacg atttatcttt atttgttttc aatgaccttt 3420 tttctttaaa accaataaat aaatacacat taagaagtaa tagttttcta aatgaaccat 3480 tttgtaaaac gaagttcaac caattctgta ttgcatatcg tgcaccccat ctttggaata 3540 aaatagtatt accaaatttt gacccttcta ctactcttcc tgtttttaaa actaaattaa 3600 aaaacttcat tctctgtatg gacaacattc taaaattcta ctaaaaattt atgtaaaatg 3660 tatttttcaa atcttcagca tattgagtat taaaatgtta gtatattgta tatttgcatt 3720 atgttatatt gtattagtat attgcatatt tgcattatgt taaatctttg cattatgtct 3780 tttttaaggt tccggtgata agatccttct gatcttcttt cggaaaccta gtcttatatt 3840 gttagtacgc tgaattgtat atacatatat atatatatat atatatatat atatatatat 3900 atatatatat atatatactc attttttttt tcttttcttg tagaagtctt aaatattttt 3960 atgtatcaga cttatattat tatatatcac gacttatatt attatatatg tatcagactt 4020 atattattat atatcacgac ttatattatt ttgtatcacg actaacattg taaactaaaa 4080 aaaaaaaa 4088 // ID MSAT-5_CQ repbase; DNA; INV; 154 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A satellite repetitive sequence family from Culex DE quinquefasciatus - consensus. XX KW MSAT; Satellite; Simple Repeat; MSAT-5_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-154 RA Kojima K.K. and Jurka J.; RT "Satellite sequences from the southern house mosquito."; RL Repbase Reports 11(1), 617-617 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 28 sequences with >93% CC identity. XX SQ Sequence 154 BP; 46 A; 41 C; 26 G; 41 T; 0 other; caatatgggt atcaaaattc ttcattttgt caacacatct caaaaatggt cttccaaaat 60 atttttctgg ccactggcca ctccggaacc ggttccggat atccccccgg ggaaatcttc 120 gaagtggcca atatcatatc agaacaaagc ccat 154 // ID E22_TC repbase; DNA; INV; 863 BP. XX AC X95485; XX DT 14-SEP-2004 (Rel. 9.08, Created) DT 14-SEP-2004 (Rel. 9.08, Last updated, Version 1) XX DE T.cruzi transposon-like repetitive element. XX KW DNA transposon; Transposable Element; E22_TC; Interspersed repeat; KW transposon-like element. XX OS Trypanosoma cruzi OC Eukaryota; Euglenozoa; Kinetoplastida; Trypanosomatidae; OC Trypanosoma; Schizotrypanum. XX RN [1] RA Araya J., Cano I.M., Gomes B.H., Novak M.E., Requena M.J., RA Alonso C., Levin J.M., Guevara P., Ramirez L.J. et al.; RT "Characterization of an interspersed repetitive DNA element in RT the genome of Trypanosoma cruzi."; RL Parasitology 115 (Pt 6), 563-570 (1997). XX DR Genbank; X95485; Positions 1 863. XX SQ Sequence 863 BP; 175 A; 197 C; 217 G; 274 T; 0 other; gaattccact gcgatgcgtg cactcaagga cagggttccc gttccgacga gagaagcggc 60 cccgaccgca tctgagaaga ttcgagtagt tgacagcgtc agtgagcacg ttgctgggga 120 tattacggaa gataaatgaa tgggcacaac gcccctggac acgaatcccc gccacaaaca 180 gtcacctcgc agggtctgga gcggcatgag tcccgacgac cgcctcgcct ctttcaacga 240 cgtgaagttg acactgcgtg atttaggcgg taacagcacc gtgcgtgtgt gcgtgtcttg 300 tgtgccgctg cttctaatgg gattgtattt gggtggtggc agtcttctga gccacagagt 360 gtggattgct gcctcacgag aataccattc cgcatcgccc tctcatttta ttccaccctt 420 ttttttcttc aatgttttta cttttaaatg cctttttaat gcagttcagc tgtagtagta 480 gctactgttg ttttttcttt acttaatgtc tggttgctgc acactgtaat gagcgaagtg 540 cttcccgctc ttgtttgctt ttgttttcgc ccacacgcgg tgccggcccg cggctgtgcg 600 cccacggact gcacgaaagg aagattgagg agaacaggcc ggatgccgca tgaaatgtga 660 agaatgttgt gtgggatgct ggaataatgc ggatctttct ttttttttgc ttcaaatgtt 720 tttttttttt tttttttttg gtgcggacaa tatgtatata catacggtct ctttgtgctg 780 ctcggctgat gaaataaaag tcctgccgca tcaattgttc cattaatttt ttcttttttt 840 cttttgggag ttatgccgaa ttc 863 // ID BEL-191_AA-LTR repbase; DNA; INV; 362 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-191_AA_; KW BEL-191_AA-I; BEL-191_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-362 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 874-874 (2011). XX DR [2] (Consensus) XX SQ Sequence 362 BP; 105 A; 89 C; 70 G; 98 T; 0 other; tgttcacgca taacgtgaat tttgagccag tctaactgct cgaaaaccaa ttatagagtc 60 agcctgttct ttcccataga tcagcagcag tccgtcgcag catccgatga tgctgccgat 120 agtgatctgc aagataggta gatttttgcg aattatttgt ccactcatct tcccgcttta 180 cgagataaag ccgaaggatg gaccagaaaa tccctcagtt atcacccaaa cgcggaataa 240 actacttcac atcacgccga atatcgttgt cttataatcg acggagtaat aaaagttgtg 300 tcgttctaac tccaatcaag aacactgatt ccctagttcg aagttccctg atcgcggtaa 360 ca 362 // ID Sola1-7_AP repbase; DNA; INV; 4912 BP. XX AC ABLF01007348.1; XX DT 28-JAN-2009 (Rel. 14.02, Created) DT 28-JAN-2009 (Rel. 15.12, Last updated, Version 2) XX DE Sola1 DNA transposons from Acyrthosiphon pisum. XX KW Sola; DNA transposon; Transposable Element; Sola1; Sola1-7_AP. XX NM Sola1-7_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-4912 RA Bao W., Jurka M.G., Kapitonov V.V. and Jurka J.; RT "New superfamilies of eukaryotic DNA transposons and their RT internal divisions."; RL Molecular Biology and Evolution 26(5), 983-993 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX FH Key Location/Qualifiers FT CDS join(1945..2181,2139..2438,2374..2598,2522..3856, FT 3860..4351) FT /product="Sola1-7_AP_1p" FT /translation="MDSRQRSRSLKLVKAAQLNQPIPIINQRYQTRKKMPS FT TESNDQTIILNNLVFEEIMTQKDNVGVEKTIIAENFLLQIVKKNNNCRKLS FT SSNSKIELSYDSSEEWLPPGLKKKLDCLSEISSVSISSNNVLKSNNCSKEL FT FPFLVDADKNVENSDYNVSKSIDHSQETPPRFPGSYHRREANRLTILKKPP FT PDFPVHTIEESNDGIINQNTEVHNSSFDETIQIGKGGLRKSQKSNKRKIIW FT FRKNTSTKPLFVCGKVRNRTKEKLSGSGKTLQPNPCLSAKCQNKCVHKFSE FT DERKDIFTAFWGLNSIQRQRDFIISCAIEIPIKRVRAKHGLSRRSITYEYF FT LSYNSESKKVCLQFLLKTLNVSQIFVRYTLQNKTNIDISPNDRRGKGRPPN FT EVAIENMKNLDHFIQKLPAVPSHYCRASSIKKYLPAEFGNISRLYLVYMDF FT CKKNNHTVMSKSTFKTVFTKKYNIGFHLPKKDKCNLCSKFDNIKLSSDLNE FT EQKIKEEKHLEEKEECRQMFLFDQQLSKAGGDFVCSSFDLQKVLNTPMGPH FT MNLYYSRKYSYYNCSIYESGTCNAYAYLWGEIDGLRGCNEIVSCINQYLCD FT CNKNPKFKSISLYCDSCAGQNKNRAMLFMIMSCLKKQWADLTEVKIVFLLP FT GHSYMPVDSIHATIERFINDKTIWAPSEWPPLIRNSRVNPKSIEVKEQQFN FT FLNWKKNADKVLPKVLKDTEKNIVKTSKIKYINFYRNPLQNDDVILEAFYS FT YNADAKGFIINIPKPTSEALYFPEQAYTSNLSISKEKSKDLLKLCTDGIIP FT GKFHQEYINMTSNELVRDCIQETDEDDEVPPDEDAHKKTNKTKVVKKLKGK FT TKQLLLNTV*" XX SQ Sequence 4912 BP; 1870 A; 654 C; 719 G; 1669 T; 0 other; cgatggtgac aatgcaaaag ggcataagcg acacttttta cgaaatctac tttttgtaat 60 tagggcataa gcgacgtcat tctctatcga ttgctcttac tcatttatcg atacgattat 120 tttttacgat ataaaacgaa aatttagatt tagacataag cgccaaaaag agtaaattaa 180 ttaacagata tgtgaataaa tagggaataa gcgacattta gttttaaatg ttagcataag 240 cgccatagga ttttacttta tggggcctaa gcgacaacag cattattata gttaataata 300 gattttatat aaaaaaccat tttctgttat ctgttcggtt gacaataaat aataataata 360 aaaaaataaa aataaaaata atacgtcgta atcttaaaaa aatgcaatat cgtatgaagc 420 tattaacatt attttattcc aaagagaata tttatttacg tttctattaa aaaataaaat 480 gtcattattt ataattaatg attataaata ttataatatt attattatca ataatattga 540 actatgatag actctttgtt ttttgataat aataattatt tttgcgagtg catgttcaga 600 cgacagacac acgtctcaaa ttcactggag gtaaatatta catttattaa taataggtat 660 tatttaaaaa ttaagattca ttagggttgt ctctttatta tatctgcaga gtataacaaa 720 atgtcacgcg atgttgccgg tttcgtaact tgtaaaatag ctacacgtaa tagatacgta 780 cggatgtttt atactttaaa caacgctact agacattaag tttataaata taataatata 840 ttgttagggc taggatttct aggcgttcgc atgttttttc cttgagtcaa aattgattga 900 ttgtcagaat atatgtccac gatttggatt attctcgatt aaataaaaca attttttttt 960 tgcataatat tttgaacata atgcatataa ttgcataggt agttatctta tgaattataa 1020 atgaacctct ataatgaatg tgttatttgt cgaaaagccg tatatttact acctatctaa 1080 aatcttatct tggtcagatg accgtcggtt accgtatggt tgcattatta aacattattt 1140 ttgtttttca ttaatatggt aagttttagt taataaacat ccattcttcg ttccaaccac 1200 cggttattcg tgtttcaaaa tttgaaagaa cataataata tcatcacagc aatattaaat 1260 gtaacaacta ggtacttata attagttatt aattcttaaa aaaaaaaata ataataataa 1320 ataataaata attagtagat agttactgtt tcaataatta gtttctgatt tttatgtaaa 1380 aaatattaaa tgttattttg aaataaaaaa aatggttttt gcataatatt ttataatttg 1440 ttatttctat aaatcctagc cctatatatt atattaaatt tatatcaaat gtaatactat 1500 aatgatccca caatactatt ttctttctgt ctcgccaact ggcagtaggt atttcactat 1560 tattatttat gtatactgtc cagatatttt tcaagctagg ctttactttt tgttgaaatt 1620 atattcattt acgatgggca ttgatctaat aatgttatta taataaaatt ataaaaataa 1680 attacttgaa tcaattgaat attataatga tatttataat aacttataat ataaaaatca 1740 taagctaatt aatacatatt tttatgtgga ctggtacgaa aaaccgtgcg tatttacaaa 1800 gatgtgattt atacttgaag atacacgtca tgttttcatg caaataattc atgtattgga 1860 tcaatgtcta gcaacgtctt gttttatgaa gacaattagt catttacact tatttttata 1920 ataggtacta taaggtacat aaaaatggat tctcgtcaac gttctcgtag tttgaagttg 1980 gtaaaagcag ctcaattaaa ccagcctata cctattatta atcaaagata tcaaactcgg 2040 aaaaaaatgc catctacaga atctaatgat caaactatta ttttaaacaa tctcgtattt 2100 gaagaaatta tgacacaaaa agataatgtg ggggttgaaa aaacaataat tgccgaaaac 2160 tttcttcttc aaatagtaaa atagagttat cttatgactc ttctgaagaa tggttaccac 2220 caggattgaa aaaaaaatta gattgtcttt ctgaaatatc cagcgtttct attagtagta 2280 ataatgtttt aaaatctaat aactgctcta aagaattgtt ccctttcctt gttgacgcgg 2340 ataaaaatgt tgaaaattca gattacaacg taagcaaatc gattgaccat tctcaagaaa 2400 ccccccccag atttcccggt tcataccata gaagagagta atgatggaat tataaatcaa 2460 aatacagaag tgcacaacag tagttttgat gaaacaattc agattgggaa aggtggttta 2520 aggaaaagtc agaaatcgaa caaaagaaaa attatctggt tccggaaaaa cacttcaacc 2580 aaacccttgt ttgtctgcta agtgtcaaaa taaatgtgta cacaaatttt ctgaggacga 2640 gagaaaagac atttttacag cattttgggg attaaatagt attcaaagac agcgagattt 2700 tattatatca tgtgccattg aaatacctat aaaaagagta agagcgaaac atggacttag 2760 ccgacgtagt attacctatg aatattttct atcttataat tctgaatcga aaaaagtgtg 2820 cttacaattt ttattgaaaa cattaaatgt atcccaaata tttgtgagat atacgttaca 2880 aaataaaaca aatattgata tttctcctaa cgatagacgt ggtaagggcc gtccaccaaa 2940 tgaagtggca attgaaaata tgaaaaatct agatcatttt attcaaaaac tacctgcagt 3000 accatcgcac tattgtcggg catcaagtat aaaaaaatac ttaccagcag aatttggtaa 3060 tatatcaaga ttatacttgg tatacatgga tttttgtaag aaaaataacc atacagttat 3120 gtcgaaaagt actttcaaga cagtatttac gaaaaaatac aatattgggt ttcatcttcc 3180 taaaaaagat aaatgtaatc tttgttcaaa atttgataat ataaaattat ctagtgactt 3240 gaatgaagaa caaaaaatta aagaagaaaa acatctagag gaaaaagaag agtgtcggca 3300 aatgttttta tttgatcaac aattatctaa agcaggcgga gattttgtat gctcaagttt 3360 tgacctccag aaagtcttaa atacacccat gggtccacac atgaatttat attacagccg 3420 taaatacagt tattacaatt gttctatata cgagtctggt acatgcaatg cgtatgctta 3480 tttgtggggt gagattgacg gtttacgtgg ttgcaatgaa attgtgtcgt gtattaatca 3540 atatttgtgc gattgcaata aaaatccaaa gtttaagtca atttccctgt attgtgattc 3600 gtgtgcaggc caaaataaaa atcgagctat gttatttatg attatgagtt gtttgaaaaa 3660 acaatgggcc gatttaaccg aggtaaaaat cgttttctta ctcccgggac atagttatat 3720 gccagtagat tcaattcatg cgacgatcga acgctttatt aatgataaaa caatttgggc 3780 acctagtgaa tggcctccac ttataaggaa ctcgagggtt aacccaaagt caatagaggt 3840 gaaagagcaa caattttaaa acttcctaaa ttggaaaaaa aatgctgata aagtgctgcc 3900 aaaagtccta aaagatacgg aaaaaaatat tgtaaaaacg tccaaaatta aatatattaa 3960 tttttacaga aatccattac aaaacgatga cgtgatactt gaagcatttt attcatacaa 4020 tgccgatgca aaaggattta taataaatat accaaaacca acttcggaag cattatattt 4080 tccagaacaa gcatatactt caaatctttc gatttctaaa gaaaaatcta aagatttatt 4140 aaagctgtgt acagatggca ttataccagg taaatttcat caagaatata ttaatatgac 4200 ttcaaatgaa ttagtaaggg actgtattca agaaactgat gaagatgatg aagtaccacc 4260 agacgaagat gcccataaga aaacaaataa aacaaaagtt gtgaaaaaat taaaggggaa 4320 aactaaacag ttgctactaa acactgtatg agatgtacct atgtttttaa aatattacag 4380 tattgtcatg ttatttctat tttatataat aataatatta ttataaatgt ataatattta 4440 taattaatta tatgattatt tatacgtaat atttaactaa attcgagtca tgtttttgtt 4500 ttatttttaa tttttaactg attttatatg attttgtaaa actgcaatat tctggaataa 4560 ttaaaatttt aaaattcaaa aatatgttaa ttttctatta atatacaggg aagaaaatta 4620 atagggcata agcaccaaaa tactagaact aaaattacag gtaaggaaaa taatggggca 4680 taagcgccaa attatataat aatactcaat tcttactgat gtaagtgttt aactaaaaaa 4740 atgaaagaac aagtgtctta aatagagtac aactcatctt ttttattctt atcatatttt 4800 tctaaattaa aaatataacc tacaaaatgt aataattgtt aaaaaaaagt tttttggctc 4860 tcctgaataa tttcaaaaat gtcgcttatg cccttttgca ttgtcaccat cg 4912 // ID BEL1_Cis_LTR repbase; DNA; INV; 545 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 06-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE Long terminal repeat of BEL LTR Retrotransposon from Ciona DE savignyi. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL1_Cis_LTR. XX OS Ciona savignyi OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. XX RN [1] RP 1-545 RA Smit A.F.; RT "BEL1_Cis_LTR - BEL LTR Retrotransposon from Ciona savignyi."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX CC Ci000021. XX SQ Sequence 545 BP; 140 A; 144 C; 80 G; 181 T; 0 other; tgtaagatat ctcgacccat taatgtttta ttgatgtcta tgataaatca taaattgcat 60 cacaaccaaa ttacgtcaca atgggagttt gtgttacaat atctataccc gaccgcgtgt 120 tagaaaatgc ttcctttttg atttagtcac cgtacagtct gcagcacagt aatttatctg 180 ataccgtgta ttttccgttc cggttctaaa gccaaagttt ggtgagtgta attgtttact 240 atttcatacg tacacccgtc aattattgat tttggtccct ttatttttaa tttacagaaa 300 accgtatcat cctccgtatc atccctccgt atcatccctc cgtatcatcc ccactgcgta 360 acatctccgt ccatatcgtc cctgtatcac ccccatatcg tccctgcccc ctccgttccg 420 catctgtaac cgtttgaact ctgatcgcca attataattc gaggacacaa actgtctgtt 480 cttttattcg atacgagtta ttttcaaacc aaagcaaccc cgttactcca cccgcggaag 540 taaca 545 // ID BEL-2-LTR_HM repbase; DNA; INV; 281 BP. XX AC . XX DT 02-JAN-2009 (Rel. 14.02, Created) DT 02-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE LTR retrotransposon from Hydra magnipapillata: long terminal DE repeat - consensus. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-2-LTR_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-281 RA Bao W. and Jurka J.; RT "LTR retrotransposons from Hydra magnipapillata."; RL Repbase Reports 9(2), 433-433 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX SQ Sequence 281 BP; 109 A; 31 C; 44 G; 97 T; 0 other; tgttgtggat tacgtcttcc aagataagta gcaaaaatta gcggttgtat cttaatataa 60 acattaatat atttgtaatg atgcggtctt cttaaagata ctgggaaatt acgccaacgt 120 aatttttaag ataataatat aatattttgt ggttttgaat tttatataaa aaatggatga 180 tatttgtaaa taatagattc gcagtcatat gtaaataaaa caaaataaac aaacaagtga 240 attcattagt gtcactggga aacgtttaat aattccccac a 281 // ID Gypsy-5_DWil-LTR repbase; DNA; INV; 513 BP. XX AC scaffold_181075; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-5_DWil_; KW Gypsy-5_DWil-I; Gypsy-5_DWil-LTR. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-513 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_181075; Positions 152583 153095. XX SQ Sequence 513 BP; 186 A; 98 C; 74 G; 155 T; 0 other; tgtagtatat tcatacaaat aatcattcag tacaatgagc tcaacttata acaccctgtt 60 gcaagtataa acaattaaac atatacgcat taaaatataa ttgtgcactt gaatatgttt 120 atatatatat acattgatgt atgtacacaa acatattcac ataaatctac agctaagcat 180 gatcatatat gtatttgcat ttgtatatgc acatacatat attcacataa gcaatgtatg 240 cacaagcgta acatgagcta gagccaggcc agcgtggaac gctgaccaag cattttgatg 300 catgcgctcc aacaatatgt atgtatgtac cgacactctc cctctctttt gtagctgtcg 360 cataagtcgc agcataagaa tatatatgta aacttagctt aagcaaaata ataattttga 420 ataaagaatc aatctgaaaa ctagaatcaa cggacactga cgtgtccgtc ctcaataata 480 tcatttcttc ttaaatggaa cccggtcatt aca 513 // ID Copia5-NVi_LTR repbase; DNA; INV; 253 BP. XX AC AAZX01004556; XX DT 05-NOV-2007 (Rel. 12.11, Created) DT 05-NOV-2007 (Rel. 12.11, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: long terminal DE repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia5-NVi; KW Copia5-NVi_I; Copia5-NVi_LTR. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-253 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from Nasonia parasitic wasp."; RL Repbase Reports 7(11), 1132-1132 (2007). XX DR Genome; AAZX01004556; Positions 1256 1004. XX SQ Sequence 253 BP; 61 A; 69 C; 55 G; 68 T; 0 other; tgttaaaata aagattcttg atgcgtcgca tattactcgc gtatactacc tatgtataaa 60 accacagagt gggggcaggt agcgacgaag cgatctggag cgccacctcg cagcgaacgc 120 gttgccgccg cccgctttgg ggcctcagac atttttgtac cgtcacgcag tgagcatctc 180 ctttcacgac acgaataaat ctattgactt tcaactgtgc ttgatccttc ttttttaacc 240 tgctccgcca aca 253 // ID MULE1_EI repbase; DNA; INV; 2883 BP. XX AC MULE1_EI; XX DT 16-MAY-2005 (Rel. 10.05, Created) DT 16-MAY-2005 (Rel. 10.05, Last updated, Version 1) XX DE MULE-Ei1, a new member of the Mutator DNA transposon superfamily DE from the single-celled eukaryotic reptilian parasite Entamoeba DE invadens. XX KW MuDR; DNA transposon; Transposable Element; Mutator superfamily; KW MULE-Ei1; MULE1_EI. XX OS Entamoeba invadens OC Eukaryota; Amoebozoa; Archamoebae; Entamoebidae; Entamoeba. XX RN [1] RP 1-2883 RA Pritham E.J., Feschotte C. and Wessler S.R.; RG Department of Biology, University of Texas at Arlington, RG Arlington, TX 76019, USA; RT "Unexpected diversity and differential success of DNA transposons RT in four species of Entamoeba protozoans. Mol. Biol. Evol. (2005) RT In press."; RL Direct Submission to Genbank (16-MAY-2005). XX DR Repbase; MULE1_EI; Positions 1 2883. XX CC MULE-Ei1 (MULE1_EI) is a member of the Mutator superfamily. The CC TIRs are 187-bp long with 1 mismatch and are flanked by 9-bp TSD. CC The element contains a large ORF, which can potentially encode a CC 456-aa protein 39% similar to the Hop1 transposase from Fusarium CC oxysporum. There are several elements closely related to CC MULE-Ei1 in the E. invadens, E. moshkovskii, E. dispar and E. CC histolytica genomes as well as multiple other distantly related CC transposases. XX FH Key Location/Qualifiers FT CDS 196..1564 FT /product="MULE-Ei1_ORF" FT /translation="MNSIHIDYVINVDEKENYNYFKTPQVVVNQITEAFKE FT YSFETQARKMKEASVFTINGGAFSLFTTANRHTNSSNQFFKCVYRGCKCRL FT VMSLSKQKSHVVQITFHHLHSHTFDYLPDNVKPFFTQEEMKRMKDRFKLGI FT SPDNIKDEFGLPFTGKVMSSLLHKDRKDNQLCQAKQLRDYFNSLSDWSTII FT FTNRRGIFQGCVCINQTLCTSRDIGEVVVDDTCGTNEFGLPLVVAVNIDKE FT KRSHCVFFALMTNRTTQSFVLVFSYVKKFYDLTVVICDRCLSQTNALLQVF FT KDVNLVFCTRHIRRNLITEFGKYHPLVNLFNKVVFGKEGTEEQLIVAMKNQ FT IIQDETSKIEENFYDDKNTELDGNHIMDCVKETLETLIECTNKSNIKNSSP FT TKCRMFRELLDCAKCWLPSSMMTKTTNSTSNGIEGFFGWLKRKIEHKKCTL FT EQIAKSLVSCFX" XX SQ Sequence 2883 BP; 1049 A; 462 C; 434 G; 938 T; 0 other; ggggttagct atcttccacc gacaccagtg cacataagtt gcacaattaa ataaaatcaa 60 ataacatttt gtgaaaaatt gtgtttttct atttttgatt tttttaaacc aaaaaagctt 120 ttttaattgt agcaattctt caaactttta aagttttgta aaactatctt taaactcaaa 180 aaacaaacca ttcaaatgaa ttcgatacat atcgactacg tcatcaatgt tgatgaaaaa 240 gagaattata attactttaa aacgcctcaa gtggttgtaa accaaataac agaagcattc 300 aaagaatatt cattcgaaac ccaagcaaga aaaatgaaag aagcatcagt atttacgatc 360 aacggtggtg cattttcttt attcacaact gcaaacaggc atacaaacag ttcgaatcaa 420 tttttcaaat gtgtttacag aggttgtaaa tgtagattag taatgtcact ttcaaaacaa 480 aaatcccatg tggttcaaat tacttttcac catcttcatt ctcatacatt tgattatctc 540 ccagataatg tcaaaccatt tttcacacaa gaagaaatga aaaggatgaa agacagattt 600 aagcttggta tatctccaga caatattaaa gatgagtttg gactaccttt tacaggtaaa 660 gtgatgtcat cattattaca caaagataga aaagacaacc aattgtgtca agccaaacaa 720 ctgagagatt attttaattc cttgtcagac tggtctacaa ttatatttac aaatcgacgt 780 ggtatttttc aaggttgtgt ttgtatcaac caaacattat gtacatcaag agatatcgga 840 gaagttgttg ttgatgacac ttgtggaact aatgaatttg gattacctct agttgttgct 900 gtaaatattg acaaagaaaa aaggtctcat tgtgtgtttt tcgctttgat gactaacaga 960 acgacacaat cttttgtatt ggtcttttct tatgttaaaa aattttatga tttgacagta 1020 gtcatttgtg accgttgttt gagccaaaca aatgctctgc ttcaagtttt taaagacgtc 1080 aatttggtgt tttgtaccag acatataaga cgtaatttaa taacagagtt tggtaaatac 1140 cacccacttg tcaatttatt taataaagtg gtgtttggta aagaaggaac cgaagaacaa 1200 ttgattgtag caatgaaaaa ccaaatcata caagatgaaa catctaaaat agaagaaaac 1260 ttttatgatg ataaaaacac cgaacttgac ggtaaccata ttatggattg tgtaaaagaa 1320 actttagaaa cattgattga atgtaccaat aaatcaaata taaagaactc ttcgccaacc 1380 aagtgtcgaa tgtttcgtga acttctcgac tgtgcaaaat gctggcttcc aagttcgatg 1440 atgactaaaa ccaccaactc cacctcaaat ggaattgagg gcttttttgg atggctcaaa 1500 cgtaaaattg aacacaaaaa gtgcactctc gaacaaatag ccaaatcttt agtctcttgt 1560 ttctaagaca ttagaaacaa attccaatta taactataat aacactgaca atataaacct 1620 tctttctagt tattttgaag atgatgtact tatatgtata actcgatcaa catctgatgc 1680 tttggttgag tgtgtaaaaa cggtacgaga tcctcttctt atgtcttttt tttgactcaa 1740 acacgtgtat tacttcctta aattcgtgtt catgtctatc attttctaat aatggatttc 1800 cttgtcccca tttactaaga ctttatatct caagtttcaa aaaagttcct ttaagactgt 1860 ttgatttaaa aagactcaaa acgtcagcca aacttgttac gcatgagtct aaagtcctta 1920 aactcgaaga gttaaaaaat ttgtcattta acgattttca aatgctactc gagaagtatt 1980 acccaaacgt caaagccaat cctcgaatta ttccaacagt cattaatgtt tttgaaggga 2040 aaaaggttgt gtttgaagaa gtccaagaag agaaaatcga aaacaagaac tcattgggaa 2100 ataaattaat ctcttattcg aggactcgta aataacagaa acgacaattg gaacttgttt 2160 aaatgagact cttgttgttg ataccattaa agatcctaca aaactcacca aaataaaaca 2220 agtcaaagct tcaactgaag tagacttagc tgaaggaaag acaaaatcac aacacaagaa 2280 aaaagatcca tctcaatcta aaccgaagca ttgttttgtt tgtggcaatc taggccatta 2340 tgcaaaaact tgcaaaatga agcaaataac accagacttt aaacaataac taatctattt 2400 aattcaattt tttattacct ttttatacac tttgattatt tatacatttc atatattgcc 2460 cataatttga tagtaggcaa attgtttgtg tattgtactg cagcctcagc ttaaaaatat 2520 attttttgta tcattcacac ctaaacttat gtttcaaaat agacttttga cattaagaaa 2580 gtcgccagat atgacattag gcagtattta tttacaaatg aagacaactg tggtaaacct 2640 cgtcttattg tttacaaaga aacaatgtgt gatattgtca atttatttta atcatagttt 2700 gttttttaag tttaaagata gttttacaaa actttaaaag tttgaagaat tgctacaatt 2760 aaaaaagctt ttttggttta aaaaaatcaa aaatagaaaa acacaatttt tcacaaaatg 2820 ttatttgatt ttatttaatt gtgcaactta tgtgcactgg tgtcggtgga agatagctaa 2880 ccc 2883 // ID Mariner-25_SM repbase; DNA; INV; 1888 BP. XX AC . XX DT 12-AUG-2009 (Rel. 14.08, Created) DT 12-AUG-2009 (Rel. 14.08, Last updated, Version 2) XX DE Mariner DNA transposon from Schmidtea mediterranea: consensus. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-25_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-1888 RA Jurka J.; RT "Mariner DNA transposons from Schmidtea mediterranea."; RL Repbase Reports 9(8), 1874-1874 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 247..825 FT /product="Mariner-25_SM_1p" FT /translation="MARKRTEITIHQKAKICAIKESKKMSLSELSQIIRRE FT LNLDLGISTLSEILKNKEKWENCESSNTYFRQSSKIHSLLENSLIEWISRM FT DAINGYVTDEIIITKAKEFGQKLNVDDLQFSKGWLYRFKNRFNLKLKVLRG FT EANSMNDVDVSAHRKKMVEKLAGIDPEVILNMDETGLFFKHFLQERFHQVK FT EKA" FT CDS 987..1754 FT /product="Mariner-25_SM_2p" FT /translation="MTTDLFNCFMQHLDDELRKYKKKFYLLLDNAPSHSLK FT KCFQNIEVIMLPKNSTAHLQPMDAGIIKNLKHYYRKKLVVDYMNSLDQTKA FT FVVNLKKAILILHEAWEDVKEETIKNCFKKVAIIPGLHGSYAEENLSSDPV FT LETHKNELDQIMVEEDMIQTEEVIDDDEIIALCQEEYTVEDVVLSAEDGQV FT DDSYNISPSNLFHAIEIVIKASSSSNSSTQKITNDTVKNLKEVKDDLQKFI FT QKKTLQKTLDDFLNK" XX SQ Sequence 1888 BP; 717 A; 286 C; 310 G; 575 T; 0 other; cagtcaaccc tcattaatcc gaacttcaat aatccgaaaa cttcatagtc cgaaaaaaat 60 ttatttattt cgtcaatttt cgaacgtttc cagacaaatt tgattcatta atccgaaact 120 tcatgatccg caatcggaaa gctctttttt gaattaaatt tctaataatt tttaaaagtt 180 tcggattaat gttgtttaaa tattaagctt gtttacaatt ttaatttatt taaattaaat 240 atccatatgg ctagaaaaag aacagaaatt acaattcatc aaaaggcaaa aatttgtgca 300 ataaaggagt cgaagaaaat gtcgctgagt gaattgtctc aaattatcag aagagaatta 360 aacctggacc taggaatttc tacgttatca gaaattttga aaaataaaga gaaatgggaa 420 aattgtgaaa gctcaaacac ttacttcaga caatcttcga aaattcattc tctgcttgag 480 aattcattga tcgaatggat atctcgaatg gatgcgataa atggttacgt gactgatgaa 540 attatcatta caaaagcaaa ggaatttgga caaaaactaa atgttgatga cctgcaattc 600 agcaaaggat ggctctatag attcaaaaat cgatttaatc tgaaattgaa agttctccgc 660 ggagaagcaa attcaatgaa tgatgttgat gtttcagccc atagaaaaaa aatggttgaa 720 aaattggcag gaattgaccc agaagttata ctcaatatgg atgaaactgg actttttttc 780 aagcatttcc tacaagaacg atttcatcaa gtgaaagaaa aggcataaaa caatctaaag 840 ttagaataac catagcttta tgttcaaatg ctagtggatc aattaaaatt acaccctttg 900 ttattggaca tagcaagaaa ccaagatgtt ttaaaggatt taatgttgca agatactgta 960 attaccacgc aaattcgaaa gcatggatga ccaccgattt gtttaattgt tttatgcagc 1020 atcttgatga tgaactgaga aaatacaaaa agaaatttta tctacttctt gataacgctc 1080 caagtcatag cttgaagaaa tgttttcaga acattgaagt tataatgctt ccgaagaact 1140 caactgctca tcttcaaccc atggatgcag gtataataaa aaatttaaag cattattaca 1200 gaaaaaagtt agtagttgat tacatgaact cgcttgatca aacaaaagcc tttgttgtta 1260 atcttaagaa agccatattg atactccatg aagcttggga agatgtgaaa gaagaaacca 1320 tcaaaaattg tttcaaaaag gttgcgatta ttccaggtct tcacggatct tatgctgaag 1380 aaaacttgtc atcagatccc gttcttgaga cacacaagaa tgagcttgac caaataatgg 1440 ttgaagagga tatgattcaa acagaagaag tcattgatga tgatgaaatc attgcactgt 1500 gtcaagaaga atacactgtt gaagatgttg tacttagtgc tgaagatggt caggtcgatg 1560 acagttacaa cataagtcca agcaatttat tccatgcaat tgaaattgtc atcaaagcaa 1620 gttcttcttc aaacagctct actcaaaaaa taaccaatga tacagtaaaa aatttaaaag 1680 aagtcaaaga tgatttacag aaattcattc aaaagaaaac tctacaaaaa actttagatg 1740 attttttaaa taaataattg tctttttctt aataatttca tcattaatct ggcgatcata 1800 tagattttaa attccttcat tatccgaatt aattcacaat ccgaaaggaa atcgatttct 1860 aatttttcgg attaatgagg gttgactg 1888 // ID Shinagawa-6_AAe repbase; DNA; INV; 2525 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.03, Created) DT 16-FEB-2011 (Rel. 16.03, Last updated, Version 1) XX DE A non-autonomous DNA transposon family from Aedes aegypti. XX KW DNA transposon; Transposable Element; nonautonomous; KW Shinagawa-6_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-2525 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the yellow fever mosquito."; RL Repbase Reports 11(3), 843-843 (2011). XX DR [2] (Consensus) XX CC ~95% identical to consensus. TIRs are 117 bp long and composed CC by degenerate repeats. Related non-autonomous elements, named CC Shinagawa, are found in Aedes aegypti and Culex CC quinquefasciatus. The insertion of DNA-TA-8_AAe-like transposon CC around 610 is excluded from the consensus. XX SQ Sequence 2525 BP; 815 A; 434 C; 471 G; 804 T; 1 other; gattgtccga catttccccg aaaaccattt ccccgaatga tttcttcccg aaggattttc 60 tcccgaaaac cattttcccg aatgaactgt ttccccgaat gtaccgtttc cccgaaagct 120 ttttccaaag acatttttca tttatttaaa aaaaatatcg gtgaaggtag catccacaaa 180 ttacgtaacg ctctagggag gagtaggctc aaacgttacg gctcacacaa aaatttgaaa 240 tttttcatac aaaaagcgtt acggatggag ggagggggtc acaaatttcc aatttaggcg 300 ttacttaata aatggacgct gcctaagact atattcagca agcttgataa tcgcttcgaa 360 cggatgattc tcttatgaat gatttgaatc gaaaaacgaa agatgagata tttggagtga 420 aaatcgatga cagtttttac tggcaggtct gcttttaaaa atatgttaaa aaagtaaagt 480 tcggttttca aattttatta tgctaagaat cgcaataaaa atacataaaa tatttcgaaa 540 gatcattcaa gagatgaaca aatctgacgc ttaatgttgt ttgttcgaac ataattcatc 600 ttaatgaaat ctagctgaac gggcaaaggc agaataatga aataaagttt gacctggttg 660 aaaacagttt gcaggattca aagacgtgac cctaatgaga tcttcactgc atgaaaactt 720 agaaagaatg aatgagtcac aaatgtcgtt ttcgttttta ggaactgggt cagtgatcaa 780 cacataaaca aaccaacgtc aagaaaattg tagctcggtc gaacctgaat cgggttggag 840 ttgtaactgt gagtcagagc gaccagttcc aagtttcgcc agttccaacc taggctcaaa 900 aacgaattcg acattagttg attggggacg catttgacag tccagcacac cgtgtccaga 960 ctatcattcg atcattattt ccgcgttagt agattgtgtt tcttaaaaca taagcagatg 1020 ttttttttaa ctacaactga gaacgtaaaa ggcttgtttt caattttata aaaatccctt 1080 ctttaggatg ttcttcttta tattactggt ccaataatca ttggcgtcaa aatagttgat 1140 aattttaaaa ctctcgcctt cttatcgtcg taggcagttc tttaaagctt tgctggtttc 1200 atattgttct gcttgcagct gctactgatt tcatcataaa tttgcccttc ttttatttga 1260 aggctgttct tcatagataa atatatcaat ataaattaac agagctactg aaactttaat 1320 cagatttttt ctggtaatta catgctacta gttcaaatgt cattgtgaga acgaatacta 1380 cagattctaa tattttgatc caacattttc aatgctttcg acctacttaa ttaattaaga 1440 cttaattact taacttaatt aggagttttt tttttcaaat aggcgatttt ttctgtttaa 1500 ttagtgatta ttagcatttt tagtattcat tactaacatg ttctctgtat ttaacgtttt 1560 cgactagatg acccatttag gaacgatttg gaaaaattga cgttcgggag aattgtttct 1620 atccaagaca taaaaaaaga tttttatctg gaaaactaat gcttattctg cgcggcgtgt 1680 gagtcgagac gagtcgttct cacctcgggt gacgtgagac aactaatgta aatcaaatgg 1740 gaacgagtcg taacctcatt cgacttgtct cacctcacac gccgcgcaga ataagcacaa 1800 ttgattgtca gggggagaag aaagactaga agggagactc aaggtttaat tatgtagtgc 1860 aagatataat agattctaag gatcaatggt ttcactgtag ctagtcacta agtgtaatga 1920 aattacgttt aacctagtgg acatatggca cgaagacgtt tggcatcttt gtgtaataat 1980 gcgaaacaat gtgaaagaac agcctattat taaaagaagg gaaatttaag attatcagct 2040 aagtgtcact gaatattgga acagtataaa tataaagaac agcctattat tctaagaagg 2100 caaaactaag ccgaatgttc attcgtccaa atgttcgtaa gccaaatatc tatactccaa 2160 agataccgct tcaaatgaaa ctgcctatca tgtgctcgct tttttcatca gggtatcaca 2220 aaaattaatc taagttgttt tgagccaaat gctcgtggcc attattttga acatctgtaa 2280 ctttaaacac atggattttt atcacaaatt aatcaatttt cgtctaatct tacatcagct 2340 aacacgttga atactttttg gcagcaggaa catggatagc tttcagtatg tcatttacat 2400 tacgattttt tcggggaaat ggtacattcg ggaaaataat tttcggggaa atagttcatt 2460 cgggaaaack tcattcgagg aaattgcatt cggggaaatg gttttcgggg aaatgtcgga 2520 caatc 2525 // ID Gypsy-613_AA-LTR repbase; DNA; INV; 2178 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-613_AA_; KW Ty3_gypsy_Ele177; Gypsy-613_AA-I; Gypsy-613_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-2178 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 2178 BP; 554 A; 487 C; 533 G; 604 T; 0 other; tgtagcaagt acgctttatt atttattata aattctttgt taaatttaag actgtaaata 60 ttacatgtta ttttgttatt tgttaattaa agatttgtta atttattaaa ctttgcatgt 120 ccattcgttt ttctaaatta aatctccaat tcaatcaccg ccaattgctc tcatagagaa 180 tatacgacat agttactcac gcctaaaacg agtgtcttga gttccctatg gtcagctacc 240 aggaaagcgc atggttttgc aagcctgttg agcgtcatga cgccacccaa aatccgcttg 300 ggtcgcgggc actgacccat atgcctatgc tattcaggat gcgttgacct tttccttgtt 360 cggcattcca cgcttgtcac gaagcttagc gagcaagctg actagatggt ttgaatagtc 420 agttggaatg agatgcacga cttcgggaaa tattgcagat aaattccttt gatgctcggt 480 ttcgcaaaat ttaacttcgt tgaatttcgc gaagtctggg gacaatgttg aaaaactgcg 540 gcatctgagg attactccag cctcgaatcg ttcactttgt tccgtttcgt ggaagaggca 600 agaacggtct attttgttac cgtgccgcgg tgtgattgtg aaggtgaaag tgacatagga 660 aacttaagat atttagctaa tttatgttct tatatccctc tagttgagcg cagtagtcaa 720 ttatgcgcaa gcagatcatt atttgcagat cgtcagtgta gtgcgttgat cagtgtttcc 780 aaggtaatgg acgttattgg aaaggtgaaa caatatttta attccttccc cacgtgtgcg 840 catgaaaatt taccggaata ggtcgtggca gcataggcga agccgtcaca ctgcgctctc 900 cgtttgcgac cgtttttgac cccacgcaac ggcttcaaag cttcccgggg tacacgtggc 960 cggagaagga cgcgatcgcc aatttcgcgc ctttggcagg caagcctacc atttcgtcaa 1020 ccagcaagag tcgagaggag accaaccccc cctgcagcga gcatcgaagc accgaacaag 1080 ccagcacgcc accggagcaa attgccgacg gcgcctcaac gtttgtttgc gcgccactgc 1140 aggatcgagc gtttgtaccg cgcttacgcg taagcccgcg caccggtgag ggagacacca 1200 ccaaccgcat cgcccgttcg atcgccatcc cggcgagagt accaggctgg aggtagtgcg 1260 ctaatctgcc ccagtcaacg taagcgtctc cccacccaca tcaaaaccct aggtaaccgt 1320 gagtactatt gcatgcatgt ttgttagtgt cgtccatggc catgtgaccc aatagaggca 1380 caacaacatt attttgctag tccgtcatag cccataattc ggcggcgcta gcatagaact 1440 cacccgagag tgagaggtgg aggtgtgagc tccccggagc gaatgtcgga gttcgaagct 1500 tgaagagaag aggagcagca gaagatctcg cgcatcgagt gcgcgccgca ggagaagttg 1560 atccgcacac acagaaatag ctagagggaa gttagtggga aagagagtag actaggccta 1620 ggcaatgtag agagcgatag tggaaataaa gcatgtataa ctagttttgg agcaatccat 1680 gtatttttgt tcccccaata aaatttagtt tgctccatgc acggaggaaa aagctctatg 1740 tattttgaat gtatttagtt agcgtctttt aaggtctcta tgttgcttcg cgtccgtatg 1800 tagttttttg tagtgatttt gggttattta ctgggtggtc agcccgaatc tgactttatg 1860 ctattttggg actgttcttt agttttctgc ggattgggac gggtgaactt tttggactgg 1920 acctgtgagg aaaaactccg cctttataac ttgtcctcag ccgtttatcg gaggaattta 1980 tttgtaagct tttctgaagt tgatgtgtcc gctcggaaac gtgaagtacg acggactatt 2040 tgagtcagcc agtcagctta ctcactaata aagtgaccct tccccaagaa ggtctcaagc 2100 cgggcaacgc aatctttgtt gggtggtctc ctttcggagt ggcgcataag ctacccaatt 2160 tcaaaagtaa gctcgaca 2178 // ID VENSMAR1 repbase; DNA; INV; 1293 BP. XX AC AJ507234; XX DT 10-SEP-2005 (Rel. 10.09, Created) DT 12-SEP-2005 (Rel. 10.09, Last updated, Version 1) XX DE Ventiella sulfuris mariner-like transposon Vensmar1.3. XX KW Mariner/Tc1; DNA transposon; Transposable Element; VENSMAR1. XX OS Ventiella sulfuris OC Eukaryota; Metazoa; Arthropoda; Crustacea; Malacostraca; OC Eumalacostraca; Peracarida; Amphipoda; Gammaridea; Lysianassoidea; OC Lysianassoidea incertae sedis; Ventiella. XX RN [1] RP 1-1293 RA Halaimia-Toumi N., Casse N., Demattei M.V., Renault S., RA Pradier E., Bigot Y. and Laulier M.; RT "The GC-rich transposon Bytmar1 from the deep-sea hydrothermal RT crab, Bythograea thermydron, may encode three transposase RT isoforms from a single ORF."; RL J Mol Evol 59(6), 747-760 (2004). XX DR EMBL/GenBank/DDBJ; AJ507234; Positions 1 1293. XX FH Key Location/Qualifiers FT CDS 182..1222 FT /product="VENSMAR1_1p" FT /translation="MDKIEYRAVINFLTKEGKNAKKIHDRLVAVYNDTAPS FT YATATRWHKKFHHGRESLEDDPRVGRPSEATSEDIVDRVEATITENRRVKV FT EEISLEIGISHGSVCTIIHHHLGMSKVSARWVPRNLSLPDRLQRHTSSEEL FT LTTQTQRAFESRVVTGDETWVHHWDPETKNESMAWKHKGSLVPLKFRTQPS FT AGKIMATIFWDAEGVLLVDFQPRGSTITGEYYDGVLGRLRDSIRQKRRGKL FT TRGVLLLHDNTPVRKARRAQAALRDCGFEQLNHPPYSPDLAPSDYFLFRQL FT KSSLRERRFDNDDEVKEALMMWLEKQSESFWLARYQSLRDKWFKCIQVKGN FT YFEK" XX SQ Sequence 1293 BP; 302 A; 342 C; 369 G; 280 T; 0 other; tacgaggggc ggtcagaaag ttatgtaact cggtatgtta ggtagcgcaa ccagtcgtga 60 cctagtgtgt ccgtttagga ctggttttga catgttcaca tgcagcgccg ctccgctccc 120 tcgctccctc ctcagtcttt agggagcaag cagtggacgt ggttggcagg agtcggcaac 180 aatggacaag atcgagtacc gtgcagtgat caatttcttg acaaaagagg ggaagaacgc 240 gaagaagatc cacgacaggc tggttgcggt gtacaacgac actgcccctt cgtatgccac 300 agccacccgg tggcacaaaa aatttcatca tggccgtgag tcccttgaag acgacccccg 360 tgtgggacgc ccctccgagg cgacctccga agacattgtt gatcgtgtgg aggcaacgat 420 cacggagaat cggcgagtga aggtggagga aatttcgttg gagattggaa tttctcatgg 480 aagcgtttgc accattattc atcatcacct gggcatgagc aaagtttctg ctcgttgggt 540 gcctcgaaat ctttctctgc ctgatcggct tcagcgccac acaagttcgg aggagctgct 600 gacaacgcag acccagcggg cattcgagtc gagggtcgtg acaggtgatg aaacgtgggt 660 tcaccactgg gacccagaga cgaagaacga gagcatggcc tggaaacaca agggatctct 720 ggtaccgctc aagtttcgga cccaaccatc ggctggcaag atcatggcca ccatcttctg 780 ggacgccgag ggagtgctgc tggtggactt ccagccacgt ggctctacga tcacagggga 840 gtactacgac ggagtactcg gtcgcttgag ggactccatc cgtcagaaga ggcggggcaa 900 gttgacccgt ggtgtcctcc tcctccacga caacaccccg gtccgcaagg cccgccgtgc 960 ccaggctgct ctgagggact gcggcttcga gcagctcaat cacccaccct acagtccgga 1020 cctggctccc agcgactact ttctgttccg ccagctcaag tcctcgttgc gggagcggag 1080 gttcgacaac gacgatgagg tcaaggaggc tctgatgatg tggttggaga agcagtcgga 1140 atcattctgg ctggcaagat accagagcct tcgcgacaag tggttcaagt gtattcaagt 1200 caagggtaat tactttgaaa aatgatgtgg ttatcatttt cattcctcca aaataaatac 1260 ctgaattgca taactttctg accgcccctc cta 1293 // ID BEL-2_CQ-LTR repbase; DNA; INV; 595 BP. XX AC AAWU01001625; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-2_CQ_; KW BEL-2_CQ-I; BEL-2_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-595 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 158-158 (2011). XX DR Genome; AAWU01001625; Positions 2385 2979. XX SQ Sequence 595 BP; 193 A; 140 C; 124 G; 138 T; 0 other; tgtaagatgc gccacgacga ccagcgcacc ccaagctgcc cttcgtcgac gcgcgacgat 60 tccccacgcc cgctgcacca cgaagggatt gttttgttcc aagggtcagc ttaaagtcac 120 gaaaagcaga tcaaccgcgc gcgctagaag aaaagattaa ttgtacccag aaagtgaacc 180 tagaagcgtc aagttagagc gtccagttgt gcacgaaact aaaattggat ataatctgga 240 ttgaattgag gtctacttac ggagaaatta ggaagtaact aagttgcgaa aagtagaact 300 cgagcagttc aaacgattag tacgctattc tacgactaca cacaagtaca gagcgcctgg 360 agattaaagc tcgcggaatg ttgctatctt taattccctg taactcaccc gcaaattgaa 420 cagaattgtt gtttggtagt aagctctacg agagcactaa acgtaagacc tagctgataa 480 acattgatcg caagccatta actcatatgc ttcatctcca ggaaatttgt aactcgcctt 540 ctaccgcaag aataaagtta acaaatttaa atcgctttcg ctacaaccaa caaca 595 // ID BEL-111_AA-I repbase; DNA; INV; 5685 BP. XX AC supercont1.78; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-111_AA_; KW BEL-111_AA-LTR; BEL-111_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5685 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.78; Positions 2573857 2579541. XX CC 'TTATT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS join(290..3430,3434..5653) FT /product="BEL-111_AA-I_1p" FT /translation="MWRKLLYPEEASYQSPVRNHPEKKMSDEELKLLVHKR FT GLVKSKVTRIRNSLRAAEENPETISAPQLRVFARSLENHYREFNEVHDALI FT SACAANQRDSQERKYEEFEVLYNETSLVIESLKDAKTPEPVAAVFQQGGVQ FT PQVVHQQTLKAPLPTFDGKYEHWPRFKAMFQDIMGRSPDCDAVKLYHLEKS FT LVDAAAGVIDSQTIQDNNYAQAWAILEQRYENKRLIVDLHIRGLLNLKHMT FT RKSSKELRQLLDECCRHVENLKFLGQELEGVSELFVVNVLTAALDKETREF FT WESKQDHGELPTYDETIECLKNRCLVLERCEAVHPPPLTTKFSSGFQKSSI FT QKVANAVTSSDSVICEFCSGQHPNFKCSAFRNMTVAQRLMKVKEEKVCFNC FT LRKGHRSNTCLSEKTCSKCSQKHNTLLHFDDPGLPEPELQSTPPVSVNVAN FT ASTDPEPVPTTSALVSHVSSTRKSRSSSKEVLLLTALVSLIDDSGRVHPCR FT ALLDCASQVCLISSDMVQKLGKRPRSTNTEVVGVTGRKCASQEVVVSVASK FT VGEYKVDIPCLVMPQVSGTIPTRKIEIKKWQLPANVELADPEFYKPKQVDL FT LIGNEFFFSLLKSGQMKISDDLPLLQETVFGWVIAGPVDVPQQNVVFSHVV FT TEENSSDLLQKFWALEEVIESSSSTTEEQQVESHFVNTHRRDETGRFVVRL FT PFRETVGELIDNRALALKRFFLLERRLQRYPEIKQQYQDFITEYEALGHCC FT EVSEAADPTGVQRYYLPHHAVLKLTSSSTKLRVVFDATARSSGPSLNDVLK FT VGPTIQNDLFTILLQFRRHRFAFSADITKMYRQVVVDQRDTRFQRIFWRKQ FT PTDPIRVLELNTVTYGTASAPFLATRSLVQLAKDEQADFPAAAEVVCEHFY FT IDDALTGANTITEAVCLREDLKKMLARGGFDIRKWCSNSDAVLENVPEEEK FT EKLVKIDDYENTVRTLGLLWDPKNDHFRFIVPSDEDTLQPVTKRFVLSRIS FT RLFDPLGFVSPVVLLAKLFMQSLWVQKLEWDEKIPGDLLAHLKFSNSLKFL FT NEIKIPRCVVPQDCEGFELHGFADASSSAYGACLYLRTVDKNGDINCNLLT FT SKSRVAPLSEMTIPRKELCAALLLSRLLDKVLSALKLPISKIILWSDSQIV FT LSWLKKAPTRLEVFVRNRVAEINRNTQQCLWQYVRSSENPADVVSRGQFPE FT VLDKNALWWFGPEFLRNMHYEPAVVEEIPDSDTPEMKSEVIVNLATIEQLP FT VFTSYGSFRKLQRIVALVLRFINNTRMKDVAKRHLSRSISVAEMRASLVTI FT VRVIQHTELPDEVISQHAAESSKKLARLCPVLDEVNGVLRVGGRLEKSALP FT FDAKHQMILPDHHPVTKLLIRSLHEENMHAGPSNLLAILRQKFWLLRARST FT IRGVIRSCVSCFRASPRRVEQLMGNLPDFRVNPAMPFEYTGVDYAGPIMVK FT EGKYRPKMIKGYIAVFVCLVTKNVHLELVSSLTTEAFLAALDRFVNHHGLV FT KRILSDNATNFTGASNELHALYLQFRDEAIVSKISDFLLPREVEWHFIPPR FT APNFAGLMEAGVKSVKTHLKRTLQNATLNFEEFATVLTHIQAILNSRPLYA FT LSDDPTEPMPITPAHLQLGRPLNSIPKPTYLNTKENRLSRWEYLTLLRDHF FT WKRWSREYLVTLQSRGKWIKPSNNIVPGMIVLVIEDNLPPLTWKYGKVVKT FT YPGEDSLVRVVDVKTVSGTFKRSISKLAPLPIKDNDDLQQGDSSTNGVDSA FT LDLNSVSLGGTSALDV" XX SQ Sequence 5685 BP; 1537 A; 1279 C; 1364 G; 1505 T; 0 other; ttttggtccc atcgaaccgg accgtcggcc attttggaac aaacgtgcta tcgaagtgaa 60 aaatacctat ccagtggaaa gaaaaacgca ttttaaaatg gatccaattg gaatgcaaaa 120 acattgaaaa gagacctttc cgaaagagtg gatacaagtg ccaagctgta ttgcgaataa 180 tgcaaagtga aagttcccgg tgaatttcgt cgcgaattta ttgtttccga aataattttc 240 cgtatggata tcgatcccgt cgttatcctg ttccagtttc gtacgggtaa tgtggcgtaa 300 gttgctgtac ccggaagaag ccagctacca aagtccagtg agaaatcatc ctgaaaagaa 360 aatgtcggac gaagaattga aactgcttgt tcataagcgt ggattagtga aatcaaaagt 420 gacccgcatc agaaattcat tgagagcagc cgaagaaaat cccgaaacaa taagtgcccc 480 ccagcttcga gtttttgcta gaagcctgga gaatcattat cgcgaattca atgaagttca 540 cgatgcgtta attagtgcct gtgctgcgaa ccagagggat tcgcaggaga gaaaatacga 600 agaattcgaa gtgttgtaca acgaaacgag cctggttatc gaatcactga aagatgccaa 660 gacaccagaa cctgttgccg ccgtctttca gcaaggaggt gtgcagcccc aagtcgtcca 720 ccaacaaaca ctcaaggctc cactcccgac attcgacgga aagtacgagc attggcctcg 780 tttcaaggca atgttccagg atattatggg ccggtcccca gactgtgacg ctgtaaagct 840 ttaccatttg gaaaagtctt tagtcgatgc tgctgctgga gtaattgact cccaaactat 900 ccaagacaac aattatgctc aggcgtgggc aatccttgaa caacgttacg agaacaaacg 960 tctcatagta gatttgcaca ttcggggtct actcaatctc aaacatatga cccgaaagtc 1020 atccaaagag ctacgccagt tattggatga gtgttgtcgg cacgtcgaga accttaaatt 1080 tctgggacag gagcttgaag gagtgtcaga gttgttcgtc gtcaacgttc ttactgccgc 1140 attggataag gaaacccggg agttttggga atcaaaacaa gaccatggag agctgccaac 1200 ttatgacgaa actattgagt gtttgaaaaa tcggtgtctt gttctggaga gatgcgaggc 1260 ggttcatcct cctccactga cgacgaagtt ttcatcagga tttcagaagt catctattca 1320 aaaggtggct aacgctgtca cgtcgtccga tagtgtgatc tgtgagtttt gcagtggaca 1380 gcatcccaat ttcaagtgtt ctgcattccg caacatgacg gttgctcagc gcctaatgaa 1440 agtgaaggag gaaaaggttt gttttaattg cctccgaaag gggcaccgaa gcaacacctg 1500 cttgtcggag aaaacgtgct caaaatgttc acaaaagcac aacaccttgt tgcattttga 1560 tgatcctggt ctgccagaac cagagttaca atccacacct ccagtaagcg tcaacgttgc 1620 gaacgcatca acggatcccg aaccggtgcc tactactagc gcattggtaa gtcacgtatc 1680 cagtactcga aaatcccgtt catcatcgaa ggaggttctg ctgctgaccg ctctggtgag 1740 cctcattgac gatagtggcc gtgtccatcc ttgccgtgca ttgctggact gtgcatcgca 1800 agtgtgtctg atctccagcg acatggtcca gaaacttgga aaacggccga ggtctactaa 1860 cacggaggta gtaggcgtaa caggtagaaa gtgtgccagc caagaagttg tcgtttctgt 1920 tgcctccaag gtcggtgagt acaaggtcga cattccgtgc cttgtcatgc ctcaggtgag 1980 tggtacgatt cctaccagaa agattgaaat caagaagtgg caactaccag ccaacgtgga 2040 gttggcagac ccggaattct acaaaccgaa acaagtagat ctccttatcg gcaacgagtt 2100 tttcttttcc cttcttaaat ccggtcaaat gaaaatttcc gatgatttgc cgttgctcca 2160 agaaactgtc tttgggtggg ttattgctgg gccagtagac gttccgcagc aaaatgttgt 2220 attttcccat gttgttaccg aggagaactc atccgatcta cttcaaaaat tttgggcttt 2280 ggaagaggtc atcgaatcgt catcatccac gaccgaagaa caacaagttg aaagtcactt 2340 tgtgaatacc caccgtcgag acgaaacagg ccgttttgtc gtccggctac cattccgtga 2400 gacagttggt gagcttattg ataaccgtgc cttggcgttg aagcggtttt ttctgcttga 2460 acgtagattg cagcgttatc cggaaataaa gcagcagtac caagatttca tcacggaata 2520 tgaagctctc ggccattgtt gtgaagtgtc agaggcagcc gatccaactg gagttcaacg 2580 gtactacctg ccacaccatg cggttttgaa attaactagc tccagcacca agcttcgagt 2640 ggtattcgat gccactgcgc gatcatctgg gccgtcattg aacgatgtcc tgaaagtcgg 2700 tcctactatt caaaacgatc tgtttactat cctcttgcaa tttcgtcggc atcgctttgc 2760 cttctccgcc gacatcacaa aaatgtatag gcaagttgtg gtagatcaac gtgatacccg 2820 tttccaacgc atattctgga ggaagcagcc taccgatccc atacgtgttc tggaactcaa 2880 caccgttacc tatggtactg catcggctcc attcttggct acaaggtcac tggtgcagct 2940 tgccaaagat gaacaagccg attttcctgc ggctgccgag gtcgtttgtg agcattttta 3000 cattgacgat gcgctcaccg gggcaaatac tatcacagag gctgtttgtt tgagagaaga 3060 cctgaagaaa atgctcgcca gagggggatt cgacatccga aagtggtgtt ctaattccga 3120 tgcggttttg gaaaacgttc ctgaggagga gaaggaaaag ctcgttaaaa tcgacgacta 3180 cgaaaacact gttcgcacgc tcggattact gtgggatcct aaaaatgatc atttccggtt 3240 tatcgttccg tccgacgaag acacactgca acctgtcacc aagagatttg tcttatctcg 3300 tatatcaaga ttgttcgatc cacttggttt tgtttcacca gttgttctgc tagcgaagtt 3360 gttcatgcag tcgctttggg ttcaaaaatt ggagtgggat gaaaaaatac caggtgatct 3420 gcttgcccat tgattgaaat tcagtaattc cttgaagttc cttaacgaaa tcaaaattcc 3480 gagatgtgtt gttcctcaag actgtgaggg gttcgaactc catggatttg cagacgcttc 3540 gtcatcggcg tacggggcat gtttgtattt aaggactgtg gacaaaaacg gcgatataaa 3600 ctgtaatctt ctgaccagta agtcgcgggt tgctccgctc tcagaaatga ccatacctcg 3660 taaagaattg tgtgctgcac tcctgttgtc acgtttattg gataaagttt tgtccgcctt 3720 gaagcttcct atttccaaga ttattctgtg gtcagacagc cagatcgtgc tttcgtggtt 3780 gaagaaggca ccaacgcgtt tggaagtttt tgtgcgcaac cgcgtagcgg aaatcaaccg 3840 aaacacacag cagtgtttgt ggcagtatgt gcgatcttcc gaaaacccgg ctgatgtcgt 3900 ttcacgcgga cagtttcctg aggttctgga caaaaatgca ttatggtggt ttggaccaga 3960 atttctgcgg aacatgcatt atgaaccagc agttgttgaa gagattcctg atagcgatac 4020 acctgagatg aaatctgaag tgattgtcaa cttggcaacg attgagcagc tacccgtttt 4080 tacatcctat ggttcatttc gaaaactcca acgaatcgtt gcattggttc tacgctttat 4140 taataatacc aggatgaagg atgtggcaaa acgccaccta tcacgttcaa tatcggttgc 4200 tgaaatgaga gcatcattag ttacgattgt tcgcgttatc cagcataccg agctacctga 4260 tgaggttatc agtcagcatg cggcagaatc ttccaaaaaa cttgcccgtc tatgtccagt 4320 attggatgaa gtgaatggcg tgctcagagt agggggacgt ctggaaaaat ctgctctacc 4380 attcgatgcg aagcaccaaa tgatccttcc agatcaccac ccggttacga agttgttaat 4440 tcgatcattg cacgaagaaa acatgcatgc cggtccatcc aacctgttgg ctatacttcg 4500 gcaaaagttt tggttgttga gggccaggtc taccattcga ggagtcatca gatcatgcgt 4560 ttcctgtttc agggctagcc ctcgaagggt ggaacagttg atgggtaacc ttcctgactt 4620 tcgggtgaat cctgccatgc catttgaata cactggagtt gactacgcag gacctataat 4680 ggtaaaggaa ggaaaatatc gaccgaaaat gattaagggt tatatagcag tcttcgtgtg 4740 tttggtcacc aagaacgtgc acttggagtt agtgtccagt ctcaccaccg aagcgttcct 4800 tgctgcgctt gaccgtttcg ttaatcatca tgggttggtg aagagaatcc tgtcagataa 4860 tgcaacaaat ttcaccgggg cttccaacga gttgcacgca ttgtatctgc agtttcgaga 4920 tgaagctatt gtgtccaaaa tcagcgactt cctgctccca cgagaagttg aatggcattt 4980 cataccacca cgagctccca atttcgctgg tttaatggaa gctggtgtaa aaagtgtgaa 5040 aacgcatctt aagcgtacgt tgcagaatgc tacactcaat tttgaggaat tcgccaccgt 5100 gctgacgcac atacaagcca ttttgaattc tcgtcctctg tatgctctct cagatgatcc 5160 taccgagcct atgccgataa caccagctca tcttcaacta ggcagaccac tgaattcgat 5220 tccgaagcca acgtacttga acaccaagga gaacagactg tcgcgatggg agtacttgac 5280 gcttctgcgg gaccatttct ggaagagatg gtccagagaa tatcttgtca cacttcaaag 5340 ccgtggaaaa tggatcaaac cttccaataa tatagtacct ggaatgatag ttctggtgat 5400 tgaggacaat ctaccgccac ttacctggaa gtatggaaag gttgtgaaga cgtatcccgg 5460 agaggactcg ttagtcagag tggtcgacgt aaaaacggtt tctggaactt tcaaacgatc 5520 catcagtaag ttggcccctc ttcctataaa ggataacgac gatcttcagc agggtgattc 5580 ttccacaaat ggtgttgatt ctgctttgga tttgaattca gtttcgctgg gcggtacgtc 5640 tgctcttgat gtttgaagtc ttctgttctt caacgcgggg gagaa 5685 // ID Perere2_Smed repbase; DNA; INV; 2035 BP. XX AC . XX DT 11-AUG-2009 (Rel. 14.08, Created) DT 11-AUG-2009 (Rel. 14.08, Last updated, Version 2) XX DE Perere_Smed is a Penelope-like retrotransposon. XX KW Penelope; Non-LTR Retrotransposon; Transposable Element; KW reverse transcriptase; retrotransposon; Penelope-like element; KW GIY-YIG endonuclease; Perere2_Smed. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-2035 RA Jurka J.; RT "Penelope-like elements in planaria."; RL Repbase Reports 9(8), 1910-1910 (2009). XX DR [1] (Consensus) XX CC ~90% identical to consensus. This sequence was derived from CC sequence data generated by Washington University School of CC Medicine: Genome Sequencing Center. XX FH Key Location/Qualifiers FT CDS 132..1583 FT /product="Perere2_Smed_1p" FT /translation="MPMRPVLSMVGTAQYKIAKFLDGLLKPLIIADFECKD FT SFEFSTFIAQLNKKNADEFMVSFDVCSLFTNIPLVETIELCCKMWKENVNE FT HERVDEIAFRKLLEFATSNVNFLFNNGWYKQVDGVAMGSPLAPTMASIFMS FT SLEKKISSYNLTKPTVYKRYVDDIFLVFENREHVKPFLDFMNSLHLNIKFT FT CEEEKMSSIAFLDLLIQRKEKAYQTEIYRKATDTGLYTTPESFCEFKYKRN FT MVRGLVFRAWSLSSTFVNSVKSVDSLVALLLKNGYSKSFLERMTKETVDKC FT VLNSHDCCTDLSGGNILIDSEISKLSSLKGEQNRSKHIEPKCMLVLPYSEG FT FREFKHKILKHVDKTQFRIVSTSCKVSSMFTNKSPTPVGLCSELVYQFSCN FT GCNATYIGETARHLCTRVLEHCRPNGLTHISDHFRKCTSKKIGMSDFKIIA FT KNLSNYWERITCEALLIKSLNPIINVQNVQSTCILNVFK" XX SQ Sequence 2035 BP; 658 A; 250 C; 372 G; 751 T; 4 other; attgttttat caaaaatgtt gaaggaacta gctcgtaaaa acatattgac ggagaactgg 60 ctaagtctat ccatccgaaa ggctcccagc cagcaaaact ttatggacta cctaaagtac 120 ataaagaggg tatgcccatg agacctgttt tgtcgatggt aggaactgct caatacaaaa 180 tcgccaaatt tttggatggg ttattaaaac ctttaattat tgctgatttt gaatgtaagg 240 atagttttga attttctacg tttattgcac agttgaacaa aaagaatgca gatgaattta 300 tggtctcatt tgacgtttgc agtcttttta cgaacattcc attagttgaa accattgagt 360 tatgttgtaa gatgtggaaa gaaaatgtga atgaacatga gagagttgat gaaattgcat 420 ttcgtaaatt actcgaattt gcaacgtcga atgtgaattt tctttttaat aacgggtggt 480 ataaacaggt ggatggggtt gctatgggtt ctccattggc tcccacaatg gcttctattt 540 tcatgtctag tttggaaaag aaaatttcat cttacaattt aactaaacca actgtgtaca 600 aaaggtatgt tgatgatata tttttggtnt ttgaaaatcg tgagcatgta aagcctttct 660 tggattttat gaacagtcta catttgaata ttaaatttac gtgtgaagag gaaaaaatgt 720 cgtcgattgc ttttttggat ttgcttattc aacggaagga aaaagcctat caaacggaaa 780 tatacagaaa agcgactgat actggtctgt atactacgcc tgagagcttt tgtgaattta 840 agtataagcg aaatatggtg agaggattag tatttcgcgc atggtctcta tcttccacgt 900 ttgttaactc agtaaaaagt gtggactcgt tggttgcatt gctacttaaa aatggatatt 960 ctaaatcttt tcttgagcgt atgacaaaag agactgtaga taagtgcgtg ttgaattctc 1020 atgattgttg tactgatttg tctggaggta atattcttat tgactctgaa attagtaagt 1080 tgtcaagttt gaaaggtgaa caaaatcgat caaaacacat tgaaccaaaa tgtatgttgg 1140 tattgcctta ttccgagggt ttcagagaat ttaagcataa gatcctcaaa catgtagata 1200 aaactcagtt taggattgtt tcaacgtctt gcaaagttag tagtatgttc actaataagt 1260 ctccaactcc agttggtctt tgttctgagc tcgtttatca attttcatgt aatgggtgta 1320 atgccacata cattggagag actgcccgtc atctctgtac tcgagttctt gagcattgtc 1380 gaccgaatgg attgactcac attagtgatc attttaggaa atgtactagt aaaaaaatag 1440 gtatgtctga cttcaagata attgcaaaaa atctgagtaa ttattgggaa agaataactt 1500 gcgaggcgtt gttgataaaa tctttaaatc ctataattaa tgttcaaaat gttcagtcta 1560 catgtatttt gaatgtattt aagtaattga atgtaaattg tgtatatgat gtatttgatg 1620 taattcgttt gtgatgtttt gtaaaaataa ttttaaaatt taanttttat tttgcgattt 1680 taaatttaat tatttattta aattttatta aaaattttat ttnaatgaat gtttgatttt 1740 tattttttaa attaaaatta aattttgtaa tacgagttga atacgattat tattgattaa 1800 tttgtttaga ttttcctatt ttcatttaga ttttgttaat gtgatttcta gcctgaagaa 1860 gactgataaa cggtcgaaat atatagtgaa atgttatttt atagaatata cagttggcgt 1920 ttttcaatta ttaaaacatt gccttctttc tttgattatt aaattataag atanattttt 1980 aaatttaata taaaatatat atttaaatac ttcaaattct aatttttaaa tataa 2035 // ID Gypsy-1_NG-LTR repbase; DNA; INV; 273 BP. XX AC ADAO01190636; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from parasitoid wasps: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-1_NG_; KW Gypsy-1_NG-I; Gypsy-1_NG-LTR. XX OS Nasonia giraulti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-273 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from parasitoid wasps."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; ADAO01190636; Positions 17 289. XX SQ Sequence 273 BP; 59 A; 81 C; 77 G; 56 T; 0 other; tgtaacggag gagcactagg cttctccgat gacgaaagcc gacgatggtc ctgcatcgtc 60 agcggggaac gccgtcgcct gcgtttctca cgctcagaag tgcagtggat gcacttctgc 120 acgtgggatc gctagccgtt gcctagtcgc gacaacggcg tagcgagtag tctaccggcg 180 accacgacta agagaggagc tcaagcgtcg aataaagctc gaacagcatc tctttcttct 240 tccttctccg tgtccggaga gcaccctcga cca 273 // ID Gypsy-20-I_NVi repbase; DNA; INV; 10691 BP. XX AC . XX DT 17-APR-2009 (Rel. 14.04, Created) DT 17-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW internal portion; Gypsy-20-I_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-10691 RA Bao W. and Jurka J.; RT "LTR retrotransposon from Nasonia parasitic wasp."; RL Repbase Reports 9(4), 777-777 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 9093..10262 FT /product="Gypsy-20-I_NVi_2p" FT /translation="MLNKRPIVNFDLFTYVNSKFVYVEKHFRGQFKNLYLN FT MLKEECELERKVIQNSLSLASVSPDEFAYNIMQKPGYIARLAGEAIHIIKC FT TPVEVKIQHTDYCYTQLTVSSGNKTMFLTPKTHILLKKGTQINCNSIVPQY FT FKIGDNWINLMPAPRKSNNIIKTLKPSTKIAWQPEKLNSLATSGIYSQEDL FT QKLQQHIMFPMERASLLNTIARGMQGEETVQGGLISNLLTENTLMNIAEST FT WNRFMDKLVRFGSISSAIIMIFIIIHISKLAIDIIIRGYTLHAMYGWTLHL FT LGAIFSSITHLLIVMRRNDIIVNDDNKQTPTTNPQDIQLQEIVVQNPISPK FT KEISPPIPIHPPNSKTYINQPIESKVESKVESKPESKISLFSLKPF*" FT CDS join(1385..3673,3637..4359,4363..5811,5780..7693) FT /product="Gypsy-20-I_NVi_1p" FT /translation="MSDNQKNLLGQKLTRSVVRNNPTLQNQIIIPNNQTPF FT ARSKKIIRSPPANNTTAQGRDEEAVTNTEKLNDTIAKNIEKIDQEINLEQT FT NKENSTGAIKKEISRLNKAHLTINIPTTPNKQESTEISKKEQKVAAVTPIT FT PLTAIRKAEDWKFDNSLKQDISLIDIEGFNEQPSLIGNLNNSQSKFNFNFN FT LDSILDNTMVVGGLGNKQQQQQQAPQQLRLSLKYVANLIPEFDGKCMSVNE FT YVEKLKHAKNMLSETDQANLIPILKMKLKGDVYRAMLNAQINNIQDFIKAV FT RQIYPSTDKMATLYGKIAESLQNPDETVLSFANRLQELVLQVKDSKEVEAY FT TAEEKQXFETKIDAETLLAFVEGLRQEIRLELGATRNLGDAIQKAIEVETR FT LNRRDLIFGKANKVFCSLSNQVAISGDTSVAIYSCQICQDYGHEALFCHKV FT ACVYCKNKDHLSDSCRIVKKKIQLICKFCNNHGHSIDACKMNRIKGNHCQY FT CQVMGHTVTQCPFIIEYELCWKCKESGHDPNNCKNQGKQSETCEFCNGTTH FT TIKDCPDVTCKRCNQRGHVLKYCPILMDRKFLMCAICNKNDHEAEQCKEAK FT ILVAQNRAKACQQDIQCQIGNEIGHLGKYYPKFMAQQQATNNANASYKNRN FT YNNYNNNYRKSNNFNNQNKNTSKPNWKQISCEYCKIRGHPIQECRKLKNLE FT NMAQERKNCSYCKEDGHTVENCQEIKALESRANKFCSYCKNTNHMTEECFK FT LQNQQMNRVGNDIAESANESCGKRLKPSRGEWTRSNKANALETNTDTANLI FT KTRDTYEKNVLNTEKSEKCENEANITDIEKLEVYHIKLDDRLEYIEITNPY FT TIKSCSLLVDSGAQINIIKLGQISPEAQLQDKKILLTGITSKQIKTLGIVK FT LPINNKYFDFHVAPEEFDIPYDGILGIKLLRESKLRLDEGYLEINDIKLKL FT QSGPIKKIYTYMEQVQNKDQNARVGINNDQKKFLNRILKEIIDENDIYIKN FT IIKIYIXMYIXIYIYEGVNENKWSESKAENECKENITHQSNKQISVNESME FT QEAENFDNTKDFEKNEIAIENTFYDDVKTVLNTITKEYEVSNDTQAEIINE FT NNKEKNKPENFREFLETDNKNIENFIYNINVGLQEEQNSQEISAKAKECDF FT KDILTTMKETGEKLSKAEEKLMFDGKEYQYISFDIDYIPEGAQLVDELGIE FT ESLILNIEAIDAFEKTKQGKLKKPNRQEVLRKKLQMKNGGKRNQERIERIT FT NKYPDVFWIEGDELGTCNVEQHEINLTTDKPIYVKQFPLEHKKKGIAISET FT KKLVINKAIRNTKSPFNAPALVVPKKELVTGEKRYRLVIDYRALNEVTIAD FT PYLLPNINEILDLIGNKKYFDVLDLKSGFHQVKMRPSDIEKTAFSVQPLGR FT FEFLVLPFGLKNSARTFQRVINTVLAEYINKICYNYIDDIIIFGDSTEELD FT ERFNLIAEALIAGRIKIRTRKMQAGLKLEPEKCEFGKTEVCFLGHIISEAG FT IRPDEKKIEAVKXFPIPKNVKGVRQFLGFANYYRRFIKNLADITKPLTILL FT QKDRPFIWKEEQQKAFDKLVKMLCSAPVLQYPQFDKPFIITSDASQYALGC FT VISQGEIGKDRPIAYASRVLQPAEMKYATYEKEALAIMYAIKTFKSYIYGN FT KFVIVTDHKPLVWLKSADNNERVQRWRLKLLDYEYEIIYKAGKQNTNADAL FT SRNPVAEAGICVITRAQKKREENKIRQELTNEDKIKIVNEQKTKINKKKLK FT IKNRKRRKIKGPKNRIACDVEKQESKQKNVVYSKNAIQFRKDNVIHLINNN FT GESLDENTANLLKNIYWDKSLDSNEIEIINKGNNKIFVLCLNTAESAITVK FT TNLINLFKLLRTICLQKNYAIFSISKDIKITNISWDEIEEILKDIFNETNI FT KVIICLNNIIYIDIDKRDKIFEMLHATKIGGHSGVNRTYNRIKDKYYWENL FT KVDIQKRIQRCEECQRNKLKRIKTKQPMIITDTPLKTFDKISMDIVGPVNV FT TKNNNKYILSIQDQLTKFIVIVSLPDQTAESVADALIKKFICIFGSPKLVL FT TDRGANFTSKLIKQIARRFKFKKVETTAFSPQSNGSLERAHLRYTNI*" XX SQ Sequence 10691 BP; 4515 A; 1523 C; 1854 G; 2787 T; 12 other; ktggttccgg tacgaatcga acgctagacg tttgtaaaaa aaaagtaaaa atcttgcaaa 60 cagataaaat atatcagtgt gttatactaa aaaaaagtga taataacaat ggttctaatt 120 agaacwtaaa acatatataa atgcaataaa aacaactaca aaagcaatct aaaaataaga 180 cakaaattct aaaagtgaca gtgttttaaa accgtataaa tcaagttttt tgcctagtgc 240 gagtgtaacc ctttcctcaa gtttgcttag tagcaacttt cccctagatc tgctaaaata 300 gcggtcagtc ttcttgatcc taaagctcgc tgaaaagcga catatcctgt gatctgctta 360 aatagcgatt actcatttcc taaaaagcac ataaaggcac gaaaaccctc tagccataaa 420 aatacggttt atgaacggat ttataaactt atatctaaat aaacggaaat aaataaaata 480 taaagagtac aaaagcaatc agaacaatag aaatataaaa tctaaagatt acaaaaacaa 540 tcagaacaat agaaatataa aatcgcgagt tcacaaaaaa atttaggtga gtgacctgct 600 tgcgctcatc ggcgatcgcg ctcaccttac gctgcagtgt cagcaaggtg atagaacgca 660 gcgcttcgga tctatacagc gcttgtgtca gtttaaaaac caggtcaatt caaaatattc 720 gaaacatatg taaagcaatt aatttcgtgc attttccaat ctgccgtgta attagtgaat 780 tcatttatcg atataaaagc acgttatttc attcaatgaa ggataatact agtgtagtaa 840 atacggtaaa aaagaataat cacaaaagag gcggtaaaaa gatattttta cggaaactat 900 taaatgaaaa atctaaatta atagaagaat taagacaatt aaacaagaaa atagattcat 960 ttttccgatc gcaagaagtt aagcagccaa aaatagtaaa caaccaagta gaacgatcgg 1020 agtttcaaat agaaatcggc aacaaatttg aagagttatt taattttgct atgactgaat 1080 tgaataaaaa taaataaatt ttagattgta gaatcacgaa aaatcagttt aaagcctaaa 1140 accgttcaaa gaacacttat atagcagagt cgagtcaaat tactgtatca cgcgtatagc 1200 cgcggtaaat gaactgacca aaacttgtgt agtttgcaaa aacgcgaaaa gtaggtcgcg 1260 tattgtaaaa taagtagaac agcaaaatta cattcgaatt atttgtaygc gatttaaaaa 1320 tcaaaaattc aggctgaata aaattttata ataaataaaa agataactaa aatgagagca 1380 ccaaatgagt gacaatcaga aaaatctact agggcaaaaa ttaacaagga gcgttgttcg 1440 taataatcct acattacaga atcaaataat aatccctaac aatcaaacgc catttgcgag 1500 aagtaagaaa ataattcgat caccgccagc aaataataca actgcacaag gaagagacga 1560 agaagcagta accaatactg aaaaattaaa tgatacaata gctaaaaata tagaaaaaat 1620 agaccaagaa ataaatttag aacagacaaa caaagagaat tccacggggg caataaaaaa 1680 ggagatttca aggcttaaca aagcccattt aaccattaat ataccaacta cacctaataa 1740 acaggaatca accgaaatat ctaaaaaaga acaaaaagta gcagcagtaa caccaataac 1800 accgttgaca gcgattagaa aagcagaaga ttggaaattc gataatagtt tgaaacagga 1860 catatcgttg atagacatcg aaggttttaa cgagcaacca agcctaatcg gcaacctaaa 1920 taattcacag agtaaattca actttaattt taatttagac agtatattag ataacaccat 1980 ggtggttggt gggctaggta ataaacaaca acaacaacaa caagcgccac agcagttgcg 2040 tttatcgcta aaatatgtag ctaatctaat accagaattt gacggaaaat gtatgtcggt 2100 taatgaatac gtagaaaaat taaaacatgc aaaaaacatg ttatcagaga ccgatcaagc 2160 aaatttaatt ccaatactaa aaatgaaact aaaaggcgac gtatataggg ctatgttaaa 2220 tgcacaaatt aataatattc aagattttat caaagctgtg cgacagattt atccatcaac 2280 tgataaaatg gcaactcttt atggaaaaat agcagaaagt ttgcaaaatc cagacgaaac 2340 ggtactcagt tttgctaata gattgcaaga attagtattg caagtaaagg acagcaagga 2400 agtagaagca tatacagcag aagaaaaaca aarttttgaa actaaaatag atgctgaaac 2460 tctattagca tttgtagaag gattacggca ggaaatacgt ctagaattag gagcaacaag 2520 gaatttaggt gatgctatac aaaaagcaat agaagtagag acgagattaa acagaagaga 2580 tttgatattt ggtaaagcaa ataaagtatt ttgcagttta agcaatcaag tggctatatc 2640 aggagataca agcgttgcca tatactcgtg tcaaatttgt caagattatg gacatgaggc 2700 attgttttgt cataaagtag cgtgtgttta ctgcaaaaat aaagaycatt tgtcagatag 2760 ctgcagaata gtaaaaaaga agattcaatt aatttgtaaa ttctgcaata atcacgggca 2820 ttcaattgat gcctgcaaaa tgaatagaat taaaggcaat cattgccaat attgccaagt 2880 watgggtcat actgtaactc aatgcccatt tataatagaa tacgaactgt gttggaaatg 2940 caaagaaagc ggacatgatc ctaataattg taaaaatcaa ggaaaacaaa gtgaaacgtg 3000 tgaattctgc aatggcacaa ctcatacaat aaaagattgc ccagatgtaa catgcaaaag 3060 atgtaatcaa cggggacatg tattgaaata ctgtccaata ttaatggata gaaaatttct 3120 aatgtgtgca atttgtaata aaaatgatca tgaagctgaa caatgtaaag aagctaaaat 3180 attagtagca cagaatagag cgaaagcatg tcaacaggat atccaatgcc agattggcaa 3240 cgaaataggt catttgggaa aatactaccc caaattcatg gctcagcaac aagctacgaa 3300 taatgccaac gcctcgtata aaaatcgaaa ttataacaac tataataata actatagaaa 3360 aagcaacaat ttcaataatc agaacaaaaa tacaagtaag cctaattgga aacaaataag 3420 ctgcgagtat tgtaaaataa gaggacatcc aatacaagag tgcagaaaat taaagaatct 3480 agagaatatg gctcaagagc ggaaaaattg cagttattgt aaagaagatg gacacacagt 3540 ggaaaattgt caggaaatca aggcattgga atcaagggca aataaatttt gtagctattg 3600 caaaaatacg aaccatatga cagaggaatg ctttaaattg cagaatcagc aaatgaatcg 3660 tgtgggaaac gactaaagcc ttctcggggc gagtggacga ggagtaataa ggctaatgca 3720 ctcgaaacaa atactgacac agcaaattta ataaaaacaa gagatacata tgaaaaaaat 3780 gttctaaaca cagagaagtc ggagaaatgc gagaacgaag cgaatattac ggatatcgag 3840 aaactagaag tatatcacat aaagttagat gatcgattag aatatattga gattacaaat 3900 ccatacacaa taaaatcatg tagcctactc gttgatagtg gtgcacaaat aaatataatt 3960 aaattaggtc aaatttctcc ggaggcgcaa ttacaagata aaaaaatatt gttgacagga 4020 ataacaagca agcagataaa aactctagga atagttaaac taccaataaa taataaatat 4080 tttgattttc atgtagcacc agaggaattt gacattccgt acgatggcat tttaggaatt 4140 aaattactca gagaaagtaa attacgctta gatgaaggat atttagagat aaatgacata 4200 aaattaaaac tacaaagtgg accaattaag aagatataca cgtatatgga gcaagtacaa 4260 aacaaggatc agaatgcaag agtgggaata aataacgatc aaaagaaatt tctcaataga 4320 attttaaaag aaatcattga tgaaaatgat atatatatat agaaaaatat aataaaaata 4380 tacatatawa tgtatatatw gatatatata tatgaaggcg taaatgaaaa taagtggagc 4440 gagagtaaag ctgaaaatga atgtaaagaa aatataactc atcagagtaa taaacaaatc 4500 agtgtaaatg agagcatgga gcaagaggca gaaaatttcg acaatacaaa agatttcgaa 4560 aaaaatgaaa tagcgataga aaatactttt tatgatgatg tgaaaacagt cctgaataca 4620 ataactaaag aatatgaagt cagtaatgat actcaggcgg aaataataaa cgaaaataac 4680 aaagagaaaa acaagccaga aaattttaga gaatttttgg aaaccgataa taagaatatt 4740 gaaaatttta tatataacat taacgtagga ttgcaagagg agcaaaactc gcaagaaatc 4800 agtgcaaaag caaaagaatg tgatttcaag gacatattaa caacaatgaa agaaactgga 4860 gaaaaattgt caaaagccga agaaaaattg atgtttgatg gaaaagaata tcaatatata 4920 tcattcgata ttgactatat tcctgaagga gctcaattag tggacgaact gggcatagag 4980 gagagtttaa ttttaaatat agaagcaatc gatgcatttg agaaaactaa acaaggcaaa 5040 ctcaagaagc ctaataggca ggaagtatta cgaaagaaat tacagatgaa aaatggtggt 5100 aaaaggaatc aagaaagaat tgaacgaatc acaaataaat atccagatgt gttctggatc 5160 gaaggagatg aattaggaac gtgtaacgta gaacaacatg aaataaattt aacaacagat 5220 aaacctattt atgtaaagca gtttccatta gagcacaaaa agaaagggat tgccatcagt 5280 gaaacaaaga aattagtcat aaataaagca atccgaaata caaaaagccc attcaatgca 5340 ccggcattag tcgtgccaaa gaaagaactt gtaacgggag agaagagata caggttagtg 5400 attgactaca gggcactaaa tgaggtcact atagcggacc cttatttact tccaaatatt 5460 aacgaaattt tagatctcat aggaaacaag aaatatttcg atgttttaga cttaaaatca 5520 ggatttcatc aagtaaaaat gagaccttca gatatagaaa aaactgcgtt tagtgtacag 5580 ccattgggaa gatttgaatt tcttgtttta ccgttcggtc tcaagaactc tgctaggaca 5640 tttcagcgag tgataaatac agtcctagcc gaatacatta ataaaatatg ctacaattat 5700 attgacgaca ttattatttt tggagattca acggaagaat tggatgaaag atttaattta 5760 atagcagaag cattaatagc aggcaggatt aaaattagaa ccagaaaaat gtgaatttgg 5820 taaaacagaa gtttgttttt tagggcatat aattagcgaa gctgggataa ggccggatga 5880 gaagaaaata gaggccgtca aaaaktttcc tatacctaaa aatgtaaaag gagtgagaca 5940 atttttagga tttgctaatt attatagaag atttataaaa aatcttgctg atattacaaa 6000 accattaaca atattattac aaaaagatag accatttatt tggaaagaag aacaacaaaa 6060 agcttttgat aaactggtaa agatgttatg ttctgcacca gtcttacaat acccgcaatt 6120 tgacaaacca tttataataa caagcgacgc aagtcaatat gctctaggat gtgtcatttc 6180 tcaaggcgaa ataggcaaag acagacctat agcttatgca tctagagtat tgcaaccagc 6240 tgaaatgaaa tatgcgacgt atgagaaaga ggctttggcg attatgtatg ccataaaaac 6300 tttcaaaagt tatatctatg gtaataaatt tgtgattgta acggatcaca agccattggt 6360 atggttaaaa tcagcagata ataatgaaag agtacaaaga tggagattaa aattactaga 6420 ctatgaatat gaaatcatat ataaggctgg aaagcaaaat acgaatgcag atgccttgtc 6480 acgaaatcca gtggcagaag ctggaatatg tgttataact agagcgcaga aaaagcgaga 6540 agagaataaa attcgccaag aattgaccaa cgaagataaa attaaaatcg taaatgaaca 6600 aaagacgaaa atcaataaga aaaagttgaa gataaaaaat cgtaaacgaa gaaaaattaa 6660 aggcccaaag aatagaattg cgtgtgacgt cgagaagcaa gagagcaaac agaaaaatgt 6720 agtatactcc aaaaacgcaa ttcaattccg aaaagataat gtaatacatt taattaataa 6780 taatggtgaa tcactagatg aaaatacagc gaatttatta aaaaatattt attgggataa 6840 aagtctagat agtaatgaaa ttgagataat aaacaaggga aataacaaaa tttttgtatt 6900 atgtttaaat acagcggaat cagcaataac ggttaaaaca aatctgatta atttatttaa 6960 attattaaga actatatgtt tgcaaaagaa ttatgcaata tttagtatat cgaaagatat 7020 aaaaataaca aatataagtt gggatgaaat cgaagaaatc ttaaaagata ttttcaacga 7080 gacaaatatt aaggttataa tttgtttaaa taatataatt tatatagaca tagataaaag 7140 agataaaata tttgaaatgt tacacgcgac taaaataggg ggacactctg gcgttaatag 7200 aacgtataat agaattaagg ataaatatta ttgggaaaat ttaaaagtag acattcagaa 7260 aaggatacaa agatgcgagg aatgtcaaag aaataagctg aaaagaataa aaactaaaca 7320 acctatgatt atcactgata cgccattaaa aacattcgat aaaatttcta tggacatagt 7380 tggaccggta aatgtaacta aaaataacaa taaatatatt ttatctatac aagatcaatt 7440 aacaaaattt atagtaattg tatctttgcc tgatcaaacg gcggaatcag tagcagatgc 7500 tctaataaaa aaattcatat gcattttcgg atcacctaaa cttgtactca cggatagagg 7560 agcaaatttt acaagcaaat taataaaaca aatagctcga agatttaaat ttaagaaagt 7620 agaaaccacg gcattttcac ctcagtctaa cggctcgtta gaaagagcac atctccgcta 7680 tacgaatatc taaagaactt ttcttcaaag aaaatggagt gggacgaatt acttgaattc 7740 gctcaattta attataatac tagtgtccat actagtcata aatttacacc gcatgaacta 7800 gtttttggtt atccagctaa acttccatct agtgaaccat taaagaaaaa cgaacaattg 7860 cttactttta atggttattt agaaaattta gtagcgaaat tagaagaaat tacagtgatt 7920 gctagagaaa atttgatcaa tatcaaaaat gaaatcaaaa gaatattatg atcggtacgt 7980 aaatccaata gaactaaaaa ttggcgaaaa agtatggcta atcaaagaac caaaaccagg 8040 taaattcgaa aagaatcata acctggggcc atatgaaata ataaaagtaa atgaaaatag 8100 taacgtaact ataaattaca atgggaaacc aaaaacagtt catgtaaata aattaagccg 8160 ttgccataca tgattctaaa atatttttct ttctaggaaa cttggacaca atgattcctg 8220 aaattatttt aatttttgca attcttaaaa caagtcaagc tatagtagga ttcgattgcg 8280 gggcatctga tccaatcatc accacgtact ctctacttga ttcggggagt gcgatttcca 8340 tcaagaagat gtaaacgcga ccaacgccac catacaacta ctacaactgg cagaatttag 8400 aaaagtaaga gtaatcaatg taaaatagaa attaagcgta cggtatatca ttgcggaatg 8460 tttagtcata taggtgcggt agaacaaggc gtacaagaat atatttatga tataagctat 8520 gaaaattgca aatcaatcca tgaaacagga atattcaaat atgacaattt tcacactata 8580 gctaatttga aagtaaactc aaccaaatca acaggaattg aattggcggg aagtattaaa 8640 gaaaagaaat gcacaggtgc agcatattcg gatagttttg gttcatggag tgatgtttta 8700 gtacagggac ttataaaaat aacactttta gaagaatagt gctgcagata ggtaagttta 8760 gataatgata aaataagatt aagttcaggg acagtatgta aattctcgga gcagcattgt 8820 attgatatac aagctagcca tacattttgg tcagtaataa atactgatag ttgttttaaa 8880 aataagtatg atatattata tggargagta tcgacaaaaa ataatctcac aaaatcacga 8940 gacagtttat acaataagta ctaatgaaat atcgttcgcg ttaactatta gagaaaagat 9000 aaatatttgc ggtagagatt ttatataaaa cagaacatcc gaaattrttc atttgcgaag 9060 aaattgataa tcaattatta tttcaagaag atatgcttaa caaaaggccg atagttaatt 9120 ttgatctttt tacatatgta aactcaaaat ttgtatacgt agaaaaacat tttaggggtc 9180 agtttaaaaa tttatactta aatatgctaa aagaagaatg tgaattagaa agaaaagtga 9240 ttcagaattc cctatctctc gcatccgttt cacccgatga atttgcctac aatataatgc 9300 aaaaacctgg gtatattgcc agactagcag gtgaggctat acacataatt aagtgtacgc 9360 cagtagaagt aaaaattcag catactgact attgctatac acaactgaca gtctcatcag 9420 gaaacaaaac aatgtttctc acgccaaaaa cccatatact gttaaagaaa ggtacacaga 9480 taaactgcaa ctctatcgta cctcaatatt ttaaaattgg cgataactgg ataaatctca 9540 tgcctgcgcc aagaaaaagt aataacatca tcaagacttt gaaaccgtca accaaaattg 9600 catggcagcc ggaaaaattg aattcactgg caaccagcgg aatatattca caagaagatc 9660 ttcagaaact acagcaacac ataatgtttc cgatggaaag agcatcctta ctcaacacga 9720 tagccagagg aatgcaaggt gaagaaacag ttcaaggagg cctaatatcc aacctattaa 9780 ccgaaaatac attgatgaac atcgctgaaa gcacatggaa tcgatttatg gacaaactgg 9840 tcagatttgg atcaatcagc tcagcaataa taatgatatt cattattatc catatctcca 9900 aattggcaat cgatattata atacgaggat atacattaca tgcaatgtac ggatggacgc 9960 ttcatctact tggagctata ttcagctcta ttacacattt actaatcgta atgagaagaa 10020 acgatataat agtaaacgac gataataaac agacaccaac gacaaatcct caagatatcc 10080 agctacaaga aatagtggtg caaaatccaa ttagtccgaa gaaagaaata tccccaccta 10140 ttcctataca tccacctaat tcaaagacct atataaatca accaatagaa tcaaaagttg 10200 aatcaaaagt agaatctaaa ccagaatcga aaatatcatt attctcgttg aagccatttt 10260 agaatgtgta gctgtaaata ttcaaaccct ttactgttat tctacgtaaa caatatgata 10320 ttaatgtttg atatatttat attgctaggg tcgtaggtaa taaattagtg tataatataa 10380 acttcattta taatttgaaa caaactaaaa ttaattttaa tttaggttag atgtaaccac 10440 taagagtagt agatttaata ataagaatgt aaataaaatt ctagagtatt catataagtt 10500 tacaataaat cagacaataa taataataat aatacaagta aaaacaacta cttcaaaatt 10560 tagttagaat attctagaat tttatgaaga ttaaagaatg ttgctgagaa cttaataatt 10620 caaattgttt gttgatctgt tatctaatta agttgctatt gtaccaatag gcaaccagct 10680 tgttgggggg a 10691 // ID BEL-98_AA-I repbase; DNA; INV; 5877 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-98_AA_; KW BEL-98_AA-LTR; BEL-98_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5877 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 871-871 (2011). XX DR [2] (Consensus) XX CC Positions [4560-5132] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 255..5546 FT /product="BEL-98_AA-I_1p" FT /translation="MSSAERRIRYLKLRQRSISASFTLIKGFVDRYDEEND FT AAEVPVRLESLVELWNDYTKVQAELESLDEASIEEQLKQRMEMESHYYRVK FT GFLLNKSPIAPPPSPTTSTNAQAHSPTTHVRLPDVKLPVFNGTLEHWLNFH FT DLYMSLVHSSQDLSSIQKFYYLRSSLSGDALKLIQTIPISATNYPVAWNLL FT VEHFQNTGRLKASYVDALFEFPTLKRESAGELHTLVERFEANVRILQQLGE FT QTNTWDILLVRMLSSRLDQTTRRDWEEHSSTLQDVSFKELTTFIQRRVNVL FT QSINHKPSDTCTPSSAKKPPTPRSFASHGASQLNARKCILCSEQHPLYMCP FT TFSKMTVEDKEKEIRRHQLCRNCLRKGHMSRECSSNSTCRRCKGRHHTQLC FT NHESSNESSQKTSDVTSKPTQFQSASEQPTTSASATHMHPKGYASTGRNHA FT KVLLATAVVHVVDDNGTFHVARALLDSGSECCFATEPFSQLLKVQRKRVSI FT PIAGIGHSTTQARTKFTSTIRSRISDFSASAEFLVLPKVTVNLPSTSLDIS FT SWEIPSGVKLADPSFCTSNTVDLVLGAEIFFDLFKVPGRIPLGDDLPMLIN FT STLGWVVSGRSNHCQPSTTITANVATMADLHQLMERFWTIEEDNSSPCHSV FT EEAACEAHFRRTVSRTPEGRYVVRLPLKDDVISNLGDNRRTAIRRFQMVES FT RLQRNLHLGQQYSDFMKEYYELGHMQRVQDTGEITTQTYHLPHHAVVREES FT TTTKVRVVFDASCKTSSGTSLNDAMMIGPIVQDDLRSIVMRARIHPVMLIA FT DIKQMYRQVLVDERDTPLQRIVWRASRDEELETYELKTVTYGTASAPFLAT FT RVLQQLADDEQHDFPEAATVLRKDFYVDDLFSGSNSIAETIVLRKQLHSLL FT ARGGFELRKWASNEPAVLDDISDDNKALQQSVDLDRDQLIKTLGLHWEPNA FT DVLRYNVKLPQSIPNATLTKRLALSYIAQLFDPLGLVGPVVTTAKLFMQAL FT WTLKNSDDNIWGWDEELPATARDYWQSYHEQLPLLNQLKIDRFVLCPDATT FT LQLHFFSDASESAYGACAYVRSENESRQVKVALLTSKSKVAPLKKQSIPRL FT ELCGALLAAQLFEKVTSSLSMRPETYFWVDSTTVLSWLNCSPSTWTTFVAN FT RVSKIQLSTTNCIWNHVAGEQNPADCLSRGTSAELLLSHDLWWHGPEWLHR FT DQTEWPSVHQTTSNPQSVCEMRKPPATIVSATIEDSFINCYVNKFSNYQRM FT LRVTAYCKRFLQNCRLSNQHRPASHVVTTEEKKEAELTLIRLVQEQAYPNE FT WKCLQQGKPVAVKSRLKWFHPILDSENLIRIGGRLRRSQQAYDSKHQIILP FT STSPLSALLVRSLHEQHLHAAPQLLLGILRLRYWITGARDLARKIFHKCTI FT CFRARPKRIEQFMSELPTARITASRPFSSTGIDYWGPILIQPAHRRASPRK FT AFVAVFVCFCTKAVHLELVADLTTAKFLQALRRFVSRRGLCSDIYSDNGRN FT FVGAANELRHLIRSKEHREQIAQECAHNNIRWHFNPPKASHFGGLWEAAIQ FT SAQKHFIRVLGAQTLAYDDMETLLSQIECCLNSRPLVPISDDPSDLEPLTP FT GHFLVGSALKAVPDVDVTSIPFNRLKKWQQTQKLYQQIWERWHRDYLVTLQ FT PRAKWCNPPVPLQRNQLVVLLDENLPPMRWPMARIQDLHPGPDGVVRVVTV FT QTSTGIFTRPVAKICLLPIAPIMPPAENTSQTTTNSAPKPASNP" XX SQ Sequence 5877 BP; 1579 A; 1659 C; 1284 G; 1355 T; 0 other; tggtccttcg agccggatga acgtcgaact ccgttcatca acgatccaac ccgccagagt 60 cgccatcttg cttctggaac cttctggaga gttcaaccac gctgtacatc cctttagggc 120 catcttggac acagtcatcc cgccattgca gcatcaataa aatacaaggc atttaattgc 180 ctataacagg taattagctt tcctatcatc tataccctac tcgttcgagc tacgcggtct 240 tttgcttgca gaccatgtct tcagccgaac gccgcatccg ctatctcaag ctgcgccagc 300 gtagcatttc tgcatcattc accctaatca aaggcttcgt ggatcgttat gacgaggaaa 360 acgatgccgc ggaggttcca gttcgcctgg agagccttgt ggaactatgg aacgactaca 420 ccaaggttca ggccgaattg gagtcgttag acgaagcttc gatcgaagag caactcaagc 480 agcggatgga gatggaatcc cattattata gggtcaaggg gttcttgctg aataaatcac 540 ccatcgctcc acccccttcg cctactacgt ctacaaacgc tcaagcccat tctcccacca 600 cgcacgtgcg attgccagat gtgaagctac ccgttttcaa tggaaccctg gagcattggc 660 tcaacttcca tgacctttac atgtcgctag ttcactcttc gcaagaccta tccagtatac 720 aaaagtttta ttatctccgt tcctccctct ccggagatgc tctgaaactg attcagacta 780 tacccataag cgctacaaac tatcctgttg catggaatct gctggttgag cattttcaga 840 acactggtcg tctgaaagct tcgtatgttg atgcactttt cgaatttccc accctcaaac 900 gagaatcagc tggggaatta cacactctcg tagaacggtt cgaggccaac gtccgaattc 960 ttcagcagtt gggagagcaa accaatacgt gggacattct tttggtccgt atgctaagca 1020 gtcgcctcga tcaaaccacc agaagggact gggaggaaca ttcgtccact cttcaagatg 1080 tttcgttcaa ggaacttacc acattcattc aacgaagggt taacgtgctg caaagcatta 1140 accacaagcc gtccgacacc tgtacgccat cgtctgcaaa gaaaccaccc accccacgtt 1200 cattcgctag ccatggagca tctcaactca atgctaggaa atgcattctt tgttccgagc 1260 aacatccact gtacatgtgc cctacctttt ccaaaatgac cgtcgaggac aaggaaaagg 1320 agatacgccg tcatcaactc tgcagaaatt gcctccgcaa gggacacatg agtagggaat 1380 gttcatccaa cagtacctgt cgtcgatgta aaggccgtca tcacactcaa ttatgcaacc 1440 acgaatcgtc gaatgaatca tcccagaaga caagtgacgt cacatcgaaa ccaacacaat 1500 ttcaatctgc tagcgaacag cctaccacct cagcatctgc aacacacatg cacccgaaag 1560 gctatgcttc aactggacga aaccatgcga aagtactgtt agccacagcg gtcgtacacg 1620 tagtcgacga caatggaacc tttcacgtcg ctagagcgct tctagactcg ggaagcgaat 1680 gctgttttgc caccgaacca ttctctcaac ttttgaaggt acaacgcaaa agggtatcca 1740 ttccgattgc tggcattggg cattctacta cgcaagctag gaccaaattc acttccacca 1800 ttcggtcccg aatcagcgac ttttctgcct ccgcagagtt ccttgtactc cccaaggtta 1860 cggtcaactt accgtcaacg tctttggaca tttcatcctg ggaaatcccg tccggggtta 1920 aactggctga cccatcgttc tgcacctcca acaccgtcga tctcgtcctt ggcgcagaaa 1980 tcttttttga tctctttaag gttcctggca gaattcctct cggagacgac ttacccatgc 2040 tcataaattc gactcttggt tgggtagtct caggtcgttc caaccattgt caaccatcaa 2100 ctacaatcac ggccaacgtt gcaaccatgg ccgatttaca ccagctgatg gaacgattct 2160 ggacaatcga agaagacaac tcatctcctt gtcattcggt ggaagaagcg gcatgtgaag 2220 ctcacttccg tcggacagtc tcgcgcacac cagaaggtag atatgtagtg cgcctaccac 2280 tgaaggatga cgtcatctca aacctcggcg acaaccgtcg caccgctatc cgtcgttttc 2340 aaatggttga atcacgcctg cagcgaaatc ttcatctggg ccaacagtac agcgacttca 2400 tgaaggagta ctacgagcta ggccacatgc aacgagtgca ggacaccgga gagattacca 2460 cccaaacgta tcacctccca caccatgccg tggtacgtga agagagcaca acgacaaaag 2520 tgcgcgtcgt cttcgacgcg tcttgtaaaa cgagcagtgg aacatctcta aacgacgcaa 2580 tgatgattgg gccaatcgta caagacgacc tgcgatccat cgttatgcgt gctagaattc 2640 atcccgtgat gctgattgcg gacatcaagc agatgtaccg ccaagtactg gtcgacgagc 2700 gtgacacgcc tcttcaacgc attgtgtggc gagcatcacg tgacgaagaa cttgaaacct 2760 acgaactcaa gacagttacc tacggaaccg caagtgcccc gtttctggca acgagagtct 2820 tgcagcaact agctgatgac gaacagcacg atttcccgga ggcagccaca gtattacgca 2880 aggactttta tgtggatgac ctcttctctg gcagcaacag catagccgaa acaatcgtcc 2940 ttcgaaagca attgcactca ttgctagcac gaggtggctt tgaattacgt aagtgggctt 3000 caaacgaacc cgctgtcctg gacgacatct ccgacgacaa caaggctcta cagcaatcag 3060 tagatttgga tcgcgaccag ctcatcaaaa ctctcggtct tcattgggag ccaaacgccg 3120 acgttttgcg gtacaacgtg aagctgccgc agtcaatccc aaatgcaaca ctcaccaaac 3180 gtctcgccct ctcatacatc gcccaattat tcgacccact tggtttggtg ggacccgtcg 3240 ttacaacagc aaaacttttt atgcaggcat tgtggaccct gaagaacagc gacgacaaca 3300 tatggggctg ggatgaggag cttcctgcaa cggcacggga ctattggcag tcctaccatg 3360 agcagcttcc actgctaaac cagctcaaaa tcgaccgttt tgtactgtgc cccgatgcaa 3420 caacacttca gttgcacttc ttttcggatg catcggagag tgcatacggt gcatgtgcat 3480 atgtgcgatc cgaaaatgaa tcgcgacagg tcaaagtcgc actactgaca tcgaagtcga 3540 aagttgcacc tttgaagaag caaagcatac cacgtttgga gctttgcggt gccctcctgg 3600 cggcacaact gtttgaaaag gtgacctcct ccctatctat gcgcccagaa acctacttct 3660 gggtcgactc gactactgtc cttagttggc taaactgttc accgtctact tggacgacct 3720 tcgtggcgaa cagggtgtca aaaatacagc tctcgactac aaactgcatc tggaaccacg 3780 tagcaggtga acaaaatcca gcagattgtc tctccagagg aacatcagct gaactacttc 3840 tctcccacga tctatggtgg cacggtcccg agtggctaca ccgagaccaa acggaatggc 3900 cgagtgtcca tcaaaccacc tcaaatccac agagcgtatg cgaaatgcgc aaaccaccag 3960 caacgatcgt atctgcaacg atagaagatt ctttcattaa ctgctacgtc aataaattct 4020 ccaattatca acgcatgctt agggtgaccg cctactgtaa acgttttctg caaaactgtc 4080 ggctcagtaa tcaacatcga ccagcgtcgc acgtcgttac cactgaggag aaaaaagaag 4140 ctgaattgac attgatccga ttagtgcagg aacaagccta tcctaacgaa tggaaatgtt 4200 tgcaacaagg aaaacctgta gctgtgaagt ctcgcctaaa atggttccac cctattctcg 4260 attcggaaaa tctcatccgc attggtggtc gtttgcgccg gtcgcagcaa gcctacgatt 4320 ccaagcacca aatcatcctt ccatcaactt ctccgttatc tgccctgctt gttcgaagcc 4380 ttcatgagca gcatctgcac gctgcacccc aattgctgct tggcatactc cgcctccgat 4440 actggatcac gggggccagg gacctggccc ggaaaatctt ccacaaatgt accatctgct 4500 tcagagcacg accgaaacgc atcgagcaat tcatgtcgga actgccaaca gcccgaatca 4560 ccgcatcgag accattttcc tcaactggaa tagactattg gggaccaatt ctgatacaac 4620 cagcacaccg gagagcatca ccacggaagg cctttgttgc agtcttcgtg tgcttctgta 4680 caaaggccgt ccacctcgaa ttagttgccg acctaacgac tgctaagttc ctccaagcgt 4740 tgcgtcgatt tgtctcccgg agaggactgt gttcggacat ttacagcgat aatgggcgga 4800 actttgtggg agctgcgaat gagttacgcc acctgatacg aagcaaagaa caccgtgaac 4860 aaatcgccca agaatgtgca cacaacaaca tccggtggca tttcaaccct cctaaggcgt 4920 cgcacttcgg aggactctgg gaggccgcca tacagtcagc acagaagcac ttcatccgag 4980 ttttgggggc gcaaacactc gcctatgacg acatggagac tttgctatcc caaatcgaat 5040 gctgtctcaa ctctaggcct cttgtgccaa ttagcgacga cccatcggac ctggaacctc 5100 taacgcctgg ccacttccta gtcggctcgg ccctcaaagc agtacccgat gttgacgtca 5160 cttcgattcc ctttaatcgt ctgaagaaat ggcagcaaac gcagaaactg taccagcaaa 5220 tttgggagcg atggcatcgg gactacctcg taaccttgca gccacgagca aaatggtgca 5280 atccacctgt accactacaa aggaaccagc tcgtagttct cctggacgaa aatcttcctc 5340 cgatgcgctg gccgatggcc agaatccagg atctgcatcc cgggccagac ggagtggttc 5400 gagtcgtcac cgtccaaaca tctaccggta tattcactcg accagtagcg aaaatatgtc 5460 tcctgcctat cgctccaatc atgcctccag cagaaaacac ttcccaaacg acaacaaaca 5520 gcgcacctaa accagcttcc aatccatgat tccgtcacct tcttcgagta tgccgtagta 5580 tcctgatgca cctgcaataa aatacaatgc cctaattgat tagggacaag gtaagcacct 5640 gtccaatatc cctatctcgc aaaaatccgg ctaccgggtc ttatttgttt gccaggttat 5700 aataaaatgg tgtacgatga catatcaacc gcatcaatgt aaacaatcat cgtgtcgtcg 5760 gtatcgtcat catcgatcaa ctccatgaga aggtgcaccg tctgcctggt cgacgattga 5820 gtcgacaatc atcatcatat cgacacacca gaaataggat ttctggaggg gccagtg 5877 // ID hAT-24_SM repbase; DNA; INV; 2974 BP. XX AC . XX DT 13-FEB-2008 (Rel. 13.02, Created) DT 03-MAR-2008 (Rel. 13.02, Last updated, Version 1) XX DE hAT DNA transposon from Schmidtea Mediterranea: consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-24_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-2974 RA Jurka J.; RT "hAT DNA transposon from Schmidtea mediterranea."; RL Repbase Reports 8(2), 73-73 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 388..1665 FT /product="hAT-24_SM_1p" FT /translation="MSDGYQRKHESGSEKRKKKQEKEEALKKMKGSILKYI FT NNDDKPNNDTTVTTNVSIPEILPNKSAKQSETDVTYEHPEQIYQPKDTQLQ FT FDDPDPELASSEDLKMKQSNECRVSDPAEWEVNAHLIGYYAENIPSQNLES FT DFLSMGRQFGDKTRFAHKEYFLRKLTNGEVVKRDWLIYSPSTGKVFCYVCK FT LFDCKYGGVDESCNQFQTGFDDWKNATARITLHEKSKDHIAALSMMLQRQK FT STRIDAELQKEKEKLFKYWTEVLKRIIAVIKFLAERGLAFRGETELHNCHN FT NGNYLGCIDLLAEFDPFLMEHIRKYGNTGKGNVSYMSSTICDEFIDILGSK FT VLTLIISEIKNAKYFGIKMDQLTLVIRYVILNGNVVERFLQFIAIEHHERK FT YLFDLVMNTLNLHGVDISNCQQHKCNLFLRTS" XX SQ Sequence 2974 BP; 1042 A; 438 C; 525 G; 969 T; 0 other; ggcctatgca ggaccgtttt aaggtcgtcg taggccctag gcacttttgc atatctgcat 60 ttgtagtaag cctcaaatcg attaaaattt aaattctcaa ctaataaaat atgtaaattg 120 taaacacatt tacagcattt ttggatgata ttgttttgaa acatttttga tttgaaccat 180 aatatttatg aacgcaccca ggttttgata ttactaatca caattacttt tagtacaatg 240 atttcataac taattatttt ccggttttcc aatatttcgt aaagttaaat atttaaaata 300 aaaatacttt ataattgtct taatgtttac caattagtaa tttttaatta gtattggtaa 360 ttacaaataa ttttaatttt agaaatcatg tctgatggat accagagaaa acacgagtct 420 ggatctgaaa aacgaaagaa aaagcaagag aaagaagaag cattaaaaaa gatgaagggc 480 tctatactga aatatataaa taatgatgat aagcccaaca atgacacaac tgtcacgacc 540 aatgtctcaa taccagaaat attgcctaac aaaagtgcca aacaaagtga aacagatgtt 600 acttatgaac acccagaaca gatctaccag cctaaagata cacagcttca atttgatgat 660 ccagatccag aactagcaag cagcgaagat ttaaaaatga agcaaagcaa tgaatgcaga 720 gtcagtgatc cagctgaatg ggaagtaaat gcacatctta taggctatta tgcagaaaat 780 atcccttcac agaatttaga aagtgatttt ctttcaatgg gtagacaatt tggagacaag 840 acaagatttg cacacaagga atattttctg agaaaattga caaatggaga agttgtcaag 900 agagattggc ttatttattc tccttcaaca ggcaaagtgt tttgttatgt gtgcaagcta 960 tttgattgca aatatggtgg tgtcgatgaa tcttgcaatc aatttcaaac tggttttgat 1020 gattggaaaa atgcaactgc aaggattact ttgcatgaga agtctaaaga ccatattgct 1080 gctctatcaa tgatgttgca acgtcagaag tccactagaa ttgatgcaga gctccaaaaa 1140 gaaaaagaga aattgtttaa atactggact gaagttttga aaagaataat tgcagttata 1200 aaatttcttg ctgaaagagg attagctttc cgtggtgaga ctgaattgca taactgtcat 1260 aacaatggaa actatcttgg ctgcattgac cttttagctg aatttgatcc atttctaatg 1320 gaacacattc gaaaatatgg aaacacggga aagggaaatg tatcatatat gtcttcaact 1380 atatgtgatg agttcattga tatattaggt tccaaagtat tgacgctgat aatttctgaa 1440 ataaagaatg caaagtattt tggaattaaa atggatcaac ttacactggt tattcgttac 1500 gtcatcctaa atggaaatgt tgtggaaaga tttcttcagt ttattgcaat tgaacatcac 1560 gaaagaaaat acttgtttga tcttgtaatg aatactctta atcttcatgg tgttgatatt 1620 tcaaattgcc agcaacataa gtgtaattta ttcttgcgta caagctagat ttagatagat 1680 caacagttta gcacagttta aatttggttg gttcatctgc ggtgaagtgt ttttcagcag 1740 cagtttacta ctttggtatt gttcagtcag tgtatacatt ctttccggct ttaccaaaga 1800 aatggtcaaa atttaaagac caaattaagg acaaatcagc ggctctggtc aacaaggtct 1860 gttagaacca agatggtcag cacgcagtga tgctacaaaa gctttttatg caaactattt 1920 tgagattcgc ctagcattgt ttgatatcgc cgaaagcgaa tgccaactac cagcagctgt 1980 ccttaaagca aaattgttaa taaaaattaa taccgttatg agactgcgct aatgtgttta 2040 atttggaaag atttacttca acaaattaat atagccaata aagctttgca gcagcctgga 2100 atagagttgt gcactatcaa aaaattgcat gacagtctca caaaacactt tcatgaaaca 2160 cgtaataaat ttgaagcgtt tgaaagtatg gcaaaagatc ttacaaatcc gattacaaag 2220 aatgtacaaa gcgcaaacat gtgcgaacaa gatttcatga cgtcgtgatg gatgagaata 2280 atattgatat gtttaaattt gagtgctagt ggcaatttct tatacagtcc taatatgtga 2340 tcattgacag acttctaatt gagatggaaa acggcgagta gcttacgctg gataaaatta 2400 caagttcagt tttcttcttg acaaatcttt ctctggtgat gaaatagtta ctaaagcaaa 2460 cgaattaatt tagatatatt catcagatct ggaaaaggca tttggtgatc agtttctttt 2520 gttttccaga attttcgccg acacaaaatc tgtaactgaa atgatgaagg cccaaattta 2580 aaacaagttt tccaaatgta aatattgctt ttcgaattta tctttcaatt tttgaatcaa 2640 gctgtgaatg tgaacgatct ttttcaaaat tgaaatttat caaaaattat ctgcggtcta 2700 cgatgggaaa agaatgattg tcttctcttg catttctttc aattgaaaac gatttaatgc 2760 tggaaatgtc atttgaagat gttattcgta attttgttca tgcaaagagt cacaaagttt 2820 atataatttg gtctataatt tagactactg tgtataaaaa cagtttctaa ttataaaatt 2880 gattgatttt gaattttgct tttttgtctt tatttcttta tttttaatcg ttctataaaa 2940 aaatgataaa aaatttaatt ttttttcgta ggcc 2974 // ID Gypsy-16-I_HM repbase; DNA; INV; 4007 BP. XX AC . XX DT 06-JAN-2009 (Rel. 14.02, Created) DT 06-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE An internal portion of the LTR retrotransposon - a consensus DE sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW reverse transcriptase; Gypsy superfamily; Gypsy-16-I_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4007 RA Bao W. and Jurka J.; RT "Gypsy LTR retrotransposons from Hydra magnipapillata."; RL Repbase Reports 9(2), 404-404 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 34..3924 FT /product="Gypsy-16-I_HM_1p" FT /translation="MEQLRPLEAMNLNGNVSENWRKWKQRWNLYKIASGVN FT DKNEDIQCAIFLHMIGEDALRVYDTFVFTIEENDKLTPLVQKFECYFSPKK FT NITYERYLFNTCMQDSRLFTDFLIDLRNKAKTCEFGTLEESLIRDRVVCGI FT DSKAARERLLRDTELTLEKTISFMRAYETSKTQLKALENTIQADSINKHGN FT RPNFNSLKIDRPSQKSCYYCGSQHTTSFCPAYGLKCTACGKLNHFAKVCRS FT KTNRFNTNRVDQVDQKEVNLNVAELYIHHVAASMECPDDLTTNITVNNKNI FT TFKLDTGAQCNVIPYNVYNSIPGRPPLKKTITQLKSYGGNNIPVLGICNLT FT IKKNNISQVCQFFVVSLSDVKPLLGLSSCKSLEILNINDVEIDKTNILKHY FT SDVFTGLGLVKGNYHIALAKDAIPVIHAPRKVPMTVLPKLKSTLDRLVKAT FT VISKISKPTPWVHSMVIVEKKDGSLRLCIDPKELNKSVLRQYKTTPSTEEI FT SSKLCGNKVFTVIDMADCYWHIKLDEPSSELCTFNTPYGRYKFNRMPFGIS FT CASDAAQEMIELNFGDIPNVLAIHDDLIIAAKTDTEHDKIFQQILQRARER FT NIKFNLKKIQFRVSEVKYLGNIISHEGIRVDPEKTKAISAYPLPACQADLQ FT RLLGMVNYLRQFIPNMSEITAPLRQLLKKDIHWSWNHEHTSAMEKIKQVLS FT SSPVLSFFDTSKDVQIQVDASSHGLGACIMQEGHPISYTSRSLTIAEQRYA FT QIEKELLAIVFACERFNQYTYGRQVSIESDHEPLEYILHKPLSEAPPRIQR FT LLIRLQKYQVIVKYVPGKDLHIADSLSRAHLNIFDEDDSLNEDCSVMVHTL FT VQNLPISTDRLKQLQHDTKLDPVLSQITNYITYGWPRNRQMLCGEEKKYWT FT IRNNLHIAEGIILKENKIVIPTDMRPLILTQLHLSHLGTEKTKARARNVVY FT WPGLTSDIDKLILNCHKCLKYRNNNKKEKIIQHDIPDLPWSKVGSDIYELH FT GKIYVIVIDYFSKFIENSVIPDKTAFSVIKFMKTIFTRHGIPSSLIADNNP FT YNSAEFLNFSKEYGFNFTPSSPNYPQSNGLSEMGVKIMKKILKKCDNPELG FT LLEYLNMPLTGMDYSPSQLLMNRRTRTTLPVHHDLLIPAVPVNAYSQITKS FT RNRQKQYYDRNASELPNLVANDTARMRKDGKWKKVTVKEKLTLPRSYNVVD FT KYGRVYRRNRKHLIKTNEQPLVYQHEQLDLPYDLPAVITSPTSNEETANNE FT IEATCIPETQSTEPQPDLRRSLRTKNRPAYLRDFI*" XX SQ Sequence 4007 BP; 1474 A; 835 C; 647 G; 1050 T; 1 other; tggtaccaga agtaaacctg tgtttcaaat aaaatggaac aacttagacc attagaggcc 60 atgaatttaa atggcaatgt ttcagaaaac tggcgaaaat ggaagcagag atggaatctt 120 tataaaatag cttcaggagt caatgacaag aatgaagata tccaatgtgc cattttcctt 180 catatgattg gcgaagacgc tttaagagta tatgatacat ttgttttcac tatcgaagaa 240 aacgataaat taacaccttt agttcaaaag tttgaatgtt atttcagccc aaaaaaaaac 300 ataacttatg aacgttattt attcaataca tgtatgcaag acagcagact atttactgac 360 ttcttaattg atttacgaaa taaagctaaa acgtgcgagt ttggaaccct cgaagaaagc 420 ctaatacgtg acagagtagt ctgtggaatc gactctaaag ctgccagaga acgactactt 480 agagacactg agcttactct agagaaaaca atcagtttca tgagagcata tgaaacatca 540 aaaacacaat taaaagccct cgaaaataca atccaggcag attctataaa taaacatgga 600 aacagaccta atttcaactc cctaaagatt gatagacctt cacaaaaatc ttgctactac 660 tgtggatcac aacatactac ctcgttttgc ccagcatatg gcctaaagtg cacagcatgt 720 ggaaaactaa accattttgc taaagtatgc agatccaaaa ctaatcgctt taatactaat 780 cgtgtagacc aagttgatca aaaggaagtc aacctcaacg ttgcagagtt atatattcat 840 catgttgcag catcaatgga atgtcctgac gatttaacaa caaatattac agtcaacaac 900 aaaaatataa catttaaatt agatactgga gctcaatgta acgtcattcc atacaacgtc 960 tacaactcta ttcctggaag acctcctctc aaaaaaacaa taactcaact aaaatcatat 1020 ggtggaaaca atattcctgt actaggaatt tgcaacctaa ctattaaaaa gaataacatt 1080 tcacaagtgt gtcagttttt tgttgtatct ctgtctgacg taaaaccact tcttggtcta 1140 agttcctgca aatctcttga aatactaaac attaacgatg ttgaaattga taaaactaac 1200 attttaaaac attatagtga tgtttttacc ggccttggtt tagttaaagg aaactatcat 1260 atcgcacttg ctaaagatgc aatcccagtc attcatgcac caaggaaagt accaatgact 1320 gtcctaccta aacttaaaag cacacttgac cgtctagtaa aagccacagt tatctcaaaa 1380 atttctaaac cgacaccttg ggtacactca atggtcatcg tagagaaaaa agatggatcc 1440 ttaagattat gcattgatcc gaaagaacta aacaagtctg ttttaagaca atacaaaacc 1500 actccatcta cagaagaaat ctctagcaaa ctttgcggca acaaagtttt cactgttata 1560 gacatggctg attgctattg gcatataaaa ctagatgaac cttcctcaga gctatgcaca 1620 tttaacactc cttatggccg atacaaattc aaccgcatgc cgttcggcat atcatgcgca 1680 tcggatgctg cacaggagat gattgaactt aatttcggtg atatccctaa tgttctggct 1740 atccacgatg acttgattat tgcagctaaa acagatacag agcatgataa aattttccaa 1800 caaattctac aacgagctcg tgaaagaaac atcaaattca acctaaaaaa aatacaattt 1860 cgagtatctg aggtcaagta tcttggcaac ataatcagcc atgaaggcat aagagttgat 1920 cctgaaaaaa ctaaagccat atctgcatac ccattgcctg catgtcaagc agatttgcaa 1980 cggctcttag gaatggtcaa ctacctgcga caattcatcc caaatatgtc agagataaca 2040 gctcccctcc gccagctgct taaaaaagac atacactggt catggaacca tgaacatacc 2100 agtgctatgg aaaaaattaa acaggttctt tcatcttctc ctgtactctc atttttcgac 2160 acgtcaaaag atgtccaaat tcaggtagac gcatcttcac atggattagg agcctgtatt 2220 atgcaagaag gtcacccgat cagctatact tctcgtagtt taaccatagc agaacagaga 2280 tatgcccaga tagagaaaga acttttagca atagtttttg cttgtgaacg tttcaaccaa 2340 tatacctatg ggcgacaggt ctcaatagaa agtgatcacg aacctcttga atatatcttg 2400 cacaagccgc tgtcagaagc accacctcgc atacaaagac ttctgataag actccaaaaa 2460 tatcaagtta tagtaaaata tgttccagga aaagatctac acattgcaga ctctttatca 2520 cgtgctcacc tcaacatttt tgacgaggat gatagcctaa atgaggactg cagcgttatg 2580 gtccacactt tagttcaaaa tttgccaata tcaacagata gactaaaaca attgcagcac 2640 gacactaaac ttgaccctgt gctctctcaa ataacaaact atataactta tggttggccc 2700 cgaaaccgac aaatgctctg tggtgaagaa aagaaatatt ggaccatcag aaacaactta 2760 cacatagcag agggaataat tttaaaagaa aataaaatag tcataccaac tgatatgaga 2820 cccctcatat taacacagct tcacttatca caccttggca ccgaaaagac gaaagctaga 2880 gctcgaaacg ttgtttactg gccaggactt acaagcgaca ttgataaact aatactcaat 2940 tgccataaat gccttaaata tcgtaataac aacaaaaagg aaaaaatcat tcaacacgac 3000 attcctgact taccgtggag taaagtagga tcagacatat atgagttgca tggaaaaatc 3060 tatgtaattg ttatagacta cttttctaaa tttatagaaa acagtgtcat tcctgacaaa 3120 acggcatttt ctgtaatcaa atttatgaaa accattttta ctcgtcacgg aataccttca 3180 agtctcattg ccgataacaa cccctacaac agtgcagaat tccttaaytt ctcaaaagaa 3240 tatgggttta acttcactcc atcaagtcca aactatcccc aatcaaatgg tctcagtgaa 3300 atgggtgtaa aaattatgaa gaaaatacta aaaaaatgcg acaaccctga acttggtctt 3360 ctagaatacc tcaacatgcc tctcactggt atggactact ccccatccca acttctcatg 3420 aacagaagaa cgcgaacaac cctacctgtc caccatgact tacttatacc agcagttcct 3480 gtaaatgctt atagtcagat caccaagtcc agaaatcgtc aaaaacaata ctatgatcgg 3540 aacgcatctg aactcccaaa tctcgtagca aacgacactg cccgtatgag aaaagatgga 3600 aaatggaaga aagtgacagt taaagaaaaa ttaacattac ccagatcata caacgtggta 3660 gataagtacg gaagagtata tcgacgcaat cgtaaacatc taattaaaac caatgagcaa 3720 ccactagtat atcagcatga acaacttgat ttgccatacg atttaccggc tgttatcaca 3780 tctcctacat caaatgagga aacagcaaat aatgaaatag aagctacgtg tattcctgaa 3840 actcaatcga ctgaaccgca gcctgacttg cgaagatctt taaggactaa aaacagacct 3900 gcctacttga gggacttcat ttaacagtgt atattataaa gaacttgttt aactatatga 3960 taaatacaca tcttaaactt taaaaaaaaa aaatttaaga aggaaga 4007 // ID CR1-58_AAe repbase; DNA; INV; 5159 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-58_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5159 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1145-1145 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 20 sequences with >97% CC identity. XX FH Key Location/Qualifiers FT CDS 337..1146 FT /product="CR1-58_AAe_1p" FT /translation="MVCNICQQAVSDSDRISCRGYCDKSFHMNCVNADYDL FT REVLATHERNVFWMCNDCADLFARDHFRQLTAGCSNNEIPDISAIKSIKDD FT IAGLKQAFGELSAKVDCNPCTPTLKNPWRAVGRIERTLPNTPKRARMEIQP FT VIERPIQCGTKTASTTVKTISPPEDLVWIYLSAFDPSTSDKDITNLARECL FT GMDSNVNPKVVKLVPKDKDVSTLSFITFKIGLSKNLREIALSNNTWPENVH FT FREFVSYSKNQRPVVKVATAAAPPNTGVQ" FT CDS 1191..4874 FT /product="CR1-58_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MEAPNPPDPVVLVSSCQQSRSGPEVEDGVGVFQTPFA FT GKYPLNDIHSPPERFSIPSTVADFNSTLLPVSTPGRTVSSLMEAPYPSDLV FT ELSTDSRHHSRPGPSVGCGDGVFQHPSAGEYLFHHDFWPSNSFPAFSFITD FT VFGSRDGVYQHSTSNAVSCPGNFINDVENTTSAFELSTEISGDPERTACRT FT EEASDPLNPVVLPDAVLPSRSGPVVEIGDEVFQASLSGKYSSIRNNALLDS FT CLTFSQPSGNHSPQPDICPGQNGSPAPEHVLTIFYQNVRGLRTKIEDFFLA FT VTDSQYDVIVLTETWLDDRITSAQLFNNQFTVFRNDRNHLNSNKARGGGVL FT IAVSRRLSCCLDPTPVSSSLEHLWIKIKTPYRTTSIGVMYLPPDRKSDVRI FT IESHIDSINSILSHLEVNDFALLFGDYNQSGLLWNTSPNEAPSIDVLRSHF FT PIGCSSLLDGFNLNGLTQINSIFNRNLRMLDLVLTNESVLPYCTVSEALEP FT LVSLDLDHPALETSVRLPLPTQFVEMADLPQLDFHRTDYESLKRILNNADW FT NAIEMCTSVNEAVDYFVMVLMQAISISTPLRRPAPKPQWSNSHLRHLRRQR FT SKALRKYCKSRNPYNKRVFSLASNEYRIYNRYRYMLYTRRTQENLRRNPKQ FT FWSFVNEKRNEGGLPTEIHLGQLVAQTPLEKSNLLAQHFKSVFNDYFATQA FT HAEEATRATPTDAFNFTVFDVSPALVVSAIKKLKSSNASGPDGVPSCIVKK FT CSTELAEPLSMLFNLSLRHGEFPRQWKTSFMFPVYKKGDKQNVENYRGITS FT LSACSKVFEIIVNDALFASCKNYISSDQHGFFPKRSVSTNLAPFISLCVRS FT MDAGTQVDVVYTDLKAAFDRVDHVILLAKLEKLGVSSQLVRWFKSYLTDRV FT LCVKLGSAQSDTFTNQSGVPQGSNLGPLLFSIFINEVGIILPRGCRSFYAD FT DVKLYLVVRCAEDCLRLQNMINCFEQWCSSNFVTLSTSKCNVITFHRKLKP FT ILHEYKIGSQLLQRVDNVRDLGLHLDTALTFNIHYSDIIAKANRQLGFIFK FT IASEFRDPLCLKALYCSLVRSILEFSDVVWCPYQITWITRIEAIQRRFIRY FT ALRSLPWDDPVNLPPYEDRCQLLGLETLQKRRAVNQAVFAAKSLLGELDAP FT ALLDEFEIYAPTRNLRSRAFLNLGHRSANYSLHDPIRFMSARFNEYYEHFD FT FNRSALAFRHSLQRNTVP" XX SQ Sequence 5159 BP; 1384 A; 1191 C; 1056 G; 1528 T; 0 other; tctggcatcg ctgtcgtaat agttgttgac aatttttaac ctgaagtatt tttccgattt 60 tttgtgtttt atttcgttta ttaaacttca catagtgatt gtttgattga ttaagatcct 120 cgaatttagt aatctctcga agtaactgct attcgtcatt agttattgtg tttttcaacg 180 gatttgtctt ctgtgcacag ttgagattca ttctctcagt cattattcaa caaacgttct 240 gtcgacttgc ctacgtcaca tcttctattc atattgaacg caagttgttg ctattaccgg 300 aagcctacct aaaggcgttt gtgcgaaaca gaaaatatgg tttgcaacat ttgtcaacaa 360 gccgtttcgg attcagaccg catttcgtgt cgcggctact gtgacaagtc attccatatg 420 aattgcgtca atgctgacta cgaccttcgt gaggttttgg ctacacacga gcggaatgta 480 ttctggatgt gtaatgattg tgctgattta ttcgcccgag accactttcg tcaactaact 540 gcaggttgca gcaataatga aataccagac atttccgcca tcaagtctat caaggatgac 600 atcgcaggcc tgaagcaagc tttcggtgaa ctttcggcta aagttgattg caacccctgt 660 actccgaccc tcaaaaatcc ttggcgagca gttggtcgaa ttgaacgtac tttgcccaat 720 acaccaaaac gtgcgagaat ggaaatacaa cctgttatcg aacgccccat ccaatgtggt 780 actaaaacag cctcaacaac agtgaaaact atttcaccac cagaggatct ggtttggata 840 tacctatcgg cttttgatcc tagcaccagc gataaagata ttacgaacct agcgagggaa 900 tgcctgggta tggattctaa tgtgaatcct aaagttgtca aactggttcc aaaagacaaa 960 gacgtgtcta cactaagctt catcacgttc aaaattggac taagcaaaaa ccttcgtgaa 1020 atcgctttat cgaataacac ttggccggaa aatgtgcact tccgtgagtt tgtaagctat 1080 tcaaaaaacc agcggccagt agtcaaggtt gcaaccgcgg cagccccgcc gaatactgga 1140 gtccagtaga atcaccttca atcaacccag agcgcgctgc ctgtagctct atggaagccc 1200 ctaatcctcc cgacccagtc gtgctcgtgt catcatgcca acaaagtcgc tctggccctg 1260 aggtcgagga tggtgtcggg gtcttccaaa ctccttttgc aggcaagtac ccgcttaatg 1320 acattcattc gccacctgaa aggttttcaa ttcctagtac tgtcgccgat ttcaactcaa 1380 ctttactccc cgtttcaaca ccgggccgca ctgtatccag cttaatggaa gccccttatc 1440 cctccgacct agtcgagctt tcaacagatt cccgtcatca cagtcgtccc ggtcctagtg 1500 tcggatgtgg agacggggtc ttccaacacc cttctgcagg cgagtatcta ttccatcatg 1560 atttttggcc gtccaatagt tttccggctt tcagcttcat cactgatgtg ttcggaagta 1620 gagatggggt ctaccaacat tcgacttcca acgctgtctc atgccctggt aatttcatca 1680 atgacgtgga aaatacaaca tctgctttcg agttgtcaac ggaaatctca ggtgatccag 1740 aacgcactgc ctgccgcact gaggaagcct ctgacccact caacccagtc gtgcttcctg 1800 atgcagttct tccaagtcgt tctggtcctg tggttgagat cggggacgag gtcttccaag 1860 cttccttatc aggcaagtat tcgtccatta gaaacaatgc tctgcttgat agttgtttga 1920 ctttcagcca accctctggc aatcattcac cacaacctga catttgtcct ggacaaaatg 1980 gatctcctgc cccagaacac gtgctgacga ttttctatca aaacgtacgt gggttacgta 2040 ctaagatcga ggattttttc ctagcagtaa ctgattcgca atacgatgtc atcgttctca 2100 ccgaaacctg gctcgacgat agaattactt cggcccagct attcaacaat caattcacgg 2160 tttttcgaaa cgacaggaac catctcaaca gtaataaagc tcgtggtggg ggggtcctga 2220 ttgctgtgtc caggcgacta agttgttgcc ttgatcctac accggttagc tcttctttgg 2280 aacatctgtg gatcaaaatc aagaccccgt acagaactac aagtattggt gttatgtatt 2340 taccccctga tcgcaaatcc gatgtgcgca tcattgagag ccacattgat tcgattaatt 2400 ctatactctc gcatttggaa gtgaatgatt tcgcgcttct gtttggcgat tacaaccagt 2460 ctggattatt gtggaacaca tcaccgaacg aagctccttc cattgatgtt ttgcggtcac 2520 actttccaat tggttgcagc agtctccttg acggattcaa tctcaacggt ttaacacaaa 2580 tcaattctat tttcaataga aatttgcgta tgcttgatct tgtgcttacg aatgaatctg 2640 ttctaccata ctgcacagtt tccgaagcgt tagaaccgtt ggtcagcctt gatctagatc 2700 atcctgcgtt agaaacgtct gtgaggttac ccttgcccac tcaattcgta gagatggctg 2760 atttgcctca actggacttt cacagaacag attacgaatc cctaaaacgt atcctcaaca 2820 atgctgactg gaatgcaatc gaaatgtgca cctccgttaa tgaagcagtg gattactttg 2880 ttatggtttt aatgcaagct atttcgattt ctacccctct gcgcagacct gctccgaaac 2940 ctcaatggag caatagtcat ctacgtcatt tgcggcgtca gcgttctaaa gcactacgaa 3000 aatactgtaa atctcgtaac ccttacaata agcgagtgtt ctctttggcc agcaatgaat 3060 atcggatata caacagatac cggtatatgc tttacacacg tcgcacacag gaaaatctcc 3120 gaagaaaccc caagcagttc tggtcgttcg tcaatgagaa aaggaatgag ggaggcctcc 3180 caactgaaat tcatctggga caattggtgg ctcagacgcc gctggagaaa agcaatcttc 3240 tggctcaaca cttcaaaagc gtattcaacg attactttgc aactcaggct catgctgagg 3300 aggctacaag agcaactccc actgatgcat tcaactttac agtctttgac gtttctcctg 3360 ctcttgttgt atctgctatc aaaaagctga aaagttctaa tgcttctggt cctgacggtg 3420 ttccatcctg tatagtgaaa aagtgttcaa cagagttagc cgaaccttta tcgatgttgt 3480 tcaacctctc cctacggcat ggcgaatttc cgcgacaatg gaagacatcc ttcatgttcc 3540 cagtatacaa aaaaggcgat aagcaaaacg tagaaaatta ccgaggcatc acatcactta 3600 gcgcctgctc aaaagtgttt gaaatcatcg ttaacgatgc tctctttgct agttgcaaga 3660 attacatatc cagtgatcaa catggatttt tcccaaaacg ctcggtatct acgaatcttg 3720 ctccgtttat ttcgttatgt gtgaggagta tggacgctgg aactcaagtt gatgttgttt 3780 atacagatct taaggcggcg tttgaccgcg ttgatcatgt gatactgctg gcaaagttag 3840 aaaaactagg cgtgtcctca cagttggttc gatggttcaa atcataccta actgatcgtg 3900 ttctctgtgt gaaacttggt tctgctcagt cggacacatt tactaatcag tcaggggtgc 3960 ctcagggcag taacctagga ccattgctgt ttagcatttt tattaacgaa gtcggaatta 4020 ttcttccacg agggtgtcgc tctttctatg ctgacgatgt caaactatat ttagtagttc 4080 gatgcgccga agactgcctt cggttgcaaa atatgataaa ctgctttgag caatggtgct 4140 cttctaactt cgttacattg agtacatcaa aatgcaatgt tatcaccttt catcgcaagc 4200 tgaagccgat tctacatgaa tacaagattg gcagtcaact attacaacga gttgataacg 4260 ttcgtgattt aggactccat ctcgatacag ctttgacttt caacattcac tactcggaca 4320 tcatcgctaa agcaaatagg cagctgggat tcatcttcaa aattgctagc gaatttcgtg 4380 atccattgtg tttgaaagct ctatattgct ccctcgtccg gtcgatatta gaatttagtg 4440 atgtagtttg gtgtccatat caaataacct ggattactcg aattgaggct atccagagaa 4500 gattcattcg ctatgctcta cgatcgctac cctgggacga tcctgttaat ttgccacctt 4560 atgaagaccg atgtcagctt ttgggccttg aaactctaca gaagagaaga gctgtcaacc 4620 aagcagtatt tgcagctaaa tccttgttag gagaattaga tgctcccgca ttattggacg 4680 aattcgagat atacgctcca accaggaatc tccgctccag agcctttctt aaccttggac 4740 accgatccgc caactatagc ttgcacgatc caatacggtt tatgtcagcc agattcaacg 4800 agtactacga gcattttgac ttcaaccgat cagctctcgc ctttcgacat agcttacaac 4860 gtaatacggt accgtagggc atgatctctt gatacagttg tttggcgatt ggcattatgt 4920 ttaattgatt ttaatatgtt tttttttttt gtgtgcttaa ttgttattgt tagatataag 4980 ttgtaattct tgtgtaatgt tttgtgaaga aaagatatgg ggtttttatg cttttttgag 5040 taaggtttat atgtcaacca actcaagggg gcttttcctc atcatatatt tcattaagac 5100 tgagagtcag atgaaattaa ataaataaaa taaataaata aataaataaa cgaatgaaa 5159 // ID BEL-244_AA-I repbase; DNA; INV; 5600 BP. XX AC . XX DT 13-JAN-2011 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-244_AA_; KW BEL-244_AA-LTR; BEL-244_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-5600 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (13-JAN-2011). XX DR [1] (Consensus) XX CC Positions [4593-5201] - Integrase core CC 'GATAT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 323..1321 FT /product="BEL-244_AA-I_1p" FT /translation="MENLKQLIHQRGQVKARVTTIVQKLNEAIEHPETTSA FT SQLKAFEKKLELHYSEYSAKHDSIMAQCPNVSVAEQDEKLEEFDELHTDAL FT VKLNRLVDYFRAEPANNRVPQVIVTQQPLRTPIPTFDGKYEGWPKFKALFD FT DLVGKCGDSDATKLQYLDKALIGEASGILDAKIINNNNYEQAWQLLEDRFE FT NPRVIIDTHISGLLSMKPVAKQSYKELRDLVDACNRHVEGLRFLEQEVSGA FT AGLIVVKILSMCLDGETREQWELTMDHGELPDLDETMEFLRSCCQVLERCE FT TGKSVMPSSKPVVAKPPVTAKTSFSPRSSHPAASTSSGERV" FT CDS 3774..5600 FT /product="BEL-244_AA-I_2p" FT /translation="MKKSPGQLQVYVRNRVAEINKLTGDYEWKYVRSESNP FT ADLVSRGRYPRVLSCSEIWWNGPVFLQADDYEIQETPAIKDEELPEMKEVQ FT VCNPANEVEEEPVFDRCGAFLKLQRVLAQVVRFTRLVRVKKEDRINFRYVS FT VHDMRKAMQYIVRVLQKSELRQEIRCVQRGELPKRLANLRPFLDEEGFLRV FT GGRLQNSKLPFDAKHQLLLPKDHRVTETLIRQYHEDRLHEGPSGLLAAIRQ FT KYWPVNARSAIRKVTRSCVKCFRTKPRGVQPLMGNLPEERVNSAAAFEFTG FT VDYAGPVTVKEGRYRPKQIKAYIALFVCLATKSIHLELVSDLTTEAFLSAL FT DRFVNRRGPVRKMMSDNATNFVGASKELHQLFLMFRDETEKAKIDDFLLKR FT DIEWEFIPPRSPNFGGLWEAGVKVIKSHLHRTLGNAILTFEEFGTVLTHIE FT AVVNSRPLYSMSDDPNDPLPITPAHLMIGRPLEPIAKPSYTGIPVNRLSRR FT QYMNHLREQFWAKWSRDYLSSLQSRAKWTKSEVNVKTGTIVLLMEDNLPVQ FT SWRLGKIVALYPGKDKVIRVADVKTSAGVFRRSVRKLAPLPIVDNDEQRIS FT AFGIAFQPAGV" XX SQ Sequence 5600 BP; 1443 A; 1267 C; 1543 G; 1311 T; 36 other; tttggtccga atcgaaccgg ataataatca gtgatacgtc cattcgagac taagaaagtg 60 tttccctcca ttacgagact gtgttcctcc taccgtgata aattcgtgac cctttcgaac 120 tgaattaaga gagttgtgtc cttcgcgaac attcgtgagc cctcccattg agacctttgt 180 caagcgcgct cttgtcaata gtgaatcgta gtacagtggt acccaagtgt agtgaaaagt 240 aagagccctc tgtgaatgaa ctaaagtgaa gtgatagaag cgaaatagcc ctgtgaaagt 300 gtgaaacagt gatagtgtga ccatggaaaa cctgaagcag ttgatccacc agcgtggaca 360 agtgaaagcg cgtgtgacga cgatcgttca gaagctgaac gaagctatcg agcatcccga 420 aacgacgagt gcctcccagt tgaaagcttt tgagaagaag ttggagttgc attattcgga 480 gtattcggcg aagcatgatt cgatcatggc tcagtgtccg aatgtgagcg tagcagaaca 540 agatgagaag ctagaggagt ttgatgagct gcatactgac gcgctagtga agcttaatcg 600 gctagtggat tacttccgtg ccgaacccgc caacaatcga gttccccaag tgattgtgac 660 gcagcaacct ctcagaaccc cgatcccaac tttcgacggg aagtacgaag gctggccgaa 720 gttcaaagcc cttttcgacg acttggttgg aaaatgtggt gactccgacg caaccaagct 780 gcagtacctg gataaagcgt tgattggtga agcttccggc attcttgatg cgaagatcat 840 caacaacaac aattacgagc aagcatggca gttgctggaa gatcgtttcg agaacccacg 900 agtgattatc gacacccata tttccgggct gttgtcgatg aaaccggttg ccaagcagag 960 ctacaaggag ctgagagacc tcgtcgatgc ctgcaatcgt catgtggagg gattgcggtt 1020 tctggaacaa gaagtcagcg gagcggctgg cttgatagtg gtgaagattt tgtccatgtg 1080 tcttgacggt gaaacgcggg agcagtggga gctaaccatg gatcatggag aactgcctga 1140 ccttgacgag accatggagt tcctcagaag ttgctgccaa gtccttgaga ggtgtgaaac 1200 cggcaaatca gtgatgcctt cttccaagcc agttgtcgcc aaaccacctg tgacggcgaa 1260 gaccagcttt tcgcctagat cgtcccatcc tgcagcttcg acatccagtg gagaacgtgt 1320 gtgagatatg ccgaggccag cactgcaact ataagtgtcc gtccttcttg aagtatgagt 1380 gtcgatcagc gtgttgccaa agcgaagcaa tccgggctgt gcttcaactg tctccggaaa 1440 ggccatcgaa tcgaggcttg tccttcggag ctaagtcatg ctcgaagtgt tccgaaaggc 1500 atcactccat gctgcatctc gagcaagaat cgccccgacc agtgccgatt cctaagcagg 1560 aagataattc gaatgtatcg tgctcagcaa gccagaagtg aagacggcag tagcgtctgc 1620 tgttcccgag gaacctgtgt ctacagcgtg ttccagtgtt cagcgtcgag caaagcatgt 1680 gttcctgatg acggcgttgg tgaacgtagc ttcaaagagt ggaaaggtgt tcaagctgcg 1740 tgcccttcts gattccgggt cmcaggtcma cmtagtgtcc gaggccgckg ctaaastgct 1800 ggggctgcct aagtaccccg cgcaacgtaa cggttgttgg agcaggtggt gcgaaaactc 1860 aagtcagaaa gggggtgatt ctgaagcttt cgtccggata cgccaacttc gaaggcggct 1920 tggastgttt ggtstcaacg aaggtccgac tggaacgatc ccatccgtac cggtcaacgt 1980 gtcagagtgg aacatttccg actggaatac agttggcgga tcccaatttc tacgagccaa 2040 gagatgtgga tttgttgatt ggcgccgagc tacgtttggg atctgctgga gcgagcacag 2100 ctgaaactag cggacggtac taccttcact tcgagagacc gacttgggat ggatcatcac 2160 cggtacgttt gaggagtccc ggcgatgagg cgcgttaatc cgtgttgtst gccaacgtcg 2220 cgcgaggacc cgttactgga agctattgag aaatttttkg acggtcgaag aacttccaga 2280 ggtgaaagtt cgccaaccag tgaagaasag gaagttgaag catcatttcc ggaaacgtat 2340 cgccgagatg aagctggccg atttgtggtt cagatgcctt tccgggagac tgtcagtgag 2400 ctagaagaca atcgagsact tgcactgaag agatttctag cctgcgagaa gagtctagtc 2460 gcgcaatcca gagacgaagc ggatgtaccc agcattcatg aaagaatacg aaggtcttgg 2520 tcactgcaag agattcacga ggaagacgac gtacccttcg caacagaatt attacttacc 2580 gcaccatgcg gtactcaaac ctcgagttcc scacaaagtt gmgtgtkgtg ttcgatgcca 2640 gtgcgaagtc tggcgggcta tcccttaacg atgtgttgaa gatcggcccs acagtgcaga 2700 gtgatctatt ctccgtcgta ctgcgattcc gaaagcacct gtatgcattt tcgggagacg 2760 tagcgaaaat gtataggcag gtgaatatcg atcccggcca gacgcacttc ctgcgcatct 2820 tctggagaga saatccgcgg aaccgatacg agtactggag ctggcgacag tgacatacgg 2880 gacggcacca gcacccttcc ttgcagcgag gtgcttggtt caactggcga gagacgaaat 2940 gaccagtttt cctgaagctg cagaggccat actagaagat tgttatatgg atgacattct 3000 gagtggctct tcaacattag aatccacaag gcagctgcga tgcgatattg cggatcttct 3060 gacgaaaggc aaattcccca tccgaaagtg gtgttcaaac gatgatgaga gtgcttgaag 3120 gaattccgga tgaggatcga gagaagctag ttcgtattga ggagtctggt gccaacgaag 3180 ctattcgagc ccttggtgtt ttgtggaatc ccaaaagtga taagtttttg ttctggagaa 3240 gtccggaatc gatacaaccc gacgagcagg taacgaaaag gaatgtcctc tctcaaatag 3300 caaggctttt cgaccctctc ggcctgatat cgccggtaat tgtgttggcc aagtctatca 3360 tgcagcagct atgggcagat ggtttggact gggacgagaa gcttgagaac gagttgttaa 3420 ataggtggat gacctttcat cagtcacttg tgcagctcaa cgagattcaa gtgccgagat 3480 gtgttgtgat tcctggagcg catcgtattg agatccacgg wttttskgat gcctcaggca 3540 ctgcctacgg tgcatgtstt tatctgcgct gtgtccagga gaatggagaa gtctctgtta 3600 gattgttgtg cggcaaatcg agggttgctc cattgmgkcg tmtcaccatk cccagamtgg 3660 agctgcktgc cgctgtcgtk ctggccmgac tagtcagcgt agtgkcgmct atattgaagm 3720 tcgaagtckm cgatatcckg ttgtggtctg atagccagat tgttctggca tggatgaaga 3780 agagtcccgg acaacttcaa gtctacgtca gaaaccgagt ggcagagatc aacaaactga 3840 ccggcgacta tgagtggaag tatgtcaggt cggagagtaa tccggcagac ctggtgtcca 3900 gaggacgtta cccaagagtg ttgagttgtt ccgaaatttg gtggaacggc ccagtatttc 3960 tgcaagctga cgattacgag atccaggaga ctccagcgat caaggatgaa gaattaccgg 4020 aaatgaagga agtgcaagtg tgtaatccag caaacgaggt agaagaagag ccagtgttcg 4080 accgatgcgg cgcgtttctc aaactgcaac gtgtactggc gcaagtggtg cgatttactc 4140 gactggtacg cgtgaaaaag gaggaccgca tcaatttccg ttacgtttcc gtgcacgata 4200 tgcgaaaggc aatgcagtat atagtgaggg ttttgcagaa gagtgagtta aggcaagaga 4260 tccggtgtgt tcagcgcggg gaattgccga aacggcttgc gaatctgcga ccatttttgg 4320 atgaagaagg tttcttgcgc gttggcgggc gtctgcagaa ctccaagcta ccattcgacg 4380 ccaagcacca gttgttactg ccaaaagatc atcgagtgac cgagacgctg atccgtcagt 4440 accacgagga cagacttcac gaagggccat ccggtctact ggcagcaata aggcagaagt 4500 attggccggt caacgctcgg tccgccattc gaaaggtgac acgcagctgc gtaaaatgct 4560 tccggactaa gccacgaggt gttcaacccc taatgggaaa tctaccggaa gaacgtgtga 4620 attcagcagc agcgttcgag ttcaccggtg tagattacgc cggtccggta acagtgaagg 4680 aaggcagata caggccgaag caaatcaagg catacatcgc tctttttgtg tgccttgcga 4740 cgaaatctat ccacctcgag ttagtttcgg atttgacaac cgaagctttc ctttcwgcct 4800 tagatcgctt cgtcaaccga cgtggaccgg tgcgcaaaat gatgtcagat aacgcgacca 4860 attttgtcgg cgcatccaag gagctccacc agctgttctt gatgttccgc gatgaaaccg 4920 agaaggcaaa gatcgacgat ttcctgctca aacgtgacat agagtgggaa ttcattccgc 4980 caagatcccc aaactttggg ggtttatggg aagcaggtgt caaggttatc aagtcccacc 5040 tacatcgaac tttgggaaac gcgattttga cgtttgagga gtttggaacg gttctaactc 5100 acattgaagc cgttgtgaac tcgaggccgt tgtattcaat gtctgacgac ccaaacgatc 5160 cacttccaat aactcctgct catttgatga ttggaagacc attggagccg attgcgaagc 5220 catcgtatac cggaatacca gtgaaccgtt tatcccgtcg tcagtatatg aaccatttgc 5280 gtgaacagtt ttgggctaag tggtcaagag actatttgtc atcgcttcaa tcaagggcaa 5340 aatggacgaa aagtgaagtt aatgtgaaaa cgggtaccat tgtgctactg atggaggaca 5400 atttaccggt tcaatcttgg aggctgggaa aaatcgttgc actttacccg gggaaagaca 5460 aagtgatccg agtcgccgac gtgaagactt ccgcgggtgt ttttcgtcgt tccgtccgaa 5520 agttagcgcc cttacccatc gtcgataacg acgagcagag gatttccgct tttggaattg 5580 cattccaacc ggcgggagta 5600 // ID BEL-223_AA-I repbase; DNA; INV; 3095 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-223_AA_; KW BEL-223_AA-LTR; BEL-223_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-3095 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 903-903 (2011). XX DR [2] (Consensus) XX CC 'ATTAG' target site duplication CC LTRs are 93% similar to each other. XX FH Key Location/Qualifiers FT CDS join(294..1295,1299..2840) FT /product="BEL-223_AA-I_1p" FT /translation="MPTERRIRALKVRQKSLVASLNLITAFVEDFDEETQA FT NEVPVRLESLTQLWSDYNLNQNELETLDEAAIDVHLKERTLLESHYYKVKG FT FLLAHNKSPINQTLSSPTHSSLQIPPSASHVRLPDVKLPIFDGNLENWMNF FT HDLYMSLVHSSAELSNIQKFYYLRSSLSESALQLIQSIPISAINYPVAWNL FT LLEHFQNPARLKQTYVDAIFEFPSLRKESASELHSLVEKFEANVKVLQQLG FT ERIEHWDILLIRLLSTRLDPTTRRDWEEFATNKAAVTFKDLTGFLQRRVTV FT LQSLQPKTVIDTQSSIPLKKPAQRSASSYGANQVNLRKCVICDSHPLYLCE FT SFANLPLEEKESEVRRHQLCRNCLRKGHMSKDCSSSSNCRKCRGRHHTQLC FT SIDAALSSESTSSSYHSSPTTETPSTSGLPSLSASATRTESISCASSGCTQ FT KTVLLATALIDIIDDEGNKHTARALLDSGSECCFVSEQFAQRIKARRKRIN FT MPITGIGQATTYARTKFVSRILSRVGEYSTNIEFVVLPKVTVNLPATSIDT FT SYWNMPPGIKLADPTFDCTNPVDVVIGAEVFFEVFRSPGRIPLGNNLPELV FT NSVLGWVVCGKSDINRSTPIVANFAIIARSSIATLTEHLPHPKETRGTKSS FT IKSNFQGQQASPTSRRHHTEADTKSSRNQAQLRVSTPIATSALNEVTTLTQ FT EIVVKPSYRNASSDSADINRSIHANGSNRQLIHATSFFNQNVTLLQLTALN FT RIERDNSCQQHRSFTGARAVTNQQQLHRCLSSNSPPPVNKLKENVFTAKNY FT QLRSASSSSAKGANLSTVSQQHNLNETVMCPLIPLTRPIIGQKRIG" XX SQ Sequence 3095 BP; 860 A; 910 C; 617 G; 702 T; 6 other; ttggtccttc gagccggata cggtcgtcga cactccgtcc gttccgaaac ctctacgatt 60 cgtttgaaac cgccatcgct acagaggaca gcctgtttcg catccgccat catagttcac 120 tccgaaacat cgcgaaggat cgctgaagtc cgccatcatt acgakctctt magaatttaa 180 tacaaggcaa ttgcattgcc tctagaaggt aattagggtt cctattaacc tatcccctgt 240 tgtgtgcttg ctacgctgat cgtttgtcca cagacccacc attctccgtc gccatgccta 300 ccgaacgtcg catcagagcc ctgaaggtgc gacagaagag cctggttgca tcgctcaacc 360 tgatcactgc gttcgtcgaa gacttcgacg aagaaacaca agccaacgaa gtgcccgtcc 420 ggctagaaag tctcacccag ctgtggtcgg actataacct aaaccagaat gagctggaaa 480 cactcgatga agccgccatt gacgtccacc tgaaggagcg aacgctgcta gaatcccact 540 actacaaggt gaagggcttc ctactggccc ataataaatc tccaatcaac caaacactct 600 cgtctcccac tcattcatca ctgcaaatcc ctccgtcggc atcacacgtg cgattacccg 660 acgtaaaact ccccatcttc gacggaaatc tggagaactg gatgaacttc cacgatctat 720 acatgtcgct tgtccactcg tcggccgaac tatcaaacat ccagaagttt tattacctca 780 ggtcttcgtt gtcggaatcc gctctgcagc tcattcaaag catccctatc agcgctatca 840 actacccggt ggcgtggaac ctcttgctgg aacacttcca gaaccctgcg cgactgaagc 900 agacatacgt ggacgccatc ttcgaatttc cgtcgctaag gaaggaatcc gcatctgagc 960 ttcacagtct ggtggaaaaa tttgaagcaa acgtgaaggt actacaacag ctaggggagc 1020 gaatcgaaca ttgggacatc ctgctcattc gtctattgag tactcgactt gatccgacaa 1080 cccgacggga ctgggaggaa ttcgccacca acaaggcggc cgttacattc aaggacctga 1140 ccgggttctt acagcgtaga gtgaccgttc tccaatcgct tcaaccgaag actgtcatcg 1200 atacccaatc gtccattccg ctaaagaagc cagctcagcg ttccgcctcc agctacggag 1260 ctaaccaggt caacctccgc aagtgtgtca tctgcwctga ttcccatccg ttgtacttgt 1320 gtgagagctt tgccaaccta ccactcgaag agaaggaatc tgaggtgcgc cgtcatcaac 1380 tatgccgcaa ctgcctacga aagggtcata tgtcgaagga ttgttcatca tcgtcgaatt 1440 gccgcaaatg ccgaggccgg catcatacac agctttgttc mattgatgcc gcattgtcgt 1500 ccgaatcaac atcatcaagc tatcattcat caccaactac cgaaacacct tcgaccagtg 1560 gcctgccatc gctttctgcc tctgctacac gtactgaatc aatcagctgt gcttctagtg 1620 gctgcacaca gaaaaccgtt ctcctagcta ccgctctaat cgatatcatm gacgacgaag 1680 gcaacaaaca caccgctcga gcgctcctgg actcaggtag cgaatgttgc ttcgtttcgg 1740 aacagtttgc ccaacgcatc aaagctcgtc gaaaaaggat caacatgccc atcacaggaa 1800 tcggccaagc tactacctac gccagaacca aattcgtctc ccgcattcta tcccgagttg 1860 gcgagtactc caccaacatc gaattcgtcg ttcttcccaa ggtgaccgta aatttaccag 1920 caacaagcat cgatacttcg tactggaaca tgccacctgg tatcaaactc gctgatccaa 1980 cattcgactg cacaaacccc gtcgatgtcg tcatcggtgc cgaggtattc ttcgaagtgt 2040 tccgatctcc tggccgaatc ccactcggca acaacttacc tgagttggtc aactctgttc 2100 tkggctgggt tgtgtgcgga aagtcggaca tcaaccgatc cactcccatc gtcgccaact 2160 ttgcaatcat cgctcgttct tcaatagcta cattgacaga acaccttcct cacccgaaag 2220 aaaccagagg aaccaagtcg tcaatcaaat cgaacttcca aggtcagcaa gcatcgccaa 2280 cttctcgtcg ccaccacaca gaagcagaca ccaaaagcag ccgaaaccaa gcacaactaa 2340 gagtctcgac accaatcgcc acctctgcac tcaacgaagt aaccaccctc acccaagaaa 2400 tcgtcgtcaa accatcttat cgaaatgcat cgtctgattc tgccgacatc aatcgctcca 2460 ttcatgccaa cggttctaac cgtcaactca ttcatgcaac atcattcttc aatcaaaacg 2520 tcactctact gcagctaacg gctctgaatc gcatcgagag agacaacagt tgtcaacaac 2580 atcgttcatt cacaggggcc agagcagtca cgaatcaaca acaacttcac cggtgtttat 2640 cttctaattc accaccgccg gtaaacaagc taaaagagaa tgttttcacc gcaaagaact 2700 atcaactgcg ctctgcatcg tcatcatctg caaagggtgc aaatctatct accgtatccc 2760 agcagcacaa tctcaacgaa acagtaatgt gtccactaat tcctttaacg aggcctatca 2820 tcggacaaaa gcgcatcgga tagtcactct tcagtagcat aattgaaaca tcatcaatcc 2880 cattagaaaa catctttatt tcttggaaat tcatagtcca tcgagatcca tagagtttcc 2940 tgaaaataaa aatacaacgc ccttttgatt aagggacgag tcgaccgatt tgacgtgcca 3000 caactaacct gtcatcggag tgaacgacaa catcgagcga gaaagagagg caacatcatc 3060 gactgtcaga aatgtgattt ctgaaggggc cagta 3095 // ID ITmD37E_Ele11 repbase; DNA; INV; 1280 BP. XX AC . XX DT 27-OCT-2010 (Rel. 15.1, Created) DT 27-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE An ITmD37E DNA transposon family from Aedes aegypti. XX KW Mariner/Tc1; DNA transposon; Transposable Element; ITmD37E_Ele11. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-1280 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-1280 RA Kojima K.K. and Jurka J.; RT "ITmD37E-type DNA transposons from the yellow fever mosquito."; RL Direct Submission to Repbase Update (13-OCT-2010). XX DR [1] (Consensus) XX CC [2] Consensus update. ~89% identical to consensus. TIRs are 30 bp CC long. TA TSDs. The consensus is ~90% identical to the original CC sequence in [1]. This family encodes a DD37E-type transposase and CC is similar to Tx_mos from Toxorhynchites amboinensis. XX FH Key Location/Qualifiers FT CDS 154..1095 FT /product="ITmD37E_Ele11_1p" FT /note="transposase." FT /translation="MPSKQEEQRIKILLAHRENPSYSHAKLAKSLKVAKST FT VTNVIKVFGERLSTARKPGSGGNRKPEAAATTKRVAGSFKRNPNLSLRDAA FT NKLGVSSTTVHRAKKRAGLSTPNRNDKQNTTAKARSRRLYTTMLTKFDCVV FT MDDETYVKADYKQLPGQEFYTAKGRGKVPDIFKHMKLSKFAKKYLVWQAIC FT TCGLKSSIFIATGTVNQEIYVKECLNKRLLPFLKKHGCSVLFWPDLASCHY FT GKKAMEWYAANNVQVVPKDKNPPNTPELRPIEKYWAIVKRNLKKTKKLLVT FT RSSLRQTGVLRRRRWTRWLYKI" XX SQ Sequence 1280 BP; 399 A; 276 C; 299 G; 306 T; 0 other; aagggtgata cggtcaaaat ttggtcaatt tcaacttgac gtatttcttt caattttgca 60 tttaaaaaac ctgaacaccc ctcattttaa aggtgtgtgt gtgtagaatg ttgctcctat 120 tttgattttg aaattcactc ttcagttgtc aaaatgccgt ccaagcaaga agagcagcgt 180 atcaaaattt tgctcgcgca tcgcgaaaat ccgagctact cgcacgcaaa gttagcaaaa 240 tcgctaaaag ttgccaaatc aaccgttaca aatgtaatta aagtgtttgg ggaacgtttg 300 tcgacagcca ggaagcctgg atcgggggga aatcgaaaac cggaagccgc tgcgacgaca 360 aagagagtag ccggtagttt caagcggaac cccaacctct ctctccgaga tgccgcaaat 420 aagctgggtg tgtcatctac aaccgtgcat cgagccaaaa aacgagccgg actgtcgact 480 ccaaatcgca atgataagca aaatacgacg gccaaagcgc gatctcggag gctgtacacg 540 acgatgctga cgaagtttga ctgcgtggta atggacgacg aaacctacgt caaagccgac 600 tacaagcagc ttccgggaca ggagttttat acagcaaaag gaagaggaaa ggtaccagat 660 attttcaagc atatgaaact gtcgaagttc gcaaagaaat atctggtttg gcaagccatc 720 tgtacctgtg gcttgaaaag cagcattttc atagcaaccg ggactgtcaa ccaagaaatt 780 tacgtgaaag agtgtttgaa taaacgtctg ctgcctttcc tgaagaaaca cggttgttcc 840 gtactgtttt ggccggattt ggcatcttgc cattacggta aaaaggccat ggagtggtac 900 gccgccaaca acgtgcaggt ggttcccaag gacaagaacc ctcccaacac gccagagctc 960 cgcccaattg agaaatattg ggctattgtc aagcggaacc taaagaagac caaaaaactg 1020 ctagtgacga gaagcagttt aaggcaaact ggcgttctgc ggcgaagaag gtggacaagg 1080 tggctgtaca aaatctgatg gcaggggtca agcgaaaggc ccggcaattc ggatttggaa 1140 aagcggaagc ctaactgaat atttttcctg aattttatac tatttgaact tgaaaaagaa 1200 atataatttg atttttaaat aaacgatttc accaatttac acgcattttc ccttgaccaa 1260 attttgaccg tatcaccctt 1280 // ID BEL-20_CQ-I repbase; DNA; INV; 6636 BP. XX AC AAWU01039777; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-20_CQ_; KW BEL-20_CQ-LTR; BEL-20_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-6636 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 193-193 (2011). XX DR Genome; AAWU01039777; Positions 22799 16164. XX CC Positions [5706-6266] - Integrase core CC 'GGCAA' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 24..6587 FT /product="BEL-20_CQ-I_1p" FT /translation="MASNPRKSFVRQTRSMTRAAQEAANLQIGDAQDDAAK FT PFEPSVVEPERGDEQDCAGCTRPNNAELYMVRCEKCELYFHFSCANVTTAT FT VNQPPFVCRTCVPFRPRSTRSSGSHVSSTRSARIALELQQMEEQFRIQEAL FT TKERLAQLERQYLYSSKKYALLREQEEDGGRSVRSRDSRASTSRVEKWINA FT QAETGNTGPGVNKETTESDDHVQPLSEDKADSLHVKFTSTPLASPNASFNE FT SPYVSLSISSLLPDSLEESSEKSIPEGPKDPGQTAAEIPPSEVPEPPTTME FT SMLKLLQISLGKPQNTGAIPKILKVTSTAFEEWRNSMQRKDLDPIPEELSE FT ETKKNEQNLLVLLQQLEDQRAEDQQLQRQREEQLKRKLQQQEREQAKQKQE FT LQRRLQQVEAEQKEEMQRLLQQQQRELAEQKRELKQRLKEQKQQQAKQQEE FT LDRLRELERLQKLAESRDKNHSSAAGRNGTVDKSRSTESTTGTTSVPRVDD FT DLPPPSNERSRASANFGSSSVRSSRSSSSRSSTLTTPSVVPSVESFPSLVP FT PLPAPPPYQGPTSQQIAARQVVNKELPVFTGNPVDWPIFISSYNHSTLTCG FT YTDNENLLRLQRAIQGRAREEVSSLLLNPSTIPQLLTSLKLLYGRPEQIVH FT TMIEKVRATPAPRADKLESLVSFGLVVHNLCGHLKAIGMEKHLSNPILLHE FT LVAKLPSNVMFNWALYQEQLPEVDLNVFGDYMSKIATATSGVTLFAAKAAK FT DDFRPKKEKAAYVNTHSTAEQGQRKGDDEIKEKPTDRPSSNIKVCPMCDNS FT GHLAANCSKFSKLCLDDRWKLVKEKRLCRRCLVAHSRWPCKSDPCGVQGCQ FT KKHHPLLHYEQAPTEAKRSEPATSGVVALHRQPTISTLFRMLPVTLYGKKG FT KVNTFAFLDDGSSVTLLERKLAATLGLEGKQASLCIHWTSGIKKNFSETRE FT VELEISGADRPQRFVASNVYTVDKLGLPEQTMDVAAMAEEFAYLQDLPLLS FT LQSAVPGILIGLNNVHLLATLKLREGRKGEPIATKTRLGWTVYGSMPAATQ FT SFAHRQFHISDEQAEVDLHEYVKSFFAVESLGVMAVPSEESVDDQRANKIL FT SETTKRVEGGRFETGLLWKQDYVEFPDSRPMAEKRMKCLERRLGRDPALYD FT QVRKQIADFQSKGFAHKATAAELDTFDPRRTWYLPLGVVLNPKKPGKVRVI FT WDAAAKVDGVSLNTMLLKGPDLLTPLLSVLFQFRERQVAICADIEAMFHQV FT KIREPDRSAQLFLWRDSPDKPLETMVTDVAIFGATCSPAHSQYVKNLNANE FT HEADHPKAAAAIRNKHYVDDYLDSVDTADEAVAMALEVAEVHAKAGFHIRN FT WISNDKTVLARIGAVNPTTVKNFVIEKENGFERLLGMVWLPDEDMFSFALS FT LREDNMKLLTGEVAPTKTQLLSIVMSIYDPNGLVAVFVIHGKILVQDVWRS FT GVGWKDKIPEKLIGRWKQWIALLRKIETVRVPRCYFKDYEPASLKTLQLHV FT FVDASEQAYSAMAYFRLEDRGQVRCSLVATKTKVAPLQLLPIPRLEVQSGV FT TGSRLRKTIEDGHSLPISKVVFWSDSKTALQWIRSTDLRRFRPYVAFRVNE FT ILSLSKAAEWRYCPSRMNVADEATKWGKKGPTFDPDAQVYVCHKFIYDQEE FT DWPEDCLERVEATEELRPAFMFSHFVVMPIIRLELFSRWERLLRMVAYVHR FT FLARRLKLKQETCPGALTREELQKAERSLWRLAQSDAYPDEVATLNQNLLV FT PAEKRECLETSSSILKLSPLLDDSGVLRAGSRLEAAEFAAFDAKFPVILPN FT KHRVTWLLVDSYHRRFRHANNETVVNEIRQKFNIPKLRVLIRQVASKCCFC FT RIKKAIAVAPMMAPLPRVRLSPFVRPFTFTGVDYFGPVLIKVGRSVAKRWV FT ALFTCLTIRAVHLELVASLSTDSCKKAIRRFIARRGAPQEIYSDNGTNFQG FT ASGELSKELAKVNHNELSSTFTDIHTQWRFNPPAAPHMGGCWERMVRSVKA FT ALGVIPVERKLDEESLVTLLAEAEHMVNSRPLTFVPLESADSESLTPNHFL FT MLSSSGVQQAVKDRVGVGAAIKNSWNSIQHALDEFWCHWVKEYLPTIARRT FT KWFEEVRPIKEGDIVLVVDEGHRNGWLRGRVARTYPGKDGRVRRADVQTSK FT GQVGSAGRW" XX SQ Sequence 6636 BP; 1603 A; 1848 C; 1954 G; 1231 T; 0 other; gaaactgaaa gaatttattt acgatggcct ctaatccccg taagtcgttc gttcgccaga 60 cccgctcaat gacgcgcgcg gctcaagaag cagctaacct ccaaattgga gacgctcaag 120 atgatgcagc gaaaccattc gagccgtctg tcgtcgaacc ggagcgaggg gatgagcagg 180 attgcgctgg atgtaccagg ccgaacaacg ccgagctgta catggtgcgg tgtgaaaagt 240 gcgagctgta cttccacttc tcgtgcgcga acgtcaccac cgcaaccgtc aatcagccgc 300 cgtttgtgtg ccgtacttgc gtgccctttc ggccaagatc tacccgttca tcgggctccc 360 atgtgagcag tactcgttcg gcacgaattg ctctagaact gcagcaaatg gaggagcagt 420 tccggatcca ggaagcgctt acgaaggaac gactcgcgca gttggagcga cagtacctgt 480 acagctcgaa gaagtacgcg cttcttcggg aacaagaaga agatggggga cgtagtgtcc 540 gtagcagaga cagccgcgct tcgacgagca gggttgagaa gtggatcaat gctcaggctg 600 agaccgggaa caccggtcca ggcgtcaaca aggagacgac cgagtctgac gatcacgtcc 660 agccactgag tgaggacaag gccgatagtc tgcacgtgaa gttcacgtct actccgctcg 720 cttcgccaaa tgcttcgttc aacgaatcgc cgtacgtttc cctctccatt tcctccctat 780 tgccggacag tctcgaggaa agcagcgaaa agtcgatccc tgagggaccc aaggaccccg 840 ggcagaccgc ggctgaaatt ccacccagcg aggttccaga accacctacc accatggaaa 900 gcatgctcaa gctgctgcaa atctcactgg ggaagccgca gaacaccggt gctatcccga 960 agatcctcaa agtcacctcg accgctttcg aggagtggcg aaacagcatg cagcggaagg 1020 atctcgaccc gattccagaa gagctgagcg aagagacgaa gaaaaacgag caaaacttgc 1080 ttgtgctcct tcagcagctc gaggatcagc gcgccgagga ccagcagttg cagcgccagc 1140 gggaagagca gctcaaacgc aagctgcagc agcaagaacg tgagcaagcg aagcagaagc 1200 aagaactgca gcgtcgactt cagcaggtgg aagcagagca gaaggaagaa atgcagcgtt 1260 tactgcagca gcaacaacgc gagctggccg agcagaagcg ggagctgaaa cagcgtctca 1320 aggagcagaa gcaacagcaa gccaagcagc aagaagagct tgaccgtttg cgagagctcg 1380 agaggcttca aaagctggca gaatctcggg acaagaacca ctcgtcggcg gcgggccgca 1440 acggcacggt cgacaaaagc cgatcaacag aaagcaccac aggaacaaca tctgtaccac 1500 gcgttgacga tgacctgcca cctccatcta acgagcgatc tcgcgcctct gcaaactttg 1560 gcagttcttc cgtaaggagc tctcgatctt catcttcaag gtcgtctact ttgacaacac 1620 cgtcggtcgt accttccgtt gagtcattcc cctctctcgt gccgccgctg ccggcaccgc 1680 ctccgtacca aggaccgact tcgcagcaga ttgctgcacg tcaggttgtc aacaaggagc 1740 tcccagtctt cacagggaat cctgttgact ggccaatctt tatcagcagt tataaccact 1800 ctacgctgac gtgcggatac accgacaacg agaacctgtt gcgactgcag cgtgcgatcc 1860 aggggagggc gagagaagaa gtgagcagcc tgctcctcaa tccgtcgacg ataccgcaat 1920 tgctgacctc tctgaagctg ctgtacggac gaccggagca gatcgttcac acgatgatcg 1980 agaaagtacg cgcgaccccc gcgccgagag cggacaagct ggagtctctc gtctcgttcg 2040 gtctggtggt ccacaatctc tgtgggcact tgaaggctat tggcatggag aagcacctgt 2100 cgaaccccat tttgctgcac gagctcgttg ccaagttgcc ttccaacgta atgttcaatt 2160 gggcactcta ccaggagcaa ctgccagaag tggacctcaa cgtgttcggc gactacatgt 2220 cgaaaatagc gaccgctacc agcggcgtga cattgttcgc tgcgaaggct gccaaggacg 2280 atttccggcc caagaaggag aaggcggcgt acgtcaacac gcactcgact gccgagcaag 2340 gtcagcgcaa gggtgacgac gagatcaagg agaagccgac ggaccgtccg tcttcgaaca 2400 tcaaggtgtg cccgatgtgc gacaacagcg gccatttggc ggccaactgc tcgaagttca 2460 gcaaactttg tctcgacgac cgttggaagc tggtcaagga gaaaagactt tgtcgtcgct 2520 gtctggttgc ccactcgcgc tggccgtgca aaagcgatcc ctgcggagtc caaggttgcc 2580 agaagaagca tcatccgctg ctccactacg aacaagcacc cacggaagcg aagcgcagcg 2640 agcccgcaac tagcggcgta gtcgcactgc accgtcaacc gacgatatca acattgttcc 2700 gtatgttgcc cgtcacgttg tacggaaaga aagggaaagt aaacacgttc gcctttctcg 2760 atgacggctc gtcggtgaca ctcctggagc ggaagttggc ggcaacccta ggcctggaag 2820 gaaagcaagc atctctgtgc atccactgga catccggcat caagaagaac ttctcggaga 2880 cgcgggaggt tgagctggag atttctggcg ctgatcgtcc gcagcggttc gtcgcgtcga 2940 acgtgtacac ggtggacaag ctcggacttc cggaacaaac gatggacgtc gcggccatgg 3000 cggaggagtt cgcgtatctg caggacctgc cgctattgag cttgcagtct gcggttcccg 3060 gaatactcat cggtctgaac aacgtgcacc tgctcgcaac actgaagctg cgggaaggac 3120 gcaaaggaga accgatcgct accaaaacgc gtctcggttg gacggtctac gggagtatgc 3180 cggcggctac acagtctttt gcccaccgcc agttccacat ctccgatgaa caagccgaag 3240 ttgacctgca cgaatacgtc aagagcttct ttgcggttga aagcctcgga gtcatggcag 3300 taccgagtga ggagagcgtc gacgaccagc gtgccaacaa gattctgtcc gaaactacca 3360 aacgagtcga aggtgggcgg tttgaaactg gccttctctg gaaacaggac tacgtggagt 3420 ttccggatag cagaccgatg gcggagaagc ggatgaagtg tttggagcga cgcctgggac 3480 gagatccggc actgtacgat caagtacgaa agcagatcgc ggacttccag tcgaaaggtt 3540 tcgcgcacaa ggcgaccgct gcggaactgg acaccttcga ccctcgccgg acctggtatt 3600 tacccctcgg agtagtactg aacccaaaaa agcccggcaa agtgcgcgtg atctgggacg 3660 cggctgccaa ggtcgatggc gtgtccctga ataccatgct tttgaagggg cccgacctgt 3720 tgacgccgct gctgtccgtg cttttccagt tcagggagag acaggtcgcc atctgcgcag 3780 acatcgaagc catgtttcac caggtgaaga ttcgggaacc cgatcgcagc gcgcagctgt 3840 tcctgtggag ggattcgccg gacaagccgc tggaaaccat ggtgaccgac gtcgctatct 3900 ttggggcgac gtgctcccca gcccactcgc agtacgtgaa aaatctcaac gctaacgaac 3960 acgaagcaga tcacccgaag gcggcagcag cgattcgcaa caagcactac gtcgacgatt 4020 acctcgacag tgtcgatacg gcggacgaag cagttgcgat ggcgctagag gtagccgagg 4080 ttcacgcgaa agccggattc cacatccgga actggatctc gaacgacaaa accgtgctgg 4140 cgcggatcgg agcggtcaac cctacgacgg tcaagaactt cgtgatcgag aaggagaacg 4200 gattcgagcg tctgctggga atggtgtggc tgcccgacga agacatgttc tcgttcgcgc 4260 tgagcctgcg agaggacaac atgaagctgc tgactggaga ggtggcgccg acgaagaccc 4320 agctgctgag catcgtgatg agcatctatg acccgaacgg gttagtcgcc gtcttcgtca 4380 tccacggaaa aatactcgtg caggacgtgt ggcgatccgg cgtcggctgg aaagacaaaa 4440 tcccggagaa gctcatcgga cgctggaagc agtggatcgc gctgctgcgc aaaatcgaaa 4500 cggtgagagt tccacgttgc tatttcaagg attacgaacc agccagcctg aaaacgttgc 4560 aactccacgt gtttgtcgac gccagcgagc aagcttactc cgcgatggca tactttcggc 4620 tagaggatcg cggccaagtc cggtgctcac tggtggcgac caaaacgaag gtcgcacctc 4680 tacagctgct tcccatccca cgtctcgaag ttcagtcggg cgtaactgga tcgcgtttgc 4740 ggaagaccat cgaggacgga cactcactcc ctatttcgaa ggtcgttttc tggtcggact 4800 ccaagacggc actgcagtgg atcaggtcaa ccgacctccg ccgatttcgc ccgtatgtgg 4860 cgttccgggt caacgagatc ctgtcactgt ccaaggccgc cgagtggaga tactgtccat 4920 cgcgcatgaa cgtcgccgac gaagctacga agtggggcaa aaagggacca actttcgacc 4980 ccgacgcgca agtttacgtc tgccacaagt tcatttacga tcaggaagaa gactggcccg 5040 aagactgcct cgagcgggtt gaggcgacag aagaactcag gccagcgttc atgttcagcc 5100 acttcgtggt catgcccatc attcgactgg agctgttttc gcgatgggaa cgcctgctga 5160 ggatggtcgc ctacgttcat cgctttctcg cgcgcagact aaagctgaag caggaaacct 5220 gtccgggcgc gctgacacgt gaggagctgc agaaagcaga acgaagcctg tggcgcttgg 5280 ctcaatccga cgcttatccg gacgaggtcg ccacgctgaa ccagaacctg ctagtgccag 5340 ccgagaaacg ggagtgtttg gagacgtcga gcagcatttt gaagttgtcc cctctgctgg 5400 acgacagcgg agtgcttcga gctggaagtc gcttggaggc cgccgagttc gccgcgttcg 5460 acgccaagtt tcccgtgatc ctcccaaaca aacatcgcgt gacctggctg ctcgtggact 5520 cctaccaccg aaggtttcgc catgcaaaca acgagacggt ggtcaacgag atacggcaga 5580 agttcaacat accgaagctg cgggtcctga tccggcaggt ggcgagtaaa tgctgtttct 5640 gccgcataaa aaaggccatc gcggtggcac ccatgatggc accccttcca cgtgtgaggc 5700 tgtccccttt cgtgcgcccg ttcacgttca cgggcgtgga ctacttcggc ccggtcctga 5760 tcaaggttgg acgcagtgtt gcgaaacgct gggtggcgct tttcacctgc ttgacgatcc 5820 gcgcagttca cctggagctc gtcgcaagct tgtcaacgga ctcgtgcaag aaagcaatcc 5880 gccggttcat cgcgcgacgt ggagcgccgc aggagatata ctcggacaac ggtacaaatt 5940 tccaaggagc cagcggagag ctgtcgaagg aattagccaa ggtcaaccac aatgaactca 6000 gcagtacatt caccgacatc cacacccagt ggcgattcaa cccaccggcc gcgccgcaca 6060 tgggtggctg ctgggaacga atggtaagat ccgtgaaggc tgcactgggc gtaatcccgg 6120 tggaacgcaa gctggatgag gagtcgctgg tcaccctact agcggaagcg gagcatatgg 6180 taaactcgcg ccccctgact ttcgtgccgc tggagagcgc agacagcgag tccttgaccc 6240 ccaaccattt tcttatgctg agttcgagcg gcgttcagca ggccgtcaag gatcgggtgg 6300 gcgtaggcgc ggccatcaag aacagctgga acagtattca gcacgcactg gacgagttct 6360 ggtgccactg ggtgaaggaa tacctgccga cgatcgcgag aaggactaaa tggttcgaag 6420 aggtgagacc aatcaaggag ggcgatatcg tgcttgttgt ggacgaagga catcggaacg 6480 ggtggttgag aggacgcgtt gcaaggacgt atcctggtaa ggacggcaga gtacgtcggg 6540 cggatgtaca gacatcgaag ggtcaagttg gctctgctgg acgttggtga cgctgaggat 6600 tctggtgcag atcatcatgc gacacgtggg ggagga 6636 // ID LINER1-2_NVi repbase; DNA; INV; 5260 BP. XX AC . XX DT 09-APR-2009 (Rel. 14.04, Created) DT 09-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE Non-LTR retrotransposon. XX KW I; Non-LTR Retrotransposon; Transposable Element; R1; LINER1; KW LINER1-2_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-5260 RA Bao W. and Jurka J.; RT "Non-LTR retrotransposon from Nasonia parasitic wasp."; RL Repbase Reports 9(4), 787-787 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 215..1753 FT /product="LINER1-2_NVi_1p" FT /translation="MDSELIITLPQAGQALQPTPDVGSDRVKAPVADAGSM FT DVGGSQLTQTRQEVIEVGHSALKGRLASSDASGCDGVINRGMHEVWSKIQT FT ADRAEEELMDKIQNHLDKMDAVVGPAKNIHKSVKDDMRKVMSFWKRLISVR FT EASSKTKSSFGLMMSNSSRTSLTSEEKREVETPRKRKERSPTQNETNKKRK FT EEDKTPKHGQSAPLATTSSTTQCPPPTTGTLPPWQRVETRKKKKKKEKEDD FT KINNGVVDQKGTRKSKPPRARPTRPEALIIKATDGKSYADILRKMKADPKL FT KMLGDSVNRIRKTAVGDLLLELQRTSEGKATELRQAVQEVLEEGVTVRTLQ FT DVEVFEVKDLDVLTTKEDIVEALRREFQDSGSNAVEETAVKSVRKAYGDTQ FT TAVIQMPAKMAQQMIAKQKIRIGWVVCRIREIKRNARPLRCYKCLGFGHIG FT KNCTVTQDRSNHCYKCGVEGHNARDCKNKPSCVLCQERGATEKSDHAAGSY FT TCPVYRAAVEKLKKRR*" FT CDS 1705..4776 FT /product="LINER1-2_NVi_2p" FT /translation="MPSIPSGSREVEETKIMKIVQLNLNHCETAQDLLNQY FT VHETEVDVAIICEPYRALDETSWETDDTGRAAIWACGNVAFQEKMLTSEEG FT FVRAKIAGIHVYSCYASPNAPIEQFERQLDRLVQDIAGRKPVIIAGDFNAW FT AVEWGSQRTNQRGRVLLEASALLDLVLVNQGSTNTFRRGDAGSIVDLTFVS FT SCLIGSIDKWTVSEHYTNSDHQAIIMEVRKSEQRPSASTRTNRVGWKTKDY FT DKEMFLLALEELQLSGTANSKAEQVMGNITRACDAAMPRRVPNCRRPPVYW FT WDKEVESARSECHRTRRREQRARKKYYKTGRGQEIVDARGQEMKDAKRKLK FT KTIRENKRRCLKELQDEVEQDPWGRPYKIVMKKIKGSYVPPPKCPELLHRV FT VTTLFPRQLEEPSVIERGANEEAVPPITIKELLAACRRVGNNKAPGPDGIP FT NIALKHAIHAHPEVFVDLYNACLEEGTFPTNWKKQRLVLLPKGKKPPQDSS FT SYRPLCMLDTPGKILERIICVRMDHFIEGKGGLAEHQYGFRKNRSTLDAVS FT LVIDTAQTAIEGKRWKGGKKKYCAIVTLDVKNAFNSAKWSEIHEALRKQDV FT PLYIRRMMSDYLKDRILLYDTEDGTKTYKVTGGVPQGSVLGPPTWNIMYDG FT VLRLQLPEGATVVGFADDIAVVVVAKHKEEVTEIAEEATRIIHEWLTETGL FT ELASHKTEVILISSRKKMEEIKLTVDGHEIASQPTIKYLGITIDARLTFKQ FT HLEIVSDKAAKVGAALSRLMPNVGGPTQKRRLLLASVTTSIMLYGAPIWAD FT AMRVKSYARMLTTVYRRSALRVASAYRTVSDNAVCIIAGMPPIDLLAMESK FT EVFQTKRRTSDKTQKEIWDAARKKTMAEWQTRWDVSDKGRWTHRLIPKIED FT WTGREHGEVNYYLTQFLTGHGCFRAYLHRFKLDDSPNCPACLDANEDAEHV FT LCDCSRYQMEREELECYLQTRVTPESMMTTMLTSEDGWNAVNNYVRTILKK FT VRNDEEKRRQRQGETAE*" XX SQ Sequence 5260 BP; 1729 A; 1132 C; 1437 G; 962 T; 0 other; catacacaca cacacacaca cacacacaca caactccgct acgccggtcc caagcccggg 60 gccccaccac acaagaggtg gggaaggagg agggggagga gtagcaacct cgttaaaata 120 accaaggctc atctggcccg gatgacagca ccttgttaaa gggctccttc ctaggataca 180 ggatagggga agaaaccccg aatataaatc aatcatggac agtgagttaa ttataacact 240 gccccaagcg ggtcaagctt tacagcccac tcccgacgtt ggttccgatc gggtgaaagc 300 cccggtggcc gacgcaggct ccatggatgt tggggggagc caattaactc aaacaagaca 360 ggaggttata gaggtgggtc actcggccct caaaggaaga ttagcctcaa gtgacgcatc 420 aggatgtgat ggagtcataa acaggggaat gcacgaggta tggtctaaaa ttcagacagc 480 agatcgtgcg gaagaagagt tgatggacaa gattcagaac catctggaca agatggatgc 540 tgtagtaggt ccagccaaaa acattcacaa gagtgttaag gatgacatga ggaaggtcat 600 gtctttctgg aaacggctca tcagcgtaag agaagcgtcg agcaagacca agtctagctt 660 tggactgatg atgtcaaata gctcacggac ctcgctgacc tcagaagaaa aacgggaggt 720 agaaacgcca aggaagagga aggaacggtc tcctacacag aacgagacca ataagaaaag 780 aaaagaagag gacaagacgc caaagcatgg ccagagtgcg ccactagcga cgacttccag 840 cactactcag tgcccaccgc ccacaacagg gacattacca ccctggcaga gggtggaaac 900 aagaaagaaa aagaagaaaa aggaaaaaga agacgataag attaacaatg gtgttgtaga 960 tcagaaaggc actagaaaat ctaagccgcc aagggcaaga ccaacccgac cggaggctct 1020 aatcataaaa gctaccgacg ggaagtccta cgctgatatc ttgcgaaaga tgaaagcaga 1080 cccaaaactg aagatgctag gggatagcgt aaataggata cgcaaaacgg cggttggtga 1140 cctacttttg gaactccaac gcacatcgga agggaaggca acggaacttc gacaagcggt 1200 gcaagaagtg ctagaagagg gtgtcaccgt aagaacccta caagatgtag aggtctttga 1260 agtaaaggat cttgatgttc taaccactaa ggaggacatc gtggaagcgc tacggaggga 1320 gtttcaagat agcggctcaa atgcagtgga ggaaacggca gtaaagtctg tgcgcaaggc 1380 atatggcgat acgcagacgg cggtcattca aatgccagca aaaatggctc agcagatgat 1440 cgcaaagcag aaaatcagga taggctgggt agtatgcaga atacgagaga taaagcggaa 1500 tgcacgtcca cttcgctgct ataaatgtct aggatttgga catatcggca aaaactgcac 1560 agtaacacag gatcgaagta accactgtta caagtgtgga gtcgaaggac ataatgcaag 1620 agattgtaaa aacaaaccga gctgcgttct ctgccaagaa agaggagcaa cagagaaaag 1680 tgaccatgct gctggtagct acacatgccc agtataccga gcggcagtag agaagttgaa 1740 gaaacgaaga taatgaagat agtgcagttg aacttgaatc actgcgagac agcacaggat 1800 ttacttaacc agtacgttca tgagacggag gtcgacgtag ccattatatg tgaaccatac 1860 agggctctgg acgagacatc gtgggaaaca gacgacactg gcagagcagc gatctgggca 1920 tgtgggaacg tggcattcca ggaaaaaatg ttgaccagcg aggaaggatt tgttcgtgca 1980 aagatagcag gaatacacgt atacagctgc tatgcttcgc ctaacgcacc gatcgaacag 2040 ttcgagcggc agttagatcg actcgtgcag gatattgcgg gaagaaagcc agtaattatc 2100 gcaggcgatt ttaacgcatg ggccgtagag tggggcagtc aaaggactaa ccaaagagga 2160 cgagtactgc tagaagcatc tgctctcctg gacttggtgt tagttaatca gggtagcact 2220 aacaccttta gaagaggaga cgcaggatct attgtagacc tgacctttgt tagtagttgc 2280 cttattggct caattgacaa gtggaccgtg agtgaacact acacaaatag tgatcaccag 2340 gccataataa tggaggtaag aaaatcagag cagagaccca gcgcgagtac aaggaccaac 2400 agagtcggtt ggaaaacgaa ggattatgat aaggagatgt tcctactggc actggaagaa 2460 ttgcaactct cggggacggc gaacagcaag gcggaacagg tgatgggtaa tatcacacga 2520 gcctgtgacg cggccatgcc gagaagagtg cccaattgta gacgaccacc agtatattgg 2580 tgggacaaag aggttgagag tgcgcgcagc gagtgtcacc ggaccagaag aagagaacaa 2640 cgggcaagaa agaaatacta taaaaccgga agaggccagg aaatagtaga tgcccgcgga 2700 caagaaatga aggacgctaa gaggaagctt aagaagacaa tccgcgaaaa caagcgacgg 2760 tgccttaaag aactacagga tgaggtcgaa caggaccctt ggggtcgacc ttacaaaata 2820 gtaatgaaga agattaaagg aagctacgta ccccctccaa aatgcccgga actattacac 2880 cgggtagtga ccacactgtt ccccaggcaa ctggaagagc caagtgtcat cgaaagaggt 2940 gcgaatgaag aggctgttcc accgatcacg attaaggaat tgttagccgc ctgcagaagg 3000 gtaggaaaca acaaggcccc agggccggat ggcattccca atattgcatt gaagcatgcc 3060 atccatgctc atccagaggt cttcgtagac ttgtacaatg catgtctaga agagggaaca 3120 ttccccacaa actggaagaa acagcgtctc gtgcttctgc caaaaggaaa gaagccaccc 3180 caggattcct catcatacag accgctctgc atgttggaca ccccgggtaa gatcctggaa 3240 cgcataatct gcgtcagaat ggatcatttt atagaaggaa aaggtggatt agcggaacac 3300 caatatggct ttaggaagaa tcgttcaacc ctggacgcag ttagcctagt gatcgacact 3360 gcgcaaacag cgatagaggg gaaaagatgg aagggaggaa aaaagaagta ctgcgccata 3420 gttacattgg atgtcaaaaa tgcattcaat tcggccaaat ggagtgaaat acacgaagca 3480 ctccggaaac aggatgtgcc cttatatata agaagaatga tgtccgacta tttgaaggac 3540 agaatcctat tatatgacac ggaggacggt accaagacct ataaagttac aggaggcgta 3600 ccccaagggt cagtgcttgg cccaccgacg tggaacatca tgtacgatgg ggtactaaga 3660 ctacaattac ctgaaggagc aacagtcgtg ggcttcgctg acgacatagc agtagtggta 3720 gtagcgaaac ataaagaaga ggtgacagaa atagcggaag aggcgacccg tataatacac 3780 gaatggctaa cggagacggg tctggaacta gccagccaca aaacggaagt tattcttatc 3840 tctagtagga agaagatgga ggaaattaaa ctgacggtag atggacatga gatagcttct 3900 caaccaacca tcaagtactt gggcatcacc atcgatgcaa gactaacttt caaacaacat 3960 ctggaaatag tcagcgataa agcagcaaaa gtcggcgctg ctctatcacg attgatgcca 4020 aatgtaggag gcccaactca gaagcgaagg ttactcttag ccagtgtaac cacatcgatt 4080 atgctatatg gagccccaat atgggcagat gcgatgcggg tgaaatcgta tgcccgaatg 4140 ctaaccactg tgtacagacg aagtgcacta agagtggcgt ccgcttaccg tacagtatcg 4200 gacaatgcgg tgtgcattat tgctggtatg ccgccgattg acctattggc gatggaaagt 4260 aaggaagtat tccaaactaa aagaagaaca agtgacaaaa ctcagaagga gatctgggat 4320 gcagcaagaa agaagacaat ggccgaatgg caaacaagat gggatgttag tgacaaaggg 4380 cgatggacac atcgccttat accaaaaata gaggattgga ctggaagaga acatggggag 4440 gtcaactact acctcacaca gtttttaacc gggcacggat gcttccgggc ttatctgcac 4500 cgatttaaat tagacgacag tccgaactgc ccagcttgtt tggacgctaa cgaagatgcg 4560 gagcatgttc tctgtgactg ttcgcgatac caaatggagc gagaagaact cgagtgttat 4620 ctccagacca gagtgacacc tgagtcaatg atgacaacta tgttgacgtc ggaggatggt 4680 tggaacgcag taaacaacta cgtaaggact atcctcaaga aggttagaaa tgacgaagag 4740 aaaagaaggc aaagacaagg cgaaacagcg gaataaaaat gtagctagac gaaggtcact 4800 gagccgtgtc ggggttgagg ctgtccagag gatggccgga cgatgccgca cgaagcagac 4860 gaccggacga tgcggaaaga agcagacgcc cggacgacgc agaaagaaga aacgtgatga 4920 cgaccacatc aattagaaga gggcgtaaaa gcctttcccc cccccccccc cccccgcgaa 4980 gcactactcg acggtcgtac cgcggggaac aagggcacag aagatggagc taggtgcgcc 5040 tttaccgagg gtcgataaag gcaaagaggg cgttccttgg tacggttggg tgtttttcat 5100 ctggccgtaa aaaggatggg tttagtcggt gtgaatccga cacacggctg acattgcgga 5160 tggggctgtt tcgacgaccc cataaacaaa gtcacgctat ggtctttcta agattttccc 5220 taacacctct tctccgagag aaaaaaaaaa cacacacaca 5260 // ID Sola1-N5_AAe repbase; DNA; INV; 1831 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A non-autonomous Sola1 DNA transposon family from Aedes aegypti. XX KW Sola; DNA transposon; Transposable Element; nonautonomous; KW Sola1-N5_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-1831 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the yellow fever mosquito."; RL Repbase Reports 11(4), 1295-1295 (2011). XX DR [2] (Consensus) XX CC ~96% identical to consensus. 4-bp TSDs. TIRs are 29 bp long. CC Both termini are 78-90% identical to those of Sola1-2_AA. XX SQ Sequence 1831 BP; 606 A; 314 C; 338 G; 573 T; 0 other; ctgcccctct ttgcacaaca gtcccatgtt ggaaaacgac aaactgagaa aaagacgatt 60 gaaatgtgga aacaatttcg ctccattttg agtctttatt tggcctaaaa tgtgttaaat 120 tgctacatat cactgtttgt tgagtacacg aacaaaccta ataaatttct tttgaattca 180 agttgaaaat ttgccttaaa atgcacaaaa atcgtagcga gactgttatg cgattatata 240 caatataata taccaatgta attgcatacc agtcccatca tgttcatcgg ctcgttcgac 300 agctactaaa acactcattg ttattaatgc actggttgtt tcatttgcta tttatgttct 360 ttggccgtaa aaatttagtt attatagtac agtcaactct ccgtaactcg atattaaggc 420 gactatcgag ttagggaaat atcgtgttaa agaacacaaa atcagggtaa ctttggttaa 480 agggaccatc gaggtatcca tgaaaaccca catttactac acattctata actcgatatc 540 gagatacgga atagcgagtt acggatagta tactgtatgt ccatggatga atcaccagat 600 aaaaaaaaac atcggattga atgatttctc aaaacacacg ttatgatgtg gaatgcgctt 660 acagtcatga gtcccagtca tgagagaccc agatgatcat cttttggata atgcggaaat 720 ggaatccgat ggaaatgatt cgagatgttc cagaacaacc tagctaatct gttatatggg 780 agatccagaa tgacgaaaca aatttttcag gcaacttctg taagccttga ccgcagatct 840 aaaatacagt ggctcgtggg atgtgccgag ccaaagtctt tctgtatttt cccaataagc 900 agatgaagaa gacgatgtgg ttgctgattg tttttatatg gaaatcttgc tttcgcattg 960 aaaagttttt attttgcaca cttacaagca agtactcctt aagtcaattt taccagttat 1020 cgaaaaacgc aaaaataagt tttagttcga tattcctgag caccagcaaa gaatcattac 1080 aaaatttaaa taaaaaatgt agaatgaact ttattttatt ttattttttg ttgctcattt 1140 tttttattcc cctcttacca cccaaatggt tacctacgga cataaaagaa gattaatatt 1200 tgtataagcc ttattaaaga aaatttgtaa tgcatcgtaa ttgaaaatac ataattatgt 1260 gtgatatcca ctaatttaaa aatattttcg aaacaacgtg aatacatgac atttacaatg 1320 tcgtaagcaa gtgtgtatta aatacagtca aacctccatg agtcgatatc tgaaggaatc 1380 atagactcat ggaaacatcg agtcatggaa cagaaatgct ttggaaagct gtttgagcga 1440 atcatcatag ttaccatgaa attttgttta agtatggttc catgagtcga tattaagaca 1500 tagaacatcg actcatggag gtttcactgt ataattgatt tccctttact tcgctacaca 1560 tcagtgaata actagtagta tcgagagtcc gaatgtagta ctagaagtgg tacccattct 1620 gagcccgatc actatacgga ctttgatgtg actgttatgc gaatattttc aagatgggac 1680 aacacaaatg gggtttgttt ttgtacaagt taggaacaaa ggctactttt ttagcagttt 1740 ttatcgatct tctgacgatg ttgagttgat ttatgcaaaa aacgcaaatc ggtcaaaagt 1800 cgacatggga ctgttatgcg attatgggca g 1831 // ID CR1-74_AAe repbase; DNA; INV; 5135 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-74_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5135 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1162-1162 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 20 sequences with >97% CC identity. XX FH Key Location/Qualifiers FT CDS 304..2037 FT /product="CR1-74_AAe_1p" FT /translation="MSKKCSASHCITGSSRSSKITCANCRKGFHLKCVGLS FT SNQFKAIRDFPGAEWFCPACRNSPLDLQQSSINPVLNLILDRLASVLRLVG FT VQIDVTRSLCRALSENIRRTSINNRNSVPAQASQYRNFEDALNNLQLEFSN FT VFGSFIDADDNSAAKRDRTSSLSSSHLVSSNSGKRMRVDVPISTSDIANII FT VPSIADATAVNIASSSEPVEACSSDINCITSTENASAATVTSITTSATPTA FT STTASTTPFTRLAPPPANSSHYATSTVAALRPATACSATLAATQAVFTTAT FT QTSPSSATAYKTSVTTEQSTQTNFSALPKPPRTTQLILQSPMISAPASSAT FT CNTTNTVCKNVLPPEDRHLPQSRPSESVDLTVPPTMVSSSGPTIPCYQSSC FT SLASIQTKNNCNSYTRPSLAIAQSAPIISVNSNDNWYYLTKFLPHETESNI FT INYIAHHTNCNPSHIVCQKLVRANSDTRPLSFLSFKIKFPVGIENVVLANG FT FWPQGVTITPFLDRRSNSRKPVGSQTQLRSILRNPRLLNQRSPSLQSNSLH FT NLQPVKYLAPVQRKQIPSPGFKAPNYRVTLV" FT CDS 1770..4916 FT /product="CR1-74_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="KCRACQWILATGSYYNPFFRQAIQQPQTCRKSDTTTL FT NSSKSTFTQSTFTLASVEFIAQSSTGEIPSSSATETNTFARVQSSELQSNF FT SLNGLLCYYQNLGGINTSLTEYLLACSDAAYDIYAFTETWLKNTTPSSTIF FT GNVYSVFRTDRSSLNSSKSTGGGVLLAVRSTLKCRLLQPPNCHAVEQLWVA FT LPVSHHTLYICIIYIPPDRINDPLLTSQHSDSLDWITSQMNLTDKILILGD FT FNMSSISWTTGSSNYLYPNYQQSKINTSQCQLLNDYSTAGLVQISSVYNSN FT HRLLDLCFTSLEHIQDITVVKAAAPLVKDCRHHPPIQLYLKNRVPYEFKPI FT SSGIHYNFRKTDFIKMNQFLSSIDWDDMLRGLSVDTTVEVVSFAVTHAIGL FT FTPKTSHKDPSFPPWSTKALKRLKTTKKSALKKFSKKRNALTKTQYLTCNS FT EYKRLNDRLYHSYIRRTQRNLKLNPKKFWSYIDDQKKESGLPSTMTLREKK FT ASSLSSITALFRQHFSAVFTSTTLTPHHVQEAAAMVPQRSAIGPHPRISSL FT TVELACDGLKCSTNPGPDGIPAVVLKSCSSSLSKPLSTIFNRSLSSGCFPS FT AWKNSYIFPVFKKGAKGEVSNYRGIASLCATSKLFELIVLDFLKHHCLNYV FT SETQHGFMPKRSTTTNLVSYTSFIIKTIEARKQVDAIYTDFSAAFDKINHD FT VLLAKLERLGLAEPLLSWMRSYLVGRTMSVKIGNIISEHFAVPSGVPQGSH FT IGPFLFLLYLNDINLVLKCFKLSYADDYKLYYVITCNEDARFLQLELDTFT FT NWCKTNNMSLNADKCSVITFTRKHSIISYEYSINGIPIKRESTIKDLGVLL FT DSKLCFKDHVAYITSKALKCLGLIFRSAKHFDDIYCMKTLYCSLVRSVLEY FT GVVVWAPYYDSGIQLIEAVQHKFVRYALRRLPWTDPHNLPSYVDRCKLIDL FT DLLEHRRKVCKASFISDVLQSHIDCPTILEMLNVDTRRRDLRSHQFFRLPI FT FRTNYGYNEPISGMCRIFNQCFHAFDFHLSRVSNKNQFRKILL" XX SQ Sequence 5135 BP; 1472 A; 1232 C; 893 G; 1536 T; 2 other; cagackwgga acgatcacaa ccacacagtg ttatacctat ttgaagctgc tattaaattc 60 gttttttcta ctgttaatta tctcgcttaa ataagagcag tgcgttaata ttgatctaaa 120 cgttttgttc tttataatcc gttgttggtt cagtaaaatt gtgccaaatt aagctgtagt 180 gatataattt atcacatcat ccaagctact cacccgtcat cgatacagcc attgtttgct 240 gtttgcagct cagacaacca caaagcgcat tgttttcact tgcccatcga ttcgacgtag 300 aagatgtcga aaaaatgttc tgcttcccac tgtattactg gttcatctag aagcagtaaa 360 ataacttgtg caaactgcag aaaaggattt catttgaaat gcgttggttt atcatccaac 420 cagttcaaag cgatacgtga cttccccggc gcagaatggt tctgtccagc atgtcgaaat 480 tcgccactcg atcttcagca gtcatcgatc aatccggtgc tcaatttgat actggatcgc 540 cttgcgtcgg ttttacgtct ggttggagta caaatcgacg tgactcgctc gttatgtcgg 600 gctctgagcg aaaatatccg tcgcacaagt ataaacaaca gaaactcggt acctgctcaa 660 gcctcccagt ataggaattt tgaggatgcc ctaaacaacc ttcaactcga attttcgaac 720 gtcttcggct ctttcatcga tgctgatgat aactctgctg ccaaacggga tcgtacaagc 780 agcctatcgt cttcgcacct tgtatcatct aattctggaa agcgaatgag agttgatgtt 840 ccgataagta cgtccgatat agcaaatatt atagttcctt caattgcaga cgccactgct 900 gtgaatattg cctcctcatc ggaacccgtc gaagcttgca gttctgacat caattgcatt 960 acatcaaccg aaaatgcctc cgccgccacc gtcacatcta tcaccacttc tgccacccct 1020 accgcctcca ctaccgcctc aactacacca tttaccagac tagctccacc accagccaat 1080 tcaagccact atgccacttc cactgttgct gcactaagac ccgccaccgc ctgctctgct 1140 accctcgctg ctacgcaagc tgtttttaca actgccacac aaacttcgcc ttcgtcggct 1200 accgcttaca aaacctccgt caccaccgaa caaagcacgc aaaccaactt ctctgcatta 1260 ccaaagccac cacgcaccac tcaacttatt ctccaatcac caatgatctc tgctccagct 1320 tcatcagcaa cttgtaatac aaccaacacg gtctgcaaaa atgttcttcc accagaggat 1380 cgtcatctac cgcagtccag gccatcagag tctgtcgact tgactgtccc tcctaccatg 1440 gtgagttcat ccggaccgac aatcccttgc tatcaatcat catgttctct agctagtatt 1500 caaactaaga acaattgtaa ttcttacact agaccgtccc tagcaatagc tcaatctgca 1560 ccaataatct cagtcaatag caatgataac tggtattacc tcacaaaatt tttgccacat 1620 gaaacagaaa gcaacataat taattacatt gctcatcata cgaattgtaa cccttcacat 1680 atagtgtgcc agaaactggt tcgagcaaat agtgacacga gaccgctatc gtttttgtcg 1740 tttaaaatta agtttcctgt tggaattgaa aatgtcgtgc ttgccaatgg attttggcca 1800 cagggagtta ctataacccc ttttttagac aggcgatcca acagccgcaa acctgtagga 1860 agtcagacac aactacgctc aattcttcga aatccacgtt tactcaatca acgttcaccc 1920 tcgcttcagt cgaattcatt gcacaatctt caaccggtga aatacctagc tccagtgcaa 1980 cggaaacaaa taccttcgcc agggttcaaa gctccgaatt acagagtaac tttagtttga 2040 atggcctact ttgctactat caaaacctcg gtggaattaa cacttcactt actgaatatc 2100 ttcttgcctg ctctgacgct gcttatgaca tttacgcgtt tactgaaaca tggctcaaga 2160 atactacacc ttcctctaca atttttggta atgtatattc agtcttccgt accgaccgct 2220 catcattgaa tagttccaaa agtaccggtg gcggcgtgct tcttgctgtt cgttcaacgc 2280 tcaaatgtcg tttactgcaa ccacccaact gccatgccgt agaacagttg tgggtggcac 2340 ttcctgtttc tcatcatact ttgtacatct gtataatcta cattcctcct gatagaatca 2400 acgatcctct actgacctct caacattcag attctctaga ttggattaca tcgcagatga 2460 acttgactga taagattttg atcttaggag atttcaatat gtcttccata agctggacca 2520 ctggttcctc aaactatctc tatccaaact accagcagtc caaaataaac accagtcagt 2580 gtcagttact aaacgattac agtacggcag gtttagtcca aataagctca gtttataatt 2640 caaaccatcg acttctagac ctttgtttta ccagcttaga acatattcag gatataactg 2700 ttgtaaaagc ggccgctcct ctcgttaagg attgtcgaca tcatccacct atacaacttt 2760 atttgaaaaa cagagtcccg tatgaattca aacccatttc tagcggtatc cactacaatt 2820 tcagaaaaac tgattttatc aaaatgaatc agtttttgag ttcaattgat tgggatgaca 2880 tgctacgagg tctaagcgtg gatacaaccg tcgaagtagt aagtttcgct gtaactcacg 2940 cgataggttt gttcacgcct aaaacttctc ataaagatcc ttctttcccg ccttggtcca 3000 ccaaagctct taaaagatta aaaacaacta aaaagtcagc tctgaaaaaa ttctcaaaaa 3060 aacgtaatgc tcttaccaaa acacaatatc tgacctgcaa tagtgaatac aaacgtctga 3120 atgatagact atatcacagt tacatccgcc gcacccagcg caacctcaaa ttgaacccta 3180 agaagttttg gtcgtatatc gatgatcaaa aaaaggagtc aggattaccg tcaactatga 3240 cgcttcgaga aaagaaggct tcaagtctct cttcgattac tgctttattc agacaacatt 3300 tctcagcagt tttcacttct acaacattga caccccatca tgttcaagaa gccgcagcta 3360 tggtacctca acgttccgct attggccctc acccaaggat ttcgtcgctc acagttgagt 3420 tagcttgtga cgggttgaag tgctcaacaa accctggtcc tgatggaatt ccagctgttg 3480 ttcttaaaag ttgcagctca tccctttcga agccattgtc aactatattt aaccgttcgc 3540 tctcttctgg ctgttttccg agtgcatgga aaaattcgta tattttccca gtcttcaaaa 3600 aaggcgctaa aggggaagtt tcgaattaca gaggaattgc ttccttgtgt gcaacgtcca 3660 aactgtttga gttgattgta ctggattttc tcaaacatca ctgcctgaat tatgtatctg 3720 aaacccagca tggctttatg ccaaaacgat ccactaccac taatttagtc agctacacat 3780 ccttcattat caaaaccatc gaagctcgta aacaagttga tgcaatttat actgactttt 3840 cggcggcctt tgacaagata aatcacgatg tcttgttagc aaaactcgag cgcctaggat 3900 tagccgaacc tcttttaagt tggatgagat cctacttagt aggacggacg atgtcagtta 3960 aaataggaaa tataatttct gaacactttg ctgtaccatc gggagttcct caaggaagtc 4020 atattggccc gtttttgttc ctactctact taaatgatat taaccttgtc ctcaaatgtt 4080 tcaaactttc ttacgccgat gattataagc tttattatgt cataacatgc aatgaagatg 4140 ccaggtttct tcagttagaa cttgatacgt ttacgaactg gtgtaaaaca aacaacatga 4200 gtctgaatgc agacaaatgt tcagtgatta ccttcactcg aaaacactca atcataagct 4260 acgaatattc cattaatgga atacctatta aacgagaatc tactatcaaa gacttagggg 4320 ttctgctaga ttccaagctt tgcttcaaag atcatgtagc gtatattaca tcaaaagcgc 4380 taaaatgctt gggtttgatc tttcgatccg ccaagcactt cgatgacata tactgcatga 4440 aaactctcta ctgttctctt gtccgctctg ttctggagta cggtgttgta gtgtgggctc 4500 catactacga ttctggcatt caactaatcg aagcagttca gcataaattt gttcgttatg 4560 ctcttcgtcg tcttccttgg accgatccgc ataacttgcc cagttatgtg gatcgttgta 4620 aattgatcga tctcgatctt ttagaacatc gtcgtaaagt ttgcaaggct tcgtttattt 4680 ccgatgttct tcaatctcat atagattgtc caactatttt agaaatgtta aatgttgata 4740 ctcgtcgtcg tgatcttcgt tctcatcagt tcttccgtct tccaatattt cgtacaaact 4800 atggctataa tgagcctata agtggtatgt gtcgcatttt taatcaatgt tttcatgcgt 4860 tcgattttca tttgtcccgt gtctccaaca aaaaccagtt ccgtaaaatt ttgttgtaac 4920 attatgatta agttagtaga tgtaaacgat tagctttaag atctgtattg ttgtatagtg 4980 tactttagcg ttaagaattg tatcattggg aatttgtatt ctgttgatat gaaaagaaga 5040 ggaggtttta tgcccatttg agaaagagct tttaagaaag ctctactcaa acgggctttt 5100 ccctgctcca ataaagataa agataaagat aaaga 5135 // ID DNA8-96_AP repbase; DNA; INV; 662 BP. XX AC . XX DT 28-AUG-2009 (Rel. 14.09, Created) DT 28-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-96_AP. XX NM DNA8-96_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-662 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2033-2033 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 662 BP; 185 A; 92 C; 97 G; 287 T; 1 other; actagggccc ggaacttgat gcatttgcat ctttttttct aatgcttaca ttcgaggctc 60 gtctttacag aaatttgatt catggaatat agaatttaat ggtacaattt ttattttgca 120 ttttttgcat ttttagggaa tttattgctt tacggtcatt ttttagtatt tttggtgcat 180 tttattcatt tttgtacttt ttttttacat ttatttgcat ttttcttgtt gaaagttaca 240 ctttatcgta acacattcag tttccattag cttgtcgatt atcgacgttc attatcttta 300 tcgtttgttg tcgcaatcta cgattattta ataaactgta ggccgatact attttaggcc 360 cggtcaatca gaaaaaattt cccaaaagtc gatcgantta tttcaaactt caaaaaaata 420 tttttgaagg ctccttcgag agtaaatttt taaagagcta gaacctataa taaatacgat 480 ttatgaccca tgaattgtat aaaacgtaat ttttgtaaaa cttaaatttt taaaaatgtc 540 aagtcattat aaagtaattt ttggtatttc tgtggtcatt ttttgatgtt tataagtcat 600 tttaacattt ttttttgtgc attttttaaa gatttttagg tcatcaagtt ccgggcccta 660 gt 662 // ID Saci-1_LTR repbase; DNA; INV; 668 BP. XX AC BK004068; XX DT 09-JUL-2004 (Rel. 9.06, Created) DT 17-JUL-2008 (Rel. 13.07, Last updated, Version 2) XX DE Schistosoma mansoni Saci-1 long terminal repeat. XX KW LTR Retrotransposon; Transposable Element; LTR; PAO; Saci-1_LTR. XX NM Saci-1_LTR. XX OS Schistosoma mansoni OC Eukaryota; Metazoa; Platyhelminthes; Trematoda; Digenea; OC Strigeidida; Schistosomatoidea; Schistosomatidae; Schistosoma. XX RN [1] RP 1-668 RA DeMarco R., Kowaltowski T.A., Machado A.A., Soares B.M., RA Gargioni C., Kawano T., Rodrigues V., Madeira M.A. et al.; RT "Saci-1, -2, and -3 and Perere, Four Novel Retrotransposons with RT High Transcriptional Activities from the Human Parasite RT Schistosoma mansoni."; RL J. Virol 78(6), 2967-2978 (2004). XX RN [2] RP 1-668 RA DeMarco R., Kowaltowski T.A., Machado A.A., Soares B.M., RA Gargioni C., Kawano T., Rodrigues V., Madeira M.A. et al.; RT "Direct Submission to GenBank."; RL Direct Submission to Genbank (03-DEC-2003)Departamento de RL Bioquimica, Instituto de Quimica, Universidade de Sao Paulo, Av. RL Prof. Lineu Prestes, 748, Sao Paulo, SP 05508-900, Brazil. XX RN [3] RP 1-668 RA Smit A.F.; RT "Consensus update."; RL Direct Submission to Repbase Update (12-JAN-2005). XX DR Genbank; BK004068; Positions 5428 5980. XX CC CC [3]. XX SQ Sequence 668 BP; 168 A; 104 C; 119 G; 277 T; 0 other; tgttggatcc attttgcaag tgaaaatccc tccatgcata cacttgtgat ttcttcctta 60 cgttttcttt atttgtatat gctgttaaca tttattatgc taattaagac tgtaacgttc 120 aattttcttg gttttactat cgttttttat tgagtctcca tgttctttac tgagctgtat 180 atagtttgtt ttaattgcta ttattctggt atctgccata ttggctgctc gctttttgat 240 tgtgcggttt gaacttgact gggatcgtgc atttttggtt gtatttggag cacttttcca 300 gaaattgaaa cacttggtgt tataaaataa ccgctcaaac ggaatctccc ttgatctatc 360 cagaaaggat agaataacgc ttattctttg gtgtttttgg taatttttca tatgtttctt 420 atcaccttgc tatttatcgt aatttagatt gaatattttg gtatcaattg ttggttgaat 480 aactgggtga taatattcgt ttaggtaaat ttgtacattt gtatatataa cgaagtacag 540 aaccgtgtct aatcatcgtg tcaaatatat accttgtttt gatttgaact gagacacgag 600 tgtctaaagc tatctaacgg ttattcgacc acatccagac ctattggaat ttaggtttgg 660 atcttaca 668 // ID KBOC_DB repbase; DNA; INV; 3300 BP. XX AC AY116624; XX DT 09-DEC-2004 (Rel. 9.11, Created) DT 29-JUL-2005 (Rel. 10.08, Last updated, Version 2) XX DE Drosophila bocqueti transposon P-element k-boc putative DE transposase gene, complete cds. XX KW P; DNA transposon; Transposable Element; KBOC_DB; P-element; KW putative transposase. XX NM KBOC_DB. XX OS Drosophila bocqueti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; montium subgroup. XX RN [1] RA Nouaud D., Quesneville H. and Anxolabehere D.; RT "Recurrent exon shuffling between distant P-element families."; RL Mol. Biol. Evol 20(2), 190-199 (2003). XX DR Genbank; AY116624; Positions 1 3300. XX FH Key Location/Qualifiers FT CDS join(151..450,716..1372,1429..2157,2328..2402, FT 2524..2922) FT /product="KBOC_DB_1p" FT /note="transposase" FT /translation="MSFCEFCCAVVKTEGVKFIRVPKEDRKRKLWEESLGC FT SLAHNARICDTHFKGSDFYGETKTKEERKRRRLMPNALPRQPTPEPESIPT FT VKPGYSNAYTQTIDLENFKLKQKISELEKEIHHLRQQLSESDALRQGLTKI FT FTQNQIKMLPNCGKRIRYNSSDMSEAICLHAAGPRAYNHLYRKGYPLPSRA FT TLYRWLSEVEIKTGTLDIVMDLMKNEDMDEADKVCVLAFDEMKVSAAYEYD FT SAADAVYKPASYVQLAMVRGLKKSWKQPVFFNYNTAMDACTLKAITTKLYK FT SGYIVVAIVCDLGPGNQKLWREFGISEENTWFSHPVDPALKIFAFSDVPHL FT IKLVRNHYVGSGLLISGTKLTKNTVQQAMNCCSSSDLSVLFKLTENHINVR FT SLQKQKVKMATQLFSNTTASAIRRCYELGYEIENACETADFFKMINDWFDT FT FNSKLSTANSLKYSQPYGLQHDLQKDILDKTSLTMSGKIIEKSQRRLPFQH FT GIIVSNKSLDGLYIYLKEKYNMEYILTSRLNQDILEQFFGAMRSKGGLYDH FT PTPLQLKYRLRKYITAKNTELLTGKGNVEDGEEEEWLNLGDIDEMTEDAIE FT YVAGYMIKKLKLRDMSNKDATYTYVDEVSHGGLKKPSSQFVEQLKKLEAIF FT QLYAKEEFDLQINVKRTLLNAAEKLNVPLDIKQLFFKCRIYFRIKHLNKKL FT AIKNQKQRIVANSKLLKIKL" XX SQ Sequence 3300 BP; 1133 A; 569 C; 659 G; 939 T; 0 other; cataatggaa taactataag gtggtctcgt ttgaaaaagc tcgagtgttt ctcatgctta 60 cggggtctgt tctcactctg attttgacag ttgacaggtt gtgcgaacga atttttattt 120 tatttgttaa cccttgggag tgcgtaaaaa atgtcgtttt gtgaattttg ttgtgcggtc 180 gtaaaaactg aaggagtcaa attcatccgt gttcccaaag aagatcggaa aagaaaattg 240 tgggaagaat ccttaggatg cagtttggct cataatgcca ggatttgcga cacacacttc 300 aaaggatcgg atttttacgg agaaacaaag accaaagagg aacgaaagag aaggcgtttg 360 atgccaaacg ccttgccaag acagcctacc ccggagccgg aaagtatccc gaccgtcaaa 420 cctggatatt caaatgcata cacacaaaca gagtaagtcc gaaatgcagt ttctaaataa 480 attttttgga aattgaaaaa aagttaggtt aggttgagtt aggaaaaaaa gtgatacaaa 540 gcaaaaaaaa gtgaattgaa tatatttatg tatgtaaaaa cacacaaaat tgttgcatcc 600 atatgtgcat acatacatat taaagtgtaa acagaatcgc aaagagttta tttggcacat 660 cttatatata ttttcgaatg atttttaaaa tacatttgtt cttttgatat tcagcatcga 720 cttggaaaat tttaaactga agcaaaaaat ttcggagctg gaaaaggaaa ttcaccatct 780 gcgccaacag ctgtcggagt cggacgcatt gcggcagggt ctgactaaaa tcttcaccca 840 aaaccagata aaaatgttgc ccaattgcgg caaaagaatt aggtacaact cgtcggacat 900 gtcagaagca atttgccttc atgctgctgg accacgggct tacaaccacc tgtatagaaa 960 aggatatcca ctacctagcc gtgcgactct atacaggtgg ttgtcagaag tcgaaataaa 1020 aacggggact ctcgacatag tcatggactt gatgaagaac gaggacatgg atgaggctga 1080 caaggtttgt gtcttggcct tcgacgagat gaaggtttct gctgcatacg aatatgacag 1140 cgctgcggac gcagtgtaca agcccgcaag ctatgtccaa ttggccatgg ttcgaggatt 1200 gaaaaaatcg tggaagcagc cggttttttt taactacaac actgccatgg atgcctgtac 1260 cttgaaagca ataacaacca agctctacaa gtcaggatac attgttgttg ctattgtgtg 1320 tgatttgggg cccggaaatc aaaagttgtg gagggagttt ggaatatccg aaggtaaata 1380 tgaaaaaaat catttcagaa atttctaaaa tatttttttt ttattttaga aaatacctgg 1440 tttagtcatc cagtggatcc agctctcaaa atttttgcat tttcggatgt gccacacttg 1500 atcaaattgg ttcgaaacca ttatgttggg tcagggcttt taatcagcgg gactaaattg 1560 acaaaaaaca cagtccaaca ggcaatgaac tgctgttcca gctcagacct gtctgtcctt 1620 ttcaagctaa ctgagaacca catcaatgtt cgatctcttc aaaaacaaaa ggttaaaatg 1680 gcaacgcagc tattttcaaa cacaacagca agtgccatca gacgctgcta tgaattgggc 1740 tatgaaatag aaaacgcatg tgaaacggct gattttttca aaatgattaa tgattggttt 1800 gacacgttta attcaaaatt atctacagca aattcattaa agtatagtca accgtatgga 1860 ttgcagcacg acttgcaaaa agatattttg gataaaacat ctctaacaat gtctggaaaa 1920 ataattgaaa agtcgcaaag gcgtttacca tttcagcatg gaattatagt gagcaacaaa 1980 tcactggacg ggctatatat ttatttaaag gaaaaatata atatggaata cattctgaca 2040 agccgattga atcaagacat tcttgaacaa ttctttggtg ccatgaggtc aaaaggcggc 2100 ctgtacgacc atccgacgcc actacagctt aagtatagac taagaaaata tattacaggt 2160 atatttgagc aacaaaaaca gcaacgaatt aacttgtgat atgacaaatt agttaatgtt 2220 tatattgtag caaaccccgt gattgttggt agttatgtct tgtcctttgt tctataaatg 2280 tttataaatg ccattataga tttttaataa cattcacatt tttcgcagcc aagaatacag 2340 agctgctgac aggcaaagga aatgtcgaag atggtgaaga agaggagtgg ttaaatttgg 2400 ggttcaaaaa agaaaaagac tgtgatgact cccaatgtga cgatgccttt cagacggaag 2460 gaaataaaga aaatgagact gaatactgac actgtgatcg aaatacctga ccatctgaca 2520 agtgatattg acgaaatgac tgaggatgcc atcgagtacg ttgctggata tatgataaaa 2580 aaactgaaat tgcgtgacat gtcaaataaa gacgcaacgt acacatacgt ggatgaagta 2640 tcgcacggcg gtcttaaaaa acccagctcc cagtttgttg aacagctaaa aaagctagag 2700 gcaatttttc aactttacgc caaagaagaa tttgacctac aaataaatgt gaagagaacc 2760 ctgttaaacg ctgccgaaaa gttaaatgtc ccattagata taaaacaatt attttttaag 2820 tgtaggatat attttagaat taagcattta aataagaaac tggccataaa aaatcaaaag 2880 cagcgcattg tggcgaattc aaaattgtta aaaataaaac tttgaaaagg attataacga 2940 aaactacaaa tcaactgttc ttaatttgct tattttttta ttcagtatca tttcttgggt 3000 tgcgcggtct tcctgagcat tcaaaatttg tttttccgac ttaaggtcaa aaattattaa 3060 aaatctaaat atatttcaaa aattattaaa tttttcattt ttctttatag cattttagct 3120 tatgtttcag cagtgggttg tgcatataca cacagccaga ctaagggtta aactcacttg 3180 actcaaatcc atgctcaaaa caaactgaca gcaaaagctc ttttgagtgt ttttgaaaag 3240 cttttttccc ataggctaga gctttttcta acgagaccac cttatagtta ttccattatg 3300 // ID Gypsy-14-I_HM repbase; DNA; INV; 4140 BP. XX AC . XX DT 06-JAN-2009 (Rel. 14.02, Created) DT 06-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE An internal portion of the LTR retrotransposon - a consensus DE sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW reverse transcriptase; Gypsy superfamily; Gypsy-14-I_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4140 RA Bao W. and Jurka J.; RT "Gypsy LTR retrotransposons from Hydra magnipapillata."; RL Repbase Reports 9(2), 400-400 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 112..4119 FT /product="Gypsy-14-I_HM_1p" FT /translation="MASHFGNLSEFNSNEDWDNYVERLDFFFIANDIVNEE FT QQRAILLSSCGHATYKLFKSLVAPGKTSDRSYTELCRLMSTHKSPTPNPIA FT ERFKFNSRNRDTSESVATFIADLRALTTHCAYENTLDEMMRDRIVCGINDS FT RIQRRLLSEGSNLSLQKTLDIVLTMEAAANQAAIIQSHQNVTSPAAINKVE FT IKSERHYERECFRCGGKHNPETCFFKDQECFFCHNKGHTTKKCRKKKKVLK FT NTRTQHKIEEVNGNQNEEELYRIQSKSNKRKPIMIKVEIEGISTKMELDTG FT ASVSVMSLETFTSLKKKGLTRKEIIPTKSFLRTFTGEVVLPVGEVXLSVRI FT NGQTNLLALIITPGKGPSLIGRNWLRKLKINWEKIFHASSGIPKSPNASMI FT NTLVKKHSAVFQNVLGLFKDTVVHIPLLPNVKPKFCRARPIPYALLEKVNA FT ELDRLISSGIYRPVSHSRWAAPIVPVLKKDGTIRLCGDYKQTVNQAAXCDS FT YPLPRTEDLFATLAGGQKFTKLDLAFAYQQLLLDKSSCELLTVNTHRGLFE FT PTRLQFGVHSASGIFQREIEKLIGNLPFTKVRVDDILVSGRSDEEHLKNVE FT TVLHILEKAGLKLKENKCVFMSPEVEYLGFKLTKDGVVPLEDKLNAIRNAP FT EPKDTTQVKSFLGMINYYHRHLPNLATIVEPLHRLLRKGIXWLWGMKEQTA FT FDRAKAELCSPKLLMHYDPKKELILSCDASPYGIGAVLAHVTSEGSERPIT FT YISRTLSAAERNYSQIEKEALSIVFAVKKLHQYLYGRYFTLVTDHKPLLGL FT LAEGKPIPAMTAARIQRWALTLSAYNYCLKYRSGATHGNADCMSRLPIKNT FT ESSVAENVILMMELSSTPVSAKDVKIQSARDPIISRVLDGVLSGLQFPNET FT GFKPFIARKYELGVEGGILMWGNRVVIPASLQEMILKELHSAHPGVVRMKA FT LARSYVWWPNMDKSIEETVRRCRLCELHQRSPESAPIHHWEYPSKPWSRIH FT LDYAGPFLGHMFLIVCDAYSKWIEAIVMKNVKSENLIEQLRSVFAIHGIPE FT VIVSDNGTSFSSAAFAEFVKRNSIRHIFTAPYHPSSNGQAERMVQTFKEAM FT KKLTAQQGNSIETTVNRFLFSYRITPHSTTGISPAELLMKRKLRSAFQGLK FT PDLNNSVKEKQERAERLSNRKAHLRKFDCGDQVMAKNFGSGPKWIPGRIIK FT QKGPVNFEILTDDVVIHRHIDQIRLRFSDLPEYDSNPVITFPEPTNPQPPV FT ENLSLNPFSLQPNTSIAKELPSTEESTASNAVQSSTEKDGASKLPSXMESS FT XAPVETRKSGRLRQRPIRFXDXE*" XX SQ Sequence 4140 BP; 1390 A; 762 C; 896 G; 1078 T; 14 other; yattttggcg acgaagataa atttgttaca ttatttagtg gtttaattat tggtgatttt 60 gtttaatggt tattattatg agtagtacaa attaattagt tcgttattgt gatggcttcg 120 catttcggaa atttatcaga attcaactca aatgaggatt gggataatta tgtcgagaga 180 ctagattttt tctttattgc taacgacatt gttaatgagg agcaacagag agccatcctt 240 ttgagttcat gtgggcatgc aacatacaag ttgtttaagt ctttggtagc accgggaaaa 300 acaagcgata gatcatacac tgagttatgc aggttgatga gcacccacaa gagcccaacc 360 ccaaatccaa tagcagaacg tttcaarttt aattcaagaa atagagacac tagtgagtca 420 gtagcaacat ttattgctga tctcagagct ttaaccaccc attgtgccta tgaaaataca 480 ctggatgaaa tgatgcgaga cagaatagtg tgtgggatta acgactctcg aatacaaagg 540 agactactaa gtgaaggatc taatctttct ttacaaaaaa cattagatat tgtacttact 600 atggaagcag cagcaaatca agcagcaata attcaaagcc atcaaaatgt taccagtcca 660 gcagcaatca ataaagttga aatcaagtcg gaaaggcatt atgaacgtga atgttttcgt 720 tgcggaggta aacataatcc agagacatgt tttttcaaag atcaggaatg ttttttttgc 780 cacaacaagg ggcacacaac aaagaaatgt cgtaaaaaga aaaaagtgtt gaaaaataca 840 agaacccaac acaaaataga agaggttaat ggaaatcaaa atgaagaaga gctatatcgc 900 attcaaagca agtcgaataa aaggaaacca attatgatta aagtggaaat cgaaggtatt 960 agtacaaaaa tggagcttga cactggagcg tcggtgtcag taatgagtct agaaacattt 1020 acaagcttga aaaaaaaggg actcacaagg aaagaaatta ttccaacaaa atcttttttg 1080 cgaacattca ctggggaagt agtgttacca gtaggggaag ttragctttc tgtcagaatc 1140 aatgggcaga ctaatttatt agctttaata atcactccag gtaaaggacc ttcattgatt 1200 ggcagaaatt ggctgagaaa acttaaaatt aattgggaaa aaatatttca tgcaagctca 1260 ggaatcccaa aaagtcccaa tgcatcaatg ataaatacwt tagttaaaaa gcattcygca 1320 gtgttccaaa atgtgctggg gctatttaag gatacagtgg tacacatacc tttgctgcct 1380 aatgtaaaac cgaagttttg cagagcacga cctataccat atgccttatt ggaaaaagta 1440 aatgcagaac ttgatcggtt gatatcttcg ggcatctaca gaccggtaag tcattcccgt 1500 tgggcagccc caattgtgcc agtcctaaaa aaagatggca ctattcgatt atgtggagac 1560 tataaacaga ccgtaaatca ggcagctatr tgcgatagtt atcccctacc gaggacagaa 1620 gatttatttg caactttggc cggtggacaa aaattcacta aactggactt agcatttgcc 1680 tatcagcaat tacttcttga caaaagttcc tgtgaattat tgacagtaaa cacccatcga 1740 ggtctttttg aacctacacg attacagttt ggagtacact cagcatcagg aatttttcaa 1800 cgagaaattg agaagctgat tggcaattta ccatttacca aagttcgagt tgatgacatt 1860 ttagtctcag gacggtcgga tgaggagcat ttaaaaaacg ttgaaacggt tttacatata 1920 ctggaaaaag caggactaaa rttgaaagaa aacaaatgtg tattcatgtc acccgaagtt 1980 gaatatctgg gttttaagtt aacaaaggat ggagtggtac cgcttgagga caaacttaac 2040 gctatcagga atgctcctga acctaaagac actacccagg taaaatcgtt ccttggcatg 2100 atcaactatt accacagaca tttaccaaac ctagcaacga tcgtagagcc actccataga 2160 cttctgagaa aaggcattcy ttggttgtgg ggcatgaaag aacagacagc ctttgacaga 2220 gccaaagcgg aattatgttc acctaaattg ttgatgcatt atgatccaaa aaaggagcta 2280 atcttgtctt gcgacgcatc accttatgga atcggagcag tgctggcaca tgtcacatct 2340 gagggaagtg aaagaccgat tacttacata tctagaacgc tatccgcagc tgaacgaaat 2400 tattctcaaa ttgagaaaga agcattatct attgtatttg ccgtaaagaa gctgcatcag 2460 tatttatatg ggagatattt tactctagtg acggatcaca agcctttgtt gggtttattg 2520 gctgagggaa aacctattcc agccatgact gctgcaagga ttcagcgttg ggcgttaacc 2580 ctttctgctt acaactattg tttgaaatac cgtagtgggg ctacacatgg aaatgctgac 2640 tgcatgagca gattgccgat taaaaacacc gagtcatcag tcgcagagaa tgtaatatta 2700 atgatggaac tatccagcac accagtctca gccaaagatg taaaaatcca atcagcacga 2760 gacccaatca taagtcgggt gttagatggt gttcttagtg gattgcagtt tcccaatgaa 2820 acaggattta aaccttttat agcgagaaaa tatgagttag gagtcgaagg aggaattcta 2880 atgtggggaa accgggttgt aatacccgca tcgttacagg agatgatatt gaaagagttg 2940 cacagtgccc atccgggtgt ggtacgcatg aaagctttag caaggagtta cgtgtggtgg 3000 cctaatatgg acaaatcgat tgaggaaacg gtgcgccgat gccgtttgtg tgaattgcat 3060 caaagaagcc ctgaaagtgc gccaattcat cattgggaat atccgagtaa accctggagt 3120 agaatacacc ttgattatgc aggtccattt ttaggacaca tgtttttaat cgtatgcgat 3180 gcttattcaa aatggattga ggcaatagtg atgaaaaatg ttaaatccga aaaccttatc 3240 gagcaattga ggagtgtatt tgcgattcat gggattcctg aggtaattgt tagtgataac 3300 ggaacttcgt ttagcagtgc tgcttttgct gaatttgtca aaagaaattc aataagacat 3360 attttcacag ccccgtatca tccttcttcg aacgggcaag ccgaacgtat ggtgcaaaca 3420 ttcaaggaag cgatgaaaaa gttgaccgct caacaaggca attctattga aactacggtg 3480 aatcgattct tattttcata ccgaatcacc cctcattcga ccaccggaat atcaccggca 3540 gagcttttaa tgaaaagaaa attacgtagt gcgtttcaag gcctaaaacc ggatctaaat 3600 aatagtgtga aagaaaagca agaaagagct gaaaggctca gtaacagaaa agcacacttg 3660 aggaaattcg actgtggaga ccaggtgatg gcaaagaatt tcggaagtgg tccgaagtgg 3720 ataccgggac gaattataaa gcagaaggga ccagtcaatt ttgaaatcct aaccgacgac 3780 gtagtcattc atcgacacat tgatcaaata cgactgcgat tttccgattt accggaatac 3840 gattcgaatc ccgtaatcac atttccggaa cccactaacc ctcaaccacc agttgaaaat 3900 ttgtcactaa acccgttttc gctgcaaccg aatacgtcta tcgcgaagga actaccatcc 3960 acggaggaat cgacagccag caatgcagtg caaagttcca cagaaaaaga cggggctagc 4020 aaacttccct ccgwgatgga atcgagtwtt gcaccagtgg aaacacgcaa rtcagggcga 4080 ctccgacaga ggccaatacg rttcygggat ratgaataat aataaactta ttggtgaagg 4140 // ID CR1-6B_CQ repbase; DNA; INV; 1832 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-6_CQ; KW CR1-6B_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-1832 RA Kojima K.K. and Jurka J.; RT "CR1 non-LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 7-7 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 7 sequences with >98% CC identity. CC The consensus is ~80% identical to that of CR1-6_CQ. Both CC termini are truncated. XX FH Key Location/Qualifiers FT CDS 2..1831 FT /product="CR1-6B_CQ_1p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="NGNIDDYFHASSGECYDFIALTETWLKDHTLSSQIFG FT PEYEVFRCNRGPDNSRKTEGGGVLIAARRCFKPRRILKDAWKGVEQVWVSV FT KLADRVLYLCVVYFPPDRTFDKDLYRIHLDSVASVAERARPADDIIVVGDF FT NLPKLTWTPSHDGFLYPDPDRSQFHSCASDLLEGYNMATLQQVNHVENEDG FT RRLDLCFVSAQDFAPTLTTAPIPLAKVVRRHPALIITVAGTHSCTVRDRQD FT TVRYNFRGADYEGMSLALGRINWESVLDSVDVDAAVDTFSSIMRDHIDQFV FT PKVRKRVTYHLPWQTPELRNLKTQKRAAFKVSSRCGTLSLRDYYLSINSRY FT QRLSRSCMASYQRRKQRELKSNPKKFWKFVDENRKESGLPSSLHLADEEAE FT STEDICKLFAKKFASVFSSEPVTEDEARSAADNVPRCDRSLASIEIDEDAI FT SAAVTKLKHSSSPGPDGIPSTLLKRCSSVLQVPLLHLFRLSLASGKFPCAW FT KQAFMFPVHKKGDRRNIENYRGISALCAASKLFELVVIDPIFAHCRQELSS FT DQHGFIPKRSTATNLLCFTEFVIDSFENRSQTDAVYTDLSAAFDKINHNIA FT IAKLEKLGFRGS" XX SQ Sequence 1832 BP; 448 A; 499 C; 445 G; 440 T; 0 other; gaacggaaac atcgacgact actttcacgc cagctcggga gaatgttacg atttcatcgc 60 attaacagag acttggctca aggaccacac gctatcctct cagatcttcg gacctgagta 120 tgaagttttc cgatgcaatc gaggtccgga caatagcagg aagacggaag gaggaggtgt 180 cctaatcgcc gcacgtcgct gcttcaagcc gcgccgaatt ctcaaagacg cttggaaggg 240 cgtcgagcag gtttgggttt ctgtgaaact ggctgatcgt gtgctttacc tatgcgtggt 300 ctacttcccg ccggatcgga cctttgacaa ggacctgtac cggatccatc ttgattctgt 360 ggcatccgta gctgaacgag cgcgcccggc tgatgacatc atcgttgtgg gtgacttcaa 420 tttgccgaag ctgacctgga caccttccca cgacggcttt ttatatccag atcctgatcg 480 ttctcaattt cactcttgcg caagtgatct cctggaagga tataacatgg cgaccttgca 540 acaagttaat catgtggaga acgaagatgg gcgtcggctc gatctctgct tcgtgagcgc 600 acaggatttc gccccaacgc taaccaccgc tccgatcccg ctcgccaagg ttgttcgtcg 660 tcatcccgcc cttatcatca ctgtggccgg cacccacagc tgcacggttc gtgatcgaca 720 agacactgta cgctacaatt ttcgcggtgc tgattatgaa ggaatgtcgc tagcactggg 780 tcgcatcaac tgggagagtg tgttggattc tgtcgacgtc gacgctgccg ttgatacgtt 840 ctcatctata atgcgagacc acatcgacca gttcgtaccc aaggtcagga agcgtgtcac 900 ctatcacctt ccgtggcaaa ctccagagct tcgcaacctc aagacacaga agagagctgc 960 ctttaaagtc tcctccaggt gcggaacact ctctcttcga gattattact tgagcatcaa 1020 cagcagatac cagcgactga gccgcagttg tatggcaagc taccaacgcc ggaagcagcg 1080 tgaattaaag tccaatccca agaaattttg gaaattcgtt gacgagaatc gtaaggaatc 1140 tggactgccg tcgtccttgc acctggccga cgaggaagct gagagcaccg aagatatctg 1200 taagctgttt gccaaaaagt ttgcaagcgt tttttcaagc gaacctgtaa ccgaagacga 1260 ggcccgatct gctgccgaca acgttcctcg ttgcgaccgc tctcttgctt cgatagaaat 1320 tgacgaagac gccatctcag ctgcggtgac taagctaaag cactccagct caccaggacc 1380 ggacgggatc ccctcgacac tgctgaaacg ttgttcatcg gtattacaag tacccctctt 1440 acacctattt cgcctatccc tcgcatctgg aaagttcccg tgcgcatgga agcaagcctt 1500 catgttccct gtccacaaaa aaggcgatcg caggaacatc gagaattaca gaggtatatc 1560 ggctctgtgc gctgcatcga agctgttcga gctggtggtg attgatccca ttttcgctca 1620 ctgccgccag gaactatcaa gcgaccaaca tggattcatt ccgaaacgtt ccaccgcaac 1680 aaatctactc tgttttactg agtttgttat tgacagtttc gaaaatcgct cgcagactga 1740 cgcggtgtac acagatctgt cagctgcctt tgataagatc aatcacaaca tagcaatcgc 1800 caagctcgag aaacttggat tccgcggtag tt 1832 // ID Harbinger-N2_BF repbase; DNA; INV; 515 BP. XX AC . XX DT 04-AUG-2008 (Rel. 13.08, Created) DT 04-AUG-2008 (Rel. 13.08, Last updated, Version 1) XX DE Amphioxus Harbinger-N2_BF non-autonomous DNA transposon - DE consensus. XX KW Harbinger; DNA transposon; Transposable Element; Nonautonomous; KW Harbinger-N2_BF; non-autonomous. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-515 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-515 RA Kapitonov V. and Jurka J.; RT "Harbinger-N2_BF - a family of non-autonomous DNA transposons RT from the amphioxus genome."; RL Repbase Reports 8(8), 815-815 (2008). XX DR [2] (Consensus) XX CC This transposon is characterized by TWA TSDs and 31-bp TIRs, CC copies are 93% identical to the consensus sequence. XX SQ Sequence 515 BP; 143 A; 94 C; 100 G; 178 T; 0 other; ggccacagca agtaaatttt atggatgaca tcagcgcgct cattaatttt cgcctgattt 60 cagaaaaaaa aaaagaattt ttttcctctt cggggatggt cagatagacc aaaataggca 120 aacatgttgt gttgtagcct aataatcata cttgtagaag tgacactagg gaggtagcca 180 tgactatagt tgcagaagtg aaacttgttg gatagttgta aaagtgccac caatatggaa 240 gccatgaatg tagttgccag cttggcaagt gataccagtg taccacacta gtatcacttt 300 tactacacta gttcacttct ttaatacagg aatgaatgcc ttgttagttg tgatgccacg 360 ttttgtcact tcaaatcttg ttacgacgtt tatggtgtct attttgaaaa tttagccatc 420 ttgttttttt tacccatctt gttttttttt cgcgctatct cattattttt gcactctgca 480 taggatgtca tccataaaat ttacttgctg tggcc 515 // ID Copia11-NVi_LTR repbase; DNA; INV; 320 BP. XX AC AAZX01012276; XX DT 13-NOV-2007 (Rel. 12.11, Created) DT 13-NOV-2007 (Rel. 12.11, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: long terminal DE repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia11-NV; KW Copia11-NVi_I; Copia11-NVi_LTR. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-320 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from Nasonia parasitic wasp."; RL Repbase Reports 7(11), 1166-1166 (2007). XX DR Genome; AAZX01012276; Positions 5019 4700. XX SQ Sequence 320 BP; 89 A; 78 C; 73 G; 80 T; 0 other; tgttgtaatg aaaagaagtt tcaatactgc tcatgcgtgc acagtaggcg cgaaagtatt 60 taagagtccg tgcgagagca agagagaccg cgatcgaact gtatatccgt ctctctttca 120 cagcgcgtct attgctgtct ctttcttatc gcgcgtatat gctcggggac cgccaaagcg 180 atggggagat agtcacgagc ctttgccgag gtaggacagc taacgaagca tttcaagaag 240 agaatgaacc ttatactact gcacacaact agacttgatt ttctttgtcc acaccacaac 300 gtcctcacca gaattccaca 320 // ID BEL-219_AA-I repbase; DNA; INV; 6348 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-219_AA_; KW BEL-219_AA-LTR; BEL-219_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6348 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 895-895 (2011). XX DR [2] (Consensus) XX CC Positions [5383-5670] - Integrase core CC LTRs are 96% similar to each other. XX FH Key Location/Qualifiers FT CDS 35..3058 FT /product="BEL-219_AA-I_2p" FT /translation="MINRTPIKTRSAAAARVTQSTELDGATNTPIRCKACA FT EPNTTRAIRCFECEDEYHVECVEVTERGVQDWRCSACLGKQGDKLKKTTSV FT PISEPFVSSTNPSSQTMQTTVTSLDTYSNPVQNMSAPSTLLISPGSLFPPL FT PPHSLLPPHSVIPPNSFAPPCYSIPSYQPAQYQYAMPIPPSRQSNMHVSFA FT SGKNYEQCNTVQSSFGQLIPSVFNAVPSTSYNSMGIITSLPEGRPEETPAS FT AFTKYPHQGLEDNHSQYSGTSSQRSTKQRQLKLKLQILEEQRQLQEKEEAT FT KREYLRKRHELMTEIANEEESVIDMEQELPEARVSNWLQGKDPGPVTGAQN FT HQHFLVRPEQSSTIRENTNQQFRTSVEGLPRTHRQGETRLNRSAPRSSIQQ FT VTSAFGQMQMGNNDQQSAVNQGPHSSIGMAPPQISTSISTPRLMDFDGEDE FT VFLSRSHIAARQAVSRDLPTFNGTPEEWPLFYSTFTTTTRMCGYTQEENLI FT RLQKCLKGKAHEAVKCRLMHPSNVPGIMSTLKMLYGNPEVIVQNLIAKIQS FT TPQPKAEKLDTIIEFSLAIQNLCATIEACKLDEYSYNIALLNEFVNKLPCG FT FKVEWAKHRRNLPRANIREFAYWLNELAETVCPIASLQSSGVKNAQGNKNS FT AYLHAHSEDTFEDSTENPSKSFKYTRSDQTLECVACKSSCPTLEKCQRFVE FT LGYNSKWDVVKEFGLCRKCLRKHKGPCRSQQVCGKNGCTYKHHQLLHNYQR FT DQTNATNSKEEDSKAVVQNSSGECNMHRQHKSTVLFRVLPITLHAQNKSIK FT TYAFLDDGSSLTLIDASLAQELNLTGRSEPLCLKWTGNNSRREDDSKNINV FT EISGTGKASKTYTAVAHTVASLNLFRQTIDAQKLKAQYSHLQGIPLESYRN FT IQPKILIGSDNANLIFQLKGREGKFQEPIATKTRLGWSVYGGTTGVDALVG FT HHSVQICPCNTQSDEILQQAICEYFSFDALGIYKPEKVLESHENQRAREIL FT QR" FT CDS 3175..5655 FT /product="BEL-219_AA-I_1p" FT /translation="MKKQPELAAVLRDKIEDYKRKGYIRKLTNEEIHARHE FT RVWYLPLFPVFNPNKPGKVRVVWDAAAKTNGVSLNSLLLKGPDLLTPLDYV FT LYRFREFRVGLSGDVREMYLQMLMAEKDQHCLRVLWNDDSAGDPNTYVTQV FT MPFGTNCSPTCAQYVKNLNAKKYEEQYPEAANAIIKQHYVDDMLVSVETEE FT EAIKLAKEVKFVHSQAGFDMRNWISNSSAVVEAMMEDTPQDKNLNIGAELG FT TEKVLGMWWCTSTDTFTYRLSSKHDPALLEGLRKPTKREILRTLMAVFDPL FT GLISNVLIYLKVLFQEVWRSGIGWDDVIPEHLHEKWEQWLEILPAVQNIRI FT PRCYRLTTQLSAQTNIQLHTFVDASLSGYAAVVYLRFEQGNTVECAIVGAK FT TRVAPLKFVSIPRLELQAAVIGVRLADTITRALSIKVHQRCFWSDSRDVLC FT WIRSDHRRYSQFVAFRVSEILESTTMAQWGHKGSKENVADEATKWQGLPDL FT SSNSRWFRGPSFLWETSNTWPSNPFETDTTSEELRAQICYHHEHVEGLIKP FT QDFSSWTRMQRIVATVIRSIHNLRCKANKAQRRIGPLSAYELQQGAIHLYR FT LAQADAFGDEISMLSSGTDRVTLPKASAIYSFNPFIDANKLLRMQGRISAC FT EYASMDCKNPIILPKDHHVTWLVVQHYHERYHHQNHTTVINELRQSFKIPK FT IRRLFQRVRTTCQKCKIHAASPRPPIMGDLPPDRLAAFARPFSFVGIDYFG FT PMTVAVGRRTEKRWGVLITCLTMRAVHIEVAHSLNAASCVMALRNFMARRG FT VPIRIHSDRGTNFTAAKKELAAAND" XX SQ Sequence 6348 BP; 1928 A; 1437 C; 1470 G; 1497 T; 16 other; atttaaaatc ttcgtttact gatcaagtgg agctatgatc aatcgcacac ctatcaagac 60 gaggagtgcc gctgcagcca gagtgactca atcaacggag ctggatggtg cgaccaatac 120 tcccatcaga tgcaaagcat gcgccgaacc gaacaccaca cgagcgatcc gttgtttcga 180 gtgtgaggat gagtaccatg tagaatgcgt tgaagtcacc gaaagaggag tccaagattg 240 gcggtgttcc gcctgtttag gaaaacaagg agataagtta aagaagacaa cgagtgttcc 300 tatcagtgag ccatttgtaa gtagcaccaa tccaagctcg caaacgatgc aaacgacggt 360 aacatcgtta gatacttatt cgaacccagt acagaacatg tcggccccgt ccacgttatt 420 gatatcacca ggttcacttt ttccgccttt gcccccacat tcgctactgc cacctcattc 480 agtaattcca ccaaattcgt ttgcaccacc ttgctattca atcccgtcat atcaaccagc 540 acaatatcag tacgccatgc caatcccacc aagtcgtcag tcaaatatgc acgtttcgtt 600 cgcgagtgga aaaaattacg agcagtgcaa cacggtgcaa tctagtttcg gtcagctaat 660 tccatccgta ttcaacgcag ttccatcaac ttcctacaat tcaatgggca tcatcaccag 720 tctacccgaa ggtagacccg aggaaactcc ggcttcagcg tttacaaagt atcctcatca 780 aggcttagag gacaaccaca gccagtattc cggaaccagt tcacaaaggt cgaccaagca 840 gagacaattg aaattgaagt tgcagatttt agaagaacaa agacagctgc aggaaaagga 900 agaagcaact aaacgcgagt atctacgcaa acgacacgaa ttgatgacgg aaatcgccaa 960 tgaggaagag tcagttatcg atatggaaca agaattacca gaggctcgag tatccaactg 1020 gctacaggga aaggatccgg gccctgttac aggggcgcag aatcatcagc attttttagt 1080 cagacctgaa caatcatcaa ccatacgcga aaacaccaac caacaatttc gaacttcagt 1140 tgaaggactt ccaagaactc atcgtcaagg cgaaactcgt ttgaatcgtt cagctccacg 1200 aagttccatc caacaagtga catcggcgtt cgggcagatg caaatgggaa acaacgatca 1260 acaatctgca gtcaaccaag gtccacattc aagcatcggt atggcacctc ctcagatttc 1320 aacctcgatt tcaactccta gattaatgga ctttgatgga gaagatgaag ttttcttatc 1380 ccgtagtcat atcgcggctc gtcaagctgt ttctcgtgac cttcctacat tcaatggcac 1440 cccggaagaa tggccgttgt tttactcgac atttactaca accactagaa tgtgcgggta 1500 tacacaggaa gagaatctca ttcgactaca gaagtgtttg aaagggaaag cacacgaagc 1560 cgttaaatgc agactaatgc acccatcaaa cgtacccggc ataatgtcca cgttaaaaat 1620 gctatatgga aacccggaag ttattgtgca aaatctgatc gccaagattc aatcgacacc 1680 gcaaccgaag gcagaaaaat tggatacaat tatcgagttt tctctggcaa ttcaaaatct 1740 ttgtgcaact attgaggcct gcaagttgga cgagtattcc tacaacatag cgcttttgaa 1800 cgagtttgtg aataaacttc catgcgggtt taaagtagag tgggctaaac atcgtcgtaa 1860 cttaccgaga gcaaacatac gggagtttgc ttattggtta aacgagttgg cggaaacggt 1920 ctgtcctata gcaagcctgc agagcagcgg agtgaaaaat gcacaaggaa acaaaaattc 1980 agcgtatcta catgcacaca gcgaagatac atttgaagat tccacagaaa atccttcaaa 2040 atcgttcaaa tatacccggt cagatcaaac tctagaatgt gtagcatgca aaagtagttg 2100 tccaacattg gagaaatgtc agcgcttcgt cgaacttggc tacaactcaa aatgggatgt 2160 tgtgaaagaa tttggtctat gccgaaaatg tctacgcaag cataaaggac catgcagatc 2220 acaacaggtg tgcgggaaga atggatgcac atacaaacac catcaactac tacacaatta 2280 tcaacgggac caaacgaatg caaccaattc taaagaagaa gattccaagg cggttgtaca 2340 aaattcaagc ggcgaatgta acatgcaccg acagcataaa tccactgtgt tgtttcgagt 2400 gcttccgatc acacttcatg ctcaaaataa gtcaattaaa acttatgcat ttttagacga 2460 cggttcttcg ctcactctta tcgatgcttc gctggcgcag gagttaaatt taacaggaag 2520 atccgagcca ttatgcttga agtggaccgg caataatagt cggagagagg atgattcaaa 2580 aaacataaat gttgagattt ctggaactgg gaaagcgtcg aaaacgtata ccgcagtagc 2640 ccatacagtt gcaagtttga atctgttccg tcaaactata gatgcgcaga aattgaaggc 2700 acagtattct catcttcaag gaataccgtt ggaatcgtac cgtaatatcc aacccaaaat 2760 attgattggc agtgacaatg cgaaccttat cttccagctt aaaggacgtg aagggaagtt 2820 ccaggagcca attgcgacga aaacccgact tggttggagt gtgtacggag gcactacagg 2880 ggtcgatgct cttgttggtc atcatagtgt gcagatatgc ccttgcaaca ctcaatctga 2940 tgaaatactc caacaagcga tatgcgaata cttcagtttt gatgcgcttg ggatttacaa 3000 acccgagaag gttctggaat cgcatgaaaa tcaaagagct cgtgaaatac tacagcggta 3060 acccaaacaa caagcggtcg ttacgaagta agtctgctgt ggaaattcga ctctatacat 3120 ctaccgaata gtaaaactac ggcgttgcag cgctttcgct gcttagaagc gaggatgaaa 3180 aaacagccgg aactagctgc agtattacga gataagattg aggactataa acggaaaggt 3240 tatattcgaa aactgaccaa cgaagaaatc catgccagac atgagcgtgt atggtacctt 3300 cccctatttc cagtatttaa ccccaacaaa cctggtaaag ttagagtagt ttgggacgcc 3360 gctgccaaaa ccaatggagt gtcgctaaat tctcttttac ttaaagggcc ggacttacta 3420 acaccgttag actacgtgtt atatcgtttc cgagaattcc gtgtaggtct cagtggtgat 3480 gtacgagaaa tgtacttaca gatgctcatg gctgagaaag accaacattg cctgagagta 3540 ttgtggaacg atgactcagc tggcgatcca aatacttatg taacgcaggt tatgccgttt 3600 ggtacaaact gttcacctac ctgtgcccaa tacgtcaaaa atctcaacgc aaagaagtat 3660 gaagagcagt atccggaagc cgcgaatgca attataaaac agcattacgt cgatgacatg 3720 ctggtcagcg ttgagacgga agaagaagcg atcaaacttg ccaaagaggt gaaatttgta 3780 cactcacaag cgggctttga catgcggaac tggatttcga attcgtcagc agtagtagaa 3840 gctatgatgg aagatactcc tcaagacaaa aaccttaata ttggagcaga gctgggaacc 3900 gaaaaggtat tgggtatgtg gtggtgcaca tctaccgata cgttcaccta cagactttct 3960 tcaaagcatg acccagcttt gttagaaggt cttcgaaagc cgactaaacg agaaattttg 4020 agaacgttaa tggcagtttt cgaccctctc ggattgatat ccaacgtcct catctatctg 4080 aaagtattat ttcaagaagt ttggagatcg gggattggat gggatgatgt cataccagaa 4140 catcttcatg aaaaatggga acaatggttg gagattcttc ctgcagtgca aaacattcgc 4200 ataccgcgat gctaccgtct gactacacag ttgagtgcac aaaccaacat tcagttgcac 4260 acctttgtcg atgcaagtct atctgggtat gccgcagttg tttatctacg ctttgaacaa 4320 ggtaatactg tggagtgtgc aattgttgga gctaaaacaa gggtggctcc actcaagttt 4380 gtatccattc cacgactgga actccaagcc gccgtcattg gagtgagatt ggcagatacc 4440 atcacgaggg cattgtcaat caaagtgcac cagcgttgct tctggagcga ctcacgagat 4500 gttttgtgtt ggattcgatc agaccatcgt cgctactcac aattcgttgc gtttcgtgtc 4560 agcgaaatct tagaatcaac aacgatggca cagtggggtc ataaaggatc gaaagaaaat 4620 gtcgccgacg aagccactaa atggcaagga ttgccagatt tatcatcaaa cagcagatgg 4680 tttaggggtc caagtttttt atgggaaaca agtaacacct ggccgtcgaa tccattcgaa 4740 accgacacaa cgagtgaaga acttcgagcc caaatttgtt atcaccacga gcatgtggaa 4800 ggcttaataa aacctcagga tttttccagc tggacgcgca tgcagagaat tgtagctaca 4860 gtgataagat ctatacataa cttacgttgt aaggcaaaca aggcacagcg tcgcattgga 4920 ccactttctg cttatgaact tcaacaaggc gctatccatc tataccgatt ggctcaagca 4980 gatgcattcg gcgatgaaat aagcatgcta agttcaggta cagatcgagt tactctgccg 5040 aaggccagcg caatctacag tttcaacccg tttatcgacg caaacaaatt actcagaatg 5100 caaggtagaa tcagtgcatg tgagtatgct tcaatggatt gcaagaaccc gattatactt 5160 cctaaggacc atcacgtaac ttggttggtc gtgcagcatt atcacgagcg ataccatcat 5220 cagaaccata ctacagtcat taacgaactg cggcaaagtt tcaagattcc aaaaattaga 5280 cgactgtttc aacgcgttag aactacctgt cagaaatgta aaattcacgc tgcatctcca 5340 cggccaccta ttatgggcga tcttccgccg gatagacttg ctgcattcgc cagaccgttt 5400 tcctttgttg ggatagatta tttcggccct atgactgtgg cggttggccg ccgtaccgaa 5460 aagcgatggg gtgtactgat aacgtgcctg acgatgaggg cggtgcacat tgaagttgcg 5520 cactctctga atgcggcttc atgtgtcatg gcactgcgga atttcatggc gcggcgtggt 5580 gtaccgatta gaattcatag tgaccgtgga acaaatttta cggcggcgaa aaaagaattg 5640 gcggcagcga acgatgasct kakmgcgcgc tgaaggagat ggaccaagat aaagtcgtcg 5700 cmgaaatcgt cagttcagac accgaatgga cattcctgcc ccccgcgtcc ccgcatatgg 5760 gcggagcmtg ggaacggctc atacagacgg ttaagagkaa cctacgggca atgcaaccag 5820 gmcgaaaccc atctgacgaa gtcctccgca ataccttagc ggaagttgag aacgtcgtca 5880 attcacatcc actgacgtac gtaccggtmg aagattcgga ggcgccagtc cttactccga 5940 accacttcct cttgggttca tccagtggcc tgaagccgct gaccgtattc gacgataggg 6000 cggtggttct kmgacgagct gttgtatgtc gcaaattgag gcaaatgttt tctggcaaag 6060 atggttacgw gattatctgc cggacataac aaggcgaaca aagtggttct acgaggtgaa 6120 gccgattgaa ktsgacgaca tcgtggtggt agtggacccg gaactaccac gtaattgttg 6180 gccaaaaggc agagttattt ccgtcaacac aagcaaggat ggacaggtgc gttccgcggc 6240 agtccaaacc aaaacaggga tatacgagcg tccggctack aaattagcgg ttctagatgt 6300 tcgacgcgaw gaatcggtaa gccaggaacc tggcctaccc cgggggac 6348 // ID Gypsy-32-I_NVi repbase; DNA; INV; 8549 BP. XX AC . XX DT 14-MAY-2009 (Rel. 14.05, Created) DT 14-MAY-2009 (Rel. 14.05, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW internal portion; Gypsy-32-I_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-8549 RA Bao W. and Jurka J.; RT "LTR retrotransposon from Nasonia parasitic wasp."; RL Repbase Reports 9(5), 1000-1000 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 3220..5361 FT /product="Gypsy-32-I_NVi_1p" FT /translation="EAGKLKHPAVKETDSRLPYTARLHTRDILSTDLFLID FT TGADISVIPKPTNWQGKPIDLKLYAVNGSTINVYGTVLRELDVGLPYRLTW FT NFTIADVPHSIIGADLLTHYHLLPDLKNKRLVDGNLFTHAPAFIKSVRPMQ FT ISFLAPSHKYANIINNFPKVFGPDQFRSTKKRGVFHHILTTGPPSAQRARR FT LKPEKLKIAKAEFRRMVEQGICRPSNSNWASPLQMVPKKNNKVRPCGDYRK FT LNIDTVPDRYPVPHMHDCMAFLHGKNIFSALDLRQAYHQIPVAPEDIPKTA FT VITPFGLYEFPVMTFGLRNASQTFQRYINSALGDLDFVFVYLDDILIASTS FT EEEHKKHLETVLSRLNEHELQVNLEKCSLGVSELIFLGHLITPNGFKPNPE FT KVKAINDFPLPKTIEELRRFLGLVNSYRRLLAHSAETQRHLNDFLKGAKKK FT DKRPVPWTKEAEAAFQKCKDDLVNLSFTAFPSENAELRLIPDASDTAMGAA FT LEQRSGNSWQPLAFFSQKFSPAQMKYATYDRELTAIYEAIRYFHHYLEGVE FT FKTYTDHKPLIYALQQNHDKMPAIRSRRLSYIAQYNTEIYYLPGEENDVAD FT ALSRINAFTSPTLFNWSDPELFTDPQVKKVLANISAFKLPTMFDAKQLSQE FT QGNDDQLKGILNDASHPLKLRKLTWGSDHAEFYCDIHEDVIRPYIPKKLRK FT AGAKRAQLVNNLK*" FT CDS join(5178..6605,6517..7227,7200..8423) FT /product="Gypsy-32-I_NVi_2p" FT /translation="RPAQGHSQRREPSAKASQAHMGLRSRRILLRYSRRRH FT SSIHSKKVEKGWCKTCTACQQSKVTKHNKFLPKHFIAPDARFDHVHLDLVG FT PFACSHGYTHLLTIIDRYTRWPEAIPIADITAATVARTFYDNWICRYGAPT FT TITTDQGAQFESRLFNELLSILGINRIRTTAYHPASNGMVERLHRDIKTAL FT MCHGDNQEWVRLLPTVMLGLRSRIRLDTDASPADLVFGKTMRIPGDFSPFT FT NEEPNVRTFYNEFRDVMQQLRPVPVSHKTATKPFIHQDMKTCTHAWMQEKP FT IKPALTRPYTGPHKVISRNMENQTFNIDINGTQKTVSLQRLKPAFILQEDP FT DGAKENARPEPIKIPKSTKDNSAQRAPKTSIRTQLPADDIQPAPPSPPIVD FT PPDQPLPVPQPPAPKRVRFVPDTSKCSKNQPSEPVPSAEPGPKRTKFVPNI FT LKRKTNANSIQSRKHTPRKLNSFAGGGGRSANEKRTRIQSSHVNTHRENST FT PLRGGGGAVPNQRYISHRIVSNHTIFLLSMDKLGELRAQLNRFRNNSSDER FT VRADCFNADLELESFENFLRRKLEHRDVRLRMARHHSLADHPVMYEPPIPQ FT LFSNNVPITQLFLRNLNYRLNKNKLREVLSAQGFEDVEITSLKNTKSGSVA FT TLVCKSLASSCKIQVKSCKKELIFTDPKGKVSYIRIRPDQFSSQNALLPTQ FT ASSKKISVLPQALKNISIAAGSLSLLDCPQAVNHLLEYVSANDIYNFYIAS FT FENPKFLAVDFTFDSAADRYNSENGLNRMLDDARKLSFRSILLNAHRITIE FT TPFVNKLLNICVNRRLDPDRRLWLRQVDLSAVAASEQTIRTLSESCTIINL FT TLGSTPGGCDRSLDINKLNTLTLVNNAQLSLKFLTKDCFSLITSLKIIGCN FT EVPMDKLFNFVTINTNLKTLVFKDSQSRNKNSVDLLIAVFSSRNCIEEFQF FT DYSSIREFRTQLIPNMVLNQTTSLRALNLENNDFFKNWIPDPIIDNASNLT FT ELNILGLEFEYSLPLHRLPFLASLKLNYNQAIETTLANLVNKSKLDVLVNN FT TRGKKRTRAINLASLRDVTFGCAKLSLKFINCKFVALASTQRALESDGVIV FT QRSENSIIICKR*" XX SQ Sequence 8549 BP; 2584 A; 2291 C; 1758 G; 1915 T; 1 other; ttggtgaccc cgacgtgatt tttggcaacc agccacagca ccttcgacga gaagaatagt 60 gtcagtgaca gtgcacgcga ctctctctct agcaacggcc tctttgtctc gcacgctcgc 120 ggacaaagac atgctgcgat tgcgtcagtc aacgcgcatc gcagcccttc cgtgatacgc 180 tggcacgccg gtacacacgt gaagagccac acaaaaataa gtcgacatca ccgacttcct 240 cgacacacgt ctccacggtc gagcgcggca gcaacaggtc ggcaagaccc gccgccattt 300 tctccaagct atacgcgcgc aggcatgccg gcgcgtatat acttgtcgca actcgacgtt 360 gcgcgagaaa gagtgtgcgt gctggcacgt cgaccaactc gcaccaacgg cttcctggac 420 accgcagcat agcacagcat ccagaggcca cttcggcgcg gggcagctgc cgccgagttt 480 tccatcgcac atcagcgcag cgggttcgtc gttggatacg cgtgaagcca ccgacaccgg 540 cgccttcaac gggggaattg catcactcct gctttgcgtg aagccaccga caccggcgcc 600 ttcatcgaga agtccagcat cccctcgaca acgcagcggg aaattcttct catcattttt 660 gcgcggcgat tcacgcggga atcgccattt tttattttat aaggtaaata agtgcgcgcg 720 cgcgagattt cacacaaagc cgccatcttg cttcggcaag caatcaattc taagaaagcc 780 agtgcgaacg gctttagttc acagaatcga cacgcacgcg cggcgtagcc actatacaaa 840 gcgaattcga taaatctaat tcgattcaag cgagtaacaa agttttcttt ttttattttt 900 acacagacaa tagcgctata cgaatataca aatacttgcg cctctcgcaa gcgctattca 960 gtcaaaattt atttattaat tcggcattct cacattttat tttatcagga gttccagcaa 1020 ttaccacttc cggactgtgt tcctcgcaag cggtactacc tgggcttcat caacgcggct 1080 gatccgcaaa taagcggcaa gcaagcgcta acgcgttaag ttcggcaacg gctcctttcg 1140 ttgttactgc ctctcggacc gagctacaca tgggcagtac agcaaagcaa ccacgtcaga 1200 ctctagcgta tccgctgtgc ttgaaccagc tgcaacagca gataaggtaa agaaagctct 1260 caaaaggtga caatcttaga ttaggcgtga ggcagacaga cattgctttg ttcatgtatg 1320 tacctgcgag tacattttag tcattttaat aattttgtac gtctgatctg cactcgaggt 1380 tagattagac actgcgcctc accgcacatg agcgaacgtg acttgcattt acgtctcaga 1440 tcgggtaggc aggttcctaa attatccttc cccacaagtc acgccaacaa gaggcttaca 1500 aaatcaagaa atatgacgtt tccaagaaca ccaatcaagt cacaaaatac aaatgaagac 1560 aacgagaccc aggtcaccac atcgagcgaa agcctacgca ttcagactcc tccagtcaca 1620 gcgatggcaa acacagatca acctgcaaac acatcagggt taacaacctt cgctcccata 1680 gactctaccg actcgtcagc agggtccagc caagattata atggccttct aaacgccttc 1740 gctgttggca caggccagca acgaggtgct tcagggccca acttcaaatt cgggtttcct 1800 aaccagcaac acgtgtccgc aaacacagcg tcgtctacgt cgtctccaaa aaacacgcaa 1860 acttgcttcg gagcgaaaca atggtcacag acgatagacc actcactgtc gcaaaacacg 1920 catatgtcta gctcagagat aggtgcctac aaaaagccaa gttggcataa tcaagacaaa 1980 actgccaatg agcattgaca gttagcagag aagaagtcca cagaaataaa tgccgagcag 2040 atcatggaat ctgtcgagca caaaataaag agtggcatag aagaaatgaa agattacatg 2100 aataaatgct tcaccacctt gttcgcgcac taacaagcct catccggtga agacaagagt 2160 gtacccacga cctcgagttc acctcaggtc acgacgacaa cgaatctgtc aaacttcacc 2220 acagcaaaca ctacttgctg gacacgagga gcatgcggaa gcgctccatc gaccgtattt 2280 cctcctcagc agcctcacat accgccgtac gcaccagttc cagtcccagg cacgcaacca 2340 aacatgggag taggtttccc aacacaggga aatacaggat ttggtcaaca gcacaacgca 2400 tctgcggcag caagcaccag gcacttgcta ttttggtcgt tctataaaca cgatccacaa 2460 ggctggttta ggcaactaga ggatctattc gcaacaaata acgtcacgga agacgaagta 2520 agattcaact ttgtcggcag gtacctcagc ccagacattc accacgagat ccagtacaaa 2580 atgaatactc tcatcagagg acaaaagtac ctgcaactga agaaaattct caccgacaaa 2640 tacgctgaga ctccagagca acagctggat agacttttca aaggccttga aatcggtgca 2700 aaacgaccat cagatttttt cgcagagatg attagcttgc gtgcaaagcc gagtaccaag 2760 agacacagtc atccagctct ggaaaaatag gtggtcacct cagattcgcc tgatggtatg 2820 gaactataag gaagaatcag agctcatcat gaacgcagat atggcttatg aggtgcttca 2880 acaatctcac aacatcagta ccatgaaatt cggcaaccag ccggattcgt caatcagaga 2940 caccatggaa aggctggacc aacgcattgc ggcaatcgag aaaacgccga acacacaacc 3000 agatgatcgc tcaagaccca aggggcgaaa caactctcgc aataacgcga acaacaatcg 3060 cggctctaca ccgaatagat ccaagtccag gtcacgtacc aacggcttat gctttttcca 3120 caacaagcat ggcaaagaag ctcgcaactg ccagcctccc tgtacctgga aacgtggccc 3180 tgggcaagac tacagggtga gagaaccggc ggactctaag aagccggaaa actaaagcat 3240 cctgctgtta aggagactga cagccggtta ccatacaccg cacgtctcca cacgagagac 3300 attctctcaa ccgacctgtt cttaatagac acaggtgcgg atatctcggt gatacccaag 3360 ccaaccaact ggcaaggtaa gccaattgac ctcaagctct acgcagtcaa tggctccacc 3420 atcaacgtct acggcaccgt acttcgtgaa cttgacgtcg gattgcccta cagacttaca 3480 tggaacttca ctatcgcaga tgttccgcat tcaatcatcg gagctgatct cctcacacat 3540 tatcatttgc taccagacct caaaaacaaa aggctggtag acggtaatct attcacgcat 3600 gcgccggcgt ttattaagtc agtgcgcccc atgcaaattt cgtttctggc accaagccac 3660 aaatacgcca atataatcaa caatttccca aaagtttttg gtcctgatca atttagatcg 3720 accaagaaac gtggcgtatt tcaccatatc ttaacgacgg gacctcccag cgcacaacga 3780 gcacggcgtc tcaagccaga aaagctcaag atagcgaaag cagaatttcg caggatggtc 3840 gaacagggca tctgccgccc atcaaacagc aactgggcaa gcccactgca aatggtaccg 3900 aagaaaaaca acaaggtacg gccgtgcggc gactaccgca aacttaacat tgatacggta 3960 cccgatcgct acccagtgcc acacatgcac gactgcatgg catttttaca cggcaaaaac 4020 attttttcgg ctctggatct gcgtcaggca tatcatcaga tacccgtcgc accagaggac 4080 atcccaaaga cggcagtcat tacgcccttt ggactctacg agttcccagt catgacgttt 4140 ggccttcgca acgcgtcaca gacgttccag cgctacatca actcagcgct aggagatctc 4200 gatttcgtct tcgtctactt agacgacatt ttaatcgctt caacatcgga agaagagcac 4260 aagaagcacc tggaaacagt gcttagtcgt ctcaacgagc acgagctgca ggtcaattta 4320 gagaaatgca gtctcggcgt ctcagagctc atcttcctag gccatctcat cacaccaaat 4380 gggttcaaac caaacccaga gaaggtcaaa gcaatcaacg actttcctct gccgaagacg 4440 atagaagaac ttcgcagatt tttaggccta gtcaatagtt acaggcgact tcttgcacac 4500 tcagcagaaa ctcaacgtca cttgaacgac tttctcaaag gtgccaagaa aaaggacaaa 4560 cgtcctgtcc catggaccaa agaagccgag gcagcattcc aaaagtgcaa agacgacctc 4620 gtcaaccttt cgtttactgc gtttcccagt gaaaatgcag agctgcgtct cattcccgac 4680 gcatctgaca ctgccatggg ggcagccctc gagcagcgct cgggtaactc ctggcagcct 4740 ctagcattct tctcgcagaa attttcacct gcccagatga agtatgccac atacgacaga 4800 gagctcacgg cgatctacga agccatcagg tacttccacc attaccttga aggtgtcgag 4860 ttcaagactt atacagatca caagccattg atctatgctc tacaacaaaa tcacgacaaa 4920 atgcctgcca ttcgctcacg caggctgtcg tacatcgccc agtacaacac ggagatctac 4980 tacctcccag gagaggagaa tgacgttgcg gacgcactgt cccgaatcaa cgctttcacc 5040 tcccctacac tcttcaactg gagcgatccg gaacttttta ccgacccaca ggtaaaaaaa 5100 gttctcgcga acatcagcgc attcaagcta ccaacaatgt tcgacgcaaa gcaactgagt 5160 caagaacaag ggaatgacga ccagctcaag ggcattctca acgacgcgag ccatccgcta 5220 aagcttcgca agctcacatg gggctcagat cacgcagaat tctactgcga tattcacgaa 5280 gacgtcattc gtccatacat tccaaaaaag ttgagaaagg ctggtgcaaa acgtgcacag 5340 cttgtcaaca atctaaagtg acaaaacaca acaagttttt gccaaaacac tttattgctc 5400 cagatgctag attcgatcac gtccacttgg acctcgtagg accttttgcc tgcagtcacg 5460 gctacacaca tctcctcacc atcatcgatc gctatacgag gtggcctgag gcgatcccaa 5520 ttgcagacat aactgcagca accgtagctc gtacatttta cgacaactgg atttgtcgct 5580 acggcgcacc aacaacgatc accactgatc aaggtgcaca gttcgaatcc aggctcttca 5640 atgagctact gtcaattctc gggataaaca ggattcgcac cactgcgtat catcccgctt 5700 ccaacggcat ggtggagcgc ctccaccggg acatcaagac agccctcatg tgccacggag 5760 ataatcaaga atgggttcgt ctactgccaa cagtcatgct tggacttcgc tcacgcattc 5820 gactagacac ggatgccagt ccagctgacc tggtttttgg caagacaatg agaattcccg 5880 gagacttctc tccgttcacc aacgaagaac ccaacgttcg cacgttctac aatgaattcc 5940 gcgacgtcat gcagcaactg agaccagtcc cagttagcca caaaacggca accaaaccat 6000 tcatacatca agatatgaaa acttgcactc acgcctggat gcaagaaaag ccgatcaaac 6060 cggcattgac acggccatac actggccctc acaaggtcat ctctcgcaac atggagaacc 6120 agaccttcaa catagacata aatggaacgc agaagactgt atctttacaa cgcctaaaac 6180 ctgcgttcat tcttcaagag gatcctgacg gcgctaagga aaacgctaga cctgaaccga 6240 tcaagattcc caaatccaca aaggacaaca gtgctcaacg agcaccaaag acatccatac 6300 gcacgcagct tccagcagac gacattcaac ctgcaccccc ttctcctcct atcgttgatc 6360 ctccggatca accgcttccg gttcctcaac cacctgcacc caaacgcgtt agatttgttc 6420 cggatacctc aaaatgtagc aagaatcaac ctagtgaacc tgttccgtct gcggaaccag 6480 ggccaaaacg cacaaagttc gtcccaaaca tattgaaacg aaaaacgaac gcgaattcaa 6540 tccagtcacg taaacacaca ccgagaaaac tcaactcctt tgcggggggg ggggggcgca 6600 gtgcctaacc aacgttacat ctcacataga atagttagta atcatacaat ttttctttta 6660 agtatggaca agttaggaga gttgagagct cagctcaaca gattcagaaa caactcgagc 6720 gacgagagag ttcgtgcaga ctgcttcaac gcggatttgg agctggagtc ttttgaaaac 6780 tttctacgac gaaagttaga acatcgagac gttcgcctgc gcatggcgag acatcactcg 6840 ctagcggatc accccgtcat gtacgaacct cccattcctc agctcttctc aaacaatgtt 6900 ccgatcacac agttgttctt gcgtaattta aattaccgct taaacaagaa taagttgcgt 6960 gaagttttat cagctcaggg tttcgaggac gtagaaatca cgtccctcaa aaacaccaag 7020 tcgggctctg tagctacttt agtttgtaag tctctggcat catcctgcaa gattcaagta 7080 aaatcatgca aaaaagagct aattttcaca gaccccaaag ggaaagtttc atacattagg 7140 atcagacctg atcaattttc cagtcaaaac gcactgctac ccacgcaagc gtcttctaaa 7200 aaaatatcag tattgccgca ggctctttaa gtctgttgga ctgccctcaa gcagtaaatc 7260 acttgttaga atacgtctcg gcaaacgata tttacaactt ctacatagct tccttcgaga 7320 atccaaaatt tctcgctgtt gactttacct ttgactccgc agcagacagg tacaacagcg 7380 agaacggatt aaatcgcatg ttggatgatg ccagaaaatt gagcttcaga agtattttgt 7440 tgaacgcaca tagaatcact atagaaactc cttttgttaa caaattgttg aacatttgcg 7500 taaatcgtcg gcttgatcca gacagacgcc tttggctacg ccaagtcgat ctatcagcgg 7560 tcgcagcttc agaacaaacg attagaacgc tgtctgaatc atgcacgatt atcaacctaa 7620 ctttaggaag caccccaggc ggttgcgata gatccttaga cataaataag ttaaatactt 7680 tgacactagt taacaacgcg caattatcgt taaaattctt gaccaaagac tgtttctcgt 7740 tgattacttc gctaaaaatc attggttgca atgaagtgcc aatggacaag ctgttcaact 7800 tcgttaccat taacactaac ttaaaaacgc tagtatttaa ggacagtcag agcagaaaca 7860 aaaactcagt tgacctgctc atagcagtat tttctagtag gaattgtata gaggaattcc 7920 aattcgatta ttcaagcata agagaattcc gcactcaact aattcccaac atggttctga 7980 atcaaactac ctcactaaga gctttaaacc tagaaaataa tgattttttt aagaattgga 8040 ttcctgatcc aattattgat aacgcgtcca atctaaccga gttgaacatc ctcggattag 8100 aattcgagta ttcgcttccg ttacataggt taccattttt agcttctctt aaacttaact 8160 acaatcaggc tatcgaaact acgttagcca acctagtcaa taaaagcaaa cttgatgtat 8220 tagtcaataa cacacgaggt aaaaagagaa cccgtgcaat caatctagct tcattaagag 8280 acgtaacctt tggttgtgct aaattaagtt taaaattcat taactgtaag ttcgtcgctt 8340 tagcatcaac gcaaagagct ttggaatcag atggagttat tgtacaacgc tcagaaaact 8400 caattattat ttgtaagcgc taagcgatgc taatgtacta tcaagaaatc tctgtattca 8460 tmaaatgtaa attactctct agccgctgac agaataaaaa aaaaaaaaaa taaagctact 8520 catttttttt tccttttcgg ggggagtac 8549 // ID Copia-37_AA-LTR repbase; DNA; INV; 279 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-37_AA_; KW Copia-37_AA-I; Copia-37_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-279 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 961-961 (2011). XX DR [2] (Consensus) XX SQ Sequence 279 BP; 67 A; 71 C; 61 G; 80 T; 0 other; tgttgtggtt ttacaatgga ttgcccctga tgccgaccct gttaagtgtt cacccctggt 60 ggtcatcgtc atcgaccgac caacggcgcg ccaaaagtgc gcgcgaacga agagagaaga 120 aagcgaagaa aacatttttt tcattccacc attgttaacc gaccgaagta gaagtaaatt 180 acaagttcag ttttgtttcg taaagtgtcc gtgtgttaat tcatttcttc ccaattccct 240 acgtgatttc cactgcgcgt gtgtcctctg ccgccctca 279 // ID Copia15-NVi_LTR repbase; DNA; INV; 381 BP. XX AC AAZX01023481; XX DT 13-NOV-2007 (Rel. 12.11, Created) DT 13-NOV-2007 (Rel. 12.11, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: long terminal DE repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia15-NV; KW Copia15-NVi_I; Copia15-NVi_LTR. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-381 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from Nasonia parasitic wasp."; RL Repbase Reports 7(11), 1158-1158 (2007). XX DR Genome; AAZX01023481; Positions 671 291. XX SQ Sequence 381 BP; 99 A; 66 C; 93 G; 123 T; 0 other; tgaaagcgta ttttccgtat tttctagtgt gtaagtacga atgggatatt tatgtatcta 60 ggtgagtata ggtccaacgc tttcttatcc ccactcgcgt aactctcttt tcagcacgta 120 agtgcgtatg tatgtatcaa aggtccgtga gtctgtgtgt gtatgacgcg tatataagcg 180 gcgtaaatat cggtccgtcc cgcaatattg tgtcgcatct gctgtgaaca ggtctttagg 240 ttgcgctaag agtggtggat aatttatacc gtgtattgta ccgagtatag aagactcgag 300 gtgaaggtgt gagtggttaa ataaacggaa gcaattatca acgaacttga gtcatttatt 360 tgagcctatt ctacctaaac a 381 // ID L1-1b_Cis repbase; DNA; INV; 6231 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 06-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE L1 Non-LTR Retrotransposon from Ciona savignyi. XX KW L1; Non-LTR Retrotransposon; Transposable Element; L1-1b_Cis. XX OS Ciona savignyi OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. XX RN [1] RP 1-6231 RA Smit A.F.; RT "L1-1b_Cis - L1 Non-LTR Retrotransposon from Ciona savignyi."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX CC Ci000973, Ci000161 1-2% div. XX SQ Sequence 6231 BP; 2541 A; 1117 C; 988 G; 1583 T; 2 other; tattctattt gaataaggca agattcacct tgcccaaaag caaatagaac ccgcggatta 60 tcccatttgg attatttctg cgtttttcaa ctgtggatat atccccctct ctcgccagca 120 atacctgcaa acgaaaggta aaatatttag ttgccttgtt tttttgtatt tgtaggtata 180 aagccccgat ttgtttcaat ttttcagtta ccagcctcta agcctttttt tggcttgagg 240 cgaatagctt cctttggttc aatcctcgta tagttctcct tttttcttgt tgtgtaagat 300 agcatctctg tgccacgtag ctccaaacgt gggcaacact ggtgctatct taactagtgt 360 gctctaacca cttcctttac aaaagcaatt actagtgttt ccgcgctcga ctttagcgca 420 tttcctggaa acactgataa ttgtttttgc gtgagaagtg taaattactt cgttttttcg 480 ctcttatact gctctttaga gcggatttgc tctgagttac aanaatagtg ttatctttaa 540 gtcaagttaa cacgacaagt aagtgactta atacgtgagt ttccttacac caaaacacac 600 tttacacaaa aaacatacaa ttaaaaccac tcacctttaa acaacacgat gaatgacggc 660 ataccggaaa tcgaagcgaa attggagaga tcgattgtat ttcaacttac gggtacggtc 720 gaacatatga gtgttagctc atttatggaa ctatttgcgg atggagcgcc gatggcggat 780 atgtcgcaga tcgtagaagg agtaatcatg gacaatttca aagattgcac cttccttgtt 840 accctaaagg aacaaagcgg agaagtagta attccggaaa aacgggagat aattcactac 900 ttcaacacaa acgacatcaa tttcactacc gataagggat cgacaatgtc cctaaaagct 960 gagcttcccc aaggcgaatc tgaagtcgtc tctctacacc cggtcactgc agacacctgc 1020 aaggagaaat tagaagcaat gataaataac caaaactggg gcaaaatcaa aaacataaat 1080 tacggaacac accgtaattt caacaaaata aaaaatggat gggtaaacat aacccttact 1140 gaaacaaaca ttaaaaacat tcctcccttg ataaaaatcg gaggaagaac catcaccgtg 1200 accaggcctg gagaggagca catggctctc tgcaggtact gtaagcaaag aggacacacg 1260 cagaacaaat gtcccaaaaa ggggttctgc gtggaatgca aagcacatgg tcatacctcc 1320 agaaactgca gaacctcata ccaggaaccg cgcccaagaa cagtgttcca ctctacctgt 1380 aacacagcgc agttagtgag gactccaaat aggaatgcca atcaaagcca gtggcaaact 1440 gctaaaaagt tgcctgaaaa acaaatgcat aatacgagaa atgcacaaat aaaactcacc 1500 aacagattcg gcattcttca ggaatcgcag cactcggaga tcgaagatga cctggaatgt 1560 ttaaggaagt tagttgggtc aatggatgaa agtatcttca tgcctgaaga ctttccagat 1620 attaccacgg ccaacagcac accaaaaaat gacaacacaa acttaccaaa acgaagaaga 1680 cgacgcacca aaaagagggg atcaacatca aacccctcaa gccaagaaaa aatggaaaaa 1740 aagattgaat acaataatcc caatcaatta accagcaacc aaatcacccg ggaaatcaag 1800 caaccgataa cactaaactt aagcgacagc tccactgaca acgacaccag cacaataaac 1860 gaaataacgg aagccgccac ccttggacca cgattaccat ctgctgtgga atcagtatat 1920 agaacgcccg aaaacacagc accctttcca ccaataccga atcggacatt aacggacatt 1980 accaaatttt acaccgctga aggattacaa tctcccaaga gaccaagatc cggcacataa 2040 acagaactcg cgtaaaacgc aaaaaactta tacgaacaac caaaatacac ataaaataat 2100 aatgattcca tgtactatat gatctaatgg agcctgacaa agtattaggc aacaaattaa 2160 aaattggatc actgaacaca aatggattac tgaacaaaat taaaaagata atatactata 2220 tggaatcaaa taaaatcgac atattgctaa tacaggaaac tcacgtgttc acccaagata 2280 tgatctccaa atttaaagcg cagtctaaca tcgaaatctt cgttaatgcg cccgaacacc 2340 catttagatc attccggcaa ggaaccgcaa tccttgttca aaagcacctc ctgccaatgt 2400 acaaaattca ccataatata atatttgaaa accgtgtaca aaaattaaat cttcaatata 2460 aacacaatga aatcaacctt tataatttat atctagaagc aggtcaatcg cataaaaatc 2520 ttattgccag agaacaaatg atatatgacc ttaaagataa attaggggac gtaaatgaaa 2580 caatcgacct gctgattggc gattttaata tggtttctaa tgaaatagac gtgaaagcaa 2640 actatgataa aagaaaaaaa cgagatagaa tcgccttaca gcggctacaa aatggaaata 2700 actttcagga cgcattccga ttattacata aacaaaatat tgaatttact aggataacaa 2760 aaacgagtgc tacaagaata gacagaatat atgttaatag actagcgaaa aataaaacat 2820 gggcgctaaa tcatatacgt aattactttt cggaccataa caattgcccg gtaataacgc 2880 taaatataaa tagtaatcgt aaatggggtc tatcgtttta caaaataaat aactctatat 2940 tacaacacaa cgacataatt gaaaacttga atacaatgtg gattaaatgg caaagacaaa 3000 aaacaaaata catgaactct gctacatggt gggaaaacgg taaaaaacta attgcaaatg 3060 aagtacggta tttttctcaa aacataaatt atgcggaacg gaggcgatat ttaaccagag 3120 tgctggaaat taaagaacta gaaaagcaat accagtccga aaatatagta acacaaataa 3180 cacgattgaa agaaaatgta aataagtacg aacgcaaaat taatgaaggt gcgataatta 3240 gatcaaaaat aaatgtaata gaagatgaag aaaaaccaac aaaagaattt ttcaaatacg 3300 aagaaacaaa agctaataga gatactattt acaccatata taatcaaagt ggagaaataa 3360 cggaaaacca aaaccaaact ctgaacgaga cgcatagctt ttaccaagat ctatggacaa 3420 atgctgaaat aaatncagaa tatattgata attacctgct tttcttagaa cccatagaat 3480 atgaccagtt ggaacttaaa aaaatgacgc aaccaatcaa ccattcggaa atttacgact 3540 gcatattaga aatgaaggat aacagcacgc caggatgtga cggtctaaca gtaaaaatat 3600 ataaacaact atgggaattt ataaaatatg acatggaaga attgtacaat aatatatata 3660 taaatggaat aatgccagaa accatgcgca ccgcggtagt aaaattatta tataagaaag 3720 gtgataaaaa ggacattaaa aattggagac ccatttcact gctaaacaca gactataaaa 3780 ttcttagcaa gattatagca aaacggttaa atataataat caataaaata attagtccaa 3840 atcaaaaatg ctccatacct ggaagatcaa ttaataatag tttagaaaat ataaacgcat 3900 gcatagaggc ggctaaatat tttaacaaaa ctttaacaat tctagcaatt gacttcgaaa 3960 aagcatttga ccgcgtaaat tacacctact tatttaaagt acttacgaaa ttaaacatac 4020 ctatttatgt aatcaaatgg ataaagataa tatataataa aatacaaagt aaagttgaaa 4080 taaacggagc cttcacggat aacataaata taacaagggg aatacgacaa ggttgccctt 4140 gtagcatgat tcttttccta attggcgtag aagttctaac tcgaaaaatt aataacaata 4200 aaaatataaa cggatttaaa ctaaatcata tagagttaaa gacagaacag tacgcagacg 4260 atttatctat actaatatca gacaacatgt cactgaagga aaccattaat gaaataaaaa 4320 tatttgaaaa agcatcaggc caaaaaatga atacaagtaa aacccaaata ataactaatg 4380 atactttaat aaataacgta atatacgaac attttccacg tgaatgcata aaggaaaaaa 4440 taaaaatact aggcgtttat tttagtctta acggagattg catgacagaa aacaccgcaa 4500 aagctcgacg cgtaataaat gctatatatt ggaaaaatct taagagaaaa ctaacattaa 4560 aaggcagaat tattataata aactcgctct ttatgccaca attgttaacg atcggaagaa 4620 atatgatact gcctaaacaa tttattaatg aaattaataa ctatatatat aaatttatat 4680 ggttcccaca aaaaattgac agaatagcac ggaaaaaact aatagcccag cctaatgatg 4740 gaggattgtg tgccccagat ataacactaa aattaaaagc tgtaagagct actcgattat 4800 atgaaattag tatattaaaa aaagttgaaa cgatctcaca agaatggaca cgtttcaatt 4860 tagcgtcaac tatgaaatta ataaacgaaa agttatatac taactcggcc ttaaatgcaa 4920 ccgaaccaaa cgacttttac aaggaaattc gacaaacgat ttataaatta cgtccgaaag 4980 atttccaatg ggaatcgaat aaattaaaac ccatttacct agaactatta aaagataagg 5040 cgcaaccaac tataatacgt gaaaataatg agataataaa atggtcacaa attacgctaa 5100 acgataaact aactaaacat cactttaata acgtagaacg agataggaat tacaaaatag 5160 cgcacaacgc atatcatttc ggagattggt atagaaataa aattggaacg caatatcaaa 5220 tgggtaaatt attaattaga aattgcaaat tttgcggaag tagatcagat aatataaagc 5280 atattcttac ggaatgtcag ctaacaaaaa aagtaattaa ggaaatagaa gtactaacca 5340 acaatgcgtg caaacaaaaa acgcaaataa caaaatccat aatattatat aatcaaacga 5400 gtaataatgc aactccgaat ttacttgtaa tcaaagcaat caatattttt aaggctgaaa 5460 taatcaggaa aaaacatcaa ctcgattttg ggaataaata catagatagc aatgatgaat 5520 ttaccacacg attactttgg ataataaata cgaaaatcaa aaatagttta ttacgtgaaa 5580 gtacattgag aggaaaattc gaaacttatg aattatacga gctaaacgat acttacgtaa 5640 tttgaaccaa ttggatacct tgtcaaaata tatattaaag caaaatcgtc acttacgtat 5700 tttactttat gttatgcaaa tgtgcctatt ataatgaact acataataag aaacaaggca 5760 actaaatatt ttacctttac attagataaa taattggaaa atattgaaaa acaaaataac 5820 tcacgaaaag aaaagctttg caggaattca ttgctgacga aacctccgat ccaaatataa 5880 agacctttct ccaaacaaat ttttcatcca tgtcgacctg taatgcagta atagtattca 5940 ttatttttgt aaatatctca aaaatctgta aaaaaaacaa acttgtgacc cgattaaatc 6000 tatttaatta taattatcat gcatattaaa tatgtaaata tgtaaatatg taactatgta 6060 actatgtaaa tatgtaatta tgtaaattta taaatgactg tataaccgtg tttttttact 6120 gatccccgct tattttgttg ttgtatattc gcaatattat gtacggccgg aattccggtt 6180 taacgtgtat tagtcgccca ataaaaaaac gtgcgtcaaa aaaaaaaaaa a 6231 // ID R1_DMo repbase; DNA; INV; 5931 BP. XX AC . XX DT 08-NOV-2010 (Rel. 15.11, Created) DT 08-NOV-2010 (Rel. 15.11, Last updated, Version 2) XX DE 28S rDNA-specific non-LTR retrotransposon R1 in Drosophila DE mojavensis. XX KW R1; Non-LTR Retrotransposon; Transposable Element; R1_DMo. XX OS Drosophila mojavensis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; repleta group; OC mulleri subgroup. XX RN [1] RP 1-5931 RA Stage D.E. and Eickbush T.H.; RT "Origin of nascent lineages and the mechanisms used to prime RT second-strand DNA synthesis in the R1 and R2 retrotransposons of RT Drosophila."; RL Genome Biology 10(5), R49-R49 (2009). XX DR [1] (Consensus) XX CC This family is specifically inserted into 28S ribosomal RNA CC genes. XX FH Key Location/Qualifiers FT CDS 319..1836 FT /product="R1_DMo_1p" FT /translation="RPGMDSEASSGSSTMSKKKKRGRGAKSHLRHAGDAAA FT MPPPTDSPPRKLRVHVDSSADESDASLVATAATVASAASAAPTAAPAANAA FT ANTAAPAAIPAMEAILAAAAAAAAPAAAAAPAAAAPAAAATHAARNPAPLE FT APSTSAAAAAASAHSVSSVPAGAGTASSNSMVQRMLAIERELRRAVVADNV FT PGAVALSVLDNAAKFQELILEMYGRMKELETIVRTRPQATAPQTSAAPAAS FT YAAVTATAVAAPVALPRARKIAETWSAIVTSNNPEETPQQVAERVRKEVAP FT ALGVRVHEVRELKRGGAVIRTPSAGEIRRVLANPKFKEVGLDVKENAAPRP FT RINVINVDSSISPNKFMEGLYKNHFFGHCSNAAFEKAVKIASKPWSSEDGP FT TVNIILEMERKALDILEDSERIYVEWFSFRWHPLTPTYACFRCFSFDHKVA FT MCRMKEQTCKRCGQSGHRVSGCRNPVHCRNCSFKGLPAGHHMLSETCPIYG FT RVLARVAAKH" FT CDS 1833..4943 FT /product="R1_DMo_2p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="TLTMFSIIQANCGRSRAAVVDLGVRMRNSGAMFALLQ FT EPYVDRGGRITGLPAGMRVFSDRRNKATIVVDDQEVVCMPVSSLITEFGVC FT VSVSGNFGSIFLTSVYCQFNAELEPYLLYMDAVLLLASRTPVIYGLDANAV FT SPLWFSKLPERSRGYLNRQRGELLADWVQGSRAGVLNVRSRVYTFDNRRAR FT SDIDVTIVSDSASTWAAYDWSVSEWDLSDHNIITVVVTLDPESTVESFAPV FT PSWQLQNADWRRFGDELRTASMDIPLEDFRLLSSDAQVTALRSLVHQVSDT FT LFGRRQPRARRRVGWWNAALTDARRTLRRARRRLQHARRTQSESASALASY FT FRITRKEYERMMLKAKEEDWRRYVGEHQDDPWGSVYKICHGRKTRTDFGCL FT RWNNEQYVTWHDCANVLLRNFFPAAERPVDIVVPREVPPALETFEVEMCIA FT RVRSRRSPGLDGITGGMVKAAWRAIPEHMTALYSRCLADGYFPLEWKRPRV FT VALLKGLDKDRSDPASYRGICLLPVFGKVLEGIMVNRVKEMLTDESRWQFG FT FRPGRCVEDAWRHVLSSVEASSARYVLGVFIDFKGAFDHVEWDAALRRLSD FT LGCREIGIWRSFFSDRKASIVSSFGEANVNVSRGCPQGSISGPFIWNILMD FT VLLRRLEAHCTFSAYADDLLLLVEGNSRSQLELTGAQLMEIVGGWGIEVGV FT SVSATKTVTMLLKGKLSAGRNPAVRFAGANLRYVTQYRYLGITAGERLSFL FT PHIASLRDRLAGVVGALTRVLRVDWGLSPRARRRIYAGLMVPCALFGASVW FT YTVVMRLVGARRSLKSCHRIILIGCLPTCRTVSTDALEVLAGAPPLDLVAT FT RNAMQFKLKRSYPMVEGDWLYDQDVSTLDRTMRRALLDERLLREWQIRWDD FT SEHGRVTHRFIPDVSFVYSRPDFSFTMPTSFLITGHGSLNAFLHARGLSET FT AGCLCGHPLEDWLHVLCACPLYADVRDLQGLRIQQSESGDWTMERTLMEAE FT SMQLLEDYARTVFSRRRMLMDGMGPGRPVPD" XX SQ Sequence 5931 BP; 1263 A; 1551 C; 1772 G; 1345 T; 0 other; cagtttgctt ttgattgtcg ccgtgaacgg acgtgttcag tttcgtcgcg catcattgtg 60 taaatttgag ttttattgcc ggcaataaaa tacgttgaac gcgcgttaaa attgcttatt 120 tcgagcatct cgcgataagc aatctctcgc cgcgtttgta ttagtgcgtg tgtgagcgtt 180 ttaccgctac gtgttgctga ttggctagcc aatcagaagc gacgaactgt tgcgcttcaa 240 cagaaagtgt tgcatacttt cgggaacaat ttaaaatcac tgcatacttt gtggtatcag 300 tgattttgtg ttgcctagcg tccagggatg gactctgagg cgagtagcgg gagttcgacc 360 atgtcgaaga agaagaagcg tggccgcgga gcgaagagcc atcttcgtca cgcgggcgac 420 gccgcagcca tgccgccgcc aacggactcg ccgccgcgaa agttgcgagt ccacgtggac 480 agctctgcgg atgagagcga tgcgtcgctg gttgccactg cagccaccgt tgcctctgct 540 gcttctgctg cccccaccgc cgcccccgct gccaacgccg ctgccaacac tgctgccccc 600 gctgccatcc ctgccatgga ggctatttta gctgctgccg cagctgctgc tgcccccgcc 660 gctgctgccg cccccgccgc tgctgccccc gccgctgctg ctacccatgc tgcccggaat 720 ccggccccac ttgaggcgcc gtcgacgtcc gctgccgctg ctgctgctag cgcccactct 780 gtgagctcgg tgccggctgg agctggtacc gcgagctcaa actcgatggt acaacgtatg 840 ttggccatcg aaagagagct ccggagagcc gttgttgccg acaacgtgcc tggcgctgtt 900 gcgctcagcg tgctggacaa cgctgccaag ttccaggagt taatcctgga gatgtatggt 960 aggatgaagg agctggagac cattgtcagg actcgccctc aggccactgc tccccaaaca 1020 agtgctgccc cagccgcttc ctacgccgct gttactgcaa ctgcagtcgc tgctccggtc 1080 gctcttcctc gggcccgcaa aattgcggaa acgtggtcag cgatcgtgac gtccaacaac 1140 ccggaggaga cgccccagca agtcgctgag cgtgtccgaa aggaggttgc gcccgctctc 1200 ggtgttcgtg ttcacgaggt gcgtgagctg aagcgtggcg gtgcggtcat tcgcacgcca 1260 tctgctggtg agatacggag ggttcttgcg aacccgaagt tcaaggaagt cggtctcgat 1320 gtcaaagaga atgcagcacc caggcccagg attaacgtga tcaacgtgga cagcagcatc 1380 tcgccgaaca aatttatgga agggctatac aagaatcact tcttcggaca ctgctccaat 1440 gcagcgttcg agaaggcggt caagatagcc tcgaagccct ggagcagtga agatggaccc 1500 acggtcaaca tcatcctgga aatggaacgc aaggcgctgg acatcttgga ggattcggag 1560 agaatctacg tggagtggtt ctctttccga tggcatccac tgaccccgac atatgcctgc 1620 tttagatgtt tcagctttga ccataaagta gccatgtgtc ggatgaagga gcaaacttgt 1680 aagcgatgcg gacaatcggg gcatcgtgtc tcgggttgca ggaatcctgt gcattgcagg 1740 aactgtagct ttaagggttt acccgcgggg catcatatgc tctcagagac ctgcccgatt 1800 tacggacgtg tgctcgccag ggtggctgct aaacattaac aatgtttagc atcattcagg 1860 ctaactgcgg tcgcagcaga gctgccgttg tcgacctagg ggtccggatg aggaactctg 1920 gggcgatgtt cgcattgctg caagaaccat atgttgaccg tggaggaagg atcaccggat 1980 tgccggcagg catgcgagtt ttctccgacc gtcgcaacaa agctacaatt gtcgtagacg 2040 accaggaggt cgtctgcatg cctgtctcgt cactcatcac ggagtttggc gtatgcgtga 2100 gtgtgtcggg taacttcggc tcgattttcc tcacctccgt atactgccaa tttaacgcag 2160 aactggagcc gtacctgctg tacatggatg cggtgctgct gctagccagc cgcacgcctg 2220 tcatctatgg ccttgacgcg aacgcagtat cccccctgtg gttcagtaag ctgcccgagc 2280 gctctcgggg ctacttgaac aggcagcggg gtgaactgct agctgactgg gttcagggca 2340 gtcgagccgg cgtgctgaat gtccgcagca gagtgtacac gttcgataat cgcagggcga 2400 ggagcgatat tgacgtgact atcgtcagtg attcagcgtc tacgtgggcc gcgtatgact 2460 ggagtgtaag cgagtgggat ttaagcgacc acaacatcat cactgttgtg gtgacgcttg 2520 atccggaaag cacagttgag agctttgctc ctgtgccctc gtggcaactc cagaatgctg 2580 actggcggcg ttttggtgat gagttgagga ctgcatcgat ggatatcccc cttgaggatt 2640 ttcgcctttt gtcatcggat gcacaggtga ccgcactccg ctccctggtt catcaggtga 2700 gcgacacttt gtttgggcgt cgacaaccgc gagctagacg ccgtgtgggc tggtggaatg 2760 ccgccctcac ggacgcacgc cgtacgctca ggagagcacg gcgaaggctc cagcatgcgc 2820 gccgcacgca aagtgagagt gccagtgccc ttgcctcgta tttcaggatc acccgaaagg 2880 aatacgagag gatgatgctc aaggcgaaag aggaggattg gaggagatat gtcggcgagc 2940 atcaggatga cccatgggga tctgtttaca agatctgcca cggccgcaag acgcgcaccg 3000 atttcggttg ccttcgctgg aacaatgagc agtacgtaac ctggcacgac tgtgcgaatg 3060 tcctgctccg caactttttt ccagctgcgg agaggccagt ggacattgtc gttcctcgcg 3120 aagtaccccc agccctcgaa actttcgagg tggagatgtg catcgccaga gttcgcagca 3180 ggcgctcacc tggcttggat ggcatcactg ggggtatggt caaagcggcc tggcgcgcca 3240 tcccggagca catgacagcg ttgtattccc gctgcctggc agatggatat ttcccacttg 3300 agtggaagcg cccacgtgtg gttgcgctcc tcaaaggcct cgacaaggac aggagtgatc 3360 cagcgtccta tcgaggcatc tgcctgctgc ctgtctttgg caaagtgctg gaagggatca 3420 tggtgaaccg tgtgaaggag atgctcacgg acgagagtcg atggcaattc ggctttcgcc 3480 ccggacgctg tgtggaggat gcgtggaggc atgttctgag cagtgttgag gccagctcgg 3540 cccgatatgt gctcggagtc ttcatcgact tcaaaggagc attcgaccat gtcgaatggg 3600 atgcagcgtt acgccgacta tccgatctag gatgcaggga aattgggatc tggcgcagct 3660 tcttttcgga ccgaaaagct agcatcgtca gcagctttgg cgaagctaat gtgaatgttt 3720 cacgtggctg cccgcagggg tccatcagtg gtccattcat atggaacatt ttgatggatg 3780 tgctcctgcg ccgccttgaa gcgcattgca ctttcagtgc atacgctgac gacttgctgc 3840 tgctcgtcga aggaaattcc agatcccaac tggagctcac aggcgcccag ctaatggaga 3900 tagttggagg atggggcatt gaggtaggcg tctcagtctc ggcaacgaag actgtgacga 3960 tgctccttaa gggaaagctg tcagctggca ggaatcccgc cgtcagattt gcaggagcaa 4020 atctacggta tgtgacgcaa tatcgttacc ttggcatcac ggctggcgag cggttgagtt 4080 tcctcccgca tatcgcatcc ttacgcgatc ggctggccgg agtcgtcgga gccctgaccc 4140 gtgtgctacg cgttgactgg ggactcagtc cgcgcgcacg gcgaaggata tatgccggac 4200 tcatggtgcc ttgtgcatta tttggtgctt cggtgtggta cacggtggtg atgagactag 4260 tcggcgccag gaggtcgctc aagtcctgcc atcgcatcat cctgattgga tgtttgccta 4320 cgtgtcgaac ggtatctact gatgcccttg aagtgctggc tggagccccg ccgctggacc 4380 ttgttgcgac gcgcaacgct atgcagttca agctgaagag gagctacccg atggtagagg 4440 gcgattggct ctacgatcaa gacgtttcga cccttgatcg taccatgagg agagccttgc 4500 tggacgaacg cttgttgcgt gaatggcaga tccggtggga tgacagcgaa cacggccgtg 4560 ttacgcatag gttcatcccg gatgttagct tcgtgtacag ccgccccgac ttcagtttca 4620 cgatgccgac cagtttcctc ataactggtc atggctcgtt gaatgctttt ctgcatgcgc 4680 gcggtcttag cgagactgct ggatgcctat gtggccatcc cctggaggac tggctacacg 4740 tattgtgtgc ctgccccctc tatgcggatg tgcgggatct gcaaggactt aggattcagc 4800 agtcggaatc cggtgactgg accatggaga ggaccttgat ggaggcggag agcatgcaac 4860 tgcttgaaga ctatgctcgc acagtcttca gcaggcgacg catgctgatg gatggcatgg 4920 gacctggcag gccagtgccg gactgacgtt agtcaaaact ggtagtagaa ttatctgcga 4980 gggtcggacc aaagtcctta aatttgggag acgaagtcct tcgggatcag tcgaaaggcc 5040 gaccagagcc tcaatctggc agactctctt ctcggagaga gtcgaagggc tgaccagagc 5100 cttaatctgg aagacaaatc caatctggag gcgtcgaagg gctgaccaga gccttaatct 5160 ggaagacaaa tccaattctg gacgcgtcga agggctgacc agagccttaa tctggaagac 5220 aaatccaatt tggaggcgtc gaagggctga ccagagcctc aatctggaag acaaatccaa 5280 ttctggaggc gtcgaagggc tgaccagagc ctcaatctgg aagacaaatc caatttggag 5340 gcgtcgaagg gctgaccaga gccttaatct ggaagacaaa tccaattttg gaggcgtcga 5400 agggcggacc agagccttaa tctggtaggt tatctcttcg gaggtgccgc cggtgttaac 5460 ccagtttcca ggttgtcgga aaccaaaatg ggcagtagtc gcggactact ttcagccgat 5520 caaagctgca aattgtaaca gatttgaaat ttgaaccacg ggatgggcta gtgtccgagg 5580 ctagcccagt gaggttggcc ccctttagtg ggagtatcgt ggtggctgtg gtgttacccg 5640 ctgtgtctcc actatagttg gagcacagct gctgggtcca atgtgtattt ggaccacagc 5700 cggaagctgt acccgtagtc agcagaggtt atgataggcc tcgcctcgac ccaagaggga 5760 attgtgtccg acaacacaat tcgagttggt agctaaaagc gttcttttag gggcgttggt 5820 ggacgcattg ttttcacccg ctgtactatg tgtcacatat gtggcacatg cgagtaccgt 5880 ggttgtaaac ccccatcggg gtacacgtca cgttaaacaa atcaaggatt c 5931 // ID Sola2-4_NVi repbase; DNA; INV; 4860 BP. XX AC . XX DT 20-APR-2009 (Rel. 14.04, Created) DT 20-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE Sola2 DNA transposons from Nasonia vitripennis. XX KW Sola; DNA transposon; Transposable Element; Sola2; Sola2-4_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-4860 RA Bao W., Jurka M.G., Kapitonov V.V. and Jurka J.; RT "New superfamilies of eukaryotic DNA transposons and their RT internal divisions."; RL Molecular Biology and Evolution 26(5), 983-93 (2009). XX DR [1] (Consensus) XX CC The consensus is incomplete at both ends. XX FH Key Location/Qualifiers FT CDS 3360..4334 FT /product="Sola2-4_NVi_1p" FT /translation="LKLTKMVKLSMFQKDCYYVILKYCIELLKKNVHNFLS FT ALVNSPNYVQNGVFWPVDDFMSELSLSLKKLIPYHFIAKNQSKYIKDQKNN FT LQENEGLLEMDYSENFAYVVQDAAQQFHFNNDQCTINVAVLYYREDNETRH FT CSFIALSECTTHDTAAVYILQEKIISEIKKRFPKMNKIIYVTDGAKQHYKN FT RYQMMNLVNHESDFQIMADWHFHATAHGKGSCDGVGAVLKREATRTSLQAK FT ATEAILNSKQLYDWARQKFQTITFFHYSKKDYEKTSRSLKKRFSAAPSVTK FT ISEGHAFLVSPDKKITVFRYSNAPSPLSIVQY*" XX SQ Sequence 4860 BP; 1798 A; 657 C; 661 G; 1732 T; 12 other; ttttcacgca gttgagaggt ttatcctgtg atcgacggga ggaaaattat tgacagagas 60 tatttttatg tattttgtgc tgctgaacac gaaaccggcc gtctttttta ccgatacctc 120 aagcgctcgt ccctaacctc aaaaaaccac ccaaaatgtc tgttttacgc ggttaagaaa 180 tttacgtcta gaggaaaatt attgacgaag gctattttta tgaattttga gccactgaat 240 ttgaacacga agcttatttc ttttggtttc tattagttaa gtacattcga ttaaaaccaa 300 atgcctttaa aacctaaata attagatgga cgtaccttta ctttatacct gttaaaaaca 360 tttttggtcg tgtatactga ttcatggatt caaagaaata aatagatgaa atggactgat 420 tccttttgga aaatgattga aatacctgtc attttagaaa atttattgat ttaattaact 480 cgattacgtc aattccatat ggcgttcata atgaccaaat attactaaac ttcataagta 540 atcagaacaa agcccgcgtt gtaattgaat ttattctgaa aagccttagt tgggatattc 600 taaaaattac caaaataaaa aagaaattta gctgtttagc tataaaatta tactaaaaat 660 agtagagaac cgtaaagaag catgctttac aagattttac agatttctac ccttttatat 720 tttatatttt tttttacgta tttcacattt tgtgcaatca aacatctttt ctaatttttt 780 atttttctgc agtttataac atttttaaag tttaaaattt ttaaaattta tcaaatttag 840 aaatgttatt aactgcaaaa caatgtaaaa atataataat aaaaataaat ataaaaaaaa 900 tgtaaaaagt gtatgattct gtaaaaaact gtaaagcata gtctaaatac aaacaatctt 960 ttatagtgat taactatttt tattagaatt ttataacaat ttacagtgta caactttata 1020 acattaaata aacgttttat caaattaata aaatccaaaa agtgtggaca ttcgatcgtt 1080 tatttatcgt tttttccaat atttcttaaa ctttcattcc agcgtttcaa agtgtggtat 1140 taaaagtttt taaaatgttt ttagaaccgt ttcatgttag ggctacatta aaacattcac 1200 taaacatatt tattatttat tttatcatta ttattgccaa taagatatta tgtttgcagt 1260 atacttaata ctatcaaaat ttggtaatat tttgatactt gctaatgaga atttttattg 1320 taaaaaatat caaaatattt aattatattt taattttaaa tattatgaca tatagtaaag 1380 ttagtttact ttacttaatt tttttttcaa gagataaata tcattatcat ttttgttaaa 1440 aaaacgaata caagttaaaa gataatttca tagcacgtgc acgttcggtc tygcttcgct 1500 tgcggtaaac atatttttct aatcagaatt tttgaaatat aagttacaaa accttctcat 1560 cgaaacaact tcatyaacca acaaagattc taaactttaa aaattattct attgttaaag 1620 tttataaaat ttatcaaatt tagaaataac tgcaaaaaat ataataataa aaaaaaataa 1680 aaaaaaaatg taaaaagtgt atattctgta aaaaactgta aagcatagtc taaatacaaa 1740 caatctttta tagtgattaa ctatttttat tagaatttta ttacaattta tagtgtacaa 1800 ctttataaca ttaartawac gttttatcaa attaataaaa tccaaaaagt gtggacattc 1860 gatcgttwwt wwatcrtaat atatttgtcc ataatttaat aaatcatcgt tatttccaat 1920 atttcttaaa ctttcattcc agcgtttcaa agtgtggtat taaaagtttt taaaatgttt 1980 ttagaaccgt ttcatgttag ggctacatta aaacattcac taaacatatt tattatttat 2040 tttatcatta ttattgccaa taagatatta tgtttgcagt atacttaata ctatcaaaat 2100 ttggtaatat tttgatactt gctaatgaga atttttattg taaaaaatat caaaatattt 2160 aattatattt taattttaaa tattatgaca tatagtaaag ttagtttact tnacttaatt 2220 ttttttcaag agataaatag cattatcatt tttgttaaaa aaacgaatac aagttaaaag 2280 ataatttcat agcacctgca cgttcggtct cgcttcgctt gcggtaaaca tatttttcta 2340 atcagaattt ttataataaa aaatataaaa atatttctaa attatgttac ctattgtagg 2400 aattgcttaa ctattatatt taaaatatgt ttcgcataaa acgcgtttta tcaaactatg 2460 ataatatttt gatacttcgt tgctatgtat gtttatttgg aaatataagt tacaaaacct 2520 tctcatcgaa acaacttcat ctgctgtcca atgcaaccaa caaagattct aaactttaaa 2580 aattattcta aaattattct aagtttataa ttttttaatg atattaattg agttgtactt 2640 gctatcactg tgtaaacgtt gcagacgcta ctcttgtctt tgatatattt tctgacggta 2700 aagttttagt aaaaattgta tcacacgttg ttttacctag aattctttta aaatatttaa 2760 atcaaagttt agattattat tctcatttat taaattttga tttcaagtat atttaaaata 2820 atattatttt gtatactgtt gtagtccaga atctttctct atagtttagt tttgtttcaa 2880 attctacaaa atttattagc tgttactttc cacatggtat ttttaatgac agaaaaatta 2940 gtttgaagca gacacaatta acttttaagc tgatatagct tttgtatact tgtccatgat 3000 taattgaagc atctaaaata tttttaatgt ctgagtctac tgcaaagcat tcttttccaa 3060 cagttcatgt tgacagatgt attaatccat ttaatcttct ttacctccaa atgatccatt 3120 gatagtttct attttgacaa tagtacctga atgctggaca ttacgcgata ttgaatcaga 3180 gtttaatgtc tctttaagaa ttgcaaaaaa ggcgagagat ttgaaagatg aattcggtat 3240 cttagcgatt cctgatccga aaaaagcgaa aagtttacct gaaattacaa ttaaagaggt 3300 aaacaaattt tatgaatctg atattaacag tcgagttatg ccgaataaga aagatgtagt 3360 taaaattaac gaaaatggtg aaactcagta tgttccaaaa agattgctac tatgtgatat 3420 taaagtattg tatcgaactt ttaaagaaga atgtccacaa tttcctatca gctttagtaa 3480 attcgccgaa ctacgtccaa aatggtgtgt tttggccggt ggatgatttt atgtctgaat 3540 tgtctttgag tttaaaaaaa cttataccat atcatttcat cgcaaaaaat caatctaaat 3600 atataaaaga tcagaagaac aatttacaag agaacgaagg gcttctggag atggattatt 3660 ctgaaaattt tgcttacgtc gttcaggatg ctgctcagca atttcatttt aacaatgatc 3720 agtgtacaat aaatgttgca gtactatatt acagagaaga taatgaaaca aggcattgta 3780 gttttatagc actttctgaa tgtactactc atgatacggc cgcagtttat attttacaag 3840 aaaagattat ttctgaaata aagaaaagat tcccaaaaat gaataagatt atatacgtca 3900 cagatggtgc aaagcagcac tataaaaata gatatcaaat gatgaacttg gttaatcatg 3960 agtcagattt ccaaataatg gcggattggc atttccatgc tactgctcat ggaaagggtt 4020 catgtgacgg agtaggagcc gtgttaaaaa gagaagctac tcgtaccagc ttgcaggcca 4080 aagctacgga agcgatttta aattcaaaac aactttatga ctgggcgaga caaaagttcc 4140 aaacaataac attctttcac tattccaaaa aagattacga aaaaacttca agatcactaa 4200 agaaaagatt ctcggctgca ccgtctgtta caaaaatttc agaaggacat gcgtttttag 4260 tttctcccga caaaaaaatt actgtgttta gatactctaa tgcaccaagt ccattatcaa 4320 tagtacaata ttaataattg aatttaaaat atgttcaaca aaataataca aatatacaaa 4380 aataaaagta ataattaaaa aacttaaaaa aataaaaaaa aaaaagtaat gcgggctttg 4440 taataaagac agtctcaacc ctgtaaaacg cagagagctt tcataggaat caataaaaaa 4500 attgcacagt ataaaccata ttatcagtaa tcagttacat tatttaagtg cactgaagta 4560 aaagtaaaaa aaataaaatt tagcttcgtg atcaaattca gtggaccaaa attcataaaa 4620 atagccttcg tcaatgattt tcctctagat gtaaatttct taaccgcgta aaaaagacat 4680 tttggatggt tttttgaggt tagggacgag cgcttgaggt atcggtaaaa aakacggtcg 4740 gtttcgtgtt cagcagcaca aaatacatga aaatagtcgt tgcttagaat ttttctgagc 4800 ccctgtgatt tctgacagct tgatcatttc gtaaagtagt aattttttaa ggttatagaa 4860 // ID hAT-36_SM repbase; DNA; INV; 2605 BP. XX AC . XX DT 14-AUG-2009 (Rel. 14.08, Created) DT 14-AUG-2009 (Rel. 14.08, Last updated, Version 2) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-36_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-2605 RA Jurka J.; RT "haT-type repetitive family from freshwater planarian Schmidtea RT mediterranea."; RL Repbase Reports 9(8), 1839-1839 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 905..2455 FT /product="hAT-36_SM_1p" FT /translation="MNVSMNSVQEINNHFAKYVEIPESWRSKNYAFEFVEC FT INHVIQQEIFNEIRMAQFHCLIVDESTDISVHKMLIIYIKYRFKFNYKTVF FT AGILQLNACDGKTIVESIKCFYKNINLDMQKIVMLTSDGAAVMLGAYNGVT FT ALLTQEIRHLTAQHCVAHREDLGIDDAWKGIPLIAQVETLLRTVYTMFHRS FT SVKKHNFEEMASLMDCEVLMFRPLNEIRWLSRHFAIIPFIRNYDVLIEYCK FT EKLDSSNSADPIAKYCLTALENKVNRITLIALNDVFTELTKLSKYLQKSVL FT SPMEAHQYCKSIITKLRSQYLGDIIYWSEDVKKILEDESNNNKEILSGIIL FT FIEKLCYHLDKRFPDADMDLWNAFDQNAIVNTSDFNYGNENFKKIILKYGN FT LIPNMDNKIIMDEYKDLKYLFTEKFKNNSRDFNDLLQLVLREDSFQNIRIL FT FDICGTFQASSADCERGFSLMNLIKTKFRNRLDVNHLGNLIRIKSHLMSEN FT TIDLDKVYHYWKHNKDRREKK" XX SQ Sequence 2605 BP; 970 A; 336 C; 437 G; 861 T; 1 other; acgagcaacg ttccctctaa gatgcgcacg gaaatatttt gaattgcgca taagaaattt 60 gcttggaaat attttaacaa aatttnaaaa aaattgtttt aatttaaaaa caaacatttt 120 tcaatttacc gtactaaaac gttaaaaaag agtttgtttc atgttccaga aaaaacaaat 180 taagtataat ccattatttt cttctagtta taatataaaa aattcaaaaa aataattcct 240 tagttggcca tgcaaattgt tatggattta aatattaaat catttaaata ttaaagccaa 300 attaatctca ttcatttcat atgatctgcc cacaacagtt gttttttatc atagtgctta 360 gtttgttgca gttctaattt gtaagtttta tcatttttaa tttgtagtac taatattttg 420 taaaaaataa aaatatcaat attggacagg tattcaataa aataatgtca aaaagaaaaa 480 atttagctac tggtgaattg ggcaaaaaag tgaagcagtt aaaatcggag tggctttctc 540 tctatgttga aacaactata ccaagctctt gtggtaatca gaatataaaa ttgggagaga 600 tatttgtact tcttgaaact ggtgctgtta tatgtaaatt ttgcgctgag tctaaagtcg 660 gtggggattt tgcagtgggg aaaaatggaa tgactggaaa ctcgactatt taaagcgaca 720 tattagtcat aaaacccaca ttgaagcggt caatatacta cgtacacgat tatcaggtgg 780 agttttgcag ctacttacag aaacgcagca agatagagaa aatcgtacag aagccatttc 840 tagaacaaaa gctggtggtg ataaaatcaa aattttaata gataatgtta ttcttgcaat 900 taaaatgaat gtttcaatga actcagtcca agaaataaac aatcattttg cgaaatatgt 960 agagattcca gaaagttggc ggagtaaaaa ttatgctttt gaattcgttg agtgtatcaa 1020 ccacgttatc caacaagaaa ttttcaatga aataagaatg gcacaatttc attgcttaat 1080 agtggatgaa agcactgata tttcagttca caaaatgtta attatttata taaaataccg 1140 atttaaattt aattacaaaa ctgtttttgc tggtatatta caattaaatg catgtgacgg 1200 aaaaacaatt gtggagtcca taaaatgttt ttataaaaat attaatctgg atatgcaaaa 1260 gattgtaatg ttaacatctg atggagctgc agtaatgttg ggagcctata atggagttac 1320 agctctatta acgcaagaaa ttcgtcactt aacagctcag cattgtgtgg cacatagaga 1380 agatttggga atcgatgatg cctggaaggg tatacctctt attgcccaag tagaaacgtt 1440 attgcgaaca gtttatacca tgttccatcg ttcatctgtt aagaaacata attttgaaga 1500 aatggcaagt ctaatggatt gtgaagtttt gatgttccga cctcttaatg aaatacgctg 1560 gctttctcgt cattttgcga ttattccgtt tattcggaat tatgatgtct taatagaata 1620 ttgcaaagaa aagcttgata gttccaattc agctgatcca atcgcgaaat attgcttaac 1680 agctttagaa aataaggtga atcgtataac attgattgca ttaaatgatg tttttacaga 1740 attaacaaaa ctatcaaaat atttacaaaa aagtgttttg tcacctatgg aagcgcatca 1800 atattgtaaa tcgatcatca caaaattaag aagtcagtat cttggtgata taatttattg 1860 gagtgaagat gtgaaaaaaa ttcttgaaga tgaaagtaac aataataagg aaatattatc 1920 tgggataatt ctatttattg agaagttatg ctaccatttg gataaaagat ttcctgatgc 1980 tgacatggac ctttggaatg cttttgatca aaacgccatt gtcaacacat ctgattttaa 2040 ttatggcaat gaaaacttta aaaaaattat tttgaaatat ggaaatctca tcccaaacat 2100 ggataataaa ataataatgg atgagtataa agatttaaaa tacttattta cagaaaaatt 2160 caaaaataat agtcgtgatt tcaatgatct acttcaattg gttcttcgtg aagatagttt 2220 tcaaaatata agaattttgt ttgatatttg tggaaccttt caagcatcga gcgctgattg 2280 tgaaagaggc tttagtctga tgaatttgat taaaacaaaa tttagaaata gattggacgt 2340 aaatcatttg ggtaatttaa taagaatcaa gtcacactta atgtcggaaa atactattga 2400 cctggataaa gtataccatt actggaagca caacaaagac aggagagaaa aaaaataatc 2460 atgtaattaa tatttaatat tgcaagaaaa aacattttta ctaataaaat acctaattta 2520 cttaattttt gtttaatttt ttaatcgtag cgcaatttat tttttttctg ctcagctaaa 2580 aaaaattaga gggaacgttg atcgt 2605 // ID Gypsy-12_OD-LTR repbase; DNA; INV; 394 BP. XX AC CABV01000024; XX DT 24-FEB-2011 (Rel. 16.02, Created) DT 24-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Oikopleura dioica genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-12_OD_; KW Gypsy-12_OD-I; Gypsy-12_OD-LTR. XX OS Oikopleura dioica OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; OC Oikopleuridae; Oikopleura. XX RN [1] RP 1-394 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Oikopleura dioica genome."; RL Direct Submission to RU (24-FEB-2011). XX DR Genome; CABV01000024; Positions 3806 4199. XX SQ Sequence 394 BP; 124 A; 111 C; 79 G; 80 T; 0 other; tgtcgtaatt aaaggacttc tgctcagaac taaaacagat atgagcagtc atccatgact 60 cccaaggtca agattacaat cgcgcagcga cgtggccaaa ggccaagcgc cgctctgaca 120 cctaggcaag tccggaaaca cggccagtcc acatcgaatc attcggtgag atatcggacc 180 tgcggtaagt tgagagacag aacattagaa acccttcaca cgttaggata gagatccaac 240 cgacattagg gaatcaacaa cgacttctaa ctcacctatt cagactatcg ctccagccct 300 cacctgcaga agacttcgac acctggttct aacctgtcct tcctctgcaa ccgagatctg 360 aataaacgta gactaaaaca cttgtcacaa gcca 394 // ID BEL-4_DWil-I repbase; DNA; INV; 5548 BP. XX AC scaffold_180701; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-4_DWil_; KW BEL-4_DWil-LTR; BEL-4_DWil-I. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-5548 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_180701; Positions 1220922 1226469. XX CC Positions [4487-5077] - Integrase core CC 'AGCTG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 125..5455 FT /product="BEL-4_DWil-I_1p" FT /translation="MAEKLLPLATEQITLLTSLEKSLARYKSEESGHTESH FT LEARLDSIAHLHADFQRNHHQLIQERDNAEIKGRYFDKDIRDCFEETYIMG FT VAEIRNELQRHKDRRIPASTAFDESCLHQTVHEQMGPCLEEVKLPTVKLPT FT FSGKFIEWPAFKEVFLTRVHHCDKLTDLHRFHYLKDSLVGDAAKDIEHLTL FT IAANYEVAWRMLINLYDNKRVLFLHYMDVFDQQLPIKYGDAESLRRFVQTS FT RSCVNSLEKTGVDVRQQSEILVYYMMKRLPNQLRVDWERSISSTQNLPSFE FT ELSQYLETQYRTMITASAGNIEGTLSRDRHRNAKPGVHGLRANQNYTKASF FT VTRETDLCIVCQGDAHLIGVCRIFQEMRLSERKACVVRHRLCFRCLRPNHR FT IDACDSVVNCEKCQGRHHTLLHVDKSGCERPLGTLRKGVIPTASANALIGK FT STTKTAVRHKDEVLLPTAMVWVESIEGYPLQFRAFLDQGSQVSFISEIAAQ FT QLRLRRKQAHITITGLADTKITDAAAEVELVIRSIYNPHAKYKLLASVIPL FT VSKRMPIKRLPFEEWTHLKGLPLADPNFFSPQGVDILLGNDVYDELMLGEV FT HRGIRGMPLAQNTHLGYIVSGKTNQEAGRRSSYTITVRRKEADEILEQLLC FT KFWELEQFDEEPRGVSVEDNWCEDFFVKTHSRMPSGKYVVRLPFLTYLDES FT MVIGESYKSALKRLHSLDRRLRNDASLKNAYVGTINEYLELGQMERVTSRA FT SKTGNPALGGVDHCYLPHHPVIKESSSTTKVRVVFDGSSKTSNGKSLNEIL FT AIGPKLHVHLQGIILNWRGLKWVFMADVEKMYRCINIPPDDAQYQRILWNP FT ETSDYVEEYACTTVMFGTSSAPYLAMRVMKQLAMDECEKYPLAVDVINHQM FT YVDDILSGGDSIAETEDVKNQVIGMLRSGTFELRKWASNCTKLLENIPVEH FT RESSGLLHMEGNDTIRTLGLYWSPKEDEFRFALHVVPPMSSPTKRTILSAI FT ARLFDPMGWLSPIIITAKILMQQLWKEKIGWDSTLPDTIKREWEKFVKELP FT NIVTIRIPRHVTWGLPGGVAELHLFCDASSRAYGAAAYLRFPIGEDKYHTE FT LLLSKGRVSPTKQQLTIPRAELCAALEAAKMYKYLKENLRININYSNTIFW FT SDSMITISWIRGGAANWKVFVSNRINKILEISDERQWRHVASKDNPADYLS FT RGTSATKLASCELWWHGPSWLQGSDIEWPCNAIDQDSLSVVDTEVKKVTCH FT KISVEEMPSVAFISKYSSYTRLVWIISYIKRFVYNCRTSVAERRGPGLSIV FT ELEEGLRELIRMVQKDCFNSEWKALSEGRVIAKDSILLSLNPYFDSNERVL FT RLGGRLRDSLLPINEKYPYILPYKHPFTDLVISYSHICTLHGGTSLTLNFI FT RKKFWIVNGRNAVRFGIHKCITCFKMKPQFAGQKMGMLPAVRVRPSRAFSS FT VGVDYAGPVNISPCKGRGRVSQKGYIAVFVCLVTKAIHLEPVGDLTSDAFI FT GAFNRFIGRRGLCTDVYSDCGTNFIGANKKLQVDKLAYCSYITKHIAPTLL FT RKGINWHFNPPSAPHFGGLWEAGVKSMKYHLKRTLGEQVLTFEEMATVLAQ FT IESCLNSRPLCPLSNDPDDLMVLTPGHFLMGEAPLALPAPELAKVTLIDRW FT KNCQYYAQQFWRRWNSEYLSRLQRRPKWLQTKENLTVGCIVLIRDERFPSH FT QWPLGRVMETHAGPDGLVRVVTLKTVKGLMKRPVAKLCPLPIQENWIENDT FT SPTPKR" XX SQ Sequence 5548 BP; 1681 A; 1040 C; 1386 G; 1441 T; 0 other; tatttttggt gccgaaaccc gggaggaatg atcttggcga ttggatttag agtgtgattg 60 ccagtggata gactgattcg ggtgagtaca atttaacgtt gatgtgtttg ggtaggtgtt 120 cattatggct gaaaaactgt taccgttggc tacggagcaa atcacattgc tcacttcgct 180 ggagaagtca ttggcgcggt ataaatcgga ggagtcggga cataccgaga gtcatctgga 240 agccagattg gactcgattg cccaccttca cgcagatttt cagcgcaatc atcaccaact 300 gattcaagaa cgagataatg cggaaataaa gggcagatat tttgacaaag acatcagaga 360 ctgtttcgaa gaaacctaca taatgggagt agctgagatt cgaaatgagt tgcagagaca 420 taaggatcgg agaatcccgg cgtcgactgc atttgatgag tcttgtttgc atcagacggt 480 gcatgaacaa atgggaccat gtttggagga agtgaaactc ccaacagtta aactgcctac 540 cttcagcggg aaatttattg agtggccggc gtttaaagag gtgtttttaa caagagtaca 600 ccactgcgac aagctgacgg acttacatcg ttttcattat ttgaaggatt cgttggttgg 660 agacgcagct aaggatattg aacatctaac gctaatagcg gcgaactatg aagtggcttg 720 gagaatgttg ataaatttgt atgataacaa acgagtgtta tttttacatt acatggatgt 780 gttcgatcag caattaccaa taaaatatgg agatgcggaa agtttgagaa gatttgtgca 840 aacttcccgc tcatgtgtga attcattgga gaagactggt gtggatgtgc gacaacagtc 900 tgaaatttta gtgtactaca tgatgaagag attaccgaat caattaagag tggattggga 960 gcggtcaata tcttcaacgc agaatctgcc gagctttgag gagctcagtc agtatcttga 1020 aacacagtac agaacaatga ttacggcgtc agcaggtaat atcgagggaa ctctatcacg 1080 ggatagacat cggaatgcaa agccaggtgt acatgggtta cgagctaacc agaactacac 1140 aaaggcttca tttgttactc gagagacaga cctatgcatt gtgtgtcaag gggatgcaca 1200 ccttatcgga gtatgcagga tattccagga gatgaggcta tcggaacgaa aggcgtgtgt 1260 agtaaggcat cgactgtgtt ttcgatgcct tagaccaaat caccgaatag atgcatgtga 1320 cagtgtggta aattgtgaaa aatgtcaagg acggcatcat acactactcc atgtagacaa 1380 atcaggatgc gagaggccgc ttggcacact aaggaaggga gttatcccaa cagcatctgc 1440 gaatgcttta attggaaaga gtacaacaaa aactgcggta aggcacaaag acgaggtttt 1500 attaccaaca gcaatggtgt gggtagagtc gatagaaggc tatcccttac agtttagagc 1560 gtttttggat caaggttcac aagtttcgtt catttcagag attgcagctc agcagctacg 1620 tttacggcgc aaacaggctc acatcacgat tactggatta gcagatacaa aaattacaga 1680 tgcggcagca gaggtggagc tagtcataag atcaatctac aacccacatg cgaaatacaa 1740 gctgctcgcg tcagtaatac ctctagtgag taagcgcatg ccaatcaaac gactgccgtt 1800 tgaagagtgg actcatctga agggcttacc gttggccgac cctaacttct tcagtccaca 1860 aggagttgat attttgttgg gaaatgacgt ctacgatgag ctgatgttag gagaagtgca 1920 tcgaggaata cgtggtatgc ctctggcgca aaatacgcat ttgggataca ttgtatcagg 1980 aaaaacgaat caagaagctg gtaggagatc gtcctacacc ataacggtac gaaggaagga 2040 agctgatgag atacttgaac aattgttatg taagttttgg gagctagaac aattcgacga 2100 ggagccacgt ggggtttcag tagaggataa ctggtgcgaa gatttcttcg ttaagacaca 2160 tagccgaatg ccgtctggga aatatgttgt tcgtcttccc tttcttacat acttggacga 2220 atcaatggta attggagaat catataagag cgcgttaaaa cgattacatt ctctggacag 2280 acggttgcgt aacgatgcga gtttgaagaa cgcttatgtt ggcacgataa atgagtatct 2340 ggagctgggt caaatggaac gtgtaacgtc acgggctagt aagacgggca atcctgcatt 2400 ggggggagtc gatcattgct acctaccgca tcacccagtc ataaaggaat cgtcatcaac 2460 tacgaaggta cgagttgtct tcgatggatc gagcaagacg agtaatggaa aatcgcttaa 2520 tgagattcta gcaataggtc caaaattgca cgtacatctt caaggaatca ttctcaattg 2580 gcgagggcta aaatgggtat ttatggcgga tgttgaaaaa atgtacagat gtattaatat 2640 tccacccgac gacgcgcagt accagcgtat tctatggaat ccggagacaa gtgattatgt 2700 ggaagaatat gcctgcacaa cggtgatgtt tggaacgagc tccgcacctt atttagccat 2760 gagagttatg aaacaattag ctatggatga atgcgagaag tatccactgg cggtggatgt 2820 tataaatcat caaatgtatg tagatgacat actgtctggt ggcgacagta ttgcagaaac 2880 ggaggatgtc aagaaccaag tgattggaat gcttcgtagt ggtacattcg aactacggaa 2940 gtgggcgagt aattgcacaa agctattaga gaacatacca gttgagcatc gagaatcaag 3000 cggattgtta cacatggaag gtaatgatac tattcgcacg ttaggtctat actggagtcc 3060 caaagaagat gagttccgtt ttgctttgca tgtcgttcca ccaatgagct caccaacaaa 3120 gaggacaata ctatcagcaa ttgcacgact gtttgatccc atgggatggc ttagtccgat 3180 aataattact gccaaaattc tcatgcagca actgtggaag gagaaaattg gctgggatag 3240 cacactaccg gacactatta agagagaatg ggagaaattc gtaaaagagt taccaaacat 3300 agtaactata cgaataccac ggcatgttac atgggggcta ccgggtggtg tggctgaatt 3360 gcacctgttt tgcgatgcct cgtcacgagc atatggagct gcggcttact tgcgcttccc 3420 aattggagag gataagtatc acacagagtt gctattgtca aaggggagag tatcgccaac 3480 taaacaacaa ctaaccatac ctagggcaga gttatgtgct gcgcttgaag cagccaaaat 3540 gtataagtat ctaaaagaga atctaagaat taacattaat tattcgaaca ctattttttg 3600 gtcagattct atgataacca tcagttggat ccgtggtgga gctgccaact ggaaagtatt 3660 tgtgtccaat cgaataaaca aaatcctgga aatctcagac gaaaggcaat ggagacacgt 3720 cgcctcgaaa gataaccctg cagattatct aagccgtggc acgagtgcaa ccaagttggc 3780 atcgtgtgaa ttatggtggc atgggccatc atggttacaa ggatcggaca tcgaatggcc 3840 ttgtaatgct attgatcagg attcgctgtc ggttgttgac actgaagtta aaaaggtgac 3900 gtgccataag atatcagtgg aagaaatgcc aagtgtagca tttatcagca agtattcatc 3960 gtatacacga ctcgtctgga ttatatcata tataaaacgt ttcgtatata attgtcgaac 4020 atctgtggca gaacgaagag gacctggact ctcaattgtg gaattagaag agggcctacg 4080 tgaattgatt aggatggtgc agaaggattg ttttaactct gaatggaagg cgctatcgga 4140 aggcagggta atagcaaagg acagcatact actgtctctt aatccgtatt ttgattcgaa 4200 cgaacgtgtt ttacgtctcg gaggcaggct ccgagattcc ctgttgccca taaatgaaaa 4260 atatccttat atattaccgt ataaacatcc tttcacagat ctggtcatat cttacagtca 4320 tatttgcacc ttgcatggtg gaacctcact aacgctgaat tttataagga agaaattctg 4380 gattgttaat ggacgcaatg cagttcggtt tggaattcat aaatgtatca catgttttaa 4440 gatgaagcct caatttgcag gtcagaaaat gggaatgtta ccagcagtac gggtacgccc 4500 ttctagagca ttctcatccg ttggagtgga ctacgctgga cccgtaaata tttcgccctg 4560 caaaggaaga ggacgtgtat cccagaaggg atatattgct gtgttcgtat gtctagtgac 4620 aaaggccatt catttggaac cagttggaga tctgacctca gatgcgttta taggagcatt 4680 taaccgtttc attggcagac gtggcctatg cacagacgtt tatagtgact gcggtactaa 4740 ttttattgga gccaacaaaa aattgcaagt ggacaaactg gcatattgtt cgtacattac 4800 gaaacatatt gcgccaacat tattgagaaa aggaatcaac tggcatttca acccaccatc 4860 agcgccacac tttggtggat tgtgggaggc tggagtgaag tccatgaagt atcatctgaa 4920 gagaactctc ggagaacaag ttttaacatt cgaggaaatg gcaactgtgt tggcacagat 4980 tgagtcttgc ttaaattcaa gaccgttatg tcccctatca aatgatcctg acgatttaat 5040 ggtattaaca cccggtcact ttctgatggg tgaagcgcct ttagcacttc ctgctccgga 5100 attggctaaa gttactctga ttgatcgatg gaagaattgt caatattatg cacaacagtt 5160 ttggagaaga tggaactcgg aatatttatc acgattacaa aggagaccaa aatggctgca 5220 gaccaaggaa aatctaacag ttggatgcat agtgctcatc agggacgagc gtttcccttc 5280 tcatcaatgg cctctaggtc gcgttatgga gacacatgca ggacccgatg gactagtgag 5340 agtggtgact ttgaagacag tcaaaggact catgaaacga ccggtagcca aactctgtcc 5400 attgcctatt caggagaatt ggatagagaa tgacaccagc cctacaccta agcgttaaat 5460 tgttactttc tcgtttcagt tttctttcgc aggagtcttc cagacgtgaa tttgatgatg 5520 aagatcgtca tcatcaaagg ggggagaa 5548 // ID Rehavkus-1_HR repbase; DNA; INV; 10679 BP. XX AC . XX DT 31-MAR-2008 (Rel. 13.03, Created) DT 26-MAY-2011 (Rel. 16.05, Last updated, Version 2) XX DE This is a recently transposed Rehavkus-1_HR DNA transposon - a DE consensus sequence. XX KW MuDR; DNA transposon; Transposable Element; Rehavkus-1N1_HR; KW Rehavkus-1_HR; Rehavkus group. XX NM Rehavkus-1_HR. XX OS Helobdella robusta OC Eukaryota; Metazoa; Annelida; Clitellata; Hirudinida; Hirudinea; OC Rhynchobdellida; Glossiphoniidae; Helobdella. XX RN [1] RP 1-10679 RA Kapitonov V.V. and Jurka J.; RT "Rehavkus DNA transposons from the leech genome."; RL Repbase Reports 8(3), 375-375 (2008). XX DR [1] (Consensus) XX CC Rehavkus-1_HR belongs to the Rehavkus group of the MuDR CC superfamily of "cut and paste" DNA transposons. Transposons from CC this group are widespread in different metazoa, including CC insects, sea squirts, sea urchin and fish. The genome harbors CC several copies of this transposon that are less that 3% divergent CC from the consensus sequence. Rahavkus-1_HR contains 26-bp CC terminal inverted repeats. The 73-bp 5' terminal portion is CC tandemly duplicated (pos. 1-73, 73-146). The 3' terminal portion CC contains a 184-bp minisatellite (pos. 9743-10442). Rehavkus-1_HR CC elements are flanked by 9-bp target site duplications. The CC transposon encodes a 915-aa Rehavkus-1_HR transposase, which is CC composed of the transposase core, C48 cysteine protease, and PHD CC finger. CC This sequence was derived from sequence data generated by DOE CC Joint Genome Institute. XX FH Key Location/Qualifiers FT CDS 4678..7422 FT /product="Rehavkus-1_HRp" FT /note="Rehavkus transposase." FT /translation="MHYNLYVLRQFICNKHCLKFNIPFLHICNFLSSIPNF FT QGVHELEEEGKEVSCRTYANCTVHLLSSAVEEEVYQGLREKDATSTERICF FT AVENAISPCIAITNSKRENRSRYIIYLKCKFEMCRLFKVNISHNKIDIYTS FT STNYNHNGKLVRQLRGQRRKNLYEQLILQHPSVLREKFIRSSNAALIKQGN FT LQNIHSLGVFKNVRCEAMAAGDKAKDDVIDLIERQKFQKTDEIYIQAVSAV FT PFYATFFSDDQIEVILRLGQTDLPTIHLDSTGSVVRRLHSKQTIYLYSAVV FT GLDAGYAQIFDFISDRHDTASVAYHLINFKRYVIKKVNAWPIFMNVVTDHS FT MAMINAVLTGWIGVDLRGYLAMAFKWCISECVPKNQIMIKLCCAHFFKNIV FT RFVKKNFSTKEVVKFYINAFSVILQAKSYRRMINLITSFFHITCSLKKTPS FT LNSHFDEFHESTINEVSHYDLIEHVYDDDDNEVNELETIYESSPFYRKGLQ FT ILEHVNDQLRLVNDEQTEPNEYCNKTFAKLVLKKYISILPLWTVLLNKNGE FT RTSNSKVENCFRFIKHQLLNGSTNLKIGRFAELLKNHVDARTKEILLQIPA FT SRKRKIKTVVKECEKNYWGKRKPFKSYFNSIVTVKIPTIICSNDQSANGHS FT NICITNSNIPSNKCYYEQIVSRNCYVAKKGNDRLLLSDFQTVIGEKWVSSV FT LLDYLSHWFTNSIYIPCDLSTTVMFKNFTCSSPIFQYSFNANYIFFPLVVN FT NNHYCLITVDNNLNKIFFYNPLGSSILQIKQFEKRFKRMSQRINKQSWENF FT DFEAHNELPKQMDGHNCGIFIYMVMDTLDKSKPIGNEFDPIRCRLKIFDDV FT IANSDNMINECLFCCNESADELWIQCQNCFRWVHRKCTLITCSDWTKVLYN FT CNICRSLLGQI" XX SQ Sequence 10679 BP; 3603 A; 1603 C; 1792 G; 3681 T; 0 other; acgaaggtaa tgttcgcgac ggttttggtg tatctaacag gcggaaccaa tattgtcaaa 60 ttggctgaac agtacaaagg taatgttcgc gacggttttg gtgcatctaa ctggcggaac 120 caatattgtc aaattggctg aacagtttga actaatatat tttcgtttaa taatattaca 180 cggaataatt aaaaatttaa ttttattata taagcatctg cgtctcgtta atttttggaa 240 acgatacgat atcggctacg atgaagcgta tctaacaggc ggactcaatc atgtcaaatc 300 ggctgaacgg tttgatctaa taaattttcg tttaatgaaa atacacggaa ttattaaaaa 360 ttaattttgt tacataagaa tttgcgtctc gttattattt tgaaaacgat acataacgat 420 acgataacaa tacgataacg aggtattgaa gactatatat tgatataata aaatcatgtg 480 cacaagtatg tgcacaaatt gtatgatgat ttttgagttt ttgaaagata gtatttgaac 540 atcgcgcaat aacaatgacg cttggcacat ttacagtctg cccttagttg ttttcccgcc 600 aactccgctt tacaaaatgt actgtattaa aacagaaaaa agttaaaggt caattacact 660 cttccaaagt tcagaggatt caacagagtg caaaacatcg taaatgaaca tcgtgctaaa 720 cgacgcatta cacacgaagc atatgatacg cggggatatt tttgtcacat tcgaacccgg 780 ttactgcgaa acagcaccag gcgaccttaa ccattagact atcacgccgc ccagtcattt 840 cgcccagaaa acagccattc aattgaaaaa ttaaagaaaa gtttgttttg ttcgtatttt 900 ttatttgata cgagactttt tggagaatgt ctcgttttaa taacgatacg atatttttag 960 agtctttatt aaaccaaagt cattcaattg gaaaaaatta aataagtttt aatattttat 1020 aacactttat ttttttacaa tgtgcacttt taccatttcc acacattgaa ataatggtac 1080 gagacgaaaa ttgtgtcttt tattttttat ttcgatatga tagactttcc taaaaatgtc 1140 ttgtttaaat aacgatagga gatctctggt attgttatta aaggatacga gatttttcgt 1200 atcgttttat aacgatacga gatatttttt accgttatta tttttaaatc tctgtcgatt 1260 ggtaacgaaa ataacggttc gagacgcagg tgcatagtat aagattctat atattatttt 1320 gttgacatca taatacaatt taaaaattgt acagtatatt atcgaatatt ttaaaattgt 1380 gttacacttt gatttataaa ttaattttaa tgaactcggt gaattaaaca ggaactgcaa 1440 agaagtattg gacttaatta tggaagtttt tggtccttgt tattatctcg caaacaacct 1500 aagtgcacca cagttttgaa gtttcctgta tctgttgtgt tacaacgtca ttcctgcttt 1560 tgcttgtagt atagtttata atttgattaa cacaaaatgg caataccaga atagaatacc 1620 gtgattgtac tttttggact tttttaaatt ttagatgttt ttcagattat gttttgaaac 1680 ggaataaaat taaaaaaaat acttttaaaa ttattctgca tgtttaaact ctagtgttac 1740 tttcgaaaga aatgttcatg ttttaattct cgtgttttaa ttcgcggcat tcttttcttt 1800 ttttaaacga gttatttgct acaaattttt gtaccatatc cccgtttttt gttggcttct 1860 ctaaaaagaa ttacatttat actaatataa attgttttaa tttcacacgt agtggtagtt 1920 gatgactcaa cttccttcat tgttttttgg cccaaggtga cataaaaatt acatctcaaa 1980 acggaccgga ttagcgtcga tccagatgtt tatgtttcgg cattaaatac cgtcagtgaa 2040 acagcacgtt caatccagga cgttgctcgt caatagtttt atttttctga tgctgcgtca 2100 catcagaatt tgtcaaatgg aggcatttgt ccattgttaa cagctcaacg taatcgacaa 2160 atttttaatg ttattttgga acaggaaata gcgaaagagt agtcctgtta tttcaaatca 2220 gttaaacctg aaaacgagaa aacgacgaaa ccaagtattc atttgtcata actatggcgt 2280 gatggctcaa tggttaaagg cgcctggtct tgctattgca ttgactgggt tcgaatacaa 2340 agagatcccc gcctatcata tgcaatgtgt atgatacgtg gtttgcacgc gtattcatat 2400 acgctgtgtg tgtttgttgt tgaaacctct gaagcaaacg caaaagagag agaaaccttt 2460 aacctttatg gtgaatattt aataactttc tttttcgttg taatgatatt tgatgctaat 2520 aacattcaac cccaacctgt gtatcaatca acttcgatat tggggtttgt ttatgacatt 2580 tgtttgttac attttgaaac cgcaattttt tttaacgtta aataaaaaaa gttataaatg 2640 ttttagtaag tgagtaatta agtcatacca cacacacaaa cacatttata taaaaaaaat 2700 ggatatatag gtacatatat aatatatata tacaaagata tattaaacaa caatgcatac 2760 ttttttataa actttttatt acaaataaaa acgttaaaaa tttatttcgt gcactttaga 2820 gaataaatga tgtttctctt aaagttcatc attccacgtt catttaggtg gatgccgtca 2880 ggtcctatat tattacctcc tcccactctg tcaagcttcc agaaatgaac aatgttagac 2940 tggaattgtt tcaatgaaac atttagttta tcaactttcc atttctgaac gtcttttcta 3000 taaaccaggc ttaatatgat tattcggcgc ggtctgattt tgttctctaa cagtctgagt 3060 aaggattcat atctcgcggc cacgagttcc ggggtttcag tcgtcaaatc attgcctggt 3120 atttaatttt aatgactaaa aattaaacaa aactgttatt atattaaaag aataatgaac 3180 caacattaaa ctatttatta aacaaatgaa tcaacctccg atgtaaagga ttattgtttc 3240 aataggaatg gtatgcttcc ggaagaagtt aatcaatcct ccgatcctcc caccaggctt 3300 tccttttaat atgaaacgtt tattaccgtg ggtaataacg tcatttttca tttttaaaaa 3360 aattccattc tacgcacata actgtcgcca aacaacgaaa tagacattgc tcgaacagtt 3420 ttataaattt aaccttgaca gcaagtaaca aaatggccaa catagaaatg aaaaacgtgg 3480 tttcttaaca tggaaataga aatgtaatta aacgtgtata attgaactga aacgtatata 3540 atgtcaataa ttgttataca cacaaattgt ttaatacaca taattacggc aaggaaaaac 3600 aattatatat aataagaaca aaaatagatc taaattcata atttcataat ttggtaacac 3660 aaaaattatc aataagtgaa tttaaaattt gttgcgtaga agttcgcttt tatacagagt 3720 tctttttcag cgcttgtttg ctaggcggct ttttcaaaac aatttttgaa actagttttc 3780 ttacaattgc gagttttgcg atatggattc ttttattgaa aaattcgtga gttacgtcca 3840 gaaaagtggc atagacagag aaaagttact aaaggtttac ttttttataa ctaaaaaata 3900 tacgaagcca caaaaataaa tgttaaacga taactttata agttagcgat attgatctgt 3960 acttttcaca attttagtca tcacgatctt ttttgcaatt taaaatgttt aaatttcact 4020 ttttgctgcg attgcattag tatttgttaa tttttagcta aaataaaaaa catattttgt 4080 ttttttttgt aaatttaaat gttcacgcgt taaagaacta tagttcaatt actgtgcaat 4140 gcatattatt cattaaatta aacgaatcaa catttttatt aaaatacagt ctagtttgga 4200 tgaacagggt tggtttggcc aagcttgcaa gagcctaaag cagaaaaata acttcagtaa 4260 tcgaaagcgg taaatgcact attgaaaatg taacaaaact agtgataaca ttaaaaattt 4320 gaaattcaca gattgcatga cttattgcgg aggaatagat gctttttagg tttaactgaa 4380 aatggtcccc aaggcaacga tgaatgtttg cagaacctac agagcatagt aacatgtaac 4440 aacaatgcct tagccaagtg gttgcatgat aaccaaacaa ttgtccggag caacagagat 4500 gatatcttaa agatactgca aggtaatata attaatatac aatgtttttt gcagcaggta 4560 gagaacactg atgtcatcga tgaatcacaa cggcaagcca ctcaggcatc acatgatgaa 4620 agtgttgcaa ttgttgaaaa caattcgccg gcacgaggta ttgatatatc atttataatg 4680 cattataatt tgtatgtact acgtcaattt atttgtaata aacattgcct caaatttaat 4740 attccctttc tgcatatttg taatttttta tcatctattc ctaatttcca aggggtacat 4800 gaacttgaag aagaaggaaa agaggttagt tgcaggacgt acgccaactg caccgtgcat 4860 cttttaagta gtgcagttga agaagaggtg taccaaggcc tccgcgaaaa agatgcaaca 4920 tcaactgagc gtatctgctt cgcggtagaa aatgccattt cgccctgcat tgctatcaca 4980 aatagtaaac gcgagaatcg ttcgcgttac attatttatc tcaaatgtaa atttgagatg 5040 tgcagattat ttaaggtaaa tataagtcat aacaaaattg atatttatac atcgtctacc 5100 aattataacc ataatggaaa attagtaaga caactaagag gccaacgtag aaaaaacttg 5160 tatgaacagt taatattgca acatccgtct gttttacgtg aaaaatttat aagatcctcc 5220 aatgctgcat taataaaaca ggggaatcta caaaatattc attcgttggg ggtatttaaa 5280 aatgtgagat gtgaggctat ggctgcaggc gataaggcaa aagatgacgt tattgatcta 5340 atagaacgtc aaaaatttca aaaaacggat gaaatttata tacaagcagt ttcagccgtt 5400 ccattttatg ccactttttt ttccgacgac caaattgaag ttattttgag attaggtcag 5460 acagacttac caacaataca tttagattca acggggtctg tggttcgtcg tcttcatagc 5520 aaacaaacta tatatctata cagcgccgta gtaggtcttg atgcgggtta cgctcaaatt 5580 tttgatttta taagcgatcg ccatgacaca gcttcagttg cttatcattt gataaatttt 5640 aaaagatacg tcataaaaaa ggtaaacgcc tggccaattt ttatgaacgt ggttacagac 5700 cactcaatgg caatgataaa tgccgtttta actggttgga ttggagttga tttgagaggt 5760 tacctggcaa tggcatttaa atggtgtatt agcgaatgtg tcccaaaaaa tcaaataatg 5820 ataaagcttt gctgcgcgca cttttttaaa aacatagttc gatttgtcaa aaaaaatttt 5880 agtacaaaag aggttgttaa attttatata aatgccttta gcgtaatact gcaagccaaa 5940 agttatagac gcatgataaa tttaattact tcattctttc atattacatg ttcgttaaaa 6000 aaaactccat ctttaaatag tcattttgat gagtttcatg aatcaacaat caatgaagta 6060 agtcattacg atctaattga acacgtttac gatgatgacg ataatgaagt taatgaattg 6120 gagacaattt acgaaagtag cccattttat cgaaaaggct tacaaattct tgaacatgtt 6180 aacgaccaat tgcgtttggt gaacgatgaa cagacggagc caaacgaata ttgtaataaa 6240 actttcgcga aacttgtttt aaaaaaatat atttcgattt tgcctctttg gacagttctt 6300 ttgaataaaa acggggaacg aacaagcaat agcaaagttg aaaattgttt taggtttatt 6360 aaacaccaat tgctaaatgg atcgacaaat ttaaaaatag gtcgttttgc ggagttgtta 6420 aaaaaccatg ttgatgcgcg tacgaaagaa attctactgc aaatacccgc ttctcgtaaa 6480 agaaaaataa aaacagtggt aaaagaatgt gagaaaaatt attgggggaa acgtaaacct 6540 ttcaaaagtt atttcaacag catagttacc gttaaaatac ccacaataat ttgcagcaat 6600 gatcaaagtg caaatggcca cagcaacatc tgtataacaa attccaacat tccgagcaac 6660 aaatgctatt acgagcaaat agtcagtaga aattgttacg ttgccaaaaa aggaaatgat 6720 aggttgttgc tttcggattt tcagactgtt attggagaaa aatgggtaag tagcgttctt 6780 ttagattatt tgtctcactg gtttacaaat tcaatctata taccttgtga cttgtccact 6840 actgtaatgt ttaaaaattt tacttgttct agccccatct tccaatattc ttttaatgct 6900 aattacatat ttttcccact cgtcgtcaac aacaatcatt actgtttaat aactgtagat 6960 aataatctaa acaaaatatt tttctataac cctcttggat catctatttt gcaaatcaaa 7020 caatttgaaa aaagatttaa acgtatgtcc caaagaatta acaaacaatc atgggaaaat 7080 ttcgactttg aagcgcataa tgaactgcca aagcagatgg atggccacaa ctgtggaatt 7140 ttcatctaca tggtcatgga cacgttagat aaaagcaaac caattggaaa cgaatttgat 7200 ccaataagat gtagattaaa aatatttgat gatgtgattg ctaattccga taacatgata 7260 aatgaatgtt tattttgttg taatgaaagc gctgatgagt tgtggattca gtgtcaaaac 7320 tgtttccgct gggttcatag aaaatgtacc ctcatcacat gcagtgattg gactaaagtg 7380 ttgtataatt gtaatatttg tcggtctcta cttggtcaaa tataattgac aattgaatga 7440 gcctgcttca ggcagctctc taggtaggtt ccttttttta attgaatgag cctgcttcag 7500 gcagctctct aggtaggttc ctttttttaa ttgaatgagc ctgcttcagg cctctctcta 7560 gctaggttgc gtttttttta attgaatgag cctttttaca ttcaaaaatt agtttaattt 7620 agctgcttta gtttcattta actaagccag taaaaataac gcagctctat aggtaggtct 7680 cctgcaaaaa ccgactcgca cttcatcgta gtgcaggagt gaacatcttt tgtcgaagac 7740 tgaagatggc taacgcgagc ggtgttttta ttgaatacca ttttacgtta ccgatgtcgc 7800 aaatttgaaa atgtatgggt aacgcgttca atcgaattat tgagaatgtg tacgaataaa 7860 attgtctatt ggaagatgca agtaaaacag gacgttcaat ccaggacgtt actcgtcgat 7920 agttttattt tttttttgtt atgttgttgt tcatatttaa cggcatttat gctcccaaag 7980 tggtgcaatt gtttgccaaa ttaaaaaaaa aaattttttt ttctccatgt ggaaaaatag 8040 aagttcagat aatatgaaaa atgtttattg cactgaacgt ggggaaagag aaatactatg 8100 ggaaaaacat ttaaatcata tcgtggtagt tacaatcctt cagaaattgt attgcattga 8160 taatattatt gttattctta aaattttctg aaagctcttt atatattaaa tgtttatgtc 8220 tatgtgtaat tgggtttaat ttgtgtccgc aatattgaca taatgtttta tctgacttat 8280 ttaatatata tccatgtgtt aattttgtgt ggccaatcat tagtctgttt attttagagg 8340 ataccaatag ggtgttaaat attctgcatc tcttttgttc gatcaatggc cttatttgtc 8400 ttaattgtgt tatgcaatca ttccattcat tttgccataa tgtgtatata tatttattta 8460 ttattgattt aatgtctgag taaatcattg ctgtatttag tatgggttca ttgtgggctg 8520 ttaaagctaa ttgatcggcc ttttcattgc caaaaataca tgtgaaatag gctgtacaat 8580 ttagaatgtt aataattgat agttttattt ttatgattct gcgtcaaatc aaaattagtc 8640 aaatggagac attagtccat tgttaaaagc tcagcttaat cgacaaattt ttactgttaa 8700 tttggaacaa gaaatagcga aaggtatagg cagagaagag ttactcaagg tttttttata 8760 actaaaaatt gtacgaagcc gcaaaaataa atgttacgat aactatatct gttagcaata 8820 ttgatacctg tagcgtgcaa ttaccaattc atacacataa cattatgatt ttcagcacac 8880 accacacaca cacacatata tagatatcat aatccgtggg caagtaatac ttggtaaata 8940 tatatatata tatttattca ttcgcttggg tttgcacttt tttggtgcta tacgtttttt 9000 tttattatta taacgtatta ttatattttt ttgtgttttc atacatttaa attaaagcta 9060 aaacttatat tttataaata tgctatatta atttgtcggt cgcaaaaaca ctacagttta 9120 ttatatttta cctcgtatat atagttaaaa tttatacgct tgtcaatctg tactattcac 9180 attatagtca tcttcatggt catcacaaaa tttttttcaa tttaaaatgt ttaaatttca 9240 cttttagctg caactgcatt agtatttgtt aatttttagc taaaataaaa aacatatttg 9300 tttttttttt taaatttaaa tgttcacgcg ttaaagaact atagttcaat tactgtgcaa 9360 tgcatattat tctttatacg aaataacatt tttattaaat tacagtctaa tttgaatgaa 9420 cagggttggt ttggccaagc ttgcaggagc ctaaaacaga gaaataattt caccaatcga 9480 aaatgtaaaa aactagtgat aacaataaaa aattgaaatt cacagattga cttattctgg 9540 aagaataaat gctttttagg gttaactgaa aatggtccca aaggcaacga tgaatcttta 9600 caaaacctac agagcataat aacaagtaac tacaatgcct tagaaagaaa agaggttagt 9660 tgcatgatgt aaaccaactg cacggtgcat ctattaagta gtgcagttga agaagaggtg 9720 taccaaggtt accggtgacg caaatgtatg gataacgcgt tcagaccaat atttggggat 9780 gtgcacaaaa cgggagaaag ttttgttata tatggtattt ttgggaataa aattgtctat 9840 tagaagatgc aaatattttc cagatgcaca cataaatgga tttttggcga ttttttacct 9900 ttctcgttac cgatggcata ttaatgtagg gttaacgcgt tctatcaaac ttttgagaat 9960 gtgcacaaaa cgggagatag ttttgttata tattattttt ggaaataaaa ttgtctatta 10020 gaatatgtaa atattttcca gatgctcaca taaatggatt tttggcgatt ttttaacttt 10080 ctcgttaccg atgacgcaaa tttggaaatg tagagttaac gcgttcaatc gaatttttga 10140 gaatgtgcag gaaaagggag atagctttgt tatatgttat ttttgggaat aaaattgtct 10200 attagaatat gcaaatattt tccagatgca cacataaatg gatttttggc gattttttta 10260 ccgatgacgc aaatttggaa atgtagggtt aacgcgctca atcgaatttt tgagaatgtg 10320 cacaaaacgg gagaatgttt tgttatatat ggtatttata taattaaaat tgtctattaa 10380 aaaattcata tatttttcag atgcatacat aaataatttt ttggcgattt tttaactttc 10440 tccttatcgt tggtatctta atatatagcg attttatgat gaagtttttt tttgttttat 10500 gcaattttta atctgaataa agttaaaatg ttataaataa caaataaaaa acaataataa 10560 atcttttttc atttcattaa acctttaaat atttattatt taatcggttt tttcataatc 10620 cgccttttac atatgcttaa tcgtcgctga gttaaaaccg tcgcgaacat taccttcgt 10679 // ID Mariner-2N1_BF repbase; DNA; INV; 578 BP. XX AC . XX DT 13-APR-2011 (Rel. 16.04, Created) DT 13-APR-2011 (Rel. 16.04, Last updated, Version 1) XX DE Mariner-2N1_BF DNA transposon DNA transposon - consensus. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Nonautonomous; KW Mariner/Pogo; non-autonomous; Pogo-1N1_BF; Mariner-2N1_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-578 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M. and Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-578 RA Kapitonov V.V. and Jurka J.; RT "Transposable elements in the amphioxus genome."; RL Deposited to Repbase as the tmplanrep.ref file (02-May-2008). XX RN [3] RP 1-578 RA Kapitonov V. and Jurka J.; RT "A family of Mariner-2N1_BF non-autonomous DNA transposons from RT the amphioxus genome."; RL Direct Submission to Repbase Update (13-APR-2011). XX DR [3] (Consensus) XX CC It is similar to the autonomous Mariner-2_NV. XX SQ Sequence 578 BP; 158 A; 115 C; 147 G; 158 T; 0 other; cgaggggcgt gcaataagta atggccctga cccacttcca gttgtctgat ctaaatgaaa 60 ttttgcatgt gtaatgattc atatctctat gggttatgtt gcaaaaaaca gctctgaact 120 aattgtggtt tctgatttac tggtgtttga actgagtcag gtgtgaaatg gaccaggtgt 180 gaaatggagc cagttgagtg tcgcgcagtg atccggtttt tgtatttgaa aggacgcaca 240 ccaaaggaga cttttgatga aatgaaagaa acttatggtg atgatgcccc atcatatgac 300 cttgtaaaac gctggcatcc tgaattcaaa catggccgga agtctgtgga aacagctccc 360 agacctggtc gtccctcttc tgccattgat gaggcatctg ttggaggacc aacctggggt 420 gtcctacaaa aacggtgtcc agagctgcat caaacgatgg gagaaatgca taactctggg 480 tggttcctat gaagagaaag actaataact gtgccaagtt tcattaatct cctgctatgg 540 gaaatgggtc agggccatta cttattgaac gcccctcg 578 // ID BEL-41_CQ-LTR repbase; DNA; INV; 493 BP. XX AC AAWU01003534; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-41_CQ_; KW BEL-41_CQ-I; BEL-41_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-493 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 236-236 (2011). XX DR Genome; AAWU01003534; Positions 69556 70048. XX SQ Sequence 493 BP; 163 A; 78 C; 111 G; 141 T; 0 other; tgttgccgct gaaattgccg atgtgcttac gaactcgaag atcttaacgt agggttaggc 60 gatggtgagc aacggtctat atctaggtga gcgatagcga ataagaagag ttaacgcaca 120 agaatagtag gcgagtaatt tgatattctg tttaatctca gttaaagccg acttattctg 180 agatgtccaa tcaggatagt gcatttgaat tgtagaaaag tcagtgaaaa gaattcatga 240 gtaagttggt gggcgtaaac tgcaggagga aaaattaaga attatttgtt acagtttcgc 300 aacgcaccaa attgctgctc taagttggga atgaagttgg gttaggaaac ctattttgtg 360 agtaaaatta tatctagttt aaccaaatta ctaatattta ataaacagct tttagcaatt 420 aagatcacta aaatgaacgg tgtttcttgt tcgctgaaaa gacggcaaat ccagccctcc 480 cccaacgtca aca 493 // ID BEL-7_AA-LTR repbase; DNA; INV; 584 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-7_AA_; KW BEL-7_AA-I; BEL-7_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-584 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 864-864 (2011). XX DR [2] (Consensus) XX SQ Sequence 584 BP; 201 A; 88 C; 104 G; 188 T; 3 other; tgagcgcacc tcctactcag cgtaccggtt tagattgacg tattacagac agatcgagat 60 tagctgtcat attctgtgga caaaagaagg aagawagcca acaatacgga atttatccac 120 gccgatacga gtgatgaaag ctaaatttat tatttaaagg gctaatagct attgaatttg 180 ttaattactt aatactaaag ttgttattmt tgctaaggta tgaatttata tatccgtgta 240 attgaacatg cagctaaatt gacctattta cagaaacttc ctctaggtta gattgaattg 300 tagctgakaa aatgaattag tttctgcatt gaattcgatt tattacggta atttgagatg 360 ataattgttt gtacttcata tgataacgtg aatttgataa ccctagatat tcctgaaagt 420 cgcctttgat caaccaattg ccggattaga agctagaaga atttacaaaa cgtaagttaa 480 cctttgagca gctaaattcg taaatattga aatttaatca gaaatcatga ttttaggaaa 540 ctttaacgct ggtaataata aaacctgttc aaaacttcgg acca 584 // ID hAT-N7_BF repbase; DNA; INV; 408 BP. XX AC . XX DT 30-SEP-2008 (Rel. 13.09, Created) DT 30-SEP-2008 (Rel. 13.09, Last updated, Version 1) XX DE Amphioxus hAT-N7_BF non-autonomous DNA transposon - consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW hAT-N7_BF; non-autonomous. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-408 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-408 RA Kapitonov V. and Jurka J.; RT "hAT transposons in the amphioxus genome."; RL Repbase Reports 8(9), 917-917 (2008). XX DR [2] (Consensus) XX SQ Sequence 408 BP; 140 A; 81 C; 68 G; 119 T; 0 other; tagggctgtg tacctaaata cccgtacctg tatcgtacct aaaaaattgt ttaggtacag 60 gtccggacct aaacatctgt gtacctgtac ctgtacctaa acaaactaaa tcactattaa 120 acactacttt ttaagactga tggacaagta aatcccgaat caacaatgac aatggtgata 180 atcttgcagt tgaaacttaa gaaaacttgc agaagaaatt gttttgagac tcaaatatgt 240 aattccaaat acaaagatga caatttatca ctagaaaaga cagttagcta catgtttaac 300 ttacatatac ttggttaaac ttttttaggt acaggtgcag gtccggacct gtacctaaac 360 ccgtgtacct gtacctgtac ctgaaatttg tttaggtaca cagcccta 408 // ID Gypsy-28_DWil-LTR repbase; DNA; INV; 289 BP. XX AC scaffold_181145; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-28_DWil_; KW Gypsy-28_DWil-I; Gypsy-28_DWil-LTR. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-289 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_181145; Positions 1781779 1781491. XX SQ Sequence 289 BP; 91 A; 43 C; 64 G; 91 T; 0 other; tgtaagatgt gtgctgggtt gctggtcgtt ttttgtagcc ctggcgaata agagaagagg 60 tagaaaagtt ggagagagtc gagaagtagc atcgtcggtg gaaagttgtc cgcgagtgaa 120 gccaaaagtc gtcggcaaaa taagtagaag aatcaagtcg ctttaaaacg ttgtatttaa 180 ttaagcctaa tttctgaacc ctattgaact aatttgtttt tgtaaactga tatagcatta 240 aaactctcta tttgttttta taccaataaa tccattacat tccattaca 289 // ID Crack-7_CQ repbase; DNA; INV; 4035 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A Crack non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; Crack-7_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4035 RA Kojima K.K. and Jurka J.; RT "Crack non-LTR retrotransposons from the southern house RT mosquito."; RL Repbase Reports 11(1), 38-38 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 4 sequences with >98% CC identity. XX FH Key Location/Qualifiers FT CDS 7..972 FT /product="Crack-7_CQ_1p" FT /translation="MATETAKVCVKCKKNINARHGKPLQCDGKCRQVWHKA FT CTSVSDEXYPVPDKFWFCAPCKETRNRNRRSTINMADRTPSTPSTAKAGAA FT SSTTEAAIVRIESKLDQLIGWQKDVITEIRGIQNTLEELRTTTETLMDEQQ FT QLRSDNFELRRQLELNEIEIDQLRQEKLXKTLEISNVPVAEDEDLFDKVTQ FT ICHGIGVVLDISEVKEIFRAPTYQSQKSQIPPPIQLKLSSKKKRDELLAKK FT REKKELTTAILDKGTDKPVNIYISEVLTKRNRYLFKLARDLRRSDRIKFAW FT FRDGRLLVRKTEKAKVKGIRSVAALDEFTK" FT CDS 959..3829 FT /product="Crack-7_CQ_2p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="MSSQNNPNLQLPFQPSTKTINTIKQLKFNSENSLKLF FT YFNARSLNDKLTELEFLLDEANCRMDALMITETWSRPETEQSMCLSNYQCF FT FASRSTRRGGGSAIFIHNDIHCKPIHNYCDEHNSFIAVEVGDREKTVLVCV FT YRPPEPLSSSLDCFFQHLDTFLAAQACRTTIVTGDFNLDLLTTTTAVQRYT FT NIVLSNGFIFCGHRPTRFEACLDHILTNNTELQVTVQQLQYNLFDHDAMFI FT EATRTIDCISSGRVSFTKTDVNKLRLYLQQHPPNTSMQFSVEENYNVLLSQ FT LRSGIEASSTQVRSRGTKQTHSKPWFDNELKQCIRTKNYWYGKHRQNIADH FT VTRDTYHRWANTVTSLKRTKRKHYYGAKFERQQSSITGTWDVVREVLGSRR FT KSGKGISSRFSSTDEKQRCVEDANTYFAAAGKHLAENIPYTAMQPINPQTD FT QLLVLEPVPRTIVLQTIASLPASKSTGYDGCQPNFFKACGEILADSVADVV FT NTSIRQSLVPNTLKLSKVVPIPKTPAARNVSEFRPINIPSVTDKVLQKIVN FT KQLTEHLERNHLLSPRQYGFRPKSNTQSALFDAVVEIQRNCDRKLKVAAVF FT LDLSKAFDTCEKRILLRSLSELGVNGQSLRWFESFLGERQQYVCDNGLNSE FT PLFVDYGVVQGSIVGPTLFNCYVNNLKDLPLHGTLFMYADDIVLVYAASTY FT EELQSRMNEDLCHLCRWMNQHKLTVNIPKTKYMLFNAPSWTRLDVVYNGEC FT IDYVDAFKYLGVWLDSGLKWTVHIEKLNKTLAQVAGVFKRVSSVLPTQTKR FT MLYFSLFYSHLIYGIAVWGTAGSTALNLLQTTQNKAIKNLFGYHYRTSTAL FT IHSNNRFLSVSSTYTAVACCHVHKIQRNCIHTNTVLSRGEDRHQHYTRRRG FT DFTASRINTTTFGQNSALHRATLMYNALQDDMKTLHPARFKKVLQNQLLEE FT QF" XX SQ Sequence 4035 BP; 1210 A; 1037 C; 909 G; 874 T; 5 other; ttagcgatgg cgaccgaaac cgctaaagtc tgcgttaagt gcaagaaaaa tatcaacgca 60 cggcatggaa agccgctgca gtgcgacgga aagtgcagac aggtgtggca caaggcgtgt 120 acaagtgtct ccgatgaasa atatcccgtt cctgacaaat tctggttttg cgctccgtgc 180 aaggagaccc gaaaccgcaa cagaagaagc acgataaaca tggccgaccg aacaccgtct 240 acaccatcca cggcaaaggc tggagctgca tcatccacca ctgaagctgc gatcgtccgg 300 atcgaatcca aactggacca gctcatcggc tggcagaagg atgtcataac ggagatccgc 360 ggcatacaaa acacgctaga agaattgcgg acaaccacgg aaacgctgat ggatgagcaa 420 cagcagcttc gttccgacaa cttcgagttg cgaagacagc tggagttgaa cgaaatcgaa 480 attgaccaac tgaggcagga aaagctgsgc aagacgttgg agatttcgaa cgtgcctgtt 540 gccgaagatg aagacctctt tgacaaagtc actcagattt gtcatgggat cggagtcgta 600 ctggatatca gcgaggtgaa ggaaatcttc agggctccca cgtaccagtc ccagaagtcg 660 caaattcckc cacccatcca gctgaagctt agcagtaaga agaaaagaga cgaactactg 720 gcgaaaaaga gagaaaagaa ggagttaact acggcaatcc tggacaaggg aaccgataaa 780 ccggtcaaca tttacatcag cgaagtgctg accaagcgga atcggtacct gttcaagctg 840 gcgagagatt tgagacgaag cgacagaatt aagtttgctt ggttcagaga tggcaggttg 900 ttggttagaa aaaccgagaa agccaaggtg aaggggatca gatccgtagc cgcactagat 960 gagttcacaa aataatccta acctccaact gcccttccaa ccaagcacaa aaacaatcaa 1020 caccatcaaa caactcaagt tcaacagcga aaacagtcta aaactgttct atttcaacgc 1080 gagaagtctc aacgacaaac tcaccgagtt ggagttcctg ctcgacgaag ccaactgtcg 1140 gatggatgcc cttatgatca ccgaaacctg gtcacgaccc gagacggagc aatctatgtg 1200 tctcagcaac tatcaatgct tcttcgcttc gagatcaacg agacgtggcg gtggttcagc 1260 aatcttcatt cacaacgata tacactgcaa accgatacac aactactgcg acgaacacaa 1320 cagtttcatt gctgttgaag ttggtgaccg tgaaaaaact gtcctggttt gtgtgtatcg 1380 cccacctgaa ccgctgtcct cctcgctgga ttgctttttt caacatctgg acacctttct 1440 cgcggcgcaa gcgtgccgaa caacaatcgt caccggggac ttcaacctgg atctgctcac 1500 aacaacaact gctgtccaaa ggtacaccaa catcgtgctg tcaaatgggt ttattttctg 1560 cggtcaccgt ccaactcgtt tcgaagcctg tctggaccac attctaacga acaacaccga 1620 gctacaagta actgttcaac agttgcagta caacttgttt gaccacgatg cgatgtttat 1680 tgaagcaacc agaacgatcg actgtatatc ttcgggtagg gtaagcttca ccaagaccga 1740 tgtcaacaaa cttcgtctgt acctacagca gcaccctccc aacacaagca tgcagttctc 1800 ggtagaagaa aactacaacg ttctactgtc gcagctacgc tcggggatag aagcttcctc 1860 aacacaagtc cgttcacggg gaaccaaaca aacacactca aagccctggt ttgacaacga 1920 gttgaaacaa tgcattcgaa ccaagaacta ctggtatggg aaacatcgcc agaacatcgc 1980 tgaccacgtc acccgagaca cataccaccg ctgggccaac acagtcacga gtttgaaacg 2040 tacgaaaaga aagcattatt atggtgctaa attcgaacgt caacaaagca gcataactgg 2100 cacctgggac gtcgtccgag aagttttggg ctccagacgg aaaagcggca aggggatctc 2160 cagccggttt agcagtacgg acgaaaagca gcgttgcgtg gaagatgcaa acacctattt 2220 cgctgcagcg ggtaagcatc tcgctgaaaa cattccatac acagccatgc aacccattaa 2280 cccccagaca gaccaattgt tggtcctgga accagttcca cgcacgatcg ttctgcaaac 2340 gatagccagc ctaccagcct caaaatcgac cgggtatgac ggctgccaac cgaacttctt 2400 taaagcttgc ggggaaatcc tcgctgacag tgttgctgac gtcgtgaaca cctcgattcg 2460 gcagtcgctc gtccccaaca ctcttaagct gtccaaggta gttccgatcc cgaaaacacc 2520 agctgcaaga aatgtctctg aattccgccc tatcaacatt ccaagcgtga ccgacaaagt 2580 tttgcagaaa atcgtgaaca agcaactcac cgagcacctt gaaagaaacc acctgctatc 2640 tcctcgtcag tacggcttta ggccgaaatc caacacacaa tcggctttat ttgatgcggt 2700 ggtcgaaata cagaggaact gcgatcgaaa actcaaggtg gctgcggtgt tcctcgatct 2760 gtccaaagct ttcgacacct gtgagaagcg gattcttctg cgaagtctca gcgaacttgg 2820 tgtgaatgga cagtccctgc gatggtttga aagtttcctg ggggaacgtc aacaatacgt 2880 ttgcgacaac ggtctaaaca gcgaaccact atttgttgac tacggtgtag tccaggggag 2940 cattgtcgga ccgacgctct tcaattgtta tgtcaacaac ctcaaggatc ttccgctaca 3000 tggaactctt ttcatgtatg ctgacgacat cgtgctcgtg tatgctgcct caacatacga 3060 agagctgcaa agcagaatga acgaggacct ctgtcatttg tgtcgctgga tgaatcaaca 3120 taagttgact gtgaacattc caaaaaccaa gtacatgctg ttcaacgccc caagctggac 3180 cagactcgat gttgtgtaca acggcgagtg catcgactat gtcgacgcgt tcaagtacct 3240 aggagtatgg ctggacagcg gtttgaagtg gaccgtgcac atcgagaagc tgaataaaac 3300 actggcacag gtagcgggtg tcttcaagag ggtctcctcc gttctaccaa cccagacgaa 3360 acggatgctc tacttttcct tgttctatag tcatcttatc tacggcatag cagtttgggg 3420 aacagctggw agcactgcac tcaacctgct gcaaaccacg cagaacaagg ccatcaagaa 3480 cctgtttggc taccactacc gcacatcgac agctctaata cactccaaca accggttctt 3540 aagcgtgagt agcacttaca ccgctgttgc ttgctgccac gtgcacaaaa tccagcgaaa 3600 ttgcatccac accaacactg tcctgagcag aggagaggac cgccatcaac actacaccag 3660 aagaagggga gacttcacag ctagcagaat caacaccacg acgtttggcc aaaattctgc 3720 gcttcacagg gcgaccttga tgtacaacgc actccaagac gacatgaaaa cgctacatcc 3780 tgctcgtttc aagaaagttc tacaaaacca actccttgag gaacagtttt agattgttaa 3840 cacwttaaat cttttttttc gtctctattt tcgttttgaa attgtataaa ttagttgttc 3900 ttttttttgc actctatttg taactgtaaa tcttactaaa atgtactgta cggtaaaact 3960 agtctgccac taagagccaa aggctcactg gcaaataaaa cgattaaact actaaaaaaa 4020 aaaaaaaaaa aaaaa 4035 // ID CR1-3_NVi repbase; DNA; INV; 4278 BP. XX AC . XX DT 16-APR-2009 (Rel. 14.04, Created) DT 16-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE CR1-type non-LTR retrotransposon: consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-3_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-4278 RA Bao W. and Jurka J.; RT "CR1 families from Nasonia vitripennis."; RL Repbase Reports 9(4), 750-750 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 165..1358 FT /product="CR1-3_NVi_1p" FT /translation="MSLNGSFLSAGSLDGRVMPIDDSLVTKRMPRLSVCVG FT CEYENCDCFGQGDINCVKCNRRFHEKCAGLSGCVKCSDDLDKVVCNLCLGP FT STSVSSAHLSIGGSRVPASVPVSVAVGTSESAAAFPAPAQTIVSDDKFEEL FT KSLLQGNYASIRESLTGIDDRFARLDSQLEQDRTERRAIRAEVAKLDARLR FT ALEERPAAVLAGPVVVSGDGSAPLPDGRXVQALASLDSASEVQDQLYRSRN FT LLLYDVPAGGSDSDLATVKEILGKISSLDLDKISVRRFARPTSRGTFPPLV FT VRFSTSFEVIRMITYRSLLPSGISVAADLTPAQRARRRQLLEEASKHNRDH FT PDKPKTVKFVRGSLALIDAKQKTGVFSCQQKPKLIVVLRIKPSSPSTTKML FT THW*" FT CDS 1368..4031 FT /product="CR1-3_NVi_2p" FT /translation="PIFAPIFFNLLIYRTDIIVFTETWLQPALLSAELGLV FT GYRIYRRDRDLQAVGVSRGGGVLVAVRDHINSSELFVESSLEQIFIRISLP FT DTTMILGAVYLQPNSDTTRYQEYVDGLETVAEANADCQLVMVGDYNLPSVT FT WRSNPLQFAQTGYVDPEHRMNAELICGCFSSLGLSQLIPPHPSKGYSLDLL FT FASPGFIDLFELNEQLVPTDGHHVPSVFNANICCDSDFVPVSKRNFFAADY FT ESIVNHLGEVDWDSILSCDCLEDILCGFYNVVDTCISKQVPLSRSNPSTYP FT VWYDRELICTIRDKKLAHKAWKLTGETSDEIEFKRLRAVCIRMSRSRYRDY FT VRSVETGLRSNINAFWSFVKGLKGEAGIPVTTYLDDLKAESESETVNLFSS FT HFSSVYDCNPASPLNHLLESRELISNVRVLAEDLRPFVRELKCTANPGPDN FT VPNVFVKRCWPALERPIVDIFNKMLANGYFPKAWKRSYIFPVFKHGDRHDV FT KNYRPISLISCIPKLFDAFLTDVLSQRLLCKITDLQHGFLPGRSTLTNLLI FT FNDFVSDALENSAQVDSIYIDFSKAFDRVDHGRLLAKLWNIGVRGSIFNLL FT SSYLSNRYQAVRINNSVSPAVLVSSGVPQGSHLGPLLFCIYIDDLVKQFRF FT ANALLYANDVKIFMRIHSEEDVERLRADLNALSGWACENGLVINAAKSQVV FT SFYRGRTQLQFQYELDGSAIARTEVVRDLGVLFDSRLTFAPHVDYVVSRCR FT GLIGFVKRTTQDFSNISAIIYLYNTLVMPTFIYCCQIWSPYTQIAINHLDS FT VQHKFIRYLAYKSGRPMAPIDHDYSLIASSLGIPSIHSVHRYHDGLLTFKL FT LRKFMICPSICVAIDLYTSIWRIRTLGSFRPYLD*" XX SQ Sequence 4278 BP; 945 A; 948 C; 1037 G; 1347 T; 1 other; ctctcttacg cctcctgtgt cttcccaggg tgtttttagt ggcgttactg cgcactgaag 60 agctctggct gcctgcttgt tgttttgtga agcgtcgtgg cgtgtagtgc tctgtctaag 120 acttgtgctt atttgtctgg tgtatatttc agtgttagac agctatgtcc ctgaacggta 180 gctttttatc cgcgggctcc ttagacggca gagtcatgcc gatagacgat tctttggtta 240 caaagcgtat gcctagattg agcgtatgcg tcggttgcga atatgaaaat tgtgattgtt 300 ttggacaagg tgacattaac tgtgtaaaat gtaaccgtcg cttccatgag aaatgtgctg 360 gcttgtcggg ttgtgttaag tgtagcgatg accttgacaa ggtggtctgt aacctctgtc 420 tcggaccttc gacctccgtt tcctctgccc atctgagcat cgggggatca cgtgtgcctg 480 cttcggtccc agtgtcggtt gccgtcggaa catctgagtc agcagcggcg ttccctgccc 540 ccgcccaaac catagtttcg gatgataaat ttgaggagct gaagagcttg ctgcagggta 600 actacgccag catcagggag tccctgactg gtattgatga caggtttgcg cgccttgact 660 ctcagctgga gcaggacagg acggagaggc gtgctattcg tgcggaagtc gctaagctgg 720 atgcccgttt gagggccctt gaggagcgcc ctgctgctgt tctggctggt ccggttgtcg 780 tgtctggaga tggttctgca ccactgcccg atggccgagy tgtccaggct cttgcctctc 840 ttgactctgc gtcggaagtc caggaccagc tgtaccgctc gcgcaacctc cttctgtacg 900 acgttccagc tggcgggtca gactctgatt tggcgactgt caaggagatc ctgggcaaga 960 tttccagtct ggatcttgat aaaatcagcg tcaggaggtt tgccaggccg acttctcgag 1020 gaactttccc tccattagtt gtaagattca gcaccagttt tgaggtgatc aggatgatca 1080 catacaggag tcttcttcca tctgggattt ctgttgccgc agaccttaca ccagcccagc 1140 gcgcacgtcg tcgtcagctg cttgaggagg cgagcaagca taatcgtgac catcctgaca 1200 aacccaagac tgtcaaattt gttcggggca gtttggctct catcgacgcc aagcaaaaga 1260 caggggtttt ttcttgccaa cagaaaccaa agctaattgt tgtgctgcgg ataaagccaa 1320 gctcaccatc tactaccaaa atgctcactc actggtaaat aagctaacct atcttcgctc 1380 caatcttctt caacttgctt atttaccgga cggatatcat cgttttcaca gagacttggc 1440 tgcaacccgc tttattatct gccgaactcg gactggttgg ctatcggatt tatagaaggg 1500 atcgtgacct tcaggctgtt ggggtctcaa gaggtggtgg cgtacttgtt gcagtaaggg 1560 atcatatcaa ctcctctgaa ttatttgtcg aatcctcact cgaacaaatt ttcatcagaa 1620 tttcccttcc agacaccacc atgatcctgg gtgcggtcta tcttcagcca aattctgata 1680 cgacacgata ccaggaatat gttgatgggt tggagacggt ggctgaggcc aatgctgatt 1740 gtcagcttgt tatggtggga gattacaatc ttccttccgt tacttggcgc tctaatccac 1800 tacagtttgc tcagactggc tatgtggatc cggagcatcg tatgaacgct gaactcatct 1860 gcggctgctt ttcatctttg ggactgtccc aacttattcc accacatcca tcgaagggct 1920 actcgttgga cctgttgttt gcatctcctg gattcatcga tctctttgaa ctgaatgaac 1980 agcttgtgcc taccgatggt catcacgttc cgagtgtttt taatgcgaat atttgttgtg 2040 atagtgattt tgtacctgtg tcaaagcgta atttttttgc tgccgattat gaatcgattg 2100 taaaccatct tggtgaagtg gactgggatt cgatcctttc ttgcgactgt ctcgaggaca 2160 ttttgtgtgg cttttataat gttgtggaca cctgtattag caaacaggtt cctttgtcga 2220 gatcgaatcc tagtacttat cctgtatggt atgatcggga gcttatatgt actattagag 2280 ataaaaagtt ggctcataaa gcgtggaaac tcactggcga aaccagtgat gaaattgagt 2340 tcaagaggct ccgagccgtt tgtatccgga tgtcgagatc tcggtaccgg gattatgttc 2400 gatcggtgga aaccggcctt aggtccaaca ttaacgcgtt ttggtctttt gttaagggtt 2460 tgaagggcga ggctggtatc ccagttacta cgtatcttga cgatttaaag gctgaatcgg 2520 agtctgaaac ggttaacttg ttttcttccc atttcagttc agtatatgac tgtaatccag 2580 ctagcccttt gaatcatctt ctagaatcgc gtgaacttat ttccaatgta agagttttgg 2640 cggaggatct gcgtcctttt gtccgtgagt tgaaatgcac agcgaaccct ggtccggaca 2700 atgtacctaa tgtctttgtt aagcgctgct ggccagcttt ggagaggcct atagttgata 2760 tttttaacaa gatgttggct aacggttatt ttcctaaggc gtggaagcgt tcttatattt 2820 ttccggtgtt taaacacggc gatagacatg atgtcaaaaa ttataggcct atttctctta 2880 tcagctgcat tccgaagctt ttcgacgcgt ttttgaccga cgtattgtcc cagcgactgc 2940 tttgtaagat cactgatctg cagcacggct ttctgcctgg tcgatcgact ttaactaacc 3000 tcttgatttt taatgacttt gtctctgatg cccttgagaa ctcggctcaa gtggactcca 3060 tctatattga tttctctaag gctttcgata gagttgatca tggcaggttg ctggctaagc 3120 tgtggaacat tggtgttcgt ggctccatat ttaatctatt gtcgtcatac ctttcgaatc 3180 gctatcaggc cgttagaata aataatagcg tgtctccagc tgtcttggtg tcgtctggcg 3240 tacctcaggg ctctcacctg ggtcctttat tattttgcat ctatattgat gatctcgtga 3300 agcaatttcg ctttgcgaat gccctgttgt acgcgaatga tgtcaagatt tttatgcgta 3360 ttcactctga ggaggatgtg gagaggctta gggctgatct taatgctttg agcggttggg 3420 cgtgcgaaaa tggcctggtc ataaacgcgg ccaagagcca ggtagtgtca ttttataggg 3480 gcaggacaca actacagttc cagtatgagc ttgacggatc ggctattgct cgcacggagg 3540 tggttaggga cttgggggtt ttatttgact cgcgcttgac cttcgcacct catgtggatt 3600 acgtggtctc taggtgtcgt gggctgatcg gatttgtgaa gcgaactact caagactttt 3660 ccaacatatc tgctattatt tatctttaca atactctggt catgcctact ttcatttatt 3720 gttgtcaaat ttggtcaccc tatactcaaa ttgccatcaa ccatttagat tccgttcagc 3780 ataaatttat tcgttacctg gcttacaagt ccggccggcc catggcgcct attgaccacg 3840 attattcact catagcgtct tcgttgggta ttccttcgat acactcagtg catcggtatc 3900 atgatggtct gcttacgttt aaacttttgc gtaaatttat gatttgtccg tcgatctgcg 3960 tagccataga cctatacacg agcatctggc gcattcgaac tttgggttct tttcgacctt 4020 acctagatta aagcgtgctt ggaattctct ccctcaacgt atcactgtca tcgagcagtt 4080 gtctgaattc aaagtgaacc tcaaatcatt tgtgtcggca tttggttagg aaacttggtt 4140 cataatattt ttgttgtttt gtatattgtt tgttattttt acaattgtca tgtacctgaa 4200 ttttatattt ttgtaagggc acttatgccc gttaatttat ataaataaat aaataaaaat 4260 ggctaaacaa gcaaaaat 4278 // ID Jockey-8_CQ repbase; DNA; INV; 4305 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A Jockey non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW Jockey; Non-LTR Retrotransposon; Transposable Element; KW Jockey-8_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4305 RA Kojima K.K. and Jurka J.; RT "Jockey non-LTR retrotransposons from the southern house RT mosquito."; RL Repbase Reports 11(1), 119-119 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 5 sequences with >94% CC identity. XX FH Key Location/Qualifiers FT CDS 44..1384 FT /product="Jockey-8_CQ_1p" FT /translation="MGKKKPKPPPDDVPGVKKSTQHTFDRRDYGPKLKTNP FT TLDTEVKKLAAAGHGYSLRRRASYSDLTSTRNRNTWAGASTQPIPLANGFD FT TLQSMDDDDFSSISGDSIPGVVDIRVKKTTKARCPPITVRNMSAIEINKLI FT SRLGGGGGNYSIRNFEKAVQIKVKCADLFQKITAELVQLNAEFFTHAKPED FT ALVKIVLSGLPVFEVDDLIEELEKNDIFPREVKVLSKSEGGNRALYLLNFL FT KGSVKLTQLREVKTIYNVVVWWRFYTRNKTDVMQCFRCQGFGHGSRYCNMT FT PRCVKCGQKHGSNECQLPTKAELEKAPDDVRQKIRCANCNLNHTANFKECT FT ARKDYLKHQEKKKPRKASPQAEKASPSSRKFTSNVVAPGLSFANIAAGPGT FT GSPPGAAPNDDLFSITEFLGLAREMYARLSKCTTREEQFFALHELMVKYIY FT VH" FT CDS 1368..4052 FT /product="Jockey-8_CQ_2p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="SIFTFTSLTKLKVLNWNGRSVPKKALELSEFIVSQKI FT DVAVLTETWLQGNVSFFIDGYSVVRLDRPTDAAERGGGVALLVKRGIGFKP FT MDNLRTKVIEAVGVRVQTEGDPINIIAAYFPGGRSKEDRVNFKRDIGTLTR FT QSGAYFIVGDLNSRHRMWNCTRANQTGNILMNLFRTSDFFIHAPTTPTYVP FT RGRARPSVIDLVLSNNRVNMSVPKVHQELSSDHLPVTFEIDCIIQAETQAK FT RVRCYDRADWVRFQRDVGGKLDXTVESLNNIQTTAEIDASVRRFTSAVLEA FT EDVAVPWAEYNPQKVVLPDGLRLLITLRNTRRRQFIRSRDPVLGLIVDTLN FT QRIQKECSKLKYKNFGDTVRDIANGHKKFWKISKLVRNKVKHNPPFRIDDG FT LVISPAEKARVLAESFAKAHDNTLPGDPSVNDEVARSMSVVAEANESNDDW FT STYTKPSEIKAIFRKLKNRKAPGQDGLRNITLKHLPRKGLIYLTKIYNACL FT KLSYFPAAWKHALVVAVPKPNKDITQPGNYRPISLLSTLSKVLERIVLARM FT NXHLDNNRIVPDEQFGFRRAHSTNHQLVRIAQAVKKGFDSKKSTGMLMLDV FT EKAYDTVWQEAIVHKLVVANFPLYLVKMLHSFLKGRSFQVNVNGALSAIHG FT IPCGVPQGSVLSPTLYNIFTADVLMIDGVSYAFFADDTGFFATDNDPKIVT FT IKLQAAQNRLEEFQQKWRIKINPSKTQAIFFTRRRAPKYLPSSKVKVCGLE FT VDWSKEAKYLGFVLDTGLRYDKHIDQTLKKCKNLTKALYSLVNRRSRLQLH FT NKLLLYKCVFRAVLTYGSPTWKTCAATHRKKLQRMQNKLLKMIYNLDPWHP FT TDDLHQLAELETIDQFIDRLFQKFRTSCQMSDNPLIEAILPL" XX SQ Sequence 4305 BP; 1171 A; 1125 C; 1074 G; 933 T; 2 other; ccaccgtaaa tcgcgggtcg aaatctcgcg cggtttccct gccatgggaa aaaagaagcc 60 aaagccccca ccggatgatg tcccgggtgt gaagaagtca actcagcata cgttcgatcg 120 gcgggactac ggtccgaaac tgaaaactaa ccccactttg gacacagaag taaaaaaact 180 cgcagcagca ggtcacggct attcactccg gcggcgagca tcttattcgg atctcaccag 240 cacgagaaat cgaaacacct gggcgggtgc ttccacccag ccaatcccac ttgccaacgg 300 gtttgacact ttgcaatcga tggacgacga cgatttcagc agcatcagcg gcgacagcat 360 cccgggtgtt gttgacatcc gcgtcaagaa aacgacgaag gctcgctgcc ccccaataac 420 agtgcgaaac atgtcggcga ttgaaatcaa caaattgatt tcacgtctcg gaggtggcgg 480 cggcaactac agcatacgaa actttgagaa ggcggtccag atcaaggtga agtgcgcgga 540 ccttttccag aaaattaccg ctgagctggt gcagttgaac gcggaattct tcacccatgc 600 caagccggag gacgccctag tgaagatcgt cctttcgggt ctccccgtat tcgaggttga 660 tgatctgatc gaagaactgg aaaagaacga catcttcccg cgggaagtca aagtgctgtc 720 gaaatcggaa ggtggaaatc gtgcacttta ccttctcaac ttcttgaagg gttcggtgaa 780 gctgacgcag cttcgcgagg tgaaaaccat ctacaacgtc gtggtctggt ggcgttttta 840 cactcggaac aaaactgacg tgatgcaatg tttccgctgc caaggatttg gtcatggcag 900 ccggtattgc aacatgacgc ctcgctgtgt caagtgtgga cagaagcacg gatcgaacga 960 atgtcagctt ccaacgaagg ccgagctgga gaaagccccg gatgacgtcc gccagaagat 1020 aagatgtgcc aactgcaacc tcaaccacac ggccaacttc aaggaatgca ctgctcgcaa 1080 ggactacctc aagcaccagg agaagaaaaa acctcgcaaa gcctcaccgc aggccgagaa 1140 agcgtcgccc tcgtcacgga agttcacctc aaacgtggtg gcacccggcc tatccttcgc 1200 aaacattgct gctggtccgg gcactggctc cccacccggt gcagctccca atgacgatct 1260 gttttcaatt accgagttct tgggcttggc gagagaaatg tacgctcgac tcagtaagtg 1320 tacgactagg gaggaacagt tcttcgctct tcacgagctc atggtgaagt atatttacgt 1380 tcactagttt aactaaatta aaagtactaa actggaacgg acgatccgtt ccgaagaagg 1440 cacttgaact ctccgagttt attgtgtcgc agaaaattga cgtggccgtt ttaactgaga 1500 cgtggctcca gggaaacgtg tcatttttca tcgatggtta ttcagttgtt cgactggatc 1560 gaccaacaga tgctgctgaa agagggggcg gggttgctct cctggtcaaa cgaggaatcg 1620 gtttcaagcc gatggacaat ctacggacga aggtgattga agcggttggc gttcgtgttc 1680 aaacggaggg cgatccaatc aacataattg ctgcttactt ccctggagga cgcagcaaag 1740 aagatcgagt caacttcaag cgtgacatcg gaaccctgac gaggcaaagt ggagcatact 1800 tcatcgtggg agatctcaac tctcgccaca ggatgtggaa ttgtacgcgg gcaaaccaga 1860 ctggtaacat tttgatgaat ctatttcgta cttctgattt cttcattcat gctcccacta 1920 cgccgacgta cgtcccgcgt ggtcgcgcac gaccctcggt cattgatctg gttttgtcta 1980 acaaccgggt aaacatgtca gttcctaaag tacatcaaga gctatcatca gatcacctgc 2040 ccgtgacgtt tgagatcgac tgcatcattc aggcggagac acaagctaaa cgagttcgct 2100 gctacgaccg agcggactgg gtccggttcc agcgggacgt tggcggaaag ctggacgwaa 2160 ccgtagagtc gctgaacaac atccagacaa cagccgagat cgacgcttct gttcgccgat 2220 tcacgtctgc agtactggaa gctgaggacg ttgcagtccc ctgggcggag tacaatccgc 2280 agaaagtggt tctaccagat ggattgcggc tgctgattac actgcgaaac acccgtcgac 2340 ggcagttcat ccgttcgcga gatcctgtgc tgggcctgat cgtcgacacg ctgaaccaac 2400 gaatccaaaa agagtgcagc aaactgaagt acaaaaactt cggcgatacg gtccgcgata 2460 tcgccaatgg ccacaaaaag ttctggaaaa tttccaagct cgtgcggaat aaagttaaac 2520 acaatcctcc tttccgtatc gacgatggcc tggtcatctc ccccgcggag aaagccagag 2580 tgctcgctga aagtttcgcg aaggcacacg acaacacctt gccgggtgac ccttcagtga 2640 acgacgaggt agcgcgatcg atgtcggtgg tggcagaggc gaacgaaagt aacgacgact 2700 ggtccacata cacgaagcca agcgagatca aagctatctt ccggaagctg aaaaacagga 2760 aagccccggg tcaagacggt ctgcggaaca ttacgctgaa gcaccttccg cgaaaaggat 2820 taatctacct cacaaaaatc tacaacgcct gcctgaagct gtcctacttt cctgctgcgt 2880 ggaaacacgc gttggtggta gcggttccaa agccgaataa ggacatcacg caacccggaa 2940 actatcgccc cataagcctg ctgagtacgc tcagtaaggt cctggaacgg atagtactcg 3000 cacgaatgaa cckacacttg gacaacaacc gcatcgtccc cgacgagcaa ttcggattca 3060 ggcgagcgca ttcgacgaat caccagctcg tgaggattgc gcaagcagtc aagaaaggat 3120 ttgactcgaa gaaatcgacg ggcatgctga tgctggacgt agagaaagcc tatgacacgg 3180 tgtggcagga agcaattgtc cacaagctgg tagtagccaa cttcccactg taccttgtca 3240 aaatgttgca ctcctttctg aagggcagaa gttttcaagt gaacgtcaac ggtgcactgt 3300 cggccatcca cggaataccg tgtggcgtgc cccaaggttc ggttctaagc ccaacgctgt 3360 acaacatctt cactgctgat gtactgatga ttgacggagt gtcgtatgcg ttctttgctg 3420 acgatactgg cttctttgca acagacaacg atccgaaaat cgtcacaatc aaacttcaag 3480 ctgctcagaa ccgtctggag gagtttcaac aaaagtggag gatcaaaatc aacccttcca 3540 agacgcaagc catcttcttt acgcgccgaa gagcgcccaa gtaccttccg agttccaagg 3600 tcaaagtctg cggcttggag gtggactggt ccaaagaggc caagtatcta ggctttgtct 3660 tagacaccgg gcttcgctac gacaagcaca tcgaccagac gctcaaaaag tgcaaaaacc 3720 tgacgaaggc tctctactcg ctggtgaacc gacgctcccg attacaactc cacaacaagc 3780 tgttgctgta caagtgcgtt ttccgtgcag tcttgacgta cggatctcca acgtggaaaa 3840 cctgcgcagc tacccaccgc aaaaagcttc aaaggatgca gaacaaattg ctgaagatga 3900 tctacaacct cgatccgtgg caccccaccg atgatctcca ccagcttgcc gaactggaga 3960 cgatcgacca gtttatcgat cgcttgttcc agaagttccg gaccagctgc cagatgtcgg 4020 acaacccact gattgaagcc atccttcccc tttaatgaag cggaactcaa ccccaccact 4080 cgtctgtgat attagtttta agaaacccca agtttcgatc tcactaggtt ttttggccaa 4140 attttcccta gtttttgttt aattgcattt gaagaacctt ctgttaccaa acggttattg 4200 aaaaggaaat agttgcaagg aacaccaaat ctctatgaac cagttattca cacaaattgt 4260 aaaattgtta tttcggtaaa taaaatgaat tgaattgaat tgaat 4305 // ID TTAA28B_AP repbase; DNA; INV; 318 BP. XX AC . XX DT 30-AUG-2009 (Rel. 14.09, Created) DT 30-AUG-2009 (Rel. 15.12, Last updated, Version 2) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; nonautonomous; TTAA28B_AP. XX NM TTAA28B_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-318 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2097-2097 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC TSD is TTAA tetranucleotide. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea CC Aphid Genome Project. XX SQ Sequence 318 BP; 95 A; 54 C; 52 G; 116 T; 1 other; ccctttgacg agtatggacg tatatatacg ttttgttgaa acgcgcctcg gggagtatgg 60 acgtatattt acgttttaca caaaaacctt attttatcta taatgttatg tccgtattaa 120 attgacgata atttatgaat cgtgtgctgc tatctactgg ctaggatagg acgtatattt 180 acgtctttac taaaataccg atatcgtatt taagacgtaa atatacgtct ttactttaaa 240 taattttatc caagttngtc attcctaatt cactctctgg gtgtaaacta tataaatcac 300 aattcactct tcaaaggg 318 // ID Dmedtr1cons repbase; DNA; INV; 491 BP. XX AC . XX DT 19-APR-2011 (Rel. 16.04, Created) DT 19-APR-2011 (Rel. 16.04, Last updated, Version 1) XX DE Consensus of mariner-like element internal region of mellifera DE subfamily. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Dmedtr1cons. XX OS Drosophila mediostriata OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; tripunctata group; OC tripunctata subgroup III. XX RN [1] RP 1-491 RA Wallau G.L., Hua-Van A., Capy P. and Loreto E.L.; RT "The evolutionary history of mariner-like elements in Neotropical RT drosophilids."; RL Genetica 139(3), 327-338 (2011). XX DR [1] (Consensus) XX CC Consensus of clones with show less than eight percent divergence. CC Dmedtr1cons. XX SQ Sequence 491 BP; 121 A; 121 C; 139 G; 110 T; 0 other; tttgggtgcc gcacgagttg acggaaaaaa acatttttgc ccgtatggat gcatgcgaat 60 cgcttctgaa tcgcaacaaa atcgacccgt ttttgaagcg gatggtgact ggcgatgaaa 120 agtggatcac ttacgacaac gtgaagcgca aacggtcgtg gtcgaaaagg ggtgaagctg 180 cctagacggt ggccaagcct ggattgacgg ccaggaaggt tcttctgtgt gtttgctggg 240 attggcaggg aatcatccac tatgagctgc tgccctatgg ccagaccctt aattcggacc 300 tgtactgcca acaactggac cgcttgaatg cagcactcat gcagaagagg ccatctttga 360 tcaacagagg acgaattgtc tttcatcagg acaacgccag gccgcacaca tcttttgtga 420 cgcgccagaa gctcggggag ctcggatggg aggttctttt gcatccaacg tactggcccg 480 acctagcccc a 491 // ID BEL-154_AA-LTR repbase; DNA; INV; 314 BP. XX AC supercont1.336; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-154_AA_; KW BEL-154_AA-I; BEL-154_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-314 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.336; Positions 984605 984918. XX SQ Sequence 314 BP; 92 A; 75 C; 64 G; 83 T; 0 other; tgtctacgac caacaaaacc tacttatccc tcattactct actggtgcaa tggctagcgc 60 aagtcggcag atattatata attaccacaa caataagttt attaatctcg ctaccgtggt 120 cggaaataat aggcttccag cgccagagat attgccatcg tctggatgta ggaccacagc 180 actatcacta tcactcgata gatcgatagt cgatcgatta acggcgataa gagagtgaga 240 tcctcactct tttagcaggc agatctaccc atctattttg agtgaagaaa ccagtcgatt 300 ttgggtcgcc ggca 314 // ID Gypsy-193_AA-LTR repbase; DNA; INV; 277 BP. XX AC supercont1.84; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-193_AA_; KW Gypsy-193_AA-I; Gypsy-193_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-277 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.84; Positions 83598 83874. XX SQ Sequence 277 BP; 97 A; 42 C; 67 G; 71 T; 0 other; tgtagagttg gcaattgaaa gctatattat gttactttac cctattacgc ctgacgactt 60 gacttctcag agagagagat tattagattc gaagagaggg agaaaatgta acgagaatga 120 actgaggaag ggggatacat gaaattcagt acagctagcc agaagttgat caatcattgt 180 cgcgactaga ggcaaataaa ttgtctacgg aaaatataaa agtgttttat aagtgttccg 240 aaaagtgagc ggaaatttcc accggcgaga ctctaca 277 // ID Copia-22_SI-I repbase; DNA; INV; 4195 BP. XX AC AEAQ01023538; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-22_SI_; KW Copia-22_SI-LTR; Copia-22_SI-I. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-4195 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01023538; Positions 181 4375. XX CC Positions [1665-2198] - Integrase core CC LTRs are 94% similar to each other. XX FH Key Location/Qualifiers FT CDS 432..4172 FT /product="Copia-22_SI-I_1p" FT /translation="MCYISSSMEHKQLETLITCETAAEMWAKLSAIHEQKS FT AANKLTLMSRFHELKMAPNDTVVQHVAKIENMASQLRDIGEELSKVTIMSK FT ILSTLPQKFGPLVTAWDSVSEEEQTQSKLIERLIKEEGRLAAMDDATSTTA FT AMTVQQRRGRGTQRGKVDPKSNVDKDSLETKKTFVCHFCKKRGHISRYCYA FT RKNASKDNKNSEKTNVENSADVSAFIVSEAKVIADLLDRDAKRIWLLDSGA FT SKHMTYQREWLHDFHETTNETVSLGDNTLCEVRGYGTVAIKTLINGEWKLG FT KLENVLYVPTLRKNLFSTGACTNKGYSVIFNEKTVEIKDSNILKIRGVQQS FT NNLYRLLIETTTINNANVVEENAFEIWHKRLGHINDKCLRKMIEEKLVTGI FT KPCSYDKCFCENCVLGKQHRLPFKNASKRKSKIGDLIHADVCGPMSERSIG FT GSRYFLILKDDCSGFRKVYFMSHKSDTYDRFKEFIASFKNAFGHDIKSLRA FT DNGTEFTNHMMRDLMKKRGIVFETSAPYTHEQNGTAERDIRTIVECARTML FT IASGLPKCLWAEATNAAVYIINRCIQSQSRDVVPYELWFKRKPDLSHVKIF FT GCVAYAHVPKELNKKWEPKSNKMYLVGYHDESKNYRLFNPLTNKVIVSRDV FT IFNEKARYNKNKEEHDISFDIRLNDSSEKGNDNYEIEHIGDNDEPESIVVG FT TNKSYNLRNRDDLKRPDRYEACIALFDEPTTYKDAINGKNSKEWTIAINEE FT LQAHERNNTWTIVDFPKKEKNIIDYKWVFKQKPIVCEGRETVRFKARLCAK FT GFSQKPGVDFNEIYSPVVRYDSIRLLLAIAAIENLEMRKFDIKTAFINGEV FT EEELYMKIPEGINVSCDKVCRLNKSIYGLKQAARCWNDRFNNFMRKFDFKQ FT SESDRCVYFGKFEKKRIYLALYVDDGLVLADCTVVIDHFFTELKNTFEITE FT SEVNSFVGIEITRNRDKRTIFIHQSSYINRLLKRFNMCEAKSVSVPSDPHV FT ILKIPDCEWEKMDKTPYREAVGGLLFLAMISRPDVSYAVGLVSRFCGKFDL FT NHWNAIKRIFRYLISTENYGILYHGDDTSDIKGFSDADFAGDIETRRSTTG FT YIFRLAGGAITWASKRQATVSLSTTEAEYIAASTAIKECIWVGKILSEIGF FT RCNAKPITLHVDNQSAIRLIKNPEYHSRTKHIDVKFHFIREKYDKRIIDIV FT YVSSQDQFADICTKAIAKEKFEFLRSKIGVVRPLDNK" XX SQ Sequence 4195 BP; 1392 A; 749 C; 968 G; 1086 T; 0 other; gttcgcctcg agcgctgtac cagtccacac gttctaataa atacttgata cacattgtta 60 cttgtgattt ctgcgcctat cccaacaggt tatgggccca ggtcggtgtt tcaagtcgca 120 aggattagtt ttaacgagca acactgaaac agaaaagtct gtcgcgtaaa tttgcgagtg 180 aagtgctcga ctctgtgaat ttttttttct ttgtgtcgta tcaagatggc gaactcaagt 240 acagattcga ttgatcgaat tgatctgaag acggttgtgc gatttgatgg atccaacttt 300 caagtgtgga agttccaaat gcgagccatt tttacggcca ctggtctttt aaaattagtt 360 aatggaacgg agacgacacc aggaccgaca gaggcaggtt tgacgcatgg aacgcacgca 420 acgctagagc gatgtgttac atatcatcgt caatggaaca caagcaactc gagactctga 480 taacatgtga gacagccgct gaaatgtggg caaagctaag tgccatacat gagcagaaaa 540 gcgcggcaaa taagctaacg ttgatgtctc gtttccacga gctcaaaatg gcgccgaatg 600 ataccgtcgt acaacatgtg gcgaagatcg aaaatatggc gagtcaacta cgcgacattg 660 gcgaggaact atccaaggtg accataatgt cgaaaatctt gagcactctc ccccagaaat 720 ttggcccttt ggtgacggca tgggatagcg tcagcgagga ggaacagacg cagtcaaagc 780 taatcgagcg tctcatcaag gaggaaggac gacttgccgc catggatgac gcaacaagca 840 ccacggctgc catgacagtg cagcagagac gtggcagggg aacgcagcgg ggaaaggtgg 900 accctaaatc gaacgtcgac aaggacagct tggagacaaa gaagactttc gtatgtcatt 960 tctgtaaaaa acgcggtcac atttcgagat attgttatgc gagaaagaat gcgtctaaag 1020 ataataaaaa ttctgaaaaa acgaacgtag aaaattcggc agatgtaagc gcgttcatcg 1080 tctcggaagc gaaggtcatc gcagatttat tggatcgcga cgctaaaagg atttggcttc 1140 tcgactcggg tgcttcaaag cacatgacgt atcaacgcga gtggttacac gattttcacg 1200 agactacgaa cgaaaccgtt agtttaggcg acaacacgct ttgcgaagtt aggggttacg 1260 gaacagtagc aattaaaacg ttgattaatg gcgagtggaa actcgggaaa ctggaaaatg 1320 ttctctacgt tccaacctta cgcaaaaatt tgttttccac aggcgcgtgc acgaataaag 1380 gttattccgt tatatttaat gaaaagaccg tcgagataaa ggactcaaat atcttaaaaa 1440 ttcgcggggt tcaacaaagt aacaatttat accgtttatt gatagagaca acaacaatta 1500 ataacgcgaa tgttgtcgag gaaaacgcgt tcgaaatctg gcacaaacgt ctcgggcaca 1560 ttaatgataa atgtttgcga aaaatgattg aagaaaagct cgtaaccgga ataaagccat 1620 gcagttacga taaatgtttt tgcgaaaact gtgtgcttgg taagcaacat agactacctt 1680 tcaaaaacgc gagtaaaaga aaaagtaaaa tcggtgattt gatccacgca gacgtttgtg 1740 gacctatgtc ggagagatca atcggaggat ctcgttattt tttgattttg aaagacgatt 1800 gttctggttt ccgtaaagtt tatttcatgt ctcacaaaag cgatacttat gacagattta 1860 aagaatttat cgcaagtttt aaaaatgcat ttggtcatga tataaaatcc ttgagagctg 1920 ataacggaac ggaatttaca aatcacatga tgcgtgatct tatgaaaaaa cgcggcatcg 1980 tctttgaaac ttcagcgccg tatacgcatg aacaaaacgg caccgctgag cgtgacatta 2040 gaacaattgt tgaatgcgca agaacaatgc taatcgcgag cggattaccg aaatgtttat 2100 gggctgaggc tacaaacgca gcagtctata taataaaccg atgtattcaa tctcaatcgc 2160 gcgatgtggt tccttacgaa ctatggttta aaaggaagcc tgatctttct catgtaaaga 2220 tatttggatg cgtcgcatac gctcatgttc ctaaagaact taacaaaaag tgggagccga 2280 aatcaaataa aatgtattta gtcggatatc acgatgaatc caagaattat agattgttta 2340 atccacttac caataaagta atagtgtcac gtgatgtaat ttttaatgaa aaggctcgat 2400 ataataaaaa taaagaggaa catgatattt ccttcgatat tcgtttaaac gattcttctg 2460 aaaaagggaa tgataattat gagattgaac acattggaga taatgatgaa ccggaatcga 2520 tcgttgtagg aacgaataaa agttataatt tgcgtaatcg cgatgattta aagcgaccgg 2580 atcgatatga agcatgtatt gcgttgtttg atgaaccaac gacatataaa gacgcgataa 2640 atggtaaaaa ttcaaaagaa tggactattg cgattaacga ggagcttcag gctcacgagc 2700 gaaacaacac gtggaccatt gttgattttc caaaaaagga aaagaacatt atagactata 2760 aatgggtgtt caaacaaaaa cccatagtgt gcgaaggtcg tgaaacggtc cgattcaaag 2820 cacgactgtg tgccaaaggt ttctcacaaa aaccgggcgt tgattttaat gaaatatatt 2880 caccggtagt aagatatgac tctatacgtt tgttgcttgc tattgctgct atcgaaaact 2940 tagaaatgcg aaaatttgac ataaagacgg cgttcatcaa cggtgaagtc gaagaggaat 3000 tatatatgaa aatccctgaa ggcataaatg taagttgcga taaagtgtgt cgtttaaata 3060 aatcaatata cgggttaaag caagctgcaa ggtgttggaa tgatcgattt aacaatttca 3120 tgagaaaatt cgatttcaaa caaagcgagt ctgatagatg cgtttatttc ggtaaatttg 3180 aaaagaaaag gatttattta gcgctatatg tcgatgacgg actcgtgtta gcagattgta 3240 ctgtcgtgat cgatcatttc ttcactgaat taaagaacac attcgaaatt accgaaagtg 3300 aggtaaatag ttttgtcggt atcgaaataa cgcgaaatcg cgataagcgg acgattttta 3360 tacatcaaag ctcatacata aatagattgc tgaaacgttt caacatgtgt gaagcgaaga 3420 gtgtgtcagt accgtccgac ccacatgtga tcctaaagat ccctgactgt gagtgggaga 3480 aaatggacaa gaccccatac cgagaggctg taggaggttt attgttcctg gcgatgattt 3540 cgcgccctga tgtttcgtat gcggtaggac tggtaagccg tttttgtggc aaattcgatt 3600 taaatcactg gaatgcgata aagcgaatat ttagatactt aataagtacc gagaattatg 3660 gaatattata tcacggtgac gacacttctg atattaaagg tttctcagac gcagattttg 3720 cgggcgatat agagacgcgg cgttcaacca caggttatat ttttcgactt gcaggcggag 3780 caattacttg ggcgtcaaaa cgccaagcga cagtaagcct gagtacgact gaggcagaat 3840 acatagctgc gagcaccgcc ataaaagaat gtatctgggt aggaaagatt ctttctgaaa 3900 ttggattcag atgtaatgcg aaaccaatta ccctgcatgt tgataatcag agtgcaatta 3960 ggctaataaa gaacccggaa tatcacagcc ggacgaaaca cattgatgta aaatttcatt 4020 tcatacgcga aaaatacgat aaacgaataa tagacattgt atatgtatct tcgcaggatc 4080 aattcgcgga catatgtacg aaagccattg cgaaagaaaa gtttgaattt ttaaggtcaa 4140 aaattggcgt tgtaagacca ttagacaata agtaataaca cttggaaagt gggag 4195 // ID CR1-61_HM repbase; DNA; INV; 4139 BP. XX AC . XX DT 23-DEC-2008 (Rel. 13.12, Created) DT 23-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE CR1-type family - consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-61_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4139 RA Bao W. and Jurka J.; RT "CR1 families from Hydra magnipapillata."; RL Repbase Reports 8(12), 1888-1888 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 70..774 FT /product="CR1-61_HM_1p" FT /translation="MAKNFTPAQLKEILENHENTMMKIFNDRIEKMEIKFN FT NLKDENFTLKKELSEVQKAVEFISKSYDKIIIELTELKKTESKKSPDESNI FT ENGTNNIKEKIAELEDRSRRNNLRFIGIKDKANETWEESEQKIKDFLNTKL FT NIQEKILIDRAHRVGKENGKNRSIIVRFQNYKDKALVLNKYTTMKLWNERL FT YVNEDFSDFTMDLRRKLFKEAKELRAKGKFAKVVYNKLITRDLL*" FT CDS 817..3894 FT /product="CR1-61_HM_2p" FT /translation="MDDYESITFNFHDKYLKSNFNSDPDVNFFNDVNTCCS FT YFYPNELKEFLNKNGTENQKIRILHINIRSLNTNFEKFCDFLDETENIFNI FT ICLTETWISSNDFKNNSIFHLPGFETILLERQTNKRGGGVLIYVKEHIAHN FT IRNDMCVSDGDKEIVTIEILNNKKNNILLSCCYRPPDGASENLSFFLQQNI FT IHKGSKENKITFIIGDFNMNCLIYNKDKKIKSFYDEILMAGAVPLINRPTR FT ITKTSTTLIDNIFTTDIYNRKLKKGIIKTDISDHFPIFLTIHTVSSNNPER FT QNIIKKRVFNESNIKSFQYQLSLLHWKHINFNEDANTIYNKFFDTFFSIYD FT ANFPLCEKITKIKTLNSPWITKGLRKSSKIKQKLYIKYLKTKSNENQQKYK FT NYQHLFEKIRKKLKKNYYSNLISKFKNDSKNTWRTLKEITGKQKTNSNSLP FT KVIKINKTNIYEPNEIANEFNKYFIDIGTKLANKIPITEKNFSDFLEPINN FT SLFIDNLHSDLSFDEFERAYKSLKRNKATGADEINGNIVIDCFENLKCILY FT KVFRASIQQGVFPDQLKIAKVTPIYKNGKKSNISNYRPISVLSTFSKILER FT IIYNRTYNYLNLNKLLYKNQFGFKKNCSTEQAITQFIREISDSFGRSQYTL FT GIFIDLSKAFDTVDHKILLKKLKFYGINGKLIKWYQSYLENRKQFVFYGDY FT FQNEKLIDIKCGVPQGSILGPLLFLVYINDLNKASNLMSINFADDTNLFLS FT NSSIYELFSNMNNELNQISNWFKCNKLTLNIDKTKWILFHSLAKKRFLPNK FT LPKLFIDKIEIKKESVTRFLGIYIDENITWKYHIDYISTKLAKSIGILYKV FT RYYLSKQNLIQLYYSFIHSYINYANIVWGSTSKSKLRSLYHRQKHAMRLIC FT FENRFSHSNILFKNVKALNVYELNVYNILCFMFRCKSERPLSNFKDLFTYK FT AINKYALRNNNLLIEPFCQTKFNQFCINYRAPYLWNKIVVPNFDLSISFSV FT FKTKLKNYILSTGNIYKYF*" XX SQ Sequence 4139 BP; 1693 A; 577 C; 530 G; 1339 T; 0 other; ttatattttt caccgcaact ttcaccgcga acggacgtgt tttaagagct tattaaaaaa 60 cttataataa tggcgaaaaa ctttacacca gcacaactta aagaaatcct tgaaaatcac 120 gagaatacta tgatgaaaat ttttaatgat cggatagaaa aaatggaaat taaatttaat 180 aacttaaaag atgaaaactt tacgttaaaa aaagaattat ctgaggtaca gaaagctgtc 240 gaatttataa gtaaaagcta tgataaaata ataattgaac ttacagaatt aaaaaaaaca 300 gaatctaaaa aatcaccaga tgaaagtaat attgaaaatg gaactaataa cataaaagag 360 aaaatagcag aactagaaga cagaagtcgt agaaacaact tgagatttat tgggattaaa 420 gataaagcta atgaaacttg ggaagaaagt gagcaaaaaa tcaaagactt cctcaataca 480 aagcttaata tacaagaaaa aattttaata gatagagccc atcgagttgg aaaagaaaac 540 ggaaaaaaca gatcaataat tgttcggttt caaaactaca aagacaaagc cctagtgtta 600 aacaaatata cgacgatgaa gctatggaac gaacggttgt acgttaacga agactttagc 660 gattttacaa tggatttacg cagaaagctt ttcaaagaag cgaaagaatt gagagcgaaa 720 ggtaagtttg ctaaagtagt ttacaataaa ttaattacgc gcgacctatt ataaaggaat 780 tcttttaata ttctataatt aaatttaaaa acaaaaatgg atgactacga atccattact 840 tttaattttc atgataagta cttaaaatca aattttaatt cagatccaga tgtcaacttt 900 tttaatgatg tgaacacatg ttgttcttat ttttatccca acgagttaaa agaatttctt 960 aataaaaatg gcactgaaaa tcaaaaaatc agaatccttc acattaatat tagaagtcta 1020 aatacaaatt ttgaaaagtt ttgcgatttt ttagatgaaa ctgaaaatat ttttaacata 1080 atttgcttaa cagaaacgtg gatttcttcg aatgatttta aaaataattc tattttccat 1140 cttccaggtt ttgaaacaat tttattagaa agacaaacaa ataaacgcgg tggaggagtt 1200 ctgatttacg ttaaggaaca cattgcgcat aacattagga atgatatgtg cgtttctgac 1260 ggtgataaag aaatcgtaac tattgaaatt ttaaacaata aaaaaaataa tattctatta 1320 agctgctgtt atagaccacc tgatggtgcg tctgaaaatc ttagtttctt tttacagcaa 1380 aatatcattc acaaaggtag caaagaaaac aaaattactt ttataattgg agatttcaac 1440 atgaactgtt taatttataa taaagataaa aaaataaaaa gtttttatga tgaaatactt 1500 atggcaggag ctgttccttt aattaatcgc cctactagaa taaccaaaac ttcaactacg 1560 ttgattgata atatttttac aacagatatt tataatagaa aattaaaaaa aggaattata 1620 aaaaccgaca tatccgatca tttccccatt tttctaacaa tccatactgt atcctctaat 1680 aaccccgaaa gacaaaatat aataaaaaaa cgtgtcttca atgagagcaa tataaaatcc 1740 ttccaatatc aattatcact tctgcattgg aaacatatta actttaatga agacgcaaat 1800 acaatatata ataaattctt tgatacgttc ttctcaatat acgatgcaaa ctttccactt 1860 tgtgaaaaaa taacaaaaat aaaaacctta aatagtccat ggattaccaa aggacttaga 1920 aaatcctcta aaataaaaca gaaactatat ataaaatatc taaaaacaaa atcaaatgaa 1980 aaccaacaaa aatataaaaa ttaccaacat ttatttgaaa aaattcgtaa aaaattaaaa 2040 aaaaattatt attcaaattt aattagtaaa tttaaaaacg actcaaaaaa tacttggaga 2100 acgttgaaag aaataacagg aaaacaaaaa acaaattcaa actcccttcc taaagtaatt 2160 aaaataaata aaacaaatat atatgaacca aacgaaattg caaacgagtt taataaatat 2220 ttcattgaca taggaaccaa attggcaaac aagatcccta ttacagaaaa aaattttagt 2280 gatttcctag agcctataaa taattcgctt tttatagata atttacattc tgacttatct 2340 tttgatgagt ttgagagagc ttacaaatct ctcaaaagaa acaaagctac gggtgcggac 2400 gaaataaatg gcaacatagt catagattgc tttgaaaact taaagtgtat tctttataaa 2460 gtctttaggg catctataca gcaaggtgta ttccccgacc aattaaaaat agctaaagta 2520 actcctatat acaaaaacgg aaaaaaatca aatattagta attatcgccc tatctcagtt 2580 ctttcaactt tctcaaaaat tttagaaaga attatataca atagaacata caattacctt 2640 aatctaaata aacttctata taaaaatcaa ttcggtttta aaaaaaattg ttcaactgaa 2700 caagccatta cacagttcat tcgtgaaatc tccgattcat ttggtagatc acaatataca 2760 ttaggtattt tcattgacct atcaaaagca ttcgacacgg tcgatcacaa aattctactt 2820 aaaaaactca aattttacgg aattaatggc aaattaataa aatggtatca aagttacttg 2880 gaaaatagaa aacaatttgt tttctatgga gattattttc aaaatgaaaa attaattgac 2940 ataaaatgtg gtgttccaca gggttctatt ctaggcccac ttttgttttt agtttatata 3000 aatgatttaa ataaagcttc taaccttatg agtataaatt ttgcggacga tactaattta 3060 tttctgtcaa acagtagcat ttatgagctt ttttcaaata tgaataatga actcaatcag 3120 atctccaact ggtttaagtg caacaagtta actttaaaca ttgacaaaac aaagtggatc 3180 cttttccact ctcttgctaa aaagcgtttt ttaccaaata aattgcctaa actttttatt 3240 gataaaattg aaataaaaaa ggagtcggtc acaagatttt taggcattta cattgatgaa 3300 aatattacat ggaagtatca tatcgactat atttctacta aattggctaa aagtattgga 3360 attctatata aggtcagata ctatctaagt aaacaaaatt taattcaact ctactactcg 3420 tttattcata gctatataaa ttatgcaaat attgtttggg gaagtacctc taaaagtaag 3480 ttacgaagtc tttaccatcg tcagaaacac gcaatgcgtt taatatgttt cgaaaatcgt 3540 ttttctcatt caaatatttt atttaaaaac gttaaagcac ttaacgttta cgaactcaat 3600 gtctataata ttttatgttt tatgtttcgt tgtaaaagtg aacgaccatt gtccaatttc 3660 aaagatcttt ttacttataa agctataaac aaatatgctt tacgaaataa taatttatta 3720 attgaaccct tttgtcaaac aaagtttaat caattttgca taaactatag agcaccgtat 3780 ttatggaaca aaattgttgt tccaaatttt gatttgtcaa tttcgttttc cgtttttaaa 3840 actaaattaa aaaactatat tctttctact ggcaatatat ataaatattt ttgaaatgta 3900 aaattctttt tttttactct ggataatttg taaaattatt ttcagttttt gtttcttcta 3960 cattgttatt agttatttga tcaattgtat tttattttat tatatataca cgtattatat 4020 cttttatatg gttctggcga caagatcatt tggatcttct ttcagatacc aagtttatat 4080 attgttatat tgttatatta cgattaataa ttgtaaactg aaaaaaaaaa aaaaaaaaa 4139 // ID BEL-45_CQ-I repbase; DNA; INV; 5822 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-45_CQ_; KW BEL-45_CQ-LTR; BEL-45_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-5822 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 243-243 (2011). XX DR [1] (Consensus) XX CC Positions [4805-5386] - Integrase core CC LTRs are 92% similar to each other. XX FH Key Location/Qualifiers FT CDS 341..5821 FT /product="BEL-45_CQ-I_1p" FT /translation="MWRSLLKLYPEPGSENEKEEEKLAVIVPVPSPVKPLT FT PKKLWEKKKAKMADGEVKALSKKRSQVKGKLTRILHAVQPSTQPGALPGAQ FT AREADVLPPQLRVHQRSVEKYYGEFCELHNKIMDLVEDDEGTKQQEAKWVE FT FEELYNRTLVVLETLLTAHDRPTQAVLPAGQAAQQVIIHQQALRAPLPTFD FT GRYENWAKFKAMFQDLMRSSPDSDAVKLYHLDKALVGDAEGKIDLRTIQDN FT NYQEAWNQLEEQYENTRLIIDLHIQGILQLKKMDKRSSKELRDVVERCSRH FT VEGLRFHKQELLGVSELIVVNILAAALDRETRELWEATIEKGELPTYQATV FT DFLKKRCHILERCEQSDPVASAVEPTISKKLPSKPSAKMSAAAMTTSEVKC FT ELCGGNHPNYKCSTFCSMSVPQRLAKLREVRACFNCLRTGHHIKTCPSEKT FT CKCGEKHHNLLHMDKPKEQASSKSEASPIPQPTTEQAAGGSGSVTAAVADA FT NPKQTTSCCSSGVLQTSRPVLLQTAIIDVMDKHGRLHPCRTLLDSGSQAHI FT LSAAMARTLDLPLMKCNVTVIGANAVKTPARRGVNLNFSSRYSDFQDTISC FT LISEKPTGIIPSGRIDASGWGFPSGLQLADPHFCEPNDIDLVMASNYVWDL FT LRTEKVKLHNGTVTLRETDLGWIVTGTYDPFDQVSQSFVHSNVVRLDALKE FT AVEKFWTVEELAETAPPSAENNDVEKHFVETHHRDESGRYVVQLPFRDTVS FT ELDDNRTLALRRFSQLERRLSRNPDLKLQYSAFIDEYEALGHCKEVFEEQD FT PPDTKKFYLPHHAVLKPSSSSTKLRVVFDASAKSSRYSLNDVLKVGPTVQS FT DLFSILLHFRRYLIAFSADAVKMYRQVKIVPEHRPFQRIFWRKEPSDRLRV FT LELNTVTYGTASAPFLATRCLTQLAESAKDRFPKTAKKIKEDVYVDDMLSG FT ADTAEEAQTCVKEVKELLAEGGFQIKKWCSNSEAVLEGVPEEDREILVKID FT ECSVSEGIKALGILWDPKQDEFRFQNVLSESDILPITKAKVLSDIAKLFDP FT LGLISPVIVCAKVIMQKLWADKLGWTDPLQGELLKEWLDFRSSLDGLKDIR FT IPRCAVVPGAVRYEVHGYSDASTVAYGACVYLKCFMPDSKTVVSRLLCSKS FT RVAPLKQLTIPKLELEAALLLSLLVVKVLATLDLEIAEVVLKSDSQIVLAW FT LRRPLDTLEVFQRNRVAKINALTKGFTWAYVRTNENPADFVSRGQLPAPLS FT RNNLWWEDTPSTTTTAVDPAPILEQELPGLVASLSNTVESELLPVFTRFSD FT FRHLQRVVAYVCRFIQNLKSKRAGQPTCRTNYLTIPETRKALKCIVRCLQV FT HALLEDYNSVKQSGAAHRLARLAPILDSDGCLRVGGRIQKSSLPEEAKHQL FT ILPRHPVTESLIRAMHIENLHIGPSGLLAATRQKFWLLAGRSITRKITTKC FT MCCFRAKPRGIEQFMGQLPDERVNVAAPFEFTGVDYAGPMLVKQGKYRPKV FT VKAYIAVFVCMATKNVHLELVSELTAEAFIAALERFINRRGMVRKLFSDNG FT TNFVGASGMLRDFYRLLSSDLLLRELHELLLPKEIEWSFIPPRAPNFGGLW FT EAGVKSVKTHLKRTLVNATLTFEEMATMLTHIEAILNSRPLYSASDCPGDA FT LPITPAHLQIGRPLQSVPKPSCSSVPDNRLPRWRYLDKLREHFWDRWSREY FT LTSLQVRGKWHKKTANVRPGMVVLLIEDNLPPQSWKLAIVIKTYPGPDQLV FT RVVDVKVGNKILQRSISKLAPLPTEDNDQLRQVPEPSGLDEIPSLEEASAS FT QSGRR" XX SQ Sequence 5822 BP; 1456 A; 1580 C; 1609 G; 1177 T; 0 other; tttggtccat tcgtaccgga ctttggccat tcgaccgtcg gatccgcgag tgaaaagtgt 60 ccagtggatt ccggaaatcg tgcgagcgga agaacgcatc ggtaagaaga agttaccgat 120 cgtgtgatcc gggagtagga aaagttctaa agactcccgg agggcgaccc acgcggccct 180 gaacagcgga aacagtggcg gcaagagtcc gcaataaaag gaaaaaagtt ccgtgcctgt 240 gcagtgaaga tcagcgaagc gacgccattt tgttttccag ttcccaagag aggaacaatc 300 cgtggaatgc agacagcgca agagttgctg gtctgatttc atgtggcggt cgctcctgaa 360 gctgtacccg gaaccaggaa gcgaaaacga aaaagaagag gaaaaactcg ccgtcatagt 420 gccagttccg tctccagtga aaccgctaac gccaaagaag ctgtgggaga agaagaaggc 480 aaagatggcg gacggagagg tcaaggccct ttccaagaag cgaagccagg tgaagggaaa 540 actcacacga atcctgcacg cagtccagcc gagcacgcag cccggtgccc tacctggtgc 600 ccaagctcgt gaggccgacg tactcccgcc gcagctacga gtccaccagc ggagtgtgga 660 gaagtactac ggtgagtttt gcgagctcca caacaagatt atggacctgg tagaggatga 720 cgagggaacc aagcagcagg aggcaaagtg ggtggagttc gaggagctct acaaccgaac 780 gttggtcgtg ttggaaacgc tgctgaccgc tcatgacagg ccgacccagg ctgtgcttcc 840 ggctggccaa gccgcccagc aggtcatcat ccatcagcaa gcgttgcgtg ccccactccc 900 gacgttcgat ggacgttacg agaactgggc aaagttcaag gcgatgttcc aagacttgat 960 gcgcagctcg cccgactctg atgctgtcaa gttataccac ctggacaagg cgctggtagg 1020 cgacgcggaa gggaagatcg accttcgaac tatccaggac aacaattatc aggaagcttg 1080 gaatcagctg gaagaacagt acgagaatac gcggctgatt atcgatctcc acatccaagg 1140 catcttacag ctgaagaaga tggacaagcg atcgtcgaag gagttgcgag atgtcgtcga 1200 gcggtgttcc agacacgtcg aaggtctacg cttccacaag caggagctgt tgggagtgtc 1260 tgagcttatc gtggttaaca tcctcgcagc tgctctggac cgcgagacga gagagctctg 1320 ggaagcgacg atcgagaaag gtgagcttcc aacgtaccag gcgaccgtcg atttcctaaa 1380 gaagcggtgt cacattctcg agcgatgtga gcagtccgac ccggtggcat ctgctgtcga 1440 accgacaatc tccaagaagc tcccttcgaa accttcggcg aagatgtccg ctgcagccat 1500 gacgacaagc gaggtcaagt gtgagctgtg cggcggcaac catccgaact acaagtgtag 1560 caccttctgc agcatgtctg tcccacagag gcttgcgaag ttgagagagg tacgggcctg 1620 cttcaactgc ctcagaaccg gccaccatat caagacttgt ccctccgaga agacctgtaa 1680 gtgtggagag aagcaccaca atctgctcca catggacaaa ccgaaggaac aagcttcttc 1740 aaaatctgag gctagtccca ttcctcagcc aaccaccgag caagcagctg gtggttccgg 1800 atcggtgacc gcagcagtgg cagacgcaaa cccgaagcag acaacatcgt gctgtagcag 1860 cggagtgcta caaacctccc gacctgtgct gttgcagaca gcaattatcg atgtgatgga 1920 caagcacgga cggctgcatc cgtgccgcac gctcctggac tctggatccc aagcacacat 1980 cctgtcagcg gcgatggcgc gaacgctcga ccttcctttg atgaagtgta acgtgaccgt 2040 gattggagcg aacgctgtga aaaccccggc gagacgaggt gtcaacctga atttctcctc 2100 gcggtacagt gacttccagg ataccatttc ctgcctcatc tcggagaagc caaccggcat 2160 cattccgtcc gggagaatcg acgcgtccgg atgggggttt ccgagtggat tgcagcttgc 2220 agatccacat ttctgtgaac ccaacgacat tgacctagtg atggcatcca actacgtgtg 2280 ggatctgctg cgcacggaga aagtgaagtt gcacaacggc accgtcacgc tccgagagac 2340 ggaccttggt tggattgtta cgggcaccta cgacccgttc gaccaggtga gtcaatcttt 2400 cgtccactca aatgttgtgc gtctggatgc tctgaaagaa gccgttgaga agttttggac 2460 agtcgaagag ctggctgaaa cggcaccacc ctccgccgag aacaacgacg tcgagaagca 2520 tttcgtggaa acccatcacc gcgacgagag cggccgttac gtagttcaac tcccgttcag 2580 agacaccgtt tcagagttgg acgacaaccg aaccctggcc ttgagaagat tctcccagct 2640 tgaacgccgc ctttcacgaa acccagactt gaagctgcag tactcggcct tcatcgacga 2700 atacgaggca cttgggcact gcaaggaagt ttttgaggaa caagatccac cagataccaa 2760 gaagttctac ctaccacacc acgcagtgtt gaagccctcc agttcatcta ccaagttgcg 2820 agtggtgttt gatgccagtg cgaagtcttc gaggtactcc ttgaacgacg tcttgaaagt 2880 gggccccacc gtgcaaagcg atctgttttc catcttgctt cacttccgcc ggtatttgat 2940 tgctttctcg gctgacgccg tcaaaatgta tcggcaagta aagatcgttc ctgagcaccg 3000 accatttcaa cgaatcttct ggagaaagga gccctcggat cgcctcagag tgctcgaatt 3060 gaacaccgtc acgtacggca cggctagtgc cccgtttcta gccacccgtt gcctgaccca 3120 gctggccgag tcagccaagg accgcttccc gaagactgct aagaagatta aggaagatgt 3180 gtatgtcgac gacatgctgt ctggagcgga cacagcagag gaggcacaaa cgtgcgtgaa 3240 ggaagtaaag gagctgttgg cggaaggcgg tttccagatc aagaagtggt gctcaaactc 3300 ggaagctgtt ctggaaggag taccagagga agatcgtgaa atactagtga agatagacga 3360 gtgtagtgtg agtgaaggaa tcaaagcatt gggaatcctg tgggacccaa aacaggacga 3420 attccgattc caaaatgtgc tcagtgaaag tgatatcttg cccatcacca aagccaaagt 3480 actgtcggac attgcaaaac ttttcgatcc gctgggcttg atatcaccag tgatagtgtg 3540 tgcaaaggtg ataatgcaga agctgtgggc cgacaagctc ggctggactg acccgctgca 3600 aggagagttg ctgaaggagt ggctcgactt ccggagttca ttggatggac tcaaggacat 3660 cagaatcccg cgatgcgctg tcgtaccggg cgccgttcgc tacgaagtac acggatactc 3720 ggacgcctcg accgttgctt acggtgcttg cgtctacctc aagtgcttta tgcccgacag 3780 caaaaccgtc gtgtcccgcc tgctgtgttc caagtcgcga gttgcaccct tgaagcagct 3840 tacgatcccg aagctggagc tcgaagctgc gctgctattg tccctgctcg tcgtgaaggt 3900 gctagctaca ctcgacctgg aaattgctga agtcgtgctg aagtcggaca gccaaatcgt 3960 cctagcgtgg ctaagacggc cacttgacac cttggaagta ttccagcgaa accgagtcgc 4020 caagatcaac gctctgacca agggtttcac gtgggcatac gtgagaacga atgagaatcc 4080 agctgatttt gtgtctagag gccagctccc ggcgcctttg agccgcaaca acctgtggtg 4140 ggaggatacg ccgtccacaa cgaccaccgc agtcgatcct gcaccgatcc tcgaacaaga 4200 gcttcctggg ttggtggcat ccctgtccaa caccgtcgag tctgagctgc taccagtgtt 4260 cacgcgcttc agcgacttcc gccacctcca gcgagttgtc gcttacgtct gccgattcat 4320 ccagaatttg aagagcaaga gagccgggca gcctacctgc agaaccaatt acttgacgat 4380 ccctgaaacg cgaaaagcac taaagtgcat tgtgagatgt ctgcaagtgc acgccttact 4440 tgaggactac aattccgtga agcagagtgg agccgcacac cggctggctc ggctagcgcc 4500 aatactcgac agcgatggct gcttgagagt cgggggccgt atccagaagt caagcctacc 4560 agaagaagct aaacaccaac tgattctgcc tcgccatccc gttaccgaat cgctgatccg 4620 ggcgatgcac atcgagaacc tgcatatcgg cccatctgga ctgctggctg caacccgtca 4680 gaaattctgg ctgttggctg gccgatcgat caccaggaaa atcaccacca agtgcatgtg 4740 ctgcttccgt gctaaacccc gtggaatcga gcagttcatg ggccaattac cggacgagag 4800 agtgaacgtg gctgcgccgt tcgaattcac cggcgtggac tatgccggac ctatgttggt 4860 gaaacaaggg aaatatcgtc cgaaggttgt caaggcatac atcgcggtat ttgtgtgcat 4920 ggcgaccaag aatgtacacc tggagttggt ctccgagctt accgcagagg ccttcatcgc 4980 agctcttgaa cgttttatca accgccgtgg catggtccgg aaactgttct ccgataacgg 5040 aaccaacttc gtcggagctt cagggatgct tcgagacttc taccgcttgc tgtccagcga 5100 tcttctgctt cgagaacttc atgagcttct gttgcccaag gagattgaat ggagcttcat 5160 tccaccgcgg gcccccaact ttggtggctt gtgggaggcg ggcgtcaaaa gtgtgaagac 5220 acacctgaaa aggacattgg tcaacgcaac actaaccttt gaagagatgg ccaccatgct 5280 gacccacatc gaagccatct tgaactcgcg accgctgtat agcgcctcgg actgccctgg 5340 agacgccttg ccaatcacgc cagcacactt gcagatcggt agacctcttc aatcggtgcc 5400 caaaccgtcc tgcagcagcg tgccagataa tcgccttccc cgctggcgct atctggacaa 5460 actccgcgaa catttctggg accgctggtc tcgggagtac cttaccagct tgcaagtccg 5520 cggcaagtgg cacaagaaga ctgcaaacgt acggcctggc atggtcgtgc ttctcatcga 5580 agacaactta ccaccacagt cctggaagct ggccatcgtt atcaagacct atccgggacc 5640 agaccagcta gttcgagtcg tcgacgtgaa ggtgggcaac aagatcttgc aacgttccat 5700 ctcaaagctg gcgccactgc ccacggaaga taatgaccag ctgagacaag ttccggaacc 5760 ttcagggctt gatgaaattc cgagtcttga agaagcttca gcttcgcagt cggggcggag 5820 aa 5822 // ID Gypsy-10-I_HM repbase; DNA; INV; 3639 BP. XX AC . XX DT 30-DEC-2008 (Rel. 13.12, Created) DT 30-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE An internal portion of the LTR retrotransposon - a consensus DE sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW reverse transcriptase; Gypsy superfamily; Gypsy-10-I_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3639 RA Bao W. and Jurka J.; RT "Gypsy LTR retrotransposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 1986-1986 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(28..885,695..3607) FT /product="Gypsy-10-I_HM_1p" FT /translation="NKRKMDRFMKPDRLGIDPNSADAAKTYNHWLRTFYNF FT VSTVDDNSIKLNLLINHIEPNVYEFISECDDFEQATSILESIYIKPKNEIF FT ARHILSTRKQKPSETIDQFLNELKNLSKDCKFTAVTAEQYKSEMIRDAFIN FT GLLSNSIRQRLLENKILDLSSAFDQARSLDAAQQNSNLYSQLTIPTSASIQ FT EIKSPLQKQSTTDNHLAATAKSKQLCWFCGNIRHPRTKCPAREAICHKCKK FT IGHFEKLCRSSNVSAAIPKIDDDYFCAKTTQHHISFSKLSIQSLSDIHVLN FT VLLVKLSVTNVKRLVILKNFADHLMFLLLFLKLMMITSVPRLLNTISASAN FT SLFRVCHNIIVNDKYKAEALIDSGSTNKSFINNKLVALLNLNIIYEQSVIG FT MASASLSAKSDGYCYVSITLQNEIYNKVKLHVLDNLCVDVILGTDFQELHE FT SITIKYGGKRPPITFAALTTMKTEPPELFANLTXDCQPIATRSRNYSQRDK FT EFIKSEIHRLIQAEIIEPSNSPWRAQVLVVNEKTKRRMVVDYSETINKYTQ FT LDAYPLPKIETIVNIISKYKIYSTLDLRSAYYQIPLSKKDRPYTAFEADDK FT LWQYTRMPFGVTNGSACFQRKIDDFVRKYKLTDCFPFIDNVTICGNSQKEH FT DENLLKFRLAAIDAGITFNEEKCVFSTKTLDFLGYRISYGSLKPDPQRLAP FT LIKLPPPDNAKSLQRIIGMFSYYAKWIRKFSDKIRPLNSVTQFPLNQQRIS FT AFETLKNELVQSSVQTIDENIPFTVETDASDFAISATLNQDGXPVAFHSRT FT LQGSEQYYSSVEKEARAIVEAINHWKHFLLGRHFILITDQQSVAYMYDYKS FT SSKIKNDKIMRWRIALSPYSYDIHYRPGQCNSGPDTFTRVKCAAISSESLY FT DLHAALCHPGVTRFHHFIQSRNLPYSVEDVKQICRDCRICKEIKPKYYRPN FT DVHLIKATQPFERISIDFKGPLPSSTPEQYMLTIVDEYSRFPFAYPVKDMT FT TQTIINCLADLFSMFGMPSYVHSDRGSSLMSSELKHWFLSKGIATSRTTPY FT NPTGNGQVERYNGIIWKSILLALKSRNLPISSWKRVLPDALHATRSLLCTS FT TNCSPHERLFNFQRRSSSGSSIPSWLVNPGPVLIKRNAMQSKYEDSVNEVQ FT LIEANPHYANVRFPNGHETTVSTKQLAPIGTTPIVQTSNDNLEITAIDNYP FT LTPNGSNETPLVSEGELESRVNPIEQSTLPPRRSSRISKAPEKLNL*" XX SQ Sequence 3639 BP; 1257 A; 707 C; 586 G; 1086 T; 3 other; atcaaaagag ctaaaatatt attatgaaat aaaagaaaga tggatcgatt catgaaacct 60 gacagacttg gtatagatcc aaactcagca gatgcagcaa aaacatacaa ccattggtta 120 agaacatttt acaattttgt aagtacagtt gacgataact ccattaagct taatctccta 180 ataaaccata ttgaacctaa tgtatatgaa ttcatatcag agtgtgatga ttttgaacaa 240 gcaacaagta ttctcgagtc tatctacata aaaccaaaga atgaaatatt tgccagacac 300 attttgtcaa cacggaagca aaaaccaagt gaaacaatag atcaattcct aaatgaactt 360 aaaaatttat ctaaggactg taagttcact gctgttactg ctgagcaata taaatctgaa 420 atgattcgcg atgcgtttat aaatggctta ctctccaaca gtatacgcca acgtttacta 480 gaaaacaaga tattagacct ttcatcagcc tttgaccaag ctagatcttt agacgcggcc 540 caacaaaact caaacttata ctcacaattg actataccta cctctgcctc tattcaggaa 600 atcaaatcac ctttacaaaa acagtccacg actgataatc acttagctgc tacagcaaaa 660 tcaaaacaat tgtgctggtt ttgtggaaac ataagacatc cacgtactaa atgtcctgct 720 cgtgaagcta tctgtcacaa atgtaaaaag attggtcatt ttgaaaaact ttgcagatca 780 tctaatgttt ctgctgctat tcctaaaatt gatgatgatt acttctgtgc caagactact 840 caacaccata tcagcttcag caaactctct attcagagtc tgtcataaca tcattgttaa 900 tgataaatat aaagcagaag ctttaattga tagtgggagt acaaacaaaa gttttattaa 960 caacaaacta gtggctctac tcaatttaaa tattatttat gaacagagcg taatcggaat 1020 ggcatcagct tctttatctg caaaatcaga tggatattgt tatgtttcaa taactcttca 1080 aaacgaaatc tataacaaag tcaagttgca cgtgctagat aacttatgcg tagatgttat 1140 tcttggcaca gattttcaag agttgcacga aagtattacc attaagtatg gtgggaaaag 1200 accacccata acatttgctg ctttaacaac aatgaaaact gaacctcctg aattatttgc 1260 taatctcacg maagattgtc aaccgattgc cactagatct agaaattatt ctcaaagaga 1320 taaagaattt attaaatctg aaatccaccg cctgattcaa gcagaaataa tagagcctag 1380 taattcacct tggcgtgccc aggtcttagt agtaaatgaa aagactaaac gtcgcatggt 1440 tgttgattac tctgaaacta ttaataagta tactcaatta gatgcatatc ctctacctaa 1500 gattgaaacg attgtaaaca taatttctaa atacaaaatt tatagtactc ttgaccttcg 1560 ctcagcatac taccaaatac ctttatcaaa aaaagatcga ccatatactg cttttgaagc 1620 tgacgataag ttgtggcaat atactagaat gccttttgga gttaccaatg gatctgcatg 1680 ttttcaacgc aaaatagatg actttgttag aaaatataaa ctaactgact gctttccttt 1740 cattgataac gttacaatat gtggtaatag tcagaaagaa catgatgaaa atctacttaa 1800 gttcagatta gctgccattg atgctggaat tacatttaat gaagagaagt gtgtattttc 1860 aactaagact ttagattttt taggttaccg tatatcatat ggctctttaa aaccagatcc 1920 tcagagactt gctcctctca ttaagttacc cccacctgat aatgcaaagt ctttacagcg 1980 tattattgga atgttctcgt actatgcaaa atggataaga aagttttcag ataaaattag 2040 acctttaaac tctgttactc aatttccact caatcagcag cgaatatcag catttgaaac 2100 attaaaaaat gagcttgttc aatcttccgt gcaaaccatt gatgagaata taccttttac 2160 tgtggagact gatgcttctg attttgccat atccgccaca cttaatcaag atggaagmcc 2220 tgttgctttc cattcacgaa cccttcaagg gagtgaacaa tactattcat cagtggagaa 2280 agaagctcga gcaatagtwg aagccataaa tcattggaaa cattttcttt tgggtcgaca 2340 tttcattctc atcactgacc aacaatctgt ggcatatatg tatgattata aatcaagcag 2400 taaaataaag aatgataaaa taatgcgttg gagaatagct ttgtctccat attcttatga 2460 tatccattat cgtcctggcc aatgtaatag tggtccagat acatttactc gagttaaatg 2520 tgcagcaata agttcagaat ctttatacga tttgcatgct gcactttgcc atcctggtgt 2580 tactcgcttc catcatttta ttcaatcaag aaacctccct tattcagtag aagatgtaaa 2640 acaaatatgt agggattgta gaatatgtaa agaaataaaa cccaagtact accgcccaaa 2700 tgatgtgcac ctgattaagg cgactcagcc attcgaacgt atatctattg acttcaaagg 2760 accattacct tcttcaaccc ctgaacagta tatgctaaca attgtagacg aatactcacg 2820 ttttccattt gcttatcctg tgaaagatat gacaactcag acaataataa actgtttagc 2880 tgatcttttt tccatgtttg gtatgcctag ctacgttcat tcggatcgtg gatcctcatt 2940 aatgtcttct gaattaaagc actggttctt atcaaaaggg atcgcaacta gtagaacaac 3000 cccttacaat ccgactggaa atgggcaagt tgaaagatac aatggaatta tttggaagtc 3060 gatattactc gctctgaaat ctcgaaatct gcctatatca tcatggaaaa gagttttacc 3120 tgatgcactt catgctacaa gatctctatt atgtacttct actaattgta gtccacatga 3180 acgtcttttt aattttcaaa gaagatcatc atctgggtcc tcaattcctt catggcttgt 3240 taaccctgga cctgtactta taaaaagaaa tgcaatgcaa agtaaatatg aagattcggt 3300 aaatgaggtt cagctcattg aagcaaatcc tcattatgcc aacgtgagat ttcctaatgg 3360 tcatgagact acagtgtcta caaaacagct ggctcctatc ggaacaactc ctatagtaca 3420 aacatcaaat gacaatttag aaataacagc tattgataac taccccttaa cccctaatgg 3480 tagcaatgaa actccgttag tctccgaagg ggaactagaa agcagagtga atcctataga 3540 gcaaagtact ctgcctcctc gtagatcttc acgaatttct aaagcaccag aaaaactcaa 3600 cttataaata cacctgaaca cttttaagag gagggtgaa 3639 // ID DNA8-13_AP repbase; DNA; INV; 788 BP. XX AC . XX DT 22-AUG-2009 (Rel. 14.08, Created) DT 22-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-13_AP. XX NM DNA8-13_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-788 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(8), 1755-1755 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 788 BP; 230 A; 167 C; 124 G; 266 T; 1 other; caggggtgga taggcccata gggaaaaagg gattttccct gtgggccccc gtctatgttt 60 gtggcctggg ggcccactcg ggtctacaca ttgttttttt gcagaataaa atattcacca 120 aataggaata gtttaaattt aaaactattc atttttttat ataaattatt antacttagg 180 tacctactgt tatttttaac aatataataa taataccacg actagtcaag aatttgctta 240 cggatcttta acatagtatt ttaaattttt atcgttaatc gaagaccgaa atatcctcgt 300 atgtttccct aacccccccc cccccatatt gtgtttttgc aatttttttt tggccgatat 360 ttagtacgga atttgtacga atttgtacga taagaccctc aatttgtaca aatttttttt 420 cgaagttata cgagaatttg tgtgaggggg ctatagaaac gcttctccac cccccccccc 480 cttatttcca aaactactta tttatgtgaa acttctatgc agatatttag tacggaattt 540 gtacgaattt gtacgacctt acgcgaaatt tttcaaaaaa ccccttacaa ccgaaaaaag 600 cattttcctc atatgccgtt tccccagccc ccaccaaact aaaaatacgt tttcgcacga 660 aacattttat ttataggtaa aggtgttaga ttgaatatct aatgcgagcg aagcgaccaa 720 attttttttc agtttggggg ccccttcaaa tatttccctg taacataaat actacctatc 780 cacccctg 788 // ID I_Ele43C_AAe repbase; DNA; INV; 6593 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE An I non-LTR retrotransposon from Aedes aegypti. XX KW I; Non-LTR Retrotransposon; Transposable Element; I_Ele43C_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6593 RA Kojima K.K. and Jurka J.; RT "I clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1354-1354 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 6 sequences with >98% CC identity. CC The consensus is ~80% identical to I_Ele43 and I_Ele43B_AAe. XX FH Key Location/Qualifiers FT CDS 577..1866 FT /product="I_Ele43C_AAe_1p" FT /translation="MAAASSGDPGGSVKRRLPEYMDPTNQFGELTFLQLSG FT KNGIPLPINPFITGKSVEACAGGSIESAKSEAQGSKYTLRVRDPAQVAKLL FT KLTKLIDGTEVEVIPHPNLNVSRCVISCIDLIQMEEKDILQEMISEKVIRV FT QRITRNEGGRRVNTPALILTFCKTTYPEYLKVGLLRVPTRPYFPNPMLCYG FT CFSYGHTRVRCPGPQRCANCSQNFHGEECSEAPSCRNCKGDHRSTSRQCSV FT YKKEVEVIKLKVSENLSFPEARKRVEQQGGSYAQVAAQQNVFEKKLKELEV FT AMLAKDKEIAKLHEEGKRKDERIEQMMAFIKQVKQQSSQERTHHVSETVVV FT EKPRHSREQRVVQSTAGPMTRSRNNSPAVQETKRGRPSKFVYPKSATSPDT FT SPPPKKTAPTTHDLTQMEYSGEESEVSETPPNQRLR" FT CDS 1802..6481 FT /product="I_Ele43C_AAe_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="LRWSIPAKSLRYRRHPLTSVFDNPTLEQRSEYNDDPR FT TKTDNETFRFGSRESVVPLPTQVRQAEGVIRDVLPQPVDDEFTSVADGIPH FT DTTILQNHRGTIHSVPTSSNSIETNNENTDDTTQSLSPVPAIAGVSGSSRV FT VVSAMAGTVGQDVPQVSLINSQPQTGTAAPAIFSDTHQTTSTAANPNRYQP FT ASTYHLVFNNHLSDDTTQSLSPVPAVAGVSGSNRVVVSAMAGTVGQDVPQV FT SLSKFPSQTVAAAPDYLSGYTASPPSPGEGPSSVHRRNITQSASPVPAVVG FT IGRTSVMVSTAAGTVGQSVPQASDSLVLSSSEVTALPLSPTTDPPIRRTST FT SSSTTRSTTDSRSGYCFALQWNIRGLRANISELKLLISDLEPCVVALQETK FT VNQTVVPPDFVGRNYTLLLQSRNANYWQHGVGLAIREGIPFERLQIDTTLH FT LVAARIFAPIQMTVVSVYIPPSAVHCQTALGDLLEYLEGPVLLLGDFNAHH FT LAWGSHQSNALGRFVAETTLDKQLVILNDGSSTRIDPATGKTSAIDVSICS FT ESLARKFTWRTLSDTHNSDHFPITVCIPGWSNRRTTRQKWLYDQADWPLYE FT RLIAETIRPDVEWDVESFTEKIIAAAKKSIPRSSGRVGPKAVPWWCPEVKA FT AIRQRRKCLRSLRRLQQRDPNQPEALASFQEAKAEAKKAVTQAKQQSWEDF FT VAKISPSSTTTELWNTVNTLRGNRRQRPVVLKKSNGFTDNPEEAAEELAKY FT YSERSATSSYPPSFQMAKSAAERESMDFSPNTGDVYNIDITLAELLWALDK FT GRGSSTGDDSIGYPLLQRLPLSVKMTLLEVFNGIWRSGEFPASWRNAIVVP FT IPKPNCQDSGPAAFRPISLTSCMAKLFERIINRRLITELESSGRLDKRQHA FT FRTGRGTDTYFAELERSLPNIDEHCLIASLDLSKAYDTTWRHGIVRTLKSW FT RIRGRMINILKSFLSERTFQVSVGGHLSPQHQLENGVPQGSVLSVTLFLVA FT MQPIFRIIPAGVEVLLYADDILLVVRGAKTEGLHRKLQAAVKAVDKWAKSV FT GFGISATKSNCFYCSPNARREPAKEITIDRTAVPRTNRLRILGITLDRTLT FT FKPHCTTVKKACESRLRILQMIGAKLPRGNRSSLLQVGSALVTSKLIYGIG FT LVSRGGPASLQILAPAYNKMVRYASGAFVTSPINSVMAEVGTLPFDLLAAQ FT STARTAIRIAAKSSDNNNLPLIRRISDRLVELTGTALPNVSHLVRQSAREW FT HARKPTIVWDVKRSIRAGDPTEKVRPVVQHLLSTRFHQSTVVYTDGSKSQN FT TVGAAWHTTGLAGLYSLPEQCSVFSAEAYALKMAISIPNVPRELVILTDSA FT SCLQALEAGKSKHPWIQEIERTARTKSIRFCWIPGHAGISGNSEADRLANE FT ARRQPVIDTPLPGEDALRAVKQLIRCKWDDQWFNTRDSKLREVKQDTLRWT FT ELGSAADQRVLTRLRIGHTRLTHTFLLKREPPPTCECCGTTVDVRHLILQC FT RKYEREREQNNVSTTSLRDALANEEEATKNLLKFLHETGLYKKL" XX SQ Sequence 6593 BP; 1781 A; 1705 C; 1667 G; 1438 T; 2 other; caggttcgtg gacacaccgt gttgaccgca tctcttcgtg tgcgctctct agttaactcg 60 gtgaattttc caagttttcc gcggtttccg cgaacaaaac gtgtgtgaat cggttgttcc 120 ggtgataaag acaaattgtg cagtgaaggt ttggtggaaa tcggtggtga aacggccgaa 180 aacggcgaaa acaagtgcaa tccacgtgcc tcgaacggca aaacaaatcg ccgtcagccg 240 gctccccggc tgatttcatt caacgtctta gtagtgtggg acgtggaaca cggtgtgcca 300 gaaggcagtc gtttcgaaaa ataaaaaaaa gtgcggaaaa gttttggggg ttttttgaaa 360 ttattgaaag tggttaattg gtagcttgag cgaattacag tgcgtatcgt gtgagtgtgc 420 ttacagtgcg gcacataatt atctgaacgc ccgattattt tcccacgttg aaggagtaaa 480 gaaggtagga tctctcctct ttttatcttt tttgctcatt tggggtaggt aacgggtgac 540 cgttgccagt gggagtaagt tccgacgtga ggaaacatgg cggccgctag tagcggcgac 600 ccaggtgggt ccgttaaacg cagacttccg gaatatatgg acccaacgaa ccaatttggt 660 gagttgacgt tcctccagct gtccggaaag aacggaattc cacttccgat caacccgttc 720 attaccggaa aatcggtaga ggcatgcgca ggtggttcaa ttgagagcgc gaagtccgaa 780 gcgcagggtt cgaaatacac tctgcgagtt cgagacccag ctcaagtcgc caagctgctt 840 aagctaacaa agctaattga cggaaccgag gtggaagtta tccctcatcc gaatttaaat 900 gtcagtaggt gtgtcatctc ctgcatcgat ttgatccaaa tggaggaaaa ggacatccta 960 caggaaatga ttagcgagaa ggtcatccgt gtgcaacgga tcacccgtaa cgaaggaggc 1020 agaagggtga atactccggc gttgatcctt actttctgta agacaacgta tccggaatat 1080 ctgaaagtcg gcttactacg cgttcccacc cgtccctatt ttcctaaccc aatgttgtgc 1140 tacggctgct tcagctacgg tcacactcgc gttcgttgtc ctggcccaca acgctgtgcg 1200 aattgctcgc agaacttcca tggggaagaa tgcagtgaag ctccgtcgtg ccgaaactgc 1260 aaaggtgacc atcgttcaac aagtcgtcaa tgttcggtgt ataagaaaga agtggaagtg 1320 atcaaactaa aagtgagcga aaacttgagt ttcccggagg ccagaaaacg cgtggaacag 1380 caaggaggta gttacgccca ggtagctgca caacagaacg tctttgaaaa aaagctgaag 1440 gagttggaag tggctatgtt ggcgaaggat aaggagatag cgaagctgca cgaggaaggt 1500 aaacggaagg atgaaaggat cgagcagatg atggctttca tcaagcaagt caaacaacaa 1560 tcgagccaag agagaaccca ccacgtgagt gaaaccgtcg tcgttgagaa gccccgccac 1620 agccgagagc aacgagtggt ccagtcgacg gctggtccaa tgacacggtc aagaaacaac 1680 tccccggccg ttcaggagac aaagcgcgga agaccttcaa aattcgtcta ccccaaatca 1740 gccacctcgc cagacaccag cccgcctccg aagaagaccg cacccactac tcatgaccta 1800 actcagatgg agtattccgg cgaagagtct gaggtatcgg agacaccccc taaccagcgt 1860 cttcgataac cccactctcg aacaacgctc ggaatacaac gacgaccctc gcacaaaaac 1920 ggacaacgag acttttcggt tcggaagtag ggaaagtgta gtacctcttc cgacgcaagt 1980 acggcaggct gaaggagtca tccgggacgt cttaccccaa cccgttgatg atgagttcac 2040 ctccgtagcc gatggaatcc cacacgacac cacgatattg cagaaccacc gaggtaccat 2100 tcactctgta ccaacatcaa gcaacagcat cgaaaccaac aacgagaaca ccgacgatac 2160 tacacaaagt ctttccccag tgccggctat tgccggtgtt tctggaagca gtcgtgtagt 2220 agtttcggca atggctggca ctgtggggca agacgtccca caggtcagtc ttataaattc 2280 tcaaccacaa acaggtactg ccgcgcctgc aatcttttca gacacccacc aaaccacatc 2340 aaccgctgca aatcccaatc gataccaacc agcatcaacc taccacctcg ttttcaacaa 2400 ccacctctcc gacgatacta cacaaagtct ttccccagtg ccggctgttg ccggtgtttc 2460 tggaagcaat cgtgtagtag tttcggcaat ggctggcact gtgggacaag acgtcccaca 2520 ggtcagtcta tctaaattcc catcgcaaac ggttgctgct gcgcctgatt atctttcagg 2580 gtacaccgca tcaccaccat ctcccggaga aggcccttcg agtgttcatc gcagaaacat 2640 tacacaaagc gcttccccag tgccggctgt tgtcggtatt ggcagaacta gtgtaatggt 2700 ttcgacagca gctggtactg tgggacaaag cgtcccacag gcaagtgaca gtttggttct 2760 ttcgtcgtca gaggttaccg cactccctct ttctcccaca acagacccac cgatccgaag 2820 aacatcaacc tcgtcatcca ccacacgatc aaccacggat agccgttccg ggtactgctt 2880 cgccctccag tggaatatcc gtggtctacg sgccaacatc agcgagctaa agctkctcat 2940 ctccgacctc gaaccgtgtg ttgtagcttt gcaggagacc aaggtgaacc agactgttgt 3000 tccgccagac ttcgtcggta gaaactatac gttgctgcta cagtcgagga acgctaacta 3060 ctggcaacat ggtgtaggcc ttgctatccg ggaaggcata cctttcgagc gtttacaaat 3120 cgacaccacc ttgcacctcg tcgctgcacg catcttcgca cctatccaaa tgaccgtggt 3180 ctcggtgtac atcccaccga gtgcagtcca ttgtcagacc gcattgggag atctgttaga 3240 gtatctagag ggtccggttc ttcttctcgg ggatttcaac gcgcaccatc tcgcgtgggg 3300 ctctcaccag tcaaacgcgc ttggccgatt cgttgccgaa acaacgttgg acaaacaact 3360 ggtgatcctg aatgacggct cctccactcg tatcgacccg gcaacgggta aaacctcagc 3420 aattgacgtg tccatctgtt ctgagagtct ggcgcgaaag ttcacgtggc ggaccttgtc 3480 cgacacacac aacagcgacc actttccgat aacggtgtgc atccccggat ggtccaatcg 3540 tcgaacaaca cgacagaagt ggctgtacga ccaagctgat tggccgttat acgaacgcct 3600 catagcggaa accattcgcc cagatgtcga gtgggacgta gaaagcttca ctgagaagat 3660 aattgcagca gcgaaaaagt ctatccctcg ttccagtggc cgtgttggac cgaaagcggt 3720 accttggtgg tgccctgagg taaaagcggc aattcgccag cggagaaaat gtttgcgatc 3780 ccttcgacgc cttcaacagc gcgatccaaa tcaacccgaa gcattggcga gcttccagga 3840 agctaaagca gaggcgaaaa aagcggtcac ccaagcaaag cagcaatcgt gggaagattt 3900 cgtggcaaaa atatcaccca gcagcacgac gaccgaactg tggaatacag ttaatacatt 3960 gcgtggcaac cgacgacagc gaccggtggt acttaagaag tcgaacggat ttacagacaa 4020 tcctgaagaa gcagcggaag aactggcgaa gtattacagc gagagatcgg cgacttcaag 4080 ctatcctcca tcgttccaga tggcgaaatc ggcagctgaa cgggagtcca tggatttttc 4140 gcccaacacc ggcgatgtgt ataacatcga catcacccta gccgaacttc tgtgggctct 4200 cgacaaaggg cgaggctcct caacagggga tgattctata gggtaccctc ttcttcaacg 4260 tctccctctg tctgtgaaga tgacgcttct tgaggtcttc aatggaatct ggcgaagtgg 4320 cgagttccct gccagctggc ggaatgctat cgtcgttccc atcccgaaac ctaactgcca 4380 ggattccgga ccggctgcct tccggccaat atcgctcacc agctgcatgg cgaagctttt 4440 cgagcgtata atcaatcgtc gtttaattac agaattagag tcgagcggtc gacttgacaa 4500 gcgccagcac gcctttcgta caggacgtgg caccgacacc tactttgccg agctggagag 4560 atcgcttcca aatatcgacg aacactgcct catagcctcc ttagatctgt cgaaggcgta 4620 tgatacgacc tggcggcacg ggattgttcg cactttaaag tcatggcgaa tacgtggtcg 4680 aatgataaac atccttaaaa gttttctctc tgagcggacg tttcaggtgt ctgtaggagg 4740 acatttgtcc ccccaacacc agcttgagaa tggggtacca cagggttccg tgttatccgt 4800 aacactgttc ctcgtcgcga tgcaacccat cttccgaatc ataccggcgg gtgtagaagt 4860 tcttttgtat gctgacgaca ttctcctcgt tgtacgcgga gcgaaaaccg aaggattgca 4920 ccgaaaacta caggccgctg tgaaagcggt tgacaagtgg gcaaagagcg taggtttcgg 4980 tatatctgcg acaaagtcga attgctttta ttgcagtccg aatgcgcgtc gagaaccggc 5040 aaaagagata actatcgatc gaacagccgt tccgaggacc aatcgtctga gaatcttagg 5100 cattaccctg gatcggacac ttacatttaa gccccactgc acaacggtaa agaaagcgtg 5160 cgagtcgcga ttacgtatac tgcaaatgat cggtgccaaa ttgccccgtg gaaatcgttc 5220 ttccctgctg caggttggat cggcactggt cacctctaag ctgatctacg ggatcggact 5280 ggtaagtcga ggaggaccag catctctaca gatactcgcc ccggcgtaca acaaaatggt 5340 ccggtatgct tccggagcgt tcgtgaccag tccaatcaac tcagtcatgg ctgaagtggg 5400 tactttaccg ttcgaccttc tggcggcaca gtctacagcg cggacggcca ttcgaatcgc 5460 agcaaaaagt tctgataaca acaaccttcc tcttatccgc cgtatatcag accgtttggt 5520 tgaacttacg ggaacagcgc ttcctaatgt cagccacctt gtacgacaga gcgcccgcga 5580 gtggcatgcg cgaaaaccga cgatagtttg ggatgttaag aggagcatta gagccggtga 5640 cccaacggaa aaagtccgcc cggttgtgca acatctgctt tcgacgcgct tccatcaatc 5700 gaccgtggta tacaccgatg gttccaagtc ccagaacacg gtaggtgctg cttggcatac 5760 tacaggtctg gctggtttgt acagtttacc ggagcagtgt agtgtctttt ccgcagaagc 5820 ctacgcctta aaaatggcaa tctcaattcc gaacgtacct agagagttgg tgatccttac 5880 ggattcggca agttgcctcc aagcgctgga ggcgggtaaa tccaagcacc cctggattca 5940 ggaaattgag cgaacagcaa ggaccaagtc gatcagattc tgctggatcc ctggacatgc 6000 aggtatcagc ggcaacagcg aagccgaccg actagccaat gaggctagga ggcaaccggt 6060 catcgatacc cccctaccag gagaggacgc attgagagct gtcaagcagc taatacggtg 6120 caaatgggat gaccagtggt tcaacactcg tgattcgaag cttcgggaag tgaagcaaga 6180 tacgcttcgt tggacggaac tcggaagcgc agccgaccag cgtgtgctta cacggctaag 6240 gatcggtcat acgcggctta cccatacatt tttgcttaaa agagaacccc caccaacctg 6300 cgaatgttgt ggaacaacgg tggacgtgcg acatttaatt ctgcaatgca gaaaatatga 6360 aagggagagg gagcagaaca atgtgagcac gacaagcttg agagacgccc tagcgaatga 6420 agaagaagca acgaaaaatt tattaaagtt tctacatgaa acagggcttt acaaaaaatt 6480 gtaatgaaac aagaattgtt ttatgaattg taaatttaaa accacttcag ttcgacacga 6540 atgcaccctg gtgtaaagtg tcgttaatac acaaaaaaaa aaaaaaaaaa aaa 6593 // ID BEL-69_AA-I repbase; DNA; INV; 6198 BP. XX AC supercont1.21; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-69_AA_; KW BEL-69_AA-LTR; BEL-69_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6198 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.21; Positions 3460765 3466962. XX CC 'TAAAG' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 31..5178 FT /product="BEL-69_AA-I_1p" FT /translation="MDGHHSRSGYTCKACHRPDSSSAHMIICDQCRLWEHF FT SCTGESESVPSRPFICRQCRGENAAGSTISSRLRSQAKRAPSGNVSGGKPA FT SNIGSKISSVASSSRSSIVKARLELIEEEERMKQKELEEEEAFKKLEHEEA FT QRQLEEKKKRMEEQRKLAEEESLLRQSKLVAEKARLLKQQSIRRESLEKKN FT EVILQISERGSVVESATSSREKVASWLTANMPNGKSDRNTLGNHGNLQYDS FT VPAAPHSVASLIINVPSRVTEIPSELPPAPQVAETHNSSPRQRFPGPTGIP FT HVSGRIGYHVRSTTIREPFQQFVTSSRPLQTNPLPVHPYPPSIHSSRGHSD FT GGEHTPQIFRRQEPSWTTPGLCDAIHDPPLTRRSVEPNLFSHESRPEQFRH FT LGGPTVNTAMRANNHEPIGQAGVLSTQQIAARQVVGKDLPHFGGNPADWPM FT FISSFEQSTVACGYSNAENLARLQRCLTGHAREAVRSRLLLPANVPQVMNT FT LRTLYGRPELLIRSLHEKIRRTPGPRHDRPESILEFGLAVQNFVDHLQAAE FT QEEHLANPMLMQELVEKLPGPMRMDWATFKNLQPRATVITFGEFMGKLVSA FT ASEVSFELPGFQKATSSEKLQRPREKARIQAHSTADIPAPKPAAENTRKSP FT KSCRICDREGHRVAECYKFKQLNLEERLKEVQDKGLCRTCLNNHGKWPCKS FT WQGCGVSGCRLKHHTLLHSSSSSTPTTHSVNVSSRKFPMDDDQTLFRVLPV FT VLYANGKSVTVFAFIDEGSQITMLEEKVANELGLSGPRKPLTLQWTGNVKR FT NELRSEEVSLEISGKRNDKRYDLHQARTVSCLLLPAQRFNYREMTTRFPHL FT KGLPIEDYDLAQPKLLIGLDNLRLGVPLRLREGGPFDPIAAKCRLGWGVYG FT RTYAKPIPRAIVNFHTAAAASSDALLNEQLRDFFTLENNGIVGPNVMLESD FT EDKRARKLLEDTTRRTATGFETGLLWKFDDPEFPETYPMAIRRMKALEKKL FT GCNSTMMQRVGEQISDYERKGYIRKVSESEVTAIDPQKTWFLPLGVVINPK FT KPGKIRLIWDAAAKVAGVSFNSFLLKGPDLLTPLPRVLSGFRLFPVAVSGD FT IREMFLQIKLQASDRNSQMFVFRRTPEEPVQVYAIEVTMFGSTCSPSSAQF FT VKNANAEQYAQHYPRAAAAIKEHHYVDDYLDSFRTIDEAVRVVSDVKYVHS FT MGGFEIRNFLSNEDEVLRRTGEIEPDSTKDFALVRADAAESVLGMRWKPAD FT DVFTYTFSMREDLKLILDENHIPTKREVLKVIMSLFDPLGFVAFFLVHGKV FT LMQDVWASGIDWDQKINEQLFTRWRQWVAFFPQLDSLQIPRCYFNSPFPDN FT FDRLELHVFVDASDSAFACVAYYRIETENGIHVALVCAKTKVAPLKVLSIP FT RLELNAAVLGVRILEAVEGFHTYSISRRVLWSDSTTVLAWIRSEHRRYNKF FT VAVRIGEILTSTDVREWRWVPSRSNAADLATKWKNGPDLTSMNPWFAGPPF FT LHRPEEFWPQQTVVTTTCEELRPIHIHSTSPPLIDTTRFSNWTRLQRTMAF FT VIRFVDNLRRKRSGASMQQDVLVQDELRRAETALWKIAQAESFPEEIEILS FT ETEGSPEVRHRYAAKSSPIYKNCPFLDDHGVLRQRVNVAESMPQLLYQWNS FT STQQYFRGNIRSRSCLWIGIIGVSAMPITRPSPTKFVNASKFPSYVH" XX SQ Sequence 6198 BP; 1660 A; 1518 C; 1610 G; 1410 T; 0 other; ttctaaaaga atttttgtac gtgggtgaag atggacggac atcacagccg atcgggatat 60 acatgcaagg catgccaccg cccagactcg tcaagtgctc atatgatcat ctgcgatcaa 120 tgccgcctgt gggagcactt cagctgtacc ggcgagagcg agtccgtgcc gagtagaccg 180 tttatctgca ggcagtgccg gggagagaat gcagcaggat cgactatctc ttctcgactg 240 cgatcacaag cgaaacgagc gccgtcaggt aacgtgtcag ggggcaaacc tgcttcaaat 300 attggcagca agatttcgtc ggtagcatcg agtagccggt cgtcgatcgt gaaggcgcgg 360 ctggagctaa tcgaggaaga agagaggatg aaacagaaag agcttgaaga agaggaagcg 420 tttaagaaac tggaacatga agaggcccag agacaactgg aagaaaagaa gaagcgaatg 480 gaagagcaaa ggaagttggc tgaggaagaa tcgcttctaa ggcagagcaa acttgtggca 540 gagaaagcac gtctgctcaa acagcagtcg atacgccgtg aatcactgga gaagaaaaat 600 gaagtcattc tgcagatttc cgaacgtggc agtgtggtgg aatcggcaac aagttcccga 660 gaaaaggtcg cgagttggtt gacggctaat atgccgaacg ggaagtccga caggaataca 720 ttgggaaatc atggcaacct ccaatatgat tcggttcctg cggccccgca tagtgtagcg 780 tcgttgatca tcaatgtacc ttcaagagta acggagatac cgtcagaatt accacctgca 840 ccccaagtgg cagaaactca taattcttct ccacgtcagc gtttccctgg gcccactggc 900 attcctcatg tgagtggtag gatcgggtat cacgttcgat ctactacaat ccgtgaaccg 960 ttccagcagt tcgtaactag ttcaaggcct ttgcaaacaa atcctttgcc agttcaccca 1020 tatccaccgt cgattcattc atctcgtgga cacagtgatg ggggtgaaca cactccacag 1080 atatttcgcc gacaagagcc ttcttggacc acacctggtc tgtgcgacgc aattcacgat 1140 ccacctctaa cacgtcgatc cgtagagcca aatcttttct ctcacgaatc tcgtccggag 1200 caatttcgtc acttgggtgg accaacggtg aatacagcga tgagagcgaa caatcacgaa 1260 ccaatcgggc aagcaggggt tctcagcacg cagcaaatag ccgcgagaca agttgtcggc 1320 aaagatttac ctcacttcgg cggcaatcct gctgattggc cgatgttcat cagcagtttt 1380 gagcagtcaa ccgttgcgtg tgggtattca aatgccgaaa accttgcgcg actacaaaga 1440 tgccttaccg gacacgctcg agaagcggtg cgaagcagac tattgcttcc ggccaacgtt 1500 ccacaagtga tgaacacgct ccgcacgctt tacggtcgtc ccgaacttct tattcggtca 1560 ttacatgaaa agattcggcg gactccagga cctaggcatg accgcccgga atccattctg 1620 gaatttggat tggcggtgca aaatttcgtc gaccatcttc aggcggctga acaagaagaa 1680 catctggcca acccaatgct catgcaggag ctggtcgaga aactgccagg ccctatgagg 1740 atggactggg ccaccttcaa aaatcttcag cccagggcta ccgtcataac gttcggtgaa 1800 ttcatgggca agctggtcag cgcggccagt gaagtgagtt tcgaacttcc agggtttcag 1860 aaagcgacga gcagcgaaaa attacaacgt cctcgtgaga aagctagaat tcaagctcat 1920 tcaactgcag atattccggc tccaaaacca gcggcagaaa atactcgcaa atcaccaaaa 1980 tcgtgtcgaa tatgtgatcg tgaaggccac cgagttgctg aatgctacaa gttcaaacag 2040 ctaaacctag aggaacgatt gaaggaggta caggataaag gtctctgccg aacgtgcctg 2100 aacaaccacg gcaaatggcc gtgtaagtct tggcagggtt gcggagtctc gggatgccga 2160 ctcaaacatc acacgctttt gcactcgtcg tcttcctcta cgccaacaac ccactccgtg 2220 aatgtttcgt cgaggaaatt tccaatggac gacgatcaaa cgctgttcag agtactgcct 2280 gtagtattgt acgcaaacgg caagagtgta acggtattcg cttttatcga cgagggttcg 2340 cagattacaa tgctagagga gaaagtggca aacgaacttg gtctctccgg gcctaggaaa 2400 ccgttgactc tacaatggac tgggaacgtg aagcgcaatg aactgcgatc ggaagaggtg 2460 agcctggaaa tatcgggtaa acgaaacgac aaacgttatg atcttcacca agcacgcacc 2520 gtgagttgtt tgcttcttcc agcgcagcgt ttcaactata gagaaatgac aacacgtttc 2580 ccacatctca aggggcttcc gatcgaagac tatgatctag cacaacccaa attacttatc 2640 gggctagaca atcttcgcct cggcgttcct ttgaggctac gagagggagg gccatttgac 2700 cctatagccg ccaaatgtcg actaggttgg ggagtctatg gccgcacata tgcgaaaccc 2760 attcctaggg cgattgtgaa cttccacaca gcggccgcgg cgagttcaga tgctctgcta 2820 aatgaacagc tacgcgattt tttcacgttg gagaacaatg ggatcgtggg cccgaacgtg 2880 atgctcgaat cggacgagga caagagagcg agaaaattgt tggaagatac cacaaggcga 2940 acagcaaccg gattcgaaac cggactcctc tggaagttcg atgatccgga atttccagaa 3000 acctatccga tggccattcg acggatgaag gctctcgaaa agaagttggg ttgtaactca 3060 acaatgatgc agcgagttgg tgagcaaatc agcgactatg agcgaaaggg ttatattcgc 3120 aaagtcagcg aatcagaggt gacagctatc gatccacaga aaacttggtt tctgccactt 3180 ggagtggtca tcaaccccaa gaaaccgggt aaaattcggc tgatctggga tgcagcagca 3240 aaagtcgctg gggtatcgtt caattcgttc ctgttgaaag gtcccgacct cctcactccg 3300 ctcccgagag tacttagcgg attccgtcta tttccggtgg ccgtttctgg agacattagg 3360 gagatgtttc tccaaatcaa gctacaagct agcgatagga attcacagat gtttgtgttt 3420 cgtcgcactc ccgaagaacc tgtacaggtg tatgcgatcg aagtcacaat gttcggttcc 3480 acctgctctc cttcttccgc ccagttcgta aagaatgcca atgccgaaca gtacgcgcag 3540 cattatccac gagcagcagc cgcaataaag gagcaccatt atgtggatga ctacctggac 3600 agtttcagga cgattgacga agcagttcgg gtggtgagcg atgtcaagta cgtccactct 3660 atgggtggat tcgagattcg taacttttta tcgaacgaag acgaagttct gcggcgaact 3720 ggggagatcg aaccagattc taccaaagac ttcgctcttg tacgtgcaga tgcagccgaa 3780 tcggttctcg gaatgagatg gaaaccagcg gacgatgttt tcacgtacac cttttcgatg 3840 cgtgaggatt tgaagttaat tctcgatgag aatcacattc caacgaaacg cgaagtcttg 3900 aaggtgatta tgagtctctt tgatcctttg ggattcgtgg cgttcttctt agtccacggg 3960 aaagtactta tgcaggacgt ctgggcctcg gggattgatt gggatcaaaa aatcaacgag 4020 cagctcttca ctcgttggcg acaatgggta gccttcttcc cgcaactaga ctcattgcaa 4080 atcccccgct gctacttcaa ctccccgttt ccagataatt tcgatcggtt ggagttacac 4140 gttttcgtcg atgccagcga ttctgccttc gcctgcgtag cttattaccg aatcgaaact 4200 gaaaacggca ttcatgttgc gctggtttgc gcaaaaacta aggtggcacc tttaaaggtc 4260 ttatcaatcc ctcgccttga attgaatgcg gccgtactag gagtccgaat tttggaggct 4320 gtcgaaggat tccacactta ctctatcagc cgccgagttc tatggagcga ctctaccacc 4380 gttttagcct ggattcgatc agaacatcgt cggtataaca aatttgtggc agtccgtatc 4440 ggtgagatcc ttacatcaac tgacgtacgg gagtggagat gggttccatc tagatcgaat 4500 gcggcagacc tcgctacgaa atggaaaaat gggcctgatc tcacctctat gaatccatgg 4560 ttcgcgggtc caccattcct gcatcggccg gaagaattct ggcctcaaca aacggtagtc 4620 acaacaacgt gcgaagaact gcgaccaatc cacatccatt ctaccagccc accattaatc 4680 gacaccacac ggtttagcaa ctggactcgt ctacagagaa ccatggcgtt tgtgattcga 4740 ttcgtggata acctacgccg taaacgcagc ggtgcttcga tgcaacagga tgttctcgtc 4800 caagatgaat tgaggcgtgc tgagacagcc ctttggaaga ttgcacaggc tgaatcgttt 4860 ccagaggaaa tcgaaatctt gtcagaaacg gaaggttcac cagaggtccg tcatcgatat 4920 gctgctaaaa gcagtccaat ttacaaaaac tgtccatttc tcgatgatca cggtgtgctt 4980 cgtcaacgtg tcaacgtggc agaatcgatg cctcaacttt tgtaccaatg gaattcaagt 5040 acccagcaat acttccgagg caacatcaga tcacgttcct gcttgtggat tggtatcatc 5100 ggcgtttccg ccatgccaat cacgagaccg tcaccaacga aattcgtcaa cgcttcgaaa 5160 tttccaagct acgtgcacta gtcgagaagg tcacaaaggg ctgcgtggtg tgccggatta 5220 agaaagccct accgagtcct ccagcgatcc gtcgacaacc gttcatccga ccatttacct 5280 acgtaggagt agattatttc ggtccaatac tggtcaaagt tggaagaagt caagtgaaaa 5340 ggtgggtggc tttattcacc tgccttacag ttcgcgcaat acacttggaa gtagtgtaca 5400 gcctctccac tgagagttgc tggatctcca gccgaattcc attcgggata atggaacctg 5460 ctttcaagga gcgagcaacg aactgcaacg agagacggaa aagcggaatg ccgctttagc 5520 taccacgttt acaacaacca agacccggtg gtgtttcata ccaccagcgg ctccccatat 5580 gggaggcgca tgggaacgtc tagttcgttc cgtcaaggta gcaataggat ctatagcaga 5640 cagcccacgt agaccagatg acgaagttct ggagaccatc tgctcgaagc cgaggctctg 5700 ataaacgcgc gtccgctcac ctatattcca ctcgagtcgg cggatgagga agctcttaca 5760 cccaaccact ttttgctggg ttcctcaaat ggtgacaagt tccttccaaa tgaacccgtt 5820 gagtcaccgg ccgtgctacg gagcagttgg agacttgcgc aggccataac gcaacgcttt 5880 tggacaaggt ggctgaagga atatcttcca gtaatcacgc ggagatcgaa gtggtttgga 5940 gaggtccgag atatagctgt tggagatttg gtattggtgg tgaatgggac agccaaggat 6000 caatggacga gaggacgtgt tgaagctgtg atgcctggac gtgatggacg tgtccgccaa 6060 gcccttgtcc ggacggcttc cggagttcat cgacgaccag cagtaaagtt ggcggttctt 6120 gacgtggttg aacgaagtga acccagattg gagactcccg aagagtcggg tcgtaactag 6180 tgttcacggg agggggga 6198 // ID Gypsy-5_OD-LTR repbase; DNA; INV; 212 BP. XX AC CABV01000629; XX DT 24-FEB-2011 (Rel. 16.02, Created) DT 24-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Oikopleura dioica genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-5_OD_; KW Gypsy-5_OD-I; Gypsy-5_OD-LTR. XX OS Oikopleura dioica OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; OC Oikopleuridae; Oikopleura. XX RN [1] RP 1-212 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Oikopleura dioica genome."; RL Direct Submission to RU (24-FEB-2011). XX DR Genome; CABV01000629; Positions 20780 20569. XX SQ Sequence 212 BP; 54 A; 52 C; 40 G; 66 T; 0 other; tgcaaagact cgcccacgta agatttatct gatcgggcgc aaccccggtt gggctcgagc 60 aggcatcttc attcgctttt tatccttcct ccgagattca ttatttatta ttttattctc 120 tcagaataca ttcttcaagt caactacgaa tccaatagag tatttgttag agagcaccgc 180 aggcattctt ctgagtggaa gcaaaccttg ca 212 // ID Transib5_AA repbase; DNA; INV; 1033 BP. XX AC . XX DT 13-JUN-2005 (Rel. 10.05, Created) DT 29-JUL-2005 (Rel. 10.05, Last updated, Version 2) XX DE Transib5_AA is a DNA transposon, a partial consensus sequence. XX KW Transib; DNA transposon; Transposable Element; KW Interspersed repeat; DDE-class; TRANSIB superfamily; KW Transib5_AAp transposase; TRANSIB4; Transib5_AA. XX NM Transib5_AA. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-1033 RA Kapitonov V.V. and Jurka J.; RT "RAG1 core and V(D)J recombination signal sequences were derived RT from Transib transposons."; RL PLoS Biol 3(6), (2005). XX DR [1] (Consensus) XX CC Transib5_AA belongs to the Transib superfamily of DNA CC transposons. CC The consensus sequence is not complete; termini are not known. CC Transib5_AA encodes remnants of the Transib5_AAp transposase. CC The transposase is not perfectly recovered due to available CC sequence data. XX FH Key Location/Qualifiers FT CDS 2..1030 FT /product="Transib5_AAp" FT /note="transposase (conceptual translation)" FT /translation="KQFSSKKPFVLPNSIELHLPSTSGGSKGRKKIPFQDS FT SDCVKRRKTAMLRATHNADELSYAATMKLREEGKRAQASVLQMATQSSPSH FT VAHLLNTSKISAKFDIVPYTPDEALALICNAKLTLSQYKELRINAKDRNAD FT LYPSYKKVLEAKKRCYPEDAAITINDTEMDIKLQALLDHTAHRIVLANSDN FT LINFSDEELKYITLITKYGFDGNSGVSQFKQTFTGDDGTKSDGSIFITSVV FT PIQATGHSKTLFKNPRPSSTRYCRPLRIQYTKENTELILAEKAYLDDQISA FT LKPSTVEHMGRTIIINHKLLITMVDGKVCNALTSNKSSQRCLYMWCNAFAD FT " XX SQ Sequence 1033 BP; 346 A; 215 C; 192 G; 280 T; 0 other; caagcagttt tcttcaaaaa aaccatttgt cttgccaaat tccattgaat tgcacctgcc 60 ctccacttca ggaggctcta aggggaggaa aaagatacca tttcaagata gctctgattg 120 tgttaaaagg cgaaaaactg caatgttaag agctacgcat aacgcagatg agctttctta 180 tgcagcaact atgaaacttc gcgaagaagg aaaacgtgca caggcttcag tattgcagat 240 ggctacccaa tcaagcccaa gccatgttgc acatttactt aataccagta aaatttccgc 300 caaatttgat atcgtaccat atacacctga tgaagcatta gcacttattt gtaatgctaa 360 gcttacacta tcgcaatata aagagttgag aataaatgct aaagatcgta atgccgattt 420 gtatccttca tataagaaag ttcttgaggc gaaaaaacga tgctaccccg aagatgcagc 480 tatcaccatc aatgatactg aaatggatat caaattacag gccttgcttg atcacacggc 540 acaccgcatt gttctggcca attcagacaa tttgattaac ttttcggatg aagaattaaa 600 atacattacc ttgataacca agtatggttt cgacggaaat agtggtgtat ctcagtttaa 660 gcaaacattc accggtgatg atggaacgaa gtctgatgga agtattttca ttacttccgt 720 agtaccaatc caagcaacag gacattctaa aactttattt aagaacccac ggccgtcgtc 780 tactcgctat tgcagacctc tacgtattca atacacgaaa gagaacacgg aactgatttt 840 agctgagaaa gcatatctcg atgaccaaat atctgcttta aaaccatcaa cagttgaaca 900 catgggacgc acaatcatca tcaatcataa gcttttgata actatggttg atggaaaagt 960 atgtaacgct ttgacatcga ataagtcttc gcaacgctgt ttatatatgt ggtgcaacgc 1020 cttcgcagat gaa 1033 // ID Gypsy13-LTR_Dya repbase; DNA; INV; 2454 BP. XX AC chr2h; XX DT 18-MAY-2009 (Rel. 14.05, Created) DT 18-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy13_Dya; KW Gypsy13-I_Dya; Gypsy13-LTR_Dya. XX OS Drosophila yakuba OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; melanogaster subgroup. XX RN [1] RP 1-2454 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1091-1091 (2009). XX DR Genome; chr2h; Positions 787618 790071. XX SQ Sequence 2454 BP; 965 A; 419 C; 399 G; 671 T; 0 other; tgtaatgtcc ccatatatca agattttttt ttaaatcccc aaaatttgcg ccttagttag 60 tagcgataat aattatcgga accttatctg gcaacgtcat ctaaatggca attgaccagc 120 tggtgaatat tgattcggag aaaagttctt cggatcccca gcatctagct tcgatctttt 180 cttacgacga tcgtcaagct tataagttca gtcccagcag agataagacc tagcaaaaag 240 gcaaatcaag aaaaaagaac tggagagtac aatcacatgt gctcccaact ttaatccagc 300 gtttatttcg gcatcaagat taaaaggcca caaagaaaag gcaattaata aatgcccgtg 360 gattgccgtg tcggcaaggt aaatatttca atgaaaattt caaaaaaatt accaaattat 420 aataacagtt gtgcacgttt aggtcacgcg tgttattttt tatcgtcgac tacacagcga 480 gaaatcgcca ggtcgatcga ttccactaaa gaaaaaaatc actgggtaaa tatccaagta 540 attctgggga aaaattgccc gaaatgaatg agttatattc ccgcagttac ccgaggaaac 600 gaagctgaat tttaaactct tccacgccga cccaaaagtt cggctggagt cagctacctc 660 tgctgggaga caaattcttc ggaggtcgca ggaaaaaaat ctacagaaaa acctacatag 720 aaataaaaac ccttacaaaa ctagtgtaaa acaaaaacag attaaaatac catcagtgcc 780 taaaaaccca aaaattaatt gaagttaaac atattacatt gtacatatta cataaaacaa 840 aagcatttga tagtgatgtg agtgtgtgca ggtgataagt aaaataagag aaaaaaagaa 900 atacaagctg gccaaaccaa tgtaaaatcg aacactgaaa gaaaaacaaa gtttcgttgc 960 caatgccaaa aataaaatac ttcaaatcaa aaataaaaat aattcatcat gacaatataa 1020 acaaaaaatg aaatgaaaaa aattcataaa aaaaaaaatt gaaataaata atcctatatt 1080 ttaattatca cttatgaaaa aaactaaata gtgcatttat agtttgaagt tttcttacta 1140 ataaccccca acaccgcatc gacaagaagg ccttcttcga gaaccaacag tgggtgagta 1200 aaattaaaaa aataaaatat tatatatata accaaaacaa aactaaaact tcaagttact 1260 aaattgcaat tatgtagttc agtgtagagc ctggctacct gagttaaggc ttatcaaccg 1320 ctcgcgccga ttaatttgtc tattttgtag attactcttt tgattcttat ccaaattcta 1380 acatggtaat ttaaagcact ataatgtaac gtaaacattg acgactacac ggaaagaaat 1440 taatattgac atgattggca cctacatatt ttctcccttt tagcaataga aagctttttc 1500 tttcaatgat gcgattaaat aaataaactc catagaaaaa aaaaaacact taatagttgt 1560 tcaggattaa taaatactta acaattacaa cataaaaaaa aaaaaaaaaa acccacacca 1620 cttcaatgtt cttttgttgg taacaaacca gaaaaagaaa agcttggcac taaaaaaaat 1680 aaaacaaaaa ataaatcaat aaaaaaaaat aaaaatgaat agagattttt gcgaaaaata 1740 aataaagata aataatgtaa aataaggagc aagatacgga atcagactta gggaattaag 1800 agtatggaat tataagatta aaactagtaa ggctaattag ttagaaatat gtacactgca 1860 agaaatgcac tcaacaagaa agaagaaatt gtttattgaa aattaaaatt gttgtaaaga 1920 tgaaactcta agttcttatt catggcaggg gagcataatt agatgatgca tatttaacta 1980 catttggggt cttctaaaaa aaggttacac tgaagactat ggttgggagg gtaatgatag 2040 tgggagtgga aattaacatc taaaaccacc ccgccaggct tacatccaaa ttcgtgcttc 2100 cttctctcac attgtagttt aattaagttt cgcttgctgt tcattttcta aaagatgaaa 2160 ctgagacagt ggagttgttt gttttttttt ttttccgctg ttcgtttata tgaggattgt 2220 ctcattgttt aaattcagga tggactctat aataaaacct tttggtgtag caatcattat 2280 ccacagcaaa aaaaaccccc cccacatctc ccgctagagc taggaatcaa tgggtcccgc 2340 tcagctgatg gttaatggtc gacaagcctg aaaagggtga acattttgga atcgaatcgc 2400 caacaaggat acccagtaaa tgtttctctt cttcaacggc gctcttacat caca 2454 // ID Gypsy-65_CQ-LTR repbase; DNA; INV; 1516 BP. XX AC AAWU01038188; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-65_CQ_; KW Gypsy-65_CQ-I; Gypsy-65_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-1516 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 510-510 (2011). XX DR Genome; AAWU01038188; Positions 2084 569. XX SQ Sequence 1516 BP; 394 A; 393 C; 427 G; 302 T; 0 other; tgtggcaatt atgtagatta gtcccaaatg tgtagtaatt agtaagttat tgtaatcaaa 60 acaagcaaac gccgcgcagc gtgcatgtgc aaaaacagaa aaaacccctc gctcgaagcc 120 aaattcacca cacgccaatc gggcaacact acacgcacac aagcgtccgc aaccatttcc 180 gccgccagcg gcaaagtaat gctgttcgcg tctcgaaagg agaaggtcgt ggaaaaagac 240 ctccgcgtgt gtcgcacgat tttggaccgc acgtgcggat tcctcgcggg agattgttgg 300 gccttcgttc ggcggctgaa accgcgcaat cgtcgtgagt tgcgcatcgg tgggagctcg 360 gtggccactc gaactggacg gatttgtccc tgaagacgcc gcaaagtaag ccacgttttg 420 acgatcagac tcgcccggag aataagtcat cggtcaaagg taattcgcaa aatctgccta 480 aagtccaccg actcacaagg attaaccctc ttcgcagcgc gccaactccg aaccaatctc 540 tgcaatccac ccgaagggag gtcccgccgg aagaggaatc aagcaggtaa aagaatgaga 600 gcgttctgct cagaatagtc gcgctaaccg tagaaaatca tccacagtga tcgctgtgcc 660 aatctcgctg tgggaatctg tgtcgcgcaa gacccatctc cgaagaacgc cgccaaaagc 720 aaggtaaaca gttgcactaa ggtcccgcag cacagttctg acactttgcc ttcctcagac 780 ctgaaaacat ccacacggtg gcagccagtc cgtgtgcaac aaaaggagga agcgcaacac 840 gaacacgcac acaccatcaa ccagaacgca aaaggccccg gaaaagggtg agaggaaaaa 900 agggaccgtt gtagcaaccg gccactttcg ttgtttcatt tagggccttt tcggtgggcg 960 ctcgacttcg gctgctggag gctggtgcac gattgctggc gcgcgacgga acatcccgtc 1020 ggtaggcgga cggtgcggta cgagtcgccg ttccaacacg gcgtacgtgc gtcattgagg 1080 gcgctgggcg agacaacgct aggcgttgtt cgcgagggcg cagatgagga gcagcctgcc 1140 aagctgtggt cgcacgtggg catgctgtaa cgcgtggcgg tcaagggtga actcacgagc 1200 cggcgggcac gccaacggcg gaatgcgggt tgcggcaaga gtgagtgcta gaaattaagc 1260 gggtgcgcac ccagaagaat aagagcccaa agggcgagaa gaagaagcac acgcgcgcga 1320 agcgccatct ttgcgcatgt ttcgctaggc ttaggcacac acacgcacgc acatttgtac 1380 cgtttttttg aggaaataac atgttttgag cacatttttg ttgggttttg ttgattagtt 1440 atggtgcagc ctgaatgtcg cttctttacc tgccaatttt gagatttacc gctttagttg 1500 tgaaaatagt ggaaca 1516 // ID REP2-2_TT repbase; DNA; INV; 6282 BP. XX AC AY371729; XX DT 17-NOV-2004 (Rel. 9.1, Created) DT 03-JUN-2011 (Rel. 16.05, Last updated, Version 2) XX DE Tetrahymena thermophila micronuclear non-LTR retrotransposon DE REP2-2, complete sequence. XX KW Non-LTR Retrotransposon; Transposable Element; KW reverse transcriptase; endonuclease domain; REP2-2_TT. XX NM REP2-2_TT. XX OS Tetrahymena thermophila OC Eukaryota; Alveolata; Ciliophora; Intramacronucleata; OC Oligohymenophorea; Hymenostomatida; Tetrahymenina; Tetrahymenidae; OC Tetrahymena. XX RN [1] RA Fillingham S.J., Thing A.T., Vythilingum N., Keuroghlian A., RA Bruno D., Golding B.G. and Pearlman E.R.; RT "A non-long terminal repeat retrotransposon family is restricted RT to the germ line micronucleus of the ciliated protozoan RT Tetrahymena thermophila."; RL Eukaryotic Cell 3(1), 157-169 (2004). XX DR Genbank; AY371729; Positions 980 7261. XX CC This sequence encodes two proteins with alternative nuclear CC genetic codes (UAA and UAG encode glutamines). CC ORF1(245-1480): gi|34761709|gb|AAQ82025.1| CC ORF2(1854-5294): gi|34761710|gb|AAQ82026.1|, apurinic-like CC endonuclease and reverse transcriptase. XX SQ Sequence 6282 BP; 2467 A; 1296 C; 1004 G; 1515 T; 0 other; aaaaaacttt ccaaaaactt taattaagat tcgaaaaaaa cgaagcttaa aagtaaaaca 60 gagaagtaca aattaattaa aaaatcagag gaaaaacaag aaaaagtcaa acaaactatt 120 aaaagatatc taaataacga gcaaaaccta ctgaaaatgt ctcaaagaag ataaagcaat 180 aagaagcaag agagcttagc aaacaagcaa aaacaggacc aaacaatcac catatctcct 240 aggaatgcat cagcggacac ttaaaacata gaactcaacc gaaacatctt agaaaaccac 300 acaaccaagc accaatcaaa aacctaatcc agaaatttct gaaaagaagg tagatggcag 360 gaagagaggt ttcagatcgg cctacaagaa taaggtagaa aaacaatatg aagaacagca 420 accagtgtca actccactcc catctgaaga gaattgttta aatctgctgt aggagcccca 480 aatggtcata aataagtagc caagacccga agccaaggag gttccaatca gcagcaatac 540 atcaatctcg taaagatcaa atagcaatga tagtagcagt ctaaaacaga accacaacca 600 ctagcatcga cactcaagca tctctagaca agaaagtaaa ccagccttgc aagaaccgaa 660 accggaactt ccaacctcca tcgatatcgg caataaaatc tggaaattgt gtagatttga 720 cgcaggcatg aatggatgca gaaacagttc ctaggattgc aagtttatcc atttggactt 780 gatagaacag gtagctctaa gcgacaaaaa gaagctagcc aatgtagcaa tctggaactg 840 ggactaagct aagctagaca gcaagtatca accttctgac aaggaccttc tcaactcctg 900 ggtaaactca aattcaaacg agggaaatcc cctcttcatc aaatacaaaa caatgattga 960 ggagaaggat cctgaattcg tctaactcac taaagatctc acagctgagt aaatggaact 1020 tgaagaactc taaaacgcat tccaagtcaa gaagcagcgc tacatgtcgg agctttcttc 1080 aagactaaga cagaaaattc tattcaaaag caggcaaaag gcctttgata aattgctttc 1140 agaatcagcc cagaagagaa cagcagcatc tgaagcagca ttatctcaag aaagtagcag 1200 catagaatat ctcccaccat cccagtagaa gtactcaaat cgatatcaga tgataccaca 1260 tcacaaggaa caatatctac cctttgaaaa ttcacagtat tcctcactcc agtagtagcc 1320 aaaatctcca atacgaactt ctcagcagca acattattac tagcaaaaca atactcaaag 1380 ctaccagcca cagcactaag cccataactg gaatccaact tattagcact agcaatctaa 1440 caattagaga tttgaaccca gctacaagcc gtcctactac tgaagaaata catatgaaaa 1500 ctccttgaaa tgaaatatta acatctcatg catcaataag gaagtgtttt acctctacta 1560 gagatgagtt ttgagtaaac tcttatttaa agaatttgga aagttttaaa aagaacgtat 1620 caaacaaaaa ccctgtgtct tcttcatatt tacatattgt agtccctcaa cacgcttaaa 1680 tatagctcat tctctatccc caataattag caccagcaaa ttacaatagg ctttgctctt 1740 attataataa tatatcacag atataaaatt tattattact attttttagg tatattctaa 1800 aaggaattaa gaaggccata aaataaagca aacaactaaa gaacatctat ataatgatta 1860 gaccaaatcc tatttataga tcaagcaaga agtcacaaac acaaacctca cagacaataa 1920 agctagattt aagactcatc tagtaaatta actcacacct caactaacat gattctaacc 1980 taaaaagtag ctcctgctca aatcaatcaa tgtcctccaa acgacgtatt aacaaagact 2040 cgatgaacaa agtaaatcaa taccttggct ctaacaaaac ttctccccct caatattcac 2100 cacaacaaca tttaaaaagc aaaattattg gctcttagct cacaattaca ggcttcaata 2160 ctcaaggagt agcaaaagct aacaaaagaa ataagcaata ctctacgctt tacctccaaa 2220 agcttttcgc tgaaagcgat gctaccatac tcttggagac aaattgcaaa gaaaatatga 2280 gagtggatat ccataatccg gactatattc aaatgaacaa tcactgcaaa gtacgctaaa 2340 tgggtgcagg aacagcctct atccaccaca aggaaatcaa acttgcacca tacgtcgcaa 2400 gcctcaataa tgaaaaagtc cttgcccaag tattaattct tgagaatcaa tagagagttc 2460 taatcatggg actacatctc caactctagg ccaaccctat tgaagaataa gctctactcg 2520 aagaaatcat ctcgtaagtg ctgcaagaca acaggctcaa ccatatactt atctacgggg 2580 acttcaacct cgatatctca acctccaact cctcacacgc agatgtaagg aaagccggac 2640 aaatcagaag gctcaaggac ttcttgaaat caaagaactt atacatacac agcacaaata 2700 ggcacactag agaagcattc aagcagaaca aaatcataaa aactcaagtg gacttcttca 2760 taagctcgtt tgctcctaat tgcattctca agattgaaac aattacacag gacgaaagca 2820 acagaggcag tgatcattat cccatccgaa ttcaaattga cttgcaagtt gcaaaggcta 2880 gagaaatcaa aaaagtcatc aacttaaagt agctagatct cctatcctag aagcttcaaa 2940 aattcattcg cagcagtcca tagaatgaca cagaaatcag agataaaatt aacagcatta 3000 aaggcaaact tgatctcaaa taacaaaact cttaacagct aaggcgaaac ataaaccaaa 3060 gaataaatgt agttatcagg gcaatccaca aacatgaaaa ggtgctaatg attgattaaa 3120 actcagcaaa aactagagca gaacatatga aaacgcgtag aaaaatcata cagagcaaag 3180 acaccataat ccagcataag gtcagtgaaa tactcaaaga atactatgcc gaatcaacga 3240 aaaaagtcca atcactcttt aaaacaaaca ttaagcgata ttattagaac atcaagagag 3300 tatgcagctt aggcaacaag tcaaatgttc taaacacctc aagcggacca gttgaaaact 3360 cgaacaaaga actcttattt gatagtaaag atatctaaaa cgaaataagc aaatacttca 3420 aagaacacta taaatgtgaa acacacgtca attactatca caccattggg caacttagca 3480 gcagcgaaat agacactctc attcagtctt cagaaactct ctacagcagc aggaacaagg 3540 ccttctccaa cgattatatc aaagataagt gctttttccc accaaattac agcgaaatct 3600 ctaacgccat gcaaagcatt ccttgctcca aaactaccaa gaagcaaatg atttctgaga 3660 aaatgctctc tcaatagtag catagagaca caaacctcag aacactcctc aaagacatcc 3720 tcactaaccc agactccttc agacagacct tcaatgcgag gtaaatctac ctcctaaaga 3780 gcgaaagcag ataaattatc aactgtagac ccatcacgat tcagtcctca gctattaaaa 3840 tcctggaaaa tgcaatgctc cagcatgtcc aaaagctcaa agatgaaggg aaggttaaag 3900 attttcacat cagccaatgt ggcttccaaa agaatcgaag cacaatcatc aacatttgca 3960 gaactattgc tctcatccaa agtacagtag ccaagaaaga aaacagagtt gccatcttcg 4020 tagatgtcaa gagtgctttc gatagcgtca accacgaaca gctgttctaa gcactgcgaa 4080 atcaaggctt tgacgatatt ttcatcaaat ctgtcgcctt tctctactag cactgtcgca 4140 tcaatgggta ccaaatcggg agaggagtca tccagggagg aaaattatcg cctatcctgt 4200 tcaattacct ctatgaagaa gtcagagtga aaatccttga aatctggaag agtaagaagt 4260 tggattgtca agacctccac tttgagctct tcgccgatga catgctaatc attctcaaaa 4320 agtacaagct cacagcaacc ctccttgaag tcctcaagca agcgtataga gacataaatc 4380 tccaaatcaa tgaatccaag acgaaaatca tgctaatcgg caagcaggaa acctacataa 4440 aggactcact caaactaagg caagcgaagg gcatagatct tgtcatggag ttcaagtacc 4500 ttggcttagt catcaacaat gtgggcaaaa tccacaaaga tgtagctaag aaagtagaaa 4560 aggcaaagaa tctcacacaa atgctcaaga ggtggcgcta caacaagcta ggtgttatcc 4620 cttgcctcct tctctggtag ctctttataa aatcgcagct ataatacgct tcagcaatca 4680 tgatcacatc agcctaaagc gaaaaatgca tgagaagtct tcgaaacatg tacaactctt 4740 cccttaggga catacttgga ctcaacagga acacttcaat ctcaactatc tgtgcagttc 4800 ttggcatcga aaacattgag tctatgattc tcaactgcta caacaaccta atgagccaaa 4860 ttaacgctga taaacgaatc taaatgtagt tcaaagtctt cactaaaaac attaaattag 4920 acatggctag caggcatcaa cagctgctac aagcagctac tcataaggac agcagagtca 4980 aacaaagatt ttggtggtag ttcaacaaca agtacgatga tctcattaaa ctcaacatat 5040 atctctagaa ccctgctgga caattctgct ggaaatgtta aaggttctac aagggcggca 5100 gttggctcta ctgtcagcac aaaaaagaag cttggatgga actccaaaag atattaaata 5160 ctacctccaa ggcagaaata gtcagtgtgc tcaatcttag aaacacggat tggcaaaagt 5220 tttctataga cataaaaaaa agtattcaag agtgggctag actcctggtc tattaagaca 5280 tttctaaggt cagatgaaag aaaaaatcag ctcccaaaat ttaaaattct cactcaaaaa 5340 aaaaaaaaac ttaaagaatc attcaagaat ttgcaggccc ttttcaatgt gagccaagta 5400 ttgaacaaac gcccttcatg tttgttcaaa tggaagaaat agtgaggagt ttctttaata 5460 accatgaagt tactgtctta tacaatgtga gtcatgtata gtatatgtac atccataact 5520 attaaaaaaa aactgaaaga atcacccatg aagtttaagc cctatacaat gtgagtcatg 5580 tatagaacaa gcccttcatg tttgttcaaa gtggaggcaa tagtgaggag tttctttaat 5640 aaccatgaag ttactgtcct atacaatgtg agtcatgtat agtatatgta catccatact 5700 attaaataaa tttaaagaat cacccatgaa gtttaagccc tatacaatgt gagtcatgta 5760 tagaacaagc ccttcatgtt tgttcaaagt ggaggcaata gtgaggagtt tctttaataa 5820 ccatgaagtt actgtcctat acaatgtgag tcatgtatag tatatgtaca tccataacta 5880 ttaaaaaaaa ctgaaagaat cacccatgaa gtttaagccc tatacatgta tagaacaagc 5940 ccttcatgtt tgttcaaaag taagaataaa ttatttttca tgttaagcgc aaataaataa 6000 ttaaaaataa taaacaataa attaaataaa ttaattgact cggaacaata aatagaaaaa 6060 ggttattaaa ttaccaaatt gttctggaat taatgagaaa aagaggaaac tgggtgtata 6120 ttaaatattc aaaaatcaga tattgattga attattaaaa attaataatt tggaaaatac 6180 ctaaaaataa ataaaaatta tctatataaa tataataaaa ataaatttta agatgtattc 6240 tgagtgcatt tatatgtgcg agtttttcta ttctattcta tc 6282 // ID Zator-1_Ngru repbase; DNA; INV; 1408 BP. XX AC . XX DT 17-MAY-2011 (Rel. 16.05, Created) DT 17-MAY-2011 (Rel. 16.05, Last updated, Version 1) XX DE DNA transposon: consensus. XX KW Zator; DNA transposon; Transposable Element; Zator-1_Ngru. XX OS Naegleria gruberi OC Eukaryota; Heterolobosea; Schizopyrenida; Vahlkampfiidae; OC Naegleria. XX RN [1] RP 1-1408 RA Yuan Y.W. and Wessler S.R.; RT "The catalytic domain of all eukaryotic cut-and-paste transposase RT superfamilies."; RL Proc Natl Acad Sci U S A 108(19), 7884-7889 (2011). XX DR [1] (Consensus) XX CC incomplete. XX SQ Sequence 1408 BP; 501 A; 263 C; 254 G; 390 T; 0 other; tggctatcaa gcacaaaaca ataggcaggc ctagtaaggt tggcaatatc gcagaattgt 60 taaaaactac cctgccagca gattacagcg aagcacatcc tcgccgaaga gatggccata 120 tctacacctt ggccaatggt gttgaggtta tcaaagaagg aattgctctt attaagaaca 180 aaactgggat aacgtactca ccttctactt taaaacgttt ttttgtgggg aaaagaaaaa 240 caaaagcatt caagttgaag tctaaggatg cttttattcg cgtaatgaaa atcagaagca 300 acagaatgat tgaacacaag gacaaacata ttgcgtgtgc gttgatgaaa tatattcgtg 360 aacttgcttt caaattagat gatgatgtta cagtggtatc cgttgaccaa aaggcaaaga 420 tttatactaa gggtcctgct gtgagtaaat ctactgctgt tgctgttgac tcatcattaa 480 caagatcaaa actaccagcc cttccagacc acaacttcgg caacactaag gaaaatagtg 540 ttacaccgaa tgtaatgtca gtattaagag tttctaagga agggaataac attacttgga 600 aaagacaatg cgtgcttgta aggttgcgca gtagttgtaa tgatgataga ttttcgcatc 660 tagatgacct tgaatggttt tttgaagaac taaaactccg taatttatta cacggaaacg 720 ttattatgat cacagataat gggcctaaag tgcaaccaag aaacaggaaa gttcaatctg 780 aagcatatta cattttcaaa aaatacaatc ttgattcatt aattcagtgc tcctatagtc 840 cacatgcatc tgctttgaat ccagttgaga aagaccattt cccaatctca aagaagcttg 900 ctggtttatt cgatatagat aaggagatat atggatcgtc caaagagccc caaaaaagac 960 aaaagaatca caatgttgtg ttgaagcaat tgtgtttact tattaatgaa ataccatcat 1020 accactgcac atacgtcccc accaaccaaa attcaattcc aaacactctg ccttcttggc 1080 caacttggga aaatattttt tcctttttgg agggaaagga aacagaagag gattgcttga 1140 tcaagaaatt aattaaggat attcattacc ttagtagaga tggtaatgga ttgtacactt 1200 ttgtaattag aaaggaagat tcatcaacaa cattttacca acagctagca caatacaatg 1260 gcttcatacc tcaccccatc cctgacaaaa ataaccaaag aaaaaacaat gaagactatc 1320 ccaacgacga taatcttcat tattgtttat tcgaagatag attgaaagat aattccatag 1380 ccttaaattt acatttccct tcatacca 1408 // ID Copia-13_CQ-I repbase; DNA; INV; 4159 BP. XX AC AAWU01014418; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-13_CQ_; KW Copia-13_CQ-LTR; Copia-13_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4159 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 341-341 (2011). XX DR GenBank; AAWU01014418; Positions 39320 43478. XX CC Positions [1470-1997] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 120..2981 FT /product="Copia-13_CQ-I_2p" FT /translation="MADAKVQVEKLNDENYVMWKFKMELLLFKEGLTSTVT FT QNKPANPDAAWLKKDGDAKVAIGLAVENDQLVHICRKTTSKEMWDALQEIH FT GESNYSNTLHVLRKLCSLRLREGGNLPEHLKEMTAMANHLEMNGEGLKETT FT FVGLIISSLPPSYGSLINVIENLPKAEIKLDNIKGKLRDEWRRRRDCSEDE FT QLHEKALKSSVGGSSKATDKKKTKKAGACHNCGAVGHWKRDCPKLSKATSG FT SPKANLAVERPAGMEMCLAVSGECSPNRWYLDSGATSHMTSNTSLLNNVDA FT SKQPDICLADGTRIKSSGAGTGKLISVTGSGIRMSVTLADVYHVPSLAGNL FT LSVSRMCDLGYAVCFDKVGCKVMRGEEIVLVGERSGGLYRLKEFPERALVS FT KTEHPALCEHMWHRRLGHRDPDAVSKIVREDLGFGLKMEKCAVKCVCEVCL FT KGKMSRDSFPKQSQSKSSAVGDLIHTDVGGPMEEATPSGNRFYVIFVDDYS FT CYTVLCLLKKKSEVEAKIREYCSLMKNQFGYYPKVIRSDGGGEYSSGTLKK FT YLAENGVVLQQTAPYSPQQNGKAERKNRYVVEMARCLLVESNLGKKYWGEA FT ISTANYLQNRLPTSTVERTPYELWHGKKPSYTHLRVFGSEAYVHVPKEKRR FT KLDAKAVKMTFVGYAEGRKAYRFLHPDADWIVISRDAKFLENTGEEAVQDS FT GDLFADDAEPDAAKEPETFDVPISGGEQRDREPAVAGPSAETTEDDVEQLE FT PDVEQIELDEDTTDETGSSELETSVYENASEGDLSFHGFPLDEIARRSLRS FT TKGVPPVRFDEAFVVAAAPTNDEAEPTSVREALRCDQSVEWKSAMEEELQS FT HAENGTWELVPLPPGRQPVGCRWVYKVKRNAAGEAIKYKARLVAQGYCQKF FT GQDYDEVFAPVIKQTTLRMLLAVASKHNLQLRHFDVKTAYLNGVLQRNST" FT CDS 2981..4141 FT /product="Copia-13_CQ-I_1p" FT /translation="MRQPAGFEEQGKEDLVCRLRRSIYGLKQSARCWNQRL FT HTVLEQLEFEQSTADPCLYVKVVEGKRIYLLVYVDDILVGCKSGGEIEKIY FT SLLKKEFKMTDLGPANFFLGLEVKCNDGKYGISLEGYIDRIAERFELSDAK FT GAKSPMDEGFTKTVEESRPLENSTEYRSLVGALLYVAVCARPDIAVSASIL FT GRSVTAPTLADMAAAKRVVRYLKATKSWQLRYDDPDGELVGYSDADWAGDL FT KTRKSTTGSVFLYSGGAVSWASRLQQCVTLSSMESEFVALCDTSQEAVWLL FT TLIEDFGEPEQKPLTIKEDNQSCIKFVAAERTTRRSKHVDTKHCYVKELCE FT RKVLQLEYCPTEDMIADVLTKPVGAVKHRKLSSLLGLAAPGSGR" XX SQ Sequence 4159 BP; 1019 A; 1036 C; 1327 G; 777 T; 0 other; acccgaatag gttgtgggcc cacagtggtg aagatttgag tttttccgcg agtgaaagtc 60 gggaaaagta gttttttctg atcggacttt gtgcgtcgcg tgcatcagtg aaaaacaaga 120 tggcggacgc aaaggtgcag gtggaaaagc tgaacgacga gaactacgtg atgtggaagt 180 tcaagatgga actgctgctc ttcaaggaag gcctgacctc aacggtgacg cagaacaaac 240 cggccaatcc ggatgcagcc tggctcaaaa aggacggaga tgccaaggtg gcgatcgggc 300 tggccgtgga gaacgatcaa ctcgtccaca tttgtcggaa gacgacgtcg aaggagatgt 360 gggacgcgtt gcaggaaatc cacggggagt cgaactacag caacacactg cacgttttgc 420 ggaagttgtg ttcgttgcgg ctcagagaag gcggcaacct cccggagcac ctcaaggaga 480 tgacggcgat ggcaaaccat ctggagatga acggtgaagg cttgaaggag acaacgttcg 540 tggggctcat catctcgagc ctgccgccgt cgtacgggag cttgatcaac gtgatcgaaa 600 atctcccgaa ggcggagatc aagttggaca acatcaaggg caagctccgg gatgaatggc 660 gacgccgtcg ggattgctcg gaagacgagc agctgcacga gaaggcgctc aagagctcag 720 ttggtggcag ttccaaggcg accgacaaga agaaaacgaa gaaggccggt gcgtgccaca 780 actgtggagc tgtcggtcac tggaaacgcg attgcccgaa gctgtccaag gcaacgtcgg 840 gttcaccgaa ggcaaatctg gcagtggagc gaccagctgg gatggagatg tgtctggccg 900 taagtggaga gtgtagccca aaccggtggt acttggactc cggtgctact tcccacatga 960 cgtccaacac atcgctcttg aacaacgtgg acgcctcgaa gcagccggac atttgtctgg 1020 cggatggaac caggatcaaa tccagcggcg cgggcactgg gaagctgatc tccgtgacgg 1080 gcagtggaat ccggatgtcg gtcaccctgg ccgatgtgta ccacgtgccg tctctcgccg 1140 ggaatctcct ctcggtcagc cggatgtgtg atttgggcta cgccgtgtgt ttcgacaaag 1200 ttggctgtaa agtgatgcgg ggcgaggaaa ttgtgctagt gggggagaga agcggcggct 1260 tgtaccggct gaaggaattc cccgagcgcg ccctggtctc gaagacggag catccagcgt 1320 tgtgtgagca catgtggcat cggcgcctgg gtcaccgaga tcccgatgcg gtgtcgaaga 1380 ttgtgcgaga agatctgggc ttcggcctga agatggaaaa gtgcgccgta aagtgtgttt 1440 gtgaagtgtg cctgaagggg aaaatgagcc gagacagttt tccgaagcag tcgcagagca 1500 aatccagtgc cgtgggagat ctgattcaca ccgatgtggg tggccccatg gaggaggcca 1560 cgccaagtgg caacaggttc tacgtgattt ttgtggacga ctacagctgc tacaccgtgt 1620 tgtgcctgtt gaagaagaag tcggaagtgg aagcgaagat ccgggagtac tgcagcctga 1680 tgaaaaatca gttcgggtac tacccaaaag tgatccgttc ggacggcgga ggggaatatt 1740 caagtggaac gctgaagaag tacctggccg agaacggtgt tgtgctgcaa cagacggcgc 1800 cgtattcgcc gcaacagaac ggcaaggccg agcgaaagaa ccgctacgtg gtggagatgg 1860 cgcgctgtct gctggtggag tcgaacttgg ggaagaagta ctggggcgag gcgatcagca 1920 ctgccaacta cctgcagaat cgcttgccta cttcgaccgt ggaaagaaca ccttacgaac 1980 tgtggcacgg aaagaagcca tcctacaccc atctgcgtgt gttcggttcg gaagcctacg 2040 tccacgtacc caaggagaaa cgtcgtaaac tggacgccaa ggccgtgaag atgactttcg 2100 tcggctacgc ggaggggcgc aaagcctacc gtttcctgca tccggatgcc gactggatcg 2160 tcattagccg cgatgccaag tttctggaga acaccggtga ggaggcagta caagacagcg 2220 gcgacctgtt cgcggacgat gcagagcctg atgcggccaa ggaaccggaa acgtttgacg 2280 tgccgatcag cggtggagag cagcgagatc gcgagcccgc ggttgctggc ccgagtgcag 2340 aaacgaccga agacgacgtg gagcaactag agccagacgt ggagcagata gaactggacg 2400 aggacacgac cgacgaaaca ggaagttcgg aactggagac gtccgtgtac gaaaacgcgt 2460 cggagggaga tttgtctttc cacgggtttc ctctcgacga gattgcgcgg cgctcgctgc 2520 gatcaaccaa gggtgtgcca ccagtccgtt ttgacgaagc tttcgtggtt gctgcggctc 2580 ccacgaacga cgaagcggaa ccaacaagtg tgcgggaggc gctgcggtgt gaccaaagtg 2640 ttgagtggaa aagtgccatg gaagaagagc tgcagtcaca cgccgaaaac ggaacctggg 2700 agctggtgcc actgccacct ggccgccaac ccgtcggctg tcgctgggtg tacaaggtca 2760 agaggaacgc cgcgggcgag gccatcaagt acaaagctcg tctggtagcc cagggctact 2820 gccaaaagtt cggacaggac tacgatgaag tgtttgcgcc agttatcaag caaacaacgc 2880 tgcggatgct actggccgtc gcgagcaaac acaacttgca gctgcgacac ttcgacgtca 2940 agacggcgta cttgaacgga gtgctgcaga ggaactctac atgaggcagc ccgccggatt 3000 cgaagagcaa ggcaaggagg acctggtgtg ccgtttgagg aggagcattt acgggcttaa 3060 acagtcggcc cgttgctgga accaacgtct gcacacggtc ctcgagcagc tggagttcga 3120 gcagagcacc gcagatccct gtctctacgt gaaggtcgtc gaaggtaagc gcatctacct 3180 gttagtctac gttgacgaca tcttggtcgg ctgcaagtct gggggcgaaa tcgagaagat 3240 ctattcgttg ctcaagaagg agttcaagat gactgacctt ggcccagcta actttttcct 3300 ggggctggag gtcaagtgca acgacgggaa gtacggcatt tccctcgaag gctatatcga 3360 ccgtatagct gaacgctttg aactctcgga tgccaaaggt gccaaatctc caatggacga 3420 aggattcacc aagaccgtag aagaaagtcg tccacttgag aacagcacgg agtatcggag 3480 tctcgtcggt gcgctcctct atgtggcggt ttgtgccaga ccggacatag cagtcagtgc 3540 ctcgatactc ggacggagtg tcacggcgcc cacgctggcg gacatggcgg cggcgaagcg 3600 agtggtccgc tacctcaaag ccacgaaatc ctggcagctc cgctacgacg atccagacgg 3660 agaattggtc ggctactccg acgcggattg ggccggcgat ttgaaaacca gaaaatctac 3720 gaccggatcc gtcttcctct attccggagg agctgtttcc tgggcgagcc gtctccagca 3780 gtgcgtgact ctgtcgtcca tggaatcgga gttcgtcgct ctttgcgaca catcccaaga 3840 agcagtctgg ctgctgaccc tgatagaaga cttcggcgaa ccggagcaga aaccgttgac 3900 catcaaggag gacaaccaaa gttgcatcaa atttgttgca gcagaaagaa caacgcggcg 3960 ttccaagcac gtagatacca agcactgcta cgtgaaggaa ctctgtgaac ggaaggtgct 4020 acagctggag tactgtccga ccgaggacat gattgcggac gtgctcacca aacccgtagg 4080 agccgtgaag caccggaagc tgtcttcgct gcttggactt gcagcgcccg gcagtggtcg 4140 ttgaggagga gtattagag 4159 // ID Gypsy-211_AA-I repbase; DNA; INV; 4023 BP. XX AC AAGE02027032; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-211_AA_; KW Gypsy-211_AA-LTR; Gypsy-211_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4023 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02027032; Positions 1104 5126. XX CC Positions [1518-1967] - Reverse transcriptase CC Positions [3051-3530] - Integrase core CC 'GACAT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 90..4001 FT /product="Gypsy-211_AA-I_1p" FT /translation="MMMDQKAMLVQGASIKPPKPLVIEENMAAKWKLWWRQ FT FHWYSVATELSLKPTQTQAATFLSCIGEDCVRVLDTFGLSDAQESDIDVLK FT AKFDAYFVPKSCLTYERYVYGKIVQHEGEQFDAFLTRVRKQAKKCSFSVLH FT DSLVKDRLISGMVHAKLVPQLLDDTFDLQKTIDVIRNYELSLKQSQEMRKP FT SVVEVDSVFKQSGSKANRIMKSEMIQCNRCGREHKRGACPAYGKKCFKCNQ FT MGHFAERCFGAAASGGTKNKAIKSVSCEEESELSVEELFIGSVCDDDDSVE FT EAWYEEVKIKNKKLLLKLDSGAACNVIPLKIFRGLNEELLPSKTKRLVSYS FT NHKVSVAGEAKLPVVVRGKTFEAVFKVVDGEVAPILGRKTSVRLNLIARVD FT QLSMDQSLFHGLGCVKGFVYDIDLVENPTFATYPPRRIPHSLRDAVKEELD FT SMEKMGVIQPISEPTPVTNALVIVRQKGKLRICIDPSQVNRNLLRRTHPLS FT TIEEISARICKSKFFTILDMKKGFWQIPVSERSMKYLAFSTPWGRYTCKRL FT PFGLASAPEVFQKLMNTLLAGLEGVESSMDDILIHAETEEKLRELTNIVLK FT RIESAGLKLNKEKCLLNQTSVKFLGHIVTQQGLQADPEKLKAIQQLKRPTN FT RLELQRVLGTVNYLGKFIEHLSALTEPLRKLLVKDVEWFWDREQEDAFSKI FT KCLMTSPPVLSFYDVNEAVTLSVDASSKAFGAVLMQKGKPVAYASKSLTPA FT QENYPQIEKKAAAIRFACNKFHEYVYGKDLTIETDHKPLESIYKKSLDRAP FT PRLKRILLDVVQYGPKVQNKKGKDIPLADILSRDVANNDDPEKADELEVHI FT VLQMSKPAKLELENECARDPEIQLLMGTVMSGWPDDRRKLPLELRPYWNYR FT DELSCYEGLVFKSHQVVVPKTLRQKMLSIIHSGHTGIQGCINRAKQQLFWI FT GMATEIKIMVESCAICQKHQKSNQKHTIINNEVPTLPFEIVGSDLFHFHGQ FT DYILIADSYSGFFDFEPLKDTSSRSVVAILKRWFACHGIPRILYTDNGPQY FT ASREFANFSKQWSFDHVTSSPHFPRSNGLSERFVQTAKTILKKCSEDDSDI FT QLALLLSRNTPRDNELASSSHRLMGRRLRTPLPITQKSLKPELVGNTTDAL FT AAKRIKQKEYADKGGREATEFTEMQQVMVQNPKSKTWEEGEINKKLEPPRS FT YLVKLADGQIVRRNARDLKNNRWSSCSNQASNDDTIIYMDTVPVQRSSEDV FT RDIPTSIDQHERFNGSHVGGDFETCTRSGRVIRTRRDDDFEYY" XX SQ Sequence 4023 BP; 1255 A; 789 C; 975 G; 1004 T; 0 other; tggtgtcaga ataagtggaa acaaacgcgg aaatcgcaaa cggtgtcgtg atagtttacg 60 ggaatatttg agaaagtgat tcgatcgtta tgatgatgga tcaaaaagct atgttggtgc 120 aaggagcatc aattaagccc ccaaagccgc tcgtgattga agaaaatatg gcggcaaagt 180 ggaagctttg gtggcgtcag ttccactggt attcggtggc aacggagctt tcgctgaaac 240 caacacaaac acaagcagct acctttctaa gctgcatcgg tgaagactgt gtgcgagtgt 300 tggatacatt cgggctgtcc gatgcccagg agagtgatat agatgtgtta aaagccaaat 360 ttgatgccta cttcgttccc aagtcgtgcc ttacgtatga acgatacgtg tacggaaaaa 420 ttgtgcaaca cgaaggagaa caattcgatg cctttttgac gcgagtgcga aagcaagcaa 480 aaaaatgctc gttcagtgtt cttcatgatt cgttggttaa agaccgcttg atttctggaa 540 tggtacacgc gaaattagtt ccgcagttgc tcgacgacac attcgatttg caaaaaacca 600 ttgatgtgat acggaattat gaactctcgc tgaagcagtc tcaagaaatg agaaaaccat 660 ccgtcgtaga agtggattca gtgtttaagc agagtggttc aaaagctaat cgcattatga 720 aaagtgagat gattcagtgc aaccggtgtg gcagagagca caaaagaggt gcctgcccag 780 catacggcaa aaagtgcttc aagtgcaatc aaatggggca tttcgccgaa cgctgtttcg 840 gagcagccgc cagtggtggc acgaagaata aggcgatcaa atcagtgagc tgtgaggaag 900 agagtgaact atcggtcgag gagctgttca ttggaagtgt ttgtgatgac gatgacagtg 960 tcgaagaagc atggtacgaa gaagtgaaaa taaaaaacaa aaagcttttg ttgaaacttg 1020 atagtggggc ggcttgcaac gtgataccac tcaagatttt ccgtggactg aacgaagagt 1080 tactgccatc gaaaaccaag cggctggtct cctacagtaa ccacaaagtg agcgtggcgg 1140 gcgaagccaa attaccagtt gtcgtgagag ggaaaacatt cgaagcagtt ttcaaagtag 1200 tggatgggga agtcgcacct attctcggtc gaaagaccag tgttcgtctg aatctcattg 1260 ctagagtcga tcagttaagt atggatcagt cattattcca tggtcttggc tgtgtgaaag 1320 ggttcgtgta tgatattgat ctggttgaaa atccaacgtt tgctacatat ccaccacgac 1380 gaatccctca ttcactacga gatgctgtta aggaagagct cgattcgatg gaaaaaatgg 1440 gtgtcattca accgattagc gagccaacgc cggtcacaaa tgcattggtc atcgtaagac 1500 aaaaaggtaa gctgcgtata tgtatcgatc cgtctcaagt gaaccgaaac ctcctccgtc 1560 gtactcatcc cctgtcaaca atagaagaaa tatctgcacg gatttgtaaa tcaaaatttt 1620 ttacaatact tgacatgaaa aaaggtttct ggcaaattcc ggtatcggag cgatcaatga 1680 aatatcttgc cttttccact ccttggggcc gatatacttg caaaagactt ccttttggac 1740 ttgcctcggc tccggaagtt ttccaaaagt taatgaacac gttattggcg ggactagaag 1800 gtgtggaaag ctcgatggac gatatactaa tacatgctga aacggaggaa aaattaaggg 1860 aattgacaaa catcgttttg aaaagaatcg aatccgctgg tctcaagctt aataaggaga 1920 aatgtctctt gaatcaaact tcggtcaaat ttttggggca catagtgaca cagcagggct 1980 tacaggcaga cccagaaaaa ctgaaggcaa ttcagcaatt aaagcggccg accaatagac 2040 tagaattgca gcgtgtactt ggaacagtaa actacttggg gaaattcatt gaacatttgt 2100 cagctttaac agaaccgttg agaaaactgt tagtcaagga tgtcgaatgg ttttgggacc 2160 gggaacagga ggatgctttt tcgaaaataa agtgcttgat gacttcgccc cctgttttga 2220 gcttctacga tgtcaacgaa gcggtaactc tctcagtgga tgctagttcg aaagcttttg 2280 gtgcagttct gatgcaaaag ggaaaaccag tagcgtatgc atctaaatct ttaacgccgg 2340 ctcaagagaa ttatccccaa atcgagaaga aagccgctgc aatccgcttc gcgtgtaata 2400 agttccacga gtatgtctat ggcaaggacc tcacaataga aacggatcat aaaccgcttg 2460 agtcgattta caaaaagtcg ttagataggg ctcctcctcg tcttaaacga attttgctag 2520 atgttgttca atatggcccg aaagtccaga acaagaaagg gaaggatatt ccgttggccg 2580 atattctcag tcgggatgtt gccaacaatg atgatccaga gaaagcagac gaattagagg 2640 ttcatatcgt gctccaaatg tcgaaacctg ctaaactgga attggagaac gaatgtgcta 2700 gggatccaga aattcaattg ctaatgggaa cagttatgtc tggttggcca gatgatagac 2760 gtaagctacc acttgagtta cgtccttatt ggaactaccg agatgaacta tcatgttatg 2820 aaggtttggt tttcaaatct catcaggttg ttgtcccgaa gacgctgaga caaaagatgc 2880 tgagtatcat tcattcggga catactggaa ttcaaggctg tatcaataga gcaaagcaac 2940 aattattttg gatagggatg gcgacagaaa taaagataat ggtggaatca tgcgctattt 3000 gccaaaagca tcagaaatcg aaccaaaagc atacaatcat caataatgaa gtacccacct 3060 tgcccttcga gatagttgga tctgatttgt tccatttcca tggacaggac tacatattga 3120 ttgcagacag ctattcagga ttcttcgatt tcgaacccct taaggacaca tctagcaggt 3180 cggtagtggc gattctaaaa cgatggtttg cttgtcatgg aattccccga atactgtata 3240 cagataacgg tccccaatat gcatcaagag agtttgctaa ttttagtaaa cagtggtcgt 3300 ttgatcacgt aacttctagt ccccactttc caagaagtaa tggactttcg gaacgattcg 3360 tccaaactgc gaaaacgata ttgaaaaagt gttcagaaga tgactccgat atacaattgg 3420 cgctgttgtt atctagaaac acacctagag acaatgagct agcatcatca agtcatcgtc 3480 taatgggtcg ccgattgcga acacctttgc cgattactca gaaatctttg aagccagaac 3540 ttgtcggtaa tactacagat gcattagcag ccaagcgaat aaaacagaaa gagtacgctg 3600 ataaaggagg tagagaggct acggagttca ctgagatgca gcaagttatg gtgcagaatc 3660 cgaagtcgaa gacttgggaa gaaggcgaaa tcaacaagaa gctggaacca ccaagatctt 3720 atctcgtaaa acttgcagat ggtcagattg tacgcaggaa tgcccgtgat ctgaagaaca 3780 atcgctggtc atcatgttcc aaccaagcat cgaacgacga tacaattatc tacatggata 3840 ctgttcctgt acaacgttca tcagaagacg ttcgagatat cccaacatca attgatcaac 3900 acgaaaggtt caacgggagc cacgtcggag gtgacttcga aacatgcaca cgaagtggtc 3960 gtgtgattcg cacaagaaga gatgatgact tcgaatacta ttgatgttct tcgaaagggg 4020 aga 4023 // ID BEL-628_AA-I repbase; DNA; INV; 5926 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-628_AA_; KW BEL-628_AA-LTR; Pao_Bel_Ele79; BEL-628_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-5926 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [4976-5536] - Integrase core CC LTRs are 95% similar to each other. XX FH Key Location/Qualifiers FT CDS 36..4394 FT /product="BEL-628_AA-I_2p" FT /translation="MSKKGDGQSKRGTSPSNKDPLPVVSCDICRKPDDSRM FT VACDSCGQWYHFMCVSVDSSVQDEDWSCKKCSEAGKQLLANTSTPNNAGTI FT PKSGTSKPADLNVEEYIQKQLMAMQKKFEKMMKEKDDQRLKELRDQRKQYE FT QLLKATEQRVQQLEQQHRGITSSGSGTAGSLSDKQPKQTLPRGGSTTNSND FT LRMMLGGGLPESFGRYQHPDVASGVNNPLQWTLGTGPTASSTSNPLPQIHV FT QDVSQGDDALAQELLLLEEKQALERKHLEERSRLLQQRYAIGGSATTGLNP FT QATVFQPNVSSAFGVPLLSQSQVSARQAVNKELPPFSGDPEEWPLFIASYE FT HSTRICGYSEEENMLRLQRSLRGKALEAVRCRLLLPANLPGVLATLKTLFG FT RPEIIVHSLVNKIREMPPPKAEKLQSLIDFGVAVQNVCATIAASGLDEYMC FT NVALLQELTERLPPAIRLNWAYHRQGLNRVTLSEFGDWLGKLVEAASIVTI FT PSISAPKTERRGQKGENYINVHSESSSIVDAAAKERPSTPKGCLVCQNECS FT GLEKCPRFLDMDVGTRWTIVKEQKLCRKCLRKHFGSCHIKASCGRHGCTFV FT HNSLLHDDKRYSKPATTEVAKTPAENSSESCNTHSKTARKILFRYVPVTIY FT GKGKQVTTYAFLDDGSSATLMEHSLLKELGLKGRSHPLCLDWTGGHQREEN FT ESVMLALKISGANDASEVFELPEVHTVRDLSLPKQSVLIPQLETKYCYIEG FT LPLESYDNVSPRILIGMDNCRLGHALRSIEGGENEPVVSKTRLGWMIYGPC FT AIESGTINSNHSSHHSFHICPCVKEDEKDLNTAVKEYFSIESLGISGSPTS FT LPSKDEERALKILSTETRLVGNRYETGLLWRFDQVQLPDSKGMALKRLGCL FT QKRLRREPELAVAMHVKMAEYEEKGYIRRLSTREKAEKHSNDWYLPIFPVT FT NPNKPGKLRIVFDAAAKVNGVSLNSFLLTGPDQLVSLLAVLYKFREFRVAV FT VGDIREMFFQVRMKKQDQRSQMILWNNGKTEDEPDAYVVAVMTFGAACSPS FT SAHYVKNRNADRFEKEYPRAVECIKYEHYVDDMLASVETEEEAVKLATEVR FT TIHSEGGFEIRNWLSNSSDVVTSLHENSTTEKNMSFSTEMSTEKVLGMWWD FT TTTDTFTFRLSPKHDQELLSGTRMPTKREVLRTLMAIYDPLGLIANFLIYL FT KILLQEIWRSGCGWDDEISGKIADRWSTWIEALPNVRQVRIPRCYRYETSA FT EPTNNVELHIFCDASENGIAAVAYLRFEEKQIVECALVGSKTRVAPLKFLS FT IPRLELQAAVIGARLADCIVKSHRLKITQRVFWTDSRDVVCWLRSDHRRYS FT QFVAFRVSELLDTTQVSEWRWLPTKQNVADEGTKWQRPPDFQPSSRWFRGP FT DFLWDPKKEWPGDSGDQGATIEEIRPSVLHHTSVLRQQW" FT CDS 4439..5926 FT /product="BEL-628_AA-I_1p" FT /translation="MGYVHRFIATLRKGTKKLVGPLTQDELKLAENTIYRL FT VQQLAYPDEIQLIRDKDPGAYPWERVLPKTSPLHKLSPFIDEHGVLRMRGR FT IDACEWADKAAKNPILLPRNNHVTNLVMADYHAACRHQNHQTAINQIRLKY FT NVPRLRSEFGRVRKSCQRCKIRQARPQPPEMGNLPPARLAAFQRPFSYTGI FT DYFGPMTVLVGRRAEKRWGVLLTCMTTRGVHIEVAHSLTTDSCILALRNFI FT ARRGAPLEIISDRGTNFIGASRELCEALKHVNEEELMVEFVSPDTKWTFNP FT PAAPHFGGCWERLIQSVKKTLHDFEPPRLPSDEILRTMLLEIEMILNSRPL FT TDIPLDNDTEPPLTPNHFLLGSANGSKPPIAFDDKPSVLKRSWTMAQLYAD FT HFWKKWVAEYLPTLTRRTKWFQPAKPIQEGDLVIVVDNSLPRNCWPRGRVV FT KAVHAKDGQVRRVTVQTASGLLERPATKVAVLDVGAKGGKPLGQQWRTEGE FT " XX SQ Sequence 5926 BP; 1614 A; 1419 C; 1631 G; 1262 T; 0 other; tatttataaa atccgtttac gatcgggggc tcggtatgtc aaagaaaggt gacggtcaaa 60 gcaagcgggg aacttctcca tcgaacaaag atcctttgcc ggtagtgagc tgcgatatat 120 gtcggaagcc ggatgatagc cgtatggtag cgtgtgattc gtgtgggcag tggtatcatt 180 tcatgtgcgt tagtgtggac tcgagtgtac aggacgaaga ttggagctgc aagaaatgtt 240 ctgaagcagg aaaacaactt ctagccaaca catctacacc gaacaacgcc ggaacaatac 300 cgaagtctgg aacaagcaaa ccagcggatc tgaacgtgga ggaatatatt cagaagcagc 360 taatggcgat gcagaagaag ttcgagaaga tgatgaagga gaaggatgac cagcggctga 420 aagagctcag agatcaacgg aagcagtacg aacagctact gaaagccacg gagcaacgtg 480 tgcagcaact ggaacagcaa catcggggaa tcacatcgag tggaagcggg acagcaggga 540 gtttgtctga taagcagccc aagcagactt tgcctagagg cggatcgacg acaaattcga 600 atgatctgcg aatgatgctc ggtggtggat tacctgagag tttcggtcgt taccagcatc 660 cggatgtggc tagtggtgta aataatccgc tgcaatggac tcttggaaca gggccgacag 720 ctagttcgac cagcaatcca ttaccgcaaa tacatgttca ggacgttagc caaggcgacg 780 acgctctagc gcaggagctg ctgttactgg aggagaagca agcgttagag aggaagcatt 840 tagaggaaag aagtcgactt ctacagcaac gatacgcgat cggcggaagt gctactaccg 900 ggctgaaccc tcaagcgaca gtattccagc cgaatgtaag cagtgctttc ggtgttcccc 960 tgcttagcca aagtcaagtt tctgcgcggc aagcggtcaa caaagaactc ccgccattct 1020 ccggagaccc cgaagaatgg ccgctcttca tagcaagtta tgagcactct actagaatct 1080 gcggatacag cgaagaagaa aacatgctgc gtcttcaacg aagcctgaga ggaaaggctc 1140 tggaagcggt acgttgccga ctattgctgc ctgcgaatct tcccggagtg ctagcgacgc 1200 tgaagaccct ttttggaaga ccggagataa tcgttcattc gctggtgaac aaaattcgtg 1260 aaatgccacc accgaaggcg gagaaactgc aatcgctgat tgacttcgga gtagcggttc 1320 aaaacgtttg tgccactatt gctgcttcag gcctggacga gtacatgtgt aatgtggctc 1380 tccttcaaga gttgacggaa agattgccgc cggcgatcag gttgaactgg gcgtatcatc 1440 gacagggtct gaatagggtg acactttcgg aatttggaga ttggttgggt aagctagtcg 1500 aagcggccag cattgttacg ataccctcca tcagtgctcc gaaaacagag cgacgcggac 1560 agaaagggga gaattacatc aacgttcatt cggaaagcag ttcaattgtc gatgctgctg 1620 cgaaggagcg tccttctacg ccgaaaggtt gtctcgtatg tcagaacgaa tgcagtggtt 1680 tagaaaagtg cccgagattc ctcgacatgg acgtcgggac ccgctggacg attgtgaagg 1740 agcaaaaact gtgcagaaag tgtctgcgga agcacttcgg atcctgtcac atcaaagcgt 1800 catgcggaag gcatggatgt acgttcgtgc ataatagtct gctgcacgat gacaagcgat 1860 acagcaaacc tgctaccacg gaagtagcga aaacgcctgc agaaaactca tcggaaagct 1920 gcaacacgca ttcgaaaacg gcgaggaaaa tactctttcg gtacgtgcca gttacgatct 1980 acggtaaggg gaagcaagta actacctatg cctttctcga tgatggatcg tcagcaactc 2040 tgatggaaca cagtctgttg aaagagctgg ggcttaaagg aaggtcccat ccattgtgcc 2100 tggactggac cggtggacac cagcgagagg aaaacgagtc ggtcatgctg gccctgaaga 2160 tttccggtgc taatgatgcc agcgaagtat tcgagcttcc agaggtccac acagtccgag 2220 acctttctct gccgaagcaa tcggtgttaa taccgcagtt agaaacaaaa tattgttaca 2280 tcgaaggatt gccattagaa tcgtacgaca atgtctctcc gcgaatcctt attggaatgg 2340 acaactgtcg gctgggccat gctcttagaa gcatagaggg tggtgagaat gaaccagtgg 2400 tgtccaaaac acgtcttgga tggatgattt acggcccctg tgcgatcgaa tccggtacga 2460 tcaattctaa tcatagcagc catcatagtt tccatatctg cccgtgcgtg aaggaagacg 2520 aaaaggatct gaatacggct gtgaaggagt acttttccat cgagtctctg gggatctccg 2580 ggtcgcccac atcacttcca tcgaaagatg aagaacgagc gttgaagatc ctgtctaccg 2640 aaacgcgatt ggttggaaac cgttatgaga ccggcctact gtggcgcttt gatcaagtcc 2700 agttgccaga cagcaaggga atggctttga aacgtctggg ctgtttgcaa aagcgactga 2760 gacgggaacc tgaattagcc gttgcgatgc atgtgaaaat ggccgaatac gaggagaagg 2820 gttacatccg gcggctttcg acaagggaga aggcagagaa gcactccaat gactggtatc 2880 tgccgatttt ccctgttacg aacccgaaca aacctgggaa gctacgaata gtcttcgatg 2940 cggcggcgaa agttaacggc gtttcactaa attcgttcct tctgacgggg cctgatcaac 3000 tagtatcgct gctcgccgtg ctctacaagt tccgcgagtt tcgagttgcc gtcgtaggag 3060 acatcagaga aatgttcttt caagttcgga tgaagaaaca agatcaacga agccagatga 3120 ttctgtggaa caatggaaag accgaagatg agccggatgc gtacgtggtg gcagtcatga 3180 cctttggtgc ggcgtgttct ccgagcagtg cacactacgt gaaaaaccgg aatgctgaca 3240 ggtttgagaa agagtatccg agagcggtcg agtgtatcaa gtacgagcac tacgttgacg 3300 acatgttggc gagtgtcgaa accgaggaag aagcggtgaa gctagccact gaagtgcgaa 3360 ccatccattc tgaaggaggc ttcgagatac gaaattggtt atcgaactcc agcgacgttg 3420 tcacgagtct gcatgagaac agcaccacag aaaagaatat gagtttcagt acggagatgt 3480 cgacggaaaa ggtgcttgga atgtggtggg ataccacgac agacacattc accttcaggt 3540 tatcgccgaa gcacgatcag gagctgctct ctggtactag gatgcccaca aaacgcgaag 3600 tcctgaggac gttgatggca atatacgacc cattgggact tatcgccaat ttcctgatct 3660 acctcaagat cctgttgcag gaaatctggc gctctggatg cggctgggac gacgagatta 3720 gtggaaagat agctgataga tggtcgacat ggattgaagc gctaccgaac gttcgccaag 3780 tcaggattcc acgctgctat cgatacgaaa cgtccgctga accgacgaac aacgtggagt 3840 tgcatatctt ctgcgatgct agcgagaacg gaatagctgc tgtagcctat ctcaggttcg 3900 aagaaaaaca aatagtggag tgcgcgctgg tcgggtcgaa aactcgtgtg gcacccctaa 3960 agtttctatc catccctcgc ctcgagcttc aggctgctgt catcggagct cgtctcgccg 4020 attgtatcgt aaaatcgcat cgtttgaaga tcacgcaacg tgtattctgg acagattcac 4080 gtgacgtagt ttgctggctg cgatcggacc atcgacgata cagccagttc gttgcgttca 4140 gggtgagcga gctactcgac accacccaag tgagcgagtg gagatggctg ccgaccaaac 4200 agaatgtagc agacgaaggc acgaagtggc aaaggcctcc agatttccaa ccgagtagtc 4260 gctggttccg cgggcctgat tttctgtggg atcccaagaa ggaatggccc ggagatagcg 4320 gagatcaagg tgcgaccata gaggagatca gacccagtgt cctgcatcac acatccgttc 4380 tccgtcaaca gtggtgaact tcgagcgatt ctcaaaatgg aaacgactgt taagatcgat 4440 gggatacgtt caccgattta tcgctaccct tcgcaaaggc actaagaagc tcgtcggccc 4500 tctaacgcag gatgagctga aactggccga aaataccatc tatcggttgg ttcagcagct 4560 agcctaccca gatgaaattc aactgatccg tgacaaagat ccaggagctt atccgtggga 4620 gcgcgtccta ccgaaaacga gtccgttgca caagctgagt ccgttcatcg acgagcacgg 4680 tgtattacga atgcgtggac gtattgacgc ctgcgagtgg gcagacaagg ctgcgaagaa 4740 tccaatctta ttgccgagaa ataatcacgt aaccaacctc gtgatggcag attaccacgc 4800 agcgtgccgt catcagaacc accagacggc aattaatcag attcgtttaa aatacaacgt 4860 tccccgtctt cgttcggaat tcggccgcgt tcgcaaaagc tgtcagcgct gtaaaatccg 4920 acaagcacgt ccgcaacctc cagaaatggg gaaccttccg cccgctcgcc tggcagcctt 4980 ccagcgaccg ttttcataca ctggaatcga ctatttcggg ccgatgaccg tactggttgg 5040 cagacgcgct gaaaaaaggt ggggtgtcct gctgacctgc atgactacca gaggtgtgca 5100 catcgaagtt gcacattcgc tgacaacgga ctcatgcatt ctcgcgttac gcaacttcat 5160 tgctagaaga ggcgcaccgt tagaaattat aagcgaccga ggaacgaact tcatcggggc 5220 ctcccgagag ctttgcgaag ccctgaagca cgtcaacgaa gaggagctga tggtagaatt 5280 cgtcagcccc gataccaagt ggactttcaa tccaccagcc gctccgcatt ttggtggctg 5340 ctgggagcgc ttgatccaat ccgtgaagaa gaccctacac gatttcgagc caccccgctt 5400 gccatcggac gaaatccttc gaaccatgct attagagata gaaatgatcc taaactctag 5460 accgctcact gacataccgc tcgataacga taccgaaccc ccactaactc cgaaccactt 5520 cctgttaggt tctgctaatg gtagcaagcc cccgatcgcc tttgatgata aacctagtgt 5580 gctcaagaga tcgtggacga tggcccaact gtacgctgac catttctgga agaagtgggt 5640 ggccgaatac ttgcctacct tgacccgccg gacaaagtgg tttcaaccag ccaagccaat 5700 tcaagaaggc gatttggtga tagtcgtcga caacagcctt ccgcggaact gctggccaag 5760 aggacgagtc gtcaaggcgg ttcacgccaa ggacggacag gtacgacgcg taaccgtgca 5820 gacagcgagt gggcttcttg agagaccggc gaccaaagtt gcagtactag atgtcggtgc 5880 aaaaggaggt aagccacttg gccagcagtg gcgtaccgag ggggag 5926 // ID CR1-18_BF repbase; DNA; INV; 3832 BP. XX AC . XX DT 30-JUL-2009 (Rel. 14.07, Created) DT 30-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE Amphioxus CR1-18_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-18_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-3832 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-3832 RA Kapitonov V. and Jurka J.; RT "Young families of CR1 non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(7), 1589-1589 (2009). XX DR [2] (Consensus) XX SQ Sequence 3832 BP; 1130 A; 949 C; 786 G; 967 T; 0 other; cggtgaagga ggctactttt cacgtgagtt attcaatctc tccagcccac tgtactcccg 60 ccgtcatgtc gctgatgatt tccaggtcag tggtctttct cttgctattt ctcaacacga 120 ttcaggcctc aactatcgga catggcccag gtaaagtacg caggacgaag ccgccatgtt 180 acatgtttcc cggtggtagc gctcagccgc gaccgccatt accatcccta tggtaccccg 240 tccttagttt aaatgacaaa gaaattgctt cttggcaaaa tctgaatttc agttctgctg 300 ggatcctctt gaaacaagct agatatttga gaacatcaga cagactctgc ctactctaca 360 tttgttccgt actgatggca caggccgttg acttggaaac caaccctggt cctcggcctc 420 caaagtatcc gtgcggttct tgtggaaagg cggtgacttt taagcacaaa gcggtttgtt 480 gcgacagatg cgatttctgg tttcaccacg attgtcaagg cctcggttct ttcatgtatc 540 cctatctaag caattcgaat gtgtcatgga tatgtttaaa ttgtggactt ccaaacttat 600 caactacatt cttttctgac tcccgcgacc tatcctcccc taatccattt agccccctaa 660 gtaatgacag cccaggaatg ccgcaagctg cgtcttcccc gaaatcgcct gttcaaaaca 720 aacacaggtc caatgctagg ccattacggc tgatcaatgc aaatttccaa tcccttagaa 780 ataaaaaagt cgaactagaa accttagtag atcagacaaa accagacatc ttagtgatta 840 cagagacttg gttggatagc tcctgcaata tttcagaata cttcccaagc cacctcaaca 900 tgcaagtgtt ctataggaat agaccggagg acagccatgg tggtgtcttg atagctgtct 960 ccaatgaatt catatgtaca caggaacctc accttgagac caactgtgaa atggtttggg 1020 tgaagattaa tctggtagga tccaagtgct tgaatatatg tgcttactat agacctcagg 1080 taggagacag cgtaagtttg gactgtttag aagaatctat ggaaagaata tgtaacaaac 1140 gtaataacca tgtatggatt atcggtgatt tcaatttccc agggtgggat tggtcagacc 1200 cccagcagcc tgtacttaag ccagactgcc cctacccagg tctgcaccgc cgattcatgg 1260 aactgcttag tgatcaaaac atgtcacaag ttgtggataa accaaccagg tacgataaca 1320 cgttggacct tgtcctcatg tcaaatgata attgtgtcaa cagtgtacgt actttacccc 1380 ccattggcga tcatgaccta gtattcattg aggcagactt acggccccaa aagcaaaaag 1440 ccaaacctcg taagttgtac ttgtacaaac gttccaactg ggacaaattc cgtgatgaaa 1500 tggaagacta caaaactcat ttcctagaat tggtagaaga cgacgtagat gtgaacgagc 1560 tctacaacaa tttcactcaa aaactatcca tgtgtgttga taaatatgta cctaccaaaa 1620 tgtctagcgg caagaaacat ctcccttaca taacccctga acttaagcga atgatgagaa 1680 aaagagaccg tttttacaga aaaacaattg gtcaaaactc aactgaaagg aaagacaaac 1740 taaaccagtt caagaaagac attaataaga agatgaaaga gtgttactgg aaatacattg 1800 aggatgtagt gcttgacatg gacgtaactg accccgaaca aagccatgct acttccacaa 1860 gtaagcaagg taccaaaaag ttctggagtt tcctgaagag catgaagtca gaaagagctg 1920 gagtctctaa tcttagaaac gaaggcacac tgatttcgga caacaaggaa aaagccgatc 1980 tattaaacca acagttccaa tctgtgttct ccactgaacc tcctgatgac gctatgcctg 2040 acatggggcc tagcccgcat cccccaatga aggatatcaa catcgacaca aatgggataa 2100 atgaactact ggctaatcta aaccctcaca aagcatgtgg ccctgaccat gtccatgctt 2160 gtgtgctgaa agaactaagc tccacactaa gcccaatcct ccaagccatc ttccagaaaa 2220 gcctcgacac tggctctgtg ccagaggcct ggaaggaggc taacatagct cccgtctaca 2280 aaaaagggaa ccgcctagac ccggcaaatt accgaccaat ttcgttgaca tgtatatgct 2340 ctaaaatcat ggaacacgtt atagctagta ctatgatgaa tcactttgac tccaacaaca 2400 ttctctatga tctccaacac gggtttagac atagtaggtc atgtgaatct caactcttgt 2460 cacttacgga tgacctagca cataacaggg aaaatgggat tcagactgat ctcataatca 2520 tggactttgc aaaagccttt gacaaggtcc cccacttacg gttaatacac aaactccagt 2580 tttatggtat tacaggtaaa accttaacct ggatccagaa cttccttcag ggccgttcac 2640 agacagtagt actagacggt gagcgctctg accctgtacc tgtaacctct ggcgttcctc 2700 agggaacagt cttagggcca atattgtttc tagcttacat taacgacctg ccatcacatg 2760 ctgcacatgc aaaagtccga ctatttgccg atgattgtat tttgcagatg agtgtgaaga 2820 caaagaatga ttgtgaaaaa ctccagcacg acataaatag catttgttca tgggagaaga 2880 cgtggcttat ggcctttaat ccgtccaaat gcgaagtaat gtcggtccca gcctccagaa 2940 acccgataac attcccctac tcactccacc aacacccctt gaccaaggta agttccacaa 3000 aatatcttgg ccttcatatg tcttccaact tgacctgggg caaacacgtt gaaaaaacta 3060 catcaaaagc taacaggaca ttgggaatgt tacgacgctg cctgcggata tccagcactg 3120 cggccaaaga gagagcctac atggctctgg ttcgaccatg tctcgagtac ggctgtagtg 3180 tttgggaccc tcacaccaag gcccaggtga gtagcctaga gatggtacaa cgaagagctg 3240 cccgatatgt atttaacgat tacagacgca caagctcagt tacctccatg cttcacaatc 3300 ttggctggca gtccctcgaa cttagaagga agattgcccg ccaggtcacc ttgtacaaaa 3360 tcatcaacaa catcatgtac gtacctcaca cgaccatgct tgtcccggtg gctcgatgct 3420 cccgccgcac caatcactca caaacattgc aggccatcgc gtgcaggaac aactactaca 3480 ggctctcgtt cttccctcgt accatcagag agtggaatgg acttgagcct ggtgtggcgg 3540 aggcggggtc tctctcccag ttcaaaactg agctggggag gaccctgctg cattgagcca 3600 ccccccacac gtcttgtata tatgtaaata cgtgtcctat taacccactt aattcacctg 3660 tcctgtcttt ttgcacacag aaaatcagtt tgtttttatt tccctcagca ttttccttca 3720 tttttcccat gtctatttgt tctggtcact tgtattgtaa ttaaaccatg cgtagtctgc 3780 caataatctc aaaaaattga ggctaggcag taggaaagaa gaagaagaag aa 3832 // ID PERERE-8 repbase; DNA; INV; 1409 BP. XX AC BN000799; XX DT 04-SEP-2005 (Rel. 10.08, Created) DT 02-JUN-2010 (Rel. 15.07, Last updated, Version 2) XX DE Schistosoma mansoni Perere-8 non-LTR retrotransposon (EST). XX KW Rex1; Non-LTR Retrotransposon; Transposable Element; PERERE-8. XX NM PERERE-8. XX OS Schistosoma mansoni OC Eukaryota; Metazoa; Platyhelminthes; Trematoda; Digenea; OC Strigeidida; Schistosomatoidea; Schistosomatidae; Schistosoma. XX RN [1] RP 1-1409 RA DeMarco R., Machado A.A., Bisson-Filho A.W. RA and Verjovski-Almeida S.; RT "Identification of 18 new transcribed retrotransposons in RT Schistosoma mansoni."; RL Biochem Biophys Res Commun 333(1), 230-240 (2005). XX DR EMBL/GenBank/DDBJ; BN000799; Positions 1 1409. XX FH Key Location/Qualifiers FT CDS 2..1339 FT /product="PERERE-8_1p" FT /translation="NGYIFLNRYSNVFPKMCRKIKVSSIPKKASGNKNAKL FT RPIAITSPFLKIMEKLLIHPLQPAIKEHCDPFQFAYKCKRSTLDAVAVLHH FT NIVFGLEKGKKYVRCAFLDFTSAFDSIPRHLLLNKLISIDADSWISNWLCS FT YFSGREQYTVFEGKCSTSLLSTVGVPQGAVLPPLLFSFFLHDLPSSTENTF FT VKYADDLTVCMPISTSLHPIEMNEFLSRIERWSVGNGLLLNPSKCQAVNFS FT LRHGQNLRSILGSHNACAIGDSLINTVSKVKYLGVLFSSDLSWSSHVLLLS FT KKVYRLTYYIKRLHAFGITRRLLLQFVNSCILPIILYCSPLFFPGLLRKDF FT AILRRVLKAVCKVCGESFEVIVNMLVDRHLKSCKLFAGVILSDTNHPLHSY FT LSPCISSGRTRRKYIKIHARKQIYKSSVIPYLANLLCDEQAVRVDLVNNLS FT S" XX SQ Sequence 1409 BP; 395 A; 283 C; 238 G; 493 T; 0 other; caacggttac atatttctca atcgttatag taatgtcttc ccgaaaatgt gcagaaagat 60 aaaggtatcc tctataccta agaaagcttc tggtaataaa aatgcgaaac ttagacccat 120 agcaataacc tctcctttcc tcaaaataat ggaaaaatta ctgatacacc cacttcaacc 180 tgcaataaaa gagcactgtg atccatttca gtttgcttac aaatgcaaaa gaagcacatt 240 agatgccgtt gctgttctgc atcacaatat agtgttcggg ttggaaaagg gtaagaagta 300 tgttagatgc gctttcctgg actttacttc tgcttttgat tctattccaa gacacctctt 360 acttaacaag ctgatcagca tcgacgctga cagctggata agcaattggc tatgttccta 420 cttttctgga agagaacagt acactgtatt tgaaggaaag tgttcaacat ctctactgtc 480 tactgtaggt gtgccacaag gagctgttct tccacctctg ctcttctctt tctttttgca 540 tgatctgcca tcttccacag aaaacacttt tgtaaaatat gcggacgacc tcaccgtatg 600 tatgccaatt tctacctcct tacatcctat agaaatgaat gagtttttat ctcgtattga 660 aaggtggtct gttggaaatg gtctcttact caatccatct aaatgtcaag ctgttaattt 720 tagcttgaga catggacaga acctacgctc tattttggga tcccataatg cttgtgccat 780 tggagactcc ttaataaaca cagtgtcgaa ggtcaaatat cttggtgtcc ttttttcctc 840 tgatctttct tggtcctctc atgttttgct gttatcgaaa aaagtttacc gtttgacata 900 ctacataaag aggctgcatg cttttgggat tactcgccgt ttactcttac aatttgtaaa 960 ttcctgcata ttacctatta ttttatattg ttctccatta ttctttcctg ggcttttgag 1020 aaaagacttt gctattttgc gtagagtgct gaaggcagtt tgcaaggtat gtggtgaatc 1080 ttttgaagtc attgttaata tgcttgtgga tagacatctt aaatcttgca aactctttgc 1140 aggtgttatc ttatcagata ctaaccatcc gcttcattca tatctttctc cttgtatatc 1200 ttctggtaga acgagacgta aatacattaa aattcatgca cgcaagcaaa tctataaaag 1260 ttctgtaata ccttacttag ctaatttact ttgtgacgaa caggctgtta gagttgacct 1320 agtaaataac ctgtcttctt aaacatgtct catattcgat aatttgtata aactcagatt 1380 ttcttaaacc ttttttatat tttatttct 1409 // ID Mariner-40_HM repbase; DNA; INV; 2962 BP. XX AC . XX DT 23-JUN-2010 (Rel. 15.06, Created) DT 23-JUN-2010 (Rel. 15.06, Last updated, Version 2) XX DE Mariner-type family: consensus sequence. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-40_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2962 RA Jurka J. and Kojima K.K.; RT "Families of Mariner elements from Hydra magnipapillata."; RL Repbase Reports 10(6), 839-839 (2010). XX DR [1] (Consensus) XX CC ~97% identical to consensus. TA TSDs. 9-bp TIRs. The protein is CC distantly related to Mariner/Tc1-type transposase. CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 578..2572 FT /product="Mariner-40_HM_1p" FT /translation="MASDRKFSRTDRDKYSLEEKNALGKLCKSYKKEYDLS FT IAENFQKVNFSEKRKKHVKTLPREGYLARAVREFYSDLKDIKHDNPTLTKA FT LKLGKRCLNQVEADEDAVTAPPSKSRYRQAGGGRKLTIPDVRQLLFEWFVD FT IRGTLKARLPRKMFTAQCKLFYEQWLAQQPEEVPENKKIVFSNRWLNNWMS FT EHGVSLRHPNKRFQIMQADREERIFEYLKNIWTVRKYFIENFGVDPPVLNG FT DQMPLHRNESSTQKTLNFTGLDTYVKENYSLSRERITVYTQVCSDPKVSIK FT PEFVFKGKGVRVKLNPPQDVKFQWAPKGSYRLEHMLSTIANLPNRHNIFTH FT QNYGIYVLDDYSVHIMPEIKAALLKKGYIYVGIGGGVTGDVQINDTDFHRP FT LKAKYRELEQNLMIEQLRLDPKKIPQPSRDHMMQMLLKSWNSLEIDVTSRF FT KALWVTNALDGSEDYLVSERIFALVGEKLQAFRKKLMLTPSPRNLKDLQGL FT ITPPKGVKRKQGQNTCETEPLDEGYELFDCEGETLFINQLDEEHQTDEEND FT TNLETSALENQDENNRVNLNTDSSSNILLGDLCQGNDDLRKDAVFIDELGK FT LLCNAETSTQFIPHLTRLKRSYITSRRAIKSRIENNFEAFDNEQLENEVEH FT RDENDSIRNAFEDLFK" XX SQ Sequence 2962 BP; 1045 A; 553 C; 508 G; 856 T; 0 other; ctctcgagtt cctaataaac gtcccccccc ccccccgttt attaattttt ggatattttt 60 cccacccacc ctaagcttat taagaccccc cccccccccg tttattaatt tttacttgtt 120 atcaacaaaa aaattatata tctcaaaagt aaaaaaaaaa attcagtgaa aatcggcctt 180 aaaaatttga aaataatttt cagtcaccga taaaaacaaa aactttcagt gtaaacaggt 240 tttacaaatt gtcgacctat taaatctcga tttgctaaaa atgaactttt agtttatttc 300 tcaacaaagc tttggtttta attataggat ttaatatgct gaatataagt tttatacttt 360 gcaaagtata aaacttatat cagcatatta aatcctattg aaattatgat aaaacagcct 420 aaccctaccc taacagctgt tagaaaatgc gcttttgacc gcaagtcaaa tgcgcatttg 480 acttgaacat gcataaatag ttttatgcat gctttgtaaa aaatattggt ttggatttat 540 aaattaaaat tttttacaaa aatatttata ttgtaacatg gcgtcagatc ggaaattttc 600 aagaactgac cgagataaat acagtttgga agaaaaaaat gctttaggta aactttgtaa 660 aagttacaaa aaagagtacg atttatcgat tgctgaaaac tttcaaaaag tcaactttag 720 tgaaaaaaga aagaaacacg ttaaaacatt gccacgcgaa ggatacctcg caagagctgt 780 aagagagttt tattcggacc ttaaagacat taagcatgat aatccaacct taacaaaagc 840 cctaaaactt gggaaaagat gtctaaacca agttgaggca gatgaagatg ctgtaactgc 900 tcctccaagc aaatcaaggt atcgtcaagc aggaggagga cgaaaattaa ccatcccaga 960 cgtcagacaa cttttattcg agtggtttgt agacatccgt ggaaccctta aagcacgttt 1020 acccaggaaa atgtttacag cacaatgcaa acttttttat gaacagtggc ttgcccaaca 1080 accagaagag gtgccagaga acaagaaaat tgttttctca aaccgttggt taaataactg 1140 gatgagtgag catggtgtca gtctgcgcca tcctaacaaa cgctttcaaa taatgcaggc 1200 tgaccgtgaa gaaagaatat ttgagtacct gaaaaacatt tggactgtac gtaaatactt 1260 tattgaaaat tttggagttg atccacctgt gctgaatggt gaccaaatgc cactccaccg 1320 aaatgaaagt tctacccaaa agaccttgaa ctttactggt ttggacacat acgtcaaaga 1380 aaattattct ctttcaagag aacgaattac tgtttacact caagtttgca gtgaccctaa 1440 agtctcaatt aaaccagaat ttgtgtttaa aggcaaaggt gttagggtta aacttaatcc 1500 tcctcaagat gtgaagtttc aatgggcacc taaaggttcc tatcgattgg aacatatgct 1560 gagtacaata gctaatttac caaacaggca caatattttt acacatcaaa attacgggat 1620 ttatgttcta gatgattata gtgttcatat aatgccagaa atcaaagctg ccttactaaa 1680 gaaaggctat atttacgttg gaataggtgg tggtgttaca ggggacgttc aaattaacga 1740 tacagatttc cacaggcctt taaaagctaa ataccgagag ttggagcaaa accttatgat 1800 agagcaatta aggcttgacc ccaaaaaaat tccacagcca tccagggatc acatgatgca 1860 aatgctacta aaaagttgga attcgttgga gattgatgtt actagtcgat ttaaggcact 1920 gtgggtaacc aatgctttag acggaagcga ggattatcta gtttcggaaa gaatatttgc 1980 cttagtggga gaaaagcttc aagcatttcg gaaaaaactt atgcttaccc caagcccaag 2040 aaacttgaaa gatcttcagg ggctaattac acctccaaaa ggagtgaaac gaaagcaagg 2100 acaaaacact tgtgaaacag agccgctaga tgaaggatat gaattatttg attgtgaagg 2160 tgaaacatta tttataaatc aacttgatga agagcatcag acagacgagg aaaacgatac 2220 aaatttggaa acaagtgctt tagaaaatca agatgaaaac aatcgagtta atttaaatac 2280 cgatagttca tccaacattt tattaggtga cttatgtcaa ggcaatgatg atctaagaaa 2340 agatgctgtt ttcattgatg aattaggaaa acttttatgt aatgcagaaa catctaccca 2400 atttattcca catcttacaa gattgaaacg gagctatata acatctcgcc gtgccataaa 2460 aagccgtatt gaaaacaact ttgaagcctt tgacaacgaa cagttagaaa atgaagtgga 2520 acaccgcgat gaaaatgatt caatcaggaa cgcttttgag gatctattta aataaaaaga 2580 tcatttttat atattttatt taaagttaat aaatattgat aattaaacct tttttaaagt 2640 cgtacatttt cttttggtaa attatttttg tacgttacta cgggggttgt aagaaggggt 2700 acggggatta cattctctct ctgtttacta gaactagaag atgttccttt tttttaaaca 2760 aaaaaacccg cacatttatt ttgtagataa aatcctcctc ctactataaa tccccccccc 2820 ccccgtttat taaatttcag aaaaaatctc tcattccccc ccccccccgt ttattaaccc 2880 cccccccccg tttattagat tttgactaaa attcccaccc acccgtttat tcccaccccc 2940 cgttgtattc ggcactcgag ag 2962 // ID Gypsy-590_AA-I repbase; DNA; INV; 6716 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-590_AA_; KW Gypsy-590_AA-LTR; Ty3_gypsy_Ele50; Gypsy-590_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-6716 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [2626-3087] - Reverse transcriptase CC Positions [4378-4848] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 491..1687 FT /product="Gypsy-590_AA-I_2p" FT /translation="MPKEHKVRNSLKQTHHLLSKIRNSACVSSDSDSSEST FT DSGDLAHIPIKNIPSLEILSLNDNNNMEEVNAKLQTMMQMMEALAKQQTEH FT ATYFSQIQTQTNQTQENVIAHQPISNIDQLFKIPDPIKMIPIFDGNRKQLS FT SWLETAEETLNVLKPHVSAQQFRMYFTAVSNKVQGKAKDILCLAGNPDKFE FT DLKEILTNALGDRHELSTYKSQLWHCKMTEDMSIHIYYKKSKEIIQNIKTL FT AKQKDTYKNHWDAINEFIEEDALAAFISGLSEPYFGYAQAARPKDIEDAYA FT FLCKFKSKQVVARNLNQDQKFNKNKIFENKSQGSSNFNQNKKPFIKTEHKD FT TTEPMDTKTTRSRLTINNHVTEEEETQSHLNSETESDEEVDLDLNFHLVNP FT TQKTT" FT CDS 1915..5133 FT /product="Gypsy-590_AA-I_1p" FT /translation="MKFHYFFDGILGSESLAKLKAQINYENETITLRDKQF FT SYSKFYPAKKLYNHFITIDTMNNGDWLVPTYQKLHKKVFIQPGLYKAHNNK FT STIHVISATKNIENLPKLQLTVNNFETILPEKIDSQHISKNDIESIIRTNH FT LSKLEKQELLCTLFKHQEVIQKKGEKLSCTSATKHKIITTNDSPIYTKNYR FT YPHHFKGDIQQQIEEMLDNGIIRPSKSPYSSPIWVVPKKLDASGKRKVRVV FT IDYRKLNELTIDDRYPMPQIEDILDSLGKSSYFTTIDLKSGFHQIPMDPAH FT IEKTAFSTDKGHFEFTRMPFGLKNAPATFQRAMNNILGDYIGTKCYVYLDD FT IIIIGYNLKDHLENLSLILKRLSDFNLKIQLDKCEFLKRETEFLGHIISNE FT GVKPNPEKIDKILKWPLPKTQKEIKQFLGLVGYYRRFIKDFSKITRPMTKY FT LKKDSSININDSSYHTAFDTLKGIIATDQVLVYPQFDKPFIVTTDASGYAL FT GAVLSQIQDNIERPIAFASRTLNDAETRYATNEKEALAIIWAVNKFRPYLY FT GTKFTLVTDHKPLTFIKSSEKNQKILRWRLDLEDYDYEVKYREGKSNVVAD FT ALSRMPVEVNINETNNDEIPYFSGSEDLEDDADSIANNSPESHDSLDSQTV FT HSADDSANNYIHFTERPLNYFKNQIIFRISRIDTIIQETLFQNYHRTIVSQ FT KTYTKENITHLLKTFHNGRQTALMAPENLIQTVQESFKENFNQKGHFVFTN FT NMVEDVQNEERQNQIIINEHNRAHRGITEVEAKIKRSYFFPNIYSKIRTFI FT NSCEICNTHKYERRPFNIKISPRPITEKPLDRVHMDIFIMDKCSFLSLIDS FT FSKHLQLIVLKSKNLVHVQKALGKYFSSFGVPREIVTDHETTFQSIQLRNF FT LSQLGAQIHYAASSESNGQIERTHSTIIEIFNTNKHKFRGMGTKSIVKLSV FT ALYNNSVHSSTKYTPNEILFNQNNIVNPEEIIRDAQELFLKAKLNMGKAQK FT QMISQNSNKEDPPIINDGQEVYVIPNIRTKTQPRANKTNANEVTDRTFKNN FT RHIKRHKNKIKRPKKM" FT CDS 5169..6716 FT /product="Gypsy-590_AA-I_3p" FT /translation="MLHYFITFTLILTNISCQDLTIRNLHNDLIMMQKNSL FT CRIQTGNIRVVHPINLTDIEITINQLTNLVYRKQNNNALFEISKYKIRELY FT SNFMQIKPISHKRSKRWDIIGTAWKWIAGSPDAQDLRIIDRSLNELVNENN FT HQVKINHQIGKRISELTTTINELIEKQQINQIILDELDVITTILNIDTLNK FT ILTSVQEAILFSKMHVTNSRMLSGKEIHLIKDILRNQGVQLDIPEEALNFV FT TPKMATSENTLLYILHVPELETEESTIMRIHPLSQNDSIIKTYPQFLIKQA FT KQLFTTAKPDDYVQRHSFIKEFKDLCIYPLIMGTEPQCLMEINAETTTKLI FT TNNKILISNAKNQELRSNCGPSDRYLSGNFIVSFSNCTIEFMDQNFTSTEK FT ISESESIQGALHNLQINSEILQNHDVAKIEKATLLNRRKLEKVSLQHETDQ FT IWKWSLLGGTALMSTLLFFTLYLIFIHLRFTHKRTSRKTHQRKPETKSPGH FT GNQELSIEDDTSSPPGGVTT" XX SQ Sequence 6716 BP; 2595 A; 1407 C; 1050 G; 1664 T; 0 other; ggattggcgc agccgtgcag tgctcgaaaa acaatttcgc gaaaaacagt gtagtgactc 60 tggtctacca gccagaatcg aagtgaaccc gcgaaagcat tcgctaacat cgagggtgca 120 gcggagttga ggttccaccg cgtgaagttg gacgtcgaga gaacccacgc acaacgattg 180 caaggtgagt gttttttttc tttatcctct attccacacc gaaatacgag tgaaagtgaa 240 aacatcgcct aggttggcgt tttatttttc cgtgagtttg aagcccgttc taagagaaag 300 gctcgaactc attaagaaat taaaatttgc taaacccctg gagtgagaag agttctaaga 360 gagcttcccc tccaaccgaa attgtgaagt tctaagagaa ttacatttcg ggagtcatac 420 gagttccgag agatcgtacg gctcaaaaca aacaaaacta tagcttgctt ttatcttcta 480 agtgcatgtc atgccaaagg agcataaagt aagaaacagt ctgaagcaaa ctcaccactt 540 gttatctaaa attagaaaca gcgcgtgcgt gtcttcagat tcagacagct ctgagtctac 600 cgattcggga gacttagcac atattccaat aaagaacatc ccttcactag aaatcctttc 660 tctaaacgac aacaacaata tggaagaagt caatgccaaa ctgcaaacga tgatgcagat 720 gatggaggca ctggcaaaac aacaaacaga acatgccacg tatttcagtc aaattcaaac 780 gcaaacaaat caaactcaag aaaacgttat cgcacatcaa cctataagta acatcgacca 840 gctattcaaa attccagacc cgatcaaaat gatccccata tttgacggca atagaaaaca 900 attaagttca tggctggaaa cagcagaaga aacccttaat gtattaaaac cacatgtatc 960 agctcagcag tttagaatgt actttactgc ggtatctaat aaagtccaag gcaaagcaaa 1020 agacattctt tgcttagccg gaaatccgga caaatttgaa gatttaaaag aaatattgac 1080 gaacgcgttg ggagatagac acgaattgtc aacatacaag agccaattat ggcattgtaa 1140 aatgaccgaa gatatgtcta tccatattta ctacaaaaaa tccaaagaaa taattcaaaa 1200 tattaagact ctagcaaaac aaaaagatac atacaaaaat cattgggacg ccattaacga 1260 gttcattgaa gaggatgcgc tagcagcctt tatatctgga ctctcagaac cgtactttgg 1320 ttatgcacaa gctgcacgac caaaggatat tgaggatgca tacgcattct tatgcaaatt 1380 caaatctaaa caagttgtag ctcgaaacct gaatcaagac caaaaattca acaaaaacaa 1440 aatatttgaa aataaatccc aaggatcatc aaatttcaat caaaataaga aaccttttat 1500 aaaaaccgaa cataaagata caactgaacc aatggacaca aaaactactc ggagtcggtt 1560 aaccataaat aatcatgtaa cggaagaaga ggaaactcaa tcacatttaa attcagaaac 1620 tgaatctgat gaagaagtag atctagatct aaattttcac ttggtcaacc ccacacaaaa 1680 aacaacctaa attacctccc atatttgaaa ctgaatcatc ctaaaacaaa aaagcagtta 1740 aaaatattaa tcgatactgg ggcaaacaaa aatattattt caccaaacat aatagaatcc 1800 gttaaaaccg taccgaatac acaaatcagt aatgtttgcg gagtcaacaa cgtcacttcg 1860 aaaggagaac tcgatctttt cggaaatacg tttaaacctt tgcaatttta cgtaatgaaa 1920 tttcattatt tctttgatgg aattctgggg tcagaatccc ttgcaaaact taaagcacaa 1980 atcaactatg agaatgaaac cattacgtta agggataaac aattttcata ctcaaaattc 2040 taccctgcaa aaaaacttta caatcatttc ataactatcg atacaatgaa caacggagac 2100 tggttggtac ccacatacca aaaacttcat aaaaaagttt tcattcaacc aggactatat 2160 aaagcacata ataataaatc gaccatccat gttatttctg caaccaaaaa catagaaaat 2220 cttccaaaac ttcaactgac agtaaacaat tttgaaacaa ttctcccaga aaagattgac 2280 tctcaacata tttccaaaaa tgatatagag tctataatta gaacaaatca tctttcgaaa 2340 ctagagaaac aagaactgct ttgcactctc ttcaaacacc aagaagtgat ccaaaagaaa 2400 ggagaaaaac tttcatgtac gtcagctaca aaacataaaa ttataacaac aaacgattct 2460 cccatttata caaaaaacta tcgctaccca caccacttta aaggcgatat ccaacagcaa 2520 attgaagaaa tgctagacaa tggcatcatt aggccttcga aaagcccata ctcttcaccg 2580 atatgggtag taccgaaaaa acttgacgcg tcaggcaaaa gaaaggtacg cgtcgtaata 2640 gattatagga agctgaacga gctcactatt gacgacagat accctatgcc tcaaatagag 2700 gatattctgg atagcctcgg aaagtcatcg tacttcacaa cgatcgactt aaagtctggt 2760 ttccatcaaa tcccgatgga tcctgcccac atagaaaaaa cagctttttc aacggacaaa 2820 ggacattttg aattcacgag aatgcccttt ggcctaaaga atgctcctgc tacattccag 2880 agagcaatga acaatatcct aggagattat atcggcacca aatgttatgt ctatcttgat 2940 gacataatta ttatcggata taatctaaaa gaccatttag aaaatcttag ccttatcctg 3000 aaacgcttat cggatttcaa cttaaaaatc caactcgata agtgtgaatt tctcaagcga 3060 gagaccgaat ttctagggca tataatttca aatgaaggtg tcaaacctaa ccccgaaaaa 3120 atagataaaa ttctgaagtg gcctctacca aaaacgcaga aagagattaa acagtttctg 3180 ggtttagtag gttattacag aagatttatt aaggattttt caaaaataac tagacctatg 3240 accaaatatc tcaagaaaga ttcatcaatc aatataaatg attcatcgta tcacacagca 3300 tttgatacgc tcaaagggat aattgccaca gatcaagttt tagtctatcc ccagtttgat 3360 aagcctttca tcgtaacaac agatgcttca ggatatgcat taggtgcagt tctatctcag 3420 atacaagaca atatagaacg cccaatcgcc tttgcatcca gaacactaaa cgatgcagaa 3480 accagatacg ctaccaacga aaaggaagcg ctggcaatca tatgggccgt gaataaattc 3540 agaccatatc tttacgggac taaatttact ttagtaactg accataaacc tctgactttt 3600 attaaaagtt cagagaaaaa tcaaaaaatt ttgcgatggc gcctagatct ggaagattat 3660 gactacgaag tcaagtacag agaaggcaaa tcaaatgtag tagcagatgc actaagcaga 3720 atgcctgttg aagtaaatat taatgaaacc aataatgatg aaattcctta tttttcaggt 3780 tccgaagatc ttgaagatga tgcagattcc attgccaata actcccctga atcccatgat 3840 tcccttgatt cccaaactgt tcactcagct gatgattcag ctaataacta tattcatttc 3900 acggaacgcc ctttaaatta tttcaaaaat cagataatat tccgtatttc ccgtatagat 3960 actattatcc aagaaaccct tttccagaac taccatagaa caatagtatc tcagaaaacc 4020 tacacaaaag aaaatatcac acatcttttg aaaaccttcc acaatggtag acaaacagcc 4080 cttatggcac cagaaaatct tatccagacg gtacaggagt cctttaaaga aaacttcaat 4140 caaaaaggtc atttcgtttt tactaacaat atggtcgaag acgtgcaaaa tgaagaaaga 4200 cagaatcaaa taataataaa tgaacacaat agagctcacc gaggtataac tgaagtcgaa 4260 gccaaaatta aaagatcata ttttttccca aatatttact caaaaatcag aacctttata 4320 aactcatgtg aaatatgcaa tactcacaaa tatgaacgca gaccatttaa cataaaaatt 4380 tcacccaggc caataacaga aaaaccatta gatagagtcc acatggacat attcatcatg 4440 gataaatgca gcttcttatc tctaatagat tcattctcta aacatttaca gttaatagtt 4500 ttgaaatcga aaaatcttgt acatgtgcag aaagcactag gaaaatattt cagctccttt 4560 ggagtaccca gagaaattgt caccgaccat gaaaccactt ttcaatcaat tcaactaaga 4620 aatttcctta gccagctagg ggcacaaatt cattacgccg cttcgtcaga atcgaatgga 4680 caaatagagc gtacacattc tactataata gaaattttta ataccaacaa gcacaaattc 4740 agaggaatgg gtacaaaatc aattgtaaaa ttatccgttg cattatataa caactctgta 4800 cactcttcaa ctaaatatac tccaaatgaa atattgttca accagaataa tattgttaac 4860 ccggaagaaa ttataagaga cgcccaagag ctattcctta aagcaaaatt gaacatgggt 4920 aaagcacaaa aacaaatgat ttctcaaaat tctaataagg aagaccctcc aattataaat 4980 gatgggcaag aagtttatgt tattccaaat attcgcacaa agactcaacc cagagccaat 5040 aaaacaaatg caaatgaagt cacagacaga acattcaaaa acaaccgtca cattaaaaga 5100 cataaaaaca aaatcaaaag accgaaaaaa atgtaacacc atttactttt ataacttttt 5160 gcttccagat gcttcactac tttatcacat tcaccctcat tctcacaaac atctcatgtc 5220 aagacctaac tatcaggaat cttcataatg atttgataat gatgcaaaaa aacagcctat 5280 gtagaattca gacaggaaat atccgagttg tacacccgat caacctaaca gacatagaaa 5340 ttactatcaa tcaactcaca aatttggtgt atcgtaaaca aaataataac gctttatttg 5400 aaattagtaa atacaagata cgagaattgt actctaattt tatgcaaatc aaaccaatat 5460 cgcacaaacg gtccaaaaga tgggacatca tcggcacagc gtggaagtgg atagcgggca 5520 gcccagatgc tcaagaccta cggataatcg acagaagcct caatgaactt gtcaacgaaa 5580 ataatcatca agtaaaaatt aatcaccaaa tcggcaaaag aatttcggaa cttacaacca 5640 ccatcaacga actaatcgaa aaacaacaaa tcaatcaaat cattctcgat gaacttgacg 5700 tcatcaccac catactgaac atagatacac tcaacaaaat tctaacaagc gttcaagaag 5760 cgatactctt ttctaaaatg cacgtcacaa acagcagaat gttatctggc aaagaaattc 5820 atctgatcaa ggatatactc agaaatcaag gagtccagct agacatcccg gaagaagcac 5880 tcaattttgt tacaccaaaa atggcaacta gcgaaaatac tttactctac atattacatg 5940 ttccagaatt ggagacagaa gaatcgacaa tcatgcgaat ccacccattg agtcaaaatg 6000 attcgattat taagacttac cctcaattcc ttatcaaaca agcaaaacaa ctcttcacca 6060 ctgcaaaacc agacgactac gtccaaagac acagttttat caaagaattc aaggatctgt 6120 gtatttatcc actcatcatg ggaactgagc cacaatgtct catggaaatc aacgcagaaa 6180 ctacaaccaa actaataacc aataacaaaa ttcttatctc aaatgcaaaa aaccaagaat 6240 tacgctcaaa ttgcggtcct agcgacagat acttatctgg aaattttata gtatcgttct 6300 caaattgtac aatcgaattt atggaccaaa atttcacatc taccgaaaaa ataagtgaaa 6360 gcgagtctat tcaaggagca ttacacaatc tgcagataaa cagcgagatt ttgcaaaatc 6420 atgacgttgc aaaaatagaa aaagccaccc ttctgaacag acgcaaactg gaaaaagtat 6480 ctcttcaaca cgaaaccgac cagatttgga aatggagtct attaggagga accgctctga 6540 tgtcgactct actattcttc accctttacc ttatcttcat tcatttaagg tttacccaca 6600 aaaggacatc gaggaaaacc caccaaagga agcctgaaac caaatcacca ggacatggta 6660 accaggagct gagtatcgag gacgacactt cttcaccccc cggaggagtc acgaca 6716 // ID Gypsy-99_CQ-I repbase; DNA; INV; 4177 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-99_CQ_; KW Gypsy-99_CQ-LTR; Gypsy-99_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4177 RA Jurka J.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 577-577 (2011). XX DR [2] (Consensus) XX CC Positions [1556-2059] - Reverse transcriptase CC Positions [3194-3661] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 2816..4153 FT /product="Gypsy-99_CQ-I_3p" FT /translation="MIDLEVIGRETQRDALLSTLFQLVQSGWDESNVPADL FT KFYFANQSCLSLFNDCVLYSEKVIVPKSCQKRVLELMHGCHLGVIRMKQEA FT RRYVYWPGLDKDIEEFVQRCEVCSKTGRMPKKVYSKWPEASRPFERVHLDF FT FHFAGKTFLIVVDAYSKWIDVRLMSRTDSDSLINALNSVFRIFGKSDLIVS FT DNGPPFNSQAFVDYARRMKIELKKSPAYSPESNGLAERGVQTAKSGLKKLM FT ADPKYGSCKIPELVEVFLFSYRNSYCQALGCSPASKIFSFVPKTDLDSNLK FT PKQEKVKKRVRFDLRVKEKVIDKRESVNEKPRVRKNDDYAVGDLVWYRCEY FT KTMRSWLEAVIVGVNSKNTYYIEVSGNTKLASRNQLKRRVVEKNDYIYPKV FT EPNVIVTPKKRYKRRREKSSPLKTPERPAKLRRSKRVRRRPERYEVFRFHK FT M" FT CDS join(140..1018,1022..2362) FT /product="Gypsy-99_CQ-I_1p" FT /translation="MTKPEEEEKNAGAAEHAMKLIGTLENFVPSADFDDYL FT ERAENFFELNCITDDEFKRKLIVHFIGLPALKKLQQLLYPKTHKQSTYKEV FT TDKLKSYFSPQKNRIAQSVEFFKRSQHEYEKVADFAVELQALSKHCVFEQF FT LDAALRDKFIAGLRNAKIQAELMNSPDNTKFDEAVTKAKNLEQIEEDQQKM FT KAKQQYANRVNGGNFNRRNRSQSRKPEQSERGQRSSSRGRPSSRGGPRGRK FT DFRCYACGKKGHMARSCWSNKANVVNRSDDDSVVDSDDEVNHVVNVPLFKR FT LLDGKWLEFEIDSGASYTIISVNDYRENFARLKLQKCEVRLRVISGTMLNI FT AGSIRVRVKCDGRSYTMTLIVIDGKSTFHPLLGRDWLDVLYPEWRQFFENE FT VAEVDQSEVKLDSTRTTLLSNLNARYHDLFTKNLEEPIETFEASVVLKEDA FT RPIFYRPYEVPFALKEKVSSELDRLVREKILVPVKHSEWASPIVVVPKADG FT SVRICMDCKVTVNKAICTEHYPLPNISDVFANLSGYRYFAKIDLQGAYMQV FT RVSEDSQKYLVINTHKGLFAYQRLPFGISNAASAFQFIMTEGILKGVEGVQ FT CYLDDILLGSETVEGLVDKIHEVLGRLKEFKVKVNLEKSEFLVKKIAYLGH FT QVSEKGLSPSEEKVKAIVDAPRPKDVSQLKSFLGMINYYSKFVPNLSVKLC FT PLYALLKKNVQFKWSAECESAFDKCKKLLLSNRLLELYNP" XX SQ Sequence 4177 BP; 1104 A; 888 C; 1241 G; 942 T; 2 other; ttggcgacga aggaaaatag caacgaggtt gctggtaacg cgtggagtag tcgagtaagt 60 agagtgatcg gaagggagaa gtttgccgtg gacgccaaaa accacgtgct gtgtgatcgt 120 agtgctcgag gtcagtgcca tgacgaagcc ggaagaagaa gagaagaacg ccggcgcggc 180 ggaacacgcg atgaagctga tcggaacgct ggagaatttc gttcccagtg cggatttcga 240 cgactaccts gagagagccg aaaacttctt cgagctgaac tgcatcacgg atgacgagtt 300 caagcggaag ctgatcgtcc attttattgg cttgcccgcg ctgaagaagc tgcagcagtt 360 gctgtacccg aaaacccaca agcagtcgac gtacaaggag gttaccgaca agctgaagtc 420 gtacttcagt ccccagaaga accggatcgc ccagtccgtg gagttcttca agcgaagcca 480 gcacgagtac gagaaggtcg cggatttcgc ggtcgagctg caagccctgt cgaaacactg 540 tgtcttcgaa cagttcctcg acgcagccct gcgcgacaaa ttcatcgccg gcctacggaa 600 cgccaaaatc caggccgagc tgatgaacag cccggacaac acgaagttcg acgaagccgt 660 cacgaaagcc aagaatctgg agcagatcga ggaggaccag cagaagatga aggccaagca 720 gcagtacgcc aaccgagtca atggcggaaa cttcaaccga cgaaaccgtt cgcagtcccg 780 gaagccggag cagtccgaga gaggccagcg cagcagttcg cgaggcagac ccagctcgcg 840 aggaggacca cgtggccgca aggatttccg ttgctacgct tgcggaaaga agggccacat 900 ggcgcggtcg tgctggagca acaaggcgaa cgtggtgaac cgcagcgacg acgacagcgt 960 ggtcgactct gacgacgagg tcaaccacgt ggtgaacgtt ccgctgttca agaggttgtg 1020 actagacggt aagtggctgg aattcgagat cgactctggg gcatcctata ctattatcag 1080 tgtgaacgac taccgagaaa attttgcgcg cttgaagctg cagaagtgtg aagtgaggtt 1140 aagggttatt tcgggcacga tgttgaacat tgcggggtca atacgggtga gagtgaagtg 1200 cgatggccga tcgtacacga tgaccttgat cgtgatcgac gggaaaagca cgttccatcc 1260 gttgctcggg cgagactggt tagatgtgct gtacccggag tggaggcaat tttttgagaa 1320 cgaggtcgca gaggtcgatc aaagtgaggt taagttggac agtactcgca ccacgttgct 1380 ttctaacctc aacgcacgct atcacgactt gttcaccaag aacttggaag aaccaatcga 1440 gacttttgaa gctagtgtgg ttctgaagga ggacgccagg ccgatcttct accggccgta 1500 cgaagttccg tttgccctga aggagaaagt gtccagtgag ctggataggt tagtgcgtga 1560 gaagattttg gttccggtca agcacagtga gtgggcatcg ccgatcgtgg ttgtgccgaa 1620 agcagacggt agtgtacgga tctgtatgga ctgcaaagtg accgtgaaca aggcgatctg 1680 taccgaacat tatcctttgc ctaacataag tgatgttttt gcgaacctaa gtggttaccg 1740 gtactttgcg aagatcgatc tgcaaggggc gtacatgcag gttagagtgt cggaggattc 1800 ccagaagtat ctggtgatca acacgcacaa gggacttttt gcctaccaga gactaccctt 1860 cgggatctca aacgcggcct ccgcgttcca gtttatcatg actgagggca ttttgaaggg 1920 ggtagaagga gtccagtgtt atctagacga cattttgttg ggaagtgaga ccgttgaggg 1980 gttagtggac aaaattcacg aggttctggg acgactgaaa gagttcaagg tcaaggtgaa 2040 ccttgagaag agcgagttct tggtcaagaa aatcgcttat cttgggcatc aggtttctga 2100 gaagggtctc agtccgtcgg aggagaaggt caaagcgatc gtagatgcac caaggccgaa 2160 agatgtgtcc cagctgaaat cgttcctcgg aatgatcaac tactactcca agtttgtccc 2220 caatctgtcc gtgaagttgt gtcccctgta cgcgctgttg aagaaaaacg ttcagtttaa 2280 gtggtctgct gagtgtgaaa gtgcgttcga caagtgcaag aagctcctgt tgagtaaccg 2340 actcctagag ctgtacaacc ctgakctacc gatcgtggtc gtgtgcgacg cgagcttgaa 2400 tggcgtggga gccgttctgt gccacagggt tggtgacgtg gaaaagcccg ttttctacgc 2460 gtcgagcact ttgtccgctg ctgaacgaaa ctacccgaat ctgcatcgag aagctcttgc 2520 cgtggtcttt gccctgacga agttcttcaa gtacatctac ggtaagcagt ttacggttgt 2580 caccgacaac aagccgttag cagccatttt ggaccagaag agaggtctgc cgccgttggc 2640 cgcggcgcgg ttgcagaaat acgtgtattt gctttcgatt tttgaatttg ggatcgtgta 2700 tcgcaagggg tcgaagattc caaatgcgga cgctttgagt agactaccgg tttctggtac 2760 caccggggtc gactcggaga ttgctgagct gttgagcgtg accgaggagt cagccatgat 2820 cgatctggag gtcataggcc gggaaactca gagagatgct ctgctgagta cgcttttcca 2880 gctggtgcaa agtgggtggg acgagagtaa cgtacccgcc gacctgaagt tttacttcgc 2940 aaaccagagt tgtctgtcgt tgtttaacga ttgcgtactt tattcggaaa aagttattgt 3000 tccgaagtct tgccaaaaac gtgttctaga gctgatgcat ggttgtcatc ttggagtgat 3060 ccggatgaaa caagaagctc gacggtacgt ttactggccg gggttggaca aggacatcga 3120 agaatttgta cagcggtgtg aggtttgcag taagaccgga agaatgccga agaaggttta 3180 ctcgaagtgg ccggaggcaa gtagaccgtt cgagagagtc catctggatt tctttcactt 3240 cgctggaaaa acgtttttga ttgtcgttga cgcctactcg aagtggatcg acgtgaggtt 3300 gatgagtaga accgattcag attcgctgat taacgctttg aactctgtgt tccgtatttt 3360 tgggaaaagc gatctgatcg tcagcgataa cggaccgccg tttaacagtc aagcatttgt 3420 ggactacgcg agacggatga agattgagct gaagaagagt ccggcgtatt cgcccgaaag 3480 taatggtctt gcggaaaggg gggtccagac agccaaaagc ggtttgaaga aactgatggc 3540 tgatccgaag tacggtagct gtaagattcc ggaactggtt gaggttttcc tgtttagtta 3600 ccggaacagt tactgccagg cgctagggtg ttcgcctgcg agcaaaatct tttcttttgt 3660 gccaaaaacc gatttggaca gcaatttgaa gccgaagcaa gaaaaggtta agaaaagggt 3720 tagatttgat ctgcgggtca aagagaaggt cattgataag cgggaaagtg tcaatgaaaa 3780 gccaagggtc aggaaaaacg acgattacgc tgtaggagat cttgtgtggt accgttgtga 3840 gtacaagacg atgagatctt ggttggaagc cgtgatcgtg ggtgttaaca gcaagaatac 3900 gtactacatt gaggtgtcag gtaataccaa gttggctagc aggaaccaac tgaagagaag 3960 agttgtggag aaaaacgatt atatttaccc aaaagttgag cccaatgtaa ttgtaacacc 4020 taagaagaga tacaagcgaa ggagagagaa gagttcacct ctcaagacac ctgaacgacc 4080 tgcgaagttg cgtaggagta agcgtgtcag gagaagacca gagagatacg aagtttttag 4140 attccacaaa atgtaatttt gaatttaagc ggggaag 4177 // ID Transib-6_AAe repbase; DNA; INV; 4107 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A Transib DNA transposon family from Aedes aegypti. XX KW Transib; DNA transposon; Transposable Element; Transib-6_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4107 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the yellow fever mosquito."; RL Repbase Reports 11(4), 1307-1307 (2011). XX DR [2] (Consensus) XX CC >99% identical to consensus. 5-bp TSDs. TIRs are ~620 bp long. XX FH Key Location/Qualifiers FT CDS join(1047..1068,1164..2770,2839..3053,3139..3307) FT /product="Transib-6_AAe_1p" FT /note="transposase." FT /translation="MAGTTNTVELDKRYIFNLWVDTKGSFEEKKRNVLTYV FT LEACAIDADVSPGVASKIEYKVELVCSYFKQRWCKHGRSKRNVLNRYFKYF FT EKPFKLTENILLTSRNIPSGRKSILFEESSNITKRKKTADIRNRFRSKELA FT YATQMKLREERLTDAAAIIREATQTTPSRATRIRDSWKQTENNPVYAYTDM FT EALALMVDSDLSVSQYNSIQSGAKKRNANIYPPYNRVLAAKKTCYPADEFI FT TISDTEIEINLQQLLNLTSERIVEVVEEKLNNFTDAELLDIEMTIKYGMDG FT STGNSEYNQVFENDDGSKTDSSIFMSALVPIKAAWKEKVLFQNPRPSSTRL FT CRPIHIQMATETAELAIRERDYLARQIEMLEPTKLVKNGRKICVRYNMLLT FT MVDGKICSALTETKAAARCYVCKATPKQMNNVDKISNLHVDESAFQFGLSI FT LHAWIRFLEYFLKVSYRLEVKKWAPRGANKEAMLKRKAMIQQRFKVELGLH FT VDKPRAGGSGTSNTGNVARRFFENAVAVAEITGLNLPAIQRCGTILQVMAS FT GRKINTDAFSSYCEETARLLIRLYEWYYLPASIHKILFHGSAIMEHFLVPI FT GQLSEEAQEAKNKEWKRKISRKSTNEDLIHWLLVSSDPVLAEARGWPKSKI FT KSLSDDAKLLLINLQDEEEEE*" XX SQ Sequence 4107 BP; 1350 A; 714 C; 788 G; 1253 T; 2 other; cacagtgatt ttttttgccc atttagtgat cgaaatagca tagcgcctaa accattcatt 60 ttagaaaaaa agtttgttca gaggagtttc ttgatttgta aaagcgcttc ttttggcact 120 agcggacagg tgattaaatc acctaaaagt gagataaaaa aawttttttt tacctgattg 180 agatagacga ttggtgtctt cagcaaagtt gtagtacttg taaattcaag acactttgtc 240 gaagacacca aatttctacc tcttatagat tacaagatac agcatgtttt ttcaaaatgg 300 cccccaaaaa tcaaaatttt aatataactt tttttcaata tttttcacat ttttcatgtc 360 ttctacaaag ttgtttgcct accaaatata cacgtttttg ccgaacattt caacttagta 420 tctctcatgc tgaaaaagtt atttcaaaat aatcgatatt ttttgggctt ttttctactg 480 acatcatgcg tttggacgca gaatgacgca cctaaccgtt tgcgattctt aaaggactat 540 cttttggctc tcaaatgata ctaaaaactt ttggccggga gtgatctttt gattgctacg 600 cacaacctag tgaaactata aaatkgcata tatttgcgat ttaaaaaccc ttcttctcgt 660 tttaatcaca tttttttcat atgaggcata tttaaacggt ttcatttgtt tgcaacacaa 720 ccattattcg tgtcactatc gtataggacg ttttcttcct tacaaaatca cgcaatatac 780 gacatagctt ataaaaaacc ctttcacatt atacaaattc gtgtgcggtg ttggaacaaa 840 atgaatcgtt aataatatgt gtgctgacgc ttctacagta aatgcacatg tcactactac 900 taatgcagaa aaagttgttg ttgatttgac attaaactgt gacagcgagc gccagataga 960 tataaatcag tggttcagtg ctattatcaa attctgttac tgctctgtta gtgttacaat 1020 cacaaactct gcgagaactg ttaacaatgg ctggtacaac aaatacaggt aggtcgttag 1080 gaagttagtt gtgcggttcc ttattaccat atgatttgtc attttaattt gtttggttgc 1140 ttcaattttc cggatccttt cagtggagct cgataaacga tatattttca atttgtgggt 1200 tgatactaaa ggcagttttg aggaaaaaaa gcggaatgtt ttgacatacg tactagaagc 1260 ttgtgctatt gatgctgatg tctctcctgg tgttgcttct aaaattgagt ataaagtcga 1320 acttgtttgc tcctacttca aacaacgttg gtgcaagcat gggcgctcaa agaggaatgt 1380 tttgaaccgt tatttcaaat actttgaaaa gccattcaaa cttactgaaa acattttgtt 1440 gacgtcgaga aatataccat ctggacgtaa atcgattctt tttgaggaaa gcagcaacat 1500 aacgaagaga aagaaaacag ctgacatcag gaatcgcttc cgatcgaagg agctagcata 1560 cgctacgcaa atgaagcttc gggaagaaag gttgacagat gcagcagcaa ttataagaga 1620 agctactcaa actacacctt ccagagcaac caggataagg gattcgtgga agcaaactga 1680 gaacaatcca gtttacgctt atacggatat ggaagctctt gctctgatgg tagacagtga 1740 tttatccgtc tcgcaatata actccattca atcaggagcg aagaaaagga acgcaaacat 1800 ttatcctccg tataaccgag tactggctgc taaaaagacc tgttatcctg cagatgaatt 1860 cattacgatt tcagacacag agattgaaat caatttgcag caactactca atctaacttc 1920 cgaacgaatt gttgaagtgg tagaagagaa attgaacaac ttcactgatg ccgaactatt 1980 ggacattgag atgacaatca aatatggaat ggatggcagc actggtaatt cggagtataa 2040 tcaggttttt gaaaatgatg atggttcgaa gaccgattca agcattttta tgtcggcact 2100 tgtaccgatc aaagctgcat ggaaagaaaa ggtccttttt caaaatcctc gtccgtcgtc 2160 cacacgtctt tgtagaccaa ttcacattca gatggcaact gaaacagctg aacttgctat 2220 tcgcgagaga gattatctag ctcgtcagat tgagatgctt gaaccgacga agctggtgaa 2280 gaatggccgt aagatatgtg tgaggtacaa catgttgttg accatggtag atggtaaaat 2340 ttgttctgca ttaacagaaa ctaaagcagc agcgagatgt tatgtctgca aagcaacgcc 2400 aaagcagatg aataatgttg acaaaatttc aaatcttcat gttgacgaaa gcgcatttca 2460 atttggccta agtatcctcc acgcttggat aagatttttg gagtactttt taaaagtatc 2520 ctatcggcta gaagtgaaga agtgggcacc tcgaggtgca aataaggaag caatgcttaa 2580 gcggaaggct atgatccaac agcgtttcaa ggtggaatta ggactgcatg tggataaacc 2640 acgcgcagga ggaagcggaa catccaatac aggaaacgtg gccagacgtt tttttgaaaa 2700 tgccgtagct gtggctgaaa tcactggcct taatcttcct gccattcaac gatgcggtac 2760 catattacag gtaattatgg ttaatatcat taatgagtaa tatttacaat ttttgtgata 2820 ttcttgctcg tatgataggt aatggcatca ggacgaaaaa ttaataccga tgcattcagt 2880 agttactgtg aagaaacagc ccggttattg attcggttgt atgagtggta ctaccttcca 2940 gcaagtattc ataaaatact ttttcatgga tctgcaatca tggaacactt cttagttccg 3000 attggacagc tttcggaaga agcgcaggaa gctaagaaca aagagtggaa gaggtatata 3060 ttaaaaaaat attgattttt tcagttttaa ttctacaaaa aaaacatctt gaaatagata 3120 tcgggaattc cataccagaa aaatatcgcg gaaaagtaca aatgaggacc ttattcattg 3180 gttgctagtt tcatcggacc cggttctagc agaagctcga ggatggccaa aaagcaaaat 3240 taaatcttta tctgacgatg ctaaactact acttatcaac ttgcaagacg aagaagaaga 3300 ggaataatac aatctggttg agcataatta caaaataaat ctgttttgtt gatattttac 3360 taattttatg tatcaaatgc atagtttgaa atttataact caatttttac atcattatcc 3420 attaaaatca ttgtttatat atcactactt cttgcaaaga cttaaaaaac cacaaataaa 3480 tgcattttta tagtttcact aggttgtgca tagcaatcaa aaggtcactc ccggccaaaa 3540 gtttttagta tcatttgaga gccaaaagat agtccattca gaatcgcaaa cggttaggtg 3600 cgtcattctg cgtccaaacg tatgatgtca gtagaaaaaa gcccaaaaaa tatcgattat 3660 tttgaaataa ctttttcagc atgagagata ctaagttgaa atgttcggca aaaacgtgta 3720 tttttggtag gcaaacaact ttgtagaaga catgaaaaat gtgaaaaata ttggaaaaaa 3780 gttatattaa aattttgatt tttgggggcc attttgaaaa aacatgctgt atcttgtaat 3840 ctataagagg tagaaatttg gtgtcttcga caaagtgtct tgaatttaca agtacaacaa 3900 ctttgctgaa gacaccaatc gtctatctca atcaggtaaa aaaaattttt ttatctcact 3960 tttaggtgat ttaatcacct gtccgctagt gccaaaagaa gcgcttttac aaatcaagaa 4020 actcctctga acaaactttt tttctaaaat gaatggttta ggcgctatgc tatttcgatc 4080 actaaatcgg caaaaaaaat cactgtg 4107 // ID Gypsy-24_CQ-LTR repbase; DNA; INV; 155 BP. XX AC AAWU01011196; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-24_CQ_; KW Gypsy-24_CQ-I; Gypsy-24_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-155 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 428-428 (2011). XX DR GenBank; AAWU01011196; Positions 15775 15929. XX SQ Sequence 155 BP; 40 A; 43 C; 25 G; 47 T; 0 other; tgtggtgtac agccctgctc gaacgttacg tcaaacagtt gacgtaacgt cttagagttt 60 tcatcagaac attcttgaat aaaatcggtc tctttcgact gcgactacca gcacaacaac 120 acctcttttg ctctcactcc gatctcatct ttaca 155 // ID Gypsy-25_IS-LTR repbase; DNA; INV; 127 BP. XX AC ABJB011009743; XX DT 15-FEB-2011 (Rel. 16.02, Created) DT 15-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the black-legged tick genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-25_IS_; KW Gypsy-25_IS-I; Gypsy-25_IS-LTR. XX OS Ixodes scapularis OC Eukaryota; Metazoa; Arthropoda; Chelicerata; Arachnida; Acari; OC Parasitiformes; Ixodida; Ixodoidea; Ixodidae; Ixodinae; Ixodes. XX RN [1] RP 1-127 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the black-legged tick genome."; RL Direct Submission to RU (15-FEB-2011). XX DR Genome; ABJB011009743; Positions 6384 6510. XX SQ Sequence 127 BP; 29 A; 26 C; 38 G; 34 T; 0 other; tgtgtgagat agttcatcgc tcgtgtcggc tttgccgaca cgagggaaga ggaagaagtg 60 tattaaagag cgggcagttg cgtgcctggt tccatgttga aaggtcgccc ttttcatact 120 tccaaca 127 // ID BEL-35_AA-I repbase; DNA; INV; 6251 BP. XX AC supercont1.126; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-35_AA_; KW BEL-35_AA-LTR; BEL-35_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6251 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.126; Positions 1389829 1383579. XX CC 'TCCTC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 45..6251 FT /product="BEL-35_AA-I_1p" FT /translation="MAEKGNDSQAQRNCAACTQPDEVCDMVSCDKCKLWYH FT FTCVQVDATVKHRRWYCVTCEPLFRMNPNCHDEQQQSKDNSNPIDAVESEV FT KEPRVKATKTITGKQTGATAKASSQRDLALRPGTRITRSKAGSKGTKVSKK FT TAADRSVSSSIRAQLALELEVLEEKQRVEEEELANDREMMEQQLQQEQVMR FT QKELDMEARKLAEEKAFMQRKLEEEREFRNRQLALKRHSIEEKAKLIRQMS FT ECSSRTSTDVSESASKEKVSEWLLRVDQQTEGVLQSSTSVNKTGTVLQHKS FT YDNTAKTQLGAKQIVDKQLGFSLADISETNPVPYFDRLNVGENKKSYANIS FT PGNAMQNVTGHVPGSKEGIRNQTNDRLESASVIINHPQTRGVFGLQSREGY FT TDQKVRPSGQEEFNSGGQRVPEFVHGNDEGPTNRQIAARQVMGKDLPLFSG FT NPEEWPIWISNFQRSTATCGFSLDENLIRLQRSLRSAALDAVRSRLLCPSS FT VPHVIKTLEMRYGRPETLIRVMTERIKHLPPPRMNDLDSIIEFGLTVDNLV FT EHLRNAGQQAHLSNPSLLHDLVIKLPVDYRLKWSAYKSSTYNADLGVFGRF FT MSSLVELAYEVADEFPTQRNDKVQKPKAKERMFVQTHAEYNDSKSKPVPAE FT KASRKPCPICKLDGHKTPDCIRFKGMSIDNRWKVVDQQGLCRTCLNQHGKW FT PCRTWKGCGISECTSRHHTLLHSFSPTVRAAVSTSHLSGSQENHGPMFRIL FT PVTLFGEKCQVNIYAFVDEGSEITLLEDSIAEQLGLSGQNEALNLQWTGDI FT KRNESNSRRVNAEVAGFGLCKKFKLINARTVSSLSLPAQTMKYQQLANIYP FT HLQGLPLQDYEQASPKLLIGLDNLKLTVPLKIREGGWKEPIAAKCRLGWSL FT YGGGSSTSKAVVCGFHVGGWTNSEQELTQLVRDYITLDNVVVSRPFTPLES FT DDEKRARMLLESTTQRVEGGFETGLLWRRNNVRFPESFGMAYNRLRSLERK FT FDKTPGLFERVRQQIQEYELNGYAYKATAEELSGTSSGRYWYLPLGIVINP FT KKPNKLRMIWDAAATVDGISLNSELLKGPDFLTSLPTVIGKFRLYQYALAG FT DIKEMFHRFSIRPEDRQYLRFLFRDHPSQEPVVYVMDVAIFGATCSPSCAQ FT YIKNLNAAEFESDFPRAVNAIIYRHYVDDYLDSFGTQEEAIKVGSEVKQIH FT AAGGFEIRNFLSNDPVIAMKVGDKSPAEERDIRTEKGDGDAGRAESVLGMR FT WIPNNDVFTYTVSLRDNLKHVLKESHIPTKREVLRTVMSFFDPMGLITFFL FT IHGRVLMQDIWAAGIDWDNEINERMRCQWRRWIELVPQLTDLQVPRCYFPN FT AIEQTYSSLQVHVFVDASKSAYACTVYFRVETAEGPLVSLVAARAKVAPLK FT MLTIPRLELQAAVLGARLLDSVITMHNLPVQKRVLWSDSSTVLAWIRSDQR FT RYHQYVGFRVGEILTTTDVNEWRKIDSALNVTDDATKWGSGTVIASDSRWF FT CGPDFLRQPEECWPGNEMTTHTTEEELVSCNIHCLTPASLVDVTRFSRWER FT LLRTIAYVLRFLDNCRCLQRSRTTHHVILHQRELERAETLLWKQAQEEYFG FT SEVAILEASKGGPENRHKLLQKSSVLYKLWPFMDEWGVVRKRNRLENAECI FT QHGTKYPVILPRQHRITFLIVDSYHRRFRHGNRETIVNEIRQVYEIPKLRS FT LVAKVAKDCTWCKVFYASPCTPPMAPLPKVRVTPYVRPFTYVGLDYFGPIL FT IRCGRSVVKRWVALFTCLTIRAVHLEVAHSLSTESCVMAVRRFVARRGAPL FT EIFSDNGTNFHGANNQLRQELAERSKHLATVFTNTQTRWTFNPPSAPHMGG FT AWERMVRSVKAAIGTILEDKRRPTDEILETVIVEAESMINTRPLTYVPLES FT ADQEALTPNHFIFGNSNGAKQPPIEPVDYRTTLRSGWKQAQHLSDAIWSRW FT IKEYLPVISRRSKWFEEVKELEEGDLVLVINGSVRNQWTRGRIEKVIPGRD FT GRVRQALVRTRTGIMRRPSVKLALLDVLDERKPTVREGLRAGE" XX SQ Sequence 6251 BP; 1884 A; 1285 C; 1577 G; 1505 T; 0 other; aatttcttta gaaatttttc acgtggcatt tggaagatcc aaacatggcg gagaaaggta 60 atgattcaca agctcagagg aattgcgcag cttgtacgca acccgacgaa gtgtgcgaca 120 tggtatcttg cgacaagtgc aaactttggt accacttcac ttgcgtacaa gtcgatgcta 180 ccgtcaaaca tcggagatgg tattgtgtga cgtgtgaacc cctttttcga atgaacccga 240 actgtcacga tgaacaacaa caatcgaaag acaattccaa tccgatcgat gcagttgaaa 300 gtgaagttaa agaacctcga gtaaaagcta ccaaaacaat aaccggaaag caaaccggag 360 ccacggccaa agcttctagt caacgagatt tggccctcag gccaggcacg agaatcacca 420 ggtctaaggc gggaagtaaa ggtaccaagg tgtcgaagaa gactgcagct gatcgtagcg 480 ttagttccag cataagagcc cagttagcac tagagctgga ggtattagag gaaaaacaac 540 gagtcgaaga agaggaatta gcaaacgacc gagaaatgat ggaacaacaa ttacaacagg 600 agcaggtaat gcgacagaag gagcttgata tggaagcgcg caagcttgcg gaggagaagg 660 cgtttatgca acgtaaactt gaagaggaga gagagtttcg gaatcggcaa ctggcgctaa 720 aacgacattc catagaagag aaggcaaaac ttattcgaca gatgtcggag tgtagcagca 780 ggacaagtac ggacgtaagt gaatccgcat ccaaggaaaa agtatcggaa tggttactac 840 gagttgatca gcagacagag ggagttctac aatcatccac gtcagtgaac aaaacaggta 900 ccgttctaca acataaatct tacgataaca ctgcaaaaac acagttaggt gcgaagcaaa 960 tagtagataa gcagttaggt ttctcgcttg ctgacatatc cgagacaaat ccggtaccat 1020 atttcgatag gttgaatgtt ggcgaaaata agaaaagcta tgcaaacatt tctccaggca 1080 atgctatgca aaacgttact gggcatgtcc cggggtctaa agaggggatc agaaatcaaa 1140 ctaacgatcg attagagtca gcctctgtaa taataaatca tccacaaaca cggggggtat 1200 ttgggctcca atcacgagag gggtataccg accaaaaggt tagaccgtct ggacaagaag 1260 agttcaacag cggtggtcaa cgagttcctg agtttgttca tggcaacgat gaaggtccaa 1320 cgaatcggca aatagcagcg cgacaagtga tgggaaaaga tctcccattg ttttccggca 1380 acccagaaga gtggccgatc tggatcagta attttcaacg gtcgacagca acgtgtggct 1440 tctctctgga tgaaaatttg attcgcttac agcgtagttt gagaagcgca gctttagacg 1500 cagtccgcag tcgattgctt tgcccgtcaa gcgttcctca tgttattaaa acgttggaga 1560 tgcgttatgg tcgtccggag acattgatcc gtgttatgac ggagcgcatt aagcatttac 1620 ctcctccgag gatgaacgat ctggacagca tcatcgagtt tggattaacg gtagataacc 1680 tggtagaaca tttgagaaat gcagggcagc aagcgcattt atcaaatcca tcccttttac 1740 acgatttggt catcaagtta cctgtagact atcgtctgaa atggtcagca tataagagtt 1800 caacatacaa cgcggatcta ggagttttcg gtcgcttcat gtcgagcttg gttgaattag 1860 cgtatgaggt ggcggacgag tttccaactc aacgtaacga taaagtgcag aagccaaaag 1920 cgaaagaacg tatgtttgtg cagactcacg cagagtacaa cgattcaaaa tcgaaacctg 1980 tacccgctga aaaagcaagt aggaagcctt gtcctatttg caaattggat ggacacaaaa 2040 caccagattg catcaggttc aaagggatga gcatcgacaa tcgatggaag gtggtagatc 2100 agcagggatt atgcaggacg tgccttaacc aacacgggaa gtggccttgc agaacctgga 2160 aaggatgtgg aatcagtgaa tgcacttcga gacatcacac tttgttgcac tccttcagtc 2220 ccacagtacg tgctgctgtt tccacgagtc atttaagtgg atcgcaagag aaccatggcc 2280 caatgtttag gatcttacca gtgacgttgt ttggtgaaaa gtgccaagtg aatatctacg 2340 ctttcgtcga cgaaggatcc gagataacgc ttctcgaaga ttcaattgcg gagcagttag 2400 gactatcggg tcagaatgaa gctttaaatc tacaatggac aggagatata aagcgtaacg 2460 agtcgaattc acgccgagtc aatgcagaag ttgcgggctt tgggttgtgt aagaaattca 2520 agttgatcaa tgcccgcaca gtgagcagtt tatcgctacc agcgcaaacg atgaaatacc 2580 agcaattggc caatatctac cctcatttgc aagggttacc tctacaagac tacgagcaag 2640 cttctccaaa gctcttgata ggtcttgaca acctgaaact tactgttccc ctgaaaatcc 2700 gcgaaggagg ctggaaagaa ccaatcgctg caaaatgccg ccttgggtgg agcctctacg 2760 gaggtggttc ttcgacatcg aaggcggtag tatgtgggtt tcacgttgga ggatggacca 2820 attcagagca ggagctaact cagttagtac gcgactatat cacgctagat aatgtagtcg 2880 tatcgcgccc gttcacaccg ctggaatcgg acgacgagaa acgagctaga atgttgctgg 2940 agtctactac tcaaagagtc gaaggaggct tcgaaacggg attgctgtgg agaaggaata 3000 acgttagatt tccggagagc ttcggaatgg catataaccg tttgcgttcc ctggagagaa 3060 aatttgataa aacaccaggt cttttcgaaa gagttcgtca gcagattcag gaatacgaac 3120 tgaatggata tgcgtataag gcaaccgcgg aagagttgtc tggtaccagt tctggtcggt 3180 attggtactt acccttgggg attgttataa atccgaagaa gcccaacaaa ttaagaatga 3240 tttgggacgc agctgctact gttgatggaa tatctctgaa ttcagaattg ttgaaaggac 3300 cagatttttt aacaagcctt ccgacagtta ttggtaaatt taggctatat caatacgctc 3360 tagccggaga cataaaagag atgtttcatc gcttttccat tcgtcctgaa gaccggcagt 3420 atctcaggtt tctattcaga gaccatccca gccaagaacc agtggtttat gttatggatg 3480 tagccatttt cggggcaact tgttctccca gttgtgctca gtacataaag aatctgaatg 3540 ctgcagaatt cgagtcggac tttccaagag cagtgaacgc cattatctac cgccattatg 3600 tagacgacta tttggacagt tttggaacac aagaagaagc catcaaggtc gggagtgaag 3660 tgaagcagat tcatgcagcc ggaggttttg aaatccgaaa ttttctatcc aacgatccgg 3720 tcattgcgat gaaggtcggt gataagtcgc cagctgaaga gagagatatt cgaactgaaa 3780 aaggtgatgg agatgctgga cgtgctgagt cagtattagg tatgcggtgg atacctaaca 3840 acgatgtctt cacctacacc gtgtccttgc gagataattt gaagcacgtt ttgaaggagt 3900 cacacatccc cacgaagaga gaagtgctac ggacggttat gagtttcttc gatccaatgg 3960 gacttataac cttctttttg atacacggga gagtcttgat gcaagatatt tgggccgcag 4020 gcatcgattg ggataatgaa atcaacgaaa gaatgaggtg tcagtggaga aggtggatcg 4080 aactcgttcc acagttgacc gatctacaag tgccacggtg ctattttccg aatgcgattg 4140 agcaaacgta ctcctcctta caagtccatg tatttgtaga tgccagcaaa tctgcatatg 4200 cgtgtactgt gtacttccgg gttgaaacgg cagaaggtcc gttagtgagt ttggtagctg 4260 cccgagcgaa agtagcaccc ctcaaaatgt taactatacc aagattagaa ctccaagcgg 4320 ctgtattagg cgctcggctt ctcgacagtg taatcacaat gcacaatctt ccagttcaaa 4380 aacgagtact ctggtcagat tcaagcactg tacttgcatg gatacggtcg gatcaacgta 4440 gatatcacca atatgttgga tttcgagttg gggaaatttt aacaacgact gacgtcaacg 4500 aatggaggaa aattgattct gccttaaatg taacagacga tgctaccaaa tggggctccg 4560 gaacagtcat cgcttcagac agtcgctggt tctgcggacc ggatttcttg agacaaccag 4620 aagaatgttg gcctggtaac gaaatgacaa cgcatactac agaagaagaa cttgtttctt 4680 gtaatataca ctgtttaacg cctgcttctt tggtcgatgt cacgagattt agtcgatggg 4740 aaagactcct tagaactata gcatacgttc tgagatttct ggacaattgt cgctgtttgc 4800 aacgaagcag aacaacgcat catgttattc tgcatcagag agagctggaa cgtgcagaga 4860 cattgttatg gaaacaagcg caagaagagt atttcggttc agaagttgcg atactggaag 4920 cttccaaagg tggacctgaa aatcgtcaca aactgcttca aaaatctagt gtattataca 4980 agctttggcc ttttatggat gaatggggag tagtacggaa acgcaatcgg ctggagaatg 5040 ctgaatgcat tcagcatggc accaaatacc cagtcatctt accacgtcaa catcgtatca 5100 cctttctcat tgttgattca taccatcgcc gttttcgaca cggtaataga gaaaccatcg 5160 tgaacgaaat aaggcaagtc tacgaaatac cgaagttaag atccttggtg gcaaaagttg 5220 ctaaagattg tacttggtgt aaagtattct acgcttctcc ttgcactccg cctatggcac 5280 ctttacctaa agtacgagta acaccttatg tacgaccgtt cacgtacgta ggtctcgact 5340 actttgggcc aatcctcata agatgcggca gaagcgttgt taaaaggtgg gtagcattat 5400 tcacatgctt aaccattcgt gcggttcatt tagaggtcgc gcattcgctt tcaacggaat 5460 cctgtgttat ggcggtgaga aggttcgttg ctcgccgtgg agcaccactg gaaatattca 5520 gtgataacgg aactaatttc catggggcaa acaatcaact gcgacaagaa ttggccgagc 5580 ggagcaagca tctggctaca gtattcacaa atactcaaac gagatggacc ttcaatcccc 5640 cgagcgctcc tcacatggga ggagcttggg aaaggatggt tcgctcagtt aaagcggcta 5700 ttggaactat attggaagac aagcgcagac caaccgatga gattttggaa acagttattg 5760 tggaagctga gtcaatgatt aacacccgcc cactgacgta cgttccgttg gagtctgccg 5820 accaggaagc attgacacct aaccatttca tttttggcaa ttccaacgga gcgaagcaac 5880 ccccaataga accagtggac tatcgtacaa cattgcgcag cggatggaaa caagcacagc 5940 atctatcaga tgcaatatgg agccgatgga ttaaggaata tttgccagta atttcaagga 6000 gatccaaatg gttcgaggaa gtgaaggagt tagaagaagg agatttggtt ctggtgataa 6060 acggttcagt gcggaatcaa tggaccagag gaagaattga aaaggtaata ccaggacggg 6120 atggtcgagt tcgacaagct ctggtacgta ccaggaccgg tataatgagg agaccatcag 6180 tgaaactggc attgcttgat gttttggatg aacgtaaacc aactgttagg gaaggtttac 6240 gggcggggga g 6251 // ID R2-1_DWi repbase; DNA; INV; 3550 BP. XX AC . XX DT 08-NOV-2010 (Rel. 15.11, Created) DT 08-NOV-2010 (Rel. 15.11, Last updated, Version 2) XX DE 28S rDNA-specific non-LTR retrotransposon R2 in Drosophila DE willistoni. XX KW R2; Non-LTR Retrotransposon; Transposable Element; R2-1_DWi. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-3550 RA Stage D.E. and Eickbush T.H.; RT "Origin of nascent lineages and the mechanisms used to prime RT second-strand DNA synthesis in the R1 and R2 retrotransposons of RT Drosophila."; RL Genome Biology 10(5), R49-R49 (2009). XX DR [1] (Consensus) XX CC This family is specifically inserted into 28S ribosomal RNA CC genes. D. willistoni contains two subfamilies of R2. XX FH Key Location/Qualifiers FT CDS 150..3314 FT /product="R2-1_DWi_1p" FT /note="reverse transcriptase and restriction-like FT endonuclease." FT /translation="FERRSNSWGYRPLEPRSVGTESNNNSPRSNITITSAT FT SRPGDQPREAIAVVNLAGEIPCAVCGRLFNTRRGLGVHMSHQHKDELDTQR FT QREDVKLRWSEEEAWMMARKEVELEASGNLRFPNKKLAEVFTHRSSEAIKC FT FRKRGEYKAKLEQIRGQSTPTPEALDSITSQPRPSLLERNHQVSSSEAQPI FT NPSEEQSNWEIMRILQGYRPVECSPRWRAQVLQTIVDRAQAVGKETTLQCL FT SNYLLEVFPLPNEPHTIGRSNLRRPRTRRQLRQQEYAQVQRRWDKNTGRCI FT KSLLDGTDESVMPNQEIMEPYWKQVMTNPSTCSCDNTRFRMEHSLETVWSA FT ITPRDLRENKLKLSSAPGPDGITPRTARSVPLGIMLRIMNLILWCGKIPFS FT TRLARTIFIPKTVTANRPQDFRPITVPSVLVRQLNAVLASRLASKVNWDPR FT QRGFLPTDGCADNATLVDLILREHHKRWKSCYLATVDVSKAFDSVSHQAII FT KTLQAYGAPTNFVSFIEEQYKGGGTSLNGAGWSSEVFIPARGVKQGDPLSP FT LLFNLIIDRLLRSYPREIGAKVGNTMTSAAAFADDLVLFAETPMGLQTLLD FT TTVGFLASVGLSLNADKCFTVSIKGQAKQKCTVVERRSFCVGERECPSLKR FT TEEWKYLGIRFTADGRARYSPADDLGPKLLRLTRAPLKPQQKLFALRTVLI FT PQLYHQLTLGSVMIGVLRKCDRLVRQFVRRWLDLPLDVPVAYFHAPHTCGG FT LGIPSIRWIAPMLRLKRLSNIKWPHLEQSEVASSFIDDELQRARDRLKAEN FT VQLCSRPEIDSYFANRLYMSVDGCGLREAGHYGPQHGWVSQPTRLLTGKEY FT LHGVKLRINALPSKSRTTRGRHELERRCRAGCDAPETTNHILQKCYRTHGR FT RVARHNSVVNAVKRGLERKGCVVHVEPSLQCDSGLNKPDLVGIRQNHIYVI FT DVQVVTDGHSLDQAHQRKVERYDRADIRSQMRRFFGATGEIEFHSVTLNWR FT GIWSGQSVKRLIAKDLLIAEDTKLISVRAVNGGVTSFKYFMYCAGYTRS" XX SQ Sequence 3550 BP; 987 A; 800 C; 917 G; 846 T; 0 other; gaagctgggt cggatgagcg cagaaggggt gttctttgga acactgtaat tcataagtcg 60 taagtctgat caagtcgact cgaaacctcc tcgtggtgtt tcctgggtgc tgttgagttc 120 ctagtctcta ggttcttttc agtagctaat tcgagcggcg aagcaactct tggggttacc 180 ggccccttga gccaagaagc gttggtacag aatcaaataa taatagtcct cggagcaata 240 tcactatcac ttcagcgact tcacgtcctg gagaccaacc gagagaggct atagcagtgg 300 taaatctcgc gggagagatt ccctgtgcag tatgcgggcg cctcttcaat actagaaggg 360 ggctcggtgt acacatgtca catcaacaca aagacgaact agatacgcaa cgtcagcgtg 420 aagatgtaaa actccgatgg agcgaggaag aagcgtggat gatggcgaga aaggaggtgg 480 agctcgaagc aagtggtaat ttgagatttc ctaataagaa gctagcggaa gtatttactc 540 accgtagctc cgaagcaatt aaatgttttc ggaagagggg tgaatataag gcaaaactgg 600 agcagatcag agggcaatct actcccaccc cagaagcgtt ggactctatt acctcacagc 660 ctcgccctag tttactcgag cgaaaccacc aagtatcatc gtcggaagcg caaccaatca 720 atccatcaga agaacagtcg aactgggaaa tcatgcggat actacagggc tatcgccccg 780 tagaatgtag tccccggtgg agagcccagg tcttgcaaac tatcgtagat agggcgcagg 840 ccgtagggaa ggaaaccact ctccaatgct tatccaacta tctcctggaa gtatttccat 900 taccaaacga accacacacc atcggtcgga gcaatttgcg aagacctcga actaggagac 960 agttaagaca acaagagtac gcacaggttc agcgtcgttg ggataagaat actgggagat 1020 gcattaaatc cttgcttgat ggaacagatg agtcggttat gccaaaccaa gagataatgg 1080 aaccctattg gaaacaagta atgacgaatc ccagcacatg ctcttgcgat aacacaagat 1140 tccgtatgga acattcgctt gagacggttt ggtcagcgat aacgccacgc gacctgaggg 1200 aaaataagtt aaagttgtca agtgctccgg gtcctgacgg tatcactcca agaacagcca 1260 ggagtgtacc cttaggcatt atgctacgca taatgaacct gattctctgg tgcggcaaaa 1320 taccattctc tacccgactg gccagaacta tcttcattcc gaagactgtg acggcaaatc 1380 gaccgcaaga ctttcgtcca ataacagtcc cctcggtttt ggtcaggcaa ttaaacgctg 1440 ttctggcttc tcgattggct tctaaagtca actgggatcc aaggcagcgc ggtttcctac 1500 ctaccgatgg gtgtgctgat aatgcgacgt tggttgatct cattttgcgg gagcaccata 1560 aacggtggaa gtcatgttac cttgcgacgg tggatgtcag caaggctttt gactcagtat 1620 cacaccaggc cattatcaag actttacagg cctatggtgc tccaacaaac tttgtcagct 1680 tcatagaaga acagtataag ggcggcggaa cctccctcaa tggggcagga tggagttcag 1740 aggtgtttat acccgcgcgg ggcgttaagc aaggtgaccc tctgtctcca ctattattta 1800 atcttatcat tgatagatta cttaggtcct accccagaga gattggtgcc aaagtcggaa 1860 ataccatgac aagcgcggca gcgttcgcgg atgatctggt gctatttgcg gaaactccga 1920 tggggcttca aacattgttg gataccacgg taggcttcct agcctccgtg ggactctccc 1980 ttaatgctga taagtgcttc actgtcagta taaaggggca agccaagcag aagtgtactg 2040 tcgtagaacg acggagcttt tgtgtaggtg agcgcgagtg tccttcattg aagcgtactg 2100 aagagtggaa gtatttaggt atccggttca ctgcggatgg gcgggctcgg tatagtccag 2160 cagacgacct cggtccgaag ctgttaagat taacaagagc ccctctgaaa ccacaacaga 2220 agttatttgc acttaggact gtccttatcc cacaactcta tcaccaacta acacttggga 2280 gtgtgatgat aggcgtccta agaaaatgtg acagattggt acggcaattc gtaaggagat 2340 ggttagatct cccactggat gtaccagttg cttactttca cgccccccac acttgtgggg 2400 gtctcgggat tccgtcaatt agatggatag caccgatgct gcgtctgaag cgattgagca 2460 atattaaatg gccccacctc gaacaatccg aggtagctag ctctttcatt gacgacgaat 2520 tgcaaagggc tcgagataga ttaaaggcgg aaaatgtgca gctgtgttcg cgtccagaga 2580 ttgactcgta tttcgcaaat agattgtaca tgtctgttga tggttgcggt ctccgtgaag 2640 caggtcatta tggcccgcaa catggatggg tgagtcagcc cacgcgcttg ctaacaggaa 2700 aggaatattt gcacggtgtc aaattgcgga taaatgccct accctcgaag tctcgtacga 2760 cgaggggaag gcacgaattg gagagacggt gtcgtgcagg atgtgatgct cccgagacaa 2820 caaaccacat cttgcaaaaa tgctatcgta cgcatgggag gcgggtagct agacacaaca 2880 gcgtagtaaa tgccgtcaag cggggacttg aacggaaagg ctgcgttgtc catgtcgaac 2940 caagtctgca atgcgactcg ggcttaaata aaccggacct ggtgggaatc cgacagaatc 3000 acatttatgt gatagacgtt caggttgtga cagacggaca ttccttagac caagcgcacc 3060 agcgcaaggt cgaaaggtac gacagagctg acataagatc acaaatgcgg cgatttttcg 3120 gagcgacagg tgaaatcgag tttcattccg ttacactcaa ctggagagga atctggagtg 3180 gtcagtcggt aaaacgattg attgcaaagg atctcctcat cgctgaagat accaaactca 3240 tcagcgtcag agcagtaaat ggcggagtga cgtccttcaa atatttcatg tattgtgctg 3300 ggtatactcg aagctagatg tactaacctc tagcttttct tatacttttg cctgctacct 3360 tggcattaca tctaaaaagg tacaaacatc gcattgtcat aaagaggtgg ttttagtacg 3420 taggcgctgt gggacttcat tgtcccggtg atgcagtgaa tcgtgcatac gagattgtcc 3480 agtagttggt tgctcgtatc tttagaagat ttccttcctc ggcgatcaaa aaaaaaaaaa 3540 aaaaaaaaaa 3550 // ID Copia-4_DGri-LTR repbase; DNA; INV; 168 BP. XX AC scaffold_5173; XX DT 07-MAR-2011 (Rel. 16.03, Created) DT 07-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the Drosophila grimshawi genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-4_DGri_; KW Copia-4_DGri-I; Copia-4_DGri-LTR. XX OS Drosophila grimshawi OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Hawaiian Drosophila; OC grimshawi group; grimshawi subgroup. XX RN [1] RP 1-168 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Drosophila grimshawi genome."; RL Direct Submission to RU (06-MAR-2011). XX DR Genome; scaffold_5173; Positions 1311 1144. XX SQ Sequence 168 BP; 62 A; 34 C; 19 G; 53 T; 0 other; tgttgaaaat atgtaaaaga tcaatacaac attccgatta ctaatgttaa taacagaaca 60 tatcgatact gcgctcgata ccttaattta agattacatt gaataaattc ttcttagtct 120 aaccgccaaa cctagtgcat cactttttgc cgctttaaaa atccaaca 168 // ID DNA8-12_CQ repbase; DNA; INV; 1692 BP. XX AC . XX DT 28-DEC-2010 (Rel. 16.01, Created) DT 28-DEC-2010 (Rel. 16.01, Last updated, Version 1) XX DE Non-autonomous DNA transposon from Culex quinquefasciatus - DE consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-12_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-1692 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the southern house mosquito."; RL Repbase Reports 11(1), 89-89 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >95% identity. CC 8-bp TSD. ~110-bp TIRs. XX SQ Sequence 1692 BP; 609 A; 270 C; 269 G; 544 T; 0 other; caaggggttt ccagcatcaa ggattactga ccagtctaaa tggaattacc aggattactg 60 gcaatatgtc atgaattact ttgatttccc agaattactt gggatttcca gaaaccacgc 120 agaattgctc aaatttataa aaaataactt ggaattactg cctagcttcc agcatattga 180 gaaaagtctt tggaattgga tcgaatgaca gctgtcaaac gcacaaaacc acgtgcttgg 240 caatagtacg tccatcccgg gattcccggc acaaaattcc cgggattttt atatgttgta 300 cggggaattc ccggaattaa acaaaaatct taatttggcc tggaaattac attagtatta 360 caattaataa gtttaaactt ttctaaaatt ttacataaat tggtaccatt ttatttgttg 420 tcatttttgt tttttagaca tcttaaacca atttttaagc tgttttagtc atgtcatata 480 gaaataaata aaaaataaat atttataaga tacttattta atttgaatta gaaatgtggt 540 gcatataaaa ttcaaagcac aacagagaac agcaaaaatt tgtttaagct atcttaaaac 600 tgctatcctt gtttagattg atattattag tattgagcta attctataaa tttaaagctt 660 gaaaaacata acaaatataa agaaaacata gattttatga ctttttcaaa aacttcccgg 720 gaataaataa atatttttcc cgttttccgg gaaactcaaa acctgggaaa attggacgtc 780 ctgcttggaa atcatcgaac aattgggtaa atatcaacat cagtttgaca actgattttg 840 gcgtttgagt gcatttaaaa ttcaaagcac aacaaagaac aaaaaaatct ttaaaactat 900 ctaaaagtgc tatcctatta atagtattga gctaaaagta tgattttaaa aacataacaa 960 atataaagaa aacatagatt ttatgacgtt ttcaaaaatt tcccgggaat aaaaaatata 1020 ttttttccgt tttccgggaa acttggacgc cctacttgga aatcgtcgaa taatggggta 1080 aatatcaaca tcagtttgac aactgatttt gacgtttgag ttgtttttca tccgaataca 1140 tttgaattag agagcatcca aatttgcgga cattccgaat ccgctctgta cttcttacaa 1200 ccataaaagg catcaaggaa aagcaaaatg cattctgccg atttacttga attgatggga 1260 aatacacgaa ttactgagta actatggaat tactcgaatt actacggaat tactaggtaa 1320 ttccagaatt acaggaatta ctgcagaatt gcggagtaat tctggaatta ctgaattact 1380 tactcaaaat ccgaatgatc ccgtactagc tataactttg gattcttaca tgaaaattaa 1440 taggcaataa atagaatcat aaaaggttta tcgaaattat ctttttaaca gttatttttc 1500 aatattttat tattttctga agtatgtttt cacaaacaag aatcatggaa ttaccaggat 1560 tactgggatt attaggaatt accaggatta ctctggaatt actcagtaaa tccggaatta 1620 cagaattact tgctcaaatt ccaagtaatt ccggattagt gaccagtctg acacgatgct 1680 ggaaacccct tg 1692 // ID BEL-144_AA-LTR repbase; DNA; INV; 276 BP. XX AC AAGE02017327; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-144_AA_; KW BEL-144_AA-I; BEL-144_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-276 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02017327; Positions 31218 31493. XX SQ Sequence 276 BP; 90 A; 69 C; 41 G; 76 T; 0 other; tgttaagaat aagcaaaaat tatgaaccac cctaatgtaa aacagtatct attgtctctc 60 ttatcaccta tccatctgat agcgtgagcg aaaccatacc aataggatct ctctcgtgtc 120 ctccatctgg tcctgagcgt attgtataac tagtatagaa tagccagtcc agtcagtcga 180 ataaaaaccg tcaaacgaag aacagtgttt ctcatcgcta accaaaccct aattctcata 240 tccgataccg agccaatcta gttcacattt cgtaca 276 // ID I-1_CQ repbase; DNA; INV; 3415 BP. XX AC . XX DT 29-DEC-2010 (Rel. 16.01, Created) DT 29-DEC-2010 (Rel. 16.01, Last updated, Version 1) XX DE I-type non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW I; Non-LTR Retrotransposon; Transposable Element; I-1_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-3415 RA Jurka J.; RT "I non-LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 106-106 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 4 sequences with >99% CC identity. XX FH Key Location/Qualifiers FT CDS 171..3299 FT /product="I-1_CQ_1p" FT /translation="MNDGSHTYHRGTTNSCIDISLCSQSLINTLTWCIHTD FT THGSDHNPIEIHACGLSPSTTRRRRWRYEDANWNDYELEIERRLEADREYT FT ISEITQSILKAAEASIPRTSGKPGRRSVYWWNKEVEDAIKTRRKALRKLKK FT LSNESPLRDDALNMFRIARNHARKVMEESKKKSWEEFLDGISNQTSCTEMW FT RRVNALSGKRRTRGLSMEVGGSLTDNPALIAQEIGQYFQKLSATEGYSSKF FT QRLKSTAEENPVSFQIDASNENEDYNKCFSMDELLRALDNAHSKSSGPDDV FT GYPMLKHLPYLAKKSLLNAINKLWLSGSFPEEWRKSIVIPIPKKGQNASSP FT TGYRPISLTSCIAKVAERMVNRRLTTVLEENNLLIDQQHAFRKGRGSGTYF FT ARLGDILDDARTQGHHIDLAALDISKAYNQTXRTGVLRQLQDWGFNGSLAM FT FLKXXLSDRTFQVAIGNTLSDLFVEENGVPQGSVLAVTAFLIGMNNVSLDL FT PKNTHILLFADDILLITHGPTFARNRIKMQAAVRAVGKWADSVGFKIASEK FT CAILHSCSTRHHHWKRPIVLNGVNFPNREKIKILGVTLDRNLNYNQHFREV FT KESCRSRIQLIRTISSRHKTSNRKTIIQVGNSIITSKLVYGIELTCAAFSA FT LVNTLTPVYNEIVRRASGHLKSSPILSIMIESGNLPFPLKAALILAKRTTS FT VMEKIPEESSTLLRKTNEILREKIGSSLPEVAKLPRVQERRWDDVKPKIDW FT KIHHKIKPTDTAAQKRAVFKQRVEQKYQRHQHIYTDGSVSRHGVGMGAYSV FT HWSISKKIDNKCSIFTAETAALIIAVDMTQEDRIYETVIFTDSASALLNVE FT SGKSKNALIQTLESKLENRTDITFCWVPSHCGILGNEKADKLAGEGYRAEA FT WITPLPKQDVNRWINTQLELVWENQWHITRDVFTRKIKGTTKSWDDRGHRT FT EQRIMSRLRIGHTRLTHRHLMGLGPREMCFTCGVPITVEHILVDCPSYTAQ FT RQENLLSNSIREILGHEPESEQRLLKFLRETHLYETI" XX SQ Sequence 3415 BP; 1136 A; 782 C; 711 G; 783 T; 3 other; acggccaatc taatttcgca actgaactac aaaagctcat tgatcaactt gaatctcctt 60 ttttgatttt gggagatttt aacgcccacg acccgctgtg gggcagtctt cgcgaggacg 120 gtagggggaa ggcaatccgt caagcagccg aaaccaacaa cattattgta atgaacgacg 180 gtagtcacac ctatcaccgg ggaacgacaa actcgtgtat cgacatcagt ctctgctctc 240 aaagccttat caacacctta acatggtgca tacacacgga tacgcacggc agtgatcata 300 atcccattga gattcacgca tgtgggcttt ccccaagcac aacccgaaga cgacgatggc 360 gctacgaaga tgcaaattgg aatgattacg aattggagat tgaaaggaga ttagaagcag 420 atcgtgaata cacaatttcc gagataacac aatctatcct caaggcagcc gaagcaagca 480 taccacgaac cagtggcaaa cccggccgcc gttctgttta ttggtggaac aaagaagtag 540 aagatgctat taagaccagg agaaaagcgc tcagaaaact caaaaagtta tcaaacgaaa 600 gcccccttcg agatgatgct ctgaacatgt tccgaattgc caggaatcat gcacgcaaag 660 taatggaaga aagtaaaaag aaaagctggg aagagtttct cgacggcatc agcaaccaaa 720 catcgtgtac tgaaatgtgg cgtcgggtaa acgctcttag tgggaaacgc cgtacgcgtg 780 gactctccat ggaagttggc ggctctttaa ctgataatcc tgctttaatt gctcaggaaa 840 ttgggcagta ttttcagaaa ttgtcagcta cagaaggtta cagctcaaaa tttcagaggt 900 tgaaatcaac agcagaggaa aatcccgttt cgtttcaaat tgatgcatcg aacgaaaacg 960 aagactacaa caagtgtttc agtatggacg aactattacg agccttagat aacgcgcaca 1020 gtaaatcatc aggaccagat gacgtaggct accccatgct caaacactta ccgtacttag 1080 cgaagaaatc tctcctcaac gcgatcaaca aattatggct ttccggatca tttcccgaag 1140 aatggcgaaa aagtattgtt atcccaatcc caaaaaaagg ccaaaatgcg tcttctccta 1200 ctgggtaccg accaattagt ctcacaagct gcatagctaa agttgctgaa agaatggtca 1260 accgccgact aaccacagtt ctggaagaaa acaatctcct catcgatcaa caacatgctt 1320 ttcgcaaagg ccggggatcg ggtacctatt tcgcccgtct aggggatatc ctagatgacg 1380 ctagaactca agggcatcac atcgacctgg cagctcttga tatttccaaa gcgtataatc 1440 aaacatngag aacgggtgta ctaaggcagt tgcaggactg gggcttcaac ggaagtctag 1500 ccatgttcct gaaaaanntt ctatctgata gaacatttca ggtcgccatc gggaacacac 1560 tgtcagatct attcgtcgaa gaaaatggcg taccacaagg ctcagtccta gcagtaaccg 1620 ctttcctgat tggcatgaac aacgtctctc ttgatcttcc caaaaacaca cacatattac 1680 tatttgctga tgatatcttg ttgataactc acggaccaac atttgctagg aaccgaatca 1740 aaatgcaggc agctgtccga gctgtaggaa agtgggcgga tagtgtggga ttcaaaattg 1800 ccagcgagaa gtgtgccatc ctgcatagct gctcaactcg tcaccatcac tggaaacgac 1860 ctatcgtact caatggagta aactttccca atagggaaaa aatcaaaatc cttggtgtaa 1920 cattggatcg caacctcaat tataaccaac actttagaga agttaaagaa agttgtagaa 1980 gccgtattca gctcattaga acaatcagca gtcgacacaa aacaagcaac cggaagacaa 2040 tcattcaagt tgggaacagc ataatcacct caaagctagt atatggaatt gaactaacat 2100 gtgctgcatt ttctgcacta gtcaacactc taacacctgt ctacaacgaa attgtacggc 2160 gtgcctcagg ccatctcaaa agctcaccga ttctatcaat tatgatagaa tctggcaatc 2220 taccttttcc cctgaaagca gcgctgatat tagcaaaaag gacaaccagt gttatggaga 2280 aaattcccga agaatcaagc accttactaa ggaaaacgaa cgaaatctta agggaaaaga 2340 taggatcttc actaccggaa gtggccaagc ttccccgtgt tcaagaacgg agatgggacg 2400 atgttaaacc aaaaattgac tggaaaattc atcacaaaat caaacctacg gacaccgctg 2460 ctcaaaaaag agcagttttc aaacaaagag tggaacaaaa atatcaacga catcaacaca 2520 tttacacaga cggatcggta tcccgacacg gagtcggaat gggagcttac agtgttcatt 2580 ggtcaatatc taagaaaatt gataacaaat gttctatatt tacagctgag actgctgccc 2640 tcattatagc tgtcgacatg acccaggagg atcgcattta cgaaacagtt atttttacag 2700 actctgcaag cgctcttttg aatgttgaat ctggaaaatc caaaaatgcc ctgattcaaa 2760 cattagaatc gaaattggaa aaccgtacgg acattacctt ctgctgggtc ccaagccact 2820 gcggaatact cggaaatgag aaagctgaca aacttgctgg tgaagggtac cgtgctgagg 2880 catggataac gccactcccc aaacaagacg ttaatagatg gatcaacaca caactcgaac 2940 tagtttggga aaaccagtgg cacataacca gagacgtctt cactaggaaa ataaaaggaa 3000 ccaccaaaag ttgggatgat cgaggacacc gaacagaaca acgcattatg tccagactta 3060 ggataggaca taccagacta actcatcgac acctcatggg actgggccct agggaaatgt 3120 gtttcacgtg tggagtacct attactgtgg aacatattct cgttgattgc ccttcctata 3180 ccgctcaacg ccaggaaaat ttgctttcga actcaatccg agaaatatta ggacacgaac 3240 ccgaatcaga acaacgcttg ctgaaatttt tacgagaaac tcacctgtac gaaaccatat 3300 aatcaaatgt atatacaaat tttgtgaaat cttcctctaa ttctatccca tctttctttt 3360 gaaaaagagg cgaatgcacg tggagtgtaa aacctctata ataaaacaac aacaa 3415 // ID L1-1_CQ repbase; DNA; INV; 4894 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE An L1 non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW L1; Non-LTR Retrotransposon; Transposable Element; L1-1_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4894 RA Kojima K.K. and Jurka J.; RT "L1 non-LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 131-131 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 6 sequences with >97% CC identity. XX FH Key Location/Qualifiers FT CDS 162..1610 FT /product="L1-1_CQ_1p" FT /translation="MSSVRIRENTFKLCLKNFPKRPTYEEIHAFVHDKVGL FT KPTQVTRLQMNHSQNCVHIKCVDLKTAQDAVINHNDRHELVVDKKRIKVRL FT MMDDAGVEIKIHDLSENIRNEEVAAYLKQYGDVLSVRDVVWSESFKYKGVN FT TGVRVAKVVLRRHIKSFVTILGETTMISYRTQPQTCRHCHNQLHPGSSCVE FT NKKLLGQKQDLNKRLDLARNQPSAGSSYADVLGRMEEAATPLMPQFTPTNL FT TQLHEQLRGVAVEGVEASSSLSATVPSPPPRAEGDVSSTVVPMEQSAIEQA FT VIEQSVFEQSAIEQAVIQQAAIQQAAIQQAAIQQAAIEQSTIEQAKIEQAA FT IEQSAIQLAANEQASNEQPTNEQAANEQAAIEQSELDQSAMELDAQSDEVK FT KDGISSVPNDQSTQQAEGEMSVEADACSSSVSPFVFTPPLPAPSLPNDPLM FT QISDSESTTDSSQEVDGAPFQKVNRKKRGRPKKPKLEA" FT CDS 1635..4838 FT /product="L1-1_CQ_2p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="MSNNPLSYNIASININAISNENKLQALHTFLRLQDLD FT IVLLQEVENPNMCIPGFIVITNVDSARRGTAVALKSHIPYSNVQRSLDSRI FT ITVKLGSSVTICNVYAPSGTQNQSAREFLFRHSLPFYLQNSAPNLVVGGDF FT NCVISTKDATGCNNFSNSLKRFIDSQNLVDSWEHLNRNAVAFSFVRPNSAS FT RLDRIYLSHSLAPHLRSTDFFATCFSDHKAFKVRCCLPDLVGNGHGRGYWS FT IRSHVLTEENLAEFEEKWTRWLRERRNYTSWLGWWIECAKPKIRSFFRWKT FT NQAFREFHATNELLYARLRRAYDELLLNPGSTAEVNKIKGEMLLLQNRFSK FT AFERINDKFVAGEKISSFQLADRNTRKKKSAIDSIRHRNRLLTDPAEVQNH FT ILDYYKTLYSLENAAPNTNFPTNRAIPPGSDSNERMMEEITTEEIYFAIKS FT SASRKSPGSDGVPKEFYLRAFDIIHRQLNLILNEALCGNIPSNFVEGVVVL FT AKKKTDEDTIKAYRPISLLNFDYKLLARILKQRLEKVMEENHILNAAQKCS FT NQHRNIFEALNAVKDRVAEFNCRKRSGRLISFDLDHAFDRVSRSFLLVVMR FT SMNFHPGFVSLLDKIMSMSSSRLLINGNLSSPFPIQRSVRQGDPLSMHLFV FT LYLHPLLEKILQICDNPHELVVAYADDISVIVADDNKLDQIKQAFADFGLC FT SGAVLNTNKTVSINIGRQQSGRRTPSWLNVSDSVKILGISFFNSLKQTINF FT NWGEVIRKTSQLMWIFRPRVLTVFQKVILVNTYITSKLWFMASILSIPNAA FT VARMTSQIGSFIWERYPTRVAMEQLGLPIDQGGLNLHLPMQKCKALLINRF FT VHCQTNMPFAESFGSQLKNPPNPAGVPALYPCLKMLARELAYLPPRLAENP FT STTSLLSFYREKANTPKIIEENQAISWKRVWRNIRCRELTSAEKSSYYLLI FT NQKIPHAALLFRQNRVSSNFCEMCPLXVEDLEHKFSQCVRTVHLWDFLKSK FT LESILNRRVTFKTLVLPEMKNVSPAAKNKALKMFITYVNYVLEATSSLTVG FT ALEFVLQCNCL" XX SQ Sequence 4894 BP; 1390 A; 1258 C; 1123 G; 1121 T; 2 other; gttaggtcca agcactcgcc gcgatcagac gtatttctat aacagctccg cgttaaattt 60 ttatcgttta tcgcatcgca cacttcggtg ttgctttgcc ggctcgtcag ccgcctgacg 120 acagtgcaat cttttcgcat atcgtgccag attttgcgaa catgtccagt gttcgaatcc 180 gcgaaaacac tttcaagctg tgcttgaaga actttcccaa gcgaccaaca tacgaagaaa 240 ttcatgcatt cgtgcatgat aaagtggggc tcaagcctac ccaggtaacc cggctacaga 300 tgaaccactc tcaaaactgc gtacacatca agtgcgtaga cctgaagact gcccaagatg 360 ctgtgatcaa ccacaatgac cgccacgagt tggttgtcga caagaagcgg atcaaggtgc 420 gacttatgat ggacgatgcg ggggttgaga tcaaaatcca tgatctgtcc gagaacattc 480 ggaacgaaga ggttgctgcc tacctgaaac agtatggcga cgtgctgtct gtcagggatg 540 tagtttggag tgaaagcttc aagtacaaag gggtcaacac cggtgtacgg gtcgcaaaag 600 tggtgctacg aagacacatc aagtctttcg taacaattct tggtgagacc actatgatta 660 gctatcgcac acagccacaa acgtgtcgcc attgccacaa tcagttacat cccggcagtt 720 cttgcgttga aaacaagaaa ttgctaggac aaaagcaaga cctgaacaaa agacttgacc 780 ttgctcgcaa ccagcccagc gctggatcaa gctacgcaga tgtgttgggt cgtatggaag 840 aggccgcgac tccactgatg ccccagttca caccaacaaa cctgacacag ctacatgagc 900 agttacgcgg cgtggcagta gagggagtcg aggcaagctc ctccctctca gcaaccgtcc 960 cgagtccccc acctcgcgcc gagggagatg tgagcagtac agttgtacca atggagcagt 1020 ccgcgatcga gcaggccgtg atcgagcagt ccgtcttcga gcagtccgcg atcgagcagg 1080 ccgtgatcca gcaggccgcg atccagcagg ccgcgatcca gcaggccgcg atccagcagg 1140 ccgcgatcga gcagtccacg atagagcagg ccaagatcga gcaggccgcg atcgagcagt 1200 ccgcaatcca gctggccgcg aacgagcagg cctcgaacga gcagcccacg aacgagcagg 1260 ccgcgaacga gcaggccgcg atcgagcagt cggagttgga tcagtcagcg atggagcttg 1320 atgcacaaag cgatgaggtg aagaaagatg gaatctcgag cgtgccaaac gaccaatcca 1380 cccaacaggc agaaggtgaa atgtcggtag aagccgatgc ttgttcttcc tcggtgagtc 1440 cgtttgtctt cactcctccc ctcccagctc cttcccttcc caatgatccc ctaatgcaga 1500 tttctgactc agaatcaacc actgactcct ctcaagaggt ggacggtgct ccgttccaga 1560 aggtgaatmg gaaaaaacgg ggccgcccaa aaaagcccaa acttgaagcg tagacccact 1620 cctccaaccc cacaatgtct aataatccct tgagctacaa catcgctagc ataaacataa 1680 atgctatatc taatgaaaac aaactccaag ccctacacac tttcctccgt ctccaagatc 1740 tagatattgt tctcttacaa gaggtagaaa acccaaacat gtgtattccc ggattcattg 1800 tgataacaaa tgtcgacagt gcaagaagag gtactgctgt tgcccttaaa tcacacattc 1860 cttactctaa tgttcagcgc agtttagata gccggattat cacagtcaaa cttggtagtt 1920 cggttactat ctgtaatgtc tacgcaccat ccggcacaca gaaccagtcc gcccgtgaat 1980 ttcttttccg acactctcta cccttctatc tacagaactc tgccccaaat cttgtcgttg 2040 gtggtgattt caattgtgtg atatccacca aagacgcaac aggttgcaac aactttagta 2100 attctctaaa aagattcatt gatagtcaaa acttagttga ttcatgggaa catttgaata 2160 gaaatgctgt tgcattcagt tttgtccgtc caaattcagc ttcgcgcttg gaccgaatat 2220 atttatccca ttccctcgct ccgcatcttc gctcgactga tttctttgca acctgtttct 2280 ctgatcataa agcgtttaaa gtgagatgtt gtctgcctga tctcgtcgga aatggccacg 2340 gtcgagggta ctggtcaatt cgctctcatg tcctcacgga ggaaaacttg gctgagtttg 2400 aggaaaagtg gacgaggtgg ctacgcgaac gacgaaacta cacgagttgg ctgggatggt 2460 ggattgaatg cgccaagccc aagattcgca gtttcttcag atggaagaca aatcaagcat 2520 tccgagaatt ccacgcaacg aacgagctcc tgtacgccag gctgcggagg gcatacgacg 2580 agctgctgtt aaacccagga tcaacagctg aagtgaacaa gatcaaagga gaaatgttgt 2640 tgctgcagaa tcgtttttcc aaggcgttcg aaaggatcaa cgacaagttc gtcgctggag 2700 agaagatctc gtcattccag ctcgcagatc gaaacacacg taaaaagaaa agcgcaatcg 2760 actcgatccg acatcgaaat cgtttgctca cagacccagc ggaagtccaa aaccacatcc 2820 tggactacta caaaacactc tactcgttgg aaaatgctgc cccgaacacc aacttcccta 2880 caaatcgtgc cattccgccg ggatcggaca gtaacgagcg gatgatggag gaaattacca 2940 ctgaggagat ctacttcgca atcaagtcca gcgcgtcacg gaagtcacca ggaagcgacg 3000 gtgtccctaa ggagttttac ctgagagcat tcgacataat ccataggcaa ctgaatctaa 3060 tactaaacga agcgctttgt ggcaacattc cgagcaactt tgttgaaggg gttgttgtac 3120 tggcgaaaaa gaaaaccgac gaggacacga tcaaagctta caggccgata tccttgctaa 3180 actttgatta caagctactg gcgcgtatcc tcaagcaacg tctggagaaa gtcatggaag 3240 aaaaccacat cttgaatgca gcacagaaat gctcaaacca acatcggaac atcttcgaag 3300 ccctaaatgc cgtaaaagac cgtgtagcag aattcaactg tcgaaaaaga tcgggcagac 3360 tcatctcctt tgacctagat cacgccttcg accgtgtgag taggagtttc ttgctggttg 3420 tgatgcggag catgaatttt caccccgggt ttgtaagtct tcttgataag atcatgagta 3480 tgtcgtcgtc tcggctcctc ataaacggaa atctgtcctc cccattcccc atccaacgtt 3540 ccgtgcggca aggtgatccg ctaagcatgc acctttttgt cctgtaccta catcctctcc 3600 ttgaaaaaat cctccaaatc tgcgacaacc ctcacgagtt ggtggttgcc tacgcggacg 3660 acatctccgt tatcgttgcg gatgataaca agctggacca gatcaagcaa gcgtttgcag 3720 atttcggact gtgttcggga gctgtgctga acaccaacaa aaccgtctct atcaacatcg 3780 gtaggcagca aagcggacga cgaacaccca gctggctgaa cgtcagcgac tcggtgaaga 3840 ttctcggaat aagtttcttc aactccctca agcaaacaat caacttcaac tggggggaag 3900 tcatacggaa gacctctcag ctgatgtgga tcttcagacc acgggttctc acggtcttcc 3960 aaaaagtgat tctagtgaat acttatatca cctccaagct ttggtttatg gcctcgatac 4020 tcagcattcc caacgcagcc gttgcaagga tgacgtcgca aattggaagc ttcatttggg 4080 agcgctaccc gacacgtgtg gcaatggagc agcttggtct tccaatcgac caaggaggcc 4140 tcaacctgca tcttccgatg caaaaatgca aagctttgct gatcaaccgc tttgtgcact 4200 gccaaaccaa catgccgttt gctgagtcct ttggaagcca gctgaaaaat cctcccaacc 4260 ccgctggcgt tcccgcactg tacccttgcc tgaaaatgct ggcacgggag ctagcctacc 4320 tcccacctcg gcttgccgaa aacccttcaa ccacatcgtt gctatctttc tacagagaaa 4380 aggcaaacac accgaagatc attgaggaaa accaagccat ctcatggaaa agagtttggc 4440 gcaacatcag atgtagggaa cttacgtctg cagagaaatc ttcttactac ctgctcatta 4500 atcagaagat accacatgca gcacttttgt ttaggcaaaa ccgagttagt agcaactttt 4560 gcgaaatgtg ccccctagma gttgaagact tggagcacaa attttcacag tgtgtcagga 4620 ctgttcactt gtgggacttt ctcaagtcaa aattagagtc gattttgaac aggagagtaa 4680 catttaaaac cctagtgcta ccagagatga aaaatgttag tccagcagct aaaaacaagg 4740 ctctaaaaat gtttataaca tatgtaaact atgtgctaga agccactagc tcgttaacag 4800 taggcgcact agaatttgtt ttacagtgta attgtctcta aaatgtaaag tgaaaatgtt 4860 ctaataaacg tgtttaaaaa aaaaaaaaaa aaaa 4894 // ID Kolobok-1_AA repbase; DNA; INV; 6114 BP. XX AC . XX DT 16-MAY-2011 (Rel. 16.05, Created) DT 16-MAY-2011 (Rel. 16.05, Last updated, Version 1) XX DE DNA transposon: consensus. XX KW Kolobok; DNA transposon; Transposable Element; Kolobok-1_AA. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-3053 RA Yuan Y.W. and Wessler S.R.; RT "The catalytic domain of all eukaryotic cut-and-paste transposase RT superfamilies."; RL Proc Natl Acad Sci U S A 108(19), 7884-7889 (2011). XX DR [1] (Consensus) XX SQ Sequence 6114 BP; 2069 A; 1083 C; 1185 G; 1777 T; 0 other; gaggaaacac acctccatcc tcctaaacag tcatcatcat caatcaatga cgggtatcaa 60 aatggtagaa acgcagtaat cgccaagatt gaaagcattt tacgacagga aaatcaaaaa 120 atatttcatt tcgaattata tcaagcaata attacaacac tattttatca ccacttacct 180 tttaaagaca gaaatatcat aaaacataag aaaatactgg attttaaaat gaaacaacct 240 tctcaccatt catcactact actgatgtac cttgctcggc ttgcgaaaac tcccccaagc 300 aagtcaaact ttggatagct caattcctca ctcgcgcaca aaaaatcatc gtgtggcaca 360 tttcactgtc aaggtatact ttcacaaaat tcaaaaatag cacggttatc actaattttt 420 gattaatttt gaaataaact gaactttttt atatgcactc aacgattcta ccagctgtaa 480 tcaaaacata actgctgcta gaccaacaca ttgctttgtt gtgtaattat atccatgcgt 540 gctagtatgc gcatagcaat cattgaccag tattgccaaa tctactcttt ataaattcga 600 tataaaattt attgaatgtt gtattattta agtcaatcgt atgtaaagca ttcattttta 660 agaaccattt gcagatagat aaagatttcg acgaatattc aatgctaaat cgatttcggt 720 acaaagtttc tgtaactttc gttttgcaag attatttgaa ccagtatgta tggcagcact 780 gacaacgaaa acaccatatg ttcgatcata tgttttttat ttacgttttg aagcatcgct 840 ggttctacat ctcgcatgac ttgttgatag agtgttttcg tgactaaaac cgatcggttg 900 tacctaaaga tgccgcgtaa aactaaggct caaatagctg ccatcaaacg tgaagcgcag 960 aaaaagcaag caaactgtag tgctaatgat gattgtaata gtgaggaagt tacagaagaa 1020 gattatcagc gtgaagactg ggattcagaa gacacgagtg atggatgtgg acatcgtgtt 1080 gagttgttgc cggcaacaga aagtgaattt gtgagtgatt tctgggagta ctgttcagta 1140 tgctccaccg tatcagattc cggatcggaa ctcagtgctt atgcatctga tgaagatagt 1200 gatattgcag aaattaattc tatgcgtaaa ccatgtgagc tgtctcactg tgaagatgta 1260 tggccatcat ttaatgaaga tacggcatct tttcttcgtg atttatctat cccgagtaca 1320 gcgtttgacg atagtgtaaa ttggtcgtat gatgatgcgt ttgctccttg ttctccgtct 1380 gcagaattgc acgatagtag acaagccgac agcctaggag aaaccggctc tgttccaaat 1440 acaatacctc gtacaccgaa cttgagcaat ggtacaggta agccaataga tatacactgt 1500 taaaatatgt tataccgaat tgcatgtaac aatgctttat gcaagaaatt gtttcgttta 1560 aacttaaatg aaatataata ttaaattaca cgttcctatc ccgtgtaacc ccgttgcttt 1620 gactgctttt aatctgacaa ctgtatatta agccacgcta atgtgtacat ctttcaaatt 1680 aaaaatggtt ggttcatacg tcatttagat cgtggaacac agtgaagtga aactggacgc 1740 agagtcaaaa caaaatagtc aatgaggtga cgagtaagct gtacctaatg ttgtggaata 1800 aaagtcgcta tcatcaatcc ggctgctact atgcaatatg tttggttatg ctgctcaata 1860 tataaagaag aagcgctttg ttgtttgtta tacagcgcac ttacagtaga aagctatata 1920 ataacaatac gccactacac agtctttttg atgtaaaaac tgcgtttgac gtttcgcgca 1980 cttctacagc ctgatgtagt ggtataagca tgattataac atctaataac ataattttga 2040 aatattacat gactcgtttt cgatgtagaa gaaaacgatg tttttcagca aggaccgcgt 2100 caagacttgt ggatgcagag atcgtgagtt cgatcccaag tggtggcagg tacttaattg 2160 tatgtagctt tgggaatatt aaattcagat acttatgaat tgaacttcgg aagctgtgaa 2220 caacttgaaa atcatattaa atcgataata caggcaagct tataaacgtg cagttgcgaa 2280 atgattgaaa aagtgccttc cgcagatgct atgcaactac ttgtcctata acggtcgaaa 2340 caaagcaatt atcagtatac ataagagcta ccaacactgc ttttgaaata gctgagtaga 2400 ttcgccagat tgatggtaaa aagttcgcat agaagctttt gttcgtatgt acagcgcatt 2460 tactcagcat aaagctgtgt aaagaatgct tcgagaatgt ttgcaacatt tgcactaaat 2520 gttagttgag tagcatttaa actttagcta attgttgtgt atctggtgcg caatgtgcag 2580 gttttcgaac aaaacttcaa tgtataatat tattttattt ttctttcatt caagatacac 2640 ctgagagtaa gtcacactct cagatagaag ctcaatcgaa tgaaccaaaa acatgtgaaa 2700 ccgaaaacaa agcatacttc acggaatcaa tacttgaatt aaaacatcga gaactggaaa 2760 agtacccaat acaactcatt ccgtctgttg tcgagcatag atctggtatt atttatattg 2820 atgaatgtta tgtattttct ataagattta cgtttctaga acgtgtgttt acacaagcag 2880 aagcagaaaa catgtctcat tttactgaag attgtatcgc tgataaacgc acagtatgtg 2940 tatacagtat ttatctgttc aaagattcat aggcgttttt tattcaattt ttaaggtaat 3000 cccacaagaa agtaacattc aatcaatgca tctagctaaa aaacgtaaga ccgataaagc 3060 gtttgatgtg aattcatcgt tatcaagctt accaacgaat gacgaaatct tgaaaacaaa 3120 agtaatggca gttgatactc atcaatactg cgaagaaaag aatgacaatt ggaatgatat 3180 tgaaggacgt agaattgttg agttcaacta tgttatgaaa tgtattgtgc gagctcaagc 3240 ccatcattcc aaaatatgta atggagttct ttgccttgat caggagtact atcaatccat 3300 gatttcaact atgtggttca agtgtgacag ctgtgagaca atgtttaaga taagaaccga 3360 ggagccctcg cagcgaccga aattgcggaa aagtattgta tggggaacgt tatgttcagg 3420 aggcacgtat acgcaaacca agcaattact aagcttttgc gatgtaccat ttatgccagt 3480 gaaaacattt tcgctggatg aaatggaaat ggataccaca ctccaagaag cagtcgatga 3540 atcaaccgat aaagccatcg aagttgaaaa agctacctat tttgaagaac ataaagtcaa 3600 cgaccagaat tcgtcacccg atcaaattgc aaaaatcaaa gcttcgttgg atggcagctg 3660 ggccactcga agctatggag ccagatatag ttctgcatcc ggttgtgggg ttatcgtagg 3720 agagaaaaca ggaaaaataa tttacgtggg atgtcgcaac aagagatgtt ctgtatgcac 3780 cagaaattcc aggctaggta acgtcaaagg tccccataag tgttatcgca attatgtcgg 3840 ctcatctggg ggcatggagc cggatataat gattaaagga tttgaagaat tggaaagaaa 3900 aggagtatgg ataaccacac ttataaccga tggagactct acaactgtag cgcggatcaa 3960 aaactctatt aaatacggtc catctataga acatcaatta tgctgcaacc atatttgtaa 4020 aaatatggga aagaaactac gggatgtaag ttttgtttaa tatcatttca tataaaagaa 4080 aaattacaat ttttctatta gatcaaatgc cccaaagatg tagctgcttt ggtgtctgca 4140 aacatagcac gcattacgcg tggagttcga gctgctatag ttcattgcgc agaagcaact 4200 ggaggttctg acgtagaagg gctccggcat gatataagaa atgtaccagc acatgtcttc 4260 aatgatcatc gaaaatgtcg tgattatttc tgttcgcagt tgacaaatga aaatgaatgg 4320 cacatcgact tgctccaaaa atatggagtg ttgtccaaaa ttcaaggggt ggtagaaacg 4380 gttgcggcta aggcagagtt tctgaatcac aacaagacga gtaacacgtg agcaaacgta 4440 ttattcattg caaactgatt ttgactattg ttttattaca gctctgagaa cgtcttctgc 4500 cgtttgtgta aatacaatgc tgggaaaaga ttgaacttca ctagtcgtgg ttcttatcaa 4560 cggagagtta taataacaac tctacaacat aacgaaggat acacgtggca ctcaacatcg 4620 atgaaaaagt ttattggatc tgaccctcca aagcttattt tcaaatttgc cgaagagcaa 4680 gaacgtctga aagcagccac attggtatcg aaaaagcgac ctgattttaa caggaagcct 4740 atcaacatca gcgagattga gcgagactat ggcgaaaccg ctgtcgctgt ggcttctcgt 4800 gaaattgacg ttgaagaaga aattccagca gctaaacaaa aatatatagc acgtatttct 4860 ggtttatatt ttacccaagt ttaacttttt ctaacttttt aaataaggca gcgtccattt 4920 ataacgtaac gctaaaattt gaaattttgg accccctccc ccccttcgta acgctttttg 4980 tatgaaaaat ttcaattttt agtatgaacc gtaacgcttg agcctactcc ccccctcccc 5040 ctagagcgtt acgtaatttg tggatgccgc ctaatattta ttgatattga ttgatatata 5100 tcctacattt tagatgaccg cgcaatctaa cgaaaagctt cggaacaatg tgagtttgcc 5160 agaaatggaa gaatatgcga aaggcagaat tttaccgaaa tttgttgctg aggtagttgg 5220 atcgacgaaa aaaacattgc aacaaaaatg tgacaatata ttgttaacaa ccaatacatg 5280 gaatttcatc aaacaaaaca gatctattat cggtccaatc actgaagcta tcaatggaaa 5340 attcgggtac aaaatcagtg gcagtaaaat gttcgttgat aactgccata attatatttg 5400 ctgtattcca gacggtattc aacaacaatt tgatattgga tatattgtat accaaatgaa 5460 agatctattt ttcaggagtt tcggatgatg gagaaacaat aatgattgtg aagaaaaagg 5520 cgatcaaggg acctctaaaa gaaaacgctg ctaaactagg atgcaaatta atcgaaaatg 5580 acatcaaact tgatcgtaaa acatattgtg aagtacaagt ttcgctgcac gtgtgtgagg 5640 cagcgaaagc tgtagtgtgc ttcgtcaata gcaacaatat tgatgattac ttggtgaagg 5700 agtgcgccct ggatgcaaca tttttcgaag aaaaggtgga agagcatatt aaagatttct 5760 tcgaaaacat tctcatccaa aaaatcattg aaaatgaagt atttgttaaa agttgaattt 5820 ttaaagcgta ttttttggaa gattgcaaat gcaaataacg gtaccccccc cacttccatt 5880 aataaatgaa atagaataaa acagatatga attgttttat atttaatttt actaaaaagt 5940 tgattacaat acttcccacc attggcataa tcgtgaggca agtgtatttg tggttttgca 6000 taatatgttt acataagatc gtaccaccac gaaggcataa aataaagata aaactaacac 6060 tataataatg cttcacaaac acatggaagt catagaatga cggaaatttg aacc 6114 // ID Crack-1_AAe repbase; DNA; INV; 4787 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A Crack non-LTR retrotransposon family from Aedes aegypti. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; KW Crack-1_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4787 RA Kojima K.K. and Jurka J.; RT "Crack clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1217-1217 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 10 sequences with >96% CC identity. Closely related to Crack elements in Culex pipiens CC (Crack-1_CP to Crack-4_CP). XX FH Key Location/Qualifiers FT CDS 376..1425 FT /product="Crack-1_AAe_1p" FT /translation="MSDDTIVCVTCKKEEKDPKKVIECAQCHKCEHFKCRN FT TFGNAIRKLKGKDFFCTLECQEFFQRASGNSSTDPDILKELHTVLMEVRAT FT RSEVQDMKSTICEMEKFQNFLSSKLDTLLNEVKALKVEHAAMKAEADQIRK FT KHHQLSYTVDELEMDVERLSRMTLSRNMIVLGIPPRKDENVKEIIGKVSSA FT IGYEIPDGAILEAKRLTSNGEKLRNTEPAPIKVVFVSEQYKEELFAKKRSF FT GPLLSAAVDLPLPDGSRKIVLRDELTPRGMELYKQVREIQDCLNYKFVWPG FT RNGVILAKKSENSKTEFIRSRSDVAGLQRTGAKRNLDTSTTNSSTQSSPVS FT EPAAKRR" FT CDS 1467..4352 FT /product="Crack-1_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MFSNYFYDCSESWLKNKYCAVSNTSLKIIQLNIRGMN FT ELSKFDNVRELLDRYGERVDIIVLGETWLKDDKAVLYSIKGYKGYFSCRDS FT SHGGLALFVRQDMHYDLCSNRNVDGFHHIHLQLQTKGRHLQIHAVYRPPSF FT DARRFFSEIETMLSSGSSNHDRLILGDMNVPVNIVTNNIALEYIRLLESYH FT AFLTNTMVTRKASGNILDHVVCSGQIVDNIINETVYNDLSDHCFIVTNVEL FT RHEVSKQIFTKNVIDHARLNELFAQNSTQIPPDCDANGKLEHVIRTYNSLL FT TQCTKKVTLKTKLKGHCPWMTYDLIRLIRIKENALGRYHRNPDDVAIKETL FT QHASKLVQRKKAQCKSDYYYKQLERADARTAWRFVNSNLGKQPNINKISAI FT SVNDRLVSDNSHICSAFNDFFCSVGSSLAAKINSDRNISKFGTLVPQRNSL FT YINPTTISEITIMIHNLDSKKSAGPDNIPANFIKTHHSFFARLLTDVFNEI FT ISTGQYPDCLKIARVVPIFKSGNNKDLNNYRPVSCLSIMDKILEKLLATRI FT ISFAQRFNLIFDHQYGFRSGSSTLTACSDLVDNIYDSLDRRRLSAALFIDL FT KKAFDTIDHHLLIEKLEIMGFRGVTKSLLSSYLTNRRQFVAIAGYKSPPGL FT MKTGVPQGSNLGPILFLLFINDLSKLPLNGKLRLFADDTSLLYEANSIENL FT QRQIKEDIILLQDFFSANLLSLNLSKTKYMIFHSPRKRLPARNILEVRGCV FT IEEVAEYSFLGLLLDSTMKWNAHVNMLKNKLSSICGIFRRIAECIPKKWLM FT KLYHALFHSRLQYLVTCWGSASKSTLKEVQVLQNRCLKIIHGFPVLYPTIA FT LYTRNTDSILPVKALQSFQMLLHVRKILSDPSTHHNTHLRRIARNRESRYE FT GNLSLGRPYTEFGRRRFEYAGSKLFNELPTVCKTSATLNKFKFSLRNHLKE FT NVNSYLR" XX SQ Sequence 4787 BP; 1554 A; 1005 C; 963 G; 1265 T; 0 other; gtctggcaac accgaatcga ctagaacggc atcctattag ggctcggaat aaattacaaa 60 aagtgaagtg aaatcgaaaa attctgaagc tacgacatcg atgaagcagg aatcttaaca 120 tagataaatc tgctgaaacg ttatacaatc tggtttttcg acgattacgt tctaaaacac 180 gaatcagttt atcccataat tatcgcttta agtgattcat ggtggtcttt gcacagtttt 240 gtctttattc ttaacacaac acacacaagc gtgatcgtga aatatttcgt caatctgtct 300 atcacattct gtcggagttt gcggttggta gcgataaacc agtcagtctg ttatcaccat 360 cgtaaaatat aaacaatgtc tgacgatacc atcgtctgtg ttacgtgcaa gaaggaagaa 420 aaagacccta aaaaggtcat agagtgtgca cagtgccaca aatgcgagca tttcaaatgc 480 cgaaacacat ttggaaatgc aatacgaaag cttaaaggaa aggacttctt ttgtacccta 540 gaatgccagg agttcttcca aagggcttca gggaattctt cgactgaccc ggatatactc 600 aaggagctgc atactgttct tatggaggtc agagcaaccc gctcggaggt gcaggacatg 660 aagagcacga tttgcgaaat ggagaaattt cagaatttcc tctccagcaa gctcgacact 720 cttctgaacg aggtcaaggc acttaaagtg gaacatgcag caatgaaggc agaagctgat 780 caaatacgga aaaaacatca tcaattaagc tacacggtgg atgaactgga gatggatgtg 840 gaacggttga gcaggatgac cttgtcgcga aatatgatag tccttggaat acccccaaga 900 aaggatgaaa acgtcaagga aatcatcgga aaagtgtcgt ccgcgatcgg ctatgaaatc 960 ccggatggag caatactgga ggccaaacgt ctgacatcga atggagaaaa actacggaac 1020 accgaacccg caccgataaa agtagtgttt gtgagtgagc agtacaaaga ggagttattt 1080 gccaaaaagc gttccttcgg tccgctgctg tctgctgctg ttgatctacc gttgcctgat 1140 ggttcacgta aaatcgtttt acgcgacgaa ctcacaccac gtggaatgga actctacaaa 1200 caagtacgtg agatacaaga ctgtttaaac tacaagtttg tgtggcccgg cagaaacggt 1260 gtaatattgg caaaaaaatc cgaaaactca aaaacagagt tcattcgctc tcgctcagac 1320 gttgcgggat tacagcggac tggtgcgaaa cgcaatttgg atacatccac gaccaactca 1380 tctacacagt catcgcctgt gtcggaaccg gcggcgaagc gacgttaagc aggcccactg 1440 cggacacact atgaagattt atttcaatgt tttcaaatta tttttatgat tgttctgaat 1500 cgtggttgaa aaataaatac tgtgcagtat caaatacgtc tttgaaaatc attcagttaa 1560 acatcagagg gatgaatgaa ttgtccaagt ttgacaatgt aagagaattg ttagatcgtt 1620 atggggaacg ggtggatata atcgttcttg gcgaaacatg gttgaaagat gacaaggctg 1680 tattatactc aataaagggt tataaaggct atttttcatg cagagatagt tcgcatgggg 1740 gattggcgtt atttgttcga caagatatgc actatgatct ttgttcgaac agaaatgtgg 1800 atggctttca tcacatccat ctgcaacttc aaaccaaggg aagacatctt caaatacacg 1860 cagtatacag accaccaagt tttgatgcaa ggcgcttctt ctcggaaata gaaactatgt 1920 tatcgtcagg tagtagtaat catgaccgtt tgatacttgg tgatatgaac gttcctgtaa 1980 acattgtcac aaataacatt gcgttagaat acatccggct tttggaatca taccatgcat 2040 ttttgacgaa caccatggtt accagaaagg caagtggcaa cattctggac catgttgtat 2100 gttcaggcca gattgtagat aacataataa acgagacagt ttacaatgat ttgagcgatc 2160 attgcttcat tgtaacaaac gtagaactcc ggcatgaagt cagtaaacaa attttcacga 2220 aaaatgttat agaccacgca cgtttgaatg agttgtttgc tcaaaacagt acacaaattc 2280 cgcccgactg tgatgcaaac ggtaaactag agcacgtgat tagaacatat aattctctgt 2340 taactcaatg cactaagaaa gtgactttga agacaaaact aaaaggccac tgtccttgga 2400 tgacttacga tctgataaga ctgattcgta taaaagaaaa tgcgttaggc agatatcacc 2460 gtaaccctga tgatgttgca ataaaagaaa cgcttcagca tgcatcgaaa ctagttcagc 2520 ggaagaaagc tcaatgtaaa agtgactact actacaagca actggaacgt gctgacgcca 2580 ggactgcgtg gagattcgtc aactcaaatt taggaaagca gccaaatatc aacaaaatat 2640 ctgcgatatc tgtgaacgat cgattagttt ccgataacag ccacatctgc agtgctttca 2700 acgatttttt ctgtagcgta ggatcctcct tggcagccaa aatcaatagt gatcgaaaca 2760 tcagcaagtt tggtacgctc gttccccaaa gaaattctct atacataaat cccacgacga 2820 tcagtgaaat tacaatcatg attcataact tggactcaaa aaaatcagct ggccctgaca 2880 atataccagc caactttatc aaaacccacc attcgttctt cgctcgtttg ctaacggatg 2940 tgtttaacga aataatatct actggtcagt atcctgattg tcttaaaatt gctcgggtgg 3000 tgcccatatt caaatcaggc aataacaaag atcttaacaa ctatcgccca gtatcttgtt 3060 tatccataat ggataaaata ttggagaaac tgctggctac aaggatcatc agctttgcac 3120 aacgcttcaa tctaatcttc gatcatcaat atggtttccg gagtggatcc agcacactta 3180 cagcctgtag tgatctcgtt gataacattt acgattcact cgatcgcaga cgactctctg 3240 ccgcgctttt tattgatctt aaaaaagcat ttgatacaat cgatcatcac ttactaattg 3300 aaaaattaga aattatggga ttcagaggag ttacaaaatc attattgagc agctacctaa 3360 caaatcgtcg tcagtttgtc gctatagcag gatataaaag tcctcctgga ttaatgaaaa 3420 cgggggtgcc acagggtagc aatttaggcc ccattctttt cttgctattt atcaatgatc 3480 tttctaaact tccactgaat ggtaaactga ggctttttgc cgatgatacg tctttgctgt 3540 atgaagccaa ctccatcgag aatctgcaaa ggcagatcaa ggaagacatc attctactgc 3600 aagatttttt cagcgcaaat ctgctgtctc tgaacctcag caaaaccaag tatatgatct 3660 ttcactctcc tcgtaaacgg cttccagctc gcaatatctt ggaagttcgg ggatgtgtga 3720 tagaagaagt agcagaatat tcgtttcttg gtcttttgtt ggatagtaca atgaaatgga 3780 atgcccatgt aaacatgctg aaaaacaaac taagctctat ttgtggaata ttccgaagaa 3840 tcgccgagtg tatcccaaaa aaatggttga tgaagttgta tcacgcattg ttccactcca 3900 gactgcaata tctggtgacc tgctgggggt cggccagcaa atctactttg aaagaggttc 3960 aagtacttca aaatcgatgc ttgaaaataa tacacggatt tccagtactg tacccgacta 4020 tcgcactcta cacacggaac acggactcga ttcttccagt gaaagctctt caatcgttcc 4080 aaatgttgtt gcatgtacgg aaaatactca gcgacccctc aacgcatcat aacacgcatc 4140 tcaggagaat tgctcgtaac agagagtcaa gatacgaagg aaacctatca ttgggaagac 4200 cgtatacaga attcggccgt agaagatttg aatatgcagg aagcaaactg ttcaacgaac 4260 taccaaccgt ttgtaaaact tcagcaactc tcaataagtt caaattttca ttgagaaatc 4320 atttgaaaga aaacgttaat tcttacctcc gttaacgccg cctccaccct ccttccctta 4380 tatgtgttta tatataagta ttcacctgcc ataatttttc ttatatctcc gccagccacc 4440 gccagccacc gccagcctcg ccaccaaccg ccattgccct ccatcagcaa ccgcccacat 4500 cagcccaact ccaccatcgc caacatttaa tgtcttgcca agaaaatcag aaatttgtaa 4560 acctaggtta acattaacca tcaaatttca cttccttaaa agagcgacta tgctcactgg 4620 aaagtgtatc atattgtatg aaaaatgaaa atgtgaacta tataaacgaa aagatgagga 4680 ggttttatgc ctattggagg aagatgctta aaagaagctc acctccaatg ggcttttccc 4740 tgctccaaaa aaagaagaaa taaaataaat aaaaaaaata aaaaaaa 4787 // ID Poseidon-10_HM repbase; DNA; INV; 2885 BP. XX AC . XX DT 15-DEC-2008 (Rel. 13.12, Created) DT 15-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE Penelope-like element. XX KW Penelope; Non-LTR Retrotransposon; Transposable Element; KW Poseidon-10_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2885 RA Bao W. and Jurka J.; RT "Penelope-like elements from Hydra magnipapillata (Poseidon RT group)."; RL Repbase Reports 8(12), 2090-2090 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 458..2710 FT /product="Poseidon-10_HM_1p" FT /translation="MLKNGFLCLLMKLKNLQKDLKVILDEIDYKKVEQVTE FT KAREIMFLKSKERLFKKFKILQNEFENNHSNKICKLTTHTKNAVLNLCNXD FT IPKNQNNLLNLGPHFVPSLKYIPYMDIIATTETSALKLEYSNKIEDAQDLR FT KNVLKELKMGKKINQNLTREQRKALMEIKKNKIIDIYPFDKGNGFVRIEHD FT KALEKIREQIGPTRILSEDPTSSYAIKIKTYLSQLNKKQRFSKTEYDSIYP FT SDPIPPRMYGLIKAHKPEKSYPMRIVVSTIGTPNYGISNYLVKKIQPVLNK FT NQTRLKNSFDFISKANSWEIDKNEVQVSFDVINLYPSIPLKEATLILIDQL FT NKDDSYKYSTKLTISETKQLIELCLHRCYFLWNNEIHELENSGPIGLSFMV FT VLAESFLQHHEQNAFKIAMAVNPPLDLKSYLRYVDDSHARFSNTQEAEQFL FT IILNKQHPAIQYTIETESENRTLNFLDLTIINNNKGKYEFKVYRKEAITNI FT QIKXHSNHDPKILNAIFKGYVHRAYSLCSNLYLQDEINFLIQMFNENGYNI FT CQLKRIANLIGNKRSLKINKIQSDSLNIPTVSLPWIPSLSPKLRKIFRKAG FT YRVVFKSNPNLKTLLTSKNKTKLPQNSQPGTYLIECKCSKRYVGETKLQIR FT TRTQQHLKSLNEGKHYQSAIATHNKFCSQEIKWENVKTIKVETKKFDRKVR FT EALEIQRFQCSPLHGGINLDNGQFVKTKFWTPFFTFLRKQERSHSTADVS* FT " XX SQ Sequence 2885 BP; 1152 A; 467 C; 396 G; 866 T; 4 other; tgtatataga atgagctctt akttaatttt aaactgtttt taacgacgtc agttaatttt 60 aaactgttaa aaaattttgt attaattttt acgagctgat gatgctggta accataatcc 120 agcgaaaatt tctaataata aattatcatt tgtattaaga gaattcgtat tatttgatat 180 tttcttacta atatacatat actcttatac acgatgtaaa cactaaagtt atatattaca 240 gctaaatacg gagctgatgt ttacaaacaa actgaacagt tacagttatt gaagaagtcg 300 gtagcaaaat ctaaaaacca gtatatattt ttacaaaaat gtgaaaaaca caaacttatt 360 ccgaaatcat tacgaattaa cagtcctgta aatacaaaga gagcagtaag tattgttatt 420 cgatatagat ttgaaatttt gatttgtatt aaaaacgatg ctaaaaaacg gttttttatg 480 cttgttaatg aaactaaaaa atctccagaa agatcttaaa gttattcttg atgaaataga 540 ctacaaaaaa gttgaacaag ttactgaaaa agctcgcgaa ataatgtttc tgaaatcgaa 600 agagagatta tttaaaaagt ttaaaattct tcaaaacgaa tttgaaaata atcacagtaa 660 taaaatctgt aagttaacaa cccatacaaa gaacgctgtt ttaaacttat gcaatgwtga 720 tattccaaaa aaccaaaaca atcttcttaa tcttggtcca cattttgtac cttcgttgaa 780 atatatacct tatatggata ttattgcaac aacagagacc tcagctttaa aattggaata 840 tagcaacaag attgaagatg cgcaggattt gagaaaaaat gttttaaaag aacttaaaat 900 gggtaaaaaa ataaatcaaa atttaacaag agaacaaaga aaagctctta tggaaattaa 960 gaaaaacaaa attattgata tatatccgtt tgataagggt aatggttttg taagaataga 1020 acacgataaa gctttggaaa aaattcgcga acaaattggc ccaacaagaa ttttaagtga 1080 agatcccaca tctagttatg ctattaaaat taagacttat ctctcacaac ttaataaaaa 1140 acagcgattt tcaaaaacgg aatatgatag tatatatcca agtgatccca tcccgcctcg 1200 tatgtatggt ctaattaaag ctcacaaacc tgaaaaatcc tatcctatga gaatagttgt 1260 atcaactatc ggcactccta attatggaat atcgaactat ttagtaaaga aaatacaacc 1320 tgtcttaaac aaaaaccaaa cacgtctgaa aaattctttt gattttatta gtaaagcaaa 1380 ttcatgggag attgacaaaa acgaggttca agtttcattt gatgtaataa acttgtatcc 1440 atcaatacca ctaaaggaag ccactttaat tcttatagac caattaaata aagatgattc 1500 ctacaaatat tctaccaaac ttactatatc tgaaacaaaa caactcatag agctttgttt 1560 acaccgttgt tatttccttt ggaacaacga aatccatgaa cttgagaatt ctggtcctat 1620 aggtctttca ttcatggtcg tacttgcaga atcattttta caacatcatg aacaaaacgc 1680 tttcaaaatt gcaatggcag taaatcctcc tcttgaccta aaatcctatc taagatatgt 1740 cgacgatagt cacgctagat tttccaacac tcaagaagca gaacaattcc taattatctt 1800 aaacaaacaa caccctgcaa tacaatatac gatcgaaact gaaagcgaaa atagaactct 1860 aaacttcctc gatttaacca taataaataa caacaaagga aaatacgagt ttaaagttta 1920 cagaaaggaa gccattacta atatccaaat taaacmtcac tcaaatcatg accctaagat 1980 tctaaacgca atatttaaag gttacgttca cagggcttat tctttatgca gtaatttata 2040 tttacaagat gagattaatt tccttattca aatgtttaac gaaaatgggt acaacatatg 2100 ccaacttaaa cggattgcaa acttaattgg aaataaacgg tctctaaaaa tcaayaaaat 2160 tcaatcagac tctcttaaca tccctactgt atcgttaccg tggataccat cactatcccc 2220 aaaactaagg aaaatatttc ggaaagctgg ctatagagta gtcttcaaat caaatccaaa 2280 tttaaaaaca ttactaacgt caaaaaacaa aactaaatta ccacaaaata gtcaacccgg 2340 aacttatttg attgaatgca aatgctcaaa gagatacgta ggcgaaacta aacttcaaat 2400 taggactcga actcaacaac atctaaaaag tttaaacgag gggaaacatt atcaatctgc 2460 aatagccacc cataataagt tttgctcaca agaaataaag tgggaaaacg taaaaacaat 2520 taaagttgaa acaaaaaaat ttgatcgaaa agtgcgcgaa gctcttgaaa tacagaggtt 2580 ccaatgttca cctttacatg gcggaattaa tttagataac gggcagtttg tgaaaactaa 2640 attttggact ccgtttttta catttttacg taagcaagaa agaagccatt caactgctga 2700 cgtcagttaa ttttaaactg tttctaacgt taaaaaattt tgtattaatt tttacgagct 2760 gatgatgctg gtaaccataa tccagcgaaa atttctaata ataaattatt aattgtatta 2820 agagaattcg tattaattga tgttttctta ctaatataca tatactctta tacacatgtc 2880 aatat 2885 // ID TTAA15_AP repbase; DNA; INV; 595 BP. XX AC . XX DT 25-AUG-2009 (Rel. 14.09, Created) DT 25-AUG-2009 (Rel. 15.12, Last updated, Version 2) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; nonautonomous; TTAA15_AP. XX NM TTAA15_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-595 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2081-2081 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC TSD is TTAA tetranucleotide. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea CC Aphid Genome Project. XX SQ Sequence 595 BP; 195 A; 106 C; 119 G; 172 T; 3 other; gggggttcca gtcggtcatt ttctaaggag taagaattag gcaaattatg tggcaagtan 60 aaccgngcca tgtcaatgtg tgcgagtgta aatgaacatt ggtgtgtgtg ggcacacacc 120 gcagtgtatg tcaacgcgta gcgacccgca tncacattct aagaaccggc gttctcgcac 180 tcgcaggcac atgtcaatat attaaattgg gaaaaaacta tacggttaat tttcgagtaa 240 aaacatacct cggctctgac aaaattgtga gacttaggtt ggtccgattt cgattctgta 300 acaatcctaa caactattaa cacgtcctct aaagaacggt cgaagaattt tttagaaatg 360 ttaatttctt caatacatat ggttgaacaa aaatgttgta aaaactgaaa tttgtataat 420 attataatat tgtcaaaagt tcaaacttta aaaaaaagct ccgaccgttc tctagaggag 480 atgttaagaa atgttattat ttttacagca ttaaaatcgg accatctaaa gtctcacaat 540 cttgtcagag cggagggtct aaaaacggtt atttttttga ccgactggaa ccacc 595 // ID Howilli3 repbase; DNA; INV; 2559 BP. XX AC . XX DT 08-OCT-2009 (Rel. 14.1, Created) DT 08-OCT-2009 (Rel. 14.1, Last updated, Version 1) XX DE Howilli3 is a putative autonomous DNA transposon. XX KW hAT; DNA transposon; Transposable Element; HOBO; transposase; KW Howilli3. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-2559 RA Ortiz M.F. and Loreto E.L.S.; RT "Characterization of new hAT transposable elements in 12 RT Drosophila genomes."; RL Genetica 135(1), 67-75 (2009)DOI 10.1007/s10709-008-9259-5. XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 51..1550 FT /product="Howilli3_1p" FT /translation="MFSDVFRCFVTYRCLSMHRCLTEHRVLNFTTLQNICF FT SATRKNSRICLIYLPCVCNAKVSKLWKIYINIVICNYKCIYSRITTRTSFC FT WRHFTKIDGNSAKCNICDKVLKTAGNTTNMKDHLRSLHKIDQNVEVPAVPN FT ISGVAQISIAESFQNMAEYGTDGNKTKIINDAIIYMISKDVQPFSIVENKG FT FIHLMNTVAPRYKIPSRYLITKWVDEKFLEMKEMWKRLLNGKTLTLTMDVW FT SDQMSMRSYLGITAHFQLELEMTSLTIGVLELSERHTSVYLTEMLERCCKD FT WHIEKELVTAIVTDNGANVVKAVDLGFGRKKHVPCFAHTLNLIARSAMQRD FT SISACIEKVKGIVTFFKQSCVASDNLRKVTDKVLIQDVVTRWNSTYYMIER FT YLELKNFVNDIVLSVRNDVEMLNGNELQLLNNIMPMLRPLEEATKIIGADT FT YCTASMVIPMVNILKTKLLNVDDITPEANDIKNFLLFEIDKRMGSIEQVCF FT KCHH" XX SQ Sequence 2559 BP; 886 A; 417 C; 494 G; 762 T; 0 other; tagtgatgta aaaaaacatc gatgcattga acatcgatga tttttgggca atgttttccg 60 atgtttttcg atgttttgtg acataccgat gtttatcgat gcatcgatgt ttgacagaac 120 atcgggtgtt aaatttcaca acattgcaga acatatgttt cagtgccaca cggaaaaatt 180 cgcgaatttg tttaatttat ttaccctgtg tttgcaatgc caaagtaagt aaattgtgga 240 aaatatatat aaatatagtt atttgcaatt acaaatgtat ttattctaga ataacaacaa 300 ggacaagctt ttgttggagg cactttacaa aaattgatgg aaatagcgca aaatgcaaca 360 tttgcgacaa agttttgaag actgcgggta acacgaccaa catgaaggac cacctgcggt 420 cccttcataa aatagatcaa aatgttgaag tgcctgcagt acctaatatt agtggtgtgg 480 cacaaatttc tattgctgaa agctttcaga atatggcgga atatggcacc gacgggaata 540 aaacaaaaat aatcaacgat gccattattt atatgatatc aaaagatgtc caaccatttt 600 cgatagtcga aaacaaaggc ttcatacatt taatgaatac ggttgcaccg cgttacaaga 660 taccgtcgcg atatttaatt acaaaatggg tggatgagaa gtttttggaa atgaaggaaa 720 tgtggaagag attgctaaat gggaaaacgc taacattaac aatggatgta tggagcgacc 780 agatgtcaat gcggagttac ttgggcatta ctgctcattt tcagcttgag cttgaaatga 840 cttctttgac tatcggagtt ttggagctaa gcgagcgtca cacatcagta tacctcactg 900 aaatgctgga gcgttgctgc aaagattggc acattgaaaa ggaattagtg acagccatcg 960 ttaccgacaa tggggcaaat gtcgtcaaag ccgttgacct aggttttggg cgaaaaaagc 1020 acgtaccatg ttttgcacat accctaaatc taattgcaag atctgctatg caaagagatt 1080 caataagtgc atgcattgaa aaggtaaaag gaattgtaac tttttttaag cagagttgcg 1140 ttgccagcga caacttaaga aaagtaacgg acaaggtatt gatccaagat gtggttacta 1200 gatggaacag cacatactac atgattgagc gttacttgga gcttaaaaat ttcgtaaacg 1260 atattgtctt aagtgtccgc aacgatgttg agatgctaaa tggaaatgaa ttacagttgc 1320 ttaataatat catgccaatg ctgcgccctc ttgaggaagc cacaaaaata ataggcgcag 1380 atacgtattg cacagccagt atggttatcc caatggtgaa tatcttaaaa acaaaattgc 1440 ttaacgttga tgacatcacg cccgaggcaa atgacattaa gaacttcttg cttttcgaaa 1500 tcgataagcg tatgggttca atagagcagg tatgcttcaa atgtcatcat taatttattc 1560 cttaaaaaat tttattttaa atgtattgtt tcaggtctca attctggcaa tggctacagt 1620 acttgatcca agatttaaga aactgcattt taaagaccct caggcatgct caaactctat 1680 tacaaaaatt aagtctatga taaaaccgat ggtccaagac tacaatagta atgataacgt 1740 gacaagcaca actaaatgca attctttttg ggaccaccat caccagctgg cacagtctca 1800 tgagcctgat ggggatattg atgtggagat gatagcgtac ctgcgaatgc cattagcgtc 1860 atttgaaagc aaccctctac aagtttggga aggtatgaga aacacatatc ccaatttaca 1920 taaaatggct ctgcaatttt taccggtagt tggatcgtcg gttccttctg aacgggtttt 1980 ttcagcggca tcatatattt taagccaaag gagaaacagg ttagagccaa accgactaag 2040 ccgcatttta tttttgcaaa gtatcgacaa aaaatacttt tttggaaaaa agtaattaca 2100 ataattaata ttgtttggaa tcataaaaaa ggactgctaa ggaagctttt attgttatat 2160 tattatttta ttcattttga ttctcaatgt aatgaaaaag actgctaaga aagtatagtt 2220 atattattat atttgattct caatgtaatg aaaaagactg ctaagaaagc atttgttata 2280 ttattattta atcacaagac tacgtaaaat tcgtgaattg atatagtttt aagttctatt 2340 aattatattg ttaagtaaac cactcaataa agaaataaaa aaaaaacgta gcatttattt 2400 aaaaggcttt taggaattga attaacgaaa aataacggaa ttggtagaaa cattgataca 2460 tcgatgttta tagccgatgt ttcacacaac atcgatttat cattttgcaa acatcgatga 2520 acatcgacat cgaaatttcg acccaaattt acatcacta 2559 // ID Jockey-3_CQ repbase; DNA; INV; 4427 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A Jockey non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW Jockey; Non-LTR Retrotransposon; Transposable Element; KW Jockey-3_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4427 RA Kojima K.K. and Jurka J.; RT "Jockey non-LTR retrotransposons from the southern house RT mosquito."; RL Repbase Reports 11(1), 114-114 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 10 sequences with >99% CC identity. XX FH Key Location/Qualifiers FT CDS 146..1540 FT /product="Jockey-3_CQ_1p" FT /translation="MRQNKGKRKSSEDLVVTSVKRLNAKPANGNRKRKQPL FT LRSDSDSECEVNPPIPLTNSFGVLSETDDKEPSPRTEPSAVEKRVKAPPIV FT VTSVSDLASFRTQLKNCKETCNLKVSFQLGRRGECRLLTESLQDHQTFVGY FT LKNHKHNFYTYETKNARPFKAVLKGLSNDLSVDEIKNELKVLLGFAPSQVI FT PMKKKSNGNISRFGLTSQFYLIHFNRNEINNLKLLDKVQFLFHVRVKWEHF FT KKHGGNGQNLTQCRRCQAFGHGTDHCAMVPKCMVCGDSSHDKDNCPVKEVT FT QFKCANCGGNHKSNFWDCPIRKKVLDSRAKHQPKSKPKFSQSQVVPASLNQ FT TFVLSHSNNSRNTPTVEKLGNNNGISYANVVSGSSTNFKSSTNLSEIGQVP FT QISFENFSAGNALGSSDLGDVTFEKMTFLQNSLFGLIQTMSNATSMMEAIQ FT IGLKFANDVVLTLKFNHGSK" FT CDS 1530..4187 FT /product="Jockey-3_CQ_2p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="MDLSNSINIMNFNARSLKAKENEFFNFLRVHNVHVAV FT ITETFLKTGTYLKSDPDYKVITNNRMNRNGGGVAIVIHRSMTYSTLRDFKL FT KVIESLGIELETSFGKIMIAAAYLPFQCTGENKNYFKGDLNKLTRHRSRFL FT IIGDFNAKHQSWNNSKVNSNGKILFRDCTSGLYSVLYPNGPTCFSSVRNPS FT TIDLVLTNQSQYCGPLVTHADFDSDHLPVTFSLSHEAVTRPNSSVFNYHKA FT NWDRYQHHIENNLNHDFVLETKADIDSALESLTNAILDARNIAIPKVQVKF FT DSPIIDDDLQLLIRLKNVRRRQYQRSRDPALKRIQKDLQKVIDHRFTLLRN FT EKFARDVEQIKPYSKPFWKLSKVLKKPQKPIPSLKDGDNILLTNGEKAQKL FT AQQFESAHNFNLNVLSPIENQISIEFQNIVEQEFSSDEVFNTDLNEIKSII FT KKFKNMKAPGEDGIFYILIKKLPEATLSSLVKIFNKCFDLAYFPSSWKNAK FT VIPILKPDKNPAEASSYRPISLLSSISKLFERIILNRMMTHINENSIFADE FT QFGFRLGHSTTHQLLRVSNLIRSNKSEGYSTGAALLDIEKAFDSVWHKGLI FT AKLKRFNFPIYIVKIIQNYLTDRTLQVCYQNSKSDQLPVRAGVPQGSILGP FT ILYNIFTSDLPDLPPGCQKSLFADDTSISAKGRSLRVITRRLQKSLDIFNS FT YLKEWKITPNAAKTQLIIFPHKPRADFLKPKSHHIIKMNEVNLKWEDQVKY FT LGLGFDKNLTYKDHIESIQVKCNKYIKCLYPLINRNSRLCLKNKLLIYKQI FT FRPAMLYAVPIWTSCCLTRKKKLQRIQNKILKMILKLPPWFSTSELHQLAE FT VDTLDVMSNKIIDAFRQKSLQSSAALIRSLYSL" XX SQ Sequence 4427 BP; 1448 A; 797 C; 787 G; 1393 T; 2 other; tttcaaaaca aaattcacag tctttttaaa gactacctga gttattttcc aaaattctaa 60 acagtctttt caagactaac cccacctgaa attttgttgt attttacaaa cgaagccatc 120 tttttaagag aactttgctt gcaaaatgcg tcaaaacaaa ggtaaacgca aatcttctga 180 agatttggtc gttacgtcgg tgaagcgttt gaacgctaaa ccggctaacg gaaacagaaa 240 aagaaaacag cctcttctga ggtctgattc tgattctgaa tgtgaggtca atcctccaat 300 tccattgaca aacagtttcg gtgttttatc cgaaactgat gacaaggaac cttctcctcg 360 tactgagcct tctgccgtcg agaaacgagt aaaggctccg ccaattgtwg tgacttccgt 420 ctccgatttg gccagctttc gaacgcaact gaagaattgc aaggaaactt gcaatttgaa 480 agtttcgttc cagcttggtc gaagaggaga atgtcgcttg ttgacggaat ctttacaaga 540 tcaccaaact tttgttggtt atttgaaaaa ccacaaacac aatttctaca cgtatgagac 600 caagaatgct cggccattca aggcggtctt gaaaggtctc tccaacgact tgtcggtgga 660 tgagatcaaa aacgaactta aggtgttgct tggctttgcc ccatcccaag taataccaat 720 gaagaaaaaa tcaaacggga atatttctcg ctttggtttg acttcacaat tttatctgat 780 tcatttcaac agaaatgaaa tcaacaattt gaaacttttg gacaaagtwc agtttttgtt 840 ccatgtacgg gtaaagtggg agcattttaa gaaacatggc ggtaatggcc agaatctgac 900 ccagtgccgg cgttgccagg cattcggtca cggtactgat cattgcgcca tggttccaaa 960 atgcatggtt tgcggggatt cttctcacga caaggacaat tgtcccgtga aagaagtcac 1020 ccaatttaaa tgtgcaaatt gtggtggaaa tcacaaatca aatttctggg attgccccat 1080 cagaaaaaag gttttggatt ctcgtgctaa gcatcagccg aaatccaaac cgaaattttc 1140 tcaaagtcag gttgtacctg catctttaaa tcaaacgttc gtgctgtctc actcgaacaa 1200 ttctagaaat acccctaccg tggaaaagtt aggtaacaac aatggcattt cttatgctaa 1260 cgtcgtttcg ggttcatcca cgaattttaa atcctctacc aatctttctg aaattgggca 1320 ggtacctcaa atctcatttg aaaatttttc tgctggcaac gctttgggat cttctgatct 1380 cggcgatgtt acgtttgaaa aaatgacttt tttgcaaaac tcactgtttg gtttgattca 1440 aacaatgagt aatgctacat ccatgatgga agcaatccag attggattaa aatttgcgaa 1500 tgatgttgtt cttaccctga agtttaatca tggatctaag taattccatc aatattatga 1560 attttaatgc tcgctcttta aaagcgaaag aaaatgaatt tttcaacttt ttacgagttc 1620 ataacgtgca tgttgctgtt ataaccgaaa catttttaaa aactggcact tatttgaaaa 1680 gtgatccaga ttataaagtt ataactaata accgaatgaa tcgaaatggc ggtggagttg 1740 caatagttat ccaccgtagt atgacttata gcacgttacg tgactttaag ttaaaagtta 1800 ttgaaagttt gggcattgaa cttgaaactt cttttgggaa aattatgatt gcagctgcat 1860 atttgccatt ccaatgcact ggggaaaata aaaattattt caaaggggat ttgaataaac 1920 ttactcggca tagatctcga tttttgatca tcggtgattt taatgccaaa caccaatctt 1980 ggaataattc aaaagtaaat tccaatggta aaattctgtt cagagattgc acttctggtc 2040 tttattcggt tttatacccg aatgggccaa cttgcttttc ttctgttaga aatccatcaa 2100 caattgattt ggttttgaca aatcaaagtc agtattgtgg tcctttagtg actcatgctg 2160 attttgattc tgatcacctt ccagtaactt tttcactttc tcatgaagca gttaccagac 2220 ccaatagttc tgtgtttaat taccacaaag ctaattggga caggtatcag catcatattg 2280 agaataattt aaatcatgat tttgttttag aaaccaaagc tgatattgat tcagccttgg 2340 aatctttaac taatgcaatt ttggatgcta ggaatattgc tattcctaaa gtccaagtca 2400 aatttgattc tcccattatt gatgacgatc ttcagcttct gattcgtctg aaaaatgttc 2460 gccgaagaca gtatcaacgt tctcgtgatc ctgcactgaa gcgaattcaa aaagatttgc 2520 aaaaggttat tgaccacaga ttcactctcc tgcgaaatga aaagttcgca agagatgtcg 2580 aacaaattaa accttattcc aaaccttttt ggaaactttc aaaggttctt aagaaacctc 2640 aaaaacccat cccttcttta aaagatggtg ataatattct attaactaat ggggaaaaag 2700 ctcaaaaact tgctcagcag tttgagagtg ctcataattt caacttgaat gttttgagtc 2760 ctattgaaaa tcaaatttca atagaatttc agaatattgt tgaacaagaa ttttcatcag 2820 atgaagtttt taatacggat ctgaatgaaa taaaatctat tatcaaaaaa tttaaaaata 2880 tgaaagcccc tggtgaggat ggcatttttt acattttaat taaaaaatta ccagaagcaa 2940 ctttaagtag cttggtcaaa attttcaaca aatgttttga tttggcatat tttcccagta 3000 gttggaaaaa tgccaaagta attccgattt tgaaaccgga taaaaatcct gctgaagcct 3060 caagctatcg gcccattagt ttgctttcat ctattagtaa attattcgaa agaataattc 3120 ttaatagaat gatgacgcac attaatgaaa attcaatttt cgctgatgag cagtttggat 3180 ttcgccttgg gcattcaact actcatcagt tgttgagagt ttcaaattta attcgaagca 3240 acaaatctga gggctattct actggcgctg ctcttctaga catagaaaaa gcatttgaca 3300 gtgtttggca taaaggtttg attgcgaaat tgaaaaggtt taattttccg atttatatcg 3360 tgaaaattat tcaaaattat ttgacggatc gtactctgca ggtatgttat cagaatagca 3420 aatctgatca actacctgta cgtgctggcg tccctcaagg aagcattttg ggtccaattt 3480 tatacaatat ttttacttct gacttgcctg atttgccccc aggatgtcag aaatcacttt 3540 ttgctgatga cacaagcatc tccgccaaag gcagaagcct tcgtgtcatc acaagaagat 3600 tacaaaaaag cttggatatt ttcaattctt atttgaaaga atggaaaatt actccaaatg 3660 ctgcaaaaac tcaacttatt attttccctc acaaaccaag ggctgatttt cttaaaccaa 3720 aaagtcatca cattataaag atgaatgagg taaatttaaa gtgggaggat caagtgaaat 3780 atcttggact tggttttgac aaaaacctta cttacaagga tcacattgaa agtatccagg 3840 ttaaatgtaa caaatatatt aaatgtttgt atccacttat aaacaggaat tctagacttt 3900 gtctcaagaa taaactgtta atttataaac aaattttcag acctgccatg ctttatgctg 3960 tgccgatctg gacaagctgt tgcttaacca ggaagaaaaa acttcagagg attcagaaca 4020 aaattctgaa aatgattctg aaacttcctc cctggttcag caccagtgaa cttcatcaat 4080 tagccgaagt tgacactttg gatgttatgt ccaataagat aattgatgca tttcgacaaa 4140 aatcattgca gtcttcagct gcattgatcc gctctttata tagtttataa gttagtttta 4200 aggtatccct tttccctttt gtacatgtag gacctcctac atttgaaatc actgaatagc 4260 gaaagctaca atatttcatg aataaatgaa agttgctagt atttaaaatt gaggtgaaaa 4320 gtcatcgttt gtgattggac actcaataat attttaactg aatgaatgta catggaaaag 4380 aaatttgaat aaatataaat taaaaaaaaa aaaaaaaaaa aaaaaaa 4427 // ID BEL-16_CQ-LTR repbase; DNA; INV; 502 BP. XX AC AAWU01032461; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-16_CQ_; KW BEL-16_CQ-I; BEL-16_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-502 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 186-186 (2011). XX DR Genome; AAWU01032461; Positions 819 1320. XX SQ Sequence 502 BP; 151 A; 112 C; 92 G; 147 T; 0 other; tgttacgcgc cgaagcgctt aacgcacctc ctgatcagtc aatgcagttg tcacctgcat 60 tcgactgtca atctgacttg agcgctggac caaaagtcat ccgaattggt taagaagcag 120 aagtgaaatc catgtgtgct gcttcacagc gatattatct ttaaaattta ctaatttctt 180 taattttctg ttctcagaac ggtaatttat tcagtgcaac cacttggggt taaaatctaa 240 tgaattttgc acctttttcc tcaattagct ttatgtgaaa ttacataacc taaaatccgt 300 tgaaccatga gagatgatca tgagttcagc caaaccaagc tcgaccggat ttgccgattg 360 gccactgaat tgcattaggg agcaaaaacg aatttattaa tctaccaaat ataccgacta 420 agacatgcaa acagctttgt tttaatgtcc ccggactgtg attaataaac cggatgaatc 480 ctcccgttgg ccacagccaa ca 502 // ID BEL-50_CQ-I repbase; DNA; INV; 5932 BP. XX AC AAWU01014601; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-50_CQ_; KW BEL-50_CQ-LTR; BEL-50_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-5932 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 253-253 (2011). XX DR GenBank; AAWU01014601; Positions 28225 22294. XX CC Positions [4940-5521] - Integrase core CC 'CTATT' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 464..5905 FT /product="BEL-50_CQ-I_1p" FT /translation="MWRSFLGLNTEPKVEPEEGEESEQEASVQKKPEPEDR FT KPVVVQLPLTPKQRHTESKRKMADEEVQALLNRRGHVKGKLTRVRTVLGQP FT GIPAARIVVCKANAEKYYGEYNSLNNNIMDARLSEEQRTENAERFLDFEQL FT YDEVLEKIVELTAPPNRAQAVVPAGQAGQQVIVQQTPLRAPIPTFDGQYEN FT WPKFKSMFLELMKNSPDSDAIKLHYLDKSLVGAATGMIDTKTLQDNNYAHA FT WEILEDRYENRRLIIDIHINGILQLKRMAKKSSKELRELYEECSRHVENLR FT YHKQELLGVSELFVVNILSSCLDRETREQWEATVKKGELPKYAQTIEFLKQ FT RCAILERCELAAPASYAVRSAVPKAVQSSKPSAKITSAAATSNEAECDLCD FT GSHANYKCGSFRGMSVAERWSKVRDARLCYNCLRKGHRVGQCPSDRSCKCG FT EKHHSLLHQEKNQQQTKPKPEAAPVSAKAPSAVSSASVKAGPSDTGAPEVN FT DDNEQQATSCFSSGALKSPQQVLLQTAIINVADKAGRFHPCRTLLDSGSQA FT HILSEAMARTLGLTFEKCNVTVVGANAVKTQAKKGVNLTFSSRYCEFQDNI FT SCLISDKPTGRIPSARIETAAWHIPSDVFLADPKFSEPHEIDLVMASNYVW FT DLLRTDRVKLANGTVTLRETDLGWIVTGVYDPFQQLSCQVVHSNITLKDQL FT NNAIEKFWLVEELADKQLQTNEEQEVEEHFRGTHGRDGSGRFVVQLPFKDE FT VLELGDNRVQALRRFNLLERRLSRNPELREQYTAFVEEYEALGHCKEISEH FT QDSPEVRKYYLPHHAVLKPSSSSTKLRVVFDASAKAGKYSLNDVLKVGPTV FT QNDLFSIVLRFRRHPVAFTADVVKMYRQVRVHPSDTPYQRIFWRKDPNERL FT RVLELNTVTYGTASAPFLATRCLVEQAESAKGRFPAAAKIVIEDMYVDDML FT SGASNEKEAVKLVLEVKKILEEGGFPVKKWCSNSEQVLKCVPEEDREKPKP FT IEEYSANEAIKALGVLWDPREDEFRFVNCLEECDDKPVSKAKVFSDILKVF FT DPLGFVAPVIVLAKVFMQKLWASKMDWATELNEELLCEWFEIRESLTALND FT IRVPRCVVVPNTVLHELHGFADASTVAYGANVYLRCIRGNGSAEANLLCAK FT SRVAPLKKLSVPNLELCAALLLARLVVKVIETLDLKLSAIKLYSDSQIVLA FT WLKKDPNLLETFVRNRVIQIFNLTGKFRWNYVRSADNPADIVSRGMSPKLL FT SRCPKWWKSLLPLTSAAYVPEEPPELEDDELPGLVAIVYKVTVVEEMPVFK FT RFGEFRKLQRVICYVRRFVQNCRNKKSGQPKTTCAYVTVPETRAALKSIIW FT SIQMTVLPDEFYLAKTEQTSSQLASLAPIVDEDGLLRVGGRLEHSSLPYEA FT KHPVILPRHHVTTLLIRALHVENLHVGPSGLLAIVRQQYWPLKARNVVRDV FT TRKCLQCFKANPRRIEQFMGQLPAERVNVAAPFEYTGVDYAGPVTVKQGKY FT RPKLTKAYIAVFVCLVTKNIHLELVSDLTTEAFLAALDRFVNRRGMVRKIF FT SDNATNFVGASRSLREMHELFKQDLLKRGIQDLLVPKAIEWSFIPPRAPNF FT GGLWEAGVKSVKTHLKRTLQNAVLTFEEFATILTHVEAVLNSRPLFSLSDS FT PGDPLPITPAHLQLGRPLRPVAKPSRSGVADNRLNRWEYLDKLREDFWGRW FT SREYLTSLQSRAKWTKKSHNLQPGMLVLLVEDNLPSQTWKVGIIVETYPGM FT DALVRVVDVKTGSGAVFKRSVSKLAPLPTVDNDRLLERFGASD" XX SQ Sequence 5932 BP; 1393 A; 1582 C; 1826 G; 1131 T; 0 other; tttggtcctt acgaaccgga tagacagtgc ctatcggtaa gtccgtcgga atcggagaat 60 tccggagtgt cgcgaggagt gaagtgtccg gaagtcgctt atcccggagg gcgtttcacg 120 tggcccagac agcgtgaaag tgacgcggtt gtgtggaggt tccagcaacg cgcaaacagt 180 gcaccggaag aggtcggatt ggccgccggt agtgcgaagt gtggacagtg gtcgcgattg 240 gccaccgaag cagaacattg cgtggacggt agcgttgcta ctcgagcaga aaagtgcgtg 300 gaaggtggtc cgaaaaaacc actggagcaa aaagtgaaaa gtgttcggtg aaaagagcgt 360 gggccgccat cttgaaaacg ccagcaagag agtgttcggc gagaaaagag tgtgcgccgc 420 catcttggaa cccaaaaaag tgaagtgagc tgcatcccgg cggatgtggc gtagtttcct 480 ggggttgaac accgagccga aagtagaacc ggaagaaggc gaggaatctg agcaagaagc 540 cagtgttcag aagaaaccgg aaccggaaga ccgtaagccc gtggtggttc aacttccgct 600 gactccgaag caacggcaca cggagagcaa gcggaaaatg gcggacgaag aagtgcaggc 660 gttgctgaac aggcgcggac atgtgaaggg caagttgacg cgggtcagga ctgtcctcgg 720 ccagcctgga atcccagcag cgcggatcgt agtctgcaag gcgaacgctg aaaagtacta 780 cggtgagtac aacagcctta acaacaatat catggacgca aggttgtcgg aagagcagag 840 gaccgagaac gccgaacggt tcttggactt cgagcagctc tacgacgagg tgctggagaa 900 gatcgttgag ctcacagcac caccgaatcg tgcccaagcg gtcgtccctg ccggacaggc 960 tggtcagcag gtgattgtcc agcagacgcc cttgcgtgca ccgattccaa cgtttgacgg 1020 ccagtacgag aactggccga agttcaagtc gatgttcctg gagctgatga agaactcgcc 1080 agactcggac gccatcaagc tgcactactt ggacaagtct ctggtgggtg cagcgacggg 1140 catgatcgac acgaagacgc ttcaggacaa caactatgcg cacgcgtggg agatcctcga 1200 ggaccggtac gagaaccgac gcctgatcat cgacatccac atcaacggga tcttgcagct 1260 aaaaaggatg gcgaagaagt catcaaagga gctgcgagag ctgtacgagg aatgttcgag 1320 acacgttgag aacttacggt accacaagca ggagctgctg ggtgtatcgg agttgttcgt 1380 cgtcaacatt ctgtcgtctt gcttggatcg tgagacacgc gagcagtggg aagctaccgt 1440 caagaagggt gaactcccga agtatgcgca gacgatcgaa ttcttgaagc agcggtgcgc 1500 catcctggaa cggtgcgagt tagctgcccc tgcgtcgtat gcggtccggt cggcagttcc 1560 caaggcggtg cagtcctcga agccatcggc gaagatcacc tcggcagccg cgacatccaa 1620 cgaggccgag tgcgatctgt gtgacgggtc tcacgcgaac tacaagtgtg gctcgtttcg 1680 cggcatgagt gttgcggaga gatggtcgaa ggtgcgtgac gcgaggctgt gctacaactg 1740 cttgcgcaag ggtcaccgag ttggacagtg tccgtcggat cgatcctgta agtgcggtga 1800 gaagcaccac agcttgttgc accaggagaa gaaccagcag caaaccaagc cgaagccaga 1860 agcggctccg gtatcagcga aggcgccgtc ggcagtgtca tccgccagcg tgaaggctgg 1920 tcccagcgac acgggtgcgc cggaagtgaa cgacgacaac gagcagcagg caacctcgtg 1980 cttcagcagc ggagcgctga agtcaccgca gcaggttctg ctgcagacag cgatcatcaa 2040 cgtggcggac aaggccggca ggttccatcc gtgtcgaacg ttgttggatt ccggttccca 2100 ggcgcacatt ttgtcggagg cgatggcacg aaccctcggt ctaaccttcg aaaagtgcaa 2160 cgtgacggtc gttggagcca acgcagtgaa gacgcaagca aagaagggag tcaaccttac 2220 cttctcgtcg aggtactgtg agttccaaga caacatctcg tgcctgattt cggacaagcc 2280 aacgggacgg atcccgtcgg caagaatcga gacagctgct tggcacatcc cgagtgacgt 2340 gttcctggca gacccgaagt tcagcgagcc gcacgagatt gacctggtca tggcgtcgaa 2400 ctacgtgtgg gacttgctgc gaacggatcg agttaaactg gcgaacggca ccgtgactct 2460 gcgtgaaacc gatctgggct ggatcgtgac cggcgtgtac gacccgttcc agcagctgag 2520 ttgccaagtc gtccactcga acattacgct gaaggatcag ctgaacaacg ccatcgagaa 2580 gttctggctg gttgaagaac tagcagacaa gcaacttcaa accaacgaag aacaggaagt 2640 ggaagagcac tttcgtggca cccacggccg tgacggaagt ggtcgcttcg tcgtccagtt 2700 gccgttcaag gacgaggttc ttgagctggg cgacaaccga gtccaggcgt tgaggagatt 2760 caacctgctg gagcgccgtc tgtcgcgcaa cccggaactc cgagagcagt acacggcgtt 2820 cgtcgaggag tacgaggcgc tgggtcactg caaggagatt tctgagcatc aagattcacc 2880 agaagtgcgt aagtactacc tgccgcacca cgctgtgctc aaaccgtcca gttcgtctac 2940 gaagctgcgg gtggttttcg acgccagcgc caaggccggc aagtactccc tgaacgatgt 3000 gttgaaggtg ggaccgaccg tccagaatga cctgttctcc atcgttctgc ggttccgtcg 3060 ccatccggtc gcattcactg ccgacgtggt gaagatgtat cggcaggtgc gtgtgcatcc 3120 gagcgatacg ccgtaccagc gaatcttctg gaggaaggac ccgaacgagc gcttgcgagt 3180 gcttgagctg aacaccgtga cctacggaac ggcgagcgcc ccgttcctcg caacgcggtg 3240 tctggtggag caggcggagt cggcgaaagg ccggttccca gcagccgcca agatcgtgat 3300 cgaggacatg tatgtcgacg acatgctgtc cggagccagc aacgagaagg aagcagtgaa 3360 acttgtgctg gaagtgaaga agatcctgga agagggaggt ttcccagtga aaaagtggtg 3420 ctctaactca gagcaagtgc tgaagtgtgt gcctgaagaa gatcgggaga agccgaaacc 3480 gatcgaagag tacagtgcga acgaagccat caaggctctg ggtgtcctgt gggacccacg 3540 cgaggatgag tttcgttttg tgaactgttt ggaagagtgc gacgacaaac cagtttcgaa 3600 ggcgaaagtg ttttcggaca ttctgaaagt gtttgacccg ctgggcttcg tagccccagt 3660 catcgtgctg gccaaagtgt ttatgcagaa gttgtgggcc agcaagatgg actgggctac 3720 cgaactaaac gaagagcttt tgtgtgagtg gttcgagatt cgagagtctc taacagcgct 3780 gaacgacatt cgtgtgccac gatgcgtcgt ggtgcccaac acggttctgc acgagctgca 3840 cggattcgct gacgcgtcga ccgttgccta cggggctaac gtctacctgc gctgcatccg 3900 cgggaatggg tcggccgagg ccaacctgtt gtgcgcgaaa tcccgagtgg ctccgctaaa 3960 gaagctgtcc gtgccgaacc tggagctgtg tgcagcgctg ttgcttgcac ggctcgtcgt 4020 gaaggtcatc gaaacgctgg atctgaagct gtccgctatc aagctgtatt cggacagcca 4080 gattgtgctc gcatggctga agaaggatcc gaatctcctg gagaccttcg tgcgtaaccg 4140 cgtcatccag atcttcaacc tcacaggtaa gttccgatgg aattacgtga ggtcagccga 4200 caatccggca gacatcgtct ctcgagggat gagtcccaaa ctcttgagtc ggtgtccgaa 4260 gtggtggaag agtcttctac cgctgacgag cgctgcctac gttcctgaag aaccccccga 4320 actcgaggac gacgagctgc cagggctggt agccatcgtc tacaaagtca ccgtcgtcga 4380 ggagatgccg gtgtttaaac ggttcggcga gttccggaaa ctccagcgcg tgatctgcta 4440 cgtgcgcaga ttcgtccaga actgcaggaa caagaagtct ggacaaccaa agacgacgtg 4500 cgcctacgtg acagttcctg aaacgagagc ggcgttaaag tccatcatct ggtcgatcca 4560 aatgacagtc ttaccagatg agttttatct tgcgaagacc gaacaaacat ccagccagct 4620 cgcgagcctg gcaccaatcg tcgacgagga cggtctgctg agagtcggtg gacgtctgga 4680 acattcgagc ctcccgtacg aggccaagca tcccgtgatt ctgccccgtc atcacgtgac 4740 tacgttgctg atccgagcgc tgcatgtgga gaacctgcat gtcggcccct ccggactcct 4800 ggccatcgtt cgacagcagt actggccgct gaaggcgcgc aacgtagtgc gcgacgtaac 4860 tcgcaagtgc ctgcagtgct tcaaggccaa cccacgcagg atcgagcagt tcatgggaca 4920 gcttccggcg gaacgagtga atgttgcagc ccccttcgag tacaccggcg tggactacgc 4980 tggacctgtg acggtgaaac aaggcaagta cagacccaaa cttaccaaag cctacatcgc 5040 cgtgttcgtc tgtctggtca ccaagaacat ccacctggaa ttggtgtcag acttgacaac 5100 cgaggccttc ttggccgccc tggatcgctt cgtcaaccga cgaggcatgg tgcgcaagat 5160 cttctctgac aacgcgacga atttcgtcgg ggcgtcacga tctttgcgcg aaatgcacga 5220 gctcttcaag caggaccttc tgaagcgcgg aatccaagat ttgctcgtgc ccaaggccat 5280 cgagtggagc ttcatcccgc ccagggctcc caacttcggc ggattgtggg aggcgggtgt 5340 gaagagcgtc aagacacacc tgaaacgaac gctgcagaac gcggtgctta ccttcgaaga 5400 atttgctacc atcttgacgc acgttgaagc ggtcttgaac tctcggccgc tcttcagctt 5460 gtcggacagc cccggagatc cccttccgat aactccggcg catcttcaac tcggcaggcc 5520 gctgcgtcct gtcgccaagc cctcccgtag cggtgtcgcg gacaaccgtc tgaaccgctg 5580 ggagtacctg gacaagcttc gtgaggactt ctggggccgc tggtccagag agtacctgac 5640 aagtctgcag agccgtgcga agtggaccaa gaaatcgcac aatctgcagc ccggaatgct 5700 ggtcctcctc gtcgaggaca accttccgtc gcagacctgg aaggtcggca tcatcgtcga 5760 gacttacccc ggaatggacg cgctggtgcg tgtcgtggac gtgaaaaccg gttctggagc 5820 agtcttcaag cggtcggtat ccaaactggc acccttgccg accgtggaca acgaccggct 5880 tctggagcgc ttcggtgctt cagattagct ggaatgcagc cgcgggggag aa 5932 // ID CR1-67_AAe repbase; DNA; INV; 4996 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-67_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4996 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1155-1155 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 25 sequences with >94% CC identity. XX FH Key Location/Qualifiers FT CDS 392..1183 FT /product="CR1-67_AAe_1p" FT /translation="MACNKCDKVVNDSDLITCRGYCGNSFHMICMKLDYSV FT REVLNANKKNMFWLCDSCSELFSNDQFRKISSRHINDMPEDTPLKSIKDDI FT AGLKQIVSALSSKIDAKPLTPVFNSWRKMDGIQTIPNTPKRIREDVQSTIK FT PSIIRGSKAASEMVKTVRPPEDLFWLYLSAFEPSTSDSEIVAFVKECMSST FT EVDPKVVRLVAKNKDPSTLSFVTFKVGVPKTLKDVALSSETWPDNIYFREF FT DNNSKNQRRLVRITAERSPIVVT" FT CDS 1123..4929 FT /product="CR1-67_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="FKKPTSFSQDYCREEPDCGDITSCWIPGRTLALRFME FT ASDPLSTVEPFLPATISRPGPVSEYGDEVFQNPVSGKYSVYSSNSLPDMTT FT ASSTTTSLAISSLRTPGRKLSISPKEASHSPSTVEPFLPSTCSRPGPVFGI FT DEEVFRIPPSGKYDTSATVPLSEKFIASSQASSESAPVLLATTTTTSPAVS FT SFRTPGRTFSTSPKEASHSPSTVEPFLPSTCSRPGPVFGIDEEVFRIPPSG FT KYDTSATVPLSEKFIASSRASSEAVPEHSSSISGITLATTDTTTSLPSSSG FT SLLTGIEEHVNTRSKWGGITVYYQNVGGLNSSIQDYHLACSDCCYDVIVLT FT ETWLNDNTRSSQIFGASYEVFRCDRNPENSRKATGGGVLIAVRKEFEASAV FT DDQHWRNAEQVWVRIKLGNHNLYMAAVYIPPDRIRDDSVYDALIRSATSIS FT SNATVCDEFVILGDFNLPGVKWKHTSNGFLSIDTEQSSYPLSANALIDSYS FT FANMRQINDCVNENGRTLDLCFASLQDVAPDISIAPNVLVKFVPHHPPLLL FT MLNNNELVTLTETVESVYYDFNKADYESISNVLHSIDWYSILDHENVEGAT FT QTFSNVLSYVIDRHVPKKAAKTSEHAPWQSRELRRLKREKRAALRNYSKYK FT TLQLKEYYLQINGLYKNMSKACYSAYLRNLQASMKANPKVFWNYVRCQKKE FT NGLPSVMKLNDIEGSNELEHCQLFATKFSGVFSNEHLETEQIIAAASNVPL FT SGLSLNAVNVSNAMIIEAASRLKSSSSCGPDGIPSVLVKKSIENLVTPLQH FT IFQLSLNTGTFPSLWKNAYMFPVHKKGDKSNVDNYRGISALCAVSKLFELV FT ILEPVFFECKHFISNDQHGFMPKRSTTTNLLCFTTYVIDGMAHGLQTDAIY FT TDLSSAFDKINHDIAIAKLSRLGFNGAVLNWFRSYLTGRLLNVKIGDHLSD FT SFAASSGVPQGSHLGPLLFILYLNDVNKLLEGPRLSYADDFKIFLKIAKPN FT DASFLQKQLLIFANWCDLNRMVLNAKKCAVVSFSRSQNPVFYDYSLSGERL FT SRESSMKDLGVILDSKMTFRQHVSYITEKASRNLGLIFRIGKEFRDVYCLK FT SLYCALVRSVLEYGSAVWNPCYLNGSDRIERVQRRFIRWALRNLPWQNRFE FT LPPYEDRCQLIGLDTLSCRRSVTCALCVSDVLSARIDCPAILRYLPFQARI FT RNLRNNSLFRIQFRRTNYSLNCGITGLLRIFNRVASVFDFHYSRATLKQLF FT VTTFKRFM" XX SQ Sequence 4996 BP; 1375 A; 1134 C; 1053 G; 1434 T; 0 other; aaaatttggc agcactgcga tcttgtatag tacaactttg tgtgcgcatt ttgttagcaa 60 atttcaccga tctgttaagt gtctaaattc atgatttttt gttgagtcga ttgattgtgt 120 tctcctttcg tgttaacaac ggtattgtgc ttgtatcttg tgaattgagc tgattattac 180 ggcttatcat ctgatcaact ggacgacaac gtagtttctt ctaacaccga cgcaaacatt 240 tttcgaacag gacgacatct tgcgtgaaaa attggaaaca gaacttgtgt gcaatattga 300 acggcagccg aagctcttta ctgtaaactc gctatctcgt tagtataaca atcaggcgtt 360 tcttctgatc gtcgcatttt caccaccaag gatggcttgc aacaagtgtg ataaggtagt 420 aaacgattct gatctcatta cctgccgtgg atattgcgga aattccttcc atatgatctg 480 catgaagctt gactattcgg ttcgtgaggt actcaatgct aataagaaaa atatgttctg 540 gctgtgtgat agctgttcag aattgttttc caacgaccaa ttcagaaaaa tatcttctcg 600 tcacatcaac gacatgcccg aagatactcc gcttaagtct atcaaggacg atatagctgg 660 attgaagcaa atcgtaagcg ctctctcttc caagattgac gcaaagccgc taacccctgt 720 gttcaactct tggcgtaaga tggatggaat tcaaactatt cccaatacac cgaagcggat 780 acgtgaggat gttcagtcaa ctatcaagcc atctatcatt cgtggttcca aagcagcgtc 840 cgaaatggtt aaaacagtgc ggcctcccga ggatctgttc tggctttatt tgtctgcatt 900 tgagcccagt acctcagaca gcgaaatagt agcctttgtc aaggaatgca tgtcatctac 960 tgaagtggat ccgaaagtcg taaggttggt tgccaagaac aaagatccat cgactcttag 1020 ctttgttact ttcaaagttg gtgttcccaa aacactaaag gacgtggctt tatcgagtga 1080 aacgtggccg gataacatct acttccggga gttcgacaat aattcaaaaa accaacgtcg 1140 tttagtcagg attactgcag agaggagccc gattgtggtg acataacttc atgttggata 1200 ccgggacgca cacttgccct acgctttatg gaagcctctg atccactcag tacagtcgag 1260 ccattcctgc cagcgaccat cagccgtccc ggtcctgttt ctgagtatgg tgacgaggtc 1320 ttccaaaatc cagtttcagg caagtattct gtttattcga gcaattccct tcctgatatg 1380 actaccgctt ctagtacaac gacatcactc gcaatatcat cccttcggac accaggacgc 1440 aaactttcca tcagccctaa ggaagcctct cattctccaa gcacagtcga gcccttcctg 1500 ccatcgacct gcagtcgtcc tggtcctgtg tttggaatcg atgaggaggt cttccgaatc 1560 ccgccttcag gcaagtacga cacatccgcg acagttccac tctctgaaaa gtttatcgct 1620 tccagtcaag catcttctga atctgcacca gtgcttttgg caactactac aacgacatca 1680 cccgcagtat catcttttcg gacaccagga cgcacatttt ccaccagccc taaggaagcc 1740 tctcattctc caagcacagt cgagcccttc ctgccatcga cctgcagtcg tcctggtcct 1800 gtgtttggaa tcgatgagga ggtcttccga atcccgcctt caggcaagta cgacacatcc 1860 gcgacagttc cactctctga aaagtttatc gcttccagcc gagcatcttc tgaagctgta 1920 ccagagcatt cgtcaagtat ttctggaatt acactagcaa caaccgacac aacgacatct 1980 cttccgtcat ctagtggttc tctactgacg ggtatagagg aacatgttaa tactcgtagc 2040 aaatggggtg gtattaccgt ctattaccaa aacgtcggtg gattgaactc ttcaatccag 2100 gattatcatc ttgcgtgctc ggattgctgt tatgacgtca tcgttttaac tgaaacctgg 2160 ttgaacgata acaccagatc gagtcaaatt tttggtgcta gctacgaagt ttttcgatgt 2220 gacagaaacc ccgaaaatag tagaaaggca acaggaggtg gtgttttgat agctgtgcga 2280 aaagaatttg aagcttcagc tgtcgatgac cagcattgga gaaacgcaga acaggtatgg 2340 gtccgaatta agctgggcaa tcataattta tatatggctg ctgtttatat tcctcccgat 2400 cgtatccgcg atgactctgt ttacgatgct ttaattcgat cagcgacgtc catctcgtcg 2460 aatgctactg tgtgtgacga attcgtcatt cttggcgatt tcaatcttcc tggcgttaaa 2520 tggaaacaca caagtaatgg gtttttatca atcgacactg aacaatcctc atatccactc 2580 agtgctaatg ccctgataga tagctatagt tttgcgaata tgcggcagat caacgattgt 2640 gttaacgaaa acgggcgaac tttggacctg tgtttcgcta gtcttcagga tgttgctcca 2700 gatatctcga ttgccccaaa tgttctagta aaatttgttc cgcatcaccc tccactcctt 2760 ttaatgttga acaataatga attggttaca ttgacagaga ccgttgagtc tgtgtactat 2820 gacttcaata aggcagatta cgagagtatc agcaatgtac ttcactcaat tgattggtac 2880 agtatacttg accacgaaaa cgttgaagga gctacgcaaa ctttttcaaa cgttttatct 2940 tacgttattg acagacatgt tcccaaaaag gccgcaaaaa catctgaaca tgcaccctgg 3000 caatctcgtg aattgcgcag gttaaaaagg gaaaaaagag cagccctacg aaattattct 3060 aaatacaaaa cgcttcaatt gaaagagtac tatttgcaaa ttaatggtct gtataagaac 3120 atgagtaagg cctgttattc tgcttatctg cggaatttgc aagcgtcaat gaaagcgaac 3180 ccaaaagtat tttggaatta cgtaagatgt cagaagaaag agaatggatt accatcagtg 3240 atgaagttga atgatatcga aggatcgaac gaattggaac attgtcagct ttttgccaca 3300 aagttttctg gggtcttctc caatgaacat ctggagacgg agcaaataat cgctgccgct 3360 tccaatgtac cactcagtgg cctttcgctg aatgcagtaa atgtaagcaa cgctatgatc 3420 atcgaagcag cctcccgcct aaaatcgtcc tcatcgtgtg gtcctgatgg tataccctcc 3480 gttctggtga agaaatccat cgagaacctg gtcactcctc ttcagcatat attccagctc 3540 tctttaaata ccggtacttt cccatcgcta tggaaaaatg catacatgtt tcctgtacac 3600 aagaagggcg acaagagcaa cgtcgacaac taccgtggca tttctgccct ctgcgctgtt 3660 tcgaagctgt tcgaattggt gatacttgaa cctgttttct ttgaatgcaa gcacttcatc 3720 tccaacgatc aacacggatt catgccgaag cggtcaacaa caacaaatct gctttgtttt 3780 actacttacg taatcgatgg aatggcgcac ggattgcaga cggacgccat ttatacggac 3840 ttatcctccg cgtttgacaa aatcaaccat gatatcgcta tcgccaaatt gtcgagattg 3900 ggtttcaacg gagccgttct aaactggttt cgctcttatt tgaccggccg tctactcaac 3960 gtaaagatcg gagaccatct ctcggacagc tttgctgcat cgtctggagt tccccaagga 4020 agccatttgg gaccgttact attcatcctc tatctgaacg atgtaaataa gcttttggaa 4080 ggacctcgcc tatcatacgc agacgacttc aaaattttct tgaaaattgc caaaccaaac 4140 gatgcgagtt ttttacaaaa gcagttactc atatttgcaa attggtgtga tctgaatagg 4200 atggttttga atgctaagaa atgcgctgta gtttccttct ctagaagtca gaacccagtt 4260 ttctacgatt acagtttgtc tggcgaaagg ttaagcagag aaagcagtat gaaagacctt 4320 ggtgtgatac tggactccaa aatgactttc aggcaacatg tttcctacat aactgagaag 4380 gcgtcaagga atcttggact catctttcgg atagggaagg aatttagaga cgtgtattgc 4440 ctgaaatccc tgtattgtgc cctagtccgt tcagtgctag aatacggatc agctgtttgg 4500 aacccttgct atctgaacgg atctgatagg attgaaagag ttcaacgcag atttattcgt 4560 tgggctttgc ggaatctacc gtggcaaaac cgttttgaac tcccacccta cgaagatcgt 4620 tgccagttaa ttggattgga cactttaagc tgccgaagaa gcgttacgtg tgctctgtgt 4680 gtttctgatg tattgtcggc gcgtatagat tgtcctgcta ttctgagata cctaccgttt 4740 caagcgagaa tccgaaattt gcgaaacaat tcacttttta ggattcagtt ccgccgcaca 4800 aactacagtt taaattgtgg aatcaccggt ctgctgagaa ttttcaaccg cgtagcttcc 4860 gttttcgatt tccactattc tagagctact ttaaagcaat tatttgtcac gacgtttaaa 4920 cgatttatgt aaacctagat taagttttat cattgggact accataaggt ctgttgataa 4980 taaataaata aataaa 4996 // ID hATm-25_HM repbase; DNA; INV; 4340 BP. XX AC . XX DT 07-DEC-2008 (Rel. 13.12, Created) DT 10-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type family: consensus sequence. XX KW hAT; DNA transposon; Transposable Element; hATm-25_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4340 RA Jurka J.; RT "Families of hATm elements from Hydra magnipapillata."; RL Repbase Reports 8(12), 1919-1919 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 1344..3092 FT /product="hATm-25_HM_1p" FT /translation="MTKRKMTLGSKDLAYCRKLKKIEERRLRYLTFIEQQE FT NQTEKIEFLSSEQSEVSSCQDSDYSADKIKSGTSASTVLPKKFAKEVAITG FT VSKNISPRNMMHLLSDIVVTSGGSLTDFSLSESTIRRAKKKAISTSAAKYR FT EDIIKAASQSKFPIIAHFDGKILQDITNGTKQKRDRFAVLVNIDGDLRLLG FT IPPMENCTGKSHHDTLINILDEYKLRTYIKGLCFDTTSTNTGKYSGTNIRF FT CQTQGSIILQLACRRHVYELHCKHFWEKVHIGKTYAPENLMFKVFQANWNA FT TKETMDKSKFKKFDKAAGKGTYLEKQIEKTVQFCKYALANLVFPRGDYKEL FT VELTLTYLCPDVSFNVYAPGSVSHARFMAKSIYYLKIQVLSNQLPYQLTAC FT LQKEISKMAEFISIFYTVWFVRSSLVSSAPYQDLNAYWQMVQYKRWVESYD FT PGANDVLEGIEEVLKSMERHTWYLDQTLIPLCLADKELEDDEKEKVAKVLF FT SQPVPTHYEKFENPILPKDLDFTQEKPPSLSQLVGRNSWFIFSLLNLSEDD FT DKPWLNCPSSFWGFIDQFKTFIEFVCRHICCQRLL*" XX SQ Sequence 4340 BP; 1584 A; 657 C; 704 G; 1395 T; 0 other; gggtgcgtca tttttcaaca attttttcaa actaaatccc aaccacctgg attagttgac 60 cggttggata aaaatgattt tacgctaaaa aaaaattttt gaaacccatt tttaaggtcc 120 ccgcttttac aatttgtgcc ccaaaacccc aactttttta accaaaaagc ggggacctta 180 aacattgaca taaaacacca attttttttc tatgaactct aaaaggccta aatattaatt 240 tggaattggt tgggaagttt acaataaaaa aatatatact aatttaggtc aagaaaaata 300 gcttaaaacg cactaactta gtataatttt taagtatatt taaatagaat gacgcttgcg 360 atcgaattta actcacgtgt gtcacaaaca aaaacaatct ttctttttcg aactatttat 420 tctaaaaaaa ttagaaaaga ttaaaaaacg aatttgtgcc atacttttac ttaagctaag 480 aatattttta ttattataaa ctgcaattta taacttttac tttaaagatc aaaaataaaa 540 cgaagattta aaaaaaatta aagatgctac caaaaattac gagatcactt tcaaagacat 600 ttttatttaa aaaatgttag aaaaactttg tgaaacgcag tttgcacaaa tgaggaaatt 660 attcttcatt ttcattacag attaaaacat taaactgatt ataaattatc attcaatata 720 ttaaccaagt aattaaattt cagtttgaaa aacgaaatat ataaaatcta aatttaatct 780 taataactca ggcgtctgtt agacaaaaat actcccacca aaactttgat tggttgtgct 840 tcccaagata aaatgatgtc agcatgctct gaactagcaa tatgtcctca gaactgcatc 900 ctttttgcaa taaagaaacc gtggatcgat ggaggatatg acggcggcat atttcatgac 960 aaacacatca ggtttgctaa atgatctata attagagttt ttttattttt accgaatttt 1020 acttttgctt atattaattt gttgactatt ttcactctct gatatgaggc tacattaaaa 1080 agttttaaag caattagaaa aaagttaaat aaatattata tgttctagga aactattaaa 1140 tgatctagtt gaaaaatggg caacgttgaa aaaaaaataa aataaaaaaa ggcaacgcag 1200 cagaattagc tagaaacatc ttttaaaaaa atgctaagaa agttttttgg attggaaaac 1260 ctgatatttg ctccattttg aagaagaaac acggaaacaa cctagattca tacaattcag 1320 ataagcaatt tttaaatgat caaatgacga aacgtaaaat gacacttggt tcaaaggatt 1380 tagcatattg ccgaaagctg aaaaaaatag aagaaagaag attgcgatat ttaacattta 1440 ttgagcaaca agagaatcaa actgagaaaa ttgagttttt atcatcagaa caaagtgagg 1500 tctcctcttg tcaggattcc gactattcag cagacaaaat taaatcaggt actagtgctt 1560 caacagtatt acctaaaaaa tttgcaaaag aagttgcaat tacaggtgtt tctaaaaata 1620 tttcacctag aaatatgatg cacttgctct ctgacatagt agttacttct gggggcagtt 1680 tgacagattt ctcgctttct gaatctacta tcagacgtgc aaaaaagaaa gcaataagta 1740 caagtgcagc taaatatcga gaagatataa taaaagctgc aagtcaaagt aaatttccaa 1800 taattgcaca ctttgatgga aaaatcttgc aggatattac taacggtaca aaacaaaaaa 1860 gagatcgttt tgctgttctt gtaaatatcg atggtgattt aagacttctt ggcattccgc 1920 caatggaaaa ttgtactgga aaatctcatc atgatacttt aataaacatt ttggatgaat 1980 ataagctaag aacttatata aagggtctgt gctttgacac aacttcaact aatacaggaa 2040 aatattccgg cacaaacatt aggttttgtc aaacacaggg ttcaattata ttacagcttg 2100 catgtcgaag acatgtttat gagcttcact gtaaacattt ttgggaaaaa gttcacattg 2160 gtaaaactta tgctcctgaa aatctgatgt tcaaagtttt ccaagctaat tggaacgcaa 2220 caaaggaaac tatggataaa tcaaagttca aaaagtttga caaagctgca ggtaaaggaa 2280 cgtatctaga aaaacagatc gaaaaaactg ttcaattttg caaatatgca cttgccaatt 2340 tagtatttcc aagaggtgat tataaagaac tggtagaact tacactcacc tacttatgtc 2400 cagatgtaag ctttaatgtt tatgctcccg gatcagtttc ccatgctcgt tttatggcaa 2460 agtctatata ttatcttaaa attcaagttt taagtaatca gttgccctac caattgacag 2520 cttgtttaca aaaagaaatc agtaaaatgg cagaatttat ttctatattt tatactgtgt 2580 ggtttgtgcg ttcctcttta gtttcctctg caccttatca agatttaaat gcgtactggc 2640 agatggtcca gtataaacgt tgggtggaat cttatgatcc aggagctaat gatgtactag 2700 agggaattga agaagtattg aaatcaatgg agagacatac gtggtatttg gatcaaacct 2760 tgattccatt atgtctcgca gataaagaac tcgaagatga tgagaaagaa aaagttgcaa 2820 aagttctttt ctcgcaacct gttcctactc actatgagaa gtttgaaaat ccaatccttc 2880 cgaaagatct agacttcacc caggagaaac cacctagctt gtcacagttg gttggtagga 2940 actcgtggtt tattttcagt ctcttaaatt tgagcgaaga cgacgataaa ccttggctga 3000 actgtccttc atctttttgg ggctttatcg accaatttaa aactttcatt gaatttgttt 3060 gccgacatat atgttgtcaa cgactgctct gaacgagcaa ttaagcttgt gtcagagttt 3120 gttaattctg ttcacaatga agaggacagg caagatttaa tgttggccgt tcagcatagt 3180 agggataaat tcagggagtt cgaaaaagta caaactacgg agagtggcaa aaaaaaaagg 3240 cgctgacaaa gcatagtttg gagaatattt attcgatttt tcaccaaaaa cacgaagaaa 3300 agattgttga gttatagttt tacaaactgt ttttgaagaa aattttttct tgattattta 3360 ttactaattt atcactgttg accttaacaa tatcgttata tatatatttt ttagcagaaa 3420 atttattatt ttaccttaac tatctcagct acatgatatt cagcaaaata aaattgtttc 3480 attaaaacgc ttattttgca gcgcctatat gaaaatctaa tcaaactata tgtaaaaata 3540 aaactttaag gcagatggta cataaatatt ttgtaaaaaa aagtaaaata gatggaggag 3600 ggggagtgaa agaaacgaac tctccgccgc ccaccactaa ccctcctatg ttagaaatta 3660 ttttcctata ggggtcccta tattagaaat taattttaaa attgaatttt ggatcaaatt 3720 aatctttgta aacaagaatt tgataaaatt tgtcacggaa agtttgagag ataaaaaaat 3780 tggtcaatag atatatccat aaaattacct aaaattagaa cgtttcgccc atttttccag 3840 ggtcaaacca atacactttt taaaaacttg taagtaaagt atggcacgaa ttcgttttta 3900 aatcttttct aattttttta gaataaatag ttcgaaaaag aaagattgtt tttgtttgtg 3960 acacacgtga gttaaattcg atcgcaagcg tcattctatt taaatatact taaaaattat 4020 actaagttag tgcgttttaa actatttttc ttgacctaaa ttagtatata tttttttatt 4080 gtaaacttcc caaccaattc caaattaata tttaggcctt ttagagttca tagaaaaaaa 4140 attggtgttt tatgtcaatg tttaaggtcc ccgctttttg gttaaaaaag ttggggtttt 4200 gggcacaaat tgtaaaagcg gggaccttaa aaatgggttt caaaaatttt ttttagcgta 4260 aaatcatttt tatccaaccg gtcaactaat ccaggtggtt gggatttagt ttgaaaaaat 4320 tgttgaaaaa tgacgcaccc 4340 // ID Gyp3_Cis_LTR repbase; DNA; INV; 504 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 06-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE Long terminal repeat of Gypsy LTR Retrotransposon from Ciona DE savignyi. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gyp3_Cis_LTR. XX OS Ciona savignyi OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. XX RN [1] RP 1-504 RA Smit A.F.; RT "Gyp3_Cis_LTR - Gypsy LTR Retrotransposon from Ciona savignyi."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX CC Ci000031, Ci000391 3-4% div. XX SQ Sequence 504 BP; 120 A; 95 C; 79 G; 207 T; 3 other; tgtaacgcgc acgggggctc gtttgttttg ttatgatttt atacactctt atatatgtct 60 tttacacatt tttgtacacc ttttatacac ttttaacaac accattttat acaattttaa 120 tacactgttt tacgcctttt gtttgtcccc ttttatttta cacgtanttt acaccctttg 180 caccttttaa tcatttaatc atttttatct tgcttgccta tttttacacg cacgttttta 240 tctctcgggg tattcgagta atcaacgttc ggtgctgtgc taaattaacg tgatgtcttc 300 gaatttatgt ttctaaaaat cagttctctc ttggctttta aaccgtgcag acttgttttt 360 aatagctgat cgtttttatt agtgttatat ttgtatctta actatgagtg tttttataaa 420 gcgaaaacgt tgcaagccct tgtaattggt cngacggtag gccgttaaga cagcgtcctg 480 cgaaaccata ggaawtctgt aaca 504 // ID Waldo-1_AAe repbase; DNA; INV; 6341 BP. XX AC . XX DT 05-OCT-2010 (Rel. 15.1, Created) DT 05-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE A Waldo non-LTR retrotransposon family from Aedes aegypti. XX KW R1; Non-LTR Retrotransposon; Transposable Element; R1_Ele5; KW Waldo-1_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-6341 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6341 RA Kojima K.K. and Jurka J.; RT "Waldo, (AC)n microsatellite-specific families of non-LTR RT retrotransposons from the yellow fever mosquito."; RL Direct Submission to Repbase Update (05-OCT-2010). XX DR [2] (Consensus) XX CC [1] Originally named as R1_Ele5. CC [2] Consensus update and characterization of target sequences. CC This consensus is generated from 30 sequences with >98% identity, CC and ~100% identical to the original sequence in [1]. 3' flanking CC sequences are all (AC)n microsatellites, and some are in tandem CC sandwiching (AC)n. Thus renamed as Waldo. XX FH Key Location/Qualifiers FT CDS 757..2163 FT /product="Waldo-1_AAe_1p" FT /translation="MEVSVIEVEEEVTNPFVKSGLVRTPPPQTQQVQKQPT FT NEQQAQQQGQQQYSVWTKPPQPPRVETAKKLVDELHEYVDKRSNVHKDIKA FT LVIKLQGALGSAVKEWKNVVQRAEAGEKELLAVKTALEACRAAEMRRAEAD FT TATREVNAARSIPPGVQSTPFFTPKRPRASPGDVRPGGPKRHKDVRVTGAK FT PPPEATDVVNSHTEWQVVGKKNRKKEKASKKKPPEKRKVVRTRNKSEALIV FT KASDDSYAEVLRAMRMNPDLKELGEDVQKVRRTRNGEMLFELKRDPKAGSI FT SYKELTEKALGNKVEVRALCPEATLLCKDLDEITTEEEVKLALKEQCELGE FT VQMTIRIRNGPDGTKVASIKLPVDAANKALRVEKVCVGWSVCPLSVSQQPD FT VCFKCLGFGHFARNCQGPDRSKLCRRCGEEGHKAKDCSKPPKCLICAATGD FT NEHPTGGSRCPASKRARATKSQWR" FT CDS 2118..5120 FT /product="Waldo-1_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MPSLQTSESNEVPVEVVQINLNHCDTAQHLLRQSVAE FT HRCDVAIISEPYQIPPGDGNWISDGTKTAAIWTVGKYPIQEVVHCADEGFV FT IAKINGVFICSCYAPPRWTIEQFNRMLDKLTEELTDRRPVVVAGDFNAWAV FT EWGSRLTNPRGSSLLEALAKLNVDLANDGTTTTYRKDGRESIIDVTFCSPG FT MIRDMNWRVCEDYTHSDHQAIRYRFGRCLQMESSGAQIYERRWKTEIFNKD FT VFVEAMRRENNLVNLSAEELTAALSRACDATMPRMGKPRNCRRPAYWWNST FT IGDLRAHCFQARRRMQRASNNVEREERRLPYRAARAALNKAIKLSKKACLE FT ELYRNANENPWGNAYKVAMAKMKGPAIPPDRCPEKMKVIIETLFPTHEPTV FT WPPTPYDEQDVHDEETRVTNEELVVVSKALPVKKAPGPDGIPNLALKTAIQ FT ENPDMFRTTLQKCMEDGNFPDIWKRQKLVLLPKPGKPPGDPSAYRPICLLD FT TVGKLLERVILNRLTKYTENENGLSNMQYGFRKGRSTIDAIRMVVETMETA FT QKQQRRGNRYCAVVTLDVKNAFNSASWVAIADSLHRLRVPKYLCQILKSYF FT QNRTLIYETDAGIRNLLVTAGVPQGSILGPTLWNVMYDEVLKLNLPRGVKI FT VGFADDVVLVVIGESREEVEVLATEAIDAVENWMREKKLALAHQKTELVMI FT SNRKAVQNVSIMVGECIINSKREVKHLGVMLDDRLNFNSHVDYVCNKATKV FT ISALSRIMPNNSAITSSKRRLLASVSTSIIRYAAPAWSAALKTGRNRAQLN FT RTFRLMAMRVASAYRTISSHAVHVIAGMIPICLLLEEDSECYRDRATGRGR FT NRARANTLSKWQQQWNNAEKGRWTYRLIPNVSIWTTRAHGEINFQLTQFLS FT GHGCFKQYLHRFGHARSPLCPECRDIEETPEHVVFTCPRFAQQRSEMTAIV FT GDDVNVENIVLKMCSNEDKWNAVNRSVVQIMSTLQRTWREEQRQMT" XX SQ Sequence 6341 BP; 1789 A; 1457 C; 1876 G; 1219 T; 0 other; gcagcaggga caggagtcca ggggcttgac gaaccccccc tctgcgagtc gggtgggagc 60 ttaggaagca tttcttaagc gaaaaaactc catgcatccg tatggattac caccttttgg 120 cacttctctc gagaagtgac cgagctcctc ccagttagct caagcagagg atgcctagga 180 tgtggtgggg gttcaacagt gggctctgtt ggatctctgc aaaaagccac atatccgcaa 240 gcaactccgt acaagcggcc tggcaccgct ttcaaagtgc cttagcccaa ccattggagt 300 gccaaccggc acatcaggat cgatgccagt aacatcctga ttatggtata ctggtgaagg 360 cgttcaacaa cggacaagct acggatcacg gattggatgg cgttattggg ctgacagtcg 420 ggccattctg ctccctgatt aaggaagggt gacaccgacc tgaaacggcg agtgggctca 480 tggttcgatt gcctccgtcc cgttaaaacc ttggcaggcc tccggatacg ttcgaaatct 540 gtccatcagt tctgatgagt acgtaggttc ggatggaaca ttcccgatct acgccgatct 600 ggcactgtac ccggccccca taagggtccg tgtcaaccct cgcatggcct caatgcttgc 660 aggtatagtt gcttgcaggt acagataacc atgggatcgt gaaggcgact acgtacggcg 720 gatggagata gcggctcgga ggggggaccc tgaagaatgg aagtaagcgt aattgaagtt 780 gaggaagaag tgacaaatcc gtttgtcaag agtggattgg tgaggacacc gccgccgcaa 840 acccagcagg tgcaaaagca gcccacgaac gagcagcagg ctcagcagca gggtcagcag 900 cagtatagcg tgtggacgaa accaccacaa ccaccgaggg tagagacagc gaaaaagttg 960 gtggacgagc tgcacgagta tgtggataag agaagtaacg tgcacaaaga catcaaggcg 1020 ttggtgatta agctccaagg cgctcttggt tcagctgtaa aggagtggaa aaacgtagtg 1080 cagagagcag aagcagggga gaaggagttg ttagcggtta aaactgcctt agaggcatgt 1140 cgcgcagcgg aaatgcgaag ggcagaagcc gacacagcta cgagggaggt caacgcggcg 1200 agaagtatcc ccccaggggt gcagtccacg cccttcttta caccgaagag gccaagggca 1260 tcgcctggag atgtcagacc aggtggtccc aagagacaca aggatgtccg tgtcactgga 1320 gctaaaccac caccagaagc aaccgacgtg gtaaacagcc acactgaatg gcaggtcgtt 1380 ggcaaaaaga atagaaagaa ggagaaagcg agcaagaaaa aaccgcccga aaaacgtaag 1440 gtcgtcagga cgaggaataa gagcgaggct ctcatcgtca aagcgagtga cgactcgtac 1500 gccgaagttc tacgcgctat gcggatgaac ccggatctta aggagctagg ggaagacgtg 1560 caaaaggtca ggcgcactcg taatggagag atgctctttg agctgaaaag agatcccaag 1620 gcaggcagca tctcgtacaa ggagcttacc gagaaagctc tcggaaacaa ggtggaagta 1680 agagccttgt gcccggaagc gactctcctg tgtaaagatc tggacgagat taccacggag 1740 gaggaagtaa aattagccct gaaggagcaa tgcgagctag gagaggtcca gatgaccatc 1800 cgtatcagga atgggcctga tggcacgaag gtagcatcaa ttaagctgcc agtagatgca 1860 gctaataaag cgttgagagt ggagaaagta tgtgtgggct ggtctgtgtg cccgctgagc 1920 gtttcccagc aaccggacgt gtgcttcaag tgtctgggct ttggtcactt cgcacggaat 1980 tgccaagggc cggataggag caaactatgc agaaggtgtg gcgaggaagg tcacaaggca 2040 aaggattgct cgaagcctcc gaaatgccta atttgcgctg ctacggggga taacgaacac 2100 cctacaggcg gtagcagatg cccagcctcc aaacgagcga gagcaacgaa gtcccagtgg 2160 aggtagtgca aatcaacctc aaccactgtg atacggcaca acatcttctg cggcaatctg 2220 ttgcggaaca tagatgcgat gttgcgataa tttcggagcc ataccaaatt ccacccggag 2280 atggaaactg gatatcagac ggaaccaaaa ctgcagcgat atggacagtg ggaaaatacc 2340 ccattcaaga agtggtgcac tgcgcagatg aaggattcgt tatagccaaa atcaacggtg 2400 tcttcatctg tagttgttat gcacctccac gatggacgat cgaacagttc aaccggatgt 2460 tggacaaact gacggaagag ctaaccgacc gaagaccagt ggtagtagct ggagacttca 2520 atgcttgggc ggtcgaatgg ggcagccgcc tcacgaaccc aagggggagc agccttctgg 2580 aggccctagc taagttgaac gtcgatctcg ccaatgacgg taccaccacc acctaccgta 2640 aggatggtcg agagtctatc atcgatgtca ctttctgcag cccaggaatg atacgtgata 2700 tgaactggag agtatgcgag gattacaccc acagcgacca ccaagcgatt cggtaccgct 2760 ttgggcgctg tttgcagatg gaatcgagtg gagcccagat atacgagcgg aggtggaaaa 2820 cagagatttt taacaaggac gtgttcgtgg aagcgatgag gcgtgagaat aacttagtaa 2880 acctaagcgc agaggagttg actgcagccc tatcgcgggc gtgcgatgca actatgccga 2940 ggatggggaa acctagaaac tgtcgacgac cagcctactg gtggaattcg acgatcggtg 3000 acctacgcgc acattgcttc caagctagga ggaggatgca gagggctagc aacaacgtcg 3060 aaagagaaga aagaagattg ccctataggg cggcaagggc cgcactcaac aaggcaatca 3120 agctcagcaa gaaagcgtgc ctggaagagc tctaccgcaa tgccaacgaa aacccgtggg 3180 ggaatgccta caaagtggcc atggcaaaga tgaaaggtcc agcaatacca cccgacagat 3240 gtccggagaa aatgaaggtt attatcgaaa ctctttttcc gacgcacgag cctacggtct 3300 ggccacctac accgtacgat gaacaggatg tacacgacga agagacccgt gtaacgaacg 3360 aggaactggt cgtggtatcg aaagccttac cagtgaagaa ggcaccgggt ccggacggga 3420 ttccaaactt ggcccttaaa acggcaatcc aggagaatcc agacatgttc aggactacac 3480 tgcaaaaatg tatggaggac ggaaactttc ccgacatttg gaagcgacaa aagctggtgc 3540 tgctaccaaa gccaggtaag ccgcccggcg atccttctgc atataggccg atatgcttgt 3600 tggatactgt cggaaaactg ttggagagag tgattcttaa caggcttacg aagtatacgg 3660 agaacgagaa cggtctatcg aacatgcagt atggattccg gaaaggtaga tctacgatag 3720 atgccatccg aatggtagtg gaaaccatgg agacggcaca gaagcagcag aggagaggga 3780 accgatattg tgcggttgtc actcttgacg ttaaaaacgc tttcaacagc gccagctggg 3840 tagcaattgc cgactcgttg cacaggttga gggtgcctaa gtatctatgt cagattctga 3900 aaagctattt tcagaaccgg acactgatat atgaaacaga tgccgggatc agaaatctgt 3960 tggttacggc gggcgttcca cagggatcca tcttgggtcc caccctttgg aatgtcatgt 4020 acgatgaagt gctgaagctg aacctgccca gaggagtcaa gattgtcggt tttgcggacg 4080 atgtagtgct tgtagtgatc ggcgaatcac gggaagaggt ggaggtactg gcaacggagg 4140 cgatagatgc cgtggaaaat tggatgcgag agaagaagct agcgttggcc catcaaaaga 4200 ctgaattggt tatgatcagt aaccggaagg cagtacaaaa tgtgagcatc atggtcggtg 4260 agtgcatcat aaactcgaag cgagaagtga aacacctggg cgtgatgttg gacgaccgtt 4320 tgaatttcaa cagtcacgtc gactacgtct gcaataaggc gacaaaggtg atatcggcct 4380 tatcccgaat tatgcctaac aattctgcga ttaccagtag caagaggagg ctactggcga 4440 gcgtgtcaac gtcgatcatc cggtatgcag ctccagcgtg gtcggcggca ctgaagacag 4500 gacgaaatcg tgcccagttg aaccgtacgt ttaggctgat ggcaatgcgt gtagcgagtg 4560 cgtatcgaac gatatcatcg cacgccgtgc acgtaatagc cggaatgatt cctatctgcc 4620 ttctactgga ggaggatagc gagtgctaca gggatcgagc cacagggaga ggccggaaca 4680 gagcgagagc caacacgctt agtaaatggc agcagcaatg gaacaacgca gagaagggca 4740 ggtggactta ccgactgatt ccaaacgtgt cgatatggac cactagagcg cacggcgaga 4800 ttaattttca actgacgcaa ttcctgtctg gtcatggctg ctttaagcag tacttgcaca 4860 ggttcggcca cgcaaggtca ccgttgtgcc ctgaatgccg agacatagaa gaaacaccag 4920 agcacgtagt ctttacttgc ccacggtttg cacagcagcg aagcgaaatg actgccatcg 4980 tcggagacga cgtgaacgtg gaaaatatcg tcctaaagat gtgtagtaat gaagataagt 5040 ggaacgcggt gaaccgatct gttgttcaga ttatgtcaac actacagcgg acatggcgag 5100 aggagcagcg tcagatgacg taggcagccc tgccgaacga agaagggcgc cggactgttt 5160 tcgacactct aaaagagcgg acgaagggaa agtgatactg ctcggttccg ggagatattc 5220 ctttgccggg gaactctccg ccggagtagg ctagatccac cgccggggac tagctgagta 5280 gacgcgacgt agcaccggtc ccgggtcgta ggagcaccag tgaaccggaa gttagcctcc 5340 accggaatcg ctggactgac tccggcaccc taccggtcgg cccgtaaaaa attgaagagc 5400 agcagtagca gcagcagaag aagaatcggt atcggaagaa cttccatcgc cggggaattc 5460 ttcgtcggtg taggctaggt tcaccgccgg ggactagttg agtagacgtg tcatagcgac 5520 ggttgagggt cgtcggggca ccagtgaacc ggaagctagg ctccaccgga atcgctggac 5580 tgacttcggc acccctccgg tcggctcgta tcgtagaagt agaagaatcg gtgtcgggag 5640 aaattccacc gccggggaac tctccgtcgg tgtagattag gtccaccgtc ggggactggt 5700 cgagtagaat acgtcggaac gctggtattg aatagtcggg gcgcctacga accggaagtt 5760 atgctcaacc ggaatcgttg gaccgacctc ggcatcttac cagctgaact atggaaagga 5820 gatggtcgcc gtaaccgtac gagtgcgggc tttgtatgag ctcagagtgg tgcacaaact 5880 ggagccgaag ggctcagaat gttgcatgta ggtactaaca aattgacggg tcgccaaagg 5940 gcgaaaatag gtattaacaa attactaacg agtggagccg aagggctcag catgaggcac 6000 gtcgttgaac ggttttaaaa ctgctaacag gagtcgaagg gctcaggagc caaagggctc 6060 aggagccgaa gggctcagga gccgaagggc tcagcatgta gaataacaaa ttgaagtggt 6120 gcgctcagca cgttgtcctc cccttcgaag taataccgga aggtagttcc ggagggtgat 6180 ggtgatggta ctaaacctag gagagtgttt ttagtgggga aggcactaag tggatcccac 6240 accgcgccaa aaattacact ggcatgagca tgaacatata caggccagtc tatgaagatt 6300 tttaaactcc tagttgcatg aaaaaaaaaa aaaaaaaaaa a 6341 // ID Gypsy-4_BM-LTR repbase; DNA; INV; 281 BP. XX AC nscaf3031; XX DT 19-MAR-2010 (Rel. 15.07, Created) DT 19-MAR-2010 (Rel. 15.07, Last updated, Version -1) XX DE LTR retrotransposon from silkworm: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-4_BM_; KW Gypsy-4_BM-I; Gypsy-4_BM-LTR. XX OS Bombyx mori OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Bombycidae; Bombycinae; Bombyx. XX RN [1] RP 1-281 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from silkworm."; RL Repbase Reports 10(7), 984-984 (2010). XX DR Genome; nscaf3031; Positions 1896470 1896750. XX SQ Sequence 281 BP; 75 A; 86 C; 69 G; 51 T; 0 other; tggcggcaca tcactagccg tgtgtgcggc caccactagc acaaaaaccc gtccaggcga 60 cagcggtgtg cggggacgag agcgggtaac gcgcagagga acctgctccc agtcggccgc 120 cgagagcgca gtgcgctacg aaacaactcc aagatcgaac ctacgcagct ctgctcgcac 180 gcgcatacaa atattatatt tctcgtctcg aatcgaataa aacgcattaa agtcagttta 240 gttcttattg caacgacgcc ttcccgctca cacaagtacc a 281 // ID Gypsy-113_AA-LTR repbase; DNA; INV; 225 BP. XX AC AAGE02027370; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-113_AA_; KW Gypsy-113_AA-I; Gypsy-113_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-225 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02027370; Positions 93788 93564. XX SQ Sequence 225 BP; 60 A; 51 C; 47 G; 67 T; 0 other; tgtagtgtcc gctctacaga gccttcactg ttttgagaat atactgtggt cgaataccaa 60 cacttgagta catgtagaac agtatagctg agatcggaaa taaactacat tcgatcgttc 120 actttcgttt gtacagagct agaagataga cgtgttctcc tttttgcggc tgacaccgaa 180 aacctcttac caatctaccc ttgcttgcgt tgtaagtgcc ggaca 225 // ID DNA-8-3_HM repbase; DNA; INV; 2698 BP. XX AC . XX DT 24-DEC-2008 (Rel. 13.12, Created) DT 24-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE non-autonomous DNA transposon from Hydra magnipapillata- DE consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA-8-3_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2698 RA Bao W. and Jurka J.; RT "nonautonomous DNA transposon from Hydra magnipapillata."; RL Repbase Reports 8(12), 2078-2078 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX SQ Sequence 2698 BP; 824 A; 516 C; 317 G; 1035 T; 6 other; tagtaartga ccggggttga tatttgaccg gggccgggtt tgataaaata tagccrgggc 60 cgggtttgtg accggggctt gcataaacat ttatacatcc taaagtatat tttttttatt 120 caaaattcat gttatatttt gtatatatat gcatatataa acatcattat gatcaacatg 180 attatcataa tcatcattat tatcatgatg atgatcatca tcatgatcat catcatcatc 240 atcatcatca tcaacagcat aaaataggaa tgctaaagcc tgatgatgat gatgatcatc 300 atcatcatca tcatcatgat catcatcatc atgatcatga tcatcatgat catcatcatc 360 agcttccaaa cctttagcct tcctctttta tgctgtttct ctaatataaa caatctagag 420 catttttttt tcttaactaa agctcattca tacaatatgt aaggcactct cttcaagttt 480 tcttacttct accactacca ttttctactg aagctgctac ctctctacat gctgatacct 540 aaattacttt aatcttttct aacaaacatt gttcgataca atgttttttc taatctgtct 600 aagattatgc cttctttttt tgtcttgctt gagtcacacc acacatccat ctgatcatct 660 tctcttttgg caaatactac tccatataaa tactaaagtt tatataacaa taaaaggata 720 ttgtatgcca gcatctaaat aaatagcaaa cagatatgtt tatatccata atatgtatgc 780 ataaagtgca gacatctcat ggaagctttt attatccttc tctggaataa aatgtttgga 840 agcttttatt acttctcctt tttcaacttt taagtcaagc ctcattctat tttactcctc 900 taacattttt caccatcttt gagcagttgc atattaatac ttttttttgc aaaacattac 960 aataatcatt tttaaaattt tcatcttttt tacaagaaca tcattactct cttataagaa 1020 caaatgtcca tttttattat tgcaaaaacc agccattatg tatgtaaaaa gtgcatcctg 1080 tttatttctg aatgtttgaa gctcttgtta cttctcaagt ttagtctcat tttactacat 1140 ttttcatcty agtttcttta ctaaatcttg tatcttatct acagatgtta ggttctagag 1200 acttttggtt agtcttcaac agtgtcatta actaaagtaa gtctaacatt taatctctaa 1260 ttcatgtgtc tgatcttttt acttctcccc agaataaagc agagttattt gcaaagaaat 1320 cttactctaa tttgactctt gaatataatg gccatactct tcctgacagt taaacagatt 1380 aacccattgt taaacatcca aatcactctg gcttccaatg ctaaagttaa ttcttaatta 1440 aacacttcta cagtttgtgc tccagaaaag atttccatca tagtcttaaa aaagtgttct 1500 ccagaactct cttcaatttt tgctaaactt ttaaaaagtg ctaaataagt ggttccaatt 1560 tttcaaaact ctagaaaaca ctttgacctc tcyaactatc ctacaattag tttttactct 1620 gttattagtt tctttatata aaggttttga ggttattaaa ttattttttt ctaactgcag 1680 tagtaaagtt attcttgaag gcctaccctc ttctttattt caagtaactt taagggtacc 1740 acaaggttct atctttgctc ctgtattgtt tcttatctac aataacgatc ttcatgaaat 1800 ctaaagtggc tctatttgct gacgactcaa cttaatactc ttgtcttgac aaaaaatctt 1860 cactttttaa ttgcttagaa caagcagccg attttaaatc tgatctctct tctgtaacag 1920 cctaaggctt gcagtggctt gtggatttta attctggatt taaaatccac aatttttgtt 1980 agacaacaaa actcatttat ttactgcaaa aaaatattgc aatattgttg gcattcctat 2040 attgataaat aacaaccctc ttactctcag acaaagtcta aaagcacatt gtaaatgtaa 2100 ttagacctgc tttatctscc aagcttgagc cactctccca ttgttgtaag gttgcatttc 2160 tttctttttt tctacaaata caatcatggt caatgctcaa acaagctatc atctctatta 2220 cgataaacca aaactcattc ttgcttggyt ttttattcaa caagttgcat ccttttactg 2280 tatctgtccc ttcatgctac aaaaactttt attaatccag tttttttcct tgcacatcaa 2340 aaaaacctta taactattac ccatctttca tgtttttctt tttttttata tacaacttac 2400 aactttttaa gtcttcagtt aaccatttcc tagtattttt acttattctc tattttaaag 2460 taactcataa cttaatagtg gttgcttgca atcttgttgg aagtaaatta gagtttaaaa 2520 atatataaat ttatgccagt atttaataaa tagtaaatgt aagcataaaa ttataatata 2580 taaagtatta taaatttttt tctaaccccg gctatcaacc cctgctaaat cttatagttt 2640 agccggggct gggtttgaaa aaagtctcat ttttagccgg ggccccggtc acttacta 2698 // ID BEL-11_DPu-LTR repbase; DNA; INV; 721 BP. XX AC scaffold_26; XX DT 16-DEC-2010 (Rel. 16.02, Created) DT 16-DEC-2010 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from Daphnia: long terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-11_DP_; KW BEL-11_DPu-I; BEL-11_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-721 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Direct Submission to Repbase Update (16-DEC-2010). XX DR Genome; scaffold_26; Positions 894832 895552. XX SQ Sequence 721 BP; 158 A; 168 C; 126 G; 269 T; 0 other; tgttgcggcg agaaaagcgc ctttcaattt ctccaatttg gcaacgccta actttaagtc 60 aagcgcgatt ctttcccctt ccctcccgag tctttgtgtt aatccgtctg ctagccttcg 120 tacgacgtta agaccacgct cgtttcactg tcgaaagagc tagttactgt cacgattatt 180 atacttttct gtgtttcgtg aagccattta ttcagcttgt aagtacatta tgtttgtaat 240 tatcaattgc ccgacccgtc taacctgtga atatgtttga tgtttagttt cagctcgttt 300 tctattctga gctgtcagtt caagccttat tctgagtgct tttgcttgcc tagcattggt 360 aattactatt attatgctta tattctattg tgtatgctaa ttgctgtcta ttttaggctt 420 catatctata ttccctgtct gtgtgcatat tggtctgggt gactcctcac attttctcgt 480 gtcatcccag gtacaattct tccttgtttc ttttccgttc cctaactaac tcttttctct 540 tttgtttaga catcacacgt atgtcagttt actgtatgtc aacacagttt gagtcctaca 600 catctttcat tttgtgtcgt gcttgaccca gcatcaactt gcgaataaac tacaggggct 660 gcttacaaca gaccccagaa gaaactctga aggctctcac taaattgata gcgtcgaaac 720 a 721 // ID Gypsy-27_CQ-LTR repbase; DNA; INV; 230 BP. XX AC AAWU01011305; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-27_CQ_; KW Gypsy-27_CQ-I; Gypsy-27_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-230 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 434-434 (2011). XX DR GenBank; AAWU01011305; Positions 102744 102973. XX SQ Sequence 230 BP; 63 A; 58 C; 58 G; 51 T; 0 other; tgttagggcc acgcgatgcg cccctaccga atgttggtgc cgtctagcac ggtacacgtc 60 aacatgacag ctgccacaca gtgacagctg accggaagat tgtaagcgag agagagagaa 120 gattgttctg taccattaat tgaccgagcc cgaactagcc ggacgtgcaa taaaaatcta 180 attgtaaaac gtagtcgcgg tacttttctt ttgctcccgc gatcacagca 230 // ID Gypsy-5_RP-I repbase; DNA; INV; 4887 BP. XX AC ACPB02004727; XX DT 28-JAN-2011 (Rel. 16.02, Created) DT 28-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Rhodnius prolixus genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-5_RP_; KW Gypsy-5_RP-LTR; Gypsy-5_RP-I. XX OS Rhodnius prolixus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Euhemiptera; Heteroptera; OC Panheteroptera; Cimicomorpha; Reduviidae; Triatominae; Rhodnius. XX RN [1] RP 1-4887 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Rhodnius prolixus genome."; RL Direct Submission to RU (28-JAN-2011). XX DR Genome; ACPB02004727; Positions 6258 1372. XX CC Positions [3987-4463] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 1689..4754 FT /product="Gypsy-5_RP-I_1p" FT /translation="MRRGIVKPQVDSIFLLSPAENRYFVDLRLGEVTLPSL FT IDTGATHSYLGDEGLSLLESLQLSVAKINSYQVTFTNGVEETVQFTVAFVA FT RVHGRDECVTFSLLPNLSVPCLLGLDALKKWGMVLDTSNATWWFKDCPGNI FT YSFSSPTEGRSGKSTHPVGQVGLKRINADEYRKLEQFLNQEKVVDSSVPGI FT TTLTEHVIDVGNNKPIKQRAYPVSPALQKVINQEVDRMLEEDIIEPSNSPW FT SSPVVLVKKTNGDYRFCVDFRKVNSLTKKDVYPLPHMSGLLDQLRSCRYLS FT KIDLAQAYHQIPLAPQSREITAFIVLGRGLFQFKRLPYGLCGAPATFQRLL FT DRIITPELAPACFAYLDDIIIATDTFGDHVHYLKLVFARLREAGLRMNFDK FT CTFGCSEMRYLGFVVNEQGLQVDPEKIEPLLKFPTPQNVTELRRFIGLASW FT YRRFISDFSCRIRPLTQLLRKGQRWIWTAEQEAALQEIKTLLTSAPILARP FT DFSRTFLLQTDASAEGLGAVLTQDTDNLERVIAYASRALQGAERNYSVTEK FT ECLAVLWAIKKFRQYLEGYHFKVITDHASLKWLHNLKNPTGRLARWALELQ FT GYDFEVIYRRGSAHAVPDFFSRLPSRSLEGDQEDNPVSLASCMNVSSDLWY FT QNKLQSVQQQPGNHPDWKVVDGQLFRHVLEDIELTLEDVSRAWKLVVPEED FT QLKVIQENHDLPEAGHLGFEKTLEKVKRHYNWPGMRRDIARYIRKCRVCQT FT TKPEQRPSVGKMNLKKYTQPWDTVAVDIQGPFPRSSNGFSYLLVFLDTFSK FT WTEIIPIRLANAKTVAKNFRQFIIFRWGTPRILHSDNGTPFINKIMQGLAD FT TFGITLTQTPPYWPQANPVERTNRVIKTMISSYVEGNHKQWDCYIPEFMFA FT INTSWHSSLGCTPAYLNFGRELEAPNTHFRKSSPSRLSPLTTSIWTGHLEK FT LKELRLLAEKNLSEASTKQGHYYNLRRRAPQFKEGDLVLRRAHHLSSGAEG FT FAAKLALKLKALQDCH" XX SQ Sequence 4887 BP; 1417 A; 1053 C; 1117 G; 1300 T; 0 other; taaatggcgc ccgaacaggg acctcgaagc agttacttat aactataagt aattatagtt 60 atagttactc tgtaactaca gagagcttaa ggaattctgg cttgcactat acatatttca 120 gagtagcttg ccatttcgat gcatgaagag taattgccat ctttaacttt ctggtgcatg 180 atcaaagctg tagagagaga aaagaaggta aattttatat gtattctatg taaatcctta 240 ctgatttaaa gctttctgag atcatagaaa attctattct ggtggttaca aaccagaaat 300 agcaaatatt ggagtttaaa gataaaacaa acaatatata cacttgtaag tgcaagtaag 360 tatatttttt ttgtagagaa caggcacatt gctattaagg ttaggttgtc tagttcaagg 420 tgaggttcgt tacctaccta tttatttgct ttgtcatttc ctgtgctaga ggggggtatt 480 aatggtaaac ccagaagacc agttcctcaa acctaaactg ggttggatct atgaggccaa 540 taaggagacc ttaatagaaa tcttaaacaa tttcagtcta agtacagaag gaaatgttgc 600 ttccttaaga cagaccctaa gagactactt tattaaatat cccacttatt ggacagtacc 660 ttctaacatt gaaatgggag gtccagaagt gaaatcagct agcactagtg ggggaaaaga 720 gcaaccaaga tattcaccat ttcaatatcc cggtttcaat tatcatagta ccttcactga 780 aaaatttaac aatgtctcca attcgttaag cccggaggac ttccttttaa aaataaacga 840 aaagcggaag gcttggaaga taccagatga ggaccttcta gcatacctcc cggagctgtt 900 agtaggcacc cctctaaact ggtaccgctt gcaaaaggac aattggagtt cctgggaaga 960 gtttgaaatc tcgtttcgga aatctttctt tccttttgat taccaggaaa aattgagaaa 1020 ggagattgac cggcgtaccc agggtgagga tgagagaatt tttgacttcg ttatctccct 1080 tcagacttta tttagacgtt tacagaaacc attaccggaa gcggaacagg tttctattgc 1140 ctttaacaat ttattaccta ggtatcaatt atatatacgt ccgacggaaa ttacctcatt 1200 tactcaatta atcgataatg gccaacaatt cgaaaggatt gaagaccgca caaagacttt 1260 tcgaccacca cccactaaag agaaatgttt agtgcctgag ctgcttatgt tcgtaaagaa 1320 aaaagttcaa taccccagac taagcccaaa cctttaatgg catccctaga gggtgcctcc 1380 agcgatagcc accattcaga acaatcctcg gggaaaataa agtcagctcc ttctccaaaa 1440 aaggctaaac ctaagccaac ggaaaggaga ttgctttcta agccagacaa tctgatatgt 1500 tggaattgtt tggatcaggg tcaccgctat ccgcagtgta agaaacccag agataaagta 1560 ttctgttacc attgtgggac aagaggctat actaggagga actgcccgac ttgtgagggt 1620 tcgggaaact cctaaggggt caggttgctg gagtcctatc tgatcccgtt cctaacgact 1680 ccgacactat gaggagggga attgttaaac cccaagtcga ttctatcttc ctgctgagcc 1740 cggctgaaaa cagatatttt gtggatctgc gtttaggcga ggtcacccta ccgtcactaa 1800 tagacaccgg cgccacgcac tcgtatttgg gagatgaggg attatctctt ttggaatctc 1860 tccagctatc agtagctaag atcaattcct atcaagtaac gtttaccaat ggagtggagg 1920 aaaccgtcca gttcactgtt gcctttgtag ccagggtaca tggacgtgat gagtgcgtta 1980 ctttctctct cttgcctaat ttgtccgtcc catgtctttt ggggttagat gccttgaaga 2040 agtggggtat ggttttggat acctctaacg caacctggtg gttcaaggat tgccctggaa 2100 atatctattc cttctcttcc cccacggagg gaagatctgg aaaatcaacc catccagtgg 2160 gtcaggtagg ccttaaaaga attaatgccg atgagtaccg taaattggag cagttcctta 2220 accaagagaa agtagttgat agttcggtac ctgggatcac gactctaaca gagcatgtga 2280 ttgatgtggg gaataacaag ccgataaagc aacgggctta tcccgtatca ccagcgttgc 2340 aaaaagtcat taaccaggag gttgacagaa tgttagagga agacataatc gaaccatcga 2400 acagtccttg gtctagcccc gttgtgttgg tgaagaagac caacggagat tataggtttt 2460 gtgtagattt ccgcaaagta aattctttaa caaaaaagga tgtgtatccg ttaccacaca 2520 tgtccggtct cttagatcaa ctgcgatcgt gccgctacct atctaagata gatttggctc 2580 aggcctacca ccaaatccct ctagctccac agagtcgaga gattacagct tttatagtac 2640 tggggagagg attatttcag ttcaaaagac ttccttatgg actatgtggg gccccagcta 2700 cctttcaacg cctgttggat cgcatcatca cacccgaact agctccggct tgtttcgcgt 2760 atttagatga tattattatt gcgacggata ctttcgggga ccatgtgcac tacctgaagc 2820 tagttttcgc aagacttcgt gaagcggggt tgaggatgaa tttcgataaa tgcacattcg 2880 gttgttccga aatgaggtac cttgggttcg tggttaacga gcaggggttg caagtagacc 2940 cagagaagat agaaccttta ctgaaatttc ccaccccgca aaacgttaca gaacttcgtc 3000 ggtttatcgg gttggcctcc tggtatcggc gcttcatttc tgatttttcg tgccgaatac 3060 ggcctttaac ccaacttttg aggaagggtc agaggtggat ctggaccgcg gagcaagagg 3120 cagccttaca agaaatcaag acccttttaa cttccgcgcc catcctcgct cgccccgatt 3180 tctcccgtac cttcttactc caaactgatg cgagtgctga gggcctaggg gcggtactga 3240 ctcaagatac cgataatcta gagagagtta tcgcatacgc tagtagagct ctgcaaggag 3300 ccgaacgtaa ttatagtgtg acagagaaag agtgtctagc cgtcttgtgg gctataaaga 3360 aatttcgcca atatttggaa gggtaccatt ttaaagttat taccgaccat gcaagtctta 3420 aatggcttca caatcttaaa aaccctacag ggcgattggc caggtgggca ttggaactac 3480 aaggctatga ttttgaggtg atataccgtc gcggttcagc tcacgccgtg cccgacttct 3540 tttcccggtt gccgtcccga agtctggagg gtgatcagga agacaatccc gtttccttgg 3600 cttcgtgtat gaacgttagc tccgatctct ggtaccaaaa caaacttcag agcgtacaac 3660 aacagcctgg taatcatccg gattggaagg tggttgacgg ccagctcttt agacatgtgc 3720 tagaggatat tgaattgacc ttagaggatg tctcgagagc ctggaaacta gttgtacccg 3780 aagaggatca acttaaagtg attcaggaaa atcatgacct tcccgaggct ggtcatttgg 3840 gattcgagaa aaccctggag aaagtaaagc gccattataa ttggccaggc atgcgccggg 3900 atatagctag atatatccgg aagtgtagag tatgccaaac tacaaaacca gagcagagac 3960 caagtgtagg gaaaatgaat ctaaagaaat acactcaacc ttgggatacc gtcgcagtag 4020 atattcaagg tccttttccc cgatcatcga atggattctc ttatctttta gttttcttag 4080 ataccttcag taagtggact gaaataatac ctattaggtt ggctaatgcc aaaaccgtgg 4140 ctaaaaattt tcggcagttt attatcttca gatgggggac accccgaata ctacatagcg 4200 ataacggtac acctttcata aacaaaatca tgcaagggtt agctgatacc tttggcatca 4260 cactaacgca gacgcctcct tattggcccc aagccaaccc agtggaaagg acgaatcgcg 4320 tcatcaaaac aatgatttca tcttatgtgg aagggaatca taaacaatgg gattgttaca 4380 tacccgaatt tatgtttgcc attaatactt cctggcactc ttccttaggg tgcacgcccg 4440 cctatttaaa ctttgggcga gaattagagg cccctaacac acattttagg aaatcaagcc 4500 ctagtaggtt aagcccttta actacctcta tctggactgg tcacctggaa aaattaaagg 4560 aacttcgatt actggcagag aagaatctga gtgaagcgag taccaagcag ggccactatt 4620 acaacttgcg tcggagagcc cctcaattca aggaaggcga cttagttctc aggcgtgcac 4680 atcatctctc ttcaggagca gaaggcttcg cggctaagtt ggcgctgaaa ttgaaggccc 4740 ttcaggattg ccactaaaga gtctgcaacg ttttccagct gcaggacgaa gatggcaacg 4800 acgttggccc ttgccatgca gggcagatga agctgtacca ccgctaagat cagcgtgaag 4860 ggtctcacgc ttccaaagga ggggaag 4887 // ID hATm-4_HM repbase; DNA; INV; 3880 BP. XX AC . XX DT 22-MAR-2008 (Rel. 13.03, Created) DT 22-MAR-2008 (Rel. 13.03, Last updated, Version 1) XX DE hAT-type family: consensus sequence. XX KW hAT; DNA transposon; Transposable Element; hATm-4_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3880 RA Jurka J.; RT "Families of hATm elements from Hydra magnipapillata."; RL Repbase Reports 8(3), 208-208 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(1049..1384,1353..1862,1823..3412) FT /product="hATm-4_HM_1p" FT /translation="MEILKNFYYIKSTLETNAKKLSILYCMPNKMKESQCK FT NGKCKCILSLIKTSWNKAGFPIVSDKTLRKHLSNLEKEYYNLKKHEKRGSP FT TDLKKQNNFIQKMKKVFLDWYTKSRKCSWIGTPNLRHTIMIDKKRLDKDKA FT EDLKFLDDQESXRKFFLGTEDKKYSQMSRESIRKKTFKRKHNTLEVKANLD FT QELIFDENDTEMLNDLDFEPGNWTXRKEKINEKPITCTIPKDIYSGSVALA FT ASINDISPAVLQKVVTSVVHKAGVDVSKLHCSISTASSHMKITLQHFNXXI FT TYEMKKVNLIISKQARNDIKVALKASKYPPIIHFDGKTLFELKKGKRFKND FT RLAVLLNIDGEVYLLGVPSLSSSSGDDQYKGIMNLLKEYETEGNIGGLCFD FT TTSSNTGTKKGSLFRIANNLDIFPLFLACRHHISELKIVHFCNVVTQRKTS FT GPDNLLFKKLKDIFETPDFHYDTNDVTQFFWDKTKGTVIECAAVESLNFCK FT EYIKNEKILRDDRKELAELVVAYLSPGGVKIRKPGAVHHARFLGKAIYYLK FT LQFLSNQIKFVQEDKLLLEEINIMAEFIACFYAKWYLQSDKAIKAPYLDIL FT SIHKMHLYKDVCAKPDAVEAVLKSFYKHSWYLDSTIVPLSLLDKDVSNEEK FT SKIASAMLQYDMPNSDYFKIDNKSLIDVENKIKVETRVYNDPPSLSLLVDQ FT FSYLMFDRIGLDKQRIRDWLTLPPQYWYTQSSFRIFKAFAKSLVVVNDPAE FT RAVGMMQQFVHRYNEEEEIQNRLLTVDKTRLATKKPGXKSSNLSKKRLIDS FT LSIMEKMIEK" XX SQ Sequence 3880 BP; 1448 A; 538 C; 608 G; 1276 T; 10 other; tagggtactt gcttttgagg aaatttttga atttcaataa acgacccctt caaatgtggg 60 acattggtga taaaaataac tttggaaaaa aattagcctg attggaccac tcttgagcct 120 cccccaaagc ccatttatta aaaaaaaaag gtcaaaaaaa cgccaaaaac gactatgttt 180 gaagggctag ggggaggtga taatataaat aattttaact gatttttttt tcataataca 240 ggtattgatc tatattttca aataaaaagg gtagttaagg cagattatga ctattttttc 300 tgttttattg tgttaaaaag tgttaaaaaa gcgcaaaaaa aaagaattat tttgccgcga 360 attttaagaa tgaataaacg ctttgtttta cactttaatt ttgtttttta tcatttattt 420 tattttctct attatatttg trggaccata acaattttac ttgtttagat tataataaat 480 aaaataaaat acttgttrcc ctatcgatgt tttattgcag atattgagag aaacttatga 540 atttgyagtt ttcttgaatt tacaggtatg tttgaactag ttgtttttct tactaatttt 600 ttttcagtat ttatttttga aagtactatt aattaactaa taatatttta tgacttatgg 660 tgttttaggt tttataatga acattaaaga aawtaaaaaa agagatgaag aaatagagct 720 taggtcaaga aaaaaagttt tttttcttat tggagaagct aatccagcta tatcaggtga 780 tattatataa ttaatcaggc gatattaagt aattatgtcg attgatgatt atatactctt 840 tattttgcat tcaaattata ctttttatga ataatttaat gcacacatgt gtactttata 900 aaaaaagata aaattattgt ttatattaat atcttttcta aaaaaaataa gtacttttaa 960 aataagtaat caagtattga agtatgtata ttattaacat aaataaatgc tttttttata 1020 ttattaaggt aaacaacttc caactggaat ggaaatacta aaaaactttt actacataaa 1080 atcaacattg gaaacaaatg cgaaaaagtt atcaatacta tattgcatgc ctaataaaat 1140 gaaagagagt cagtgcaaaa atggaaaatg caagtgcata ttatctttga ttaaaacttc 1200 ttggaataaa gctggatttc caatagtttc tgacaaaacc cttagaaagc atctaagtaa 1260 tctagaaaaa gaatactaca atttaaaaaa gcatgaaaag agaggatcgc ctacagattt 1320 aaaaaaacaa aataatttta tacaaaagat gaagaaagtg ttcttggatt ggtacaccaa 1380 atcttagaca taccataatg atcgacaaga aaagattaga taaagacaaa gctgaagatt 1440 tgaagttttt agatgatcaa gaaagcmata gaaagttttt tctcggaacc gaggataaaa 1500 aatatagtca aatgtctaga gagagtattc gaaagaaaac atttaaaaga aaacacaata 1560 ctctagaagt taaagcaaat ttggatcagg aactaatttt tgatgaaaat gatacagaaa 1620 tgttgaatga tttagacttt gaacctggaa actggacaar gagaaaagaa aaaataaatg 1680 aaaaaccgat tacgtgcaca attcctaaag atatttacag tggtagtgtt gctcttgcag 1740 catcaatcaa tgatatatct cctgcagttc tccagaaagt tgtcacttct gttgttcata 1800 aagctggtgt ggatgtaagt aaattacact gcagcatttc aacwgcytca tcacatatga 1860 aatgaaaaag gtaaacctaa taataagtaa acaagccaga aatgacatta aagttgcttt 1920 aaaagcctcc aaatatccgc ctatcattca ttttgatggt aaaaccttat ttgagttaaa 1980 aaaaggcaag cgattcaaaa atgacagatt agcagttctt ttaaacattg atggagaggt 2040 ttatcttctc ggagtacctt ctctatcttc ttcctcaggg gatgatcagt ataaaggcat 2100 aatgaatctc cttaaagaat atgaaaccga aggaaatatt ggagggcttt gctttgatac 2160 aacttccagt aatactggaa ctaaaaaggg ttcccttttt agaatagcaa acaatctaga 2220 tatctttcca ttatttcttg cttgcagaca ccacatatca gagctgaaga ttgttcactt 2280 ttgtaatgtt gtgacacaaa gaaagacatc tggaccagat aatcttttat ttaaaaaact 2340 gaaagacata tttgaaactc ctgattttca ttatgataca aatgatgtta cacaattttt 2400 ttgggataag acaaaaggaa ctgtyatcga atgtgctgca gttgagtctc taaatttttg 2460 caaagaatat attaaaaatg aaaaaatttt aagagacgat agaaaagaac tagctgagct 2520 agttgttgct tatttgtctc ctggtggtgt aaaaattaga aaaccaggag ctgtccacca 2580 tgcaaggttc cttggaaaag ccatatatta tctaaaattg caatttcttt ctaatcaaat 2640 taaatttgtt caagaagata aactactttt agaagaaatt aatatcatgg cggaattcat 2700 agcatgcttt tatgcaaaat ggtatttgca atctgacaaa gctatcaaag caccatatct 2760 agatatttta tcaatacaca aaatgcatct atacaaagat gtttgtgcaa aaccagacgc 2820 agttgaagca gtattgaaat cattttacaa acattcctgg tatctagatt ctacaattgt 2880 acctctttct ttgttggata aagacgtttc taatgaagaa aaaagtaaga ttgcttcagc 2940 tatgcttcaa tatgacatgc cgaactctga ttattttaaa atcgacaaca aatcgctcat 3000 tgacgtggaa aacaagatta aggttgaaac gagagtttac aatgatcccc caagtttatc 3060 attattagtt gatcagttct cctacttgat gtttgataga attgggttgg ataaacaaag 3120 gattagagat tggcttaccc tcccgcctca gtattggtat acacaatcaa gctttagaat 3180 cttcaaagct tttgctaaat ctctagttgt tgtaaatgat ccggctgaaa gagcagtagg 3240 aatgatgcaa caatttgtgc acagatacaa tgaagaagaa gaaattcaga acagactgct 3300 tactgttgat aaaactagat tggctacaaa aaagccgggg rgaaaatctt caaacttgtc 3360 taaaaagagg ttgatcgact cactttctat aatggaaaaa atgattgaaa aataaaaatt 3420 gacaaccaaa ttttttaaac aaactttaat attgttttaa ataattatta attaattaaa 3480 gatttataac acacttttaa ttgtattatt tgattatacc tattcgcggc aaaataattc 3540 tttttttttg cgctttttta acacttttta acacaataaa acagaaaaaa tagtcataat 3600 ctgccttaac tacccttttt atttgaaaat atagatcaat acctgtatta tgaaaaaaaa 3660 atcagttaaa attatttata ttatcacctc cccctagccc ttcaaacata gtcgtttttg 3720 gcgtttttga cctttttttt taataaatgg gctttggggg aggctcaaga gtggtccaat 3780 caggctaatt tttttccaaa gttattttta tcaccaatgt cccacatttg aaggggtcgt 3840 ttattgaaat tcaaaaattt cctcaaaagc aagtacccta 3880 // ID Copia-1_DPu-I repbase; DNA; INV; 4144 BP. XX AC scaffold_154; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-1_DPu_; KW Copia-1_DPu-LTR; Copia-1_DPu-I. XX NM Copia-1_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-4144 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 665-665 (2010). XX DR Genome; scaffold_154; Positions 222293 226436. XX CC Positions [1625-2158] - Integrase core CC 'CCAGC' target site duplication CC LTRs are 99% similar to each other. CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX FH Key Location/Qualifiers FT CDS 533..2407 FT /product="Copia-1_DPu-I_2p" FT /translation="MTKIVSTLPVAFRHFTTVWDNLPENEKNDTAARYLMN FT EEHKNSSATPTSNPVVQPAEAFTARSGRPVYKPYRPSENADRKRKREECEY FT CGKFNHSESTCTRRINAEAGHPDIKCEYCRNYDHSAIDCNKRKRDERNKSK FT ISPRVKFAKSATWDKSNEDNKGQNGVAFGASSISMEKETWYADSGASHHMC FT DDHSCMTNYSTISSHRTVKGIGGVKLTVRGKGDIRVVMEINGIKQNATIHD FT VFHVSGLGTNLLSIASTTDRGIDVNFTKQMVSFTKNGILVMTGNRSGKDLY FT RLNMRTVPVIRNETIACTARVGKLPLSVWHQRLSHTNYKTIVKMVSGDMVH FT GIRLNDYSIPTEVCSGCALGKMHRLPFKKGRERAKQIGELVHTDVCGPMQQ FT PSPNGSRYYVIFKDDFSGYRAIYFLKLKSDVFDHFKLFVCKMKSETDHNIR FT TLRSDGGGEFLSTEFVNWLPKKCIRHETTVAHTPQQNGVIERDHRTIGEAE FT RSAMLHMKNIPQELWAESYNCAVYTLNRTLSSSISATPYELWFGRKPKLGH FT LRIFGCEAYMHISDCNRSKLDPKRSSVHSLDIARPRKRTGYGTQRAAESKL FT VAMFCSTKLSSRLIQTPLIHKNSTKPL" FT CDS 2857..3885 FT /product="Copia-1_DPu-I_1p" FT /translation="MMQLDVQTAFLHGEVTEDLYVNQPEGFIAGGNESLVC FT RLHKGLYGLKQSSRLWNITFDSFITKFGFISSSADPCVYYRETDSEFTILA FT FWVDDGLMCSTKSSTNEEILSYLESHFSMTSGPADFFIGLQISRDRPKKKL FT LLSQPQYILRVLKRFHMEKCHAITGPADPHARLDSSMSPSTSEDIQMMYST FT SYDEAIGCLTYAAVCTRPDISFAVSQAARCCKNPGKAHWAAVKRILSYLAG FT TTTHGIIFSGEGRTNLVGYTDSDYAGDMDTRRPHPGTFSYTSVAPYPGEVR FT DNPALPSPQQRLNTWQPATPPRKPFGFNAFLVRSDNYHLALYVFYAIIRAP FT " XX SQ Sequence 4144 BP; 1260 A; 1034 C; 854 G; 996 T; 0 other; ggttatgggc ccaggttaaa caaatttccg acaatggcac gaattttcaa ttgtggaaat 60 acaattgctg gttaatcatc gaacaacatg agttactcga aatcgtagag gtaacttaca 120 ctgaaaatga atcatatttc atcatttttc agatcgtact aaaccaaacc tttttcctta 180 cagggaagat ctaaggaacc agaacctcac atggtttccg gaagcatctc aaactcaaaa 240 gagatcaaga aatggaaaaa aacaagacac tggcgctaga gtgatcctta tgtccatcat 300 cactcccaaa gaacaacaag cattcgtcaa ctgcaaaact gcatttgcca tttggacaaa 360 actcgcagct cagtatctaa aaaatgcctc agcaaacact cacgtgttgc aggcccgttt 420 ctttcactac caatatgtga aaggaaacaa tatgatgtat cacatcacag ccattgaagg 480 catcgcccaa caacttgacg atcttggaag ttcaatgtca gaatcctaga tcatgactaa 540 aatcgtgtca acacttccag tcgcattcag acacttcaca acagtatggg ataatctccc 600 tgaaaatgaa aaaaacgata ccgcagctcg gtatctgatg aatgaagaac acaagaacag 660 cagcgccact ccaacatcaa atcctgtcgt acaaccagca gaagctttta cagctaggtc 720 aggaagacct gtttacaaac cgtatagacc atcagaaaat gcagacagaa aacgaaagcg 780 agaagagtgt gaatattgcg gaaaattcaa ccactcagaa tcaacttgca ccagacgcat 840 taatgcagaa gccggccatc cagacattaa atgcgaatac tgcagaaatt atgatcactc 900 tgcaatcgac tgcaacaaac gcaaacgtga cgagaggaac aaatccaaga tatcgcctag 960 agttaagttt gctaaatcag cgacctggga caagtcaaat gaagacaaca aaggtcaaaa 1020 tggtgtggca tttggcgcaa gctcaatctc tatggaaaaa gaaacatggt atgcagattc 1080 cggagcaagt caccacatgt gtgacgacca ttcctgtatg accaactact ccacgatctc 1140 atcacaccgg actgttaaag gaatcggtgg agtcaaacta actgtacgag gaaaaggaga 1200 cattcgtgta gttatggaaa tcaacggtat taagcaaaac gctacaattc acgacgtttt 1260 tcacgtctca ggccttggta cgaatttatt atccatagca tcaaccaccg accgtggaat 1320 cgacgttaat ttcacgaagc agatggtgtc atttacgaaa aatggaattc ttgtaatgac 1380 aggaaatcgc agtgggaagg acctatatcg tctcaacatg cgaacagttc ccgtcatcag 1440 aaacgagacc atcgcatgca ccgccagagt cggcaagtta cccctctcgg tgtggcacca 1500 acgtttatcg cacaccaact acaagactat tgtcaagatg gtatccggcg acatggtgca 1560 cggcataaga ctgaatgatt actcaatccc aaccgaagtg tgttcggggt gcgccttggg 1620 aaagatgcac cggctaccat tcaagaaagg tcgcgagaga gccaagcaga tcggagaact 1680 tgtgcatact gatgtatgtg gacctatgca acaaccttca cccaacggat ccagatatta 1740 tgttatattc aaagatgatt tttcaggata tcgcgcaatc tactttctga agttgaaatc 1800 agatgtcttc gaccatttca agctatttgt ttgcaaaatg aaaagcgaga cggatcacaa 1860 cattcgcaca ctccgttcag atgggggcgg cgaattcctc agcactgagt tcgtcaattg 1920 gctccccaaa aaatgcattc gtcacgaaac aaccgtagct cacaccccgc aacaaaacgg 1980 cgtaatagag cgtgatcacc gcacaatagg cgaagcagaa cggagtgcaa tgctacacat 2040 gaagaatatt ccccaggagc tgtgggcgga atcttataac tgtgcagtct acactctcaa 2100 ccggacttta tccagcagta tttcggccac tccctacgaa ttatggttcg gacgtaaacc 2160 taaacttggt catctacgaa tcttcggatg tgaagcctat atgcacattt cagactgcaa 2220 tcggagcaaa ctagacccaa agagatcaag tgtacattcg ttggatattg cgagaccacg 2280 aaagcgtacc ggctatggaa cgcaacgagc cgcagaatca aaattagtcg cgatgttctg 2340 ttcaacgaaa ctatccagcc gactcatcca gactcctttg attcacaaga actcgacaaa 2400 acctctttag tggtccctcc tatcgcaatc ccaccacgtc attccagcag aaaaccacaa 2460 cctaaacgac tgtgggccga gttggcgacc gaagagtcct cggtaccgaa cttttcaccc 2520 gaagagataa aggaaccaac caattttcag tcagccatct ccggtcccga ttctgtgaaa 2580 tggaaagctg ccatggacaa agaatatcaa tccctgatgt ttaacaaaac gtggtctcta 2640 gttccccttc caaccggccg gtcagtgatt gggtgtcgat ggacatacac actgaaacga 2700 ggaccagatg gattaatcaa acgttacaag gcgcgttttg tggccaaagg ctaacgccaa 2760 cgaccaggtg tcgactattt ggagacgtac tcgccagtgg ttaaactcga ttctctccgc 2820 tgtatactct ccattgcttc acatcgtgat cttgacatga tgcaactaga tgtgcagacg 2880 gcattcctac atggcgaagt aacggaagat ctctatgtca atcaacctga aggtttcatc 2940 gctggtggaa atgagtctct agtgtgtcgc ctacacaaag gactctacgg tttaaagcaa 3000 tcatcccgat tgtggaacat cacatttgat tcgtttatta ctaaatttgg attcattagc 3060 agctcagcag acccttgcgt ctattaccgt gaaacggact cagaattcac tatccttgca 3120 ttctgggtag acgacggtct catgtgtagc acgaagtcct caacaaacga agagatattg 3180 tcatacctag aatctcactt ctctatgact tcaggtccag ctgatttctt cattggactt 3240 caaatatcac gtgaccgacc caagaaaaaa ctgttgttat ctcaaccgca atacatcctt 3300 cgagtactaa agcgcttcca tatggaaaag tgtcatgcca tcaccggtcc ggcagatcca 3360 cacgcaaggc ttgattcctc catgtctccc tctacttctg aagacatcca gatgatgtac 3420 agcacatcat atgacgaagc gattggatgc ctcacttacg ctgctgtctg cactcgcccc 3480 gatatttcgt tcgcagtgag ccaggccgcc cgttgctgca agaatcctgg caaagctcat 3540 tgggctgccg tcaagcgaat tctgtcatat ctagcaggga ctacaacaca cggaattatt 3600 ttctcgggag aaggacgcac caatctggtc ggatacactg attctgacta cgccggcgac 3660 atggacaccc gtcggccaca tccgggtaca ttttcctaca cctcggtggc gccatatcct 3720 ggggaagtaa gagacaatcc tgcactgcca tctccacaac agaggctgaa tacgtggcag 3780 ccagcaacgc cacccaggaa gccatttgga ttcaacgcct tcttagtcag atcggacaac 3840 taccacctgg ccctatacgt attttatgcg ataatcagag cgccataagt ctagtccata 3900 atccggctca tcatcagcga acgaaacaca tcgatgtcaa gtttcatttt atcagagaga 3960 agcaggcttc ccatttcatt gaaataatgt atatcgacac tcaacatcaa cttgccgata 4020 tttttggcct gccgactccc caatttaact tcatcagggc tcgaatcggc gttgttccct 4080 ttctctaaat cttttttctt ttttattctt atcttattta cctatcgctt ggtttgagga 4140 ggtg 4144 // ID Mariner-6_HM repbase; DNA; INV; 2559 BP. XX AC . XX DT 22-MAR-2008 (Rel. 13.03, Created) DT 22-MAR-2008 (Rel. 13.03, Last updated, Version 1) XX DE Mariner-type family: consensus sequence. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-6_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2559 RA Jurka J.; RT "Families of Mariner elements from Hydra magnipapillata."; RL Repbase Reports 8(3), 223-223 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 502..2337 FT /product="Mariner-6_HM_1p" FT /translation="MSKKIKRKQPTLAAFGFTKNVMHRGEMSAIRLPLEAV FT ETSVECEHCKKLFKSQQGLAFHVKVMHGISKQDCKHSQSQVLQSNKELVTL FT AVKDVLDNIVNKVVSAIKKSEEASKLAGKKRHQYTSAFKAEAINAYGYEAN FT QDTIAESFGVTQSQISRWLKKKETIIKDAESSYRKLFLKGRRSTKYLELYE FT ILYKEFLQARSKGHIVNFSWLWSKARNIQLNIDPNVVIKHHVIVRFLQKKD FT LKMRSKQRNKRKHKKEMEPLLQKWHATYREKCIRTGSKDPSYDKKWGRYQP FT GQRLNVDQSPLPFVVHGKKTYEYVPKGQGATHNTWISQPGSGLEKRQCSLQ FT IMFRPEGEQPKLALIFRGQGKRISKDEKLAWHKDIHVYFQQNAWLDQNVCK FT HWCDKTLLPFVREQKLDKFVLLLDNLKGQMQEDFKDAVAAAKGLLWYGLPS FT ATDLWQPVDAGYAATLKRLIAIEHQKWLDRDNHSDRWFSNEMPYTAKERRI FT LITHWAGEAWKALNTSKYDKQRKKCWTMTGCLMTSDGSEDSLVKPEGLDSY FT KVPPPSIIDPGSDQPKGNQSDIQPVEVDDAINEDANEILPDDTFMGPDEKE FT CEVHIFDFIDNICI" XX SQ Sequence 2559 BP; 890 A; 433 C; 458 G; 778 T; 0 other; catgtttttt tttttataag cataacgttt ataagcatat ggaggctcag atttggaaaa 60 aaaataagca tatgtaagca tattcctgag cctcgataaa ccatttaaaa gatttaaatt 120 ttatttaatt aaaaatttta aatatatata tttctaaatt taagcctaca aaaacaacaa 180 aaatcgaagt ttatgttttg cagtgcaacg actttatgaa gtaaagaagt cattgtaatg 240 caaaatctaa acctcaatac ttgaccagaa aaacataaag tttaagagtc gctctgcagt 300 gttttatttc tctctctctt ttttttttct ttctctctct tttttctctc tctttttttt 360 ctttctctct ctcttttttt ctttctctct ctcttttttt tctttctcgc tcttaatgtg 420 tagtttattt aataccatat tcaaaatgtt tattttattt ctaagtctta ttttcagata 480 ctgttaaaaa tagtaataaa aatgtcaaag aagataaaga gaaagcagcc tacattggca 540 gcgtttggtt tcacaaaaaa tgttatgcac agaggagaaa tgagtgcaat ccgattgcca 600 ttagaagctg tcgaaactag tgttgaatgc gaacattgca aaaagttatt caaaagccaa 660 caaggactag cttttcatgt taaagttatg cacggaatta gcaaacaaga ttgtaaacat 720 tcacaatcac aagttcttca aagtaacaaa gaacttgtca ctctagcagt taaagatgta 780 cttgataaca ttgtcaacaa agttgttagt gccataaaaa aaagtgaaga ggctagcaaa 840 ttagctggta agaaacgcca ccaatatact tctgccttta aagcagaggc aattaacgcc 900 tatggttatg aggcaaatca agatactata gcagagtcgt ttggtgtaac gcaaagtcag 960 atttcgcgat ggcttaaaaa aaaagaaact attattaaag atgccgaatc atcctatcgc 1020 aaattgtttc tgaagggaag acgttccaca aaatatcttg aactatacga aattctttac 1080 aaagaatttt tacaagccag atcgaaaggt catattgtta acttttcctg gctttggagt 1140 aaagctagaa atatacaatt aaacattgat ccaaacgtgg taataaaaca tcatgttata 1200 gttcgatttt tacaaaaaaa ggatttaaaa atgagatcga aacaaagaaa caaaaggaaa 1260 cataagaaag aaatggaacc attattacag aaatggcatg cgacatacag agaaaagtgt 1320 atcagaacag gttctaaaga tcccagctat gacaaaaaat ggggacggta ccaacctgga 1380 caaaggctca acgttgacca gagcccactt ccctttgttg ttcatggtaa gaaaacttac 1440 gagtatgttc ccaagggtca aggtgcaaca cataatacat ggatctcaca accgggctca 1500 ggattggaaa aaaggcaatg ttcccttcaa attatgtttc ggccagaagg agagcaacca 1560 aagctagccc tcatttttag aggtcaagga aaacgcattt caaaagacga aaagttagca 1620 tggcacaaag atattcacgt ttacttccaa caaaatgctt ggttagacca aaacgtttgt 1680 aagcactggt gtgataaaac actccttcca tttgtcagag aacaaaagct tgataaattt 1740 gttcttttgc tggacaattt gaagggacag atgcaggagg attttaaaga tgctgtggct 1800 gctgctaaag gacttctctg gtacggtcta ccaagtgcca ctgacctttg gcaaccagtt 1860 gatgccggat atgctgcaac tctgaaaagg cttattgcca ttgaacatca aaaatggctt 1920 gatagagata atcattctga tagatggttc agtaatgaaa tgccctacac tgcaaaagag 1980 agacgaattt tgattacaca ttgggcaggg gaagcatgga aagctttaaa cacttctaag 2040 tatgacaagc aaaggaagaa atgttggacg atgactgggt gtttaatgac ttctgatggc 2100 tcagaagatt cgcttgttaa accagagggc cttgatagtt acaaagtgcc cccaccatcc 2160 atcattgatc ctggcagtga tcaacccaag ggaaatcaaa gtgatattca accggtggag 2220 gtagatgatg ctataaatga agatgccaac gaaattcttc cagatgacac ctttatgggt 2280 ccggacgaga aggaatgcga agttcatatc tttgatttca ttgataacat ttgtatatag 2340 tatatatttt ttatttttta ttatactgta caaaagtcaa ctttttattt ttcttataac 2400 ctttagtaac gcgcgcactt ttagaaagta ttgaaataag catattttat tagaaatatg 2460 cttaagttta agcatatttt tgagcctctt ttaacttttt gcttaagcat acccgagcct 2520 gtttcaaaaa tttcatatgc ttataaaaaa aaaaacatg 2559 // ID GIZMO2_EI repbase; DNA; INV; 2168 BP. XX AC GIZMO2_EI; XX DT 16-MAY-2005 (Rel. 10.05, Created) DT 16-MAY-2005 (Rel. 10.05, Last updated, Version 1) XX DE Gizmo-Ei2, a new member of the Tc1/mariner DNA transposon DE superfamily from the single-celled eukaryotic reptilian parasite DE Entamoeba invadens. XX KW Mariner/Tc1; DNA transposon; Transposable Element; mariner; KW Interspersed repeat; Tc1/mariner superfamily; Gizmo-Ei2; KW GIZMO2_EI. XX OS Entamoeba invadens OC Eukaryota; Amoebozoa; Archamoebae; Entamoebidae; Entamoeba. XX RN [1] RP 1-2168 RA Pritham E.J., Feschotte C. and Wessler S.R.; RG Department of Biology, University of Texas at Arlington, RG Arlington, TX 76019, USA; RT "Unexpected diversity and differential success of DNA transposons RT in four species of Entamoeba protozoans. Mol. Biol. Evol. (2005) RT In press."; RL Direct Submission to Genbank (16-MAY-2005). XX DR Repbase; GIZMO2_EI; Positions 1 2168. XX CC The TIRs of Gizmo-Ei2 (GIZMO2_EI) are 352-bp long and are flanked CC by TA putative TSD. The element can potentially encode a 432-aa CC protein similar to other Gizmo putative transposases and to CC various IS630-like prokaryotic transposases and harboring a DD32E CC motif. There are several elements closely related to Gizmo-Ei2 in CC the E. invadens and E. moshkovskii genome. It seems that Gizmo CC elements belong to a distinct clade of Tc1/mariner elements most CC closely related to the IS630 group of bacterial insertion CC sequences than to established eukaryotic clades of the CC superfamily (e.g. mariner, Tc1, pogo?). XX SQ Sequence 2168 BP; 827 A; 301 C; 340 G; 700 T; 0 other; cagacgaatt aggtcaactt gtgatagtat tttttatgga agtgatatga tttttataaa 60 atgtacaaca tttttgataa aaatgatgaa ttaattgaga agtgatagta atttgtttga 120 aaaattatga gtatttttta taaaagtgat gaataatcgt aaaaatgatg atttatggta 180 aaaatgatgg gaaattaaat caaaagtgat ggatttttct cagaaatgat gcaatctata 240 agaaagtgac caataatatt ttgaaaaagt ttatgatttt taacacttgt gacagattat 300 tttataaata tataaatgta gttaattatg gaaacttccc attattttaa acaaaaaaat 360 gaaataaaca cgaataaaca taaaataata caaaagatgt gtttacattg ttttttcttt 420 gttttatttg tactataaaa acaactttaa tatgaacaat aatcaaatgt cccattcact 480 cacatatctt gatgcaagag atccaattgt tcgaagcgat tatatagaaa aacctccaca 540 acttctacaa ggtgagcccg tggaatcgcg acgtattatt ccttttatac caccacaatc 600 cgaaagaaga aaaagaggac gaccaaaaaa gaatgtaaca aatattacac aaatcaaaag 660 atcatacaaa agagtattgc tttctgataa gtttgaatta attaaattat ttcgaaaata 720 tggagatact gttgaaccag aattctatag taccaaagtt ggtattaagg ttaacactta 780 aaataatttg ttaataaaat tgagaaaggg ggaatcaatt ttcctaaaaa tcactacaga 840 agaaagagcc gtgtcatacc gtttcaaaac cttgttcaaa agatattaga aaaagacagt 900 actgaatctg tctctaatat tagaagacat attaaatata ttacagcaaa atcaaatgac 960 agaaatttag aagacgtttc tcttgttaca aaacaagagt ttgaagctgg atatcacaga 1020 agatttaatg aacctcttac tccagaatca cttaatctgg cggattcttt aataccatcc 1080 gtaagttcaa ttacaaattt catgcgtggt aaaacgggaa atgaaatggg gaggacattt 1140 cccatattaa gtttcaagaa agaaacaaca cgaggagccg tagctaatac tccggataac 1200 aaagaaaaaa gagttgaagc catcgaaaaa ttagtcggaa tgatgcatca aggatacacc 1260 tgggtgtgcg ttgatgagtc ttcttggcga atagtctgta cttctagtta tggatggagt 1320 ccaattggag aaaaaatgat tgtcaccaag gcaaagaatg gtcaacggct ttctgcattg 1380 actgcaattg accacatggg catgagtttt agccttatag ttgctgggga aagtgacgct 1440 gaaatattca atcgttatat ccaagatgtc atgaaacatt atgatgataa taatataaac 1500 gctgtttttt ggtgcgacaa ttgttccatt cataacgatt tagaacaact tgttgaacat 1560 acacatcata ctgttatttt caatgcggct tatagcccag aattgaaccc catagaaaac 1620 attttcggaa tttggaaacg aaaagttgaa aacgaaatca gagtatggtc ggggttggaa 1680 gatctgcttt gtaaaataaa aaatggtttt gtaaatatcc aacccgctga tgtaatagct 1740 tcaatggaaa aatgtagaaa tgaagtttgg tctcttgttt atactagaag tgatttatga 1800 attattttat tttttgttta aaataacggg aagtttccgt aattaactac atttatatat 1860 ttataaaata atctgtcaca agtgttaaaa atcataaact ttttcaaaat attattggtc 1920 actttcttat agattgcatc atttttgaga aaaatccatc acttttgata taatttccca 1980 tcatttttac cataaatcat cattttttac gattattcat cacttttata aaaaatactc 2040 ataatttttc aaacaaatta ctatcacttc tcaattaatt catcattttt atcaaaaatg 2100 ttgtacattt tataaaaatc atatcacttt cataaaaaat actatcacaa gtagacctaa 2160 ttcaactg 2168 // ID CR1-27_HM repbase; DNA; INV; 4420 BP. XX AC . XX DT 16-DEC-2008 (Rel. 13.12, Created) DT 16-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE CR1-type family - consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-27_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4420 RA Bao W. and Jurka J.; RT "CR1 families from Hydra magnipapillata."; RL Repbase Reports 8(12), 1855-1855 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 130..888 FT /product="CR1-27_HM_1p" FT /translation="MAVTLPEIKKLLKEMFKEFKKETEEMFQKQEKAVLDI FT IGANIKIINDRFEKVEKNVDGNANCIKKLTKELEDIKVSLNFNEGLIDEKI FT VTNNKYLDKKMEHTKIDNVLKEKQRNLEDRSRRNNLRIEGIYENDKESWGD FT TEKKVQTFFTEKLGLKDVEIERAHRTGRKNDGRPRTIILNLQKYKDKIRIL FT KELYRLKGTNTFVNEDFSRETVAIRKKLFAEVKERRLNGESVTVRYDKIVY FT IKTSFKNNFNE*" FT CDS join(936..1469,1414..4026) FT /product="CR1-27_HM_2p" FT /translation="MATNENFESRVFKIQKTNYSTLNENLDSDYNIYHNKF FT SDCSYYFPSQLEGFLFKNENERNNNQMLILHLNIRSLNSNFEKLLDLLEET FT KHVFNIICLTETWISLEDLNNFHLPHFNTFLMERKTNKRGGGVLIYVHENF FT EHCFRHDLSVSDCDKEIATIEITNCNKKKIFTKLLLSATKLLTATKKKFLL FT SCCYRPPDGVSENFSMFLQQIIKKGTVEKKQNFIIGDLNMNCFLYNDNLKV FT KNFYDEIFETGSVSLINKPTRVTTNSATLIDNIITSDIFNNDLKVGILRTD FT ISDHFPIFLKLDNTNSEKTLTAQRVIRKRIYNEANLNLFKNQLSLLHWNNI FT NFNDNANNIYESFFKTFFSVYDANFPIIENKITTKSLNTPWITKGFKKSSK FT IKQKLYINFLKTKTPKNEKIYKDYKNLFEKIRKNLKKNYYSQLLNTFKNDT FT KRTWQIMNEIIGKQKSCSGFLPQMVRVDNKSLYEPRTIAQEFNKFFIDIGP FT TLSKKIPKTKSLFTDFLVPIDNCIGSVELSSELLFEEFERAFKSLKKNKAI FT GADQINGNVIIDCFEQLKVVLFKVFKASIHQGVFPELLKIAKIIPIFKEGD FT KSIISNYRPISVLSTFSKILERIIYNRVYKYFHVNKLLNINQFGFKKDSST FT EHAIIQFVREISKSFEKSQYTLGLFIDLSKAFDTVDHQILIHKLKHYGIKN FT KVLKWFESYLSNRKQFVPSNDGYHTNCLSITCGVPQGSILGPLLFLIYIND FT LNKASKLTSIMFADDTNLFLSNSDIFELFKTMNKELIHVSSWFKCNKLTLN FT SNKTKWTLFHSLSKKRYLPLNLPKIFIDQNEIKRDSVTKFLGVYLDENITW FT NHHIDYISTKISKNIGVLYKARIYLNKKILIKLYYSFIHSYLNYANIAWGS FT TEKSKLQRLYRRQKHAIRLICFANRFSHAKHYFIEMRILNIYELNIYNVLC FT FVYMWKNDLSQSVFKDIFTPKPINKYNLRNTDFLNEPFCQTNFNQFCIVYR FT APHLWNKIVLPNFDFELPITFRFFKSKLKTLIFSLEDVLCYY*" XX SQ Sequence 4420 BP; 1771 A; 619 C; 572 G; 1458 T; 0 other; attttttcaa ctcaagcttt caacaagaac ggacgcactt tttacctcgt ggcgaaaaaa 60 aaaaaaaaaa aaaaatttat ttattataaa taaagacgta tctttaataa aaaattttaa 120 cttaataata tggctgtaac acttccagaa ataaagaaac ttttaaaaga aatgttcaaa 180 gaatttaaaa aggaaacgga agaaatgttt caaaagcaag aaaaagcagt tctcgatatt 240 attggcgcta acatcaaaat aataaacgat agatttgaaa aagtagagaa aaatgttgat 300 ggaaatgcaa attgtataaa aaaattaaca aaagaattag aagatataaa ggtaagcctt 360 aattttaatg aaggacttat tgacgaaaaa atagttacta acaacaaata tctagataaa 420 aaaatggaac atacaaaaat agataatgta ttaaaagaaa aacaacgaaa cttagaagat 480 cgatcgcgcc gaaacaatct taggattgaa ggaatttacg agaatgataa agaatcatgg 540 ggggatacag aaaaaaaagt ccaaacattc ttcaccgaaa aacttggact aaaagatgtt 600 gaaattgaaa gagctcaccg cactggacga aaaaacgatg gacgaccaag gactataata 660 ttaaatcttc agaaatataa agacaaaata agaatattga aggaattgta tcgactcaaa 720 ggcacaaata cgttcgtgaa cgaagatttt tctcgagaaa ccgtcgctat tcgaaaaaaa 780 ttgttcgctg aagtgaaaga aaggcgattg aatggtgaaa gtgttacggt aaggtatgac 840 aaaattgttt atattaaaac ttcttttaaa aacaatttta acgaataatt ttatgggcat 900 tcctaaaaag aaatcttaaa taaaatatat taatcatggc tacaaatgaa aactttgaat 960 ctcgtgtttt taaaattcaa aaaacaaatt actctacgtt aaatgaaaat ttagactctg 1020 attacaatat ttatcacaat aaattttcag attgttcata ttactttcct agccaactag 1080 aagggtttct ttttaagaat gagaatgaaa gaaataataa tcaaatgtta attctccacc 1140 tcaatataag aagcttaaat agcaattttg aaaaactttt agacttatta gaagaaacta 1200 aacatgtttt taatataatt tgtttaacag aaacttggat ttctttagaa gacctaaata 1260 attttcatct acctcatttt aatacatttt taatggaaag gaaaacaaat aagcgcggtg 1320 gtggagtttt aatttatgtt cacgaaaatt ttgagcattg ttttaggcat gatttaagcg 1380 tttctgactg cgataaagaa attgcaacta tagaaattac taactgcaac aaaaaaaaaa 1440 tttttactaa gctgttgtta tcggccacct gatggcgtaa gcgaaaactt tagcatgttt 1500 ttacaacaaa ttattaaaaa aggtaccgtt gaaaaaaaac aaaattttat aattggagat 1560 ttaaacatga attgttttct ttataatgat aaccttaaag ttaaaaattt ttatgatgaa 1620 atttttgaaa cgggttcagt atctttaata aacaaaccta caagagtaac gacaaattca 1680 gcaactctaa tagataatat aataacttca gatattttca acaatgattt aaaagtaggt 1740 attttgcgaa ccgatatatc agatcatttc cctatattcc taaaattaga taatacaaac 1800 tctgaaaaaa ctttaaccgc acaaagagta attagaaaac gcatttataa tgaggctaac 1860 cttaatttat tcaaaaatca actttcatta ctacactgga ataacataaa ttttaatgat 1920 aatgctaaca atatctatga gtcttttttc aaaacatttt tttctgtata cgatgctaac 1980 ttcccaatca ttgaaaataa aataactact aagagcttaa atacaccctg gatcactaag 2040 ggctttaaaa aatcgtccaa aattaagcaa aaactatata taaatttttt gaaaacaaaa 2100 acacctaaaa acgaaaaaat ttacaaggac tacaaaaatc tatttgaaaa aattcgcaaa 2160 aatctaaaaa aaaattatta ctcccaatta ctcaatacat ttaaaaatga cacaaaacgc 2220 acatggcaaa taatgaatga aattattgga aaacaaaaat catgctcagg ttttctgcca 2280 caaatggtta gagtcgataa caaaagttta tacgaaccaa gaactatagc tcaagaattt 2340 aacaaatttt ttatcgatat aggtccaaca ctttctaaaa aaatcccaaa gaccaaatct 2400 ttatttactg attttctagt acccattgat aattgcattg gttcagttga gttgtcttct 2460 gaattattgt ttgaagagtt tgaaagagcc ttcaaatctt taaaaaaaaa taaagcaatt 2520 ggagcagatc aaataaatgg aaacgttatt atagattgtt ttgaacagct aaaagttgtt 2580 ctttttaaag tttttaaagc ttctatccat caaggtgttt tccccgaact gttaaaaatt 2640 gccaaaatta ttcctatttt taaagaaggg gacaaatcaa ttataagtaa ctatcgccct 2700 atttctgttc tttccacctt ttcaaaaatt ttagaaagaa ttatatataa cagagtatac 2760 aaatattttc atgttaataa attacttaac attaatcaat tcggttttaa aaaggatagc 2820 tcaactgaac atgcaattat ccaatttgta cgggaaatct caaaatcatt tgaaaaatca 2880 caatatactc taggactttt tattgatcta tcaaaagctt ttgatacggt tgaccaccaa 2940 attctaatcc ataaacttaa acactatgga ataaaaaata aagttttaaa atggtttgaa 3000 agctatttgt caaatcgtaa acaatttgtt ccgagtaatg acggttatca caccaattgt 3060 ctttctataa catgtggagt cccacaaggc tcaattcttg gtccactcct tttcctaatt 3120 tatattaatg atttaaataa agcctctaaa ctaacaagta ttatgtttgc agatgatact 3180 aacttatttc tttctaacag tgatatcttt gaacttttta aaacaatgaa caaagagctt 3240 atacatgtat cgagttggtt taaatgtaat aagttaactc taaattctaa taaaacaaag 3300 tggaccttat tccattcgct ttccaaaaaa cgatatttgc cattaaactt acctaaaatt 3360 tttattgatc aaaatgaaat aaaaagggat tccgtaacaa aatttttagg tgtttacctt 3420 gatgaaaaca tcacttggaa tcatcatatc gactacataa gcactaaaat ttcaaaaaac 3480 ataggagttt tatataaagc acgaatctac ctcaataaga aaattttaat caaactttat 3540 tattcattta tacacagtta tttaaattat gctaatatcg cctggggtag tacagaaaaa 3600 agtaagttgc aacgtcttta tcgccgtcag aaacatgcta tccgtttaat ctgctttgca 3660 aatcgctttt ctcacgcaaa acattatttt attgaaatga gaattttaaa tatatacgag 3720 ctaaatatat ataatgtttt atgctttgta tacatgtgga aaaatgactt gtctcaatct 3780 gtcttcaaag atatttttac tccgaaaccc atcaacaaat acaatttgag aaatactgat 3840 tttttaaatg aacctttctg tcaaacgaac tttaatcaat tttgtattgt ctatcgggca 3900 ccacaccttt ggaataaaat tgttttgccc aattttgatt tcgaactacc cattactttt 3960 cgttttttta aaagtaaatt aaaaacccta attttttcat tagaagacgt attatgttat 4020 tattaataat ataaattaat aactttgcct tttcttattt atatttgtaa aaagtttatt 4080 cttacttatt ttgttgattt tatattcttt acttcacgta tacgtttata taaaatattg 4140 gtgagtagat ccgcgtgatc tctctttgga ttataaaata tatatttgga ttatatatat 4200 atgaaatttg taaaaatttg atactgttaa tgcttgtttg aacttattta aggttctgac 4260 gacaagatcc tttgatcttc tttcagatac cttattttat atttgtgaaa gtttattatt 4320 attatcatta tttttatttt attttttttg gcaatatata gttgtaatac gaccaacaaa 4380 tgtaaaatta aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 4420 // ID hATm-51_HM repbase; DNA; INV; 3778 BP. XX AC . XX DT 07-DEC-2008 (Rel. 13.12, Created) DT 14-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type family: consensus sequence. XX KW hAT; DNA transposon; Transposable Element; hATm-51_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3778 RA Jurka J.; RT "Families of hATm elements from Hydra magnipapillata."; RL Repbase Reports 8(12), 1945-1945 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(808..1362,1605..2021,2005..3174) FT /product="hATm-51_HM_1p" FT /translation="MDILQHLAYKQSFLPKTSKKATIIACPLQKKFQRCAI FT ESKCECILAEVKEPWIKAGFSILNDQSLVKNLSNLDXXYSQLKKNEKRGSA FT ADLAKQXEFKIYLKKVFWAGIPDLKNIIRNDKKRSDXDKLEDLXFLEDQEG FT ERKFILGSEDKKYGKKVCIPLKPKKKKSFQGFLNNLLTICKPLNKIKKTFS FT TKVVTAEIPIDIFAGNVALMATISDISPNVLHKVTGAILTQSGVDLTDFKC FT SQSTAXRKMKYANXNICMIXKEDVXQAMNESPYPCIVHFDGKTLFELNEGK FT TIKTDRLAVLVNIEGETHLLGVPPPLASVILWLLSSSGEDQCNGVIDILKE FT YNLESKIGGICFDTTASDTGLKKGSLIRISNNIDKYLLLIACRHHVCELRM FT VHFCNSVSNEHXTGPDNPMFKKLKTVFERPNFNYNQTELIKFDWKAVKGTF FT LEKAAQESLNFCQEYITKGKENKDFIREDRKELAELVVSYLSQSSVKLRKP FT GAVHHARFLXKALYYLKLQLLSSQIEFVNQNNKLKTEIRLISEFIVCFYAK FT WYLQANDAIKAPFLDIKAIHQMQQYRKLCAKPDAVDAVLNSLYKHSWYLDS FT TMIPLALLDDDLSFEEKANIAKVILSFKMPISTWYNTKNKSKIDIEDKTKM FT GNKISENPSSLALLVDEFSYLMFDIIGLEEQRIKDWLSLPPEYWYTQSVFK FT NFQNYANIFNCXK*" XX SQ Sequence 3778 BP; 1435 A; 545 C; 583 G; 1190 T; 25 other; ttaggggtag tcaatttgaa gaaattttca aatctaaaat accatacact ctagatttgg 60 ggcaaatatt aaaaaaacaa ctcccaaaaa atttgagctc aacaggacca ctttatcctt 120 accccctaag cactttttta gaaaaaaaat tctcaaaaac cttcaaaaat tgccgatttt 180 acaggtccag gggggaagtt taattacaaa aatttgaatt tttttttttt ttgggttgct 240 catttttacc tatatatttg agtaaaaata tatggtatca tagtattgtt caaaaaatat 300 ttttaatttt tgtattttga attgaaaaaa ttacaatata caattatata ctaaccgcgt 360 gtatgctact aacaaaagat tataactaat tgtcaagaat atatactttt aattttttta 420 attaataatg taaatataca ataaacttta gtaatttcta cgaggtggga atttgacggg 480 acccagctat agtagctcaa agaacaattc aatggttcta tttgttgcat tagtaaattg 540 ttttacagca atcgttttaa acttgatttt ttagattagt aaaaattaaa ttaattaatt 600 taaatatgta taaaaaagaa gagcacggtc agaaaacaga aagtgaaaga acactaaagt 660 ctaaaaaaaa cctatttatt attaaagaag caaaaccagc aataamaggt aatatttagt 720 tttataatga atcccttaaa ttaaaaactt tagctaaact ttttatttat tcttttttat 780 aattaagrct ttcagcttcc aacaggcatg gatattctgc arcacttagc ttacaaacaa 840 tcatttctgc ctaaaacatc aaaaaaagcc actattattg cttgtcctct tcaaaaaaaa 900 tttcaaagat gtgcaataga aagtaagtgt gaatgtattt tagccgaggt aaaagagcca 960 tggattaaag ctggattttc tatactaaat gatcaaagtt tagtgaaaaa tctttcgaat 1020 ctggacaakg watattcaca acttaaaaaa aatgaaaaaa gaggatctgc tgctgatttg 1080 gcaaagcaam aagagtttaa aatttacctg aaaaaagttt tttgggctgg cattcctgat 1140 cttaaaaaca taattagaaa tgataaaaaa agatctgacw magacaagct tgaagatttg 1200 ragtttctgg aagatcaaga aggggaaaga aagtttattc ttggatcaga agataaaaag 1260 tacggaaaaa aggtatgcat tcctttgaaa ccgaaaaaaa aaaaaagttt tcaaggcttt 1320 ctaaataact tacttacaat ttgtaagcca cttaataaga tttaaaataa aaacagctaa 1380 atagcaatgt tcactcacaa tttcaatttc tcatatattt aaattaaatt attaattatt 1440 aaattctttt tatattttta ggcaacagag agtagcagaa agagaaagca gcgtgtaaaa 1500 caacaatata aagaagtaga gggtacaact aatgttgaga ttgatacaga tactagtgag 1560 gattcctcgc gtgattctga ttttgaccct ggagagtggt atgaaagaaa accttttcaa 1620 caaaagtagt aactgctgaa attccmatag atatttttgc tggtaatgta gctcttatgg 1680 ctacaatcag tgatatttct ccaaatgttt tgcataaagt tacaggtgca attctaactc 1740 aatcaggagt tgatcttaca gatttcaaat gcagccagtc tacagcakct agaaaaatga 1800 aatatgccaa tcrtaatata tgcatgatak caaaagaaga tgttmaacaa gcaatgaatg 1860 aatctccata cccatgcata gtacattttg atggcaaaac attatttgaa ctaaacgaag 1920 gaaaaacaat taaractgat agactggctg twcttgttaa tattgaaggt gagacacacc 1980 tacttggagt acctccacct ttagcttctg tcatcctctg gtgaagatca gtgcaatggt 2040 gtaatagaca ttcttaagga gtacaacctt gaatcaaaaa taggaggaat atgttttgat 2100 acaactgcaa gcgacactgg gttaaagaaa ggttctttaa taagaatatc caacaatata 2160 gacaaatatc ttttacttat tgcatgtaga catcacgtat gtgaattaag aatggttcat 2220 ttttgtaatt cagtttcaaa tgaacatwcc acaggaccag ataatcctat gtttaaaaaa 2280 ttaaaaactg tttttgaacg tcctaatttt aattacaatc aaactgaact tataaagttt 2340 gattggaaag cagtaaaggg aacctttctt gagaaagctg cacaagagtc tttgaacttt 2400 tgtcaagaat acattacaaa aggaaaagaa aataaagatt ttatcagaga agatagaaag 2460 gagcttgcag aacttgttgt gagttacctt tctcagtctt cagtaaaact tagaaaaccc 2520 ggagcagttc atcatgctag gttycttkcc aaagcacttt actayctaaa actacagctt 2580 ctatccagcc aaattgagtt tgtaaatcag aacaataaac tcaagacgga aatcaggcta 2640 ataagtgaat ttatagtttg cttttatgcc aaatggtatt tacargctaa cgatgctatt 2700 aaagctcctt ttctagatat aaaagcaata catcaaatgc aacagtacmg aaaactttgt 2760 gccaaaccag atgctgttga tgcagtttta aactctctat acaaacatag ttggtattta 2820 gattcaacaa tgattcccct agctctgttg gatgatgatt tgtcatttga ggagaaagca 2880 aacattgcaa aagtaattct atcttttaaa atgccaattt caacttggta caatactaaa 2940 aacaaatcaa aaatagatat agaagataaa acaaaaatgg gaaacaaaat tagtgaaaat 3000 ccttcttccc ttgctctgtt ggttgatgaa ttttcatatc ttatgtttga tataattgga 3060 ttagaggaac aaagaataaa agaytggctt tcgcttcccc ctgagtattg gtatacccaa 3120 tctgttttta aaaactttca raattatgcc aatatcttta attgtwgtaa atgatcattc 3180 agaaagatct gttggaatga tgcagcaatt tattcacaga tacaacaatg aggaagataa 3240 acagaataga ttgctaactg ttgacaaagt tcggtctgct ttcagaacac ctggaaaaag 3300 ttcaaacaaa ctttctaaaa gtaaatttat ctgaaagctt atcttctcta aataaaagga 3360 agaaaagtgg tgaaaatgaa catgaacaca actgaaatta tttgctcttg aaatttaatc 3420 caataacatg aaactatcat caaatattgt aattttttca attcaaaata caaaaataaa 3480 aatatttttt gaacaatact atgataccat atatttttac tcaaatatat aggtaaaaat 3540 gagcaaccca aaaaaaaaat ttcaaatttt tgtaattaaa cttcccccct ggacctgtga 3600 aatcggcaat ttttgaaggt ttttgagaat tttttttcta aaaaagtgct tagggggtaa 3660 ggataaagtg gtcctgttga gctcaaattt tttgggagtt gtttttttaa tatttgcccc 3720 aaatctagag tgtatggtat tttagatttg aaaatttcct caaattgact acccctaa 3778 // ID Proto1-2_NG repbase; DNA; INV; 5393 BP. XX AC . XX DT 21-MAY-2009 (Rel. 14.06, Created) DT 21-MAY-2009 (Rel. 14.06, Last updated, Version 1) XX DE Proto1-2_NG is a non-LTR retrotranspsoson from the Naegleria DE gruberi amoeboflagellate genome - a consensus sequence. XX KW Proto1; Non-LTR Retrotransposon; Transposable Element; KW Proto1-2_NG. XX OS Naegleria gruberi OC Eukaryota; Heterolobosea; Schizopyrenida; Vahlkampfiidae; OC Naegleria. XX RN [1] RP 1-5393 RA Kapitonov V.V. and Jurka J.; RT "Proto1 non-LTR retrotransposons from the Naegleria gruberi RT amoeboflagellate genome."; RL Repbase Reports 9(6), 1145-1145 (2009). XX DR [1] (Consensus) XX CC Proto1-2_NG is a very young familiy of non-LTR retrotransposons CC that belongs to the Proto1 clade of non-LTR retrotransposons. CC This clade includes also the Proto1-1_NG, Proto1-3_NG, CC Proto1-4_NG and Proto1-5_NG families from the the Naegleria CC gruberi amoeboflagellate genome. The Proto1 elements code for two CC ORFs. The ORF2-encoded proteins are composed of the apurinic CC endonuclease, reverse transcriptase and ribonuclease H domains. CC It is likely that the Proto1 clade is a sister clade of the L1 CC clade. Proto1 retrotransposons are characterized by 15-18 bp long CC target site duplications and by a weak target site preference: CC 5'-CATTTTTTTNNNNNNNN-retrotransposon-ATTTTTTTNNNNNNNN-3'. XX FH Key Location/Qualifiers FT CDS 16..1656 FT /product="Proto1-2_NG_1p" FT /note="Proto1-specific protein of unknown FT function." FT /translation="MPQHDHPPDTEGSTLLANRIFVLERALETSQKENTCL FT RESIQNFVNIISDHILNNNNNGPITLVNNLMDEANKFTSITQKSGNPSNDG FT NLSNNRNLSNCGKTTESEIIPDDSGKTKNRPATYSEITRSTPIKSQPNIKK FT VIAKQNTATNKMKRNMATNYKSNKRRLTTNPNLIINVICYKDATNELVEQI FT SDLLDNFPPVRINRYCKNVINVICTNDSDYKQYFNILNECVVDRKFVLECF FT PSPAIFTPKPLALCHSTSSKNIHYRLLEKIIEYYIIEKGINVEGLYTNYSH FT KQYITRISFDNEKTCSELASALDLQHYKTVEQIRSETKETTSQCTFPIQFT FT DNEITNGIARVYSRMDIQLVLNNLKISDATNHSGTSKYKLVKLSHTDKDEM FT DEFCKKNISLIDKHKNHFNFYFKPSIEILDLKIEQKNRLFPYLPTINSTNN FT IATTKSVAKTTNNNNNNNNNNNNNNNNNINNNNNNENNLLILLTKAFSSLV FT VGLANNNVIPLDNNLMEIVNITNNINNVNNTNSVMIDDDLISTSSNESTQP FT " FT CDS 1755..5339 FT /product="Proto1-2_NG_2p" FT /note="contains the APE endonuclease, reverse FT transcriptase and ribonuclease H domains." FT /translation="MCLTHGVHMMSITETHVNTTELEIKVRNSGSGYLLFN FT SLMGRSKGCKGTAVMHFLNNHRKRNITNNNLVPGILQWTRFESRGIPINLF FT TIYLSGRSDDYENDQEAINSFINAVLTCKDEHIILSGDLNIDTQNPTTTRE FT KDWIWILEMLKLTEFKTPNYTWIRRSQGTLIRSRPDHIFVSKNIRVVKEVI FT MTPLTTNDHVPFYLDLEFKSNLTWRTYFSKNKRKRMYEAINKCKIDNFQEL FT NQAIEAQIIKYGSKVDRLGVGRISTAFKEEIQDLEDEILSKINNPNIPQSE FT IQELQTLLKETHINNAKEIKEKLIDEFNDNSSSRMYQFDKMIHSNQIMWKD FT LNFEEEEIVDYFTNKFTSTNQQPTFTPSNSTIKEGPDWSNSIHIKELQDAL FT KRMKSNTSGPDLISLDIIKHLTEANQGILLEEYNRCLRDGDIPQNWKQGWV FT KLIPKREINTLSDIRPITILPIFYRILFNIIAFRLRSWASTHINVRQQAFI FT TDRNTLNHGVLLSALAMKTRKRTFILVNLDIEGAYDAVELPVIKMALDHCK FT YPSELTQFILNAYANHELQLEIDNHLSDKFKKTRGIPQGCPLAPLIYDCIT FT QLIIDKAIDKWKIPIKPGKLCANDIALCCFADDMNVVCDRYAKYNERLDNI FT HDWLHQLLFKLNAKKSVATLLPKKSTVSPKIKGTPVPKQKNLRVLGHYPWD FT DTLVTQDIQSKIDKFTKSLRFLPLFKLQPTNLKIIMHAKAISLFTHLSKIN FT IIPLKKAQEIDRAIRGAIRRKLLMDRATPTSFFHLPLEEGGLGLPSIEEFS FT ERMNLRTMNLIANNKNKLIRKAFKYGIKHCEETKDNIYAKWLSVLKEYNLT FT FVQNDTKFRIAKIKTGNSFDIHTDGSRINQKTGMGINIYDTSNRSVPCKTL FT SLRINDHYSNNIAEICSIITSINVIPRNSKVTIHTDSEVAIEVLKDRYKGD FT FLTLKEAFFRTIKNRKIEYEIKKVEAHKDSENIKVDLIAKNATTREDIFDI FT CNLLKNQHLLVKDGVVIFNHKKMTITRHLSVRHNKMNENPNFILNQSWDSL FT NVQYLKTRLTPKTKYFIWRNSINAHINFGKGKSFCYDCGTESDLFHYIHEC FT PGLDSARYFCQSKISDILEGRECLLSHFQFKPEKHHHALHFHHNGITSFED FT GYMQFHKDIAKKWPEIQGAISNFVGRSYSNYYIDSK" XX SQ Sequence 5393 BP; 2129 A; 855 C; 779 G; 1630 T; 0 other; cgtccttcta aaaaaatgcc tcaacatgac caccctccgg atacggaagg gagtactctg 60 ctcgctaaca gaatctttgt gttagagaga gcattagaaa cctcccaaaa ggaaaacaca 120 tgtctccggg aatctattca aaattttgta aatattataa gcgaccacat actaaataat 180 aataataacg gtccaatcac acttgtaaat aatctaatgg atgaggcaaa taagtttaca 240 tctattacac aaaaaagtgg aaatccatca aatgatggaa atctatcaaa taatagaaat 300 ctatcaaatt gtggaaaaac tactgaatcg gaaattattc cagatgacag tggtaaaact 360 aaaaatagac ctgctacata ttcagaaata acaaggtcaa ctcctattaa aagtcaacca 420 aatattaaaa aagtaatagc taaacaaaac acagcaacta ataagatgaa aagaaatatg 480 gctactaatt ataaatctaa taagcgtaga ttaactacaa atcctaatct aattattaat 540 gttatctgct ataaagatgc aactaatgaa ttagttgaac aaatttcaga cctccttgat 600 aactttcccc cagtcaggat taatcgttat tgtaaaaatg ttatcaatgt catttgtact 660 aatgactctg actataagca atattttaat attttaaatg aatgtgttgt tgataggaaa 720 tttgtattag aatgtttccc atctccagca atctttactc caaaacctct tgctttatgt 780 cattctacta gttccaaaaa tattcattat agactattgg aaaaaattat tgaatattat 840 attattgaaa agggaattaa tgttgaagga ttatatacca actattccca taaacaatat 900 attactagaa tatcatttga taatgagaaa acgtgctctg aacttgcttc agctcttgat 960 ttacaacact ataaaacagt agagcaaatc agaagtgaaa caaaagagac aacatctcaa 1020 tgcacattcc ctattcaatt tactgataac gagattacaa atggaattgc tcgtgtttac 1080 tctagaatgg atattcaact tgtattgaac aatttaaaaa tatcagatgc aactaatcac 1140 tctggtactt caaaatataa attagttaaa ttatcacaca cagataagga tgaaatggat 1200 gaattctgta agaaaaatat ttctcttatt gataagcaca aaaaccattt caatttctat 1260 ttcaaacctt ctattgagat tttagatctt aaaatagaac agaaaaatag attatttcct 1320 tatttaccta ctattaattc tactaataat attgcaacaa ctaaatcagt tgctaaaaca 1380 acaaataata ataataataa taataataat aataataata ataataataa taatattaat 1440 aataataata ataatgaaaa taatctgtta atattactca ccaaagcttt ttcaagcctt 1500 gttgtaggat tggcaaataa taatgtaatt ccattagata ataacttaat ggaaattgtc 1560 aatattacta ataatataaa taatgtaaat aatactaact ctgtaatgat agatgatgat 1620 ttgatttcaa catcatctaa tgaatctact caaccttgag tggaactacc actagtttaa 1680 gacttgcatg ctacaatgtt aacggtctat acaagcatgc agtccataaa tctgtaactc 1740 aagaattgaa atctatgtgt ttaacacatg gagttcatat gatgagtatt acagaaacac 1800 acgttaacac tactgaatta gaaattaaag ttagaaattc tggttcaggt tatttattat 1860 ttaactccct tatgggtcga tctaaaggct gtaaaggaac agcagttatg cactttttaa 1920 ataatcatag aaagcgtaac attaccaata ataatttagt acctggaatc ttacaatgga 1980 caagatttga aagcagggga attccaatta atttatttac tatttattta agtggcagat 2040 cagatgatta tgaaaatgat caagaagcca ttaacagttt tatcaatgct gttttaacat 2100 gtaaagatga acatattatt ttaagtgggg atcttaatat tgatactcaa aaccccacta 2160 ctacaagaga aaaggattgg atttggattc ttgaaatgtt aaaattgaca gaattcaaaa 2220 caccaaatta tacatggatt agacgaagtc agggaacttt aattagatct agacctgacc 2280 atatctttgt ctcaaagaat attagagtgg taaaggaagt aattatgaca cctttgacca 2340 ctaatgatca tgtccctttt tatttagatt tagaatttaa atctaatctt acttggagaa 2400 cttactttag taagaataag aggaaaagaa tgtacgaagc aattaataag tgtaaaattg 2460 acaattttca agagctgaat caagcaattg aggctcaaat aattaaatat ggatccaaag 2520 tagatagatt aggagttgga agaatttcta cagcatttaa agaagaaatt caagatttag 2580 aggatgaaat attatctaaa attaataacc ctaatattcc tcaatcagaa atacaagaac 2640 tacaaacact cctaaaagaa acacatatta ataatgcaaa agagattaag gaaaaactta 2700 tagatgaatt taatgataac tcttcatcta gaatgtatca atttgataaa atgatacact 2760 ctaaccaaat tatgtggaaa gaccttaact ttgaagaaga agaaatagtg gattacttta 2820 caaacaaatt tacctccact aatcaacaac caacattcac accaagtaat tcaactatca 2880 aagaaggtcc agattggagc aattcgattc atattaaaga gctacaagat gctctcaaac 2940 gaatgaaatc aaacacttct gggcctgatc taatctcact tgatataatt aaacatctta 3000 cagaagctaa tcaaggaatc ctcttagaag aatataatag atgtttaaga gatggtgata 3060 ttccacaaaa ttggaaacaa ggatgggtaa aactcattcc taaaagagaa attaatacct 3120 taagtgatat tcgcccaatt actattttac caatatttta cagaatatta tttaatatta 3180 tagcatttag attgagatct tgggcatcaa ctcatattaa tgttagacaa caagcattta 3240 ttacagacag aaatactttg aatcatggtg tattactctc tgccttagca atgaaaacaa 3300 ggaaaaggac ttttatttta gtgaatttag atatagaagg ggcgtatgat gcagtcgaat 3360 tacctgtcat caaaatggcc ctagaccact gcaaataccc ttcagaactt actcaattca 3420 ttttaaatgc ttatgctaac catgaacttc aattagaaat tgataatcat ctctctgata 3480 aatttaagaa aacaagagga ataccacaag gctgtccact ggcccctctt atttacgatt 3540 gtattaccca attaataatt gacaaagcaa ttgataagtg gaagatacca atcaaaccag 3600 gcaaactatg tgcaaatgat attgcactct gctgttttgc tgatgatatg aatgttgtat 3660 gtgacagata tgccaaatat aatgagagac tagacaatat tcatgattgg ttacatcaat 3720 tattatttaa attgaatgct aagaaatcag tagcaactct attacctaaa aagagcacag 3780 tctcacctaa aatcaaaggt actccagtac ctaaacagaa aaaccttaga gtacttggcc 3840 attacccttg ggatgatact ttggtaacac aagacatcca atccaagata gacaaattca 3900 caaaatcatt aagattttta ccattattca aattacagcc aacaaacctt aaaataatta 3960 tgcatgctaa agcaatcagc ctattcactc acctctcaaa aatcaatata atccctttaa 4020 agaaagctca ggaaatagat cgtgcaatta gaggtgcaat cagacgtaaa ttattaatgg 4080 acagagcaac tccaacaagt ttctttcacc tgccattaga agaaggtgga ttaggattac 4140 catcaattga ggaattctct gaaagaatga atttaagaac tatgaattta atcgccaata 4200 ataagaataa gttaataaga aaagctttca aatatggtat caaacattgt gaggaaacta 4260 aagataatat ttatgctaaa tggttatctg tacttaaaga atataatctc acatttgtgc 4320 agaatgatac gaaatttaga atagccaaaa tcaaaacagg aaatagtttc gatatccata 4380 cagatggatc tagaattaat caaaagacag gaatgggtat taatatttat gatacctcaa 4440 atagatcagt tccttgtaaa actttatccc tcagaatcaa tgatcattat tcaaataata 4500 tagccgaaat atgctccata ataacttcta ttaatgtaat cccacgaaac tctaaagtta 4560 ccattcatac agatagtgaa gttgcaatag aagttctcaa agatagatac aaaggagatt 4620 ttctaacatt aaaagaagcc ttctttagaa caatcaaaaa tagaaaaata gaatatgaaa 4680 ttaagaaagt tgaagcacac aaagatagcg agaatatcaa ggttgattta atagcaaaga 4740 atgctacaac cagagaggat atttttgata tttgtaattt actcaagaat caacacctcc 4800 ttgttaagga tggtgttgtc atcttcaatc ataagaagat gaccataaca agacatttaa 4860 gtgtaagaca taataagatg aatgaaaacc ctaactttat tctaaaccaa agttgggatt 4920 ctcttaatgt gcaatacctt aaaactcgtc ttactcctaa aactaaatac tttatatgga 4980 gaaattccat caatgcccat atcaattttg gtaagggtaa aagtttctgt tacgattgtg 5040 gtacagaatc tgatttattc cattacattc atgagtgccc tggcttggat agtgccaggt 5100 atttttgtca gtcaaagata tctgatattc ttgaagggag agagtgctta ctttctcatt 5160 ttcaattcaa accagaaaag catcaccatg cattacattt ccatcataac ggaatcacct 5220 ctttcgaaga tggatacatg caattccata aagacatcgc taagaagtgg ccagagatcc 5280 aaggtgccat ttctaacttt gtcggccgtt cctattctaa ctattatatt gactccaaat 5340 aatttatata acctatgaga atctgaatag attcgttatt gtcgtaataa aag 5393 // ID Kiri-9_AAe repbase; DNA; INV; 3351 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 16-FEB-2011 (Rel. 16.02, Last updated, Version 1) XX DE A Kiri non-LTR retrotransposon family from Aedes aegypti. XX KW Kiri; Non-LTR Retrotransposon; Transposable Element; Kiri-9_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-3351 RA Kojima K.K. and Jurka J.; RT "Kiri non-LTR retrotransposons from the yellow fever mosquito."; RL Repbase Reports 11(2), 704-704 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 10 sequences with >95% CC identity. CC This family and related Kiri elements found in two mosquito CC species, Culex quinquefasciatus and Aedes aegypti, constitute a CC distinct lineage belonging to the CR1 group. The phylogenetic CC analysis with PhyML indicates that Kiri is a sister lineage of CC the L2 clade. Although several Kiri elements do not encode two CC proteins, probably due to 5' truncation, Kiri elements generally CC encode two proteins. Their ORF1 proteins commonly contain an CC L1-like RNA-binding domain. XX FH Key Location/Qualifiers FT CDS 468..3206 FT /product="Kiri-9_AAe_1p" FT /note="endonuclease and reverse transcriptase." FT /translation="NICHINVQSLIARNFTKFHELKLTFIDSKVDIICFTE FT TWLNHTISNTMIGIEGYKLIRNDRNRHGGGICIYLRNSLLYKVVSMSQSSS FT NDFYGTEYLNIEIKVGFDKVLLGVIYNPPNIDCIDVLHDILENCTINYDCS FT FFVGDFNTDILKFSSRSRRFGEMLDAMSYECMNSEPTYFHQSGCSLLDLFI FT TDSPDFVAKLDQISMPGVSNHDLVFCSLRLSLNRVEDIVQYRDYVNFDSSA FT LLDGFYNIEWNNLFCMEDPNEILNFINYHILKLHDDFIPLRRKKKNRTPWF FT NSDISHAFINRNLAYDQWKRSRTEIDRNTYKRLRNFANDLVRQAKINHDKQ FT NFNVDLPSKQLWKNIKKLGISNDSSFLDQDCLNANQINDYFISNYINDSSS FT VPIPFSSDGFKFNVFHEEDIVNSIFSIKSNAVGLDNIPIRFIKILLPIALP FT VYKFLFDSIIKTSVFPRAWKNSKVIPIKKKGNSASLSNLRPISILSSLSKV FT FEKLLKIQITKFITDNDLLHPLQSGFRQNHSTNSALIKVHDDIARVIDRRG FT IAVLLLIDFAKAFDRVSHSKLVKKLSSKFSFSHNAVKLIESYLNERKQAVF FT YNGVLSDFSFIESGVPQGSVLGPLLFSLFLNDLPASLEFCSIHMFADDVQV FT YLCASDDVDMNDFARKINHDLNNILRWSQNNLLAVNPSKTKAMLICKLKNR FT PLPPSIYFDGKRVQFYDKLENLGVIFTSNLCWDAFINGQCGKIYGTLKKLN FT LTTKHLDTAMKIKLFKSLILPHFIYCDFLFSNATASALNRLRVALNACVRY FT VFNLTRYSHVSHLQDVLLGCQFSKFYSFRACLQVYIIIKRKTPRYLYDKLQ FT FMRNTRTMSLVIPNHSSAYYGQSLFVRGINNWNRLSPTLKSTANIYGFKGG FT LLRELERLQ" XX SQ Sequence 3351 BP; 1024 A; 579 C; 584 G; 1163 T; 1 other; atgaactgtt ccactcctaa aagttatatt gggaagctac tgtttcccct tctgctgctg 60 ttgttgctgt gtgctgttga tgctactgct gttgctgctg ggaggcgttg ctgtcgaata 120 agatttttct gctgctctcg attgtggtga attagatcaa attagacttt ttcgttttcg 180 gcatattagc agacattatt tagctgtttt ttctttgtaa tgtgcttttc tctaatgtca 240 aatgaacttc agactaaaag ttaatcacaa tgaacatatt agtagttagt agttttgctt 300 ttcctgattt ctacactgca tgttgcttgg gttcgatggg tagactgctt ttgaagccta 360 tactgttcaa catttagata tggatgactt ttcccttgat aatgcttctc cgaacaatgt 420 aattccacgt gttgttctta atgctgcatt aaaacatgac tgtttgaaat atctgccata 480 tcaatgtcca aagtctaatt gctagaaact tcacgaaatt ccatgaattg aagctaactt 540 ttatagatag taaagtcgat atcatttgtt ttactgaaac atggctgaac cacactatta 600 gcaatactat gattggtatt gagggttata aactcataag aaatgatagg aataggcatg 660 gtggtgggat ttgtatatat ttgcgaaaca gtttgttata taaagttgtg tcgatgtccc 720 agagttcatc caacgatttt tatggtaccg agtatcttaa tattgaaatt aaagttggtt 780 ttgacaaagt tctgcttggt gttatttata atcccccaaa cattgattgc attgacgttt 840 tgcacgatat cttagagaat tgtactatca actatgactg ctccttcttc gttggcgact 900 tcaatacaga catactcaaa ttctctagtc gatcacgtag gttcggtgaa atgttagatg 960 caatgtcgta cgagtgtatg aactctgaac caacttattt ccatcaatcc ggatgttccc 1020 tgcttgacct ttttataaca gattcacctg actttgttgc aaagctagat caaatttcta 1080 tgcctggagt atctaatcac gatttagttt tctgttcttt gcgattatca cttaacagag 1140 ttgaagatat tgtccaatac cgcgactacg tgaatttcga ttcaagtgca ttactggatg 1200 ggttttataa tattgaatgg aataatttgt tttgcatgga ggacccgaac gaaatactta 1260 acttcataaa ttaccacatt ctcaaacttc acgacgattt catacctcta cgtcgcaaaa 1320 agaagaatag aaccccttgg ttcaacagtg atatttctca tgcatttatt aatagaaatc 1380 tggcctacga tcaatggaag cgttctagaa ctgagattga cagaaacact tacaagcgcc 1440 ttcgaaattt tgcgaatgac ttggtcaggc aagcaaaaat caatcacgat aagcaaaact 1500 tcaacgttga cttgcctagc aagcaactat ggaaaaacat taaaaaatta ggaatatcta 1560 atgattcaag ttttcttgat caagattgtt taaatgcaaa ccaaataaat gactatttta 1620 tctccaatta tattaatgac tcgtcgtctg ttccaatacc attttcttct gatggtttca 1680 aattcaatgt ttttcatgag gaagatattg tcaacagtat tttctcaatt aagtcgaatg 1740 ctgttggctt ggacaatatt ccgattaggt ttattaaaat attgttacct attgctttgc 1800 cagtatataa gtttctgttt gattccatta ttaaaacatc tgttttcccg agagcttgga 1860 agaactctaa agtaataccc ataaaaaaga aaggaaacag tgcttctctt tcaaatctac 1920 ggcctataag tatactgagt tctttgtcta aagtttttga aaaactactc aaaatccaaa 1980 ttactaaatt tattactgat aatgatcttt tgcatccgct tcaatctgga ttccgtcaga 2040 atcacagtac aaattcagct ttaatcaaag ttcatgatga cattgctcgt gttatagatc 2100 gcagagggat tgctgtttta cttctaatcg atttcgctaa agcttttgat agagtttcgc 2160 actctaaact ggttaagaag ttgtcatcaa aatttagttt ttctcacaat gctgttaaat 2220 taatcgaatc ttacttgaac gaacggaaac aagctgtttt ctacaatggt gttttatctg 2280 atttcagttt tattgaatct ggagtacccc aaggctccgt gctggggcca ctactgtttt 2340 cgttattttt aaatgaccta ccggcaagtt tagaattttg ttcaattcat atgtttgctg 2400 acgatgttca agtatacctc tgtgcgtccg atgatgttga catgaatgat tttgctagaa 2460 aaattaatca tgatttaaat aatatattga gatggtcaca aaataatctt ttggctgtga 2520 acccatcaaa aaccaaagca atgcttatat gtaagcttaa aaaccgccct ctaccgcctt 2580 caatttattt cgatggtaaa agagtacagt tttatgataa attagaaaat ttgggagtaa 2640 tttttacttc aaatttatgt tgggatgcat ttattaatgg gcaatgcggt aaaatatacg 2700 gtactttaaa gaagctgaat ctaactacaa aacacttaga cacagctatg aaaattaaat 2760 tgttcaaatc cttaattctg cctcatttta tatattgtga tttcttgttt agtaatgcga 2820 cagcttctgc actmaacaga ctacgggttg cgcttaatgc atgtgttaga tatgtgttca 2880 atcttactag gtattctcat gtatcccact tgcaagatgt tttgttagga tgccagtttt 2940 ctaaatttta tagttttaga gcctgtttac aagtgtacat catcatcaaa cgtaaaacgc 3000 ctcgctatct ctacgacaag cttcaattca tgagaaatac gagaactatg agcctagtaa 3060 tacccaatca ctcatctgca tattacggac aatcattgtt tgtgagaggc attaacaatt 3120 ggaatcgttt gtctcctacc cttaagtcaa ctgctaatat ttatggtttc aaagggggtt 3180 tgcttagaga acttgagcgc ttgcagtaga acattcagga aaatcgaatt caggaacata 3240 attcgagtta gttaaatagt tttaatggaa tcaccacagt gtaacatttc aagaaggagt 3300 ttccttacgt tacataaatt caaataaaga aataaataaa ataaataaat a 3351 // ID hATm-31_HM repbase; DNA; INV; 3837 BP. XX AC . XX DT 07-DEC-2008 (Rel. 13.12, Created) DT 10-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type family: consensus sequence. XX KW hAT; DNA transposon; Transposable Element; hATm-31_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3837 RA Jurka J.; RT "Families of hATm elements from Hydra magnipapillata."; RL Repbase Reports 8(12), 1925-1925 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(938..1474,1567..2268,2208..2783,2656..3381) FT /product="hATm-31_HM_1p" FT /translation="MRTLEMSKETMSQHTQEFLDKANVLFPIHKEDIFKLI FT SQDPRRSKEAVKEDSIFLKRCLKGDMVKMSYKNDKTFAEIVSEGEERMKNL FT LMKMVEEEQLEERLRANVEQKKESQLLEKAAEEEALRNFEQEFSSGCSGTD FT YDSDFESPTAKKKKLYDSTKICLQLSLNDVLDKWIPFMTRMLHNLKNDNAL FT FIFRYQVSVRAETAMLSSLFQAGGVDIDMLNISKSQMARKNHEIIENEALI FT VREQNLDKIRGLKLVLHFDTKLVKQYRTEVKISETVERLAISVSSPESGSL FT DFLLGVLEVPSSKGKDQALAIQSLVEYYDLTDQIIACCTDTTRSNTGKHNG FT AIRNIAVYSLPCPILWLMCRYDXLLYKLQAFFFFFFFLRVMTENENFKFRH FT HIMEQHVHHVMVELHVIMLAPYNGTTCSSCYGGTTCHHVIMERHVHHVMME FT LQGETKGPSRSLYKKHQDLWPKIEKGVNTLANIEKFDWNRPEFALGTLFYN FT LANETMQFCTTALKTNIFDRGDYKYLCQLVAFYLGADLLNFRFLQPGAHHE FT ARFMADSLYLLAMQMTKNYNKQLSINEIKNDTGCNRLYSDLSLFILPEEPN FT GSSGSIQPRITTSNSVSMRLKMIQDATDYIVIFHCLFFLKSPMAAQAPSND FT LRAFKISSQIMIDPLYSKKYGKIGKALNLSILRHTWYFTPQCVIFALADQR FT LEEKERMDILVELLKYDVPEMNSFDKEQPEPFTQVCPLSKLSDFVSQESYL FT FFLHLGITKDDILSWKEEGMSNLEKNLNNRSFKTFVSSVRQLAVVNDRAER FT HIKLVQDFIGRTHSEDRLQDTMMVVLDNRRKVSKRATKQDLKNI*" XX SQ Sequence 3837 BP; 1347 A; 572 C; 710 G; 1200 T; 8 other; gggtgtccca gaaaaaaaat awttttgaat tcgacgttgt gcactatgtc aatcctcctt 60 atttggtccc ggtagaccac cataaaaata tctggcctca aagtcaytgt twagctgaag 120 aaggtgtttt agaaaaaaac tgcatataaa ttttaatgag atttttttgc atttttttat 180 tgaaactcta cgaatatatt ttaaaataca ataaattttt aataaaatat cagtkattag 240 gatgttgaaa aacagtagca gtatatattg tatactgcta cttaattcat aatttgaata 300 rtatacattt aattacttaa tgacaatttt tacatatatg taaaatacgt ataaaaaaat 360 tgtatatttt gtgatcaatt ctgttgtatt ttatagtata gaaaataata ttaaatatgg 420 ctagtgcagg acatgttgga aaagctagtg gaagaagaak tctttatctg ttggggtacc 480 ctctccaaat attttctcca aatagacttc caaccggtgg ggaagtttgt cgcagaattt 540 actggattaa tcggagtcat gcaactgcaa tcaaaacatt gtcatccaaa attggatgtc 600 ctctagggag tacaggcaag cttaaatgca gtggtggtat gtgttatgct gatagcagaa 660 acgaaatatc aaatgtgtgc atcataagag agctggttaa catttggaac aaggcaggct 720 ttgatgaaaa ttttatcatc agtgaacggt cgattaagta agcgactaaa tataaaaatc 780 tcttgtcata aaaattaagt attctgtgaa acatatttat ctgtgaaata cgtttacata 840 taagttttta cggctgacag gtttatttaa tgttgtaatt ttttaattta ggactagaat 900 cgttgctctg tatacacagt atattaaagt tttaaagatg agaaccctag agatgagcaa 960 agaaaccatg agccaacaca ctcaagagtt cttagacaag gccaatgttt tgtttcctat 1020 acacaaggaa gacattttca agttaattag ccaggatcct agaagaagta aagaggctgt 1080 aaaagaggat tctatttttt tgaagagatg tttgaaaggg gacatggtca agatgtccta 1140 caaaaatgac aaaacatttg ctgaaatagt ttctgaaggt gaggaaagga tgaaaaacct 1200 gctgatgaag atggtagaag aagagcaact ggaggagaga ctgagagcaa atgttgagca 1260 aaaaaaagaa agccaattac tggaaaaagc agcagaagaa gaagctttga gaaattttga 1320 gcaggaattc tccagtgggt gtagtggaac agattatgat tcagactttg agtctcctac 1380 tgcaaagaaa aaaaagttat atgattcaac aaagatttgc cttcaactca gtctgaatga 1440 tgttttagat aagtggattc catttatgac aaggtaaata agtttatgta taaaaaagtg 1500 aagtatatta caagtatata tatatatata tatatatata taaaacttca agttaaaaca 1560 acttaaatgt tacataattt aaaaaatgac aatgctctgt ttatttttag rtatcaagtt 1620 agcgtgagag ctgaaacagc aatgttgtct tcactattcc aagctggggg agtagatatt 1680 gacatgttaa acatttctaa gtcgcagatg gcaagaaaaa accatgaaat tatagagaat 1740 gaagcactta tagtaagaga gcagaacctg gataagataa ggggtttaaa acttgttctt 1800 cactttgaca caaagttggt caaacagtat aggacggagg tcaagatttc ggaaacagta 1860 gaaaggttgg caattagtgt ttcctctcca gagtcgggtt cacttgactt tcttctggga 1920 gtacttgaag taccctcctc caaagggaaa gatcaagcac tggctatcca aagtcttgtt 1980 gaatattatg acttaacaga ccagatcata gcttgttgta ctgacacaac tcgctcgaac 2040 actggaaaac ataatggtgc aattagaaat attgcagttt attctctacc ttgtccaatt 2100 ctttggctaa tgtgcaggta tgatgyttta ctttacaaat tacaggcatt ttttttcttt 2160 ttcttttttc tgagagtaat gacagaaaat gagaatttta aatttaggca ccatataatg 2220 gaacaacatg ttcatcatgt tatggtggaa ctacatgtca tcatgttata atggaacgac 2280 atgttcatca tgttatgatg gaactacagg gagaaacaaa aggacccagt cgatccttat 2340 acaaaaaaca tcaagatctt tggcctaaga ttgaaaaagg tgtgaataca ttggccaaca 2400 ttgagaaatt tgattggaac agacctgagt ttgctcttgg aaccctcttt tacaatcttg 2460 caaatgaaac aatgcagttc tgcacaacag ctcttaaaac aaatattttt gacagaggtg 2520 attacaagta tttatgtcaa ctggtagctt tttacctcgg agcagatttg ttgaacttca 2580 ggtttctaca accaggtgca catcacgaag caagatttat ggctgattca ttgtatctgt 2640 tagcaatgca aatgaccaag aattacaaca agcaactcag tatcaatgag attaaaaatg 2700 atacaggatg caacagatta tatagtgatc tttcactgtt tattcttcct gaagagccca 2760 atggcagctc aggctccatc caatgatctt agagcattca agatttcatc gcagattatg 2820 atagatccac tttactccaa gaagtacgga aagattggca aagctcttaa tcttagtatt 2880 ctcagacaca cttggtattt tacaccacag tgtgttatct ttgctttagc tgaccagcgc 2940 ttagaagaga aagagaggat ggatattctt gtggaattgt taaaatatga tgttccagag 3000 atgaacagct ttgacaaaga acaaccagaa ccttttacac aagtttgtcc cctctcaaaa 3060 ctatcagact ttgtttctca ggagtcttat ctcttttttc tccatcttgg tataaccaaa 3120 gatgacattc tctcatggaa agaagaaggc atgtcaaatt tggagaaaaa tctcaacaat 3180 agatctttta agacatttgt gtccagtgta cgtcagttag cagttgttaa tgacagggct 3240 gaaagacaca tcaaattggt gcaggatttc attggaagga cacattccga agacaggctc 3300 caagacacaa tgatggtggt tttagacaac cggcggaagg tgtctaaacg tgccactaaa 3360 caagacttaa agaatattta aataaaaaaa gcactttatg gctatatttt tgtttttgaa 3420 atatattttt tctaacaaaa gtttaataat tcttgcacca cctaattttt ttttgtattc 3480 cgtagtattt tattttaaaa aacattttaa aacaatgcaa ttataataaa gaataaaata 3540 attatgaaat tttttcaaag ttgcagtaag tatagcagtc agaattaatg cattggatga 3600 ttaatataaa gttaatttga ttattttata aaaatattga atactaccaa agtttcaatc 3660 aataaaatgg aaaaaaaact cattaaaatt tatatgcagt ttttttctaa aacatcttct 3720 tcagctaaac agtgactttg agcccagata tttttatggt ggtctactgg gaccaaataa 3780 ggaggattta catagtgcac aatgttgaat tcaaaatttt ttttttctgg gacaccc 3837 // ID Ci000428 repbase; DNA; INV; 500 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 06-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE Mariner DNA transposon from Ciona savignyi. XX KW Mariner/Tc1; DNA transposon; Transposable Element; mariner; KW Ci000428. XX OS Ciona savignyi OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. XX RN [1] RP 1-500 RA Smit A.F.; RT "Ci000428 - Mariner DNA transposon from Ciona savignyi."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX CC Ci000428,Ci001033 TA TSDs. Matches Tc2 "Mariners" in CC Caenorhabditis briggsae. Has 28 bp terminal inverted repeats. XX SQ Sequence 500 BP; 136 A; 88 C; 101 G; 174 T; 1 other; ccgtaaaacc tctatttcac cgccatggcg ctctattttt caactcttcc ttaaaagtga 60 cgttctatta gaggtgacgt tcaattagag gttggcgctc tatttttcga ctagctggtc 120 agaattttga aatgaaattt attaatgaaa aaaaatatgc gtcacacggg cgtgagttcg 180 tttaaatata ccggtaaaag aaaaattctt tcttctgttt ggtcatagcc ggtttagaag 240 gcacaaggga gactgctatt gattaatctt tcctttattc aattcaaatt gtttgcttac 300 cgcattactg ctgcttatcg tgacgtgata ctgcattgtt actaccggta tgttggttgg 360 tatcttttaa atggctgctt taactgctag attgtgaaaa cttaatcaat tttataatta 420 ccggtagaga cctgctaata caattccgcc gctaaaagtg gtgttctatt agaggtggcg 480 ytcaaataga ggttttacgg 500 // ID CR1_Ele22 repbase; DNA; INV; 4365 BP. XX AC . XX DT 25-OCT-2010 (Rel. 15.1, Created) DT 25-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE A CR1 clade non-LTR retrotransposon family from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1_Ele22. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4365 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4365 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (25-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. This consensus is generated from 19 CC sequences with >97% identity, and ~99% identical to the original CC sequence in [1]. XX FH Key Location/Qualifiers FT CDS 566..877 FT /product="CR1_Ele22_1p" FT /translation="MSFIYVASRDGLSCESAKAIPLIKQGTDVNSLNFISF FT KVGVDPKYRSAALDPSSWPKGILFREFEDNKAKSFWMPKPSTPSINTTTNF FT DETPQITAASMETTEC" FT CDS 881..4294 FT /product="CR1_Ele22_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="PQGAHCKVFMGASGPSNTVESSTRNSNTSIMLSCHRS FT RPGPVFGNRLEVSQPTNEGKYESLRICSCSDVVSASSSFNCFQIVTSHDER FT NNAHRNTKLNSSYHRSTGCMQQSFMEAPRPFAAVEPATNLVTDTTSSSLHH FT SHPVPASESGLGVFQTSLPGEHSSCSASTFPDESLTPSNITTTTSSQPLSI FT YYQXVRGLRTKTNHLRSQLSSCDYDVLLFTETWLRPDIENAEFASHYNVFR FT CDRSASTSQFSRGGGVLIAVHNRPQCESVEWTNGENLEQAAVRIKLSDRLL FT YILAIYLPPNSSIEVYNAHALSIQHIVGLLSDFDIFLALGDYNLPNLRWLF FT DEDINGYLPSNASTEHELAMVEAMFTSGLQQVNHFVNVNGRLLDLAFANLP FT EHLDLIEPSVPFLPVDSHHQPFIVLLDACCDILSTCPNIGESLYFDFQKCN FT FELLDNAFSTIDWELLNGPTVEELLSDFNRRVYEVIQEIVPRKRLPSTSVY FT SKPWWSPELRNLRNILRKARNRYFKSKSESDKIQLRELENTYKNRLSVTYE FT NYISAVQTNVKQNPSLFWNYVKRQRSSDRIPSNVRYAGSMAATDSETADLF FT AAFFENVFSRASPVPRADRFEDVPSYNISIPECHFSNEDVLPVLEDLDDKK FT GPGVDGFAPILFKKCAHVLAKPIAAIFNRSIQERKFPSAWKTAIIVPIHKA FT GSQHLVENYRGISVLCCLSKVFEKLLHKVLYNAASSIMCEYQHGFVRNRST FT TSNLMCYVTAISRAMESKQQVDAIYIDFAKAFDTVPHLVVVEKMEHLGFPR FT WITSWLFSYLSDRNAFVKVNFARSKLFEISSGVPQGSVLGPLIFIIFVNDL FT ILKLSSYKLSFADDLKIFRIISTAADCLALQNDIDALLVWCDDNGMRVNSG FT KCKIISFTRNSTLRLHQYTIGTTLLERVQSICDLGVTIDSKLRFNKHISTI FT TAKAWSVLGFIRRHASEFTDIYALKTLFCALVRSILEYAAPVWTPYHLSYS FT LQLERVQKCFLRFALRGLPWRDPENLPSYPERCQLINLETLSERRAKSQQV FT FIFDIITGNLDCPMLLSEVPFYVPPRSLRNAPFLANPVHRTSYGQNNPFSA FT GVRAFNNVSVLFDFDMSKAVFKNRLRNVIS" XX SQ Sequence 4365 BP; 1191 A; 1032 C; 904 G; 1236 T; 2 other; aattggcatc actgctaatc gttgttatgc aagtcgtatt cggcctcgaa atttatctaa 60 attagtagtg ttctttacct gataaattat cggtttttgt ctccgtattt gtcgaaaagt 120 ctaatttatc aaatccgtgg ttgtttaagt gcactggttg taccgttgtg acattatttg 180 tttgtttttc attgctttgc ttacttgcaa ttgtactcgg tcactatcgt cttatcaact 240 aacgcgacgc cttacagtgg gtatcgcaag ctccacaagc tctactatcg ttgtgctttc 300 atctgttcga acaagcaacc acagctgtgt tggattatac ttccacataa cgaattcaca 360 tctgcaatag gcagtaggtc gcagataaga gtgttgaagt gcagaaagct tgcggtttta 420 ggctgatcca gaaactcaaa aatcgaggag atttctctca ggataaattc gatatttaac 480 atcgagatgg ccgctgcttg tgaccgttgc gccaaggcga tcaagcgggt cgatgacgtg 540 atcacctgca tggggttctg cgaacatgtc gttcatttac gttgcgtcga gagatggatt 600 gagttgcgaa tcagcaaaag cgataccgtt gatcaagcag ggaacagatg taaattcatt 660 gaatttcatc tcgttcaaag tcggtgttga tcccaaatac cgatcagcag ctcttgatcc 720 ttcttcgtgg ccaaaaggta tcctgttccg agagttcgag gataataaag ccaaaagctt 780 ttggatgcct aaaccaagta ctccatcaat caacaccacg acgaattttg atgaaacacc 840 tcagataact gccgcttcca tggaaaccac agaatgctga ccccagggtg cacactgtaa 900 agtatttatg ggagcctctg gtccttccaa cacagtcgag tcatcaacac gtaattcaaa 960 cacttcaatc atgttatcct gccatcgaag tcgccctggt cctgtgtttg gaaacagatt 1020 agaggtctcc caacccacga atgaaggcaa gtacgaatca cttagaatct gttcctgctc 1080 tgatgtagtt tccgcttcca gctcgttcaa ttgctttcaa attgttacgt cacatgatga 1140 aagaaacaac gcccaccgta acacaaaact caactcttcc taccatcgst caacggggtg 1200 catgcagcaa agcttcatgg aagcccctag gcccttcgcc gcagtcgagc cagctacgaa 1260 tttggtcacc gatacaacgt ctagctcttt gcatcatagt caccccgttc ctgcgagtga 1320 atccggtcta ggggtcttcc aaacttctct cccaggtgag cattctagct gcagtgcatc 1380 tacctttcct gatgagtctc tcactcctag caacatcacc acgacgacat ctagccaacc 1440 gctttccatc tactaccaaa wcgtgcgagg attacgcaca aaaacgaatc accttcgctc 1500 acaactgtcg agctgtgact atgacgtttt attgttcacg gaaacgtggc tccgaccgga 1560 catcgaaaat gcagagtttg cctcccacta taatgtgttt cgttgcgacc gcagtgcatc 1620 gacaagccaa ttttcgcgtg gtggaggagt actgatcgca gtccacaatc gtccacaatg 1680 cgaatcagtc gaatggacca acggtgagaa ccttgaacaa gctgctgttc gcatcaagct 1740 gtcggatcgc ctgctttaca tactggcgat ataccttcct ccgaactcaa gtatcgaggt 1800 ttacaatgca cacgcgttgt ccatccagca tatagtgggt cttctttcgg acttcgacat 1860 cttcttggcg cttggtgact ataatctccc aaatctacgc tggctcttcg acgaggatat 1920 caacggttac cttccatcta acgcatctac tgagcacgaa ctggcaatgg ttgaagccat 1980 gtttacttca gggctgcagc aagttaatca cttcgtcaat gtaaatggta gacttctgga 2040 tttggcattc gccaatctac ccgagcacct tgacttgatc gagccatctg taccttttct 2100 accagttgat tcgcatcatc agccgtttat tgtactactt gacgcttgtt gtgatatttt 2160 atcaacatgt cccaacatag gagaaagttt gtatttcgat ttccagaaat gcaatttcga 2220 gcttttggac aatgctttct ctaccattga ctgggaactt ctaaatggac caacagtaga 2280 agaactgctt tcggatttca atcgaagagt gtatgaagtg attcaagaaa ttgtaccacg 2340 caaacgactt ccatccactt ctgtgtacag caaaccatgg tggtctccgg agttgagaaa 2400 tctacgaaac attctacgta aagctcgaaa tcgttacttc aagtctaaat ctgagagtga 2460 taaaatccaa cttcgagagc tcgagaatac gtacaaaaac cgactcagtg ttacgtacga 2520 aaattatatc tcagcagttc agacgaacgt gaaacagaac ccatcccttt tttggaacta 2580 tgtgaaacgt caacgatcaa gcgatcgaat tcccagtaac gttcgctatg ctggttctat 2640 ggcggctacc gattcggaga ctgctgatct tttcgcagca ttcttcgaaa atgtgttctc 2700 tagagcttca cctgtgccac gggccgacag gtttgaagat gttccatcgt acaacatttc 2760 aattcctgag tgccactttt caaatgagga tgtattgcct gttctcgaag acctggacga 2820 caagaaagga ccaggcgttg atggttttgc tcctatcttg tttaaaaaat gtgcgcatgt 2880 gcttgcgaaa ccaattgctg caatcttcaa tcgctctatc caggaacgga aatttccgtc 2940 agcatggaag actgcgataa tcgtgccgat tcacaaggca ggtagtcaac accttgtaga 3000 aaattatcgc gggatatccg tactatgctg cctcagcaag gtgtttgaaa aactcctaca 3060 taaagtgcta tataatgctg catcatcgat aatgtgtgaa taccaacatg gatttgtaag 3120 aaaccgctca actacatcca acttgatgtg ctatgttacc gcaatttccc gagcaatgga 3180 gtcaaaacag caagtggacg caatttacat cgatttcgcc aaagcgttcg atacggtccc 3240 acatctagtg gtcgtggaaa aaatggagca tcttggattt ccaagatgga ttacatcgtg 3300 gctgttttcc tatctgtcag atcgaaatgc ttttgtaaag gtcaattttg cgcgatctaa 3360 actcttcgaa atctcttcag gggtaccgca gggcagtgta ctcggacctt taatcttcat 3420 tattttcgtg aatgatttaa ttttgaagct ttcgtcgtac aaactgtcgt tcgccgatga 3480 tcttaaaatc tttcgcatca tttctaccgc ggcagactgc cttgcacttc aaaacgacat 3540 cgacgcactg ctggtttggt gcgacgacaa cggtatgcga gtcaacagcg ggaagtgtaa 3600 aataatatca tttacacgca atagtacact tcggttacat cagtacacca ttggaacgac 3660 actattagaa cgagtccaat ccatttgcga tcttggagta actatcgact ctaagctccg 3720 attcaacaag cacatttcta ccataaccgc aaaagcgtgg tcggttctag ggttcattcg 3780 tcgacatgca tcggagttca cagacatcta cgctctgaag acgctgttct gcgccctggt 3840 gcgaagtatt ttagagtatg cagctccggt ctggacaccg tatcatttgt cgtactcttt 3900 acaactggag cgtgtacaaa agtgttttct gcgatttgca ctgagaggcc ttccttggag 3960 agacccggaa aacttgccaa gctaccctga acgatgccag ctaattaact tagagactct 4020 ctcggaaaga agagcaaaat cgcaacaagt tttcattttc gacatcatta ccggaaacct 4080 ggattgtcca atgcttcttt ctgaggtacc tttctatgtt ccaccacgaa gtcttcgcaa 4140 tgcacccttc ctcgccaatc ctgtccacag aaccagttat ggtcaaaaca atccgttttc 4200 cgctggcgta agagcattca ataatgttag tgtgttgttt gattttgata tgtcgaaagc 4260 tgttttcaaa aataggttaa gaaatgtgat atcgtaattt tatattttac catctgtacg 4320 accaagtcga agatatgaat aataaataaa taaataaata aatat 4365 // ID Kiri-6_AAe repbase; DNA; INV; 4747 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 16-FEB-2011 (Rel. 16.02, Last updated, Version 1) XX DE A Kiri non-LTR retrotransposon family from Aedes aegypti. XX KW Kiri; Non-LTR Retrotransposon; Transposable Element; Kiri-6_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4747 RA Kojima K.K. and Jurka J.; RT "Kiri non-LTR retrotransposons from the yellow fever mosquito."; RL Repbase Reports 11(2), 701-701 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 20 sequences with >96% CC identity. CC This family and related Kiri elements found in two mosquito CC species, Culex quinquefasciatus and Aedes aegypti, constitute a CC distinct lineage belonging to the CR1 group. The phylogenetic CC analysis with PhyML indicates that Kiri is a sister lineage of CC the L2 clade. Although several Kiri elements do not encode two CC proteins, probably due to 5' truncation, Kiri elements generally CC encode two proteins. Their ORF1 proteins commonly contain an CC L1-like RNA-binding domain. XX FH Key Location/Qualifiers FT CDS 310..1086 FT /product="Kiri-6_AAe_1p" FT /translation="MGKTKTTLVPLTNETKSNPCKRSKEDLDSSDDHVETL FT DDLLSRMQQMFNETNSKIEQSKCDLRSEIADLREEVQQFKQDCSNEVLRLT FT ESVNTIRSNVRVNEERILASMRTNDLLLSGVPYKPNEDLSIYINKVSTALG FT YVEHVMPLIHTKRLARLPIEAGASPPIVLQFAFKNVRDDFYQRYLSSRSLS FT LNHLGFNVNKRIYVNENLTILARQVKAHAIKLKKSGKLHSVFTKDGFVFIK FT RKPEDQAQLVLSVDSLEK" FT CDS 1754..4597 FT /product="Kiri-6_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MLDDTVTSSSNVGVVCIPRIVMNCALLSDKLNICHIN FT IQSICARQMSKFNELKLCISDSKLDIVCLTESWLSDSVDDELIAIDGYKLI FT RNDRQYSRGGGICVYYRNGITCALLSKSELYVGMGHINITEFLFLEIRFKH FT EKFLLGVVYNPPRNDCSEIMFRKLSDFALHYSKTIIVGDFNTNMLRPNERT FT IRLGGVIENLGFQCVNEEPTHFQSNSSTLIDLMITNDPDFVLNFNQVSAPA FT FSKHDIIFSSLNIARCSKDTSMRTFRDYYNINMTALSDATLNINWNLLYSI FT TDPDLALNFFNNCLTELFNSFVPLRTAKPKKNPWFNDEICSAMIDRDISYA FT HWKRTKNIIDFNNYKRLRNRVTHLINTAKSNYVSNHLASSNSNKDLWHKLK FT QINVRSGSDHLVEVTNSANEINDYFGSNFTACDQNLSIIPSQYNNFSFSTV FT VESDIVFAIKSIKSNAVGMDLIPLRFIKLLLPLIVSEIRYIFNLILTSSKY FT PQAWKVAKIIPIKKKPRINCLENLRPISILCGLSKVFEKILKNQIFQFVES FT FNLLSECQSGFRPAHSTTSAILKLNDDILKTIDRRGVAFLLMIDFSKAFDR FT VSHVKLVNKLTTQFYFSRDASNLVRSYLSGRTQIVNINNTASQPIPILSGV FT PQGSILGPLLFSIFINDLPSVLQYCNVLMFADDVQIYLSSAELSVASMSEL FT INLDLKRLITWSQANFLPINAEKTKIMLISRSRSEHVLPNIYLGEDVIDYV FT TEANLLGFVVQQNLEWDSHVNFQCKKIYMGLRQLRLSSSMLKFETKMQLFK FT SLILPHFMYGAELLLNASARSLDRLRVALNCCVRYVFNLNGYSRVSHLQSQ FT LLGCPFYEFAKLRSCLLLNKLVSRSSPAYLFEKLNRFQSARTRNFLIPNYN FT TSHYGNTIFVRGIAYWNQLPNELKNVNSVDIFRRDCIAWFNRRI" XX SQ Sequence 4747 BP; 1371 A; 871 C; 897 G; 1607 T; 1 other; gtttctgaag ggatgagctg gaatcgtgag tgagataatt ggtagtggct caatttgttt 60 ttaacgcctg ttcagtgatt tttccgacca ccaacttstg tctgcccttt catcagatgt 120 tgctgattta gtgttttatg gaatctaccc gtctaattac cagcctgtgt taaaatccaa 180 ttttatattt ggctaagctt ctactggact cgccattgtt tgcgtagtcg tatatcaaca 240 aaaggcactc acgaaagttc attccgctca tctggctcgg taagcaagag acggatccag 300 tgtttcacta tgggcaaaac aaaaacaaca cttgttccac tgacgaacga gactaaatcc 360 aatccgtgta agcggtcgaa ggaagatctc gattcctcgg atgatcacgt agaaacgcta 420 gatgacctgc tttcccggat gcagcagatg ttcaacgaga caaactccaa gatcgagcaa 480 agtaagtgtg acttgcggag tgaaattgca gatcttcgtg aggaagtaca gcagttcaag 540 caagattgtt cgaacgaagt gttacgatta acggaatcag tcaatactat ccgctccaac 600 gtccgagtca acgaagaacg aattttggca tcaatgagga ccaatgactt gctattatca 660 ggagttccct acaaaccgaa tgaggatttg agcatctaca ttaacaaagt ctcaacggcc 720 cttggctatg ttgagcatgt catgccactt attcacacca aacgcttggc acgtcttcca 780 atagaagctg gagcttcccc accaatcgtg cttcaattcg ccttcaagaa cgttcgtgat 840 gatttctacc agagatacct gtcttcgcgt agtttgtcgc tcaaccatct cggatttaac 900 gtcaataagc gcatctacgt aaacgaaaat ctcaccatac ttgctcgtca ggtcaaggcg 960 catgctatca aattgaagaa aagtggaaaa ctgcacagcg tgttcaccaa ggatggattc 1020 gtcttcatca aaaggaaacc ggaagatcaa gcgcaattgg tgctgtcggt ggatagcctg 1080 gaaaaataag aatacccttc catttacatg ctctctttcc ttcctaatag agtccatggt 1140 tcctatccta gggttccaat gtcttcttcc tctcctgaaa gtcaagaagc cgaaatcata 1200 cctatcctta ttagtttttg aatattccct tctcattatt cctatgtatc cgatccattt 1260 acttccttgt tttcttccgt cctaaaagtt aatcgtcgat ggattaccat tgatcatcgg 1320 atcaactcaa ctcaagactg acgttatact gcaacttcga ttggttctgc tgtctgttgt 1380 gattacaccg aatgcttttg ctgctgctgt tattgctgtt gttgctgttg ttgctgggat 1440 gatgatgctg taaaaccagg agatgatttc aaggctgcta cgtgcgtact ggaccctatg 1500 ggcgtcgctt tggatgcagt gttggtgaat gtaattgtaa attattaatg aagactgtat 1560 atttgaactt tatcaagttg agccattgct cttttgcgat tgttgttatt tttttcaatt 1620 attttgagat attgatgagt agctaaattt agtgtaatgg agtttcactc acttttcttt 1680 tttatttcta tttcctcctt cgtggccacg gaactaccgt cttgcattgt ttcgtatgta 1740 actttttatt tcgatgttgg atgataccgt aacttcctct tcaaatgtag gtgttgtatg 1800 tattcctcgt attgtgatga actgtgcttt gctttcggat aaactcaata tttgtcatat 1860 caatattcaa agcatttgtg ctcgacaaat gagcaaattt aacgaattga aattgtgcat 1920 tagtgatagt aaactagata tagtgtgctt aacagaatcg tggcttagcg acagtgttga 1980 tgatgagtta atagctattg atggatataa actaattcgt aatgaccgac aatacagtcg 2040 tggaggtgga atctgcgtgt attataggaa tggaattact tgtgctttgc tttctaaatc 2100 tgagctttat gttggaatgg gccatattaa cattacagag tttctgtttt tagaaattcg 2160 ttttaaacat gagaaatttt tgcttggtgt tgtatataat ccacctcgaa acgattgttc 2220 cgaaattatg tttcgtaagt tgtcggattt tgcacttcat tactccaaaa cgatcatagt 2280 tggcgatttt aacacgaaca tgttgagacc aaatgaacgg accattaggt taggaggtgt 2340 tatagaaaat cttggttttc aatgtgtgaa tgaagaacct actcattttc aatcaaattc 2400 gtctaccctt atcgatctta tgattacgaa tgatcctgat tttgtgctta attttaatca 2460 ggtctcggca ccagcctttt ccaagcatga tatcattttt tcgtctctga atattgcacg 2520 atgctcaaaa gatactagta tgcgaacctt tagagattac tataacatta atatgactgc 2580 tctgagtgat gctaccttga atattaactg gaatctactt tatagtataa ctgatcctga 2640 tcttgccttg aactttttca acaactgttt aacggaactt ttcaattctt ttgttccgtt 2700 aagaactgct aagcccaaga aaaatccttg gtttaatgat gaaatttgta gtgcaatgat 2760 agacagagat atttcttatg ctcattggaa acgtacgaag aatatcattg acttcaataa 2820 ttataagcga ttgcgtaacc gagtgactca tttgattaat actgctaaat caaattatgt 2880 atcaaatcac ctagcatctt ctaactcgaa taaagacttg tggcataaac taaagcagat 2940 taatgtaaga agtggttctg accatttggt tgaagtgacg aatagcgcta atgaaataaa 3000 tgactatttt ggttcgaatt ttactgcatg tgatcagaat ctgtctatca ttccttctca 3060 gtataacaat tttagttttt caactgttgt ggagagtgat attgtatttg caatcaagtc 3120 cattaagtcg aatgcagttg ggatggatct gattccacta cgttttataa aactgttgct 3180 gcccttaata gtttctgaga ttcgttatat tttcaattta attttaacgt cctcaaagta 3240 tccacaagcc tggaaagttg caaaaataat tcctattaag aaaaaaccta gaataaattg 3300 tttggaaaat ttaagaccaa taagtattct ttgcggttta tcgaaagtgt ttgaaaaaat 3360 tctgaaaaat caaattttcc aatttgtcga atcatttaat ttactcagtg aatgtcaatc 3420 tggttttaga ccagcgcata gcactacatc tgcgattcta aaattaaatg acgatattct 3480 taagacaatt gataggagag gtgtagcgtt tctattgatg atcgattttt caaaagcctt 3540 tgatcgcgtc tcgcatgtta aacttgtcaa taagctaacc actcagtttt acttctcacg 3600 agatgcatca aatttagtgc gatcttattt atctggtcga actcaaattg taaatattaa 3660 taatacagct tcacaaccaa ttcctattct ttccggagtg cctcaagggt caatattagg 3720 acctcttctt ttttcaatat ttattaacga tctcccatcc gtgttacaat attgcaacgt 3780 cctcatgttt gccgatgacg ttcagatata tttaagttct gcagaacttt ctgttgcctc 3840 gatgtctgaa ctaataaatt tggatctgaa acgtctgatt acttggtcac aggcaaattt 3900 tcttcctata aatgcagaaa aaaccaaaat tatgcttatt tcaagatcta gatcggagca 3960 tgtcttacct aacatctatt tgggagaaga tgtcattgat tatgtgacgg aagccaatct 4020 acttggtttt gtcgtacaac agaatcttga atgggattct catgtaaatt tccaatgtaa 4080 gaaaatctac atgggccttc gccagcttag gttatcttcg agcatgctta aatttgaaac 4140 aaaaatgcag ttgttcaaga gccttatttt gcctcatttt atgtatgggg cggaacttct 4200 gttaaatgct tcggcgaggt ccttggatcg tttgagagta gctctcaatt gctgtgtacg 4260 ctatgtattc aacctgaatg gttactctag agttagccat ctacagtcac agctattagg 4320 ttgtcctttc tacgagtttg ctaagcttcg ttcctgtttg cttttaaata aattagttag 4380 tagatcttct ccagcatatt tgtttgaaaa gttgaatcgc tttcaaagtg caagaactcg 4440 taactttctt attcctaatt ataacacatc tcattacggt aacaccatat ttgtaagagg 4500 aatcgcttac tggaatcagc taccaaatga attaaagaat gtaaattccg tggatatttt 4560 ccggcgagac tgtattgcgt ggtttaatag gaggatttag aatttttgtt agtgtaatgt 4620 ttaagttttt tttttataat taagtagttg atgaattctg atggactcaa cttatgtagt 4680 aatttaaaag gggcagccct tactctacag atttatgaat caataaataa ataaataaat 4740 aaataaa 4747 // ID Hopers3 repbase; DNA; INV; 3063 BP. XX AC . XX DT 08-OCT-2009 (Rel. 14.1, Created) DT 08-OCT-2009 (Rel. 14.1, Last updated, Version 1) XX DE Hopers3 is a putative autonomous DNA transposon. XX KW hAT; DNA transposon; Transposable Element; HOBO; transposase; KW Hopers3. XX OS Drosophila persimilis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; obscura group; OC pseudoobscura subgroup. XX RN [1] RP 1-3063 RA Ortiz M.F. and Loreto E.L.S.; RT "Characterization of new hAT transposable elements in 12 RT Drosophila genomes."; RL Genetica 135(1), 67-75 (2009)DOI 10.1007/s10709-008-9259-5. XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 186..2645 FT /product="Hopers3_1p" FT /translation="MTPKKREKTNVWIYYVNNTESGLATCRMCNGTLKNNR FT VSNLKTHLWNRHNIKVLKITPIETKVDESSDLDSEISEKIKRKQKTTAVHK FT KQTLRQKRNALWKYFENNIDSGQAKCKICKAPLRNRVSNLKGHLFQQHDIN FT LYYAKRQPMKKIRVNVNSLLLRNNKNDVWNYYENIIDSGIARCKTCNGTLK FT NNRVSNLKTHLLKMHNLNLSVKKIQPIESASSEADSVHSVKIIPKEILKIN FT VNRKQLLRSFIGLVTEDCIPLKVLDSPNMRNIIGPICDGLEASAGKPMSLK FT ASSCIKHLQLVASNIRLDIKTELKNKLLSFRIDSASRLCRNIVGISAQFLS FT DAQIKSRFLGMIELRKEPPKNVAVEVINVLKRYDIKVAQMVSATSDNGEPI FT EESGDDCSNEEYLKKIELVEKSPNILLGHVRVSRCAAHSAQLCVLDVTKSS FT EIINYIFTCRNLTKYIRKPSNRYHETFEQKQLKMPQLDCPTRWGSTYAMLE FT HLKLAKDVIPKNESVRSNIQDGSYVIDSFWEFIESYCVVFGPLQKTILKFQ FT EEQLHYGNFYAQWLKCKICTEKIVKDASQTLTTTIGNIILNSIDKRTKKFM FT NSRRLVSCLYLDPRFHHTLTAEQKVKARDYLKQVWDRITEVNPEVTCSASM FT VSIPDQSVGFDEEDELLNQYLTQGLLEANVGSKSDVYTKIETLQLPFHRID FT VDVLSFWQAKENSDQELYAISKVCFAIPPTQVNSIHFYYKFKVLLNIFHSS FT QVAMERQFSTLRLMLTDDWNQHGQETLENILLVKLNPNFLESAIDQLPIFQ FT NDHDPPPDRIYKRRRFCS" XX SQ Sequence 3063 BP; 1065 A; 582 C; 594 G; 822 T; 0 other; cgaatatcag aagtcgcggt caaagtacaa aaactaagca aaaactgtgc aagtagtgag 60 cttttcgacg ttcgtcagaa tcacggctgg ctgcgaagat tgaagaagtc gcagtcgtga 120 cggcagtcgt cgtcgatatt gtcaagtgtt ttccgaggct caataagtaa ctcaatcgat 180 tgacaatgac gccaaaaaaa cgcgaaaaaa ctaatgtttg gatttactac gtaaataata 240 cagaaagcgg gctggctacg tgcagaatgt gtaatggcac cttaaaaaac aatagggtgt 300 cgaatctaaa aacacattta tggaatcgtc acaatataaa agttttaaaa ataaccccga 360 tcgaaaccaa agttgacgag tcatccgatc tggattccga gatttctgaa aaaataaaac 420 gcaagcaaaa aacaacagct gtgcataaga aacaaacact tagacaaaaa cgtaatgctc 480 tttggaaata tttcgaaaat aatatagaca gcggtcaggc caagtgcaaa atatgcaaag 540 ccccgcttag aaacagagtt tcaaatctga aaggtcattt attccaacag cacgatataa 600 acttatatta tgccaaaaga caacccatga aaaaaattag agttaatgtg aatagcctgt 660 tacttcgaaa caataaaaat gatgtttgga attactatga aaatattata gacagcggta 720 tagccaggtg caagacatgc aatggcacac tgaaaaacaa tagagtttca aacttaaaaa 780 cacacttatt gaagatgcat aatctaaacc tatcagttaa aaaaattcaa ccgatcgaat 840 cagcctcatc cgaggcggat tcagtgcatt ctgtcaaaat aataccaaag gaaatattaa 900 aaattaatgt gaatagaaaa cagttactta gatcttttat tggtcttgtg accgaagact 960 gcataccctt aaaggtgttg gattcaccga acatgaggaa tatcattggc ccaatttgcg 1020 acgggctgga agcttcagct ggaaagccaa tgagcttaaa ggcctcaagc tgcattaaac 1080 atttgcaatt ggttgcatct aacataagac ttgacatcaa aactgaattg aaaaacaagc 1140 tgttgtcatt cagaatagac agtgcctcgc ggctatgcag aaacatagtg ggaatcagtg 1200 cacaattcct aagtgacgct caaataaaat cccgcttttt aggaatgatt gaacttagaa 1260 aagagccacc caaaaatgtt gccgtcgaag tgatcaatgt tttaaaaagg tacgacatta 1320 aagtagccca aatggtgtca gctacatccg ataacggaga gcccatagaa gaaagcggag 1380 acgactgttc caacgaagag tatctcaaaa aaatcgagtt ggtggaaaaa tcacccaaca 1440 tcctattagg acatgttcga gtctctcgct gtgctgctca cagtgcccaa ctttgtgtcc 1500 ttgatgttac aaaatcctcg gaaataatta attacatatt cacttgtcgt aatttgacga 1560 aatatatcag aaaaccatcg aaccgatatc acgaaacttt tgagcaaaaa caactaaaga 1620 tgccccagct agactgcccc accagatggg gctcaaccta tgcgatgttg gaacatttga 1680 aacttgctaa agacgtgata cccaaaaacg aatcggtgag aagtaacatt caagacggaa 1740 gctatgtaat agactcgttt tgggaattta tcgaaagcta ttgcgtcgta ttcggtccct 1800 tgcaaaaaac aatactcaaa ttccaagagg agcaactgca ctacggcaat ttttatgctc 1860 aatggttaaa atgcaagata tgcactgaaa aaatcgtaaa ggatgccagt caaactttga 1920 cgacaacaat tggaaatata attttgaact ccatcgataa gcgaacaaaa aaatttatga 1980 atagtagacg tttagtttcg tgtttgtatt tggatcctcg attccatcat accttgacag 2040 cagagcaaaa agtgaaagct agagattatt taaaacaagt ttgggataga atcacggaag 2100 ttaatcccga ggtcacttgt tcagcatcga tggtttcgat tccagatcaa tccgttggct 2160 ttgacgaaga agatgagtta ctaaaccaat acttgactca aggactactt gaagcgaacg 2220 ttggaagtaa aagcgatgtg tatacaaaga ttgagactct tcagcttcca tttcacagaa 2280 ttgatgtaga cgttctatcg ttttggcaag cgaaagagaa ctctgatcag gagctgtatg 2340 caataagcaa ggtgtgcttt gcaataccgc ccactcaggt aaatagtatt catttttatt 2400 ataaatttaa agttttatta aatatttttc attcatcaca ggtggcgatg gagcgacaat 2460 tttctacgct acgactgatg cttactgacg attggaatca acatggtcag gaaacactag 2520 aaaacatatt gttggtgaaa ttaaatccaa attttcttga atcagccatt gaccagttgc 2580 cgatttttca aaacgaccac gatccccccc ccgatagaat ctataaaaga agaaggttct 2640 gcagctgacg cagctggtcg ggattgtcga agaccgacat tgctgtgaga cccgcactcg 2700 agtaaaagta ttgaagtatt tcaaattctt tttttctata aatttttttt tttttttttt 2760 tttttctcaa cacttgtcac tttatctttt cgcacttgta aaataaaact cgaaatattt 2820 atacagtttt tgtttttaag ttttcggctc ggcttgtcga cttcgacaaa agtgtgatgg 2880 acattaaaaa aaccaatagt attaagtgca aaaattactg caaaacaaag cgtaaaactt 2940 cgtcggcgac tgccgaaggt aaaaatcaac tggcactttt tcgctttgac gacttgcaca 3000 ccagccgtgg gccgactgac gtcagttttc gttactcagc tttgactgcg aattcttatt 3060 tcg 3063 // ID hAT-N14_AP repbase; DNA; INV; 302 BP. XX AC . XX DT 26-AUG-2009 (Rel. 14.09, Created) DT 26-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW hAT-N14_AP. XX NM hAT-N14_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-302 RA Jurka J.; RT "hAT-type DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2114-2114 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 302 BP; 86 A; 57 C; 61 G; 98 T; 0 other; cagggatggc caacccgcgg cccgccttgg aaagttttgc ggcccgcgac atttatcaaa 60 tatttacaaa aaaaaattat ttttgtgccg atagaatatt ctggaatgcc cgcgcaacgc 120 ggtatcttat aattttatat gtggcctaat tttgaataaa aaaatataaa aatatgtgat 180 tataataata ttttggctga tgaccttttt tttttttttt ggcaacccta tacactgcgg 240 cccgcgggat tgtaatgtat ttaaaaagtg gcccgcaaga tcatttgggt tggccatccc 300 tg 302 // ID Gypsy-13_DPu-I repbase; DNA; INV; 4562 BP. XX AC scaffold_35; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-13_DPu_; KW Gypsy-13_DPu-LTR; Gypsy-13_DPu-I. XX NM Gypsy-13_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-4562 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 741-741 (2010). XX DR Genome; scaffold_35; Positions 995982 991421. XX CC Positions [3379-3795] - Integrase core CC 'AACGT' target site duplication CC LTRs are 97% similar to each other. CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX FH Key Location/Qualifiers FT CDS 444..1916 FT /product="Gypsy-13_DPu-I_2p" FT /translation="MAQLYKIIVARVYILAFPQTECVTFAGPTATSHTMAT FT GLGAAAAANAAVVAANVIAAPAQPRAFKPYGSPPPFDLEAEKDSFDTWVRR FT WEIFLALSTIDEVLDAGLRPAYKTNLLLSCFSTQTLKTVLSAGLSQAQLAD FT HDQIIGMLRTRCNAGKNRHIWRHQMALCTQRPNQQADNWLCELRDVSRKCD FT FIGDCCARCEATRILGQIVTGVADNEVRIKLLEQGDTLTLDGAIAILRTAE FT TSQLQAANLREESSIYAIRRSTYQAKKAAEGNPGAAEHPEEKKDGVRFEKD FT SSTSCRFCGGPSRHIRRDCPAWGKKCHNCGRDNHFAIACEDGSRNKVGSIF FT ERSLVEGTNEPLTMEFTAKGGTEKASISTLADTGSDIDAIPDKIYYAFFPD FT IPLRPNSQAQSATGSPIMCLGTFQATIDWPADLNESSPATATVHVLILPGL FT STAEQVAGWSQIRQPDSISSSPYNPQRHALLHSSGRAEGVPPMRAG" FT CDS 1861..3795 FT /product="Gypsy-13_DPu-I_1p" FT /translation="MLFFTVVDALKGYHQCALDEASMALTTFATPEGLHQY FT TRLPMGICHAGDDYGRRFHDIFGHLRNTARCMEDLIIYSVTYEEHLELVRA FT VFKTASDHNVGFNRAKTTFAEPTGKFAGYIVSEDGFRPSPDLTRAIREFPQ FT PRNVTNLRSFYGLCQQVGNFSSKIAAALAPLSPLLKKNVEWSWNAQEETAF FT QAARKELAQVQELAFYNPDRTTSLHVDASRLHGLGFILKQQDPATRKWQLV FT QAGSRFLSSAESRYAMIELECLAAAWAMKKFRPFLEGLPAFDLITDHRPLT FT PILNEYSLDKLDNPRLLRLRLQMTRFIFTAKWVPGKINIEADALSRAPVER FT ASEADELAEGPSTFRAKGALIRLIAGSPNQSPDLKLKAVQLAADADPVMTS FT LRETIIAGFLNDKCNLPEQLRPFWNARHQLAIDEDDGTIVMGPRVVIPRSM FT VRQTIQVLLGMHQGASKMRQRARLSVFWPGMDVDIANAAATCDSCNSRLAS FT QPKETIRHHEPATRPFEFLHADLGEDNGRHFLVIVDQFSGWPHFTMFPNKN FT TTARRLIDEFRSFFVSIGGAPIKIWSDNGLFPAAEFQDFLRDWKVGWESSS FT PHYPQSNGRAEAGIKAIKALVAGSRTGGTFDQNKMAKALLLYRNAHP" FT CDS 2397..3623 FT /product="Gypsy-13_DPu-I_3p" FT /translation="MELERARRDSFPGRKKRTGAGTRARVLQPRPDHVPPR FT RCIPPSRPRFHPKATRPSDQKVATSPSRIPVPFVGRVQVCDDRTGMPGGRV FT GHEEIQTIPRRTTGVRPHHRPPAADPNSQRVFTGQTRQPATASPPPPNDPL FT HIHSKVGPGQDQHRSRRTLKSASRTRIRSRRASRRTQHLPSKRRVNQTHSR FT ITQPVTRPETESSPVSSRRRSRHDVPPRDNHRRIPERQVQLAGAAATILER FT PPPAGHRRGRRNNCNGPPRSNTSLHGPPNNSSPARYAPRSQQDAPEGPFIR FT VLARDGCRHSQRGSHLRLMQQPASVPTQRNHPPPRTSHAPLRILTRRSGRR FT QRAAFFSHRGSIQWVAPLHHVPKQEHHGTTSHRRIPQLLRLNWRRPNQNLV FT RQRTLPRGRVPRLPA" XX SQ Sequence 4562 BP; 1315 A; 1425 C; 1082 G; 740 T; 0 other; tggcgcagtt ggtttaacga aacccccctt tacctctctg tcacccagat gttccagcga 60 aaacagccga agacacaagg cccccagcag cccccaaacg tctagaaaac ctcgaggaca 120 tcgacagcaa gaaaccactc aacaagatac agtgagacaa accctacgca atcttttaat 180 cagtgagaca caatcatcaa aatccacatc tttacccaca ggtcgcgccc aatttttcaa 240 gccaatagtt ccgaaatcgc ccacggggcg cgcaagacca aagtcgtaaa accagaatcg 300 aagggcctat cgaccgtaac atcctacacc atcatccgac gccatcacag ggcattaaca 360 cccgaatcga gccccaacgg caacaaagaa acaaaaccat cgcgagttca gttaagccaa 420 aacccggtaa gaaaccacca accatggcac aactttacaa aatcatcgtc gcgcgtgtct 480 acattctcgc tttcccccaa acagagtgcg tcactttcgc cgggccaaca gcgacgagcc 540 acacgatggc cacaggacta ggagcagccg cagccgcaaa cgcagcggtc gtagcagcaa 600 acgtgatagc ggcgccagcc caaccaagag cattcaaacc atatgggagc cctccaccat 660 tcgaccttga agcagaaaag gactctttcg acacctgggt caggagatgg gagatattcc 720 tcgccttgtc aaccatcgac gaggtgctgg atgccgggct ccgcccagcc tacaaaacca 780 atctgttatt gtcctgcttc tccactcaaa cactcaagac agtgctatca gcaggccttt 840 cacaagccca attagccgat cacgaccaaa taatcgggat gctgcgaaca cgctgcaacg 900 ccgggaagaa ccgccatata tggcgccacc agatggcgct gtgcacacaa cgacccaacc 960 aacaagcaga caactggctg tgcgagctgc gggacgtcag caggaagtgc gacttcatcg 1020 gcgattgctg cgccagatgc gaagcaacaa ggatccttgg acaaatagtc accggcgtag 1080 cggacaacga agttcgcata aagctgctag aacaaggaga caccctgacg ctcgacggtg 1140 cgatcgcgat cctgcgtacg gccgaaacct cacagctcca agcagccaac ctgagggaag 1200 aatcttcaat atacgccatc cggcggtcaa cgtatcaagc caagaaagca gccgaaggca 1260 acccgggcgc agccgaacat cccgaagaga agaaagacgg tgtaagattc gagaaagaca 1320 gctccacatc atgccgattt tgcggcggac catcccgtca catcagacgc gattgccccg 1380 cctgggggaa gaaatgccac aactgtggac gagacaacca cttcgccatt gcttgcgaag 1440 acggctccag gaacaaagtc gggtcaatat tcgaaaggag tctagttgaa ggaacaaacg 1500 agccactcac gatggaattc acagccaaag gaggaacgga aaaggcgtcc atatcaaccc 1560 tcgcagacac cggatcagac atcgacgcca tccccgacaa gatctactac gccttcttcc 1620 cggatatacc gctaaggccc aatagccaag cccagtcggc gaccggcagc ccaatcatgt 1680 gtctaggcac ctttcaagca acaatcgact ggcccgccga cctcaacgag tccagcccag 1740 caacggccac agtccacgta ctgattctgc ccggattatc gaccgctgaa caagtggctg 1800 gttggagcca aattcgacaa cccgactcca tttcaagcag tccgtacaat ccccagaggc 1860 atgctcttct tcacagtagt ggacgcgctg aaggggtacc accaatgcgc gctggatgaa 1920 gcctccatgg ctttgacaac ctttgccacc ccggaaggcc tacaccagta cacccgactg 1980 ccaatgggaa tctgccacgc cggagacgat tatggacgac gattccacga catcttcggt 2040 cacctccgca acacagccag atgcatggag gacctcatca tctactccgt cacgtacgaa 2100 gagcacttgg agttagtccg cgcagttttc aagacggcca gcgaccacaa cgtcggcttc 2160 aaccgcgcca agacgacctt tgcagagccg accggcaagt tcgccggcta catagtgtcg 2220 gaagatggct ttcgcccaag cccagatttg acgcgtgcca tcagagaatt cccacagcca 2280 cggaacgtca ccaaccttcg gtcattctat gggctgtgcc agcaggtcgg taacttcagc 2340 agcaaaatag cagcagcgct agcccccctc tccccactgc taaagaaaaa cgtggaatgg 2400 agctggaacg cgcaagaaga gacagctttc caggccgcaa gaaaagaact ggcgcaggta 2460 caagagctcg cgttctacaa cccagaccgg accacgtccc tccacgtcga tgcatcccgc 2520 cttcacggcc taggtttcat cctaaagcaa caagacccag cgaccagaaa gtggcaacta 2580 gtccaagccg gatcccggtt cctttcgtcg gccgagtcca ggtatgcgat gatagaactg 2640 gaatgcctgg cggccgcgtg ggccatgaag aaattcagac cattcctcga aggactaccg 2700 gcgttcgacc tcatcacaga ccaccggccg ctgaccccaa ttctcaacga gtattcactg 2760 gacaaactag acaacccgcg actgcttcgc ctccgcctcc aaatgacccg cttcatattc 2820 acagcaaagt gggtcccggg caagatcaac atcgaagcag acgcactctc aagagcgcca 2880 gtcgaacgcg catcagaagc agacgagcta gcagaaggac ccagcacctt ccgagcaaaa 2940 ggcgcgttaa tcagactcat agccggatca cccaaccagt cacccgacct gaaactgaaa 3000 gcagtccagt tagcagcaga cgccgatccc gtcatgacgt ccctccgcga gacaatcatc 3060 gccggattcc tgaacgacaa gtgcaacttg ccggagcagc tgcgaccatt ctggaacgcc 3120 cgccaccagc tggccatcga cgaggacgac ggaacaattg taatgggccc ccgcgtagta 3180 atacctcgct ccatggtccg ccaaacaatt caagtcctgc taggtatgca ccaaggagcc 3240 agcaagatgc gccagagggc ccgtttatcc gtgttctggc cagggatgga tgtcgacata 3300 gccaacgcgg cagccacctg cgactcatgc aacagccggc tagcgtccca acccaaagaa 3360 accatccgcc accacgaacc agccacgcgc cccttcgaat tcttacacgc agatctggga 3420 gaagacaacg ggcggcattt tttagtcatc gtggatcaat tcagtgggtg gccccacttc 3480 accatgttcc caaacaagaa caccacggca cgacgtctca tcgacgaatt ccgcagcttc 3540 ttcgtctcaa ttggaggcgc cccaatcaaa atctggtccg acaacggact cttccccgcg 3600 gccgagttcc aagacttcct gcgtgattgg aaggtcggat gggagtcatc atcaccgcat 3660 tacccgcaat ccaacggcag agccgaagca ggaatcaagg ccatcaaagc cctcgtggcc 3720 ggatcacgaa cgggcggcac gttcgaccaa aacaaaatgg ccaaagcact cctcctctac 3780 cgaaacgcac acccgtaggg gcctccagtc gcccgcacaa ctaatcttca accgacccat 3840 ccgcgacggt ttaccagcgc acaggagatc cttcgcgcca gaatggcaaa gagaagccag 3900 gaaaatcgag cagaggcaac gaaaagcgct ggagaagacg acaaactact acatccgcga 3960 cgcaaaggat ttgccggaat tgaccattgg cgaccacgta ctcatccaac atccagattc 4020 caagcgatgg actacaccag gagtggtggt ggatacgggc ccaaaccggg attacatggt 4080 caaaacacca tccggccgaa tttttcggcg aaatcgccgt atgctccgca aaagaacggt 4140 ggtaatgccg ggaacgattc caccagccga aaacgaacca gtagcagcag tggaaccaat 4200 tcgaccaatc cctaccatcg aaagagcagc gccgatcaca acacagccgg agaagagaac 4260 ggacgggcag ccaggccagt aagttggagt cgggcgcggc cgaggacgcg gccggattcc 4320 acccatacgc cagtcaaccc gcatcagcgt gccaagtaac cgatacccgg cagaagagtg 4380 gaccaaataa caaaacaaga atccaatcca accaaaccag aatcaccata ccatgtttat 4440 gttccaccat tcagtatatt gttgtgtttt tttttttatt aagcgccaat ccctcccaag 4500 aagtaccaga agggacgcgc cttcaaaaaa aaagaaaaaa aaaaagagaa aagaaaaaga 4560 ca 4562 // ID Chap2a_Cis repbase; DNA; INV; 360 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 06-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE hAT DNA transposon from Ciona savignyi. XX KW hAT; DNA transposon; Transposable Element; Chap2a_Cis. XX OS Ciona savignyi OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. XX RN [1] RP 1-360 RA Smit A.F.; RT "Chap2a_Cis - hAT DNA transposon from Ciona savignyi."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX CC Ci000096; NACATGTN duplications; small region derived from coding CC region, matches Charlie1 best. XX SQ Sequence 360 BP; 106 A; 66 C; 94 G; 94 T; 0 other; caggggttct taaactgggg ggcgcgcccc ccttgggggg cgtgaaagct tctcaagggg 60 ggcgcgagca tgacagtgac atatttatgt gaaagtggat tttcggcact tgtgagtata 120 aagacaaaat gtcgtaacaa actggaatgc gaagcagatc ttcgatgttc gttgtcttca 180 acaaagcctc gaattaaacg actagtttcc caaaaacagt tgcatccttc acactaagca 240 gggtaatggc aaataaatta ttgttctatt tttcgtaaat ttgatttttg agtaggtgag 300 gcgtgagcat gaatcacaaa tctcaagggg ggcgcgactg aaaaagttta agaaccactg 360 // ID BEL-598_AA-LTR repbase; DNA; INV; 419 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-598_AA_; KW Pao_Bel_Ele216; BEL-598_AA-I; BEL-598_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-419 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 419 BP; 141 A; 60 C; 80 G; 136 T; 2 other; tgttaggaaa ggatgttgcc cacgataccc ctcgatccat gcaaatgacc caacgccaat 60 tggcmgttgg atatgtcata ggcttgtcaa agcttaaagg aggataggta gattattgtt 120 katgaaatac agaaagagaa ttgttgattt tgttatcacg gtaaatttgt tattaaaaaa 180 agttgaattc atttaaaatc taatgtttaa atttatatac tttagatata gaaattctgc 240 aattttagtt ggattgatat tctgtggcta ttgaacacta tagaagacaa attgtaagac 300 aggtaatttc cctaattgaa atggttttct aagaaatata tatattgcag cttcgagctt 360 actctaccca aacgaacgag ttttgctacg cggattgtcc gaaaaatagt gtgaccaca 419 // ID Shinagawa-11_AAe repbase; DNA; INV; 2258 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.03, Created) DT 16-FEB-2011 (Rel. 16.03, Last updated, Version 1) XX DE A non-autonomous DNA transposon family from Aedes aegypti. XX KW DNA transposon; Transposable Element; nonautonomous; KW Shinagawa-11_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-2258 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the yellow fever mosquito."; RL Repbase Reports 11(3), 848-848 (2011). XX DR [2] (Consensus) XX CC >92% identical to consensus. 9-bp TSDs. Subterminal inverted CC repeats are 137 bp long and composed by degenerate repeats. CC Related non-autonomous elements, named Shinagawa, are found in CC Aedes aegypti and Culex quinquefasciatus. XX SQ Sequence 2258 BP; 651 A; 423 C; 372 G; 811 T; 1 other; ggtggtaggt catttggcat aaagtcgttt ggcataaagc cgtttggcat aaagtcgttt 60 ggcataatgg tcatttggca taaagtcgtt tggcataacg gtcgtttggc ataatggtcg 120 tttggcataa tagtcgtttg gcataatgag tctgaaacca agaatttctt aagatgacat 180 tcgtttttac gtttctattg aatcttatct gatkacatca ggcttgtttt ggagtcaatt 240 gacataaaat gacactttgt tcaataatca tatgctcttg aataaactaa agcatattag 300 gaaagttatt gccaatagta ttttattatt tatccagaaa ttacccttct ttcaaatatt 360 aattcttctt ttgactttcg ctattggaat aaatttttgc actgaagatt ttgatatatt 420 tagcacaaat ttacccttct ttcaaacgtt ctattttttt tttctaaata tgatgtaatc 480 ataattatag gtatagaaac tgttgattca aaatttaaat aaatttcacc ttcttttata 540 cataggctgt tctttggagc tatattttta aatattttat tgttcccaac tgacaaaagg 600 gcggcattcg ataacattga tctgctcaag ttatcagcga aaagagacat cgattactgc 660 agttacttcg atgggttgtc ctacttttca ccgaatgtcg gttccccgaa tgtcgtttcc 720 ccgaacgcca gttccgaatg ccagttcccc gaatatccca tttccccgat tagccaattt 780 ccccgaaaag tttttagcac tcataactgt cgtaacttta tacatttcag ggtggtgaac 840 gaactggcca tctaatatgc acccttcttt atttaattgg cggttctttc gagttttacc 900 gtcctcagct tttttgccaa catgtgtata gccgagaatg acagaataac tccttctttt 960 gattgcttat cgttctttct cgttcactat ccagccactg aagattatta tagcacctat 1020 ttaaactgca ccacttttcg gggaaaccgg tcattcgagg actggcattc ggggaaccgt 1080 cattcggaga aacgacattc ggggaaaagt atcacaatca cttcgatgat tctgaacgca 1140 attgttaatg tcttttcatt ttattatgcc gcaatagttt tcggcgtcca atcttctatt 1200 ggtttaacca taaattttgc cttcttttaa aaataggctg ttctttatat ttatactgtt 1260 ttaaatttta atatttagta aagctaaatg tcaaccattt ttatattttg ctgttttatc 1320 aattcttaca agacgtgctg ttataatgat aattacttta gaacctgcca atattcaact 1380 tttttttttg caggcgcatg atctcctttc gagatatgct ggcaaaaata ttatctcaat 1440 cacaaatttt cccttctttc gataatagac ggtgtttttg atgtacactt ccaaaattaa 1500 aatttatgtt gaatcgttga aaagttttgc cacaaatatt ttcttttaaa aatgtgttgt 1560 tcctttgagt tatgctgtga gaagattttt tgcgttagag ccttaaacat tttaaacgaa 1620 aaataatctg ctctcgtata taggctgttt ttgaatatga tgcagtaata gattttaaga 1680 ttagagcttt taatattttg caaataaatt tcccttcttt tatcaagtga ttttcgtcaa 1740 gtgttacgtc caaactggga cagagcttga tctgcagcga aatattctat gagcgttccc 1800 tttattaata actgcgaact ttctttggca atttactatt ctcaaatatg tatatcgcaa 1860 caagctcaaa gatactctat gttctgggat gtagagaaaa ttattattcc gaaaagatct 1920 tccaccagac ccgtcacgag ttttatttag taatgctctc taatgtcaca acaattttga 1980 cattgatagc tgaaaaatct ctgaacgaac accacgcttg ttgagaacct accaaaaatt 2040 tttgctatgg gctcggtggt gttgatctcg gtaaaagctt caaaaatgcc gatattcatt 2100 ctaagtcaat cattatgcca aacggccatt atgccaaatg accattatgc caaacggcca 2160 ttatgccaaa cgaccattat gccaaacgac tttatgccaa acggctttat gccaaacgac 2220 cttatgccaa acgactttat gccaaacggg gtacaatc 2258 // ID I_Ele16 repbase; DNA; INV; 6907 BP. XX AC . XX DT 08-OCT-2010 (Rel. 15.1, Created) DT 08-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE An I clade non-LTR retrotransposon family from Aedes aegypti. XX KW I; Non-LTR Retrotransposon; Transposable Element; I_Ele16. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-6907 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6907 RA Kojima K.K. and Jurka J.; RT "I clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (08-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. This consensus is generated from 8 CC sequences with >95% identity, and ~99% identical to the original CC sequence in [1]. XX FH Key Location/Qualifiers FT CDS 380..1708 FT /product="I_Ele16_1p" FT /translation="MSGGSSGPYGGAVGLQPRRNKPDWMLSPNDLGQVMVL FT VLRRKVNDTETSGQRTQDTPLPDTFIVGTSIELAVGTKEARALNATREGRG FT SRYLLRASSANVIEKLSKMTELTDGTKIEIVPHPTLNTVQGIVFDADSINR FT DEKSILEFLESQGVQAVRRIKKRVNGALRNTPLLVLSFRGTILPDHVYFGL FT MRIKVRVYYPSPLLCFNCAAYGHSKKVCQQTTICLRCSTPHDVPEGEQCVN FT PPNCLHCKTDHQVTSRDCPKFKEEEKIIRLKTDQGISFAEARRICAEETKK FT QTFAGIIQNQMHQELAAKDQVIAALQKQVAVLTKELANLKKLLKPSAQNHS FT PAPQEPRPSASSERSVSQTTQSVSNTDRLSRKDQSFISPPARRREIRKSNK FT LDYDVQTRSRSGKRPIETSPTEVINNRGKRISAQPETGNKPTNIETNYG" FT CDS 1704..6794 FT /product="I_Ele16_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="MANNHQREFFTNFQNDDPAMEKDTQMEERISHLCRNN FT VVSSLTSEEYDRINNHISKPSTSRTYFDSPSARPTTSTLIRDVPVQSEEPL FT AAVDVVRPSPRASISSNYVCIKQSKFGFNSSKRGIDTITRPTVKNTMSPTG FT KPRQGEGECIVTSPIPSMEEHIRWISNGDSHKRITLRVSSWNQRSNSTKTT FT RPSIPYDSPSARPTTSTLTREVPEHSEEPLAAVDVVRPLSRASTSTNYIPK FT KQFNLVTPQTKPDSGATIRPAVYSSISSTSVSHSTTNQGSTTTTVTTTKRL FT RSPTFSDSQCSNPTKTTRPSIPYDSPSARPTTSTLTREVPEHSEEPLAAVD FT VVCPLSRASIHSNSLQTKQFNPVIPPTNPGQEIHYHHRNNNNASPTSVARR FT EGKSSVAYSSRTFLGTEEQGSMGEFRSCSWNQTEASHDVTCKSNAPPSTPR FT NTRFSIDADSPRSSSQMNGINSPRSPSRASDDSEASSLTAQRTESSFAIQW FT NICGLRTHLSELQILIKKYQPLVVCLQETNADYNRIAPSCLGKEYELLLSQ FT SSTHGRQGAGMAIKNGTPFQRIWLQSNIQAVAVQLFAPTTITVVSVYLSPS FT EKDAVKLFGDLLEELPKPILVLGDLNAHHAAWGSKITQPSSRAKTRGEGIL FT DLVVRHNMVVLNNGSHTRIDPVTGASQALDVSICSTSHAAKFSWKTLLDYS FT GSDHLPILLETYCNQFTSKCRSRWIFEKANWQLYEQLTTDSLRPGYTLSVD FT EFTDRIITAAEASIPKTSGNIGQKSVVWWNPEVALAIKSRRKRLRALRRLG FT DDDPQKIVALKQFQEARSHCRKAIHEAKQSSWDAFVESINPDTPASQVWNK FT INKLQGKRQRNTISLNLETGHTNNGPTVANALADEYQQKSSDANYPERFRK FT KHKKDKRTRCVTQRPNLHKRYNTDLTVEELMWALDRRVGSSTGPDNVSYQL FT LQRLPFSGKTALLELFNRIWASGCFPAQWKIGTVIPIPKPGADRSKPEGYR FT PITLLSCLGKLFERIINRRLMTELESTGKLDSRQHAFRAGKGVDSHFAKLE FT SIINLQNDEHVEIISLDISKAYDTTYRPEILHTLTKWRVTGRLMNIISSFL FT TDRFFRVAANGSLSSLRRAENGVPQGSILSVTLFLVAMQPIFNAIPGDTEI FT LLYADDVILIVKGKNHVTIRSKLRKAVAAAVEWSSNIGFTIAPTKSKLLHI FT CHLKHRKRGKAIKIGSNPIPHTRYMKVLGVLLDCKLNFLKHLTSVKQSCRK FT RLNIIRILGYRLKRSSRSSLLKVGSALILSKLYFGIGLTSINFDAMQCTLE FT PIYNDVARQASGAFQTSPITSIMAETGWTDFYSALIQRLCVLAVRLAEKNE FT EANDYPVVQRARRLVLEKTGWSFPKIHETLRSSDREWYVPPPNIDNELKRT FT IRAGTNHSVAVQKYKELVNVRYRTHEQIFTDGSKYGDQTGAGIVMNDSHFS FT YRLPDTCSVFSAEAFALMMAVSKLNGQAKNIILTDSASCLEALAGGRSKHP FT WIQAIERKIVGQNVTFSWIPGHSSIAGNEAADQLAKQGRHQPLQDIPLPAQ FT DVVREIKRRIWTQWESQWHQQRCVLRSIKPTPGKYPDRKNPSEQRVLTRIR FT IGHTRLTHAYLMDHNDPPNCRHCGVQLTVEHILVECRGLQSNRKNCGINGS FT LAEILAYNTESEKAVIKFLKECNLFTKI" XX SQ Sequence 6907 BP; 2214 A; 1728 C; 1427 G; 1537 T; 1 other; cacatcgtac tcgggatacg aaagtacaac aataagtgca aatcgctccg attttatcac 60 caagttgtat ctaagttacc gctcgaaaat accaccatta gtggtcacgt gaaagttcta 120 cttacagtgt acctctgtga agtgaagttc tggtgaatcg agccgaaaag tcgattaata 180 aacagtgaac tgattgtgaa aatatcacgc ggtgtgaccc ttcacccata catacgttgt 240 tgcgatagtg agaacagaac acattgtccc gatcggctag ttgtcccaaa acggaaactt 300 ttttgatagt tcgatcgctt tcatcttcat ttcgaaaact gaacccgata cctaaccagt 360 gagtgattga cccggacaaa tgtccggagg ctcctctggc ccctacgggg gcgcagtagg 420 attacagcct agaagaaaca agcccgactg gatgcttagt cctaatgatt tgggccaagt 480 gatggtgtta gttcttcggc gaaaagtgaa tgacacggaa accagcggtc aaagaactca 540 ggatacacct ctgccggaca ctttcattgt gggaacctca attgaattag ctgtgggtac 600 aaaagaagct agagcactca atgcaacccg cgaaggtcgt gggtcgcgtt acctactgcg 660 tgccagttct gccaatgtca tagaaaagct gtccaaaatg actgagttga ctgacggcac 720 gaaaattgag attgtgcccc atccaacact taatacggta caaggaatcg tgtttgatgc 780 agattccata aacagagatg agaaatcaat cctggagttt ttggaatcac aaggtgtgca 840 agctgtgcga agaataaaaa aacgagtgaa tggagcattg agaaacactc cattgcttgt 900 tctttcgttc cgtggtacaa ttcttccgga tcatgtgtac ttcggcctca tgcgaatcaa 960 agtgcgtgtt tactaccctt ctcccctgct ttgcttcaat tgcgctgcat atggtcactc 1020 gaaaaaagtt tgccaacaaa ccacaatctg cttgcgatgc tcaaccccac atgacgtacc 1080 agaaggagag cagtgtgtta atccacccaa ctgccttcac tgtaaaacgg atcaccaagt 1140 cacgtcacgt gactgcccga aattcaaaga ggaagagaag atcattcgac tcaaaactga 1200 tcaaggaatt tccttcgctg aagccagacg catttgtgcc gaagaaacca aaaagcaaac 1260 attcgctgga attatccaaa atcaaatgca tcaagaactg gccgcgaaag accaagtgat 1320 agccgcactt caaaaacaag ttgccgtgct gaccaaggaa ctcgccaact tgaagaaatt 1380 gttaaaacca tccgcacaaa accactcgcc agcaccccaa gagccacgac cttcagcttc 1440 cagtgagaga tctgtatcac aaacaaccca atctgtttcc aataccgacc ggctatctcg 1500 aaaggaccaa tccttcatct cgcctccggc ccgccggcga gagattcgca aaagcaacaa 1560 actggattac gacgtccaaa ctcgtagtag aagcggaaaa aggcccatcg aaacgtcacc 1620 caccgaagtc atcaacaacc ggggcaaacg tatatctgca caaccggaaa caggcaataa 1680 acccactaac atcgagacga attatggcta ataatcacca aagagagttt ttcacgaatt 1740 ttcaaaacga cgaccccgct atggaaaagg acacgcaaat ggaagaacga atttcacatc 1800 tctgcagaaa caacgttgtt tcttcgctta cttctgaaga atacgataga attaacaatc 1860 atattagcaa accatcaact agcagaacct attttgattc gccaagcgcg aggcctacca 1920 cgtcgaccct gatcagagat gtcccggtac aatccgaaga gcctctggcg gcagtcgacg 1980 tggtccgccc atcccctcgg gcaagtatct cttcaaatta tgtatgtata aaacagtcca 2040 aattcggttt taattcctcc aaacgaggca tagacacaat caccagaccg acagtaaaga 2100 acactatgtc gccaaccggc aaacctcgac aaggagaggg agagtgtatt gttacctccc 2160 ccattccaag catggaagag catatccggt ggatatcgaa tggggacagc cacaaacgca 2220 ttacactgcg tgtgtcctcc tggaatcaac gcagcaactc aactaagaca acaagaccat 2280 caataccata tgattcgcca agcgcgaggc ctaccacgtc gaccctgacc agagaagtcc 2340 cggaacattc cgaagagcct ctggcggcag tcgacgtggt ccgcccgcta tcccgggcaa 2400 gtacctccac taattatata cctaaaaaac agttcaacct tgttacccct caaaccaaac 2460 cagactcagg agcaaccatc agaccggcag tgtacagcag catatcgtcc actagcgtat 2520 cccacagcac aaccaaccaa ggcagcacaa caacaaccgt aactacaaca aaaagactaa 2580 gatccccaac gttttccgat tcgcaatgca gcaacccaac taagacaaca agaccatcaa 2640 taccatatga ttcgccaagc gcgaggccta ccacgtcgac cctgaccaga gaagttccgg 2700 aacattccga agagcctctg gcggcagtcg acgtggtctg cccgctatcc cgggcaagta 2760 ttcattcaaa ttccttacaa acaaaacagt tcaacccagt tatacctccc accaacccag 2820 gtcaagaaat ccactaccat catcgaaaca acaacaacgc atcgcctacc agcgtggctc 2880 gacgagaagg aaaaagttct gtagcatact ccagccgtac atttctgggg acggaagaac 2940 aaggaagtat gggtgagttc cgatcatgct cctggaatca aaccgaggca agccatgacg 3000 tcacatgcaa atccaatgca cctccatcaa ccccaagaaa caccaggttc tcaatagacg 3060 ctgactcacc ccgttcctcg agtcaaatga acggcatcaa ctcgccacga tcgccgtcac 3120 gagcttcgga tgactcagaa gccagtagtc taacagcaca aagaaccgaa tcgtcatttg 3180 ccattcagtg gaatatctgt gggctccgca ctcatctgtc tgaactgcag attctaataa 3240 aaaaatatca gcctttggta gtgtgcctac aggaaactaa tgctgactac aatagaattg 3300 cacctagctg cctaggaaag gaatatgaac tgcttctgag ccaaagctct acacatggaa 3360 gacaaggcgc cggaatggcc ataaaaaatg gcaccccttt ccagagaata tggctccaat 3420 ccaacattca agccgtagct gttcaactat tcgcgccaac aacaatcaca gtggtttcgg 3480 tatacctttc accttcggag aaagatgcag taaagttgtt tggagatctt ttggaagaac 3540 ttccaaaacc cattcttgta ttgggggatc tgaatgctca tcatgccgca tggggcagca 3600 aaataactca accatcatcg agagctaaaa ctagaggcga aggtatttta gacctagttg 3660 tacgtcacaa catggtggta ctcaacaatg gatcgcacac tcgtatcgac ccagtcacag 3720 gagcatccca ggccttagac gtctccattt gcagcacttc acatgcagcc aaattcagct 3780 ggaaaacact actagactat tcaggaagcg accatctacc tattctcctc gaaacgtact 3840 gtaatcaatt tacctctaaa tgccggagca gatggatctt cgaaaaagct aactggcaac 3900 tatacgaaca gctaacaact gattcactac gccctggata tactctatcg gtagacgaat 3960 ttactgatag gattataact gctgctgaag caagcattcc taaaacttca ggaaacatcg 4020 gacaaaaatc tgttgtttgg tggaatccag aagtggccct agcaataaaa tccaggagaa 4080 aacgcctacg ggccctccgt cgtctgggtg atgatgatcc acaaaaaatc gtcgctctta 4140 aacaattcca ggaagctcgg tcacattgcc gaaaagctat ccatgaagcc aaacaaagta 4200 gttgggacgc cttcgtcgaa agcataaacc cagatactcc agctagtcag gtttggaata 4260 aaattaacaa gctacagggc aaacggcaac gcaatacaat atcgttgaat ctggaaactg 4320 gacacaccaa caacggtcca acggtagcta atgctcttgc tgatgaatat cagcagaagt 4380 cgtctgatgc aaactaccct gaaagattcc gaaaaaagca caaaaaagat aagcgcacga 4440 ggtgcgtaac tcaacgtccc aatcttcaca aaagatataa cacggatttg actgtggaag 4500 agctgatgtg ggcactcgac cgacgcgttg gatcttctac tggccctgac aatgtaagct 4560 accaattact ccaaaggcta cctttttcgg gaaaaactgc ccttctagaa ctatttaacc 4620 gtatctgggc cagcgggtgc ttccctgcgc agtggaaaat tggtactgtw atccccattc 4680 cgaaacctgg tgcggatcgc agcaagcctg agggttacag gccgataaca ctcttaagtt 4740 gcctcggaaa attgttcgaa aggattataa accgccgtct tatgacagag ctagaatcca 4800 ctggcaaact agactctcga caacacgctt ttcgtgctgg gaaaggagtc gattcccact 4860 ttgccaaact ggagtctata atcaatctcc aaaacgacga acacgtcgag atcatatctc 4920 ttgacatatc aaaagcatat gatacaacgt atagaccaga aatccttcac actctgacaa 4980 aatggagggt tacagggaga ctgatgaata tcatttctag cttccttact gacagattct 5040 tccgggtagc tgctaacgga tcactgtcaa gccttagaag agcggaaaac ggagttccac 5100 aagggtcaat tttgtctgtc acactattct tagtagctat gcaaccaatt ttcaacgcta 5160 tacctggtga cactgaaatt ctactctacg cagatgacgt cattctgatt gtcaagggta 5220 aaaaccatgt cacaattcgc agtaaattga gaaaagcggt tgcagctgct gtcgaatggt 5280 cttcgaatat aggttttact attgctccta caaaatcaaa gctactacat atctgccacc 5340 taaagcatcg aaaacgtggt aaggcgatca agattggttc taacccaatc cctcacacaa 5400 ggtacatgaa agttctcgga gttctactgg actgcaagct caactttctg aaacacctta 5460 catccgttaa acaaagctgc cgaaaaaggc tgaacatcat acgcattctt ggataccgat 5520 taaaaagaag tagccgatcc agcttactga aagtaggatc agccttgatc ttgtccaaac 5580 tttattttgg aataggactt acgagcatta acttcgatgc tatgcaatgc actttagaac 5640 caatatataa cgatgtggct cgacaggcat ctggagcatt ccaaactagc cctataacat 5700 ctattatggc tgagactgga tggaccgact tctactcagc cttgattcaa cgcctatgcg 5760 ttttggctgt tcgactagct gaaaaaaatg aagaagcgaa cgactacccc gtagtgcaaa 5820 gggcaagaag actcgttctg gaaaaaacgg gatggtcttt tccaaaaatt catgaaactc 5880 ttagatcctc cgatagagaa tggtacgttc caccaccaaa cattgataac gagctcaagc 5940 gcaccataag ggctggaaca aaccacagcg ttgccgttca aaagtacaaa gaacttgtca 6000 acgtccgata tcgtactcac gaacaaatat tcacggatgg atcaaaatat ggtgatcaaa 6060 caggcgcagg aatagttatg aacgacagtc actttagcta ccggttacct gacacatgta 6120 gtgtgttttc cgctgaagcg tttgcactaa tgatggcagt atcgaagctg aatggccaag 6180 cgaaaaacat catactgacc gactcagcca gctgcctgga ggcactcgcc ggtggtcgat 6240 ctaagcatcc ctggatacaa gctattgaga gaaaaatagt aggacaaaat gtaacattca 6300 gctggatccc tggtcactcc agtattgctg ggaatgaggc agccgatcaa ctagccaaac 6360 aaggtagaca tcaaccactt caggacatcc cgttacctgc acaagacgtt gtacgagaaa 6420 tcaaaagaag aatttggaca caatgggaat cacaatggca tcaacaacgt tgcgtactta 6480 ggtcaatcaa accgacacct ggaaaatatc cggataggaa gaacccctcc gagcaacgag 6540 tcttgactcg catccgaata ggtcacacaa ggctcacaca tgcgtaccta atggatcata 6600 acgaccctcc caactgcagg cactgtggag tgcaactcac ggtggagcac atattggtcg 6660 aatgtagagg actccaatca aacaggaaaa actgcggtat caacggttca ctggccgaaa 6720 tactggcgta caatacagaa tccgaaaaag cagtgataaa atttcttaag gaatgtaatc 6780 tctttacaaa aatttaacta actttgtgat tatatatatg tgtaatagta ataataatta 6840 atcagacacg aatgccatga caaatggtaa agtgtcttaa ataaaacata ataataataa 6900 taataat 6907 // ID BEL-43_CQ-LTR repbase; DNA; INV; 290 BP. XX AC AAWU01000931; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-43_CQ_; KW BEL-43_CQ-I; BEL-43_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-290 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 240-240 (2011). XX DR GenBank; AAWU01000931; Positions 38394 38683. XX SQ Sequence 290 BP; 90 A; 77 C; 75 G; 48 T; 0 other; tgttcggtac gaacgaagcc agaaacccca ccgtgacgac catctgcgtt gccgacgacg 60 agggagaaac gtcatttgca acacaccaag agcgcgcggg cagcgcgcca agtaagtccg 120 catacaacct ccgatcgaga aggacgaagc atagtttttt agagtagtag caggaggaag 180 aaggaaaagt gggagaaata aaagtagaaa acgtaatctc cggtgttttc cgttccccgc 240 aactaacaac catacagtcc accgtttccc atgtcggcca ggcccgaaca 290 // ID Copia-2_ACA-I repbase; DNA; INV; 4370 BP. XX AC AEYA01000638; XX DT 23-MAR-2011 (Rel. 16.03, Created) DT 23-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the Acanthamoeba castellanii genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-2_ACA_; KW Copia-2_ACA-LTR; Copia-2_ACA-I. XX OS Acanthamoeba castellanii OC Eukaryota; Amoebozoa; Centramoebida; Acanthamoebidae; OC Acanthamoeba. XX RN [1] RP 1-4370 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Acanthamoeba castellanii genome."; RL Direct Submission to RU (23-MAR-2011). XX DR Genome; AEYA01000638; Positions 123442 127811. XX CC Positions [1588-2082] - Integrase core CC 'AGTTG' target site duplication CC LTRs are 96% similar to each other. XX FH Key Location/Qualifiers FT CDS 670..2913 FT /product="Copia-2_ACA-I_1p" FT /translation="MWCDKPDCAHGKCGGEGCGLLKQDKGGERHGHGWKSS FT KGPKQGPQEGDETTCHYCHKPGHWRNKCLKLNWGDGGDGGGGGGDSHKVNA FT AQAQHESDEDEEIILSIANQVNAVADETDQWYLDSGATCHVTCRCELLHNY FT QPSRSTINLVLGNDFKCCVKGTGTIHATIVVDSTMKTIVLMDVYYAPELAK FT NLVSMAQIAKLGCTILIEAAGCHVVDGRGRAVLQGTAQGNMLVLPLNPLLP FT EQAYATCIDLSLNTLHWRFGHACKRRLRTMLKAKGIMPTTKSLSPCTICIK FT NKTTHKPIHKGPAARSTTPMEQLHTDVCRPFPITTKTSKHYFISIINNTTH FT FTVIITIHKKSNIKAMLHSHLAATPADLKCQHLHSDQGGKYTSEQVQELLC FT TTSIIHKATSSHTPEHNGIAKQFNRTIVEMVQCMLHNSGLAQCYWGEALHT FT TTAIYNHLPTNANNGASPLDQWDANHAGALKDMHQFRAKVEVLVLPSKRTK FT LSVHTHTGVYLRPADSGTSNHQVLVQGHILMTCKVVFPYNTHDMLMGDDMS FT IAVAPAIVNPLAKEANVGSALQWASIDQSTTLAPSVLGASGSAETSPPHSP FT SQVELSIIEWTPQGELPMQVKPPDPLTPPDMPVMDDDPTLKESEEEEHAAP FT PSLEANDIVVAVKKKWPMPKKPPAHKRKKNTKFYNDDFCAFTAEVLQALDV FT SDSACTPNSYNEAVALPEAAEWIKAMKAKYGTLDCNGMFELALLPVGL" FT CDS 3037..4371 FT /product="Copia-2_ACA-I_2p" FT /translation="MFLLVVMMEHLQQLLTLAVILNLEVHQMDVENTFLNA FT TLSVHIYVEQPQGFIDLEWLDHVCLLHKSLYGLHQAPLKWNRMMDRHLRSH FT HFLPTHTNPCIYMLQESSALVIITVYIDNCVIVTPLKHIERAKMVLHDSFR FT MKDLGQARSILGMEVMRDREEGALYLRQAGKIMEILHDFGMADAKSIGTPM FT DPGLILHKLKVTAPGHLGKPYRSVVGRLSYLSQATRPDIAFAVNVLSRHVN FT GYDQSHWGAVKHLLQYLRATKDLTIKYTTSGSHSQSGGLLPVGYADADWGR FT DVKTRRSMSGILFTLGGSPIQWGARTQKCVATSTMEAELNAIAETIKEAAH FT LNRITRELFPHIDATLQLYCNNQSAIVIAKSKPGEHTQRTKHYALKLAFLH FT ESVSKLNVDFKYLPTEVMPADVFTKALGRARVVELRSLINLVEPKIESKGR FT " XX SQ Sequence 4370 BP; 1094 A; 1241 C; 1152 G; 883 T; 0 other; ggttatgggc ccacactcaa ctggaagatg aacccatttg aagctcaatt gaaacttctc 60 aactctaaac tggtactcaa atccacaagt ggttatgtga catggcgtga tgatttgtct 120 gtgactctac agtcttgagg actgttcgag tacgcctttg ggtccatgga ggaactgaag 180 gggaggaagg atgagagtga agtgagtttg gaactttgaa aggacgattt cagacagaag 240 gtgcaacaga ccatgggcta catctgcatc tgtctggatt ggccattctg agccatgatt 300 aaggggcttg agaccaacct gtggaagatg atggctaccc ttgatgtgaa tcttgtgctg 360 aaggctaatg tgagcaagtt gacactgttg acacagctca tcaacatcaa gtgcaagact 420 ggagaagtgc ttgactccta ctttggctgg attgtcaaca tcaacatgga acttgtcaac 480 aatgacttgg cgctccctga agtgttcatc cttatggtga tcctgaatgg gctgctggtt 540 gagtacgata ccgtgtggcc agtaattgag gcgcagacca agatcaacct gcatgatgtg 600 atgagttgac tccacaactg tgaggcaaac ttgaacacat gttctgaaca gaatgaggct 660 gccaacacca tgtggtgtga caagcctgat tgtgcgcatg gcaagtgtgg tggcgaaggt 720 tgtggcctgt tgaagcagga caagggtggc gagcgccatg gccatggttg gaagtcaagc 780 aagggcccca agcaggggcc acaggagggt gatgagacca cctgccacta ctgccacaag 840 cctggccact ggaggaacaa gtgtctgaag ctcaattggg gtgacggtgg tgatggtggc 900 ggcggcggtg gtgacagcca caaagtgaat gccgctcagg cgcagcatga gagtgatgag 960 gatgaggaaa tcatcctgtc catcgccaac caggtaaatg ctgtggctga tgagactgac 1020 cagtggtatc tggattcagg tgcaacctgc catgtcacct gtcgttgtga actgctccac 1080 aactaccagc ccagccgcag caccatcaac ttggtgctgg gcaatgactt caagtgctgt 1140 gtcaagggga caggcaccat ccacgccacc attgttgtag acagtacaat gaagaccatt 1200 gtgctcatgg atgtgtacta cgcccctgag ctggcgaaaa atctggtctc tatggcgcaa 1260 atcgccaagt tgggttgcac catcctcatt gaagccgcag ggtgccatgt ggtggatggg 1320 cgtggacgcg cagtccttca agggacagcc caaggcaata tgctggtgct gccactcaac 1380 ccattactcc cagagcaggc atatgccaca tgcattgacc tctcactcaa cacactacac 1440 tggcgttttg ggcatgcatg caagcgtcgc ctgcgcacaa tgctcaaggc gaagggcatc 1500 atgcccacca ccaagtcact ctcgccatgc accatttgca tcaagaacaa gacgacacac 1560 aagcccattc ataaggggcc agcagcaagg agcactacac caatggagca gctccacact 1620 gacgtgtgca ggccatttcc catcaccacc aagaccagca agcactactt catctcaata 1680 atcaacaaca ccactcattt caccgtcatc atcaccatac acaagaagtc caacatcaaa 1740 gccatgctgc acagccatct ggctgcaaca cctgcagacc tcaagtgtca acacctccac 1800 agtgatcagg gtggcaagta caccagtgag caagtgcagg agcttctgtg caccaccagc 1860 atcatccaca aggccacatc atcacacact cctgaacaca atggcatcgc caaacagttt 1920 aacaggacca ttgttgaaat ggtccagtgc atgttgcaca acagtggtct tgcccagtgc 1980 tactggggcg aggcactgca caccaccact gccatctaca atcacttgcc caccaacgcc 2040 aacaacggcg catcaccact ggatcagtgg gacgccaacc atgcaggggc actcaaagac 2100 atgcaccagt tcagggccaa ggtggaggtg ctggtactgc ccagcaagcg gaccaagcta 2160 tctgtgcaca cacacactgg agtttacctc aggccagcgg acagtgggac cagcaaccat 2220 caggtgctgg tgcaagggca catcctcatg acatgcaagg ttgtcttccc ctacaacaca 2280 catgacatgc tcatgggcga tgacatgtcc atcgcagtgg cgcctgccat tgtgaaccca 2340 ctggccaagg aggccaatgt gggctctgct ctgcagtggg cctccattga ccagtccacc 2400 acactggcgc catctgtgct gggcgccagt ggatctgctg agacttcacc cccacactca 2460 ccaagtcagg tggagctgtc catcattgaa tggacgccgc agggcgagtt gcccatgcag 2520 gtcaagccac cagaccccct cacaccgcct gacatgccag tgatggatga tgaccccact 2580 ctcaaggaga gtgaggagga ggagcatgct gctccaccca gcctggaggc caatgacatt 2640 gttgtagcgg tcaagaagaa gtggcccatg ccaaagaagc caccagccca caagcgcaag 2700 aagaacacca agttctacaa tgatgatttc tgtgcattca ctgctgaggt gttgcaggcg 2760 cttgatgtca gtgacagcgc ctgcactccc aacagctaca acgaagctgt tgccttgcct 2820 gaggccgccg agtggatcaa ggccatgaag gccaagtatg gcacacttga ctgcaatggc 2880 atgtttgaac tagctctgct gcctgtgggg ctgtgagcca ttgggttgag atggctattc 2940 aaaatcaagt gcaaggccag tggcgtcatt gattgtctca aggtgagatg agttgggaag 3000 ggctacactc agtgactggg cattgactat gatgagatgt tcttgctggt tgtcatgatg 3060 gagcatcttc aacagttgct caccctcgcc gtgatcctca atcttgaggt ccaccagatg 3120 gatgttgaga acaccttcct caatgctact ctctctgtcc acatctatgt tgagcagccg 3180 caaggattca ttgacctgga gtggctggac catgtctgcc tgctgcacaa gagtctatat 3240 ggactccatc aagcgccact caagtggaac aggatgatgg accgccatct gcgcagccac 3300 cacttccttc ccactcacac caacccctgc atctacatgt tgcaggagtc aagcgccctc 3360 gtcatcatca ctgtctacat tgacaactgt gtgatagtca cccctctcaa gcacatcgag 3420 cgcgccaaga tggtgctgca cgacagcttc aggatgaagg acctgggcca agccaggtcc 3480 atcctgggca tggaggtcat gcgtgaccgc gaggagggcg ccctctacct gcgtcaggcc 3540 ggcaagatca tggagattct ccatgacttt ggcatggctg acgcaaaatc catcggcacg 3600 cccatggacc ctggcctcat cctccacaag ctcaaggtca cggcaccggg gcacctcgga 3660 aaaccatacc ggtcggtagt ggggcggctg agctatctgt ctcaggccac acgtccggat 3720 atcgcgttcg ccgtcaacgt gctgagccgt cacgtcaacg gctacgacca gtcccattgg 3780 ggcgccgtca agcaccttct tcagtacctg cgcgccacca aggacctcac aatcaagtac 3840 accaccagtg ggtctcattc ccagtctggc ggcctgctac cggtaggcta cgcggacgca 3900 gactggggca gggacgtcaa gacacggcgc tccatgtcag gcattttgtt cacacttggt 3960 ggcagtccca ttcaatgggg cgcgcgcact cagaagtgcg tagccacctc cacgatggag 4020 gcagagctga acgccatcgc cgagacaatc aaggaggcag ctcacctcaa caggatcacg 4080 cgagagcttt tcccacacat cgatgccaca ctacaactgt actgcaacaa tcagtccgcc 4140 atcgtcatcg ccaagtcgaa gcccggcgaa cacacacagc gcaccaagca ctacgccctc 4200 aagctcgcgt ttctacacga aagtgtgagt aagctgaacg tcgacttcaa gtacctcccc 4260 acagaggtca tgcccgctga cgtgttcacc aaggcgctgg ggcgcgcacg cgtagtcgag 4320 ctacgctctc tcatcaacct cgtcgaaccg aagatcgaga gcaaggggcg 4370 // ID Gypsy-30-I_NVi repbase; DNA; INV; 11685 BP. XX AC . XX DT 14-MAY-2009 (Rel. 14.05, Created) DT 14-MAY-2009 (Rel. 14.05, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW internal portion; Gypsy-30-I_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-11685 RA Bao W. and Jurka J.; RT "LTR retrotransposon from Nasonia parasitic wasp."; RL Repbase Reports 9(5), 996-996 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 857..3286 FT /product="Gypsy-30-I_NVi_1p" FT /translation="MSSSDESEKTTINREANITKTRGNRTLDLDDSLLNPD FT LQIINQYSDSQKRNDPIDSALERKASQSYKRLEKSLSKYHVEAPFAKPKAR FT KSIHDRVQSFNKNLFSAYRENKVTDDEEDSIIKQVNTSVPVKQVPIKPIRL FT RESLLKPSSLNQENFKISAVAQANASLEATPPFSNPTLEQRDPTRTEPSDA FT AHELKLAESEKNLSTELLTNSNLSESLNLENIQNLSSSSINSTVENATIIN FT TTETNTQSQEKVKSDVIPENLTKFEVIQANKQIELLSRQSLAESAEEVARE FT AEELRNKFANHLKIQLSPCVKKEVLDDYFQSPENVDELGEKLRKGEKFSSV FT YDYVCRRISNFWGITAFKDSPNNTRDYTENNEQFSTPTGNIINPLQIIAPK FT TRLKFDETINEENSESEIESNKKKMAPQIKGISLKDTLDLIPRFNGSNISV FT TQFVDGCTEARDILPEGNEEDCARLIRMRLFGDALACARGQKFKTIDEIIQ FT FFESNFGSSKTYHEVSGELAKIKQRPTESVVVYSNRLREIEKEIKKAAVRE FT GRATDKNNFNEELEKDCIRFFIRGLRWEIQTRMGDMQTLEKAHKKAIEIER FT DFAYTEENDENLRREEARLKPKETRRVNVVEATPEITCSFCNQVGHAAMNC FT LKFCLQFLSNNRPSQSNGNPNNSRFSSFNGNNNRNNNQQSYNRNNNWNGFE FT RNFRYNNQQNNYHPNNEAISNNNIRDNNNQNLNNSTSNLPTNFFCHYCLKP FT GHYKRNCYTLMRDMQEGNLPTGNGRGNPRAGGEAAASXARPIXETQGASTS FT STRQQST*" FT CDS 9690..10286 FT /product="Gypsy-30-I_NVi_3p" FT /translation="MAICDTSARDQALMRTLLSISMNNSSGPKHPEYIAYK FT DRLYTFILWLEGSSQKPEILAEMGLCYNYYENQVYCFYCDCKMSQLQHGED FT LWIRHAILAPDCNYLRTRKGEEFIEKVYTRVNFNREPPPESLHCHDQDSNS FT DKSDIVVMKPLDCKICFRKEVKILFLPCSHLIACAECAIKLEECYICKQTI FT EHMIHVNS*" FT CDS join(3127..6087,6044..6421,6393..7973,8004..8666) FT /product="Gypsy-30-I_NVi_2p" FT /translation="LLYIDEGYARRKLADGKRSRQSSRGRRGCRFECSPDX FT RDSGSEYILYETTVNLNQKNCAPNVEIDIKELTRPAIFMLDTGSSPNLLKI FT SHVPIYLPINSQEIILLRGITSASIKTLGTITVNYDGLNIYFHIVGDEFPI FT REAGILGSDFFQQTSAKICYERNCLELANIEYPFINDIQNFQNDEYNDERW FT NLKNNASKRVVSERLSKGDSTIDFRRERAKSLENILFDNATLNLTTENQAI FT FENYLSTSSFVQEDEATAPQGGEIVNCAYVQEVPSASAKTLPSLESRRENI FT VEESLLERETDYFDDFEKFNDDLFVNSLGVNETKGIMDNIEMIQSDKYDAE FT KVYVCQNKNDDAYPTERLTDKIIGKIRRFEDLYEYTPDCQESSARELSANE FT NYEDFNTSDYYENDEQYKNKNLEFSAPISLCHRGQAQIAKDDYENYENEMI FT KSCKRIEQDTEIEENCLNEDVECVYFCAQQADDENCVKLKSKIKEMEDCDP FT REYNEQIQEEKQPYSDEHGALTSQISLRESASKREDFERKIRDEAHEKFEP FT MYASQIAEISPTGRPPDEESARATVNVPREVTLKKNVRFRDENLEQYYDFK FT SNLQNSENNSNNIIRGISNDDEVSKINIRKKRVISKGAIPKSIKLRKKIKK FT MLPSDNNYEDFTDIFSKTRPSKFHNSIKKDENPEVFKFNAISEALENIITT FT ERKENKNQNYSISLEKDNSQETEKYLHECGRIEELLNDCQNPYGKQEHEVH FT GLCVISSTPAYENAEGAYCSVMLIDSAAAEVLRCEKVRALLRLDHLKPRER FT EAIEQIVDTYHEQFHLPGEKLTTTHIIQHRIPTIDNRPVNVKTYLPPRAHR FT GPINEKIKEFLDGGLISPSNSDYNSPLWCLRKKDDSHGNPRYRIVLDYRKL FT NEVTITDNYPLPNIQDIFDQLGGSTYFTVLDLASGYYQIPLHPEDRHKTAF FT TVMNSGFYEWNVCPQGLTSMPATFMRCINRIIHLCLRHLCDALIELYNNTP FT MKEGEKPESALTVSPEDAKELDSGPLDVCVYLDDIIIYAKTVEEHNEKFAK FT LLKRLKAANFKLSPDKCDFLRTEVSFLGHLISNQGICPDPKKVEAVRKKIS FT ATENCERKFPRPKTVKNIRQFLGLAGYYRRFIDGFSKIAKPLSNLLKQNVP FT FEWNEKAQEAFDILREKLCKEPVLIFPDFSKPFILTTDASLTCIGAVLSQG FT TIGQDRPIAYASRVLNDAETRYDTYSREALAMVFGIRHFRHYLLGTKFVVC FT TDHLPLCWWKDSRDPDSRATRWRLKLSDYNFEVVYKPGRINLNADALSRNP FT IEENTESVMLIRESTESHAETRVENGDQSPLESAATEVATGRMTRTRAGKL FT SRPDYRENRKKLPNANKGTKELRGKERSEENDSVSANKTVEKHVEIRTNKK FT SYKLPVSNQSTSETINLDDIGTHNVSDQDQETMINHEDNDSVINPNTQKRS FT KSYGKMFAETREQLSMRKDNYLYFLSSMHQPCDEGARLLESRKEVPRLHEL FT TAGELRIMPKSNKRHFIIILRGLERESISKITENLVSGLKQLKSFILKEQI FT KSISLAKSVSIENVRWDDVIDKIRSTLRGSQVKIIICKGTITYPSLSERAK FT IIEESHCSAIGGHKGVNKTISAKSTYKCIFGSVYLVNKLVFMKRLAKRFRI FT TQFKTTAIHPQSNGSLERSHHVLSEYLKQFISQDEQWDQWLEMAAFSYNTS FT VHEATQHTPYELVFGRLARSPSSDPLETEDRLPTYADYIIELTTRLNNIQE FT IAREKLIAAKWRAKYYYDRSANPQDFKVGDYIWLLKGGKIHKLKDQYEGPY FT LVVDILRNGNIKIQLKTNKYKVVHMNRVRISYIEPEKT*" XX SQ Sequence 11685 BP; 4477 A; 2018 C; 2193 G; 2993 T; 4 other; ggttgtttac ttataatttt ttggtgcatc gcaggccggg aaatattttg aaacgcaaaa 60 aattcagaca tttttgagta attattaaac aaaaactcgt gaaagtggaa attaaaatcg 120 tgtagaaact aagaaagtga tcatttaaca caatatttta ttttgcgcaa gccttttcct 180 attcctccgc aagcaaagaa tcgctgagta gcgacttgct ttcgttttca caattcctct 240 gcaagtaaat aatcactgag tagcgacttg ttttcgtttt cacaattcct ccgcaagcaa 300 agaattgctg agtagcgact ttctttcatt tctacaatcc tttcgcaagc aaagaatcgc 360 tgagcagcga cttgctttcg ttctcacaat ccttccgcga gtaaagaatc gctgagtagc 420 gacttactcg ttatcaacaa tccttcttaa aacagagaat cgcctctttg tgcgacctgt 480 tcgtcagtcc ctaactccgt gatcgattaa taaccttatt ttccttacta gtacgcaaaa 540 taaaataaat gaatactaca aataatccga ccgaaattcc gcgaaaaaat agaagggctg 600 gcagaaaagt aagagcaaca attaagaccg aaagcaccga agaatcgggg gaaatagtta 660 attctttagc cgaaaatcat caaattaccg agaaaacaag cgaatcgaaa ccgaaaagtg 720 acgtaaaatt atcagaattt caaatcgaga taggaaaaaa gttcgacgcg ctattcgcat 780 tcgctgaaaa agaaattaaa ttaaacaagt taaaagcaca aaaattaaaa aatgaaagtt 840 gtaataaaaa ttaattatgt cgagtagtga cgagagcgaa aaaaccacaa taaatcggga 900 agcgaatata acaaaaactc gtggaaatcg aacacttgat ttagacgaca gtcttttaaa 960 cccagatttg caaataataa atcaatattc ggattcgcaa aagagaaatg atcccataga 1020 ttcagcgtta gaaagaaaag caagccaaag ttataaaaga ttagaaaaga gtttatcgaa 1080 gtatcacgta gaagcccctt ttgctaaacc aaaagctaga aagtcaattc acgatagagt 1140 tcagagtttc aacaaaaatt tatttagcgc ttatagagaa aataaagtta cggacgacga 1200 agaagacagt attattaaac aagttaacac cagcgttcca gttaagcaag tacctataaa 1260 accgattcga ttaagagaat cattattaaa accgagtagc ctaaatcaag aaaattttaa 1320 aatttccgca gtggcccaag cgaacgcatc tctagaagcc acgccccctt tctcaaaccc 1380 cactctcgag cagcgagatc ctactcgtac agagccgagc gacgccgctc acgaattaaa 1440 attagccgaa tcagagaaaa atctatcgac cgaattgcta actaactcga atttgtctga 1500 atcccttaac ttagagaata tacagaattt atccagctcc tcgataaact caaccgtcga 1560 aaacgcgact attattaata cgacagaaac taatacccaa tcgcaagaaa aagtaaaaag 1620 tgatgtaatt cccgaaaatt taacgaaatt tgaagtaata caagcgaaca aacaaataga 1680 gttactatcg agacagtccc tggcagaatc ggcagaagaa gtcgcgcgag aagcagaaga 1740 attacgcaat aaattcgcga atcacttgaa aattcaattg agtccctgtg tgaaaaagga 1800 agttttagac gactattttc aaagtccgga gaacgttgac gaattaggtg aaaaacttag 1860 aaagggagaa aaattttcta gcgtatacga ctacgtctgc cgacggatct cgaatttttg 1920 gggaatcact gcatttaaag atagtcctaa caacactcgc gactacacgg agaataacga 1980 acagttctct acgccaactg gaaatataat aaatccttta caaataattg ctccgaagac 2040 gcgtctaaaa tttgacgaaa ctataaacga agaaaatagt gaaagtgaaa ttgagagtaa 2100 taagaaaaaa atggcgccac aaataaaagg gatatctttg aaggacacgc ttgatttaat 2160 tccccgtttc aatggatcga acatatcagt aacacagttt gtcgacggtt gtactgaagc 2220 gcgagatatt ctacccgaag gaaacgaaga agactgtgcg cgtcttattc gaatgcgtct 2280 ttttggcgat gcattagcgt gcgcaagggg acaaaaattc aaaactattg atgagatcat 2340 tcaattcttt gagagtaatt tcggatcatc aaagacgtac cacgaagttt cgggagaact 2400 agctaaaatc aaacagcgtc cgacggaaag tgtagtcgtt tactcgaatc gccttcgaga 2460 aatcgaaaaa gaaatcaaga aagccgcggt gcgcgaaggg agagcaacgg ataaaaataa 2520 ttttaacgag gaattagaaa aagattgtat tagatttttt attcgaggac ttagatggga 2580 aatacaaact agaatgggtg atatgcaaac actagaaaaa gcccataaga aggcgataga 2640 aatcgagagg gatttcgcgt atacagaaga aaatgatgaa aatttaaggc gagaagaagc 2700 gcgattaaaa ccaaaagaga ctcgtagagt aaatgttgta gaagctactc cggagataac 2760 gtgcagcttt tgtaaccaag ttggacacgc cgcgatgaat tgtttgaaat tctgtttaca 2820 atttttaagt aataaccgtc cctctcagtc aaatgggaat ccaaataata gtcgtttttc 2880 atcatttaat ggaaataata atcgaaataa taaccaacaa tcgtataatc gtaataataa 2940 ctggaatgga ttcgaaagaa atttccggta taataatcag caaaataatt atcatccaaa 3000 taacgaagcg atttcgaaca ataatataag agataacaat aatcaaaatc ttaataattc 3060 cacttcaaat ttgccaacga actttttctg tcattactgt ctaaaaccgg ggcattataa 3120 gcgtaattgt tatacattga tgagggatat gcaagaagga aacttgccga cgggaaacgg 3180 tcgcggcaat cctcgcgcgg gcggagaggc tgccgcttcg artgctcgcc cgatcmcaga 3240 gactcaggga gcgagtacat cctctacgag acaacagtca acttaaacca aaagaactgt 3300 gcgcccaacg tcgagattga tattaaagag ttgacacgtc ctgcaatttt tatgttagac 3360 accgggtctt cgccaaatct attaaaaatt tcacatgtcc cgatttattt accaataaac 3420 tcgcaagaaa taattttgtt aagaggaata acttcagcat caataaaaac cctagggact 3480 ataacagtta attatgacgg tttaaacatt tattttcaca tagtcggtga cgaatttccg 3540 attagagaag caggaatatt gggcagcgat ttctttcaac aaacaagtgc aaaaatttgt 3600 tatgagagaa attgtcttga actagcaaat atagaatatc cgtttattaa cgatatacag 3660 aattttcaaa atgatgaata taatgatgag cgatggaatt tgaagaataa tgcaagcaag 3720 agagtcgtat cggagaggct ctctaaggga gatagtacaa tcgattttcg aagagaacgt 3780 gccaagtctc tagaaaatat tctttttgat aatgcaactc tcaatttaac aacagaaaat 3840 caagccattt tcgaaaatta tctgagtacg agtagttttg tgcaagagga cgaagcaacc 3900 gctccacagg gaggcgagat cgtgaactgc gcatatgtac aagaggtgcc gtcggcgagc 3960 gctaagactc taccgagcct agagagccgg agagagaata tagttgagga gtctctattg 4020 gaaagagaga ctgattattt tgatgatttc gagaaattta atgatgacct ttttgtaaat 4080 tccctaggag tgaatgagac aaaaggtata atggataata tagaaatgat tcaaagcgat 4140 aaatatgacg ccgaaaaagt gtatgtgtgc cagaataaaa acgatgacgc ttatccgacg 4200 gagagattaa cagataaaat tataggaaaa atcagaagat ttgaggattt atatgaatat 4260 acgcccgact gccaggagag cagcgcacgc gagttaagtg ctaatgagaa ttatgaagat 4320 tttaatacaa gtgattatta tgaaaatgat gaacaatata aaaataaaaa tttggaattt 4380 agtgcgccaa tatctctatg ccacagaggt caagcacaga ttgccaaaga tgattatgaa 4440 aattatgaaa atgaaatgat aaaaagttgt aaacgaattg aacaagatac agaaattgaa 4500 gaaaattgtt taaatgaaga tgttgaatgt gtatattttt gtgctcaaca agcagatgat 4560 gaaaattgtg taaagttgaa gagtaaaatt aaagaaatgg aagattgtga tccgagagaa 4620 tataatgaac aaattcaaga agaaaaacag ccctacagcg atgagcatgg agccttgacg 4680 agtcaaataa gtctaagaga aagcgcgagt aaaagagaag atttcgagag aaaaattcgg 4740 gacgaagcac acgagaaatt cgaaccgatg tacgcctccc aaatagccga gataagccca 4800 acaggacgtc caccggacga ggaaagtgca agggccactg taaacgttcc cagagaagtc 4860 acacttaaga aaaatgtgcg ttttcgagat gaaaatctag agcaatatta cgatttcaaa 4920 tccaatttac agaattctga aaataatagt aacaatataa tcagaggaat aagtaatgac 4980 gatgaagttt caaaaataaa tattcgtaaa aagcgtgtaa ttagtaaagg tgcgattccg 5040 aaatctataa aattgagaaa gaaaattaaa aaaatgttac catcagataa caattacgag 5100 gatttcacag atattttctc gaagactcga ccgagtaaat ttcacaattc aatcaaaaag 5160 gacgaaaatc ccgaagtgtt taagttcaat gcaataagcg aggcgttaga aaatataatt 5220 acaactgaaa ggaaggaaaa taagaatcaa aattattcca tatcacttga aaaagataat 5280 tcgcaagaaa cagagaaata tttacatgag tgtgggagaa tcgaggaatt gttaaatgac 5340 tgtcagaatc cctatggaaa acaagaacac gaagtgcatg gactttgcgt aatatccagc 5400 acgccagcct acgaaaatgc agaaggagcc tactgtagtg ttatgctyat agactctgca 5460 gcggccgaag tcctccgctg tgaaaaggtc agggctctat tgcgcctcga tcaccttaaa 5520 ccaagagagc gagaagccat tgagcaaatt gttgacactt atcacgaaca atttcattta 5580 cccggggaaa aacttactac tacgcacata atccagcatc gcattccgac tatagataat 5640 cgacccgtaa atgtcaaaac gtacctccct cctagagcac acaggggtcc aatcaatgaa 5700 aaaattaaag aatttttgga cggcggactt atttcaccgt cgaattctga ttataatagt 5760 ccgctttggt gtttaagaaa aaaagacgat agtcatggta atccaagata cagaatagtt 5820 ttagattatc gaaaattgaa cgaagttact ataactgaca actatcctct tccgaacatt 5880 caagatattt tcgatcaact cggwggttcg acatatttca cggttctaga tctcgcgtcg 5940 ggctattatc aaatcccatt acacccggaa gatcgacata agactgcatt tactgtaatg 6000 aattctggat tttatgagtg gaatgtttgt ccacaaggtt tgacatctat gcctgcgaca 6060 tttatgcgat gcattaatag aattatataa taacacacct atgaaagaag gagagaagcc 6120 tgaaagtgct ctcactgtct cgcccgaaga cgcgaaagag ttagattctg gaccattaga 6180 cgtttgcgtg tacctcgacg acatcattat ttacgcaaaa accgtagaag agcataacga 6240 aaaattcgcg aaattattaa aaagattaaa agccgccaat tttaaattat cgcctgataa 6300 atgtgatttc ctccgcacag aagtaagttt cttaggacat ttgataagca atcaaggaat 6360 ctgccccgat cctaagaaag tagaagccgt aagaaagaaa atttccgcga ccgaaaactg 6420 ttaaaaacat acgccaattt ttgggactag ctggttatta tcgaagattt atagacggat 6480 tttcaaaaat agctaaacca ttaagtaatc tactcaaaca aaacgtaccc tttgaatgga 6540 acgagaaagc acaagaagct ttcgatatac tgcgagagaa attatgtaaa gaacctgtac 6600 taatttttcc cgacttctca aaacccttta tactcaccac cgacgctagt ctgacttgca 6660 tcggagcagt gctttctcaa ggtacaatag gccaagatcg cccaattgca tatgcttctc 6720 gggtgttgaa tgacgcagaa actcgttacg atacgtattc gagagaagcc ttggctatgg 6780 tttttggcat tcgccatttc agacactatt tattaggcac gaaattcgtc gtttgcacag 6840 atcatttgcc actatgttgg tggaaagact ctcgagatcc cgattcacgt gcaacacgat 6900 ggaggcttaa attgtcggat tataattttg aagtcgtcta caaaccaggc cgaatcaatc 6960 ttaatgctga cgcactctct cgaaatccaa tcgaagagaa tacggaatca gtaatgctta 7020 ttcgcgaaag cacagaaagt catgcagaaa ctcgagtcga aaacggggac caatcacccc 7080 tagagtcagc agccacggaa gttgcaacag gacgaatgac gcgtacgcgc gcaggtaaat 7140 taagtcggcc tgattacaga gaaaaccgaa agaaacttcc caatgcaaat aaggggacta 7200 aggaacttag ggggaaggag agatcagaag aaaatgatag cgtatctgct aataagactg 7260 ttgagaaaca cgtcgagatc cgaacgaaca aaaaatctta taaattacct gtcagcaatc 7320 aaagtacgtc agaaacaatt aatctcgacg atattggcac acataatgtc agtgatcaag 7380 atcaagaaac tatgattaat catgaagaca acgattctgt tataaatccg aatacacaaa 7440 agcgttctaa aagttatggc aaaatgtttg ctgagacccg agagcagctg tcgatgcgca 7500 aggataatta tttatatttt ttatcctcta tgcatcagcc ctgcgatgaa ggagcacgac 7560 ttctagagag tcgtaaggaa gtaccgcgat tacacgaatt gactgcggga gagttaagga 7620 taatgccgaa aagcaacaaa cgacacttta tcataatatt acggggactt gaacgcgaaa 7680 gtatctcgaa aattacagaa aatttagttt ccggtttaaa gcaattaaaa tcatttatct 7740 tgaaagaaca aatcaaatca attagtcttg caaaaagtgt ctcgattgaa aatgtacgat 7800 gggacgatgt gattgataaa attcgaagca cactaagggg atcgcaagtt aaaataataa 7860 tttgtaaagg tactattact tatccgagtt taagcgagcg cgcaaagatt atagaggagt 7920 ctcattgttc tgctatcggc ggacataagg gcgtgaataa aactataagc gcgtaaagca 7980 aaatttttat tgggaaaata tgaaaatcga catacaaatg tatattcgga agtgtttacc 8040 ttgtaaataa attagttttt atgaaacgat tagcaaaaag attcagaatc acacagttca 8100 aaacgacagc cattcatcca caatcgaacg gctctcttga gagaagtcac catgtactga 8160 gcgaatattt gaaacaattt atctcgcaag acgaacaatg ggaccagtgg ctagagatgg 8220 ctgctttttc gtataatacg tccgtacacg aagcgacaca acacacgccc tacgagttag 8280 ttttcggaag attggctcgc tcaccgtcga gcgacccact cgaaacagag gacagacttc 8340 cgacgtatgc cgattatatt atcgaattaa caacaaggtt gaataacatt caagaaattg 8400 cgcgagaaaa actaatagca gccaagtggc gtgcaaaata ttattatgac agaagtgcga 8460 atccgcaaga ttttaaggtt ggcgattata tttggttact taaaggagga aaaattcata 8520 agctaaaaga ccagtacgaa ggtccctatt tagtcgtaga tatactgcgt aatggtaata 8580 ttaagataca actaaaaact aataagtata aagtagtgca tatgaatcgc gtacgtatat 8640 catacattga accagaaaag acttaagtct aaattaatta gatgatacat gtcaaaaata 8700 gtttaagtat aaagtacata atgcataatt acagtattag cgttaatcga ttaaacaagt 8760 gagataaaaa ttaaaaaaaa agtagtagaa aaataattgt tgaaatctac atttaaagaa 8820 actaagctta agaaaaaaat ataatcaaaa gataaactcg cctcaagata cactgtgatc 8880 tgtaagagtg taccaagaaa ctcggtagca ccttcgtagc aatactagac atgaacgcta 8940 gcgccctgat ggcgattata tagagatatg cgcttgcgcc ccgatggcaa tgatatgtac 9000 atcatacata tgtcgattat aggcataggc aagcgagaat cagtattaag caaaatcttg 9060 tctatacaac gaaaatcgtg tgatataact actgtaattc gagaaaatct aaaaattcta 9120 cctcacagaa aaaaagaaaa aaattcaaat aattctcaaa tatgcgggaa aatctttgtt 9180 gggtttcaca tatatttata tactactatt ttgaatgata agcagagcca ttttagtaat 9240 ttcattacaa ttaaaataac aataatcatg acatcaattc tttcattgta caattaaact 9300 tattctttct tttcttttac ttcttcacca caaattaaca acgatcattt taaattatcg 9360 ccatgcaaaa aattaacaag gatcatcaaa aaatttaatt tgatacaaaa attaatgaga 9420 aaatacaaat atatgctcaa ctttcactca caaaactcac aaagatatca cttttacacg 9480 gacacaagaa caattatcac tgagttaaga agaatcacgt agttaaccat ttcaacactc 9540 gtaaaacatt aatacaagat tttacacgac tcaaagaaca caaaagagga gcaattctgt 9600 tttattgttt tagattccgc ctaagccgaa gaatcgaatt ctgggacaag ttagcgttgg 9660 gttaaactca gaatttcgct gaattgaaaa tggctatttg tgataccagt gcaagggatc 9720 aagctctaat gagaacctta ctgagcatat ccatgaacaa ctcgtcaggt cctaagcatc 9780 ctgaatacat agcttataaa gacagactat acactttcat attatggtta gaaggatcaa 9840 gccagaagcc agaaatctta gcagagatgg gcctctgtta taattattac gaaaatcaag 9900 tctattgttt ttactgcgat tgcaaaatgt ctcaacttca acacggagag gatctatgga 9960 ttagacacgc catccttgca ccagactgta attatctaag aacccggaaa ggagaagaat 10020 tcatagaaaa agtatataca agagtaaact ttaatagaga accaccacca gagagtctac 10080 attgccatga tcaagattct aattcggata aatcagatat tgtagtaatg aaaccactgg 10140 attgtaaaat atgtttcaga aaggaagtca agatcctttt tcttccttgt tctcacttaa 10200 ttgcttgcgc agagtgtgcc attaaattag aagaatgtta catatgtaaa caaacaatcg 10260 aacatatgat tcatgtaaac tcttaaagag gaaacaactt gacgattcca tcgaacgtat 10320 aagaagagga aaaaaaacat aaaattatat ctaacccgac cactcagaag aattaagaac 10380 cattatatga cgatttcaac gaaatttaga taaaatatta tccgaacaat tactgcaaga 10440 agagataaaa taaatgaata tatagaagta cgattaaaag tagatgataa tattattgat 10500 ataagaagaa atagtcatcg cgtgcagaac tatcgtataa ttaggtataa aaagaagaga 10560 aaaaaaaaat aattataaaa agtacaaacc gtgcctaaca ttccgaaact gaacaacgaa 10620 cgagaagttg actatactga acattcaaca agaactaatt catcaaggaa gcagccacga 10680 gggtattctt gagccgactt catgcaccac tgtaacgttc cagatcaaga cttcaaaaca 10740 ataaacaagt aaatagagat taagaacaag atatatttta agaggggatc gggatccttt 10800 tttctatcat gttacgttcg agttaattaa caaaatattt gattgcaaaa ggaatattaa 10860 atttctagta ggaccaaaga cgtaacattg aaaaagggaa catgatcctt tgcatgcatt 10920 agttatagaa gagaattaat aaattaaatt gacaactatt actaaatatt aagaattaca 10980 taatcaatga ggctacatag tgataccaaa agaatcatgt actaacatga aggaaaacaa 11040 aaaattacaa aaaaaagaag tttttgaaga gtaaatacta ttaatattat taaaaataac 11100 gatatcattg ttaaatgcac attcccgata aagtacaggg accattataa aggagtatat 11160 aaaatcaaaa ctattattaa atttaaatta aaaagagtta aagttgctta cattatagcc 11220 atattatatg catttgcatg acaattatta agaattatat ctagaaaaac caaaacagaa 11280 taaaatcaag aaaaaaataa agtattaatt aagcattgta atcgaaacca aatgtattag 11340 caattttttt ttactattaa gcgatagaat attaagtaac atcagtatct atacgagtat 11400 tgtcgaaaca aatgataatt acctcattta caaacaaata ctgattacta aaattaagga 11460 taccagatgt aaaatgtcaa tcaaaatata tatgcaaaga gaatatcaga aaatatgtta 11520 ataaagaaaa aaatacaatt ccataactac aaagcatgtt atatagatat taagaaataa 11580 tgtaaacgct cgcaagatag ggaaataaga ttaagaaacg aaattcacaa aaattaattg 11640 tgagcgtcag aggctcacct cagaacgctc atcctagggg gaagg 11685 // ID Penelope-1_HM repbase; DNA; INV; 1398 BP. XX AC . XX DT 13-DEC-2008 (Rel. 13.12, Created) DT 13-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE Penelope-like element. XX KW Penelope; Non-LTR Retrotransposon; Transposable Element; KW Penelope-1_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-1398 RA Jurka J.; RT "Penelope-like elements from Hydra magnipapillata."; RL Repbase Reports 8(12), 2091-2091 (2008). XX DR [1] (Consensus) XX CC It is flanked at both ends by (TA)n. CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(180..1217,1221..1397) FT /product="Penelope-1_HM_1p" FT /translation="MDTIKKHIIQIFKNNNLSISIQCNMKVVNFLDVTLNL FT NDQSFRPFCKPNNELNYIHVDSNHPPSIIKQLPRSIELRLSANSSNETSFR FT NSAHLYQDALKKSGYKFKLNYIPKLIQPLPKNRKRNIIWFNPPYSKNVSTK FT IGKIFLDLIDKHFPVGHKYHKIFNRNTLKVSYSCMPNVESIVNSHNRHIMK FT SIPPESDKSCNCINKSKCPLNQQCLVNNIVYQATLYPGTPDGIEKVYFGVS FT ETAFKLRYANHTKSFNIEKYKNDTELSKEVWKLKENGVLNPTIKWKILKRC FT RSYNPTIKRCNLCLYEKYFIISYTYDNLLNKRNELVSKCRHKSKFLLSNFD FT TGDAAVFSFLFLFCFDDXAHSTVTLYFNGFFCFLFCITWLMIAFRHETIST FT IKSCFLYFYKL" XX SQ Sequence 1398 BP; 489 A; 228 C; 208 G; 472 T; 1 other; taataaaaaa acagtggttt atttgatgtt acaaatgggt gcttttgacg gcgcagaagt 60 atgtgagctt gtcggaattt ttcttctttt tcagctatca cagcatttat aacaagacag 120 attttggctt ataccgagat gatggtttag cggtttttaa aaatgctagt ggccacctta 180 tggacacgat aaaaaaacat attattcaaa tttttaaaaa taacaatctt tctatctcta 240 tacaatgtaa catgaaagtt gtgaacttcc ttgatgtaac acttaatctc aatgatcaat 300 cttttagacc attttgtaaa ccaaataatg aactaaatta tatacatgtt gattccaatc 360 accctcctag cataataaag caacttcctc gttctataga attaaggttg tcagctaatt 420 catctaatga aactagcttt cgaaactctg ctcatttgta tcaggatgca ttaaaaaagt 480 ctggttacaa atttaaactt aattatattc cgaaactcat tcaacctcta ccaaaaaacc 540 gtaagcgcaa tattatctgg tttaaccccc catatagtaa aaatgtttct actaaaattg 600 gtaaaatctt tttggattta attgacaagc actttccagt tggacataaa taccacaaga 660 tttttaatag aaatactcta aaggtgagtt acagttgtat gccaaacgtc gaatcaatcg 720 ttaactctca taaccgtcat attatgaaaa gcatcccccc agaatcagat aaaagttgca 780 attgtattaa taaaagcaag tgccctttaa atcaacagtg cctcgttaat aacatagtgt 840 atcaagccac tctatatcca ggcactccag acgggattga aaaagtgtat tttggagtaa 900 gtgaaaccgc ttttaaactt agatacgcta atcatacaaa atcttttaat atcgaaaaat 960 ataaaaacga tactgagttg tctaaggaag tatggaagtt aaaagaaaat ggagtattaa 1020 acccgacaat taaatggaaa attttaaaac gttgtagatc ttacaatcct acaataaaac 1080 gatgcaactt atgtctatac gaaaagtatt tcataatatc gtacacctat gataacttat 1140 taaataagag gaatgaatta gtatctaaat gtagacataa aagcaaattc ctcctttcga 1200 atttcgatac gggtgattag gcagccgttt ttagtttttt gtttttgttt tgttttgacg 1260 acrgagcaca ttccactgta actttatatt ttaacggttt tttttgtttc ttgttttgta 1320 ttacatggct gatgattgcc tttaggcatg aaactattag taccataaaa agttgttttc 1380 tttattttta taaattaa 1398 // ID BEL-67_AA-I repbase; DNA; INV; 6741 BP. XX AC AAGE02020328; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-67_AA_; KW BEL-67_AA-LTR; BEL-67_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6741 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02020328; Positions 18925 25665. XX CC Positions [5781-6350] - Integrase core CC 'GGGAT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 762..6740 FT /product="BEL-67_AA-I_1p" FT /translation="MNQLPSELMNMISPFSSTNLGPSTSAAPLLPTCQSTP FT FVAPLAASQRNQPDDVDVTHENPQNRTYSVEIDPTNSTGNPLEVEQGLNVR FT SGSAPVSTPNRHVRIVTDTNEGSRAYQGVVTSTLNSNANDQLYHLSSNVVA FT SGSQTVDRVAPTQYRASNGTITLPSITKANQMPVRLLHTNSAPMLSVTSNQ FT GPIFSLPMGSSAPPFPPLSNVQSFVGDPIGPTFSHTTGIPTTLYAGVSERT FT VNAAPLIGNRYYTGTISSAPVVNSRLENLWQPTHLHPVAGSESVNPMGMNS FT ASGVYPAAINATSVNPMQGFNNPPFQHAERTGAGHSGPSSSFDVQSFNNPT FT AQQLAARHVVPKELPSFSGNPAEWPLFWSSYEMSTRICGYSESENLMRLQR FT CLKGEARKAVNCFLLHPLNVPEILRTLSTLYGRPEAIIGTLLAEVRSTPAP FT RSEKLETVINFGLAVRNLCAHLVATGQELHLVNPMLMNELVDKLPANIKLD FT WALYTQRVARADLRAFSDYMNVIVDAASRVTPAMDKSEKPKSKAHVNAHTS FT EGYGSKSQGERKPMSTKAYVDKSTGVSDKPCMVCKISGHKPKDCTSFKSKT FT LENRWKIAQELHLCKRCLYPHGKWPCKASLCGAEGCQQRHHKLLHPRDPAQ FT EQESTASASAVTGVVSVHQHCHNKVLFRIIPVMLHANGKSVETFAFLDSGS FT DSTLVEKSLVEQLGVEGTVSPLCMQWTNGVKRTEENSRKVQFHISGVGQSK FT QFPLRGVQTVDSLDLPRQSIHFKELECRFPHLRGLPVMSYRDGVPGILIGL FT DNTKVKTSLKQREGKTNEPVATKTRLGWVVFGRSGSVDHAQPNRVLHVCSR FT SYDDNLNDLVKQFFSTESVGVSALNPTESVDDRRAREILQTTTVRTPSGRY FT ETGLLWRYDEINFPNSRPMAERRLKCLEHRLAKLPSLYEKMRQQMAEYQAK FT GYAHQATSQELEESDPTRVWYLPLGIVVNPRKPDKLRVVWDAAATVQGVSF FT NSVLLKGPDLLVPLQTVLCRYRQKEIAISADIMEMFHQVQIRQEDRQAQRF FT LWRDNPAEPAKVFVMDVAIFGSTCSPCSTQFAKNINAEEHAEEFPKAAVAI FT KENHYVDDYLDSVDSVDEAVQLALDVKTVHERAGFQIRHWMSNSPEVLRQI FT GDQNTQAVKSFVMDKGSQQERILGMVWLPNEDMFSYKTSFHSDLERLFGAA FT VVPTKREVLRLVMSLFDPLGLIASCVVEGKVIIQEIWRANVGWDEHIPSEF FT FPRWQQWLEVLSSLDRVKIPRCYFPGYSKEAYETLELHVFVDASESAYATA FT AYFRIVDKGEVRCSLVGAKTKVAPLKSLSIPRLELQAATIGARLMKTIVDN FT HTIPIRRRVMWSDSKTVLAWLRSDQRRYRQFVAFRVTEILEETNVADWRWV FT PTKQNVADEATKWGKGPHYFESCRWFKGPSFLYEDESQWPVEGVVECTTEE FT LRNAQIHVQTSAGPIVQFQRFSKWERLIRTVAYVRRFYSNCQRKRSGSTII FT VGCIECEELKSAEYTVWKLVQSEMFPEEVLVLEKNQNVSESEQKTVEKSSR FT IYRLLPFIGEGEVIRKNGRIAASPYVPYDTKFPVVIPKEHHVTQLLINWYH FT RKYGHANGETVVNEIRQKYHVPELRVVVRKATKRCNWCLVYRSVPQVPRMG FT PLPLARLTPYVRAFTFVGLDYLGPLTVRVGRTNVKRWVALFTCLTTRAVHL FT EVAASLSTESCKQAIRRFIARRGAPQEIYSDQGTNFQGVSGELARQIQSIN FT ETLASTITNHRTQWKFNPPYAPHMGGVWERLVRSVKKGLSCLLTDRHPDDE FT TLSTVLAEVESLVNSRPLTYLPLESEEHEALTPNHFLLLSSSGVVQPVQRP FT TDEAAALRSSWHAIQVMLDNFWRRWMKEYLPVISPQSKWFGELRNVQVGDL FT VMIANEKERNSWTRGRVLRTYPGKDRRVRRVDVQTTMGVHQRPVSKLAILN FT VAGVPGIAEGSDSNTGGG" XX SQ Sequence 6741 BP; 1896 A; 1591 C; 1715 G; 1539 T; 0 other; aacttcaaga gtttttatta caatcaaccg attgcctccg gtaacctcaa tagatgtcga 60 ccgagtcggc cgcatatcgg cattgtactc tctgtgggca gaccaatagt cgagagccga 120 tgattgagtg tgccagatgt cacaatcggt accataattc atgcaccggt gtccaggatg 180 gtacgtcagc tacatctagc ccttattaca gcgagttgtg tatgccaagg gaccctgcac 240 catccgtctc catgtcctcg tccgcctcga cctcagctag tgcacgggaa gcaagattgc 300 aattgcaaat gcaaaagctc gctgaggaga aacgtctcca ggaaaggttg ttggccgaac 360 gggaaaaggc agataaagaa ctgcaggaga aggcgctgcg tctcgaaaag gaaaggagtg 420 agaaagcaat cgtcgacaag attgagctag agaaggtgta tattacgcgc aagttcgatt 480 tgctacttgc gcaagtggac gaggaggagg aaggcagaag cgtgcgcagt cggcgaagca 540 gtagcaaaag cgtcgaaaaa gtacaggctt ggattgacaa ccagcaagtg accctgaatt 600 ctaaatctgg gggacgcatc cccactggaa tcacgtcaca gcagagaggt tcatcaatcc 660 agcgagtaga cacagcggca cccatccaga atccagagcc acaacatcac cgaaccgaag 720 ggaatgatat cagcacaatc ggagattaat ctgccgccgc gatgaatcag ctaccaagcg 780 agttgatgaa tatgatttca ccgttttcta gtaccaacct aggtccatcg acgagtgccg 840 cgccattact accaacatgt caatccactc ccttcgtggc accgttagct gcgagtcaac 900 ggaaccaacc agatgatgtc gatgtcactc acgagaatcc ccaaaatcgt acctattcgg 960 tggaaatcga tccgacgaac agcacaggca acccactgga agtcgaacaa ggtttgaatg 1020 taaggtctgg ttcggctccg gtaagtacac caaatagaca cgttagaatc gtcaccgata 1080 caaacgaagg tagtagggca tatcagggtg tagttacatc gacattgaat tcaaatgcaa 1140 acgatcaatt gtaccatctt agttctaacg tcgtagctag tggttctcaa acagtagaca 1200 gagttgcgcc aactcaatat agggcatcga acggcaccat aacactccca tccattacga 1260 aagcaaatca aatgcctgta agactgctac acactaacag tgctccaatg ctgagcgtaa 1320 cgagcaatca gggcccaata ttctctcttc caatgggttc ctctgctcct cctttccccc 1380 ctctgtcaaa cgtccaatcg ttcgtgggtg atccgatcgg tcctaccttc agtcatacta 1440 ccggtattcc aactacgttg tacgcgggtg ttagtgaaag aacggtgaat gcagctccat 1500 tgatcggcaa ccgttactac actggaacga tatcctcggc tccagtggtc aattcaagat 1560 tggaaaactt atggcaacca acacatctcc atccagtggc cggttcagaa tcagtgaacc 1620 ctatggggat gaattcagca tcaggagtct acccagcggc gatcaacgct acttcggtga 1680 atccaatgca aggcttcaac aatccaccgt ttcaacacgc agaaagaact ggcgcagggc 1740 attctggacc ttcttcaagt ttcgatgttc aatcgttcaa caatccgacc gcacagcagc 1800 tggcagcccg ccacgtcgtc ccgaaggagc ttccctcttt ttcggggaat ccggctgagt 1860 ggccattatt ttggagcagc tacgaaatgt caacgagaat ctgcgggtac agtgagtcgg 1920 agaatctcat gaggcttcaa cgctgtctca agggagaagc aaggaaagcg gtcaactgtt 1980 tcctgctcca tccgttgaat gttccggaga tcttacgcac gctgagcact ctctatggac 2040 gaccagaggc catcattgga accttgttgg cagaagtacg atctacacca gcaccacgat 2100 ccgaaaaatt agaaactgtc atcaatttcg gattagccgt gcggaatctg tgcgcccact 2160 tggtagcaac cgggcaagag ttgcatttag taaacccaat gttgatgaat gaattggtgg 2220 ataaacttcc agcaaatatt aagctagact gggcgttgta cacccagcgc gttgcaagag 2280 ccgatttgcg agcattttcg gattatatga acgtcattgt ggatgcagcg agtcgagtga 2340 cgccggcgat ggataaatcg gagaaaccga agagtaaagc tcatgtaaat gcccatacat 2400 cggaaggcta cggcagcaag agtcaagggg agcgcaagcc tatgtcgacg aaagcgtacg 2460 ttgacaagtc aacaggagtc agtgacaagc catgcatggt ttgcaagata agtggtcaca 2520 aaccgaagga ctgtacgtct ttcaaatcca aaacactgga aaaccgatgg aagatcgcgc 2580 aagaacttca cctttgcaaa aggtgtctct acccacacgg caagtggcct tgcaaagcat 2640 cgttgtgtgg tgccgaagga tgtcaacaac gtcatcataa acttctgcat cccagagacc 2700 cagcgcagga gcaagagtct actgcatctg ctagcgctgt aacaggtgtg gtttctgttc 2760 atcaacactg tcacaacaaa gttttgttca gaatcattcc ggtcatgcta cacgccaacg 2820 gcaaatccgt cgaaacattt gcgttccttg acagtggttc agactcaacg ttggtggaga 2880 agtcactggt tgagcaactc ggcgtggagg gtactgtatc ccctttgtgc atgcaatgga 2940 ccaatggtgt caagcgaacc gaagaaaatt ccaggaaagt tcaattccac atttctggag 3000 tagggcagag caagcagttt ccactgagag gcgtacagac agtggatagc ctggaccttc 3060 ctcgacaatc gattcacttc aaagaactcg aatgccgttt tcctcatctg cgtggattac 3120 ctgtcatgag ttaccgtgac ggtgttccag gtatccttat cggcctcgat aataccaaag 3180 tgaagacttc gctgaaacaa cgtgaaggaa agaccaatga acctgtagcc accaaaacca 3240 ggctgggttg ggttgtgttt ggccgttccg gttccgtaga tcacgctcaa cctaatcgag 3300 tactccacgt ttgctcccgt tcttatgacg acaacctcaa tgatttagtc aagcagtttt 3360 tctcgacaga aagcgttgga gtgtctgctt taaatccaac tgagtcggta gatgatagac 3420 gggcacgaga aatactccag acgacgacgg ttcgtacgcc ttccggaaga tacgaaacag 3480 gcctgctgtg gagatacgat gaaatcaatt ttccaaacag ccgaccaatg gctgaacgtc 3540 gtctcaaatg cctcgaacat cgtcttgcca agttacccag cctttacgag aaaatgcgac 3600 agcagatggc cgaatatcag gcaaaaggtt acgcccatca agctacgtca caggagttgg 3660 aagaatcgga tcccacacgg gtctggtacc tgcctttagg tatcgtcgta aaccctcgca 3720 aacctgataa actgcgcgtc gtctgggatg cggccgccac tgttcaagga gtctcgttca 3780 attcagttct tttgaaagga ccggatttac tagtgcctct tcagactgtg ctttgccgct 3840 atcgacaaaa agaaatagcg attagcgccg acattatgga aatgttccat caggtgcaga 3900 tcagacaaga agatcgacaa gcccaaaggt tcttgtggcg agacaatcca gcagagccag 3960 caaaagtttt tgtgatggat gtggcaatat ttggatcgac gtgttcgcca tgttccacac 4020 agtttgcgaa gaacatcaac gccgaagagc atgcagaaga gttcccgaaa gccgctgttg 4080 cgataaaaga gaaccactac gtcgacgact atctggatag cgtcgacagc gtcgacgaag 4140 cggttcaact tgcgttggac gtgaaaactg tccacgaaag agctggtttc cagatccggc 4200 attggatgtc gaactctccg gaagtattaa ggcaaattgg ggaccagaac actcaagcgg 4260 tgaaaagttt cgtgatggac aaaggtagtc agcaagaacg cattttggga atggtctggt 4320 tacccaacga agacatgttt tcgtataaga caagtttcca tagcgatctg gaacgccttt 4380 ttggtgcagc ggttgtccct actaaacggg aggttctgag gctagtgatg agcctgtttg 4440 atccgcttgg tctcatcgca tcttgtgtcg tcgaaggaaa ggtgatcatc caagaaatct 4500 ggagagctaa tgtcggttgg gacgagcata taccctctga attttttcca cgttggcagc 4560 agtggctgga agtgttaagt tcgctcgata gagtgaagat cccccgatgc tacttcccag 4620 gatatagcaa ggaggcttac gaaacactcg aactacatgt cttcgtcgac gcaagtgaaa 4680 gtgcgtacgc taccgccgcc tacttcagaa ttgtcgacaa aggagaggta cgatgcagtt 4740 tagttggagc gaaaactaag gtggctccgt tgaaatcgct ttctataccc cgcttagaac 4800 ttcaggcggc tacaattgga gcgcggttga tgaagaccat cgtagacaac catactatcc 4860 cgatcagacg cagagtcatg tggagcgatt caaaaacggt gctggcgtgg cttcggtcag 4920 accaaaggcg atatcgacag tttgtggcgt ttcgcgtcac agagatactt gaagaaacga 4980 acgttgcaga ctggaggtgg gtaccaacca agcagaatgt ggcggacgag gccaccaaat 5040 ggggcaaagg tcctcactac ttcgagagct gccgatggtt caaaggacct tcatttttgt 5100 acgaggacga aagtcagtgg ccagtggaag gcgtagtaga gtgtaccacc gaagaactga 5160 gaaacgctca gattcacgtc caaacttcgg cagggccgat agtacaattt caacgattct 5220 ccaaatggga gaggcttatc agaaccgttg cttatgtacg ccgtttctac agcaattgcc 5280 aacgcaaaag gtctggctct acaatcatcg tgggctgtat agaatgcgaa gagctaaaat 5340 cggctgaata tacggtctgg aagttggtac aatcggaaat gttcccagaa gaagtccttg 5400 tattagaaaa aaatcaaaac gtctcagaaa gtgagcagaa aactgtggag aaaagtagcc 5460 gtatctatag actgttgccg tttattggtg aaggagaagt tatacggaag aatgggcgaa 5520 ttgcagcatc accatacgtt ccgtacgaca ccaaatttcc ggttgtcata cctaaagaac 5580 accacgttac acagctactg ataaactggt atcaccgaaa atatggccac gcgaacgggg 5640 aaacggtggt caatgaaata cgacaaaaat accacgttcc ggagcttaga gttgttgtaa 5700 gaaaggccac aaaacgctgc aattggtgct tggtctatag atctgtacct caggttccga 5760 gaatgggtcc tttacctttg gcaagactta cgccctatgt gagagccttc actttcgtag 5820 ggctagatta tctcggacct ttgactgttc gtgttgggag gacaaatgtc aagcgatggg 5880 tagccctgtt tacttgcctg acaactaggg cagttcattt agaagttgcg gcgtcgttgt 5940 ccacagaatc atgtaaacag gctattcgaa gattcattgc acggagagga gctcctcagg 6000 agatttattc ggaccaagga acgaatttcc aaggagttag cggtgaattg gcaaggcaaa 6060 tacagtccat caacgaaacc ctcgcttcta ccatcaccaa ccacaggacc cagtggaagt 6120 tcaaccctcc gtacgctcca cacatgggag gcgtgtggga gagacttgtg cgatccgtta 6180 agaagggtct cagctgttta ctgacagata gacaccctga tgatgaaact ttgtcgacag 6240 tgcttgctga ggtagaatcg cttgtgaact cacgccccct gacctaccta cccttggaga 6300 gcgaagaaca cgaggcttta acgcccaatc attttttgct cttgagctcc agcggggtgg 6360 ttcaaccagt tcaaaggcct acagatgaag ctgctgctct tcgttccagt tggcatgcga 6420 ttcaggtaat gctggataac ttctggaggc gttggatgaa ggagtacctt ccagttatat 6480 cgcctcaatc caaatggttt ggagagctgc gaaacgttca agttggagat ttggtgatga 6540 tagctaatga gaaggagaga aacagctgga cacgagggag agtactccga acttatccag 6600 gaaaggatcg aagagttaga agagtcgacg tgcagactac gatgggagtt catcaacgac 6660 ctgtttccaa gttggctata ctgaacgtcg cgggagttcc aggtatagct gaaggttcgg 6720 acagcaatac gggtgggggt a 6741 // ID SAT_DG repbase; DNA; INV; 368 BP. XX AC X83736; XX DT 09-JUL-2004 (Rel. 9.06, Created) DT 09-JUL-2004 (Rel. 9.06, Last updated, Version 2) XX DE Dociostaurus genei tandemly repetitive DNA (DGT3). XX KW SAT; Satellite; Simple Repeat; SAT_DG; Repetitive DNA; KW tandem repeat. XX OS Dociostaurus genei OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Orthopteroidea; Orthoptera; Caelifera; Acridomorpha; OC Acridoidea; Acrididae; Gomphocerinae; Dociostaurus. XX RN [1] RA Garcia De La Vega C.; RT "Direct submission."; RL Unpublished. XX DR Genbank; X83736; Positions 1 368. XX SQ Sequence 368 BP; 121 A; 77 C; 66 G; 104 T; 0 other; tcgaccgatc agcgccaaac attacataac gtaactaacc tggcctatca gcgacttagg 60 acctcccctg ttacaaatgt ttccgaccag aaatacgtta gttatactaa aaataggaag 120 tttgcgttgc gatccgcctt ttgtacattt cagaacttgg gtaatttccc aagggaaggc 180 caagtaattt taattaataa ttacgatata gagtgatatt agtaagagga acagatcacc 240 tggaccgtca cagaggatgc caaaatttaa aaatcatact ctcaaacgca cacaattgtt 300 gaaatacgaa tatttcgata tgtctgtctg tctgtaaagt gttcccactt ttaggtcaaa 360 ccattcga 368 // ID SACI-5 repbase; DNA; INV; 4257 BP. XX AC BN000803; XX DT 04-SEP-2005 (Rel. 10.08, Created) DT 04-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE Schistosoma mansoni Saci-5 LTR retrotransposon (EST). XX KW Gypsy; LTR Retrotransposon; Transposable Element; SACI-5. XX OS Schistosoma mansoni OC Eukaryota; Metazoa; Platyhelminthes; Trematoda; Digenea; OC Strigeidida; Schistosomatoidea; Schistosomatidae; Schistosoma. XX RN [1] RP 1-4257 RA DeMarco R., Machado A.A., Bisson-Filho A.W. RA and Verjovski-Almeida S.; RT "Identification of 18 new transcribed retrotransposons in RT Schistosoma mansoni."; RL Biochem Biophys Res Commun 333(1), 230-240 (2005). XX DR EMBL/GenBank/DDBJ; BN000803; Positions 1 4257. XX FH Key Location/Qualifiers FT CDS 164..3625 FT /product="SACI-5_1p" FT /translation="MKIPWPKEFEDGDVMSFLKEFEAVAELVGVKEPKAKT FT VVLGTLLRGRAKAVYDSLDGAGGKTTWEAVTERLIVEFDSPVDREEALQKF FT RVAKLPIDGDPLVLAVELTNLLRRPLPNLDEESEAQLLASQFIESVPVAVS FT QQLRLVHAAQPMDISELAKVTRQLMTRTVAPVACESQEISSIEKKIEELQH FT EIAALRVIRKRNDKCFACGGTGHWKINCPTRRKRRYFRRYVFPSYSRDLGV FT VALVDRGAVYARVNVNELDLVCLIDTGAAVSLIGKEQCKRFKPCKVAVHTI FT GGHMLEVLGVSNSIVKVDGSAISFPFVVTANLSRPILGADFLREVKAVVDL FT RSGKVVTKYGSFPIHESSEVAEVRVATAVPTKKPSITELCKKYAKLFTGEG FT EPYGFCDKVKHEIPIKNDGGSIFTTRRVPVHLEAEVNRQVQEMLKEGIIEE FT ADSPYSSPVLLVKKPNGKYRFCVDFRELNNITELKPCAMPTVVETLDRLQN FT ATVFTVLDLRSGYWQLPIKESDRSKTAFTIRDKQYQFRRMPFGLAGAPFTF FT RRLMSLLLRDLDNVEVYGDDVVVYSQTETDHAKHVEAVLKRIEEFGLRINK FT DKSQMAKSSITLLGHKVGNGEIKPLPEKILTIKNVAVPNSRRKLRQFLGRA FT AFYSRFIKNFNEIAAPLYKLLSNTKFSWTETAQQTFNQIKNVLDDRQMTLR FT LPELEKPFTVTTDASDHGIGAVLSQSNRVVEYASRVLTPAEQKYSTIEKEC FT LAIVWAVDKWRPYLLGRRFHIETDHKPLQWLQTARDPRGKLARWMIRLQEY FT DFSIGHVPGKENVMADYLSRPDMEAELPLTAYAVNSIEGDPLELVRQQKSD FT PKLREVIRVIREEADVDKRAMDKEVIMLLRQKERLKVNTYGALVWQDDDKN FT WVAVIPKDWRRKMIHECHQVAHTGIARTTDLLRQSAYWPGMRDDVAQYVLT FT CQQCQLMKSDRYMQPPLQSVPVTAVGDLWSVDVMGPFPQTDSGNRYLLVMT FT EHATRWVDAVPIADQRAKTVTEVIIRHIVAGHGVPKMILTDQGPCFESDEF FT KARLKQFGIKRIRTTPYHPQTNGLTERNNRTLKEWLASKGGNWEKELPLIL FT LAHRASIQGTTKKSPFLLMYGRQPRLPMHSKTWPQQQKWTATRLRK" XX SQ Sequence 4257 BP; 1348 A; 804 C; 1192 G; 913 T; 0 other; aggaacaatg aagcctgagg caattgaaga acattttgca aatagtgtat catagcatgg 60 tttttgtatt ttaccaagta acgttgtaat ttgtgctaaa tacatagttg gttgtcctca 120 cctgtgttct cattcacaac atttggtgtc gctgcctcca aggatgaaga tcccatggcc 180 gaaggaattt gaggacggag atgtgatgag cttcctgaag gagttcgaag ccgtggcgga 240 gctggtggga gtgaaggagc cgaaggcgaa gacggtggtg ctggggacgc tgctgagagg 300 cagagcgaaa gctgtatatg acagcctgga cggtgcgggc ggtaagacga catgggaggc 360 agtcacggag aggttgatcg tggagttcga tagcccagtg gacagagaag aagcactgca 420 gaagttccga gtggcgaagt tgccgatcga tggtgatccg ctggtgctgg ccgtcgagct 480 taccaacttg ctgcgtcgcc cgctgccaaa tctggatgag gagtcggagg cacagttgtt 540 ggcgtcgcag tttatcgaga gcgtgccagt tgccgtttcc cagcagctga gactggtaca 600 cgccgcgcaa cctatggata tcagtgagct tgcaaaggtg acgcggcagt tgatgacacg 660 aacagtagct ccggtcgcgt gcgaaagtca ggagataagc agcattgaga agaagattga 720 agaactgcag cacgagattg cggcattacg agtgattcgt aaaaggaatg ataagtgttt 780 cgcatgtgga gggacaggac actggaagat aaactgtcca acacgacgaa aacggagata 840 ttttagacgt tacgtgtttc cttcttactc tagggatctg ggggttgttg cattagtaga 900 tagaggggca gtctatgcaa gagtgaacgt aaatgaacta gatttagtgt gcctaatcga 960 tacaggggca gcagtttcgt taataggtaa agagcaatgt aagagattca agccgtgtaa 1020 agtcgcagtt cacactatag gagggcatat gttagaggta ttaggcgtgt ctaacagtat 1080 cgtaaaagtg gatggttcgg cgataagttt cccgttcgtg gtaaccgcaa acctgtcgag 1140 accaatatta ggagcagatt tcctaagaga agtaaaagcc gtagttgacc taagaagcgg 1200 taaggtggtc acgaagtacg gttcattccc aatacatgaa agtagcgagg tggctgaggt 1260 tagagtggca acagccgttc cgacgaagaa accaagcatc accgagttgt gcaaaaagta 1320 tgcgaagctg ttcactggag aaggagaacc gtacggattt tgcgacaaag taaagcacga 1380 aatcccgata aagaacgatg gaggaagtat attcacaaca cgtcgtgtcc ctgtgcactt 1440 agaggctgaa gtaaatcgac aggtccaaga gatgctgaaa gaaggaataa ttgaggaggc 1500 ggacagccca tatagctcac cggtactctt agtaaagaaa ccgaatggga aatatagatt 1560 ttgtgttgat tttagagaac tgaataatat aacagaacta aagccgtgtg caatgccaac 1620 agtagtggaa acattagatc ggctgcagaa tgccacggtg tttacagtgt tggatttgcg 1680 gtcaggttat tggcagctgc ctataaaaga gagcgatcga agcaaaacag catttaccat 1740 acgtgataaa caatatcaat ttagacgaat gccgtttgga ttggcgggtg cgccatttac 1800 gttcagaaga ttaatgtcgc tactgctcag ggatctagat aatgtcgaag tgtatggcga 1860 cgatgtagtt gtatacagcc agacagaaac agatcatgcc aagcatgtgg aagcggtctt 1920 aaagcgaatt gaagaattcg gacttcgaat taataaggac aagtcacaga tggcgaaaag 1980 tagcatcacg ttgttggggc ataaagtggg aaatggagaa ataaagccac tgccggagaa 2040 aattctgacc ataaagaacg ttgcagtacc gaattcgaga aggaaattga gacagttcct 2100 gggaagggcg gcgttctaca gtcggttcat caagaatttc aacgaaatag cagcacctct 2160 ttataagttg ttgagcaaca ctaaattttc gtggactgaa accgcccaac aaacgtttaa 2220 tcaaatcaaa aacgtgctgg acgaccgcca gatgacgcta agactgcccg aactggagaa 2280 accgtttaca gtgacaacag acgcaagtga ccatggtata ggtgccgttt taagtcagtc 2340 taatagagtg gtggaatacg caagtcgcgt tctcacgcct gcggaacaga agtattcaac 2400 gattgaaaag gagtgtttag cgattgtgtg ggcagtagac aagtggagac catatctgtt 2460 aggtagacga tttcacatcg agactgacca taaacctctg cagtggctac aaacagcacg 2520 agacccacgt gggaagttgg cgcgatggat gatacgtctg caggagtatg acttcagtat 2580 cgggcatgtt ccaggcaagg aaaatgtaat ggcggactac ctgtcaagac cagacatgga 2640 agccgagctt ccattgacag catacgcagt gaacagtata gagggagacc cactagaact 2700 cgtccggcaa cagaaatctg atcctaaact tcgagaggta atcagagtga taagagagga 2760 ggcagatgtt gacaaacgag caatggacaa ggaggtaata atgttgcttc gtcaaaagga 2820 aagactgaaa gtgaacacat atggagctct ggtctggcag gatgatgaca aaaattgggt 2880 ggctgtgatc ccgaaagact ggcggcgtaa gatgatacac gaatgccatc aggtagcgca 2940 cacaggtatc gcaagaacga cagatttact acggcaaagt gcatactggc caggcatgag 3000 agacgatgtg gcccaatatg tgttgacgtg ccagcagtgc caattaatga aaagcgatag 3060 atacatgcaa ccgccattgc agtcggtccc agtaaccgca gtcggtgatc tatggtcggt 3120 agatgtgatg ggaccatttc cacagacgga tagcggcaac cgatacttac tagtaatgac 3180 ggaacatgct acacgatggg tcgacgctgt accgattgcc gaccagcggg cgaaaacagt 3240 gaccgaagta ataatccggc acattgtagc gggtcatgga gtaccgaaga tgatactaac 3300 cgaccagggt ccctgttttg aaagtgacga gtttaaggcc cgtttaaagc agttcggcat 3360 caagaggata cgaactacac cgtatcaccc tcagacgaac gggcttacgg aaagaaacaa 3420 ccggacatta aaggaatggt tagcatctaa aggagggaat tgggagaaag agttaccact 3480 aattttgctg gcgcatcgtg cctccatcca aggaacaacc aagaaatcac ctttcctact 3540 tatgtatggt agacagccac gacttccaat gcatagtaaa acgtggccac agcagcagaa 3600 gtggacagca acaagattaa ggaaatagag gacggaggca ataaagaacg tgaaagagaa 3660 acaaatatca gacatcaaga aggcggaaca gcagagaaga gcgtggaagc cttttagagt 3720 gggggacctg gtaaagtgta gagaacggag agctaacact gaaggaggac cagggtcagg 3780 aaagctgatc ccgaaatggg agggaccgta cattatcacg gagagacgcg gcccagttta 3840 cacgatccgg agagaagaca agcagaagcg tgtaaatgcc agtcaactgc agagatggtt 3900 tcaagaacat catgagcacg agtcacgaac agcaccaaag gagacaatgg tacgccgatc 3960 agaaagattg cgagaacagc ttattaaaag gggggacgag tgtggtgtga gctactgata 4020 tcaacataag tagtatatat ggtgagtaag gaatagaata tctgacagca gaagattgag 4080 aagatcaaga agagaagaac agaaataaaa agcaatttgt gtgaaaatcc aggaacaatg 4140 aagcctgagg caattgaaga acattttgca aatagtgtat catagcatgg tttttgtatt 4200 ttaccaagta acgttgtaat ttgtgctaaa tacatagttg gttgtcctca cctgtgt 4257 // ID Gypsy-622_AA-I repbase; DNA; INV; 8370 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-622_AA_; KW Gypsy-622_AA-LTR; Ty3_gypsy_Ele157; Gypsy-622_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-8370 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [4552-5031] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 489..2411 FT /product="Gypsy-622_AA-I_1p" FT /translation="MNRSLPRAEDLNLEEIDYELSIRNQPEEVFKLDVQGK FT QRHLRLLFKSDQKEGRNYRSNTNIQDEAGHISARIDNLEKALSKKVEAKYE FT SRVIHYYYRVKRCIADTEEGKELRRELSRRIEKLMQYYQFGPSLSPFKEPS FT IPIIQETEGAVGLSGPVSPSRTMGEPASDSMLETGPKGRCEMSNSQGTIPK FT VNRSDFRLQERTDQNGIKDTIEVSRTEWEELKRILADLSNKANADNAANMV FT SRPRIPENRQTFPQNNPRTQQRQTGLRSTCYADEFVDTDSDEEEFATVQQN FT RLDRRVDRNQFERDSYEDSLYEYDSGSTRRYRHRRGIRDGRHRREQGYGRV FT EKWKLRFSGDSRGISVENFLYKASKLAEREGVSEHTLLRDIHMLLEGAASD FT WFFTYVDDLITWEDFKNGITYRFGNPNKDQGIRSKIQERKQQRGESFIAFV FT SEIEKLNRMLSKPLSRRRKFEVIWDNMRQHYRSKISIVRVRDLQHLIDLNY FT RIDAADHQLHQPGVDHFVRRPINQIEAAEYVSEDEEEQEATVNHVRGQHQR FT SRPPASRPFSGPTQQVQGNRQAENGEPGLARLSCWNCQEEGHGWRQCTKPR FT NIFCYGCGNLGRTIRSCERCVRQEVVPESGIQGNLRRGASQGN" XX SQ Sequence 8370 BP; 2385 A; 1752 C; 2011 G; 2219 T; 3 other; tttggcgccc aacaatacca ggcattttct gattttggtg taactgtatc tgataattat 60 cttttcgtat taatatagta tttgtgtgtt tagttgtcta agggaacaag acttgtttaa 120 tttatttttc ttcgaggaac tttctcttat attgtacata taaataatat tgagtattta 180 tttctgtttc gtttcgttta tttcataaaa cgtttgattt gcacaattac gtatcataca 240 aaaaaaatct cgttcttatc attaagctta tcttatatgt gatttaatta ttttctagtg 300 ttattaaaat tgttttcttc aatatttgtt ttgctaaatg ttatttatga ttttatagct 360 gtttaaacaa aaatagcttc tctgggaaca ttacatcaaa actcaaatat tctgaatcac 420 gattttgcgg ttgaataggt gacctttttt ttcttctggt ttaagcaaac atacatagat 480 cgatcaatat gaatcgttcc ttaccaagag ctgaagatct caaccttgaa gagatcgact 540 acgagttgtc gattcgcaac caaccagaag aagttttcaa actcgatgtc cagggaaaac 600 aaagacattt gagacttctg ttcaaatctg accaaaagga agggaggaac tataggtcaa 660 ataccaacat tcaagacgaa gcgggtcata ttagtgctag aattgacaat ttggagaagg 720 ccctgtctaa aaaggtagag gccaaatatg agtctagggt gatacattat tattatcggg 780 tgaagagatg cattgcagac acggaagagg gtaaagaact gcgaagagaa cttagccgta 840 gaatcgaaaa acttatgcag tactaccagt tcgggccatc cttgtctcca ttcaaagaac 900 catccattcc tattattcaa gaaacagagg gggcagtagg tctatcaggt ccggtcagtc 960 caagtcgaac aatgggggaa ccggcgtccg acagtatgtt agaaactggt cctaaaggca 1020 ggtgtgaaat gagcaactcc caagggacaa ttccaaaagt caatcggtca gatttcagac 1080 tgcaggaacg gacggaccaa aacggaatca aagataccat cgaagtatct cgtacggaat 1140 gggaggagct gaagcgaatt ttagccgatt tgtcaaataa ggctaacgcg gataatgcgg 1200 cgaatatggt gagtagaccg cgaattcctg aaaatagaca gacctttcca cagaacaacc 1260 caaggacaca acagcgccag acaggacttc ggtcaacgtg ctatgctgat gagttcgtcg 1320 atacggattc agatgaggag gaatttgcca cagtgcagca aaacagattg gatcggaggg 1380 tcgatcgcaa ccagttcgag cgtgatagct acgaggattc gctgtacgaa tatgatagtg 1440 gaagtacgag acggtatcgg catcgacgag gaatcagaga cggccgacat agacgtgagc 1500 aaggctatgg gcgagtagag aagtggaagc tgcggttctc gggagactct cggggaatat 1560 ccgtcgaaaa cttcctttat aaagcatcaa aactagctga gcgcgaaggc gtatccgagc 1620 atacattgct aagggacatc cacatgctgt tagaaggagc cgcgtcagac tggttcttca 1680 cctacgtgga tgacttgatc acgtgggaag acttcaagaa cggaatcacc tatcgcttcg 1740 gtaaccctaa caaggaccaa gggatacggt cgaagattca ggagcggaaa caacagcgag 1800 gagagtcgtt tatagctttt gtgtcggaga ttgagaagct aaaccggatg ttgtccaagc 1860 cattgtctcg aagacggaaa tttgaggtca tatgggacaa catgcggcaa cattataggt 1920 ccaagatctc gatcgttcgc gtaagagact tgcagcatct aattgacttg aactaccgca 1980 tcgatgcggc cgatcatcaa ctacaccaac cgggggtaga tcatttcgtg cgtagaccaa 2040 tcaatcagat tgaagccgca gaatatgtca gcgaagacga ggaggagcag gaagcgacag 2100 tgaaccacgt taggggacaa caccagagaa gcaggccacc ggcgtcccgg ccgtttagtg 2160 gtccaacgca acaggtgcaa gggaatcgtc aagcagagaa tggtgaacct ggtttggcga 2220 gattaagctg ttggaactgc caagaggaag gacacggttg gagacaatgc acaaaaccgc 2280 ggaacatctt ctgttacgga tgtggcaatc ttggaaggac gattcgctct tgtgaacggt 2340 gcgtcagaca agaagtcgta ccagagagtg ggatacaggg aaacctgaga aggggtgcga 2400 gtcaggggaa ttgagcatcc ctgcatcaca cgttcttccc caaacatttg acccgttttt 2460 gcatgttcac catatacgaa ttagaataag caagtgtcct cacattagag tgagaatttt 2520 cgattttgag acagaagctt tgttggactc cggggcaggg attagtattt taaattcaat 2580 ggaattgatt gagagatttg ggttgaagtt gcagccggca gccattcggg tggccactgc 2640 tgatggtgca aattacgcct gtcggggctc cgtaaacatt ccgtttacct acaaacggtc 2700 acaaggtgat tccgacaatc gtagttccag aaatttcgcg ggaaattatt ttgggagcgg 2760 acttttggga cgcgtttgga atcaagccaa tgatcgacct gggtagaggc ccagagtccc 2820 tggaaacggt agcaagtcag caaaacgacg ctatttgttt caccgttgaa ccggtagacc 2880 aattgccgga ggcggccaag gaagatgagg atgacacgtt ggacattcca gtgtatgagg 2940 ggccaacaga agcaatcccg gatccagaat ccatagaaac cgaacacgaa ttgagtacgg 3000 aggagaggca acaactgatg gaagtaatca gaggtttgag ttgacttcca cggggaagtt 3060 aggtagaacc gatctaatcg agcacgaaat cgtgctgaag gaaggtgcaa agccacggaa 3120 cccggcgatg tacaaatgct caccgtacat gcaagaggcg ataaagaaag aggtggagag 3180 gttcaagaca ctggatgcaa tcgaagaatg ctacagtgag tggaccaccc actagtgcca 3240 gtaccgaaga agaacgggaa ggttcggtct gcctgattcc aggaagatca acaaattgac 3300 ggtaaaagac tcctacccta tgagaaacat gcaggacatt tttcgtcgtc tgggtagagc 3360 aaagtacttc tcggtcatag acctgaagga tgcctacttc cagattccgt tgaaggaaga 3420 gagccgaact acacggcatt cagaacatcg gaaggcgtct tccgcttcaa agtgttgcca 3480 ttcggggtca tcaacgcgcc gttcaccatg tcacggttga tggacagagc acttggattc 3540 gacctcgaac cgcaggtgtt cgtctacctc gatgacatag tcatcgctac cgaaacccta 3600 gaggagcacc ttcgactgct aaaatagtag gggagcggct acgcaaagcg ggactacgat 3660 ttcgttagag aaatcacgct tctgtcggaa gcaagtgatg tacttgggct atctctgaac 3720 gagcatggga ttgcgattga cagtagtcga atccaaccaa ttttagatta tgcgagaccg 3780 aagacgcagg aggacatccg cgcttgatgg gcttggcggg gttctaccag cgctttatca 3840 aggactacag tagggtgact gctccaataa ccgatcttct cacgaaagaa aataagaaat 3900 tcacttggag caaagaagca gaagaggcgt ttcgagagct caaatcaatt ttgacctcag 3960 cgccaattct agggaacccg gacttcagca aggtttttac gattgaatcc gatgcttcgg 4020 atagagcagt aggtgccgct ttggtgcagg agcaggacgg agttactagg gtgatcttca 4080 gcaagaaact gaatcgtact caaaggcggt actccgctgt ggagaaagaa tgtctggggg 4140 ttctctcagc gattcaacac ttcaggcatt acatcgaagg cacgaaattc cgagtcgtta 4200 cagacgcgag aagtctctat ggcgggatta atacctacga tcgagatcaa ggaagcgatt 4260 ggaggtcact tgcgaccagt ttaggaagat gtttcctgaa atcaaccaac cgattaagcg 4320 accctcgatt cgcgtggaag caatatcctc cgtagcggaa cgagagcaga tcttgaaaag 4380 aacccacgag aaagctcact tcgggtacga caagacgttg aacgaggtaa agcagagata 4440 ctttggccga aaatgaatag tgacgtccgg aaacattgcc gtgaatgttt gaaatgtcag 4500 gtcagtaagt cgggtaacac gaatgtgaca ccgccgatgg gatcaaagaa gcctgtcgaa 4560 tacccatggc agttcgtgac gttagattat gtcggaccat tgccaccgtc cggcagaaat 4620 agacatacgt gcctactcgt agccaccgac gtgttcagca aattcgtact agtgcagcca 4680 ttccgagaag caaaagctag ctcattggtc gaatttgtcg aaaacatgat ttttcgcctg 4740 tttggagttc cggaggttgt tctgacggat aatgggtctc agtttatttc gaaacaattt 4800 agagagctgt tggaagcgta ccatgtttcg cattggctaa caccggcgta ccaccctcag 4860 gtgaacaaca ccgagcgagt gaaccgcgta atcacgacag ccattcgcgc cacgctgaag 4920 aaagagcaca agcactgggc ggatgacatt caggagatcg caaacgctat ccgaaacgcg 4980 attcacgaat ccacaaagta cagtccctat tttatcgtat ttggtcgtaa catggtatct 5040 gatgggagag agtacggtcg gataagggac aactatgagt ctaccggcga taatgaggat 5100 ttgacagtcg aacgacggaa gaaactgttc gaggacatta agaccaacct gactacagcc 5160 taccagcgac atgcaaagac ctacaacctt cgatccaatk caaattgtcc cacctatgcg 5220 gtaggggaaa aagtattaaa gcagaccttt gacctgtctg ataaggggaa gggtttttgc 5280 aagaaacttg cacccaaata cgaactggcg gtagtacgga aaaggttggg aactaacacc 5340 tacgagttgg aagatttgac ggggaagcga ttaggggtct atttcgcgac cagtttgaag 5400 aagatgcttc ctcatctaag ccaatcgtag gcgcactatt gtagagctat gaaactcctt 5460 tcgagtgacc ataaatcgtt agatgaaaaa acaccttcgg gttcaaacat cgtacggatg 5520 atgctcgatg gactcgctct tggaacgttt tcgaaccgac cagtgctgag ggtaccgaca 5580 ttaagcctaa aaattctaaa ccctactttg tagagctatg aaactccttc gagtgaccac 5640 aaatctactc agatgtaaaa acaccttcgg gttcaaatat catacggaca atgctcgacg 5700 gactcgccta cggaacgttt tgaactgacc agggctgagg gtaccgacat tgggcctaac 5760 acattccaac cagtttgcaa gctatgaatc gaaagtgacc acaacgaaca aatctatcaa 5820 aacaccatga agggcaaagc aatcaagtcg ttaaattcca attaagacgc gaaaaccgat 5880 tgggacaaga ttccaattgg aatacattcg cctatgcgaa ctgtatatat gtaaatatta 5940 gtagattagt ttttcctagt tccctagccc ttaatctagt attagtactt agcttagatt 6000 agtaaattaa cttaaataac tcacgattat attccttatt ccattgatta gcttattttt 6060 tcatgttgaa ggcattcgtt atctcaagtt gttccattgc attcgttttc gtcttaccgg 6120 cgatgtaaat aatgtaaata cctaaaagtc gaaatawttg tttagtttgc gttcatacag 6180 tagtaaagtt taattagctt agatgaataa ttaccttatt aaagttcgtt gtagtcattt 6240 ccgttcattg tagatttggt cactttctca ttcgtttcct tttcccacgt ccatcattcc 6300 aattagaatt gctccttgtt tacgttcacc tttgtccacc ttattttttc ctagtttcga 6360 tgttcctctt cccgttgtcc agtccaatcc tttccagtca cgatcctccc gtcgcaaacg 6420 ttgaaatctt ccactcgtta gttcccagct tttccatttt ccgattcggg cagttccgca 6480 gttctccgtt agcagccggg ggcaatcgca aatgcacctg tggaaaaaaa caaacaaaca 6540 aaccaactta cacccgataa tttggaacaa tattgtgttc cactctcctt atacttatcg 6600 gaacgtcccg caggacgtcc accgatgcac tccaagccat ttcagcacta attttcacga 6660 tttttatcac taattttccc gattttgcac tgtatcgccg tgccacgcgt gcgcgatcca 6720 ctttcacttt ccgtcttgtt gattttgaca gttctttgtt gttgacatcg cgtgtatgat 6780 agaccggtca tgtttcgagt gtgagtgtgt cggagttcga caggagtcgg attctggtac 6840 acattattgt atgcaaggta tgaatgattg gatggtttta tagtgagtat gtgttattct 6900 tgaagagctt tcgcagtcgg tttgacctgg cgtttgaaat ctcttttaag gagctttttc 6960 gtttcgtgtt tcagtaagcg catatcgcgg atgcatttcc gtggctgcaa actgaaaacg 7020 aaacagttta ttttaagata aggaagttgg cgtctcgcgt ttctcgctac gtgctggggt 7080 agagatggtc aatggactta tcggtttcgt ccacttaggt ggaaattttc gtgaatgtcg 7140 tcttcgatta tcggcaatgt cctggtgtat gaagacaatg tttcgttatt tgtgtgcgct 7200 tgctcagagt attgtcgaca actctcgagt ttaagccgtt tatatgtttg tgttttcgaa 7260 tttaatgtga atgaatgagg tcatcaacaa acttccggtc aattgaacag tttcctttct 7320 ctcaataaag tcaaggtttt ctgggacatc gagtatttct ggccacccct gttagtttta 7380 ttcttcaata aacaagcttt ttggttacat tatgtttaga ttttcgctgg agaccggaga 7440 cttacgtcgc tgatggtcag tggcagacag gttctgtgat ggatcctgat ctgtaaatag 7500 agatggagtt gacctcggtc gacgatgatc atatatgacc ccgaaaatcg atagtagtgt 7560 tttctgagac gacttgtttg gcctacgaaa atttgattgg ctattttcat ctccattctc 7620 aatgccaatc aaattttcgt aattttagtg gggaatgatg taacaagtac gctttctaat 7680 ttaagattta tccctgtaat atattgatta aatgtttcca aattaaatag ctctgcatgt 7740 ccgttcgttt tttctcaata aatcagaaat aaattcaccg cattagatcg tcccttctgt 7800 agagttacca ttggcatacg cttgcataaa tgagttccct acagcagtta cacgcgccta 7860 agaaaagaac ccaagccagc ccaaattgtc aacgcgcgca tatttgcgat gtctctcggc 7920 ggacatcgac ctgccaaaaa tcctttcagg atccaatgac ctttcccgtt cggcttggca 7980 cactcctacc gaacttgggg agagaatttg ccttaacgtg ctaatgcggc cgttggaaag 8040 agaggtcgca atgccattga agtttcgggt ccaaattttg agaattttgc actcgcgaaa 8100 ttggacttcg tcccatttcg cgacctcgaa gccgaaggtc amaatttagc gggtatctga 8160 ggacaattcc cagccttgat cgtccacttc gtttcgaact gtggaagagg caagagtggt 8220 tgattacttg tcccgcgtcg tggtgtgttt gataaggtgt gaaattcgag aattctagta 8280 ggccccagta gctagttcaa ttgctcaagg ctagtaggat ttcctcgttt gggaatagtg 8340 cgtgtgtagt tatcaagaat atgggattcg 8370 // ID Gypsy-205_AA-LTR repbase; DNA; INV; 284 BP. XX AC supercont1.58; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-205_AA_; KW Gypsy-205_AA-I; Gypsy-205_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-284 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.58; Positions 2884304 2884021. XX SQ Sequence 284 BP; 81 A; 58 C; 64 G; 81 T; 0 other; tgtagtatcc gtaccagaat ttccagaaga acggccacac tgttcgaacg caaagtctat 60 tctcgaaatc aaatctgctt ccgttgcgtt gatgtggttc ctgttccttt tccgctattg 120 aatcgatgct acatgtgtgt gtgtgtgagg gtgtatgaga acgaatgaac ggatatattg 180 cacgaaaatg tgccactcga cgcagtttaa cgttaaccgg cgcagcaaac agtcgattta 240 gaaaataaaa gccgaaaatc agtccattgt gaagtttttc taca 284 // ID Copia-5_Cfl-LTR repbase; DNA; INV; 182 BP. XX AC AEAB01013273; XX DT 26-JAN-2011 (Rel. 16.02, Created) DT 26-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Florida carpenter ant genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-5_Cfl_; KW Copia-5_Cfl-I; Copia-5_Cfl-LTR. XX OS Camponotus floridanus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Formicinae; Camponotus. XX RN [1] RP 1-182 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Florida carpenter ant genome."; RL Direct Submission to RU (26-JAN-2011). XX DR Genome; AEAB01013273; Positions 10073 9892. XX SQ Sequence 182 BP; 51 A; 47 C; 25 G; 59 T; 0 other; tgacgccatc taacagttaa cttgtaactc acgtgacaca ctcactagcc ttccggaccg 60 actcttagac acgttgtaac attcgttctc gttctcagat tttatacaca ataaagtatt 120 attatatctt aatctaatcc tgtgtacggt gtcttaccac gccctaaaga ttactgtcta 180 ca 182 // ID PNL2_SM repbase; DNA; INV; 2216 BP. XX AC . XX DT 01-MAR-2008 (Rel. 13.03, Created) DT 31-MAR-2008 (Rel. 13.03, Last updated, Version 1) XX DE Penelope-type retrotransposon from Schmidtea mediterranea. XX KW Penelope; Non-LTR Retrotransposon; Transposable Element; PNL2_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-2216 RA Jurka J.; RT "PNL_SM: Penelope-type element from freshwater planarian RT (Schmidtea mediterranea)."; RL Repbase Reports 8(3), 372-372 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS join(84..329,333..1949) FT /product="PNL2_SM_1p" FT /translation="MPDLPNDLDDKIKSLENDLKNIIKKQTNLMKKHLSTQ FT KSLALNIKKLKNFKDFTILPSDKTNRLIALNTNHSSTNIWLTIHQKMILPA FT SSQITFNKKLSVLAKKQTNLEIRDLLLKCNCSEPLPSNMTMLPKDHKDPLK FT GRPLVSAVDTPSTTLSKILAEILKNLLNFVPCHLKNTNQFVSEISNLSLST FT KSEQQYFWGSLDVSNLYGSIPLTGENNVFTVAAFFFETLKAETDFADLSEK FT DFIEIIKLAINSDTVLIRGKPFKQIQGLAMGNNLSPILAIIYMNHIESKIR FT SFFGDKILFWKRYIDDIFIVSTIPLNLILSHANSTNTNIQFTLELPNSEGK FT IPFLDTLISSSQVSDTLTFNTDLFTKNLHSGHILPWESHVPLQRKIALLIG FT ERKRAMRNCSDITKKQETLENFKTRFLQNGYPKEFVEKYFLNEYSSKKNKK FT SHNTQKKHDKPIIYLRFPFLNNKTCNVIRSFIHNTDLPVHIRPTFTTAAPL FT KSQLRKSQKAQTSDLYHQDCLCRSPTKCFCFSKNIVYQITCMKCDAIYIGE FT THRTFRSRLNEHIKSKASNYHEHWEKVHKSAPLIHETKCQILQDAFKNTLQ FT RQAAETFFYQKTQTNHQYSTNVI" XX SQ Sequence 2216 BP; 872 A; 487 C; 255 G; 602 T; 0 other; aaaagcaaaa acgcatctag atcttcaaac attgctgaaa aatacaatca tcttctcaat 60 cctaaccgta gacatatggc aacatgcctg atcttccaaa tgatctcgac gacaagatta 120 aaagtttaga aaatgacctc aagaacatta tcaagaaaca aactaacctc atgaaaaaac 180 atctttcaac acaaaaatct ctcgctctta atataaaaaa acttaaaaac ttcaaagact 240 tcacaatcct cccttccgac aaaacaaacc gacttatcgc gctcaacacg aatcactctt 300 caacgaacat ctggctaact atacaccaat aaaaaatgat cttaccagct tcttctcaaa 360 tcacattcaa caagaaactt tctgtccttg ctaaaaaaca aactaacctc gaaatcagag 420 atctactttt aaaatgtaac tgtagtgaac ctttaccaag caacatgacc atgttgccta 480 aagatcacaa agatccatta aagggtagac ctcttgtttc tgctgttgac actccttcca 540 caactctttc aaaaattctt gctgaaattc tcaaaaatct tctaaacttt gtcccttgtc 600 atctcaaaaa cacaaaccaa tttgtttcag aaatctcaaa cctatctctt tctacaaaat 660 cagaacaaca atacttctgg ggatctcttg acgtttcaaa tctctacgga tccatacctc 720 tcactggaga aaacaatgtc ttcactgtcg cagcattctt ttttgaaact ctcaaagcgg 780 aaacagattt cgcagactta tctgaaaaag atttcatcga aatcatcaaa ctagccataa 840 actcggacac tgtcttaata aggggaaaac cgttcaaaca aattcagggt ttagccatgg 900 gtaacaacct ttcaccgatt ttagccatta tctatatgaa tcatatagaa tccaagattc 960 ggtctttttt tggggataaa atcctattct ggaaaagata cattgacgac attttcatag 1020 tctccacgat acctctcaac ctcatcttat ctcatgcaaa ctccactaac acaaatatac 1080 aattcactct tgaacttcct aactctgaag gcaaaattcc ttttttagat accttgataa 1140 gttcctctca agtttctgac acactcactt tcaacacaga tctattcact aaaaatctac 1200 acagtggaca cattctccct tgggaaagtc acgttccact ccagcgaaaa atagcattat 1260 taataggaga gagaaaaaga gcaatgagaa actgttctga cataacaaaa aaacaagaaa 1320 ccctcgagaa tttcaaaaca cgttttttgc aaaatggata tccaaaagaa ttcgtagaga 1380 aatatttttt aaatgaatac agctccaaaa aaaataaaaa aagtcacaac acacaaaaga 1440 aacacgacaa accaataata tacctacgct tccctttctt gaacaataag acatgtaatg 1500 taatcagatc tttcattcac aacacagatt tacctgtaca tattcgtcca accttcacca 1560 ctgcagcacc tttgaaatca caacttcgaa aatctcaaaa agcccaaaca tcagaccttt 1620 atcatcaaga ttgtctgtgc cgttcgccta ccaaatgttt ttgcttttct aaaaacatag 1680 tataccaaat cacttgcatg aaatgcgacg caatatacat aggagaaaca catagaacat 1740 tccgatcacg actcaatgaa catattaaaa gcaaagccag taattatcac gaacactggg 1800 aaaaagtaca caaatcagca ccactaatac acgaaacaaa atgccaaata ctacaagacg 1860 cgttcaaaaa cacattacaa agacaagccg cagaaacttt tttttatcaa aaaacacaaa 1920 ccaaccatca atattcaact aatgtaattt gaaaatttct ttgtttttgg gggataattt 1980 aaaaaattta aaactcaata ttaattaaaa ttaaatttaa ccaataaaat caatataaaa 2040 tacaaaatat ataaattttt ctcctatttt tttaaataaa taagtaatga tcgaaatcac 2100 ttgaaaaagg ctgtgccgaa aattcgtgaa taccttaaat aaatctgatt catagtcaaa 2160 ttttgtagaa aatattatgt cacttataaa gaaatcagag gtgaaaaaca aaaaaa 2216 // ID DNA4-4_CQ repbase; DNA; INV; 147 BP. XX AC . XX DT 29-DEC-2010 (Rel. 16.01, Created) DT 29-DEC-2010 (Rel. 16.01, Last updated, Version 1) XX DE A DNA transposon family from Culex quinquefasciatus - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA4-4_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-147 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the southern house mosquito."; RL Repbase Reports 11(1), 74-74 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >87% CC identity. 62 bp TIRs. 4-bp TSDs. XX SQ Sequence 147 BP; 44 A; 27 C; 31 G; 45 T; 0 other; tcaagcaaaa caatgtaaat gggccaatgt gagtttgtaa acaaagagtc tatcctgctc 60 acgtcagtgg atgtttacat taggagtgag caggatagac tctttgttta caaacccaca 120 ttggcccatt tacattgttt tgcttga 147 // ID P-1_TV repbase; DNA; INV; 9015 BP. XX AC . XX DT 26-OCT-2009 (Rel. 14.1, Created) DT 26-OCT-2009 (Rel. 14.1, Last updated, Version 1) XX DE P DNA transposon from Trichomonas vaginalis - a consensus. XX KW P; DNA transposon; Transposable Element; P-1_TV. XX OS Trichomonas vaginalis OC Eukaryota; Parabasalia; Trichomonadida; Trichomonadidae; OC Trichomonas. XX RN [1] RA O'Hare K. and Rubin G.M.; RT "Structures of P transposable elements and their sites of RT insertion and excision in the Drosophila melanogaster genome."; RL Cell 34(1), 25-35 (1983). XX RN [2] RA Hammer S.E., Strehl S. and Hagemann S.; RT "Homologs of Drosophila P transposons were mobile in zebrafish RT but have been domesticated in a common ancestor of chicken and RT human."; RL Mol Biol Evol 22(4), 833-844 (2005). XX RN [3] RA Smit A.F.; RT "P1_Cis - P DNA transposon from Ciona savignyi."; RL Direct Submission to Repbase Update (06-SEP-2005). XX RN [4] RA Kapitonov V.V. and Jurka J.; RT "P-1_CR, a family of P autonomous DNA transposons in the RT Chlamydomonas reinhardtii genome."; RL Repbase Reports 6(3), 162-162 (2006). XX RN [5] RA Kapitonov V.V. and Jurka J.; RT "P-1_NV - a family of autonomous DNA transposons from the starlet RT sea anemone genome."; RL Repbase Reports 7(7), 627-627 (2007). XX RN [6] RP 1-9015 RA Kapitonov V.V. and Jurka J.; RT "First examples of protozoan P DNA transposons."; RL Repbase Reports 9(10), 2162-2162 (2009). XX DR [6] (Consensus) XX CC This is a young family of P DNA transposons identified in the CC Trichomonas vaginalis genome. The consensus was derived from CC multiple alignment of ~20 copies of P-1_TV, which are less than CC ~2% divergent from each other. P transposase is encoded by a CC singe ORF. TIRs are 20-bp long (4 mismatches). The genome CC contains over 300 copies of a non-autonomous deletion derivate of CC P-1_TV, called P-1N1_TV. It is very likely that P-1_TV is still CC active. CC The T. vaginalis genome harbors several families of autonomous P CC transposons. In addition to P-1_TV, we derived also a complete CC consensus sequence of P-2_TV. All these transposons are CC characterized by 8-bp TSDs and unusual 5'-AAAGG and CCTTT-3' CC termini (5'-CA and TG-3' in reported P transposons from other CC species). CC P transposase is the main feature of P transposons that differs CC them from other superfamilies of DNA transposons. The P CC transposase core conserved in all known P transposons is ~400-aa CC long. It contains three D and one E residues, which are CC universally conserved and form a putative catalytic center CC (usually DDE or DDD in transposases from other superfamilies). CC The P transposase core profile derived [6] from highly diverse 40 CC transposases encoded by P transposons from major eukaryotic CC kingdoms (including protozoa [6], viridiplantae [4], and CC metazoans [1,2,3,5]): CC X(74-89)-h-h-h-D-E/A-h-X(67-116)-D-X4-N/Y-X(27-50)-D-X2-P-h-X-K-X2-K/R-N/T/S-X(166-297)-L/F-N/T/S-Q/T/S-X3-E-N/H/T/Q-h-F/N-X3-R-X. CC P elements are also present in numerous fungi (P transposase in CC fungi). XX FH Key Location/Qualifiers FT CDS 4165..6195 FT /product="P-1_TV_1p" FT /note="P transposase." FT /translation="MNVGFYIPLSDINELKKKIEELEIEIKGDRETFLVKF FT NQDIRKQIYKRSFETDVQVFPYTGTGKMHMYKAYLEDLDAIKAEMRRSLLN FT KNIGRKTKEIEEDNDDHQDRTYWNTEFPEHVPEFTRNIIHTIMENICRKDK FT RGYRYGGESELVDLAFLIHSKSPAAYDVLLEFCLPFPTPVTLYNRFHEDIM FT NMKQFVKHAELIPRMLESMNYHVENTPEYEWAIAIDAVALSTWSSKDQKGP FT VDENGEEIKYLFIFLGMPLNFDAPNVILHVIEHANGFASGIIHEIDAAIEE FT IRKHLRVRVIITDGDPGYDKHQDEFINEILQGNDPEEIFKRATAILKKGDQ FT VVWINDLIHMSKLERTRLLDATLKLLVHPSDLNTIVDVNKIRDAIELGDAL FT NDTSPLGRIKDKYPITIFSIRAVKALLEKNYYNEALFIAPLAFWYESIRNE FT AFNADTRCQMMWFAYLLLVHHCIQMRDRHIKTNLGVVKIDGNIYPVYYMGT FT WEVVKHLTATIICFQYFFSINSTFFNFSHLTTMTEEHLNGLLRVMAHFKYD FT LSTTETSIAECNLTLKIHKKYDIDWKTHSRDYQAGITFEDIYTMRQDSRQY FT EMCKEYAYALLSKCGFRYNSNSSVDVLLSWVNEINQFHSKILMSTHISNPI FT SGYSIFSRLSVKKDVFDNPTKKNHARKD" XX SQ Sequence 9015 BP; 3423 A; 1224 C; 1229 G; 3139 T; 0 other; aaaggtaact aaactccttt cgccgatccg atttgcccca tatttaaaat atagcttgca 60 tatgtagaag aaaagctaat tttgaagtac gcccgaaatc caccaacagg atttcgagct 120 actaaaaagg tacttttaaa tgtgattttt tttataaaaa attaaattta ttttaagatc 180 agactaaaaa ctaattttat ggaaaatcta tatcgacaaa tttattactt attgggtata 240 tagatttttt gattatggcg attatgtctg atatgcttcc aataagccaa aacagtgttt 300 tgggagtgat taaatacaaa aaaatctggc gatgctttac gtggcaaatg gtcgaatggt 360 tattacccct tactattggg taccggggac gctagttcga aactgatgca ctatattttt 420 ttacaaaatg cgtttattag gacagggtaa agaaaggttg ccaatataat atatttaaat 480 aaatacaact aatctagcat aaaaaaacta ttgaatattt gaattcaata aaaatttata 540 taaaatacat gtaaagcata aattactata taaatcaagc tgtattttgt tttaaacacc 600 gattgtttca tataatatat attcaactca agtttctaat taaggatttc atataaaatt 660 tagcatatga tcagcaacaa ttaaaattag tattacaatt aataaattta gtataaaaac 720 atatagattt agtatagatt ttatagctgt aataaaagaa ctattataaa agagtaattt 780 actgaaatgc accttatatt tataaacaaa acttataacg tgttagttac atatagagtt 840 gaaattcata gatagcatag agaagttggg atcttggctt aatggtcata atataaccaa 900 attaattcaa aatttgaaat tttaatatta ctgtaactat aatatatgga tatgttttat 960 gagatcataa taaatgacat gttaatgtaa aatttagaat aaaaaattat atttttttta 1020 aaaaaacatt gttcatgagt gttaattgga gaagtagtaa attagttcag tttgcaagtg 1080 atgatggtta atctaactat aattacaatc aatttttaaa atctttgatt tattttggtg 1140 ataattaaga gtgcagttgt tctatttata aactatttac tatagacggc actggtgata 1200 ataaaaatca agcaaaagcg catctgctgt tgctcgactg cataaaaatt gtagttaaaa 1260 tttgaatact ttacataact catttgtaag gtaaattaat attattaatt tatattaaga 1320 acaaataaat atgaaatgta tttttttcca aaaaaaaatt aagaaaaatg ccatttttat 1380 cattgttgtt aatgccccct aattcttgaa tgattttcgg gaaaaataaa tacgaaaaga 1440 gaaatcatag ctgccagaca accattatat gaaattcatt cagtttacaa agtttaattt 1500 ctttccacaa tagtacataa tatagtagta tattcggaaa tatgcatgct cgatatctcc 1560 aattcacatc tcatgcatct tatcggagat taagtaactt tatcttgaaa ctagaaaagt 1620 aattagcaag tagaatatat aaaaaatata atgttgtttt atattgataa aaccataaga 1680 atggctattc tagaatgaaa ttttagtacc tatattcagt gaaaggttta tttgtagttg 1740 aactttcaaa tgaaaactca aggatccaag aattgaattg cattaacaaa ccaataatga 1800 aatatgatat atacatatat attttaccca taaaaataga attaaatcga tatttaactc 1860 aaattgtcaa aatatcaaca aaataattta aaaatagata tatatataag agaaatatgt 1920 gattttggtg tttaaaaaag aaaaatattg cccaaacaag tgttctagct aaaaaggaaa 1980 aatgcattat ttcatatttc aatatcaatg ttaaatatta aaataatctg ccattattca 2040 gacagttaaa aaataaatac actataagtt taagctttag ctgtaaaata aacagtaata 2100 ttaaataaac attatattat atagtcggag cgattctata tatgtactct tatgccaact 2160 agttacaacc tagtaaaata catggaaata ttgtaatatc ctagcattgc tatcgagaaa 2220 tcaaaaaata taagcaaaag tcatcttata tttaaagatt tttcaagagc tttgtttgga 2280 aattagttca ttggaagaag tatatagata tgaaaaccca aagtattaag ttcgataata 2340 tatagttttt ttttatttta aaattgaatg catacaacat tgctcctctg gagtgaatta 2400 attagttcca actttttcaa gattctacag tgtaatatgc tccttggaat gtctcttgca 2460 tgaatgtttt tgtttaccat ctaaagaatt aagatttgag atacaataat ctttggaaaa 2520 gtagaggata caacaataaa gtatgttctt tcgatacatg atatcatttt gttacaaaaa 2580 tattcttact tactcatgat tcaaaaaaaa aggttgcaga catactaaaa tgttatttat 2640 gtggcttgta tttactagca tatgtaattt ggtgctaacg tataccatca atttcattat 2700 aacacagaca tatatattac tattagaatt ttattttaat tcaatgataa aatcaaacat 2760 tcctttttga aaatcgttct atcttgttat tatctttgaa tgctgcatca agaactaaca 2820 atttacaatt tttattcaaa ttataatata aaattatagc atattggttt tcaaagtatg 2880 tttgctaaaa gtaattgcgc ttatcgttta cccccgcatt aacacatata aatttgtatt 2940 aaaaaatgat atcaaatcaa aaatatggct tccttgctaa gaaaagcgtc aaatttatta 3000 tgtttctcct tagccgaaag tgtataaatt ggcagttaga tttatagtat cctaactgtt 3060 ttttgtaaag ttcgtcatat ctttaatggt actatttcag cgtgtacaat gaatacaaaa 3120 attctaagag tgccacataa aatagtactt tagacttgag ttaatgctac caagcaaatg 3180 ttaaaaagat gactttagtg ttttgataat caaagattaa aatcattatt gtctaaatca 3240 ataatcataa cgacagaagt taatgttttt tcatcagtgc tttaaatcca cttttaaaag 3300 tatacctcaa aaatcaaaat atcaaaaaaa aatctagttt ttatgaatct aaagctttct 3360 atagttcaca tgcaaaaacc tttagtttta tactttttta tggttttcat gagacaaaaa 3420 gttatgattt aataattatt tttaattttc cgctgttatc aacatagatg acaacttcat 3480 acacttagtt atctaatatc ttgactatta cactccttaa cccccaacat tcaaacaaaa 3540 acctttacgg atgcctaata tgaaattttc ttttttaaaa gacatttcta tcaataaatt 3600 atttctggta gttagacaaa gaccattttc taagttgttc agaactgttt tcgtgcaaat 3660 tatctatgaa tgggcttaac aagtttaatc tgaaagttat ttccaaaaca gaaatgcata 3720 tacaatcaaa agaatatata tttatgccaa taattgatgc tgatgtggtt ttgcaatttt 3780 acagttattc acatatcatt atgcattaaa ttttcatatt tgcttatttt tctatctttt 3840 atcttatgta tgatgaattt cataactatt ggatatgatt tcatcattca tattgcatat 3900 ttccattttg aaattttagg aatcttcaat tttggatgga tgtttataaa gtagctgttg 3960 taaatggaat ttaaattagt agtgacaaca tttggtcata acaaaattgt atcttgtcga 4020 ttacaaaaaa ttaatttcaa aataatttct acaagtgaag caaaaagata ttaaatatca 4080 ttaccaattc tattctagta tgaaattata gaaaatatta agtttatttt attcagattg 4140 gtctgacatt ctctcaaaaa gttaatgaat gttggattct atattcctct ctcagacatc 4200 aacgaactta aaaagaagat tgaagaactt gaaattgaaa taaaaggaga ccgagaaacg 4260 tttttagtta aatttaatca agatattcga aaacaaattt acaaaagatc atttgaaact 4320 gatgttcagg tattcccgta tacaggaacc ggaaaaatgc atatgtataa agcctaccta 4380 gaagacttag acgcaataaa ggctgaaatg cgcagaagtc tgttgaataa gaatattgga 4440 agaaaaacaa aggaaataga agaagacaac gatgatcacc aagacagaac ttactggaat 4500 acagaatttc cggaacatgt tcctgaattt acaagaaata ttatacatac tattatggaa 4560 aacatttgta gaaaagataa aagaggttat agatatggag gtgaaagtga attggtagat 4620 ttggcattct taatacatag caaatcacca gcagcatatg atgttctctt ggaattttgt 4680 ttaccttttc ctacgcctgt aacattgtac aataggtttc atgaagacat aatgaatatg 4740 aaacagtttg ttaaacatgc tgagctcatt ccgcgaatgt tagaaagcat gaactatcac 4800 gtagaaaata cacctgaata tgaatgggca atagcaatag atgcagttgc actctctacc 4860 tggtcctcta aagaccaaaa aggtcctgtt gatgaaaatg gagaagaaat caaatactta 4920 ttcatatttc ttggtatgcc cctaaatttt gatgcgccaa atgttatatt acatgttata 4980 gaacatgcca atgggtttgc aagtggaatt atccacgaaa ttgatgcagc aatcgaagaa 5040 atacgaaagc acttaagagt tagagtcatc attactgatg gagatcccgg atatgataag 5100 catcaagatg aatttattaa tgaaattttg caaggtaatg atcccgaaga aatatttaaa 5160 agagctactg ctatcttaaa gaaaggagat caagttgtat ggattaatga tctcatacat 5220 atgtcaaaac ttgaacgtac cagattactc gatgcaacac ttaagctttt agttcatcca 5280 tctgatctga atacaattgt tgatgttaac aagatcagag atgcaatcga attaggtgat 5340 gctcttaatg atacttcacc tctgggaagg ataaaagata aatatccgat cacaatattt 5400 tcaattagag ctgtaaaggc gcttctagaa aaaaactatt ataatgaagc attattcata 5460 gctccacttg cattctggta tgaaagcata cgaaatgagg catttaatgc tgacacaaga 5520 tgtcaaatga tgtggtttgc atatctgctc ttggtacatc attgcattca aatgagagac 5580 cgtcacatta agacaaatct tggtgttgtc aaaatagacg gaaacattta tccggtctat 5640 tatatgggaa cttgggaagt tgtaaagcat cttacagcaa caatcatttg tttccaatat 5700 tttttcagta ttaattcaac atttttcaat ttttctcatt tgacaaccat gacggaagaa 5760 catttgaatg gtttacttcg cgtcatggct cattttaagt atgatttgtc aacaaccgaa 5820 acatctattg cagaatgtaa tttaacactc aaaattcata aaaaatacga tattgactgg 5880 aaaacccatt cacgtgacta tcaagctggt attacatttg aagacattta cacaatgcgt 5940 caagattcac gtcaatatga aatgtgcaaa gaatatgcat atgccttgtt atccaaatgt 6000 ggatttagat ataatagtaa ttccagcgtt gatgtattgc ttagttgggt gaatgaaatt 6060 aatcaatttc actcaaaaat tttaatgtca acacatatat caaatccaat ttcaggttat 6120 agtatttttt cacgtttatc tgtcaaaaaa gatgtttttg acaacccaac caaaaaaaat 6180 cacgcaagaa aggattaaac gatttagatc ctcataatga caaatctgat cctcttgaaa 6240 atgatccatt atatgaagaa agccatattg ctcaattatt cccagaaaat gtcattttac 6300 catcaaattt tgccaagaag gcaaacaaat tcagcgattt tgaaacatac attcataata 6360 cattaagtag atttcataac gggatcacat atgagaatct agagtttgct attctaaatg 6420 cgaaacgtca tccaaaacac attgttgatc tcataaataa aggcaatttt tcacaaatct 6480 taaggagtac acttcgcaca atgacaattg aaaacaaagt gaattatagt catgtaacca 6540 cactctactt cgctgcatag ctctcaaaat agagctcctt tattataact atcataaatt 6600 tcaaatctca gattgaaata ttcgttaaat gatagttctt ttgtatattt gtatactatt 6660 ttgtttgtat taaacttgtt tgttattttt agtcacaaat tatatttttt aaatttaatt 6720 aaagtaaaaa ttcctttaat ggtaatgttc gaacattgat tacaagataa ttaaaatata 6780 tgacctcatt ataaatgaat tttaactcta gataattgag aaagctacaa attgtttcta 6840 aatcttgaaa ttattcgatg tcaatctaat ttttcttgtt atagcctact tctatggtgt 6900 acaggttgtc ataattacta ttttaaagtg taaaacatgc ttatatcaat caggacgtta 6960 aaatggaaaa gtacatacca agcaaatgtt tggatttata tgcgacagcc tttttaatgg 7020 atgtgacaaa gcgtttcaat atagattttg attaaagcat aactgcaaat tataatttga 7080 ctttatatgt accaataata atggcaatta tgagtttttt tatataaatt atatcacacc 7140 gcttctagta atttttagct taagcatatc atcggctaat tttcaaaaaa aaagaatttc 7200 aaattttaat ccaaattctt ttcttaaaat tcacaatcaa attacagctt atctttaata 7260 gtcttattca ttctatgctt ttttttcggc tccataaaag tcataaattt ttccatcata 7320 aatttgttag aaattgggtt accattgcca aacatttgga tagaaaaagc ccaataaatt 7380 atacccaaag catccaaaca tgaagatatc atataagaaa tatagtttaa ttcagtaatt 7440 ccctaaaatc taaaatgttt aagaacgttt gttcaatatc ttacaatcta attcacagat 7500 ttagcatatg tactacgtat caaatataca atgtgatctc tcaataagct tagaaggtta 7560 attattaaaa ttaaataaaa ttctacaatt agcgaaaaat aatacggcag ataatatgct 7620 tataggtaaa gaaattcaaa gagcattcct taaatttttt cgaaatttat caaaactcga 7680 aggtaaatca tcgagtgata tgcgaaaaaa acttttttat tctattaatg ataataaata 7740 tattagccat attaaaactt ttcatttttt aaaactcata gaaatacgtt tgatattctt 7800 cattaagaac atataaaaac gagtataatt atcataatct gttaatttac aatataaatg 7860 cttttctgct tgtagtaaat atcaaataag aaaataataa cattataaat ttttcaaata 7920 agaatatatt atctagcatc taagcatgaa gatatttaga atttggatat ctaaattctg 7980 cttagattac tttcaaaata taagtcaata atctttgatt ttattaaacg ctaatattgg 8040 atagtatatt ttcacaattt aattgtagtt agataaatga taattcatgt aactcattga 8100 tttcgatcaa aagttatatt caccttgtta aaaaaaaaat ataaaaaaac cgggtgctta 8160 ctgatttgaa catattgcct ttgaacccca cgacataacc agtaggctaa acgtacagtg 8220 atttttttta taaattctag tataaaatac ttcccgcagg gcatttgttt aattatttca 8280 cattctgatg acataatttt taatatcata gataatgaaa gcgaaaaata tataaaaatc 8340 gacaaacatg ttataatgaa ttgagagctt tacgtgtcaa aaaacacatt tcatactgat 8400 ttctataaaa ggtatcttta ttatagaagt gtaagctata tctgtttgtt tttttattat 8460 aacaaattca aaattctctt ataaggatta gatgaaatct gtattactaa atcattagct 8520 aaatgaaggt tcaaatagaa agtaaatatc aatttatatg atactgaaac caaatagttc 8580 ttttaataaa agataacacg ctgaaaatta ttgagtattt gcaaatctca aaaatctatt 8640 ggtgaatttg gctcaacttt taaaattata acatccatat caaatcataa aagcagacaa 8700 aaattaaagc caaacctatc ggccaatttt tttttgatat tttgattttt gaggtatact 8760 tttaaaagtg gatttaaagc acttatgaaa aaacattaac ttctgtcgtt atgattattg 8820 atttagacaa taatgatttt aatctttgat tatcaaaaca ctaaagtcat ctttttaaca 8880 tttgcttggt agcattaact caagtctaaa gtactatttt atgtggcact cttagaattt 8940 ttgtattcat tgtacacgct gaaatagtac cattaaagtt atggcggact ttcaaaaaat 9000 agtttggtca ccttt 9015 // ID BEL-30_CQ-LTR repbase; DNA; INV; 533 BP. XX AC AAWU01042468; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-30_CQ_; KW BEL-30_CQ-I; BEL-30_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-533 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 214-214 (2011). XX DR Genome; AAWU01042468; Positions 917 1449. XX SQ Sequence 533 BP; 167 A; 117 C; 103 G; 146 T; 0 other; tgttgacgcc gctgctaggc aacaccgcta aacgcaccgc gtcgagtcct acagagaagt 60 gaggaagtaa acaatacccg agaaacgacc gctccggttg acagtggtga cagtgacgga 120 ccagcggaag tgatttgaac acacacacac acatacacac acacacacac acatcatatc 180 ttttgaacca agcagcagtg aaaccgcgag tagttcacga ttctttaatt ttgtaatttc 240 cctacatttt cgggccgatt gcctttgatt agcttatttc cagtttcgca aacgtgagtt 300 gattaattag acttaaattg ccacttgaat ttatttgtat ttaatgatac aggtacacgg 360 agattgtagg agataaatga gatttgtttg gccgatctgg ttaggaagaa aaaccaacta 420 atgtaagtcg ctactactaa aatgaagttc tttcctaaca tttatccaat aaatttcagt 480 tttgagcacg actcaacctc tgctacaaaa agtttttcta ttcggccaca aca 533 // ID Penelope-8_HM repbase; DNA; INV; 2084 BP. XX AC . XX DT 13-DEC-2008 (Rel. 13.12, Created) DT 14-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE Penelope-like element. XX KW Penelope; Non-LTR Retrotransposon; Transposable Element; KW Penelope-8_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2084 RA Jurka J.; RT "Penelope-like elements from Hydra magnipapillata."; RL Repbase Reports 8(12), 2098-2098 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 160..1920 FT /product="Penelope-8_HM_1p" FT /translation="MYKISKDDYNHLLKNAITSIYKKADPKLKNVVNKEGK FT QILKNHDIFNRIEINGTSDCFITLKDHKDNFLNNPTVRLLNPAKNEVGRLS FT KLILSNINSELRXKLCLNQWQNTQNVINWFQKIDDKHLRKFLVFDVNDFYP FT SIDEKILNNAITFAEQHITIDEQSKSIIHHARKSFIFNDGVSWIKKKAGLF FT DVTMGAYDGAEVCELVGIFILDQLSQFYNKNDFGLYRDDGLAVFKNKNGHQ FT MEQIKKHVVQVFKRNNLNISINCNLKIVNYLDLTFNLTQNSFQPYCKPDNK FT LSYVHADSNHPPNIINNLPKNIELRLSANSSSEVIFNKSTHLYEDALKQSG FT YDYKLAYKPSITNVPKNHHKRSIIWYNPPFSKNVTTKIGQRFLTLIDKHFP FT KDHNLHKIFNRNTIKISYSCMPNIKSIINSHNKNILYGDVKLSEKTCNCIK FT KSLCPLNNNCLSNNIVYQATVSSNKPQYKDKVYIGISETSFKLRYANHLKS FT FNIKKYKNDTELSKEVWDLKDNNFTSSIKWNIIKRCKSYNPASKICKLCLN FT EKFEILFYKGDNLLNKRGEVVSKCRHKNKFLLSLFDSGD*" XX SQ Sequence 2084 BP; 818 A; 359 C; 272 G; 631 T; 4 other; aattaaagaa atgtcactat ttgaaaaaga tataataaat cttgttaaac tatcaaattt 60 cgaaatctca acaacacatt tcaagaaaag gtaaataatg acattaaatc tttacgtaaa 120 tctaataaaa ctttaactcc agctgacaaa acwtccaata tgtacaaaat atctaaagat 180 gattacaacc accttctaaa aaacgcaatc acttctattt ataaaaaagc cgatccaaaa 240 ctaaagaacg tagtaaataa agaaggtaag caaattttaa aaaatcatga tatttttaat 300 agaatcgaaa taaacggaac ttctgattgt tttataactt taaaagacca taaagacaac 360 ttcctaaaca accctactgt gcgtctttta aatcctgcaa araatgaggt aggaagactg 420 tcaaagctta ttttatctaa tatcaactck gaactaagaa mcaaactctg tttaaaccaa 480 tggcaaaata cgcaaaatgt tataaattgg tttcaaaaaa tcgacgataa acaccttcgt 540 aaatttttag tatttgacgt aaatgatttt tatccgtcaa tcgatgaaaa aatacttaat 600 aacgctatta cttttgccga acaacatata acaatagacg agcaaagcaa atccattata 660 catcatgcaa gaaaatcttt catttttaac gatggcgtat catggattaa gaaaaaagca 720 ggattgtttg atgttaccat gggagcctat gatggcgcgg aagtttgtga actagttgga 780 atttttattt tagatcaact ttcgcagttt tacaacaaaa atgatttcgg tctatatcgg 840 gatgacgggt tagctgtttt caagaacaaa aatggccatc aaatggaaca aataaaaaaa 900 catgtcgttc aagtttttaa acgtaacaat ctcaacatct ctatcaattg taatcttaaa 960 attgttaact accttgattt aacttttaat cttactcaaa attcattcca accatactgc 1020 aagcctgata ataagctaag ttacgtacat gctgattcta accatccacc taacataata 1080 aacaatcttc caaaaaatat cgagctaaga ttgtccgcca actcatcaag tgaagttatt 1140 tttaataagt cgacacatct ttatgaagac gcactaaagc aatcgggata cgattataag 1200 ttggcatata aaccaagtat aacaaatgtt ccaaaaaatc accacaaacg aagtataatc 1260 tggtacaacc ctccatttag taaaaatgtt acaaccaaaa ttggtcaacg ctttttaacc 1320 ctaattgaca agcattttcc taaagaccac aacctacaca aaatatttaa cagaaacact 1380 attaaaataa gctactcctg catgccaaac attaaatcta tcatcaactc acacaacaaa 1440 aatattttat acggcgatgt aaagttatca gaaaaaacct gcaattgcat taaaaaatct 1500 ctttgtccac tgaacaacaa ttgcttgtct aacaacattg tttaccaagc caccgtaagc 1560 tcaaataaac ctcaatacaa agataaggta tacattggga taagcgaaac ctcttttaaa 1620 ctacgttatg caaaccatct aaaatccttc aacataaaaa agtacaagaa cgacactgag 1680 ttatctaagg aagtatggga tttaaaagat aacaatttta catcttcaat caaatggaat 1740 ataattaaac gctgtaaatc atacaatcct gcgtcaaaaa tttgtaagct ttgtttaaat 1800 gagaaatttg aaatattatt ttacaaagga gataatctgc taaataaaag aggcgaagta 1860 gtttcgaaat gtagacataa aaataaattt cttctttcat tatttgatag cggagattag 1920 ttttccatta cgtcttcttg tactaagacg tcagaactca ttctactgta gtttgtaatt 1980 tttaaacggt tttttatatt tttgattttg tacttcatgg ctgatgattg ccgataggca 2040 tgaaacttta agttccatta aaagttgttt tttctatatt taag 2084 // ID Tc1_Ele12 repbase; DNA; INV; 1717 BP. XX AC . XX DT 13-OCT-2010 (Rel. 15.1, Created) DT 13-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE A non-autonomous Mariner/Tc1 DNA transposon family from Aedes DE aegypti. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Nonautonomous; KW Tc1_Ele12. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-1717 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-1717 RA Kojima K.K. and Jurka J.; RT "Mariner/Tc1-type DNA transposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (13-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. ~96% identical to consensus. TA TSDs. This CC consensus is ~97% identical to the original sequence in [1]. XX SQ Sequence 1717 BP; 617 A; 286 C; 318 G; 496 T; 0 other; tacagtgagt cacagtgaaa atcgtccacc ttgaacatta ccagaaaagt aattcgtact 60 attcatgtat agaacaagaa ttcaaattaa atggctcgtt acgttagttc ggcaagggtc 120 ttgataattg tatgcaattc aagtcattga atattaaaag aataaattaa gctttaaatt 180 tgtaaaaata atcaataata aggtcacagt gaaaatcgtc caccttgccg ttatgcatga 240 tttttggggt aatattataa agttttcatt catatcattt ctttaactcg cgcacacaca 300 tacattcaag cacgatctgc tatgcaagaa taaaaaaggt ttagtttttt agagtatttc 360 cattaaaatg ggacgaaata aacagacatc gtcagatata agaaaacttg tgataaaact 420 acacaaaaat ggcaaaaatc aactgaaaat caccaaattt gttggaaaag ggtgtactac 480 cgtatagagt ataataaatc gctacaaaat taaccattga agtaaaatca aacccaaaac 540 atcgttgaag aaaattatca atggagcggt tgagcgttag cttcttcgcc aggttgatgc 600 tgaccccaag ataagtgcac cgaaactcgc aattgaagcg aaaaagtatc tcaggaagca 660 ggtatctcag aaaaagtatc tcataaagcc agaaactatg cgtagagtgc ttcgaaaaca 720 taacgtttca tggccgagtt gcaagtcgga agacgtttgt atcaccgaag tacgtgaaat 780 gtcgtatgga gtttacagaa acacacgttg gaaaggattt caacttctgg aagcacttac 840 tattcacaga tgagagcaag tttaatattt tcggatctga cggcagggtt atagtttgcc 900 ggaaaccgaa tgaagacctg aaaatgaaaa atctttgtga aactgctaag catggtagta 960 gttccgtgat agtttgaggc ttcatgccag catctgatgt tagtaactgg cactttatcg 1020 acgggatcat ggctcaatac gtctatcttg attcattgaa gaagaatcga aagcctagcg 1080 taaaaaatgg cttttaattt acctatctac cttatctcaa aaaaatgggc attacatcta 1140 ccaggataaa catctgcaac atacagcggc aaaaccgaaa gcttggttaa aaaatgccca 1200 tcaatcatca atctgccgag acaatctaca aatctcgaca ttatcgagaa cttgtggcat 1260 ctgttggatg tttaaatcag gaaacatcaa ataacgagca gatatacttt gaaaagtgcg 1320 cttactattg agtggattaa gatcgctaaa acttaaactg agaaacttgt gcgatctatg 1380 ccaaatcgtc tcaggaaagt tttaaagagc aaaggcttac ctactcgata ctagtttatg 1440 ggttgacacc tgaaaaaatg acaattcttt tagatcaact ataaattgat atcggtggac 1500 gattttcact gtgacttata tttgaggttt aaatttactt tattattttt cgattccatt 1560 ttgtttttaa tatttataaa aacagtgcca caatgaagta atggaaatat gcaaataatt 1620 gtatgaatta gaaaaccaca gcttttttat atataataat aacagaactt tgaaaggaaa 1680 atatgagggt ggacgatttt cactgtgact cactgta 1717 // ID BEL-28_AA-I repbase; DNA; INV; 6058 BP. XX AC supercont1.281; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-28_AA_; KW BEL-28_AA-LTR; BEL-28_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6058 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.281; Positions 495561 501618. XX CC 'CTAAG' target site duplication CC LTRs are 95% similar to each other. XX FH Key Location/Qualifiers FT CDS 23..6058 FT /product="BEL-28_AA-I_1p" FT /translation="MRAESDAHDCGACDKPNSADVGMVACDGCSVWYHYTC FT AKVSPGVQQRSWRCCKCPPEMHPEVTGAKKKGGKKQPASLTVLGAASSENP FT KSNNASQKKTSEKSKNSEKSRTLVVPNLAASDNTTPKKHPEVLAVEKSTHG FT EMRSSKSSTSTARARAQLALQRLEAERRLEEQKLKEERERLEEERIRLEKE FT RQLKEQEHAIKAKELAMQEKYLREKFELEEQIADDDSSRKSSVLSRKDRTS FT AWLKSQHELSKQEDKSVSQYSEWSNLANHLAGPNHHQRPLELEVDRRQQEV FT LGNPEIVPNLRAHRDAMAVGHNQSDGNRFRRPYDAGRSNASQSPSLHELES FT AAIGLQPVSHGAGPNSEQIAARQIWPKKLPVFSGDPEEWPIFVHSFETANV FT ACGFSDVENIIRLRECLRGPARDAVVTKLMFPQSVNAIMETLRRLYGRPEL FT LVKNLLDKVRRAEAPKPERLESLINFGLTVQQLCDHLEAANLKGHLSNPTL FT LGELVEKLPASIKLEWARFKRAYAEPTLKHFGAFMEELVYDASEVTSPIQQ FT KTPVTKTEKDKPKEKGHVFSHEDAAEVQNHREERQPCPICGKTDHRVRNCE FT RFQQLDLQARLKAVDRYKLCEICLFDHGQWRCRSRIRCNVGNCRDRHHPLL FT HSSGRGPVQEQRQRQFRASECNAHERSQRSVLFRIIPVTLFNGNRKCETFA FT FLDEGSSLTLIESSLARQLGATGVPEPLELRWTSSVKRNEESSKRVDFEIS FT ARGQLQRYVLKNAHTVGELNLPSQSLAIDDLSERFPHLRNLPVSSYTEAVP FT RILLGLENLSLFAPLDSCIGQPGEPIAVKSLLGWSVYGPEANAQPRKGFVN FT LHECNCGADKELNDLVRQQFMMEDKMIAAIPFLESDDEKRARELLENTTKF FT VDGRYETALLWKADDIDLPDSLPMAMKRLKSFEAQLAKDSDLRENVNQQIV FT DYIQKGYVHKATEEELREINRRQVWYLPLGLVTHPKKQKKRLVWDGKAQVN FT GISLNSQLLKGPDLLVSLPSVICKFREKRIGFGGDIREMFLQLRMRTADKY FT FQCFLFRFDPRQPPEVYIADVAMFGATCSPCVAQHVLRVNADKWADEFPLA FT ATAIKNKTYMDDYYDSADTPEEAATLAVQVKTIHARGGFEMRNWVSNCEEV FT LKKLGESATVEPRPLQSTTEARWERVLGMLWHPKSDTLTFSTDLGEQLLPY FT TLGELRPTKRIALKIIMSLFDPLGLLAPYLIHGRALIQDLWRSGVQWDEKM FT RDEEFEKWTRWVELLPAISKLSIPRYYFIEANRLPHSMLQCHVFTDASEIC FT YGAAVYFRTVDGSGRVQCSLVMAKSKVAPLKHLSIPRLELEAAVLGAKLLH FT TVQTNHSLQPHEVYLWTDSSTVLSWIRSDHRRYKQFVAHRIGEILSLTETE FT CWRWVPSKDNVADCLTKWVRDTEPDCNSRWFRGPAFLYNSEEMWPRQRVKT FT NTAEELRSSYLLAHIFLPGRMLDVRRFSKWSVLLRTVAYVYRFIGNCRLRI FT SRSPIETIRATKKQEKLLKCSLNASIVPLKQEEFLRAEQFLWRMVQGEHYP FT DEVRTLLKNRNQPIEKWIAVERNSPLYRFSPFADELGIIRMEGRTVDAAYA FT DFDTRCPIILPKDSDITRLLLDEYHRRYGHANKETVVNEVRQRFQISHLRT FT AVDSTSRNCQFCKVNKCKPRPPRMAPLPEQRLTPNVRPFSYVGIDYMGPLE FT VSIGRRKEKRYVAVFTCLVIRAVHLEVSYDLSSESCIMAIRRFTRRRGSPV FT QIFTDNGTNFVGASRELQMQIDLECAGTFTDARTKWSFNPPSAPHMGGVWE FT RMVRSVKEAMTMLDDGRKLTDEILWTTLVEVEGLINSRPLTYMPQDLDNPE FT ALTPNHFIFGCSSGAHEPLEPPVDLGQTLRSSFLRSQQLAEVAWKRWSKEY FT FPAINRRSKWLDEMRSLKVGDMVYVAEGKRRSWIRGIVDAVICGNDGRVRQ FT AIVRTASGMLKRPVVKLAVMELGGSTEDPPLDPRGGG" XX SQ Sequence 6058 BP; 1687 A; 1402 C; 1622 G; 1347 T; 0 other; aactcaaaga tttgacgccg tgatgcgagc agagagcgat gcccacgatt gcggtgcttg 60 tgataagccg aattccgccg acgtaggcat ggtggcatgc gacgggtgca gtgtatggta 120 ccactacact tgtgctaaag tgtcaccggg agtccaacaa cgatcgtgga gatgttgtaa 180 gtgcccaccc gaaatgcatc cagaagttac cggagcgaaa aagaaaggag gtaagaaaca 240 gccggcgagc ttgactgtcc tcggtgctgc atcgagtgag aatccgaagt caaataatgc 300 cagccagaaa aagacctcgg agaaatcgaa gaactccgaa aaatcaagaa ccttggtcgt 360 tcccaatctt gcggctagcg acaacactac tccaaagaaa catccagaag ttcttgctgt 420 tgagaaatcg acccatgggg agatgcgatc ttccaaatcc agcacttcta ccgccagagc 480 tcgagcgcaa ttagcattgc agcgattgga agccgagcga cgtttggagg aacagaagtt 540 aaaggaagaa cgggagcgat tggaagaaga gcggattcgt ctggagaaag agaggcagtt 600 gaaggaacag gagcacgcta tcaaggccaa ggagcttgct atgcaggaga aatatttgcg 660 ggagaaattc gagctggagg aacagatagc cgacgatgat agcagtcgca aatcgagcgt 720 gctcagtagg aaagatagga caagtgcctg gttgaaaagc cagcatgaac tgagcaagca 780 ggaagacaaa agtgtgtcgc agtattcgga gtggtctaat ttggcgaacc atttggcagg 840 gccgaatcac catcagcgtc cgttggaatt ggaagtggat cggagacagc aagaagttct 900 gggcaatcca gaaatagttc cgaatctccg ggcccaccgt gacgctatgg ctgtaggcca 960 taatcagtcc gacggtaatc gtttcagacg tccctatgat gcaggtagat caaatgctag 1020 ccagtcgccg tcgttgcatg agctggaaag tgcggcgata ggattacaac ctgtttcaca 1080 cggggcgggg ccgaatagtg aacagatagc tgcgagacag atctggccta agaagctacc 1140 cgtattctcg ggcgaccctg aagagtggcc gattttcgtc cacagctttg agacagccaa 1200 cgtcgcgtgt ggcttttcag acgttgaaaa catcattcgt ttgcgagagt gtctgagagg 1260 cccagcgcga gatgccgtcg ttacaaaact gatgtttccc cagagcgtga atgcgataat 1320 ggaaacgttg cggcgattgt acggaagacc tgagctccta gtgaagaatt tgctagataa 1380 agtgcggcgt gccgaagcgc cgaaaccaga acgtttggaa tcgttgatca actttgggtt 1440 aacggtacaa cagctgtgcg accatttaga ggccgcgaac ctcaaaggcc acctgtccaa 1500 tcccacgcta cttggtgagc tagtggaaaa gctacctgcg tcaataaagc tcgagtgggc 1560 gagattcaag cgagcgtacg cggaacctac tctcaaacac ttcggagcat tcatggaaga 1620 actagtgtat gacgctagcg aagtcacgtc gccaatccag cagaagaccc cggttacaaa 1680 aaccgaaaag gataagccaa aagagaaggg acacgtattt tcccacgaag atgctgccga 1740 agtacaaaac cacagggagg aaagacaacc ttgtccgata tgcggtaaaa ccgatcatcg 1800 cgtacggaac tgtgaacgct tccagcaact ggatctacaa gctcgcctta aggccgtcga 1860 tcgttataaa ttgtgtgaga tttgtttatt tgaccacggt caatggagat gccgttcaag 1920 aattcgatgc aacgtaggaa attgccgcga tcgtcatcac cccctactcc atagttcagg 1980 gcgtggtcca gtgcaagagc agcgtcagcg acaatttcga gcctcggaat gcaacgccca 2040 tgagcgatcg caaagatcgg tcctgtttag gatcattccg gttacgctgt tcaatgggaa 2100 ccgcaaatgc gaaactttcg ccttcttaga cgaaggctcc tccttgacgc taatcgaatc 2160 aagtttagcg cggcagcttg gagcaactgg cgttccagaa ccgttggaac taagatggac 2220 gtcaagcgtg aaaaggaacg aggaaagttc taaaagagtg gattttgaaa tttccgcaag 2280 aggacagctg cagcgttacg ttttgaaaaa tgcgcacact gtgggagaac taaatcttcc 2340 gagtcagagc ttggccatcg acgatctgtc cgagagattc ccacaccttc gaaacctccc 2400 tgtttcgtca tataccgagg ctgtccccag gattctccta ggattggaaa atcttagtct 2460 gtttgcacca ctggatagtt gcatcggcca accaggagaa cccattgcgg tgaaatcgct 2520 ccttggctgg tcagtttatg gtccggaagc aaacgcacaa ccgaggaaag gattcgtgaa 2580 cctacacgag tgcaactgcg gcgcggacaa ggaactgaac gatctagttc ggcagcagtt 2640 catgatggag gacaagatga ttgcagcaat tccctttctc gaatcagacg acgaaaaacg 2700 cgcccgcgaa ttgctagaaa acaccaccaa atttgtcgac gggagatacg agaccgctct 2760 gctttggaaa gccgacgaca tcgatcttcc cgatagcctt cccatggcaa tgaagcggtt 2820 aaaaagtttt gaggctcaat tagctaagga ttccgatctg cgagaaaatg taaatcaaca 2880 aattgttgac tacatacaga agggctacgt tcacaaagcc accgaagaag agttgagaga 2940 gatcaacaga aggcaagtgt ggtacctacc gttgggtttg gtgacgcatc caaagaaaca 3000 gaagaagagg ctcgtgtggg acggaaaggc gcaagtcaat ggaatttccc ttaactctca 3060 gttattaaag ggccccgacc tactagtgtc ccttccatcg gtgatctgca aattccgtga 3120 aaagcgcatc ggatttgggg gtgacatccg agaaatgttt ttacagcttc ggatgagaac 3180 tgcggacaaa tatttccagt gcttcttgtt tcgcttcgat ccacgacaac ccccagaagt 3240 gtacatcgca gatgtagcga tgttcggcgc cacatgctca ccgtgcgtcg ctcaacatgt 3300 gttgcgagtg aacgccgata agtgggcaga tgaatttcca ttggcagcga cagcaattaa 3360 aaataagacc tatatggacg attattacga cagcgccgat actcccgaag aagcggctac 3420 gctggccgta caagtaaaaa ccatccacgc ccgtggaggg tttgaaatga gaaattgggt 3480 aagcaactgt gaagaagtgt tgaagaagct cggagagagt gccactgtcg aaccgcgtcc 3540 tctgcagtct acaactgaag cgaggtggga acgagttttg ggaatgttat ggcacccgaa 3600 atcggacaca ctgacatttt caacggattt gggggaacag ttgctaccct acacattagg 3660 agaactacgc ccaacgaagc gcattgctct aaaaatcata atgagcctgt ttgatcctct 3720 tggattatta gcaccgtatc tgatccacgg acgtgctttg atccaagatc tatggagaag 3780 tggggtccaa tgggacgaga aaatgagaga tgaagagttt gagaaatgga ccagatgggt 3840 ggagttgctg cccgctataa gcaagctgag cattccacgc tactatttca tcgaagccaa 3900 tcgtcttccc cactccatgc tgcaatgtca cgtttttacc gatgccagcg aaatttgcta 3960 tggtgccgcg gtgtatttcc gcactgtaga cggttccgga cgagtgcagt gctcactagt 4020 tatggcaaag agtaaagttg ctcctcttaa acatttgtcg atcccgcggt tggaattaga 4080 ggcagctgtg ttaggtgcga aattgcttca caccgttcaa acaaatcact ccttacaacc 4140 tcacgaagtt tatttgtgga ccgattcttc gaccgttctt tcatggattc gctcggacca 4200 caggcgttat aagcagttcg tcgctcatcg tatcggggaa attctgtctt tgaccgaaac 4260 ggaatgttgg cgatgggtac cgtcaaaaga caacgtagct gattgcttga cgaaatgggt 4320 gcgcgacacc gagcctgact gtaacagtag atggttcaga ggtccagcgt ttttgtacaa 4380 ctccgaagag atgtggccgc gtcagcgtgt gaagacgaac actgcggaag aactgcgttc 4440 aagctacctt ctcgctcata tctttctacc cggtagaatg ctagatgttc gtagattttc 4500 aaagtggagt gtgctattgc gaactgtggc atacgtttac cgatttattg gcaactgtcg 4560 tttacgtatc tcaagaagcc cgatagaaac cattcgggcg acgaagaaac aggagaagtt 4620 gctgaagtgt tcgctgaacg cctctattgt accattgaaa caagaggagt ttctacgggc 4680 agagcagttt ctctggagaa tggtgcaggg cgagcactat cccgacgaag tacgaactct 4740 gctgaagaac cggaatcaac cgatcgaaaa atggatagct gtggagagaa acagccctct 4800 gtatagattt tcgccattcg ccgatgaact cggaatcata cgaatggaag ggcgaaccgt 4860 cgatgccgct tatgctgatt tcgacacaag atgcccgata attctcccca aagatagtga 4920 catcacacgc ctcctcctag acgagtatca tcgtcgttac ggtcatgcga acaaagaaac 4980 cgtagtaaat gaagttcgtc aacggtttca gatttcacat ctacgaacag ccgtagacag 5040 tacctcgcgc aattgccagt tttgcaaggt aaataagtgt aaacctcgtc ctcctagaat 5100 ggctcccctg cccgagcaac gattgacacc gaacgtgcga cccttcagtt acgtcggtat 5160 agactacatg ggaccgctgg aagtgagcat cggtcgacgc aaggagaaac ggtacgtggc 5220 cgtattcacg tgtcttgtca tacgagctgt gcacttagag gtttcgtacg atctctctag 5280 cgaatcgtgc ataatggcga tccgaagatt cacacgcaga cgcggatctc ccgttcaaat 5340 cttcacagac aacgggacta attttgtcgg tgccagtcgt gagttgcaga tgcagataga 5400 tctagagtgc gccggtacat tcactgatgc caggaccaaa tggtcgttca atcccccctc 5460 cgctccccat atgggcggag tgtgggagcg catggtgcgg agtgtcaaag aagccatgac 5520 aatgttggac gatggacgaa aactcactga cgaaatcttg tggactacgt tggtcgaagt 5580 ggaaggattg atcaattcac ggcctcttac ttatatgcca caagacttgg ataatccaga 5640 ggctttaacg cctaaccatt ttatctttgg ttgttcgtct ggcgctcatg aaccgctaga 5700 accaccagta gatctgggac agacacttcg cagcagtttt ttacgctcac aacaactagc 5760 ggaagtcgcc tggaagcggt ggtcaaagga atatttccca gcaatcaata gaagaagcaa 5820 gtggttagat gaaatgagat ctctgaaggt tggtgacatg gtttacgtgg cagaaggcaa 5880 acgaagatcc tggatcagag gcatcgtgga tgcagtcatc tgcggtaacg atggaagagt 5940 ccgacaagcg attgtgcgaa cggcgtctgg tatgctgaaa cggcctgtcg tgaagttggc 6000 ggtgatggag ttgggtggat caacggaaga ccctcccctc gatccacggg gcggggga 6058 // ID Gypsy-91_AA-I repbase; DNA; INV; 6204 BP. XX AC supercont1.249; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-91_AA_; KW Gypsy-91_AA-LTR; Gypsy-91_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6204 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.249; Positions 129693 135896. XX CC 'AGTT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 488..2035 FT /product="Gypsy-91_AA-I_2p" FT /translation="MAHSIGNEDIADNLVDFDSETSPVMNINKMSQRQVRR FT LTISPTFRDRLNEVYENKAKQTADSVVENQRDKREREKNPDNERQIVPPPP FT PPAMQREKVTRGENLDSIREPIVGEKYTGTKKKISSVVQEQNGAPDLDLNF FT NPEVILVEDQKDSVEFLPPGFNITSQFSNPALTNNPLFFNLPNRSSQNRKS FT LPVSQWPIKYAGNDNGIGLNLFLRRVEFFAQSEKMSKEELFESAHFLLVGP FT AQDWFVAKWPILRNKNWDYFIQALRHQFLPSNIDHYIKVRSFSMFQAKAEL FT FSNFLVRMEQFFLCRTTPMSEEDKFDIMWHTMRPIYRDRLALVDVKDIHTL FT EQLCTRIDNNNESIMNKFVLSFENQKINEINFNSNPRTNLHSIVGPQSNNS FT NQTNQQRVQQHNQNNSRNNQSNSNQNFNPQGNRPRNNSNYNNNQIIQQEYQ FT PDCGWRELSLDNILRHYKVPDRRICFNCRKFGHHFTKCFSRRNVFCCICGL FT PEFHYEECPFCEAKNQRRGD" FT CDS 2404..5235 FT /product="Gypsy-91_AA-I_1p" FT /translation="MQIKGFVHVPFTIKQTTRILPCLVVPELGARCILGMD FT FFKLFNIKLTFEDVEVFDTLANYNVELNNEIQPASHLLDAEETELLESVKS FT TFKISEPGTLEACNVMKHSIELTDKNAIHLNPHPWSPTIQKKVNEEINRLL FT DLDIIEPSNSNWALQVVPVTKENGDMRLCLDARKLNAKTVRDAYPLASSMR FT ILNNLGKNRYFSVLDLKESFLQVKLAEDSKKYTSFKIIGRGLFQYKRLPFG FT LINSSATMSRILDRVLKEGLYEPFIFSYLDDIIIATRTFKDHIKYLKIVAD FT CLREANLSVNLAKCKFCLKRIKYLGFILSENGYQPNPERVTAISKFARPQT FT PKEIRRFLGMAGYYRNFIPNFSGISAPISDLLKGKPKKIMWNDKAEQAFIK FT LKECLISEPVLVNPDWAKEFTIQTDASDVAIAGILTQELDGKEHIIAYFSR FT KLSSCEKKYGPTDKEGLAALEAIEHFRSYVEGAHFTLVTDCSAVTFICNSK FT WRPSSRLSRWSVKLQEFDMTIKHRKGRDNVVCDALSRAVCAIFDDSTSKWY FT NELKSKVSTTPEKYQNFKIEKGDLYKFVTSRSSLDMGRLEWKVVVPPDKMT FT QLVDEEHAKLMHLGADKTLERIKLKYYWPKMKSDVKRILSKCGICKQSKHS FT TIATVPPMGEQKDATRPFQMIAMDYISGFVRSKSGNSDLLVCLDVFTKYVR FT LFPVKKISVESLTRLIKCDWFLKFGAPQVLISDNAVTFLANKFQDLLSKFH FT VHHFKNSRRHCQNNPVERVNRVILACIRTYCQNDHRIWDSRIPEIEFAINN FT TKHSSTGFTPFFLVHGYESIVDGRDHLQDRYTSDPSSDQFIQRRTEVMGTV FT YEEVVKNNKKQFEKYKKNYDSKHKGLPPTFTIGQKVYKKNFKISSAVDHYS FT AKLGPVYVPCKIIARRGATSYELEEGRNLGVFAAQDLIPE" XX SQ Sequence 6204 BP; 2020 A; 1053 C; 1268 G; 1863 T; 0 other; aatggcgccc aacgtaaatg taactgtaca gtggattacc ctcttgacga agtttttcaa 60 ttatggagtc attcgcttat aacaaagcag gttttgaatt ttcatgtgca atggtggatg 120 gcttttgtaa atacctttag aataattttc tttgaattaa accttttgaa cttgtgttga 180 gtttcttaaa aggaattaat aaataaattt gaattttgcc ttgcattttt ggagttttgg 240 caaggagttg taaatatgta aatagataat tagtattaat gaatagaatt agtatattga 300 attgaattta gtggagaatt tataagtact ttttcccatt tttatttacg tttcattgaa 360 tttccataca tttttccata ttttaacaca gtttcagttt ttaggataat gaagagattg 420 gccgatctca aaagtaatcg ttttgtcgtt aattgaatta aagtgaaatt aaagtgaaag 480 gacactaatg gcgcatagca taggaaatga agatatcgcg gataatttag tcgattttga 540 tagtgaaact agtccggtga tgaatataaa taaaatgagt cagcggcaag ttcgtagact 600 cacaatctct cccacatttc gggacaggtt aaacgaggtt tatgagaata aagcgaagca 660 aactgcggat tcggttgtgg aaaatcaacg ggataaacgc gagcgcgaga aaaatccgga 720 taatgaaagg caaatcgttc ctcccccgcc gccaccggca atgcagcgtg aaaaagttac 780 gcgcggtgaa aatttagata gcattcgcga accaattgta ggtgaaaagt acacgggtac 840 taagaagaag atttcgtcgg tagttcagga acaaaatggt gctcctgatt tagatttgaa 900 tttcaatcct gaagtgattt tagttgaaga ccagaaggat tcggttgaat ttcttccacc 960 gggtttcaac attacttctc agttttcaaa tccagcattg acgaacaatc ccctattttt 1020 caaccttcca aatagaagct cacaaaacag aaaatcgctt ccggtttctc aatggccgat 1080 caagtacgca ggtaatgaca atggaatcgg tttgaatctt ttccttcgaa gagttgaatt 1140 ctttgcacaa tctgaaaaga tgtcaaaaga agaattgttt gaatcagccc attttctttt 1200 agtagggcct gctcaagact ggtttgttgc gaaatggccg attttacgca acaagaactg 1260 ggattatttt atccaagcct tgagacacca gtttttgcca agcaacattg accactacat 1320 caaagtgcgt tcattttcaa tgtttcaggc gaaagcagaa ttattttcga actttttggt 1380 tcgcatggaa cagttctttt tgtgcagaac cactccaatg tctgaagaag acaaattcga 1440 cataatgtgg cacacgatga ggcccatcta ccgagaccga ttggccttag tggacgtgaa 1500 agatatacac actcttgaac aactttgcac tagaattgat aacaataatg aatcaatcat 1560 gaataagttt gttctctcat tcgaaaatca aaaaattaat gaaatcaatt ttaatagcaa 1620 tcctagaact aatcttcatt ccattgtcgg accacaatcg aacaatagta atcaaaccaa 1680 tcaacaacga gttcaacaac ataatcaaaa taatagtaga aataaccaaa gcaactcaaa 1740 tcaaaatttc aacccgcaag gcaatagacc aaggaacaat tctaattata acaataatca 1800 aattattcag caagaatacc aaccagattg tggatggagg gagttgtcgt tggataatat 1860 tctgaggcac tataaagttc cagatcgcag aatttgtttt aattgtagaa aatttggtca 1920 tcatttcact aaatgtttct caaggcgcaa tgtattttgc tgcatatgtg gactcccgga 1980 atttcattac gaggaatgtc cattctgtga ggcaaaaaac cagagaaggg gagattaaag 2040 gaggtggttt ccccaacctc agaaattcct ccagatttga gtcgtaaccc tcaaatcata 2100 aatgcatttc aacaaatcgt ttcagaaatc caggatacag attgtaccat tgagaaacac 2160 gaaactgatg atataacaaa ttatgttggt tattttcatc aaattgaaac agtttttgtt 2220 aatttaaata acgatgcaag atattttttg aaatttagtg ttttaggatt gcagctgcat 2280 ggattattgg atagtgggag taatgtgtca ttagttggag aacactttgg aaatttaacc 2340 aactttttac aaattcaaga tctgaacaaa aacgttaaaa tctcaactgc cagtggtgaa 2400 ccaatgcaga tcaaaggatt tgttcacgtt ccatttacaa tcaagcaaac cactagaatt 2460 ttaccttgtt tagttgtgcc tgaacttggt gctcgatgca ttctagggat ggactttttc 2520 aaattattta atattaagct taccttcgaa gatgttgaag tgtttgatac gttggctaac 2580 tataatgtgg aattaaacaa tgaaattcag ccagcctctc atctccttga tgcagaagag 2640 actgaattac tagaatccgt taaaagtact ttcaaaatat cagagcctgg cactctagaa 2700 gcttgcaatg ttatgaagca tagcattgaa cttacagata aaaatgccat tcatttgaat 2760 cctcatccat ggtcgcccac gattcaaaag aaagtcaatg aagaaataaa tcgattattg 2820 gacttagaca taattgaacc ctcaaactca aattgggcct tgcaagttgt accagttacc 2880 aaagaaaatg gtgatatgag gctttgttta gatgctcgca agttaaatgc taaaactgtt 2940 agagacgcat atcctttggc aagttccatg agaatcttga acaacttagg caaaaatcgc 3000 tacttttcgg tgctggactt gaaagaatcg tttcttcagg taaagcttgc cgaagactcc 3060 aagaaatata caagctttaa aataattggc aggggtcttt ttcaatacaa aagacttccc 3120 tttggtttaa tcaacagcag tgctaccatg tctcgtatcc ttgacagagt tttgaaagaa 3180 gggctctacg aacctttcat attttcctat cttgatgata ttataattgc aaccagaaca 3240 ttcaaggatc acataaaata tctaaaaatt gtcgctgatt gcttacgaga agctaacctc 3300 tctgtgaatt tggctaagtg taaattctgc ctgaaaagaa taaaatattt aggttttatt 3360 ctttcagaga atggctacca gccgaaccct gaaagagtta ctgcgatctc caagtttgct 3420 cgaccgcaaa ctccaaagga gattcgcaga ttcttaggga tggcgggata ctaccgcaac 3480 ttcatcccga attttagcgg tatatcagcc cctatctctg acttgttaaa aggcaagccc 3540 aagaagatta tgtggaatga taaagcagaa caggcattta tcaaactaaa agaatgtttg 3600 atatcggaac cggtcttagt caacccagac tgggcaaaag aatttactat tcaaaccgat 3660 gccagtgatg tggcaatagc tggcatactg acgcaggagt tggatggtaa agagcacatt 3720 attgcctatt tttcacggaa gctgagctct tgtgaaaaga agtatgggcc caccgacaag 3780 gaaggtttag ctgcactcga agcaatcgag catttcagaa gttatgtgga gggtgcacac 3840 tttacgctgg taacggactg ctctgcagtg acgttcatct gcaattcgaa gtggcgtcct 3900 agttcacgat tgtcacgatg gtcagttaaa ttacaagaat tcgacatgac catcaagcat 3960 agaaagggac gggacaatgt agtttgcgat gcccttagtc gtgctgtttg tgccatcttt 4020 gatgactcga cttcaaaatg gtacaatgag ttgaaaagta aagtgagcac tacaccagaa 4080 aaataccaga attttaaaat tgaaaagggc gacttgtaca agtttgtcac ttccagatct 4140 tctcttgaca tgggcagact ggaatggaaa gtggttgtgc cacctgacaa aatgacacaa 4200 ctagtggatg aggaacatgc caaactaatg catcttggag cagataaaac ccttgagcga 4260 ataaaattaa aatattattg gcctaagatg aaatctgatg tcaaaaggat tttatctaag 4320 tgtggcattt gtaagcaatc taagcattcg acgattgcta ctgttcctcc aatgggggag 4380 caaaaggatg ctacgcgtcc atttcaaatg attgctatgg attatattag tggttttgtg 4440 cgtagtaagt ctggaaatag tgatttgtta gtttgtctgg atgtgttcac taaatatgtt 4500 cgtctgttcc ctgttaaaaa gattagtgtt gaaagtttga ctcggttgat taagtgtgat 4560 tggtttttga aatttggggc tccacaagtg ttaatttcag ataacgctgt aacgttttta 4620 gctaataagt ttcaagattt gttgtctaag tttcatgttc atcattttaa aaatagcagg 4680 agacattgtc aaaacaatcc tgttgagcga gtaaacagag tcattctggc gtgtattcga 4740 acatattgcc aaaatgatca ccgtatttgg gactctcgga tccctgaaat agaattcgcg 4800 atcaataata ccaaacattc atcgacggga ttcacacctt ttttcctggt gcatggctat 4860 gagtcgatag ttgatggaag ggaccatctc caggataggt acacttctga tccatcatca 4920 gatcaattca tccaacgcag gactgaagtt atgggcaccg tgtatgaaga ggtagttaaa 4980 aacaataaaa aacagtttga aaagtataaa aagaactatg acagcaagca caaaggtctg 5040 ccacccactt ttaccattgg tcaaaaagtt tacaagaaga atttcaaaat ttcaagtgct 5100 gtagatcact attcagccaa acttggcccc gtttatgtcc cttgtaaaat tatagccaga 5160 agaggagcaa catcatatga gctagaagaa ggaaggaatt taggtgtgtt tgcggcacaa 5220 gatttgattc ctgaatagat tgtatgagag tatgtcttac tgaaattagt atgttagatc 5280 gtgtaatgtc agagaaatcg ttgaatagtt tgtacaaatg aaatcgcgtg cgtttgaatg 5340 attgagaaat attggtaaag gatgctcgag gaattggtga tcaccgagag tttgcgtagt 5400 ttaagcataa aaagttcgtg tatgtgtaga agtttgaatt gaaagaatat ttgcgtagaa 5460 taaaccgaga gttcgcgtag cctaagcata gaaagttcgt atttgagagt ttgagtcaat 5520 aagatatttg cgtaagcgta aacgagattt cgcgtagatt taagatagtg tttgaggaat 5580 gttaatttga gttgatctgt tatttgcgta aatgttactg agagtatatg tttgagtgaa 5640 actcaattag attgtatgac cgtgtttgaa tttgttgtgt tataatgtat tgatagggaa 5700 aatgtgtgga tagttataat caatgaatgt tagagttgct tgcacgattt gtattcttaa 5760 taattttctt tgcccaatgt ttgaagctcc tcagctaagc cagaaccacc ttcagttatc 5820 tgtgcaaaac aaaattaatt ggaatgtaat caaactctat gtacacacag cagttcatcg 5880 cgaatgatcg ttgatctgtt cttggaccaa gtaagtccct gttcttaaag gacaagttga 5940 gctgaaatcg agccagtaaa aataagcata tctatataca gcatatatat ctagtaaaca 6000 agcatccttt cacatatgag agttgaaaat tttgcccaga caagaatgat cttcgtgttg 6060 gaaaacagat aaaacttgac gacaccttac actactttac tatgtcgata tttgtcgatc 6120 gtatgagtgg atttgctcat ggtaagaaaa gaccactgct atacaatgat tcaatccttg 6180 tattgcagtc tggtgtgggg gtat 6204 // ID Perere_Smed repbase; DNA; INV; 2764 BP. XX AC . XX DT 07-DEC-2006 (Rel. 11.11, Created) DT 07-DEC-2006 (Rel. 11.11, Last updated, Version 1) XX DE Perere_Smed is a Penelope-like retrotransposon. XX KW Penelope; Non-LTR Retrotransposon; Transposable Element; KW retrotransposon; Penelope-like element; reverse transcriptase; KW GIY-YIG endonuclease; Perere_Smed. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-2764 RA Arkhipova I.R.; RT "Distribution and Phylogeny of Penelope-like elements in RT Eukaryotes."; RL Syst. Biol 55(6), 875-885 (2006). XX DR [1] (Consensus) XX CC Poseidon_Hyd is a Penelope-like element (PLE) from the planarian, CC Schmidtea mediterranea. It belongs to the Poseidon group of PLEs, CC and is likely a member of the Perere clade, which has Perere10 CC from Schistosoma mansoni (BN000801) as a founding member. Its a CC single ORF contains regions homologous to reverse transcriptases CC and to GIY-YIG endonucleases. Consensus sequence was assembled CC from GenBank trace archives. The element is likely to have active CC copies, as most sequences are 99% identical. Many copies appear CC to be present in a tandem arrangement. XX FH Key Location/Qualifiers FT CDS 1..2523 FT /product="Perere_Smed_1p" FT /translation="MIAVLSKYIVRKVFYWRYRLIKSLPNLFNYLFINHAA FT AVPIFKRFMKVYNSLKRKELDISFLETCLEEKMIPSFIYKMKLSDCLSSKD FT VQLIHRRTLRRMLESEKIRRTSIDKKLTNIGLMLNNATNEFCLQVCVEWCI FT KNSIVQIKFKENAHIKKLINLRKIFGISKFSVENTVINLSKIVLNDDEERV FT LKFGPNHCIKNRLDETKLCAKIESLYFHIARSMKGINGNYVKSSLSASTME FT YIDKNKKQNIRDFMTLKNLAKKNINICKFDKGNGIVILDHDVYVNKLNDLL FT NSPQFVKLPKGRKNGKLPPLKDEEFVQKMLNGFIEKKLIDEEVGKSIRPIG FT SQPAKLYGLPKIHKDGVPMRPVLSMVGTAQYKIAKFLDGLLKPLMRSEFEC FT KDSFEFVSCIAKLQKRSMNDVMVSFDVCSLFTNVPLVETIDLCCALWNEND FT SEHHILDRRAFRKLLEFATSNVNFLFNDEWYQQIDGVAMGSPLAPTMASIF FT LASLEKKIASFQFEKPFVYKRYVDDIFLIFENQKHVEPFLQFMNSLHKNIV FT FTCETERKSSIAFLDLLIQRKDKQYETEIYRKPTDTGLYTSPESFCEFKYK FT RNMVKGLIYRSWALSSTFANSVKSVDKLVELLIKNGYSKSFLLLMIKETVD FT KLIKNPKDPCCGKSECNLSDGCELSVDLGDCKNGYKKIEPKYVLVLPYSEG FT FTNYKRKLGKLIGNLEYKIVSNSCKVRNMFINKSKTPVGLCSDLVYQFTCN FT GCNATYIGETSRHLCTRVLEHCRLKGLTNISEHNRGCKSDIGMSDFRILLR FT SFNSYWERVICEALLIRSLDPKINVQSAVTTNVLNVFK*" XX SQ Sequence 2764 BP; 970 A; 287 C; 542 G; 965 T; 0 other; atgatagctg tgctatcgaa atatatcgta agaaaggttt tttattggcg ttatcgttta 60 attaaatcgt tgccaaattt atttaattat ttatttataa atcatgctgc agctgttcca 120 atattcaaaa ggtttatgaa ggtttataac tccctcaaaa gaaaggaact agacattagt 180 tttcttgaga cttgtttaga agaaaagatg attccttcgt tcatttataa aatgaaacta 240 agtgattgtc tttctagtaa agatgtccaa ttaatacata gaagaacgtt aagaagaatg 300 ttggaatcag agaaaataag gagaacttcc attgataaaa aactaacaaa tattggatta 360 atgttaaata atgctacgaa tgagttttgt ttacaggtat gtgtagaatg gtgtatcaaa 420 aattcgattg tgcaaataaa atttaaagag aatgctcata ttaaaaaatt gattaatttg 480 agaaaaatat ttgggatttc taaatttagt gtggagaata ctgtgataaa tttgagtaaa 540 atagtgttga atgatgatga agagcgtgtt ttaaaatttg gtccaaatca ctgtattaag 600 aatcgcttgg atgaaacaaa attgtgtgca aaaattgaaa gtttatattt tcatatagcc 660 cgatcgatga aaggcattaa tggtaattat gtgaagagct cgttaagtgc aagtacgatg 720 gagtatatcg ataaaaacaa aaagcagaat attagggatt ttatgacttt aaaaaactta 780 gctaaaaaaa atataaatat ctgtaagttt gacaaaggaa atgggatagt cattcttgac 840 catgatgtgt atgtaaataa gttgaatgat ttgttaaatt ctcctcagtt tgtgaaattg 900 ccaaagggtc ggaagaatgg aaaattacct cctcttaagg atgaagagtt tgtgcaaaaa 960 atgttaaatg gttttattga gaagaagctt attgatgaag aggtaggtaa gtctattcgt 1020 ccaataggat cgcagccagc taaactatat gggctaccaa aaatccataa agatggtgta 1080 cctatgagac cagttttgtc gatggttggt actgcccagt ataaaatagc aaagttttta 1140 gatggattat taaaaccatt aatgcgatca gagtttgaat gtaaggatag ctttgagttt 1200 gtttcttgta ttgcaaagct acaaaaacga agtatgaatg atgttatggt ttcgtttgat 1260 gtttgtagtt tgtttacgaa tgtgccactg gttgagacaa ttgacttatg ctgtgcgttg 1320 tggaatgaaa atgattcgga acatcacatt ttggatagaa gggcatttcg aaaacttctt 1380 gaatttgcta cttcaaatgt caattttctt ttcaatgatg aatggtatca acaaatagat 1440 ggagtcgcta tgggttcccc tttagcacca acgatggcat caatattttt ggctagtttg 1500 gaaaagaaaa ttgcttcttt ccaatttgaa aaaccttttg tttataaaag gtatgtggat 1560 gatatttttc ttatatttga aaaccagaaa catgtagaac cgtttttgca gtttatgaac 1620 agtttacata aaaacattgt attcacatgt gaaacggaga gaaagtcatc aattgcgttt 1680 ttggatttgt taattcaacg caaagataaa cagtatgaga cagagattta tagaaaacca 1740 actgatactg gtctttatac ttctccagaa agcttttgtg aatttaagta taagagaaat 1800 atggtgaaag gattgattta tcgttcatgg gctttatctt ctacttttgc taattcggta 1860 aaaagtgttg ataaattggt tgagttgttg ataaagaatg gatattctaa atcattttta 1920 ttgttgatga taaaagaaac tgttgataaa ttgattaaaa atcctaagga tccatgttgt 1980 ggaaagtctg aatgtaattt gtcggatggg tgtgaattga gtgttgatct gggtgattgt 2040 aaaaatggat ataaaaaaat tgagcctaaa tatgtgttag tattgccgta ctctgaaggc 2100 tttacaaatt ataagaggaa gttgggtaaa ctgataggaa atttagagta taagatcgtc 2160 tctaattcgt gtaaggtacg caatatgttt attaataaat caaaaactcc ggttggattg 2220 tgttctgatc tcgtttatca atttacatgt aatgggtgta atgccacata cattggagag 2280 acttcccgtc atctctgtac tcgagttcta gagcattgtc gattgaaggg tttgaccaat 2340 attagtgaac acaatagagg ctgtaaaagt gatataggaa tgtctgattt tagaattttg 2400 ttgagaagtt ttaatagtta ttgggaacga gtgatatgtg aagcattgct gataaggtca 2460 ttggatccaa agattaatgt gcagagtgct gtgactacga atgttttgaa tgtttttaaa 2520 tagtttacgt atgttaaatc atgtttgttt ttttaatatt attaaaataa ttgaaaattt 2580 taaattacat taatttttta aaaattagaa tataagtgaa aattttaaaa ttttaaatta 2640 attaaaagtg aattgaatat attttataaa tatttattta ttaaaaaatt aaattaataa 2700 aaaataattt gtttaaatta aatgttaatt cataatattt ttagttggtt ttgagtacct 2760 gaag 2764 // ID BEL-125_AA-LTR repbase; DNA; INV; 229 BP. XX AC AAGE02023612; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-125_AA_; KW BEL-125_AA-I; BEL-125_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-229 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02023612; Positions 260595 260367. XX SQ Sequence 229 BP; 64 A; 59 C; 38 G; 68 T; 0 other; tgttccggtg cgacttccca atgctacaca tcacacacga cagcgcaaat accttgcgca 60 aacgtcacca ccggacggtt attttttgtt tattctctct ccctacaaat cacaacctaa 120 caacaagaga agaaaaaaag ttattaataa aagtttgatt ttttcggagt aaacacgcgg 180 ttttttcctc ttctccgggt tccgtcggga tttatcgtct atccgtaca 229 // ID BEL-205_AA-I repbase; DNA; INV; 2392 BP. XX AC supercont1.2; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-205_AA_; KW BEL-205_AA-LTR; BEL-205_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-2392 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.2; Positions 2164069 2161678. XX CC 'GAGAC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 24..2063 FT /product="BEL-205_AA-I_1p" FT /translation="MGIKSQKVNQVGSCVEISRPNTNDNEWLINHSCTVVK FT KATSKNNSAHIALRLKQLDEELAIQQREWDAERRAFEHDKRVLQIKYDLLQ FT KHVAERTKASKGKKNSGVGEQNIAYNNTEGTLQSTNRIPPSEAFVNSSRKQ FT QPVWKQTDLTRLNLPDDQLDEIQQRDISAFLYTAPNNPNLIANRSEEMNEH FT NFTPIAHSTTTIGMPVRGKQTTNNVCNDQWSKKRSSVPANIESLYKLTLLA FT DDVAALAKSERNELSLAPSVQHSTSKDCRSPSTFVAPIVPEPAPPPAQGDA FT AFPFIDGFLDRSMFRSENITSISEELVKSLLLSSALQSAPTPKYSKAITKI FT VNAGVQYFLGHVKAVEHLIQSQNMLMVMEFVTKLSIIVKMDLLSNKKTSNA FT VHYRVLYFTELTKPRTAVWLNSTPNGNVMSLTLSQKKCAHVSCCLQAEMCL FT VRCRQKKPVLPFDCDGGETSLMAVQWCENRGREQMKGQDDYWFTGIFVRRE FT KSLGVCYSCPCWLEGSMKENGITNAVRMPQSTKIIAKMRTGMKTTTSITVP FT RICVDGGILRVLMRGLSVVPGLCIEILSPDILVACRISIKPPISIGATYVG FT IAVPLPAPVVSTNTADRDNGIDVEMLNKKDECPTPKLSSIILMIVSVQSVS FT TIIPCKLVIPIQNEKAVRKRSRSGSRSFERL" XX SQ Sequence 2392 BP; 765 A; 474 C; 564 G; 589 T; 0 other; taacttaaag ataatttgat gcgatgggaa tcaaatcgca gaaggtaaac caagtgggaa 60 gctgcgtcga aatctcgcgc cccaatacga acgacaacga gtggttaatc aaccacagtt 120 gcacagtagt caaaaaagca acatccaaga ataactctgc ccatattgcc ctgcgattaa 180 agcagctaga tgaagaactc gctatccagc agcgcgaatg ggatgcagaa agaagagctt 240 ttgagcatga taagagagtt ttgcagataa aatacgattt gctccaaaaa cacgtcgcgg 300 agcgaacaaa agcttctaaa ggtaagaaga actctggcgt tggagagcag aatatcgcat 360 acaataacac cgaaggcacc ttacagtcaa caaataggat ccctcctagt gaagcatttg 420 taaacagttc gcggaaacaa cagccagtat ggaaacaaac agatctaacg agattgaact 480 tgccagatga tcaactggat gagattcagc agcgagacat cagcgcgttt ctctatacag 540 caccgaataa tccaaatttg atcgctaatc gaagtgaaga gatgaacgaa cacaacttta 600 ctcctattgc gcattccact acgacaatag gtatgcccgt gcgaggtaaa caaacaacca 660 acaatgtatg caacgaccaa tggtcgaaaa aacgatccag cgtgccagct aatattgaga 720 gcctatacaa gcttactctt ctagcagatg acgttgctgc tttggctaaa tcagaacgga 780 acgaactaag tttagcccca agcgtacagc acagtacaag taaagattgt cgttcgcctt 840 ccacctttgt cgctcctata gtgcctgaac ctgcgccgcc ccctgcacag ggtgatgcag 900 catttccctt catcgatgga tttttggatc ggtcaatgtt tcgcagtgag aatatcacca 960 gtatatcgga agagctcgta aagtcgttat tattgtcgag tgcactacag tcagccccaa 1020 cacccaaata cagcaaggcg attaccaaga ttgtgaatgc tggcgttcaa tatttcctcg 1080 gacacgttaa agcggtggag catctgatcc agtcacaaaa catgttaatg gtgatggagt 1140 ttgtgaccaa attgtcgatc atagtgaaaa tggatttatt atcaaacaag aaaacgtcca 1200 acgcggtcca ttatcgtgta ctgtacttca cggagttaac gaaaccgagg acagcagtgt 1260 ggttgaattc tacccctaac ggcaatgtga tgtctttgac gttgagtcag aagaagtgtg 1320 ctcacgtgag ctgttgtctt caagctgaaa tgtgtctcgt acgatgtaga caaaagaagc 1380 cagtattacc atttgattgt gatggaggtg aaaccagcct gatggctgtg cagtggtgtg 1440 agaaccgagg cagagaacag atgaagggac aagacgatta ttggtttacc gggatattcg 1500 taaggagaga aaagagtctt ggcgtttgtt attcgtgccc gtgctggctg gaaggtagta 1560 tgaaggagaa tggaattacg aatgccgtga ggatgccaca atcgactaaa ataattgcca 1620 agatgaggac cggaatgaaa acaacaacaa gtattacggt gccccgaata tgtgtagatg 1680 gtggcattct acgagtgctc atgagaggac tttcagtcgt acccggtttg tgcattgaga 1740 ttttgtcacc ggacatcttg gtagcctgtc gcataagtat taaacctcca atttcaatcg 1800 gtgcaacata tgttggtata gcggtgccat tgcccgcccc agtggtaagt acaaatactg 1860 ccgatagaga taacggaatt gatgtagaga tgttaaacaa aaaagacgaa tgtccaaccc 1920 ctaaattatc atcgattata ttgatgatcg tttccgttca atctgtctcg acgataatcc 1980 catgcaaatt agtgattcca attcaaaatg agaaagctgt taggaaacga agtcggtcag 2040 gttctcgatc atttgaaaga ttatgaagac gttcaacagg agaaaaatat tggtttaaat 2100 cacagagttg tggaacagta atatccggag caatataagc aacctttgag gtactgatcg 2160 gttacgattt tcatcgcgtc ggcaatactg gtactccacg tttaagcaac aagattgcat 2220 tagtcattca ggttctgact gttataacag ttgaaaaggt tgagttagaa ctagataaaa 2280 aagtagaagg attctatcgg tcgctataag agattttttt tgcattagtt cagtagaagg 2340 atagcaatac ggataatagt aaaactggtg gccagtgtta cgggggggag aa 2392 // ID Copia-23_DPu-I repbase; DNA; INV; 4272 BP. XX AC scaffold_34; XX DT 11-MAY-2010 (Rel. 15.05, Created) DT 11-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE Copia-like LTR retrotransposon from Daphnia: internal portion. XX KW LTR Retrotransposon; Transposable Element; Copia-23_DPu-I. XX NM Copia-23_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-4272 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 709-709 (2010). XX DR Genome; scaffold_34; Positions 1007468 1011739. XX CC Positions [1635-2165] - Integrase core CC 'GTTCA' target site duplication CC LTRs are 100% similar to each other. CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX FH Key Location/Qualifiers FT CDS 765..4262 FT /product="Copia-23_DPu-I_1p" FT /translation="MEHSLQAKKKNKVVFSESKSRDFSSHGRPKPICGHCR FT NLNRRASHREEDCWIKEAYLKGRQDAGSEEAMLAQPRKEKTPAPVFDDDYA FT FKSADLQLNNECWYADSGASEHMSDQKSIFQSITPINRGDRAIKGVGKNNE FT ALYATGVGTVIIKTKVNGEWHDGLLRNVLFVPDLGANLFSIGATTERGATA FT TFDKNGMELWKNGKIVATGSRIQKKLYLMHFTNCSSPSVTAALVANRVPNS FT LQLWHERLGHANHSVVHHMHSSNLVDGIKMEKSCSPPPFCEGCVFGKHHRL FT PFPTSGRTRATAIGGLIHSDLCGPMPTPSPNGSLYFLTFRDDYSGYGFIRF FT LKRKSEANEHIQDLIARFETETGRSIVIFRSDNGGEFMSKDLIQWLAKRGI FT IHQTSTPKTPEQNGVAERYNRTILESMKSMLHSSTLPVKLWAEASATAVYL FT LNRVPCKAVPTMTPYQAWHGRKPNVSHLHVFGCDAYAHIPKDERTKLDPKG FT VKCNFVGYSETQKAFRLWDSSSGKIKISRDVIFNESIQAPSLLPVAVPRTD FT LETVSVIGVGGDACEASNGRATVSNDQRGQPASDSDIIAEPFHGFEQPEKP FT ILGDHHLEPSVDSPQRPSRIRKKPNRLIEDPNFLSHTDMNVAPDEIFEPQT FT YQEAISCPESAKWISAMEEEMESLRSNETWQLQPLPEGRRQIRNKWVYTVK FT YDSANRPTRFKARLVAKGFSQKEGIDFNETFAPVVRHESVRVILSTAAAHD FT LEIIQLDVKTAFLHGDLHEEIFMDQPSGFISSDSPNHACRLLKSLYGLKQA FT YRSWNIKFDGYLCALGFVRSLADPCVYRRSEEDGIVILAIWVDDGLLCGPD FT KTKLLELISQLSTHLDITARDADFFIGLKIDRDRTRRVIHLSQEQYTNRIL FT RRFEMLECNPKMQPADPYVRLTINGIEGGPNSPTMDDSIYREAVGSLMFLM FT VCTRPDIAFSVGQVAQFSHNPKQAHWTAVQRVLAYLKGTSKCGITYGETSH FT PHVLTAFSDSDYAGDSDTRRSTTGYLLIYNGGPVAWGSRRQSCVSLSTTEA FT EYIAMCEAAKDIVWTRRLLSGIGCDQKQPTELFCDNQGALKLVSNPEFHRR FT TKHIDVRYHYVREQQMDGSIVISHVGTKEQLADLLTKALAGPAFQDLRRKI FT SVHPILV" XX SQ Sequence 4272 BP; 1251 A; 987 C; 951 G; 1083 T; 0 other; ggttatgggc ccaggttcgc aattgctacc tacttactgt aataactgca aaataggcta 60 aaagtactca cgttgtttct agaacagagc caagatggcc actggaactt tctctgcaaa 120 agatgtgtcg cacattacga aattcaaggg agaccaattt agcttctata agtttcaact 180 taaacttgta ctgaagaatc atgccctact caagattgtt gaaggtgatg aacaaaaacc 240 cgcggccatt gttctccttg ctgataattc aaacaacgct gcagtcatcg ccagaaatac 300 tcagattgat gaatgggaca agaaagacac tgctgcacaa aactacattg ttgctactct 360 ggaagagaag gtaatgagaa ccattatgaa ttgtaagact tcaaattcga tgtggattcg 420 tctgttaaat caatatgagc tggcttctgt tgaaaacaaa catctactaa tgggcaagtt 480 tatggcatat caatatgacc ctacccacga cataatgtct catgtttcag cagttgaatc 540 tcttgcatca caactaagtg atgtgaattc accaatctca aacgatcaaa tcatagccaa 600 gatcacatct actctgccac taaatggcaa tcgcaactat aggtccttca tgtctgcatg 660 gaacagtact gatgatgccg tcaagacact ttctctacta acgtccagac ttcaagttga 720 agaaaacatg cttaagctaa ctgaaatgag tgtcgactct tcagatggag cattctttgc 780 aggcaaaaaa gaagaataaa gtcgttttct ccgagtcaaa atcaagagat ttttcttcac 840 atggtcgtcc taaacctatc tgtggtcact gccgaaatct caaccgtaga gcatctcatc 900 gtgaagaaga ttgttggata aaagaggcct atctcaaagg aagacaagat gctggaagtg 960 aagaagcaat gctggctcaa cctagaaaag agaaaacacc tgcacctgtt tttgatgatg 1020 attatgcctt caagtcagca gatctacaac tcaacaacga atgctggtat gcagattcgg 1080 gtgccagtga gcatatgagt gatcagaagt ccatctttca gtcaatcacc cccatcaatc 1140 gtggtgaccg agcaatcaaa ggcgtgggaa agaacaacga agcattatat gcaactggag 1200 ttggaactgt catcatcaag accaaagtaa atggcgaatg gcatgatggt ttgttacgta 1260 acgtcttgtt tgtcccagat ctaggagcca acctgttctc aattggagcc acaacggagc 1320 gtggagccac agctacgttc gacaaaaacg gcatggagct ctggaaaaat ggaaaaatag 1380 tcgctactgg ctcgaggatt cagaaaaagc tctatttgat gcatttcacc aactgttcgt 1440 cgccaagcgt cactgcagca ctagtagcca atcgtgttcc taactccttg caactttggc 1500 acgaacgctt gggccatgcg aaccactccg ttgtgcatca tatgcattcg tctaacttag 1560 ttgatggcat caaaatggaa aaatcatgta gtccaccacc tttctgtgag ggttgcgtgt 1620 ttgggaaaca ccaccgactc cccttcccta ccagtggacg tactcgagcg acagcaattg 1680 gcggactcat acacagtgat ctgtgtggac ctatgcctac accatctcca aatggatcgc 1740 tctatttctt gactttccgg gacgactatt ctggttatgg cttcattcgt tttctgaaac 1800 gaaaatcgga agccaacgag cacattcaag atctaatagc acgttttgaa actgaaacag 1860 gacgatcgat tgtcattttt cgttctgaca acggtggtga gtttatgagc aaggatttga 1920 tccaatggct ggctaaacgt ggaatcattc atcaaaccag cacaccaaaa actcctgaac 1980 agaatggagt agctgaacgt tacaaccgta ccatcttgga atctatgaag agtatgctgc 2040 actcgtcgac actgcccgta aaactctggg ctgaagcttc agcaaccgct gtgtacttgc 2100 tgaatcgagt tccctgcaaa gccgtcccga cgatgacacc ttaccaagcg tggcacggaa 2160 ggaaaccgaa tgtctcacat cttcatgttt ttggctgtga tgcatatgcc cacatcccaa 2220 aggatgagcg cacaaagcta gatccaaaag gtgttaaatg taatttcgtt gggtactcag 2280 agacacaaaa ggctttccga ctgtgggatt catcatctgg aaaaattaaa atctcaagag 2340 atgtcatctt caatgaaagt attcaagcac cttcactact tccggtagct gtccctcgca 2400 ctgacctaga aactgtctca gtaattggag tgggtgggga tgcctgcgag gcatccaacg 2460 gacgtgccac cgtgtccaac gatcagcgtg gtcaacctgc aagtgactct gacatcattg 2520 cggaaccctt ccatggattt gaacagccag aaaagccgat tcttggagat caccatctcg 2580 aacccagtgt ggactcacct caaagaccgt ctcgcattcg gaagaagcca aatcggctta 2640 ttgaagatcc aaactttctg agccacactg atatgaacgt tgcccctgac gaaatttttg 2700 aaccacagac gtaccaggag gcgatctcat gcccagaatc tgcaaaatgg atttcagcga 2760 tggaggagga gatggaatcg ctacgatcaa acgagacatg gcagcttcaa ccactacctg 2820 aaggtcgccg ccaaataagg aacaagtggg tttacacagt aaagtatgat tctgctaatc 2880 ggccaactag attcaaagcc cgcctcgtag caaaaggttt ttcacaaaag gaaggaattg 2940 attttaatga aacatttgcc ccagttgtgc gtcatgagtc ggtcagagtt atactatcaa 3000 ctgcagcagc ccatgacctc gagataattc aattggacgt caaaactgct ttcttgcatg 3060 gagacctcca tgaagaaatt ttcatggatc agccatcagg cttcatctct agtgactctc 3120 ccaatcatgc atgccgtctc ctaaaaagtc tctatggcct taaacaagcc tatcggtcgt 3180 ggaatatcaa gtttgatggc tatctgtgtg ctcttggatt tgttcgcagt ttggctgatc 3240 cgtgtgttta tcgtcgaagt gaggaggatg gcattgtcat cttagcaatc tgggtggatg 3300 acgggcttct ctgcggacca gacaaaacca aattattgga acttatcagt caactatcaa 3360 cacacctaga catcacagca cgagacgcag acttctttat tggcctaaaa attgacaggg 3420 atcgtacccg ccgtgtcatt catctttctc aagagcaata caccaatcgg attcttcgcc 3480 gtttcgagat gttggagtgc aacccgaaaa tgcagcctgc ggatccatat gtgagattaa 3540 ctatcaatgg aattgaaggt ggacccaatt caccaaccat ggatgactcc atataccggg 3600 aagctgtcgg cagtttaatg tttctcatgg tgtgtacgcg cccggacatc gctttctcag 3660 tcggacaggt agcacaattc agccacaacc cgaaacaggc tcattggact gcagttcaga 3720 gagtcctagc atatttaaag ggaacatcta agtgtggaat aacttacggg gaaacaagcc 3780 acccccatgt tctcactgct ttttcagatt ccgactatgc tggggactcc gacactcgac 3840 gatcaactac tggatatttg ttaatctaca atggaggtcc ggtagcttgg ggaagcaggc 3900 gtcagtcttg tgtgtcactc tcgactacag aagctgagta catagctatg tgcgaagccg 3960 caaaagacat cgtttggact cgccgactgt tgagtggaat tggatgcgac cagaagcagc 4020 caacggaatt gttttgcgac aatcaaggag cactcaaatt ggtgtccaat cctgaatttc 4080 atcgccgcac gaaacatatt gacgtacgtt atcattatgt tcgtgagcaa caaatggatg 4140 gaagtatcgt gattagtcac gtgggaacca aagaacaact tgcggatctc ttaacgaagg 4200 cgcttgctgg tccagctttc caggacttac gaaggaagat atcagttcac ccaattttgg 4260 tttgagtggg ga 4272 // ID Kolobok-14_HM repbase; DNA; INV; 2755 BP. XX AC . XX DT 14-JAN-2009 (Rel. 14.02, Created) DT 14-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE Kolobok-type family - consensus. XX KW Kolobok; DNA transposon; Transposable Element; Kolobok-14_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2755 RA Bao W. and Jurka J.; RT "Kolobok-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 9(2), 423-423 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 373..2214 FT /product="Kolobok-14_HM_1p" FT /translation="MAKNKSFKNRKVKRVFQGNRYTKKTVEVFLQQPDKDT FT NNDGINILPCASSTKLNISIDKADQPLTKHDDFFFFFHFPMFKDLICSIGS FT CGECNSKDIALEYLDEKAKGFSIQFCIKCATCEWTYFFNSSRAFSIPDRDT FT RGVKSQEINVRTVMAFREIGQGHEAMKIFASILNMPPPMSLFSYNEINSDL FT LKFYEAASCDSMKDAVTEIRKRDFPNAAENDIVDIQIGIDGSWQKRGHSSL FT NGVCTAVAKANRKVVDYQVFSKFCRGCALWQSKRNKSGYGKFLKTHVCDLN FT HFKSSGAMESAGALSFFTESLAKYNVRYSHYIGDGDTESYTNVVKAKPYGE FT DLVPVKIECVGHVQKRLGTRLRQMRCDLKGKKLEDGKIISGKGRLTDKIIN FT KMQNFYGMSIRQNTLEAWSGDRTIALYNMKKSVLAVLWHCSNISNSEERHQ FT FCPRTKESWCKYWQNDKNYKPSINLPLAIKSILKDTFLALRSDDLLSSCLD FT GATQNPNEAFNQIIWKKCPKNIFVTKKVLDLGVASAVINYNDGLRGFTRIF FT DYLQLLPGSFMIQGSWKKDVSRVKNTIKNSTPKQKRARRKRRCIRKGYIDK FT EKEKEGVDSYITGNF*" XX SQ Sequence 2755 BP; 965 A; 362 C; 502 G; 926 T; 0 other; ggtggttatc aggtaaaaaa atagcgattt ttgtgaaaaa atgtgttttt tagtttatca 60 gccaactttg tttaaaattt aattggcttt ttaaccatat ataacatgat atgggttaat 120 taataaaaaa atgtaaaaaa attgttttag ttggtaagag taaaaaaaat ttacttaaca 180 accgcatagc aaccaataat tttttataga tatattaccc aaagacctag atttaaacaa 240 ttatttcttt tcatgggtct gtataattaa ctaaactttt gaaccagagt tatttttagc 300 aaagccgtat gttttgatgt tatttgttaa gcagaaagaa aaaagtttaa agttgttaaa 360 aaaagtttta ttatggcaaa gaataaaagt tttaaaaata gaaaagttaa acgagttttt 420 caaggtaata gatatactaa gaaaacagtt gaggtttttt tgcaacagcc tgacaaagat 480 acaaacaatg atggaataaa tatattgcca tgtgctagtt caacaaagtt gaacatttcc 540 atagataaag ctgatcaacc tttaacaaaa cacgacgact ttttcttttt ttttcatttc 600 ccaatgttta aagatttaat ttgtagcatt ggatcttgtg gtgaatgcaa ttcaaaggat 660 attgctcttg aatatctcga tgaaaaagcc aagggatttt caatacaatt ttgtattaaa 720 tgtgcaactt gtgaatggac gtacttcttt aattcatctc gagcatttag tatacctgat 780 agagatacca gaggagtaaa atctcaagaa attaatgtta gaactgttat ggcctttaga 840 gaaattgggc aaggtcacga ggcaatgaaa atatttgcat ctattttgaa tatgccccca 900 ccaatgagtt tgttctcgta taatgaaata aactctgacc tattaaaatt ttatgaagct 960 gcttcttgtg atagcatgaa ggatgctgtg acagagataa gaaaaagaga ttttccaaat 1020 gctgctgaaa atgatattgt tgatatacaa attggaattg acggttcatg gcagaaacgt 1080 ggtcattcat ctctaaatgg tgtatgcact gctgttgcaa aagctaaccg taaagttgtc 1140 gattaccaag tattttcaaa gttttgtcgt ggctgtgctt tgtggcaaag taagagaaat 1200 aagagcggtt atggaaagtt tttgaagaca catgtttgcg accttaacca ctttaaatca 1260 tcaggtgcca tggaatcagc aggtgcttta tcatttttta cggaatcatt ggcaaagtac 1320 aatgttcgct attctcatta tataggagat ggagatactg aatcttatac aaatgttgta 1380 aaggcaaagc catatggaga agatcttgtc ccagtaaaaa tagagtgcgt cggtcatgtc 1440 caaaaacgcc taggcacccg gttacgtcaa atgcgatgtg acttaaaggg taaaaagtta 1500 gaagatggga aaattatatc agggaaagga aggttgacag ataagatcat taataaaatg 1560 caaaattttt atggcatgag cattaggcaa aataccttag aagcatggag tggagaccgt 1620 actatagcct tgtacaatat gaaaaaatct gttttggctg tattgtggca ttgctcgaat 1680 ataagcaaca gcgaagagag acatcaattt tgtccacgta caaaggaaag ctggtgcaag 1740 tattggcaga atgacaaaaa ttataaacca tcaattaatt tgcctcttgc tattaaaagt 1800 attcttaaag acacattctt ggcactacgg tcagatgatt tgttatcaag ttgcctggat 1860 ggcgcaacac aaaatccaaa tgaggctttc aaccagataa tctggaaaaa gtgtccaaaa 1920 aatatttttg ttactaaaaa ggtactagat cttggtgttg cttcagctgt aataaattac 1980 aatgatggtt tgagaggttt cacaagaatt tttgattatt tgcagctctt gcctggaagt 2040 tttatgatcc aaggatcgtg gaagaaggac gtatcaagag taaagaacac gattaaaaat 2100 tcaactccaa agcaaaaaag ggctagaagg aaaaggagat gtatcaggaa gggttatatt 2160 gataaggaga aagaaaaaga gggagtagac tcttatatta caggaaattt ttaattaata 2220 aattgttttt aaactttgaa gcttgatttc tcaattgttt gtttttgcct gactgcaata 2280 agcgaatcat tcatatcttt ttaaccagat atgcaattga gttcaaattt tcagggttgt 2340 ttctttatag atagaactgc aatagcttac tagaactaat cttaagcttg tttccatttg 2400 aatatattca tattttacct tttttttttg tatcaataat ttccgttaaa attgaaaaag 2460 ctgccatttt aaaaataata attattttta gtttttctag taagcttttg tttaatacac 2520 tataatctat cagtgttgta agttttggct tataatgtca taaagaggct gagaaaatgt 2580 tgttttcttt tttcattgat ttttttgcat tttatgctga ttcagcaaaa aaaaaaagga 2640 aattattttt tttttatcat ttttaattat attaataagt atgtttcttt taaactaact 2700 ttggtatatg attaagttgg caatcaaaaa ataatttttt tacctgataa ccacc 2755 // ID BEL-26_AA-I repbase; DNA; INV; 5875 BP. XX AC supercont1.16; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-26_AA_; KW BEL-26_AA-LTR; BEL-26_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5875 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.16; Positions 1555527 1561401. XX CC Positions [4905-5489] - Integrase core CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 624..5861 FT /product="BEL-26_AA-I_1p" FT /translation="MMQSLEILGDRQALLTDKLLRMRETLREDISIHLLKL FT HVETLRRIADDFEKVYSEMAALLTKEQREAMHEEYAKFEKMHNEVYVILQT FT RIEQAQQQLKLEEFRNIPAPQTSSQAPVYVQAPAPHLQAPFPTFNGTLENW FT YSFKSLFQSIMARYTNETDAMKILHLRNSLIGEAKDKIDQEVVNNEDYASA FT WKILEDAYEDKRLIMDTHIDAILDCPKITKDNRGKSLSKLVEVCVKHTDAL FT NGHGFPVEGLAELIIVNVLYKKLDKETQELWETSLGSGELPDFEEFVDFLR FT ERGRVLQRTNRTQQPPAHQATAIPTKRQPGSQKPHAQVPSNSFVQVSKEMC FT PCCKEEHLIYRCSKFQGMSLQERKGFVAKSKLCFNCLKAKHRVINCPSEQG FT CKVQGCGRKHHSLLHCNDAPTVECPDKRQESEESSTASEVQPQQAPVQRSS FT ATALYTNISGAKHHVVLSTAQVLVTGKGNSLVKCRALLDTGSDSNIITEKL FT ADKLQLSMVPVDVPISGLNNNETRVKFLLATKFQSCTSSFASPKLDFLVVP FT QITSNLPAVRIDARSWTIPSGLRFAAPTFHSPSGIDMIIGNEVFFDLIKGG FT RVKLGTIGVTLAETQLGWIVAGSVPIKEEPSRRVCQLNRNEEMLNQTMSKF FT WELECVYSDKSSTVTEELVEEHFKATHYREDSGRYVVRLPFNGIKSQLGDS FT YDMARKRLNKLMLQLAKNPSKRGEYYKFLSEYLALGHMTEVAGSTCDGGYY FT IPHHAVYKASSSTTKTRVVFDASAKTTTDLSLNDTVLVGPTVQNDIVSIII FT RFCIHSVVLTADISKMYRQVRLHKDDCKYQRILWWNDDGQLKTYELQTVTY FT GVASSPYHATSVLVQLAIDEGEDFALAKEIITEDSYIDDFLTGGSSSEEVI FT QVYSELSELLRRGGFEVHKFCSNSNDVLGAIPEELQEQQVSFDDGGINNSV FT KTLGLIWNPQADYFMFRATVADWKEVPTKRKVLSEIGQLFDPLGFLGPVIV FT FAKLIMQDIWRLGLAWDEELPDDLQEKWHQFRRQLPALNQVQKPRCVIQSE FT AVTLELHGFSDASKRAYGAVVYMRSITADGKIYVNLVASKSRVAPLKPTTI FT PRLELCGAKLLAELVKKVITSMKIRLDAVRLWCDSQIVLCWLKKSPLALNQ FT FVANRVAAIVDLTQSYQWGYVRSEENPADAVSRGELPEDLLQKNQWWSGAN FT MLWEPQPILSEPELFDESMIPEVKSTVMMAAVRSDPPIVLNRLSSYKRIQR FT AWVYVHRYIDIVMNKKKEFGEITADEIRMAEKSILLLVQRESYGDVLKALE FT SKSVLRQPYRNLALFVDEDGLIRVGGRLKYSAIPYDGKHQVLLPQKHHITE FT TIVRGLHQEHFHVGQNGLLAIVREHYWPVHAKQIIKKVVSSCQVCARQRPV FT PGMQFMGNLPEVRVNPSPPFFKVGIDYAGPFMLKLGGRSTKLYKGYVVVFV FT CLIVKAIHFELVSSLSTDDFIAALQRFSSRRGVPSDIHSDNATTFVGANHE FT LAALKQLFEDQQHQLKVKEFCSAKGIRWHFIPPRSPHFGGIWEAGVKSMKY FT HLKRVVGETRLTFEEMSTFLAQTEAILNSRPLCPMSDDPSDYSVLTPSHFL FT IGRSGVALPVPSYGDEKLGRLDRYQHIQQMHQHFWSKWSREYLHHLQGRQK FT WNTNANSSFKVGALVLLVEENLPPQQWKRGRISAVHPGDDDVVRVVTVRTT FT NGDYKRAITKIAVMPSVETELSTGGV" XX SQ Sequence 5875 BP; 1617 A; 1323 C; 1514 G; 1421 T; 0 other; ttttttggtc cgttctcccg gatcatcgga ggaattttga atggtttccg gcgttcgcga 60 gtgattgatt agcaccgggt gctaattatt ttggcgcaag gccaaattac gcgttggtgg 120 tgcttcgcca ccaaaagtga gtgcccgtgg cacaaggaat tggacggtcg ccatcttgac 180 gaaaagctat tgccgaagca gattcggtag agtgcccgtg gcacgagaca tagtggtagc 240 catcttggcg aaaagttgtc gaggtggctt cgacaacagt gctcgtagca gaaaatatag 300 atcggtagcc attttggcga aaagcaaatg tcaaacggtt tgacattgcg cgtgtggatt 360 cgacaccgtc gtgttgatta atagtgcgaa aacgctaaca ttgaaagtgc gcgagactcg 420 tgtgtttgtg tagccaaaga aaaggctgaa gatcattcat gagtgaaagc gacacacccc 480 gtggaccgtc gagcaaagtt aagcgaagga atatattcga cgagctcgcc gggtcgcaga 540 atagtgaaga ggacgaaggg gatcgtatcc cgactattgc ccagcagaag gttgcacagt 600 aggagttgaa cacaagacgc aggatgatgc agagtttgga gattttaggc gatcggcagg 660 cgctactgac cgataagctg ctacgaatgc gcgaaacatt gcgagaggac atcagcattc 720 atctactcaa gctacacgtt gagacgttac gccgaatagc tgacgatttc gagaaagtct 780 attcggaaat ggcagccctt ttgacaaagg aacagcggga agcaatgcac gaggagtacg 840 caaaattcga gaaaatgcac aacgaagtgt acgtgatact acagacgcga atcgaacaag 900 cccagcagca gctgaaattg gaagagttcc ggaacatccc tgcaccccaa acatcctctc 960 aggctccggt gtatgtacag gcaccagctc cacatctgca ggctccgttt cctaccttta 1020 acggaaccct cgaaaactgg tacagcttca aaagcttgtt tcagagtatc atggctaggt 1080 acacaaacga aaccgatgcc atgaaaatcc ttcatctccg aaattcactt atcggtgagg 1140 ctaaagataa gatcgaccag gaagtggtta ataacgaaga ttatgcttcg gcgtggaaga 1200 tcctagaaga tgcttatgag gacaagcgtc tgattatgga cactcatatc gacgcaatct 1260 tggactgccc gaaaatcact aaggataacc gtggtaagtc cctatccaaa ctagttgagg 1320 tttgcgtgaa acacacggat gccttgaacg gccatggatt tcccgtcgaa ggattggcgg 1380 agctcataat agtgaatgta ctgtacaaaa aactggacaa ggaaacacag gagctatggg 1440 agacaagttt gggtagtgga gagctacccg attttgaaga attcgtcgat ttcctgcgtg 1500 aacgagggcg tgtactccag cgaacgaatc gaactcaaca gccaccagca caccaagcaa 1560 cggcgatccc aactaagcgg caaccgggta gtcagaagcc acatgcgcaa gtaccttcca 1620 attccttcgt gcaggtgtca aaggagatgt gcccatgctg caaagaggaa cacttgatct 1680 atcgttgttc gaagttccaa ggaatgtctc tgcaggaacg aaagggtttt gtagcaaaat 1740 caaagttgtg cttcaactgc ctgaaagcca agcaccgcgt catcaactgt ccatcggagc 1800 agggctgtaa ggtacaagga tgtggtcgca agcaccacag tctactgcac tgtaacgatg 1860 cacccactgt agaatgtcca gacaaacgtc aagagagcga agaatcatca acagcttccg 1920 aagtgcaacc gcagcaggct cctgttcaac gcagcagtgc aacagctctc tacactaata 1980 ttagcggagc taaacatcat gttgttcttt cgacagcaca agtgttggtt actggaaaag 2040 gcaactctct cgttaagtgc cgtgcgttgt tggacactgg ttcggacagc aacatcataa 2100 ctgaaaaact ggcggataaa cttcagttaa gcatggttcc agtggacgtt ccaatcagcg 2160 gtttgaacaa caatgaaact cgtgtcaagt tcctgctagc aacgaagttc caatcctgta 2220 ccagttcgtt tgcttcgccc aagctggatt tccttgtcgt cccacagatc acttcaaact 2280 tgccagctgt gagaattgat gcccgttcat ggacgattcc ttccggatta cgttttgctg 2340 ctcctacttt ccattcacca agtggaattg acatgatcat cggaaatgag gtgtttttcg 2400 atctaatcaa gggcggtcga gtgaagctcg gtaccattgg cgttacccta gccgaaacac 2460 aactagggtg gattgtagct ggctcggtac caatcaagga agaaccgtct cgccgagtat 2520 gtcagctcaa tcgaaatgaa gagatgctaa accaaacgat gtccaaattc tgggaattgg 2580 aatgtgtgta ctccgataaa tcatcgactg tgactgaaga attagttgag gaacacttca 2640 aggctaccca ttatcgtgaa gattcgggac gttatgttgt gcgactgccg tttaacggca 2700 taaaaagtca actcggcgac tcttatgaca tggctcgaaa acgtctcaat aagctgatgc 2760 tccagctggc caaaaatcct tccaagcgtg gtgagtacta caagttcttg tctgagtact 2820 tagctctagg tcatatgacg gaggttgctg gatcaacgtg cgatgggggg tattacattc 2880 cccaccatgc tgtttataag gcttcgagct caacgacgaa aacaagggtg gtgttcgatg 2940 cgtcggctaa aaccaccaca gacctctcgc tcaatgacac ggtattagtt ggacccacgg 3000 ttcaaaatga catcgtatcg atcataattc gattttgcat ccactcggtg gtattgactg 3060 ccgacatttc aaaaatgtac cggcaagtac ggctgcataa ggatgattgc aagtaccagc 3120 gaattctctg gtggaatgac gacggacagc tgaagaccta tgaacttcaa accgtaacat 3180 atggtgttgc tagttcgcca taccatgcca caagcgtctt ggtccaacta gcaatcgacg 3240 aaggtgaaga ttttgctctc gcgaaggaga tcattacgga ggacagctac atcgacgatt 3300 ttttgaccgg cggctcgtct agcgaagaag tcattcaagt ctattcagag ctatcagaac 3360 tgctacgtcg tggtggcttt gaagtgcata agttttgttc gaacagcaac gatgttttgg 3420 gcgccattcc tgaagagctt caagagcaac aagtcagttt cgatgatgga ggcatcaaca 3480 acagcgtcaa gacgctgggt ttgatctgga accctcaggc agattacttt atgtttcgag 3540 ctactgtggc tgattggaag gaggtgccaa ctaaacgtaa agttctgtcg gagatcgggc 3600 agcttttcga cccactcggt ttcttgggtc ctgtaattgt atttgccaag ttgataatgc 3660 aggacatatg gcgtctgggt cttgcttggg atgaagaact accggatgac ctgcaagaaa 3720 aatggcacca atttcgccgg cagctcccag ctttaaatca ggtgcagaag ccccgttgtg 3780 tcattcagag tgaggctgtt actctggagc ttcacgggtt ctcggatgcg tccaaacgcg 3840 catatggagc tgttgtgtac atgcgaagca tcactgcaga tggcaagatc tacgtcaact 3900 tggtggccag caaatctcgt gtggcaccgc tcaaaccaac aacgattcca cgactcgagt 3960 tgtgcggtgc gaagcttttg gcagaactgg tgaagaaagt gattacgtcc atgaagattc 4020 gtctcgatgc tgtaaggttg tggtgcgatt cccaaatagt tttgtgttgg ttgaagaaat 4080 ctcctctcgc tctaaaccaa tttgttgcca accgtgttgc tgcaattgtt gatctaacac 4140 aatcatatca gtggggttac gttcggtccg aggaaaatcc ggccgatgcg gtatcgcgag 4200 gagaattacc agaagatctg ttgcagaaga accagtggtg gagtggtgca aatatgctat 4260 gggaacctca acccatcctg agtgaacctg agttgtttga tgaatccatg attccggagg 4320 tgaaatcgac tgtaatgatg gcagcagtac ggtcggaccc accgattgta cttaacagac 4380 tgagtagcta caagaggatc cagagagcat gggtttatgt tcatcgttac atcgatatag 4440 taatgaataa gaagaaggag tttggtgaaa ttacagccga cgaaatcagg atggctgaga 4500 agtcgatttt attgctcgtt caacgtgaat catatggtga tgtcttgaaa gcgctggaat 4560 caaaatccgt tctgcgacaa ccttacagaa atctggcgct attcgttgat gaagatggtc 4620 ttattcgagt tggcggtcgc cttaaatatt cggccattcc gtatgacggc aagcatcagg 4680 tactgttacc acagaagcac cacatcacag agaccatcgt aagaggacta catcaggagc 4740 acttccacgt tggccagaat ggacttttgg ctatcgttcg agagcattat tggccagtgc 4800 atgcgaagca gatcatcaaa aaggttgtat cttcgtgtca agtctgtgcg cgacagcgtc 4860 ctgtcccagg catgcagttt atgggcaacc tccccgaagt ccgagtcaac ccatcaccac 4920 cgttcttcaa agtgggaatc gactacgcag gaccgttcat gctcaaactg ggaggccgaa 4980 gtacaaaatt gtacaaggga tacgtggtcg tatttgtttg cttgatcgtg aaggctatcc 5040 acttcgagtt ggtttcaagc ttgtcgactg acgacttcat agcggcgtta caacgtttct 5100 ccagccggcg tggtgttccg agcgatatcc acagtgataa tgccaccaca ttcgtaggag 5160 caaatcacga gttagcagcc ctaaaacagc tgttcgaaga ccagcagcac cagctgaaag 5220 taaaggagtt ttgtagtgca aagggtatcc gctggcattt cattcccccc agaagcccac 5280 actttggtgg gatctgggaa gcaggggtga aatctatgaa atatcaccta aagagagttg 5340 tgggtgagac acgacttacc ttcgaagaga tgagcacttt tctcgcccag acggaagcca 5400 ttctcaactc gcgaccgtta tgcccgatgt cagatgaccc aagtgactat tccgttctga 5460 cgccatcaca ttttttgatt ggccgttctg gcgtagctct tcctgtgcca tcatatggtg 5520 atgagaaact ggggcgactc gacagatatc agcacatcca gcagatgcac caacattttt 5580 ggagcaagtg gtctcgcgaa tatcttcacc atttgcaagg tcgccagaag tggaacacaa 5640 atgcgaactc gagtttcaag gttggtgcgt tagtgctctt ggttgaggaa aatctgccac 5700 cacaacaatg gaagcgtggg cggatttcgg ctgtgcatcc cggagacgat gatgttgtgc 5760 gagtggtgac agtcagaacg acgaacggtg attacaagcg agcgataacg aagattgctg 5820 tgatgccttc tgttgaaact gaattatcaa cggggggtgt atgaacagta aaaaa 5875 // ID Copia-125_AA-LTR repbase; DNA; INV; 129 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-125_AA_; KW Ty1_copia_Ele14; Copia-125_AA-I; Copia-125_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-129 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 129 BP; 23 A; 33 C; 33 G; 40 T; 0 other; tgcaaaacgt gtttgctcgt tccatcgtta aacgcggaat ccgaaaagtc cccggtcgat 60 ctggttcttg tgccgtcgga gtgttgtcca ccttgtaacc ggtccgggat tgctgtttta 120 cgttctaca 129 // ID hAT-N1_AP repbase; DNA; INV; 561 BP. XX AC . XX DT 19-MAR-2009 (Rel. 14.03, Created) DT 19-MAR-2009 (Rel. 15.12, Last updated, Version 2) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW hAT-N1_AP. XX NM hAT-N1_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-561 RA Bao W. and Jurka J.; RT "hAT-type DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(3), 660-660 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX SQ Sequence 561 BP; 176 A; 83 C; 92 G; 210 T; 0 other; cagcggttct taacctgtgg tcctcggccc cctgggggtc cgcgatactc atatcggggg 60 tccgcggcag ccttaacaca taacatacgt caataagtga catgaatgca ttattacaat 120 attgttggat tattattatt ttttttcttc atatcatata taatatgtgg attctatacc 180 aaataattgt attattaata tttattaaat aatcagtttt tcttgttcat acgaaatgta 240 cacgcataca gcactgtact ccacagtctt atatttaggt actgccacta agtgcagttg 300 atcgatcgta ctccaattat agttttattt aacattttta taaaaaatta attgtaatat 360 ttaataatta taaatgtgta tgttttttta agtctactta tgtaatttgt tttataaaat 420 aattattata attttggtta tgataacatg agatgttttg taaaaaaaaa aggtattcga 480 aaaaaatttg gtaagggggt ccgcgatttc tagatatctt tcatcggggg tccgtgatct 540 acaaaaggtt aagaaccact g 561 // ID DNA-13_AAe repbase; DNA; INV; 767 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A non-autonomous DNA transposon family from Aedes aegypti. XX KW DNA transposon; Transposable Element; nonautonomous; DNA-13_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-767 RA Jurka J.; RT "DNA transposons from the yellow fever mosquito."; RL Repbase Reports 11(4), 1268-1268 (2011). XX DR [2] (Consensus) XX CC ~98% identical to consensus. 9 bp TSD. >10000 copies. XX SQ Sequence 767 BP; 265 A; 133 C; 119 G; 249 T; 1 other; ttcgggtgaa attgatcatt tttcacggtt tttcttgtcc gttttctata atgttgacaa 60 tgccaaacaa ttgaatgcag gaaaacaagt acgaaggtga gtctcattga ctcatgtacc 120 gaaattttct aacaaatgtc attttagtgc taagaaatat ctctaaaata aaaaatcggg 180 gaatctcatt tcggggtgaa attgatcact tgtcaatgcg attattgtta gttttcaaac 240 caactctata taataaattt gaggtccctg aatccgaata tgcttgctaa attcttaaca 300 atacaataat tatagaaata atgaatagtt aaatttcaag aattacgcga aaaacgccta 360 aatgtatgca atttcctaag gaatatctcg atatctaaca gaaattcaat tatttccacc 420 gaatgaccaa attggtacga gttttggtta tgaaatctgg tttggggata tatctcaagc 480 atcctcacaa actttcaggt ttataccatg ttcaaatcct tgtttagaac tagaagttta 540 tggatgtttg tcaatatcgc accgcacatg attttggaat tttacattat tttcatagaa 600 ataaacattc tttaacgcaa aamacttaac aacacacaat tcgacaatag ttacgacaaa 660 caacgttcat tcactgcagg ttacattgat atttgtgctc cattaggaag aaataaaatc 720 gagtgacacg gtgatcaatt tcaccctact gatcaatttc acccgaa 767 // ID Chapaev-N6_AAe repbase; DNA; INV; 743 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.03, Created) DT 23-DEC-2010 (Rel. 16.03, Last updated, Version -1) XX DE A non-autonomous Chapaev-type DNA transposon family from Aedes DE aegypti. XX KW Chapaev; DNA transposon; Transposable Element; Chapaev-N6_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-743 RA Jurka J.; RT "DNA transposons from the yellow fever mosquito."; RL Repbase Reports 11(3), 836-836 (2011). XX DR [2] (Consensus) XX CC >96% identical to consensus. 4-bp TSDs (TTAA). XX SQ Sequence 743 BP; 248 A; 136 C; 122 G; 237 T; 0 other; cccctctacc ggcagcttca ttttttaccg caaaattcaa attcaaatcg ttataacttt 60 tttgtttttc aatatttttg caccattttt tcacaagctc tcaaaaaact cttctagttt 120 taggatctgt gtcgatattg atcattggtc atctggtttt gaagatattc caaaattcct 180 tgggggaccg acccgtaact agaacccccc gttaatttct caggccgcag aattttaatc 240 gtattcggat attcactctt cggaacaata cattaatgag agtatgttgt gaaaaaatga 300 agcaatttgg tgcagccgtc tttaagtaat gaccatttag gtttctggaa ccacgctggc 360 caaataaaga tcttaaaaac ttcaaaaaac atcatatcgt aattttcaga tatctccaag 420 aatactgaac cgatttgtat gattttttca gagaagctcc ttattacctg gcattatata 480 gcccacattt tatttttttg ataaattgat caaaaacaaa atggccgcca aagacatttt 540 atatggaaaa tgtcggtccc ccaaggaaca tcggaatatc tttaaaacca gatcaccaat 600 aattaatatc gacacggatt cctaaactag aagagttgtt tgagaacttg tgaaaaaatg 660 gtgcaaaaat attgaaaaac aaaaaagtta taacgatttg aatatttatt ttgcggtaaa 720 aaatgaagct gccggtagag ggg 743 // ID Gypsy-17-I_NVi repbase; DNA; INV; 8802 BP. XX AC . XX DT 13-APR-2009 (Rel. 14.04, Created) DT 13-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW internal portion; Gypsy-17-I_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-8802 RA Bao W. and Jurka J.; RT "LTR retrotransposon from Nasonia parasitic wasp."; RL Repbase Reports 9(4), 771-771 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 666..2768 FT /product="Gypsy-17-I_NVi_1p" FT /translation="YNRGERTKRRLFTVRTTNRHIDCSNSFEGTSILFRIS FT ETVRGHLSEMASEASVNDWVMALSNDEIKREAESRGLNTAVSLAELQRRLA FT HEVYIERGIVQPDEELPPSGAAREHDDPPPPYQLSTNVPLLTSSVSGPIAT FT AREQPFHASRTSSPIETERGRDREEQTSTGTVPRVPRRSVLFGDTTQFCTN FT NQQNNSQRDLGYLTLTSDGHVYNSTNFQNVQDRPIINHPRANSTTRLAGNN FT LISSTAMFNTQNQATFNRPSTLAQEVNPNNLISLQTQNLSRNTRHTGNGYV FT STTPREIPARINNFMPPEGQNGYLNNGIARPSHLQLPINNNNNDNRQQYRG FT AYNNNEQPINTERRLIAPTHINAFDFICKWDLKYSGRRDEDSEEFLKRLRE FT GRLIMHISDDDLFGILPFFLQGVALQWYRINSDNWVTFEQFENAWRARFSD FT ADFQFALMENARRRTQGEREPVADYLTYIKAMFNRMIPPLGPVREINLALR FT NMLPRLQTRIDKSEIFTFQELEAIAVRKEKGLMIARTYKPPPSAEDSLLPD FT LAYREQRPQRNIRQQLSNLELDEEQNRARVEHTEPDFTEDALPDEDELNNL FT GYGRDKRFNNNQRQPTNQSPNKNLHNNNRPMTQIQICYNCGKSGHFSRECE FT GPKRAFCYKCGKIGTRFINCPKCNPTEIFCNDCGLIGYLRVNCPECTGNAD FT *" FT CDS 2684..6358 FT /product="Gypsy-17-I_NVi_2p" FT /translation="SNGNILQRLRINRLPQSKLSRMYGKRRLKSQNKADAA FT WKTEKASLVELNKNLDIYPTKINAVCDEKSCTQKNDEADLKAISEKSGYLT FT NKDKIEPKAICDILNKAPDILNNQRENAEIRDNNFEYLKFTPSQWDILTIE FT EKEQLAFSDYLQDIETVVSEAPEFLIKNSSYKLIRDDLIHEDIYTTDKLYS FT LPLKSPNLLYSTVSIADNKFKALIDSGASKTFIGPKIHKIATKNALLVNKN FT VTGRVTTPLGYTEEIREIIHTPIKFQDRIRVIETRVLDCLDQDCILGIDAL FT HSFGVIADFTNFTYTFGSKPFDRYTFEKRFAETSETAEACYGLRELSPSEE FT KILEEFLNNEIPKSSGKLGKTSLIEAVIDVNNHPAIKQRPYPISPTVETIL FT FNEVDKMLDEDIIERSNSDWSSPVHMARKANGTRRFCLDLRKINAIIKKDA FT YPLPLMGSILDKLRVARYISTIDLSQAFLQIPLEKKSREITAFAVPGKGLF FT HFKRLPYGLSNSPGIFQRLVDSLIGPELQPHVYSYLDDLIIVTETFDEHLK FT WLKVVFDKLKHANLTINKEKSKFCLSEVKYLGYLVNKNGLQVDSEKVAPIR FT EYPVPKTLKQIRRFIGMTSWYRRFIENFASLIQPISSLLKKDKKLEWGPEQ FT QTAFETIKERLMTAPILMRPDFSKPFTIQVDASTVGLGSVLTQIIDDKERV FT IAYASRTITDAEKNYTTTELECLGVIWAIEKFRGYIEGTHFKVITDHSSLQ FT WLKSLKKPVGRLARWSMYLTAYDVDIEHRKGAMHHVPDALSRMYENQLDEV FT NTLNDDNDDWYKGRIDEVRKNPKKYHNWHIAGDILYHKTEDTFLDTIKRDQ FT SIWKLVVKSKDRDRVLFEAHDDTQSGHLGIDKTHARIAKDYFWPSMYKDIV FT NYVKRCDVCQKVKPEQFAPPGMLGQRIIEHPWSVVAADIVGPLPRSKKGKR FT YLLVIQDLFTRWIEIQPLREATGKSIKDALHDLIITRWGTPRVLITDNGTE FT FINKTLIEYTKQLNIKHSTTPPYHPQANPVERVNRVLKTMITAFIEQDHRE FT WDNHILDFRFAFNTSKHTSTQVTPAFLNFGREPLPIHSLRKEIENSDIIER FT GNVDQWAERLYRLRAIRDWVVENSENAHNRQANYYDKSHRNLTYKVGDSVL FT MRNRVLSDMSKNFAAKLAKPYKGPYTVTKVLSPLVYELVSADNQIVSKIHI FT CDLKPYRQNTCIPX*" FT CDS 6613..8085 FT /product="Gypsy-17-I_NVi_3p" FT /translation="MERQQEPDTTTEEEERLLLEVDSDDAITIDSDIEVLS FT LPPPSPADVGEPGPGSPPRRGESPEDDFDFIPRENADNGAAPQQQAEDADE FT VNSTATTPRARSPDNLSDRMEGDFDQQLPASPEPEPIEEEETNERPEDIQR FT RKEIERLRRMVKEQTPDNPLLPLLGHEEEVAEFFYENPPILSAFNKVIKNI FT VEKVRQRPCILRGSSLCIGKPEVLPRAPRRSTMIYELNPAVDQDTQTIAVP FT QVSSGTQTTLREPAQPLRTSIATQTPQLNFARPSLVSTATQTRPENLAGSL FT QASTSVQIQAEVHVEAVAPEHPPELSTEIQTPPTTPNETASERPFPPRNEQ FT ATENDIENAGANAIVDTPMEVEPTTPPAQAISGRRAGDPTLTGWMRINGRD FT VNVLPSLPGCFNCSRRHHYRNCPHPATVFCYRCGRRNVTVRDCPHCAPGWR FT EEGPYIRRLRTHVPRDQPLPPREGLDELQQFSQRPRHTFTRMEPY*" XX SQ Sequence 8802 BP; 2926 A; 2084 C; 1825 G; 1962 T; 5 other; aactagcgcc ctataacgtg tggattaata gaaaagaata tcgtgatcgg taaaatcgct 60 gtctcgagac aaaggaaaca accagtgtga gtgaaatttg agagcgcgtg tgtgaaaatt 120 aattgcgtaa agttaaaaat cgtagagtag ttgactggat atctaaaata attttgtagc 180 gacacaaatt tctggctttc taacttaaaa atatatctac tagttacctc taaaaaacta 240 atatttctcc cggatatcta ttgctagcta cgaaaaatct attctcgtat agattcctat 300 aattatacaa gtagcgagct taaaaagcac gtaaataact tacgtagtag aacggaaaag 360 gttaaaatct cacgaagcag gcaatctcga cacaacggga aaattcctac gaaacaaagt 420 cgagaaaggt cgcaacgtgt cgaaaacaaa ttcttagagc gcgagctcaa acgtacctcg 480 ggtcataaag ataaaaaaaa acaaagggac actgttgccg agcggctcac aaacagccgg 540 cgacacgtgt gtccgcgccg gtcgaagatc aacgcgaacg cgcgggtcga gtaacgtaaa 600 cactctcgaa acgaacttcg gcaatacgtg gaactgcaca caccgagatt agtacgtaag 660 agtagtacaa cagaggtgaa agaacaaagc ggcgactgtt tacagtacgc acgacaaacc 720 gacacattga ttgctcgaat tccttcgagg gaacttcgat actatttagg atatctgaga 780 ctgtgcgagg tcacctttcc gaaatggcta gcgaagcctc agtcaacgat tgggttatgg 840 cattgtcgaa cgatgaaatt aaacgcgaag cggagagccg cggactaaat acagcagtca 900 gcttagccga actgcagagg cgtttggcac acgaggtgta cattgagagg ggcatagtgc 960 aacccgacga ggaacttcca ccgtcaggcg ctgcgcgcga gcacgacgat ccgccgcccc 1020 cctaccagct ctcaacgaac gtcccactgc taacatctag cgtatcgggc cctatagcga 1080 cggcgcgaga gcaaccgttt cacgcgtcgc gaacatcgag ccctatagag accgaacgcg 1140 ggcgagatag agaagagcag acctctacgg gtacagtacc acgtgtaccg agacgcagcg 1200 tattattcgg cgacactaca caattctgca caaataatca gcaaaataat agccaaaggg 1260 acttaggtta tctaacatta acttcagatg gccatgttta caattcgaca aattttcaaa 1320 acgtgcagga taggccaata ataaaccatc cgcgagcgaa cagcacaacg aggttagctg 1380 gaaataattt aatttcatcc acggctatgt ttaacacaca aaaccaggct accttcaacc 1440 gaccttctac attagcacaa gaagtcaatc ctaataattt aatatcatta caaacacaaa 1500 atttatcgag aaatacgaga catacaggga atggttatgt tagcacgaca ccacgtgaaa 1560 ttccggccag aattaataat ttcatgccac cggaggggca aaatggctac ctaaataacg 1620 gaatagctag accgtctcac ctacaattac caattaacaa taataataac gataatcgac 1680 aacaatatag aggcgcgtac aataataacg aacagcccat taacacagag cgtaggttaa 1740 tcgcacctac tcacattaac gcttttgact ttatttgcaa atgggactta aaatactctg 1800 gaagacgcga cgaagatagt gaagaatttt taaaacgact aagggaaggg cgcttaatca 1860 tgcacatttc ggatgatgat ttatttggaa tattrccgtt tttccttcaa ggggttgcct 1920 tacagtggta cagaataaat tcggataact gggtcacgtt cgagcagttt gaaaacgcat 1980 ggcgtgcacg tttttcagac gcagattttc agttcgcgct catggaaaac gctcgacgcc 2040 gaacgcaagg cgagcgcgag ccagtcgccg actacctcac gtatatcaaa gcaatgttta 2100 atcgcatgat acctccgcta ggaccagtcc gtgaaataaa tctagctctc cgaaatatgc 2160 taccgagatt acaaacccga atcgataagt ccgaaatctt cacgttccaa gagctggaag 2220 ctattgcagt acgcaaagag aaaggcctaa tgattgcgag aacatacaaa ccacctccga 2280 gcgccgaaga ttctctccta ccggacttag cttatagaga acaacgcccg cagcgtaata 2340 ttagacaaca attatctaac ctcgagctag acgaagaaca aaatcgagca cgcgtagaac 2400 atacagaacc ggactttacg gaggatgcct tgcctgacga ggacgaatta aataatttag 2460 gttacggacg cgacaaacgc tttaacaata accagagaca accaacaaac caatcaccta 2520 ataaaaacct gcataacaac aacagaccaa tgacacagat acagatctgt tataactgcg 2580 gaaaatccgg acatttctcg cgagaatgcg aaggcccaaa acgcgcgttt tgctataaat 2640 gcggaaaaat cggaacacgg tttataaatt gcccgaaatg taatccaacg gaaatattct 2700 gcaacgattg cggattaata ggctacctca gagtaaattg tccagaatgt acgggaaacg 2760 cagactaaag tcgcaaaata aggccgatgc ggcttggaaa acagaaaagg cctcactggt 2820 cgaattaaac aaaaatttgg atatctaccc gacaaaaata aacgcagtct gcgacgaaaa 2880 aagttgcaca caaaaaaatg acgaggctga cttaaaagct atctccgaaa aatcaggtta 2940 tctaactaat aaagacaaaa ttgagccaaa ggctatctgt gatatcttaa ataaagcacc 3000 ggatatctta aataatcaac gagaaaacgc ggaaattcga gataataact ttgaatattt 3060 aaaattcaca ccgtcgcaat gggatatctt aactatcgaa gaaaaagaac aacttgcgtt 3120 ttcggattac ctacaagaca tcgagacagt cgtaagcgaa gcacctgaat ttctaataaa 3180 aaattcaagc tataaattaa ttagagacga tttaattcac gaggatatct acacgacgga 3240 taaattatac agcttacctt taaaatcccc gaacctactt tactctaccg tcagcatagc 3300 agacaataaa tttaaagctc tgatagacag cggagcgagt aaaacgttca taggaccgaa 3360 aatacataaa atagcaacga aaaacgcact attagtgaac aagaacgtaa ccgggagagt 3420 cacgacgccg ctaggttata cagaagagat acgtgaaatt atacacacac caattaaatt 3480 ccaagataga atacgagtaa tcgaaacacg agttcttgac tgcctagatc aagactgtat 3540 tttagggata gacgcgttac acagtttcgg cgtaatcgcc gatttcacta atttcacgta 3600 caccttcggt tcaaaaccgt tcgatagata taccttcgaa aaacgtttcg cagaaacgtc 3660 ggaaacggcg gaagcatgtt acggcttacg tgaactyagt ccctcggaag aaaaaattct 3720 cgaagaattt ctaaataacg aaatacctaa atcatccgga aaactcggga aaacttcgct 3780 aatagaagcc gttatcgacg tgaataatca ccctgcaatt aaacaaagac cttatccaat 3840 atcaccgacc gttgaaacaa ttttatttaa cgaagttgac aaaatgcttg acgaggacat 3900 catcgaacgt tcgaacagtg attggtcatc accggtacac atggcgcgca aagccaacgg 3960 cacgcgcaga ttctgcctcg atttgcgaaa aattaatgcg attattaaaa aggatgccta 4020 tccattaccg ctaatgggat caatattaga taaactccga gttgcgagat acatttctac 4080 aatagatctc agccaggctt tcttacaaat tccgctagaa aagaaaagca gagaaattac 4140 agctttcgca gtaccgggaa aaggactatt ccactttaaa cgtttgccat atgggttatc 4200 taattcacca ggcatctttc aacgcctagt agattcctta atcggacccg aattacaacc 4260 tcacgtttat tcatatcttg acgatttaat catcgtcaca gaaacgttcg acgaacacct 4320 aaaatggcta aaggttgtct tcgacaaact taaacacgct aacttaacaa ttaataaaga 4380 aaaaagcaaa ttttgcctat ccgaagtaaa atatcttggc tatctagtaa acaaaaacgg 4440 acttcaagtc gacagcgaaa aagtcgcgcc tatacgcgag taccccgtac ctaaaacgtt 4500 aaaacaaata cggagattta tcggaatgac atcatggtac cgcagattta ttgaaaattt 4560 cgcgtcatta atacagccaa tctccagtct actaaaaaag gataaaaagt tggaatgggg 4620 accagaacag caaacggcat tcgaaaccat aaaagaacgt ttaatgaccg ctcctatatt 4680 aatgcggccc gatttttcaa aaccttttac catacaagtc gacgcaagca ccgtagggct 4740 aggtagcgtc ttaacacaaa tcatagacga caaagaacga gttatcgcgt acgcaagccg 4800 aaccataaca gacgccgaga aaaattacac gaccacagag ttggaatgtc tgggggttat 4860 ctgggcgatc gaaaaatttc gaggttatat agagggcaca cacttcaaag tcattacgga 4920 ccatagcagt ctgcagtggc taaagtcact taaaaaaccc gtgggtaggc tagctcgctg 4980 gtcgatgtat ctaacggcat acgacgtcga tattgaacat agaaaagggg ccatgcacca 5040 cgtacccgac gcgttatcgc gcatgtacga aaaccaactc gatgaggtta acactcttaa 5100 cgacgataac gatgactggt ataaaggaag aatcgacgaa gtacggaaaa atcctaagaa 5160 atatcataat tggcatattg ccggggatat cttgtatcac aaaaccgaag atacgttcct 5220 tgatacgatc aaacgcgatc aatctatctg gaagctcgta gtaaaatcaa aagaccgaga 5280 ccgagtactg ttcgaagcac acgatgacac gcagagcggc cacttaggca ttgacaaaac 5340 acacgcacgc atagcaaaag attatttttg gccaagcatg tataaggata tagtaaatta 5400 cgttaagcga tgcgacgtgt gtcaaaaagt caaacccgag caattcgcac caccgggaat 5460 gctaggccaa cgcattatcg aacacccttg gagcgttgta gcagccgaca ttgtgggccc 5520 gctgccacgt agcaaaaagg gaaaacgcta tctgttagta atacaggact tattcacccg 5580 ctggatagaa atccaaccgt taagagaagc caccggaaaa tcaattaaag acgctttaca 5640 cgatttaata ataactcgct ggggaacacc acgagtgctc ataacggaca acggcacaga 5700 attcattaat aaaactctaa tagaatacac aaagcaactt aatatcaaac attctaccac 5760 accaccctat cacccacaag ctaacccagt agaaagagta aatcgcgtac taaaaaccat 5820 gatcacggca tttatcgaac aagatcaccg agagtgggac aaccacattt tagacttccg 5880 tttcgcattt aatacctcya aacatacgtc tacgcaggtc actccagcct tcctaaattt 5940 tggacgcgag ccgttaccga tacactccct gcgcaaagaa atagaaaact cagatatcat 6000 cgaacggggt aacgtagatc agtgggcaga aagactctat agattaagag cgattagaga 6060 ttgggtcgta gaaaattcgg aaaacgcgca caatcgccaa gcaaattact atgacaaaag 6120 tcaccgcaat cttacttaca aagtaggcga ttccgtctta atgcgtaacc gagtcttgtc 6180 agacatgtca aaaaacttcg cagcaaaact ggcgaaaccg tataagggac catacacggt 6240 cacaaaagtt ctttcaccac tggtgtacga gttagtttcg gcagacaacc agatagtctc 6300 aaaaatacac atctgtgacc taaaacctta caggcaaaat acctgcatcc ctgmatagac 6360 acctcaggct acctatcatc atcatcayag ataacattta ctatcacaac ataaggttat 6420 ctacaacgat cattaataaa ctgtatgcgg acactggcta ccttatttgg atttattgat 6480 tatcttgtta tgtaagtaca cgaaaattta aacacacgat aattgtcata tcgcggtata 6540 ggcgaagctg gttatctaat gtaaaatcta gatcaacgag ggttatctta ctgctgatta 6600 caaacttaca ccatggaaag gcagcaagag ccagacacaa cgacagaaga ggaagaaagg 6660 cttctgctag aagtcgattc tgacgacgcc attacgatag actcggacat cgaggtactg 6720 tcgctacctc caccgtcgcc cgccgatgta ggcgagcctg ggccaggctc tccgccacgg 6780 aggggggagt ctcccgagga tgacttcgat ttcattcctc gtgagaacgc tgataacggc 6840 gcagcaccac aacagcaagc tgaagacgcc gatgaggtga actcgacggc aaccactccc 6900 agagcccgct cacctgacaa cctgagcgac cgcatggagg gcgatttcga ccaacaatta 6960 cccgcaagcc cagagccaga acctatagaa gaagaggaga ccaacgagcg cccagaagat 7020 atccagaggc gcaaggagat cgaacgctta cgtcgtatgg tcaaggagca aacaccagac 7080 aaccctctac ttcccttgct gggacacgag gaggaagtgg ccgaattttt ctacgaaaac 7140 ccgcccatcc tgtcggcttt caacaaggtc ataaaaaaca tcgtcgaaaa agtcagacaa 7200 agaccgtgca tactccgggg atcttcccta tgcattggaa aaccagaagt actacctcgc 7260 gcaccgagaa gaagcacgat gatctacgag ctcaacccag cagtggacca agacacgcaa 7320 accatcgcgg ttccgcaagt ttcctcggga acgcaaacca cgctcagaga acctgcccag 7380 cctctgcgaa cctccatcgc aacgcagact ccgcagctga atttcgctcg accgtcgcta 7440 gtttccaccg caacgcagac tcgacccgaa aatctagctg gatccctgca ggcctccacc 7500 tcggtgcaga tccaagccga ggtacacgtc gaagccgtag ctcccgaaca cccgccagag 7560 ctatccacag agatccagac acctccaact acacccaacg agacagcatc ggagagaccg 7620 ttcccgccaa ggaacgaaca agccaccgag aatgacattg agaacgctgg ggctaacgcc 7680 atcgttgaca ccccgatgga agtcgagccg acaacaccac ctgcgcaagc aatatccgga 7740 agacgcgcgg gagatcccac actcactggg tggatgcgca ttaatggccg tgacgtcaac 7800 gtcctgcctt cgctacctgg atgtttcaat tgcagtagac gccatcatta ccgcaattgc 7860 ccacatccag ccaccgtctt ctgctaccga tgtggccgca ggaacgtcac agtacgcgac 7920 tgtccacact gcgcaccagg atggcgagag gaagggcctt acatccgtcg cctcaggact 7980 cacgttcccc gtgaccaacc gctccctccc agagaaggcc tcgacgagtt gcagcagttc 8040 tctcaacgac caagacatac cttcacgcgt atggagccat actgaggaag aaccgtggtt 8100 atctaatgta gaaagttatc ttatttttgc taccttttat attcttgttt accttttcta 8160 gattattaat tttgttcaaa agtattattt agtttgaagt tcctccgtct gattaaccgt 8220 taatcagacg catcgcgaaa gttagcgttc aatagtttgt aacgcgcata aagtggttgc 8280 cttattatta tcaaaatagt cgactgacta ggttatctaa taattaagaa gagtgattat 8340 aacgcaccaa agtgtcctta ggtttgccaa agctatcttt attataattc attgtagttc 8400 ctcttagata tatcgattat ttaaaataag gttgtctctg tgttaccata gtacttaagt 8460 aatattatta aataaataat cttttctact tttctatcaa atcaagcgtc atctgattac 8520 taaaataaac ggcctgtgtc ctcatcacct gaacctactc cgttctatgg ggcttccacc 8580 cgccccacta ggttaagtta gaattaagtt agtcttaaga attagcttta agaaaaccac 8640 ttgcttttcc caccccttga tgagttgtgt gtgagttaaa gagagacggt tgatctgcca 8700 cgcgtgagat tagccttgag actcgctttc acccgcgggg tcggccgggg agtttgacgg 8760 agcgattcgc tggctcgctc ctttcctcac ggaggggaga gt 8802 // ID Gypsy-100_AA-I repbase; DNA; INV; 5316 BP. XX AC supercont1.273; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-100_AA_; KW Gypsy-100_AA-LTR; Gypsy-100_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5316 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.273; Positions 1459800 1465115. XX CC Positions [2516-3016] - Reverse transcriptase CC Positions [4151-4621] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 905..5179 FT /product="Gypsy-100_AA-I_1p" FT /translation="MASTSMAFSLEPYRKGTSFNDWYTRMKYFFRVNKIKD FT EDKMAYFITMSGPVIFAEIKLLYPAGNFEDAELDDIVSKLKSRLDKTDPDL FT VQRYKFSTRVQNPDESTEDFVLSLKLQAEFCGFANFKEVAILDRIIAGIKD FT KNLRQRLLSEEKLSLSNAEKIIATWEVARANAGTAEPNNRDFPNLVAMVEG FT GSRDQEGTAMRRLSKLYDLARQNQTSGSNDMAGNSRGPVKSRLGFRPYERT FT QHTFRGGRMAYGSRAGSSRQDDQVFGQRQWQRPDYSQMICNYCGVKGHIKK FT KCFKLKNLNRDAVNLVESYKPGPSADRHITELLERMRTQDSEDEEIGSDSG FT ELHCMLVTSINKISNPCLVHVNIEGKELEMEVDCGASVSVISKKRYLSKFN FT NPLRNYSEQLIVVNGAKLKIEGEATVFVKYNGKEALMQLLVLDCENDFYPL FT LGRTWLDVFYQNWRQYFTNSLKINNLNDDNGKIALDDIQMKYSNVFTKNFS FT RPINGFKAELVIKDETPIFKKAYDVPYRLRDKVGSYLDKLEKEKVITPIDT FT SEWASPIIIVMKKNDEIRLVIDCKVSINKSIVPNSYPLPSAQDIFANLSGC FT NIFCALDLEGAYTQLELTERSKKYVVINTMKGLYKYNRLPQGASSSASIFQ FT QVMDKVLEGIENVSCYLDDVLIAGKTVDECKTKLLAVLERLANANIKVNLE FT KCKFFVKELTHLGHVISGKGLKPCPDKILTIEKAKAPKNESELKSFLGLIN FT YYHKFIPNLSAKLYFLYNLLKSNVRYVWDDNCQKAFVESKNLLIQANFLEF FT YDPKKQIVVISDASSYGLGGLIAHVVDEVEKPISFTSFSLNSAQKKYPILH FT LEALALVCTIKKFHKYLYGQHFTVYTDHKPLVGIFGKEGKNSIYVTRLQRL FT ILDLSIYDFDIIYRPSHKLGNADFCSRFPLPQDVPRELDTEVIKSINFSKE FT IPIDSKMIATATKDDDFLQQVMSYMQNGWPDRVDKPFADVYSNCFELELID FT DCLLYMERVIIPQIYQKQILSLLHANHAGVIKMKQLARRMVYWFGINADID FT KYVAECCACNSMATSHEQTEKSKWTPTTRPFSRIHIDFFFFEHHTFLLMVD FT SFSKWLEIEWMKNGTDSNKVLKKLVAYFARFGLPDVIVSDNGPPFNSYSFV FT NFLEKQGIRVFKSPPYNPSSNGQAERLVRTVKEVLKRFLMDPDMKELDLEN FT QINYFLFNFRNYNSTKDGHFPSERIFSYKPKTVLDLVNPKKHYKKQLDIQQ FT PDDETVSENQSGKIPDRSYDAIDHLMAGDEVWYKNNNPHHHARWVKATFIK FT WYSHNILQISIGSVSAMAHRKQIRICGDEATRQRPNVVITRSGGSENPKGN FT MDMARSEGPPNSNEEVRLPEAPNGVRKRKRLNSDSRVGTPGELDLRRSKRV FT RKANRSEDFYY" XX SQ Sequence 5316 BP; 1761 A; 873 C; 1149 G; 1533 T; 0 other; aaagatttaa aagtgacgac gaggcaaagt ggtgtttttc tagcgagtga gcagaaaaag 60 tgcgccggcg ttcagcacca tcaacctgaa cgggtcgcag gcaggacaaa gcagcggaaa 120 attcccatct cttcagtgga gttatcgcta cagacctgta gaggcgaaat tgctgccgtg 180 agaatcggag tttgggattt gtgagtataa actgacgcta caaaagtgaa atatataaag 240 aaaacgtttt cttagtgttt cggccatctt atcgggtggc gtcagttgga cgctcacaaa 300 ggcgaaaagg cctttcattt ctatacccag ttaatgaaaa gggtggtcca aaaatcaatc 360 ttttgttcta ttgaaaaaaa caaaacggat tattattcct tttaattgtt ttctcataca 420 ttgtattaag agtggctatt ttcttttctt aagtttaagc aggttgaaat ttagcgcaac 480 aaaggtgcga tacatatttt agctatccaa tatcttctat atagcgcaaa ggtgcttgga 540 aacagctgga ttgatacggt ttcgttgacg ttcgttcccg aaggaaatcg gacactccct 600 gtgcgaggaa ctggatccta cctttcattc gcagtcacaa ggagtggcat tggaagtatc 660 aacctaaacg tgagtaatct caggagtttt acttaaagtg gagttacaac ccttatttgg 720 ttcacaatta gaaatcttac tgtatttttt tctttcgttc tcattccata actattgatt 780 tgttgaaact ttatgtgact acacaacacg atttttgcct tgtgaattgt gggatattta 840 ccttgatttt gattcaatta attctacgac tgttataatt ttttaataag tgattttttt 900 agtaatggct tccacaagta tggctttttc cctggaacca taccgtaaag gaacatcgtt 960 taacgattgg tacacacgga tgaaatattt cttccgtgta aacaagatta aagatgagga 1020 caaaatggca tattttataa ccatgagtgg gccggtgatt tttgcggaaa ttaagctttt 1080 gtatccggcg ggtaactttg aagatgctga gttggatgac attgtgtcca aactcaaaag 1140 ccgcctagat aagacagacc ctgatctagt acagcgatat aaattcagca caagggtgca 1200 gaatccagac gaatcaacgg aagattttgt gctaagcctc aaattgcaag ccgagttttg 1260 cggttttgca aattttaagg aagttgctat actagatcgc attattgcag gtattaaaga 1320 taaaaatctc aggcaaaggc tgttgagtga ggaaaaattg tcgttgtcaa acgcagagaa 1380 aattattgcg acatgggagg tagctcgggc aaatgccgga actgccgaac caaataacag 1440 agattttcca aatttagtgg caatggtgga aggtggttca agagaccaag agggaacggc 1500 catgagaaga ttatccaagc tttacgattt agctagacag aatcaaactt caggaagtaa 1560 cgatatggcg ggaaatagta gaggtccagt taagagccgt ttaggtttta ggccatatga 1620 gagaacacag catacattcc gtggtggacg catggcctat ggctctagag cgggcagctc 1680 ccgtcaagat gatcaagttt ttggacaacg gcaatggcaa cggccagatt actcgcaaat 1740 gatttgtaac tactgcggag taaaaggcca tataaaaaag aagtgtttca agcttaaaaa 1800 cctgaacagg gatgcagtta atctggtgga atcatacaaa ccgggtccat cagcagatag 1860 acatatcacg gaactattgg agcgcatgcg gacgcaggat tcagaggacg aagaaatcgg 1920 aagcgattca ggtgaattac actgcatgtt agttacgtct ataaacaaaa taagtaatcc 1980 ttgtctggta catgtcaata ttgaaggtaa agaattagaa atggaggttg attgtggtgc 2040 atcagtttcg gtgattagta aaaagcgata tttgtcgaag tttaacaatc ctttgcgtaa 2100 ttatagtgaa caactaattg ttgtaaacgg agcaaaattg aaaattgagg gagaggcaac 2160 ggtctttgtt aaatataatg gtaaagaggc tttaatgcaa ttattggttc tcgattgtga 2220 aaatgatttt tatcctttat taggcagaac atggctggac gttttttatc agaattggag 2280 acaatatttt acaaattcat tgaaaataaa caacttaaat gatgataatg gcaaaattgc 2340 tcttgatgac attcaaatga agtattccaa tgtttttact aagaattttt cacgccctat 2400 taacggattc aaggcggaat tggttataaa ggacgaaact ccaattttca agaaagctta 2460 tgacgttcct tatagattaa gggacaaagt tggtagttat ttggataaat tggagaaaga 2520 aaaggttatc acacctattg atacgagcga atgggcctca ccaataataa ttgtaatgaa 2580 gaaaaatgac gagattagat tagtaattga ttgtaaagtg tctattaaca aatccattgt 2640 cccaaattca tatcctctgc catcagcaca agatattttt gcaaatttat ctggttgtaa 2700 cattttctgt gctctggatc tggaaggagc atatacacag ttggaattga cggaaaggtc 2760 caaaaagtat gtagtcatca ataccatgaa aggcctctat aaatataacc gtttaccaca 2820 gggagcatct tcaagtgcat ctattttcca acaagttatg gataaagttt tggagggaat 2880 tgagaacgtt tcttgctact tagatgacgt tttaattgcc ggaaagaccg ttgatgaatg 2940 taagaccaaa ctactagctg tgttggagcg tttagctaat gctaatatta aagtcaattt 3000 agaaaagtgc aaattttttg taaaagagct cacacatttg gggcatgtta taagtggaaa 3060 aggactcaag ccctgcccag acaagatatt gaccatagag aaagcaaagg cgccgaaaaa 3120 tgaatcggaa ttgaagtcgt ttcttggact gataaattat taccataaat ttattccaaa 3180 cctgtccgct aagttatatt ttctatacaa tttacttaag agtaatgttc gatacgtgtg 3240 ggatgacaat tgtcagaagg cctttgtgga aagcaaaaat ctgctaatac aagccaattt 3300 tctagagttt tatgacccaa aaaaacagat agtagttatt tcagatgcct caagctacgg 3360 gttagggggt ttaatagcac atgtggtaga cgaagttgaa aaacctataa gctttacgtc 3420 attttctcta aattcagcgc aaaaaaagta ccctatctta catttggagg cactagcctt 3480 agtgtgtaca attaaaaagt ttcacaaata tttatatggg caacacttta ctgtttacac 3540 agaccataag cccttagtag gaatttttgg aaaagaaggg aaaaattcga tttatgtgac 3600 gagactacaa cggttaatac ttgatctatc tatatacgat ttcgatataa tatacaggcc 3660 ttcacataaa ttgggaaatg cggatttctg ttcgagattt cctttgccac aggatgttcc 3720 tagagagtta gacacagagg taataaaaag cataaacttt agcaaagaaa taccaatcga 3780 ttcaaaaatg atagctactg caactaaaga tgacgatttt ttacagcagg ttatgagtta 3840 catgcagaat ggctggcctg atagagttga taaacctttc gcagatgtct attctaattg 3900 ttttgagctt gaattgattg acgattgtct tctctacatg gaaagggtca ttatacctca 3960 gatttatcaa aagcaaattt tgtcactttt acatgcaaat catgctggag tcatcaaaat 4020 gaaacaacta gcacggcgaa tggtttattg gtttggaata aatgccgata ttgataaata 4080 tgtagcagaa tgttgtgcat gtaacagtat ggcaacatcc catgagcaaa cagagaaatc 4140 caaatggaca ccaacgacaa gacctttcag tagaattcat atagatttct tcttttttga 4200 gcatcacacg tttttactaa tggttgatag cttttccaaa tggttggaaa tagaatggat 4260 gaaaaatggt acagatagca acaaggtttt gaagaaattg gttgcatatt ttgcaagatt 4320 tggattgcca gatgtaatag tgtcggacaa tggtcctcct ttcaactcct acagttttgt 4380 aaacttccta gaaaagcaag gtattagagt gttcaaaagc ccaccataca acccatcaag 4440 taatggacaa gccgaaaggc tcgttagaac ggttaaagag gttcttaaga gattcctaat 4500 ggatcctgat atgaaggagt tggatctgga aaaccagata aactatttcc tatttaattt 4560 tagaaattat aattcgacga aagacggaca ttttccgtct gaacgaatat tttcgtataa 4620 accaaagact gtattagact tagtgaatcc aaagaaacac tacaaaaagc aactagatat 4680 acaacaacca gatgatgaaa cggtttcaga aaatcagagc gggaaaattc ctgatcgttc 4740 ttatgatgct attgaccatc tgatggctgg ggatgaagtg tggtataaga ataataatcc 4800 ccaccatcat gcccgttggg taaaggccac atttattaaa tggtactctc ataatatttt 4860 acagatatcc attggaagcg tcagcgcaat ggcgcaccgg aagcaaatta ggatctgtgg 4920 agatgaggca accaggcaaa ggccgaatgt ggtgattacg agaagcggtg gtagcgaaaa 4980 cccgaagggg aatatggata tggcaaggag cgaaggccca ccgaattcta acgaggaagt 5040 gaggctaccc gaggctccca atggagttag aaagcgaaag agactgaatt cagacagcag 5100 ggtcggaaca ccaggtgaac tcgatctcag acggtctaaa cgagtgagaa aagcaaatcg 5160 ttcggaagat ttttattact agtttcggat ggtcattcat aattacagtt aaaaagattt 5220 catgtgaatt tgaaacaaat ttattgtata agttttttat ttttgattat ttatcagttc 5280 tgattatata gcataagctc tctaagggag aagaat 5316 // ID BEL-120_AA-LTR repbase; DNA; INV; 946 BP. XX AC AAGE02025227; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-120_AA_; KW BEL-120_AA-I; BEL-120_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-946 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02025227; Positions 34229 35174. XX SQ Sequence 946 BP; 233 A; 251 C; 181 G; 281 T; 0 other; tgttcggttc aaaaccgaag tttagtttta agtaattttt gtttgatttg ttttccacca 60 ttttgtttga attcagccct aaaatccatt acctttatat aaacccaaaa tcattgcctc 120 tcttcccaac tccccgtgtc aattagtcat ttgcgccttt tccccaatat ccacaaaagt 180 caataccttt cccccattcc cgaaacaaaa ggtaataaat gttagtttgt aagtcaacca 240 gtccaaaagt cagccagaat gacagtattt cccatcgcag caaccctctc ccatccagct 300 ttgtagtttt tgttcttttt gttgtttttg tttcgccctt tgacgtttcc atggagatgg 360 gaggagctag ttagaatttt tcaattgtca cgatgccaag attataggta taaaagtaaa 420 caattttgcg caaagcgcat cagtattttt gtaaccgttg gagagtgaac accaagaata 480 aatcgccgat tgaattagaa accagcagtt tttcctttca ccgaagagca tccagtcccg 540 tcctccactg ccagtcgagc cgatcgctgc tgctgccaaa accgaagacc agatacagtc 600 caccgttcgt cgctggcaat tgatttgtcc gttcgttcct gtccacaaat tgatcccacc 660 cgaggtagtc gattgtggac acggttcgtt tgcacgcgga aaaccaattg tctagcagca 720 ttcgagttcc gccgcttctg gaacagcaac aagggttcca tagtagcgtt tgtcccctcg 780 attcgatcca cagtcagacc tttccggtaa tctgctgtgg acaccgtttc gactgcacac 840 ggacaaccta ctagccctgg cctgttgttg ttagccagtc gctggaatcg attgcgcacc 900 cgccaggatc ccgttccctc cagtttgcca ctttgtgcat cgcaca 946 // ID Gypsy-29_AA-I repbase; DNA; INV; 6322 BP. XX AC supercont1.18; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-29_AA_; KW Gypsy-29_AA-LTR; Gypsy-29_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6322 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.18; Positions 759078 752757. XX CC Positions [4397-4873] - Integrase core CC 'ATAAT' target site duplication CC LTRs are 99% similar to each other. CC A non-autonomous DNA transposon inserted at positions 5360-5596 CC is masked. XX FH Key Location/Qualifiers FT CDS 782..2122 FT /product="Gypsy-29_AA-I_1p" FT /translation="MVVYATKYFINSFQSILFRYLIVMSANGMYVRAPDAY FT MDEEESELYVVESQYSDDLNSCQLGANEVVGITEPIDTDGEGEPQSAEEVM FT WSTEDDPNATIMEKNQKDACSTNAHIDERLDRLERMMLELASSHTTSARNV FT PVEQNEIGWGFAKETGSQGAGYSSIRMETIPPFPKDVPANKLWEAWQEFLE FT NFEIAVSLSHPLDPVRRAKLLFVSMGRELQGIVRAAKLRPNLNEATCYSTF FT VENIDRHLKAMTDTSAEHEAFTSMKQEKGESAVSFHSRLMEKVRLCGYSPS FT DQERFVRAQLLKGLANRELAKLSRTFGYETNFVVQSATRDEAYDRETRHEA FT PDVYAITQTKSEPGANPGPAWKRPRTNESNRRAVHARDNFRKGRRFRCSRC FT NRLAHNGGPCPALGRKCRTCGMEDHFAAACRDRRKTAGALRRDERTSNDAE FT QV" FT CDS 2195..5314 FT /product="Gypsy-29_AA-I_2p" FT /translation="MSFILRSTYPLFQEINAITLDDVTVNCRVGRSTPISF FT LIDSGADVNVIGGSDWSVLYDQYQKGLVDLDIMNTQPNQELRAYATSNPMT FT VKCSFRGTVEIVGSSKPVVKALFLVVSEGRRSLLGRTTAGEMKLLEVGLAV FT NNCDTVQPEIFPKMPGVIVKFSVDKTVPPVKNAYFNVPAAYREGARLRLCD FT MEKQGIIERVAGAPGWISGMSAVPKGQNDFRLVVNMRSANKAIKREYYRLP FT LLDEMKIKLHGAKHFTKLDLSNAYYHLELSEESRDLTTFMTENGMYRFTRL FT MFGVNCAPEIFQREMCRILEGEDNIIVYIDDVLIFANSPEQLRKITSRVLE FT ILRKNNLTLNKSKCEFEKTQIKFLGHELDEDGFHVDKAKIKDIENFRPPTT FT GSELRSFLGLATFISPYIDNFANITHPLWAVSSSRTWTWGPQQQKAFEEIK FT NKITHCTTTLGYFSEEDRTIIYTDASPDALGAVLVQQQEGHPPRIISFASK FT SLTPTEKKYAQNQREALGAVWAIEHFSFFLLGRQFTLRTDAEGITFILNRT FT REDSKRALTRADGWALRLSPYNYDVEFVKGRDNIADPSSRLYVGVDEPFDE FT QKSPWEVACLDSNKGGFLTVDEIRKATLNDPILQKVRASVESGDWPADLNK FT FKAVTRELSLRDGLLIKNGCAVIPESLRERTLELAHDGHPMTAKLKSILRE FT RVWWPGITSDAEKWVQSCKTCAVNGKPEKPTPMKRIFAPQSVWETIALDFN FT GPYVKHGGISILVIVDYRSRYLIARPVKSTSFDHTKRVLEDVFAREGFPAT FT MRSDNGPPFNSTEYKQYCAQRGIQTVFSTPLFPQQNGMVENYMKLINRAMA FT SATENNTKFTDELQAAVNAHNAAAHSITGIPPEEVLMGRKIKRRLPLVDYK FT RAEYNDETLNAKDKKAKLLGKEREDARRGARECRVKPGDTVIVERQNKAKG FT EPRFDNNKFTVLEEVNGNLTLSSNGLTLKRHVTQTKKVGDWRNETDDETRN FT DSAVDESGKESHRPGCQRKQPAYLQNYVRTVSDD" XX SQ Sequence 6322 BP; 1917 A; 1254 C; 1446 G; 1468 T; 237 other; atggcgatcc agccaggtta gctgttcttt cgaagcagct aataaagaaa ggatcagtcg 60 atgacctcgg aattccctta cattttcaaa ttttcctgca aagcaatgct accatcataa 120 cctctgatac aaatgcgaca cgatgagtag agattaccat ctcaagcgag gggtagtaaa 180 attgttttcc caagaaaata tatctaactt tgcttgcatg tgctgagaca gccatcgata 240 atgaaggtta tgagtttgtt agaacaaaat gatcaatgag ctgagaccgc cattaaaagc 300 gtttaattac tgtaaagatg agtgttgaaa aaccatagag accagtagta gatatcgagt 360 aaaccaaatg aaaaagagac aggtttcccc cactgagcgt tgacgagagt tcgcataggg 420 gcatgtcatg aacatccgta aaagtttgca gaataaagcg ttgacgagag tccgtcattt 480 aagacactag caatagcgca cgtgatggaa tagggaaatt gtaatgcgtt tagataacca 540 tttatgattt cggtgcgagc attcccacat gttgatcacc cgcgatataa ctaattacaa 600 tccaggagtt catacactct ctgggaagag cttgtcacaa gaaacaagaa agttctagag 660 gttaataaca aacaataaat ggttgtatgt atggccaaag gatgaactgg gaaagtcatt 720 tatcactgaa gaggctcaag taggcagatc agtatgtcta ctaaaatata gtgacaattg 780 gatggttgta tacgcaacaa agtattttat caattcgttt caatcgatac ttttcagata 840 cctaatcgtt atgtcggcaa acgggatgta tgttcgcgca ccagatgcgt atatggatga 900 ggaggagtcc gagttatatg tggtcgagtc ccaatattcg gacgatctga actcctgtca 960 acttggtgcg aatgaggtag tagggataac tgagcccatc gacaccgatg gtgaaggcga 1020 gcctcaatcg gccgaggagg tcatgtggtc aacagaagac gatccgaatg caaccataat 1080 ggaaaaaaat caaaaagacg cttgctcaac gaatgcacac attgatgaga gacttgaccg 1140 tctggagaga atgatgctcg aattagcaag ttcccataca acttcagcaa ggaacgtacc 1200 ggttgagcaa aacgagatcg gttggggatt tgctaaagaa acgggatctc aaggtgctgg 1260 ttattcgtcg attcgaatgg aaacgattcc cccatttccg aaagacgttc cggcgaataa 1320 actttgggaa gcttggcaag agtttctcga aaattttgag attgctgtgt cattatcgca 1380 tcctctcgat cccgtacgcc gtgcaaagct tttgtttgta tcaatggggc gagaactaca 1440 aggcattgtg cgcgctgcga aacttcgccc gaaccttaat gaggcaacat gctattcgac 1500 ttttgtcgag aacatcgaca gacatttgaa ggcaatgacc gacacatcgg ccgagcacga 1560 ggcatttact tccatgaaac aagagaaagg cgagtcagcc gtatcgttcc attcgagact 1620 aatggaaaaa gttcgactgt gtgggtattc accatcggac caggagcgat tcgtacgagc 1680 ccaactgttg aaaggactgg ccaatcgaga gctggccaaa ttgtcgcgaa cgtttggata 1740 cgagactaat ttcgttgtgc aatcggccac tcgcgatgaa gcttatgata gagaaaccag 1800 acatgaggct ccagatgtgt atgccatcac acagacgaaa tcagaaccag gagcaaatcc 1860 aggaccggct tggaaacgtc cacgcacgaa cgaatcaaac cgacgagctg tacatgcccg 1920 agacaatttt cggaaagggc gacgattccg ctgttccagg tgcaacagat tggctcataa 1980 tggaggtcca tgcccagccc taggccggaa gtgccgcacc tgtgggatgg aagaccattt 2040 tgcagcagct tgtcgagaca ggaggaagac tgccggtgct ctaagaagag atgaaagaac 2100 gtccaatgat gccgaacagg tatgatgagt tcaataagga ataaatagag aaataaaaat 2160 aaattttctg aattggttta tctcaagacc atgaatgtca tttattttaa ggagcaccta 2220 tccccttttt caggaaatta acgcgattac tctcgacgat gtaacggtca actgccgggt 2280 gggtagatct acaccgatta gttttttgat tgactcggga gccgatgtta atgtcatcgg 2340 tggctcagat tggtccgttt tgtatgatca gtaccaaaag ggattggtag acttggacat 2400 catgaacacc cagcctaacc aggaactgcg agcttacgcg acctcaaacc ctatgacagt 2460 gaaatgttct ttccgtggga cagtagaaat cgttggttcg tcaaaaccag tagtaaaggc 2520 acttttcttg gtagtcagtg aaggtcggcg atccttatta ggaagaacaa ctgctggaga 2580 aatgaaacta ctagaggtag gcctagcagt taacaactgt gacacagttc agccagagat 2640 ttttccaaag atgccaggtg tgattgtaaa attcagcgtt gacaagacgg taccgccagt 2700 taaaaatgcc tactttaatg taccggcagc atatcgcgag ggagcaagac ttcgtttatg 2760 tgatatggag aagcaaggta tcattgaacg agtagctggc gcccccgggt ggattagcgg 2820 aatgtccgcg gtcccaaagg gtcaaaatga cttcaggttg gtcgtcaaca tgcggtcggc 2880 gaataaggct ataaaaaggg agtattatcg actccctctg ttggatgaaa tgaaaatcaa 2940 gctacacgga gccaagcatt tcacgaaact tgacttgagt aatgcttatt atcacttgga 3000 gttgagtgaa gagtcgcgcg acctcacaac gtttatgacc gagaacggca tgtatcgttt 3060 tacccgatta atgttcggtg taaactgtgc tccggaaatt ttccagagag aaatgtgtcg 3120 tattcttgag ggcgaagaca atataatcgt ttacatcgac gatgtactta ttttcgccaa 3180 ttctcccgag caactccgaa aaatcaccag ccgtgttttg gagatcctgc gtaaaaataa 3240 cctaacgctc aataaatcga aatgtgagtt cgagaaaact caaatcaagt tcctaggtca 3300 cgagcttgac gaggacggat ttcacgtcga taaggcaaaa ataaaggaca tagaaaactt 3360 tcgaccaccc actactggat ccgagcttcg aagctttttg ggattagcta cgttcattag 3420 cccgtacata gacaacttcg ctaacatcac acacccactc tgggcagtat cttcaagtag 3480 gacttggacc tggggcccac agcaacaaaa agcgttcgaa gaaattaaga acaaaatcac 3540 gcattgcact acaacgctgg gatatttttc agaagaagac aggaccataa tttacacgga 3600 cgcttccccc gatgctctcg gagcggtgct ggtgcagcaa caggaaggac atcctccgag 3660 aattataagt tttgcatcga aatcgttgac tccaacagaa aagaagtatg cccaaaacca 3720 gcgtgaagca ctcggagcag tttgggctat tgagcatttc tccttctttc tactcgggag 3780 acaatttacg ctgcgcaccg atgcagaagg cataaccttc atcttaaatc ggacacgcga 3840 ggattcaaaa cgagcgttaa ccagggctga tggttgggcc cttcgcctca gtccttacaa 3900 ttatgatgta gagtttgtca aaggtcggga taacatagct gacccctcat cccgattgta 3960 tgttggagtt gatgagccat tcgatgaaca gaagagtcca tgggaggtag cctgccttga 4020 ttcaaacaag ggaggtttct tgactgtaga tgaaataagg aaagccaccc taaacgaccc 4080 aatattgcaa aaagtacgtg cgtcagtaga gtcgggcgat tggccggcgg atctgaataa 4140 atttaaagcc gttacgcgtg aactgtccct acgtgatggg ctgctaatca aaaacggctg 4200 cgcagtaatc ccagagagct taagggaaag gactttggaa ctggctcacg atggccaccc 4260 tatgacagct aaattaaaaa gcatcctaag ggaacgcgta tggtggcccg gaataacttc 4320 tgatgctgaa aaatgggtac aatcttgcaa aacgtgtgct gttaatggca aacccgagaa 4380 accgacgcct atgaaacgga tatttgcacc tcaatccgta tgggaaacga tcgccctaga 4440 ttttaatggg ccatacgtca aacatggtgg aatatccata ctggtgattg tagattatcg 4500 atctcggtat ttgattgcac gcccggtcaa gtccactagc tttgaccata ctaaacgagt 4560 attggaagat gtcttcgcta gagaaggttt cccggcgacc atgcgttccg ataacggccc 4620 accgttcaat agcacggaat ataagcaata ttgtgcacag agaggaatac aaaccgtatt 4680 ttccactccc ctatttcccc agcaaaatgg gatggtagaa aattatatga aactcataaa 4740 ccgtgcgatg gcatcagcta ccgaaaacaa cactaaattc acggatgagc ttcaagcagc 4800 tgtgaatgca cacaacgcag cagcccattc cataacagga ataccgccgg aagaagttct 4860 gatgggccgg aagatcaagc gacgtttacc actcgtggat tataaaagag ccgaatataa 4920 cgacgaaact ctgaacgcga aggataagaa agcaaaacta ctcgggaagg agagagaaga 4980 tgcacgtcgc ggggctcgag aatgtcgagt taagcctggt gatacggtca ttgtcgagcg 5040 ccagaacaaa gctaaaggag aacccagatt tgataataac aagttcactg tccttgaaga 5100 agttaacggt aatctcacct tgagtagcaa tggactgaca ctcaaaagac acgtcactca 5160 aacgaagaag gttggagatt ggcgtaacga aaccgatgat gaaacgcgaa atgattctgc 5220 tgttgacgag tcagggaaag agtcccatcg accagggtgc caaaggaaac agccagcata 5280 tttgcaaaat tacgtgagga ctgtgagtga tgactgatgc aggcttgtat cacaaacagg 5340 gtcagaaaag ttaaattaax xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 5400 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 5460 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 5520 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 5580 xxxxxxxxxx xxxxxxttaa atgtttgaac cggtacgatt ttttggaaac gattattgcc 5640 cctattcttg aatagacaaa aacaccgaaa cgactgcaac actagaagcc ccaccttaca 5700 cacctgaaaa gaaaagaaag aagaaattta gcaacaaact tacattttca cttacctgtc 5760 gatttgagtg agaaactcga gactcagctc atgatcaatg ctttgatcga agccgttcct 5820 cattcaacta agggattttt ttatcaccag gaacattgat tggaacggga aaatagtttt 5880 aatgtccact ttctttgaat agtttcaact tctgtttcaa cagttgcaac agcaggacgg 5940 tgaaaatttt tgacgttttg accaggtaaa caaaacaaac ggagagctaa taggcctgtt 6000 catggaagaa aagcaaatcc tgcttagagc caatagacct gttcatgatc atttgagatg 6060 aaaggcctgt tcatggttta gttttttttg gactacacgg aaaagtattt tctactaaga 6120 atataagttt aaaacaaccg aagtttatgt aaattgggta aaatcgaatt tgggtaaacg 6180 catcgatttg agggtcagct tacgtgaaga aaagctaatt ttaagctggg ggttttaaat 6240 gcatagctca aaagacctgt ccgtaagatt tacgttctga aagtttcaaa ttacaaaaaa 6300 aaattaggaa caaggaggga ga 6322 // ID BEL-133_AA-LTR repbase; DNA; INV; 378 BP. XX AC AAGE02019812; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-133_AA_; KW BEL-133_AA-I; BEL-133_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-378 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02019812; Positions 42958 43335. XX SQ Sequence 378 BP; 144 A; 67 C; 67 G; 100 T; 0 other; tgttctgaac gacgagaaat cagcaccccg gtgcgacagg gtcacacgat ccacacgcag 60 gagaaatccg tagtgtatga tatcgacagc acacatgtat gcgcaccaat ccgtcatgca 120 taataaatag aattaaattt gtttctacct agtacatgca gacacaagaa gtaaaatttg 180 cttacggcta gaacacaaaa cattgaattg cttaaatata tccttaaatt agtaaaagtt 240 gaacttaaaa ctagtgtaaa attgttggac agtggacaag tttgctatta aggactagaa 300 atgtaagtaa aactgatttt ccatgcaaaa taagataata attcatctct aataaattgc 360 agctaaaaag ctgattca 378 // ID BEL3-I_Dpse repbase; DNA; INV; 5868 BP. XX AC Unknown_group_180; XX DT 15-MAY-2009 (Rel. 14.05, Created) DT 16-JUN-2009 (Rel. 14.05, Last updated, Version 2) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL3_Dpse; KW BEL3-LTR_Dpse; BEL3-I_Dpse. XX NM BEL3-I_Dpse. XX OS Drosophila pseudoobscura OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; obscura group; OC pseudoobscura subgroup. XX RN [1] RP 1-5868 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1016-1016 (2009). XX DR [1] (Consensus) XX CC Positions [4929-5510] - Integrase core CC LTRs are 95% similar to each other. The original virus is a CC duplicate or fusion of highly similar viruses. The duplicate copy CC is eliminated in RU. XX FH Key Location/Qualifiers FT CDS 708..3731 FT /product="BEL3-I_Dpse_1p" FT /translation="MAPRKGKDAEVLTSRTALMNSFVRNKVYIEKNSDSMM FT AAELEARLSLIETNFNQFCNIQAKIEHDSEDASEYESRYEAEEVYCELKAK FT LLSCLGHRGRRQSNVGDLLNSTQVSRSSRLPKLKLPEFAGKFTEWPSWYNT FT FATLIESDSELDELSKFIHLRSALGAGPLSAIEGLELTGPNYRKALRLLKD FT RYENKAIILQLHVQELFNLRRLKRPDSEGLRVLVDNVNAQLVALKSLSDDK FT EILDAVIFHLIRTKLDEDTMDRWEVEWDCKKLPSWSLLSQFLINRGVNMAN FT REVRRGNGARPSGKGEMKNATLAATIGSNGCYVCPGTHGLKGCQRFNELSP FT VQRYHEAKKHSLGLICFSKRHTTRNCNGPRCRICSKPHHELLHREIPIVAK FT EPGLKGQKLQEPKSLHSSLLSRSSTNFLATAVVQIKAIDGGYRNCRCVLDS FT GSQLNFLTARIVSALGLEMRDSVIQLSGIGNATSQVMGEVQATFKSRVTRY FT MATEDFCVLKEITQYNPTSSGICSEWVPPKGVTLADPTFQSGEDIDMLVGV FT TLFFKLIVAGQIKLGAHLPRLQKSLLGWVVVGEFTKSSEVALLATDIARKG FT SKNDQLQELVQRFWKCEEVPETVDKFTEEEKQCEDCFVTHVKQEASGHLVV FT ALPFKNPKVELGETYQIALTRFLNLEKRLNRSQALKEQYVQFIEEYRQLGH FT LTAIEPAGQEKVEYFMPHHPVIRPESLTTKLRVVFDASCRSSNGRPLNEAN FT LVGPTLQPDLFETLTRFRFYRYALTADIKKMYRQILVAEPHRNYQCILWRC FT NVSDPIQVLRLNTVTYGTNSAPFLAVRCLYYLVDRYAEKFPLAREAVKRSF FT YMDDMLCGAESKEELTKLKEQVTELLGLGKFELHKWRSNYQGMDNNTSVEP FT LMLKTEDAAKTLGIYWSSLEDKFQFCFNIKVSAIATKRSVLSELAQVFDPL FT GFLSPLLILGKIFVQELWLLKQDWDEELPASYATQWLRYREELKLIDKISI FT P*" FT CDS 4473..5297 FT /product="BEL3-I_Dpse_2p" FT /translation="MEGGLRYIVYNVQQLSLHEEFIKLKEGKVIKSARIQS FT LSTFLQEEEGIAIIRVGGKLSNAELPFDTRHPILMPSNHKVVEALVELTHR FT KNLHAGAQSLCAFLRQRYWVINCRKLARRVIRGCIPCFRRRPLAATQLMGA FT LPANRVRGNIYPFERAGLDFAGPIWMHFHMRGKRPVKVYLCIFVCFATKAC FT HIEIVSDLSSNAFIAALKRFFARRGLSSDLYCDNATNFVGHESSTELSTRQ FT VRSSLTTNAASEESGFTSSPHAPLTLEDYGRAP*" XX SQ Sequence 5868 BP; 1684 A; 1198 C; 1487 G; 1499 T; 0 other; tttggcgccc gagcagggac ctttggagaa ggttcaatta ccggatggag ttcacgactg 60 gaactcaata aatatttcac cctgagtctg gctgacggcg agtgcgacag cggcttcggt 120 atcgcattgt gagtgtgaac tgctttgagc atctaatatt tcttgcaggc atatttagtt 180 ccaattcgat gtgtgtgaat aatcttaaaa caatacaatt gtgttagtgt caagtggtcg 240 cggctttgag acttctgcgc tttaattgtt ctgtaagcgg ggtctcgtgt agaaagcaga 300 ttacaattac agttagcttg agcgcatagg tatctgcata cacacataca gcctattgcg 360 tgcactgctg tacgttgaag caaggcacat acatacttgc atatatatgc agttacaagc 420 cggcacttaa agaacagtaa ttgctatttg acattgattt aactttgtag tcgtttcttc 480 tttgcctgcc ggttcgttgg ggcgcaagta cacattttgg tacagtggcg cacagtggtt 540 aattgttgat caattattga attgggcagt gtagtgagaa tttaaagtta gccgattatt 600 gaatagagtg gctaaacttg gtaccgggtt acttactcat gccaatattg agtatttttt 660 tattgttaaa ttgtcacggg taaaaccggt gtaaatagtc agtgcaaatg gcacctagaa 720 agggtaaaga tgcggaggtg ctcacttctc gaacggcgtt aatgaattcc tttgtccgta 780 acaaggtcta tatcgaaaaa aatagcgact ccatgatggc tgcggagctg gaagccaggt 840 taagtctaat tgagacaaac tttaatcagt tctgcaacat tcaggctaaa attgaacacg 900 atagcgagga cgccagcgaa tatgaaagcc gctatgaagc tgaggaggtt tattgcgaat 960 taaaggccaa gttactaagc tgtctcggac ataggggacg gcggcagtcc aatgtaggtg 1020 atcttctaaa tagcacacag gtgtctcgtt catcgagact tcccaagcta aagctgcctg 1080 agttcgctgg aaagtttacg gaatggccca gctggtacaa cacctttgct actctaatag 1140 aatcagactc tgaattggat gagttgtcta aatttattca tttgagatca gcgttaggag 1200 ctggaccgtt aagtgcgatt gagggtttgg aattaacagg gcctaattat cgcaaggcac 1260 tacgattatt gaaagatcgc tatgaaaata aggcaattat tctgcaatta catgtacagg 1320 agctattcaa cttaaggcgg ctaaagagac ccgactcaga gggactgaga gtgctggtag 1380 acaatgtaaa tgcccaatta gtagcactca agtcgttaag cgacgataag gagattctgg 1440 atgcagtaat tttccatctg attcgcacaa agcttgatga agatacaatg gataggtggg 1500 aagttgagtg ggattgcaaa aagttaccgt cgtggtcatt gttatcgcag tttctgataa 1560 acagaggagt caatatggca aatagggaag ttcggcgagg caatggagcc cgacccagtg 1620 gtaaaggtga gatgaaaaat gcgaccttag ctgcaacaat aggcagcaat ggctgctacg 1680 tttgtcctgg gacccatgga ttgaaagggt gtcaaaggtt taacgaacta tcaccagtac 1740 aacgttacca tgaggcgaag aagcattcgc tgggcctgat ttgttttagc aagcgacata 1800 caactagaaa ctgtaatggt ccgcgttgta ggatttgcag caagccgcat catgaacttc 1860 tgcatcgaga gatacctatt gttgccaagg agccggggct caaagggcag aagttgcaag 1920 agccaaagtc actgcatagc tccctattgt ctagatcatc caccaacttt cttgcgaccg 1980 ctgtagttca aatcaaagct atagacggag gatatcgcaa ttgcaggtgc gtgttagatt 2040 ccggtagtca attaaatttt ctgacggcta ggatagtctc ggccctgggc ctagaaatgc 2100 gcgactcagt catacaactc agtggtattg gaaatgcaac atctcaagtt atgggagagg 2160 tgcaagcgac ttttaaatct agagtcacca gatatatggc taccgaggac ttctgtgtgc 2220 tgaaggaaat aactcagtat aacccaacaa gctcaggaat ttgttcggag tgggttccac 2280 ccaagggagt tacgctggcc gaccctacat ttcagtccgg ggaagatatt gacatgctag 2340 ttggagtcac attgtttttc aagctaattg tagcagggca aattaaacta ggagcacacc 2400 ttccaaggct tcaaaagtct ctgttaggct gggtcgtcgt aggtgaattc acgaaatctt 2460 ctgaagttgc actgttagca actgacattg cacgaaaggg gtcgaaaaat gatcagctgc 2520 aagagctcgt acaaaggttt tggaagtgtg aagaagtccc tgaaacagta gataagttca 2580 ctgaggaaga aaaacagtgc gaagattgtt ttgtgacgca tgttaagcaa gaggcctcag 2640 gacatttagt ggtagctctg ccctttaaga atcctaaagt tgagttaggc gaaacttatc 2700 aaatcgcact taccaggttc ttgaatttag agaagaggtt aaatcgtagc caagcgctga 2760 aggaacaata tgttcaattt attgaagaat atcgccagtt agggcattta acggcaatag 2820 agcccgccgg acaagaaaag gtcgaatact tcatgccaca tcatccggtt attcgcccag 2880 agagtttgac gaccaagtta cgagtcgtat ttgatgcctc atgtagatct agcaacggca 2940 ggccactgaa cgaagcaaac ttggttggtc caacgttgca gcctgatctg ttcgagactt 3000 tgaccaggtt ccgtttctat agatacgccc ttacggccga cataaagaaa atgtaccgtc 3060 aaatattggt tgcggaacca catagaaact atcaatgcat tctatggcga tgcaacgttt 3120 cggatcctat ccaggtgcta cgcctaaaca cggtcactta tggaacaaat tcggcgccgt 3180 tcctggcagt acgttgcctc tattacttgg tcgataggta cgcggaaaaa tttccgttag 3240 ctagagaggc agtaaagcga agtttctata tggatgatat gctgtgtgga gccgaatcaa 3300 aagaggaact gacaaagttg aaagaacaag tgaccgaatt attagggcta ggaaagttcg 3360 agctgcacaa atggcgcagc aattatcaag gaatggacaa caacaccagc gtagagcccc 3420 tgatgctaaa aacagaggac gcagctaaaa ctttggggat atattggtca agtctagagg 3480 acaaattcca attttgcttt aacataaagg tctccgcaat cgccacgaag cgatcagtgc 3540 tgtccgaatt ggctcaggtg tttgatccgt tagggttctt gagtcccttg ctgatcttgg 3600 gcaagatctt tgttcaggaa ctttggcttc taaagcagga ttgggacgag gagttaccag 3660 caagctacgc aactcaatgg cttcgatata gagaagaact gaaactaatt gacaaaattt 3720 caattccgtg atcctcaatt ccaattacgg ccgacccatg cactttgcaa ctgtttggtt 3780 tctctgatgc atccaacaga gcttacggcg gcgtaattta tgctcgtgtt agggatgaag 3840 ggggaggaat ttcggtgcga ctggtaacgg ccaagtccaa agtagcccca gttagggtaa 3900 catcattgcc acgcttacag ttacaagggg cattgttagt ggccaagcta atggcaaagg 3960 tatgagcctg tcttcacata gcagtagaga aggttaacta tttcacggac tccacaattg 4020 tgttaaattg gctgtcgtca catgctagtc gatggaccac cttcgttgcc aatagggtgg 4080 ctcaaataca agagctcaca catgtacagg attggttcaa ggtcgacacc aaatcaaacc 4140 ctgcagatat tgtatctagg ggattattgg taaccgaatt gagcaagtca acgatgtggt 4200 ggaacggtcc agaattttta atggacgcag cagatgactg gaccaaattt aattggagtg 4260 aggaaataca aacagttcca gaggagagga aatgtaagtt ctcgctgaca gttggaaagg 4320 tagaaagtaa tgagtagtta gatgtcgtgg ggagatgcaa atttgcaaat gatttcctca 4380 aattgcagcg agtttttagc tatatttttc gctggcgtag aatatcgttg agatcaaact 4440 caaatgaaac aggcaacctt actgcagctg aaatggaggg cggattgcgc tatattgtgt 4500 ataatgtgca gcagttgagc ttgcatgagg aattcattaa gcttaaagag ggaaaggtga 4560 tcaaatcagc gagaattcag agtttaagca cgtttttaca agaagaagag ggaatagcta 4620 ttattcgagt cggcggaaag ttgtccaatg ccgaattgcc atttgacaca aggcatccca 4680 tcttaatgcc gagcaaccat aaagtggtag aggcgcttgt tgaactcaca catcgaaaga 4740 atttacatgc cggggcacaa tcgctatgtg cctttttgcg acagagatat tgggtaatca 4800 attgcaggaa acttgcccgc agggtgatcc gtggttgcat accatgcttt cgtcgtcggc 4860 cgctcgctgc aacacagctg atgggagcgc tgccagctaa ccgagtgcga ggcaatatct 4920 accccttcga gcgcgctgga ctggactttg ctggaccaat atggatgcac tttcacatgc 4980 gaggaaaacg cccagtaaag gtgtatctgt gcatatttgt atgctttgca acaaaggcct 5040 gccatataga gatcgtgtcg gacttaagct caaatgcgtt catcgcggcg ttaaaacggt 5100 tctttgcaag acgtggattg agttctgacc tatattgcga caacgccacc aattttgtag 5160 gtcacgagag ctcaacagag ctttcgactc gccaagtcag aagctcattg acgacgaatg 5220 cagccagcga ggagtcaggt ttcacttcat ccccccacgc tcccctcact ttggaggatt 5280 atgggagagc gccgtgaagg tggccaaaca gctgcttgtg aaatgcacca acagcacagc 5340 actaaactac gaagatcttg tgactgccat cacacaagta gaagcggtga tgaattctag 5400 acctctgcat cccttgtcat cggacccgaa tgacttcgag gcgctaactc ccggtcattt 5460 cttggttggc cgacccctga acgcgttagt agaactagta gacgacgcac tgttgaaact 5520 gtccatgagc aaccattgga aacgcattct aatggtacac cacatgttct ggcaccgctg 5580 gtcggcggaa tatttgacgt tgctgcagaa gcggacaaag tggagcactg tagccaacaa 5640 cattcaactt ggtacactgg tccttattgc ggaagacaac gctccgcccg gacaatggcc 5700 gctgggaaga gttgcagagt tacacccagg agctgatggt gcagttcggg ttgtaacact 5760 caggaccaag actggattat ttaagcgcaa cgtacataag ctctgcccgc tgcctacaga 5820 tgccgatgaa cttaatgttg gaaggagctt ccaaggtggg gagaatgt 5868 // ID Gypsy-31_DPu-I repbase; DNA; INV; 5325 BP. XX AC scaffold_55; XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from Daphnia: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-31_DP_; KW Gypsy-31_DPu-LTR; Gypsy-31_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-5325 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Direct Submission to RU (16-DEC-2010). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR Genome; scaffold_55; Positions 727870 733194. XX CC Positions [4180-4512] - Integrase core CC LTRs are 96% similar to each other. XX FH Key Location/Qualifiers FT CDS 1890..4274 FT /product="Gypsy-31_DPu-I_1p" FT /translation="MAGSITIQSIIPDDMVELGVVPASSNSSIVIRVLPDS FT GASIDAIPAAMYRSHFNHVQLIEGGPNAVTATGAAISSLGHFPATLSWSSG FT SSLPVATTIHVLRDLQQPILSKASQKQFGMLPAQYPHSCLLAAVMPPPLPA FT SAPSDDAMATLGVLMAECLSIFDGVCRPMRGPPCHFQLKDGAVPSAIRGSR FT PVAVLLMPRVKQELDSLEDQGIISKVPEPTLWVHPIVIVPKDDGGIRICGD FT LTSLNHCIIRPTFDSPTPFQAVRTIPTGMKFFTVIDALKGYHQVELDEESS FT AMTTFSTPFGRYKYNRLPFGVSLEGDDYGRRLANIFDDFPNCRRVVEDILV FT FSATWEEHVWLFRLAADHQIAINVPKVVFAQPSVLFGGYVVGESGFRPSPN FT LLRAIREFPAPTCVSEMRAFHGLCQQVGNFSDDLAPVLRHLAPLLRKDFVW FT EWTMQHEKDFKASRESLSSSSELSFYDPASPTSLHVDASRLRGFGFILRQQ FT KADGSWNVVQPGSRFLSDAESRYAMIELECLTAAWAMRKCRQFLEGLHTFQ FT LVTDHRPLIPILNDYHLDKLDNPRILRLRLSMQRYSYVAIWVPGKNNLMAD FT ALSRSPVDHPSASDEIAEGPQSFSARISLLDTMEGSNDANPDILLSSVAAA FT TSADPVMLSLRQTVLEGFPNEKCNLPLELRPFWQVRSQLSVDDAEGLILVG FT PRVVIPTSLRQEIISRLLQMHQGATKLRQRARLSFYWPSMDNDIVVAAKSC FT PSCTERLPSHPAEPLLPHAPASRPFEFVFVDLGAYRGRDFFYCG" XX SQ Sequence 5325 BP; 980 A; 1613 C; 1262 G; 1470 T; 0 other; ttctatggcg cagttgattt tgttttcgag cctattcgtt ctccctgctc tcgtcactcc 60 atccctcagt gttttcgttc agttttattt tgttttgtgt tgatttcccc ggcccttggg 120 tgcgcccatt gctgctaacc acccccctcc cagcaaccag cgtggtttgt gggggccgcc 180 atcttgtttc ctttgtctcg tgtgctgcat attgtgttct tttgtggtcc tgcgagttat 240 ccatgccgtc aaacatagtt ggaatattgc acacattcac ccactgtttt tcatatttct 300 ttttttgttg ttgtgtgatt cattgtcgtg tctatattga ggggcccaca tgttgcgtgc 360 ccgccatctt ttctctggtt tcgattctct cttggccatc tctcttgcta ggaatccgtt 420 ttttttgtgt gttttctttg ttggccgtat ttattccgcg tcgttacgag ttccacgcgg 480 atttcatccc gcggtttagt ttctcccacc attcaacccc ataccgtcat atattcggtt 540 tttttgtgtg ttgaccgtca ttattccgcg tccacccgtt ccacgcggat ttcatcccgc 600 ggtttagttc tcccaccttt cagcccccta tcatcataca tttccacatc accgttcatt 660 ctgttttata agtgtgctgt cttcttgttt cacgccctcg tttcgaccac tcattgccat 720 ttcaacctca cacgctctct ccattcaatt catttttgtg tgtgtgtgtt tcccgctcct 780 cggttcatta acgttcatct gcgtgtttca atttcccatt gccgtttcaa cctcacacgc 840 tctctccatt caattcattt tttttgggtt tcctgctcct cggttcatta acgttcatct 900 gcgtgtttca aatttccccc attcgttccg ctgtcgtgtt catcctacgc cggtgccacc 960 cttcgtgcag tgttcagtgg ttccctcgtg tcgcccaggt cttgggttaa tccgtgcccc 1020 gccatgcctc ccaagccatt caagccgtac ggcgtgccac ctccactaga cattaaggaa 1080 ttcaaagact ccttcgagat ctggcaccag cagtggaata tattccttgc tctgtcaacc 1140 atcaacactg cgctgccgca aggggatcga ccggaatata ttgccaacat cttactatcc 1200 tgcctctcca atgccacgct gaaagccgtg ctcaccatgg gcttgacggc caccgagttg 1260 aaagatgccg acgtcatcat tggaaagctg cgggagaggt tcaatgcggg ttggaattgc 1320 catgtatggc gccagaaatt ctcatcccgt gtccagcgtg acactgaatc gtccgattcc 1380 tggttcagcg atttgcgtga tctcgcgcgc aaatgtgagt tcgagaaaga ttgctgtgcc 1440 gcctgccaga acacgcgcat tttgggccaa gttgtctttg gcgttttcga cgacgaggta 1500 cgccgtaagt tactggagcg gggcgccaat ttaacccttg atcgggcgct aacaacactc 1560 cgtacagcgg aggccacccg gcttcaggcg tccaccatct aacaaggtgg tgcagccccc 1620 gtccaccagc taaagactcc ggcggccaaa ccacctatgg gcaagccggc cgttcaacac 1680 cgggatcagc agcgcggacg tccagcagct cgttggcatc cgcctggaac taagccgtac 1740 gggtgttgga attgcgggtc agcctctcgt cacgctaaag aagattggat tgcaagattg 1800 ccttcggcaa ggaatgtctt ggttgccaca aaacgggcca cttccaggcc gtatgtaccc 1860 aaggcggcag ttcaaccccg aaacctgaga tggctggcag catcacgatc cagtccatca 1920 tccctgatga catggtagag cttggtgtcg tcccggcttc cagcaactct tctattgtca 1980 tccgtgtgct ccctgactct ggggcgtcga ttgatgcgat cccggccgcc atgtatagga 2040 gccacttcaa ccacgttcaa ctaattgagg gcggaccaaa cgcagtcacg gccaccgggg 2100 ctgctatttc ttcgcttggt cacttcccgg ccacgttgtc atggtccagt ggatcatccc 2160 tccccgtggc caccaccatc catgtcctac gtgaccttca acagcctata ctctccaagg 2220 cgtctcaaaa acagttcggc atgctaccgg ctcagtatcc gcattcatgt ttgttagctg 2280 cggttatgcc tccgccgttg ccagcttcag ccccaagtga tgacgccatg gctacattgg 2340 gcgttcttat ggcggaatgc ctttcgattt tcgacggagt gtgccggccc atgcgtggtc 2400 ctccgtgcca ctttcaacta aaggatggcg ctgtcccttc tgccatccgc gggtcccgtc 2460 ccgttgcagt gcttctcatg ccgagggtta aacaggagct ggactcgctg gaagaccagg 2520 gcatcattag caaggtgcct gagccgacgt tgtgggtcca cccaatcgtt attgtcccaa 2580 aggacgatgg cggaatccgc atttgcggcg acttaacgtc gctcaaccac tgcatcatcc 2640 ggccaacctt tgattccccg actccgttcc aggcggtccg gaccattcca acggggatga 2700 agtttttcac ggtcattgat gccctgaagg gctatcacca ggtggagctg gatgaggagt 2760 caagtgccat gacgacgttc tccacaccgt ttggccgtta caagtacaac cgtctcccgt 2820 ttggcgtttc gttagaaggc gatgactacg gtaggcgtct cgcaaacatt ttcgacgatt 2880 tcccgaactg tcgccgcgta gtcgaggata tcctggtttt ctccgccact tgggaagaac 2940 atgtttggct tttccgcctc gccgctgatc accagatcgc cattaacgtt cccaaggttg 3000 tctttgctca accttcagtg ctgttcggtg gctacgtggt aggtgagagc gggttccgcc 3060 ccagcccaaa tttactgagg gccatccgtg aattcccggc acccacttgt gtttcagaga 3120 tgcgagcctt ccacgggctc tgccagcagg tgggaaactt ttcggacgac ttggcccctg 3180 ttcttcgcca ccttgctcct ctgcttcgga aagattttgt gtgggagtgg acaatgcaac 3240 acgagaagga tttcaaagca tcccgtgagt ccctatcttc atcttccgaa ttatcgtttt 3300 acgacccggc ttccccaacc tcacttcacg tcgatgcatc ccgcctccgg ggcttcggtt 3360 tcatcctccg acaacaaaaa gccgatggga gctggaacgt cgtgcagccg ggatcccgtt 3420 tcctatctga tgccgagtcc aggtacgcga tgatcgagct ggaatgctta accgcggctt 3480 gggcaatgcg caaatgccgc cagtttctcg aaggcctaca caccttccag ctcgttaccg 3540 atcatcgccc tttaattccg atcctcaacg attatcatct ggataagctg gataatcccc 3600 gaattttacg ccttcgcctt tccatgcagc ggtactccta cgtcgctatt tgggtgccgg 3660 ggaagaacaa tctgatggct gatgccctgt cgcgctcccc ggttgatcat ccttccgctt 3720 ccgacgagat cgcggagggc ccgcaatcgt tttcagcccg catcagttta ttggatacca 3780 tggagggttc caacgacgcc aatcccgaca ttttgctgtc ctccgtggca gcggctacct 3840 cggccgatcc ggttatgctg tccctccgcc aaacagtcct tgagggattc ccaaacgaga 3900 agtgtaatct tccccttgag cttcgtccat tctggcaggt gcggagccag ctgtccgtcg 3960 acgacgcgga gggtctcatc ctggttggcc cccgagtagt cattccgact tccttgcgtc 4020 aggagatcat cagtcgctta cttcaaatgc accaaggagc cactaaactc cgtcagcgtg 4080 cccgtctctc cttctactgg ccctcaatgg acaacgatat cgtcgtggcc gcaaagtcct 4140 gtccatcttg cacagagcgt ctcccatctc atccggcgga gcctcttctt ccccatgctc 4200 cggcttccag accgttcgag tttgtattcg tcgatctcgg cgcataccgc ggtcgtgatt 4260 ttttttattg tggctgatca gttcagcggt tggcctcaag ttttcccgtt tccagacacc 4320 aacacgtcaa cgcgacgtgt catcgacgcc ctccgttcat ttttcacatg cggtgccgga 4380 gccccggtga aactgtggtc ggatggaggc ccgcagttca agtccgacga atttctcgca 4440 ttccttcgcg actgggatat tagtaacggc cgttcatcgc cccaccatcc ccagtccaat 4500 ggctacgtgg aataagccgt caaaccgatg aagaagctca tcgcgggctc ttggtcgtcg 4560 gggtcgttcg acccggacaa gtttggcaag gctttgcttc tcttccgcaa cgcgcctatg 4620 tcgggtggag cttcaccatc acaaattgtt ttcagccgcc ctacgcgcga tctcctcccc 4680 gcgcacaggc gttcattcgc gccggaatgg cagcgaaacg cggagttatt ggagaagcgt 4740 gcgcaacatg caaaggagct tcaaattcag cacttcaacc gttccgcgca cccccttcct 4800 cccctggtga ttggcgacag tgtgatcatc caagaccata aaacaaacgt tggtccacta 4860 ctgggatcgt cgtcgaggtg gggccattcc gggattacct cgtaaaaacg ccagctggtc 4920 ggttattccg ccgcaatcgg cgtgttttct tcgtgtcatc tccccccgga ttccgcagcc 4980 gcaacctcca gcaagcgctt caactacgac tacccccttc cagcggcttt gtctggcccc 5040 tccgacagtg ccttcaccca ctctccgtcg ttcgcgtcgc ctggccgcca agatcgcttc 5100 ttaaattaat tatttgtctg tttgtttttc tgtctctctc gttcgttctc ctttgagtcc 5160 atcttctctt acgtcggctt tgagacccgt ttttgtttct taattcagca atttcccctt 5220 atctatctgc tccccgtgtt tcgtccctgg gagtgttttc tcgccgtcgg tgttccacat 5280 tacaattcag aaattctttt ccgtttatgt acggggaaaa agaca 5325 // ID NAVIRTE1 repbase; DNA; INV; 5663 BP. XX AC . XX DT 07-NOV-2007 (Rel. 12.11, Created) DT 07-NOV-2007 (Rel. 12.11, Last updated, Version 1) XX DE RTE-type element: consensus. XX KW RTE; Non-LTR Retrotransposon; Transposable Element; NAVIRTE1. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-5663 RA Jurka J.; RT "NAVIRTE1: RTE-type non-LTR retrotransposon."; RL Repbase Reports 7(11), 1174-1174 (2007). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 2727..4166 FT /product="NAVIRTE1_1p" FT /translation="MRAAFDKVNRREMWEMMEKLGLGVRIRNRVKELYRGT FT RSEIRIGEDKIGGFELHKGVRQGCPLSPTLFNVAMADTEKELSKVQEGGIV FT LGKKKFWSISYADDVVMVANNATGLKQMLKKFKKIIERKGLELNTEKSKIM FT VFKNGGRRKKEDSFFWEKKEIEVVKNFEYLGYMLKENGKEEAQIKKLKEKA FT STVMSSMWGLGEELFRDNWELRMRLFDTMVKSVMEYGAEVWGWKGKEELEK FT IQRKYMKWVMKLDRTTPEHVLHLETKRYKLETRTRRRAIKYEVKLSKAKEG FT TLWRECWRTLEKEEEKERRRGXRRXNREEELNKRGWALSEYHRRLRAGEQV FT WSELERIEKDIQWQKWESEMRDSRQARDVKELIKPQGETPEYLRRTGNTGK FT KGLAMTARFRMGNEARGYKYWMKEEERICRLCREEEETFEYIFSRCNRTGS FT RENNWKNILNGERKNLAKLYRVIWVRKEKEKEEEX" XX SQ Sequence 5663 BP; 2530 A; 559 C; 1739 G; 800 T; 35 other; agaaaagtaa gaaggaaaga actggagaag aggcagaaga aagaaacgca tttgcaaaag 60 gagggaatgt gaaaagatca ccaattgaga aagagaagat ggaagaggag gcaatcgggt 120 cgttaaagtg cactatgaat ataatcctac aaaaagtgga gaagatgagt atgaaaatgg 180 ataaggaata ggtagaaagg gataagatga tcgagaactg ggaacagaaa tatattggac 240 tggaggaaaa aatgaacaga aaaatagagg aagggctaaa aggtatggaa ggaagactgt 300 gagaagaagg agaataaagg aaaagaatta gaagagaatg caaggaggcg agagaaaagg 360 aggaaagaga gatggaaaga aagaatggga atgagaatgc gaaagaaaag aagaaaaaaa 420 gaaattgagg agatattagg gaagatagaa agtctggaaa aaggggaaga gatagacaag 480 ggtagagaga gtcaaccagg ggagagagag gataagggaa aggtaacgga aaaggagagg 540 gacagaaagc agatagaaga aatggaatgg aggatcgagg aaggagagag gataggaaaa 600 agaataatat tataatagca gggtggaaag aagaaaaatg ggatagaaag acggtagaag 660 actggatgaa aaaactcgaa gtataggtta cgttgaggaa aacggtcatt gaacgggaag 720 atctaaagaa taggggcgga atgtagaaac tgggaagaaa aaggagatat catgagagag 780 aaggctaggt tacaagggtc tgatatattt atagacaacg atctaacatg gaaagagaga 840 agaaataaag aaaaactgag agaatacgcg acaaaggaaa gaaaagaagg aaaagaggtc 900 aagataggat acaacaaact atggataaaa ggcgtagagt atttatggaa ggaaagagaa 960 caggaatttt ttcggagaaa gaaaaggaca ggagaggaag aaaagagggg agcaaaggaa 1020 cttaaggata attaattgga atgtagcggg aataaagtga gtagatgaga ggacctgggg 1080 ctatctaaaa gggtttgatg taatttgtct gcaggaaaca tggctggaag ataaagaagg 1140 aaatttctgg gaaaagaaat taggaggtta tgaagtcaga attagaaatg caaagaaagg 1200 gggagaaaga gggagagtga aagggggaat agtgatggcg gtaagacagg gaacgaaaac 1260 agagagaata gagtggatag aagaaaagac aaacgaattc ataggagcaa gaatattagg 1320 aaggagagat atatggtgga taggtacggc atacatgaga gaggaaaaac aagaaaactt 1380 taaggagata gaagggatga cggaaaacgc cagaggtgag aaattgatat ggtgcggaga 1440 catgaatgca agaacgggat aggaaggagg aggagtcgac gaggaagaaa acgaagagaa 1500 cagggaatcg aaagatgaaa agataaacag agaaggggaa gaacttctgg aaaaaacaag 1560 agaaatggga cacagtatat tgaatggaaa taccgaagga gacaaaacag gagaatacac 1620 gtacatagga ggagcaggat gctcagtgat agattatgtg ataacaaatg agcaaggaaa 1680 aagagaggta caacgcaata gagatagagc tgggaaaaac actcgggaaa acagatgaag 1740 agaaagacaa aatcacagaa agagctatat ggactgaaaa agcaataggt caattccgag 1800 aagaactaaa taaaggggaa gaggaaacag aatggacaga actaaaaaga aaactagaaa 1860 aagcaatcag aaggaaaaag ggaagcggca gaaaaagaaa gaaaaacacg tggtgggatg 1920 aaaaatgcag actcagaaag cagaggtaag gaagttaggc aaggcgatta agaagggagc 1980 gggaaggaaa gaatacgtca aagcaaagag agactgggga atattagtag aaagaaaaaa 2040 gcaggaagaa atggaaaaaa gaataaaaga ggctggagat agtaaaaccg ggagaaaatt 2100 ctgggaggtg gtaaaaagta ggagaagaaa aaagaggaca agatgcagca gtaaaataaa 2160 gaaagaggaa tggttaacty acttcaaagg gcaactagga gaagagacag aggaagattt 2220 gggaacaagc ayacaaggag aggatgaaga aggagakaga gaggataaag aagraagaat 2280 aaccgaaaac gaggtgagga aggcaattag aaaaatgaag aagggaaaag cgccaggagc 2340 agacggacta caaaatgagg tatggataca cggggaggaa caggtaatag gagaaataac 2400 taaaatactg aataacatat ggaaaggggg aaaagtacca gaggaatgga aaacgggact 2460 aataaccccg cttttcaaga aagggaagaa ggaggaagca aagaactaca gagggataac 2520 tctcatggac acagggtata aattatacgc agaaataata agggaaaaga tggaaaggga 2580 gttagaggga agagagatgc ttgacgatac acaaatgggt ttcagaaaag ggagaaggta 2640 cggcagacgc aatttatatt ttgagtaaag caatagaaac tgaactggat aagaagggag 2700 gtaaagtcta cgcgtttttc gcggatatga gagcggcktt tgacaaggtg aatagaagag 2760 agatgtggga aatgatggag aaactaggrc taggagtcag aataaggaat agagtcaaag 2820 agttatatag agggacaagg agtgaaatca ggataggrga ggataagata ggagggtttg 2880 agctacacaa aggggttagg caagggtgcc cgctaagccc gacacttttc aatgtagcaa 2940 tggcagacac agaaaaagaa ttgagtaagg tacaggaagg agggatagtt ctaggaaaaa 3000 agaaattctg gtcaatttca tacgcggacg acgtagtgat ggtggcgaac aacgcgacag 3060 ggctaaagca gatgctaaag aaatttaaga agataataga aaggaaaggg ctagaattga 3120 acacggaaaa atcaaagata atggtattta aaaatggggg aaggagaaaa aaggaggaya 3180 gttttttctg ggaaaagaag gagatagagg tagtaaaaaa tttcgaatac ttaggataca 3240 tgctaaagga aaatgggaaa gaagaggcac aaattaagaa gttaaaggaa aaggcaagca 3300 cggtaatgag ctcgatgtgg ggattaggag aagaactctt cagagataac tgggaactaa 3360 ggatgaggtt atttgacacg atggtgaaaa gtgtaatgga atatggtgca gaggtgtggg 3420 gatggaaagg taaggaagaa ctggagaaga tacagagaaa atatatgaaa tgggtaatga 3480 aactagacag aaccacgccg gagcacgtac tacatttgga gacaaagagg tacaagctcg 3540 aaacaaggac gaggagaaga gcaataaaat atgaggtaaa actaagcaag gcaaaggaag 3600 gcaccctgtg gagggaatgc tggagaacac tagaaaaaga agaggaaaaa gaaaggagaa 3660 gaggaargag gcgamgtaat agagaagagg agctaaacaa aagaggctgg gcactatcag 3720 aatatcacag aagactgaga gcaggggagc aggtgtggtc ggagctggaa agaatagaga 3780 aggacataca atggcagaaa tgggaaagcg aaatgagaga ttcaagacag gcgagagacg 3840 taaaagaact aataaaacca cagggygaga caccagaata tctgagaaga acaggaaaca 3900 cagggaaaaa agggctggct atgacagcaa gatttagaat gggcaacgaa gcaagaggct 3960 acaaatattg gatgaaagag gaagaaagaa tatgcaggtt atgcagagaa gaagaagaga 4020 ctttcgagta catattttcg agatgcaaca gaacaggtag cagagagaac aactggaaaa 4080 acattttaaa cggggaaagg aagaatctgg cgaagctata cagagtaata tgggtaagga 4140 aggaaaagga gaaggaagaa gaaakttaaa agaaaatata atacaaatag agaaaagagc 4200 aaacgaaaga acgwaaagag caatcgaaag gaaacnagag caatagagaa gaggaaagga 4260 aaaaacaaay aaggaaagaw agaaagaacc agacgaagga ackaaataga gcgcagggac 4320 atttagagag caggggagac gacggaacag gaggaacagt ggatgaaaca aaagtcacta 4380 ggwawggaca agaaaacgat acaggtggar agaggaacgt ggacgagaca gaaaagaaaa 4440 cacgtggtga gccacagata gaaaagaggt tagtaaagat aaaacaagta gtgaaaggag 4500 aagataaaaa agtgaaaagg agaagatgta cggagacaag aagaaagaaa gagagatgag 4560 ggaactgaga gaggaggtrg aaggatggaa gataaaggcg aggtttgaga gagaaaaaag 4620 agaagccctg gaaaagaaaa tgcaggatgt gagagaggac gtggagtttt taaaggggaa 4680 gatgaaaaac atgcagaaag agatagatga gatgagagaa gggcttgcag gtaagaaaaa 4740 tagagtggag acggtaaaga aaaggcagga gatgagaaga gggagaagag taataggaga 4800 agagagaatt ttgagagaga gaaaggctga tgcggccagt acagatatag gaaaggggac 4860 agcaggtgca aggtgtggga aagaggcctg atagtacacc taaacaattg gacggaggag 4920 gattctataa aagactggct ggaggatata atggaaggag tggaatggga gatggaagat 4980 acaaggacgg aggacacgaa aaaggtaatt tttaagagaa argaagacat ggaattgata 5040 tgggagaaga gggcggaaat aagagaagga ggcaaaatca ggctggagca gtggctatcm 5100 ttcgatgaga gaagggccaa agcgctgatt atgagagaga aaaggagaag agaagaggat 5160 gagatcgagg agggcgaggt ctcgcacaaa aaagaatggg tagctgaaat agaaggcgaa 5220 ctgtacaagt gggacgaaaa gaaggagaga attgtaaaaa aaggaaaagg gatagaackg 5280 ggagaagagg rcagatataa gaaatggagg tcgggggagt cagaggagga agaggaagaw 5340 tatcagacaa agatggagag aaaggaagag tagagtttac tcacaaacac acataaacac 5400 acacaaacaa tcaaacacac acacacacgt acgacttata cacacgcaca cacacacaca 5460 tgcacaaacr gacacgaaaa cargaaaagm aggagarcrc acayacacay acacacatrt 5520 acacasgaac attacctaca gagacacaca taacacacca gattaggtta aagtaggatt 5580 aaagcaagag gtatgtaaat atacgaatga ataaatattg taaacacgaa tagtgaaata 5640 caataaagtt attattatta tta 5663 // ID hAT-N9_AP repbase; DNA; INV; 589 BP. XX AC . XX DT 26-AUG-2009 (Rel. 14.09, Created) DT 26-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW hAT-N9_AP. XX NM hAT-N9_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-589 RA Jurka J.; RT "hAT-type DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2109-2109 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX SQ Sequence 589 BP; 212 A; 60 C; 82 G; 235 T; 0 other; gggctaggga cttctaggag tttgcatgtt tttaaataat cattaaaaac atagctaagt 60 aatatagaaa tttgtttcat agcaatgcga gtttatattt aaaatttata gttgcatatt 120 ttgcatattt gaccattttt catttataag tgcatatttg acaatttttt tataatatat 180 aggtgtaagt ttataattta ataaaaaaaa cctaatttta aatgtagata ttagtatgat 240 ggttttattt atagaaaata aatattttaa aattagtgtt tgttttcctc gccgatcact 300 cgtattgtta cttatcggta caccacacgc gtgattcact aaaactcgat gtaagaaaaa 360 aaaatcggaa aagaaaatta tcaattattt gttaatatac aagattttta ttataaatta 420 caatttaaaa ttttgaaaat attatctaga tattaaaata aaaaggtttt attcaatagg 480 tactatgatt ttttttttta atgcatattt taatagcata tttagtgatt ctctagggca 540 taagtgcatg catattttaa ggtttttaag ggcatgaagt ccctagccc 589 // ID L2B-2_CQ repbase; DNA; INV; 5265 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE An L2B non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW L2B; Non-LTR Retrotransposon; Transposable Element; L2B-2_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-5265 RA Kojima K.K. and Jurka J.; RT "L2B non-LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 143-143 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 10 sequences with >84% CC identity. XX FH Key Location/Qualifiers FT CDS 322..1047 FT /product="L2B-2_CQ_1p" FT /translation="MSLCGKCEQKICFEDAVPCSGKCGAFFHRSCTTLTKA FT AAKMINEQPNVQFKCDPCLDSGVDAEIAVLLADVKGIKEELKKYSDISDSI FT PGINESIATRIDEAIKKGIEDVMKACSEIFQKXIKTVVESSVATXIEKAIK FT SNSXKCXNERIVKVTRKRGTHSESDGLPGSKRRAVDTDDXEPMXVDDEEXD FT NVFEXPTFAEILKGETQKLKMVFKIKNYVFIVQKLLLNLTKQIKVMIKQGL FT F" FT CDS 1722..4466 FT /product="L2B-2_CQ_2p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="QSNAGRKVNATDIIYLNIAGLTTNYVALRQLVESSQP FT LLVLLSETHITQPEAFDQYSIPGYKVVFCLSHSRHTGGVAIYFKESIHLKV FT CLNEFSENNWFLGITVEKGMKTGNYGVLYHSPSSSDQRFIQILENWLENFV FT DMSKFNLITGDFNINWLDEQNSAHLKSLTEYLGLTQKVTAFTRISRNSRTL FT IDCVFSNIDCVKAVVKSDWKITDHETLVITISEFNLESSDDRIKIKSWKNY FT TPSAFCDLLERHVNFRTLTGELDFKSRILTNTLKNCTSELVTTEYINRKDS FT NSWYNGELMRLKQKRDKLYKKFCRTNNPRHWNKYTVARNKYTKSLKTTRST FT HIQNNIERNKNNSKELWKILKKLMKSKDSLPRTILFGDIEECSDDVIASKF FT NNYFIESVVEINESIERTDEPVEIRNMMNRNCRLEIFNPITYVQLKKICFD FT LGNSAGIDNVNAKVLKDCFHVIGHTLLEIINESLLSGQVPTVWKESLVIPI FT QKVAGTKKAEEFRPINMLHTLEKILETVVKSQLLEYLNINKLLIPEQSGYR FT EGHSCETALNLVLAKWKEFFENRNSTLAVFLDLKRAFETISRPLLLQTLQH FT FGIGGAAFNWFKNYLNFRTQRTIFNEAISESRENNLGVPQGSVLGPILFIM FT YINDMKLVLKHCDINLFADDTVIFVAAKNPIDAVLHLNEDLDALSKWLKFK FT QLKLNINKTKYMIISSRHQNIQDVQISIDGEQIDRVNSIKYLGVTIDENLK FT FNIHIDNTIKKIAKKYGIMCRLRNDLNFQSKIQLYKSIISPHLDFCSSILF FT LANETQLSRLQRLQNKIMRLILKCNRFTSSTWMLDALQWLSVKQRIIFSTM FT IFVFKLINGLLPRYLCDRILRGSDLHDHFTRNADEPRTPNFLFGAAQNSLF FT YKGVNIFNSMP" XX SQ Sequence 5265 BP; 1812 A; 812 C; 1039 G; 1589 T; 13 other; tttgttggtg tcaaacagag tagttcactt gtactgtgtt gtgataaaaa caataaattt 60 tgcgtgttaa agtgcgagta gttggtgcta aaatattgtc aatgtgtgaa ctaagcagct 120 gtcaactcgg ataagtacaa gttttgaaac gaacgttact atmgagttaa ttatcatcga 180 ttttattttt caccggcaac aacggtttgc tatcggattc ggtggcggct gctgttagct 240 gtttgttgct tttttttttt gggtcttttg tgctctgcta ttgtgttgct tggctggctc 300 gaccgtgctc gacagcgcga gatgagtctg tgtggcaagt gcgagcaaaa aatttgtttt 360 gaggatgccg taccttgttc ggggaagtgt ggagcgtttt ttcatcgttc ctgcactacc 420 ctgactaaag cggcggcaaa aatgattaat gagcagccaa atgtgcagtt caagtgcgac 480 ccatgtcttg atagtggagt tgatgctgag atcgcagtgc ttctcgctga tgtcaaaggg 540 ataaaagaag agttgaagaa atattctgac atctctgatt ctatccctgg catcaatgaa 600 agtattgcta ctcgaatcga cgaagctatc aagaaaggga tagaagatgt catgaaggca 660 tgcagtgaaa tatttcagaa aaawatcaaa acggttgtag agagcagtgt ggccactsaa 720 attgaaaaag cgataaaatc gaattcwsaa aaatgtmawa atgaaagaat agtaaaagta 780 actagaaaaa gagggaccca ttccgaatcc gacgggctcc ccggtagtaa aagacgagcc 840 gttgataccg atgacttwga accgatgktg gttgacgatg aagaaaakga caacgttttt 900 gaawcgccaa cttttgcgga gattttgaaa ggggaaactc agaaattaaa aatggtcttc 960 aaaataaaaa attacgtgtt catcgtccaa aaattgttat taaacctaac gaaacaaatc 1020 aaagtaatga tcaaacaagg acttttttaa aatcgaatct tgaccctaaa ttgcataaaa 1080 ttaataattt aaaaaatggc agggatgggt ccattattgc ccaatgcgcg ccaggacaaa 1140 acctttctaa agtgawaagt gacattgaaa acagacttgg ctaaaattat actgcaatcg 1200 tctcgactgg ccttccaaag ctgaaaatcg ttgggatgag tgaaaagtat gctcctgaag 1260 atttcgttga gttattgacc acacagaatg acgaaatttt tattgaacat gtaaaggtta 1320 tatcttctta cgaaaaccct cgcatgaaat acaacaaatt tagtgcaata attgaagtta 1380 ataatgacac ttttgacgca ttgataacgg ccggaaaggt caacatagga tttgatagat 1440 gttcagtttt tgaggctatt aaggtcctga gatgttttaa gtgtggagaa tttggacaca 1500 tgagcacaac atgtacaaac agcgaaactt gttcaaagtg tagcgagtcc cataagacct 1560 ctgagtgcac ttcaacggta atgaaatgtg ttaactgctt gaaaaagaat aaggagcaaa 1620 agatgaatct tgatgtgaat catgctgcat ttagttcaaa atgtccagtt tttcgacact 1680 tagctgcaat taaaaaggat cgtatgtatg agaatgaata gcaatcaaat gctggcagga 1740 aagtaaatgc tacagatata atctatttga atattgcagg actgaccaca aactacgtag 1800 cattacgaca gttagtggaa agttcgcaac ctcttttagt attactttct gaaacgcata 1860 ttactcaacc ggaagcattt gatcaatatt ccattccagg gtataaagtt gtattttgtt 1920 tgtcacactc taggcatact ggaggagttg ctatttattt taaagaatct attcacctta 1980 aagtgtgttt aaacgagttt tctgaaaata actggttttt gggcattaca gttgaaaaag 2040 gcatgaagac gggtaattat ggagttttgt atcattcgcc aagctctagt gaccagcgat 2100 tcattcaaat tttagaaaat tggttggaaa atttcgtaga tatgagtaaa tttaatttaa 2160 taactggtga ttttaatatt aattggcttg atgagcagaa ttctgcacat ctaaagagtc 2220 ttacagaata tttaggttta acacaaaaag taactgcttt cacaagaatt tcgagaaata 2280 gtagaacatt gattgactgt gttttctcta acatcgattg tgttaaagct gtcgttaaaa 2340 gtgattggaa aataacagac cacgaaacac tggttattac aatttcagaa tttaatttgg 2400 aatcttcgga tgaccgcatt aaaattaaat cttggaaaaa ctatacacct tcagcttttt 2460 gcgatctttt ggaaagacac gtaaatttta gaacattaac tggagaactc gacttcaaat 2520 cacgaatttt gacgaatact ttgaaaaatt gcacaagtga attagtaact acggagtaca 2580 taaacaggaa agattcgaac agttggtata acggtgaact tatgagattg aaacaaaaaa 2640 gagataaact ttacaagaaa ttctgccgaa caaataatcc tagacattgg aataaataca 2700 cagttgcaag aaacaagtat acaaaaagtt taaaaacaac cagaagtact cacattcaaa 2760 acaatattga acgaaataaa aataacagta aagaactatg gaaaatattg aaaaaattaa 2820 tgaaatctaa agatagtctt ccaagaacta ttctttttgg agatattgaa gaatgttccg 2880 atgatgttat tgccagtaaa tttaataatt attttattga gagtgttgtt gaaataaatg 2940 aaagcataga gagaactgac gaacctgttg aaatcaggaa tatgatgaat agaaactgca 3000 gactagaaat cttcaatcct ataacctatg ttcaacttaa aaagatttgt tttgatttag 3060 gaaactcagc agggattgat aacgtaaatg caaaagtttt gaaagattgt tttcatgtca 3120 ttggtcacac tttgcttgaa attataaatg aatcactttt gtcgggtcaa gttccaacag 3180 tctggaaaga atctttagtt attccgattc aaaaagttgc tggtacaaaa aaagctgaag 3240 agtttcgccc tatcaacatg ctacatacgt tagaaaaaat tttagaaacg gtcgtgaaat 3300 ctcaattgct tgaatacttg aacataaata aattattaat accagaacag tcgggatatc 3360 gagaaggcca ctcttgtgaa actgcattga atttggtttt agcaaaatgg aaagaatttt 3420 ttgaaaacag aaactccact ctggcagtat ttttggactt aaaacgggca tttgagacaa 3480 tttccaggcc cttattatta cagacactcc aacattttgg tatcgggggt gcagctttca 3540 attggtttaa aaactatttg aattttagaa ctcagcgtac gatttttaat gaagctattt 3600 cagaatctag ggaaaacaac ctcggtgttc ctcagggaag tgtattaggg cccatattgt 3660 ttataatgta cattaacgat atgaaactag ttttgaaaca ttgtgacatt aatctttttg 3720 cggatgacac tgttattttt gtagcagcga aaaatccaat tgacgctgta ttgcacttga 3780 atgaagattt agatgctctt tccaaatggc tgaaattcaa acagttgaaa ttaaatataa 3840 ataaaaccaa atacatgatt atttcatcaa gacatcaaaa catacaagat gtacaaattt 3900 caattgatgg ggagcaaatt gatcgggtta attcaataaa atatttaggg gttactattg 3960 acgaaaactt gaaatttaat attcatattg ataacacaat taaaaaaatt gcaaaaaagt 4020 atggaattat gtgtcgtcta agaaatgacc ttaactttca gagcaaaatt caactttata 4080 aatctataat ctcaccacac ctagactttt gttcatcgat attatttttg gcaaatgaaa 4140 cacaattatc taggcttcaa cgcttgcaaa ataaaataat gcgtttaatt ttaaaatgta 4200 acagattcac ttcwtcgacg tggatgttag atgctctgca atggctatct gtgaagcaaa 4260 gaataatttt ttccactatg atttttgtgt ttaaattaat caatggtttg ttgcctagat 4320 atttatgtga tcgaatttta agaggtagtg atcttcacga tcattttaca agaaatgcag 4380 atgaacctcg cactccgaat ttcttatttg gagctgccca gaattccctg ttttataaag 4440 gtgtgaatat ctttaactcg atgccctgac aaatcaaaca gtcaacgaca atagcagagt 4500 tcaaaagacg gtgttgctcg cacattaaat tagtatttta agtttgactt cgtatagcta 4560 tatattgacg gaattctgaa acgaacgcac aagttaattg ctcagtatga ccacgtgatg 4620 atgatggatt tttttttgtt ttcgaaatgt tggtttaaat tagagttaaa aaaagtaaat 4680 gagaagtcta caaaggtttg agaatcgcgc gcggtatgca aatataatgt gcctcttaag 4740 ttgacgatga tgaaagtaaa tgtttctttc tatgtatgat gaaaattggt agttgctctg 4800 aagacgggag agctcggaag gctacaggga aagctagtct agtaggtttt ggcggtcgaa 4860 agatagggtc tcgaaaagta ctaagtcttc aggcaaagct cgttacgtac actctgtcag 4920 gtaaaacagt agggtgcgat ctcgggcaat gccatgtaag cttgtcactt agtagagacc 4980 cagttcagat agatacaagt aaaactatat gtgctccgaa cggtagttag taccagatgc 5040 gattccttga agctgataca gttacaggcc tatcaactag taatctacga aagtaaactg 5100 taatgacgga gttaactgaa aggctaattt atatctgcta ttaaaaatca gttcgttttt 5160 ctttgcataa ctgattcaaa tgttcaatca aaattatcat aacgataact cgtccagctc 5220 aaacctttgt aggggtaaga ggtgggacca tcatcatcat catca 5265 // ID Gypsy-14_CQ-LTR repbase; DNA; INV; 218 BP. XX AC AAWU01010358; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-14_CQ_; KW Gypsy-14_CQ-I; Gypsy-14_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-218 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 408-408 (2011). XX DR GenBank; AAWU01010358; Positions 11117 10900. XX SQ Sequence 218 BP; 49 A; 56 C; 60 G; 53 T; 0 other; tgtgtgggtt gatgcgtatc ttcgtccgcc agactatatg ttgagtcgac atagcattgg 60 ccagactaca acgctatctc tatgtcgacc aacggtctcc cgacgacgac gatcggatgc 120 cgaagggcat tcgtattcag acagactcac aggtaagatg tgcaacagcg tcggtgctcc 180 gtggtgggtg atttcccaag tggatttccg gccacaca 218 // ID Gypsy-20_OD-LTR repbase; DNA; INV; 112 BP. XX AC CABV01004250; XX DT 24-FEB-2011 (Rel. 16.02, Created) DT 24-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Oikopleura dioica genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-20_OD_; KW Gypsy-20_OD-I; Gypsy-20_OD-LTR. XX OS Oikopleura dioica OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; OC Oikopleuridae; Oikopleura. XX RN [1] RP 1-112 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Oikopleura dioica genome."; RL Direct Submission to RU (24-FEB-2011). XX DR Genome; CABV01004250; Positions 7 118. XX SQ Sequence 112 BP; 34 A; 25 C; 18 G; 35 T; 0 other; tgttcacttt tacgagaacg actcgaatct gtccaccttg cgaacctgcg aataaatcag 60 ttgtatattt ttataagcat tgtctactca agcaaaagca tccgaagttt ca 112 // ID Sola2-1_HM repbase; DNA; INV; 4423 BP. XX AC . XX DT 28-JAN-2009 (Rel. 14.02, Created) DT 28-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE Sola2 DNA transposons from Hydra magnipapillata. XX KW Sola; DNA transposon; Transposable Element; Sola2-1_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4423 RA Bao W., Jurka M.G., Kapitonov V.V. and Jurka J.; RT "New superfamilies of eukaryotic DNA transposons and their RT internal divisions."; RL Molecular Biology and Evolution 26(5), 983-993 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 1555..3807 FT /product="Sola2-1_HM_1p" FT /translation="MMRSSLNIAPGKKLCKPCKQKIAIKEDLKEKKQSQEQ FT DEDFLISSFKSKRQEINEELKNFNISPLKSHSKSSKHILSEGKRKIARINE FT MVNNVTRKTESQVNVPSKLCYETNNMKKKAKNFDEIMSLIKEKILCSDKRT FT IVQLLTLAPPSWSILEVQNNFAVTEYQAKMARQIFNKKGLLAISPLYKGKV FT LHKEIEDSVKLFYDSSDLCRTMSGKKDYVSIQKNIHKQKKLLLCNLKELYV FT LYKENNPEIQISYSKFASLRPKWCILPGANGTHSVCVCCYHQNAILLVDAL FT NIGLKYKDLLSKTVCSVENKECMLAQCDNCPGKETLTKYMYEIFGEYEDDF FT EIHYKQWQTTDRATLLSLTADVSTFVELLVSCFEKLQAHSFIAHSQSQYLN FT QLKQNMDQSNIIIIGDFAENYTFVVQDEIQSYHWNTQQCSLHPLVIYYKDD FT KGVLKHISYCFISDDITHDVTYVYKIFQLIIPILKTKFSNLSKLHLFTDGC FT AGQYKNCKSFYNLCQLESEFSLKVEWNFFATSHGKSPCDGIGGTVKRLTAQ FT ESLKRPYRNQILTSEAMYEFCIEKIKVVNFIYIKGIDLQLQREEQKERYTG FT VTTLPGTRSFHQFIHLGDNRVGAKRCSTDTNYTIIHNLKKKQDIFQLDTVT FT LGGYVAVVYDDKWWIGIVTETNLEEVDVKVKFLHPKGPSICFNWPEREDYC FT FIPYTNILKKLSVPQACSSSGRNYVFENAEIEEVQVKWEQYCELLQTQSK* FT " XX SQ Sequence 4423 BP; 1720 A; 590 C; 622 G; 1491 T; 0 other; gggtaattcc acatcaaatc acccaaaatt taggaaattt tgaaccatgg ttcctcaaat 60 tttttaaaaa cttctttggc tgttgcatct ttaataaaaa agttaaaaca taaaatttta 120 gctaaaaatg ttaagcggtt gacaagatac tgcaatttta acattgacct ggtttcccaa 180 aatgacccgg ttttcataac attctaaaaa aacatttttt ttaaaaaaaa taagaaataa 240 aaatggcaaa aatatgaaaa taaatatata taatgttttt ttgatgctga attcaataaa 300 tgtacttaaa atgctgaaat ataaaaggaa catgcttaaa agttaataaa acaactagtt 360 tctataatcc aactttgggg cccaatatct caaaacaccc ccgtcaaaat tttttttttt 420 ttttgctgtt aatattttca gctgcttata atcttactag gcacaaaata tctaactgga 480 aaaacaacta aaacatatgt aactttaatt ttaaagatat atcaaatttt caataaatta 540 gtttaaaaaa cttgcaaata gaattttgta aaatattaaa tttagttata tttaaggaag 600 tcttaattat ataataaatt tgttatttga tcttaacaac tgaaaaaaac atcctgtttt 660 gtttttaaaa tctgacaagg tgtgatattg aagtgcattt gtttttaaaa ttttgacaac 720 tatagtttgt aaaagtttat aaaaactagt ttataaaaag ttcagttata acaacattta 780 tcggtaaaat tttatttaaa ttttaaaaat ttgatatata tagattgtat atatatatat 840 atatatatat atatatatat atatatatat atatatatat atatatatat atatatatat 900 atataagtat ttataaagaa aattaacaat atctatcaat gtaaaaaatt atacaacaaa 960 aggattctga aaaggtttga attatttcta tattcatttt tttttattac tcaaaattat 1020 gtagaaaaga acaatttttt cctttttttt taggaaatac attcatacta agtaacttat 1080 tataagatgt gttctattgg tttattgaca tcagatgaat gcaatggcca acaagatgtt 1140 attgaaactt taagcaatga agaaaaaata actatatcat tacgctgtaa cattgagtta 1200 tcaaacttaa cgatattatg tgaaagacat actacaaaat atttgaaatt gtatcctata 1260 tggcagaaag catgttgtga tccattcaat aaacatacaa agaaaatcac aagtaaatag 1320 atgtagaatt taaatattat attatatata tatatatata tatatatata tatatatata 1380 tatatatata tatatatata tatatatata tatatatata tatatatata tactttttta 1440 atttgtattt tctaaaacat aaacaatgtt ttgaacaatt gtgcagataa gctttataca 1500 attcaattac ttttagaaaa tctcatagta gttacaataa atgagtcaca agtcatgatg 1560 agaagtagtt tgaatattgc tcctgggaag aaactttgca aaccttgtaa gcaaaagatt 1620 gccataaaag aagatcttaa agaaaagaaa caaagccagg aacaagatga ggattttcta 1680 atatcatcat tcaaatcaaa aagacaggag atcaatgaag aactcaaaaa ttttaatata 1740 tctcctctaa aatctcattc aaagtcatca aaacatattc tctcagaagg aaagcgaaaa 1800 attgcgcgca tcaacgaaat ggtaaataac gttactagaa aaactgaaag tcaagtcaat 1860 gttccatcaa aactgtgtta tgaaacaaat aacatgaaaa agaaggctaa aaactttgat 1920 gaaattatga gcttgataaa agaaaaaatt ctatgttctg acaaaaggac aattgttcaa 1980 cttctaacac tggcaccccc aagttggtcg atattagaag tacaaaataa ctttgcagtt 2040 acagaatacc aagcaaaaat ggctagacaa atttttaata aaaaaggatt gttagctatc 2100 tctccattgt ataaaggtaa agtcttacac aaagaaatag aagattctgt taaattgttt 2160 tatgactcaa gtgatttatg tagaactatg tctggaaaaa aagattatgt tagcattcaa 2220 aagaatatcc ataaacaaaa aaaactgctt ttgtgtaatt taaaagaact atatgtatta 2280 tacaaagaaa acaatccaga aatacaaata agttactcaa aatttgcttc acttagacca 2340 aagtggtgca ttttgccagg tgcaaatggt acacattctg tttgtgtttg ttgttatcac 2400 caaaatgcca tattgctagt agatgcctta aatattggac taaagtataa ggatttactt 2460 tcgaaaaccg tttgctcagt agaaaacaaa gaatgcatgc ttgcacagtg tgataattgt 2520 cctggtaaag aaaccctcac caaatatatg tatgagattt tcggagaata cgaagatgat 2580 tttgagatac actacaagca atggcaaact actgatcgtg caacactatt gagtttaaca 2640 gcagatgttt caacatttgt tgaactatta gtatcttgct ttgaaaagct acaagctcat 2700 tcttttattg ctcactctca atcacagtac cttaaccaac taaaacaaaa tatggatcaa 2760 tcaaacatta tcattattgg cgactttgct gaaaattata cttttgtagt ccaagatgaa 2820 atacaaagct atcattggaa cacccaacaa tgttctttac atccattagt gatatactat 2880 aaagatgata aaggtgtact gaaacatatt tcttattgtt ttatatcaga cgacattaca 2940 catgatgtaa cttatgtgta caaaatattt caactaatca taccaatatt aaaaacaaaa 3000 ttttccaatc tgtccaaatt acatctattt actgatggtt gcgcaggaca gtacaaaaat 3060 tgtaaaagct tttacaatct atgccaacta gagagcgaat tttcccttaa agttgaatgg 3120 aatttttttg ccacgtcaca cggaaagtca ccatgcgatg ggattggcgg cactgtaaaa 3180 aggttaacag cacaagaaag tttaaaacga ccatacagaa accaaattct cacatctgaa 3240 gctatgtatg aattttgtat tgagaaaatc aaggttgtta acttcattta cataaagggg 3300 atagacttac aattgcaacg tgaggagcaa aaagaaaggt acactggtgt aacaactcta 3360 cctggtactc gtagctttca tcaatttata catcttggag ataacagggt tggggcaaag 3420 aggtgtagta cagacactaa ttatacaata attcacaacc ttaaaaagaa acaagatata 3480 tttcaacttg atactgtcac tttaggaggt tatgtagctg tagtctatga tgataagtgg 3540 tggatcggca ttgtaactga aactaaccta gaagaagttg atgttaaagt taagtttctt 3600 cacccaaaag gtccatcaat ctgttttaac tggcctgaaa gagaggacta ctgttttatt 3660 ccttacacca acatattgaa aaaactatct gttccacaag cttgctctag ttccggtaga 3720 aactatgtgt ttgaaaatgc tgaaatagaa gaagtacaag taaaatggga acaatattgt 3780 gaactattac aaacacagtc aaaataactt gcatcttgaa aaatcttgta tatttaaata 3840 catttttaaa ttacgattaa cttattttta ttttatttac aaacattttt tgggaatttc 3900 ttgaaaaagt aatacatctt agatattaaa gttttatatg tttcagttgt ttttccagtt 3960 agatattttg tgcctagtaa gattataagc agctgaatat attaacagca aaaaaaaaaa 4020 aaaaaatttg acgggggtgt tttgagatat tgggccccaa agttggttta tagaaactag 4080 ttgttttatt aacttttaag cttgttcctt ttatatttca gcattttaag tacatttatt 4140 gaattcagca tcaaaaaaat attatatatg tttattttta tatttttgcc atttttattt 4200 cttatttttt taagaaaaat atttttttag aatgttatgt aaaccgggtc attttgggaa 4260 accaggtcaa tattaaaatc gcagtatctt gtcaaccgct caacattttt ggctgaaatt 4320 ttgcatgcaa ttttttttta taattaggaa taatgtgtca aaatataaaa caaaaagaag 4380 acacatggtt caatggactg ggtgatttga tgtggaatta ccc 4423 // ID PiggyBac-2_HM repbase; DNA; INV; 2703 BP. XX AC . XX DT 06-JAN-2009 (Rel. 14.02, Created) DT 06-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE PiggyBac-type family: consensus. XX KW piggyBac; DNA transposon; Transposable Element; PiggyBac-2_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2703 RA Bao W. and Jurka J.; RT "PiggyBac families from Hydra magnipapillata."; RL Repbase Reports 9(2), 451-451 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(464..1852,1779..2453) FT /product="PiggyBac-2_HM_1p" FT /translation="MERHKFYDPELMSQDDVFNLLDSIDAGELDTDIEDNF FT LSDDDPDFVPNAHHFNMENILDNDADETNEILTAMDETNIEPGTSNSEIQL FT VESEEIEIGPSQSKKPQLEQVENDNFVNTFIGSSTEIDPKNIAFKNILWRK FT KNIQLEKKQIQFRGNEELPARFYELKTPYECFKYFLEDKLYQEIANTTNLY FT ARQMNISTKFVTTPAEIQKYVGILFYMSIYRYPNTREYWGENSFEPVRKTM FT TKNRFEEIRRYLHFNDNTKMPAQGDSNFDPIFKMRPIIEYFNIRFQSVPMS FT QRLCVDEQMCSTKMVSHIRQYMPAKPHKWGMKLFVLCDTYGFSYGFELYSG FT ASDNKIPNGAPDLGAAANVVSRLSQIIPDHMNHIVYFDNYYSTLPLMIYLY FT SRGIYSLGTVRANRIANCKLPTDKEVAKKPRGFSTEYVGSCYGVDLSTTLW FT KDNKGVLFGLNIRRSITITCRLHCGKTTKVFCLASTYVGVLPFKNESNNTL FT KASRYDRAQKKRIEIDCPNIIREYNAHMGGVDLMDGLLGRYHIRMKTVKWT FT SRFFYHVLDLAMINAYLLHKRINRQNKACNIQLPQFRKEVAAMLCRFQTDE FT PQKRSVGRPRSTNAEEDPAPKRLGKRTYLPAADIRFDGKEHFPQWLDRSGK FT RQCKLPGCKSETQCTCSKCNINLCCTAAKNCFAIFHKQ*" XX SQ Sequence 2703 BP; 938 A; 467 C; 507 G; 790 T; 1 other; ccctctagtt cccaaaaccg cctcgaggcg ggtattataa aaatccatgt aaaaattaca 60 tcgctagaaa ttttagcgta atgctcgtgt ccgcctatag acaatagcat acgaaccaag 120 ttacgtgcgc cggcgcaccc gcaaagcacg tggttttgtt tgtataatcc rtgcagtgtt 180 atttcgcagc gattactacg attagttttg tgtcagtgat tggaaaattg ttaaaaaata 240 tttgtagtgc ccccaattga ataaaaatat tattggtaag tttgtacttt atttagcatt 300 gcaacaaaat atatttgaat gaataattgc tcatacaaat attatacttg ccctattcat 360 gatacccgtc tccaggcgga cttgggcagc aagttatatt tttttgatat aacttgtaat 420 tgagaaacat tttttataca ttgctaaata tttcatttca gagatggaac gacataaatt 480 ttacgatccc gagttgatgt cacaggacga tgtttttaat ttactagatt caatagatgc 540 aggtgaatta gatactgata ttgaagataa tttcctttcg gatgatgacc ccgattttgt 600 tcctaacgct catcatttca atatggaaaa tatcttagat aatgacgccg atgaaacgaa 660 tgaaatacta accgcaatgg acgaaaccaa tattgagcct ggtacctcaa attccgaaat 720 acaacttgtt gaatctgaag aaattgaaat aggaccatct caatccaaaa aacctcagtt 780 agagcaagtt gaaaatgata attttgtgaa cacttttatt ggatcctcta ccgagataga 840 cccaaaaaat atagctttca aaaatatttt gtggcgcaag aaaaatattc agctagagaa 900 aaaacaaata caattcagag gaaatgaaga attaccggcg cgtttttatg aattgaaaac 960 tccttatgaa tgttttaaat atttcttgga agataaatta tatcaggaaa ttgcaaatac 1020 aactaatctt tatgctcgcc aaatgaatat ttcaacaaaa tttgtaacta cgccggcaga 1080 aatacaaaaa tatgtcggaa ttttgtttta tatgtccata tacagatacc caaatactcg 1140 agaatattgg ggagaaaatt cgtttgaacc agttcggaaa actatgacca aaaatcgatt 1200 tgaagaaata cgtcgttatt tgcatttcaa cgacaatacc aaaatgccgg cgcaagggga 1260 ttccaatttt gatccgatat tcaagatgcg gccaattatt gaatacttca atatacgttt 1320 tcagtcggta ccaatgtctc aacgcctgtg tgtagacgaa cagatgtgtt ccacaaaaat 1380 ggtttcgcat attcggcaat acatgcccgc aaaaccacat aaatggggca tgaaactgtt 1440 cgtgttgtgc gacacctatg ggttttcata cggatttgag ttatattctg gagcaagtga 1500 caataaaatt ccaaatggag caccggattt aggtgcagca gccaatgtgg tttcaagatt 1560 atcgcaaata atccctgatc acatgaatca cattgtatat tttgataatt attattcgac 1620 tttgccattg atgatatatc tgtatagcag aggaatatat tctttgggaa ccgttcgggc 1680 aaatcgaatt gcaaattgca aattgccaac agataaagaa gttgcgaaga aacctcgagg 1740 attttcaact gaatatgtag ggtcttgtta tggagtagac ttgtcgacta cactgtggaa 1800 agacaacaaa ggtgttctgt ttggcctcaa catacgtagg agtattacca tttaaaaatg 1860 aaagtaataa caccctgaag gcatcaagat atgatcgtgc ccaaaagaag agaatagaaa 1920 ttgattgccc aaatatcatc cgcgaataca atgcacacat gggcggtgtc gacctaatgg 1980 acggtctttt gggtcgatat catattcgca tgaaaacagt aaagtggact agtcgtttct 2040 tttatcatgt gttggacttg gccatgataa atgcatattt attacataag aggataaaca 2100 gacaaaataa agcatgcaat atacaactcc cacaatttag aaaagaagtg gctgcaatgc 2160 tttgcagatt tcaaacagat gaaccgcaaa agagatctgt tggaagaccg cgatcaacaa 2220 atgcagaaga agatcccgcc cctaaacgac tgggcaagag gacatattta ccagctgcag 2280 acatcagatt tgatggcaaa gaacactttc ctcagtggct ggacagatcc ggaaagcgcc 2340 aatgtaaatt gcctgggtgc aaatcagaaa cgcaatgcac ttgctctaaa tgtaatataa 2400 atttatgttg cactgcagcg aaaaattgtt ttgcaatatt tcataaacaa tgatataaaa 2460 ttttttgtaa ttattcaata ataaataata aaaaagaaaa tatttataat tatttgatcc 2520 atcataaaac tgaactttta gtattaactt cccaggtccg cctcaaggcg gagatcatat 2580 aatcgggact ggaggtcata ttttttactt tagaaaaaaa ttagcttgaa attgagtttt 2640 aatagcctca atttagccta gcataatttt ttggtgaaga aaaaaaattg gggcactaga 2700 ggg 2703 // ID Gypsy1-LTR_AP repbase; DNA; INV; 321 BP. XX AC Contig19874; XX DT 21-APR-2008 (Rel. 13.04, Created) DT 21-APR-2008 (Rel. 15.12, Last updated, Version 0) XX DE LTR retrotransposon from pea aphid: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy1AP; KW Gypsy1-I_AP; Gypsy1-LTR_AP. XX NM Gypsy1-LTR_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-321 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from pea aphid."; RL Repbase Reports 8(4), 438-438 (2008). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 321 BP; 70 A; 84 C; 54 G; 113 T; 0 other; tgtggtgacc cacagcgaca atattgttac cgcggtatca gcgcgtcagc cgccgcaagc 60 tgcgcgcccc tcctttaatt ttatataaaa cactacctcc atgacgcgac cgttgccgac 120 cgaccctaac ttgtttttcg ttcgaacgta tccgtcgcaa cgctcgtact gcgtcgtttc 180 taattgtttc gcgtctaatt gtttcgtccc gctacctaaa accacgtgta ttgtttcgcc 240 ttttaatatc aaattttact gtttattttg actacaatta tactctactt gaattattta 300 tttggttgta atttatttcc a 321 // ID Proto2-4_CS1 repbase; DNA; INV; 4472 BP. XX AC . XX DT 15-JUL-2009 (Rel. 14.07, Created) DT 15-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE Annelida Proto2-4_CS1 autonomous Non-LTR Retrotransposon - DE consensus. XX KW Proto2; Non-LTR Retrotransposon; Transposable Element; KW Proto2-4_CS1. XX OS Capitella sp. 1 OC Eukaryota; Metazoa; Annelida; Polychaeta; Scolecida; Capitellida; OC Capitellidae; Capitella. XX RN [1] RP 1-4472 RA Kapitonov V. and Jurka J.; RT "Proto2, a novel clade of metazoan non-LTR retrotransposons."; RL Repbase Reports 9(7), 1559-1559 (2009). XX DR [1] (Consensus) XX CC Proto2-4_CS1 is a very young family of non-LTR retrotransposons CC present in the annelid genome. It belongs to a novel clade of CC metazoan non-LTR retrotransposons called Proto2. This clade CC includes families of non-LTR retrotransposons present in the CC hydra (from Proto2-1_HM to Proto2-5_HM), annelid (from CC Proto2-1_CS1 to Proto2-8_CS1), hemichordate (Proto2-1_SK) and CC amphioxus (Proto2-1_BF) genomes. A model Proto2 non-LTR CC retrotransposon is ~4.3 kb long, contains two ORFs; its 3' CC terminus is composed of microsatellites; usually there is no CC target site duplications generated upon retrotransposition of CC Proto2 elements. ORF1 codes for a protein conserved in Proto2 CC elements from all species mentioned above. ORF2 codes for a CC protein composed from the AP endonuclease and reverse CC transcriptase domains. It appears that the Proto2 clade is a CC clade ancestral to the RTE and RTEX clades. XX FH Key Location/Qualifiers FT CDS 216..1289 FT /product="Proto2-4_CS1_1p" FT /note="ORF1." FT /translation="MRSLSNHPTPDLQLVRSELLCYAFTHVNSNTKHSLTE FT CISEFYNAEQIHTARELLWKEYEQFLITVMKKTRRPQVPYDKETARPFADD FT ISQWVFMIANAPNDSSFCQFYALDLTQVPPCRPEEVNIFSLVSRISALEKN FT DRDRQSMENIKPQRLQPSHPISEYQPHTDQQTNVKIPVPVTASSAPPEETW FT AKVVNRNKKRIQKATARKEVRAAAKDLHVVVGTAGAKDVKACPPLKHIFVY FT KVSRDCTADSIRTLMKSNDVDPINIRITSKPSWLSSSFCISIAKDDFSKTF FT SEGFWPSGIRCREWISHVHKDRNNSNFDATDRAANKDLDDDVFTTPAGGAE FT SLGGFQKDNTNNHHG" FT CDS 1072..4323 FT /product="Proto2-4_CS1_2p" FT /note="ORF2, AP endonuclease, RT." FT /translation="MTSVKRSPKVSGPRVYDAESGYRMYTRIETIPTSMQQ FT IALQIRTSTTTYSRPQQEELRAWVAFRRTIPTTIMGSTLNVCSWNSRGHAA FT DRLSYLQTLLDGNDFVFVQEHWLFDDDLHRLCSNEDVNVIGVSGMDETDLL FT WGRPYGGCAIVYHKKLNCSMNMIALDSRRVCACICKGPSGEKMLFFNVYMP FT CDTSHDLNTQSKFTDVLDELSSVIYTHLDVDFIVIGGDLNTDLGRHRSMHV FT EPLKEFCSRHSLAFCIEQSVSSVKFTYYNDFNAAQSTVDHFIVSENLVPLI FT VDYSTNVDGDNLSDHVPLSLKLNMSFHYSNEPRVHPVCKISWRRATDHNIV FT SYKERLRVKLNSIVLPRKVLLCPDLDCSSHSTEIDTYYGRLIDAMKSAASE FT CIPRQRKKALAGWSEVVTPYRNKSIFWNKIWVENGMPEVGLIRDIRQSTKK FT DYKHAAKWVIRNQDNITADRMATKLHSDMNREFWDEVKRVRGNAKDCAGVI FT DDAVGEDAICDVFASKYEELYSSVSFNADDMVELRRDVDIDIRSKCCEELC FT YSNHAVTVQDIKEALGKLKLSKSDADPDLSSDHFKRACPELYVHLSFLLTS FT MLRHTTAPSHVLCSIIRPIPKNRKKSLSVSSNYRSVAISSVLLKVLDHVIL FT KKHASVLSTSDLQFGFKSGLSTTQCTYVLNEVVDYYTRSNSSVFVCLLDAS FT RAFDRVHYIKLFNLLRSRDLCPLLIKLMLNMYSSQSLAVKWQNKVSSSFRC FT SNGIKQGAVLSPVLFCVYMDSLLIRLKESRLGCYVGNVFVGAVSYADDITL FT IAPTLKSAQSLLDICEQFASEFHVLFNSTKSNVIVLNNPFRKNYDLHNLVL FT NNAAIPYTDRALHLGSFIGKDANKFNIKKAMQDLNARVNMLVCNYSFSYFD FT TLCFLFKSYCTSYYGSPLWKLDVNSLEDFCICWRKAVRKLLGLNSRTRSKF FT LPHVLNVIEVRSELISRFSSFYLKLIKSENSLVSTCCLLMNETPSTSIVAS FT NLQSLSQYVNKNISEINSHDFKTLRNLIVDTWLSEIDHSTFVYSKLLIELL FT QMKSHSSPFMSFHEILIMLNFICLMPD" XX SQ Sequence 4472 BP; 1292 A; 862 C; 889 G; 1429 T; 0 other; gtagatcaac tcgcttgtgc actgtatttt ttgctccgga ttatatcttc aattccgtcg 60 gatatctttg tgtctcatcc ctgggaatta catcagtgaa atcagtaatc gctgctgtac 120 acagtgatac gttatttgtg tgttatttcc atgtgtttgg ggaggattgt gggtttttat 180 tcaccccttt ccatgtgttg acagagctaa tcgaaatgag aagcctgtca aatcacccca 240 cccccgattt gcaactcgtg cgcagcgagc tactttgcta cgcattcacg cacgtgaata 300 gcaacactaa gcatagctta acggaatgca tctccgaatt ctacaacgca gaacagattc 360 acacagcacg tgaactatta tggaaagagt acgagcagtt tctaatcact gtgatgaaaa 420 agacacgcag gccccaagtg ccttatgaca aggaaaccgc acgtccattt gcagacgaca 480 tttcgcaatg ggtctttatg attgctaatg ctcctaatga ttcaagcttc tgccaatttt 540 atgctcttga tttaactcag gtgccgccct gtcgacctga agaggtcaat atcttctcct 600 tagtgagcag aatctctgcc ctagagaaaa acgaccgtga tcgccaaagc atggaaaaca 660 ttaaacctca gagactacaa ccctcccacc ctatcagtga atatcagcca catactgatc 720 aacaaacgaa tgttaaaatc cctgtccccg taacagcatc ttctgctcct ccagaagaga 780 cgtgggctaa agttgtgaac agaaataaga aacgcattca gaaagcaaca gctcggaaag 840 aggtgcgagc tgcagccaag gatctgcacg ttgttgtggg tactgcaggc gccaaggacg 900 tgaaggcctg cccacctctt aaacacatct ttgtctataa ggtgtccaga gattgtacgg 960 ctgactccat tagaacgcta atgaaaagca atgatgtcga cccaattaac atcaggataa 1020 catcaaaacc ctcatggctt agttcttcgt tttgtatctc tattgctaag gatgacttca 1080 gtaaaacgtt ctccgaaggt ttctggccct cgggtatacg atgccgagag tggatatcgc 1140 atgtacacaa ggatcgaaac aattccaact tcgatgcaac agatcgcgct gcaaataagg 1200 acctcgacga cgacgtattc acgaccccag caggaggagc tgagagcctg ggtggctttc 1260 agaaggacaa taccaacaac catcatgggt agtacactaa acgtatgctc atggaacagt 1320 agaggccacg cagccgatag gctctcttat ttacaaacgc tattggacgg taacgatttc 1380 gtatttgtac aagaacattg gttgttcgac gatgatttac atcgtttgtg ttctaatgaa 1440 gatgttaatg taatcggtgt ctctggtatg gatgagactg accttctgtg gggtaggcct 1500 tatggaggtt gcgcaatagt atatcataag aaattaaatt gttcaatgaa tatgattgcc 1560 ttagacagca ggagagtatg tgcttgtatt tgtaaagggc caagtggaga gaaaatgcta 1620 tttttcaatg tctacatgcc ttgtgatacc tcgcatgatc ttaatactca atcaaagttc 1680 acagatgtcc tagatgaact tagtagtgtt atttataccc acttagatgt tgatttcatt 1740 gttattggag gtgacctaaa taccgatcta ggtcgacata gatcaatgca tgttgagcct 1800 ttaaaagaat tctgtagccg acattcgcta gcgttctgta ttgagcagtc agtgagcagt 1860 gtaaagttca cgtattataa tgattttaat gcggcccaat ccacagtaga tcattttatt 1920 gttagtgaga acttagtacc tttgattgtt gattactcta caaatgtcga tggggataat 1980 ctatcagacc atgtcccttt gtctcttaag ctgaacatga gtttccatta ttcaaatgag 2040 ccgagagtgc acccagtttg taaaatatca tggcgtagag ctacggatca taatattgtt 2100 tcctataagg agcgattgcg tgttaaacta aatagtattg ttctaccccg aaaggtattg 2160 ttgtgtcctg acctagattg ctccagtcat tcaactgaaa ttgacactta ctatgggcgt 2220 ctcattgatg caatgaaatc agcggcctct gagtgtattc caagacagag aaagaaagcg 2280 ttagccggtt ggtcagaagt agtaactccc tatcggaata aatctatttt ctggaacaag 2340 atttgggtag agaacggaat gcccgaagtc ggtttgatta gagatataag acagtcaact 2400 aagaaagatt ataaacatgc agcaaaatgg gttattcgca atcaggataa tattacagca 2460 gatagaatgg ccacaaaatt gcatagtgat atgaacagag aattctggga tgaagtaaag 2520 agagtacgtg gaaatgctaa ggattgtgca ggggtcattg atgacgcagt tggtgaagac 2580 gcaatctgtg atgtttttgc ttcaaaatat gaggagctat atagtagtgt ttcgttcaat 2640 gcagatgata tggttgagtt gcgtcgcgat gtcgatattg acatccgttc taaatgttgt 2700 gaggaattat gttattctaa tcatgctgtc actgtccagg atattaagga agctcttggt 2760 aaattaaaac ttagtaagtc ggatgctgat ccagatctct cgtctgacca cttcaaacga 2820 gcatgtcccg aattgtatgt tcatctctct tttcttctca catccatgtt acgacataca 2880 acggctccca gtcatgtact ttgttcaata atcaggccca ttcccaaaaa caggaagaaa 2940 tcgttaagtg tgtcatctaa ttaccgatca gtggcgataa gtagcgtttt attgaaagtc 3000 ctagatcatg ttattttaaa aaagcatgcg tctgttttaa gcacaagtga tttgcaattt 3060 ggtttcaaat ctggtctcag tacaacacag tgtacatatg tcttgaatga agtggtggat 3120 tactatactc gtagtaatag ttcagttttt gtttgtctcc ttgatgcctc tcgtgcattt 3180 gatcgtgtgc attacataaa attgttcaat ctattgagga gtcgtgatct gtgtccatta 3240 ctaattaagc tcatgttaaa tatgtactct tcacaatcgc tcgcagtcaa gtggcagaat 3300 aaagtgtcat catcttttcg ttgtagtaat ggaattaagc aaggggctgt tttatcgcca 3360 gtattgttct gcgtctacat ggactccctc ttaatccgtt taaaagaaag cagactcgga 3420 tgttatgttg ggaatgtttt tgttggagct gttagttatg ccgacgacat aacgttgatc 3480 gctccgactc tcaaatcagc tcaatctcta ctcgacattt gcgagcagtt cgccagtgaa 3540 tttcatgttc tctttaatag caccaaaagc aatgttatag tcttaaacaa tccttttaga 3600 aaaaactacg atttacacaa ccttgtcttg aataacgcgg ccattccata tactgatcgt 3660 gctctacacc tgggttcttt tattgggaag gatgcaaata aatttaacat aaaaaaggca 3720 atgcaggatc taaatgctcg agtcaatatg ctagtttgta attattcttt ttcatatttc 3780 gacactttat gttttctatt taaatcttat tgcacctctt actatggctc tcctctatgg 3840 aaactcgatg taaatagttt agaagatttt tgtatttgct ggaggaaagc ggtgagaaaa 3900 cttttaggcc ttaattcaag aacaagatca aaatttctac cacacgtgtt aaatgttatt 3960 gaagttagat cagaattaat atcacgtttt tcttcttttt acctcaagtt aattaagagt 4020 gagaacagtc tggtttctac atgttgttta ctcatgaatg aaacccctag cacctcaata 4080 gttgcgtcta atctccaatc tttatctcaa tatgtcaata aaaatatttc tgaaattaat 4140 tcacatgact ttaaaacatt acgtaattta atagttgata cttggctctc tgaaattgat 4200 cactccactt ttgtttattc taaattatta atagaacttc tgcaaatgaa atctcattcg 4260 tcacctttca tgtcttttca cgaaattcta ataatgttaa atttcatttg tctgatgcct 4320 gattaattgg tctttcaatt tttctttatt tttttttttt tctatttttc tattttgtta 4380 ttttttcgtt attttttttt tctctcctct tttttctgtt aactcattaa ttaccattat 4440 tgttatggtg aataaatgat attattatta tt 4472 // ID MAR1_TV repbase; DNA; INV; 1304 BP. XX AC AY282463; XX DT 10-SEP-2005 (Rel. 10.09, Created) DT 10-SEP-2005 (Rel. 10.09, Last updated, Version 1) XX DE Trichomonas vaginalis transposon mariner mar1. XX KW Mariner/Tc1; DNA transposon; Transposable Element; MAR1_TV. XX OS Trichomonas vaginalis OC Eukaryota; Parabasalia; Trichomonadida; Trichomonadidae; OC Trichomonas. XX RN [1] RP 1-1304 RA Silva J.C., Bastida F., Bidwell S.L., Johnson P.J. RA and Carlton J.M.; RT "A potentially functional mariner transposable element in the RT protist Trichomonas vaginalis."; RL Mol Biol Evol 22(1), 126-134 (2005). XX DR EMBL/GenBank/DDBJ; AY282463; Positions 1 1304. XX FH Key Location/Qualifiers FT CDS 97..1221 FT /product="MAR1_TV_1p" FT /translation="MYMMSMLYFGAMPGYKMNFETRGGFAGNFGGIFFIFL FT MNHKENILALAKKCKDCKKIYETLVKCFGMDAPSYSTVTYHVRMYHFMNKK FT APIIKIDKKSPDQRKIKAILQALDEDPRASLRRIEEMTKIPRTTVSYYLHN FT YLNYKLAYTRWVPHNLNSVQKKSRVQSSKELLSILGAYQSKKFRFLVTGDE FT SWFQYATEAKIMWIPKDENPQTFPKKKIDTPMMMLSVFWGVNGIIAIDILQ FT KPNTMNAQYLIDNVLTQIINSDEFEKSKQQKQKFAIHFDNSRVHKSHKVMN FT YLVENNVKVVPNPIYSPDIAPSDFYLFGTLKKRAEGREFASPDDLENFVRE FT QFEQFSHDDLKRVFQAWIDRCERVIESNGDYI" XX SQ Sequence 1304 BP; 451 A; 227 C; 246 G; 380 T; 0 other; tagggtgtcc aaaggtgcgt ttagcacagc gctctatatg agactattaa aggagatata 60 tgctgattat aaatatataa taaaaaatat gaatatatgt atatgatgag tatgttgtat 120 tttggcgcca tgccgggata taaaatgaat tttgagacaa gaggcggttt cgcgggaaat 180 tttggcggga tttttttcat ttttttaatg aatcataaag aaaacatcct cgctcttgca 240 aaaaaatgca aagactgtaa aaaaatctac gagacattgg taaaatgctt cggtatggat 300 gctcctagct attccactgt aacgtatcat gttaggatgt atcacttcat gaacaaaaag 360 gcgcccatca taaaaattga caaaaaatcg ccggatcaaa gaaaaatcaa agcaattctt 420 caagcgctag atgaagaccc aagagcatca ctcagaagga tagaggagat gacaaagata 480 ccacgtacaa cagtcagtta ttatctgcac aattatttga actacaaact agcgtataca 540 cgttgggttc cgcacaacct taattcagtt caaaaaaaat ctagagtcca atcttcgaaa 600 gagctcctct ctatattggg tgcgtatcaa tccaagaagt ttcgcttttt agtaacaggg 660 gacgagtctt ggtttcaata tgcaacagaa gccaaaatca tgtggatccc gaaggatgaa 720 aatccgcaaa cattcccgaa gaaaaaaatt gacacaccaa tgatgatgtt gtcggttttt 780 tggggcgtga atggcatcat cgcaattgat attcttcaaa aacctaatac gatgaatgct 840 cagtatctaa ttgacaatgt tcttactcaa attatcaatt ctgatgagtt tgagaaatcc 900 aaacaacaaa aacaaaagtt tgcaattcat tttgataatt caagagttca caaaagtcat 960 aaggtaatga attacttggt ggaaaacaat gtcaaagttg ttccaaaccc aatttattca 1020 cctgatattg caccctctga cttttattta ttcggcacgc taaaaaagag agctgaagga 1080 cgcgaattcg cgagtccaga tgatcttgaa aattttgtaa gagagcaatt tgagcaattt 1140 tctcatgatg atttaaagcg tgtctttcaa gcctggatcg atcgctgcga acgcgtgatt 1200 gaatccaatg gtgattatat ttaaataaac tttatatata tgtaacttgt ttatttctat 1260 tcgcagtatt caggtttgtg ctaaacacac ctttggacac ccta 1304 // ID Gypsy-47_CQ-LTR repbase; DNA; INV; 220 BP. XX AC AAWU01016425; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-47_CQ_; KW Gypsy-47_CQ-I; Gypsy-47_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-220 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 474-474 (2011). XX DR GenBank; AAWU01016425; Positions 85984 85765. XX SQ Sequence 220 BP; 87 A; 27 C; 71 G; 35 T; 0 other; tgtagtggat gtggtgagag agagagagag agagagagag agagagagag aaagagaata 60 agcgaggaaa gatagagaga gaaaaagtga gagagagaga gagagcaaga aataaaagag 120 attcttaata aacggcagtc gattgcttaa tctcaaccag taaagtcgtg cgttttattc 180 cgtccggaaa tggacgaggc tggccaccag atccacgaca 220 // ID I-2_BF repbase; DNA; INV; 5738 BP. XX AC . XX DT 22-MAY-2009 (Rel. 14.05, Created) DT 16-JUN-2009 (Rel. 14.05, Last updated, Version 2) XX DE Amphioxus I-2_BF autonomous Non-LTR Retrotransposon - consensus. XX KW I; Non-LTR Retrotransposon; Transposable Element; I-2_BF. XX NM I-3_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-5738 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-5738 RA Kapitonov V.V. and Jurka J.; RT "Young families of I non-LTR retrotransposons from the amphioxus RT genome."; RL Repbase Reports 9(5), 1141-1141 (2009). XX DR [2] (Consensus) XX CC I-2_BF is a consensus sequence of the young I-2_BF family of CC non-LTR retrotransposons that belong to the I clade. The I-2_BF CC consensus sequence has two ORFs. ORF1 codes for a protein that CC contains the PHD and Zinc knucle domains at its N and C terminal CC parts. ORF2 codes for a protein composed of the apurinic CC endonuclease, reverse transcriptase and ribonuclease H. XX FH Key Location/Qualifiers FT CDS 175..1518 FT /product="I-2_BF_1p" FT /note="ORF1 protein." FT /translation="MPSTRAQSKQQSTTKQPAEKQEDKSNKELNQENNKSE FT KQHKRKPQETENTQLQAKSVLQQSPSETKKRKQKTTNKKDNPTNNRKQSQQ FT RNKSPSKRQTLSEEEELCICGNDHPEGGCWICCDQCNTWWHGTCAKLSPST FT VAYFVKSGEEYYCAYCTIQNINKSTKHIRKKDPSHEDVNTPKQTIRSREGP FT ATDTKHKPNKEETNEDKKEEIIIIDNISNPSSYKNSAKILAEISRHKPGIT FT HSIDLAYSLPRGGIAIHCRESKAASEILKPWPEGAFGNAHEQLKVHKVHNI FT YGRRAVLKNVPTNTNEAAIEQNIYEQTRVKVKAHRFHYHDTGKPLKVVRIE FT AEEGQLSELFTQNIRIENALVKVEAYKTKRHTPIRCYQCHKLGHIAKECKE FT SATCAKCGGKKDHTKSCRPNCVNCKGNHPANDSRCKAFISLQQKLVDRDRR FT QHQ" FT CDS 1715..3604 FT /product="I-2_BF_2p" FT /note="ORF2 protein composed of the APE, RT, and FT RNAse H domains." FT /translation="MKIAQVNIRSINTSATLVEKMCEKKNIDVLCLSEVWH FT KAKQPTCLKTWTWTHKNRTDKRGGGTAIVTRENIKVMEEPIQQTTHADIVA FT LNIYTKALNFVLVSAYVPPEDRLALQELLSIVNELNQKKKHLVLCGDLNTK FT HIAWGNKKNNKLGQMLYEAMETGDFQIMNNNTPTRDDSIIDLTIVNKNTTK FT HMQSWRVEPEVQLRTDHNLITMSIGSKEETQKQQKWNLRNVDWKEWQDKTD FT TVFQQWVEETDWATEKANSRSEEVYNSFKENLLTCAEEVITKKTITQHSKG FT YWNKALDVQMQITKTALRKFKRRRDKTNLTKYLHEKEILTNMEEEARNTYW FT NEQLKDMDPRKPQKFWKAIKNQLGRSSRPTIQPIRQKDGTIATTDQQIAHA FT VTTEYAPGGVEITGELATWKQNITQAVEISIKHEKQRLQDDHTYEEDTHED FT TLNLDLTLEEVKAAIRKMDASSSPSPTEGILPVMIKYGGDGLTIALQHMLN FT LVWETGQIPKAMKQDNKILIKKPGKDDYNKVRSYRPITLSSVVGKLMERIV FT DNRLTWWAEANHILSPYQEAYRKHRTATHGVLRLVQHIEEAWKNNETTVAV FT FADYEGCFDRIWQEGLLYKLITKGVKGRNSAT" XX SQ Sequence 5738 BP; 2311 A; 1332 C; 1205 G; 890 T; 0 other; caagacacat taagaacgtg agtatcctag aataataaca gggaaaacac ctgacaactg 60 aaaacacccg agaataagaa acaacagaca acagaaaaca caagagtaca agaaacaaca 120 aggctcacca acagtggccg aaaccggtgg cagtttcaaa cacagaaaaa cggaatgcca 180 agcaccagag cacaaagcaa acagcaatcg acaacaaaac aaccagcaga aaaacaggag 240 gataaaagta acaaagaact aaaccaagaa aacaacaaaa gcgaaaagca acacaagaga 300 aaacctcaag aaacagaaaa cacacaactc caagcaaaat cagtactgca acaaagtcct 360 tccgagacga agaaaagaaa acagaaaaca actaacaaaa aagacaatcc cacaaacaac 420 aggaaacaat cacaacaaag aaacaagtca ccaagcaaac gacaaacatt gtcagaagaa 480 gaagagttgt gcatctgtgg taacgaccac ccggaaggcg gctgctggat ctgctgtgac 540 cagtgcaaca catggtggca cggcacatgt gccaaactct caccatcaac cgttgcatat 600 tttgtaaaaa gtggagaaga atattactgt gcatattgca ccatacaaaa catcaacaag 660 tcgacaaagc acataagaaa gaaagatccc tcacatgaag acgtaaacac tccaaaacag 720 acaatcagaa gcagagaagg gccagctact gataccaagc acaaaccaaa caaagaagaa 780 acaaacgaag ataagaagga agaaataatc atcattgaca atatatctaa tccatcttca 840 tacaagaaca gtgcaaaaat attggcggaa atctccagac acaaacctgg aatcacacac 900 agcattgact tagcctactc actgccgaga ggaggcatcg ccatacactg cagagaaagc 960 aaagcagcct cggagatact gaaaccctgg ccggagggcg cttttggtaa cgcacatgaa 1020 caactaaaag tacataaggt acacaacata tacggaagaa gagcagtact caagaacgtt 1080 ccaactaaca ccaacgaagc agcaatcgag cagaacattt acgaacaaac aagagtaaaa 1140 gtaaaggcac acagattcca ctaccacgac acaggaaagc ccttgaaagt ggtacgaata 1200 gaagctgagg aggggcagct gtctgagcta ttcacacaga acatcagaat cgaaaacgca 1260 ttggtgaaag tcgaagcata caaaacaaaa aggcatacac ccattagatg ctatcagtgc 1320 cacaaactag gccacattgc taaagagtgt aaggaaagcg ctacatgcgc aaagtgtggg 1380 gggaaaaaag accacacaaa aagctgcaga cctaactgtg tcaactgcaa aggcaaccac 1440 ccagcaaacg acagcaggtg caaggcattc atcagcctac aacaaaagtt ggttgacaga 1500 gaccgtagac aacaccaata agagacacca gaaatccagc agttagcact ctccacagaa 1560 gagacaaaac atacaccacg gtacaactac caaccacaac cgaagaaaga agaagcatgc 1620 ttaagtgcct tataacgctc atagtgtgcc tagtaacact caccgagcaa acagtacaca 1680 gacttctaca acacgagcca aacttgccac aaggatgaag atagcacagg tcaacattag 1740 atcaatcaat acctccgcca cattggtaga aaagatgtgc gagaaaaaga acatagatgt 1800 attgtgcttg tcggaagtgt ggcacaaagc caaacaacca acatgcctca aaacatggac 1860 atggactcat aagaacagaa cagacaaaag aggtggtgga acagccattg tcacaagaga 1920 gaacataaaa gtaatggaag agcccatcca acaaacaaca cacgccgaca tagtagcact 1980 aaacatctac acaaaagcgc ttaacttcgt gctagtgtca gcgtacgtcc caccagaaga 2040 cagattagca ctgcaggaac tcttgagtat cgtcaacgag ctcaaccaaa agaagaagca 2100 cctagtcctc tgcggtgacc tcaacacaaa acatatagcc tggggaaata aaaagaacaa 2160 caaactagga caaatgctgt atgaagctat ggaaactggt gacttccaaa tcatgaataa 2220 caacacacca acccgagatg acagtattat agacttgaca attgtaaaca aaaacactac 2280 gaagcatatg cagagctgga gagttgaacc agaggtgcag ttacgaacag accacaatct 2340 catcacaatg agcataggtt ctaaagaaga aacacaaaaa caacagaagt ggaacctgag 2400 aaacgtagac tggaaagaat ggcaggacaa aaccgacacg gtcttccaac agtgggtaga 2460 ggaaacagat tgggcaacag agaaagcaaa cagcagaagt gaagaagtct acaacagctt 2520 caaggaaaac ttgctgacat gtgcggaaga agtgataacg aagaaaacta taacacaaca 2580 tagcaagggc tactggaaca aagccttgga tgtacaaatg caaatcacaa agactgcact 2640 aaggaagttc aagcgaagaa gagacaaaac caatctgacg aaatacctac atgagaaaga 2700 gatcttaaca aacatggaag aagaagcacg aaacacatac tggaacgaac agcttaaaga 2760 tatggacccc agaaaaccac agaaattctg gaaagctatc aaaaaccagc taggcagaag 2820 ttccagaccc accatccaac ctataagaca gaaagatggc acaatagcca caactgatca 2880 acaaatagca catgcagtca caacagagta tgctccaggg ggtgtggaaa tcacaggaga 2940 actggcaacc tggaaacaaa acatcaccca agcggtggag atctccataa aacatgagaa 3000 acaaagacta caagatgatc acacatatga agaagacaca catgaagaca ccctcaacct 3060 agaccttacc ctagaagagg tgaaagccgc catcagaaaa atggacgcaa gcagctctcc 3120 tagccccaca gaaggaatac taccagttat gatcaagtat gggggagatg gcctgaccat 3180 agccctacaa cacatgctga acctagtatg ggaaactgga cagatcccta aggcaatgaa 3240 acaagacaat aaaatactca ttaagaaacc tggaaaagac gactacaaca aagtaagaag 3300 ctacagacct ataacactct cgagtgtggt tggaaagcta atggaaagaa tagttgacaa 3360 cagactaacg tggtgggcag aagccaacca catattatca ccataccaag aggcatacag 3420 aaaacatcgt acagctacgc acggagtgct aaggctagtg caacacatcg aagaagcctg 3480 gaagaataat gaaaccacgg ttgcggtgtt tgccgactat gaaggatgtt tcgatcgcat 3540 atggcaagaa ggactattgt acaaactaat aacaaaaggg gtcaagggta ggaactctgc 3600 tacctagaga gttcctgaga ggtcggaaac cagatttaaa gtcaacacca tcacaacaga 3660 accggagatc agtaaagtag gcatcccaca gggagctgta ctttctacca ccctctgcaa 3720 tatctacact gcggatgcct accagcaaac aggactagac aacttccaat atgctgatga 3780 tggtgcagcc tggtgcagtg gcgcagacgt aaaagaggta tcaaaacaag tagagacaag 3840 tatcgatcac gtaatatcga cttggtgccc tttgtggaac atgcgtatcg aagaaagcaa 3900 aaccaaagcc atggttttct ctccacctca catagaccag ccaactgctg acagccttga 3960 agtaaacaac aagaacatcg acatcatccc agaagtcaga ctggtaggga ttacactaga 4020 cgagaagctg aacttccaaa gtcacatctg caacacccag actaaagcat acaaagcctt 4080 aaaagcaatt agcaaagtaa ccaatgctaa gaaaaatccc aaccaggagg ctcacctcca 4140 gctgtacaga gcaatgatca gacctattct cgaatacgga acagaatgta cactcagagc 4200 aggacagaag cacgatgtag cgtacgcccc catacagaga aaagctctct tggctgcaac 4260 cggctgcaag atcagaacga gtacagacgc actggaagtg ctgactggca ttatgcctat 4320 tgacatacac ctaacatgcc ggcaagcaca agcctacttg agaatggcaa caaaacacac 4380 ggggaacccc atatatgata aaatagcaca agaaagatcc gaagagaaga tcggcacaac 4440 cctacatctt ctcgaaacca ggtttaaaga aatgaagggg gagatagagg tcagcacagt 4500 agacaaggaa tgctactacg actccaggct accccctttc tcagtgggca gaataacagg 4560 atctttcctt cccaccacaa caaaccctgc caacaaggag gaagccaaag agaaagtaaa 4620 gcaaatacta cacgagatga agaaaacacc aaccacggta gttttcacag atggttcatc 4680 cttaggaaac cccggtccaa cagggtgtgc tgcggtcatc tacgagcagt ggggagtaac 4740 tgaaccttac acagttagaa aaccagtagc tgcaaagtcc aacaactacg aaggagagtt 4800 gcaggggatc tacctagcac tgaacaccct acacagaagt caaagcaaaa acagaagaat 4860 cctcatactc tgtgattgta aagcggccct tgagaacgtc agctcgttac aacaggcaga 4920 agcttacaac gatctagtga atgcagcaag acagaggctc tttgaaatcc aacagaaagg 4980 acataacata caaattgaat ggtgcccagg ccacatgggg gttgaaggaa acgagctagc 5040 agacatgcaa gctaagctag ctgcagaaga agctaaacac acagagaaca acacgacatg 5100 gactaagcaa caagcaatga agcacataga agaacaagca gtaaaaagat ggcaaagaag 5160 aagagaaaac caaacaacta gcagccacat gcagaaagcc aacacgagtt tgaagaagaa 5220 atgcagcaca tgggagtcaa gaacgactca aatctctata aaccaactcg tgagtggaca 5280 cacggagcta aacgcctaca aaaactggat tgaccccacg gagacaccaa actgtggcac 5340 atgcggtaca agggaaaaca tagaccatta catgtatgaa tgcccaaaat atgagaacac 5400 cagacaacat ctaatgaaag aaattgacaa catatatgaa gactacggaa tccctcaaca 5460 agaaagaacg atggatgttg tatccctagc aggaatgcga agcgacctga cgaacgaagc 5520 aaacaaaagg atgtatttgg cgttcaccaa atacattgaa gacacgagaa ggttcgccga 5580 gcaggtctag aacaccccaa acacgcatat gcaaagcacc agtgtacaag caccagcgag 5640 gtcaacaaat ctaagagcac ccagcagaag aagcaccaag agaccatgtc tagcggagaa 5700 gtagacgtta aacaaggaca tcaacaacaa caacaaca 5738 // ID Gypsy-603_AA-LTR repbase; DNA; INV; 1331 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-603_AA_; KW Gypsy-603_AA-I; Gypsy-603_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-1331 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 1331 BP; 387 A; 310 C; 360 G; 272 T; 2 other; tgtaacgttc atttcgaatt tagttaattt cgagcataaa ttacactgta cacaaaagcg 60 attaactcga acacgccctt tgagaatacg atccacatct gtcatcatca gtactaccac 120 aacacaggaa aaagtttgtt tccgctgtgg cagacaaaaa gtgcactggc agcagcaaga 180 gaagaggtgc tagtgctcga agctgccgag atagctgcct cctcgtggca gactaagttg 240 atcgaagcaa atccttgccg acacggtgcc aggacgcgcg gaccaaagga aagttccaga 300 tttcgacggg acagcagcta acgggattcg gaccgcctag cccggatcaa ggaaaaaacg 360 ccagcacacg cgctgcggga ccggtgagaa gagcacggct gtgttcaggg actgtgccaa 420 ggtaatttag atcgcaatgg ccgtataaca tggawtgtta accaccgact ctctgccaaa 480 ataggaatcc sttccgagca tcatccggca aaaaccaacg gcaaaaagta gccaggacaa 540 ggaagttctg aaggaactag cgtgaggtaa ttttgaccga cattaacatg cgcgtaacgt 600 tttgtgacga tttcccgttc cacacagaag tcgaccctaa tctccaaccg tccgtttggg 660 aacgaggaaa gcgctgagaa atactccgaa aaagcgtccc aaatccacgg gtgagtcact 720 ctagtcggaa gcgtccgcat gttactaatg aaggccgttt agggcgcttt ttttctgtta 780 cgtttcaatc cgaggcagca aagcatccaa tccggacaag cacttggccg tgggagatcc 840 tggtgaatcg ctggatcgag gacctaacag atctcggttg tcggcaagag atccaaaccc 900 gtcggccaga gcaaatacct gtctggtggc attcacagag tggtcggtgt ctgctaacgg 960 aggttcatcg aacgcgaacg tagccaagct ccacagtgag gggcatttag aggaacggag 1020 gcggtgcaaa aaggtccgaa cagcgagact gtcgaccatt gtcggcgcga agaagacggc 1080 gagctagacg aggtgacgaa ctctagggag aagaagccgt gagtacacta tagcatgcgc 1140 acaagtgatg atcgcccgca gggcaagaag gagaccataa tctggccgaa gaccatagca 1200 tagggcatta gcgtaggagt tagcattatc gaaggcgtgt gatagatgtg tgtgggagcg 1260 ctaaaataca gaatgcttcg aaattaattt tgtggctcta tattcttttg gtttgaggga 1320 aaaacccaac a 1331 // ID Gypsy-10_DWil-LTR repbase; DNA; INV; 268 BP. XX AC scaffold_180708; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-10_DWil_; KW Gypsy-10_DWil-I; Gypsy-10_DWil-LTR. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-268 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_180708; Positions 8960112 8960379. XX SQ Sequence 268 BP; 86 A; 42 C; 58 G; 82 T; 0 other; tgtaagagag agagttacca gacagctggc aacgtaccat tccaggctgt ttgctgggta 60 atacgtttgg gaagcaagtg ttcgctggga aaaagggtcg tcagttggaa gttgctctcg 120 ttggtcgtca gccgaagtga aaagttgtat cgaagttgtg aaaaagtcga gttaaatcta 180 ttacttgcat catatatatt tactctgtat tatcatgaac ctattaaata actattataa 240 taaacattaa attaatctaa accttaca 268 // ID hAT-13_HM repbase; DNA; INV; 3169 BP. XX AC . XX DT 07-DEC-2008 (Rel. 13.12, Created) DT 12-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-13_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3169 RA Jurka J.; RT "hAT-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 2002-2002 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 632..2863 FT /product="hAT-13_HM_1p" FT /translation="MHRQNCAMIKSNEAIPSSFFTNLKKTDMCILPYGTPD FT NVHIGSTCTKRVKPTIKPIVSFRDRTSNAEAMTLAFICEHNLPISLTSHMI FT EYAKEMSKDHRVLKNLKMFRTTASYKLREGLGEAFHAELVFDMKIKMFSLN FT IDECFSAKNEKVLSILVAYFCDKTNKVVLKHYASLSLVTVNAESLFNAVIH FT LFNKDAIPLSNLVSNLSDSTNYMRGKISGFETRLRKAVPHLLDIDGDICHH FT VHNVVKKFTSFFGKVVENLLDDIYIDFKYSPDLRTFLKKICQITGVTYNTP FT KQRVCHRWLSVLDCTLPMLEMMPAFTLIYSSWLSKQDISLYKDESDKILKR FT LSNESKKLIEKIKCKMQQKKLTKQGAERKARIVKSLFFRRNETLLYINLYV FT EILPIFKSFILTFEQKEPLVHRIYDEQIELVKTILTCFIKPKMIKNITGEK FT LKFLNLKDKKMYRHSNYIYLGSNAEKYLSRILIRNPDFKSSFLKTLLKAYL FT ESSVYLLNKLPFNNPLLKCLSAIDPMSQGHEITERMLKKLSTFFPSVDINK FT VIFNREVMQLQIDIALPPVFDDGKPVRLDYWWAEVFKCKKYLELSKIIKAC FT LSIFSSPHVEQSFSMMNTIVNKKTNRLDTLTYSAIASVKYQLLANETTAVG FT YFKRRDILHDPVNKNICGHLQTASKRYNKKMKFHRKNRKVEERFLNINSTW FT KKKIALHEQCKKVKESIIDHAKKCRKRKADECFTTENYKKKKY*" XX SQ Sequence 3169 BP; 1154 A; 486 C; 496 G; 1033 T; 0 other; catagatgag ttttttccgg gaattcccgg aattccggaa attcacatca agtttccggg 60 aaattccgga tgctttttgc tattgatttg taaaaagtta tataaaattt taaatacgac 120 atggtttgca tttaagcaat tatgttttct ataaatgaaa ttttcacgag atcaattttt 180 tcaaaaaatt tttttctttt taattcattc aaattaactt tttcgaatat tagaaattct 240 ttttcttaag aagttaattg atgtaaattc cgttttaaac gtttgcaaaa gatcttgtcg 300 ttaaaaatta ctatcaagaa gaaagaaaag aaaatgcaag ttaagagttt atctgacatt 360 tctgagatag acaaaaatat taaaaacaag tttcgctgga attggcttca agagaaagac 420 tataatgatg agcattattc agcctatatt acaaagataa ataaagccgg agcggtaagt 480 ttattctcta actgaacact gaaatatata acttcatcat agttatatat tatttgattt 540 gatgttttag gtacgatgta caatatgtaa taaagacttt tattatggaa gtgaaggtaa 600 aaaatcttta ctcagccact caaagtctaa aatgcatcgt caaaactgcg caatgattaa 660 gtccaacgaa gccataccgt cgtcattttt tactaattta aaaaaaacag atatgtgtat 720 attgccatat ggcactccgg ataacgtgca tattggcagt acttgcacaa agagagtaaa 780 gcctaccatt aaacctattg taagttttag agacaggaca tcaaatgcag aagctatgac 840 actcgcattt atttgtgaac acaatcttcc aatatcttta acatcccata tgattgaata 900 cgcaaaggaa atgtctaaag atcatcgtgt tttaaaaaat ttgaaaatgt tcagaacaac 960 tgcttcgtat aagcttcgag aaggattagg tgaagctttt catgctgagc ttgtttttga 1020 tatgaaaata aaaatgttct cccttaacat agatgagtgc ttctcggcaa aaaatgagaa 1080 ggttctcagt atactcgtag catatttttg cgacaaaact aataaagttg tattaaagca 1140 ctacgcgtca ttatcgcttg ttacggtaaa tgccgaaagt ctttttaatg ctgttattca 1200 tttatttaac aaagacgcaa ttccattaag taatcttgtt tccaatttat cagattctac 1260 caattatatg agaggaaaaa tcagtgggtt tgaaacaagg ttaagaaaag cggttcctca 1320 cttacttgat attgatggtg atatttgtca tcatgtgcat aatgttgtca aaaaatttac 1380 ttcttttttc ggcaaggtag ttgaaaactt actggatgat atctacattg attttaaata 1440 tagcccagat cttcgtacat ttctgaaaaa aatctgtcaa atcacaggtg tcacatacaa 1500 cactccaaag caaagggttt gtcatcgatg gttgtctgtt ttagattgca ctcttcctat 1560 gcttgagatg atgcctgctt tcaccttaat ttattcttct tggctatcaa aacaagacat 1620 ctctctgtat aaagatgaaa gtgacaaaat tttaaagagg ctatcaaatg aaagtaagaa 1680 actaatagaa aaaataaaat gtaaaatgca gcagaaaaaa ctaacaaaac aaggtgcaga 1740 aagaaaagct agaattgtta aaagtttatt tttccgacgt aatgaaacat tactttatat 1800 taatttgtac gttgaaattc ttcctatttt taaatcattc attttaactt ttgaacaaaa 1860 agaaccgtta gtccatcgaa tttatgatga gcaaattgaa ctagtgaaaa caattttaac 1920 ttgttttatt aagccaaaga tgataaaaaa tataacagga gaaaagttga agtttttaaa 1980 cctgaaagat aaaaaaatgt atcgccattc gaactacata taccttggaa gtaatgctga 2040 aaaatatttg tcaagaatac ttattcgaaa tcctgatttc aaatcatcgt ttttaaagac 2100 cttgctaaaa gcttaccttg aatcgtcagt ttacttactt aataaattac cctttaacaa 2160 ccctttatta aaatgcctct ctgcgatcga cccaatgtct caaggtcacg aaattacaga 2220 gagaatgcta aaaaaactct caactttctt tccctctgta gatataaata aagtaatatt 2280 taatcgagaa gttatgcagc ttcagataga tatagcgctt ccacccgtat ttgatgatgg 2340 gaagcccgta cgtcttgact attggtgggc tgaagttttt aagtgtaaaa aatatttaga 2400 gcttagtaag attatcaagg catgcttaag tatttttagt agtcctcatg ttgagcaatc 2460 attcagcatg atgaacacca tagttaacaa aaagacaaac cgtcttgata ctttaacgta 2520 ctctgctata gcttcagtta agtatcaatt attagcaaac gagaccacag cagtagggta 2580 ctttaaaagg cgagatatat tgcatgaccc agttaacaaa aatatctgtg gtcatttgca 2640 aactgcaagc aaacgctata ataaaaaaat gaaattccat cggaaaaata gaaaagttga 2700 agaaagattt ttaaatataa attcaacgtg gaaaaaaaaa attgccttac atgaacagtg 2760 caaaaaagtt aaagaatcta ttattgatca tgcaaaaaag tgcagaaaaa gaaaagctga 2820 cgaatgcttc acaacagaaa attataagaa aaaaaagtat tgattcctga catttattcc 2880 attgtcttac attttatatt ttaaatataa agaaattatt gcgctttgca ggttgcttat 2940 gttaaaactg agatattata tttcatttta gagaaacata attaatgagt agtttttagt 3000 ttctttattt atccctttac caacgttcca gaatttaccc acataccttc gtcaatttat 3060 tgaccctaaa tttttccggg attttccgga attttcatga aaatgacttt ccgggatttt 3120 caaagaaaaa aatttccggg attttcataa agtcaaaaac tcatctatg 3169 // ID BEL-8_DPu-I repbase; DNA; INV; 7595 BP. XX AC scaffold_140; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-8_DPu_; KW BEL-8_DPu-LTR; BEL-8_DPu-I. XX NM BEL-8_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-7595 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 663-663 (2010). XX DR Genome; scaffold_140; Positions 163162 170756. XX CC 'TAATT' target site duplication CC LTRs are 99% similar to each other. CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX FH Key Location/Qualifiers FT CDS join(1282..4842,4846..7593) FT /product="BEL-8_DPu-I_1p" FT /translation="MSNANSSPNNDTHSPRPPSRAQEMQTEYTDRYPPLEN FT LMQPDQTDHNPPLFDLRPPDRTDEYPLFEDTRALKQARSTEKSRATRQGNK FT IISHVRNKKSRTQLKFMRTEFSTTIADCLSIHQQYCDSKEEEDERDDEWTK FT ELGEDAKTIFEIIDRYLARKSRPPSAVSIGHASQQSLGIQSNATVYPAQPD FT PPNRSADHHSVSPSSSVSNQDAKALQEERRQREEERRQREEERRQREEEKR FT QREQLEKQIQQMKIDEKEKIKKAVEEERERQSSKYTTANQEAVNEAEKRAN FT EIAEENKKIRTALEVQKQKQKELFEAAQEKDRLLRLEKKERLESEQKLNQK FT IQEQQSKIPISSLLKQPYTSFQPTQFEPTRGANPFDNLFQTASNPHRQTEE FT EPINFKQWTAHNDNLLSSTKDSTESNHNTRSVFRLPKLDLKPYDGDPKKWP FT DFIAIFRDLVHSNNSLSSTEKMALLKRSLSEDIRNGLGDSLSSSALYSEAL FT TELENTYGHPQIVSRAYIQSLIELPKVNNNDYKTLLKFSQTLNGAVSSLKN FT GGYEHELKASGILELILAKLPAELQSRWGKKIVKSHPVCLTLQDFSSWIHL FT IVKGEMMAKHCLISPASSPPSSKSNQKPGRQQTDRKPKHPPSINIIGHKPA FT ALPSTAAKQNDGRKTLTCLLCRGDHRLSSCPKFAALSLEERMKIIKEFNCC FT LRCLTKGHWTKECFNKTKCNFEECTAHHHPLLHGAPSLNVSFKDKAKTPKE FT EKSETQENKKTTKKGVGTHTVEGDSITTLLLTVPVIIEAHGIQVNTVGILD FT QGSQASLILDKISKKLKLDGPTQSSPLATFHGNDPKNKVKCVSFNILTADS FT SRSFEVKSAYTVPRLQIQTTSLNWPAVKHQWNHLCDIEPINADTKEIGVLL FT GRDVLRVHDVLDSRYPADGVEAPDGIKTHFGWCVTGPVATAILHPPLHINA FT LSITQQLSDQALHDVVNQFWLTETFGVRPSTSLPSSLDDKAALKILEQTTR FT HTGERYEVGLMLRDPKINIPNNREVAVRHFNSLEKRFARNPAFAERYSRVM FT NEYISLGHAVLLDVNNSIRKGFTWYLPHHGVTNPNKPEKVRVVFNPSARYK FT GTSLIEQLFKGPDLLTCLIGVLLRFRQFPVPISGDIEKMYHQVLVPKQQQS FT LFRFLWKNPGDVGEPKEYQMTVHVFGAVSPTSCIYALRKTAEDFGSRFPDV FT ADSVSKNIYVDNYLDSTETEEEAIARLRDVSALLKLGGFNMVQWLSSSRSV FT LATVDHSDLSRSLDLDADKLPIERTLGLLWNCQQDSFNFKSSIKIQAKTKR FT EVLQEVASVFDPLGFLSPVIMTAKILLQDIWRSGADWDDPLPPTLLEIWMA FT WAKELSAIASIKIPRCFRLQEKPISYELHVCSDASEVGFGACVYLRAEYPN FT GHFRLNLLLAKARVAPLRQLSIPRLELQGAVLGVRLCDSAIKELGPIAAQV FT IYWCDSQTVLQWIHSKSCKYHAFVAHRITEIIESSAASQWRHIPGELNPAD FT DCSRGIPATHLTTQHRWFRGPDFLALPQSSWPSTGVIAEPSSDDPEVSPAK FT WVGFVQVTNDHPVFNLIQQSSNLHKLKRIVAWLLRFVNNRHINPKNRQLAP FT YVKAPELREALRFIIRVDQRHFFDDEFRCLAKGRPVPTASSLANLTPFLDP FT FGIIRVGGRLQHASLPEDTKHPIVLSSDSQLSTMVITDTHKLLIHASTEPT FT LHALRAKYHVLHPRASINRVIRKCFTCKLRNSQPAPPLMGPLPASRLQTHL FT PAFTNVGIDLFGPFSVVILRRSVKRYGVMFTCLDTRAVHLEVADSLDMDSF FT INAFSRFADRRGVPQLCYSDNGTNLVAGEQEINRALSRWNEAELVQKIEKL FT KNQPIEWRFSPPVAPHFGGSWERLIKSAKTALRGILNNRSVTEDVLVTAIV FT GAEALLNSRPLTHVSVNPNDLEAITPNHFLLLRAHPGCNLDSPPDAKISSR FT RRYEQAQQLITHFWNRWLREYVPNGIERRKWLRSRRNLAVNDLVLVVTPNS FT PRGSWPIGRVVSVQQGPDGFVRSADVRVVRAIPSTSKRRRAASDVTCTTHL FT YTRSVHKLCLLEEDEQDVSEDGNRAG" XX SQ Sequence 7595 BP; 2252 A; 2105 C; 1556 G; 1682 T; 0 other; tttggtgcat cgaccttgcg ttttgaaccc tacagaaaac taagatccaa tcgtacagca 60 agtcgtgcac atttcgtgca tcgaccctgc gtttcgaatc caaaaaaaaa aaactaaact 120 cactcgtaga gcaagcattg cacactttgt gtagtaacct tgcgtttcga atcaaaaaat 180 ttactcgctt taatgtaaag tcccgtacag tccgtgcaaa gacattgcgt tagatcccgt 240 aaaaaattct ccccccgatc cattgaaatt ggttgacgac caagcgcgcg gcgtcaccca 300 ccgtcagggt aatccatatt tatctccgac atctcaacag gtagacaatt tccctttgag 360 aacttaattt ctgggtctaa aaacagaagg gaaaaaacct cattggtaaa aaaaaaaaaa 420 aaactctcga tcgcccctta aactcagttt ctaattttga ttcattgcaa aattaaaacc 480 acttggcgat tttgtctagc tagcatattg cctaatattt tttgcgaagt aatttcatta 540 gaatcggcag actccattta aatttatctg gcaacgctgc cgagaggaag ggaatcggga 600 acccaggggg agccagttag tagcagacga cgttttctct ctctcgctgc cgacaaccgc 660 atcgtcaagt gctctcccca cattttctct cgtaaatttt tcgcccaccc tccctttttt 720 ctctcgaaca ggccacgtgg ccgttacgtt tcaaaactgc ctcgtcgccg caacagccca 780 atttcccgcg ccatccaggc cgctgttcac gaagaaagaa gagtggctcg acgtcaacaa 840 caagtgcagc agctacggca aagagagatt gacgatcaac ccaatcagca agcaagacaa 900 gccgcaatca acgcggccat tcaagaagca gaaagaatat tgcaagttat tctggccgaa 960 cgagaataag aagaagcggc aaaacaaaga aaacaagaag actcagccgc tgaacctgaa 1020 gaagattcca aggccagtac atctcaagat caagatttgc atcctgtttc gtgtaacaca 1080 cggtcctagt gatcggtcct aagtcttttt caaggctcat tgggctatca ttcgttgggc 1140 ttgctgttct cacttttata ttcagttaaa aaaaaaaacg gtcctaagag ccaagtccta 1200 gatccctcgt ggtccattgg gctagcttct cattggtcct actgaatttt ctaaaaaaaa 1260 aaaaaaaact taatttccaa gatgtcaaac gccaattcat cgcccaataa cgacactcac 1320 agtccgcgcc caccaagtcg agcgcaagaa atgcagacgg agtacaccga ccgatatcca 1380 ccattggaaa acctaatgca gccggatcag acggatcata atccaccatt gttcgaccta 1440 aggccgccgg atcgcaccga tgaataccca ctatttgaag acacaagggc gctgaagcaa 1500 gcaagatcaa ctgaaaaatc tcgtgcgacc agacaaggaa acaaaatcat cagtcacgtg 1560 cgtaataaaa aaagtcgcac ccaactcaag ttcatgagga cagaattttc cacgactatc 1620 gccgactgtc tttcaattca tcagcaatat tgtgattcga aagaagaaga agatgaacga 1680 gacgacgaat ggacgaaaga acttggcgaa gatgcaaaaa cgatttttga aatcatcgac 1740 cgttatttag caagaaaatc ccgtccgcca tccgcagtat ccatcggcca tgcgtcacaa 1800 cagtcactcg gcatccagtc aaacgccacc gtctatccag cgcaaccgga tccacccaat 1860 cgatcagcag atcaccactc cgtatcacca tccagctcag tctcaaacca agacgccaaa 1920 gctttacaag aggaaaggcg tcaacgcgaa gaagaaaggc gtcaacgcga agaagaaagg 1980 cgtcaacgcg aagaagaaaa gcgtcaacgc gaacaactag aaaagcaaat ccaacaaatg 2040 aagatagacg aaaaagagaa gatcaagaaa gccgttgaag aagaaagaga gcgacaatcc 2100 agcaagtaca ccacagctaa tcaagaagca gtaaatgaag cggaaaaacg tgccaacgag 2160 atagctgaag aaaacaaaaa aatcagaacc gcgctggagg tccaaaaaca aaagcaaaag 2220 gagctgtttg aagccgccca agaaaaggat cggttgctga ggctagaaaa aaaagaaagg 2280 cttgagagtg aacaaaaact gaatcagaag atccaagaac agcaaagcaa aattcccatc 2340 agttctctct taaaacaacc gtacacctcc ttccagccca cgcaattcga accaaccaga 2400 ggagccaatc catttgacaa tttgttccag accgcgtcaa atccacatcg tcaaaccgaa 2460 gaggaaccaa tcaacttcaa gcaatggacg gcccacaacg acaatttatt gtcgtcaacg 2520 aaagattcca ccgagtcaaa ccacaacaca agatcagttt ttcgtcttcc caaattggac 2580 ttaaaaccct acgacggcga tcccaagaag tggcccgact ttattgcaat cttcagagat 2640 ttggtccact ccaacaacag tttatcatca acagaaaaaa tggcgttgct caaacgatct 2700 ttgtcagaag acatccgaaa tggacttggc gactctctca gcagctcggc tttgtacagt 2760 gaagctttga ccgagctcga aaacacctat ggacatcctc agatcgtctc aagagcctac 2820 atccaatctc tgatcgagct accaaaagtg aacaacaacg actataaaac tttgctcaaa 2880 ttctcgcaaa ctctcaatgg agccgtttca tccttgaaga atggaggtta cgaacacgag 2940 ctgaaagcat ccggcatcct cgagctaatt ttggccaagc ttccagcgga gttacaaagc 3000 aggtggggga aaaagatagt caagagtcac ccagtctgcc taacccttca agacttctca 3060 agctggatcc acctaatcgt caaaggagaa atgatggcca agcattgttt gatttctcca 3120 gcatcatcgc ccccgtccag caaaagcaac caaaagccgg gccggcagca aacggaccga 3180 aagccgaagc atccgccttc aatcaacatc atcggtcaca aaccagccgc tttgccctca 3240 acagcagcca aacaaaacga tggaagaaaa accttgacgt gcctcctgtg cagaggagac 3300 caccgtcttt catcgtgtcc aaagttcgca gccttatctt tagaagaaag gatgaagatc 3360 atcaaggagt ttaactgctg tctacggtgc ctaactaaag gacattggac caaagaatgc 3420 ttcaataaaa cgaaatgcaa cttcgaagaa tgcaccgccc atcaccaccc acttcttcac 3480 ggcgctccaa gcctcaacgt gtcctttaaa gataaagcaa agacaccaaa agaagaaaaa 3540 tccgagacac aagaaaacaa gaaaacaacc aaaaagggcg ttggaacgca caccgtcgaa 3600 ggagattcaa tcaccaccct gctgctaacg gttccagtca ttattgaagc tcacggaatc 3660 caggtcaaca cagtaggcat cttggatcaa ggaagccaag catcattgat cctcgacaaa 3720 atatcaaaaa agctgaaact cgacggtcct acgcagtcat cacctttagc gactttccac 3780 ggaaacgatc cgaagaataa agtgaaatgt gtatctttca acattctcac agccgattcc 3840 agccgctctt tcgaagtcaa gtctgcgtac acagtcccgc gtctacaaat ccaaacaacc 3900 agccttaatt ggccggcagt caaacatcaa tggaatcatc tctgcgacat cgagccaatc 3960 aatgcggaca ccaaagaaat aggcgttctt ttaggacgtg atgtgctccg tgtccacgat 4020 gtgctagatt cccgctatcc agccgacgga gttgaagccc cggatggaat caaaactcat 4080 tttggttggt gtgtgaccgg accagtggca accgccattt tgcatccacc tcttcacatc 4140 aacgcgcttt ccatcaccca gcaactctcc gatcaagctc tacacgacgt cgtcaatcaa 4200 ttctggctca cagaaacttt cggtgttcgt ccgtcaacat ctttgccttc atcactggac 4260 gacaaagcag cattgaaaat actggagcag acgacgcgtc acactggaga aaggtacgaa 4320 gtcggactaa tgcttcgtga tcccaaaatc aacattccga ataatcgtga agtcgccgtt 4380 cgtcatttca actcgctgga gaagcgcttc gcccgtaatc cagcattcgc cgaaagatat 4440 tcccgcgtga tgaacgaata catttccctc ggacacgcag tactattgga cgtcaacaac 4500 tctatccgca aaggcttcac ctggtacctt ccgcatcatg gcgtcacaaa tccaaacaaa 4560 ccggaaaaag ttcgtgtggt attcaatcca tccgctcgtt acaaaggaac gtcgctgatt 4620 gaacaattgt tcaaaggtcc cgatctgcta acctgtctaa ttggcgtcct tcttcgcttc 4680 cggcagtttc ccgttccaat ctccggcgac atagagaaaa tgtaccacca ggtgctcgtt 4740 cccaaacagc aacaatctct tttccgcttt ctttggaaaa accctggcga cgtaggagaa 4800 ccaaaagaat accaaatgac agttcacgtc tttggtgccg tatagtctcc aaccagctgc 4860 atctacgcgc tgagaaaaac ggcagaagat tttggcagcc gcttccccga cgtcgccgat 4920 tcagtatcaa aaaatatcta cgtcgacaat tatctagatt ccacggagac agaagaagaa 4980 gctatcgcaa gattacgtga cgtttcagca ttgttaaagc taggcggctt caacatggtc 5040 caatggcttt catcttcccg atccgtctta gccaccgtcg atcattccga tctttcacgg 5100 tcactcgatc ttgacgccga caaacttcca atcgaacgga ctcttggcct gctctggaat 5160 tgtcagcaag attcgttcaa ctttaaatcg tcgatcaaaa tccaagccaa aaccaaacgg 5220 gaagttctcc aagaagtagc ttccgtcttt gacccactcg gatttctctc tccagtcata 5280 atgacagcca aaatcctgct tcaagacatt tggcgatccg gcgcggattg ggacgatcca 5340 ctgcctccta cactgcttga aatttggatg gcatgggcaa aggaactttc agcaatcgcc 5400 tccatcaaga ttccgcgttg cttccggcta caagaaaagc ccatctccta cgagcttcat 5460 gtttgttccg acgcatccga ggttggtttc ggcgcctgtg tctacctgcg agccgaatat 5520 ccaaacggcc atttccgcct caaccttctt ctcgcaaaag caagagttgc gccgttacgc 5580 cagctttcta ttccgcgttt ggagctacaa ggcgcagtgc tgggcgtccg attatgtgat 5640 tccgccatca aggagcttgg acccatcgca gcacaagtta tctactggtg tgattctcaa 5700 accgtgcttc aatggattca ctcaaaatcg tgcaagtatc acgcgttcgt cgcacaccgg 5760 attaccgaaa ttatcgaaag cagtgccgct tctcagtggc gccacatccc aggcgaattg 5820 aatccggccg atgattgttc ccgcggcatt ccagcgactc atctcacaac tcaacacaga 5880 tggttccgtg gcccagattt tctcgccttg cctcaatctt cttggccctc aacaggcgtg 5940 atcgccgaac catcatcaga cgatcccgaa gtgtcgccgg caaaatgggt cggcttcgtt 6000 caagtgacca acgaccatcc cgtcttcaac ttaatccaac aatcatcaaa tttacacaag 6060 ttgaaacgca tcgtcgcttg gctactccgg ttcgtcaaca atcgtcacat caatccgaaa 6120 aatcgacagt tggctcctta cgtcaaagct cctgaacttc gcgaagcact ccgcttcatt 6180 attcgtgtcg atcaacggca tttcttcgac gacgagttcc ggtgtttggc gaaaggacga 6240 ccagtgccaa cagcttcatc cctggccaat cttaccccat tcttagatcc gttcggaatc 6300 atcagagtcg gtggtcggct acaacacgcg tctctgccgg aagatacgaa acacccaatt 6360 gtgctatctt ccgacagtca actgtcaact atggtcatca ccgacacaca caaacttctc 6420 atccatgcat caacagagcc tacgcttcac gcacttcggg caaaatacca cgtccttcat 6480 cctcgagctt caatcaatcg tgtcattcga aagtgcttta cgtgcaagct tcgtaatagt 6540 caaccagcgc caccactcat gggcccactg cctgctagtc gcctgcaaac tcatcttccc 6600 gcttttacca acgtcggaat cgaccttttc ggcccatttt ccgtcgtcat cttgagaaga 6660 tcggtcaagc gctatggcgt gatgtttaca tgcctagaca cccgagctgt acatcttgaa 6720 gtcgccgatt ccctcgatat ggactctttc ataaacgctt tttcccgttt tgctgatcgt 6780 cgcggcgtcc ctcaactttg ctacagcgac aacggaacaa atctcgtggc cggtgagcaa 6840 gaaatcaacc gcgccctttc ccgctggaac gaagccgagt tggtgcagaa gattgaaaaa 6900 ctcaaaaacc aaccaattga atggagattc agtccacccg tagctcctca ctttggaggt 6960 tcatgggagc gattgattaa atccgcaaaa accgccctgc gaggtattct aaacaatcgg 7020 tctgtcaccg aagacgtcct cgtcacggct atcgtcggag ccgaagcgct cctaaattct 7080 cgcccgttga ctcacgtcag cgtcaacccg aacgacttag aagccatcac accaaaccat 7140 tttctgctgc tgcgggctca tccaggatgt aaccttgatt ctcctccaga cgccaaaatt 7200 tccagtcgaa gacgttacga acaagcccag cagctgataa cgcatttctg gaaccggtgg 7260 ctccgagaat atgtccccaa cggcatcgaa agaagaaagt ggcttcgttc gcgacggaat 7320 ctggccgtca acgatttagt tctcgtcgtc acgcccaatt cccctcgcgg atcatggccg 7380 atcggtcgcg tcgtcagtgt ccagcaaggt cccgacggat tcgtccgatc cgccgacgtc 7440 agagtcgttc gagccattcc atccacctcc aaacgtcgcc gtgcagcatc tgacgtcacc 7500 tgcaccactc atctgtacac ccggtcggtc cacaagcttt gcctcttaga agaagacgaa 7560 caagatgttt ccgaagacgg aaacagggcc ggcaa 7595 // ID Copia-113_AA-I repbase; DNA; INV; 4146 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-113_AA_; KW Copia-113_AA-LTR; Ty1_copia_Ele104; Copia-113_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4146 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [1470-1994] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 90..4136 FT /product="Copia-113_AA-I_1p" FT /translation="MTNGQQVNGSGAGSQGTGSGGLPAAGGGAGVPAAGNG FT RINAVALPIIEKLKGRENYTTWAFAMKMTLIREGSWYTVSPAQGQAIDDEV FT SLRALATICLSLETTNYSLVQDAKDAKEAWDKLRNAFQDNGLTRRIGLLRK FT LTSIRLDECGSVESYVDELMSTAHKLVGIGFRVDDTWLAALLLMGLPEHYE FT PMIMGLEASGTALTSDAVKAKILQDVKLPSGPKTGGGDGALYSNPKSRRRG FT GNNGPTKKDDTCHNCKKRGHFAAKCPQKQNSPRTAGKALCSVFAMGEVSDK FT EWYFDSGATCHMSRSGDGFVEQHRMAHPVATANNGSMMSVAKGLVKLDLSE FT GPIEVKEVLQIPELATNLLSISKICQKGLKVVFDADKCEVREHDGIVIASG FT TQSGGLYKLNRKHEQAMLTPSTGIWHRRLGHLNRQSLRKLMAMADGIELAN FT DVIPECIACIEGKHARNPFPSSESRAEGLLDLVHSDLVGPIEVPSVGGSRY FT VMTFIDDASRKVFLYFLERKNEAFGAYENFKAMTERQTGRKLKVLRTDNGT FT EYVNKTFRTSMAKDGVRHEKTCPYTPEQNGVAERMNRTLIEKARSMLNDSR FT LPKEFWAEAVSTAAYLVNRSPTRSLQTTPEEAWTGKKPDLQHLRIFGSLAL FT VHVPKQRRKKFDAKSQKAVFVGYADGTKGYRVYDPVKRSVQISRDVIVVQE FT GEPQNLVGIGEEQQQIQFMELYTVDEPNASGGQHPVVEPSVGDDDVDEDDS FT EESAASDSGSSDLEFMDVLDSENEPALPSQLGRPAVEQQGLRRSDRERRTP FT GKYSDYVTYSSFSGEVVSPQSTPMCPKTSFDNPQSYEEALNGPDREKWIAA FT MREEIAALEENETWELTSLPSDRKAIRNKWVFKTKRGPTGDIERYKARLVV FT KGCSQRPGIDFDEVYSPVVRYSTIRYLLALAVKHDLDVEQMDAVTAFLQGE FT LSDEVIYMELPRGFAGTSKQVCRLRKALYGLKQSSRVWNNQLDQALKKFGL FT EQSKVDPCLYFRIDGENMIFVTIYVDDFMIFTNDAKLKQKLKSFLHNCFKM FT KDLDEANFCLGLRITRDRKNGKLWVDQEHYVDSILQRFNMANCHPVSTPAE FT ASVKLDKSMSPTTPEETREMKEVPYQEAVGCLTYLAQTTRPDISFAVNQVS FT QFNANPGRTHWNAVKRIMRYLRGTRSSRLTYSKERCADLVGYTDADWGGEP FT DSRKSTTGYVFTFMGGAVSWNVKRQPTVALSSCEAEYVALSRTVQEALWWH FT HFQSQIFGVRAIPIRCDNQSAICIARNQGYNPRTKHMDIKHHFVRDVLDQG FT VVELGYVNTKQQPADGFTKALVNQKLEENKKLIGFVA" XX SQ Sequence 4146 BP; 1050 A; 1035 C; 1268 G; 793 T; 0 other; agaggttatg ggcccaggat tggccatagc caagaagaaa aatttttttt tctcaagaag 60 tgaagccacg tggaagaagt tagttcaaga tgacgaacgg tcagcaggtg aacggaagcg 120 gagctggatc gcagggcact ggaagcggag gactgccggc cgctggaggc ggagccggag 180 tgccggccgc aggaaacggt cggatcaatg cggtcgctct accgatcatc gagaagctga 240 aggggcgtga aaactacacc acgtgggcgt tcgccatgaa gatgacgctc attcgagaag 300 gttcctggta cacggtgtcg ccagcccaag gccaggctat cgatgatgag gtcagtttgc 360 gtgcgttggc cacaatttgc ttgagcctgg agacgacgaa ctacagtctc gtccaggacg 420 caaaggatgc caaggaggcc tgggacaagc tgcggaacgc tttccaggac aacggtctga 480 cgaggagaat cgggttgctg aggaagctca catccattcg gctggacgag tgcggcagcg 540 tggaatcgta cgtcgacgag ctgatgtcca cagcgcataa acttgtggga atcggcttca 600 gggtcgacga cacatggctt gctgcattgt tgctgatggg acttccggag cactacgaac 660 ccatgattat gggacttgag gcgtcgggga ctgcgttaac gtcggacgcg gtgaaggcaa 720 aaattttgca ggacgtgaag ctgccgagtg gacccaagac cggcggcggt gacggagcgt 780 tgtactcgaa cccaaaatcg aggcggcgtg gcggcaacaa tggtcctacg aagaaggatg 840 acacgtgcca caactgcaag aagcgggggc attttgcagc caagtgtcca cagaagcaaa 900 actcaccaag aacggctggc aaagctttgt gcagcgtttt tgcgatgggc gaggtaagcg 960 ataaagagtg gtatttcgat tccggcgcca cgtgccacat gtctcggagt ggcgatggtt 1020 tcgtcgagca acaccgcatg gcgcacccgg tggcaacggc gaacaacggc agcatgatgt 1080 cagtcgcgaa gggtctggtc aagttggatc tctcggaagg cccgatcgaa gtgaaggaag 1140 tgctgcagat tccggaactg gccaccaatc tcctatccat cagcaagatt tgccaaaagg 1200 ggctgaaggt ggtcttcgac gcggacaagt gtgaggttcg tgagcacgat gggatcgtga 1260 tcgcatccgg tacgcagtca ggcggcctgt acaagctgaa ccggaagcac gagcaggcaa 1320 tgcttacacc gagcaccggt atctggcacc gacggcttgg tcacctcaat cgtcagagct 1380 tgcggaagct gatggcgatg gcagacggca tcgagttggc aaacgacgtc atccccgagt 1440 gcatcgcttg tatcgagggt aagcatgcac gtaatccttt tccttccagt gagtcgcgag 1500 ccgaagggct attggattta gtccactcgg acctggtcgg acccatcgaa gttccatctg 1560 tcggtggcag tcgttatgtg atgacgttca tcgacgatgc gagtcgaaag gtgttcctct 1620 atttcctgga acgaaagaac gaagcgtttg gggcctacga aaatttcaag gccatgaccg 1680 agcgccagac agggaggaag ttgaaggttc tgaggacgga caacgggacc gaatacgtca 1740 acaagacctt caggaccagc atggcgaaag atggagtgcg gcacgagaag acttgtccgt 1800 acacccctga gcagaacggg gtagcggaga gaatgaaccg gacactcatc gagaaggcca 1860 gaagcatgct aaacgattcg cgcctgccaa aagaattttg ggccgaggcg gtttcaacgg 1920 cggcgtacct cgtcaaccga agcccgacga ggtcactcca gaccactcct gaagaagctt 1980 ggaccggtaa gaagccggat ttgcaacatt tgcgcatttt cgggtcgcta gcactagtcc 2040 acgtcccgaa gcaacgaagg aagaagttcg acgccaaatc ccagaaagcc gttttcgtag 2100 gatatgccga tggtaccaag gggtaccgtg tgtacgatcc ggtcaagcgt tcagtccaga 2160 tcagccgtga cgtgattgtg gttcaagagg gagaaccgca aaatttggtc ggaatcggcg 2220 aggaacagca gcagatacag ttcatggagc tgtacacagt cgacgagccg aacgcaagtg 2280 gcggacaaca cccggttgtg gaaccgagtg tcggcgacga cgatgtcgac gaagacgatt 2340 ctgaggagtc ggctgcatcc gatagcggct caagtgatct tgagttcatg gacgtcctgg 2400 attccgagaa tgagcctgcg ctcccgtcgc aacttggtag gccagctgtt gagcagcaag 2460 ggcttaggcg cagcgaccgg gagcgccgaa ccccaggcaa gtattctgat tacgttacgt 2520 atagttcgtt ttctggcgaa gttgtttccc cccagtccac cccgatgtgc cccaagacgt 2580 cgttcgataa cccgcaaagc tacgaagaag ccctcaacgg accggatcgc gaaaagtgga 2640 tcgcggccat gcgtgaggag atcgcagccc tggaggagaa cgaaacttgg gagctgacca 2700 gtcttcctag tgacaggaag gccatccgaa acaagtgggt cttcaagacg aagcgaggac 2760 caaccggtga catcgagcgg tacaaggcgc gcttggtggt gaaagggtgc tcacaacggc 2820 caggcatcga cttcgacgaa gtgtattccc cggtggtacg gtattccacc attcgatacc 2880 ttctggcgct ggcagtgaag cacgaccttg acgtggagca gatggatgcc gtcactgcgt 2940 tccttcaagg ggaactatcg gacgaggtca tctacatgga gctaccgaga gggtttgctg 3000 gcacctcgaa gcaagtgtgt cggttgcgaa aggcacttta cgggctcaaa cagtcgagcc 3060 gtgtctggaa caaccagctg gaccaggcct tgaagaaatt cggactggag caatcgaagg 3120 tcgacccgtg cctgtatttc cggatcgacg gtgagaatat gattttcgtc acaatctacg 3180 tggacgattt catgatcttc accaacgatg ccaagctgaa gcagaagttg aaatcgtttc 3240 tgcacaactg cttcaagatg aaggacctcg acgaagcgaa tttctgcctg ggactgagga 3300 tcacgcggga caggaagaat ggcaagctgt gggtagatca ggagcactac gtcgacagca 3360 ttttgcagcg gttcaacatg gcgaactgcc acccagtctc cacgccagcg gaggcgagtg 3420 tcaagctgga caagtccatg tccccgacga caccggaaga gacgcgcgaa atgaaggagg 3480 taccctacca ggaggcagtg ggctgtctca cgtatctggc gcaaacgacg cggccggata 3540 tcagtttcgc ggtcaaccaa gttagccaat tcaacgcgaa tcccggacgt acccactgga 3600 atgctgtcaa gcgcattatg cggtacctgc gaggcacccg atcgagccga ctgacgtact 3660 ccaaggaacg ttgcgcggat ttggttggct acacagatgc cgattgggga ggagagccgg 3720 attccaggaa gtccaccacc ggatacgtct tcacgttcat gggaggagct gtgtcgtgga 3780 acgtaaaacg gcaaccaaca gttgcattgt cctcgtgcga agcggaatac gttgcgcttt 3840 ctcgcacggt ccaggaagca ctttggtggc accacttcca atcgcagatt ttcggcgtgc 3900 gagcgattcc catccggtgc gacaatcaat cagccatctg tatcgcacgg aaccaaggat 3960 acaatccacg aacgaagcac atggacatta agcaccattt tgtccgagat gtgctggatc 4020 aaggagtagt cgaactgggc tacgtgaaca caaagcagca acctgccgac ggattcacca 4080 aggcccttgt aaatcagaag ctggaagaga ataagaagtt aattggattc gttgcttaag 4140 gaggag 4146 // ID DNA4-2_AP repbase; DNA; INV; 769 BP. XX AC . XX DT 22-AUG-2009 (Rel. 14.08, Created) DT 22-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA4-2_AP. XX NM DNA4-2_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-769 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(8), 1739-1739 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 4 bp TSD (TATA). CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 769 BP; 273 A; 117 C; 106 G; 272 T; 1 other; cagggtgatt caccaagcat gctcaccccc attttttcct ttaataatga atttattcaa 60 attctgattt tagaattttt aaatatactc atagaccata ttttcaaatt cttgaatttt 120 tttgtactaa ctacttaagg agtgtcctgt ggcgatacaa acttctgttt ttcaaatgag 180 aacccccttt ttactgtaaa ttatttagtg gataattttt ttgaaaatgt tgatgtaggt 240 acctaattca aaattcgaat gagtagtttc ttagttatta aaatgtttgt attaaggata 300 atagtcctta aaaatggttt tacataaata taaaatagac gtaaaactgg atctagtatt 360 tattatactt gggccatttt tacaaattat taatattgat agattaatat taatgactac 420 aattttcaaa tcaccatagc ccccatatct tataaaccca ttaccatgga ccttttatcc 480 ttagaacaca aagttgaata actcaaaaac tacycgtacg aattttgatt ctgatacgtc 540 aacaatttca gaaaaattat ccactgtata atttacagtt gaaaaggggt gttctcattt 600 gaaaaacaga agtttgtatc tccacaggac actccttaag tagtacaaaa aaatctcaag 660 aatttgaaaa tatggtctta aagtatattt aaaaattcca aaaatcagat tttgaataac 720 tgcattattg aagaaaaaag ggggtgagca tgcttggtga atcaccctg 769 // ID L1-36_AAe repbase; DNA; INV; 4386 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE An L1 non-LTR retrotransposon from Aedes aegypti. XX KW L1; Non-LTR Retrotransposon; Transposable Element; L1-36_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4386 RA Kojima K.K. and Jurka J.; RT "L1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1389-1389 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 7 sequences with >98% CC identity. XX FH Key Location/Qualifiers FT CDS 139..1140 FT /product="L1-36_AAe_1p" FT /translation="MTRRENTFRIDLANIPKKPSYEEIHNFVATKLGLQRH FT HVQRIQCSRSASCVFVKVADLELAQKVCEEHDAKHEIIVDNDKFVLRIRME FT DGAVEVKLFDLPEDISEKCITDFLSAYGEVLSVREQLWTEQYTFGGASTGV FT WIARMIVKRNIPSYVIIDGETTFLSYYGQLQSCKHCGEYVHNGASCVQNKK FT LLIQKMSADAPAKQSYANVAKKSAVTRAMIAPKPTKPSPAVQQQQEPITEA FT NATHTSSTIKPPAVFQTPALPATLKKSSGVRHHQQQHQPKEKQQQPPKQNR FT NNDGGETDESTSSNTSRRSRRHIEKKPRYDDVDISPDEAIGN" FT CDS 1143..4334 FT /product="L1-36_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MEFTSYNFVTININNITSETKLNALRTFLRTMDSDIA FT FIQEIESDKLTLPGYNVICNVDHERRGTAIALKDYIRFSHVERSLDGRITS FT LRVHDTTLVCLYAHSGTSLRPQRERFFNHTLAYYLRHGTPHTVLAGDFNCV FT LRQCDSTGRNDSPSLRATVQQLQLSDVWIKLRPQTPGHTYISVNSSSRLDR FT FYISSSLCNQLRSVQTHVCSFSDHKAVTARLCLPSLGRAPGRGFWSIRPHL FT LTQENIEEFRYKWQFWTRQRRNYPSWILWWISFCKPKIKKFFRWKSKLAFD FT DFHREHQRLYTLLCEAYDNLPGRPHLRTTINQTKAKMLALQRKFTHMFIRI FT NETYVAGEAMSTFHLGERRRKRTTITQLRNEDGETTDQSDEIERSLLRYFA FT ELYTEPQIEETIDDSFNCERIIPEGDEINEACASEITTAEIWTAIRTSAAK FT KSPGTDGIPKEFYHRTFDIIHREVNLVLNEALSGRIPSEFVDGVIVLIKKK FT NGDGTARSFRPISLLNFDFKILSRVLKARIELVMRTHHLLSPAQKCANSPR FT NIFQATLSLKDRIARLIRNKQKGKMITWDLDHAFDRVRQSYLYNTMRSLGF FT HHGLVDLLARIDNLSSSRLLINGHLSEPFAIQRSVRQGSPLSMHLFVLYLH FT PLISKLENVSDGDLIVAYADDITVLSTSIDRIERMRELFSRFERASGAKVN FT WEKTLSLDVGFIAGNPLYLPWLRTENTVRVLGVIFANSIRLMVKLNWDAML FT VKFSRLVWLNSMRSLTIFQKVILLNTFVTSKVWYLASIISPYCVHTGKITA FT TIGRFIWSNTVTRIPLCQLAREREQGGLKLHLPTFKCKALLINRHIQELDS FT IPFYKSHILPDNPDQNIIPADLPDLKIIYSQYQQLPLQIQQSPSSDAIHRY FT FVEQTEVPRVERLHPEFDWKRIWASLRWRAFTSTEKSILYMTINEKGAHRE FT LLHTIGRADSNTCFHCNITIETMQHRFSDCPRVRPAWEFLQSKIRVVLGVW FT RTVSFSDLLRPVLNRICKVKRILILKLFAHYIMFVNNNENQIDINALEFYL FT NCELNNS" XX SQ Sequence 4386 BP; 1316 A; 1035 C; 917 G; 1118 T; 0 other; cagtttgtgt tcaacttcca tagcaggcgg acctttcgtt atcgctactg gaagccccgg 60 gcattttgat atcacgagtg ttcttccgtt tcaattagtg tcgcgaagtt ataccgtgtt 120 tcgcggttag ctttcggtat gacacgtcgc gaaaatactt ttcgcatcga tctcgcgaac 180 atcccgaaaa aaccaagcta cgaagagatc cacaatttcg tggcaactaa gcttggactg 240 caacgccacc atgtacaacg aattcaatgt agtcgtagtg cgagctgtgt attcgttaaa 300 gtcgctgatc tcgaattggc tcaaaaagtg tgcgaagaac atgacgccaa acacgaaatc 360 atcgtggata acgacaaatt cgtgctgaga atccgaatgg aggatggagc cgtggaagtt 420 aaactttttg atcttcccga agacatctct gagaaatgca tcaccgactt tctgagtgcc 480 tacggcgagg tactctctgt acgagagcaa ttatggactg agcaatacac tttcggaggt 540 gcttcgaccg gtgtgtggat agcacgaatg attgtaaagc gaaacatacc ttcctatgtg 600 attattgatg gagaaacaac cttcctgtct tattatggac agctccaatc ctgtaaacac 660 tgcggggagt atgtccataa tggagcgtca tgtgttcaaa acaaaaaact tttaattcag 720 aagatgtccg ctgatgcacc agcaaagcaa tcctacgcaa acgttgccaa aaagtcggct 780 gtgacaagag caatgattgc gccaaaacca acaaaaccat cccctgcagt tcagcagcaa 840 caagaaccta tcaccgaagc taatgcaact cacacttcct ccacgatcaa gccaccggca 900 gtgtttcaaa caccggctct ccctgcgacg ttgaaaaaat cgagcggcgt ccggcatcat 960 caacaacagc accaaccgaa ggagaaacaa caacagcctc cgaagcaaaa tcgtaacaac 1020 gatgggggtg aaaccgatga gtccacatca tccaacacca gcagacgatc tcgacgacac 1080 atcgaaaaga aaccacgcta cgatgacgtt gatatttctc cggatgaagc aattgggaac 1140 taatggaatt cacgagctat aattttgtaa cgatcaacat taacaacatc accagtgaga 1200 ctaaactaaa cgccttgcga acattccttc gcactatgga ctcggatatc gcattcatcc 1260 aagagattga gagcgataag ttgacgttgc ctggatacaa tgtaatctgc aatgttgacc 1320 atgagaggag aggaacggcg attgccctaa aagattatat aaggttttcg catgtagaga 1380 gaagtctcga tggccgcatc acatccttac gagtccacga tacaactctt gtttgcctat 1440 acgcacacag tggaacctct ttgcgaccac aacgagaacg gttttttaac cacacactag 1500 cctactatct acgtcacggt accccgcaca ctgtactggc tggagatttt aattgtgtac 1560 tccgacagtg tgactctact ggacgaaatg acagcccatc tcttcgagca acagttcaac 1620 agctacaact atcagatgtt tggatcaagc ttcgacccca aactcccggt catacttata 1680 tttcggtgaa ctccagctcc agattagacc gtttttacat tagctccagt ctctgtaatc 1740 agcttcgctc tgtccaaact catgtatgtt ccttctcgga ccataaggca gttacagcac 1800 gattatgtct cccttctctc ggcagagcac ctggtcgagg gttttggtcc atacgcccac 1860 acctcctcac tcaggagaat atagaggaat ttcggtataa gtggcagttc tggacaagac 1920 aacggcgaaa ttatccctcc tggatacttt ggtggatttc cttttgcaag cctaaaatca 1980 aaaaattctt tcgctggaaa tctaaactcg cctttgacga ttttcaccga gaacatcaga 2040 gattgtacac attactatgt gaagcgtatg acaatcttcc tgggagaccc cacttgcgca 2100 ctacgattaa ccagacgaaa gcaaagatgc tcgctttgca acgaaaattt acacacatgt 2160 tcatcagaat caacgagacg tacgttgcag gagaggcgat gtcaacattc cacctcggag 2220 aaagaaggag aaaacgaacg accataactc agctccgaaa tgaagatggt gagaccacgg 2280 atcaatctga tgaaattgag cgtagtctgc tacgttattt tgccgaactc tacaccgaac 2340 cgcaaatcga agagactatc gatgacagct tcaactgcga gaggataatt cctgaaggtg 2400 acgaaataaa tgaagcctgc gccagcgaaa tcactactgc cgaaatatgg actgccatcc 2460 gaacaagcgc agcaaaaaaa tcccccggca ccgatggaat cccgaaagaa ttttaccacc 2520 gtactttcga tataatccat cgtgaagtta accttgtatt gaacgaggct ttgtctggac 2580 gtattccttc cgagttcgtg gatggtgtga tcgtattgat caaaaaaaag aacggcgatg 2640 gaacagcacg ctcgtttcgg ccgatttctc tgctgaattt cgacttcaag atactctccc 2700 gagttctgaa agctcggata gaactcgtca tgcgaactca ccatttgcta agtccagccc 2760 aaaaatgcgc gaactctccc cgtaatatat ttcaggcgac tctctcgcta aaagatcgta 2820 tagctcgtct gatacgaaat aaacaaaaag gaaaaatgat cacgtgggac ctcgaccatg 2880 cattcgaccg agtgcgacaa tcgtatttgt ataataccat gcggtcactc ggcttccatc 2940 atgggctggt agatctactg gctcgaatcg acaacctttc ttcttctcgt ctcctcatca 3000 acgggcatct ttcggaacca ttcgctatac agcgatctgt tcggcaggga tcaccactaa 3060 gcatgcatct atttgtgtta tatttgcacc cactgataag taaattggag aatgtatctg 3120 atggagacct catcgttgca tacgcagatg atatcaccgt actttctaca tcaattgatc 3180 gtatagagcg aatgagagaa ctcttttctc gcttcgagcg cgcctctggg gcgaaggtga 3240 actgggagaa aacattatca ttggacgtag gcttcattgc ggggaatccc ctatacctcc 3300 catggctgcg aactgaaaat acagtgaggg tccttggtgt aattttcgct aactcaattc 3360 gtctcatggt gaaacttaac tgggatgcaa tgctcgtaaa gttttctcga ttagtctggc 3420 taaattcgat gcgttcatta accatatttc aaaaggtaat actattaaat actttcgtaa 3480 cttcgaaagt atggtatctt gcatcaataa tttcaccata ttgcgtgcac acgggcaaaa 3540 tcacggccac tattggccga tttatttgga gcaatactgt cacgagaata ccattgtgtc 3600 aactagcacg tgaacgtgaa caaggtggtt tgaagctaca cctgccgact ttcaaatgta 3660 aggcattact gatcaatcgg catatccaag agctagactc gattcctttc tacaaatctc 3720 acatactgcc cgacaatcca gatcagaaca tcattcctgc ggatctgccg gatttgaaga 3780 tcatatacag tcaatatcag cagctcccac tgcaaattca acaaagtccg tcatccgatg 3840 ctatacaccg ttattttgta gagcaaactg aagtaccgag agtagagagg ttgcatcctg 3900 aatttgactg gaaaagaatc tgggcgtcgc tgcgctggcg tgctttcaca tcaacggaga 3960 agagtatcct gtacatgaca ataaatgaaa aaggggcaca cagagaatta ctgcatacga 4020 tcgggcgggc tgacagcaat acatgctttc attgtaatat taccatcgaa actatgcaac 4080 acagattcag cgattgtccg agagtccgtc ctgcgtggga attcctacag agcaaaataa 4140 gagtagtatt aggcgtttgg agaacggtat cgttttcaga ccttttgaga cctgtcttga 4200 acagaatttg caaagttaaa cgtatcttaa ttctcaaact gtttgcacac tacataatgt 4260 ttgtcaacaa taatgaaaat caaattgata ttaacgctct tgaattctac ttgaactgtg 4320 aattgaataa tagttaagtt tattttatat acttttgacc gcaataaaaa tctttaaaaa 4380 aaaaaa 4386 // ID BEL-239_AA-LTR repbase; DNA; INV; 509 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-239_AA_; KW BEL-239_AA-I; BEL-239_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-509 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 934-934 (2011). XX DR [1] (Consensus) XX SQ Sequence 509 BP; 183 A; 70 C; 92 G; 148 T; 16 other; tggtcgatca gcagagcccg atgacaktgt acagtataga agaacacaaa amaggaagaa 60 ckatatgtca gatgataatg atttgcaaaa gtaaaggttk aagtgaattg agaagaattg 120 atgctagaak tgatttatta taattattga agtgcctatt gattacatwt tattaacagw 180 taaatacgat tcgtattcgg taagataatt twattgaact tttccttgtt gcataattaa 240 cgtggmcatg aactatattt agagttagca agttgcaaac atttgcccac attggattta 300 kcttttkcga agattactag gmgatataag cccaagagta agtcagatag ttagtattcc 360 taaaattaat aataattgaa actawttaac taaactgaat tcaatwtaac taggcaaaat 420 ctaattgcca taaggatcgt gccgtacgca acggagaagt gtatcaccta acctgttttt 480 cgmggwagaa caccaaaatt gtaagcaca 509 // ID R5-2_SM repbase; DNA; INV; 7404 BP. XX AC . XX DT 15-OCT-2007 (Rel. 14.07, Created) DT 16-OCT-2007 (Rel. 14.07, Last updated, Version 1) XX DE A family of planarian NeSL non-LTR retrotransposons - consensus. XX KW NeSL; Non-LTR Retrotransposon; Transposable Element; LIN1_SM; KW R5-2_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 8-6645 RA Jurka J.; RT "Non-LTR retrotransposon from Schmidtea Mediterranea."; RL Repbase Reports 7(10), 1092-1092 (2007). XX RN [2] RP 1-7404 RA Kapitonov V.V. and Jurka J.; RT "NeSL/R5 retrotransposons from Schmidtea Mediterranea."; RL Direct Submission to Repbase Update (04-AUG-2009). XX DR [2] (Consensus) XX CC ORF1 shows no significant similarity to any known protein [1]. CC Originaly this family was classified as a R2 non-LTR CC retrotransposon [1]. Later, based on phylogenetic studies, it was CC classified as a member of the NeSL clade [2]. Copies of R5-2_SM CC are inserted at the same target site CC (|AATAGTACTAGTGAGTGATTTTGCTAAGTCTACG) in copies of Polintons DNA CC transposons [2]. Two other families of NeSL retrotransposons, CC LIN2_SM and LIN3_SM, are also inserted at the same target site in CC Polintons [2]. XX FH Key Location/Qualifiers FT CDS 20..2362 FT /product="R5-2_SM_1p" FT /note="unknown." FT /translation="MTQSLPIVQDQFASNQLNNSSSELVIFKTILQNQIIF FT SQQLLDFISQRSNGQNQLPLSAKPIHPDLSLDLVSKKSTRDNQLPPSTNAM FT DPEKSFDECDKEALSNKILPLSAKLMELEEHILRMNSKFDIVLNKLSNSDF FT NDNNCNLNILNKINVDNNDDLNISSNYNLVSENAENSPLNPINTQLKNFKV FT TKTKDALNMKRFNCNYLSRILNKNERRNIINNYPCFISLLKKKNSKTKFDL FT INYELSLINKSKLNFKIKDCDSLFYRNGNLCISIKLSEIKKTDATKKIITK FT STNFQNKNKTIISNLINTDNYCSKLIKAPLAIKKFLFVFSHKLAETVSIKN FT ELHKRAVINKTLDNLKFQKQLPYFLVKSDLKYADGDFIMKDLLTSEDADNE FT SFNIGSSNLTRSQLDKLYFDFPEIKEMFFTEGMHKDSDKISSLINHLKVNK FT EFPKDISFSGGKFIINQKSNKIEPDLSNASDVMFNFGLFDLFKISKDDIEL FT ERLISLPASSNKIVEIIFHLNYLNLKGNIKYFSNITYGTYVLNYFRNVLLL FT HSIIDSEKCNSLIIDTLEIWGSELTDQIPSTSYFKFSCQEKLIKYKKWLLN FT GEIDKLKNIISPLTIKRINNLFNLFNKTISEKTNRLSMIDSLLTYINDFRP FT ERDDNVTVTAKRLLYFVIRDKKKFLLKNRDDFQISMKCRKNCTKEMSLIIF FT DLIESLENLITYEEHVNSVELTDKIDNFNKGSSSNVTNNEKKVVLPVDIDE FT EAVSSLELKITENKKMMTRQNLKVSKAMSK" FT CDS 2439..6965 FT /product="R5-2_SM_2p" FT /note="Ulp1 protease, RT and restriction FT enzyme-like nuclease." FT /translation="KILRDPKVWFTDDDIDQYLERHISNPTFAHLPCFIIS FT ILSSDIKENIISIPDAVFKAEVILCPLNINNAHWILFVYSKSLLESYFIDP FT IFANRNLFKNKQATLKINIALNKIFKLQVASSCHPFQNLIYQENSFDCGPF FT ICAYAILISQGLDTFPNNFIDGIRREVHDFQIFGSNKVSQVIGGGLPKNNL FT MGINFKKAIALTLEKQKRNINNSINIKPKFQKSCVSFLLSKWFKSNTNILV FT LSPDLTINLIAQNNNYIIENVNFRQFKDVQHVISIIPNENYWCLLLYSTRR FT MTCNIFDFRKTDISDRLTEIGGNFTDYLNSFFLSMKIKFQINHGILHRCFI FT SDDPTFHATAFIVFLEKLILDRDFQHDKIYKLLDDINTKECVEISLNLNLI FT NGKITDSILIKYFNNLSLSTDFVLLGFTMCTAILDDCTHYLNEHLNYKCLQ FT NAKVVFAIFAPPKSRETLMVIDYNTDEHYFLDPTTLDVSLNYTFICKVLVT FT KINEIRNSRGCAIRAGKCPHDVRGSGLLSKILICAFINNYAHDMSLENINL FT REIARIINSVLPIVSEINNKDKEKILTKAKETIKFDLQQRKNKVFELIISL FT ENADVNQIVESIIRQIPHLNNFTEIKSHEPYLGSKQNSNIISKYKSRSEFL FT INMKLTFYKIINDLPVTVRPEIQDILNQFSHEDPVSNSWNKILKDCRSTTN FT VLNLVDILPFEVIYELKRADNTSPGIDGIQYKDLALLDPEGILLSFLFNKI FT ISSKIIPTSWKTFKTILIPKPDKTDNYDKVSSWRPIALLSVIYKVFASILT FT TRLTSWVICNDILHIGQKGGSVHEGCVEHNSILSSALEHSKYSKNSPLAIA FT WLDIKDAFGSVPHDYLWSVLKTIGVSEEFITIVKLLYTDTQSFYSCGPIVT FT PNLSIKKGVKQGCPLSMILFSIAINPVLEAISRSCIEPFMIGDSPVQVLAY FT ADDIALVANNVENLQKIVDVAVEAATEIGFEYRPEKCGYMQLPRVNINGEI FT LINEKEIKKLLSKEFYQYLGVPVGEDNDQSPYAILDKVVSDTKKIADSGLF FT GWQKLKAYKIFIHSRLTFAFRTREIKTMALSASQGNTNSCGNNSSKLRGHL FT RRILNLPHNSETVYLYNSTENGGASCVDLLDEYHTQTIVHFFRLFTSNCDY FT SRKVNIDSLKFVTGPRLGIKEPTLQQSFDWINGAETKLNHGGRKTRFQRAR FT TSIIYFKREHSISVSFHIVKEQVFLYLITKSHGTFILTTKFRKAISKIFHS FT ALYDSYMQKWQLSNKSDMIAAAIKLSPQINKAIFKSKLGEFAWNFIHRART FT NTLNINAKPNRKGDSRLCRRCEKEDETMSHALQSCKIHQILTLERHNDCIK FT IITNNLKNPNFIVVVDHSCSLVLNSKQRVDLIITDTERKRIFMIDIKCPMD FT SVSHFELTDKANIEKYSLLQSELQVAKPDYQVELYTCIIGALGSIPPSTYD FT LICKLGIEPLQTEGLLKECAMSNILHSAKIWYYHVHGILPDFCN" XX SQ Sequence 7404 BP; 2643 A; 1160 C; 1135 G; 2466 T; 0 other; tctcctctca cacaacacga tgacgcaatc tcttccgata gtgcaagatc aatttgcatc 60 aaatcaattg aataatagtt catcagaact tgttattttc aagacaattc tacaaaatca 120 aataattttt tcccagcaat tattggattt tatttctcaa agatcaaatg gacaaaatca 180 attgccactt tcggcgaagc caattcatcc agatctatcg ttggatttag tatcaaagaa 240 atcgactcgt gataatcaat tgccaccttc aacgaatgca atggatcctg aaaaatcgtt 300 cgacgaatgt gataaggaag cactttcaaa caaaatattg ccactttcgg cgaagctaat 360 ggaacttgaa gagcacattt tgcgaatgaa tagtaagttt gatattgttt tgaataaatt 420 aagtaattct gattttaatg ataataattg taatttaaat atactaaata aaattaatgt 480 cgataataat gatgacttaa atatctcatc taattataat cttgtttcag aaaatgcaga 540 aaatagtcct ttaaatccta ttaatacaca gcttaaaaat tttaaagtta ctaaaactaa 600 ggatgcttta aatatgaaaa gatttaactg caattacctt tcacgtattt taaataaaaa 660 tgagcgtcgt aatattatta ataattatcc atgcttcata agtttattga aaaagaaaaa 720 ttctaagacc aagtttgatc ttattaatta tgaactttcc ttgattaata aatcgaaact 780 taattttaaa ataaaagatt gtgatagttt attctataga aatggcaatc tctgtatttc 840 aatcaagtta agtgaaatta aaaaaacaga tgctactaaa aagataatta caaaatcaac 900 aaattttcaa aataaaaata aaaccattat ttcgaacctt attaatactg ataattattg 960 ttccaagctg attaaagctc cattagcaat aaagaaattt ttatttgtat tttcacataa 1020 attagccgaa acagtttcta ttaaaaacga actgcataaa agagctgtca ttaataaaac 1080 tttggataat ttaaaatttc aaaaacaatt accatacttt cttgtaaaat ctgacttaaa 1140 atatgctgat ggtgacttta ttatgaagga tcttttaact tctgaagatg ctgataatga 1200 atcttttaat attgggtcat ctaatttaac aagatcgcag cttgataaac tttattttga 1260 ttttccagag attaaagaga tgttctttac tgaaggcatg cacaaagatt ctgataagat 1320 aagtagttta attaatcact taaaggtaaa taaagaattt ccaaaagata tttcattttc 1380 tggtggtaaa ttcattataa atcaaaaatc aaataaaata gaacccgatt taagcaatgc 1440 ttctgacgtt atgtttaatt ttggattatt cgacttattt aagatatcta aagacgatat 1500 agaacttgag aggcttattt cattacctgc ttcttcaaat aaaattgttg aaataatttt 1560 ccacctaaat tatttaaact taaagggtaa tataaaatat tttagtaaca taacatatgg 1620 tacttatgtt ttaaattatt tccgtaatgt tttactatta cattctatta ttgatagtga 1680 aaaatgcaat tcgttgataa ttgatactct tgaaatatgg gggtctgaat taactgatca 1740 aattccttct acttcctatt ttaaatttag ttgtcaagaa aaactaatta aatataaaaa 1800 gtggttactt aatggcgaaa ttgataagtt aaaaaatata atttcaccac ttacaattaa 1860 aagaataaat aatttattta atttatttaa taaaacaatt tcggaaaaaa ctaaccgatt 1920 gagtatgata gactcgctcc ttacttacat caatgatttt cgtcctgaaa gagacgataa 1980 tgtcacagtt acggcaaaaa gattattgta ttttgttatt agagataaga aaaaatttct 2040 acttaaaaat agagacgact ttcagatatc aatgaaatgt cggaaaaact gtactaagga 2100 aatgagcttg attatttttg atctcattga atctcttgaa aatttgataa cttatgaaga 2160 acatgttaac tctgtcgaac tcaccgataa aattgataac tttaataaag gaagtagttc 2220 taacgtaaca aataacgaaa aaaaagtagt tttacctgtt gatattgacg aggaagctgt 2280 atcttctttg gaattaaaaa ttacagaaaa taaaaaaatg atgactagac aaaatctaaa 2340 agtgtcaaag gcaatgtcca agtgaaggta cgtgccagca tcagtcttag aatctgccac 2400 aatctctgaa gagactgatg cagaatgcac taaattaaaa aatacttcgt gatccaaaag 2460 tgtggtttac cgacgatgac attgaccaat accttgaacg gcatatcagt aatcctacat 2520 ttgcacattt accgtgcttt attattagta ttctatcttc agacattaaa gaaaatatta 2580 tctcaattcc agatgcagta tttaaagctg aggttattct ttgtccgcta aatataaaca 2640 atgcccattg gattttattt gtgtacagca aatcgttgct tgagtcatat tttattgacc 2700 caatatttgc aaatagaaat ctgtttaaaa ataaacaagc tactcttaag attaacattg 2760 ctttaaataa gattttcaaa cttcaagtgg cttcatcttg tcaccctttc caaaatctta 2820 tatatcaaga gaacagtttt gattgcggtc cttttatttg tgcttatgca attcttataa 2880 gtcaaggttt ggacactttt cctaacaatt ttattgatgg gataagacga gaggtgcatg 2940 attttcaaat ttttggctca aacaaagtat cccaagttat cggtgggggc cttcctaaaa 3000 ataatttgat gggaatcaat tttaaaaaag caattgcgct tactcttgaa aaacaaaaac 3060 gaaacatcaa taacagtatt aatattaaac caaaatttca aaagtcatgt gtgtcattcc 3120 tcttaagtaa atggtttaaa agcaatacta atatacttgt tctttccccg gatctaacta 3180 tcaatttaat agcccaaaat aataattata tcattgagaa cgtcaacttt cggcagttca 3240 aagacgttca acatgtcatt agcataattc ccaatgaaaa ttattggtgt ttacttctct 3300 attcgacccg tcgtatgact tgtaatattt tcgattttcg aaagactgat atttctgata 3360 gactaaccga aataggagga aactttacgg attatttaaa tagttttttc ttgagtatga 3420 aaataaaatt tcaaataaat catggtattc tccaccgctg cttcatttca gatgatccaa 3480 cttttcatgc cactgctttc attgtgtttt tggaaaaatt gattttagat cgggattttc 3540 aacatgataa aatatataag ctacttgatg atatcaacac taaagagtgt gttgaaattt 3600 cattaaattt aaatctaata aatggaaaga ttaccgacag cattttaata aaatatttta 3660 ataatttaag tttatcaaca gacttcgttc ttttgggttt cacaatgtgt actgctattc 3720 ttgacgattg tactcattat ttgaatgaac atttgaatta taagtgcctc caaaacgcga 3780 aagttgtatt tgccattttt gctccaccaa agtctcggga aacactaatg gtaattgatt 3840 ataacactga cgagcattac tttctggatc caacaacact tgatgtcagc ctaaactata 3900 catttatttg taaagtatta gttactaaaa taaatgaaat aagaaactca cgtggttgtg 3960 ccattagagc aggcaaatgt ccacatgatg ttcgtggttc aggtttgctc agtaaaattc 4020 taatatgtgc tttcatcaac aactatgcgc atgatatgtc ccttgaaaac attaatttga 4080 gagaaatagc aagaattatc aactctgttc ttccaattgt ttctgagatc aataacaaag 4140 ataaagagaa aattttaaca aaagcaaaag aaacaattaa atttgactta caacagcgta 4200 aaaataaagt atttgaatta attattagtc ttgaaaatgc tgatgttaac caaatagttg 4260 aaagcattat tcgacaaatt cctcacttaa ataatttcac tgaaattaaa tcccacgaac 4320 catacctagg aagtaaacaa aactcaaata taataagtaa atataaaagt agatcagaat 4380 ttttaattaa tatgaaatta actttttata aaattattaa tgacttacca gtcactgtca 4440 ggcctgaaat tcaggatatt ctgaaccaat ttagccacga ggacccagta tcgaattcat 4500 ggaataaaat tcttaaagat tgtcgaagca cgacaaatgt tctcaatctt gtagatattt 4560 taccgtttga agttatttac gaattaaaga gagctgataa cacttcaccg ggtattgatg 4620 gtattcaata caaagatctt gctcttcttg accctgaagg tattttgtta agtttcctgt 4680 ttaataaaat catcagtagt aaaatcattc cgactagctg gaaaactttc aagactatac 4740 taattccgaa acctgataag acagataatt atgataaagt ttcatcttgg cgaccaatag 4800 ctctgctgtc tgttatttat aaagtctttg cgtctatttt aactacacgc ctaacatcat 4860 gggtgatttg taacgatatt ttgcatattg ggcaaaaggg tggctctgtc catgaaggtt 4920 gtgtcgagca caattcaata ctttcttctg cacttgagca ttcgaaatat agcaagaatt 4980 ccccattagc tattgcttgg ttagatatta aggacgcatt tggaagtgtt cctcatgact 5040 acttgtggag tgttctgaaa actattggcg ttagcgaaga atttattact attgtcaaat 5100 tactttacac cgacactcaa tctttctaca gctgtggccc aattgtaaca cctaatcttt 5160 caatcaaaaa aggagttaaa caaggatgtc ctctttcaat gattcttttc tcgattgcca 5220 ttaatcctgt tcttgaagca ataagtagat catgtattga accatttatg attggtgatt 5280 cgcccgttca agttctagct tatgcagatg acattgcact tgttgcaaac aatgtcgaga 5340 accttcaaaa aattgttgat gttgctgttg aagctgctac agaaatagga tttgaatacc 5400 gaccagagaa gtgtggctac atgcagcttc ccagagttaa catcaatggt gagatcttga 5460 taaatgaaaa ggagatcaaa aagttgttgt caaaagaatt ttatcaatat cttggtgttc 5520 cagtaggcga ggacaatgat caaagtccct atgcaattct tgataaagtt gttagtgaca 5580 caaaaaaaat tgcagattcc ggtttatttg gttggcaaaa actcaaggcc tacaagattt 5640 ttattcattc tcgactaaca tttgcattcc gaacacgtga aattaaaact atggctttat 5700 cagcttctca aggaaatact aattcctgtg gcaataactc aagtaaattg cgtggccatc 5760 tgagaagaat tttgaacttg ccgcacaatt ctgagacagt ttatttatac aactcaactg 5820 aaaatggtgg agcttcgtgt gtcgatttgc ttgatgaata tcatactcaa acaattgtgc 5880 actttttccg actctttact tccaattgcg attactcgag aaaagttaat attgactctc 5940 taaagtttgt aacgggtcct cgtcttggaa tcaaagagcc tacattacaa caaagttttg 6000 attggattaa tggtgctgaa actaaattga atcatggtgg aaggaaaacg agatttcaac 6060 gtgcacgtac ttcaataatt tactttaaaa gagaacactc aatttccgta tcttttcata 6120 ttgttaaaga gcaagttttt ttatatctga taacaaagag tcatggaact ttcatactta 6180 cgactaagtt tcgaaaagca atttctaaga tatttcactc tgcattgtac gactcatata 6240 tgcaaaagtg gcaactgagt aataaaagcg acatgattgc agcggctata aaattatctc 6300 ctcaaataaa taaagcaatt tttaaaagta aacttggcga atttgcatgg aactttatac 6360 atcgagcgcg aactaataca cttaacatca atgcaaagcc taatagaaaa ggggattcta 6420 gactttgtag acgttgtgag aaagaggatg aaactatgtc tcatgcctta cagtcatgta 6480 agattcacca aatactgaca ttagagagac acaatgactg tataaagata ataacgaata 6540 atctaaaaaa tcctaatttt attgtcgtgg ttgatcattc ttgttctctt gtcctgaatt 6600 ctaaacaacg cgtcgatctt ataataactg atactgaaag aaaaagaata ttcatgattg 6660 atattaagtg tccaatggat tctgtttcac attttgagtt gactgataag gcaaatattg 6720 agaaatatag cttgcttcag agtgaattac aggtagccaa acctgattat caggtagagc 6780 tttacacttg tattattggg gcattagggt cgattcctcc atcaacttat gatctgattt 6840 gtaaacttgg cattgaacca cttcaaacag aaggtctgct aaaagaatgt gcaatgagca 6900 atatcctgca ctcagctaaa atatggtatt accatgtgca cggcattcta ccagacttct 6960 gcaattaatt taaaataatg gatatttaaa ctaacttaaa ctttttcatt gagatcagaa 7020 ttattttttt tgtttgaaat gattcagact acattcgtaa tttatcttta aattcaatat 7080 tattaacctg acttttgcct ttgtaaaagt cgtaattgaa tttagaaaac ttagaaaatc 7140 aaatcagagc ttagttcgta gtatttatta ctgcataatt tatatttgca atgccggatg 7200 acgacgccga tgaatcctca tatcagatca agcaaaagaa aattcagaag acgccaattg 7260 cagaagagac tgattgatca aatacttgat aacttacatt tgaattcatc cttctctact 7320 acgattgcag tatgaattgc atgcttcatg ttgagcatat tcttttcact gtgaaacaaa 7380 cttgttatat ttaataaaat tttg 7404 // ID JAM1B_AAe repbase; DNA; INV; 3399 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE An RTE non-LTR retrotransposon from Aedes aegypti. XX KW RTE; Non-LTR Retrotransposon; Transposable Element; JAM1B_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-3399 RA Kojima K.K. and Jurka J.; RT "RTE clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1440-1440 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >85% CC identity. The consensus is ~89% identical to JAM1. XX FH Key Location/Qualifiers FT CDS 430..3375 FT /product="JAM1B_AAe_1p" FT /note="endonuclease and reverse transcriptase." FT /translation="KNQATNNQRENTNRDNRRRPQRRKGTSDWKLGTWNCK FT SLNFIGSTRILADVLKDRGFGIVALQEVCWKGSMVRTFRGNHTIYQSCGNT FT HELGTAFIVMGDMQRRVIGWWPINERMCKLRIKGRFFNFSIINVHSPHSGS FT TDDDKDAFYAQLEREYDSCPSHDVKIIIGDLNAQVGQEEEFRPTIGKFSAH FT RLTNENGLRLIDFAASKNMAIRSTYFQHSLPYRYTWRSPQQTESQIDHVLI FT DGRHFSDIIDVRTYRGANIDSDHYLVMVKLRPKLSVVNNVRYRRPPRYDLE FT RLKQPDVAAAYAQHLEAALPEEGELDEAPLEDCWRTVKAAINDAAESNVGY FT VGRSRRNDWFDEECQEVLEEKNAARAVMLQQGTRQNVERYRRKRQQQTRLF FT REKKRRLEETECEEMEQLCRSQETRRFYQKLNASRNGFVPRAEMCRDKDGS FT ILTDEREVIERWKQHFDEHLNGAESTGNEGRDNGGNAFVSTAEDGNQPAPT FT LREVKDAIHQLKNNKAAGKDGIGAELIKMGPERLAICLHRLIGTIWETEQL FT PEEWKEGVICPIYKKGDKLDCENFRAITILNAAYKVLSQIIFRRLSPVVNE FT FVGSYQAGFVDGRSTTDQIFTVRQILQKCREYQVPTPLGIMDENSFPGKLT FT RLIRATMEGVQNCVKVSGEHSSSFGSHRGLRQGDGLSCLLFNIALEGVMRR FT AGLNSRGTIFTRSSQFVCFADDMDIVGRTFEKVADLYTRLKREAAKVGLVV FT NAAKTKYMLAGGAERDRARLGSSVTIDGDTFEVVDEFVYLGSLLTADNNVS FT REIRRRIISGSRAYYGLHKKLRSKKIHARTKCTMYKTLIRPVVLYGHETWT FT MLEEDLQALGVFERRVLRTIFGGVQENGVWRRRMNHELAQLYGEPSIQKVA FT KAGRIRWAGHVARMPDSNPAKMVFASDPVGTRRRGAQRARWADQVRIDLAS FT VGQNRGWRDAATNRVLWREIVDSVLSV" XX SQ Sequence 3399 BP; 899 A; 813 C; 1014 G; 671 T; 2 other; ggtaaatggc tgggcatggc gtaccattgg tacctcgcgt acctgcagga ataaaataga 60 cccctttgtg tggtccttag cctcttgccc agcaactcct atccctacct cctcgtggta 120 ctggccgggg tacgagtaac cttagggaag atcgggtaac caacccccgg tgggaactct 180 ggtcgtatgc tgacagggaa gggggggttt gcttttgctt ttgcttctgc aaacctggag 240 cgtctgtact ccatgttagg agcggctcac aacagcgtct gttccccatg tcaggggcgg 300 ctgatcatcg tccgagtgcc agagaaggac tctaagctaa actgcgcact atggtcctcc 360 gaacatttag ggggaatggt cctccggaaa tctagggggt tggtgtcagg ccctgcaagc 420 cagccgtaaa aaaatcaagc aacgaataat caacgagaga atacgaaccg ggacaatcgg 480 cgaagaccac agcgacgtaa agggactagc gattggaagc tcggtacgtg gaactgtaaa 540 tctctcaact tcatcgggag cacacgcata ctcgccgacg tgctgaagga ccgcggattc 600 ggcatcgtag cgttgcagga agtgtgttgg aagggatcaa tggtgcgaac gtttagaggt 660 aaccatacca tctaccagag ctgcggcaac acacacgagc tgggaacagc ttttatagtg 720 atgggtgata tgcagaggcg cgtgatcggg tggtggccga tcaatgagag aatgtgcaag 780 ttgaggatca aaggccggtt cttcaacttc agcataataa acgtgcacag ccctcactcc 840 ggaagcactg atgatgataa agacgctttt tacgcgcagc tcgaacgcga gtacgacagc 900 tgcccaagcc acgacgtcaa aatcatcata ggagatctaa acgctcaggt tggccaggag 960 gaggaattca gaccgactat tggaaagttc agcgcccacc ggctgacgaa cgaaaacggc 1020 ctacgactaa ttgatttcgc cgcctccaag aatatggcca ttcgtagcac ctacttccag 1080 cacagccttc cataccgata cacctggaga tcaccacagc agacagaatc acaaatcgac 1140 cacgttctga ttgatggacg gcacttctcc gacattatcg acgtcaggac ctatcgtggc 1200 gctaacatcg actctgacca ctatctggtg atggttaaac tgcgcccaaa actctccgtc 1260 gttaacaacg tacggtaccg acggccgccc cggtatgacc tagagcggct caagcaaccg 1320 gatgtcgcag ckgcgtacgc gcagcacctc gaggctgcat taccggaaga gggtgagctg 1380 gatgaagccc ctcttgagga ctgctggaga acagtgaaag cagccatcaa cgatgcagct 1440 gagagcaacg tcgggtacgt gggacggagt cgacggaacg attggttcga cgaggagtgc 1500 caggaggttt tggaggagaa gaatgcagcg cgggcggtca tgctgcagca agggacccgg 1560 cagaacgtgg aacgctatag acggaaacgg caacagcaga cccgcctctt tcgggagaaa 1620 aaacgccgcc tggaggagac ggagtgcgag gagatggaac agctgtgccg gtctcargaa 1680 acgcgtaggt tctatcagaa gctcaacgca tcccgcaacg gcttcgtgcc gcgagccgag 1740 atgtgcaggg ataaggatgg gagcattctg acggacgagc gtgaggtgat cgaaaggtgg 1800 aagcagcact tcgacgagca cctgaatggt gctgagagca caggcaatga aggacgggac 1860 aacggaggaa atgccttcgt cagtactgcg gaagatggaa accaaccagc ccccactttg 1920 agggaggtta aggatgccat tcaccagctc aagaacaata aagctgctgg taaggatggt 1980 atcggagctg aactcataaa gatgggcccg gagaggctgg ccatttgtct gcaccggctg 2040 ataggcacaa tctgggaaac agaacagcta ccggaggagt ggaaggaagg ggtaatatgc 2100 cccatctaca agaaaggcga caagttagat tgtgagaact ttcgagcgat caccattcta 2160 aatgcggcct acaaagtatt atcccagatc atcttccgtc gtctgtcacc tgtagtaaac 2220 gagttcgtgg gaagttatca agccggcttc gttgacggcc gatcgacaac ggaccagatc 2280 tttactgtac ggcaaatcct ccaaaaatgt cgtgaatacc aggtcccaac gcctttagga 2340 atcatggacg agaacagctt tcccgggaag ctcacgagac tgataagagc gacgatggaa 2400 ggtgtgcaaa attgtgtgaa ggtttcaggc gaacactcca gttcgtttgg atcccaccgg 2460 ggactacgac aaggtgatgg actttcgtgc ctgttgttca atattgcgct agaaggtgtt 2520 atgcggagag ccgggcttaa cagccggggt acgattttta cgagatccag tcaatttgtt 2580 tgcttcgcgg atgatatgga catcgtcggc cgaacatttg aaaaggtggc agacctgtac 2640 acccgcctga aacgcgaggc agcaaaagtt ggactggtgg tgaatgcggc caagacaaag 2700 tacatgctag ctggtggggc cgagcgcgac agggctcgcc taggtagcag tgttacgata 2760 gacggggata cgttcgaggt ggtcgacgag ttcgtctacc ttggatcctt gctgacggct 2820 gacaataacg ttagccgtga aatacggagg cgcatcatca gtggaagtcg ggcctactat 2880 ggcctccaca agaagctgcg gtcaaaaaag attcacgccc gcaccaaatg taccatgtac 2940 aaaacgctca taaggccggt agtcctctac gggcatgaaa cgtggacgat gctcgaggag 3000 gacctgcaag cacttggagt cttcgaacgt cgggtgctta ggacgatctt cggcggtgtg 3060 caggagaacg gtgtgtggcg gcgaaggatg aaccacgagc tcgcccaact ctacggcgaa 3120 cccagtatcc agaaggtggc caaagctgga aggatacgat gggcagggca tgttgcaaga 3180 atgccggaca gcaaccctgc aaagatggtg ttcgcttcgg atccggttgg tacaagaagg 3240 cgtggagcgc agcgagctag gtgggcggat caagtgcgta tcgatttggc gagcgtgggg 3300 cagaaccgag gatggagaga tgcggccacg aaccgagtat tgtggcgtga aattgttgat 3360 tcagtgttat ctgtgtagat gttaactaaa taaatgaat 3399 // ID Gypsy-9-LTR_HM repbase; DNA; INV; 206 BP. XX AC . XX DT 30-DEC-2008 (Rel. 13.12, Created) DT 30-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE Long terminal repeat of the LTR retrotransposon - a consensus DE sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-9-LTR_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-206 RA Bao W. and Jurka J.; RT "Gypsy LTR retrotransposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 1985-1985 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX SQ Sequence 206 BP; 63 A; 21 C; 45 G; 77 T; 0 other; tgatgtaatt gcgtaattcc ggttcacggt tatgttgaat aaagaaacgt gcaagtgagt 60 ttccattttg gtcgggacat tatattgaga gacacgaagc aagaacttta cttgtagtgt 120 ggtgttataa ataaattaca tattgagtta ttactaagag ttgttattat attcagtggt 180 gttattgtta ctattaggtt ataaca 206 // ID Gypsy-9_AA-I repbase; DNA; INV; 4376 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-9_AA_; KW Gypsy-9_AA-LTR; Gypsy-9_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4376 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 987-987 (2011). XX DR [2] (Consensus) XX CC Positions [1624-2064] - Reverse transcriptase CC Positions [3401-3865] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 733..2895 FT /product="Gypsy-9_AA-I_1p" FT /translation="MNRNVSVILGTSFSTGRTQQSFQGKFRSIPVCYACGK FT RGHVRGSDFCPARSAACLKCKGTGHFAKQCLKRVNDSNRNPIPTKRIRVVQ FT DDNPDKKDSEYIFYAMGENIFMFNVGGIEIPMVIDSGVAANIISQKTWEEM FT KCMKVSVWDMSTDVDRNFTCYASNEPMHISGSFMTMIQAGERKCDAKFYVA FT RSGQQNLLGDETAKALHVLKVGFDIGSVSKSPQAEFPKFKGVTVEIPIDET FT IQPVQQAFRRAPYALEDKVNEKLKTLQEQGIIERVTGPSSWVSPMVPVLIT FT SGDIRLCIDMRRANQAVIRETHLLPLVDEILGSVTGAKRFSKIDVKDAYHQ FT LEISERSRPITTFITKNGLFRHFVQNISFRYKRLMFGISCAPEIFQKTMKS FT ILAGLEGVIIYLDDVVVFGSTEEEHYNRLQALLKRLQEYNVLLNKDKCIFG FT TNEIEFLGHVLSEKGVSPTESRIEAIRKFRELKTVAELRSFLGLITYVGRF FT IPNLAAKTEPLRMLLRKSAAFSWKAEQSKAFNMIKEAVCDIRCLGFFDRRD FT RTKLIVDASPEGLGAILLQENKDGQNRIISFASKALTDLERKYFQTEREAL FT AIVWGVERFSLYLLGTKFVLITDCKALKYLFSPRSRPCPRIERWVLRIQSY FT NYEVKYEPGVTNLADALSRLSIDNKKPFDTTAECYIKRKITIIVLQNFKTS FT FSAQNRSNVHENLSLHCPDCL" FT CDS 3134..4117 FT /product="Gypsy-9_AA-I_2p" FT /translation="MFKCFKSEIHSAKGVLLRGDRLIIPEALRLQVLKCAH FT DGHPGMTVMKRRLRQKVWWPKLDDSVDKFVKQCGSCTLVSAIGPPEPMLRT FT RMPDKPWGEVAVDFLGPLPSGHSLLVLIDYFSRFTEVIVMKQTTTDLTIQA FT LFETFSRFGVPEALKSDNGPQFVSDSFKAFCREFGIEHQKTTPYWPQTNGE FT VERMNNTILKRLKISQAENPAKWKWDLRSFLLMYNSTPHSTTGEAPSALMF FT GRILRDKIPSILNSNRPWVEDIRDRDWERKVSTAESADHDRNALPNQLEEG FT DIVVAKRITKENKLSTNYNKERFKKVNRHGSEAEIR" XX SQ Sequence 4376 BP; 1444 A; 810 C; 964 G; 1158 T; 0 other; ttggcgacga gtataaatgt ttagtttatt attccttctt tctttttggt tttataacca 60 aacttgatat attatatcag gagcatcatg gatactcgca tcaaaccatt ccaaactacg 120 ttggaatctt cgcaattgcc attggcatgg gccaaatgga aacgtgacat tgaatgctat 180 tttgaatcag aaaagatcga agggcagtat gaaaagcgtt caaaactgct ctacctggga 240 ggttcagatt tgagagatat tttcgataat cttccggaag tcgaaacagt gtcaatggtt 300 ttgatcgatc ctccttatta cgatgcagca atcgcaaaat tagatgcgca tttcgagcca 360 tatcgtagaa ggacctacga acgtcatcta tttcgtcaga ttactcagaa atcgtccgaa 420 cgttttgcgg atttcgtgct gaggttgaga acacaagtaa agagatgtga atacgacaaa 480 ccagaggaga tgattcttga tcagatagta gagaagtgct attctgacaa gctccgacag 540 aaattgttga aacgcgacat gttgttgact gaagttgaag gcctaggtac aagtttggag 600 gaaagtgagc ggaagatgaa ggaattcgga aaacattgtg agcctagcga cccgatcagt 660 aaagtaacta aatggaaacc tgaacataca aaacgtgact cagcgggtaa attattgtaa 720 ttttgtttta aaatgaatcg taatgtttca gtcattttag gtacaagctt cagcacaggc 780 agaacacaac aatcatttca aggaaaattt cgatccattc cagtatgtta cgcatgtgga 840 aaacgtggtc atgtaagagg atctgacttt tgtcccgcta gatcagcagc gtgtcttaaa 900 tgtaagggca ccggtcattt tgccaaacag tgtctaaagc gagtaaatga tagcaaccga 960 aatccgattc cgacaaaacg tataagagta gttcaagacg ataatcctga taagaaagac 1020 agcgagtaca tattctacgc catgggagag aatattttca tgtttaacgt aggcgggatc 1080 gaaataccga tggttatcga ttctggcgtg gctgcaaaca tcatcagtca gaaaacgtgg 1140 gaggagatga aatgcatgaa ggtatctgtg tgggacatgt ctactgatgt tgatagaaac 1200 tttacatgtt acgcttcgaa cgaaccgatg cacatctccg gtagcttcat gacaatgatc 1260 caagctggag agagaaaatg tgatgcaaag ttctatgttg caagaagtgg acaacaaaac 1320 ttgttgggcg acgaaactgc aaaggcattg catgttttaa aagttggttt cgacattggt 1380 agtgtttcca aatccccgca agccgagttt ccgaaattca aaggcgttac ggtagaaata 1440 ccaattgatg aaacgattca acccgtgcaa caagctttcc gtcgtgctcc atatgctctt 1500 gaagacaagg tgaacgagaa gctaaagacg ttgcaagaac aaggaataat cgaacgagta 1560 actggtccat cgtcctgggt ttcaccaatg gttccagtgt tgataacatc aggagacatc 1620 cggctatgta tcgacatgcg tcgagcaaat caggcggtca taagagaaac acaccttcta 1680 cctttagtag atgagatcct tggatcagta accggggcga aacgtttctc caaaattgac 1740 gttaaagacg cgtaccatca gttagaaata tccgaaaggt ctcgccctat tacaactttt 1800 atcacgaaaa acggtctttt caggcatttt gttcaaaata tttcctttag atataaaaga 1860 ttgatgtttg gtatcagttg tgctccagaa atcttccaaa agacgatgaa atcgatttta 1920 gcaggtttgg aaggcgttat catatatctt gatgacgtag tagtgtttgg atcgacagaa 1980 gaagaacatt ataatcgtct tcaagcactt ttgaaacgtt tacaagagta taacgtcctt 2040 ttgaataagg ataaatgcat attcggaacg aacgaaatcg aattcctagg tcatgtgctg 2100 agtgaaaagg gtgttagtcc gacggagagc cgcattgaag caattaggaa attcagagaa 2160 cttaagaccg tagcagagct gagaagtttt ctaggtttaa tcacgtatgt tggacgtttc 2220 attccaaacc tagcagctaa aacagagcct ttgagaatgc tcctgcgaaa aagtgcagca 2280 ttttcctgga aagccgaaca gagcaaggcg tttaacatga ttaaagaggc agtctgtgat 2340 attagatgtc tgggattctt tgaccgaaga gatcgaacga aactgatagt tgacgcaagt 2400 ccggaggggt taggagcaat actactccaa gaaaataagg atggccaaaa tcggattatt 2460 tcttttgcca gcaaagccct tactgattta gaacggaaat acttccaaac tgaaagggaa 2520 gcactagcaa ttgtttgggg agttgaaagg ttcagtctat atctgcttgg caccaaattt 2580 gttcttatca cggattgtaa agctctaaag tatttgttta gtcctcgatc ccgaccatgc 2640 ccaagaattg agcgctgggt actacggatc caatcgtaca attatgaggt taagtacgaa 2700 cctggagtaa ctaatttggc cgatgcttta tctagattgt caatcgataa caaaaaacca 2760 ttcgatacaa cagccgagtg ttacattaag aggaaaataa ccatcatcgt tttacaaaat 2820 ttcaaaacca gtttttctgc tcaaaatcga agcaatgttc atgaaaatct atcattgcat 2880 tgccctgact gtctgtaatg caatgataga tttttataac gtttccttaa aatttgaacg 2940 aaaaaactgg tttttagttt tcgatgattg ctgatgattg aatgatggat tcttgagcct 3000 taattatctt ttggacaact cggttccgga agcagtcact cttacacagg tggctgatgc 3060 gacaagaaat gataaaactc ttcaagagct attgaaagca ctacgatcag gatcttggtc 3120 ggatgaaata caaatgttca agtgtttcaa atctgaaata cattcggcaa aaggagtact 3180 attgagaggg gatagactga taattccaga agcacttcgt cttcaggtac tgaaatgtgc 3240 acatgacggt catcctggaa tgaccgtaat gaagcgtcgt ttgaggcaaa aagtatggtg 3300 gccaaaatta gacgattctg tcgataaatt tgtaaaacaa tgcggatcat gcacacttgt 3360 ttcagcgatc ggacctccag aaccaatgct tcgtactaga atgcctgaca aaccttgggg 3420 tgaagtggcc gttgattttc taggtccttt gccgtcagga cattcgttac ttgtactaat 3480 cgactatttc agcagattca cagaagttat cgtgatgaag caaacaacga cagaccttac 3540 aattcaagct ttattcgaaa cattcagccg ttttggagtt ccagaagcat taaaatcaga 3600 caatgggcca cagtttgtca gcgattcatt caaagctttt tgtcgtgaat tcggtataga 3660 acatcaaaaa actacgccat actggcccca aacaaatggg gaagtagaaa gaatgaataa 3720 cacgattctt aagcgattga aaataagtca ggcggagaat ccagcaaagt ggaaatggga 3780 tttacgaagt tttcttctta tgtataattc caccccacac tctacgacag gcgaagctcc 3840 atcagcttta atgtttggta gaatacttcg ggataaaata ccaagcattc ttaattcaaa 3900 taggccatgg gttgaagaca ttagggatcg agattgggaa cgaaaagtaa gtactgcaga 3960 atcagctgac cacgacagaa acgcattacc aaatcaactt gaggaaggtg acatagttgt 4020 ggcgaaaaga ataacaaagg agaacaaatt atctacaaac tacaacaagg agagattcaa 4080 aaaagttaat agacatggat cggaagctga gatccgataa ctggacactg gtaaaatata 4140 ccgccgaaat gtaacacatc ttaaaaagat acatgagcca acatctgaat caaatacgac 4200 accaattgga agcccaacat ctacgatgac aatagaaaag gaacagtcaa tttcggaaga 4260 tcctgctgaa gcacccatga acactcgtcg tagtactcga ccatcgaagc gcccagaaca 4320 tctaagaaat tatgtacaag cattagtaaa ctaaaattat ttgtaaggga ggagtg 4376 // ID BEL-91_AA-I repbase; DNA; INV; 5082 BP. XX AC supercont1.289; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-91_AA_; KW BEL-91_AA-LTR; BEL-91_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5082 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.289; Positions 1207023 1201942. XX CC 'CCCAT' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 2184..4292 FT /product="BEL-91_AA-I_2p" FT /translation="MFRQIWVHPSDRRFQQVLWRTSQFQELQRYQLTTVTY FT GTSCAPYLATRVLNQLAEDKGHRNPLRAKVVLKGCYMDDALSGEDDLETAI FT ESSKQLTELLKLGGFNLRKWSTNDPRILAHVPDEVKELAPETEIDDSGTIK FT TLGLLWSHVTDEFEFKIPSLPSLNKVPNVSKRVVASEMAQLFDPLGLVGSV FT VMSAKMFIQKLWAIGILWDDELPENLREWWLRFRNEIPELSRLKVPRQVLA FT NDHTSYALHCFCDASNHGYGACVYVVSTTGTGTYYSQLLIAKSRVAPLRGL FT TTPKLELCAAVLGCQLLEQVRNTTNFAGSAMFWSDSSIVLHWIRTPPTVWK FT VFVSNRRAEIQKLSMNDCWCHVPSKLNPADRLSRGISPSEILNDALWWHGP FT DYLVQTVENWPEDIVSLSRDEQEVRDAEARPVVTFAVAQIDHSIIQRYSDL FT GKLLRVVGYCFQFCRNSRSLPSCRKVGPLEPSEVDYSLKSLIRSVQRTEFP FT EEIRYLSANPQRRLTNKDKKLQFKSFKSLNPFLDDVGLLRIGGRLTKLSAS FT LDTRAPILLPAKHYLSWLIARSLHLRTLHGGPTLLLATIRQRFWPLRGRDL FT ARKIVRQCVTCFRCVPKPTEQFMGPLPLVRITPARVFSNSGMDYYGPFSVR FT PLVGRGANVTMYVAVFVCMVVKAVHLEVVSDLTSVACINAVKRFIARRGRV FT VNL" XX SQ Sequence 5082 BP; 1302 A; 1206 C; 1250 G; 1324 T; 0 other; attttttggt gccaaaaccc gggaccacgt tgcagaaagg tctcgaaggc agctattgtc 60 gtcgttggct ggaccacgtg actgcattgt ttcgcctatt gttcctgagc atcgcagtga 120 aatcggcctc ggagcggatc aatacatcgt ccctagcaac acacatgtta taattgtgta 180 ttatcaactc atatatggcc aaaactagtc atatataagt ttataacatt caattataac 240 atgtgtgttg ctcggggtcg atcccgcttt tatcatctcg gcactgcgat ctgcctggac 300 tttggaccat ttacggtgtg tgcaagaacg acacggagtg atagagagcg gtaaagtaac 360 cctaagtaat cgataccagg tgggtgaatc tttttcattc ttctacacgc aaaatgaaaa 420 cgttgaaaac gcttttgcgg caccgagcca acattatgga ctctgccaag ttgattgaga 480 aattcaacaa cgaatgtcag gaagcttaac atcatgtaga aaatgccgtt tctggtgacg 540 attccgatga agaaacggaa ccgaccttgt ccgcgaagcg tttcagttcc aaaatctgta 600 ttttggcctc aaagcctctc tcgtgagtaa gctgccagtc gacccgccat ccgggaactc 660 agctgctaat cgtcctccgg cgcaaccaat gcattctgtg cggttgccgg aaatcaacat 720 tccaaagttt tgtggaaatc cgtcgaaatg gatcgaattt cgggacattt ttcgttccat 780 gatccattca agcaccaatc tctcgtcagt gccaaaaaaa tcactatctt aaggctgctc 840 tgaaaggcga ggccgcccgc ataattgcga atttggaagt caattccgac aactatgcca 900 ccgtatggaa atcggtttgt gagcgatatg aaaattccaa tctacttcgg aaacaccatt 960 tcgcagctct ttccgcaatc agttctgtga aaagagcgac ttcgtcgtcg ttgtacgagc 1020 ttgtcgatga gtttgatcat cacgtggcca ttttgaggaa gctggatacg gcagctgatt 1080 tatgggacac tgtgctcgtc gaaacccacg ctgcaccgca aggcggttct gttaatgcac 1140 cctgcgagga agtctttctg tccacagtgg ctaccaggat gaaggacatc aacggtaacg 1200 cacactacgc ctgcggcgtt ctcgatagct gttctcaagc taatttcatt tctgaagctc 1260 tggcgcgcaa gttggagttg aagcgtgatc gcatcagtgt cgatgtaagt ggtattgggc 1320 aaggaattgt ccacattcgc tcaaaggtag tgattcgatt accttctcgg atgtctcgtt 1380 ataccgcaaa tcacagtcac gcttccaagc aagcatatcg atatatccga ttggaatctc 1440 cccaaaaacg tccctctcgc agatccaaga tttaacatca gttctggggc cgatattctc 1500 gtcggtggag aacttttcta ttcgttcatg caatcacata ggatcagttt gcgtaccgga 1560 ttcccgatac tccagaagac agttttcggg tacgttgttg ctggtcggct atccaatgta 1620 ttacgcaagc cttcgatttg ggtagtaagc gctacaacta gtctcgatag taaattgcag 1680 cgcttttggg aagtcgaaaa tttcgaagat cgtagagcga tgactccttt ggaagaagag 1740 tgtgagaaac acttccgaag caccgtaacc agaacaagtg cttcatggtc cgtcttccga 1800 tacgtgaaga aatgctggct atgttgggag aatccttcat ggtggcacaa cgtaggtttt 1860 tggccattga gaggaagttt gctggaaaca gtgagttcaa gagtgaatat gtgaagatca 1920 tggaggatta cgcagcacta ggccatatgg agttgagccc tcgtgtcgaa aatccccagt 1980 tcatcctgcc acatcacgcc atatttcgcc ctgacagctc caccacaaag actcgcgtcg 2040 tttttgacgc cacctgcaag ggaagttccc agttgtcgtt aaatgatgtg ttactcgtcg 2100 gtcccgtcgt acaaacaccc cttctagcta tcgtgctcaa ttggcggatt cctcgttatg 2160 ttttctaggc ggatgtagag aagatgttcc gccaaatttg ggtccatcct tctgaccgca 2220 ggtttcagca agttctgtgg cggacaagcc aatttcagga actccagcga taccaactca 2280 ccacggtaac atacggaacg tcctgcgcgc cctacctagc tactcgagtg ctaaaccaac 2340 tggccgaaga caaagggcac cgaaatccgc ttagggcaaa ggttgttttg aaaggatgct 2400 acatggacga cgctctatcc ggggaagacg acctggaaac cgctatagaa tcaagcaagc 2460 agctgacaga gctcttgaag ttagggggct ttaatttgag aaaatggagt accaacgatc 2520 ccaggattct ggctcacgtg ccagatgaag tgaaggagtt agcgccggaa acagaaattg 2580 atgattccgg aaccattaaa acgttgggtt tgttgtggtc acatgttacc gatgaatttg 2640 aattcaaaat tccatcgctg ccttcgttaa acaaggtacc aaatgtatca aagagagtgg 2700 tagcatccga gatggcacag ttgtttgacc ccctcggatt agttggatcg gtagtgatga 2760 gtgcaaaaat gtttattcag aaactgtggg cgattgggat tctttgggat gacgagcttc 2820 ctgagaactt gcgagaatgg tggttgcgtt tccgtaatga aattccggaa ctttcaaggt 2880 tgaaggttcc acgacaagtg cttgcaaacg atcacaccag ttacgcatta cattgcttct 2940 gtgatgcatc caaccacggc tacggcgcgt gtgtgtacgt ggtgtcgacc actggaactg 3000 gaacatacta cagccaatta ctaatcgcca aatctagagt tgcaccattg cgtggcctta 3060 ctacacctaa actggaactt tgcgctgcgg tactagggtg tcaactgctg gagcaagtac 3120 gaaacacaac aaatttcgcc ggttcggcga tgttctggtc ggattccagt atagttctgc 3180 attggatccg tacgccgcca actgtgtgga aagtattcgt gtcgaacaga agagcggaaa 3240 ttcaaaaact gtcgatgaat gactgctggt gtcatgttcc ctctaagttg aatccagcag 3300 atcggttgtc gcgtggaatt tcgccttccg aaatactcaa tgatgcgtta tggtggcacg 3360 gacctgatta cctcgtacag accgtggaga attggccaga agacatcgtt tctctctctc 3420 gagatgaaca agaagttcgt gatgcagaag cacgtccagt agtcaccttc gcggtggccc 3480 aaattgatca ctcaattatc caacgatatt ccgatctcgg taagctactt cgggtagtag 3540 gttactgctt ccaattctgt cgaaattctc gaagtttgcc tagttgtcgc aaggttggtc 3600 ctttggagcc ctcagaggta gattactcgc tgaaatcgct aattcgatcc gtccaacgaa 3660 cagaatttcc tgaagaaatt cgttatctct ctgcaaatcc tcaacgacga ctcaccaaca 3720 aggataagaa gctccagttt aagtcgttta agtcgctcaa tccctttctg gacgatgttg 3780 gcctccttcg gataggagga cgcttaacaa agctatcagc ttctctggac actcgagcgc 3840 ctatcctact tccggccaaa cattatcttt cttggctgat cgctcggtcg cttcacctaa 3900 gaactctaca tgggggacca acactgcttc ttgcgactat tcgacaaaga ttttggccat 3960 tgcgcgggcg cgatctagcc cggaaaatcg ttcgacaatg tgtgacctgc ttccgttgcg 4020 tcccgaaacc aacagaacaa ttcatgggtc ctcttccatt ggtgcgcatt acgcccgcac 4080 gagtgttttc gaatagtggc atggattatt acgggccatt cagtgttcgt ccgttagttg 4140 ggagaggagc aaacgtaaca atgtacgtcg ccgtgttcgt ttgtatggtg gtgaaggcgg 4200 tacacctcga agtcgtgagc gatctgacgt ccgtcgcgtg tatcaatgca gtgaaacggt 4260 tcattgctcg tcgcggtcgc gtagtgaatt tgtagcgttc gtcggtgctg atcgagaact 4320 gaaacagctt cgccgacaat acgttgaaca gttctctacc gaacaatgga atgggtattg 4380 cctggagagt gggatagcgt tctattttat ccctcctcgg tctccccccc acggtggcct 4440 gtgggaagcg ggggtaaaat cctttaaata ccatctgcgt agagtcctag gcagtcgatc 4500 gtttacgttg gaggaattaa ctactactgt ggctcaaatt gaaaatatac taaattctcg 4560 ccccctttct tcgttatcaa accatccaca agatctatcc agccttactc ccgaccattt 4620 cctagtgggt gagcctttgt attcaatacc agaacccgac tacactggac agtccgttag 4680 tcgtctaaac cgttaccaag agatgaagcc cagtatccag gacttctgga aacgctgggc 4740 aaaagagtat gtcagcgagc ttcatcagcg ttccaagtgg caacgtgtac gtgccgaggt 4800 gaaggtagga tcgctagtcc tgctgaaaca agagggtcta ccaccgttgg aatggaactt 4860 aggccgcatt gtggcggtgt tgccaggatc tgacggtcat atacgggtcg tcgaagtccg 4920 cacagccaag ggtacgtaca agcgagcaat aacggaggtt tatgtgctgc ctattgatga 4980 gtcagccgtt gaaactcaag cattgaacaa aggtggagaa cccgtcggtt cgtcaaccta 5040 ggacgtggat agttgaaaca accgtttcaa cggaggccgg cg 5082 // ID Mariner-11_HM repbase; DNA; INV; 2124 BP. XX AC . XX DT 22-MAR-2008 (Rel. 13.03, Created) DT 22-MAR-2008 (Rel. 13.03, Last updated, Version 1) XX DE Mariner-type family: consensus sequence. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-11_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2124 RA Jurka J.; RT "Families of Mariner elements from Hydra magnipapillata."; RL Repbase Reports 8(3), 228-228 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 487..1440 FT /product="Mariner-11_HM_1p" FT /translation="MVRNYKRKTKRGSYGAEGLQRALDSVKNGMSLRKAEN FT SFGVSRKTLRRHLQGKVRQVGSNKLGNHVNTFSIEIEQDLVKHIQMMERAL FT LARNETSEPTNTNEEAGNAVDDEASVNVMETDRIATGDSNESDENQLDKTA FT DHPGLSEALQIISELSPIPRIKNVRSRKRKTEAATVITSSPYKAMLIEANA FT KKTKQNKQVVVENNNEENDEDEIDVIADKSKRTKSGDDKETGKRKQYARTK FT SKRTIKSNASNANVIRKGSKERRCVGSVTNSDDTPCCMCGKRFNEPPIDDW FT QQCPECLNWYHDGCGPDDTAVCFHCQ" XX SQ Sequence 2124 BP; 733 A; 295 C; 431 G; 665 T; 0 other; gggtagggtg gggcaagagg cccaggtggg gtaaaaggcc catcctgcag ttataaagaa 60 cctaggtaaa atattataat tattgttttg tagtatgaat gataaatgtg taacagtgtt 120 gcgagcgaat tttggtgaac ttttagtaga aatataattt ttaacagaat attgaagact 180 tgacaggtaa taagaatagg acattttgat tttttgcgta attttaaaaa ttttttgtag 240 attattgtta aacttagttg attatttaaa tgagtttatg tatttgtaag cgtacaaatt 300 atgtattgat tatttacata acgttcattg taccatagaa taatatgttt ttatcaattt 360 ttgcaccgac cattttgggg caaaaagccc agtttgttaa tggggcaaaa ggcccatcct 420 gagcttttta ccccttgcat tatttaagta gaatgttcaa attttaactt tattttattt 480 atagatatgg ttagaaatta caaaaggaag acaaaaagag gaagttatgg agcagaaggt 540 ttacaacggg ctttagattc agtaaaaaat ggtatgtctt tgcgtaaagc tgaaaacagt 600 tttggagttt caaggaagac attgagaagg catttgcagg gcaaagtaag acaagttggg 660 agcaataaat taggaaatca tgtgaatact ttttctattg aaattgaaca agatcttgtt 720 aaacacatcc aaatgatgga aagagcatta ttagctagaa acgaaacaag tgaacctacg 780 aatacaaatg aagaagccgg caatgcggtg gatgatgaag catcggttaa tgtgatggag 840 actgatagga ttgcaacagg agatagcaat gagtccgatg aaaatcaact agataaaaca 900 gctgatcatc ctggtttatc cgaagcattg cagattatat cagaactaag ccctattcca 960 aggataaaga atgtacgcag tcgcaaacga aaaaccgaag cagcaactgt aataacttct 1020 agtccttata aagcaatgtt aattgaagca aatgcaaaga agacaaaaca gaacaagcaa 1080 gtagtagttg aaaataataa tgaggaaaat gatgaagacg aaatcgatgt aatcgcagac 1140 aaatccaaga gaacgaaaag tggagatgac aaagaaaccg ggaaaaggaa gcaatatgcg 1200 agaacaaaat caaaaaggac aatcaaaagt aacgcatcga atgctaacgt gataagaaaa 1260 gggtcaaaag aacgaagatg tgtaggttct gttactaaca gtgatgatac gccatgttgc 1320 atgtgtggta aaagatttaa cgaaccaccc attgatgact ggcagcaatg cccagaatgt 1380 ctcaactggt accacgacgg ctgtggtcct gatgacactg ctgtctgctt tcactgccaa 1440 taggcataat caattcaact agtcttactg tagtttttta gctgtctttt tagctgtatg 1500 aattagtttt ttagctgtct ttttgggtta tgaattattt ttttagatat ctttttagct 1560 ctatgaatta gtttttttta gcaactttta agctctatga attagctttt agctgtgttt 1620 ttcagctcta taaattagtt tttatagcta tctttttagt ctataaattt ttagctcttt 1680 ttagctctat taattttacg gtgctgtttg ccattgtcat atgatagcat atatggtgct 1740 acttgctaat attatggtgc taagtgttct actaaaactt ttgttggaca tttttacaca 1800 caaatatatt gtgactagta aaaagtgtat tgttgcaata aatttttatt gtaaaaaagg 1860 aatatttcaa gctattaacg tttggggtaa aaggcccagg gtaggggcgt tttgccccag 1920 caatggggca aaaggcccac tctatagcag tttataaaaa gcatatttcc attaaaactt 1980 gagtagccat taagaaaata actttagata cttaagttta ttatgtatct gaaataatgg 2040 taacataaat ttctagatac gtttattgga tcagtaaaaa aaagacgaaa ttgcttatgg 2100 tgggcttctt gccccacctt accc 2124 // ID OVRP1 repbase; DNA; INV; 297 BP. XX AC M18644; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 28-SEP-1995 (Rel. 1.08, Last updated, Version 1) XX DE O.volvulus DNA repeat. XX KW OVRP1; Repetitive sequence; tandem repeat. XX OS Onchocerca volvulus OC Eukaryota; Metazoa; Nematoda; Chromadorea; Spirurida; Filarioidea; OC Onchocercidae; Onchocerca. XX RN [1] RP 1-297 RA Shah S.J., Karam M., Piessens F.W. and Wirth F.D.; RT "Characterization of an onchocerca-specific DNA clone from RT Onchocerca volvulus."; RL Am. J. Trop. Med. Hyg 37(2), 376-384 (1987). XX DR GenBank; M18644; Positions 1 297. XX SQ Sequence 297 BP; 103 A; 52 C; 49 G; 93 T; 0 other; gatctctaat atcaagaaac gggtacatac cctcaaattg agtgctaaaa aaaatgctcg 60 actatatttt ggtgaatttc aaatttatat cgcgattttt ccggcgaaca acgcagtttg 120 caaaattaat taataactgg tgacatatga cccctagttt cacgaaaggg gtacgtgctt 180 tcaaattgag tcctaaaaaa aatattcgac tatattttgg tgaattttca catttatatc 240 gcgatatatc cggcgaacaa cgcagtttgc aaaattaatt aataactggt accatat 297 // ID RTE-2_CQ repbase; DNA; INV; 3642 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE An RTE non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW RTE; Non-LTR Retrotransposon; Transposable Element; RTE-2_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-3642 RA Kojima K.K. and Jurka J.; RT "RTE non-LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 612-612 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 10 sequences with ~94% CC identity. ~80% identical to RTE-1_CPi. XX FH Key Location/Qualifiers FT CDS 616..3621 FT /product="RTE-2_CQ_1p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="MARRLEIRNKRQKPIEKQTTDWRLGSWNCRSLNFLGF FT EFALAXELQPRNFDVVALQEVCLEEEQVLDWPGRQTNSRFXLSGGKDKKLG FT TGFIVRGKMQDRVFGFTAXSERMCKLRIRGRFFNYSIINVHCPHEEKTDDE FT KEAFYATLEEVYDGCPRQDVKIIIGDMNARFGREEMFRPTIGPESLHAVTN FT DNGQRCIDFAASRGMVVRSTYFPRKDIHKATWTSPDQRTKTQIDHVLIDGR FT FFSDVTHVRTFRGANIDSDHYLVGVDMRSKLSTVFNQRRSRRAPPFNTACL FT QNGEVAHSYAQQLEANLPGEEELGAASLEDGWSRIRSAIGSAAEATLGSAI FT RVSRNDWYDDECQRITAEKKAAYDRKLHKATRGNVERYRQARNRQVAVFKL FT KKRQQEDRDCAEMEQLFRANETRKFYEKVNRSRKGFVPRADVCRDNGGNLI FT VNKSEVLDRWKQYFNEHLNGDEADGDGVGVNLGAPAADEQFPAPDLETVKR FT EIRKLKNNRAAGKDRLPGELFKYGGEKLARALHCVISKIWEEEKLPEEWMD FT GVVCPIYKKGDKLDCGNYRGITLINAAYKILSQILCRQLSPHARRFVGPYQ FT AGFTGARATTDQIFCLRQILEKCREYNVPTHHIFIDFKAAYDTVDREQLWQ FT IMHENGFPDKLTRLIKATLDGVMCHVRVSGXLAEPFGSRRGLRQGDGLSCX FT LFNIVLEGIIRRAGIETSGTIMNKSYQLLAFADDIDIVARNLETVMDVYTR FT LKVQARRVGLGMNTTKTKYMRGRGSKDVGPPSLTPLTVDGDELEEVSEFVY FT LGSLVTADNDTSKEIRTRIFAGNRAYFGLRKTLTSDRVQRRTKLTMYRTLI FT RPVVLYGHETWTMRQEDERALGVFERKVLRTIYGGVQVSDGVWRRRMNHEL FT HAFLEEPTIATQAKIGRLRWAGHVARMDESQPVRLLFDRAEPAGGTGRRAG FT KPRARWGDQVNGDIRKICNLGNWRVAAQDRERWKRLLATARVPGALC" XX SQ Sequence 3642 BP; 928 A; 896 C; 1112 G; 695 T; 11 other; gtgtgtgtgc gtcgcgcgtt attatcgcgc ctcggtgcgg ctccgcgaaa gtgccctcag 60 aaccgcagaa ttaccgttcc gtcgtggccg aaagtgtgat caatcaacgc cgaagtgcgt 120 cgcgaaccgg cgtcgtgtgt tataattgac ccgcaatcga cggctgggca ataaaagtga 180 agttattttt gccgtccatc gtcgtcgtcg cgggaaaaca ggcagaacaa agcaagaagt 240 gtggtgtagc aacaagacca gtggcagtgc cgttgtgtcg gttgcgtata cgcacactaa 300 tcgaaaccgc gcgcggtgtg ccgagagtgc gaatttttga aaaaagaaaa cgggcaaaaa 360 gttttgaact tttttgcaaa aaataaaata aattttatta ttgcgaaatt tggcgctagt 420 agtgaagcgc atttacgcac gcagcggagg acgtcctctt ccggatatga agaccgttag 480 gcttcgagat aaaagccccg ttacggtakc gtgagaaaac cggggctagg tctcggtagc 540 ccgtaggtgg ttctccaccc cgagactagg tagcgcagcc ctgataaggc tgcctaccga 600 aaaagaggat cgcgaatggc aagaaggcta gaaatccgga acaaacggca gaagcccata 660 gaaaagcaga ccaccgattg gcgactcggc tcttggaact gcaggtcttt gaatttcttg 720 ggtttcgaat tcgccttagc gaakgagttg caacctcgca acttcgatgt tgtggcactg 780 caggaggtgt gcttggagga ggagcaggtg ctcgactggc caggtcggca gaccaattcc 840 cggttttwtc tgagcggcgg caaggacaag aagctgggta ccggctttat cgtgcggggc 900 aagatgcagg atcgcgtgtt cggcttcacg gcgwtcagcg agcggatgtg taagttgagg 960 ataagaggcc gtttcttcaa ctacagcatc atcaacgtgc actgccccca cgaagaaaag 1020 accgatgatg agaaggaagc attctacgcg acgctggagg aggtgtacga cggctgtccg 1080 cggcaggatg taaaaatcat cattggggac atgaacgctc ggttcggaag ggaggaaatg 1140 tttcgaccga caataggccc agaaagtcta catgcggtca cgaacgacaa cggccaacgc 1200 tgcatcgact tcgcagcctc ccgtggaatg gtggtcagga gcacctactt tcctcgcaag 1260 gacatccaca aagccacctg gacatcacct gaccaacgga cgaaaacgca aatcgaccac 1320 gtcctgatcg acggccgatt cttctcggac gtaacgcacg tgcgcacctt tcgcggtgcg 1380 aatattgact cggaccacta cctagttgga gttgacatgc gctcaaagct gtcgacggtg 1440 ttcaaccaac gccgaagccg gcgggcccct ccgttcaaca ccgcgtgtct ccagaacggg 1500 gaagtggccc acagttacgc gcagcagctg gaagcgaatc tgccaggtga ggaggaactt 1560 ggcgcagcct cgctcgaaga tggttggagc cgtattcgct cagccatcgg cagtgcagcg 1620 gaagccacac tgggtagtgc gatccgagtc agccgaaacg attggtacga cgacgagtgc 1680 caacggatta ccgccgagaa gaaggcagct tacgacagga agctgcacaa ggcgacgaga 1740 gggaacgtgg aacgatacag gcaggctcgg aatcggcagg tcgcggtctt caagcttaag 1800 aagcgccagc aggaagaccg ggattgcgca gaaatggagc agctattccg agctaacgaa 1860 acgcggaagt tttacgagaa ggtgaaccgg tcccgcaaag gcttcgtgcc gcgagccgac 1920 gtgtgcaggg acaacggkgg aaacctgatc gtaaacaaga gcgaggtgtt ggacaggtgg 1980 aagcagtact tcaacgagca cctcaacggc gatgaagcgg acggagacgg cgttggagtc 2040 aaccttggag cgcccgcagc tgatgaacag ttcccagcac ctgatctgga gacggtgaag 2100 agggagatca ggaagctgaa gaacaacaga gctgccggma aggatcggtt acccggtgag 2160 ctcttcaaat atggaggaga gaaactggcg agggcgcttc actgcgtgat ctccaagatc 2220 tgggaggagg agaagctacc ggaggaatgg atggatggtg tcgtgtgccc catctacaaa 2280 aagggcgata agctggactg cggcaactac cgcggcatta cgcttatcaa cgcggcctac 2340 aaaatcctct cccagatcct ctgccgtcag ctgtcacccc atgcaaggag gttcgtgggg 2400 ccctaccaag cgggcttcac tggcgcgcgc gccaccacgg accaaatatt ttgtctccga 2460 cagatcctcg agaaatgtcg tgagtacaac gtgcccacac atcacatctt catcgacttc 2520 aaggcggcct atgatacagt cgaccgcgag cagctatggc agatcatgca cgagaacgga 2580 tttccggata aactgactcg gctgatcaag gctaccttgg atggtgtgat gtgtcacgtg 2640 cgtgtttcgg gggawttagc ggaacccttt ggatcacgcc gagggctgcg gcaaggtgat 2700 ggcttatcct gtkcgctgtt caacatcgtc cttgagggca ttattcgaag ggcgggcatc 2760 gaaacgagtg gcacgattat gaacaagtcc taccagttgc ttgcttttgc cgatgacatc 2820 gacattgtgg caagaaacct ggagacggtg atggacgtct acacccgact gaaggtacaa 2880 gcaaggcgtg taggacttgg catgaatacg acgaagacga agtacatgag aggaaggggt 2940 tcgaaggatg tcggcccccc aagtctcacc cctctaactg tggatggtga tgagttggag 3000 gaggtgagcg agttcgtgta cttgggatcg ctggtaaccg ccgacaacga caccagcaaa 3060 gagatccgga ctcgtatttt tgctgggaat cgtgcstact tcggattacg gaagactctc 3120 acwtcggatc gagtgcaacg ccgcacgaag ctgacgatgt acagaacact gatcagaccg 3180 gtagtcctct atggccacga gacctggacg atgcggcagg aggacgaacg tgcccttgga 3240 gttttcgaac ggaaggtgct acgaacgatc tacggtggag tgcaggtgtc tgacggagtg 3300 tggcgamgac gcatgaacca cgaactgcac gcgtttcttg aagaaccgac catcgccacc 3360 caggcgaaga tcgggaggct caggtgggcc ggccatgtcg cccgaatgga cgaaagccag 3420 cctgtcagac tgctttttga ccgcgctgaa ccagctggcg gaacaggcag aagagcaggg 3480 aaaccgcgcg cacggtgggg agatcaagtg aatggtgata tccggaagat ctgtaacctg 3540 ggaaattgga gagtagcagc acaagaccga gaaagatgga agcgtcttct tgctacagca 3600 cgcgtacctg gtgcgctatg ctgattggta tggtatggta tg 3642 // ID HERO-3_BF repbase; DNA; INV; 3487 BP. XX AC . XX DT 26-MAY-2009 (Rel. 14.06, Created) DT 26-MAY-2009 (Rel. 14.06, Last updated, Version 1) XX DE HERO-3_BF is a family of HERO non-LTR retrotransposons - a DE consensus. XX KW Hero; Non-LTR Retrotransposon; Transposable Element; HERO-3_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RA Bouneau L., Fischer C., Ozouf-Costaz C., Froschauer A., RA Jaillon O., Coutanceau J.P., Korting C., Weissenbach J. et al.; RT "An active non-LTR retrotransposon with tandem structure in the RT compact genome of the pufferfish Tetraodon nigroviridis."; RL Genome Res 13(7), 1686-1695 (2003). XX RN [2] RA Kojima K.K. and Fujiwara H.; RT "Cross-genome screening of novel sequence-specific non-LTR RT retrotransposons: various multicopy RNA genes and microsatellites RT are selected as targets."; RL Mol Biol Evol 21(2), 207-217 (2004). XX RN [3] RP 1-3487 RA Kapitonov V.V. and Jurka J.; RT "HERO non-LTR retrotransposons from the lancelet and zebrafish RT genomes."; RL Repbase Reports 9(6), 1165-1165 (2009). XX DR [3] (Consensus) XX CC This is a young family of HERO non-LTR retrotransposons. The CC consensus sequence was built based on multiple alignment of CC several copies less than 2% divergent from each other. XX FH Key Location/Qualifiers FT CDS 215..3217 FT /product="HERO-3_BF_1p" FT /note="Contains the RT and REL domains." FT /translation="MALPAVRSGPASTWTLLITLVIVAAKGTDGFMSFKLP FT LLSTDTWSGYNNDVKTLLGPLHHELATNEMSPKLAGEGFSDIMCDFMASKP FT EFSHTTEESHSEGYISHEPQSLAQVKRLKNKLRKKAFRADATPEDRKAFRD FT AIKTYSFMKRQQKRKETTKSAAHQEKEYHKNFWKFAGKCAKGQLDIPPVKP FT AFSVYYANEYYKNKYSHPTRVDFNKLLWFPHLPVEEQLPANSFDMSPVRPK FT DIKAVLSKRCATSAPGPDGIMYGHLKHLPACHLFLSTLFSKLLESGDPPTS FT WSSGNVSLIHKDGSPEAAENFRMICLTSCVSKIFHQILSERWAKYMTCNDL FT IDPETQKAFLTGINGCVEHVQVMREILAHAKKNRRTVHITWFDLADAFGSV FT EHELIYYQMERNGFPPIITTYIKNLYSRLKGKVKGPGWESDPFPFGRGVFQ FT GDNLSPIIFLTVFQPILQHLKGVEQQHGYNLNDKHYVTLPFADDFCLITTN FT KRQHQKLITQISSNTKSMNLKLKPRKCKSMSIVSGKPSDISFTIDGDPVKT FT TKDAPEKFLGGYITFLSKTKETYDILAKTIETTVENINKSAIRNEYKLRVY FT MEYAFPSWRYMLMVHDLTDTQLQKLDSIHTKAIKTWLRMQPSATNAILYNT FT RGLNFKSISDLYLEAHALAYSRSVLKADEKVKHALQAKLDRESQWTRKMQK FT WGIGKCHTIHQQAIHVAKDSEWTSVRKHVKQQVTDMRHDVWTKHQENLLQQ FT GQMLQLLEEEKCDLTWRSAMYNLPRGILSFAVRASIDALPTLCNLTTWGKR FT NTDKCKLCGNRETLHHVLNHCGVALQQGRYTFRHNSVLKHITDTIIESIDT FT SRINATIYADIQGYTTNGGTIPVHTIPTTQKPDLIIYLPEQKTLHIHELTV FT PFEKNIKTSHDRKVNKYSTLAADLETAGISATLTCFEVGSRGLVTPENKTR FT LRTLFKIVKAKPPKTLFTDISRIAMLSSYAIWNSRHEPYWESETLL" XX SQ Sequence 3487 BP; 1121 A; 897 C; 705 G; 764 T; 0 other; ctgaccagca gacgggaagc ccgcgaccaa ctagtctccg caaatattgc acacagggcg 60 accctatgga gctgattcag tcaaatttcc tctgagatat accgataact atctacagaa 120 actgcacagt tagtttggaa agagcttttc tactgaaaga cagcaaaatc cgccacttta 180 gacgagcgtc aagactgccc tccccataac caatatggcg ctacctgctg tacgttctgg 240 accagccagc acctggacac tgttaatcac gctggtcatc gtcgctgcta aaggtacaga 300 tggttttatg tcttttaaac tgccactgct gtctactgat acctggtctg ggtataacaa 360 tgatgtgaaa accctgctag gcccgctcca ccacgaactg gccacaaatg aaatgtcccc 420 caaactagct ggggagggat tcagtgacat catgtgcgac tttatggcca gtaaaccaga 480 gttcagccac actaccgaag aaagtcactc agaaggctat ataagccacg aaccacagtc 540 tctcgcacaa gtaaaacgcc tgaaaaacaa gctacgtaag aaggcattca gagctgacgc 600 aacacctgag gatcgaaagg ctttcagaga tgcaattaaa acatactcct tcatgaagcg 660 acaacagaaa cgaaaggaaa ctacaaaatc ggcagcacac caagagaaag aatatcataa 720 gaacttttgg aagtttgccg gaaaatgtgc aaaaggacag ctcgatatcc ccccagtaaa 780 accggcattc tctgtttatt atgcaaatga gtactacaaa aacaaatact cacacccaac 840 ccgtgttgac ttcaacaaac tgctctggtt tcctcatttg ccggtggagg aacaactacc 900 tgcgaactct tttgacatgt cacctgtcag gccgaaagac attaaggcag tcttatccaa 960 acgatgcgct acatctgcac ctggcccgga cgggatcatg tatggccacc tcaagcacct 1020 gccagcttgt cacctgttcc ttagtacact gttctccaaa ctgcttgagt ccggagaccc 1080 accgacatca tggtcatctg gcaacgtgtc acttatacac aaggatggta gtccagaagc 1140 tgccgaaaac tttcgaatga tctgccttac ttcctgcgtc tccaagattt tccaccaaat 1200 actctcggaa cgatgggcaa agtacatgac ttgcaatgat ctgatagacc cagaaacaca 1260 aaaggcattc ctgaccggaa tcaacggctg tgtggagcat gtccaagtta tgcgggagat 1320 cttagcacat gccaagaaaa accgccgaac agtccacatt acatggtttg acctcgcgga 1380 tgcctttggt tctgtagaac acgaactgat ctactaccag atggagagaa acggcttccc 1440 gccaattatc accacgtaca ttaaaaacct gtattctcgc ctgaaaggga aagtgaaggg 1500 tccaggctgg gaaagtgatc cgttcccgtt cggaagagga gtgttccaag gagacaactt 1560 gtcacccatc atcttcctaa cggtgttcca gcctattcta cagcatctca agggagtaga 1620 gcagcaacat ggctacaacc tcaatgacaa gcattatgtt acactgcctt tcgcagacga 1680 cttttgtctc ataaccacaa acaaacgaca gcatcagaaa ctaattactc aaatttcttc 1740 caacacaaag tcaatgaacc taaagctaaa accacgcaag tgtaagtcta tgtctatagt 1800 gagcggaaag ccatcggaca tcagcttcac aatagatggg gaccctgtca aaacgaccaa 1860 agatgcaccg gagaaattcc taggtggcta catcaccttc ctgagtaaaa caaaagagac 1920 ctatgacatc ctagcaaaga caatagaaac gactgttgaa aacataaaca aatcagcgat 1980 aaggaacgaa tacaaactca gggtttacat ggagtacgcc ttcccatctt ggaggtacat 2040 gctgatggta cacgacctga cagacaccca gctacaaaaa ctcgattcca tccacacaaa 2100 ggcgatcaaa acatggctca gaatgcaacc tagtgcaaca aatgcaattc tgtacaacac 2160 aaggggtctc aacttcaaaa gcatctcaga cttgtaccta gaagcccacg ctctggccta 2220 cagtaggtca gtcctcaaag cagatgagaa ggtaaaacac gctttacaag ccaaactgga 2280 ccgcgaatcg caatggacta ggaaaatgca gaaatggggt attggaaagt gtcacaccat 2340 ccaccagcaa gccatccatg tagcaaagga ctcagaatgg acatcagtac gcaaacatgt 2400 caaacaacaa gtcacagata tgcgtcatga cgtctggact aaacatcagg aaaaccttct 2460 acagcaaggg cagatgctac aactgcttga ggaagaaaaa tgcgacctga catggcggtc 2520 cgctatgtac aacctgccga ggggcatcct cagtttcgct gtgcgtgcct ccatcgacgc 2580 cctccccaca ctctgtaacc tgaccacctg gggaaaacgt aacactgaca aatgtaaact 2640 gtgtggcaac cgggaaacac tccaccacgt tctgaaccac tgcggtgtcg ctctccaaca 2700 aggacggtac acattccgac acaactcggt attgaagcac ataacggaca ccatcataga 2760 gtccattgac acctctcgga tcaacgccac catctatgcg gacatacaag gttacacaac 2820 taacggaggt accatcccgg tccatacaat acccactacc cagaaaccag acctgatcat 2880 atatttacca gaacagaaga ccctccacat ccatgaactg actgtaccct ttgaaaagaa 2940 catcaaaaca agtcatgacc gaaaggtcaa caaatacagc accctagcgg cagatttaga 3000 aactgctggc atttccgcta cactaacctg ctttgaagtc ggatcaaggg gactcgtcac 3060 gccagagaac aagaccaggc ttagaacact gttcaaaata gttaaagcca aaccaccgaa 3120 gactctgttt actgatataa gccgcattgc gatgttatcg tcatatgcta tttggaactc 3180 acgccacgaa ccgtattggg agtcagaaac gctattgtag aaacccacaa ggctgagaaa 3240 tgtagagcat ctgtatggac aatattgatg attgaaatgt tgtgatttta gatcaaattt 3300 agaaatatga aaaccgaact aaactaaata taatgttttt tttaaagtaa tgataagcaa 3360 tacccacatt gtgcaatact atctatgtta tgtcctttgt cccccctgca tgtttggtca 3420 ataatgacca tcgtgtcctg ggctccgtgt acctttcttt actatgaata aagaatgatt 3480 ttactac 3487 // ID L2B-1_AAe repbase; DNA; INV; 4842 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE An L2B non-LTR retrotransposon family from Aedes aegypti. XX KW L2B; Non-LTR Retrotransposon; Transposable Element; L2B-1_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4842 RA Kojima K.K. and Jurka J.; RT "L2B non-LTR retrotransposons from the yellow fever mosquito."; RL Repbase Reports 11(4), 1413-1413 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 15 sequences with >95% CC identity. XX FH Key Location/Qualifiers FT CDS 1925..4681 FT /product="L2B-1_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MRSHFDELKLLIEEVKPKIVILSETHVTPYHCVEEFN FT IPNYSNEVCLSRSAHTGGIWIYVDRQLEYSVISNTVMGDNWFLAVEVKSGA FT LSGLYGGVYHSPSTSDVQFVSYFETWLNSVLIDEKTNVIVGDFNIKWNESG FT CAAELKNVTDAAGLVQKVEEATRTGPSSSTLIDLVFTNSKSCVATVVGEMK FT ISDHETICISVGESIRISVPEKRTARISWRRYSKHRLQNILRNDPTLRETS FT GSIDEAAERLSTALVSAVNQLIDVCNKSQTRTQSWYGPCLRNLKYDRDRAY FT RKFEESRSGDDWERYKLLRNCYVHELRMAKNNSVAREIRDCQGDSKRMWKC FT LKSLISPSGRSSTQIKINGSRSEEETARSLNEYFVASVQEIHNSIAAPANE FT LVEENFDSDARFCSFRPVTTAKLQETVIALKDCAGVENVTKRVLLDAFDVI FT DIHLLEVINRSLQSGVFPKTWKRSLVIPIPKVPKSTSPEDYRPINMLPLYE FT KILETLAKEQVMEYVNDNGIILEEQSGFRKHHSCETALNLLLLKWKQSIER FT GKIVMAVFVDLKRAFETIDRSKLKKVLKRYGIQGIALKWFGSYLENRSQVT FT RYNDLVSPETVVELGVPQGSVLGPLLFILYINDIKRVLRKTEVNLFADDTV FT LFVEGSNYNECFQIMNEELKYFSEWLKWKKLKLNITKTKYMVITTRSQINC FT AGEVRIDGEIVERVKSMKYLGVILDEKLNFCDHINYSIRKAAQKFGILCRV FT SRYLPMDSKVTLYKSIIAPHFDYCASILFLASKTQLKRMQILQNKVLRLIL FT KCNRLTPRAAMLNCLQRMSVRQRIEFNTLVFIFRVVKGMAPQYLTCTVRYG FT TDVHRYETRHAGDLILLNCRKTCTQNSLFYKGYSLFNQLPEDAKRTNNLRD FT FKSLCSIFVKQRSIE" FT CDS join(256..558,537..1376,1346..1864) FT /product="L2B-1_AAe_1p" FT /translation="MPAEMDGEEREHSFYSQQPCAQCALPIEDRNTCVECF FT GRCKLAIHVHYLPGATNAEIELLQKIKNAVFVCDACLSLAEFDDGHSEKRL FT DEISNKLNDLAILTKRSCDLVTEMLKNFDQVVRRVVCEELARANKRNVVSK FT CEKNELPPRRMLTRSSAKRRKTDESENCVEVDSTPKSTFAEVLKKRVEKSV FT VDNDPKKDEPIQKPNPVVIVRPKTGVQVEDVRSELRKKVDARNLNVARVSS FT GKNGEVVIALKDEESVELLKEKVKENMGGRYEVSVRENLKPTIKLIGMSEA FT MDEEDLKETLVDQNEAXSNLGHFKLCKTYCNRKLRYNNVSAIVELDAETFS FT KVMQEEKLNCGWDRCRVVDGLQVTRCYRCCAFNHKRSAVLCFQSQKIGMSE FT AMDEEDLKETLVDQNEAFSNLGHFKLCKTYCNRKLRYNNVSAIVELDAETF FT SKVMQEEKLNCGWDRCRVVDGLQVTRCXRCCAFNHKSKDCKAETPTCPVCS FT ENHQVQECKSSSKQCVNCKKMNAERNLKHDMNHAAWSEICPVYQRHFEQRK FT SLVDYSK" XX SQ Sequence 4842 BP; 1551 A; 813 C; 1207 G; 1269 T; 2 other; tttttttttt actgtgctga cgctgtgata agtttaatga ttttatgacg gaacactaaa 60 tatatgtttt atgtgtgtaa actttccgcc gttttaatca aaccttgtcg atgagtcctt 120 tgaacattgt gcgtaactgg taaaagttaa gttcagttag tacgtgccga gtaactgtgg 180 tattccagtg ataatttgtt agcattactg taccgaagtc ctcttatatt gatcgtttgc 240 aatcacccgg gcgaaatgcc cgcggagatg gatggtgaag agagagagca cagcttctat 300 agccaacaac catgtgctca atgtgctctg cctattgaag atcgaaacac ttgtgttgaa 360 tgcttcggtc gatgcaagtt ggctatccat gtccactatt tacctggtgc gacaaatgcg 420 gagatagaac tgttgcagaa gataaaaaat gctgtgtttg tatgcgatgc gtgcttgagt 480 ctggctgagt tcgacgatgg acacagtgaa aaaagattag atgaaatttc gaataaacta 540 aacgatcttg cgatcttgta acggaaatgc ttaaaaattt cgatcaggtt gtgcgaagag 600 ttgtatgtga agaattagcg cgggcaaata agcggaatgt tgtttccaaa tgtgagaaaa 660 acgaacttcc tcctcgacga atgcttactc ggtcttcggc caaacgcagg aaaactgatg 720 agtcggaaaa ttgtgttgaa gttgacagta cgccaaagtc gacttttgct gaggtgctga 780 aaaagcgtgt tgagaaaagt gtggtagata atgatccaaa gaaagatgag ccaattcaga 840 aacctaaccc agtagtcatc gttagaccta agacaggtgt acaggttgaa gacgtccgat 900 ccgaactacg aaaaaaagtg gatgcccgaa atttgaatgt agcacgggtc tccagcggta 960 aaaatggtga agtagtgatt gcactgaaag atgaggaaag tgtcgagttg ttgaaagaga 1020 aagttaaaga gaacatgggt ggtcggtatg aagtgagcgt tcgggaaaat cttaaaccga 1080 caattaaact gatcggcatg agcgaagcaa tggatgagga agatcttaag gaaacattgg 1140 tcgatcaaaa cgaagctttm agtaatcttg ggcatttcaa attgtgcaag acttactgca 1200 atcggaaact gcggtacaat aatgtcagtg caattgttga actagatgct gaaaccttca 1260 gtaaagtgat gcaggaagaa aagctaaatt gtgggtggga tcgctgtcga gtggttgatg 1320 gtctgcaagt cacgcggtgt tataggtgtt gtgctttcaa tcacaaaaga tcggcatgag 1380 cgaagcaatg gatgaggaag atcttaagga aacattggtc gatcaaaacg aagctttcag 1440 taatcttggg catttcaaat tgtgcaagac ttactgcaat cggaaactgc ggtacaataa 1500 tgtcagtgca attgttgaac tagatgctga aaccttcagt aaagtgatgc aggaagaaaa 1560 gctaaattgt gggtgggatc gctgtcgagt ggttgatggt ctgcaagtca cgcggtgtwa 1620 taggtgttgt gctttcaatc acaaaagcaa agattgcaag gctgaaacac cgacgtgtcc 1680 tgtgtgtagt gagaatcatc aggtgcaaga atgcaaatcc agttcgaaac agtgtgtaaa 1740 ctgtaaaaaa atgaacgctg aacggaattt gaagcatgac atgaatcatg ctgcatggag 1800 tgaaatatgc ccagtgtatc agagacattt cgagcaacgg aaaagcttag tcgactattc 1860 gaagtagcaa tcacggccta atcatgaaga acagtgtgat gtagtgtatt tgaatatagc 1920 tggaatgcga tctcattttg acgaattgaa actgctaata gaagaagtga aaccgaaaat 1980 tgtaattttg agtgaaactc acgtgacgcc ctatcattgt gtggaagaat ttaatattcc 2040 gaattactca aatgaagtat gcctatcgcg atcggcccat actggcggaa tttggatata 2100 cgtagatcga caattagagt actctgttat ttcgaacact gtaatgggag acaactggtt 2160 cttggctgtc gaggtgaaaa gtggagcatt atctggactt tatggaggcg tgtatcattc 2220 tcccagtacc agtgacgttc agttcgttag ttattttgag acctggctga atagtgtgtt 2280 aatagatgaa aagacaaatg taatagtggg cgactttaac attaagtgga atgaaagtgg 2340 ttgtgcagcg gagttaaaga atgtgactga cgcagctgga ttagttcaga aagtcgaaga 2400 agctacgcgt actggcccta gtagtagtac gttgatcgat ttggtgttta ctaactcaaa 2460 gtcttgtgta gcgaccgttg taggggagat gaaaatttcg gaccatgaaa ctatttgcat 2520 tagcgttggg gagtcaattc gaatctctgt gccagaaaaa agaacggcaa gaatctcatg 2580 gagacgatat tctaaacatc ggttgcaaaa catcctgaga aatgatccca cgctgagaga 2640 aaccagcgga tccattgacg aagccgccga acgccttagc acggcgcttg tgtctgcggt 2700 gaaccagttg atagacgttt gtaataagag ccaaactaga acccaatcct ggtatggtcc 2760 gtgcttaaga aatcttaaat atgacagaga tcgtgcttac cgaaaattcg aagaaagcag 2820 aagcggagat gactgggaga ggtacaaatt gctgagaaac tgctatgtgc atgagctgcg 2880 gatggcgaaa aataattcgg tagcccgaga aataagagat tgtcaaggag attcaaagag 2940 aatgtggaaa tgtttgaaat ctctcatctc gccaagtgga cgatcgtcga ctcaaataaa 3000 aataaatgga tcacgatcag aagaagagac cgcaaggagc cttaacgagt actttgtggc 3060 aagtgttcaa gaaatacata atagtatcgc agctccagca aacgaattag ttgaagagaa 3120 ttttgactct gatgcaagat tttgttcgtt taggccggtt acaacggcta aactgcagga 3180 aacagtgatc gcactgaagg attgtgcagg agtggaaaac gtgacaaaac gtgtattgct 3240 ggatgctttt gacgtgattg atatccattt gctagaagtt attaatagat ctctacaaag 3300 tggtgttttt cctaagacgt ggaaaaggtc attggtgatt ccaattccaa aagtacccaa 3360 atcaacgagt cctgaagact acagaccaat aaatatgcta ccattatatg agaagatatt 3420 ggaaacattg gcgaaggagc aggtaatgga gtatgtaaat gataatggga taatacttga 3480 agagcaatcg ggattcagga agcaccattc atgcgaaact gcgctgaact tgctgttgtt 3540 gaaatggaag caatcaattg aacgtgggaa gatagtcatg gcagtatttg ttgatctgaa 3600 gcgagcgttc gaaaccatcg accggtcgaa gttgaaaaaa gtgttgaagc gatacgggat 3660 ccaagggatt gcattgaagt ggtttggtag ttatttggag aaccgcagtc aagtgactag 3720 gtacaacgac ttggtatcac cagaaacagt cgtagaactt ggtgtgccac aaggcagtgt 3780 tctagggcct cttttgttta ttctgtacat aaacgatata aaacgagtgc ttcgaaaaac 3840 ggaggtgaac cttttcgctg atgacacagt tttgtttgtc gaagggagta actacaacga 3900 gtgttttcaa ataatgaacg aagaattaaa gtacttctcg gaatggttga aatggaagaa 3960 gttaaaacta aatattacca aaaccaaata tatggtgatt acgacgcggt ctcaaataaa 4020 ttgtgctggc gaagtgcgca ttgatggcga aattgtggaa cgagtgaaat caatgaaata 4080 tctaggagta atactggatg aaaaattgaa tttttgtgat cacatcaact actcgattag 4140 gaaggctgct caaaaatttg gaattttgtg tagagtgagc cgttatcttc cgatggattc 4200 aaaggtcaca ttgtacaaat cgataatcgc gccccatttc gactactgtg catcgatact 4260 cttcctggct tcgaaaacac aattgaagcg aatgcagatt ctgcaaaaca aagttctgcg 4320 tttaattctg aaatgcaatc gtctgacacc ccgagctgca atgttgaact gccttcaacg 4380 gatgtccgtt cggcaacgaa tagaatttaa caccttagtt ttcatattcc gtgtagtcaa 4440 ggggatggct cctcaatatt tgacgtgtac tgtaagatat ggaacagatg tgcatcgata 4500 tgaaacaagg catgcaggag acctcatatt gttgaactgc aggaaaacat gcacgcaaaa 4560 ttcgctgttc tataagggct atagcttgtt taaccaacta ccagaagatg ctaagcgaac 4620 caacaatctg cgggacttca aaagcctctg ctcaatattt gtaaaacaga gatcaataga 4680 ataaacgaca gagagaaatg gatgggcgta cacggaagtg gagtcagatt cagggatctc 4740 tcataagctg gtggaaaaca ttatcgaaag atatctgctc gtaaaccttc catactacaa 4800 aagatgtgta tgggtatgtg gtgggccatc cgaagaaaaa aa 4842 // ID BEL-59_AA-LTR repbase; DNA; INV; 559 BP. XX AC supercont1.11; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-59_AA_; KW BEL-59_AA-I; BEL-59_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-559 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.11; Positions 2168169 2168727. XX SQ Sequence 559 BP; 204 A; 113 C; 86 G; 156 T; 0 other; tgttgcgacg aaactcctcg cctcggcaac actgtcgaat ctgtatgtgt agtggaccgt 60 aacctgaaca gccaaggaac gtcaacaaca gtgagcatgc agaaaacaga aaatacacaa 120 caagtcgtcg aagaaagtgt tctttaatta ccaacttact attatcgatc tacttaaagc 180 tataagggtt agtgattata attgtcaaac tatgttcaag gtactcattc tactatttac 240 agctatttac agtttctaaa tctaatccta caaactactg tgcaaaattg taaacctaaa 300 ctataagata ttactaccgt aagtacgaca caactgaaat tgaatattca caaacaaatt 360 atgtaacgtt cactagatcc acattattcg gagtcggaag tcgctgaaat ccattaccac 420 ttccagattt aaaacaccaa atttgtaagc ccttgaatta gagtcactta aatgaattta 480 ataaaaatct tattcgtagc ttaaagcaca ctaacataaa aaggtgtttg ctcaacggag 540 ttggtgacat aacccaaca 559 // ID BEL-1_DPu-I repbase; DNA; INV; 7186 BP. XX AC scaffold_30; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 2) XX DE LTR retrotransposon from Daphnia: internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-1_DPu_; KW BEL-1_DPu-LTR; BEL-1_DPu-I. XX NM BEL-1_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-7186 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 649-649 (2010). XX DR Genome; scaffold_30; Positions 1102673 1095488. XX CC Positions [6252-6812] - Integrase core CC 'TTGTC' target site duplication CC LTRs are 99% similar to each other. CC Includes insertion of a Mariner element masked by strings of "n". CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX FH Key Location/Qualifiers FT CDS 604..1680 FT /product="BEL-1_DPu-I_1p" FT /translation="MENARNFNDLTGGQLFQLRTNSKRAHTRLVTRINELI FT VRRASKLMLEREATALSDALAAIRNINDHYVIAARLDAQEQHDAEEYVLQI FT ANLNAATHAAIQAYLETIPTPRRASWNISDNVINQPLRPIGWSITQSRNEQ FT QINPPVDQNASVHNGLENVGDNLQPRGGTPDQQNSRGLVQQPDPGIEPIDA FT KRRKMMLEYQLTQKDIQMARQLEDFQRQGQREREDLLSKIEHERKLIDSEN FT LTSTQGPLLLSTPVIRTSPSAFPPAEPDASHIPIQQKTTTGSKIRWPKITV FT ERFGGDPRKWRKFDHGVNATIRDTSMPDSLKLLSLQDVLVDEIRKKMAHVF FT NNGFTFEAAWAELSSR" FT CDS 2552..3607 FT /product="BEL-1_DPu-I_2p" FT /translation="MQAHNAHLLQIQPFRAGDFNSLFALAADVRDAVSSVS FT EDHLAVFTFSTVVSSLSTKLPTQLQIDWGQHAYKLRPNLPSLKDFDNWIDI FT AVGAEEYRGNRFSTAQKIVMTPALRQQSSYGADKSSAYRSPTILSNTIKED FT TNRVPIPQCAACNENPGHRLENCSTFKKMLINTRAAFVADNNHCFKCLSKG FT HYGRNCQKTTIKCSLCGEAHHTLLHGADRQFPKKIGNLKILLVRAPSKSLR FT PVLLAIVPVMAQEGAISVKSFALLDPGSEATIMSRALANKLMLSGPKLKIR FT FGNFNSSVVLDSEVVKFTIKSSSNVIIQASNVFIVPNINLSPRKINWPSLK FT TPKDAPIRP" FT CDS 3519..6995 FT /product="BEL-1_DPu-I_3p" FT /translation="MFLSYLTSTCPLVKSTGPLSKPQRTHLSDLELEKTDS FT SLVEILIGMDQLSAHSIKNIREPIDGEIGPTAIQTIFGWTIVGKIPSFLTN FT GPSTKKNVNTHSIAEEVSLTSLADSFHSTESFGIDVVAPNVMSFDDSQCLK FT VLQTSIKFIRCGWQVDLPLRYSNLNLPDNRTQAVSRYYGMERKMKKPEFQN FT VAIQYNAIIKKLIDSGTAVKVDKSELNGPMGMVWYLPTHYVTYPNKNKIRV FT VMDWAVKFKDRCLNEELFRGPKLIPSLIGVLLRTRQFRVAISADIESFYHR FT IGVPQHHQTLQRFVFRPFGSQDPLSTFQMTTLVFGAVHASTAAIWVLRHAI FT RQDQQCSTISATIHEDYYSDNLSKSFETEEEAIKFSHDSKTFLGNHGFNLT FT GFASSSSSLLATFPPDDRAAPLRDLNFDALPTEYVFGLGWDCKNDCYRLRI FT KTMPSVTTKRTLLSALARSFDPLGICLPIITFAKLLFQSSCSLRTAVPPYK FT KPTGWDEPLPCNIVAKWNNWAVQLQLLSEISIERCFRPSDFPLDKCVFDLI FT VFSDSSSAAFCAVAYLKVTCCERIHWSFVMAKGRIAPVGIHSLSIPRLELQ FT AAVAAVRLARTIKDELRITISSTEFRTDSQIVLHQIQSECRDYPTFVRSRV FT NEILQHSTPESWAFISGEANPADDGTRGQTPSEFKKSCRWLNGPQGVQEYT FT PSIAFLQLQDPENLPSCVIGQLNVSPLQCSYPAIAKAINDCHTNLADLKRE FT VAFQLIDVPASKTELTNSNLEDALRVCLITAAEESFQREMKALRQGAPIPR FT DSDLRKVNPYIDPADGILKVNGRLEHAPLAESARHPVIISPDHRLAGLIIN FT QAHVDAHHAGVEHTLATIRTKYYLLRGRRAVRKIIARCASCRFNNSMPSQP FT MMANLPKERLQPYVTPFSSCGLDLFGPLYTVIGRRTEKRWVMLANCFSTRA FT VHLELLYSLSRDSCLMGVRRLIADRGRPVNIYSDNGTNFLAADKELQEGVK FT NLNSRLVADEVINQGINWSTSPPTGAHFNGVTERLVASAKTSLCAVLTDGR FT AINDEVLLTVLKEVASLLNTRPITHVSTDPFEPEPLTPNHFILGCHHPHLP FT PNVEDAFHGASRKHWEQAQFIVNQFWRMRCGNTFRSSSRERSGIKKREKCG FT " XX SQ Sequence 7186 BP; 1871 A; 1545 C; 1268 G; 1650 T; 852 other; tggtgccgaa acccggttcg aacatcaaca ttaactatct ggtaagttat atatataaat 60 ttttcgaaca caacacgaac ttttacgaac atttcacgaa cacctcgaaa aaacctacga 120 accacgttac caccacgaac acgtcacgat attcccacga acacctaacg aacatgccac 180 gaacacctaa cacacaagtt actaaaaatt acataaattg ttttaaaatc ttataaatgt 240 ttggacttat caaattgaga atttgagaaa tttcactaaa ctgttgcgta tattttgtag 300 aacagaactt tgaaaatact gagtaccact caccgacgaa ccacttacaa acttcatttc 360 cttaacttcc aagtttctac gtaaatatca atgttacata ctctaatgat taattgtaac 420 gtcgaacatt ctcttacaga ttactcgatt ctatatcaaa tcgttcgtga actcatctga 480 cgacatccat ttctctagct aattcactgc gttcatgtaa tattttatat tatacacccc 540 tcacgaacat tataatcacg ctttaccttt aagcacaggc tgacattaca acaactgctc 600 aacatggaaa acgcacgcaa cttcaatgat ctgactggtg gacaattatt tcaattgcga 660 actaattcga aacgtgctca tacgagactc gtaacaagaa ttaatgagct tatcgttcgg 720 cgtgcgtcta aattgatgtt agaacgagaa gcaacggcat tatctgatgc tttagctgcc 780 ataagaaaca tcaacgatca ttatgtcatc gctgccagat tagatgctca agaacaacat 840 gacgcagaag aatacgtcct acaaattgcc aatctgaacg cagcaacgca cgcggccatt 900 caagcttact tagaaacaat tcccactcca cgccgtgcca gttggaacat ctctgacaac 960 gtaatcaatc aaccattaag accaattgga tggagcatta cccaatcacg taacgaacaa 1020 caaatcaacc cacccgtcga tcagaacgct agtgtccata atggactcga aaatgtcggt 1080 gataatcttc agccacgcgg aggaaccccg gatcagcaaa acagtagagg ccttgttcaa 1140 caaccggatc ccggcataga accaattgac gccaaaaggc gaaaaatgat gctcgaatat 1200 cagttgacac aaaaagacat tcaaatggca cgtcagttag aagatttcca acgtcaagga 1260 cagagggagc gcgaagacct actaagcaag attgaacatg aaagaaagct aattgattca 1320 gaaaatctga cttcaactca aggaccgctt cttttatcaa cacctgtaat acgaaccagt 1380 ccgtcagcat tccctcctgc agaacctgat gcctctcaca tcccaattca gcaaaaaact 1440 acaactgggt cgaagatcag atggccaaaa attaccgttg agcgtttcgg tggagatcca 1500 agaaaatgga gaaaattcga tcacggagta aacgcaacaa ttcgggacac cagtatgcca 1560 gattcactca aacttctcag cctacaagat gttctcgtag atgaaataag aaaaaaaatg 1620 gctcatgtat ttaacaacgg tttcaccttt gaagcggctt gggcggagct gtccagtaga 1680 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1740 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1800 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1860 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1920 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1980 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 2040 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 2100 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 2160 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 2220 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 2280 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 2340 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 2400 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 2460 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 2520 nnnnnnnnnn nncggcacac caggtttaat aatgcaagct cacaacgcac acctactcca 2580 aatccagcct tttagagcag gtgatttcaa ctcgctcttt gctctcgcag ccgacgtcag 2640 agacgctgtg tccagcgttt cggaggatca tctagccgta ttcacgtttt caacggtggt 2700 ctcatctctt tcaacaaaac tccctacgca gctacaaatt gattggggcc agcacgccta 2760 taaactgcga ccaaatttac cttctctcaa agacttcgat aactggattg acatcgctgt 2820 cggcgctgaa gaatatcgcg ggaatcgttt ctcaactgca cagaaaattg tcatgactcc 2880 tgcactgcgg caacagagca gttacggtgc agataaaagt agtgcgtata gaagtcccac 2940 catattatct aatacaataa aggaggatac aaatcgagtt cccatacctc aatgcgctgc 3000 gtgcaacgaa aaccctggtc atcgtctcga aaactgcagc actttcaaga agatgttaat 3060 caacacacgt gctgcctttg tagctgacaa caatcattgt tttaaatgct tgtcgaaagg 3120 ccattacgga cgcaattgcc agaagactac catcaaatgc agtttatgtg gagaggccca 3180 ccacacacta cttcatggag cagatcgaca attcccgaaa aaaataggta acctcaaaat 3240 tctgcttgtt cgtgcccctt cgaaatctct acgcccagta ttgttagcaa ttgttcccgt 3300 tatggctcaa gaaggagcca tttcggtaaa atcgtttgct cttttagacc ccggcagtga 3360 agccacaata atgtctcgtg cgctcgcaaa caaattaatg ttgtctggcc caaagctaaa 3420 aatacgtttc ggaaacttca acagctcagt ggtattagat tctgaagttg tgaaatttac 3480 gattaagtca tcttccaatg taattatcca agcatcgaat gtttttatcg tacctaacat 3540 caacttgtcc cctcgtaaaa tcaactggcc ctctctcaaa accccaaagg acgcacctat 3600 ccgaccttga actagaaaaa actgattcgt cactcgtcga gattttaatc ggaatggatc 3660 aattgtcagc acactcaatt aagaacatca gagaacctat cgatggtgaa atcggcccaa 3720 cggcaattca aactattttt ggatggacga tcgtgggaaa aattccatca tttctcacta 3780 atggcccaag cactaagaag aatgtcaata cgcattctat tgcggaagaa gtttcgctga 3840 ccagcctagc agatagtttc cattccactg agtcctttgg catcgacgta gttgcaccga 3900 atgtcatgtc gtttgacgat tcacagtgtt taaaggtgct acaaacttca atcaaattca 3960 ttcgttgcgg atggcaagtc gatttaccac ttcgttattc caacttaaat ttaccagaca 4020 acagaacaca agcggtttct cgctattacg gaatggaacg aaaaatgaag aaaccggaat 4080 tccaaaacgt tgcaattcag tacaacgcca taatcaaaaa gcttattgat tccggtacag 4140 cagtaaaagt tgacaaatca gaactcaacg gtccaatggg aatggtatgg tacttgccta 4200 ctcactatgt aacgtatccg aacaagaata aaataagagt tgtaatggat tgggccgtca 4260 aattcaagga tcgctgttta aacgaagaat tatttcgtgg tcctaagctg attccaagtc 4320 taatcggcgt tcttctacgc accagacagt tccgcgtagc aatttcagcc gatatcgagt 4380 cgttttacca tcgtattggc gtaccccagc atcatcagac gcttcaacgc ttcgttttta 4440 gaccctttgg cagtcaggat cctctttcca catttcagat gacaacactc gttttcgggg 4500 ctgtccacgc ctccactgca gccatatggg tattgagaca cgcaatcaga caagaccaac 4560 aatgttcaac aatttcagca acgatacacg aagactacta ctctgacaac ttatcaaagt 4620 catttgaaac tgaagaagag gccatcaaat tttcccacga ttcaaaaacc tttcttggca 4680 atcacggttt taatttaact ggatttgcgt catcttcaag tagcctgctg gctacatttc 4740 ctccggatga cagagcagct cctctccgag atttaaattt cgacgccttg cccacggaat 4800 acgtattcgg tttaggatgg gactgcaaaa atgactgtta ccgactcaga ataaaaacaa 4860 tgccatcagt gacaacgaaa cgaacgctcc tatctgcctt ggctcgatct ttcgatccac 4920 tcggaatttg tttacctatt ataacttttg caaaacttct tttccagtct tcctgcagtc 4980 ttcggacagc agtgcctcct tacaagaaac cgaccggatg ggatgagcct ttgccctgca 5040 acatcgtggc aaagtggaat aactgggcag tccagcttca gttactttca gaaatttcga 5100 ttgaacgttg ttttcgaccg agtgattttc ctttagacaa atgtgttttc gatttaatcg 5160 ttttctctga ttcgtcttcc gcagcattct gtgccgtcgc ctatttaaag gtaacatgct 5220 gcgaacgcat tcactggagt tttgtaatgg caaagggccg tatcgcgccc gttggaattc 5280 attctctcag cattcctcgt ctagaactac aggctgctgt tgctgctgtg aggctagcgc 5340 ggaccatcaa ggacgagctg cgaatcacga tatccagcac cgagtttcga acggattctc 5400 aaatcgtcct tcatcaaatt caatctgagt gccgtgatta tccaacattc gtccgcagta 5460 gagtaaatga gatcctccaa cactctacac cggagagctg ggcgttcatt tcaggagaag 5520 ctaacccagc tgatgatggc acgagaggcc agacccccag tgaattcaag aaaagctgtc 5580 gttggcttaa tggaccacag ggcgtgcaag aatacactcc atccatagcg tttcttcaac 5640 tacaagatcc agaaaatcta ccctcatgcg tcatcggaca gttaaatgtg tcaccattgc 5700 aatgctcata tcctgccata gcaaaggcca tcaacgattg ccacacgaac cttgccgatc 5760 ttaaacgtga agtcgctttc caattgattg acgtcccggc ctctaaaacc gaattaacta 5820 attcgaatct ggaagatgca ttacgtgtct gtctcatcac ggcagcagaa gagtctttcc 5880 aacgagagat gaaggcatta cgacaaggag cgcccattcc gcgtgattcc gatttaagaa 5940 aagttaatcc ttatattgat ccagcggatg gcattctcaa ggtaaatggc cggctggagc 6000 atgctcctct tgcagaaagc gcccgtcatc ctgtcataat ttctccagat catcgtctcg 6060 ctggtcttat catcaatcag gctcacgttg atgcacatca tgccggcgtg gagcataccc 6120 ttgcaacaat ccgaaccaaa tactaccttt tgagaggtcg tcgtgcagtc cggaagataa 6180 tcgctcgctg cgcctcctgt cgcttcaaca attcgatgcc cagtcaaccg atgatggcga 6240 atcttccaaa ggagcgactc caaccctacg taacgccgtt ttcctcctgt ggtctggatc 6300 tttttgggcc gttatacacc gtcatcggca gacggactga aaagagatgg gttatgctgg 6360 cgaattgctt ttcgactcgc gctgtccatc tcgagcttct atattcattg tccagagact 6420 cttgtctgat gggagtaagg cgtctaattg ctgatcgtgg tcgtcctgtc aacatttatt 6480 cagataacgg aactaatttc cttgctgccg acaaagaatt gcaagaagga gtcaagaatc 6540 tcaactcgag actagttgct gacgaagtta ttaatcaagg aatcaactgg tcaacgtctc 6600 ctcctaccgg tgcgcatttt aacggtgtca ccgaacgcct ggtggcatct gccaaaacgt 6660 cgttgtgcgc tgtgttgacg gacgggcgcg ccataaatga tgaggtgctg ttgactgttt 6720 tgaaggaagt ggcctcgctt ctaaataccc gcccaatcac acatgtctca actgatcctt 6780 tcgaacctga accactcacc ccaaatcatt tcatcctggg atgccatcat cctcatcttc 6840 ctccgaatgt tgaagatgca tttcatggcg cgtccaggaa gcattgggag caagctcaat 6900 tcattgtaaa tcaattttgg aggatgcgat gcgggaatac gttccggagc tcatcgagag 6960 aaagaagtgg aataaaaaaa cgagagaaat gcggttaggc gatcgggtct tagtattaga 7020 tgataatact cgccgtggtt tatgggtcgt ggccacggtc accaaattgt tccctgggga 7080 tggcggtgtc gtgagaaaag ttttggttaa aacctccaaa tctgaatttg ttcgccctat 7140 tgtaaaacta tgcttgattt cagatgctta atcgtcgggt gtggag 7186 // ID L1-35_AAe repbase; DNA; INV; 5102 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE An L1 non-LTR retrotransposon from Aedes aegypti. XX KW L1; Non-LTR Retrotransposon; Transposable Element; L1-35_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5102 RA Kojima K.K. and Jurka J.; RT "L1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1388-1388 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 10 sequences with >97% CC identity. XX FH Key Location/Qualifiers FT CDS 1851..5036 FT /product="L1-35_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MVKTKMDFLSYHLASVNINNITNQTKLDALSSFLRLN FT DVDIAFLQEVENSSIQLTGYTLIFNIDSRKRGVAFALRPNIEYTSVERSLD FT GRLITLRLKNGVTLCNIYAPTGSQGGSSREDFFNMTCAYYLRNCNGTIILG FT GDFNSVVNQRDATGTTPRSNMTSRLMTSLNVIDVWRFLRPNDIDYTFLRAG FT SCSRLDRFLLSRSSTDWLRTINHVVNCFSDHKAVIMRVVLPITGPPVGKGL FT WRLNPHVLDDEETLSLLQLKWRSLVRQQTNYRSWSSWWLNLAKPKISSFLR FT WRTSVQLQDNRNAMNLLYGELARTYQLYVNDSSQLSRINRIKSIMLLKQRE FT STSNWRQMNETFLQGEPTALYHLSEKNYKRSRTLIRELEIENSNVVNDGNQ FT IEQHVVSYFENLFQENNQTQTTNNFVPSLRIPNNNDHNGKLMDLIEADEIY FT QCIKNSCSRKSPGEDGLPKEFYMKCWNIIHNEFTLVLNDFMQNPNVESKLM FT NGIVVLVKKKGNSKTIKGYRPITLLNFDYKILTRILKRRMTPLMDLVLSKH FT QKCSKPQSNIFQATGKILDNIASLKHSRQTSMLVSFDLDHAFDRVKHSFLF FT QTMQRMNFNEQIITFLRRIMSNTCSRVLINGKVSRQIQISRSVRQGDPLSM FT FLFVIYMQPLIDKIVDSGYAGDAFNVYADDISIFVPDYHTLQQVVRIIDDF FT REVSGAVLNLQKTFALKIGNVPSPGPLQWLNFTNSLKVLGIYYSDKVKETM FT DNTWTNVIKNLKWRLWMSKVRSLNLIQKVILLNTFISSKLWYTASTLPLPK FT KFELQILKEYRNFLWIGGKNHIALETLFLPKIRGGLNLHSPGLKSTSLIVN FT RILQNLYNLPFLRSWLNAENILHIPAAYPQVKVFALEVATLSQETIIQHSS FT SSIYKELLQDIRDPAVFVRNRPWRVIFKNVLDHRIVSGNRSMWYTAVHGRI FT ITNEMLFNQNRRTSPNCLRCPGITETAEHKFFECQLSKPVWMFALSKMVAI FT RPLLRRKKPEYFLFPEMRGVPSREALEIKHIFAKYISFICNTPEANINVDN FT FRFEL" FT CDS join(143..1081,1036..1875) FT /product="L1-35_AAe_1p" FT /translation="MSGYRKNSFVIDFSVMPVRPKLNVVQDFIFSKMSLDM FT SVVKNLQTSITKSQVIIEVDSVATADSIIAQHNSKHSLEHDQQLYPIPVHS FT ADSAIEVKVYDLPPQMPNHLIASHFSIYGKVLSVRNDVWKEYFPGIPNGVR FT ILRMEIQKPIPSYVPVSGEMAYVLHINQVKTCKHCTQKVHIGKSCSAARKE FT TSNQSNPGQTLAEIVSNARNTEDENEATVDNMDSAESADESDVEEPSPKPE FT TVLENESNKTNSKTTPASANASRATSSKPTVAVHDSEHDQLGGRKRPISPK FT LPEQDNPRTLRRSKSQRQHSKSKNSTQIKVSASTLQVMFVFFFSMVSIYDS FT VCPAEATNDTAVLRIISSLLEKVIPNAPHQPVPRNPFSRRNHTTSLEHRXS FT PKGKSFSICLVEASLDTAILKKMSFSSVSSQPNMKIRSYLRYMIPALPLVP FT SEVIPGAQYSSEGKKHSICLAGVSSGTAAPDKIFPTPTAKSVLLTTSKSII FT NQCRQKMQETTLSTTLQYRMVQRSVIGNPNIYNGRIHKHEVSVSLNNHACM FT ANTVKRGNQSRNQIHRTLERMSSVSTRQISGFDINASPWLKRKWIS" XX SQ Sequence 5102 BP; 1602 A; 1107 C; 1007 G; 1384 T; 2 other; agttgacatt caggcttcat acacaacaga cgtgtatcgg atcgagtgaa aacatcccca 60 ttgtttttct gctaatcgga cagggtccga aaggctttcc tagttagttt ttcgtggaag 120 cgacaagatt attccctcca gaatgtctgg ataccgtaaa aactcgttcg tcatcgactt 180 cagcgttatg ccagttcgac ccaagctgaa cgtagtccag gatttcatct tctcaaagat 240 gtctcttgac atgagtgttg tcaaaaacct ccagacgagt atcaccaaat ctcaagtcat 300 aattgaagtt gactctgtgg ctacagcaga cagcatcata gcccagcata actcgaaaca 360 ttcgctcgag catgaccaac agctgtatcc aatcccggta cactctgctg atagcgccat 420 tgaggttaag gtgtacgact tgccccccca aatgccgaac catctcatag catcgcactt 480 ctcgatctat ggaaaagtcc tgtcagtgag gaacgatgtt tggaaggaat acttcccagg 540 aattccaaat ggggtaagga tccttcgtat ggagattcag aagccaatac cttcctacgt 600 tcctgtgtcc ggtgaaatgg cgtatgttct gcacataaat caagtgaaaa cctgtaagca 660 ctgtacacaa aaggtacaca tcggtaagtc gtgctctgct gccaggaagg aaacttcaaa 720 ccagagcaat ccgggtcaaa ctctcgctga aatcgtaagc aatgctcgta acacagaaga 780 cgaaaatgaa gctaccgtcg ataatatgga ttcagctgaa tcagcggatg aatcggacgt 840 agaggagccc tcaccaaaac ctgaaaccgt gctggagaac gaatcgaata aaaccaattc 900 caaaacaacg ccagcatcag cgaatgcatc cagagccacc agttccaaac cgaccgttgc 960 agtgcacgat tcggaacacg atcaattagg aggacgtaag cgaccgattt caccaaagct 1020 tccagaacag gataatccaa gaactctacg cagatcaaag tctcagcgtc aacactccaa 1080 gtgatgttcg tctttttttt ctcaatggta agcatatacg acagtgtatg tcctgccgaa 1140 gcaacaaacg atactgccgt cctaagaatt atatcatctt tattagaaaa agtcatccca 1200 aatgcaccac atcagccagt tccacgcaac ccgttcagtc gacgaaacca taccacwtcg 1260 ttggagcacc gttwttctcc caaaggtaaa agcttcagta tatgtcttgt cgaagcgtct 1320 ctcgatactg ccatcctgaa aaaaatgtcc ttttcatcag tgtcctccca gccaaatatg 1380 aaaatacgat cctacctacg atatatgata ccagcattgc cgctcgtgcc ttctgaagtc 1440 atccctggag cacagtattc ttctgaaggt aaaaagcata gtatatgcct tgccggagtg 1500 tcgtcaggta ctgccgcccc tgataaaatc tttccaacac ccacagcaaa atcggttttg 1560 ctaacgacgt ccaaatccat catcaaccag tgccgccaaa agatgcagga aacaacgtta 1620 tcgacgactc ttcagtatcg aatggttcaa cggtctgtta ttggcaatcc caatatttat 1680 aacggtagga tacacaaaca tgaggtgagc gtatcgttaa acaatcatgc ctgtatggca 1740 aatactgtca agcgtggtaa ccagagtaga aatcagatac atagaacgtt agagagaatg 1800 agctccgttt cgactaggca aatcagtggt ttcgatatta atgctagtcc atggttaaaa 1860 cgaaaatgga tttcttaagt tatcatttag cgagtgttaa catcaacaac attactaatc 1920 aaacaaagtt agatgctcta agttcctttc tacgtttgaa tgacgtagat atcgcatttc 1980 tccaggaagt tgaaaattcc tccattcaac tcactggtta tacactcatc ttcaatatcg 2040 attctcgcaa aaggggtgtt gcttttgcgc tcagaccaaa tattgagtat acctcagtag 2100 aacgctctct ggatggtaga ctaatcactt tgcgactgaa gaatggtgtc actttgtgca 2160 acatttatgc tcctactgga agtcaaggcg ggagtagtcg ggaagacttt ttcaacatga 2220 catgtgctta ttatctgaga aattgcaacg gaactataat tttgggcggt gacttcaact 2280 ctgttgtgaa tcaaagagat gctactggaa caactccacg gagtaatatg acatctaggc 2340 ttatgacttc attgaatgtt atcgatgtgt ggcgattcct gcggccgaat gatattgatt 2400 ataccttctt gcgtgctggt tcttgctcaa ggctcgacag attccttcta tccagatcat 2460 ctactgattg gctgcgtaca attaaccatg tagttaattg tttcagtgat cacaaagcag 2520 ttatcatgcg agtggtgcta ccaataacag gtcccccggt tggaaaaggt ctttggcgtt 2580 tgaacccaca tgtgttggat gatgaggaga ctctatctct tcttcagctc aagtggcgct 2640 cccttgtaag gcagcaaaca aattatcggt cttggtcatc ttggtggttg aatctagcaa 2700 aacccaaaat ttcctccttt cttcgatggc ggacatccgt acaacttcag gataatcgta 2760 atgcaatgaa tttattgtat ggtgaactag caagaacata tcagttgtat gtcaacgatt 2820 cttcgcagtt atctcgaatc aatcgcatca aaagtatcat gctgttaaag caacgcgaat 2880 ccaccagtaa ttggcggcaa atgaatgaaa catttttaca aggtgaacca actgcgcttt 2940 atcatctgtc tgagaagaac tacaaacgat caagaacatt gatacgcgaa ttggagatag 3000 agaacagtaa tgttgtcaat gatggaaatc aaattgaaca gcatgttgtc tcatactttg 3060 aaaacctatt tcaagaaaat aatcaaacgc agacaaccaa caacttcgta ccttcgcttc 3120 gtattccaaa taacaacgat cataatggaa aattgatgga cctcattgaa gccgacgaaa 3180 tataccaatg tattaaaaac agctgctcac gcaaatcgcc aggggaagac ggactgccca 3240 aagagttcta tatgaaatgt tggaacataa ttcacaatga atttactcta gttttgaacg 3300 atttcatgca aaatccaaac gtcgaatcca agcttatgaa cgggatcgtt gttttggtga 3360 aaaaaaaggg gaacagcaaa acgatcaaag ggtatcgtcc aataacactt ttgaactttg 3420 attacaaaat cctcacaaga attctaaagc gtagaatgac accattgatg gatttggtcc 3480 tgtcaaagca tcaaaaatgc tctaaaccac agagcaacat tttccaggct acagggaaaa 3540 tattagacaa cattgcttcc cttaaacact caagacagac ctctatgctg gtatctttcg 3600 accttgacca tgcttttgat agggttaagc acagtttcct gtttcaaacc atgcagcgta 3660 tgaatttcaa tgagcaaatc ataacatttc ttcgtcgtat catgtccaat acctgctcaa 3720 gggtacttat aaacggaaag gtgtcacgtc aaattcagat cagtcgttcg gtgagacaag 3780 gagaccctct gagtatgttc ctgttcgtta tatacatgca gcccttgatt gataagatcg 3840 ttgattctgg atatgccgga gatgctttca atgtgtatgc agatgacata agtatctttg 3900 tgcctgacta tcatacgcta caacaagttg tgcgcatcat tgatgatttt cgagaagtct 3960 caggagcagt tttaaatctg caaaaaacgt tcgcattgaa aatagggaac gtaccatcac 4020 caggccctct ccagtggctg aatttcacaa attcccttaa agttcttgga atatattatt 4080 ccgacaaggt caaagagact atggacaata cttggacaaa tgtaattaaa aatttgaaat 4140 ggcggttatg gatgagcaaa gtccggtcac tcaacctgat tcaaaaagtt attctgttga 4200 acaccttcat atcatcgaaa ttgtggtaca ctgcttcaac gcttccgctg ccaaaaaaat 4260 tcgaacttca aattctaaaa gagtatcgga acttcctttg gatcggtggg aaaaatcata 4320 ttgccttaga aactctattt ctgccaaaga ttcgtggtgg attgaatctt cacagcccag 4380 gattaaaatc gacgtctttg atagtaaaca ggattcttca aaacttgtac aaccttcctt 4440 ttctacgaag ctggctgaat gcggaaaata tcctgcatat tccggctgct tatccacaag 4500 ttaaggtttt tgctttggaa gtggccacac tttcacaaga aaccattata cagcacagtt 4560 catcatctat ctacaaagaa ctacttcagg atataaggga ccccgcagtg ttcgtcagaa 4620 atcgtccatg gagagtaata ttcaagaacg ttttggatca tcggattgtt tctggaaatc 4680 gatctatgtg gtacacagcg gttcatggga gaattattac caacgaaatg ctgtttaacc 4740 aaaatcgtcg aacttctcca aattgcctaa gatgtccagg aatcactgaa acagctgagc 4800 ataagttttt cgagtgtcaa ctatctaagc ctgtatggat gttcgcgttg tctaaaatgg 4860 tagccattcg acctctgctt cgcagaaaga aaccagaata cttcctattc cctgaaatgc 4920 gaggtgtacc atctagagaa gcgctagaaa ttaaacatat ctttgcaaag tacattagtt 4980 ttatatgcaa tacacctgaa gcaaacatta atgtagataa ttttagattt gaattgtaaa 5040 ttttgataga cttaatgttg tactctaata aagacatggt aaaaaaaaaa aaaaaaaaaa 5100 aa 5102 // ID Sola2-1_AAe repbase; DNA; INV; 3912 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A Sola2 DNA transposon family from Aedes aegypti. XX KW Sola; DNA transposon; Transposable Element; Sola2-1_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-3912 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the yellow fever mosquito."; RL Repbase Reports 11(4), 1299-1299 (2011). XX DR [2] (Consensus) XX CC ~96% identical to consensus. ~920 bp TIRs. XX SQ Sequence 3912 BP; 1428 A; 561 C; 617 G; 1306 T; 0 other; gagcggttcc acgccgtttc gcaaacccga ttatttttgt attttttgaa tcgcctgaaa 60 atttgcatac agattcttta tgaccaaaaa tgccattatg cactttcaga ccgccatttt 120 gaacctcgcc ttatttttga gaagggcgta tcggaaaatg catggcaaat ctttaaaaaa 180 ctgtaactcg aaaacggttt gtccgatcga tttgagatct tctacaaagt tgtaggtatt 240 gcttaggact atatggagaa aaatatgcac ggtaaaaaaa agttacagat tattttttat 300 ttcaaaaaca aaattttaaa atcgattttc tccagaaacg cagttttgat tttttttatt 360 tttggatatg ttttagggga caacttatgt gatttattgc acaatgtttc aaaatggaag 420 aattatggac aaaaaagtta tgatttttta aaaaattaca gattttgaaa aaaaaaatcg 480 aaatagagga aaacaataat tttttatgtg attatttgaa agtacagaaa aattgcaatc 540 gaaaagtact taagtaaatt ttttctaggt tgcatcaatt tcgagatata ctcatattta 600 tataaaattt tcaaataaaa atagaaaaat aggccctttt caagcatatt tcgtgtttct 660 ccattccaga aaaaaaatat tttgattgag cgaatcgtag ctaaatcgtt tatcgtttga 720 tcaaaacatt tatgcttaag tattgaaaaa agagtgcacg gtaaaaaata taaacgatat 780 tttcaaatga aaatttgaag ctaatagttc ttaaaaaccg caacattgat gtttttaatt 840 tttggatata ttttgcggtt gttgtaggta ttgtttagga ctatttggaa aaaaatatgc 900 acggtaaaaa ataatgacag atttttttaa taaaaaaaat taaaatcgat tttctatgaa 960 aatgcatctg tgatttttat tatttttgga catgttttag gggacaactt atgtgattta 1020 ttgcacaatg tttcataatg gaagaattat ggacaaaaaa gttataattt tgaaaaaaaa 1080 tacaaatttt gaaaaaatcg aaataaagga aaacaatatc caactacttt caatattgcc 1140 gaggtcatgg acggttcaga aaataatgga tacattcgat acatccaggt atttggcaac 1200 gcaagctaaa cagtcgttag aaggaacaag taaatctcgt acatctttcg ggctaagtaa 1260 tgtagcaaaa gaaattgtag taaactttta cgaagatgat gaaactagcc gagcgatgcc 1320 tggccaacgt gattatgttt ccgtaatgag agacggaaag cggctggcaa ttcaaaaaag 1380 gttaatgatg actacattaa gagagtcgtt taatcgattt actgaattac atagtggtgt 1440 tcaaatagga ttttcatctt ttgcaaaatt gagaccgaag aattgtaaat tgttaactag 1500 ctctggaaca cataatgtgt gcgtatgtac tattcacgaa aatgttaatt tgataatcca 1560 tagtttaaag aaatataatt tactgcatga tttaattttt tttactgata gtttattatg 1620 cgaaacaaaa acagtggatt gcaccttacg ttgttgcgaa aaatgttcag attctactac 1680 ctttgaacga agcttgttat cagaaatcga agagaatggc attgacgaat tacaatttga 1740 acagtgggtc acaaccgaca ggtgtgatat tgaaactttt ttaaagcagc ctgatgaatt 1800 tgtatcatat tttactcgta aattggagaa actaatacct catgatttca tcaaaaaaga 1860 gcaagctaca tttttaaata acacaaaaaa acaatttgat agaaggtgaa tttgtggtaa 1920 tttgtgattt ttcggaaaac tacacattta ttcttcaaga tgaagttcaa agccatcatt 1980 ggaacgctca gcaagctact ttgcatccgt ttgtaatata ttatcgtcaa gatggcaaag 2040 tcaatcattt aagttttgtt gtaatttcag aagatttgcg ccacgattct gtatctgtca 2100 acttattcat atctaaaatg atagattttt tgcgattgga tcacaaatta actgtcaaca 2160 agatatattt tatgtctgac ggagctgcat cacagtataa aaataggaag aacttttcaa 2220 ctctttggca atttaaaatc aaatatgata tagaagttga atggcatttt tttgcaacat 2280 cacatggaaa agggccatgc gatgccatag gagggacact gaaacgtatg gctaccagag 2340 ccagtcttgc caaaaacgtg aacatccgat aaaaaatgct aaggaactat atgattgggc 2400 tcaaaatcgg aaagaagagc agcttacaca aatcttcttt tgttattcaa ctactgagga 2460 atatgatatc attgcacaag aacttaacca actattttca agtgcaaaaa cagttcaagg 2520 cactcagaaa tatcactcgt ttatccctat ctctgaaaca caaattgaag ttagacaatt 2580 ttcgagctgt gacgataaca agaaagtagt tgatataatt aagaaataaa attgcagtat 2640 ataatataaa aaaaaaaaaa aaaaacaaaa aaaaacaaaa aaaaaacaaa ctccttctcc 2700 atggttgtac gatcttgccg cgatgagagg gctttaccca ttagccgggc ttagccctgg 2760 ggaccaaagg tacaccgttg tgatagaatc acatttttaa tgccacgttt agatttaccg 2820 tgtatatctc ggaagtgatg catctcagct aaattttact tgagtatttt tcgatagaaa 2880 tttttcatat ttaaaaatat atagttattt ttctgtactc aaaaaaacgc accaaaaata 2940 tcgttttaca taaattcgtt ttattatttt tttaaataat ttatttttta tccaaaaatt 3000 ttccgttttg aggcattgtg caagaaatta caaacaatat gctgcaaaac atatccaaaa 3060 attaaaaaca tcaattttgc ggtttttaag aacaatgaac ttcaaatttt catttgaaaa 3120 tttcgtttat atattttttt taccgtgcat acttttttaa atactttagt atataggttt 3180 tgatcaaaca ataaacaatt cagctacgat tcactcaatc aaaataaaat atttctggaa 3240 tggagaaaca cgaaatatgt ttgaaaaggg cctattttac tttttttatt tgaaaatttt 3300 atataaatat gagtatatct cgaaattgat gcaacctaga aaaaatttac ttaagtactt 3360 ttcgattgca atttttctgt actttcaaat aatcacataa aaaattattg ttttcctcta 3420 tttcgatttt ttttttcaaa atctgtaatt ttttaaaaaa tcataacttt tttgtccata 3480 attcttccat tttgaaacat tgtgcaataa atcacataag ttgtccccta aaacatatcc 3540 aaaaataaaa aaaatcaaaa ctgcgtttct ggagaaaatc gattttaaaa ttttgttttt 3600 gaaataaaaa ataatctgta actttttttt accgtgcata tttttctcca tatagtccta 3660 agcaatacct acaactttgt agaagatctc aaatcgatcg gacaaaccgt tttcgagtta 3720 cagtttttta aagatttgct atgcattttc cgatacgccc ttctcaaaaa taaggcgagg 3780 ttcaaaatgg cggtctgaag gtgcaaaatg gcatttttgg tcataaagaa tctgtatgca 3840 aattttcagg cgattcaaaa aatacaaaaa ttaaatttaa aaaaaaattc gtcatgttcg 3900 gtggaattgc tc 3912 // ID Gypsy-12-LTR_HM repbase; DNA; INV; 120 BP. XX AC . XX DT 05-JAN-2009 (Rel. 14.02, Created) DT 05-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE Long terminal repeat of the LTR retrotransposon - a consensus DE sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy-12-LTR_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-120 RA Bao W. and Jurka J.; RT "Gypsy LTR retrotransposons from Hydra magnipapillata."; RL Repbase Reports 9(2), 397-397 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX SQ Sequence 120 BP; 36 A; 4 C; 18 G; 62 T; 0 other; tgttgtgttt tgattatttt agattatgtt tttatgtttt gattattatg attatgttta 60 ttttagagta gaaaaactta taagaattaa aatattgtag ttctttatac aagtgtttca 120 // ID Baggins-2_NVi repbase; DNA; INV; 3705 BP. XX AC . XX DT 15-FEB-2009 (Rel. 14.02, Created) DT 23-JUL-2010 (Rel. 15.08, Last updated, Version 2) XX DE Non-ltr retrotransposon: consensus. XX KW Loa; Non-LTR Retrotransposon; Transposable Element; KW Baggins-2_NVi. XX NM Baggins-2_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-3705 RA Jurka J.; RT "LINE retrotransposons from the parasitic wasp Nasonia RT vitripennis."; RL Repbase Reports 9(2), 484-484 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Nasonia CC Genome Project. XX FH Key Location/Qualifiers FT CDS 78..3614 FT /product="Baggins-2_NVi_1p" FT /translation="MPNTRTLASRGEIRGLGGAGRIFRDPTCSVPRACILV FT KGFDALLLPARCSRDLTAIKIRLPVGEGSEREVVVASGYFPYDSQEEPPPR FT EVQDLVEYCRQRSIPLILGCDANAHHIVWGSSDTNGRGDALLQYLVTTSLC FT IMNRGREPTFYNSVRSEVIDLTLCTVGMEGWVGSWRVSNEPSLSDHRYIQF FT EWKERCSETRAFRNPRKTDWAFFRENLRNELQSFKPSFGTTDELDHWAFEL FT GEIVNISFQRSCPWTIPRGTWGTPWWNRELEDLRRETRRTLNRAKNTRNSI FT DWRIHREAQRLYKNRINAVRIKGWRDYCEDIERYPDAARLLRILAKNPEVW FT LEAIRLPTGEYATSEEECLKLLLEANFPGFRLSHEMGDESSGRNRQQRAAW FT DLAAKVVTPEKVKWAIRNFQPFKAPGIDGIYPAFLQEGLEELVGPLVKLFR FT ASVALAHVPEIWKTAKVVFIPKTGKPSHTTVKDYRPISLNSFVFKTLERLV FT DRFIQDDILTVFPLHPNQHAYRAGFSTETALHSAVWRIEEQLERGEVMVGV FT FLDIEGAFNCTSIEAVVKEAGLHGMPGPLIKWLQGMLTRRTVISSLGTVTV FT SGEVSKGCAQGGVVSPTIWCLVANGPLEALNGWGCYAQAYADDFLILIKGS FT EIGAAMDAMQLALRKVERWCCSTGLSVNPDKVEMVIFSRKYKLDAYRAPKL FT SGVALQVKDSAKYLGIVLDKKSTWEKHLTAQCDKFLVALWTCRRAFGSTWG FT LSPRIILWLYRAVLVPRLAYAALVWWPRAELAGARAAPEKLRGQVLRGATG FT AYRTTPTKALGILVKVEPLHLTIIGMAAKAAHRLNAYGQWTQGTRHTRLPG FT GVGLLPEFSIKTDMMIPRYFFGRRYGVVIPTREEWKARRDSLPGTGDVWYT FT DGSRAETGTGSGYYCRRDGRGTFFSLGRYATVFQTEIYAILTCAQRNIELG FT ARDRIITICSDSQAALRALMAHRTTSRLVWECKVVVNQLTAHNNKVRLLWV FT PGHTGIRGNEIADRLAALGAKHPPIGPEPYTGAARCLLAGEIRDWVEREHT FT KEWQGTQGCRQAKAVMGQDTNVGWTKCIAGGSRNNSRLLTQIVTGHIRLRY FT HGLKMGKEATGVCRWCEGAEETPTHVLTVCPKFAQLRHQCLGELFPTYEEI FT RALDAGAILTFWRRAGLPA*" XX SQ Sequence 3705 BP; 957 A; 870 C; 1122 G; 754 T; 2 other; acacagataa acttgcatca tagcaaaggc gcctcggccg cactggtcag gcgcatggct 60 aagatgcaca caggcatatg cctaatacaa gaaccttggc tagtcgcggg gagatcaggg 120 gtctgggggg agctggccgg atctttagag accccacgtg cagcgttccc agggcgtgca 180 tacttgttaa aggttttgat gccctgctgc tgccggctcg gtgcagcagg gaccttacgg 240 caataaagat cagacttcct gtgggagagg gctctgagag ggaggtggtg gtggcctctg 300 gctacttccc atacgactct caggaagagc ccccaccaag ggaggtacaa gatcttgtcg 360 aatactgcag acaacgaagc atccctctta tactgggatg cgacgctaat gctcaccaca 420 tagtctgggg cagttcggat actaacggca ggggcgacgc actgctgcag tatctggtaa 480 cgacgagtct gtgtattatg aacaggggca gagaacctac cttctacaat tcggttagaa 540 gcgaggtgat cgacctaacc ctgtgtacag ttggaatgga ggggtgggtg ggcagctggc 600 gggtatctaa cgaaccttca ctgtccgacc acagatacat ccaattcgaa tggaaggaac 660 gatgctcgga aacaagggcc tttagaaatc ccaggaaaac tgactgggcg ttctttaggg 720 aaaaccttag raacgagctt caatccttca aaccaagctt tggaacaacg gatgagcttg 780 atcactgggc atttgagcta ggtgagatag tgaacatctc tttccagaga agctgtccct 840 ggaccatccc aaggggaaca tggggtaccc catggtggaa ccgggagcta gaggatctgc 900 gacgwgaaac gaggaggacc cttaacaggg ccaagaatac tcgcaatagt atcgattggc 960 ggattcacag agaagcccag aggctatata aaaaccgcat caacgcggtg agaatcaaag 1020 gatggaggga ctactgcgag gatatcgaga gatatcctga cgcggccaga ctcctccgga 1080 tactcgcgaa aaatcctgag gtatggcttg aagcaatcag actgcctaca ggagaatacg 1140 caacctcgga agaggagtgt ttaaaactac ttctggaagc caactttccg ggctttcggc 1200 tctcccatga gatgggggat gagagctcgg gtaggaatag gcagcagagg gcggcatggg 1260 atctggcggc gaaggtcgtc acaccggaaa aggtcaagtg ggccataagg aactttcaac 1320 ccttcaaggc gccgggcatc gacggaatct acccggcatt ccttcaagag gggctggagg 1380 aactggttgg ccccttggta aagctcttca gagcaagtgt agctttggct catgtacctg 1440 aaatatggaa aacggccaag gttgtgttta ttcctaagac agggaagcca agtcacacca 1500 ccgtcaagga ttacagacct ataagtctta actcctttgt gtttaagaca ctagaaagat 1560 tggtggatag atttattcaa gacgatatcc taaccgtgtt tcctcttcac cctaaccagc 1620 atgcgtacag ggcgggattc tctacagaga ctgccttaca ctcggcggtt tggcgcattg 1680 aggagcagct ggaaaggggg gaggtaatgg tgggagtatt tctagacata gagggagcgt 1740 ttaactgcac ctctattgaa gcagttgtca aagaggcagg actgcatggg atgcccggac 1800 cattaataaa atggcttcag ggcatgctta cccgcaggac tgtgatctcg agtctcggaa 1860 ccgtaacggt ctccggcgag gtcagtaagg gatgcgcgca gggcggagtg gtgtcgccca 1920 ccatttggtg cctggtggca aatgggccgc tggaagcgct gaatgggtgg ggctgctatg 1980 ctcaagccta tgcggacgat ttcctgatac tgataaaagg aagcgaaatt ggcgcagcca 2040 tggacgccat gcagcttgca ttaaggaaag tggagagatg gtgctgctcg acgggacttt 2100 cggttaaccc ggataaagtc gagatggtga tcttctcccg caagtacaag ttagatgcct 2160 acagagcgcc caaactctct ggagtcgccc tacaggtaaa ggactcagct aagtacctcg 2220 gcattgttct ggataaaaag tcgacctggg agaagcatct cacggcccag tgcgataaat 2280 tcctggtggc tctctggaca tgtcgtaggg catttggcag tacatgggga ctaagtcccc 2340 ggattatact gtggctatac agggctgttc tggtacctcg tctggcctac gccgcgttgg 2400 tctggtggcc aagggcggag ctggcgggtg ctagggcagc accggagaaa ctcaggggac 2460 aggtgctgag gggggcgaca ggggcctata ggaccacgcc taccaaggca cttggaatcc 2520 ttgtgaaggt ggaaccactc catcttacta tcattgggat ggctgctaag gcagcgcaca 2580 gactcaatgc gtacggacaa tggacacaag gcacgaggca cacccggctc ccgggtgggg 2640 tgggtctctt gccggaattt tccattaaaa ctgatatgat gatcccgcgc tacttctttg 2700 gaagaagata cggggtcgtc ataccgacaa gagaagaatg gaaggcccgc agggacagtc 2760 taccaggaac gggggacgtc tggtacacag atggctctag ggcagagact ggcactggtt 2820 cggggtacta ctgccgaagg gatggaaggg gaaccttctt ttccctgggg cggtatgcta 2880 cggtcttcca gaccgagata tacgctatac tgacctgcgc tcagaggaac attgaactcg 2940 gcgcaaggga tagaattatc accatttgtt cagacagtca ggcggcactt agagcactta 3000 tggctcatag gacgacctcc agactggtct gggaatgcaa agtggttgta aatcagctga 3060 ccgctcacaa caacaaggtc aggctgcttt gggtcccagg gcacaccggg atcaggggta 3120 acgaaatagc agatagactt gcagctttgg gagctaaaca ccccccaatc gggccggagc 3180 cctacacagg tgccgcaagg tgcctgctag cgggcgaaat ccgggactgg gttgagaggg 3240 agcatactaa ggaatggcaa ggaactcaag gatgcagaca agccaaagcg gtaatgggac 3300 aggacaccaa tgtcggctgg accaagtgca tcgcaggagg cagcagaaat aactcccgac 3360 tcctgacaca gatagtgacc gggcacatcc gcttaaggta ccacggtctg aagatgggga 3420 aggaggctac gggtgtatgc aggtggtgcg agggggcgga agaaacgcca acccatgtgc 3480 tgactgtttg cccaaaattt gcgcagctca gacatcagtg cttgggggag cttttcccca 3540 cgtacgaaga gattagggca ctcgacgcgg gagccatact gaccttctgg agaagggcag 3600 gcttacctgc gtaggggatg tcatgacgaa gggtggtaca atgggtttac agcctaagtg 3660 ccagggatct cggaggtccc cccttcatag taataataat aataa 3705 // ID Gypsy-3_DPer-LTR repbase; DNA; INV; 2254 BP. XX AC super_615; XX DT 04-MAR-2011 (Rel. 16.03, Created) DT 04-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-3_DPer_; KW Gypsy-3_DPer-I; Gypsy-3_DPer-LTR. XX OS Drosophila persimilis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; obscura group; OC pseudoobscura subgroup. XX RN [1] RP 1-2254 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (22-MAR-2010). XX DR Genome; super_615; Positions 5 2258. XX SQ Sequence 2254 BP; 624 A; 531 C; 458 G; 641 T; 0 other; tgtaatattc gacgcttgtc gtgccgttta tttgaattat gaacggctaa ctcccaaaat 60 tacagcctaa gactaggcag aaaaaaaaaa aaattcgagt tgcagagctt gtgagcttta 120 accatacata gagtatcagc tgttttgcga ccaagtggca acactaagat atgcaatttt 180 tctgctttgc actatatttc attaaatatt gctgtggctg gattctgtat aacagaaaaa 240 gaccatagaa ggtgtggaaa aaacgtgggt tgctgtacct agtagtccca gcctctgtct 300 tttttacctc gtggccgtca gagtgaccgg cagtgaaaaa agaagtctgc ttgagctgac 360 ccagttctct aaacaaggtc aagcccaaag tcaaagccga gtgcaagtgc caagtgcggc 420 tcaaagtgat cattgaattt gagtggacgc gccagttata tatcgccaat actggggcag 480 ccgttcgacc atttggaaac gagcgccccc agtgtatcaa catttttcga aacagtggtg 540 caattctgtt gaggtaatta tccacccagc tgcagcgacc cgttgaaaaa tgttctgttc 600 tttccgtgta ggattagcac cactagtatc tacgtattga aacgctggtc tagaagatac 660 ggattggtta ttcccggaga cctcttggcc acgtgcccgt accaccgttc ggtagccgag 720 ggcatggact tgcgcctcaa gtaaacccga tcggtgtgct ctgcaatctt cgcctggaca 780 agttcgaccc atggggatga gctgagtgac tcttcaaata aaccagtccc ccccacgagc 840 aatccagtgt gcgagtgtac ctaacctcac gccgcagatg gaccgcggtc attgacccct 900 ggcatcggaa gcagctgatg ttacctgcgc ccgttacaat cgtgtgccac cttcagacct 960 cgagtaggaa ctcggtctcc agccgggaag catcgaaccc gcatcggatt tagttatatt 1020 ttctatcttt ctttcgaaat tagacccgat atgcgaaacc ctgttttgtt atccgctttc 1080 tgtttaacaa tgttcttttt atacggcaaa aatgtgcaaa agccaagtaa atactgtata 1140 aaccaaaagg gaattcctgt ccatgggcta agtcgaactt gaacaaatga gtagtgtatt 1200 aataccgatt atttttacag tttttgattt tggttgccac gcattcatct catcatctat 1260 gcagatactc atcacaatca cacacatcat catactcatc cacccatcaa atcacctcat 1320 ccaacacaca acgaaagatc tttctgcgtg ggagtcaaag ttattaatcg tttcgtaagt 1380 gattataaga aagaaaaaaa aaaacaagta agccatctga aaagaatgaa tgtgcggtaa 1440 ctattcgttg gtcttcaacc aggatcacgg ttttatacac gtattactca tgcctttaaa 1500 cgactttcca tatacccgat ttttgtctgg ttttggcact gaccaagtta ccgtaatgac 1560 atatgacgtt tccttgttcc tatgatgaac acgtacctat ttcttgcctt tatctatacc 1620 tactactaat catttttata gatttaagaa ccttatcccc tacctcgccc ctatgatact 1680 cttaatacta gtaataccct ataaatgtaa gacgataccg actagcaggc ttatattagg 1740 gactagatcc tattagcgtt aggtttgata ggagtttaaa tgaagaatta tggattgttt 1800 aaataaagta gttttggaaa tgaaccgtgg ctaaatagat tatattggaa ccatgcaacc 1860 tcgaaaaacg gagatgtgac ccaccgcagt ctttgttgga gggatcctga cgattaatag 1920 gtggccctga aggatcatgg tagggtgggc gttctcgcca atgggcgata ccgcagccga 1980 caaagatctg accggtgatc ttcttcgttc acgtgtacat gctgatctgt ccctttgacc 2040 cccctgtatc ccgtttccta gtatcctttc cctactcgta cgcgattttt tttataaatt 2100 cttatctcga ccatttgcca gagctataaa tgggcagtaa aatttcctcc atgcccacac 2160 cccgccgacc agagacaaaa gccgaagcca gcgcgtactt cctactcccg tttacttatc 2220 atcagctcct ctaacctgta cgttgtatgc taca 2254 // ID Gypsy-174_AA-LTR repbase; DNA; INV; 1119 BP. XX AC supercont1.141; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-174_AA_; KW Gypsy-174_AA-I; Gypsy-174_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-1119 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.141; Positions 151063 149945. XX SQ Sequence 1119 BP; 314 A; 264 C; 257 G; 284 T; 0 other; tgtaaccgtt cagttacaga atttctagat ttattttaat tttgtgcatt ttgtagggtg 60 aactgtagtg tttttatggt ttgtgtttgg taaaattcgt aattgtttgt gacaagcata 120 aattgtaatt taaataaact gtggataaaa tgggcgttgg gtggaaatca actgtggtag 180 gaaatgtagg caacatgcaa ggtaatggac gatgtttatg ctaccaagga tggaaatatg 240 gagtaaggat aaggaacaca aggaaattat ggactggagg gaatcataag gcttgagcta 300 gatcgccatg atcacttttt gatcggcgtt cgtgtgagac acaagtgagt gagagttagg 360 tggcaaaatt tgtgaagttc ggtgaaccgt aggagaggac attgaacccg ttcattcagg 420 accttttatt agtgcctttt ggtactgagc gataatacgc catcctcaca gtccacacga 480 ttcggcctag atcgggtatc cgatctacgc ctaaccgcaa aagactactc tcaggtctca 540 ttcttacgac cgtatgagag atcatcctcg ggccgttgag ccctaaacat aaaggaggac 600 ccttctcggc gtggccaata cggtctcgtc cgtggtccgt cgattccgac aaagaaggaa 660 gacgccgtta gccaagccta ggtcacgtaa gtggccgatt cacagtggaa acgccattcc 720 acttgtccac cgtcaccaga cccgatcctc agcaaccgac gccatcaacc gccattgcca 780 tcagcagtca tcagcgagcg ctcgccatcg caacggtacg tacccaacga tcctgaaccg 840 ccttaaaacc acaatagccc aataaacaca atagacactc cacagtagtt ttttcaataa 900 attagatctt tttcctccaa acacacacac ttctcaatag ccaaagaaag accatagttc 960 ccgcacactt agatccgcgt agcccctccg aattacccag tagcatgccg gccctgcgac 1020 acagttcagg ggtctcatgc tactgagcta gatatccggg agtcttggtt ctgccgtgag 1080 cctatagagc tgcatcagtg taagtctcaa acagtctca 1119 // ID CR1-48_AAe repbase; DNA; INV; 4539 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-48_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4539 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1135-1135 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 23 sequences with >98% CC identity. XX FH Key Location/Qualifiers FT CDS 1657..4476 FT /product="CR1-48_AAe_1p" FT /note="endonuclease and reverse transcriptase." FT /translation="MRTKTNEFRTKMETCDYDIIVLTETWLRPDVVNAELA FT SNYCIFRCDRNDNTSTLQRGGGVLIAVKSTLRCSAVELTDYIHLEQVVVAV FT RLPDTTIYVCGIYIRPNSHFDVYDSHASAVQEICNLSSTRDSIVVVGDYNL FT PHLTWHFDDDCNSYLPRNASSESEIMFTQSMIGSGLHQINSLRNMNGRILD FT LAFVSEXSFVELIDPPSSLLKVDSHHKPFVLRIEIETRTRAPVEEADVDDF FT DFKRCDFAALNDMLVAVDWNETFGDAPLDRITVLFYDKVFDIIRHHVPRRR FT RCSAVHSKLPWWTPELRHLRNIVRKARKRYFRNKSHDNRKNLKSLETRYSV FT CQENAFRSYTNRMESNLKRDPKSFWTYIKKLKNINRIPEELSYNGTTAETA FT EASANLFADFFSSVHNNHSPTFSSSARQNIQSFNVSLPMITISQHEVSSAL FT KKLDVSKGAGTDCLPPTFLKECADSLKTPISVIFNRSLREQVFPEIWKVAS FT ITPIHKSGSVHSVENYRGISILCCIAKVFEEIIHRSLYSAVRPIISDVQHG FT FVKKRSTVSNLMSFTNVLSNSVEKRRQVDAIYFDFAKAFDKVPHELVISKL FT KHIGLPDWITEWLRSYLTDRKAHVKVGRARSHSYDISSGVPQGSVLGPLIF FT ILFINDLAVRLKSGKLMYADDLKIYREISSTLDCCALQSDVNELISWCTEN FT GMELNIDKCKTITFTRRQTYVTHDYAIGNNIIERVFSIRDLGVIIDSKLKF FT TEHISTVTAKGFAVLGFIRRNSQSFRDVYTLKALYCSLVRSVMEYAVCVWS FT PYHATHMLRIEKVQRCFIHYCLRQLPWNDPTNLPDYVSRCRLIDLETLTSR FT RTKIQRLFVYDLLSNNIDCSELTNELRFYAPTRQLRERQLLVIRPHRTTYG FT QNCPMSSCLRAFNDVGCYFDFNVSKCTFKHRIKYLN" XX SQ Sequence 4539 BP; 1268 A; 1062 C; 973 G; 1232 T; 4 other; aattctggca ccactgcgtg ttttcgtttg tgttgtttat aacgtcgccc taaaatcctg 60 tgaatattcg tgcaatcaac tctataaatc agggtttatt gctaaactgc ctcgttccgc 120 ttctgttagc cgcaagcatc aggttttcgc atcacttttg gcaaggtagg acagtgattt 180 gtgatctaaa tccgagcagg ctagagagtg gaagcatccg ccatcgcccg ataacatatt 240 tcccacctcg ccgctctcca ccttccaacc gatcgaaagc tgcaacaaga tcaaaaatca 300 acccgccccc atttgcccca ncgccccacc aacgccctac taytgtttgc tgcaccgaac 360 gaaatcaaaa caagaaatcg tcctgtaata ctccgcttca tattggaaat aaactactta 420 cgctgcctcg tctttcattc caacgctccg aaagggacca acgaacgatc atcatacagc 480 ctcaatcgcc gtccaaagcc aacccgttgc taccaaggat taatcgccaa actctgcatc 540 cgtagtatca tccattcacg taaggaaatt ctgttttggt tctgcacttt gtttatgtat 600 tgaactgtga agctcaccgt tgacaagcca ccgtttgtga cagagaacac acgcgcagct 660 cgtggtcatt ccgagaatca atagttcaca gtatatcacc ccttcgtaac tgggagcttg 720 tttctcgcag ctaaaatgca gggtgtttgt gaaaaatgcg cttccgaact atccgctgat 780 ttttggagct gtggtggatt ttgttctggt cgcttttgct tcaggtgtac cggaatgagt 840 cctgatatcc aagctgccgt tgcgaataac gggtgccttc actggatgtg taacgcctgt 900 agcggaataa tgattaaagc tcgctttact aaggcaatca catcagtgaa tgctgcctat 960 gaaggcataa tcgaatcaat gaaaatccga aattcgagac aacattttgt ccgaaatccg 1020 ctcggaattg cagaataacc ttaaacgaat tccggagatt gttgcaaaca caccgattcg 1080 aaatattcaa cgtcgcacat ttgattcggc tgtccgtaat cctgcaaaac gtcttcgtgc 1140 aaatgatgat gtgcctctcg acccgcctaa aaaactggtc tgtggcactg atggccacag 1200 tactagttct tcagacgtca tcgcagttcc tgaacgaacg gatgaaaaca cgttttggct 1260 ttacttatca ggagtgtccc cgaaagcgcc tgatgagaag gttatcgaaa tggtcaagaa 1320 taggctcgma acagacgact taactgtcgt taaacttgtt cctcgtggaa aggatacaaa 1380 caatctcacg tttgtatcct tcaaaatagg aatgaagctt gatcttaggg caaaagcaat 1440 gacttcatca acctggccag ttggaattcg ttttcgcgaa ttcgagaacc acagttcctc 1500 aagggcgggg ttctggaatc cctcggagaa ccagcccgaa ccacagatcg taacagtatt 1560 gaactagtat ccccatcgtg tcccatagat cttcattctc ctcgtgatga gaccgtttta 1620 agccatttga caatctacta ccaaaacgtg cggggtatgc gaacgaaaac taatgagttt 1680 cgtacgaaaa tggaaacttg tgattatgac ataattgttc ttactgaaac ctggctgcgc 1740 cctgatgtgg ttaacgcaga gttagcatcg aactactgca tctttcgctg tgaccgcaat 1800 gacaatacca gcacgttaca gcgaggcgga ggcgttttaa tcgctgtgaa atcgactctt 1860 cgttgcagtg ctgttgagct aacggactac atccacctcg aacaggttgt ggttgctgtt 1920 agactgcctg atacaacgat ttacgtatgt ggaatttaca ttcggcctaa ttcccatttc 1980 gatgtatatg attcgcacgc atctgcagta caggagattt gcaacttatc atcaactcgt 2040 gactccattg tcgtcgttgg cgattacaat ctaccgcatc tgacatggca tttcgatgat 2100 gattgcaaca gttacttgcc gcgaaatgct tcttcagaat cggaaataat gttcacccaa 2160 tccatgatcg gttctggact acatcaaatc aactcacttc gtaatatgaa tggacgtatt 2220 cttgacctag cattcgtgag tgaakctagc ttcgtggagc tcatcgatcc tccatcttct 2280 ctgttgaaag ttgactctca tcacaagccg tttgttctcc gaatcgaaat tgaaaccaga 2340 actcgcgccc ctgttgaaga ggctgatgta gacgacttcg attttaagcg ttgtgatttc 2400 gcagcactga atgacatgct agtggcagtt gactggaatg aaacgtttgg cgatgcacct 2460 ctcgaccgaa tcactgtatt gttttacgac aaggttttcg acatcattcg ccatcacgta 2520 ccacggagac ggaggtgctc tgccgtgcat tcgaagcttc catggtggac accagagtta 2580 cgacacctcc ggaacatcgt tcgcaaagca cgaaagcgtt acttccggaa taaatctcac 2640 gacaacagaa agaatctcaa gtccctcgaa acgcgataca gtgtatgcca agaaaacgct 2700 tttcggagct atacaaaccg tatggaatcc aacttaaaac gggacccgaa atctttttgg 2760 acgtacatca agaagctgaa aaacatcaac cgaattcctg aagaattgtc ttacaacgga 2820 acgacagccg aaactgcaga agcttccgct aatttatttg cagacttctt cagcagcgtg 2880 cataacaacc actcccctac gttctcatca tcggcgaggc agaatattca atcttttaac 2940 gtaagtctac caatgataac gatctctcaa cacgaagtct catctgcgtt gaaaaagctt 3000 gacgtctcta aaggtgcagg cacagactgc ttacctccga cattcctcaa agagtgtgca 3060 gactcattga agacaccaat tagtgtaatt ttcaatcgct ctctacgtga acaagttttt 3120 cctgagattt ggaaagtggc ttcaatcact ccgattcaca agtccggtag tgttcactct 3180 gtcgaaaatt atcgaggcat ctcaatactg tgctgtatcg caaaggtatt cgaagagatt 3240 atacacagga gtctctacag cgcagttcgt ccaataatct ccgatgttca gcatggtttc 3300 gtaaagaaaa gatcgacagt ttcgaatctg atgagtttca caaatgttct gtcgaatagc 3360 gtcgagaagc ggcgtcaagt ggatgcaatt tatttcgact ttgcgaaagc cttcgataaa 3420 gttcctcacg aacttgtaat cagcaagctt aagcatattg ggcttcctga ctggataacc 3480 gaatggctta gatcctacct gaccgatcga aaagctcatg tgaaagtcgg ccgtgcacgt 3540 tctcattcat atgatatatc atccggtgtt ccgcagggaa gtgttctcgg cccgctaata 3600 ttcatcttat tcataaacga tctcgccgtc cgtctaaaat ctggaaagct gatgtatgcg 3660 gatgatctta agatctatag agaaatatct tcgactctcg attgctgtgc attacaatct 3720 gacgtaaatg agctgatttc atggtgtaca gagaatggta tggaactgaa catagacaag 3780 tgcaaaacta ttaccttcac gcggcgccaa acgtacgtga ctcacgacta cgcaatcggt 3840 aacaacatca tcgagcgcgt gttttccatt cgtgatcttg gcgtaataat agatagtaag 3900 ttgaagttca ctgaacacat aagcaccgtg acggcgaaag gatttgctgt gttgggattt 3960 atccgacgaa actcgcaatc gtttcgcgat gtgtacacgc tgaaggctct gtattgctcg 4020 ctggtcagga gcgtcatgga atatgcagtg tgcgtctggt ctccatacca tgcaacgcat 4080 atgctcagaa ttgagaaggt gcagcgctgt ttcatccact attgtttaag gcagttaccg 4140 tggaacgatc cgacgaatct tcccgactat gtaagtcgtt gtaggctgat agatttggaa 4200 acattgacgt ccagacgaac taagattcag cgattatttg tgtatgactt gttgtccaat 4260 aacatcgact gctcggaact gaccaacgaa ctgaggtttt acgcaccgac aaggcagctc 4320 cgcgagaggc agttgctggt tatacgaccg catagaacga catatggaca gaactgccca 4380 atgtctagtt gtttgcgggc attcaatgat gtaggatgtt attttgattt taatgtgtcc 4440 aaatgtactt ttaaacatag aattaagtat ttaaattaac agtctggcgg attaaggtat 4500 tgatccaaga cgtagcaatt aaataaataa ataaataaa 4539 // ID BEL-198_AA-LTR repbase; DNA; INV; 353 BP. XX AC supercont1.1432; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-198_AA_; KW BEL-198_AA-I; BEL-198_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-353 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.1432; Positions 69327 68975. XX SQ Sequence 353 BP; 86 A; 96 C; 66 G; 105 T; 0 other; tgttggcgca agaatgggga ccgctaccct actcgtagca ccagcaagcg atgcagataa 60 ctttccccgc acgtgcggct gctaccacca agcaattgaa agcagcatcc aaaatttctg 120 tatctctccc tttttcttaa attctccttt gtcatccttt tttgagcctt ctgcctattg 180 aattgccgaa ttatataaac ccccaacatc agaagcaaaa tatagtagtt acgaattgaa 240 ttgtaaaagc cacacgcgtt gtttgattcc gtcccgaatt cccgaaagtt ttgccggtag 300 tttgtccgcc ttgtgacccg ttttcgcgcg cttttttgga cctcacgcta aca 353 // ID BEL1-NVi_I repbase; DNA; INV; 6018 BP. XX AC AAZX01006515; XX DT 13-NOV-2007 (Rel. 12.11, Created) DT 13-NOV-2007 (Rel. 12.11, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: internal DE portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL1-NV; KW BEL1-NVi_LTR; internal portion; BEL1-NVi_I. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-6018 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from Nasonia parasitic wasp."; RL Repbase Reports 7(11), 1167-1167 (2007). XX DR Genome; AAZX01006515; Positions 21758 15741. XX CC Positions [5076-5636] - Integrase core CC 'GTGTC' target site duplication CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 88..2100 FT /product="BEL9-NV_I_1p" FT /translation="MAASGRRHRPLDNVPENVAMGPGDPGVVPAAIENDAA FT QRDASPPHQDRCAATGARNRVPAAPQQHVAAVGDAAAPPPRRRAPAAARNR FT APLAPLAAAAVGDAVAPPPRRRAPAGGRNRVPPAPLAAAAVNNADAAPLHR FT RTPVGARNRAPPAPPPVAVAEGAAVPAAPPSHRRAPAGTRNHVLPAPQRAD FT VLRNKQKNRPCLRSQLQRSAAAISERYRAKRRQRNKSVGDRLAKRPRFNCD FT FSDESSSSLDVDNSLHRPFFNRNQFEPRVGNFNLLNNEINDNYFRVANGMN FT NRAHSTSYNNPLTNKNCNHSSEYVIERFAKALENTCLQSNQSNLNSRLLHR FT MSTSKVLPTFSGDPLEWTRFKKAFEVSTSLGKYSDSENLMRLGEALKGNAR FT EAARSLFVAGNNTEDIMKTLEMRFGNSRLILNNIILEIKNLPSINSKKISI FT IEFATRLRNAVLAIKSFDNHSGYLSSPDLSNELIRKLPDAMISNYVRFVKA FT QGYERSDLEKISDFIFEEAELNIAAGIFPTSSSETTNEASDSSTKKPRNPQ FT KIVCALNVNEGSNNALEQPSGNCEHCGRKNHKIYTCRDFMKAPVSQRWKVV FT KSNRLCFRCLEKGHSRDECVKESCKHCARKHHELLHFHGQKNSNSRVENSQ FT KNDLVPVSTATEPVLSTLNMSL" FT CDS 3513..5975 FT /product="BEL9-NV_I_2p" FT /translation="MHSWSSNYKSVLHNNNINCDSDQLIKTGTEIQSGEKV FT LGLKWLDVSDELSFNINLSRIPIDLHKGVRKPTKREFLSVIMSVFDSLGLI FT TPFTIRSRILMQNVWSSGITWDAELHDTEFVQWKQWLCELKEVVECKVDRC FT YQLKNNQANSAELHVFSDASSKACTAVAYWRFKLSEDRYHVSIIMAKSRVV FT PLKGETIPRLELQAAVIAVRLAKLIAEEHRFVITKRVFWCDSKIVLYWIKR FT DLKDFKVFVANRLSEIREKSSASEWRWIHSAENPADDGTRLTGQTLKSTSR FT WFLRPNFLREKETFWPADEFHSENLDQNVNLEHKKSVVEVYTVVNKTRIID FT LSRFSLWTRLISTTTRIYEAVDIWKNQARTPLQRYLLAEETCIKISQLNSF FT EKEINALKNKDKLPKDSRILTLNVYLDECGLLRSNSRIIKLKAVEINTKPL FT VLDASESGTQLLIKYYHGKYYHASHKTAENELRQRYWVVGLRNKLRSLVAK FT CAICRMQRGRPSNPIMSALPECRLAYYERPFRYCGLDYFGPMTVKIGRRRE FT KRWGAIFSWMSTRAVHLELAHSLSAGSAIMALRRFTARRGTPSKMYCDNGT FT NFRGMSLEPAREIREINREKLQEYANRNKIDWIFNPPTASHMGGAWERLIR FT SVKTALAVILKEQAPKEETLLTILAEVEHAINSRTLTQLSVDPRDQEALTP FT NHFLLGFSSEQIRINRSDVQSTCPRKQWQIAQKFADAFWKRWLKEYVPTLI FT PRSKWHENDESIKVNDIVLILDDNVERNQWRKSVVTRVLPGSDGQIRIAEE FT RTANNILLRPTRKLIKFAA" XX SQ Sequence 6018 BP; 1963 A; 1122 C; 1344 G; 1589 T; 0 other; ttttggtgcc gaaacccggg acagttttgt gttccattta ttttttttct tttttgcgcc 60 gagaagttta tccagaagaa ggttagaatg gcggcttcag gtcgccgcca ccgaccttta 120 gacaacgttc cagagaacgt agcaatgggc ccaggagatc caggagttgt gcctgctgcg 180 atcgaaaacg atgctgctca aagagatgca tcgcctccac atcaggatcg ctgtgctgca 240 acgggtgcgc gtaatcgtgt gccagccgca cctcaacaac atgttgcggc agtaggcgat 300 gctgccgcgc cgcctcctcg tcgacgagca ccagcggctg cgcgtaatcg cgcgccgctc 360 gcgcctcttg cagctgcggc agtaggcgat gctgttgcgc cgcctccacg tcgacgagca 420 ccagcgggtg ggcgcaaccg tgtgccgccc gcgcctctcg cagctgcagc agtgaacaat 480 gctgatgcag cgcctctaca tcgacgaacg ccggtgggtg cacgaaatcg cgcgccacca 540 gcacctcctc cagttgcagt cgcagaaggg gctgctgtac ctgctgcacc gccttctcat 600 cgtcgggctc cagcaggtac acgtaatcac gttctgcctg cgcctcagcg tgcagacgta 660 cttcgcaaca aacagaagaa tcgtccatgt ttgcggagcc aactacaacg atcagctgca 720 gctattagtg agcgatatcg tgctaagcgt cgtcaaagaa acaagtcagt aggtgatcga 780 ttagctaaac gaccgcgttt taattgcgat ttttctgatg aaagttcttc gagtcttgat 840 gttgacaata gtttacatag acccttcttt aacagaaatc aattcgagcc tcgtgttgga 900 aatttcaatt tgttaaataa tgaaatcaat gataattatt ttagagttgc caatggtatg 960 aataatcgcg cgcatagtac ctcttataat aatcccttaa cgaataaaaa ctgtaaccac 1020 tcgagcgagt acgtaataga gagatttgct aaagcattgg aaaacacatg tttacagtca 1080 aatcaaagta atcttaattc taggttattg cacagaatgt ctacgagcaa agttttgcct 1140 actttctcag gagacccgct cgagtggacg agatttaaaa aagccttcga agtttccaca 1200 tccttaggaa aatactcaga tagcgaaaat ttgatgcgtt taggtgaagc gttaaaaggt 1260 aacgctcgag aagcagctag atcattattc gtagcaggaa acaatacgga ggatattatg 1320 aaaactttag aaatgcgttt tggcaattcg aggctaatat tgaataatat tattttagaa 1380 attaaaaatc tgccgagtat taattccaaa aagataagca tcatcgaatt tgcgacgcgt 1440 ttgagaaatg cagtgctagc gattaagtcg tttgacaatc actcgggtta tttgagtagt 1500 ccggatttat caaacgagtt gattagaaag ttgcctgacg cgatgatttc gaattacgta 1560 agattcgtca aagcgcaagg atatgaaaga tctgatttag aaaagatttc tgattttatt 1620 tttgaagaag cagaattaaa cattgctgct ggtatatttc cgacttcatc gagtgagaca 1680 acgaatgaag cgtccgattc gtccacaaag aaacctcgta atccacagaa aattgtatgt 1740 gcgctaaatg taaacgaggg ttctaataat gcattggaac aaccatctgg taattgcgag 1800 cactgtggtc gcaaaaatca taaaatttat acgtgccgtg attttatgaa agctcctgtt 1860 tcgcagcgtt ggaaagtagt aaaatcaaat cgattgtgtt ttagatgtct cgagaagggt 1920 cattccaggg atgaatgtgt taaggagtca tgcaaacatt gcgcgcgcaa gcatcacgaa 1980 ttattacatt ttcatggaca aaagaattct aacagtagag tagaaaattc acaaaaaaat 2040 gatttagtac cagttagtac tgcaacagag cccgttttat ctactctaaa tatgtcatta 2100 tgactagagc aacgtgtttt actcaaaatg attgaaataa cagtgatagg tacacattcg 2160 agcaaaatga tttacgcact attagatgaa ggttctacgg taacgataat caactcaata 2220 attattaacg agataggagg agaaaggtcg cgctacgtat ctctcgcagg cgtaggatct 2280 gaaaaatcta ttgcagttat aaaccaaaag gttaatatta aaattcaaaa tgattacgac 2340 acgtattcct tagcgagtgt ttagtaattg acgaccttgc acttcccatg caacatgtat 2400 cttcagagat cgcggagtta tgtagtaaac gcgcaaaggt tccgatatgc gattataacg 2460 cagtaccagt tttgttgatc ggacaggata attgtaaatt aatattaacc tcagaatttc 2520 gagaagttat cgggaaaaat caagtcgttt cgcgatgtga attaggatgg gttgtacacg 2580 gacattacaa aacttccgaa ttttcaaggc gcgtcaattg tgtaaataaa ctaaatcaat 2640 gcgatgaaat tatatcacga gacaatacgt tagatagact tgttaaagcg tattttgata 2700 tcgatgcagt cggattaata aacaattcaa ttcagtcaac agcagatgaa caggctttaa 2760 aaattttaga agaaacaagc cgctatatcg agaacgcgtg ggaagttggt ctattgtgga 2820 aaggagaaag cgtaaatctt ccgaatagcc gtgtgacagt attacgtcgt ttgcagttat 2880 tagagcgaaa attagacaga gacaaagact atggggaaaa atattatcga gagatggata 2940 gactatttga aaacggattt gcaaaaaaac taacggacga gatagacaat gtagaatttg 3000 gtacttaccg catttcggag tattgaatat taataaacca ggaaaagtac gcctcgtttt 3060 cgacacagca gctacaacag ctaacatcag tctaaacggg cttttactcg cgggtccaaa 3120 ttttttaaac gtgctaccag gagtattaat gcgatttaga caatttgcag ttgctatcaa 3180 aggagatctt aaagatatgt tcttaaagat taaaatcaat aaaatagatc aaaatgctca 3240 gcggtttctt tggcgaggca ggagtagaac cgaaaaaccg caggaatacg ttatgacgac 3300 tgtcttgttc ggagcgaaat cttcaccatg cacagcttta catattaaaa aataaaaacg 3360 ccgcgttgta tgcttcaaaa tacccgtcca cagtaaggag tttaatcgaa aactgctata 3420 tggacgacta cttggatagt tgcgagaccg tcgatgaagc agaatcacgg gttaaaaaag 3480 ccatagaaat aaatgcgaat gcaggataga ttatgcatag ctggtcgagt aactataaat 3540 cggtgctgca taacaataat attaattgtg attccgatca attaattaaa acaggcactg 3600 agattcaaag tggggagaaa gttttagggt taaaatggct agacgtatct gacgagttat 3660 cgtttaacat caatttatcg agaataccta tcgatctaca taaaggagta agaaagccaa 3720 caaaacgcga atttttatcc gtgataatgt cggttttcga ttcccttgga ttaataacac 3780 ctttcactat aagatcgcgt attctaatgc aaaacgtttg gagcagtgga ataacttggg 3840 acgcggaatt acatgacacc gaatttgtac agtggaaaca atggctatgc gaattgaagg 3900 aagttgttga atgtaaagtt gaccgatgtt atcagttaaa aaataatcaa gctaattcag 3960 ctgaattaca cgtgtttagt gacgcaagtt ctaaagcctg tactgctgta gcttattggc 4020 gcttcaaatt aagcgaagat cggtatcatg tttccataat catggcaaaa agtcgcgtcg 4080 tgcccttaaa aggagagaca atccctcgat tagagctaca agctgccgta atagccgttc 4140 gtcttgccaa attaattgcc gaagaacatc ggtttgtaat aacgaaaaga gtattttggt 4200 gcgattcaaa aatagtattg tattggatta aacgcgatct gaaagatttc aaagtttttg 4260 tcgcaaatcg tctgtcagaa atacgcgaaa agtctagcgc ttccgagtgg agatggattc 4320 attcagcgga aaatccagcc gatgatggta ctaggttaac aggtcaaact ttaaaaagta 4380 caagtagatg gtttcttaga cctaattttt tacgcgagaa agaaacgttt tggccggcag 4440 atgaatttca ttcggagaat ctagatcaaa atgtaaactt agagcataaa aaatccgttg 4500 tagaagtata cactgtagta aacaaaactc gaataattga cttatcaaga ttttctttat 4560 ggacgcgatt gatatctaca acaacgcgga tttatgaagc agtagatatt tggaaaaatc 4620 aagcacgaac accattacaa cgctatttgt tagcagagga aacgtgcata aaaataagtc 4680 agttgaattc gttcgaaaaa gagataaacg cgttgaaaaa caaagataaa ttgccaaaag 4740 atagtcgcat tttaacgtta aacgtgtatc tagacgagtg tgggttgcta cgttctaaca 4800 gtcgaattat aaaattaaaa gctgtcgaga ttaatacaaa accccttgtt ttagacgcta 4860 gcgaatcagg cacacaattg ttgatcaaat attaccatgg aaaatattac cacgcgagcc 4920 ataaaacggc cgagaacgaa ctgcgtcaaa ggtattgggt agtagggttg cgaaacaaat 4980 taagaagttt agtagcaaaa tgtgcaattt gcagaatgca gcggggacgt ccgagcaatc 5040 cgataatgtc tgcgcttccc gaatgtagac ttgcttatta tgagcgacca ttcagatact 5100 gtggcctaga ctatttcgga cctatgactg tgaaaatcgg tagaagaagg gaaaaaaggt 5160 ggggcgcgat ttttagttgg atgtccactc gcgcagttca tctcgagctt gcgcattccc 5220 ttagtgcggg ttccgcaatt atggcattaa gacgattcac agcgcgtaga ggaacaccgt 5280 ctaaaatgta ttgcgacaac gggacgaatt tcagaggaat gagtctagaa ccagctcgtg 5340 agattcgtga aatcaaccgt gaaaagttac aggaatacgc aaatagaaac aaaattgatt 5400 ggatttttaa tccaccgacc gcgtcgcaca tgggaggtgc atgggaaaga cttataagat 5460 cagttaaaac tgcgttagct gtaatattaa aagagcaagc tcctaaagaa gagacattgc 5520 taacgatact cgctgaggta gaacatgcta taaattcgcg aaccttgact caattatcag 5580 tagatccgcg cgaccaagaa gccctgactc cgaatcactt ccttctcgga ttctcctcgg 5640 aacaaatacg cataaataga tctgatgttc aaagtacttg ccctagaaag cagtggcaaa 5700 tcgcgcaaaa gtttgctgat gctttttgga aaaggtggct aaaagagtat gttcctacat 5760 tgattccaag gagtaagtgg catgaaaatg acgaatcgat aaaggtaaat gatattgttt 5820 taattttaga cgataacgtc gaacggaatc aatggagaaa aagcgtagtc acgcgagttt 5880 tacctggatc tgacggacaa attagaatag cggaagaacg tacagccaat aacattttac 5940 tcagaccgac tagaaaatta attaaatttg cggcgtagtt aagagtacaa cgttctagac 6000 ctttttacgg ggggagga 6018 // ID Tx1-1_BF repbase; DNA; INV; 5614 BP. XX AC . XX DT 28-APR-2009 (Rel. 14.04, Created) DT 28-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE Amphioxus Tx1-1_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW Tx1; Non-LTR Retrotransposon; Transposable Element; L1-1_BF; KW Tx1-1_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-5614 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-5614 RA Kapitonov V. and Jurka J.; RT "Young families of Tx1 non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(4), 838-838 (2009). XX DR [2] (Consensus) XX FH Key Location/Qualifiers FT CDS 241..1449 FT /product="Tx1-1_BF_1p" FT /note="ORF1p." FT /translation="MAARDDISRLQDTNLWIKIDAVGMPKLSEFDIVEALY FT NTVQNELDDSIKPEEILGAQYLSQKRTWVINFTCLDAKAKAISVGRVVVGS FT KSYGILDFQKTGLRKTEIRISIHGIPHSVSDDEVAQWVDSRAERTTEVLRH FT QKKNRNADSPFQHLYSGHRFCYASKITNPFQRYGTYSIPDPTDTGSLIDIE FT VTVFHDGQHINCKSCKSEDHAFHECPHRITCHNCGKKGHTKKWCRAERSNP FT SRERRDVEQKVQQGLRAAREASNSRSTSASRAVVNNKNSGEDQDWTDWASG FT DPDETAESLVTGDSDGAENTRCETESIANPPHQEQAGLSGGTNQRRLTRST FT RPAPTSTEDTRDQGQRRQPGTSRSTRKRKNVLTPPDKSKPDKLYKDDGRHN FT DQRRNKDNG" FT CDS 1805..5536 FT /product="Tx1-1_BF_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MVRFLSINVRGLRKRSVRRTVFQYLKTKQIDIVCLQE FT TYITNDVKDAWTKEWGGKLYAHPCTSHSKGQIILIAENFDCNNVEFINLSD FT RSIGVKFIHHGEKFVCFNVYAPNNVVSKKLFYKQVLTQLQEHCCDGEYFML FT AGDFNTVLNRDLDNVAGKEHDKAEVKCFNNLTDNLELLDIWRVFHSEDKDY FT TWSKRNPFTARRLDYLLINDFLFDKTTACEIISFPCSDHRAVVLDMSVTQF FT DKGPSYWKFNNSFLKDKVFVDRVNEIIGNIQEGLSDSVIEDKQLIWDICKS FT KIKTFSISYGKQKALEKRSKHNIIDEKLERLEKQLASDPKNLTLQQQVHNT FT KAEFEALALHEAHGAQIRSKVKWIEEGEKNTAYFLRLEKSRAIHNVITSLR FT TPDGHIITDQNKLLEQQVDYYRKLYSKNDDFDIESFDDFITDVNFPTLSDM FT QSKSCEGLFTMTECTSALRNMTNCSPGLDGLTTDFYKVFWSRIGSMVLNSL FT NEGYTKNQLSSTQRKGVISLLFKGGEHQDLSRDQLSNWRPISITNTDYKIG FT AKILANRLQNIISHIVGPDQVGYIKGRSILNNIRAIDDIICFTRDSNVPGA FT ILFLDYSKAFDTISKDLIIKVLEKFNFGSDFVQWVKVYTNNARSCINHCGH FT ISEWFDLGRGVRQGCPLSSLLFILATELFALKVRQSSEIKGITLSGLGDSD FT VESRIFQYADDTELILGDKISIDRSLDIIRHFSVFSGLVLNKEKTVAMWLG FT PWRHCSEQASGLKWCTTAKCLGIVFSSACCASEVEVNWSKKYCKMKNILLS FT WASRNLSLIGKITVLKNLVASQLSYSFGALIPPDSFLKDVNLMFYRFIWKR FT DRVKRNVMVQDYSKGGLKMIDIRIMHKCCLLRSIKKLLSGSETPTPTWKKI FT QLYYFKKLGPDLCILRHNCSAVHVSSKDSLLPLPLYYRKLILTWYEMKPVQ FT NVNDIEVNQVNWQLLWNNACISYKGNMLYFKKWIRRHILYVNDIVDDQGNF FT ISFDDVKAIVGNDAHVLLQYHALINSIPKQWKGVSRLDNEESPSIKLCGKD FT IRLLDSIFFKNFCIMQYQAVPVSQNFWEKRFPMYDFRSISWHVLWKSPFLT FT TKEPKLISLQWKILHNIYPTKILLHKMRIVENNKCIFCDIIEYIEHFFFDC FT KIVKPLWDYVESLFSYSISLTVHDIIFGYNPHMLNKFRYINELLLVAKLSI FT VKYKSTVSSPNSLCQDAEIILKIFKSECILRNMI" XX SQ Sequence 5614 BP; 1828 A; 1039 C; 1156 G; 1591 T; 0 other; caatatcaaa gacagaaagg tcaaaggtca ggtgttgtca tctgacgtgt tttgtgtgag 60 ctctccagaa ttcgcgttct ggaagctctc gagcggtgct aaccgttctt tttgagttag 120 tagtgataca taagtgtgtg ctgactttcg tcgctgcctt ggaattactt tctttcaatc 180 tttggaagac tttgtgcccg tcttttgaag accagtggga gggctactcc gacaggaaag 240 atggcggcac gtgacgacat ttcaaggctc caagacacca atctttggat caaaatcgac 300 gcggtcggta tgcccaagtt gtctgaattt gatatagttg aagcactgta caatactgtt 360 cagaatgagc tggatgattc cataaaacca gaagaaatcc taggcgcaca gtacctgagc 420 cagaagcgca cctgggtaat taattttaca tgcctggatg ctaaggcgaa agcaatcagc 480 gtaggtcgag tagttgtcgg ttcaaaatcc tacggaatcc tggatttcca gaaaacgggt 540 ttaaggaaaa ccgaaatcag gatttctatc catggtattc cgcatagtgt atcggatgat 600 gaggttgcac aatgggtgga ctcgagagct gaaaggacga cagaagtcct gagacatcag 660 aagaagaaca ggaatgcgga ctcccccttt cagcacctgt actcgggcca cagattctgt 720 tatgcgtcga agatcaccaa cccattccag cggtacggta cctacagcat acctgaccct 780 acggatacag gttctctgat tgacattgaa gtgacagttt tccacgatgg ccagcatatc 840 aactgcaaat cttgcaagtc agaggaccat gcattccacg aatgccctca tcgcatcact 900 tgccacaact gcggaaaaaa aggtcacaca aaaaagtggt gcagggccga aagaagtaat 960 ccatcaagag aaagaagaga tgttgagcag aaagttcagc aaggactcag agctgcccgg 1020 gaggcgtcca actctcgatc gactagtgct tcaagagcag tcgttaataa caaaaatagt 1080 ggagaagatc aagactggac agactgggct tctggtgatc ccgatgagac agccgagagt 1140 ctagttacag gggacagtga tggcgcagag aacactcggt gtgagacaga gtcaattgca 1200 aaccccccac atcaggaaca agccggtctc tctggtggaa ccaatcagag aagacttaca 1260 cgttcaacgc gcccagcgcc caccagcact gaggacacta gagatcaagg tcaacggcgt 1320 caacctggga catcaaggtc aactaggaag agaaagaatg ttctgactcc gcctgacaag 1380 tcgaagccgg acaaactgta caaagacgac ggtcgccaca acgatcagag gagaaacaaa 1440 gacaatgggt aggacttagg tctatcttcc atcaacaccc atggttcgac tactcaccat 1500 caacacgcga ggcttaagga aacgaacggt acggcgaact gtttttcaat accttaaaga 1560 taagaagata gacattgctt ttatgcaaga aacttacatc agcagtgctg atatagagac 1620 atggaagaca gaatggggag ggatgcttta cgcacaccct ggtacctgtc atagcaaagg 1680 acaagttatt ttgatatcta aaggttccaa cattaaaaat gtggaattta tagatatatc 1740 tgagagggtc attggtataa aattcataag tcaagatgaa acattctgat tgacatttgt 1800 tgtcatggta cgttttctta gcatcaatgt cagaggacta cggaaaagat ccgtccgtcg 1860 aactgttttt caatatttga aaacaaaaca aattgatatt gtttgcttgc aggaaactta 1920 tataaccaat gatgtcaaag atgcatggac aaaagagtgg ggtggaaaac tttacgctca 1980 tccttgcacc tcacacagta aaggacaaat tatattaata gctgagaatt ttgactgtaa 2040 taatgttgaa tttataaatc tttctgatcg ttcaatcggt gtgaaattta tacaccatgg 2100 tgagaaattt gtttgtttca atgtatatgc tcctaacaat gtagtgagta agaaactgtt 2160 ctacaaacag gttctaactc aactgcaaga acattgttgt gacggtgaat attttatgtt 2220 agctggtgat ttcaatacag tactcaatcg tgatctcgac aatgtagcag ggaaagagca 2280 tgataaagca gaggttaaat gttttaataa tttgacagac aaccttgaac tacttgacat 2340 ctggagagtc ttccatagtg aagacaagga ttacacatgg tctaagagaa acccttttac 2400 agccaggaga ctagattatc tcctgattaa tgactttttg tttgataaaa ctactgcatg 2460 tgaaatcatc tctttcccct gcagtgatca tagagctgtt gtattggata tgtctgtaac 2520 acagtttgac aaaggacctt catattggaa attcaataat tcatttctaa aagataaggt 2580 gtttgttgat cgtgttaatg aaatcatagg aaacatccaa gaaggcctta gtgactctgt 2640 gatcgaagat aaacagttaa tatgggacat ctgtaagagt aagataaaaa ctttctcaat 2700 tagttacggg aaacaaaagg ctcttgaaaa acgatccaaa cacaacataa ttgatgaaaa 2760 attagaacgg ctggaaaaac aattagcttc cgatccaaag aatttaactt tacagcaaca 2820 ggtgcacaac acaaaggctg agtttgaagc attagccctt cacgaggcgc atggtgcaca 2880 aatccgctca aaagttaaat ggattgagga aggagaaaaa aacacagcct actttcttcg 2940 acttgagaag tcacgtgcca tacacaatgt tattacaagc ctcagaacac ctgacggtca 3000 cataataaca gatcaaaata aattattgga acaacaagtc gattactata gaaaactata 3060 cagtaaaaac gacgatttcg acatagaaag ctttgatgac tttattacag acgtaaattt 3120 tccgacgttg tctgacatgc aaagtaaatc ttgtgaaggg ttgtttacta tgacggaatg 3180 tacatctgct ctacgtaaca tgacaaattg cagtcccggc ctagatggcc ttactactga 3240 cttttataaa gtcttttgga gcagaatagg atctatggtt ttgaactcac taaatgaggg 3300 ctacacgaag aaccagcttt cttccactca acgtaaaggg gttatatcct tactctttaa 3360 agggggtgaa caccaagatt taagtagaga ccagctgtct aactggcgac ctatctcaat 3420 cacaaatacg gactacaaaa taggtgcaaa gatattagca aatagattac agaacattat 3480 ctctcacatt gttggaccag accaagttgg atatatcaag ggacgtagta ttttgaataa 3540 catacgagcc atagatgaca taatttgctt tactagagat tctaatgttc caggtgccat 3600 tctcttctta gattacagca aagcctttga tactatttcc aaagacctca ttataaaagt 3660 tcttgaaaag ttcaatttcg gttcagattt tgttcagtgg gtaaaggtat acactaataa 3720 tgcccgtagt tgtatcaatc attgtggtca tatctctgaa tggtttgatc tgggaagagg 3780 tgtacgtcaa ggatgtccct tgtcttctct tctgtttatt ctagccacag agctctttgc 3840 tcttaaagtc cgccagtctt ccgaaatcaa aggtattaca ttatctgggc ttggcgactc 3900 cgatgttgaa tctcgaattt ttcaatatgc agatgatact gaactgatac ttggtgacaa 3960 aatttctatt gatagaagtt tagatattat tagacatttt tccgtttttt ctggtctagt 4020 tttgaataaa gaaaaaacag tagccatgtg gctaggccct tggagacact gctctgaaca 4080 agctagtggg ctaaaatggt gcaccacagc aaaatgttta gggattgttt tctcatctgc 4140 atgttgtgcc tctgaagtag aagtaaattg gagcaaaaaa tactgtaaga tgaaaaacat 4200 tttattatct tgggcttcac gcaaccttag tcttatagga aaaataacag tgctcaaaaa 4260 ccttgtagct tcacaactgt cttacagttt tggtgcactt ataccaccgg atagcttctt 4320 aaaggatgtt aatttgatgt tttacagatt tatctggaaa agagatagag taaagcgtaa 4380 tgtgatggtt caagactatt caaagggagg attgaaaatg atagatataa gaataatgca 4440 caaatgttgt ttattacgat cgataaaaaa attgttatcc ggtagtgaaa ctcctactcc 4500 tacatggaag aagatacaac tgtattattt taagaagtta ggacctgatc tttgtatatt 4560 gagacataac tgctcggcag tccatgtgtc atctaaagat tcactgttgc ccttacctct 4620 ctattaccgt aaactcatac ttacttggta tgaaatgaaa cctgtacaga atgtcaatga 4680 tattgaggta aatcaagtaa attggcaatt gttatggaac aatgcttgta taagttacaa 4740 gggcaacatg ctttatttta agaaatggat tagacgacat attctttatg taaatgatat 4800 tgtggatgat cagggtaact ttatatcttt tgatgatgta aaagctattg ttggtaatga 4860 tgcacatgta ttactacagt atcatgcttt aatcaattca attcccaaac aatggaaagg 4920 tgtttctcga ttagacaacg aagaatctcc tagtatcaaa ctttgtggta aagacatccg 4980 cttactagac tctatcttct ttaagaattt ctgtatcatg caataccaag cagtcccagt 5040 ctcacagaat ttctgggaga aacgatttcc tatgtacgat tttcgttcca tttcttggca 5100 tgtactatgg aaatctccct tcttgaccac taaagaaccc aaacttatta gtttgcaatg 5160 gaagatcttg cataatattt atccgaccaa aatattgttg cataagatga gaattgtaga 5220 aaataataag tgtatttttt gcgatattat agaatatata gagcattttt tctttgactg 5280 taaaattgtt aaaccattat gggattatgt ggaaagtttg tttagttatt ccatatctct 5340 cactgtacat gacattattt ttggatacaa ccctcatatg ttaaacaagt ttcggtacat 5400 caatgaactg ctgctagttg cgaaactatc cattgtaaaa tataagtcta cagtgtcatc 5460 cccgaattca ctatgtcaag atgcagaaat cattctgaaa attttcaaat cagaatgtat 5520 tcttagaaat atgatataag ctacacattg tacagctttc tgtgacatgt aataaagtgc 5580 cgtgacgaac acggaatcag gaagaaaaaa aaaa 5614 // ID Harbinger-N17_BF repbase; DNA; INV; 553 BP. XX AC . XX DT 04-AUG-2008 (Rel. 13.08, Created) DT 04-AUG-2008 (Rel. 13.08, Last updated, Version 1) XX DE Amphioxus Harbinger-N17_BF non-autonomous DNA transposon - DE consensus. XX KW Harbinger; DNA transposon; Transposable Element; Nonautonomous; KW Harbinger-N17_BF; non-autonomous. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-553 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-553 RA Kapitonov V. and Jurka J.; RT "Harbinger-N17_BF - a family of non-autonomous DNA transposons RT from the amphioxus genome."; RL Repbase Reports 8(8), 810-810 (2008). XX DR [2] (Consensus) XX CC It contains 33-bp, its copies are flanked by TWA TSDs. XX SQ Sequence 553 BP; 135 A; 133 C; 101 G; 183 T; 1 other; ggccacactg acttgatttt atggatgaca tccgcgcgcg cattactttt cgcctgttct 60 caaaaaaaat cacctcctcc gaggctccac gcaagtccgg aacttgccga ctcatattgc 120 cgcaaaaggc cagcatgtgc cggcttgtgt aaaactcgtt atgtactacc gcaaaacgtg 180 ctaattttgt caaacccttg gtcaaactgc ccttatgagt attctcatca cagtcatata 240 tcatccacag tccttggagc catatacctt ctaatagtgt acttctttga tcgttcttgt 300 caatatatta aayatcgatt acttcagctg agtctgctaa gatgaacaag acaacctaca 360 tgtgcctcct acaaataaag acttgttgtt tcttttgata cactgtgttc tgtggttcaa 420 tattcgtaat accagctttg tagtcctgtg gtttagcagc attttttttg ctggattttt 480 ttctttttcg ccctcccgca ttagttttgg ggtctccaga ggatgtcatc cataaaatta 540 agtcagtgtg gcc 553 // ID Gypsy-41_CQ-I repbase; DNA; INV; 7537 BP. XX AC AAWU01014757; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-41_CQ_; KW Gypsy-41_CQ-LTR; Gypsy-41_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-7537 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 461-461 (2011). XX DR GenBank; AAWU01014757; Positions 12934 5398. XX CC Positions [3579-4100] - Reverse transcriptase CC Positions [5064-5540] - Integrase core CC 'CCTG' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 640..2757 FT /product="Gypsy-41_CQ-I_1p" FT /translation="MDMAHLLSLYRGMNTGHLTVDELEHELKIRDIPFDSS FT RSGAERALRNRLKEERELKHVEYELMNGSVLDELEACELKVSEIKSHLENR FT KSKQAPEQSFKTRILHCVFRLERLRTYTTKDDELNSLAETARACMKLLNTF FT FSISSHLPEVREAELALINQSLTDIMKKQDAEKGNGKGAEEDNESWKSTEE FT NNGEEIAGIVGETGSVSGNGKGNGSGVGSGVGSGDGKSVGKPGDVERAEFE FT QWKEQKVELINVVNKLLEHIQVLEAKQVEKEVEKEKPEVQAKPANSTQLGV FT DTNKEKKDEGKSQQNPGDFLAWLNQRNSSLGSFGNSENEHNKPEEVHSGKD FT EPKEKKTGFGRSLPVHKWTVRYDGMDNGRKLNEFLKEVEFNARSENISESE FT LFQCAHHLFTRKARSWFMEVNGNNELGSWKKLVDELKSEFLPIDIDYVYER FT QANNRKQMSREKFQDYYLDMVRIFRCMTNPWDEKRKFDVLFRNTREDCQIA FT MLAAEIKTIPSMKEFGKRFDSVNWKLYQKGERYTPRSAHVEEVQTQRSSYQ FT NTGNRYQNSNRFQGGNRSQGGYQQQNQNQQNRPYYKKNNSNGNYNQKSSWK FT PEQQRQQSNGNQKENKPRVDERAQPKPNTQNRENNTAASSSGTSALQRIVK FT AYIPVKRDICFNCHVAGHGHEECRQERSVFCVRCGFPGFSTKDCPYCEAKN FT MQKTAQ" FT CDS 2889..5912 FT /product="Gypsy-41_CQ-I_2p" FT /translation="MLVTVSNGDRRPFARVEMLGISVVGLLDSGAQRTVLG FT VGGRKLVKLLSLKLKPTNIDLKTAGGESLEVIGCSDIPITFNGVTKILSVM FT IAPKLNRRCILGYDFWQKFGIRPTMESVEVVDEQVDEGLEDAEEELEEEQK FT KQLEDVQKMFLVASEGVVGKTDLLEHKIELKEEFIQSEPVRKNPYPWGPEI FT QKKIHIAVEKMLRDGIIEPSESDWALPVVPVKKRDSEEIRLCLDARKLNER FT TKRDAYPLPHQNRILSHLGPFKYLSTIDLSQAFLQVPLSEECRKYTAFSIP FT GMGLFQFKRLPFGLINSPATLSKLMDKVLGFGALEPSIFVYLDDIVVASHT FT FDDHIAKLTELAERLKNANLHINLEKSKFCCYELPYLGYILSREGMRPNPD FT RVQAILGYEAPNSVRSLRRFLGMVNYYRRFIDKFSELTAPLTDLLKNKPKR FT VVWSKAADLAFRAIKERLISAPVMANPDFKLPFCVQTDASDTAIAGVLTQV FT HDGMERVIAYHSEKLKGAELNYHAAEKEGLAALRCIEKFRCYIEGTHFTLV FT TDSSALTFIMRAKWRSSSRLSRWSITFQQYDMEVKHRKGKENIVPDALSRS FT IETLDVGDDDNWYRNLYNAVEDDPEKYLDFRIENGKLFKFVSSKSDVLDYR FT FEWKECVPESAREQVMRQEHDGCLHIGFEKCIEKLKRRFYWPRMSADLKKY FT INKCDTCKETKHSTVSTEPMMGQQRVAVRPFQIVCMDYIQSLPRSKKGNAH FT LLVVLDVFSKYCLLAPVRKISSASLCSILEEQWLRKLSVPQYIITDNATTF FT LSKEFQDLLKRYGIQHWANARHRSQANPTERLNRTINAMIRTYVRQDQRLW FT DTKISEIEFILNNTVHATTKFTPHRVVFGHEAVTRGMDHQLEDDEELTEQE FT RMERMHGVNKKTYELVKENLQKAHESTKRQYDLRHKRYSPVFDVGQRVFKR FT SFQQSAANKSFNAKLGPTYVPCIIVAKKGTSSYEVSDMNGRSLGVFSAADL FT KA" XX SQ Sequence 7537 BP; 2254 A; 1449 C; 1943 G; 1891 T; 0 other; cttggcgccc aactagaaat tagtttaact gtttcgtgtt tgggttcact tgaggacttt 60 agacgatttg cccacttctc tgggaaaacg cccattattc aggtgcaaaa gttgtctaat 120 cgtcaaaata caaacagaag tgacgctaat atttgaattt ttgagttccg aggaggggtt 180 ttggaagtct acgctaggat gaacgaccac cagacattgg caaagtaatt ggttggagat 240 ttcgtaaaat tttgtttgaa ttccggaata gaattcgtat cggtggagtg tcgtttcctt 300 cggactagtt gagtgaaaac accaggcaaa gaagttattt attcaccgta caaaagtacg 360 attacaaaag tttcgactta cactaggttt ttacaattat tcgctatttg ctcattcatt 420 caagagcgta attcaattct aggttttttt ttcaattcat tgttgatccg ttggtgttca 480 tcaatgaaaa gtttcattca ttaggttatc aaattatctc agttagcgaa tcaattttag 540 gtatttttta aatttgtagt tgtgattgct catttctagt ttttttttaa atgaattttc 600 tttttaatta caagatcttt gattttcctt gcactcaaga tggatatggc tcatttgttg 660 tcactgtatc gaggaatgaa tacgggtcat ttgaccgttg atgaactcga acatgagctg 720 aagatccgcg acatcccatt tgattcatcg cgcagcggag ccgaacgtgc gttgcgtaat 780 cgcctaaaag aagagagaga attgaaacat gttgagtatg agttaatgaa tggttcagtg 840 ttggatgaat tggaagcgtg tgagttgaaa gtgagtgaga ttaagtccca cttggaaaac 900 cggaaatcca agcaagcacc agaacagtcg ttcaagacta ggattctgca ttgcgttttc 960 cgcttggaac gtttgagaac atacaccacc aaggatgatg aactgaattc attggcggaa 1020 acagctcgag cgtgtatgaa gctgctaaat actttctttt ctatttcgtc acatttgccc 1080 gaagtacgtg aagcagagtt agcactcatt aaccagagtc taacggacat aatgaagaaa 1140 caggatgcgg aaaaaggaaa tggaaaagga gcggaagaag ataatgaatc ttggaaaagt 1200 actgaagaga ataatggaga ggaaattgct ggaattgttg gtgaaactgg atctgttagt 1260 ggaaatggta aaggaaatgg tagtggagtt ggtagtggag ttggtagtgg tgatggaaaa 1320 agtgttggaa aacctggtga cgtggaaaga gctgaatttg aacagtggaa agaacaaaaa 1380 gtggaactaa taaacgtggt caacaagttg ttggaacaca ttcaagttct agaagcgaaa 1440 caggttgaga aagaggtgga gaaagaaaaa ccggaagtac aggccaagcc agccaacagt 1500 acgcaacttg gagtagacac gaacaaggaa aaaaaggacg agggtaaaag ccagcagaat 1560 ccaggtgatt ttctcgcgtg gctcaaccaa agaaatagtt cattgggaag cttcggtaac 1620 tcagagaatg aacataacaa accagaggaa gttcattccg gcaaggatga accgaaggaa 1680 aagaaaaccg gatttggtag gagtttaccg gtgcataagt ggacggttcg ttatgacggg 1740 atggacaacg ggaggaaact gaacgagttt ttgaaggagg tggagttcaa cgctcgttca 1800 gagaacattt ctgagtccga gttgttccag tgtgcgcatc atttgttcac gaggaaagca 1860 cgttcttggt tcatggaagt gaacgggaat aatgaactgg gttcatggaa gaagttggtg 1920 gatgaattga agagcgaatt cttaccgatc gacatcgatt acgtctatga gcggcaggcc 1980 aacaatcgca agcaaatgtc cagggagaag ttccaggact attacttgga catggtccgg 2040 atcttccgct gtatgacgaa tccgtgggat gagaaacgga agtttgacgt tctcttccgc 2100 aatactcgcg aggactgcca aatagcgatg ctagccgcag aaatcaagac gattccatcg 2160 atgaaggaat tcggtaagcg attcgactct gtcaattgga agctgtatca gaagggagaa 2220 aggtacacgc cgcgatcagc gcacgtggag gaggtgcaaa cacagcggtc gtcgtaccag 2280 aacaccggaa ataggtacca gaactcgaat cgtttccagg gtggaaaccg atcgcagggt 2340 ggttaccaac aacaaaatca gaaccaacaa aaccgtccgt actacaagaa gaacaacagc 2400 aacggaaact acaatcagaa aagctcgtgg aagccagaac agcagagaca gcagagcaac 2460 ggaaatcaga aggagaacaa accaagggtt gacgagaggg ctcaaccgaa accgaacacc 2520 cagaatcgcg agaacaacac tgctgcaagt tcgagtggaa ccagtgcctt acagcggatc 2580 gtgaaggcgt atattccggt caaacgggat atttgcttca attgtcacgt tgcaggtcat 2640 ggtcacgagg aatgccggca ggagaggagc gtgttctgcg tgcggtgtgg ttttccgggg 2700 ttttctacga aagattgccc gtattgtgaa gcaaaaaaca tgcaaaagac tgctcagtga 2760 ggcgagtcag tctcacggga agctccagaa gacctcatga agataccgaa gcgtgggaag 2820 cactccaaca gctaggatat tcgagggttg atggagaaaa gaaagaagaa gatgctgaag 2880 tggcaacgat gcttgttaca gtgagcaacg gcgatcgtag gccgtttgca agagtggaaa 2940 tgttgggaat atcggttgtt ggacttcttg atagtggtgc acagcgcaca gttttaggtg 3000 ttggtggaag aaagttagtg aagttgttaa gtttaaagtt gaaacccacg aatattgatt 3060 tgaagacagc cggaggtgag agtttagaag tgattgggtg tagtgacata ccgattacgt 3120 tcaatggtgt tacgaagatt ctgtccgtga tgatcgcgcc gaagctgaac aggaggtgca 3180 tcctcggcta cgacttctgg cagaaattcg gcatcaggcc aacgatggaa tccgtcgagg 3240 tggttgatga gcaggtcgat gaaggtctag aagacgcaga agaagaatta gaagaagagc 3300 agaaaaaaca gttggaagat gtacagaaga tgtttctcgt tgccagcgaa ggagtcgtcg 3360 gaaagacgga cctgctggaa cacaaaatcg agctgaagga ggagttcata caatccgaac 3420 cagtccgaaa gaatccgtat ccgtggggcc cagaaataca aaagaagatt catatagctg 3480 tggaaaagat gctccgggac ggaatcatcg aaccgtcgga gtcggattgg gcgttgccgg 3540 tcgttccggt caagaagaga gacagcgagg agatcaggct gtgtctcgat gctcgtaagc 3600 taaacgagcg caccaagagg gatgcgtacc ctctgcccca tcagaacaga atattgagtc 3660 atctggggcc gttcaagtac ctgtccacca tagacttgag ccaggcgttc ttgcaggtgc 3720 cgttgagtga ggagtgcagg aagtacaccg ctttctccat tcccggaatg ggattattcc 3780 agttcaagcg acttccgttt ggactgatca acagcccggc aaccttaagc aagctcatgg 3840 acaaagtctt aggattcggg gcacttgagc cgtccatttt tgtctattta gatgacattg 3900 ttgtggccag tcacacgttt gacgaccaca ttgcgaaact gacagaactt gcggagcggt 3960 tgaaaaacgc gaatttgcac atcaacctcg aaaagtccaa gttctgctgc tacgaattgc 4020 cgtacctcgg ttacatcctg tcgcgagaag gaatgcggcc taatccagat agagtccaag 4080 ctatcttagg atatgaggcc ccaaactcgg tcaggtcgct gagacgattc ctcggtatgg 4140 tcaattacta ccgtaggttc atcgataagt tcagcgaact tactgcgccg ctcacggact 4200 tgctaaagaa caagccgaag cgagtggtgt ggtccaaagc agcagactta gcttttcggg 4260 ccatcaagga gaggctgatt tcagctccag tgatggccaa tcccgacttt aagctaccgt 4320 tctgtgtcca aacggacgca agcgacaccg cgatagccgg agttctgaca caggttcacg 4380 atggtatgga gagggtgatc gcgtatcact cagagaagct gaagggtgct gagttgaact 4440 accatgctgc ggagaaagag ggtttagctg ctctccgttg catcgagaag tttcgctgct 4500 atattgaggg cactcatttc acgctggtga ccgattcatc cgcgctcacg ttcattatgc 4560 gggcgaagtg gaggtcgtcg tcgcgtctga gtcgttggag cataaccttc cagcagtacg 4620 acatggaggt gaagcacaga aagggaaagg agaacatcgt acccgacgcg ctgtcgcgtt 4680 cgattgagac gctcgacgtg ggcgatgacg acaactggta ccggaatctg tacaatgctg 4740 ttgaggatga tccggagaag tatctggact tccgcatcga gaacgggaag ctgttcaagt 4800 ttgtgtcgtc caagtccgac gtgctggact acagatttga atggaaggag tgtgtaccag 4860 aatcagcacg agagcaagtc atgcggcaag aacacgacgg ctgcttgcac ataggattcg 4920 aaaagtgcat tgagaagctc aagaggaggt tttactggcc tcgaatgagc gcagacttga 4980 agaagtacat caacaagtgc gacacttgta aggagacgaa acattcgacc gtgtctactg 5040 agccgatgat gggacaacaa cgagtagcgg tcaggccatt ccagattgtc tgcatggact 5100 acatacagtc gcttccgcgt agcaagaaag ggaacgcaca cttattggtc gtcttagacg 5160 tcttttctaa gtattgcctt cttgctccgg tccggaagat ttcgtctgct agtctgtgta 5220 gcattctgga agaacagtgg ctgcgaaagt tgtctgtgcc ccagtacatc atcaccgaca 5280 acgcaactac gttcctgtcg aaggagttcc aggacctgtt gaaacggtat ggaatccagc 5340 attgggcaaa cgctcgtcac aggagccaag caaatcccac cgagcgcttg aaccgtacga 5400 tcaacgcgat gatccgcacg tacgtccggc aagatcaacg gctgtgggat actaaaatct 5460 cagagattga gttcatcctg aacaacacgg tccacgcaac aacgaagttt acgccacaca 5520 gagtagtgtt cgggcacgaa gcggtgaccc gaggaatgga ccatcagctc gaggacgatg 5580 aggaactcac cgagcaagag cgtatggaac gcatgcatgg agtgaacaag aagacctacg 5640 agctggtcaa agagaacctg cagaaggcgc acgagagtac gaaaaggcag tatgatctga 5700 ggcacaaaag gtactcaccg gtgttcgatg ttggacagcg cgtctttaag agatccttcc 5760 agcagtcggc ggcgaacaag tcctttaatg ccaagcttgg gcctacgtac gtcccgtgta 5820 tcatcgtcgc gaagaaggga acgagttcct acgaggtgtc cgacatgaat ggtcgctcac 5880 tgggagtctt ctcagcggcg gatttgaaag catgatctct gaaagaggga catggcttag 5940 ttcgaggaca gaggttcatc tgagaggagg tcaaggtcga tatagatgtg attccgcaaa 6000 ttcattcatc aggagttctc atctaagtcg ccttgcgcaa tccacagcgc atgaatgaac 6060 cgagtgtcac tcatcaatgg gttcattgac acatgaacgc ccacgcatcg caagcaccag 6120 tatggaaaag tgcatcctag gaaatcattg atcaaatttt aagtgaaagt tttaaagtgt 6180 cgtaattgag tgttattttg tgtttgtgaa tcaattttat gttttatcgg tgcacaatgt 6240 tccattcatc tctcaggcaa cattatcaag aatatttcta aagttgcact acgagaaggg 6300 aatgtggaag gaaaattgca tcggagttca aagtgtgtgt gaaagaacaa gttaatgagt 6360 cgtgaagagt gagcaaatga gttggctaac catgaagttg caagaagatt tttgtcatat 6420 accatgcatc actatttagc tttttcaatt tataactagg attttttcgc ttaatttttc 6480 acctagtgga cgcacttccg tgaaatttga gcacttcata cttaccaaaa gcatagcatt 6540 caaaattcat atcagagcat attgcatctc tgatgagctt gtttgtgaga tatgggattt 6600 ttggggtaga attggggaat aaaacttcaa atgattatat aggtaccttt ttaaactgat 6660 acgtatcagg aaaataataa ataaattccc atttctacac aaatcgccga ttttctcaca 6720 aacaaccgca tcgcgatctt gcacaggcca catttttatc accaatagta gtttgacagg 6780 ttcaggtcac tcagcaaacc cgaagaagca gcacacacac tgatacgaat gataacgata 6840 aggattatag cacgaattgg ttcgataaag gagatctccg acactcacgt agtgaattat 6900 ttcaaccgtc gcttgatgcg aataggtttt tgatcgcgat atttgtttga atttgtttat 6960 tccggaatat aattaattta ttaccatggt attatttgta atttgcatgc agcataaaat 7020 ttgaaaatca aaaaaaaaat ttaaaaaaaa tcaaaattta taaaaatttg catgcttgag 7080 tataatttta atttgtaaat attttaattt gttatttatt tatttattta tttgacttgt 7140 aaatattgta atttaaatgt tgtcaatgtc ggttaatata tcaagaatgg agatcgctgg 7200 agaccggagt tgaaatatac gttgatggtc agtaacaggt ttcggccttg aaactgagag 7260 gagtcgactt aggttgacga tgataactta tgaccccgcc cattaccctt gatataagtg 7320 ttgttaactt gtgtattctg acttgactta gtacaacgga ctcaattttc aaatatttct 7380 ggttactttg agttaaaaca acgatgtgca gtcaattctt agttcttgtc tcgtggttca 7440 gaggtaagcg gttcgtatat agtgctcagt gcaaaacgaa aaatgttaga aaaaaaatat 7500 tagttctaat atttttttta cccgagcgag gggagag 7537 // ID L2-1_BM repbase; DNA; INV; 3514 BP. XX AC . XX DT 30-APR-2010 (Rel. 15.07, Created) DT 30-APR-2010 (Rel. 15.07, Last updated, Version 1) XX DE L2 non-LTR retrotransposons from silkworm. XX KW L2; Non-LTR Retrotransposon; Transposable Element; L2-1_BM. XX OS Bombyx mori OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Bombycidae; Bombycinae; Bombyx. XX RN [1] RP 1-3514 RA Bao W. and Jurka J.; RT "Non-LTR retrotransposons from Bombyx mori."; RL Repbase Reports 10(7), 1047-1047 (2010). XX DR [1] (Consensus) XX CC L2 type non-LTR retrotransposons, 98% identical to the consensus. XX FH Key Location/Qualifiers FT CDS 257..3202 FT /product="L2-1_BM_1p" FT /translation="MTDMLNCSDSSDYLSITSSSSGSEDSFLSIPSLAETL FT DSNFSDVPRNLNVVHINAQSIPAHFSDMLVSFDIKNIHAVLVSESWLKPSL FT PSISYSLPGFNLIRNDRIGKGGGGVAIYLRSHIPYTIVSASDQSPLPNVSE FT HLFIEVGLTHAKILLGVFYSPSLHINYFSSFEKVLEDLLPCYEHTIIMGDF FT NTCLLKRDHRSSSLESIVTSSNMHILPLSATHHFPNSSPSLLDLILVSSLE FT HVARHGQCSADSFSYHHLIFLSYKIRPPKRKPTVILQRNFGRMDMDAFVAD FT AKTVDWEPVMAASSIDDQVAIFNSLMIXLYDIHAPLRAIKIKHQPAPWITD FT HIRDLKRRKSAAKTKYKLRPTEFNRTRYVKIRNRCNMVCRDAQRRHIHDSV FT KVEDPAKVWKFLRSLGVGKSRQECNSTNINIDQMNIFFTSSSLLDDVVKSS FT TINSLSNITAPDHSSFKFRQFTACDVRRSILSIASNAIGTDSISRNMVIPL FT VDILLPILCHILNXSISNCVFPSVWKEAQIIPLPKKPNPKSFSDYRPISIL FT PFLSKVLERLIHNQLSSFLNSHDLLNPFQSGFRPGHSTVTALVKITDDIRL FT GMENGQLTVLTLLDFSNAFNTVDFDILLAVLRSFNISPTVIEWFRNYLYGR FT RQCVRLGESISSWCNTSAGVPQGGVLSPLLFSLFINSISCNLTSSYHLYAD FT DLQLYCQAPLPQLHDAIRTMNSDLRVLSEWSNAYGLKLNPTKTQCIIVGSQ FT RTCSKIEWTHLPQLSLDGINIPFSDTVKNLGVIIDRSLSWGPQLVSVSRKL FT FSSAGSLRRLRNFLPTSTKIALAQSLLLPIIDYADASYLDLTEAQLNKLER FT LQNFIIRFIFGLRKYDHVSEYRVKLKWLPIRYRRNVHILSLLYSVLFNSSS FT PRYLQERFDFLHTASSHSLRSSENLMLKFPAHGTSFYDHSFAVEAVRLWNA FT LPLSVKQSSSPEIFKSRLKSHYFASMI" XX SQ Sequence 3514 BP; 900 A; 762 C; 617 G; 1232 T; 3 other; ttattctaca tcagttcgta atttacaatt gcacttcatg ttgtatttgg caaatgcaat 60 gcacatggta aaatttgttt ctgtcttgtt taagttttaa tttttaagtt ttctcttcgt 120 atcagacgcc ggcaggttga gtgaatgtat ttattttttg tttattatta ttgtttatat 180 atatatatat atatattatt gttagtgtta tatatatttt tgttacttat ttgtgtttat 240 aataggtatt atctttatga ctgatatgtt gaattgtagt gatagtagtg actatttatc 300 tattacttca tccagtagcg gaagtgaaga tagcttcttg agtatccctt ccctcgcaga 360 aacgttggac tctaatttct ccgatgtgcc taggaatctg aatgtagttc acataaatgc 420 gcagagcatt ccggctcatt tctctgatat gctggtatct tttgatataa aaaacatcca 480 tgctgtatta gtctccgagt cttggcttaa accgtctctt ccttctatct cgtattcttt 540 acccggcttc aacctaattc gcaatgatag aatcggcaaa ggcggcggtg gtgttgccat 600 atatctacgc tcccatatac catatactat cgttagcgcc tccgaccaat cacccttgcc 660 gaatgtcagc gagcacttgt tcattgaagt cggtctaaca catgctaaaa tcttacttgg 720 agtcttttat agcccatctc ttcacattaa ttatttttct tcttttgaaa aggtcttgga 780 agacctactt ccttgctatg agcacaccat aattatgggt gacttcaata cctgccttct 840 taaacgtgat catcgttctt cttccttaga aagcattgtc acgtcgagca acatgcacat 900 tctcccactt agcgctaccc accattttcc aaattcttcc ccatccctgc ttgatctcat 960 cttagtttct tccctagagc atgttgccag gcacggacaa tgctcagctg actctttttc 1020 ctaccatcat ctgattttct tatcctacaa aattcggcct cctaaacgta aacctaccgt 1080 tattctgcaa cgcaattttg gtagaatgga catggatgca tttgttgcgg atgccaaaac 1140 tgtagattgg gagcctgtga tggcggctag ctcaatcgat gaccaagtgg caatatttaa 1200 ttcactaatg atcmgtttat atgacatcca tgctccctta agggcgataa aaattaaaca 1260 tcagccggcg ccttggatca ctgaccatat tagagatctg aagaggagga agtctgcggc 1320 taagactaag tataagttgc gcccaaccga gtttaaccga acacgttatg ttaagatcag 1380 aaatcgttgc aatatggtgt gtagagacgc tcaaagacgt cacattcatg attccgttaa 1440 agttgaagat ccagcaaaag tctggaaatt tctaaggtca ttaggagttg gtaaatctcg 1500 tcaagagtgt aattctacaa acattaacat tgatcagatg aacattttct ttacttcttc 1560 gtctttatta gatgacgttg ttaagtcatc taccattaat tctctttcta acattaccgc 1620 tcctgaccat tcttctttta aatttcgtca attcacagct tgtgatgttc gaaggagtat 1680 cctttctata gcgtccaatg ctatcggaac agatagtata agtcgtaata tggtcatccc 1740 attagttgat atcctgcttc ctattctttg ccacatmctt aattwttcaa tttccaactg 1800 tgtgtttcct tctgtttgga aagaagcaca gataatacct ctgccaaaaa aacctaaccc 1860 taaatccttt tccgactatc gacctatatc cattcttcct ttcctatcta aagtcctcga 1920 acgccttatc cataaccagt tgagttcgtt ccttaatagt catgatctcc tgaacccctt 1980 tcagtccggt tttcgtcctg gccatagtac agtcactgct ctggtaaaaa tcaccgacga 2040 catacgcctg ggtatggaaa acggtcagtt aactgtgttg acactgctgg acttcagcaa 2100 tgcattcaat acggttgatt ttgacatact acttgccgtt cttcgctcct tcaacatatc 2160 tccaactgtg attgaatggt ttcgtaatta cctgtacgga cgccgacagt gtgttcgcct 2220 tggtgaatcg atttcatctt ggtgtaatac atctgccgga gtgcctcagg gtggcgtttt 2280 atctcctctt ttgttttctc ttttcattaa ttcaatatct tgtaatctca cttcgtccta 2340 ccacctttat gcagacgatc ttcagctgta ttgtcaagct ccacttcctc agttacatga 2400 tgctattcga actatgaata gcgacttacg cgtcttatct gagtggagta acgcgtacgg 2460 cctcaaactg aatccaacaa aaacgcagtg tataattgtt ggcagccaaa ggacatgcag 2520 taagatagag tggacacatc tgccccaatt gtctcttgac gggataaaca tcccgtttag 2580 tgacaccgtt aaaaatcttg gtgttatcat tgaccgcagt ttgtcatggg gaccacaact 2640 ggtgtcagtc agtcgcaaac tattttcctc tgcgggttcc ttacgtaggt tgcgtaattt 2700 tcttccaaca tctactaaaa ttgcacttgc acagtctctg ttactcccca ttattgatta 2760 tgccgacgcc agctatcttg atctaaccga ggcccagcta aataaacttg agcggcttca 2820 aaattttatc ataaggttta tatttggttt acgcaaatat gaccacgtct ctgagtatcg 2880 agttaagctc aagtggcttc ctattcgtta tcgccggaat gttcatatat tgtctcttct 2940 ttacagtgtc ctgtttaatt cctcgtcccc gcgctacctt caggaacgtt ttgatttcct 3000 acatactgca agctctcatt cactcaggtc atcggagaat ttaatgctga agtttccggc 3060 tcacgggacg tcattttatg accattcctt tgcggttgaa gctgtccgtt tatggaatgc 3120 gttaccctta agcgttaaac aatcgtcctc gccggaaatt ttcaagtcca gattgaaatc 3180 gcactacttt gcttctatga tataactgta taagttccta cttcttttgt attttatctt 3240 gtatattttt agtatttatt tgtatttaat cgtatgtaag tttatcttat atggtattta 3300 tttttcacat gtgtatttaa gtatatattt ttatgtaagt ttattaagat atagcaccgt 3360 ccagttaact tattcctctc cctgcaaggt tgcctggaag agatcgcttt cagcgataag 3420 gccgccgatt tatataccga cttttgtttt attactttac tgtgtacata ttaaggtgtt 3480 atataaataa agagttatta ttattattat tata 3514 // ID Gypsy-39_OD-LTR repbase; DNA; INV; 149 BP. XX AC CABV01002763; XX DT 24-FEB-2011 (Rel. 16.02, Created) DT 24-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Oikopleura dioica genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-39_OD_; KW Gypsy-39_OD-I; Gypsy-39_OD-LTR. XX OS Oikopleura dioica OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; OC Oikopleuridae; Oikopleura. XX RN [1] RP 1-149 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Oikopleura dioica genome."; RL Direct Submission to RU (24-FEB-2011). XX DR Genome; CABV01002763; Positions 1487 1635. XX SQ Sequence 149 BP; 51 A; 22 C; 51 G; 25 T; 0 other; tgtaagagga gcccccgggc gttaagcctg gggctaggtt taaggaagag aggagccggg 60 ggcgaggcag agtgtttctt gtaataaaat ataaaatata aaacccgaga gatttgagaa 120 ggaaagcagg gcagagaagc aggcataca 149 // ID PiggyBac-N1_SM repbase; DNA; INV; 292 BP. XX AC . XX DT 27-SEP-2007 (Rel. 12.09, Created) DT 01-OCT-2007 (Rel. 12.09, Last updated, Version 1) XX DE Consensus sequence of putative PiggyBac-type family of DE non-autonomous repeats. XX KW piggyBac; DNA transposon; Transposable Element; Nonautonomous; KW PiggyBac-N1_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-292 RA Jurka J.; RT "PiggyB-N1_SM: PiggyBac-type element from freshwater planarian RT (Schmidtea mediterranea)."; RL Repbase Reports 7(9), 983-983 (2007). XX DR [1] (Consensus) XX CC This element contains numerous identical or nearly identical CC copies in the genome, indicating recent transposition. TSDs: CC TTAA. XX SQ Sequence 292 BP; 76 A; 59 C; 70 G; 87 T; 0 other; cacgttgacg gacaattgcg gctatagccg tcaaagcgaa gttcacaatg tggcatgact 60 gctatagccg tccaaaacga aacggtcgtt tttagacact acggctatag cagcacaaat 120 gtgcacttac tggcgtatac gtagaagcat agtgtcgtgt tttgaaaaca tctttttagg 180 ttgaagttca tatttcacct gttatacgag ttgcagattc agtatgcacg tggttgtgaa 240 gggagtagtg tcaattttcg aacttcgctt gttcctactg tccgtcaacg tg 292 // ID Hoana2 repbase; DNA; INV; 2427 BP. XX AC . XX DT 21-SEP-2009 (Rel. 14.09, Created) DT 21-SEP-2009 (Rel. 14.09, Last updated, Version 1) XX DE Hoana2 is a putative autonomous DNA transposon. XX KW hAT; DNA transposon; Transposable Element; HOBO; transposase; KW Drosophila; Hoana2. XX OS Drosophila ananassae OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; ananassae subgroup. XX RN [1] RP 1-2427 RA Ortiz M.F. and Loreto E.L.S.; RT "Characterization of new hAT transposable elements in 12 RT Drosophila genomes."; RL Genetica 135(1), 67-75 (2009)DOI 10.1007/s10709-008-9259-5. XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 255..1121 FT /product="Hoana2_1p" FT /translation="MYVCISILMVKWRNLGTRRRSVTPIATSAENQECDYN FT SSGSEKDVLPNKPSNKVKRSKISHVWKYFKKSDHQKLAKCLDCGKEYKTSG FT NTSNLRDHLKRCHPSLEINKSDCSTPAEVTDRNETASTSSSCRSSMNSVAS FT YFKRAVLYESNSKRKTDIDKALTEMVAKDVQPYNIVENEGFIKYTHVLDPR FT YKLPSKTHLRDVLMSNLYKETSAKLSLIFEEVSDVSITCDLWTSSANASFL FT TVTGHFVHNFELKTASLATKKTVKRYKSLFAKHCRYFAKYFNRLEHFN" XX SQ Sequence 2427 BP; 847 A; 450 C; 462 G; 668 T; 0 other; tagaggtgga gaactatcga tggcgctatc gacactatcg atggttggcc gctatcgagc 60 cgcaatcgat agtgtcagac acgttcgata gtgacccaaa cactatcgat agtgccttcg 120 atagctagcg cctcacatca cgaaaataat ttcaacattc acttgtgaaa atattgaaaa 180 atttcgtata aaattgacgg caagtcaaaa gtaactgata aaaatggatc gctttttaaa 240 aaaaggtatg tgtaatgtat gtatgtataa gtattttgat ggtaaagtgg cgaaatttag 300 gtacacgtcg gaggtccgtt acgcctattg cgacaagtgc agaaaatcaa gagtgcgatt 360 acaacagctc aggctcagaa aaggatgttt tacccaataa accgagtaac aaggtaaaaa 420 gaagtaagat ttcccatgtt tggaaatatt tcaaaaagtc cgaccaccaa aaacttgcca 480 aatgccttga ctgtggcaag gaatataaaa caagcggcaa tacgtcgaac ttgcgcgacc 540 atttaaaaag gtgtcatcct agcttggaaa taaataaatc ggactgtagc acaccagctg 600 aagttacaga taggaacgag actgcgtcta caagcagcag ttgcagatca agtatgaact 660 ctgtggcttc ctacttcaaa agagctgtct tgtacgagtc gaattccaag cgaaagaccg 720 acatcgacaa agcattaaca gaaatggttg ctaaggatgt acagccgtac aacattgtcg 780 aaaacgaagg ttttattaaa tacactcatg tattggatcc tcgctataaa ttgccaagca 840 agacccactt gcgtgatgtg cttatgtcca atctgtacaa ggaaacctct gccaaattgt 900 cattgatatt tgaagaagtt tcggatgtat caataacatg tgatttgtgg acatcgagtg 960 ccaatgccag ttttttgaca gtgacgggtc attttgttca taactttgaa ctaaaaacag 1020 cgtctttggc tacaaaaaaa actgttaaac gttacaaatc actgttcgca aaacattgca 1080 gatactttgc gaaatatttt aatcgattgg aacattttaa ctaagacggt gtgcattgta 1140 actgataatg ccagctccat gcttaaagca tgtgaaatgt tgcaaattca aaatttgcca 1200 tgctttgcgc acacaatcaa cttagttgtg gaagatgcat taaaagttga tgacacagtc 1260 ataagggatt tgtttactaa gtgcaaatca atagtgagat ttttcaaaca aagcacgatt 1320 gctaacgaaa aatttaaatt agcacaagaa ggcacgactt atactttgct gcaagaaata 1380 ccaacaagat ggaacagttt ctttttcatg attgaaagaa ttttaaaaac aaacgatgcc 1440 attgccaaag tcctattagg aacaacaaat gcacctcagc catttactgc tgaggaaata 1500 cttgtgctaa aggatataga aaaattgttg tcttttttcc agcaagcaag cgaaaaaatt 1560 tctggtggga agtatgtaac tatatcacta atcataccca tggcatatgg acttttccgc 1620 aaaatcgaaa gtttttcacc tatgctaaac actttgcaag gaaaaattat ccaaaatata 1680 ttaatggaat caataaagaa acgtctttcc atttatgagc aaaggacatt atgtaggatg 1740 gcaacattgt tggacccacg ttttaaaaaa aatggatttc tacacgcttc aaacgctgaa 1800 caaccagcag tcttttttga aaacgaactg gcaaattctt caatcaaata gacctcaaat 1860 aatctgtcca tttcaccgga cacaagtgcc caggagacat tatttgattt tttgtgtgag 1920 cgatcaaata gcaaagtcag aaacgcacaa ctggatgcta ttcttattaa aaggcaatat 1980 ctggagaggg caattgccag tcaagatgtg gatccactat tatggattaa ggtataattc 2040 gaaaacaaat tataagttaa caatatttac atacattact tttattttag gcgaatcaaa 2100 ctgattttcc ctgcataaaa cgactgtttt gcaaatacct ttgcattcct gctacatccg 2160 tagaatcgga aagagcattc agcaaggcag ggcaaattgt ttccgagaga cgcacccggc 2220 tcaaagaaga aaatattaac atacttttat ttttaaatca aaacttctgg ttaaaataaa 2280 acaattttct taaaaactta aaaaaaaact tgtttgtcgg acgatagtat cggtgactat 2340 cgatggcact atcgacacta tcgagggttt gagaaaaaca atcgccggct atcgatggcg 2400 ccgactatcg atagttctcc acctcta 2427 // ID Gypsy-246_AA-I repbase; DNA; INV; 4416 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-246_AA_; KW Gypsy-246_AA-LTR; Gypsy-246_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4416 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1093-1093 (2011). XX DR [1] (Consensus) XX CC Positions [1841-2332] - Reverse transcriptase CC Positions [3453-3929] - Integrase core CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 116..2221 FT /product="Gypsy-246_AA-I_1p" FT /translation="MSETERVLEEISKSVAESVVGVPNSQEEAGGSTSGQT FT SLPRSGAKEESDSESELFRSPTGSRRLSFEELLHKDPRYKKRTMDDAAMKQ FT LIAALTGIAQGSAQRRFDVRDVKDLVVQFDPDIPTTPTAEQWVDSIVKAAA FT LYQGNDEWKLQCGILNLNGAAKLWFTGVAVNTWDEFKTALIRDFPTSVDAV FT GIHQAMINRKKLPHESLETYFYSHVALGRKGKLADEAIIKYIVLGLEGRFG FT TITQVSTLPELLKQLKWLAEVRDLKPTEGHVRPSILKSAKSSGALSANIKC FT YRCNGEGHVAANCSVKSAGKSHANFECYRCNQKGHIAKNCSKTTPKPFSRP FT MQEIRQPSNFVKTVIVGSSEIDALYDCGSAVTTIKESCAGILNCVEPCNIE FT LVGFGGNKVQVRGKSAEVINLDGIVVECDVCVVPNKVQANSMIIGRDILDR FT EDIRFVKEKGSVRIEKIAVQHNDERQSGPSSDKSGTQNNVYSIRAYEPIVA FT EEINVDSVGEEREKIFELIKCYRPCFAKNYLEMGTARDCEMVIELIGAEKP FT IHTKQYPLEYSREKVVETIVDDLLAANIIRPSNSPYNSPTVLVRKKNGDWR FT MVVDYRAVNARTVKDSWPMPVIEDCLNRLVGDRLFTAVDLFRGYHQIPIAE FT NSRKFTAFSTPFGHFEYVKMPFGLSNGSAVFQRKDDRYGDRTPSFDRDYSV FT SG" FT CDS 2166..4364 FT /product="Gypsy-246_AA-I_2p" FT /translation="MIDTVIAPLRSIGIIAYLDDAILGGRDVDDVLRKFEA FT LLKRLMEFGLTVNLEKTQFLKVSLDFLGHEVSEGEIRPGKEKIRAISDFPQ FT PVNVRNIREFLGLANYFRRFVKEFSIIAEPLTRLTKKDEPFTWEEEQETAF FT GKLKEALVNQPVVVMYDPARNLEVHTDACSHGLSGILLQQMEDGLHPISYF FT SRKTSPVECLKYSYELEALAVVESVERYRKYLLGRHFVVVTDCEAVKKTIA FT KKVMLPVVGKWLLKLQEYDFDLVHRKGEKMQHADCLSRNPVLEPESEEPEP FT VMAHVMEVNIKEDEWLKLLQREDPKLFEMMKILSKPPVENREKQIHKEYVL FT KDEGVMRKCIDGLKWVVPTRARWRIARNYHDDLGHKGVEKVLEAVKRCFWF FT RRMRSYLKRYIGSCVHCAYVKAKGGAKEGLLHPIEKIAVPFDTIHLDHIGP FT FIRSSALNEHIIVLVDGFTKYVVLKAVRNTKTSPVIQMLNDLFAIFGKPRR FT IVTDRGTAYTSKDFEKFCVDLGIVHVKVAVGSPRANGQVERENRNVLDAVR FT CMVKENDKSWDKHLRMIQWGLNTMINDSTKVSPHTLLFNYNPRDIMQNQLL FT MMFAAEVPAEANADNLKREVIGRIEKEQRKQKHYFDKSRRPARQYLEGELV FT LVEKDPQATGLSKKLEPKYKGPYLVERALGNDRYLIVDVPGVQLKQKKTST FT IFAADRMKPWSVNASLEDSDDDNMSSEDSDETM" XX SQ Sequence 4416 BP; 1270 A; 861 C; 1215 G; 1070 T; 0 other; attctgtaga caggatcgat tttgttacgg gatcgcggtt agcatttgag catgcgtgac 60 taagttaatc gtgggcgcga aaagcaattt tagaatcggt gagactttcg gtgtgatgag 120 cgagaccgaa agggtgttag aagaaatttc taagagcgtg gccgaaagtg tggttggcgt 180 gcctaatagc caggaggagg ctggaggcag tacatcaggc cagacaagtt tgccgcgaag 240 tggtgctaaa gaagaatccg attctgaaag tgagctattt cgatccccaa ccggttcacg 300 acgtctcagt tttgaagaac tgttgcataa ggatccgcgt tacaaaaagc gaacaatgga 360 tgacgccgcg atgaaacaat tgatcgctgc ccttactggt attgctcagg gaagcgctca 420 gcgtcggttt gatgtgcgtg atgtgaagga tctagtagtg caatttgatc ccgatatacc 480 cactacaccg acagctgaac aatgggttga ttctattgtg aaggcggcag ctctatacca 540 gggtaacgac gaatggaagc tccaatgtgg aatcttgaac ctaaatggcg cggccaagct 600 atggttcact ggagtggctg tgaatacgtg ggatgaattt aaaacagctt tgattcgcga 660 ttttcctacg tccgttgacg ctgtcgggat tcatcaagcg atgattaacc gaaagaaatt 720 gccacacgaa agcctggaga cttacttcta tagtcatgtc gcattaggta gaaaggggaa 780 gttagctgat gaggccataa tcaaatacat tgttctgggc ttggaaggga gattcggaac 840 cataacccaa gtgagtacac tgccagaatt gctcaagcag ctaaaatggt tggcagaggt 900 tcgagatcta aaacctaccg aggggcatgt gcgtccttcc atcttgaaat ctgccaagtc 960 ctctggagcg ctgagtgcca atatcaagtg ctatcgttgc aacggcgaag gtcatgtggc 1020 tgccaattgt agtgtgaaga gtgccgggaa atctcacgca aactttgagt gctatcggtg 1080 taatcagaag ggtcatatcg cgaaaaactg cagtaaaact actccgaaac cgttttctcg 1140 acccatgcaa gaaatccgtc agccaagcaa tttcgtgaaa acagtgatcg tcggaagttc 1200 tgaaatagat gcactttacg attgtggttc tgcggtaact acgatcaaag agagctgcgc 1260 tggtatactg aactgtgttg agccgtgcaa tatcgaacta gtagggttcg gtggtaataa 1320 agtgcaagtt cggggaaaga gtgccgaagt gatcaatctt gatgggattg tagtagagtg 1380 cgacgtttgt gtagttccca acaaggttca agcgaactcg atgataatcg gaagggacat 1440 tctcgatcgt gaagatattc gtttcgtgaa ggagaagggt agtgtgcgaa ttgagaagat 1500 tgcggtgcaa cacaacgatg agaggcagag tggaccatcg tcggataaga gtggtacaca 1560 aaataatgtg tacagcatcc gagcctatga acctatcgtt gctgaagaga taaacgtgga 1620 tagtgttggt gaagagcgag agaaaatttt tgaactgatc aagtgctatc gaccatgttt 1680 cgcaaaaaac tacctagaaa tgggtaccgc aagagattgt gagatggtga ttgaacttat 1740 cggagccgaa aagccgatac atacaaaaca gtatccattg gagtactcgc gagaaaaagt 1800 ggtggagact attgtagatg atttgttagc agcgaatata attcgtccgt ctaactcccc 1860 atacaacagc ccaacagtgt tagtgcgtaa gaagaacggc gactggcgca tggtagtcga 1920 ttatcgggct gttaacgcta gaacggtgaa ggattcttgg ccgatgccag ttatcgagga 1980 ttgcctcaac cgattagttg gcgatcgact gttcaccgct gtggacttat ttcgaggcta 2040 tcatcaaatt ccaatagcgg agaatagcag aaagtttact gccttctcga cgccttttgg 2100 acatttcgag tacgtgaaaa tgccctttgg actaagtaac ggtagtgcag tgttccagcg 2160 aaaggatgat cgatacggtg atcgcacccc ttcgttcgat cgggattata gcgtatctgg 2220 atgatgcaat tctcggtggg cgtgatgtag atgatgtgtt gcgcaaattt gaagctctgt 2280 tgaagcggtt gatggagttt ggtctaacag tcaatttgga gaagactcag tttctgaaag 2340 tcagcctcga ctttctgggc catgaagtga gcgaaggtga aatacgaccg ggaaaagaaa 2400 agattcgtgc tatcagtgat tttccgcaac ctgtgaatgt gcgcaatatt cgcgagtttc 2460 ttggcttagc caattacttt cgtcggttcg tgaaggaatt cagtattatc gccgaaccat 2520 taacacgact tacgaagaaa gatgaaccgt tcacgtggga agaagaacaa gaaacagctt 2580 tcgggaagtt aaaagaagct cttgtgaatc agccggtcgt agtcatgtac gaccctgctc 2640 gaaatctaga agtgcataca gacgcttgtt cgcacgggct gtccggaata ctgttacagc 2700 aaatggaaga tggtctgcat cccataagct actttagccg aaaaacttcg ccagttgaat 2760 gcctgaagta tagttatgaa ctagaggcac tcgcggttgt tgaatcagtg gaaagatacc 2820 gtaaatactt gctgggaaga cactttgtgg ttgtgacgga ctgcgaagcc gtgaagaaaa 2880 cgatagcgaa gaaagtgatg ttaccggtag ttggcaagtg gttgctgaaa ctccaagagt 2940 acgattttga tttagtgcat cgaaaggggg aaaagatgca acatgccgat tgcctcagtc 3000 gaaaccccgt tcttgaaccg gagtccgaag aaccggagcc ggtgatggct catgtgatgg 3060 aagtgaacat caaagaagac gaatggttga agttactgca aagagaagat cctaaactgt 3120 tcgagatgat gaagattttg tcaaaaccac cggtcgagaa ccgggaaaag cagatccata 3180 aggaatacgt tctgaaggac gagggtgtga tgagaaaatg tatcgatggg ctgaagtggg 3240 tggtgccaac cagggcgcgc tggcgaatcg cccgcaatta tcacgacgat cttggacata 3300 aaggagtgga gaaagtgcta gaagctgtga agaggtgttt ttggttccga aggatgcgta 3360 gctacctgaa gcgctacatc ggatcgtgtg ttcactgtgc ctacgtgaaa gcgaagggag 3420 gcgcaaagga gggcttgttg caccctatag aaaaaatcgc agtgccattt gatacgattc 3480 atttggacca tatcggtccg tttattcgtt cgtcagcctt gaacgaacac attatcgtgc 3540 tcgttgacgg tttcaccaag tatgtggtac taaaagcggt gcgcaacacg aaaacatcac 3600 cagtgattca gatgctgaac gatttgtttg cgatttttgg taagccgagg cgtattgtca 3660 ctgaccgagg aactgcgtac acctccaagg acttcgaaaa attctgtgtt gatctcggaa 3720 tagtacatgt gaaggtggct gttggcagtc cacgcgcaaa cggacaagtt gaacgcgaga 3780 atcgcaacgt actcgatgct gtgcgctgta tggtgaaaga aaatgacaaa tcatgggata 3840 agcatcttcg tatgattcag tggggtctga acaccatgat caatgattcg accaaagttt 3900 ctccacatac gctgctattc aactataacc cacgggacat tatgcagaac cagctgctga 3960 tgatgttcgc tgcagaagtc ccagcagagg cgaacgcaga taatttgaaa cgtgaagtga 4020 ttggtcggat cgagaaggag caacggaaac agaagcatta tttcgacaag tcacggcgtc 4080 cagcaaggca atatctggaa ggcgaactag ttttggtgga gaaggatccg caagcgacag 4140 gtttgagcaa gaaactagaa ccgaagtaca aggggccgta cctcgttgag agggccctag 4200 gcaatgaccg ttatttgatc gttgatgttc ctggagtgca gttgaagcag aagaagacat 4260 ccactatttt cgctgccgat cgaatgaagc cttggtccgt caatgcatca ttggaggaca 4320 gcgatgacga caatatgagc tccgaagact ccgacgaaac aatgtaatat cgccggggcg 4380 ggatataacc tgtggtgtcc gttataacag atatta 4416 // ID Copia8-NVi_I repbase; DNA; INV; 6024 BP. XX AC AAZX01003863; XX DT 05-NOV-2007 (Rel. 12.11, Created) DT 05-NOV-2007 (Rel. 12.11, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: internal DE portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia8-NVi; KW Copia8-NVi_I; Copia8-NVi_LTR; internal portion. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-6024 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from Nasonia parasitic wasp."; RL Repbase Reports 7(11), 1114-1114 (2007). XX DR Genome; AAZX01003863; Positions 15271 9248. XX CC Positions [3359-3874] - Integrase core CC 'GTAGG' target site duplication CC LTRs are 96% similar to each other. XX FH Key Location/Qualifiers FT CDS 670..3387 FT /product="Copia8-NVi_I_1p" FT /translation="MLINSTGVDQEEFRRIILREWARLEANEAYGDKYHRD FT TVTLLRKKDGSSYFLRSELQLRRFDEVLSETERAVIDRVWPAVKKQLVFQL FT RYSVDSITRSEFMWMMLEQNKMFLTGIEMLVKGLTMPGFGQQLPYFDQSPP FT AKPLSVEPRGPEPPKSTVTVQPKKPAALVEETEPVEPRNAFLDIRKKPRIK FT GIENFPKAASTPFGRPQTVRCTELIDMSINMREPVKRSAERRREPSEGGEL FT SNNSEDEAQETRESNKSDTPVEISMFDPYRASGSQQMQPPPDQEHRSKFPR FT VSVRQNNCCQFVEQSNNVPLNVVNELVENKIKVLLLERDKEIEKSSNDRSF FT LNLSEQLKNEFIGRENLGSRTYKLTVKTNFEMFEDYLKSELRNKKLHYILD FT ESVSMNSTVNESKRLDDEFKVREIIINRIDAIYYNKIVNLTDPKLILRKLK FT DVKRYETRTTRITARRDLMTIKYLPSKETASDFYDIFQEKVRIFENIPDAG FT KLPENELKDYFLQAVAEAVPEITTADQLFKEATSTEMTCERLKNYLLQVGS FT TKSRKMPSQPPVRAFVHSSQGTGRLICRGCGMEGHTKTNCPNEGLRQCYYC FT LKLGTHLAHECNKRKADAEKGQATATSYGPPSKRGRWTGQGSRSTASRGRT FT SKMAASGRASVAPANRGARGARGASTRGRGRGRGGPQRGVGYSRLQQGTSL FT QAIAEEYQGNKANCGNKELLEFIADSDATEHLSRSKLYFDKFDSRAITKVD FT CANKDSTFRTDDSGSTEVVTEDNNTFTLNNILYSKNLSYNLLSLRRFVDRG FT LEIYLNNKIIDIYNPELDKSILTREYKKPFWIIKLVLSKDTATKTNPKIMY FT NTRSKKNSVKINDSDVNKSVSGRSSVDKTREPNDVETDFLQTSSNRKHTQN FT RD" FT CDS 4442..5362 FT /product="Copia8-NVi_I_2p" FT /translation="MINLGPNIIDSRWVFKRKTNEQGETKFKGRLVIRGFK FT DKNVYDLRETYAPVSRMSLIRAFFSIANKLNYTIRQLDVETAFLYGDLSED FT IYMEVLEGVQITSDTRKRFLWKLNKSLYGLKISPKKWNDKFSSVMTKLGFT FT SNDIDPYLYIKHSGVDTILVVLYVDDILLASSNNEVLNKLSQNLIAEFKIK FT DLGTPKEFLGIKIERDQTKRVIKLSLYETIFMLSRFGFESCRPLSTPMRQL FT EVVSHDRKEREETENTLAESGKEQIPNKLYREAVGSLLYLANATRPDISYA FT VNVLSRHQVNPTIQE" XX SQ Sequence 6024 BP; 2074 A; 1117 C; 1369 G; 1464 T; 0 other; ggttatgggc cgggtgagcc acgcttccac tacgcaaaaa taaagtgcgt gacacgcgtc 60 tagagacgct ggagtgtttt tagctcacga gagcaagtga aatgtcaggt ggcaaaaggc 120 tgtgtatacg ctctttacaa cgataatcgg aaatagaaaa tgcttcgaag caagagattt 180 taaatcctca tatcagtaac cgttgagtag catgtttaat agtgttcttt aaaaacctgt 240 gttgcaaaat aaatacggac tgcagttcat gaaaaatcgc attttgtgag ggccccacat 300 aatgcgtatc cggctttcgc ttaaattcaa gcaagctgta ataaatcaat tcaaaagtca 360 acacagtagc tagaacaaga aagatagaca agagtcaggt gaaagtaaga ttcagaaaag 420 aacatcaacc aaagaaacag gttaatctta caatatgcca agaccgcgcg attagaatcg 480 cctcacttga ttgggagagc aagagcatgc gacaagagtc gctcctcaag agaggaaagt 540 caaaagatat ttgctgatag aaactctgag tcaaagtcgt aattgaacaa ttatttttat 600 tgttttaact atctaataca tggatatctg gaataatatt ttgtatatac atacaaacac 660 aacacttata tgctaattaa ctcaacaggt gttgatcaag aggaattccg tcgcatcatc 720 ctccgggaat gggcgaggtt ggaagccaac gaggcgtatg gtgacaaata ccaccgagac 780 accgtcaccc tgctaaggaa gaaggacggc tcctcgtatt ttttaaggag cgaactccag 840 ctgcgcaggt tcgacgaggt cctgtcggaa accgagagag cggtcattga ccgcgtgtgg 900 ccggctgtaa agaagcaact cgtgttccag ctgaggtact cggtggactc catcaccagg 960 agtgagttca tgtggatgat gctggaacaa aacaagatgt tcttaaccgg gattgagatg 1020 ctggtgaagg gtctcaccat gccaggcttt ggacagcagc tgccctactt cgatcaaagc 1080 cctcctgcga aaccactgtc agttgagccc cgtggtccgg agccacccaa atctacggtg 1140 actgtccagc caaagaagcc tgcagctctg gtcgaagaaa ccgagccagt ggagcccagg 1200 aatgcctttc tagacatcag gaaaaagccc cgaatcaagg gcatcgaaaa cttcccgaag 1260 gcagcctcga ccccgttcgg caggccgcag acggtgcggt gcacagaact gattgatatg 1320 agtatcaata tgcgcgagcc tgtcaaaagg tcggcagaaa gaaggcggga gccttccgag 1380 ggcggagaac tttccaacaa ctcggaggac gaggctcagg agactcggga gtcgaacaag 1440 agtgacacgc cggtggaaat ttccatgttc gacccctaca gggcttctgg gtcgcagcag 1500 atgcagccgc cacctgatca agaacacagg tcaaaatttc cccgggtatc ggtaagacaa 1560 aataattgtt gtcagtttgt agaacagtcc aataacgttc ccttgaacgt agttaacgaa 1620 ctagtagaaa acaagataaa ggtcttgctt ctcgaaagag ataaagagat tgagaaatca 1680 tcgaatgata gaagtttttt gaatttgtcg gaacaattga aaaatgaatt tataggcaga 1740 gaaaatctag gttctagaac ttataaactc acggtaaaaa ctaacttcga aatgttcgaa 1800 gactacttga agtccgaact aagaaacaag aaactgcact acattctgga tgagagtgtg 1860 agtatgaact ctactgtaaa tgagtcaaaa agattggatg atgagtttaa agttagggaa 1920 attataatca atagaataga cgcaatttat tacaacaaga ttgttaattt gactgatccg 1980 aaactgattc ttagaaagct gaaggatgta aaaagatacg aaactcgaac aactagaatc 2040 acagctagac gagacttgat gacaatcaaa tatttacctt cgaaagagac tgcgtctgat 2100 ttctacgata tattccagga gaaagtcaga atatttgaaa atattccaga cgcgggaaaa 2160 cttccggaaa acgagttaaa agattatttc cttcaagcag ttgctgaagc agttccggaa 2220 attaccactg cagatcaatt gtttaaagaa gctacgtcta cggaaatgac atgcgagaga 2280 ctaaaaaatt atttattgca ggttggatcc accaagagtc ggaaaatgcc ctcccaaccg 2340 ccagttagag cgttcgtgca ctcaagtcaa ggcactggcc ggctgatctg cagaggttgt 2400 ggaatggagg gtcacaccaa gaccaactgt cccaacgaag gcttgaggca gtgttattac 2460 tgcctcaaac tcggtactca tcttgcacac gagtgcaaca agcgtaaagc agatgcggag 2520 aagggacagg caactgcgac cagctacggc ccaccctcaa agagagggag gtggacaggc 2580 caagggagcc ggtctactgc gagcagaggc cggacatcta agatggcagc gtctggccga 2640 gcctcagtgg ctcctgccaa ccgtggagcg agaggcgcca gaggcgcatc aacccgaggc 2700 agaggccgtg gacgtggcgg accacagagg ggtgtaggct actctaggct tcagcagggg 2760 acatcgctcc aagccattgc cgaggaatac caaggtaaca aagcgaactg cggtaataaa 2820 gaattgctag aatttatagc ggactcagat gcgaccgagc atttgagcag atcaaagttg 2880 tacttcgaca aatttgactc tcgagcgatt actaaggtag attgtgcgaa caaagatagc 2940 acttttagga ctgacgatag tgggtcaact gaagttgtta cagaggataa taataccttt 3000 accttaaaca acattttgta ttcaaaaaat ctgtcataca atctgttatc gttgagacgt 3060 ttcgtagatc ggggacttga gatttacttg aataacaaaa tcattgatat ttataatcca 3120 gaactagata aaagcatttt gacaagagag tacaagaaac cattttggat tattaaattg 3180 gttttaagta aagatacagc aactaaaact aatcctaaaa tcatgtataa cacaagaagt 3240 aaaaagaact ctgtcaaaat aaacgactcc gatgtaaaca aatcagtatc gggcaggtcc 3300 agtgtagaca agactaggga acctaatgat gtggaaacag attttttaca aacatctagt 3360 aacagaaaac acacacaaaa cagagactag tcaggtcgat aaaatagatg tgaataaaaa 3420 tcacaataaa cctagagagg aaatttcaga agccatgtta tggcatatca gactaggaca 3480 tgtatctaag aaatatttac ttgtgttagc caagcaaaat gataaattaa taaatattga 3540 aaatataaac aaagataaga ccatacaaga tgaaaaattc tattatctaa gatgcgacag 3600 ggggacggag tttgtttgtc aggctactag agaggtctta ggaaaatacg gtgcagaatt 3660 gcaactggct tgtccagata caccacaata caacggtgtt gctgaaaggt ttaatcgaac 3720 gctagagaat aaagtaaaag tcatgatgct ggactctggc ttaccgcatt cgtattggga 3780 tctagctatt aaaacagctg tctatattta taataggaca tcacacaagt cagtagacat 3840 ggtggcacct ttgagtaaaa tcttatcaaa tcagtctgaa tgcgtgagtc aggtcaagcg 3900 ttttatgcgc agcatatttt aagatagccc gcagcacaga aaccaagttt tcgcctcaag 3960 caaagcaggg atttttaatt ggatatctga gtacaggaaa catgatactt gtaccgagcg 4020 aaaataaatt atatgattgt aaacatgtta ggtttgtaga aaatttaacg tataaagact 4080 ttcaaagtaa aagtatagaa aacgatcaat cagaattaaa atttgaaaag gaatcagact 4140 ccgagactga gaaagagaga cagtagttga aataaaatcg aaaagaggta gaccgaaaaa 4200 gaatcccgta gcaatgttct ttttaactga agatcctgag atacaagaat ttgattttga 4260 tagagaattt agcgatctta aatatcatgc tcttctagct aaaatcttag gtgacccata 4320 aacgtttaaa caggctgtaa actctccaga gagagaacac tggttgggag caattaaatc 4380 agacctagat tctattaaag aaaaacaagt atacactgta gtagaaagat ctaaagtatc 4440 tatgattaac ttaggaccaa atataatcga ctccagatgg gtctttaagc gaaaaacaaa 4500 tgaacagggc gaaactaaat tcaaaggtcg attagttatt cgaggtttta aagataaaaa 4560 tgtgtatgac ttacgtgaaa catacgctcc agtatccagg atgtcactga taagagcttt 4620 cttctcgata gcgaataaat taaattacac gattagacaa ctcgatgttg aaactgcgtt 4680 cttgtacggt gacctttcag aggacattta tatggaagtt ctggagggag tacagattac 4740 cagtgatacc agaaaaagat ttctatggaa actaaacaag tcactttatg gattaaagat 4800 aagtcccaag aagtggaatg ataaattttc aagcgtaatg actaagctag gctttacatc 4860 aaatgatata gatccgtact tatacattaa acactcaggt gtagatacaa tccttgttgt 4920 attgtatgtt gacgacatac tgttagctag ttcaaacaac gaagtcttga acaagttgag 4980 ccaaaattta attgctgaat ttaaaatcaa agatctcggt acccccaaag aatttttagg 5040 gatcaaaata gaaagagatc aaaccaagcg agttatcaag ttgagcctat atgagactat 5100 attcatgttg agtagatttg gctttgaaag ctgcaggccg cttagtacac caatgaggca 5160 acttgaggtt gttagccatg atagaaaaga gagggaggaa actgagaata cactggctga 5220 atcaggaaaa gagcaaattc ctaataaact gtacagagaa gctgtcggat ctctacttta 5280 tttggcaaac gcaacgaggc cagacatttc gtatgcggta aatgtactta gtaggcatca 5340 ggtgaaccct actatacagg aatagaatat ggttaagaga gtatttcaat acttgtcagg 5400 aacacgaaac tatgagttaa cattcaatcg aaccgatgag gggctgataa cttactctga 5460 cgctagctta gcagactgta aaagttctct gaccacgtgt ggctacgtta ttcgactgtt 5520 cggtaacaca gtggcgtgga gaacgcacaa gcagcagagc gttgctttgt ccacttgtca 5580 atcagagtat gtcgccatga gcgaaacctg tcaggagtgt atgtccttac ataactcagt 5640 agcaattatg ctagagcgaa acttatatcc actgacattg tgctgcgaca acatggcagc 5700 catttcttgc gctaaagtaa atggtggtaa tcggttaaga cacatggtcg aaagaaggga 5760 acattatgta aaagaatgtg taaataagaa ttacattaag atagaatgga tcaaatcaaa 5820 gagtcaatta gctgacatat ttacaaaggc tttacccaag caattacatt atgaactatt 5880 gaaaatcatt tttaacatat aattaataga ttttgtgttc attttagagt ttcctgaaga 5940 gaccgatggc gaggcagaag ttgcagagcc atccataaca gaggagccct ggaactgagt 6000 gggaggtcct caagtgagag ggag 6024 // ID Gypsy-599_AA-I repbase; DNA; INV; 4376 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-599_AA_; KW Gypsy-599_AA-LTR; Ty3_gypsy_Ele67; Gypsy-599_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4376 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [3329-3793] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 83..4363 FT /product="Gypsy-599_AA-I_1p" FT /translation="MSQPQAPSSLAGSGEQPFKDTILQILSNQQALMTKLS FT QHVAAIEGNIQNANRNELILDSLATNITEFAYDLEKGYSFDAWFSRYADLF FT EKDAAQLEDDAKVRLLLRKLSPSAHERYTSFILPKLSKEFSFEETVAKLRT FT IFGTPVSTFHRRYQCLQTTKDEDEDFISYSCKVNKACVDFKLQELKEDQFK FT CLIFVCGLKSPKDSDIRMRLLSRMNETSDMTLEKIVEECKSLINLKQDTVL FT IGGKPPTPAVAATNAVRTHSQPNRKEKSNRSKDQAPKTPCWSCGGMHFANH FT CNFKDHKCRDCGRTGHKEGYCSCFSSKSRPKSGKGKPANRNKPSSKIVVVK FT NVTRSRRYVETTINGVPVNLQLDSGSDITIISRQNWLKIGAPKTSPPDCEV FT QTASGDKLGIAAMFRACISISGDQREGNCYVCRTNLSLNVLGSDLLDKFGL FT WDVPFSSFCKLVDAKQEDHQVAALKAKYPTVFTDQLGLCSKMQVHLSLKKD FT VQPVFKPKRPVSYNMEAVVEDELKRLQDKGIITPITYADWAAPIVVVRKPD FT RTVRICADFSTGLNNALEANNYPLPLPEDIFNRMANCTVFSHIDLSDAYLQ FT VQVDDESRKLININTHKGLYQFNRLSPGIKSAPGAFQQIMDAMLAGLDCTC FT PYLDDVLVGGRTEEEHRKNLYKVLERLQEYGFTVKIDKCRFFMRQVNYLGQ FT LLDKEGIRPDPEKVKAIVNMPPPSDVSTLRSYLGAINYYGKYIREMRKLRH FT PLDELLKQGTSFEWTDECQRSFDRFKEILQSPLMLTHYNPRLDIVVSADAS FT NVGIGARIAHRFPDGSEKAVYHASRSLTPAESRYSQIEKEALGLVYAVTKF FT HRMIYGRRFTLQTDHKPLLAIFGSKQGIPPYTANRLQRWALTMLLYDFQIE FT YISTDHFGHADILSRLINSHIKPDEDFVIASIEIEKVICNVVGQSIEHLPV FT SYKMIAEETAQDEALRKVTSFIRKGWPTEATSEHSSVQQFFARRDSLYEAQ FT KVLMYGDRVVIPKKLQQKVLQQLHKGHPGIERMRSLARNFVYWPNIDDHVT FT ALVRNCHECAAAAKTDTKTKLQSWPVPEKPWQRVHADFAGPVDDTYYLLLV FT DALSKWPEVVPTKRITTAATIAILRKIFSRFGMPEVLVSDNGAQLTSDAFE FT RFCESNGIMHLKIAPFHPQSNGLAERFVDTFKRTIKKIQAGGEDIDQAIDT FT FLLCYRSTPSRNAPGGKSPAEILFGKPLRTSFELMKPPSKFFKDVNSKQDS FT QYNEKHGAKAKYFDVKEQVYAQVHHGNEWSWIPGEVVERIGQVMYNIWLPD FT RKRLIRCHSNQLRNRYGSKSNPEVTNPDVPLDILLDTWSLNPTNSEAAVEA FT VEPPTEPDGLNELQLEFLRNLMQPDQQQRRQRTPVRQPELDEVIQRRSSRQ FT RRAPIRYEPYQLY" XX SQ Sequence 4376 BP; 1280 A; 1061 C; 1050 G; 985 T; 0 other; aagtggcgac gagtctgcgg aagttcgcgg agtaccgaat ttcgggtcga aagtatcgcg 60 ccgccgaaaa ccgcgtgtga gaatgtcgca gccacaggcg ccatcttcgt tggccggcag 120 tggtgaacaa ccattcaaag acacaatttt gcaaattctc agcaatcagc aggcgctgat 180 gacaaagctg tcgcagcatg tggcggcaat cgaaggaaat atccaaaacg caaaccgcaa 240 cgagttgata ttggattcct tagcgacgaa tatcacggaa tttgcatacg accttgaaaa 300 aggttactcg ttcgatgctt ggttttcacg gtatgcggat ctttttgaga aggatgccgc 360 ccagctggag gacgatgcca aggtgcgctt gttactgcga aaactgagtc cttctgcgca 420 cgaacggtac acgtcattca tcctccccaa actgtcgaag gagttttcgt ttgaggaaac 480 tgtagccaaa ctgaggacca tcttcggtac accggtatct acgttccatc gacgttatca 540 gtgccttcag acgacgaagg acgaagatga ggattttatc agctactcgt gtaaagtcaa 600 caaagcatgt gttgatttta agctccaaga gctaaaggaa gatcaattca aatgtcttat 660 cttcgtatgt gggctcaaat cacctaagga ttcggacata cgaatgcgcc ttctttcgag 720 aatgaacgag accagcgata tgaccctgga gaaaatcgtc gaagagtgca agagtctgat 780 caatttgaag caggatacgg tgctcattgg aggaaaaccc cccacgccag ccgtggctgc 840 aacaaatgcc gttcgaacgc actctcagcc aaatcgaaag gagaaatcaa accgttcaaa 900 agaccaggca ccgaagacac cgtgttggtc ttgtggtggc atgcactttg ccaaccactg 960 caacttcaag gatcataagt gtcgagattg cggtagaacg ggccacaagg aaggctattg 1020 cagctgtttt tcgtcaaaat cacgtccaaa gtcaggaaaa ggaaagccag cgaatcggaa 1080 caaaccttca agcaagattg tcgtcgtgaa aaacgtcaca cggagcagaa gatacgtcga 1140 gacaaccatc aacggagttc cggttaacct tcaactagac tccggttctg atatcaccat 1200 tatatccaga caaaactggc tcaaaatcgg agcaccgaaa acatcgccac cggactgtga 1260 agttcagacc gcatctggcg ataaattggg aatcgctgct atgttccgtg cctgtatttc 1320 catcagcggc gatcaaagag aaggtaactg ttacgtatgt agaactaatc tatctcttaa 1380 cgttctaggc tcagacctac tcgataagtt tgggctgtgg gacgtaccat tctcgtcgtt 1440 ctgcaagttg gttgacgcca agcaagaaga ccaccaagtc gctgcattga aagccaagta 1500 cccgaccgtt ttcaccgatc agttggggtt gtgctcaaag atgcaggtgc atttgtcgct 1560 caaaaaagac gttcaaccgg tattcaagcc aaagcgaccg gtttcgtaca acatggaggc 1620 cgtcgttgaa gatgaattga aacgcctgca agataaaggt atcatcacgc caataacgta 1680 cgcagattgg gccgcaccaa ttgtggtggt gcgtaaacca gatcgcaccg ttcgaatttg 1740 cgccgatttt tccaccggat tgaataatgc acttgaggca aacaattacc cattgcctct 1800 cccggaggat attttcaatc gaatggctaa ctgtacagtg ttcagccata tcgatttatc 1860 agatgcgtac cttcaggtcc aggtggatga cgaaagcagg aagttgatca acatcaatac 1920 gcataagggc ctttaccagt tcaaccggtt atcacccggt atcaaaagcg ccccgggtgc 1980 attccaacaa attatggatg cgatgttagc tgggctggac tgcacttgcc catacctcga 2040 cgacgttctc gttgggggtc gcacagagga ggaacatagg aagaacctat ataaagtact 2100 cgaacgcctt caagaatatg ggttcaccgt taagatcgat aaatgtcgat tcttcatgcg 2160 ccaagtgaac tatctggggc aacttctaga caaggagggc atccgccctg atcctgaaaa 2220 ggtgaaagcg atagtgaaca tgccacctcc tagcgatgtc tcgacgctgc ggtcgtactt 2280 aggtgcgata aattattacg ggaagtatat tcgagaaatg cgaaagcttc gccaccctct 2340 ggacgaactt ttgaagcaag gaaccagttt cgaatggaca gatgagtgcc aacggtcgtt 2400 tgatcgattt aaggaaatct tgcaatctcc tctcatgctg acgcactata atcctcgttt 2460 ggatatagta gtatcagccg acgcatcaaa tgtgggcatt ggcgcccgca tagctcatcg 2520 atttccagac ggctccgaaa aggctgtgta tcacgcatca agaagcctca caccagcaga 2580 atcccgatac agccaaattg aaaaggaagc gttgggattg gtttacgcgg tcaccaaatt 2640 tcatcgaatg atctatggaa gacgcttcac tctccagacc gaccataaac cgctgctggc 2700 catttttgga tcgaaacaag gtattccacc atacacagcc aaccggctac agagatgggc 2760 gctcaccatg ttattatacg actttcaaat cgagtatatt tctacggacc attttggtca 2820 cgcagatatt ctgtcacgcc tgataaattc tcacatcaag ccagatgagg atttcgtaat 2880 tgcgtccatc gagatcgaaa aggttatctg caatgttgta ggccagtcca tcgaacatct 2940 tcccgtttca tacaaaatga tagcggaaga gacagcacag gacgaagcac tgcgtaaagt 3000 tacgagcttc atcaggaaag gctggcctac cgaagcaacc agcgagcatt caagcgtaca 3060 acagtttttc gcacgtcgag attcgctgta tgaagcgcag aaggtcctga tgtacggcga 3120 ccgagtggtt atccccaaga agctgcaaca aaaggtgctt cagcagctgc acaagggaca 3180 tccggggatc gaacgaatgc gctccttagc tcgaaatttc gtttattggc caaatatcga 3240 cgaccacgtc accgctctgg ttcgaaattg tcatgaatgc gcagcggcgg ccaaaacaga 3300 caccaaaact aaattgcaat catggccagt gcctgaaaag ccatggcaga gggttcatgc 3360 ggacttcgca ggtcccgtcg acgatactta ctatcttctg ctggttgatg cactgtccaa 3420 atggcccgag gtagttccaa caaagcgaat cacgacagca gccaccatag ctattcttcg 3480 taaaatcttc agccggtttg gtatgccgga agtcttagtt tccgacaacg gagcccaact 3540 aaccagcgat gcatttgaga gattttgcga gtccaacggt atcatgcatc tgaaaattgc 3600 tccattccat ccacagagca atggattagc cgaaaggttt gtcgatacct tcaaaaggac 3660 aattaagaaa attcaagcag gaggggaaga tatagaccaa gccatagaca ctttcttgct 3720 ctgctatcgc tctacaccca gtcgaaatgc gccaggagga aaatcacctg ccgagattct 3780 tttcggcaaa cctcttcgaa catcgtttga gctcatgaaa ccaccaagca aattcttcaa 3840 ggatgtgaac tccaaacaag acagccagta caacgaaaaa catggagcaa aagccaagta 3900 ttttgatgtc aaggaacaag tgtatgcgca agtacaccac ggaaacgagt ggagctggat 3960 accaggagaa gtagtggaac gtattgggca agtcatgtac aacatttggc tgcctgatcg 4020 taaacggctc atccgctgtc acagcaacca gttacgaaac cgctacggca gcaaaagcaa 4080 cccagaagta accaatccag atgttccact cgatattctg ctcgacacgt ggagtttaaa 4140 tccaacaaac tccgaagctg ctgtagaagc agtagaaccg ccaactgagc ctgatggatt 4200 gaacgagtta cagttggaat tcctgcggaa cctgatgcaa ccggatcagc agcaacgtcg 4260 acagagaact ccggttaggc aacctgagtt agatgaagtt attcagcgac gatcatcgag 4320 gcaacgacgt gcaccgatcc gctacgagcc gtatcagctt tattaaacag gggagg 4376 // ID Gypsy-106_AA-I repbase; DNA; INV; 4793 BP. XX AC supercont1.333; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-106_AA_; KW Gypsy-106_AA-LTR; Gypsy-106_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4793 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.333; Positions 567062 562270. XX CC 'GTAT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 26..1222 FT /product="Gypsy-106_AA-I_1p" FT /translation="MSTTPIPTDSGSQSKSGQKDITALFAELTSSIAVKFE FT QLNDKISNNLDEMRSEVKEFSVELERVKNDVSHLKNVRDFSAAINALQFSD FT NPVYQSSPNASKTDAPTPFPNNEVFGGPSSRVASPQGLTSLADRNQLNQGN FT VNSGETRAMNDLSIVNTGLKVKLTPEVFDGTKRWADYVLTFELTADINHWD FT EALKAKYLAAMLRGPALEVLRSLTPVQRASYVFLKAALEKRFGETFHRALH FT HAQLRSRQQHKGEDFITFADEIRRLVASAYHDCSAVAQDKIAMNHFLDGLG FT DPVVQDLVRFGLPDTLDKAVQLALQFHLSRSATRSNHKPVRLVRDTDAQNS FT SSDSETTSLCEENVYGLRSAGTQTYRSPAHKPTVDDETSIAAKGIRETIGR FT QPKGA" FT CDS 1758..4772 FT /product="Gypsy-106_AA-I_2p" FT /translation="MPMLLEGTLDHPDLFIAKILVCVKNEKVPIRIMNTSQ FT FPVNISLGTVIANCSSVQVIDKTKHVVDLQNDSLHLQSVITKSTKHLDAKQ FT KERFVKLITENHDVFSKGSHDMGRTSLTQHRIDTGNHSPIKIPPRKNPLAK FT RECATQLVHQMQKEGIIEPSQSPWCSPIVLVKKKDGSQRFCVDYRKLNDIT FT KKDSFPLPRIDLTLEALGGSSWFSTIDLQSGFWQVEMDQHDKEKTAFSTGN FT DLWQFKVMPFGLCNSPATFQRLIELVLHGLSWHFCLIYMDDIIVHSKSFEE FT HVVHLNEVFSRLRAANLKLNPAKCAFLQREVAYLGHVVSADGVHTDPEKIK FT SVLEWERPLNKTQVRSFIGLCAYYRKFIRDFSTIARPMHKLMEHHTPFIWD FT EECEQAFERLKQKLLEAPVLAYPWDDTEFVLDTDASYCGIGAVLSQVQEGN FT ERVVAYYSKSLSKAERNYCVTRKELVAVNKAVQHFHHYLYGRHFKLRTDHT FT ALTWLRSFRNPEGQVARWIERLSQYDFTSEHRPGTRHGNADGLSRRPCFDL FT GCKHCSRLERDVTDKAKGDAQPCNRIGAVDDAFKLGQQSDLVLQQVIQWVQ FT SGKRPTFEDTSQLDSTTRSYWTMFDSLLVHNGVLCRNFVGPHNEFLQVIVP FT KKMVQYVMEQIHGGLTGGHYGVAKTLIKVKERFYWVGLSRDVKLWCMNCTV FT CGARKGPSTRSRGELKPILVGEPFRMIGVDILGPFPITERGNRWILVITDY FT FTKWPEAVALPSQTAEVVADALLSTVVSRFGVPQQIHSDQGKNFESEVFGC FT LMDLLGVNKTRTTPLHPQSDGQTERFNRTLLDYLSKFIDKDQKIWDTLLPL FT LSYRASTHESTKFSPAILNLGRKLVLPIDLWRSSSPDALHSGPSYVQKIRE FT NMQRIHEEVRNNLSVAAESMKRRYDYKSNATTFNVGDLVWLFNPQRRIGKS FT PKLQCDWEGPYKILRRISDLVYEIKKDAIGSRKKIIHVDRLASFKGSRDG" XX SQ Sequence 4793 BP; 1440 A; 996 C; 1141 G; 1216 T; 0 other; ctggtgtcag aagtggggtt acgtcatgtc taccacaccg attcctacag actccggctc 60 acagtccaaa tccggtcaga aggacattac tgctttgttc gctgagctca cttcatcgat 120 tgccgtgaaa ttcgaacagc tgaatgataa aatcagcaat aatttggatg aaatgcggtc 180 cgaagtaaag gagttcagtg tagaactgga acgcgttaaa aatgatgtta gccacttgaa 240 gaatgtgcga gattttagtg cagcgataaa cgccttgcag ttttcggata atcctgtcta 300 tcagagctca ccgaacgcct cgaagacaga tgcaccgaca ccttttccaa acaacgaagt 360 atttggaggg ccaagttcga gagttgcatc gcctcagggg ttaacgtcgc ttgcagatag 420 gaatcagtta aaccaaggga acgtaaacag tggtgaaact agagcgatga atgacttatc 480 gattgtcaat accggactca aggtaaagtt aactccggaa gtgtttgatg gtacgaagcg 540 gtgggccgat tacgtattga ctttcgaact aacagccgac atcaaccact gggacgaagc 600 actgaaagca aaatatctag cagctatgct tcgtggccct gctttggagg tgttgcggtc 660 actgacgccg gtccaaagag cgagttatgt ctttttaaag gcagctctag aaaaacggtt 720 cggcgagacg tttcatcgag ctctgcatca tgctcaactg cgatctcgtc agcaacataa 780 aggagaagat tttatcacat tcgcggatga aattcgtcga cttgtagctt ccgcttacca 840 cgattgctcg gcagtcgctc aagacaaaat tgcaatgaat cacttcctcg acggcctcgg 900 tgatccagtt gtgcaagatc tcgtcagatt tgggctgcca gacacattgg ataaagcagt 960 gcagcttgcg ctacaattcc acctttcacg tagtgccact aggtccaacc ataagcctgt 1020 tcgactcgta cgagacacag atgcacaaaa tagtagttcg gattcagaaa ccacgtcttt 1080 gtgcgaagaa aatgtttacg ggttgcgatc ggccggcaca caaacctaca gatcgccggc 1140 acacaaacct acagtagacg acgaaacaag tatcgcagca aaaggaattc gggaaacgat 1200 cggtcgccag cctaaggggg catgagctgg ccgagaacag tggtccccca aacatcttgt 1260 ttgtgcattc actgaatcgt cgacgtggta gttatttagc cgatttacgt attgatggtg 1320 ttccttgtga agctgtcgtg gattgtggag ctgctgtgac aatcataagc cagaagtttt 1380 ggaagctaat tgagaagaat attacccagc aacaatcatc aaaattcatc aagacggtgt 1440 ccggagaact cgttccagtg aatctggaaa ccacagttgt gtttaacttc ggaaggtacg 1500 cagtaacaca tccggtatgg attgttgata tatcggagga ctgcatcatc ggaaacgatt 1560 tcctgagaga acatcaatgt atcatcgatt ttgtgaggaa caagctagag atctctaaaa 1620 atacatccat ccaattgcgt ccagcgtcag gttcaagtaa agaaaccaag gttttccatg 1680 tgacgaccga aacagaatta gaaattcctg cttggagtga aaccattgtc caaggaaaat 1740 gcgaacatcg ttcgagaatg cctatgctct tggaaggtac tttggatcat ccagatttat 1800 tcattgccaa aattctagtc tgcgtgaaga atgaaaaagt gccaataaga ataatgaaca 1860 cctcacaatt tcctgtgaat atatcgctcg gaaccgttat cgccaactgc tcaagtgtcc 1920 aggtaattga taaaactaaa catgttgtgg atttacaaaa tgacagttta catttacagt 1980 ccgtgataac gaaaagtaca aagcatcttg acgcaaaaca aaaggagcga ttcgttaagc 2040 tgattacgga aaaccatgac gttttcagca aaggatctca tgatatggga cgcacgagtt 2100 tgacgcaaca tcgtatagac acggggaatc attcgccgat aaaaatacca cctcgaaaaa 2160 atccgctggc taaaagagaa tgtgcaacac agttggtgca ccaaatgcag aaagagggca 2220 tcattgaacc gtcgcaaagc ccatggtgtt cacctattgt gctggttaag aaaaaggatg 2280 gaagtcaaag gttttgtgtt gactacagga aactgaacga cattactaaa aaagattcct 2340 tccctctccc cagaatcgat ttgacacttg aagctcttgg cggttcgagc tggttttcca 2400 ccattgatct acaaagcggt ttctggcagg tggagatgga ccaacacgac aaagagaaaa 2460 cagctttttc gaccggtaac gatctttggc aatttaaagt tatgccgttt ggtctgtgta 2520 acagccctgc aacattccag agattgatag agctcgttct gcatgggtta tcctggcact 2580 tctgtttaat ttacatggac gacattattg tgcattccaa gagcttcgaa gagcatgttg 2640 tccacctgaa tgaagttttt agtcgcttgc gtgctgctaa tttgaagctt aacccagcta 2700 aatgtgcgtt tttgcaacgg gaagtagcct accttggtca cgtagtatca gcggacggag 2760 tgcacaccga tccagagaaa ataaagtcag tattagaatg ggaaagacca ctcaacaaga 2820 cgcaagtacg aagcttcatt ggtctgtgcg cttattaccg gaaattcata agagactttt 2880 caactatagc ccgaccgatg cataagctga tggagcatca tacaccattt atatgggacg 2940 aagaatgtga acaagcattc gagagattaa agcaaaaact gttggaggcg ccagtcttag 3000 catatccttg ggatgatact gagttcgtgt tagatacgga tgccagctat tgcggcattg 3060 gggctgtact ttcgcaagtc caggaaggca acgagcgggt tgtagcatac tacagtaagt 3120 cactatccaa ggcagagcgt aattactgtg tcacgagaaa agaattggtc gctgtaaata 3180 aagccgtgca acatttccac cactatctgt atggaagaca ttttaaattg cgaacggatc 3240 atacagctct cacatggctt cgcagtttcc gtaatcctga aggacaagtt gctcgatgga 3300 ttgaaagatt gtctcagtat gattttacta gtgaacatcg ccctggaact cgacatggaa 3360 atgcagatgg gctttcaagg agaccttgtt ttgaccttgg atgtaaacat tgtagtagat 3420 tggaacgcga tgttaccgac aaagccaaag gtgacgcaca gccttgcaat cgaataggtg 3480 cggtcgatga tgcgtttaaa ttaggccagc aaagtgactt agtgttgcag caagttatcc 3540 agtgggtaca gagcgggaaa cgtccaacgt ttgaagacac atctcaactt gattcaacaa 3600 ctcgaagtta ttggacaatg ttcgacagtt tgttggtgca taacggtgtt ctttgtcgta 3660 attttgtagg cccacataac gagttcttgc aagtaattgt ccctaagaag atggtgcaat 3720 atgtgatgga acaaattcac ggaggtttaa ctggtggtca ctatggtgtt gcaaaaactc 3780 ttataaaggt caaggaacgg ttctactggg tcggactctc aagagatgtc aagctatggt 3840 gtatgaattg tacggtgtgt ggtgctcgaa aaggaccaag tactagatct cgaggagagt 3900 taaagcctat acttgtcggg gaaccatttc ggatgattgg agtcgatatc ttgggaccat 3960 ttccaataac cgagagagga aatcgatgga tactcgtgat tactgattac ttcacaaaat 4020 ggccagaggc cgttgcccta ccatcacaaa cggcagaagt ggtagcagat gctctgcttt 4080 caactgttgt atcaagattt ggagttcctc aacagattca ttctgatcaa ggcaagaatt 4140 tcgaaagtga agtatttgga tgtttaatgg atttacttgg ggtcaataaa actcgcacca 4200 cgccgttgca tcctcaatct gatggacaga ccgaaagatt caaccgtact cttcttgatt 4260 acctgtccaa atttatcgat aaggatcaaa agatatggga cacattactg ccactgttga 4320 gttaccgagc atcaacacac gaatcaacga agttttcacc agctatcttg aatttgggcc 4380 gtaaactggt actgcctata gacttatgga gaagctcatc tcctgacgca cttcattcag 4440 gtccatccta tgttcagaag attagagaaa acatgcagcg tattcatgaa gaagtaagga 4500 acaacttgag cgtagccgca gagtcaatga agaggcgcta tgattacaaa tccaatgcta 4560 caacattcaa cgtaggggac cttgtgtggt tattcaatcc gcaacgtcgt ataggaaaat 4620 caccaaaact ccaatgcgat tgggagggcc catacaaaat actgcgacga atcagtgatt 4680 tagtttacga aatcaagaag gatgccattg gtagtagaaa gaagataatc catgtggatc 4740 gattggcttc ctttaaaggt agccgggacg gctaggaccc aagaaggagg cag 4793 // ID BEL-184_AA-I repbase; DNA; INV; 6265 BP. XX AC supercont1.151; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-184_AA_; KW BEL-184_AA-LTR; BEL-184_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6265 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.151; Positions 1274015 1280279. XX CC 'TAGGT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 1817..5635 FT /product="BEL-184_AA-I_1p" FT /translation="MLENLECLHKSCAAASSNRSAKPISSASGSRTPSSNR FT PFVATAVRDRSKISCFVCRQNHYINQCEEFLKLSPQQRIEKAKSLNLCFNC FT LSSSHTVAQCTRSLCRTCQKKHHTLLHLESKQSLSVVPSEKVPLAQSKPQR FT VTPPTEPSVSNTTVSQAYRSFPIRRQVLLNTAIVYVEDSHKEYHKVRVLLD FT SGSMVNFITEDCIQRLGLKKFKGNTEVTGVCGNTSSIQYKVRTVIHSTTSN FT FETEIECLVTKKITSDLPIEGFDTTSLMIPNNLTLADPSFNIPNKIDVLIG FT ADVYMNILEGNRVPLHNGSPVMQETLFGWVIGGKVEAKRPVVSALVSNENL FT DLLLRSFWETEGGNIESRFTTEEEQAEQHFVDTHRRQEDGRYIVELPFKNG FT CPDLGDSKQMALKRLQALDRRLSRNWTLATDYEDFINEYISLGHMVELGPL FT EHYKAELGQDYFLPHHAVEKPDSSTTKCRVVFDGSAASSNGRSLNDNLLVG FT PVIQPKLNEIALRFRVPKIAITTDISKMYRQVQVASNHQQFQQILWTPTKT FT SDPRVYRLTTVTYGLASAPFQAVRAVKQLCIDEAARFPEAARVIGEDSYID FT DVLTGAETLEQAIQLKDEIIGLFESGKLELHKWCSNSSEFLQTLPQDKIEQ FT KLLVGDKQSVKALGIIWKPDEDVFMLNVDPIILGKAEGTKREILSSISRFF FT DPIGLAGPVIIVVKLIMQGLWRKEKGWDEKADQEDIDRWMEFKTQLASLQQ FT IKISRCILPFTSMSIQIHGFSDASNLAFGACVYLRSMNKNGQVYMSLVCSR FT SRVAPIKKSNKSKGFEPDLTIPRLELCGALLLAQLVTDTINALNIRIDKMV FT LWCDSTIALAWIARQPEQLKQFVANRVAKIQALTSTMEWRHVGTNDNPADI FT ISRGLMPSELQHATLWWNGPEFLRGEESTWPAEYQESGEVYETINVCAEQP FT TETFPIFENISSYRRMVRVMVWVQRFISIMRKQPNVSKASKMSNAEKTSAF FT QQLVKLVQGEAYPDLVRQLQHGKMIDTRHKLISLSPFIDDQGVLRVGGRLR FT NKNCNFDMKHQYILPPYHKFTEAIIREYHRENLHSGNQLTLSMIRETFWIE FT RGKSAIKRFNCLRCFRQKPKPLQQYMGELPTDRVELVYPFYNTGVDFCGPV FT YLKPTIRSTTRAKSYICVFVCQATKAIHLELVGSLTSVAFIGALQRFVSRR FT GKCAKLISDNATNFVGANNELRELHELFNSQPFLKKRHHLEDCGKLEFGPS FT NIICAEPLVKPS" XX SQ Sequence 6265 BP; 1790 A; 1452 C; 1489 G; 1534 T; 0 other; ttttggtcct tcgagccgga tagttgaagg gccgtggtgt gaagtgaaac caacggtgaa 60 tttgaagcga tcgatcgaca atgatgccat gcgggcaaac gatgaccctg ctttggttgt 120 gtgaacaaac gaaagagacg aacaaaatga cgttcagtgt gacgctcgtg tgagccgtgc 180 aagtgcgaac gaagacgaca gaaaaattct ggtcagccat tgtattccga accgtcgagt 240 gtagcacgtg cgtaatcgaa tcgtcgcaat tttgtgactt tgtgctctaa ttactaataa 300 aagtccggac aatttacgta ccgtcgtaac gagtggtgaa attcgcgaag caaaagtgct 360 ttaatttcct tgctagcggg aatcgagaat aaattcttgg cgaattgtct cgagtgcggt 420 gtttttccgc gagatccatc gcgagtggaa tcggtgaaag gagtcaaagg gctccaaaaa 480 cgtctcgaat tcgagggaga aatcgtccca agtgaaatgg aatcgaattg cttaagcaat 540 aattccgaaa aagtgcaaat cgtgatcgtt tgaaaccgtc gcgagtgcaa aaactgtgga 600 cggagcagcc aagctgccgt atcgaaaaag gtgaacagtg aacccaaaaa tcgagaaagg 660 gaccagatta tacggcccaa aaacatactg aaaattcgga aaagtgaaca ataagtgcta 720 aaaagggcgt tagagtaccc caagagaaga ataaaagtgc tacaagtgtc ctaagaggca 780 ccaaactgcg ttccagaaag tgcctactga cgccgtcggc agccattgcg tgtcgctggg 840 tgagcgagac aacctgcgcc aaccgccaaa ctcctgagaa caaaggtagg tgatgagcca 900 ttttgtatta cattgagcaa gcccacttaa agcatactag cgtgacctca aaatgaccga 960 gaaaaagctg caaattgttt gcctgtgtcg aattttcgat gccattcgag atcgctcgaa 1020 taagctcttg tcgttcgtag aagccttttc tgctgaaacg ggtgacattg acgagctgga 1080 agaaaaatta tctgtcctcg atgatttacg tggccaattt cttgataccc gttcgaaact 1140 ctatgggttg gtaaaggacg atgacttggg tgatatccag gaacgtgggg aagaaataga 1200 ggacatcctt gatagcagtc gtttcaaaat ccgtcgtcat ttgatacgat gcaaaccgca 1260 aacggaccaa gacatcaaac ttgagttcga agctggtact tccaaaacca agcttcccga 1320 tattcccctt ccaaaatttg acggacacta cgaaaattgg atctttttcc gggatcaatt 1380 caagtcaatt atttgtcgtc gtgaaaatct ggatgatttc gagaaattgc actatctgcg 1440 aatgtgcctg tgtggggagg caaaacattt gcaatgtaac gaggagacgt tttcttcact 1500 ttgggatgca ttgaaccgtc gctacgaaaa caagagatgg ttggtcgaga aacatttggg 1560 ggatttgttc caaattccgc atttgactgc cgaaaatgct gctggcttgc gttcgttgct 1620 cgacaatttt ttaaagcaca tccgtgcttt gaacgccctc gatatcccac tggataaaat 1680 gtctgaattg atttttgctc aaatggtcat gatacgtcta catccaagaa cccgccgaca 1740 gtattaagct gagttagagg aatccaaact gccggattgg aaggatttgg tgagcttctt 1800 ggagaaccac tgccgcatgt tggaaaattt ggagtgtcta cacaaaagtt gtgctgctgc 1860 ctcttcgaat cgctctgcca aacccatttc gtctgcttct ggttctcgta ccccatcttc 1920 aaaccgtccc tttgttgcga ccgctgtgcg tgatcgtagc aaaattagct gctttgtgtg 1980 ccgacaaaat cattatatca accagtgtga agagtttcta aaactgagtc cccaacagcg 2040 tatcgaaaag gcgaagtctc tcaacctgtg cttcaactgt ctctccagca gtcacactgt 2100 tgcgcaatgt accagaagct tgtgccgcac gtgccagaaa aagcatcata ctctgcttca 2160 tctggaatcg aagcaatcgc tttccgtggt cccgagtgaa aaagttccgc ttgcgcaatc 2220 caaacctcaa cgagtaactc caccaactga accttccgtg tcgaacacca ccgtctccca 2280 agcttatcga tcgtttccca ttcgacgcca ggttcttctc aatactgcta tcgtttatgt 2340 tgaagattct cacaaggaat accacaaggt ccgcgtacta ttggattccg gatctatggt 2400 taacttcatc actgaagatt gtatccaacg gcttggtctg aagaagttca agggcaatac 2460 cgaagttacg ggagtctgtg gcaatacgtc gtccatccaa tacaaggtac gcacggtgat 2520 tcattcgacc acttcaaact ttgaaacaga gatcgagtgc ctcgttacca agaagattac 2580 atccgatttg ccaatcgaag gattcgatac aacctctctg atgattccga acaatctcac 2640 tttggcggat ccttctttta atattccaaa taaaattgat gtccttatcg gcgctgacgt 2700 ctacatgaac atcctggaag gcaatcgagt tcctttgcac aatggcagcc cagtaatgca 2760 ggaaactctt tttggatggg taatcggtgg aaaggttgaa gctaaaaggc cagttgtttc 2820 tgcacttgtc tcgaacgaaa atctggacct tctgttgagg agtttttggg aaactgaagg 2880 tggcaacatt gaatcaaggt tcacaaccga ggaagagcaa gcggaacagc acttcgttga 2940 tactcaccgt cgtcaagagg atggtcgcta tattgtggaa ttgccgttca aaaatggttg 3000 tccggatctt ggcgattcca aacaaatggc tttgaaacga ctgcaggcat tggacagacg 3060 cttgtcgcgt aactggacct tggccacgga ctacgaagac ttcatcaacg aatacataag 3120 tctaggccat atggtggaac tgggtccttt ggaacactat aaggctgaac ttggtcagga 3180 ctacttcctt ccccatcatg ctgttgaaaa gcctgacagc agcactacca agtgccgcgt 3240 agtttttgat ggatctgcgg catcaagcaa cggaagatca ctgaatgaca atctactcgt 3300 tggtcctgtg atacagccga agttgaatga aattgcattg aggttccgtg ttccgaagat 3360 agcgatcacc acagacatca gtaaaatgta tcgacaagtc caagttgctt caaaccatca 3420 acaatttcag caaatattgt ggacacctac caaaacaagc gatcctcgtg tgtatcgatt 3480 aacaacggtg acctacggac tcgcaagcgc accattccaa gcggttcgag cagtgaagca 3540 attatgtatc gacgaagctg cacgctttcc cgaggcggcg agggttattg gtgaagattc 3600 gtacatcgat gacgttttga ccggtgcaga aacactagag caagcgattc agctgaagga 3660 tgaaatcatt gggctatttg aaagtggaaa actcgaactt cacaagtggt gttccaacag 3720 tagcgaattc ctacaaaccc tccctcagga taaaatcgaa caaaagttgt tggtcggcga 3780 taaacaatca gtgaaggccc tgggaatcat ctggaagcct gacgaagatg ttttcatgct 3840 caacgttgac ccaatcatcc tcggaaaggc agaaggtacg aagcgagaaa ttctgtcaag 3900 catctcaaga tttttcgatc ccattggatt ggccggaccc gtgatcatcg tggtgaagct 3960 catcatgcaa ggcttgtgga ggaaggagaa aggctgggac gaaaaggcgg atcaagagga 4020 catcgataga tggatggagt tcaaaacaca gttggctagt ctccagcaga tcaagattag 4080 ccgatgcatt ctgccgttca cctccatgtc tatccaaatt cacggctttt ctgacgcctc 4140 aaatcttgcc tttggggcat gcgtctatct caggtctatg aacaaaaatg gtcaggtcta 4200 tatgtcgcta gtatgctcaa ggtcacgtgt agccccaatc aaaaagtcga ataaatcaaa 4260 agggtttgag cctgatctca caatacccag gcttgagtta tgtggtgctt tgctcttggc 4320 tcaacttgtc actgatacca taaatgcctt aaatatccgt atcgataaaa tggttctttg 4380 gtgcgattca acgatcgctc ttgcttggat cgcacgacag cctgaacaac tcaagcaatt 4440 tgtggcaaat cgagtcgcaa agattcaggc gttaacatcg acaatggagt ggcgtcatgt 4500 aggtaccaac gacaacccag cggacatcat atcaagaggc ttgatgccga gcgagcttca 4560 acatgcaact ttgtggtgga atggacccga atttctgcga ggtgaagaat caacgtggcc 4620 agcggagtac caagaatctg gagaagtcta cgaaacaatc aacgtttgtg ccgagcaacc 4680 aaccgaaacg ttccctattt tcgagaacat cagttcctat cgtcgaatgg ttcgagtcat 4740 ggtctgggtg caacgattca ttagcattat gaggaagcaa cccaacgttt cgaaagcatc 4800 caagatgagt aatgccgaga aaacatcagc ctttcaacag ttggtcaagc tagtgcaagg 4860 ggaggcctac ccggatctag ttcgtcagct gcagcatggg aagatgatcg acaccaggca 4920 taaattgatt tccctatctc cgttcatcga cgatcaaggc gttctgcgag tgggaggacg 4980 gctacgtaat aagaactgca atttcgacat gaaacaccag tacatcttgc ctccgtacca 5040 caagttcacg gaagctatca tccgagagta ccatcgggag aatctgcatt ctggaaatca 5100 gctcaccttg tccatgatca gggagacatt ctggattgaa cgcgggaagt ctgccatcaa 5160 acgcttcaac tgcctacggt gcttccgtca aaaaccaaaa ccgttgcaac aatacatggg 5220 cgagcttcca acggaccgtg tggagcttgt ctatcccttc tacaacactg gggtagattt 5280 ttgtggaccg gtgtacctca aaccgacaat tcgatctacg acccgtgcca agtcctacat 5340 ctgtgtgttt gtgtgccaag caacaaaagc gatacacctg gagttagttg gatccctgac 5400 ttcagttgca ttcatcggag cccttcaaag gtttgtgtct cgacgaggaa aatgtgccaa 5460 gctaatctcc gacaacgcta ccaactttgt cggcgctaac aacgagctcc gggaactgca 5520 cgagctcttc aactcccaac cttttttgaa gaaacgccat catttggagg attgtgggaa 5580 gctggagttc ggtccctcaa acatcatttg tgcagaaccc ttggtgaagc catcctgacc 5640 tatgaggaga tgattactgt gttgacgagt gaaggagatc gaagctaccc taaactccag 5700 accgctgtgt gctctctccg acgacccaac cgactacaat gccttaacgc cgggtcattt 5760 tttgatcttc cggccactga acgcaatagc tgaacccgac ttgacaagca caaacatcca 5820 cactttgtct cgatgggaca aaattaagca gtatgtgcag catttctggc gtaggtggaa 5880 cgtcgactac atccaccaac tccagcagcg ttctaaatgg cactccaagg ttccagtggc 5940 agtaggtcag ttggtaattg tacgcgaaga caacgttcct cctcaacagt ggctacaagg 6000 acgaatcgag caacttttcc ctggtaacga taacatcgtg agagtcgtaa cagtacgcac 6060 ggcgaaaggc gtatacaagc gatcagtttc acgacttagt ttgctaccaa tcaaggacaa 6120 cgacattgaa gcaaacaaca acattggata gagatccgat gaaatgtgaa gtcccgagag 6180 tgcatcgtaa cgacgtcccg aaaaatcaag atctacgaag tcccgacggc aatatgttga 6240 aagctccctt tcaacggggg gagaa 6265 // ID Gypsy-588_AA-I repbase; DNA; INV; 5804 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-588_AA_; KW Gypsy-588_AA-LTR; Ty3_gypsy_Ele61; Gypsy-588_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-5804 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [4858-5337] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 812..2071 FT /product="Gypsy-588_AA-I_2p" FT /translation="MNNGESEIELLIGQIHKIKTNLKKAPFRKYLKFTLDE FT KLRNVKLIYKNITDLLLENEARIGSSHFNFLIKAARQEFDETLILLNTKLK FT HYKKSVSCKSVVYAVIFNNLLKKCIARNVTTMPFDIKTATSIVQVYDGTSE FT NLDAYVDSANLLKDYVAADQVATAVRFLKTRLTGKARLGLAENLNTIDALI FT DDVKQRCAEQITPANIIAKLKTTKQKGDTNNFCDEIDKLTNKLKSIYVGQG FT IPDGVAKTMATKAGVDALINGTTSYETKIILKAGTFSNINEAVQKVQENAT FT PTAANQQNSSQIMTYGTHKHNNHTRNNRGRRERNQNAQRGRHTYHRYQGHF FT LNRTPQNYGFNRGGRRDQRPRQMFVTNAEQIYPMPQNMPRMQNLPQQIPIP FT TAGQPQNVNFLGQANQGPSPQQQFIR" FT CDS 2113..5688 FT /product="Gypsy-588_AA-I_1p" FT /translation="MTNSKCTFILDSGADVSLFKVTKVPAMQIVDFNKKLR FT ITGVTEGVTETIAEAQTSVTFDNNLKLVHSFQLVSENFPIPTDGILGRDFL FT VKFRCTIDYDNWLLNFNFQNHSICIPIEDSINNSIILPPRCEVIRRLPNFS FT VTEDSIVLSQEILPGVFCGNTIVSSNSPYIKFVNTTQSQVAISNFHPICEP FT LHDYVRVNKPTENLAMKKERIQNIFSKINFNEIPSYVHSSLKKLIEKYSDI FT FSLENEKLSHNNFYSQNILLNDQVPVYIPNYKTIHSQGDEMAKQIQKLLDD FT DIIEPSVSPYNSPILLVPKKAGNNSKKWRLVVDFRQLNKKVLADKFPLPRI FT DIILDQLGRAKYFSTLDLMSGFHQIPLDQESRKYTAFSAPNGHFHFKRLPF FT GLNISPNSFQRMMTIAMAGLNPECAFIYIDDIIVIGCSENHHFQNLTTVFE FT RLRHYNLKLNPAKCKFFLNEVTFLGHKVTDRGILPDDSKFEALKNYPKPNN FT VDEVRRFVAFCNYYRKFVPNFADIARPLNALLKKGVQFQWTDSQEQSFQQL FT KQTLMEPYILRYPDFSREFILTTDASNYACGAVLSQRYTEGDFPIAFASRS FT FTQGERNKPTIEKELAAIHWAVNYFKSYLYGRRFTIKTDHRPLVYLFNMKK FT PTSKLTLMRLDLEEYNFNIEFLAGKSNVCADALSRIPLNSDDLKSLSVLVV FT NTRAMNRKKLNDTTKFDDSQLADSETDHLTAWQTENPSEVRKYLKVGCEVH FT HNQLKVKLFNNNYNKMLQTITIQLYDENTNGSQALELALLELGKLLNHYKR FT TMVAISLQDELFKSFASETVKEIANRAISNYQLVLYNPPTFIQNVDKINEI FT LVNNHMTPTGGHIGQHKLYLKLREKFTWKNMKLEISNFVKNCEKCKINKVN FT RHTKEPVIVTSTPFKPFETISIDTAGPFGITNNRNRYILTIQCNLTKYIVL FT APVPTKEASVMARALVENFILIYGNFLQLNSDQGTEFNNDVFEQICKLLEI FT KQTFATAYHPQTIGALERNHRCLNEYLRSFTNTHNSDWDDWTKFFAFSYNT FT TPNIEHGYTPYELVFGRKATLPHDLQDNQSMINPVYNPDQYYQELRYKLQT FT SNSNARNKLIFQKEKRKNVCNQNINKIDLNIGDLVYLTNENRKKLDPFYIG FT PFTVTKIADPNCTVKHNNTHKETIVHKNRTIKI" XX SQ Sequence 5804 BP; 2181 A; 1059 C; 990 G; 1574 T; 0 other; tggctgaccg cgaacaggca accgagcctg aagacttatt caatagtcaa ttaaaactgt 60 aaaaagtgcc aagtggaagc gaaaaatctt gaaaagtgga aaaagtttaa attataaaat 120 aattgtgaag tgagtaccca caatggggaa taataaatcc aaaactgctc agaattcagg 180 ggacccgcaa gtccaaattc tgaatgaact tgcaattcat gaagagtatc acgccgacca 240 cgagtttaaa ctcaacatca ttttagcatt agtcggtcta cagttagcag tgatgtttta 300 ccaactgtac aaattacata ccaaaaggca agcaatgaag gctgctagat caatagccaa 360 catgcaaaac cctgcttagt gcctgaaaaa gaactcgagt aagctgcaag tgaaatacga 420 acaattatga aaaaccgtcg aagtggaata agaacagttt cgtgacccgt tagtatacac 480 cgttggcgtt tatagtgatg aagtgaaagc acaaataagt gcaaaaatca aacaaacaaa 540 tcaactgcag gcgagaccac ggtcgacagg agtcgaccgg cgggattaga aagcgcatgg 600 gctcagcgcc aacacaaaca acaacaacaa cgatgtgatg caacgagacg ccgaggatcg 660 tatatcgagc gacaaccggg aagctacgat ataaccgtat acaattttat aaggtaacgt 720 aaacaaatca acggatgtgt ttttttttta ccagtggaaa ataatactta gaagtaatgc 780 tttgatcgtc gaacaaatat taatatttct aatgaataat ggtgaaagtg aaatcgagct 840 tctaatagga caaattcaca aaattaaaac aaacttgaaa aaggctccat ttcgaaaata 900 tttgaaattc acactcgatg aaaaattacg aaatgtaaag ttaatatata aaaatattac 960 ggaccttctc ttagaaaatg aggcgcgtat cggaagctca cattttaatt ttctgattaa 1020 agcagctcgt caagaatttg acgaaaccct tattttgtta aatacaaagt taaagcatta 1080 taaaaaatcg gtatcttgca agtcggtggt ctacgccgta atatttaaca atttactgaa 1140 aaagtgtata gctagaaacg taacaacaat gccattcgac ataaaaacgg caacttccat 1200 cgtacaagtg tacgatggaa cgtctgaaaa tctagacgcc tacgtggatt cggcgaattt 1260 attgaaggat tatgttgctg ctgatcaagt ggcaacagca gtaagatttt taaaaaccag 1320 actgactgga aaagccagat taggcttggc ggaaaatctt aacacaatcg atgctcttat 1380 cgatgatgtc aaacaaagat gcgcagaaca aattacacct gcaaacataa tcgcaaaatt 1440 aaaaactaca aaacaaaaag gggacacaaa taatttttgt gacgaaatag acaaactaac 1500 caacaagttg aaaagcattt acgttggtca aggcatcccg gatggggtgg ctaagaccat 1560 ggccacgaaa gctggagtag atgccttaat taatggcacc accagctatg aaacaaaaat 1620 tatcttaaaa gccggcacct tctccaatat caatgaagcg gtgcagaaag tacaagaaaa 1680 cgctactcca acagcagcca atcaacagaa ctcttcacaa attatgactt atggtacaca 1740 taagcataat aatcatacca gaaataatcg gggccgacgt gaaaggaacc aaaatgcaca 1800 aagaggaagg catacttatc acaggtatca agggcatttt ttaaatcgaa cccctcaaaa 1860 ttacgggttt aatcgaggcg gtcggcgaga ccaacgacct cgacaaatgt ttgtaacaaa 1920 tgctgaacaa atttatccta tgccacaaaa catgccacgt atgcaaaacc tacctcaaca 1980 aattccgatt ccgacagcag ggcaacctca aaatgtaaat tttttaggcc aggcgaacca 2040 gggtccatcc cctcaacaac aattcattcg ataaatttaa atgctgtaaa tttcgtaacc 2100 ttatatgttg aaatgacaaa cagtaagtgt acatttattt tagatagcgg agctgatgta 2160 tctctgttta aagttacaaa agttcccgca atgcaaatag tagattttaa caagaaactt 2220 agaataacag gtgttacgga aggagtgacg gaaactattg ccgaagctca aacttctgta 2280 acattcgaca ataaccttaa attagttcac tcttttcagt tagttagtga aaatttccct 2340 atacccacag atggtatact tgggcgtgac ttcttagtca agttcaggtg caccatcgac 2400 tatgataatt ggcttttaaa ttttaatttt caaaatcatt caatatgcat acctattgaa 2460 gatagtatta ataatagtat aattttacca ccaagatgcg aggtgattag aagattacca 2520 aattttagtg tgaccgagga ttcaatcgtg ctctcacaag aaatacttcc aggagttttt 2580 tgtggaaaca ctattgtatc atcaaactca ccgtatataa aatttgttaa tacaacgcag 2640 tcacaggtcg caatctcaaa ttttcatcct atatgtgaac ctttacacga ttatgttaga 2700 gtcaataaac ctacggaaaa tttagctatg aaaaaggaaa ggatacaaaa cattttttcg 2760 aaaatcaatt ttaacgaaat tccttcatat gttcactcct cgttaaaaaa gttgattgaa 2820 aaatattctg atattttcag tttagagaac gaaaagttat ctcacaataa tttttattct 2880 caaaatattc ttctaaatga ccaggtacca gtctatattc caaattacaa aaccattcac 2940 tcgcaaggag atgaaatggc aaaacaaatt caaaaacttt tggacgatga tataatcgaa 3000 ccatcagttt ctccgtacaa ttcgccaatt ctcttggtac caaagaaagc tggcaataat 3060 tcaaaaaaat ggagattagt tgtcgacttc cgacaattaa ataaaaaagt tttggcagac 3120 aaatttcctt tgccacgtat tgacataatt ttagatcaat tagggagagc aaagtacttc 3180 agtacgcttg atctaatgtc tggttttcac cagataccat tagaccaaga gtctagaaag 3240 tacacagcct tttcagcccc taatggtcat tttcatttta aaagattgcc cttcggttta 3300 aacatatcac ccaatagttt tcagcgaatg atgactattg ctatggcagg attaaatcct 3360 gaatgtgcat tcatttatat cgatgatata atcgtcatcg gctgttcgga aaatcatcat 3420 tttcaaaatt taactacggt gtttgagcgt ttgaggcatt ataacctcaa attaaatcct 3480 gcaaaatgca aatttttctt aaatgaagtg acatttcttg gtcacaaggt aacggataga 3540 ggcatccttc cagacgattc taagtttgaa gccttgaaaa actatcccaa acctaacaat 3600 gtcgacgaag tacgtagatt tgttgcattt tgcaactact accgaaaatt cgtaccaaat 3660 tttgcggata tagctcgacc attaaatgca ttgcttaaaa aaggagtgca gtttcaatgg 3720 acagatagtc aggaacaaag ttttcaacaa ttaaaacaaa ccttgatgga accatatatt 3780 ctgagatacc ctgatttttc acgggaattt attttaacca cagatgcatc aaactatgca 3840 tgtggagcag ttctttctca gcgttatact gaaggagatt tcccaatagc ttttgcgagt 3900 agaagcttta cgcaaggcga acgcaataag cctaccattg aaaaagaatt agcagcaatt 3960 cattgggctg ttaattattt taaatcgtat ttatatggcc gaagattcac aattaaaacg 4020 gatcacagac cattagtata tttatttaac atgaaaaagc caacatcgaa attaacgttg 4080 atgaggcttg atttggaaga atataatttt aacattgaat ttcttgctgg aaaatctaat 4140 gtttgcgcag atgcattgtc gcgtatccca ttgaactcgg atgaccttaa atctttatct 4200 gttctagtgg ttaacactag agcaatgaat agaaaaaaat taaacgatac aactaaattc 4260 gatgattcgc agttagcaga cagtgagact gatcacctca cggcttggca aactgagaat 4320 ccatcggaag ttagaaaata tctgaaagtt gggtgtgaag tgcaccacaa ccaactgaaa 4380 gttaaattat ttaataataa ctacaataag atgcttcaaa cgattacgat tcaactatat 4440 gacgaaaaca caaatggaag tcaagcatta gagcttgcac ttttagaact tggaaaatta 4500 ctgaatcatt ataaaagaac catggtcgcc atctctttgc aagatgagct attcaaatca 4560 ttcgctagcg aaactgttaa ggaaatcgca aatagagcca tttccaatta ccagctggta 4620 ctgtataatc caccaacatt tatacagaat gttgataaaa ttaatgaaat tttagttaat 4680 aaccatatga cacctacggg aggtcatatt ggtcaacata aactttattt aaaattaaga 4740 gaaaaattta catggaaaaa catgaaatta gaaataagca attttgttaa aaattgcgaa 4800 aagtgtaaaa ttaacaaagt gaacagacat actaaagagc ctgttatcgt aacttcaacc 4860 ccttttaaac cattcgaaac aatttcaatt gataccgcag gaccattcgg cattacaaac 4920 aatagaaata gatacattct aactatacaa tgtaatttaa cgaaatacat tgtacttgca 4980 ccagtaccaa caaaagaagc ttctgtaatg gcaagagcat tagtggaaaa ttttatcttg 5040 atttatggaa attttcttca attaaattcc gatcaaggca ctgaattcaa taatgatgtt 5100 tttgaacaga tttgtaaatt attagaaata aagcaaacat ttgcaacagc ctatcatcca 5160 cagacaatcg gagctttgga aaggaaccac agatgtctca atgagtatct gagatccttc 5220 accaatactc acaattcgga ctgggatgat tggacaaaat tttttgcctt ctcttacaat 5280 accacaccaa atattgaaca tggctataca ccatacgaac ttgtttttgg acgaaaagct 5340 acactaccgc atgacttgca agataaccaa tcaatgatta atccagttta taatccagat 5400 caatattatc aagaattacg ctataaacta caaacatcaa actcaaatgc tagaaacaaa 5460 ttaattttcc aaaaagaaaa acgtaaaaat gtgtgcaatc aaaacatcaa taaaatcgat 5520 ttaaacatag gagatttagt atatctgaca aacgaaaatc gaaaaaaact tgacccgttt 5580 tacataggac cattcacagt aaccaaaata gcagatccaa attgtacagt aaaacataac 5640 aacactcata aagaaacaat agtccataaa aatagaacaa taaaaatcta aataaatgat 5700 agcaattaaa gaaaacataa ttaacattct agatgaggta cttggaggat aggtcataac 5760 gaatgattcc atttttatta catcattctc caaaaggggg aaga 5804 // ID Copia-1_DVir-I repbase; DNA; INV; 4151 BP. XX AC scaffold_5826; XX DT 07-MAR-2011 (Rel. 16.03, Created) DT 07-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-1_DVir_; KW Copia-1_DVir-LTR; Copia-1_DVir-I. XX OS Drosophila virilis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; virilis group. XX RN [1] RP 1-4151 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (07-MAR-2011). XX DR Genome; scaffold_5826; Positions 4574 424. XX CC Positions [1484-1846] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 134..1846 FT /product="Copia-1_DVir-I_1p" FT /translation="MNMSHQIEKLGEDNYDVWHMAMKSVLVVADLWNVVCG FT KYVKPEGNGEDIEKWVSVDQKAMAYIILNLKPTQLMHIKSCPTSAAVWKKL FT KEIHVSVGPIRKVQLYQKLIRFNMQQGDDVVAYVNSFVETIEKLAELDIKL FT NDELQVIMFLSSLPDTWENFVIAIETRDELPKFEVVKVKMLEEATRKQERA FT SRDGIGTPEAVYTHKHNGPRDNNSVESKGRKNSSKQPFRGKCFNCEKEGHR FT ASDCRNKKKDDKKSQKDEKSLCLMHTCVKEKQKSFWCVDSGATSHMCCDRS FT MFLTYADKKTSIMLAANKFIDSPGSGSVELHCNRIKLKLQNVLYVPDLSMN FT FLSVSAAAKFENSTIFEGKIALIKDKEGNEIIRARHENGLYVYKHSLDQVH FT MLNSSDSSSIKWHNRFGHLNFKDLKLLNDKQLVRGMKVENVQLEINCDTCN FT KAKICALPFPQKANRVSKQVLDLVHTDVCGPMNVKSLAGNRYFVTFIDDYS FT RKIFVYLMHSKSQVFEKFKMFKTFVECQTGNKIKAVRSDNGTEYVNKQFTE FT FLIQCGINRQLTSSLHSRNQMGVS" FT CDS 2628..4106 FT /product="Copia-1_DVir-I_2p" FT /translation="MQIEYDALQANETWSVCELPPGQKAIGSKWVYRIKRD FT KDGNIEKFKSRLVAQGCGQKFGVNYWETFSPVIRYETIRMLFAIAAEKELY FT MHQVDISNAYLNSKLNETVYMKQPPNFIDKQHPNKVLKLEKALYGLKQSGR FT VWNNTLDEVLRGMDFKRCKNEACLYVKQKQQQFSYIAVYVDDLIIVCPSEE FT DIAVIKKKIASNFKMHDGGPISYFLGMEIQRDGDRGSVSLCQKTHIQGLLD FT KYGMSNCRPVSTPLDPGFQVDCNDDTCVKVNITQYQSLIGSLIYLAVLSRP FT DILHAVNKLSQRNANPHGEHQSAAKHILRYLAGTKNMKLVYRKNGKPLKGY FT ADADWGNDRLDRRSYSGYAFFLAGSAFSWTSSKQSVVALSSTEAEYISLSM FT AAKEAVYLRRLLSEMGWSKDTEPLTICGDNLSAQHIAQNPVHHKRTKHIDI FT RYHFIREKVQCNDIALEYVPTNNNVADILTKCLSKQKHVGFVKTLGLN" XX SQ Sequence 4151 BP; 1427 A; 745 C; 937 G; 1042 T; 0 other; ttggttatgg gcccaggtct cgttgttttc ggttaaataa gtacacaaag tgtctatggt 60 tttgaaagct tccttttgtt aaatttttaa aaagtttatt tcgtgcgaag aataacgtgg 120 tgaaattgtc aaaatgaaca tgtcgcatca aatagaaaaa cttggtgagg acaattatga 180 cgtatggcat atggccatga aaagtgtctt ggtggtggcg gacctatgga atgttgtgtg 240 tggaaaatat gttaagccag aaggaaatgg tgaggatatc gaaaaatggg ttagtgtcga 300 ccaaaaagcg atggcttaca taattttaaa tctgaagcca acacagttga tgcatatcaa 360 gtcatgccca acatctgcag ctgtatggaa aaaactaaag gaaattcatg tctcagttgg 420 accgattagg aaggtccaat tatatcaaaa attaatacgt ttcaatatgc agcagggtga 480 cgatgtagtc gcatatgtca attcatttgt tgagacaatt gaaaagttgg cagaattaga 540 catcaaactc aatgacgagt tgcaggtcat catgtttttg agcagtctac cagacacatg 600 ggaaaatttt gtaatagcga ttgagacacg cgatgagctg cccaaattcg aagtcgtcaa 660 ggttaaaatg ttagaagagg ccacgcgaaa gcaagaaaga gctagcagag acggcatagg 720 gacaccagaa gcagtgtata cacacaaaca taatggaccg cgggacaaca acagtgtcga 780 aagcaaaggc agaaagaata gcagtaagca gccattccga ggcaaatgtt ttaattgcga 840 aaaagaaggg catcgagcga gcgattgccg aaataaaaag aaggatgata aaaagagcca 900 aaaggacgaa aagtctctat gcttaatgca cacgtgtgtc aaggagaagc aaaagagttt 960 ttggtgcgtt gacagtggtg caacatcgca catgtgctgc gaccgcagca tgttccttac 1020 gtatgctgat aaaaagacgt cgataatgtt ggccgcaaat aagtttatcg attcacccgg 1080 cagtggcagt gttgagttgc actgcaatag aataaaatta aagctgcaaa atgttttata 1140 tgtccctgac ttaagcatga attttttatc agtaagtgct gcagcaaagt ttgagaattc 1200 taccatattt gaagggaaaa ttgcgctgat aaaagacaaa gaaggcaacg agataataag 1260 ggcaagacac gaaaatggtt tgtatgttta caaacatagt cttgatcagg ttcatatgct 1320 aaatagttct gattcatctt ccataaaatg gcacaataga tttggccacc tgaattttaa 1380 ggatttgaaa ttgcttaatg ataagcaact ggtgcgtggt atgaaagttg aaaatgtaca 1440 attagaaata aattgcgata catgtaataa agcaaaaatt tgtgcattgc cattcccaca 1500 gaaagcgaat cgtgtctcaa agcaagtact ggacttagta catacagacg tatgtggtcc 1560 gatgaatgtt aagtcgcttg caggcaatcg atatttcgta acttttattg atgactactc 1620 acgcaagata tttgtttacc ttatgcactc aaaaagccaa gtttttgaaa agtttaaaat 1680 gtttaaaact tttgtagaat gccaaacagg aaataaaata aaggctgtgc gcagcgacaa 1740 tggaacagag tatgtgaata agcaattcac tgaatttttg atccagtgcg gaataaacag 1800 gcaattaaca agttccttac actcccgcaa ccaaatgggt gtgtcctaac gtgccaatcg 1860 gaaccatcgt tgaaatgggc caaaaacctc ctatttcatc aaaatcttaa taatttttgg 1920 gggcagaggc tgtgcaaaca gcagtttatt tgcgcaacag atgtcccaca aaagcataga 1980 tgggaagcac accatttgag gtatgaaagg atcgcaagcc atcagtacag cacttgagag 2040 tatttggctc gcgggcattt gcactcgaca aagtcagaaa aagcaagttc caagcaaagg 2100 gcaaggagta catttttgtt ggttactcct ttacagcaaa ggcatatcgc ctatatgatc 2160 gcgagaagcg aacagttatt gaaagaaggg atgtcaaatt cattgaaggt gagtttgaca 2220 tagcaacaac tgatgcatgc agtatttcag acagaaacac cttcacgacg tacattatcc 2280 cgtttgagtc aaatcatgag gttgcaaaaa accacccaga gcaacctgcc gaggtgcaag 2340 agtacaacag tgactcaaat gaagaagaag aggaggattt tgtgagcgca agcgatccag 2400 gagaagatgg caacgatgac gacaatgacc aagtagcgct gccacaagtt gaagctgtag 2460 taaaacatgg gcgtggcagg ccaaaaatta tacgtacagg tcagccaggc aggcccagaa 2520 agcagtacca attgatcaat agtcttaatc agattgaaga cattgaaact ccgcagtcag 2580 ttgcagaagc attacatggc caaaatgcgc aggattggga aaagtctatg caaatagagt 2640 acgacgcttt acaagcaaat gagacatggt cagtgtgtga gctacctcct ggacaaaagg 2700 ccattggttc aaaatgggtt taccgaatca agagagataa agatggtaat attgaaaaat 2760 ttaagtctcg actagttgca caaggttgtg gtcaaaaatt tggtgtcaat tactgggaaa 2820 cgttttctcc cgtaattcga tatgaaacca ttcgcatgtt attcgcaata gctgccgaaa 2880 aagagctgta tatgcatcaa gtcgacattt caaacgcata cttaaatagt aagcttaatg 2940 aaactgtgta catgaaacag ccgccgaatt ttattgacaa gcagcatcca aataaagtcc 3000 tcaaactaga aaaggcactt tacgggctta aacaatccgg acgagtttgg aacaatacgc 3060 tcgatgaggt attgagaggc atggatttta aaagatgcaa aaatgaagca tgcttatacg 3120 tcaagcaaaa gcaacaacaa ttcagctaca tcgctgtcta cgtcgacgat ttaatcatag 3180 tgtgtcccag cgaagaagac atagccgtga ttaagaagaa gattgcatca aattttaaaa 3240 tgcacgatgg aggccccatt agctatttcc taggcatgga aatccaacgc gacggcgatc 3300 gaggttccgt ttcgctgtgc cagaagacgc acattcaagg cctattggac aagtatggca 3360 tgagcaattg ccgtccagta agcacacctc ttgacccagg ttttcaagtg gactgcaacg 3420 acgatacttg tgtaaaggta aatataacgc aatatcagtc attaattggt tcgcttatat 3480 acttagcagt tttgagccga ccagacattt tgcatgctgt aaacaaatta tcacaaagaa 3540 atgctaatcc tcatggagaa caccagtctg cagcaaagca cattctaaga tatcttgcgg 3600 gtacaaagaa tatgaaatta gtataccgta aaaatggcaa accactaaaa ggctatgcag 3660 atgcagattg gggcaacgat cggctggaca ggagatccta tagtggctac gcatttttcc 3720 tggcaggcag tgcgttctct tggacttcat caaagcaaag cgttgtcgca ttgagcagca 3780 ctgaagcaga atatatttct ttgtccatgg ctgccaaaga agcagtttat ctacgcaggc 3840 tacttagtga gatgggatgg tcaaaggata cagagccgtt gactatatgt ggcgacaacc 3900 taagcgcgca acatattgcg caaaatccag tccatcataa gcgcactaag cacatagaca 3960 taaggtatca ttttatacgt gaaaaggtac aatgtaatga tattgcatta gaatacgtac 4020 ctacaaataa taatgttgca gatatactaa caaaatgttt aagtaagcaa aagcacgtag 4080 gctttgtaaa aacacttgga ttgaattaaa tttgttattt tgcaactgat tataattagc 4140 ttaagaagaa g 4151 // ID INE_WB repbase; DNA; INV; 969 BP. XX AC L19892; XX DT 09-JUL-2004 (Rel. 9.06, Created) DT 09-JUL-2004 (Rel. 9.06, Last updated, Version 2) XX DE Wuchereria bancrofti repetitive DNA sequence. XX KW Transposable Element; INE_WB; Interspersed repeat. XX OS Wuchereria bancrofti OC Eukaryota; Metazoa; Nematoda; Chromadorea; Spirurida; Filarioidea; OC Onchocercidae; Wuchereria. XX RN [1] RA Siridewa K., Karunanayake H.E., Chandrasekharan V.N., RA Abeyewickreme W., Franzen L., Aslund L. and Pettersson U.; RT "Cloning and characterization of a repetitive DNA sequence RT specific for Wuchereria bancrofti."; RL Am. J. Trop. Med. Hyg 51(4), 495-500 (1994). XX DR Genbank; L19892; Positions 1 969. XX SQ Sequence 969 BP; 321 A; 135 C; 190 G; 323 T; 0 other; gatcagacgg aaattctgag ctaaatagtg cgcctcaaaa ttgttgtcgt aatcgaggat 60 atcaaagctg tctgagatgc cgggatgagt aatatttgtg acaatttcat caccggtatc 120 gagattaatt agagaaaaag tttcattttt atcagaagcg atgcaaagca gaagattgag 180 ccaaatgatg ggtgtcatag gtatagtgga gaacatggtg aaatatagaa ttttatgaaa 240 tgacagaaat gctcaaatga aagtttttta tttcagattg aaattaagga ccaaagattc 300 aaacaattct taatccgtcc attgtttgtc atatttttac atcctcgctt tcttctccct 360 caactgttag tttatcaatg ctcttttgaa agcattaagg gaagcattaa agtagaatgt 420 acgtagttac tgatagtatc actggtggaa cttcatcgga tggacgcatc acctgaaatt 480 tggtaattgt tgctttttga catacatcca ttgctattat cagaaagaat ttcaacacaa 540 gctgaaacga tgtttgatgt ttgaaaggtt gggacaaatt ttaaaaattg gtgttgcgac 600 aatggatgaa ttaatatttc gataaatgga actgatgaat gtttgaactg aatgtcactg 660 gatgatgaac tctttaattt gtacagtgtt gaataatcct ttctgctttg tttgataata 720 tactgatact gaattatttg catattatag aaattttaga tttcaaatgt ttgtgacatc 780 agttgattaa ctgttgcaac ctttcgcagt taattaacac aaaatttaat ttgataaatc 840 taacatgcaa ctttttgcag tgaatatccc ggtctcaaat agaggaacaa caaatatagt 900 tgacagcatg atataacgat tagtaagtgc tagtttgacc ttcgagaaat caggcatatt 960 gtaggatgg 969 // ID CR1-91_AAe repbase; DNA; INV; 4894 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-91_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4894 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1179-1179 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 20 sequences with >92% CC identity. XX FH Key Location/Qualifiers FT CDS 80..916 FT /product="CR1-91_AAe_1p" FT /translation="MSLVCYSCAIEIGDVQVECQGFCNAIFHPRCCGIRAD FT VYEEVMRNHQVFWFCPSCTTLMKKVRFRNTARAAYEAGQSQAINSHSDTMQ FT NLKSEIMEAEIRTTFAKLINSSSCTPXSSKRVGIDARFTRSRRLFSTVRDX FT KSNQQPPLLHGTGSTLSPSNEIATVPPVQPKFWLYLSRIAKDVSTDQICAF FT AKKRLGSEDIQVTRLVAKGRXISTLSFVSFKIGMDIELKPKALSSSSWPKG FT ILYREFTDKSSENFWRPTLTTPSDDPLNLPTEEVVIME" FT CDS 1061..4744 FT /product="CR1-91_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MGSSAPLPRASTIMFPNTSPPEPYAAFSRPISSRPHA FT TSDIYIPGRTLATSMKEPPIPLIAVEPIPPASYSRPGPALELENGVFRTPT FT TGKYVVFTNFPSPESITIPSDYSSIRLPTRTLGRKTVISSKIAPTVLNAVK FT PLLCPAINSHAETEDAVLSSSTKKRHESNLSNRPRSTHVDHSPGRILAASI FT EETPISLSTVEPFLPATSSRPGPVPESGDGAFRKPLLGKYSEFVEFSAPES FT NSISSESTLAQRQYNVSQQEQRSKADKLLLYYQNVGGMNSTIEDYRLAVAD FT NCYDVIALTETWLDSRTLSNQVFGHDYEVFRCDRNPNNSKKSVGGGVAIAV FT NKKLKCRPIESETWKCVEQTWVAINLGECVVYLCVVYLPPDRIRDISMIDA FT HTQSIWHVMSEMSATDEIIVLGDFNLPGIAWKASHSGFLYPDPERSVIHPN FT AMNLLDSYSTATLKQICDVINENSRTLDLCFVSVQDAAPIVSIAPAPLVKF FT VPHHRPIILTLNQRLSSSVKIHKSQVYYDFKNTDDEGMANFLLTIDWDVCL FT DRNDIQSAAQTFSNIMLYAIDRYVPKKVIRCEKQPWQSNELKRLKTAKRAA FT LKKYTKRRNQSSKTNYVRLNNKYKRVSKRCYLNYQNKVQRNLKHRPKSFWK FT YVNEQRKESDLPSSLELNGETACKVDDICRLFAVKFSSVFADEQLSTERIS FT LAARNVPMQEQSLAVFTVDDTMIIRAARQLKSTCKPGPDGIPAILLKNNIS FT CLLSPLRHLFQTSLANGVFPYCWKFAHMYPVYKKGRRYDINNYRGITSLCA FT VSKLFELVVMEPLFSHCKHYLSEEQHGFMPGRSTTTNLLCLSSHITTSLAE FT RNQTDVIYTDLSAAFDKINHAIMIAKFQKLGLHGSFLRWIDSYLTDRRMLV FT SIGDSQSDVFVGTSGIPQGSHLGPLFFSIYFNDVNHVLKGPKLSFADDLKI FT FQQIRTTADAITLQNELDRFAWWCDLNRMLVNPNKCSVISFSRKKEPVRFK FT YTFSDVEICRVNHVKDLGVIMDSQFTFKQHVAYVVDKASRTLGFIMRLCKN FT FNDVHCLKSLYCSLVRPILEFSSPVWSPIYLNGAERIESVQRRFIRFALRR FT LPWRDPYRLPSYESRCLLIDIETLRTRRDISKAMFAADVLQNRIDCPAILH FT EIDLNVQPRALRNNLMLRPPLRRTNYGMNTAIDGIQRTFNRVSSKFDYHMP FT RTTLRRLFSDVLTEVREI" XX SQ Sequence 4894 BP; 1394 A; 1218 C; 1013 G; 1266 T; 3 other; tgctgtttga tcatcgcgac tgtgggctgg ttttgcaaat cattttcgag cgtaacttgt 60 gttaagagcc ggagaaaaaa tgtctctcgt atgttactca tgtgctatcg aaattggtga 120 tgtacaagtc gagtgtcaag gcttctgtaa tgcaattttt catccccgtt gctgtggaat 180 ccgtgcggat gtgtatgaag aagtgatgag aaatcatcaa gtgttttggt tttgcccgtc 240 atgtaccacg ctaatgaaaa aagtgcgttt cagaaataca gcccgcgccg cttacgaagc 300 gggtcaaagt caagctatca attctcatag tgacactatg cagaatctaa aatctgagat 360 tatggaagct gaaatccgaa ccactttcgc taaactgatc aactcaagct cgtgcacacc 420 aawatcctcg aaacgtgttg gcattgatgc aaggttcaca agaagtcgaa gattattcag 480 cacagtaagg gacamtaaat caaaccaaca gccaccactc ttacatggaa ccggtagcac 540 actctcgccg tcgaatgaaa ttgcaactgt tccaccggtt caaccgaaat tctggttata 600 cttgtcgcgg atcgcaaaag acgtttcaac cgatcaaatc tgtgctttcg ccaaaaaacg 660 cctcggatca gaagatatac aagtcacgag actagttgcc aaaggaaggg wcataagtac 720 attgtctttt gtttccttca aaattggtat ggatattgaa ttgaaaccca aagcactctc 780 atcctcttca tggccgaaag gcattctcta ccgagagttc actgacaaat ccagtgaaaa 840 tttttggcga ccaacactaa caaccccatc cgacgatccg ttaaacctgc ctacggaaga 900 agttgtaata atggagtaac caatcatcta atccatcttc gctgcacatc gggacgcaaa 960 ttcttcgcgg gcactaagga agtccctatc cctctcatcg cagtcgagcc cctcctgcca 1020 gcgaccatca gtcgtcccgg tcctgcgttt gagttgggag atgggatctt ccgcacccct 1080 accacgggca agtacaatta tgtttccgaa cacttcacct cctgaaccat atgccgcttt 1140 cagtcgaccc atatccagcc ggccacatgc aacctctgac atttacatcc cgggacgcac 1200 gcttgccaca agtatgaagg aaccccctat tcctctcatc gcagtcgagc ccatcccgcc 1260 agcgtcctac agtcgtcccg gtcctgcgct tgagttggaa aacggggtgt tccgaacccc 1320 aactacaggc aagtacgttg ttttcacgaa ctttccgtcc cctgaatcga ttactattcc 1380 cagcgattat tcatcgataa ggctgccgac tcggactctt ggtcgcaaaa ccgtcatcag 1440 ctccaagata gcccctactg ttctcaacgc agtcaagcct ctcctatgcc ccgcgatcaa 1500 cagccatgct gaaacggaag acgcggttct cagttcttct acaaaaaaac gacatgagtc 1560 aaacctgtca aatcggcctc gctcaaccca cgtagaccac tcaccgggac gcattcttgc 1620 cgccagtatt gaggaaaccc ctatctcact cagcacagtc gagccattcc tgccagcgac 1680 cagcagtcgt cccggtcctg tgcctgagtc gggagacggg gctttccgaa aaccgctcct 1740 aggcaagtac agtgaatttg tggaattttc tgcacctgaa agtaactcca tttccagcga 1800 atccacgtta gcccaacgac agtataacgt ttcgcaacag gaacaaagga gcaaagccga 1860 taaactccta ctctactacc agaacgtcgg cggcatgaac tcgactattg aagactacag 1920 attagctgtt gcagacaact gctacgacgt tatcgctctt accgaaacat ggctcgactc 1980 acggacatta tctaaccagg tatttgggca cgattacgaa gtttttcgct gtgatcgtaa 2040 tcccaacaat agtaagaaat ccgtcggtgg aggcgtagcc atcgctgtaa acaagaagct 2100 gaaatgcaga cctattgaga gtgaaacatg gaaatgcgtc gagcagacat gggttgccat 2160 caacctcggt gaatgcgtcg tttatctgtg cgtcgtttac ctccctcctg atcgtatacg 2220 cgatatctct atgatcgatg ctcatactca atcaatctgg catgttatgt ctgaaatgtc 2280 tgcaactgat gagattattg tgctgggaga cttcaactta ccaggaatag cgtggaaggc 2340 gtcccatagt ggtttcttgt atcctgaccc ggagcgatca gtgattcatc caaatgccat 2400 gaatttactt gacagctaca gtacagcgac cctgaagcaa atttgtgatg taatcaacga 2460 aaacagccgc actttagatc tctgctttgt gagcgtccaa gacgcagctc caatagtgtc 2520 cattgctcct gctcctctcg ttaaatttgt tcctcatcat agacctataa ttttaacgct 2580 taatcaacgg ctcagcagtt cagtaaaaat ccacaaatca caagtatact acgattttaa 2640 aaacaccgat gacgaaggaa tggccaactt tctcctgacc atagattggg acgtttgtct 2700 cgaccgtaac gacattcaat ctgcggccca aacgttttcc aatataatgc tgtacgctat 2760 tgacagatac gtcccaaaaa aagtgatacg ttgcgagaaa caaccatggc agtcaaacga 2820 acttaaacga ctgaaaacag cgaaaagagc agcattgaag aagtacacga aacgtcgcaa 2880 ccaatcttcc aaaactaatt acgtgcggct caacaataaa tacaagagag tcagtaaacg 2940 ttgttatcta aactatcaga ataaggtaca aaggaatctg aaacatcgac caaaatcgtt 3000 ctggaagtat gtgaacgagc aacgcaaaga atctgacctt ccatcgtcgc tagaactgaa 3060 tggtgaaaca gcatgcaaag ttgacgatat ctgtcgactg tttgccgtta aattctcaag 3120 cgttttcgcc gacgaacaac tatcaacaga acgaatttcg ctcgccgcaa gaaacgttcc 3180 aatgcaagaa cagtcactcg cagttttcac cgtggacgac accatgatca tccgagcagc 3240 acgtcaactg aagtctacat gtaaacctgg gcctgatgga ataccggcga ttcttctgaa 3300 aaataacatc agctgtttgt tgtctcctct tcgccacttg ttccagacat cgctagcgaa 3360 cggcgtcttt ccatattgct ggaaattcgc tcatatgtac ccagtgtaca agaaaggtcg 3420 ccgatatgat atcaacaact accgtggtat aacatcatta tgcgcagtat caaaattgtt 3480 cgaacttgtc gttatggagc cacttttttc tcattgtaaa cactatttga gcgaagaaca 3540 acacggtttt atgcctggcc gatcgacaac cacaaatttg ttgtgtcttt cgtcccatat 3600 cactacaagt ttggcagaac gaaatcaaac agacgttatc tacaccgacc tatccgccgc 3660 tttcgacaag ataaaccacg ctataatgat cgctaagttt cagaagcttg gattacacgg 3720 ttcgtttctg agatggattg actcctatct caccgatcga cggatgctcg tatccatagg 3780 ggattctcag tcggatgttt tcgtaggaac ttctggaatt ccacaaggaa gccatctcgg 3840 acctctattt ttctcgatat acttcaatga cgtgaaccat gtgttgaaag ggcctaagct 3900 ctcgttcgca gacgacctga aaatatttca gcaaatccgc acgaccgccg atgcaattac 3960 gctgcaaaat gagttagatc gtttcgcatg gtggtgcgat ctaaaccgga tgctagtgaa 4020 cccaaacaag tgctcggtga tttctttctc caggaaaaaa gagcctgttc gtttcaagta 4080 caccttttct gatgtggaga tatgccgggt taaccatgtg aaagatttag gagtaatcat 4140 ggattcccag tttaccttca aacagcatgt agcgtacgtc gtcgataagg catcgagaac 4200 attaggcttc attatgcgat tatgtaaaaa ttttaacgat gttcactgtt tgaaatcgct 4260 gtactgttcg ttagtgcgtc ctattcttga attcagttca cctgtatgga gtccaattta 4320 ccttaatggt gcggaacgca tcgaatcggt tcaacggagg ttcattcgat tcgcactacg 4380 cagattacca tggcgagatc cttatcgttt gccgagctat gaaagccgtt gcttgttgat 4440 cgacatcgag acccttcgaa cacgaagaga catctcgaaa gcgatgttcg ccgccgacgt 4500 tctacaaaac agaatagact gtccagcgat tctccacgaa attgacctca acgttcaacc 4560 tcgagcactt cggaataatt taatgctacg acctccattg cgcagaacga actacgggat 4620 gaacaccgct atcgatggaa ttcaaaggac ttttaaccgt gtctcatcta agttcgatta 4680 ccacatgccc cggactacac tacgccgcct attctcagat gttctgactg aagtcagaga 4740 aatttagtaa atcatgctct ctgtttatcg ttacgttcat tttaattgtt gacaagcaat 4800 gtatgtttct tttaagtatt ttaatattag ggcagaacca ttggggccac tgtcggcctg 4860 ttggtatatc taaataaata aataaataaa taaa 4894 // ID Mariner-10_BM repbase; DNA; INV; 1978 BP. XX AC . XX DT 28-APR-2010 (Rel. 15.07, Created) DT 28-APR-2010 (Rel. 15.07, Last updated, Version 2) XX DE Mariner-like DNA transposon - consensus sequence. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-10_BM. XX OS Bombyx mori OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Bombycidae; Bombycinae; Bombyx. XX RN [1] RP 1-1978 RA Jurka J.; RT "DNA transposons from Bombyx mori."; RL Repbase Reports 10(7), 945-945 (2010). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 420..1481 FT /product="Mariner-10_BM_1p" FT /translation="MEEYSSQHIADIHFVYGLCNGNANRAVTEYARRFPTR FT RTPNAARTFIRIHLRLAENGIRRRANERTRVLSPQDEQEILRLITQDPSLS FT IRLVACRLNLTKWTVWRVLKREGLHPYHFRRVQEILEPDYTRREVFCSWIL FT RKTRSNPNFLKRIMWTDEAXFTRSGYTNHRNEHLWLQENPHAIRPSSFQHQ FT FSVNVWAGLINDVLVGPIILPETMNGPRFLNFLEXDFFDALLELPLAYRNR FT MILQLDGAPAHFALLVRNHLNAQYSPWVGRGGTIAWPPRSPYLTPLDFXLW FT GTMKQKVYVNVPNTREELIANIIRTGNELKEDRAMIQRSTQHVAIRATACL FT QRRGGHFEQFL" XX SQ Sequence 1978 BP; 626 A; 350 C; 361 G; 629 T; 12 other; taatggcagc cattttgttt ttttttactt cgcagctcag gcgcctgcac tgatttaatc 60 attgctatcg atgtagacgt cttaggaaga cgaaagaggt gtcatttgaa agctaacaat 120 gtagacaatc aatttatgat acattttatt caatatcttt ttcacaaaat aacaaaaaaa 180 gtattaataa aataaaaatt ataatttttt attgcatttt tttagttttt agtttttttc 240 atgaaaacta accctgtcaa gctatttttt ttattgtttt ctcataatac aggatatttt 300 cattaaaata atctttacat caaagccatg cctaagtaag atagttcaaa agttgtcatg 360 gtttaaatat gacacttaaa ttttgttttt ggactaactg ttgattgtcg tgcattanaa 420 tggaggagta ctcaagccag cacatcgctg acatccattt tgtgtatgga ttgtgtaacg 480 ggaatgcaaa tagagctgtt acagaatatg cacgcagatt tcccactagg cgaacaccga 540 atgctgctcg cactttcatt agaattcatt tgcgacttgc cgaaaatgga ataagaagac 600 gcgctaacga acgtactcgt gttttgtcgc cccaagatga gcaagaaata ttgcgactaa 660 ttactcaaga ccctagcctg agcatcaggc tcgtagcctg tcggctaaat ttaactaaat 720 ggacggtgtg gagagtttta aaaagagaag gactgcatcc ttaccatttt cgtagggtac 780 aagaaatact ggaacctgac tatacaagaa gagaagtttt ttgttcgtgg attttaagaa 840 aaacgcgatc taatccaaat tttttaaaaa gaattatgtg gaccgatgag gccnctttca 900 caagatcagg gtatacgaac caccgcaatg aacatctgtg gcttcaagaa aatccacatg 960 caatccgacc aagttctttt cagcaccaat tttcggtcaa tgtttgggca ggactgatta 1020 acgatgtnct tgttggacca atcattttac cagaaacaat gaatggtcca aggttcctga 1080 atttcctgga ancggacttt tttgatgctc tattggagtt accattggcc tacagaaaca 1140 gaatgattct gcaactggat ggtgctcccg cacattttgc actacttgtg cgaaatcatt 1200 taaacgcgca gtattcgccg tgggttgggc gcggtggaac aatcgcatgg ccgccacgct 1260 cgccttattt aacgcccctc gatttttntc tntggggcac aatgaaacaa aaagtatacg 1320 taaacgtgcc taatacaaga gaagagttaa tcgcaaacat cattcgaaca gggaatgaat 1380 taaaagaaga cagggcgatg atccagcgct ctacgcaaca tgtcgcaata agagcaacag 1440 catgtctgca gaggcgagga ggtcacttcg aacagtttct gtaacttttt tgccaaatta 1500 aatgcaatcc tatcaataat tattttttta tttaatttct caggaaaaac atgctttcga 1560 agtggctatg tttatactgt catagctcct aaactatcta gcttagagnt aagattttaa 1620 tgtgcatttt tggcaaatat cctgtattat tcaaaataaa taaaaaacgg ataggttagc 1680 acattgtttt tttttatagt taacaatctg aaataaaaaa caatanatga aaaaaatcaa 1740 tattttcgta aatttntttt tnatctttta ctcaagtntt ttgtcttgtt taataataag 1800 aatagctaac aaagaagcca aaaaaaatag atttaataat ctattttttt tcggcttctt 1860 tgttagctaa catatgggac ctcttttgtg tttctaacac ctntacttgg gtaggtattg 1920 ttaaattaac acatgcacct gtgcttcggc tacaaaagtt tttaaagggc tgccatta 1978 // ID Gypsy-44_AA-I repbase; DNA; INV; 5799 BP. XX AC supercont1.385; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-44_AA_; KW Gypsy-44_AA-LTR; Gypsy-44_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5799 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.385; Positions 222280 216482. XX CC Positions [4621-5127] - Integrase core CC 'ATAC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 816..1958 FT /product="Gypsy-44_AA-I_2p" FT /translation="MDITYLEQQIAYLRVNLNNLKKCPNREYLSATLTKKK FT QFCQETYDRAIVLLEDLNQKISKEDSEYYKNIIVRIYSDICNLIEEKLKKL FT EMPPKLDLGLALKIVKPFDGSASNLQQYVESVNLLKDYATDVSEADILKFI FT KITLTGAAHGSIDSAQSINDAFTELKNKFAVKLTPRAVENEMLSKRQHNAS FT ITDYGAEIEKLATKLAAAHVSQGTFTSEAAASNIVESVAVRAFVDGLQNPT FT TQFLLRARNPTSLNKAVSDALECNPGTSKPKNEMALWCNSQYYPRRGSNNN FT WRGRASGYRRGRGFSRGRGNFHRGNFQQYNNNGNRGNNHNNNNNHNNETNN FT NRNRHSNAHANVAEQNNNRRQPNEEEAEVNVVAELFRE" FT CDS 2644..5568 FT /product="Gypsy-44_AA-I_1p" FT /translation="MTKLQTKEKQTISKICLKYADIFCLKDDKLTTTEIFT FT PTISLKKDAKPIYSKQYRLLQSQKQEIDKQIGKMLSDGIIEKTRSEWNSPI FT LLVPKKSADDSKKWRLVVDYRKLNNVLEDDKFPLPNIEEVIDSLAGAKYFT FT HLDLSQGYYQCQLKAEERPVTAFSTSTGQYQMTRLPMGLKTSPSTFSRLMT FT IAMSGLNGAQCLVYLDDLIIFGRTFYEHNRNLIKIFERLREVNLKLNPAKC FT NFLQEELVYLGHYISADGIKPDPAKVEAIKNWPRPINADEVKRFVAFANYY FT RKHINNFAFLCAPLNFLTRKNIQFVWTNECETSFKELKERFMNPPVLDFPN FT LSNDNTFTLHTDASGYAIGAVLSNGNGRPVAYASKMLNQAEKNYPTIEKEL FT LAMVWGIRHFRPYLWGKKFLVYTDHRPLVYLFSLTDPSSRLTKFRLALEEY FT NFEVIYKRGTENVIADALSRISIQELKSLSEKTCLITTRSMERNKKENSSV FT KTSEDRTDHPSSPALDSIELEIDKKSKDIKFSKTKIVIPLKTTSIHLRGMM FT NKIIEFTKKENIKALIIKDNVATTREVIKNINAQELDGMPKIYIIGKHIKE FT VQDEKEKLLIMNDFHILPTAGHAGIKRTLNTIKLRYYWESMSKDIEEFIKH FT CKQCQINKLSNTKIPMKITTTAESAFEKVYLDIVGPLIKSDGFEYILTTQC FT ELSKFITATPIPDKTTLTVANAFVTSVILKYGVPKTIVSDRGTEFMSELFT FT SVANILGIEKLNSTAYHHQTIGALENSHKVLGTYLRIQCDGKLFSWSEWLP FT FYEFSYNNTVHSSTGHTPFYLVYGKLSNVPSNLTEKSEEQNYNLDEYTTIL FT KIKLRHAHEAARKQLLNTKIQRKTNYDENKKLVNFKAGELVLIKNETSKKL FT QAKYKGPFKVIDDLGENIQVKINKKLDTVHKDRIKRYLGHVVDKKNKNDKN FT GKEEILEAENKSEEEEHVKN" XX SQ Sequence 5799 BP; 2203 A; 1019 C; 1053 G; 1524 T; 0 other; tggcgatcct gccaggagca gtgaaattga tattgcatgg agactaagtg ccaggagcag 60 cgaaagtaaa agtgtaaaag ttcagtgcca acattttccc cccgtgtaaa gcattatttc 120 tgcaccatgg gctggttctc gtcagatgag aacgtggcca taagcaacac ggacaatact 180 ccgttaacta tttcaatttg tgtgatggct atagccatta ttggatacat gttggtgaaa 240 ggactagtgc tgcttcataa gcgaagcacc gaacgattgg cggaacgaat cgcgcggcgt 300 accgtcgccg gagtgtaaat tattttcaat ttgcaacatt gaacaatcaa aacggaaaat 360 ttagcaacag taacgaagaa aattttatgc ccacaactaa gtgacaaagt gacaaaatgt 420 gcgaacgaaa agtacaagaa aaactatact cgctgtgctt tatccagtta gtggaaatac 480 ggtcaacact ggaaaggaaa gtgcaggaag ggttctcgaa ggaagaaatt cgagctcaac 540 tcgaaatcgt aaacgctctt tttgggctgg ccgccggagc agctgtaaga ctggaaatgg 600 aaccaaggga cagtgacatc atcgtacgat tggtcgacgg catcgcagat atgctgaact 660 ggaatgcatc atcccagcat tagcagacaa caaggcgtta attcctttgc cttattagtg 720 aggtacatgc gctatcctgc taacatgccg taaggaaata ataataataa aaaaaaaaaa 780 cccaatgtga gtaacaaaaa ataatatctt tattaatgga tataacctat ctagagcaac 840 aaatagctta ccttagagtt aacctaaata atttaaagaa atgccccaat agagagtatt 900 taagtgcaac gttaaccaag aagaaacaat tttgtcaaga aacgtacgat agggctattg 960 tgttattaga agacctgaat caaaagatat ccaaggaaga ttcagaatat tacaaaaata 1020 ttattgtaag gatttattcc gacatttgta atttgattga ggaaaaatta aagaagttag 1080 aaatgcctcc gaaactagat ttgggtcttg cgctaaaaat tgtcaagcca tttgatggct 1140 cggcgagcaa tttacagcaa tacgtggaaa gcgttaatct cctaaaagat tacgcaaccg 1200 atgtttccga agcagatatc ttaaaattta taaaaataac tttgacagga gctgcgcatg 1260 gctccatcga ctcagcacag tcgataaatg atgcttttac tgaactcaag aataagtttg 1320 ctgtaaagct aactccccga gcagtagaaa atgagatgtt atcaaagcgt caacataatg 1380 cttctatcac tgattacgga gcagaaatcg aaaagttagc aactaagctt gctgctgctc 1440 atgtatcgca aggtacattt acttcggaag ctgctgcttc aaatattgta gagtccgtag 1500 cagtacgagc tttcgtggat ggtcttcaaa atccaaccac acaatttcta ttgagagctc 1560 gaaaccctac aagcctaaac aaagccgtat ctgacgcctt agagtgcaat ccaggtacgt 1620 ctaaaccaaa aaatgaaatg gcactatggt gcaattctca atattatccc aggagaggga 1680 gtaacaataa ctggagaggc cgtgcgtcag gatatagacg tggccgcggt ttctccagag 1740 gacgcggaaa tttccaccga ggaaatttcc aacagtataa caataatgga aacagaggca 1800 acaaccataa taataacaac aaccataaca acgaaaccaa taacaatcgc aacagacata 1860 gcaatgctca cgcgaatgta gctgaacaaa acaataatag gagacaaccg aatgaagagg 1920 aagcagaagt taatgtagtt gccgagcttt ttcgtgagta attcgaacaa tggactacta 1980 cgaataccat ttaatctatt gaataactct ataaatttta ttatcgatag tggagcaagt 2040 tgttcgataa tatcaaaaca acttgtacca gataacatac acatttatag aactaatata 2100 ttggaagtta aaggcataaa tggaacgtcc cgttccattg gaacgattaa tgttaatata 2160 aaatacagac aattcaaatt tccaattaca ttacatgtta tcgaagaatt gccaaaaaac 2220 attccagcgt taattggaac agatttctta cgacgaacta acgcggtcat caattttaac 2280 tctttaactt tagaattgaa taacaatgcc gaaacaatag tgattccctt tatatcgaac 2340 aatcgcgcgt ttgtaacagt acctgcgcgt tcagagatag ttacacatgt aataagtaaa 2400 tataaggatg actgtgtggt tttgaaccaa cagatcacat cttcagtttt tgtagctaac 2460 tcattggcca caccaaacca aggtaaaata ccagtgagat tagtaaatat aggagacaac 2520 ccaatatgta tcgaatcgtt aaaagttgat atcaaaccat tgaaaaatta taaccagatc 2580 ggaaaaatca gttttcccaa gtatgacaca aaaagagcaa ctcaattgct caaagaactt 2640 aacatgacaa agttgcaaac taaagaaaag caaacgataa gtaaaatttg tttgaaatat 2700 gctgacatct tttgtttaaa agatgataag cttacaacaa cagagatatt tacgccaaca 2760 attagtttga aaaaggatgc gaagccaata tattcaaaac aatacaggct actacaatcc 2820 caaaaacagg aaattgacaa acagattgga aaaatgttat cagacggaat tattgaaaaa 2880 acccgttctg aatggaatag tccgatttta ctcgtgccta aaaaatcggc tgatgattct 2940 aaaaaatgga gactagtcgt tgattaccga aaactgaata acgtactaga ggatgacaag 3000 tttccacttc caaacattga agaagtgatt gactctttag ccggagcaaa atatttcaca 3060 catctagatt tatcgcaggg atattatcaa tgtcaattaa aagctgaaga aagacctgtg 3120 acagctttct cgacgtcaac tggacaatac cagatgacta gattaccgat gggattaaaa 3180 acaagcccat caaccttttc aagactaatg acaatagcaa tgtcgggtct caacggagca 3240 caatgtttgg tatacctaga tgaccttatc atttttggta ggacatttta tgaacataat 3300 agaaatttga taaaaatctt tgaaagatta cgggaagtta acttaaagct aaacccagcc 3360 aagtgtaact ttctacagga agaattggta tatttgggtc attacatttc agcagatggt 3420 atcaaaccgg atcctgctaa agttgaagca ataaaaaatt ggccaagacc tatcaatgca 3480 gacgaggtaa aaagattcgt tgcatttgca aattactata gaaagcacat taacaatttt 3540 gctttcttgt gtgcaccttt aaattttctt accaggaaaa atattcaatt tgtctggact 3600 aatgagtgcg agacatcatt caaggaactt aaagaaagat ttatgaaccc acctgtttta 3660 gattttccaa atttgtctaa cgataacaca ttcactctgc acacagatgc atctggatat 3720 gctataggag ctgttctttc caatggaaat ggcagaccag tggcatacgc cagtaaaatg 3780 cttaaccaag cagagaaaaa ctatccaact atcgagaaag aactattggc aatggtatgg 3840 ggtatacgtc attttcggcc ttacctttgg ggtaaaaagt ttttggtgta tactgaccat 3900 aggcctttag tttatttatt ctctcttact gacccttcga gccgactaac aaaatttagg 3960 ttggctttag aagagtataa ttttgaagtg atttataaac gaggaacgga aaatgtgata 4020 gcggacgcgt tgtcgcgaat ttcgatccaa gaacttaaat cactcagtga aaaaacttgc 4080 ttaataacga cacgttccat ggaacgaaat aaaaaggaaa attcatctgt aaagactagt 4140 gaagatagga ctgatcaccc ttcatcacct gccttagaca gtatagaact agaaattgac 4200 aagaaatcga aggacataaa gttttccaaa acaaaaatag taattccatt gaaaacaaca 4260 tcaattcacc tacggggaat gatgaacaaa atcattgaat ttactaaaaa agaaaacata 4320 aaagcattga ttataaaaga caacgttgca acgacaagag aagtaattaa gaatatcaat 4380 gcacaagaat tggatggcat gccaaaaata tatattattg gaaaacacat caaagaagtc 4440 caggacgaaa aagagaaatt acttataatg aatgatttcc acatattgcc aactgcaggg 4500 catgctggaa tcaaaagaac actgaatacc atcaaattaa gatattattg ggaaagtatg 4560 agtaaagata ttgaggaatt cataaagcat tgcaagcaat gtcaaataaa taaacttagc 4620 aatacaaaaa ttcctatgaa aattacaaca acagcagaat cggcgtttga aaaagtatat 4680 ctggacattg ttggtccatt aataaaatca gatggatttg agtacatctt aacaacacaa 4740 tgcgaactat caaaatttat aacagcgaca ccaatccctg ataagaccac actgaccgtt 4800 gcaaatgcct ttgtaaccag tgttatttta aagtatggag tgccaaaaac aattgtctct 4860 gacagaggaa cggagttcat gtcagaactg tttacatccg tagcaaatat tttgggaata 4920 gaaaaactaa attcaaccgc ctaccaccac caaaccatag gagccttgga aaactcacat 4980 aaagtactgg gaacatattt gagaattcaa tgcgatggta aactattcag ttggtctgaa 5040 tggttgccat tctatgaatt ctcctataat aatacagtgc attcatcaac tggacataca 5100 cctttttatt tagtctatgg gaaactttct aatgtaccgt ccaatttaac agaaaaatca 5160 gaagaacaaa attataattt agacgaatac accacaattc ttaaaattaa acttagacac 5220 gctcacgaag ccgctaggaa acaattgtta aacacaaaaa tacaaagaaa aactaattat 5280 gacgaaaaca agaaacttgt aaattttaaa gcaggagaac ttgtcctcat caagaacgag 5340 acatccaaaa aacttcaagc gaaatataaa ggtccattca aagtcataga tgatttagga 5400 gaaaatatcc aagtaaaaat aaataaaaag ttggacactg ttcataaaga tagaataaaa 5460 cgatacttag gacatgtggt ggataagaaa aataaaaacg ataaaaatgg taaagaagaa 5520 atattggaag cagaaaacaa aagtgaggaa gaagaacacg tgaaaaatta ataattcatg 5580 atctaaacat actacttact tttataaata aattgtacat attgacataa ccagcataat 5640 ttgtaagaat acttcaaaca tttatttgca tacactatac attttatatt gtactactgt 5700 tctattacat acatattact tttatattac ttctactatc ttatcttata aaaaattgta 5760 agttttgaac ttaaattttt tatgtaatta attagggcg 5799 // ID P1_Cis repbase; DNA; INV; 4062 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 06-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE P DNA transposon from Ciona savignyi. XX KW P; DNA transposon; Transposable Element; P1_Cis. XX OS Ciona savignyi OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. XX RN [1] RP 1-4062 RA Smit A.F.; RT "P1_Cis - P DNA transposon from Ciona savignyi."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX CC Ci000947, Ci000957, Ci000859. 8 bp target site duplications. CC ORF1 at pos. 598-1122 encodes a protein closest related to human CC PP238 (AF258556_1). It also matches the CC Zaphod-transposon-derived pP52rIPK, but similarities are outside CC the Zaphod-like region. ORF2, at pos 1076-3487 encodes a classic CC P element transposase. This is the first coding P element found CC outside insects (outside Diptera even). XX SQ Sequence 4062 BP; 1454 A; 745 C; 688 G; 1170 T; 5 other; catagagata ctaatataca ctagagtggc ctttgttaaa taattgataa tgctgcattg 60 ttgtaaaatg aatgcatttg ccgtgcaatg gctgtgttgc catacaaata atggttttat 120 ttttttataa ttggtattta actgaacgta cttttgagcg gagttgtaag aaaaataaat 180 attatttact accaaacttt atttccnata tatttaagga gcaatattta ttaaactttg 240 gcattttaat gcatcatatt tattaaataa aattgcatgc gcaacactgg atcagacgac 300 attcgagtcg tcaaccattt cttcgtgcga aaaagtccaa agaaacgttt aaagttgtag 360 cctactttaa tacctacgta ccacggggtc agacactctc tttgaaatat acaccggata 420 tatatatcca atgttactgt atttgacgca aaanggcact ttacgttgcc acgttacatt 480 atataactgt accaacagca gcgcgtcgga ccgtcagcag gaatggtgca agatcgaaac 540 ctgccgatgc taattgcgtg taacccaaca atgggcgcgt gggtccatta aataagagta 600 tcctatcccc gttaaactta agctcacact tttactttta gaaaataata ccagtacggt 660 tagtggatat ggttatttct tgttctgcgt ggggctgcac aaaacgtcaa atcaagactg 720 caacgtggag ttttcacagg tacattgttt aacgcataat aatatacaaa tataccacat 780 acacgaagtt aaactacatc actttaaatt cagatttcca aaaaataaag agcaactgga 840 acaatggctg gtagcactta aaagagagaa attcttccca acaaagtatt catatatatg 900 ctcggaacat tttgttgctg aagatttcaa tggtagagtg cgaaaacgac ttaagcctgg 960 ggcagtgcca tcagtattca gtttcccgga gcatcttcag tcagtaagtc attttttacc 1020 agtactgcag cctttgggca agagattagt gacgttagtt gtagtaagct taaatatttt 1080 ttccccagaa aaaagggaaa aaaagaaggg agctgcttga cagaaatcgg ctagctagtc 1140 gagctgtggt acctgaaaaa aaattcaaat ttatacagag tcataatgtt ttaaaagatc 1200 acaactactg taatactggt agtccaaaac aaaagataga aaacataaaa cgaaagttaa 1260 tcaaaataag aaagcaaaaa aaagtgttac ttgtgcagaa atgcagggct acaaaacaag 1320 ttaaaggttt aaaagagctg ttagtcaaag ccaaaaataa aaaatactta gaagacaagc 1380 catttgatat attgacctca aactttggag ataccgcgat agaactattt tgtacaaata 1440 tctcaaaaaa tagtggcaaa aaagtgaaat atacagaaaa aattaaaaaa tttgctatga 1500 cccttcatac atactcccca aaggcctacc aattcattcg aaaacaccta acattaccac 1560 acccaagaac tttgtcaaag tggttatctg tctttaaagg taatgctgga attactgcag 1620 aatccattga agccattaaa aacaagcaag ccaagataga ttacaaatta tattattcac 1680 tcatattgga tgaaatggca atcaaaaagc aagtggaatg gaatggaaag gagtacacag 1740 gatttataaa tttaggagaa aatttagatg acgatacact acctgttgca gcaaacgccc 1800 ttgtgtttat gctggttcca attaatgcaa actggaaaat tccaatagct tatttcttca 1860 ctgatggctt gtctgggaaa actcttgcag aattagtgcg gaaggtgttg tcctgcctgc 1920 atgacaataa catactagtt tgttcgttaa cctgtgatgg ctgtgggagc aaccagagta 1980 tgttaaatga attaggagta cctgtgagat atccaatcaa gcaaagttat tttcttcacc 2040 cagaaaatcc caatcaaaaa atctctgtct ttttagataa ttgtcatatg tttaagttga 2100 tgcggaatct tcttgctcaa aaaaagctgt tgcaaaattc cactacaggc caacaaataa 2160 aatgggatta tataaccaac ttgcacaata tacaaacaaa ggaaggtcta aagctggcta 2220 ataaattaaa acggaatcat attgagtacg gaacacaaaa aatgaaagtc agtcttgcgg 2280 tacaaacctt gagtgcttcc actggtaaag ctatccagtg tctaagacag cttggctatg 2340 cacagtttca aggtagtgca gaaactgaga acttcattat gcttactgac cgcctgtttg 2400 acgtaatgaa ctctaaaaac ccgctgggaa aaggcaacaa agcacctctg aaaaaatcga 2460 ataaacaaat ttggttacca tttttaagca cagcaaaaga atatttaagt aacctcatgt 2520 gtgacggcaa ttatctgtac aaaagccagc gcaaaacagc cattgttggc tacctgttaa 2580 atattgaatc ccttaatcaa ctatttgata cacttgttga aagcgggaaa ttacaatatc 2640 tcttaactta caagttcagc caagactttt tggaattatt ttttgggggc attagagggg 2700 ccaatggttg gaacaacaac cctacatgcc agcagtttaa atccgcctac agaaagctgc 2760 taagtttaac tgatgacatg ttagtaacag ccggaaattg ttcccaatca gcagagttta 2820 ccacaagcct ttctatttta gcaccaattc aaatggcatc gcctaatatc gatattgaac 2880 ggaaatacaa ccttgattca gacatatcag accaaaatgc aactgatcat acctatctgc 2940 aaacatgtta cttctcgact ggtacatatt tgtcaccgat gatagaaact atagtgatgt 3000 atatatcagg atttgttgca aaaaaattaa tacaaaagtt actttgcaac atttgcatca 3060 actcactgta caccacagta cctccagaac atgaccaaag gttcatactc atctctatca 3120 aagacaacaa tggtttaact tacccatcat cagatttgac taaagtaaca atgacaactg 3180 aaaaagtttt tcggagatat attaaaacaa ctggacatcc cacaaatatc aagggtttag 3240 agggtgcaat ttgctccagt gttctnacac acctgcagag tgaaaagttt ttatttaatt 3300 ccttggaaac ccacatgttc gacacagatc caacggataa ccacaggata acactcatca 3360 aaaaaataat taaactatac cttacaataa gatttcatca cgaagcaaag gtgtacacat 3420 ccaaaatcag aggacgcaat caacgacaac aattccataa agccattcta tttaaacacc 3480 aataatatta aactggaaca tcaattttac tgaaataata tcaactaaat taccatagtt 3540 tatttttaga atttcgtttt agtttttatt ttgtctatat catacttatt acggtactct 3600 aataccctat attatattta taaatgcatt ttacagtaaa tatatacttt ttattttaat 3660 atttaatgta gtttttaacc agttttagtt tataattgtt gtgcatttga aataaactaa 3720 aataactgat ctaaaaacaa gttgtcggtg aagtacgaaa gcctatgcaa aagtgcgtac 3780 cggttctccg gtctagctta ctctaaggtg aattaaaaaa gctttttttt atattaaata 3840 ggtttaaagc tgaattaaca actgatatat atcagcaaag catgtatatt aattagaaaa 3900 caaattcact gcacatttgt taactattta acatacctgg ncaaatccgc accaaacacg 3960 tgtcggaact cttgccgtgc aaagcttcaa aatggcggcc accatggnga caacaattac 4020 gcaacagtaa tgccgcctct agtgtatatt agtgtctcta tg 4062 // ID Gypsy-149_AA-LTR repbase; DNA; INV; 239 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-149_AA_; KW Gypsy-149_AA-I; Gypsy-149_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-239 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1024-1024 (2011). XX DR [2] (Consensus) XX SQ Sequence 239 BP; 73 A; 51 C; 46 G; 69 T; 0 other; tgttacacct accattatta gaactacctt gctctttact tttgacccaa atacccccga 60 tattgagtag ttgcgcttca gtgtgtagcc agaagaaaaa ggaaaccgtt agcgaaaaca 120 cgcgtaaaat catgtccgcc gtcgcggttg taaaacagat taatgtaatt aaagtggtga 180 aataaataga attcgtcgat tagctgttgt gttcatcatt tctctcggcc gtcacaaca 239 // ID Gypsy-122_AA-LTR repbase; DNA; INV; 232 BP. XX AC supercont1.327; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-122_AA_; KW Gypsy-122_AA-I; Gypsy-122_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-232 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.327; Positions 36504 36735. XX SQ Sequence 232 BP; 90 A; 42 C; 36 G; 64 T; 0 other; tgttaagtat tcccggttcg gtaaatccgc agatacctca acggattctg attgaactac 60 cacgagcaaa ctgttcagta ctgtaaatgt atgtgacacc aatgtgtata aaaaggaagc 120 ccaaataaac caaatagtca gttcttaaca gaaatcctaa aagttaaatt aattcttata 180 ataagaaacg tctcaaaata tttaataaag ttcttgttca cagaacggaa ca 232 // ID Penelope1_Dw repbase; DNA; INV; 2789 BP. XX AC . XX DT 25-JUL-2007 (Rel. 12.07, Created) DT 25-JUL-2007 (Rel. 12.07, Last updated, Version 1) XX DE Penelope1 retrotransposon from D. willistoni - a consensus DE sequence. XX KW Penelope; Non-LTR Retrotransposon; Transposable Element; KW retrotransposon; Penelope-like element; reverse transcriptase; KW GIY-YIG endonuclease; Penelope1_Dw. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-2789 RA Arkhipova I.R.; RT "Penelope retrotransposons from Drosophila willistoni."; RL Repbase Reports 7(7), 628-628 (2007). XX DR [1] (Consensus) XX CC Penelope1_Dw from Drosophila willistoni is 68% identical to CC Penelope from D. virilis (U49102), and 70% identical to CC Penelope2_Dw, another D. willistoni Penelope element. There are CC about 60 copies of Penelope1_Dw in the sequenced part of the D. CC willistoni genome, most of which differ from the consensus by CC 1-5% but can differ as much as 10-15%. Most of the copies are CC 5'-truncated, and only one appears to be fully intact CC (AAQB01006297) and is arranged in a characteristic CC partial-tandem, forming "pseudo-LTRs". XX FH Key Location/Qualifiers FT CDS 32..2530 FT /product="Penelope1_Dw_1p" FT /translation="MTTTDGPPIATQTIGFAEIKLNYSNTKTTINRHQTTS FT KKLVRLKSSLKFLLKCRKSKLIPNFIKNTTRCDNIFKFDNNTFPDINKTLD FT RYTYFFHTKILNLLIKHKHNTLIRINNKLAETTKTLGKQLDEQSMSAFLER FT EQNAVKRLTKTLREHHEMKHEKLNNKRNFALLNNNRTDDWFVNETELEFPP FT DIKSLLSKGPKFALPIEHEKLPLIKYIADGEEIVQTIKEKEKQEAARTAFS FT LMVKEHRAMKYINATDRAILNITQRTYKFLKQHDDILILTSDKGNKTVAMK FT KIDYDHRMMDILSDLNTYRILRKDPTTRLQTKNNNLVDKLHKMGVISKIEK FT NRMTTTTAISPRIYGLPKIHKEGTPLRPICSSIGSPSYGLCKYIIQILRNI FT TLDSKYNIKNATEFKQRINNTYICDDEQLISFDVVSLFPSIPTDLALDTIR FT NKWTKIQEYTKIPKALFMEIVKFCIQENRYFTYKDQTFIQLKGMPMGSPAS FT PVIADIVMENLLDTTMDKLTRPPRLLTKYVDDIFSIIHQDDIDKTLDTLNS FT FDRNIKFTMELEDNGKLAYLDSMVIRRGNELKLKWYRKPTASGRIINFNSK FT HPKTMIINTARSCIQRMLNISDKIYHEETKMEIINILKDNDFPDNVIRTLF FT KKHKSTIDKTKNEKTYISVTYVPKLSERLSHSDCYDKDGIKIAHKPDNTLK FT HVFNKTKSKIEMTNRSNVIYKIPCNGQGNIPCNKIYIGTTKSKLKTRISQH FT KSDYKMRHYTDTPKTALMTHCATSGHAPNFEETTILDEERHYNKRFTMEML FT HIINTPSATRLNFKTDTDNCAHIYRHLLDKN" XX SQ Sequence 2789 BP; 1161 A; 522 C; 447 G; 659 T; 0 other; taacaacaca aaggtcgcaa aaactaccac aatgacaaca acggatggcc cgccgatcgc 60 aacccaaacc attggatttg cggaaataaa acttaactac agcaacacaa aaacaacaat 120 taatagacat cagactacct ccaagaaatt agtaagactg aaaagtagtt taaaatttct 180 gttaaaatgt agaaaatcta aattaattcc caattttatc aaaaatacca cacgatgtga 240 taatattttt aagtttgaca ataacacatt tccagacatt aacaaaacat tggacagata 300 tacatatttc tttcacacaa aaatattaaa ccttcttata aaacataaac ataacactct 360 cataagaatc aacaacaaat tagctgagac aacaaaaact cttggaaaac agctggacga 420 acagagcatg agcgcatttt tggagagaga acaaaacgca gtaaaaagac taacaaaaac 480 attgcgggaa catcatgaaa tgaaacatga gaaactaaac aacaaacgga actttgctct 540 cttgaacaac aacagaactg acgattggtt tgtaaatgag acagaattgg aattcccgcc 600 ggatataaaa tcattacttt caaagggtcc aaagtttgct cttcccatag aacacgagaa 660 gctccctctc attaaataca ttgccgatgg agaagaaata gttcagacga ttaaggaaaa 720 agagaaacaa gaggcagcgc gtactgcatt ctctttaatg gttaaagagc atagagcgat 780 gaaatacata aatgcaacag atcgtgcaat attaaatata actcaacgga catacaaatt 840 tcttaaacaa catgacgata ttttgatatt aacatctgac aaaggaaata aaacggtagc 900 aatgaaaaag attgattatg accatagaat gatggacatt ttgtcggatc taaacaccta 960 tagaatcctc aggaaagacc caacaacacg attacagaca aagaacaaca atctggtgga 1020 caaactacat aagatggggg ttatctcaaa gattgaaaag aacagaatga cgacaactac 1080 ggcgatctct ccaagaatat atggactacc aaaaatacac aaagaaggga caccattgag 1140 accaatttgt tcgtcaattg gctcaccatc ctatggactt tgtaaataca ttatacaaat 1200 tttaaggaat ataacattgg actcgaaata taacataaaa aacgcaactg aattcaaaca 1260 acgcataaat aacacataca tttgtgacga tgaacaatta atttcgtttg acgtggtatc 1320 attatttcca agcataccaa cggatttagc attagacaca atcaggaaca aatggactaa 1380 aatacaggaa tacacaaaga taccaaaagc tttatttatg gaaatagtaa aattctgcat 1440 ccaagagaat agatatttta cctacaagga ccaaacattt atacaattaa aaggaatgcc 1500 gatgggatct ccggcctcac ctgtgattgc ggacatcgtt atggaaaatc ttctggacac 1560 tacaatggat aaactaacca gaccaccaag attattgacg aaatatgtag atgatatttt 1620 ttcaattata caccaagatg acatcgataa aacactggac accctaaatt catttgacag 1680 gaatataaaa ttcactatgg agctggaaga caatggaaaa ctggcgtatt tggactctat 1740 ggtcatcaga cgaggaaacg aattaaaact aaaatggtac aggaagccaa cagcatcagg 1800 acgaatcatc aatttcaact caaaacaccc taagacgatg ataataaata cagcaaggag 1860 ctgtatccaa cggatgctga acatttcgga taaaatatac catgaagaaa cgaagatgga 1920 aataataaat attttaaagg acaacgattt tccggataac gtaatcagga cattatttaa 1980 aaaacataaa tccaccattg acaaaactaa gaatgaaaaa acatatatat cggtcactta 2040 tgtacccaaa ttatcggaaa gactatcaca ttcggattgt tatgacaaag atggaataaa 2100 aatagcacat aaaccggaca ataccctcaa acatgtattc aacaaaacaa aatccaagat 2160 agagatgact aatagaagca atgtgatata caaaatccca tgcaatggac aaggaaacat 2220 accgtgcaat aagatataca tagggacgac aaaatcgaaa ttaaaaacta ggatttccca 2280 acacaaatct gactataaaa tgaggcatta cacagacaca ccaaaaacag cactaatgac 2340 acactgtgca actagtggcc atgcaccgaa tttcgaagag acgacaatat tagatgaaga 2400 gcggcactac aacaaaaggt tcaccatgga gatgctacat atcatcaata caccgagcgc 2460 cacacgactt aactttaaga ctgatactga caactgcgct cacatataca gacatttact 2520 tgacaagaat tagttcgtaa ctccacgtca aacggtgctg acgtgttaaa taatgtatgt 2580 tacaaaaaat tttgttaaat gtcacataat tgttaaatgt tcttgtattt tattgtagtt 2640 gccctgaaga cggttgccgc tgcgcaaccg aaatatatcg gaaaataaat aaacttaaaa 2700 acaatgtttt atttttattt ggaaaaatga cctaaagcct gatcaaaatt tttatataac 2760 aacacaaagg tcgcaaaaac taccacaat 2789 // ID BEL-88_AA-LTR repbase; DNA; INV; 420 BP. XX AC supercont1.2; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-88_AA_; KW BEL-88_AA-I; BEL-88_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-420 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.2; Positions 4359093 4358674. XX SQ Sequence 420 BP; 153 A; 73 C; 69 G; 125 T; 0 other; tgtaaccaac aagcccctcg gttagagagc caaaactgag acccaaaaat caccaatacg 60 gcgaacgtca gaggagagag atgaattagt ttaacgtcac tttctaactc aatacaatac 120 tacctaaaac taaacgaagt acaattttat gaactagcca aattgaatta gttatcggct 180 gtaattgtaa gtttatttaa tttcttaact acttaaatct aaatttattg aattgtagat 240 tataaccagg aaattgattt aaccgttttt gttcgctgat aaatctgttc ggacactaaa 300 ttttaagtga tgcaatatgc taagagagcg tgaattaatt aattaataaa atttgtagct 360 tgagctgtcc agaattcaaa acgaacgact ttgcttcgag actgccgaac atcacgaaca 420 // ID Gypsy5-I_Dmoj repbase; DNA; INV; 4593 BP. XX AC scaffold_6390; XX DT 13-MAY-2009 (Rel. 14.05, Created) DT 13-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy5_Dmoj; KW Gypsy5-LTR_Dmoj; Gypsy5-I_Dmoj. XX OS Drosophila mojavensis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; repleta group; OC mulleri subgroup. XX RN [1] RP 1-4593 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1050-1050 (2009). XX DR Genome; scaffold_6390; Positions 61890 57298. XX CC Positions [1645-2070] - Reverse transcriptase CC Positions [3437-3736] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 367..3498 FT /product="Gypsy5-I_Dmoj_1p" FT /translation="MALSKSLDGDAAYWLSQISFAGMTWMQFKELFLQNFE FT GHETTAATIFNILNSGPNVDEGLASYGSRTVTSLVSKWKDVSIEEIAVSVV FT LAHAAKIEPKLQRTIFTTNIKSRNEMRNELSAFDFGKRKSNSVPENSASKN FT MRLQPISKCHFCGKQGHKVADCRARRIPGNSQQNSTQRDTRVLEKQRKSTI FT ICFKCGEAGHIQSACPRGSSALVKNRVEEKRVNICHVKEPLGNLIQSGESY FT PFYFDSGAECSLVREAVSRKFSGKRINNVVHLKGIGNNSISTMQILSNVII FT DEYSLEVVFHVVLNEYLNYDIILGREILCNGFSVTISNEKVELLKSKSICA FT VAVSDNLVDLTTVDSGVSGLDKVRLLAILKKYSNSFINGIPKSRVITGQLE FT IRLVNPNKTVQRRPYRLGLEEKQIVRNMIEELLKANIIRPSCSPFASPIIL FT VKKKNGSDRLCVDFRELNSNTVDDKYPLPLINDQIARLCGAKYFSCLDMAS FT GFYQIPIHPNSIDRTAFVTPDGQYEFLAMPFGLKNAPSVFQRAVMKALGEL FT AYSYVIVYMDDIMIVAETIDDAFNRLEYVLKILSEAGFSFNVAKCSFLKSS FT VEYLGFEVREGEIRPNPRKKRSLSDLPPPQSVTQVRQFIGLASYFRQFIPQ FT FSQIMKPLYGLTCKDKVFEWKLEHEEIRQKIIKALTDEPVLTIFNPDYPIE FT LHTDASMDGYGAILLHRIDGKPRVVEYYSRRTSSAESRYHSYELETLALYN FT SMKHFRHYLQGKDFVVYTDCNSLKASRLKAELTPRVHRWWAYMQSFNFDIQ FT YRPGSKMAHVDFFSRNLLPNKMNTQVPIKRVQLAEISNNWLLAEQQRDPET FT SLIISKLQNDELEESIAKTYELRSKILYRKVQRNGKTRCLPVIPRPFRWSV FT VNHVHEAIVHLGWEKTLDKMYEFYWFDNMRKYVRKFVEKCITCKLTKPSSG FT KPQVELHPIPKADIPWHTVHIDITGKLSGKSDQKEYVIVQTDAFTKFVYLY FT HTVNINAESCISAIKSLISFFWCANKDNCRSGQMLYGQ" XX SQ Sequence 4593 BP; 1499 A; 834 C; 991 G; 1269 T; 0 other; gagagatggc acagcgaagg tgggcagttt cgagaagaga ttgaaagagc gaacatgtcg 60 cggcccagaa agcgtccggt tatggatttt aatgatattg aaagtgctaa cccgacatca 120 gaagtgggat cagatccacc ccaaacaaat cccgatgacc gcttgtgcgc gatcttggaa 180 tcgcaacacc gaaacttgct ggaagttata aacgcagtga aaagtggtta aaatgaacaa 240 agtacaaaaa aaataattat attgccaaaa ttcaatccca atgttacggg atcgagtgca 300 gcagcttggt gtgctatagc cgactttata ctgaccgaaa accctttgga aagctctacc 360 cttctcatgg cacttagtaa atcgttggac ggcgacgcag cttattggct gtcccaaata 420 tcgtttgctg ggatgacctg gatgcagttc aaggaattgt ttctacagaa ctttgaaggc 480 cacgaaacaa cagctgcaac tatcttcaac atcttaaata gtggtccaaa tgttgatgag 540 ggtctggctt catatggaag ccgaacagtt acgtcacttg tctccaaatg gaaggatgtg 600 agcatcgaag aaatcgctgt atcggttgtg ttggcgcatg cagctaaaat tgagccgaaa 660 ttacaacgaa cgatatttac aaccaatatt aaatctcgta atgagatgcg aaacgagctt 720 agtgcatttg actttggcaa gaggaaaagc aactcggttc ccgaaaactc tgcgagtaag 780 aatatgcgac ttcaacccat ctctaagtgt cacttttgtg ggaagcaagg acataaagtg 840 gccgattgtc gtgcaagacg aatcccagga aattcacaac agaactcaac tcagcgcgat 900 acacgtgtct tggagaagca gagaaaatct acaataatct gttttaaatg tggagaagct 960 ggacatattc aatcggcctg ccctagagga tcatcagcac tagttaaaaa tagagtagaa 1020 gaaaaacgtg taaacatttg tcatgtaaaa gagcctcttg gtaatttaat acagtccggt 1080 gagtcgtatc cattttattt tgattctgga gctgagtgtt ctcttgtaag agaagctgtg 1140 tcccgaaagt tttctggcaa aagaattaat aacgttgttc accttaaagg tatcggaaat 1200 aattctataa gcacgatgca aatattatca aatgtcatta tagacgaata ttcacttgaa 1260 gttgtatttc atgttgtgtt aaatgagtac ctaaattacg atataatatt aggtcgagaa 1320 atcctgtgta atggttttag cgttacaata tcaaacgaaa aagtcgaact tttaaaatca 1380 aaatctatat gtgcggtagc agtttcagat aacctcgttg atttgaccac tgtagatagt 1440 ggagtaagcg gcctagacaa agttcgtttg ttagcaattt taaaaaaata ttctaattct 1500 tttattaatg ggataccaaa aagccgagtt ataacaggtc aactggaaat acgcttagtg 1560 aaccccaaca agacagtaca acgaagaccg tatagactag gattggaaga gaaacaaata 1620 gtgaggaata tgattgagga attattaaaa gcaaatatta tccggcctag ttgttcgcca 1680 tttgccagtc ccatcatact agttaaaaag aagaacgggt cagaccgact ctgcgtagat 1740 tttcgagaac taaattctaa cacagtggat gataagtatc cgttgccgtt gataaacgat 1800 caaatagcta ggctttgcgg cgcaaagtat ttttcttgtc tagatatggc gagcgggttc 1860 tatcagattc ctatccaccc aaattcgata gatcgcacag ctttcgtcac tccagacggg 1920 caatatgagt tcttagcaat gcccttcggg ctaaaaaatg ccccgtcagt gtttcaacga 1980 gcagtgatga aagctttagg agaacttgca tattcatacg taatcgttta catggacgac 2040 attatgatag ttgctgaaac gatagacgac gcatttaacc ggttggaata tgtgttgaaa 2100 atattgtcgg aggctggatt ttcttttaac gttgctaagt gctcgttttt aaaatccagc 2160 gttgaatatc tgggctttga agtcagagag ggtgaaatcc gcccaaatcc ccgcaagaaa 2220 cgttcactgt ctgacttgcc ccctccacaa tctgtaactc aagttagaca atttattgga 2280 ctggcgtcat attttcgaca atttatccca cagttttcac aaataatgaa gccactgtat 2340 ggtctcactt gtaaagacaa agtatttgaa tggaaattag agcatgaaga gattcgtcaa 2400 aaaattatta aagcattgac agatgaaccg gttcttacca tattcaatcc ggattatccg 2460 atcgaactgc atactgatgc cagtatggat ggatacggag cgattttgct gcataggatt 2520 gatggtaagc ctcgcgtagt cgaatattat agtagacgaa cttcttccgc tgagtctcga 2580 tatcattcgt acgaattaga aactttagct ctatacaact caatgaagca ctttcgccac 2640 tatttacagg gcaaagattt tgtagtttat acagactgta attcgttgaa agcctcacgt 2700 ttaaaggctg agctgacgcc aagagtccat cgttggtggg cttatatgca atctttcaac 2760 tttgacatac aatatagacc tggtagtaaa atggcgcatg tagacttctt ttctagaaat 2820 cttttaccca ataaaatgaa tacccaagta cctatcaagc gggtacagct agctgaaatt 2880 tcgaataact ggcttttggc ggagcaacaa agagatccag agacttcttt gataatttcg 2940 aaacttcaaa atgatgagct tgaggaaagc atagcaaaga cttatgagct tcgatcaaaa 3000 attctttatc gaaaagttca aaggaatgga aaaacacgat gccttccagt tatcccccga 3060 ccgtttagat ggtctgttgt gaaccatgtc cacgaggcaa ttgtacacct tggttgggaa 3120 aagaccttag acaaaatgta tgagttttac tggtttgata acatgcgtaa atatgtgcgc 3180 aaatttgtag agaaatgcat tacatgcaaa ttgacaaaac catcgtcagg caaaccacaa 3240 gtagagctcc atcccatacc aaaagcagat attccgtggc acacggttca cattgacatc 3300 actggtaaat taagcggaaa aagtgatcaa aaggaatatg taatagttca aaccgatgcc 3360 ttcaccaaat tcgtttactt atatcacaca gtcaatatta acgcagaaag ttgcattagt 3420 gcaatcaaat ccttgatttc gtttttttgg tgtgccaaca aggataattg cagatcaggg 3480 caaatgcttt acgggcaatg actttaacaa attttgcacc atgcaaaaaa ttgacttaca 3540 tttgatagca acgggggcta gtcgagcaaa tggccaagtt gaacgcgtta tgagagtact 3600 gaaacatatg ctaaccgcca ttgagagtag tgagcgttct tggcaagacg ctttaggtga 3660 aattcaattg gctataaact gtacaacaaa tcgaactaca aaagccagcc cattagaatt 3720 actaattgga aaagaagtaa gacctttaaa tatgttacat gtttgtgaaa atgaagataa 3780 gacagatata agtaatgtaa gaaagattgc caaagaaaat attgaacaga acgctcgtta 3840 cgaaaaagag agattcgaca aaaataagtc aaaaattgtt agatttaagg ccggcgatta 3900 tgcactgctc aagaatgagg aaagacacca aactaaatta gatcccaaat ttaaaggtcc 3960 gttcttagtg acagaggttc tcgatggaga ccgttatgtt ttgaagtcct tagtcaacaa 4020 gcgaacttat aagtattcgc atgagtcgtt gaggcgtatg ccggcggagg ggattcccat 4080 tgagttggaa atgtgtagag acggcgttga aagagacgtc aatgaaggcg ttgaaagaga 4140 cgtcaatgaa ggcgttgaaa gagacgtcag tgacatgtta gtggagtcta acatgtgaga 4200 tatcagacac tatgccgggg aatcttgctg atatgtgact taactactaa agctattagg 4260 gacacgagtt aatgtgttat cctattgagt cgctggtagg cacgtatagt aagattttgc 4320 gtgggctgtt gtcctgatga tctaagtaaa cctcacataa ctagtggaaa tccttttgcc 4380 accggataca tcacaaaaat aattgttcta tcttggaaac tctgtcaatt gaatgaccta 4440 ctcgagaaaa tataaattga ctgtattttt atttaaagta ataagtttgg actgatttgt 4500 cttttaaatt gagctgtaca tactgctgtt tagtttattt tattagtttt ttagattgta 4560 acacacgagg acgtgtgaac gtcagaatgg ccg 4593 // ID Gypsy-595_AA-LTR repbase; DNA; INV; 205 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-595_AA_; KW Ty3_gypsy_Ele22; Gypsy-595_AA-I; Gypsy-595_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-205 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 205 BP; 62 A; 32 C; 49 G; 61 T; 1 other; tgtgaggatg cgagagaccg atgaagaaga gtagtaaatc gttttcgaag ccaagcgcag 60 cagttgagtt gaggtcttcc gcgagawtag gagtgtttca gaccgctccg tgataatttg 120 gtgaagtgat ttatttccat attttaacac tgaattggca atcattaaat aaagaaaaat 180 cgtttaaacg cgtctttttc ttaca 205 // ID Gypsy-171_AA-I repbase; DNA; INV; 5847 BP. XX AC supercont1.268; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-171_AA_; KW Gypsy-171_AA-LTR; Gypsy-171_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5847 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.268; Positions 1206427 1212273. XX CC Positions [4617-5093] - Integrase core CC 'GGTTC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 1063..2271 FT /product="Gypsy-171_AA-I_1p" FT /translation="MSEAGQYVRVHQCDDEHSSNDHAEEHSKESEVPEQHN FT HEDEQLDTQQTEKPGTTMDERVLRLENAVVRLSNVVIEKHFPEVAKTAAKS FT WTDTQKETSVTQEARQCANVRWDHVQAFPKNVPASKLWETWRHYMESFEIA FT ASFSNATDPALKAKLLYLVMGEEIQTMVRAANLRPSLDDPECYRKLVSNIE FT EYFRSLVDPAAEHDEFFGMHQQQGESVVRFHARLMEKAQACEFHTDNQNQF FT VHSQLLKGMRNQDIATSARTYSHDTACVVQAAARQETYGSEFQRGPPSCTD FT TSTVQAVEQNLHKRKNYIEPRSMDHQSRFFKKRKFDSKPKKSTTCSSCGRS FT EHSDPRKCPAKTRSCKSCGKRGHFEAVCRSKRINNTDGEISEDHRISMKTR FT EPSVENDKV" FT CDS 4533..5537 FT /product="Gypsy-171_AA-I_3p" FT /translation="MPGDAENWVKACRVCAINGKPEKPTPMERIFAPKAVW FT ETIAIDFNGPYARYGGMYILVLVDYRSRYLIARPTKSTGFEQTKSVLEEVF FT DRQGFPSVIKSDNGPPFNGDDYKKYCAERGIKTIFSTPLFPQQNGLVENYM FT KIINKAMSASVTCGTNYNDELQAAVRAHNAAAHSVTKVPPEEIMLGRKIKR FT GLPLLDREKVHHDEDLLNQRDREAKLDAKHREDKRRSARLCRVKPGDVVIV FT EKQTRSKGEARFGDKRFTVLEEKNGNLLLADDEGSTLRRHVSQTKVVGEWM FT DAPPKASENHVTIPAAAPIRPIRERSAPSYLKDFVQIISAEVR" FT CDS 2669..4264 FT /product="Gypsy-171_AA-I_2p" FT /translation="MEIEQSFKAKISTVDIFKPSTWAEFLVVPDGRRSLLG FT RKTAGEMKLLIVGAAVNACVDSGKVETFPKMPGVRVKFSVDKSVPPVRNAY FT YNIPAAYREAARNRLNDMEMKGIIEKVSTAPEWISGMSAVAKGKGDFRLVV FT NMRAANRAIKREYFRLPLLEEMKVKLHGAKFFSKLDLSNAYYHLELDKDSR FT DLTTFLSENGMFRFTRLMFGVNCAPEIFQREMTKILKQFPNVIVYIDDILV FT FAKSLKELRETVAAVMKVLRANNLTLNAQKCELDKTSIKFLGHNLDEHGFH FT IDEAKVRHIQQFRQPQSVSELRSFLGLASFVSPHIQHFADLTSPLWAVSGK FT NSWKWGPSQKQAFENVKHKISKCATTLGYFCETDRTVLYTDASPTALGAVL FT VQVSGTNTPRIISFASKALTETEKRYPQNQREALGAVWAVEHFAYYLLGRH FT FTLRVDAQGIAFLLNRTREDSKRALTRADGWALRLSPYDYNIECIRGRDNI FT ADTLSRLYEGIDGPFNDDVSPWEIAKLEVKKCYLFN" XX SQ Sequence 5847 BP; 1749 A; 1199 C; 1350 G; 1549 T; 0 other; atggcgattt cctgccagaa ttttctttct caacaaagaa aggaaatgat cagtttcttg 60 gagtagttta tttctttatt caattcactg tagaagaatc gccccattga accaaaaact 120 tgtttagttt ttacaaaatc ggcaatttcc tttaccgacg cgagtgtacg acgaaaaatg 180 cgtgaatcaa aatggcgtgt gaaaaaaagt gaaagtgaag atttttttta ttccctcctg 240 ctgatgtgca tgttggtttg ttgctaaatt gtttgaacca cttttttccg caaatggcgc 300 actgacgaaa gcacaaatcc gtttcgatgt cgttggtttg ccttcatctt tagtgcaaat 360 gtcacatgtc ggatagtgta tcacagtgca gaatgttttg cggtgaaatg gttcatgcca 420 tttagagcaa atgtcgtgct gacgattacc cattgcgaag atgttttgcc gtcgtgtggt 480 tcttggaaac cactccgcaa ttgtcgcata aaaagggagg cgtatcacag tgctcgttga 540 tttgcagttg catggtctgt aacattcatc gcaagtggcg cattgacgaa attacaagac 600 ctttctgatg ttgttttgtc gtcgtgtagc tttcaacagc tgcagcacaa atgtcatatc 660 aagaggaaat tgtatcacag tgctgagtgt tttgcgtttg attgttgcat gccttacagc 720 gcaaatgtca cactgatgga tttgcattgg ctcttatttc catcgcaacg ttgttttacc 780 gtcgtgtggc tctcgggagt cgtaacgtaa atgtcgcgtt gggaagagta tgatggtagt 840 ttgcctatca aattgcgaaa aacattcaat ccaaaacatg ttctagtatt ttttgctatc 900 ggttaacatc ttttcacaat aaattaaatc tattaaagta tacaagcact cgaatcggat 960 ctatgtttca aagatttgtt taagcctttt taataaaagc aatttgacga tgctagagat 1020 tgacatgtgt ttttgcatat tttggtaggt tcaacagtca aaatgtcgga agcaggacag 1080 tacgttcgtg tccaccagtg tgacgacgaa cattcttcaa atgaccacgc tgaagaacat 1140 tccaaagaat cagaagttcc agaacagcat aaccacgaag atgaacaact cgatactcaa 1200 caaacggaga aacctggcac caccatggat gaaagagtgc ttcgcttgga aaatgccgta 1260 gtgcgccttt ccaacgtcgt gattgagaag cattttccgg aggtggcaaa aaccgccgct 1320 aaatcttgga ccgatacaca aaaggagacc agtgtgacgc aggaggcacg ccagtgtgct 1380 aacgttcgct gggatcacgt tcaggcgttc ccaaaaaacg ttccagcatc gaagctttgg 1440 gaaacttggc ggcactacat ggaaagcttc gaaatcgccg cttcattcag caacgctaca 1500 gatcctgcgt tgaaggcaaa gctactgtat ctggtcatgg gtgaggaaat tcaaacgatg 1560 gtaagggccg caaatctgcg accgagtctg gacgatccgg aatgctaccg caaactggtt 1620 tcaaacatcg aagaatactt cagatcccta gtcgatccag cagcagaaca tgatgaattc 1680 ttcggcatgc atcaacagca gggcgagtca gttgttcgtt tccacgctcg tttgatggaa 1740 aaggcgcaag cctgcgagtt tcataccgat aaccaaaacc agttcgtcca ctctcaactc 1800 ttgaagggca tgaggaatca agacatcgct acctctgcac ggacttacag tcatgacacg 1860 gcatgtgtcg ttcaagctgc cgcaagacaa gaaacctacg gatcagagtt tcaaagagga 1920 ccacccagct gtacggacac ctcgacggta caagccgtcg aacagaatct tcacaagcgg 1980 aagaactata tcgagccaag atccatggat catcaatcca gattcttcaa gaaacgtaag 2040 ttcgatagca agcctaagaa atcgacgact tgttcttcgt gtggccgttc ggaacacagt 2100 gacccaagga agtgcccagc caaaacgcgc tcctgtaaat cgtgcggcaa gcggggacac 2160 ttcgaagcag tatgccgaag caagcgcatc aacaacacag atggtgagat ttccgaggat 2220 catcgcatct ccatgaaaac gcgtgagccg agcgtcgaaa atgacaaggt atgagttgaa 2280 aatatttgtt atactttttc ttttgaatat tccaataagc acatcggcga ggaacgcctt 2340 ttttttaatt ttacattaaa aacgtggttt taagttttta tacttgtttt gtttaattta 2400 tgcggaaact aagtgtcctg tccatttttg tgtcttattt tatctgcagt ccgctaacct 2460 catctcttat gaagacgtcc tagtgagctg tagtgtagga tcttcaacac ctatcaaatt 2520 cttgattgat tcaggatcag atgttaatgt gatcggagga tgcgattggg aacttttaaa 2580 acaagaattt gaaaatggtc tacttcaaat tgatttcgtc gcgaaacaca accatttatc 2640 ggttcgtgct tatgctgccg ataagccaat ggaaattgag caatctttca aagcaaaaat 2700 ttcaaccgtt gatattttta agccgtcaac atgggcagaa tttttggttg tccctgacgg 2760 gcgtcgatct ctgctcggga gaaaaaccgc aggtgaaatg aagcttttga tcgttggagc 2820 ggccgttaac gcttgtgtag actcaggaaa agttgaaacg tttccaaaaa tgcccggcgt 2880 aagagtcaaa tttagcgtcg ataagtccgt tcctcctgta aggaatgctt attataacat 2940 cccagccgca taccgcgaag cagcgcgaaa cagattgaac gatatggaaa tgaaaggtat 3000 aattgagaaa gtttcaactg cccccgagtg gattagcggc atgtcggcgg tcgctaaggg 3060 aaaaggagat tttagactcg tcgtcaatat gagggcagca aacagagcaa ttaaaaggga 3120 gtactttcga ctgccccttt tagaggaaat gaaggtcaag ctccatggag cgaaattctt 3180 ttccaaactt gacctctcta atgcttatta tcatctagag cttgacaaag actccagaga 3240 tttaacaaca ttcctatcgg aaaacggaat gttccgtttt acaagattga tgttcggggt 3300 caattgtgcc ccggaaatct ttcagagaga aatgaccaag atcttaaagc aatttccgaa 3360 cgttatcgtg tacatcgatg atatattggt gtttgcaaag tccttgaaag agctgagaga 3420 aaccgtggct gcagtcatga aagttttgag ggcaaataat ctcaccctga acgcgcaaaa 3480 atgtgagctg gacaagacat ccattaaatt tctggggcat aatttggacg agcatgggtt 3540 tcacatagat gaagcgaaag taagacatat tcaacagttc cgacagccac aaagcgtctc 3600 cgagctaaga agctttctcg gcttggcttc attcgtaagt ccgcatatcc aacacttcgc 3660 tgacctaaca agccctttat gggccgtttc ggggaaaaat agctggaaat ggggaccttc 3720 ccagaaacaa gcatttgaga acgtaaagca caagatttca aaatgtgcta ctactcttgg 3780 atacttttgt gaaactgatc gaacagttct ttacacagac gcgtctccta cggcattggg 3840 cgccgtctta gttcaagtta gtggaactaa taccccccgc atcataagtt ttgcgtccaa 3900 ggctcttact gaaacagaga aaagatatcc acagaatcag cgtgaggctc tcggagctgt 3960 atgggccgta gaacattttg cctattactt attaggacgg catttcaccc ttcgagtgga 4020 cgcccaaggc atagcttttt tgttgaacag gacaagggaa gattcaaagc gtgcgctaac 4080 cagagctgac gggtgggccc tgaggttaag tccgtacgat tataacatag aatgtatccg 4140 cggacgggac aatattgcgg atactctttc tcgtttatac gaaggaatcg acggcccgtt 4200 caatgatgat gtcagccctt gggaaattgc aaagctggag gtaaaaaaat gctacctttt 4260 taactgatga cgaagtgaga gaggcaacat cgcttgacga ggttctatgc aaggtagttc 4320 aagctttagg atctggcgaa tggcccgtgc agttgaaacg atttaagcaa gtggcggaag 4380 atttgtatgt taaggatggc cttgtgatta agaatggttg cgtagtaatc ccagacaaat 4440 tacgacataa aacgttggag attgctcatg aaggtcatcc tctagccgca aagctcaaaa 4500 gcatccttag ggagcgcgta tggtggcctg gtatgccggg agacgcagaa aactgggtga 4560 aggcttgtcg ggtttgtgca attaacggaa aaccagaaaa acccaccccg atggaacgca 4620 tttttgcccc taaagcggta tgggaaacca tagctattga tttcaatggc ccctatgcta 4680 gatatggagg gatgtatatc ctggtgcttg tggattacag gtctagatat ttgatcgctc 4740 ggcctacaaa gtcaactggt tttgaacaaa ccaaatccgt attggaggag gtattcgata 4800 gacaaggttt tcctagtgtc atcaaaagcg ataatggccc cccattcaat ggggatgatt 4860 acaaaaagta ttgtgcagag agaggcataa aaaccatctt ttctacgccg ttgtttccgc 4920 aacaaaacgg actggtagaa aactatatga aaattatcaa caaggctatg tcagcatccg 4980 ttacctgtgg aacaaactac aatgatgaac tacaagccgc agttcgagca cataatgccg 5040 ccgctcactc agttaccaaa gttcctccgg aagagataat gctcggccgc aagattaaaa 5100 gaggactccc tctgctggat cgtgaaaaag tgcaccatga cgaagattta ctgaatcaaa 5160 gagatcgcga agctaagctg gatgcaaaac atcgcgagga taagcgtaga tcggctcgtc 5220 tctgtcgagt caaaccggga gacgttgtca ttgttgaaaa acaaactcga tcgaaaggag 5280 aagctagatt tggagataaa cggttcacag tattggagga gaaaaatgga aacttgttgc 5340 tggctgatga tgaagggagt accttaaggc gacatgtttc acagacaaag gtggttggtg 5400 aatggatgga tgcacctcca aaagcatctg aaaatcatgt aacgatccct gcagcagcac 5460 caatccgacc aatcagagaa agaagcgctc cgtcgtacct gaaagatttc gtgcaaatca 5520 tctcagccga ggtaagataa taaagccgta tatatataaa tgtgtgtgtg ttgttgtcac 5580 atgaatatta acctgctgtt tgcaatgtca aatggctaaa aaaatatacg attttctttt 5640 atattttaca gaggaaccaa ctgtagtcat cttcaaataa ccagtcattc aatttgtatt 5700 atcattatgt aggttagaag attgctatgt agtagtaacc ccgattaaat tgtccacttg 5760 aaaacaattt ttcttatatt gaaacatgcc gaaaaaacaa atttcttcaa ttcgtaacaa 5820 actttgatct acggaggaaa gggaaga 5847 // ID STREPE_PF repbase; DNA; INV; 545 BP. XX AC L22446; XX DT 14-SEP-2004 (Rel. 9.08, Created) DT 14-SEP-2004 (Rel. 9.08, Last updated, Version 1) XX DE Plasmodium falciparum subtelomeric repeat. XX KW STREPE_PF; Repeat region; subtelomeric repeat. XX OS Plasmodium falciparum OC Eukaryota; Alveolata; Apicomplexa; Aconoidasida; Haemosporida; OC Plasmodium; Plasmodium (Laverania). XX RN [1] RA de Bruin D., Lanzer M. and Ravetch V.J.; RT "The polymorphic subtelomeric regions of Plasmodium falciparum RT chromosomes contain arrays of repetitive sequence elements."; RL Proc. Natl. Acad. Sci. U.S.A 91(2), 619-623 (1994). XX DR Genbank; L22446; Positions 1 545. XX SQ Sequence 545 BP; 273 A; 21 C; 42 G; 209 T; 0 other; gaattcattg tattagtagt atttctaata aaaaataatt aaaataataa aatatattat 60 gaaataaata taaataacat tatatataat atgataggtt ttttaaggaa tataattctg 120 aattgaaata atatacttat tggttttaat tttaattata taaatataga tataatattt 180 tgaataatat gtttcttaaa atattaataa taatagtaat aacaataatc ataattataa 240 taataagttt tattgttttt tttttgatat atattccttt gaaaaacata tgaggtattt 300 aaaaaatata cgttacatta tgttttaaaa aaataatata ttataatacg tataagtatt 360 ttatataaat taaaaattac attgtttatt aattaaaaga gttttagaaa aaataaataa 420 ataaataaat aaataaataa ataaataaat aaataaaatt atagaaaaat aaattatata 480 taaaagcgca atacatataa aaatatatag ttatttataa tggagaaaaa aaaaaaacac 540 ggtac 545 // ID hAT-29_SM repbase; DNA; INV; 3024 BP. XX AC . XX DT 14-FEB-2008 (Rel. 13.02, Created) DT 03-MAR-2008 (Rel. 13.02, Last updated, Version 1) XX DE hAT DNA transposon from Schmidtea mediterranea: consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-29_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-3024 RA Bao W. and Jurka J.; RT "hAT DNA transposon from Schmidtea mediterranea."; RL Repbase Reports 8(2), 78-78 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 293..1750 FT /product="hAT-29_SM_1p" FT /translation="MSSKKPSGFQNRKRKTLRAQEREKESGRLLKFFKSSL FT DVSATSDLDRCVGQSDVQQNMSDHADEGISKKEENKREEGVSEFDIMMQSE FT CSHEIKSSTSATINYVDYNDPGNWPDYCNDNFCQILLQNPPHQIITYNFPK FT DSKNRRFSPIHYKRKLANGEEVYRSWLIYSTIKDAVFCFCCKLFNKNSSSI FT LEKSGSKDWKNIGAILSSHERNTFHLDNFQTWKELDVRISKEKTIDNINQQ FT NIKEEEQYWRQILERLIALIRVLATQNLAFRGTNEKLYNNNNGNFLKFVEY FT LALFDPVMNEHLRRVKNQDIMVHYLGKDIQNELMQILAGAIKNKILSLVKS FT AKYYSIILDCTPDVSHIEQMTIIIRFVDIIKPLDSEIFEPAVIIREHFLGF FT VPLEETTGAFITETLIEKLEQMELQIENLRGQGYDNGSNMKGKEKGVQNRI FT LNINPRAFFIPCNAHSLNLVVNDASKCCLEATNFFSSVFSV" XX SQ Sequence 3024 BP; 1086 A; 470 C; 552 G; 915 T; 1 other; cagagccggc gctaccatta aggcgaacta ggcggtcgcc tagggcgcta tccagagagg 60 ggcgacaaaa tgtgtaatcg ccattattaa aaaataacta ataattaata actttgatta 120 tattatataa actatagaca cagggtaata taataagata attattgtat ttaatataaa 180 tcttttaatt tcgaacattg cacttctttg ctggtaagtt taatagcaaa tattactttg 240 ttgtattaaa ataattaaat attatttata gtggtacatt agtactttta aaatgtcatc 300 aaaaaagcca tcgggatttc aaaatcgtaa gcgcaaaact ctcagagctc aagagcgtga 360 gaaggaaagt ggtcgactgt taaagttttt caaatcaagt cttgatgttt ccgccaccag 420 cgacttggat agatgtgtag gacaatcaga tgtacaacaa aatatgtcag atcacgcgga 480 tgaaggtatt tcaaaaaaag aagaaaataa aagagaagaa ggtgtttcag agtttgatat 540 tatgatgcaa agcgaatgct cacatgaaat taaatccagt accagtgcca cgattaatta 600 tgtagattac aatgacccag gaaactggcc tgactattgt aatgataatt tttgtcagat 660 tttattacaa aatccaccac atcaaattat tacatataat tttcccaaag attccaaaaa 720 tcgaagattt tctccaattc actataagag aaaactagcc aatggtgaag aagtataccg 780 ttcatggttg atatattcaa ctataaaaga tgcggttttt tgtttttgct gcaagttatt 840 taacaaaaac agctcgtcaa tcctagaaaa atcaggatca aaagactgga aaaatattgg 900 agcaattcta tcttcacatg aaaggaatac tttccattta gacaactttc agacgtggaa 960 agaacttgat gtgcgtatat ccaaagaaaa aacaattgat aatattaacc agcagaatat 1020 caaagaagaa gaacaatatt ggcggcaaat tttggaacgt ttgatcgctt tgatcagagt 1080 acttgccact caaaacttag catttcgtgg aacaaatgaa aagttgtaca ataacaacaa 1140 cggtaatttt ttgaagtttg tagaatatct agccctcttt gaccctgtaa tgaacgaaca 1200 tttgcgcaga gtcaaaaatc aagacattat ggtacattat ctgggaaaag atattcaaaa 1260 tgaacttatg cagatcctag caggagctat taaaaataaa attttgtcac ttgtgaaatc 1320 ggcaaaatat tactcaatta ttttggattg cacaccagac gttagtcata ttgaacagat 1380 gactattatt atacgttttg ttgatattat caaaccgtta gatagtgaga tattcgagcc 1440 tgcagtaata ataagagaac atttcctagg atttgttcca ttagaagaaa ctactggggc 1500 attcataaca gaaactttga ttgaaaagct tgagcaaatg gaattacaaa ttgaaaatct 1560 ccgtgggcaa gggtatgata atggaagtaa tatgaaagga aaagaaaaag gtgtacaaaa 1620 cagaattcta aacataaacc cgcgagcctt ttttattccc tgcaatgcac attctttaaa 1680 cttagttgtc aatgatgcct ctaaatgttg cctagaagcg actaatttct tcagttcagt 1740 tttttcagtt tagttcaaca gatttataat tacttctctg catcgactca acgttggcat 1800 gtgtttacta gccatatcgt gaatttaaca gttaaaccat tgagcgaaac aagatgggaa 1860 agtaggattg atgctttgaa gcctataagg tatcagcttg gtgcaatata tgatgcaytt 1920 taatggagat ttttgatgac ccaagactaa ataattcatc tggtaatact tcccgaacgg 1980 acgcaaaggc tcttgctgat gcaatttgta aattcaagtt tatggtatcc cttgtcacat 2040 ggtacaacat tcttaatgaa gtgaatgtta caagcaaaat actacaaaaa gaggattctg 2100 ctataaattc agctacaaaa caacttcaag tgactaagaa ttacctctta aaatgtagga 2160 gtgataatgg ttttgaacaa gttttaattg atgctgccga gattgcaaag gaattagaaa 2220 cagaagcgaa atttgagccc gatgaatgta gaagaaggag acataaacgg cagtttgaat 2280 atgaagctca agacgaggca ccacaagatc ctaaacataa attcaaagta gaattctact 2340 acactatctt ggacatggct atacagtcta ttgaagagag gttccaacaa ttagaacaac 2400 atgatatttt atttggcttc ctttatgata tatccaatat caacaaaaag acatctgcgg 2460 atatacttac tgactgcaaa catttagaaa cgtctttgac gcataaagat aataaagata 2520 ttgatgccta tgatctttgt aatgagcttc aagtagtagc tcaaaggctt ccaaagccta 2580 tgtcaccatc agatgtgctc ctctatatag tacaacaaaa attggaagat tgtgtaccaa 2640 atgttgttgt ttccttgaga atattattaa cacttccagt gtccgtggct agtggagaac 2700 gcagtttttc aaaattgaaa ttaattaaaa gttacttgcg ttcaactatg tcgcaaagca 2760 gacttgtgga cttagcaaca atttcaatag aatgtgatta cgcttcgaca ctggaattga 2820 aggaattagt agaaactttt gcgaggaaga aggcaagaaa aattaaattt taattttgta 2880 acgtttatgc aattgataat aaattaatat gtcactttaa atttattgtt ttcttatata 2940 tatttatttt tttattaatt tgtgtgaaag ggggcgtttt aataagtttt gcctagggcg 3000 ataaataagc tggcgccggc cctg 3024 // ID BEL-189_AA-LTR repbase; DNA; INV; 250 BP. XX AC supercont1.85; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-189_AA_; KW BEL-189_AA-I; BEL-189_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-250 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.85; Positions 975284 975533. XX SQ Sequence 250 BP; 92 A; 40 C; 40 G; 78 T; 0 other; tgttgagtgt aattaaaaaa aaaaacgtat ttttttatta ctgtgcacac cgctcaagct 60 ttcttgacat ctgaaatttt acttgaattt gtttttgttt ccgcttgaac gtgcgcaaga 120 ttctaagaca gaatataaaa gccagtcaaa caaaaatggc gaaaaaactg tgtgctcaaa 180 taaatactaa ttaatacagt ccattataaa gtcggaattg gattttatac gattgacgaa 240 accacgaaca 250 // ID BEL-640_AA-LTR repbase; DNA; INV; 478 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-640_AA_; KW Pao_Bel_Ele3; BEL-640_AA-I; BEL-640_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-478 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 478 BP; 154 A; 83 C; 87 G; 154 T; 0 other; tgttgccgca agtctatcgc accagtaccg aaagggtgaa gaccaatagc ggatggtatc 60 atgacacctg tcatcgagaa atccattgtt cggctgccac tcatcagcgc ttgtggtcaa 120 cgggattaaa gtgtgatcga gaaggtggca tgcaagtgtt cccctattgt gagcaggaaa 180 ttaattaaat aatattcgtt tatttatact ttcatcaaac taaaagtatt aaaattgaat 240 tcaatcatct cttcactaat aattagattt tatatttaac tgttgatagg tttgaagaga 300 tatatgcgga aatttgttat accctaaccc tattcgtgaa cttaaaagca ctaaattgta 360 agttaccctt tgaaatctat aactgctatg atccttaata ttatgattat ttatagcttt 420 gaagctgaat aaaatatagc cgtcgaacct tagtgcaagg gttttaatta ccgcaaca 478 // ID MuDR-3_TV repbase; DNA; INV; 3197 BP. XX AC . XX DT 11-DEC-2008 (Rel. 13.12, Created) DT 11-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE MuDR DNA transposon from Trichomonas vaginalis. XX KW MuDR; DNA transposon; Transposable Element; MuDR-3_TV. XX OS Trichomonas vaginalis OC Eukaryota; Parabasalia; Trichomonadida; Trichomonadidae; OC Trichomonas. XX RN [1] RP 1-3197 RA Kapitonov V.V. and Jurka J.; RT "MuDR DNA transposons from protozoans."; RL Repbase Reports 8(12), 1813-1813 (2008). XX DR [1] (Consensus) XX CC The MuDR-3_TV consensus sequence was derived from multiple CC alignment of 12 copies that are <1% divergent from it. MuDR-3_TV CC copies are usually flanked by 9-10-bp TSDs (several copies are CC not flanked by TSDs). MuDR-3_TV contains imperfect 30-bp TIRs (8 CC mismatches in the 10-bp terminus) and codes for a 557-aa MuDR CC transposase. CC This sequence was derived from sequence data generated by TIGR CC Database: The Trichomonas vaginalis Genome Sequencing Project. XX FH Key Location/Qualifiers FT CDS 781..2451 FT /product="MuDR-3_TVp" FT /note="MuDR transposase." FT /translation="MHIDAPCELKERLSYIDAMKAVIDNECHAGCPYQLKK FT NPNLGGTKLKFRCSADKCEACISFLKDDEEYILTEYNGTHNHNQIKDETKK FT YHTLSQCYRRVLINFIKQGGDPKMAQIECDKKLDLDDPLNRPSFLNASSDQ FT MKTIQEKGNKKIKGIQKYDDNQFKSTQLFKGIKEQESPKDLIYFDVDNYNK FT PKKLIFYYASFKFLDIIKIVKEHFIDATHSLLEFKMLFYMISAKFPNTHAF FT PIFQFVVYPNTSENIAFCLKAFFNWAKVKPKYFMSDCAQEIENAIINSFPE FT VILHWCAVHVMRAFRKNLKDYAFESSDKLILDTKMNYLAYGRNGKKEWIEP FT TFKKILEIAEKFPEFSKYIQNQWISNQERWTAAERDENLALTNNISESINK FT KIKYYYFGGTIFMRFDRFVMKLIDFIVPSFYYRISQDIRLRDKIPNPLPSE FT KKPKKSTIRLDTEKCIMLTDKIKSLIVNSSANLNTLQLGLDDLLDKVVKAA FT KVQKRMTQYFSKLSIPMEIKLSILTQITEFGCNTPQFIVENAKSFQDKIIT FT DFHLQIYTTL" XX SQ Sequence 3197 BP; 1181 A; 453 C; 486 G; 1077 T; 0 other; gagtaggggt aattttagct aaaagtgaca cgcctgtaaa aagtgacaca cccgtgaagg 60 gtgtgattac cgatcggtaa tttcgacagt ttagaattta aaatttttta tggaactgtt 120 gcgagtttca taaagaattt aaggaatata agcaaagatt tgaatcaaaa tcttagacct 180 aagccgctcg attcgctcgc ggcttccgcg aattacagat cagcaaagat ttgaatcaaa 240 atcttcgaac taagccgatc gcttcgctcg cggctttctt tattatttta aatttgcata 300 tttccatcaa aagttcataa ggtaaaaaat ttcgtttata ttattgcaaa ataaaatctt 360 cattatcaaa ataatattaa atgataatat aatataattt ataaacaact attatttaat 420 aaagtttaat tgccaaggtt catctaattt tttgattttt aatgttttat gaattttatt 480 tatcatgtat atatctaatg aagattatat atgtagaaga taagtacctt aatatgtacg 540 taaaaattaa tgaaatttat tttgaatgta gaaccgagca attttcaaat tttgtgcttt 600 ttttgaattt tctcgatacc agataagttg tgttggctaa ataatatatg gatacaatgt 660 gtcttataga tgaatatatg cacaaagttt aagaggccat tttctttcag aaaccgagat 720 ttgtgaattt tttcaaattt tgtgcttttt ttgatattga aatttcattt catttttttg 780 atgcatatcg atgcaccatg cgaacttaaa gaaaggctat cttatataga tgctatgaaa 840 gctgtcattg ataacgaatg ccatgcaggt tgtccatatc aattaaaaaa aaatccaaat 900 ttagggggaa caaaattaaa attcaggtgc agtgcagata aatgtgaagc atgtatctca 960 tttctaaagg atgatgaaga atatatccta acagaatata atggaacaca taatcacaat 1020 caaatcaaag atgagacaaa gaaataccac acattatcac aatgctatcg tcgggtactt 1080 attaatttta taaaacaggg gggtgatccg aaaatggctc aaattgaatg cgacaaaaaa 1140 cttgatctgg atgatccact taacaggcca tcttttttga atgcatcttc agatcaaatg 1200 aaaacaattc aggaaaaagg aaataaaaag atcaaaggta ttcaaaagta cgatgataat 1260 caattcaagt caactcaatt atttaagggt atcaaagaac aagagtcacc aaaagattta 1320 atttatttcg atgtcgataa ttacaacaaa ccaaagaaat taatctttta ttatgcttct 1380 ttcaaatttc tggacataat caaaattgtg aaagagcatt ttattgatgc aacccattca 1440 ttacttgaat ttaaaatgtt gttttacatg atatcagcca aatttccaaa tacacatgca 1500 ttcccaattt ttcaatttgt cgtttatcca aacacttcag aaaatattgc attctgtctc 1560 aaggcatttt ttaactgggc aaaagtaaaa ccaaaatatt ttatgtcaga ttgtgcacaa 1620 gaaatcgaaa atgcgattat caactctttt ccagaagtta tccttcattg gtgtgcagtt 1680 catgtaatga gggcgttcag aaaaaatttg aaagattatg catttgagag ttcagacaaa 1740 ttgatattag atacaaaaat gaactatctt gcatacggca gaaatggcaa gaaagaatgg 1800 attgaaccaa cgttcaaaaa aattctggaa atagccgaaa aattcccaga attttcaaaa 1860 tacatacaaa atcaatggat atcaaaccaa gaaagatgga cagccgctga aagagatgaa 1920 aatttagctc tcacaaataa catttcagaa tcgattaaca aaaaaattaa atattactac 1980 tttggcggaa caattttcat gagatttgat cgatttgtga tgaaattaat cgattttatc 2040 gttccttcat tttattacag aatttctcaa gatataaggc tcagagacaa aattccaaat 2100 ccgttgccat ctgaaaaaaa gccaaaaaag tctacaatca gactggatac agaaaaatgt 2160 ataatgctca ctgataaaat taaatcgtta atagttaact catcagcaaa cttaaatacg 2220 cttcaattgg gtttagatga cctacttgat aaagttgtca aggcagcaaa ggttcaaaaa 2280 cgaatgaccc agtatttttc gaaattaagt atcccaatgg aaatcaagct tagtattctc 2340 acgcaaatta cagaatttgg atgcaatacc cctcaattca ttgttgagaa tgccaagtct 2400 tttcaagata aaattattac agatttccac cttcaaattt atacaacatt ataaaaattt 2460 atgaataaaa acttgatata aaataacatt tttattttta ctaagaaaaa aaaggttttt 2520 gataataaat ttattttatt tcgaaaaagt agcaataatc gtaaaagggt attgataggc 2580 ttgctaaagt aactaatttt agctgtcaaa tacttttgta ttttgaagta aattagtata 2640 aatttgtttt tacttttatt tcgaaaaagt agcaataatc gtaaaagggt attgataggc 2700 ttgctaaagt aactaatttt agctgtcaaa tacttttgta ttttgaagta aattagtata 2760 aatttgtttt tacttttatt tcgaaaaagt agcaataatc gtaaaagggt attgataggc 2820 ttgctaaagt aactaatttt agctgtcaaa tacttttgta ttttgaagta aattagtata 2880 aatttgtttt tacttttatt tcgaaaaagt agcaataatc gtaaaagggt attgataggc 2940 ttgctaaagt aactaatttt agctgtcaaa tacttttgta ttttgaagta aattagtata 3000 aatttgatat tattttaact ccctttgaaa acctcaattt tccatacaga gggaacggga 3060 atagcatcca atattccctg agaatggttg cgcattctca gttccctcta tatttcgaac 3120 gaagctctca tcggtctttt tttttaatac taaacgtttt ttttccctgt cacttttagc 3180 taaaatttgg gtttgcc 3197 // ID Gypsy-598_AA-I repbase; DNA; INV; 7444 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-598_AA_; KW Gypsy-598_AA-LTR; Ty3_gypsy_Ele194; Gypsy-598_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-7444 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [2877-3380] - Reverse transcriptase CC Positions [4446-4925] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 220..2067 FT /product="Gypsy-598_AA-I_1p" FT /translation="MSRVFPRAEDLNQEELDFELQIRNEPEEVYKLDLTAK FT QRHLRNLFKNDQKDGRLYQSPFSINDEYAHVEGRILNLEKALEKKVESKYE FT SRVLHYWYRVKWIHVEGEEEKKKKRELVQAIERIMMKYQFGPPQSPTVERQ FT SQTELGRMESVLGGNISNPFEFPKVGVPARNGTGTIPKITLSEIKDQNTFC FT ETGAEGLRHGTDDSTRISVTRKEWEDMQAAITELAAKLAQSEALRQQNDFS FT NTERISQYRNVPQRNPVPTRTVIDRSRRTADSYQASIEDEDSEEDVRRPQP FT VWRSRHWRSSHDSESGYNNGWNNDAYQSGWRHPGGRYAGQGRVEKWKLRFT FT GELRSTMSVENFLYKAKKLAEREGVAKHILLRDIHMLLEGAASDWFFTFVD FT DLNTWEEFETAISYRFGNPNKDQGIRTKIQERKQQRGEPFIAFVTEIEKMN FT RLLSKPLSNRRKFEVIWDNMRQHYRSKISIVEIKDLQQLIKLNYRIDAADP FT QLQSQNMEGTSGYQRRPINQLEAEGSDYDSDQSINAINTRTNNNFARANRP FT SMEQNARPTQNATTPASCWNCQGQGHTWRQCTRPRVIFCYGCGNLGRTIRS FT CERCSRPSNAQNSSQGNE" FT CDS 2481..5318 FT /product="Gypsy-598_AA-I_2p" FT /translation="MIDVGQGFEEVETVGQTNSLPTCFTIEPSGEPLSADE FT LEEDETLDIPVYEGPISSVPDPDAIETEHELSEEERKQLTGVIRNFELTSD FT GKLGRTHLITHEIVLQEGAKPRNPPMYKCSPYVQQAINDEVERFKRLDAIE FT ECYSEWTNPLVPVLKKNGKVRVCLDSRKINKLTVKDTYPMRNMQDIFRRLG FT KAKFFSVIDLKDAYFQIPLKEECRNFTAFRTTEGVFRFKVLPFGLTNAPFT FT MSRLMDKALGFDLEPFVFVYLDDIVIASNTFKEHLRLLRIVAERLKKAGLT FT ISLEKSRFCRKQVMYLGYLLNENGVAIDTARIQPILDYAQPKTQEDIRRLM FT GLAGFYQRFIKNYSRITAPITDLLTKENKKFVWTKEAEIAFRELKSVLTSA FT PILGNPDFTKPFIIESDASDRAIGAALVQEQDGETRVISYFSKKLNRTQRR FT YAAVEKECLGVLSAIQHFRHYVEGTKFRVITDARSLLWLFNVGTETGNAKL FT LRWALRIQAYDFDLEYRKGKANITADCLSRSIEVDTVVISQPDAEQEELME FT RIQSNPNKFKDFRIVDGRVFRFVKTGNRMSDPRFAWKCYPPRTERCDIIRQ FT EHEKAHFGFEKTLAAIKNRYYWPRMNIEIRKFCRECLKCQVSKAGNVNVTP FT PMGSQKPVEYPWQFVTLDYVGPLPPSGKNRNTCLLVATDVFSKFVLIQPFR FT EAKAHSLVDFVENMIFRLFGVPEIILTDNGSQFVSREFKKLLEDNHVSHWL FT TPAYHPQVNNTERVNRVIVTAIRATLKKSHNHWADDIQQIADAIRTSVHES FT TKFTPYFVVFGRQKVSDGREYSKIRDNHVPPEEDEQQIKNKRKELFDEIKT FT NLKAAYAKHARSYNLRSNVQCPTYIVGEKVLKQTFDQSDKGKGFCKKLAPK FT YEPAIVRKVLGANTYELEDLTGKRIGVFFANKLKRMLPQSGT" XX SQ Sequence 7444 BP; 2240 A; 1534 C; 1708 G; 1962 T; 0 other; tttggcgccc aacaatacaa ggctttttct tcgctgttta ttaaaactct ttttttgttg 60 tttgttggat tttcttgaat tctgtaaata tttatttttt tcgtatcgtt tttttttcat 120 ataaatacat tttcttagtt tcttattacc ttatttgaat tctatatatt ttgttcgcat 180 tttcattaaa tttaagctaa atttttaaaa taattcacga tgagtcgtgt attccctcga 240 gcggaagatt tgaatcagga ggagctggat tttgagctcc aaatccgcaa tgaacctgaa 300 gaagtctata aacttgatct aactgcaaag caaagacatt tacgaaatct gttcaaaaat 360 gatcaaaaag atggacgttt gtaccaatcg ccattcagca ttaatgacga gtatgcacac 420 gttgagggaa gaattttgaa tttagaaaaa gcgttggaaa agaaagtgga atctaaatac 480 gagtcaagag tactgcacta ttggtacagg gtaaaatgga ttcatgttga gggagaggaa 540 gagaaaaaga agaagaggga gttggttcag gctattgaga gaataatgat gaaatatcag 600 tttgggccac cgcaatcacc gacagtcgaa aggcagagtc aaacagaact aggtcgtatg 660 gaatctgtac ttggagggaa tataagcaac ccattcgagt tccctaaggt tggcgttccc 720 gcgaggaatg ggactggtac tattcctaaa atcacattga gcgagatcaa ggatcagaat 780 accttctgcg agaccggggc cgaaggactg cgacatggca cagacgattc aacacgtatc 840 tcggtcacac gtaaggagtg ggaggatatg caggcagcga taacggaatt ggcggccaaa 900 ctagcgcaga gtgaggcact gagacagcaa aacgattttt cgaatacgga acgaataagc 960 cagtatcgaa acgtaccaca gaggaatccg gtacctacta gaaccgtaat cgaccgtagt 1020 agacgtacag cagacagcta ccaggccagt atcgaggacg aagacagtga ggaggacgta 1080 agaagaccac aaccggtttg gaggagtagg cattggcgga gcagccatga ttcggaaagc 1140 gggtacaata acggttggaa caacgacgct taccaatcgg gttggaggca tccaggtgga 1200 agatacgcag gccaaggacg agtggagaaa tggaaattac gattcactgg cgaattgaga 1260 tcgacaatgt cagtggagaa ttttctgtat aaggcgaaaa aattggccga gagggaaggg 1320 gtggccaaac acatcctact gcgtgacatc cacatgcttt tggaaggggc cgcctctgac 1380 tggtttttca cctttgtcga tgaccttaac acgtgggaag agtttgagac agccatttcc 1440 tacagatttg gaaacccaaa taaggatcag ggcatccgaa ctaaaataca agagcggaaa 1500 caacaacgtg gcgaaccgtt catcgcgttt gtaacggaaa tcgaaaagat gaatcgttta 1560 ttgtccaaac cactctcaaa tcgaaggaag tttgaggtga tttgggacaa tatgaggcaa 1620 cactataggt ccaaaatatc gatcgtggaa attaaggacc tacagcaact gataaagctt 1680 aactaccgaa tagatgcagc cgacccacaa ttgcagtcgc agaatatgga aggcacatct 1740 ggttatcaac gacgtccaat aaaccagcta gaggcagaag gaagcgacta cgatagtgac 1800 cagtcaatca acgcaatcaa cactcgaacc aataacaact tcgcaagagc aaatagacca 1860 tcgatggagc agaatgcaag acctacacag aatgcgacaa caccagcgag ctgctggaac 1920 tgccaaggac aaggtcatac atggagacag tgtacacggc caagagttat tttctgttac 1980 ggttgtggaa acttgggcag aacgattcga agctgtgagc gttgctccag accctcgaat 2040 gcccaaaact caagtcaggg aaacgagtaa aggggggcga tggggggaat tcgtcaaacc 2100 cagcaaaagt aataattccc aaacaacaac caaactacga cccattttct caaatttacc 2160 acattaaatt taagaggtca aaatgccctc atattaaagt cagggttttc gagttcgaac 2220 tcgatgcctt actggactcc ggagcgggga ttagcatcct aaactctcta gaagtggtag 2280 aacgttataa tcttaaaatt caaccaacgg ccatcaaggt ttcaacagct gatggatcag 2340 tatatagttg cttggggtac gtgaatattc cattcacgta caaaaagttg accaaagtgg 2400 tacccactct cgtagtacca gagatttcaa ggcatttgat ccttggagta gacttttggg 2460 aatgttttgg aataaaacct atgatagatg tgggtcaggg gtttgaagaa gttgagacag 2520 tcggacaaac caacagtttg ccaacgtgct tcaccataga accttccggt gagccgttaa 2580 gtgctgacga attagaggaa gatgagacac tggatattcc agtatacgaa ggaccgataa 2640 gttctgtacc agacccagac gctatcgaga ccgagcatga gttatcagag gaggagagga 2700 aacagttgac gggtgtcata aggaatttcg aactgacatc ggacgggaag ctaggaagaa 2760 cacatctaat aacccacgag attgtgctcc aggagggagc taaaccgcgt aatccgccaa 2820 tgtataagtg ttctccctac gttcagcaag ccatcaatga tgaggttgag aggttcaaac 2880 ggttggatgc aatagaggaa tgctatagtg agtggaccaa cccgcttgta ccggtgctca 2940 agaaaaacgg gaaggtgagg gtctgcttgg actcacgaaa aattaacaag ttaacggtca 3000 aggacaccta cccaatgcgt aatatgcaag acatattccg tcgattgggt aaggcaaagt 3060 tcttttcagt aattgacctt aaggacgcct actttcagat ccctctgaaa gaggaatgtc 3120 gaaacttcac tgcattccgg acaaccgaag gagttttccg ttttaaggtg cttccgttcg 3180 gccttacgaa tgcgccgttc acaatgtcac gtctgatgga caaagccttg gggttcgacc 3240 tcgaaccatt cgtctttgtg tatctggatg acattgtcat cgcgtccaat acgttcaagg 3300 aacacctccg gctattgcgt atcgtagcag aaaggcttaa gaaagccgga ctcacgatct 3360 cgctggaaaa gtcaagattc tgccggaaac aggtaatgta cctcggctat cttctcaacg 3420 aaaatggagt ggcaattgac acggctcgaa ttcagccaat tctagattac gcccaaccaa 3480 aaactcaaga agacattcgc cgcctcatgg gattggcagg cttttatcaa agatttataa 3540 agaattatag tagaattacg gcaccaatca cagatctcct cacgaaggag aacaaaaagt 3600 ttgtctggac taaagaagcg gaaattgcat tccgagaact gaaatctgtc ctaacttcag 3660 caccgattct agggaacccc gatttcacaa aaccgttcat catcgaatcg gacgcttcag 3720 atcgggccat tggagcggcc cttgtccagg agcaggacgg agaaacgcga gtaattagct 3780 actttagcaa aaaactcaac cgtacacaac gaagatacgc ggcggttgaa aaggaatgtt 3840 taggggtact gtccgcaatc caacatttcc gacattatgt ggagggaacg aagtttcgag 3900 taatcacaga cgcgagaagc ctcctgtggc tttttaacgt cgggacagag accggaaatg 3960 caaagctgtt gagatgggca ctacgaatcc aagcctatga ttttgatctg gaataccgga 4020 aaggtaaggc aaatattacc gcggactgtc tatcgaggtc gattgaagtg gacacggtgg 4080 taatctccca gccagacgct gaacaagagg aactgatgga acgaattcag tccaatccta 4140 acaagttcaa agactttcgc atagtggatg gcagagtttt tcgattcgtc aaaacaggaa 4200 accggatgag tgacccacga ttcgcatgga aatgctatcc tccgagaacc gaacgttgcg 4260 atataatccg ccaggaacac gagaaggcac attttggttt cgaaaaaaca ctggctgcaa 4320 ttaaaaatcg ttactactgg ccacgaatga acattgaaat tcggaaattt tgtcgagaat 4380 gtctcaaatg ccaagttagt aaagccggaa atgtcaacgt aacgccacct atgggttctc 4440 aaaaaccggt ggaatacccg tggcaattcg ttacactaga ctatgtagga ccgttacctc 4500 catcgggtaa gaatagaaac acctgcttac tagtagccac agacgtcttc agtaagttcg 4560 tcttgattca accgtttcgc gaagccaaag ctcactcttt ggtagacttc gtggaaaata 4620 tgattttccg tctatttggg gtgcccgaaa ttatattaac cgacaatgga tctcagtttg 4680 tctcgcggga attcaagaaa ctgctcgaag acaatcatgt gtcccattgg ctcacacctg 4740 cgtatcaccc acaagtgaat aatacggaaa gagtgaaccg ggtaatcgtt acggccatac 4800 gcgcgaccct aaagaagtcc cacaaccatt gggcagacga cattcaacaa atagcggacg 4860 caatcagaac atcggttcac gagtcaacca aattcacgcc ctacttcgta gtttttggtc 4920 gacaaaaggt ctcagacggt agagagtatt ctaagatccg cgacaatcac gtaccacctg 4980 aagaagacga acagcagatt aaaaacaagc ggaaagagtt gttcgatgaa attaaaacga 5040 acctaaaagc cgcctacgct aagcatgcta gaagctataa tcttcgttct aatgttcaat 5100 gccctactta cattgtcggt gaaaaagtcc taaagcagac gttcgaccag tcagataaag 5160 ggaagggatt ttgtaagaag ctggccccga aatatgagcc agctatagtt cgaaaagtct 5220 tgggagccaa tacctatgag ctagaagacc tgactgggaa acggatagga gtctttttcg 5280 cgaataagct gaagcgaatg cttccgcagt ctgggacata gtcagagact gggttttcaa 5340 gctatgcaac tcttaatgag taactataag taccaaaaaa cacctatatg gtcattacgt 5400 atacggacag tttcgacgga ctcgcctagg gaaaatctac ggatggaatc tacgctgagg 5460 gtaccgaaac tgagcctaaa caatatctaa acaacacctc aagctatgaa caactttgga 5520 aagtgactaa aaaaatcaat aatcgattgc aaaaacacct aaaagggcag aaactaattg 5580 aggaggactc aaataccgag aaagatacga ccaaacgaga cgctcaatca ctgagcttgt 5640 aaatacctgt aaatattagt tgttagtttc ctaatgccct atcccttaat cactagaatt 5700 agtacttaaa ctagattagt agataaacat taattactca cggttcaatt cgtaattgat 5760 tagtcctttt gttgaattgg tcgtcaagtt gtaaatattc gcttttttct ccttatcgtc 5820 ctttttgtaa atatcctcgt tgttcatcgt agccgaatcg cgttaatata ttgaattagt 5880 tgaatttagt cgccaagttg ttcatatcct ggttttcctc agcattatcc gatgaattcc 5940 aatccgtttc tcagattagt ttcctttccc gattagtcca ccttgttgtt tttccttgct 6000 tctactatat cgatccaaat cgcgttgcaa tcccgtatcc atgcacagtt ctgctttacc 6060 gatccgtcct tgtttatttt tagttcctct ccaattctcc tttaccgatc caacgtagtt 6120 aatcaacatt cctccatttc cttcctcatc catagttttc gtgtccggtt actttctcct 6180 tattcccgtc atccgtcctt tgttatttca atcgcacagc tccggtattt ctccttaatc 6240 cgcatctgac tggtccggtc gtctgctttc cgatggcggc tggtccgaac tttcacctgc 6300 gaagaaaaga aacaaaacaa accaacttac atcgatagct cggaacgatg attaatttta 6360 ctctcctcaa gtagtccata gcccggagtc ttcaaatttc cactcgtaca attcaatttt 6420 ccagcgcaac actttttttt tcacgtttgt gcaaaagttt tcaccgcact tcccactcgc 6480 gggttgtttt ttcttctcgt ccatcacttt gacagttcgt ttgagggtga tgacattgga 6540 tgtttggtag accggttatg cgggtgaacg tcacgtatgt agttggattc tgcacttcac 6600 agtgtcggag ttcatttggt atgattgcga tgggtgtatg gatgtgtagt agcttgtgtt 6660 tatttaattt ctgcgagatt tttgttagtc agacgatgta ttggcgctta aaaaaatctc 6720 gtaagaggct tatttaagat ttgcgtcgca atgagtgtaa atcgcggttt ctgttccgtg 6780 gcaacgaatt gcgattcgga tcggcttatt ttgtgaggaa ggttaaaatc tcgttttctc 6840 gcttcgggct ggggtagaga ttgttgactg gactcacggt tacgtccact taggtggaga 6900 ctttcgtaag tgcgtcttca tatatcggca aaatgtcctg gttttagaag atgtacttcg 6960 gtttatgtta ttttaagctt gctcagagtt ttagtcgact actctgagtt gaagctgagt 7020 ttttcgtttg tgaatttatg tttgaatgaa tggagggggg aaagtatgag tgtggtcatc 7080 atcaacaatt attcttttct ttgatcaaga taggttttca gtaaaaaaaa ttagttcttt 7140 gcgatcggat ctctttatat acttctttag taaggttata aattgtaatg gcatggcagt 7200 aatgattttt cgctggagac cggagaccgt aggtcgctga tggtcagtgg cagatgtggt 7260 ttgttatgag ctgcatactg taaatagaga tggagttgaa ttcgttcaat gatgaaacta 7320 tatgaccccg aaaattgaag catgtcatcg ccatgactat ttagcaagcc tacgaaaatt 7380 tgattggctt ttgttagctt gatagcatcg ccaatcaaat tttcgtaaat ttaggaggga 7440 gtga 7444 // ID Gypsy-1_PPP-LTR repbase; DNA; INV; 226 BP. XX AC ADBJ01000004; XX DT 13-DEC-2010 (Rel. 15.12, Created) DT 13-DEC-2010 (Rel. 15.12, Last updated, Version -1) XX DE LTR retrotransposon from the Polysphondylium pallidum (slime DE mold) genome: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-1_PPP_; KW Gypsy-1_PPP-I; Gypsy-1_PPP-LTR. XX OS Polysphondylium pallidum OC Eukaryota; Amoebozoa; Mycetozoa; Dictyosteliida; Polysphondylium. XX RN [1] RP 1-226 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Polysphondylium pallidum (slime RT mold) genome."; RL Repbase Reports 10(12), 2159-2159 (2010). XX DR GenBank; ADBJ01000004; Positions 1175512 1175737. XX SQ Sequence 226 BP; 89 A; 36 C; 19 G; 82 T; 0 other; tgtaacgaca ataccacttt ttcaaaattc aaattatttt tcatttttaa aaactcaacg 60 actattcaaa aaaacgaacg atcaacttta ccattcaagt tcttacacta actctgaata 120 ggtatgatca atttatattt aataaacatg attcaagttc ttacactaac tctgaatagt 180 ctttaaatta tttgagttga ttattaataa ataaatagac tgatca 226 // ID BEL-1-I_HM repbase; DNA; INV; 5507 BP. XX AC . XX DT 30-DEC-2008 (Rel. 13.12, Created) DT 30-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE LTR retrotransposon from Hydra magnipapillata: internal portion - DE consensus. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-1-I_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-5507 RA Bao W. and Jurka J.; RT "LTR retrotransposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 2073-2073 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(39..2873,2827..5469) FT /product="BEL-1-I_HM_1p" FT /translation="MVKRKNQKSNNQMEDQLTVQNEVSALNRKKIVLKSSF FT TRTKHQIIQQIGDDEPTCKNELRELSNQLTKTQNEALEIMVELSKKYELLE FT SKENVGKLNTEIEKLNDEFAEIMEHVNEWILRNGSLKSSLKQQRMEKELKM FT ETKKQCYQWLSTQKSNDQYEDNKEENGNSEMEPIKSTHIMRKMENEIIDLD FT QKTAKIGHDLWKQLKRVSIPIFNGDKKNYENWKASFYACIDQAPATSEYKL FT LQLRQYLCGDALKAIEGLGHSSFAYEAAKERLERKYGGTRRKILMYLDELN FT NFKPMRENNPKDVEKFADLLDVAVTNLKEANRTEELGDGTLYHQLQKKVPE FT IMLIRYKRWVYENKLNENVKTLKKFIVEEADFQIAATETLHGLKKTIGSKA FT HEGNSYFGNSDQKKKINWIQKCSVCKNDHKIWECPTFQNQNIKERWETAKK FT NKLCFRCLNSNHSSKECRNTRICGIDGCTDTHNRLLHKKSHQKETISEDYK FT EKDTILENSHTSNIQFHSNHVMNYISLRTVPVILRNGYKKVVVNALLDDGS FT TKTYLNSDVAAELSLQGKTEKVTVNMINGKIDSFETMPVEFQLESLNGETK FT ITVEAFTVNNVTGNLKAVNWANMSKSWKHLRKIEFPNIGPKPKIDILIGVD FT YAELHCAIKEVKGGFNEPVARLTPLGWTCVGGTGGSLQTHFTKITSSTKEI FT EDANNTIKKLWEIEGDEEMQFRKTMTSPEDSTALNIVTNSLKTENGRYELK FT LPWKGNRHLDNNYTMALNRLQNTEKRLMKNKSLGEEYNNIIKQYQEKGYLE FT KINKRDLGDGWYLPHFPILRPDKSTTKVRIVFDGSAKYNGKSINDVIYQGP FT KLQQDLVTVLLRFRKYPVALACDIAEMYLKIGIHKDDQRYQRILWRNLDST FT AEPEVFQFNRLVFGINSAPFMAQFVTQEHARKYAQKISFGFRKNTQESMHK FT KFPLASEAVLEATYMDDTITSVVDEKVGIQLYKELTLLWGSAGMFARKWLS FT NSVEVLKITPENDRAEHINLDSGELPAMKTLGVVWKAKPDLFSFHSVTTEE FT NTVYTKRILLKKMATLFDPLGFLAPYIIRIKIIMQELWIDGIEWDDAXPDR FT IANNVDQWFQELNDLPKINIPRCLQTTSTVTNRSIHVFTDASCKAYGAVAY FT QQCLYDTGEVTCVIIMSKARVSPLQSISIPRLELLGAVLGLRLAEKIVKAL FT KLETKDVTIWCDSLNVLWWIKNQSRKLKPFVANRVGFIQSKTELKQWRYIP FT TKTNVADLLTRGTTVKELESNYVWWHGPSFLNSLEEKWPQNHIEVTQEASI FT EVKKNPVLMNFALSTEVTKQLLDINKFSCWKKLVRINGWIHRFIGNCRFET FT DFRKKGDIAADEYHESEKEIIAKAQKESFKEEYSNIEKGKPISISSKIISL FT NPQIDEDGLLRSCSRLQNAHYLPYDVKYPIILPRGHTITKLIVKHYHDEAN FT HVMGTNQLLTKLSERYWVIRGREEIRDAEAKCNKCKLRRCKPAQQLMAPLP FT AMRFKEPLRAFARIGIDFAGPFLTMQGRGKIRQKRYLCLFTCLLSRAVHLE FT MAYNLDTNSFLNAFYRMVNRRGLPLEVLTDNGTNFVGGCRELNEIVNDLDK FT DKITNSTADKGIKWHFNAPLGPHLGGVFEIMVKAVKRAMTGILEKADITDE FT ELCTAFTGAESLLNSRPLTYQSANIKDDIPLTPNHFLIGQVGGQFAPDSVQ FT SKGYHPKKRWRRVQELLSHFWKRWMREWLPSLSKRYKWNQKKREIKTDDIV FT LVISPDTSRGHWPLGRITGVFEGKDGRVRVVKVKIGEKEFIRPITRLCLLE FT FDN*" XX SQ Sequence 5507 BP; 2115 A; 853 C; 1094 G; 1444 T; 1 other; taattggtag cagagggtag tttcttcaac acggaagaat ggtaaaaaga aaaaatcaaa 60 aatcaaataa ccaaatggaa gatcagttaa ccgttcaaaa tgaagtatct gcattgaaca 120 ggaagaaaat agttctaaag tcttcgttta cgcgaactaa acatcaaatt attcagcaaa 180 ttggagacga tgaacccact tgtaaaaacg agttacgcga attgagtaat caattaacaa 240 agactcaaaa cgaagctctc gagataatgg ttgaattatc aaagaaatac gaacttctgg 300 aatcaaaaga aaatgtagga aaactcaata cagaaattga aaaactaaac gacgaatttg 360 cggaaattat ggaacatgta aatgaatgga tattgcgcaa tggaagttta aaatcaagtt 420 taaaacaaca aaggatggaa aaagaactta aaatggaaac aaaaaagcaa tgttatcaat 480 ggttaagtac tcaaaaaagt aatgaccagt atgaagataa taaagaagaa aatgggaata 540 gcgaaatgga accaatcaaa tcaacgcata tcatgcgtaa aatggaaaat gaaataattg 600 atttggatca aaagactgca aaaattggac atgatttatg gaaacaactc aaacgcgtct 660 ctataccaat atttaatggt gataaaaaga actatgaaaa ttggaaagct tcattctacg 720 catgcattga tcaagcccca gcgacatctg agtataaatt acttcagtta cgtcaatatt 780 tatgtggaga cgcattaaaa gcaattgaag gacttgggca ttcgtcattc gcttatgaag 840 ctgcaaaaga aagacttgaa agaaaatatg gcggaacaag gagaaaaatc cttatgtatt 900 tggatgaatt gaacaacttt aaacctatga gggaaaataa tccaaaagac gttgaaaagt 960 ttgcggattt gttggatgtg gcagtaacaa acttaaaaga agcaaatcga acagaagaac 1020 ttggcgatgg gacattgtac catcaattgc aaaaaaaagt tcctgagata atgttaatta 1080 gatacaaacg atgggtgtat gaaaacaaat taaacgaaaa tgttaaaact ctaaaaaagt 1140 ttattgttga agaagcagat tttcaaattg cagcaacgga aactctacat ggtttaaaaa 1200 aaactattgg tagcaaagca cacgaaggaa attcttactt tggtaacagt gatcaaaaaa 1260 agaaaataaa ttggatacag aaatgttctg tttgcaaaaa cgatcataaa atatgggagt 1320 gtccaacgtt tcaaaatcaa aatattaaag aaagatggga gacagctaag aaaaataaac 1380 tctgcttcag atgcttgaat tctaaccatt cgtcaaaaga atgcaggaac acaagaattt 1440 gtgggattga tggctgtaca gatacccata atcgtttgct tcacaagaaa agtcatcaaa 1500 aagaaacaat ttctgaagac tacaaagaaa aagacacgat attagaaaat tctcatacgt 1560 cgaacatcca atttcattca aatcacgtca tgaactacat atctcttaga acggtgccag 1620 ttattcttcg taatggatat aaaaaagttg tggtaaacgc attgcttgac gatggaagta 1680 ccaagacata tttgaactct gatgtagcgg ctgagttgtc actgcaaggt aaaaccgaaa 1740 aggtaactgt aaacatgata aatggaaaaa ttgattcttt cgaaacgatg ccggtagaat 1800 ttcagttgga atcgttgaat ggagaaacaa aaattacagt ggaagctttt acagttaata 1860 atgtaactgg aaacttaaaa gcagttaatt gggcaaatat gtcaaaaagt tggaaacatc 1920 taagaaaaat cgaattccca aatattggac caaaaccaaa aattgacatt ctaatcggtg 1980 tggattacgc tgagctgcac tgtgcaatta aggaagtaaa aggaggattt aatgaaccag 2040 tggctcgact cacgccgctc ggatggacat gtgttggtgg tactggtgga tctcttcaaa 2100 cccattttac gaaaataaca tctagtacaa aggaaataga agatgcaaac aacaccatta 2160 aaaaattatg ggaaattgaa ggcgacgaag agatgcaatt tagaaaaaca atgacaagtc 2220 cagaagactc taccgccctt aatatagtaa ctaattcgct aaaaactgag aacggaagat 2280 atgagctaaa attaccatgg aaaggtaata gacatttgga caataattat acaatggcgt 2340 taaacagatt acaaaatacg gaaaaacggc taatgaaaaa caaaagtcta ggagaagagt 2400 acaataatat tattaagcag tatcaagaaa agggatactt ggagaaaatc aataaaaggg 2460 atcttggaga tggttggtac ttaccacatt ttccaattct tcgtcctgat aaatcaacta 2520 caaaagtcag aattgtgttt gatggatctg caaaatataa tggtaaatca attaatgatg 2580 ttatttatca aggtccaaaa cttcaacaag accttgtgac tgttttgctg agatttcgaa 2640 aatacccagt ggctttagct tgtgatatag cagaaatgta tttaaaaatt ggaatacata 2700 aagatgacca aaggtaccaa cgaattttat ggagaaacct agattcaact gccgagcctg 2760 aggtatttca gtttaatcga ctcgtatttg gaatcaactc agcgcctttt atggctcaat 2820 ttgtaacgca agaacacgca agaaagtatg cacaaaaaat ttcctttggc ttctgaagct 2880 gttcttgaag caacttacat ggacgacaca ataacttctg tagttgatga aaaagtagga 2940 attcaattat acaaggaact cacgctacta tggggttcag caggaatgtt tgcacgaaag 3000 tggctttcaa actcggtcga agttcttaaa ataacaccag aaaatgatcg tgcagaacat 3060 ataaatttag attctggaga attgcctgca atgaaaacac ttggagttgt ctggaaagca 3120 aaacctgatt tatttagttt tcattctgta acaaccgaag aaaatacggt atatacaaaa 3180 cgaattttgc ttaaaaaaat ggcaacgttg ttcgatccat tgggtttttt ggcaccatat 3240 atcattcgaa tcaaaataat tatgcaagaa ctgtggattg atggaattga gtgggatgat 3300 gcarttccgg ataggattgc aaataatgtt gatcaatggt ttcaagaact aaacgatctc 3360 ccaaaaataa acattcccag atgtttacaa acaacctcaa cagtaacaaa taggtcaatc 3420 catgttttta cggatgcgtc ttgtaaagct tacggtgctg ttgcttacca acaatgttta 3480 tatgacacag gagaagtgac ctgtgtgatc ataatgtcaa aagcccgagt aagtccattg 3540 caatctatta gtataccaag acttgagtta cttggagctg tcctcggttt acgacttgca 3600 gagaagattg tcaaggcatt gaagctggaa actaaggacg taactatatg gtgtgatagt 3660 cttaatgttt tatggtggat aaaaaatcaa agtcgaaagt tgaaaccatt tgttgccaac 3720 cgtgttggat tcattcaatc aaagactgaa ctaaaacaat ggagatacat accaacaaaa 3780 actaatgtag ctgatttact aacaagagga acaacagtta aagaattaga aagcaattat 3840 gtatggtggc acggaccaag tttccttaat tctttagaag aaaaatggcc acagaatcac 3900 attgaagtta cacaagaggc ttcaattgaa gtaaagaaaa atccagtatt aatgaatttt 3960 gcattatcca ctgaagtaac aaagcaattg cttgatataa acaagttctc atgttggaaa 4020 aaacttgtca ggataaacgg ctggatccat cggtttatag gaaattgtcg atttgaaaca 4080 gactttcgta aaaagggtga tatagcagct gatgaatatc atgaaagcga aaaagaaatc 4140 atagctaaag ctcaaaaaga aagctttaaa gaagaatata gcaacattga aaaaggaaaa 4200 ccgatatcga tatcaagcaa aattatttct ttaaatccgc aaattgacga agacggatta 4260 ttgcgatctt gcagtcgatt gcaaaatgcc cattacttac cctacgatgt caagtatcca 4320 atcattttgc cgagaggaca tacaataaca aaattgattg ttaagcatta ccatgacgaa 4380 gccaatcatg tgatgggaac aaatcaatta ttaacaaaac tatcagaacg atattgggta 4440 atacgaggac gagaagaaat ccgagatgca gaagcaaaat gcaacaaatg taaattaaga 4500 agatgtaaac ctgctcaaca gctgatggcc cctttaccag caatgcgctt taaagaacca 4560 ctacgtgcgt ttgctagaat tggtatcgat tttgctggtc catttcttac tatgcaaggt 4620 agagggaaga tacgacaaaa gagatacttg tgcttgttta cctgtttact atcacgagca 4680 gtacatcttg agatggctta taatctagat acaaattcat tcctcaatgc cttttatcga 4740 atggtgaata gaagaggatt accattagaa gttttaacgg acaatggaac aaattttgtt 4800 ggaggatgtc gagaactaaa cgaaattgtc aacgacttag ataaggataa aattactaac 4860 tctacagctg ataaaggaat caagtggcac tttaatgctc cgctaggacc tcatcttggc 4920 ggtgtatttg aaataatggt caaagctgtc aaacgtgcaa tgactgggat actcgaaaaa 4980 gccgatatta cagatgaaga actctgtacg gcatttacag gtgctgagag tttattgaat 5040 tctcgcccac taacatatca gagtgccaat ataaaagatg atattccatt aactccaaat 5100 cattttttga ttgggcaagt tggtggacaa tttgcacctg actcagttca atctaaaggc 5160 tatcatccta aaaagcgctg gcgtcgagtg caagaactac taagtcattt ttggaaacgg 5220 tggatgcgag aatggcttcc atctttatct aaaagatata aatggaatca gaaaaaacgc 5280 gaaataaaaa cggatgacat tgtgttagta atttctccag atacatctag aggacattgg 5340 ccacttggta gaattacggg agtctttgaa ggtaaggatg gacgtgtaag agttgtgaaa 5400 gtcaaaattg gcgaaaaaga atttatacga ccaattacga gattgtgtct tcttgagttt 5460 gataattaaa tgtttgatag actggagtct atcaaagagg gggagaa 5507 // ID BEL1_MH-LTR repbase; DNA; INV; 456 BP. XX AC ABLG01000037; XX DT 24-JUN-2009 (Rel. 14.07, Created) DT 24-JUN-2009 (Rel. 14.07, Last updated, Version -1) XX DE LTR retrotransposon from northern root-knot nematode: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL1_MH; KW BEL1_MH-I; BEL1_MH-LTR. XX OS Meloidogyne hapla OC Eukaryota; Metazoa; Nematoda; Chromadorea; Tylenchida; Tylenchina; OC Tylenchoidea; Meloidogynidae; Meloidogyninae; Meloidogyne. XX RN [1] RP 1-456 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from northern root-knot nematode."; RL Repbase Reports 9(7), 1517-1517 (2009). XX DR Genome; ABLG01000037; Positions 53809 53354. XX SQ Sequence 456 BP; 155 A; 75 C; 54 G; 172 T; 0 other; tgtcgacctg ctcgacttcc ggtagaaaga aatttaaaag gaaagatgaa tgttaaagat 60 ggacaaaaaa gataagaaaa aactatattt cattccatta aaaaattaaa ctctttattt 120 taaatttatt ttctttaaca ttccaatccc cttttttcca cttgtccatg tcaagccaag 180 ccactaacga acgattgcat atattttttc ctttctctaa aaaaccatta tatgcgatcg 240 ttcatcccat tttcttttta tttataagcc attgcctttt accacaattg tcttcagttg 300 tcttcagttt ttaatatccc ctacttttct ttggaaataa attgtatttt ttaaatatag 360 aataaaattg tttttaataa attggggaga aattataaaa tttttgaata tattcacaaa 420 ggaagtgaat ataagctttg cagattccag tctgca 456 // ID BEL-47_CQ-I repbase; DNA; INV; 2911 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-47_CQ_; KW BEL-47_CQ-LTR; BEL-47_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-2911 RA Jurka J.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 247-247 (2011). XX DR [2] (Consensus) XX CC 'ATTGT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 423..2909 FT /product="BEL-47_CQ-I_1p" FT /translation="MWKAMFTMKLEDAFQQRVHAMQKIANLRDVVHDLQHP FT ALAQLTKLRQTLWSLYDEYGRAHSCIVGNVPDDDLAQQEVEYDAFDQLFDA FT VSVPLEEKFIALTNANPKTSAMDYVAHQQKLWTIYESNYWDTPMMDCESAQ FT PELSGQTANSNLRSRPENLETEVILRSHSPSTSYPSSPPKTRSRSQTLEAD FT TALRSRMPPKIDPIRPSKPTSASDVLDTEVVMRDQLQPATNADPTDLPVSA FT ISSEVTETSVQCVPLEPAQGNPNESGTGRVLPDDGTHMTGAPTPDSVLNHE FT ASLIATDNGIEPQVEPDERWELAMPTVTTLLNTDRCSEDSGSRTEAEMMST FT VLPIFPEPNLFNSKRSLTDGDARTNEREPLPAVILQNEKVNSLRVEHGSSL FT TTIQRDAVTLQTRAELNPPLPTDAKAPDKRRLMIACPNTPLNDVHGNRKKH FT AVNKTATPNLQKTNGVSPKIPDPLHAVLCRGTPHEQRPTSSTSGDLRDTEA FT PEPNRVTHQHHEEKPPQKRKSETEQLTKTCVRSERKGGSKHALKRLQHRAA FT PKKSTPTSEALNHRPLLNEPPGLPQRRRTITAVFWTCLKLNRTNGSPCCSR FT CCSKLTSNLKATIVESRTKPKTKRSRSLWTTSSTTAAALVSMLQILLAFCH FT TIHAKYYENDFVGCETSFPNNGRGPCEQIVAPQSVKVNRRVVSSQKVRLLR FT LLAGWQRSKIPSNPPYSYSTVRMSRSCNRRYNGDSTYSRGCHPRKPIIQDV FT SGRSLMNDSEPAHRKYSVGERTLGTKSGQPLNTMTTPSCSTETIVTGSNKP FT ARSTLRFDIIHANLISKNVESMLQRGE" XX SQ Sequence 2911 BP; 808 A; 922 C; 698 G; 483 T; 0 other; tatggtccta ccgacccgga ttggactttc gtgtgcgcgc cgccccgggg acatccggaa 60 ggaaaattcc gactaagtgc gcaagaagaa aagctctggg attgagctcc ctgagccgta 120 ccatcaccaa aacgttctgg aacattcggc tgcttgccga gtggggtcct tgtaagaatt 180 tccggtgaaa aatgcataag aagtgaaccc agaaaaagaa ccttcctgct gcggcagtac 240 agtgcaaaac agaagaacaa gaagatcctt cctgctgcgg cagaacagtg cgaaaagtga 300 aacaaacccg tcgtctgtga agccatcatc cctctcgcgg aaaccacatt ccgctgccaa 360 cgctgtaggc gagctgcagc gttccgcatc gattctgtag ccctttgatt ccccatttca 420 ccatgtggaa ggcaatgttt actatgaagc tggaggatgc gttccaacag cgagtacatg 480 ccatgcagaa gatcgcgaat ctccgggatg tcgttcacga tctgcagcat ccggcgttag 540 cgcagctgac aaaactgcgt caaacccttt ggtcgttgta cgacgagtac ggccgagccc 600 acagctgcat cgtcggcaat gttccggatg acgacctcgc ccagcaagaa gtggaatacg 660 acgccttcga ccaactcttc gatgcagtca gcgtgccatt ggaggagaag ttcattgcgc 720 tgacaaacgc caacccgaaa acgagtgcaa tggactacgt tgcacaccag cagaagctct 780 ggacgatcta cgaaagcaac tactgggaca ccccgatgat ggactgtgag tccgcacaac 840 cggagctctc aggacaaacc gccaactcaa atctacgatc caggcccgaa aacctggaaa 900 cggaagtgat tttgcgaagt cactctccgt cgacgagcta cccgagctca ccacccaaaa 960 cccggtccag gtcccaaacc ctggaagcgg atacggcttt gcgaagtcgt atgccgccga 1020 agatcgatcc gatccgccca tccaagccca catccgcgag cgatgtcttg gacaccgaag 1080 tggtcatgcg tgaccagctt caaccggcaa ccaatgccga tccgacggac ctcccagtgt 1140 ccgcaatctc cagcgaagta accgaaacca gtgttcaatg cgtgcctctc gaaccggcac 1200 aaggaaaccc caacgaatcc ggaaccggcc gtgtcctccc ggacgacggt acccacatga 1260 ctggtgcgcc aactcccgac agtgtcctga accacgaagc aagtctgatc gcaacagaca 1320 acggcatcga accacaagtc gaacccgacg aaagatggga actcgcaatg ccgacggtga 1380 caacgctcct caacactgac cgctgctcag aggacagcgg atcccgaacc gaagccgaga 1440 tgatgtcaac agtgctgcca atctttcccg aacccaacct gttcaactcc aaacgttcgc 1500 tcacggacgg tgacgcacga acgaatgaac gcgagccgct accagcggtg atcctccaga 1560 acgaaaaggt gaattctctc cgtgtcgagc acggcagctc cctgacgacc atccagcggg 1620 acgctgtaac cctccagacc agagcagagt taaatccgcc gctgccaacc gatgcgaaag 1680 cccccgataa acgacgccta atgatagcct gtcccaacac gccgctcaac gacgtccacg 1740 gaaaccggaa gaagcacgct gtcaacaaaa ccgcaacacc caatctacaa aagacgaacg 1800 gcgtttctcc aaagatcccg gacccgttac acgcagtgct ttgccgaggc acaccacacg 1860 aacaacgacc aaccagctcg acttctggcg atctgcgaga cacagaagct ccggaaccca 1920 atcgcgtgac acaccaacac cacgaagaga aacctccaca gaaacgaaag tccgagaccg 1980 agcaactgac gaagacgtgc gtgcggtctg aacgcaaagg tggctccaag cacgccctca 2040 aacggcttca gcatcgagca gcaccgaaga agagcacccc aacaagcgaa gcgctcaacc 2100 atcgaccact cctgaacgaa ccacccggcc tgccgcagag acggcgcact atcaccgcag 2160 ttttctggac ctgcctgaag cttaaccgaa caaacggcag tccctgctgc tcacgctgct 2220 gctccaagct aacatccaac ttgaaagcca ccatcgtcga gtccagaacg aagcccaaaa 2280 ccaaacgaag cagatcactt tggacaactt caagcaccac cgctgccgca cttgtgagta 2340 tgctccaaat tttgcttgct ttttgccaca ccatacacgc caagtactac gagaacgact 2400 tcgtcggttg tgaaaccagt ttcccgaata acggccgtgg gccgtgcgaa cagatcgtcg 2460 ctccgcagtc cgtcaaggtc aaccgacgcg ttgtgagcag ccaaaaagtt cgcctactac 2520 gattgcttgc aggctggcag cggagcaaga ttccgagcaa cccgccgtac agttactcca 2580 cagtgcggat gtcccgaagc tgtaatcggc gctacaatgg cgattccacc tactcacgag 2640 gatgccaccc ccgcaaaccg attatccaag acgtcagtgg acgaagcctg atgaacgatt 2700 ccgaaccagc ccatcgaaag tactccgttg gcgaacgcac gctgggaaca aaaagcggcc 2760 aaccgctcaa cacgatgacc acaccaagct gttccaccga gaccatcgtc actggatcaa 2820 acaagcctgc acgctcaaca ctccgtttcg acatcatcca cgccaacctg atcagcaaaa 2880 atgttgaatc gatgcttcaa cggggggagt a 2911 // ID TSRP1 repbase; DNA; INV; 1613 BP. XX AC X06625; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 28-SEP-1995 (Rel. 1.08, Last updated, Version 1) XX DE Trichinella spiralis repetitive DNA. XX KW Repetitive sequence; TSRP1. XX OS Trichinella spiralis OC Eukaryota; Metazoa; Nematoda; Enoplea; Trichocephalida; OC Trichinellidae; Trichinella. XX RN [1] RP 1-1613 RA Dick A.T.; RT "TSRP1."; RL Direct Submission to Genbank (16-JAN-1988)Dick T.A., University RL of Manitoba, Z320 Duff Roblin Building, University of Manitoba, RL Wpg. Mb.,, Canada, R3T 2N2.. XX RN [2] RP 1-1613 RA deVos T., Klassen R.G. and Dick A.T.; RT "Sequence Analysis of a 1.6kb repetitive element from a porcine RT isolate of Trichinella spiralis."; RL Nucleic Acids Res 16, 3114-3114 (1988). XX DR GenBank; X06625; Positions 1 1613. XX SQ Sequence 1613 BP; 459 A; 311 C; 277 G; 566 T; 0 other; gaattctgaa ttgcaggttc gatgtctgac ctttaaaaca acctcctcga tggattattc 60 ttcatgatat tgcaacttct atgggttagc taatgtcttc tattctgcta ctgctaacac 120 ttcgacgaat atttctttag gattcttctt gcagaacagt gcgaatcatt tgattatatc 180 gattttaata aatgaaaagc ttgtaaagcg gtggtgcgta ttccattctt gagccacacc 240 attaatgtga ttggctattt attttgcaac ctattttatc gaatttcatc tctcaatttt 300 acacattcat acctatgctt ttccaccggg aatggtatta atcgcttcac cgccatgtct 360 gtaacaccag taaatactcg tgaagctttg ctaatcattg catttcacag cgccctaacg 420 aatatgagcg gtttgtcgat gtgtaaacta taataagggt tcttcttacc aattttccgg 480 atttgtacct gtgtttctcc gcacacatgt ttggtaatgg ttcctctcta tggaatgtaa 540 tactctctaa tatagtcgag cattgttaat cattccattt cagagcgctc caaggaatat 600 acacggtcaa tcggtacaca aactctggta agcagttact cttaccaata ttgcggattt 660 ctaccagtgt ttctccccac agatgtcagg taatgttgcc tctctatggc ctgctatact 720 ttctaagata ggtaaacatg ttaatctttg catttcattt cagacagcac caaggaatat 780 gagcggttgt tcggtatgca aactctggag agtaattcct gactcttcct gtgtttctac 840 gctcagaaat cagctaatgt tcctttgtat tgtctgtaac agcagctgac atgtcaaagc 900 catgttaatc aatttcattt tacttcacca ccaaaagcat gatcaatcat tcgttattca 960 ccatcttgaa tgctttttcc tgcagggatc ttcggcacgt atgacttgtt attccacaga 1020 aaataattat gatcgtatga ctgcagtgac gagaatttca gcttcactgc acagcagttg 1080 taatacagcc tgttacattt taaaatcgac agttactacc cacttgcagt tcgttttatc 1140 attggtaata ttacagaact tgaaaatcat taatgtctac tgtaagatta ctgtatttta 1200 aaaatagaat tatttatgaa aaataatacc attttgtgta tttgattaaa ttgcttatag 1260 ttgatatatt gcaattgtta cagtgtctag agtaaaaatt ccatgtaacc aaacgacagt 1320 tttcgtttcg tttaaaatca actgtttgcg tcagatactt cgtattggat gaagattgtg 1380 cagtttttga ttcagtcaga ttttacgctt aaatagtcac cttaattgat ttttctatag 1440 tgtgcagttg atgcggttta aaaaaggaac cagtataata ccaaaaaaca tgatttgatc 1500 agtatgatct gcaagaaaaa cgttgattcg tgatatgatc gttcatgtag tttgagttga 1560 agctacttca attatcacgc agtgtatgaa cgacattccc cgcacctgaa ttc 1613 // ID R1-4_BM repbase; DNA; INV; 3196 BP. XX AC . XX DT 25-APR-2010 (Rel. 15.07, Created) DT 25-APR-2010 (Rel. 15.07, Last updated, Version 2) XX DE Non-LTR retrotransposon - a consensus. XX KW R1; Non-LTR Retrotransposon; Transposable Element; R1-4_BM. XX OS Bombyx mori OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Bombycidae; Bombycinae; Bombyx. XX RN [1] RP 1-3196 RA Jurka J.; RT "Non-LTR retrotransposons from Bombyx mori."; RL Repbase Reports 10(7), 1050-1050 (2010). XX DR [1] (Consensus) XX CC ~90% identical to consensus. XX FH Key Location/Qualifiers FT CDS 759..3194 FT /product="R1-4_BM_1p" FT /translation="MSRVCDAAMPRVRALAPKRRVHWWTEEIASQRRLCDV FT SRRAYQRYRRRRTRRDPDEEDRLYEVYRTAIRALRVAIGEAKEAAWNDLLA FT SLDRDPWGRPYRLARNALRPWAPPATSTLPPETLRRVVGGLFPDXXGTAFV FT PPVMTTARVADGGEPEDVSQAEFDSAVQRMRAKRTAPGPDGISSRAWALAL FT TGDGLGPALRGLFSRCLREGRFPEPWRTGRLVLIPKEGRPRDEPSGYRPIV FT VLDEAGKLLERVVAGRLVQHLESVGPDLAPNQYGFRRGRSTVDAVLRVRHL FT SDRACSGGGVLLAVSIDIANAFNTIPWSTIVESLRFHRVPTGLRTLIEDYL FT SGRXVVFPERRGWGRKAVSRGVPQGSVLGPLLWDIGFDWVLRGASLRGVDV FT VCYADDTLVTARGDDYRXAAILAAAAVATVVSRIXRLGLEVALHKSEAVCF FT HPARRGPPPGAXLVVGGVSIAVQPKLKYLGLVLDSRWSFDHHFGELVPRLL FT GTAGALARLLPNVGGCSAGVRRLYLGVVRSMALYGAPVWSPALSARNAALL FT LRAQRALAVRVIRGYRTISREVACALAGSLPWDLEAEVLAAVYRRRTQSPS FT RGRTPGPSAVGRWRRAARRLAYAKWRERLLEELGRTSATRRRTLEALVPVL FT EAWSDRRHGVLSFRLTQVLSGHGCFGRYLWRVCGREPHPGCHQCGHPDDDA FT QHALEACPRWEHPRRDLVAVLGXDLSLRAVVARMVEDGSSWGAMAGXADLV FT MASARLRSAPCATVVSWNRRXRXXXXXSXRSSXLEAPESDIARSVAPLDRV FT SVNGFPSXKK" XX SQ Sequence 3196 BP; 436 A; 994 C; 1157 G; 577 T; 32 other; gaccgcagcc gngattgcgc taccggtgcg gccagaccgg ccacgtggcg gccggctgct 60 ctctcgcgcc acactgcgct gtatgtgccg ctgccggcag accggcggac cacgtctccg 120 ggagcaaggc ttgtgccaag tccccgagac cgagacttag aagaggagct tccgctgcac 180 ttgaaaatgc acggaggcag cccggtacgg ctatggagac atcgaatgac gccctgttat 240 gacaggtgtc cttcgattcc tgcaggggaa tctcaatcac tccgccagag ctcaggacct 300 tttgttccag agcatggcgg aggggttgac ccatcttgcg gtggtcgccg agccgtatcg 360 ggtcccttcg agccccgatt gggcggccga tttggagggc cgagtggcna tcatccggcg 420 ttgttgtgtg ggtgctccgc ccaggttcga cgttgtcgag agaggtcgcg gcttcgttgc 480 cgtcctctgg gccgaggtat tcgtgttggg agtgtacttt tccccaaaca ggacgctcgc 540 cgagttcgag gttttcctca gcgagctcag ccgcgtcgtt gggaggtcgc actcccgacg 600 gatnctcgtt ctcggggacc tcaacgcgaa gtcattggct tggggttcct cgaggacgtg 660 ccccagaggt agggcggtgg aggagtggct ggtcggaagc ggtctcctcg tcctcaatcg 720 cggacgtgtg cgagggggtg gagcgtctgc gcgaggcgat gtcgcgggtg tgcgacgccg 780 ctatgcctcg cgttagagct ctcgctccta agcgccgagt ccactggtgg accgaggaga 840 tcgccagcca gcgccggctg tgcgacgtca gtcgtcgcgc ataccagcgg tatcgacgnc 900 gaagaacgcg ccgggacccc gacgaggagg accgcctgta cgaggtgtac aggacggcca 960 taagggccct gcgcgtggct atcggggagg cgaaggaggc cgcctggaac gacctactgg 1020 cctcgctgga ccgtgacccg tgggggcggc cctacaggct ggcgcgcaat gcgctccgcc 1080 cttgggcccc ccccgcgacc agcaccctgc cgccggagac attgcggcgg gtagtcgggg 1140 gtctgtttcc ggatttnncc gggacggcct tcgtcccccc tgtgatgacg acggcgcggg 1200 ttgctgatgg cggggagcct gaggacgtct cgcaggcgga gttcgattcg gccgtgcaga 1260 ggatgcgggc gaagcgcacg gcgcccggtc ccgatgggat ctcgtcccga gcgtgggcgc 1320 tcgccttgac gggcgacggc ttggggcctg ccctccgagg gctgttcagc aggtgcctcc 1380 gtgagggcag gttcccggag ccatggagga ctggtcggct cgtccttatt cctaaggagg 1440 gtcggccacg tgacgagccg agcgggtacc gcccaatcgt cgtgctggac gaggccggta 1500 agctcctcga gcgcgtcgtc gccggtcgcc tcgtccagca cctggaaagc gtcgggcctg 1560 acctggctcc taaccaatac ggtttccgga ggggtcgctc gaccgtggac gcggtcttgc 1620 gcgtccgcca cctctccgac cgtgcgtgct ccgggggggg cgtgttgttg gctgtgtcga 1680 tcgacatcgc caacgccttc aacacgatcc cctggagcac gatcgtggag tcgctncggt 1740 ttcaccgcgt ccccaccggt ctccgcaccc tgatagagga ttacctctca gggcgggncg 1800 tggtcttccc cgagcggagg gggtggggac ggaaagcggt gtcgcgcggg gtcccgcagg 1860 ggtcggtact gggnccgctc ctgtgggaca tcggtttcga ctgggtcctc cgcggggcta 1920 gcctgcgtgg cgtcgacgtg gtntgctacg ccgacgacac gctggtgacg gcccgcggag 1980 acgactaccg gnccgcggcg atcctggctg cggcggcggt ngcnaccgtc gttagccgaa 2040 tanggagact cggcctcgag gtggccctcc acaagtccga ggcngtgtgc tttcacccgg 2100 ctcggagggg gccccctccg ggcgcgantc tcgtagtcgg nggagtctcg atcgctgtcc 2160 agccgaagct caaatatttg ggcctcgtgc tggacagcag atggagcttc gaccaccatt 2220 tcggtgagct ggtcccgagg ctgctgggga cggcgggtgc gctagcccgt cttctcccca 2280 acgtcggtgg ttgcagcgcc ggtgtccggc gnctgtacct gggggtcgtg cgcagtatgg 2340 ccttgtacgg cgctcccgtg tggtcgcccg cactctccgc gcgcaacgcg gctctgctgc 2400 tccgagctca gcgggcgctc gcggtgaggg tcatcagggg gtaccgtacg atctcccggg 2460 aggtcgcctg cgccctcgcc ggttcccttc cttgggatct cgaggccgag gtattggctg 2520 cggtataccg gcgcaggacg cagtccccga gtcggggacg gacgcccggc ccgtcggctg 2580 tcggtcggtg gaggcgtgct gcgcgtcgtc tggcgtacgc caagtggagg gagcggttgc 2640 tggaggagct cggccgaact tcggccaccc gccggcgcac cctcgaggcc ctggtgcccg 2700 tgctggaggc atggtcggac cggcgacacg gcgtgctctc cttccgcctc acgcaggtcc 2760 tctcggggca tggctgcttc gggaggtacc tgtggagggt ctgcgggaga gagccgcatc 2820 cgggctgcca ccagtgcggg catccggacg acgacgctca gcacgcgctc gaagcgtgcc 2880 cgcggtggga gcacccgcgg cgagacctcg tcgcggtgct gggggnggac ctctccttgc 2940 gggccgtcgt cgcccgcatg gtagaggacg ggagttcgtg gggggccatg gccgggtntg 3000 ctgacctggt natggcgtcc gcgaggctga ggagcgcgcc gtgcgcaacg gtagtctcgt 3060 ggaatcgcag ggngagggnn cntnggnctt ngtccngtcg gtcttcnnct ctggaggcgc 3120 cggagtccga catagcccgg tctgttgccc ccctcgaccg ggtatccgta aacggattcc 3180 ccagcnttaa aaaaaa 3196 // ID Gypsy-40_AA-LTR repbase; DNA; INV; 417 BP. XX AC AAGE02031144; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-40_AA_; KW Gypsy-40_AA-I; Gypsy-40_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-417 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02031144; Positions 65362 64946. XX SQ Sequence 417 BP; 156 A; 51 C; 83 G; 127 T; 0 other; tgtagtaatc taggcgataa ttaaaaataa taataataat aaatgtttat gctagacaaa 60 agtttctttg tttaatattg caaatattgt tctgataaat ttcataaagt caaagtgtct 120 tgacctagaa atataaataa tttattattg ttcatcttgc acattgaaca tttatttttt 180 taaagtgtat tccacgagaa ctttctgaag gggaaaacag aaagagagaa aacgcaaaga 240 tgaaacgcca agttcgaatc ggatgccggt tgggagaggt aaaaggagaa aggaatgaag 300 cgcggcattc tgtgaacgat tttaaacgcg aaggttcggg tattttgcag ctccgaaaaa 360 atattgaaaa atacataaat tcgtgtaatt ttctcgatta cggttgagaa tacgaca 417 // ID Gypsy-82_AA-LTR repbase; DNA; INV; 151 BP. XX AC supercont1.148; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-82_AA_; KW Gypsy-82_AA-I; Gypsy-82_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-151 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.148; Positions 1978704 1978854. XX SQ Sequence 151 BP; 43 A; 26 C; 29 G; 53 T; 0 other; tgttgtaatc ttgtaatcga tgtaatttta cttcggtaag ccccttaact acatgtaagc 60 aaagcgttat tattggaccc cttgaataaa gttagttgaa gatttctagc gtaacagagc 120 acgcgtgttt tatttctggt ataccgaatc a 151 // ID BEL-217_AA-I repbase; DNA; INV; 5924 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-217_AA_; KW BEL-217_AA-LTR; BEL-217_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-5924 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 891-891 (2011). XX DR [1] (Consensus) XX CC Positions [4581-5180] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 1074..3341 FT /product="BEL-217_AA-I_2p" FT /translation="MQLQEEIGRSTLRPMENVKFKDLTEFIQRRVTVLQSI FT QPKVVDTPPSVQGKKPAQRPVSSHGANQLSPRKCLVCSDHHPLYLCGSFAK FT LSAEDKEKEVRRHQLCRNCLRKGHQSRECSSSTSCRKCRGRHHTLLCVNGL FT SSQSTSKPLSAPQSKLPTSSSEDSDAPITSASATRMESSSCPSAGHQQKTV FT LLATALITLVDDSGVEHIGRALLDSGSECCFITERLSQRMKAQRQKIYLPI FT SGIGQSGTHAKQKFSSIIRSRSGEYSTTIDLFVLPKVTIDLPATSVDTSTW FT NMPPGIQLADPSFDGTKPVDIILGAEIFFELFRVPGRIPLGEHLPVLVNSV FT FGWVVSGRSTERSFNPTVVANVATIADIHQLMERFWVIEEDSSPTTHSVEE FT QACEKHFRRTVSRTPEGRYVVRLPFKENVLDRLNDNRRTAVRRFHLLQGRL FT DRNPELQQQYKAFIDEYVALGHMHRIHEYDDPTVKRFYLPHHAVLREDSAT FT TKLRVVFDASCKTPAGPSLNDALMVGPTVQEDIRSITMRSRKHQVMVVADV FT KMMYRQVLVDPRDTPVQLIVWKPSPDQPLETYELDTVTYGTASAPFLATRV FT LIQLADDECSNFPLAAPVVKKDFYVDDLFSGGKHAEEVIELRNQLEALLAK FT GGFQLRKWASNNEAVLDGIPPENRALSTSIELDPDQTIKTLGLHWEPFADC FT LRYKVELPSDSSTHPLTKRLASLSNRATIRPAWPCGASCDDRQGVHAKSVD FT LEGRR" FT CDS 4581..5555 FT /product="BEL-217_AA-I_3p" FT /translation="MAELPSSRITASRAFSITGVDYWGPILLKPVHRRAAP FT GKAYVAVFICFSTKAVHLELVGDLSTAKFIQSLRRFVSRRGLCKHLYSDNG FT RNFVGAANELKRLVNNKDYQNAVAQECTAHQIRWHFNPPKASNFGGLWEAA FT IQSAQKHFVRVLGKNTLPYDDMETLLCQIESCLNSRPLTPLSDDPSDYESL FT TPGHFLIGSALQSIPDVDYTEIPSNRLTKWHHVQKLFQQLWERWHLEYLSS FT LQQRTKWLSPPVPIQKNQLVLLCEENIPPMHWPTARIEQVHPGSDGIVRVV FT TLQTPAGSFVRPVNKICILPIASSLDQQSATSN" XX SQ Sequence 5924 BP; 1533 A; 1687 C; 1299 G; 1405 T; 0 other; actggtcctt cgaaccggat cacgtcgaat ctccgtaagt ccggcagtcc tgtgctagcg 60 agtcgctatc atctacccgc catcgtgccc gccggagggt tccagaacaa tcgacactcg 120 tctccattcc tacaagcccg agatactgtt tgctttgacg caactcggtt ggattctgat 180 catacaaggc aatttaattg cctctagcag gtaattaaca ctccaactac tctactttct 240 gctctgtgcg tgctacgctt gtccatttgt ccacagaacc ccttctgttg ccgtaatgtc 300 caccgaacgt cgcattaaga cactcaaagg tcggataaag agtctgacgg catctctcac 360 ctccatcaag accttcgtag atgggttcga cgaggacacg caatccgaag agattccagt 420 ccggctggag agtctctgca gtttatgggc ggagtacaac acgacgcaaa gcgaactgga 480 agggctcgac gaaaccgctc tcgattccca cataaagcaa cggacagcgc tagaagcaac 540 ctactatcga atcaagggct tcttactggc ccagaataaa actcctttaa atcgaagcct 600 ttcatctacc cctaattctg acatacaagc ccctttgtcc acatcgcagg tgcgattacc 660 ggatatcaag ttgcccatat tcgatggaag ggtggagaat tggcttgtct tccacgattt 720 gtatctttcg ctggtccact catcgactgc actctcgaat atccagaagt tttattatct 780 acggtcatcg ctctcgaact cggctcttca aaaaatccaa tcaatcccaa taagtgccga 840 taactatgcg gtagcgtgga atatgcttct gaaagagtat caaaaccaag cgcgtttgaa 900 gcaggcctat gtcgatgcac tcttcgactt ttcagtgctc aagcgtgaat cggcatttga 960 actccgcagt ttagtggagc ggtttgaagc caatgtcaaa gttctgcacc agctgaacga 1020 gagaacggag cactgggaca tcttgctcgt ccgtatgctc agcacccgcc tcgatgcaac 1080 tacaagaaga gattgggagg agcactcttc gtcccatgga gaacgtcaag ttcaaggact 1140 tgactgagtt cattcagcga cgcgtcaccg tacttcaatc catccaacca aaggtcgtcg 1200 atactccgcc atctgtccaa gggaagaagc cagcccagcg tcctgtttct agccacgggg 1260 ccaatcagct cagtcctcgg aaatgcctcg tttgttcgga tcaccaccct ctatacttgt 1320 gcggaagttt tgcaaaactg tcagctgagg acaaggaaaa ggaggtacgt cgacaccagc 1380 tgtgtcgaaa ctgtttgcgc aaaggtcatc agtcccgaga atgctcctcg tccaccagct 1440 gtcgcaagtg ccgcggacgg catcatactc tactttgtgt caacggttta tcttcacaat 1500 ctacgtcgaa acctctaagc gccccacaat caaaactacc aacatcctcg tctgaagact 1560 ctgatgctcc aattacctct gcatcagcta cgcgcatgga gtcctcgagt tgtccatccg 1620 ccggccatca acagaaaacc gtccttctgg ctacagcgtt gatcactctc gtcgacgaca 1680 gcggcgtcga acacatcgga cgcgctctgt tggactcggg cagcgagtgc tgctttatca 1740 cggaacggtt gtcgcagcgc atgaaggccc agcggcagaa aatctacttg ccgatcagtg 1800 gaatcgggca gtcaggtaca cacgccaagc aaaaattctc ctccatcatc cgctcacgaa 1860 gtggcgagta ctccaccact atcgatctgt tcgttcttcc caaagtaact attgacctac 1920 cagctacatc cgtcgatact tcgacatgga atatgccacc tggcattcaa ctagcggacc 1980 cctcattcga cggcaccaaa ccagtagaca tcatacttgg cgctgaaatc tttttcgaat 2040 tgttccgcgt cccaggtcga attcctctag gcgaacatct cccagtactc gtgaattcgg 2100 tttttggatg ggttgtgtct ggaaggtcca ccgagagatc cttcaatcca acagttgtcg 2160 ctaacgtagc caccatcgcc gatattcacc agctcatgga aagattctgg gttattgagg 2220 aagatagttc acccacaacg cattctgtcg aagagcaagc ctgcgagaag catttccgcc 2280 gtactgtctc acgtactcca gagggtaggt acgttgttcg cttgccgttt aaggaaaacg 2340 tcctcgatcg actgaacgat aaccgaagaa ctgctgttcg tcgctttcat ctcctacaag 2400 gtcgtctgga ccgaaatccc gagctacagc aacaatataa agctttcatt gacgagtacg 2460 tggctctagg tcatatgcac cgcatacatg aatacgacga tcctaccgtc aaacgcttct 2520 atctgccgca tcacgcggtg ctccgtgagg acagcgccac cacgaagcta cgggtagtct 2580 tcgatgcttc gtgtaagact ccagccggtc catccttgaa tgatgcgctc atggtcggac 2640 caacggttca agaagacatc cgatcgataa ctatgcgatc ccggaagcat caagtcatgg 2700 tagtcgccga cgtaaaaatg atgtaccgtc aggtactcgt agatccacga gatacacccg 2760 tccagcttat tgtatggaag ccgtcaccag atcaaccgtt ggagacatac gaattggaca 2820 ccgtaacgta cggtactgcg agcgcgccat tccttgccac tcgcgtactg atccaactgg 2880 ccgacgacga atgttccaat tttccactag ccgcacctgt tgtaaaaaag gatttctatg 2940 tagatgatct gttttcaggt ggaaaacacg cagaagaagt aattgagctt cggaatcagc 3000 tagaagctct tctagcaaag ggtggatttc aactacgcaa atgggcatct aacaacgaag 3060 cagtactcga cggaattccc cctgaaaacc gggccctgag cacctcaata gaacttgatc 3120 ccgaccagac tataaagacg cttggactcc actgggaacc tttcgcagat tgcctccgat 3180 acaaggtcga actgccatcg gattcatcta cccatcctct aacaaaacgt cttgcctctc 3240 tctctaatcg cgcgactata cgacccgctt ggccttgtgg ggccagttgt gacgatcgcc 3300 aaggtgttca tgcaaagtct gtggaccttg aaggacgacg atgacaaacc gtggggctgg 3360 gatagagagc tgccgaagga gtaccaaaca cgttggatga agtatcagtc gttgctccca 3420 aaccttaaca aacttcgcat cgatcgttgc atcctccttc cagaaccaga aatcatccag 3480 attcacattt tctccgatgc gtcgaaactt gcatacggtg cctgcgccta cgttagatct 3540 atcaccgctg atgggctagt gaaggttgct ctactctccg ctaaatcccg cgtagcccca 3600 ctacagagtc gtagcattcc ccgtcttgag ttgtgcggcg cactattggc cgccgaactt 3660 tacaagaaag tccaatcgtc attacaactc gatgcagaat cctttttctg ggtcgatagc 3720 actgtcgttt tatgctggct caacgcttcg ccttcgacat ggaacacgtt tgtggccaat 3780 cgtacttcca aaattcagct agccacccca aacagtgcat ggaatcacgt tgctggtggg 3840 agaaccctgc tgactgcctt tccgcggtct cactgccgat gccatcatta acttcgacct 3900 gtggtggcac ggaccccaat ggctacaaca accccaacaa tctggtcgat aacccatccc 3960 acctgaacaa tcatcagaag cattgaaaga gatacgtcga tcgtccgtag ccgtcccatc 4020 aaccccaacg gatccatcct tcatcgacat catcgttgga aaattctcca acttccatcg 4080 cctcattcga gtagtcgctt actgtcaacg attccttcgc aactgtcgaa gttccctcgg 4140 gcgagagatc tatcgcaaca atcctcaacg tagatgagct actggaagct gaacactgca 4200 tcatccgttt agttcaactt caatccttca gccaagaatg gagtcagctg aaaaataatc 4260 tcccagtatc gtcaaaatcc cgtctgaaat ggtttcatcc gtttgtatcg tcggagaacg 4320 taatccggat tggtggccgg ttgggaactg ctgttcagcc atacgatgcc aggcaccaaa 4380 ttcttctgcc acgatcacac ccattttcgc ttctcctagt ccgcaactac cacgaacgtc 4440 accttcatgc agccccccag ttgctgctca acctgcttcg acaacgatac tgggtcatag 4500 gggccaggag tctggccaag aacgttgtcc acaactgcat cgtttgctgt cgagcacgcc 4560 cacgcctact gcaacaattt atggctgagt tgccatcatc acgaatcact gccagtcgag 4620 cattttccat aacgggagta gactactggg ggcccattct actcaaacca gttcatcgcc 4680 gtgctgcacc aggaaaagct tatgttgccg tcttcatatg cttcagcacc aaagcggtgc 4740 atttggagct ggtgggggat ctaagcactg ccaagttcat ccaatccctc cgccgattcg 4800 tatcacgccg aggattgtgc aaacacctgt acagtgataa cgggagaaac tttgtgggag 4860 ccgccaacga gctaaagcgg ctcgtgaaca acaaagacta ccagaatgcc gtcgcacagg 4920 aatgtaccgc gcatcaaatt cgctggcatt tcaacccgcc gaaggcgtca aactttggtg 4980 gcctctggga ggctgccata cagtctgccc aaaagcattt cgtccgcgtc ctcggaaaaa 5040 acactctacc gtacgatgac atggagactc ttctgtgtca aattgaatcc tgtttgaatt 5100 cgcgtccact tacgccattg agtgacgacc catcggacta cgaatcgtta acgccaggtc 5160 attttcttat tggatcagca ctccaatcga ttcctgacgt cgattacacc gaaattccat 5220 caaatcggct taccaaatgg caccacgtcc aaaaactgtt ccagcagcta tgggaaaggt 5280 ggcacttaga gtatctttcg tcactacaac aacgaaccaa gtggctttct cctcctgttc 5340 cgatccagaa aaatcaactc gtcctactct gcgaggaaaa catcccaccg atgcattggc 5400 caaccgcaag gatcgagcaa gttcatcccg gctcagacgg catcgtcagg gtagtaaccc 5460 ttcaaacacc tgctggaagt ttcgtccgac cggtaaacaa aatttgcatc ctgccgatcg 5520 catcatcttt ggaccaacaa tcggcgactt caaactgaac gcactcacaa caaaacagca 5580 tcacaacaga agcctcatcc agaaaatttt cgcatccaat actgtcgcaa tccattttca 5640 accagcaccg catcatctgt cctgcaagag tacctaaaag caaagaatgc aaggcatatt 5700 attagcctgg atcaggtaag cacctgtcca aattatttaa atttcgcgaa acccgctacc 5760 gggtcttttg ctttccagat gtcatcctcg attgcatcat gccacttgat ttgacgtttt 5820 ccaatcaaaa caacgaagtt catcgatcgt gccgcaagcc ccgagagctg tcatccacat 5880 cgtcgtgaaa ggtgtcagaa acaacgttcc tgaaggggcc agca 5924 // ID RTEX-1_SK repbase; DNA; INV; 6177 BP. XX AC . XX DT 26-FEB-2010 (Rel. 15.02, Created) DT 26-FEB-2010 (Rel. 15.02, Last updated, Version 1) XX DE Hemichordate RTEX-1_SK autonomous Non-LTR Retrotransposon - DE consensus. XX KW RTEX; Non-LTR Retrotransposon; Transposable Element; RTEX-1_SK. XX OS Saccoglossus kowalevskii OC Eukaryota; Metazoa; Hemichordata; Enteropneusta; Harrimaniidae; OC Saccoglossus. XX RN [1] RP 1-6177 RA Kapitonov V.V. and Jurka J.; RT "Young family of RTEX non-LTR retrotransposons from the acorn RT worm."; RL Repbase Reports 10(2), 242-242 (2010). XX DR [1] (Consensus) XX CC The consensus sequence was derived from multiple alignment of CC four copies less than 1% diverged from each other. The 3' CC terminus is composed of the (TTA)n microsatellite. XX FH Key Location/Qualifiers FT CDS 478..2502 FT /product="RTEX-1_SK_1p" FT /note="esterase domain." FT /translation="MGDSRNSISLRSKENVQNPGEMRKFIERYFGKKVEIK FT FNHNSVTIRSNTNMKEVWMKAIKAVFTEDKYTYIKKGEVKSEQLRIADGDN FT TVTIKMYDNGTLLIQGNNYNQWMETVGREMFSKIQELLGLNVNFESAEVTE FT DTVSKNTRARRDPKKTIRFNVEEYELEASTIKPQKTHYDESNTKNDSQSKG FT PTSTKQRPTTPTRSTKPNIANTPSRIPCRISKTPIVNNGGSLDNTELAATL FT SQMFKSIQQIDKNYCNIMRSGSITQELTDKLMKVEDNLAKQSRDIQKINKL FT LENKLRKDEATKPGTGTTIQNGETIATLMSKIDRIERSITSIEKSFINNDK FT AISKITALQESTKSCVEKEKEILTKLQQDRHYLQQQLEKRGEEIDILQSEN FT DKLRKDIRELRHDTQDNYTRNQYSVLAPEDNSDDSQTTGNTSTNSEFTTVT FT KKIRSSRKVQTLIIGDSMLKDINARGVYRNTQVKTLRGRKIKDITDHIQKA FT SIVKEVENIVIHVGTNNLADKHKVDECMEEYTTLIDTIREKNPTVKISISS FT VIHRKNDVATNKIIDNFNSRLKKINGNNVHFINNNNITSETHLIDAVHLNK FT RGTAELVKNIKDTIGPLIGVNRQEKTNHRYNNARHRHNRSYDTPKEVHRPR FT NNKEQYRPRDNEEQYRPRDNDNTTDQ" FT CDS 2620..6054 FT /product="RTEX-1_SK_2p" FT /note="APE and RT domains; CHCH zink finger." FT /translation="MCCQKTRKNINISQRRSLSLSYWNIQSLARKTQENDF FT IKDINCHDIVGLGETWLDKHSASSFNIPGYYNYSITRNNHTGGGLTILIKQ FT DIKQGVNIIENTSSEDKFWCKQKKNFFGQLKDTYLCFVYNPPLNSKHCNPD FT FFEHLEKDISKFSQFGNIVILGDFNSRTGMLKDYADFDDTFLPLPFSPDNN FT DTITVHNNRDTTVNAYGKALIDLCKNCDMRILNGRMPGDSSGNITCYHYNG FT VSTVDYALSSCDIIDYFDYFKIHDFTYLSDHCQISCQLKLNKIIKLHKTNN FT NDLHNLPKSWVWDKLSVFNYQDNLASTHSQHKINSFLTNKYTLDSKGINDA FT VNNFNSLITESADRTFNKKSNKSKKQKPKKKKWFDRDCMVLKKDLVKTAQL FT VNKFPFDTRLRDLFYRRKKKFKKLIKLKKKQEENLIVNKLINLNNTNPNQF FT WKVINQLRESTHEQASHISPEEWNRYYSKLNACETNNNYHNNIISEITSLQ FT NREPLATKLNSTITDEEIKLAIKKLKSKKAAGYDGYCPELLKQSSDFLIKP FT LQKLYNLILDSGIYPDIWNISIISHIFKKGDRAEPGNYRGISVSSCVGKTF FT CVIMNNRIQQFMDKNNILHSNQAGFRKGFRTTDHIFTLKTIVHKYLKQNGR FT LYTCFIDFQKAFDTIWRNGLFYKLLNSGIDGKIYNIIKYMYLNTKACIRFG FT NKTSSVFETTKGVKQGCVISPMLFNIYINDLNDVLQDSSIDHVSLNDTKIS FT SLFYADDLVLLSTTNYGLQKALDILSDYCSNWKLSVNLTKTKIVIFRKRGL FT KTHDKFLYLGNHINCVNSYNYLGVTMQSNGKFTEARQNMADKANRALYAML FT RHIPPECNCKTFIYLFRTCILPVLIYGSEIWGIEVLDENKWDKTTSEKLHL FT KFCRNILGLNKFGANLACRAELGCFPLLIDTKKNLIKYWLRIQGLPDKCLT FT KQAYYEQMKFNLPWCSSVQNLVIDSRIQTSKTDLNIKNIDKTLKENYITFW FT KQNVNQENTKLRTYAKFKNTYNFESYLNCLPKKHQVCMSRFRTSNHCLAIE FT KGRHTQPKTPIEKRICNLCSQNKIEDEVHFLLECDRYKYIRHELLTEITLT FT NAGDKLSNFIFLMSNKDSKAIKCMGTYLLKAMEYRKL" XX SQ Sequence 6177 BP; 2440 A; 1040 C; 1021 G; 1676 T; 0 other; atatggccga cttcctgtca cagcactgct attgagctcc agaatttcac aaatttttaa 60 cctaccatag ttacatctac cactacaaga acttgtgaag aagtctatga atgtatgtta 120 catcacctta tatgttatct gatgttagtg atcacttcaa aaagcaaaga tctggtgttg 180 aagaacttca tatattggta ggtcaagtcc aaacaataac agtgctatca aaaagtgtgt 240 aacatctcag aggtgactcc tggaaatcca tgttattcaa ccaacatagt tgattgacac 300 taaagaagtc acaaagtttg attatacagt cctctggttt tagttagaca ctggttcttg 360 tgttgaatat ccaagtcaag gctacagtgc atgagttaac attgtttaca tgaagaggct 420 gatattgttt acactaacaa gctgataatc acattacaaa ggaaggtcaa tctcaaaatg 480 ggtgattcta ggaacagcat aagtttaagg agtaaagaaa atgttcaaaa tccaggagag 540 atgagaaagt ttattgaaag atactttgga aagaaagttg aaattaaatt caaccacaac 600 agtgtcacca tacggtcaaa cacaaatatg aaagaagtgt ggatgaaagc gatcaaggct 660 gtattcactg aagacaagta cacttacatc aagaaaggtg aagtaaaaag tgaacaactc 720 cgtattgctg atggtgacaa cacagttact attaagatgt atgataatgg cactttacta 780 atacaaggaa acaattacaa ccagtggatg gaaacagtgg gaagagagat gttctccaaa 840 atacaagaac tgttgggttt aaatgtgaat tttgaatcag cagaagttac tgaagatact 900 gtatcaaaaa atacaagagc aaggcgtgat cctaaaaaaa caattagatt caacgttgag 960 gaatacgaac tagaagccag cacaattaaa ccacaaaaga cacactatga tgaaagcaat 1020 acaaaaaacg attcacaaag taaaggacca acatcaacca agcaaagacc tacaactcca 1080 acaagatcta ccaagccaaa tattgctaac actccatcca gaatcccatg ccgcatctcg 1140 aaaacaccta ttgtaaataa tggaggatcc ctggacaaca cagagctagc agcaactttg 1200 tcacaaatgt tcaagtccat tcaacaaatt gacaaaaatt actgtaatat tatgagatcg 1260 ggctcaatca cacaggaatt aacggacaaa ctaatgaaag tggaagataa tctagccaaa 1320 caatcaagag acatacagaa gataaataaa cttctggaaa ataaattgag aaaggatgaa 1380 gcaactaaac caggtactgg cacaactata cagaatgggg aaaccatagc cacactgatg 1440 agcaaaattg acagaataga gagaagcata acaagtattg agaaaagttt tataaacaac 1500 gataaagcca tatcaaaaat aacagctcta caagaatcca ccaagtcatg tgttgaaaag 1560 gaaaaagaaa tcctaaccaa gctgcaacaa gacagacatt acttacaaca acagcttgaa 1620 aagagggggg aagaaattga catattgcaa agtgagaatg acaaactacg gaaagatatc 1680 agagaactga ggcatgatac acaagacaac tacacaagaa accaatacag tgtacttgca 1740 ccagaagaca acagtgatga tagtcaaact accggtaaca cctctaccaa cagtgaattc 1800 acaactgtca ctaagaagat cagatcatct agaaaggtac aaacattaat aattggtgat 1860 tcaatgctaa aggacattaa tgctagagga gtttatagaa atacacaggt taagacatta 1920 cgtggacgga agataaagga cataacagat cacatccaaa aagcatctat tgtaaaagag 1980 gttgaaaata ttgtgattca tgttggcacc aacaacttgg ccgataaaca caaagttgat 2040 gagtgtatgg aagaatacac caccttgatt gatactatca gagaaaagaa ccccacagtc 2100 aagataagca tatcatcggt tattcacaga aaaaatgatg tagctactaa caaaataata 2160 gacaatttca acagtagatt gaagaagatt aatggaaata atgtacattt catcaacaac 2220 aacaatatta cttctgaaac tcacttgatt gatgcagtac acttaaacaa acgtggaaca 2280 gctgagctgg tgaaaaacat caaagacact attgggccac tgattggagt caacagacaa 2340 gaaaaaacaa atcaccggta caacaatgca agacatcgtc acaacagaag ttatgacaca 2400 cccaaggaag tgcacagacc aagaaacaac aaggaacaat acagaccaag agacaatgag 2460 gaacagtaca gaccaagaga caatgacaac actacagacc aatagatcca gttccatttc 2520 atatgcagtt tcctttaaca tcgttccttg gctactcacc accttcacat tggccaggaa 2580 tgtacggtca tagaccacac ttccccaatt agatttaata tgtgctgtca gaaaactagg 2640 aagaatatta acatttcaca acggagatca ttatcattgt cttattggaa catacaaagt 2700 ttagccagga aaactcagga aaatgacttc attaaagaca ttaattgtca cgacattgtt 2760 ggtcttggtg aaacctggct tgacaaacac tctgcatctt cattcaatat acctggatat 2820 tacaattact caataacaag gaacaatcac acaggtggag gactaacaat cttaatcaag 2880 caggatatca aacaaggagt gaacatcatt gaaaatacat cttctgaaga taaattctgg 2940 tgtaaacaaa agaagaactt ttttggtcaa cttaaggata cttatctctg ttttgtttat 3000 aatcccccac tgaactcaaa acactgcaat cctgactttt ttgaacactt agagaaggac 3060 atatctaaat tctcacagtt tggtaacatt gtcattctag gagactttaa cagcagaact 3120 ggtatgttaa aggattatgc tgattttgat gatacatttt taccattacc attttcacca 3180 gataataatg acacaataac agttcataac aatcgagata ctacagtcaa tgcatacggt 3240 aaagctttaa ttgacctttg caaaaattgt gatatgagga tactgaatgg cagaatgcca 3300 ggagattcta gtggtaacat aacatgttac cattacaatg gagtaagcac agttgattac 3360 gccctgtcgt cttgtgatat tattgattat tttgattatt ttaaaatcca tgattttact 3420 tatctttcag accattgcca gatttcttgc caacttaaat taaataagat aattaaatta 3480 cataaaacca ataataatga tttacataat ctacctaaaa gctgggtctg ggataaactt 3540 tcagttttta attaccagga taaccttgct tcaacacata gtcaacataa aataaattcc 3600 ttcttaacta acaaatatac attagatagt aagggaataa atgatgcagt aaataatttt 3660 aattcattaa ttactgagtc tgcagacaga acatttaaca agaagtctaa taaaagtaag 3720 aagcagaaac ctaaaaagaa aaaatggttt gacagagact gcatggtact caaaaaagat 3780 cttgttaaaa cagcccaatt agttaataaa tttcctttcg atactaggtt gagagactta 3840 ttttatagaa gaaagaagaa atttaaaaaa ctaattaaat taaagaaaaa acaagaagaa 3900 aacttaattg tgaataaact tattaactta aataatacta accctaatca attttggaaa 3960 gttattaatc aattaagaga atcaactcat gaacaggcaa gccatatttc accagaggaa 4020 tggaataggt attacagtaa attaaatgcc tgtgagacca ataacaatta tcataataac 4080 ataatttctg aaattacatc acttcaaaat agggaacctt tagcaactaa attgaatagc 4140 acaattacag atgaagaaat taaattagct attaaaaaac ttaagtcaaa aaaggcagca 4200 gggtatgatg gatactgtcc agaattattg aaacaaagtt cagacttcct gattaaacca 4260 ctacaaaaac tttacaacct catcttggat tcaggtattt acccagatat ctggaacatc 4320 agtatcatat cacatatttt taagaaggga gatagagcag agcctggtaa ttatagggga 4380 atatctgttt ccagttgtgt gggtaagaca ttttgtgtta ttatgaataa tagaattcaa 4440 cagtttatgg acaaaaacaa tattttacat tccaaccaag ctggctttcg aaaaggtttc 4500 cgcaccactg atcatatttt cactctaaaa acaattgtac ataagtatct aaagcaaaat 4560 ggaagactat acacttgttt catagacttc cagaaagctt tcgatactat ttggagaaat 4620 ggtttatttt acaaacttct gaacagtggt atagacggga aaatttataa tattattaaa 4680 tatatgtact taaatacaaa ggcctgcata cgatttggaa acaagacttc ttctgttttt 4740 gagacaacta agggagtaaa acaaggctgt gtaatcagcc ctatgctttt taacatctat 4800 attaatgact taaatgatgt attacaggat agttcaatag accatgtttc actaaatgac 4860 actaaaatta gctcactctt ctacgcagat gacctagttt tactttccac aacaaactat 4920 ggtcttcaaa aagcattaga tattttatca gactattgtt ctaattggaa acttagtgtc 4980 aatttaacaa aaacaaaaat tgttattttc agaaaaagag gtttaaaaac acatgataaa 5040 ttcctatatc ttggaaatca tatcaactgt gttaactcct ataattactt aggtgttaca 5100 atgcagtcca atgggaaatt tactgaagcc agacaaaaca tggcagataa agccaacaga 5160 gccttatatg caatgttaag acacattcca cctgaatgca actgtaaaac atttatctat 5220 ctgttcagaa catgtattct tccagtactc atttatggtt cagaaatttg gggaattgaa 5280 gttttggatg aaaacaaatg ggacaaaact acaagtgaaa aactacattt gaaattctgt 5340 agaaatattc taggcctaaa taaatttggt gcaaatttag cttgtagggc agagttgggc 5400 tgtttcccat tactaataga caccaagaaa aaccttatca agtattggtt acgcatacaa 5460 gggcttccag ataaatgtct tacaaaacaa gcatattatg aacaaatgaa attcaattta 5520 ccatggtgtt catcggttca gaatttagtt attgactcaa gaattcaaac tagtaaaaca 5580 gatttgaata taaagaacat tgataagact ttaaaagaaa actatattac attttggaaa 5640 cagaatgtca atcaagaaaa cacaaaacta agaacatatg caaaattcaa aaacacttat 5700 aattttgagt catacttaaa ttgtcttccg aaaaaacatc aagtctgcat gtcacgcttc 5760 cgaacaagta accattgttt ggctattgaa aaaggaaggc acacacagcc aaaaactccc 5820 attgaaaaaa gaatttgtaa tttatgttcc caaaataaaa tagaagatga agttcacttc 5880 cttttagaat gtgatcggta caaatatatt agacatgaat tgctcacaga aattacatta 5940 actaatgcag gagataaatt gagtaatttt atatttttga tgagcaacaa ggactccaag 6000 gcaattaaat gtatggggac ctatctactt aaagccatgg agtacagaaa actttaacgt 6060 atctatttat attgtattgt aattgtgttg tgtatggata tatttattgc tttttaccaa 6120 aagaaatatt gtaatttcta aggatgcaat aaaacttgat tattattatt attatta 6177 // ID BEL-644_AA-I repbase; DNA; INV; 5940 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-644_AA_; KW BEL-644_AA-LTR; ao_Bel_Ele11; BEL-644_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-5940 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [4985-5542] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 23..5938 FT /product="BEL-644_AA-I_1p" FT /translation="MATGGTPSPAESSCANCDRPNTVDDLVQCDRCKAWWH FT MSCAGVTHSVEGRAWNCSKCVSSTGSTGSVLSVPSSSRSRSSTKSKRAKLV FT LQMLQEEQDMRMKQLKEQQDMEKSFLERKYALIQAALEEEEQEDGVSVRSR FT SSHRSNVRKTQQWVNELGEANEDTAAALPKIADPAPDPQKKGAIPKHSRRL FT DFTESTPQNICKDKEEVAIEQIGHNDLPESIPLRAQHGEQQVQAAAPPPWP FT NSFALDLQKEKDASRRQNLRRVEYPETASQWFDVRPSNSGKLDPKPPHPGK FT SDLQTSTLPTHANDETQLSSIEHLTKQFGLVSVENMNLPDNINAFTPTPSQ FT LAARQVMPRDLPVFSGSPADWPVYISNFMNSTLACGYSPAENLIRLQRSLK FT GPALEAVRSRLLLPESVPHIIDTLRLLYGRPELLINALIEKVHSVLTPKAE FT KLESLIDFGMAVQSLSDHLEAANQKAHLSNPSLLIELVDKLPVHMKMEWVK FT FVENKGEVNLKVFGEFMSGIVKSASQVTLYTGMASRVKKSEVEDKWKQKRV FT AVHSHQLESPPASSGDGSDKACPACRQKNHRLKDCGVFRSYSIEGRSRFVQ FT QHGICRNCLNAHGRRACRNSGVCGVNACPYRHHQLLHFPRSNVSAVENHVH FT REIHPSLYFRIIPVTICGGGRSIEIFAFLDDGSQLTLIEEGLVRQLGVEGM FT ELPLCLKWTGNMTRTENGSRAVSLSITTDDKRLFELKEVRTVKSLTLPSQT FT VDYDYLATKYSHLRGIPMKSYSAAVPRLLLGVDNLRLTVPLKTREGKINEP FT VAVKTRLGWCIYGGQAQSVTASLNYHYCECSADQALHGLVKEFFAAEDVGT FT RSPPPIVSEDDRRASELLETTTKRTGSRFETGLLWRFDNFKLPESYGMALK FT RLECLEKRMEKQPELRANIAKQINEYVAKGYVHRATEEELRFADSKRVWYL FT PLGVVFNPKKPNKVRLIWDAAAKSHGVSLNDMLLKGPDQLTSLPAVLFRFR FT QFAIAVSADIAEMFHQIGIVEGDRHAQRFLWREDPFTAPEVFLMNVATFGS FT TCSPASAQFVKNKNALEFRQQYPRAVQSIVKNHYVDDLLESFETEEEAAKV FT IQEVRMIHAHGGFNIRNFQSNSPVVLDSVGEKSASNPCSLDLHQAERSERV FT LGLLWTPHEDMLGFSTIMPDEIQKVISTGVAPTKRQVLKCLMSFYDPLGLL FT ALFTVQGKIILQAIWRSNTQWDQQIGAVIRNRWQQWIQILVDLQHLHFPRC FT YFDQATLKYYQHVQLHIFVDASEEAYAAVAFFRIVDPQGRVCCSLVAAKTK FT VAPIKPISIPRLELQAAVLGVRLMQWIVEGHSIRIEKKVFWTDSSTVLSWI FT KSDKRRYKQFVAVRVGEILTETNVEEWRYVPSKKNIADAATKWGKGPDLNP FT DGVWFKGPTFLKDIESKWPVTPKLKEDDVEVRPIFVHQAVPVVKLVIDLER FT FSNWNRVLRTTAYLLRFISRSRKRTVPQSTYLSQEELAAAWRYLICAVQWD FT SYPDEMVTLTNNLRNPSGQMRALDKTSSLYQLSPMLDDCGMLRVDSRIQAA FT RNVLNDTKLPIILPRKHKLTHLIIESFHRKYLHGSPETVVNEIRQLYYIPK FT LRTMVRLFTKECQLCKIKRAQPQIPRMAPLPSARLASFIRPFSYVGLDLFG FT PLLVKLGRSNAKRWIALFTCLTIRAVHVEVVYSLSTDSCVMSVRRFVSRRG FT APVEIHSDNGTNFRGADRLLREQIQNINEAMAETFTNTTTKWVFIPPSAPH FT MGGSWERMVRSVKVAMEAANGGRRLDDEGLITLAVEAEGIVNSRPLTYLPL FT DSEEAEALTPNHFILGSSSGVRQPAMKPADEAVALRNSWGQIQVQTDVFWK FT RWTREYLPTLTRRVKWFGEVKPIAVGDLVLVVEETRRNGWTRGRVCEVYKG FT QDGRIRQAMVQTSGGLIRRPVSKLAILDVAQVGKTGPGTSGDQCYGEE" XX SQ Sequence 5940 BP; 1715 A; 1308 C; 1512 G; 1405 T; 0 other; aatcttcaag aaattcgcta gtatggcaac aggaggaaca ccttcgcctg ctgaaagcag 60 ttgtgcgaac tgcgatcgac cgaatactgt ggacgatctg gtccaatgcg atcggtgtaa 120 ggcttggtgg cacatgtcat gtgcaggagt tacacactcg gtggaaggtc gagcatggaa 180 ttgcagtaaa tgtgtttcat caactggttc tactggatct gttttgtcag tacccagcag 240 ttccagatcc cgttcttcga cgaaatcaaa acgtgcgaag ttggttctac agatgcttca 300 agaggagcag gatatgagaa tgaagcagct aaaggagcaa caagatatgg agaagagttt 360 tctggagcgg aagtatgcac tcatccaagc cgcattggag gaagaggaac aagaggatgg 420 cgtcagtgtt agaagccgtt caagccaccg aagtaacgtg agaaaaacgc agcagtgggt 480 gaacgagctt ggcgaagcaa atgaggatac tgcagcagca ttgccgaaaa tagcagatcc 540 agctcctgat ccgcagaaga aaggtgcaat cccgaagcat tcacgacgac tagattttac 600 agaatcaaca ccccagaata tctgtaaaga taaggaagaa gtggccatcg agcagattgg 660 acacaacgat ttacccgaaa gtatcccact gagagcccag cacggtgagc agcaggttca 720 agcagccgcc cctcctccgt ggccaaattc gttcgcttta gatctgcaga aggagaagga 780 tgcaagcagg aggcagaatc tacgtcgtgt ggaatatcct gagacagcgt cacaatggtt 840 tgatgtaaga ccgtccaact caggtaaact agaccctaag ccgccacatc caggtaagtc 900 cgatcttcaa acatctacat taccaacgca tgctaacgat gagacccaac tgagttcaat 960 cgagcacctt accaaacaat ttgggctcgt ttctgttgaa aatatgaacc tgccggataa 1020 tattaatgca tttactccta cgccttcgca attggcggcg aggcaagtaa tgccccgaga 1080 cctccccgtt ttttctggca gtccagctga ctggccagta tatatcagta actttatgaa 1140 ttctacgctg gcgtgtggct acagtcccgc cgaaaattta atcagactac aacgctctct 1200 gaaaggtcca gccctggaag cggttcgcag tcgtctcctt ctccccgaat cagtacctca 1260 tataattgat acgctccgct tactgtacgg gcgcccagag cttttgataa atgcgctcat 1320 cgaaaaggtc cactccgtcc tcacgccgaa agcagagaaa ttagaatctc tcatcgactt 1380 cgggatggca gttcagagct tgagcgacca ccttgaagct gcaaatcaaa aggctcatct 1440 gtcaaacccg tcgttgttaa tcgaactagt cgacaaattg ccagtgcaca tgaagatgga 1500 gtgggtgaag tttgttgaaa ataaaggtga agtgaacttg aaagtatttg gtgaatttat 1560 gagtggaata gtgaagtcag cgagtcaagt tacgctctac acgggaatgg caagcagagt 1620 gaagaaaagc gaagttgagg acaagtggaa gcagaagcgt gtagcagttc actcacatca 1680 actggagtca ccaccagcga gtagtgggga tggaagtgat aaggcctgtc cagcgtgccg 1740 ccaaaagaat catcggctga aggattgcgg tgtgttcagg tcctattcta ttgaagggag 1800 aagtcgtttc gtacagcagc atggaatctg tcgtaattgt ttgaatgctc atggtagacg 1860 tgcttgccga aattctggag tttgtggagt aaacgcatgt ccgtaccgtc atcatcaact 1920 tttgcatttt ccacgatcaa atgtatccgc cgtggaaaat cacgttcacc gtgaaattca 1980 tccttcgctc tacttcagaa taatcccagt tacaatttgt ggaggaggca gaagcattga 2040 gatcttcgca ttcttagacg atggttcgca actaacgctg attgaggaag ggctggtgcg 2100 acagctaggt gtcgaaggta tggagcttcc gctttgtttg aagtggacag gaaatatgac 2160 acgaaccgaa aatggatccc gagcagtaag cttgtcaata accacggatg acaaacgcct 2220 cttcgagctc aaggaagttc gtacagtgaa gtctcttacg cttcccagtc aaaccgtaga 2280 ctatgattat ttggcaacga agtactccca tctgcgtgga atcccaatga aaagctattc 2340 cgcagccgtt ccaagattac tgttgggagt tgataatcta cggttgaccg ttccattgaa 2400 aaccagagaa ggcaaaatca acgaacccgt agcggtaaaa actcgtttgg gctggtgcat 2460 ctacggcgga caggcgcagt cagtcacagc atctctaaac tatcattact gcgagtgttc 2520 ggctgaccaa gctctgcacg gtctcgtaaa ggagtttttt gctgcagagg atgtgggaac 2580 gcgatcccca ccaccaatag tatcggagga cgacagaaga gcaagtgagc tgctggagac 2640 tacaacgaaa cggacaggaa gcagatttga aaccggatta ctgtggcgat ttgacaattt 2700 caagctacct gagagttacg gaatggcgtt gaaacgactg gaatgtttgg aaaaacgaat 2760 ggaaaagcaa ccggaattga gggctaacat agcaaagcaa atcaacgaat acgtagctaa 2820 aggctacgtt caccgtgcaa cggaagagga attgaggttc gctgattcaa aacgagtgtg 2880 gtatcttccg ttgggtgtgg tgttcaaccc gaagaagcca aataaagtcc ggctaatttg 2940 ggacgccgct gccaaatcac atggagtgtc tcttaacgac atgctactta aaggacccga 3000 tcaattgacg tcattgccag cagtgttatt tcgtttccgg caatttgcga tagctgtttc 3060 agctgacata gccgagatgt tccaccaaat tggaatcgtg gaaggcgatc gtcatgcaca 3120 gcgttttttg tggagagaag atccattcac agccccagaa gtttttctaa tgaacgttgc 3180 gacatttggc tcaacgtgct ctccagcatc agcccaattt gttaaaaaca aaaatgcttt 3240 agagttcaga cagcaatacc caagggcagt tcagagcatt gtgaaaaacc actacgtgga 3300 cgatcttctg gaaagcttcg aaaccgaaga agaagcagcg aaggtaatac aggaggttcg 3360 gatgatccat gcccatggtg gattcaatat ccgtaatttc caatcgaaca gtcctgttgt 3420 tttggacagt gtgggcgaaa agagtgcgtc gaatccttgc agtctggatt tgcatcaagc 3480 cgaaaggtct gaacgcgttt tgggactgtt gtggacccct catgaagata tgcttggatt 3540 ttcaacaata atgccagacg aaatacaaaa ggtgatctcc acaggagttg ctcccaccaa 3600 acgccaagtt ttgaaatgct tgatgagctt ttacgatcct ctgggattgt tagccttatt 3660 tacggttcag ggaaagatca ttctccaagc aatctggcgt agcaataccc agtgggatca 3720 acaaatcggt gcagttattc gaaatcgatg gcagcagtgg atacagatac tagttgactt 3780 acaacatctg catttcccac ggtgttactt tgaccaagct actttaaaat actaccaaca 3840 tgttcaactc cacatattcg ttgatgcgag cgaggaagct tatgcggccg tagcattctt 3900 tcgcatagtc gatccacaag gaagggtgtg ctgttcttta gtggctgcca aaacaaaagt 3960 agctcccatc aagccaatat ccatacctcg cctcgagctg caggcagcag ttcttggtgt 4020 tcgactgatg caatggatcg ttgaaggtca ttctattcga atagaaaaaa aggtgttctg 4080 gacggattcc agcacggtct tgtcgtggat caaatctgat aagagacgat acaaacagtt 4140 tgtggcagtt cgagttggcg agatcttgac agaaaccaac gtggaggagt ggaggtacgt 4200 accgtcgaag aaaaatattg ctgacgccgc aactaaatgg ggaaaaggtc cagatcttaa 4260 cccggatggt gtttggttca agggaccgac atttttgaaa gacatcgaat ctaaatggcc 4320 tgtgactccc aaactaaagg aagacgacgt tgaagttcgt cctatcttcg tccatcaagc 4380 ggtaccagtt gtaaaacttg ttatcgattt ggaaagattc tcgaactgga atcgagtatt 4440 acgcactaca gcatatctac ttcggttcat aagcaggagt cgaaagagaa ctgttccgca 4500 gtcaacctat ctttctcagg aagaattggc cgcagcctgg agatacctaa tctgtgccgt 4560 acaatgggat tcttacccgg acgaaatggt gaccttgaca aacaacctac gaaatcccag 4620 tggtcaaatg agagcgttgg acaaaacaag tagcctatac cagttatcac caatgctcga 4680 tgattgcgga atgctacgtg tggacagtag aatacaggct gcaaggaatg tactaaacga 4740 caccaagctg ccaattatct tgccaagaaa acacaagtta acacacctta tcatcgagtc 4800 cttccatcga aagtatcttc atggaagtcc agaaaccgtg gtcaacgaga taagacaact 4860 gtattacatc ccgaaacttc gcaccatggt caggctgttc acaaaagaat gtcagttgtg 4920 caagattaaa cgagctcaac cacaaatacc aagaatggct ccactaccat ctgcgcgttt 4980 ggcctcattc attagaccgt ttagctacgt tgggttagat ctatttggcc cactcctggt 5040 taagctcggt agaagtaatg ctaaacggtg gattgcttta ttcacatgct tgacaatccg 5100 ggctgtccat gtggaggtcg tgtacagcct ctcgacggat tcttgtgtga tgagcgtacg 5160 ccgattcgtt agccgtcgtg gagcaccagt agaaatccat tcggataatg gaacaaactt 5220 cagaggtgcc gatcggttgc taagagaaca aattcagaac attaatgaag caatggcaga 5280 aacattcacc aataccacta caaagtgggt cttcatccct cccagtgcac cgcacatggg 5340 cggatcatgg gaacggatgg tgcgttccgt aaaggtggcg atggaagcag ctaatggggg 5400 acggagactg gacgacgaag gactgataac actggcagtt gaagctgaag ggattgtcaa 5460 ttcgagacca ttgacctacc ttccgttgga ctccgaagaa gcagaggccc ttactcccaa 5520 tcattttatc ctggggagct ctagtggcgt acgacagcca gcgatgaaac cggcggacga 5580 agcagttgct ttacggaact cttggggaca aattcaagta cagacggatg ttttctggaa 5640 acggtggacg cgtgaatatc tgcctacact cacgcgacgt gtcaaatggt tcggggaggt 5700 aaaaccgata gctgtcggag acttagtatt agtcgtagaa gaaacaagga gaaacgggtg 5760 gactagaggt cgcgtttgtg aggtttacaa ggggcaggat ggtcgtatcc ggcaggcaat 5820 ggttcagacg agtggaggtt tgattcgtag accagtctca aagttagcta ttctggatgt 5880 ggcgcaggtc ggtaaaactg gacccggaac atccggtgac cagtgttacg gggaggagga 5940 // ID RTE-2_NVi repbase; DNA; INV; 4937 BP. XX AC . XX DT 20-APR-2009 (Rel. 14.04, Created) DT 20-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE Non-LTR retrotransposon: consensus. XX KW RTE; Non-LTR Retrotransposon; Transposable Element; RTE-2_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-4937 RA Bao W. and Jurka J.; RT "RTE-like non-LTR retrotransposons from the parasitic wasp RT Nasonia vitripennis."; RL Repbase Reports 9(4), 798-798 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS join(463..687,614..1027,984..1751) FT /product="RTE-2_NVi_1p" FT /translation="MEAKNKLKGTDLFIDHDTTWIERRNREKLNGLAKEWR FT SEGKPVKVGRNKLTVGDEEYIWNERKDQLFRKRGRRQERWETRSIYGTRER FT TSFFEKEEEGKNKGKKKGRGKEKEKESLKVLSWNVAGLKGVDEEGWGFLRE FT FDVICLQETWTGGGEEKSIRKRLQGYEVEVREAKKGGSRGRLKGGMIMAVR FT KRSKEDKVEWMEGEIRRIYRREDSGWKGRSGEFIGGKIRIKGEEWWIGSTY FT MRDEKKKNYEEIEEMLERAGGGGRILWCGDLNARTGTEGGSLDEEGNEERR FT ESKDRTVNKEGEELLDRLKEMGLSILNGNIEGDEEGEITYVGGMGCSVIDY FT GIVNEEGRRKIKRMKVKDRIESDHGALEIEIDTERKVEVVKEETSRALWTE FT KAIREYRSWLEKGGEAKSWREMKVKIRNAIQTKRGVRKKNEKNRWWDEECR FT KKRAEMKRAKKGLEEMGSTGTHR*" FT CDS join(1794..2264,2194..2556,2445..3677,3958..4734) FT /product="RTE-2_NVi_2p" FT /translation="TEVREIIRKLKEKKAPGDDGIQNEAWKYGEENLVEEL FT KEILGKIWKGEGLPEEWRKGTVKPIFKKGDKGECKNYRGITLMDTGYKIYA FT ELIRGRMEKKLEEENRLSDTQMGFRKGRGTIDAIYVVSKAVEQELRKKGGK FT VYACFMLRRHESSFRQNKGRKVGRFTPALCFADMKAAFDKINREEIWRMMK FT RLGVSRKIRERVEEIYEKTECEVNIGERKVGSFVTQKGVRQGCPLSPLLFN FT VAMADLEVEMGKIQEGGIILGRKKNYGRFHTRMMWCSCGNGRPGSRDGEDT FT GRRNHSRKKKKLWSISYADDVVLLATNASGLKQMLKRFKGFIRRKGLELNT FT EKTKIMIFRNGGRRKREEKFEWNGEEIEVVKKFEYLGYTMKENGKEXEQIK FT KVKGKAIAVLGTVWGIGEAIFKESWKMRMKLFDALVQSVMEYGAEIWGWKG FT QEELEKIQRRYMKWVLKLERTTPAHIIHWETKRFKLETRARKRAVRYEGKL FT RKAKENSLLRECWSLLKKEGEKEIRRGRGKDVRRSELEKCGISVEEYNRRV FT EEGEGVEEEIERINKDTQWQKWREEIRGSKYAEEVRELLMEQGEIPEYLGE FT TGEKRRGALEKIARFRLGSETRGNHYWRKEEDRKCRMCGREEETLRHVMEE FT CEETKISGKKWKEICGGKRSQIGNLNSIIWKRKNKEKVTEKSQIVEMTHFE FT RRMEREMREIKEEVKEWKVKMQLERGKREAAERKLMDLEGDVEILKGKVRE FT ERREREKMQRAVEEMRAKIKKVEEKGRREKERKGSEEVSXVKVKSEVRLVS FT ASKRKRMEEGRSKVWERGVIIHKGANWDENKVLQDWLDEFLEGTNWELEET FT SNKDTKKIIFESREEMEEVWKRREEINEEDKVRMEQWMNREERRAKREAIL FT ERERRMEEAKRRGELMRRGNIRVRVEGETYEWIEEEGRIKKRR*" XX SQ Sequence 4937 BP; 2120 A; 461 C; 1647 G; 698 T; 11 other; aatgaaaagg tggaagaggt aaggcggcta atagaagaag agaaaaggga aagaatggaa 60 ctagaggaga gaatgaagag ggatgcggag gaggagagaa ggaaaagaaa agaactcgaa 120 agaaaaatgg aagaaagggt aaaagtgttg gaggataagg cagaaggaga gaagatagaa 180 aaggcggaat caggaagtaa agggaaagag gaaaaggaaa tggaagataa ataactgaag 240 gagctagaat ggaggatcga ggaaggagaa agagaaagga aacggaacaa cgtggtatta 300 acagggctaa aagaggagag gtgggacaag gtgaagattg aaggctggtt taaagaaaag 360 ctagaaatca cggtgaaggt gaagaggaca tggctgatta gaggcaaggc cacaaacaga 420 attagggtgg aatgcgtaga cagagaagag aaggaaaaga taatggaggc taagaataaa 480 ctaaagggca cggacttgtt catagatcac gatacaacct ggatagagag gagaaataga 540 gagaagctga acgggctggc taaagagtgg agaagcgaag ggaaaccagt gaaagtagga 600 aggaacaaac tgacggtggg agacgaggag tatatatgga acgagagaaa ggaccagctt 660 tttcgaaaaa gaggaagaag gcaagaataa gggaaagaag aaaggccgag gaaaagagaa 720 agaaaaagaa agtctgaagg tgttgagctg gaacgtggca ggacttaaag gagtggacga 780 ggaaggatgg ggatttctac gggaattcga cgtcatttgc ctgcaggaga cgtggacagg 840 aggcggagaa gagaaaagca taaggaaaag gctacaggga tacgaggtag aggtaagaga 900 ggcaaagaag gggggaagca gaggtaggct aaagggagga atgataatgg cagtcaggaa 960 aagaagtaaa gaggataaag tagagtggat ggaaggggag atcaggagaa tttataggag 1020 ggaagattag gataaaagga gaggaatggt ggataggctc gacatatatg agggatgaga 1080 aaaagaagaa ctatgaagag atagaggaaa tgttagaaag ggcaggtgga ggaggaagga 1140 tactttggtg cggagaccta aatgcaagga caggaacaga gggtggaagt ctggatgagg 1200 agggaaatga ggaaaggaga gaatctaagg atagaacagt aaataaagaa ggggaagagt 1260 tgctagacag attaaaagag atggggctgt ccattttaaa cggtaacata gaaggggatg 1320 aggaagggga aataacatac gtgggaggaa tgggatgttc agtaatagat tacgggatag 1380 taaatgagga agggaggaga aagataaaaa ggatgaaggt aaaagataga atagaatcgg 1440 accatggggc gcttgagata gagatagaca cagagagaaa agtggaggtg gtaaaagagg 1500 aaactagtag agcgttatgg acagagaagg ccattagaga atacaggagt tggctagaaa 1560 agggaggaga agcgaaatca tggagagaaa tgaaagtgaa aataaggaac gcgatacaaa 1620 ccaaaagagg ggtgagaaag aagaatgaaa agaatagatg gtgggatgaa gaatgcagaa 1680 aaaagagggc agaaatgaag agggcaaaaa agggattaga agagatggga tcaacaggta 1740 cacacaggta aagagagaat taaagaacat gtagtgtrga acatgtagag taaacagaag 1800 tgagggaaat aatcagaaag ttgaaggaga aaaaggcacc aggagatgat ggaatacaga 1860 acgaagcatg gaaatatgga gaagagaatc tagtagagga actgaaggag atactgggaa 1920 agatatggaa aggagaggga ctaccagaag aatggagaaa agggacggtg aaaccaatct 1980 ttaaaaaagg agataaggga gaatgtaaaa actacagagg gataacccta atggatacag 2040 gatataaaat atatgcagaa ctgataagag ggagaatgga gaagaagcta gaagaagaga 2100 acagactgag cgacacgcag atggggttta gaaaagggag agggacgata gatgcaatat 2160 acgtggtgag taaggcagta gagcaagagc tgaggaagaa aggtgggaag gtttacgcct 2220 gctttatgct tcgcagacat gaaagcagct ttcgacaaaa taaataggga ggaaatatgg 2280 agaatgatga agagactggg agtgagcagg aagattagag aaagggttga agagatttat 2340 gaaaagacgg aatgcgaagt gaacatagga gaaagaaaag taggcagctt cgtcacgcaa 2400 aaaggggtaa gacagggatg cccacttagt ccactacttt ttaatgtggc aatggcagac 2460 ctggaagtag agatggggaa gatacaggaa ggcggaatca ttctaggaag aaaaaaaaat 2520 tatggtcgat ttcatacgcg gatgatgtgg tgctcttagc aacgaacgcg agtggactaa 2580 agcaaatgct gaaaagattt aagggattca taagaagaaa gggtctagaa ctgaatacag 2640 agaaaacaaa gattatgata ttcagaaacg gaggaagaag gaagagagag gagaagttcg 2700 agtggaatgg agaagagata gaggtggtga agaaatttga atacctaggc tacactatga 2760 aagagaacgg raaggaagak gaacaaatca agaaagtgaa aggaaaggca atcgccgtgt 2820 taggcacagt atgggggata ggggaggcaa tcttcaaaga gagctggaag atgagaatga 2880 aactcttcga cgcactagtt cagagcgtaa tggagtacgg agcggagatt tggggttgga 2940 aagggcagga agaattggag aaaatacaga gaagatacat gaaatgggta ctgaagctgg 3000 agagaacaac tccagcacac attatacatt gggagacgaa acgcttcaaa ctggaaacaa 3060 gagcaagaaa aagggcagtt agatacgagg gaaaattaag gaaggctaaa gaaaactcct 3120 tgctaagaga atgctggagc ctactgaaaa aagagggaga aaaagagata cggagaggaa 3180 gagggaagga tgtaagaagg agcgaattag aaaaatgcgg tatatcggtg gaagaataca 3240 acaggagagt agaagaggga gaaggggtag aagaggagat tgaaaggata aacaaggaca 3300 cacaatggca aaagtggaga gaggaaataa gagggtcgaa atatgcagaa gaggtaagag 3360 aactactgat ggaacaagga gaaataccgg aatatctagg agagacagga gaaaagagga 3420 gaggggcact ggaaaaaata gcaagattta gactaggaag cgaaacgaga ggcaaccact 3480 actggaggaa ggaagaagac aggaaatgca gaatgtgcgg gagagaagaa gagacactga 3540 gacacgtaat ggaagaatgt gaggagacga agatatcagg gaaaaaatgg aaagaaatat 3600 gcggaggcaa aagaagtcaa atagggaacc ttaatagtat tatatggaaa cggaaaaaca 3660 aagaaaaagt aacagaataa aggagaaaga agaataagta aagggaagga agacaaagtg 3720 tgccgggaga agtttcaaaa agcgcgcgsa aaccacctag ggaactccga ggaaaagtcc 3780 ctaggagkgt agagagaaag aaactaggaa cgtcggcaag gaagaaagga gtagaaagaa 3840 ggaacggacg aaaaagaaaa gggaacgaat agaacagaag cgaaagcacg tgtagaaaac 3900 agcagaaaga ggttagaaag aagagatcag tcaactagat aagagataaa aaagtgaaaa 3960 agccagatag tggagatgac tcacttcgag agaagaatgg agagggagat gagagaaata 4020 aaagaggagg tgaaggagtg gaaggtgaag atgcaactgg agaggggaaa aagagaagcg 4080 gcggagagga aattgatgga cctagaagga gacgtggaga tcctgaaggg aaaagtgaga 4140 gaagagagaa gagaaaggga gaagatgcaa agggcagttg aagagatgag ggctaagata 4200 aagaaggtgg aggaaaaggg gagacgcgag aaagagagaa agggatcrga agaggtaagc 4260 aawgtgaaag tgaaaagtga ggttaggttg gtgtcagcaa gcaaaagaaa gagaatggaa 4320 gagggaagaa gcaaggtgtg ggagagagga gtaataatac acaagggagc aaactgggac 4380 gagaacaagg tactacagga ctggctggac gaatttttag aagggacaaa ctgggaactg 4440 gaggagacat cmaacaaaga cacaaagaaa attattttcg aatcaagaga agagatggag 4500 gaagtttgga aaaggagaga ggagattaac gaagaggaca aagtgagaat ggagcagtgg 4560 atgaaccggg aagagagaag ggcaaaaaga gaggcaattt tagaaaggga gagaagaatg 4620 gaggaggcaa agagaagggg tgaactaatg agaagaggga acatcagggt gagagtagaa 4680 ggagagacat acgaatggat agaagaagag ggcaggataa agaagcggag atgagggata 4740 aggaggagga gagaagggca agaaggaaga aggaagacta grgagaagga aactgcagtg 4800 tactatgtat aagaaaaaaa aagagaaaaa aaagaraaaa aaagatgtac ggggakaata 4860 tttttgtata ctattgtaaa tacgagatgt ataaaaacaa taaaactaat actattacta 4920 caatcatatt attatat 4937 // ID BEL-100_AA-I repbase; DNA; INV; 5875 BP. XX AC supercont1.40; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-100_AA_; KW BEL-100_AA-LTR; BEL-100_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5875 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.40; Positions 1565389 1559515. XX CC Positions [4571-5143] - Integrase core CC 'ATTTC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 263..5569 FT /product="BEL-100_AA-I_1p" FT /translation="MSTERRIKSLKMRLKSLTASLSLITNFVDEFDEDTQS FT DEVPVRLESLIKLWTDYSTAQSELESIDENSLDAHIKERTTMESSYYRIKG FT FLLAHNKSPLNQSLSTPTHSEAQVPMSTSQVRLPDVKLPIFDGKLDNWLVF FT HDLYISLVHSSTCLSNIQKFYYLRSSLSHSALQLIQSIPISADNYSVAWNL FT LLKHFQNPARLKQAYVDALFDFTALKRESAAELHNLVERFEANVKVLHQLG FT ERTEHWDILLTRMLSTRLDATTRRDWEEYSSSQQTVKFKDLTEFIQRRVTV FT LQSIQSKVVDTPPFAQVKKPIQRSVSSHGATQVSPRKCLVCSDHHPLYLCG FT NFSKLSAEDKEKEIRRHQLCRNCLRKGHQSRDCSSSTNCRKCRGRHHTLLC FT SSDSSSSSTPKPFATQQSKPIVSSEVTETPISSASATVVDSVSCASTGQQQ FT RTVLLATAMIILVDDNGVEHIGRALLDSGSECCFITERFSQRMKVQRKKIH FT LPISGIGQSSTHAKQQISCIIRSRVGAYSTAVDLLVLPRVTIDLPATSVNT FT STWSFPPGIQLADPSFDSTNPVDIILGAEIFFELFRIPGRIYLGEHLPVLV FT NSVFGWVVSGKSNVGTSGPPVVANIATVADIHRLMERFWKIEEDISPTTYS FT VEEQACEEYFSRTVSRTSEGRYIVRLPFKENVLERLSDNRSTAVRRLHLLQ FT ARLIRNPDLHVQYKAFIDEYNNLGHMQRTHEYEESTVKRFYLPHHAVLRED FT SPTTKLRVVFDASCKTSAGPSLNDALMVGPTVQEDIRSIIMRARKHQVMIV FT ADVKMMYRQVLVDPRDTSVQLIVWKSSPDQPMETYELKTVTYGTASAPFLA FT TRVLIQLADDEGRNYPLAAPVIKKDFYVDDLFSGGENTAEVIELRNQLEAL FT LAKGGFELRKWASNNEAVLDGIPTENRAIKESVELNRDQIIKTLGLHWEPA FT TDCLRYNLETLNSSTQPLTKRLVLSLIARLYDPLGLVGPVVTTAKVFMQDL FT WTLKDDTGEPWSWDRVLPINYQTRWTNYQTLLPNLNNLRISRCILLPEPET FT IQIHIFADASQLAYGACAYIRSTNTAGLVKVSLLSSKSRVAPLKRLSIPRL FT ELCGALLAAELYEKIKLSLQLNAKCYFWLDSTVVLCWLNASPSTWNTFVAN FT RTSKIQLATSNCSWHHIAGLENPADCLSRGLTPEIIVDFDLWWNGPQWLRQ FT HQHLWPTNHATLEQPSEAVEEARRPYLASPSSKPNPSFIDEIVGKYSNVQR FT LIRVVAYCHRFLQNCRSPRSERATSSPLNVEELLQAEHIIIRLVQQQAFEE FT EWKQLKLNQPVSSKSRLKWFYPFVSSDNLIRVGGRLGNAIQPYDAKHQILL FT PRAHQFSLLLVRNYHERHLHAAPQLLLSLLRQRYWVIGARSLAKHVVHNCV FT VCFRARPRMLEQFMADLPASRTIASRPFSITGVDYWGPIHLKPIHRRAAPG FT KAYVAVFICFSTKAVHLELVSDLSTAKFIQSLRRFVARRGLCTELHSDNGR FT NFVGAANELKQLVNSKDYQNAVAQECNKNQIRWHFNPPKASNFGGLWEAAI FT HSAQKHFVRVLGKNTLSHDDMETLLCQIESCLNSRPLVALSDDPSDFEPLT FT PGHFLTGSALKAVPDADYTEIAVNRLSRWHHVQKLYQQLWKRWHLEYLSTL FT QPRSKWLSPPVQIKENQLVIVRDENSPPMHWPTARIVQTHPGSDGIVRVVT FT LQTPTGHFVRPVNKLCFLPISSSVDQQVQTQIQESPSASAQDYQKKHPN" XX SQ Sequence 5875 BP; 1564 A; 1554 C; 1265 G; 1492 T; 0 other; tctggtcctt cgaaccggat tacgtcgaaa ctccgtaatt ccggcagtcc tgtggtcgtg 60 agtcgccatc agccatccgc catcattccc gctggaaggt tccagaacaa tcgataacca 120 tcattggtgc tgctctactg cattcataat acaaggcaat tgtattgcct atagcaggta 180 attatctgtc caacttccct ctttcctgct ttgtgcgtgc tacgcttgtt cgtttgtccc 240 caggacttca tctggtgctg tgatgtccac cgaacgtcgc atcaagtcgc tcaaaatgcg 300 actgaagagc ctgacagcat ctctcagcct catcaccaac tttgttgacg aattcgacga 360 agacacgcaa tccgatgaag tacctgttcg cctggaaagc cttatcaagc tatggacaga 420 ctatagcact gcacaaagtg aactagagtc gatcgatgaa aattccctgg atgcccacat 480 caaagagcgg acgactatgg aatcgtccta ctatcggatc aagggcttcc tactagccca 540 caataaatca cccttgaatc aaagcctttc aactccaact cattcagagg cgcaagtccc 600 tatgtccaca tcgcaggtgc gattgccgga cgtcaagctt cccatatttg atggaaaatt 660 ggataattgg ctcgtgtttc acgatttgta catatcattg gttcattcat cgacttgtct 720 ctccaatatc caaaagtttt attacttacg gtcgtcgctt tcgcattcag cgcttcagct 780 catccaatcg attccgataa gtgccgataa ctactcagtg gcgtggaatt tgctcttaaa 840 gcactttcaa aacccagcta gactaaaaca agcgtacgtc gatgcactct tcgactttac 900 ggccctgaag cgcgaatcag cagctgaact acacaacttg gtggaacgtt tcgaggcgaa 960 tgttaaggtt ctgcaccagc ttggtgagcg tacggaacat tgggatattt tgctcactcg 1020 tatgctcagc actcgcctag atgcaactac acgaagagat tgggaggagt attcgtcttc 1080 ccaacaaact gttaaattca aggacttgac tgaattcatt cagcgtcgag tcacagtgct 1140 tcaatccatc caatcaaagg tcgtcgatac accgccattt gcccaggtaa agaagccaat 1200 ccagcgttcc gtttcgagcc acggagccac acaggttagc cctcgaaaat gcctcgtctg 1260 ctcggatcat catcctctgt atttatgtgg gaatttctcg aagctatccg ccgaggacaa 1320 agagaaagag attcgtcgac atcaactgtg tagaaactgc ttgaggaaag gtcatcaatc 1380 aagagattgc tcatcttcta ccaactgccg taaatgtcga ggccggcatc atacgttact 1440 ctgctccagc gactcatcat cttcatccac accgaaacct ttcgccaccc aacagtcaaa 1500 gcccatcgtc tcttctgaag tcactgaaac tccgattagt tctgcttccg ctacagtagt 1560 tgactctgtt agttgtgctt ccactggtca gcaacagaga accgttcttc tggctacagc 1620 aatgatcatc cttgttgacg acaatggtgt cgaacacatt gggcgagctc tccttgattc 1680 gggaagcgag tgctgtttta ttacagaacg cttctcgcag cgcatgaagg ttcagcggaa 1740 gaaaatccac ctgccgatca gcggcatcgg tcaatcgtct acgcacgcta agcagcaaat 1800 atcctgcatc atacgttctc gcgtcggtgc gtactccacc gcagttgatt tacttgttct 1860 tcccagagtc accatcgacc tacctgctac gtccgtcaac acttcaacgt ggagtttccc 1920 acctgggatt cagttggcgg atccatcttt cgacagcacc aacccagttg acatcattct 1980 tggcgctgaa atctttttcg aactgtttcg tataccgggt cgaatctatc ttggtgagca 2040 ccttccagta ttggtgaatt cagtttttgg atgggtggtc tctgggaagt caaacgtcgg 2100 cacctccgga cctccagtag ttgccaatat cgccaccgtt gctgacatcc atcgacttat 2160 ggaaaggttt tggaagatcg aagaagatat ttcacccacc acgtactccg tagaagagca 2220 agcttgtgaa gaatatttca gccgtaccgt ttcgcgtacc tcagaaggaa ggtacatagt 2280 ccgcctgcca tttaaggaaa atgtcctcga gcgacttagc gacaaccgta gcactgctgt 2340 tcgtcgacta catcttttac aagcacgttt gatacgcaat ccagatcttc acgttcaata 2400 caaggcgttc atcgacgagt acaacaacct tggacacatg cagcgcactc acgaatatga 2460 agaatcaaca gttaagcgtt tttaccttcc gcatcatgca gtgctccgcg aggatagccc 2520 tactacgaag ctaagggttg tcttcgatgc gtcgtgtaaa acttctgcgg ggccatcctt 2580 gaatgacgcg ctcatggtag gaccaaccgt acaagaggac atccgatcaa taatcatgcg 2640 tgctcggaag catcaagtta tgatcgtggc cgacgttaaa atgatgtacc gccaagtact 2700 cgttgaccca cgcgacactt cagttcaact tattgtatgg aaatcatcgc cagaccagcc 2760 tatggagaca tacgaattga aaacggtaac ttacggtacc gctagtgcac catttcttgc 2820 aactagagta ctgatccagc ttgccgatga cgaaggccgt aactatcccc tagctgctcc 2880 tgtgattaag aaggattttt atgtggacga tctgttttct ggcggagaaa acacagctga 2940 agtgatcgaa cttcgaaacc agttagaggc tcttttagca aagggtggat ttgaactacg 3000 aaagtgggca tcgaataacg aagctgtact cgatgggatt cccactgaaa atcgtgccat 3060 taaggaatct gtggaattga accgtgacca gattatcaaa acgcttggtc tccattggga 3120 acccgctacg gattgccttc gctacaacct agagacactg aactcatcca cccaaccact 3180 cacaaaacgg cttgtcctct ctctaattgc ccgtctatat gacccgctag gcttggtagg 3240 acccgtcgta acaacagcca aagtgtttat gcaagactta tggaccttga aagatgacac 3300 tggggaacca tggagctggg atcgagttct accgattaac tatcagacgc gatggaccaa 3360 ctaccaaact ctactaccaa atctcaacaa cctccgcatt agtcgttgca ttcttctacc 3420 ggaacctgaa actattcaga ttcacatctt tgccgacgct tcacagctcg cctatggtgc 3480 ctgcgcctac atcaggtcca ccaacactgc tggattagtc aaggtttcgc tactttcttc 3540 caagtcgcgg gttgcacctc tcaagcgtct aagcatccca cgcctcgagc tgtgtggggc 3600 acttttggct gctgaactgt acgagaagat caaattgtcc ctgcaactca acgcaaaatg 3660 ttacttctgg ctcgatagca ccgtcgtact ctgctggctc aacgcatcgc catctacatg 3720 gaacacattt gtagcgaatc gaacatccaa gattcagctt gccacctcaa actgctcgtg 3780 gcaccatatt gctggtttgg aaaaccctgc tgactgtctg tcacgtggtc ttactccgga 3840 aatcatcgtc gactttgacc tctggtggaa cggacctcaa tggctacgac aacatcaaca 3900 tctttggcca acaaatcatg caacactcga gcaaccgtcg gaagccgttg aagaagctcg 3960 ccggccatat ttggcgtcac catcgtcgaa acccaatcct tcgtttattg acgaaattgt 4020 cggaaaatat tcaaatgtcc agcggctaat tcgagtggtc gcctattgtc atcgttttct 4080 tcaaaactgc cgaagcccca gaagtgaaag agccacatct tcacccctga atgtcgaaga 4140 actattgcaa gcagaacata tcatcatccg attggtgcaa caacaagcct tcgaagaaga 4200 atggaagcag ttaaaactca atcaaccagt gtcgtcaaaa tcacgattga aatggtttta 4260 cccatttgta tcatccgata acttgattcg agtcggtggc agattgggaa acgctatcca 4320 accatatgat gccaagcacc aaatcctgtt accacgagca catcagtttt ctctcctctt 4380 agtacgaaat tatcacgaac gtcatctgca tgcagcccca cagttactcc taagcctact 4440 tcgacaacgc tactgggtca taggggccag aagtttggct aaacacgttg tccacaattg 4500 cgtcgtctgc tttagggccc gtcctcgaat gcttgaacag tttatggctg atctaccggc 4560 atcacgtacc attgcgagtc gaccgttttc cataacaggg gtagattact gggggcctat 4620 ccatttgaaa cccatacatc gccgtgcagc acctgggaag gcttacgtcg ccgtctttat 4680 atgctttagc actaaagcgg tgcatttaga gttggtatcc gatctaagca cagccaaatt 4740 tatccaatct ttacgtcgct tcgtggctcg ccgaggactt tgcaccgaac tgcacagtga 4800 taacggaaga aactttgtcg gagctgctaa tgagttgaaa caactagtga acagcaaaga 4860 ttaccaaaac gctgtagctc aggagtgcaa caaaaatcaa attcgatggc attttaatcc 4920 tccaaaggcg tctaacttcg gcggtctgtg ggaggccgct atccattccg cccaaaagca 4980 tttcgtccgt gttcttggaa aaaacactct ttctcacgac gatatggaga ccctcttgtg 5040 tcaaattgag tcctgtttga attcgcggcc gctcgttgcc ttgagtgacg acccatcgga 5100 tttcgaaccg ttgacacccg ggcatttcct gactggatcg gcactgaagg cagttcctga 5160 tgctgattat actgagattg ccgtcaatcg actctcgagg tggcatcacg ttcaaaaact 5220 gtaccagcag ctgtggaaaa ggtggcactt ggagtatctt tctaccctgc aaccaagatc 5280 caaatggctt tctcctcctg tccaaatcaa ggaaaaccag ctcgtaatcg tccgtgatga 5340 gaacagtcca ccaatgcatt ggccaacagc acggattgtt caaacacatc ctggatccga 5400 cggcatcgta agggtagtga cgctccaaac accaactggc cacttcgttc gtccggtaaa 5460 caaattatgc tttctgccaa tctcatcatc ggtcgaccaa caagtacaaa ctcaaataca 5520 ggaatcacca tccgcttcag cacaggatta ccagaagaag catccaaatt gaattttccg 5580 ggcctgttgt aaacagggat caggtgagca cctgtccttt ccatttaatt ttacgtgaac 5640 ccgctaccgg gtctaatgtt ttgcagaaaa tcatccgttt gacgtctcaa atcaaaacaa 5700 cgaagttcat cgatcacgca gcaagagtga caacccatca tcgagaatcg tcatcatcat 5760 catcgacaga gctgtaagaa accaaggtgt ataaactagt aggtgagcta atctactcaa 5820 gttgtggaga tatgcagcga cctagtcagc aaccgtgttc ctgaaggggc cagca 5875 // ID BEL-106_AA-I repbase; DNA; INV; 6118 BP. XX AC supercont1.254; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-106_AA_; KW BEL-106_AA-LTR; BEL-106_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6118 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.254; Positions 391405 385288. XX CC Positions [5009-5611] - Integrase core CC 'ACAC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS join(638..1924,1928..6118) FT /product="BEL-106_AA-I_1p" FT /translation="MTREEDLAEVVETTKNQLCSLESSLKRMASEAKKLKD FT PSLIHVKTTISLLDSVYEQASNVLLKLEGFEGPQQQQRREPLMAIYLNAKV FT TLEELAQALRPAPTAGSPNGLLDQTIQQSASRADHLPRIDLPRFNGSPTEW FT LAFKSRFEKRIATIGEDADKFAFLTKCLEHFEPAKNSIEALENSGATFVEA FT WAKLETRFYKKRIAYEGYFSKLLKLKKSTTPNAKAIMALIDAVDTTVHAAK FT QIQNDRNPNLDCVANGLLVSLVKARLDEGTLSRMEDKMDLQTVYTWTEFKG FT ELEKHANQLACLNMSEVKSASTKVMGALTQTKNYPENKKSHPCDICAKDGH FT KIWSCPEFATIKPRKRQEVAKQKRLCFNCLQKGHGVSNCRSTYTCRTCQGK FT HHSMLHEEKDSTATPPLQQVNAVSNSAPSSSNSQNLAEGYVFLATAMVKIT FT SPNGTFQPVRCLLDSGSQLEAIAESTFKRLGVPHFNEKTIVSGIGGMLQLN FT SKTKVVLIAKSTNYTMEVELVILPKLLMDQPTQCINLQDVDIPANILLADP FT HFNVKGPIELLLGARVFYQVMEGEQKRIGRGPTFQNSKFGWLACGLVATGG FT TNACSGPIFQESAMGWIAQEPSNHVSPASIALSINREPSSSAYPEDELAKL FT IEKFWKLEEAADLVHQPAALCNNEAEAHFRNTVKTNADGRYVVRIPLRGEP FT SSLGESYSQAYRRFTSLERRLERSPQTYHEYRKFMEEYKSLGHMKPISNED FT FNKVKFFIPHSCVIKPDSTSTKLRVVFDASAKTSSGVSLNDIQVIGPTIQK FT ELFDQLIEFKTHDKVLMGDITKMYRQVQVAEEDTWLQCILWRNQSCEPIEA FT FRLTTVTYGEAASSFLACRALHAAGEDYREIDPEIADLIQKSFYVDNLMIG FT AASTEELQRMKKGIQDALLRHGFPLRKWASNDASTLEGTPPEDLEPLIQMG FT DQEVIKTLGVAWNPLHDTFQFISNGKETGSLKTLTKRQMVSKILRLYDPIG FT LVQPVIVTAKIQMQQLWKHRLNWDDEIPSDVIKEWKEFESSLAELQRIEFP FT RMTIPSKTIVLDLHGFSDASGKAYGCVIYLHAIDLQGNESTNLLCSKSRVA FT PLKYEEPPRSATSSTKATIPRLELQAALLLSELSARIKGILGKRITSEHYY FT SDSQVTLSWIKSPPSRWDTFIRNRVMKIHQLTNPNHWEYVPTKENPADMVS FT RGISVKKLIQSELNFWLYGPEYIKKRETYPRNKYRYDSTAEIEQRRAPTAL FT LIAAKGKDCADLIANYPHHNSYFKTVRHFAWLSRAINNLKGINANTGMNHK FT RSGPLTKIELDQGLQLIIKTMQTTIYPNEMGELNRNGTIPAKGAFEHIKVK FT VERGILHIAGRLQNANMPNSEKNPMLVPKSHPFARVIIKNIHEMRFHAGTN FT IVMSEFRLQFWMRDLRRTVQGVIRHCVSCARARPRRLQQEMGQLPAPRVNQ FT SAAFSHTGVDLCGPFEVLLNQRSKCKIIAYACIFVCFTTKAVHLEVVEDQS FT ASAFISALLRFTSMRGVPEVIYSDNGRNFVGASRELNRLRKAYNNQNFQNK FT IIDTAAKSGIHFSFIPPRSPNFGGLWESNIKVAKRLFSAAARGAQLNIMEL FT QTLFYQVAAIMNSRPLTAVYAGTEAPTPLTPGHFLIARPMQAIPMPTQPEN FT TVNLTTRWKRVQHQTEQFWRRWHTEYLHQLRNYAKWTKRHENVKVGQIVLI FT GDDNIPVARWPMGIITQVHPGEDGIVRVAVVRTSTGTYKRNVRTLAPLPIE FT EEHESTHNQESHSVIDKDVTITQRETSEFEDEPPPQAPQMIWDGRLRARPK FT GGRK" XX SQ Sequence 6118 BP; 1959 A; 1469 C; 1420 G; 1270 T; 0 other; tttttggtga ccccgacgtg atcctcgact cgcgcgcgga tcgaggatcg ccgcataagt 60 gtaataaaaa ctaatcaaat caccgcaccc gagcggtggt tagtgcatag tgacgaatcg 120 aaacaccccc cgtgcgggat ggaaagtgca aaaaattgaa cgaacacccg ccagcggcgg 180 caagtggcaa ctctaagccg ccacctgtgt gccgcgaaaa aaaaaaaaaa aaattgaacc 240 accgtctgca aacggtgcct agtgtacaaa ttgaatcacc cgggagttgg ggcgagtgaa 300 taaatctaac cagttagtgt gggaaaagtg ctaaagttga acaccgattg acggtgacgg 360 aacggtctgt gcaaatgacc cgaaatagtg aaatttaaac accgttcgat acggtcaaag 420 tgattcgcgt gttgctctga gtgatgtcgt ggaaattcaa tcaaggccac ccgctcgtgg 480 ccaccgcgta gtgccccctg agtggaaaac gaaaatagtg atcgcaggtt cctcccgcgt 540 ggagaaccac gaacaaaaag gtgatcgccg cccgcgtgcg gtaactgcgg tttgctccat 600 tagtgagagg caatcataaa ttgtcatacc atccaaaatg acaagagaag aagaccttgc 660 tgaggtggtt gagaccacca agaatcagct gtgttccctg gaaagcagtt tgaaaagaat 720 ggccagcgag gccaaaaaat tgaaggatcc ttcattgatt cacgtaaaaa caactatctc 780 attactggat agtgtgtacg agcaggcgtc aaacgttctg ctaaagttgg agggctttga 840 aggccctcaa cagcaacagc gtagggagcc actaatggca atctacctca acgcgaaggt 900 aacgctggag gagctggcgc aagcactgag gccagctccc acagctggtt caccaaatgg 960 actactggat cagacgatcc agcagtctgc aagccgagct gatcatctgc cccgaataga 1020 cctgccacgg ttcaacggtt cgccaacaga atggttggca ttcaagagca gattcgagaa 1080 gcgaatcgca acgattggag aggatgcgga caaattcgca ttcctcacca aatgcctgga 1140 gcatttcgag cctgcaaaga actcgatcga agcgctcgag aactccggcg ccaccttcgt 1200 cgaagcgtgg gcgaagctgg aaaccagatt ttataagaag agaatagctt atgagggtta 1260 tttctcaaaa ttactcaaac tcaaaaagag cacgacgcca aatgccaaag ccatcatggc 1320 gttaatagac gcggtggaca ccacggttca cgccgccaaa caaatccaaa acgaccgaaa 1380 tccaaatttg gattgtgtgg ctaacggatt actcgtaagc ctcgttaaag cccgactaga 1440 tgaaggcacg ctctctagaa tggaagacaa gatggacctg caaacagtct acacctggac 1500 cgaattcaaa ggagaattgg aaaaacacgc aaaccaattg gcatgtctaa acatgagcga 1560 ggtcaaatca gcctccacaa aggtaatggg tgcactaacc caaacaaaaa attatccaga 1620 aaataaaaaa tcgcacccat gcgatatttg cgccaaggat ggacacaaaa tctggtcatg 1680 tcctgaattc gccacgatca aaccccgaaa acgtcaagaa gtagcaaagc agaaacgact 1740 ctgcttcaac tgccttcaga aaggacacgg agtgtccaac tgtagatcga cttacacatg 1800 tcgaacttgc cagggaaaac atcactcgat gcttcatgag gaaaaagatt caaccgccac 1860 tccccctctc cagcaagtga atgcagtctc aaactcagca ccatcctcat ctaacagtca 1920 aaattgacta gcagagggct atgtgttctt ggccactgca atggtcaaga ttacttcgcc 1980 caacggaacc ttccaacccg tccgatgcct attggactct ggaagtcaac tagaagctat 2040 agcagagagt actttcaaac gactgggagt gcctcatttc aacgaaaaaa ctattgtcag 2100 cggaatagga ggtatgttgc aacttaattc taaaacaaag gtggtactaa tagcaaaaag 2160 taccaattac actatggagg tagagttggt catccttcct aaattgttaa tggaccaacc 2220 tacacaatgc attaatcttc aggatgtgga tattccagcc aacattttgt tggctgaccc 2280 acacttcaat gtgaaaggcc ccatcgagct gctattaggt gcacgggtat tttaccaagt 2340 gatggaggga gaacaaaagc gaattggaag gggcccaacc ttccaaaact ctaaatttgg 2400 gtggctggca tgcggactag ttgcaacagg aggcacgaac gcctgcagtg gtccaatatt 2460 ccaagaatca gcgatggggt ggatcgcaca agaaccatcc aaccatgtta gtccagccag 2520 catagcattg tcaataaatc gcgaaccaag tagttcagcg tatccggagg atgaattggc 2580 aaaattaatc gaaaaatttt ggaagctgga ggaggccgca gacctggtgc accaaccagc 2640 ggccctgtgc aacaatgaag cggaagctca tttccgcaac acagtgaaaa cgaacgccga 2700 cggcaggtac gtcgttcgta tcccgctacg gggggagcca tcctctctag gggaatccta 2760 ctctcaagct tatcgccgtt ttacgtctct cgagcggaga ctagaacgca gcccgcaaac 2820 gtaccatgaa taccggaaat ttatggaaga atacaagagt ttggggcata tgaaacccat 2880 cagcaatgaa gatttcaaca aggtgaagtt ctttattcct cactcctgtg taattaagcc 2940 tgactcaaca tccaccaaat taagggtggt gtttgatgcc agcgctaaaa cgtcaagtgg 3000 agtatcccta aacgacatcc aagtcatagg acccactatt cagaaggaac tgttcgacca 3060 gctgatcgaa ttcaaaacgc acgacaaagt gctgatggga gatataacaa aaatgtaccg 3120 tcaagtacaa gtagctgagg aggacacttg gctacaatgt attctctggc gaaaccaatc 3180 atgtgaacca attgaagcgt tcagactgac aacggtcaca tatggggaag ctgcttcctc 3240 gtttctggct tgccgagccc tacacgcagc tggtgaagat tatagggaaa ttgatcccga 3300 aatcgctgat ctcatacaaa aatcgtttta cgtcgacaat ttaatgatcg gagcagcatc 3360 aactgaggaa ctgcaacgaa tgaaaaaagg gatacaagat gcccttcttc gacacggatt 3420 cccattgagg aaatgggcat ccaacgacgc ttccaccctg gaaggaacgc cacctgagga 3480 cctcgaaccc ttgatccaaa tgggagatca agaagtgata aagacactag gggtagcatg 3540 gaaccccttg catgacactt tccaattcat tagcaacgga aaagaaacgg gttcacttaa 3600 aaccctgacg aaacgacaga tggtgtcaaa aattctgcgt ctttacgacc caataggctt 3660 agtgcagcca gtaatcgtga cggccaaaat acaaatgcag caactctgga aacatcgctt 3720 aaattgggac gatgaaatcc cgagtgacgt aattaaagaa tggaaggagt tcgaatcgtc 3780 gttggcagaa ctacaaagaa tcgaattccc gagaatgaca atacccagca aaactatcgt 3840 attagatttg cacggtttca gcgacgcctc aggcaaagca tatggatgcg taatatattt 3900 gcatgccatc gatctgcaag ggaatgaaag cacaaattta ttgtgctcta aatcgcgagt 3960 agctccacta aaatacgaag aaccacccag aagtgcgact tcgtcaacaa aagccacgat 4020 accgcgcttg gaattgcaag cggccttact gctaagcgaa ttgagcgcta gaataaaagg 4080 cattctagga aagaggataa cctctgaaca ctactacagt gactcccaag ttactctcag 4140 ctggataaag tcgcctccgt cccgctggga cacctttatc cgaaatcgag tcatgaagat 4200 tcaccagtta acaaacccaa atcactggga atacgtgccc acaaaggaaa atcccgctga 4260 tatggtgtct aggggaattt ccgtaaagaa gctgatccaa tcagaactca acttctggct 4320 atacggtcca gagtacatca agaaacgaga gacttatcca agaaataagt acagatatga 4380 ttcaacagca gaaatagagc aacgtagagc gccaacagca ttactgatag cagctaaagg 4440 caaagattgc gctgacctca tagctaacta cccgcaccac aattcgtatt tcaaaacagt 4500 gagacatttc gcatggttga gtcgagccat caacaatcta aaaggaataa atgccaatac 4560 gggcatgaat cacaagaggt caggccctct gactaaaatc gaactcgatc aaggcttgca 4620 actcataatc aaaacgatgc aaaccactat ttacccgaat gaaatgggag aattaaacag 4680 gaatggaacg atacctgcta aaggagcatt cgagcatatc aaagtcaagg tggaacgtgg 4740 catactgcac attgcaggca ggctccaaaa tgctaacatg ccaaattcag agaaaaaccc 4800 aatgttggtg ccaaagtcac acccattcgc acgagtaatt atcaagaata ttcacgaaat 4860 gcgattccac gccggaacaa atatcgttat gagtgagttt cgattacaat tttggatgag 4920 agatcttcgg cgcacagtgc aaggcgtcat acggcattgc gtctcatgtg cgcgagcgcg 4980 gccaagaagg ctacaacaag aaatggggca actgccagcc ccaagagtca atcaatcagc 5040 agcgttctct cacaccggag tggacttatg tggtccgttc gaagtattgc taaaccaacg 5100 atcaaaatgc aagatcatcg catacgcttg catttttgtc tgcttcacta ctaaggcagt 5160 gcacctagaa gtagtcgagg atcagtctgc atcagccttt atctcggctt tactcagatt 5220 cacttcaatg cgaggagttc cagaggtcat atactccgat aatggacgga acttcgtagg 5280 tgccagtagg gagctcaatc gcctacgaaa ggcttacaac aaccaaaact ttcaaaacaa 5340 aattatcgac actgcagcga agagtgggat tcacttctca ttcattcccc cacgaagccc 5400 aaactttgga gggctgtggg aatccaacat aaaggtagca aagagattgt tcagtgcagc 5460 agctagagga gctcaactga atataatgga gcttcaaaca ctattttatc aggtggccgc 5520 gattatgaat tcacgaccat tgaccgcagt atacgcagga acagaagctc caacaccatt 5580 gacgccaggg cactttttga tcgcaaggcc catgcaggcc atacccatgc ccactcaacc 5640 tgaaaatact gtcaacctca caacgaggtg gaaacgtgtg cagcatcaga cagaacaatt 5700 ttggagaagg tggcacacag aatatcttca tcaacttcgc aattatgcaa aatggacgaa 5760 acgacacgag aacgttaaag ttggccagat agtgctgatt ggcgatgaca acattcccgt 5820 ggcgcgctgg cccatgggaa ttatcacaca agttcatccc ggcgaggatg gaatagttcg 5880 cgtggcagtg gttcgaacct ctactggcac gtacaaacgc aacgttcgca cattggcgcc 5940 tctaccaata gaagaggagc acgaatccac acacaaccag gagagtcaca gcgtcatcga 6000 taaagacgtt acaattactc agcgagaaac ttcagagttc gaagatgaac ccccaccgca 6060 agcaccacaa atgatttggg acggtcgcct acgcgctcgt cccaaagggg ggagaaaa 6118 // ID Gypsy-600_AA-I repbase; DNA; INV; 7755 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-600_AA_; KW Gypsy-600_AA-LTR; Ty3_gypsy_Ele178; Gypsy-600_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-7755 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [3273-3776] - Reverse transcriptase CC Positions [4845-5324] - Integrase core CC LTRs are 95% similar to each other. XX FH Key Location/Qualifiers FT CDS 892..2229 FT /product="Gypsy-600_AA-I_1p" FT /translation="MLTEPEGALGGLRSLSPVSHGGNYTEKASSNNNDEKT FT TGTKPKQVLVPVDNNPSVVTVSRVEWEEMQRTLAELTAKLAEVNVSKSTAN FT TINCGIQTGMIERPIPPTNSVQRHSLPNKYGHVSGPPQPPNSRRTPTEHLD FT RDRRQLPMQQRPSNDFRCVEQETSSDDEPDRQPWAPRQHYADFRVPRETAG FT FDQQAGPYDMYSRNRNLGFRGSEVDDWTCSDYDDRYRNRRYTRNFNNSYQS FT RIEKWKLRFSGDQRSVSIENFLYKAKKLAEREGIPRDILLRDIHMLLEGSA FT SDWFFTFVDELQTWENFETSITYRFGNPNMDQGIRSKIHERKQLRGESFIA FT FVSEIEKLDRMLSRPLSKRRKFEVVWDNMRQHYRSKISLVEVKDLQQLIRL FT NHRIDAADPQLQQQAGEFPNRRPIHQIEADYSEYESDQSASINAIGSRNGR FT N" FT CDS 2412..5717 FT /product="Gypsy-600_AA-I_2p" FT /translation="MLQTSGRRTPTGKRMKGCDVGNSFIPRLTDIPKPLPS FT NETYNPFSQIYHIRIQHTNCPHIRVRIFDSEFEALLDSGAGISILNSLDIV FT DKYRLKIQPAAIRVTTADGSAYGCLGYVNVPFCYKGLTKVVPTLVVPEIAR FT HLILGADFWKSFGIKPMIDLGQGLEELETVEHRNDRPQCFTIEPTGELPGL FT ETVEEDETLEIPVYEGPTESDPNVDTIETEHDLTEDQRKQLTDVVKQFELT FT CVGKLGRTSLIEHEIELKEGAKPRNTPMYKCSPYVQQAINDEVERFKSLDA FT IEECYSEWTNPLVPVLKKNGKVRVCLDSRRINKMTVKDTYPMRNMQDIFRQ FT LGKAKYFTVIDLKDAYFQIPLKQESRNLTAFRTTKGVFRFKVLPFGLTNAP FT FTMARLMDKAIGFDLEPYVFVYLDDIVIASNTFDEHLRLLRIVADRLRKAG FT LTISLEKSRFCRKQVVYLGYLLNENGVAIDTSRIQPILDYAQPKSQEDIRR FT LMGLAGFYQRFIQNYSKITAPITDLLTKENKKKFKWTEEAEEAFRELKSVL FT TSAPILGNPDFSKMFTIESDASDRAVGAALVQEQDGVTRVISYFSKKLNRT FT QRRYSAVEKECLGVLSAIQHFRHYVEGAKFRVVTDARSLLWLFNVGTETGN FT AKLLRWALRIQAYDFDLEYRKGKANITADCLSRSIQVDAIVLTEPDFEYED FT LIDKIQSDPGKFTDFRVVDGQVYRYVKSQSRQSDPRFEWKLIPNASQRKII FT VRKEHEKAHLGYEKTLATVKERYFWPRMNEEVRRYCRSCLKCQVSKSGNSN FT VTPPMGAQKPVEYPWQFVTLDYVGPLPPSGRGRHTCLLVATDVFSKFILVQ FT PFREAKAQSLVEFVENMIFRLFGVPEIILTDNGSQFVSKQFKDLLKNYNVS FT HWLTPAYHPQVNNTERVNRVVTTAIRATLKGEHNHWTDNLQEIADAIRNAV FT HDSTKHSPYFVVFGRNKISDGSEYSRIRDNQPTTSDNNQTTTERRQKLFDQ FT IKQNLTTAYAKHSRTYNLRSNPNCPTYSVGEKVLKQTFDLSNKGKGFCRKL FT APKYEPAIVRKVLGTNTYELEDQDGKRLGVYFANRLKKMQAHLNH" XX SQ Sequence 7755 BP; 2292 A; 1550 C; 1731 G; 2159 T; 23 other; tttggcgccc aacgaaaaaa aaaaggcttc tattagagta gttttcgata tttatatatt 60 ttcagatttt ttcatataca attggcttat ttatttattt atatatgtgt ttatttattt 120 atttatttat ttatttattt atttatttat ttatttattt atttattcgt tttttattta 180 tttatttatt tatttattta tttatttatt tatttgttta ttcatttatt tatttattta 240 tttattttat ttttattata gttatttatt tacacatcct catatttatt cgttatatcc 300 tatatttatt tacatattta tttatttatt catatacata tacatatttg gattgagaat 360 ctcatctgtt tgtgctaact ttacttgttc attgtaattc attggaaaaa ttgttgaata 420 attgaatttt gtctaattgg atatttcaaa tctaacgact tgctcaaaat ggtgcgtaca 480 ttccctcgcg ctgaagatct caaccaagaa gagatcgaat atgaattctt gatacgcaat 540 caaccacaag atatatacac actcgatttg gcaggaaaac agagacattt gaggagtttg 600 ttcaaaaatg atgcaaaaga aggtgtgtcg tatcgatcac ctttatcaat atcagaggaa 660 gctcaacata ttcagggccg aatcgaaaac ctcgagaaag ctctagagaa aaaggtggaa 720 tcgaaatatg aatccagggt tttgcactat tggtatcgag tgaaatggag tagggctaat 780 ggggaagaag agaagagaat tcgcagagag ttaagtcaga ggattgagtc gatcatgaac 840 cggtttcaat ttggaccacc ccaaamccct ttagtaaatc aaggcaatcg catgctwacg 900 gaaccagagg gagcgttagg tggtctaaga agtttgtcac cagtatccca cggtgggaat 960 tataccgaga aagcgtcttc gaataacaac gacgagaaga caactggaac aaaacccaag 1020 caggtattgg tgccagtwga taacaaccct tcagtagtca ctgttagtcg tgtggaatgg 1080 gaggaaatgc aaagaacttt ggcggagtta acagctaagt tagcagaagt taatgtgtcg 1140 aaatcaacag ctaacacgat aaattgtggt attcaaactg gaatgataga acgcccaatt 1200 ccaccaacta attcagtgca acgacactcg ctgccaaaca aatatgggca tgtctcaggg 1260 ccacctcaac caccgaattc taggagaact ccaacggaac atttagaccg tgatagaaga 1320 cagttaccaa tgcaacaacg accgtcgaat gattttcggt gtgtcgaaca ggagacgagt 1380 agtgatgacg aacctgacag acaaccctgg gcaccacgac agcattatgc tgattttaga 1440 gtgccacgtg agacggcagg gttcgatcag caggcgggac catatgacat gtatagtagg 1500 aataggaacc ttggatttag ggggagcgag gtagatgatt ggacctgctc cgattatgat 1560 gacaggtatc gaaaccgacg ttacacacgt aactttaaca attcctatca aagccgtatt 1620 gagaagtgga aattgagatt ctcaggggat cagaggtcgg tctccattga aaacttcttg 1680 tataaggcta agaagctggc tgaacgtgaa gggattccaa gagacatatt gctgagagac 1740 atacatatgc tactcgaagg atcagcatca gattggtttt ttacgtttgt agatgagcta 1800 caaacgtggg aaaattttga aacgtcgatc acctataggt ttggaaaccc gaacatggat 1860 cagggtattc gatccaaaat ccatgaaagg aagcaattgc ggggagagtc ttttattgcc 1920 tttgtctccg agattgagaa gttggatcga atgttgtctc gaccgctttc aaaacgacgg 1980 aagtttgaag tagtgtggga taacatgcgc caacactaca ggtccaagat ttccctggtc 2040 gaggtgaaag accttcaaca gttgatcagg ttgaatcacc gaatcgacgc agctgaccca 2100 cagcttcagc agcaagctgg tgaatttcct aatcggcgtc caatccacca gattgaagct 2160 gactacagcg aatacgagag cgaccagtct gcttcgatca acgcgattgg aagtcggaat 2220 ggtagaaatc mmaggcaagg aaaccaacag acgaatgcta gagwaccacg gcaggttcca 2280 gcaaacgcga accagttagg ttgttggaac tgtcaggaac caggacatag ttggcgtcaa 2340 tgtcagaaac caaaggttgt cttctgctat gggtgcggaa atttaggacg aaccattcgc 2400 aactgtgagc gatgctccag accagcggaa ggaggacacc aacagggaaa cgaatgaagg 2460 ggtgcgacgt tgggaattcg ttcatccctc gcctaaccga cattcccaaa ccactaccat 2520 cgaacgaaac ctataatcca ttttcacaaa tctaccatat aagaatacaa catacaaatt 2580 gtccccatat cagagtaagg atctttgatt cagaatttga ggctctttta gactcggggg 2640 ctggtataag catattaaac tcccttgata tagtcgataa atacagatta aaaattcaac 2700 cagcagcaat tcgggtaaca acggccgatg ggtctgcgta cggttgtctg ggttacgtta 2760 acgtaccttt ctgctacaaa ggactaacca aagtagtacc gacgttagtg gtcccagaga 2820 tcgctcgtca cctgatcctt ggtgccgact tttggaagtc ttttgggatt aaacccatga 2880 tagacctagg ccaaggatta gaggagctag aaacagtaga acaccggaat gatcgaccac 2940 aatgtttcac gattgaaccg actggtgaac ttccaggtct ggaaacagtc gaggaagacg 3000 aaacccttga aatacccgta tacgagggac caactgaatc agatcccaat gtagacacga 3060 tagaaaccga gcacgatctc acagaggatc agcggaaaca attgacagac gtcgtgaagc 3120 agtttgagct gacatgtgtg gggaagttag ggagaacgag cttaattgaa cacgagatag 3180 agctcaaaga gggtgccaaa cctcggaaca cacccatgta caaatgctcg ccgtatgtac 3240 agcaagcaat taacgacgaa gtagagcgtt tcaaaagtct tgatgctata gaggaatgct 3300 atagcgaatg gacaaacccc cttgttcccg tccttaaaaa gaatggaaaa gtaagggttt 3360 gtctggattc gcgaagaata aacaagatga ccgtaaagga cacgtatcca atgcggaata 3420 tgcaggatat attccgacaa ttgggcaagg cgaagtactt cacggtcatt gatctcaaag 3480 acgcctattt tcaaattcct ttgaaacagg aaagtcgaaa tttaacagcg tttcgtacga 3540 caaagggcgt attccgtttc aaggtactcc catttggact aacaaacgca ccgttcacta 3600 tggccagact aatggacaag gcgataggtt tcgacctgga accatatgtg ttcgtatatc 3660 tggacgatat agtgatcgcg tcaaacacat ttgacgaaca tttacggctg ttgagaattg 3720 tagcggacag acttcgaaag gcagggttga caatctcctt ggagaaatcc agattctgcc 3780 gaaaacaagt tgtctactta ggatacctgt taaatgagaa cggagttgct atagacacgt 3840 ccagaatcca gccgattctc gattatgcgc agcctaaatc gcaagaagac attaggagat 3900 taatgggtct cgcaggcttt taccaacgat tcattcagaa ttacagtaaa ataacggctc 3960 cgattackga tctgttgacg aaggagaata agaaaaagtt caagtggact gaggaagccg 4020 aagaagcatt tcgggaattg aaatccgtgc tgacatcggc accaattctc ggtaacccag 4080 acttttcaaa aatgttcacg attgagtcgg atgcctcaga ccgagcggta ggggcagccc 4140 tcgtccaaga acaagacggc gttacacgag tgataagcta ttttagcaag aaattaaatc 4200 gcacccaacg ccggtactca gccgtggaga aggagtgcct tggcgtcctc tcggcaatac 4260 aacattttcg ccactacgta gaaggcgcca agttccgtgt ggtcactgac gccagaagtc 4320 tattgtggct tttcaatgtg ggtacggaaa cagggaatgc gaaactcctt cgatgggccc 4380 tccgtattca ggcatacgac ttcgaccttg aatatcggaa agggaaggcc aatattacag 4440 cagattgtct gtcacgttcc attcaggtcg acgcgatcgt attgaccgaa ccggactttg 4500 aatacgaaga cttgatagac aaaattcaat cggaccctgg gaaattcacc gattttaggg 4560 tggtcgatgg acaggtgtac cgatacgtaa aaagtcagag tcgccaatca gaccctcgtt 4620 tcgagtggaa actaatccct aatgcatccc aacgaaagat tattgtgcgc aaagaacacg 4680 agaaagctca tctgggatat gagaagacgt tagcgacagt gaaagagcgt tatttttggc 4740 ctcgtatgaa tgaagaggtg cgtaggtact gtagaagttg tctgaagtgc caggttagca 4800 aatccgggaa ttcaaatgtt acgccaccaa tgggagccca gaaaccagtc gagtacccgt 4860 ggcaattcgt caccctcgat tatgtcggac cgttgccgcc gtctggtcgt ggaagacaca 4920 cgtgtcttct agttgcaacg gacgtcttca gcaagtttat cttagtgcag cctttcagag 4980 aggcgaaggc acaatcactg gtggagttcg tagaaaatat gatttttcgg ttgttcggtg 5040 tccccgagat aatactgacg gataacggat cccagtttgt gtcgaaacag tttaaagact 5100 tgttgaaaaa ctacaacgtt tctcactggc ttactcctgc ataccatccc caggttaata 5160 acaccgagcg tgttaaccgg gttgtcacga ctgctattag agcaacgctc aaaggggagc 5220 ataaccattg gaccgataac ctgcaggaaa tcgctgatgc tatcmggaat gcggtacacg 5280 actctacaaa acatagtccg tatttcgtgg tgttcggtag gaacaaaatc tctgatggtt 5340 ccgaatactc acgcattcga gacaatcaac caacgactag cgacaacaac cagactacga 5400 cggaaagaag acagaaacta ttcgaccaaa tcaaacagaa tttgacgacc gcctacgcaa 5460 aacactcaag gacctacaat ctcagatcaa atccaaattg tccaacttac tctgtgggtg 5520 aaaaagtgct gaagcaaacg tttgatctgt ccaataaggg taaagggttc tgtaggaagt 5580 tagctccaaa atacgagcct gcaattgtaa gaaaagtttt ggggactaac acttacgaac 5640 tggaggacca ggacggaaaa cgtttgggag tctactttgc caacaggctg aaaaagatgc 5700 aggctcatct aaaccattga gtggtaacct gtttttcaag ctatgaacct cctagtaagc 5760 ttttcacacc gtgaatctac tagcgagcga ccactaacta cgatgagaag aaaatgccta 5820 cgggttaatt gcatctagga caatttcgac ggactcgtgg gaggaaagtt ttcgaacgaa 5880 ccgagcactg agggtaccga aattgatccg aaaaccagct atgaacaact ttccgaagtg 5940 actacaagac tacgcgtcga ttcaaatacc aaatagggcg aaacttagta gacttacgac 6000 ccactgaggc gtacgcgact gtaaatatgt aaatagttag ttgttagtta tcctagttcc 6060 ctaaccctta agattattgt tagtaattag cctaattgtt aaagtaaatt agttcaatca 6120 ctcacgattt tctttcgttt tggtttgtca tccttgtaaa tatagattta gacagtttgt 6180 tctgaatttt ccttcattcg tcatatattt cctttccggt tacatagttt gwtatttatt 6240 taccagtttt cgttgatttt tgttcctaaa agtgaaatat agcataaggw twtcgttttt 6300 tgtatacact ttgcatatcg gwacgtttct ttacctgttt tgtagtcgtt tcgtagttgt 6360 agtgcagtag tcctctcgat tctcaatcct tagccttcct tttcgtcttc cagccatcct 6420 tcttgttccg ttttctgtcs caccagattc gtacmtccgt wattccgtcc tccaatcctg 6480 tccattatcc tgattgaatc gtccatccag tttccgtacc gttatagtcc gttctcgtcc 6540 tccagtgcag tgtactgttg gggtcggtcc acctgtaaaa aagtaaacaa acgcaaacaa 6600 cttacatccg atgttttgga acgataatga tcgtatactc tcctcaaatc tctgccagcc 6660 gtagcgaaas ccttcctgga tactccgaaa tagcattttc acttgctgca aacccggaaa 6720 macttwwatt ttcaccaaaa atacgcattc cggacgmwcw cttttctcgc cgcgagcttt 6780 ttcactgccg tggtgattca ctttgacgtt tcgttttgtt gatgacctgg agtgcatgat 6840 agactggtat gagtgcgaga tgaactgagc gataaaattc atcgcgttsa tgttggtttg 6900 aatgcgaggg gtgcttgaat ggtattttgt gtcaatgttt tataatttct tgaagaattt 6960 tgttgtcggg taaatcccgg tgggtaaaaa ttcttttaag agcacaattt tcgtttctat 7020 gtcacaatga gcgtttatcg cggatcatta tccgtggctg cgaattgtga gaagaaacag 7080 ttattttttg atgagaaaga gttgcatctc gcgcgctcgc ttcgtgttgg gctagagatt 7140 gtaattctaa ctcatcggtc tgcgtccact tggtggaaat tttcgtgtac gttgtcttcg 7200 tttatcggca ttgtgtcctg gtgtagaaga tagcgtatcg ttctgtgtca gtttaccgcg 7260 tgagctcact caatgatgtc aaccacattg agtagatgcc ttttaaatgt tgtcaagttt 7320 tcgtgttgaa tgtttttttt ctcaattttg cgtgtttgag ggtcatcagc aacaacaaca 7380 actcttcttc aaaaatgctt tcctttttct tgtcaaatcg agatttagta gtaaatgtaa 7440 atacaaatta attgtttagt tataaatgtt caagttgatt ttcgctggag actggagact 7500 taatgtcgct gatagtcagt agcaggcagg ttccgtgata gaatctgtgc tgtaaacaga 7560 gatggagtca actgcggttg acgatgaatc tatatgaccc cgaaaattta aactgaatgt 7620 ttttgtagtt attagtagtt tttgtatata tgttaatata acttgtattg attgtttggc 7680 ctacgaaaat ttgattggtg attgtaatct ccattccaat gccaatcaaa ttttcgtact 7740 tttagtgcgg aatga 7755 // ID Gypsy7-LTR_Dya repbase; DNA; INV; 205 BP. XX AC chrU; XX DT 18-MAY-2009 (Rel. 14.05, Created) DT 18-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy7_Dya; KW Gypsy7-I_Dya; Gypsy7-LTR_Dya. XX OS Drosophila yakuba OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; melanogaster subgroup. XX RN [1] RP 1-205 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1067-1067 (2009). XX DR Genome; chrU; Positions 1431488 1431284. XX SQ Sequence 205 BP; 65 A; 40 C; 60 G; 40 T; 0 other; tgtgggaacc tgtaccaaac agcgacgcta ctggaacaag cagcgacgtc acagacgcgg 60 gagcgggatg agtgggcagc gagagaggga gagtcagtca attgtgaacc accggattga 120 taagtcgtat taccgaagaa cgaataaaac agatcaataa cagaagctca gcgtgtatca 180 attgtgtatt ttcggggtct ttaca 205 // ID Gypsy-147_AA-I repbase; DNA; INV; 7038 BP. XX AC AAGE02030242; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-147_AA_; KW Gypsy-147_AA-LTR; Gypsy-147_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-7038 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02030242; Positions 32930 25893. XX CC Positions [3724-4185] - Reverse transcriptase CC Positions [5212-5688] - Integrase core CC 'ATTT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 449..2938 FT /product="Gypsy-147_AA-I_2p" FT /translation="MNMDREYQVQMSVTYLTPDEVGFELEIRNLTKERERS FT LSVSRRKLKNALREEAEGLIRIFSYKRDPKIELAICRERLERVYDELNDSS FT LDRQMSKTALLHLYHRLKLFKRTFGVGSYGNDATWLFSKVIQTYVSFFNEN FT ALYSPFSVTVQNTVPLELQGAASLAAQVAEQSHDLINLEQPILTSSVGETD FT PTISVPPDAQPVSNPSLIAAENDPHLSFHNLSISATSVSMPNETPLITASI FT VTVASGSDLNMTWATPRYQVASRIQSRQVRFDTARELPTELREPSMSITTV FT QTSAVNTVSSIHASIVNSVSGVRAHSIGNHPESQPDFNQYAASQQLPQNTA FT SPVQWSNPYYHPRTQPQVSFSQPNTGYSNIQQHQYPDFDNPLHSVPINHRD FT SMYQPTFPQAGNPIREHSPVQLSDTIPLRYTPTSVDLVGLYPRRSNPTAPN FT QTPNPYASSRNNFPQSNVRAHTSAPTPSLNPFQDLDFVPLANRSKTIPVVK FT WPMRYAGEDRGAGLNTFLWEVNDWTQSELISEPELLRSFGNLLTGKAKMWF FT ISNKHRFTTYSDLIDSLKATFRHPDLDHFMLLEIYQRRQQKNESFLEFFLD FT VERKFKSLTKPVSEEEIAQTVRRNLRADYKRALIGREFPDLYSLQMAGQEI FT DATNTYLFTKPQGNAQTNAIQATGKPAPFNTASREHGQGSQRGGNKGSNQQ FT QASTQQFRPKQGANPGKPGGKPFLKKDQTNPKTEDKAKRDSRPEKGDSSEG FT EAPESQKGNNGLEGQLRKHVPLNEKFVCFNCRSQQHFTGQCTQPYKVHCQV FT CGFLGFPTHRCPFCSKNSQRRKDRSPSK" FT CDS 3328..6063 FT /product="Gypsy-147_AA-I_1p" FT /translation="MDFFTAFDLHITRGEEILFLVNEEECPNSNPPRIVEL FT SSEQVAALEKVKRIFKPAMSDQLEVTSLIEHSIELKEEFKESPPIRVSPYP FT YSPTISKALNVEIDRLLQLGVIEESYSDWALNAVPIKKPNGTVRLCLDARK FT INARTKRDSYPLAHVGRILGRLGKTRYLSTIDLKDAFLQIPLSEDSKPLTA FT FCIQGRGMFQYTRLPFGLTNSPATLSRLMDKILGAGELEPWVFVYLDDIIV FT ASDCFEEHIRLLEEVAHRLRKANLSINIDKSKFCRSEVPFLGYLLSSDGLR FT PDPSKVQGILDFEPPKTIRQVRRFLGMVNYYRRFIPDFSAITTPISDILAG FT KPKIVRWSKEADEAFRLIKERLITAPILSNPDFEEEFIIQTDASDRAVAGV FT LTQIQDGSEKVISFVSQKLNSAQQNYSATEKEALAVLMAVEKFRGYVEGSH FT FTLVTDASALQYIRNNKWRPSSRLSRWSLDLQHLDMSIIHRKGTENIVPDA FT LSRSICAIKASRAGTSSFEELVRKVEKEPADFPDFRLEDGQLWKYIPVDDE FT PFDVRFEWKMVPPPENRPRIIQEEHEQSFHLGIEKTVSRLQLRYYWPHMAT FT ETRRWIQRCTVCKESKPATVPTIPVMGKQKLADHPWQIVAMDYIGPLPKSK FT AGYIHILVVQDLFSKWCQLHPVRRIEAGNLCKTIRECWFLKNSIPEIVLTD FT NATTFLSKEFKALLQQYNVKHWTTSRHHSQGNPVERLNRSINAAIRTYCQK FT DQRGWDTKIADIEHVFNNTVHSATGFTPFFVTRNHEIAISGEDHLRMRRKE FT EYEPEERTNQQKKISGEIYDLVSKNLLKAYTSSAQRYNLRKRARPEDLKVG FT QNVYRRNFKQSNAGEYYNAKLAPMYLPCRVVAKHGSSSYELEDSDGKNLGV FT WPAEHIKP" XX SQ Sequence 7038 BP; 2026 A; 1609 C; 1592 G; 1811 T; 0 other; tattgtcgct ccaacaaaaa ataggctttc gatttttcga gaattaattt tttcgttgga 60 gaaggggaat atttttggat ggttgatcac ctaaccttta gagggaaaaa attaatactc 120 tcaaaataac ctataaattt tgaacccgtc taattttatt ctggcggggt actcatactc 180 acgatatatt cgttgctaga gaaagagtag gtagcacgtg ctggaggtag gatatatcgg 240 atgatttgta gccgcacgct gctaagtaga gtagaagaca ccagcggtca accgagagac 300 aatagttgtt tgttcatatt ttctatctag ttatatttcc atacgtttcg tgcatgcttg 360 tttttttgct ttgatcgcat tgataggccc aaagtaaacg ggcagtgcca gttttttttt 420 tagatttgga ttttgcgctc atccaatcat gaatatggac agggaatatc aggtgcagat 480 gagtgttact tatcttacac ctgatgaagt cggttttgag ctcgaaatca ggaatttaac 540 gaaggagaga gaacgtagtt tgagtgtaag ccgtaggaag ctcaaaaatg cactgagaga 600 agaagcggag ggtttgattc ggatcttttc gtacaaacga gatcccaaaa ttgaacttgc 660 tatatgtcga gagaggttag agcgtgtgta tgacgaattg aatgatagtt cgctcgatag 720 acagatgtcc aagactgcgc tcttgcattt gtatcatcgt ttgaaacttt tcaaacgaac 780 ttttggtgtt ggcagttatg gtaacgatgc aacttggttg ttttcgaagg tgatacaaac 840 ctatgtatca ttcttcaacg agaatgctct ttattcccca ttttctgtca ctgtgcagaa 900 cacagttcct ttggagctgc agggagcagc tagtttggcg gctcaagtgg cagaacagtc 960 gcatgattta attaatttgg agcaaccgat tttgacttcg tcggtaggtg aaactgaccc 1020 aacaatatca gtgccacctg atgcacagcc cgtctctaac ccgagtctta ttgcagcgga 1080 aaatgatcca catttgagtt tccataattt gtctatttct gccacctcgg tttcaatgcc 1140 taacgaaaca cccttgatta ccgcgtcaat cgtcaccgtt gcgtcaggaa gtgatttaaa 1200 catgacgtgg gcaacaccac gataccaggt cgcgtcgcgt attcaatcta ggcaagttag 1260 attcgatacg gctcgcgaat tgcctaccga gttacgagaa ccatctatga gtattacaac 1320 ggtgcaaacg agtgcagtga acacggtaag ctcgattcat gcgtcgattg ttaacagtgt 1380 atcaggtgtg agagcccatt caattggtaa ccatccggaa tcacaaccag atttcaatca 1440 gtacgcagcg tcacaacagc ttccacagaa caccgcctcc ccagtacaat ggtcaaatcc 1500 atactaccat cctcgaaccc aaccccaagt tagtttttca caaccaaaca cagggtattc 1560 gaatatccaa caacatcaat acccagactt cgataaccct ttgcactcag tcccgattaa 1620 tcatcgagac tcgatgtacc aaccaacgtt cccacaagct ggcaatccaa ttagggaaca 1680 cagccctgta cagttatccg atacaatccc actgagatat actcccacat cagttgattt 1740 ggtaggtttg tatccgcgta ggagtaaccc gactgcgccg aaccaaacgc caaatcctta 1800 cgcttcttca aggaacaatt tcccccaatc aaatgttagg gctcacacct cagcaccaac 1860 tccttctctc aaccctttcc aagatttgga ttttgtacca ttggcaaacc gatcgaaaac 1920 cattccggtt gtcaaatggc cgatgcgtta cgcaggagag gatcgcggtg ccggactcaa 1980 tacgttcctg tgggaagtta atgactggac acaatcggaa ttgatttctg aaccggagct 2040 actgagatct tttggcaatc tacttaccgg aaaagccaag atgtggttca tcagtaataa 2100 acaccggttc actacctact ccgatctgat tgacagcctt aaggccacgt tccgtcatcc 2160 ggacctggat catttcatgt tgctagaaat ctaccaacgt aggcaacaga aaaacgaaag 2220 ctttctggag tttttccttg atgtcgagag gaagttcaaa agtttgacca agccagtgtc 2280 tgaggaggag attgctcaaa cggtgcggcg aaacttgcgt gccgactaca aacgtgctct 2340 cattggtcga gagtttccag atttgtactc gctacaaatg gcggggcaag aaatcgacgc 2400 aacaaacacc tatctcttca ccaagccaca ggggaatgcc caaaccaatg cgattcaggc 2460 gacagggaaa ccagcgccat tcaacacggc gtcgcgtgaa cacggtcaag gatcgcaaag 2520 gggcggaaac aagggctcaa accaacaaca ggcttcaact caacagttcc gacccaagca 2580 aggagcgaat ccaggaaaac ccggaggaaa accgtttctc aaaaaggatc aaaccaaccc 2640 caaaacggag gacaaggcaa aacgggattc tagacccgaa aaaggcgact caagtgaagg 2700 tgaggcgccg gaaagtcaga aaggaaataa tggattggaa ggccagctgc gtaaacatgt 2760 gccgttgaac gagaaattcg tgtgcttcaa ctgccgaagt caacagcatt tcaccggtca 2820 gtgcactcaa ccctacaaag ttcactgcca agtgtgtgga tttttaggtt ttcccacaca 2880 tcggtgtccg ttctgttcaa aaaactccca gcgtcggaag gacaggagtc cttccaagta 2940 actgttttta gcttgaccga ttctcttatt gcgttcgggt atttgccctt acccgaatgc 3000 gacgataata ctgaagattt tcaatgctcc tcagttttat caataataaa tgataataga 3060 ccatacataa aaccaaaaat ttttgatgca gtggttcgaa cactgttgga ttgtggaagt 3120 cagagaacat tgatctctgc cacaacgtcc ccgctttgga aaacgtctac cacgaagata 3180 ttgccaagca agttaactct cactagcgcc tcaggggatt cactggacgt ggtgggacgt 3240 gtgtttcttc cattttggtt tgaaggccaa accaaaataa ttgagaccac aatcgttgaa 3300 gacttacccg tcgaatgtat cgccggcatg gacttcttca cggcttttga tcttcatata 3360 accagaggag aagaaattct cttcttggtg aatgaagagg aatgccccaa tagtaaccca 3420 cctcgtatcg ttgagctaag ctccgaacag gttgcggcac tagaaaaagt taagagaatc 3480 ttcaagccag ctatgtctga tcagctcgag gtcacctcgc tgatcgaaca ctccatagaa 3540 ctgaaggagg aatttaaaga atctccacca atacgggtat ccccttaccc gtactccccg 3600 acaattagta aagccttgaa cgtggagata gatcggcttc ttcaactagg agtaatcgaa 3660 gagtcatatt ccgattgggc tctcaacgca gtgcccatca aaaaacccaa tgggactgta 3720 cgattatgtt tagacgcgcg aaagatcaac gcccgaacaa agcgagacag ttacccattg 3780 gctcacgtcg gccgtatact gggacgttta ggcaagacga ggtacctgag cactatcgat 3840 ctgaaagacg cttttctaca gataccgcta agtgaagact cgaagcctct taccgctttc 3900 tgcattcaag ggcggggaat gtttcagtac actcgcctcc cgttcggctt gaccaatagt 3960 ccagcaacct tatctcggtt gatggataaa atcctcggag ctggagagct agaaccctgg 4020 gtgttcgtct atttagatga catcatagtg gctagcgact gtttcgagga acatatccga 4080 cttctcgagg aagttgcaca ccgactaagg aaagcaaacc tatcgataaa tatcgataag 4140 tcgaaatttt gtaggtctga agtaccattt ctgggatatt tgctgtcatc tgacggactc 4200 cgacccgatc cttccaaagt ccaaggaatt ttggatttcg aaccccccaa gaccattcgc 4260 caggtgcgaa ggtttcttgg gatggtaaat tattaccggc gtttcattcc cgactttagt 4320 gccatcacca cgccgatctc ggacatccta gctggcaaac caaagatcgt gcgatggtct 4380 aaagaagcgg acgaagcatt ccggctaatc aaagagcgtc tcattaccgc tcccatactc 4440 tcaaacccgg attttgaaga agagtttatt attcagacag acgcgagtga tcgcgccgtg 4500 gcaggagtgc taactcaaat ccaggacggg tcagagaagg ttataagctt cgtgtcccag 4560 aaattgaact cagctcaaca aaattattcg gccactgaaa aagaagcgtt ggctgtttta 4620 atggcggtag aaaagttccg tggttatgtt gaaggtagcc actttacatt ggtgaccgat 4680 gcttccgctc ttcagtacat acgtaacaat aaatggcgcc catcgtccag actcagccgc 4740 tggagtttag acttgcagca tctggacatg agtattatac accgaaaagg aactgagaat 4800 atagtgcccg acgctctttc gcgaagcatt tgcgcgataa aggcttccag ggcagggaca 4860 tcatctttcg aagaattggt gcgaaaggta gagaaagagc cagcggattt tccagacttc 4920 cggctagaag acggacagtt gtggaaatac atcccggtag atgacgagcc tttcgacgtg 4980 aggttcgaat ggaagatggt gccaccacca gaaaatcgtc cacggataat acaagaagaa 5040 catgaacagt cgttccatct cggaatcgag aagacggtat ccaggttgca gctgcggtac 5100 tactggccac acatggccac agaaacccgt aggtggattc agagatgcac cgtctgcaaa 5160 gaaagtaagc cagccaccgt ccctacaatc ccggttatgg gtaagcaaaa gttagccgat 5220 catccttggc aaatcgtcgc catggattac ataggacctt tgccgaaaag taaggcggga 5280 tatatacata tcctggtggt gcaagacctt ttcagcaaat ggtgtcagct ccatcccgtc 5340 cgcaggatcg aagcggggaa cctctgcaaa acgatccgag aatgttggtt tttgaaaaac 5400 tccataccgg agatagtcct gacggataat gcaacgacct tcctttcgaa ggaattcaaa 5460 gcattgctgc aacagtacaa tgtcaagcac tggacaacct ctcgacatca cagtcaggga 5520 aacccagtcg aacgcctaaa taggtctata aacgctgcaa tacggacgta ctgccaaaaa 5580 gatcaacgag gctgggacac caagattgct gatattgagc acgttttcaa caatacggtg 5640 cactctgcca caggctttac gccgttcttc gtgacccgta atcacgagat cgcgataagt 5700 ggagaggacc acctgcgaat gcgacggaag gaggagtatg aaccggaaga acggactaac 5760 cagcagaaaa agattagcgg cgaaatatac gacctggtat cgaaaaacct gttgaaagca 5820 tacacgtcta gcgctcaacg ctataatctt cgaaagcgtg cgcgccccga ggatctcaaa 5880 gtgggtcaga atgtataccg ccgtaacttc aagcagtcca acgcaggaga atactataac 5940 gctaagttag cgccaatgta tttaccctgt cgtgtggtag ctaaacacgg gtccagctcc 6000 tacgaactcg aagactccga tgggaaaaac ttgggcgttt ggcccgctga gcacattaag 6060 ccctagtttc ctatcagtgc gaaccagcgt tcgtttcttc ctataccttc ctatctgttt 6120 cctatcctga agatgaacat cagcgatgga aaaccatcaa aatctcactt tcaaagccac 6180 atgagacagc atttggctct ctcgcgttcc cagagagagg attgtatgct gtgagtggtc 6240 aagatctcct aatgtgaatg tccaatgttc tctctcagtg ggagtagggt ggtaaggatt 6300 agcttaggag attttctcat tctgaaaacc tcagaacatc ttttaggccg gccgaatatt 6360 tcataagttt tatcaaggaa atgcggattc cttctcaact taaaacagat cgctgcgaaa 6420 actaatgtgt gtcttctaaa aactgaagat ggccaaaatt agggaattct ctagcgttca 6480 aacaaagtgc gagtatgatg taccagcgtg agaaaatcat tgaatgggac agatttgaaa 6540 ggaaaagggg aggttgattc gtcgttcata aatacttccc aatgacagtc aaggatgtga 6600 ccgcgactga tcagtgacgg ttagttggga cgaagctaat tagtcaacca gcgttactct 6660 cgccagttag cggagtacta cgggtgttcg aggagtcgat gactttattt tggattcact 6720 gtgatggcgt gccagatcac agggatcccc caattgcgcg tacccgagtc gaacacctta 6780 taaatccttt tttttttctc tctattgaat ggctttttga tgtatataca tattcataca 6840 ttttttgtaa atatccaatt cacagttttg tttgtttttc taattcgttg agttcgtttc 6900 tctattattt aatttttctt ttttttgttt ataccgttgt acaaatttgt tgtttcttta 6960 gtaattataa ttggtcattt tgtttatatt aaaaaaaaat aacttcatta tttttttttt 7020 attctacctc gggggaaa 7038 // ID Gypsy-202_AA-I repbase; DNA; INV; 4498 BP. XX AC AAGE02024456; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-202_AA_; KW Gypsy-202_AA-LTR; Gypsy-202_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4498 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02024456; Positions 8716 4219. XX CC Positions [3449-3826] - Integrase core CC 'GACAT' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS join(1040..3445,3449..4393) FT /product="Gypsy-202_AA-I_1p" FT /translation="MSVSTTIEPYLPNSIPFAQYIEQLEYVFLNNNVPEER FT YKTSFLAVCGVTVFSEIKKLFPGQDVKTLSYQQITEQLKKRFDKCDSEVIN FT SYKFWSRRQGRNEKSEDFVIAVKVLAEQCGFGTFKDRAIRDLLVIGVYNRD FT IQKRLCDEDDLSAARAEKLILNHEISNFRTSVLKDDDDKNTSIVARLGRKE FT VRSRSKQRYRGRSKSGNRSVSFSPRHKQIRYGDRRYDSEKPTYSCSFCKKT FT GHTRKFCYKLHGRGQRSADVKFLGSPKKSTSATTSGSGNFKRPVYTDDEDD FT EDLPCMMISSINRINEACYVEVSVEKSVLTMEIDCGSAETVISEELFLRNF FT KHIKLSPCNKKLAVIDGKRLKVLGKISVPVHLKGRRQQLCMIVLRCENDFM FT PLMGRTWLDCFYEGWRNIFSNSGIQNENIHKIDEEHVVDELKSKFSSVFDG FT DFSTPIVEYEGDLVLKDDKGLFRKAYGVPLRLRDKVIEHLDSLERDGVITP FT IETSEWASPVVIVMKKDQSIRLVIDCKVSINKVIIPNTYPLPVAQDLFASL FT SVVCAVKKFHKFLFGMKFVIYTDHKPLIGIFGKEGKNQISVTRLQRYVIEL FT SIYDYEIIYRPSSKMANADFCSRFPLPQPVPKVLDREFVKSLNFTDQFPLD FT CKEISKETTRDDFLVAIMTYLRQGWPDRLDRRFRDVYSHHQELEEVEGCVL FT FQDRVIIPDVLKPQVLKMLHRNHNGISKMKQLARRTVYWFGMNKDVEEYVK FT CCRACNQMTPVSKPAAYSSWIPTKQRFSRVHADFFHLDHKTFLVIVDSCTK FT IELEYMKSSTDSNNVIKVFLNVFARYGLPDVLVTDGGPPFNSMNFVDFFEN FT QGVKVMKSPPYHPESNGQAERMVRLVKDVFKKFLIDPEMKKLETDLQVSYF FT LLNYRNTCQDDDGRFPSVRLLSYKPKTLLDLINPKSNFKHNLIIRHDDYSP FT TVSTSTDNFNDQFVNLKCGDLIYYKNANKSDIRRWLPATYLKRISSNVFQI FT SLGGRMITAHRRQIKISDVPKSTTGSRFVIRRENVQSNSRKRRREVDVEDD FT ERISDQEPDFYGFSAESFIFREETPIGDQIDRQEINLASNQTRKSSRNAKK FT KRKGDFVYY" XX SQ Sequence 4498 BP; 1380 A; 748 C; 1002 G; 1368 T; 0 other; gctggcgacg aggagtaagt acggtaaaaa aaaagtgatt tttgtgacag tgaatcgttt 60 tagtgataaa agtgtttcgc cctacattcc cagagtcaag taagtttttt tccctgggaa 120 gagctaggag attttcggcg gacaaagcgt cattttgaag tgcatttttt atattgtttg 180 tagtcctttt gttctttgct ttttgtttga ctttcaaaga ggtgcgaaat tacaaccgat 240 aaaacaaaag tgatagtgaa agagttttta attgtgtcaa aaagcattgt tttgtgtttt 300 caacgactat caatcattta aacaaacgaa ttgtggatat tttttaagag aaattgaatt 360 ttccagcagc agtttttgct tgttttatct tatttttaat tttcaccatt ttgttttttt 420 tttatctgtg tagctattcc gtctgacggg tcgttgcgag agtgagttcg tcgatttcac 480 tgatcattta cgacggacga agttcactca ctaatccgtt gctacatgtg tgagtggtag 540 gaaaacaaat cgagcagttg agttaccagt cgtaccgaac aaaccaccat acacactact 600 acatagtagc aggcaacgac tcgggtagcg aggcagacag cgcggtgcag cattttgtac 660 gtgaacgaga aaacggtgca tcctcaggtg cattgtttca tttgttttcg ggcaacacaa 720 agaaaaggat cagctgtttc aaaaacatcg ttttgtgcga cgtctgtgat taagttccgt 780 tggtgcctgc tagctgctgc tgttggaggc tgtttaatac ttggtgtaat accacaagcc 840 ggtgtttgcg gcggtttttt tctcctgcat gctgtttgta tccgccacaa tattctccca 900 aggtgaagta cttcggtgtc aaggtacgtg actttgtttt tgttcaatat tttggaagtt 960 tttatattga tattttattg cgaacgcgca tgattgtagt ttgatatgtg cttttgtttt 1020 gttttatttt caataaagaa tgtcggtgtc aacaaccatt gagccatatt tgccaaattc 1080 aattccgttt gcgcaataca ttgagcagct tgaatatgta tttcttaata acaatgtgcc 1140 tgaagaacga tacaagacat cctttttagc cgtgtgtggt gtaacggttt tttctgaaat 1200 caaaaaactt tttcccggtc aagacgttaa aaccctatct tatcagcaga ttactgagca 1260 actaaaaaag aggttcgaca aatgtgattc ggaggtgata aatagttaca agttttggag 1320 tagaagacag ggaagaaatg agaagtctga ggattttgtg atcgcggtaa aggttctagc 1380 cgaacaatgt gggtttggta cctttaaaga tagggcaatc cgagatcttc tagtgatagg 1440 agtatacaac cgagatattc agaagcgctt gtgtgatgag gatgatttgt ctgctgcaag 1500 agcggagaag ctcatactga atcatgagat ttccaatttt agaacaagtg ttttgaaaga 1560 tgacgatgat aaaaatactt ccatagttgc tcgtttagga agaaaggagg tacgttctcg 1620 ctcgaaacaa agatatcgtg gtaggagtaa gagcggtaat cgcagtgttt cattttcgcc 1680 aaggcataag caaataagat atggggaccg gagatatgat tcagaaaaac cgacatattc 1740 ctgctctttt tgcaaaaaga cgggccacac acggaagttt tgttataagc tgcatggtag 1800 agggcagcgt tcagcagatg tcaagttcct aggatcgccg aagaaatcca ctagtgctac 1860 cacaagtgga agtggaaatt ttaaaaggcc cgtttacact gatgacgaag acgatgagga 1920 tttgccttgc atgatgattt cctcgatcaa tcgcataaat gaagcttgtt atgtggaggt 1980 ttcggttgag aaaagtgttt tgacgatgga aattgactgc ggctcagcag agaccgttat 2040 ttccgaagaa ctgtttctaa ggaactttaa acacatcaaa ctctcaccat gcaacaagaa 2100 actggcagta attgatggca aaagattgaa agttttgggc aaaataagcg taccagttca 2160 tctgaaaggt cgtaggcagc aactgtgcat gatagtgctt aggtgtgaaa acgatttcat 2220 gccattgatg ggtcgcacct ggttggattg cttttatgaa ggatggcgaa acattttttc 2280 gaactcagga attcagaacg aaaacattca caagattgat gaggaacatg tggttgatga 2340 gttgaaaagt aagttttcat cagtttttga tggggatttt tcaactccaa tagttgagta 2400 tgaaggagat cttgttttga aagacgataa aggccttttt agaaaagcat atggagtacc 2460 gcttagattg cgggacaaag tcattgagca tttagattct ttggagagag atggggttat 2520 tacaccgata gaaaccagtg aatgggcttc accggtagta atagtgatga aaaaggatca 2580 aagcatccgt ttagtgatcg attgcaaagt ttcaataaat aaagtgataa ttcctaacac 2640 ttacccatta cctgttgcac aggacctttt tgcctctcta tcggttgttt gtgcagttaa 2700 gaaatttcat aaatttctgt ttggaatgaa gtttgtaatt tacactgatc acaaaccttt 2760 aattggaata tttggtaagg aagggaagaa tcaaatttcg gtaacaaggc tgcaaagata 2820 tgtcattgaa ctttctattt acgattatga aatcatatac cggccttctt ccaaaatggc 2880 aaatgcggat ttttgctctc gttttcccct ccctcagcct gttcctaaag ttctagatcg 2940 tgagtttgtg aagagtttaa attttaccga ccaatttcct ctggattgta aagaaatatc 3000 aaaagaaaca actcgagatg attttttggt ggcaatcatg acttatctca gacaagggtg 3060 gccggatcga cttgacagaa gattcagaga tgtttattca caccaccagg aacttgaaga 3120 agtagaagga tgtgtgttgt ttcaggatcg agtgattatt ccagacgttc taaaaccaca 3180 ggttttgaag atgctgcata gaaaccataa cggtataagc aaaatgaaac aattggctcg 3240 aagaacggtg tactggttcg gtatgaacaa ggatgtggaa gaatacgtca aatgctgtag 3300 agcatgcaat cagatgacac ccgtttcaaa accagcagcc tattcttcat ggattccaac 3360 taaacagcgg ttcagcagag tacacgcaga cttctttcac ttggatcaca aaacattcct 3420 ggttatagtg gacagttgca cgaagtagat cgagctggaa tacatgaaat ctagcacgga 3480 cagtaataat gttatcaagg tttttctcaa tgtatttgcc agatatggtt taccggatgt 3540 tctggtaacg gacggagggc caccgttcaa ctcaatgaat ttcgttgatt tttttgaaaa 3600 tcagggggtg aaagtaatga aaagcccgcc ttatcacccc gaaagcaacg ggcaggccga 3660 acgtatggtt cgattagtaa aggatgtttt taaaaaattt ctcattgatc cagaaatgaa 3720 aaaacttgaa acggatttac aggtttccta cttcctttta aattatagaa atacctgtca 3780 agatgatgat ggacgatttc catctgtaag actcctttca tataaaccca agacgctatt 3840 ggatttgatt aatcccaaat ctaattttaa acacaattta ataataaggc atgatgacta 3900 ttcacctact gtctctacaa gtaccgacaa tttcaacgac cagttcgtta atcttaagtg 3960 cggtgacctg atatattaca aaaatgctaa caaaagtgac attaggcgat ggttacctgc 4020 aacgtatctt aaacgaattt cttcgaatgt ttttcagata tcacttggag gcaggatgat 4080 tacggcacac agacgacaga ttaaaatttc cgacgttcct aaaagtacaa ctggctcacg 4140 ttttgtgatt cgcagagaga atgtgcagag taattctaga aagagaagga gagaggttga 4200 cgtcgaagat gacgaaagaa tttcagatca ggagccggat ttttatggct tctcagcgga 4260 atcgtttatt tttagagaag aaactccgat cggagatcaa atagataggc aagagattaa 4320 tttagcgtcg aaccaaacaa gaaaatctag tagaaatgct aagaaaaaac gtaaaggaga 4380 ttttgtatac tattgatttt tattctgaat tttaaatcgt tcaatattat agtcaaagtt 4440 ttgatttcga attctattag attttgaatt agttttcatt cttaatggaa ggaggagt 4498 // ID CR1-59_AAe repbase; DNA; INV; 4935 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-59_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4935 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1146-1146 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 21 sequences with >95% CC identity. XX FH Key Location/Qualifiers FT CDS 376..1206 FT /product="CR1-59_AAe_1p" FT /translation="MATQCGTCMEPITGMDVVVCRGYCGGSFHMNGCTSVT FT RAMQSYFTTHKKNLFWMCDRCAELFENSHFRTISCNADEKSPLSMLTSAIT FT ELRTEIKHLNSKPTVSFPPATNVDWPAIDQRRSAKRRRETESVVRASGNCR FT TGSKKTQDDVISVPIFNEEVDEKFWLYLSKIRPDVTVEAVSAMIKANLDLT FT SDPTVVKLVPRDKDISTLSFVSFKVGLDPSLKNKALDPETWPQGLLFREFE FT FFGAPKFRKLLPKKLATPLLQPQASSSPITPIMDLS" FT CDS 1092..4856 FT /product="CR1-59_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="VRVFRSPKISEITAQKTSYTVITTASFIITNHAHHGP FT ELNFDNPGCNNSSILEAPSSPSTVLPIQPALISRPGPVSGCGDGVFHNVAT FT GEYSSFKHNRMTVTSEGSIHSHVRRLDSCRRNISSSSNIESSVRNSSPGCM FT PASLTEAPNPLVTVEPFLPATSSHPGPVYEKGEGVFQPVNAGKYDCIKNTS FT LPSTFITFSKGKQKSSCSNQHQFSAHRHSSSSTDRLLSRHSPLGCTPASFM FT EAPEPIVTVEPILPATNSHPGPVFDTGEGVFQPVDAGKYESNSSNHLPSSS FT ITFSRDNANDIWIYYQNVRGLRTKIDSLLLTATDCNFDVIMMTETGLDNCI FT TSQQLFGAGFNVFRCDRSPANSSKSRFGGVLIAVAQRYSSRVVHTESGQNL FT EQICVSAMIKGRKIMMCVIYVPPDKSNDVNVINGHVSAMEELCAKCTTGDA FT VLICGDYNQPRMRWCFVDSVIQCDGSQLPLSSSTLLDGMNFLCLGQKNLVR FT NSLDRILDLVFCLSDREAEVNSCSMPLLPVDSHHPPLEISLPSRMVHDDAQ FT MRTGNERPLNFRMIDFDALLAYLSTIDWNVIVTTVDVDEMAESFCSILNDW FT FASNVPRSRPAVSPPWSTSRLRELKRVRNACQRKVRRHRTSFNKRNFHQAS FT NAYRLLNVTLYKSYVLRVQSSLRRNPRDFWRFVNSKRKDSVIPANVYLNEA FT TAASAIESSKLFARHFESIFVASSATNLEIEEAIRDVPCDLADIRTFMITP FT EMVSQAAKKIKKSFVPGPDGLPAVVICRCIAALAQPLSDIFNRSFQQAKFP FT RIWKQSYMTPVFKRGDRHNVSNYRGITSLSAVSKLFETIVCGVMLDATKCY FT ISKDQHGFMPGRSVTTNMLNFTSKCIMSMENKAQVDVIYTDLKAAFDKIDH FT MILLRKLSRLGTSNQLVSWLESYLTERELRVKIDSCVSPPFTNKSGVPQGS FT NLGPLLFVLFFNDAASALGDGYKLVYADDMKLYMVVRTEDDCLCLQHSLNL FT FADWCRRSKLVLSIEKCQVITFHRIVNPILFDYRIEGNTLIRVSSVNDLGI FT QLDSRLTFDLHRSSIITKATRQLGFISKVARDFSDPHCWKSLYCALVRPIL FT ENVSVVWNPYQVTWSLRIERIQKRFVRSALRNLPWRDPANLPPYPDRCRLL FT GLDTLEHRRKIQQSLLVAKVLTGEIDSPELLSLINFRVPNRSLRNTTLLQP FT IFHRTAFGYNEPITTCIREFTAVEDLFNFDEPTERFANRIRSVVT" XX SQ Sequence 4935 BP; 1319 A; 1129 C; 1084 G; 1403 T; 0 other; ttggcatcac tgcttttgat gtatgttatg caaactgctt acggtcgtga ttttttctcg 60 tttttgtgtg ttttcataat cgtaaaatca tcgttttgcg ttctcgtgtt aaaagattga 120 cattgtgatc gtttcgatac ataagttgat gtttagtgtc gtgtcgtagt gaaataatat 180 caattagcta gagtgaattg tttttctcta ccacgtcacc cgctacaaaa gcctgctacg 240 tcgttttatt ttttcgactg aagcgacctc tgctggaata aatcagaatc gctacttgca 300 tgcaaataat ctcctgtgct tttatcaacg taagccttgg acgcaaactt ggaggctaca 360 ttcaggcgct acatcatggc aacacagtgt ggaacatgca tggaaccgat cactggtatg 420 gatgttgtgg tatgtcgtgg ctattgcggt ggctctttcc atatgaatgg atgtacaagc 480 gtcactcgag ccatgcagtc gtattttaca acccataaaa aaaatctgtt ttggatgtgt 540 gatagatgtg ctgagttgtt tgaaaattct cattttcgca caatctcgtg caatgctgac 600 gaaaaatcgc cccttagcat gctcacgtct gcgatcaccg agttgcggac tgaaataaaa 660 cacttgaatt cgaagcccac agtatcgttt cccccagcaa ctaatgttga ttggcctgcg 720 atcgatcagc gtagaagcgc taaacgccgt agagagactg aatcagtcgt acgcgccagt 780 ggaaactgcc gcactggtag caaaaaaacg caggatgatg ttatatctgt tcccattttc 840 aatgaggaag tggatgaaaa attctggctt tacctatcga aaattcgtcc agatgtgact 900 gtcgaagccg tatcagccat gataaaagcc aacctcgatt taactagcga tcctactgtg 960 gtgaaactcg taccgagaga caaggatatc agcacattat cctttgtttc tttcaaagta 1020 ggactcgacc cgtctcttaa aaataaagct ttggatcctg aaacatggcc acaaggacta 1080 ctatttcgtg agttcgagtt tttcggagcc ccaaaatttc ggaaattact gcccaaaaaa 1140 ctagctacac cgttattaca accgcaagct tcatcatcac caatcacgcc catcatggac 1200 ctgagctaaa tttcgacaat ccgggatgca ataattcaag tattttggaa gccccttcgt 1260 cccccagcac agtcctgcca atccagcctg cgctcatcag tcgtcccggc cctgtgtctg 1320 ggtgtggtga tggggtcttc cacaatgttg caacaggcga gtactcttca ttcaaacaca 1380 atcgtatgac agttacttca gaaggttcta tacattcaca tgtgcgtcgt ttggactctt 1440 gtcgccgcaa tatctcttcg tcttcaaaca tcgaatcatc tgtgcgtaat tcatcaccgg 1500 gatgcatgcc tgctagcctt acggaagctc ccaatccact cgtcacagtc gagccattcc 1560 tgccagcgac cagcagtcat cccggtcctg tgtatgagaa gggagagggg gtcttccaac 1620 ccgttaatgc aggcaagtac gattgtatta agaacacatc actcccgtca acattcatca 1680 ctttcagcaa aggaaaacag aaaagttctt gttcgaatca acatcagttc tctgctcacc 1740 gtcattcatc atcttcaacc gatcgattac tatcacgtca ttcaccactg ggatgcacgc 1800 ctgctagctt catggaagcc cccgagccga tcgtcacagt cgagccaatc ctgccagcga 1860 ccaacagtca tcccggtcct gtgttcgata ctggtgaagg ggtcttccaa cccgtagatg 1920 caggcaagta cgaatccaat tcgagcaatc acctcccctc atcgtccatc actttcagtc 1980 gtgacaatgc caacgatatc tggatctact accaaaatgt gcgtgggctg aggacgaaaa 2040 ttgacagcct gttactaaca gccaccgatt gcaactttga cgttatcatg atgaccgaaa 2100 ctggactgga caactgcatc acctcacaac agctatttgg tgcaggcttc aatgtttttc 2160 ggtgtgatcg aagccctgcc aatagtagca agtcccgctt tggtggtgtt ttgattgctg 2220 tagcacaacg atatagcagc cgtgttgttc acactgaaag tggacaaaat ttggagcaaa 2280 tatgtgtatc tgcgatgatc aaaggtagga agattatgat gtgcgtaata tatgttccac 2340 cggacaaaag caatgacgta aacgtgatca acggacatgt ttccgcaatg gaggaactct 2400 gcgcgaaatg cacgactggt gatgctgtcc tcatatgtgg tgattataac caaccacgta 2460 tgcgctggtg tttcgttgat agcgtcattc aatgcgacgg ttctcaactc ccattgtcca 2520 gtagcactct tctggatggt atgaattttc tgtgccttgg tcaaaagaat ctcgtccgga 2580 attcacttga tcgcattctt gatttggttt tttgcctgtc ggatcgtgaa gcggaagtaa 2640 attcttgttc catgccattg cttcccgttg attctcacca ccccccgctg gagatctctc 2700 taccatcccg catggtgcat gacgatgcgc agatgcgcac tggaaatgaa cgaccgctta 2760 acttccgtat gattgatttc gatgctcttt tggcatattt gtcaacaatt gattggaatg 2820 tcattgtaac caccgttgat gttgatgaaa tggcagaatc tttttgttcc attctgaatg 2880 actggtttgc ctcgaatgtc cctcgaagtc ggccagcagt ttcacctccg tggagcacga 2940 gtcgtttgag ggagctaaaa cgtgtacgca atgcctgtca acggaaagtg cgtcgtcatc 3000 gaacatcatt caataagcgg aatttccatc aagccagtaa tgcgtaccga ctcttgaatg 3060 tcactcttta caagtcttac gttttgcgcg tacaatcgag tttaagaagg aatccacgtg 3120 atttttggcg cttcgtgaat tctaaacgta aagattcagt gataccagcc aatgtatatt 3180 tgaatgaagc gacggctgct tctgcgatag aatcaagcaa gttgtttgca aggcactttg 3240 agtcaatttt cgtggcaagt tcagcaacaa atttggaaat agaagaagcc atacgtgatg 3300 ttccttgtga tttagctgac ataagaacgt tcatgataac tccagaaatg gtttcacagg 3360 ctgcaaaaaa gataaagaag tcgtttgttc cgggtcctga tggtttacct gcagttgtta 3420 tttgccgctg cattgcagct ttagcacaac cgttaagtga tatcttcaat cgttcattcc 3480 aacaagcaaa atttcctcgt atctggaaac aatcgtatat gacgcccgtt ttcaagcgtg 3540 gtgaccgtca taacgttagc aactaccgtg gtattacgag cctatctgct gtctcgaaac 3600 tgttcgaaac catagtatgt ggagttatgc ttgatgcaac caaatgctac atatccaagg 3660 atcaacacgg attcatgccc ggccgttcag ttacaacaaa tatgctgaac ttcacgtcaa 3720 agtgtattat gagtatggag aataaagcgc aagtcgatgt aatttacacc gatttgaagg 3780 ctgcgttcga taaaattgac cacatgatac tcctgcgcaa actatctcgg cttggaactt 3840 ctaatcagct cgtctcttgg ctggagtcgt acctcacgga acgggaattg cgggtgaaga 3900 ttgatagctg cgtatcacca ccgttcacga acaaatccgg cgttccgcaa ggtagcaacc 3960 ttgggccgtt gctgttcgta ctgtttttta acgatgctgc ttcagctctg ggcgatggat 4020 acaagcttgt ttacgctgac gatatgaaat tgtatatggt agtacgaaca gaagacgatt 4080 gtttgtgcct gcagcattca ctgaatctgt ttgctgactg gtgccgtaga agtaaactgg 4140 tactaagcat cgagaaatgt caggtgatta cttttcaccg tattgtgaat ccaatcttat 4200 tcgactaccg tattgagggc aatactctta tcagagtttc cagcgtgaac gatctgggga 4260 ttcagctaga ctcaagattg acattcgact tgcaccgttc atcgatcata acgaaggcaa 4320 cacgtcagtt aggattcatc tccaaagtgg cgagagattt ttcggatcct cattgctgga 4380 aatcgctata ctgtgctctt gttcgtccaa ttcttgaaaa tgtgtccgtt gtatggaatc 4440 cgtatcaagt aacgtggagt ctgaggattg aaagaattca aaaacgattc gttcgctctg 4500 ccttgaggaa cctgccgtgg agagatcctg ccaacttacc accatatcca gataggtgta 4560 ggctgcttgg acttgatacc ctggaacatc gacggaaaat ccagcaatcg ctgcttgtgg 4620 ccaaagtgtt gaccggtgaa attgatagcc cagaactgtt gtcgctgatc aactttcgag 4680 taccgaaccg atccctccgg aacacgacgt tgctgcagcc aatatttcat aggaccgcgt 4740 tcgggtataa tgagcctata actacctgta ttcgtgagtt tacggctgtt gaagatctgt 4800 tcaattttga cgaaccaact gaacgatttg caaatcgaat taggtctgta gtaacttagt 4860 atagtttttt attcattaag acgaaatatt gtcagatgaa ttattttaaa tacaaataca 4920 aatacaaata caaat 4935 // ID PFRP4 repbase; DNA; INV; 2446 BP. XX AC L11892; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 28-SEP-1995 (Rel. 1.08, Last updated, Version 1) XX DE Plasmodium falciparum interspersed repetitive element. XX KW Transposable Element; PFRP4; Repetitive sequence; KW Interspersed repeat. XX OS Plasmodium falciparum OC Eukaryota; Alveolata; Apicomplexa; Aconoidasida; Haemosporida; OC Plasmodium; Plasmodium (Laverania). XX RN [1] RP 1-2446 RA Dolan A.S., Herrfeldt A.J. and Wellems E.T.; RT "Restriction polymorphisms and fingerprint patterns from an RT interspersed repetitive element of Plasmodium falciparum DNA."; RL Mol. Biochem. Parasitol 61(1), 137-142 (1993). XX DR GenBank; L11892; Positions 1 2446. XX SQ Sequence 2446 BP; 677 A; 481 C; 304 G; 984 T; 0 other; ctttactata atttgttgtt tctttactac aacacacaca tataatttta ctactttctc 60 caatttaatt accacttccg cgtttacttc aacatattac attgttttac caccttgtgt 120 tcaacctcaa cagtgtgttg cactaaaggt tcaaggttct ccccccgggc tgtttcttta 180 gcttaggggg gtgaacattg tgatgtgtta gtgtatatgt gtgtgtgtgt aaggggtact 240 accactagtt ttgtagttgt tggatgtgtt tttagttgtg ttgatgttgt ttgttgggtt 300 agttgtaccc atttgtggtc atagtttacc aatttttaaa aatttttttt tttttttttt 360 tccgactttt aactatcaca ttcatgtctc cttgatgtgc ttttaatgat gatatggttt 420 tttttaattt ttatgaaaat tttacaaaat cgatattttc acctacctgc accctcatgc 480 atggatgatg ttgtagatga tgtcgttttt gttatactat gttgtgtgca tgatatatat 540 gtgatgtgtt gacgttgtgg atgtgttttt tatctacatt atgatttact tttatgcttt 600 tgtttgcatt gtgtgtttac ttatatgcat gggggtgcat ttttttactt acatgatcat 660 ttttactatc atatgacttc attgagtaaa tttacttctc tttttcttct cttcttggtt 720 ggactaaatg gttccaaata tcaaagaaac tgagaaagat gtgtgcaaag tgcagcatta 780 aattcttttt ttacaattta taacatgcac ctctttgcat ttaacaatca tgaacttact 840 aactgccaat aattactact tcctaacaac atcttcctct ttttcatgaa ctcataatta 900 tcatgtccaa aatagttaac ccatattcta tttcatgtct cctttgctct ctttttatca 960 aaacaacctc attttttgct catctatggt cacttacgaa ctaatgaaca cgtccaactt 1020 caattacgaa acgtcgactt caatactgac aactccaaat aggtctatct catttttcat 1080 ttttcatcaa aatttctatt ccttaattac ccccaaaaca cactcatttt tacctaatca 1140 ttccattatg gtgcatggta gtgcatttcg tacttacttt gacactacta ttattaacat 1200 atttactcat acaacacata atcatcctct gtttattcat tgaaatgcat tgttttcatt 1260 ttggctaaat tttcccttgc actttccttg cacattgcat gcacttcctc gcatctaatt 1320 actattactt tgcacacaca ttttacacaa atctgatttt cgacactact atgtaggaaa 1380 atggtccaac attccccaaa tattactact ttcccatagt aaaaatttag ccatttttga 1440 aattttccat aggggggggt atttctcttt cggccaattt ttcctaaaaa ttttggacca 1500 ttttttcaac taattgtcat gactacttca ttcaaaagct aatactgctg catttaagtg 1560 cttcacattt tcattattat gattttttag tcatcccaat tttggccttc attttcgacc 1620 aaaagacgtc tttattattt gcacacagaa ttacatacaa tacatccgaa catgtcatta 1680 caaaacttcg gttccatatt ttgtcttaca actcattact ctcacccaca actaatattt 1740 ctttgtcaaa aaagtacttc ttttataggg ggaaattgaa cgtgaaattc gggattttcg 1800 gtcatttttt tacaacgtca tcaaaatatg gtacaccaat ttcgagtgtt ccattatata 1860 gtccatgact tactatacat acaaaatggt agaacatact tcctactacg tttatatttt 1920 gtgtgcactt caattttcct tcatagtcca ttttccccta gcactttgct tgcacaattg 1980 acatgcactt caaagcattc aattactata attttgcaca cacattttta acatgaaaat 2040 atgaattctg ggtcacttac tatatggtca aatactccaa ttttctctta acattactac 2100 tttcccatag taaaaattta cccatttttg aaatttttca tagggggggg tatatcaatt 2160 tttgtcaatt tttcataaaa tttttggtcc attttttcac ttaattgcca caactacttc 2220 aatcaaaagc taatactgct gcatttaagt gcttgatatt tctataatta ccattttttg 2280 gtcacccaaa ttttggcctt gattttcgac caaaacatct ctttactatt ctcacacaaa 2340 attacttaca acacttccga acatgtcatt acaaaatttt cgatccatat tttatcatac 2400 aacttattac tatcacccac aactaatatt tcttggtcaa aatggg 2446 // ID Copia-130_AA-I repbase; DNA; INV; 5044 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-130_AA_; KW Copia-130_AA-LTR; Ty1_copia_Ele193; Copia-130_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-5044 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [2579-2977] - Integrase core CC 'GAAAT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 1666..2610 FT /product="Copia-130_AA-I_1p" FT /translation="MEEQALKIGGRGKQQNTAECYFCGIQGHRKKDCRAYL FT AEQNSNEKSKKRSYKEKAKVVHEDDQNKAFTFMVRNRRQASKSRIVDSGAT FT SHLANDHKSFQRLDSSARPEITGADGNSIRTAGIGDCVIQCLDSKGKEVKI FT TLTEVIYAPDVEGNLLSITKLTEKGVRAMFEANRCSMWINGEEIATADRVS FT GLYRLNTVEESRSLMVANRQHNKDCQHLWHRRLGHRDPDVLGEIERKNLVS FT GMQVKNCGIQLTCECCIEGKMARPTFAKEAVKTSKEILDIVHSDVSGPMTK FT TQGGCRYYLTMIDDHSRCTWYTF" FT CDS 2579..5044 FT /product="Copia-130_AA-I_2p" FT /translation="MITAVARGILFEKKSEVEGRIREYVRFVETQFGRKPK FT AIRSDRGGEYTSHSLRKFYVSEGIKAEYTAGYAPQQNGVAERKNRSLNEMG FT LCMLLDAGLPRKFWAEAVNTAVYLQNRLPTKATDITPFEVWFGRKPDLSHL FT RVFGCGVYAWKPQQKRKKFESKAVKLTFVGYATDSKAYRLLNTKTGEIVIS FT RDVRFLEVDELAKKIPDGSMEKPAIDEAEETEVPLQLTDLENVAPEVAEHR FT DEFEDPEVEDDEEAYELAEQTLSDGDESQLNDDTMKQAVVRRSTRPNIGVP FT PDRFQEVVSLAVDLAEPKTFAEAVSGAEQVQWKAAMKEEMTSHRENGTWEI FT TELPPGRTAIGCKWVYKRKQDETGQVTRYKARLVAQGFSQKFGTDYDLVFA FT PVIKQATFRTVLTLASRDRMLVRHVDIKTAYLYGELQEEIYMKQPPGFMSS FT NVNDVCRLRRSLYGLKQAARVWNHRIDGVFKKLGFHSSTADPCLYVRRTKN FT GFLYLLIYVDDIVVVVRTESEYRALVAALKEEFKITELGDLRFFLGLQVRK FT KNGRYSLNQRSYIIKVLSRFRMENAKTSRIPMMTGFLQQKEEELMQNQGEF FT QNLIGCLLYIAVNTRPDVAIATSILGRRVSNPTTADWNEAKRILRYLKGTA FT DHELHLGGGEPCEMECYVDADWAGEIGERKSNTGYIFKLGGGLVNWRCSKQ FT TSVSLSSTEAEYIALAECLQELQWLRKLMDDLGESMKLPVRVNEDNQSCIA FT LAVADRTTRKSKHIDTKYCYVKDLVKNGIVTIQYCPTDKMEADLLTKPLGA FT VKLKQLREAIGVRLANVEEE" XX SQ Sequence 5044 BP; 1514 A; 992 C; 1416 G; 1121 T; 1 other; ggttatgggc ccagataaaa accactaggc ggatttgaag tccagtagga gtgattattt 60 ggaatccaaa gtcaagaagc gtagcagttg atatcctgtg atggagtggg gacgagtccc 120 gaagctcaac agttccaact atcagtcgtg gtcgtttgcg gtggaagcgt tgctgagacg 180 ggagcaatta tggaaatttg tcgacccagg gacagcgccg gctgaagtaa ccgacgcgtg 240 gacgaatggt gacgaaaagg ctctgtccaa ggtacctgga attctaggta aagtaaccta 300 agcagtcagt ggaagcgggg tgcgattcac cgttgtcgag gcggaggcga cggcaaaatc 360 gacttggaag gcgttgaaag aatcttcaat cacaktgaga ctgagtcgct cgaagctatc 420 cgtgttggag agagcagtgg aagcttatct tgcatcggat accctcttta ttttactgaa 480 tgaactactt aagactggag agttacggta ttcgagatgc aagaatgctt gaaagttgca 540 cttatttttc gaggacttcc ggaatcttct aaccctttga cgacggcttt ggaagctcgt 600 acaagaagaa gatcgtgacg cttgaatttg tgaagatgaa actcatcgac gaagctcaat 660 ttccaaacgc agtccgaacc gaagggcgga atggaagaac aagcttcgaa gatagggggc 720 agatgtaagc agcaaaaatc tgtggcgtgt tacttccgat gtggaattca agatcatcgg 780 attccataaa aggattgccg ggcctgtcct gcggagcgag agctccaacg aaacacggtc 840 caagaagagg tcctgcggga aaaggacgga agtgagtgca ggggggtgca ccaaaacaag 900 gcgttcacgt ttatggtgcg aaatcgtaga caggcatcga aatcctggat agtggactcc 960 ggcgcgacat cccatcttgc cgaatgatca caagtacgct ccagcgtttg gacagttcag 1020 cgcgtccgga aatcacaggt gcagatggaa actcgattcg tgctgctgga ataggagatt 1080 gcgcgatcca gtgttctcga tagcaaggca tgctagtatt gcatagcata gactgactgt 1140 acagctgtca tatgcgttgc tactccgtga ttgatcggaa ctggcaattc cacgaattgc 1200 actacgaccc aaatagatac cagggctctg tctacaattc aacttacagt ggaagagcag 1260 tgcagattca ccattatcga ggcagaaacg acgccggcaa aatcccgcac ttggaaggcg 1320 ttgaaggaat atcacagtga agactgtcgc ttggacagaa agtgactctt ttgaagcaga 1380 tcaccaatca aaacgggtac cgtgttggga gagagcagct ggaagcccct tatcgggttg 1440 cggccatata ctgaagaact actcaagact ggagcagttc cggtttccag acgtgcaaga 1500 atgcttgaaa cgtcgcactt atcctgtcga ggacttccgg agatctttta accctttgac 1560 gacggctttg gaagctcgta aagaagaaga tctgacgctt gaatttgtga agatgaacca 1620 ctcagtcgga cgaagctgac agagcagtcc aaaccgaagg gcggaatgga agaacaagct 1680 ttgaagatag gaggcagagg taagcagcaa aacactgcgg agtgttactt ctgtggaatt 1740 caaggtcatc ggaagaagga ttgccgggcc tatcttgcgg agcagaattc caacgaaaag 1800 tccaagaaga ggtcctacaa ggaaaaggcg aaagtagtgc atgaagatga ccaaaataag 1860 gcgtttacgt ttatggtgcg aaatcgtaga caggcatcga aatcccggat agtggactcc 1920 ggcgcgacat cccatcttgc gaatgatcac aaatcgttcc agcgtttgga cagttcagcg 1980 cgtccggaaa tcacaggtgc agacggaaac tcgattcgta ctgctggaat aggagattgt 2040 gtgatccagt gtctcgatag caaaggcaaa gaagtgaaga ttacgctgac tgaggtcatc 2100 tacgcccctg acgttgaagg aaacctgttg tcgatcacga agctcacgga aaaaggtgta 2160 cgtgctatgt ttgaagccaa tcgatgcagt atgtggatca atggtgagga aattgcgacg 2220 gcagatcgag ttagtggcct gtaccggctg aatacggtag aagaatcgcg ttccttgatg 2280 gtagcaaatc gtcaacacaa taaggactgt caacacttgt ggcatcgcag acttggccat 2340 agagatccgg atgttcttgg agaaatcgaa cggaagaatt tggtatctgg tatgcaagtt 2400 aagaattgtg gtatacagtt gacgtgtgaa tgctgcatcg aagggaaaat ggcccgaccc 2460 acatttgcaa aggaagctgt gaagacttca aaggagattt tggatatcgt tcacagtgac 2520 gtgagtggcc cgatgacgaa aactcaaggc ggttgcagat attacctcac catgatagat 2580 gatcacagcc gttgcacgtg gtatactttt tgaaaagaag tccgaagttg aaggcagaat 2640 tcgtgagtat gtacgttttg ttgaaactca gttcggaaga aaaccgaaag ccattcgatc 2700 ggatagaggg ggcgagtaca cgagtcacag tttacggaag ttctacgttt cggaaggtat 2760 caaggcagaa tacacggctg ggtatgcacc ccagcaaaac ggcgtggcgg aacgaaaaaa 2820 tagaagcttg aatgagatgg gactgtgcat gctgctcgac gctggactac cgcggaagtt 2880 ttgggcggaa gcagttaata cagcagtata tttgcagaac agacttccga cgaaagcgac 2940 tgacatcaca ccattcgaag tatggtttgg aagaaaaccg gatttaagcc acctgagggt 3000 gttcgggtgc ggagtgtatg cttggaaacc acagcaaaaa cgtaagaagt ttgaatcgaa 3060 ggcagtgaag cttacgtttg ttggatatgc taccgacagc aaagcgtaca gattattgaa 3120 caccaagacg ggagaaatag tcattagccg ggatgtgaga ttcctggagg tggacgaatt 3180 agcgaagaag ataccagacg gttccatgga aaagccagca attgatgaag ctgaagaaac 3240 agaagttcct ctacaattga ccgatttgga aaacgtcgcg ccggaagtag ctgaacatcg 3300 tgatgagttt gaagaccccg aagtcgaaga tgatgaggaa gcctatgagc ttgcggaaca 3360 aacattatcg gatggtgacg agagtcaact aaatgatgac acgatgaaac aagcagtagt 3420 acgacgatca actcggccga acattggagt gccgcctgat aggttccaag aagtggtaag 3480 cctggctgtg gatctagctg aacccaagac gtttgcagaa gcagtatctg gtgctgagca 3540 agttcagtgg aaggctgcaa tgaaagaaga gatgacgtca catcgagaga acggaacatg 3600 ggagattacc gagctaccac ctggtcggac agcgatagga tgtaaatggg tctacaagcg 3660 gaagcaagat gagactggtc aagtgacaag atataaagcg agactagttg ctcaaggatt 3720 ttcgcagaaa ttcggaacag attatgatct agtgttcgct cctgttataa agcaagcaac 3780 atttcgaacc gttttgacgc tagcaagcag ggaccggatg ctggtacgcc atgtagacat 3840 caagacagcc tatctgtatg gagagctgca ggaggagatc tatatgaagc aacctccggg 3900 tttcatgagc agcaatgtga acgacgtgtg caggttacga cgcagtctgt atggactcaa 3960 gcaagctgca agagtatgga accataggat cgacggagtg ttcaagaaat tgggcttcca 4020 tagttcaact gcggatccat gcctctacgt gcggcgaacg aagaatggat tcttgtattt 4080 gctgatctat gtagatgaca ttgtcgttgt tgtacgaacg gaatccgagt atcgagcatt 4140 ggtggcggcg cttaaggagg agtttaagat aacagaattg ggagatttac gttttttcct 4200 cggactgcaa gtgaggaaga aaaatggaag atactcattg aatcaaagaa gctatataat 4260 aaaagttttg tctcgattcc gaatggagaa tgccaagaca tcgagaattc caatgatgac 4320 cggctttcta caacaaaagg aggaggaatt gatgcagaac caaggagagt tccagaacct 4380 gatcggatgt ttgttgtata tcgccgtcaa tactaggcct gatgtggcaa ttgcaacttc 4440 tatccttgga cgacgagttt caaatccaac cacagcggac tggaatgaag caaaaagaat 4500 acttcgatat ctcaaaggca cagcggacca tgaattgcac ttgggtggag gagagccatg 4560 cgagatggag tgctacgtgg acgcagactg ggctggagag atcggtgagc gaaagtcaaa 4620 caccggctac attttcaagc taggaggagg tcttgtcaac tggagatgca gcaaacagac 4680 tagcgtttcg ttatcaagta ctgaggcgga atacatcgca cttgcagaat gtcttcaaga 4740 gctgcagtgg ttgcgcaagc tgatggacga tctaggcgag tcgatgaagt taccagtacg 4800 tgtgaacgaa gataatcaaa gctgtattgc ccttgctgtt gcagatagaa ccactcgaaa 4860 atctaagcat attgacacta aatactgcta tgtgaaggat cttgtgaaga acggaattgt 4920 aactatccaa tattgcccaa cggataagat ggaagctgat ctgttaacca agcctttggg 4980 agcagttaaa ctgaagcagc tgagagaagc gattggagta cggctggcca atgttgagga 5040 ggag 5044 // ID BEL-107_AA-LTR repbase; DNA; INV; 205 BP. XX AC AAGE02022366; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-107_AA_; KW BEL-107_AA-I; BEL-107_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-205 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02022366; Positions 140695 140491. XX SQ Sequence 205 BP; 53 A; 38 C; 39 G; 75 T; 0 other; tgtttggacg tattgattat aattcgaaga ttgatatttt gttcagtgtg tttgcgcttt 60 caattttctt tttcttttta gcacacacac acattttctg taacagtcag tactagagag 120 acggaacacg aataaagacg ttgttgaatt acagtccgcg cgttttcact cgtctctaaa 180 gcaaagcgac tttttcggtt gcaca 205 // ID Gypsy-51_CQ-I repbase; DNA; INV; 1948 BP. XX AC AAWU01036160; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-51_CQ_; KW Gypsy-51_CQ-LTR; Gypsy-51_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-1948 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 481-481 (2011). XX DR Genome; AAWU01036160; Positions 12693 10746. XX CC 'ATTTT' target site duplication CC LTRs are 94% similar to each other. XX FH Key Location/Qualifiers FT CDS 60..1871 FT /product="Gypsy-51_CQ-I_1p" FT /translation="MSSDPELKNAILKLTSLIATQQEQITALQQANANPSG FT SEKIIESLATAIEEFRYDPDDGIFFDAWFARYEDVFKEDGKHLDDKAKVRL FT LLRKIGTQFHERYVNSILPKHPRDFKFDETITKLKRLFGRQTSLFHARYQC FT LQYAKNEADDFSSYAASVNRHCEAFQLKKLSSDQFKALRFVCGLQSPRDAE FT IRTRLISKLEAEEANPQPEPPAKGAAPRLTLENLVEECHRIINLKQDTQLV FT EKTEKLNIHAVAKQPRPTATEKKVPSSPCWLCGDMHYVRDCSYKDHMCSRC FT KRQGHKEGYCSMAEQKSSKQVEQPRSKDFVTSKGIRLVKQTDLKQKRKYIR FT IGLNGTTVQMQLDSASDITMISKNVWAQIGQPHIHAADVNAVGAAGGRINI FT IGKFNANITVQSVTKEGCIYVTGDPDFNVGVGDRHNRSIRSVVDSVQLAVR FT LESRSTPRIAWTLQLNSAACQESGGINGSVWTQRLHTQSAIQRGRRRSGTT FT RRPGRLKSNLRRSTCGACRQCSRTRSSSVADIERRHQAYRQQFLRQRRPSG FT CPAIISEAVNKVQPSHFSDGCKRSPESCRRGESGGHKHRLVQAVKRRSRTT FT PVLDAT" XX SQ Sequence 1948 BP; 499 A; 571 C; 526 G; 352 T; 0 other; gtttggcgac gaggaaaagc agaaaagttt tcaagcttcg cgaagttcaa ggtctcaaga 60 tgtcgtctga tccagaattg aagaacgcca tcctcaagct gaccagtctg atcgccaccc 120 agcaggagca gattacggca ctccaacagg ccaacgcaaa tccctccggc agcgagaaga 180 tcatcgagtc attggcaacg gccatcgagg agttccgtta cgacccggac gacggaattt 240 tcttcgatgc gtggttcgcg cgctacgagg acgtcttcaa ggaggacggc aagcacctgg 300 acgacaaggc gaaggtgagg ctacttctac gcaagatcgg cacgcagttc cacgagcgct 360 acgtgaacag catcctgccg aaacaccccc gagacttcaa gttcgacgag acgatcacga 420 agttgaagag gctcttcggg cgacaaactt ctctcttcca cgcccggtac caatgtctcc 480 agtacgcgaa gaacgaagcg gacgatttct cttcgtacgc ggcatcggtc aaccggcact 540 gtgaagcgtt ccagctgaag aaactgtcca gtgatcagtt caaggcgctc cggttcgtgt 600 gcggtctcca gtcaccgcgc gacgctgaaa tccggacgag gctgatcagc aagctggaag 660 ccgaggaggc caacccccaa cctgaacctc ctgctaaggg cgcagcacca cggctgacgc 720 tcgaaaatct ggtcgaagag tgccatcgga tcatcaacct caagcaggac acgcaactgg 780 ttgagaaaac cgaaaaactc aacatccacg ccgtcgctaa gcaaccacgg cctactgcga 840 cggagaagaa ggtgccaagt tccccgtgct ggctctgcgg tgacatgcac tatgtcagag 900 actgcagcta caaggaccac atgtgcagca gatgcaagcg gcaagggcac aaggaaggct 960 attgctccat ggccgagcag aagtcgtcga aacaggtcga gcagccaagg tctaaggact 1020 tcgtcacgtc caagggcatt cgcttggtca agcagacgga cctgaagcag aagaggaagt 1080 acatccgcat cgggctcaat gggacgaccg tgcagatgca gctggattcc gcttccgaca 1140 tcacgatgat ctccaagaac gtgtgggcgc aaatcggaca accccacatt cacgccgcgg 1200 acgtcaacgc cgtgggtgcc gctggaggtc gcatcaacat catcggtaaa ttcaacgcca 1260 acattaccgt ccagagcgtc acgaaggaag gatgcatcta cgtcacggga gatcctgact 1320 tcaacgtcgg ggtcggggat cgacacaatc gaagcattcg atctgtggtc gattccgttc 1380 aactcgctgt gcgccttgaa tccagatcaa caccacgaat cgcttggacg ctgcaactca 1440 attcagctgc gtgccaagaa tctggtggca tcaacggatc cgtttggaca cagcgcttgc 1500 acacacagtc agcaattcaa cgaggccgtc gtcgctccgg aacaacgcgt cgacctggcc 1560 gtctcaagtc caatctacga cgatcaacct gcggagcctg tcgacagtgt tcccggaccc 1620 ggtcatcgtc agtcgccgac atcgagcgac gtcatcaagc ttaccgacag cagttcctgc 1680 gccaacgccg gccgtctgga tgtcctgcga tcatttcgga agctgtcaac aaagttcaac 1740 catcgcattt ctccgacggc tgcaagaggt ccccagaatc ttgccgtagg ggggaaagcg 1800 gcggtcacaa gcatcgtctg gttcaagccg tcaagcgtcg ttcgcggacc acacctgtct 1860 tggacgccac gtgaagacaa gatttggcca caaccagatg ccggtcacat gaaggtttta 1920 catccctcaa acctcttaaa gagggaga 1948 // ID Jockey-2_DYa repbase; DNA; INV; 3616 BP. XX AC . XX DT 16-MAY-2009 (Rel. 14.05, Created) DT 16-MAY-2009 (Rel. 14.05, Last updated, Version 2) XX DE Jockey-type non-LTR retrotransposon: consensus. XX KW Jockey; Non-LTR Retrotransposon; Transposable Element; KW Jockey-2_DYa. XX OS Drosophila yakuba OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; melanogaster subgroup. XX RN [1] RP 1-3616 RA Jurka J.; RT "LINE-type retrotransposon families from fruit fly."; RL Repbase Reports 9(5), 967-967 (2009). XX DR [1] (Consensus) XX CC Probably 5'-incomplete. XX FH Key Location/Qualifiers FT CDS 323..1264 FT /product="Jockey-2_DYa_1p" FT /translation="MTQTNNRETFKIVFWNACGANNKVDELSLLIQREKAH FT IVIVTETRLDRKSTKLDFPGYVTYLAQNPVSSKRGGVATIVSTRVRHSALE FT AIEKECIQSAPIVLLPENNRRNEMMVIASVYCPPQPKWSSSHFSDILNYAE FT KILGGRTKFVLCGDWNAKHRQWGCSRGCQRGAALYDAVKADTMTEIIATGC FT ATHFPFDSRKNPSAIDFSISKGLGMYEKKISSSSDLSSDHLPILLEINLDR FT NTFFLHKQNNSILKKNTNIVLFKNALDRKVLLNTEIRVAQDINDAINIFIK FT NIMEQLPPQTTRQPQKQARAN*" FT CDS 1479..3548 FT /product="Jockey-2_DYa_2p" FT /translation="MRQLWRLLDEGKKKIQPNFPLKLETKEGYKWTKTTQE FT TTEAFVSHLEGRFKPNMIIPDCHISKINTGLKTIKESNTAERENTNKNPHN FT QPVTLKELREELKNLNNNKAPGKDLITNKLIKNLPTKATLYLILIYNSILR FT IGHYPDNWKHATVKMILKPGKSANDPKSYRPISLLSGFSKIFERLLLKRLV FT RIDSFKKAIPLHQFGFRKDHGAEQQIARVTQFILEAYERKEYCSAVFLDIT FT EAFDRVWHEGLLLKLAKILPFNLYTILESYLTNRTFEVKDPTGEPSRTGQI FT GAGVPQGSNLGPILYSIFSSDMPIPRTHSVSPTGEMLLSTYADDTVVLSTD FT QLPTAAIRNNQNYLETFSDWADKWGIKVNAAKTGHVIYTLKNDLPTNLKTI FT KIRGNKIKNQNKQSYLGVILDNKLTLSSHVTKLVGKFTAAHKKLTWILSER FT SKLPDNTKMLIFKTILSPIWQYAIPAWGPLVTDAQIKRIQVEENLKMRKVC FT RAGRFTRNQFIRDKYGVKTVEDYYKQATHRFYETIKTHPNKAVRRIVTRHY FT IPKRLERSRQRLLKLTKEHIPQGPTGLTSPKLPKIPELYECKTLKTRKERD FT SIRTRHLEDLPTLLRLEEMEDIRIRNQEEMEKRERENEKWPPDRWCELEIN FT MYNKRYRNGDLTRQEIIKKFRGQPIGIQRIILPDYEGDTT*" XX SQ Sequence 3616 BP; 1381 A; 836 C; 703 G; 696 T; 0 other; tagtattgct acgttagaga aagagcgaaa aaacccgaaa agcgtcctaa acccggcaaa 60 cacccatctg tatcacttcc gtccaccccc aatggcacaa aatctacccg aagagataca 120 gaaaaacgcg atcctcgagc caaatcttct aaaacgcata gaaagcatgg aagagaagat 180 aaacaacctt cttgagatag tcactcgcct cgttcaaaaa gatgtaactg gccgaatttt 240 cccgaaaaat ccctctgagg caaaaggaca aatctttgtc taaaattttc cctccacaca 300 tatcggaata gtgacatcta ggatgacaca gacgaacaat agggaaacct tcaaaatcgt 360 gttctggaac gcttgtgggg cgaacaacaa agtggacgag ctcagcttgc tgatacaaag 420 ggaaaaggct cacatcgtca tagtcacgga aacaaggcta gacaggaaat ccaccaagct 480 ggattttcct ggatatgtga cctatctagc gcaaaacccc gtatccagca aaagaggagg 540 agtcgccact attgtaagca ccagggtccg tcattcggcc ttagaagcga ttgagaagga 600 atgcatccag agtgccccaa tcgtattgct accggagaat aacagacgca atgaaatgat 660 ggtaatagcg tcagtatact gccctcccca acctaagtgg tcgtctagcc acttcagtga 720 catactcaac tatgctgaga aaatcctagg aggtcgaact aaatttgtcc tatgtggcga 780 ttggaatgct aaacatagac agtggggctg ctctcgcggc tgccagcgtg gcgctgcact 840 atacgatgct gttaaagctg ataccatgac cgagatcatc gctactggct gtgcgactca 900 tttccctttc gactctagga aaaacccatc agcaatagat ttttccataa gtaaaggtct 960 tggaatgtac gaaaagaaaa tctcatcgag ctcggatctc tcctcagacc accttcctat 1020 cctgctggaa atcaacctcg acagaaacac cttcttcttg cacaagcaaa acaacagtat 1080 ccttaagaaa aatacaaaca tcgtactctt taaaaacgct cttgacagaa aggtccttct 1140 aaatactgag ataagagtag ctcaagatat aaacgacgcc ataaacatct ttataaaaaa 1200 catcatggag caactccccc cccagactac cagacaaccc cagaagcagg cacgggcgaa 1260 ctaacagaca tagtaatact ctcacactag acgaaaacac cagtagatta ttggaagaaa 1320 aaaggacact aaagaggatt tttaaagcca ccaggacaaa tgaggacaaa gcgaaactga 1380 aagcagctga aaacagacta aaaaaagcta taaaaacctt aagagagaag actataaaca 1440 aacaagtcga aggaatcgat acaaaaaacc cagacaggat gagacaactg tggagattgc 1500 tggacgaagg aaaaaagaaa atacaaccaa attttccact taaactagag accaaagaag 1560 gatacaaatg gactaaaaca acccaagaga caacagaagc ttttgtttct cacttggagg 1620 gaagattcaa gcctaatatg atcatacccg actgccacat tagcaaaata aatactggac 1680 ttaaaacaat aaaagaatct aatacagctg aacgagaaaa tacaaacaaa aacccccaca 1740 accaaccagt cacgctaaaa gaactaagag aagaattaaa aaacctaaac aataacaaag 1800 ccccgggaaa agacctaatt acaaacaaac ttattaaaaa cctacctacg aaagcgacac 1860 tttacctaat attgatctac aattccatac taagaatagg acactacccg gacaattgga 1920 aacatgctac agtcaaaatg attctaaaac cgggaaaaag cgcaaatgac ccgaagtcat 1980 ataggccgat cagcctttta tcgggttttt caaaaatatt tgaaaggctt cttctcaaga 2040 gactggtcag gatcgactct ttcaaaaaag ctatcccact acaccaattt ggattcagaa 2100 aagaccatgg tgccgaacaa cagatagcaa gggtcacgca attcatcctt gaggcatacg 2160 aaaggaaaga atactgctct gcggttttcc tggacatcac agaggcgttt gacagggtgt 2220 ggcacgaggg gctactacta aaactagcta agatcctgcc tttcaatctc tacaccattc 2280 tggagagcta tctgacaaac agaacatttg aagtcaagga cccaacagga gagccatcta 2340 gaacaggaca aataggtgct ggtgtgcctc agggaagcaa cctgggcccc atactgtact 2400 caatcttctc ttcagacatg cctatcccta gaacacacag cgtctcacca acaggagaaa 2460 tgctgctgtc aacgtatgca gatgacacag tagtcctcag cacggaccaa ctgccaaccg 2520 cagccatacg taataaccaa aattacctag aaaccttttc tgactgggca gacaagtggg 2580 gcattaaggt taatgctgcc aaaacaggac acgttatata cactttaaaa aatgacctgc 2640 cgacaaacct aaaaacgatt aaaattaggg gcaacaaaat aaaaaaccaa aacaaacagt 2700 cctaccttgg agtaattctt gacaataagc ttacccttag ctcccacgtc accaagctag 2760 tgggaaaatt cactgcagcc cacaaaaaac tgacttggat attgagcgaa agaagtaaac 2820 tcccggacaa cactaagatg ctaatcttta agacaatcct atcgccaata tggcagtatg 2880 ccatcccagc ctggggtccc cttgtcacgg atgcccaaat aaaaaggatt caagtcgagg 2940 aaaaccttaa aatgagaaaa gtttgcaggg cagggagatt tacaaggaac cagtttataa 3000 gggacaaata cggagtcaaa acggtagaag attactacaa gcaggccacc cacaggttct 3060 acgaaaccat aaaaacacac cccaacaaag cagtccgaag aatcgttacc agacactata 3120 tccctaaaag actggagaga agcagacagc ggctactaaa attgacaaag gaacacatcc 3180 cccaaggacc gactggacta acttcaccca aactccctaa aatccccgaa ctctacgaat 3240 gtaaaaccct aaaaacacgc aaagaaagag acagcataag aacaaggcac ctagaggatc 3300 tccccacact tctaagactg gaagagatgg aggacataag aattaggaat caggaagaga 3360 tggaaaaaag agagagagag aacgaaaaat ggcctcctga cagatggtgc gagttggaga 3420 taaatatgta caataaaaga tatagaaatg gggacctaac taggcaggaa ataataaaaa 3480 aattcagagg gcaacccata ggcatccaac gaataatcct acccgattac gaaggggaca 3540 caacataata actaaaacaa tttaaaaaca agaaggctaa gcaaaaacaa acaaaagaaa 3600 aaaggggaaa aaaaaa 3616 // ID Copia-14_AA-LTR repbase; DNA; INV; 123 BP. XX AC supercont1.71; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-14_AA_; KW Copia-14_AA-I; Copia-14_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-123 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.71; Positions 1307799 1307921. XX SQ Sequence 123 BP; 40 A; 31 C; 16 G; 36 T; 0 other; tgttgatcta agcaaccgga agtccatgtc catgatcaaa gcacccatag aggaagtgac 60 gtcttctttt tcttcttcat tcctattcca actactaacc aataaacacg tattataaaa 120 cca 123 // ID hAT-29_HM repbase; DNA; INV; 2568 BP. XX AC . XX DT 07-DEC-2008 (Rel. 13.12, Created) DT 12-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-29_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2568 RA Jurka J.; RT "hAT-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 2018-2018 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 191..2176 FT /product="hAT-29_HM_1p" FT /translation="MSRVNSRKYLEKWEADYPWLKKSSDGTESAFCTLCRV FT TIQPRKSSVEQHSGTFRHASREKHLNTKSQITFPLVSKKADCEADIHLALC FT VCCHAPISTINHLGEIISLYGKNSPLENLRLHRTKCTQIIKNVISSSIESE FT ISESMADKPFSILIDESTDVSSTKHLCICARYYSDEQKSVIDDFLGLAAVI FT STTGADLFAVLETKLTSVGLSLKNCIGYSSDGASNVIGFHNSVWSRVRAVS FT PNCILMRCTCHSLNLIVQHSFELMPSSVGFLLSEIPSFFSKSSIRREEFKV FT LFQTMDPSNERMGTPSPFQKYSATRWLVRGKCLYNLMVNWYELKAYFLCAE FT AQAPACIRYRVTTINNMFKDETLYLYVVFLTPIVQEFERLNALFQSTNVDP FT EKLVSELNLHYKSLQQRVLDNQGVQLPLKRVDFGAKFLSDCQKYIDTYPQT FT QKEIARKNIESVLTRCLQFLLRLLKELDQRLPENKEIFQGLSYLSPSKVLH FT QSSKMPFTHLPLRHIMKDKAQNIEDQYRKVSLVDWSSTELFENSCIPKDTT FT EFWTAVRHYKNTSGDAAFKELADYCLTCLVLPISNAFVERIFSQVTFVKNK FT PRNRMSDKLLSSVVRVKAYLASRNKCCKDFKISDKMLTKLGTDMYSDDDEL FT VEDDLTLIDDLN*" XX SQ Sequence 2568 BP; 817 A; 434 C; 438 G; 879 T; 0 other; cagtgttgcc aactggcatt ttttaatgcc aaatttagta aaattggcat tattttaaca 60 tgttggcatt ttttggcatt ttggcattat tttggcattt tggcattttt ttggcatttt 120 taaaaaaact gttgaattga tttttttttc agttggtttg tatttaattt gtattagaac 180 attaagcaca atgtcgaggg taaattcaag aaagtacctt gaaaaatggg aagcggatta 240 tccttggctt aaaaaaagct ctgatggtac tgaatcagca ttctgtaccc tttgtcgtgt 300 gactatacaa ccacggaaat cgtctgtaga acagcacagt ggtactttta gacatgcttc 360 aagagaaaag catcttaaca caaaaagcca gattacgttt cctttagtat caaagaaagc 420 agactgtgaa gcagatattc atcttgctct ttgcgtgtgt tgtcatgcac ccatttcaac 480 gataaatcac cttggagaaa tcatttctct ttatggaaaa aatagtcccc ttgaaaacct 540 tcgtctgcat cgaaccaaat gtacccaaat tattaaaaat gtaatttctt catcaatcga 600 atcagaaatt tctgaatcaa tggcagataa acctttttcc atcctgattg atgaatcaac 660 tgatgtttct tcgacaaaac atctttgcat ttgtgctcga tactattcag atgaacagaa 720 atcagttatc gatgactttc ttggtttagc tgctgtgata tcaaccactg gtgcagatct 780 ttttgctgtt ttggaaacaa agttgacgtc tgttggattg tctttaaaaa actgcattgg 840 gtatagctca gacggagctt caaatgtcat tggatttcac aattcggttt ggtctcgtgt 900 tcgggcagtc tctcctaatt gtattttaat gcgctgtaca tgccactcct tgaacctcat 960 tgttcaacat tcctttgaat taatgccatc aagtgttggc tttctgctca gtgaaattcc 1020 ttcatttttt tcaaaaagtt ccattcgacg agaagagttt aaagtacttt ttcagacaat 1080 ggatccaagc aatgaaagga tgggaactcc atcaccattt caaaagtatt cagctactcg 1140 atggctcgtc agagggaaat gtctttacaa tttgatggtg aattggtacg aattaaaagc 1200 ctatttctta tgtgctgaag ctcaagctcc tgcttgcatt aggtatagag ttacaacaat 1260 taacaacatg ttcaaagatg aaactctcta tctttatgtc gtctttttga caccaattgt 1320 tcaagaattt gagcgtttga atgctctttt tcaaagcact aatgtggatc ctgagaagct 1380 agtttcagag ctaaacctac actacaaaag tctacaacag cgtgttcttg acaaccaagg 1440 agttcaactg cctctaaaac gtgttgactt tggggcaaag tttttatcag attgccaaaa 1500 atatattgat acttatcctc aaacccaaaa agaaatagcc agaaaaaaca ttgaaagtgt 1560 tctaacacgt tgtcttcaat ttcttttgcg tcttttgaaa gagcttgatc aacgtctacc 1620 tgaaaacaaa gaaatttttc aaggtctatc ttatctctca ccatcaaagg ttctgcatca 1680 aagttctaaa atgccattca ctcatctgcc tttaaggcat attatgaagg ataaagcgca 1740 aaacattgaa gatcagtaca gaaaagtaag tttggttgac tggtcttcta cggagctgtt 1800 tgaaaattct tgtataccaa aagatacaac tgagttctgg actgcagttc gccactacaa 1860 aaatacatca ggagatgcag catttaagga actagcagac tattgcctaa cttgtcttgt 1920 tctccctata tcaaacgctt ttgtggaacg tattttttct caggtcacct ttgttaaaaa 1980 caaaccaaga aacagaatgt cagacaaatt gctcagcagt gtagtgcgag taaaagccta 2040 ccttgctagc cgcaacaaat gttgtaaaga ctttaaaatc agtgacaaaa tgttaactaa 2100 attgggaacg gacatgtatt ctgacgatga tgaattagtc gaagatgatc tcactctaat 2160 tgatgattta aactaaagga agtttgatat tgagacattt tttataattc ttttaacaga 2220 atttgaaatt ttctttaact catatttttt aactttagat ttatatttat ttgaaagttg 2280 agtgaacata cttatctatt taataaatac ataaataatc tatgttaaac gtcgtttttt 2340 tattgaaaaa tatataaaaa atatgttaaa actgattaat atatgacttt taaaatagtt 2400 ttattgaata tatagttaat aaataaggtt caatgttttt ttataaagtt tttctgctat 2460 tttaaaagtt ggcatttttt ttggcattat taataaaaat ttggcattat ttggcatttt 2520 ttagacaagc atttggcatt ttagcgcaaa aaaaatctgg caacgctg 2568 // ID Copia-5_AA-I repbase; DNA; INV; 3992 BP. XX AC supercont1.211; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-5_AA_; KW Copia-5_AA-LTR; Copia-5_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-3992 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.211; Positions 1838483 1842474. XX CC Positions [1441-1953] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS join(97..1371,1375..3990) FT /product="Copia-5_AA-I_1p" FT /translation="MDASKVMMERLNNANYEAWRFRIEMLLIREGLWKHIS FT EEVPNPVTSEWKQGDDKARATIALFVKDNQYPIIRDCKTSRETWEVLKKHH FT QSVSLTSRFTLLKRICQLQFRDGDDMEQHFQEMERLFSRLSNAGQKLDSQL FT MVAMVLRSLPESYDTLTTALESRSDDELTMDLVKCKLMDESMKRRKYLSRG FT ESLLRVESSKKQILCHNCGKPGNVEKDCRLQSYQKVDESRRKFKPKARTVK FT EEESSMFCFTVRNKPTAMNSWVIDSGASTHMCAEKRFFKHYKTCSGITVTL FT ADGSEVSVAGKGSGTLQCTVDNRETQVTITDVLHVPKLDANLVSVGKLTEK FT NAEVIFTGETCKILKHGHVAAIAVKKSDLYQLETLNNSKSITAMSVKHAEN FT CIHKWHRILGHRDEEAIRKLVRDDLVTGIQIKCEVNADCECCLEGKMTRLP FT FPKQAGPQSSKMLELVHTDLCGPMNTVTPGGARYFMTIIDDFTRYCTVYFL FT RKKSEVTERMEEYIWMVETRFGKTPAIIRSDCGGEYKSSNLDRLYRQKGII FT PQYTAAYSPQQNGVAERKNRSLVEMARCMLIDAGIDYKYWAEAINTASHIQ FT NLLPTRATARTPYEAWYKKTPDLKHLHIFGSSAFIHIPDQRRTKLEPKGIK FT LTFVGYSDNQKAYRFVDLATNKVYHSRDVRFVNEGKPLKIFSKPSILVEYE FT SILEPSISNDLSIDEGNQEFETEDDQSTSDESYLDCSQGIDTEDELVTLQT FT PPENIRRSERSTKGIPPTRYCEVADMIKKNIPEPRTIKEALASPESELWRA FT AVEEELQSHKENCTWNIVSLPHGRKTIGCKWIFKRKIDETGKVVRYKARLV FT AQGFTQKFGTDYDEVFAPVVKQTTFRTLLSVAHQRNMLIKHVDVKTAYLNG FT ELQETVYMKPPPGCDYEDPSLVCHLQKGLYGLKQAANIWNKKLDSVLRKLR FT FKPSESDPCLYSKHNDDGTMSFIAVYVDDLLIVCKSEKEHEAIFKNLNSHF FT KVTSLGDITHFLGIHVIRSQDGVSLNQKAYIQKLLTRFGLQDAKSSKIPLD FT PGYIQSKEEETERLPNNHQFASLIGGLLYIAVNTRPDIAVAVSILGRKTSN FT PSQDDWNEAKRVLRYLKGTIDHRLVLGIDSSQLEVFVDADWAGDSKDRKST FT SGFLFRYAGGMIGWNARKQDNVTLSSTEAEYVAMADCCKELTWILRLFNDL FT KIVTTLPVKINEDNQSCIKQLSAPNINRRSKHIEKKYHFIRQLKQDGIINP FT QYLPTEDMIADMMTKLLYGIKLSKFRDQARVISSRR" XX SQ Sequence 3992 BP; 1302 A; 787 C; 931 G; 972 T; 0 other; agaggttatg ggcctttaag aaactgaaat tttgattttc aattctgaag aactctgaat 60 ttaaagattg gtcggaactt tggatctgtt ttcaagatgg acgctagcaa ggtcatgatg 120 gaacgtctga ataacgccaa ttatgaagct tggagattca gaattgaaat gctcctgatt 180 cgtgaaggat tgtggaagca catctcggag gaggtaccca atccggtcac aagcgaatgg 240 aagcaagggg acgataaggc cagggcgact attgcacttt tcgtcaagga caaccaatat 300 ccgattatcc gggattgcaa gacttcgaga gaaacttggg aagttctgaa gaagcatcac 360 cagagtgtgt ctctcacatc tcggtttacg ttattgaagc ggatttgcca gttacagttc 420 cgggatggtg acgacatgga acaacatttc caagaaatgg aacggctctt ctcccgccta 480 tctaacgctg gccagaagct agattctcag ttgatggtcg cgatggtatt gagaagtcta 540 ccagaatcat acgacactct cacgacggct cttgagagtc gctcagatga tgagttaact 600 atggatttgg tgaagtgcaa attgatggac gaatccatga aacgtagaaa atatctgtca 660 cggggagaat ctttattgcg cgtcgaatcc tcgaagaagc agattctttg ccataactgt 720 ggaaaacctg gtaacgtcga gaaagactgt cgattgcaat cctatcaaaa ggtagatgaa 780 tcccgccgga agttcaaacc taaagcaaga actgtgaaag aagaagaaag ttcaatgttc 840 tgtttcacgg tacgtaacaa gccaactgcg atgaactcct gggtcattga ttctggtgca 900 agtacacaca tgtgcgcaga aaaacgtttt ttcaagcact ataagacatg ctcagggata 960 accgtcacat tagccgatgg cagtgaagtt agtgttgctg gaaagggctc tggaacactg 1020 caatgtactg tagacaaccg tgaaacgcaa gtaacaatca ccgatgttct ccacgttccg 1080 aagttggacg caaatctggt atcagtcggt aagctaactg agaagaatgc agaggtgatt 1140 ttcactggag aaacatgcaa aattttgaaa catggacacg ttgccgccat tgcagtgaag 1200 aagtcagacc tatatcaact ggaaacattg aacaacagca aatcgataac cgctatgtca 1260 gtgaagcatg ccgaaaattg tattcataaa tggcatcgaa tcctagggca ccgagacgaa 1320 gaagccattc gtaaactagt acgtgatgat ctagtaactg gtattcagat ttagaaatgt 1380 gaagttaacg ccgactgcga gtgttgcctg gagggtaaga tgacgcgatt gcctttccca 1440 aaacaagctg gcccccaatc atcgaaaatg ttggaactcg tgcataccga tttgtgtgga 1500 cctatgaata ccgtcacacc aggaggcgcc cgatatttta tgacgattat agacgatttt 1560 acccgatatt gcaccgtgta ttttcttcga aagaaatcgg aagtgacaga aaggatggag 1620 gaatacatct ggatggttga aacacgtttc ggaaaaacac ctgctatcat tcgatccgat 1680 tgcggcggag aatataaatc aagcaacctg gacagattat accgccaaaa gggaatcatt 1740 ccgcaatata cagcagccta cagtcctcag caaaatggag ttgcggaacg taagaatcga 1800 agtctagtcg aaatggcgcg atgtatgctc atcgatgctg gtattgatta taaatactgg 1860 gcagaagcta tcaacaccgc ctcgcatatc caaaacttgt tgccaactag agctactgca 1920 agaacaccat atgaagcctg gtataagaag acgccagatt tgaaacattt gcatatattc 1980 gggagcagtg cgttcatcca tattcctgac cagaggagga ctaagctgga gccaaaagga 2040 atcaagctta cgtttgtagg atattctgat aatcaaaaag cataccgctt cgtggatctg 2100 gcaaccaaca aagtctatca cagtagagat gtacgatttg tgaatgaagg aaaaccgttg 2160 aaaatttttt cgaagccttc catcttagtg gagtacgaat caattctaga gccaagtatt 2220 tcaaacgatc tgagtattga tgaaggtaat caagaattcg aaacggaaga tgatcaatca 2280 acgtcagatg aatcttatct ggactgtagt caaggaatcg atacggaaga tgaactagta 2340 accctgcaaa caccacctga gaatataaga cgttcggaga gaagcacgaa gggaatacca 2400 cccacacgtt attgtgaagt agctgatatg ataaagaaga atatccctga accacgcaca 2460 atcaaagaag cgctagcgag tccggaaagc gaactttgga gagctgcggt tgaggaggag 2520 ttgcagtcgc acaaagaaaa ctgcacttgg aatattgtat ccctgccaca tggacgtaaa 2580 acaatcggct gtaaatggat ttttaaaagg aagatagatg aaactggcaa agttgttcgt 2640 tataaggcaa gattggtggc ccaaggcttt acgcaaaaat ttggaacaga ttacgatgaa 2700 gtttttgccc ccgtggtgaa gcaaactacg tttagaactt tgctgtctgt ggctcaccaa 2760 aggaatatgc taatcaaaca cgtggacgtg aagacggctt acctaaacgg agaactccag 2820 gaaactgtct atatgaaacc acctcctggt tgtgattatg aagatccaag tctggtgtgt 2880 catcttcaga aaggactata tggattgaag caagccgcaa atatttggaa caagaagctc 2940 gattcagttt tacgaaaact tagattcaag ccctctgaaa gcgatccgtg tttgtattcg 3000 aagcacaacg atgatggtac gatgtctttc attgctgtat atgtggacga cctactcatt 3060 gtttgcaagt ccgagaagga gcatgaggcc attttcaaga atttgaatag tcacttcaaa 3120 gtaacatcac tgggagatat aacacatttt cttggcatcc atgtaatacg ttcgcaagat 3180 ggtgtgtcat taaatcagaa ggcgtacatc caaaaacttt tgacaagatt tggactacaa 3240 gatgcaaaat cttcgaagat tccacttgat cccggctata tacagtcaaa ggaggaggaa 3300 acagaaagat taccaaacaa tcatcagttc gccagcctga tcggaggatt gctgtacata 3360 gccgtgaata ccagaccaga tatagctgtg gcagtgtcta tccttggacg aaaaactagt 3420 aacccaagtc aagatgactg gaatgaagca aaacgagtac ttcgctatct caaaggaacc 3480 atcgatcata ggctagtatt gggcatcgat tcatcacaac tagaagtatt tgttgatgct 3540 gattgggcag gagattccaa ggatcgcaag tcaacatctg gtttcctatt tcgttatgct 3600 ggaggaatga ttggatggaa tgcaagaaag caagataatg tgacactcag tagtacggaa 3660 gccgaatacg tggccatggc agattgctgt aaagaactca cctggatact tcgtttgttc 3720 aatgatttga agatcgtaac aacattgccg gtgaaaatca atgaggataa tcaaagctgt 3780 atcaagcaat taagcgcacc gaacatcaac cgaagatcca agcacattga gaaaaaatat 3840 catttcatcc gacaactcaa acaagacgga attatcaatc ctcaatactt gccaaccgaa 3900 gacatgatag ccgacatgat gacaaaacta ttatatggaa ttaagctctc aaaatttcgg 3960 gaccaagcaa gagtaatttc gtcgaggagg ag 3992 // ID Gypsy-4_RP-I repbase; DNA; INV; 3891 BP. XX AC ACPB02026971; XX DT 28-JAN-2011 (Rel. 16.02, Created) DT 28-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Rhodnius prolixus genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-4_RP_; KW Gypsy-4_RP-LTR; Gypsy-4_RP-I. XX OS Rhodnius prolixus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Euhemiptera; Heteroptera; OC Panheteroptera; Cimicomorpha; Reduviidae; Triatominae; Rhodnius. XX RN [1] RP 1-3891 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Rhodnius prolixus genome."; RL Direct Submission to RU (28-JAN-2011). XX DR Genome; ACPB02026971; Positions 15369 11479. XX CC Positions [1482-1937] - Reverse transcriptase CC Positions [2958-3476] - Integrase core CC 'CTCCC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 768..3866 FT /product="Gypsy-4_RP-I_1p" FT /translation="MGDHQPAVNKWDAVDGKAEAATANKFRRLFVTDIKTG FT TRYLVDTGAAISIIPNCNKARERDSGFSLYAANGTPIPTYGEQRKSLDLGL FT RRLFVWDFIVANVTQHILGADFLAGSNLVVDLAQRCLIDRTTGLATIGSIE FT VVELLELSTVPPHVPFRELLLQFPTLTKVGSHKVRQNNGIFHHIETTGPPV FT CSRARRLPPGKYQAVKEEFQQMMNDGICRPSKSPWASPLHVVEKKDGTLRP FT CGDYRLLNSKTVPDRYNVPNLHDFTIQLYGTKIYSALDINKAYHHIPVAEE FT DIPKTAIITPFGLFEFVKMTFGLRNAAQTFQRYMDHLFRDLPFVFVYVDDI FT LIASKDEREHEAHLKVVFQRLSERGLTLRTDKCVLGQPTVSFLGYTVSPEG FT ICPLQEKVKAIRELPKPATIYDLRRFLGMVNFYRKCLPNAATTQRELNKYL FT HHAKRRDKTPIVWSREAEAAYEKTKEDLANATCIAHPSPDSQLSVACDASN FT IGMGAVLQQQEEGTLQPLGFFSKAFNKAQQNYSTYDRELLAVYEAIRYFRP FT MIEGRHILIYTDHKPLTYAMLQSNKTLCPRRLRHLNYISQFSTDIRYQRGE FT DNTVADALSRIAEVTVLNNFQNLAEAQKRDEEIDTQEKRETLRLQLVPVPG FT AEYSLFCETTKGTPRPYVPGELRKPIYENLHSLSHPGIRASRKLVTARFFW FT PNMEKDVAYWAKHCIPCQRAKVNRHTSTSLGSFMPSKKFEHVHLDLVGPLP FT PSDGKEYLLTMVDRATGWPEAIPLTSITATAVAEAFYRHWIARYGVPVRLT FT TDRGRQFESTLFNQLAQLLGITHLRTTAYHPQSNGMVERWHRSLKSALRTR FT LEEASKWTKTLPTVLLGLRSALRRTTGVSAAEAVLGQPLRLPGEIFMPPRT FT QEPTDASLQLLRQEWARTIGPTQTKCFVSPDLEKCTHVFLREEVHKRSLTP FT PYSGPYLVRKRDDKTATVQLPGREDRVSLDRLKPAYLLEESQQDAPKPPEP FT STANDEGSEHRSTRPSRTIRRPVRFAT" XX SQ Sequence 3891 BP; 1107 A; 1032 C; 1005 G; 747 T; 0 other; attggtgacc ccgacgtgat gactgaagcg agggacgcac cgctcgagcc tgcaccggac 60 acagaggctg taagcctcgt gaagttaccc cagttctggg aggaaaatcc agcattatgg 120 tttctacaaa tcgaagctct atttggggcc tacaagatca agagcgaggc tggacgattt 180 caagccctgg ttggtcagct accatatcag gtgcttacac aaatagccgg ctcagtgcag 240 tcacctggtc cagtgcctta cacgtcaacg aaggaaaaac tgatcgctat ctacagtcaa 300 agccaggaga gaaggatact aaggcttctg gaacaaacac agcttggcga catgactcct 360 tcacagctgt tgcgacaaat gcaaaaccaa gcgggtacag cgatgtcgga cggcgtctta 420 cgtaccgttt ggctgcgagc actaccgcag agagtgagag ggattctagc cgctattgaa 480 caggacgacc tagacaagtt ggccactgtg gcagacaagg taatggaggt agactccaat 540 acctctgttc acgctgtgag ggatactgac ggcgacagac ttgctcgact ggagaagcaa 600 ctgcaagttt tgcacgagca attttcttca ttggccacta ccattcagcg cagcaaccgt 660 ggaagaagta ggtcccgtaa ccgtgaccaa ggagctccca aggatgggta ttgctactac 720 caccacaagt tcggggagaa ggccaccaag tgcaagaaac cttgcaaatg ggggaccacc 780 agccagccgt caacaaatgg gacgcggtag acgggaaggc ggaagccgct accgctaata 840 aattccgccg tctcttcgtc actgacataa aaacagggac aagatacctg gtggacactg 900 gcgcggcgat atccatcatc cccaactgta acaaggccag ggaacgagac tcaggattca 960 gcctgtatgc ggctaacggc actccaatcc caacttatgg agagcaaagg aagagcttgg 1020 atcttggctt acgccgactg ttcgtgtggg atttcatcgt ggccaacgtc acgcagcaca 1080 tacttggggc agactttctc gccggaagca atctggtggt tgatttggcc cagcgctgtc 1140 taatagaccg gaccacagga ctcgcaacta tagggtccat tgaagtcgtg gagctactag 1200 agctatctac tgtaccaccg cacgtaccat ttagggagtt gctgctacag ttccctactt 1260 taacgaaggt tggctcacat aaggtacgac aaaacaacgg gatatttcat cacattgaaa 1320 ctacagggcc tcccgtatgc agcagggcgc gcagattgcc gccaggcaag tatcaagctg 1380 tcaaggagga gttccagcaa atgatgaacg acggtatctg tcggccatcc aagagcccat 1440 gggccagccc actgcatgtg gttgagaaga aggatggaac attgcgacct tgcggtgact 1500 accgcctgct gaatagcaag accgtgccag accgatacaa tgtgcctaac ttgcacgact 1560 tcacgattca gttatacggc actaagattt actcggctct ggatataaac aaggcgtacc 1620 atcatatacc agtggcggaa gaagacattc cgaagacggc tataatcaca ccgtttggcc 1680 tattcgagtt cgtcaagatg acgttcggtc taagaaacgc agcccagacc tttcagcgat 1740 acatggacca cctttttagg gacctaccat tcgtgttcgt ctacgtagac gatatactca 1800 ttgcgtccaa ggatgagcga gaacacgaag cccacctaaa agtagtgttc cagcggctga 1860 gtgagcgagg gctgacgcta aggactgata aatgtgtact gggacaaccg acagtcagct 1920 tccttgggta caccgtgtca ccagaaggga tctgcccact ccaggagaag gtgaaagcta 1980 ttcgggaact accaaagcca gccaccatct atgatctaag gagatttctt gggatggtaa 2040 acttctacag gaagtgcctt cccaacgcag ccaccaccca gagagagctc aacaaatatc 2100 tccaccacgc taaacgcaga gataagacgc ctatagtgtg gtcgagggaa gcagaagcgg 2160 cttacgagaa gaccaaggag gaccttgcga atgcaacatg catagcacat ccttcaccgg 2220 actctcagct atcagtagct tgtgacgcca gcaacatcgg catgggagca gtgctacagc 2280 aacaggaaga gggcacacta cagccgctag gtttcttctc taaagcgttc aacaaggcgc 2340 aacaaaacta cagcacctac gaccgagagc ttctggcagt gtatgaggca atacgctact 2400 tcagacccat gattgagggc aggcatatcc taatatacac cgaccacaag ccactaacct 2460 acgcaatgct acagagcaat aagaccttgt gtcccaggag attacggcac ttgaactaca 2520 tctctcaatt ctccacggat atacgctacc agcggggaga ggacaacacc gtggctgatg 2580 ctttatcgag gattgcggag gtaactgtac tgaacaactt tcagaaccta gctgaagctc 2640 agaaacggga cgaagaaatc gatacccaag agaaaaggga gactcttcgc ctccagctag 2700 ttccggtacc tggtgcagag tacagtcttt tctgcgagac cacaaaggga acacccaggc 2760 catacgtacc aggggaatta aggaagccaa tctacgagaa tctgcacagc cttagccatc 2820 cagggatcag agcatcaagg aagctggtga ccgctagatt cttctggcca aacatggaaa 2880 aggacgtagc atattgggca aaacactgca taccctgtca acgggccaag gtcaaccgac 2940 acacctcgac atcgcttggg tccttcatgc cgtccaagaa atttgaacac gtgcacctgg 3000 acttagttgg accactccca ccatcagacg ggaaagaata tctcctgaca atggtcgacc 3060 gagcgaccgg gtggccggaa gcgatacctc tgactagcat caccgctacc gctgtggcag 3120 aggccttcta tcgccactgg atagctcgat atggagtacc agtccgccta acaactgata 3180 gaggaaggca attcgagtcg acgctgttca accagctagc gcaacttcta gggatcacgc 3240 atctcagaac aacggcttac catccacaaa gcaacggaat ggtagaacga tggcaccgtt 3300 ccctgaagag cgcacttcgg acgaggctag aagaggccag caaatggaca aagaccttgc 3360 ctacggtcct gctaggacta aggtcggctc taaggagaac caccggggta agcgcagcag 3420 aggcagtgtt gggacagcca ctacgtttgc ccggagaaat attcatgcct cctcgaactc 3480 aagaacccac agacgccagt ttgcaactgt tacggcaaga atgggcacga accatcggtc 3540 caacgcaaac aaaatgtttt gtcagcccgg acctggaaaa gtgcacacac gtattcttac 3600 gagaagaggt tcacaagcgg tcactgacgc caccttattc aggaccgtac ctagtgcgga 3660 aacgggatga caaaacggct acagtgcaac tacccggcag ggaagaccga gtttcgttgg 3720 accgcctcaa gccagcctac ctactggagg agagccaaca ggatgcacca aaacctccgg 3780 agccttccac cgccaacgat gaaggaagcg agcatcggtc aactagacca tcgaggacca 3840 tcaggcggcc agtgcgattt gcaacgtagc tcttcaatta gggcggaggc a 3891 // ID BEL-15_DPu-I repbase; DNA; INV; 6627 BP. XX AC scaffold_147; XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from Daphnia: internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-15_DP_; KW BEL-15_DPu-LTR; BEL-15_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-6627 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Direct Submission to RU (16-DEC-2010). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR Genome; scaffold_147; Positions 95392 102018. XX CC 'ACACT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 1297..6600 FT /product="BEL-15_DPu-I_1p" FT /translation="MAEARLTASRGGNRAAATRLINRINTIVADVAITRAQ FT KIHELQDKIESLEEKMATIANIDNAIQDELDPEDVQAEIEAADTNNQTYRD FT ARAGFTFQLKTLQDEEAAANAILALATAPVVPAAAAATPGPSASMLPKLDL FT PSFKGDILQWSSFWDVFESEVDSKSYGGATKFNFLIYKLEGEAKASLLGLT FT SSNDNYTKAKDILRQRYSQPRKVITAHYKALINLPVANATRSSLRAFADQL FT ESHIRGLEALGTAPAYYGDLLVCFLIDKLAIDVRRNLTRHQGNADWTLDEL FT RDAIAREIEIMGDTSELPPPRPASKQVLFNSSHPSQATKKRICPFCSGEHS FT PTRCSEFKTPEDRAKIVKIKKLCYNCLHSGHTNLKDCPNRFRCQHCSQQHH FT TSLHIEKSPKPGTPPKPSSSHTGCLQTKAVSKVVAAIVTESLPFVFLKTAV FT APVCSHLTSTFANLLIDDGSQITFITARLVKLLRLYPIRRVSLLLSGFKGI FT SASSMEPSYYDVVHFWLQGLNGERLLIQAVVLDSIVEPLEDPHRAILSTLP FT HLQNLQLAHPTSTDDHFSVDILIGADSYWSIVGDENIHGNGPTAVNSFIGY FT LVSGPLQDYHASTQRSSFHISAVQLDDITRLWSLETLGILPDVENADAAAT FT YQADCITYQGNQYTARLPWKDDHPELPSNYLICQRRTRHTVKSLLKKPQLL FT QLYSKIIRDQLSAGFIERVPRTQQAPHGCHYIPHFAVQKDSATTPIRIVYD FT CSCKTRNGASLNDCLLIGPPLQNDMLHILLRFRVHRIGFSSDIEKAFHKVQ FT LHEADRDFIRFLWLSNDNDPDSDFVIYRFKVVPFGASSSPFILNSVIKKHL FT KRSSTPISVDIEANIYVDNLISGSETPEEALNYYAESLSTFQAASLSLQSW FT AFSDPSLNERAISEGVADVSTLTKTLGLLWDRSTDTLQLPLFFLTPFCSTS FT PTKRDVLRGLSSIYDPLGFITPLSIPARILMQEIWKEELQWDDPLPASFAD FT RWLGLCSSLSDAKPKVSRSYFGDRSRVKSLHIFVDASQQAYGAVAYLLDGQ FT NSSFVISKARVAPLKGNGPQLTLPQLELMAALIGTRIATTIITAFQVLGIS FT LSVTMWSDSQIVLYWLSKNDKQKNLFVANRVSTIITFNRLHQASWHYYPTA FT SNPADLVTRGLTLHQLQSSPIWTRGPLWLSSEDEWPQWSVSSLSQIKILHL FT AEISTAAAPERLPLPLDISTVIDIARFNWSSLKRTTAYVSRQFSNFQRPKA FT EWNREFLSTLELIQAERKWIVSFQFRFFANEYEYLRGDRKGRRPALISQLD FT LYLDADSIIRCRGRLNNVDLSSDARNPILLPKNTELTRLIIQHFHERSLHS FT GVSYTISSIRQRFWIPSIRQQVKAIVRLCTRCRRVNGAPYRAPNPATLPSF FT RVRGDKAFAVTGIDFAGPFPVRGPSSQPDSKAYICLFTCTTSRSIHLELVE FT DLTAASFILAFRSFISHHFTPTMVLSDNATTFECAARALKSIFNSSEVTNY FT LSDCQIEWRFIPKRAPWYGGFWERLVGLTKEALKKMLGRTKLKFAEFRTIV FT TEIEAILNDRPITYVSSDLNDPQALTPSHLMYGDRLTPLPYNPAVEEELLD FT PTFGEKPSHLLDMFSRRQRILKAFWSRWQKDYLTSLRERHITCKEKLQPSI FT KVGDVVLVHNEGPRIYWKLAVIESLIISPDGEIRAANIRTAEGKTNRPVSK FT LYPLEVTAPIDPSPSAQVLPNPASSLPTRPRRKAAIAAEKRIRNMAEE" XX SQ Sequence 6627 BP; 1688 A; 1820 C; 1306 G; 1813 T; 0 other; tttggtgccg tgaccaattg tacaacgtaa cgatttcggc catccattct cagattgctc 60 gcattcagca gtagtcgcca ttttgttctc gttgctcgca ttcagcagca acttctcatt 120 tctcttttct cgtttggagg ctttgattta ctttaatttg tcattcttct ggtcgttaat 180 ctattccgat tattcgtccg ttgaacttcg ataaattatt cgtttcattc ccccctctca 240 gtaatttctc ccgccaagaa accgcgtggc tgcgcacgac actgccttct tttgtacaag 300 tcgccatcgc tcaaggctaa ccacagcgtc tctcccacgt cgtaattcca aatctttgcc 360 tttttcgaat ttgttcttcc accgtcttgg tcctaatcgt ccggtcgtta cctttacggg 420 tgttcggtcg tcgatattgg tccagattct ttcatttgtt catcggtttt cgttcatacg 480 gcccagccat tatagtcatc agcatagaga aaattcccac gtcttggtcc taatcgtccg 540 gtcgttacct ctacggtgtt cggtcgtcga tattggtcca gattattttc tttatttctc 600 tctgaacatt tgatgatact aaatgccccg gagtcgtaat agtcccgcaa gccgtcagct 660 acgggcaaaa gccttcaagg aagaaagaat caagaacaag gaagctaatc aacttccgga 720 agaaagacga gtcgtcattt tacgacacga agaatttcaa gaagccaacg aagacgttcg 780 agtcgccatt tttcaagaat ctcaagtcga caatcaagtt gccgaacctc aagtcgacaa 840 tcaagttgcc gaacctcagg tccctgatca gaaggtattg gaagatcgca aacagcgggc 900 aattgaaatc ttcagctgta ttcaagggct ttatcgcaaa ttacaccctc agctgctgcc 960 tccagtgaac attctcgctc aacaggcgtt gagatatttt cacacaaatc agccagcaca 1020 accaagagaa gaagaacatc cgcagttggc gattgaagac ggtgactttc tccaattaac 1080 atatgaagag gacgcctaaa cttttatatt tttgaattgc atggtcgata gctcgaaaga 1140 gtggttcggt cggtggctca gaaactcttc atgtattagg ccaatcatca ataaaatcat 1200 gttgccactt caatcatctt catagtcgtc tctctctctc tcttccgttc tacttatcgc 1260 cctacacttc atcagtcatc aattttgttt ttcatcatgg cggaggctcg tctcaccgca 1320 tcacgtggag gcaatcgtgc cgctgcgact cgtttaatca acaggattaa tactatcgtt 1380 gcagacgttg ccataacgag agcccaaaaa atccacgaac ttcaagacaa aattgaaagt 1440 cttgaagaaa agatggcaac catagcaaac atcgacaacg ccattcaaga tgaattagat 1500 cctgaagatg ttcaagcaga gattgaagca gccgatacga ataatcaaac ttatcgtgac 1560 gctagagccg gctttacatt tcaactgaaa acgctgcaag atgaagaagc tgcagctaac 1620 gcaattctcg ctctcgctac agctccagtc gttcccgccg ccgctgctgc tacacctggt 1680 ccttctgcca gtatgctccc gaaacttgat ttgccctcat tcaagggcga cattcttcaa 1740 tggtccagct tttgggacgt ttttgaatcc gaagtcgatt ccaaaagcta tggcggcgct 1800 acaaaattca actttctcat ctataaactc gaaggagaag caaaggcctc tcttcttggt 1860 ctcacctctt cgaacgacaa ctacaccaag gctaaggata tccttcgaca acgctacagc 1920 caacccagga aggtcatcac cgctcactac aaggcgctga tcaacctacc agtcgcgaat 1980 gccactcgct caagtctacg tgcgttcgcc gatcaacttg aatcccacat ccgtggtctc 2040 gaagcccttg gaacggcccc tgcctactac ggcgatctcc tcgtctgttt cttgattgat 2100 aaactcgcca tcgatgtgcg tcgtaacctc acacgccatc aaggcaatgc agactggact 2160 ctcgacgagt tacgggatgc catcgcgcgt gaaatcgaga ttatgggtga tacgagtgaa 2220 cttccacctc ctcgtcctgc atcaaaacaa gttctcttta attcatcgca tccttcccaa 2280 gcaacaaaga agcgcatttg tccattttgt tctggagagc attcgccgac aagatgctcc 2340 gaattcaaga caccggaaga tcgcgcaaaa atcgtgaaga taaagaagct atgttacaac 2400 tgtctccatt caggacacac taatttgaaa gactgcccta atcgttttcg gtgtcaacac 2460 tgctctcagc agcaccacac cagccttcat attgaaaaga gcccaaagcc gggtactccc 2520 ccgaagcctt cctcttctca tactggatgt cttcaaacca aagcagtttc aaaagttgta 2580 gcagccatag tgactgaatc tttaccattt gtttttctta aaacagcagt tgcccccgtt 2640 tgctcacatc ttacttctac atttgctaat ctactgatcg atgatggatc acaaatcacc 2700 ttcatcacag ctagactggt caaattactc cgtctttatc caatacgacg agtatcgctc 2760 cttctctccg gcttcaaagg aatttcagca tcctccatgg aaccaagcta ctacgacgtc 2820 gtccatttct ggctccaagg attaaacggc gaacgtctct taattcaagc tgttgttctt 2880 gactcgattg ttgaaccttt agaagaccct catcgagcaa ttctctcaac tctacctcat 2940 ctacagaatc ttcagttagc tcatcctacg tcaacagatg atcacttttc cgtcgacatc 3000 ttgattggag ctgattcgta ttggtccatc gttggagatg aaaacattca tgggaacggt 3060 cctacagctg tcaacagttt tatcggatat ctcgtttctg gtccacttca ggattaccat 3120 gcatcgactc aacgatccag ttttcacatc tccgctgttc aactagacga catcactcgt 3180 ttatggtcac tcgaaacttt aggaattctt cctgatgtcg agaatgcaga tgcagcggca 3240 acctatcaag ctgattgcat tacttatcaa ggtaaccagt acacagcgcg gctaccatgg 3300 aaagatgacc atccagaact gccatctaat tatctcattt gccaacgacg aactcgccac 3360 acggtcaaga gtctcctgaa aaaacctcag ctcttacagc tgtacagtaa aattattcgt 3420 gatcaacttt ccgccggttt catcgagcgg gttcctcgta ctcaacaggc gccacacggt 3480 tgtcactaca ttccgcattt tgctgttcaa aaggattctg ctacaactcc gattcgtatc 3540 gtgtacgatt gctcctgcaa gactagaaat ggtgcaagcc tcaacgactg cctcctcatt 3600 ggtcctcctc ttcaaaacga tatgctccac attctgcttc gatttcgtgt gcatcgcatt 3660 ggtttttctt ccgacattga gaaagcgttt cacaaggtac aacttcacga agctgatcgt 3720 gacttcattc gcttcctttg gctcagcaat gacaacgatc ccgattccga ctttgtcatt 3780 taccgtttta aagttgttcc attcggagca agtagcagtc cattcatttt aaattcagtg 3840 atcaaaaagc atcttaagcg tagttcaact cccatcagtg ttgacatcga agctaatatc 3900 tacgttgata atctgatttc gggaagtgaa actccagaag aagcattgaa ttactacgct 3960 gaatcactct ctacttttca agccgcatcg ctatccttac aatcatgggc tttcagcgat 4020 ccatctctca acgaacgtgc tatctcggaa ggtgtggcgg atgtttcgac tctcaccaaa 4080 accttgggtc ttttgtggga tcgctcgaca gacactctac agctacctct cttcttttta 4140 acgccgtttt gttccacttc acccaccaag cgtgatgttc ttcgcggact ttcttccatc 4200 tacgacccac ttggattcat tacgcctctt tctattcctg cccgcattct tatgcaagaa 4260 atttggaagg aagaattaca atgggacgac cctttaccgg cttctttcgc tgatcgctgg 4320 cttggtttat gcagctccct ttctgacgca aaacccaagg tttcccgttc atacttcggt 4380 gataggtctc gcgtaaaatc tcttcatatt tttgtcgacg ccagtcagca agcatatggt 4440 gctgtcgcct atctccttga cggccaaaac tcttccttcg ttatttccaa ggctcgagtc 4500 gccccgttga aagggaacgg ccctcagttg actcttccgc agctagaact catggcagcc 4560 cttattggaa ctcgcatcgc gacgacgatc atcaccgcct tccaagtcct cgggatctct 4620 ctatcggtca ccatgtggtc tgatagccaa attgttttat attggttgtc aaaaaacgac 4680 aaacagaaga atctctttgt cgccaaccgt gtctccacga tcatcacatt caaccgtctt 4740 catcaagcca gttggcatta ctaccccact gcttcaaatc cagctgatct ggtaacgaga 4800 ggcctaacgc tccaccaact tcaatcgtct ccgatttgga caaggggccc actatggtta 4860 tcttccgaag atgaatggcc gcaatggagc gtctctagcc tcagtcaaat taaaattctt 4920 catcttgcag aaatttccac cgccgcagct ccggaacgac tccccctgcc actggatatt 4980 tcaacagtta tcgacattgc ccgtttcaac tggagttcgt tgaaaagaac aacggcatat 5040 gtttctcgtc aatttagcaa ttttcaacga ccgaaagctg aatggaatcg cgaatttctt 5100 tccacactgg aactcatcca agcggaacgc aaatggatcg tctcttttca atttcgtttt 5160 tttgccaacg agtacgagta tctacgcgga gaccgcaagg gacgtcgacc agcactgata 5220 tcccagttgg atctttattt ggatgctgat tcgatcattc gctgccgtgg ccgtttaaat 5280 aatgttgatt tatcaagcga tgctcgaaat cccatcctcc taccgaagaa taccgaactc 5340 acccgcctaa taattcaaca ttttcatgag cgcagtcttc attccggtgt ttcttacact 5400 atttctagca tccgccagag attttggatc ccatccatcc gccaacaggt gaaagccatc 5460 gtgcgcctct gcactcgctg ccgccgggtg aacggggctc cttacagagc ccccaatcca 5520 gctacattgc caagttttcg agtccgtgga gacaaggcct tcgccgttac cggcatcgat 5580 tttgctggcc catttccagt ccgtggtcca tctagccaac cagattcaaa ggcttacatt 5640 tgcctcttta cgtgcactac atctcgttcc atccacctgg agttagtaga agacctcacg 5700 gcagctagtt ttatactagc cttcagaagt ttcatctccc accactttac cccaacaatg 5760 gttctctcgg acaatgccac aaccttcgaa tgtgccgctc gtgctctcaa gtccattttc 5820 aacagttcgg aagttaccaa ttatctttcc gactgtcaaa tcgaatggcg ttttattccg 5880 aaacgtgccc cttggtatgg gggcttttgg gaacgacttg tcggcctgac caaggaagct 5940 ctcaagaaaa tgctcggtcg taccaaactc aaattcgctg agttccggac tatcgtcaca 6000 gagatcgagg ccatcttgaa cgatcgtccg atcacgtatg tgtcgtcaga tttgaacgac 6060 cctcaagctc ttacaccgtc tcatctgatg tatggtgatc gcttgactcc actcccatac 6120 aacccagccg ttgaagagga gctgcttgat cccaccttcg gcgagaagcc ttcccacctc 6180 cttgacatgt tcagccgaag acaacgaatc ctcaaagctt tctggtcccg ttggcaaaag 6240 gactacttga ccagcctccg ggaacgacac atcacatgca aggagaaatt acagcccagc 6300 atcaaggtcg gtgacgtggt tctagttcat aacgaaggac cccgaatcta ttggaaactg 6360 gctgtgatcg aaagcctcat catcagtccc gatggagaaa tccgagcagc caacatccgc 6420 accgcggaag gaaaaacaaa ccgcccggtt tcaaaactgt atccgctgga agtgactgcg 6480 ccaattgatc cttctccaag cgctcaagtt cttcccaatc ccgcctcttc tttaccaacg 6540 agacctcgtc ggaaagcggc catcgccgcc gaaaaacgga tccggaatat ggcagaagag 6600 tagaagaaag cttccgggcc gggagta 6627 // ID LOA-6_CQ repbase; DNA; INV; 2444 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A LOA non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW Loa; Non-LTR Retrotransposon; Transposable Element; LOA-6_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-2444 RA Kojima K.K. and Jurka J.; RT "LOA non-LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 153-153 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 16 sequences with >95% CC identity. XX FH Key Location/Qualifiers FT CDS 1..2358 FT /product="LOA-6_CQ_1p" FT /note="reverse transcriptase and ribonuclease H." FT /translation="KVAWAVHSFEPFKSPGLDGIFPALIQHGGKKLIEHLS FT NIFKASLTLGHIPQIWSKTRVVFIPKAGKKDKTIPKSFRPISLTSTMLKIM FT EKIVDEYIKSKFLKKRPLSRFQFAYQSNKSTITALHELVTRIGKSLEIKEI FT ALAAFLDIEGAFDNTSFNSIKNCMIKRGFDSCIVQWIMTMLDNRIVSAELG FT ISTLTVKTTRGCPQGGVLSPLLWSLVVDDLLNKLTDDGFEVIGFADDLVIL FT VRGKFDLTISERMQHALNLTSQWCRNEKLNVNAAKTVLVPFTRRRKLTLKT FT LKINGLDLQYSKHVKFLGVLLDXKLNWNIHLEQVVNKATNALWISRKTFGK FT HWGLKPKMIQWIYTAIIKPRITYAALVWWPKTKQTTTQKKLEKLQRLVCLS FT ITGAMRSTPTKALNASLHIPPLYQFVQMEAAKSALRLKKIQSKVESIEHLK FT IISFIKINQNVFMTSDWMESTLNLDEPFKVIETNRNEWESGGPRVLPGSVI FT FYTDGSKMGNKTGAGITGPGVSVSIPMGQWTTVFLAEIYAILECASICLRR FT NYRLAKICIFSDSQAALNALKSPTCQSKLVWECRKLLKQLASKNQVHLYWV FT PGHRGIDGNEKADFLARNGSEGHFIGPEPFCGVSKSVLKMEFVKHEEKTIQ FT SNWTKTQLLRQSREFITPSTQKSKKILNLNKKYLNIYIGLITGHCPSRYHL FT KKLGLSQIDICRFCDCETETSKHLICDCSALSARRRQILNRPILSPKEIWQ FT ENPSKVVDFILEIIPEWGIPQRQPMTATLNGNVSS" XX SQ Sequence 2444 BP; 863 A; 459 C; 478 G; 641 T; 3 other; aaagtagcgt gggcagtgca ttctttcgaa cctttcaaat ctcccggatt agacggtatt 60 tttccagctc ttatacaaca tggggggaaa aaacttattg aacacctttc aaatattttc 120 aaagcaagtc tcactttggg acacattcca caaatatgga gtaaaactag agtagttttc 180 attcctaaag ctggaaaaaa agataaaaca atcccaaaat cattcagacc aataagtctc 240 acatctacaa tgcttaaaat tatggaaaaa atcgttgatg aatatataaa atcaaaattc 300 ttaaaaaaac gtcctcttag caggtttcag tttgcttatc aatcaaacaa atcaacaatc 360 acagccttac atgagctggt aacacgaata ggaaaatcac tggagattaa agaaatagct 420 cttgcagcat ttctggatat mgaaggtgct tttgataaca cgtcattcaa ttctataaaa 480 aactgtatga ttaaaagagg ttttgattcc tgcatcgtgc agtggattat gacaatgtta 540 gataaccgaa ttgtctctgc agaattggga atatcaactt taactgtwaa aaccactagg 600 ggatgcccac agggaggtgt attatctcct ctgttgtggt cactggttgt agatgatctt 660 ctcaataaac tgacagatga tggatttgaa gttatcgggt tcgcagatga tttggttata 720 ctagtacgtg gaaagtttga tctcactatc tccgaacgta tgcagcatgc tcttaatctc 780 acttcccaat ggtgcagaaa tgaaaaactt aatgttaacg ccgcaaaaac agtcctggtt 840 ccgttcacgc gcagaaggaa actgactctt aaaaccttaa aaattaacgg cttagattta 900 caatactcaa aacacgtaaa attcttgggc gtgttgttgg acgamaaact gaattggaac 960 atccatttag agcaggtggt taacaaagcc acaaatgctt tatggatcag cagaaaaacg 1020 tttgggaaac attggggttt aaaaccaaaa atgatacaat ggatttacac agctattata 1080 aaacctagaa taacatatgc cgctctcgta tggtggccaa aaacaaaaca aacaacaact 1140 caaaaaaaat tagaaaaact acaacgctta gtttgtctct ctataaccgg ggcaatgcga 1200 agtacaccaa ccaaagcctt aaacgcatct cttcacattc ctcccttgta ccagtttgtg 1260 caaatggagg ctgcaaagag tgcgctgagg ttaaaaaaaa tacaaagcaa agtggaaagc 1320 attgagcatc taaagataat atcttttata aaaattaacc aaaatgtatt tatgactagt 1380 gactggatgg agagcacgtt gaacttagac gaaccattca aggtgattga aacaaatcgc 1440 aatgagtggg agtcaggagg gcctcgcgtt ttaccaggat ctgtcatatt ttacacagat 1500 ggttcaaaaa tgggaaataa gacaggtgca gggattacgg gtccaggagt tagtgtatcg 1560 attccaatgg gtcagtggac aacagtattt ttggcagaaa tttatgctat cttagaatgt 1620 gcatccattt gtcttagaag gaattacaga cttgcaaaaa tttgtatttt ctcagacagt 1680 caagctgcgt taaatgctct aaaatcacca acatgccagt ctaagctagt atgggaatgc 1740 aggaaacttt taaaacaatt agcatctaaa aatcaagtac atctgtactg ggttccgggc 1800 caccggggta tagatggaaa cgaaaaagcc gactttcttg ctagaaatgg gtcagaaggg 1860 cacttcattg gcccagaacc cttctgcggg gtctcaaaaa gcgtattaaa aatggagttt 1920 gtaaaacatg aggaaaagac gattcagtcg aactggacga aaacacaatt actgagacaa 1980 tctagggagt tcatcactcc ctctactcaa aaatcaaaaa aaatactcaa tctgaacaag 2040 aaatatctaa atatctatat cgggcttata acaggacact gtccgtctag atatcatttg 2100 aaaaagctag gtcttagtca aatagatatc tgtcggttct gtgactgtga aaccgaaaca 2160 tcaaaacatc taatatgcga ctgcagcgca ctatctgcaa ggcgaaggca gattcttaac 2220 aggcctattt taagccctaa agagatctgg caagaaaacc ccagtaaggt agttgatttc 2280 attttagaaa ttatacctga atggggcata ccgcaacgtc agccaatgac cgctacccta 2340 aatggtaacg tgtcatcctg aagtctgcga taataaaatg ggtataccgc aatagatcaa 2400 agtcatggtc gcagtggtct caacccaaca aaaaaaaaaa aaaa 2444 // ID L2B-6_AAe repbase; DNA; INV; 4215 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE An L2B non-LTR retrotransposon from Aedes aegypti. XX KW L2B; Non-LTR Retrotransposon; Transposable Element; L2B-6_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4215 RA Kojima K.K. and Jurka J.; RT "L2B clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1411-1411 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 7 sequences with >98% CC identity. CC Closely related to CR1-1_AG and L2B-1_CP. XX FH Key Location/Qualifiers FT CDS 22..1218 FT /product="L2B-6_AAe_1p" FT /translation="MKNAVYACDTCIALHEFDDTEEMQTKMNEVLHGVAKL FT QRVLDFVEGFDVRVRKIVREELVESGKIPVTPTCDPVRYNLRSTSKAKREN FT GRMENRNNPPVNGTVVNNNINYSFADVVKTSKPVVLSKDTPKTLNSKNEEN FT NNKAIHAVSPKKPNSRIVIKPKTGKNASETKKLLSKKVKPSNFRVKDIYTR FT KDGSILVDVQDHSTMLKLKETIEKELCDHCEVEVSDTLKPTLKIVGINEEM FT NEDELKTTLIENNEVMENVKHFKVKSIVAKDKENNDNFDAIIEVDAITFHK FT VLKQKKIMCGWERCQVVDALDVVQCYKCCGFNHKSVKCTARKEACPRCAGE FT HLIRECNSSEVKCINCERSKLNGDKDADSNHCAWSERCPLYMRMKERKRQM FT IDYSV" FT CDS 1222..4026 FT /product="L2B-6_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="QSSVQVLYANVAICKHIEEIRLLLKRRRPVVTILTET FT HLTGDHDINQLEVQGYSMINCPSTSRHTGGVTMFVRSGVKLKVISNESVAG FT NWFLAIDVSTGNFSGIYGAVYHSPNASDAAFLQHLEDSWLPNVFDDERTNL FT IIGDFNINWMKVGDRRELKNVMDSVSLRQIIQFPTRVSIRSSTMIDLAFTN FT DDRITAIELIDDKLSDHETIGLKFNITDQLRQSTKISVKSWRQYSKEKFCS FT KLQHKLRDFNLLDSTDESAALLGCSLESAVNEMIIDKEIEIYDDCNWYNGK FT LHRMKESRDRAYAKFRRTGDATDWRLYQRLRNKYVIEIRKTKAEYISKDLQ FT ECQGDSKKVWKVLKKMMKPIRNRDPGVEFENASDEIDVTVKSNKLNEFFVE FT SIKDIHNSIPDVPIVEGNVELPQIGVVEWNVFHEISMTNLEKVVNGLKVCG FT GVNNLSTKVFWDGFQTIKNVFLCVINKSLMRGEFPSSWKKSLVVPLPKVAQ FT SKRPEDRRPINMLPLYEKVLELVVKEQLCEFLENKNILINEQSGFRRNHSC FT ETALNLLLLKWKRAIESKKVILAVFIDLKRAFETIDRTKLLGVLFKIGVRG FT SVLKWFESYLTGRQQTTSYDGRESAAIDVELGVPQGSVLGPLLFIIYMNDI FT KEIMTNGEVNMFADDTVIFIIANDFVEAYNALEAELLRVSNWLKSKKLLLN FT VKKTKYLVITNKIKEGNIQPLHIDGQEIERVDNLKYLGVILDDKLSFNDHV FT DYTIRKAAQKLGVICRINSFLDVGSKIMLYKAIVASHFEYCPSVLFLANRR FT QKRRMQKIQNKSMRLILGCHRTTSSATMRECLQWMNVEERIIVRTLEFVFK FT IKNQLLPTYLTENIQYGTDIHNYNTRAASDLRLPNFKKTGTQNCLFFKGFQ FT LFNRLPTEVKQANSIYSFKRMCIRCVKDGLI" XX SQ Sequence 4215 BP; 1547 A; 625 C; 938 G; 1104 T; 1 other; gagctagacg tacttcggaa gatgaagaat gctgtatacg catgtgatac gtgtattgcg 60 ttgcacgagt ttgatgatac agaagagatg caaactaaaa tgaatgaagt gttgcatggt 120 gtagcgaaac tacagagagt actcgatttt gttgaaggct ttgatgttag ggtgcgcaaa 180 attgttagag aagaattagt cgaaagcggg aaaattcctg taactcctac ttgtgatcct 240 gtccggtata atttgcggtc gacaagtaag gctaagcggg aaaatggaag gatggaaaat 300 agaaataatc cgccagtaaa cggtactgtg gtgaataaca atattaatta ttcttttgct 360 gacgttgtga aaactagtaa gcctgtggta ttatcgaaag atactcccaa gacgttgaat 420 tccaaaaatg aagaaaataa taataaagct attcatgctg tatcaccaaa gaaaccaaac 480 tcaagaatag taatcaaacc gaaaactggt aaaaatgcga gtgaaacaaa aaagttgctt 540 agcaagaaag tcaagccatc aaactttagg gtcaaagata tttatacacg gaaagatggt 600 agtattcttg tagatgtaca ggatcactcc acgatgttga agttaaagga gacgatagaa 660 aaagaattgt gtgatcactg tgaagtcgaa gtgagcgata ctttaaaacc aacattgaag 720 atagtgggga tcaatgaaga aatgaacgaa gacgaactca aaacaacgtt gatcgaaaat 780 aatgaagtta tggaaaacgt gaagcatttc aaggtaaaaa gtattgttgc aaaagataaa 840 gaaaataatg ataactttga tgcaattatt gaggttgatg cgatcacttt tcacaaggtc 900 ctgaaacaaa agaaaattat gtgtgggtgg gagcgatgcc aagtggtaga cgccttagat 960 gttgttcaat gttacaaatg ctgtggattt aatcataaat ctgtaaaatg tacggcgaga 1020 aaagaagctt gtccacgttg tgctggtgaa catttgataa gagagtgcaa ttcttctgag 1080 gtgaagtgca ttaactgcga gagatcaaaa ttgaatggtg acaaagatgc ggattctaat 1140 cattgtgcgt ggagtgagcg ttgtccgctg tatatgagaa tgaaggagcg caaaaggcaa 1200 atgattgact atagtgtata gcaatcatcg gtacaggtgt tgtatgcgaa tgtagcgata 1260 tgcaaacaca ttgaagagat aaggttattg ttgaaaagac gaagacccgt agtgactata 1320 ttaacagaaa cgcatctgac aggtgatcat gatattaatc aattagaagt gcaaggatat 1380 tctatgataa attgtccgtc aacatcacgg cacactgggg gagtaactat gtttgtgaga 1440 tctggagtca aactaaaagt gatttcaaac gagtcagttg ccggtaattg gtttctcgcg 1500 attgatgtat cgactggaaa tttttctggg atttatggtg cagtatatca ctcaccgaat 1560 gcaagtgatg ccgcatttct acagcatctt gaagactcgt ggctgccgaa cgtatttgac 1620 gacgaaagaa caaacttgat aattggagat tttaatatta actggatgaa ggtaggtgac 1680 agacgtgagc taaaaaatgt aatggattct gtgtcattga gacaaattat acaattcccc 1740 actagagtaa gtattagaag tagtactatg attgacctgg cattcacgaa tgatgataga 1800 ataactgcta ttgaattgat agatgacaaa ttatctgacc atgaaacaat tggactcaaa 1860 ttcaacataa cggaccagct aagacagtcg acaaaaatta gcgttaaaag ctggagacaa 1920 tattccaagg aaaaattttg tagcaaactg caacacaaat tgcgtgattt caatttgctt 1980 gattcaactg acgaaagtgc agcacttctg ggatgttctt tggagtctgc agtgaatgaa 2040 atgataatag ataaagagat tgaaatatat gacgattgca attggtacaa tggaaagcta 2100 catagaatga aggaatcaag ggacagggca tatgccaaat ttagaaggac aggtgacgcc 2160 acagactgga gattgtatca aagattgaga aataagtacg ttatagaaat ccgtaaaacg 2220 aaagctgaat acataagcaa agacttacag gaatgtcaag gtgactcgaa aaaagtatgg 2280 aaagttttga aaaagatgat gaaacctatc agaaatcgtg atccaggagt agaatttgaa 2340 aacgccagcg atgaaatcga cgtaacagtg aaatcaaaca aattgaatga atttttcgtt 2400 gagagcatca aagatatcca taacagtatt ccagatgtac caatagttga aggaaacgtt 2460 gagttgcccc aaattggagt tgtagagtgg aacgtatttc atgaaataag catgaccaat 2520 ttggaaaaag tagtaaatgg gttaaaggtt tgcggaggtg ttaataatct aagcacaaaa 2580 gtcttttggg atggatttca aacaataaag aatgtatttt tatgtgttat aaataagtcg 2640 ttgatgcggg gagaatttcc ttcatcttgg aaaaaatctt tggttgtgcc tttaccaaaa 2700 gtggctcaat caaagagacc agaagatcgt aggcctatca acatgttgcc actatacgaa 2760 aaagtgctag agttagtcgt aaaagaacag ctatgcgagt tcctagagaa taaaaacatt 2820 ttgattaatg aacaatctgg cttcagaaga aatcactcgt gtgaaacagc actcaattta 2880 ttattattga agtggaaacg agcaattgaa agcaaaaaag tgatcctagc agtgtttata 2940 gacttaaagc gagcatttga aactattgat aggactaaat tgttaggggt attgttcaag 3000 attggcgtga gaggttcagt cttgaaatgg ttcgaaagct atttgaccgg cagacagcaa 3060 acaacctcgt atgatggaag agaatcagca gccattgatg tggagttggg agttcctcaa 3120 ggaagtgtat taggaccctt gttatttatt atttatatga atgatattaa ggagattatg 3180 acgaacggtg aagtaaacat gtttgctgat gacacagtaa tattcatcat cgctaatgat 3240 tttgttgagg cgtataatgc acttgaagca gaactgctca gagtaagcaa ttggctgaaa 3300 tctaaaaagc ttttgctcaa cgttaagaaa acaaaatatt tagtcataac taacaaaatt 3360 aaggaaggaa atattcaacc gctacatatc gatggacaag aaatagaaag agtggataac 3420 ttaaaatact taggagtaat tcttgacgat aagctttcat tcaatgatca cgtggactac 3480 acaataagaa aagctgcaca aaagctaggt gttatttgca gaattaattc atttctagat 3540 gtcggatcaa aaatcatgct atataaagca atcgtggcgt cacattttga atactgtccg 3600 tctgtgctgt tccttgccaa cagaagacaa aaacgtcgaa tgcagaaaat acagaacaag 3660 tctatgcgac ttatactagg gtgtcatcgt acaacttctt cagctactat gcgtgagtgt 3720 ctccaatgga tgaatgtcga ggagaggatc attgtaagaa cattagaatt cgtattcaaa 3780 attaagaatc aattactacc aacgtacttg acagaaaata ttcaatatgg aacggatata 3840 cataattata acacaagagc agcatcggat ctacgactac cgaactttaa gaaaactgga 3900 actcaaaact gtctcttttt caaaggattc caactcttca accgactacc taccgaagta 3960 aagcaggcaa acagcattta ttcattcaag agaatgtgta taagatgtgt caaagatgga 4020 ttgatatgaa tcatgatgat ggacatacgc gggagcaaga atgaaagtta aaaagaaaat 4080 gaagaaacat gaattaaaaa ccaaatatca ctatgataac tcgtccaata agcctcccaa 4140 ctacwaacga agcgtatggg gtggaggtgg gaccaaatac ggcccaatgg gaccatcaga 4200 aaaaaaaaaa aaaaa 4215 // ID BEL-188_AA-LTR repbase; DNA; INV; 494 BP. XX AC supercont1.88; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-188_AA_; KW BEL-188_AA-I; BEL-188_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-494 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.88; Positions 1280214 1280707. XX SQ Sequence 494 BP; 212 A; 90 C; 78 G; 114 T; 0 other; tgtggatgag cccaccttga gcactaccac ttctctggtc actaaaaggg attaccttcc 60 atacaaactg cgacctacga agcattgagt gaaggtcgca caaaacaaaa acaaaaacga 120 tagacagata acgggagtta aaacgaaacg tggtacaaca aattattctc tttaaaatct 180 cactaaatta cttaaattaa ctaattgaat tcgacgataa ttgtaagtac aaacacaatg 240 ttattaaaat taaaatgaaa tataaataat gaatacgtta acacacagat taaaagtcat 300 atgagcgttg cggtcagata gacaggcagg aaaacagaat taaaagagac tagaaacgta 360 agtaataaca gacaacattt ataaaattat aaaactaatc aaaataaaaa atccgcagct 420 taaagcattt cccagaaaaa cctggtgtgc taaaaaggcg tccgaaaagt cttgccatct 480 aatccaccgt aaca 494 // ID hATm-59_HM repbase; DNA; INV; 2390 BP. XX AC . XX DT 22-JUN-2010 (Rel. 15.06, Created) DT 22-JUN-2010 (Rel. 15.06, Last updated, Version 3) XX DE hAT-type DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; hATm-59_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2390 RA Jurka J.; RT "DNA transposons from Hydra magnipapillata."; RL Repbase Reports 10(6), 797-797 (2010). XX DR [1] (Consensus) XX CC >97% identical to consensus. CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 402..2171 FT /product="hATm-59_HM_1p" FT /translation="MSKNSVASNKNKKVWESNLVRFEKAKTSRRCTTVVKS FT ISEMNKLPTEQRIIDRFKSLSSNVKSRKERITIVAKETKLLWRKLNIPIQQ FT GKSAAERKIENVLKKLDKDSRKPGTQNFTRLFDITDEKGEWLCQEDKEFYC FT LQKSTNGRAGYCTTIKDAEGIHPRKLVMIKKQMSISQGQDFVESASEGDPS FT ESEEQCSSSDIEAEGPSSSFKKQKTNSAVNLVISAKISTKKAHKVCKTLSE FT SGISIYTPTQSGIYKGVMKEGEKLKVNYMENLKNEKWSLHFDGKHIKKMEY FT QVVVLKNENREVRLAVLELMNGKSETIYNGIKHVLDEYELWPAIKMIVSDT FT TSTNAGLRNGVVTRLQKHFKTIGLEKPVFIGCQHHILDTLLKHVMNDHFEG FT STMSPYLHYPFVTRLQQDYENLKLLFCNIGNPLTKVKRTRWRDDMDFLNHL FT IACYRKFKSTGNFPKVNFKSLPAISNARWNSRAIFILLAYILIPEYQESES FT TQAACDFICGSWGDIWFGGHYFNPADFVNLLEACNEHPKALKSLKRFWSTE FT PTPIPTQRSNICAERAVKVMQDLMPLCHSIEKLNIKYLLSNTQ" XX SQ Sequence 2390 BP; 884 A; 339 C; 435 G; 732 T; 0 other; ttaggggtat gactaaaaaa aaattttaga attttttatt ccccatgtaa ccatgagtgg 60 agaaattata tttataaaca taaaatctat tttttttatg taaattgtca aaaaagatca 120 cccctcaagc tgaaaataat ttttcctatc tttttagtga gttataaatt acttaataga 180 gacacaacac aatcttactc aatatacgac tgtgattaca aacatcataa attctacaga 240 tttctttacg agataactgc aatttttaat aactgcaatt ttcgatttca accctgttgg 300 accgtgtgga agtgcttcag cattttataa ttggtttttt gtatataacc ggaattgtct 360 tctttaagtt tgtcaaactg ttatcttgtg ttttgaacat tatgtccaaa aatagtgttg 420 cttcaaacaa gaataagaaa gtttgggaat caaatttagt aaggtttgaa aaagctaaaa 480 cttccagaag atgtactact gttgttaaaa gtattagtga aatgaataag ttgccaacag 540 aacagcgtat cattgatcga tttaaaagtt tgtcatcaaa tgtgaaaagt agaaaggaga 600 gaattacaat tgtggctaaa gaaactaaat tgctatggag aaagctaaat attcctattc 660 aacaagggaa atcggcagca gaaagaaaaa tagaaaatgt attgaaaaaa cttgacaaag 720 atagtcgaaa acccggaact caaaatttta cacgattgtt tgacataacc gatgagaaag 780 gtgaatggtt gtgtcaggaa gataaagaat tttattgtct tcaaaagtca acaaatggaa 840 gagctggata ttgtaccaca attaaagatg ctgaaggtat tcatccaaga aaattagtga 900 tgataaagaa acaaatgtcc atcagtcaag gacaagattt tgttgaatca gctagtgaag 960 gagatccttc agaaagcgaa gagcagtgtt ccagcagtga tattgaagct gaaggtccgt 1020 catcatcatt taaaaaacaa aaaacaaatt cagcagtaaa tttagtgatt tctgcaaaga 1080 ttagtaccaa aaaagcacat aaagtatgta aaactctctc agaaagtggt atttctattt 1140 atactccaac gcaaagtggt atctacaaag gtgttatgaa agaaggggaa aagttgaaag 1200 taaattatat ggaaaaccta aaaaatgaaa agtggtcttt gcactttgat ggaaaacaca 1260 taaagaaaat ggaatatcaa gtagttgttt tgaaaaatga aaatagagag gttcggcttg 1320 ctgttctgga gctgatgaat ggaaaaagtg aaacaatata taatgggata aagcatgtgc 1380 tagatgaata tgagctgtgg ccagccataa aaatgatcgt atctgacaca acatctacaa 1440 atgcaggatt aagaaatggt gtagttacac ggttacagaa acattttaaa accattggac 1500 tagaaaagcc tgtttttatt ggatgccaac accatattct agatactctt ttaaaacatg 1560 tcatgaatga tcattttgaa ggatcaacaa tgtcaccata tctccattac ccttttgtga 1620 caaggttgca gcaagattac gagaacttga agttattatt ttgtaatatt ggaaatcctt 1680 tgacgaaagt gaagcgaact cgctggagag atgacatgga tttcctgaat cacttaatag 1740 catgttatag aaagtttaaa agcacaggta actttccaaa agttaatttc aaatcacttc 1800 ctgctataag caatgcaaga tggaattcga gagccatttt tattctactt gcttacattt 1860 taataccaga ataccaagaa tcagaatcta ctcaagcagc atgcgacttc atctgtggtt 1920 catggggtga tatatggttt gggggtcact acttcaatcc tgcagatttt gttaatctat 1980 tagaagcatg caatgagcat cccaaagcat tgaaatcatt gaaaaggttt tggtccactg 2040 agccaacacc aataccaaca caacggtcaa atatatgtgc agaacgtgct gtgaaagtga 2100 tgcaagactt aatgccatta tgccatagca tagaaaaact aaatattaaa tacttattgt 2160 caaatacaca gtagacaaat tttaattata catgtttatt ggtttctgta aagtagtgga 2220 agatgtgttg caaaactaca tttttttaat ttttttataa atatttttta attggggggt 2280 gaactttttt gacatatagt aaaaatgctt gttatttttt tgatttttga acaaaaactc 2340 cactaaattt agtatgggga tatagggttg caaaattgtc atatccctaa 2390 // ID BEL-36_CQ-LTR repbase; DNA; INV; 564 BP. XX AC AAWU01012157; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-36_CQ_; KW BEL-36_CQ-I; BEL-36_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-564 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 226-226 (2011). XX DR GenBank; AAWU01012157; Positions 4725 5288. XX SQ Sequence 564 BP; 145 A; 122 C; 146 G; 151 T; 0 other; tgtttgatag taagctaatc atatttgggg cttcaaccct gcatattgta tcggtgggat 60 tttggtaatc gatttttgtg agatattgat ttacgcggga aaatgggctt gaaccgagag 120 ttgtactctc ttctcatgag catcagaacc cgaactataa aacatcgcaa gggcccaaag 180 ggcaacttta gggcttgtgg tctcagatcg cagctaggac aagttagtca gaaaaggtct 240 gctcaggaca acgggaaccg ggtcgccggg actccggcac gccggacact ttaactattg 300 taaccaacga taattgtatt ttaatataat gtgctaagtg taataaacgt tagttttcgt 360 atgtgcgaat gtttctaagt ggtgcgtttt attggacatt ggcggtggca aactagaagc 420 gttcctgtag cagttgaacg ctacgacccc cccccccccc cccacaacac tttgttacgg 480 ggcagcacaa gtccgtcggc caggaatttg gctgtgttga ggttacggag gaactggtct 540 gaatcgaagg tccgtagccc aaca 564 // ID REP_DE repbase; DNA; INV; 1416 BP. XX AC X97823; XX DT 09-JUL-2004 (Rel. 9.06, Created) DT 01-NOV-2007 (Rel. 12.12, Last updated, Version 3) XX DE D.etrusca highly repeated DNA. XX KW Polinton; DNA transposon; Transposable Element; Nonautonomous; KW Repetitive sequence; REP_DE. XX NM REP_DE. XX OS Dugesia etrusca OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Dugesia. XX RN [1] RA Batistoni R., Filippi C., Salvetti A., Cardelli M. and Deri P.; RT "Highly repeated DNA elements in planarians of the dugesia RT gonocephala group (platyhelminthes tricladida)."; RL Unpublished. XX RN [2] RP 1-1416 RA Jurka J.; RT "REP_DE: classified as a fragment of Polinton transposon."; RL Direct Submission to RR (01-NOV-2007). XX DR Genbank; X97823; Positions 1 1416. XX SQ Sequence 1416 BP; 527 A; 198 C; 189 G; 502 T; 0 other; aagcttttct gagttctgcc aataatatat gtattgaatg gtctcgcttg aaatcttgac 60 aacattgaca tccgttaatg caagatactc tcaatgtttc attacatgca atgcaattcg 120 gaacacaaat atctcttaat ccttgaacca tatcaaagtg catttcgaga agttcttcta 180 tattgatccc agtgaatcga tgtattttaa aaacaagtaa ctctggcatt gcacaataat 240 ccaatgtatt tttaactctt ggataaagtt cttgaatctt ctttagaatt atttcaaata 300 gataaactaa ataataataa taataataat aataataata ataaataaaa ataattcacc 360 tttgacatat tttaactaga taatgaaaaa cagaacaaag ggtttaaaca aaatatttat 420 taattgatta acatttttat ttctgtaata atatcttcta catatttatt atattccaag 480 gattttatac caagaattac tactttccca cttgaaaata acaattgaca cataatggat 540 tatatttctg tagacgcaat gctggaaata tttcagcttc aaatatacat tcgactttgt 600 tttgtaaagc atataaattt ataaattttc caatatcaat tgttactgtg attgattgaa 660 tttgcaaatc ctttatcttg actttcaaat ctccttcttt aagaggttgc ttgcatcccc 720 attacacgac acttttccgg taccgtaaaa acgataatcg gatactttcc tgttcgatca 780 acaacttgcg taggttttct gatagggaat ttcatatcag gtactaaata tcgtccttta 840 taatttgata ttgctcaaca ttttgaatta attttattca agtaaaacag ttaatttaaa 900 aaaatataaa caattcaaat aaaattaatt aattaaatta actaaattaa aatggaatta 960 agaaatgatg ctgtatatca aattgtagga ccaagtggta gtgggaaaac ttatttgtgt 1020 gtaaattatt acaatcaaca tgtttaaatc taaattcaat aaaatatatt gggcatagag 1080 gagtgcacga tgagagtggt gacactcaaa atcaattctg taaatttaaa aatgacaata 1140 attaaaggat ttgataaaaa ttggtcatct agattgctgc aaggggatgt aataattatt 1200 gatgatttgt atcaggaagc aaataaagga aaaaggattt taataattta tttacaaaaa 1260 tttcgcagac atttaggagt aacagttatt ttcattactc aaaatctttt tcatcaaggt 1320 ggaggacata gaactcgtaa tttaaatgtt caatatctcg ttatttttaa aaaccccgtg 1380 atgctacaat tattgatttt cttgccagac aagctt 1416 // ID BEL-78_AA-LTR repbase; DNA; INV; 427 BP. XX AC supercont1.278; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-78_AA_; KW BEL-78_AA-I; BEL-78_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-427 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.278; Positions 451126 450700. XX SQ Sequence 427 BP; 138 A; 84 C; 80 G; 125 T; 0 other; tgttacaaac accagcaaag gacggcgtac ctactttttg agcgacgttg cggtcactga 60 cggcgacgaa caatatccat tgtatacaaa agaatcaatc cgtaatagag atcgcatggt 120 gataaacgta gattaatgtg ctatgcaagt gaattatata tgaatttgat agttaataac 180 agcattatat tcataaacta gtgtgattat attgatcgac gtaaatttag gttagcttct 240 ataaacaagt ttgtaatccc caaatactca cgaataccgc tgtaatacgt agtacttgcc 300 gatttctgag atccaacttc agttgcccat cgagcttgcc aaatccaggg attggaaact 360 aattattttg agctgattcc ataaaacgcg ctgcttcaaa acttagtatc cgctctgtgt 420 ccgaaca 427 // ID BEL1_LTR_Dpse repbase; DNA; INV; 300 BP. XX AC Unknown_singleton_86; XX DT 13-MAY-2009 (Rel. 14.05, Created) DT 13-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL1_Dpse; KW BEL1_I_Dpse; BEL1_LTR_Dpse. XX OS Drosophila pseudoobscura OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; obscura group; OC pseudoobscura subgroup. XX RN [1] RP 1-300 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1009-1009 (2009). XX DR Genome; Unknown_singleton_86; Positions 13062 13361. XX SQ Sequence 300 BP; 100 A; 76 C; 63 G; 61 T; 0 other; tgatatgggc acggcaacta caaaacatcg aagaattaag ggttccacga tatctcggtc 60 taacgaaaga aacatctaat atccaattac acatctttac agacgcgagt ataaacgcct 120 ttgcagccgt cgcctacata aggatcgaac aaagaggaga agtcagatgc acactactcg 180 catccaagac aagagtggca ccattgaaac caatttcgat tcccaggatg gagcttatgg 240 cagcgatcct cggcctacga ctagctaaac tcataacaca ggagctctcc gtcgaggtca 300 // ID BEL-5_DWil-LTR repbase; DNA; INV; 448 BP. XX AC scaffold_181039; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-5_DWil_; KW BEL-5_DWil-I; BEL-5_DWil-LTR. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-448 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_181039; Positions 409833 409386. XX SQ Sequence 448 BP; 132 A; 116 C; 87 G; 113 T; 0 other; tgttgccgcc gaaattcgct aactgcccag cagacccgac gcagccacca atgggttaac 60 ccatatcact cattctcttt cattttcata cttattctta agcaccagcc tatgtgctgg 120 gaaattaacc gttgcatctt gcgctttaag cttacatgtt aggttctatg cttgttacta 180 agcggcggac agaacagttt ggttacaacc gcgagtaagg gcgattaatt tatagcggct 240 cgcttacaat catagcagct ctcaaaaacg gctctcgctt acaatcatag cggctcgcat 300 aataacgact cgcatacggc acacacacca tacacacccc attagcataa gaaatacacg 360 tggaagaaat aaatgaagta tcacgtgaaa agtggagtgt tttcttgtgg gcccgctaaa 420 acatcgaacc ttagcgccaa actcaaca 448 // ID SMAR30 repbase; DNA; INV; 1637 BP. XX AC . XX DT 22-JAN-2008 (Rel. 13.01, Created) DT 22-JAN-2008 (Rel. 13.01, Last updated, Version 1) XX DE Consensus sequence of Mariner-type family of repeats. XX KW Mariner/Tc1; DNA transposon; Transposable Element; SMAR30. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-1637 RA Jurka J.; RT "Mariner-type element from freshwater planarian (Schmidtea RT mediterranea)."; RL Repbase Reports 8(1), 18-18 (2008). XX DR [1] (Consensus) XX CC The youngest copies are >96% identical to consensus. CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 232..1434 FT /product="SMAR30_1p" FT /translation="MKIDENISLDKSENQLNNTTSNEQNFANQQIRSIFQQ FT QAELDLPNPLTAETNSDTAVIQIRRYNRSAYNTTSLDKKKIFIKCYEEGKT FT IVEAAEISGINKYTAASLLKRYKNDGCLILQKKRGGKRTAKITSEIMSAIE FT DIVEQNPAITLKSISKKIMDEKHINLTTTSINNALKRLRITLKTATLNLDR FT LNSPSTILQRKEYALNFSLNAPIIREKIVFIDESGFNCHLRRTKARSKINT FT GAHVIIPTLRGRNVSLIAAMNLHGIVHKQIIANSTVNANVFINFLDGLFQT FT LREANITCAWLVLDNCTIHKTQEVRDKVGTTSHTLVYLPPYSPMLNPIEKV FT FSKTKLCARNLLAESRTDLNLENVIQESLATVTQSDCNNYYVDMTMKLPLA FT TGGQPLH" XX SQ Sequence 1637 BP; 621 A; 243 C; 251 G; 522 T; 0 other; ttttaaactt caaattcact tttttaaatt tcaaattaat tgaaaagcaa aatcaaaata 60 attaaaatgt tgttttaatt taatttaatt gaaattaaaa ttaaaattat attcaattcg 120 aacttcaagt tattaaacgt caaaattttc cttgcaatac agagagatgt aattaatagt 180 agttataatt cgatatttaa cctataaaac agaaccccaa ttaagatata catgaaaatt 240 gatgagaaca tttcacttga caaatcggaa aatcaactta acaacactac ctctaatgaa 300 cagaactttg cgaatcaaca gattcgttct atatttcaac aacaggcaga acttgatttg 360 cctaatcctc taacagcaga aacaaattcg gatacggctg ttattcaaat taggcgatat 420 aataggtcgg catataatac cactagtttg gacaagaaaa agatttttat aaaatgttac 480 gaggaaggga aaactattgt tgaagcggca gaaatatcag gtataaataa atatacggca 540 gcttcattgt tgaaacggta taaaaatgat gggtgtttaa ttcttcaaaa aaaacgtggt 600 ggaaaaagaa ctgctaaaat aaccagtgaa ataatgtctg caatagaaga tattgttgag 660 caaaatccgg ctatcacttt aaaatccatc agcaaaaaaa taatggacga gaaacatatc 720 aatctgacaa caacttctat aaataatgca ttaaaaagat taaggatcac attgaagaca 780 gcgacactga atcttgatcg tttaaattcc ccctccacga tcttgcaaag aaaagaatat 840 gcattaaatt tttcactcaa cgcacctata ataagagaaa aaatcgtctt tattgacgag 900 tctggtttta attgccatct tcgtagaacc aaagctcgat caaaaataaa tacaggagct 960 cacgtaatta ttccaactct gcgagggcga aatgtctccc ttattgcagc tatgaattta 1020 catggcattg tacacaagca aataattgca aactcaacag taaatgcgaa tgtttttatt 1080 aattttctgg atggattatt tcaaacatta agagaagcaa acattacgtg tgcatggctt 1140 gttttggata attgcactat tcacaaaact caagaagtga gggataaggt tgggacaaca 1200 tctcatactc ttgtttatct cccaccttat tcgccaatgc tgaacccaat tgaaaaggtt 1260 ttttctaaaa ccaaattatg tgcgcgaaat ttgttggctg aatcaagaac cgatttaaat 1320 ttggagaatg tgattcaaga atctttagca actgttaccc aatccgattg taataattat 1380 tatgttgaca tgactatgaa attgccattg gctacaggtg gccaaccatt acattaaatt 1440 aaaagcgctt attatttttg gtaaatagtt gatagaatgt atttgttgtt tagaaattca 1500 cagtcatgtt tattaatata aggttgattt tcgaaattta atttaatttt taactaaaga 1560 agctatttaa ttattttgat tttgattttt aattaatttg aatttgaaat ttaaaaaagt 1620 gaatttgaag tttaaaa 1637 // ID Waldo-6_AAe repbase; DNA; INV; 5611 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A Waldo non-LTR retrotransposon family from Aedes aegypti. XX KW R1; Non-LTR Retrotransposon; Transposable Element; Waldo-6_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5611 RA Kojima K.K. and Jurka J.; RT "Waldo, (AC)n microsatellite-specific families of non-LTR RT retrotransposons from the yellow fever mosquito."; RL Repbase Reports 11(4), 1463-1463 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >97% CC identity. Both sides are (AC)n microsatellites. XX FH Key Location/Qualifiers FT CDS 578..2083 FT /product="Waldo-6_AAe_1p" FT /translation="MASLLANITMGSRRRLLTTGEDSGLEGGPTKMENFQN FT NAETMEFNAAGVENASAFQKSGKVPRSPVHTTENIPTASSSSKQCETQGPQ FT GGLGIQSPVFTPKSSNNTQGGQLPENLRLTEVKNKVDELIQFVKDRHNVHT FT VIKSKLTSIKSAVNAAMKEQEALIMRAEFAEKSLKKVTEQAAADKLRTPKV FT RSDKPTKKRDRESPGEEEDPKKQRNDQGKNVPKKDKNGEGWQTVVNAKDKK FT RKQNEKVEKKKDGKKKAKRRPQWSKGDAILVKANDQITYADILRKVKDDPN FT LKDLGENVIRTRRTQKGEMLFELKSDPAIKSSAYQELVAKSLANEADVRAL FT TQEAVVECRYLDEITTVDEVSEELRKQCNLGEEAMAIRLRKSYDGTLSATI FT RLPVDTANKLVEKGKIRIGWSICPLRLVTRADRPPMRCFKCMDFGHPAASC FT KGPDRTELCRKCGDRGHFGKDCKNRPRCLLCSSEEGNTHSTGSFNCRAYQK FT AIAAQQ" FT CDS 2041..5088 FT /product="Waldo-6_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="LQLPRVPKGDRSPTVTEVTQINLNHCDTAQQLFWQST FT TETKCDVAIIAEPYRVPLNNANWVADSTGAAAIQVTGRFPIQEVVASSNAG FT FVIAKINGIYFCSCYAPPRWTLEEFGHMLEELTDELVGRKPVLIGGDFNAW FT AVEWGSRVTNARGYNLLEALANQDVVLCNEGTVSTFRKDGRESIIDLTFCS FT PSLAGNMNWRVSEEYTHSDHLAIRYSIGRRNTAAVQRNTIGDRQWKTKAFD FT KDLFVEALRADSDALNLDAVGLTTAMARACDTTMPRKREPRNDRRPAYWWN FT ETLNELRAACLKARRRYQRATNVEIKEERKVVFQAARAAFKREITLSKSEC FT YKQLCREADANPWGDAYRVVMAKVKGPSIPVETCAEKLRVIVEGLFPTHDP FT TPWPPTPYVDEEEAITEDRQVSNNELVAAAKALKVKKAPGPDGIPNVALKA FT AIPAYPDMFRKVMQKCLDEGLFPDIWKIQKLVLLPKPGKPPGDPSSYRPIC FT LLDTLGKLLERVILNRVAKCTESANGLSERQFGFRKGKSTVDAIRTVLERA FT EKASKQKRRGNRYCAVVTIDVKNAFNSASWEAIATALHRMRVPDYLCRILQ FT SYFENRVLVYATDAGRKELRVTAGVPQGSILGPTLWNGMYDGVLSLELPMG FT VEIVGFADDVVLTVIGETLEEVEVLTTEAMGTVENWMNAVKLKIAHHKTEV FT LLVSNRKAVQHAEITVGGHVIASQRSLRHLGVMIDDRLNFNSHVDYACEKA FT ARATNALTRIMPNAHGPRSSRRRLLASVLSSILRYGVPAWGAALQTKRNRD FT KLNSTFRIMAMRVASAYRTISSEAVFVITGMTPICITLGEDIECYQRRGTS FT QVRKLARIGSLGKWQQEWDASEKGRWTHRLIPNVSTWVNRKHGEVNFHLTQ FT FLSGHGCFKQYLHRFGHAPSPFCPVCSEREETPEHVVFDCPRFNMERSAAV FT AAFGQDFNADMIVSKMIGDANLWNLVEKMVVEITSILQRNWREEQRSERQR FT AVPGIGSSGR" XX SQ Sequence 5611 BP; 1622 A; 1320 C; 1654 G; 1013 T; 2 other; gcggtgggag ctagggactt gttccctagc gccaatgaaa aaccaccagt tggcttgacc 60 gggccctggt tccggtttac cggtttccag tcagcccaag gggaggttgt ctagggcgtg 120 gcgggggttc aatgcagggc tctgttgaac tcctacaaaa acccacaggt ccgcaatcac 180 acctccgtac aagcgacccg taccgctttc aaagtactct agcccatcac caggagcgcc 240 acccggcgca ttaggatcca ggccagcaag atcctgatta tggaagactg gcagcataag 300 aacatggaca cgactatgga ccacgcatag gaactacgct acagagctga cagtcgtact 360 attctacacc cttagtaagg atggatgaca cctccctgaa acggcgagtg ggttaatggt 420 acggttgccc ccgtcccgtt aaaaccttgg caggcctcca agtacgttcc aaacccgcca 480 ccttagctgg tggaagtagg ctcagttgaa cctttcctga cttgcgccga tctggctctg 540 aacttggcac cccataaggg gccgggtcaa ccctagcatg gcgtcactgc tggcaaacat 600 caccatggga tcacgaaggc gactacttac gacgggcgag gatagcggct tagagggggg 660 acccactaaa atggagaatt tccaaaacaa tgccgaaaca atggagttta atgcagcagg 720 agttgagaac gctagcgcgt tccagaaaag cggtaaggtg ccacgatcgc cggtgcacac 780 cacggagaat ataccgacgg ctagcagtag ctcgaagcaa tgtgaaacac agggaccaca 840 aggaggatta gggattcaaa gtccagtgtt cacaccgaaa tcgagcaata acactcaagg 900 aggccaacta cctgaaaact tgaggttgac tgaggtgaag aataaggtcg atgaactcat 960 ccagttcgtt aaggaccggc acaatgtgca tactgtgatc aagagcaagc tgacgagcat 1020 caaatccgct gtcaacgctg ccatgaagga gcaggaagcg ctgatcatga gggcagagtt 1080 tgccgaaaaa tcactgaaga aggttacgga gcaagcagcg gctgacaagc ttaggacccc 1140 aaaggtacgt agtgacaagc ctacgaagaa gcgagataga gagtcaccag gagaagagga 1200 agacccaaaa aagcagcgca acgatcaagg aaaaaatgtg ccgaaaaaag acaagaatgg 1260 tgaaggatgg caaaccgtcg taaacgccaa agataaaaaa aggaaacaga atgagaaggt 1320 ggagaagaaa aaggacggga agaagaaagc gaagcgacgt ccgcagtggt cgaaagggga 1380 tgccatactt gttaaggcaa acgaccaaat aacgtacgca gatatccttc ggaaagttaa 1440 ggacgatccg aacttgaagg acctcggaga aaacgtgatt aggacgcggc gcacccagaa 1500 aggggaaatg ctgtttgagc tgaagagtga cccagcgatc aagagctcgg cctatcagga 1560 gctagtcgcg aaatctttgg cgaacgaggc ggacgttaga gcacttactc aggaagcagt 1620 tgttgagtgc agatacctgg acgagattac aactgtggac gaagtgagtg aggagctgcg 1680 taagcaatgc aaccttggtg aggaggccat ggcaatccgt ctgaggaagt cgtacgacgg 1740 cacactgtca gcgacaattc ggctaccagt cgacacggcg aacaaactgg tggagaaagg 1800 caagataaga attggctggt cgatatgccc actgagactt gtcacgcgag cagataggcc 1860 accgatgcgt tgcttcaagt gcatggactt cggacacccg gcggcgagct gtaaaggccc 1920 agacagaacc gaactgtgca ggaaatgtgg agatagaggt cactttggga aagactgcaa 1980 gaatagacca aggtgcttgc tctgctcatc ggaggaagga aacacccatt cgacgggtag 2040 cttcaattgc cgcgcgtacc aaaaggcgat cgcagcccaa cagtaacgga ggtaacgcag 2100 atcaacctga atcactgcga caccgcacag caactgtttt ggcagtcgac aacagaaact 2160 aagtgcgacg ttgcgataat agctgaaccg tatcgagtac cactcaataa cgccaattgg 2220 gtggcggata gtacaggcgc agcggcgata caagtgacgg gtaggtttcc tatccaggaa 2280 gtggttgcca gctcaaacgc ggggttcgtg atcgccaaaa tcaacggaat ctacttttgc 2340 agctgctacg cgcctccaag atggacgctc gaagaatttg gtcatatgtt ggaggagttg 2400 accgacgagc tagttggtcg aaaaccagtg ctgattggag gtgacttcaa tgcatgggcc 2460 gtggagtggg gtagcagggt aaccaatgcc agaggttaca acctactgga ggctctagca 2520 aaccaagacg tagtattgtg caacgaaggc accgttagca catttcggaa agacggacgg 2580 gagtcgatca ttgacctaac attttgtagc ccatcactgg cgggcaacat gaactggagg 2640 gtaagcgaag agtacaccca tagcgatcac ctggcgatac gctacagtat tggccggcgg 2700 aacactgcgg cagtgcagag gaacacaatt ggcgaccggc agtggaagac taaggctttc 2760 gacaaagatc tctttgttga ggcacttcgc gccgacagcg atgccctgaa cttggatgcg 2820 gtagggctga caacagcgat ggcaagggca tgtgatacaa cgatgccacg gaaacgggag 2880 ccccgcaacg atcgacgtcc cgcgtactgg tggaacgaga cactcaacga actccgagct 2940 gcttgcctta aagccaggag gcgttaccag agagcaacga acgtagaaat caaagaggag 3000 cgaaaggtcg ttttccaagc agccagagct gctttcaaac gcgagataac actgagcaag 3060 tctgagtgct acaagcagct gtgccgagaa gctgacgcca acccctgggg ggacgcttat 3120 cgagtggtca tggcaaaagt caagggtcca tcaattccgg ttgaaacgtg tgctgaaaag 3180 ctgagggtta ttgtcgaggg tctcttccca acgcatgatc cgacaccgtg gccacccacg 3240 ccgtatgtcg acgaggaaga agcaatcact gaagatcgtc aggtttccaa taacgagctt 3300 gtagcagcgg cgaaagcatt gaaggtgaaa aaggcccccg gaccggatgg aataccgaat 3360 gtggcactaa aagcggcgat tccagcgtac ccggatatgt tccggaaagt gatgcagaaa 3420 tgcctggacg aaggtctttt cccagatata tggaagatcc agaagctggt gctgctaccg 3480 aagccaggaa aaccacccgg tgatccttca tcatacaggc ctatatgctt gctggataca 3540 ctgggcaaac tcttggaacg ggtcatccta aacagggtgg ctaaatgtac ggagagcgcg 3600 aacggactat cagaaaggca gttcggattc cggaagggaa aatcgacggt agacgctatt 3660 cggacggtcc tggagagggc cgagaaggca tcgaagcaaa agcgaagagg gaatcgttac 3720 tgcgccgtag ttacgataga cgttaagaac gcgttcaata gtgccagttg ggaggccatc 3780 gccacagcgc tgcatagaat gcgggttcct gactatctgt gccgaattct acagagttac 3840 ttcgagaatc gggtgttggt gtatgcaacc gacgccggcc gaaaggagtt aagagtaacg 3900 gcgggagttc ckcaagggtc catactgggt ccgacgctgt ggaatgggat gtacgacggg 3960 gtcctatcat tggagttacc catgggcgtc gagatcgtcg gcttcgctga tgacgtcgtc 4020 ctaactgtaa taggcgaaac gctggaagaa gtggaagtgc taacmacgga ggcaatgggt 4080 acggttgaga actggatgaa tgcagtcaag ctgaaaatag cccaccacaa aacggaggtg 4140 ctactagtca gcaatcgcaa agcggttcaa cacgctgaga ttaccgtcgg gggacatgtc 4200 atagcgtcac agcggtcact cagacacctg ggcgtgatga tagacgatcg gctaaatttc 4260 aacagccacg tcgactatgc atgtgagaag gcggcaaggg cgaccaacgc actcacaagg 4320 atcatgccga acgctcatgg tccgagaagc agtaggaggc gtcttctggc tagcgtatta 4380 tcgtcgatac tgcgatacgg agttccggcc tggggtgcag cactgcaaac caagcgcaat 4440 cgggacaagc tcaacagcac gttccggatc atggctatga gagtagcaag cgcatacaga 4500 acaatatcgt cggaggcggt atttgtgatc accgggatga ctccgatttg catcaccctg 4560 ggagaagaca tcgagtgtta tcagcggaga ggcactagtc aggtgaggaa attggcgagg 4620 atcggctcgc tgggcaagtg gcagcaagaa tgggacgcct ctgagaaagg tagatggacc 4680 cacagactca tcccgaatgt gtcgacctgg gtaaacagga agcatggcga agtaaacttc 4740 cacctgacac agttcctgtc aggtcacggt tgcttcaagc agtatttaca tcggttcggc 4800 cacgcgccgt caccgttctg tcctgtgtgt agtgaaagag aggaaacgcc ggaacatgtt 4860 gttttcgact gcccgaggtt caacatggaa cgaagcgcgg cggtggctgc ttttggacag 4920 gacttcaatg cagacatgat tgtaagcaaa atgataggcg atgctaacct ttggaatctg 4980 gtggaaaaaa tggtggtgga gataacatct atcttgcaga gaaactggcg ggaagagcag 5040 cgaagcgaga ggcagagagc agtgcctggc atcgggtcgt cggggcgcta acgaaccgga 5100 agtcaacccc caaccggaat cgtttgaccg acctcggcac tcgagtcagc ctgacgaaga 5160 aagaaggcga agatctcccc tgcgatagcc gccggtagtc ggggcaccat cagtgcggac 5220 gtctccccaa ccggtactgg tgacaacctc cgacgccggg ggaagtcagg aggagaagga 5280 agcgagaagg gatgaacgat ggaaggagac agaaaagcgg aagaaggtgg aacggcagtc 5340 tgctcaacat gcaagagcaa accactggca gactgcgcag agtgcaagag cacagagcgg 5400 gaaaacgtca agaacgtaag agaagtgcag atgcacagcc ccccccgacg aagtcgcatc 5460 tagtggaatc cggggggatc caggctaccg ccgaaagtga ggactaggtt ttagtggata 5520 ggcacgcaag tgaatcccac acagtgccaa taattaacag gccaggtttt tgaaactttt 5580 ttgatacccc actataataa aaaaaaaaaa a 5611 // ID CR1-94_AAe repbase; DNA; INV; 5634 BP. XX AC . XX DT 11-APR-2011 (Rel. 16.04, Created) DT 11-APR-2011 (Rel. 16.04, Last updated, Version 1) XX DE A CR1 non-LTR retrotransposon from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-94_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5634 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1182-1182 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >91% identity. CC The ORF1 protein of this family, combined with that of CR1_Ele20, CC shows high similarity to those of L2B elements, such as CC L2B-1_AAe. It indicates ORF1 shuffling between CR1 and L2B. XX FH Key Location/Qualifiers FT CDS 704..1573 FT /product="CR1-94_AAe_1p" FT /translation="MDNLENRVEEDVTPKSTFAEVLKKRAEKRVMEQPKKV FT QPIQKPNPVVVVKPKAGVQVEDVRAELRKKVDARQLNVEKVMSGKSGEVAI FT ALKDEQSAKLLKENVEKNMGGLYDVNVRESIKPTIKLIGMSEEMNEQELKE FT TLVDQNIAFENLKHFKLCKLYCNEKLRFNNVSAIIELDAETFRKAVLEEKL FT NCGWDRCRVKDGLRVTRCYNCCAFNHKSKDCKAATPKCAVCSGSHLVSECQ FT SNVKECANCKKMNTDRKLRLDTNHAAWSDSCPVYQKQLEQRKSFVDYSV" FT CDS 2326..5535 FT /product="CR1-94_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MTCRLPDHDDPLPGSEEVSRNPFGESCRSAGCSVAVI FT PALSMLVGDTGLQDEGNCTGIIEAVDTPAPSGNSMDSICPAEARLDAAPLL FT NVNAFQAAHLLPLSNGSISSHADAPEESLRIYYQNVRSLRSKADDILTATC FT EAEHDVIVLTETWLNDQFHSAQFFGNHYTVYRKDRCAVSTGKSRGGGVLIA FT VSNRLISDVAPIVTDDLEHLWITLQCSNRRLWIGVAYLAPDISSDVSTIDR FT HLSAVNTVTSAMRYGDVHVLLGDYNQPHLCWKRAPTGVVILDAEKSTFNTA FT SSKFVDGMSFNNMRQLNTVTNVRDRTLDLLFINEDASTHCSVFEELDSYVS FT ADLYHPALVAVLKISLPCKFVDAAEVNRYNFNKADIASLNIAYENIDWTQI FT ESMNDIDLAVSFFNQTVTSMFEQYIPVVQPRCKPPWSNAYLRRLKRARASA FT LKAYSRERNDYTKHCFVSASNRYRRHNRFLYSKYVERKQSELKQNPKSFWK FT FINEKRSEHGLPTNMFLGETTANSTEAKCNLFADHFSSVFNPDAATDQQIE FT TALRDVPEHIISLSAVEFTDNDVFEALCGMKSSSVPGPDGIPSVMLKKCAG FT ALCQPLRMMINLSLRSGRFPECWKRSYMFPVFKKGDKKDIVNYRGITSLCA FT GSKLFEILMGEVLFRKMQPYIATEQHGFFPGRSTSTNLLEFTSLCMTGMDG FT GAQIDTVYTDLKAAFDRIDHRILLAKLDRLGVSYDIVLWLRSYLCNRKLSV FT KIGDTESFMFCNTSGVPQGSNLGPLLFAIFFNDVCCLLPQGCKLVYADDLK FT LYLHIRSTGDCLELQRLIDLFTEWCNRNLLIVSVSKCYVISFSRKKNDIEY FT PYCIDNQLLTRVSEIKDLGVILDKKLTFHSHYLQTIAKANKNLGFIMRVGY FT EFRDPYCLRALYYSLVRSVLDSNSIVWSPHSNNWITRIESIQSKFVRFALR FT FLPWANRYELPPYEDRCQLLRMEPLYIRRNHQKALFVAKLLTSKIDSPKIL FT GTIGIYAIERPLRTRGFLYLPSRRTLYGLNEPIRSMCNVFNSVYYLFDFDS FT SIEIFKNRLRNAR" XX SQ Sequence 5634 BP; 1595 A; 1133 C; 1299 G; 1606 T; 1 other; ttttttttac tgtgccgacg gtgtgattag atcgtttact tcgtctgatc accaaaatta 60 tgttttatgt gcgtaaatct cgcgtaatag tatccaaagt ttgtgcttgt gttgtgggaa 120 cattgtgctc ataaaattaa aaagcaatca aggtgatacg ttaagtggaa accccgaaat 180 tcagtgattc atattagcat attactgaag ctagtttgta acctgtatcg tttgttttcg 240 cccgggcgac atgcccgtgg agatggatgg tgaggagagt gagcacggct gctgtagcca 300 accgtgcgcg cagtgcgctt tgccgatcag agatagtgac gmcagtgtcg aatgcttcgg 360 tcgttgtaaa ctgtctatac atatccactg tttacccgat gcgacgaacg aggagataat 420 actgttgcgg aagatcagaa atgctgtgtt cgtatgcgat acgtgtttga gtcttgccga 480 attcgacgat aggcacagtg aaaaacgatt ggacgaaatt gctaagaaat tggaggacct 540 cgcgggtgta gtgggaaatc gtgaaaaatt tcgatcaaac tgtgaagaga gtcgtgtgcg 600 aagaactagc acgagcgaat aagcgtgatg cagctgctct tagtgaaaaa gacgatgtat 660 cccaccgtag gtggttacac gctcgtctgc taaacggaga aaaatggaca atttggagaa 720 tcgcgttgag gaagatgtca cgccaaaatc aacttttgcg gaagtgttga aaaagcgtgc 780 tgaaaaaaga gtgatggaac aaccaaagaa agttcagccg attcagaaac caaatccagt 840 ggtcgttgtg aaacctaagg caggtgtgca agttgaagat gtacgagctg agctgcgaaa 900 gaaggtggat gctagacagc taaacgttga aaaggtgatg agtggaaaat ccggcgaagt 960 tgcgattgcg ttgaaagatg agcaaagcgc gaaattgtta aaggaaaacg ttgagaaaaa 1020 tatgggtggc ttgtacgatg tgaacgttag agaaagcatt aaaccaacta tcaaattgat 1080 cggtatgagc gaagaaatga atgaacagga actcaaagaa accttggtcg atcaaaacat 1140 agctttcgag aatctcaagc acttcaaact gtgcaagtta tactgcaacg aaaaactgcg 1200 gttcaataat gtcagcgcaa tcattgagct cgatgctgaa acgttccgaa aagccgtgct 1260 tgaagaaaaa ttgaactgtg gatgggatag atgtcgagtg aaagacggct tgcgagtgac 1320 gcggtgttac aattgttgtg ctttcaacca taagagtaaa gattgtaaag cagcaactcc 1380 gaagtgcgct gtgtgtagtg gaagtcatct ggtgagcgaa tgtcaatcga acgttaaaga 1440 atgtgctaat tgtaagaaaa tgaacactga tcgcaaactg agacttgaca caaatcacgc 1500 tgcgtggagc gattcatgcc cggtgtacca gaagcaactt gaacagcgta agagtttcgt 1560 cgactattcg gtatagcaac tattgccctg tcacgaagaa ctcgaaaaca gtgagcacag 1620 ctgtgatgga aatatagaag cagtcgaaac tcctgctttc tcaggtaatt tgattcagag 1680 tatatggcct gccgaagcta ggcatggtac tgccccccac ctgaatttaa acgcacttca 1740 gataacttgt cgattgcccg attctgacga ctttctccct ggatccgaag aagtttcacg 1800 aagtcccttt ggcgaaagct gctgttctgc tggatgttcc atagcggttg atatccctgc 1860 tttgtcaggt aattttgaaa tggtatatgc ctcgccgaag ctaggcaaga tgctgcgccc 1920 gttttagcaa tgcgattcac agacgacatg acttgccgat tgccgtgaag accctctccc 1980 tggatccgaa gaagtttcgc gctgctgttt tggatgttct atagcagttg aaaccctgct 2040 tcgtcaggta attttgagaa tagtatatgc ctcgccgaag ctgggcaaga tgctgcgccc 2100 ttttaagcca aagcgattca cagaacgata tgacttgccg attgcccgtt ctgacgaccc 2160 tgaatccgtt tcacaagtcc ctttggagaa aactgctgtt ctgctggatg ttctatagca 2220 gttgataccc ctgctttgtc aggtaacttt gagaatggta tatgcctcgc cgaagctagg 2280 caagatgctg cgcccgtttt aagtcaatgc gattcacaga ttgatatgac ttgccgattg 2340 cccgatcatg acgaccctct ccctgggtcc gaagaagttt cacgaaatcc ctttggcgaa 2400 agctgccgtt ctgctggatg ttcagtagca gttatccctg ctttgtcaat gctcgttggt 2460 gatactggtc ttcaggatga aggcaactgt acaggaataa tagaagcagt cgacactcct 2520 gctccgtcag gtaattcgat ggacagtata tgtcctgccg aagctagact tgatgctgca 2580 cctctcctaa atgtaaacgc attccaggcg gcccatctac tgccattgtc taacggatcc 2640 atctcaagtc atgctgatgc tccggaagaa tctcttcgca tttactatca gaacgtcagg 2700 agcctacgta gcaaagccga cgatattctt accgctacgt gtgaagcaga gcatgatgtt 2760 atagtgttga ctgaaacttg gcttaacgat caatttcatt ctgcacagtt ttttggaaac 2820 cattacaccg tataccgtaa agaccgttgt gctgtcagca cgggtaaatc cagaggcgga 2880 ggagttctga ttgctgtatc gaaccggctg atctccgatg ttgcccctat agtaacagat 2940 gacctggaac atttgtggat cactttacag tgctctaata ggagattgtg gatcggtgta 3000 gcgtaccttg ctccggatat ttctagcgat gtttcaacaa tcgaccgtca cttatctgcc 3060 gttaatacag tcacttcagc catgcggtac ggcgatgttc atgtactgtt aggagactac 3120 aatcagcctc atctttgttg gaaaagggct cccactggcg ttgttatctt ggatgccgaa 3180 aaaagtactt tcaatactgc cagctcgaaa ttcgtcgatg gaatgtcctt caataatatg 3240 cgacagttga acactgttac aaacgttcgc gatcgtacac tggatctact atttattaat 3300 gaagatgcta gtacacattg ttcagtattt gaagaactag attcgtatgt atcagccgat 3360 ctttaccatc ctgcgcttgt agctgtattg aaaatttcgt tgccgtgtaa gtttgtggat 3420 gcagctgagg ttaatcggta caacttcaac aaagcggata ttgcttctct taatatcgct 3480 tacgagaata tcgactggac ccaaatagaa agcatgaacg atattgactt ggctgttagc 3540 ttcttcaatc aaacagtgac ctctatgttt gagcaataca tccctgttgt ccaaccacgt 3600 tgcaaaccac cttggtcgaa tgcctactta cgtcgcttga aacgtgcacg cgcatctgct 3660 ctgaaagcat attccagaga acgcaatgat tatacgaagc actgttttgt gtctgctagc 3720 aaccgctacc gccgccataa tcgttttctg tactccaagt atgtggaaag gaagcagtcg 3780 gagctaaaac agaatcctaa atcattctgg aaattcataa acgagaagcg tagtgagcat 3840 ggtctgccaa ctaatatgtt tcttggagaa actactgcca actcaaccga agcaaaatgt 3900 aacctcttcg ccgaccattt ctctagtgta ttcaatccag atgctgcaac ggatcaacaa 3960 attgaaactg ctttacgcga cgttcctgaa cacattattt ctttaagtgc tgtagaattc 4020 accgataatg atgtcttcga agctctttgc ggaatgaagt cttcatctgt acctggtcca 4080 gacggtattc catcggtaat gttgaaaaaa tgcgctggag ccctgtgcca gccattgcgt 4140 atgatgataa atctttcttt gcggagtggt agattccctg aatgctggaa acgctcgtat 4200 atgttcccag tttttaaaaa gggtgacaag aaagatatcg tcaactacag aggtataact 4260 tcactgtgcg cgggttctaa gctgttcgag attttgatgg gagaggtgtt gtttcgaaag 4320 atgcaaccgt acattgctac agagcagcat ggctttttcc ctgggcgttc cacaagcacg 4380 aacttgcttg agtttacttc gctttgcatg accggaatgg acggaggggc gcaaatcgac 4440 acagtgtaca ccgaccttaa agcagcgttt gacagaatag atcatcgaat tcttctagca 4500 aaacttgatc gtctaggtgt atcatacgat attgttcttt ggcttcgatc ttacttgtgc 4560 aaccgtaagc tgtctgtaaa aattggagac acagaatcgt ttatgttttg taacacttcg 4620 ggagttccac aaggcagtaa tttaggacca ttattatttg ctatattttt caacgacgtg 4680 tgctgtttgc tcccgcaagg ctgtaagcta gtgtacgctg atgacttgaa attgtacctc 4740 catattcgtt ctactggtga ttgtctcgag cttcaacgat tgattgactt gtttactgaa 4800 tggtgtaatc gaaatcttct tattgttagt gtttctaaat gttatgtgat aagttttagt 4860 cgtaagaaaa atgacattga atatccctat tgtattgaca atcagctact gacaagagta 4920 tctgaaatta aggacttagg tgtaattctg gataaaaagt taacgttcca tagtcattac 4980 ctccaaacaa ttgctaaggc caacaaaaac ctagggttta taatgcgtgt cggttatgaa 5040 ttccgcgatc cgtattgttt gagagcttta tattactcgc ttgtccgctc tgtgttggac 5100 agcaattcaa tagtatggag cccgcatagc aataattgga taaccagaat tgaatccata 5160 cagtcgaaat ttgtgaggtt tgctttaaga tttcttcctt gggccaatag atatgagctg 5220 cctccatacg aagatcgttg tcaacttttg agaatggaac cattgtatat aagacgaaac 5280 catcaaaaag cattatttgt agctaaatta ttaactagta agatagattc acctaagatt 5340 ttaggaacaa ttgggattta tgccatcgaa agaccgttaa gaactagagg atttttgtac 5400 cttccttctc gtcgtacact ttacgggctt aatgagccta tacgatccat gtgcaatgtg 5460 tttaattcag tgtattattt gtttgatttt gactcgtcta tagaaatttt taaaaatcga 5520 ctccgtaatg ctcgttaata ttagctgtgt tagaataact ttagattaag tagatcatgt 5580 agaccatgtt gttcgatgat tatttcacaa taaataaata aataaataaa taaa 5634 // ID EnSpm-11_HM repbase; DNA; INV; 6548 BP. XX AC . XX DT 08-JAN-2009 (Rel. 14.02, Created) DT 08-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE EnSpm-type family from Hydra magnipapillata - consensus. XX KW EnSpm; DNA transposon; Transposable Element; EnSpm-11_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-6548 RA Bao W. and Jurka J.; RT "EnSpm-type families from Hydra magnipapillata."; RL Repbase Reports 9(2), 382-382 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 576..3095 FT /product="EnSpm-11_HM_1p" FT /translation="MQCFKCSIYFADLFDLFYHFRNFHGLATRGSELRCMF FT NNCPRVFLSFSKLKKHFQHHHSVVPVQTFGFNNNESCSNNLXSESHEDAFD FT TNFPSDLDQDCNNATIVNSNSITRDSIEVSVMKFVSEITSKPNITLANSQL FT VMENMSGLLNDVAXYASKTIKTLAINLKVAPTNKFVFDAIKDVQSMFSIFN FT KTDTIYKRNKWLKEHGYYIEATEISLGIREEQKYSMALQSMHSVLVEDKCY FT VVPIDILLAKILEQASSRIAFPQCTPXSLPNKIQDFVDTHTFHHHSFLQRH FT QKTFLLHFFIDAFEVTNVLGSHTRVHKLEALYMMIRNIAPEYQSKTDSIFL FT IGLWYALDAHTDKYSYYPILHSVVSLLQQLESEQGVVVVVNGVEETIHAIL FT ILFSADNLGAHTLFGFMESFNAKFFCRFCKCAHXEIQTSFKAETFVRRTRA FT EYDMCVSECRNNNYNASESGIKHGCPFNCLSQFHCIDQSAVDCMHDVLEGV FT VPLEICLVLQSLIDKKYFLLADINNVINNYNYSLADKNSKPPEVNLKNIRI FT QAAESWCLIRNLPLMIGNLIPEGDAHWRLLLMLLDCMDAIFAPTITVGLIR FT QLEFNLEEHHAQFKNLYPEKRLIPKHHFMLHYPEFMAKYGPLSRYWCMRFE FT AKHRFAKELASVVRNFKNICKTIMHRSQLKLANTLFSGSLYTDYVQIGICD FT TVIVGNLQSAVSNSLRTVLNVDNCDEIYVAHSVKFGCYEIKPGCVLCHGIV FT ESHPQFGELKCIVSLAKQIYFVCYPFETLHFHEHFHSFEXKKSNNPNTLLV FT LHTNQLKDYHPYNVIXIAGEXGKYSTLCKSEIHNFLE*" XX SQ Sequence 6548 BP; 2166 A; 951 C; 1046 G; 2364 T; 21 other; cactctaaaa aaaaatccgt taaaatatca acggagtaac tccgttgaca gtaactccgt 60 taaactccag atttaattga tacaaattaa gggagtaatc cgttgtttta caacggatta 120 ctcctttaaa tataatataa cgttatatta tasttaacgt tatattatat tcatgaaytc 180 ctttaaattc ttatgttata acagaataac tttgttaagt taacggagtt caagcgtgta 240 taaaggagtt cataaatatt actttgaaca cttaatattt atttgaaact aaaaacttat 300 ttaattagtg tattaaattt taattagtga atattatcat gtgtctttaa ggttagttat 360 tttcattatt aatgtcattt cttgattaag ttgaaattta acttacatta attaattaat 420 taaatgtata atttaacgtt ttatacattt acagaatgtc tgtgctaatc aaagtatggt 480 cttctgatcg aattattaag aagttagttg ccatcaataa cagtgaaraa tttgctgcaa 540 aaggtgacaa tataaattaa tattttaaaa caaacatgca atgttttaaa tgctcaattt 600 attttgcaga tttattcgat cttttttacc attttcgaaa ttttcatggc ttagctacaa 660 gaggatctga attacgctgt atgtttaata actgtccacg agtattccta agctttagta 720 agttaaaaaa gcatttccag caccatcatt ctgttgtgcc agttcaaact tttggcttca 780 acaataatga aagttgcagc aacaacttac waagtgagtc acatgaagat gcatttgaca 840 caaattttcc aagtgaccta gatcaagatt gtaataatgc cacaattgtt aactctaatt 900 ctataacaag agacagtatt gaagtatctg ttatgaagtt tgtatcagaa ataacttcaa 960 aaccaaacat cactttagct aattcgcagc ttgttatgga aaatatgtct ggcctactaa 1020 atgatgttgc trtttatgct tcaaagacaa taaaaacttt agcaattaat ttgaaagtag 1080 ctccaactaa taaatttgtt tttgatgcga taaaagatgt ccaaagtatg tttagcattt 1140 tcaataaaac tgacactata tataaaagaa acaaatggct aaaagaacat gggtactata 1200 ttgaggcaac agaaatatca ctgggtatac gtgaagagca aaaatactct atggcattgc 1260 aatcgatgca ttctgtttta gtagaggaca aatgctatgt tgtgcctatt gatatcctac 1320 ttgccaagat attggaacaa gcttcatcaa ggattgcttt cccacagtgc acaccaartt 1380 cactcccaaa taaaatacaa gattttgttg atacacatac ttttcaccat cattcatttc 1440 ttcaacggca ccaaaaaaca tttttgctac acttttttat tgatgcgttt gaagtaacaa 1500 atgtgcttgg gagccataca cgtgtacata aacttgaagc tttatatatg atgatacgca 1560 atattgctcc tgagtatcag tctaagactg atagcatttt tttgataggt ttatggtatg 1620 cgctagatgc acacacagat aaatactcat actatccaat tcttcatagt gttgtatctt 1680 tattgcaaca attggaaagt gaacagggtg ttgttgtcgt agttaatgga gtggaagaaa 1740 ctattcatgc cattttgatt ttgttttctg ctgataattt aggagcacac actttgtttg 1800 gcttcatgga aagctttaat gcaaaatttt tttgtcgatt ttgcaagtgt gcacattktg 1860 aaattcaaac ttcattcaag gcagaaacgt ttgttaggag gactcgagca gaatatgata 1920 tgtgtgtatc agaatgtaga aacaataact acaatgcttc agaatctggc attaagcatg 1980 gttgtccttt caattgtttg tcacagtttc actgcataga ccagagtgca gttgactgca 2040 tgcatgatgt attagaaggt gttgtacctt tagaaatatg cctggttttg caatcactga 2100 tagacaaaaa gtattttctg cttgcagata tcaacaatgt aatcaataat tataactatt 2160 cattggctga caaaaacagc aaaccaccag aagttaactt aaaaaatatt cgcattcaag 2220 ctgctgaatc atggtgtttg ataagaaatc tgcctttaat gattggaaac ctaattccag 2280 aaggtgatgc tcattggaga cttctactaa tgttacttga ttgcatggat gcaatatttg 2340 caccaactat aactgttggc ctcattcgtc agctggagtt taatttggag gaacatcatg 2400 cccaatttaa aaacttatat cctgagaaga gacttattcc aaaacatcat tttatgctac 2460 attatccaga atttatggca aagtatggac ctttaagcag gtactggtgt atgcgttttg 2520 aggcaaagca tcgatttgct aaagaacttg cttctgttgt tcgcaatttt aaaaacattt 2580 gcaaaactat tatgcatcgt agtcaactca aactagcaaa tacattgttt agtgggtctt 2640 tgtatacaga ctatgtacaa attggcatat gtgatacagt aattgtaggc aatttgcaat 2700 cagcagtatc gaatagtttg cgcacagttt taaacgtgga caattgtgat gaaatatatg 2760 ttgctcattc agtaaaattt ggctgttatg aaatcaagcc gggttgtgtt ttgtgtcatg 2820 gcatygtrga aagtcatcct cagtttggag agctaaaatg cattgtttct cttgcaaaac 2880 aaatatactt tgtatgctat ccatttgaaa cattacattt tcatgaacat tttcattctt 2940 ttgaarttaa aaaatctaac aatccaaata cactgcttgt tttacataca aatcagctca 3000 aagactatca tccatacaat gtcatatsta tagccggaga agamggcaaa tayagcacac 3060 tttgtaaatc tgagatacac aattttttag aataaaaata ttgtagaaat attgattgtt 3120 tttttattta tgcattacaa actgaaaaat atataaacct ttagttatta aatatgtata 3180 attgtttaat tgtatatgaa gttgttactg tgcattcata tatatattca ctttaactag 3240 tctgaatttt tttgtgttgt gtatataaaa agctaaatga ctgtgtattg ttattccagt 3300 gcgttgcaag tttggtttga aatcagatca acttgttggc tttgtctacg aaaatgatgg 3360 cacagaagtt gatgaagttg attttgaaat aataaaacat ttagcagaat caaacacaat 3420 tctcattcta ctcatagatg gtcagcagtg gactaatgct aatttggtaa atcaatttaa 3480 attacaacta aagactgcta ctgtgtaaat tctttaataa actttttttc atcataaata 3540 ctttgtaata ttttgaaatt agcattttct agtgtgtttt aactttataa actgttttta 3600 taaattttaa aaaaatcatt tgttttaata cagacaaaat caaattcacc attagtgttg 3660 ccttcgttgc agccaaatca aactacatta acttccagta ttacrctctc agaatctagt 3720 gtttctgtta acaacagcca tataatgaaa gctgcaaatg aggcgtttgt ttcagctaat 3780 tcgattaaag cctttcttgc gcaacaccct gtaagtacaa tacttaaata gtttttacat 3840 acatacatag ttgtacgtat ttttacgtaa taatttttta gcaataattt tactttactt 3900 gcttttttag caataaaagt tttntcaagg ttttgtgaac agtttttttt taatgaatag 3960 caatatcatg tattatcttt atagagtact tgtcaatata ctgaagagat tgaatctggt 4020 aaagtttcaa tctctactag acagcacatc attcgagttg ttgtgcacaa aatgatgagt 4080 gtgtatggta attatcctga tcgatatcaa aagattttag tggcagcagt gctaagtgag 4140 gcactkgtac ttccagcatg cattttttat gattctacta attatcatgg cttcctggag 4200 agaggattgg aaaatgctag aagaaagtta cctggtgttt attaaaatta gttaaattaa 4260 ttgaatataa atttaactgt agatgcttac caactttttt ttatattagc aacttgcttt 4320 gtaatctcct tgttgtgtgt tattgtgatg taatttctcc actgtatgtc tygattgcaa 4380 ataattcatt ttagttacta aaagtttatt tgaatcattg ccttattttg tgtcatacaa 4440 gcagctacat ttacaagcta tgttttcttt tgtatactag gatcagaaaa aaagtttgtt 4500 tggtctcgta aaaggggaca cractcaact tgttgtgata caactcaact tgcaggtact 4560 gaaytatctc aaccagtgga tcatgcakta gacctacaaa atctagaagg gtgtccccgt 4620 ggtttcatgt tttgcgcatt atgtttaggt tatctaatta tctatatctt gtaaaaacct 4680 tttgttataa aaaacatata tgttgtaaag acttattatt tactaattat ttgttgctaa 4740 caattttact aagtaaaata aaataaaaaa agacctagta atataggtct ttcaaaatta 4800 ttgttgatta taattttaat atctatattg ttgctctgtg tttttcatat ttgaattaat 4860 ttcaggaatg agcaacctcg atgaaattga aaagcttgat tcattatctt ctacacgtgt 4920 agaagaagtt aaggcttcta caaagtcaac gttttgtttt cggcagaatt gggttcattc 4980 aacagatcaa ccttcattat cccgactgat ggttaaattt ccaaagttca ggattattcc 5040 agagctggta ttttttttgt caaaattatc tataatgtgg ctctttttat cagatatcaa 5100 actttagtgt ataatattca tacatgtata agtgataata aaataattgt acagatagtt 5160 tgttattaaa tgatttatag caattaaatt gataagatta atacagtttt aggcattgta 5220 ttcaaaaata tttattttta gattgatatg gaatttcaat taaaatattc agactgcatg 5280 gttgatatcc tcaaaacctt taatcagact gttgcagacc cagtattaaa atatgcttca 5340 ctgtcaaata gtgattacat caaagctcta gctataatgg catctaaaaa acatcatggt 5400 ttgcattaat tttattaaag atattctttt tttttgttca caaatatttt gcataaaatt 5460 acctttgtat gtttggcact tttacgtttt cgtatggttt atatatttgt tttagattgg 5520 aaagcttatt atgcatgcaa gcttttgtgc tcgcttctgc gacaaggggg gaaacagaaa 5580 cgttctgctc cagaagtagc caatgaaatc attcggatat gcagtgtaag aaattttttt 5640 tttagttgat ttacagcgca cataagtttt aaattaaaaa aaaaaacaaa aaattataat 5700 ttataaatat ataaatacaa tttataataa tattgtaacc gagataacaa aagtttcatt 5760 gcaaatttag gctgagcatg gactgcaaga tgttgcagaa aagttggagt ccccttatcc 5820 ggtaatatta atacagggga ttgagggtga aaaaattctc tctataaaga ttgcatttga 5880 aaaaactttg attgaagccg gcacatctat ygtagaaggc atcgatcgtt tattcaagat 5940 tttttggtgt tttaacatcg aataccagcc atcaacaatc atgttctggc agtttatgca 6000 agatatttat gggctgcagt acggttcgtt gcctaatgga gtggttgagc tgcgatcaac 6060 actctctaaa tatgttttta cttaaatgta tcatttattt atcttaatat acatgaagat 6120 ttacaatttt gttacaacgt gttattttta tttattttta caaatgtttt gttgctgttt 6180 atgttgtcta aaaccctcat ctctaataca aatcagttaa actaattttt agataaacac 6240 taataattag tttgttaagt ttataggagt acttagttgc aaaaataata cggaataaaa 6300 catctataac tgttttattt cagcaatttt ttatgataaa caaaatagtt ttttttatgt 6360 tacggagtat tttgtaactt caacgaacta tccgttaaca tttacggagt aatttgttac 6420 attagagtta cggattgaat ccgttaaatt attactccgt taaattaacg gagtactccg 6480 ttggtgctag aatgagggag tactccgtaa ccaactccgt tgattttaac gggatttttt 6540 ttagagtg 6548 // ID Gypsy-248_AA-LTR repbase; DNA; INV; 263 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-248_AA_; KW Gypsy-248_AA-I; Gypsy-248_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-263 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1098-1098 (2011). XX DR [1] (Consensus) XX SQ Sequence 263 BP; 76 A; 61 C; 39 G; 84 T; 3 other; tgtcatacct ttgctacact tcattgataw atgttgaacc atacctttct wmtgttacat 60 acacctgaat atgtataact accttcccat acatacacct tgctccgcac gttttgacag 120 ttcagtacag tcagccattg cataaatatt cttccccgcg tgcgtgagca tgtgctcatt 180 aaattccctc aactaaatcg ttaagaataa aagatagtaa gtgttaaaac aaatagctgt 240 tgtgtgtatt actcggccag tca 263 // ID Copia-1-LTR_HS repbase; DNA; INV; 114 BP. XX AC . XX DT 16-JUN-2009 (Rel. 14.06, Created) DT 16-JUN-2009 (Rel. 14.06, Last updated, Version 1) XX DE LTR retrotransposon from Hydractinia symbiolongicarpus: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; 5-bp TSD; KW Copia-1-LTR_HS. XX OS Hydractinia symbiolongicarpus OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydractiniidae; Hydractinia. XX RN [1] RP 1-114 RA Bao W. and Jurka J.; RT "LTR retrotransposons from Hydractinia symbiolongicarpus."; RL Repbase Reports 9(6), 1261-1261 (2009). XX DR [1] (Consensus) XX SQ Sequence 114 BP; 35 A; 17 C; 20 G; 42 T; 0 other; tgttgatatt gcgatttgtg ttaattctgt tgttatcttt cagtaaccaa tacaatagta 60 ttagagactg gtgaagcctt caagttacct catcaaggta agaatcattt aaca 114 // ID BEL-10_AA-I repbase; DNA; INV; 2938 BP. XX AC supercont1.13; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-10_AA_; KW BEL-10_AA-LTR; BEL-10_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-2938 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.13; Positions 1060869 1063806. XX CC 'ACAG' target site duplication CC LTRs are 96% similar to each other. XX FH Key Location/Qualifiers FT CDS 408..2849 FT /product="BEL-10_AA-I_1p" FT /translation="MQQDRTPKSRSVAEAAAEAAKSNKREAVFHDQTAEWK FT FSDDPITLKELSSLCRQRDQVSLKLVRVRRALGTAHHQINQSLLRTYLKNA FT DDAYTEFSTVHSKIIAIIPDEAFRQEEKQYIAFEERYRQIRTTIDELMLAS FT EVEPARVPDPKPQVIIQQQPLKAPIPTFDGSYANWPKFKAIFQDLMANSPD FT SDAIKLYHLDKALVGEAAGALDAKVIREGNYQQAWAILTNRYENKRIIVET FT HIRDLFDIKRMSSGSSKELRRLLDECIRHVESLKYLNQEFSGVSDLFLVYI FT VTSAMDNATRMAWEATQRKGELPQYSQTISFLEARHQMLENCENAFQSSSQ FT YFSDPSLKPSQSCTKTSNICTATTERPKKYICDFCRGPHRNFQCYILASLP FT FDRRMDKVRKTGACFNCLRKGHSFRECSSSKACQKCQKRHHTQLHREKMSQ FT DPNPNFSDPARPDHTQDRVSRQAHVEENSNNSLTCNHADCCKIVFMPTAVV FT DVLDTKGQSHRCRVLLDSGSQVNLISSDLANRLGVSRQPTKVPLRGINDIK FT SMVQETVVVNFRSRITNYRASVEFLITPTVTSSIPSAGTDASTWRFPSGVC FT LADPDFHKPNKIDLLLGIELFFELLQRGHIKIDEDYPELRETQLGWVVTGV FT LKNRSVIEEIQNSFTTYVHHSSRPRRRREQFEENTNIDPEPIHYNMYMRHD FT DRPVAVLTTSQTVMRQDDKKSISSIVSDPPSPIENHRIYPTSSAQRIGIDV FT PSSDSVVHQREIPSSMPLNYRVNRGEYVPYGTSVPNHCRSSMKNNDTHSIV FT TEAHDELTLSMT" XX SQ Sequence 2938 BP; 839 A; 764 C; 678 G; 657 T; 0 other; acatagtggt ccttcgagcc ggatccagcg ccgctcaccg taggaaagta ttcgacacat 60 ccgctcgtcg tgacgagata atccgacacg ttagcgtggc tttagccagg acgtgtatta 120 cagagaggtc tactaatcac catcgccgtc cgacacgtct tcgtggtacc tataccggga 180 cgtgtattag tgaaggaatt gttcacatcg tcactcgcta cgaatcccaa actgttctga 240 accacaacga ataacaccag gaggaatcgc ccaccgatta tccgacctga aattcaaccc 300 gacgaagtag tgtgaagcca aatcgagtcc gacgcttctt tctccaggaa tcactgttat 360 cgtttcgact caacagttac ccgctaattg aggatcgctt cggagccatg cagcaagatc 420 gtacgccgaa gtccagatca gttgctgagg ccgcagctga agccgctaaa tcgaataaac 480 gagaagccgt tttccatgac caaactgccg aatggaaatt ttccgatgac cccattacac 540 tcaaggagct gtcgtcactt tgccgccaga gagaccaggt gagccttaag ctagtacgag 600 tccgaagggc acttggtacc gcccatcacc aaatcaacca gtcgctattg aggacatatt 660 tgaagaacgc cgatgatgcc tacaccgaat tcagcaccgt acatagcaag ataattgcaa 720 tcattcccga tgaggctttt cgacaggaag aaaagcagta cattgcattt gaggagcgtt 780 accgtcagat ccggacaacc attgatgagc taatgcttgc cagcgaagtg gagcctgcca 840 gagttcctga cccaaagcca caagtcataa ttcaacagca accattgaag gctccgatcc 900 ccaccttcga tgggtcctac gccaactggc cgaagttcaa ggccatattt caggacttga 960 tggccaactc gccagattcg gatgccatca agttatacca tctcgacaag gcacttgttg 1020 gtgaagctgc aggagctctg gatgcgaagg tgatccgcga gggaaattac cagcaagcat 1080 gggctattct cacgaaccgc tatgagaaca agcgaatcat cgtagagacg cacatccgcg 1140 atctgtttga tattaagcga atgtcatcag ggtcgtccaa ggagttgcga cgcttactcg 1200 acgaatgtat ccgccacgta gagagcctga agtatctcaa ccaggagttt tcaggtgttt 1260 ccgacctgtt tctcgtctac atcgtgacgt ccgctatgga taacgctact aggatggctt 1320 gggaagcaac ccagaggaaa ggcgaactac cccagtattc gcaaaccatt tcgttccttg 1380 aagcgcgaca ccaaatgttg gagaactgtg aaaacgcttt ccagtctagc tcacagtatt 1440 tttccgaccc gagtttgaag ccttcccaat cgtgtacaaa aacttccaat atttgcaccg 1500 ctacaaccga acgacccaag aagtatatct gtgatttctg ccgaggacct caccgcaact 1560 tccagtgtta tatactagca tcgctgcctt ttgaccgaag gatggacaag gtaaggaaga 1620 ctggagcatg cttcaactgt ctccgaaaag gacactcgtt cagagagtgc tcatcttcaa 1680 aagcgtgtca gaaatgtcaa aagcgacatc acacccaatt gcatagggaa aagatgagcc 1740 aagatccgaa ccccaacttt tccgatccag ctcgaccaga tcacacacaa gaccgagtat 1800 ccagacaggc ccatgtagaa gagaactcga acaattcact gacctgcaat cacgccgact 1860 gctgtaagat tgtatttatg ccaaccgctg ttgtagatgt gcttgatacg aaaggccaat 1920 cgcaccgatg ccgtgttctg ctggatagtg gctcacaagt gaaccttatt tcgagcgatc 1980 tggctaaccg attgggagtc tcaaggcaac ccacaaaagt tccgctacga ggcatcaacg 2040 acatcaagag catggttcag gagacggttg tagtgaattt ccgatccaga atcaccaact 2100 atcgtgcaag cgtggaattt ttgataacgc caaccgtaac cagcagtatt ccgtcagcag 2160 gaactgatgc ttcgacttgg aggttcccat ctggtgtttg tcttgctgat ccagacttcc 2220 acaaaccgaa caaaatcgac ctgctgctgg gtatagagtt gttcttcgag ttactgcaac 2280 gaggacatat caagatcgac gaggactatc ccgagttacg agaaacccaa ctaggatggg 2340 tggttaccgg agttttgaaa aacagatccg tcatcgagga aatccaaaat tctttcacca 2400 cttatgtaca ccattccagc cgaccacgta gacgccgtga gcagtttgaa gaaaacacga 2460 atatcgatcc ggaaccgatt cactacaaca tgtacatgag acatgatgac cgaccagtag 2520 ccgtgctaac aacatcgcag accgtcatgc gccaagacga caagaagagc atctcatcca 2580 tcgtttccga tccaccaagt cccatagaga atcatcgaat atatccgacg agttcagcac 2640 aacgaatagg tattgacgtt ccttcatccg attcagttgt tcaccaaaga gaaataccgt 2700 ccagcatgcc tttgaactat cgtgtcaacc ggggggagta tgttccgtat ggaacgagtg 2760 tcccaaatca ctgccgcagt agcatgaaaa acaacgacac acattctatc gtcactgaag 2820 cacacgatga actcaccctc tcaatgacgt aggcagtgtg ggtcgaatca ggggctgcaa 2880 ttgggtgagt tcatcgtgtg cttcagtgac gatagaatgt gagtgtgagt gccaacag 2938 // ID Jockey-1_TCa repbase; DNA; INV; 3076 BP. XX AC . XX DT 21-MAR-2009 (Rel. 14.04, Created) DT 05-APR-2009 (Rel. 14.04, Last updated, Version 3) XX DE Jockey-type retrotransposon: consensus. XX KW Jockey; Non-LTR Retrotransposon; Transposable Element; KW Jockey-1_TCa. XX OS Tribolium castaneum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Coleoptera; Polyphaga; Cucujiformia; OC Tenebrionidae; Tribolium. XX RN [1] RP 1-3076 RA Jurka J.; RT "Non-LTR retrotransposons from Tribolium castaneum."; RL Repbase Reports 9(4), 739-739 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Tribolium CC castaneum Genome Project. XX FH Key Location/Qualifiers FT CDS 365..2746 FT /product="Jockey-1_TCa_1p" FT /translation="MGANQLIQNSFQVAFWNANGLRAARDELEEFVDRLQL FT DIVLVCETKLQPQTTDPKIRGFTLHRADRGIGPGGGTAIFVSNRIKHSVLA FT TPDLRNMEAVGINVATANGPLRLFACYNRPQIPILEEDLQTLFDGNTPTIA FT AGDFNAKHINWGSRRSNRNGNILNGFTDQHLDISVMAPVEPTFYRNSDGXA FT DILDVAIVKNVVHQVRLTAINDLSSDHNPVLMQIGNEANDPIVCRYTSVDW FT RKFTKHLVNNFGTIPPIRSTEEIDEAVQTFETKIRDAITPPPGNXELGPKT FT GDXITGPHRRSTRADXXHVEEIEDHVESIATEDPDEPLTPTTPEEVSGVIR FT KLKKRKASGPDEISNRALKNLPLKVIVELTGILNAMFSFRYFPQRWKMATV FT IFIPKPGKDPKFPQNHRPISLLSAVGKVAERLIRSRLLQLTQERHIVPDEQ FT FGFRSNHSTTDQLLRVVEHASISIERKQVTGAVFLDVAKAFDAVWHDGLIY FT KLHQTGIPLAMVQMIRSFLDGRRFQVRINNSVSDPQDLEAGVPQGSVLSPL FT LYSIFTHDIPKTDRTTLAIYADDTAILTRSKQPYMATRYLQESVERIENWC FT RRWLINVNPDKSRALLLARRRVSPDGFVRMFNADIPWSDQVKYLGVILDKK FT LSFGPHLDYALAKGKMATGMLRSLVCRRSALSIDNKLLLYKSVIRPTMTYA FT SVAWAFAPCKTRMHKLQTFQNKFLRQAFNAPWFVRNNQLHREAKMPTMEEF FT FRETAERAFSKAEAHPNPLVREAVDYDENGPSRCKRPRMALL*" XX SQ Sequence 3076 BP; 819 A; 896 C; 737 G; 615 T; 9 other; aagtgcgggg aagcacatga cacaaaagtg tgcacgaagg agagcaagga gcctcccaag 60 tgtgccaatt gtaatggccc ccacacggcc aattacagag gctgtccgca gttccccaag 120 ctgcagaaaa ccgccacccc caggacaacc gcccccgcca aggtcgccgc ccccaaggca 180 gcagccccca aaaaggccgc cgctcccaag ccaacagccg cccccaaggt cgtcgctccc 240 aaggcaaacc ccaccgccac caaaggcaaa ggaggcgcca aaaagaccgc tcaagtcaaa 300 cccaacgcca ccaatggcaa aggagtaaaa ctctcagcct tcgtcgctct tctttgaatc 360 gttgatgggt gcgaatcaac tgatccaaaa ttcttttcag gtggcgttct ggaacgccaa 420 tggccttcga gcggcccggg acgagctcga agaattcgtc gacaggctgc aacttgacat 480 cgtacttgtg tgcgaaacaa agttgcaacc tcaaacgacg gacccaaaaa ttcggggttt 540 cactctccac agggcggaca gagggatcgg cccaggagga ggaacggcaa tttttgtaag 600 taacagaatt aaacattctg ttctagcaac acccgatctt cgcaacatgg aagccgtggg 660 catcaacgta gccactgcaa atgggcccct gcgcctgttt gcatgctaca accgaccaca 720 aattccaatt ctcgaagagg acctgcagac acttttcgac ggcaacacac caacgatcgc 780 tgctggtgat ttcaacgcca agcacatcaa ctggggaagc cgccgttcga acaggaacgg 840 aaacatcctc aacggcttca cggatcaaca cctggacatc tccgtcatgg ctccggtaga 900 acccacgttc taccgaaact cggacggkrc tgcagacatc cttgacgttg caatcgtcaa 960 aaacgtcgtg caccaggttc ggctgacggc aataaacgac ctctcttcgg atcataaccc 1020 cgtyctgatg caaatcggga acgaggcaaa cgatccgatt gtatgtcgtt atacctcagt 1080 ggattggagg aagttcacca agcacctggt gaacaacttc ggaacaatcc caccgatcag 1140 atccacagaa gaaatcgacg aggcggtcca aaccttcgag acgaagatcc gggacgccat 1200 cactccgcca ccagggaacg ragaactcgg ccccaagact ggagattyca taaccggacc 1260 tcacagaaga agcactmgag ctgacckgra ccacgtcgaa gagatcgagg aycacgtgga 1320 atccatcgcc accgaggacc cagatgaacc cctgaccccg acaacccccg aagaagtttc 1380 cggagtgatt cggaaactga agaaacggaa ggcatccgga ccagacgaaa tcagcaacag 1440 ggctctcaag aacctcccgc tgaaagtcat cgtcgaactg acaggtattt taaatgcaat 1500 gttcagtttc agatattttc cacagcgttg gaaaatggca acggtcattt tcatcccgaa 1560 accggggaag gacccaaagt ttccccagaa ccacaggcca atcagcttgc tgtcggctgt 1620 gggaaaggtc gcagaacgtc tcatccgctc ccgcctcctc cagctcaccc aggagcggca 1680 catcgtcccc gacgaacaat tcggctttcg gtcgaatcat tcaacaactg atcagctact 1740 acgagtcgta gagcacgctt ccatcagtat cgagcgcaag caagttactg gtgcagtgtt 1800 tctcgacgtg gcaaaagcat tcgatgcggt ttggcatgat gggcttattt ataagctcca 1860 tcagactggc attccgctcg caatggttca aatgatcaga tcgtttctcg atggacgccg 1920 tttccaggtg agaatcaata actctgtttc agatccccag gacctggaag ccggagtccc 1980 tcagggctcc gttctttcac cccttcttta ctccattttc acccacgaca tcccaaaaac 2040 tgaccgtacg acactggcga tatacgcaga tgacacagcg attctcacca gatcgaagca 2100 accctacatg gctacccgct acttgcagga gtctgtggaa cgcatcgaaa actggtgtcg 2160 caggtggctc ataaacgtca accccgacaa aagccgagcg cttttgttgg cacgacgtag 2220 agtcagccct gatgggtttg ttcgaatgtt caacgcggat attccctggt ctgaccaggt 2280 caaatatctc ggggtcattc tggacaaaaa actctccttc ggcccacatc ttgactacgc 2340 actcgcgaag ggcaagatgg ccacgggaat gctgagatct ctcgtatgtc ggcggagcgc 2400 gttgtcaatt gacaacaagc tcttgttgta caaatcagtg atccgaccta ccatgactta 2460 tgcttctgtg gcttgggcgt tcgcgccctg taagactcgg atgcataagt tacaaacttt 2520 ccaaaacaaa tttctccgtc aagcattcaa cgccccgtgg tttgttcgca acaaccaatt 2580 gcaccgagag gcgaagatgc cgacgatgga ggaattcttc cgtgagaccg ccgagcgagc 2640 cttttcgaag gcggaagccc acccgaaccc cctggtccga gaggccgttg actacgacga 2700 aaacggtccg tccagatgca agaggccaag gatggctctc ttgtgatcga aaatgccttc 2760 tcactcagat cttccgaatc tggtaagcat caaatgtcat tgccacattt atctgtcgca 2820 gctcttttca gattccatca aatggatcac ttcgggggaa tccccggagc gccctcttct 2880 tttcttctgg cgccatgcaa accggcatgg ctgcccagcg tgcgttcagg gatgaaggac 2940 caatggttcc gatccctaag gtgacgcgag gggttttagt gggtattgtc ggcttgcaca 3000 tttaccgacc gagtcccaca taaccagacc tagtctggta tgcgtaaaag catttcccct 3060 ctccaaaaaa aaaaaa 3076 // ID Copia-13_SI-LTR repbase; DNA; INV; 301 BP. XX AC AEAQ01018600; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: long terminal DE repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-13_SI_; KW Copia-13_SI-I; Copia-13_SI-LTR. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-301 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01018600; Positions 374 74. XX SQ Sequence 301 BP; 62 A; 85 C; 59 G; 95 T; 0 other; tgcattgata ggaggagtgt tagaaatatc aatgcgttct atgcgtgctt aatttcgatt 60 ttaccgcgct ataatgtccc tatgctttcg ccccgacatg cgcgatgcgc agcgcttatc 120 ccgcgcttct tccgctctgg ctggcgcttg ctctcttgct ttgcttcgct cgccacgcgt 180 gcacgcactc acgttcctgt atcccatata tattattaaa ggagtttaac gtataaacaa 240 ctgtgctttc ctgagtctct actcccacgc ctgatcgcta ctctaatagt aactaaatcc 300 a 301 // ID Gypsy-589_AA-LTR repbase; DNA; INV; 141 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-589_AA_; KW Ty3_gypsy_Ele15; Gypsy-589_AA-I; Gypsy-589_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-141 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 141 BP; 52 A; 23 C; 27 G; 39 T; 0 other; tgtagtgaat gatattaatg tatttgatca aggtaggcac agacaatgac tcagagtagg 60 caatgacagt cagtaaagtt aataaatgat cattctgtaa aaccacctta aacagagcag 120 ttgtaattcc tgacctctac a 141 // ID Gypsy-4_AA-LTR repbase; DNA; INV; 229 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-4_AA_; KW Gypsy-4_AA-I; Gypsy-4_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-229 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 978-978 (2011). XX DR [2] (Consensus) XX SQ Sequence 229 BP; 77 A; 62 C; 33 G; 57 T; 0 other; tgtggcgtat ccgacacagc aacccattgt acacacgaat tatataccac taccctatat 60 tccttgaatt attgcgtaat aaccatacat acacacatcc cacagaatta tcacacgatg 120 atgtaaatat agcttttagt tcagtacaga ccaccaaacc cgatagtcta gctatcgtcc 180 gaaagaacgt ctcgaatcgg tatcaaaacc gtaggcacta gtccctaca 229 // ID BEL-35_AA-LTR repbase; DNA; INV; 463 BP. XX AC supercont1.126; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-35_AA_; KW BEL-35_AA-I; BEL-35_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-463 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.126; Positions 1383578 1383116. XX SQ Sequence 463 BP; 175 A; 71 C; 76 G; 141 T; 0 other; tgtaacatca aacgcccctc ggctcaaaaa ttaccacagg acaagggaga acaataagct 60 gttgggagac aatggaaaat gtcatactga cttttgatga ataggagtta gattatacga 120 attcgatcgg taaaaccata gcttaaatct tgagttcttg aacaagttat agttattcgc 180 tgtatcttaa atcttatatt taccacgcta tgtaagtgaa tttggaagat taaaatgaat 240 ttatctaatt atttacctaa cttctctaaa cagtatatag cagtatgaat tgtatcggcc 300 ttaaatctaa agaactacta cgattgtaac tgaaaaattt gtaagtatta tgattgtaaa 360 ctaaaaaaca aatgaaatga aatctattaa aaacttacag ctaaagctga tctcaacaaa 420 cgaatcgtat ttcttgtggg actgctaaga aagttcagct aca 463 // ID Gypsy18-LTR_Dpse repbase; DNA; INV; 184 BP. XX AC Unknown_group_563; XX DT 15-MAY-2009 (Rel. 14.05, Created) DT 15-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy18_Dpse; KW Gypsy18-I_Dpse; Gypsy18-LTR_Dpse. XX OS Drosophila pseudoobscura OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; obscura group; OC pseudoobscura subgroup. XX RN [1] RP 1-184 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1110-1110 (2009). XX DR Genome; Unknown_group_563; Positions 98855 99038. XX SQ Sequence 184 BP; 44 A; 54 C; 31 G; 55 T; 0 other; tgtagggatc cttggtgggc tccctacaag tgagtaagaa tccgctccag aaactccatc 60 aaatatacgc cacacttagc atagttcttt tcgcacgtct gctcgcttct gccgtcgaac 120 tcttacttac tatcgatata tcttcctcta cccttgacta tccctagcga tagatcgata 180 ctca 184 // ID Sat4_Cis_ repbase; DNA; INV; 124 BP. XX AC . XX DT 05-FEB-2007 (Rel. 12.01, Created) DT 10-APR-2007 (Rel. 12.01, Last updated, Version 1) XX DE Satellite from Ciona savignyi. XX KW Satellite; Simple Repeat; Sat4_Cis_. XX OS Ciona savignyi OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. XX RN [1] RP 1-124 RA Smit A.F.; RT "Sat4_Cis_ - Satellite from Ciona savignyi."; RL Direct Submission to Repbase Update (05-FEB-2007). XX DR [1] (Consensus) XX CC Ci000004. XX SQ Sequence 124 BP; 36 A; 52 C; 8 G; 28 T; 0 other; caactgaacc ccttacaact gaaccccttt caactgaacc ccttacaact gaaccccttt 60 ccaactgaac cccttaccaa ctgaacccct ttccaactga accccttacc aactgaaccc 120 cttt 124 // ID CR1-33_HM repbase; DNA; INV; 4177 BP. XX AC . XX DT 16-DEC-2008 (Rel. 13.12, Created) DT 16-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE CR1-type family - consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-33_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4177 RA Bao W. and Jurka J.; RT "CR1 families from Hydra magnipapillata."; RL Repbase Reports 8(12), 1861-1861 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(211..1056,908..4054) FT /product="CR1-33_HM_1p" FT /translation="MTKLTDQQKNCIALMINKAFEELKDDFQVKLNQQTKL FT FEVKLKEKDAEIAELSEKVKRLEENNANLPPSSSPAVCYSQIAKQLAKPGS FT ELYNASIKVSKSHEILSSKKSKNVVIVGLPNSQKTIIEDRKTDDENMVNEL FT IRSLNRRIIIKSTFRLKSKVPLVNNASEPLVVEFENEVARNEMLAAARELS FT TIEAYKKVYIRPDRTPDEQAEFAQLNKERKEANEDLQRNNNLDQPFRFVIR FT SGKLLCIDVTTTKDIGNRKVHPFVKESVAKQARRGSYQPILTSPFDLSLEA FT ENYCASTSLQQKTLATAKFTHLSKRALPSRPGGAHISQSYKSYQNLSNVKL FT PIYLKQNTSTNYKPIKFYYTNATSCNSNKLSELSTLALAYEIDVIAIAETW FT FKLDSIKIIDGYELYQADRDRIGGGVAIYVNKRLTNIEIVNVHLNNKEIEQ FT VWCSINCNNFHLIVGCVYRPPNSTADYSKLILDSIRTASKLKSGKKYTDLI FT VVGDFNFPNIEWTDDGPIIHSNGNSIEHEFIKVLNDCSLYQIVNFPTFKQN FT DSVSAVNTLDLVLVCNPNCIDFLQCGPQLGHYEKGRYHYSITWQYFTSLSN FT NTVLTKDKFCFKKGNFFQMNKYLKQVNWVEHFYNKSACECYELFLHTYNES FT CRLFIPILNNKKKNSQPWLDAKVKKAIKIKHSMFYSNLHRQSQESRKAFNK FT MCANVKKQIKRSTREYEMDLVKRAKLAPKLLFSYVNRKQNIKTQIRAIKNK FT KDQVTTEISDISDILNRQFQSVFVNDNDQPIPEFLQRCENQCDCSKLSNMI FT TYEKILKMLEGIDESKAMGFDEISPFVLKHAAEGFVDPIVHIFKLSFDTGC FT VPYHWSVANISPIYKGGSKLEPENYRPVSLTSVICKMFEKLIRDTILQHLV FT SNRLISEEQTGFVPNKSCTTNLIETIDTITFETAKGKPLCVIYLDFAKAFD FT KVEHKRLLTKIKAYGITGKIYNWIESFLANRKQRVAISNTVSEWVSVTSGV FT PQGSVLGPMLFLLFINDLPEVVLNTFKMYADDSKIIAIEDSIDKQAEIQTD FT LDNIVKWCNNWGMELNCKKCKVMRFGNQKVFRGIDKTYTMNINGMIYDLSN FT SVLERDLGVFVQPDMRWKHQTQVCVNKASKILGLLKNAFESRDCYMWKILY FT TTYVRPHLEFAAPAWSPPNVAEISMLERVQRRATKIPYSNRMLSYEKRLVK FT MDLINLNLRRKRGDLIHLYKILNNLEFVNLNVKPVYKSSIKLDGPCSSIRG FT LKHKIRMQCFTPREINDFHASVSSRIHFFTNRVCQIWNKLPEKVVDSENLN FT SFKASLDEWLKKNKKILI*" XX SQ Sequence 4177 BP; 1562 A; 680 C; 734 G; 1201 T; 0 other; tatatacgga aaaaaacaca tactgaaact ttttagtccc tgtgtaataa gatataaaat 60 attctaataa ttctgaacgg gaggtaatcc actggtattc agagtggtag ccctcctata 120 aagaccaact caacatcaaa gaagtaatcc actgttgttt gtagtggttg tttctctttc 180 atttcattgg tggaaatcga ggattgagat atgactaaac tcactgacca acaaaagaat 240 tgtatagctt tgatgattaa caaagcattt gaagagctaa aggatgactt tcaagtgaaa 300 ctaaatcaac aaactaaact atttgaagtg aaattaaagg aaaaagatgc agaaatagct 360 gaacttagtg agaaagtaaa aagactagaa gaaaacaatg caaatttacc accttctagt 420 tcacctgctg tttgttacag ccaaattgct aagcaactag ccaaaccagg ttctgagctg 480 tataatgcta gcatcaaagt gtcaaaaagt catgaaatat tatccagtaa aaagtctaaa 540 aatgttgtaa ttgttggttt gcctaactca caaaaaacta taatcgagga tcgcaagaca 600 gatgacgaga acatggtcaa cgaattaata cgttcgctga acagacgaat aataataaaa 660 tcaacattca gattaaaatc taaagttccc ttggtaaaca atgcatctga accgttagtt 720 gtcgaatttg agaatgaagt agcacgaaac gagatgctgg ctgcagcaag ggaactgtca 780 actattgagg cctacaaaaa agtatatata cggcctgacc gcactcctga tgagcaggct 840 gaatttgcac aattaaataa agaaagaaag gaggcgaacg aggacttgca acgcaataat 900 aacctagacc agccctttcg atttgtcatt agaagcggaa aattactgtg catcgacgtc 960 actacaacaa aagacattgg caaccgcaaa gttcacccat ttgtcaaaga gagcgttgcc 1020 aagcaggccc ggaggggctc atatcagcca atcttataaa tcttaccaaa acctgagtaa 1080 tgtaaaactt cctatttatt taaagcaaaa tactagtact aactataaac caattaaatt 1140 ttattatact aatgctacct cctgtaactc taataaatta tctgagctgt ccacacttgc 1200 tttggcatat gaaattgatg ttattgcaat tgcagaaact tggtttaaat tggattcgat 1260 taaaataatt gatggctatg aactgtatca ggcagataga gatagaattg gaggaggtgt 1320 tgccatttat gtaaataaaa gacttacaaa tatagaaatt gtaaatgtgc atcttaacaa 1380 caaagagatc gaacaagtct ggtgctcaat aaattgtaat aattttcatt taattgttgg 1440 ttgtgtatat agaccaccaa actctacagc tgattactca aaattgattc tagactcaat 1500 cagaaccgct tcaaaactca aaagtggaaa aaaatacact gatctcatag tggtcggaga 1560 tttcaatttt cctaatatag aatggacaga tgatggtcca ataattcatt caaatggaaa 1620 ctcaattgaa catgaattta ttaaagtttt aaatgattgt tctctatacc aaatagtaaa 1680 ttttccaaca ttcaaacaaa atgactctgt ttcagcagtt aacacccttg atcttgtttt 1740 agtttgcaat ccaaattgca ttgattttct tcaatgtggt ccacagttag gccactatga 1800 aaagggaaga taccattact ccatcacttg gcaatatttt actagtctta gtaacaatac 1860 agtactcaca aaagataagt tttgctttaa aaaaggtaat ttttttcaaa tgaataaata 1920 cttgaaacaa gtcaactggg ttgagcattt ttataacaaa tcagcctgtg aatgttatga 1980 gttatttttg catacatata atgaatcttg tcgattgttt attcctattt tgaataataa 2040 aaaaaagaat agtcaaccct ggttggatgc aaaagttaaa aaagcaataa agattaaaca 2100 ttctatgttt tattcaaatt tgcatcggca aagtcaagaa tctcgaaagg cttttaacaa 2160 aatgtgtgct aatgttaaaa agcaaatcaa aagatcaact cgagaatacg aaatggattt 2220 agtaaagaga gcaaagcttg caccaaaact tttattctca tacgtaaata gaaaacaaaa 2280 cattaaaacg caaattagag caatcaagaa caaaaaagat caagttacaa ctgagatatc 2340 agacatttca gatatattaa ataggcaatt tcagtcagtc tttgttaatg ataatgacca 2400 accaataccc gaattcttgc aaagatgcga aaaccagtgc gattgcagta aattatctaa 2460 tatgattacc tatgaaaaaa ttctcaaaat gttagaaggt atagatgagt caaaagcgat 2520 gggttttgat gaaattagtc cgtttgtttt aaagcatgct gcagaaggtt ttgtggaccc 2580 aattgttcat attttcaaat taagctttga cactggatgt gtcccttacc actggtctgt 2640 tgcaaatata tcaccaattt acaaaggggg atcaaaactt gaaccagaaa actacagacc 2700 agtatcccta acatctgtga tttgtaaaat gttcgaaaag ctcattcggg acacaattct 2760 gcaacattta gtttcaaata gacttatatc tgaggagcaa actggttttg ttccaaacaa 2820 atcatgcaca acaaatctta ttgagacaat tgacacaatt acatttgaaa ctgcaaaagg 2880 taaaccgtta tgcgtcatat atcttgactt cgcaaaggct tttgataaag tagaacacaa 2940 aagattattg acaaagatta aagcgtatgg tataactggt aaaatatata actggattga 3000 aagttttctt gcaaatagga aacaaagagt tgctattagt aatacagttt cagaatgggt 3060 atcagttacc agtggcgtgc ctcagggttc cgttttgggt ccaatgttgt ttctgctatt 3120 cataaatgac ctcccagaag ttgtgctaaa cacatttaaa atgtatgctg atgatagcaa 3180 aataatagcc atagaagatt ccattgataa acaagcggaa atacaaaccg accttgacaa 3240 cattgtgaaa tggtgcaaca attggggtat ggaactaaac tgcaaaaaat gtaaggtgat 3300 gagattcggc aatcagaaag tttttcgagg tattgataaa acatacacaa tgaatataaa 3360 tggaatgatt tatgacctct caaacagtgt attagaaaga gatctaggtg tctttgtaca 3420 accggatatg aggtggaagc atcaaacgca agtatgtgtt aacaaagctt caaaaatact 3480 ggggttattg aaaaatgcct ttgaaagtag agattgttat atgtggaaaa ttttgtatac 3540 tacttatgtg cgacctcatc ttgaatttgc ggctccagcg tggagtcctc cgaatgtggc 3600 cgaaatctca atgcttgaaa gagttcaacg tagggccaca aaaataccat actcaaaccg 3660 aatgctaagt tatgagaaac ggctagttaa aatggattta attaatttaa atctcagaag 3720 aaaacgagga gaccttattc atttatataa gatattaaac aatctagaat ttgtcaatct 3780 taatgtcaaa ccagtctata agtcaagcat taaattagat ggcccatgtt cttccattag 3840 aggtctcaaa cataaaatta gaatgcagtg ttttactcct cgcgaaataa atgactttca 3900 tgcatcggtt agctccagaa tccatttctt cacaaacaga gtttgccaaa tttggaacaa 3960 acttccggaa aaagttgtag attccgagaa tttgaatagt tttaaagcaa gtttagacga 4020 atggcttaag aaaaataaga aaatattaat ttaaaagaaa aagaaaaaaa aaaaaagtac 4080 tgggctgcta tagctgaacg ttacaacgtc tcagcttgtg gataagatcc acgcagttta 4140 ccactaacta actaactaac tatatatata tatatat 4177 // ID Gypsy-4_IS-LTR repbase; DNA; INV; 157 BP. XX AC ABJB010633793; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the black-legged tick genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-4_IS_; KW Gypsy-4_IS-I; Gypsy-4_IS-LTR. XX OS Ixodes scapularis OC Eukaryota; Metazoa; Arthropoda; Chelicerata; Arachnida; Acari; OC Parasitiformes; Ixodida; Ixodoidea; Ixodidae; Ixodinae; Ixodes. XX RN [1] RP 1-157 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the black-legged tick genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; ABJB010633793; Positions 4055 4211. XX SQ Sequence 157 BP; 41 A; 43 C; 40 G; 33 T; 0 other; tgatgtatac gccctcacta gatggcggga ccgacccgct ctgcaacgac ggagccgact 60 cgggaaccgt acccaggaag gctgctgcca ttccggccgc ttgtgttaat aaagtgttat 120 tgttatacgg aacccgaaat atccgaagta cataaca 157 // ID Vingi-1_APis repbase; DNA; INV; 2702 BP. XX AC . XX DT 01-FEB-2010 (Rel. 15.02, Created) DT 17-AUG-2010 (Rel. 15.12, Last updated, Version 4) XX DE A family of Vingi non-LTR retrotransposons from Acyrthosiphon DE pisum. XX KW Vingi; Non-LTR Retrotransposon; Transposable Element; Ingi-1_APis; KW Vingi-1_APis. XX NM Ingi-1_APis; Vingi-1_APis. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-2702 RA Kojima K. and Jurka J.; RT "Ingi non-LTR retrotransposons from insects."; RL Repbase Reports 10(2), 147-147 (2010). XX RN [2] RP 1-2702 RA Kojima K.K., Kapitonov V.V. and Jurka J.; RT "Recent Expansion of a New Ingi-Related Clade of Vingi non-LTR RT Retrotransposons in Hedgehogs."; RL Molecular Biology and Evolution 28(1), 17-20 (2011). XX RN [3] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [3] (Consensus) XX CC Originally classified as Ingi [1] and re-classified as Vingi [2]. CC There are only 5 sequences with >90% identity to consensus in the CC aphid genome. This consensus is likely 5'-truncated. The 3' CC termini are composed by (TAC)n microsatellite. XX FH Key Location/Qualifiers FT CDS 466..2700 FT /product="Vingi-1_APis_1p" FT /note="includes a part of endonuclease domain, and FT a complete reverse transcriptase domain." FT /translation="IYKANKFRKPKKKIVIGDFNSHNSAWGYNETNEDGEE FT VEKWTEANDLTLVHYTKLPSSFNSGRWRRGYNPDIIFVSNNIKQQCKKKVE FT APIPNSQHRPITCQIKAIIHPENVPMIRRYNFKKANWDNFKTQLDTEINKI FT TPIPENYGQFSNLVKNISKQNIPRGCRTQYIPGMGKENKELLENYINQFNN FT DPFAEETIITGESLVEKLSEARREKWRSLISETNMTQNSSKAWRMLRNLGN FT DPKNAESHINVTPNEIAHQLLKNGKCPGIKKVKPKIGREPKNEANLLQTPF FT TLEELDVAMARMKEKKAPGLDEIRTEQIKNFG*QTKRWLLELYNNCIKCVK FT IPVIWRKSRITALIKPVKQPTEANNFRPISLLCHTYKLPERLILXRINDIV FT DKSLINEQAGFRGGKSCTGQILNLTQNIENGYEEKLVTGAVFIDLTAAYDT FT INHRILFKKVYEITRDYNLTSFIAEMLRXRRYFVELQGKKSRWRTXKNGLA FT QGSVLAPLLFNIYTNDQPTPLGTQRFXYXDDLALTAQNHSFGXIESTLNNA FT LEEMSIYYKKNWLKPNPXKTQVCTFHLRNKEAKRKLRIIWEDVEIENTEYP FT KYLGVTLDRALSFKLHCGNVKQKTHARNNLIRRLTGTSWXADPNTVRTSAL FT ALCYSTAEYAAPVWARSCHAKKVDTALNETCRIISGCLKPTPVEEIQVLSG FT IAPPDIRREIASEIERHKQQNDPRHPLHGKSPPQPRLKSRKSFLT" XX SQ Sequence 2702 BP; 1094 A; 588 C; 488 G; 523 T; 9 other; gggggcatgg taagagcctg cacgcggcgc cccctgtata gcgatatttc taaaccagaa 60 tttatcgcta accaatctta ccaactccgg acggatctca atgtcaacca aaacgccaaa 120 acggttcttt tcaggaccag ccctcataac gacttcaata aatatcgaag gattctcaaa 180 caataaaagc gacatcctac aagaactatg ccaaaaaaat acatgcgacg ttatatgcgt 240 ccaagaaaca caccgagaca agtacaacat ccgtccgaag ataaagggaa tgaagttggc 300 aatatagaga ccacataaga aatacggcag tacgatattt gtaagagata atctaaagat 360 cctctcaaca agtcgtacag aaacaaacga catagaaatc ctcacagtag aacttacaaa 420 ctgcactgtc acatcagtat acaaaccacc taatatccca tttaaattta caaagctaac 480 aaatttcgaa aaccaaagaa aaaaattgta attggcgatt tcaacagtca caacagcgcc 540 tggggctata acgaaacaaa cgaagatgga gaagaagtag agaaatggac agaggcaaat 600 gacctaacat tggtacacta tacaaagctc ccatcgtcgt ttaatagcgg cagatggaga 660 agagggtata atccagacat aatctttgta agcaacaata tcaaacaaca atgcaaaaag 720 aaagtcgaag caccaatacc aaattcccag cacagaccaa taacctgtca aataaaggct 780 attatacacc cagaaaacgt accgatgata agacgctata atttcaaaaa agccaactgg 840 gacaatttta agacacaact ggacactgaa atcaacaaaa tcactccaat acctgaaaat 900 tatggccaat tctcaaactt agtaaagaac atatcaaaac aaaatatacc aaggggctgc 960 agaacacagt atatcccggg aatggggaaa gaaaacaaag aactacttga gaactatatc 1020 aaccaattta acaatgatcc ttttgctgaa gaaacaatta taaccggaga atccttagtg 1080 gaaaaactgt cagaagcacg acgagaaaaa tggagatcac taataagtga aacaaacatg 1140 actcaaaaca gctcaaaagc atggagaatg ctgagaaacc tgggaaatga cccaaaaaat 1200 gcagaatcac atataaacgt tacaccaaat gaaattgctc accaactcct aaaaaatgga 1260 aaatgccctg gcattaagaa agtgaagcca aaaataggca gagaaccaaa aaacgaagcc 1320 aacctactac aaactccttt cacgttagaa gagctcgatg tggcaatggc acggatgaaa 1380 gaaaaaaaag caccaggtct agatgaaatc agaaccgagc agattaagaa ctttggataa 1440 caaaccaaaa gatggctatt agaattgtac aacaactgca taaaatgcgt aaagatacca 1500 gtaatatgga gaaaaagccg aataacagca ctaataaaac cggttaagca acccacggag 1560 gcgaacaatt ttcgcccaat ctccctctta tgccacacat acaagttgcc agaaagatta 1620 atactcaama gaataaacga catagtggac aaatcactta taaacgaaca agccggtttc 1680 agaggaggaa agtcatgcac tggtcagata ctgaacctga cgcaaaatat tgaaaatgga 1740 tacgaggaaa aactggtgac tggcgcggtt ttcatagacc taacagcggc atatgatacg 1800 attaaccaca gaattctttt caaaaaagtg tacgaaataa caagggacta caacttaacg 1860 tccttcatcg ctgaaatgct aagaaawaga agatacttcg tggagttgca aggaaagaaa 1920 agccgctgga ggactcwaaa aaatggactt gcacaaggta gcgtattagc acctttgcta 1980 tttaacatat ataccaacga tcagccaaca ccattaggaa cccagcgttt cmtatacgmt 2040 gatgatctag cgctcactgc tcagaatcac tcgtttggaw atatagaaag cactcttaac 2100 aacgcactag aggaaatgtc aatttattac aaaaaaaatt ggttaaaacc aaaccccaam 2160 aaaacccaag tatgcacttt tcatctaaga aacaaagaag caaaacgaaa attaagaata 2220 atatgggaag atgtwgagat tgaaaatacc gaatacccaa aatacctcgg tgtcacacta 2280 gaccgagcct tatcattcaa acttcattgc ggtaacgtaa aacaaaaaac ccacgcgaga 2340 aacaacctga ttagaaggct cacaggaaca tcatggkgag cagacccaaa tactgtaaga 2400 acatcagcac tagcactatg ctactcaaca gccgaatatg ctgcccccgt atgggccaga 2460 tcatgtcacg cgaaaaaagt tgacactgct ttaaacgaaa catgcaggat aatctcagga 2520 tgcctcaaac caacaccggt cgaggaaata caagtactat caggaatagc cccaccagat 2580 attaggagag agattgcatc agagatagaa cgacacaaac aacaaaacga tccccggcac 2640 ccactacatg gaaagagccc gccgcaacca cgacttaaat ccaggaaaag tttccttact 2700 ac 2702 // ID Copia-53_AA-LTR repbase; DNA; INV; 121 BP. XX AC supercont1.140; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-53_AA_; KW Copia-53_AA-I; Copia-53_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-121 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.140; Positions 1411268 1411148. XX SQ Sequence 121 BP; 44 A; 22 C; 18 G; 37 T; 0 other; tgttagcagt aggcgtttgg acctgagttt aatcatagta accctagata ataaatgaaa 60 aaaataaatc ttgtcattca atgttcaact ctcaaaccaa acagtagttc tttctacgac 120 a 121 // ID Copia3-NVi_I repbase; DNA; INV; 4130 BP. XX AC AAZX01004352; XX DT 05-NOV-2007 (Rel. 12.11, Created) DT 05-NOV-2007 (Rel. 12.11, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: internal DE portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia3-NVi; KW Copia3-NVi_I; Copia3-NVi_LTR; internal portion. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-4130 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from Nasonia parasitic wasp."; RL Repbase Reports 7(11), 1129-1129 (2007). XX DR Genome; AAZX01004352; Positions 19163 15034. XX CC Positions [1633-2169] - Integrase core CC 'TTAAT' target site duplication CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 181..4095 FT /product="Copia3-NVi_I_1p" FT /translation="MAGEISTRSLVKFNSKNFQHWKFQITAAFVANGLLGQ FT VDGTVPTLANKNDEEGQRWVKEDAKAMFLISSAMETEQMENLLVCETARNM FT WERLTSVHKMKSQTHKLLMSQRFHEYRMDPNDTIVQHISKVQNLARQLIDI FT GENIPDLVIMAKILASLPAKYRHFRTSWGVMESTRQTIELLQDRLIEEEAY FT KTAEEKEETALAATTRNATQGAPKFNAKKKKNFRKSKNNIQCYVCSEKGHY FT VRECPQKNSKQEKESRDNCALVASSRTATGHGQGDLAQRDAGVYFSEPTSE FT QRKKILNAKLEDIWLTDSGASAHITFRREWLVNYRPRYEGSTIVLGDNHEC FT AVVGEGSVRVKRLINGEWLDAQIDNVLYVPSMGKNLYSVEVCTTNGLNISF FT SGDVVNISREGEIIATGVKQTNLVYRMCLLVQTSEQREANVATADVKVWHK FT RMGHLNLRSFQSLVSKNLVKGVSVSGADEFFCEGCQIGKSHKLPFSKEVER FT STESGEMFHSDVCGPMSEVTLGGASYYVSFIDDATSYRVVYFIKHNSDMTD FT RFMTFEKLIRNKFDRTMKTLRSDNGKEYVNRRLEEYLESRGIVHERTAPYT FT PEQNGTAERENRTIVECARTMLEASKWPNTLWAEAVNTAVYTLNRVLTPSK FT NKIKTSYELWTGKKPDLSHLKVFGCSAYKYVPKIHTRKFDDQATKMIMVGY FT EDNSTNYILLCPENMKVSVARHVKFNEAGIGLEKAEKTEEVDDEDKEDLLL FT VPETNKEVREDVNNPPRNPEKVKEVENEDNGIGIRPSENAAKGAAESVSKR FT QTSIPPAQRELRDRTKLQKPAKSLVNIAEIKLPTTFREATSGPDSEKWREA FT IQEELGAHERIGTWVIVPRQENTRIIKSKWVFKVINNQSTGEARFKARLVG FT KGFMLEEGIDYSETYAPIVRYDSLRVLLAYITLEDMEMVSFDVCAAFLYGD FT LEEEIWMEVPEGVHVEKVDSGSVVCLLRKSLYGLKQAPRCWNVKFSNFLRR FT FNFKEREADKCVYVGELEGHTVYLALFVDDGLLASKSSQIINKILNELRSE FT YSITVGDASYFVGLQISRNRTEKLMFVNQNAYTQEILAKYGMGNAKCMSVP FT VDPNVHLLAADPDCKNDCKAPYREAIGSLMYLAIVSRPDIAYAVNLLSKFV FT CNFNEAHWAAVKRVFCETWSSQRQKMVTLSTTESEYVAAAAAAKEICWLSK FT LENGIGCRCEDGVTLLVDNQSAIRLAKNPQYHKRTKHIDIRYHFIREICES FT GEINIVYVASENQLADIFTKALPRDRFLHLRQSIGLVSEKELLE" XX SQ Sequence 4130 BP; 1362 A; 754 C; 1030 G; 984 T; 0 other; ggttatgggc ccagacattg tctggaagcc aaagtaagag agactataag tgagtagtac 60 ataaagacgc gagttagtcg ggcagtgaca tcaagccaaa aactgttcgc tcaagtgtgt 120 ttctcgttcg tacccatgtc atataaactt ctacacgaga caaggtgtga aatttgagag 180 atggcaggag aaatatcaac tcgtagccta gtgaagttca acagtaagaa tttccaacat 240 tggaaatttc aaatcacggc ggcttttgta gcgaacggtt tactcgggca agtggatggc 300 acagtgccta cactggctaa caaaaacgat gaggaaggac aacgctgggt caaagaagac 360 gccaaggcta tgttcctaat ttcttcagcg atggaaacgg agcaaatgga aaatttactc 420 gtgtgcgaga cagctcgaaa tatgtgggag agacttacgt ccgtccataa gatgaagtct 480 caaacacata aactgctaat gagccagagg tttcatgaat accgcatgga ccccaatgac 540 accattgtac aacacatatc gaaagtgcag aacttggcaa gacaactgat cgacattggt 600 gagaatattc cagatctggt tataatggcg aaaatactcg caagtctacc tgccaagtac 660 cgacattttc gaacatcttg gggagtcatg gagtcaactc gtcaaactat cgagctttta 720 caagatcgac tcatcgagga agaagcctac aagacagcgg aagaaaaaga ggaaaccgcg 780 ctggcagcta cgacgagaaa tgctactcaa ggtgctccaa aatttaacgc gaagaaaaag 840 aaaaatttcc gtaagtctaa aaacaatatt cagtgttatg tgtgctccga aaagggacat 900 tacgtgcgcg aatgtccgca gaagaatagt aaacaagaga aagagtctcg cgacaattgt 960 gcgctggtcg cttcgagcag gacagcgacg ggtcacgggc aaggtgactt agcacagaga 1020 gacgctggtg tttatttttc ggaaccaact tcggagcaac ggaaaaagat tttaaacgca 1080 aagctcgaag atatctggct taccgacagc ggtgcgtcag cccacataac tttcagaaga 1140 gagtggctgg taaattatcg accgaggtac gagggcagca ccatcgtgct cggtgacaat 1200 cacgagtgtg cagttgtggg agaaggctcg gttcgtgtta agcggctgat aaacggcgag 1260 tggctagacg cacaaataga caatgttctc tatgtgccaa gcatgggcaa aaatctatac 1320 tcagtggaag tctgtacaac aaatggattg aacatcagct tcagtggaga cgtcgtcaac 1380 atctcgcgcg aaggcgagat tattgcaaca ggtgtcaagc agacgaactt agtctatagg 1440 atgtgcttgc tagtacagac gtctgaacag cgggaagcta atgtagccac agctgacgtg 1500 aaagtttggc acaagaggat gggacactta aatctccgtt cttttcaaag tctagtgtcc 1560 aaaaatctag taaaaggagt cagtgttagt ggtgcagacg agttcttctg tgaaggctgt 1620 caaataggaa agtcacacaa actacccttc agcaaagagg tagaaaggag cactgagtcg 1680 ggtgaaatgt ttcacagtga tgtgtgtggt cctatgtctg aagtcactct aggaggagca 1740 tcatactatg tttcttttat agatgatgcg acttcataca gagtagtgta tttcataaaa 1800 cacaattctg atatgacaga tcggttcatg acatttgaaa aactcatcag gaacaagttt 1860 gacaggacta tgaaaacact aagaagcgac aacggcaagg agtacgtcaa cagaagactt 1920 gaagaatatt tggagtcaag aggcatagtg catgagcgaa cagcacctta tacaccagaa 1980 cagaatggta cggcggagag agaaaaccgt accatcgttg agtgtgccag aacaatgctg 2040 gaagcatcaa agtggccaaa tacactttgg gcggaagctg taaacacagc tgtatacaca 2100 ctgaacagag tgctgactcc aagtaagaac aaaatcaaga cgtcatatga attgtggact 2160 gggaagaaac cagatctgag tcacttgaaa gtattcggct gtagtgcata caaatatgta 2220 ccaaaaattc acaccagaaa gttcgacgat caggctacca agatgatcat ggtcggctat 2280 gaagacaact caacgaacta catactacta tgtccagaga atatgaaggt gtcagtagct 2340 cgtcatgtaa aattcaatga ggctggaata ggacttgaaa aagctgaaaa gactgaagaa 2400 gtagacgatg aagacaagga agatctgctg ctggtacctg aaacaaacaa agaagttaga 2460 gaagatgtca acaacccacc aaggaatcct gaaaaagtaa aagaagttga gaacgaggac 2520 aatggaattg gaatcagacc atctgagaat gcggcaaagg gtgcagccga gtcagtctcc 2580 aaacgtcaaa ctagtatacc acctgctcaa agggaactac gagataggac caaacttcag 2640 aaacctgcta agtctctggt aaatatagct gaaattaaac ttcctactac ttttagagaa 2700 gcaaccagtg gtccagattc agaaaaatgg cgtgaggcga tacaagaaga gctaggagct 2760 catgaacgca ttggtacatg ggtcatagtg ccaagacaag agaacacaag gatcattaaa 2820 agcaaatggg tcttcaaggt catcaacaat cagtcaactg gtgaggctcg tttcaaggct 2880 aggttggttg gcaaaggatt catgctggaa gaaggtatag actacagtga gacctacgct 2940 cccatagtga ggtatgactc tctacgcgta ctcttggctt acatcacatt agaagatatg 3000 gaaatggtgt cattcgatgt ctgtgctgct ttcctgtacg gtgatctgga agaggagatc 3060 tggatggaag ttcctgaagg tgtacatgtc gagaaagtag acagtggtag tgttgtgtgt 3120 ctgttgcgaa aatccttata tggattaaaa caagcacctc ggtgctggaa tgtaaagttt 3180 tctaatttct tgcgtagatt taattttaaa gagcgggaag cggataagtg tgtgtatgtt 3240 ggagagttag aaggtcacac agtgtatctg gccttatttg ttgacgatgg cttgcttgca 3300 agtaagtcgt cacaaattat aaataaaatt ctgaatgaat tacgtagtga atactccata 3360 actgtaggag atgccagcta ctttgtggga cttcaaatta gtaggaatag aacagagaaa 3420 ttgatgttcg taaatcagaa tgcgtatact caagaaattt tagctaaata tggtatgggt 3480 aatgctaagt gtatgagtgt tccggtcgat ccaaatgtac atctgcttgc agcagatcct 3540 gactgtaaga atgactgtaa ggctccatat agagaagcca taggatcact tatgtatcta 3600 gcgattgtct ctcgaccaga tattgcttac gcagttaatc tgctaagcaa gtttgtgtgt 3660 aattttaatg aagcccactg ggcagcagtg aaaagagtgt tttgtgaaac gtggagttct 3720 cagcgacaga aaatggtaac gcttagtacc acagagtcag agtatgttgc tgcagctgct 3780 gctgcaaagg aaatctgttg gttgagtaag cttgaaaatg gaatcggttg tcgatgtgaa 3840 gatggtgtaa ctttgttagt tgataaccaa agtgctatta gattagctaa aaatcctcaa 3900 tatcacaaga gaaccaaaca catcgacata cgataccatt ttataagaga gatatgtgag 3960 agtggagaaa taaatattgt gtatgtagcc tctgagaacc agttagctga tatctttacc 4020 aaggctttgc ctagggaccg atttttgcat ctacgacaga gcataggtct agtgagtgaa 4080 aaagagttgt tagaataatg tagctggaga gcatacacaa aaggcggaag 4130 // ID Gypsy-51_AA-I repbase; DNA; INV; 7460 BP. XX AC supercont1.286; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-51_AA_; KW Gypsy-51_AA-LTR; Gypsy-51_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-7460 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.286; Positions 1356486 1349027. XX CC Positions [4175-4636] - Reverse transcriptase CC Positions [5657-6133] - Integrase core CC 'TGAC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 1551..3350 FT /product="Gypsy-51_AA-I_2p" FT /translation="MRVQDLGCCLLPNLLKTEGALSAPRSQCTGQAPVTAS FT AADRTVCTKYASLAFTPLDTATGLSTISPRCVECLSTHSWQRQALGMGILP FT ASTSTTLPARSRAQPGTTKPSLNLDFPNTGHASKWTSERQASSLAKSGSPD FT SPTYDLWPISAPEHVALSGLTNPCGCRTGYSPVSTGLAALRSNLLPPAQSA FT TLVIGSLSNMLPQSRLATIVILFVVSLLIWIHHRYYPRWSESEAPPPVEHW FT NLPDATFIPSDPPVPRAQSTLPTYAGNQNPAQRERRQPFFQPRKVIPVSQW FT KIQKYKADDQGLGMNEFLENVGQMALSEHVCEEELFDSAIHLFDGPALSWY FT TAMRSQNRLRDWNHLIQELQKAFRHPDLDAVLRVKIYQYRQQRNETFQQYY FT LHMEKLFRSMSQPMSESDKVEVLKMNLRFDYRKLLVGRQITSLQTLLNLGN FT DLDAADSSAFARVFGNSKKETCAINSGGVNPQNTRVFSNKNHNPGKSNQQS FT KGETNFKTYTASKQPKADAKPANQSKAKMDNPSPKHEDRGARDPFSKIVNN FT YRPPPNGECFNCRDNHNTSDCPLPRRLFCQICAFPGVAYKNCPYCSKNPTR FT ES" FT CDS 3587..6511 FT /product="Gypsy-51_AA-I_1p" FT /translation="MFNSTIFKRLSNVRLQDPAEPLDLVTASGDALEIAGE FT VLVPYTFQGKTRVLPTLFVPGLAVDCICGIDFWRAYRIRPTVATLAITSQT FT TKLNITVEDPPILTDKQKEILNQVRQSFKLASPEKLDTTPLIEHEIILGGE FT FQNSKPVRQYPYPISPKIQAGLFSEIDRLLARGIIEESNSDWSLNIVPIRK FT SSGDIRLCLDARKLNERTIRDAYPLPHPGRILGRLPQARFLSTIDLTEAFL FT QIPLARGSRKYCAFSVQGKGMFQFTRLPFGLINSPATLARLMNRVLGQGVL FT EPNVFVYLDDIVIVTETFEEHIRLLNEVARRLIEANLSIKLEKSHFCVAEI FT PFLGYILSPRGLHTNPEKIRPIVEYERPTTVTKLRRFLGMSNYYRRFIADY FT SRITAPLSELLKSNTKNLKWSVEAEEAFQNIKEKLITTPILASANFDHEFI FT IHTDASDQAIAGVLTQEIDGREHVIEYYSKKLTTPERSYHATEKEALAALL FT SIEHFRGYVEGSHFCLVTDSSALTFIMRTKWKTSSRLSRWSLTLQQFDLTI FT LHRRGKDNVVPDALSRSVLAVSMRSDSDWYNTLKRKVMESPEDYPDFKVED FT EQLMKFVFSSYPTDHRFDWKIVPAPEARKLIIASAHDDAMHIGVDKTIGKI FT RQKYYWPRLTSDVRLHVQKCAICKEIKPANAPTVPLMGDMRPSNQPWQIIA FT LDYIGPLPRTKSGKQYILVVMDLFSKWTQLHAFPSISVASLKTVLRDHWFF FT RNSVPSIVLSDNASTFTSKEFGALLENYGVKHWLTSRYHSQANPVERVNRS FT INTAIRSYARDDQRSWDTKLSEIEMIINSTVHSATGYSPFLVAKGQEVIVD FT GRDYNRFQSDEELSLRERVDRIRETSPKIFDLVAKNLKSAYNLSSQRYNLR FT HRKLAKPLNVGDKVYKRNTKQSNANEFYNAKLAAQYLPCTVVEKHGSSSYE FT LVDSSGRNIGIWPANLIKSA" XX SQ Sequence 7460 BP; 2085 A; 1841 C; 1689 G; 1845 T; 0 other; tttggcgccc aacgtggggc cggaataagg actggccgaa cgtttgtacc tgagccattg 60 gcagcaggcc gctgttggcc actccgttta gaaagtttcc attgagattc tgggagttca 120 gagccagact aactcccagg ctaagaagag ctccagcagc aatcgcatac tgcagcaaag 180 tggaagcagt cccctgctct agaactctta tctcgtcccg cggcaccatc tcaacctcat 240 cagatttcat ggcgtttagc cacctgtaga tggggttgga atcacttggt gcttgccaac 300 ggaaatggga gaagggctcc agcgtgcatg tgaaccagta gttcactgca ggggcagccg 360 gccttgcgtg ctcacgagtg cacaccaaag ccagcgcctt cacaaatcct aacgctagct 420 gttgcatttc accacgactg cgcgctacat ggactaaagt cgctcgcatc tcctccgctg 480 tgaaatccgc aatagcggga agagctccct gagccacacg ataatacacc gcggttggca 540 cggcgaacca cagaatgcgt gccgccaccg gattggcgcg atccggtatc gcgttgaata 600 cgcgctggga tcgcgacagg tggcgcaggc agttgatcat agaatcctgc aaatcggctg 660 cgagatagac gaatcggcgt gctgttacgt cagcttcaaa attcgtcccc aagtcagcgt 720 cggcaaaact acgtacataa aagtcgttgc catccagcca gacgtgccca agcgggtcac 780 ctggatggcg gtcgattatc tctcgcaaaa acattagcag ccagcctttc gcatacatgg 840 ccgtattgtc catgatgtca ggtgacaacg tgacgtaatt agccatgatt tgggctacgg 900 ttaggtccac gcgaatggct gcagctagag aagcggctaa agccctggta ttataattcg 960 tcttatcgga taaaaagccg gcaagcaggg tgtcatcggt ccgccctagt ctatgaccgc 1020 cgatgacaga caagttcaca ttggccacat cgacgtgcca cagtgagaag tccccttgtg 1080 ccacccactc attcgtttgg gtataagggg catttgggat caaatgagtc acaacaccca 1140 ttgacccatt gacaacatcc tcaacaatgt tgtcacctaa agaagtagaa attcgattcg 1200 ctagccaaga tagaatattc tgctgggatg cgctcgatgc caaagagcgg ttcatctcag 1260 cgcaaccatt gccgacgatc gaatggtcac cgaagtgagc aaggcgtggc gcggttggct 1320 tagaaccggt tgcaatggcc ttccccttgg atcgcactcg tttgcggccc ttcgtgccgc 1380 ccgacgtcgc atggctgttg tcaatagtca aattcgatag tgagcccccc ggatatggtg 1440 ggggcccgcc agctggaagt tcagaaatgc ggattttgga gaacttagaa catgtcggat 1500 tgatgccagg cttccccaaa gctacctgga ccccgacgtg acggtgtttc atgcgtgtgc 1560 aagacctggg gtgctgcctc ttgcccaact tgctgaaaac tgaaggtgct ctttcggcac 1620 ctcgttccca gtgtactgga caagcgcctg tcacagcatc agcagcggac aggacagtgt 1680 gcacaaaata tgcttcactg gcctttacac ctctggatac tgcgacaggg ctcagcacaa 1740 tctccccgcg ctgcgtggag tgcctaagca cccactcatg gcagaggcaa gcactaggca 1800 tgggaatact cccagctagc acatccacta ccctaccagc acgttctcgg gcccaacctg 1860 gcacaaccaa accgtctctc aacctagact tcccaaatac tgggcacgca tcaaaatgga 1920 cttcagaacg ccaagcaagc tccttggcga aatccggcag cccagactcc cctacatacg 1980 acctttggcc tatttcagcc ccggagcatg tcgccctctc cggattgacc aacccttgcg 2040 gttgtaggac ggggtacagc cccgtctcaa ccggcttggc agcccttcga tctaacttgc 2100 ttccaccagc gcaatcggcg acgctagtca ttggttcctt gtctaacatg cttccacaat 2160 cgcgactcgc aacgattgtc atactgtttg tggtcagttt actcatctgg atacaccacc 2220 gctattatcc ccggtggtcg gagtcagaag ctccaccacc ggtggagcac tggaacctcc 2280 cagatgcaac tttcattcct tctgaccccc cagtccccag agctcagtcc actcttccaa 2340 cgtatgctgg aaaccaaaat cctgctcaga gggaacgacg tcaaccgttt ttccaacctc 2400 gaaaagtgat cccggtatca caatggaaaa tacagaaata caaggccgat gatcaggggc 2460 tgggtatgaa tgaatttctg gaaaacgtgg gtcagatggc tctttctgaa catgtctgtg 2520 aagaagagtt gtttgactca gcaattcatt tatttgatgg ccccgcactc agctggtata 2580 cggccatgag aagccaaaac cgactgaggg attggaatca tctgattcag gaactgcaaa 2640 aagcatttcg tcaccctgat ttggacgccg tgttaagagt caaaatctac caatatcggc 2700 agcagcggaa tgagactttc cagcaatatt acttgcacat ggaaaaactc tttcgcagta 2760 tgagtcaacc aatgtctgag tcagacaagg ttgaagtact aaaaatgaat ctgagattcg 2820 actatcgtaa gctgttagta ggaaggcaaa taacttcctt acagacattg cttaaccttg 2880 gaaacgacct tgatgccgcg gactcctcgg cctttgccag agtattcgga aactctaaaa 2940 aggaaacctg tgcgataaac agtggagggg tgaatccaca gaatacacga gtgttctcta 3000 acaaaaatca taatccggga aaatccaatc aacaatcaaa aggtgaaacg aatttcaaaa 3060 catatactgc ctcaaagcaa cctaaagcgg atgccaaacc tgctaatcaa tctaaggcaa 3120 aaatggataa cccatcccct aaacacgaag acagaggagc cagagatccg ttttcaaaaa 3180 ttgtgaacaa ttaccgccct ccacccaatg gggaatgttt caattgtcgg gacaatcaca 3240 acactagcga ctgtccattg ccaagacggt tgttttgcca aatttgtgcc tttccaggag 3300 tagcgtacaa aaattgtccg tactgctcaa aaaacccgac acgggagtcc taaagtcgca 3360 ggttcctcgt ctagcgccaa tagaaaacca cttcaagaaa cttattttgg agctcaaaga 3420 tcactatgag ccccgtttgc ccgaaactga gcacgaaaac cgtctgccac aagtaaataa 3480 gatcactctt acctccccag atggcgagga tgatcgtcct cacgtacatt tgaaaatctt 3540 tgacgtgcct gtttatgcgc tcttagacag tggaagtcat aggactatgt tcaattccac 3600 tattttcaag cgactctcta atgtccgtct tcaagatcca gctgaaccgc ttgatctagt 3660 cacggccagt ggtgatgctt tggaaatcgc aggagaagtt ctagtaccct acactttcca 3720 agggaagacg cgagttttac caacactgtt tgtcccagga ttagccgttg attgcatttg 3780 tggcatcgac ttttggcgag cttatcgtat tcgaccgaca gtcgccaccc tggccattac 3840 ttctcagaca accaaactca acatcacggt tgaagatccc cctattttga ccgacaaaca 3900 aaaggaaatc ttaaaccagg tgaggcaatc ttttaagctt gcttcccctg aaaagctgga 3960 tactactccg ctaatagaac atgaaatcat tctcggaggt gaattccaaa atagtaagcc 4020 tgtccgacaa tatccttatc cgatttctcc aaaaattcag gccggtctgt tctctgaaat 4080 agatcgttta ttggcccgag gaataattga ggaatctaat tcagattggt ccctgaatat 4140 agttcctatt cggaaaagtt caggggatat ccggctatgt ctagacgctc gcaaacttaa 4200 tgagcggacg attcgcgatg cttatcccct acctcatcct ggacgcatcc ttggtcgact 4260 gccacaggcc cggtttttga gcacaataga tctaacagaa gcatttcttc agatcccttt 4320 ggctcgtggt tcacgaaaat attgtgcttt cagcgttcag ggtaagggga tgttccagtt 4380 caccaggctc ccattcggcc tcatcaacag tccagcaaca ctggcccggt tgatgaaccg 4440 agtgttgggg cagggtgtac tggaaccaaa cgtcttcgtg tacttagacg atatagtcat 4500 agtcacggag acgtttgaag aacatattcg cctactcaac gaggttgcga gacgcctaat 4560 agaggccaat ttgtcaatca aattggagaa atcccatttc tgtgtggctg aaatcccttt 4620 tttgggatat atcctgtctc ctcgtgggtt gcacaccaat ccagaaaaga tcaggcctat 4680 cgtggaatat gagagaccaa cgactgttac taagctccgc cgatttctcg gcatgtcaaa 4740 ttactatcga cgttttatcg ccgattatag tcgcatcacc gccccccttt cagaactttt 4800 gaagtcaaac acaaaaaatc tgaaatggag tgtagaggcg gaggaagctt tccaaaatat 4860 aaaggaaaaa ctcattacca ctcccatact cgctagcgct aacttcgacc acgagttcat 4920 tattcacact gatgctagcg atcaagctat tgcgggtgtg ttgactcaag agatagatgg 4980 ccgcgaacat gtgatcgaat actactccaa aaagttgact actcccgaac gtagctacca 5040 tgccaccgaa aaggaagcgt tggccgcatt gctgtctatc gaacactttc ggggctacgt 5100 ggaaggtagt cacttttgtc ttgtaaccga ttcatccgca ttaactttta taatgcggac 5160 gaaatggaaa acctcctcgc gattgagtag atggagcctg acactccagc agttcgacct 5220 caccatactc caccgtcgtg gaaaggataa tgttgtaccg gacgccttgt cccgaagcgt 5280 tctggcggtt tcaatgagat cggactcgga ctggtacaat accttgaaac gaaaagtaat 5340 ggaatctccc gaagattatc cggactttaa ggtcgaggac gagcaattga tgaagtttgt 5400 tttctccagt tatcctactg atcataggtt cgattggaag atcgttccgg cccctgaggc 5460 aagaaaactt atcatcgctt cagcacatga tgatgccatg catatcggag tcgacaaaac 5520 gatcggaaaa atccgtcaga aatattactg gcccaggtta acatccgatg ttcgcttaca 5580 cgtgcagaaa tgcgcgatat gcaaggaaat taaacctgct aatgccccta cagtcccatt 5640 gatgggagat atgcgcccct ccaaccagcc ttggcagata atagcccttg attatatcgg 5700 ccctttgcca agaaccaaat caggaaagca gtatatcctc gtggtaatgg acttattcag 5760 taaatggacc cagctgcacg cttttcccag tatatcagtg gcttctttaa aaacggtact 5820 acgggatcat tggttctttc gcaattcagt tccgtcgatt gtgttgtcgg ataacgccag 5880 taccttcaca tcgaaagaat ttggcgctct actggaaaac tatggagtga aacactggct 5940 tacgtccaga tatcactctc aagccaaccc agtagagcgc gtaaatcggt cgataaatac 6000 ggccattcga tcgtacgccc gagacgatca gcgatcgtgg gacacgaagc tgtccgagat 6060 cgaaatgatc ataaattcga cagtccattc agctacaggt tattcacctt tcttagtagc 6120 taaaggtcaa gaggtcattg tcgatgggag agactacaac cgttttcaaa gtgatgaaga 6180 actatctctt cgagagagag tagaccggat aagagaaact tctccaaaaa tctttgattt 6240 agttgcaaaa aatttgaaaa gcgcatacaa cttgtcgagc caacgttaca atttgcgcca 6300 ccgaaaattg gccaaacctc tgaacgtggg agacaaagtg tacaagcgaa acactaagca 6360 atccaatgca aatgagttct acaatgctaa gttggctgca caatatttgc catgcactgt 6420 cgtagaaaaa catggatcct cctcttatga gttagtcgac tcatcgggtc gaaacattgg 6480 gatttggcct gcaaatctga tcaaatctgc atagacagat ttatgggaac aatcaaacaa 6540 tagttctgta gttctaactc ctctgattcc tacctctcaa tttccatgga tgaacctaga 6600 aagaacaaaa gaattagttc atgactatgt ccctctgagg gtaatgacat gtagagctta 6660 ccggttgata gtgaatcagc tgtccaattt gaaggccact gcagggctgg atatgctgga 6720 tagcattaga accgtcgaaa atcggtaaat gttcccaaca agcaataaaa acttcatccg 6780 cgaattcgaa acaaaaaaga tgcatcattg aattatatcc atgcgtgcgc ggatcatgga 6840 ccatgctgat cagatatcga tcactcactg aagtcggatc attgtccgct gtctcggtta 6900 gccgaatcat gagtaaccga attgtgtata ttatatagta attgtataga tccgaagcaa 6960 tagtgccgaa gtctcaaaat tcgagcagtt tcctttgtag cagtttatat acagaaaatc 7020 agtttgcgtt cgagaattga acgatcataa gtttagggtg gatgaaatca agaaccgatc 7080 atctctccag tgtcaaataa aacgtcgatg gacttggatc ggtcttaaaa agttacgtaa 7140 cgtcaaccgc gcgtgtaaat atttaagtgg cttcgttagt tataagttta atttcattat 7200 tagcatgttc aaaattcgtt tattttccgt ttgcgttcaa attgcatgta tttttcgttt 7260 cgttaaaata agtttatgaa aactctcgct aagtatttcg aagaagaaaa ttatctaggc 7320 ataataaaac ggtcgttcgt tcatatattg ccactcaagt agtggacgca ctagcttcat 7380 aagaaaaagt aagtgaggtg gtgtttgggt ttaacacaga aaaagaaata tttctttttc 7440 ttactcgacg acagggggat 7460 // ID Copia-25_CQ-I repbase; DNA; INV; 4045 BP. XX AC AAWU01017673; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-25_CQ_; KW Copia-25_CQ-LTR; Copia-25_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4045 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 365-365 (2011). XX DR GenBank; AAWU01017673; Positions 43428 47472. XX CC Positions [1447-1947] - Integrase core CC 'GGAGT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 22..4044 FT /product="Copia-25_CQ-I_1p" FT /translation="MTSEKSDYPIFDGKDFADWKFRLELTLAERELLDHVR FT KEPKVEQLLLDNWVKVDVKAKNVIVKSLGREHSHIVRSEKSAHGMLKALSE FT VFEKSGAVQERHLRMKLNKMKCEEGQPLDVHFAAFDSVVMDLRAAGAAMTE FT TEQVTHLLMSLPRSYEHIVTVLLAMEELTLRKAKARLLNEEVRRSEMNEEE FT PTGRAAFLGAPKKFTGKCWRCNEPGHRAFECPKASGEEARNTEKRRPGPQK FT RAFNLSKREDDRGQEEDTSRAFAFVAARGTESLGSSGHAPREVLWIIDSGC FT TDHMTNDDKLFLVERRLSAVYPVSLAEKGQAIEATKIGFIASTSVVNGREY FT DCHISDVLFAPGLRHNLLSVSRLEKAGFVAVFREGGVELWAGDVLFAEGQR FT RNDLYELKFTVKERQSNVAAAVDTNALWHRRLGHIGMRSINHMVKKGMVTG FT IKPNITDTLGVCEPCVMGKHSRATFHRCERRASRPLELVHTDVCGPITPTT FT WDGKRYFVTFIDDYTHFTIAYLISTKDEVYECLKEYYSMVCAGFGSSVARM FT RCDNGGEYISRRCKEFCKEKGVVLEYTAAYTPEQNGRAERMNRTLVEKARS FT LIYESQLPKEMWGEAIMTALYLTNRSCTVALDDLTPTELWYGKKPDVSNLR FT VFGCRAYAHLPKCKRRKMDAKSEKLIMVGYSGNGYRLWNPEERKVKLSRDV FT IFDEAVLPGKEREPDQDPVSKQIVGEPVPEENAVEHEMTDDEEAATSDDEP FT EGASEAGAPESVEREAAAREADTREAHTRGADVQEADAREAEASEAGRRKS FT ERNKKRPVWHKDYQMDSTALLTNGIEDVPESYNSIEGRRDAADWYAAVRDE FT LNSLTENQTWDVVDLPRGAKVITSKWVFKKKVDKNGDLERLKARLVARGFQ FT QTAGVDYHDTFAPVAKLSTVRFLLAVGIQRSFEFWHMDVTTAFLYGELDET FT IFMKPPPGLDVEAGKVCELKRSLYGLKQSPKCWNNRLHSFLSSLNFERSDA FT DYCLYVRIAHTERLYILLYVDDLLICGNNSSEIAVLKQQLSAEFRMKDLGQ FT LEYFMGMRIRIDDAAGMVTIDQTAFAERILERFGMEACNAVATPLEPGVKF FT DSAESSDEVRTNFRQLIGSLMYLMLGTRPDLCFAINLFSRYQDKATCEHWN FT CVKRVLRYIKGTTGMHLMYTRKDEATPLVGYVDSDWANDPDDRRSTSGYLF FT EVYGNLISWTTKKQGLVTLSTAEAEYVAASQAVCEAIWFKKLFQDVGYPVD FT HPIPVYEDNQACIFISKNPESKRTKHVEVKYHYVREKVWRKEVELRYISTK FT EQKADILTKPLARVAFEKMRLELGLERGG" XX SQ Sequence 4045 BP; 1074 A; 907 C; 1224 G; 840 T; 0 other; ggttatggga accagcacga aatgacctcg gaaaagtcgg attatccgat cttcgacggc 60 aaggacttcg cggactggaa gtttcgcctc gaactgacct tggcggagcg ggagttgctg 120 gaccacgtta ggaaggagcc taaggtggag caactgctcc tggacaactg ggtgaaggtc 180 gacgtgaagg cgaaaaatgt catcgtgaag tcgcttggcc gggagcactc gcacatcgtg 240 cgctcagaga agtccgcgca cgggatgctg aaggccctgt cggaggtttt tgagaaaagt 300 ggagctgtgc aggagcggca cttgcggatg aagttgaaca agatgaagtg cgaagagggg 360 cagccactgg atgttcactt tgcggccttt gattcggttg tgatggatct gcgtgccgct 420 ggagctgcta tgacggagac ggaacaagta acacatctgc tgatgtcgct tcctcgatcg 480 tacgagcaca tcgtgacggt gcttttggca atggaagagc tcacattgcg gaaggctaag 540 gccaggttgc tgaatgaaga ggttcggcgc tcagagatga acgaggaaga acccacggga 600 cgagcagctt ttctcggagc ccccaaaaag ttcacgggaa aatgctggcg ttgcaacgag 660 ccaggacaca gggcctttga atgcccgaag gctagcggag aggaagcaag aaacacggag 720 aaaagaaggc cgggcccgca gaaacgggcg ttcaacctca gcaagaggga ggatgatcgc 780 ggccaggagg aggatacgag ccgagcattt gctttcgtcg cggcgcgagg gactgaatca 840 cttggcagtt ccggacacgc tccgagagag gtcttgtgga tcatcgactc gggatgtacg 900 gatcatatga ccaacgatga caagctgttt ctcgtcgagc gtcgactgtc cgcggtgtac 960 ccggtgtcgc tcgcagagaa gggacaggcc attgaagcga caaagatcgg gttcatcgct 1020 tccacaagtg tagtcaacgg ccgagaatat gactgtcaca tctcggatgt tcttttcgca 1080 cctggtttga ggcacaacct cctgtccgtc agcagattgg agaaggctgg atttgttgca 1140 gtttttcgtg aaggtggcgt tgaactctgg gctggagatg tactttttgc ggaaggacaa 1200 cgcaggaacg atttgtacga gcttaagttc acggtcaagg agcgtcagtc gaacgtagcg 1260 gccgcggtgg acacgaacgc gctctggcat cgacgacttg ggcacatagg catgagaagt 1320 ataaaccata tggtgaagaa aggtatggta accggcatta aaccaaacat tacagacact 1380 ctgggcgtat gcgaaccgtg tgttatgggc aaacacagtc gagccacctt tcaccggtgt 1440 gaacgcagag cttccagacc actcgagcta gtgcatacgg acgtgtgcgg tccgataact 1500 ccaacaacgt gggacggcaa gaggtatttc gtgacgttta ttgatgatta cactcacttt 1560 accattgcct atttgatcag cacgaaggat gaggtctacg aatgcttgaa ggagtactac 1620 tcgatggtct gtgctggatt tggctcgtcc gtggctcgaa tgagatgtga caacggtggt 1680 gaatacatct cccggcgatg taaggagttc tgcaaggaga aaggagttgt tttggagtat 1740 acagctgctt atacaccgga gcaaaatggc agagccgaac gaatgaatag aacgctagta 1800 gagaaggcaa ggtcgctgat ctacgaaagt caactgccga aggagatgtg gggcgaagcg 1860 attatgacag cattgtacct gacgaaccgc agctgcacag ttgcgctgga tgacctaacc 1920 ccgactgagc tgtggtacgg gaaaaagcca gatgtgtcga acctcagagt gtttggatgc 1980 cgtgcctacg cccatttgcc caagtgtaag cgccgcaaga tggatgccaa gagcgaaaag 2040 ttgatcatgg tcggttactc tggcaacggg taccggctct ggaacccaga ggaaaggaag 2100 gtaaagctgt cccgagacgt catttttgat gaagcggttc tcccgggaaa agaacgtgag 2160 cctgatcaag atccagtgag taagcaaatc gttggggaac cggttccaga ggagaatgct 2220 gttgagcacg agatgacaga tgacgaagag gccgctacga gtgacgacga acctgaaggt 2280 gcgtctgaag caggtgcacc agaatcagtt gaacgcgaag cagccgcgcg cgaagcggac 2340 acgcgcgagg cacacacgcg aggagcagac gtacaagaag cagacgcgcg agaagcagaa 2400 gcgagcgagg caggacgtcg caagtccgaa cgcaacaaga agagacctgt ttggcacaag 2460 gactaccaaa tggactccac ggcactgttg actaacggta tcgaagatgt tccagaatcg 2520 tacaactcca tcgaaggaag acgggacgca gctgactggt acgctgcagt tcgtgatgag 2580 cttaactcgc tgacagaaaa ccaaacgtgg gatgttgtcg atctgccaag aggagccaaa 2640 gtgattacgt ccaagtgggt gttcaagaag aaggtcgaca agaacggaga tttagaacgc 2700 ttgaaggcac ggttggtggc aagaggcttc cagcaaaccg ctggtgttga ttaccacgac 2760 acttttgcac cagtggccaa gctgtcaacg gtgcgtttcc tgctggcagt tgggatacag 2820 aggagtttcg agttctggca tatggacgta accaccgcat tcttgtacgg tgagctcgac 2880 gagacgatct tcatgaaacc gccacctggt ctggacgtcg aagctggtaa ggtttgcgaa 2940 ctgaaacgat cgttgtacgg tttgaaacag tcaccgaagt gctggaacaa cagactccac 3000 tccttcttga gcagtttgaa cttcgagcgg tctgatgctg attattgcct gtatgtccgt 3060 atcgcccaca ctgaacgctt gtacatttta ctgtacgttg acgacctcct catttgtggc 3120 aacaacagca gtgagatagc ggttttgaag caacaacttt ccgccgagtt ccgcatgaaa 3180 gatcttggcc agctggaata tttcatggga atgcgtatca ggatcgacga tgcggctgga 3240 atggtaacta ttgatcagac agcgtttgcc gaaagaatcc tggaaagatt cggcatggaa 3300 gcgtgcaatg cagtggcgac cccactagag ccgggagtca agtttgattc agccgagagc 3360 tcagatgaag ttcgaacaaa ctttcggcag ctgattggaa gtttgatgta tctgatgctc 3420 ggaaccagac cagacttgtg cttcgcgatc aatctgttta gtcgctacca ggacaaagca 3480 acgtgtgaac actggaactg cgtgaaacga gtgctacggt acatcaaagg gacgaccgga 3540 atgcatctga tgtacacacg gaaggacgaa gcaacccccc tggttggata cgtggattcc 3600 gactgggcga acgatccaga tgaccgtaga tcaacgagtg gctacttgtt cgaagtttac 3660 ggcaacttga tatcctggac aaccaagaaa caaggcctgg tgaccttgtc gactgcagaa 3720 gcggagtatg tggcagcgtc gcaagctgtc tgtgaagcaa tctggtttaa gaagctgttc 3780 caggacgttg gatatccagt tgatcatccc atcccggtct acgaagacaa tcaagcttgt 3840 attttcatct cgaagaaccc ggaatcaaag cggacgaagc atgtggaggt taaatatcac 3900 tacgtgcgag aaaaggtttg gcggaaggag gttgaactgc ggtacatctc gacgaaggaa 3960 cagaaggcgg atatccttac gaagcccttg gcccgtgtgg ctttcgagaa gatgcggttg 4020 gagctgggtt tggaaagagg aggag 4045 // ID BEL-81_AA-I repbase; DNA; INV; 6736 BP. XX AC supercont1.280; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-81_AA_; KW BEL-81_AA-LTR; BEL-81_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6736 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.280; Positions 1395090 1388355. XX CC Positions [5790-6347] - Integrase core CC 'ATTAT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 3516..6734 FT /product="BEL-81_AA-I_3p" FT /translation="MAERRLKCLERRLAKNPQLYDSVRQQVADYESKGYIH FT VVTKEEKAKFDPRRTWYLPLGVVLNPNKPGKVCVIWDAAAKVNGISLNTML FT LKGPDLLTPQLTVTFKFRERQVAFSGDIQEMFLQVGIREQDRSALLFVFRD FT SPSQPIITMASDVAIFGATCSPAQSQFVKNLNAAEHALEYPRAAAAITDRH FT YVDDYLDSVDTEEEAVELALEVAAVHRKAGFHIRNWVSNRPKVLEAIGEAN FT PATVKNLTMNNQSGFERLLGISWLPDEDVFCFKLGLQGDLEELAKGEIAPT FT KRMMHSFLMRIYDPLGLVGSLVIQGKILFQDVWRAQVEWDQQIPEDLFGRW FT KSWILVLKELDNVKIQRCYFQGYDRASLDSMELHIFVDGSAQAYSAVAYFR FT VQDKGLIRCSLVTSKTKVAPLQLLSTPRIELQAAVIGARLRKTIEEGHSIR FT IKRTCFWSDSSTVISWIRSDTRRYRQFVAFRVNEILNLSKVEEWRWLGTKS FT NVADEATKWGRGPNCKPDSRWMNGPAFLYEEESDWPSNELTLSEETTEEVR FT PAFVGSHLIAFPAIDVDRFSKYERLLRSIAFVHHFCYVATQRSRKQLRLQP FT GVLNSEDLRKAERSLWRIAQAEHYPDELAILKRNVDAEPAQRIALDKSSPI FT SKLPPLLDVHGIIRLDSRIAAAEYVSYDTRYPVVLPRQHQVTKLLLDWYHR FT KYQHANNETVVNEVRQKFYIAKLRVEVRLAARRCQWCRVYKVKPIAPKMGP FT LPRIRITPFLRPFTFVGIDYFGPYYVKVGRSSVKRWVALFTCLTVRAIHME FT LAYSLTSDSCKKAIRRFIARRGAPQEIYTDNGTNFVGASRELQNELQAINT FT TMSSTFTDVNTKWRFNPPAAPHMGGCWERMVRSVKTALGALPSARKLDDEE FT LMTLLAEAEHMVNSRPLTFVPLENELQESLTPNHFLMLNSSGVRQAVKAPA FT DARKALRGGWNLIQHMLDTFWQRWIVEYLPIIARRSKWFEDVPPIKEGMLV FT VIADQGSRNEWIRGRVIRTYPGKDGTVRRVDVETPNGVLCRPVNKLAVLDV FT IPDGDAGKDSGRHGGE" FT CDS join(848..1291,1295..2518) FT /product="BEL-81_AA-I_1p" FT /translation="MVNVQQYARLLEESQSVSNLVPTSEQLMTGTNPKIIK FT KITNPTPFEMWHRETCGLRRQRESEILYHGDDERKRYENRLQEIEASLKLQ FT REMEQKHLQENELRRRREMELINRLNRLEDQHAEERKQSKDAECALRKQLE FT ESQLRFQTLAERQQQLASQEEQLRRYREREHHFTQQLESFRLHDQHEQLQS FT VRSQKATMLQTQSAAVPLQQSDVASMGQASYVSNPRKTLVSEAQCNHLEQD FT SCAGFYSGRDTPFSSMSGRAGTYCTYNINQDLGPKHVGHQVQTQFNPYSPP FT ELNGLSSNNFLQGQFEPTRQHLAARQVIPKELPIFSGDPTEWPLFISSYKH FT STISCGYSDSENLLRLQRALKGVAKESVSSFLLHPSTVSQIMSTLQTLYGR FT PEQIVHHMVAKVRATPAPKADRLETLVQFGLVVQNLCGHLKAIGMDSHLSN FT PILLQELVEKLPPNIKFSWALYQEQQPIVNLSTFGDYMAKVTAATSGITNV FT FLTPKHPKDDNPRTKEKAFVNTHSAYEKKFGSTNGIQKGEYQQSANTKNSR FT SEEIRNCPM" XX SQ Sequence 6736 BP; 1986 A; 1453 C; 1732 G; 1565 T; 0 other; taatctcaaa gatttattac ggaccgtttg gagaatggag aaatcgttcg tccgtcaaac 60 tcggtcaaag acgcaggccg ccaatatcga aatatcttct ggacctgtag gccaagatct 120 ggatctgtcg gaagatgagt ttgtgccatc tataaccgat ttcgagcgcg gagaggacct 180 cgactgtgcg gtatgcacac gtcccaacaa tgccgagttg tatatggttc aatgcggaag 240 atgcaagcgt tggcaccatt tcacttgtgc gagagttaat tcgaaggagt tttgctgtgc 300 gaagtgttca tcgaagcgaa gttcagtacg gtcatttaga tctgtatctg gccggtctag 360 cgtgacaagc ggtcgcagtt agcacattga ccttgaactc caacgtttgg aagaggaaag 420 gcgtgctgag gaagaagtag aacgagagcg cttgcggcaa gagaaaattc ttatcgagaa 480 agccgctaag gaaaagttag accgtgaaaa gcagttcata gctagaaagc atgaactact 540 ccgtcaaaag gacgaagaaa acggaagtca gttgagccga cgaagtagca gaagcagcat 600 aaaaaaagtg gaggattggg ttcagcaaca tgtcacttcc gctagtggcc ccaacgacga 660 taaggtccct gttcagccga taaagcaacc atcggttgta tcgtctacgc cgctgggaag 720 cagtgaagtc agtaccggtg agcgtcaggc cgtcactacg cagcgagcat cgttacctcg 780 tacaataggt agcgttacga tcggggatag tccagagcgt accgatgtgg agttcccaaa 840 gctagaaatg gtgaatgttc aacaatacgc acgactattg gaggagtcac aatccgtttc 900 aaacctcgtg ccgacttctg agcaattgat gaccggaacc aatccgaaga ttatcaaaaa 960 gataacaaac cccactccat ttgaaatgtg gcatcgtgaa acttgtggac ttcgacggca 1020 gcgtgaatca gagattttat accatggcga cgacgagaga aagcggtatg aaaatcgatt 1080 gcaggaaatc gaagctagct tgaaactcca acgagagatg gaacagaagc acctgcaaga 1140 aaacgaactg aggcggagac gagaaatgga attgataaat cgactgaatc gtctagaaga 1200 ccaacatgcg gaagaacgga aacaatctaa agatgcggaa tgtgcgttga ggaagcaact 1260 ggaggagagt caactccggt ttcaaaccct gtaggcagag cgtcaacagc agctagcctc 1320 acaagaagaa cagctacgcc ggtatcgaga gcgagagcat cactttactc agcagttgga 1380 atcgtttcgt ctacacgatc aacacgaaca actacagtcg gttcgatcac aaaaggctac 1440 gatgttgcag acacaaagtg ccgcagttcc tctccagcaa agtgatgtag cttcgatggg 1500 acaagctagt tacgtgagta atccgcgaaa aacgttggta agtgaagccc aatgtaatca 1560 tttggagcaa gattcatgtg caggattcta ctctggtaga gatacaccct tttctagtat 1620 gtcaggtagg gcaggaacat attgcactta caatataaac caagatctag gaccgaaaca 1680 tgtagggcac caggttcaga ctcagttcaa tccgtatagt cctccagagt tgaatggtct 1740 tagtagcaat aattttctgc agggacagtt tgaacctact agacaacatt tagcagctag 1800 gcaagtgatt ccgaaagagc ttccgatatt ttctggggac cctactgaat ggccactctt 1860 cattagtagc tacaaacact caacaatttc ttgtggctac tcagattcag aaaatctact 1920 tcgtctacaa cgggctctga agggtgttgc caaagaatcg gttagtagtt ttctgcttca 1980 tccgtcaacg gtgtcacaaa ttatgtctac gctacagacg ttgtacggta ggccagaaca 2040 aatagtgcat catatggttg ccaaggtccg cgcaactcct gctccaaaag cagaccgcct 2100 cgaaacgttg gtccagtttg gtctagtagt gcaaaacctt tgtgggcatt tgaaagcgat 2160 tggaatggac agtcaccttt caaatcccat tttgctacag gaacttgtgg agaaacttcc 2220 gcctaatatc aaattcagct gggccttgta tcaagagcaa caaccgattg tcaacttgag 2280 tacgtttggc gactacatgg caaaggtaac agccgcaacc agcggaataa ctaacgtatt 2340 tctgactccg aaacatccga aggacgacaa tccaaggacg aaggaaaagg catttgtgaa 2400 tacccattca gcatacgaga agaagtttgg atctactaat ggtatccaga aaggagagta 2460 tcagcaatca gcaaacacga aaaacagcag aagtgaagag ataaggaact gtccgatgtg 2520 aaaatctcac gaacacgcca ctgaaaagtg taccgagttc aagcaactgt cggttaacga 2580 gcgctggaag ttcgtgaaag agcagaaact atgtagaaga tgtctcattg cacacactcg 2640 ctggccttgt gaaggggaaa tctgtggagt gaacgaatgt caaaaacgtc atcatcgact 2700 gcttcattac gaccccgaat ccgagaaaat gttggtcact gcgaacgcta ccgtcacaat 2760 ccatcgacaa cccgttgctt cgaccttgtt caaggtacta ccggtaacgc tttatggaag 2820 aaccgttcgc ttttcttgac gatggctcgt cagctacgct tctggaaaag gcgatagctg 2880 acaaactagg ggtggacggt aagatgctat ctttatgcat gcagtggacc ggtggaatcg 2940 ataagaaggt tgctacaact cagatcgcaa atctgggtat ctcggaactc gggagtaata 3000 tgatgcaaca gttgtccgaa gtctacactg tgaagaatct gggacttccg gaacagacag 3060 taaactatac cgaattggca aagcagtaca aacatctgcg gcgcttaccg gtaaagagtt 3120 ttgacagagc taaccctgga ttgctgattg gagtcaataa tattcacctc ctaaccacat 3180 ctaaagtacg tgaagggaag gaacatgaac cgatcgcagc gaagacacgt atcggatggg 3240 tagtttgtgg acatttgcgc ggagaagaga accaactcaa acatcgccaa atgcacattt 3300 gcgcggaatc caccgaactc gatcttcaca attacgttcg agagtttttc tcggtggaaa 3360 gcataggtgt tgctgttgcg ccaaatcttg aagggacaga agaccaacgt gcccgctgaa 3420 tcctggaaga aacgaccgtg cgtaagacga atggaaaatt cgagactgga ttgctatgga 3480 aacaggacga cttcgagttc cctgatagtc gaccaatggc ggaacgccgt ttgaaatgtc 3540 tcgaacgtcg tttggcgaag aatccacaat tgtatgacag tgtgcgtcaa caggtggcgg 3600 actatgagtc aaaagggtac atacacgtag tgacaaaaga agagaaggcc aaattcgatc 3660 cacgccgcac ttggtatttg ccgcttggag tggttttgaa cccgaataaa ccaggaaaag 3720 tgtgcgtgat ttgggatgcg gcagccaagg tgaatgggat ctcgttgaac acgatgttac 3780 tcaaaggccc ggatttgttg accccgcagc tgaccgtcac gttcaagttc cgggaacgac 3840 aagtagcatt ctccggggac atccaggaga tgtttctcca agttgggatt agagagcaag 3900 atcgtagtgc attactcttc gttttccgtg actccccttc ccagcccatt attacgatgg 3960 cctctgatgt agctattttc ggagcgacgt gctctccagc acagtcgcaa tttgtgaaga 4020 atctgaatgc ggcagagcac gccttggagt atccacgagc ggctgcagca attacggaca 4080 gacattatgt cgatgattat ctcgacagtg tcgacacaga agaggaagct gtcgaattgg 4140 ctttagaagt agcggcagtt catcgaaagg ccgggttcca cattcgaaat tgggtttcga 4200 atagacccaa ggttctggag gctattggcg aagcaaatcc agcaacggtg aagaatttaa 4260 cgatgaataa ccagagtgga tttgaacgtt tgctgggaat atcatggcta cctgatgagg 4320 acgttttctg cttcaagctc ggcctgcaag gagatctgga agaactggcg aaaggagaaa 4380 tagcccctac taaacggatg atgcatagtt tccttatgag aatatacgat cccttagggc 4440 tagtaggttc cctagttatt caaggaaaaa tactttttca ggatgtatgg agagctcaag 4500 tagagtggga tcaacaaatt ccggaagacc ttttcggacg ttggaaatcg tggatattag 4560 ttctgaaaga attggacaac gtgaaaatcc aacgctgcta tttccaaggc tacgatcgtg 4620 cgagtctcga ttcgatggaa ctccatattt tcgttgatgg aagcgcgcaa gcatattccg 4680 ctgtcgccta ctttcgcgtg caggataaag gactgatacg atgctcatta gttacgtcaa 4740 aaacaaaggt tgctccatta caattgttgt caaccccccg aattgagcta caagcagctg 4800 ttatcggcgc acgcctgcgc aaaacaatag aagaaggtca ttctataagg atcaagcgaa 4860 cttgcttttg gagcgactca agtacagtaa tatcctggat aagatcagac acgcgacgct 4920 accgacaatt tgtagcattt cgcgtcaacg agattttgaa cttgtcaaaa gttgaggagt 4980 ggcgatggct gggtacaaag tcaaatgtag cagacgaggc aacgaaatgg ggaagaggtc 5040 cgaactgtaa accagatagt agatggatga atggaccagc gtttttgtat gaagaggaga 5100 gtgattggcc gagcaatgag ctgaccttat cggaagaaac aacagaagaa gtacgtccag 5160 cgtttgtggg tagccatctc attgcctttc ctgctatcga tgtggatcgt ttctccaagt 5220 atgaacgact attgcgaagc atagcatttg tacatcattt ttgttatgtt gcaacgcaac 5280 gatcaagaaa acaacttcgc cttcaaccag gagttttaaa cagtgaggat ctacgaaaag 5340 ccgagcgaag cttgtggcga attgcgcagg cagagcacta tccagatgaa ttggcaatat 5400 tgaagcgtaa cgtggacgcg gagccagcac agcggatagc gctggataaa agtagtccta 5460 tttcaaaact accacctctg ctggatgtcc atggaatcat acggctcgat agccgcattg 5520 cagcagctga gtacgtcagc tacgacacca gataccctgt tgttctgccg aggcaacatc 5580 aggtaaccaa acttctactt gattggtatc accgcaaata tcaacatgcg aacaacgaaa 5640 ccgtcgtgaa tgaagtccga cagaagttct atattgccaa gttgcgggtt gaggtacgtt 5700 tggcagctcg tcgttgccaa tggtgccgag tttataaagt taagccaata gctccaaaga 5760 tgggaccgct tccgaggatc cggatcaccc cattcctacg gccatttacg tttgtaggta 5820 tcgattattt tggaccctac tacgtgaaag tcggccgcag ctcagtgaaa cgttgggttg 5880 ctctttttac ttgtctgact gtccgggcaa tacatatgga gctagcctac agtttgacgt 5940 cagattcctg caagaaggct atacgtcggt tcatcgctcg tagaggagca ccccaggaga 6000 tatatacgga caacgggacg aatttcgttg gtgctagtag ggagcttcaa aatgagttgc 6060 aagcaattaa cactacgatg agcagtacgt tcacggacgt aaacactaaa tggcggttca 6120 accctccagc agctccccat atgggaggct gttgggagcg catggtgcgc tctgtgaaga 6180 ctgcattggg cgccttacct tcggcgcgga agctggatga tgaagaactt atgacgctct 6240 tggctgaagc tgaacacatg gtcaactcac gaccattaac atttgtccca ctagaaaacg 6300 aattacaaga gtcgttgaca ccaaatcatt ttttgatgtt gaactctagt ggagtacgcc 6360 aggcagttaa ggctcctgcg gacgctagaa aagctttgag aggtggatgg aatttgatcc 6420 aacatatgtt agacacattt tggcagcgat ggatcgtcga gtatctgcca attattgcca 6480 gacgaagtaa atggttcgaa gatgtgccac cgattaagga aggaatgtta gtggtaatag 6540 cagatcaggg tagccgtaat gaatggatac gaggtcgagt gatccgtaca taccctggta 6600 aagatggaac agtgcgaaga gtagacgtgg agactcccaa tggagtatta tgcagaccgg 6660 tcaacaaact tgcggtactg gatgtaattc cagatggtga cgccggaaag gactccgggc 6720 gccacggggg ggagga 6736 // ID R4_AL repbase; DNA; INV; 4686 BP. XX AC U29445; XX DT 22-DEC-1995 (Rel. 1.11, Created) DT 25-MAR-1997 (Rel. 2.02, Last updated, Version 2) XX DE Ascaris lumbricoides site-specific non-LTR retrotransposable DE element R4 in 26S rDNA, complete sequence. XX KW R4; Non-LTR Retrotransposon; Transposable Element; ALR4; R4_AL. XX OS Ascaris lumbricoides OC Eukaryota; Metazoa; Nematoda; Chromadorea; Ascaridida; OC Ascaridoidea; Ascarididae; Ascaris. XX RN [1] RP 1-4686 RA Burke D.W., Mueller F. and Eickbush H.T.; RT "R4, a non-LTR retrotransposon specific to the large subunit rRNA RT genes of nematodes."; RL Unpublished (1995). XX RN [2] RP 1-4686 RA Burke D.W.; RT "R4_AL."; RL Direct Submission to Genbank (19-JUN-1995)William D. Burke, RL Biology, University of Rochester, Hutchison Hall, Rochester, NY RL 14627, USA. XX DR GenBank; U29445; Positions 1 4686. XX SQ Sequence 4686 BP; 1269 A; 1058 C; 1365 G; 994 T; 0 other; ggggccggtg ggtttactca cttctgaccc accaccaacg gaacgaggga aagcagagct 60 ggggccctct tccgattggc atggaaccga cctccacgtg gtggccctgg gcaacggaat 120 tcaagagagg atttaatcct ctctatcatt tgcaagatgg atgagatcga ggtatccggc 180 aaacaggttc caagtgagca cctttcccat agctgggaat atgggttagg cgtcctctga 240 catataagag gaatcagact cgttcgcgcc cggtcattaa catcgatcag cgggagggcc 300 ggactgaagt aaatttcctg ttggcccgag tgcaggtgga gctcggaccc gaaaaacgat 360 ccctaagagg accacaaccc gaagggatgg acgcagtcgc cccggcacgt ttggtgttgg 420 ttatcctgga gtgttgtggg acgaatagct atgccttgtt caactaactc tttttttgaa 480 cgagggaccc ctgagcccca tcgtgagcca atctcaggga ctgactcttc ggagtcctta 540 ggtatgggta cccaccgttc acctcggcta aatgatgacg aggtgataaa cggccccaag 600 ggtcacgaaa gtgacccggt tcatgtagtg cgtgcgcctc gaacgctaca cccgaggcga 660 ttggaactcc caatcggagt aaataatctc ggagaagcct ctcaactacg gcaagattca 720 gcaatagctg aggaagccca attggagagc accgagaacc atgatggacg tcgccctcct 780 ttgagaggcg gcaggaagct ctggagtgaa aaggagatcg ccacattaag gagattgtgt 840 gaggcctatg gaaataggca ggtatgctgg aaagaggttc agaggaagtt tgcggacttc 900 cacgaggaaa gaacagtggc tgcgttagcc acgaagtggg gggctcttaa acgcccgaga 960 gctccgatgg ttggtgcgcc accgactcct gatcatgatc cagagcgtgg accagcgggc 1020 gagggagatg gaggtacaac ctcgcaagag aatgtgccta ccgatgatcc aatacccgcg 1080 aacggcccga cagagggcaa ggaatcggat gtgagaccag ccgtggcctg taggtgcacg 1140 gaaccggagg agcaactgat ggagtcggat gtgagaccgc ctgcagtcgt aagacttgca 1200 gatccggagc agcatacgat gaagtcgggt gtgaaaccgg ttgcacttga cgggtcagcg 1260 gatctggagg agcgtccgaa agagaaggat atcgagcaga tgggtgttga ctttgaaggg 1320 gaaccgaggt ttcgtgcctt tcggaaagcc ttttacgggt atttcagatg ggctgtcaac 1380 tcgtttgata gggaacctgt caagcgagtt aggcgggact gtcccaaggt attttacgct 1440 tatgcggatt accttatcgc aaccggaagt tctaaggcgt tgggaccgaa tcaatccagg 1500 attggtcgct tgaacggatt ggtctatgcg gcagctagaa caatccacca attctggaga 1560 gaagaggtag gccatcgtca gcaaggcgag aagggatggt acacgaaaac caaggcaacc 1620 cgtgaagacc ttcagatgct catctctatg atggaatctg aacttgcaag aagaaaggag 1680 aagaggaagc ccggcgcgaa agagctggag aatatccata agctagttgc aagacttgga 1740 acgcgcagca catcgggcat cgtcagaagg ctggagatga caagacagag gctcaaactg 1800 ttggaagaca gaataagttt gcatgagcaa gagaagaggc ggaaacgatt gcgcaagcaa 1860 tttgcggaaa ccccttctct aaaattactc acaaaaggag ccaaggatag gggcgatacg 1920 atggtaacca tgaaatctgt aatggacttc tggagaccaa taattggcag acgagttacc 1980 tccaatccgg accaattgca agtcttgaga gactggagag atgagcagaa gaaggcttat 2040 ccggcagacc tggatttaga aaaggccgat cttgaggaga aatatgaggg agcaatcagg 2100 agaatccaac cgtggaaagc tcccggtccc gacggattac atgcccactg gtggaaagct 2160 ttaccgtcgg ccaagaggct actgggtgaa ctggtggttg attggctgac aacaggtaag 2220 gttaccactg gctggatgtg ccgagggagg acaatcctga tccctaagaa gggtgatagg 2280 ggcgaccctt ctaattaccg acctataaca tgtttaaata catgttataa ggtgctaaca 2340 tcggtaatga attcagtcat tctgagtcac ctgagcagag gcgaagcttt accaatgaac 2400 cagcgagcaa tgcggaaacg cgagtggggt tgcacccacg ctatggtcct tgacagggcc 2460 atggtaatgg atgcaatggc tcaaaagaaa cactcattaa gtgtggcctg gcttgactat 2520 cgtaaagcat acgatagtgt gtcgcatgaa tatattcgct gggcgattaa ctccgtgaat 2580 ataccccgga gtgttcagct gacgctcaag aggctcatga gtgactggga gacacgcttt 2640 gagtcgacgc aatgccggcc gaagttaagg tctgacaaaa tgaaagtgct gaatggcatc 2700 tttcaaggtg actcgttatc accaaccctt tttgtattat gcatagcacc tatcagctac 2760 gcactcaata agggtgtcgg ccagtgtcaa tcctcatccg gctggagtgc aggttacggt 2820 tttgagattg gacatcagtt ctatatggac gatcttaaac tgtacgctag gacgcctgcg 2880 atgctagact cccaaatcca ggtggtgtct gaggtgtcgg aagcaatggg actccatttg 2940 aatttgagta aatgtgcgaa agcacattat gctccgcatg gggcgggcgg agcgcaagaa 3000 gctgtggaag gtgcagaagg atcgaggaag ggagaaatcc cgatactcgg gcttcgaagc 3060 acctataaat atcttggagt ggaacaacga ctccttccga tggaagtagc cctcaaagag 3120 ttcgaggata agtttatgga tcgagcagaa acaatcttcg ctagcgaact cacatggggg 3180 cagatggcca cagcgtataa tactatagct atcgctggtc tacgatacgt ctatagtaat 3240 acaaatggag catcaccaaa gcttctagaa gccctgaaaa gggcggccac cttagacacg 3300 cggataagag atcttttgag gcgacataaa tgtcggtttc gaaatagctt tgtcgagagg 3360 ctgtatatcc ctagagaatg cggtggatac gggttaaaat cagtggagga tacgctgcga 3420 gagagtatcc tcgctacgtg gagttacatc gccacgaacc cgcatttggc tggacaacag 3480 tatttcttcg agaggcttgc agcaagaggc aagcgcaccc cgatggcgga cggggtaaag 3540 atattgttgg atctgggagt ggaaccccag gtggacttga agcgaaggac ggtgaccgtc 3600 gacggtatag tcttcgaaga cccgaccaag cttcatcgat acctggtggg aaagctctta 3660 aaagcaagaa ctgaggcgag gattcgaaga tggaaggaag ccagcttagc tggacggttg 3720 gtaaatgaca caagtattga tatgcgacta tcatgcttgt ggatgaaaaa aggttttgtg 3780 agtgcgagga acctcagaga tgcgcttgct gtgcaagagg ggagtttgct tactagagca 3840 tgccctgctc taaagggtaa aggcggccaa gaagtttgcc ggtgttgcca tgcagcgccg 3900 gaaactgcag agcacataac atcagcctgc cgctattggc ttccaagtct ctacgttgag 3960 agacatgact cggtagcaag gaacctctat tacgtcatat gctgccgcta cggcataaca 4020 ccggtgcatt actcaaatag ggtatcaccg ctctcggaga atagccaatg ccgcgttctt 4080 tggaacatgg atatgcagac tcggacgcca atgaagcatc gaaagcctga tatagtcgtc 4140 tttgatctca agagggaaaa gatcctcatg ttcgaagttt cgatagccca tgccagcggg 4200 ttattgaaac agcgggaaat caagatcaat cggtatacgg tgaactccga agagttgcct 4260 gatgagacca taacaccgta tccgcctggg ccgaatttgg ccgctgacct cgctgccacc 4320 tatggttggc aggttgaatt tgccccagtg gtggttggca cgtgtggtga gcacgtacca 4380 gccgtcaaag aggacctgca aagaacgttg gatctaaaac ctcatcaagt cgaagccctt 4440 cttgaaagga tatcccgatc ggcggtgatc ggaacggcta gagtagtccg agcacacctc 4500 gcctgctcct agtcgctaag gggtccggaa atggtccggt cctgcgctac ccggttctgg 4560 tagcacgttc aagcgctcaa tcgcctgcct tgtaggcagt ccatctgtgg aagtcgcgct 4620 cttgatacag atgtggacgg atggaagcag atgatagagc cggtgacggc cctactagcc 4680 aaacgc 4686 // ID BEL-37_AA-LTR repbase; DNA; INV; 524 BP. XX AC supercont1.244; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-37_AA_; KW BEL-37_AA-I; BEL-37_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-524 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.244; Positions 1596698 1596175. XX SQ Sequence 524 BP; 182 A; 91 C; 91 G; 160 T; 0 other; tgtgacgacg gggtccctcg ttacgactac accgatggat gcgatgcata aaccgataac 60 cgacagcaaa tgaagatgac atattagggc acaaaaggaa gagagaaaac gtcattagaa 120 aaagtgaaag tcatttgata cgcagaacta cggattgcga aggaaattta ttgaagctct 180 aaaattatat attctatcta aaatagtatc ctaaaatcaa gtacggataa tttgttattc 240 tacttagcct aatctagtac ggttggctat caatactatc gtaagtactg tcttaattat 300 gaaatcttaa atctataact gaaattgttc attaggttgg aattagtgtg gaatcaattg 360 cagactcaat tttcgcttct tttttgtcgg ttcactaaaa atgtgagtat ctacatattc 420 aaattgtaaa atgaattact aaataaaccc ttcttgtagc ttgaagctta ctatcaacaa 480 atttcgcgtt tgctctcaag acttagtgca acctaacccc aaca 524 // ID BEL-51_AA-I repbase; DNA; INV; 6531 BP. XX AC supercont1.362; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-51_AA_; KW BEL-51_AA-LTR; BEL-51_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6531 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.362; Positions 701665 695135. XX CC 'CATCG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 84..1103 FT /product="BEL-51_AA-I_2p" FT /translation="MMPTTTRAAAKASAGGSQCICEKCKIPNHVANMVTCD FT RCRRWYHRTCAGVAESLNGKWTCKDCVLVVTISEASISGRTSSTSRSTRVQ FT LQLMRLEEEKKSQEKLILEQQEQDRIRQEKALAAKAALDKRYLDEKYALLI FT AEADDGEASSQRSRRSRLSRNSQVQEWIQDVDEAVGGVVNVDDIFPPISVG FT TEVNVPIGGMVAKYTGTIPKVPSNPLIGGVSEGEAHSSSRRAIDWLADNVA FT TEGMCSTTGMRDPITASTPIRSVGYCVSANKSTFNTYAIDVYTTNFNTTSV FT YPVDDSTTNVTSTPYNRSWFHGRVYQHFCLLSQYQGQQLIRHVLHYHQ" FT CDS 1563..3140 FT /product="BEL-51_AA-I_3p" FT /translation="MQNQQAMWGQFQQQLSARQVVPKDLPVFSGSPEEWPL FT FVSSYRNSTAMCGYSQSENLMRLQKCLKGKALEAVRSNLLLPSSVPKVMET FT LETLFGSPERLVQSLLNKVRSVPTPRAERLETLVNFGLVVQNLVGHLKAAN FT QEAHLTNPTLLQELVDKLPPHIRLDWALYKKNFGRVDLGTFCDYMSAITSA FT ASDVAHFTDFDGARSGGHEKQRKEKVVINAHVSAKPRKFDQKAKKTENQER FT PCYVCQSVRHRIKDCNKFKSLSIEERMKVVETHQLCMVCLVPHGKWSCKST FT RTCGVGDCTKRHHPSLHVGQQTPSDCSGTRQKSEAVINIHRRIESSTIFRI FT VPVVLYGKEAKLSTFAFLDEGSSSTLIDKEVADLLNLEGKLQPLCLTWTSK FT VSRHEAGSRMVSLRISEHESSESFPLIDVSTVSQLDLPVQTLRYEELSRRY FT SYLAGLPVESYDDAVPRILIGLENIKLSLPLNIREGRTHGPVAAKTRLGWT FT VFGSTDGRKLIRGPPFCTSARTPKKLSCTT" FT CDS 4106..6529 FT /product="BEL-51_AA-I_1p" FT /translation="MAWLPEEDVFVYFVKIPNGGECSGSAREVTKRSILRF FT VMSVFDPLGLISNLLVHGKVIIQDLWRAQVGWDDAIPSNILENWTRWIEQL FT TKLDHLRIPRCYFSGYNPDSYRTLQLHVFVDASESAFACVAYFRIIDHGQP FT RCTLVASKAKVAPLKPLSIPRLELQAAVIGSRLAKSIEGYHTLPVSRRFFW FT SDSKTVLSWIGSDARNYRQYVAVRIGEILDESKAEEWRWVPTKLNVADEAT FT KWGKGPVFHPDSRWFTGPNFLFQPENEWPQQQVALVDTEEEIRIVHVHRTK FT LMKPFIEYDRFSKWERMLRTTAYAYSFVDRYLALGKQGASQDKQTLRQEHL FT QRAERALWRLAQAEAYAVEIAVLQSGPDGKRSSKLEKSSPISKLSPFLDDY FT GVVRMAGRTEASELAPYDAKFPVILPKTHRVTELLIDWYHRRFGHHNSETV FT VNELRQRYHISALRTVVRRVAKRCQWCKVYKANPVVPRMAPLPEARVTPFV FT RPFSLVGIDYFGPYMIKIGRSQVKRWVALFTCLVVRAVHLEVTASLSTEAC FT KLALRRFIARRGAPNQIYTDHGTNFVGASRELANQISAMNQELSETFTDAN FT TRWFFIPPSSPHMGGAWERMVRAVKTAMESINNSRAPSEEVFQTMLCDAEA FT MVNSRPLTYVPLETSDQEALTPNHFILLSSNGVKQPEKAPVAEGEALRNGW FT NLCRSLLDQFWARWIREYLPDLTRRTKWHEETKPIQEGDVVFIVGETTRNL FT WPRGKVVKVIPGKDGRIRQADVQTGAGVLRRPVAKLAVLNILPSGNPAKSE FT QHYGGG" XX SQ Sequence 6531 BP; 1761 A; 1532 C; 1705 G; 1533 T; 0 other; atctttaaag atttcgactc gttcagcctg tgctgctgag ttagcaccga ttggttcgta 60 gaggtcacgc aaaggacgaa tccatgatgc ccactaccac ccgggctgcg gcgaaagcca 120 gtgctggggg gtctcagtgc atttgcgaga aatgcaaaat cccgaatcac gtagcaaata 180 tggtaacttg cgatcgatgt cgtcgttggt accatcgtac gtgtgcaggg gtagctgaga 240 gtctcaacgg aaagtggaca tgcaaagatt gcgttttggt ggtgactatc agcgaagcct 300 ctatttcggg aaggactagc agtacatctc gttcaacacg ggttcagttg cagctcatgc 360 gacttgaaga ggagaaaaaa tcccaggaga aactgatcct tgaacagcag gagcaggacc 420 ggattcgtca agaaaaggcc ctggccgcga aagctgcgct agataagcgg tatctcgacg 480 agaaatatgc gctgttgatc gctgaggcgg acgatggaga agctagcagt caaagaagtc 540 gtcgcagtcg attgagccga aatagtcagg ttcaggaatg gatccaggat gtcgatgaag 600 cggtcggagg tgttgtgaac gttgacgata tcttcccgcc gatttcggtt ggcaccgagg 660 tcaatgtgcc cattggagga atggttgcga aatacactgg aacaattccc aaagtcccat 720 cgaatccgtt gatcggtgga gtgagtgagg gagaagcaca ttcttcatcc agacgtgcga 780 tcgactggct ggccgataac gttgcgactg agggtatgtg ctctacaacc ggtatgagag 840 acccgatcac agcatcaact cctatccggt cggttgggta ttgtgtctca gccaacaagt 900 caacgttcaa tacgtacgca atcgatgtat atacaaccaa ttttaacaca accagtgtat 960 acccagtcga tgatagcaca accaatgtca cctctacccc atacaaccga agttggttcc 1020 acggccgcgt ttaccaacat ttttgtttgc tgagccaata tcaaggccaa cagctgatca 1080 ggcacgtgct tcactaccac caataagtga atccccatta ctaccctcga gcgttgaaca 1140 gcatgcagag acgagtcagt cgcattctac aacacaacaa ccactggccc cccgcacttt 1200 ggtgccgctg ccgtcgcagc caccgccggt atcgcagtcg tcgatcaacc catcatttca 1260 acccaacgag cagatatcga gcatgcaact accaccaaca tctcaagcgt cgttctctca 1320 gacgtcggtc tgtcagacgt cgttccagca gcagccgtat acactcgcag gtcaacaagc 1380 gacgatgtgc gatgagttag agaaacaatc gcaacctcca tctgtgtgcg agaatcggta 1440 tcatttactc cacaagcaca gcacccgcac gagtcggaga atttgtgcct gtccatcaat 1500 cattgtcctc gcatatatcg ctacaacaac cagcagagcc gtatgttacc cagctgcaac 1560 agatgcaaaa tcaacaggca atgtggggac aatttcagca acaattgtcc gccaggcagg 1620 tagtgcccaa ggacctccca gtcttttctg gtagtcccga ggaatggcct ctttttgtga 1680 gcagctaccg caattccact gctatgtgtg gctattctca gtccgagaac ttgatgaggc 1740 tacaaaaatg ccttaaaggc aaagcgttag aggctgttcg gagtaacctt ttgctacctt 1800 catcggtccc gaaagtaatg gaaacacttg aaacattatt cgggagccca gagcgactag 1860 ttcagtcgct gctcaacaag gtgcgcagtg tccctactcc gagagccgaa aggcttgaaa 1920 cgttggttaa tttcggtctc gttgtccaga atcttgtggg ccacctcaaa gctgccaatc 1980 aagaagctca cctcaccaat ccaactttgc tgcaggagct ggtggataag ttgcccccac 2040 acattcgttt ggattgggct ttgtataaaa agaactttgg acgagtcgat ttggggacgt 2100 tttgcgacta tatgagcgct atcacgtcgg ccgcaagtga cgtagctcac ttcactgatt 2160 ttgacggtgc tcgttccggt gggcatgaaa agcagagaaa ggaaaaggtg gtgatcaatg 2220 ctcatgtttc tgccaaaccg cggaaattcg accagaaggc gaagaagacg gaaaaccagg 2280 aaagaccctg ttatgtctgc cagagcgtca ggcatcgaat caaggactgt aacaagttca 2340 agtccttgtc catcgaagaa cgtatgaagg tggtagaaac tcaccaactt tgcatggttt 2400 gtttggtgcc gcatggaaag tggtcctgca aatcaacacg cacctgtgga gttggagatt 2460 gtacgaagag acatcatccc tcgctacacg taggtcaaca aaccccctcc gactgttctg 2520 gaacaagaca gaaatccgaa gccgtcatca acattcatcg tcgaatcgaa agctctacaa 2580 tattccgtat agtaccagta gtattgtacg gaaaggaagc aaaactatct acctttgcct 2640 tcctcgacga aggttcatcg tcgacactga tcgataagga agttgcggat ttgttgaacc 2700 tcgaaggcaa actgcaacca ttgtgtttga cttggacctc taaagtttcc cgtcacgaag 2760 ctggttccag gatggtttcg ctgaggattt cggagcacga aagcagcgaa agttttcctt 2820 tgatagatgt tagtaccgtc agtcaattgg atcttcccgt gcaaactctg cgatatgaag 2880 aattgtctcg tcgatactcg tatttggctg ggctaccggt ggaaagttat gacgacgccg 2940 ttccaagaat cctcattggg ttggaaaaca taaagttgtc gttacccctc aatatacgtg 3000 aagggcggac acacggaccg gttgccgcaa aaaccaggtt gggatggacc gtttttggga 3060 gcactgatgg ccgtaagttg attcgtgggc ccccgttttg cacatctgca agaacaccaa 3120 agaagctgtc ttgcacgacc tagtaaaagg ctatttcgcg atggagaatc tccgaagttt 3180 ccattgcttg tggacccgaa acagaagacg atcgccgagc gaaggagatt cttcgacaaa 3240 cgactatcaa gcgtgctgac ggtcactatg aaaccggatt gttatggcgg tacgatgttg 3300 tggagttgcc atccagctat aatatggcgg aacgtcgttt gctctgctta gagcgaaagc 3360 tgcggtctaa tccggaactg caagcgggct tggaaaagca gatttccgaa taccaggcca 3420 aaggctatgc ccacaaagca acgcctcagg agttggcggg aagtgatccg caacgtacat 3480 ggtaccttcc acttggggta gtgacaaacc ctcgcaagcc agggaagata cgcatcattt 3540 gggacgcagc cgcaaaagcg aatggtgtgt cactgaacga cgtactgttg aaaggcccgg 3600 acctgttaac atcgttgccg gcggttctgt gtcgttttcg gcaacgagaa gttgcgattg 3660 caggagatat tcgggagatg tatcaccagc taaaaatcaa gaaggaagac tgccaagttc 3720 agcgctttct ctaccgaagt gacccatcga aaaaaccgga tatctttgtc atggatgtgg 3780 cgatcttcgg gtcgacttgc tcaccctgct ccgccaattt cgtgaagaac atgaatgcac 3840 tggagtggaa agaaaaattg ccagaagctg ctgcagcggt gatagataac cactatgtgg 3900 atgattatct cgacagtcga gatacagaaa cggatatggc aatactggct tcggatgtgc 3960 ggaaggttca ggcagaggca ggtttcgaat tacgaaattg gcgttccaac tctaagaagg 4020 tattgcaggc cctcggagaa gacgcagcaa tctcaaggaa ggatttcagc atcgacagga 4080 aagccaggtg gaacgcgtcc ttggaatggc atggttacca gaagaagatg tatttgtata 4140 ttttgtcaag ataccaaatg gtggcgagtg ctcaggatcg gcaagggagg tcacgaagcg 4200 cagtattctg aggtttgtca tgagcgtctt tgatccactt gggttgatat caaacctgct 4260 tgttcacggt aaggtgataa tccaagacct ttggagagct caagttgggt gggatgatgc 4320 tattccatcg aatatcctgg agaattggac ccggtggatc gagcaactca ccaaattaga 4380 tcaccttagg attccacggt gctatttttc cggatacaat ccggatagtt atcggacgct 4440 tcagcttcat gtttttgtgg atgcaagtga gtctgcattt gcttgtgtcg cctactttcg 4500 catcatagat cacggtcagc cacgatgcac tttggtagct tcaaaggcca aggtagctcc 4560 attaaagccg ctttcaatcc ccagattgga gctgcaagcg gcagttattg ggagtcgatt 4620 ggcgaaatcc atcgaagggt atcatacact acccgtaagc cgcagatttt tctggagcga 4680 ctcaaaaacg gttctatcct ggattggctc ggatgcacgg aattatcgcc agtacgtagc 4740 agtacgaatt ggtgaaattt tagacgaatc gaaggcagaa gagtggcgct gggtacccac 4800 caagctgaac gttgcggacg aggccactaa atgggggaaa ggacctgtct tccatccgga 4860 cagccgctgg ttcacgggac caaacttcct cttccagccg gagaatgaat ggccacagca 4920 acaggtggca ttagttgata cggaagagga gatacgaatc gtgcacgtac atcgtacaaa 4980 gcttatgaaa ccattcatcg agtatgatcg attttccaag tgggaacgaa tgcttcgaac 5040 cactgcgtac gcgtattcat ttgtggaccg ttacttggct ctcggcaagc aaggagcttc 5100 acaggataaa caaacgttga ggcaagaaca tttacagcga gcggaaaggg cattgtggag 5160 acttgctcaa gcagaagctt atgccgttga aattgcagtt ctacagagtg gaccagacgg 5220 gaaacgatcg tcgaaacttg aaaaaagcag tccaataagc aaattatcgc cgtttctaga 5280 tgactacgga gtagttcgta tggcaggtag gactgaggca tctgaactag ctccatatga 5340 tgctaaattt ccggttatcc taccaaaaac tcatcgagtg accgaactgc tgatagattg 5400 gtaccacaga cgatttggcc atcacaatag cgaaactgtg gtcaacgagt tacgtcagcg 5460 atatcatatc tctgctcttc gaacagtggt acgtagagta gcgaaaaggt gccaatggtg 5520 taaagtatat aaagcgaatc ctgtggtgcc tagaatggcg cccctacctg aagcacgtgt 5580 aactccgttc gttcgtccct tctcactggt cggtatagat tattttggcc cgtatatgat 5640 caagatcggc cgaagtcaag taaaacgttg ggtggcactc ttcacgtgct tggtcgtaag 5700 agcggttcac ttggaggtta cagcttcgct ttctacagaa gcctgcaagt tggccttgcg 5760 acgattcatc gcgagacgtg gtgcgccgaa ccaaatctac actgaccacg gaacgaattt 5820 tgttggcgcc agtcgggagt tagctaacca aatatcagca atgaaccaag aattgtcgga 5880 gacgtttaca gatgctaaca cacgctggtt cttcatccca ccttcctccc cgcacatggg 5940 tggggcttgg gaaaggatgg taagagcagt taaaaccgct atggagtcga tcaacaattc 6000 tcgtgcgcca tctgaggaag tttttcaaac aatgttgtgc gatgctgagg caatggtcaa 6060 ttccaggcct ttgacctacg tgccgctgga gacttcggac caagaagcat tgactcctaa 6120 tcatttcatc ctccttagct caaatggtgt caagcagccg gagaaagctc ctgttgcaga 6180 aggtgaagca cttcgaaacg gatggaatct gtgtcgcagc cttctagatc aattctgggc 6240 aagatggatc cgggaatatc ttccggactt gacccgacgc accaaatggc atgaggagac 6300 gaagcctatt caggaaggag atgtagtctt cattgttgga gaaacaactc ggaacctttg 6360 gcctaggggt aaggtcgtga aagtaatccc agggaaagat ggtcgtatca ggcaggcaga 6420 cgtgcagaca ggtgcaggag ttcttcgccg tcctgtggca aagctcgctg tgctcaacat 6480 attaccatcg ggtaatcctg ctaagtcgga gcagcattac ggaggggggg a 6531 // ID Gypsy-1_AA-LTR repbase; DNA; INV; 137 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-1_AA_; KW Gypsy-1_AA-I; Gypsy-1_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-137 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 972-972 (2011). XX DR [2] (Consensus) XX SQ Sequence 137 BP; 36 A; 19 C; 37 G; 45 T; 0 other; tgggtaagtg atcggaagct ttgcttacgg cgggttataa ttcgatagtt agttatcagt 60 agttctctta attaatcggt cgagttctag tggtgggagt gctaaaaccg aagcggttat 120 aaagtataat cgcttca 137 // ID BEL-3_SI-LTR repbase; DNA; INV; 248 BP. XX AC AEAQ01011211; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: long terminal DE repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-3_SI_; KW BEL-3_SI-I; BEL-3_SI-LTR. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-248 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01011211; Positions 7684 7437. XX SQ Sequence 248 BP; 64 A; 60 C; 67 G; 57 T; 0 other; tgttccggcc gattgatacc gaccgggtgt agcttgttag aagcgacaag gcgtcgttag 60 tggcgctagt aacgttcagg ccccggcgtt acgagggcga gtagcggtag gtcagagact 120 ctcgagacac gtgaagtgca agcgaagggc accgtatact gtgttagatt ccagcacaat 180 aaagcactct aaaagaattc actgtccgtt atctggatca atctcttcat cccgtcgaac 240 attcaaca 248 // ID RTEX-15_BF repbase; DNA; INV; 1325 BP. XX AC . XX DT 30-JUN-2009 (Rel. 14.07, Created) DT 30-JUN-2009 (Rel. 14.07, Last updated, Version 1) XX DE Amphioxus RTEX-15_BF autonomous non-LTR retrotransposon - DE incomplete consensus. XX KW RTEX; Non-LTR Retrotransposon; Transposable Element; RTE-2_BF; KW RTEX-15_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-1325 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-1325 RA Kapitonov V. and Jurka J.; RT "Young families of RTEX non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(7), 1732-1732 (2009). XX DR [2] (Consensus) XX CC The 5' terminal portion is missing. The 3' terminus is composed CC of the (ATTGTC)n microsatellite. XX FH Key Location/Qualifiers FT CDS 2..1132 FT /product="RTEX-15_BF_1p" FT /note="RT." FT /translation="LKKTKIIIFNRKGEILSNFKFYLLDKDLDVVSSYCYL FT GVVFTAGCNFSTAMKTLQKKALRAMFSIRTTLGENNPSVSLQCKLFDACIV FT PILLYGAEVWGNAGLNDNSPLEQAHLKFCKAIIGVRRNASNLATRAELGRF FT PLRIEAYSRLYSYYSRLTREVPTNSLQYQALLVQQDLHRRKVPCWLSNVSK FT ILEESGYGYLLLNQDYNIPSAQSQVKQRLKDIFLQTFYSELNSNGKKENQG FT NKLRTYRKFKDTYEREQYLNIHNFSHRRAMAKLRISDHPLQIEVGRYNRTP FT PSDRICTLCNAKDIENEVHFVLECELYSDIRNSFINALNFDTRAKQLVKDE FT LFVYIMKSSDNIVLQKLCEFIYTCFKKRKETCKL" XX SQ Sequence 1325 BP; 462 A; 234 C; 247 G; 382 T; 0 other; cctcaagaaa acaaaaatta tcatttttaa tagaaaagga gaaatattgt caaactttaa 60 attctattta ttagacaaag acctagatgt agtctcctca tattgttatt taggcgtggt 120 attcacagcg ggatgtaact tcagcacagc catgaaaacc ctccaaaaga aggcgctaag 180 agcgatgttt agcattagaa caactctggg cgaaaataat ccctcagtat cactgcaatg 240 taagttattt gatgcatgta ttgtacctat attgttgtat ggtgcagaag tgtggggcaa 300 tgctggttta aatgataatt ccccactaga gcaagcacat ttgaaattct gtaaagctat 360 cattggagta agaagaaatg ctagtaacct cgccacaaga gcggaacttg gcaggtttcc 420 tttgcggatc gaggcttaca gcaggctcta ctcttattat agcaggttaa ctagagaagt 480 accaacaaat agtttacaat atcaagcact ccttgtacaa caagatctac accgacgtaa 540 ggtaccatgt tggctttcaa atgtaagtaa gattctcgag gagtctggtt atgggtacct 600 actgttgaac caagactata atataccatc agctcagtcg caagtgaaac aaagattgaa 660 agacatattt ctacagactt tctattccga actcaactct aatggaaaaa aggaaaatca 720 aggaaacaaa ctaagaacgt acaggaagtt caaagacacc tacgaaagag aacagtacct 780 aaatatacat aatttcagcc atcgccgagc catggctaaa ttaagaatta gtgatcaccc 840 tttacaaata gaagtaggaa gatacaaccg gactcctccg agtgatagaa tctgtacact 900 ttgtaacgcc aaagatattg aaaatgaagt tcattttgta ttggagtgtg agttatatag 960 tgatatacgt aattctttta ttaatgccct gaattttgat acacgagcta aacagttagt 1020 aaaagatgag ctttttgtat acataatgaa atcatctgac aacattgtac tgcaaaaact 1080 ctgtgaattc atatacactt gtttcaagaa gagaaaagag acatgtaaat tgtagaatag 1140 tatagcatta catatttgac tattttgtaa acgatagatg tacttatagt gtagcatagc 1200 cttagttgtt ctgtctctcc cactgccact gtaaagattc tttacagcaa agttttatgt 1260 aactactgca tttagcccat atgggcaagg tcatgcaaat aaagatcatt gtcattgtca 1320 ttgtc 1325 // ID DNA8-18_AP repbase; DNA; INV; 820 BP. XX AC . XX DT 22-AUG-2009 (Rel. 14.08, Created) DT 22-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-18_AP. XX NM DNA8-18_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-820 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(8), 1760-1760 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 820 BP; 304 A; 108 C; 131 G; 276 T; 1 other; caagcccgga ttaagggggg ggaagggggc acggtcccgg ggcccattga ttttgggggc 60 ccatctttat ccccaaaata tattttttta acgcgtccaa tacagttttc aaaaaaaaaa 120 aattatttaa gtaggtaggt atagtaggta aattattatt aataattata aaataaaatt 180 gttctcaatt ttatatcacg aaaaattact ataaataatg aactatgttt gccaacaatt 240 ttgtttgtgt tttttttttt gagcatttta gattaccgat ttaactattt aagtatataa 300 aaaggtagtt ataagaaaaa antatttagc tcttgtaaat taggtatttc agttaaaagt 360 attaatgcat attatactaa actttagagg accatagtta ataacctagc gaaccaaaat 420 tgatcatttg actgtcaaaa tgttcagaaa aaaagcttta ccagaatcca tcaaaaaata 480 tagtggttac catttgaaaa aataaagtcg attttattat tggtacaact gcgtgtttaa 540 aaattttgaa aatgttataa aaaaattgcc tgttaaaaat aaagaaacac gtcttttttt 600 gttttaacga aattccatat catcgatcca taattgttaa aaaaaaaccc gcgtacaatg 660 acataagata caccctgtat attttttgta tagtaaaaaa ttatacaata atataaagtc 720 gttaatatgt aatgatacaa ttattttgaa gttaaagttg aaagggggcc cactaagtgt 780 atggtgtccc ggggcccaat tgatccttaa tccgggcttg 820 // ID Gypsy-22_OD-LTR repbase; DNA; INV; 203 BP. XX AC CABV01001706; XX DT 24-FEB-2011 (Rel. 16.02, Created) DT 24-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Oikopleura dioica genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-22_OD_; KW Gypsy-22_OD-I; Gypsy-22_OD-LTR. XX OS Oikopleura dioica OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; OC Oikopleuridae; Oikopleura. XX RN [1] RP 1-203 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Oikopleura dioica genome."; RL Direct Submission to RU (24-FEB-2011). XX DR Genome; CABV01001706; Positions 17158 17360. XX SQ Sequence 203 BP; 57 A; 46 C; 47 G; 53 T; 0 other; tgttaggtcg agaaggcgcc gttagatctg ataaggatcg aggcgctgcg ctctgttcac 60 ttgcttgtca gtcaacactt tcacccttgt gaataaacag ctaaaaccta cttttccgtg 120 tttacctcaa gagtgcaact acagttagag aaggagctag acctagaaat aagtgcagtc 180 gtagacggac ttacgaccta tca 203 // ID Gypsy8-NVi_LTR repbase; DNA; INV; 2026 BP. XX AC AAZX01004364; XX DT 05-NOV-2007 (Rel. 12.11, Created) DT 05-NOV-2007 (Rel. 12.11, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: long terminal DE repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy8-NVi; KW Gypsy8-NVi_I; Gypsy8-NVi_LTR. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-2026 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from Nasonia parasitic wasp."; RL Repbase Reports 7(11), 1127-1127 (2007). XX DR Genome; AAZX01004364; Positions 5940 3915. XX SQ Sequence 2026 BP; 656 A; 461 C; 404 G; 505 T; 0 other; tgcggagacc attgcacgcg cagctacaac ggttgctaac acggtaacag atgctcgatt 60 cccaatcccg gccgaaggtt tatcaaccat tccaataatg aaaatttcaa aattaacagt 120 catattttcg aatggctcct taatttacca catagcaatc cctttattaa gtatagaaaa 180 attcaacctg ttcaaggcat caccgttacc ggcagttcag catgttttta gcataccgaa 240 tgtggatagc cgcttatata tggcccgagt ttgtttattt cgcggtaagc aagtcgaacc 300 ggacgtacat gccgctcctg ccggaggaag tcgtgggcct ccgaaaatta ggtgaactaa 360 ggatcgtggt agatccggaa ccagtccgcg agatacgcga aaacactgct tgcgaaataa 420 aaatcgccgc cggcaaaaag atcgccaacc ccgaaatatg cgatatacgc atcagacagc 480 ttagagactt attctggcta cggctatata aagccaacgc ttgggtcttc tcaactagga 540 ccccagagtc gatcttcgtg cagtgtttgc gagccgaaca tatcaccgat aaaatcaacg 600 gtatcgggat actagaactc cgccccgggt gcttcgctta taccgccaat gcacgcctga 660 cagcttcacg ttccatcacc tcgttcttaa acgattccca tttcgaaaca gtacagttca 720 acgtatcgaa aattctagcg gtgctgaatg aatccgctca tgccgaagcc gagttagaaa 780 aaacgattgc aaccgcggcg gacagtcgag tcaaaggctc gcacggggta aatctagaaa 840 gcctgaaaac gggagccagg cttcacgaaa tagctaataa agctagcgaa atcgctcagc 900 gcaaatcaac gaattttgaa ttacgaaatc taggtaatta tacctctatt ttcgggtttt 960 ctttaacatc tgtcttaaca gtaattagtg gaataatcat cggaggatgt ataatttacc 1020 gccgacgcgc gaacaatcct cttgctcgct taatcaagca acaagaaaat ttgattgctc 1080 tagagcaaac gcgaaaattt tctcgcgaga gaagagctca gttttcggaa aatatagccg 1140 aataagaagc atcttatttt cactttctct tttcttccta agtaagttct agcaaagcgt 1200 agagttaaga aaggcggtat tcgggtgcgt aaaatagaac accaccttgt cgcgcggcat 1260 aacacaacat ctcgtcgcgc tatcgctgct cgctcacctg gcggctaccg gggaaggagt 1320 aacatcacgt cccttcccat gaatacgcac cgggctcggc acctagtcgt attaagaata 1380 ttcgcaataa tgtcaaaaaa aagcaaaatg gaaagccaag ctgggaaaaa aagggacaca 1440 aggtcatgcg agaccccgcg atcctaatcc taagttagtc ttaagattta cagtcacgca 1500 agataaacgg cggactggaa aataaacaaa atgttccctt atattataat cgcagtttca 1560 attacagctt ctggcaaaac caggccatga aaaaaacata atagataagc aacaattctt 1620 gtacgataag gtaaggcctc aaggtcgacg tactgtattt tgtagccctt tacactataa 1680 ttaccatata taatctctta tagaataaag ccacgaggcc aatattttgt acttccattt 1740 ttataatctt ggaaactaag tttcacacta ttttgtactt ttgctgtgta agttatagaa 1800 aataagattc atgtaaaaag gggagctcta aacacttttg taaacggcag cggccgcatt 1860 tagctacaca atcgggacaa caacacaaac actcacgcac accaaatgca aagatattag 1920 aatctaatag aaatccaaca agcgctggca tccctattaa ttgtaagatt tgtattttgt 1980 ttaacaaaaa aaaaaccatt tcagaattgt aaagtaaaat cgaaca 2026 // ID Gypsy-14_DWil-LTR repbase; DNA; INV; 805 BP. XX AC scaffold_180708; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-14_DWil_; KW Gypsy-14_DWil-I; Gypsy-14_DWil-LTR. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-805 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_180708; Positions 5616265 5615461. XX SQ Sequence 805 BP; 235 A; 248 C; 141 G; 181 T; 0 other; tgcgaggcct cctcccatac aaccagcgcc cctggaacgc cttatccttc agtcaccaga 60 ccatgcccca cacctcggca tctgctatca caacaggact cgaccgacac tttctaccaa 120 aaacgtctgg acgagcatcg ggcaaaatac cctctatcaa tctaagcctt gaccttacac 180 tcacatttca tgcactcaca tttcacacat gttagggaca ttaacagacc acacaacacg 240 cgtgcatcgc gacccacact agcaatctaa gtgacaccag cccactcaca tttcacacac 300 tcacatgtca cacaccaaca gaccacacac agcacgcgtg tatcttttag ccgactaatt 360 tcctgtaacc acacttagcc aagtaagcga cttatgtccc ctagtactta tgcgacccta 420 gcaaaacgcg accctcgtcg ttcacactag gcacttcact tccccttcta gcgtaagcac 480 tttgaatatt gccgataagc agaattcgcc ggaagaggcg acccacttct gacacgatcc 540 caacgccgct gccagcgaca cgacgccgca gtcagcgccc cttgggcatc ttggagtata 600 tttaaaggat cacgctcgac agggagggca gtcttaatct tgacattgta caagtgacat 660 ggaactagta tcgaataagt gaataaaacc aatttataac gtttaattgt gtgttgtatc 720 gttattctca acaaagatcc tgaatatcca agccttaatc ctcttcacaa ggaaggacat 780 ggtaagataa ccccatttaa ggcca 805 // ID LOA_Ele2B_AAe repbase; DNA; INV; 5931 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A LOA non-LTR retrotransposon from Aedes aegypti. XX KW Loa; Non-LTR Retrotransposon; Transposable Element; KW LOA_Ele2B_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5931 RA Kojima K.K. and Jurka J.; RT "LOA clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1421-1421 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 14 sequences with >98% CC identity. ~86% identical to LOA_Ele2. XX FH Key Location/Qualifiers FT CDS 436..2097 FT /product="LOA_Ele2B_AAe_1p" FT /translation="METSEDNFDDGLQSDSLVDVHTPSPSILNSPLKELDD FT SMSDNDEDSVNVTIRPLKPPPLVVPKDDKKISDKGGETNDGDRKVQGKRRG FT LCGSARRRLHKMIKGGMDYAEARERAKAPFVSTPKRQRNDLDRTISSDEKP FT AQKKIRDVRPPQAGNMGKRVDKAQSQEVSSNIASKLNAMTSADRNVGNRVD FT KAQPQEVSNTIASQSDLMVTIDRNVGNRVAKALPQEVRSNIAFQSNETLTA FT DRNVGVNARDSINSGHSRDSKTMRDRVRDALPAPSYGEIASRVRVGIMPKK FT YPAVELSNNQQQVVQEALLLEVIQQRREQFKPKFACCRFRPGFLVLTCQDK FT HTADWVKNKVPSLSLWEEADLVVVDEDKIPRPDELVSFFAFSAKYSNDMIL FT SLIESQNDNLYTDTWRILRRHVKGDHVELVLSVDDSSLRKLGERQFVLNYR FT YGQILMKKKRPVKIVASDQNPKNVPEDTVMDSIQVQQNLAIPGTSGLGRGL FT LKPSPGKTTEQVNSHQTCAGHGGLNIQGQSNAHGNNKTTSDIPSTTKKSTI FT QCRLERE" FT CDS 2168..5821 FT /product="LOA_Ele2B_AAe_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="MNLIKIIQVNLHHAKGASAVLNRRFTKENIGVGLIQE FT PWVNNGRVLGCSSQNSKLLYDDSQSKPRTAILVNRDIKFVPITEFISRDIV FT AIKIEVPTIRGRTEVCIASAYFPGDVEDVPPSLVVDFVNYCRKINAQFIIG FT CDANAHHTIWSSTGINKRGEDLLNYLSSNKIDLCNRGDSPTFINAIRQEVL FT DLTICSDLLSGNIVNWHVSNEESLSDHKHIRFDYKAGSAIEESYRNPKKTN FT WDLYRFHLINKNSYDGEQFTTVSQLDKASDGIIHQMISSYHASCPIQQRSS FT NRDVPWWNDKLAELRKKSRRLFNRAKITSDWVQYKKALTEYNKELRQAKRK FT DWRRVCENINSAPTAARLHKVLSKDHSNGLGSVKKEDGSFTVDSSKTLEVM FT MRAHFPGSIPYSVEEQVLDKVGHIWSTNAYQKACEIFTPSKVEWALSSFQP FT FKSSGKDEIFPALLQQGKEALCPLLTEIFKASVTLSYIPKAWRKVRVVFIP FT KAGKRDKTTPKAFRPISLSSVLLRTMEKIIDDFIKSTSLIEMPLCKYQFAY FT QTGKSTITALQTLVSKIEKSLEAKEIAVVAFLDIEGAFDNASYSSMRSAME FT TRGLDKSIIEWVMAMLRDREISADLGGAQLSIRSKKGCPQGGVLSPLLWSL FT VVDELLKNLKDQGFEVIGYADDVAIVIRGKYDDTISNRLQSALNSTLKWCR FT SEGLNVNPSKTVIVPFTRRKKVNLKQPFLDGIRIQFSEEVKHLGVTLDEKL FT NWNSHLNKIITKGTNALWVCNKALGKTWGLRPNMVHWIYSAIVRPKITYAS FT LVWWPKTNEVTTQKKLAKLQRLACISMTGAMKSTPSVALDALLNLLPLHQF FT VKLQAAKNALQFIRYNKILDGDLMGHLKIIKEFNLNSDIKTVEDWMITKTN FT YDVPFKVVKPNRNTWESGGPSLRPGSVVFYTDGSKMGENTGAGVFGPGISK FT SIAMGCSPTVFQAEIQAILECTNVCLKRNYRFAKICIFSDSQAALNALKAF FT TCQSKLVWECILSLKQLASRNEVTLYWVPGHCGIEGNEIADNLARQGAASS FT FVGPEPFCGVPESTLRMKLRTWEMSMVETNLNATDTAKQAKRFIRPSLSKA FT RAILNLNKKNTRVITGLMTGHCPSRYYLYKIGKIQFSECRFCLNENETAEH FT LLCSCGALFNQRLSIFGKGLLEPSEVWQCNPNRVINFIKRVEPSWDNVLHQ FT PLSTTSQL" XX SQ Sequence 5931 BP; 1896 A; 1150 C; 1352 G; 1533 T; 0 other; agtgagtgac tgtcgatcag aaactcgcga ataattttgt gctctcggga ataagtgatt 60 tactaggtgc tttctaagcc aatcggctgt taaaagtgta gaatagtgtt aatcgtggaa 120 atactcacta ttattccatg aaattataag cgaattcatc aagaatacgc cgaaatatcc 180 cgataaagat tttcattcct cgtgggccgt gacggagata tatctttgtg cgataaaaag 240 cctgatagcc atttgtatgc cgcagtgagc tcccgttgta cggcagtgtg cagatacaca 300 gttgtgttat cagtgtcttc ggttactaca accagctctt agacgtgtgt gccatacaaa 360 tcagtctctg ataagttcgg ggggaaattc gagcagttaa cagtgaacgt taaacatatt 420 tgaacgctgt gcattatgga aaccagtgaa gataattttg atgatgggct tcaaagtgat 480 tcgcttgttg atgtgcatac cccatctcca tcgattctga attctccgct caaagagcta 540 gatgatagta tgtccgataa tgatgaggac agtgtgaacg tcaccatacg accattgaaa 600 ccaccaccac tggtggtacc aaaagatgat aaaaagatca gtgataaggg gggagaaaca 660 aatgatggag atcgtaaagt gcaagggaaa cgtcgaggtt tgtgtggttc ggcgcgtaga 720 cggcttcaca agatgataaa aggtggtatg gactatgctg aagcaaggga gcgggctaag 780 gccccctttg taagtactcc gaaaaggcag aggaatgatc tggaccgcac catcagcagc 840 gatgaaaagc cggcccaaaa aaagattcgg gacgtcaggc ccccacaagc aggtaacatg 900 ggcaaaaggg ttgataaagc ccaatcccag gaggtaagtt cgaatatcgc ctccaaattg 960 aatgcgatga catcggccga tcggaatgtg ggtaacaggg ttgataaagc ccaaccccaa 1020 gaggtaagta acactattgc ctcccaatcg gatttaatgg taacaatcga ccgaaacgtg 1080 ggtaacaggg ttgctaaagc tcttccccag gaggtacgtt ctaatattgc cttccaatcg 1140 aacgaaacat taacagccga tcgcaacgtg ggagtaaatg ctagggattc gataaactct 1200 ggacactcac gcgattcaaa aacaatgcgc gatagggtac gtgatgctct tccagcgcca 1260 tcgtatggtg aaattgccag tagggtaagg gtagggatca tgcctaagaa atatcctgct 1320 gtagagcttt ctaacaacca acaacaagtt gttcaagaag ctcttttgct ggaggttata 1380 caacaaagac gggaacagtt caaaccaaaa tttgcttgct gtaggttccg gcctgggttt 1440 ttagtgctca catgccagga taaacacact gcagattggg ttaagaacaa ggtcccgtct 1500 ttgagccttt gggaagaagc tgacctggtg gttgtggacg aagataaaat ccctcgtccc 1560 gacgagctag tctctttctt cgcatttagt gcaaagtata gcaatgatat gatactttca 1620 ctcattgaga gtcagaatga caatctctac actgacacgt ggaggattct taggcgccac 1680 gtcaaaggtg atcatgtaga gcttgtgcta tctgttgatg actcttcttt gcgtaaactt 1740 ggtgaaaggc agtttgtact taactacaga tacgggcaga ttctcatgaa aaagaaacgc 1800 cctgtgaaaa tagttgcttc tgatcaaaac cctaagaatg tcccagaaga tactgtcatg 1860 gattctattc aagtccaaca aaacttagca attccaggga caagtgggct tggcagaggg 1920 cttctaaaac cctctccggg aaaaacgact gaacaggtca actctcatca aacatgtgcc 1980 gggcacggag gtttaaacat ccaagggcaa tcaaatgccc atggtaacaa taaaaccacc 2040 agtgatatac ccagtacaac taagaagtcg acgattcaat gccgtctgga acgggaatag 2100 agtgcccctg gaaagaaggt aagtctaagc taataaaatc agtaattaaa aataatgatt 2160 tacaaacatg aatcttataa aaattattca agtgaacctt caccatgcaa agggtgcgtc 2220 agcagtgctc aatagaaggt tcacgaaaga aaatattggc gtgggactaa ttcaagagcc 2280 ctgggtaaat aatggtaggg tacttggctg ctcttcgcaa aatagtaaac ttttgtatga 2340 cgatagtcag tccaaaccaa ggacagcaat tttagtaaat agggacataa aatttgtccc 2400 aattacagaa tttatatcaa gagacattgt tgcgattaag attgaggtac caacaattcg 2460 tggaagaacg gaggtatgta ttgcttctgc ttactttccc ggagacgttg aagacgtgcc 2520 tccatctttg gtcgttgatt ttgtaaacta ctgtagaaaa ataaacgctc aatttataat 2580 agggtgcgat gcaaatgccc atcatacaat ttggagcagc acgggtatca ataaaagagg 2640 agaagatctt ttgaattact tatcgtccaa caaaattgac ctatgtaaca gaggggattc 2700 tcctactttt ataaatgcta tccgacagga ggtgctggat cttacgatct gtagtgacct 2760 actgtcgggc aatattgtga attggcatgt ttccaatgaa gaatcattgt cagatcacaa 2820 acacatccgg tttgattata aagctggatc agcaatagaa gaaagctata gaaaccctaa 2880 aaaaaccaac tgggacctct atcgttttca tttaataaac aaaaattcat acgatggtga 2940 acagtttacg actgtttctc aactggataa ggcctcagat gggatcattc accaaatgat 3000 ctcatcgtac catgctagct gtcctattca acaaaggtcc tctaacaggg atgtgccttg 3060 gtggaatgac aaattggcag aattgaggaa gaaatccagg cggttattca atagagcaaa 3120 aattacttct gattgggttc aatacaaaaa agctctaaca gaatataaca aagaattacg 3180 tcaagccaaa cgtaaggact ggagacgggt atgtgaaaac atcaacagtg cccctactgc 3240 cgcaaggctt cacaaagtcc tttcgaaaga ccactccaat ggtctaggca gcgtcaaaaa 3300 ggaggatggc agttttactg ttgattcttc taaaacatta gaagtaatga tgagagctca 3360 tttcccagga tcgatcccat attcggttga agagcaggtt ctagataagg taggacatat 3420 atggtctaca aatgcctatc agaaagcttg tgaaatcttc actccatcga aggtcgaatg 3480 ggcactgagt tcttttcagc ccttcaaatc ttccgggaaa gatgaaattt tcccagctct 3540 gctacaacaa gggaaggagg cgctttgtcc gctccttact gaaattttta aggccagcgt 3600 tacactgtct tacattccta aggcgtggcg aaaggttcga gttgttttta ttcctaaagc 3660 tggcaaaagg gataaaacaa ctcccaaagc ctttagaccc ataagccttt cgtctgtttt 3720 gctaaggaca atggaaaaaa taattgatga tttcatcaag tcaacaagct taatagaaat 3780 gcctctttgc aaatatcaat ttgcgtatca aacgggtaaa tctactatta cagcactaca 3840 gacgctagtg agtaagatcg agaaatcact tgaagccaaa gaaatagccg tggttgcatt 3900 tctcgacatc gaaggagcat tcgataatgc atcctacagc tccatgagat cagcaatgga 3960 aacaaggggc ttggataaaa gcattattga atgggtaatg gctatgctca gagatcggga 4020 gatatccgct gatctgggag gtgcgcaact atctataaga tctaagaagg gatgtcctca 4080 gggaggagta ttatcgccct tactttggtc gttagtagta gacgaactcc ttaaaaatct 4140 aaaggatcaa ggttttgagg ttattggata cgcagatgat gtggccattg tgattcgtgg 4200 aaaatatgat gacacaatct ccaatcggtt gcaatctgcg ttaaacagta ctctcaaatg 4260 gtgcagaagt gagggcttaa atgtaaaccc ttcgaaaaca gttattgttc cttttacaag 4320 gaggaaaaag gtgaacctca aacagccttt tttggatgga attcgaattc agttctcgga 4380 agaagtaaaa caccttggtg taactctgga cgagaaattg aactggaatt ctcatctaaa 4440 taagatcata actaagggta caaatgccct gtgggtctgc aataaagcct tggggaaaac 4500 ctggggccta cgcccaaaca tggtacattg gatttattca gcaatagtgc gacctaaaat 4560 tacttacgct tcactggtgt ggtggcctaa aacaaatgag gtaaccactc aaaagaagtt 4620 agctaagtta caaaggcttg cgtgtatatc tatgactggg gcaatgaaaa gcacaccatc 4680 agttgctttg gatgcccttc tcaatttact acctttgcat caatttgtta aactgcaagc 4740 tgcaaaaaat gccttgcaat ttatccgtta caataaaata ctagatggtg atctgatggg 4800 acacttgaag atcatcaagg aattcaactt gaactcagat ataaaaacag tagaagactg 4860 gatgataacg aagacaaact atgatgtgcc cttcaaagtg gttaaaccaa accgcaatac 4920 gtgggaatct ggtgggccaa gtttacgtcc agggtctgtt gtattttaca ccgatgggtc 4980 aaagatgggc gaaaataccg gggctggagt ttttggtccc ggtattagta agtctatagc 5040 tatgggatgc agccccactg tatttcaggc tgaaattcaa gcaattttag aatgcacaaa 5100 tgtttgtctc aaaaggaatt acaggtttgc taagatctgt attttctcgg acagtcaagc 5160 agcattaaat gcgctaaagg catttacatg tcaatcgaag ttagtgtggg aatgtattct 5220 ctctttgaag caattggcca gtaggaacga agtaacgttg tactgggttc ccggccattg 5280 tggtattgaa gggaatgaaa tcgccgacaa tctagcaaga cagggtgcag cttcgagctt 5340 cgtcggccct gaaccttttt gtggagttcc tgagagtact cttaggatga aactaagaac 5400 ttgggaaatg tccatggtag aaactaattt gaatgccacg gatacagcca agcaagcgaa 5460 aagatttatt agacccagtc tgtcaaaagc ccgggccata ttaaatctta ataaaaagaa 5520 taccagggta ataaccggtc tgatgactgg tcactgcccg agcaggtatt acctttacaa 5580 gattggaaaa atccaattct cagaatgtcg attttgtttg aacgaaaacg aaaccgctga 5640 acacttgctt tgcagctgtg gagcattgtt caatcaaagg ttgtcaatat ttggaaaagg 5700 gttattggag ccctccgaag tttggcaatg caatccaaac agggtaataa actttataaa 5760 acgagttgag cctagttggg ataacgtgct ccatcaacca ttgtccacca catctcaatt 5820 gtgagaggat atttgatgag cacaaaatta aatatgggtc ataccacaat attcctaatt 5880 attggacgca gtgggtaaaa ggctctacag acaaggaaaa aaaaaaaaaa a 5931 // ID Copia-32_DPu-LTR repbase; DNA; INV; 315 BP. XX AC scaffold_96; XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from Daphnia: long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-32_DP_; KW Copia-32_DPu-I; Copia-32_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-315 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Direct Submission to RU (16-DEC-2010). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR Genome; scaffold_96; Positions 227968 227654. XX SQ Sequence 315 BP; 83 A; 68 C; 57 G; 107 T; 0 other; tgttgaagtt ggttagacaa ccacaagatg tgggaactcg tttccctttg gtgtgttatg 60 cagtgccatc tacctctgcg tgtttagctc aaccctctgt taccctcacc tatgattctt 120 gtatagtcaa tgtcagaact tgctgtgtca agttctctct ctctgtcttg tgaaacctct 180 tgtcatacag tcaacagaag tattggtaac tatcatacaa aatcatatgt gtatccatta 240 tgttatatga agttttcctg ttgaaataca gaagccttct cagtgggcaa gttagtcaaa 300 catcaaactc taaca 315 // ID Gypsy-86_AA-I repbase; DNA; INV; 5502 BP. XX AC supercont1.246; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-86_AA_; KW Gypsy-86_AA-LTR; Gypsy-86_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5502 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.246; Positions 1424091 1418590. XX CC Positions [2732-3160] - Reverse transcriptase CC Positions [4538-5014] - Integrase core CC 'TAAA' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 1610..3229 FT /product="Gypsy-86_AA-I_1p" FT /translation="MSLNTTNNVSNWRTLKDFLLKEFEFSENCAVVHERLR FT NRKRNLGENVLEYFLQMREIGAKANVDILSIITYTINGINDNGPDKTILYG FT SKSIDEFREKLRIYQSIKDSKYGRDGNFHPLNSEKRNFSGRPNGFNRNLTN FT QFKSPPRPINCYNCGRDGHISRECPKKFGNKNANICNSNTAVSKGTFMMIL FT VGTIPFEAFFDSGSDVSLLKEEWKNKLNLNMNQNDRKRLITLKGAIWTLGS FT VVLDIEIENTPLRVTFDILRDEDLPHDILLGRNILTFGDESVTADGAKFSP FT KEELFALRIRTDEVSDDFFDHIDDVYIRDKVKQMVEDYCPSKVENNRVKLK FT IVLKNESPIQQLPRRLAPLEKQIVREQIDKWKEDGIIRPSTSEYSSPIVLA FT HKKDGTRRLCVDYRRLNKVIVRDHFPIPLIEDIMDDLHSARVFSTLDLENG FT FFHVPVEENSIKYTSFVTPAGQYEFLKTPFGLCTSPTVFQRFINDIFRELI FT EKKIVIVYIDDILIPARNEIEALERLKMVLKVAQSHGLIIKWKK" FT CDS 4400..5500 FT /product="Gypsy-86_AA-I_2p" FT /translation="MHELLTREYWIENLSTKLDRFVKSCVPCILACRKAGK FT REGYLYPIPKDGTPLSTYQIDHLGPLTASNKLYRYIFTVIDAFSKFVWIYP FT VKSTTADEVLKKLEVQRCVFGNPKRIISDRGAAFTSTVFQNYCENEGIEHL FT QITTGVPRGNGQVERIHGVIVPALAKLAVDNPENWYRHVGKLQEFLNNTLQ FT RSIARTPFELLFGTKMRTTQDLRLAEIIEQEWIRIFEESREEIREMARANI FT ENCQRKMKQQFDRNRKEAEDYEVGDLVAISRTQFGTHLKIARKFLGPYRVI FT KKKRNDRYDVEKVGNEEGPKRTSTCAEFMKRFIPGEDEEEETYDNMEQPDI FT GRSNQEDTNDQITPSGPDGGQDDR" XX SQ Sequence 5502 BP; 1798 A; 894 C; 1225 G; 1585 T; 0 other; aatttggggg ctcgtccggg aacgcgagaa gaaactgtcg ggtactcgtt agtgcgcgcc 60 ggtgggcgag tgtgaaaaaa aaaattgttg aaaattgaaa taattttgta aagaaatttt 120 tcactaaagc gaattgtaaa atatattgtt ttttttttgt ttttttgaat tgaacaaatt 180 agtaataaga ctgaattgaa ttaagtgata attaatgata agtaataaag tgaattggat 240 ttttatttga attaagttat agagtacttt cttaagtggc agtgtaaaac taacaaccag 300 ccacattacc gtgctgtaaa tttcgcacat tctgtgcacc gtgaaaaaga agaaaaattg 360 tgagttcttt tgcgagcatc gaattgcata gaactacatg tgtgacctga ggttaagtgt 420 cattgtccag tgctacaaat aaataagccc tagctcgaac gactgtgact gtgtgtacgg 480 tgccggcata acaggcgaag aaagcgaaag taacaaacgt tttgttctct tgtggctaat 540 ttatattgca acagtttgag gaaatgcagg acggaaacat cccgccgttc aggcctagtg 600 gtggtaacat acctcccgtt tctaatcctc aattttcccc ttcctctaat aggcttccaa 660 tgaacattat agagaacact ttcacacgtt taatagacca aatcaatttt tcaaatacct 720 ctatgtccaa cttaacgcaa gagcttcgcg cgatgcgcag tgaaatgaac tgtatgagag 780 aagaaacatc tcggttgaac gtagtagtag aagaaatgcg atctagttcc atggtgcctc 840 agcaggcaag tactcctaat ggaggtagcc ctactggatg tgatgaagat agactagcga 900 gaagcaatgg cagaagtgcg atagacaaga acactcgtgt aaccgttcaa aacaaagtaa 960 gtgacggcga cggtcgtgat catggtggtg tgtcgagaac aaattatggt cgagatgttt 1020 tgaatggtga tgctaacgat ctgttgggca acggcccgaa tgttgattta tttagtggct 1080 caaatgggca ccacttactt ggtggtcgga atcgtacggt ggaaaagttg agtgtcaatg 1140 aataccaaca tgctattagt gagattggaa gctgttgtgg ctctgaggcg tacggtggct 1200 tgcgtggtaa ctaccacatg gaggcccatg cgcagaacgc agacagagaa ctagaaccgc 1260 ggctatcggt aagtgcagag tctatataca cagagctgcg cgaagagact tcttcaaatg 1320 cttgttcttc ttcgtcttcg aaaatagccc ctatctgttt tgctgggctg ctcgataggt 1380 agacggtttg acagttcgat aggtagattt ggtttcactc ttcgcgcaac tcttcattca 1440 tagactctgg gtaagtgaag ctgagacagc tttctctgaa ttctctggga ctgattttta 1500 tcctgtgcgt aaatggattg ctgattttga tgaattgtca gattccattg ggttatcaga 1560 gctacgtaag ttcgtcgtta ggaaacgtaa attaactggg ttagcaagaa tgtcattgaa 1620 cacaaccaat aatgtctcta actggagaac attgaaagat tttttgttaa aagaattcga 1680 attttcagaa aattgcgctg ttgttcatga aaggttgcga aatcgcaaaa gaaatttagg 1740 agaaaatgtt cttgagtatt ttttgcaaat gcgagaaata ggggcaaagg ccaatgtgga 1800 tattttgtcc attattacgt acactattaa tggaatcaat gacaatggtc cagataaaac 1860 tattttatat ggttcaaagt caatcgatga atttagagaa aagcttcgta tatatcagag 1920 cattaaagat agtaaatatg gaagggatgg aaattttcat cctttaaatt cggaaaaaag 1980 gaattttagt ggaaggccaa atggcttcaa tagaaacctt acgaaccaat tcaaaagtcc 2040 tccaagacca ataaattgtt ataattgtgg aagagatggc catatctcca gagaatgtcc 2100 gaagaagttc gggaacaaaa atgccaatat ttgtaactct aacaccgctg tttcaaaagg 2160 aacatttatg atgatattag ttgggacaat tccctttgaa gctttttttg attcgggatc 2220 tgatgtctct ttattgaaag aagaatggaa gaataaactg aatttaaata tgaatcagaa 2280 tgatcgaaag cgtttaatta cgctaaaggg agccatatgg acgttgggta gcgttgtgct 2340 agatattgaa atagaaaata ctcctttgag agttacgttt gatattctgc gggatgagga 2400 tttgccgcac gatattttgt taggaagaaa tatattgacg tttggtgacg aaagtgttac 2460 agctgacgga gcaaaattca gcccaaaaga agagcttttt gctctgagaa ttagaacaga 2520 tgaagtttcc gatgattttt ttgatcatat tgatgacgtt tatatacgcg acaaggttaa 2580 acaaatggtt gaggactatt gtccaagcaa ggttgaaaat aacagagtga agctgaaaat 2640 agtattgaag aatgaatcac ccattcaaca acttcctcga cgcttagctc ctttggagaa 2700 gcagatagtt cgagagcaaa ttgacaaatg gaaagaggat ggaataattc gtccaagtac 2760 atccgaatat agcagtccta ttgttttggc ccataaaaaa gatggaacca ggcggctttg 2820 tgtagattac cgtagattga acaaagttat tgtacgggat catttcccga ttcctttaat 2880 tgaagacata atggatgatt tacattcagc acgagtattt tcaacccttg atttagaaaa 2940 tggattcttc cacgtaccag tagaagaaaa tagcatcaaa tatacgtcgt ttgtgacacc 3000 tgcagggcaa tatgagtttt taaaaactcc atttggatta tgcacttcac cgacagtttt 3060 tcaaagattt atcaatgata tttttcgtga attaattgaa aagaaaattg ttatagttta 3120 tattgatgat atattgattc ccgctcgaaa tgaaattgaa gcattagaaa ggctaaagat 3180 ggtattaaaa gtagcacaaa gccacggact tataattaag tggaaaaaat gacagttttt 3240 aaagcgcaag attcaacatt taggatatga aatagaaaac ggaaatattc gtccaatgtc 3300 ggaaaaaaac aattgctgtt tctaaattcc ctgaaccaag aagctataaa gaaatacaac 3360 aattcttggg attaactgga tattttagaa agttcattga tgggtattcc ataattgcca 3420 agccgttgac tgatttgttg cgtaaggacg ctgtatttat ttttggacca gaacaacgta 3480 ccgcaatgga aaaattgaaa gaagcattgt gtaatagacc aacactaaaa ttgtactctc 3540 ctaatgctat aactgagctt cacactgatg caagtagggt tggatatggt tctattttgc 3600 tgcagaaatc tgccggagag aatgttttcc atcctgttta ttattatagt agaaaaacaa 3660 catctgctga agcaaattac ccaagttacg agctagaggt tttagcggtg atttctgcct 3720 tgaaaaaatt ccgggtatat ttactgggta ttccttttac catcgtaact gattgtgctg 3780 cttttaagat gacaatggta aagaaagata tatccccaag aattgctggg tgggcattat 3840 tgctggaaga ataccaatac accttagaac accgccctgg tactcgcctg aaacatgtag 3900 acgcgcttag cagaaatcct gtagttatgc ttttacagag taacataata gatcagatta 3960 aagtgtgtca acagagagtt cagtatcact gcaatgatcg atttctcttc atcgattttc 4020 tctttgacat atttctcggc cgggtgactg ttttcgattg ctatttttgg ttcagtagaa 4080 agtactcatc gtccttcgtc ttgctacatc gtaaaatgtt acgaaaatct tttgattttc 4140 tcgtattatt gtaaagagaa aatcgacgaa gagaaatcga tcattgcaga tacgcctgta 4200 tgatactcaa cgcgagagat gaaagattgg cggcaataat gaaagtttta aatactgagc 4260 catttggtga ttattttgtt gatcatggaa tattatataa gaacttcgga ggtggaaagc 4320 ttctagtcat ccctaattcg atgcataatt tcataataag aaatgttcat gagcaaggac 4380 attttggggt gaaaaaaaaa tgcatgaact tcttactaga gagtactgga ttgagaactt 4440 gtcgacaaaa ctagacagat ttgtcaaaag ttgcgtgcct tgcatactgg cttgtcgcaa 4500 ggctgggaag agggaaggat acctttatcc tattcctaaa gatggaacgc cattgtctac 4560 ttatcaaatt gatcatttag gaccgttaac agcatcaaat aagctgtatc gatatatttt 4620 taccgtgatt gacgcgttta gcaaatttgt gtggatatat cctgtgaaat ctacaacagc 4680 cgatgaagtt ttgaaaaaat tagaagttca gagatgtgta tttggaaatc ccaaacgaat 4740 tatttccgat aggggagctg cttttacttc gactgtgttt caaaattatt gtgaaaatga 4800 gggcatagaa catttgcaaa ttaccactgg agtaccacgc ggcaatggcc aagtcgaaag 4860 aattcatgga gtcattgtac ctgctttagc taaactagca gttgataatc cggaaaactg 4920 gtaccgtcat gttggaaaac ttcaagagtt tttaaacaat actcttcaac gaagcatagc 4980 tagaactcct tttgaattgt tgttcggaac taaaatgaga actacacaag atttaagatt 5040 agcagaaata attgaacagg agtggataag aatatttgaa gaatctagag aagaaattag 5100 ggaaatggca agggccaaca tagagaattg tcagagaaag atgaaacaac aatttgaccg 5160 taacaggaaa gaggccgaag attatgaagt aggtgacttg gtggcgataa gtcgtacgca 5220 gtttggtact catttgaaga tcgctcgaaa atttctcggg ccatatcgcg tcatcaagaa 5280 gaagaggaat gatagatatg acgtcgaaaa agtcggaaat gaagaaggac caaagaggac 5340 atcaacttgt gcggaattca tgaagcggtt catacctggt gaagatgaag aggaggagac 5400 ctacgacaac atggagcagc ccgacatcgg tagaagcaac caggaagaca ctaatgatca 5460 gatcacacca tccgggccgg atggtggtca ggatgaccga tt 5502 // ID L1-40_AAe repbase; DNA; INV; 4640 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE An L1 non-LTR retrotransposon from Aedes aegypti. XX KW L1; Non-LTR Retrotransposon; Transposable Element; L1-40_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4640 RA Kojima K.K. and Jurka J.; RT "L1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1393-1393 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 9 sequences with >97% CC identity. XX FH Key Location/Qualifiers FT CDS 131..1249 FT /product="L1-40_AAe_1p" FT /translation="MAVSPYHLTSNGPRINTIVIDLATVPTRKVTMDLIVR FT FINHNLKIQFSSVQSLQHNTGKSLVFLECQSEEQALAVADKNDGKHEITIE FT NIKYAVNVYMEDGATTVRIHDISPQSENSVIEQALEQYGDVIWLKEETWTE FT PAILKGIKSGVRSVRIRLHSAIPSYINIRGEVTLVTYKNQQQTCRHCNKPV FT HWGRKCIEANYMELQLQTGLTGNISDRLRASGVDYAGALKQVAQEPVTSVG FT TGSANKQVSANFTSLNQLLRVEQNANTSKSVDPVIHASNSEIPHTKKGNSV FT TIHRTTDPSVGNEMNTESVSSATKSPSFLAEGSQLGIPTNNTFMALISDDE FT SLHNDGTRSRSTSPSHQKVQKRSRPRKTSK" FT CDS 1261..4452 FT /product="L1-40_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MNALSYNIGSINMNGISNKNKIDALRAFVHLMELDIV FT MLQEVPEQIDFNGFDSFSNIDHKKRGTEIMVKSHIKTSGIQRSIDSRIISL FT QIDQHVTICNIYAPSGYALRAEREKFFTDTIPHYINHGTGELILGGDFNAV FT INKKDANHTSNHSSSLKFLVESLKLSDAWEHVHKTVVDYTFVRNGSGARLD FT RFYVTEGIKQNVVDIKSNVCCFSDHKSVILKVKLPNLGKPYRKQFWRFNDT FT ILTEEVLSEFRTKWLWWVRQRRYYDSWFSWWNELAKKKIVSFFKWKNSIAR FT KRNHDTMEFYYAALNLLHREYVDKPALISEINLVKGKMLALQKQISNNFYS FT HSKTMIAGEDASIFHVGEQIKNRGKAFIKSLDIAENTERDADKIEEEVVNY FT FQKLYSASDVTSDNSFNPQKTVDVNLPANNDLMSPVTEQELLLTIKTSQSR FT KSPGIDGLSKQFYLKCWNIIKTELVAVVNDVLNGKMSKKFVEGIIILIHKK FT GSRSSIENYRPITLLNFDYKLVARILKTRLSKFSPLLLSKAQKCANAPNTI FT FDTICGIRDKILEVNCRRRTCILVSFDFEKAFDRVSHTFLEKTLDRMGINN FT RFIDFLKQIHANSYSRIMVNGRFSNQIKIGRSVRQGDPLSMLLFCLYLEPL FT IQKLQRCCNAEFDMLYSYADDISLILNDFTKLEEVKRIFENFGKVSGAKIN FT FLKTKAMCIGTNNTSNVPDWIQLAESVKILGIYYHNNSRTMMNLNWDNVIQ FT KLKFSTWNNQFRNLNLIQKVIFLNTFASSRIWYVSSVVPLDKKRNAKIRNI FT YGNFLWYRATLRVAFDQLCLPRKRGGLGLIAPNLKSNALLLNRFLRLESES FT PFLHNLTNQIDNPPNIKDVPLNSPFLKTVYLELPYIPENIRQNSTANALYN FT HYIARLPTPHIEQKYPQCTWQRIWKNVFNKKIISENQVSWYLFVNEKIPCG FT KQQFRFGRTASERCVFCPQEIEDWKHRYANCEKVRKCWWYSLTQIKLINRN FT RTRNLSFDDFKFPVLSRLNMLERTEASKIFIEYLNYIKSRSREELDLSELV FT FNLNCKT" XX SQ Sequence 4640 BP; 1611 A; 827 C; 931 G; 1270 T; 1 other; cagttacgct ttggcagccg gcgagatcgc acgcttttgc aattgttcgg agtttaaaag 60 tgaagctatt attttgccag tcaatacaga ctgtgctcat caacacaaca caatacaaca 120 cgtgagaaca atggccgtat cgccatatca cctcacctcc aatggaccac gaatcaatac 180 tattgtcatc gatttggcta cagtgcccac gaggaaggtc acgatggact tgattgtacg 240 tttcatcaat cacaatttga aaatacaatt ttcgtcagtt caatcacttc aacacaatac 300 ggggaaatcc ttggtatttc tggaatgcca atctgaagag caagcgttgg cagttgcgga 360 caaaaacgat ggaaaacacg agatcacgat cgagaacatc aagtatgcag taaatgtata 420 catggaggat ggagcaacaa ccgtaaggat tcatgatatc tcaccgcaat cggaaaattc 480 ggttattgaa caagcgttgg agcagtacgg agacgtaatt tggctgaagg aggaaacatg 540 gactgaacca gctattttga agggaatcaa gagtggagtc cgatcggttc gtatccggtt 600 gcattctgcc ataccgtcgt acatcaacat ccggggtgaa gtcacgttag tcacgtacaa 660 gaaccaacaa cagacatgca ggcattgcaa caaaccggtg cactggggac gaaagtgcat 720 tgaggccaac tacatggagt tacaactgca gaccgggttg actggaaata tcagcgatag 780 attgagagct tctggcgtag attacgcagg ggcgttaaag caagtggctc aagaacccgt 840 aacaagcgta ggaacaggta gcgctaacaa acaggtaagt gccaatttca caagtctcaa 900 tcaactgctc cgagtagagc agaatgctaa tacgagcaag tcggttgatc cagttattca 960 cgcttcaaac tcagaaattc ctcacacgaa aaaaggcaat tccgtaacta ttcatcgcac 1020 cacggatcca tctgtgggta atgagatgaa cacggagagc gttagctcag ccaccaaatc 1080 accatctttc ttagcagaag gctctcagtt gggaattccg accaacaaca cgtttatggc 1140 actcatttca gacgacgagt cattgcataa tgacggtact cggagcagaa gcacctctcc 1200 ctctcaccaa aaagttcaga aacgtagtag acccagaaaa acctcgaagt aatttcgaaa 1260 atgaacgctc tctcgtataa cattggttcc attaacatga acgggatctc caacaagaac 1320 aagatcgatg ctctcagagc attcgttcac ttaatggaac tagatatagt tatgcttcag 1380 gaggttcctg agcaaattga ttttaatggc tttgattcct tttcaaatat cgatcataag 1440 aagcggggta cagagataat ggttaagtcg cacataaaaa catcagggat tcaacgcagt 1500 attgattctc gtatcatcag tcttcagata gatcaacatg taaccatttg caacatatat 1560 gcaccatcag ggtatgcttt gagggcagaa agggagaagt ttttcacgga tacgattcca 1620 cactatatca atcatggtac tggcgaactg attttggggg gcgattttaa tgctgttatc 1680 aacaagaaag atgctaatca cacttctaat cacagtagtt ctttgaaatt tctcgtagaa 1740 tctttgaaac tgtcggatgc atgggagcac gtacataaaa cggtagtaga ttacactttt 1800 gttagaaacg gatccggagc aaggttggac agattttatg tgacagaggg aattaaacag 1860 aacgtagtag acattaaatc taatgtgtgc tgcttctctg atcataaatc agtgattttg 1920 aaggtgaaat taccaaattt aggaaagcct tacagaaaac aattctggcg tttcaacgac 1980 acaattttaa cggaggaagt tctgtcagaa tttagaacta aatggttgtg gtgggttaga 2040 cagaggaggt attatgattc ttggttctca tggtggaacg agttagcgaa aaagaaaatt 2100 gtttcgtttt tcaaatggaa aaatagcatt gctaggaaac gaaatcatga caccatggaa 2160 ttttattatg cagctttgaa cctgttgcac agggaatatg ttgataaacc agctttgatt 2220 agtgagataa atcttgtcaa aggtaaaatg ctagcacttc agaagcaaat atcaaacaat 2280 ttttactccc actcaaaaac aatgatcgcg ggggaagatg cttcaatatt ccatgtcggt 2340 gagcaaatca agaacagggg gaaggcgttt atcaaatcac tggatattgc agagaacacg 2400 gaaagagatg cagataaaat agaagaagaa gtagtgaatt attttcaaaa actttactca 2460 gcgtcagatg ttacatctga taatagtttt aatcctcaaa aaacagttga tgtcaatcta 2520 cctgccaaca atgatttgat gagccccgtg accgaacaag agttgttgtt gacaatcaaa 2580 accagtcaat ccagaaaatc ccccggaatt gatggtctgt ccaagcagtt ttacttgaaa 2640 tgttggaata ttattaaaac agagttagtt gcagttgtta atgatgtact caatgggaaa 2700 atgtcaaaaa aatttgttga aggtatcata atcttaatac ataaaaaagg atccaggagt 2760 agtattgaaa attaccgacc aatcacattg ttaaattttg attacaaact tgttgcaaga 2820 attctcaaaa ctcgtctaag caaatttagt ccgttgctct tgtctaaagc acaaaaatgt 2880 gctaatgcgc caaacaccat atttgataca atttgtggca tcagggataa gatactcgaa 2940 gtaaattgca gaagaagaac ttgcattctc gtatcatttg atttcgagaa agcatttgat 3000 agggtcagtc acaccttttt agaaaaaact ttggacagaa tgggaattaa taacagattc 3060 atcgattttt taaaacagat tcatgctaat tcatattcca gaattatggt caatggtcgt 3120 ttttctaacc agataaaaat tggaagatca gtcaggcagg gcgaccctct ttcaatgtta 3180 cttttttgtt tatatctgga acctctaata caaaagttac agagatgttg caatgctgag 3240 tttgatatgt tgtatagtta tgcggatgat attagtttga ttttgaatga tttcactaaa 3300 cttgaggagg tcaaacgtat ttttgaaaat tttggaaaag tctcgggggc aaaaattaac 3360 ttcttgaaaa ctaaagctat gtgtatagga acaaataata cgtcaaacgt gccggattgg 3420 attcagcttg ccgagagtgt taagatctta ggcatttatt atcacaacaa cagcagaaca 3480 atgatgaatt tgaattggga taacgtaata caaaagttaa aattttcaac atggaataac 3540 cagtttcgga acttaaattt gatacagaaa gttatctttt taaatacatt tgcttcatca 3600 agaatttggt atgtttcatc ggtggtgccg ttggacaaaa aacggaatgc taagatcagg 3660 aacatttatg gtaactttct ttggtaccgt gctacactaa gagtcgcttt tgatcaatta 3720 tgcttgccta gaaaacgtgg aggtttgggt ttgatagcac ccaacctcaa aagcaatgca 3780 cttttattaa atagattttt gcgcttagag tctgaatcac catttctcca caacttaact 3840 aatcaaatag ataatcctcc gaacataaaa gatgttcccc taaattcacc ttttctcaaa 3900 actgtatatc tggagcttcc atacattcct gaaaatatca gacaaaactc aacagctaac 3960 gctttgtata atcattatat agctagacta ccaacaccac atatcgaaca aaaatatcca 4020 cagtgcacgt ggcaacggat atggaagaat gttttcaaca aaaaaatcat ttcggaaaac 4080 caagtatcat ggtatctttt tgtaaatgaa aaaatcccgt gtggtaaaca acaatttcgt 4140 ttcggtagaa ctgcaagtga aaggtgtgtg ttttgtcctc aagagattga agactggaag 4200 cacagatatg caaactgcga aaaagtgcgg aaatgttggt ggtattcctt aacccagata 4260 aaattgataa ataggaatcg aactcgaaat ttgagtttcg acgattttaa atttcctgta 4320 ttgagtcgcc ttaatatgct tgaaagaacg gaagcttcaa aaatatttat tgaatatttg 4380 aattacataa aaagcagaag cagagaagaa ctggatttgt ccgaacttgt atttaatttg 4440 aactgtaaga catgaaattg gcactttaac aaaaataaaa cgagaaggga taaactmact 4500 aacacataat taacatcaag tagcaaccaa tgtcaaacaa aatagaaaaa aaaaaacaca 4560 aaaaacatac atacttattg tattacaata ttaaacggaa ataaaacaag tttttagaat 4620 ttttagaaaa aaaaaaaaaa 4640 // ID Copia-13_DPu-I repbase; DNA; INV; 4310 BP. XX AC scaffold_22; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-13_DPu_; KW Copia-13_DPu-LTR; Copia-13_DPu-I. XX NM Copia-13_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-4310 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 689-689 (2010). XX DR Genome; scaffold_22; Positions 1125443 1129752. XX CC Positions [1664-2188] - Integrase core CC 'CAATA' target site duplication CC LTRs are 98% similar to each other. CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX FH Key Location/Qualifiers FT CDS join(548..2683,2687..4300) FT /product="Copia-13_DPu-I_1p" FT /translation="MSHISAIELLASQLKDLNEPVTEAQVMTKILVTLPPS FT FRHFLSVWDNVPAKNRNIQTLTQRLLKEENVTKIYNNGQSDAADSAFFSNN FT FPPQQSDRHSRGGFNNRRSRGGRPGRGAGTRHPHIRKCNYCGDTTHLYATC FT RERLRNERDKKGSPGEKSNLAKDSNKNHGDHSYHSSTQQTIRNDSTWYADS FT GATRHMTDQRNILWNFKPDDSTPRYVTGIGNTQLLTEGQGDVKATTMINGA FT EVPITIKNVLFVPKLGTNLFSIGASTNEGVEAVFTNNEAKFYRNGNLELTG FT KRTDNTLYQLNITAKVAYEDSALAANLVLPLSTWHERLAHVSNSSILKMAN FT QNLVAGLNLSSHHSDSKVPCVGCTLGKMHRSPFPQGRTRGTDIGSLVHSDI FT CGPITPPTPAGSRYFVTFKDDYSNWTTVHFLKTKDEVPDLFRAYTMFLKNS FT TGKVVKTLRSDNGGEYTSKNFSTWLQNSGIRHETTAPHTPAQNGVAERANR FT TILEAARSLIYTRNIPLKLWAEAIAYAVYTLNRVVSKAASVTPFEMWFHKK FT PDVSYFRIFGARTFVHTPDATRKKLDPKSQEGIFVGYCDASKAFRIWVHEK FT QRVVISRDVLIDENNVFKGPSNEEKHILIFPESDFIEERTDGNPTPDEESS FT GSTSGPEFEQLRKDPTNAEREQNPEGEQRPEGEQRPEGEQRPEGEQHPEGE FT QHPEGEQRPEGENLDEMEENSTLVQPRRSSRQPVYSEKFKNWRRDLGLLSC FT ANQPHEPLNYTEAITSDEANLWKPANDDEYASLMKNETWARLVAKGYTQSQ FT GIDYQDTYAPVVKQCSLRTVLSLVAALDLEFTQLDIKTAFLYGELEEELYL FT EQPEGFVIAGRENEVCRLKKSLYGLKQSPRAWNTKFNEFLVKFGLTRCDSD FT SCVYYRRQEEGEITIVCIFVDDGLICTNKKSVSTSILNFLNMLFEIRSLPA FT DRFIGLDILRDRPNKKLFISQIDFVNKILKKFNLDLCHPKSIPADPNARLT FT AAMSPSTQPDPISPHLTRYREAIRCLMYIMTTTRPDIAFAVGQASQFCQNP FT GDGHWNGVKRIFAYLAGSAHLGLCFDGQQKNNLTGFTDSDFAGNLDNRRST FT SGFVFLYNGASVSWSSKLQQCVSLSTTEAEFVAASETSKEAFWLQQFLREV FT KGDEIGPIPILCDNQGAIRLIKNPEFHQRTKHIAVKYHFVRHQQHNGNIEV FT SYVPTENQLADMLTKPLPGPRFTFLRDQIGVKPLPQTPTV" XX SQ Sequence 4310 BP; 1393 A; 1087 C; 870 G; 960 T; 0 other; ggttatgggc ccagttcaca ttgtcttaat ctaagtgtta cactgatatt gattcaaaca 60 tgtctatgtc gacgtcaaaa gacattagcc acatagtcaa atttgacgga agtaactttc 120 aacaatggaa gttcggttgc agattattgc tagaatcatt caatttactt gatattgtag 180 atggagtaga aaagattcct gctgctgtaa gttatttttt gtctacgagt aaattgtttc 240 atttactcat gcactttctc ataggattcc aatcaaaagt tgattgatgc ttggaaatcc 300 aaagatgtac atgctcgtca ttacctattt gctacaatag agagacagca acagaatacc 360 ctttatagat gccagtcagc aaacgacatg tgggtcaggt tgaccacaca acatgctcag 420 aatgcggctg agaataagca tcttctgatg caacaattct ttgaatacaa ataccaccca 480 ggtagtcact acccctacgc ttaacagtac acaccactaa atttattctt gtgtagatca 540 cagtgtcatg tcccacatat ctgccatcga gttacttgcc tctcaactca aagacctcaa 600 cgagcctgtg actgaagctc aggtaatgac caaaatactg gtcacacttc cacctagctt 660 ccgacatttc ctgtctgtct gggataacgt accggccaag aacaggaaca tccagacact 720 tacccagagg ttacttaaag aagaaaatgt taccaaaatc tacaacaacg gtcagtcgga 780 tgctgcagat tcagccttct tttcgaacaa cttcccccct caacaaagtg atcgacacag 840 tcgtggagga ttcaataacc gtcgcagtag aggaggaaga cctggaagag gagctggcac 900 cagacacccc cacatcagaa aatgcaatta ctgtggcgat actacccatc tctacgccac 960 atgtcgggaa agattaagaa acgaacgaga caaaaaggga tcccctggag aaaaatctaa 1020 tctagccaag gacagcaaca agaaccatgg agaccacagc tatcattcat ccacccaaca 1080 gaccatccgt aacgactcca cctggtacgc cgactcagga gctactcgcc acatgacaga 1140 tcagagaaac atcctgtgga acttcaagcc tgatgactct acaccaaggt acgttactgg 1200 tattggtaac acccaactcc taactgaagg ccaaggagat gtcaaagcca caacgatgat 1260 caatggtgca gaagtcccaa tcaccatcaa aaacgtactc ttcgtaccaa aactcggaac 1320 gaacttattt tccattggag catcaaccaa cgaaggagta gaagcagtat tcacaaacaa 1380 cgaggccaag ttctatcgca atggtaattt agaacttact ggaaaaagaa cagacaacac 1440 gctctaccaa ctgaacatca cggcaaaagt ggcctatgaa gactctgccc tcgcagccaa 1500 cctggttctt cctctatcaa catggcacga gcgtcttgca catgtgtcaa acagcagtat 1560 tttaaaaatg gcaaatcaaa accttgttgc aggtcttaac ttatcctctc accattcaga 1620 ttcaaaagtc ccatgtgttg gatgcactct aggaaagatg caccggtcgc catttcctca 1680 aggaagaaca agaggaactg atattggatc cctcgtccac tcagatatct gtggcccaat 1740 aactccacca acgccagccg gatcacgtta ctttgtgacg ttcaaagacg actacagcaa 1800 ctggactact gtccatttct taaaaacaaa agatgaagtt cctgatcttt ttcgtgccta 1860 cacaatgttt ctcaagaact caacaggaaa agtggtcaaa acactacgca gtgacaacgg 1920 aggagaatac accagcaaga atttctccac atggctccag aactcaggca ttcgtcacga 1980 aaccaccgca ccacacactc cagcccaaaa cggagtagca gagagagcga atcgaactat 2040 tttagaagca gcccgcagtc tcatctacac aagaaacatc ccattgaagc tatgggcgga 2100 agcgatagcc tacgcagtat acaccctaaa tagagtcgta tcaaaagcag caagcgtaac 2160 accatttgaa atgtggttcc acaagaaacc agatgtctcc tacttccgaa tttttggagc 2220 aaggacattc gtccatacac cagatgcaac cagaaaaaag ctggatccaa aaagccaaga 2280 aggtatattt gttggatact gtgatgcgtc aaaagcattc agaatctggg tacacgagaa 2340 acagagagta gttataagtc gagatgtgct aattgacgag aataatgtct tcaaaggtcc 2400 ttctaacgaa gaaaaacaca ttctgatttt tcctgaatca gatttcatag aagaaagaac 2460 agatggcaat ccaactcccg acgaagaatc atcaggcagc acatctggac cagaattcga 2520 acagctgaga aaagatccaa ccaacgccga acgtgaacag aatcccgaag gtgaacagcg 2580 tccggaaggt gaacagcgtc cggaaggtga acagcgtccg gaaggtgaac agcatccgga 2640 aggtgaacag catccggaag gtgaacagcg tcccgaaggt gaatagaatc tggatgaaat 2700 ggaagagaac tcaacactag tccaaccgag gcgttccagt cgacaaccag tctattcgga 2760 gaaattcaaa aactggagac gtgacctagg actactctcg tgtgccaacc aaccgcatga 2820 gccactcaac tacaccgaag ccataacatc agatgaagcg aacctctgga aaccagctaa 2880 cgatgacgaa tacgcctcac tgatgaaaaa cgaaacgtgg gcccgactag tagcaaaagg 2940 ctacacccaa agccaaggaa tagactacca ggacacctac gcaccggtgg tgaaacaatg 3000 ctctctccgc accgtcctct cccttgttgc agccctcgat ctggaattca ctcaactgga 3060 tataaaaaca gcgttcctct acggcgaact agaggaggag ctctacctag aacaacccga 3120 aggtttcgtc atagcaggac gagaaaatga agtttgccgc ctcaagaaat ccctatacgg 3180 actgaaacag tcccctagag cctggaacac aaaattcaat gaatttctgg taaagtttgg 3240 actcactcgc tgtgattcgg acagctgtgt ctactaccgt cgccaagagg agggtgaaat 3300 cacaatcgtg tgtatatttg ttgacgacgg tcttatatgt acaaacaaaa aatctgtatc 3360 cacctccatc ctcaatttcc ttaacatgct ctttgaaatc cgctctttac cagcagaccg 3420 ctttattggt ttagacattc ttagagacag gccaaacaaa aagctcttca tatctcaaat 3480 cgattttgtc aacaaaattc ttaagaagtt taatcttgat ttgtgtcacc caaagtcaat 3540 cccagctgac cccaatgccc gtctaacagc tgcaatgtca ccaagtaccc aacctgatcc 3600 aattagccct catttaactc gctacaggga agcaatcaga tgcctcatgt atatcatgac 3660 aacgacaaga ccggacatag cctttgctgt aggacaagcg tcccaattct gtcaaaatcc 3720 aggagacgga cattggaatg gtgtaaagag aatatttgcc tacctagctg gatctgccca 3780 ccttggactc tgctttgacg ggcagcagaa gaacaaccta actggattca ctgactccga 3840 ctttgctggg aacctggaca accgccggtc aaccagtggc ttcgtttttc tgtacaatgg 3900 agcctcagtg tcatggagca gcaagctcca acaatgtgtc tccctctcaa caactgaagc 3960 tgagttcgtg gcagctagtg aaacttcaaa agaagccttc tggttgcagc aatttctgag 4020 agaagtgaaa ggcgatgaaa tcggaccaat tcctatactc tgtgataatc aaggagcaat 4080 ccgtctcatc aagaatccag aattccatca gcgcaccaaa cacatagcag tcaagtatca 4140 ctttgtgcga catcaacaac acaacggtaa catcgaggtg tcgtatgtcc caacagaaaa 4200 ccaactagcc gacatgctga caaaaccact cccaggacct cgattcactt tcttacgcga 4260 ccagatcggt gtaaagccac ttccgcaaac ccccacagtt tgagaggagg 4310 // ID Gypsy-60_AA-LTR repbase; DNA; INV; 216 BP. XX AC supercont1.279; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-60_AA_; KW Gypsy-60_AA-I; Gypsy-60_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-216 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.279; Positions 1278167 1278382. XX SQ Sequence 216 BP; 64 A; 58 C; 31 G; 63 T; 0 other; tgtaatgaac tttatactaa atgggcagaa cccagtgaac tgtagaacac taatacctac 60 acaactacac taatttgata tactttgaac tttacaattt tacaatatac agtcagtcta 120 ttccaagcca ccggagtaat cacttcgagt ccgaaacgct ccaattcccc ccgagtcctt 180 gtccgccgtt gacccccctg tagtttgctt tttaca 216 // ID DNA8-80_AP repbase; DNA; INV; 849 BP. XX AC . XX DT 25-AUG-2009 (Rel. 14.09, Created) DT 25-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-80_AP. XX NM DNA8-80_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-849 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2016-2016 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 849 BP; 261 A; 112 C; 101 G; 375 T; 0 other; tagggttagg atgtttatga ttttgcatat tttttttagg ttgtccaata gcatccattc 60 ttatacagaa atatgtttca tgatatctgt attcaaaata tcaaaatttt attttgcata 120 taattgcata ttttaccatt ttttttttta aagtgcataa tttgtgtttt ttctgttttt 180 aagcgaatat ttgacggttt tcgatgcaaa atgctgtaaa acaccgtaat tttatatttt 240 actgaatctt atataagttt gattatttgt atttcacaaa ttatcactta tttattgttg 300 agaattcaat atggttacac ctataggtac ctacggagta cctacttaaa tagaattttc 360 gataagcctc gtcaattgtc acagtcaaaa cttatcttaa aactgcgagc cattacaatt 420 ttatcagata ttttaacaga acgtttaata atttatagtt agtttccatc ttgtctacga 480 ccgatgtaga gtaccttata ttgcgcgttt cgtgttttat aataatttaa ataaatacaa 540 ttttttacat agagattttt cgtttaatag attcaaaaac gtattagcac caaacagacg 600 aagatttaca gcggaacatt taaaacaaac tttaattatt caatgcaact ctttttaatt 660 ttttttatat tttatttaat ttttttcagt tttttcaatt tgtctatatg tttatttcat 720 tttttgtcca ataatttgtt tattttttaa tgcatatttt tcaaattttt catgcatatt 780 tagctggttt ttagtgcata agtgcatata tattcataca ttttttagtg cataaatatc 840 ctaacccta 849 // ID Chapaev-14_HM repbase; DNA; INV; 4624 BP. XX AC . XX DT 08-JAN-2009 (Rel. 14.02, Created) DT 08-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE Chapaev DNA transposon -a consensus sequence. XX KW Chapaev; DNA transposon; Transposable Element; Chapaev-14_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4624 RA Bao W. and Jurka J.; RT "Autonomous Chapaev transposons from the hydra genome."; RL Repbase Reports 9(2), 360-360 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(1330..1962,2091..3482) FT /product="Chapaev-14_HM_1p" FT /translation="MPNQAKNHDENRKFVCLLCLKKCLRQLTTFQADRICQ FT LYKTKINISDTRVPKGFCESCRITLRRKQEGKEVALPPLFDFKSIQVRPET FT RGTMCCCLICRIGRSKLQEKHPLIEQTTIEKKPKVRCSKCLSPLGKGLPHQ FT CSQIMLRENIQSLAAKDAKSSEQIASRVISNKELSPEGTILLAKASGGPPL FT TVTPGKYLYLNNFNIYIEIKSNMYVLVLGQSSAQTPSSAPKLTTVALVNVQ FT LNTGLSNNGLKKLVRTVNKTSSSKMVERNFYMNFLRLGQQLSDFFTCSDLK FT ITDEKTDSCTVHVVAHCNDLSSLVNHVIQSRKISGGYFVKLGIDGGKGFLK FT FCLSIVDTALRDETSSPHQQQPLTQRTGKDTGVKRQIIVGVSEDLPETHSN FT IEQIWSLINANGVKLILACDMKVANIICGLQTHSSTHPCTWCDVKSNCLAK FT SGNLRTLGSINSSLESFQQSGGVVANAKLFGNVIRKPIICGPVNVLILEII FT PPMELHLLLGVVNHLFKILKTAWIEAVKWPAALHIQLSPYHGGQFEGNECR FT KLLKNVDMLQQLAETNMVPNIVCIIEAFKSFNLVVKACFGYILQPDFAEKI FT XXFKKCYLMIIGTSITXKIHAQSSTTSRISSVTKDPPWGPIVNKLSKLLIK FT IFINIGRGISEIIIIQSTKKDCCAA*" XX SQ Sequence 4624 BP; 1562 A; 781 C; 820 G; 1453 T; 8 other; cacggttagc acggatcctt ataatgccta cctagctgtc ccaagaacgc cacttatacg 60 gaaagttagc gtacacatca aggtaccctg tgatgcaaat tttgaaatca tttaattttg 120 tccgcggaaa ataatttaat ttgaatgttc aaattaagtc aaattttgac ttcatagtgc 180 gccaataaaa acacttgtag aatttgaacc actcaatata atttgataaa aattggtgca 240 aatgtagcca atagtcacac caaactctgg acaaaatttc agcctgattg gatgactggt 300 gtaggattta ttgcagtttt agttgaagtt ttgtcagttt tttgctactt atacctcaaa 360 gcgttaccga ttactttgaa attcggtcac tatttttaaa ataacggtta catgagtgtt 420 ctactatgta ataccaacaa tttttgttat aattatttta tcggtacatg tacagttttc 480 tattttatag accaaaatag tgatttgcta cttaaagcgt gatttgaatt ggcttatgcc 540 aatggcgttg acacggcctt caaagtggag ggggtcaacc cctcaatttc taaaataaaa 600 tccataattt ctaaaataat tgcctatatt tctaaaacaa aaggtcctcg caaaaaaaaa 660 gtgggcaggc catggctccc ttgacggcca tgtatgcata gcgtgaatat accaacattt 720 cctaaggtat aaactactgc cagaatgcaa aaaaattcta aaaaaatggt taagaaggat 780 ttctagggat acctacatag aaaataaaaa tatttattta tctgaaaaca gaacaactgt 840 gaaaaaacca aatatttttt tatattgaca aagataattg acttaaaaaa tttagataag 900 ttaactaaga tatctgacca atacaaagtt tggagggcca gtcggaggaa aattactcta 960 actgaataac taataattag taaacccttc aacctaatat taggttgaag ggttgttttg 1020 aaataaaaca acccgtgtga caagtatcat tagtttaata agtgtgacaa ttatcaataa 1080 tgtaatttcc ttacaagtaa atagtttaaa caattgtttc ctggtaagga aagaactatt 1140 caagaattat ttccttgtaa tttaatatta cctcaattaa atcaagtaaa tatttccttt 1200 attttaatag tttttaatta ataatatgtg ttactatgtt aagtggattg ttatcaaata 1260 taaatttagg tcaaattttg taaaaggcag agtttgattt agtgaagtca aatttagtga 1320 attgccacga tgccaaatca agcaaagaat catgatgaaa atcgaaagtt tgtgtgcttg 1380 ctctgtctga aaaaatgttt aagacagctt actacttttc aagcagatcg aatatgccaa 1440 ctgtacaaga ccaagatcaa tatcagtgat actcgggtac caaaaggatt ctgcgaatcg 1500 tgtagaatta ctttgcgcag gaaacaagag gggaaagaag ttgcactccc tcctttgttt 1560 gattttaagt caatacaagt gagacctgaa acaagaggca cgatgtgttg ttgcctaata 1620 tgccgaattg gtcgttcaaa gctgcaagaa aaacatcctc ttatagagca aaccactatt 1680 gagaaaaaac caaaggtgcg ttgcagcaaa tgcctcagcc ctttaggaaa aggtttgccc 1740 caccaatgct ctcaaataat gttaagggaa aacattcaat ctttagcagc aaaagatgcc 1800 aaatcatccg aacaaatcgc atctcgagtt atttccaata aagaattatc tccggaagga 1860 actattctac tggctaaagc atccggcggg cctccactta cagttacccc tggtaaatat 1920 ttatacttaa ataattttaa catatatatt gaaataaagt cttgactatt aagaaataac 1980 actagataga tttcaagaag tatatttttg acaagtttct aatttcttta aaaacgaata 2040 atttgaaagt tttacacaaa cttactaaac aaagtttatt gtaactataa aatatgtatg 2100 ttttggtttt aggtcaaagt agtgcgcaga caccatcaag tgctccgaaa ttgaccactg 2160 tagcccttgt aaatgtgcaa ttaaatacgg gcctgtcaaa caacggatta aagaaacttg 2220 ttagaactgt aaacaaaaca tcaagcagca agatggtaga gcgaaatttc tacatgaatt 2280 ttctaagact aggtcaacaa ttgagcgatt tcttcacctg cagcgactta aaaatcacag 2340 acgaaaaaac agattcttgt acagttcatg ttgttgctca ctgcaatgat ttatcaagtc 2400 ttgtgaatca tgttattcag tcccgcaaaa tttctggtgg ctattttgtt aagctcggaa 2460 ttgatggtgg caaaggtttt ctcaagtttt gcctaagcat tgttgatact gcgttaaggg 2520 atgaaaccag ctcaccgcac caacaacaac ctctcactca aaggactggt aaagacactg 2580 gagtcaagag acaaattatt gttggggttt cagaagattt gccagaaaca cactccaata 2640 ttgaacaaat ctggtcactg attaatgcaa atggagttaa actaattttg gcttgtgata 2700 tgaaagttgc aaatataatt tgtggtcttc aaactcattc aagtacgcac ccttgcacct 2760 ggtgtgacgt caaatccaat tgtcttgcca aatctggaaa cttgcgaaca ctgggctcaa 2820 ttaattcaag ccttgagtct tttcagcaat ctggcggggt tgttgctaat gcaaagctgt 2880 ttggcaatgt cattcgaaag ccaatcatct gcgggcctgt caatgttctg attttggaaa 2940 tcattccacc gatggaattg catctcctct taggagttgt caaccatctt ttcaagatcc 3000 ttaaaactgc ttggatagaa gctgttaagt ggccagccgc cctccacatt caactgagcc 3060 cgtatcacgg gggccagttt gagggaaatg aatgcaggaa gcttcttaaa aatgtggata 3120 tgctgcagca acttgctgag acaaacatgg ttcccaatat tgtgtgcata attgaagcat 3180 tcaaaagctt caatttagtt gtcaaggctt gctttggata catccttcag ccggatttcg 3240 ctgaaaaaat tsaaartttc aaaaagtgtt acctaatgat aattggtaca agtataacac 3300 scaaaattca cgcccaatct tctaccacat caaggatttc atcagtcaca aaggatcccc 3360 cttggggccc tatagtgaac aaactgtcga aacttctcat caagattttt atcaacattg 3420 gtcgaggtat aagcgaaata ataatcatcc agagtacaaa gaaagattgt tgcgctgcat 3480 aattgatcta aacagtaagc atctttaaag aacatcattt gagttattag aaattaggct 3540 gctgaaatty atattttacg gcgtataatg aatattttca ccatctttaa aagtagtatt 3600 atagagtatt tgtattaaaa gagtacattg agctatttat taataatctt ttggtggaga 3660 aaatcaaaaa aacatttctt aaaatgtaat tagatatttg atggtaatac aatgtaaaaa 3720 atagggttta aatagttttc acttcatttt aaaagtaaat gtatttaagg aggtcaatac 3780 aaaatatttt aggagatcta cttttttttt gttttactaa ctggtagtcc tttgaccagg 3840 ttttgcactt ttttttagct tttttttttg gcagcagttt ataccttagg aaatgttggt 3900 atattcaagc raagcataca tggccgtcaa ygccggggag gctagggggt catgtccttc 3960 tctacttttt ttttgcgagg accttttgtt ttagaaatat aggcaattat tttagaaatt 4020 atggatttta ttttagaaat tgaggggttg accccctcca ctttgaaggc cgtgtcgaca 4080 ctattagcat aagactattt aaatcacgct ttaagtagca aatcactatt ttggtctata 4140 aaatgaaaac tgtacatgta ccgataaaat aattataaca aaaattgttg gtattacata 4200 gtagaacact catgtaaccg ttattttaaa aatagtgacc gaatttcaaa gtaatcggta 4260 acgctttgag gtataagtag caaaaaactg acaaaacttc aactaaaact gcaataaaty 4320 ctacaccagt catccaatca ggctgaaatt ttgtccagag tttggtgtga ctattggcta 4380 catttgcacc aatttttatc aaattatatt gagtggttca aattctacaa gtgtttttat 4440 tggcgcacta tgaagtcaaa atttgactta atttgaacat tcaaattaaa ttattttcyg 4500 cggacaaaat taaatgattt caaaatttgc atcacagggt accttgatgt gtacgctaac 4560 tttccgtata agtggcgttc ttgggacagc taggtaggca ttataaggat ccgtgctaac 4620 cgtg 4624 // ID CR1-32_BF repbase; DNA; INV; 3220 BP. XX AC . XX DT 30-JUL-2009 (Rel. 14.07, Created) DT 30-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE Amphioxus CR1-32_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-32_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-3220 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-3220 RA Kapitonov V. and Jurka J.; RT "Young families of CR1 non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(7), 1603-1603 (2009). XX DR [2] (Consensus) XX SQ Sequence 3220 BP; 993 A; 624 C; 797 G; 806 T; 0 other; aaaaccacga gattaggtaa ggtaactccc ggcaaaaacc ggctgttgct tgtatctcta 60 aacgatccat ctgacagata ccacttgttg aaaggggcca agctgcttaa caacagcagt 120 agctttaaga acattttcat tgcaccagac ttgactagga aggaaaggga aatcagccgt 180 aggcagcgtg aggaaaagcg ccagtgtggt ttggtagggg agaacaatag aaatagaata 240 tgacaggcag acgaggatgg gcacaacact acacaggatg ggggggaacc cttctattgt 300 aatgacaggc cattgtcagt attatacaca aactctgacc aatttcccaa caagcgggag 360 gaattgatgt tttctatagc aaatgaccca ccagatttga ttgttttgac tgaagtaatt 420 ccgaaggctc aagttaaccc actcagaatg tgtacaatag ctctagatgg gtacgagatt 480 tacacaaatt ttgatcctga cgagctaaat cttggaccca gtgggaccag gggcatagct 540 atatatttca agaaatcact cagcttacag gtgcgggtac tcgactatgg ctctgtggga 600 ttccatgaac agttatttct gaactttggg ttgggaaatc aggtctcctt aattgtagga 660 tgtgtgtata ggagcccatc ctctgagcca gtagatagca ctaccaaact ctgcaactta 720 ctgtctaggg tgtgtaagga gaacgccaag catgttgtta ttactgggga ttttaacata 780 aaggagattg attggtgtag tgggatgtcg acagtgggta caggtcacca tggtcagttg 840 ttactggatt gtctgaatga caacttcctc acacaacatg tcatggaacc cactagacac 900 aggctaaacc agcaaccttc agtcttagac ctggtaatta caaatgaagt catgactata 960 ggtaatcttc agtatgcacc cgggctgggg agaggggatc attgctgctt aactttttcc 1020 atagactgtg tgttggacaa aaccacagag tctttgccaa gacgaaactt tgggaaaggc 1080 aactatggga aggctaagga ggacctcagc aaagtagatt gggacgaaat gctgggggat 1140 ctggatgtga caggagcatg gtctcagttc tgtggtatcg ttgagaaggt tgtagatgac 1200 tgcatacctt tgacgaaacc caaacagaaa ccaaacaagc tgtacatgaa tagacaggca 1260 atgagggcta gaaagaaaaa aagtagggca tggtctaagt gggcaaagtc tggtaaagtt 1320 tatgactacg tgaggtatgc taaggcaagg aactatttga ggtctttgac aagacaacta 1380 tgtagttcgt acgaacgaaa gctagttaag gatttgaaaa gtaatcccaa agtgttctgg 1440 cggtatgcaa aatctcgtat gagtaccaga cccaagatag ctagtcttgt tcagcctgat 1500 ggcaatatag cagagtcaga ctacgacaaa gcagatgtgt taaataaata ctttgccagt 1560 gtgttcacta atgaggacat ggccaatatt cccagtatga gtaccaaacc aggagttaaa 1620 acccttgaca ggattatgac cagtcaggaa atagttaggg ataaacttgc caacttaaac 1680 cctgccaagt cggcgggtcc agatgatttg cacccaagtc ttttgaagga actagctgac 1740 cagttagctt acccgcttac acaaatattc aataaatcac taagtgttgg gaaactacct 1800 agcaattgga aagaggcaca tgtaacaccc atacacaaga agggcagcag gactgagcct 1860 ggtaactaca ggccagtcag tttgacgtcc gtggttggga aggtgttgga atcaatcatt 1920 cgagacattt tggtggatca ttttatggtt aacaatcttt tcactgactc acagcacggt 1980 ttcgtcccta aaagatcatg tacaacccaa ctattagatg tcatgaatga ttggtcccta 2040 agcctggaga ggggtgaacc agtcgactct gtatatttgg actttaggaa ggcctttgat 2100 tctgtgcccc accagaggtt attagtgaag cttaaagcat atggaattga tggtacactt 2160 cttacatgga tgcgagactt cttacaccaa agaagacaga gggttgtgat aaatggttca 2220 cagtctcctt ggtgtgaagt tacaagtggc attccgcagg gtagtgtact tgggccgact 2280 ctatttgttg tctacataaa tgacttaccc gatgtgataa caagcacagt taagattttt 2340 gcagatgata gtaaaattta ccgacccatc tgttcacatg ctgaccaggt ggctttacag 2400 cgtgacttat gtgcggtgga gcactggtca gaaatctggc agttaccctt taacgctggc 2460 aagtgtaaga ttctacatct tggaagtaag aaccagaggg ctaagtacac actaggtggt 2520 catgagctag agcaaactag agttgagaaa gatctaggtt tggcagtgga tgatcagctt 2580 aagtttcatg ttaatactgc agcagcagct ctgaagggta accaggtttt agggctagtg 2640 aagagagcgt ttaccaattt agatgaaagt tcagtaccca ttttgtataa atgtatggtc 2700 aggccacacc tggaatatgc aaatgtggtc tggggtcccc attttagtac tgaccaaaag 2760 attatagaga gagtccagca tagggcaaca agactggttc cctcgctgaa agggatgccg 2820 tatagttcca gactcagaaa gttaaaacta ccaactctca agtacagaag agaaaggggc 2880 gatatgattc agctctataa gatcatgacc aaaaaagagc gaatcgatcc cgagagcttc 2940 tttgagctag caaacctcga caaaacaaga ggtcactgct tcaagattaa agtgccacta 3000 gccagatcta atgtgcgacg ccagtcgttc tccgcaagga cagttcgatt gtggaattct 3060 ttgcccgagg aagtggtaac agcaaacagt gtgaacactt ttaaaacaat gctggacaag 3120 ttctgggagc ataagagata caaggaatga agatctagag gttgtgccaa caggcgggag 3180 ccttactaca acgaagtatc ctcaaggtat cctcaaggta 3220 // ID EnSpm-15_HM repbase; DNA; INV; 8152 BP. XX AC . XX DT 16-JAN-2009 (Rel. 14.02, Created) DT 16-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE EnSpm-type family from Hydra magnipapillata - consensus. XX KW EnSpm; DNA transposon; Transposable Element; EnSpm-15_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-8152 RA Bao W. and Jurka J.; RT "EnSpm-type families from Hydra magnipapillata."; RL Repbase Reports 9(2), 386-386 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(356..856,801..1646,1538..2707) FT /product="EnSpm-15_HM_1p" FT /translation="MINCAYCEYTASCSNSYKCHLKVFHNKHSYGERLVCS FT QGGCPMEYNSFQSLSKHIEKSHQEVILGPTLEEADTMQRCFHSNNLSLAVH FT NTEPVCPDNMLKFGIKFDETDGIFEDSAYFISKLRSYSNIPLTTTLKIVDA FT CSDFVSSIVTGISNETQGIFDKHGLQKRQELVMRPKEYLTSMVSKKDEYNN FT FFENILRLQKPFQGLESLHKQTNYFEKKGYYIKPKSYPIGHFINTQKTAKG FT VVCSTCIATGQYVSLKQCLEVFLCLPGVLDEIRSNTKPSTNGLVSDFKDGE FT LWQNHQLRRQYAHSQNTFVIPVFCFFDDLETANPLGSHSTVHKIGALCTIL FT KCLKPLHNSKLENILLSAVIYSSDRIKYSNKDAFSIYIEEMTELETKGFTI FT KIDGNEFKIYVVLAQIIGDNLGLNGILGYVESFTATYPWVMWKVLLLHTHA FT EYAKQSWFKWNIRLCRKFYCNIPMGYVESFTATYPCRICKMKRKDFDSIFV FT ESVELLRTRECYDYDVSLHNCSLNGIKEKCCFNKIPSFHVNDNIYCDIMHD FT LLEGICKYVFQKMLNYLVFQKKFFNLGDINSRXRNFSYDHSSIPSCPTEYQ FT IKNGSINIGAIEMLNLVLSFPLLVGDLVPFEDNVWVVFLLLRQIVLYSFGL FT YFSKPDLLXFGSLITEFLQEYCFAFKCGLTLKFHNLIHYPRIIKMLGPLSH FT MWVMRCEGKLRGFKRTASSVGNFKNVCKTVAIRHQMDQSTRFMAKHGLKNN FT EFCVAKTEPILLCHVIDGKTISELIGNYGLYREIFQTKCVSVNFVSFKIGD FT VVIYAVEELYPAFCLIKQIFVSDTNLFYLSKVVYYN*" XX SQ Sequence 8152 BP; 2881 A; 997 C; 1153 G; 3114 T; 7 other; cactgcaaaa aaatgttttg ttaatttaac caaatatgtt tggttaagta aatttggtaa 60 aagttacaaa cttgtttggt aaaatatacc aaatcttttg tgaactttac caaattgatt 120 ttgttatttt aaccaaatta tttggtttga tgatcttttc ggtttttgtc ttaacgcaaa 180 atataacaaa tttatttggt taaatttaca aaattgtttt tgttattttc accaaatgcg 240 ccacgggatt gcaaatatta aaaaaaacta ggttttgtcg ccattttttt tcttcattaa 300 ttgtcaggta tcgtagtttc aactaaaaag aatttttatt aatttcacta taatcatgat 360 taactgtgct tattgtgaat atactgcaag ctgcagcaac tcttataaat gtcatctaaa 420 agtatttcac aacaaacata gttatggcga aagattagtt tgcagtcagg gtggttgtcc 480 gatggaatat aattctttcc aaagtctttc aaaacatatt gaaaaatctc atcaagaagt 540 tattttagga cctactttag aagaagcaga tactatgcaa cgatgtttcc attcaaataa 600 cttgtctcta gctgttcaca ataccgaacc tgtttgccca gataatatgt taaaatttgg 660 tataaaattt gatgaaaccg atggtatatt tgaagattcc gcttatttca tttctaaact 720 aagaagttat tctaatatac ccctaaccac tacattgaaa attgttgatg catgttcaga 780 ctttgtgtca tcaattgtaa caggaattag taatgagacc caaggaatat ttgacaagca 840 tggtctccaa aaaagatgaa tataataatt tctttgaaaa tatcttacgc ctccaaaaac 900 cttttcaagg cttagaaagt ttgcataaac aaacaaatta ttttgaaaag aaagggtatt 960 atataaaacc aaaatcatac ccaataggtc attttataaa cactcaaaaa acagctaaag 1020 gagtagtttg ctcaacttgt atagcaacag ggcaatatgt ttcattaaag cagtgtcttg 1080 aagtttttct ttgtcttcct ggtgtacttg atgaaataag aagtaataca aaaccatcta 1140 ctaatggttt agtttcagat tttaaagatg gagagttgtg gcaaaatcac caattacgcc 1200 gtcaatatgc tcattcccaa aatacttttg taattccagt tttttgtttt tttgatgatt 1260 tagaaactgc taatccatta ggttcacata gcactgttca caaaattgga gctttgtgta 1320 caattctaaa atgtttaaaa ccactacata attcaaaact tgagaatatt ttgttaagtg 1380 cagttatata ttcttctgat cgcattaagt attcgaataa agatgcattt tctatataca 1440 tagaagaaat gacagaatta gaaacaaaag gttttactat aaaaatagat ggcaatgaat 1500 ttaaaatata tgttgttctt gctcaaatta ttggtgacaa tcttggttta aatggaatat 1560 taggttatgt agaaagtttt actgcaacat acccatgggt tatgtggaaa gttttactgc 1620 tacataccca tgcagaatat gcaaaatgaa acgtaaagac tttgacagta tttttgttga 1680 atctgttgaa ttacttagaa ctagagaatg ttatgattat gatgtttctt tacataattg 1740 ttccctaaat ggtataaagg aaaagtgttg ctttaataaa attccatcat ttcatgttaa 1800 tgataatatt tattgtgata ttatgcacga tttgttagaa ggcatttgta aatatgtttt 1860 tcaaaaaatg ttaaactatc tagtatttca gaaaaagttc tttaatctag gtgacataaa 1920 tagcagaatr agaaattttt catatgatca ctcgtcaatt ccttcatgtc ccactgagta 1980 tcaaataaaa aatggttcaa ttaatattgg tgcaattgaa atgctaaatt tggttttaag 2040 ctttccttta ttagttggag atttagttcc ttttgaagac aatgtttggg ttgttttttt 2100 attgttaaga cagattgtcc tttactcatt tggtttgtat tttagcaaac ctgatttatt 2160 ayattttggt tctctcatta ctgaattttt acaagaatac tgttttgcat ttaagtgtgg 2220 cttaacttta aagtttcata accttattca ttatccacgt attattaaaa tgcttggccc 2280 gctaagtcat atgtgggtaa tgcgttgcga aggtaaatta agaggtttta agagaactgc 2340 ttcgtctgta ggaaatttta aaaatgtttg taaaacagta gctattagac accaaatgga 2400 tcagtcaact cgattcatgg caaaacatgg cttaaaaaat aatgaatttt gtgttgctaa 2460 aactgaacct attttactat gccatgtgat tgatggaaaa actatatcag agttgattgg 2520 taattatggg ttgtacagag aaattttcca gacaaagtgt gtttctgtta actttgttag 2580 ctttaaaatt ggtgatgttg ttatatatgc agttgaagaa ttatatcctg cattttgtct 2640 aatcaaacaa attttcgtat ctgatactaa tttattttat ttgtcaaaag ttgtttatta 2700 taactgaatg ctatcactta caatcttttg aggttgacat tgaaacagaa ttaattgttt 2760 taaattgctc tcactttgat accattatct caccttggcc gttgaaacta cgaaatttaa 2820 ataaaagtaa atacttgtca ttaatgcaca aaatataact attgatcwtg gcagcattta 2880 acttagtagc atttaaattt atttaatttt tttcttttta ataaaaaaaa tttgtgtaag 2940 tattttatat tctttagggc aaaattgaaa agtcttgttt ttcaaaggag aaatgacata 3000 ctacctataa attttactaa ccatgtcttt actccaaatt aaaaagatat tatatctatt 3060 taatttggag taaagacatt gttagtagca ttctaaattt ttagttatac tcattctaaa 3120 tttgtgtcac gaaaggaact ataggtacct tcaaggaaac aagtttcaat tgttataaca 3180 aaaaatttga aattctttaa acaatgtctt taccttaaac taaataagtt ttataagtag 3240 tatgtaagat attattgcca tctgaattca tctgaacatt tttagttttt ttcatttcta 3300 ttttagtttg caaaggttac tttccttttg aaaaaacgaa cttttcaatc ctgctctttt 3360 ttttgtttct ctaagtttta aacaaatgta gatgtaattt gttttgtttt tagtgaactg 3420 atgttaatta tttatttaaa aacaataaat tatattattt tatattataa tatataaccc 3480 acaaaattat tatattatag ataagtatta tttttgctat aacttctaaa atgatatatt 3540 tacattactt catttatcgg tttccttatt catttagtgt tgttttttct aaataatttt 3600 atataataaa ataaaaaact tgattataaa atgattaagt aaagttgttt ttctgttact 3660 ttatttttcc tcatttttat agtgttatta aaaatgaagg ataaaaaagg gagaagtaga 3720 atgcaatgtc tcaaagcagg atgtgattgt gatgaataca caaaagagaa aggttttgat 3780 ttgtgtgcct attgtgatca tgcaccagtt ttacataaag gtattttgat atcacaagta 3840 tttaatttat attataacat ttctaaataa ttagtttgta tctgtggaaa cttgatgctt 3900 agtattgttt ttttttaaaa catataactg acaagtaatt cttttgtata ctttagtgga 3960 agacaatttg aatcttgttg aagatattga gttagctcat ttctctaaga gcaatgcaat 4020 actaaacacg ttatccagta cagttatttt tttatacttt tttgtcatgt gtttaaaaat 4080 tcacctaaca gtaaaatatg ttagtggtaa aagttacgaa tagcaaatca ttatatagtt 4140 aaaaccaaat taatgcatgt tactcgtttt tttgttttta atattatata cataatattt 4200 ttatcaaaaa tatacctaat agtttataaa taatatttag cctctctcag tatgtcaagc 4260 gtgttatttg aaccccaaaa ctgtaagaaa gaaaacggat atgctaactt tggtatctat 4320 tataattatt tttttgtctt acatattatc ctataataag gtaacagttg ttagggactt 4380 agggtgagtg atttaaaaga ggctataaca tttttaaaaa taagagttta cttaaacatc 4440 ttgatttttt tatgtaactt aacataatgt ttctcactct cacaatacta gtaatactgt 4500 ttgcatccta tgtgaataat gcaacctatg tgaatttgat tatagaatca gctactttga 4560 gccaaagtga aatagacttc agtttggaca ataacaaggt aacaagccca ttgaagaaac 4620 agtgtataca acaagctaag ccatttcaag atacagtatt gactggtaaa atagtttttg 4680 catcaaaaac tgtacttcat cataaaaaca atactctctt tgaggtttga aatatttttc 4740 tttataatgt ttaactattt ttataaattc tgtgtaaaga aaatattata atacttaatt 4800 tgcattattg ataacatatc atttgtatca taagtatgat gattttaatt ataaattata 4860 ttttgacttt agtaacactt aaatattatt tataaattta atagaatggt ttagccccga 4920 tcctgaattc ttgcacagat gggattattg tgttaagtgc tttaaatcta acctttagtt 4980 tacgcagcaa gttggctrga atagtagtca accatattat tcaaaagaaa ggattgcagt 5040 aagtaataag ttattagatt ttcaaaatta gttttattta ttattgtctt tacttattat 5100 ttattactaa agttacaaat ataaaattat aacattgtag taaagcaaat gcatatgaaa 5160 atttgataaa attatttttg atatataaat tctttttaaa tttttctgta gaaatcgtcc 5220 gactaatgca gatgtctgtg aatcagcaaa tgccattgtt acacattttc cctcagaaaa 5280 gcttgtaagt aacttataaa ctttatgagt tgatgtgatc tatatgattt ttaacgaaat 5340 cgttgttagt gatatatatg tttaatatct tctgtattag gaaacatggt atactgctcc 5400 tacgagtaaa aaaactatag ctggtggaaa gttgtccgag aagcttcgca acacatggcg 5460 aaatattaaa tcagtgggta ttttaaaaaa ttctaacggc caacaattag ttgaagaaca 5520 tgatagtagc gatggtatgc tttataagat agtttagtta ttataaatag tatattgtgt 5580 tttatatttt tagaaaattt tatatatcat tatgttaatt attaaaattt tctttgataa 5640 attgtatttt gataaaatgt ataaatcttt gcacaatttt ttatcatata atggagaaag 5700 ttttagtatt ttattagttt tatttttaaa cattaaaaaa attatataga agaagcaatt 5760 caagatgacc ttaattggct taaatttaat agtcaaccat ggactgaagt gtgccataaa 5820 tggaaatcaa cttcaaaatt taggttgaaa tgttttcaat ctggcaccaa agacattcca 5880 aattttaaac ttcttttgga tctaaaagca gatgcattgg taagtatttt aaatatttgt 5940 tatgtgttat ttcatatgtc catatttcac atactaaatg ttcgccacag aaatacaaaa 6000 taagtatctt accttttatt aggtaaatat cgattttact caattgagta tgcaaaagaa 6060 ttatgagcat tgtagcaact ttatgtttca aaaaattcca ctacatgcaa aaaaattgat 6120 aggcttagtt gccaagaaat attccatgca agatggttag tgttttattt ttaataaaag 6180 tttattaatt agcatatata gtatgattag cataatatga ttagcataca tatatcacag 6240 tagcataata tgataagcat acatatatca atgacagctt agtcggcaat ctgacagacg 6300 gacagttttt aagatggcgg aaaattacct atgttcatac acctaactga caatggaaaa 6360 aayatattta tacattcatg tgtattatat atgtatatat cagggattat ctttttccag 6420 atattcccgg aattccggat atttttcaag attagttttt ccggatatcc aaaaggtttt 6480 ctatacatac aagtttaagt atatatttgt gtaatatatt tgatttttaa ggttttttag 6540 tcatcactca tacaaaaatt aaagaaattc tcaaaaaaat tgagatacaa ctcagctaaa 6600 agaaaattaa gaggaaaata aggggaaaaa aattccagat tttttccgga tattttaagc 6660 aaataatttt ctggattttt taaatatgac ctggatcatc tatgatatat atatatatat 6720 atatatatat atatatatat atatatatat atatatatat atatatatat atatatatat 6780 atatatatat atatatatat atatatatat atatatgtat atatatatat atatatatat 6840 ttagatatat atrtatatat atatataaat atatataata taggggtaga agatgagaca 6900 gctgctcttt tggctattgt gcgattgaca tgctgcaaat gttccatcca tttatgcaaa 6960 gaagcaaaga gaaaatcctg gactcctagt gttaaagaat gtggcgaatc atttcttttt 7020 ctagccaatg taaattaaat ataaataaam tacaaaagtt tttattcaat agttattatt 7080 taaagacact aaaatttatt aaatataaaa attattataa taactaagtt gttacttata 7140 ttgcttaatt attattattt ctttattttt tcagtcagtt gaaaatgctg agcaatcaat 7200 taataacatc aggatcttat ataatgacct gaagctgaca cttcaacctt tggtttatgc 7260 tgttccacca aacattgctg ttgttgtgta caacaatcac agatggtcat ttgaaacccc 7320 tttgaaagca atggattttt cgtttaaatt gattcaagtg agtttaatgt tattactgat 7380 atttgcattc ctgtatacag aatttttatc attctcaatt atttgtcttt agatcttgga 7440 taaggaatac aatattgcaa gttttccagc ttggatgttt ttgcaaatat atatatatat 7500 tgttttaatt gccccgattt tgacgtatca tgcccgaaag caagtgaatt ggcttctgag 7560 ttgaatttat aagtttagtt gactgtcttg aattgttata agtatagact gtatagttgc 7620 ataatgcgta aaaaaaaaaa atgtttgtag atgcggtaat aatacaagct aaaacaaatg 7680 taaatattat gtttttttta gagttatatt acctaaactt tagtttttga gagaaagttc 7740 cattttgtct tcgaaatcaa agaaaattta ttggtttttt tacacgcaaa aaaaaagagc 7800 aattttaatg aaatttacca aaatatttgg ttatttcaac aaaaacattt ggtgaatatt 7860 tatgcaccaa atatttttgg taaatttcac caaacggatt tggtgaaaaa tatttcacaa 7920 aaccattttg tgaaatttac aaaatcattt tgtgaaattt acaaaatcat tttgtgaaat 7980 ttacaaaatc attttgtaaa atttacaaaa tcattttgtg aaatttacaa aatacatttt 8040 accaaattat ttggttactt taacaaaaac atttggtgca taaatattca ccaaatgttt 8100 ttgttaaagt aaccaaattt tttgtgaatt tcacaaaacg ctttttgcag tg 8152 // ID Gypsy-140_AA-I repbase; DNA; INV; 7427 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-140_AA_; KW Gypsy-140_AA-LTR; Gypsy-140_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-7427 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1013-1013 (2011). XX DR [2] (Consensus) XX CC Positions [3602-4105] - Reverse transcriptase CC Positions [5156-5635] - Integrase core CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 4232..5734 FT /product="Gypsy-140_AA-I_1p" FT /translation="MGLMGFYQKFIPRYSHLTAPITDLLKKSKKFKWTDEA FT ELALEELKSVLTSAPVLANPDYTKPFIIETDASQLAVGAALLQQFNDGKRI FT IGYFSKKLSSTQRKYAATEKECLAVLLAIENFRHYIEGTTFTVVTDCKSIT FT WLFSITAANANSRLLRWALKLQSYDFTLQYRKGKDNVLADCLSRIDSIQVI FT DDEHQKLIDNIKKQPQDYKNFKVIDDKIYKLVDDSSKFSDKRFEWRYYPTL FT AEREPLIESIHNIAHLGYEKTLNALKQKYFWPKMASDVKKYCKQCLACKTS FT KGINVNPTPPMGSQKKSCDYPWQFLTLDYVGPFPPSGKGRSTCLLVVTDVF FT TKFIMVQPFKQATASSLVQFLEQAIFLLFGVPEIILTDNGSQFTSKEFATL FT LKRHGVKHWLTPSYHPQVNNTERVNKVVTTAIRATLKGNHKQWTENLQEIA FT CAIRNSVHDSTKFTPYFLTFGRNMISNGKEYENIRNGGASPNTTLSEEERE FT NCIKM" XX SQ Sequence 7427 BP; 2320 A; 1308 C; 1638 G; 2078 T; 83 other; ttggcgccca actaaaaaga tttaaacgtt gcatttcttt caaaagctat ttaaattgga 60 aaaagtgaat tttgaagttg aattttattt attctctaag aaattttgaa attgaattat 120 attattttac attgattttt ctagctataa gtttgaagtg aattttgttt gtagtttcta 180 agatcttgaa tttaatttta aactgaactg aacttgaatt tgattttgaa ctgaattttg 240 attgccaatc ttatttttac tttgaattga attttgttta tagttttaag gatctgaatt 300 gaatttctat tgaattactt tctagttttt aagttttcga attgaattga attttttaat 360 agtccgtaag tttttaattt gaatttttcc ttccataatt gaattgatct gaattgaatt 420 taaactgaac tgaattactt ttccagccta gcctattttt gattcttttt tctacattag 480 ttcctatttt tttcctaaga tttgttttga atttcctttt ttgaatcatc gaattgagta 540 tttttcattg aaaattataa gatttcgcta attttgaact gagttttggg agttgaaaat 600 ggctggtcca tacattcgac cgacggttga tcatttgttg gaggaggaag tagattttga 660 gttacgtctc cgcgaacaaa cggttgatag aagagagaat ttagaggaca agaagagatt 720 gttgagaagg ttgttcaaag aagataggaa gtataatgtc gttcatgaaa gcgtaatcaa 780 gtattcggaa gagattaaga cgatcgcgaa agtggtagaa gagatagtag cgcaattacg 840 aaagaagttt gatagagctt tagtatcgag actgagacac tttttaattc gggtttcatt 900 tcgtttgttg aaaatgagga ggaagcaatc gcaaggaaag aagcatgtga gcagattgag 960 aagattctca tagaattagg tcagcacgtt cgagaagaaa tcggtaaaga agatacagat 1020 acaggtagga aaggttctgg tagtgattca tggagaaatc cagatcctaa ctcagctgta 1080 aagtctaaga aaggagaaga ggaagattca agagcaagga acagttaggt gaggawaagg 1140 ataaagattc gaaatatcgt ttaatcgatc ctggtgctct tttaaagaca ccacggaggt 1200 cgattcgaag ctctttcgag tccctccatt ttgacaaact tgagttgaaa tgttcagaaa 1260 ttaattgtaa atgcggtgag tgtgatggga aaggagcgca gaagaaagtt tcaacgaatt 1320 ctaggggata tcatggagat ttagataaga ataaagaagc tgaatcagga gaaaatgata 1380 gtacgaaagg aagatttgtt tttaggaaag aagtttttgg gaattactca acaaaacaag 1440 aagcgaactt agataaagga aggaaaagcg agataggaga tgagagcgat tatgagaagg 1500 agcgtaaaaa ttacaggttt agagatcctc agtacacaaa taatagagtt gatagatcac 1560 agcgtgatag gacgctgagc ccggtgataa attttagaga aacacggtca aatagaagag 1620 gaatggaaga tagggaaggt aatagaggaa taggaatttc gtctaggcgt agacgatact 1680 acagtgggtc ggaaagtgat tctgagagag atcatagagg aaggtttatt aagcggagaa 1740 gtagtwsaag caggaggcac agacgttatt cstcagcgtc gagtagctcc tcagtaggcg 1800 ctgctgctcg ttacggaaaa tcgagagttg agagctggga tttggtattt tccggagata 1860 gtagatcgat kcaagtwgaa gatttcctgw acagaatcma gaagctwgca agacatgaag 1920 gagtttcmga gagggaactt ttgmtgaaca ttcacmawmg gctaaaggga gaagcttacg 1980 actggtggtt taccagggaa gacmgmttka ctagwtggag aaggtttgag gacgaaatac 2040 gtttccgata tggaaatcca aatcgcgata gaggtatcag ggcccaaatc cgcgaactga 2100 agcagcgkaa gggmgagacg tttatcgcat acgtgacgga ggttgagaag ctgaaccaat 2160 gccttacagc gacctttttc mtckagwack ctwtttgagc tggtttggga gaatatgcgc 2220 ccwcactacm gmtcgaggct ktcggttgtg gacgttgaag atcttgagsa cctgatcgag 2280 atcaaccata agattgacgc maacgatccm agcttctwca ggwmtcmgga kggtcatmga 2340 agwgaasttc wtcasttaga ggsagwcgaa gattcwkcsa gcgastactc ggaagacgag 2400 ccaaacgtts ctcctgtgaa agcaaaccgk mcwgaaagaa caaagacagg acctaksagg 2460 aacsaaccas gaacacaaca tcagcgaatc agcggcagcc ttccacagag cgaaaggatg 2520 gacagcagag gtctccgcta acctgctgga actgtcagag ggtctggcca cgcttggagg 2580 gactgtagag aacagaaagt tctgttctgt tacgcctgtg ggaaaccggg aaggacgaca 2640 agaaattgcg agagsaacca tgtcccamtt tcccagagaa ctgmkcaaag acaatgcgcg 2700 accaaaaaac taatttcggg gtgcttgcca gggaatccga acacctcgcc ttttacttcg 2760 attcccaacc aacaaaatga tcaaaaatta agtaatttct cacttcttga aattcgagtg 2820 gaccctactt catgtcctca cgttagagtc aacgttttag gtacagacat gacagctctt 2880 ctggattcgg gagctgcaat gagcgtcatg agcgctgtcg atatggttga agcccatgga 2940 ttgaaaattc tgaaatctaa tatcaaagtt tgcaccgctg acgacacaaa gcattcgtgc 3000 ttgggctacg tgaatatccc gtacaccctt gggaatgaca ccaaagtggt cccgacgtta 3060 atagtcccgg agattcggaa gccattgatc ctagggatgg atttctggcg tgcattcaaa 3120 attcggccaa tgattacgtc agaggacgga atccaggacc tagaactcac ttggacgtca 3180 tcagcttgac tgaaaatgcg ttaaatctcc tgagatttga gccaggacct tccgctgaag 3240 gcgaggaatc aatagaactc tcggttcagc tggttttgcc tctggagcct gaagtttcct 3300 tcaagtcaaa gccagaggag gacgagtctt tggacgtccc taccttggat ttgccggaag 3360 acccgctcaa atcgattgag acggtggaaa ccgagcataa tctcagtaac ggcagaggaa 3420 agacggcaac ttcaggaagc cttgaaacat tttccgttgt accggcgcgg ggaatttggg 3480 caggacaact cttctcgagc atgagatcga gctggtttcg gaaggaaaga tgagagatct 3540 tcctatgtat cgatactckc caaaaatttg ggagaagatc gaggaggaat tggagcgatg 3600 gaagagctta aacgtcatcg aagaatgctc ctcggagttt gctagtcctc tcgtccctgt 3660 gaagaaggcg aacggaaaga ttcgagtttg tctcgattcg agacgaatca actccgtcac 3720 cagaaaggac gcctatccga tgcgaaacat ggctgatata tttcaccggc tgcagaaagc 3780 taagtatttc agcatcgtgg atctcaagga tgcttatttc caaattcctt tgaaggaatc 3840 ttcgcggaac tttaccgctt tcagaacccc taagggactc ttccgcttca aggttgttcc 3900 tttcggtctg aagaacgctc cgttcactat gaacaggttg atgaacatgg caattggttt 3960 cgatttggag cctttcgtgt ttgtctacct ggacgatata attattgcca cggaaacgat 4020 cgaggagcat ttccgcctgc tgaaggaggt ggctctcaga cttaagaagg caggattgac 4080 tatctcggtc gaaaaaagtc gtttttgccg caagcaagtg aaatatttag gatacwtttt 4140 gacggaaaat ggtttgtcaa tcgacagcgc aaaattagag ccaattctga attatccgag 4200 gccaaaawcg atcagggacg tcaggagact gatgggcctc atgggcttct atcagaagtt 4260 catacccagg tacagtcact tgacagcgcc wataacggat ttattgaaaa aatccaagaa 4320 attcaagtgg actgacgaag ctgaactggc gctagaagag ttaaagtcag ttttaacgtc 4380 ggcaccagta ttggcaaacc cagactacac gaaacctttt attatagaga ctgacgcctc 4440 acaattagct gtcggagcgg cgttattgca gcaatttaac gatggaaaac gtataattgg 4500 atatttcagt aagaaattat ccagcactca aagaaagtat gcwgcgacag aaaaagagtg 4560 cctagcagta cttctggcta ttgaaaattt ccgccattat attgaaggaa ctactttcac 4620 tgttgtaaca gattgcaaga gtattacatg gctcttttca atcacggccg caaacgccaa 4680 ttcgaggctt cttcgatggg cacttaagtt gcaatcgtac gatttcacct tacaatatcg 4740 caagggaaag gataatgtac tggcagactg cctatcacga atagacagta ttcaagttat 4800 tgacgacgaa catcaaaaac taattgataa tatcaagaaa caacctcaag actacaaaaa 4860 ctttaaggtg atagatgata aaatttacaa attagtcgac gattcttcaa aattctcaga 4920 caaacgcttc gagtggagat attaccccac tttagcagaa agggaaccat tgattgagtc 4980 tatccacaat attgctcatc ttggctatga aaagactctc aatgcgctta aacagaagta 5040 tttctggcca aaaatggctt cggatgttaa gaagtactgt aaacaatgtc tagcgtgcaa 5100 aacttcgaaa gggattaacg tgaatccaac tccgcctatg ggctcacaaa agaaatcttg 5160 cgactatcca tggcaatttc tcacgcttga ctacgttggc ccttttcctc cctcaggcaa 5220 aggaaggagt acttgtctat tagtagtcac agatgtgttt actaaattta taatggttca 5280 accattcaaa caagccactg cgtcgtcgct cgttcaattt ctagaacagg ccatatttct 5340 gttgtttgga gtgccagaaa ttattttaac ggataatggt tcgcagttta cgtcgaagga 5400 attcgccaca ttactcaaac ggcatggggt aaaacactgg ctaacacctt cgtaccaccc 5460 tcaggtaaat aatactgaga gagtaaacaa agtagttaca acagccataa gagctacact 5520 taagggtaac cataagcaat ggactgaaaa cttgcaggaa attgcatgtg caatacgcaa 5580 ttcagtccat gattccacaa aatttacacc atactttctt acgtttggtc gtaacatgat 5640 ttccaatgga aaagagtacg aaaatataag aaatggaggt gcatcaccta atactacctt 5700 aagtgaagaa gaaagagaga attgtataaa aatgtaagga aaaacttagc agaagcctac 5760 caaaaacaag cgaaatacta taatttgcga tctaataaga aagctccgca atatcaggta 5820 ggagagaggg ttttgaaaaa gaatacmacc caatcsgaca aatcwaagga cttctgtgct 5880 aaattagcac ctaaatacat cgaagcttac gtcaaaagaa ctttaggaga cagttacgaa 5940 ctggtagata agaataacaa caatttagga atatttcacg ctaatttttt aaagaaattt 6000 tagagaaaaa gagttcaagc tatgacatga gagtaaaatc gaaatgtaca agaataacac 6060 aaaaaacatc aggaaaatcc accataaacg atggtaaaaa aggtatattc waagctagtg 6120 ttagtaaaat acaaaaaatc gcaaacctag ccctaaaagt tacttaaaaa ctagattaat 6180 tgcttaaaat tattagcwta ctactacatt ctcaccatag attgctgtaa atgtcsttat 6240 tcgttgaatc ctmtmgtagc cgtgcaaatt tttcatctcc gtcgttgatg tctgtagtct 6300 ggtcatcaaa ttsaacgttt atggtcctgt aattakaaka tatgtttast ttctaamaaa 6360 aataagtgtt tagtttcctt acttacctta aaactcaatt ttagctgtcc agtcgttgtt 6420 ttctttattt tcaagttctt ctcgktgtta ttttgcttcc aktaagactc cwtaggttcc 6480 gtccaaatag ttgaattwat ttgtttttcc tccatcgcat cccatagttc caatagccaa 6540 tagtttttag aaagacggaa gtccagtata aagtccaatg tccaatgaag tcaatagccc 6600 agtccaataa gaatcaatta tcgtccagta gttgtccaat tagaatccag tagcatccag 6660 ttcaacagaa tcctccatct tgtagttgtt cgttcagttc gtttccgttg agttacctgc 6720 gaagaaaaaa gaaacagaga agacattaat gccataatca acttacctaa ttaagcacta 6780 aaatatwgat actcccccgg tttaagctat ccaatatgtt ttttccgctt atttgccgta 6840 tttcactata attttccttg ctgtccgttg ttttcacttt gttgcgtcca ctctgaccga 6900 cggtttgacg tttgtcggga tggcatagtc tttttcacat gcagagcgac cgcgcggatg 6960 acagtgacag ttcgctaagc tgtcaaagta gttcatgcga tgttataggt ctgttcgata 7020 ttctcatttt cggttttggt tagaattcta tggtttttcg tcaagtttca aattgatggg 7080 tagaattcgt attgtggtcc tgtagggatt acgtttaggc aattttaaat gcagattgtg 7140 tcaaaatgag ttttgatgtt gcatttaatt gcagttcgtt gagtgatcaa gttattgacg 7200 gtaaagtttc cgttgtttga tcatggttta cgttacgttc agttcagttg agttcagcta 7260 gagagattat tgaatagaat ttgtacgttg atgaatgaat gaccttagtg tatgttactt 7320 tttcagttca gttaagtttt tgtggagttg caacgcttaa gttttttgta agttatgaaa 7380 atttgatcaa gttcttgatc aaattttcat aaaaatggat ggggtga 7427 // ID BEL-12_DPu-I repbase; DNA; INV; 7924 BP. XX AC . XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from Daphnia: internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-12_DP_; KW BEL-12_DPu-LTR; BEL-12_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-7924 RA Jurka J.; RT "LTR retrotransposons from Daphnia."; RL Direct Submission to RU (16-DEC-2010). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR [1] (Consensus) XX CC Positions [6794-7342] - Integrase core CC LTRs are 93% similar to each other. XX FH Key Location/Qualifiers FT CDS 1742..7762 FT /product="BEL-12_DPu-I_1p" FT /translation="MTTPTNEASRRAIKGHVTRWINNIQQYDNVQMDLTVH FT NLVLGAESNLRNMYNKYKRLSEGVARDMEQAGATQEQFEAEVDSQIQVEED FT VGDALIIVKRKREEFKEIQAAEERKRQEETLLLMFKTQQIAADAARAQEKA FT DQDAARAQEKADQDAARAQEKIDQDAARAQERAIRQQENLDQQNLFRQLIA FT AIPAAAAPGAPAAPAAVASTKLPKRQIKPFKGDVLEWTAFWEGYNAAVHES FT AIPAVQKFGYLKDYLKGEAQLCVENLELTDANYTVAVTLLKAMYGKPDVLI FT EAHTHKLDTLQPVRDVADTAALRCFQLTIQSHINALETLGVARTSHGCLLG FT SRILRSIPLKLQAEWAKSATNKVTDIDQVLKFIEEQVEAAERLSRLRATTP FT KPAQNSQQPAKXTPPTTPTASQLGVSSKPTPQSKHPSKTRRNGSPPPRREA FT TASSPRKPMLPCVFCKEMHWATNCPMELKEKKAVITSERRCSNCFGQHETT FT VCFNPHRCQRCRAKHHTSLCAEKDTRFGSSTIIPSKPAAGSSSTTACASSF FT GEVILKTATVYIXGPNGKQIRAILFLDDGSHRTWIKKQISKELQLKIIQVE FT QIATRAFRQTEAPPAETHNVVEMTVRGTWPGAPTVRIEALEATDKVGSTGP FT YQTTEFARKLWLENENLADDRFEREGGDEDVGILVGMDQMFNIMFNEPATT FT SPCGLRAYTSKLGKVIGGPSQEKXSKQSQTIVSQLLINSNRSLPQITSQPS FT KFFNSIQPDQSESRETGNSSLLPLPENSEQICFFDTTKENKRFEMEKLNFD FT LSLFWRLENFANLNDCVAVEKDDQFRSFCEEITRFEDGRYCTPIPWTTDRW FT RLEINHQMAAARLRSMLXKLRKSPVDLANYTKEIDQLIANGFVEEADFNYD FT GLHTYLPHHPVYRTDKATTKIRPVFDGAARSKYGPSLNDVLETGPNLNPDL FT LSVLMRFRMNRIAWIADIEKAFLNIALQPEDAEAVRFLWPREPENPGSDFI FT AYKWKXVPFGLSSSPFLLRVTINKHLLSVKPRFPKTVEQIEEQLYVDDYLG FT GADNVPTAXTTVGETVTLFSEAQLNMRSWATNNKQLRDFLTEKEMSNQIVG FT ILSPTIDGQQKALGLRWDTSSDSFKFDPTSIMEAAVEIGEKITKRKILSIS FT ARIFDPIGFLAPTVLLLKIIYQKLWEGDIGWDDDATPEVKNTWSSIMKGLK FT DLHNLEIPRWIGYSKPSVVSAEIHVFGDASEAAYGAVAYARLQLENGTPYT FT ILLASKTRVAPLPKKKVTLPRLELLSSLLAIRLGEKIRTSLHTESWKTTYW FT TDSLVTLGWIRGDPYRWRPFVRNQVETIRKFSNAEWWRHCPGLENPADLAS FT RGAPAHALVESKLWWHGPAWLTEGESEWPNSPENHSTETQEKIEEEETKKT FT ATVSFAAVETAQPIEWHLEKISTWTKLLFRTAWILRALNRMKKRTRDPGLE FT LKEIIIVAGKEITIDKIVTEELNEAELAICRQLQVERYPKAFRTLQLGLQI FT HPKEKIASLRPVWDNRDRLIRVTGRVALALRDRDIQPPILLPANHPVVTML FT ITNKHVFNSHTGVKTTLSELKEKYWIVKGRQQVRNVWXACVKCQKLTSPPF FT RQVAAPLPANRLRDARAFEITGTDFAGPLYYKNATPKRKSKSAPSVEQVIP FT EPPPEEENPDAAELPTEEAPEEEDDGDQPDPAQQVNKKQPKSYVCIFTCAV FT TRAIHLELTKDMTARSFLLAFRRFSARRGPVSVMYSDNAQTFRCVSRYLKN FT IRSDPSVHDLLAMRRTSWIFSVSLAPWWGGFWERMVRITKDLLRRSNGRAC FT LAYDELEVSLIETESVINARPLNYIGEGADDPLPITPNQFLNNRRSTCATP FT EPAVNLLAPDSTSELLKKMDQNRRDYVSDLCSRFVSDYLMQLDNFHAKGAS FT GRKIRVGEVVVIHDKLSKRLMWETGVVKELLPSCDGLVRSAIVKFPNGKFI FT NPSHSMFISRRTERRSTGRCGDRRRCIESGAS" XX SQ Sequence 7924 BP; 2440 A; 2110 C; 1837 G; 1514 T; 23 other; tggtccttcg aaacctnatc tttcgaacca aattttgtga aactgactga acgagaacgc 60 ttcccagcaa agcgttcggg tctctttggc aacaaaagac ccgctgactc tctctcatct 120 acaaactgtg cttccctatc aattcaaact gtccctttct agtaaaattt caatcaattt 180 agttaagtgt gtatgtgtgt gtgaattccg acatttcgtc tagccgtcat cgccgttttt 240 ttgggaaacc gcgtggcggc ctgaggcgtc agaaaatctg agcgcctcac acacacacac 300 acacacactc gactgtaaaa tctttctcat tttcttaaaa gtaaaagcat ttcactcttc 360 ttttttgatt tttcttagat cgtaaaagca ttacatcttc tctctctctc cctaatatca 420 cttaaaatta aattaaaaag tagacaatta attcttcacc agctcttctt gcgttgataa 480 ttttttccct ctctttttcc cgcattcatt tcccacaatt taaccttttc tttctctcaa 540 cgctccactt tagcggagct ccaaaagcgg gctccttttc ctcatcaaaa ttaggcccga 600 acaaaaacga aagcggaacc agctccgcct ggaaggaaag attcccccat ggacgggccc 660 gatccgngag ccactccgaa aaaacacgag agaaggtcga cgcgaggagt accggaatcg 720 gccgatcgtc ccgcgcggac ccgtcgagcc tttcctcggc ccggtcatcc cggctccatc 780 accggaacaa gaggaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaagaagc agagtatcac 840 cggccaatca tcggcacctc ccgtccaaga ctncaggatc accagggggc tgtttcacgg 900 ccccaaacag cgtcaancga tcgcgccagc ccaggcagcc gccgaagtgg aagaagaggg 960 agccgtcggc ggaacagaga aaaagaagaa ggagcggtgg agggagacca gcgaggagga 1020 cgacactctn cagatgctct gcgagagcct gtcaatcgaa gagacgacnc aagcccccat 1080 cccgccaaca ccaatacaaa cagagccaac acggcaaggc cgntcaagag ggagggggct 1140 gctgatgttc tcgcgagaga gtctcgaaga agccttcaag gagagcccca gggagcgatc 1200 caaatccagg atcgcctgga tcagaagact ggcggagcaa taagcagaaa taattttccc 1260 atcactgtat tctagggatg tcgaatcgtc gataaataca ggatgcgcct ctcttcaatt 1320 atctcactct aacttttcaa tctcaaaaaa aaaaaaacat tcttacatca aacgggagac 1380 aagccaaggc tgacctccat aacaaaaact ttttcttttc ctctcaacaa acaggaacgc 1440 agcattccaa aggtgattag gtgaacgagg gtcgagccaa aacttagccg acccgccgct 1500 cgtcacgcca tctactgaga ggaaggagag acgaaaaact ttcactctcc aacaacaccg 1560 gcagaggcgc tgcagtcaaa cttgaagcag ccactctcna acaaacaccg caagaggcgc 1620 tgcactcaga acttgaagca gacacctcaa gcaaacaaac aacaaacaaa caggctcgac 1680 aattaataat taaacaaaaa caaactaatc taaatctttt tctcttccaa aaacaatcaa 1740 gatgactact ccaacgaacg aagcgtctcg gagagccatc aagggacatg tcacgcgctg 1800 gattaataac atccagcagt acgacaacgt gcagatggac ctcacagtcc acaacttagt 1860 nctgggagct gagtccaacc tacgaaacat gtacaacaaa tacaagcgac tctcggaggg 1920 ggtcgccaga gacatggagc aagccggagc gacccaagaa caattcgagg cggaagtcga 1980 cagccaaatc caggttgaag aagatgtcgg tgacgcactc atcatcgtga aacgaaagag 2040 agaagaattc aaagaaatcc aagcagccga agaaaggaag cggcaagaag agacgctatt 2100 gctcatgttc aaaacgcaac aaatagcagc ggatgccgcc agggcccaag aaaaagccga 2160 tcaagacgcn gccagggccc aggaaaaggc cgatcaagac gctgccaggg cccaagaaaa 2220 aatcgatcaa gacgctgcaa gagcacagga gagagcaatc cgtcaacaag aaaatctaga 2280 tcaacagaac ttattccgac aactgattgc tgccattccg gcagcagctg caccaggagc 2340 gccggcggct ccagccgcag ttgcgtcgac taaactgcca aaacgtcaaa tcaagccgtt 2400 caaaggggac gtgctcgaat ggacggcctt ctgggaaggt tacaacgcag ccgtccacga 2460 atcggccatt ccggcggttc aaaaattcgg atacctnaaa gactacctga aaggcgaagc 2520 acagctgtgc gtggagaacc tggagctgac ggacgccaac tacaccgtcg cagtgaccct 2580 cctgaaggcg atgtacggca aaccggacgt cctgatcgag gcccacaccc acaaattgga 2640 cacgctgcaa ccagtgagag acgtagctga taccgcggcg ctcaggtgct tccagctgac 2700 tatccagtca catatcaatg cgttggagac actgggagtc gcaagaacga gccacggatg 2760 tctccttgga tcaagaattt tacgctcgat cccactcaaa cttcaagcag agtgggccaa 2820 atcagcaacg aacaaggtaa cagacattga tcaagtatta aaatttatcg aagagcaagt 2880 tgaagccgca gagcgactca gccgtctgag agctacaaca ccaaagccgg cccaaaattc 2940 tcaacagccg gccaaancaa caccacctac cacaccaaca gcttctcagc tcggagtcag 3000 cagcaagccg actccacaat caaaacatcc aagcaaaaca agaagaaatg gcagtccacc 3060 accaaggaga gaagcgacag catcatctcc aagaaagcca atgctgccat gcgtgttctg 3120 caaggagatg cattgggcta cgaattgccc aatggagctg aaagaaaaaa aggcagtcat 3180 caccagcgaa agaagatgct ctaactgttt tggtcaacac gagaccactg tttgttttaa 3240 tccacacagg tgccaacgat gccgagccaa gcaccacaca tccttgtgtg ccgagaaaga 3300 taccagattc ggatcctcaa ctataattcc aagcaaaccg gccgctggga gcagctcaac 3360 aacagcgtgc gcaagctcct ttggtgaagt tattctaaaa acagcaaccg tttacattnc 3420 aggcccaaat ggaaaacaga tccgagcgat tctcttcctc gacgatggca gccatagaac 3480 atggattaag aagcagatct cgaaagaact tcaactgaag ataatccagg tggagcagat 3540 cgcgaccagg gccttccgcc agacagaggc ccctccagca gaaactcaca acgtggtcga 3600 gatgacggtg cggggcacct ggccaggagc cccaacggtg cgtattgaag ctttggaggc 3660 aaccgacaaa gtnggaagca ccggcccgta ccaaactaca gaatttgcaa gaaagctgtg 3720 gctggagaat gaaaatttgg cagacgaccg cttcgaacgt gaaggaggag acgaggacgt 3780 cggaattctt gtcggaatgg accaaatgtt caacatcatg ttcaacgagc cagcaactac 3840 cagcccgtgc ggactcagag cctacacttc caaacttgga aaagtcatcg gcggtccatc 3900 gcaagaaaaa ncatcgaaac aaagtcaaac aattgtcagt caactactaa tcaattcaaa 3960 tcgctctcta ccacagatca cctcacagcc atcgaaattt ttcaacagca tccaaccaga 4020 ccagtcggaa agcagggaga caggaaattc cagtctcctt ccactaccag aaaattctga 4080 acagatctgc tttttcgaca cgacaaaaga aaacaaaagg tttgaaatgg agaagttgaa 4140 ttttgatctc tcgttatttt ggcgtctaga gaatttcgcn aatctgaacg actgtgttgc 4200 tgtggagaag gacgatcaat tccgctcctt ctgcgaagaa atcacgcgat tcgaagacgg 4260 caggtattgc acgccaattc catggacgac tgacagatgg agactggaaa tcaaccacca 4320 gatggcagcn gcaagactga gaagcatgct gnccaagcta cgcaaatctc cagtcgactt 4380 ggccaactac accaaagaaa tcgatcaact gatagcaaac ggattcgtag aagaggctga 4440 tttcaattac gatggcctcc acacttatct accgcatcat ccagtctacc ggacggacaa 4500 ggccaccaca aaaatccgcc cagttttcga cggcgccgca agatccaaat atgggccgag 4560 cctcaacgac gttttggaga ccggaccgaa tctnaatcct gatctcctct cggtcttaat 4620 gcgcttccgg atgaaccgga tcgcctggat tgcggatatt gaaaaagcnt ttctcaacat 4680 cgcgctgcag cccgaagacg ccgaagcggt cagattttta tggccgaggg agccggaaaa 4740 tcccggctcc gatttcatcg cgtacaaatg gaaangagtt ccattcgggc taagttcaag 4800 tccattcctt ttaagagtca ccattaataa gcatttgctc tcagtcaaac ctagatttcc 4860 gaagacagtc gaacaaatag aagagcaact ttacgtggac gactacctcg gaggagcaga 4920 caacgtgcca acagccatna ccacggtggg agagaccgtc acgttatttt cagaggccca 4980 actcaacatg aggagctggg caacaaacaa caagcagctc cgtgactttc tcaccgaaaa 5040 agaaatgtcg aaccagattg tcggaatcct ttcgcccaca atagacggcc agcaaaaggc 5100 actgggtctt agatgggaca cgagctccga ttcatttaaa tttgacccta catctataat 5160 ggaagcagcc gtggagatag gagaaaaaat caccaagaga aaaattctga gcatctcagc 5220 aaggatattc gacccaattg gtttcctagc tcccacagtt ttacttttaa aaataattta 5280 tcagaaactt tgggaaggag acataggttg ggacgatgac gccacacccg aagtgaagaa 5340 cacctggagc agcatcatga aaggactgaa ggatctacat aatttggaga tcccacgatg 5400 gatcggatat tcaaaacctt ccgtcgtctc cgcagaaatt cacgtgttcg gcgatgcatc 5460 agaagcagcc tacggagcag tagcctacgc acggctccaa ctggagaacg gaactcctta 5520 caccatcctc ctcgccagca aaacgagagt ggcgcctctt ccaaaaaaga aagtaacgtt 5580 acccagactc gaactactaa gctctttgtt agcaattcgt ttaggagaga aaattcgaac 5640 ctcccttcac accgagtcgt ggaaaacgac ttactggaca gactccctcg tcacacttgg 5700 atggataaga ggagatccct atcgctggag accgttcgtg cgaaaccaag tggaaaccat 5760 cagaaaattt tccaacgcgg agtggtggag acattgcccc ggattggaga atccagccga 5820 tctcgcttcg cggggagcgc cagcgcatgc gctggtggaa tcgaaacttt ggtggcacgg 5880 accagcctgg ctgacagaag gagaaagcga atggccgaat tctccagaaa accactccac 5940 cgaaactcaa gagaaaatcg aggaagagga gacgaagaaa acggcgaccg tctcctttgc 6000 agcggtcgaa accgcccagc caattgaatg gcacctcgaa aaaatctcta catggaccaa 6060 actacttttc agaacagcct ggatcctcag agcgctcaac cgtatgaaga aaagaacgcg 6120 cgacccagga ctcgaactaa aagaaatcat catagtcgcc ggaaaagaaa taacaatcga 6180 caagatagtc acagaagagc tgaacgaagc ggagctagcc atctgcagac agctccaagt 6240 agaacgctac cccaaagcct ttcggacact ccagctcgga ctgcagattc acccaaagga 6300 gaaaatcgca tcgctccgac cggtctggga caatcgagat cgactcatcc gagtcactgg 6360 aagagtggcg ctcgccctca gggatcgaga cattcaacca ccgatccttc tcccagcaaa 6420 ccatccggtc gtaactatgt tgataaccaa caaacatgta tttaattcac acacaggagt 6480 aaaaacgaca ctgtcagaac taaaagagaa atattggatc gtgaaaggac gccaacaagt 6540 gagaaacgtt tggtncgcct gtgtgaagtg ccagaagctg acttcacctc ctttccgaca 6600 agtagcagcg ccgctaccag ccaaccgact ccgcgatgcc agggcattcg aaataaccgg 6660 aaccgatttt gctggcccgc tgtactacaa aaacgcgact ccgaaaagga aatccaagtc 6720 ggcgccatcc gtcgaacaag tcattccgga acctccacca gaagaagaga acccggatgc 6780 agcggagctt cccaccgaag aagctcccga agaagaagac gatggcgacc aaccggatcc 6840 agcgcaacaa gtcaacaaaa agcaaccgaa aagttacgtg tgcatcttta cttgtgccgt 6900 gacgagagcg attcatctgg agttaaccaa ggacatgacg gctcgctcct ttctgttagc 6960 tttccgtcga ttctccgcca gaagaggccc agtgtcagtg atgtacagtg acaacgccca 7020 aactttccgc tgtgtgtctc gatatcttaa aaacattcga tccgacccat ccgttcacga 7080 cctcctcgcc atgagaagaa ccagctggat cttctccgtt agcctcgctc catggtgggg 7140 cggattctgg gagaggatgg tgaggatcac gaaagatctc ttgagacgat ccaacgggcg 7200 cgcatgtctc gcgtacgacg agctggaagt gagtttaatc gaaacggaaa gcgtgattaa 7260 cgcgagaccg ttgaactaca ttggagaagg agcggacgat ccgctcccga tcactccgaa 7320 tcagtttctc aacaatcgtc gttcaacttg tgctacgccg gagccagcag tcaatctatt 7380 ggctcccgat tcaacaagtg aactactcaa gaaaatggat cagaacaggc gtgactacgt 7440 cagcgacttg tgctcgcggt tcgtctccga ttacctgatg cagctggata attttcacgc 7500 aaaaggagcg tcgggcagaa aaatccgcgt cggagaagtc gtcgtaatcc acgacaaact 7560 ttccaaacgg ctcatgtggg agacgggagt agttaaggaa ttactcccca gctgcgacgg 7620 cctagtccga tccgcgatcg tcaagttccc aaacggtaaa tttattaacc cgagccattc 7680 aatgtttata tcccgtagaa ctgagagaag atcaaccgga agatgtggag atcgcagacg 7740 gtgcattgaa tccggagcca gctgaagcaa ctgctccgca atttccaaat ccacccgatc 7800 cggctccact cccattgctc ccgatcgatc cagctgacgt cgaagaagat gatcatgtgg 7860 ccaatccggc cgcaactgac gtcggcgagg tcgaggccca aggcatgggc tctggtgggg 7920 agta 7924 // ID DNA8-28_AP repbase; DNA; INV; 1034 BP. XX AC . XX DT 23-AUG-2009 (Rel. 14.08, Created) DT 23-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-28_AP. XX NM DNA8-28_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-1034 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(8), 1770-1770 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 1034 BP; 292 A; 165 C; 214 G; 362 T; 1 other; cagggccgta tttaaggggg ggcaacgggg tacatttgcc ccgggcggcc aatttttgac 60 acatttaggg gggcggccag gcgccggtgt atattatatg aataacattt tttttggtaa 120 gaatttgctt gagtacctat acttggattt tttgcattgc atgactatga tacggccatt 180 tggcgtcatt tactcgtttt gtcaatcaat gataactact aatttttgta atttttttta 240 ggctaaaatg cactttacca cgccccttac gaatttcaaa atttaaaaga cgtgtttttc 300 aacgctgatt tcgaaactgt ttaaattttt ttgcggaaac cattattttg gaagttatag 360 ccatccaaag tttgcgattt tacgtttata cgcatgcgcg caacactact ttaatgctgc 420 gcggaggaac tcgtgtataa tatacgatag taaatatata gtataatata cttatgttac 480 aggtatggct tttcaacaag attgtttaat tctgaaggct acccaaggtg gttttacaca 540 gnaaatcggg ggatgttttc gatttttttg ttcgagcgca gtacattatg tgctttttaa 600 caaccttgtt taattccgag gcctgcccaa ggtggtttta cacaacaaat cgaggcatat 660 ttttgttttt tttgttaaaa atcgtaaaaa aaattttatg gtgctgcgcg tggcagtcaa 720 gtagcgttgc acgcatgcgc ataaacgtaa aatcgcaaac tttggatggc tataacttcc 780 aaaattttta aattcgtgag gggcatggta aagtgcattt gttccttttt ttatagtgta 840 caattttatg ttgttatacg ttttgacagt aattaaaata atgcatcatc gtggtctgtt 900 tttaacaaaa tgttcataaa taaataacta taaccaaacg attaaaaaac ggattaatta 960 tttttttttt ttgggggggg gggggcgtac aaatattgtt gacccagggc gcaaaaattg 1020 taaatacggc cctg 1034 // ID Copia21-NVi_I repbase; DNA; INV; 2798 BP. XX AC DS265683; XX DT 20-DEC-2007 (Rel. 12.12, Created) DT 02-JAN-2008 (Rel. 12.12, Last updated, Version 1) XX DE LTR retrotransposon from parasitic wasp: internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia21-NV; KW Copia21-NVi_LTR; internal portion; Copia21-NVi_I. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-2798 RA Kohany O. and Jurka J.; RT "LTR retrotransposon from Nasonia parasitic wasp."; RL Repbase Reports 7(12), 1211-1211 (2007). XX DR Genome; DS265683; Positions 397815 400612. XX CC Positions [36-545] - Integrase core CC 'AATAT' target site duplication CC LTRs are 94% similar to each other. XX FH Key Location/Qualifiers FT CDS join(3..1238,1242..2702) FT /product="Copia21-NVi_I_1p" FT /translation="MENLPFKNNRSGSDRPLHTIHVDTMGKISPPSFPGEN FT NFIIVFIDDFTRYARAYSVKHKNESGKCLENFLKHMRNLIGKNEKVCYIRG FT DNGTEFTGGEFAKIMEREGISNNFVLPYTPELNGTAERFNKTIQMKIRALM FT IESGLPTTMWVLAVEAAVHTYNRTPHKGIDFKTPLQMINPNRKSHIEELKR FT FGSLAYIKVSIPERKFSERAIKSILVGYIPTGYLLWQPQTQRLLNSRHVRF FT NEKVVYKDISGLSKQAEIQIGDCKEHKTTNDSEKQTVHPSENTETANIDKP FT KRGRTMKIKATEVKQKTRENPKRKAKEQPLRDPNFVYCIHEENADELKKKS FT EDEICYVRLAELNEDPTSYREAVNSEQGEKWKEAIREELKAMNDNYVWEIV FT DKPENISEKRRMNIIDSRIFKRKCNNGEEKFKARLVIRGFKNKNQYELSET FT YTPVSRLSVIRSALVIVNKHNLDVHQLDVKTAFLNGILEDEVFMEIPEGLD FT LNREKKMCKLRKTLYGLKISPKRWNLRFSEEANKLGLERDINGPCIFTWRK FT EGKMVLLVLYVDDILLAGNNSEKIHEVKTKLCEVFEMKDLGEPKLFLGMKI FT QRNKEKKIMILSNPEYTKKILERFNMNGSKPQSTPMVTRQVKNRESEQHEK FT LRDQETPCKARYREAIGSLLYLAGTTRPDIAYAVIFLARKQTALSESDWKD FT VKRIFRYLKGTTELGLTNRESGEKLEAFTDASFRDCEESKSTGGYVIQLYR FT NTIAWRSYKQSYTSLSTCQAEYLAMSEACQELISLDKAIRDITGKTNYPVT FT VWCDNASAGKCTEMDGVHKLKSFDDDVENIQRKLKERESTGTKSHIAETHG FT DFIKSCVMENKVMVKWIGTKENQADIMTKPLPASAHIELRDKILRI" XX SQ Sequence 2798 BP; 1084 A; 486 C; 612 G; 616 T; 0 other; agatggagaa tctcccgttc aaaaacaaca gaagtgggtc agataggcca cttcacacaa 60 tacacgtaga cacaatggga aagatatctc caccttcgtt tccaggcgaa aacaatttta 120 ttattgtctt catagatgat ttcacaagat atgcgagagc atacagtgtg aagcacaaaa 180 atgaatctgg taaatgtcta gagaatttcc taaaacatat gaggaatctg ataggcaaaa 240 acgaaaaggt gtgctacatc agaggagata atggaacaga gttcacagga ggtgaatttg 300 caaaaattat ggagagagaa ggaattagca acaacttcgt tctaccatat actccagaat 360 taaatggtac tgctgagaga ttcaataaaa ccattcagat gaaaatccga gctttaatga 420 ttgaatctgg attaccgact accatgtggg tactagcagt agaggctgca gtacacacat 480 acaatcggac accacacaag ggaattgatt tcaaaacccc attacagatg attaatccga 540 accggaagag tcacattgag gagttaaaaa gatttggaag cctagcatac atcaaagtat 600 ctattcccga gagaaagttt tctgagagag ccattaaaag tattctagtg ggttatattc 660 caaccggata cttattatgg caaccacaaa cgcaaaggct gctaaactct aggcatgtga 720 ggttcaatga gaaggtggtg tacaaagata tttctggatt gtcgaaacaa gcagaaattc 780 aaataggaga ttgcaaagaa cacaaaacaa caaatgattc tgagaagcaa acagtgcatc 840 catcagaaaa tactgagact gcaaacattg ataaacccaa acgaggtaga acaatgaaaa 900 taaaagctac agaggtaaag caaaagactc gagaaaaccc aaagagaaaa gctaaagaac 960 agccattgag agatccgaat ttcgtctact gcatacatga agagaatgca gacgagctca 1020 aaaagaaatc ggaggatgaa atctgttatg tacgtttagc tgagttgaac gaggacccaa 1080 ctagctacag agaagcagtg aattcagagc aaggagaaaa atggaaagaa gccatcagag 1140 aagaactgaa agcaatgaat gacaactacg tttgggaaat agtggacaag ccagaaaaca 1200 tctctgaaaa gagaagaatg aacataatag actccagata gatttttaag agaaaatgca 1260 acaacggaga agagaaattc aaagcccgac tggtaatcag aggatttaaa aacaaaaatc 1320 agtatgaact aagtgaaacc tatactccag tctccagact gtcagtcatc aggtcagctc 1380 tagtgatcgt caataaacat aatctagacg tacatcaact tgatgtaaaa accgccttct 1440 tgaatggaat actagaggac gaggtgttca tggaaattcc agaaggtctc gatctcaaca 1500 gagaaaagaa aatgtgtaaa cttagaaaga ctctctatgg actaaaaatt agtcctaaga 1560 ggtggaatct aagattttca gaagaagcca acaaactagg gttggaaaga gatatcaacg 1620 gcccttgtat atttacctgg agaaaggaag gtaaaatggt gttattagta ctgtatgtag 1680 acgatattct actagcagga aacaattctg agaaaattca tgaagtgaaa acaaaactat 1740 gtgaagtttt tgaaatgaag gacctaggtg aacccaaatt gttcctagga atgaaaatac 1800 agagaaacaa agaaaagaaa attatgatac tgagtaatcc tgagtacact aagaaaatcc 1860 ttgagagatt taatatgaac ggtagcaaac cacagagcac tccaatggtc acgagacaag 1920 tgaaaaatag agaatcggag caacacgaga aattgcgcga ccaagagaca ccgtgcaaag 1980 cccggtatag agaagctata ggcagtttac tgtacttggc tggaacgacc agaccagaca 2040 tagcttacgc tgtcatcttt ctagcgagaa agcaaaccgc actttctgag agtgattgga 2100 aagatgtaaa gagaattttt cgctatctaa agggtactac ggagcttgga cttactaaca 2160 gagaatccgg agaaaaactc gaagctttta ctgacgcaag cttccgagat tgtgaagagt 2220 caaaatcaac tggaggatac gtaatacaac tctaccgaaa cacaatagct tggagaagct 2280 acaagcagag ttacacgtcc ttgtctactt gtcaagccga atatcttgct atgagtgaag 2340 cctgccaaga actgatttca ctggacaaag ccataagaga tataactggc aagacaaact 2400 atcctgtgac tgtttggtgc gacaatgcat cagcaggaaa atgtaccgag atggacggag 2460 tacataaact aaagagcttt gacgatgatg tagagaacat tcaaagaaaa ttaaaagaaa 2520 gagaaagcac aggcactaaa agccacattg cagagactca tggagatttt atcaaaagct 2580 gcgtgatgga gaacaaggtg atggtaaaat ggattggtac gaaagagaat caagccgaca 2640 ttatgactaa acctctacct gcttctgcgc acatagaatt aagagataaa attctgagaa 2700 tttaagagaa aagacttaag tacaaaatca attccttttt caattgtctg ttcacaggat 2760 cccatgatta aacgtagtgg aggcacgtcg ggagggag 2798 // ID SMAR9 repbase; DNA; INV; 2474 BP. XX AC . XX DT 30-SEP-2007 (Rel. 12.1, Created) DT 22-OCT-2007 (Rel. 12.1, Last updated, Version 1) XX DE Consensus sequence of Mariner-type family of repeats. XX KW Mariner/Tc1; DNA transposon; Transposable Element; SMAR9. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-2474 RA Jurka J.; RT "Mariner-type element from freshwater planarian (Schmidtea RT mediterranea)."; RL Repbase Reports 7(10), 1086-1086 (2007). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 506..2182 FT /product="SMAR9_1p" FT /translation="MSTKRKGENKDTSAKRKWLTLEQKLNIIKLHDDGASF FT AKISRDKGINESSIRLIIKNKDQHKSQGIATASYLSKSLVSKHRTSRMINM FT ERLLGIWIEDLNQKRIPLSQMEIQAKALSLFEDLKDEDSGEINETFTASHG FT WFFRFKKRCGIHNVRIVGESASADKEAALNYPIELKKIVEEGGYQDEQIFN FT VDETGLFWKKLPTHTFIAEKEKACPGYKISKDRLTLLLGGNAAGDFKLKPM FT LVYRAENPRALKGFSKNTLPVLWRANKKAWVTRTLFEDWFQNYFCPATERY FT CKENGLDYKILLILDNAPGHPTGLSDLNEHIKIIFLPPNTTSLIQPMDQGV FT IATFKAYYLRRTFSQAIKATSGENAPTLTEFWKSYNIRNVIENIGEAWHEI FT TASNLRAVWKHILPHCANDFVGFETHFAEVTHDIVEIGRDLGFSEIDSANV FT IECINSNTNELDNETLLNMEEQRAFEERDSDLDNAVAVDTPTKELSTEELS FT KIINMADKLSEHIMEVDPNFERSIHVRRNIGSLIRCYRELFEEKKPKKISK FT QTSILQFLETKK" XX SQ Sequence 2474 BP; 931 A; 367 C; 418 G; 758 T; 0 other; tacagcaata cctcgcttag tgcggtttca ctatagtgcg ttttttttgg gagggaaaaa 60 attaaaattt ataatttttt atcaataaaa tgcgcaaaaa cgtatcaaca agccatttaa 120 aatataaatt tcttaataat acaaataggc ttactattca cgaaacataa taaaatcaca 180 aaaagtgtaa gtttataata ttagaaaatg taattaagaa taaataaaat attaattgga 240 tactgtacat aatacgcaat agataaggct tattcgcaat ttgcagagtg aaactccgat 300 taaaatatat ttacttaaat tactgtttag taattaatat tatgcgaaaa atgtatgtta 360 caataaagcc atgtaaaatt tataaaatta taattttata tcaataaaaa aaatttttat 420 atcaataaat tttatatcaa ccagctctag cctaattaag ttgaattaat taaatttttg 480 taattatcaa atttttattg taaccatgag cacgaaaaga aaaggggaaa ataaagatac 540 gtctgctaag cgcaagtggt tgacattaga acaaaagtta aacataatta agcttcatga 600 tgacggtgca tctttcgcaa aaattagcag agataaaggt atcaacgaat catcaattcg 660 gctaataatt aaaaataaag atcagcataa atcccaagga attgctacag catcttattt 720 atcaaaatca cttgtctcta aacacagaac ctctcgaatg attaatatgg aacgtttatt 780 aggcatttgg attgaagatt taaatcaaaa gagaattcca ctaagtcaaa tggaaattca 840 agcaaaagct ttaagtcttt ttgaggactt aaaagacgaa gatagcggcg aaattaatga 900 aacttttact gcaagtcatg gctggttttt tcgtttcaaa aaaaggtgtg gaattcacaa 960 tgtacgcatt gttggtgaaa gtgcaagtgc cgataaggaa gcagctttaa attatcctat 1020 agaattaaaa aaaattgttg aagaaggtgg ctatcaagat gaacaaatat ttaatgtgga 1080 cgaaacagga ttgttttgga aaaaattacc gacacacaca tttattgccg agaaagaaaa 1140 ggcatgtcct ggatataaaa tttcaaaaga ccgtctaaca ctattattgg gtggaaatgc 1200 tgcaggggat ttcaaactaa agcctatgct agtttatcgc gctgaaaatc ccagagcact 1260 taaaggtttc tctaaaaata cgcttcctgt attatggcgt gcaaataaaa aagcatgggt 1320 gacaagaact ttatttgaag attggtttca aaattatttt tgtcctgcaa cagagcgtta 1380 ttgcaaagaa aatggcttag attataaaat tttattgatt ttagataacg caccaggtca 1440 tccaactggg ctctctgatt taaatgaaca tataaaaata atttttcttc ccccaaatac 1500 tacctcatta attcaaccaa tggatcaagg tgtcattgca acattcaaag cctattattt 1560 acgccgaaca ttttcgcaag ctataaaagc aacgtcaggg gaaaatgcac caactttaac 1620 tgaattttgg aagagttaca acatcagaaa tgttattgaa aatatcggcg aagcttggca 1680 cgaaattact gccagcaatc tgagggcagt ttggaaacat attttgccgc attgcgcaaa 1740 tgatttcgta ggctttgaga ctcattttgc tgaggtcact catgacatag tagaaattgg 1800 aagagaccta ggttttagtg aaattgattc tgccaatgtg atagagtgca tcaattcaaa 1860 tacgaatgaa ttggacaatg aaactcttct taatatggaa gaacaacgag cattcgaaga 1920 aagggacagt gacttagata acgcagttgc agttgatacg ccaacaaagg aattatcaac 1980 tgaagaactt tcaaaaatta ttaacatggc cgataagctt tcagaacata tcatggaagt 2040 tgatcctaat tttgaacgca gcattcatgt tcgccgaaac attggcagtt taattcgttg 2100 ctacagagag ttgtttgaag aaaaaaaacc aaagaagata agtaaacaaa catcgatcct 2160 acaatttttg gaaactaaaa aataaaatat ttttttagct cgaatttgcg agtgcttctg 2220 tactgtaacg cataataaat acttttaaat tatttatttg gttttatttc attcatatcg 2280 taatattatt aattcgtgac attttttatt tattttgaac ttgaaaacat tgataatata 2340 tcgatatgta ttaaaaaaaa atttggaacc aattacaatt tttcccatag actacatagt 2400 tcaattagtg cggtttcact tagtgcggca tatcggcgga accaattaac cgcactaagc 2460 gaggtattgc cgta 2474 // ID Vingi-4_HR repbase; DNA; INV; 3544 BP. XX AC . XX DT 17-AUG-2010 (Rel. 15.08, Created) DT 17-AUG-2010 (Rel. 15.08, Last updated, Version 3) XX DE A family of Vingi non-LTR retrotransposons - consensus sequence. XX KW Vingi; Non-LTR Retrotransposon; Transposable Element; Vingi-4_HR. XX OS Helobdella robusta OC Eukaryota; Metazoa; Annelida; Clitellata; Hirudinida; Hirudinea; OC Rhynchobdellida; Glossiphoniidae; Helobdella. XX RN [1] RP 1-3544 RA Kojima K.K., Kapitonov V.V. and Jurka J.; RT "Recent Expansion of a New Ingi-Related Clade of Vingi non-LTR RT Retrotransposons in Hedgehogs."; RL Molecular Biology and Evolution 28(1), 17-20 (2011). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 167..3514 FT /product="Vingi-4_HR_1p" FT /note="includes endonuclease and reverse FT transcriptase domains and a CCHC zinc-finger FT motif." FT /translation="MLLLRAGIESNPGPKIWICEICNKRVNKTHYSVRCRT FT CCKWTHLNNCATNLGSYYLCNNCPHPTTIPSSPVRGAQPNRPIIRNKTSAT FT PNINPISHSDSKLNILQFNCNGLKNKSAEIAHHLSIHNISIAALQETKLST FT RSAVPVFKNYAIYRKDRKSGRGGGLMMLIHHKIPYTIKQLPDTDTTESQGI FT TIMADGTEINIVNVYIPPQSACPPHFCASIADLLEFPNIILLGDLNAHDGL FT WYSGISDARGETLAAEIDDSDCCTLNLDCPTRLPSNGQPTSPDLSIASISL FT TNCLNWSAVTTLSSDHLPIHISLDLRITPTKNPKVTFINLSKANWPGFTAE FT SELAFQNTLFPNTVTAAERAFRRILGNAGDHNIPSGRLINSQPGLSKEVTE FT LIRRRDALRLSNPSSDNISTLSNDIHNKINSIKTTKWKDFLSTFNRHTNPK FT RLWSVIKSLNGRPKLNPNNAINFNGKPISNGLDMVNLFNKSFTCFSQHKTN FT KLNRSINREAKRNCLSDHPTFTVDQVSAAIKTAKASKASGPDNITMLHLKN FT LGPLGLAFLTRIFNESLRSCCIPDIYKKARIIPLLKPGKSDEDAKSYRPIS FT LLCPAVKILERCLLPFISDHLSPASHQHGFRSGHSTTTAVSLLSMDVANGF FT NQKRPAHRTTLVALDLSKAFDMVNHRILISDLLKSSLPSSISRWLSNYLHG FT RSAVTEFRNQSSKQRNIHTGVPQGSVLSPALFNYYVSRIPEPPSDIKVISY FT ADDFSVYTTGPDVDILTQRLNNYLPLLINFFSLRDLEISTSKSSITLFTPH FT TAQYHYHPQVKVNGQVLPLEHNPKILGLTFDPMLTFTQHHKLSASKATKRN FT NVLKAISGTTYGQDKETLLQVYKAIGRSTKEYACPAWSSLSSESNLCRLQR FT AQNAALRIVTGCHKIASEKHLLSECQLLSIKQHTDLLSKQFITKCFDPQHP FT CHTLLSAELPPRVMKNTLMSKWGATVRSILHTNSDSASINMAIRNLHGMAA FT KAAAEAYHAPILLNGWPPPQPQIDSSEESLPRAIRRTLAQLRSGHSIILGS FT FREIIDGGTGNMCPHCHTHVDDVAHLFSCPANPTNLKTIDLWQNPTAVADW FT LRLRLTNG" XX SQ Sequence 3544 BP; 971 A; 1041 C; 670 G; 862 T; 0 other; aggttctgga gcaacggcct tcttgtggtg gaaccgattg gcgtcgtggt caggatcgct 60 cctctcctcc tcgtcaacaa accgttgcag caacccctcc ccaacacgac ttcaccgttc 120 tgggtttttg cctggggagc agtcctcgcc ggccggcctg ctccgtatgc tattactgcg 180 ggccggcatc gaatccaacc caggtccgaa gatatggatc tgcgagatat gtaacaaacg 240 tgtcaacaaa acccactact ctgtccgctg cagaacatgc tgcaagtgga cacacctcaa 300 caactgcgcc actaacttag gaagttatta tctctgtaat aactgtcctc accctactac 360 aattcctagc tcgccggtgc gtggtgcgca acccaatcgt cccatcatcc gcaataaaac 420 atctgcaaca cccaacatca atccaatctc gcactcggac agcaagctga atatccttca 480 attcaactgc aacggtctca agaataagag tgccgagatt gctcatcacc tatccataca 540 caatatctca atcgccgccc tccaggagac gaaactttca actcgctctg ctgttccggt 600 tttcaagaat tatgctatat accggaaaga ccggaaaagt ggtcgtggcg gcggcctcat 660 gatgctgata catcataaga tcccttacac cattaaacag ctacctgata cagataccac 720 ggagagccag gggatcacca tcatggctga tggcaccgag ataaatatag tgaacgtata 780 tatccctcct caatctgctt gcccacctca tttctgcgct tcaatagcag acttattgga 840 attccctaac atcattctac ttggcgatct aaatgctcat gatggcctct ggtactcggg 900 tattagcgat gcacgtggcg aaacgctggc ggcggaaata gacgattccg actgctgtac 960 tttaaatctt gactgtccaa ccaggcttcc atctaatggt cagccaactt ctccagatct 1020 ttctattgcc tctatctctt tgactaactg ccttaattgg tcagcagtca caactctttc 1080 ttcagaccac ctgccaattc atatatctct cgacctccgc attactccaa ccaaaaaccc 1140 aaaagtgacg tttatcaacc tttcaaaagc caactggccg ggttttacag ctgaatcaga 1200 acttgcattt caaaatacac tcttccccaa cactgtcacc gcagccgaaa gagcgttccg 1260 caggatctta ggcaacgccg gtgaccacaa cattccttct ggtaggctca tcaattccca 1320 gccagggctc tcgaaggaag tcaccgagct gattcgccga agagatgctc ttaggctctc 1380 caacccatca tccgacaata tctcaactct ttcaaatgac attcacaaca aaatcaatag 1440 tatcaaaaca acaaaatgga aggacttctt aagcactttc aaccgacata ccaaccctaa 1500 acgtctttgg agcgtcatca agtctctcaa cggcagaccc aaactcaatc ccaacaacgc 1560 cattaacttc aacggtaagc caatctcaaa tgggcttgat atggtcaacc ttttcaataa 1620 atcctttacc tgctttagtc agcataaaac caataaacta aacaggtcca tcaaccgtga 1680 ggcaaaacgg aattgcctat ccgatcaccc tacattcacg gtggatcagg tgtccgccgc 1740 catcaaaacg gccaaggcat ccaaagcctc tggcccagac aacatcacca tgctccacct 1800 aaaaaactta ggtcctttgg gtctggcgtt cttaaccagg atctttaatg agtcactcag 1860 atcctgctgt atcccagaca tctataagaa ggcccggatc atccctcttc tgaagccagg 1920 taagtctgat gaagacgcca aatcctacag acccatttca cttctctgcc ctgccgttaa 1980 gatcctcgag agatgcctcc tcccatttat ttctgatcat ctgtccccgg ccagtcacca 2040 acatggtttc cgttcaggac actccactac cactgctgtc tctctccttt ctatggatgt 2100 tgccaatggt ttcaatcaga agagaccagc acatcgaacc acgctggtag ctttggacct 2160 ctcaaaggct tttgacatgg tcaaccacag aattcttatt tctgaccttt tgaagtcctc 2220 gctgccaagc tccatttcac gctggttgtc aaactacctg cacggtcggt ctgctgtaac 2280 ggaattccga aaccagtcgt ctaaacagag gaacatccat acaggagttc ctcaaggttc 2340 agtcctgtcc cctgcgttgt tcaattacta tgtctcacgc attcctgaac caccatctga 2400 tatcaaggta atatcatatg cggacgactt ctccgtctac acgaccggcc cggatgtgga 2460 tattctgaca cagcgactca acaactatct ccctctcttg ataaacttct tctctctccg 2520 cgacctggaa atatccactt ctaaatcttc tataactttg ttcacacctc acacagcgca 2580 ataccactat catccccaag taaaggtaaa cgggcaggtt ctaccgctag aacataaccc 2640 gaaaattctt ggcctcactt tcgacccgat gctgaccttc actcaacacc ataaattgtc 2700 agcatccaag gccacaaaac ggaataatgt tttaaaagct atcagcggca ccacctatgg 2760 ccaagataaa gagaccctct tacaggtcta caaggccata ggcagatcaa caaaggaata 2820 tgcctgcccg gcctggtcgt ctctctcttc ggaatccaat ctatgccgtc ttcaacgcgc 2880 tcagaatgct gcactgagaa tcgtcactgg ctgtcacaaa atcgcatccg agaagcatct 2940 actgtccgag tgccagttgc tgtcaataaa acagcataca gatctactgt caaaacaatt 3000 catcacaaaa tgtttcgacc ctcaacatcc atgccataca ttgttgtcag ccgaacttcc 3060 ccctcgtgtg atgaaaaaca ctctgatgtc aaaatgggga gccacagtca gatccatctt 3120 acacactaat tcggatagtg caagtatcaa tatggctatt cggaatttac atggcatggc 3180 tgccaaggcc gctgcggagg cgtaccacgc gccgatcctg cttaacggat ggccccctcc 3240 acaaccacaa atagacagct cagaggaatc tctgcctcgg gcaataagaa gaactctggc 3300 ccaactccga tctggccaca gtatcatctt aggcagtttc agggaaatta ttgatggcgg 3360 tacaggtaat atgtgtcctc actgtcacac gcacgtagac gacgtggctc atctcttctc 3420 ctgtcctgcc aatcctacga atctcaagac aatagacctg tggcaaaacc caaccgcggt 3480 tgctgattgg ctcaggcttc gcctgaccaa tggataggtg gtcgtgaagg ggcaacaaca 3540 acaa 3544 // ID Gypsy-64_AA-I repbase; DNA; INV; 3579 BP. XX AC AAGE02025748; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-64_AA_; KW Gypsy-64_AA-LTR; Gypsy-64_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-3579 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02025748; Positions 24686 28264. XX CC Positions [2603-2875] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 146..2875 FT /product="Gypsy-64_AA-I_1p" FT /translation="MPKEEVPSVSGSQAAKVKAVSGVKLIGTLENFVPSAD FT FDDYLERAENFFELNDIKDDTFKRKLIVHFIGLPALKKLQQLLYPDTHRDV FT TYKTVIEKLKSYFSPKKNRIAQSVEFFKRNQKEFENVADFAVELQALSKHC FT VFGEYLDKALRDKFIAGLRNAKIQGELLNSDDDTTFERAVSKAKNLEQIEA FT DLVKMKSKQEYTNRINAGQYMRRSRSKTRGQDDGKHSQQGSRERRSFSKKS FT SRNQGRKVITCFGCGKKVPYTLREKVSDELNRLISENILRPVRHSDWATPI FT VVVPKPDGSVRICMDCKVTVNKVICNEHYPLPNINDVFANLSGYKYFAKLD FT LQGAYMQIKVTEESQKYLVVNTHKGLYAFTRLPFGISSAASTFQRIMDEIL FT ADIEGVHCYLDDILMGSDTIEGLIATLYKVLDQLQRHKVKVNLSKSKFIVQ FT EIGYLGHKVSEKGLSPSAEKVKAIAEAPRPRDVMQLKSFLGMINYYSKFVP FT NLSMKLSPLYALLKKNVRFEWNAQCEKAFTECKKLLLSHNLLELYNPELPI FT VIICDASSNGVGAVLCHRVGEFEKPVFYVSSTLSASEKNYPNLHREALAVV FT FGLTKFFKYIYGKKFTVVTDNKPLASIFDFRKGIPSLTVTRLQKYVHMLSI FT FDFEIEYRPGSKVTNADALSRLPIQGETGVEDVADLRWLSDEAEIIDLEMI FT GQATQDDPRMRELYLSVKNGWNESAVPADLKFYFSNQSCLSLYRNCVMYSE FT RVIVPMSCRKRTLELLHGCHLGVIRMKQSARRYVFWQGLDKEIEEFVQQCE FT VCSSMGRSVKKKFSEWPKALRPFRRIHLDFFHLAGKTFLILMDAYSKWMDI FT TMMQSTDAEALIDALNSVFRIFGVCDTIVSDNGPPFNSNAFLKYAKSLKIE FT LLKSPP" XX SQ Sequence 3579 BP; 1063 A; 591 C; 965 G; 960 T; 0 other; agaaatagtg gcgacgagaa aatagcaaaa agtttgcggt gagcgcgtgt ggataaatag 60 tctatcgtat cgaaaagtaa tcggagcacg tggtcgcgag tgtcgtttga atctacggtg 120 tgcacagtaa agtaatcgct aggagatgcc gaaagaagaa gtgccttccg tatccgggtc 180 tcaggctgcg aaagtgaaag cagtcagtgg agtgaagctg ataggaacac tggaaaactt 240 cgttccaagt gcggacttcg acgattacct ggagagagcg gaaaatttct tcgagctgaa 300 cgacatcaag gatgatacgt tcaaacggaa gctgattgtc cacttcatcg gattgcctgc 360 gttgaaaaag ctgcagcagt tgttgtatcc cgacacacac agggatgtga cctacaaaac 420 ggtaatagag aagctgaagt catacttcag tcccaagaag aaccgaatag cccaatcggt 480 ggagtttttc aagcgaaatc agaaagaatt cgagaatgtg gcggatttcg cagttgagct 540 acaagcactt tcaaagcact gcgtttttgg agagtatctg gacaaggcgc tcagggataa 600 attcatagct ggccttcgga acgccaaaat ccaaggcgag ttgttgaaca gcgacgacga 660 caccacgttt gaacgagcag tatccaaggc gaaaaacctg gagcaaatcg aggcggattt 720 ggtgaagatg aagagcaaac aggagtacac caatcggatc aacgcggggc agtacatgcg 780 acgaagccgt tcgaaaacac gtggccaaga tgatggtaag cacagtcagc aaggaagccg 840 ggagagacgg tccttttcta agaagagctc aaggaatcaa ggtaggaagg tgataacgtg 900 ctttggctgc ggtaagaagg tgccgtacac gctcagagaa aaagtctcag atgagttgaa 960 taggttgatc agtgagaata ttctaaggcc agtgcggcat agtgattggg ctacgcccat 1020 tgtggtggtt cctaaaccag atggtagtgt gagaatctgt atggattgta aggtgacagt 1080 gaacaaggtc atttgcaatg aacattatcc gttgcctaac atcaacgatg tgttcgccaa 1140 tctgagtggt tataagtatt ttgctaagct ggatctacag ggtgcgtaca tgcagattaa 1200 ggttacggaa gagtcgcaga agtatttggt tgtaaatact cacaaaggtc tgtatgcgtt 1260 tacgagattg ccgttcggta tttcaagtgc agcgtcgact ttccaaagaa ttatggatga 1320 aattctggct gatatagagg gagtgcattg ttatctggat gacattttga tgggtagcga 1380 tactattgaa ggattgattg ctacactgta caaagtactt gatcagctgc agaggcacaa 1440 ggtaaaggtc aacttgagta agagcaaatt catagttcag gagataggtt atttgggtca 1500 caaggtgtcg gaaaagggac ttagtccatc ggcggaaaag gttaaggcaa tcgcggaagc 1560 tcctcgacct agagatgtga tgcagttgaa atcgttttta gggatgatta attattattc 1620 gaagttcgtt ccaaatttgt cgatgaagct tagtccactc tacgcgctgc taaagaaaaa 1680 tgtgcgcttc gaatggaatg cacagtgtga aaaagctttc actgagtgca aaaagctttt 1740 gctgagtcac aacctgcttg agttgtataa tcctgagtta ccgatcgtta ttatttgtga 1800 tgcgagctca aatggggtcg gagcagtgtt gtgtcataga gtaggggaat tcgagaagcc 1860 agtcttctat gtgtctagta ctctctctgc gtcagagaaa aattatccga atcttcatcg 1920 tgaggcgctt gcggtagtat ttggattgac gaagttcttc aaatacattt acgggaaaaa 1980 gttcacagtt gttaccgata acaaaccgtt ggcaagcatt tttgatttta ggaagggtat 2040 tccgtcattg accgtcacta ggttgcagaa gtatgtccac atgttgtcaa tatttgattt 2100 cgaaattgag taccggccgg gttcgaaagt tacgaacgct gatgctttga gtaggttgcc 2160 cattcagggt gagacaggag tggaagatgt tgcggatctc cgttggttga gtgacgaagc 2220 agaaattatt gatctggaga tgatcggtca ggccactcaa gatgatccga gaatgagaga 2280 gctttatttg agtgtgaaga acggatggaa cgagagtgcg gttccagcag atttgaaatt 2340 ctacttttcg aatcagagct gtttgtcgtt gtacaggaac tgtgtgatgt attcggagag 2400 agtaattgtt ccaatgtctt gtcggaaacg tactttagag ttgttgcatg gatgccatct 2460 tggtgttatt cgcatgaaac agtcggctcg aaggtatgtg ttttggcagg gtttggataa 2520 ggaaatcgaa gaattcgttc agcagtgtga ggtgtgtagt agtatgggac gatctgttaa 2580 gaaaaagttc tctgaatggc caaaagctct tcgcccgttt cgtcgaatac atttggattt 2640 tttccactta gcagggaaga cgtttctgat tttaatggat gcctactcta aatggatgga 2700 tattacgatg atgcaaagta cggatgcaga agcgttgatt gacgcgctta attcggtgtt 2760 cagaattttt ggggtttgcg ataccatagt cagtgacaat ggtcctccgt ttaatagtaa 2820 tgccttcttg aagtacgcaa aatcgttgaa aattgagttg ttgaaaagcc cgccttaatc 2880 gccagaaagt aatgggcttg cagagcgagc agtccagact gctaagagtg ggattaagaa 2940 gctacttgtc gatccgaagt ttaatcacat gaagttgctg gcgattgtag atatattttt 3000 atttagttat cgtaacagtt atagtcaggc tatcgattgt acgcctgcta gtaaggtatt 3060 ttcgtttatt ccgaaaacgg atctggactc taatttgggt atgaaattcg aacagttgaa 3120 gaagaaagtt aggtttaatt tagacgtaaa cgatcaggaa actttgaaag atggtggaag 3180 gagttgccaa ccaaaatata aagtaggtga tctagtgtgg tataaaagtg aggttaggtc 3240 caccagatct tggctggaag cagaaattgt gggagtgaat agtactaata cgttttatgt 3300 tgacatcaat ggagcggtaa aattagctaa tagaaatcag ttgaaaccca gagttctcaa 3360 aagaaatgac tacatctgcc ctaaagttgg aattgaaaaa agagttccag tgagaacccc 3420 caatgaaccc agattattga gaagtgttag tgttgattct ccgagaagtt ctcctgataa 3480 aacaatagaa gttcgtagga gtagaagagt agggaagaag ccggagagat tagtagtagg 3540 ttcattataa ttgtatagaa tcgaatttaa gcagggagg 3579 // ID Gypsy-135_AA-I repbase; DNA; INV; 4970 BP. XX AC AAGE02025177; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-135_AA_; KW Gypsy-135_AA-LTR; Gypsy-135_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4970 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02025177; Positions 34575 39544. XX CC Positions [3817-4287] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 716..2713 FT /product="Gypsy-135_AA-I_2p" FT /translation="MSTFIGSVEPYVVGTSFSDYAERLQHVFNYNRVPEGN FT RKSLFITVSGAAVFSEMKKLYPGVDLDTLEYNDIINKLKGRFDKTDGRMVQ FT KALFYERCQRKDELAEDFILDVKLLAESCGFGAMKDSIIRDRLVLGAYDKK FT TRERLLEEEEPSLVETERILIAREQMARSSLRMEQTSDRVSAIERSTTRDR FT RDDHTRRNWDHRNGRREVYPHSSRRGQDRYNYRSRSRSRSAHRWKTNNNNN FT VMCSYCKIRGHVRKNCFKLQRSRESIRFVEDENSTEQSSAFDFKRLGPPNP FT TEEDSEEDLDCMSIEAAAQVNEPVFINAKINGLGFDMEIDTGSAVAVVGEQ FT IYKSHFSKLPLKKCKRRLSVVDGARLKVSGILDVTVSIRNMIANVTLVILQ FT GSTNFVPLVGRDWLDVFFKGWRTKFLNLSSINNVNVMDKTIDIIDKIKQKY FT NSVFSKDLSVPIEGFEGELTMRECAPIFRKAYTVPLRLKEKVIQHLDSLEE FT SGIITPIQASDWASPVVAIVKKDNDIRLVIDCKVSINKFIVPNTYPLPLAQ FT DIFATLAGCKVFCSLDLAGAYTQLQLSERSRKYMTINTIKGLYTYNRLPQG FT AASSAAIFQKVMDQVLHDIEGVSCYLDDVLIAGKNFEECKNKLLLVLDRLD FT KAKIKVKWNKCKFFWDM" FT CDS 2683..4815 FT /product="Gypsy-135_AA-I_1p" FT /translation="MEQMQIFLGHVITENGLSPSPDKIDTIANAKAPNNPT FT ELKAFLGLINFYGKFVPNLSSKLICFYNLLKKDAKFIWDSKCNENFEICKK FT YLLKPNVLEFFDPKKPLIVVTDACGYGLGGVLAHEINGQEKPISFTSFTLN FT DAQRKYPILHLEALAVVSCVKKFHRYLYGQHFVIYTDHKPLLGVFGKETKD FT AISVTRLQRYIMELSIYSYEIRYRPSKKMGNADFCSRFPLANTVPRSLDRE FT YVNSLNFTNDIPIDFKDVAKETKKDVFLTQIIKYLQCGWPERVERELKDIK FT SQAYDLELTEDCLLFRNRVVVPKSLQQNIMKLLHANHAGIVKIKQLARRTV FT FWFGINADLEKYVKNCDVCNRTLIVPKTKYESSWSPTTRPFSRIHADFFHL FT GEKVFLLIVDSYSKWVELDYMRYGTDCKKVIKKFVNVFARFGLPDVLVTDG FT GPPFNADYFTEFMKNQGILVLKSPPYHPASNGQAERTVRTVKEVLKKFFLE FT PQIKTLDVEEQVNYFLLNYRNSCLTKSGTFPSEHVFSYKPKILLDLVHPKS FT HYKHNLAPRCQNSPIKENSRWNRVDELKYLNSGDALWYKNYRLKHVERWLE FT ATFVKRLSLNVFQISVNGHVLSAHRDQIRLRNNTKPTGFTQIKNNSKRSRE FT DDSEDEPDFLGFPEESFPSQSSLTISNKNSKPIISGVRRSDRLKKQRQEVN FT RDTESRRRSNQ" XX SQ Sequence 4970 BP; 1566 A; 916 C; 1044 G; 1444 T; 0 other; ctttaatggc aacgaaggat agatcaaaga cccagtgcgt gcaaatttaa caaatcggtg 60 tgatttgccg agtatttttt tgtgaaaatt tagatcgttg agccgtactt agaaaaataa 120 aaatcattaa ttttatgtgc tttaaacggc tgccgactca agtttgggta gtatacaaaa 180 gtgatacttg tatcgtgtag aacgttaagt tcttaaaaaa aaattgtgac attgcctact 240 tttcttatca tcacgcggtt caatcaccgc taacatctca gccaccacca acaattttgc 300 caccttcgag accgtcgcca cgcggattgt cgccaccaat atcttcgcca tcgccgtcac 360 cgccattgct gtccactacg gggtcattgt cagtattgtt tcctcatcag tattgttcag 420 accatcactg agttgaagtt acctagctgt tgaagatctg agtgccgtga tcgtgtatta 480 acgattgttg cagtaaacca agaattcgtc tcttgtgtgc acagcatctc aaacgagagg 540 tgagtgactg gacttgacca ttataaaaga gactcgctct ttatagcaat aaatcgatat 600 atcgtgtgct tgaattttga tagtagttga gtgcgtcctg agcagtggta ttacaattga 660 tgagaaatta ttagtgattg gtttctgttt ttgttcaaga tcatacattg caacgatgtc 720 gacgttcatc ggttccgtcg agccctacgt agtaggtaca tcgtttagcg attatgcgga 780 gcgattacag catgttttca actacaatcg ggtacctgaa ggcaatcgca agtcgctatt 840 tatcactgtg agtggagctg cagttttttc tgagatgaaa aaactatacc ctggtgttga 900 tcttgacacg ctggagtata acgatattat aaacaaactt aaaggccggt tcgacaaaac 960 ggatggacgc atggtacaaa aggcgctatt ctacgagcgg tgccagcgca aggacgagtt 1020 agctgaggat tttattcttg atgtcaagct tctagcagaa agttgtggat ttggcgcaat 1080 gaaggattcc atcatacgcg atcggttggt actgggcgca tatgacaaga aaacacgtga 1140 acgtttactt gaggaagaag aaccatcact cgtagaaact gaacggatcc ttatcgcgcg 1200 agagcaaatg gcacgtagtt cgttgcgaat ggagcaaacc agtgatcgag tgagtgcaat 1260 tgaacgatct accacacgtg atcgcagaga tgatcacaca cgacgaaact gggaccatcg 1320 taatggaaga agagaagtgt atccacatag tagtcgtcgt ggtcaagacc gttacaacta 1380 tcgcagccga agtcgttcaa ggtcagcaca tagatggaaa acgaataata acaacaatgt 1440 catgtgcagc tactgtaaga ttagagggca cgtgaggaaa aattgcttca aactccaacg 1500 ttctcgggaa tcaattcggt ttgtggaaga tgaaaattct acggagcaat cgtcagcgtt 1560 cgatttcaag cgtttgggac cacccaatcc caccgaggag gattccgagg aggatctcga 1620 ttgtatgtcc attgaagctg ctgcacaggt aaatgagcca gtttttatta atgcaaaaat 1680 caatggatta ggttttgata tggagatcga tacaggatct gctgtcgctg tggtaggtga 1740 acagatttat aaatctcatt ttagtaagct acctctcaaa aagtgcaaac gaagattatc 1800 tgtcgtggac ggtgctagat tgaaggtttc gggtatatta gacgttacag tttcgataag 1860 aaacatgatt gctaacgtaa ccttggtaat tttacagggt agtaccaatt ttgtgccact 1920 tgtaggaaga gattggcttg acgtgttctt caaaggctgg agaacaaaat ttttaaattt 1980 atctagtata aacaacgtga atgtaatgga caaaacaatt gacataattg ataaaatcaa 2040 acagaaatat aactctgtat ttagcaaaga tttgtctgtt ccaattgaag gttttgaggg 2100 agaattaact atgcgggaat gtgctcctat ttttagaaaa gcttacacag taccattgcg 2160 tttaaaagaa aaggtcatcc aacatttgga ttctcttgag gaaagtggaa taattacacc 2220 tattcaggcc agcgattggg cttcgcctgt tgttgcaata gtcaaaaagg acaatgatat 2280 ccgtttagtc atcgactgta aggtatcgat aaacaagttt atagtgccca atacttatcc 2340 gcttcctcta gctcaagata tttttgccac tttggctggg tgtaaggtat tttgttcgct 2400 ggatttagcg ggtgcttaca ctcaactgca actttcagag agatcgagga agtatatgac 2460 tattaataca ataaaaggcc tatatacata caatcgcctt ccacaaggag ctgcctcttc 2520 cgcagcgatt ttccaaaaag tcatggatca ggtcctgcat gacatagaag gagtgtcatg 2580 ctaccttgat gatgtactta ttgctggaaa aaatttcgaa gaatgcaaaa ataaactcct 2640 tttggttctt gaccgcctgg acaaagcaaa aattaaagtt aaatggaaca aatgcaaatt 2700 tttctgggac atgtgataac agaaaacgga ctttcgcctt cccctgacaa aatagatact 2760 attgctaatg ctaaagcacc aaataatcca acagagttaa aagctttcct tgggttgatc 2820 aacttttatg gaaaatttgt accaaacctg tcatccaaac taatttgttt ttacaatttg 2880 cttaaaaagg atgctaaatt tatttgggat tcaaaatgca acgaaaattt tgagatttgt 2940 aagaagtatt tattgaaacc gaatgtttta gagtttttcg atcccaaaaa accgctcata 3000 gtggtcacag atgcttgtgg ctatggctta gggggcgttt tggcccatga gatcaacggt 3060 caagaaaaac ctatatcgtt cacatctttt actcttaacg atgcacaacg caaatatccg 3120 attttacacc tcgaggcgtt ggcggtagtt tcttgcgtca agaaatttca tcggtatttg 3180 tatggccaac attttgtgat ctacaccgat cacaaacctc ttcttggagt gtttggaaag 3240 gaaactaagg atgctatttc cgttacacgt ttacaacgtt atatcatgga actatccatt 3300 tatagctatg aaattcgata tcggccatcc aagaaaatgg gaaacgcaga tttctgctcc 3360 cgatttcctc tagctaatac agtacctaga agtttagacc gagagtatgt aaacagccta 3420 aattttacta atgatatacc aattgatttc aaggatgttg ccaaagaaac gaagaaagat 3480 gtgtttctca cacaaatcat taaatatctt caatgtggct ggccggaaag agttgaacga 3540 gagttgaaag acatcaaatc ccaagcttat gaccttgaat taaccgaaga ttgtctcctg 3600 tttcgtaatc gagtagttgt gccgaaatct ctccaacaaa acatcatgaa gcttttgcat 3660 gctaatcatg cgggcattgt aaaaattaag cagcttgctc gtcgcacagt attttggttt 3720 ggcataaacg cagacctgga aaaatacgtt aagaactgtg atgtttgtaa tcgtactctt 3780 attgttccta agacaaaata cgagtcgtcg tggtcgccca ctaccagacc ttttagtcga 3840 atccacgctg atttttttca tcttggggaa aaagtatttt tgttaatcgt agacagttac 3900 tcgaaatggg tggaattaga ttacatgcga tatggaacag attgtaaaaa ggtcataaaa 3960 aagtttgtaa atgtttttgc tcgattcggc ctcccagacg ttcttgtaac cgatggtggg 4020 cctcctttca acgcagacta ttttactgaa tttatgaaaa atcaaggaat tttagttctt 4080 aaaagccctc catatcaccc agccagtaac ggacaggctg aaagaactgt tcgaacggtc 4140 aaggaagtat taaaaaagtt ttttttggaa ccacaaatca agacccttga cgtggaggag 4200 caggtgaact attttctctt gaactacaga aatagctgtc tgacaaagag tggaacattt 4260 ccctctgagc atgtgttttc atataagccg aagatattac ttgatcttgt tcacccaaaa 4320 agtcattata agcataactt agctccccgt tgtcagaatt ctcctataaa agagaattct 4380 agatggaatc gggttgacga gctaaaatat ttaaattcgg gagacgcatt atggtataaa 4440 aactaccgtc tcaaacatgt cgagcgatgg ttggaggcga catttgttaa gagactatcg 4500 ttgaatgttt tccaaatttc ggtgaatggt catgtgttat cagcacatcg agatcagata 4560 cgattacgga ataatacaaa accaactgga tttacacaaa tcaaaaataa ttccaaacgg 4620 tctcgtgaag acgactctga agacgaacca gatttcttgg gatttcctga agaatccttt 4680 ccgagtcaat catctttgac aataagtaac aaaaattcta aacctatcat ttcaggagtt 4740 agaagatctg accgtttgaa gaaacaacga caagaagtaa atagagatac cgagtctcgt 4800 cgaagatcga atcagtgatt atattaataa agatttaaaa ttaagaaata gccttgaagt 4860 gttaaatagc aattgttcag aattgcatgt acgttagaat ttcagaaatt ttataaatag 4920 tatcattgaa tcacatttat ttaattctta agtttaaggg aatggaggac 4970 // ID IFP2 repbase; DNA; INV; 2475 BP. XX AC J04364; XX DT 13-MAY-1999 (Rel. 4.04, Created) DT 13-MAY-1999 (Rel. 4.04, Last updated, Version 1) XX DE Trichoplusia ni transposon IFP2. XX KW DNA transposon; Transposable Element; IFP2; TTAA superfamily; KW Autonomous DNA transposon. XX OS Trichoplusia ni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Noctuoidea; Noctuidae; Plusiinae; Trichoplusia. XX RN [1] RP 1-2475 RA Cary C.L., Goebel M., Corsaro G.B., Wang G.H., Rosen E. RA and Fraser J.M.; RT "Transposon mutagenesis of baculoviruses: analysis of RT Trichoplusia ni transposon IFP2 insertions within the FP-locus of RT nuclear polyhedrosis viruses."; RL Virology 172(1), 156-169 (1989). XX DR GenBank; J04364; Positions 1 2475. XX CC TTAA target site duplication. Transposase encoded by IFP2 CC transposon has been found in human (LOOPER transposon in CC humrep.ref). XX SQ Sequence 2475 BP; 802 A; 418 C; 496 G; 759 T; 0 other; ccctagaaag atagtctgcg taaaattgac gcatgcattc ttgaaatatt gctctctctt 60 tctaaatagc gcgaatccgt cgctgtttgc aatttaggac atctcagtcg ccgcttggag 120 ctcggctgag gcgtgcttgt caatgcggta agtgtcactg attttgaact ataacgaccg 180 cgtgagtcaa aatgacgcat gattatcttt tacgtgactt ttaagattta actcatacga 240 taattaatat tgttatttca tgttctactt acgtgataac ttattatata tatattttct 300 tgttatagat atcgtgacta atatataata aaatgggatg ttctttagac gatgagcata 360 tcctctctgc tcttctgcaa ggcgatgacg agcttgttgg tgaggattct gacagtgaaa 420 tatcagatca cgtaagtgaa gacgtccaga gcgatacaga agaagcgttt atagatgagg 480 tacatgaagt gtcagccaac gtcaagcgta gtgaaatatt agacgaacaa aatgttattg 540 aacaaccagg ttcttcattg gcttctaaca gaatcttgac cttgccacag aggactatta 600 gaggtaagaa taaacattgt tggtcaactt caaagtccac gagcggtagc cgagtctctg 660 cactgaacat tgtcagatct caaagaggtc cgacgcgtat gtgccgcaat atatatgacc 720 cacttttatg cttcaaacta ttttttactg atgagataat ttcgcaaatt gtaaaatgga 780 caaatgctga gatatcattg aaacgtcggg aatctatgac aggtgctaca tttcgtgaca 840 cgaatgaaga tgaaatctat gctttctttg gtattctggt aatgacagca gtgagaaaag 900 ataaccacat gtccacagat gacctctttg gatcgatctt tgtcaatgtg tacgtctctg 960 taatgagtct gtggatcgtt ttggattttt tgatacgatg tcttagaatg gatgacaaaa 1020 gtatacggcc cacacttcga gaaaacgatg tatttactcc tgttagaaaa atatgggatc 1080 tctttatcca tcagtgcata caaaattaca ctccaggggc tcatttgacc atagatgaac 1140 agttacttgg ttttagagga cggtgtccgt ttaggatgta tatcccaaac aagccaagta 1200 agtatggaat aaaatcctca tgatgtgtga cagtggtacg aagtatatga taaatggaat 1260 gccttatttg ggaagaggaa cacagaccaa cggagtacca ctcggtgaat actacgtgaa 1320 ggagttatca aagcctgtgc acggtagttg tcgtaatatt acgtgtgaca attggttcac 1380 ctcaatccct ttggcaaaaa acttactaca agaaccgtat aagttaacca ttgtgggaac 1440 cgtgcgatca aacaaacgcg agataccgga agtactgaaa aacagtcgct ccaggccagt 1500 gggaacatcg atgttttgtt ttgacggacc ccttactctc gtctcatata aaccgaagcc 1560 agctaagatg gtatacttat tatcatcttg tgatgaggat gcttctatca acgaaagtac 1620 cggtaaaccg caaatggtta tgtattataa tcaaactaaa ggcggagtgg acacgctaga 1680 ccaaatgtgt tctgtgatga cctgcagtag gaagacgaat aggtggccta tggcattatt 1740 gtacggaatg ataaacattg cctgcataaa ttcttttatt atatacagcc ataatgtcag 1800 tagcaaggga gaaaaggttc aaagtcgcaa aaaatttatg agaaaccttt acatgagcct 1860 gacgtcatcg tttatgcgta accgtttaga agctcctact ttgaagagat atttgcgcga 1920 taatatctct aatattttgc caaatgaagt gcctggtaca tcagatgaca gtactgaaga 1980 gccagtaatg aaaaaacgta cttactgtac ttactgcccc tctaaaataa ggcgaaaggc 2040 aaatgcatcg tgcaaaaaat gcaaaaaagt tatttgtcga gagcataata ttgatatgtg 2100 ccaaagttgt ttctggactg actaataagt ataatttgtt tctattatgt ataagttaag 2160 ctaattactt attttataat acaacatgac tgtttttaaa gtacaaaata agtttatttt 2220 tgtaaaagag agaatgttta aaagttttgt tactttagaa gaaattttga gtttttgttt 2280 ttttttaata aataaataaa cataaataaa ttgtttgttg aatttattat tagtatgtaa 2340 gtgtaaatat aataaaactt aatatctatt caaattaata aataaacctc gatatacaga 2400 ccgataaaaa cacatgcgtc aattttacgc atgattatct ttaacgtacg tcacaatatg 2460 attatctttc taggg 2475 // ID CR1-82_HM repbase; DNA; INV; 4575 BP. XX AC . XX DT 13-JAN-2009 (Rel. 14.02, Created) DT 13-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE CR1-type family - consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-82_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4575 RA Bao W. and Jurka J.; RT "CR1 families from Hydra magnipapillata."; RL Repbase Reports 9(2), 369-369 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 62..802 FT /product="CR1-82_HM_1p" FT /translation="MDLTLKNIEKLITSKLEEQKAYILSETSKLLKEQEKN FT FTVIISGNLKILAERLDKLEXEVKRNKSNNTIIEKDIHEIKESLNFQEANV FT TEKXEKVKANLGYDINYLVKKTVDLENRSRRNNLRIDGVKEIEGESWEDCR FT NSVKEIIKNKLKISEEIEIERAHRIGSAREDKXPRTIIFKLLNFLDKTKVL FT NSAKKLRETGIYINEDYAKETIEHRKKLWEEVKRLRKAGKYAIIKFDKIYC FT REFKK*" FT CDS join(857..1288,1272..2084,2011..3051,2942..3934) FT /product="CR1-82_HM_2p" FT /translation="MVNETIDLETLRFNIFEIANKHLPRNRYDEFSFLNEK FT YSDSVYFDIKNFKVELKTTINNFTXIHINIRSLPINIDKLKNFLLECKHTF FT SMICLTETWCSDETAEKNSDLQIPNYKIFSKERSVNKRGGGIVIYIQNDLN FT TKVKKTLKLRNDLSNSDGNIEVLTMEVTNASKNFLVSTCYRPPDGDTCKFS FT NYMKYLITKNREQKKLFIIGDININILKYNEHAVTKNFFDDMFQLNIFPII FT NKPTRVTSKSITAIDNILTNSIQDTSLKSGIIKTDISDHFPIYFSLSLDSR FT KIDNSKILFYKRIIDETSTQKFKDLLSATNWQKVYRECNSGNTNSAYIKFI FT NLFMLEYNKSFPIQEKEIKAKYLNCPWITKEIRKSSKQKQKLYIKYLKNRS FT EANLFLYKQYKNLFEKIKKKTKIEAKRTYFSTSNTKTYLKKLKKKHKILYY FT SNQMQKFNGDLKKTWDIMKEIIGKGITNSNKMPSRVIINQKESNNKRDISN FT EFNSYFANIGVDLASKIQCPNNSFKNYLTGXHCSLNFNELNQDELEIAKKS FT LKIKSPGIDDISCKVVIDVFEEIRNPIYQIFNSSVTTGIVPDLLKISKIIP FT IYKSGEANSLNNYRPISILSVFAKLLERIIYNKLNEYLKTNKIINKGQYGF FT QKRHATEHAILDLSNRISKSFKQKKFTLGIFIDLSKAFDTVNHEILLTKME FT NYGVKNKALTWFKNYLNDRHQCVIIDKKRQLEFTENKMWCTPRVHSCSSFV FT SYLHKRSPKCLIKKGNSNLLKTKCGVPQGSILAPLLFLIYINDLPNASNIF FT KTIXFADDTNLFYSSKTIEDLFKKTNNELKKINIWFKSNKLSLNIDKTKYI FT LFHSTQQIKKLPLNLPTINIENITIERAQNTKFLGVLIDEHISWKPHINYI FT NTKISKNIGLLYKARSFLTQKSLILVYFSFIHSYLMYANIAWGSTHITKLK FT PLYLRQKHASRLVFNKNKLTHAQPLLEQMNALNIFQINIYQNILFMFKYKL FT SLVPEHFTAHFFENNVNKYNTRATGNFKIPFEKTKLSKFSIKHRGPYLFNK FT IISRNESLINLNNEQSLKTKLKSTIIKLNNYKEFF*" XX SQ Sequence 4575 BP; 1908 A; 706 C; 573 G; 1375 T; 13 other; accgtgaaca tcaccgcgag cggacgtgtt ttttttttat tatcttacta acaaattaaa 60 aatggatcta acgttaaaaa acatcgaaaa gttaataaca agcaagcttg aagaacaaaa 120 agcttatata ttatcggaaa caagcaagtt attaaaagag caagaaaaaa actttacagt 180 aataataagt ggaaacttaa aaattctcgc agaacgacta gataagcttg aayacgaagt 240 taaacgaaat aaatcaaaca atactatcat tgaaaaagat atacatgaaa ttaaagagag 300 cctaaacttt caagaagcaa atgttacgga aaaaatrgaa aaagttaaag caaatttggg 360 gtatgatata aattatttag taaaaaaaac tgtggactta gagaaccgct cacgcagaaa 420 caatttaagg atcgatggtg ttaaagaaat agaaggtgaa agctgggaag attgtagaaa 480 ctcggttaaa gaaataatta aaaataagct aaaaatatct gaagaaattg aaatcgaacg 540 agcccatcgt attggaagtg caagagaaga caaacyaccg agaacaataa tcttcaaact 600 actgaatttt ttagataaaa ctaaagtttt gaacagcgcc aaaaaattac gagagactgg 660 gatttacata aacgaagact acgcgaaaga aacgattgaa catcgtaaga aattatggga 720 ggaagtgaaa cgacttcgta aagctggtaa gtacgccatt attaaatttg ataaaattta 780 ttgcagagaa tttaaaaaat aaacagctgt ttttagcgag caaattttta aaaaatttca 840 tttctaaaaa atcattatgg ttaatgaaac aatagactta gaaacgctgc gttttaatat 900 ctttgaaatt gccaataaac atttaccacg aaatcgttat gacgagttta gctttttaaa 960 cgaaaaatac tctgattctg tatattttga tattaaaaat tttaaagttg aacttaaaac 1020 cactataaat aattttacgg yaatacatat aaatatcaga agtttaccta taaatataga 1080 caaattaaaa aactttctcc ttgaatgcaa acatactttc agtatgatct gccttacaga 1140 gacttggtgc tccgacgaaa cagcagaaaa aaactctgac ctccaaattc caaactataa 1200 aatattttct aaagaaagat ctgttaacaa aagaggagga ggaattgtga tttatataca 1260 aaatgattta aacactaaag ttaagaaatg atctctcaaa ctctgatgga aatattgagg 1320 tcctcacaat ggaagttaca aacgcgtcaa aaaactttct agtttctacg tgctacagac 1380 cacccgatgg tgatacctgt aaattctcaa actatatgaa atatttaata acaaaaaata 1440 gagaacagaa aaaattgttc ataattggag atataaacat aaatatctta aagtacaacg 1500 aacatgcggt tactaaaaat ttctttgacg acatgtttca gcttaatatc ttcccsatta 1560 tcaacaaacc taccagggta acttccaaat ctattactgc cattgataat atattaacaa 1620 actcaataca agatacttct ctaaaatcag gaataataaa aacagacatc tcagatcact 1680 ttccaatcta cttttcttta tcactagatt ccagaaaaat tgataattca aaaatcttgt 1740 tttataaaag aattatcgat gaaacatcta ctcaaaaatt taaggacttg ctatcggcaa 1800 caaattggca aaaagtctac cgagaatgta actcaggaaa cacaaactct gcgtacatta 1860 aatttattaa tttatttatg ctagaatata ataagagttt tccaattcaa gaaaaagaaa 1920 taaaagcgaa atatttaaac tgtccttgga taacaaaaga aataagaaaa tcctcaaaac 1980 aaaaacaaaa actttacata aaatatttaa aaaatagaag cgaagcgaac ttatttctct 2040 acaagcaata caaaaactta tttgaaaaaa ttaaaaaaaa aacataaaat attatattay 2100 tccaaccaaa tgcaaaaatt taatggagac ttaaaaaaga catgggacat tatgaaggaa 2160 ataattggta aaggtataac taattctaat aaaatgcctt ctagagtaat aattaatcaa 2220 aaggaatcaa acaataagcg tgatatttct aatgaattca acagctactt tgccaatatt 2280 ggtgttgatc ttgcatcaaa aattcaatgc ccaaacaatt catttaaaaa ttacttaaca 2340 ggcwcacact gctccttaaa tttcaacgaa ctaaaccaag acgaacttga gatagctaag 2400 aaatcactta aaatcaagtc cccaggaatc gatgatatct cttgcaaagt agtgatagat 2460 gtttttgagg aaatacgtaa tcctatttac caaatattta actcatcagt tacaacaggt 2520 attgtacctg atttgcttaa aatctcaaaa attataccaa tctataaatc aggagaagcc 2580 aactctttaa acaattatag acctatttca atcctttcgg tattcgcaaa acttcttgag 2640 cgaattatct acaataaatt aaatgaatat ttaaaaacta ataaaattat caataaaggt 2700 caatacggtt tccaaaagcg acacgcaaca gagcatgcaa tccttgatct ctcaaatagg 2760 attagtaaat catttaaaca aaaaaaattc acgttaggaa tttttattga tttgtcaaag 2820 gcgtttgaca ccgtcaacca tgagatttta ctgacaaaaa tggaaaatta tggcgttaaa 2880 aataaagctt taacttggtt taaaaactac cttaatgaca gacaccagtg tgttattata 2940 gataaaaaaa ggcaactcga atttactgaa aacaaaatgt ggtgtacccc aagggtccat 3000 tcttgctcct cttttgtttc ttatttacat aaacgatctc ccaaatgcct ctaatatttt 3060 caaaactata atktttgccg atgatacaaa cttattttac tcatcaaaga ctattgaaga 3120 cttatttaaa aaaacaaata atgaactaaa aaaaattaat atatggttta aatcaaacaa 3180 actatcactc aatatagata aaactaaata tattctattc cactctactc aacaaattaa 3240 aaagcttccc cttaacctac caacaattaa tatagaaaat ataaccatcg aaagagctca 3300 aaatacaaag tttttaggtg ttctaattga tgaacacatc tcttggaagc cccacatcaa 3360 ctatataaat acaaaaatat caaagaacat aggtttactt tataaagcaa gatctttctt 3420 gacacaaaaa agtctaatac tcgtttattt ttcattyata catagctatc ttatgtatgc 3480 caatatagca tggggcagta cccacataac taaattaaaa ccgctttatc tgcgtcaaaa 3540 gcacgcttca agactagtgt tcaacaaaaa caaacttact catgctcaac ccctactgga 3600 gcaaatgaac gctttaaata tttttcaaat taatatttat caaaatatac ttttcatgtt 3660 taaatataaa ctaagtctgg ttccagaaca ttttacagca cacttttttg aaaacaacgt 3720 aaacaaatac aatacaagag caacgggaaa ctttaagata ccctttgaaa aaactaaact 3780 ctcaaarttt tcaataaaac atcgtggtcc ctatttattt aacaaaataa tttctagaaa 3840 cgaatcgcta ataaatttaa acaacgaaca gtccctraaa acaaagttaa aatctacaat 3900 aatcaagtta aacaactaca aggagttttt ttaaactcca atttaaaatt taaaaataaa 3960 aataatataa aacaacccta ataaaacaaa aaatgtaact tttcaaagga atcaataaaa 4020 ataatactaa aaggagatag taacattaat aattttaagg catatatgta ycacatacat 4080 atttgccacg tatgcaactg atttttatgt gtattacacg tctagaaggt actcgatgac 4140 aagactgacc ccagtcttct gcgagcttcc tataactaca aacaattaat tattttcttc 4200 aatcttatac aaatcttgtt tttatcatac ttttatatat atatatatat atttatattt 4260 taattgaaag atatatagat aggtttattt mttgggtaat ttttacgttc acgtatatat 4320 ctatatgtgt actttgtatt tatttttatc atgtctcaaa gttgaatgta acaatatttt 4380 agatttatcc cttatttatt ttggacttta tgtttacgat atttatattt gtatatcttt 4440 attcgccaat cgatgattgg tcaatatgta aatactttgt agttgtaaaa taaacatgta 4500 taaaaaaaaa aatatatata taatttatat atctacacat atatatatat atatgtatat 4560 atatatatat atata 4575 // ID Gypsy-30_CQ-LTR repbase; DNA; INV; 216 BP. XX AC AAWU01003903; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-30_CQ_; KW Gypsy-30_CQ-I; Gypsy-30_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-216 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 440-440 (2011). XX DR GenBank; AAWU01003903; Positions 8546 8761. XX SQ Sequence 216 BP; 59 A; 39 C; 72 G; 46 T; 0 other; tgtagggtta tgcatttata gctacattat tacttaaaat gggtaaacac agttaacagt 60 gtagtcaggg agttttgaca gctcgggcgc gagggagaca gttgaaaatc taccaccgca 120 gcgtgaggac gcgtgattga aggacgggga cggttgacga ggtggaccag gtaacggcag 180 gcttgaggag ctctcgccgg gggaagaatc tcgaca 216 // ID Copia-125_AA-I repbase; DNA; INV; 5439 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-125_AA_; KW Copia-125_AA-LTR; Ty1_copia_Ele14; Copia-125_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-5439 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [2918-3202] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 348..1826 FT /product="Copia-125_AA-I_3p" FT /translation="MTEKYLFARLNNQNYSIWKTRMEMLLRREERWYVVES FT AKPEPVTDPWKKDDSKCHAIIVLYVEDSQLNLVKDCKSAKEVWDCLKQYHE FT KTTMTSRVSLLKKICSLNLADGEDLEKHIFDMEELFVRLACAGQALEESLQ FT IAMILRSLPDSYGGLVTALESRPEADLTMSLVKQKLLDEHQRRLERNTNSG FT DRALKLQQREQKKRLCFHCQKPGHFRKDCRQWLHQQQKKSDGESVSEKATK FT QNQKKKSDVKAKLSAEKDVPICFTVGNGRRKGCWYIDSGCSSHMTNDRSFF FT FDLDESKQVQVVLADGSVSKSHGIGEGFVKCIDADGNVLEVKLTEVLYIPA FT LDSGLISVRKLTKKGFRVNFVGSACSVVSAAGKTVALGELNGNLFVLKTVE FT YANLSKELQHLPNCQHMWHRRFGHRDPAVLEKLKAGNLSVGFEMADCGLRQ FT VCEDCLKGKLPRTSFPKSSDNRASRIRSVSKSSRKHSKRYISEHSSSF" FT CDS join(3062..4300,4304..5287) FT /product="Copia-125_AA-I_1p" FT /translation="MATCMLLDAGLGKQYWGEAVMTAAYVQNRLPSRVVDK FT TPFEKWFGKAPKLNHLRVFGCPAYVHTPDVKRTKFDCKSQKLIFVGYCEDR FT KAFRFLDPKTNGITISRDARFIEQLSIPHMAKPEAQTSEEIEIELDWPESG FT ASQEEAGAQGEQCVQEEQQAAEESDQDSDEEFYDGEMQDQADDRVADPADG FT APGEQQNSGKRSTRGMLPSRLRDYVVDVALVVEDDPVSYEEAVQGPEQDLW FT LAAMKEEYDNLMQNETWTLVELPAGRKPIGSKWVFKKKQDSAGNVTKYKAR FT LVAQGFSQQFGVDYDSVFAPVAAHTTLKVLLTVAGHKQMKVRHLDVKGAYL FT HGRLKEELYMQQPKGFQQRGQEKLVCRLQRSLYGLKQGARVWNETIRGILE FT AMHFKQSSADPCLFTKRTPGWIYILIYVDDIIVACTSEEEIARVERSLKEN FT LTLSSLGDVSCFLGIRIRKDVNGFYSMDQQGYILRIANRFNLDRARGSKIP FT LDTGYYRSREGSKAMPDNKQYQCLIGALLYLAVNTRPDVAASVAILSRRVS FT APTEADWVELKRVVRYLVTTNDYELKLTSKREEPLKLIGYSDADWSADTDD FT RKSNTGFLFKLGQATIGWVSRKQTCVSLSTMEAEYIALSEACREVLWLRKL FT LQDFNERQDGPTTVLEDNISCIDFVAIDRSSRRSKHIDTRVHFAKDLSEKG FT IVTIEYCPSASMIADILTKPLGSQKQLQFSEMLGLTKSVGR" XX SQ Sequence 5439 BP; 1581 A; 1060 C; 1394 G; 1403 T; 1 other; ggtcatgggc ccagtttcgg tggttgaaaa acgagtgtag tgaactttgc atgcgggaac 60 gcttaagtgc attttgagcg aagccatttt gttttcattc gatttggctg attgaatgct 120 gccagcagct gcagagctcg gacgactgtc agtgtgttag tgcgacagtg tttcgtccgg 180 tgtgtgagcg gttgctcgaa taacggtttc ttctcaagag tcggtgtggc accgacgagt 240 gtgtgcgtat ccattcgatt gattgtgtgt ttcgacgcga tcgacccagc agcggtcgga 300 gatcagtgaa agtgtttctc aaatcgcggt tcggtgaata ggaaacgatg actgagaagt 360 acttattcgc ccgattgaat aatcaaaatt attcaatatg gaagacgaga atggagatgc 420 ttttgaggcg agaagagcgg tggtatgttg ttgaaagtgc gaaaccagag ccggtcacgg 480 acccgtggaa gaaagatgat tcgaaatgtc atgcgataat agtgctctat gtggaagaca 540 gccagctcaa tttggtgaag gattgcaaat ccgctaaaga agtgtgggac tgtttgaaac 600 aataccatga aaaaacgacc atgacatcgc gcgtgtcctt gctcaaaaag atttgtagct 660 tgaatttggc cgacggtgaa gatttggaaa agcacatttt tgatatggaa gagcttttcg 720 ttcgtctcgc gtgtgcgggc caagcgctcg aagaatcgtt gcagatcgca atgatattgc 780 ggagcctacc ggattcatac ggtggtctgg tcacggcttt ggaaagccgg ccggaggctg 840 atctgacgat gtcgctcgtc aagcaaaagt tgttggacga gcatcagcgg cgattggagc 900 ggaataccaa ttcaggagat agggctctga aactacaaca gagagagcag aagaagcgac 960 tgtgttttca ttgccaaaag cccggccact tccgaaagga ttgccggcag tggctgcacc 1020 agcagcagaa gaaaagtgac ggggaatcag tgagcgaaaa agcaacaaaa caaaatcaga 1080 agaagaagag tgatgtgaag gccaagcttt cggctgaaaa ggacgttccg atttgtttta 1140 cggttgggaa tggtcgtcgt aaaggttgtt ggtatatcga tagtggctgc tctagccata 1200 tgaccaacga tcggtcattc ttcttcgact tggacgaaag taaacaagtg caagtagtgc 1260 tcgctgatgg ctccgtgtcg aagtcgcacg gaatcgggga aggtttcgtg aagtgtatcg 1320 acgcagatgg caatgtactg gaagtgaaat tgacggaagt gctttacata ccggcgctag 1380 acagtgggct gatttccgta cggaagctga cgaagaaagg ttttcgggtg aactttgtcg 1440 ggtcagcatg ttcggtagtg agcgccgccg gtaagactgt tgcgttgggc gaactgaacg 1500 gaaacttgtt tgtgttgaaa acagtggaat atgcaaacct gagtaaagag ttacagcatc 1560 tgccgaactg ccaacacatg tggcatcggc ggtttggaca ccgagatccg gccgtgttgg 1620 agaagctgaa agcaggtaac ctcagtgtag gcttcgagat ggcggattgc ggtttacggc 1680 aggtatgcga agactgcctg aaaggtaagc tcccacgcac ttctttccct aaatcttccg 1740 ataacagagc cagtagaata aggagtgtat cgaaaagtag tcgcaaacat tcaaaacggt 1800 atatctcaga gcatagttca tctttttaaa agcttttttc acgcaaatct tcatcatggc 1860 atcaaatatt gaagcaacta aaaaaataat gaaaaatata cagtcttact ataaatacaa 1920 caaagtttat aaagcatgga aaataaaatc ttgcacaaaa ttacattttt gaagtgtctt 1980 ctgaagattc gatggtttgt attgaacaaa atgtatatca aacaaaatag gatgaaaagc 2040 ttattctatc ggaaaatatg tttacttcac acaacacgca aaaagaagat caaaaaaata 2100 aattattgct cttaaaaagc tcaattattg gaacaaggtc ggttttagaa agtcactttt 2160 gttgaatcaa cttttgaaac tccgtcatat acagcggtaa ttatatagag taacattttt 2220 tacttacatt actacataaa tatgcaaaga aacttttcca atcaaaaaac aatttttatt 2280 cgttacaatc gttcgatatt accattaacg tgtagaaaga aaaaaaatca aaaaacgagg 2340 tccttaaaaa tcgacttttt tcgatttttg tatttgctct caaatgcttg ttaatgaatt 2400 tttcaaaata ttttttacac tatgtcataa gagttgtggc acagaatata ttagcataaa 2460 caaaaaccat tttttcctaa aatttaacat atttattaac atatcaattt tttcagaggt 2520 gtgcaagatt tttttcgaca tctgttagta gtttgcaaaa ttatcatgca aaatgccatt 2580 ttattgaaac aatatcgccg catcataatt tttaaataca ttgtagagca tttctgatca 2640 aaaaattgtt tgtaagttca acccattatt ttttagactg ttttgaatgt ttgcgactat 2700 ttttcgatac actccttact cgacttggtg catactgacg tatgcggacc tatggctaac 2760 gtgaccccag gcggctgtag atatttgatg acgctcatcg atgatttcag tcgctacacc 2820 gttgtatgct tgctgcgaca aaaatccgat gtagccgatt gcattaaacg atatgtggct 2880 cacgttaaga cgcgctttgg aagggcacca tgtgtgatta ggtccgaccg ggggaggaga 2940 gtacgtcaac ggcgtgctta aagcgttcta cgagaaggaa ggtattcaag ctcagtacac 3000 cgctgcttat agcccccagc agaacggtgt agctgagcgg aaaaaccgaa cgttgcagga 3060 gatggcaact tgtatgttgc ttgatgcagg actgggaaag caatactggg gcgaagcggt 3120 catgaccgcc gcttacgttc agaaccgtct cccttctcgt gtagttgata agaccccatt 3180 cgaaaaatgg tttggtaagg ctccaaaatt gaaccaccta cgcgtgtttg gttgtccggc 3240 atacgtgcat accccggacg tgaagagaac gaaatttgac tgcaaatcac aaaaactgat 3300 tttcgtcgga tattgcgagg atcgcaaggc attcagattt ttggatccaa agaccaatgg 3360 tatcacgatc agtcgtgatg cacgctttat tgagcagttg agcattcccc acatggctaa 3420 gccagaagcg caaacatcgg aagagatcga aattgaactc gattggcctg aatctggagc 3480 atctcaggag gaggcaggtg ctcagggaga acagtgcgtt caggaggagc aacaagctgc 3540 cgaggagtcc gatcaagatt cggatgaaga attctacgat ggtgaaatgc aagaccaggc 3600 agatgatcga gttgcagatc cagcggatgg tgcacctggt gaacagcaga acagtggtaa 3660 gcggtcgacc agaggaatgc ttccgagcag actacgggac tatgtagttg acgtggccct 3720 agtagttgaa gatgatccag tttcgtacga agaggctgtc caaggaccgg agcaagactt 3780 atggctagca gcaatgaagg aggagtacga caacttgatg cagaatgaga cgtggacgtt 3840 ggtcgaattg cctgccggcc gaaaaccgat aggcagtaag tgggttttca aaaagaagca 3900 agatagtgct ggaaatgtaa ccaagtataa agcccgctta gttgcgcagg gcttctcgca 3960 acagtttgga gtagactacg actccgtctt cgcaccagta gctgctcata ctacgttgaa 4020 ggttttgctt actgttgccg gccataagca gatgaaggtt cgtcacctgg acgttaaagg 4080 tgcatattta cacggtcgat tgaaagagga gctgtatatg cagcagccta agggtttcca 4140 gcagcgtggt caagagaagc tagtgtgcag actacaacgc agcctttacg ggctgaagca 4200 aggagctcgt gtatggaatg aaacgatccg tggtatcctt gaagcgatgc atttcaaaca 4260 atcatcagca gacccttgtt tgttcacgaa gcgtactcct gwaggttgga tttatatttt 4320 gatatacgtg gacgacatta tcgttgcttg tacttcggag gaggagatag caagagtcga 4380 acgttctttg aaggagaatc tgaccctgtc atctttgggc gacgttagtt gtttcctggg 4440 tatccgtata aggaaggacg ttaacggctt ctattcgatg gaccagcaag gctatatctt 4500 gaggattgcg aaccgattta acctggaccg tgctagggga tcgaagatcc cattggatac 4560 cggctactac cgcagcagag aaggcagtaa ggcgatgccc gataacaagc aataccagtg 4620 cttaatcgga gcgttacttt acttggctgt gaatacgcgg ccagatgttg cagctagcgt 4680 tgcaattctc agccgcagag ttagcgcccc aacagaagcc gactgggtgg agttgaaacg 4740 ggtggtccga tatcttgtaa caaccaacga ttacgaactc aagctgacca gcaaaagaga 4800 agaaccactg aagctaatcg ggtacagcga cgccgattgg agtgccgata cggatgatcg 4860 caagtccaat accggattct tatttaaact tggacaagcg acaattggat gggtcagccg 4920 caagcagact tgcgtatctc tgtcaactat ggaggcagag tatatagcct tatccgaggc 4980 ttgtagagaa gtactgtggc tacggaagtt gcttcaagat ttcaacgaaa ggcaggatgg 5040 accgacaact gtattggagg ataacatcag ctgtatcgac tttgtcgcca ttgaccgttc 5100 ttctagacga tccaaacata ttgatacgcg tgtgcatttt gcaaaggatt tgtcggagaa 5160 gggtattgtg actatcgaat attgtccgtc ggcatcgatg attgccgata tcctcaccaa 5220 gccattggga agtcagaaac agctgcagtt ctcggagatg ttgggcttaa ccaagtccgt 5280 aggccgctaa agatatcgag gaggagtgtg gaaagtgcga tatctttccc tcgcatctgt 5340 gatgcggcga agagcaaaaa cagaaatgag cataccttta tgaattcgat gtactttgtt 5400 tttatttcca ttccccaagt tactctcgtg aatataaga 5439 // ID Mariner-15_SM repbase; DNA; INV; 1011 BP. XX AC . XX DT 11-AUG-2009 (Rel. 14.08, Created) DT 11-AUG-2009 (Rel. 14.08, Last updated, Version 2) XX DE Mariner DNA transposon from Schmidtea mediterranea: consensus. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-15_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-1011 RA Jurka J.; RT "Mariner DNA transposons from Schmidtea mediterranea."; RL Repbase Reports 9(8), 1864-1864 (2009). XX DR [1] (Consensus) XX CC 86% identical to consensus. CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 157..948 FT /product="Mariner-15_SM_1p" FT /translation="MECQVNKSEHFRHYLLFAFNRGATATEAAREICAWVP FT HLLSENNKNQRSTVSASLLXRHRSTHGHKQRFLYRIVTGDEKWCLYVNMKQ FT RKEWLSPEQQATPRAKQDLHPRKTMLCVWWDWEGIIHYELLERNQTVNAEL FT YVQQMHRLNNAIQQKRPDRRNRVLLQHDNARPHIAYMTKEAVQTLGWEVLP FT HPPYSPDLAPSDFHLFRSLSNALRGVSFNNDVELRAWLGDFFETRPGDFYR FT RGIEKLVERWEEVVDNNGEYIID" XX SQ Sequence 1011 BP; 292 A; 222 C; 218 G; 277 T; 2 other; agttcccgcc gtttttttaa attcaaaatt gcgcgttgaa aataaaatgt tttggtataa 60 tgttttgtgt tattcgaaag ataatttttt cctctacaag agagtgtatt tggtttttca 120 aaatacttaa attttgttta tttttagagc atttaaatgg agtgtcaagt aaacaaaagc 180 gagcatttcc gacactatct tctgtttgcg tttaatcgag gtgctacagc cacagaagct 240 gcccgagaga tttgcgcgtg ggttcctcat cttttgagcg aaaacaacaa aaatcaacga 300 tccaccgttt ccgctagttt gctcgnccgt caccgctcaa ctcatggcca caagcaaaga 360 tttctttacc gaatcgtcac tggcgacgaa aaatggtgtc tgtacgtcaa catgaagcag 420 cgcaaagaat ggttaagccc ggaacaacaa gcgacaccgc gagcgaaaca agatcttcat 480 ccncggaaga cgatgttgtg cgtctggtgg gactgggaag gcattattca ctatgaattg 540 cttgaacgga accaaacagt gaacgcagag ctttatgttc aacagatgca ccgcctcaac 600 aacgctattc aacaaaaacg tcccgatcga cgaaatcgag ttcttctcca acatgacaac 660 gcccgacctc atatcgccta tatgaccaag gaagccgtcc aaacgcttgg ctgggaagtg 720 ctgccacatc ctccgtattc tccggacttg gcgccttcag atttccacct cttccgatcg 780 ctttcaaatg ctttacgcgg cgtttctttc aataatgacg tcgaattacg ggcttggttg 840 ggcgactttt tcgagacgcg tccgggtgac ttctaccgaa gaggcatcga aaaacttgtt 900 gaacgctggg aggaagtcgt agacaacaac ggagagtaca ttattgatta aattgttgtt 960 atttatagtt gaaaaattaa atttttaaaa agtaaaaaaa cggcgggaac t 1011 // ID Gypsy-33_AA-I repbase; DNA; INV; 7805 BP. XX AC supercont1.283; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-33_AA_; KW Gypsy-33_AA-LTR; Gypsy-33_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-7805 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.283; Positions 20927 28731. XX CC Positions [3754-4179] - Reverse transcriptase CC Positions [5239-5715] - Integrase core CC 'CAAT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 827..2989 FT /product="Gypsy-33_AA-I_2p" FT /translation="MAEARSDEYTRAVERIREWYHHLRVDHLSTDELDFEL FT NIRSIVIREDPTYSRRRRSLREHLKAEKSQKAIIEVELEVDWEETLRFCRQ FT NYEEIYAGLQQNDRNLKTRGQAKLLHFGHRIVLLLKRAEGEAESFLKAMLI FT DVVRLLDHYFYSEPEASGAGLREILGDQDLLDLFRPEDDAVAPMHQNVASG FT NQAANTERPESTTSVLEALLSRINELAVELSKRDKPNGESQNRESVLPSGS FT QQLSANSQLYNEFSSFVPISLPSSSLQSQTISSASMGIPSRPVTSVTFSLP FT QSNSTSQPQTSSSFQNHFLKNPSFQPSSSAHTCNPGWGIPSSGSVPPLGQW FT HGASAVYNAPNSNPWQGNLAPGFQQAFHPWAGTAIPFSNPSSNPSFSPFNS FT VRPGNQVRHTLPVSKWTITKYDGEDQGLKLNQFLGMVHAMSVAEQVSEHEL FT FESAIHLFEGPAMQWYMTMRASNRLMNWQHLVFELRRTFMHPDLDTLIKMK FT VYQRHQSRNETFLQYYHAMEELFGTMSVPLPEYEKVQILLQNMRLDYKRQL FT NFLPISDLPTLVSAGQKIDAVNFSVYNKVFGPERSVNAVESVDSDTKSKKG FT FSKDKQNKPSTTAQQFPPSCNTNTYPPQSRQQQNSSQTGTNPSRVPTRPPI FT TLEEIIDSHQPLSGRHCFNCGKFGHRMDRCDLPKGVLCANCGFRGYPTNNC FT PYCIKNAIAASQNRRPLNPNF" FT CDS 3301..6093 FT /product="Gypsy-33_AA-I_1p" FT /translation="MVKVVPTLVVPYLAINCICGMDFWKAFHIQPTITGCA FT MVENIGDDSNLELPSSSSVLTASEQQTIDHVKTLFIAARPGKLSTTPVAEH FT KIEIAEEWRKKPPVRQFPYVMSPKTQGLVSVELQRLLDAGIIEPSNSDWSL FT NCVPVIKPHKVRLCLDARKINERTVRDAYPLPHPCRILGQLPKAKYLSTID FT LSEAFLQVPLEKSSRKYTAFSVQGKGLFQYTRMPFGLVNSPATLARLMDRV FT LGHGVLEPYVFVYLDDIVVVTETFEHHVQLLREIARRLNRANLMINIEKSQ FT FGVPEIHFLGYLLSPEGLRGNPDKIRPIVEYERPTTITKLRRFLGMANYYR FT RFIPNFSKTCEPLSDLLKSKTKIIGWNEAAERAFCLIKEQLISTPILGFPD FT FSKEFVIQTDASDVAVAGVLTQQQEDGERVISYFSRKLTTPQKNYHAAEKE FT ALAALLSIEAFRGYIEGYHFTLVTDSSALTHILKTKWKVGSRCSRWSLDFQ FT QYDMTVVHRKGKENVVPDALSRSIAVVQVSSTSPWYTSLMEKVKNNPDDYV FT DFKVEDGRLFKFIVAKNIPFDSRFEWKQVLSTDEIPEILKSVHDDQFHPGY FT EKTLARVRQRYYWPKMANDIKVYIQACSTCKEVKPSYVQTTPEMGKMRCAS FT RPWQIISLDFVGPLPRSRRGNQHILVICDYFSKWVMIHPVKKLDSSTMCVI FT LKNQWFYRNSVPETIITDNGSTFVSKEFKELLDHFKITHWLNSRYHSQSNP FT VERVNRTINAAIRTYVQEDQRAWDTKIPEIELMLNTSVHSSTGFTPFFITH FT GAELAEQGSDHRLARHGKELSVEEREERQKQMFTKIHEIVRQNLEKAHNVS FT RDRYNLRNRTFAKSFSVGQLVYRKNMKQSSAVDNYNAKLGRQYLPCKIKAK FT IGSSSYELEDLTGKNLGIWPAVHLKPG" XX SQ Sequence 7805 BP; 2245 A; 1712 C; 1631 G; 2217 T; 0 other; attggcgccc aactaaaaag taaaaagaaa aggtttccat tttattctta aagtggaacc 60 cttttcaaga aagctccggg aggaattgga atccattatt cccaattgtt ctttaaagaa 120 aagagattga attgtatcga aatagtattg ttttcagtga aagtgaaaat ctattggaag 180 tgttcggatg atgagaattt ctggtattgc tattatagtt cgttcttttg aatcagcttt 240 agaattttaa ccttttgttc tagataaagt gcctgaatca aattagtgaa ttagcagatt 300 aaaacaaata tttagtgact tagtttcatt tttcattaca cagtgcttca ataatttttt 360 tttcacaatt attttttttc gtgatttatt tatttttata tttagtgttt aatttagtgg 420 atgttagaat aagctagttc ttgttctctt tccattcctt tttgcattta ttttttttgt 480 aaatttatta gctatcttga tttatgaata cctaatttta gctacttttc cgcataattt 540 aaagcgaata ttagcttttt agtttaatta ggtgttattt gggtgaactt taaaattttt 600 attatgcttt ttttcctatt gtagcatatt tttaaggaat aaaagctctc ttatataatt 660 tagttaatac tttagataat tatctttatc tttcattatt atttttttaa atttgtttgt 720 ttttgttatt ttttattttt atattacaat ttatttcttt agttgataat tttcttgaat 780 atttttcttg attttttttc tattttaata acctttttgt ttgaaaatgg ctgaagcacg 840 tagcgatgag tacacacgcg cagtagaacg aattcgtgaa tggtatcacc atttaagggt 900 ggaccatttg tcgaccgatg agcttgattt tgagctcaac atccgctcaa ttgtaattag 960 ggaagatcca acctattcgc gaagacggag gagcctacgt gagcatttaa aggctgaaaa 1020 atcgcaaaaa gcgattattg aggtagaact tgaggttgat tgggaggaaa cacttcggtt 1080 ttgccgacaa aattacgagg aaatttatgc gggcctccag cagaatgatc gtaacctaaa 1140 aacaagaggt caagcaaaat tgttacattt cggtcatcgg attgttcttt tgttgaagcg 1200 tgcagaaggg gaagcagaaa gttttttgaa ggcgatgctt attgacgtgg tgagactatt 1260 agatcactat ttttattcag aacctgaggc atcgggagcg ggtcttcgag agatactggg 1320 ggatcaagac ttattagatc tgtttcgccc tgaagatgat gctgtggccc caatgcacca 1380 aaatgtggca agtgggaacc aagcagcgaa cacggagcgc ccagaaagca ccacttcggt 1440 gttggaagct ttgctgtctc gtataaacga gcttgcggtg gaattgtcaa aacgggacaa 1500 gcccaatgga gagtcccaga atagagaatc tgtgcttcca agtggaagcc aacagttgtc 1560 ggcgaactca cagctttata atgagttctc ttcatttgtt cccatttctc tcccctcaag 1620 ttcactgcaa tctcaaacga tcagctcggc tagcatgggt atcccttcaa ggccggtcac 1680 tagcgtgact ttttcactac cgcagagcaa ctcaacgtca cagcctcaaa cttcttcatc 1740 gtttcaaaat cactttttga aaaatccctc gtttcaaccg tctagcagtg cacacacttg 1800 caaccctgga tggggaattc catcttcagg aagtgtaccg cctctaggtc aatggcatgg 1860 tgcttccgca gtttacaatg cgccgaattc taatccttgg caaggaaacc tagctccagg 1920 ctttcaacag gcgttccatc catgggccgg aactgcgatt ccattttcta atcctagttc 1980 taatccatca ttctctccct tcaattccgt tcgtccaggt aatcaggtgc gccacacctt 2040 acctgtgtct aagtggacga taacgaagta tgatggtgaa gaccagggtt tgaagttgaa 2100 tcagttcttg ggaatggttc atgctatgtc ggtggctgaa caagtttcag agcacgaatt 2160 attcgaatcc gcgattcacc tttttgaggg gccggccatg caatggtata tgaccatgcg 2220 cgcctcgaat cgtctcatga actggcagca tttagtattc gagctcagga ggacattcat 2280 gcatcctgat ttagatacat tgataaaaat gaaagtttat cagcgccacc agtctagaaa 2340 cgaaactttc ctgcaatact accatgccat ggaggaactt tttggaacta tgagtgttcc 2400 attgccagaa tatgaaaaag tacaaattct gctgcaaaat atgaggcttg actataagag 2460 acagctcaac tttctcccga tatccgacct cccgaccctc gtgtccgccg gtcagaaaat 2520 cgatgctgtg aatttctcgg tttacaacaa ggtttttgga ccagaaagat cggtgaacgc 2580 tgttgagtca gtagattccg acacaaagtc gaagaagggg ttttcgaaag ataaacaaaa 2640 taaaccttcg acgacagcac agcagttccc acctagttgc aacaccaata cttatcctcc 2700 tcaatctcgt cagcaacaga acagttccca aaccggaacc aatccctctc gtgtcccaac 2760 gcgtccacca atcactttgg aagaaattat cgattcccat cagcccctat ctggccgaca 2820 ctgctttaat tgcggtaagt tcgggcatcg aatggatcgt tgtgatcttc caaagggggt 2880 cctttgcgcg aactgtggat ttcgagggta tccaaccaac aactgcccct attgcataaa 2940 aaacgcaata gcagcgagcc aaaaccgccg cccgctgaat ccgaacttct aggcgtattt 3000 cccccaccgt gttttgaacc ggctacgagg aatctctatc cggtatctcc cgcttcgttt 3060 acgatttcat atccatttcc taacgataat cgtccctaca cgtcagtgaa agtttacggt 3120 gtcaactttc gggctcttct tgacagtgga agcaacctca ccctgatcaa tgaatccgtt 3180 tacaattcca ttaaaccaaa gcgcatcttt ccgcttcaaa acccagtaaa tctgcgcact 3240 gcgagcggag aaagactcag tgttcgcggg cagatttatt taccattttt atggaatggc 3300 atggtcaagg tcgtccccac tttagtggtt ccctacctgg caatcaactg tatctgtgga 3360 atggactttt ggaaagcctt tcacatccaa cctaccatta ccggttgcgc catggttgaa 3420 aacataggag acgactcaaa tcttgaattg ccatcctcat cttctgtgct aacggcttct 3480 gagcagcaaa cgatcgatca tgttaaaaca ctgttcatcg cagctcgccc cgggaaattg 3540 tccactactc cagtggcgga gcacaaaata gaaatcgccg aggaatggcg gaaaaagcct 3600 ccagtgcgac aattcccata tgtaatgtcc cccaaaacgc aaggattagt gtcggtggag 3660 cttcagagat tgctcgatgc tgggataatc gagcccagta actcggactg gtcgttgaac 3720 tgcgtgccag ttataaaacc tcataaggtt aggctctgtt tagatgcgcg taagattaac 3780 gaacgtactg ttcgtgatgc ttaccccttg ccgcacccct gtcgtatact tggccaactt 3840 ccgaaggcaa aatatctttc gacaatagat ttgtcggaag ccttcctaca agttccgttg 3900 gaaaaatcat cacggaaata cacggccttc agcgtgcagg gcaaggggtt gttccagtac 3960 acaaggatgc catttggtct agtgaacagt cccgctactc tagcaagatt aatggatcgg 4020 gtacttggcc atggtgtact ggaaccctat gtcttcgtct atctcgacga tatcgtcgtg 4080 gtgacggaga catttgagca tcacgtacag ttgcttcggg aaattgcgcg tagactgaac 4140 agagccaacc tgatgatcaa cattgagaag tctcagttcg gggtgccgga aattcacttc 4200 ctaggatatc ttcttagtcc agagggcctt cgtggaaatc ccgataaaat tcgaccaata 4260 gtcgagtacg agaggcccac tacgattacc aagctaagaa gattcctagg aatggcgaat 4320 tattaccggc ggtttatccc gaactttagc aagacttgcg aaccactgtc ggatctcctc 4380 aaatccaaaa ccaaaataat tggttggaac gaggccgcgg aaagagcctt ctgcttgata 4440 aaggagcagc tgatcagcac tcctattcta gggttccctg acttctccaa agagttcgtc 4500 attcagacgg acgcaagcga cgtggcggtt gctggagttc tcacacagca acaagaggat 4560 ggtgagaggg ttatttctta tttttcccga aagctcacca ctcctcaaaa gaattaccac 4620 gccgctgaga aggaggccct agccgcgctg ctctcaatcg aagctttccg ggggtatatt 4680 gaaggatacc acttcacctt agtgacggat tcgtcggcac tgacccacat cctcaaaaca 4740 aagtggaaag taggatcacg atgcagtcga tggagcttgg actttcagca atacgacatg 4800 acagtagtac atcggaaagg caaagaaaat gtcgtcccgg atgctctgtc acggagcatt 4860 gctgtggttc aagtttcatc gacttcaccg tggtatacct ccctgatgga gaaggtaaag 4920 aacaaccccg acgactacgt cgacttcaaa gtcgaagatg gtaggctttt taaattcata 4980 gtggccaaaa atataccatt tgactcgcgg ttcgaatgga agcaagtcct ttccaccgac 5040 gagattcccg aaatcttgaa atcggttcac gatgatcaat ttcatcctgg ctacgaaaaa 5100 acgctagctc gcgttcgtca acgctactat tggccaaaaa tggccaatga cattaaagtc 5160 tacatacagg cttgctccac ctgcaaagag gtcaagccgt cgtatgtgca gacgacccca 5220 gaaatgggca aaatgcgctg tgcatctcga ccatggcaga taatttccct ggactttgta 5280 ggtcctctgc cacgtagtcg gaggggaaac cagcatattc tggtaatatg cgattatttt 5340 tccaaatggg taatgatcca tccggtgaag aagctggata gttcaaccat gtgtgtgatt 5400 ttgaaaaatc agtggttcta caggaactct gtccctgaaa cgataataac ggacaatggt 5460 agcacattcg tttccaaaga gttcaaagaa ctgttggacc acttcaaaat cacacactgg 5520 ttgaactccc gctatcattc ccaatcaaac ccagtggagc gtgtcaatag gacaataaat 5580 gcggcaatcc gcacatacgt ccaagaggac cagcgcgcat gggacacgaa aatccctgaa 5640 attgagctga tgctcaatac aagtgtccat tcctccactg ggttcacacc ctttttcatt 5700 acgcacggcg cggagctagc ggagcaggga tccgatcacc gtctcgcccg tcatggcaaa 5760 gaactatctg tggaagagag ggaagaaagg cagaagcaaa tgttcaccaa gattcacgaa 5820 attgtccgtc aaaacctcga aaaagcccat aacgtgtctc gggaccgtta caatttgcgg 5880 aaccgtacgt tcgcaaaatc cttctcggta gggcaactag tctaccgaaa gaacatgaaa 5940 caatcgtcgg ccgtcgacaa ctacaatgca aagttggggc gacaatactt accgtgcaaa 6000 atcaaagcta aaataggctc ttcttcctac gagctggaag atctgaccgg gaaaaactta 6060 ggcatttggc cagctgttca cctgaaaccc gggtgatacg gacaatacaa aatcttgggg 6120 aacatttgca ctgctccctc tcttcaccgc ttctgcatca aatcacctga aaagacaatt 6180 actaccgaag aacattcaat gcttacccgt ttatttacat tatttcattc ttaattggtt 6240 tcaattcatg cctgccctgc cctcgtcgtc gtcagataaa ctgttgttgt tttgatttct 6300 cctcctcatt ctcattattt tcatcagctc cgttttctca ctctctgttt acattttctg 6360 ggagatttta ccagagtcaa attcaggggg ttgtttacca gcaaaataaa agctgcccaa 6420 ggtagttaca ttcctacctc tgaaatcgta acgctcgagg ggtctattgc gacgaagcta 6480 gacccccaaa aataattagg atagggatcc ctaaaattac aaataatatg tcgtagttta 6540 aataataaaa tgcgtgtgta cgtaaaaaaa tgttgttcct ggtggatctt gttttagcca 6600 gacaactgta tgaatgtaac gagtgaatga atgaatttgt atgaatgttg ggggaataaa 6660 tggatgtgtg agtgaagaat attttttttc ccccgaatca atttttcggt gattttcgaa 6720 aacaaaatga agaggtttca aaatgtttat tttctcttct aattgtaagc tagttttaag 6780 ggtagtagta gctagtttta aaaataaagt ttaatttcta atgttgatgg aaagaggatg 6840 gacaagatcc gatttattct gacgaaccat ttttctataa ggctgaattt tcctatttca 6900 tatactattt cgatacaaat caaacattta gctcctattt ccgcttatga taattaggga 6960 gtatagtggt caggctcgcg ctgcagtttt atgtaatttg aaggaaaaga gcctccacta 7020 gtgatcaatc attgcttcaa attataactt aaactgcagg ggtagagagc cactgtagga 7080 gaactaactt tcagggaatc tatcacgaat ttaagaactg attcgtccat ttcttcaatt 7140 cgatcatgga tcgtgaaagt tagcctctcc cggtaattcc ttcaatcttg gagatgacaa 7200 ctaatctcga aaagaagagc tcccattgag gaaacaatca cttttccaga ttgggtgtca 7260 cctacaagag aaaaaggttt accgctacct aattatccta atatgaggtg ggaatctact 7320 ggcagccaga ctctatggct gaaagttacg tgatttggtt ggaaagagct cctctctaca 7380 cggaaaaatc atccaattaa atcaaattaa cagtcagcta aaagagagct gccaagacaa 7440 gaactcctaa atcgaccctc tcctagtgga cagactctat tattaagtgt tacgtgattt 7500 tgtgaaatag agctcctctc catggaaaaa tcatttcaca aaatcaatct aacgcttaac 7560 aagaggaggg ccacacttac cccatacttc ctacttcctg aacgacttcc ttcccagtgg 7620 acagtctcta ttattgaatg tttatcgatt tagcgggaaa gagatcctct gtatggaaaa 7680 atcattccgt taaatcaaag aacattcaat aagaggaggg ccaaaaaaaa aatccttaca 7740 accccaaccc catctccacc taccaaaaaa aaacaaggtt tttttttatc ggagcgaggg 7800 atgaa 7805 // ID Gypsy3-SM_I repbase; DNA; INV; 4093 BP. XX AC Contig139; XX DT 15-AUG-2007 (Rel. 12.08, Created) DT 15-AUG-2007 (Rel. 12.08, Last updated, Version 1) XX DE LTR retrotransposon from Schmidtea mediterranea: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy3-SM_I; KW Interspersed repeat; LG_I; internal portion. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-4093 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from Schmidtea mediterranea."; RL Repbase Reports 7(8), 758-758 (2007). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 1040..4090 FT /product="Gypsy3-SM_I_1p" FT /translation="MGYPQGAEKWENVGRVSEKVYRFLKREMFEEEVERIG FT TIETEQVVEILYNRMQGRPSITVRMNGENFDCLLDTGARINVMSVNCFNKL FT RGQQLTKSDDKLRCANESTIETIGKTKVQVTIGNVSKEVIFIVAEKVTPDV FT IGGIELQETFGFRLLKIKDIEASEKDKNYICNIEAKFGRKIKDEERLIRAL FT EQLKINEDSKLKEIITNSGNVFMADKWDVGRTHLVKHKIVTRGEPINIKPY FT RQPLNLESKIEEAIQNLYENGIIRKCNSPWNTPLICVWKKEKKDIRLCLDF FT RQLNAVTERQAFPMPNIDEMLDLLGGSVFFSSIDLGNAYYQVELEEESQEK FT TAFSTKNGQYCFNRMPFGIAAAPGTFQEMMTKVLGKIKGAMVYLDDILIYS FT SSKEKHYTILGEVLKAIEEAGLRINPEKCQIIKEEIRFLGHIISKEGVQTD FT PSKIQAIQNFGKPNCVKKLRSFLGICNYYRRFISDYAKKARMLEQLCSGPS FT NKKLEWSEGTNSVFEGLKLALTTAPILCFPDLKKEFILDTDASFDTIGAVL FT SQRDKDGKERVIAYGSHSMTSHEKGYCITRKELLAVYYFCIHFKHYLYGRR FT FTLRTDHKAITFMMTTKKPITSQFQTWINYLSSLDIRVEYRKGINHQNADM FT VSRTKCDTCVQCQMAHEEAKKGKIKTRLLDSIREEGRSNIQHGIVEEVRKK FT TMIPENELQETIKEIHRLLCHAGVEKIADYMKDRYVGKHLWSKIQEIVRSC FT EICQKTKALTITTKEPVKRQDSKEMFEIIFVDICGPLAETKGRKKYILGII FT DQYSKYQVLTAITKQDEETIKKTILEKWILRFGCPKEIRVDCGKAFESGMM FT REFTKMLDIKLCFSSPYHHQTNGQIERQFRTIRDLINATLQDRKKTDWAEI FT LPEVEFTLNATKQKTTGKSPAEIVFGKKISREKWYGSKEIPIKEELEEQTR FT RKFNVGEEVLVKVETRHKGQDRYEGPYKVIEKVHDRRYILRNEDGKRIERN FT VEKLKNFLRRG" XX SQ Sequence 4093 BP; 1656 A; 564 C; 989 G; 884 T; 0 other; cggacgcaat caccgaaaga ttgctaccaa aggtattctg ccaaggggac gatttggaaa 60 agttcattaa aagatgcgaa atttatttcg acgccaaggg aatcacgcag aagaaagacc 120 aggaaagatt aacatcttgc atgattgatg aggatctcct ggaagtgtac gccaaaacac 180 tgaaaaaagc cgaaggattc caggctaggt tgcgattggc attcgatcgt gaaagagatg 240 ttttaaagga tctacaaaac ttgttgaatt acaagcggaa gggtgaagaa atagaaacgt 300 attttaaaaa ggtggaggac cttgttgata ggttgttcga aaataaagaa tttaccaaag 360 agaaattaac agcaaggatg ttaatagtat gtgcggagaa taaagaagtt gagaaagaag 420 tgatactacg gaagataaaa agcacagaag aaataaagga aacactaact ctgatggaca 480 aagtagaaaa gcaaagagag cagataaaca gtatgaaaag ttacgccaga gtggtctatc 540 aggaaccaca gggaagagta caatacaaga agaaagaaca gagagaagaa agtatccgaa 600 gagaaatgcc agaatgctgg ttatgccaca aaataggaca cacgaaaatt gattgtccaa 660 taaaaggtaa aatagaatgc tggacatgtc atagaagcgg acatataagt agaaactgtc 720 cagataaaaa agctccaaga tgttttgggt gtgggaaaga aggtcatata agaagattgt 780 gtcaggaaat acgctgtgaa agatgtagca gaaatggtca tagatccgag gagtgttata 840 caaagatgag gtacgggcaa gcgacggaga gacaaagatt cacgggaaac agatacagtc 900 gaataaatgg tatcgaggaa gaagaagaaa gggaatcgtg tgtagagaca aatgaagata 960 attttaggaa gacgtaccca aacaacaggg ctccaccagt agaggagttg gttggagcca 1020 tattctaaca gaaaatttga tgggatatcc tcaaggggca gaaaagtggg agaatgttgg 1080 aagggtaagc gagaaagtat acaggttcct aaagagagaa atgtttgaag aggaagtgga 1140 gagaattgga accatcgaaa cggagcaggt tgttgaaatt ttgtataaca gaatgcaagg 1200 aaggcctagt ataacagtaa ggatgaatgg ggaaaatttc gactgtttat tagacactgg 1260 ggcaagaatc aatgtaatga gtgtgaactg tttcaacaag ctacgaggac aacaattgac 1320 gaaaagtgac gataagttac gatgcgcaaa cgagagtacg atcgagacaa taggaaaaac 1380 gaaggtgcaa gtaaccattg gaaatgtttc gaaggaagtt atattcattg tggcggagaa 1440 agtgacacca gacgtgattg gaggaatcga attacaagaa acatttggat ttagactgtt 1500 aaaaattaaa gatatcgagg ctagtgaaaa agacaagaac tatatttgta acattgaagc 1560 aaaatttggc agaaaaataa aggatgaaga acgattgata cgagcgttgg agcagttaaa 1620 gatcaatgaa gatagcaaat taaaggaaat tataacaaac agtggaaacg tgtttatggc 1680 agataaatgg gatgtgggtc gcacacattt ggtaaaacac aagatagtta caagaggtga 1740 gccgataaac ataaaaccat atcgccaacc actaaatttg gaatcaaaga ttgaagaagc 1800 aatacaaaat ttgtacgaaa atggaattat acggaaatgt aactctccat ggaatacacc 1860 gctaatatgt gtatggaaaa aagagaaaaa ggatattaga ctatgcttgg actttagaca 1920 gctgaacgcg gtaacagaga ggcaggcatt tccaatgcca aacatagacg agatgttgga 1980 tttgttagga ggatccgtct ttttcagctc aatcgatcta ggtaatgcat attaccaagt 2040 ggaactagaa gaagaatcac aggagaaaac agctttctca acaaagaacg gacaatattg 2100 tttcaacaga atgccgtttg gaattgcagc agcaccaggt acgttccaag aaatgatgac 2160 aaaagtatta ggcaaaataa agggagcaat ggtgtatttg gacgacatct tgatttattc 2220 aagcagtaaa gagaaacact atacaatact cggagaagtg ttaaaggcga ttgaggaagc 2280 aggcctaagg attaatccag aaaaatgcca aataataaag gaagaaatca gattcttagg 2340 acacataatc agcaaggaag gagtacagac agatccatct aaaattcagg ccatacagaa 2400 ctttgggaag cctaactgcg taaagaaact ccgaagcttt ttaggtattt gcaattatta 2460 tagaagattc atcagtgatt atgcaaagaa ggcaaggatg ttagaacaat tatgtagcgg 2520 accaagtaac aaaaaattag aatggagtga aggaacgaat agcgtgtttg agggattgaa 2580 attagcactc acgacggcgc caatcttatg ttttccggat ctgaaaaaag agttcatttt 2640 agataccgac gctagtttcg ataccattgg agcagtactt tcacaaagag ataaagatgg 2700 aaaggaaaga gtaattgcgt atggatcaca ctcgatgacg agccacgaga aaggatattg 2760 tattacaaga aaggaactat tggcggtata ttatttttgc attcatttta agcactatct 2820 ctacggaaga agattcacgc tcaggacaga ccacaaggca atcacgttta tgatgacaac 2880 caagaagccg attacttcac aattccaaac atggatcaat tatttgagca gtttagatat 2940 aagagtagaa tacagaaagg gtattaatca ccaaaatgca gacatggtgt cgagaactaa 3000 atgtgacacg tgtgttcaat gtcaaatggc ccacgaagaa gctaagaaag gaaagataaa 3060 aacaagatta ctagactcga tcagagaaga gggaagaagc aacatccaac atggaattgt 3120 agaagaagta cgaaagaaaa cgatgatacc tgaaaatgag ttacaagaaa caataaagga 3180 aatacataga ctattgtgtc atgctggagt cgagaagata gccgattata tgaaagatag 3240 atatgttgga aaacacctgt ggagtaagat tcaggagatt gttcgaagtt gtgagatttg 3300 tcaaaaaacc aaggctttaa caataacaac gaaagaacca gtaaaaagac aagattcgaa 3360 ggaaatgttc gaaatcatat tcgtagatat atgtggaccg ttggcagaaa caaaaggacg 3420 taagaagtat atattgggca taatagatca gtatagtaag tatcaagtct tgacagcgat 3480 aacaaaacaa gacgaagaaa caataaagaa aacgatttta gaaaagtgga ttttaagatt 3540 tggatgccca aaagagataa gagttgattg cggaaaggcg tttgaatcag gaatgatgag 3600 agaattcaca aagatgttag atattaaatt atgtttttcg agtccatacc atcaccaaac 3660 aaacggtcag atagaaagac aattcaggac aatacgggat ttgataaatg ctactttaca 3720 ggatagaaag aagacggatt gggcagagat attaccagaa gtcgaattca ctttaaatgc 3780 aaccaaacaa aaaacgacag ggaaaagccc ggcagagata gtttttggaa aaaagattag 3840 cagggaaaaa tggtacgggt ccaaagaaat accgataaag gaagaactag aagaacaaac 3900 aaggagaaaa ttcaatgtag gagaagaagt attagtaaaa gtagaaacac ggcacaaagg 3960 ccaggacagg tatgaaggtc cgtacaaagt gatagagaaa gtgcatgaca gacgatacat 4020 cttaagaaat gaagatggaa aaaggatcga aagaaatgtg gaaaaactta aaaatttctt 4080 aagaaggggg ata 4093 // ID Gypsy-65_AA-I repbase; DNA; INV; 4450 BP. XX AC supercont1.236; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-65_AA_; KW Gypsy-65_AA-LTR; Gypsy-65_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4450 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.236; Positions 1368227 1363778. XX CC Positions [1816-2355] - Reverse transcriptase CC Positions [3433-3909] - Integrase core CC 'CTAG' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 184..4098 FT /product="Gypsy-65_AA-I_1p" FT /translation="MSTSEEYVSEGDHGDGNKNISSSSERNDKFVTLRKKH FT RMSRQAEQDALALRAELDEKVAEIERLQQRLTIATRATASPQDRIAVRRPD FT FRELRELVSRFNPKEATCLSAQEWIQEIESTAAHYDWDDATKLHCARLNLE FT GSSKLWWAGVQNEVNTWALFSQKLVRAYPSARDPIYYHNQMTKRRKMRDET FT VEEYVYSQVALGKRAGLSEAVIVKYTIAGLGEFVSKCRVQLGGKIETIDEL FT ISQLKWMEGMVDTTPELTTTRVTEKKREKVVNCYRCNTPGHKAVACPNSPN FT IAGRCCFNCGVSGHMTKNCPTPKPRQGPSNMQVVDEENNFVKVVEVGGIKL FT NALVDSGCKVSTIQCRFADKVGDSEVASTTLVGFGGKKVNVDRVVRSCVKL FT DDVEISVKLNVVPNWVQSTAIILGRDMLNQEGVVMVHRNGRVEFRRDSDAG FT QARMVEVQSRKYESMYTIDSSVQCSEIIADDLNVEGPSEEILKVVNQYRSC FT FAKNMNELGVAKGTEMKIVLSDQEPVYVKPHRMEYAREAALSGIVEELIDA FT GIVEESNSPYNSRVVMVPKKDGSFRMAVDYRLLNKKTVKDRYPMPDIEWCL FT NKLSGAEVFITVDLYSGYYQIPVAEDSQECTAFSTRDGHYHFLRMPFGLAN FT GCSVFQRAMNQLTGKLRKEGIVAYIDDLVIGGKSVEEVLAKFEKLLEVLQE FT SGFTINLKKSHFFKTSVEFLGFEVSKNGIRPGAMKTRAVEEFPTPESVLQV FT QQFLGLAGFFRRFIPRFSIVAAPLFTLLKKDTKFEWKSEQMIAFEKLKKSL FT SERPLLVLFDPKCDIELHTDASKDGLAGILLMRTEEDWRPISYYSKRTSEA FT ERKYHSYELEVLAVVASVERFRHYLIGTFFVVRTDCSAVRDTYSKREMNAR FT IGRWFLKLTEYDFRLEHRPGSSMRHVDALSRNAIEDEKSVDQVLCDMLSIE FT VSNVDFLVALQRQDARLCDIIDVMGRDPLSDEERQIQQNYRLEQHRLMRVV FT DGVKCFVVPTRVRWRLVHTYHDEMGHFAEEKVLNRLREKFWFPKMRKYIRS FT YIEACPQCAFHKSKGGRPEGYLHCVPKEPVPFSTVHMDHMGPFVRSARGNS FT YVVLLICGFSKFVVIRAVRSTKTGPVLNFLGDVTAMFGTPSRIVTDRGTAF FT TAKIFEEYCRVNAITHIKTAVGSPRANGQVERANRTVLNSIRCMVGEHNMR FT WDEQVPKIQWSVNNMPNSTTNLSPSALVLSYKPRDIHQNTIVLALHDEVNN FT VLESPHERKQQAMQAIERRQRLQKQYFDTRRLSLQTPIRSVIWC" XX SQ Sequence 4450 BP; 1323 A; 814 C; 1236 G; 1077 T; 0 other; agctgtagac aggattcaca gccgcgcgtg gtacgattcg ggtttcgatc gcggacaaag 60 gttctaggac agtattgttc tgggcacaat agtgtcagac aaaagaacca cgtgaaaatt 120 tcgatacaca tggctgtcgc gggtatctaa aagtgaccgg taattttgag tgcaccatat 180 cgtatgtcca cttctgagga atacgtgagt gaaggtgacc acggagacgg taataagaac 240 atatcgagct ccagtgaacg gaacgataag ttcgtaactt tgcgaaaaaa gcatagaatg 300 agtcgacaag ctgaacaaga tgcgttggca ctgagggcag agttggacga aaaagtggct 360 gaaattgagc gcttacaaca acggctcact attgccacta gggccacagc aagccctcaa 420 gatcgcatag cagtacggcg tccagatttc cgcgagttga gagagctggt cagtaggttc 480 aatccaaaag aagctacatg tctttcagcg caagaatgga ttcaggaaat cgagtctacg 540 gccgcgcact atgactggga tgacgcgacc aagttgcatt gtgctcgtct gaatctagaa 600 ggatcgtcta agttgtggtg ggctggtgta cagaacgaag tcaacacatg ggcgctattt 660 tcacaaaagt tggttcgagc gtatccatca gcacgagatc ctatctacta ccacaaccag 720 atgactaaac gtcgaaaaat gagagatgaa accgtagaag aatacgttta ctctcaagtt 780 gccctcggaa aacgtgctgg gttatctgaa gctgtgatag tgaaatacac gatcgctgga 840 ttgggagaat tcgtgtcgaa gtgccgagtg cagctgggag ggaaaattga aacaattgat 900 gagttgattt ctcagttgaa gtggatggag ggcatggtag atacaacgcc ggagttaacc 960 actacacgag taaccgaaaa gaaaagagaa aaagtagtta actgttatcg gtgtaatacg 1020 cccggacata aggccgtagc ttgtccgaac agtccgaaca tagctgggag gtgctgtttt 1080 aattgtggtg ttagtgggca tatgacgaaa aactgtccga cccctaagcc aaggcaggga 1140 ccctcgaata tgcaagtggt agatgaagaa aataattttg tgaaagtggt cgaggtaggt 1200 ggaatcaaac tgaatgctct ggtcgattcg ggatgtaagg tatccaccat ccaatgtcgc 1260 ttcgcggata aagtcggtga tagcgaggtg gccagtacaa cgttggtcgg cttcggaggc 1320 aaaaaagtga acgttgacag ggtggtgaga tcgtgtgtaa agcttgatga tgtggaaata 1380 tcagtgaagc tcaacgtggt tcccaactgg gttcagagca ctgcgattat tttgggacgt 1440 gatatgctca accaggaggg cgtagtgatg gtgcatcgca acggaagagt agaatttcgt 1500 cgtgatagcg atgctgggca agctagaatg gtagaggtac agtctcgaaa gtatgagtcg 1560 atgtacacca ttgacagtag tgtgcagtgt tcagaaatta tagctgatga tttgaacgtt 1620 gaaggaccaa gtgaagaaat tctgaaggta gtaaaccagt accgatcctg cttcgcgaaa 1680 aatatgaacg agttgggggt cgcgaagggt accgaaatga agattgtgct gagtgaccag 1740 gagccagtgt atgtgaagcc gcatcggatg gaatatgcga gagaagctgc gctgagtggt 1800 atagtggaag agctgataga tgctggtatt gtagaagagt cgaattctcc gtataatagc 1860 cgggttgtaa tggtgccgaa aaaggatggc tcgttccgaa tggcggtgga ttatcgccta 1920 ttaaacaaga agactgtgaa ggatcgctat cccatgccgg acatcgaatg gtgtttaaat 1980 aaactgagcg gagcagaagt gttcattacg gtggatctct attcaggata ctatcaaata 2040 cccgttgcag aagacagtca agagtgtact gcgttttcaa ctcgtgacgg gcactatcat 2100 ttccttcgga tgccattcgg gttggcaaat ggatgctcag tgttccagag agctatgaat 2160 caactgaccg gaaaactccg gaaagaagga atagtggcat atatcgatga tctggttatt 2220 ggtgggaaaa gcgtggaaga agtgttggcc aaatttgaaa aacttttgga agtacttcag 2280 gaaagtgggt tcactataaa tctcaagaag tcacatttct tcaaaacatc cgtggagttt 2340 ttgggtttcg aggtgtcgaa gaatggaata cggccaggtg cgatgaagac tcgtgctgtt 2400 gaagaattcc caacgccgga gagtgtgcta caagtacagc agtttctggg attagcggga 2460 ttcttccgta gattcatccc acgattcagt attgtagcgg cacctctatt cacgctactg 2520 aaaaaggata cgaaatttga gtggaagagt gagcaaatga tcgcgtttga aaagctgaag 2580 aaatccctaa gtgagcggcc tctgttagtg ttgttcgacc cgaaatgtga tatagagctc 2640 cacacggatg cgtcaaagga tggcttagcc ggaattttgt taatgcgcac agaagaggat 2700 tggcgcccaa tcagctatta cagcaaaaga acatcggaag cggaacgaaa gtaccatagc 2760 tacgagttgg aggtgttagc ggtcgtggca agtgtggaaa gattccgaca ttacctaatc 2820 gggacatttt ttgtggtgcg aaccgattgt agtgcagtga gagacaccta ctcaaaacgt 2880 gagatgaatg ctcgaattgg ccggtggttt ctgaagctga ccgagtacga ttttcgacta 2940 gaacataggc ctggcagttc tatgcgacac gttgatgctc tgagcagaaa cgcaatcgaa 3000 gatgagaaaa gtgtagatca agttctgtgt gatatgttat ctattgaggt gagcaacgta 3060 gatttccttg tggctctaca acgacaagat gctcgcttgt gtgacatcat cgatgtaatg 3120 ggaagagatc cgttgagtga tgaggagagg caaattcagc aaaattatcg attggagcag 3180 cacagattaa tgagagtagt ggatggagta aagtgtttcg tagtgcccac aagagtacgt 3240 tggcgactgg ttcacacata tcatgatgaa atgggtcact tcgccgagga gaaagtgttg 3300 aatcgattac gtgagaaatt ctggtttccg aagatgcgca agtacattcg ctcttatata 3360 gaagcgtgtc cacaatgtgc gttccacaaa agtaaagggg ggagaccaga agggtattta 3420 cactgtgtgc cgaaagaacc tgtcccattt agtacagtac acatggatca catggggccg 3480 ttcgtcagat ccgcgcgagg aaattcgtat gtagtgcttt taatctgcgg attttctaaa 3540 tttgtggtta tccgagctgt gcgttcgact aagactggac cagttttgaa ttttttgggg 3600 gatgtgacag caatgtttgg tacaccaagt cgcattgtaa cggatagagg cacagcattt 3660 actgccaaga tttttgaaga gtactgtcgg gtgaacgcaa ttacccacat caagacagct 3720 gttgggtcac cgagagcaaa cggtcaagtg gagagagcca accggaccgt gctaaattcg 3780 attcgatgta tggtgggaga gcacaacatg cgatgggatg agcaagtgcc gaagattcag 3840 tggtcagtta acaatatgcc aaactcgact actaatttat cgcctagtgc attggtgttg 3900 tcctacaagc cacgagatat acatcagaac accattgttt tggcgctaca cgacgaggtg 3960 aacaatgtat tggagagtcc acatgagcgg aaacagcagg caatgcaagc catcgagaga 4020 aggcagcgtc tacagaagca gtatttcgac acgcgacgtt taagcctcca aacacctata 4080 aggtcggtga tatggtgtta gtggagcgag accttactgc gatgggccat agtcgtaaac 4140 tcgaagccag atacaaaggc ccgtatgtgg tgaaaactgt gttggacaac gatcgttacg 4200 agctagaaga tctcccagga gtgcgtagat cgaaccgact cagaacaaca gtttatgcag 4260 cagatcgtat gaagcaatgg tgccaattag tagatgtagg cgatgaagat gatgaagttg 4320 atggagaatt aacagatagt gaagacgttt aataccaatt gaacttattg tatgaaagtt 4380 tttgaaaaaa aaaaaatatt aagaattgaa ttacacctaa tttgaaatca ggtgaatctg 4440 ggtgtccgat 4450 // ID CR1-7_NVi repbase; DNA; INV; 4435 BP. XX AC . XX DT 11-MAY-2009 (Rel. 14.05, Created) DT 11-MAY-2009 (Rel. 14.05, Last updated, Version 1) XX DE CR1-type non-LTR retrotransposon: consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-7_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-4435 RA Bao W. and Jurka J.; RT "CR1 families from Nasonia vitripennis."; RL Repbase Reports 9(5), 934-934 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 305..1297 FT /product="CR1-7_NVi_1p" FT /translation="MSSCKKQSKTYADIMANRQPIKSQRVPRIIVKSLKSE FT NKDPKEFSKKVCNLLIEKKDIQTKQVKSINDSEIEIKSMNEKSAKATIELL FT SRKLSNNYKTTIEKIESPKMKIVGVDNYNKMNLEDLEKDLNNRNFNEYDEK FT CKVLHINKNNSTKTSSIIIEVPRIIYKHIKDNQNRIFIGYQSCKVWDLINI FT KPCYNCGRFGHNPQKCRNEPCCLKCAGEHLTMQCDNSSNSTCINCFFSNSK FT YKTKFEVNHMANDTEKCAILKSKINRYIDMTDYPMRPTIPKYLGKVEIHKE FT IHKETTTVIKSPSKLLKNSQLSTSAPILPLRTKTQRDK*" FT CDS join(1297..2340,2368..3108,3032..4192) FT /product="CR1-7_NVi_2p" FT /translation="MEQHIDLIEELQKEVLQEEISYPDLKTLNKNLYKNDS FT YILHLNIRSFLNANFNKLEIFIKSLKIKPSIIVCTETWKMFNCQIFALKGY FT KLYYNNGKINQNDGVVVYIQDGISETTEVIEIGKLKILNTRITLSNNSNVQ FT LSSLYRCHDLSSLEFIANLKTYVNIKKNLKNHLIIGDFNIDIMKEDNYSQE FT LLNNMLEKGFAPGFRHTTRPKNGSGNGTCIDNIYIKTKSIDTKSFKLMIPF FT TDHYPIFLKLNKPPKPKKEIVRHYYNYTKLQRIASSKNWHIIETMHDPNSA FT TAYLINEIQDCLEKSKSLINMKISKKTPRKEWITKAIMVSCEKKEQLYKKN FT AQRTKEYYIHILYTKRLDKLIYEAKLRHDRNFFTNNSNNTKQLWKHIKNKL FT GTNSQKDTTINKIAIDKNCTIEEPIQIANYMNSFFCKIGKDLSDKIINPQG FT KKLELPPINNNTIFIKPTNNSEILQIIQNMKMKNGGIDNIPSKALKVLSPL FT IVNALTHIFNLCIDKSIWPDALKSAEVVPIFKSGSKMEITNYRPISLISNL FT AKILEKIIYNRFQSFFDKFNLLAKNQYGFRKKYRDQRCSKLYHEHNLQPKI FT NMVLEKNIGTKDALSYITNIIYSKLDKSKPIAITFLDLAKAFDTVNHAILL FT DKLYNYGIRGKAHKIITSYLCNRKQKVKIGTADSRYETVITGVPQGTILGP FT LLFIIYVNDLLVDRTESTISYADDTAIYTTDDTWTKVENKMNMYLEDISTW FT LLLNKLSLNISKTVYMTFGNYCDSVPKKNDIKIFIQGNELNRVEQYKYLGI FT NFDFNLKWDKHIEYIIKKTKYLIYILYKLSKYMQTDILKIIYYAFFHSIIT FT YGNIAWGGAYKNNLELLQRSQNRILKIINKNKFDQNNPLNLKQIFAYECVI FT YHYNELKNQHMTSNSITRNKLIKPPKCSKAVSDKNNHIRAIQVFNCLPKSN FT PNLKHLDISKNYNKKKVKKWISTHIL*" XX SQ Sequence 4435 BP; 1921 A; 626 C; 659 G; 1229 T; 0 other; agtatcagta gtaatttgtt tagtgtgtga aaacgtttat cataaaagtg attacgacaa 60 gtgtaaagac aagaaataca ttagtaatat aatatgccca gaacacgtgg agctaaacat 120 aaccttgaaa gatcaagagc ctaggctaaa tgattcagca aaaattttaa ttgcaagtat 180 caaaagtact aaagaagatc aaattaaaca taaattcaga actggaaaat ctgcatgttg 240 agaataaact attgaaacaa ctgaatgaag aattaatgga caaaaatgca ctattaaaag 300 aactatgagc tcctgtaaga aacaatcaaa aacatatgct gatataatgg caaatagaca 360 acctatcaag agtcaaagag taccaagaat aattgtcaag agtcttaaga gtgaaaacaa 420 agatccaaaa gagttttcga aaaaagtatg caatctgtta atagagaaaa aagacataca 480 aacaaaacaa gtaaaatcaa taaatgatag tgagatagaa ataaaaagta tgaatgaaaa 540 aagtgccaag gctacaatag agttactcag tagaaaacta tctaataact acaaaacaac 600 gattgaaaaa atagagtcac ctaaaatgaa aatagttgga gtggataact acaataaaat 660 gaatcttgaa gatctggaaa aggacttgaa taatagaaat tttaatgagt atgatgaaaa 720 gtgtaaagta ttacatataa ataaaaataa ttctacaaaa acatctagta taataattga 780 agtgccaaga ataatatata aacacatcaa agataatcag aatagaattt ttattggtta 840 tcaaagctgt aaagtgtggg acttaataaa catcaagcca tgctacaatt gtggaagatt 900 tgggcataat ccgcaaaaat gtagaaatga accctgttgt ctaaaatgcg caggagagca 960 tttaactatg caatgtgata attcaagtaa ttctacatgt atcaattgtt tcttcagcaa 1020 ttctaaatat aagacaaagt ttgaggttaa tcatatggcc aatgacacag agaaatgtgc 1080 aatattaaaa agtaagataa acagatacat agatatgact gattatccta tgagacctac 1140 tatacctaaa tacttaggta aggtagaaat ccacaaagaa attcacaaag aaaccactac 1200 ggtaattaaa tcgccaagta agcttctgaa gaattcgcaa ctaagcacta gtgcaccaat 1260 actacctttg aggacaaaaa cgcaacgaga taaataatgg agcaacacat cgacctcatt 1320 gaagagctac aaaaggaagt attacaggaa gaaatatcat atcctgatct gaaaactctt 1380 aataagaatt tatacaaaaa tgacagttat atcttgcatc taaatatcag aagttttcta 1440 aatgccaatt ttaataaatt ggaaattttt attaagagtt tgaagattaa acctagcata 1500 atagtatgta cagaaacgtg gaagatgttt aattgccaaa tttttgcact aaaaggttat 1560 aaattgtatt acaataatgg taaaataaat caaaatgatg gggtagtagt ttatatacag 1620 gatggtatat cagaaacaac agaagtgata gaaataggta aactcaaaat attaaacaca 1680 cgaataacgt taagtaacaa tagcaatgtt caattatcat ctttatacag atgccatgat 1740 ctctcttctc tagaattcat agcaaactta aaaacatacg tcaatataaa aaaaaatctt 1800 aaaaatcact taatcattgg tgactttaac atagatatta tgaaagaaga taattatagc 1860 caagaactat taaataacat gttagaaaag ggttttgccc caggatttag acacacgaca 1920 agacctaaga acggtagtgg aaacggaaca tgcattgaca atatctatat aaaaactaaa 1980 tctattgaca ctaaatcatt caaacttatg atcccattta ccgaccatta cccaatcttc 2040 ttaaagctga ataaaccacc taaaccaaaa aaagagattg ttcgccatta ttataattac 2100 actaaactac agaggatagc tagcagtaaa aattggcata tcatagaaac aatgcatgat 2160 cccaactccg ctacagccta cttaataaac gagatacagg attgcttaga aaaatctaaa 2220 agtctgatta atatgaaaat aagtaaaaaa actccaagaa aggagtggat cactaaagct 2280 atcatggtct cgtgcgagaa aaaagaacaa ttatacaaaa aaaatgcgca aagaaccaaa 2340 taataataat attaaagtag agtataagaa tattatattc atatactcta taccaagagg 2400 ctagataagc tgatttatga agcaaagctt aggcatgata gaaatttctt tacaaataac 2460 agtaacaata ctaaacaact ttggaaacac attaagaata agttaggaac aaactcgcaa 2520 aaggatacaa caattaataa aatagcgata gataaaaact gtacaattga agaaccgata 2580 caaattgcca actacatgaa ttcctttttt tgtaaaatag gaaaagactt aagcgacaaa 2640 attattaacc cacagggtaa aaaattagaa ttacctccca taaacaataa tacaatcttt 2700 atcaaaccaa ctaataactc agaaatactg caaattattc agaatatgaa aatgaaaaat 2760 ggtggaattg ataatattcc ctccaaagct ctgaaagtac tatcaccttt aattgttaat 2820 gcattaacac acatatttaa tctttgtata gataaatcaa tatggccgga tgccttgaaa 2880 tcagcggagg tagttcccat tttcaaatct ggtagcaaaa tggagataac taactataga 2940 ccaatatcgt taatatctaa tttagcaaaa atattagaaa aaataatata taacagattc 3000 caatccttct ttgacaaatt taatctacta gccaaaaatc aatatggttt tagaaaaaaa 3060 tatagggacc aaagatgctc taagttatat cacgaacata atttacagta aactagacaa 3120 aagcaaacca atagcaatta cctttctaga tctggctaaa gcttttgata cggtcaatca 3180 tgctatttta ctagacaaat tgtataacta cggtataagg ggtaaagcgc acaaaataat 3240 aactagttac ttatgtaata gaaaacaaaa agttaaaata ggaacagcgg acagccgtta 3300 tgagacagta attacgggag tgccacaagg gacaattctt ggtccacttt tatttataat 3360 atatgtcaat gacctgcttg tagataggac tgaaagtaca atctcatatg ctgatgatac 3420 agctatttat acaactgatg atacttggac caaagtggaa aataaaatga atatgtacct 3480 agaggatata tctacttggt tgttgcttaa taaattatct cttaatataa gtaaaactgt 3540 atatatgaca ttcggtaatt actgtgatag tgttccaaaa aagaatgata taaaaatatt 3600 tatacagggt aatgaactga acagagttga acagtataag tatctgggta ttaattttga 3660 tttcaattta aaatgggata agcatattga atatataata aagaaaacta aatatttaat 3720 atatatatta tataaattgt caaagtacat gcaaaccgat attcttaaaa ttatatacta 3780 tgcatttttc cacagcataa taacctatgg aaacattgca tggggtggtg cgtacaaaaa 3840 caacctagaa ttactacaaa gatctcagaa tagaatatta aaaattataa acaaaaataa 3900 atttgatcaa aataatccat taaatcttaa acaaattttc gcttatgaat gtgttatcta 3960 tcactataat gaacttaaga atcagcatat gacatcaaat agtataacta gaaacaaact 4020 tattaaaccg ccaaaatgct ctaaagctgt tagtgataag aataatcaca ttagagcgat 4080 acaagtattt aattgtctac ctaaatctaa tccaaatcta aaacatctag atatctcgaa 4140 aaattataat aagaaaaagg taaagaaatg gatcagtact catatattgt aaaataaaaa 4200 aaaaaaaaaa aaaaaaaaga ggtgcaatct tagtaagatt ttcattttgt tagtctttta 4260 gtttgtaagt tagttttgta agatcatttt attaagtata tataccagac ctatgtacag 4320 gtagctgtgt ctacctctat aggatgtcag ttaacccaga tataccaagt atatcttgga 4380 gttaactata acatgtattt attactctgg atgaataaat aaataaataa ataaa 4435 // ID BEL-3_BMa-I repbase; DNA; INV; 5433 BP. XX AC AAQA01001266; XX DT 24-FEB-2011 (Rel. 16.02, Created) DT 24-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Brugia malayi genome: internal DE portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-3_BMa_; KW BEL-3_BMa-LTR; BEL-3_BMa-I. XX OS Brugia malayi OC Eukaryota; Metazoa; Nematoda; Chromadorea; Spirurida; Filarioidea; OC Onchocercidae; Brugia. XX RN [1] RP 1-5433 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Brugia malayi genome."; RL Direct Submission to RU (07-FEB-2011). XX DR Genome; AAQA01001266; Positions 7438 2006. XX CC Positions [4349-4882] - Integrase core CC 'ATATA' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 24..2087 FT /product="BEL-3_BMa-I_2p" FT /translation="MSDSIRKGSELSKSRTLELLKEASELDLTSPSQTLSR FT GELQHQYEIKLRIVKEKIAHLEYYAGLLETANQNWLDNIQSLTIATRRKEE FT ETKYANMVDDPQGILSLITRTRETIITLKIHTDEYCSVLQHLKQEEVKESF FT PKNSMIHTNQAEIKLPQLPLITFSGDPKLWRQFWCSFKAAVHEQSIPDIQK FT LNYLISCLKGDALLSVRGYDIAPENYDVIIGLLKEKYGKPFLIKKSLYNEL FT NTIKKNDREWKATIEAMERTFRQLEAIGENLEHSSIEITIENKLPAWIIDK FT LYQLKEDSTSWSVKDIRQYLTKLVLRNEEVARSQADTREKKTNKNNVRGET FT SAFATINQTKGDNTIKSKKVPGKISINGKRKPCVFCNKDHWNNECNIYPTL FT KLRMERLREITACFKCLRKGHATSDCKKGKPLCFHCKSPHNTALCPNNYKE FT LTQAVDSNKERLGSVTIANSITEKGNQWKKRTLLLCKEINVTNPNRPKIQQ FT KALILFDIGSQLSFISKDLAYRLNLQETNEMEIKIASFGDKTPKTCITTKT FT EIGVKIGNKEIIRLDVITVNHLTNELQVVNLNKKELLMIDKRNQFSNFKNK FT WKQPDILIGADHFFKFIQFDQAENLKSGFSLIHTKVGPIIAGSGYINEIHD FT SISETTSVSPNFSSQCRQYSRLRSLLETGINGYTRQNKL" FT CDS 2870..5431 FT /product="BEL-3_BMa-I_1p" FT /translation="MNVREFLSNDQEVNEKLPVQDRAEVSSIKKILGLNWY FT HKQDIIQVTLKPWFGKKLTKRIILQFVASQYDPLGFIVPILIRFKLFLQTL FT WKKNLSWDQTISEEDETTWKLLTNEWPKKLKDLPRFTTNCSKQVQFHVFTD FT ASTAAYSAALYIRTQEREVFLVFAKSRIAPIKGMSIPRLELLAILIGVRMV FT QFVLKQKELEDVITILWSDSQCALHWVHNSSRLLPTFIQNRVEEIRKAKIH FT FRYIPSEQNPADIATKGISPNKLKGYDLWWKGPKWLTENESKWPQWKYNYN FT EDYQDEEIVAHITEPIIINKHFGILDASRFSKWSRLIRTTGWVLKFIRLTM FT KREIPWLKMATKEKNILTTQDYELSEIVLFKQAQSEGVTKDEIIKWNLFYD FT NELWRYKSRVVNSEMKESNLHPIYLPRHNTITELFIQQRHEQLYHAGIAHT FT LSNLRSTIWIPKGRTEVKRILNKCMKCRRWTTKPFKLPTMADLPASRTTRS FT RPFARVGLDYLGPVNIRSDNGLTKRWVALFTCFTTRAVHLEVVETLSAESF FT LHVFRRFTARRGFPELILSDNAGQFQLIFKIIVKQQLNEFLAERKMIWKNI FT IPKAPWNGGVYERLIGLTKRAMKRAIGRKLLWERELITLVAEVESILNTRP FT LTYVNFDDCIILRPIDFILPNAHLIMPAKNKNEMDDFIPHKLDSREKLIQY FT WSNTLKALDAFWEIWKEEYLNTLKERAQREHSSPRDVVKRTPHEGEIVLLN FT EPEIPRGMWKLARIREIKTGKDGEVRSVSIELPKGKILNRPVNMLYPLEVK FT GEEIDPQPTMFNKSVQNVNEQEPIALRTRSAIRRSNQPKLSNELKHLAPSV FT G" XX SQ Sequence 5433 BP; 2088 A; 961 C; 1034 G; 1350 T; 0 other; attttggtgc cacgagccgg gacatgtccg acagtataag aaaaggaagc gaattatcga 60 agtcgagaac tctagaactt cttaaagaag caagtgaact cgatctgacc tccccgtctc 120 aaacactatc acgaggggaa cttcagcatc aatacgaaat taaattgcga atcgtcaagg 180 agaagatcgc acatctcgaa tattatgctg ggttactcga aacagctaat caaaattggc 240 tggataatat tcagagcttg acaatagcaa cgaggagaaa ggaagaggag acgaaatacg 300 ccaatatggt tgacgatcca caaggtattt taagtttaat cactagaaca agagaaacta 360 ttattaccct taaaatacat actgatgaat attgctcggt actacaacat ttaaaacaag 420 aagaggtcaa agaatcattt ccaaagaata gtatgattca cacaaatcaa gcagaaataa 480 aacttcctca attaccacta ataacgttta gtggggatcc aaaattgtgg aggcaatttt 540 ggtgtagttt caaggcagca gtccatgagc aaagcatccc agatattcaa aagttaaact 600 atctgatatc atgtcttaaa ggagatgctt tgttatctgt aagaggttat gatatagcgc 660 cagaaaacta cgatgttata ataggtttac taaaagagaa atatggtaaa cctttcctta 720 taaagaaatc actctacaac gagttgaata ccatcaaaaa aaatgaccga gagtggaaag 780 caaccataga agccatggag agaacatttc gacaactgga agctataggt gaaaatttag 840 agcattcaag catcgaaatt acaattgaaa ataaactacc agcgtggata attgataaat 900 tataccaatt gaaagaggat tcgacgagtt ggtcagttaa ggatatccgg cagtacctga 960 caaaactagt tctaagaaat gaagaagttg ctagaagtca agcagacaca agggagaaga 1020 aaacaaacaa aaataatgtt agaggcgaaa cgtcagcttt cgccacaatc aaccaaacaa 1080 agggagataa tacaatcaaa tcaaaaaagg ttccaggaaa aatttctata aatgggaaaa 1140 ggaaaccatg cgtattttgc aataaggacc actggaacaa tgaatgtaac atctatccca 1200 ctctcaaact ccggatggaa cgtttgagag aaatcactgc atgtttcaaa tgtttacgaa 1260 aaggacatgc aacaagtgat tgtaaaaaag ggaaaccttt gtgtttccat tgtaaaagtc 1320 cacacaatac agcactgtgt cccaacaact acaaggaact cacgcaagca gtggattcca 1380 acaaagaaag gttgggttca gtcacaatcg ctaactccat taccgaaaaa ggaaatcaat 1440 ggaaaaagcg aaccttactg ttatgtaagg aaataaatgt aaccaatcca aatcgcccaa 1500 aaatccaaca aaaggcttta attctattcg atattggttc tcaattatcg ttcatatcaa 1560 aggatttggc ttatcgattg aatctacaag aaactaacga aatggaaata aaaattgctt 1620 catttggtga taagacgcca aaaacgtgta tcactactaa aactgaaatc ggagtaaaaa 1680 ttggaaataa agaaataatc cgattagatg taataacagt taaccacttg acgaatgaat 1740 tgcaagtagt taatctaaac aaaaaggaat tgttgatgat agacaaacga aatcagttta 1800 gtaatttcaa aaacaaatgg aaacaaccgg acatactcat aggtgcggat cacttcttca 1860 agtttatcca gtttgaccaa gcagagaact taaaatcagg attttcattg atccatacaa 1920 aggttggacc tataatagct ggaagtgggt atataaatga aatacacgac agtatttcag 1980 aaaccacatc agtaagtccg aatttctcaa gtcaatgccg tcaatattcc agacttagat 2040 cacttttgga aactggaatt aatgggtata caagacaaaa taaactctga caatgatgaa 2100 aaagcattgc agcaatttaa acaaactatc actagggaca agggaagata tcaagtttgt 2160 tggccttgga aagattcaaa gaacaaacta agcgataatt tcggactttg ccttggaaga 2220 ttaaagtcgt taattaggag gcttcaaatg aaacctcaat tactaagtcg atacaatcaa 2280 actattgaag aacaactcaa ttccaatata attgagaaag tgtcatcgga aatgaatgaa 2340 gttggtatta ttcattatct gcctcatcac gaagtgataa caccaaataa aaccacgaca 2400 aatctgagga tagtttatga tgcatcagct cattgtaaag gaatgaaaag tttaaatgat 2460 gttttatatc ggggaccaat aacactccct gatttggttg gagttttgct acgtttccgt 2520 acaatgaaaa atgtaataat agcagacgta gaaaaggcgt tcttgcaaat agaattgcat 2580 ccaacagata gaaactgcac tcgatttctt tggttgaaag acattcgaag tccagttaac 2640 gaagaaaacg taagttgtta ccgtttccaa cgagttccat ttggaattgt atcatctcca 2700 tttttattgt ccgctacact caattatcat ttggaaactt gtaaaagcaa aatcgcactc 2760 gagttaagga aaaatctcta tgtcgataac attataatac caaccaaagg aaccaacgaa 2820 gctctttcca attataaggg aaattaaaat tatattcaat gaagcatcaa tgaatgtacg 2880 ggaatttctc tctaacgacc aagaagtcaa tgaaaaactt ccggtacaag atcgagcaga 2940 ggtaagttca attaaaaaaa ttctaggtct caattggtac cacaaacaag atatcattca 3000 agtaacgtta aaaccatggt ttggtaagaa attaactaaa agaattatat tgcaattcgt 3060 agcctcacaa tatgacccac taggattcat agttcctata ctgataagat tcaaattgtt 3120 tctccaaact ttgtggaaaa agaacctttc atgggatcaa actataagtg aggaagatga 3180 aaccacgtgg aaattactca cgaacgaatg gccaaagaaa ctaaaggatc taccaagatt 3240 cacaacaaat tgttccaagc aagtacaatt tcatgttttt actgacgcat caactgctgc 3300 atactcagca gctttataca ttcgaactca agaaagagaa gtattcttag tgtttgctaa 3360 atcccgaatt gcgccaatta aaggaatgtc aatacctcga ttagaattac tggcaatact 3420 catcggagtg agaatggtac aattcgtatt aaaacaaaaa gagcttgagg acgtgataac 3480 aattttatgg tccgattctc aatgtgcgct tcattgggtt cacaatagtt cccgtttact 3540 accaacgttc atacaaaata gggtagaaga aatacgaaag gcgaaaatcc attttagata 3600 tatcccaagt gaacaaaacc cagctgatat agctacaaaa ggaatctctc cgaataaact 3660 aaaaggatat gatctttggt ggaaaggacc aaagtggtta acagaaaacg aatcaaaatg 3720 gccacagtgg aaatacaatt acaatgagga ttaccaggat gaagaaattg tggcccacat 3780 cacagaacca attatcataa acaaacattt cggaatctta gatgctagtc gttttagcaa 3840 atggtcaaga ttaataagaa caactggttg ggttctgaaa ttcataagat taacaatgaa 3900 aagggaaatt ccttggctaa agatggcaac gaaagaaaag aatatattga caacacagga 3960 ttacgagttg tcagaaattg tgctattcaa acaagcacag tcagaaggag ttaccaaaga 4020 tgaaataatc aaatggaatt tgttctatga taacgaacta tggagataca aaagccgcgt 4080 agttaactct gaaatgaaag agagtaactt acacccaata tatctcccga gacacaacac 4140 cataactgaa ctctttattc agcaacgaca tgaacaactg tatcatgctg gaattgctca 4200 tacactctca aatctgagat cgacgatatg gatcccaaaa ggtagaacgg aagtcaaacg 4260 aatcttgaat aagtgcatga agtgcagacg ctggacaact aaaccattca aattaccaac 4320 tatggctgat cttccagcat ctcggaccac aagatcaaga ccctttgcgc gagtaggact 4380 agactattta ggtcctgtga atatcaggag tgataatggg ctaacaaaaa gatgggtagc 4440 actattcaca tgtttcacaa caagagctgt acatctagaa gtagtagaaa ctctttcagc 4500 cgaaagcttt ttacatgttt tcagaagatt tacagcaaga cgaggctttc cagaattaat 4560 cttaagcgac aacgctggtc aattccagtt aatttttaaa atcattgtga aacaacaatt 4620 aaatgaattc ctagcggaaa ggaaaatgat ctggaaaaat ataataccaa aggcgccttg 4680 gaacggagga gtatatgagc gacttattgg gttaacaaaa agggcaatga aaagagccat 4740 cggcagaaag ttattatggg aacgggaatt gattacgcta gtagcggaag tggaaagtat 4800 tttgaatact cgaccactaa cttacgtaaa ttttgatgat tgtatcatat tgcgtccaat 4860 tgactttata ctaccgaatg cgcatctaat catgcctgct aaaaacaaga atgaaatgga 4920 tgattttatc cctcataaat tggattcgag agaaaaactc attcaatact ggtcaaatac 4980 actaaaggcc ctcgatgcct tctgggaaat atggaaagag gaatatttaa atactcttaa 5040 agaaagagca caaagggaac actcttctcc gagagatgta gtaaaaagaa ctccccacga 5100 gggagagatt gtgttactaa atgaaccaga aattccccgt ggcatgtgga aactagccag 5160 aataagagaa atcaaaacag gaaaggatgg agaagtgaga agtgtctcga tagaattacc 5220 gaaaggaaaa atactcaaca gaccggtgaa tatgttatac ccactggaag tcaaaggtga 5280 ggaaattgac cctcaaccaa caatgtttaa caaatcagtg caaaatgtca atgaacaaga 5340 accaattgct ctacgaacta gaagcgctat aaggcgaagc aatcaaccaa agctgtccaa 5400 cgaactaaag catttagctc cttcggtcgg gag 5433 // ID SINE-6_CQ repbase; DNA; INV; 225 BP. XX AC . XX DT 26-DEC-2010 (Rel. 16.01, Created) DT 26-DEC-2010 (Rel. 16.01, Last updated, Version 1) XX DE Non-LTR retrotransposon from Culex quinquefasciatus - consensus. XX KW SINE; Non-LTR Retrotransposon; Transposable Element; SINE-6_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-225 RA Jurka J.; RT "Non-LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 598-598 (2011). XX DR [2] (Consensus) XX CC >99% identity to consensus. Many are 100% identical. Present in CC >6,000 copies in the genome, very likely to be active. XX SQ Sequence 225 BP; 37 A; 56 C; 71 G; 61 T; 0 other; gtgggtctcg tggcgcaggg gtagcggctt cggctgccga tcccgatgat gctatgagac 60 gcgggttcga ttcccgcctt atccactgag cttctatcgg atggtgaagt aaaacgtcgg 120 tcccggtttc tcctgtctcg tcagaggcgc tggagcagaa atcccacgtt agaggaaggc 180 catgccccgg ggggcgtagt gccaatagtt tcgttttttt ttttt 225 // ID hATN-3_SM repbase; DNA; INV; 598 BP. XX AC . XX DT 16-AUG-2009 (Rel. 14.08, Created) DT 16-AUG-2009 (Rel. 14.08, Last updated, Version 2) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW hATN-3_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-598 RA Jurka J.; RT "haT-type repetitive family from freshwater planarian Schmidtea RT mediterranea."; RL Repbase Reports 9(8), 1851-1851 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX SQ Sequence 598 BP; 164 A; 114 C; 100 G; 220 T; 0 other; caggggtggg caacctacgg cccgcgggcc gcatgcggcc cgcggcgacg ttttatgcgg 60 ccctttagac ttttcacata tttttttaaa actgcgtaaa caaataaaaa aaaatgcaaa 120 gaaatattat tttcctacat ttcattcaga aataaccttt cctctgtttt tgttcttgtt 180 ggatgttttc ttgtactagt tgtgatcccc agggcaaagg tttattttcg atatcgttac 240 aaatctttgt attattatat tattattatt atcgccgcaa tcgttataac ggtttttaaa 300 ataatgttaa attaacacta gtatctttta ctgttttcgt tgttggcgaa tgtaacgtgt 360 ctgctaaaac tttacagttt agcagctata gtcataccta gctactagta acgagttatt 420 cactagtaac tattcagtat gttatatcat tatagttttt taatttgctc aaaatgtgtt 480 gattttattg tctgatccgg cttagaattc ttgattattg aaacaaaata gcatgcggcc 540 cttcaaacta ttttaatttt catacggccc cctgcccaaa aaaggttgcc cacccctg 598 // ID BEL-625_AA-LTR repbase; DNA; INV; 377 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-625_AA_; KW Pao_Bel_Ele108; BEL-625_AA-I; BEL-625_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-377 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 377 BP; 102 A; 77 C; 96 G; 102 T; 0 other; tggctacgcg agggttaatt gagagcaatg agaagtagca aacatgttcc acaggtagcg 60 atactgttga tccctgtctg tcgtacgctt ttaagctacc cctttcgttg cgagatagca 120 catgatgaag agatggaagc aatctgatgg gagcagtcga tctctcagca gccagctgat 180 cgtgtacata ggagaataaa gttagttact agtagaccgc cgggaaatat acgcctcgtc 240 cgttcagtcc gcaataaagt tgtttttatc cgcgaatagt tgtgttaatt aattgccgga 300 ataaagtgat ttctacggag tgaaattgag gactagtttc tacggtatcc gaaatcgatc 360 cgtttcccgc gcgcgca 377 // ID Gypsy9-LTR_Dpse repbase; DNA; INV; 762 BP. XX AC Unknown_group_264; XX DT 13-MAY-2009 (Rel. 14.05, Created) DT 13-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy9_Dpse; KW Gypsy9-I_Dpse; Gypsy9-LTR_Dpse. XX OS Drosophila pseudoobscura OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; obscura group; OC pseudoobscura subgroup. XX RN [1] RP 1-762 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1074-1074 (2009). XX DR Genome; Unknown_group_264; Positions 11863 12624. XX SQ Sequence 762 BP; 228 A; 242 C; 151 G; 141 T; 0 other; tgtaaagaac cttaaaggtt tcgatacaat ccgtcccccg acttataatg agcgcatcgc 60 accacacccc acaaccaacg ccccgtctcc gcacgcaacc accgatcaac acgcgtcaac 120 tccataacgc tgaccaaaga catcagatca cgcggacttt ccccactcca ggccaagcca 180 tcggccaatt gcggtcttcc gggtcttcat caactgcgga aagatgacgc cgcgaaaatc 240 tgcgacgatc accaagaacg gaaaagtgca ctccgacgcg gaagtgcatt ttcaaacctg 300 cagacccagc aatacgggcc tataaaaaga cgacgacgag aaccaagaag caacttacgc 360 agaaacccgg ctcagagtac agacgttgta ccaaaactta cgcagaaaac ctggttcgga 420 gtacagacgt tgtacccggg acttacaaga aacaccggcg atcaattaaa ctaagttcat 480 ttaaagtgct aagaattata ataaagtgtc gaatcatcaa agtaacaccc gattgcgtaa 540 gttcactcca actagccgag gcaccgacac taccccaccc cgagtccacg cacgccgttt 600 cgtatgcacg acgagcgacg ttcactccag ccccgatcca cctctctacg ataggccgcc 660 ccccgacgcc gcccttaagg ttccatccat ttctacaaat ttcttgtgga ttgacccctg 720 gacgtgactg agcttacctc agcctgaaat aggtacatca ca 762 // ID DNA8-8_AP repbase; DNA; INV; 311 BP. XX AC . XX DT 22-AUG-2009 (Rel. 14.08, Created) DT 22-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-8_AP. XX NM DNA8-8_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-311 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(8), 1750-1750 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 311 BP; 98 A; 70 C; 53 G; 90 T; 0 other; caggggcgga ctggcccagg gtgctataca ccaagtagca ccccgggccc ccgtttgtat 60 ttatgatccg ggcccccgat taccacagtc ggtatacggt ataatactgg tataatacgg 120 tattatacaa aataaaaata aaataacata tttcaacatc tctacaaata aacaacatgt 180 tattcttaat ttttttttta tatttactta taaaactaag ggctctttcc taatttaggt 240 aggtttgaaa aaaaattacg ggcccctaat taattttgca ccccgggcca aaaatagcca 300 gtccgcccct g 311 // ID Copia-119_AA-LTR repbase; DNA; INV; 314 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-119_AA_; KW Ty1_copia_Ele34; Copia-119_AA-I; Copia-119_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-314 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 314 BP; 70 A; 85 C; 57 G; 99 T; 3 other; tgttggacgg tgcaatgcgt acctctgcca tccgaacctt gcgctgttac acacctagat 60 ctacctatgc gtatctgccg tacgcacaca aacacttttt atattaacac tcactgacag 120 ttcgcaacat attttagtct wtgaaacgta cgcgtccgwa gtaccacgta tttcttattt 180 ttaaccgctt tkaataaagt ttctgtgttg atcgtattcc gcgagttcaa atagtttccc 240 ccgggaatag atcgtttccg aaattccgca cacgtcgtgt ttttccactg gcctccggac 300 tttgcctgcc gtca 314 // ID Kolobok-N1_HM repbase; DNA; INV; 2397 BP. XX AC . XX DT 16-JAN-2009 (Rel. 14.02, Created) DT 16-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE Kolobok-type family - consensus. XX KW Kolobok; DNA transposon; Transposable Element; Nonautonomous; KW Kolobok-N1_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2397 RA Bao W. and Jurka J.; RT "Kolobok-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 9(2), 427-427 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX SQ Sequence 2397 BP; 847 A; 316 C; 351 G; 883 T; 0 other; ggtggtactt gggtgaaaaa aaagtaaaaa ttgaaaattt caaagtgaat gtttttaagt 60 aaactaagct atgcagaaaa cgcctaagta atttgtttca tttaaaataa ttcagatgat 120 acttaacttt ccattttaaa aaaggcaatc ttaaactcca aactcaactt tcgtagcaac 180 gcccatagca accaattttg atttcttgaa agcctgtaaa attatgttta aaaacctatg 240 gttggtttgc tttaaagtca ttaaattgtt ttacagaaat ataaacttca tgcaaaaagt 300 caccaattaa atacatgtgt tacatggcat ttaaattgtt taaaagttga aaagctgcaa 360 gtgaaataat atctgtttat gtttttaatc atcgtttatt tgatttagtt acatggtttt 420 ttatcttttt tttttagctt tgttatagtt aacttaattg gttttttatt tgttattatt 480 ttcttttttt attcaagtta tttaactata atgttcatta ttaaagaaca acaacaatga 540 aaaaaaaatt aggcttagtt acactttttt gttattattg tttttttatt ctttatagtt 600 gtaataataa ttaatattaa aaaaagagtc gagctatgtt agaaagaaag tctggccact 660 caaaagcatt ttctaaacgg acatttaaag gtaaccaaca cacacaaaaa aaaacacaag 720 aagaaccagt tgagattcct ccttctaact tacctatttc ttcaaataaa tctgtcatat 780 ttatcatcag gtacaacatt accttttata atcactaatg aaaccagacc aataaagttt 840 gcatcatctt cattcaccaa tgacaccagt ttgtttactt gtgtaaattc ttcaaaccca 900 actgtatatt atcgattttt atactttttt ctattgacat ttttatggaa attattaatt 960 tagttgtaac atgccctgaa tgtgacaaaa agactatgag tcgaacattt aacgtcttga 1020 gaaagcaagg cttgtctata ccattattgt tacaatgttc aagttgttgt tagtgtacaa 1080 cctttcattc aagtaaagaa tgtaaatgca aaagcagtcg tggtagacct gtttgaggtt 1140 tatgttcgtt ttgtttttga tatgagagaa attggaaaaa gttataccag tttagaaaaa 1200 ctctgtggat gaaaaaagta attagaacag tgcttttcca ttgctcttaa gcaaaatata 1260 ttattagaat atagaaggca tcaaatgtgc ccacgtactc ctgatagttg gtgtaaatat 1320 caggctgaca aaataaatgg catagcaaca tacaaacata agggaggaat taccaattgt 1380 tgtccttgaa gaaataaagc ctgttttttt taagtttaag tgatgacaat ttataaaaaa 1440 atatctttat ggcaaaactc aaaataacta ataaataaat gagtcactaa atggaatgat 1500 ttggaaacgg tttcaaaaag atatctacgt ttgtaaaacc actgttgacc taggtgtagc 1560 ttctgcagtt ataaatttta atgatagttc aactggaact ctaaaagttt taaagggcat 1620 aggtattcca ggattttatg caaatcagtt ttgtgtttca aataacaacc aaagagtaat 1680 gtcaatggaa aggaaatcga aaaaagcata ttcaaagagt tcataaaagg tttcacaaat 1740 actaatcaat gtagaagaaa tgaagtatgg ttatggttta tattagagag atttttatta 1800 ttttttattt acggtcatca atcatgaaaa tctttttgtt tttgaatatt tgaaagaaat 1860 ttagtgatat tttagtatat tattacttaa attgcattta tctcaaaatt gaattttttg 1920 aacggtgcgt catataactt ttgatttatt tgagctttac tcttgaaatt ttatatggtg 1980 gttatttttt ataagataca ggtttagaac caacattttt tgaaaaaata tttttcggtc 2040 cagagttacc caagcccaaa gttttatttt taagcttttt tataagcaag gtatttgaat 2100 tttaattacc ttagatctag ttatgtaatc aggctaattc tgctcccata cctgtaaaat 2160 tgtactataa acaaactgta aaaattttgt tatcattcta caggtggttt tttctgaatt 2220 tgacgcaccg caaagtgacc attttgagac ctttggaccc aaaaaagggt tttctgtaat 2280 tttttttctt ttttttttta aaaaaagaca tagttacctg tatataattg ttgtaccaaa 2340 atatcaaatt taaatagtaa gaattaaaga atttggattt ttcacccaag taccacc 2397 // ID Gypsy-12_TCa-LTR repbase; DNA; INV; 197 BP. XX AC chrUn_5; XX DT 21-FEB-2011 (Rel. 16.02, Created) DT 21-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the red flour beetle genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-12_TCa_; KW Gypsy-12_TCa-I; Gypsy-12_TCa-LTR. XX OS Tribolium castaneum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Coleoptera; Polyphaga; Cucujiformia; OC Tenebrionidae; Tribolium. XX RN [1] RP 1-197 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the red flour beetle genome."; RL Direct Submission to RU (21-FEB-2011). XX DR Genome; chrUn_5; Positions 417153 417349. XX SQ Sequence 197 BP; 49 A; 37 C; 41 G; 70 T; 0 other; tgttgtcgat atttgaaatt ttgattttca cgctttattg ctctgattgc gcgcatgagc 60 gagttgagat ccgctttacg attgcgcgca tgagtgagtt gagatccgct ttacgttaac 120 tgctcgcacg ccatgtattc cgtattttat ctaaataaat attgaaaatt cggggagatt 180 tattaaaatt ccccaca 197 // ID DNA4-2_CQ repbase; DNA; INV; 1351 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A DNA transposon family from Culex quinquefasciatus - consensus. XX KW DNA transposon; Transposable Element; nonautonomous; DNA4-2_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-1351 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the southern house mosquito."; RL Repbase Reports 11(1), 72-72 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >96% CC identity. ~640 bp TIRs. 4-bp TSDs. XX SQ Sequence 1351 BP; 435 A; 248 C; 244 G; 424 T; 0 other; gggtaattct ccgccaactc acacagcagt tgccccgacc cctcttcgat ttgcgtgaaa 60 ctttgtccta aggggtaact tttgtccctg atcacgaatc cgaggtccgt tttttgatat 120 ctcgtgacgg aggggcggta cgaccccttc catttttgaa catgcgaaaa aagaggtgtt 180 tttcaataat ttgcagcctg aaacggtgat gagatagaaa tttggtgtca aagggacttt 240 tatgtaaaat tagacgcccg atttgatggc gtactcagaa ttccgaaaaa acgtattttt 300 catcgaaaaa aacactaaaa aagttttaaa aattctccca ttttccgtta ctcgactgta 360 aaaaattttg gaacatgtca ttttatggga aatttaatgt acttttcgaa tctacattgt 420 cccagaaggg tcattttttc atttagaaca aaatttttca ttttaaaatt tcgtgttttt 480 tctaactttg cagggttatt ttttagagtg taacaatgtt ctacaaagtt gtagagcaga 540 caattacaaa aattttgata tatatacata aggggtttgc ttataaacat cacgagttat 600 cgcgatttta cgaaaaaaag ttttgaaaaa gttggtcgtc atcgatcatg gccgttcatg 660 gtcacccgcg acagacacgg acgacgaaac aaagagaaac gcaaaaagta actttttcaa 720 aacttttttt cgtaaaatcg cgataactcg tgatgtttat aagcaaaccc cttatgtcta 780 tatatcaaaa tttttgtaat tgtctgctct acaactttgt agaacattgt tacactctaa 840 aaaataaccc tgcaaagtta gaaaaaacac gaaattttaa aatgaaaaat tttgttctaa 900 atgaaaaaat gacccttctg ggacaatgta gattcgaaaa gtacattaaa tttcccataa 960 aatgacatgt tccaaaattt tttacagtcg agtaacggaa aatgggagaa tttttaaaac 1020 ttttttagtg tttttttcga tgaaaaatac gttttttcgg aattctgagt acgccatcaa 1080 atcgggcgtc taattttaca taaaagtccc tttgacacca aatttctatc tcatcaccgt 1140 ttcaggctgc aaattattga aaaacacctc ttttttcgca tgttcaaaaa tggaaggggt 1200 cgtaccgccc ctccgtcacg agatatcaaa aaacggacct cggattcgtg atcagggaca 1260 aaagttaccc cttaggacaa agtttcacgc aaatcgaaga ggggtcgggg caacttttcc 1320 cgatttcgtg tgagttggta gagaattacc c 1351 // ID SAT350_TG repbase; DNA; INV; 1046 BP. XX AC . XX DT 09-JUL-2004 (Rel. 9.06, Created) DT 09-JUL-2004 (Rel. 9.06, Last updated, Version 2) XX DE Toxoplasma gondii satellite TgSat350. XX KW SAT; Satellite; Simple Repeat; SAT350_TG; Repetitive DNA; KW tandem repeat. XX OS Toxoplasma gondii OC Eukaryota; Alveolata; Apicomplexa; Coccidia; Eucoccidiorida; OC Eimeriorina; Sarcocystidae; Toxoplasma. XX RN [1] RA Clemente M., De Miguel N., Lia V.V., Matrajt M. and Angel O.S.; RT "Structure Analysis of Two Toxoplasma gondii and Neospora caninum RT Satellite DNA Families and Evolution of Their Common Monomeric RT Sequence."; RL J. Mol. Evol 58(5), 557-567 (2004). XX RN [2] RA Gentles A., Kohany O. and Jurka J.; RT "Consensus."; RL Direct Submission to Repbase Update (JUN-2004). XX DR [2] (Consensus) XX CC Consensus of TgSAT350-2 and TgSAT350-4 sequences. XX SQ Sequence 1046 BP; 185 A; 239 C; 366 G; 256 T; 0 other; ggtatccttc agagagtgag tgtccggcgt gggatgcgtg ggccaatcca gttgcggaag 60 cgaacgatcc gagttcactc gtttcggaat acgccgcctg tggcatgtgg tggccccaca 120 tagtcctgcc tttttcccgt atgactctcg tcgcgtacct gcgcggatgg gtgagcggtt 180 attggttgtg tgtcgttgtg cagtctgtct gcgacattgt tgggcgtttt cttcaggcgc 240 cggaggtaag agaagaaacg gatgcgcggg cgtgttctgc gtttcgagac aactgccgca 300 ccacgtagag actactgctg atggaggtca ccgcgttgag gggttgttag ctggcggcgg 360 aggcactgtg cgggcggtat ggcattgttt cggttgcggc acaggtgctc ccagaggtcc 420 gacatcactc tggcacgaat tgtcgacgtg ttgggtgtgg cggcgccaca tggcccggcc 480 gttgtatggt atgcgactcg tcgagtgcat gcgcggatgg gtgagaggtt actggttgtg 540 tgtcgttatg cagtctgtct gggagatggt cgggcgtttt gttcaggcgc gggaggtaag 600 agaagaaaag gactccgggg gcgtgtggtg tgtctcgagg caactgcggc accacgtaga 660 gactactgct gatggggtca ccgcgatgag gtgtagttag ctggcggcgg aggcactgtg 720 cgggcgggac gaccatgttt catttgcggc acaggtgctc cccgaggtcc gacatcactc 780 gggcacgaat tgtcgacgtg ttgcgtttgg tggcgccaca gctgtcagga ggttgtaaca 840 gtctggggta cgttgtaaga tattatcgct gtatagcctg ccgcaggctg tgtcaatgtc 900 cagtgctttc acgccgtggc aacggctgta caggaaggga atacaagggt gaacgatgta 960 gcatctttta tctttggagg cgaatacgac cccgtggaga aacgacaact tgtacccgca 1020 cctcgatgac gtgttgtcgg tatcct 1046 // ID Gypsy-68_CQ-LTR repbase; DNA; INV; 212 BP. XX AC AAWU01020187; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-68_CQ_; KW Gypsy-68_CQ-I; Gypsy-68_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-212 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 516-516 (2011). XX DR GenBank; AAWU01020187; Positions 20021 19810. XX SQ Sequence 212 BP; 58 A; 55 C; 39 G; 60 T; 0 other; tgttggaccc ctaggaacaa ccctggatgc aaccctggat gagcgacact tgtcgtcatc 60 ccgttgacat ccacgaccgg aaaccggttg tcataatccg cgagctcaag aagacacacg 120 cataactttt taataaattt acttattcgc aataaatgct taattatttg tctccactcg 180 cgtgtatcat tcgttgtttg cacattgcaa ca 212 // ID Copia18-NVi_LTR repbase; DNA; INV; 203 BP. XX AC AAZX01010628; XX DT 13-NOV-2007 (Rel. 12.11, Created) DT 13-NOV-2007 (Rel. 12.11, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: long terminal DE repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia18-NV; KW Copia18-NVi_I; Copia18-NVi_LTR. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-203 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from Nasonia parasitic wasp."; RL Repbase Reports 7(11), 1149-1149 (2007). XX DR Genome; AAZX01010628; Positions 4955 4753. XX SQ Sequence 203 BP; 51 A; 62 C; 37 G; 53 T; 0 other; tgttaaagcg agtttctcac actataagaa tcgcgaacgc gagcgacctc tggtcgtagg 60 ctggcgtatg cgcctgacgt ctcctcgccg acgctctctc tctcgctcct tacctcgagc 120 cgttcggacg ctgtcactta ctaactacaa ataaatacct ctttacgata aagtgtcttc 180 tcattcaaca cacaacaata aca 203 // ID CR1-113_AAe repbase; DNA; INV; 4065 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-113_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4065 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1201-1201 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 21 sequences with >90% CC identity. XX FH Key Location/Qualifiers FT CDS 224..1018 FT /product="CR1-113_AAe_1p" FT /translation="MESICFSCTLTVKKEDEIVCNGFCKSSFHLECVHQSI FT EIRNTIASSSQLFWMCKACSKMMANANFRQAISSTNNVLELMAEQYSITLN FT ELRKEIAHNTTKINTILQRTPSQATPQIPRSQLPSSSRKRPRLLVDAPSHH FT DNASVGTREFAPDESIPLAQKEDTFWLYLSGFDPQATTQQIEKLVKSNLNT FT DKTVNVVKLVPKGRTLEELTFVSFKVGLEQQMKEVALSVSSWQKGIIFREF FT DFSHSASSRQTFRFQPSPGDNCAQ" FT CDS 891..4004 FT /product="CR1-113_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="KRLPYLYRHGRKVSSSVNSTLVTLLLRVKLFVFNHHQ FT EIIVHNNHHATNTTEILTRNDQRQTTLIRNAQQLLISSADNSTISQNPATK FT LETFTIYYQNVRGLRTKTNDLFLALSDSDYDVISFTETWLNNDVNNSELTQ FT NYTIYRCDRNANNSCFQRGGGVLIGVRNGLQSTSVCFADSDRLEQIAVRIT FT LPDFELYVCTIYLPPNSETILYEQHSACVQHLLDLAGDRGRVVVIGDYXLP FT LLCWKYDDEINSFIPVNASSEQELALIESVGCCGLHQINDFSNANGRLLDL FT VFVNEEKYVELFEPPLPLLKVDLHHKPFVLKFEINVMCDDDNISNPLDFDF FT HRCDFASINTAISEVNWNEELNECSIDEAVSRFYQRVYSILQNAVPMKKRR FT RMLGRRQPWWNNQLQRLRNRLRKARKRYFRSRDEQRKVELRVIESEYNSLH FT AQSFRCYIHRTEEKIKRDPKSFWSFVSSRKQVNGIPQRVHYNDVEADTSFD FT SANLFSSFFRSVLSNNRSPSSDAYLNSLPQHDFNVALFNFSPEDVLTKMQG FT LDESKGAGPDRLSPVFIKYCADSLAAPTSIIFNRSIAESVFPSAWKIASIT FT PIHKAGSIHNVENYRGISILSCLPKVFESLVHDSLYPHVHHIISEFQHGFV FT KRRSTTTNLMTYVSTLIDALEKRQQVDAIYIDFTKAFDRVPHVLAVNKFNK FT MGLPDWLTRWIHSYLTERSAYVRINGTSSNPFEITSGVPQGSHLGPLFFLL FT FVNDLCESIKSQKLMFADDLKFFRVVSSLVECCAIQTDIDTLLNWCLLNGM FT EVNTRKCSVITFSRVRNPANFEYKMSSEIIARVTTVKDLGVLLDSKLNFAQ FT HIASTTAKAYAVLGFIKRNAQQFEDVHCLKTLYCALVRSILEYGVLVWAPY FT HAAQSARIERIQRNFLRFALRRLPWTDPIRLPPYEHRCNLVHLTTLASRRV FT LLQRLFVFDILKHNIDCSMLLYKVNFNVPIRITRHTEFLRLPVHRTLYGQN FT NPFDVCCRRFNEVFVNFDFNVSKPMFRSRIRY" XX SQ Sequence 4065 BP; 1158 A; 914 C; 852 G; 1136 T; 5 other; gtktgtgtga ggtaaacaat tcgtwaactg tgtttttttc ctgacgctaa ttttacgttt 60 ttctctacgt tatcgtgaat ttttatgtgc gattttgttc tatctggacg catcttttat 120 cgttccaccg gcgttatctg gagttcatca agttacgtcc tcatcttgag ctattaccgt 180 caaaattgtc acaacacgga accgaagatc aacttcaaca acaatggagt caatttgctt 240 ttcatgtacg ctcaccgtga aaaaggaaga cgaaattgtg tgcaatggct tctgcaagtc 300 atcatttcat ctggagtgtg tgcatcaatc catcgaaatc cgcaacacca tagccagcag 360 ttcacagctc ttttggatgt gcaaagcctg ttcgaaaatg atggccaacg ctaactttcg 420 tcaggctatc tcatccacca acaacgtact ggaactgatg gctgagcaat attccattac 480 gctgaatgaa ttgaggaaag agatcgccca caacactaca aaaataaaca caatcttgca 540 acgcaccccg tcacaagcta ccccacaaat tccgagatcg cagcttccat catcgtcaag 600 aaaacgaccc cggcttcttg tagatgcgcc ttcccatcat gataatgcat ccgttggtac 660 cagagaattt gctccagacg aatcaatccc gttggcgcaa aaggaagata cgttttggtt 720 gtatctttcc ggcttcgacc cgcaagccac aacccaacag atcgaaaaac tcgtcaagag 780 caatctcaac acagacaaga cagtgaacgt cgtcaaattg gttccgaaag gaaggacact 840 ggaggaactt acgtttgtgt cattcaaagt aggtctagaa cagcaaatga aagaggttgc 900 cctatctgta tcgtcatggc agaaaggtat catcttccgt gaattcgact ttagtcactc 960 tgcttcttcg cgtcaaactt ttcgttttca accatcacca ggagataatt gtgcacaata 1020 atcatcacgc aacaaacacc accgaaattt tgactaggaa tgaccaacgg caaacaacac 1080 tgatccgaaa cgcacagcaa ctacttattt cttccgcaga caacagcact atatcgcaaa 1140 atccagcaac caaactcgaa actttcacga tctactacca aaatgttcgc ggactacgta 1200 ccaaaacaaa cgatttgttt ttggctctat ccgacagtga ctacgacgtt atttcattca 1260 cggaaacctg gctcaacaac gatgtgaaca actccgaact cacacaaaac tatactatct 1320 atcgctgcga tagaaacgct aacaacagct gttttcagcg tggtggtggt gtattgattg 1380 gcgtacgaaa tggactgcag agcacatctg tttgctttgc agacagtgat cggttggaac 1440 agatcgccgt tcgcattact ctgcccgatt tcgaattgta tgtctgcacg atctacctac 1500 cgcccaacag tgaaacgatt ctgtatgaac agcattctgc ttgtgttcaa cacttgctcg 1560 atcttgctgg tgatcgcggg cgtgtagtag tcattggtga ttataawctt ccccttttat 1620 gctggaagta cgacgatgaa ataaattcat tcataccggt gaatgcgtcg tctgagcaag 1680 agcttgcctt gattgagtct gtgggatgtt gcgggcttca tcaaattaat gatttttcga 1740 atgcgaatgg aagacttctt gatctagtgt ttgtcaatga ggaaaagtac gtcgaactat 1800 tcgagccacc gttgcccttg ttgaaagttg acctccatca taaacctttc gttttgaaat 1860 ttgaaatcaa tgtaatgtgc gatgatgata acatttcaaa tcccctcgat ttcgattttc 1920 accggtgcga ctttgcatcg atcaacactg caatctcgga agtgaattgg aatgaagagc 1980 taaacgaatg cagcatagac gaagctgttt cacgtttcta tcagcgggtg tactcgattc 2040 ttcaaaatgc tgttccgatg aaaaagcgac gtcgaatgtt ggggagaaga caaccatggt 2100 ggaataatca gctgcagcga ctacgaaacc gtttacggaa ggcgagaaag cggtattttc 2160 gttctaggga tgaacagcga aaagtggaac tgcgtgttat tgagagcgaa tacaactctt 2220 tgcatgcaca aagctttcgc tgctacattc accggacgga agagaaaata aaaagggatc 2280 ccaaatcmtt ttggtcgttc gtgagtagcc gcaagcaggt gaatggaatt cctcagcgag 2340 ttcattacaa cgatgttgaa gccgatactt cgttcgattc ggcgaacctg ttctcgtctt 2400 tctttcgaag tgtgctaagt aataaccgat caccttcgtc tgatgcatac ttgaacagtt 2460 tgccgcaaca tgatttcaac gtagcacttt tcaacttttc tccggaggat gtgttgacta 2520 aaatgcaagg attagacgag tctaaaggcg ctggaccaga tcgactttct ccggttttca 2580 tcaaatactg tgccgattcg ctggccgctc ctacttccat tatcttcaat agatcgattg 2640 cggagagtgt ttttccaagc gcatggaaaa tcgcttccat aactcccata cacaaagctg 2700 gtagtataca caatgtggag aactaccgtg gcatctcaat tctaagctgt ttgccaaaag 2760 ttttcgaaag cctagtccac gattckctct atccacatgt gcaccacata atctctgaat 2820 ttcagcatgg gttcgtgaaa agacgctcta caactacgaa tttgatgacg tatgtatcca 2880 cgcttattga cgcactggaa aaacgacagc aagttgatgc aatatatatc gacttcacta 2940 aagcatttga cagagtacct catgttttag cagtgaacaa attcaacaag atgggcctgc 3000 ctgattggct gactcgatgg atacactctt acctcactga gcggagtgcc tatgtgcgaa 3060 taaacggaac cagctcaaac ccgtttgaaa taacgtctgg tgtgcctcaa ggaagtcact 3120 tgggcccttt attttttctg ttgttcgtca acgatctttg cgaatcgatc aaatcgcaaa 3180 agcttatgtt tgcagacgat ttgaaattct ttcgagttgt ttcatcgcta gtggagtgct 3240 gtgccatcca aacggacatt gacacgttac taaactggtg cctgttaaac ggaatggaag 3300 tcaacacccg aaaatgcagc gttattacct tcagtcgagt gagaaaccca gccaactttg 3360 agtataaaat gtcatcagag attatcgcca gagtcaccac agtgaaagat ttaggtgtgc 3420 tactcgacag taagcttaac tttgctcaac atattgcttc aacaacggcg aaggcgtacg 3480 cggtattggg cttcattaaa cgtaatgcac agcagtttga agacgttcac tgtttgaaaa 3540 cactctactg tgctctggta cgcagcatac tagaatatgg tgtgctggtg tgggcaccct 3600 atcacgctgc tcaaagcgct aggatcgagc gaattcaacg gaatttcctg cgatttgcgc 3660 taagacggtt gccatggact gatcctatcc gactgccgcc gtacgaacat cgctgcaacc 3720 tggtgcacct gacgacgttg gctagcagga gggttctttt gcaacggctt ttcgttttcg 3780 atattttaaa acacaatatt gactgctcta tgctattgta caaagttaat tttaatgttc 3840 caatcagaat tactcgacac actgaatttc tacgtttacc tgttcaccgt acgctttatg 3900 gtcagaataa cccgtttgat gtttgttgtc gtcgttttaa tgaagtgttt gtaaattttg 3960 attttaatgt gtcgaaaccc atgtttagaa gtagaattag atattaagtg tatctgtctg 4020 tacggttaaa atcgaagaca agtaaataaa taaataaaat aaata 4065 // ID Gypsy9-SM_I repbase; DNA; INV; 16308 BP. XX AC Contig472; XX DT 15-AUG-2007 (Rel. 12.08, Created) DT 15-AUG-2007 (Rel. 12.08, Last updated, Version 1) XX DE LTR retrotransposon from Schmidtea mediterranea: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy9-SM_I; KW Interspersed repeat; LG_I; internal portion. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-16308 RA Xu Z. and Wang H.; RT "LTR_FINDER: an efficient tool for the prediction of full-length RT LTR retrotransposons."; RL Nucleic Acids Res 35(Web Server issue), W265-W268 (2007). XX RN [2] RP 1-16308 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from Schmidtea mediterranea."; RL Repbase Reports 7(8), 755-755 (2007). XX DR Genome; Contig472; Positions 135259 118952. XX CC Positions [8227-8706] - Integrase core CC 'AAAT' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 3606..5099 FT /product="Gypsy9-SM_I_2p" FT /translation="MAEGNGNLNKNNEKEPTKLPATGETVREYREKFRGEP FT FNGNANKLNTFLRNFGMYVEVCGWTDNMIKTRMPLYLCDSALDIFLGAKER FT GKPLNNWKEIQEFLKLTFGVAKLTNQGIQELFNRKQRHGESNTMFASEIRR FT LAKTATDGKLDEAHLIGIFVDGLRRSELRSAVGMQMLTTLDEAVARANQAE FT IHLPSQIALMQDETIIATVAVKTEESEQNGVKMNQFIPGAHQIFKPSVQRT FT NNIRPNNNYNNNQRSNYKGGNNSTQGKQCRTCGKIGHWESECYQNVPCGRC FT GRKGHNINRCETRTCFICGKQGHVANKCRGGNNQNQGPPRNNNYVQLAPRQ FT LKVEQTKQVNVIQEISHLKDMMSKMIRTSQPQQQQSNIHMMQRVENARNRE FT TEMTQQRQAQDWQQLQMEEERVRQEKCKQRQDNQPRINMMRMIKDVKEQYC FT CNIYKKLPKKINSRNQYQKNQEKTKPSPMKQSDQLKQTNEEHKKYYSGKLK FT KG" FT CDS 7659..11789 FT /product="Gypsy9-SM_I_3p" FT /translation="MRWAMQLQGYSPFIKFKAGRANANADCMSRFIFPDLM FT DEEVRRICIMIAEEIDFSQLMIDQQEDQELKLIIKYVKTGKPEVYENDLQL FT KEHMDKHRHRYVIEKKWLLIADGEQRLMVIPENRRKELLTQYHDGKLCGHM FT SVKKTLARMKQKYFWPNMSESVKDWIKNCLICATRKNTGSKWKAPLKPMPV FT PAEPMTMISMDILEPLRETLNGNIYILVVTDYLTKFPEAFAMKDQKAKTIA FT QKLVEEICCRYGTPKQILTDQGTNFMSEIMKEVTNYFKIAKLRTSPYHPQC FT NGLTERFIGTLIQMLSNYVDRYQRDWDEYINLCLMAYRMSIHAATKMSPFK FT LMYGRECNMPMDLEYQPPISQYMEEDDYVTKFKERMQEIWRTAGLNIKFNQ FT ESYKDLHDQKRNVKPHTFNVGDWVIIETPELIPGTTKKLQRNGKGPYEILY FT ASETNVRVKLVNNPFTRSIYVNVARCKAIPKTITAQPEERRIMTRSQTTRI FT SMLRCTNQDYPYIDPDEYDEDEYQEYDDEEWEDHHTGKLEPEIAENIVKEA FT KTLLPEWSKEKRNIEILGGLMRIIDNYKRDEPDIEHITIIIINWSILFTIG FT DKESNTPLHTIMQTNNHERIQEIINVWERLKKSVDPRNEDGETPLMIAAGR FT DIPTWIIKALLKAGSDVTATNNKGNTALKCVILTNLPKNLGAMLDNITKEN FT IDNLEDLPDLLRVALECNKQEMAIRLIQWKRPDNGKWTINTNQTYEQGENE FT FMKLALFQYNTLVAIYELIKRGDLLLTNFSGTEQQALREKQQLFKLIREFR FT HKEQYKAWELHQCSNLMSQFNYIQRVYQKIIYNLQANRNIRQTLASEDIKR FT EVFNRFRITQKDQLNEILRAARQITPARFGLQNKTQILTRLGIIAQSYTIG FT DEIDPDTELIMRRLPRAFTLTIPSGMIPLHTAIHQRQDNRSLKIIKLLEKC FT GEKSNKRGKNGKTALMYAALSGVKMAVIQELIRTVSELTERSDDWSSILDL FT ATNEPNVAMIKFITENATRREIEELCADTQRLSWRTIYRNKEAIKILLNWK FT IPRSDEYAIDVNERYEDGENTLLKKLMIKGGNPKIIMMVLQRTELRFTDNV FT REQMRIGNEKTELVNLVENWKVTKTRVRKERTSLDEIIKARDKAIAAYKAL FT IRKLNVGDDISAKTKGDTDNNDENDEDEDNGEDERNDENGERNQRDKIEQR FT NDRHNNHENKKNDKNNRNENKKSNYEKRTVCLPKIELNWKNNLLTFSIVIT FT TWKLAVAQYILNDSRRDKENIIIKPNHVKITNIEGQAITQVCITRIQQNWI FT GDKLCEKEMANLKDPIYIKGIEALRRYRICVYRNKKRSSRCVIKRAPREST FT RHKIDNKKNNESNH" XX SQ Sequence 16308 BP; 7235 A; 2787 C; 3244 G; 3042 T; 0 other; gatttttggg ggctcagccg ggattgatta aaacctctcg caattggaaa gatgaaaacc 60 cagggaaatt caaaactcaa cagagaatgc ataaataaaa ccccagccga gatgggaata 120 aaattatacc aacaaaaaga ataagagtaa cgagaatcgg agagagaatc gtggaaatat 180 ctgaaataac ctagctaaac tacgcgaaat aaaacgcgaa tacgacattc tgaagtaaca 240 aaaaccgaat aaataaaaac gtagcataaa ggttatctaa ttaattaaaa aatgaatata 300 ataaaataaa attcgtaaaa caataaaagt attaatagtc aaaaagggag aaataataaa 360 aatcggaaat aaaataaata gagtagaagt agtaatccaa ttattaaaca aaataaaata 420 ataaaataat gaggaaataa aatcaacaaa tttaaattaa tactagaata ataaaaccat 480 taattaaaaa taaatttttg tagatgaacc aatacaacaa gaaattagat caactacaaa 540 gctttttgaa tattgaaaca gacccaatag aaaagttagc caaaaagatg aagcaaataa 600 acctagcaag acaaagtaac ccacaaacaa taagccgcga aaacacccga acaacactag 660 gattaatgca gacgacagaa acggagaacc aatacgactg gtaccgagca agggtaaata 720 acccagaatt gaatatgtaa taccatgcaa tagaacaaca agtcgaaaca atgtctgacg 780 ttatacaaga tataatacgc cgagaagtag cgccactaca ccaatgctac atttgccaaa 840 tcaaacgaat aaaagtgctg acactcccgt gcgggcattt ttgctattgt cgaaaatgtg 900 ccgagccaat gacaatgtgc gtaatttgta aaactgacat tgacgaaaag gtgatagtgt 960 attatggata aacctgttaa aaaaataaat aattagttag ttactttctt cctgctcata 1020 tatctatata taaaatatat gaacaattat atatatatat atatataaaa gtaaaaagta 1080 aaaagcaaaa aggaatgaaa ggcgagacat tacgaacgtc gcaagtaaaa gcataaaggg 1140 aaaatacctt gagctaccct cgggtagtac aactaaaagc cgaatcggga ggggataacc 1200 gatcacccag aagagagata taagacgaac tgtcgacaag ttttgcccag ttgaaaagct 1260 actgtggtaa ggtagatcac cttaccgttg cagtggtgga tcaacgaagc cgtgttgtat 1320 tttaaatctg agggggtagg cgaagaattc tcgaagaggc agaatgagaa acgcgcgtat 1380 aagaaataaa taaaggaata ggccgaagaa atagaaagta ataattaaga gaaaatgatt 1440 aacgagctgg agtagagaat aaataaaaga tgaaaaacag gaacgatcgt gaagccggca 1500 agaaaacgta ataccaatca gtactaaaaa gacgaatcgg aagcacaata atggccaaat 1560 cgagaatagg ggacatgtat taacaaaaca tgaaccccat gagatgtgta ttaacggaac 1620 acagtctcag ctcggagaca ggtattaacg gaacctgatc tcaaataagg gacatgtatt 1680 aacggaacat gaaccccatg agatgtgtat taacggaaca cagtctcagc tcggagacag 1740 gtattaacgg tacctgatct caaataaggg acatgtatta acggaacatg aaccccatga 1800 gatgtgtatt aacagaacac agtctcagct cggagacagg tattaacgga acctgatctc 1860 atataaggga caggtattaa cgaaacctga gctcaaataa gggacatgta ttaacggaac 1920 atgaacccca tgagatgtgt attaacggaa cacagtctca gctcggagac aggtattaac 1980 ggaacctgat ctcatataag ggacaggtat taacggaacc tgatcacatt tgtggaaaac 2040 aaaagacatg cgttagacat gatctaattc gggaaattaa agagatgtgt aggaatagaa 2100 cacagtctca atgcgaagta aaagaagacg tgcattaacg gagcgtaatc tcaatgagag 2160 ttagaaagga catgcgataa aaagacatga tccacgaata tagtagaaaa gataaaatgg 2220 gaaattatgt aattgataat gaagaaaaag aaacaacgag aaacaaatca aagagacaaa 2280 atgaatcaat aaataaaatc cgtgcacgga gaagaaaata tactgtataa aagtggtaat 2340 agataacgat attgaagttg acaggctaag aaaacggaaa atagtataaa taaggaataa 2400 taagaataaa gaaaatagat taagaaaacg aaggagcaaa acaaaaagat gaaaccgaat 2460 caaacaagat gggaagaaaa gtagaaagaa gaggggaaat aaaattatat aggtataaat 2520 aaaggggaat gtgaagcgca tagtaaacaa aataaaataa ggggggtaat agaatctggg 2580 aaacggcata gccgaacgat ttattataaa tagataattt ctacgaccat atagataatt 2640 attactaccc catatagata attaaaaatc gacgtaaaat gagtcaaaag ggaaaacaaa 2700 acgtatggga attatactaa acgggataac accggggaaa tatgaattat tataatcatg 2760 ggtactataa tcaaatcgag atggcggaca agccggaaaa agaagcgata aataaaagat 2820 aataataatg aagaaacgca gacgaataaa agcaaagaaa cgacaaataa aataataaat 2880 aaacccagta tgtttatttg aggcggttcg aaagattgaa agcgtgtaag cggattatgc 2940 tacacggttc attttttcgt tctagaactc aacaaagcaa tcaaagcaat caaagcaatc 3000 aaagcaatca aagcaatcaa agcaatcaaa gcattcaacg caagtaattc acaatcaaga 3060 caaacaaaac caacaaagca ttcacaaaaa caaaccaaat aaacacgtaa tccaaacaca 3120 aagacaaacc aaccaatact cgacgacaac taggaaacga taaccaagca aagttcagtg 3180 aaatactcgc atccaaggaa ccgactacgg acaaaattac agacgagacg acagataaca 3240 gacctcgacg caaaaaatcc atcaaagtaa aggaaaccac ccaaacagaa gctatcgagg 3300 agagaacaac gttacgatcg cgtaacagct acctactatc acggaaaagc gccaagaaat 3360 ccagatcgac ctcaatcgcg agaaaactac acagaaccaa atcactcgaa gatatcgatt 3420 ccaacagcca aggatcaggc gacaaagaaa accaaggatt aacaagaacc acgagtcaat 3480 cgaatatcgc gaacggaaac gaaaataaca atcatcaagg agaagacgca gaggttcaag 3540 aacaaattat atcaaacgaa ccagaaaacc aattgaataa cccagaatca acaaacaaaa 3600 acaatatggc cgaaggtaac ggaaatctaa acaaaaacaa cgagaaagaa ccaacaaaac 3660 taccagcaac aggtgaaaca gtcagagaat atcgagaaaa attccgagga gaaccgttca 3720 atggaaatgc aaataaactg aatacgttct taaggaattt tgggatgtat gtagaagtct 3780 gcggatggac agacaacatg ataaaaacca gaatgccact atatctatgc gatagcgcac 3840 tagacatttt cctgggagca aaggagagag gaaaaccact gaataattgg aaagaaatcc 3900 aggaattctt aaaactaacg tttggagtgg cgaaactgac aaaccaagga atacaagaat 3960 tatttaacag aaaacagaga catggagagt cgaacacaat gttcgcgagt gaaattagaa 4020 gattggcaaa aacggcgacc gacgggaaat tagacgaagc ccacttgatt ggcatatttg 4080 tagacggatt gagacgatca gagctgagat cggcagtagg gatgcagatg ttaacaactt 4140 tagacgaagc agtggccagg gcaaatcaag cagaaataca cctaccatca caaatcgcac 4200 ttatgcagga tgaaacaata atagcaacag tggcagtaaa aacggaagaa agcgaacaaa 4260 atggcgtaaa aatgaatcaa tttataccag gggcacatca aatattcaaa ccatcagttc 4320 aaagaacaaa taacattcga ccaaacaata attataataa taatcagaga tcaaactaca 4380 aaggcggaaa caattcaacg caaggcaaac aatgtagaac gtgcggaaaa attggacatt 4440 gggaaagtga atgttatcaa aatgtgccat gtggtagatg cgggagaaaa ggacataata 4500 tcaatcgatg cgaaaccaga acttgcttta tttgcggaaa acaaggtcac gtagctaaca 4560 agtgcagagg aggcaacaac caaaatcaag gaccaccaag aaataacaac tatgtgcaac 4620 tggcacctag gcaactaaag gtcgaacaga caaaacaagt gaacgtgata caagaaatat 4680 cacatctcaa agacatgatg tcaaaaatga ttcgaaccag ccaaccacaa caacaacaat 4740 caaatatcca catgatgcaa agagtcgaaa acgcgagaaa tagagaaacg gaaatgacac 4800 aacaacgaca ggcacaagat tggcaacaat tacaaatgga agaagaacga gtgagacaag 4860 aaaagtgtaa acaacgacaa gacaatcaac caagaataaa tatgatgaga atgattaagg 4920 atgttaaaga acaatactgt tgtaacatat ataaaaagtt accaaaaaag ataaactcgc 4980 gaaaccaata ccagaaaaac caggagaaaa ccaaaccgag cccgatgaaa caatcagacc 5040 aactaaagca gacaaacgaa gaacacaaaa aatattactc gggaaaatta aagaaaggat 5100 aactcgcgaa aacgaacaga caaacgaaaa tgagacaact ataggaacaa aacagacaaa 5160 tggaaattcc aaattaaaac gggatataat agaaacatca atcaacaaca tggaaaaaga 5220 atttgaaata ccaccaagag gattaataaa agtagaaaca aagagaaatt tacgaataaa 5280 tatgctacga aacatcagag attcatcacc caaggatgga gaaacaagtg aaaccagtag 5340 tctgatggga gaagcactac gagtatctga aaacgaagaa caaaataaac atatcatacg 5400 accaatgaat cgatggatct gggatcgacc actatcacca acagtaaggg aaatgaaatt 5460 aatagataaa ctaatcgaaa ccatagccga aacattaata atcgaggagt tcatcaacga 5520 cgaagtcttc aagagataca tgaggaacat cctacaactc gaggggtcaa aaggagacgt 5580 gataccgaaa caggagaaac cactgtaaaa ccatatctgg ataagcaaaa gttagaggac 5640 ataacgaagt ttcaaatact catgcaaacg ccagacttcc tagaacgtat ctggatcaaa 5700 caaccaaaca ctgatgaaat attcgattgg ataaaagaaa gaatgagaag ctccaagaaa 5760 aatgataacc tataccataa tggatggctg ataaaaccag acatgtatct ggtggatgca 5820 ctaatatggc caggtaaaac aacgtatgta actttagaaa aggaaagaac aataccggat 5880 ttcggactat gggcagagta cgcaaaaaac gaattcgaaa tcaatattaa aaagaactta 5940 caaggagcaa ccaagagaat actttgtaaa aacagagaaa aactgatcac acagataata 6000 gatcaaatcc acaagaattg ggaaatatta atgtatctca aaccagtaga gatgaaggca 6060 tgtatggaaa acctgcgtga tacagtaatt tttagacacg ctaattataa gaactcgaca 6120 ctaatcggga aatacggaac tcaccgaata aaaaaagaaa gggcaaagga gttgaggata 6180 aagttaaagt taccaaacga ggaacaaatg acgttgaaaa tgagtaacaa aacgcgggta 6240 gacgagttga taaaaatcat caatgagatg atctttgaac aagcaaaaaa tcaaagcgag 6300 aacggaatga tagaagatgt agcagtaaaa tacctccacc attatctaca cgaagaggac 6360 gaaacatatc tatatgagtt gggaatatgg gatgatgacc aagaaattag aatagtacca 6420 atggaagaag cttgatgggg aacggatacc aagtggaaag aaatggacga agtagaaaaa 6480 gaaggccacc aaaaatctaa gagaatcata gaaacctatc ggcgatggcg aaaggatcac 6540 cgtcttaaag agaagctcca cgaactagat acaaacgaat gtaaaccaac cccatggacg 6600 aaacaatgga atgatcaaga gaaaagatgg gaaccaaaac ttggagcaat ggaagacgag 6660 tcattggaag aggaaagaat caacaatcct aaagaagaaa ccccgaaata cagaatgaaa 6720 gaagccgaac ataaaacaag gtccgaaaac gaatgtagga aaataaatgt aatgaaaaga 6780 caaagatccc aaccagtaaa tattgacgaa gcacaaataa aattggaaaa cgatcacttc 6840 gaaaaattaa tagaaatatt acagcaaggc gcaaggagcc aaagacgagc agcaacgacc 6900 catgcgaaat acatggcaga aaatggtgaa aatctagaga acttaagaga attagtaaaa 6960 gagatcgagg aatatagcag aattccaaac aaacaattca tgaataatgt agaactaaca 7020 gcaatgatta tacagcaaga aatcgaaccg ggtatactgc agaatggtgg gtggaaataa 7080 tttataaata aacattcttg ggtcacatag tatcagcaaa agggaaggaa ccagatccca 7140 gaaacatcga aaaaatcaag aattgtcctg caccaaaaac ggtaactcaa atacaagaat 7200 ttttagggct atgtggatat tacagaaaat tcatcaaaaa ttacgctggc atagcgaaac 7260 caatacaagc cctagtcaaa aaagatacat cattcgtgtg gggtgacgaa caacaacgag 7320 cattcgaaac attgggagac acgctaatcg aagcaccaat cctaggccac cctgacttca 7380 aaaaaccgtt cctactggca accgatgcaa gtggatacgc atctggtgca atattgggac 7440 aattagacga agaaggaaaa gaacgcgtaa tcgggtatta tagcaaaaca ttcaaaaagc 7500 atgagaaaaa ttattcagtc accgaaagcg aagcattagc aatcatacaa gcgattaaac 7560 atttcaaata cctactgtgg ggtcacgaaa tatgtattac tacggatcac cagtcactag 7620 tatggctcgg tcaacataaa gaagcatcga gtcgactcat gagatgggcg atgcaactac 7680 agggatattc tccgtttata aaattcaaag caggccgagc aaatgcgaac gcagattgca 7740 tgtcaagatt catcttccca gatttaatgg atgaagaagt cagacgaata tgtataatga 7800 ttgcggaaga aatcgacttc tctcaattaa tgatcgatca acaagaagat caagaattaa 7860 agttaataat caaatatgta aaaacgggta aaccagaagt ctacgaaaat gatctgcaac 7920 taaaagaaca tatggacaaa caccgacata gatatgtaat cgagaaaaaa tggctattaa 7980 tagcagatgg tgagcaaaga ttaatggtaa tacccgaaaa tagacgaaag gaattgttaa 8040 ctcaatatca cgatggaaaa ttgtgcggac atatgtcggt aaagaaaacg ctcgcaagga 8100 tgaaacagaa atatttttgg ccaaacatgt ccgaatccgt gaaagattgg ataaaaaatt 8160 gcctcatctg cgcaacaaga aaaaataccg gatcaaaatg gaaagcacca ctaaaaccta 8220 tgccagtacc agctgaacct atgacaatga tatcgatgga tatattagaa ccattacggg 8280 aaaccttaaa cggaaatata tatatactag tcgtcacaga ctatctgaca aaattcccag 8340 aggcatttgc aatgaaagat caaaaggcca aaacgatagc acaaaaattg gtggaagaaa 8400 tctgttgcag atacggaact ccaaagcaaa tattaacaga ccagggaacg aatttcatga 8460 gtgaaataat gaaagaagtg accaactact tcaagatagc aaagttgaga acgtcaccgt 8520 atcacccaca atgtaacgga ttaacagaaa gatttattgg gacattgata caaatgctgt 8580 caaactatgt agacagatac caaagagact gggacgaata catcaaccta tgtttaatgg 8640 catacagaat gtccatacat gccgcaacca aaatgagccc atttaaacta atgtatggca 8700 gagaatgtaa tatgccgatg gatttagaat atcagccacc aatatcgcaa tatatggaag 8760 aggatgatta tgtcacaaaa ttcaaggaaa ggatgcaaga aatatggaga acggcagggt 8820 tgaacattaa attcaaccaa gaaagctata aagatctaca cgaccagaag aggaacgtta 8880 aaccacatac cttcaacgta ggtgactggg taatcattga aacaccggaa ttgatcccgg 8940 gtacaaccaa gaaattacaa agaaacggaa aaggaccgta cgaaattcta tacgcaagcg 9000 aaactaacgt gagagtgaaa ctagtaaaca acccattcac tagatcaata tatgtgaatg 9060 tagcacgatg taaagcaata ccaaaaacaa taacagccca accagaagag agacgtataa 9120 tgacacggtc tcaaaccacg aggatatcga tgctaagatg taccaaccag gattatcctt 9180 atatagaccc agacgaatac gacgaagacg aataccaaga atatgatgat gaagaatggg 9240 aagaccacca tacgggaaaa ctggaaccag aaatagcgga aaacatcgtg aaagaggcta 9300 agaccttatt gccagaatgg tctaaagaga aacgaaatat tgagatattg ggaggattaa 9360 tgagaataat cgacaattac aaacgcgacg agccagatat tgaacacata accataataa 9420 taataaactg gtcaattcta tttactatcg gtgacaaaga aagcaatact ccgctgcaca 9480 caattatgca gacgaacaat catgaaagaa tacaagaaat aattaacgta tgggaaaggt 9540 taaagaagtc agtcgatcca agaaacgagg acggagaaac cccactaatg atcgcagcag 9600 gaagagacat cccaacatgg ataatcaaag ccttactaaa agcaggatca gatgtgacag 9660 caacaaataa taaagggaat acggcattga aatgcgtaat attaacgaat ttgccaaaaa 9720 acctgggagc catgctagat aacattacca aagaaaatat cgataactta gaagatctac 9780 ccgatctgtt acgtgttgca ttagaatgca ataagcaaga gatggcaata agattaatcc 9840 aatggaaacg acccgataac ggtaaatgga caataaatac caaccaaacg tatgaacaag 9900 gagaaaacga gttcatgaaa ttagccttat ttcagtacaa tacgttggta gcaatatacg 9960 aattaatcaa aagaggagat ctgttattaa caaacttcag cggaaccgaa caacaggcat 10020 taagagagaa acaacaattg ttcaaactga taagagaatt ccgacataaa gaacagtata 10080 aagcttggga actacatcaa tgttccaatt taatgagtca attcaattac attcaacggg 10140 tctaccagaa aatcatatat aatctacaag ccaatcgaaa cattcgccag acgttagcaa 10200 gcgaagatat aaaacgggaa gtctttaacc gatttcggat aacgcaaaaa gatcagttaa 10260 acgagatatt acgagcagca agacaaataa cgcccgcaag attcggattg cagaataaaa 10320 cacaaatcct cacaagattg ggaataatag cgcaatcata cacaatcgga gacgaaatcg 10380 atccagatac ggaactgata atgcgaaggt tgccaagagc cttcacctta acaataccat 10440 caggaatgat accgctgcat accgccatac atcaacggca agacaatcgc agtctaaaaa 10500 ttataaaact attagaaaaa tgcggagaaa aatcaaataa aagaggaaaa aacggcaaaa 10560 cagccttaat gtacgctgca ctatctggag taaaaatggc agtaatacaa gaattaataa 10620 gaacagtatc agaactaacg gagagatcag atgattggag tagcatactt gatttagcaa 10680 caaatgaacc aaatgtagca atgataaaat ttataacaga aaatgcaacg agaagggaaa 10740 tagaagaact ttgtgcggat acccaacggt tatcgtggag aaccatttat cgaaataaag 10800 aggcaataaa aatcttatta aattggaaaa taccaagatc cgacgaatac gcaatagatg 10860 taaatgaaag atatgaagac ggagaaaata cgctactaaa aaagctaatg ataaaaggag 10920 gaaatcctaa aataataatg atggtactac aacgaacaga gttgagattt accgataacg 10980 tcagagaaca aatgagaatc ggaaatgaaa aaacggaact cgttaacctg gtagaaaatt 11040 ggaaagtcac gaaaacaaga gttcgaaagg aacggacgtc attagatgaa ataatcaaag 11100 caagagacaa agcaatagca gcttataaag cattaattag aaaactcaat gtcggggacg 11160 acatttcagc gaagaccaag ggagataccg ataataatga cgaaaatgac gaagacgaag 11220 ataacggaga agacgaacga aacgatgaaa acggagaaag aaaccaaaga gataaaatag 11280 agcaaagaaa cgacaggcat aacaaccacg aaaataagaa aaacgacaaa aataatagaa 11340 acgaaaataa aaaatcgaat tatgaaaaac gaactgtatg cctacctaaa atagaactaa 11400 attggaaaaa taatctatta acattctcaa tcgtcataac aacctggaaa ttagcagtgg 11460 ctcaatacat attaaacgac agtcggagag acaaagaaaa tataataata aaacctaatc 11520 acgtaaaaat aacaaatatc gaaggacaag caataactca agtatgtata acaagaatcc 11580 aacaaaattg gataggagac aaattatgcg aaaaggagat ggcaaaccta aaagacccca 11640 tatacataaa agggatagaa gccctacgaa gatatcggat atgcgtatac agaaataaga 11700 aacgctcaag tagatgtgta attaaaagag ccccgagaga atcaacgagg cacaagatcg 11760 acaacaaaaa gaacaacgaa agtaaccact gaaaaatcgg ggataacaac aaccaaagac 11820 aacaaccacg cagtgaactc catacaagag gaaacaaacg acaccaaatc caccaggaat 11880 aacgtagaaa tagaaaaacg tatacgcaca aagtaaacaa aatatgcata gcataaatca 11940 acggccgaag aatgggtcaa tgccaaaaag ccacgctaaa cttaggagat cagctaataa 12000 tacgaggagt acacctacaa acagaatata gaatctgtac acccagacca gaagcgaagc 12060 ctacaaaatg tgcaacagag aagacagaaa aggaaacaac aacagaggcc acaatccgat 12120 ggataacggg tcgcgccata acaacattgg atctcgacca acaaggaact atcatcggag 12180 gaagaaaaga tacccatata tctgtattct tcgtactagt aaaaggagaa tgggacgagc 12240 tgcaagcatg gccattcagt aacaaaataa cactagtaat actatcggca aaagacgtac 12300 atgtggagag actcaaaccg gacggagcac cgtgctacca taaaccaaca gcggacgtag 12360 gtgtagcaac gggagcgcca aagataatga gacaaaataa aatatgggaa tatctagaag 12420 gagacactct gttcgtaaag ttatcggtcg aaagagaagg aacgggacgg accgattggg 12480 aaataaggaa cgtgtaagat gggaataagt ataaatgtat atatctgtat atataacgta 12540 aaagtaatat cccaaaaccc aaaatatata tgcctaaact caaaaccgat aaccaccatg 12600 aataacaaat ttaattgtaa ataccaattg tagaatgaac aataaagaaa catcaatcga 12660 aataataaac cacctcccaa taatgaaaat gatgcagatc atgtgtaacg agaaccaaag 12720 agcagtgttg ataaaaagtg aggagaaaga gacaatgaat aagataaaga cagaaacgga 12780 aaatgcgaga gaatttggca acaattgcag aatggacatg agaacgagtt gattatgaaa 12840 caaagtaaca acgtaatgaa tagagtcaga gaaatatacg aggaagggat ggaggagttc 12900 gcaaaagtaa taaagaaaaa aatagaacaa gatatacgag aggaaaagaa aaggtatgca 12960 tcacaacaaa tagaagaaac ccagaaagcc gaagtggaaa atctccaaag acaagtaaaa 13020 caacaagaaa taaagaaccg cgagagagac gatcagctga aacaattaat ggaaatcaat 13080 caagaacaaa atacgaaaat aaattacgag aaagtattac gagaactaca gaatcgaaac 13140 agcgaccaac tgaaaaggga gataaatgag ctgaataaaa gaatagcaaa tcgagcatgg 13200 gaaagcataa aaagacaagc tcaatatcac agcgaaatca tagatgcggc atacgcaaga 13260 tgcctgacag aaaaaggaaa aagaaaagta aaggaacttc aaaaactagt agaagaacaa 13320 gaattaaaaa caaaccatgg gaaagtgatg atggagttac taaataaaaa tataacggaa 13380 ctaaaggaaa ggacaaatga aataaaagtg aaatattaca gagaagtcat caacgcagct 13440 catcaacaaa gccaagcgga caggtcagtg caggagaaaa gggaactcta aaccaaaacg 13500 caagaatcta aaaagaaaat aactttcggt aaagtagtgt tgaagctggc taaccaaaga 13560 ggcggtcgac tagaagaaga agtggaagaa ctaaacgaaa gattactaaa gaaaatggac 13620 gaattagaaa aaacgcgagc aaaatatcta agtcagctgg caagagcgac ctttaatcaa 13680 tgcagagtcg aaggggctgg aagaaagaac aaagaactcg aagcccaaat cgaacaaatt 13740 ggaaacatcc aagttagaat cagaaaggaa ttagaaaaca ccaagaagaa acaaagttta 13800 caggtaacca gatggtcaga aacgatgcat aaaagtgaaa gaaagataaa aagcctaacg 13860 aagcaggccc agacgtggaa acgaaaacat cagataacga tgaaagaaat gaaagaactc 13920 cccaagaaat ggatacctcg agaaaatctg cagatgtggg agcagatgga agaacaacga 13980 gccgaaatcg agaacttgca agaacaacta aactacaaga gaaaagagaa ggaaacctca 14040 ggaaattcat tacggtgaag gttaaccaac tcacaaagac gcagggccga ttacgaaaac 14100 tcgctgagga caaagagcaa agcgacctgc aagtagaaat agaaaagaat aggttagaat 14160 gcgcagaaca ggaagtagag aaatacagga aaatgcgcga acaaagagat ctcgcaatcc 14220 accgacgaaa ggcgcttaga ggaagcgaaa aggaggtatt tgacgaaaaa gtaagagaaa 14280 tgcgggctga gatagaaaga ttagaagccc agagagatac ctcaatacta gaaagagaga 14340 tagaggatct catcgaagaa gtaaaccagc tatcctatat gctggacggg gaaatgaatg 14400 cctcgcaagg attgcgggaa agaataaatg agatgttaaa catcccaccc atggtttgga 14460 cctcacttat acctatggtg ccaatgccaa tcgtaaatac aaatgtgggt gaaccactta 14520 aacaaccaca agaaaccgta ttagatgaag ggtttgtgga agttttgggc acatgggtat 14580 aggaagattg gaagaaaaag gttgtaaaaa gtgtagaaag atgtaaataa acctcgatga 14640 aaaataaaaa taaaaataaa aaataaaaaa gccaaataaa aacccataaa aagaataaaa 14700 aatgcaaaaa atcataaaaa ataaaagaaa ccaaaaaata acaaaaaatt aatgtacaaa 14760 aatttgtaga aaatgaataa tcagagagaa ctaataacaa atgaaggaga gtccggcgag 14820 accactgtca atgcgagtgg agaacgagta tccattacca ataatatgta agaccgagga 14880 ggtagtggaa ggagaaggta gtgagcagga agagatgata agaaacataa agatagcgat 14940 ggagcagcaa atacaggaga tagactgtat aacaaatcaa tggtttcaat cacgaaactt 15000 aataaagtca ttaaaaagag aatgcgagga attaaaaaga cagagggaag aggaagaaga 15060 cgaaaacgaa tggctaaaac aaaaatggaa ataaacaagg gttaggaaga aagggtaaac 15120 aaactaaaaa ccgagttagc tgcaaaatgg tcgagacacg tgttagaaga ccattggaaa 15180 acccaagcaa agaaaatgaa gagcaaatac caaaaaataa agaaattaaa tcaagcatac 15240 gggaagctcg agaaatatgc cataaaacaa tacgaaaaag gacgccaata caaacaacaa 15300 atgcaaatat taagagtaga aaatgaaacc caaagaattg ttatagagga aaaagacaaa 15360 ataatcacta ttattcctga gatgagaggt agattaatga gagcagaact attgttgatg 15420 agcagcccaa ccttgagtag tcacgacgag ttgaactccg aataaaataa caggaaaatg 15480 tacaaatgta aataaacaaa agttgaaaat catataaata gaagaatgaa ctcacaccca 15540 ctaatctatg ccttcttaat catcgtcctc atcaaaaaat caaacccaaa tcttcagaga 15600 acccaagaag ccatcgaaat aacccaaact atcaaaacag aactaacccg gatgcagaaa 15660 tacttcctta aaaaatcttc atgtcaaaat ctcgatcttc atccagtgca gaaaatcaac 15720 caactcactg aactcgaaaa taccctccta caagtattaa tatcctcacc tcacccaatt 15780 ccagacaatt caaaaccaaa caaactcgaa atcataatcc tctgcttggg aatcacccaa 15840 atcatcatta tcatcattgc agcaatctgc caattcaccc aaatcgcaaa atcatgccga 15900 aaaactccaa taaatcgaaa tcaaaaaaga agaacaatcc gcttcataaa gccaaccaac 15960 tcgaatctgc aatatcaaaa cagttctgca atagttaatc aaactcgaat catcgaacct 16020 caacccaaac caatccagct ggacgacatt ccaggtccaa gccacggata tcgaacctga 16080 caataattta atcgctatcc catcgtgtgt aattataaat aaaaacttac taactatatt 16140 gacttatgta taaaatccca tcaccttaag tgatgcctgc ttgattcttg taaattgtca 16200 tttcatcttt ctatttgaat agtaaatgta tacaagtaaa agtgtcagta tgtgaatgtg 16260 ctaatgagta attgcggggt acaatttaaa tggccccggg agtgattt 16308 // ID BEL-84_AA-LTR repbase; DNA; INV; 373 BP. XX AC supercont1.284; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-84_AA_; KW BEL-84_AA-I; BEL-84_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-373 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.284; Positions 100090 100462. XX SQ Sequence 373 BP; 120 A; 70 C; 79 G; 104 T; 0 other; tgacagttct caaaactgga agctgcttaa agcaggtatg ccatgctgtt agtgagactg 60 acagcagcct gaaatattta aatttatggt agttttttat ggtatattca aaatggaatt 120 atttcacaca acgtagggtt aaaagtagat gtaagacaat aaattttagt cagttgaaat 180 cgaacattcg agaaataaag ttatcctttc taaacgcaaa taaagtgagt tgaagtgcca 240 gaaaagttaa aaagttgttt aagtggtgaa tcttgtggtg cccaattccg tcgatagtga 300 ccaccccggt agtgtcctac cctgtccgtg tgtgagccca atcccagccg gaaaccacga 360 tcctgcccca aca 373 // ID hAT-2N1_BF repbase; DNA; INV; 1036 BP. XX AC . XX DT 30-SEP-2008 (Rel. 13.09, Created) DT 30-SEP-2008 (Rel. 13.09, Last updated, Version 1) XX DE Amphioxus hAT-2N1_BF non-autonomous DNA transposon - consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW hAT-2N1_BF; hAT-2_BF; non-autonomous. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-1036 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-1036 RA Kapitonov V. and Jurka J.; RT "hAT transposons in the amphioxus genome."; RL Repbase Reports 8(9), 910-910 (2008). XX DR [2] (Consensus) XX SQ Sequence 1036 BP; 310 A; 225 C; 213 G; 288 T; 0 other; tagtggtgcg taccggttcg ccgaaccgga cttggacttg ctataacttt cggttcaagt 60 ccggttaaaa ccggaacgtc gaaaaccggt tttctggaaa accggatatc tctttcttat 120 tgtatactag tatcatctaa aacaacgcta gatgcctatg taactttatt aatcgcagcc 180 aacgggccag aaaataacgt atttctgcga aaataaattg tcctaccgcc ataagcgggc 240 caagcacatg gtcgcgccac gggaaccttc tttcaaaatc cggcaacgcg gcaaaaatga 300 caggttcact ttagcaatat tacccatgaa aataaagtat gtggatttgt ttatctcctg 360 gtatgggttt taagtcccat atcttctgaa aatgaagaaa atttccgaga taaacaagct 420 aaaacaaatc tccggcgact gttgacattt gccgcttgcc gggatacttc actgaaagca 480 tgggtgagca gcttgttaaa aagaaacacc gacacgtttt attaaaaaaa aagtttaagt 540 tcataaactt attttatacc tcctgccttt atctttgtaa cgaataaatt tacaattaat 600 ttgtaaggtt aaatgtagag tataaaaaca ccgcacgcat ccttttctac ccgcattaca 660 gtaggctatc ctattgaccc ctatgacctc gcaagcgcca ttttgaaatt tcggccgtca 720 aaaaaaaaaa gcggaaaagg tcggattttg ccgtgatttc ccgatagttt tcattaaaag 780 tcgatgtttc tacaactttg aaaatgatgc agtcgtcttt ggcaatatct ttatagcatg 840 cggcccggtc gaatgtggtt gcgggcgccg gagacgcaaa ttatgtgtcg cctagagcaa 900 gtcgctttat gcttgtgtct atatgggaaa atttagcgac cgtcaaaaac cggacttata 960 agtccggacc tgaaccggaa tctttggact cgagtccgaa cctgaaccgg aacgtgagct 1020 tcggtacgca ccacta 1036 // ID DNA-2-3_NVi repbase; DNA; INV; 1043 BP. XX AC . XX DT 15-MAY-2009 (Rel. 14.05, Created) DT 15-MAY-2009 (Rel. 14.05, Last updated, Version 1) XX DE non-autonomous DNA transposon - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; TSD 2-bp; KW DNA-2-3_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-1043 RA Bao W. and Jurka J.; RT "DNA transposons from Nasonia vitripennis."; RL Repbase Reports 9(5), 939-939 (2009). XX DR [1] (Consensus) XX SQ Sequence 1043 BP; 356 A; 158 C; 168 G; 361 T; 0 other; cactgagaga aaagaaacat tgcagcaaac atatatgtta actgttaaca taggcatgtt 60 cgatttgagc caataatacg tatgttaata gttaacgtag ctatgttata tgtcaaatag 120 ctgtatgtta atagttaaca tatggctgcc ctttacgatt tttcatagac tttatgaagt 180 aatcagataa aaattattag aaaacacact tgtatgaatt ttattaatgc tgagtaactt 240 tacagtgata taatgtcaaa attcaaaatt ttgtctttca tcggcctttt gggtctaaat 300 accagatctg atagttctag agataaaata atgagtgtga ttccaactga ttcagcagcc 360 tctggcattt ccaaaaaggt atatatcaac tagtttatca ctattagttc aaaagttatt 420 ggaaaaattc aaatctgaaa aatgaatgat tacgatatga tgtttaacac actactgaat 480 ttttccaata acttttgaac taatagtgat aaactagttg atatatacct ttttggaaat 540 gccagaggct gctgaatcag ttggaatcac acttattatt ttatctctag aactatcaga 600 tctggtattt agacccaaaa ggccgatgaa agacaaaatt ttgaattttg acattatatc 660 attccaaagg cactttgcat gcttccttag gtatattcag ttgctttcgt tctattatta 720 atgatttatt gattattcat gataaataaa ccctcaagaa tattttttgt atgaaagagg 780 atgacggatt ttgaggttag gcggtacgag agaaacgcca tctgcacgca acactacgca 840 cgatagccat gaaattcata tgttaactat taacatacag atatttcccg caatattttg 900 tatgtatact atagaaatct atagtataca tacaaactgt ttttgtttcg tatagaatat 960 ttgtaagatg tacacgaata tattagcaat cattgctcct aacgtgtgta catgtttacg 1020 gcaatgatat ctttctctca gtg 1043 // ID DNA8-5_AP repbase; DNA; INV; 234 BP. XX AC . XX DT 22-AUG-2009 (Rel. 14.08, Created) DT 22-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-5_AP. XX NM DNA8-5_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-234 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(8), 1747-1747 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 234 BP; 68 A; 45 C; 49 G; 72 T; 0 other; cagtggcgcc gaagtccatg gtgacgtcgg gccgtggccc gacctctttt ttaaaaaatg 60 tggccggccg gccatatttt atttacttca taactttaat atgagcatga gtatatttaa 120 accgtgatta tgaataaatg tatattaggg tacctctata ttaaataatt aagatattga 180 ggccgggaag ttttaaaatt gcccgaccac attttaaaaa cttcggcgcc actg 234 // ID CR1-86_AAe repbase; DNA; INV; 4990 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-86_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4990 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1174-1174 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 19 sequences with >96% CC identity. XX FH Key Location/Qualifiers FT CDS 361..1284 FT /product="CR1-86_AAe_1p" FT /translation="MNCGRCQKNVIDAERIMCRGFCGATFHTICVEVDMPL FT LEQLRPHERNVFWMCDKCAELFSNGHFRNLTNRCDATNADLRDTINDMKAD FT IAKLNTAIGTLARRSEVIPSSPLLSPANPWRLAVRNKSLVNSAKRQRGNDG FT LTVTSDQKPESIKGTRVSNGSIQTVKRNAEQLVWVYLSAFHPHTTVDQIAA FT LARECVGLSDKDDIKVTKLVSKNADTSNMSFVSFKVGFEAQHKPTALSPDT FT WPDDISFREFVNYGQKNMPNIVKLAGSIINESPLDHSNSEGTPPITGNMTS FT CADVNHATSFPVDSNP" FT CDS 1236..4826 FT /product="CR1-86_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MCGCQSCYQFSSGFEPIISSCFPVSDLTSKHSVPVSQ FT VPLNVXEISLPTTNNYPTMPPSPSSFFDISEYHQPERTACRIEEVPEPLDP FT VASAVSRLHSRSGPVVEFGDGIFRPLSSGKYKLSQNNTRPDKISLSSSTSL FT RPRLLDELQVLKPDIVATLQPERTARSTEEALVPLDSVAIAGSSHLSRSGP FT VVELGDEGFHTSTPGKSLLVNHSSSNDARQCFSKQATTGNTSTMNELRIYY FT QNVRGLRTKIDDFFLAVSENEYDVIILTETWLDNVIFSSQLFGNQYSVYRT FT DRNATNSVKSRGGGVLIAVSSKYSSYRDPATVCNSLEQLWVRIDMRSHNIS FT VGVIYLPPDRKDNLNDIQHHIDSIGSVISLLGPNDLALQFGDYNQSAISWT FT ATEPPSVDMDHSRLSVASCALLDGFDLHGMTQINTIKNSRGRLLDLILAND FT FALPSCVLRTPVEALLAIDENHPPLEVGICHTMPLTFESSFDGTRYDFGKA FT NYSELNSALQALDWNFLNLTADVDEAVEYFTNAVNTIIASHVPLARPPPKP FT IWSNSRLRSLRRRRSAALRKYCNNRSQYFKQQLNIASREYRTYNKLLYARY FT VSRIQRNLRSNPKQFWAFVKTKRNENGLPTSMHLGSQQANSAAEKCELFAA FT QFKTAFISSSTPSSQVELAVCDTPHDVLDLNTIQIDEDVVLQAIRKLKSSN FT SAGPDGIPPVILKRCSSALAEPLTKIFRLSLQRQRFPLNWKKSLLFPVYKK FT GDKRDISNYRGITSLSACSKVFEIIVNEALFICCRQYISCDQHGFFPKRSV FT TTNLTNFTSACIQAMDAGKQVDAVYLDLKAAFDRVDHPILLKKLEKLGVAX FT NFVEWFRSYLTGRSLRVKIGSSLSVPFNNESGVPQGSNLGPLLFSLFINDV FT SLILPPGVRLFYADDAKIYIVVGCIDDCVELQTLLCRFERWCSNNSLTLSI FT EKCQVITFSRKRKPITFQYSLCGQSLERVNQVRDLGVLLDQGLTFRHHYND FT IITRANRQLGFIMKVTNEFQDPLCLKSLYCSLVRSIIEFAVVVWCPYHGTW FT KARMETVQKRFVRFALRNLPWRDDQHVTPYFDRCQLLGIETLETRRQTMQA FT MFIAKILNGDIDSPSLLTQVNVNAPERVLRRRHFLRLDGRNSRYGQHDPIR FT FASNTFNGVAHLFDFDVPLASIQQRFTTCIRNMSRNNP" XX SQ Sequence 4990 BP; 1356 A; 1176 C; 1037 G; 1419 T; 2 other; tactttacat tgcgaaattg tcgtggtttt cgacttgttt ttctggtgtt atctttgcaa 60 taataatact gatcaatcgt tttttatgtt ttgtggagtt gaccgtttta tgaacagttg 120 aaattagttg ttattacatg ttaaattcag tgtttttggt tatcgaaagt gtgctcccgt 180 atcgttttgt ggaagataac attgtgtcat acccacacgt cattttttgt tctcaaagcg 240 ccatctgtta ttgactagtg aaaacacgct tcgtttcata ggagtgttca tatacgtggc 300 ttactccaca gacaactaga caactttcgt ccgtgagagt ggtatttcac caattttact 360 atgaattgtg gtcgctgcca gaagaatgtc atcgatgccg agcgtatcat gtgcagaggc 420 ttttgcggtg caacgtttca tacgatttgc gtcgaagtcg atatgccgct cttggagcaa 480 cttcgaccac acgagagaaa cgttttttgg atgtgtgata aatgcgctga gttattctcc 540 aacggtcatt ttcgaaactt gactaatcgc tgtgatgcta cgaatgcgga tttgcgcgat 600 acgattaatg acatgaaagc cgatatcgcc aaacttaaca cggctattgg aacactagca 660 agacgcagcg aagtaattcc atcttcccca ctgctttccc ctgccaatcc atggagactc 720 gcagtacgta acaagagctt ggtgaattca gcaaaacgtc aacgaggaaa tgatggactc 780 acagttactt ccgaccagaa acctgaaagt attaagggaa ccagagtctc gaatggctca 840 attcaaacgg taaaaaggaa cgccgaacaa cttgtctggg tctacctgtc tgcctttcat 900 ccgcatacta ccgttgacca aatcgctgca ctcgctcgcg aatgtgtcgg attgagcgat 960 aaagatgata ttaaagttac caagctcgta tctaagaacg cagatactag caacatgagc 1020 ttcgtatcat tcaaagttgg attcgaagca caacacaagc caactgcgtt atcacccgat 1080 acttggcccg atgatatttc tttcagagaa tttgttaact acggtcagaa aaacatgccg 1140 aacatcgtga aattggcagg tagcatcatc aacgaatccc ctttggatca ttcgaactcc 1200 gagggtactc caccaataac gggaaacatg acttcatgtg cggatgtcaa tcatgctacc 1260 agttttccag tggattcgaa cccataatct catcatgctt ccctgtttct gatttgactt 1320 cgaaacactc ggtccccgtt agtcaagtcc cactcaacgt agawgaaatt tcattgccaa 1380 cgactaacaa ctaccctaca atgcctccat ctccatcaag tttcttcgac atctccgaat 1440 atcatcaacc agaacgcact gcctgtcgta ttgaggaagt ccctgagcca ctcgacccag 1500 tcgcgtctgc cgtttcccgc cttcacagtc gttctggccc tgtggtcgag tttggtgacg 1560 ggatcttccg cccactctca tcaggcaagt acaaactttc gcagaacaat actcgtcctg 1620 ataagatatc actttccagc tcaacaagct tacgccctcg tttactcgac gagctacaag 1680 ttctgaagcc tgatatcgtt gcaactcttc aaccagaacg cacagcccgc agcactgagg 1740 aagccctcgt gccgctcgac tcagtcgcga tcgcaggctc cagtcatctc agtcgttctg 1800 gtcctgttgt cgagttaggt gacgagggct tccacacttc aactccaggc aagtctctac 1860 tcgttaatca cagttcatcc aatgatgctc ggcaatgttt cagcaaacag gcaacgactg 1920 gaaatacctc cacgatgaat gaactgagga tatactacca aaatgttcgc ggtctgcgaa 1980 ctaaaatcga cgatttcttc ttagctgtta gtgaaaacga atacgacgtg attatcctga 2040 ctgaaacttg gctggacaac gtcattttct cttcgcaact tttcggaaac caatattcag 2100 tatatagaac cgaccgcaac gccacgaata gtgtcaaatc caggggtggt ggagtactta 2160 ttgccgtctc atcgaaatac agcagctacc gtgatcctgc tacagtatgc aattctcttg 2220 agcagttgtg ggtacgtatc gacatgcgaa gtcataatat cagcgttgga gttatttatt 2280 tgcctcctga ccgtaaagac aatctcaacg acattcagca tcacatagac tctatagggt 2340 cggtaatatc tctgcttgga cccaatgatc tcgccttgca attcggagat tataatcaat 2400 cagcaatttc gtggaccgcc accgagcctc catctgtgga catggatcat tcacgcttat 2460 cagttgctag ctgtgcacta ttggatggct ttgatcttca cggaatgacg caaattaata 2520 caatcaaaaa ctcgaggggg cgtcttctcg atctcatact tgcgaatgat tttgctttac 2580 ctagctgtgt tttgagaact cccgtagaag cactcttagc tatcgacgaa aaccatccac 2640 cgctcgaagt tggaatttgc catacaatgc cacttacgtt cgaaagctct tttgatggca 2700 ccagatatga ttttggtaaa gctaactatt ctgagctgaa ttctgcgtta caggcgttag 2760 attggaactt tcttaatctg actgccgacg ttgatgaggc agttgagtat ttcactaatg 2820 ccgtcaacac aatcattgca tcgcatgttc ctttggctag acctccacca aagccaattt 2880 ggtcaaacag tagattacgt tcactaaggc gccgccgttc tgcagccctt cgcaagtatt 2940 gtaacaaccg atcgcagtat tttaagcagc agttgaatat tgctagtaga gagtatcgta 3000 cttacaacaa actcctgtat gctcgttatg tcagccgtat ccaacgcaac ctacgctcaa 3060 atcccaaaca attctgggca ttcgtcaaaa cgaagcggaa cgaaaatggt cttcctactt 3120 caatgcacct agggtcccaa caggcaaact cagctgcaga gaagtgcgaa ttgtttgcag 3180 cacaattcaa aacggccttc attagttcct ccacaccttc ctcccaagtt gagttagctg 3240 tatgtgacac tccacatgat gtgctggact tgaacactat ccagatcgac gaggatgttg 3300 ttttgcaagc gataagaaaa ctgaaatctt caaactctgc cgggcccgat ggtatccctc 3360 cagttatact aaagcgctgc tcatctgcac tagcagaacc gctgactaag atattcagat 3420 tgtctctcca acgtcaacgt tttccgctaa attggaagaa atcgctcctg tttcctgtat 3480 acaaaaaagg ggacaaacgt gacatcagta actaccgagg aattacatca ctctctgcgt 3540 gctcaaaagt tttcgaaatt atagtgaacg aagcactctt catctgttgc cgtcagtaca 3600 tttcttgcga ccaacatgga ttttttccaa aacggtccgt tactacgaat ttaaccaatt 3660 tcacgtcagc gtgcatacaa gcaatggatg caggaaaaca ggtagatgct gtttatcttg 3720 atctgaaggc cgcttttgac cgtgtggacc atcccatact gctgaagaaa ttggagaaac 3780 ttggtgttgc tmtgaacttt gtagagtggt tcaggtcgta tctcaccggc agatctctac 3840 gtgtgaaaat tgggtcatca ctctctgtgc ccttcaataa tgagtccggg gttccacaag 3900 gtagtaacct cggacccctg ctgttttcgt tgtttattaa cgatgtttcg ctcattttac 3960 cacctggagt aagacttttt tatgctgatg atgccaaaat atacattgtt gtaggctgca 4020 tcgacgactg cgtagaactt caaactcttt tgtgccgatt cgagagatgg tgctcaaata 4080 acagccttac gcttagtatc gaaaaatgtc aagtcatcac attcagcaga aagcgcaaac 4140 ctattacatt ccaatactcg ttgtgtggcc agtctctgga gcgagtgaat caagttcgcg 4200 atctaggtgt tctactggat caaggcctca ccttccgtca tcattataac gacattataa 4260 cgagagcgaa taggcagctt ggatttataa tgaaagttac caacgaattt caagaccctc 4320 tgtgcctaaa gtcgctgtac tgctcactgg tgcgttcaat tattgaattt gcagtcgtcg 4380 tttggtgccc ctatcatggt acttggaaag cccgtatgga aactgtgcaa aaaaggttcg 4440 tgcgcttcgc tctgaggaat ctgccttggc gtgatgatca acatgtgact ccatacttcg 4500 atcgctgcca gttactaggg atcgaaactc tagagaccag acgacaaacg atgcaggcaa 4560 tgtttattgc gaagattctc aacggtgata tagattctcc atcgcttttg acccaggtaa 4620 acgttaacgc tccagaaaga gtattgcgta ggcgacattt cttgagattg gatggccgga 4680 atagtcgcta tggacagcat gatcctattc ggtttgcatc caatacattc aatggtgtcg 4740 ctcatttgtt cgatttcgat gtgccattgg cctctattca gcaacgtttc acgacatgta 4800 ttcgaaacat gagtcgaaat aatccatagt ttagggttac tatactaatt tgacaacttt 4860 ttagattcta gtgctgagtt agttaagtaa ttgtgactgt cgttattgta accgagtttt 4920 attttctttc ttcattaaga cactcagtca gatggaggta aactgtaata aacaataaac 4980 aataaacaat 4990 // ID BEL-58_AA-LTR repbase; DNA; INV; 524 BP. XX AC supercont1.17; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-58_AA_; KW BEL-58_AA-I; BEL-58_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-524 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.17; Positions 2151700 2151177. XX SQ Sequence 524 BP; 159 A; 96 C; 108 G; 161 T; 0 other; tgttcgcata tgcgagaagc gacacctatg agtcagcgca gatcatcgat atccagcggt 60 cattccctcg ttcaacgatt accgattcga cgacagagat acagcttgcg ttttttgacg 120 ttttctgatt tccgcttgtt ccttcttccg tcaactgaca agtataaaag ggacctccat 180 agcgtatgta attctctttt tctgtgtcat cgatcgaacc gacaggtgga gtccaaatca 240 gtttcgacga gtggattgta actcctaaat ttggacatct agtgctcaag ttattgaatt 300 attatttgaa ttagtgtgaa ttacccaatt atgaactgta aatagtagaa ttaagtaata 360 aagttgaatt agagtagtgc gagtagataa caataaatta gtgaattagt taacctaaac 420 ggaaatttct taattgtgtg agaagaactt acctgaagaa cttacctgtg tgggaaggtg 480 aaaaagccgt taaatttcgt gtcacaacca ctgagcattc gaca 524 // ID DNA-TA-6_CQ repbase; DNA; INV; 931 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A DNA transposon family from Culex quinquefasciatus - consensus. XX KW DNA transposon; Transposable Element; nonautonomous; DNA-TA-6_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-931 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the southern house mosquito."; RL Repbase Reports 11(1), 56-56 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >91% CC identity. ~30 bp TIRs. TA TSDs. XX SQ Sequence 931 BP; 354 A; 128 C; 136 G; 313 T; 0 other; ccgtaaaacg gggtgacttt gatagccggg gtgactttga taggtttgcg atttttccgc 60 aaaatgaaga gtacaattaa aatacgtaag gaatggttcg gaaacatact gaccgtggta 120 gagaagtgtt caaagtacct caagaagaac ttttcataaa attttgaaaa gtttaaaaag 180 ttagttaact atagttaaga aaatgttgat gaaagtcatt attttaaact tctcaaagtg 240 tcatgatttt ctcaatgaac atgattttta atcggaaaac ggaatgcatt ttcggattct 300 ttggacaatt ttccactagg agaaggttaa ataagtttgt aaataataaa taatatgtgt 360 ttttgaaaca caatttaaaa aaatctccaa atttataggc aatttcagtt gaacaaattt 420 catgtaaaat gtgaaaactt gtgattcgtg cttcgaattc agtataaaat gcaatataaa 480 tcgataattt tataaacaaa actagtttta acaaatttca ggcaaaattc cgacttttta 540 acaattttac ctaaaattta tatgtatttt gttaaaaagc ttataaactt agttaactta 600 atataaacat tgattttttt tcttacaaac tatatcagct actttagtga tggtacattt 660 aacgtacaaa taaagtttga acatcttaaa tatgatttta acaagaaaaa ctatgactat 720 caaagtcacc ccggaattaa aaccaagaat ttttaacgta actatttttc taaacactat 780 tgaaaaaact ttttttccaa aatagtgcat ggactttgtg tggcctaccc cagtacatgt 840 tttaaaaata ataatcttga gaaaaacctt acctgttgga aaatattcta aaaacaaatt 900 gaaatcctat caaagtcacc ccggtttacg g 931 // ID BEL-61_AA-LTR repbase; DNA; INV; 785 BP. XX AC supercont1.17; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-61_AA_; KW BEL-61_AA-I; BEL-61_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-785 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.17; Positions 731666 730882. XX SQ Sequence 785 BP; 269 A; 160 C; 131 G; 225 T; 0 other; tgtgaactgt cctagtgaac agcgcgcgcc cattatgttt gtgcctccta acaggtctgg 60 ccatgctgct ctgggtgcac tgattcggaa tgacagagct gccgcctgtc acgagcgtga 120 gtagacaaga aaccgaaaac aagagaacga gtgatacaag tgaaatacct actgctaaat 180 taagttgaat ttggtaataa tcctgttaaa ttaatactta aacctaaaat aagtgtattg 240 aaatcaggta catgagccat tcgcttatat atatacccaa aactaatgta tttctataac 300 tagtgcttta aactattgta caaagtattg tgccaaccgt tgaatcccca acctacaaga 360 tcctacacct aagtcgatac gggtaagagt atttaaacta taaaactgct tcgaaatact 420 taaaaaaaga tcattgtagc tgcacaggaa taccaacacc ttttccaaac cctaagaccc 480 atccggatta gactagggga attaaacgta agtcgaacct ttgaacttaa atgaattatc 540 tacagctaaa ctacctatat aatataatat gtacttatga ctacacttat gttaaaagtt 600 atataccaaa ttaccatgca atctgtattc ttaaacaaaa tttatatttc atccttcatt 660 ccatgcagga atttattaat ctcccgtaat aaagagctgt aaactacaga ttgctccgga 720 ggttaactat acggaagaat ccccctttca gtctgagtga ctgtttgtcg tgacgaaatt 780 cgaca 785 // ID Gypsy9-SM_LTR repbase; DNA; INV; 1799 BP. XX AC Contig472; XX DT 15-AUG-2007 (Rel. 12.08, Created) DT 15-AUG-2007 (Rel. 12.08, Last updated, Version 1) XX DE LTR retrotransposon from Schmidtea mediterranea: long terminal DE repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy9-SM_LTR; KW Interspersed repeat; LG_I. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-1799 RA Xu Z. and Wang H.; RT "LTR_FINDER: an efficient tool for the prediction of full-length RT LTR retrotransposons."; RL Nucleic Acids Res 35(Web Server issue), W265-W268 (2007). XX RN [2] RP 1-1799 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from Schmidtea mediterranea."; RL Repbase Reports 7(8), 756-756 (2007). XX DR Genome; Contig472; Positions 118953 117151. XX SQ Sequence 1799 BP; 645 A; 411 C; 348 G; 395 T; 0 other; tgcgctcaga gccacaattt aaacaaaaaa taaaaataaa agttaaaaag taaagtaaaa 60 ataattgaac gagtcactga aaataagcac gcaccgcacg aagataccaa aaattaatga 120 aaaacatatt aattaaaaca aaaaaaaaat ataacaaaat taaataaaaa caaataatta 180 ataaaaatca ttaaaaagca tacagctgca gaaagaaaca tatgaaaagt aaaataaact 240 gccaaaacct tgcaaaataa ggtttagaaa aaacaaaggc agaagtggcg accagaaaaa 300 aaataaaaag ggctaaaaat aatattataa gaaaatagta ataaaaataa aatccaaata 360 taacataaat agtatatcaa aaaataaaaa aaaatatata aaaaataata aaaacagttt 420 aaaattatta aaataattaa aaagccctaa aaatggcgaa aaatcgccca ttggtcgcac 480 acgagcataa aaatagtata tacacagtga tgaaattcat taatttgctt ttaaacaaaa 540 ttgtgatttt gcagaacgtt tactaatctg aataagacaa tggataataa cctcaattcg 600 gagataaatc gcctagcgaa gattccccga cttcccccat ccatagttgc cgtcaggtac 660 caatacgaag atttaaaaca aaaaacacaa cgagcttcgg gttaaatacg agcgggtgaa 720 aggcacctgg gccccgccag aaacggtaag agaactgaag cagaaaatca ctaggctcac 780 tcaggctcgc gatgagctaa tcgcagaatc acgcagccaa cggctgttaa tccgggagca 840 atcaaccgca ctccggcgag cattcacgga cgtagagctt cggactcacg tccctgggag 900 tgtccgggcc agccagctaa atatcatcat ggtccatcac ctccctatcg ccgagacccc 960 tccaaccccg tcagttccca ttcccccgcc agctgagatc gtggcgagca ctatcgtaaa 1020 tacggctcga cgtcggccaa atcgaaaccg gtgacgtcgc gaccgacgcc gccgggagga 1080 ggaactcatc cgagagacgg agtccatcga aggcccatgt cggatcgatg acttaccgcc 1140 cctttatcgg gaattcgaga ttccggggct tgatccagac atttgttatg ctcccatcgt 1200 caccttcacg tcgccctccc gatccatcca cataaatcta gtggatgagg atcctgaccc 1260 tccccgggtc gcacaagctc tcaaccatct gggactaggc accgacgata tcgccatcct 1320 cacgggcgaa gtgcctaata actcgaaacc gcaggaccct ccacccatag tgagggcatc 1380 caccaagcga atcccccgcc ctatcgaccg gccaagagga ggtggctacc caccgacatt 1440 gatgccgaag aagattaaat tgaaaatggg ccagttctat tcagcccgga agtgctcata 1500 tggattattt atttctacga aatatgttat caatgtgcgt ccccttcaat ataaattacc 1560 tgtgttttct tgtactgttt tgttttcttc ttttcgagga gtagtgttag caatgacgct 1620 cacctgctcg aatggcattc ggctattgcg tacagtacca acatgatttc ggtcatgggt 1680 acgcaatggt tattaaaatt gtgcggcgta tcaggaagat atccctagta gttaaacgag 1740 aaggtcatgt caataggata agtgatatag tacgctgaaa attcctatac gtccaacaa 1799 // ID Gypsy-606_AA-LTR repbase; DNA; INV; 285 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-606_AA_; KW Ty3_gypsy_Ele7; Gypsy-606_AA-I; Gypsy-606_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-285 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 285 BP; 94 A; 62 C; 43 G; 86 T; 0 other; tgtagcatca tcaacactac aagtagcgca tcccttgcaa catcttccat gcaacacttt 60 gcgatttact cgcaaggcac attgagacaa gcagcctgat cacaaaatgt attcacttgc 120 attacaactc tattgaacta ttcaaatatg gaactatgat gtattcaaat ctagaatgta 180 ttgtttcttg aactatataa gggaaaatcc ccaaccaaat agattagttt ttatctagaa 240 ttagaacacg ctgttttact cgttagaatc cgtgaagtcg tccca 285 // ID Gypsy-193_AA-I repbase; DNA; INV; 5221 BP. XX AC supercont1.84; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-193_AA_; KW Gypsy-193_AA-LTR; Gypsy-193_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5221 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.84; Positions 83875 89095. XX CC Positions [2915-3448] - Reverse transcriptase CC Positions [4195-4566] - Integrase core CC 'TTAAT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 1224..2204 FT /product="Gypsy-193_AA-I_3p" FT /translation="MFTIFFQMDYDKNCSLGPFNDTVDASDLRREWEEWLR FT AFELFLELKQIDSQHERLVLMLTRGGRGLQRIYYNLRPVFGEIHPEPVRVP FT LAPQETPEYDNAIKRLNHFFVGKCNERVELEVFRSLTQSSGESFNHFILKL FT RTQAARCDFGDREEKEILQQVTMGALDDRVKDKGLENMMDLDELTNYAINR FT EILMKQKQKLHPFKVEAESAGVSAVKQVWERKPQFKARTFGKPWNEGSRAR FT LECDRCGSWKHQKDSQSCIARNSRCNNCGRTGHFARKCKAIRKMQIKARST FT WKRAGGEANALQDGDRDDVMQQRRNPVLEDSMKVE" FT CDS 2405..4210 FT /product="Gypsy-193_AA-I_2p" FT /translation="MNSRNDGIITCKIDQQPVDFLIDSGAAINTVTQQVWE FT ELINSKANLFKKRFHCDRKFLAFATQEPLRVLTIFEAWISVNESKPKSYAE FT FFVIEGSRKSLLSKRTAEDLRVLKVGLEVQTIEVKNDAFPKFPNVQVKLSI FT DSSVQPRKIAYLRIPAAMEEKVDQKLLEMLKSDVIEPVVGPPEWISPMVVV FT PKGKDDIRLCINMKYPNEAIQREHYPLPMIDTFLNKLRGSTIFSKLDITSA FT FYHIELHPESRGITTFMTSRGLMRFKRLMFGINCAPEIFQRVMSGMLAGIE FT GVVVYIDDVVVAGRTKEEHDARLQEVLAVLKENNAILNKDKCLIGVSELEI FT LGFKVSAAGICPSDEKVSAIQNFRRPETKEEARSFLGLVNFVGQFIPHLST FT RSEPLRQFIRGDVENFGESQQRAFDDLRNALSNTVRRLGFFDPKDKTELYV FT DASSVGLGAVLTQRNSDQAPRIICFASKGLTKTERVYPQTQREALAVVWAV FT EKFYPYLFGTHFTIFTDHKTLEYIYEEKHQQGKRACSRAEGWALRLQPYDF FT HVKHIPGSTNISDALSRLSAQFDTASDTPFDEATEHYLCAVGEGPTAISVQ FT VGFW" FT CDS 4219..5139 FT /product="Gypsy-193_AA-I_1p" FT /translation="MKGTTAAKTIEALESIFQEQTYPETIRSDNGPPFSSE FT EFSDYCSSKNIRLIRTIPYWPQMNGLVERQNQGILRTLRIAKAINTDWRKA FT IRDYVYAYNTTPHSITGKSPMELMTGRPVKDLLPSLRTEPSWRRDEETKDK FT DAIKKMQGKLYADQRRHARPSEIDVGDEVMLKNYETGKLEPKFRLDKFTVI FT KKSGNDVIVTNEEGVMYRRPVSHLRKWPSGEGTESSQEASKNHQPHNSKVP FT EPFLVTNDSTMEKQPKSNSFKIHSPPEPPAMKGNSSTKKGLQEQSEGSSTA FT KRPKRHKKLPSRYSS" XX SQ Sequence 5221 BP; 1630 A; 1033 C; 1262 G; 1296 T; 0 other; atttggcgca gccgaacgga ttgaataagg tgagttgatt taagaaagga atcccggaaa 60 acattttttt gcgttctgaa gtgagtgcat cgaggccatt tgcaaacgat tggaaaagag 120 ctagagagaa agaatgtgtg cgtttttttc cttcctgatc tgcctgaaga gaagtgataa 180 ttatgctact ttattcgtgc aactgtacaa gttttgaaag tgcaaatttg aatcgcgtcg 240 aagaattaga caatcgtgaa gggaggattg aaaagtgaac acaaaatgga ggaaccaata 300 attgaggatt agaagcactc acacacacgc cagccgagaa aagtgagaga aaaaaaacat 360 accaaaaaga tgcgagacca gttaaatact gaagcgcaca caaacacgaa gatgatagac 420 ggacgaaaga atcacacttt gaaaagatgc gagaccggta gaggccgtag cgcacaaaac 480 gtatgaagaa ttgcggccgg ttgatgaccg agcacacaca cacacaaaaa gatgcgagac 540 cggctgaaga ccgaagcgca cacatggacg aaagaaagag gcaaaaaaaa tatgtacatg 600 tttgctgatg tggaaatgaa acatgtggaa ggcagaagac agggttgttc ccgaattttc 660 ttgatgtaaa tattggttta gattaaagtc atcctaataa atactgtgat cagtggtaaa 720 aagtttagca aaagttgtac gctatttagg atacatacaa aagcaaaata gatgataatg 780 atgcagtttc agatttaagt gaatattgag gtaattcaaa aactaacgct gaattcgtaa 840 aaaaatctga ccagttcgct gcttactaca cttttcataa tgtgtacagg gctgttacac 900 tgatttcgca atctcattgg accttgttgc aaattttgcg gatcgattgt atttatttcg 960 taacagccct atgcatggca tgttagatgt cgcaagacag tagggttttg catgtgaact 1020 ttgaaatcag cagtgatcat tatagaagta acagaaagtg tattttgtgt ggcagaagat 1080 aactaattgt cgtgtttttt ttgtggaagt ggtccatatt ttgcttcttt tcaagaactg 1140 tttactttca cgtattgaga ccgatttttc atgctgagtt cgacgatgtg aacataaaac 1200 gtcatgctca ctaaccaaac aaaatgttca caattttttt tcagatggac tacgataaaa 1260 attgctcact ggggccgttc aacgatacag ttgatgcttc agatctccgt agagagtggg 1320 aagagtggct aagagccttt gaactttttc tggagctgaa acagattgac tcgcagcatg 1380 agagactggt tctcatgctt actcgtggtg gtcgaggcct gcagaggatc tactataacc 1440 tgaggccagt gttcggagag atacatccgg aaccggtaag agtaccgcta gcaccacaag 1500 agacgcccga atacgataac gccatcaaac gactgaatca ctttttcgtg ggaaagtgca 1560 acgagcgggt tgagctcgaa gtctttcggt cactaacaca gtcttccggt gagtcgttta 1620 atcacttcat cctcaagctt cgcacgcagg ccgcccggtg tgacttcggc gaccgcgagg 1680 agaaggaaat tttgcaacaa gtaacgatgg gcgctctgga tgacagagtc aaggacaagg 1740 gtcttgagaa tatgatggac ctcgatgagc tgacgaacta tgccatcaat cgagaaatct 1800 tgatgaagca aaagcagaaa ttacacccgt tcaaggtcga ggctgaatca gctggcgtat 1860 cggcggtaaa gcaggtttgg gagaggaagc cacaattcaa agcgcgaact tttggaaaac 1920 catggaatga aggtagccgt gccagattgg agtgtgatcg ttgcggttct tggaaacacc 1980 agaaggactc gcaaagttgc atagctagga actcacgatg caacaattgc ggccgcactg 2040 gtcacttcgc cagaaagtgt aaggcgatac ggaaaatgca gatcaaagcc cgtagcactt 2100 ggaagcgtgc cggcggtgaa gctaatgctt tgcaagatgg agatcgagat gacgtcatgc 2160 aacaacggcg caacccagta ttggaagatt cgatgaaggt agaataattt tgtctttatt 2220 tctattttac tttatgacat gaattcgtgg agcattgagc tcagggtttt attttaacca 2280 ttcttatctg ataaacttat ttgagggatt gtagtgtctc gcacgtccta gtacatgata 2340 acattgaaat gataataaac aaaattaaaa taaatttggt ctggaatcta tctcgtttgc 2400 ttagatgaac tcaaggaacg atggaattat tacatgcaag attgatcaac aaccagtcga 2460 tttcttaatc gattcgggag cggcaattaa tacggtcact caacaagttt gggaagaact 2520 cattaattcg aaggcgaatc tttttaagaa aagatttcac tgtgaccgaa agtttctggc 2580 tttcgctacc caggagcccc tgcgcgtctt gaccattttc gaagcatgga tatcggtcaa 2640 tgaatcgaag cctaaaagtt atgccgaatt cttcgtcatc gaagggtctc gaaaatcgct 2700 cctcagtaaa aggacggcgg aagatttgag ggttttgaaa gtcgggcttg aggttcaaac 2760 tattgaagtt aaaaatgatg ccttcccgaa atttcctaat gttcaggtga aattgtcgat 2820 tgacagcagt gtgcagccga gaaaaatcgc gtatttgagg attcctgcag caatggaaga 2880 gaaggtggat caaaaactac ttgaaatgct gaaaagcgat gtgatagaac cggtagtggg 2940 acccccggag tggatatcgc ccatggttgt agtgcccaag ggtaaagatg acattagact 3000 gtgtatcaat atgaaatacc ctaatgaagc gatacaacgt gaacattacc cactcccgat 3060 gatcgatact tttctgaata agctcagagg ttcgaccatc ttctcgaagt tggacatcac 3120 gtcggcattc taccacattg agctgcaccc ggaatctcgt ggaatcacga ccttcatgac 3180 aagcagaggg ctcatgcgat tcaagcggtt aatgttcggc attaactgtg ctccggagat 3240 cttccagcga gtgatgtccg ggatgttagc tggcattgaa ggagttgtgg tgtacatcga 3300 cgacgttgtt gtagctggaa ggactaagga agagcacgat gctcgactgc aagaagtact 3360 ggccgtcttg aaggagaata acgctatatt gaacaaagac aaatgtctaa tcggagtatc 3420 tgaacttgag attttgggat tcaaagtgag tgcagcagga atttgtccat cagatgaaaa 3480 ggtttcagct atccagaact tccgaagacc ggaaacgaaa gaagaagcta gaagttttct 3540 tggccttgtc aactttgttg gacaattcat cccacatctg tcaacgagat ccgagccatt 3600 gcgccagttt atcaggggtg atgttgagaa tttcggtgaa agtcaacaga gagcttttga 3660 tgatcttcga aatgcgttgt caaatactgt tcgcagattg ggatttttcg acccgaagga 3720 caaaaccgaa ctgtacgtcg atgcttcttc ggtaggtctt ggagcggtac ttactcaacg 3780 aaacagcgat caagcaccaa gaatcatatg ctttgcttca aaagggttga ctaaaacgga 3840 aagggtgtac ccacaaacgc aacgagaagc attagcagtt gtgtgggcag tcgaaaaatt 3900 ctacccatac ttgtttggca ctcatttcac aatatttacc gaccacaaaa cattggaata 3960 tatttatgaa gaaaagcacc aacaaggaaa gcgtgcttgt tcaagagctg aaggatgggc 4020 gttgcggctt cagccgtacg actttcatgt gaagcatatc cctggttcaa ccaacatttc 4080 ggacgctctc tcaagattga gcgcgcagtt cgatacggca tcagatacac cttttgatga 4140 agctactgaa cattatttgt gtgcagttgg agagggtcca acagctatat ctgtacaggt 4200 tggtttttgg tgacggagat gaaggggact actgctgcaa aaacgattga agcactcgag 4260 tctattttcc aggagcaaac atatccggag acaattcgga gtgataatgg accaccgttt 4320 tccagtgagg agttctctga ctactgttcc tctaaaaata ttcggcttat tcgtaccatc 4380 ccatactggc cacaaatgaa cggcctcgtg gagaggcaaa accaaggtat tttgcgtacc 4440 ttgcgtattg ccaaggccat taacacagac tggcgcaaag ccatccgcga ttatgtgtat 4500 gcgtacaaca cgacgcctca ttcgataaca gggaagtcac ccatggaact gatgacgggc 4560 agaccagtta aagatctgct accatctctg cgaaccgaac cctcttggcg ccgagatgag 4620 gaaaccaaag acaaggatgc gattaagaag atgcaaggaa agctttatgc agaccagcga 4680 agacacgcca ggccatccga gatcgatgtt ggagacgaag tcatgctcaa aaattacgaa 4740 actggcaagc tcgagcccaa attcagactt gacaaattta ctgtaataaa gaaaagtgga 4800 aatgatgtga tcgtcacaaa cgaagaagga gttatgtatc gccggcctgt gtctcatcta 4860 agaaaatggc cgtccggaga aggaaccgaa tccagccagg aggcttcgaa gaaccatcag 4920 ccccacaatt caaaggtacc ggaacccttt ctggtgacaa acgattcgac catggaaaaa 4980 caacccaaat cgaactcctt caaaatacac tcaccacctg agccaccagc gatgaaagga 5040 aattcatcta cgaagaaggg actacaagaa caaagcgaag gcagcagtac tgcaaaacgt 5100 cccaaacgtc acaagaaatt accatctaga tacagttcgt aacttcagca atcagtgtag 5160 gctagactag aggttagatt gttttttttt gttgtttttt tctatcagag tagaaaaggg 5220 a 5221 // ID Gypsy-21_SI-I repbase; DNA; INV; 4821 BP. XX AC AEAQ01024002; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-21_SI_; KW Gypsy-21_SI-LTR; Gypsy-21_SI-I. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-4821 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01024002; Positions 3479 8299. XX CC Positions [3189-3767] - Integrase core CC 'ACCG' target site duplication CC LTRs are 95% similar to each other. XX FH Key Location/Qualifiers FT CDS 1655..2962 FT /product="Gypsy-21_SI-I_1p" FT /translation="MLDDDVIERSRSQWSSPVVIVRKKDGKFRFCIDFRRV FT NDATEPDAYPLPQIPATLDKLRGAKFLSTLDLKSGYWQVPLSPASRLITAF FT TVPGKGMFQFKVMPFGLHSAPATFQRLLDSILGPDFEPRVLVYLDIIVVSV FT TFEEHLQTLREVFHRLREAKLRINPEKCRFCVERLKYLGHIVDRDGIRTDP FT EKVKAVTDWPPPATVKQVRQFLGVAVPSVYPRLRYRGRAADRANKKTRQVE FT MGGRGTGRVSNAKTHPHRCPRADLPRLLQAVRTANRREKHRSRSRVDAILS FT RSRARNSIREPNAESGRTELQRHRARVLGRDLRDPPDARISRRVPLHRRHR FT PPVATVATETRHAHGTAGTVGVQATAIRRRNQISERRAEPRRGRLITATHK FT RRGKSHPTGPLVQARATRCTRKSRRATRLRDPRGEAVPPRPA" FT CDS 2640..4169 FT /product="Gypsy-21_SI-I_2p" FT /translation="MREYLEGYPFTVVTDHQSLRWLQKLDTPTGRLGRWAF FT KLQQYDVEIKYRKGALNRVADALSRQPTSAAANLIPPDRWYRRALRAAREN FT PAVQPDFAIREGRLYRHVLHSLDFNESPAEEQWKCCLPRDEWPEILRRNHN FT DAAAGHLGVTKTIARIAQTYYWPGMFREIAAYVQKCENCLRHKVEQRRPAG FT NVHATNVTRLWQLVTVDLVGPLPCSRKGHTWLLVMQDRFSKWTELHPLRQA FT TAPAVTRGIAEQVLLRHGCPESIISDNGTQLKSRELRELLHAYRIRHVCTP FT THAPHCNPVERTNRVIKTMIAQYVDRDHCNWDERIPELAYNTARHEATGYT FT PAFLNLGREIFTPARVGEATPPPTPPPDNTRRYLEEAYDLVRVHLARAFQR FT QQKYYNLRRRPWQPKIGEWVWKREHPLSNKASAFNAKLAPKFRGPLEVRYK FT ISPVIFDLRDRHGRWTRHVHIQDLKPAPRETDDRGNTDNKATDPGEKTEDE FT QPNNNNGDDSSGEA" XX SQ Sequence 4821 BP; 1216 A; 1577 C; 1285 G; 743 T; 0 other; tggcgcccga acagggacaa gaccggaaga aaggaagggc cgccggccta aaaagctgga 60 tctacgagct acataaggac gttctcctcg aggagatgga gacgctcggg ctagatacag 120 agggaaacct cgatacttta cggcaacgca tgagccaata tgtcgaggaa aatccggacc 180 tgttccgcgc agccgcgctt cccccgccga ccgaaccagc aacgaccaga ccgacggcat 240 taacggcgac ctccgccgca catcccgcac cgacgctgac gctcctttga ccgccgcgac 300 tcaccctgca ccgtgcaacc cccaccgaca ccagcggacc tccggatggg tccacccccg 360 ctaaaatcat gaatcaaatg cggaaatggg gggtacattt cgacgggaaa acccgtgggg 420 ttttctggag cgggtcgagg aactgcgcgc cggctacgat ttttcggatc atcaggtctt 480 gctggggcta tcagaattat tgaaagggga tgccctcctg tggtaccgaa actcacggaa 540 cgcgtgggac aaatggcgcg acttcgtcag tgacttcaag gcgacctacc tgccaccgcg 600 ctaccggagc tatctactgc gctagatacg cgaccgcgtt caaaagccag gcgaaccgta 660 tctcaaatac gcaacgacgg tcctcttctt aatgcgaagg gcagagaatt tctccaccga 720 ggagaaagtc gaccagctgt acgataacat gcgccccgaa ttccaactgc acgtacgacg 780 agccgacatc cgcacccccg ccgagctact ccaacgcgtc gtcaagatcg aacgttgccg 840 agcgctcccc tccagtcacg cgcaggaacg caagaacccc accgttgccg cgacgtacga 900 tcggagcgaa tgttgttggc gatgtaaaca gcgggggcac acgcgtttcg aatgccgctg 960 attgccgcgc aaattttgct cgcaatgtgg caaagacggc gtgctgaccc gcgactgcca 1020 tccgccggcg ggaaacggcg ccaggaccgg cgacactccg gccgaccacc ggtcctccga 1080 ataaaattta cgccccgacc gcacctaacc gtgcgtttgc gcgaccagga cctcacggca 1140 cttctcgata ccgggtcgga ggtgtcgttc attaacctcg ccaccgcccg ccacgccgag 1200 gcaagcggct ttaagatcca acgcgagagt aataccgtac agctggccga cggtcagtcg 1260 atggaactgc ccggccacgt acgcttgacc gtgacaatcg gacaccgaca catctgacac 1320 aaattccgca taatgccaag catgaaaacc accatgctcc tcggtataga cgcgtgggca 1380 aaagtaggcg cgcccatacc tatccccccc gccggactac ggggccgcga ggagaatcac 1440 gccgtgacca acgtatccac gttgctcacg cccaccgaat cagctgagct gaaccgcttc 1500 ctcacaaccg aactcgcgca gttcgagggc gtagccagcc cgacgacaat aacgcagcac 1560 atcatccggt tgaaacctgg ccgcgaaccg atcaaacagc ggtaccgacc gcgaaatccc 1620 gctatgcagg ccaattgacg ccgaaatcac ggcaatgctc gacgacgacg tcatagaacg 1680 gtcacgaagc caatggagct cccccgtcgt aatagtgcgc aagaaagacg gcaagttccg 1740 attttgcatc gacttccgtc gcgttaacga tgcgaccgaa ccagacgcgt acccgcttcc 1800 ccagatcccc gctacgctcg acaaactacg cggcgctaag ttcctatcca cactagactt 1860 aaaaagcggc tactggcaag tgccactatc ccccgcgagc cggctaatca ccgcgttcac 1920 agtacccggc aagggaatgt ttcaatttaa agttatgccg ttcggcctgc actctgcgcc 1980 cgccaccttt cagcggctgc tcgattccat actcggaccc gacttcgaac cgcgcgtgct 2040 ggtctacctc gacatcatcg tcgtcagcgt cacgttcgag gaacacctgc agacgctacg 2100 cgaagtcttt caccggctcc gcgaggccaa gctacgaatc aacccggaaa aatgccggtt 2160 ctgcgtggag cggctcaagt acctcggcca catcgtcgac agggacggta tccgcaccga 2220 ccccgaaaag gtcaaagccg tgaccgactg gccgccgccc gctaccgtca aacaggtccg 2280 acagtttctt ggcgtcgcgg taccgtcggt ttatccaaga cttcgctacc gtggccgcgc 2340 cgctgaccgc gctaacaaaa aaacacgcca ggtggaaatg gggggccgag gaacaggccg 2400 cgtttcaaac gctaaaacgc accctcaccg ctgcccccgt gctgacctgc cccgacttct 2460 ccaggcggtt cgtactgcaa accgacgcga gaaacaccgg tctcggagtc gtgttgacgc 2520 aattctttcc cgaagtcgag cgcgtaatag catacgcgag ccgaacgctg aatcaggccg 2580 aacggaatta cagcgccacc gagctcgagt gcttggccgt gatctgaggg atccgccgga 2640 tgcgcgaata tctcgaaggg taccccttca ccgtcgtcac agaccaccag tcgctacggt 2700 ggctacagaa actcgacacg cccacgggac ggctgggacg gtgggcgttc aagctacagc 2760 aatacgacgt cgaaatcaaa tatcggaaag gcgcgctgaa ccgcgtcgcg gacgccttat 2820 cacggcaacc cacaagcgcc gcggcaaatc tcatcccacc ggaccgctgg tacaggcgcg 2880 cgctacgcgc tgcacgagaa aatcccgccg tgcaacccga cttcgcgatc cgcgagggga 2940 ggctgtaccg ccacgtcctg catagcctag actttaacga gtcaccggcc gaggagcaat 3000 ggaagtgttg cttaccgcgc gacgaatggc cggaaatcct ccgccggaat cacaacgacg 3060 ccgccgcagg ccacctcggc gtgaccaaga cgatcgcgcg aatcgctcag acgtactatt 3120 ggccgggcat gttccgagaa atcgccgcgt acgttcaaaa atgcgaaaac tgtctgcgac 3180 acaaagtgga gcagcgcaga ccggccggca acgtgcacgc gacgaatgtc acgcggctct 3240 ggcagctggt caccgtagac cttgtaggac ccctcccctg ctccaggaaa ggccacacgt 3300 ggctactggt catgcaggac cgattctcaa aatggacaga gctacacccc ctccgacaag 3360 cgacagctcc cgccgtaacg cgcggcatcg cggaacaggt cctgctgcgt cacggatgcc 3420 cagaaagcat catctccgat aacggaacgc agctaaaatc gagagagcta agagaactcc 3480 tacacgctta ccgcatccga catgtgtgca cgccgactca cgcgccgcat tgtaatccag 3540 tagaacgaac gaaccgcgtg atcaagacga tgatcgcgca atacgtggac cgcgatcact 3600 gcaactggga cgaaagaatc cccgaactgg cttacaatac ggcgcgacac gaggcaaccg 3660 gttacacacc ggcgttcctg aatctcgggc gagagatatt cacccccgcg cgcgtcggcg 3720 aagcgacgcc ccctcccact ccgccgccag acaacacacg acgctacctc gaggaggcct 3780 atgacctagt acgcgtacat ctcgcacgcg cgttccaaag gcaacaaaaa tactacaacc 3840 tccgtcgtcg cccatggcaa ccaaaaatag gcgaatgggt gtggaaaaga gagcaccccc 3900 tgtccaataa agcctccgcg ttcaatgcca aattggcacc gaagttccga gggcccctcg 3960 aggtacgcta caaaatctcg cccgtgatct tcgacctacg cgacagacat ggacgatgga 4020 ctcggcacgt acatatacaa gacttaaaac cggcgcctcg cgagaccgac gacaggggaa 4080 acaccgataa caaagcaacc gatcccggag agaagaccga ggacgagcag ccaaacaata 4140 ataacggaga tgactcgagc ggagaggcat aaacaaaggc aaaccgtgac gacagccaaa 4200 gtcaaaaccc cacagcaacc gtcacttgca agacgaccgc tcgacgagat ggatcgcacc 4260 gagcaagaaa tcataggcaa aatcaatcaa ctcctggcgg agtacgagga gtacgatccg 4320 gcgaatccag ggatcgatcc ggcctggaac cagctccggc ttcgtcaacc gaccccgccg 4380 ccgcagacgg cagcaacaga aatcccgcgc agtatcgcga tcgccggcca aaagggagga 4440 cgtctcgtct agccctccga tcgacggaac cgggccaaaa tggagtggct aatcaatcca 4500 cccaccgcac ggccgaaatc ccgccgacag cctgcggcaa catcaccgca gcacccgtcg 4560 cttgaaccgg cgcgagtcat gggaccactg ccacctccgc cgattgaagt ggaggtcgaa 4620 ccagggcacc gaatcagcgt gccacacttc gctgcccacc gcgcatgaca gtggaagatc 4680 cgctcgggag gaaagcggtg gcacgtacgc tacaaccacg acggaagcgt acgccgcgtc 4740 cgagaaattc cgccgaatta caacaaggga acacatatgt ctcgtgcctc aaattcaggc 4800 acttcctaga aggggaggtg a 4821 // ID DGLT-A1_LTR repbase; DNA; INV; 268 BP. XX AC AF298204; XX DT 15-SEP-2005 (Rel. 10.09, Created) DT 15-SEP-2005 (Rel. 10.09, Last updated, Version 1) XX DE Dictyostelium discoideum gypsy-like LTR retrotransposon DGLT-A1 DE (LTR portion). XX KW Gypsy; LTR Retrotransposon; Transposable Element; DGLT-A1_LTR. XX OS Dictyostelium discoideum OC Eukaryota; Amoebozoa; Mycetozoa; Dictyosteliida; Dictyostelium. XX RN [1] RP 1-268 RA Glöckner G., Szafranski K., Winckler T., Dingermann T., RA Quail M.A., Cox E., Eichinger L., Noegel A.A. et al.; RT "The complex repeats of Dictyostelium discoideum."; RL Genome Res 11(4), 585-594 (2001). XX DR EMBL/GenBank/DDBJ; AF298204; Positions 1 268. XX CC LTRs differ by 1 bp substitution. This appears to be a recent CC insertion. XX SQ Sequence 268 BP; 134 A; 21 C; 13 G; 100 T; 0 other; tgtaataaag ttactctata aaattaatag atattaaata tgcatattac attactaatt 60 atagttgaca cacatttaaa taatataata ataataataa taataatact gtcaactcat 120 tataaataat ataaatacta aagaataatt ttataaacaa tattgaatat tataaatata 180 aactaatatt aaataaagat aacaaagtat tattcaatta tataattatt aattatatat 240 atttataaag ctaacaagct ttattaca 268 // ID BEL-5-I_HM repbase; DNA; INV; 4566 BP. XX AC . XX DT 28-JAN-2009 (Rel. 14.02, Created) DT 28-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE LTR retrotransposon from Hydra magnipapillata: internal portion - DE consensus. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-5-I_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4566 RA Bao W. and Jurka J.; RT "LTR retrotransposons from Hydra magnipapillata."; RL Repbase Reports 9(2), 438-438 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(28..1080,1074..2012,2016..2579,2536..4230) FT /product="BEL-5-I_HM_1p" FT /translation="MDKGRKRNYVGNIRFTTQNLHTQGGREFNKCLYCQGN FT HSSTKCSVVTNIEARVATLRKLARCFVCLRSGHTSKNCSSNFVCNKCGKRH FT HISICNKDSDHNKYKXTKNQEAKETNATHTWSNNSNTVLLQTATTRVSDCQ FT NFLSKKVCLXFDSGSQFSYITEELRNALKLKTLRKEKLIINTFGNSIGQVK FT VLDVIQLRVKHKINKGWTYLEALCIPEICTPLKMQNISEAVNNYKHLKHLS FT LADNNFNTTIKVDILVGIDFYYSFVTGEMIRGSDGPVXIYSSLGWILAGQS FT KLSIPVDSNKHYSSHSMKCSVFPLRNNVDSDLKDQLEKFWELEEIGELKDS FT VMHQFEKDKRLIKFNGKRYVTKLPFRPDHMMLSDNFNVAKKRLKNLYIKLK FT SDKKLLKDYSNIFAQYEKDGIIERVTGXEITKPPGSVCYLPHHPVIREEKE FT TTKVRAVFDASCAKNSPSLNDILYPGPNLLSKIFNILXRFRVNVIGITSDI FT KQAFLNIEIVPEHRDFLRLLWFSNLSRDDQLVIYRFTRVVFGLTSSPFILN FT GTMQHHLNKYLSSYPEFIKKIIPDWYVDDLVSGCDSLNDGVQFYHTAKDIM FT LKGGLELRKWTTNNIELQNHINKCEGVNLYIDDKVDINTQLSNKTVLGVNW FT DIKNDYFVFKFQSLFKEKLLPKTKRNILRLAASIYDPLGLISPITTRVKTI FT FQILCKDKLEWDEEIPDSIKLLWESFVLQLSKIHXICFPRYVIKEIKEDII FT SIELHAFSDSSIQAYASVIYIRVISRTSIKTCLLTSKTKVAPLKTLSIPRL FT ELLGCHLSSKLMKQVQHSICDKVVISKIVAWSDSLVALDKRKIMLLEAMGL FT HWIKEKSCCWKPWVENRVVKIRQIIDCDSWHFVHGEINPADIPTRPDDLSH FT CSSSRWVEGPEFLKEKSIIFPMFEFDKNNLKGEANKECSKRGKPSEVVSNL FT SSCTLFCGEVVLKNIIDINRYSSLQNLVRVTSYVLRFAKKVLKNKRSIDDN FT ILTNDILTVNEYNYGLHLWVKEVQKDIRNDKKFNKIKSTFKLFEDQYGLLQ FT LKGRYCNSQVLNFEQKHPIILLRNNFLCYLIIQDAHKKVFHQGVESTLNEV FT RKLFWITQGRKAVKSVLHKCVICKFFQGXTLTKTIEPDLPEFRIQSTRSFK FT YTGLDYAGPLFIKENNDKANILKVYVLLLTCATTRAIHLELTNDLQTPSFL FT RALRRFISRRGSPELLISDNAKTFKAKPVREFMLNRGIRXQFILPASPWWG FT GFYERLVRSVKLCLKKMLHKEIVTYXELSSVLCEVEYXINQRPLXYVSDDE FT LXESLTPFHLIFGADISRRESIVSENAPICLYPQDCSNRTKYIRKLFESIW FT NRFYNSYLNELRQHNLYRKECSASNNCQLVVGDVVLIKDENKTPRGLWRIG FT KC*" XX SQ Sequence 4566 BP; 1626 A; 640 C; 828 G; 1444 T; 28 other; tagcaaaaga aagatgtcaa agtcttaatg gacaaaggga gaaaaagaaa ctacgttgga 60 aacattcgat ttacracaca aaatttacat acacaaggtg gtcgagaatt taataagtgt 120 ttgtattgtc aaggtaatca ctcttcgact aaatgcagcg tagtcaccaa tattgaagct 180 cgtgtagcca cattaaggaa attagcaaga tgttttgttt gtctgcgaag tggtcataca 240 agtaaaaatt gcagtagtaa ttttgtatgt aacaaatgtg gaaaacgtca tcatatttca 300 atttgtaata aggacagtga tcataataaa tataaaawta caaagaatca agaagcaaaa 360 gaaaccaatg caactcatac ttggtcaaat aatagtaata ctgtacttct ccaaactgcc 420 acaacccgag tttcagattg tcaaaatttt ctctcaaaaa aagtttgttt aatrttcgat 480 agtggaagtc agttttctta cataactgaa gagttaagaa atgctttaaa actaaaaacc 540 ttgcgaaaag agaagctcat tataaataca tttggaaact caattggtca agtaaaagtt 600 ttagatgtta tacaacttcg agtaaaacac aaaattaaca aaggatggac ttatttagaa 660 gcgttatgca tcccagagat ttgtacaccc ttaaaaatgc aaaatatttc agaagcagtg 720 aataattaca aacatttaaa acatttatca ttagcagaca ataattttaa taccacaatt 780 aaggttgata ttttagttgg aattgacttt tactattcct ttgtgactgg agaaatgatt 840 cgtgggtctg atggtcctgt arccatttat tcatcactag gttggattct ggcaggacaa 900 agcaaattgt ctattccagt tgactcaaat aaacattaca gtagtcattc tatgaagtgt 960 agtgtctttc ctttacgtaa caatgttgat agtgatttaa aggatcaatt agaaaaattt 1020 tgggaacttg aggaaattgg agaattaaaa gatagtgtaa tgcatcagtt tgaaaaagat 1080 taataaaatt taatggcaaa agatatgtca cgaaactacc ttttagacct gatcacatga 1140 tgttgtctga taatttcaat gttgctaaga agcgtttaaa aaatttatat ataaaattaa 1200 aatctgataa aaagctatta aaagactata gtaatatatt cgcacaatat gagaaagatg 1260 gtataataga gagagttaca ggtratgaaa ttaccaaacc acctggatca gtatgttatc 1320 taccacacca tccagttatt agagaagaaa aggaaacaac aaaagtgcgt gctgtttttg 1380 atgcttcatg tgccaaaaat tcaccttcgt taaatgatat cttatatcct ggcccaaact 1440 tgttatcaaa gatatttaat attcttmttc gtttccgtgt aaacgttatt ggcatcacgt 1500 cggatataaa acaagccttt ttgaatatcg aaattgtgcc tgaacataga gactttctac 1560 gtttgctttg gttcagcaat ttaagtcgtg atgatcaatt agttatttat cgatttactc 1620 gtgtagtgtt tggtcttact tcaagtccat ttattttaaa cggaacaatg cagcatcatc 1680 tcaataagta tttgtcttct taccctgaat ttattaaaaa gattatacct gattggtatg 1740 tggatgactt agttagcgga tgtgatagtt taaatgatgg tgtacaattt tatcatacag 1800 ctaaagacat tatgttaaaa ggaggtttag agcttaggaa atggacaaca aataatatag 1860 aattacaaaa ccatattaat aaatgcgaag gagttaatct ttatattgat gataaagttg 1920 atataaacac tcaattaagt aataaaacag tattgggagt caattgggac attaaaaatg 1980 attattttgt atttaaattt caatcacttt tttaaaaaga aaagttgtta cccaagacaa 2040 aaagaaacat tttacgttta gctgcatcaa tttacgatcc tcttgggcta atatcaccaa 2100 ttactactcg tgttaaaact atatttcaaa tactttgtaa ggataaactg gagtgggatg 2160 aggagattcc tgatagcatt aagctcttat gggagtcctt tgtattgcag ctttcaaaaa 2220 ttcatyttat ctgctttcca agatacgtta taaaagaaat aaaggaagat attatatcca 2280 tagagctaca tgccttttcg gacagttcaa tacaagcata tgcttctgtt atttatattc 2340 gtgttatttc aagaacttca ataaaaacct gcttactaac ctcgaaaact aaagttgctc 2400 cacttaaaac tttgtcyatc ccgcgattag aactattagg ttgtcatttg tcatctaaac 2460 tcatgaaaca agtccaacat tcgatttgtg ataaagttgt aatatcaaaa attgtagcat 2520 ggtcagattc tttagttgca ttggataaaa gaaaaatcat gttgttggaa gccatgggtt 2580 gaaaatagag ttgtgaaaat cagacaaata atcgattgtg atagttggca ttttgttcat 2640 ggagaaataa acccagctga cattcccact cgtcctgatg atttgtctca ttgtagtagc 2700 agtagatggg ttgaaggtcc agaatttctt aaggagaaaa gtattatctt tccaatgttt 2760 gaatttgata agaataattt gaaaggagaa gctaacaaag aatgtagtaa acgtggtaag 2820 ccaagtgagg tagtgagtaa cttgtcatcg tgtaccttgt tttgtgggga ggtggttctc 2880 aaaaatataa ttgatataaa caggtatagc tctctacaaa acttagttag agttacctca 2940 tatgttttac gttttgccaa aaaagtgttg aagaataaaa gaagtattga tgacaatata 3000 ttaacgaatg atatactgac agttaatgaa tataattatg gtttacatct gtgggttaaa 3060 gaagttcaaa aagacattcg taatgataaa aagtttaata aaataaaaag tacatttaaa 3120 ttgtttgagg accaatatgg tttgttgcaa ctaaaaggac gttattgtaa tagtcaagtt 3180 ttaaattttg aacaaaagca tccaattatt ttgttgagaa acaatttttt atgttatctt 3240 ataattcaag atgctcacaa aaaagtattc catcaaggtg ttgaaagcac tcttaatgaa 3300 gtaagaaaat tattttggat tacacaaggc aggaaagcyg tcaaatctgt gttacacaaa 3360 tgcgtaattt gtaaattctt tcaagggyga acacttacca aaacaattga acctgacctt 3420 cctgaatttc gaatacaatc aactaggagc tttaaatata cgggacttga ttatgctggt 3480 ccattgttta taaaggaaaa caacgacaaa gccaatatat taaaggtata tgtgttatta 3540 ttaacttgtg caacaactag agctattcat ttagagttaa caaatgatct tcagactcct 3600 tcatttctaa gagctttrcg aagatttata tcgagaagag gaagtccaga acttttaata 3660 agcgataatg caaaaacctt taaagcaaaa cctgttcgag aatttatgtt aaatagaggc 3720 atccgamaac agttyatcct tcctgcttca ccytggtggg gtggttttta tgagcgatta 3780 gtacgttctg taaaactttg tctcaagaar atgttacaca aggaaatagt tacatayrac 3840 gagttgtcat cagttttatg tgaagtcgag tatgyrataa atcagcgacc gttartttat 3900 gttagtgatg atgaactgwg tgagtctttr actccatttc atttaatttt tggtgctgac 3960 atttctcgta gagagagtat agtatctgaa aatgcaccga tttgtttata tccgcaagac 4020 tgctccaatc gaacaaagta cattcgaaaa ttatttgaaa gtatttggaa cagattttac 4080 aatagttatc tgaatgagct gagacaacac aatctgtacc ggaaagaatg ctctgcctca 4140 aacaactgtc aattagtcgt aggcgatgta gtgttgatta aagatgaaaa caaaactccg 4200 cgtggattat ggaggatcgg aaaatgttga gaagttaatt actggtcgag atgggattgt 4260 gcgaggagca gagttagcag ttatttctaa agaaaaaagg tgtacaaaaa ctatcaggcc 4320 tttacagaaa ttaattccgc tagaagttag tggtgaagca caacargttg acgcgttaac 4380 acaaaataaa tttgaaaata ctraacaawt tgawaatact cgttctaggc gaagagcagc 4440 aattgtcggc caggaaacta gaaggatctt agatcaaatt tattaaaatg amgttcttat 4500 attatttatt gttttgttac tatatytgta ttgttgacta cgatggtgta gtcaacgggg 4560 ggagtg 4566 // ID DNA4-8_AP repbase; DNA; INV; 247 BP. XX AC . XX DT 26-AUG-2009 (Rel. 14.09, Created) DT 26-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA4-8_AP. XX NM DNA4-8_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-247 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 1955-1955 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 247 BP; 81 A; 43 C; 39 G; 82 T; 2 other; gaagggctcg gtgcaacaac cagacaagtc cacttttatc aattgttcag ganaagcaaa 60 acaaaattct ttatctcatt taccntaatg ggcgggaaat aattcatttt gagatatctt 120 tggaaataat aaagcgattt ttctgatttt tctaccaaaa tatagatatt tcttccatga 180 ataatattcc atgaaaaact catttttgat aaaaatggac ttgtctggtt gttgcaccga 240 gcccttc 247 // ID Gypsy-68_AA-LTR repbase; DNA; INV; 1719 BP. XX AC supercont1.280; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-68_AA_; KW Gypsy-68_AA-I; Gypsy-68_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-1719 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.280; Positions 1007921 1006203. XX SQ Sequence 1719 BP; 450 A; 352 C; 428 G; 489 T; 0 other; tgtaacggta gccctttttg ttatttgttt atttgttaat tgtttattta tttattgctg 60 ttctattaaa agttatgtaa ttatattgga attcatttaa tttaattatt catttaactt 120 aattatcatc taaagttaca tggacattat aataaccgaa atgaaacgaa caatattcag 180 aattgcatgc aacttttcta tgcgtagaga ggttaaccgc tacgcagcgt agatgcgagc 240 gctgtaacga gaatcacttt tcgtgttctc gttatttggc tagggaaccc cgaagtagtt 300 ggcttttgaa tgctcggctt tacaacggca tgaaatgggt agacaatcca aaaacgaagg 360 agatttctat aactacagtt gcacgaacat gattaacgga gaccgttgag cttccgtttc 420 ctggagccat accaaagggt cctatctgag gagtcacaga gattgtctcg ttcactttgt 480 cccagatcgt ggcagaagaa ggatcgcgaa gcagtgtagt ttccgtggtg ttgtaattgt 540 aagttattaa aacgtgaggt gtactttgtc ccgtattaat tgtctaaccg tatccccgtg 600 taggcttttg taagatagct agttgtgcgg tgagagattg tgcgagtgaa cttcggtcag 660 cgagttcaaa ggtaaatgac ttcttatgtg tgtcctatat gtgctaatcg tgtcgccaaa 720 gccaagcgtg agctacggct ttctcgccct aaaaatagaa aaccgcacgt acatcgcaca 780 aaaaggagaa cgagaagctt gctccgcacc accttgcgtc gcgcatccat ctgcgttctg 840 gcgccaaagt ccgcccggcg cctcgaagcc ggaagggagt agcgaccgtc accaacgcac 900 tccaagcgca tttccgtcaa tcaagcttcg gaagttggat cacgagcccg agaagtgcct 960 gtgaacctcg tgagcagccg tgaaagtcgt cgaagcgtca gcatcccttg agcgaccatc 1020 ggcatccagt cgacgtcatc gatagcgtca tcgtcgacgt agtaagatac aaaccgtgag 1080 tacgttccac gaaaccttgc atgcaagagt gttagcacgg acacagagaa gttgagttag 1140 aaatccgcac aagttagggc cagaagttca aaggggattg tagaagaaag gccgtcggcg 1200 tgaccgatat ggaaacctag gttatcaaat tatagttcaa atatacgtcc tacagaaagt 1260 ccttgtgtta atggaaattt cgtgctagtt agtatggtta agaaagttca aagtcctttt 1320 cacgtttcgc tggtcctggt tagttcccgt tgtctgttga ataatccgtc ttctaaaacg 1380 gaatgggctt gagtgggtgc gatatctctg ccagtgatga tacataggat taggtcgagt 1440 tttgggagtt cggtttgttt ggtggaatct tttggttatg ttcgtgtggg ctttagttgt 1500 cgaaatagga gattccttcg aggcaaattt ggataacgag tctgcttact tgttctcgcg 1560 tattgttgag catttgggag ctgactcttc ggtcagtatc tagccgggca acatagaaca 1620 cgacgggtgg tcctcctagg aggtggcgca taagtcactc gtttaagaaa ccagtccgga 1680 atcaggtagc agactcctag ccgccctatg gccgctaca 1719 // ID hAT-37_SM repbase; DNA; INV; 3645 BP. XX AC . XX DT 14-AUG-2009 (Rel. 14.08, Created) DT 14-AUG-2009 (Rel. 14.08, Last updated, Version 2) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-37_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-3645 RA Jurka J.; RT "haT-type repetitive family from freshwater planarian Schmidtea RT mediterranea."; RL Repbase Reports 9(8), 1840-1840 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS join(1311..1754,1758..3476) FT /product="hAT-37_SM_1p" FT /translation="MEDDNCNCIIEKLFEQSFGSRLFDDKMEIIKNGKPTP FT SLENLRTRTKKCVRHFSTDKYQQIEWLTGCKSFCALYCWSCLLFKHEKNVW FT NDKVGFSDLNNFEKAVKRHAASASHLQCMIALKKFGTQQRIENQMDTMRKI FT SVQQFNEKVKNRQILSWLIDVVCFLGQQELAFRGNDESAASINRGNYIEII FT NLLSEYNPLLKEHLDNATVFSGLSSDVQNDLINAISNVVTEKILSEIKETD FT FVSIILDETTDSSNKSQLAIVLRYVSNDGAILERFIKFIDVSLTRDAKALS FT DIILTFLENEKLNHKLVAQSYDGAAVMSGESGGLQALIKNVIKSANFVHCL FT AHRLSLVLQKSCEKENQCKVFFTTLSGLARFFSKSSKRAAAFDQYSSQRLP FT KVSPTRWEYNGRLVKTVSTHLQNLKLYFSSIKNQTSDFDWDKDSLILATGF FT LQFLKDPDFLVLLYFFNEIFPQSEVLYDILQKKIYDISYCIRKITEFKNMI FT DELMIQKFEEHFEIINLELNILQLASQKRELSGKNISQHYKVLINKIGQKM FT ISEMENRFDSLKQIKFIELTDTSQYENFQKQFPVDLINNLKINYMQYFDIQ FT KLETELKVIFCDSDIKNKKYADEILSFMRNCDLDNCFNEYTKLCKLVLTIP FT PTTATAERVFSSMKRIKNYARNKMGSERLSSLAVISIEKELLKTLKSTENF FT YDAVINIFSQKDRRIFLKYKY" XX SQ Sequence 3645 BP; 1340 A; 502 C; 566 G; 1237 T; 0 other; agtgacgaag cgtccatgga agcaacggaa gcaatgcttc ccttacagtt tccaactttt 60 tttcccctaa tttcctttaa aatgttatag gtcaaatgat attttcacag aaagaacaac 120 atgatgaaat tgtaaagaaa aaaaagaatc aattttataa ttacttttac tgatttgtca 180 gttaatacgt agaacaaaat ttcttatagt aggtcggtac atatttaatt ataaataatt 240 ataatatagc tgtatacttt tgtttctcat atgcaagcaa gggaatcaat tttaaaatta 300 cttctactaa ttttcttaat acgcagaaaa aaaatcgtat agtaggcgct gcaaattcaa 360 ttatatataa attataatat agcggtatat ttttgtttct gatatgcaaa caagggaatt 420 aattttaaaa ttacttctag taattttgtt aatacgcaga aaaaatattg taagtaggct 480 ctgtgtattc aatttcaatg atacataagt aaaagtaata taattctaat aactgtacat 540 ttttgtttct catatgcaat gcaaaatgca agcaagggaa tcaattttga aattacttct 600 atcaattttg ttaatactta gaacaaaatt tttatagtag gcgatgcata tttaatgata 660 caaatttatg ctataatgaa aagcggtgac cacaatagcc ttaaagcaaa attaatgacc 720 cttaattttt cattaatttt ggtacgttac acgttttctc atataattga ccacaatgga 780 cttttacaaa atttcgtcaa gttttcgtta aacttaggca ccttaatttt gtacggttta 840 aaccgtacaa aattaatggg taaccttttt tcatacgata tgattaattc cattatggtt 900 gtgtttaatt tgcatgagtt ttgacagtaa ttaatcaaaa ttaaatgaaa tgaacgaaaa 960 aaaacaaaaa aaacgtaagt tgttaatttt gtttaaggct attgtggtca ccgctaaagt 1020 gatacaattt attctaatta gtatattttg gttttttcat atgcaaggaa tcaattttaa 1080 aattacttcc agttttgagt taataggaaa aataaatttc gcaaggtagt tttattttaa 1140 tttttggtag caatcgtgat tcgtgacggt ttatgtttta attgtttttt atcgttattc 1200 atattttaaa agttaaattg attttggtaa gcattttgaa tattctgcaa ataatattat 1260 aaactaaata taatccccta tttaacagaa ctttattgaa taaaagtaat atggaggatg 1320 ataactgcaa ttgcataata gaaaagttat ttgaacaatc atttggttca agattgtttg 1380 atgataaaat ggaaataatt aagaatggca agccaacacc atctttggaa aatttaagaa 1440 ccagaacaaa aaaatgtgtc cgacattttt caacagataa gtatcaacaa attgaatggt 1500 taacagggtg caaatcattt tgtgccctat attgttggtc atgtttgtta tttaaacatg 1560 agaaaaatgt atggaatgac aaagttgggt tttctgattt aaataatttt gaaaaggcgg 1620 taaaaaggca cgcagcttca gcctcacatc tacagtgtat gattgctttg aaaaaatttg 1680 gcacacaaca gagaattgag aatcaaatgg atacaatgag aaaaatttct gtgcagcagt 1740 ttaatgaaaa agtttaaaaa aatagacaga tccttagttg gttaattgat gtcgtgtgct 1800 ttctaggaca acaggaattg gcctttcgtg gaaatgatga atcagcagct tcaattaaca 1860 gaggaaatta cattgaaatc ataaatttac tttctgaata taatccgctt ttaaaagaac 1920 atttggataa tgccactgtt ttttccggat tatcaagtga tgttcagaat gatctaatca 1980 atgctatttc taacgtagtg actgaaaaaa tattgagtga gattaaagaa accgattttg 2040 tgtcgataat actggacgaa acaacagatt catcgaacaa atcgcagtta gcaattgttt 2100 taagatatgt ttcaaatgac ggtgccattt tagaacgatt cattaaattt atagatgtgt 2160 cattaactcg agatgctaaa gctttatcgg atattatttt gacctttttg gagaatgaaa 2220 aattaaatca caaattagta gctcagagtt acgatggcgc tgcagtaatg tcaggtgaat 2280 caggagggct tcaagcgtta ataaaaaatg ttatcaaatc agccaatttc gttcactgcc 2340 tagctcacag gctaagtttg gtattgcaga aatcatgcga aaaggaaaat caatgcaaag 2400 ttttttttac cactttatct ggtcttgcaa gatttttcag caaatcatca aaacgtgcag 2460 ctgcatttga tcaatattct tctcagcgac taccaaaagt ttctcctacg aggtgggagt 2520 acaatggaag gcttgtcaaa accgtttcca ctcacttaca gaatctcaaa ttatatttct 2580 catctattaa aaatcaaaca tccgattttg attgggataa agatagtctt attttagcta 2640 ctgggttctt gcagttctta aaagatccag attttttagt gctactatac ttcttcaacg 2700 aaatttttcc acaatctgaa gttttatacg atatattaca aaaaaagatc tatgacatat 2760 cctactgtat tcgaaaaatt acagagttta aaaatatgat agatgagtta atgattcaga 2820 aattcgaaga acattttgag ataattaact tggagctcaa tattctgcaa ctagcatcgc 2880 aaaaaagaga actttctggg aaaaatattt cacagcatta taaagtatta attaataaaa 2940 ttgggcaaaa aatgatttcg gaaatggaaa atcgctttga ttcattgaaa caaattaaat 3000 tcattgaatt aactgatacc agccaatatg aaaattttca aaaacaattt cctgtggacc 3060 ttataaacaa cttaaaaatt aattatatgc aatattttga tattcaaaag cttgaaactg 3120 aactcaaagt tatattttgt gattcagaca ttaagaacaa aaaatatgct gatgaaattc 3180 ttagctttat gaggaattgt gacttagaca attgttttaa tgaatataca aaactttgta 3240 agctggtgtt aacaattcca ccaacaacag caacagcaga acgagttttc tcaagtatga 3300 aaagaattaa aaattacgct cgcaacaaaa tgggatcgga aagattaagt tcattggccg 3360 taatttcaat agaaaaggaa ctacttaaga cacttaaatc cacagaaaat ttttatgatg 3420 ccgtaattaa tatattttcc caaaaggaca gaagaatttt tcttaaatat aagtactagt 3480 tattaaataa aaaaatgcaa taaaatttca tttgtttaaa aaaactttaa atattatttt 3540 caactcctag gtcctacatt ttcagtaact tggtctgtga attctttttt ccaaaatgat 3600 ttaataggtt gcttccctaa gataaaattt cacgcctcgt cactg 3645 // ID DNA8-98_AP repbase; DNA; INV; 796 BP. XX AC . XX DT 28-AUG-2009 (Rel. 14.09, Created) DT 28-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-98_AP. XX NM DNA8-98_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-796 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2035-2035 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 796 BP; 267 A; 96 C; 107 G; 326 T; 0 other; gagctcggat ttatatgcat ttgcatgttt ttactaagtg tatgcacgtc tagggggttc 60 taaaatgtcc acggttcatc tcacactgaa tttaaacacc aaattttata ttgcatattt 120 tgcatattta gaggattatt gcatattggt tcataaatgc atataacatg catatttaat 180 gggtttttaa taaattgcct attttattta agataaatat attttagtta ttttaaaaga 240 gaaaaaaatc gttgacaggg tctcttacaa gaatattctt aaaatattgt catttttgca 300 acacttcatg agatcggagc acgtatagat aatattaaat tatattaaca acaataattc 360 acatacaccc gcatttcgac atactatgtt gttacttcga cgtgcgcaac gaataaggca 420 gtaactcaat tttacaattt ttttttataa cgaataatat ttatagatga ttttattatt 480 tttattattt ttttttaagt tttctgagtg ttgtagacat tgtattttaa acaatttaaa 540 atatgaaaaa atgaagatta taaaatattg cattttggtt tttaaagtat taaatgttta 600 actatttaaa tattcttttg tttatgatag aatatctagt aggtactact ttccttggtg 660 taaaaatgtg ttttttttta aaaaaataac attttataat gtttaacata tggcatataa 720 ttgcatattg agccactttt tccttgcata tttgcatcat atttgacact ttttaaatgc 780 atataaatcc gggctc 796 // ID Gypsy-188_AA-LTR repbase; DNA; INV; 1974 BP. XX AC supercont1.105; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-188_AA_; KW Gypsy-188_AA-I; Gypsy-188_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-1974 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.105; Positions 1876847 1874874. XX SQ Sequence 1974 BP; 559 A; 423 C; 406 G; 586 T; 0 other; tgtagcatgg tcactttttt gtaaatatta ttttaagatt tattcaatat aaatatagat 60 gtacataaat tgttacataa ttatattgtt ttttcatgtt cttcaatgtt aatttattat 120 tcgtgttaaa ttatttttca tttaatcata tatcatttca ttgccaaatt taacaaattc 180 aatagttcac ttaaatatat gcgatattca atatcacagt tcaaaatgaa atgcactttt 240 ccaaaaccgc atacaaaact tgcatacaag tcgccttcta ccaatactat atcaaataaa 300 agtccctgtc gccttacaca ctacaccacg agacaggaat aagcctaagg ttcaaagttc 360 aagcccgaac gttaacgatc gttataaata gatcggagtc caaaaataga cggaacgata 420 caggtcattc agatggaatg gtagcgaaag agatagcgct tcaatcggtt ctcattcgta 480 gttgatgagg atcgtcgttt gaaaaggtcg tgaaattact tcggtgattt tacgtacttc 540 cgtccacgaa gatgtaaaat cacaccttat ctgaggactt tcagaatctc tactcgtcca 600 tttttgacct ggaccgtgga agaggcgaga gtgcgtgaat atttttagtg tcgcggtgtg 660 tttaaattgt aaagttcgtt agccaagttc aaaggttacg tgcggtcaag ctagtttgtg 720 tgtgcgtttg tgaataagaa aagccgttcg acggtttttt atttaggatc gaaatctagc 780 gatctacggc ccaagctcgg cacactgcat ctgcaagcta aacgctgctc tcgaccccac 840 gcttcagcgt tcagaagtcc cggtttgagg ggaaagccgg aagggaacaa accgttccga 900 cagagcggcc accaccattt ctattacctc gtggacaggc cacgtgtgaa gcccccgacg 960 atccccaccg tcgtcaactg cgtcatacgt caccgacgca agtgatttcc ggcgtgcgac 1020 acttcatcaa ggagtaaatc caaaaagtcg cagttcgttg ccgtatttgg agtagtagcc 1080 gcaataccat catcgtcgat gtgctaattt cggaatcctg ctttgccgtc ccattcttcg 1140 tccggcactt ctgctttgcc agcccattga acggccgtcg cttctgcttt gccggcggca 1200 ttcatcgcag catctaaaag ggtccgtgag tacaagccaa accaaactaa ccatgcgcat 1260 gcgttaaaac cccacacaat aaccgagctc gaagataaaa agaagatgat cgataaatgt 1320 cgatagaatt agtttagcaa atccgcacac acattaggga aagtaggaag aagaagtaga 1380 agaggaaagg aacacacaat tatgaataaa acctaggtta tcaaatttga ataaatttct 1440 aaatgttttt gctgttctca accagattat gtatttttgt aatatgttag tgctttccct 1500 gtaacagttt ttttggtctt cgagatttct cacgtttcgc gtcgttctac ctgtttcttt 1560 ttctgttttt tgtacgccag gatagactgc ctgggaaatc agtctctcca ctcacgtgtc 1620 tgaagcgaac agggaaattt gaattagtga gttcttccag tgatgatact cgtgagtggt 1680 gtctttttgt gtcggtagaa cgacggtcaa catcaactat tttttgagaa tcgcttaagg 1740 cgacctctat aactgagaaa tctcgatcca ctattccttt gagttttctg tatcgtttcc 1800 taaattgcta tttggactct tataaaacaa acccgccctg aggttgggtt tcttgccggg 1860 caaacataac tcgacagacg gtcctcttcg gaggtggcgc taaagccgtc tgtttcttga 1920 gccagggaac gaaacaggag acggaaaacc tttagccgta ttttacgtgc taca 1974 // ID Gypsy-1_DFa-I repbase; DNA; INV; 5095 BP. XX AC ADHC01000033; XX DT 21-APR-2011 (Rel. 16.04, Created) DT 21-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Dictyostelium fasciculatum genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-1_DFa_; KW Gypsy-1_DFa-LTR; Gypsy-1_DFa-I. XX OS Dictyostelium fasciculatum OC Eukaryota; Amoebozoa; Mycetozoa; Dictyosteliida; Dictyostelium. XX RN [1] RP 1-5095 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Dictyostelium fasciculatum RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; ADHC01000033; Positions 287210 282116. XX CC Positions [2037-2597] - Reverse transcriptase CC Positions [3594-4079] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 1014..5093 FT /product="Gypsy-1_DFa-I_1p" FT /translation="MYVDTSFSLFDSPSLIDSGAAVNVISPSILGLYNIPY FT SKSLIPIMLADGSHTHVNGSISLDISVMGRSVNMPFLILESPHKIILGMPW FT LLATNPSIDWTAGTLSFKDPATVATLAVPVPPVTCAQAGSRPISVTLLDHP FT IVHTTTSPSAHVTPYIPPPVSPVAPSIHVTPSISPPASPIAPSIVSPVSPP FT IVSTVIPPIVLSVTPPVTSPIIPSSPLAPPVPPPIHVVVQSVIPELSVSAI FT QRLFVDDDVDEFFIVKLYPSPSSNSGLAHVASVSSANPRFAKLLADYQDVL FT VDALPNYLADHRLHDMEIVEVDGAKPVYRRNNHLSSEENNVMFTTVEKGLA FT SGRIAPSKSPYNSAVLFVRKKDGTLRMCVDFRALNKQTVADRFPLPRIDQL FT IEKIAKAKIFSKIDLKDGFNQIRIKDEHTHKTAFSTPSGHYEYTVIPFGLR FT NAPSAFVRAINAAFADILDTFVIIYIDDILIFSENENDHYEHIKQVLDRLR FT SNKLFANKAKSSFLVKEVEFLGHLITPGYIRPLADKLAAVKDWPTPSTVTE FT LQSFLGLCNYYRNFIDHFADHAAPLYDATTKKTLSWSDSLSAAFGKIKSLL FT CECTSLFIPDMDGPFTVTTDASDFGIGAVLEQDGRPVAFESRKMTKAERGM FT STRDKEALAVYHSFKKWRHYLVGKDFILFSDHTAISRIEDAVGTDDKVGRW FT ARFIGEFRYTFKHKPGKLNVVADALSRRPDYKEAISSPIVDPHFNDRIKKG FT YKSDLFYSDIINGKSRLHFHVSTNKLVYFTGNDHRRLCIPNDPDLRKDIIQ FT AAHNTGHLGVDKTYWHLSQSVYWRGIYEDCKEYVIACQSCQRNKSPTSGAL FT GKLHPLPIPHRPFHTVTLDFMVDLPPARGTGHNALLTIVDKFSKFVVYIPT FT FVNATAEDTAVLFYDQFYLRYGLPLNIVSDRDAKFTSNFWEALFKLLGSKL FT SFSSSFHPQTDGQSERMNRVTNEMMRSHVNYKMDNWVELLPTLAFTYNTTI FT HSSSNHSPAHLIHGFELINPLRLLSSSDIDSDTPAVNDFIDNRIMSWRHAT FT QCLAEAQSSQKEYHDRGKVQESLNVGDMVLLNRKNITLAADSRRHSWKWYN FT KWIGPFKIISVITNHNDEKSAYKLELPDTMAIHPVFHSSLLKPFHHSSRFS FT DTTYQVSIPTSGLIEEHLIEAIVDFRLHYNKPQYLVKWTGLDNYNNRWSTR FT DQIPTHILDEFEQLNPEHAMHNSTNKKSPTDDSTTDILSNDTTLNLNSTPN FT GHVDSSDSNVTVNLVPSVFNTTQSPVITEAPDATPNPVLSKPLPPSILLRR FT ILQKPVSPSATAKPPSPAPSITAKAHSPSLANTPASGKDFKVDNNIRR" XX SQ Sequence 5095 BP; 1264 A; 1495 C; 989 G; 1347 T; 0 other; ttggtagctt ttgtctatcg gtcaattgct cgttcattca tcacatcatc gtttcctttt 60 tatatatagt tgtttattgt tgtcattgtc atgtcaggtt tctcagcaga acagaaagag 120 tcactccaaa aggagtttgt tgcactacag tcggtcatga gtgcaacaca acaagctaag 180 ttcgatgagc tacacgctgt tattaaccgt ctcgttgccc aagcctcagc accttcaccg 240 gttatagcac cagcgcctgt tcctggtaag cgtctcaaat taggtgctcc tcctccgttc 300 aacggtgatg cctcttccat gcgccaatgg atcgctgccg tcgaaaacca gttcaactat 360 tacaaaacgc cgaacgacga gaggatcccg ttcgccagat ctttcttgac aggccctgcc 420 tcgtcttggt tcatctcagt cgaggcatcc ataaattctt gggatgcgtt caagtcggct 480 ctcaagaaca gattctcccc agtcaacgcc gccgatgtgg cgcgtgccaa gctatggaaa 540 ttatctcaag ataaactcag catacaagag tacactagca cgtttgtaga actagtttcg 600 aactgcccat ccatgtcgga ggacgaccgt atcgaccggt acctccatgg tctctcgact 660 ggtgtcaggt ctattttcaa tttgctcggt ggtgagttcc gcccaaaaac ctttgacgac 720 accatctccg tttgttgccg catcgcctct gaccaacacc actacggtca gaatgtcgcc 780 ccaccttcca cggtcgttcc catggacatc tcggccatct cgttgtctgg ccccatgact 840 ccagccatcc acgattattg taagacacat cgactgtgct tcttctgccg ctcgtctgac 900 aagcacactg caatcgattg ccctcgtaag tcaaagaaaa agaacccatc gaatttaaac 960 tagtgtcgtc ccctgctccc tttgttggga cgatctctcg atcaataaaa tgtatgtacg 1020 tcgatacatc cttttcatta tttgattcac cctctctcat tgacagcgga gcagctgtca 1080 atgtcatttc cccatctatt ttaggcctct acaacattcc atattccaag tcactcattc 1140 ccatcatgct agctgatggc tcccacaccc atgtcaacgg ttccatctcg ttggacatta 1200 gcgtaatggg acgtagcgtc aacatgccct ttctgatctt ggaatcacca cacaagatta 1260 tactgggcat gccgtggttg ctggccacca atccatctat cgattggacg gctggcactt 1320 tgtctttcaa ggacccggcc acggtcgcca ccctcgctgt accggtgcca cctgtcactt 1380 gcgcccaggc tggatctcgc cctatatcgg ttaccttact tgaccaccct attgttcaca 1440 ccaccacttc accctctgca catgttactc cttatatccc tccacccgtt tcacctgttg 1500 caccatctat acatgttact ccttctatct ctccacccgc ttcacctatt gcaccatcta 1560 ttgtttctcc tgtttcacca cccattgttt caactgttat tccgcctatt gtcttgtctg 1620 tcacaccacc tgttacctcg cctatcatcc cctcgtcgcc actcgcacca cccgttccac 1680 cccccattca tgtcgtcgtc caatccgtca ttcccgaatt gtcggtctcg gccattcaac 1740 gcctctttgt ggacgacgat gtcgatgaat tctttatcgt caaactgtat ccctcccctt 1800 cttccaactc tggcctggcc cacgtcgcca gcgtctcgtc ggcaaaccct cggttcgcca 1860 aactcctcgc cgactaccaa gacgttttgg tagacgcttt accaaactac ctcgctgacc 1920 accgcttgca cgacatggag atcgtcgagg tggatggtgc caagccagtc tatcgccgca 1980 ataatcacct ctcatcggaa gagaataacg tcatgttcac cactgtcgag aagggtctcg 2040 catctggtcg gatcgccccg agcaagagcc cgtacaactc ggctgtcctc tttgttcgaa 2100 agaaggacgg cactctccgt atgtgcgtgg atttccgcgc gctaaacaag cagacggtgg 2160 ccgaccgttt cccgttacca agaatcgacc aactgatcga gaagatcgca aaggccaaga 2220 tcttttcaaa gatcgaccta aaggacggct tcaatcagat tcgtattaag gatgagcata 2280 cccacaaaac agcatttagt actccgtctg gtcactacga gtacactgtc attcctttcg 2340 ggctacgtaa tgctccgtcc gcctttgttc gtgccatcaa cgcggcattc gctgacattc 2400 tcgacacttt tgtaattatt tatatcgacg atattcttat cttctcagag aatgaaaacg 2460 accactacga acacatcaag caagttctcg accgactacg ttctaacaaa ttgttcgcca 2520 acaaagcaaa gtcatcgttt ctagtgaagg aggtagagtt cctcggtcac cttatcactc 2580 ctgggtacat ccgtcccctt gcagataaac ttgccgcggt caaggactgg ccgacaccgt 2640 ctaccgtcac agaactacaa agcttcctcg ggttgtgtaa ttactaccgt aattttatcg 2700 accactttgc cgaccacgcg gccccgttgt acgatgccac cacgaaaaag acgttatcat 2760 ggagcgactc gctgtcagcc gcgtttggca agatcaagtc gttgctttgt gagtgcacgt 2820 ccctcttcat tccagatatg gacggcccgt tcaccgtcac cactgatgcg tctgactttg 2880 ggatcggtgc tgtcctcgaa caagatggta gacctgtagc gtttgaatcg cgcaaaatga 2940 ctaaagcaga gagaggtatg tccacgcgtg acaaggaggc tttggcggta taccactcat 3000 tcaaaaagtg gcgccattac cttgtgggca aggactttat ccttttctct gatcacactg 3060 ctatcagtcg aattgaagac gctgttggta ccgatgacaa ggttggtcgt tgggcccggt 3120 tcataggtga gtttagatat acattcaagc ataagcctgg taagctcaat gttgtggcgg 3180 atgctctctc gcgtcgcccc gactacaaag aggcgatatc atcaccgata gttgatccac 3240 actttaacga tcgtatcaag aaaggataca aatcagactt attttattca gatattatta 3300 atggcaaaag ccgtctccat tttcatgttt ctacaaacaa attagtttat ttcacaggta 3360 acgatcatcg caggttgtgc attccaaacg acccggatct gagaaaagat atcattcaag 3420 cggctcacaa tactggtcac ctcggcgtcg acaagactta ttggcacctc tctcagtcgg 3480 tttactggcg tggtatctac gaggactgca aggagtacgt gattgcctgt cagtcttgcc 3540 agcgcaacaa gtcgccgacg agtggtgcgt tgggcaaact tcaccctctg ccaatcccac 3600 atcgcccttt ccacactgtc acacttgact tcatggtaga tctcccacca gcaagaggca 3660 ccggccataa tgcattactg acaattgttg acaagttctc taaattcgta gtctacatcc 3720 ctactttcgt caacgcaacg gcagaagata ccgccgtatt attctacgac caattctatc 3780 ttcgctatgg tctgcctttg aatatagtat ctgacaggga tgctaaattt acttccaact 3840 tttgggaagc tctatttaaa ctacttggtt ccaaactatc attctcgtcc tcctttcacc 3900 ctcaaactga cgggcagtca gagaggatga atcgggtgac gaatgaaatg atgagatcac 3960 acgtcaacta caagatggac aattgggtag aattgttacc tactcttgct tttacataca 4020 ataccaccat ccactcctct tccaaccatt cccctgccca cctcattcat ggttttgagc 4080 ttattaaccc ccttcgctta ctttcttcta gtgacatcga ctctgacaca ccagccgtca 4140 acgactttat cgataaccgt atcatgtctt ggagacatgc cactcaatgt ctcgccgaag 4200 cccaatccag ccaaaaggaa taccatgacc gtggtaaagt tcaagagtct ctcaacgtcg 4260 gtgatatggt tctactaaac agaaagaata tcactcttgc cgccgactcg cgtcgacaca 4320 gttggaaatg gtacaacaag tggataggtc cattcaagat tatcagtgtc ataaccaacc 4380 acaacgatga gaagtcagcg tacaagctcg agttgcccga tactatggcc atccaccctg 4440 ttttccattc ctctttgttg aaaccgttcc accactcatc ccgtttcagc gacaccacgt 4500 atcaagtatc catccctacc tctggtctca tcgaggaaca tctcatagag gccattgtcg 4560 atttcaggtt gcattacaac aaaccacaat atctcgtaaa gtggaccggt ctcgacaact 4620 acaacaaccg ctggtctact agagatcaaa taccgacaca catactcgac gagtttgaac 4680 agctaaaccc tgagcatgcg atgcacaaca gcaccaacaa gaagagtcca accgacgact 4740 ctacaaccga catactttca aacgacacca ctttaaacct taactctaca ccgaatggtc 4800 acgtcgactc ttccgactcc aacgtcacgg ttaaccttgt tccctctgta ttcaatacta 4860 cacaatctcc agtcatcacc gaggctccag acgccacacc caaccccgtt ctttccaaac 4920 cattgccacc atccatcctg ctccggagaa ttctccaaaa accagtctct ccttccgcca 4980 ctgctaaacc accttctcct gccccatcca tcacggccaa ggctcactct ccctcgctag 5040 ccaatactcc agcttcgggg aaggacttta aagtagataa taatataaga aggaa 5095 // ID BEL-81_CQ-LTR repbase; DNA; INV; 726 BP. XX AC AAWU01022158; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-81_CQ_; KW BEL-81_CQ-I; BEL-81_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-726 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 304-304 (2011). XX DR GenBank; AAWU01022158; Positions 12831 13556. XX SQ Sequence 726 BP; 204 A; 155 C; 187 G; 180 T; 0 other; tgttcggttg aagagtaaaa ctgactcgcc accctgccgt gtagcataaa ttctcttgct 60 ataatttgcg tatatacacg gcagggtggt ggtagtggta gtcttgatcg tcgcgatcac 120 gcgcacttct gcatggtttt ctgcattcct cttcacactg cctgggccaa atatcagtcg 180 aaatagatcg tcgtcgcggt cggatcgcgt tgagaagaac acttttgttt actttttgga 240 cttaggaaat aaagtctcgg tcggtctgaa taaattttgt agttttatga agaaaagtcg 300 tttctgtgtt tttacctgtg aagaagaaaa tacttaccgt tacttacccg tttgaccact 360 gcgaaaccct ggtcgattcc gccgttctcg tgcgccgttt gcctggaaaa gaagaagatt 420 ctgctgccga agtttttgcc tgctgtgggg agaaaattac tgctgccacc gctgccggat 480 tgaacctgct ggaagatact tacctgtgga acctacaaaa gaaagaagga agaaagcggg 540 acggaaacgg gacggacagg atttggaagg aattttggta cggaaaggga cggaacacac 600 agaggacgac acaacgcaca aacacaacac acgacgaaaa cacaacgtag aaattagacg 660 tccaacgaac aggataaaat caggtgagcg cggatttcga ggttggccaa ttcacgggtc 720 cgaaca 726 // ID Gypsy-38_CQ-LTR repbase; DNA; INV; 1928 BP. XX AC AAWU01014491; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-38_CQ_; KW Gypsy-38_CQ-I; Gypsy-38_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-1928 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 456-456 (2011). XX DR GenBank; AAWU01014491; Positions 82159 84086. XX SQ Sequence 1928 BP; 455 A; 493 C; 486 G; 494 T; 0 other; tgtagcagta ccatcagtac aaccctttgc cgcaagacag acagaaaaga tgaattcgcc 60 aatattttag cgtgccataa ttatggttgg cttatcttct cgcgctacgc ctcgacctaa 120 ttatgtcatg ggttgagcgc caccccttat gggagacttt caaaatactt tgccttgacc 180 tgttatgaga tccgcacaag aaggaagaat cttcagatgc taaagtttgc gacaagagca 240 ctttttgact cgaacgggag gtcgatgcac tccagttcgg cagtcagaga ttgcaaattc 300 ttaagagatg cttcggctaa actcgcacaa tgattgcaat tgtaatgttg ctcgagttct 360 agagaaccga agttgtccta attgcacctc tactgagggt tctaggactt cgactcggtc 420 acttttaccc tgatcgtgga ggaagcagga gtggtcgaag gaccgttttg gaatcgtttg 480 agttaggtag tttaaagtcg tgcgaagtgt ttcagtggca gaggttaatc ctgcaccttt 540 acttttagag ctcgcggcta gtaggagtta ggagtagggt agtggcagat taaagggcga 600 aaccctgaca catccagagt gaacgtagta caatcgaggt taccgtagtg cagttggcta 660 attttgggga attggttggg cgatcttctc ggcacaggtg gcaaccggca tcatccgtag 720 cagagagacc actccaaccg tcaacccagc gcttttggac tccaccaagg ggtcaaagtt 780 ccggggtgaa cccccaagca gctcgtccgt caacatacga gctgggcggc gtccgtcaac 840 acacgccgcg gtcgaaccct cgatcaccaa ccgaccggac cgggacggcc tcgtggaggc 900 ctaggtccac cgccaagttc ccgcgcgcac ccgcgccgtc gtcgcaaccg acgctatcat 960 cgcagcatcc tagctgcacg cctgtgcgcc cccgcacacc ggaagtgatg ctgaacaccc 1020 gccgatagtg ccaacccgtc tgctgttagc cgccggcgga accatctcgc cgtagccgaa 1080 gcggtaagcg cgccaccacc tgatcgccgg atcgccaacc aacacacgta aatccatggt 1140 cgtgtaagta caaacccttt ttcgacatgc gcatgatcga tcgttggcca acactctttc 1200 cgccaaccca gtgggcaaca taagccaccc ccatgttatg ttgcgtgatc gcaagcccca 1260 gtgaattata gcgatccccg aatgaatttc gccccccaag ccaacgcaca ctactagcca 1320 gccagtgcac tgccgtcagg tagagagaca ggaattctct tgtggtaccg aggtaccgtg 1380 gtagactagc atgcagaaga tctgcggatc gaatctcgta ggccgaagta ttttttttgt 1440 aaataaataa attgtagcta ctgaattatg cgaacctgaa tatacgtttt ttgagtttct 1500 agagtaattt tgtagttttt tctgatgttt tctgggatat ctttccacga ctttcgcggt 1560 agttttctga tgtttggttc cgccattctg gcttctgaag cgttctgggg aaagttgatt 1620 ggggagattt tctacctgtg atgataagaa cggggcaagt acaatctttt gttttctaga 1680 tctgaggtat ttttttttga gtttgcgatc tttttcttcc gtttgtagcg cctcggctaa 1740 ccgccgagtt ctcggatgag tcggatcgtt ctcaatcaca aattttggca tcaaggatac 1800 acccgccctg tggcagggtc tcatagctgg gcaaagcagc acatgatgaa tggtctccct 1860 ttgggagtgg cgcttaagcc attcgtttaa ttacgccaat gcgatccgac cgaccgcctc 1920 cggttaca 1928 // ID Ginger1-5_HM repbase; DNA; INV; 7091 BP. XX AC . XX DT 02-FEB-2010 (Rel. 15.01, Created) DT 02-FEB-2010 (Rel. 15.01, Last updated, Version 1) XX DE Ginger1 DNA transposon from Hydra magnipapillata. XX KW Ginger1; DNA transposon; Transposable Element; Ginger; integrase; KW Ginger1-5_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-7091 RA Bao W., Kapitonov V.V. and Jurka J.; RT "Ginger DNA transposons in eukaryotes and their evolutionary RT relationships with long terminal repeat retrotransposons."; RL Mobile DNA 1, 3-3 (2010). XX DR [1] (Consensus) XX CC TIR is 45-bp long. Tpase gene contains 4 introns: 373-620, CC 809-984, 1291-1381, 1534-1621. XX FH Key Location/Qualifiers FT CDS join(231..372,621..808,985..1290,1382..1533, FT 1622..2882) FT /product="Ginger1-5_HM_1p" FT /translation="MPKINAKSQELRLEEIRQYVLEGKYANTVLNSEKRGL FT RIAARNFKGIDGRLYYIRDESLRLVLYSQEERKRVLKECHNDAGGAHQGIV FT RTQNKIKNLYYWMSITSDVEAWIKTCKQCQSFEKIKTIAPQLHPIQVTERW FT AVLGVDLIGPLPETVNGNKYVLTITDLFTKWIVARVLPEKSAAAVATALVN FT TFHTYGPPKKVITDQGREFVNELNNKIFQLMGIKHSVTSAYHPQSNGQDER FT TNQTFKRALSKYTNNEMNDWDSYTSAIAYGLNISCQKSTKMTPFYLMYLRH FT PPGIEMINVLNETDDSLNSSFLEKISDSTVEKLVNDCEAVEKKVRENIKIA FT QEKQCKQHMKKVAKKGIKTFTFKINDQVMKLNARKRGRKGEPLSKEWMGPY FT TLADLNIKGQCTLKNLNGEILKTKVNVSQLKPYYPCQDIETNIQHLEATHL FT DLSYSSYPATNTVKKSEPGQITSSQNQFYIKNIDSIDTYSSKNSITFSVPI FT SAFESILNQNMLGDIEINAAQVILKNQFPDIQGMQDTLLGTNLMFRIQSQE FT MLQILHDGSLHWLLISTLGCSDRVVKIYDSMKTEMPTLHVQKQIASIISCS FT KNTVMCQYQSCQQQVGGIDCGLFAIAFAVDICMGQNVASISYNQSAMRQHL FT HNCFKAKHFTSFPRSYSESTVNKCESSTIFNVC*" XX SQ Sequence 7091 BP; 2475 A; 985 C; 1071 G; 2560 T; 0 other; tgtcgtacaa atttatttcc cccataaatt tatttccgag tcgcacgccg attggttaaa 60 gaggccaccc gaaataacta aacaacgcca tacaaactaa acaacaacac ccgtaaattt 120 tcttcctatc ctgattagaa gtgagtttaa atttacatta ggttttaata aataaattaa 180 taactttcat gccaagtatg atatattatt tttatatttt ttcttaagca atgcctaaga 240 tcaatgcaaa atctcaagag ctacgtctcg aagagataag acaatacgta ttagaaggaa 300 agtatgcaaa tactgttttg aacagtgaaa aacgtgggct cagaatagct gcaagaaatt 360 ttaaaggaat aggcaagtga attttaatta tttggtatct gtagtagacc cgatcaagac 420 taatacaagt aatttagtag taaggttttt attatattat ctaaaatata atggaacaca 480 atatacataa tctaaatata ataacaatat tgcaatatca cagtatctaa tagtttccac 540 ttagttttaa atattttgtt ttaagtttag gcttttaaca tatgtttgaa ctaagtttta 600 aaatttggtt aaatttttag atggacgttt gtactatatt cgagatgaat ctctacgact 660 tgttttgtat agtcaagaag aaagaaaaag ggttcttaaa gaatgtcata atgatgctgg 720 tggggcacac cagggaatag taagaactca aaataaaatt aaaaatttgt actattggat 780 gtcaattaca tctgatgttg aagcatgggt taagtactat ttgcttttta cgaaggctat 840 atgtttattt tcttaataat ataaagataa taaacatgta actatcatta ttaatataaa 900 atttaaatgc aatattgtac atgcaataat aatatacact ataaattgta ttattctaat 960 taaataatat atagtctatt atagatcaag acatgtaaac aatgtcaatc atttgaaaaa 1020 ataaaaacaa ttgcgccaca actacaccca attcaggtta cagaaagatg ggcagtttta 1080 ggagtagatc taattggtcc attaccggaa acagtgaatg gcaataaata tgttttaaca 1140 ataacagatt tgtttacaaa atggattgtt gctagagtat tgcctgaaaa atctgctgct 1200 gctgtagcta ctgctttagt taacactttt catacttatg gtccaccaaa aaaagtaatt 1260 acagatcaag gaagagaatt tgtgaatgag gtaaagacta aaaaagtttc taaaactttt 1320 gttttctttg tgttattgaa aatttatttt attagttttt tttattattt tttatgtata 1380 gcttaacaat aaaatttttc aattaatggg tattaaacat tctgtgacat cagcatacca 1440 tcctcaaagt aatggtcaag atgaacgtac aaatcaaact tttaaacgtg ctttaagcaa 1500 gtatactaat aatgagatga atgactggga taggtttagt agtctatatt cttatttcat 1560 tttccttata tatacactat aatttattat tacaataatt tttttttatg acttgtatta 1620 gttacacatc tgcaattgct tatggactga acatatcatg ccagaaatct acaaagatga 1680 ctccgtttta tctaatgtat ttaagacatc ctccaggtat tgagatgata aatgttctaa 1740 atgagactga tgattcttta aatagtagtt ttcttgaaaa aatatctgac tctacagtag 1800 aaaagttagt taatgattgt gaggctgttg agaaaaaagt tagggaaaat ataaaaattg 1860 ctcaagaaaa gcagtgtaag caacatatga aaaaagttgc aaaaaagggt ataaagacat 1920 ttacatttaa aataaatgat caagtgatga agttaaatgc aagaaaacga ggtcgcaaag 1980 gtgagccttt atctaaggag tggatgggac catatactct agctgattta aacataaaag 2040 gtcaatgcac tttaaaaaac cttaatggtg aaattttaaa aacaaaagta aacgtttctc 2100 agcttaaacc ttattatcca tgtcaagaca tagaaactaa tattcaacat ttagaagcaa 2160 ctcatttaga tttgtcatac agtagttacc ctgcaacaaa tacagtcaaa aaatctgaac 2220 ctggtcaaat aacgtcatca caaaatcaat tttatattaa aaacattgac tctattgata 2280 cttattcatc caaaaatagc attacattta gtgtaccaat atctgcattt gagagtatat 2340 tgaatcaaaa catgttagga gatattgaaa ttaatgctgc tcaagtaata ttaaaaaatc 2400 aatttcctga tattcaaggt atgcaagaca ccttattagg aacaaatctc atgttccgta 2460 tacagtcaca ggagatgcta caaattttgc atgacggatc ccttcactgg ttgttgatta 2520 gcacacttgg ttgcagcgat agagtcgtta aaatttatga ctcaatgaag acagaaatgc 2580 caactttgca tgtacaaaaa caaattgctt caattatatc atgttctaaa aatacagtta 2640 tgtgtcaata ccaatcatgt cagcaacaag ttggaggaat tgactgtggt ttgtttgcaa 2700 ttgcatttgc agtagatatt tgtatgggac aaaatgttgc atccatttcg tacaatcaat 2760 ctgcaatgcg ccagcattta cacaattgct tcaaagctaa gcattttaca tcatttccaa 2820 gatcatattc agaaagtaca gttaataaat gtgaaagttc tacgattttt aatgtttgtt 2880 aatgttcctc taatattgga tttgaaaaat tattcagttt aatagatgtg aagcagtcac 2940 aaacacatat tcagataaag ttaatgcatg aaaatgtctg ttgtgtatta tttaatagtt 3000 tacttttagt atagctttga tttattttag ttgtttttat tttagttcca gttttccttt 3060 ttttgttagt tgaatgagtt atttatgata ggttttctca aaagattatg ctattatttt 3120 ctcttttttt tatttagaca ttctagtata tttaatgaga atctactatt ttatttacta 3180 gatttttgac ttaaattgaa aatttttttt tttacagttt atgctttaaa cttgaaacaa 3240 tttgtattga agtttaattt agttacattt ttgaaaaatt tgttatgtta ttttatcatg 3300 tcctttattt tttacatgaa ccaatttcaa aaaacttctt gtattgttta cgcttttctt 3360 atcttttatt ttgtgtattt tagctgtttg aattaactat gtgcaataca cctcaaaaat 3420 tgtattttaa ctagtcctct aatgttcata atggtagttt ttcatgctat aaacgagctg 3480 atgagtaaga ttttttttat ttgtttttat ttgttttttc tgaatttctg attcacattt 3540 atgtcaaata ttttactgtc agtttcagct ggatttcctt cagctggatc atcaagtagc 3600 ttcctgataa tactgctgaa aaaaaaaatg acattaaaag ttccagtttt ttctaattta 3660 tatatatata tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg 3720 tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tgtgtattta aattagtaaa 3780 aacacttatc taatttttag atttcttcaa cataatgttt ctccatcagt aggttcatca 3840 ggatcatggc ttcctgatga acctactgat ggagaataca attttagcta ctaaataaaa 3900 aatttaaata tgtgtgtaaa ttttgttagt gtatgctaca aaaagagtgc ttgattttca 3960 taaaaaacaa aaatgtaata aaatagcttc ttgaggatcc tatagatggt gagactgttc 4020 aaatcatttt ttatttatta tatatagtag tataaaataa gaaaatattt ttttaaataa 4080 tacaatataa taatatcatg atatttttga atatatcact atattattat attgttattg 4140 caataataat tgcagttatg caattagtaa caaaaatata aaggaaagaa tccagaaatg 4200 aagggctttt ttttttgtca ataacaatac ttttacagtt taacaatact tttacgccta 4260 ataaacattg gaaagaattt agaattggag atctaacctt aggcagcatt acaacttaga 4320 tatgaggtgc aaagtcctat ggcttacctt ctagttggtg cacttcgtta agtaagttag 4380 acattgtatg tctaatttac gaacagaata gccaaggcaa atttcatatt ttattattag 4440 ttattttact agaatccatt caatatatat atatatatat atatatatat atatatatat 4500 atatatatat atatatatat atatatatat atatatatat atatatatat ataatttata 4560 taataaacaa acacacatgt gtgtgtgttt gtttattata taaataataa gtatataata 4620 ctttttacag tagcttttat caaacaaaag atgcaagttc agagaattct agttcagaga 4680 aatagtatat acagaattgc cagaaaaagt atagtgctaa acaaatttct caagaaataa 4740 atctaactct taaaaatgac atctcaagac agacaataaa tagaagatta attgatcgta 4800 aactatgctt gtatactgct gctaaaaaac caatgcttaa acgatcagtt cggctcaaac 4860 ggcttcaatt ttgcaaaaaa aaatttaaga tgagtgataa tgatatttaa aaaaatagta 4920 tttagcgatg aatcaaatca tacaaaactt taaaagagtt atatgatggt ttatcctttg 4980 aatattgccg aaatcttttt gaatctaata aaagaatatg tagactttga atagataaca 5040 aaggcggtca cataccatat taacttgact tacagtttaa agtaaaaaat aatgagtcta 5100 aacttatgat caagatattc gtagatatct ttttacataa ttttgttatt ttcaaaataa 5160 aatactttta actgtttttt ttttttgttt tttttcttgt cttttatata tattaaaaag 5220 aatttcataa ataataaagc ctattaaaaa aaaattcaag taataaaaat aaaggcattt 5280 ttttaaatca caaatcactt agaaactgta agttttattc taattctaca gtgggtggcc 5340 attgaaatag ggccattgag ttttctacac atttttgtga aattttattt atctccgtgg 5400 catgctgtca atgcatttct gtgccaattc ttttatgtga ggatttcgat gccaaccttc 5460 atttagtttt tcaatatgct gggttttatt tgtcaccagt tgttttgaag aaacctcccg 5520 ttttaatagc tcccacaggt tttctatagg attaagatcc ggggaatttc ctggccaatc 5580 cagcagtgga atttttttct ctccaagaaa ctttttaatg gattttgccg tatggcatgg 5640 ggcagaatct tgcataaata taaatttgcc atctggaaac cattccgaga cttgaggaat 5700 gagcttttgc tccaaaactt ctttgtattg gccctgcctc attgtccctt gcactatgta 5760 aagccttccc gtacccttac cacatataac ggaccacatc ataattgaca atggatgttt 5820 tactgtctcg ataatgcagt ctggattaaa tttttcaccc acccttcgtc gcacaaatga 5880 ggctttgtcg tccaagatct gaaacgtgga ttcatcagaa aagcacacct gaaaacaaat 5940 gtttcaaaaa atgttagtct ttcattaatc taaagaaaat cgagaaataa agcagagcat 6000 gacatgttta agaaataaat acctttctcc aatcatctgt tgtcatattg cggtgtttta 6060 tagcccattt cagtctttta tccttcatgg caggcgttaa ttttggtttt ttcgcaggac 6120 ggtaacactt aaagcctaac tcatctaagc gtctcctgac tgtgcgatca ctaatattaa 6180 ttccagcatc tttaatcaat gttgtgatta tttttctggg tttctttcta ttctccaagc 6240 aaatattcct gataattctc tcagaccttg gcgttgtaag gcgttttcct ttacagtttt 6300 tacggagtgt ggatataggg ttttcgccat tttttatttt tgatgctatt ctttgcacgg 6360 tctgcctaga aacgccaatt cttcttgaaa tttctcgttg agaatagttg gtgttcctta 6420 aaagtgacat tatttcactt tttttacaaa cattcacatc ttttttttta cccattgtca 6480 gaaactttaa aaaatctaaa agacaaataa acacactaca aaaatatgta aaagaaaatt 6540 aaaaaagttg aaataatgta tatttttgga aaaactactt tcaaagaagt ctttcactaa 6600 ataaaacact taaaaaatta aaattttgaa aaatagtcgg ttaaaaataa cgtttgaaga 6660 aagatcctta acacacacac aacaagcttc ggcaatctac aaatggtaac tgaacacaat 6720 ttcacggttt tacttgtcaa aacatgcaaa atacagtgtt gctagcaaca aaatagtgtt 6780 gcctacacaa taatttgtaa taatgtcgta aataattatg aaaaattgtg aattttacgt 6840 ttgaaaggtg aattaatgct gacaatataa caaaaaacgt gatggcccta tttcaatggc 6900 cacccactgt aaatttttga ttactattgt atattcagtt gttgtagaat gttctgcgaa 6960 tttatttcca catcggaaat aaatttatgt cgggaaacaa ttttatgcgt aagccgcgta 7020 atattatttc cttagtaacc gaaacttgcg ccacggaaat aaatttatgg gggaaataaa 7080 tttgtacgac a 7091 // ID DGLT-A1_I repbase; DNA; INV; 4518 BP. XX AC AF298204; XX DT 15-SEP-2005 (Rel. 10.09, Created) DT 06-OCT-2005 (Rel. 10.09, Last updated, Version 1) XX DE Dictyostelium discoideum gypsy-like LTR retrotransposon DGLT-A1 DE (internal portion). XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; DGLT-A1_LTR; internal portion; DGLT-A1_I. XX OS Dictyostelium discoideum OC Eukaryota; Amoebozoa; Mycetozoa; Dictyosteliida; Dictyostelium. XX RN [1] RP 1-4518 RA Glöckner G., Szafranski K., Winckler T., Dingermann T., RA Quail M.A., Cox E., Eichinger L., Noegel A.A. et al.; RT "The complex repeats of Dictyostelium discoideum."; RL Genome Res 11(4), 585-594 (2001). XX DR EMBL/GenBank/DDBJ; AF298204; Positions 269 4786. XX CC LTRs differ by 1 bp substitution. This appears to be a recent CC insertion. XX FH Key Location/Qualifiers FT CDS 193..4503 FT /product="DGLT-A1_I_1p" FT /translation="MSNNENIIFSGDGKGLSFRKFKHLLLLIHGEEISKAD FT IIRKLRNNALEYAATLDTDLTAMEIIEDLKDAFDIGDSDDLTELTKLGEYK FT YNTLDILIGKFLSKCSAVDISEKFKIQLFYSAVGTELGIEITKCAPKSLKR FT AIDIAKSCEDGSIRIKGRSMYTGQLVAKTLQEFGNQRYLGSGSGYSATSPQ FT YVTSQNEQQSIESEEISTPIFKNQYYNRSPQMSPQSSPYKAPQNQVNSITK FT FCKYCKKKGHVIQECWSKDKKNNYNENKQSYNNHSTNAITMVNDKTINSIG FT TDSMNAALLINDKNVRGLVDTGSSLTIIWESIAKKLNLKVEGKSFSVNSAS FT NNEIKIVGSCDTIIKLGKASAKVNVNIVKDNDTSIDCIFGVDTLIGLKLII FT DCKEMIIKNLEFNVGTRLITRETKPIVCKIDLNILEKVSKPMAELLIKNEE FT IFETKLSKPGSLIDVEHSIKLTDENVSVYTPPYKTSPADKEFIEEYIKDAL FT DKGIIEKSDSSQYGSPIVLSRKNDKIRFCVNYKKLNDLTIKDRYPLPLISD FT CWYYLKDAKVFSKIDLTSGYYQIKMKEEDKDKTTFVSHMGQFRYVVMPFGL FT CNAPATFQRYMNELLKDELRKSNIGFIDDCIIYSKNNEYHIEHVKLLFEKF FT KEKGAKLQITKCEFEKEEIKFLGYIVSKNGITYDSSKFNELLKTPPPSNQK FT ELMKLLGTVNYFRTFIKNFTHLTSSFYPLLKKGVNFKWTDELENDRVKLFT FT TLAQTNLLSFPQDTDDNVIETDASIIGIGGVMIQNGKPVSYYSRTLNNAEK FT NYSVTERECLAIIESIKYFKSYINGRKVKIITDHQPLKYLLTGKFTDRITR FT WTVLLQEYNYEIIYRPGKENFLADALSRCPDRDQQPIDCDIPEKIFTITHY FT PVASPSTTILPRVLPSRSNQTSNVSPATTIQPPNSSSSNQSSNIPPTTNQS FT SNITNQPINTSSMNLNQSLSNQATTARLLNSVPISINNDNINTTTPQFTQE FT SIRKELEKENYYRAMYKYLEETRLPDNAQAARRILFESEFYSMINGFLCHG FT LKHTLSSKKSHFKHLQIVVPKTMTKWIMEIFHDSPLTGGHFGLLKTVAKIK FT ERFYWIGMIKDIKEFIDKCTTCLQIKRKYGPKEGLLIPIEIEPEPFNTIGI FT DFIGPITKDNAQVYLLVVMDYFTKWPEVFFTLDMEGETVAQLLLYEIYTRY FT GVPKKLVSDRGKNFLSNVVSGVNKLFGVHKLTTTAYHPQCDGQTENFNNTL FT IKMLKAFIGNELYGNWGELLRCVLYSYRITPHVSTGFSPFFLLFNRQPTLP FT LDTTLNVNFYSNSLRLDFADNYAQSVNNNLKRAFWFTKNNLDKAQELQKTN FT YDKGRRPTNIQTGDFVYLHTPYSQTSIGPKKFYKPWRGPFKVQEKVSDVTF FT KLDMGNLRDHKVVNIERLKKIFN" XX SQ Sequence 4518 BP; 1826 A; 609 C; 648 G; 1435 T; 0 other; tttggcgaca tcgtctttca aaaaaaaaaa aaaaaaatat actataaata taaagttatc 60 aaattattca atcaattaaa attttacatt agattaaatt taatttaata aaatcttaaa 120 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa gaaaaaattt aaaaaaaaaa aaaaaaaaaa 180 aaaaaaaaaa aaatgagtaa taatgaaaac ataatttttt cgggtgatgg aaaaggttta 240 tcatttcgta aattcaaaca cttactctta ctaattcatg gcgaggaaat atcgaaagct 300 gatattatta gaaaactaag aaacaatgca ttggaatatg ccgcaacatt ggatacagac 360 ttaacagcaa tggagataat tgaagatctt aaagatgctt ttgatattgg tgatagtgat 420 gatttaacag aattaacaaa gttgggagaa tataaataca atacattaga tatattaatt 480 ggtaaatttc tatcaaaatg ttcagcagtt gatatttcag aaaagtttaa aatacaattg 540 ttttattcag ctgttggtac tgaattgggt attgaaatta caaaatgtgc accaaaatca 600 ttaaaaagag caatagatat tgcaaagtct tgtgaagatg gttcaattcg tatcaaaggt 660 agatcaatgt acactggtca attagttgca aagacgttgc aagaatttgg taatcaaaga 720 tatctaggtt cgggttctgg ttattctgca acatcccctc aatatgttac atctcaaaac 780 gaacaacaat caattgaatc tgaagaaatt tcaacaccaa tttttaaaaa tcaatattat 840 aatcgatcac cacaaatgtc accacaatca tccccttata aagcaccaca aaatcaagta 900 aattcaatca caaaattttg taaatattgc aaaaagaaag gtcatgtaat tcaagagtgt 960 tggtctaaag ataaaaagaa taattataat gagaataaac aatcatataa taatcatagc 1020 actaatgcaa ttacaatggt taatgataaa acgattaatt ctattggaac tgatagtatg 1080 aatgcagcat tattaataaa tgataagaat gttagaggtt tagttgatac tggttcttct 1140 cttaccatca tttgggaaag tattgctaaa aaattaaatt taaaagttga aggtaaatca 1200 ttttcagtta attcagcatc aaataatgaa attaagatag ttggtagttg tgataccata 1260 attaaattag gtaaagcatc cgcaaaagtt aatgttaata ttgttaaaga taatgatact 1320 tcaatcgatt gcatttttgg tgtagataca cttattggat tgaaattaat aattgattgc 1380 aaggaaatga tcattaagaa tcttgaattt aatgttggta caagattaat caccagagaa 1440 accaaaccaa ttgtttgcaa aattgatcta aatatattag aaaaagtatc aaaaccaatg 1500 gctgagttat taatcaaaaa tgaagagatt tttgaaacca aattgtcaaa accaggatca 1560 ttaatagatg ttgaacattc aattaaatta actgatgaaa atgtttctgt ttatactcca 1620 ccatacaaaa ctagtccagc tgataaagaa ttcattgaag aatacattaa agatgcatta 1680 gataaaggta taatcgagaa atcagattca tcacaatacg gttctccaat agtcttatcc 1740 agaaagaatg ataaaattag attttgtgtt aattataaaa aactaaatga cctcacaatt 1800 aaagatagat atccacttcc attaatttct gattgttggt attatctcaa agatgcaaaa 1860 gttttctcaa aaattgatct tacttctggt tattatcaaa tcaaaatgaa agaagaggac 1920 aaggataaaa caacctttgt ctctcatatg ggtcaattta gatatgttgt aatgccattt 1980 ggattatgta atgcaccagc aacttttcaa agatacatga atgaattatt aaaggatgaa 2040 ttaagaaaaa gtaatattgg gtttattgac gattgtataa tttatagcaa gaataatgaa 2100 tatcatattg aacatgtcaa actattattc gaaaaattca aagaaaaagg tgcaaagtta 2160 caaataacta aatgtgagtt tgaaaaagaa gagatcaagt ttttgggata tatcgtttca 2220 aagaatggta ttacttatga tagttcaaag tttaatgaat tattaaaaac accacctcca 2280 tcaaatcaaa aagaattaat gaaattactc ggtactgtca attactttag aacattcatt 2340 aaaaatttca ctcatttaac ttcatcattt tatccactat taaagaaggg tgttaatttc 2400 aaatggactg atgaattgga aaatgatcgt gtaaaattat ttactacatt agcacaaact 2460 aatcttttaa gttttccaca agatacagat gataatgtta ttgaaactga tgcttcaatt 2520 attggtattg gtggtgtaat gattcaaaat ggtaaaccag tttcttatta ttcaagaaca 2580 ttaaacaacg ctgaaaagaa ttattctgtt actgaaagag aatgtttggc aatcattgaa 2640 tcaattaaat attttaaatc ctatatcaat ggtagaaaag taaaaatcat cactgatcat 2700 caacctttga aatatctctt aactggtaaa tttactgatc gtattacaag atggactgta 2760 ttacttcaag agtataatta cgaaattata tatagacctg gtaaagaaaa ctttttagct 2820 gatgcattat caagatgtcc agatagagat caacagccaa ttgattgtga tattccagaa 2880 aagatattta caattactca ctatccagta gcttcaccat caactacaat tttaccaaga 2940 gtcttaccat caagatcaaa tcaaacatct aatgtttcac cagctacaac aattcaacca 3000 ccaaattcat caagttcaaa tcaatcatca aatatcccac caacaacaaa ccaatcatca 3060 aatattacaa atcaaccaat aaatacttca tcaatgaatt taaatcaatc attatcaaat 3120 caagcaacaa ccgctagatt attgaatagt gtacctatct caatcaacaa tgataatatt 3180 aatacaacaa ctccacaatt tacacaagag tcaataagaa aagagttgga aaaagaaaat 3240 tattatagag caatgtataa atatcttgaa gaaacaagac ttccagacaa tgcacaagca 3300 gcaagaagaa ttttgttcga gtctgaattt tattcaatga ttaatggatt tctttgtcac 3360 ggtttaaaac atactttatc aagtaagaaa tctcatttta aacatcttca aattgtagtt 3420 ccaaaaacta tgactaaatg gataatggaa atatttcatg attcgccact tactggagga 3480 cattttggat tattaaaaac tgtcgcaaaa atcaaagaaa gattctattg gattggtatg 3540 attaaggata taaaagaatt tatcgataaa tgtacaactt gtttacaaat taaaagaaag 3600 tatggtccaa aagaaggttt actcattcca attgagattg aacctgaacc atttaataca 3660 attggtattg attttattgg tccaattaca aaagataatg ctcaagttta tttgttggtt 3720 gttatggatt attttactaa atggccagaa gttttcttta cccttgatat ggaaggggag 3780 actgtagctc aattattgtt gtatgaaatc tatactagat atggggttcc aaagaagttg 3840 gtttctgatc gtggtaagaa ttttttaagt aatgtagtct ctggtgttaa taaattattc 3900 ggtgttcaca aattaacaac tactgcttat cacccacaat gtgatggtca aactgaaaat 3960 ttcaataata cattaattaa gatgctcaaa gcttttattg gtaatgaatt gtatggcaat 4020 tggggtgaat tattaagatg tgttttatac tcatacagaa ttacaccaca tgtttcaact 4080 ggtttttcac cctttttctt actttttaat cgtcaaccaa ctttaccatt agatacaact 4140 ttaaatgtaa atttttattc aaattcacta agattagatt tcgctgataa ttatgcacaa 4200 tccgtaaata ataatcttaa acgtgccttt tggtttacaa agaataatct cgataaagct 4260 caagaattac aaaaaacaaa ttatgataaa ggtcgtcgtc caacaaatat tcaaacaggt 4320 gattttgttt atttacatac accatattca caaacttcta ttggtccaaa gaaattctat 4380 aaaccttgga gaggtccttt taaagttcaa gaaaaagtta gcgatgttac attcaaatta 4440 gatatgggta atcttagaga tcataaagta gtcaatattg aaagattgaa aaaaattttc 4500 aactaaaagg gggggata 4518 // ID SINE2-1_CQ repbase; DNA; INV; 684 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A SINE family from Culex quinquefasciatus - consensus. XX KW SINE2/tRNA; SINE; Non-LTR Retrotransposon; Transposable Element; KW SINE2-1_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-684 RA Kojima K.K. and Jurka J.; RT "SINEs from the southern house mosquito."; RL Repbase Reports 11(1), 623-623 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >97% CC identity. ~19bp TSDs. XX SQ Sequence 684 BP; 182 A; 158 C; 171 G; 173 T; 0 other; ggagcggccg tggctgactg gttacggtgt tcgctttgta agcgaatggt tctgggttcg 60 atgcccatct gctcccaacg agaaagttaa gaacacataa atttgaaatg atgaatatga 120 acgaaaaatc aaagtcgctc gaggcggggt tcgatccccc gtcctttgga ttggtaagca 180 aaaatgctaa ccactaggcc atgacgactt ggtgagcgtg gactggaatt aggaatactg 240 ttacagagag cgagttgtgg cgctataacc acggcaaata gtgtccaggg cattcgtgga 300 aatacaatgt cccatcccag agggtcccgg agtaccaaac cttcttagca tggtgctccc 360 aacgaataca accaaaatct cggagcgtat ggtggtgtgt ccccacgctt cttcctcccc 420 tgtcgattca gaattgtgtt gttgttcgaa cactcagtgc tcaaactcaa cccaattacg 480 agtcatctct gcgatacggc tttacgcagt aggcctggcc gctttaacgc ttgtgatgtt 540 tcaatgcatt ccgatgcaat ggcgcactac aaatgttaat aaatgacaag aagagtgcta 600 ggcgtcatct aacctaaggc actctccagg atcccttcga aagattggct gcgctagggt 660 ctgattagat tagattagat taga 684 // ID Copia-108_AA-LTR repbase; DNA; INV; 235 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-108_AA_; KW Ty1_copia_Ele43; Copia-108_AA-I; Copia-108_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-235 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 235 BP; 62 A; 59 C; 43 G; 71 T; 0 other; tgttgagtaa caatttcagt accgggtaca tcagcacaac cctacctgtg tgacagccct 60 gaagcggtaa agccatgacg tggcataaca gagaacaaaa gcgcgcgaca gcctttcgcg 120 ctaaaactat tcttttttca ttctgtgtac aaccttcgtt gaataaacat cttttcattt 180 tcgttctgta atcgcgttct gtttaattac tgaccggccg aattccctat tctca 235 // ID Gyp1a_Cis_LTR repbase; DNA; INV; 520 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 06-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE Long terminal repeat of Gypsy LTR Retrotransposon from Ciona DE savignyi. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gyp1a_Cis_LTR. XX OS Ciona savignyi OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. XX RN [1] RP 1-520 RA Smit A.F.; RT "Gyp1a_Cis_LTR - Gypsy LTR Retrotransposon from Ciona savignyi."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX CC Ci000068 and Ci000091. XX SQ Sequence 520 BP; 160 A; 107 C; 106 G; 147 T; 0 other; tgtagcgacc tcatgtgtta cataagattt acgatcccat cagctggcat ttgcgctgac 60 attccgtcat gttgaaatat cattattagg tttacgatta taaccacagc tgtcgtactt 120 tgcctatata agtttatttt acttttgctt aaaaaagaaa agagaaggag agagtcctaa 180 cgcattaagt aacatcttgt atgtgtctat actgagagtt tatctggaat ctatgtcgtt 240 taaacatcta acgatttaac tcgcaattgg taccaagtta atcgcctcaa ttgctcaata 300 caattgatgt tataaaacta ctgaggcgtt tatcggcgaa ctacaggaga tcagtcaaag 360 agacatttac gcagaaaacc tcaattaatt agcgcacctc cgagagtcga gatagcgcgt 420 agggaatcgc agatcgcatc aggtgttgta accggcttac ctgggtgcct acaaggaaca 480 atctagcgaa ataggcagcg acctgcaacg gctccctaca 520 // ID CR1-8_CQ repbase; DNA; INV; 5392 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-8_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-5392 RA Kojima K.K. and Jurka J.; RT "CR1 non-LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 10-10 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 10 sequences with >99% CC identity. XX FH Key Location/Qualifiers FT CDS 308..2278 FT /product="CR1-8_CQ_1p" FT /translation="MTKSSNCDADVCLGKAANRSPRVRCEFCSASLHIKCA FT NLDNTVVKPLRDSAGVCWLCSKCRDPVKRKATSNSNTMDLILQRTTSTMKL FT VGALLDTVQILSQLTRTLCSRPNCSTIRPPVPDLDTVAGGNDPLNFTEIFE FT SAMNGEKRPRSSSLSRQSLSQPEKITRVGVSVNQTAASNITVPPVSDSNDH FT LAVDTEQFCSDPLFDAASKVALEVAALAAEFSAGYSTAPTEQTNPPASPKI FT LASQVATAQQAAAAQQLATAQQAASAQQLATAQHAATAQHAASAQYAASAQ FT HAASAQQLVTAQHAATAQHAASALYAATAQHAASAQQLATAQHAASAQHAA FT STQHAASAQHAASAQQLAQQAAAQQAAAQQAAAQQAAAQQAAAQQAAAQQA FT AAQQAAAQQAAAQHAATQQAAAQQVAAQQAAAQQAAAQQAAAQQAAAQQAA FT AEQVATQQALQTAATANAQQTATINLAVASASASTDSTQKWYYVTRFQPHE FT KSANIIRYIQSKTNCDPDQIVCNKLVSNNKSNGRPLSFLSFKITVPDSIET FT IVNTHTFWPXGVTIQPFTENRTRAIASTKSKSNGPPRTRSTVPSRDRDVLP FT SAVPNTRNTNLPPFHGQTPFPSGFRLPHPPLPPYLTPFPMIPQYQLYPNLR FT NELRSLQP" FT CDS 2374..5163 FT /product="CR1-8_CQ_2p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="MYAFTETWLTADTLSSQLFGPEYEVFRSDRTSVNSDK FT MSGGGVLLAIRSALKPRQLFPPNCSIPEQVWVNVPLATSTLFICVVYIPPK FT RDNDRPLIEQHRDSLAWVTSMMKINDSVMIIGDFNFPALRWTRSPTNKLIP FT NLAXTPTNQLKLDVLDDYSVANLKQLNDIPNISNNVLDLCFVSSDTPTCCS FT LLPAPQPLVKIDRFHSPLLVSISCKTHAFRASSCSFFHDFNNADFAGMCVY FT LNNINWNDLLHNLDANGAAETLMNILRQAIDTFVPRKERQPARYPPWTNER FT LRILKTAKRAALRKYSKHPTDRWRNFFWAASNRYSHLNNRLYHQHLHSIQN FT RLKRDPKKFWNHVNEQRKESGLPTVMVRDDTEATNPEDICDLFRSQFSSVF FT NDEIVDDAIISKAANNVPVRPPIGQHPFVNPDAVRRACLRLKGSTSCGPDG FT VPALVLKKCGDSLAAPLSLLFNLSLRTGVFPSCWKKSFVFPVHKKGPKRDV FT RNYRGIAALCATSKLFEQIVLDFIKFQCSSYIAQEQHGFMAKRSTSSNLVA FT YTSFILRTMQHRKQIDALYTDLSAAFDKLNHRIAVAKLERLGFGGSLLDWL FT RSYLAGRTMCVKIGDVVSAAFAVFSGVPQGSHLGPFIFLLYMNDVHLLLKC FT HKLSYADDIKLYAVIETRSDALFLQDQLNIFANWCMDNRMLLNASKCAVIT FT FTRKRSAIAFDYKLTNTLLSRTSCVKDLGVMLDSKLTFSDHLAYTTAKASK FT TLGFIFRIAKKFRQISCLKALYCSLVRSTLEYCSVVWSPFYQNSIQRIESV FT QRKFVRYAQRHVLWADPLNPPCYVERCKMLQLDLLSTRRDVAKVTFVTDLL FT QSSIDCPSVLELIDFNIRRRTLRTHYFLRVPRALTNYGHHEPLSSMCRIFN FT SCSDFFDFSLPRNTIKNRVLNHLRNL" XX SQ Sequence 5392 BP; 1373 A; 1584 C; 1106 G; 1325 T; 4 other; agagcatctc ccattgtgcg taaacgtaaa caactctttc atatgcgcgc ttttaaaatc 60 gcttttttca agtgtaaaac taaccgtttt tcttcgcgag tactattttg gtgcaaacta 120 gtgaaggatt taacaggaca ctgtgataga taggtgtaat ttacgcgata aagtgatttc 180 gtgtcgttta attacacatc acaacttgcg agtgtctccc aacgatacac tctttctcgc 240 tccctcacgc acccggaccc gctatagtat ttttcgcttc caaacaacaa caatccaacg 300 catcgaaatg acaaaatcga gcaattgcga cgcagatgtt tgccttggca aagcggccaa 360 tcgctcgcct cgcgttcgat gcgaattctg ttcagccagc ttgcatataa aatgtgctaa 420 tctggacaat accgtcgtta aaccactacg ggattccgcc ggcgtttgct ggttgtgttc 480 gaaatgccgt gaccctgtaa aacggaaagc aacatctaac agcaacacca tggatcttat 540 cctacaacgg acaacatcga cgatgaaatt agttggcgcc ctcctggata cagttcagat 600 cttgagccaa ctaaccagga ccttgtgctc tcgacccaac tgcagtacca tccgcccacc 660 tgttcctgac cttgacaccg tagccggagg caacgatcct ctgaacttca ccgaaatttt 720 cgaatccgcc atgaatggcg agaaacgacc acgttcgagc agtttatcca gacaatcatt 780 atcgcagccg gagaaaatca cgagagttgg cgtttcagtt aaccaaactg ctgcttccaa 840 tattacagtc cctcccgttt cagactccaa tgaccatctc gctgttgaca cggaacagtt 900 ttgttctgac cccctcttcg acgcggcctc caaggttgct ctagaagttg cagcccttgc 960 ggcggagttc tccgctggat attctactgc accaaccgag caaaccaacc cacccgcctc 1020 gccaaaaatc ttagcgtcgc aagtagccac cgctcagcaa gccgccgctg cacagcaact 1080 tgccactgca caacaagccg cctctgctca gcaactcgcc actgcccagc acgccgccac 1140 tgctcagcat gccgcctctg ctcagtatgc cgcctctgct cagcatgccg cctctgctca 1200 gcaactcgtc actgcccagc acgccgccac tgctcagcat gccgcctctg ctctgtatgc 1260 cgccactgcc cagcacgccg cctctgctca gcaactcgcc actgctcagc atgccgcctc 1320 tgctcagcat gccgcctcta ctcagcatgc cgcctctgct cagcatgccg cctctgctca 1380 gcaactcgcc cagcaagccg ccgcccagca ggccgcagcc cagcaagccg ccgcccagca 1440 agccgccgca cagcaagccg ccgcccagca agccgccgcc cagcaagccg ccgcccagca 1500 ggccgccgca cagcaggccg cagcccagca tgccgccaca cagcaagccg cagcccaaca 1560 ggtcgccgcc caacaagccg ccgcccagca ggccgccgca cagcaggccg cagcccagca 1620 ggccgccgca cagcaagccg ccgccgagca ggtcgccacc cagcaggctt tacaaactgc 1680 cgccactgca aacgctcaac aaacagccac tatcaatctt gcagttgcct ccgcaagtgc 1740 atccaccgat tcaactcaga aatggtatta tgttacacgt tttcaaccac acgaaaaatc 1800 tgccaatatc atcaggtata tacaaagcaa aacaaactgt gacccagacc agatagtatg 1860 taacaagctc gtaagcaaca acaaatcaaa cggtaggcct ttgtcatttt tgtcgttcaa 1920 aataaccgtg ccagattcga ttgaaaccat tgtaaacaca cacacttttt ggcctgawgg 1980 cgtaactatt caaccgttca cggaaaaccg cacgcgtgca atcgccagta ccaaatccaa 2040 atctaacggc cctcctcgta cacgatcgac cgttccctct cgagatcgag atgtcctacc 2100 aagtgcagta ccgaacacac gaaataccaa tctaccaccc ttccacggcc agactccttt 2160 tccatcgggc ttccgcttgc cgcatcctcc gttgccacca tacctgacac cattcccgat 2220 gataccgcag taccagttgt acccaaacct acggaacgag ttaaggtccc tccagcccta 2280 gaaacgaatc tgctcctttt ttaccagaac attggtggct taaacactac aatcgctcag 2340 cactcactcg ccatttctga tgcatcttac gatatgtacg cctttactga aacgtggctg 2400 acagctgaca ctctatccag tcagcttttc ggccccgaat acgaagtgtt ccgttccgat 2460 cgcacctctg tcaatagcga taaaatgtca gggggtggtg ttctacttgc tattcgatcc 2520 gctctgaagc cacgccagct gtttccacca aattgttcaa ttcccgaaca ggtctgggtc 2580 aatgttccac ttgctacatc aacgctgttc atttgtgtag tttatatccc cccgaaacgt 2640 gacaatgatc ggcctctgat cgaacaacac agagattccc tggcctgggt gacatccatg 2700 atgaaaatta acgatagtgt tatgatcatt ggtgatttta attttcctgc tctccgctgg 2760 acgcgtagtc cgacaaacaa gctcataccg aacctcgcca wcactccaac gaatcagctt 2820 aaactggatg ttttggacga ctactctgta gctaacttga aacagttgaa tgatatcccc 2880 aacatctcta acaacgtgct agatctatgt ttcgttagct ctgacacacc gacttgctgt 2940 agtctcctgc ctgctccaca gccgcttgta aaaatagatc gtttccactc tcccctactg 3000 gtatcaattt catgtaaaac tcatgccttt cgagcttcaa gctgcagctt tttccacgat 3060 tttaacaacg ctgactttgc tggtatgtgt gtctacctga acaacatcaa ctggaacgat 3120 ttgctgcata acctcgacgc gaatggagcc gccgaaactc tgatgaacat cttacgtcaa 3180 gcaatcgaca cgttcgtgcc caggaaggag cgacaacccg ctagatatcc accatggaca 3240 aacgagcgcc tacggatctt aaagacagca aaacgggctg cgcttaggaa gtactcaaaa 3300 catccaaccg accgctggag aaactttttc tgggcagcta gtaaccgata cagccacctg 3360 aacaatcggt tataccatca gcacctacac tccatccaga atcgcttgaa acgggacccc 3420 aagaaattct ggaaccacgt caacgagcag aggaaagaat ccggcctacc aaccgtgatg 3480 gttcgtgatg acaccgaagc gacgaacccg gaggatatct gcgatttatt ccgctcccag 3540 ttcagcagtg ttttcaacga cgagatcgta gacgacgcga tcatctctaa ggctgccaac 3600 aacgtcccag ttcgccctcc gattggacaa catccgtttg tgaatcctga tgcggtccgc 3660 cgagcttgct tgcgcctgaa aggatctaca agttgtggtc cagatggcgt tcccgcccta 3720 gtgctgaaga aatgtgggga cagcttagca gcaccattgt ctctcttgtt caacctctcg 3780 cttcgtactg gtgtgttccc gtcttgttgg aagaaatctt tcgttttccc ggtacacaag 3840 aaggggccaa aacgtgatgt tcgcaactat cgtggcatcg ccgctctttg cgcaacaagt 3900 aaattgtttg agcagattgt tttggacttc atcaagttcc agtgctccag ctacattgcg 3960 caggagcaac acgggttcat ggctaaacga tccaccagtt ccaacctagt ggcatacacc 4020 tcctttatcc tgcggaccat gcaacatcga aagcagatcg atgccttata cacggacctg 4080 tctgcggctt ttgacaaatt gaatcaccgt atagctgtgg caaaactgga gcgtctggga 4140 tttggtggtt cgcttcttga ctggctaagg tcctatctcg ctggtcgaac gatgtgcgtc 4200 aagatcggtg atgttgtttc tgctgctttc gctgtatttt caggcgtacc ccaaggaagc 4260 catcttggtc cattcatttt tttgctctat atgaacgatg tacatctact tttgaagtgc 4320 cacaagctgt cttatgctga cgacataaag ctgtatgccg tcatcgaaac gcgtagtgac 4380 gcattgttcc tgcaggacca gcttaacatc ttcgccaact ggtgcatgga caacaggatg 4440 ttgttgaacg cttccaaatg cgctgtaatc actttcactc ggaaacgctc cgctattgca 4500 ttcgactaca aacttactaa cacactcttg tcmaggacat cttgcgttaa ggatctcggc 4560 gtaatgctag atagcaaatt aacattttct gatcacctag catacaccac agcgaaggct 4620 tcaaaaactc tagggtttat ttttagaatc gctaaaaaat ttcgccaaat aagctgcctc 4680 aaggcattgt actgttccct tgtccgwtcc actttggagt actgctcggt tgtgtggtcg 4740 cccttttatc agaacagtat ccagcggatc gagtctgttc aacggaaatt tgttcgttat 4800 gcacaacggc acgttttatg ggccgacccg ttgaatcctc catgttacgt cgagcgctgc 4860 aaaatgttac aacttgacct tctctctact cgacgtgatg tggctaaagt aacattcgtt 4920 accgaccttc tccagtcctc aatcgattgc ccatcagttt tggaactcat cgacttcaac 4980 atacgccggc gcactctccg cacccactac tttctgaggg ttccccgagc acttacgaat 5040 tacggacatc atgagccact atccagcatg tgtcgtattt tcaattcctg ttctgacttt 5100 tttgatttct ctctccctcg caacactatc aaaaatcgag tcctcaatca tcttaggaat 5160 ttgtagcgaa gatctgtgat attattttta aggaaagtga cataaatgtt aatcgttaat 5220 tgtaataatt tgttagattt aagtatgctc gtcttgtatc attggagttt gtaaacttgt 5280 tgatgcgtta agacgaggtg gttttgtgcc tctctgagag agtgtcctct gacacaactc 5340 aaaggggctt ttccccacct ccaaataaaa aacaaaacaa aacaaaacaa ac 5392 // ID Gypsy-3-LTR_HM repbase; DNA; INV; 227 BP. XX AC . XX DT 19-DEC-2008 (Rel. 13.12, Created) DT 19-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE Long terminal repeat of the LTR retrotransposon - a consensus DE sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-3-LTR_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-227 RA Bao W. and Jurka J.; RT "Gypsy LTR retrotransposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 1973-1973 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX SQ Sequence 227 BP; 83 A; 22 C; 41 G; 81 T; 0 other; tgttctatta ttatgttggt gaattgtacg tgttagaaat cacgtagatg cgtgttgaaa 60 gtcacgcatc tacgtaagtt atttcaactg taattaacag tatattctca tgggactttt 120 taatttaata tatatagaga aaagattttt gtataggatg ttagaaagaa tatataaagc 180 tatagattat aaacgttatt aaattaaatg gacgcgacga taaaaca 227 // ID Gypsy9-LTR_Dya repbase; DNA; INV; 352 BP. XX AC chr2R; XX DT 18-MAY-2009 (Rel. 14.05, Created) DT 18-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy9_Dya; KW Gypsy9-I_Dya; Gypsy9-LTR_Dya. XX OS Drosophila yakuba OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; melanogaster subgroup. XX RN [1] RP 1-352 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1075-1075 (2009). XX DR Genome; chr2R; Positions 1397220 1397571. XX SQ Sequence 352 BP; 82 A; 71 C; 96 G; 103 T; 0 other; tgtcggagaa tggaatattc tagatcggaa tttcgtagta tttgaatttg ctcgtctctc 60 tgttccttcg tttgagcgtc ggacaacttc taatttcgta gtatttgaat ttgctcgtct 120 ctctgttcct tcgtttcgag gaaccgttgg agcaaggcga cggtggcgcg tcgcctgtgt 180 agcggtgcaa gagttaccac aaatgtggcg cgacatcgtc gtcgagtgcg gttcaaggaa 240 tcatataaaa agaggtagta gtttgagggc gatgagttag tcttgatcga tcgtcgctac 300 cgtaaagaac gaaacgcaca gtgccccgat ggatcctgaa cgtaatttga ca 352 // ID Sola1-N1_AAe repbase; DNA; INV; 3137 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A non-autonomous Sola1 DNA transposon family from Aedes aegypti. XX KW Sola; DNA transposon; Transposable Element; nonautonomous; KW Sola1-N1_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-3137 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the yellow fever mosquito."; RL Repbase Reports 11(4), 1291-1291 (2011). XX DR [2] (Consensus) XX CC ~97% identical to consensus. 4-bp TSDs. TIRs are 32 bp long. CC This non-autonomous Sola1 element includes a CR1-like sequence. XX SQ Sequence 3137 BP; 1005 A; 575 C; 561 G; 996 T; 0 other; ctgcccataa aagcataact gtcccatatg gatttcctcc aaaaaatttt ttttggcggg 60 ctacgcatat ttgtatgtat attttcatat gcacacataa aattgatcag tttgttcgaa 120 aacattgcga aaaaatttca aactagtgct gtcccatatt gaaaaataaa ggcataacag 180 tctctatata gatcagcctg caaaaagtat aattattctt tagtaaaaat tatctgcgat 240 acaaatttat cgcgagcaca ttttgattga aatttagcat catttataca ttcaattcat 300 tccattttgt aatgtagaat atcctatata tttaacataa gaacagtagt gtcttactga 360 atattgtaac atgacccaaa accgcgccct ttactgtaca caaagcagaa ccgcttctgg 420 cggcaatgtc gatgtggtca gctagtcttg ggttgaatac gttaaatctt tacgaaagct 480 aatttaccgt atttcatttg caatattgat gattaattcg ttttcaatca aataattgaa 540 gttgcataaa caatggagct gaaatagctg ttgcgagaga agcacatctc ttcatccaga 600 ccatggtgaa ggtggtaaga atgcatataa gtccggacga ggatctactc gggttgaaca 660 ttttcttgct ttcacagggc atagtatatc ttcgtaccac tcgttataaa aatgtgaaaa 720 cagttgattg gcaaagaaaa ctctcagatt gaaaaactat cacaacaaga aactcacagc 780 actaaaaaac ctggacgcat ttaaatgagc aggaccggat ggcattgcac ccgtgttctt 840 caagaatcgt tcagaagagc ttaccgttcc cctaaaacgc cttttcgaca tgtttctaaa 900 taatggaaaa ttcccaaaag cttgaaaata ctcatatttg gtgcctgttt tttaatcagg 960 cacaaaatct gacatacgga actatcgcgg aattgccaat atctcttgaa aacatgtaac 1020 agctccaatt gaagtatttt taggctatca tgtagtgcag atatgtaaag ttgtaaggga 1080 tcatagataa tagactcaaa actcactttt gtagaacatt ggaatttcat aattaaacaa 1140 gcaaacagta agcttggctc cattaagagg ttcagcaaca atttgcataa cccttatact 1200 attaaaatat tgcatattac atatgtaaga ccaattctga gatactgcca cctagtatgg 1260 aacccgaacc atatcataca cgaagaaccc atagaataag tacagaagca attcctttta 1320 tttccggaga ttgaattgta ctgaagaata tgaagaagaa ttaacaaatg agttaggttt 1380 ttttaagcta taaaaaagtc tatgtattcc aaatcatgcg ttgttttagt atttatttaa 1440 tcaaaattca attgtgaaca gcagtttgta gatagaaact tgctcatcct agtctttcct 1500 cattattcag gatattttgg tttattgtaa agattttgag catacggata cccatgggca 1560 tcggctgcca gatcactgta tagcaggttc ccagttcccc gggggatgtg atcatgttcc 1620 gaattggcca cttatctagg agcccctgga atctactata gagtgattac tacatttgtg 1680 attttgaaaa tgctccacat cttccaaggc atataaaacc tactgttcat ggtagacggt 1740 gattctaaaa ctacttccag aagcatgaat tttaaatata ttgaagtttg ttacccatat 1800 tggttttatt ccgtacaaaa aaatccgtgt aaaattatcc actttcatca gggtacctcg 1860 aacctactat acagtagtca tgacagccgg aggtttatga agtactttct gaagcatcac 1920 cttttaaaat cttgatattt gatacccatt ttgatgtgat tccggacatg tttttaattg 1980 gccacatccc ttggggcacc tggatactac tagagagtgg tcatggcagt cggtggttct 2040 gcaaattctt ccggaattat cacctttaga aatcttgaag tttgatgccc atattgatat 2100 ggttcagaac atgttcctaa ttgggcactt ctcccagggc acctggaacc tactatagaa 2160 tggtcaattc gtctaaaggt tctgaatatg ctgccggaag catcatcttg aagtataaga 2220 ggaaaataac catcatcgtt ttaaaaaatt tcaaaaccag tttttttttg ctcaaatttg 2280 aagcaatgtt catgaaaatc tatcattgca ttgcacttgc aatctgaaat gcaatgatag 2340 attttcatga agatttttta aaattgaacg aaaaaactgg gttttcattt tggatgattg 2400 ctgatgattg aatgatgtat tcttgagcat taaaggtcac taggatatag ctccggatag 2460 tatatacaat acatgatagg aaatagaaac atgactgatc atattttccg gaaccagttc 2520 tacgtaaatg gttatatttc tgaagatttt caacaaattt aaatgcgttt gccggcattc 2580 aacttcctat tagttctagt ctctaggaaa attgtaaaca ctcatacaac ttcaccccgc 2640 acaaattaca agcgacagaa aaatgccatt tggacatgac ctcggagaat ccggaacatc 2700 tttggtgtca ggtggccaat atgcagggcc aagaaagttc ctgaaacaag accaaatttg 2760 ccgccgacta cttccagatg cattgaattt catccattct tcataaagtt atgagcattt 2820 ggaattagcc aaaattttgc tcatcgttct gagctggcgg tgagcctgat atggctttat 2880 ttctgttgcc ttgcaataaa ataaaggcaa aagagaactt tcttcatttt tatgctctta 2940 taattatatt tccgtgagaa actactcaaa atacatctga gcgctaagta ttgaacacga 3000 tgatgttctg ctgggcattt attttggcat tgtcgacgct aaattggtgt ttttttgacg 3060 acaagatatt taacttcatc tatagatttc tcgttcgata gcaattccat atgggacagt 3120 tatgctttta tgggcag 3137 // ID Gypsy-40_OD-I repbase; DNA; INV; 6151 BP. XX AC CABV01005033; XX DT 24-FEB-2011 (Rel. 16.02, Created) DT 24-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Oikopleura dioica genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-40_OD_; KW Gypsy-40_OD-LTR; Gypsy-40_OD-I. XX OS Oikopleura dioica OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; OC Oikopleuridae; Oikopleura. XX RN [1] RP 1-6151 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Oikopleura dioica genome."; RL Direct Submission to RU (24-FEB-2011). XX DR Genome; CABV01005033; Positions 749 6899. XX CC Positions [4059-4532] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 206..1396 FT /product="Gypsy-40_OD-I_1p" FT /translation="MGEEIYYKKVVDLSTKEMMLIYDRHQDCQYPMLIPRF FT DKTGIVMNLANIEPNFLLDLEKKYVFRKIGKDLKYDPDYDSEAVQKTARAD FT HVLNIDKINDLIMLNDYNSVLAKIKEESDEDQTKTQDATGSSKTGPAPGPA FT KDDGRETKADPAPGPATDQPRGTTTGSAPESASGPAPGPNFGHAQAPNAKS FT NTTPRSGPAPTSTAPGPDFDNKTTGSQFRMKTPKFDSQLPIENWICAMDIY FT MTCYNLSEENIIKISLSQLLTEDSGTSLIESINPEERKNWDAFKSKLIAVL FT GKDQEHFKHLYNTFQRGNESQAMALTKITAFFKKGYKKSALDEADEQIVCE FT KFISAQEPRLMELLTREKSSLNLRNIAQRATELERSFYNREKVFVAEEQKK FT SNS" FT CDS 1961..2974 FT /product="Gypsy-40_OD-I_2p" FT /translation="MDTLKNITIGESSVTLHDKMIPTIDPHSSDLLLTGII FT EIDKNSGTEKEKYDEFIRRRTENAQKTNFIPAIGSFGAANPEQKAQLEKLV FT MKYRLSFNMDSDDLGRLFGFRYTLPMFDEKQSSHQPPRPIPIHSQAQVEEQ FT IGKWKQLDIIEKTQSAFNIPLIILRKSDKTIRISLDARGINSLLVKDRYPL FT PHFTTVFTRIGERLTNGKECFISSVDFFRGYWQVLINESESHKMAFSYKNE FT HYQANRMLYGSCNAPAAFSRIMMRLMNHPSIVLYLDDLVIIDSSWEEHLRS FT LEFVFKTCLDHGLILSSKKCQLASHELDFIGHRITASGIKPLKKHS" FT CDS 2991..4802 FT /product="Gypsy-40_OD-I_3p" FT /translation="MDPPTDKTELKRYLGMVNFNVKFVRKGSEILAPLYKL FT TSAKVDYEWSDVHQQAFDTIKAELLKEPTLAHYKPGSKLVLVTDSSGHAVG FT GTLYQVKEGVFSPLGFFSKSLQGPDLKRSMRIKELFALCWGIQFFEYFLIN FT TEFDCFVDHKSLLFLFREARRSKLDIKLCNVHNYLQAFDFNIIHKNGTSPE FT MASADFFSRLPKYRSSDLERDCIDFFDVDETIFAAHTLSEDDIIFSFENRS FT FTSEEMATAQNECQNTRNLIIKSKINTKQSRLKLKDNVLYSKDRLVLPSQL FT ADEFIQYLHTITAHAGSQQLLQMLRKFFISNIQEKVRAITSSCETCIRLKP FT LKELRPSLIKDRKFAQLPFEKVFIDLVDYGRPDSSGKRYLLTCVDSLTGYL FT DGEPLSNKTDNLVSKALLKLILRHGISQNCVSDNGREFGPLTKKILDRFEI FT RHTTTTAYRSRSNGKCERIHRELHQKLKAQLATNRSWSQFWPLAQYYINNS FT PKGSLDNLSSNEAVFGRQFHVPFTLKDPVTDSIQPFIKALNEYMKKLHPSL FT LNFQVQRYQKLLKRDKNDCPVLDLGTVVLCWKPDLLAGKLGINWVWSIQNS FT PTIVKG" XX SQ Sequence 6151 BP; 2088 A; 1418 C; 1121 G; 1524 T; 0 other; gattaaaacc tactaaatat ttggtgactg aagaaaacgg cacctcgaca tccagccaga 60 gagacttttg gggttcaaga acggagctct tatccagcga gaatccaagc aagcggagtg 120 tcaaccataa gttgcaccac cgataggttc tcgtttagga ttataagaat cctccttagc 180 cacttagatt agtaatctct ataccatggg ggaagagatt tattacaaga aggttgttga 240 cctctccacc aaggagatga tgctgatcta tgatcgccat caggattgtc aatatccaat 300 gctcattcct cgctttgata aaaccggtat cgttatgaat ttggctaata tcgagccaaa 360 ttttctctta gatcttgaga aaaagtacgt tttccgaaaa atcggaaaag atctcaaata 420 tgatccggat tatgatagtg aggctgtgca aaagacagca agggctgacc acgtcttgaa 480 tatcgataaa attaacgatc tgatcatgct caatgattat aattcagtct tggctaagat 540 taaggaagag tccgatgaag accaaacaaa aacccaagat gctactggct cttcaaaaac 600 tggtccagct ccaggaccag ccaaagatga tggtcgagaa acaaaagctg atccagctcc 660 cggaccagca acagatcaac ctcgaggaac aacaactggt tcagctcccg aatcagcttc 720 aggtccagct ccaggaccaa acttcggaca tgctcaagct ccaaatgcga aatcaaacac 780 gactccacga tctggacctg ctccaacttc tactgctcct ggaccagact ttgacaacaa 840 aacaactggc tcccaatttc gaatgaagac accaaaattc gattctcagc tcccaattga 900 aaactggatc tgcgcaatgg acatctacat gacttgctac aacttgtctg aagagaacat 960 aatcaagatt tctctttctc agctactgac tgaagacagt ggaacaagtt tgatcgagtc 1020 catcaatcca gaggaacgca aaaactggga cgccttcaag tctaagctga tcgcagtcct 1080 gggcaaagac caagaacact tcaaacatct ctacaatacc tttcagcgcg gaaatgagtc 1140 acaagcaatg gctcttacaa agatcaccgc gttcttcaaa aaggggtata agaagagtgc 1200 tcttgatgaa gctgatgagc aaatcgtgtg tgaaaaattc atatccgcac aagagccacg 1260 tctgatggaa cttcttacta gagaaaaatc aagcctcaat ttgcgaaata ttgctcagcg 1320 agcaactgaa cttgaacgat ccttctacaa tagagaaaaa gtcttcgttg cagaggaaca 1380 gaaaaaatca aactcgtaaa ttgatcaaat ctgttctaaa ttagagaaaa tgatcgccaa 1440 cgctacgact caagataaga agaaaaatta ttcagaaaat aagagaaatc gaatcgacac 1500 caccaagata caaggtcact gcatctctta tgtaaaaagt ggaaaatgca aatacggaaa 1560 gaagtgccgc tacattcaca gcgatgatgt cccagctgaa gtccaaaaag tgataaataa 1620 ggaggcatga ctattaccgt cagaaacttg caaatatgct aattttgctc cgacctcgct 1680 caaatttatc aagatcacgc tcaatggact taactttcct gcactaatcg actccggttg 1740 cagtagaacg tgcttacgat ccgacgttct caagctccta ccaacctctg tcaaagaaaa 1800 accttcaaat attaagctca agtgtgcgaa ttcagaagtt gtaacggtaa ctacgctcac 1860 attaccgata gaagcaaccc tgaaaacaag ctcatcgcca cttaaactaa aactgaaccc 1920 gctcatcgtt cagaatcttt cctgtcctat tatccttgga atggacactc tcaagaatat 1980 aacaattgga gaaagctcag taactctcca tgacaaaatg atcccgacaa tcgaccctca 2040 ctcatctgac ttactcctca ccggaataat agagatcgat aaaaattccg gaacggaaaa 2100 agagaagtac gacgaattta taaggcgacg caccgagaat gctcaaaaaa ccaatttcat 2160 tccagcaatt ggatcatttg gcgctgcaaa tcccgaacaa aaggcacagt tagaaaagct 2220 cgtcatgaaa tacagactaa gcttcaacat ggattccgac gatcttggac gactttttgg 2280 gtttcgctat acactaccca tgttcgatga gaaacaatcc tcacatcaac ctccaagacc 2340 aatacctatc cattcacaag ctcaagtgga agaacaaatt gggaagtgga aacaacttga 2400 catcatcgag aagactcaat ctgctttcaa tattccgctt atcattctgc gaaaatctga 2460 caagaccatt cgcataagtc ttgatgctag aggaatcaac tcacttctcg tcaaggatcg 2520 atatccacta ccacacttca cgacagtctt cacacgaatt ggagaacgcc taacaaacgg 2580 aaaagagtgc ttcattagct cagtcgattt cttcagagga tactggcaag tcctaataaa 2640 cgaatctgaa agtcacaaaa tggccttcag ctacaagaac gaacattatc aggcgaaccg 2700 aatgctttac ggctcttgca atgctcctgc tgcattcagt agaataatga tgcgactcat 2760 gaatcaccct tcgatagttc tctacctcga cgatctggtc ataatcgatt caagctggga 2820 agaacactta cgctcactgg agtttgtatt caaaacctgc ttggaccatg gccttatcct 2880 aagctcgaag aagtgtcaac ttgcaagtca cgagctcgac ttcattggac atagaatcac 2940 cgcatccgga atcaaaccct tgaagaaaca ttcttgaagc aatcaaaaac atggatccgc 3000 caactgacaa aacagaactt aagcgctact taggaatggt caatttcaac gtcaagttcg 3060 ttcgtaaagg aagtgaaatt ttggctccac tctacaaact cacttcagca aaagtcgact 3120 atgaatggtc agatgtacat caacaagcat tcgacacaat aaaagcagaa cttcttaaag 3180 aaccgacact cgctcactac aaaccaggat caaagctcgt tcttgtgaca gattcatcag 3240 gtcacgctgt cggcggtacg ctctaccaag taaaagaagg cgtcttttct ccacttggat 3300 tcttcagtaa gagtctccaa ggtcctgatc tgaaaagaag catgcgaatc aaagaacttt 3360 tcgctctctg ctggggtatt cagttcttcg aatatttcct gatcaacaca gaattcgatt 3420 gttttgtcga tcataaatcg ttgctattct tatttcgaga agcaagaaga tcaaaattag 3480 atatcaaatt atgtaatgtg cataattatc ttcaagcttt cgacttcaac atcattcata 3540 aaaatggaac aagtcctgaa atggcctctg ctgacttctt tagcagacta ccaaaataca 3600 gaagttccga ccttgaaaga gattgcatcg acttctttga cgtcgatgaa acaatattcg 3660 ccgctcacac tctttcagaa gatgatatta tattttcttt cgaaaatcgt tcgttcactt 3720 cggaagaaat ggcaacagct caaaatgaat gccaaaacac aaggaatctc attatcaagt 3780 cgaagatcaa tacgaaacaa agtcgactta agctcaagga caatgtgctc tattccaaag 3840 atagacttgt actgccaagt caacttgccg atgaatttat tcagtatttg catacgatca 3900 ctgctcatgc tggaagtcaa cagctactgc aaatgctcag aaaattcttc atcagcaaca 3960 tccaagagaa agtacgcgca atcacaagct catgtgaaac gtgcatccga ttaaagccgc 4020 tcaaagaact tcgaccttcg ctcatcaagg atcgaaaatt tgctcaacta ccatttgaaa 4080 aggttttcat cgaccttgtc gactatggaa gacccgacag ctctggaaaa cgatacctgc 4140 tcacatgtgt cgactccctt actggatatc tagatggtga accactttca aacaaaaccg 4200 acaatttggt atccaaagct ctgttgaaac tgatactgcg tcatggaatt tctcaaaatt 4260 gtgtatctga taacggtcgc gaatttggtc ctcttacgaa gaaaatactt gaccgattcg 4320 aaatacgaca tacaacaaca accgcttatc gcagtcgatc gaatggcaaa tgtgaacgca 4380 ttcacagaga actgcaccag aaactgaaag ctcaactggc aacaaaccgc tcatggtctc 4440 aattttggcc tcttgctcag tactacatca acaattcacc aaaaggaagt cttgataacc 4500 tcagcagcaa tgaagctgta tttggaagac agtttcacgt tccattcacg ctcaaagatc 4560 cagttaccga cagcatccaa ccatttataa aggctctcaa tgagtatatg aaaaaattac 4620 atccctcatt actcaatttc caagttcaaa gataccagaa attgctcaaa cgagacaaaa 4680 acgactgtcc tgttctagat cttggaacag ttgttctctg ctggaaacct gatctgcttg 4740 ctggaaaact tggaataaac tgggtctggt ccatacaaaa ttcaccgacg attgtcaaag 4800 gatagttata ttgtcaagtg tccaatcacg aagaaagaat accggcgcca tataagtctc 4860 attcgacctc ttcgccaaaa acttcaagga gacacaacag agttcataaa agaagataaa 4920 aaagaaaatg cggaagttca actcactaat cggaagatcg acactcttaa agatcactca 4980 gctcacgatg ataaagatga gataagttca acgaaattag ataaaaagct cacatcttca 5040 ccgctcaatc aaaaagatcg acatgagcct gtcgatgaag ttgattatga agatatttgg 5100 aaaaaccgtc tcagatcaag acaaaactaa catgataata gctcaattat catttcacac 5160 ttgattttgc aatttgctcc tgtggtttca aagtcctgtt ctcaaggata tcctgtgctc 5220 caaatgtaaa ttatcacaag ctgcagcgat ccaatgcaca ccggaatggc tcaatacatc 5280 cctgctcgtt aaacaacaca aaggaaattc cagaaacaaa agttcaaatt cgaactcgct 5340 gttcaagctt caatacaggc ctttgcggac gtgtgtcgtc ataacccagt caatgtgctg 5400 attcatcggc cgaacgaaaa gtacctcttt gctgaaactt ggaacactca tctgcaaatg 5460 tttagcaatg cgtgcagttg ctatgtcaca cccagaaaat cttaaatcaa gttgcacaat 5520 tgcatacgat ctatttgcaa aagaaaatat gtcagaactc accctgatcc gtggaaagtt 5580 catctatttc cctgttctgt ggaaatctaa tcttcggaaa aaatgcccga ttgcagcaga 5640 tccgactgca agaagaatga cgaaaatata gatttccctg ctctgcggaa atctgatcat 5700 cggaagaagt atccgattgc agcagatccg aatgcaagaa attatatcat taacaactgc 5760 tctgattgaa ataagaagac cagatgaaat tcatccctgc tcgaaagaag aatttcaaag 5820 ctcaaaaata aatcaaaaag gaaaaggaag ggtaagacaa gaaggaagaa gcaacggcaa 5880 gctcagtatc gcaaacagaa gatttgcctc atctcaaatt tccgagcgaa tcatcaaaaa 5940 tttccgagag acctgttctt acgacctagt cgtctcttgc tctttcgact ctactgactc 6000 ctgtttcatt atcacaagaa tctgctatga attttctctt cccagaagac caaattacca 6060 acatcaagat ttcgactctg tcgtcactct caactaatct tcaaatccca gtaatttcat 6120 cgctcaaaat gaaaggggga agaatatata g 6151 // ID EnSpm-N3_BF repbase; DNA; INV; 2857 BP. XX AC . XX DT 04-AUG-2008 (Rel. 13.08, Created) DT 04-AUG-2008 (Rel. 13.08, Last updated, Version 1) XX DE Amphioxus EnSpm-N3_BF non-autonomous DNA transposon - consensus. XX KW EnSpm; DNA transposon; Transposable Element; Nonautonomous; KW EnSpm-N3_BF; non-autonomous. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-2857 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-2857 RA Kapitonov V. and Jurka J.; RT "EnSpm-N3_BF - a family of non-autonomous DNA transposons from RT the amphioxus genome."; RL Repbase Reports 8(8), 792-792 (2008). XX DR [2] (Consensus) XX CC This transposon contains imperfect 34-bp TIRs (10 mismatches) CC and is characterized by 3-bp TSDs. XX SQ Sequence 2857 BP; 821 A; 565 C; 594 G; 877 T; 0 other; cactgcatga aaaccgctct ttaagctctc ttaaagtgtg cttaagggtg ccgtattggc 60 attctttaag ttactttacg caatgcttac gcattcgtaa atattatgtt taagccataa 120 ttacatgcgt ccgtaaacgg cacgtagtgg cacacattta agcaacattt aagcaccgta 180 aagcatactt aattccaaat gattacgcta cctttaagct ctgtaaagtt ccgtaaaaga 240 tgtgatttaa gttgtcttaa agttttctta aagagtaatt aatggcactc atttaagatc 300 tgtaaatcag ttgtaaatat cgatatttaa gcttgatata agctccttta tttcaatgat 360 tttgcatact acgatcatgt gtcagtgaat gtatctttta tgtaatgtgg tgtaatgtta 420 tgtaattcag gtatcctttc ttgtttgttt tttatggcca gccgttgttg tgttccaatc 480 tttgtacatg cacgcaatca aatatgccat gtttagctcc accctctatg tctcggcgtg 540 atgcattcac tcctgggcca gtagaacatc ttgctttctg cattcttcgt ccagttccat 600 ccattcagat gtctcaggcc tccaggtcat ctggaaacga tcttgaacta cgtagacttt 660 cgttcagggt tgaaggagat acatgcccca taaatcgact agccggcgcg gcggcgtaac 720 tccccgggct tgatacgggg catgttgcta ggggcccggg gagttacacc gctgcgctcc 780 cttgcctact ctaggaaacc aaactgggta acgagaatac cgatccccgc ctgcccaacg 840 cgcgcccgca ctaccagctg gccccactta cgtcggcccc cgactatcgc ttgcttctcc 900 gagaggcaaa gcgatagttg ctagctggat cggcagacgg gcgggtgttt tgggcccggg 960 gcgcggggat tggtaaacct ttgcgcgctg gctcttagga gttctggctt cacagggtat 1020 ccattggccg gctagcgctt tcctccaata agcagggtga tccaattctt gtccagggca 1080 gcaatagtcg accgaccggc caggctcacg gctttgttgt tcttggtgac cgtacggtgt 1140 ttgtgtcctg gagccattat attctgcgtc ccctgggctg agttatacgt tggagctgta 1200 gcaggtccac acccaagcaa gctacgatca ccaaatgatg acagactaga tttgaattag 1260 gagcgcgagt gctgattggt ccaaatcgcg cgttgtttaa attgaaacaa tgatttgcat 1320 agtcagcaga catgatggtc tatgttatgt gtattgctat tgttattcat agacaggctt 1380 gctgaccatt tcacaattga acaatagagt ctaagtcaac agcccaagaa gggtgtaaaa 1440 tgtattccag tctaaataca tgctaaaaag cttgcccatg cagaaatcac gtctagaaat 1500 tgtacatttt accaccgcat agaaaaaaaa tgaataagtt cagtcaaata caaacatgat 1560 aaggtcaatt aaagccagca ggaaaagtca aatcatgaaa gtgagctcag atgagcctgg 1620 cggaaatcaa agacattaaa aaacgtaata tagaatgtcc ttgcagaaat gaattcttat 1680 tgtgccgagg aaattcagcc cccaggctgg gagagactgc tgcagcgcgt tatatctaat 1740 aatgaaagtt atgaaaacat gaactttatg tgatccagta tgaacgactt tctgtggtgt 1800 ctacggtaca atttggtact tgattgtgtt ttaaagctta gaaatggatt gaaacaaaga 1860 tggcgtgggc tgagtcagtg gatatgtaaa caaagaaaaa atcccgctcg gaacttgaat 1920 ttgtttttat gacctagata cacttgctgg ctctcattaa acccgctact tcacaatttg 1980 tgttctgcaa ttcatattgg tcgatctttt gatactgtgc aagtatttac aaggctgaaa 2040 atgtgcataa ttcgagaaaa tcagttatca catcggtaaa ccaaatattt gtatgatgct 2100 ttgttttgtt ttgttttagt ttgcgtagac tgcacaggcc gtctcctacg cgacgccttc 2160 gaatctaaaa caattaggca ttgcatgcta tagctttagg atctaagtcc tgttacgttt 2220 tatgtcttta tatctgaaac tctatcacat atatcagtaa attgtactac atattttttg 2280 gcagtattat gacaatgcag agttggtgtt gtccgcttct gcaggacttt tgcctcctca 2340 ctgtaattat aaaaggcgtg tgcgcagaag cttttgaata aactctagta tgacaagata 2400 tcaaggagta aagtattcaa aatccattaa ttgaagaaaa aattagactg atgacagacg 2460 ctgctaccac caactttaag cagcttaaat aaagcttaaa taaacgactt taaactctcc 2520 taatgggcgt ttaaaagtat cgtatatgaa ataattaaaa tgaagagtaa aagcgatgat 2580 tattttagtc aatacatata aactttgttg gcaaaactgc gcaaattaat gtcatatctt 2640 taaggcaaag ttaagtgtac cttaactaat atgtaaagtt atttacatgt ttaagtggac 2700 ttaacgtttt tcttaaatgt gtcttaaata tcgattttta gttgtggcat ttacgacttc 2760 ttaaatgtag cttaataata gcttaatcat attcaaatgt ttaagcttca tttacgaatc 2820 ccattaagca tgcttaaata tccaatttgg tgcagtg 2857 // ID Gypsy9-NVi_I repbase; DNA; INV; 4135 BP. XX AC AAZX01007771; XX DT 13-NOV-2007 (Rel. 12.11, Created) DT 13-NOV-2007 (Rel. 12.11, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy9-NV; KW Gypsy9-NVi_LTR; internal portion; Gypsy9-NVi_I. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-4135 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from Nasonia parasitic wasp."; RL Repbase Reports 7(11), 1141-1141 (2007). XX DR Genome; AAZX01007771; Positions 13889 18023. XX CC Positions [3108-3566] - Integrase core CC 'ATCTT' target site duplication CC LTRs are 95% similar to each other. XX FH Key Location/Qualifiers FT CDS 68..1075 FT /product="Gypsy9-NV_I_1p" FT /translation="MSTKEEIERLTAELNKQKKEVDQKIAELQALDKKVKE FT DNKKLEATRQLVELGNSSARHTTLVRGTIGSITEFSMDEDWQMWYERLEQY FT CVTNEVPAEKHVSLFLTLLGKDGYTLLRNLCTPRIPSQMLIIELAKIMKDH FT LQPAPSVITERYKFKECRQRTGEDIKGYVANLKKMSTFCEFGSNLESHLRD FT QFVWGVASEAIKKRLLGETDLTYHRAIELAASLESAGRDAAEMGTTASVAA FT DAVNYVADKKVTPGKQTRATGGDSRNCYCCGRPNDRASDCTFRNYKCNSCG FT KTGHLTIVECVEIEILRKRVCGIRLSLTKIKRVIQRDRTKNRIT" FT CDS join(1203..2063,2067..2996) FT /product="Gypsy9-NV_I_2p" FT /translation="MMIEGAMMEFEVDTGSPITAISSRVYKNAKNLSKLKL FT EKTSRVFKTYHDKKICPLGTVQVKVIYKKKSLVLELFVLPSDSDTPIVGRQ FT WLRHLGMIQVNGDKEGVNVLTIHTLCGISMSDLLREFDSVFSGKLGTYRGG FT KLSLPLKPNAVPVFYKPRPVPFAIRKPLDKELERLIRENIIEPSNSSDWAT FT PIVPVIKSNGEILFQKEMEKLLDSVKGTAVYFNDVFISGKSQAEHDSNLRK FT VLNRFKEAGLTVKLEKCQYSEQKIQFLGYELDSKGLHVSESKIKAIKMEAP FT TKVAELKSFLGAMQCYAKFIKDFAKVVSPLYELTREDVKWDWTKEREAAFQ FT TTKKCLMSHDVLVHYSTEVPIKVTCDASPKSIGAVLSHIFPNGDEKPVAFA FT SRVLTRAERNYSQLDREALAIVFGVKSFHRYIYGRDFTLETDHKALSFIFG FT SKKGILQIAASRVQRWAIFLSGYNFRIKYINGKDNGPADALSRVLVDEFKA FT KHKESEEYTYLNYLSDDIQSINVDTVRVETARDPILTKVYEFVSNGWPKKV FT DDNLQAFKNREFELSIENGCIMWGHMLVIPYVLRFNYFQKLHSAHAGIIK" FT CDS 3168..4109 FT /product="Gypsy9-NV_I_4p" FT /translation="MYLIIIDAHSKLIDIKEMSDITAKNTIGAFKEYFSVW FT GLPNALVTDNGPTFTSEAFKDFLDKNNVKHHRTALYHPASNGAAENAVRSF FT KNKFKKLCKDKLSRQDALLRYLLYYCSTPHSTTGVCPAELQIGKKLRTRLS FT AISSVACKNVKRSQERQCRNFKGNRNIEFKKNDVVMAKDYAVDKWRVAEVV FT DKLGPVTYSVSTNDTRTWKRHTDQLRSYENPMYAQSNVSTDNVNSGFLIPN FT SFCKEPSIISNSNFVKHQSFGIAKEATASQSEPMSTETVNINEQPNATIAS FT TGLSQNVRRSTRNRRAPERLNL" XX SQ Sequence 4135 BP; 1384 A; 742 C; 980 G; 1029 T; 0 other; ttggcgacga ggattaagtt cgtaaacacg tgagacacag taataggaat cacggatacc 60 tatcaagatg tccacaaaag aagagattga acgtttgaca gcagagctga acaaacaaaa 120 gaaggaggta gaccaaaaga tagctgaact gcaggcatta gacaaaaagg taaaagaaga 180 taacaaaaag cttgaagcga caagacagct tgttgagcta ggtaatagca gtgcaagaca 240 cacgacgttg gtgcgaggca caataggcag catcacggaa ttctcgatgg atgaagattg 300 gcaaatgtgg tacgagcgct tagaacagta ctgtgtaaca aacgaggtac cggcggagaa 360 gcatgtgtct ttgtttctca cgcttcttgg aaaggacggg tacacattgc tgcggaactt 420 gtgtacgccg aggataccgt cgcaaatgtt aataatcgaa ctagccaaga taatgaaaga 480 ccacctgcaa ccagcaccga gtgtaataac ggaacgctat aaattcaagg aatgtaggca 540 gcgaacgggt gaagacatca aaggatatgt ggctaactta aagaagatgt cgacattctg 600 cgaatttggc agcaacttgg agagtcacct tcgcgaccag ttcgtttggg gagtggcaag 660 cgaggcgatc aagaagaggc tactgggcga aacagacttg acctatcatc gggcaatcga 720 gttagcagca tcattggagt cagcaggtcg ggacgcagca gagatgggga cgacagcgtc 780 ggtggcggca gacgctgtca actacgtagc cgataagaag gttacgccag gtaaacaaac 840 gcgcgcgacc ggtggcgatt cgcggaattg ttattgttgc ggtagaccga atgaccgagc 900 atcggactgt acttttagga attacaaatg taacagttgc ggaaaaacgg gtcatcttac 960 aatagtagag tgtgtagaaa tagagatact caggaagcgg gtttgcggaa ttcgactaag 1020 tttgacaaaa ataaaaagag taattcaacg ggatcggacg aaaaacagaa ttacgtagaa 1080 gaagtagagt cgttgggtaa aatgttgatt attaccgatg atatattttt gatatatttt 1140 ttgtattaga ggaaatcaac gcaattcgcg aacatgacga cgctaaacca ttgtgtgtgc 1200 aaatgatgat tgagggggcc atgatggaat ttgaagttga cacgggctca ccgattacag 1260 ccatttcgtc gcgagtttat aaaaatgcga agaatttatc aaagctcaaa cttgagaaaa 1320 cgtcacgtgt tttcaaaaca taccatgata aaaaaatttg tccattagga actgtacagg 1380 ttaaagtaat ttataagaaa aaaagcttag ttttagaact ctttgtgtta cctagcgata 1440 gtgatacacc gattgtagga cgtcaatggc tacgtcacct aggtatgatt caggtaaacg 1500 gggataaaga aggggttaat gtactcacta tccacacgtt atgcggcatc agtatgtcag 1560 acttgttaag ggagtttgat tccgtgttct cgggcaaact tggtacttat agaggcggca 1620 aattgtcgtt acccttaaag cccaatgcag tgccagtttt ctacaagcct agaccagtgc 1680 cgttcgctat acgcaaaccg ttagataagg agttagaaag attaattcgc gagaacatta 1740 ttgaaccaag caatagttca gactgggcaa ctcctattgt cccagtaatt aagtcaaatg 1800 gagagatttt atttcaaaaa gaaatggaaa aactactaga tagcgttaaa ggtacagcgg 1860 tatattttaa tgatgttttt atatcgggaa aatcgcaagc agagcatgac agtaacttac 1920 gcaaagtatt aaatcgtttt aaagaagccg gactaacagt gaaattagaa aagtgtcagt 1980 atagtgagca gaaaatacag tttttaggct acgaattaga ttcaaaaggg ttacatgtgt 2040 ctgaaagtaa aatcaaggca atttaaaaaa tggaggcacc tacaaaagta gcggaattaa 2100 agtcgttttt aggagcaatg cagtgctacg cgaaattcat taaggatttt gcaaaagtag 2160 tgagtccact gtatgaactg acaagggagg atgtaaaatg ggactggact aaagagcgtg 2220 aagctgcgtt tcagacaaca aaaaaatgct tgatgtcaca tgatgtgcta gtgcattaca 2280 gcactgaagt tccgatcaaa gtaacttgtg acgcgtctcc aaagagcatt ggcgccgtcc 2340 tatctcacat ctttcccaat ggtgatgaaa aaccagttgc tttcgcgtcg agggtactga 2400 cgcgcgcgga gcgtaactac tcacagcttg atagagaagc gctagcgata gtatttggag 2460 ttaaaagttt ccaccggtat atttatgggc gtgattttac gcttgaaact gaccacaagg 2520 cattatcttt tatttttggc tctaagaaag gtattcttca gatagctgct agccgagtac 2580 agagatgggc aatatttctg tcagggtata attttcgtat caaatacatt aatggtaaag 2640 ataacggccc agctgatgcc ttgtcacgcg tgttagtaga cgagtttaag gcaaagcaca 2700 aggaatctga agaatacacg tatttgaact atttaagcga cgacattcag tccattaacg 2760 tagacactgt tagagtagaa acagcgagag atccgatact aacgaaggtg tacgaattcg 2820 tgtcaaatgg ttggccgaag aaagtagatg ataatttaca agcgtttaag aatagagaat 2880 ttgaactttc aatcgaaaac ggatgtatta tgtggggtca tatgctagtt ataccgtacg 2940 tgttgcgttt taattatttc caaaaattac atagcgcaca tgcgggaatt attaaatgat 3000 tttggtggcc taaattagac cacgaaattg aaaatatagt gaaatcttgc aaactatgcc 3060 tagaagtagc cgaaaaccca cccaaatcaa cattgcacgt ttggaaatgg ccagaagaac 3120 ctaaccagag gattcacgca gatttttgcg gaccagttaa tggttttatg tatcttatta 3180 ttattgacgc tcattctaaa ttgattgata ttaaggaaat gtctgatatc acagcgaaaa 3240 atacgatcgg ggcttttaag gagtatttca gtgtttgggg acttcctaac gctttagtaa 3300 cggataatgg accgacattt acgtcggaag ctttcaaaga ttttttggac aagaataacg 3360 taaagcacca ccgaactgct ctttaccacc ccgcatcaaa cggtgcggcc gaaaacgcag 3420 tgcgttcttt taaaaataag tttaagaaac tttgtaaaga taaactgtca cgccaggacg 3480 cgttactaag atacttactg tattattgtt caacgccaca ttcaacgaca ggcgtatgcc 3540 cggctgaact tcaaattggt aaaaaattgc gaacacgtct tagcgcaata agttcagtag 3600 cttgcaaaaa tgtcaaacgc agccaagagc ggcaatgtag gaatttcaaa ggaaatagaa 3660 acatagaatt taagaaaaac gatgtagtga tggcaaaaga ttacgcggtg gataagtggc 3720 gagtagcaga agttgtagat aagctaggac cagtcacgta ctctgttagc acgaatgata 3780 ctcgaacgtg gaagcgccac accgatcagc tgagatcgta cgagaatccg atgtacgccc 3840 aaagtaacgt gtcaaccgac aatgttaata gtggatttct aatacctaat agcttttgta 3900 aagaaccgtc tattatatcg aacagcaact ttgtaaaaca tcaatcattc ggaattgcaa 3960 aagaagccac ggcatctcaa tccgagccca tgagtacgga aactgtaaac attaacgagc 4020 aaccgaatgc aactattgct tctacagggc tatcacagaa cgtgagacgt tctactcgaa 4080 accgtcgagc acccgagaga cttaatttat aattgccaaa ttaaggcgtg aggat 4135 // ID Bag320_BR repbase; DNA; INV; 243 BP. XX AC . XX DT 09-JUL-2004 (Rel. 9.06, Created) DT 09-JUL-2004 (Rel. 9.06, Last updated, Version 2) XX DE Bacillus rossius rossius satellite Bag320 sequence. XX KW SAT; Satellite; Simple Repeat; Bag320_BR; satellite sequence. XX OS Bacillus rossius OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Orthopteroidea; Phasmatodea; Verophasmatodea; Areolatae; OC Bacilloidea; Bacillidae; Bacillinae; Bacillini; Bacillus. XX RN [1] RA Cesari M., Luchetti A., Passamonti M., Scali V. and Mantovani B.; RT "Polymerase chain reaction amplification of the Bag320 satellite RT family reveals the ancestral library and past gene conversion RT events in Bacillus rossius (Insecta Phasmatodea)."; RL Gene 312, 289-295 (2003). XX RN [2] RA Gentles A., Kohany O. and Jurka J.; RT "Consensus."; RL Direct Submission to Repbase Update (JUN-2004). XX DR [2] (Consensus) XX SQ Sequence 243 BP; 69 A; 31 C; 30 G; 113 T; 0 other; ttcggttatt tcacaatttt ttgttcaaga agattacttt tccacacatt tcagtccatt 60 tctatgattt atttcgttac aatccattca tttaatcgtt gaaatttggt tctatttaaa 120 taagattata tcgttataaa tgggtttttt gatacctttg caagtaatta tatcaaactt 180 catttttttt atttaagtaa atttgggtgc aagatgattt atttctattg tatccagaga 240 ttt 243 // ID Gypsy-3_OD-LTR repbase; DNA; INV; 135 BP. XX AC CABV01000151; XX DT 24-FEB-2011 (Rel. 16.02, Created) DT 24-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Oikopleura dioica genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-3_OD_; KW Gypsy-3_OD-I; Gypsy-3_OD-LTR. XX OS Oikopleura dioica OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; OC Oikopleuridae; Oikopleura. XX RN [1] RP 1-135 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Oikopleura dioica genome."; RL Direct Submission to RU (24-FEB-2011). XX DR Genome; CABV01000151; Positions 84017 84151. XX SQ Sequence 135 BP; 32 A; 34 C; 32 G; 37 T; 0 other; tgtagtgctt actacttgat aattactaaa tgacttttgc tcgctatttt atcaggcctg 60 caaagagaaa ttggcagcgc cggcgggcag ctataacagc gccgcgcggt gcgttctccg 120 actcgattcc aatca 135 // ID Gypsy-3_PPP-LTR repbase; DNA; INV; 611 BP. XX AC ADBJ01000006; XX DT 13-DEC-2010 (Rel. 15.12, Created) DT 13-DEC-2010 (Rel. 15.12, Last updated, Version -1) XX DE LTR retrotransposon from the Polysphondylium pallidum (slime DE mold) genome: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-3_PPP_; KW Gypsy-3_PPP-I; Gypsy-3_PPP-LTR. XX OS Polysphondylium pallidum OC Eukaryota; Amoebozoa; Mycetozoa; Dictyosteliida; Polysphondylium. XX RN [1] RP 1-611 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Polysphondylium pallidum (slime RT mold) genome."; RL Repbase Reports 10(12), 2163-2163 (2010). XX DR GenBank; ADBJ01000006; Positions 94778 94168. XX SQ Sequence 611 BP; 266 A; 92 C; 44 G; 209 T; 0 other; tgttctaaaa ttaacatttt agaattataa taatcaaatc aactgacact tcaataattt 60 attaagaatt caactatcac aattcaaaca gtttatggac acgatttcaa ataacttaaa 120 ttaataattc aactgtttaa caataatcaa atatattcga ataatatcaa cattcagtga 180 atcaacaaaa tatcaatcta ccaaagttaa cataaagaaa taaccaacaa gttaatctac 240 aatgaatacc aaactagtta atcaagaaca gatttctgag attgggcaac tgcactggtc 300 gaagaaatcc aacagtataa ataccatgga aatccattat agaaataaaa actaattaat 360 taattattca acattattca acattaccaa atattattct tatcttcaaa ttgcaaaata 420 aataaactat tatctttaaa tattatactg taaattatat ctctaaactt tattacattc 480 attaaatcta tattcctcgt accaaaacta ctcctaaagt aattcctaat cttatatcaa 540 agttgtaagt attatattta tatgtttata tgtatattat aatatgatat tattaccatt 600 gtaaaataac a 611 // ID I-45_AAe repbase; DNA; INV; 6935 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE An I non-LTR retrotransposon from Aedes aegypti. XX KW I; Non-LTR Retrotransposon; Transposable Element; I-45_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6935 RA Kojima K.K. and Jurka J.; RT "I clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1318-1318 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 6 sequences with >98% CC identity. CC Both termini are uncertain. XX FH Key Location/Qualifiers FT CDS 440..1831 FT /product="I-45_AAe_1p" FT /translation="MSLSGGYPPLPGDPGGGTASYIDGTYTGATLPSYMDP FT QGNCGELIVLRMEPVNGKMPEHPFTLRQSIEKRVNGRIEGAIPEAQGRSYA FT LKVRSRHHIERLLTMTQLTDGTAVKVSYHPGLNSTRCVISCRDLMKVKDDN FT EILDCLKDQNIIGIRRITRKVGEDRELTPTVILTVSGTTLPEHIDIGYQRI FT RTRPYYPAPMLCYQCYQFGHTRQKCQKQIPTCGNCNQEHELTQGVRCQNPA FT YCARCKSSDHSLGSRKCPVYQNEDAIQHIRVDRGLSYPEARRAFEASTGQR FT SFAGIAVHSKDKTISDLSAKLEALSVQMVEKDSKIKXLEDQISSRNVPNSS FT NEFQRLEKLIINLQNQIQKKDERILNLEKALEKDSRLDLVRKHGTIEDLVA FT KVSLLQETVTKKDNEIRILRAHTKTTSLDSSNSKLQRQQPTKTQVPPPHFP FT ALKGTTTRKIEQARQHQSP" FT CDS 2165..6577 FT /product="I-45_AAe_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="MSYKPKIISCRDKNNMHQTEDDTPNKLLNKLSFPANI FT AAQTALPNVQTLGLETSGSRDNFTLPLLDSEGLINTSSKQDKASGSRGPTS FT AEDFGEPELVDNLRHPLANSLEINCVGTIGTPFVENNGSRDNIILPHPDFD FT RTKKTSQQDKVSDSRGPTSAEDFAGPEPVDNLRRSLANSFGTVSEGVKALH FT CKENNGSHDNIILPQDDEQTNKENASSSRSTTSVRASCESDRLDNPRRLLA FT PRQETVPPAQLQFDNPIPPEARSDAQKYDQWKYRTDNAAIPAAGPSSLLTF FT IPVSSISSPLERPTLAVSACEDPLRSASSLKSSCKPVGLARRRSTRTRTPP FT SHPDNASSKKTKHSSLAIQWNMNGYSNNLADLELLIAESTPVIIALQEIHR FT SDTKTMEKTLQGQYNWILKGGANIYQSAAIGIHRSTTYVDITPNSDLIAVA FT ARVESPFPITVVSIYIPTLDCKNIEQRLSNFLDQLLPPFLILGDFNAHHVS FT WGSRRNTEKGFQILRASEKANLRILNDGSPTFTRPLISSAIDISFASENLL FT GRLQWSISKDPMGSDHHPISIYCNFPTPKTTRRPRWKYELADWQLFQNTIE FT ISPIEDDTPLEGLISTILTAASEAIPKTSPNAGRRALRWWSPETKTAIKLR FT RKALRKMKRIPKDHPNAEKIAADYCSKRNACRDIVRAAKRKSWDEFLNGIS FT SDSSSSELWKSINALQGKRGINGIAIKTSNGISRDPKEIANILGGFFAKLS FT SRSGYNPDFTQLDDFSTTVESISIPRSRPGDRLDKPFSSAELSFALGNGKG FT KSAGPDDIGYPLLRHLPPIGKRVLLKNYNRIWEDRTFPDQWRHSLVIPIPK FT NDSNRTEPSGYRPISLTSCCGKVMERMVNRRLTQFIRDNDLLDHRQHAFRQ FT GCGTNTYFATLGQVLDDALEKGLHVDLATLDLSKAYNRTWTPGVLRTLASW FT GISGNTFAFIRNFLTNRTFQVCIGNTKSNSFPEETGVPQGSVLAVTLFLIA FT MNGVFETLPKGVFIFVFADDIVILVAGHKPTLIRRKLQNAIKAVHDWALSI FT GFDLSPSKSHFSHICSGRHRTKKNPPTIDGEEIPFDATIRVLGLYLDRHLS FT FQPHCDRIKKQCESRLNLLRIITSKHTKNNRDVGLRVTKAIVCSRLLYGAE FT LLCRNSENVISKLAPTYNRCIRSLSGLLPCTPATSTCVEAGIPPFRYIFAI FT SVCSKAVSFLEKTSGDDEGVFLLKEANKLLQQFAHQILPPVSTVHWVGDEA FT WDFSPPRIYWTLNTPTKQNLNPTTVIPRFNELIKRKLNGYELFYTDGSKTS FT SEVGFGIFGPNINKCGGLPKSTSIFSAEATAILSAVTEYSFTWKAIVTDSA FT SVLRALENPKNKHPLVQAIRTASNSNTIYVWVPSHCGIKGNEAADKLAAEG FT RSKIPAHPEVPAADLKIWIKTVFTSTWENEWNLARDSFLRKIKGDRHKKLE FT GPKNTF" XX SQ Sequence 6935 BP; 2123 A; 1828 C; 1458 G; 1525 T; 1 other; acatcacatc gacacgttcg gtcgcttact atttcacggt cgagtttttt tcgcgtttct 60 cttcaagtaa ttgcgctaat tacttccgaa aaatagtgaa aaatatccct tcgctgccag 120 tgatttagtg gatagtgaag attttgataa cagctgtccc agtttagtgg aaaaagagat 180 ttagttttac aattgctcaa acgaaaggac gccacacaac gtcgccatcg tgataaaata 240 accccacatc agctttctca ctcgaacttg cacgcaagtt cgtttgcgcc gcttgcatct 300 tctgctcgag ccagtcactc aaatcgacgg ttggttttcg cgtacgcagg ataagaaaat 360 aattaactag cacttaaaaa gaaacatacg aaatactcga ttcattggtg ttccacataa 420 tcaacacgcc ccaccagcga tgtcgctgag tggcggctat cctcccctcc ccggtgatcc 480 tggaggaggg actgctagtt acatcgacgg aacgtacact ggagctacat tgcccagtta 540 tatggaccca caaggaaact gcggtgagct gatagtgctg cgcatggaac cagtgaacgg 600 caaaatgcca gaacatccgt tcacactacg tcaatcaatc gagaaaaggg taaacggaag 660 gatcgaagga gcaattcctg aagctcaggg ccggtcatac gcactgaaag tcagaagtag 720 gcaccacatc gaaaggcttc taacgatgac gcaactcacc gatggtaccg ctgtcaaagt 780 ttcctaccac ccgggactca actcaacacg atgcgtgatc agctgccgcg atctaatgaa 840 ggttaaggat gacaacgaga tccttgattg cctgaaggac caaaacatca taggtatccg 900 tcgaataacc aggaaagtag gtgaagatag agaactcaca cccacggtga ttctcaccgt 960 aagtggaact acacttcccg aacacatcga catcggttac cagcgtattc gaaccaggcc 1020 ctactaccct gcaccaatgc tctgctacca gtgctaccaa tttggccaca ccaggcaaaa 1080 atgccagaag caaatcccga cgtgtggaaa ctgcaaccag gaacatgaac tgacacaagg 1140 tgttcgatgc caaaaccctg catactgcgc tcggtgtaag tctagcgacc attcgctcgg 1200 cagccgtaaa tgtccagtct accaaaacga ggacgcaata cagcacatac gagtggatag 1260 ggggttgtct tatcccgaag ccagaagagc ttttgaagca agcactggtc aacgttcttt 1320 cgcaggcatt gcagttcata gcaaagataa aaccatctcg gacttgtcgg cgaagttgga 1380 agctctctct gtacaaatgg ttgaaaagga tagcaaaata aagastctcg aagatcaaat 1440 ctcctcaaga aatgttccca acagcagcaa cgaattccag cgtctcgaaa aactgattat 1500 taacctgcaa aatcaaatac aaaagaagga tgagaggatc ctaaatctag agaaagcctt 1560 agagaaggac tccaggttgg atctggtccg caagcatggg accattgagg accttgtcgc 1620 taaagtctcg ctccttcaag agaccgtaac taaaaaggac aacgaaatta gaattctacg 1680 cgcgcacacc aaaaccactt ctctcgacag cagcaactcc aaattacaac gacagcaacc 1740 caccaaaact caagtccccc cccctcattt cccggctctc aagggcacaa caaccagaaa 1800 gatagaacaa gcaaggcaac accagtcacc gtagcgcaga acctatcgat agtccccgcg 1860 acgccgatgg atgtagtcgc ttctgggacc aacaccggaa caaaaccgaa attgaggaaa 1920 aaaaatgcag aaaaaaacag cggcgatctc catgctaaac gaaccaaggc caacgaatca 1980 gtaatgtaca tctccgatac agaagatttt ccagcacctg aaaacgatat ttgcaatgag 2040 tcagaggtag aacaccacat ttcgtcgtct tcagacgaag caatggaaga gagcgtggca 2100 tcaggaacat tttagcaagc tacgtcctac ctttttccac catccgtaat aacaaataac 2160 tacaatgagt tataaaccaa aaattatttc atgtcgagat aaaaacaaca tgcatcaaac 2220 agaggacgat acccccaaca aactcctgaa caaactttcc ttccccgcta acatagcggc 2280 tcaaacagca ctcccgaacg tccaaacctt gggattggaa acttctgggt cgcgtgataa 2340 cttcactctc ccccttctgg actccgaggg actcataaac acatcaagca aacaagataa 2400 ggcctcgggt agtcggggcc ccaccagtgc ggaagacttt ggtgaaccgg aactggtgga 2460 caacctccga catccgttgg caaactcttt ggaaataaac tgcgttggaa ccatcggtac 2520 tccgtttgta gaaaacaacg ggtcgcgtga taacatcata ctcccccacc cggacttcga 2580 cagaaccaaa aaaacaagcc agcaggataa ggtctcggac agccggggcc ccaccagtgc 2640 ggaagacttt gctggaccgg aaccggtgga caacctccgg cgttcgttgg caaactcctt 2700 tggaacggtc agcgagggag ttaaggcttt gcactgtaag gaaaacaacg ggtcgcatga 2760 taacatcata ctcccccagg acgacgagca aactaataag gaaaatgcct caagtagtcg 2820 tagcaccacc agcgtgagag cctcgtgtga atcggatcgt ttggacaatc cccggcgttt 2880 gttggctccc agacaggaga cagttccgcc tgcacagtta caattcgata atcctatccc 2940 tccggaagct cggtcggatg cgcaaaaata tgaccaatgg aagtacagaa ccgacaacgc 3000 cgctattcca gctgctggcc cttcatcctt attgaccttc attcctgtga gctcaatatc 3060 ctcaccgcta gaaagaccca ctttagctgt ttctgcctgt gaagatccat tgcgatctgc 3120 ttccagtctt aagtcttcct gcaaacccgt aggactagca cgcagacgaa gtacgcgaac 3180 aagaacccct ccctcacacc cagacaacgc gagtagcaag aagacgaagc attcatcgct 3240 agcgatacag tggaatatga atggctattc gaacaacctt gctgatctag aactgctcat 3300 agcagagtct acacccgtta taattgccct tcaggagatt catagatcag acaccaaaac 3360 aatggaaaaa actctacaag gccaatacaa ctggatcctg aaaggtgggg caaacattta 3420 tcagtctgca gcaattggaa tccatagatc aacaacctac gttgacatta ctcctaactc 3480 agatctgatc gcagtagcag ctagagttga atctcccttt ccaataaccg ttgtcagtat 3540 ttatatacca acattggatt gcaagaacat tgagcagagg ttgtccaatt ttctcgatca 3600 gcttctcccg cctttcctga tcctgggcga ttttaacgcg catcatgtta gctggggttc 3660 acgaaggaac actgaaaagg gcttccaaat tcttcgggcc tcggaaaaag caaaccttcg 3720 tatactgaac gacggctctc ccactttcac gcgaccgctt atcagttccg ccatagatat 3780 atctttcgct agcgaaaacc ttttaggacg ccttcagtgg tcaatttcca aggatcccat 3840 gggtagtgac catcatccga tatcgatcta ctgcaacttc cccacaccaa aaacaaccag 3900 acgtccacga tggaaatatg aattagctga ctggcaactt ttccaaaata ccattgaaat 3960 cagccctata gaagatgata ctccactaga agggttgatt agcacaatcc ttacagcagc 4020 ctccgaagcc attccgaaaa caagcccaaa cgctggccgt cgggcgctaa gatggtggtc 4080 acctgaaacc aagaccgcta ttaaattgag gaggaaagcc ctaagaaaaa tgaaacggat 4140 tccgaaagac caccctaacg cagaaaaaat agctgcggac tactgttcaa aacgcaacgc 4200 atgccgggac atagttcgtg ctgcaaagcg caaatcatgg gacgaattcc tgaacggtat 4260 aagcagtgac agctcatctt ccgaactgtg gaaaagcata aacgcgttgc aagggaaacg 4320 cggcatcaac ggtattgcca ttaaaaccag caacggcatc tccagggacc ccaaagaaat 4380 tgcgaacatt cttggtggtt tcttcgctaa attgagctct cggagcggtt ataaccctga 4440 ttttactcag cttgacgatt tttcgactac cgttgaatcc atctcaatcc cgcgtagccg 4500 cccaggtgat cgattagaca aacccttttc ttctgctgaa ttgtcctttg cattaggtaa 4560 cggtaaagga aaatcagctg ggccagatga catcggctac cctctccttc gtcaccttcc 4620 acccatcgga aaaagagttt tacttaaaaa ttacaataga atttgggaag atagaacttt 4680 tcccgatcaa tggcgacaca gcctagtcat tcctattccc aaaaatgact cgaatcgtac 4740 tgaaccgagc ggataccgcc ccatctcctt gaccagctgc tgtggtaagg tcatggaacg 4800 aatggtcaat cgaagactaa cacaattcat ccgtgacaac gatctcctcg accatcgaca 4860 acacgcattt cgccaaggat gtggcacaaa cacgtacttc gcaacactcg gccaagttct 4920 agacgacgcc ctcgaaaaag gcctccacgt ggacttggcc acactggacc tttcaaaggc 4980 gtacaatcgt acttggactc caggagtgct tagaacactg gccagctggg gaatcagtgg 5040 caacactttt gctttcatcc gtaattttct gacgaacaga acctttcaag tatgcatcgg 5100 aaacacgaaa tcgaactctt ttccggagga aaccggcgta cctcaaggct ccgtgttggc 5160 ggtaaccctc ttccttatcg ccatgaacgg agtttttgaa acgttaccaa aaggcgtttt 5220 tatatttgtg tttgcggatg acattgttat cctcgttgca ggacacaaac caactttgat 5280 ccgcaggaaa ctccaaaacg caatcaaagc agtacatgat tgggcactat ctataggctt 5340 cgacctatca ccatccaaaa gccatttctc ccatatatgc tcaggaaggc atcgcaccaa 5400 aaagaatcct cctaccatcg acggcgaaga aattcccttt gatgctacga tccgcgtcct 5460 gggattatac ttagaccggc atctgagttt tcaaccgcac tgcgatcgta tcaaaaaaca 5520 atgtgaaagt cgactaaacc tgctacgaat tatcaccagc aagcacacca aaaacaaccg 5580 cgacgtgggt ctgcgtgtta ccaaagcaat tgtctgcagc aggctactgt acggtgccga 5640 actgctgtgt agaaactccg aaaacgtgat cagcaaactt gcaccgacgt acaatcggtg 5700 catccgcagt ttgtctggac tcctaccctg tacaccagca acatccacat gcgtcgaagc 5760 tggaattccc cccttccgat acatttttgc gatatccgta tgctccaaag ctgtcagttt 5820 cctggagaaa acaagcgggg atgacgaggg agtgttcctc ctcaaagagg caaacaaact 5880 ccttcagcaa ttcgcccacc aaatcctccc cccagtgtcc acggtccact gggttggaga 5940 tgaagcgtgg gacttctctc cacccaggat ctactggaca ctcaatacac caaccaagca 6000 aaacctgaat cccactactg tgattccacg tttcaatgaa cttatcaaaa gaaagctgaa 6060 cggctatgaa ctcttctata ctgacggttc aaagacttct agcgaagttg gctttggcat 6120 tttcggaccc aatatcaata aatgcggagg cctacccaaa tccacctcga tcttttcggc 6180 cgaagctacg gctattcttt cggcggttac agaatacagc tttacctgga aagctattgt 6240 taccgactca gctagtgtac tgagagctct tgagaaccca aagaacaaac accctttggt 6300 gcaagcaatc agaacagctt caaacagcaa caccatatat gtctgggtcc ccagtcattg 6360 cggtatcaag ggaaatgaag ctgctgacaa gctggcagca gaaggcagat caaaaatacc 6420 agcgcaccct gaagtacctg cagccgattt gaaaatatgg atcaaaacag tttttacttc 6480 aacatgggag aacgaatgga atctagctcg agattctttt ctacggaaaa taaaaggtga 6540 ccgacacaaa aaactggaag gaccaaagaa tactttctag actacgctgt ggttacacta 6600 gattttcaca cgacctcgga tcccgggatg gtttcctcaa acaatgctcg gtgtgctcaa 6660 cccatatgtc ggtagagcac ttaattataa attgccctgc attccaaatc atccgagatc 6720 aacatgacat cggcctgagc attagagacg cattatctaa tgacgcagac agagaaaaat 6780 cactaatcga attcctcaaa atcaccggct actaccaatc aacatagacc tgcgtaacta 6840 cttacaacta ggtaaacgga tgggactaga tgaaacttgc ttactgacag gacgaacgac 6900 gatgacaccc cgaaggccag aaactaaaga aaact 6935 // ID CR1-81_AAe repbase; DNA; INV; 2715 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-81_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-2715 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1169-1169 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 13 sequences with >95% CC identity. 5'-truncated. XX FH Key Location/Qualifiers FT CDS 2..2239 FT /product="CR1-81_AAe_1p" FT /note="endonuclease and reverse transcriptase." FT /translation="SQHQRGGGVLIAVKKALQSNVILINDANQLEQISVRI FT TYGTMSLVVCTVYLPPNTEPALYQQHAASVNQLCKLLGTQDEIVVLGDYNL FT PYLRWRFDDDIGAFLPINASSEQEISLVEDIVATGLMQMNHLLNSNERLLD FT LAFVSNASVCELLEPPLPLMKVDRHHTPFVLTLDVIQTPLENRKNETFFDF FT RQCNYDELNAEIAQINWRDILAFGSLDDAVTRFYSELNVIFERHVPRRSRA FT CRPDNKQPWWNAQLRNLRNRLRKARTRYFRVRNRLNRDAVRDLESQFNELN FT VSCFQSYIDRLESNMKDDPKQFWSYLRSRTTSRGFPQLINYDGKTATTPDD FT AVKLFSSFFQSVLSITPPPSSASYLSALKTFDLNLQPTSFTQHEVYKKLCS FT IDGSKGPGSDGLQPSFIKACSASLAEPAALLFNRSLAESVFPAKWKEALIT FT PIHKAGNVHDVKNYRGISILSCLPKVFESMLLDFLYPAVRHIITEDQHGFV FT RKRSTTTNLMSYVSTLIDKIEKRQQIDAVYVDFTKAFDRVPHQLAVEKFQR FT MGFPDWLTKWVLAYLINRSASVRLGTTLSDPFEITSGVPQGSHLGPLLFIL FT FVNDLCSELKSSKVMYADDLKIFRVINSPVDCCALQMDIDKILDWCSRNGM FT DVNIAKCNVITFTRKRSPTVFEYSMHGATIERVRLVKDLGISLDCKLRFVE FT QISSVIAKAYAVLGIIRRNTKDFRDVYCLKALYIYHLFAAFWNMEL" XX SQ Sequence 2715 BP; 768 A; 598 C; 573 G; 776 T; 0 other; tagtcaacat cagcgaggtg gcggagtttt gatcgctgtt aaaaaggctc tacaaagtaa 60 tgtaatacta attaacgatg ccaatcagct tgagcagatt agcgttcgaa tcacctatgg 120 aactatgtca cttgtcgttt gtactgttta tcttccaccg aacactgaac ctgctcttta 180 tcaacaacat gctgcgtctg ttaatcagct atgcaaactg ttgggcacgc aggatgaaat 240 cgtagttctt ggtgactaca acttgcccta ccttcgttgg agatttgatg atgacattgg 300 agcattctta ccaatcaatg cttcttcaga acaagagatt tctttggttg aagacattgt 360 ggctacagga ttaatgcaga tgaatcactt gctcaactcg aatgaaaggt tactcgatct 420 ggcatttgtg agtaacgcca gtgtatgcga attactggaa ccacctttac ctctgatgaa 480 agtcgaccgt caccatacgc cttttgtttt gaccctcgat gttatacaaa ctccattaga 540 aaataggaaa aacgagacct tcttcgattt ccgccaatgt aattatgacg aattgaatgc 600 tgaaatcgca caaataaatt ggcgggatat tttagccttt ggatcattgg acgatgctgt 660 taccagattc tacagtgaac tgaatgtaat tttcgaacga catgttcctc gcagaagtcg 720 tgcatgtcga cctgataata aacaaccatg gtggaatgct caacttcgca atcttcgtaa 780 tcgactgcga aaagcacgga cacgttattt tcgagtgaga aatcgtttga acagggatgc 840 agtacgtgat cttgaatctc aattcaatga actgaacgtg tcatgcttcc agtcatacat 900 agaccgacta gagagcaaca tgaaagatga tccgaaacaa ttctggtcct atcttcgcag 960 tcgaacaaca tctcgtggat ttcctcaact cataaactac gacggcaaaa ccgcaacgac 1020 gccagatgat gccgtgaaac ttttctcatc tttcttccaa agcgtattga gtattacccc 1080 accgccctct tcagcatcct atttgagcgc tctcaagaca ttcgatttga atctgcagcc 1140 tacatcattc acgcaacacg aggtttacaa gaagctatgt agcatcgatg gttccaaggg 1200 accaggatcc gatggacttc aaccttcctt catcaaagcg tgctctgcgt ctttagctga 1260 accagcagct cttctgttca accgttctct tgctgaaagt gtttttccag ccaaatggaa 1320 ggaggctctt attacaccga tccacaaagc cggcaacgtc catgatgtca aaaactacag 1380 gggaatttct atactcagtt gtctaccgaa agtttttgaa agcatgctgc tagattttct 1440 gtatcctgcg gtccgacaca ttattactga agatcagcat gggtttgtca gaaagcgatc 1500 tacaactacg aacctaatga gctacgtgtc gacactgatc gataagatag aaaaacgaca 1560 gcaaatcgat gcagtctacg tggattttac taaagccttc gatagagttc ctcatcagct 1620 agcagttgaa aaattccagc gaatgggttt ccccgattgg ttgacaaaat gggttttggc 1680 gtacctcatc aaccgctcag catcagtgcg acttggaaca acgctttcag atccattcga 1740 gattacttcc ggcgtcccgc agggcagcca tcttgggccg cttctgttca tactatttgt 1800 gaatgacctc tgcagcgagt tgaagtcatc gaaggtgatg tacgcagatg atctgaagat 1860 ttttcgagta atcaattcgc ctgtggactg ttgcgcctta cagatggaca ttgataagat 1920 cctagattgg tgttcaagaa atgggatgga tgtcaacatc gcgaaatgta atgttatcac 1980 cttcacgcgg aaacgttcac cgacggtttt tgaatattca atgcatggtg ccacgattga 2040 acgagttaga ctggtcaagg atctgggtat ctcgcttgac tgtaagctcc gttttgtcga 2100 gcagatttcg tcagtgatag caaaggctta cgcggttctt ggaattatca gacgcaacac 2160 taaagatttc agagacgtgt actgccttaa ggctttatat atatatcact tgttcgcagc 2220 attttggaat atggagttat agtttgggcc ccgtatcact ctgtacacat caatcgactt 2280 gaacgagttc aaaaaagttt tattaggtat gcactccaga gactaccttg gcgaaatagg 2340 atgctgctgc cgccatacga gcatcgctgt gcactcgtcg acttacctac actacgcaac 2400 aggagaactc ttctaaagca gcttttcatt tttgatctgc tcgaaaataa catcgactgc 2460 tctggtctgt taggcaaatt aagtttcaac acccctgtga ggctcctacg gaggactgca 2520 ctattccgac acgctgctca tcgaacgtta tatgggcaaa ataatccgct tgatgtatgt 2580 tgtgaaatgt ttaataatgt gtatcacttg tacgatttta acatagctaa ggaaacattt 2640 agattaagga ttctacggaa ttgaagtctg tgcgatgtat ttttatttaa atcgaagacc 2700 aaataaatca aatca 2715 // ID BEL-167_AA-I repbase; DNA; INV; 6510 BP. XX AC supercont1.348; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-167_AA_; KW BEL-167_AA-LTR; BEL-167_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6510 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.348; Positions 904168 897659. XX CC Positions [5558-6118] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 675..5414 FT /product="BEL-167_AA-I_2p" FT /translation="MNQSPKSVGCAPDVTVVKTPSNNRISQCAPGTSRNNQ FT RSNPPPRDAASNRSVGKSSRRSDTKRRIELQLQKLEEEKTLQLKYLEQKYS FT LLEELENESSPAASEINSFVDNATKVEQWMADTANCGDDSGLVDVNVVDEV FT GLNYSSGEEAEPESRVKFSEVQAHSSTRLTKSRIPNRTFDLHIPFSHPSRE FT ETVSNYQQVQRAQSTGRTEIPFSHPPREAMVQTYQRMPRHQTTGRSEVHFQ FT AANHREAAAYRNPQNFIPRQGSTPVHQSRSMDPAVMNDNEDIVCSLNRSQM FT AARQAVSKDLPEFSGNPEDWPLFFSIFNSSTQLCGFSNEENMLRLRKCLTG FT KALDAVKCRLLHPSNVQGVMTTLKMLYGRPETIIQAIVRKIRSLQSPNIER FT LDTVIQFALNVENLVATVEACEIGDFMYNASLRYELVEQLPPTLKLDWAKN FT SRNNPSPNLLDFSLWLRSIAEDASAVSISVGSEYRSRAGKKDGYLNLHAEE FT EPMGNKSNAQTTQKSTKEVIRECVGCKGSCTTLAQCERFKELGCDSRWAVI FT REFKICRKCLRQHNGPCKQRKECGTDGCSFLHHPLLHSDRKQSTGSNPMSA FT ATPAQSQSNPTTNTSPTCNVHQAQSEILFRIVPVLLHGPSKTLKTYAFVDD FT GSELTLMEQELANELGVQGPKKSLCLRWTGGTNRTEGLSQVVSLQVSSITN FT PTNKFELSAVHTVGNLQLRPQTLLVPEMQKKYRHLDGLPLESYQEVSPRLL FT IGLDNASLGNVMKCREGKLNEPIAIKTRLGWTIYGSCSRDTNASAQINHHS FT VELRQCDCKSDEDLHDTMKQYFSLESLGISKPNQLLLSNEDRRANDLLETL FT TRRTNGRYESGLLWRYDNVRLPDSRPMALRRWECLDRRMQKDRQLAQALTE FT KVNDYLAKGYIRKLSTEELETRHARIWYLPMFPVVNPNKPGKTRLVWDAAA FT TSHGVSLNSVLLKGPDQLTSLLSVLIRFREFRVAVSGDIREMYHQVQIRPA FT DQHCQRFLWKENESDSSPSTYVMQVMSFGACCSPSVAQYVKNTHAMKYEHN FT YPEAVHAIIHQHYVDDMLISAETEDKAIQLADEVKKIHESGGFEMRNWVSN FT STTVVAALEGEGAKEKSMSIGEANTNEKVLGMWWDTSSDCFMFKLSARHDA FT TLLSGCKRPTKREVLRTLMMVFDPLGLISHFLMFLRTLLQEIWRSSVDWDE FT QIHDTHFEKWLSWLRILPQVASIRIPRCYRSVTSLETGGIVQMHTFVDASE FT NGFAAAVYLRFEEGSIVECSLAGAKTRVAPLKFISIPRSELQAAVIGVRLA FT DTILGSLTTKVQKRFFWTDSRDVMCWLHSDHRRYSQYVGVRISEILDTTNL FT GDWRWVPTKLNVADEGTKWKGTLDLTNSSRWFRGPEFLWKPEEEWPASSQS FT VQVTNTELRPYLNLHIKTLEPIVNVARFSRWSRLLKTTAYVFRAIKNLQRS FT IRKTSRTTGPLIREELVRAESYLYQIAQRSTYDEEILAMSSNPSGQISKSS FT PLFRLCPFMDEKGGSCGSVGRTNACQLKIVASSILSFFLVSTVSQDFWYLT FT FIRNFCTKITRQQ" FT CDS 5528..6508 FT /product="BEL-167_AA-I_1p" FT /translation="MGDLPSCRLAVYSRPFSHMGVDYFGPMTVSIGRRTEK FT RWGVLVTCLTVRAVHLEVAHSLSADSCIMALRNVIARRGVPVVIYSDRGTN FT FVGSNTELKAALEVLDQEQLIKEFTTTHTKWSFIPPLSPHMGGAWERLVQT FT VKPNLGRLRSNRRLTEETLRSMLIEVEQIVNSRPLTDIPLDDDQSPVLTPN FT HFIMGSSNGLPPWTCFDDNPVSLKENWRLSQIMANQFWKQWLHDYLPSLTR FT RAKWFTEVKPIEINDIVVIVDERLPRNCWPKGRVIATKVAQDGQVRGATVQ FT TISGIYERPAVKLAVLDVGVRGNATQMGLRCIKGG" XX SQ Sequence 6510 BP; 1866 A; 1553 C; 1578 G; 1513 T; 0 other; gatcttgtaa ccttaaacta atgtcttctt cttgtgctgc aggaattttg taactggatt 60 aataaacttc catttcacaa aattgggatt ttcgtcttct atcgccaaca aattacaaaa 120 ttcttcaaaa gactacggaa tggcgagcaa acattccagc ggtagctccc atggccgacg 180 ggatgccggt acacaaggcg gtgtaaccga tgtaggtgat gcgaatagta gcagcaagaa 240 ccatccggag caaattcagc cggagacgac aggacaatta cagttggaga atgctgtcca 300 gccaagtaca cctgcagcgc aaggacagac agtaccacag acggtaatcc atcaaagaag 360 tatgtctcag tcggaggtag tcgctgaatc tcttcctcgc caacagcagc aggtaatccc 420 caccgaagtg gtgagtaccg aaaagggagc tattcataag tctcgtggaa tacaaccctt 480 agctatgaaa aataaagatt tagctgaggt tccccatatc ccccctgtca acgtgcctct 540 cagacgaaat cccccgcgcc gcgtccgcga aaagatcttg agcagttgcg ggctatgcac 600 cgaggcagac gacgacagaa tggtccagtg tgacgattgc gaaacttggt accacttcga 660 gtgcgtcgac gtaaatgaat caatcgccga aatcagttgg ttgtgccccg gatgtgacgg 720 ttgtaaagac cccaagtaat aaccgaatat cacagtgtgc tccaggaact tcacgaaaca 780 accagcgttc gaatccacct cccagagatg ccgcttcgaa caggtcggtt ggtaaatcca 840 gtaggcgcag tgacactaaa agaagaatcg agctgcaact acaaaaactt gaagaagaga 900 agacattgca gttgaagtat ctggagcaga aatacagttt gttggaagag ctagagaatg 960 aaagttcacc tgcggcaagc gaaataaatt cgtttgtcga caacgcaacg aaagtggaac 1020 aatggatggc tgacacggct aactgcggcg atgattctgg ccttgtcgac gtcaacgtag 1080 tagatgaagt cggtttgaac tactcttcgg gcgaagaggc agaaccggag tccagagtga 1140 agttttccga agtgcaagca cactcttcca caagactcac aaaatctaga ataccgaaca 1200 gaacttttga tttgcatatt ccgttctcac acccctcccg tgaagaaacg gtctcaaatt 1260 atcagcaagt gcagcgtgca caatccaccg gtcgtacgga aattccgttt tcacaccctc 1320 cacgtgaagc aatggttcaa acctaccagc gaatgccgcg tcaccaaacc accggtcggt 1380 cagaagtcca ctttcaagcc gcaaaccata gagaagctgc agcctatagg aacccacaga 1440 atttcattcc acgacaaggt tctactccag tccaccagtc aaggtctatg gatcccgcgg 1500 tgatgaacga taatgaggat atcgtctgct ccctgaatcg cagccagatg gccgcgagac 1560 aagccgtttc gaaggacctg ccggagttta gtggaaatcc agaagactgg ccgctattct 1620 tttcgatctt caattcgtcc actcagctat gtggattttc aaatgaagaa aatatgctaa 1680 ggcttcgaaa gtgcctcaca ggcaaagcct tggatgcagt taagtgccgc ctacttcatc 1740 cctccaacgt acaaggagta atgaccactc tcaagatgct ttacgggagg cccgaaacga 1800 ttatacaagc cattgttcgg aaaatccgct cgctacagtc tccaaacatc gaaaggctgg 1860 ataccgtaat ccagtttgcg ctaaacgtcg agaatttagt cgcaacagtc gaagcatgtg 1920 aaataggcga tttcatgtat aatgcatccc tgagatacga gctggtggaa caactgccac 1980 caaccctaaa gctagattgg gcaaaaaatt cccgcaacaa cccaagtccc aacttgttgg 2040 acttcagctt atggttgcgg tccattgcgg aggatgctag cgctgtatct atttctgttg 2100 gcagtgaata tcgttccagg gccggaaaaa aggacggtta cttgaatctt cacgctgaag 2160 aggagccaat gggtaataag tcaaatgctc aaacgactca gaaatctacc aaagaagtta 2220 ttcgagaatg cgtaggatgc aaaggaagct gcacaactct tgcgcagtgt gaacgcttca 2280 aggaacttgg atgtgactct agatgggcgg taataaggga gttcaagata tgtcgaaagt 2340 gtctgagaca acataacgga ccctgtaagc agcgaaaaga gtgcggaacg gacggctgtt 2400 cattcctgca tcaccctcta ctacatagcg acaggaagca gtcgacgggt tcaaacccta 2460 tgtcagcagc aacgcccgcg cagtcccaga gcaatccgac aaccaataca agtccgacct 2520 gcaacgtaca tcaagcacag tctgaaatct tgtttagaat cgtcccggtt ctccttcatg 2580 gcccatcgaa gacgctgaaa acatatgctt tcgtagacga cgggtctgaa ctcacgttga 2640 tggagcaaga attggctaac gagcttggag tacaggggcc gaaaaagtct ctatgcttgc 2700 gatggactgg aggaaccaat cgaacggaag gtctctctca agtagtaagc ctccaggttt 2760 caagcattac taatcccacc aacaaattcg agctttcagc tgtacatact gtcggaaatc 2820 tgcagcttcg accgcaaaca cttctggttc ccgaaatgca gaaaaaatat cgccatcttg 2880 acggtttgcc actagagtcg tatcaagaag taagccctcg tctcctcata ggtttggaca 2940 atgccagttt gggaaatgtg atgaaatgtc gcgaaggaaa gctaaacgag cccatagcga 3000 tcaagacgcg acttggatgg acaatatatg gaagctgctc cagggataca aacgcatctg 3060 ctcagatcaa tcatcacagc gtagaacttc gccagtgtga ctgtaaatcg gacgaggatc 3120 tgcacgatac catgaagcag tatttctcgt tagaaagctt aggaatcagc aagccgaacc 3180 aactattgct ttccaatgaa gaccgcaggg caaatgattt gctggaaaca ttgaccagac 3240 gtactaacgg acgatatgaa tcgggccttc tatggcgata cgataacgtc cgactcccag 3300 acagtagacc tatggcccta agacgttggg agtgcctaga ccgtcgtatg caaaaggatc 3360 gtcaacttgc tcaggctcta accgagaagg taaacgacta cttagccaag ggttacatca 3420 ggaaactttc tacagaagaa cttgagacac gccacgcccg aatctggtat cttccgatgt 3480 ttcccgttgt aaatcctaac aagccgggca agacaaggct tgtctgggat gctgcagcta 3540 cttctcatgg agtttctcta aattccgtcc tcttgaaagg acccgaccag ttaacttctc 3600 ttctatctgt tttgattcgg tttcgtgagt ttcgggtggc agtctccgga gacatccgcg 3660 agatgtacca ccaagtccaa ataaggccag cagatcagca ctgtcaacgt ttcctttgga 3720 aagagaacga gtctgattca tctccgagta cctacgttat gcaagtaatg tctttcgggg 3780 cgtgttgttc gcccagtgtc gctcagtacg tcaaaaacac ccatgcaatg aaatatgagc 3840 acaactaccc cgaagcagtc cacgccatca ttcaccagca ctatgttgat gatatgctga 3900 tcagcgctga aacggaggat aaggcgatac aactagcgga tgaagtgaaa aagatccacg 3960 aatccggagg cttcgagatg cgaaattggg tctcgaactc aacgacggtt gttgcagcgc 4020 tagaaggaga gggggctaag gagaaaagca tgagcattgg ggaggccaat accaatgaaa 4080 aggttcttgg aatgtggtgg gatacttcgt cggattgctt catgttcaaa ctgtccgccc 4140 gccacgatgc tacgctgttg tcgggctgca aaagacctac gaaacgcgag gttttacgca 4200 cattaatgat ggtgtttgac ccactcgggt taatcagcca cttcctgatg tttttgcgaa 4260 cacttttgca agagatctgg agatcatccg tcgactggga cgaacaaatt cacgacactc 4320 atttcgaaaa gtggctttca tggctgcgaa ttcttcctca agtggcaagt atccggatac 4380 ctagatgtta ccgttccgtt acgtccttgg aaacaggcgg aatagtccaa atgcatacat 4440 ttgtagacgc cagcgaaaat ggtttcgcgg cagccgtata cctgaggttc gaagaaggga 4500 gtattgttga gtgctcactg gcgggagcga aaaccagagt tgcaccacta aaatttatat 4560 cgataccacg ttccgaacta caagctgcag tcataggagt acgactggcg gacactatac 4620 ttggatctct cactaccaaa gttcagaagc gctttttctg gactgattcc agagacgtaa 4680 tgtgttggct tcattcggac catcgacggt atagccagta tgtgggagta cgaatcagcg 4740 aaattctgga cacaacaaat ctcggtgatt ggcgttgggt accaacgaag ctaaatgttg 4800 cggacgaagg gactaaatgg aagggcactc tagacctgac aaattctagc cggtggtttc 4860 gtggacccga gtttctctgg aaaccagaag aagaatggcc cgcatcatca cagtctgttc 4920 aagtcaccaa cacagagctt cgcccttact tgaaccttca tattaagacc ctcgaaccga 4980 tcgttaatgt ggctcgcttc agcaggtggt ctagattatt gaaaactaca gcatatgtgt 5040 tcagagctat caaaaatttg cagcgaagta ttcggaaaac ttctcgaacg accggaccac 5100 tcattcgcga agaactagtc agagccgaaa gctatctgta tcagatagcc caaagaagca 5160 catatgatga agagatcttg gcgatgtctt cgaatccatc aggacaaatt tctaaaagta 5220 gccctctttt ccgactttgt cctttcatgg atgaaaaagg gggttcctgc ggatccgtgg 5280 gacgaacaaa tgcttgccag ttaaagatag tagcatcgtc aatcctatca ttcttcctcg 5340 tgagcacggt gtcacaagac ttctggtatt tgacgttcat cagaaatttc tgcaccaaaa 5400 tcacgagaca gcaataaatg aattgaaaca gcgctattac attcctcgag ttaaagctgt 5460 ctacaaatcg gtacgcaaga actgccaatt ctgcaaaaac gaacgaattc gtccctgcgc 5520 gccgctgatg ggtgacttac catcttgccg ccttgctgtc tacagccgtc cattcagtca 5580 tatgggggtg gactacttcg gccccatgac ggtttccatt ggtcgccgaa cagagaaacg 5640 ctggggcgtc ttagtcactt gtcttactgt gcgtgccgtc catttggaag ttgcgcactc 5700 attatctgcc gactcttgta ttatggctct tcgcaacgta atcgctagaa ggggtgtacc 5760 ggtcgtcata tacagtgacc gtggaacgaa ttttgtgggg tccaatacag aactgaaagc 5820 cgctctcgag gtgttggatc aagaacagct cataaaggaa ttcaccacaa cgcacacgaa 5880 atggagtttc atacctcctt tgtctcctca tatgggcggg gcatgggaga gattagtcca 5940 aactgtgaaa ccgaatcttg gcaggctgcg ttcgaatagg aggctaactg aagaaacgtt 6000 gagaagcatg ctgatagaag ttgagcaaat cgtaaattca aggccgctga ccgatattcc 6060 ccttgacgac gatcaatctc ctgtactgac gccgaaccat ttcattatgg ggtcatccaa 6120 tggactaccg ccgtggacct gtttcgatga taaccccgta tcattgaagg aaaactggcg 6180 cttgtcccag atcatggcaa accagttttg gaaacagtgg ctccatgact atctaccatc 6240 tttaactcgt agagctaaat ggtttacgga agtgaagcct atcgagatca acgacatcgt 6300 ggtaattgtg gacgaacgtc tccctcgcaa ttgctggcca aaaggacgag tgattgctac 6360 taaggtagca caggatggcc aggtcagggg agctacagtg cagacgatta gcggaatata 6420 tgaacgacct gcggttaagc tcgctgtact agacgttggc gtaagaggaa atgctactca 6480 gatgggtctt cggtgcatta aaggggggag 6510 // ID BEL-646_AA-I repbase; DNA; INV; 6813 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-646_AA_; KW BEL-646_AA-LTR; Pao_Bel_Ele195; BEL-646_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-6813 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [5876-6424] - Integrase core CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 3977..6813 FT /product="BEL-646_AA-I_3p" FT /translation="MFHQVGIRPGDRNSLRFLWSENSDGPVEIYLMDVATF FT GATCSPASAQYVKNLNAMEHLQQYPRAVEGILKSHYVDDYLDSFGSESEAK FT KVSAEVRLVHHNGGFHLRNWRSNSESVTATLGESAKEDSKNLYLESGEHVD FT RVLGMLWSSSTDELSFSTSMSEEVRQLLRNGTRPTKRQVLRCVMSLFDPLG FT LLSFFIIHGKVLIQDLWRAGTEWDEAVGEEVFDHWIRWTKMIEFVASIRIP FT RCYFPRATEETYKAAQLHVFVDASEIAYSCAVYIRTAARDGAFQCALVSAK FT TKVAPLKPMSIPRLELQGCVLGARLLKFVQDSHPIAFSKRFLWTDSTTARS FT WIRSDPRRYKPFVAHRVGEILESTDANEWRWVPSKLNPADEATKWGRGPYF FT SRDSQWFNGPEFLRHPEEAWPRRDDEIGSTAEDIRPSVLFHFAVDTTLNFE FT RFSSWNRLLRATAYVLRFLFNITKPEQKLRGPLAQPELLSAECTIWKLVQR FT ESYPDEVVTLEQNKSDTQKTPLDRKSNLFKLMPILDEAGLVRQRSRIIAAE FT NVTYDARYPIVLPSKHRAVNLLIEDYHRRYRHGNNETIVNELRQRYTISNL FT RRLVKRVAFSCQLCKIRKARPSNPPMAALPPARLAMNVRPFSYVGLDYFGP FT LLTKVGRSNVKRWIALFTCLTVRAVHLEIAYSLSTASCISCVRRFIGRRGS FT PIEIFSDNGTNFQGAERILREQIGKELSMTFTNTTTKWSFIPPGAPHMGGA FT WERLVQSVKSAMKDAYAEGKLSDEELQTLVVEAESIVNSRPLTYLPLDSEE FT SEALTPNHFLLGASSGAKQSGVGVGESQKHLPDTWRGIQCQLDKFWQRFLV FT EYLPVIRRQPKWFDETRPLQVGDLVLVADGARRSEWVRGRVTQIFEGSDGR FT VRQAAVRTRQGIFRRPTTRLAVLDVHGQGAAPEDAGEHRGET" XX SQ Sequence 6813 BP; 1859 A; 1666 C; 1779 G; 1506 T; 3 other; aaaatcggtg tagcctttcg tktcctgtcg caacattctt caaaattttc atgatgagtc 60 atcactctac caccaacaag aaggatggat gtgctggttg caaccgtccg gacgaagcgg 120 acaatttggt ccaatgcgat gcttgcgatg tgtggtggca cttcagttgt gctggagtca 180 caggatcgat cgccgatcgt tcgtgggtat gtacccgctg caaacgatca tctcgtgctt 240 cgtcgagagt cagcgttacg agtcgatcgt catcgcattt ggcggaaagt atggcccgtc 300 tacgagaacg tcaggaattg gagaagcagc gcgcagaagt agaactggaa aggaagttct 360 tagacgagca acgaaggctg ttggaagctt cgattatcgc cgaagaggaa cgtcggagcc 420 aagcaagtcg cggtgatagc caaaggagag ttcgagactg gatagagaac agcgcggatc 480 gggaaaccgg cgcagcagga ggaaacttct gccagcagct actacgattc agcagcaaaa 540 ttcgattcta cgtcaagagg actccgaggc tgcggcggta aaagatgtgg aaaaggagct 600 accttcagag tttccgtcaa acttagacag ccaacatcac agttcacctc tccgagatag 660 gactattcct gcctttgaga cgtgtgacat tagcgagctt caaccgtttc cggaagtcaa 720 tcagcataac gccaaggacc taatccgcga attacgagtg agactggagc ggtgcttaca 780 gcaatccgag ccaacatcgc ctcagatggc agatttgcag cagcaattaa ggctgtgcag 840 ggaggaaatc gaaggaatgc attcaaccgt gtaccgttta acgacggtgc cgacgcccac 900 aacccagatg gaggtagtac aacccagagc agcgacgtta ggcgctgtac caaagcagcg 960 tacgaacgaa atttcttcag gtaagtgtca tgcctccgaa agccaaaccg ataaagcaca 1020 caaaaaccat gttttacaac cgacatcatt tccaaccatg caaaatcctc taaccgagca 1080 gcagcgtata gctcacgcgc gctcaaatcc gatatttaac aacgttttgt tagaaaagcc 1140 gactgttacc caatgcacac ctctcccctc tacaaaaacc caaccttcgc acccgctacg 1200 tcagcaaacg cagtatgctt tggtttacca accatttgta caacaacagt cattgcgtcc 1260 gcccctagaa cagcctccag tgcagcagcc ttcggtgcag caattttcgg tgcagcagct 1320 tcaggcagca gcctccagtg cagcagcttc gatgcagtag cttcagagcg gcagcctcca 1380 gtgcaacagc cttcagtgca gcgatctttg gtgcagcagc cttcagtgca gcaccttcag 1440 tgcagcagcc tccagtgcag cggccttcgg tgcagcaatc ttcggtgcag cagccttcag 1500 tgcagcagcc ttcagtgcag cagcctttag tgcagcagcc tccagtgcag cagtattcag 1560 tgcagaaatc tccagtgcaa caattgcagc acaacaaccc ccgatacagc aacctcaagc 1620 gcaacaacca ccagagagca atatacagcg caacagcctc ctatacacca gttttcagtg 1680 caaccgtcat cagtgcagga gtcttcaatg cgacagccgt tggtgcagca gtattcagcc 1740 cagcagtatg tagtgcagcc accttcgctt cagcagccgt tgacacagcc accatcagca 1800 cagcaatata ctatgcagca gcctgtggtg cggcagtatc tagcacagca acctttgctg 1860 caacaatctc aagtcggagc ctctccttct ccggtagtgg ttccagacat gaatgatcta 1920 gagagaggga tggacgaacc gcccacgcaa cagcagctaa cgtctcgtcg atcgttggcc 1980 cgagatcttc ctacattttc tggggatcca gccgaatggc cgatattcat ctccaacttc 2040 gaatatacga cgaggacttg cggctacaca aatggagaga acatgctccg cctacaaaga 2100 tgcttgcgag ggcgtgcgct agaaagtgta cgaagccgtt tagtgtttcc ggcagcggta 2160 ccccaagtaa ttgagacgtt acgtatgaga tacgggcgtc cggagttgtt aatcaacgtt 2220 ttgttgcaga aggttaaagc gatgcccagc atacgaggag acaagttgga atcgctcata 2280 gaatatggga tggcggtcca agagctatgc gatcacattg aagcggcgaa tgagagggct 2340 catttgtcaa atcctacact gttgcaggag cttattggga aattaccagc ggatcagaaa 2400 ctgatgtggg cggctttaag cgaggacgtg ccttggtaga tctcgaaact tttagcgaat 2460 atatgtccgg tgtaatgcag gacgcatcaa gtgtggtggt gtacgaaccg gattcgcgaa 2520 aaataggagg aagggagaaa acaaaggggt acgcgaactc gcacattacc aaggttgacg 2580 gaacttcatc ttcgacgcca actcagaaca agcctagtga gtgtctacat tgcgacaaag 2640 aaggccataa acttcgggaa tgccaagcgt ttagaacgct ttccgttgac gatcgatggc 2700 ggcgtatccg aacactgaac gtgtgtcaaa tctgcttgtt cagtcatgga agaagagcct 2760 gccggattaa caatcgtgca atgtaagtgg atgtcagttc aaacatcatc cactactgca 2820 cggaaaatcc accactccat caacacagat tggaaacact cacactcacc agcttctcga 2880 ttcggggtcc tattccgcat cataccagtg acgctacatg gaaagtccgg gaaaatcaac 2940 acctttgctt ttctggacga gggctcttca tcaacgcttg ttgacggtgc tctggtagcg 3000 cagcttggtc tagtggagaa ccaaatccta tttgcttaag atggactggg aacacatcgc 3060 gggtagagaa ggattctcag ctggtgacca tcacaatatc aggaatggag cagaagcact 3120 tatataagtt agtagatgct catacggttc ggaacttaaa tctaccgact caatctttcg 3180 aactagcaga agcagcaaag aagttcgcat acttgaagca gctgccgatc caaagctatc 3240 gcaacgccag accggagatt ttgatcggag ttgataatat cagacttgct gtgccactta 3300 agataaacga aggagatgga tctggaccaa ttgccgttaa gacacgattg ggctggtgcg 3360 tttacggtcg gcaaggtccg cagagtaatg aaggattcag cttccatatt tgtggctgta 3420 cgaaagacga cgaattacac gagaccgtaa aacagttctt cgccgttgag gaaagcggag 3480 taaagtatct cgaaacttct ctcagtgcag aagacaaacg agctcgagag ttgctggaat 3540 caacaactaa gcgagttggt acccacttcg agacaggttt actgtggaag gacgacgaga 3600 tagagttacc ggacagctac ggcatggcgt tgcgaagaca tcagtgcctc caacgtaaga 3660 tggaacgtca acctgctctg aaggaaaata ttaatcgtca aataaaagaa tacgtcgaga 3720 aagggtacgc tcatcgcgca acttctgccg acctggacac tgcggaccca gaaggatatg 3780 gttcttaccc ttaggagcag tgacgaatcc aaacaagccc ggaaaggtac gccttgtatg 3840 ggatgcagca gcgaaggttt ctggagtttc gttgaatagc gtcttgctta aagggccgga 3900 ccaacttaca ttgttgccag ctgtactgtt tcgcttccgg ctgtattccg ttgccgtcag 3960 tgcggatata gagcaaatgt ttcatcaggt cgggattcga ccaggcgaca gaaactccct 4020 gcgctttttg tggagtgaaa attctgatgg accagtggag atatacctaa tggatgtcgc 4080 tacgtttggt gccacgtgct cgccggcatc ggcgcagtat gtgaaaaacc tgaatgccat 4140 ggaacaccta cagcaatacc cgagggccgt cgaagggata ctaaagagcc actacgtaga 4200 tgactatctg gacagttttg gcagtgaaag tgaggcgaag aaggtatcag cagaagtacg 4260 tctggttcat cataacggtg gatttcatct acgaaattgg aggtcgaaca gcgaaagtgt 4320 cacggcaaca cttggtgaat cagcgaagga agacagcaaa aacctttacc tggaatctgg 4380 tgaacatgtg gatcgagtac tcgggatgct gtggtcttca agtacagacg agttgagctt 4440 ctccacctcc atgagcgaag aagtcagaca actccttcgt aacggtaccc gcccaacaaa 4500 gcgacaagta ctgcggtgcg ttatgtcact gtttgaccct ctggggcttt tgtcgttctt 4560 cattatccac gggaaggtgc ttatacagga cctgtggcga gccggtacgg agtgggacga 4620 agcagtaggt gaagaagtat ttgatcactg gattcgctgg acaaaaatga tcgagttcgt 4680 agcatccata aggataccga ggtgctattt cccacgagca actgaagaaa cgtacaaggc 4740 tgctcaacta cacgttttcg ttgacgccag cgaaattgct tactcatgtg cagtgtatat 4800 ccggacggct gcccgagatg gagccttcca atgtgcttta gtctctgcca aaaccaaggt 4860 ggctccgctc aagccaatgt ccattccgag gttagaactg caaggatgtg tgcttggtgc 4920 tcgtcttctc aagttcgtgc aagatagtca ccctatcgca ttttctaaga ggttcctttg 4980 gacggactct acgacagcga ggtcatggat caggtcggat ccgagacgct acaagccttt 5040 cgtagctcac cgagtaggag agatattgga gagtacggat gcaaacgaat ggcggtgggt 5100 gccgtccaag ctcaaccctg cggatgaagc cacaaagtgg ggccgtggac catactttag 5160 cagggatagt caatggttta acggaccaga attcctgcga catcccgagg aagcttggcc 5220 tagaagagac gacgagattg ggtcaacagc tgaagacata agaccttcag tattgttcca 5280 ttttgcggtt gatacgacac tgaacttcga acgtttttca tcgtggaaca gattgctgcg 5340 cgccacggct tacgttttac gtttcctctt caatatcacc aaaccagaac agaaactgcg 5400 tggaccgctt gcacaacctg agctgctgag cgcagaatgt acaatttgga aacttgtaca 5460 gcgagaaagt tatcctgacg aagtagtcac cctcgagcag aataagtccg atacgcagaa 5520 aacacccctt gatcggaaaa gtaatctttt caagttgatg ccaatattgg atgaagccgg 5580 attggtgcgg caacgaagcc gcattatagc tgcggagaat gtgacctacg atgccaggta 5640 tccgatagtt ctaccatcga agcatcgagc tgtcaacctc ctgatagaag actatcatcg 5700 tcgctaccgt catggaaata acgaaaccat cgtcaacgaa cttcgacagc gttataccat 5760 ttcgaactta cgtcgkctcg tgaagcgggt tgcgtttagc tgtcaactat gcaagatccg 5820 caaggctcga cctagcaatc cgccgatggc agcccttcct ccagcccgcc tggcgatgaa 5880 cgttcgtcca ttcagctacg ttggtctgga ctactttgga ccgcttctca caaaagtagg 5940 gcgctcaaac gtgaagaggt ggatcgccct ctttacgtgt cttactgtac gagccgtaca 6000 cctagagata gcgtatagct tatccactgc gtcgtgcatc tcctgtgtac gacgtttcat 6060 cggtcgccga ggatcgccga tagaaatatt tagcgacaac ggaaccaact tccagggggc 6120 ggaacgcata ctccgagagc agatcggaaa ggagttatcc atgacgttta ccaatacgac 6180 cacgaagtgg agcttcattc ctcctggagc tccacatatg gggggtgctt gggaaaggtt 6240 ggtgcagtct gttaagtctg ccatgaagga tgcatacgca gaagggaaac tgagcgatga 6300 ggaattgcaa acactggttg tagaagcaga aagcattgtc aattcgaggc ctctgacata 6360 cttgcctttg gactccgagg agtctgaagc tttgacaccm aatcactttc ttttgggagc 6420 gtctagtgga gcgaaacaga gtggtgtggg cgtaggtgaa tcccaaaaac atctacccga 6480 tacttggaga ggaatccaat gccaactgga caagttttgg caacgcttcc tggtagagta 6540 tctaccagtc atccgaagac aaccaaaatg gttcgacgaa accagaccct tgcaggtcgg 6600 tgacctcgtc ctggttgcag atggcgctag gcgcagcgag tgggtacgag gaagggtaac 6660 ccagatattc gaagggtcag atggtcgagt ccgacaagct gccgtgcgaa cccgacaggg 6720 aatcttcaga cgaccaacga ctaggctggc tgtcctcgac gtacatggac aaggtgctgc 6780 cccagaagac gcaggggagc accgggggga gac 6813 // ID BEL-92_AA-I repbase; DNA; INV; 5834 BP. XX AC supercont1.183; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-92_AA_; KW BEL-92_AA-LTR; BEL-92_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5834 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.183; Positions 915568 909735. XX CC 'CGAAA' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 10..5834 FT /product="BEL-92_AA-I_1p" FT /translation="MNRQGPNLNSTEITDCNCAICKGPDHDDVGMVGCDNC FT SQWFHFKCVGVSADVKDISWTCVNCKERPLVAAEDSGQAEPAAQPKVVPNQ FT DAVPGAKPYTEKDPDAEVSAIEELERELEQIEEMRNEQMRRMELEKRVHRR FT RLEVQRELAEQRQQMEREKRQIELEFEKEQLQKAISEEEAFRKKQQAMRDE FT LQVNLDQLRIRQTGNPMQMKAICGPESSRPTVADDVVEIAAMPPAQTRDYR FT GAYSKHSTPKGAEDSLSPVPPKEQFGAPGINEERNVAFDSEEDLPDVPSVS FT QTSARTVRPTSAQLSARQFLAKKLPIFTGRSEDWPMFISSYETANEACGFS FT NVENLARLQECLKGQALEAVRSRLLLPNAVPQIIDSLRMLYGRPEQLLNTL FT LFKVRKAESPKADRLNTFISFGMIVQQLVDHLEATKLRDHLVNPMLIQELV FT EKLPASTKMEWVRYKRQVVGVTLRTLSNFLTEIVRDASEATLFNDSPLPSR FT MAQENRSNKKGKASKECYDGYVHTHDFAGSMSSGNKTVEFQQEKRTPCPMC FT DRTDHRVRNCAEFRKLDPSTRLKAVEEWELCRICLNDHGKAKCKLNIVCKV FT DNCGQRHNTLLHQSRSVLRSDCHTHTQVGLCQPVIFRMVAVTIYNGVREIN FT TLAFLDGGSSYTLVESSLTDRLKIKGQTIPLRVTWTAGMTRLERESQQVHL FT TLSARGSKEKYLIHNVHTVQSLKLPEQALRYAEMANQYGHLQNLPVVDYER FT EPPKILIGLKHLHLFAPLESRIGKPGEPIAVRSKLGWTVYGSQTPEEVTSS FT YVGYHSCDGVTNQELHDLLRSHYALEEPGVSVSLLPESSEDQRARKIMENT FT TVKIGDRFQTGLLWKHDNPMFPDSYPMAQRRMKCLERRLTKDPDLKKNVET FT QIVDYQHKGYCHKATKEELTGVDPEKVWYLPLNVVLNPKKPGKVRLVWDAA FT AAVQGISLNSQLLKGPDFLTSLPSVICQFREHRIGFGGDIREMFHQVKIRP FT EDRQAQRFLFSGQVFVMDCAIFGATCSPSQAMYVKDCNAKQWESMYPEASM FT AVVKKHYVDDYFDSAHNIEEAVQRASEVRFIHSQAGFEIRNWVSNSEEFLQ FT QLGETERSEAVHFNRDKSTETERVLGIVWNPVEDVFKFSTKIRDDLLPYLF FT EGKRPTKRVVLSCVLSLFDPLGLLTPFTIYGRIFIQSLWRTGCEWDEMIDD FT ESARKWALWISRLSVVESVKVPRYYFGDGLLLNYSSLQLHVFTDASECAYG FT CVAYFRILAGGVPRCSLVQAKSKVAPLKPCTIPRLELMAAVLGARMVDTVK FT ESHNLDIGKVFLWTDSRTVHSWIRSDLWRYKIFTALRVGDILNRTMASDWY FT WIPTRLNVADELTKWNNGPQLEPDSRWFKGPSFLYQAEEMWPAQPEIKPNV FT REELRTTFLFHDVKLAEPILEASRISKWSVLVRTVACVLRFISNLRRKRQG FT QPIEVLPVTSKISCLVRKLVPFKRQALRREEFQEAERFLFKIAQGERFSAE FT IRILQQSYSGKDLPKADKTSDLHKLTPVLDEFGILRHEGRTVEAEHLPFEV FT RFPIVLPKKHPITIKLLEHFHQKFGHANRETVVNELRQRFYIPHIRAQLKK FT VMQECVWCKVHKCRPKVPRMAPLPVQRITPNLRPFSFTGVDYFGPLIVTVG FT RRAEKRWVSLFTCLTTRAIHLEVVHSLSSQSCVMAIRRFACRRGMPIEFFS FT DNGTNFTDARTKWNFTPPSAPHMGGAWERLVRSVKEAMRVFDDGRKLTDEI FT LLTTLADAEEMINTRPLTYIPQESAESESLTPNHFIRGLPSTGCDVFKSIV FT GDADALRDNYKRSQWLANKMWKRWLVEYLPSINKRSKWHVDSEPIIEGEVV FT FMADDENRKLWVRGVVEELIRSADGRVRQAMVRTTKGLYRRPVAKLAVPEI FT RSVNSNLTASAKESRGGA" XX SQ Sequence 5834 BP; 1710 A; 1219 C; 1515 G; 1390 T; 0 other; atctcaaaaa tgaatagaca aggacctaac ctcaacagta ccgagataac tgactgcaac 60 tgtgctattt gcaaaggacc ggatcatgac gatgtcggta tggtgggatg tgacaactgt 120 tcccagtggt tccatttcaa atgtgttgga gtttcagctg atgtgaagga catttcatgg 180 acctgcgtta attgcaaaga gagaccttta gttgcagcag aagatagtgg tcaggccgag 240 ccagctgctc aaccgaaagt agttccaaat caagatgcag tccccggagc aaagccgtat 300 acggagaaag acccggacgc tgaagtgtcc gccattgaag aactggagcg ggaactagag 360 caaatagagg aaatgcggaa cgaacaaatg cgcagaatgg agctggaaaa gagagtacat 420 cgacggcggt tagaggtgca gcgcgaattg gccgaacaaa gacagcaaat ggaacgcgaa 480 aaacgacaaa tcgagttgga gtttgaaaag gagcaactac aaaaagcgat ttccgaagaa 540 gaggccttcc gtaagaaaca acaagcgatg cgagatgaac ttcaagtaaa tttagatcag 600 ttgcgtattc gccaaacggg aaatcccatg cagatgaaag ctatttgcgg acctgaaagc 660 tcgagaccaa cggtggcaga cgatgtggtt gaaatagcag caatgccacc agcgcagaca 720 agagattacc gcggtgccta cagtaaacac tcaacgccga agggagcaga ggatagccta 780 tcaccggttc caccgaagga acagtttggt gcgcccggaa ttaatgagga aagaaatgtc 840 gcgtttgaca gcgaggaaga tttgcccgat gttccatcgg tttctcaaac ctcggctcga 900 acggtaaggc caacaagtgc tcaactttca gctaggcagt ttcttgctaa aaagctgcct 960 atattcacgg gtcgttcgga agattggccg atgtttatct cgagctacga gacagcgaac 1020 gaggcctgtg gtttctctaa cgttgaaaac ctagctcgtc tacaggaatg tctgaaaggt 1080 caagcgttag aggcagtccg cagtcgtcta ctgttgccaa atgcggtgcc gcagatcata 1140 gactcgcttc gaatgttata tggccgtcca gaacagttac taaacacgtt attgttcaag 1200 gtgcgaaaag cagaatcgcc aaaagcagat cgtttaaata ccttcatcag tttcggtatg 1260 attgtgcagc agttggtcga tcacttagaa gcgacgaaat tgagagatca tttggtgaat 1320 ccgatgttga ttcaggagtt ggtggagaaa ctaccggcta gtaccaaaat ggaatgggta 1380 cggtacaaaa ggcaagtagt gggtgtgaca cttcgcactt tatcgaattt tctaacggaa 1440 atagttcgag atgccagtga agcaacactg ttcaacgatt caccgctgcc gtctaggatg 1500 gcacaagaga accgctccaa caagaaggga aaggcaagca aagaatgtta cgacggttat 1560 gttcatacgc acgatttcgc aggaagcatg agtagtggca acaagactgt ggaatttcaa 1620 caagagaaaa gaacaccatg tccgatgtgt gacagaactg atcacagagt ccggaattgt 1680 gcggaatttc gtaagttgga tccatctact cgcctgaaag ctgtagaaga atgggaacta 1740 tgtcggattt gtctcaatga ccacggtaag gcaaaatgca agctcaatat cgtttgtaaa 1800 gtagataatt gtggccaacg acacaatacg cttctacatc agtcccgatc ggtgcttcgt 1860 tctgattgcc acacgcacac gcaggtcggt ttgtgtcagc cggtaatatt tcgtatggta 1920 gcggtgacca tttacaacgg cgtacgtgaa atcaacacgc ttgcgttctt ggacggaggc 1980 tcgtcctaca cactggtaga aagttctctg acggacagat tgaaaattaa gggacaaact 2040 ataccactcc gggtaacatg gacagctggt atgaccagac ttgaaagaga gtctcagcag 2100 gttcatctaa cgttgtctgc acgaggatcg aaagagaagt acctgatcca taatgtacac 2160 acagtgcaat ctttgaaact tccagaacaa gcgttgcgtt atgctgaaat ggccaaccag 2220 tacggacatc ttcaaaatct accggtggta gactatgaaa gagagccacc gaagattctc 2280 attggcctaa aacaccttca tctgtttgct ccactggagt cgcgaattgg caaaccagga 2340 gaaccaattg ccgtaaggtc gaaactagga tggacggtat acggatctca aactcccgag 2400 gaagtaacgt cgtcttatgt tggttatcat tcatgcgatg gcgtgaccaa tcaagaacta 2460 cacgatctgt tacgatcgca ttacgctttg gaggaacccg gtgtttcggt gagtttgtta 2520 ccagagtcgt ccgaagatca acgagcccga aagattatgg agaacacgac tgtaaagata 2580 ggagatcggt tccaaacggg actgctatgg aaacatgata atccaatgtt ccctgacagt 2640 tacccgatgg cacaacggcg aatgaaatgt ttagaaagaa gactaaccaa ggacccagat 2700 ttgaaaaaga atgtcgaaac gcagattgtt gactatcagc acaagggtta ttgccacaaa 2760 gctacgaaag aagaacttac cggcgtcgat cccgaaaagg tgtggtactt accactgaat 2820 gttgtactca accccaaaaa gccgggtaaa gttcgtttag tgtgggatgc tgcggctgct 2880 gtgcaaggaa tttcgctcaa ttcgcagttg cttaaaggac cagatttcct cacgtcactt 2940 ccatcagtaa tctgccagtt tcgggagcat cgtattgggt tcggtggtga tatccgagag 3000 atgtttcacc aggtgaaaat taggccagaa gacagacaag cacagcgatt tctgttctct 3060 ggacaggtgt ttgtaatgga ttgtgccata tttggcgcta catgctcacc cagtcaagcg 3120 atgtacgtga aagattgtaa tgctaaacag tgggagtcaa tgtacccaga agcgtccatg 3180 gcagtggtta aaaagcacta tgtggacgac tattttgata gtgcgcataa tattgaggag 3240 gcagtccaac gcgctagtga agtacggttc atacattccc aggcaggctt cgaaataaga 3300 aactgggtat ccaattcaga ggaatttttg caacagcttg gcgaaacaga aaggagtgaa 3360 gcagtacatt tcaacagaga caaatcaacg gaaacagaac gagttctggg tattgtctgg 3420 aatccagtag aagatgtgtt taagttctct acaaaaataa gagatgatct gctaccgtat 3480 ctatttgaag gtaaaaggcc aactaagcgt gttgtactca gctgcgtttt gagcttgttc 3540 gatccgttag gacttctgac accattcacg atctacggca gaatcttcat tcaaagctta 3600 tggagaacag gatgtgagtg ggacgaaatg atagatgacg aatcggcacg aaagtgggcg 3660 ttatggatca gccgtctgtc agtggtagaa tccgtaaaag ttcctcgtta ctacttcggt 3720 gacggtctgt tgttaaacta cagttccctg caattgcacg tattcacaga tgccagtgag 3780 tgcgcttatg gttgtgtcgc gtattttagg attctggccg gcggagttcc cagatgttca 3840 ttagtgcaag cgaaatcaaa agtggctcca ttaaagccat gtactattcc acgattagag 3900 cttatggcag cagtactagg agcccgaatg gtagatacgg tcaaggagag ccacaatttg 3960 gacattggga aggtgtttct gtggactgat tcacgcacgg ttcactcgtg gattagatcg 4020 gatttgtggc gctacaagat tttcaccgca ttgcgtgtag gtgacatttt gaatcgtact 4080 atggcttcgg attggtattg gataccgaca cggcttaatg tagcagatga attgacgaaa 4140 tggaacaacg gtccgcaact agagccagac agtcgatggt ttaaaggacc atcgtttttg 4200 taccaagcag aggaaatgtg gccggcacag ccagagatca agcccaacgt acgtgaagaa 4260 cttcgcacaa cgtttctgtt tcacgacgtg aagttagcag aaccaatttt ggaagcgtcg 4320 agaatatcta aatggagtgt cttggttaga acggtggcgt gtgttttacg tttcatctca 4380 aatcttcgta gaaaacggca ggggcagccg attgaagttc taccggtaac tagcaagatt 4440 agctgtcttg tcagaaaatt ggttccgttt aaaaggcaag cattgagaag agaggaattc 4500 caagaagctg aacgttttct ttttaaaatt gcacaaggtg agcgattttc agccgagatc 4560 cgtattctcc agcaaagtta cagtggcaag gatctgccga aggcagacaa gactagtgac 4620 ctccataaac ttaccccagt tctcgacgaa tttgggatac ttcgtcacga ggggcgaaca 4680 gtggaagctg agcatcttcc cttcgaagtg cgattcccga ttgttctgcc aaagaagcac 4740 ccgattacga tcaagctgtt ggaacacttt catcagaaat ttggccatgc aaatcgagaa 4800 acagtggtaa atgaattgag gcaaagattt tacatcccac acatcagagc tcaactgaaa 4860 aaggtgatgc aagagtgcgt gtggtgcaaa gtacacaaat gtcgcccgaa ggttcctcga 4920 atggcaccac ttcctgtcca acgtataact ccaaatttgc gtccgttcag cttcactggt 4980 gtggattact ttggtccact gatagttaca gttggtcgtc gtgcagagaa gcggtgggta 5040 tcgttattca cctgcttgac aaccagggct atacacttgg aagttgtgca tagtttgtct 5100 tcacaaagct gcgtcatggc catccgtcgc ttcgcatgcc gacgaggtat gccgatagag 5160 tttttttcgg acaacggaac caacttcact gatgccagga ctaaatggaa ctttactcca 5220 ccaagtgcac cgcatatggg cggagcttgg gaaaggttag tgcgttcagt aaaagaggcg 5280 atgcgtgtgt tcgatgacgg gcggaaatta acagacgaga ttctactgac gactttggca 5340 gacgcggaag aaatgataaa cacacgtccg ctaacataca tccctcaaga gtcggcagaa 5400 tctgaatcgc tcaccccgaa ccacttcatt cgaggacttc cgtcgactgg ttgcgacgta 5460 ttcaaatcaa tagttggaga tgctgacgct ttacgtgata actacaagcg gtctcaatgg 5520 ttagcaaaca agatgtggaa gcgttggctg gtagaatacc ttccctcgat aaacaagcgg 5580 tcaaagtggc atgtagactc ggaaccgatt attgagggag aagtcgtttt tatggctgat 5640 gatgaaaatc gcaagctttg ggtacgagga gtcgtagaag agctgattcg tagtgctgat 5700 ggacgtgttc gacaagcgat ggtacgtacg acgaaaggtt tgtaccgtcg gccagtggca 5760 aaactagcgg taccagagat tcgttcggtg aactctaact tgacggcgtc tgcgaaagag 5820 tcacggggag gggc 5834 // ID Gypsy18-LTR_Dya repbase; DNA; INV; 311 BP. XX AC chrU; XX DT 19-MAY-2009 (Rel. 14.05, Created) DT 19-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy18_Dya; KW Gypsy18-I_Dya; Gypsy18-LTR_Dya. XX OS Drosophila yakuba OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; melanogaster subgroup. XX RN [1] RP 1-311 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1111-1111 (2009). XX DR Genome; chrU; Positions 4988592 4988902. XX SQ Sequence 311 BP; 110 A; 46 C; 65 G; 90 T; 0 other; tgtaaggtag cttgtcgttg atctcggcct aaaagtaggc acttaattag gtcgttcgta 60 aggtcgggaa aaatgttaat taggctgtca tgaagaacat ttctccaact gcatttttgg 120 agttcagaca aggggcatta attgagttta aaccagcgag taaggaagtg cgtccgagct 180 atccaaaaat aaaatttagt gtctctcaaa attacaaaaa aaaaaagaga atttaaccaa 240 aatttaaatt tgtggatcga gaagataacc cagtgaatcg gaaaatttat atttcgttaa 300 ctttagtgcc a 311 // ID Kolobok-20_HMa repbase; DNA; INV; 2728 BP. XX AC . XX DT 15-SEP-2009 (Rel. 14.09, Created) DT 15-SEP-2009 (Rel. 14.09, Last updated, Version 2) XX DE Kolobok-type DNA transposon - a consensus. XX KW Kolobok; DNA transposon; Transposable Element; Kolobok-20_HMa. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2728 RA Jurka J.; RT "DNA transposons from Hydra magnipapillata."; RL Repbase Reports 9(9), 1919-1919 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 424..2211 FT /product="Kolobok-20_HMa_1p" FT /translation="MANKNNKQRKKLKRKFSGNQHTNRKIESLDKTSLQDE FT KNNPSLSLCKKSSITPNESNCNLFIELQIFKCVIDMVGICPDCTSKINLKL FT INEKHQGFANNMFLQCYKCDWTHEFYSSSFISKKDNTELRGKKKFDINTRL FT VIAFREIGHGFEAIEKFSTFMNMPSSMSKNAYNNINKSVYQAYADCANESM FT KKAAKEVRQIIKPNSNAVELLESDISIDGSWQKRGYNSLNGVVTGIARENQ FT KVLDVQVFSEFCYGCSKWEDKFGSAEYIDWKAKHICQINHRTSSGSMESQG FT AVNIFASSIEKYNLRYTHYIGDGDTDSFEQVLDSKPYGEIAPKKLECIGHV FT QKRLGTRLRKLRSDTKXXKLADGKSISGKGRLTDKXINXLQNYFGMAIRQN FT IGKLYAMKKAVFAVFRHFTDIKNLDVRHQFCPRSAISWCKYQSDKITGLSK FT YNSKGIIPEXIKKEIEKIFXDLSSDELLSKCLQGXTQNANEXXNQILWRKC FT PKNTFVXKDILEMGTFSSVVSFNDGFLALDNVLKNLGLTPGKYFFDRALVF FT DSKRVKNIQKKSTXIVKNKRKKLRALRKGFIDKEKHKEGGDSYSKGSF*" XX SQ Sequence 2728 BP; 995 A; 353 C; 433 G; 931 T; 16 other; ggtggtcgtc tagtaaaaaa atatgtaaaa aatcaaattt ttgttattgg tgtaatttaa 60 aagaaaaatt tttgctcgtt cagaaaatat atacttttac acatgtttct taattagatt 120 ttttcaaaat tcactttctt taaagacatg tattagtttg ttcccttagc aacgtcctta 180 gcaacaacat ttttaggttg ttttgggcca aaattattta ccaaacttga taatataaac 240 tttcttaata atcgagaaac ttaaagcttt tgtatagtgt aaacagagct ttaaattata 300 cggaacttca attttaacag ctgacagaac tatttgaatt attgctgttt ttttattaga 360 gcatggtgtg ttttgtttta tctaaaactt ttatgtgaaa ttttatttaa taactttgtt 420 tagatggcta ataaaaataa taaacaaaga aagaaattaa aaagaaaatt tagtgggaac 480 caacacacta atagaaaaat agaatctctg gacaaaacaa gtcttcagga tgaaaaaaac 540 aacccatctt tatccttgtg taaaaaatct tctattacac caaatgaaag caactgtaat 600 ttatttattg aactacaaat atttaaatgt gttattgata tggttggtat ttgtcctgat 660 tgcacctcaa aaattaattt aaagcttatt aatgaaaaac atcaaggatt tgcaaacaat 720 atgtttttac aatgttataa atgtgattgg acacatgagt tttactcttc atcattcatt 780 tcaaaaaaag ataacactga acttcgtggg aaaaaaaaat ttgatattaa tactcgttta 840 gttattgctt ttcgtgaaat tggtcatggt tttgaagcca tcgaaaagtt ttccactttt 900 atgaatatgc cttcatcgat gtcaaaaaat gcatacaata atattaataa gtctgtttat 960 caagcatatg cagactgtgc taatgaaagc atgaaaaaag cagctaaaga agtgagacaa 1020 attataaaac ctaattcaaa tgcagttgaa ctactagaaa gtgatatatc aatcgatgga 1080 tcatggcaaa aaagagggta taattcttta aatggagtag ttactggcat agctcgtgaa 1140 aatcaaaagg ttttggatgt tcaggttttc tccgaatttt gttatggttg ttcaaagtgg 1200 gaagataagt ttggatcagc tgaatatatt gattggaaag caaaacatat atgccaaatt 1260 aatcatagga catcgtcagg gtcaatggaa tcccaaggcg cggttaatat ctttgcaagt 1320 tcaattgaaa aatataattt gagatatacg cactatattg gagatggtga cacagattcc 1380 ttcgaacaag tgctagattc aaaaccatat ggtgaaattg cacccaaaaa attagaatgt 1440 ataggacacg ttcaaaagcg ccttggaacr agattaagaa agcttcgcag tgatacgaaa 1500 tstawaaaac ttgctgatgg aaaaagtatt agtggaaaag gaagattgac tgataaastc 1560 attaacamac tkcaaaacta ttttggtatg gcaatcaggc aaaatattgg caaattgtat 1620 gctatgaaaa aagctgtttt tgcagttttt cggcacttta cggacattaa aaatytagac 1680 gttagacacc agttttgtcc tagatctgca atatcatggt gcaaatatca aagtgataaa 1740 ataacaggct tgtctaaata caattcaaaa ggtattattc ctgaagycat taaaaaagaa 1800 attgaaaaga tttttaytga tttgagttct gatgaacttt tgtcgaaatg yttacaaggc 1860 maaactcaaa atgctaatga akcattkaat cagattctct ggcgraaatg cccaaaaaac 1920 acatttgttk caaaagatat actagaaatg ggtacgtttt catcagtagt aagttttaat 1980 gatgggtttt tagctctaga taatgtcttg aagaatcttg gtttaacacc agggaaatat 2040 ttttttgatc gtgctttagt attcgatagt aaaagagtca aaaatataca aaaaaaatca 2100 acaratattg taaagaacaa aagaaaaaag ttaagagctt taaggaaagg atttattgat 2160 aaagaaaaac acaaggaagg aggagatagt tattccaaag gatcatttta acttttttat 2220 ttttattttg ttagttttct ctgtttttga tatttcagac ttaaaacaag catatctcag 2280 cttctatgaa ttatttttga atgaaatttt cagtggttgt tcaactgtat ataagttagt 2340 atgtgaacca gaatctaatt aattctttga gtattttaag aattatggga tgttttttgt 2400 tgaattttac ccctttttta atttccccaa atacgcagag ctgccattat ggatctatac 2460 ataattataa gtttttactg gttcacatac taattttatg tatatcctgc atccaaacca 2520 aacttcatgc aaaatgaatg tacagataac tagaactctt tgttttaaga aagtgatgtt 2580 ttaggcattt ttttgctgat tcagcattaa aaaacgtaaa attttttttt tttttttgtt 2640 tcataattca aatataaatt ctttatgttg aaaaataaaa aaactctccc acaaattgca 2700 ttttttattt ttttactaga cgaccacc 2728 // ID Loner_Ele2 repbase; DNA; INV; 6333 BP. XX AC . XX DT 07-OCT-2010 (Rel. 15.1, Created) DT 07-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE A Loner non-LTR retrotransposon family from Aedes aegypti. XX KW I; Non-LTR Retrotransposon; Transposable Element; Loner; KW Loner_Ele2. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-6333 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6333 RA Kojima K.K. and Jurka J.; RT "Loner non-LTR retrotransposons from the yellow fever mosquito."; RL Direct Submission to Repbase Update (07-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. >99% identical to consensus. The consensus CC is ~100% identical to the original sequence in [1]. XX FH Key Location/Qualifiers FT CDS 849..2297 FT /product="Loner_Ele2_1p" FT /translation="MGVDDDGGGTSYPLTESSNVLNQQNTNTSMPNPCVSP FT QSDVSQMDTNILPSPTSPRLKAYPPDSGGPFVVFFRSKGKRLNTIQISKDL FT TKRFSSVVAIDTVGSGKLRVTVSDRKQANEIVACELFTREYQTYLPSHRVE FT IAGVVTEGSMTCEELMQGCGRFKNPSLPSVPILECKQLHSVSLVGEKKIYS FT HSESFCVTFSGSALPNFLVIGKLRLPVRLYVPKVMNCTNCKQLGHTAQYCC FT NKPRCASCGERHVDGACSIPPKCVYCNESPPHALENCPTYVRRQQHQRRSL FT EQRSRRSFAEMLKKAAPSAESQNIYSSLSLDDQGSDSEVGEGISFVFKGSS FT RKRVRLQRPTKKPRNLSSSDDPPAMKNTKPVAKVSKLSPPGFKLQEERDFP FT PLPGKSKIPNIPKFQTAQPKESSHSEPLGQPQGVPMFTLSGIVDIILNFFN FT ASDPVKNIVKGLLPCVTPLLKQLASRMPLLATIISFDG" FT CDS 2293..5985 FT /product="Loner_Ele2_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="MDKLSTEVEDMISVLHWNCRSIMPKLDSFKFLVSNLQ FT CDVFALSETWLTPDVTLPFHDFNIIRLDRSDSYGGVLLGIKKHHSFYRVDL FT PPMSGIEVVACQVTIRGKSLSVASIYLPPRSAISRRDLSHICSVMPEPRLF FT IGDFNSHGTGWGELYDDHRSTLIYDLCDDFNMTILNTGEVTRVAPPAQDGS FT PRDSRLDLSICSSSLSLDCSWRVIQDPHGSDHLPIVVSISNGTQQSSSIDL FT AYDLTKHIDWGKYAETIIDGEQAVEVLPPLEEYRFLSELILNSALQAQRRP FT VPGPSVRKKPPNPWWDEECKALYREKSAAFKDFRKRGSIDNYERYSSLERK FT FKSLVKAKKSGYWRHFVEGLSRETSLRTLWTVGRRMRNTSSVNEDRESSPR FT WIIDFAKKVCPDSVPVDRVRRDIPVDRDAMDRPFSMVEFSLALLSCNNSAP FT GMDRIKFNLLKGLPDVAKRRLLNLFNHFLECNIVPDDWRQVRVIAIQKPGK FT PASNYNSYRPIAMLSCIRKLLEKMILFRLDKWVESNGMLSDTQFGFRRAKG FT TNDCLALLSSEIQLAFAQKEQMGSVFLDIKGAFDSVCVDVLSDKLHASGLS FT SVLNNFLYNLLSEKHMSFTHGNLSVSRISYMGLPQGSCLSPLLYNFYVRDI FT DDCLMENCSLRQLADDCVVSVTGPTAAELQGPLQDTLDNLSTWALKLGIEF FT SAEKTEMVVFSKKHKPAKFPLTLMGKTITHSLSSKYLGVWFDAKCTWGKHI FT VYLTQKCQKRINFMRTITGTWWGAHPEDLIKLYQTTILSVLEYGSFCFQSA FT AKTHMLKIQRIQYRCLRIALGCMNSTHTMSLEVLAGVQPLTDRFAELTLRF FT LIRCEVLNPLVIENYEKLLEHNPQTRFMSVYYWYMTLEVSPSSVNTNRDNF FT LEFDCSSVVFDLSMKQDIHGIPDHFRSKFIPPMFASKFGHVSESRQFFTDG FT SKMDDITGFGVYNVSHSASFMLQKPCSVYVAELAAIHYTLEYISSLPPEHF FT FIFSDSLSSLEAVRSMKPVKHSAYFLNRIRQALSALSIRSYTITLAWVPSH FT CSIPGNEKADSLAKVGAKEGDVYERQIAFEEFFALARRETLISWQQKWRDG FT EMGRWLHSIFPQVSKKPWFKGLGMSRDFIRVMCRLMSNHYSLSAHTHRIGL FT SESNLCVCGVAYQDIEHVVWGCVEHRGVRSELTETLRIRGKQQKPVREVLA FT GLDLEYMNLIHLFLKRINIRV" XX SQ Sequence 6333 BP; 1592 A; 1493 C; 1438 G; 1810 T; 0 other; tgcgagacag tcgatagttc tcgtgtgtcc gggttcaccc ccgtttttcc aaatctttgg 60 tatttcgacg cctattgcaa agtgttttta gtgcagtgaa gagccttagt tcggcttgga 120 agcggaacac ttttggcgaa gccattgtgt tgccgtttaa tcgagttttg cgcaaaagta 180 ttgaacgttt tcgccattgc gtggttattt tccgatcacg tgtatcgtag aagacagctt 240 gcacctcgtc ccgtgtgcta cctgagtgat catcgtttgt ggtgtgcata tagcgtgcac 300 agcggtgcgc tctccacacc gacagtggtc accatcgtta tcgacgcctg tgttggttgg 360 tggcaggcag tgagagacca accacacata caccgtacca gcgaaccagg aaccagaccg 420 tccttttttg atcacgtgta tcatcgaagg cagctcgcac cccgtcccgt gtgctacctg 480 tgtgcagtgc agagttcatc agtgcgatca tcgtctgtgg tgtgcgtata gcgtggacag 540 cggagatcaa ccacacatct acgcgtccca gcgaatggtc gtacgttact tgctacctct 600 ccccaagcgt agacgcccat cggttacagc atcgacgacg ccagcgtgga tagagtaaaa 660 gataagtacc tctcgttgtt cttctcctct tcacgtcttt tttgattagt ttttactgcg 720 tgtgaaagtt ttccctcttc cctttcccct ttgtatgttt ttttttttca gtttgattca 780 atttccattt ctgtgaatag tgttagtttg agttattcgt gcgccctttg cggtgttaag 840 ctaccgcaat gggtgtagat gatgatggcg ggggaacgtc gtatcccctc actgagtctt 900 cgaatgtgtt gaaccaacaa aacacaaaca cctctatgcc caatccatgt gtgtcccctc 960 aatctgatgt gtctcaaatg gacaccaata ttctaccatc tccgacgtcc cctcgcctca 1020 aggcctaccc gcccgactct ggtgggcctt ttgttgtttt ctttcgatcc aaaggcaaac 1080 ggttgaacac tattcagatc agtaaggatc tgactaagcg attctcttcc gttgtcgcca 1140 ttgacacagt gggttctggt aagctgcgcg tcaccgtcag tgaccgtaaa caggccaatg 1200 agattgtggc ctgtgagctc tttactcgtg agtaccaaac ttatctgcca agccatcggg 1260 ttgagattgc gggagtggtc accgaaggca gtatgacctg cgaagaattg atgcagggtt 1320 gtggtcgctt caaaaaccct tctctcccat ctgtccccat actggaatgc aagcaattgc 1380 attccgtatc cctggtggga gaaaagaaga tctattcaca ttctgaatcc ttctgtgtga 1440 ccttttccgg atctgctttg ccgaattttc ttgtaatcgg caaacttcgt ttacctgtgc 1500 ggctgtacgt accgaaggta atgaattgca caaattgcaa gcagctgggc catactgccc 1560 agtattgctg caataaacct cgctgtgctt cttgcgggga gagacatgtg gatggtgcat 1620 gtagtatacc gccaaaatgc gtttattgca acgaaagtcc accacatgcc ctcgaaaact 1680 gcccgacgta cgtacgccgg cagcaacatc agcgacgatc attagagcag cgctctcggc 1740 gtagttttgc cgagatgttg aaaaaggctg ctccgtctgc tgaatcccaa aatatctact 1800 cttctctgtc tcttgatgat cagggctccg actctgaggt tggtgaagga atttccttcg 1860 ttttcaaggg ttcatcgagg aagcgtgtga gactccagag acccacaaaa aagcctcgga 1920 atctgtctag cagtgatgac ccaccagcca tgaaaaatac taagcctgtc gcgaaagtct 1980 caaagctttc acctcctgga ttcaaactcc aagaggaaag agactttccg ccactaccgg 2040 gaaaatctaa aatcccaaat attccaaaat ttcaaactgc tcagccaaaa gaaagttcgc 2100 attcggagcc cctagggcaa cctcagggcg ttccaatgtt tacgctttct ggcattgtag 2160 atatcatcct taatttcttc aatgcttctg acccagtgaa gaacattgtc aaaggattgc 2220 ttccatgtgt gactcctctt ttgaagcaac tggcttctag aatgcccctc cttgcaacga 2280 tcatatcttt tgatggataa attatccaca gaggtcgaag atatgatttc tgttctacac 2340 tggaactgtc gtagtattat gcctaaatta gatagtttta aatttttagt tagtaattta 2400 cagtgcgatg tttttgcgtt atcagaaaca tggctaacac ctgatgtgac ccttcctttc 2460 catgatttta atattattcg cctggatcga tccgactcat atggaggagt gcttttgggg 2520 atcaaaaagc atcactcgtt ctacagagtc gatctgccgc cgatgtcagg cattgaagtc 2580 gttgcatgtc aggtgactat ccgaggcaaa agcctcagtg tcgcctccat atatcttcct 2640 ccgagatcgg cgatatctcg cagagatctc tctcacatct gctctgttat gcctgaacca 2700 cggttgttca ttggggattt caactcccac ggaacaggct ggggggaact gtatgacgac 2760 caccgatcaa cgttgatata tgacctctgc gacgacttca atatgacaat tctcaacact 2820 ggggaagtta cacgagtggc acctccagct caagatggaa gtcctaggga tagccgttta 2880 gacctctcaa tatgttcaag ctcgttatcg ctggattgtt catggagggt aatccaggat 2940 ccccatggta gcgatcactt gccgatcgta gtttcaattt ccaatggaac acaacagtct 3000 tcatccatcg atctcgccta cgacctcaca aagcacatag actggggaaa gtacgcggaa 3060 acgatcattg atggagaaca agcggtagaa gtacttcctc cgttggaaga gtatcgcttt 3120 ttatccgagt tgattctcaa cagcgctctt caagcccaac gccgtcctgt gcctggtccg 3180 tcggttcgca agaagcctcc caatccgtgg tgggatgaag agtgtaaggc actctatcgc 3240 gagaaatccg ccgcgtttaa agacttccgg aaacgtggtt caattgacaa ctatgagcgc 3300 tattcttccc ttgaacgcaa gtttaaaagc ttggtcaaag cgaagaaaag cggttattgg 3360 cgtcattttg tggaaggatt atcacgtgag acctcgttga gaacgctttg gaccgtcgga 3420 agaagaatgc gtaatacatc gtcggttaat gaagatcgag agagctctcc tcggtggatt 3480 atcgatttcg ctaagaaagt ttgtccggat tccgttcctg ttgatcgcgt gcgacgagat 3540 attccggtgg atagggacgc catggataga cctttttcga tggttgaatt ctcacttgct 3600 ctcctttcat gtaacaattc cgctccagga atggatcgaa tcaagttcaa tttgcttaag 3660 ggcctacctg acgtcgcaaa gaggcgccta ttgaacttgt ttaatcactt tctggagtgt 3720 aacattgttc cggatgactg gaggcaggtg agagtaatag ccatccaaaa acccgggaaa 3780 cccgcgtcga actataattc gtaccgccca attgcgatgt tgtcgtgtat tcgcaagttg 3840 ttagagaaga tgattctctt tcggctggat aaatgggttg aatcgaatgg catgctgtca 3900 gatacacagt ttggttttcg cagagccaaa ggaacgaacg actgtcttgc gctgctttct 3960 tcagaaattc aacttgcctt tgcccaaaag gaacagatgg gttcagtgtt tttggatatt 4020 aagggtgctt ttgactcagt ttgtgttgat gtcctttcag acaaactcca cgcaagtggc 4080 ctttcatcag ttttgaacaa ttttttgtac aatttgttgt ccgagaagca tatgagtttt 4140 actcatggca acttgtcagt ttcacgaatt agctacatgg gtctccccca gggatcatgt 4200 ctgagtcccc ttctttataa tttttatgtg agagacattg atgactgtct catggaaaat 4260 tgctcgttaa gacagcttgc agacgactgt gttgtttctg tcacaggacc aacagcagcc 4320 gaactacaag gacccttaca agatactctg gacaatttgt ctacttgggc cttaaagctg 4380 ggtatcgaat tctctgcgga gaaaactgag atggttgtct tttcaaagaa acacaaacct 4440 gccaagtttc cgcttacact catgggtaag acaatcactc atagcttgtc ttctaaatat 4500 ctcggggtct ggttcgacgc caaatgcacc tgggggaaac acattgtgta tctgacacag 4560 aaatgccaaa aacgaatcaa cttcatgcga actataaccg gaacatggtg gggagcccat 4620 ccggaagatc tgatcaagct gtaccaaaca accatcttgt cggtcttaga gtacggtagc 4680 ttttgctttc aatccgcggc gaaaacacat atgctgaaga tacagcggat tcagtaccgc 4740 tgtcttcgca tcgcgctagg ctgcatgaac tcgactcaca caatgagttt agaggtactt 4800 gcaggagtac agcccctgac agatcgtttc gcggagttaa cactcaggtt cctcatccgt 4860 tgtgaggttc tcaatccatt ggttattgaa aactacgaaa agctgcttga acataaccct 4920 caaactcgtt tcatgagtgt gtactactgg tacatgacgc tggaggttag cccatcttcg 4980 gttaacacca atcgtgataa cttcctagaa ttcgactgtt cctctgtagt atttgatttg 5040 tccatgaagc aagatatcca tggtatacca gatcactttc gttcaaagtt tatccctccc 5100 atgtttgcaa gtaaattcgg gcatgtcagc gagagccgac agttcttcac tgatgggtca 5160 aaaatggatg atatcactgg tttcggtgtt tacaacgttt ctcatagcgc ctccttcatg 5220 cttcaaaaac catgttctgt atatgttgct gaactagcag ctatacatta caccttagag 5280 tacatcagtt ctctcccacc cgagcacttc ttcatttttt ccgacagttt aagttctctg 5340 gaggctgttc ggtcaatgaa gccggtgaag cactcagcgt acttcctgaa tagaatacgc 5400 caggcattga gtgctttgtc aattcgctct tacactatta ccctagcttg ggtcccttcg 5460 cattgctcca ttcctggcaa tgagaaagcg gactctctgg ctaaggtagg cgctaaagaa 5520 ggcgatgttt acgagcgtca aatcgccttc gaagaatttt ttgcattggc ccgtcgggag 5580 accttgatca gctggcaaca gaaatggaga gacggagaaa tgggtagatg gctgcattcc 5640 atatttccac aggtgtcgaa aaagccatgg tttaaggggt tgggcatgag ccgtgatttc 5700 attcgtgtca tgtgtcggct tatgtccaat cactattcgt taagcgcgca tacccaccgt 5760 attgggctct cagaaagcaa tctctgtgtt tgcggcgtgg cttaccagga tatcgaacat 5820 gttgtgtggg gatgcgttga gcatcgtggc gtcagatctg agctgactga aactctccgg 5880 atccgaggaa aacagcagaa acctgtcagg gaagtgttgg cgggccttga tttagaatat 5940 atgaatttaa tccatctgtt tttgaagcgt atcaatatcc gggtttgatt gtgttcgttc 6000 cctgtctttc tttttcccat tcaaattgtc ctcttctcgt cgtgtctccc acaaaccgtt 6060 tctgtttagt ttcattacag atctggacat cctgtccctt caagcttcat cagcttagta 6120 gtctaagtag cttaagaaat tccaaaaaat cctttccttc cctgtttcaa atgtatccac 6180 taaccttaaa acttccgcaa actaacatag tttttaaaat aagttcaaat ttcaaaatgt 6240 aaataatgta aaaaaaagaa aagatttcgg ctctgttatg ccctacggcg cctgagcctg 6300 ccaaataaac gagataagta aaaaaaaaaa aaa 6333 // ID I-73_AAe repbase; DNA; INV; 6820 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE An I non-LTR retrotransposon from Aedes aegypti. XX KW I; Non-LTR Retrotransposon; Transposable Element; I-73_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6820 RA Kojima K.K. and Jurka J.; RT "I clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1344-1344 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 10 sequences with >99% CC identity. XX FH Key Location/Qualifiers FT CDS 2953..6639 FT /product="I-73_AAe_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="MASNNCSNRSQSTIVTDSHHRSKPLALQWNINGFYNN FT LVELERLVHDHAPLVIALQESHRATPTAMNATLGKKYLWTSKFGRNIYHSV FT AIGVSIELPYESVTVNSDLPLVAVRITWPFAVTIASLYIPNGRIPDLKTRL FT MEAFDTIQGPIIILGDGNGHHETWGNNSNNIRGRIILDFTTDMNLSILNDG FT SPTFWRGQQETAIDISLASSTIANRLMWKIDTDLSGSDHFPIWIHTNNVTP FT ETSRRPRWRYNQADWSAFQLAIANKMESNPTSIEELSNNIYEAAVETIPRT FT SSTPGRRALHWWSDEVKKAVKKRRKTLRAAKRLPFDHPYKESAMKAYREAH FT YACRQLIREAKEGSWSTFLDGINDSQSSTELWSRVNAIQGKRRTRGIALKI FT DNRTSREPSIIVDALADFFADLSSHRKYPHNFRVKHPIPKVELYSFVVPPD FT RGQVFNQPFSINELNFALNKAEGQSAGPDEIGYPMLKNLPFEGKVALLENF FT NKLWTTGTFPHSWKTSFVVPIPKHTKSAQDVEHYRPIALTSCLAKIIERLV FT NRRLIEHLESNRLIDPRQHAFRAGFGTGTYFATLGQVLNDALQKGDHIEMA FT SLDLAKAYNRAWTPEILNQLVTWNVTGNMLNFIKNFLSDRSFRVIIGNCQS FT KLVNEETGVPQGSVIAVTLFLVGMNSVFKFLPKGVYIYVYADDILIVVTGK FT HPKALRRKLQTAVNAIVRWTSKVGFELSAEKCTRTHICSGNHPVLRKPIMI FT DGKAIPTKKAVKILGVTFDRHLNFRQHFDLVKENCKSRVNLIKSIANKRTR FT SDRKIQMRVADAIICSRLFYGTEITCHAFEDLITRLGPVYNNTIRAISGLL FT PSTPAMSACIEAGALPFKYKAAIAISTKVVSFLERTKSRAATTYITNQANQ FT LLNFTANISLPLVAKRNRFGAQSWQAKEPRIDSYIKSNLNRECLPEVAQAH FT FRKRVRTEYPFTETRYTDGSKTSGANSRRVGFGIYSTSLEEAYGLTEYCSV FT FSAEAAAIFQAASQPCDGPLLIVTDSASALSALQTTTNTHPWIQATQELIS FT ESRRITFMWIPGHKGITGNEEADRLANLGRNERPLIRTTPAADIKLWLKGK FT IKDAWDSEWAQEKECFTRKIKNTTDRWHDLPNRWDQRVLSRIRTGHTRISY FT NFGGSTGFRKQCEICRVHNSVEHFICCCPSMEHLRLKFGIYSIRSALQNER FT NSESSLITFLKNAGMYKAI" FT CDS join(557..2122,2079..2948) FT /product="I-73_AAe_1p" FT /translation="MAASSRGPTPSSWGDGPSHALNPSQIDGSYTGATLPP FT YMDNEQNYGALLLLRISGVDRPLPNVPFIIRKSVQQYVGGRIEGAFPEANR FT ATYALKVRSLRQFNRLLTMSQLIDGTPVHITEHPTLNSTRCVVSCRDVINV FT TDSELLEELKEQGIKEVRRITRSVGGERVNTPSIILTFRGTNRPDHIDFGY FT IRCRTRPYYPSPMQCFNCWCFGHTKLRCQSKSATCGKCSGDHPIPEDKTCP FT NAIFCKQCQTNDHAVSSRSCPQYKVESTIQRVKVDQGLSYPAARRMVEKDL FT GTKSYASTVEASNRDDLGLLNSKIDYLTTLVSNKDSEIADLRAALAERNTP FT SASARSQIDELKTIVANQAKQIETLTNQLSAFLSMVMPAGPSFTTTSSNTP FT VGIAVFPARHATEPTLPTSFVNDQPAVKTQQQIDDLPIGIISDSDSATPDH FT SPNLKDTPRPTRFTKSQSVPKHPRSGTPIPNSSKPSPSPNKTPNKRSINRI FT EPANIQQQKRVKHKSASEGSSAIAKRNIKAPRKAPVLSLNASITCLFSIIS FT LSPPPLDKTASKGSRNTPPNYRRKPERSHIHNKNARREIDGPEPLQSKQVG FT PPQHPATHQCLRLCLDSRGASVPEAIPQPESLDRPRRLGRVPDEEIGASDS FT QGVDRPEAKDPPERSNRLRHPQDEKVIRSYNMKNPTMYPSSLGPFSADVFP FT TPELEDSPWRWEEIGRGTDKGWLSHTPRKDDEGIEAGTEKVEGLKSKKDTT FT SENQRTNEEEKEKIFGGHPGLLSSGGKSHLVLLRLSSLQLSVAFSFNTRAS FT SRWILSPNFY" XX SQ Sequence 6820 BP; 2143 A; 1686 C; 1476 G; 1514 T; 1 other; gttttgttaa aattactttg wtaaaagtcc cccttatccc accattttca gggtgagggg 60 gcagtgagat ctataataga actcatatgc ggtgctaaag cgctttcgaa aagccaaaga 120 gggcaaaaaa cctctgtgat tcaaaactcc cgtgaatcct ctctccgtcg gttcgcgctc 180 gtgagtagtt tcaaacaaca aacagtgcgg catcgataac ccgccgaacg tacttgaaga 240 caaaaagggt aaagagagtg taaaaagttc ggtccgaaat atcaggtgtt gtgttgtgtt 300 gccaaagagc gtgagaacca tcaaagcgaa gagcaaaata gaagcgaaac gagaaaaagt 360 gaaccgtggc attatctctt gagataccca gtgattagta aacactcacc aatctatacg 420 tgagacagtg aagaacatta tcgtgtagtg gttcccaaac aatttctccg gcgattcgta 480 aaaacgtgca atatctttgc gactcatcgt tcataaacat actgcaagtg ggtggtggtg 540 gtgactgaag gccgcaatgg cggccagttc tcggggacca actccttcgt catggggaga 600 cggcccatca catgcattga atcctagcca aatcgatgga tcgtacacag gggccacatt 660 acctccttac atggataacg agcaaaacta tggtgctttg cttttgctta gaatctctgg 720 agtcgaccgc ccattaccaa acgtaccttt tattatacgg aagtctgttc aacaatacgt 780 tggtgggcgc attgagggtg cattcccaga agcgaatagg gccacctacg cgttgaaggt 840 gaggagcctc aggcagttca accgtctatt aactatgagt caactgatcg atgggacccc 900 cgtacatatt accgaacatc caacgcttaa ctctacacga tgcgtagtaa gttgccgcga 960 tgttatcaat gttactgact ctgaactctt ggaggagctc aaagaacaag ggattaaaga 1020 agttcgcaga ataacacgca gcgtaggagg agagagggta aacacacctt cgatcatcct 1080 aactttccgt ggtacgaatc gtcccgacca catcgatttt ggatacattc gatgccggac 1140 taggccttat taccctagtc cgatgcaatg ttttaactgc tggtgttttg gccacacaaa 1200 actacgttgt cagagcaaat cggccacgtg tggaaagtgt tctggagatc atcccatccc 1260 ggaagacaag acctgtccca acgcaatctt ttgtaaacag tgccaaacaa acgatcatgc 1320 tgtctcgagt cgctcatgtc ctcagtacaa ggtagaaagc actatccagc gtgtaaaagt 1380 ggaccaaggg ctatcatatc cggcagcccg tagaatggta gaaaaggatc tcggaactaa 1440 atcctatgct agcactgtag aagcgtcgaa cagagatgac ctaggacttc tcaactcgaa 1500 aatagattat ctcaccaccc tcgtttcgaa taaggatagt gaaatagccg acctgcgcgc 1560 agctctggct gaacgaaata ccccatcagc atctgcgaga tctcaaattg atgaattgaa 1620 aacaatagtg gcaaatcaag cgaaacaaat cgaaactctt accaaccaac tttctgcatt 1680 tctgagtatg gttatgcccg ccggaccttc ttttacaaca acatcctcga acacacccgt 1740 tggtattgct gtctttccag ctaggcatgc aacagaaccg accctcccaa catctttcgt 1800 aaacgatcaa ccagcagtca aaacccagca acaaatagat gatctgccaa taggaattat 1860 ctccgattcc gattccgcca cgccggacca tagccccaat ttaaaagata ctccccgtcc 1920 aacaagattc acaaagagcc aatcggtacc caaacatcct cgatcgggaa caccaatccc 1980 aaactcctcc aaaccatccc ccagtccaaa caaaacacca aataaaagat ccataaatcg 2040 aatcgaacca gccaacatac aacaacaaaa aagagtaaaa cataaaagcg cctcggaagg 2100 ctccagtgct atcgctaaac gctagcatta cttgcctatt ctccatcatc tcgttgtcac 2160 caccacccct agacaaaaca gctagtaaag gtagtcgaaa cacgcctccc aattatcgcc 2220 gtaagcccga acgaagccat atccacaaca agaacgcgcg tcgtgaaatt gatggaccgg 2280 aacccttgca atccaagcaa gtgggtccac ctcaacatcc agccacccat cagtgtctgc 2340 gtctctgcct ggatagccga ggcgccagcg taccggaagc tataccccaa ccggaatcac 2400 tggaccgacc tcggcgttta ggtagagtgc cagatgaaga gattggtgcc tcggatagcc 2460 agggcgtcga cagaccggaa gctaaggacc cgccggaacg gtcgaaccga ctccggcatc 2520 cgcaggacga aaaagtcatc agaagctaca acatgaaaaa cccgacgatg tatcctagta 2580 gccttggccc cttcagtgcg gacgtctttc ccacaccgga actggaggac tccccttggc 2640 gctgggaaga aatcggaaga gggacggaca agggatggct atcccatacc cctagaaagg 2700 acgacgaggg aatagaagca ggaactgaga aagttgaagg tttgaaatcg aaaaaggata 2760 ctacaagtga aaaccaacgg accaacgaag aagaaaaaga aaagattttc ggtgggcacc 2820 ctgggctact tagctcaggc ggtaagtccc atcttgtact tctccggcta tcttctctac 2880 aattaagcgt cgccttctcg tttaacacac gtgccagcag ccgatggata ctctccccaa 2940 acttttacta aaatggcttc aaataactgt tctaacagat cgcaatccac tattgtaaca 3000 gattctcatc acagaagcaa accgctggct ctgcagtgga acataaacgg gttttacaat 3060 aatttggtgg aactcgaaag gctggtgcac gatcacgcac cccttgtgat tgcattacaa 3120 gaatcccaca gagcaacgcc gacggcgatg aatgcgacct taggaaaaaa gtatctttgg 3180 acgtccaaat ttggtcgaaa catctaccac tccgtagcca taggagtttc tattgagcta 3240 ccatatgagt ctgttacggt taactctgat ctaccgttag ttgctgtcag gataacttgg 3300 ccttttgcag ttacgatagc ttcactatac atcccaaatg gcagaatacc agacctgaaa 3360 acccgtttaa tggaagcctt tgacacaata cagggaccaa taatcatcct gggagacgga 3420 aatgggcatc atgaaacatg ggggaacaat tcgaataaca ttagaggacg aattattctc 3480 gatttcacta cggacatgaa cctctccatc ctcaatgacg gttctccaac cttctggagg 3540 ggacaacagg agaccgctat tgacatctct ttagcatcgt ctactattgc aaaccgcctg 3600 atgtggaaaa tcgatacgga tctatcgggc agcgaccatt tcccgatatg gattcatacg 3660 aataatgtca caccggaaac ctctcgaaga cctcgctggc ggtataatca agctgattgg 3720 tcagctttcc aattagcgat agccaataag atggagtcga atccaacatc aattgaagaa 3780 ctttccaaca atatctacga agccgcagtc gaaaccatcc caagaacaag ctctacccca 3840 ggtcgtcgcg ctctccactg gtggtctgat gaggttaaaa aagccgtcaa aaaacgaaga 3900 aaaacgttgc gggccgcaaa acgtttgcct ttcgaccatc catacaaaga aagtgccatg 3960 aaggcctacc gagaagctca ttacgcatgc cgacaattaa ttcgtgaagc caaagaaggg 4020 tcttggtcta cgtttctgga cggaattaat gattcgcaat catccaccga actttggagc 4080 cgtgtcaatg ccatacaagg aaaaaggcgc actcgtggca tcgctttgaa aattgacaac 4140 cgtacttcaa gggaaccatc tatcatcgtt gatgcactgg cggatttttt cgccgatcta 4200 tcctcgcatc ggaagtaccc acacaacttt cgtgtaaagc accctattcc aaaggtggaa 4260 ctatacagtt ttgtagtccc tccggataga gggcaggtat ttaatcagcc tttttcaatt 4320 aacgaactca acttcgctct caataaagca gagggtcaat cagcgggccc agacgaaata 4380 gggtatccta tgctcaaaaa ccttccgttt gaaggtaaag tagctctgct tgaaaatttc 4440 aataaactat ggacaactgg tacttttccg cacagctgga aaactagttt cgttgtcccg 4500 attccaaaac acaccaaatc agcgcaggac gtagagcact accggcccat tgcattaacc 4560 agttgcctgg ccaaaataat agaacggttg gtaaatcgca gattaattga acatcttgaa 4620 agcaacagac ttatagatcc tcgacaacat gccttccgag ctggttttgg aactggcaca 4680 tacttcgcca cattaggaca ggttctcaac gacgccttgc agaaaggaga tcatatcgag 4740 atggcttcgc tagatctagc gaaggcgtat aatagagcat ggacaccgga aatcctaaac 4800 caattagtta catggaacgt taccggaaat atgctaaatt tcatcaagaa cttcctttcg 4860 gaccgtagtt tcagggtcat aattggcaac tgtcaatcca aactggtcaa cgaggaaaca 4920 ggcgttcctc aaggctcagt aatcgccgtg acgcttttcc ttgtaggcat gaacagcgtt 4980 ttcaaatttc taccgaaagg ggtgtacatc tatgtgtacg ctgacgacat tttaatcgta 5040 gtcaccggta aacatccaaa agctttaaga agaaagctcc aaacagcggt gaacgccatt 5100 gtacgctgga caagcaaggt tggatttgaa ctctcggccg aaaaatgcac cagaacccat 5160 atctgttcag ggaatcatcc tgtgctccga aaaccaatca tgattgacgg gaaagccatt 5220 ccgaccaaga aagcggtgaa aattttggga gtgaccttcg atcgacacct gaacttccga 5280 caacacttcg atctagtgaa agaaaattgt aaaagccggg ttaacctcat aaaaagcata 5340 gcgaacaaac gaacaagaag cgacagaaag atccaaatgc gggtagccga tgcaataatt 5400 tgcagccgtt tattctacgg aacggaaatc acatgccacg cattcgaaga cttaattacc 5460 agactggggc cagtatacaa caacactatt cgcgcaatat cgggtctact accttctact 5520 cctgccatgt cggcatgtat cgaagctgga gcacttccgt ttaagtacaa agcagccatc 5580 gccatcagta cgaaagtagt cagcttcctc gaacgaacga aaagcagagc agcaacgacc 5640 tacatcacca accaagcaaa ccagttactc aactttacag ccaatatttc actccccttg 5700 gtagccaaac gcaaccggtt tggagcacaa agttggcaag cgaaggaacc tcgtattgac 5760 tcctacatca agtccaatct taatcgggaa tgcctcccgg aagtagctca agcgcatttc 5820 agaaaacggg taaggactga ataccctttt actgaaacgc gttacacgga cggatcgaag 5880 acttccggag ctaatagcag aagggtgggt ttcggaatct actccacctc cctcgaagaa 5940 gcatatggcc ttactgaata ctgctcagtg ttttcagcgg aggcagcggc gatattccaa 6000 gctgcatccc aaccatgcga tggtccatta ttaatcgtga cggactcagc aagtgcatta 6060 tcagcactcc aaacaaccac taacacccac ccctggatcc aagccacaca agagctaatc 6120 tcagaaagta ggcgcataac ctttatgtgg attccaggtc ataagggcat aaccggaaac 6180 gaagaggctg accgcttggc caaccttgga cgaaatgaac gaccattgat acgaactacg 6240 ccggctgcag atataaaact gtggcttaaa gggaagataa aggatgcatg ggattctgaa 6300 tgggcacaag aaaaagaatg ctttacacga aaaataaaaa acactaccga tagatggcat 6360 gatctgccca acagatggga ccaacgagtc ctctcgagga tcagaactgg tcacacccga 6420 atctcttaca attttggagg ttcgactggt ttccggaagc aatgcgagat ctgcagggtc 6480 cacaattcag tagagcattt tatctgctgt tgcccctcca tggaacatct aaggttgaaa 6540 ttcggaatat acagcatcag atcagcgcta caaaatgagc gtaatagtga atccagcctt 6600 ataacttttc tcaagaacgc ggggatgtat aaggcgatct aaaaggaatg gcgtacgaat 6660 acttgataag aaggcaatta atcaaaggtt tcacgccata aacatcatac gtaaagctca 6720 ataacgtaca aaatatgtat taaatagaac aaaatagtgg tgaactcgcc tggggcgaaa 6780 aaccacttta ataaagaaaa aaaaaaaaaa aaaaagaaaa 6820 // ID Gypsy-1_DWil-LTR repbase; DNA; INV; 2614 BP. XX AC scaffold_179905; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-1_DWil_; KW Gypsy-1_DWil-I; Gypsy-1_DWil-LTR. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-2614 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (04-MAR-2011). XX DR Genome; scaffold_179905; Positions 9487 6874. XX SQ Sequence 2614 BP; 993 A; 492 C; 648 G; 481 T; 0 other; tggctcaaca cgaagggcca agaatagtaa ggatagtgtg ggcgggacaa actagatata 60 ggattaggag agctcagggg gtaattcggg tatatcggga aaggaggcta taataagtag 120 gataattaga tgtaagttta aaagatgcgg aacgggccaa aaaaaaaaaa aaaacaatga 180 aaagatgccc taatccgcca acatttaacg aaaagatact cagtaatcgc caacattaga 240 agatgccaaa tggccaataa atgatgcgag ctgagtccca aagaaaaaaa aaaacacaaa 300 acaagcaaag cacatgcaaa aactacaaag gcaaaacata aaaaaatgca cgaatacgaa 360 aatgaaaaat acacatacag aaaacaaaca caaggtacaa gggacaggga cgggcaaatc 420 cctcccgaag aaggggggag tgtgaggaac attatataga caagaataat gtcccacaca 480 acaaagccac gccaagctac aaagccacaa caaacaagat acaacaaaca gccagataca 540 aacgcagcca agacacaaac taaatacacg atataagacc aagcacagaa ttggttttat 600 aagaatagaa ttggcggtat aagagtgaaa tcggctgtaa atgaacaaat gaaacaagcc 660 ggccgaacat ggccgatcga atcccacgca accgctggca atttcgtaca agcgggaaga 720 tgcgtgggaa acataatgaa aaaccaatat acacgctcat gcgatgggca accaagtcca 780 acatggctcg atcgaaccaa gtcgaaacca tgatttgcaa tcgaacatga cggatcgaac 840 cacacgcaac gcgatttcgt acaagcggga agatgcttgg gaggcataat gaatattagt 900 ggctgtataa gagtaaatga aaccgggctt gagggcataa ttgcatgcaa aaagggggca 960 ttggcctcat tagataaatt aagagcatgg agaggcaaat ataaatgcat gcattagcgt 1020 acatacacgc acgtcacacg tacactatgc gtgagtgacg gcaagggaga ataccggtgg 1080 ccaatgggtc gcgctatgcc tacgtcacgc tcaggagagc gaaagcagtt agaatgcgag 1140 caccaaaaga atttcgtgag cgagagaaaa caaaaatgag ggccaagcaa gcccgaaaga 1200 cgggtgcggg ggacgattgg taccgaaacg aaaccaagtg agggctatgc ttgtgaatcg 1260 gggagtaggg gggaaagcat agatgcgaga agcaataatg gtttgtggat gtgtgagggg 1320 aatatgcaag ccgaaggggg agggatgagc catgggggaa gccctctatc caatggatgc 1380 gagatgcgta tgaaagtgta tttgcatggg gaattaaata atgtaatata ttaaattaat 1440 caggctaatt atgtttaatt atgataattt agttaataca attagtatat atgtatgtaa 1500 ataataatat tgagcaaata tgggtgaaat tgggtatgac gagtggatca aagggagcga 1560 cgggcgaagg tagcgctggc agagaaagca gagagagtca acgggatacg aagggcggcc 1620 aatagaaagg cgacgccaat gcgtgaaagt gatggaagca tgtagataca agggagctag 1680 acaaggacaa gggaataaat gggcgctcag ccacaaagaa agtggacata tgcgcaaagg 1740 atttacggga ctttcaccac gcatagcaac aagtggaggc gtgggtgaaa gcagagggcc 1800 gatggaaatg taccatctgt atagcggtac tacaccaaga tattcctgag gccgtgggaa 1860 ctgcctatat aaaggcaacg cccatcaagc aaagcatcaa gtcatcaagt cgtgatcaag 1920 taacaattaa ttaaggcctt acgagtaacc ttaagaaact tatcaaggaa acacaaggag 1980 caacagaaaa actaagaact atacaggccc tacgagtaac ctgtacagtt tcatttgata 2040 gtaacaagtc cttaagtaag gataacaaac accgtatcca agctagagta aattggagag 2100 aaaccaagca atctccagga gtatccaaag tcagtggtag gcacgaggat atatcctact 2160 gccgaaattc caggatctat atatcctgac gagtaccaat ttctgaggaa catcctgctt 2220 cgtgataaca gctacgggat tcgtcccaaa actttaaagc aaaagcattc tcaacgaact 2280 tagcgaaaag ggtgcagagg tgtgtagcgc aaatcaagca cactatacag atcgagacag 2340 taatccagca acggctacag aagcaagaag aagacaagaa aggacgacga gggattcgat 2400 atgactcacc aacgctgtcc aagcccagaa ctaccatcga gaatccgtca tatccaaccg 2460 agtccactaa cgatttgaat aaacatacaa ttaagcatta gcaactgaaa tcaaacgtgt 2520 tttaatttca ccggggcaca aaataaattt gttgtgccga acctagccca agagatattg 2580 taaatatccc gaaggacgaa aaaggatcgt taca 2614 // ID R2-1_SM repbase; DNA; INV; 5417 BP. XX AC . XX DT 20-DEC-2008 (Rel. 13.12, Created) DT 20-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE R2-type retrotransposon from Schmidtea mediterranea: consensus. XX KW R2; Non-LTR Retrotransposon; Transposable Element; R2-1_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-5417 RA Jurka J.; RT "R2-type non-LTR retrotransposons from Schmidtea mediterranea."; RL Repbase Reports 8(12), 2247-2247 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 1091..4186 FT /product="R2-1_SM_1p" FT /translation="MKKVLNNETEKLPGSNLTFMCGFCDREFDTARGRGVH FT ESRGHLVERDAAVQSRVKAVVSKKYYYSNEEDVALAKMQLXHADLAKSEXL FT EAMYLALGKGRTREAIEQHIRKSLRYKGVLEEQRKLLETARGNVRQNNVGV FT PASNATKNLQRFLESLPLGTNRREERLDRIIRSNSIESQRLELIHYCNDMC FT QDFVQLDCQXNPINAIRRRNPKRLSKKQLKRAKFSALQRLWIRDRKAAAQL FT VLKDKLDSLLSNKEDSKDLGSYWQQVFERESELDRRPIPQVVENEELNSPV FT LEKEVEWAVKNIKKSTAAGPDGLTALALKKIPYSELVKLFNIILLVGFLPD FT VLKNSRTILIPEVDNPQGGGDYRPISINSVLTRTLNKILAKRVSEGDFGIN FT GQKGFKSVDGCLENLATVESILADARMKKNKLAVVFLDMSKAFDSVNHESI FT VRAGEIKGYPKLLMTYVKECYNDATTNVAGVTAKFNRGVKQGDPLSPALFN FT NVIDLAIERVSGTGIGYNMGGKKYSVVAYADDLVLFGESREGLQIALTALL FT EELKLNGLTPNPAKSASLTFERSGPHWFASTDTVTALGDQIPAMGNIETYK FT YLGIKFNSCGVVKGSLPGIYTKKLELISKAPLKPQQRLAMLTDFLIPGVLH FT QAVFGQTNAGDLRSLDKRTRRAVRSWCHLPSDTSTAFIHAKAKDGGMGIPS FT IRAEVQFGKLDRFGKLPNVKDERSKVLADNAHIKKKMLEKLGVGIPIKGVR FT CKNKLEFYNKMREELIKSNDGIGLKEASLVPSANTWLKLSDLHMSGRTFVG FT CLKTRGNLMATVTRTSRGGQNPGIELNCKKGCQYQGSLNHIVQKCPVVKGL FT RIKRHDEVVKYVEEITKKAGWSATMEPIIPFEGSHRKPDLVLVRGDLGKVV FT DIQIVSDHCGLDEKNSCKIGKYDNDIIRNYVRGLGPSRVEVAAITLNWRGV FT WSRDSFNLIKRLGMTEMDAKIISMRVLASTAKMFKTCKKVLEPVCRTKTAD FT CDGYGPEETSARPCHELNLKESSGT*" XX SQ Sequence 5417 BP; 1691 A; 966 C; 1334 G; 1399 T; 27 other; cagtgctatt cgaatgtcaa tgtgaagaaa ttcaactaag ctctggttaa cggcgggagt 60 aactatgact ctcttaagga attaagaatt tacctgcckt aktaaaartg aaatcmgttg 120 ttcatwgcaa gtggtattgt acaccttccc gcggtgctag tcgtttaaaa ctaagttaca 180 aaccacgagg ggcgtcctga cggactgsaa aagcattgag rgtcmtgaag agaggctctt 240 attgtacgaa tctcttcaac gatcgaagtc tggaccgata tgagaactaa tacattagtt 300 gacaggtgaa aaatactgtt gattacttag ttctcagtca tgtggtatat tgccagtcaa 360 ttactacatw aatattagtg tggctctcaa aggaacacga ttgrtcggca gtccaatgcg 420 cgactggcgg gcttgttgtt tgcatttgtt accggctact tgaaaaggtt atatatagca 480 gacgcttaaa gcgcgactgt aatttacaty tcattgccca gtatttgtct tttgtcagat 540 ttagcaaaat ttcatatttt gttaattacc ttaactggtt aaacgatccc ataattgctt 600 gcaattatta taaagtaatt caggtaaaaa ttacatatct ggctgatcct gccagtagtc 660 attttacttc cgccgcgcta taaaacagtt taaaaactga ataggaatca aaaagaacat 720 ggcaagcgac tatatgtaac tgggcattca acattcccta ttacatatgg tggtgcctgg 780 ggtctgtttt atataatggg tacccgggaa gtggatctgt atcaccagtc atggtgccat 840 atctttkgat aaagatacag tttaaaactg cgatgatact aatagagatc ctcttagacc 900 ttcgtaaaga agtggggatt gatgacatta gcattggaag aattaaatct ccaaggaaat 960 ggagtaactt caatgaagtc ccacaacccc gttgaagggc tgggttcgag tatcgagaga 1020 aaactctaaa ttctcttcgg ttmtgtccaa cggaggggac attactgtaa aatatcctct 1080 aaaaacaact atgaaaaaag tcttaaataa tgaaaccgaa aaattaccgg gaagtaatct 1140 aactttcatg tgcgggttct gtgatcggga attcgacacg gcwaggggca gaggagtgca 1200 cgagagtaga ggtcatttag ttgagcggga tgcggcggtt cagagcagag tgaaagccgt 1260 ggtgagtaaa aagtattatt atagtaacga agaggatgtg gcattagcga aaatgcagtt 1320 asagcatgca gatctggcca aaagcgaacw attagaagcc atgtatcttg cattgggaaa 1380 gggaagaact cgtgaagcca tagagcaaca cataaggaaa tcgttacgtt ataagggggt 1440 ccttgaagaa cagcgaaagc tccttgagac agcaagggga aatgttcggc aaaataacgt 1500 gggtgtgcca gctagcaatg ccactaaaaa tctgcaaaga ttcttagaat cgttaccctt 1560 gggaacgaat aggcgcgagg aacgattgga taggattatc cgatctaact cgatcgaaag 1620 ccaaagactt gaattgatcc actattgtaa cgatatgtgt caagactttg tgcaactaga 1680 ctgtcaaarg aaccccatca atgctataag gcgcagaaat ccgaaaagac tatcgaaaaa 1740 gcagttgaag agagctaaat ttagcgctct tcaacggctt tggataaggg atcggaaagc 1800 tgctgcacag ttagtgttga aggataagct tgatagtttg ctcagcaata aagaggattc 1860 caaggatttg ggatcgtatt ggcaacaggt cttcgaacgt gagtccgaat tagaccgcag 1920 acccatacca caagtggtgg aaaacgaaga gttaaattcc ccggtattag agaaggaagt 1980 agagtgggct gtcaaaaaca ttaagaagtc cactgccgca ggaccagacg ggttaacggc 2040 acttgccttg aagaaaatac cgtattccga gctagtcaaa ctatttaata taatactgtt 2100 ggtgggattc ctacctgatg tattaaaaaa tagtagaact atcctaatac ccgaagtgga 2160 taatccccaa gggggyggtg attatagacc gatttcgatc aattcagtgc tcactagaac 2220 actaaataag atcctagcga aacgagtctc ggaaggtgat tttggtatca atggtcaaaa 2280 aggattcaaa agtgtagatg gttgtctaga gaatctagca acagttgaat caattttggc 2340 cgatgctaga atgaagaaaa ataagcttgc ggtagtattc ttggatatga gtaaagcttt 2400 tgattctgta aaccacgaga gtattgttag agctggagaa atcaaaggtt atcccaaact 2460 attaatgacg tatgtgaagg agtgttataa cgacgctact acgaacgtcg caggtgtaac 2520 agccaagttt aaccggggag ttaaacaagg tgatcctctt tcgccggcgt tatttaataa 2580 cgtaatagac cttgcaattg agcgagtttc cggtactgga attggatata atatgggcgg 2640 taagaagtat tctgtggtcg cgtatgctga cgatcttgtt ctattcggtg agtcgagaga 2700 ggggttgcaa atagccctga cggcgctact ggaggaatta aagctaaacg gtctgacacc 2760 caatccagcg aaaagtgctt cgttgacatt tgagagatca gggccacatt ggttcgcaag 2820 taccgatacc gttacagcac taggagatca aataccggcc atgggtaaca tcgaaaccta 2880 taaatactta ggaatcaagt ttaattcgtg cggagtggtt aaagggagtc tacccgggat 2940 atacaccaag aaattggagt taatctctaa ggctcctttg aagccgcaac agcggttagc 3000 tatgctaacc gatttcttga ttcctggagt attacaccaa gccgtgtttg gacagacaaa 3060 tgcaggggac ctgcgcagtc tcgataaacg gacgagaaga gcggtgagat cttggtgtca 3120 ccttccctct gatacgtcaa cggcattcat ccatgctaaa gctaaggatg gaggaatggg 3180 aattccatct ataagagccg aagtccaatt tggaaaattg gatagatttg gaaagttacc 3240 caatgtcaaa gatgaacgat ckaaagtttt ggccgataat gctcacatta aaaagaagat 3300 gttagagaaa ttgggagtgg ggatcccaat caaaggagtg cgttgtaaga acaaactcga 3360 gttctacaac aagatgcgtg aagagttgat taagtcgaat gatggcattg gcttaaaaga 3420 ggcatctttg gttccctctg cgaacacttg gttaaaactg agtgacctac atatgagcgg 3480 tcgcacattt gtaggatgtc ttaaaacacg gggaaacctt atggctaccg taacaagaac 3540 tagcagagga ggtcagaacc cgggtataga gttgaactgt aaaaagggat gccaatatca 3600 gggaagtctg aaccatatag tccaaaagtg cccagtagtg aaagggttga ggataaaacg 3660 acatgatgaa gtcgttaagt atgtggaaga gatcacgaaa aaggctggat ggtctgcaac 3720 aatggaacca ataattccgt tcgaggggtc acaccgaaaa ccggatttag ttctggtccg 3780 gggcgatctc gggaaagttg tggatattca aatagtttct gatcactgcg gtctggatga 3840 aaaaaatagt tgtaagatcg gtaagtatga taatgacatc atacgaaatt atgtaagggg 3900 gctaggacca tcgagagtgg aggtagcggc gataaccctt aattggcgag gcgtgtggag 3960 tagagactca ttcaatctca ttaagagatt aggaatgaca gaaatggatg cgaaaatcat 4020 ttctatgaga gtgctggcga gcactgcgaa gatgtttaag acgtgtaaaa aagtgttaga 4080 gcccgtttgc agaaccaaga ccgctgattg tgatggttac ggaccggagg aaacctccgc 4140 ccgaccatgt catgagctta acttgaarga gagttcgggg acgtaaggcc acgcgcgtcg 4200 tccttgtttg atcactagtg gatcaacctt cgactccccg gaactgtggg agtggcggaa 4260 gaaaggccag aggatgtcct gaaaccatat atttatttat agaagtttta cttcatccta 4320 tttacgtatt tcagtatgaa aatgagtaaa gttctcgact cgatgagttg ggggcaacca 4380 ttgggggtcc tgaagagagg ctctcactgt aaaaaatctc ttcgtgtctg tttattccta 4440 ggcacntgct gcattatgaa gcggwgaaag taaagttaga gctgagagat aggtacttgc 4500 tgcattatga agcggagaaa ggcctcgaat aaatagggtg ttagagttat tgatggagag 4560 tatactagta agcttaagct gcgcctcgcg cggtgcccaa aaatatactt aatgagagca 4620 ataactcaag gngagtttaa ttcatatgcg catgcggcac caaggtgctg aatggcatcg 4680 attaaacctc tcctgttgta gaagcaggtc ataaatggag grgggcaacc actgaaactt 4740 atgagccaga agaagcttaa ctacawaagt tttaggcaat tactgaacgg agttaactgt 4800 tagttaacac taccatgtag ttgtttataa agcaaatatc aggtttcagt ctatatacta 4860 aaagtatttt ttgataccgt ggtatatagg caactagtta ggaaatagta agggatgacg 4920 cattgtctct ctttatggta ctgaggaaac ttatcgactc gcgagggatg aaacccgtaa 4980 accgatcgat ytagcctata agtaccagcg acagttaaac catcttacgc gaggggtaaa 5040 acctgaggac cgattatggt ataacttctc aagattagca caaaatgcga gtgcaacttg 5100 aggaggagga tttgagtgtt aattcataat gtactaatct aattaaactg tgacgggaat 5160 tgcagcttcg gctgtaatta ctttgaggcc tatcacggat tgtaaggaac atattgacac 5220 cgtaagtcta acgtgttccc gatttccaac caggtcatat gaagggctgc ccttgataag 5280 gcggatttga cccaattctt catatgagag gcttattcca gccttcccgt agtaccgtga 5340 ggttttcccg cctcgaacgg aacaatgttg cagggtaatt aagtacatcg ggctatatmg 5400 cgatatttaa cgtttta 5417 // ID hAT-2_DSe repbase; DNA; INV; 2568 BP. XX AC . XX DT 08-OCT-2009 (Rel. 14.12, Created) DT 08-OCT-2009 (Rel. 14.12, Last updated, Version 1) XX DE Hosec2 is a putative autonomous DNA transposon. XX KW hAT; DNA transposon; Transposable Element; transposase; HOBO; KW Hosec2; hAT-2_DSe. XX OS Drosophila sechellia OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; melanogaster subgroup. XX RN [1] RP 1-2568 RA Ortiz M.F. and Loreto E.L.S.; RT "Characterization of new hAT transposable elements in 12 RT Drosophila genomes."; RL Genetica 135(1), 67-75 (2009)DOI 10.1007/s10709-008-9259-5. XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 942..2096 FT /product="hAT-2_DSe_1p" FT /translation="MRRISLHFINECYGLKSAVLSTHKLVDATNHCASNIA FT NSLSAVLNECNLFNKVVTIVTDNDASMKEKCELMKMKHLPCVSHTFNLLVQ FT DLLKIDSIVPLLAKCKRIVSLFKSSSIATKKLKNAQTSKSRYGLIQEVPTR FT WNSAYYMIQRIILVNDSLSALLLNSTKDPRPLSADEMFVLVEICQVLAQFH FT EATVQISTNKTLSISVIIPLIYELHQKSCKVYSQVISHPALDICEAVKKRL FT PERFSHYETRTVTRIATLLDPRFKKDGFLFTSNSEKAAKALQLELFSSLSR FT VLSTPDISTPPNTPQEKRNSFFTFLETKISNKVHSNIRMLLRTYFEGPNEP FT EDKDPLAYWMVRISFLLFKLNIHDFIPFQANSESLKPITKLF" XX SQ Sequence 2568 BP; 839 A; 471 C; 451 G; 807 T; 0 other; ggaaaaccat cgatggcact atcgatacta tcgatatttt taagtttttt caaaccatcg 60 atgcttttcg aaaacaatcg aacaatccgc actggcgtca ctaccacttt tgatacagcg 120 agagagacat gtaagtttgg taaattttat acggaatgga aatatagttt tctttctcca 180 gattatagaa aggaaataac tgtttgagaa gattggtggg gtttggaaga agttgggaga 240 tggataaata cttgaaaagt aagtactttt aatgcaaaca ctcaatcaga ctcgttcgcg 300 tacaatgtgc acatgtgtat gtatatatat atgtgcatgt agccacaaaa tatgcatttg 360 cacgtgtttt ttaaatttat attaagtaac ccctgtattc ttgatttcag accctttaac 420 aactgtggat aacacgattc caaaccttca aagagattac tgtatggacc atttagaaac 480 taagatgctc aggagcgttc acagatgtgg taagctctat aaaactctgg gatcgcatcc 540 atttattgga tcacttaaac atgcacaatc ccgaatatct gaagctgaaa aggttgctcc 600 ttccactagt ttggaacggt tcctaaaaac tgacctcact tataaaaatg ggtcagcaaa 660 aaaaaaacgg atctggacaa ggcagtaatg cgtatgatcg cgctggatgt ccaacccttt 720 ttttttttgt tgaagatagg ggattctgcg atttgattcg aaagctatac ccgaggtaca 780 aggtacccag ttaaagccat ctcagaaatg ttgatcttcc acgtgaatat gtaggcttaa 840 aattgaagtt ggaggaggat ttaagttacg ttgacaatgt atccattact actattacta 900 ttactttaac tataattact attgttggac ctcgcgtgct aatgagacgt atctcacttc 960 acttcattaa cgaatgctat ggacttaaat cagcagtttt gtcgacgcat aaacttgtag 1020 atgcaactaa ccactgcgct agtaatattg cgaactcact cagcgccgtt ttgaatgagt 1080 gtaacctttt taataaggta gtaacgattg taacagacaa cgacgcgtct atgaaagaaa 1140 agtgcgaatt aatgaaaatg aaacatttac cttgtgtctc acatactttt aatctattag 1200 tgcaagatct tctaaaaata gattctatag tgcccttact agcaaagtgt aagagaatcg 1260 tatcgctttt caaatcgagt tcaatcgcaa caaagaaact taaaaatgca caaacttcta 1320 aatcacgcta tggacttata caagaagttc caacgcgttg gaatagcgcg tactatatga 1380 ttcaacggat tattttagta aatgactcgc tttctgcatt gcttctaaac tcaaccaaag 1440 atccaagacc attatcagcg gacgaaatgt tcgttctggt tgagatatgt caagttctag 1500 ctcagtttca cgaagccaca gtgcagatat cgacaaacaa aactctttcg atttccgtaa 1560 taatacctct tatttatgag ttgcaccaaa aatcatgcaa agtatacagc caagtaattt 1620 cacaccctgc gcttgatatt tgcgaggcag taaaaaaacg attgccagag cgtttctcgc 1680 attacgaaac tcggactgtt actagaattg ctacattact ggatcccaga tttaaaaagg 1740 acggtttttt gtttacatcg aactcagaga aagccgcgaa agctttgcag ttggagttat 1800 tttcaagttt gtcccgagtt ctttccacac ctgatatttc aacaccacca aacactccac 1860 aagaaaaaag aaacagtttc tttacttttt tagaaactaa aatttcaaat aaagttcact 1920 caaacatcag aatgctcttg aggacatatt ttgagggacc taatgagcca gaggataagg 1980 atccccttgc ttattggatg gtaagaatat catttttact ttttaaatta aatattcatg 2040 actttatccc ttttcaggca aacagtgaga gtctgaagcc aataaccaaa cttttttaaa 2100 agtacctgtg cattccggcc agttcaacag aatctgaaag aactttcagt aaacctggtc 2160 ggattgtatc tgaccgctga gcttcgctta aaccaaaagt agtagatgtg ctactttttg 2220 tgaacaaaaa ccagaaacaa tgatctcttg ttgtcttaat atgaatataa aaaaaaaata 2280 ttttgttatt actaaatatt ttgaatactg tgttaattta aataaagttt gcattgaagc 2340 aaataaaacg ttttagtttt tgtttaaaaa atgcatttta aatcctttgc aggcccttat 2400 gcatatttct aaagggctac tgtaatatat tatttttatt ctgcaagggt atataaactt 2460 cggttcacgt tatattatag ttccattttt ttttttaact atcgttaata tcgatgtttt 2520 tagaaaacca tcgacaccat cgatagtgcc aaccatcgat ggttttcc 2568 // ID Gypsy-13_IS-LTR repbase; DNA; INV; 188 BP. XX AC ABJB010321873; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the black-legged tick genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-13_IS_; KW Gypsy-13_IS-I; Gypsy-13_IS-LTR. XX OS Ixodes scapularis OC Eukaryota; Metazoa; Arthropoda; Chelicerata; Arachnida; Acari; OC Parasitiformes; Ixodida; Ixodoidea; Ixodidae; Ixodinae; Ixodes. XX RN [1] RP 1-188 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the black-legged tick genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; ABJB010321873; Positions 229 42. XX SQ Sequence 188 BP; 64 A; 37 C; 58 G; 29 T; 0 other; tgtaatgatg tcatcaatac ggcgacaaaa gaggaagaaa gtcagactac taacggagaa 60 aaaagagaaa gaagagggag gacgaggaga ttgagggcgg agatcggacg cattcggaaa 120 agcttggctc cggaatacac cggcgaagct gtcgtacata ggagtctgcc gacctgtcct 180 caccgtca 188 // ID BEL-59_CQ-LTR repbase; DNA; INV; 372 BP. XX AC AAWU01004145; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-59_CQ_; KW BEL-59_CQ-I; BEL-59_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-372 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 272-272 (2011). XX DR GenBank; AAWU01004145; Positions 394 765. XX SQ Sequence 372 BP; 95 A; 91 C; 93 G; 93 T; 0 other; tgttggatcc ccaacctatc gaggaaaaat ggcaacactt gtcatcgtcg caatgtttgc 60 tgcgatcgat gcaatctgtg ttcagcgcgc aacggcagcc gttgctgcga tcgacgacag 120 aaaagggaga aggaagaggc gcgtgaaagg agtcgaccga gaagaagaag ttttctccac 180 gtcagtctcg cgccaccagc aggaagaaca atttcgctct ctagttttag ctattattta 240 ataaactagt atttagtgag aaactattga gtgcttcttt ccgacgccca cggccagtcc 300 agagtccact ctttcgtctt ttcggaaaac aattccgccg ttttttcgtg gggccagtct 360 gtggcccgaa ca 372 // ID Kolobok-10_HM repbase; DNA; INV; 2763 BP. XX AC . XX DT 31-DEC-2008 (Rel. 13.12, Created) DT 31-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE Kolobok-type family - consensus. XX KW Kolobok; DNA transposon; Transposable Element; Kolobok-10_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2763 RA Bao W. and Jurka J.; RT "Kolobok-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 2068-2068 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 375..2171 FT /product="Kolobok-10_HM_1p" FT /translation="MDKSKTEYKSDRVKFSTKRLRKRKCTFSGLRKQEKCV FT NIGVNNVNSNNENMVDSVNIESESVFKSASKKKVKQVMVPSGNSKCVSGYR FT FIDLEILASIFDKLGCPECLRPSQLSLSENKKSCKGYASNLSLNCVCGFKL FT DFDTSKNVAGFDINKRLVYAMRTLGQGQAGLHRFATLMNIPKGLCNKSYNV FT IVQKLATAAQTVAVETMNEAVQELREGKSLNEILDIGVSVDGTWQKRGFNS FT HNGVTAALSIKNGKVLDVEALSRKCKICDKNQDVGNSKQQLTAKKSHICNK FT NYEGSAGGMETVGAIKIFNRSLQKFNLRYTEYLGDGDSKSYTSVKNTYKGI FT EVIKLDCIGHFQKRIGSRCRQLKKQVKGLNGKGGLTNATIDRLQNYFGICI FT RQNCGNLEGMRSSALASLFHVASSSKNNWHYPHCPTGKDSWCKYNRDKANG FT TKEYKPGPGLPIEIVHKLKPIYVELTSDAYLNRLLHGRTQNQNESFNAMIW FT SRIPKSKYVSLTQLKFGLYDAIGAFNIGLKSSILIFERLNMIPGQHTISGC FT NYLNQKRVMMSEYKNQDTIKKKRKILRSRKLKKFDKVEQKEGKLYGAGVHD FT *" XX SQ Sequence 2763 BP; 1004 A; 404 C; 478 G; 876 T; 1 other; gggggaaaac ctctcagaaa aaattttttt ttcaattttt tttttccaga tattttaaaa 60 gtaaatttat taagaaatca aatggtaaaa accttattcg aaaatcgcta ttttttatat 120 caataaatga cacttttgta caaccacctg agttcaaaat tacctagcaa cgcccctagt 180 tacagaaaaa ctgtgaataa aacacatcta gtatagcttt ttatccggtt ttttttttgc 240 agtttagttt gaaatgcttc ccattatctc taaagccata ctttaaaacc acatatttga 300 ctgatagtgc taaaattagt catcttttca agcttctaca caatcaactc aagctttcta 360 tataattaaa aaaaatggac aagtctaaaa ctgaatataa aagtgataga gtaaaatttt 420 ctacaaaacg tcttcgaaaa cgtaaatgta cattttctgg tttaagaaag caagaaaaat 480 gtgtaaatat tggtgtaaat aatgtwaata gtaataatga aaatatggta gactctgtaa 540 atattgaaag tgaaagtgta tttaaaagtg catcaaaaaa aaaagtaaag caagtaatgg 600 ttccaagtgg aaactcaaaa tgtgtcagtg gctacaggtt tattgattta gaaattcttg 660 cttccatttt tgataaactt gggtgtccag aatgtttgcg accatcacag ttgtcgttaa 720 gcgaaaataa aaagtcttgc aaaggatatg catcaaattt atcattaaat tgtgtttgtg 780 gtttcaagtt agactttgac acatcaaaaa atgtagcagg gtttgatatt aacaaaagat 840 tagtgtatgc catgagaaca cttggtcaag gccaagctgg tctacatcgg tttgccactc 900 ttatgaatat cccaaaagga ttatgcaata aaagctataa tgttattgtt caaaaattgg 960 caactgcagc acaaactgtt gctgttgaaa ctatgaatga agctgtacaa gaactcagag 1020 aaggcaaatc tcttaatgag atcttagata ttggagtttc tgttgatgga acatggcaaa 1080 aaagaggatt taattcacac aatggggtga cggcagcatt atcaattaaa aatgggaagg 1140 ttttagatgt tgaagcatta tctagaaagt gtaaaatatg tgacaaaaat caagatgttg 1200 gtaattcaaa gcagcagcta acagcaaaaa aatcacatat ttgcaataaa aactatgaag 1260 gatcagccgg tggaatggaa acagttgggg caataaagat atttaatcgc tctttgcaaa 1320 aatttaattt acgatacaca gaatacttgg gtgatggtga tagcaaaagc tacacttctg 1380 tcaaaaacac atacaagggt attgaagtaa ttaaactaga ttgtattgga cattttcaaa 1440 agcgcattgg atctcgttgt agacaattaa aaaagcaagt caagggttta aatggtaaag 1500 gcggtttaac aaacgctaca attgatcggt tgcaaaatta ttttggtatt tgcattaggc 1560 aaaattgtgg caaccttgaa ggtatgagat cttcagcttt ggctagctta tttcatgtag 1620 catcttcaag caaaaataat tggcattacc cgcattgtcc cacaggaaaa gatagttggt 1680 gtaaatacaa tcgagataaa gctaatggaa ccaaagagta taaacccgga ccaggtcttc 1740 ctattgaaat agtacacaag ttaaagccca tttatgtaga actaacatca gatgcttatt 1800 taaatcgttt gcttcatgga agaacacaga atcaaaatga aagttttaat gcaatgatat 1860 ggtctagaat acctaagtca aagtatgttt cgcttacgca attgaaattt ggactctatg 1920 atgctattgg tgcctttaat ataggattaa agtctagtat acttatattt gaaagactta 1980 atatgatacc agggcaacat accatttctg gatgcaatta cctgaaccag aaaagagtta 2040 tgatgtcaga gtataaaaat caagacacca tcaaaaaaaa aaggaagatt ttaagaagcc 2100 gcaagcttaa aaaatttgac aaagttgaac agaaagaagg gaaattgtat ggggctggtg 2160 ttcatgatta aaaaatacaa tcagtatgct ctttttatat tatatttttt gtatctttgt 2220 tattatataa tatttttcat cacaatttca acttatttgg catttttctc agaacgcgtg 2280 tttttaatct cccggcagaa gtatcttgag aaccgcttga ccgattttaa tgaaattttc 2340 agggagtatt cctctcataa agatgtatgc aatgagcaga caacttttta aaaatgtttt 2400 ttaaactaaa gaaattaaaa aaaacatcta tcatttttgt gcttaaaatt tttgccaaaa 2460 ttaaacaggc ctactacctg acagaaaagt tattttcaaa aactgtctgc ttattgcata 2520 gatctatctt ttaactataa gttagaaaaa aatcagctca aagtatcatt ctaattgact 2580 gcaatctttg ccagggtgaa gagactaatt tgacattttc ccttagtaac cgcgttgcca 2640 tggcaacaga gtgccatttt ttatgataga tcttcttaac atttagtata cttttatttg 2700 gaacatgtaa taactttatt aatgctttac tttagaaaat aagtttttga ggggttttcc 2760 ccc 2763 // ID hAT-24_HM repbase; DNA; INV; 3951 BP. XX AC . XX DT 07-DEC-2008 (Rel. 13.12, Created) DT 12-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-24_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3951 RA Jurka J.; RT "hAT-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 2013-2013 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 905..3064 FT /product="hAT-24_HM_1p" FT /translation="EAESLEYHLPSNDELFKLNKNPDILQKLNFIKKHPIQ FT PSFFSNIQLDLQKLYFRKTSDGKKISRKWLSVLCIDNELKALYCPFCIAFS FT SSPTTFSNGCNNFKHIHEVVKIHEESLTHRHSVEAYIQSSNDKSVEFGINN FT NLMALKKSQIQDNIHVISEVFEIVKFLGRQNLPFRGPRNSESLYKWNDEDN FT LNKGNFLELVKFTAKRDATLYKHLNAAIKNSKRRKDNLEKKSLISCGRGSL FT VTLLSKTTVNKVILSIVKSIRDRIKLELGEQNFSLQIDSTQDVGVVDQAAV FT CIRYIYEGEVKERLFALLKVVDSSGNGYYDMLKKLFSEHSINFNHIIGESF FT DGAANMRGEYSGLQSKIKSQENQKSVYVWCYSHVLNLCVCDTCKGVEAKKL FT FGFLNRLSTFFSDSYKRMHIWLSEIDSSLGSNKLKKLQKIGENNTRWWSRE FT KALFWIFDGDKCLFPTVVSALHHISISRNFEPKVCSEASSLLDNLCNFKII FT LTAHIFLNIFKIIGPTSRYLQTRGMDLLSAWSMVDSVKQEIGHLTFDNILE FT NSVHFSNNMNDLLNSMNLPDDLLVPSTLPITRQSRKKRMYDELCADETPEL FT PLDKFRVETYQSIVDHINNSLNERFTDNNQLIADVQYLIPKNFKQIEQMPN FT TALKHLANLANLDHIQLCLELNNFSKMYPKLKMSTTERTKQIYRQFDTDSD FT ENTDNDEENMNYDAAFSCE*" XX SQ Sequence 3951 BP; 1435 A; 563 C; 612 G; 1341 T; 0 other; cagaggcgga ctggccaggt gggacagtgg gaatagtcct actgggccga tgttccaaaa 60 atcgagttta aaaaatttga gaaacctggc taacgcaaaa aactcaaaat caaacaaaaa 120 ttagtttgtt ttagcagcac aatgacttca aatgtatttc tgtattactg agctttctaa 180 agccgccggc ctgaatcatt tcatattgat actgtgcatt ttaatttata aatatgaatt 240 ctaaaaaaca aaaaggagga gcggccaaag aacgcgaaaa aaaaaatcaa attacaaatt 300 gctgcccaat catgtcataa cttattaaat ttgtttaaga aaccaataaa tcagactgca 360 ataaattcta ctgctataat tcaaaagaat catctgtatg aaagtgattt aacaagtgac 420 gttgatatta ctttaaattc atccattatt ataaatcagg tagacacagc cacagagaat 480 ttattggtaa gattttcaat attatattat taaaaaaggt gtctatctac attattattt 540 tgtacaccat taatatacaa tattataaga atgttatcga ttatacaaat tatgtataaa 600 agtatatttg ttcaaatttt agtgtgatag taattctact gcaatagttc ataagaatca 660 tctacatgaa ggtgatttaa caagagatat tgatattgat ttaaattcat ccatttctat 720 aaatcagata tgcccagcca cagagaatgt attggtaagg tttttaatat taaataacta 780 gtaactgcta attatattgg tcttacaaac aaattaaacg cattttattg ctaaaagtag 840 gtttttatta ttaatttagg tttttaggtt tttattatta atttaatttg aacttatatt 900 ttaggaagct gaatctctag agtatcattt gccatctaat gatgaattat tcaaactaaa 960 caaaaatcct gatatcctcc aaaaattaaa cttcatcaaa aaacatccta tccagccatc 1020 attcttcagc aatatccaac tagatttaca aaagctctat tttagaaaaa catcagatgg 1080 caaaaaaatt tcacgaaagt ggttatcagt cctctgtatt gataatgaac tcaaagcttt 1140 atactgtccg ttttgtattg cattcagttc atctcctact acattttcta atggctgtaa 1200 taattttaaa catatccatg aagttgttaa aatacatgaa gaatcgttaa cacacaggca 1260 ttcagtagaa gcctatattc aatcgtcaaa tgataagtct gttgaatttg gtattaataa 1320 caatttaatg gcattaaaaa aaagtcaaat acaagataat atccatgtta taagtgaagt 1380 ttttgaaata gttaaatttc ttggtcgtca aaatttacca tttagaggtc ctagaaactc 1440 tgaaagtctg tataagtgga acgatgaaga caatttaaac aaaggcaatt ttttggagtt 1500 ggttaaattt acagccaaac gggatgctac attatacaaa catttgaatg cggctataaa 1560 aaacagcaaa cgcagaaaag acaatttgga aaaaaaatca ttaatttctt gtggcagggg 1620 ttcacttgta actttattgt ctaaaactac tgtgaataaa gttattttat ctatcgtgaa 1680 atcgataaga gatagaatta aattagaatt aggtgaacaa aattttagtt tacagattga 1740 tagtacacaa gatgttggtg ttgtagatca ggcagcagta tgcatccggt acatttatga 1800 aggagaagta aaagaaaggt tatttgcttt attaaaagtt gtcgattcat caggaaatgg 1860 ctactatgat atgttaaaaa aacttttttc cgaacattcc attaatttta atcatattat 1920 tggcgaatca tttgatggag cagctaatat gagaggggag tattcaggac tacaatctaa 1980 aattaaaagt caagaaaatc aaaaaagtgt ttacgtatgg tgctatagtc atgtgcttaa 2040 cttatgcgta tgtgatactt gtaaaggcgt agaagctaaa aaactgttcg gatttttgaa 2100 tcgattatca acatttttta gtgattctta taaacgtatg cacatatggt taagtgaaat 2160 cgattcaagt ttggggtcta ataaattgaa aaaacttcaa aaaataggag aaaacaatac 2220 gcgatggtgg tcgcgagaaa aagcactgtt ttggattttt gatggtgata aatgtctttt 2280 tcctacagtt gtaagtgctt tgcatcatat ttctatttct agaaattttg aacctaaagt 2340 ttgttcagaa gcatcttctt tattagataa tttatgtaat tttaaaataa ttctcacagc 2400 tcatatattt ttgaatatct ttaaaattat tgggccaacg tcacgttatc ttcagacaag 2460 aggtatggat ttattatcag cttggagtat ggtagattct gttaaacaag aaattggaca 2520 tttaaccttt gacaatattt tagaaaattc tgtacatttt tccaacaata tgaatgactt 2580 acttaatagt atgaatcttc cggatgattt attagtacca agtactttac ctattactag 2640 gcaaagcaga aaaaaaagaa tgtatgatga gctatgtgca gacgaaacac cagagttacc 2700 attagacaag ttcagagttg aaacttatca atcaatagtt gatcacatta ataatagttt 2760 aaatgaacgc ttcaccgaca acaatcaatt gatagcggat gttcaatact tgatacctaa 2820 aaattttaaa caaattgaac aaatgccaaa cacagcttta aaacacttgg ccaatttagc 2880 aaatttagat cacatacaac tttgcttaga actaaataat ttttctaaaa tgtatcccaa 2940 gttaaaaatg tctacaacag aaagaactaa acaaatatac cgtcaatttg atactgattc 3000 agatgaaaac acagataatg atgaagaaaa tatgaattat gatgcagcat tttcttgtga 3060 gtaaaattgt gcttatatat tgttctatat ataatttaaa atattatttt gactgaactt 3120 tatgttggat tttaggtaac aacaaaaaca gcatacatga tactaacaat tgtttacaat 3180 gtgtatacaa acttgtatac aatttaaaca tgcatgtctc ggcttatact caactatgtg 3240 cttcatatga gttttttctt actttgtcag tcaccgaagt caactgcgaa aggacattca 3300 gcaaacttaa gcttgttaaa acaagactca gagccaatat aagtcaagat aatttagaag 3360 cactcttaat tatgagtgtt gaaaaacaat tactcaatga aattgaaatt tcaaatgtca 3420 ttgaatattt aaaagctagt tctacagtaa tgactaaaat gttttcaata taaaattgtc 3480 atttaagttt tgttttatat aatttaagta ttttcttaac tataataaat aataatgcat 3540 taatttgtat ggattatgat tcttcttctt cattctctgg catgactact ctctaagaga 3600 ttttgctgac attactaaac tttgccacct atctctatct atgaattata ttatatgaaa 3660 aaaattgttt tgagtctaat tgttgattat catgtttttt aatgttgaaa ctagacccat 3720 agtatcagta tgtaacataa cacataccag agtacgtatt taacctaatt aaattaaatt 3780 gtacatgcta tttataggta ccttataaca attaattata tttatttatg attttttttg 3840 ggtggggggg ggtgtttatt ttttgggggg ctaatcaccc ttcaggccac ctctatttgg 3900 gccggtgacc gtgatttcct actgggccga acacccctca gtacgcctct g 3951 // ID Kiri-20_AAe repbase; DNA; INV; 4639 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 16-FEB-2011 (Rel. 16.02, Last updated, Version 1) XX DE A Kiri non-LTR retrotransposon family from Aedes aegypti. XX KW Kiri; Non-LTR Retrotransposon; Transposable Element; Kiri-20_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4639 RA Kojima K.K. and Jurka J.; RT "Kiri non-LTR retrotransposons from the yellow fever mosquito."; RL Repbase Reports 11(2), 715-715 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 13 sequences with >94% CC identity. CC This family and related Kiri elements found in two mosquito CC species, Culex quinquefasciatus and Aedes aegypti, constitute a CC distinct lineage belonging to the CR1 group. The phylogenetic CC analysis with PhyML indicates that Kiri is a sister lineage of CC the L2 clade. Although several Kiri elements do not encode two CC proteins, probably due to 5' truncation, Kiri elements generally CC encode two proteins. Their ORF1 proteins commonly contain an CC L1-like RNA-binding domain. XX FH Key Location/Qualifiers FT CDS 276..1070 FT /product="Kiri-20_AAe_1p" FT /translation="MNQNKPLTRSTSSSSSSLNAIGKQQTLQHNNNVESLT FT DLWDKIQEMFAQSKNDIEAKIDSCKTDLEKKIDALEQKLSDLRLDCGAEIE FT KIANVVSDVENDLACTKRNVVRLESSHELIVSGVPYTNDENPMLIFQNIAK FT ALSYCATAIPMVHVKRLAKLPISVGSSPPLLIQFALRNLRDEFYGRYLRER FT SLNLRHIGFNNDNRIYMNENLSQADREIRTQAIKLKRQGRIHQVYTRSGMV FT YARIKNQDDAVPLQSLEQLFSYVN" FT CDS 1658..4480 FT /product="Kiri-20_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MANSHNNTSASMLIPRAVFNTILDKNAINICHINVQS FT LCARNFSKFHELKANFFNSKMDIVCMTETWLGNVVTDQMISIEGYNLLRND FT RDRHGGGLCIYIRNNLSFKILQSSIPCNPMNSTSTTEFLVIEVRSGLEPFL FT LSLYYNPPDNDCSELLRVHFEQYTVKYNHTFFIGDFNTDLLKNNSRSHRFR FT DTLSSMSYTCLNSEPTFFYTSGCSLLDLIITDSPSIVSSFNQVSIPFASRH FT DLIFASLNIAKADDKIDIYRDYKNYNERSLHDAFHNMNWDSFYSITDSNIL FT LNVLNNRLKFLHDEFIPLRKQKHKTNPWFNRDIELAIVDRNIAYSNWIRSR FT DASDELHFKRQRNRVNLLIRDAKRRYDSNKFNASLPCKQLWSNIKKLGVSK FT TNSLENNLDFCGDDINSYFTENFTVDDLSTPTVSYRSEGFRFRLVDDTEII FT YAINTIKSGAVGLDGIPIIFIKIMLPLMLPYFKHLFNTIISSSKFPSGWKA FT VKIIPIKKKPRQNSISNLRPISILCALSKVFEKLLKTQISAFVSEMEFLHP FT FQSGFRQHHGTNTALMKVHDDIAKVIDKKGIAILLLIDFAKAFDRVSHRKL FT LLKLGEVFQFSHAATKLMHSYLTDRSQAVFCNDKLSRFESIVSGVPQGSVL FT GPLLFSLFINDLPSTLRYCSVHLFADDVQIYLCELEPSNYGSLGEKVNYDL FT QRILDWSQRNLLPINPAKTKAMLINRGRNIFECPDLVMDGENIEFVDRVES FT LGIIITSKLNWDDQINSQCGKIYGCLKRLNLVTRHFSIPIKLKLFKTLLLP FT HFIYGDFINLNASVASLDKLRLALNSCVRYVYNLSRFSRVSHLHKELIGCS FT FSNLYRYRSCITIFKILLLKKPEYLFTKLSPMRSDRNRSLLIPQHRSAYYS FT QSLFARGIVFWNQLPLHIKTNVSMVNFKRDLKQLLAA" XX SQ Sequence 4639 BP; 1461 A; 830 C; 824 G; 1524 T; 0 other; ttctgaaggg aatgagtgct atagagtgaa acaataggtg gctgcagtta cgttggctta 60 taagtgatgc tggagtcttt ctaaaatcaa gttccaaggc agttcatcat atatagtgaa 120 agtttgatct tcatcaacta aagttttccg cctacaatta cattccgcag tgaattcgtt 180 gaataattac tgctaagcac tactacgtca agtccccttc agaattgtat gccaattggt 240 tactctcttc agtttgtcct ttgttctttg acaaaatgaa tcaaaacaaa ccactcactc 300 gctccacatc ttcatcgtcc tcttcgttga acgcaatcgg taaacagcaa acacttcaac 360 ataacaataa tgtggaatcg ttaactgacc tatgggataa gattcaggaa atgtttgcgc 420 agtcgaaaaa cgatattgaa gccaaaatcg attcttgcaa gacggacctg gagaaaaaga 480 tcgatgctct tgaacaaaaa ctatctgatc tcagactgga ttgcggtgca gaaatagaaa 540 aaattgctaa tgttgtttcg gatgtggaaa acgatctagc ttgtactaaa cgaaatgttg 600 ttcggctgga gtcttctcat gagctaatcg tttctggagt accgtacacg aacgatgaaa 660 atccaatgtt aatcttccag aacattgcta aggctctatc ctactgtgcg acagctatac 720 caatggtgca tgtcaaacgg cttgcaaaac ttccaatcag cgttggatca tcaccgcctt 780 tacttattca atttgcgcta cggaatctac gtgatgaatt ttacgggagg tatctacgtg 840 agagatcgct caaccttcgg cacattggat tcaacaatga caacagaatc tacatgaatg 900 aaaacctaag ccaggctgac agggagatca ggacgcaagc aatcaaactt aaaagacaag 960 gacggattca tcaagtatac acacgaagtg gaatggttta tgcacggata aaaaaccagg 1020 atgatgcagt accgttacaa tctttggaac aacttttttc gtacgtcaac taacctatcc 1080 tatgaatttc ttattcctgc tcatattatc catgtttcac ttccatgtta atccatgaaa 1140 cccagtccgc tcctaaaagt cactgaatcg aactgaattg ccgaatttcc ccttccttgt 1200 tatatacact tttttcctac caagcaaatc catcaatccg atcccaagtt gtccttgtct 1260 ccatccttcc tgaaagttaa tggaagaaca tcgaggagat cacatcgtcg aatttaatta 1320 tggtaatgct gctgctggtg ctgaatggat gctgctgttg acggattgct gatgctgttg 1380 ctgtgatttt ttttgctact caattgaaga tgacactgat gatctattgt atatggccta 1440 cgtgaaactc gatactgttg tttgcgatac tgctgttatt tttgtgcttt actattgaat 1500 tagcttcata agccaattag actgcaacta ttgtttattg aattcatgtc cagaaactta 1560 attaggttag gatacttcat tgatgatact gtttcatatc tgctgcgact gttatatggg 1620 tttcttttga agttcgttct atttttcgta tacgataatg gctaattcac ataacaatac 1680 ctctgcgtct atgttgattc caagagcagt ttttaacaca attttagaca aaaatgcaat 1740 aaatatttgc catataaacg tacaaagctt atgtgctaga aatttttcca agtttcatga 1800 actcaaagct aattttttca acagtaaaat ggatatagta tgcatgacgg aaacttggct 1860 tggtaatgtc gttacagatc agatgatttc tattgaagga tacaacttat tgcgaaatga 1920 ccgtgatcga catggtggag gtttgtgcat atatattaga aacaatttgt ctttcaaaat 1980 actacaatcg tccattccat gcaatccaat gaattctaca tcaacaactg agtttctggt 2040 gattgaagtg cggagtggct tagagccatt cttgttatct ttgtattaca atcctcccga 2100 taatgactgc tctgaactct taagagtgca tttcgaacag tataccgtca aatacaatca 2160 tacatttttt atcggagact tcaatacaga tcttttaaaa aataattcta gatctcatcg 2220 ttttcgggac acgttgtctt ccatgtcgta cacatgtctc aatagtgaac caacgttttt 2280 ctatacaagt ggttgctctt tactagatct tatcataact gattctccca gcatagtctc 2340 aagttttaac caagtttcaa taccatttgc ttccaggcat gatcttattt ttgcttcgtt 2400 aaacatcgcc aaggctgatg ataagataga tatttacaga gattacaaaa attacaatga 2460 acggagcttg catgacgcat ttcataatat gaattgggat tccttttata gcatcacaga 2520 tagtaacatt cttcttaatg tattaaataa tcgtttaaaa ttcttacatg atgaattcat 2580 tcctttacgg aagcaaaaac ataaaaccaa tccttggttt aaccgggata tcgagcttgc 2640 tattgtagac agaaatattg cttattctaa ctggatacgc agtagggatg cttctgatga 2700 gctccatttt aaacgacaaa ggaatagagt aaatctgttg atacgtgatg ccaaacgacg 2760 atatgattcc aataagttta acgcaagttt gccttgtaag caattatggt ctaatataaa 2820 gaaattagga gtttcaaaaa ctaattcttt ggaaaataat cttgactttt gtggagatga 2880 tattaatagc tattttactg agaatttcac agtagatgat ttatctacac cgactgtaag 2940 ttatagatct gaaggattca gatttcgctt ggtggatgac actgaaataa tttacgccat 3000 caatacaata aaatcgggtg ctgtaggttt agatggaatt cctataatat ttatcaaaat 3060 aatgctacca cttatgttgc cgtattttaa gcatttgttc aatactataa tttctagttc 3120 aaaatttccg agtggttgga aagcagtgaa aattattcct attaaaaaga aacctagaca 3180 aaactcgatc agtaaccttc ggcctatcag tattctgtgt gctctttcca aagtttttga 3240 aaaactgtta aaaacgcaaa tttctgcctt tgtttctgag atggaattcc tacacccatt 3300 tcaatcagga ttcagacaac atcatggtac aaatacagcc ttaatgaaag ttcatgatga 3360 tattgccaaa gttatagata aaaaaggaat cgctattttg ctgctgattg attttgcaaa 3420 agcatttgat cgagtgtcgc atcgaaagct tctattgaaa ttaggagaag tgttccagtt 3480 ctctcatgcg gcaactaagt taatgcattc atacctaact gatcgatctc aggctgtttt 3540 ctgcaatgac aagttatctc gctttgagag tattgtatct ggagttcccc aaggatcagt 3600 actagggcca ttgctttttt cattatttat taatgatttg ccctccacct tacgctactg 3660 ttcagttcat ctctttgctg atgatgtcca gatatatttg tgtgaattgg agccaagtaa 3720 ttatggttcc ttgggagaaa aggttaatta tgatctacaa agaattttag attggtcaca 3780 gcgaaatttg ttaccaatca atccagcaaa aactaaagca atgctgataa atagaggtag 3840 gaatattttt gagtgtccgg atttggtaat ggatggagaa aacatagaat ttgttgatcg 3900 tgtagagagt ctaggaataa tcataacgtc taaacttaat tgggatgatc agataaactc 3960 acaatgtggt aaaatatatg gttgcctaaa aagattaaat ttagttacca gacattttag 4020 tatccctatc aaacttaaac tttttaaaac tttacttctc cctcatttca tttatggtga 4080 ttttattaat ttgaatgcgt cggttgcttc gctagataag ttgagacttg ctctaaattc 4140 ttgtgtaaga tacgtttata atttgtcgag attttcaaga gtgtctcatc tccacaaaga 4200 actcattggt tgcagttttt caaaccttta tagatatagg tcatgcataa caatattcaa 4260 aattctattg ttgaaaaagc cagaatattt gtttacaaaa ctttctccta tgcgatccga 4320 tagaaataga agccttttaa ttcctcaaca tcgttctgca tattacagtc aatctttgtt 4380 cgcaagaggt attgtcttct ggaaccaatt gccactacat attaaaacaa atgtctcaat 4440 ggtgaatttt aaaagggacc taaaacagtt gttggcagca tgattcaggg atttgatgaa 4500 gtatagtttg aattaacatt gaaaattaaa tattagaata tctttggcgt attattttct 4560 tatcaagtgt aatgatttac aaggctttta gccttatatt acatgaattc acaaataaat 4620 aaataaataa ataaataaa 4639 // ID Gypsy-168_AA-LTR repbase; DNA; INV; 305 BP. XX AC supercont1.332; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-168_AA_; KW Gypsy-168_AA-I; Gypsy-168_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-305 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.332; Positions 1232367 1232063. XX SQ Sequence 305 BP; 92 A; 59 C; 50 G; 104 T; 0 other; tgaagtgttt aatattagcg tgaacatatt acgcattcaa tagttgtttt aattaatagt 60 gtaattccgc ttaacggaaa ccaacactat caagtgtaat gactaactga ttcacttata 120 cgaatgcaga cgaaggtctg atcgatcata ataataaacc tacgagtcag tagattctga 180 aacctagctg agtacagatc gcctttttat ttcattccat cacagttctc gcgctttatc 240 tttagcctcg gttcaagtca gttgtagtta aaacgtgaaa tctttctctt ttctcggact 300 tatca 305 // ID Gypsy-70_CQ-LTR repbase; DNA; INV; 197 BP. XX AC AAWU01039681; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-70_CQ_; KW Gypsy-70_CQ-I; Gypsy-70_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-197 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 520-520 (2011). XX DR Genome; AAWU01039681; Positions 7152 6956. XX SQ Sequence 197 BP; 71 A; 43 C; 44 G; 39 T; 0 other; tgagaggcga gagaagaaga gagcaatctg ggccgaacac gatactactc tcgggtagta 60 gaaccgaacg ttaactatta ataaagcagt ctcaacttga actacaacca cgaaggacgt 120 attttgttaa gctatccgaa aagcactctt taaacccaaa cccaagataa gctacggtcc 180 gagttaagga cgtagca 197 // ID CR1-56_HM repbase; DNA; INV; 4558 BP. XX AC . XX DT 22-DEC-2008 (Rel. 13.12, Created) DT 22-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE CR1-type family - consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-56_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4558 RA Bao W. and Jurka J.; RT "CR1 families from Hydra magnipapillata."; RL Repbase Reports 8(12), 1884-1884 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 72..812 FT /product="CR1-56_HM_1p" FT /translation="MEINVKNIEKIITSKLEEQKRSILAETERLLKDHEKN FT FTSIISSNFKIITARLDKMDAEINNNKLNIRKIENEVNEIKVSLNFQEEKS FT RDEFTLIKTKYENEIMMLKKKSIDLENRSRRNNLRIDGVKETPEENWSDCE FT KVVKDIFKTKLNILGEVVVERAHRVGTVKDRKTPRSIVLKLLNYQDKNKIL FT SSVNKLKGSGIYINEDYAKETMLERKKLWEEVKQLRNEGKFAVIKFDKIYC FT RDFRK*" FT CDS 861..3944 FT /product="CR1-56_HM_2p" FT /translation="MTNLTIDFEKLHFNVFKSANVLLNDFSDADLHFFNEK FT NFSSSFFEIKTFKNELKTLKSNFTVIHINIRSINNNMDKLKYFLIECDYLF FT SMICITETWCSDESCRINSSLEIPKYKLLSFERKTNKRGGGIITYIRNDQI FT TKIREDLSTSDADSEVFTIEITSNKSKNILVSTCYRPPEGDLLKFSNHLKQ FT IFIKNNNEQKKIFCIGDINIDCLQYEKNANIKLFFDEMLQYYIFPIINKPT FT RVTPTSISAIDNILTNSIFDTSLKAGIIKTDISDHFPVFFSLTQDIKSINS FT CKIKIYKRKINKFAIQQFKDSLSAEQWDKVYQECNLGNTNSAYNYFEKIFL FT KNYNMHFPIKEKLIKEKYLKCPWITKGIKKSSKKKQKLYIKYLKNRSEANL FT NVYKQYKNLFEKIKKNSKKNFYSNKIKNSNGNIKKTWDIIKEIIGTKHCKS FT NSLPAQIVIDNNEYNDYNVISEKFNSFFVNIGPNLASKIHCPNNSFETYLT FT SVNSELIFKELKIDELENAINSIKINKSPGVDDISSNIVANVSSQIRKPIF FT EIFKSSIKTGTVPDKLKVAKIIPIFKTGETFQINNYRPISILPIFSKILER FT IIYNRLYEYLIQNKLLDKKQFGFQSQHSTEHAILDIVNSISDSFNKKQFVL FT GTFIDLSKAFDTINHDILLKKMEKYGIKNTTLDWFKSYLCNRQQCVISNDN FT KYSNLLEIKCGVPQGSILGPLLFLIYINDLPQSLKKLDAVMFADDTNLFYS FT SASIENLFESVNDDLENLNTWFKVNKLSLNTEKTKYILFHSNHQKNNIPRM FT LPLLKIDNINIERTKTVKFLGIIIDETISWKAHINTINTKISKSIGILYKV FT KPLLSQKNLKSLYFSYIQSYLTYANIAWGSTNKSKLNSLYVHQKHASRLIY FT NKNKFTHADPLLKNLNALNVYQINIYQNILFMLKYMLGLVPSHFTNNFFQT FT YANRYSTRGTGNFTLPIKKTKFSRFSIAYRGPYLYNKIISQNIELTKLDNL FT TILKKKLKDLIINKTNFIDMY*" XX SQ Sequence 4558 BP; 1916 A; 673 C; 571 G; 1397 T; 1 other; tttttttttt gcgtttgcgg cacaaacgag aaagacgtgt tttttttgag aataacaatt 60 ttacaataaa aatggaaata aatgttaaaa atatagaaaa aattatcaca agcaaactag 120 aagaacaaaa aagaagtata ctagcagaaa cggagagact actcaaagat catgaaaaaa 180 acttcacttc aataataagt tcgaacttca aaataataac agcaagatta gacaaaatgg 240 acgcagaaat aaacaataac aagttaaata tccgaaaaat cgaaaatgaa gttaacgaaa 300 taaaagtcag cttaaatttt caagaagaaa aatctaggga tgaatttact ctcattaaaa 360 caaaatatga aaatgaaatt atgatgctaa aaaagaaatc tattgactta gagaatcgtt 420 ccagacgaaa caacttacgt atcgatggtg taaaagaaac ccctgaagaa aactggagtg 480 attgtgagaa ggttgtaaaa gacattttta aaacaaaact aaatatatta ggtgaagtag 540 ttgtggaacg agcgcacaga gtcggaactg ttaaagatag aaaaacacca agatcgatcg 600 ttttaaagct attaaattat caggataaaa ataaaatatt aagctcagtt aataaactta 660 aaggatcagg catatatatc aacgaagatt atgcaaaaga aacgatgcta gaaaggaaga 720 agctttggga agaagttaaa cagttacgca acgaaggtaa atttgcagtt attaagtttg 780 ataaaattta ttgtcgagat tttagaaagt aaacacgcga agttttatta agcgaaacgc 840 tattttaatt attttaaata atgacgaact taacaataga ttttgaaaaa ctacacttta 900 atgttttcaa atcggcaaat gtgttactta atgacttttc tgacgctgat ttgcattttt 960 ttaacgaaaa gaattttagc tcatccttct ttgaaataaa aacttttaaa aacgaactaa 1020 aaaccttaaa gagtaatttt acggtaatac acattaacat tagaagtatt aataacaaca 1080 tggataagct aaaatatttt cttattgaat gcgattatct attcagtatg atatgcataa 1140 cagaaacgtg gtgttctgac gaatcatgca gaataaactc aagtttagaa attcctaagt 1200 acaagttatt atcgtttgaa agaaaaacta acaaaagggg aggtggaatt ataacttata 1260 ttaggaatga tcaaattact aaaataagag aagacctttc gacctcagat gccgatagtg 1320 aggtctttac aattgaaata actagcaaya aatcaaaaaa catactggtc tccacatgtt 1380 accgaccgcc tgaaggtgat ttattaaaat tttccaatca tttaaaacaa atttttataa 1440 aaaacaacaa tgagcaaaaa aaaatattct gtattggtga tataaacata gattgtttac 1500 aatatgaaaa aaatgccaat attaaacttt tttttgacga aatgcttcaa tattacatct 1560 tcccaattat taataaacca acccgagtaa ctccaacttc aatctctgca atagacaata 1620 tattaactaa ttcaatcttt gatacttctc taaaggcagg aataataaag acagacattt 1680 ctgatcattt tcctgttttc ttctctttga cacaagatat aaaatctata aatagttgta 1740 aaataaaaat ttataaaagg aaaatcaaca aatttgctat tcaacaattt aaagactcac 1800 tatcggcaga acaatgggac aaggtatacc aagaatgcaa ccttgggaac acaaactctg 1860 cttataatta ctttgaaaaa attttcctaa aaaactataa tatgcatttc ccaattaaag 1920 aaaaattaat aaaagaaaaa tatttaaaat gtccttggat caccaaaggt attaaaaaat 1980 cctccaaaaa aaaacaaaaa ctttatatca aatacttaaa aaatagaagt gaggcaaacc 2040 taaatgttta caaacagtat aaaaacttgt ttgaaaaaat taaaaaaaat tcgaagaaaa 2100 acttttactc aaataaaata aaaaactcaa atggtaatat taaaaaaact tgggatatca 2160 taaaagaaat aattgggact aaacactgca aatcaaatag tttacctgca caaattgtca 2220 tagataataa tgagtataac gattataatg taatttcaga aaaatttaac agtttttttg 2280 tcaacatagg cccaaatcta gcttcaaaaa tccactgtcc aaacaactca tttgaaactt 2340 acttaactag tgtcaacagc gaactaatat ttaaagaact aaaaattgat gaactcgaaa 2400 atgcaataaa ctctattaaa ataaacaagt ctccaggtgt agatgacatt agcagtaata 2460 ttgttgctaa tgtctcttcg cagatacgca aacctatttt tgaaatattc aaatcatcaa 2520 ttaaaactgg aactgtacca gataaattaa aagtagctaa aattatacct atttttaaaa 2580 ccggtgaaac atttcaaata aataattaca gaccaatttc tattcttcca attttttcta 2640 aaatacttga acgaataatc tacaatagac tatatgaata tctaattcaa aacaaactct 2700 tagataaaaa acaattcggt tttcaatcac aacactcgac agaacatgca attttagata 2760 tagttaatag cataagtgat tcttttaata aaaagcaatt tgtattagga acttttatag 2820 acctatccaa agcatttgac acaattaatc atgatatcct acttaaaaaa atggaaaaat 2880 acggaataaa aaatactacc ctagattggt ttaaaagtta tttatgcaac agacaacaat 2940 gcgttatttc gaacgataat aaatattcaa atttactaga aattaaatgc ggagttcccc 3000 aaggttccat tcttggccct cttttatttt taatttatat taacgatctt ccacaatccc 3060 taaaaaaact tgatgctgta atgtttgctg atgacacaaa tttgttttat tcatcagcat 3120 caattgaaaa tctctttgaa tctgtgaacg atgaccttga gaatctaaac acctggttta 3180 aagtaaataa attatcttta aatacagaaa aaacaaaata catcttgttt cattccaacc 3240 atcaaaaaaa taatatacca agaatgctac cattactaaa aattgataac ataaatatcg 3300 aaagaactaa aaccgttaaa tttcttggga taattattga cgaaacgatt tcttggaaag 3360 cccatataaa tacaataaat acaaaaatat caaaaagtat cggcatactc tacaaagtca 3420 aacctttact ttcccagaaa aatcttaaaa gtctctactt ttcttatata caaagttatt 3480 taacatatgc taatattgcc tggggaagta caaataaatc taaactaaat tctctatatg 3540 tacaccagaa acacgcatca agattgattt ataataaaaa taaattcact catgccgatc 3600 ctttacttaa aaacttgaat gcgttaaatg tctatcaaat taacatctac caaaatattt 3660 tatttatgct caaatatatg ctcgggcttg tcccatctca ttttactaat aacttttttc 3720 aaacctacgc caacagatat agcacaagag gaacaggaaa tttcaccctg cccattaaaa 3780 aaacaaaatt ttcaagattt tcaattgcat accgtggccc ctacttatac aacaaaataa 3840 tatctcaaaa tatagaactt actaaattag ataatcttac tatccttaaa aaaaaattaa 3900 aagatctcat aattaacaaa accaatttta tcgatatgta ttaaatgtaa aagaaaatga 3960 atataataaa tgtaacaaga acaatttaaa ttttaaaaaa taaataaaaa aaaaataata 4020 ataataataa taataaatat agtagcaact atacatagta aaatattact gaacagtgga 4080 cctcaaaagt taatgtgtgt tttgtgaaat tactctttaa attttttgtt cattttgcat 4140 tttaaaggtt ctcgatgaaa agacttctct tagtcttctg cgagtttcct tacaacaaca 4200 tagcgttatt atgatatatt aatatatttt ttcaacgata accacaagca agtccctagt 4260 agccctgtgg tgcgacggac ttgcgtatat tttttaaatg catgggttaa tagccagcac 4320 tttatattat aaaccacgat tttttacttt ttattttttt tatatattta tttttttttt 4380 gtttttctct taacatttct aacccactgt aatacgccaa gcgtgaaaca gcattgttaa 4440 cgctctttag atattgcaaa tatgtgtggt ggacatttat ataaactgga gtcatgtaaa 4500 tatttgttct tgaaatatat aataattgtt gtaaaaaaaa aaaaaaaaaa aaaaaaaa 4558 // ID Gypsy-9_RP-LTR repbase; DNA; INV; 404 BP. XX AC ACPB02035853; XX DT 28-JAN-2011 (Rel. 16.02, Created) DT 28-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Rhodnius prolixus genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-9_RP_; KW Gypsy-9_RP-I; Gypsy-9_RP-LTR. XX OS Rhodnius prolixus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Euhemiptera; Heteroptera; OC Panheteroptera; Cimicomorpha; Reduviidae; Triatominae; Rhodnius. XX RN [1] RP 1-404 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Rhodnius prolixus genome."; RL Direct Submission to RU (28-JAN-2011). XX DR Genome; ACPB02035853; Positions 106851 107254. XX SQ Sequence 404 BP; 96 A; 61 C; 102 G; 145 T; 0 other; tgtaactggg cactagaatt agttgctagg tttatgaaca cagctgggtt tttattggaa 60 gaaaaagtat attattgact ttcgggcatt ctgagttggg acttgctgga atacagacga 120 ggattgagag tttgtagaaa agccaagttt ccctcttgct gacctgcgac ttttcttgtc 180 tgcggcccat gcggcgtatg tatgtttatt catgtaccaa atattttgtt aagtaaatcc 240 gttaagtaaa ttcatcagac ttattgttat catagctttg ttcaagtagt ttagttgtcc 300 tatctatctg tgagtgtatg tacgtgtgtg tgagtgtgtg cttgagtgtg tgtaagttgc 360 tctctataga agtgtagttc gtacgaggta gcggctggcc taca 404 // ID BEL-177_AA-LTR repbase; DNA; INV; 436 BP. XX AC supercont1.8; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-177_AA_; KW BEL-177_AA-I; BEL-177_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-436 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.8; Positions 2743997 2744432. XX SQ Sequence 436 BP; 145 A; 70 C; 81 G; 140 T; 0 other; tgttcgccgc aacagcccct cgtatcgaac acccctactg agcaccatgg tgagctacct 60 aaaagtcaac agacagtatg agatcttctg gtatgacaat ttaaattgaa tgcgtaggag 120 attatttgta ttgagaaatc tattttgatc gattgtgaaa ttttctgaat taaaaaggta 180 caagagtttg aaattaaact gaatttaagt agtattcaat agtacatttt cattatagca 240 tttttgaagt taaattcatc atctgtttgc gaattggtag tgatacagtt aaatcggaga 300 ctaaaatgtg agtcgtaata taagtcaaga gagttgaatt gattgctaat aaaattacta 360 tttacagctt tgagcttaat ctactcctaa aacgagtgtt ctgctgctag acctccgaaa 420 gaatccgttc gcaaca 436 // ID APAIB_ME repbase; DNA; INV; 167 BP. XX AC X61119; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 25-MAR-1997 (Rel. 2.02, Last updated, Version 2) XX DE M.edulis ApaI repetitive DNA sequence. XX KW Satellite; Simple Repeat; APAIB; APAIB_ME; KW ApaI repetitive DNA family; highly-repetitive sequence. XX OS Mytilus edulis OC Eukaryota; Metazoa; Mollusca; Bivalvia; Pteriomorphia; Mytiloida; OC Mytiloidea; Mytilidae; Mytilinae; Mytilus. XX RN [1] RP 1-167 RA Cornudella L.; RT "APAIB_ME."; RL Direct Submission to Genbank (30-JUL-1991)L. Cornudella, Centro RL de Investigacion Y, Desarrollo del Csic, Joedi Girona 18-26, 0834 RL Barcelona, SPAIN. XX RN [2] RP 1-167 RA Ruiz-Lara S., Prats E., Sainz J. and Cornudella L.; RT "Direct submission."; RL Unpublished. XX DR GenBank; X61119; Positions 1 167. XX SQ Sequence 167 BP; 47 A; 38 C; 25 G; 57 T; 0 other; gggccctttt ttggccccta attcctaaac tgttgggacc aaaactccca aaatcaatcc 60 caaccttcct tttgtggtca taaaccttgt gtttaaattt catagatttc tatttactta 120 tactaaagtt atggtgcgaa aaccaagaat aatgcttatt tgggccc 167 // ID R2_HC repbase; DNA; INV; 1565 BP. XX AC AF015816; XX DT 27-JUL-1999 (Rel. 4.06, Created) DT 27-JUL-1999 (Rel. 4.06, Last updated, Version 1) XX DE Hippodamia convergens retrotransposon R2 reverse transcriptase DE gene, partial cds. XX KW R2; Non-LTR Retrotransposon; Transposable Element; R2_HC. XX OS Hippodamia convergens OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Coleoptera; Polyphaga; Cucujiformia; OC Coccinellidae; Coccinellinae; Coccinellini; Hippodamia. XX RN [1] RP 1-1565 RA Burke D.W., Malik S.H. and Eickbush H.T.; RT "R1 and R2 Provide an Estimate of the Age and Stability of RT Retrotransposons."; RL Unpublished. XX RN [2] RP 1-1565 RA Burke D.W. and Eickbush H.T.; RT "R2_HC."; RL Direct Submission to Genbank (24-JUL-1997)Biology, Univ. of RL Rochester, 135 Superior Road, Rochester, NY 14625, USA. XX DR GenBank; AF015816; Positions 1 1565. XX SQ Sequence 1565 BP; 314 A; 400 C; 490 G; 361 T; 0 other; gcgtttgcgg acgacgtaat cctctgcggc accacttctt ggggactaca gagaaatctg 60 gaaatcttcg aggaggagct ccgaagaagc ggactttcct taaatcctgg gaaaagtaag 120 tgcgtttccc ttgttgcttc tggaagggaa aagaaggtta agctagtgat gaccccgaca 180 ttcagagcgt cgggtagttg gcttagccag gtcgatggga ccaccttctg gaagtacttg 240 ggattacagt ttaggggttg tgggatggcc ggttgtggtt cggacgatgt agctgagtgc 300 ttggagcggc tcactcgtgc gccgttgaag cctcagcagc ggatgcactt gctgcgggtc 360 ttccttttgc ccagatttta tcacgtctgg acatttggaa gactcaatgc gggcatcctg 420 cgtcgtctgg acattcgtgt tcggaatgcc atccggacat cggtgcgcct accccacgac 480 gtaccggtgg gatacttcca cgcgccgact aacgccgggg gactcggtat tccgcaattg 540 tcgcgcttca ttccgctgtt gcgtcttaag aggtttgagc gtttggctca ctcctccgtc 600 gaaagcgttc gggaatgcgc gaggaccgaa cctgccgtgg cgaaggtccg gtggtgtcgg 660 gagcggttag cggacgtcgt agaccgggtc gcggatggaa cgcagagtct tcgcgaattt 720 tggacgcgtg agctttatcg gtcgatggat gggagggcgc ttagggagtc tgtgagagag 780 actccgagta cacagtggct ccgctgctgt acccgtgtca taccagccag agactggctc 840 aattacatct ccgttcacat taatgcatta ccatcgcgtg ttcgaacatc acgtggcaga 900 cgtgacggcg tggatgtgac ctgtaggggc ggatgcttga cagctgaaac ccctgctcac 960 tgcatccagg tttgtcaccg aacccacggc gggcgggtgt tgcgacacga tgccatcgcg 1020 aaggcgctgt cagtacatct tacgcagcgg ggttggagtg ttaggagaga agtctcctat 1080 cagacggttg ttggagtccg acgacccgac attgtgcttc ttgcggggcg agaaatcgcc 1140 gtcgtggacg tgcaagttgt cgcacctaac ccttctctgg acagcgcaca ccgcaaaaag 1200 gtcgccaagt atcgtgacga ggctcagctc gccacgtgct tggtgcgagg agcttcggta 1260 caaccaagga ggagagcgga agaagaaacg caagtgcgat ttgccagcgc aaccatctcg 1320 tggcgaggtg tctggtcttc cgagtccgcg cggtccctgc gtgagctggg gctgacggac 1380 cgggagctgg cacaatacag cacctacgac ctgcgtggct cctggatgaa ctgggtaagg 1440 tttggtgcgt ctacgtctac tcggatggcg tggcgcccgt aaaatctcca aaggtaatcg 1500 ccaaaccgtt tggtttctac atctgcgttc ccttcaggaa cggactcggt ttcatcattt 1560 cctaa 1565 // ID Mariner-3_SM repbase; DNA; INV; 1698 BP. XX AC . XX DT 08-FEB-2008 (Rel. 13.02, Created) DT 03-MAR-2008 (Rel. 13.02, Last updated, Version 1) XX DE Mariner DNA-transposon from Schmidtea mediterranea. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-3_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-1698 RA Jurka J.; RT "Mariner DNA transposon from Schmidtea mediterranea."; RL Repbase Reports 8(2), 147-147 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 202..1509 FT /product="Mariner-2_SM_1p" FT /translation="MNTENLHSTYHLSNEERSLIFQDFLMNMSSRNPPKLK FT HGTINDISAKYNVSRASISKLWRNASEQLRTDEIVIDSSSMRINCGRKRKN FT YDDKLNDIKKIPLNRRGTLRSLSCASGIPKTTLFRNLKEFKLLRRVSSTVK FT PLLTDRNKIERLTFCLSYIKTDGCFDNFFNHVHIDEKWFYLSKVKRSYYLT FT MDEEIPHRTCQSKRFITKVMFLAAVARPRGDIGSENYFDGKIGIWPFVYKE FT EAKRNSKNRMKGTYVTKNIESINSKEYKKMIKQNVFPMIRAKFPRSNDIIY FT VQQDNAKPHFNEDDPDMLEEGSKDGFSIMFKNQPPNSPDMNVLDLGFFNAI FT QSIQHQHSPKTIDELISCVSNAFDELHPDKLDNVFLTLQQCMEETMLANGG FT NNYKLPHISKAKLRNNGTLPVSLTCKPEAILTARQNIDEQTL" XX SQ Sequence 1698 BP; 623 A; 269 C; 285 G; 521 T; 0 other; taaaataata ctccctctgt tccgaaataa tgtttacatt tgaaataaat ttttccgatt 60 atcaatgttt ttggtcaact ttaaacttta ttaatttata tagatagata atgatctagt 120 atggatggat tgacaaaaat ttccatgatt ataggttttt ttattgttaa attttaaata 180 ctcctaaaat acaaataaaa aatgaataca gaaaatttac attcaactta tcacttatct 240 aatgaagaaa gaagtttaat ttttcaagat tttttaatga atatgtccag cagaaatcct 300 cccaagctga agcatggaac cataaatgac atttccgcta agtacaacgt atcgagagct 360 tcaatttcga aactttggag aaatgcttcc gaacaacttc gaacagatga aatagttatt 420 gattcatcat caatgaggat aaattgtgga cgaaaaagga aaaactatga tgataaatta 480 aacgatatca agaaaattcc attaaaccgg cgaggaacgc ttcgaagttt atcatgtgcc 540 agtggtattc caaaaacaac attattcagg aatttgaagg agtttaaact attaagacgg 600 gtttcatcta cagttaagcc attactaaca gatagaaata aaattgaacg cctgacattt 660 tgtctgtcct atataaaaac agatggatgc ttcgataatt ttttcaatca tgttcatatt 720 gatgaaaaat ggttttatct ttcgaaagtt aagcgcagct attacttgac catggatgaa 780 gagattcctc atagaacatg tcaaagcaaa agattcataa caaaggtgat gtttttggca 840 gcagttgcta ggccccgagg tgatattggc tctgagaatt attttgatgg caagattggc 900 atctggccgt ttgtttataa agaagaagcc aaaagaaata gcaagaaccg aatgaagggt 960 acttacgtca ctaaaaatat tgaatcaatt aattcgaagg aatataaaaa aatgatcaag 1020 caaaatgtgt ttccaatgat cagagcaaag tttcctcgat caaatgatat catctacgtt 1080 cagcaagata atgcaaagcc gcatttcaat gaagacgatc cagatatgct cgaagaagga 1140 tcaaaagatg ggttttctat tatgtttaaa aatcaaccac ctaatagtcc cgacatgaac 1200 gtattagatc ttggattctt caatgccatt caatccattc agcatcaaca ttctccgaag 1260 accattgatg aattgattag ctgtgtgagt aatgcctttg atgagctaca cccagataaa 1320 ctggataatg tatttttaac actccagcag tgcatggagg agacgatgct tgcaaatggg 1380 ggaaataatt ataagttgcc gcatattagc aaagcaaagc tccgcaacaa tggtactcta 1440 ccagtatccc taacttgtaa accagaagcg attcttacgg ctcgccaaaa tattgatgaa 1500 cagaccctat aaaattgaat cctccctcct tgtcacgaat aaaatgtttt atctaataaa 1560 tattaacatt ttttacaata attattatga ctgtttgtta ataatgctat aaaatttttt 1620 aaatgtaaac attatttagg aacaaaaaat cagggtaata tgtaaacatt atttcggaac 1680 agagggagta ttatttta 1698 // ID APO1_AP repbase; DNA; INV; 519 BP. XX AC AF407668; XX DT 09-JUL-2004 (Rel. 9.06, Created) DT 09-JUL-2004 (Rel. 9.06, Last updated, Version 2) XX DE Acanthamoeba polyphaga isolate Apo1 repetitive DNA sequence. XX KW APO1_AP; Repetitive sequence. XX OS Acanthamoeba polyphaga OC Eukaryota; Amoebozoa; Centramoebida; Acanthamoebidae; OC Acanthamoeba. XX RN [1] RA Webb R.S., Garman C.G., McIninch P.S. and Brown L.B.; RT "Amoebae associated with ulcerative lesions of fish from tidal RT freshwater of the James River, Virginia."; RL J. Aquat. Anim. Health0-0 (2002)In press. XX DR Genbank; AF407668; Positions 1 519. XX SQ Sequence 519 BP; 95 A; 126 C; 168 G; 128 T; 2 other; tccatctcgc cgtggtcggc cagcgtctcg tacagcgaat cgaagcgcga ctgctcatac 60 tggctggcgc cgccgccgtg tcggcgcgtc ggcgcgtcgg ggtcatatcc gagggtgtct 120 gtgcagcatt cctcttacgt gagagaaaag cttcgccaat gggtgtggtg cctgtgtgtt 180 gtgtgtgtgt gtgtgtgttg tgtgacgcgt gtgtgacaag taccgtagta gtcactcttt 240 tctgtgctct gcctcgttat gaggtcccgc agcgccttgt gcttggatgg tgcggccttc 300 tggaagattt ccctcctgca gacgggcggg gtcaggggaa agaagccaaa agaaagaaag 360 aaagaaagaa aggcgcatca gtgtgtatgt ggagcgttac cagacgtcgt agagcgactt 420 gagcacgtct ttgtgtccga cagcggtcga ngncagcatg tccgcccgtg agcaccttgc 480 ctttcgcttg atgaactatt tcgacgacag tgagttgca 519 // ID BEL-591_AA-I repbase; DNA; INV; 6494 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-591_AA_; KW BEL-591_AA-LTR; Pao_Bel_Ele161; BEL-591_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6494 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX DR [2] (Consensus) XX CC Positions [5557-6105] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 512..1903 FT /product="BEL-591_AA-I_2p" FT /translation="MWVNECAEQKDDINVPVITDAGCLLPPAPSNQTKQNA FT ATGSCIPTQVELPDKENQQSPLKHPSMPNHRSTSAVNPTMDDSKTDPNPDE FT DATRHAVFSKITGNVPLAQSTPAKDPSFEQQAVVTASSVYPRAEEVANLAI FT APSGMPVHTYNSVQSAPEPATYSFSSSVPAARLMPPAIAIPQLAPIPEYSS FT ELQSSVAQTVPRIPGPSVQQHQHGSLASSHFSTSVLPPMLHATAMPPPVSA FT PYRPPSYPVFPPGYQAEPASVLAPAHGPVLAAPPVSSFSSTSAMSLLPPSA FT PVVPNAVPLREVSLPQPPGKPDNCPLPVNMSSASNDDNLQHLIRQFGSIAL FT SPSQSPLLNANLATFAPSPSQLAARQVMPRDLPPFSGNPADWPVFISSFMN FT TTLACGYSSAENLCRLQRSLKGAAYEAVQSRLLLPESVPHIMETLHLLYGR FT PELLIAALLDKVRSAPPPK" FT CDS 3634..6492 FT /product="BEL-591_AA-I_1p" FT /translation="MFHQILIRKEDRHSQRFLWRNDPSKTPDVYLMDVATF FT GSTCSPASAQYIKNRNAKEFQGLYPRAVEGIVDNHYVDDSLESYESAEEAI FT KVSQEMRFIHKEGGFELRNWLSNSRDVLDALGEGKPGEDKRFAADKQNDYE FT RVLGLLWLTQEDAFSFSTAMKQEISDLIRADEHPTKRQMLKCLMSLFDPLG FT LLSLFVVHGKILLQEVWREGIQWDEKVNEELHQRWKNWIRLFEEVRSLKIP FT RCYFSCATKQRYQDLQLHIFVDASESAYCAIGYFRTSNADGITECALVAAK FT TKVAPLKTQSIPRLELLAAVLGARLSQFIEENHTVRITRKVFWSDSATVLC FT WLRADHRRYKQFVACRVGELLTITDVENWRWVPTKHNPADIATKWGHGPDL FT TANSIWFKGPAFLGSSEPAWPEQKVNKPSTEEELRPCYTHKAARIPERLVE FT LERFSRLTKAIRATAYVYRFVENLKRKIARCERFVGPLSSEELQSAKNLLI FT RETQWQCFPDEMTVLKHNQSKSITEQIPLDKSSALHQLCPMLDEQAIIRVD FT GRIGAAPNVDKETKFPVILPRKHKFTMLLLDNYHRRFLHGNTETVVNEVRQ FT SYYIPRLRMAVNSAAKACQWCRIYKSLPKIPRMAPLGPLARMASFVRPFTY FT TGLDFFGPLMVKIGRSSAKRWIAVFTCLTIRAVHVEVAHSLTTDSCVKCIR FT RFICRRGAPAEIYSDNGTNFVGAARLLKEQEEQLAVTFTGTATKWVFNPPG FT TPHMGGAWERMVRSIKTAVETAYNNNRKLDDEALETFMVEAEAIVNCRPLT FT YLPLTSEESEALTPNHFLLGNSSGVRQPAVEATSSSDALRSSWHQVQQQLD FT VFWRRWIREYLPTLTKRPKWCGEARAIADGQLVLVIGEGNRNEWTRGRVVQ FT VIQGADGRIRQAIVQTARGLVRRPVARLAVLEVDEDSKTGPGGQCYGGE" XX SQ Sequence 6494 BP; 1737 A; 1649 C; 1650 G; 1455 T; 3 other; atctttagaa aaatcgacag cagtttcgat caacgcgtcg atatggaatc tattccacat 60 aaatcccttc cggatatgca cggagatcta acacggcatc ccgcacacac gctcgacgaa 120 aatcgtacag attgcgtaca gtgcgaccgt cgaaacggtg aggatgcaat ggtgcagtgc 180 gacgcttgtc agatgtggtg gcacttctcg tgtgccggcg tgacagactc gatcaaggac 240 cgatcgtggt catgcccaaa atgtcatgcg aacgatctgt gcagtagcgt gtccgtttcc 300 cgacattccc gaacctccaa cagttcggcg aaatccgaga gggtaaatct acagctggag 360 aagctcgcag agcttcacgc aatgaaggag aagttcatcg aggagaaata taagttgctc 420 gaaagtcaac tcgaatcgga acgagaccac gaaagtgttc ggagcaaacg aagccgagtt 480 aagcaagcaa accagcctcg cgaaggtagc gatgtgggtg aacgaatgtg cagagcagaa 540 agacgacatc aacgtgccag tgataacgga tgcaggttgt ttactgccgc cagccccttc 600 gaaccagacg aagcagaatg ctgctactgg aagttgtatt ccgacccaag tcgagttgcc 660 agataaggag aatcagcaaa gtcccctcaa acatccttcg atgccgaacc atcgctcgac 720 ttcagcagtc aatccaacga tggacgattc caaaacggat ccgaaccctg acgaagatgc 780 aacaagacat gccgtcttca gcaaaattac agggaatgtg cccttggccc aatctactcc 840 agcgaaagac ccttcctttg agcaacaagc tgtcgtaaca gcatcttccg tgtacccgag 900 agcggaagag gtggccaatt tggcgatcgc gccgtcgggc atgccggtgc atacgtacaa 960 ctcggtacaa tccgctccgg aaccagcaac atattcgttt tcctcatcgg tacctgcggc 1020 tagattgatg ccacctgcaa tcgccattcc ccagttggca cccattccag aatattcatc 1080 agaactgcag tcgtcggtag cacaaacagt gccaaggatt ccaggtccat ccgttcaaca 1140 acatcaacat ggaagtttag cgtcatctca cttcagcacg tccgtattac ctccgatgct 1200 tcacgctaca gcaatgccgc cgcctgtttc tgcaccatac cgtccaccat catatcctgt 1260 atttccaccg ggataccagg cagaacctgc atccgttcta gcgcccgctc atgggcctgt 1320 gttggctgcc ccgccggtga gctcgttttc atctacgtct gcgatgtcgc tcctgcctcc 1380 gtccgcgccc gtggtgccta atgcagttcc tcttagagaa gtatcactcc ctcagccacc 1440 aggtaagcca gataattgtc ctctgccagt aaatatgtca tcagccagta atgacgacaa 1500 ccttcaacac ttaataaggc aatttggtag tatcgcttta tcgccctcgc aatcaccatt 1560 gctcaatgca aatttagcca cctttgcccc ctctccctcg cagttagctg ccaggcaagt 1620 catgccgcgc gacttacccc ccttttccgg caacccagcc gactggccgg tattcataag 1680 cagttttatg aacactacac twgcttgtgg gtactccagt gccgaaaacc tttgccgcct 1740 tcaacgaagt ttgaaaggcg cagcgtacga ggccgttcaa agccgattac tcttaccgga 1800 gtcagtgcct cacatcatgg aaactctcca tttgctttac ggtcgcccag agctgctgat 1860 tgcagccctc ttagacaagg tgcgctcagc accaccacca aaaktggaca aatttcaaag 1920 tataatcgac tttggcatgt ctattcaaag tctctgcgat caccttgagg tcgctggaca 1980 aaccgcacac ctstcaaacc cttcccttct cgcggaactt gtcaccaaac tcccaccgca 2040 tcttcaaatg gaatggggat catatctcca gggattctcc gaggtgaatc tcaaaacgtt 2100 tggcctcttc atgtccagtg tagttaaagc agtgagtaaa gtgacagtgt atgctagcag 2160 cagcggaaga aatagtgtgc acccgaagtc aagaggaagt gtgaattcgc atctcagtga 2220 tactaacgac agtgcggaag ccaagccatc tactgttcga gaggaagtga aggagtgtcc 2280 cgcttgcaaa aacgttggtc atcgtatcca ggattgccga aattttcaat ccctttctgt 2340 ggatggccgt tggaagtgtg tccagttgaa caagctttgc cgaaattgtt tgagctcgca 2400 tggccgaaga agctgcagga aaaccagcaa ttgtggaaca aacggctgcg aataccgcca 2460 ccatccaatg cttcattcga cgcggagtac cacggattcg caggcgcagc catcagcgaa 2520 taacgcagag aaccttacgc atcgccagcc taaacagtcg ctgctcttcc gaatagtacc 2580 agtgaccctg catggtccca gaggaacagt ggacactttc gcgtttctcg acgacggatc 2640 atctttaaca ctcatcgagg atagcctggt taaagaacta ggtgccgaag gtgtaacgat 2700 gcctctgtgt ttgttgtgga cggctaatgt gacacgcaca gaaaaggtcg aagcaaatgt 2760 cgctggtcgt ctcatcgact gaaggtaggc agtacaagtt agatgaagta cagacagtta 2820 aggagttgtc gctgcccatt caatcgcttt cgtacgagca tctttctgcg cagtttgcgc 2880 acctcaaagg acttccagtg aagagctaca ctaaggcgat tccgaagcta ttaattggaa 2940 tcaacaactt ggatttgatg gttccgttga gagtacgaga aggacttcgg cgcgagccta 3000 tcgctgctaa aaccagatta ggctggtgta tttatggcgg tgcacaaacg atggtcaaag 3060 ctatcagtaa ctaccatgcc tgcggctgtt ctctgatcaa atctccacaa tctggtcaag 3120 gacttcttcg cgacgaggat gctggcgtgc aaccccagtg ctctcttgtc ggagtcgata 3180 agcgtgctct acggatatta gcggatacaa cggttcgaat cggagccgct ttgaaacagg 3240 tctgctctgg aaattcgacc atgttgagct acctgatagc tatggaatgg ctctccgacg 3300 gctccagtgt ctgaaaaacg aatgagtcgc aatcctgaac tcaaacactc tgcagcaaca 3360 actggaggac taccagataa aaggctacgc gcatcgtgcc acagaggacg agttggccaa 3420 cgccgactgc gacgcgtttg gtacctaccg ttgggagcaa ttacaatccc cggaagccag 3480 gcaaggttcg catgatctgg gatgctaggg cggccgtgaa cggaatctcc ctcaactgta 3540 ctgctgaaag gtccagatca gctgacttcc cttcccggag tgctggtacg attccggcag 3600 ttcaaagtgg gcgtctcctc tgatataaag gagatgttcc accaaattct aattcgcaaa 3660 gaagatcgcc actcccagcg gtttctttgg cgcaacgatc cttccaaaac acctgatgtg 3720 taccttatgg atgttgctac gttcggctct acatgctccc cggcgtctgc tcaatatatc 3780 aagaacagaa acgccaagga gttccaaggg ctataccctc gagcagtcga aggcatcgtg 3840 gacaaccatt acgtggacga ctccctagag agctacgagt ccgcagaaga ggcgatcaag 3900 gtgtcacagg aaatgcgttt cattcacaaa gaaggcggat ttgagctaag aaactggcta 3960 tcgaatagca gagatgtcct ggacgcttta ggggaaggaa aacccggaga agataagaga 4020 ttcgcagcgg ataaacaaaa cgattacgag cgggtactcg gactcttgtg gttgacgcaa 4080 gaagacgcct tcagtttctc aactgcgatg aagcaggaaa tcagtgatct gatcagggct 4140 gacgaacatc ctaccaaaag gcagatgctg aaatgtttaa tgtcgctgtt cgacccgctg 4200 ggcctattga gcctgtttgt agtgcacggg aagattcttc tacaagaagt ctggagagaa 4260 gggatacagt gggatgaaaa ggtgaacgaa gaactgcatc aaaggtggaa gaattggata 4320 aggctattcg aagaggttcg cagcctcaaa attcctcggt gttacttctc atgtgccacc 4380 aaacagcgat accaagatct ccaactccac atatttgtag atgccagcga atctgcgtat 4440 tgtgcgattg ggtattttcg tacgtcgaac gctgatggaa tcaccgaatg cgcgctcgtc 4500 gcagcgaaaa ctaaggttgc ccctctgaaa acgcaatcaa tcccccgcct ggaactgctg 4560 gccgcagtgc taggcgctcg tctgtcgcag ttcatcgaag aaaaccatac cgtacgcatt 4620 acaagaaaag tgttttggag cgattcagca accgtcctct gctggctgcg ggctgatcat 4680 cgacggtata agcaattcgt cgcatgtaga gtgggagaat tgctgacgat caccgatgtg 4740 gaaaattggc gttgggttcc caccaagcat aaccctgctg acatcgctac aaaatggggc 4800 cacgggcctg acctaactgc gaactccatt tggttcaaag gtcctgcgtt tcttggatcg 4860 tcggaaccgg catggccgga acagaaagtg aacaagccat caacagaaga agagctacgt 4920 ccctgttaca ctcacaaggc agcacgtatt ccagaacgtc tcgtagaact tgagcgcttt 4980 tcccgattaa ctaaagcaat ccgggctacc gcttacgttt accgtttcgt ggaaaaccta 5040 aagcggaaga tcgcacgatg tgaaagattc gttggtccac tatcatcgga agaactccaa 5100 agtgccaaaa acctacttat tcgagagaca cagtggcagt gcttcccaga tgagatgacg 5160 gtcctgaagc acaatcaatc gaagtcgata acggaacaaa tacccttgga taaaagcagt 5220 gctcttcatc aattatgtcc aatgctagat gagcaagcca ttattcgcgt cgatggtaga 5280 attggagcgg ctccgaacgt tgacaaagaa acgaagttcc ctgtgatcct cccaaggaaa 5340 cacaaattta cgatgctgtt gctggacaac taccatcgta gattcctgca cggaaacact 5400 gaaactgtgg tgaacgaggt gcggcagtca tactacatcc cacggctcag gatggcggtc 5460 aatagcgcag ccaaggcgtg tcaatggtgt cgtatctaca agagcttacc gaaaattccc 5520 cgcatggcgc ctttgggacc attagcacga atggcgtcct ttgtaagacc gttcacctat 5580 accggcttag atttctttgg tccgctaatg gtaaagatag ggaggagtag cgcaaaacgc 5640 tggatagccg tttttacgtg cttaacaata cgagcggtgc acgtagaagt tgctcatagc 5700 ctgactactg attcgtgtgt caaatgcatt cgccgcttca tctgccgcag aggagcacca 5760 gccgaaatct attcggataa cggcactaat tttgtgggag ccgctagatt gctgaaggag 5820 caagaggagc agttggccgt tacctttaca ggaacggcaa ccaaatgggt gtttaaccca 5880 cctggcactc cgcacatggg cggagcatgg gagcggatgg tgcgctccat caaaactgct 5940 gtagagactg cgtacaacaa taaccgcaag ttggacgacg aggcgttgga aacatttatg 6000 gtggaagctg aagcaattgt gaactgccgt ccattaacgt atctgccact gacgtcggag 6060 gaaagtgagg cacttactcc gaatcacttt ttgctgggca attcaagtgg cgtccgtcag 6120 cctgcagtag aagcaaccag ttcgtcggat gccctccgtt catcgtggca ccaagttcaa 6180 caacagctgg acgtattttg gagaaggtgg ataagggaat atctgcccac gttgacgaaa 6240 cggcctaagt ggtgtggtga agcacgagcc atagctgacg gacagttggt gctggtaatt 6300 ggcgaaggaa acagaaacga gtggactaga ggccgcgtag ttcaagtgat acaaggggcg 6360 gatgggagaa ttcggcaagc tattgttcaa actgcgaggg ggcttgtgcg tagacctgtg 6420 gctaggttgg cagtactaga ggttgacgaa gatagtaaaa ctggacctgg tggccagtgt 6480 tacgggggag agga 6494 // ID Gypsy-10_AA-I repbase; DNA; INV; 4129 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-10_AA_; KW Gypsy-10_AA-LTR; Gypsy-10_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4129 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 989-989 (2011). XX DR [2] (Consensus) XX CC Positions [3141-3617] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 63..4058 FT /product="Gypsy-10_AA-I_1p" FT /translation="MTSENKSNPFQGSSVQKVAMLGRMDPFIPGEDFEVYM FT KRLKMYFIANGIPDSLKAAVLITVMGNETFQILDSLFSPEDPCEKTFDQIV FT DKLKNQFKPMVIVAAERYKLYTRKQKVNEPNAEYVVALKHLTQSCNFGTFL FT ADALRDAFIIGIQNQKIRRRLLSEELCQAYKIASSMELANVEETKLNVDNG FT LNHVVYRKKNVTSESVKRGGGITKQTIDQQKVKHCTRCGRSNHVNNKCPAA FT NVMCYKCRLRGHFANQRRTRNVHSLQENDGDGSDDEIVDVISWVAEGNDPL FT HVPVHVEGKEVIMEIDSGACKSIISEDEYREKFNHIKLKQGSKSFQVITGQ FT KFRPLGVGEFRVSLPNTKNMIKLEMSVVQSEFKFKPLLGRSWLNALYPGWK FT SRLVTRENNDLHLIDNVMNEAVKTDTEEMNSNFLKVLRDTYPQAFSEDSSV FT PIKEFEVDIIMKEHTPVFHKAYELPFKMRESVEEELNRMVDGGILKPVKHC FT RYASPIVAVPKSDGKSIRICVDCKRTINPFVQTEHYPLPRIDDILASFPGC FT KVFCVLDLKGAYQQLALSEESKKYLTINTHKGLFQFQRLCYGVSSAPSIFQ FT QVIDQILRGLKFVKSYIDDTLIGGKNYADCLNNLKLVLERLNMYNVHVNIA FT KCSFFLPQVDYLGHTLSEKGISPNNDKIKAIVHAPSPTNNSQLLSYLGLLN FT YYSRFLPNISSLLRPLYDLLKKNASFRWTEVHQKAFEDSKQLLLSNSLLEM FT YDPNKKLIVSCDASPYGVGAILSHEINGELKPVIFASSTLSPAEQNYSQLH FT REALAIMFAINRFHKYIYGYQFELHTDHEPLQAILNPKKCKNAIAVARLQR FT WAVQLSMYNYVVKYKSSMKMRHVDALSRLPLMDETKVDFVKSLNISEEVPI FT DLEMIRECSKTDPLIIKLIKLCQSGWTKMNHADPELKYYQKLQHCLSVEDG FT CLVYRDRLVIPDQVKTNILRLLHEGHIGIVRTKMLARQYVWWRNIDSDIEE FT FINECSVCQQTQKKTPSVTVPWPKPSAALERVHVDLFYFGKFTFFIIVDAF FT SRWIEVFRLNTSNAKTIIDYFRKFIVTFGLPKEIVSDNGPPFNAYQFIDFC FT NRQGIKVTKSPPYHPESNGLAERGVQTVKNTLKKCLLDLRYRTLTIDEQID FT NFLFKYRNTPNAVSGFSPSEVMFKFKPKTLIDKLSIKQQCNPKTDYKTNNE FT YVSKENIKSFKKGEKIMYMNHFKSFCKWIPGRIIDKLSRTLYRIEINDNVR FT IVHVSSIRKSKLSDKYHPNVSTSSQVETRLPQVIIKKKIIKKNQNMQSKWS FT DFSKLRRSQRKRKPPVFYKC" XX SQ Sequence 4129 BP; 1400 A; 696 C; 864 G; 1169 T; 0 other; gttttggcga cgaagtggat taatcggtcg tgtgataagt cacgtgtgtc gtgcaagaaa 60 caatgacgag tgaaaataaa tcgaacccgt tccaggggtc tagtgtacag aaagtggcga 120 tgttaggccg gatggatccg tttattcctg gcgaagattt cgaggtgtat atgaaaagat 180 taaaaatgta cttcattgct aacggtattc cggatagttt gaaagcggca gttttaataa 240 ctgtgatggg taacgaaacg ttccaaatac tagattcgtt gtttagtcca gaagatcctt 300 gcgagaaaac attcgatcaa attgtcgata aattgaagaa tcagttcaaa ccgatggtga 360 tagtagcagc tgaaagatac aagttgtata cgcgaaaaca aaaagtgaat gaaccgaacg 420 cggaatatgt cgtggctcta aagcatctca ctcagtcgtg taactttggt acatttcttg 480 ccgatgctct ccgtgatgcg ttcataatcg gtattcagaa ccaaaaaatt cgaagacggc 540 tactctctga agaactttgt caagcataca aaatagcatc tagtatggag ttagctaatg 600 tcgaagaaac gaaacttaat gttgataatg gactgaacca cgtggtgtac agaaagaaaa 660 atgttactag tgaaagtgtg aaacgtggtg gcggtataac aaagcaaaca atagatcaac 720 agaaagtgaa gcattgtact cgatgcggga gatctaacca tgtgaacaac aaatgtccag 780 ctgctaacgt gatgtgctat aagtgccgtt tgagaggtca ctttgcgaac cagcgcagaa 840 cgcgaaatgt ccattcgctt caggaaaatg acggagatgg gagtgatgat gaaatcgtag 900 acgtgatcag ttgggtcgca gaaggtaatg atccgttgca cgttccagtt cacgtagaag 960 gtaaggaggt tattatggag attgatagcg gagcgtgtaa gtcaattatt agtgaggatg 1020 agtaccgaga aaagtttaat catatcaagt taaaacaagg atcgaagtca tttcaagtta 1080 ttacagggca aaaatttagg ccattaggag tgggagaatt cagagtcagt ttgccgaata 1140 cgaaaaacat gatcaagtta gagatgagtg ttgtgcagtc tgagttcaag ttcaagccac 1200 ttttaggccg gtcttggtta aatgcgttgt accctgggtg gaaaagcaga ttagttacac 1260 gtgagaacaa tgatctacat ctgattgaca atgttatgaa tgaggcagtg aaaacggaca 1320 cagaagaaat gaatagcaac ttcctcaaag ttttacgaga tacttatccc caagctttta 1380 gcgaagacag tagtgtgccg ataaaggaat ttgaggtaga tattattatg aaagaacaca 1440 ctccggtctt ccacaaggcg tacgaactgc cttttaaaat gcgggaatct gtggaagaag 1500 aattgaatag aatggtagat ggtggaattt tgaagcccgt gaaacactgt aggtatgcaa 1560 gcccaatagt agcagtcccc aaatcagatg gtaaatcgat tcgaatttgc gtagattgta 1620 aacgaaccat taaccctttt gttcaaactg aacattatcc acttccccga atcgatgata 1680 ttttagcgag ttttccaggg tgtaaagtgt tttgtgtcct tgacctgaaa ggcgcatacc 1740 agcagctagc tttatccgaa gaatcaaaaa aatatttaac tatcaacacc cataaaggtc 1800 tgtttcaatt tcagaggctg tgttacggag taagtagtgc tccttcaatt tttcaacaag 1860 tcatagatca gattttaaga ggattgaaat tcgtaaaatc ctacattgat gatacgttga 1920 taggtggtaa aaattacgca gattgcttga ataatttaaa actggttttg gaacgtctaa 1980 acatgtacaa cgtacatgtg aacatagcga aatgtagttt ttttctgccc caagttgatt 2040 atctcggaca cactctaagc gaaaaaggca taagtcccaa taatgacaaa atcaaagcaa 2100 ttgtacatgc tcctagcccg acaaacaact ctcaactgct atcctattta ggactactga 2160 attattattc acgatttctt ccaaacatta gcagtttatt gaggccactg tacgacctac 2220 tgaaaaagaa tgcctctttt cggtggactg aagtacatca aaaagcgttt gaagatagca 2280 aacaactttt gctgtcgaat agtttgttgg aaatgtatga tccaaataaa aagttgatcg 2340 tttcttgtga tgcttctcca tatggagtag gagcaatttt gtctcatgaa attaacggag 2400 aactcaaacc agtaattttt gcttctagca cattatcccc cgcggaacaa aattattcac 2460 agttgcatag agaagcacta gcaataatgt tcgcgattaa tcgttttcat aagtatattt 2520 acggatacca atttgagtta cacacagacc atgaacccct acaagctatt ttaaatccca 2580 agaaatgtaa aaatgcgatt gcagttgcga gattacaaag gtgggcagtc caattatcta 2640 tgtataatta tgtggttaaa tataaatcct ccatgaaaat gcgacacgtg gatgcgttat 2700 caagattgcc attgatggac gaaacgaagg tagattttgt taaatcatta aatatctcag 2760 aagaagtacc tatcgattta gaaatgatcc gtgaatgttc caaaacagat ccattgataa 2820 ttaaattgat aaaattgtgt caatcgggat ggacgaaaat gaatcatgcc gatccggagt 2880 tgaagtatta tcaaaagctt caacattgtt tatcagtaga ggatggttgt ttggtttatc 2940 gagatcgatt agtgattcct gatcaagtta aaacaaatat tttgcgtttg ttacacgaag 3000 gccacattgg aatagttcgt acaaaaatgc tggctaggca atatgtttgg tggagaaata 3060 tcgatagcga tattgaagag ttcatcaatg aatgttcggt ttgccaacaa actcagaaaa 3120 aaacaccttc tgttactgtg ccttggccta agccatcggc cgctttagaa cgagtgcatg 3180 tagatctgtt ctattttgga aaattcacat ttttcattat tgttgacgct tttagtcgtt 3240 ggatagaagt ttttcgtctg aatacatcta atgcaaaaac gattatagat tatttcagaa 3300 aattcattgt aacgtttggg ttgccaaaag aaattgtgtc ggacaatggg ccaccgttca 3360 atgcctatca gtttattgat ttttgcaacc gacaaggcat aaaagtcacg aaatcacctc 3420 cgtaccatcc cgaatcgaac ggattagccg aacgtggagt tcagactgtt aaaaatacgt 3480 taaaaaagtg tttacttgat ctcagatatc gtacattaac aatagacgag cagatagaca 3540 actttttgtt caaatacaga aatactccta acgctgtctc tggattctct ccttctgaag 3600 taatgttcaa atttaagcct aagacactaa ttgataaatt gtctattaaa cagcagtgta 3660 acccaaaaac tgattataaa acgaataatg aatatgtatc aaaagaaaat attaaatcgt 3720 ttaagaaagg agaaaagata atgtatatga accattttaa atctttttgt aaatggattc 3780 ccggtagaat tatagacaaa ctctcaagaa cgttgtaccg aattgaaatt aatgataatg 3840 taagaatagt tcatgtttca tccatacgaa aatctaaact cagcgataaa tatcacccca 3900 atgtttctac tagctctcaa gttgaaactc gattaccgca ggttatcata aagaagaaaa 3960 tcattaagaa aaaccaaaac atgcaatcaa aatggtcaga tttttcgaag cttagacgat 4020 cgcaaagaaa aagaaaacct ccggtttttt ataagtgtta ggaaaactaa aaggggagaa 4080 gtgtagtgta gagtcctacc tttcttgtag aagactatct tgcaaatgt 4129 // ID Gypsy-242_AA-LTR repbase; DNA; INV; 148 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-242_AA_; KW Gypsy-242_AA-I; Gypsy-242_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-148 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1086-1086 (2011). XX DR [1] (Consensus) XX SQ Sequence 148 BP; 43 A; 32 C; 22 G; 51 T; 0 other; tgtagcagag cagtaggctt tgctaacctt atgtcattat gttaaattct gttctttgat 60 tattatcagt tagacctcca aagaagaagc acacaaatat actttgttcc ttacaaatac 120 ctcgtcttcc ttcaatccgg agtctaca 148 // ID BEL-130_AA-I repbase; DNA; INV; 6169 BP. XX AC supercont1.259; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-130_AA_; KW BEL-130_AA-LTR; BEL-130_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6169 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.259; Positions 1321591 1327759. XX CC Positions [5196-5753] - Integrase core CC 'CTTGC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 48..6167 FT /product="BEL-130_AA-I_1p" FT /translation="MPETDRFDCQMCDLANNIDDMVQCEGCSKWSHYGCVG FT FDDGKKEEHWKCHKCANSSSIGEAHGNSSAQVSGGQQQINVPLRGGDSISE FT LARLNLELLEERKAFLMKEIEMQQAAQLEQRKLQLEKEAWKVRYDIVNARS FT DATNGEGNTGGLGDWITRLNQVAVSQNQQISAPASTVTVPTTISGTRPLHQ FT ATGGMLSAMQTTTSLFGCDLGHVSQPAPSGISTQANLQTSHQATSTHPMGQ FT ATSFMHDFQPGGEANRPGSSTPITTANWVNPVMGPTSSTQCLPQSISSVGQ FT FASYPMQSGYVSQPYQVTSSISNYVPVGQVACSQYANCSVGPVYPQINTIN FT AHSSYAPFPLPMSLPNSNPHISQAPSAAQNVSTSHIDQSAPTSRQLAARHV FT MPKELPLFSGNPEEWPMFYSAFNTSTEACGYSNVENLGRLQRCLKGGALEA FT VRSRLLLPTSVPQVMQTLQMLYGRPEQIIYALLQKVREVPAPKADNLCTIV FT AFGMAVQNFCEHLEAAGQVLHLSNPVLLQELVDKLPANLKLEWVTFKRQFL FT AVDLRVFSRYMANLVSAAAEVTLTLDPRGAKPKKDEKQKGFVNAHATSTGA FT ASPEKPKGQYAKAASSIACLICGDQEHRVKECEVFKKMDRDDRWKAIHNHF FT LCRICLGKHGRKPCRSSARCNVQGCQLRHHPLLHSDSERPPTAVKPTPQGN FT ETVPRRNPTEGVNAHHTQSSTLFRMIPVRLFSKGRSVETLAFIDEGSSVTL FT LEKSIADALQAEGTEMRLCLTWTSNISREEEGSHQVEVDISSIGGGKRYPL FT KDVRTVESLALPVQTLRYKELADRFEHLRKLPVTDFESRAPGLLIGAKNTH FT ITATQQLREGREGEPIAAKTRLGWAIYGSMPNGSNIASCNLHICGCDFDKT FT LHELVKQYFTVENVGVSVDRSPDSDENKRARALLEKTTRRIEGGFETGLLW FT RQDIVEFPDNYAMAVRRLRCLERRFNTNPALFEKVQRQIEQYQKKGYIHEA FT AEDELAATDPRRLWYLPVGIVQNPKKPNKIRIVWDAAATVDGISLNSMLLK FT GPDLVQPLPEVLCGFRERKIAVVGDIMEMFHQLKIRRDDRYSQLFLWPSES FT GKTPRTFVIDVATFGSTSSPCSAQFVKNLNATEHAVEYPRAAEAIVRRHYV FT DDYLDSFDSEDEACQVVEEVKLVHQRGGFTIRNWLSNSATVLQRVGETDAG FT TIKIVSDGGSQIERVLGMQWKAAEDVLVFSSEVDVETVVPTKRGILRCVMS FT QFDPLGLLSHFLIHGRVIIQDIWRTKAGWDDTVDEQILERWRSWTAKFKEL FT EEVRVPRAYFPGVRASDIKDLQLHVFVDGSETAYACVAYFRATIHGEFRCA FT LVGGKAKVAPLKTLSIPRLELQAALIGCRLMKTLCHSHSLPISKRVFWTDS FT KTVLSWIHSDHRRYRQFVACRIGEILSKSDPIEWRYVPSKVNVADDGTKWT FT SQGASGPNLKADSRWFKGPEFLWKHEESWPLQAQPEATNVEARPCHVHLTT FT ENRELFLEWNRFSKFERLRRTVAYIYRFVDNCRRKKMKHPLQNSHVTHEEL FT TKAENALWRLVQADTFPEEVSQLQKAKKGLKLKVNRASPIYKLSPFIDECG FT VVRMDTRISGAIFLPFDTKFPIILPKGHRLTRMVVEWYHRYYLHANNETVI FT NEVRQRFHVPCLRALVRNLSRSCVQCRISKAKPLCPREAPLPEARLSPYIR FT PFSFVGLDYFGPIQVKVGRSLVKRWVALFTCLTIRAVHLEVVHSLSTEACK FT MAIRRFIVRRGSPLEIRSDNGTNFQGASHELQQQIRDMNQQLSAVFTNAAT FT KWVFNPPSAPHMGGSWERLVRSIKVAFSALTTSRNPDDETLLTLMIEAEGI FT VNSRPLTNVPLETDTQAALTPNHFLLLSSQGVTQPPMTIPERPESLRTNWR FT LTTNLVNQFWNRWVREYLPTIAGRTKWHEDAKEPNVGDLAVIVDPSVRNGW FT LRGRILSVVKGRDGRCRQVLVKTSGGVLRRPVTKVAILDIKAANNDEGQGN FT AVPSEALDVHYGSG" XX SQ Sequence 6169 BP; 1640 A; 1554 C; 1681 G; 1294 T; 0 other; caaactcaaa gatttgttga gaaccacgtc ggaacgggat atacaggatg ccggagaccg 60 atcgcttcga ctgccagatg tgcgatctgg ccaacaacat cgacgacatg gtgcagtgcg 120 aagggtgcag caagtggtca cactacggct gcgtcggctt cgatgatggc aagaaggagg 180 aacactggaa gtgccataaa tgtgccaact cttcgtctat cggcgaagcc cacggaaact 240 ccagtgctca agtcagtggc ggacagcagc agataaacgt tcctctccgg ggaggggact 300 cgatcagtga gctcgctcga cttaacctcg aattgctaga ggaacggaag gcctttctga 360 tgaaggagat cgagatgcag caagcggccc agctggaaca gcgaaaactg cagttggaga 420 aggaggcatg gaaggtacgc tacgacattg taaacgcaag gagtgatgca accaatggtg 480 aagggaacac cggtggtctt ggagactgga ttactcgatt gaaccaggtc gctgtcagcc 540 agaatcagca aatttctgcg ccggccagta cggtgacggt tccgactacg atttccggga 600 caaggccgct gcatcaagcc actggaggaa tgctatcagc catgcaaaca accacgtcgc 660 tcttcggctg cgacttaggt cacgtttccc agccagctcc cagtggtatt tcgacacaag 720 caaacctgca aacaagccat caagctacaa gcactcatcc gatgggacaa gctaccagtt 780 tcatgcacga tttccagcct ggaggagaag caaaccgtcc gggatcgtcg acaccgatca 840 ccacagcgaa ttgggtcaat cctgtgatgg gtcccaccag ctctacccag tgtctgccac 900 agtcgatcag ctcagtcgga cagtttgcca gctacccgat gcaatcgggt tacgtaagcc 960 aaccatacca ggtaacaagt agcataagca attacgtacc agtaggtcag gtagcttgtt 1020 cgcagtatgc aaactgtagc gtagggccag tctatccgca aatcaataca ataaatgcac 1080 actcgtccta cgctcccttc cccctcccga tgagccttcc caattccaac ccacacattt 1140 ctcaagcacc atccgctgct caaaatgttt ccacgtcgca tatcgatcaa tcagcaccta 1200 cttctcgtca actcgcggcg cgccacgtta tgccgaaaga actaccgttg ttcagtggca 1260 atccggaaga gtggcccatg ttttacagtg cgtttaacac ctccacggaa gcatgcggct 1320 acagcaatgt ggaaaatctc ggaagactcc agcgttgctt gaagggcggt gctttggaag 1380 cagttcgcag ccggttgctg ttacctacgt ccgtgccgca agtgatgcag acgctgcaga 1440 tgctgtatgg acgaccggag caaattattt acgcgctgct ccagaaggtg cgagaagttc 1500 cggccccgaa ggcggataat ttgtgtacga tagtggcctt tggtatggca gtgcagaatt 1560 tctgtgagca tttagaggcc gcaggacaag tgctacacct atcaaacccg gtgctactcc 1620 aagagcttgt ggataagctc ccagcgaact tgaagctcga gtgggtaacg ttcaagcgcc 1680 agtttttggc cgtagatctc cgtgtgttta gtcggtacat ggcgaatctg gtgtcggcag 1740 cagctgaagt caccctcacg ttggacccga ggggggcaaa gccgaagaag gacgagaagc 1800 aaaaggggtt cgtcaacgcc cacgctactt caaccggtgc agcttcgcct gagaagccaa 1860 aaggacagta tgcaaaggca gcttcatcga ttgcatgttt gatctgtggg gatcaggagc 1920 atcgagtgaa ggagtgcgaa gtcttcaaga agatggatcg cgacgatcgt tggaaagcca 1980 ttcacaatca ttttctttgc cggatctgcc taggaaagca tggaaggaag ccgtgtcgat 2040 cgtcagcgcg ttgtaatgtg caagggtgcc aacttcgtca tcatccgctc ctgcacagcg 2100 attcggaacg tccgccaacc gcagtcaagc cgacaccaca aggaaatgag acagtgccac 2160 gaagaaatcc caccgaaggc gtgaacgctc atcacaccca gagctctacg cttttccgca 2220 tgataccagt gcgactcttc agtaagggac gaagcgtaga aaccctagcg ttcatcgatg 2280 aaggctcatc cgtaacgctc cttgaaaaga gcatagccga tgctctacag gccgaaggaa 2340 cggaaatgcg tctttgcctg acctggacga gcaatatcag tcgcgaagaa gaaggttcac 2400 atcaggtgga ggtggatatt tctagcatcg gaggcggtaa aagatatccg ctcaaagacg 2460 tcagaaccgt agaatctctg gcattaccgg ttcaaacact gcgctacaaa gagctcgctg 2520 atcgttttga gcatctacgg aagttgccgg tcacggattt cgaatccaga gctcctggat 2580 tactaattgg agcaaagaat acccatatca ccgccactca acaactacga gaaggacgag 2640 aaggagagcc tatagcggcg aagactcgtc tcggatgggc aatctacgga tcaatgccga 2700 atgggtcgaa catcgcgagt tgtaacttgc acatctgtgg ttgcgacttc gacaaaacac 2760 tgcacgaact agtgaagcaa tacttcacgg tggaaaatgt aggagtttcg gtggatcgga 2820 gtcctgattc ggacgaaaac aagcgggcca gagcattact cgagaagacg actagacgaa 2880 tcgaaggcgg tttcgagacc gggttgctgt ggcgccaaga catcgtggag tttcccgata 2940 actatgctat ggcagtgaga aggctacggt gtctggaaag gcgcttcaac acgaatccag 3000 ccctattcga gaaagtacaa cgtcagatag agcagtacca aaagaaaggg tacatccacg 3060 aggccgcgga ggatgagttg gccgccaccg acccaagaag attgtggtac ctcccagtcg 3120 gaatagtaca aaacccgaag aagccaaaca agatccgaat cgtgtgggat gcggcagcaa 3180 cagtagacgg catttccttg aacagcatgc tgcttaaagg tccggattta gtgcaaccac 3240 tcccggaagt gctctgtgga tttcgggaac ggaagatcgc cgtagtgggg gacatcatgg 3300 aaatgttcca tcagctcaag atccgacggg atgaccgata cagccaacta tttctttggc 3360 caagtgaaag cgggaaaact ccaaggacct ttgtaatcga cgtagcgacg ttcggctcaa 3420 caagctcacc atgttcggcg cagtttgtaa agaacctgaa tgcgaccgaa cacgcagtgg 3480 agtacccaag ggcagcggag gcaatcgtgc gtcgacacta tgtcgacgat tacctggata 3540 gctttgacag cgaggatgaa gcatgtcagg tggtagagga ggtcaaactg gtgcaccagc 3600 gaggcggatt cactattcgc aactggctct caaactcggc tacagttctg caacgggtcg 3660 gggaaaccga tgccggtacg ataaagatcg tgagcgatgg tggttctcaa atagagcgcg 3720 tgctgggcat gcagtggaag gcagccgagg atgttctcgt attctccagt gaggtcgacg 3780 ttgaaacagt agttcccacc aaaagaggca tcctgcgatg tgtaatgagc cagttcgatc 3840 cgttgggact tctttcgcac ttcctgatcc atggccgagt tattatccag gacatctggc 3900 gtacgaaagc tggttgggac gatacggtag acgagcagat tctggagcga tggcgttcat 3960 ggaccgcgaa attcaaagag ctggaagaag tgcgagtacc gcgcgcctat tttccgggtg 4020 tgagggcatc cgacattaag gacctgcaac tccacgtctt cgtcgatggt agcgagactg 4080 cctatgcgtg cgtcgcctat ttcagggcaa ctattcacgg agagtttcgc tgcgcactcg 4140 tcggagggaa agcaaaagta gcaccgctga aaacgctctc cattccacgg ttggagcttc 4200 aggcggcgtt aataggctgt aggctaatga aaacactctg ccatagccac agtctaccga 4260 tatcgaaacg agtattttgg accgattcga aaacagtcct ctcttggatt cactccgatc 4320 atcggcgcta tcggcaattc gttgcctgta ggataggcga aattctctct aaatccgatc 4380 caatagaatg gcgctatgtt ccgtcgaaag ttaatgtggc cgatgatgga acaaagtgga 4440 ctagccaagg ggctagtggc ccgaacctga aagccgacag ccgttggttc aaagggccgg 4500 agttcctatg gaaacatgaa gaatcctggc cacttcaggc tcaaccagaa gccacaaacg 4560 tcgaggctag gccctgccat gttcacctta caacagagaa cagggaactg tttctcgaat 4620 ggaatcgttt ctcgaagttt gaaaggctta gaagaaccgt cgcctacatc taccggttcg 4680 tagataactg tagacgaaaa aagatgaaac atccgctaca gaattcgcat gttacccacg 4740 aagagctaac caaagccgag aatgcgctct ggcggctggt acaagcagat acgtttccgg 4800 aagaggtgtc tcaactccag aaggcgaaga aagggctgaa gttaaaagtt aatcgagcta 4860 gcccaatcta caagttgtcg ccgttcattg acgagtgcgg cgtggtgcgg atggacacaa 4920 gaatttccgg agcaatcttc ctgccgttcg acacgaagtt tcctataatc ctccctaaag 4980 gacatcgtct gactaggatg gttgtcgagt ggtatcatcg atactaccta cacgctaata 5040 acgaaaccgt tatcaacgag gttcgtcagc ggttccacgt tccctgcctc cgtgcgctgg 5100 tacggaacct gtcaagatcc tgcgttcaat gtaggatttc caaggctaaa ccattgtgtc 5160 ctcgcgaagc acctctacct gaggcgcgct tgagcccgta tattcgacct ttcagctttg 5220 tcgggctgga ctacttcggt cctattcagg tcaaggtcgg acgttcgctg gtgaaacgtt 5280 gggtcgcact atttacatgc ctcaccattc gggccgttca cctcgaggta gtgcattccc 5340 tatccactga ggcgtgtaaa atggccatac ggcgatttat cgtacgtcga ggatcaccat 5400 tggagatcag gagcgacaat ggaaccaatt tccaaggagc cagtcacgag ttacagcagc 5460 agattagaga catgaatcaa caactttccg ccgtcttcac caacgcagcc acgaagtggg 5520 tttttaaccc accgtcagcc cctcacatgg gtggctcatg ggagcgacta gtcaggtcta 5580 taaaggtcgc gttctctgcc ttgacgacta gcagaaatcc ggacgacgag acactgctta 5640 cgctgatgat tgaggcggaa gggatagtga actctagacc actaacgaat gtaccactgg 5700 agacagatac ccaggcagcc ttgactccaa atcatttttt actcctgagc tcgcagggag 5760 tcacacaacc tccaatgacc atcccagaac gcccagagtc cctcaggact aactggcggc 5820 tgacgacgaa cctggtaaac caattttgga atcgatgggt acgggaatat ctgccaacta 5880 ttgctggtcg tacgaagtgg catgaagatg cgaaggagcc gaatgttgga gacttggcgg 5940 tcatcgtgga cccgtcggtc cgaaacggat ggctgcgtgg gcgtattctc tcagttgtaa 6000 aaggacgaga cgggcgatgc aggcaagtcc tagtgaagac atcaggagga gtactgcggc 6060 gtccggtgac gaaggttgcc atcctggata tcaaggcagc aaacaacgat gaaggtcagg 6120 gtaacgcagt accatcggaa gcgctggacg tgcattacgg gtcggggga 6169 // ID BEL-123_AA-I repbase; DNA; INV; 5559 BP. XX AC AAGE02026663; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-123_AA_; KW BEL-123_AA-LTR; BEL-123_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5559 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02026663; Positions 6411 853. XX CC Positions [4595-5176] - Integrase core CC LTRs are 95% similar to each other. XX FH Key Location/Qualifiers FT CDS 203..5557 FT /product="BEL-123_AA-I_1p" FT /translation="MAAERKASILSFQRTLQALKQRANKILTFVASFKDGD FT DPLQLEIRRTGLEDMRESFFETETKLYGLIKDDEVSAVELAAEEYYDLLGE FT ITYQIAAKLKPLSNPSEAKPHLDTSVSQKPFPDSSIKLPDITLPRFSGHYE FT EWIYFRSQFNLLIRRNESLNDQQRLHYLRSCLTGEAAGIETPEESFSSLWK FT ALETRYENRRWLVDRHLAEIFQLKSIPCESSTALRDLVNVVQKNLRALSSL FT KLNLDTLSESMVVHIVASKLDKQTHKDFQSQIVGTNLVKWQEMVEFLQNRC FT RILENLEQDNKLASRVVSKSVGTKQFPKVFVSSKEDDSKRRFACFHCSGGH FT YINECRSFLAKSPSQRFQRVKELRLCVNCFSNRHVVADCKSSSCKDCGQRH FT NTLLHFEPKASSNLSGRPKINQNTRSDNGGNESAPEPGPSRVELCAVTEVV FT DVIESTNMVETIVPEPDAFRTPSSLSSTRATKRVQLITATDQALLHTAIVN FT VRGEDGELQECRAVLDSCAMSTFMTSACVQKLRLKAFPSDVSVVGFGGAGR FT KIAEAVVAHVSSKCSYFDSISEFLVTPNITTKLPLRPFKYSHWNIPEWIDL FT ADPNFNQPARVEILFGILDWDKMMLSRSYKLADSLPTLRQTVFGWVAGGPV FT MERSSYPIVQALPITNEQLDDQLSKFWELEAYGQERIYCSDEQKAEEHFSS FT THYRDESGRYVLALPFKDTDVKLGDSKHMAHRRFLLLEKRLEKDEHLYSEY FT RKFMQEYLDMEHMEKVGTFSPVEQSTEECYFIPHHAIQRPDSSTTKVRVVF FT DASAKSSNGVSLNDLLLVGPTLQPHLVNILLAFRVHKVVATTDVSKMFRQI FT SIREEDRRFLHILWRSSPEEEISVYRLTTVTYGTACAPFQATRTLLQICDD FT EAEKFPLAAQFGKKSVYVDDALFGAETVEEVQKMRKQFEAMLATAGLELHK FT WCSNREEVLADLPTEKLEQKVLFSEEGRTKTLGLTWQPTQDIFSFQIPSAA FT FSEGVPSKRKVLSDISKLFDPLGFAGPVVMTGKLLMQELWKQKQSWDGALD FT AKQEKLWYEYRRQLQEMETIKISRCVLPLSNTISVDLLGFSDASEKGYGAC FT LYLRSRNIQGQISIRLLCSKSHVAPLKANRATIPRLELCGAVLLAQLVDQV FT IRSLPVDIDQIKLWSDSTIALSWINTCPSKLDVFVANRVTKILRITKPTNW FT YHVSGHENPADLVSRGIMPNELESARLWWRGPLFLKDPETTWYKNVNLITE FT DELPELTTITTRPVVTERFQLFDNSSKYNTMVRVVARIKRLFQNRKRGPDH FT QFRGEYTMQELRLAKLTLVRLVQKEAYPEVFDQLQNQVTTRNNSLIPLSPF FT VDNHGILRVGGRLSKSSCSYDQKHQMILPSSHPFTEAVIRSVHTSNMHVGA FT QTTLNAIRREYWIVRGKSAVKRVIRGCVRCFKTSARPIHQYMGDLPSCRLE FT AEYPFYRVGIDLCGPIPIKQRNKRSTVEYKGWIVLFVCLETKAIHLEIVSE FT LSTGAFMAAFDRFISRRGKPATVWTDNGTNFVGTSNVLKEWKEFFESSETQ FT EGIQQASDSEIEWCFNPPEAPHFGGIWEANIRQTKSLLIKYTAGATLCFEE FT LSTVLARIEAVLNSRPITSLSDDPEDFEPLTPGHFLIFRPLTAIPRPEVSL FT GSKHPRSRFEHINHIVKHFWDRWSSEYLSSLQQRYKWHKKVEVKQGQLVLI FT KKDNLPIQKWLLGRIVELFPGTDGVTRVVNVKTKNGILRRSVSKLCFLPVD FT PEQTVGRDVFQRGE" XX SQ Sequence 5559 BP; 1540 A; 1331 C; 1320 G; 1368 T; 0 other; ttaaaaaggt ccttcgaacc ggatctggct gaggcggcat ccgtcggaaa aggagagcta 60 aggtaggtgg atcgttttga gccagagcaa gcgcgttgaa tatatcatcc gaatagaatc 120 gtgtgccatt ttgtgctact gaccttgaat gcgatagcgt cgtttctttt gtctttaata 180 cacaaccaat tgctatttca aaatggctgc tgagcgaaaa gcttccattt tgtcgttcca 240 aagaacgctt caagcactga aacaacgtgc taacaaaatt ttgacctttg ttgcctcctt 300 caaggacggt gacgacccac tgcaacttga aattcgcaga actggtttag aagacatgcg 360 tgaatccttt ttcgaaaccg agaccaaatt gtacggttta attaaagatg atgaagtttc 420 ggcagttgaa ttagctgcag aagaatatta tgacctcttg ggggaaatca cgtatcagat 480 cgctgcaaaa ttgaaacctc tctcgaaccc ctcagaagca aagcctcatc tcgatacaag 540 cgtttcccaa aagcccttcc ctgattcgtc catcaaacta ccggatatta cgttgccacg 600 tttcagtggc cattatgaag agtggattta tttccgaagc cagtttaacc tcttgatccg 660 ccgaaatgaa tccctcaacg atcaacaacg attgcactat ctgcggtctt gtctcaccgg 720 tgaagctgcg ggaatcgaaa ctccggaaga atcgttttcg tctctatgga aggcgttgga 780 aacaaggtac gagaatcgcc gctggttggt agaccgccat cttgccgaaa tctttcaatt 840 aaagtccatc ccatgtgaat cttctaccgc tctgcgtgat ctggtgaacg tcgtacagaa 900 aaatcttcga gccttgtcct ccttgaaact gaatctcgac acgctttcgg aatcaatggt 960 agtccacatc gtcgcttcca aactcgacaa gcagacccac aaagacttcc aatctcaaat 1020 tgtcggtacc aatcttgtca aatggcaaga aatggttgag tttctgcaaa atcggtgccg 1080 catattagaa aatctggaac aggacaataa attagcttct cgtgtggtca gtaaatctgt 1140 gggcaccaaa cagtttccga aggtttttgt cagctcaaag gaggatgatt cgaaaaggag 1200 gtttgcttgt ttccattgtt ctggtggcca ttacatcaac gagtgtcgtt ctttccttgc 1260 aaagtctcca tcgcaacgtt tccaacgagt aaaagagctt cgattgtgtg tcaattgttt 1320 tagcaaccgc catgttgttg ctgattgcaa gagtagctcg tgtaaggatt gcggtcaacg 1380 acacaatacc cttctccatt ttgaaccgaa agcgtcgtcc aacctgagcg gcagaccaaa 1440 gatcaaccag aatacgagat ccgataacgg aggtaatgaa tcggcgccag aacctggacc 1500 aagccgtgtc gaactgtgtg cggtaacgga agtcgtcgat gtgatcgagt ccaccaatat 1560 ggtcgaaaca attgtgcctg agccagatgc gtttcgtacc ccaagcagct tgagttcgac 1620 tagggcgacg aaaagagttc aattaattac cgctactgat caagcgcttc tgcatacagc 1680 gatcgtgaat gttcgtggtg aggatggtga attgcaagag tgcagggctg tgctggattc 1740 ctgtgcgatg tctacgttca tgacaagtgc ttgtgttcaa aaactacgtc tcaaggcttt 1800 tccatctgat gtttctgtgg ttggttttgg aggtgcaggt cggaaaattg ctgaggctgt 1860 cgtcgcacat gtgagttcga aatgctccta cttcgattcc atctcggagt tcctagtaac 1920 accaaacatt acgaccaagc tgccattgag acctttcaaa tactcgcatt ggaatattcc 1980 ggagtggatc gatctcgcag atccaaactt caaccaacct gctcgtgtgg aaattctctt 2040 cgggattctg gattgggaca aaatgatgct aagtcgaagc tataaattgg cagacagtct 2100 gcccaccctt cgacaaacag tttttggttg ggtagctgga ggcccagtca tggaaagaag 2160 ttcctacccc atcgtgcaag cgctcccgat aacgaatgaa cagctggacg atcaactttc 2220 caagttttgg gagctagaag cctacggaca agagcgaatc tactgtagcg atgaacagaa 2280 ggctgaagaa catttctcct cgactcatta tcgagacgaa tctggtcgtt acgtcctcgc 2340 tttgccgttc aaggacactg atgttaaact tggcgattcc aagcacatgg ctcatcgaag 2400 gtttcttctt ctcgaaaaaa gactggaaaa agacgagcac ttatactccg agtaccgcaa 2460 atttatgcag gagtatctcg acatggaaca catggagaag gtgggaacgt ttagtcccgt 2520 cgaacaatcc accgaagaat gctatttcat cccccaccat gcgatacaac ggccagatag 2580 tagtactaca aaggttcgtg tagtttttga cgcgtccgca aagagcagca atggagtgtc 2640 tttgaacgat ctgctgctag tgggaccgac actacagccg catctggtca acattttgct 2700 cgctttccgt gtacacaagg ttgtcgcaac cactgacgtt tcgaaaatgt ttcgacaaat 2760 cagtattcga gaagaagatc gtcgtttttt gcacattttg tggaggtcca gtcccgagga 2820 agaaatcagc gtctatcgac tcaccactgt gacgtatggt acagcttgcg ctccttttca 2880 agccacccgc acgctgctcc aaatttgcga tgacgaagcc gaaaaattcc ctctggcggc 2940 acagtttggg aagaaatccg tctacgtaga tgatgcattg ttcggtgcag aaacggtcga 3000 agaagtccag aaaatgcgga aacaattcga agccatgttg gctacagctg gccttgaact 3060 gcacaaatgg tgttcgaatc gagaagaagt tttggcggat ctacctaccg aaaaactgga 3120 gcagaaagtt ttgttcagcg aagaaggtcg tacgaaaacc cttgggttaa cttggcaacc 3180 cactcaagac atctttagct tccaaattcc atcggcagcg tttagtgaag gagtcccctc 3240 aaagcggaag gtattatcgg acatatccaa gttgttcgac ccgctaggct tcgccggacc 3300 ggtagtgatg acagggaagt tgctgatgca agagttatgg aagcagaaac aatcttggga 3360 tggagcacta gacgctaagc aagaaaaact gtggtacgag tatcgacgtc aactacaaga 3420 aatggaaacc atcaagatca gtcgttgtgt gttacctctg tcgaatacga taagtgtcga 3480 tcttttgggc ttcagtgatg catccgaaaa gggctatggg gcttgcttgt atctaaggtc 3540 acgaaatatt caaggtcaaa tttcaattcg gttgttgtgt tcaaaatcgc atgtagcacc 3600 tctcaaggca aaccgtgcca cgattccgcg cttagaactt tgcggagccg tattgctcgc 3660 tcaattagtc gaccaagtta tacgatccct accagttgac atcgatcaaa tcaaactttg 3720 gagcgattca acaattgctc tcagctggat taacacctgt cccagcaaac ttgacgtttt 3780 tgttgccaat cgggtcacca aaatattgcg gatcacaaag ccaacgaact ggtatcacgt 3840 cagtggtcac gaaaatcccg ccgacttagt atcgcgcggt atcatgccaa acgaactcga 3900 atccgcaagg ctctggtggc gaggtcctct gttcctgaaa gatcccgaaa caacctggta 3960 caagaacgtt aatttgatca ccgaagacga acttcccgag cttactacga tcactacaag 4020 gcctgtggtg acagaaaggt tccagttgtt tgacaacagc agcaaataca acaccatggt 4080 tcgagttgtt gctcggatca agcgcctgtt ccagaatcgt aagagaggtc ctgatcacca 4140 gtttcgaggg gagtacacca tgcaggagct tcggctagcc aagctgacac tagtgcgtct 4200 cgtccagaag gaagcttacc cggaagtatt cgaccaattg cagaatcagg taacgacaag 4260 gaacaattcg ctcatcccgc tctccccctt cgtagacaac catggcatcc tcagggtcgg 4320 tggacgccta tcgaagtcat cctgctccta tgaccaaaaa caccagatga tccttccttc 4380 aagtcaccct ttcaccgagg cagtcatccg ttcagtgcat acgtccaaca tgcacgtggg 4440 cgcccaaaca actctgaacg ccattcgaag ggagtactgg attgtccgtg gaaaaagcgc 4500 cgtgaaaagg gtcatccgag gatgcgtccg atgcttcaaa accagcgctc gcccgattca 4560 tcaatacatg ggagatttgc catcttgccg tctcgaagca gaatatccgt tctatcgggt 4620 tggcatcgat ttgtgtggac caattcccat caaacaacga aacaaacgat cgaccgtgga 4680 gtacaagggc tggatcgtac tctttgtctg tctagaaacc aaagcgatcc acctcgaaat 4740 cgtcagcgag ctatcaaccg gcgcattcat ggcagcattt gaccgattca tcagtcggcg 4800 aggcaaacca gcgactgtgt ggaccgataa cggtaccaac tttgtcggta catcgaacgt 4860 tctgaaggaa tggaaggagt tcttcgagag cagcgaaact caggaaggca tccaacaagc 4920 aagtgactcc gaaatcgagt ggtgtttcaa ccctccagag gcaccacatt tcggaggaat 4980 ctgggaggcg aacattcgac agaccaagtc cctgctaatc aagtacactg ctggagcaac 5040 tctgtgtttc gaggaattat cgacagtatt ggctcgtatc gaagccgtgc taaactcaag 5100 acccataact tcgctgtcag acgatccaga agatttcgag ccattgacac cgggtcactt 5160 tttgatcttt agacctctaa ccgcaattcc ccgaccggag gtcagtcttg gttccaaaca 5220 tccaaggtca cgtttcgagc atatcaacca catcgtcaaa cacttttggg accgatggag 5280 ctcggaatat ttatcgtcgc tgcagcaaag atacaagtgg cacaagaagg tcgaagtaaa 5340 gcaaggacaa ctggtgctta tcaagaagga caacttgcca attcaaaaat ggttgcttgg 5400 tcgcattgtt gaactgttcc caggtaccga tggagtaaca agagtggtca acgtgaaaac 5460 gaaaaatggt atattgcgaa gatctgtttc gaagttatgc ttcctcccag tcgaccccga 5520 gcaaacagtt ggaagagacg tcttccaacg gggggagga 5559 // ID Chapaev-10_HM repbase; DNA; INV; 3013 BP. XX AC . XX DT 22-MAR-2008 (Rel. 13.03, Created) DT 22-MAR-2008 (Rel. 13.03, Last updated, Version 1) XX DE Autonomous Chapaev DNA transposon -a consensus sequence. XX KW Chapaev; DNA transposon; Transposable Element; Chapaev-10_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3013 RA Jurka J.; RT "Autonomous Chapaev transposons from the hydra genome."; RL Repbase Reports 8(3), 177-177 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 874..2496 FT /product="Chapaev-10_HM_1p" FT /translation="MCLKCSGVVKSGIIHLCSDVQAVKNLVNAAYALGSAS FT AERVASSILKTKMEKENICRGEKFTIATHGKPLTIVAGTTDNKCDRVNFSQ FT VSFGTIIELIKCLELSQCKTKKMCSIFRKSLCFQSSIEGNIVKKLEALKAK FT INEYFSCKEEEFFEAGDIVNKSMVYIKDIDEFIHDIITERNIDPLSTIVRV FT AVDSGQGFLKVTMNVFNPHDKTSNQPDLDDAGVKRCFIVAIVEGVSEHNGN FT LRKLVDTLNLQNIKYSVAFDLKCANSMFGITSHAGKYSCLWCEGESLLDGG FT EKRTLGSLDYHYSKYTEAGKPKSKMADYKNVINPRLLYLEEDPDTIIENLV FT PVPELHTLIGIVSTFGKLLSKLWPGFEKWLNSNYIFFRGYHGVGFDGNNAN FT RLLDKLDVLSRDIADQGKLDLLPVIECLRKFQAMKQATFGEKIIGDIESVV FT FEFKMSYANLMEYINTYFENISLNIPWKVHIAVNHIVPFLKSTNTDNGLGI FT YSEQAGESVHHEFKKTWIRYKRRQSHPDYAKRLKSSVVEFSTFNK" XX SQ Sequence 3013 BP; 1086 A; 374 C; 496 G; 1057 T; 0 other; cactatgaac cacattgacc gccaggtttg aagtctcccc acatcagatt tttttttata 60 aatacttagt tgtatatatg aaaataaata tttggtttgt aaaaacctct aggggaatgg 120 aaaaatgtat gaaaatcaaa aaataaaaag tataattttt agttaaaaat atccgccatt 180 ttcaacaaaa tagctataat ttgataatca aataccggcg ttttttgatg ttaaatatac 240 tctagagcac aacgacttga attttattaa atggcggtca acaatttatg tgctgtttgt 300 tttaaaatct ttggtaaaaa gtcagcaaag cttcataaaa tatctaaagc aatcgaagaa 360 aaaataaaaa cttttatttg ggatagctat gatcaaaatt tagtaaacca ttctaaagtt 420 atttgtagca attgttataa aaacctttat tgtttggaaa aaaacgatac aaaatacctt 480 gataagtggc taaagcaaat ttcccaggtt agtttataaa attaataaaa aaactatatt 540 taagacttag tgtttaatta agtaaatttg ttttattttt attaattttt aaaaaagaaa 600 aagctttgtc tcaatgaacc ttcataaact tgaaaaaatt aaaccattta ttatatataa 660 cagtttgttt aaaattccta tatataggaa gtaatatgtc atcgatgtgt ttatttttaa 720 tgtgttatat tttttatatt actagtttat atagttttag atttttttgt ttagataaac 780 cgtcaagata tcagaagaac ttctgatgtt tcagaaatat caaaagatat taatattatt 840 acttctgagt taaatgaaga taaaaatgaa agaatgtgct taaaatgttc tggagtagtt 900 aagtcaggca ttattcatct ttgtagtgat gttcaagctg tgaaaaattt agttaatgca 960 gcatatgcat tgggatcagc aagtgcagaa cgtgtcgctt caagtatttt aaaaacaaaa 1020 atggaaaagg aaaatatttg tagaggtgaa aagttcacaa ttgctactca tgggaaacca 1080 ttgactattg ttgccggaac tacagacaat aaatgtgacc gtgttaattt tagtcaagta 1140 tcttttggaa caatcattga gcttataaaa tgtttagaac tatctcagtg taaaaccaaa 1200 aaaatgtgtt ctatatttag aaaaagtctt tgctttcaaa gtagcattga aggaaatatt 1260 gttaaaaaat tagaggcttt aaaagcaaaa ataaatgaat atttttcatg taaagaagaa 1320 gaattttttg aagctggcga tatagtaaat aaatcgatgg tttatattaa agatattgat 1380 gaatttatac atgatataat cactgaaagg aatattgacc ctctttctac aattgtaaga 1440 gttgctgtgg attctggtca aggtttttta aaggtcacta tgaatgtttt taatccgcat 1500 gataaaacta gcaaccaacc tgaccttgat gatgctggag ttaagcgttg ttttattgtt 1560 gctattgtag aaggagtgtc tgaacacaat ggaaatctta gaaaattagt tgatactctt 1620 aatcttcaaa atattaagta ttctgttgca tttgatttaa agtgtgcaaa ctcaatgttt 1680 ggcattacaa gccatgctgg aaaatatagt tgtttgtggt gtgaaggaga gagtcttttg 1740 gatggtggag aaaaacgaac tctcggttcc ctcgattatc attacagtaa atacacagaa 1800 gctggaaaac caaaatcaaa gatggctgat tataaaaatg ttattaatcc aagattatta 1860 tatcttgaag aggatccaga cacaattatt gaaaatttag ttccagtacc tgagttgcac 1920 acattaatcg gaatagtttc aacgtttgga aaactattat caaaactatg gcctggcttt 1980 gaaaagtggt taaactcaaa ttacatattt ttcagaggtt accatggcgt tggatttgat 2040 ggcaacaatg ccaacagact tttagataag ttagatgttc tttctcgtga tattgctgat 2100 caaggaaagc ttgatttact accagtaatt gaatgtttaa gaaagtttca agcaatgaag 2160 caagcaacct ttggagaaaa aattattggt gatattgaat ctgttgtttt tgaattcaag 2220 atgtcttatg ctaatttaat ggaatatatt aatacttact ttgaaaacat ttctttgaat 2280 ataccgtgga aagttcatat tgcagttaac catattgtac catttctaaa aagtaccaat 2340 actgataatg gacttggaat ttattctgaa caagcaggtg aatctgttca tcatgagttt 2400 aaaaagacat ggattagata taaaagacga caaagccacc cagactatgc taaacgatta 2460 aagtcaagtg ttgtggaatt ttcaacattt aataaatagt taacaagttc acttttcaat 2520 ttaatatatt gttgatccac aactaagtca aaattatttc caaagtattt taatagcatg 2580 aaagatactt ttaattgagt gttgataaaa tataatgcta tttttgtatt tcaaagctta 2640 gtcaagatat ttttcacatt aaatgcattt tgttgttgtt gtttttatac ctacattctt 2700 ttcctatgta atctgttata gaaatgtttt aacacttttt taatgccttt atattagggg 2760 agtaaacaag tgcagaataa aaattaggtt tcttcatgaa attatgtaaa atatatgaac 2820 ttcaatagag ttagcgatgt accgcccctc ccctttgtat ctaaagttta gatgtgtaaa 2880 aaatttattc tagccattcc tatggaggtt cttaatacct ttacttaatt tatatactat 2940 gttacttctt aataagaaaa aaaaattgga tgtggggaga cttcaaacct ggcggttatt 3000 gtggttcaca gtg 3013 // ID Baggins1_Cis repbase; DNA; INV; 5611 BP. XX AC . XX DT 29-AUG-2008 (Rel. 13.08, Created) DT 23-JUL-2010 (Rel. 15.08, Last updated, Version 2) XX DE Transposable Element from Ciona savignyi. XX KW Loa; Non-LTR Retrotransposon; Transposable Element; LINE; KW Baggins1_Cis. XX OS Ciona savignyi OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. XX RN [1] RP 1-5611 RA Smit A.F.; RT "Baggins1_Cis - Transposable Element from Ciona savignyi."; RL Direct Submission to Repbase Update (05-AUG-2008). XX DR [1] (Consensus) XX CC Ci000545, Ci000042, Ci000006. A LINE-like element related to CC Lian in mosquito and Baggins1 in Drosophila . ORF1 from 1023 - CC 1808 without clear similarities. ORF2 from 1815 - 5513 36% CC identical (53% similar) to pol-products from above LINEs. Copies CC 0-2% diverged from consensus. XX SQ Sequence 5611 BP; 1493 A; 1464 C; 1318 G; 1334 T; 2 other; tttcctctct atggattgcc accttaacgt ggtagggggg tttgagtttc caatgaaccc 60 aagagcaatg ccatcggggg tggaattttc tacccctggc agggtcaccc atggtggtaa 120 ggtcgaaggt gaggaactga caaagctgca atccactttc gggttaccgt taagctactg 180 taatggtatg tgaaagggga atttctcgac cccacccacg gcagggataa actggttcta 240 gccctggcca gtttatgcgg cttccgaact gcaaaggcca cctggtgctt gaggattact 300 ccacttcaga taagtggtgt aactccaaca cggtactttg ggggttggcg cactaggagc 360 accgcggtac cgagagtagc gcctcggcta ctagggggta aacccttgcg agggtgcgag 420 gatcaagtcc agtcgtagac atatcaatgg cctgagtctc agtgggctga ttccccatct 480 gtgtacgagg gtcggcggga gctgttgaaa gctggtcggg gtcctacgga gagccgaaac 540 tgcgggaggg cgttttgcct ggccttgttg cctcggccgt gccgagaacg taagtcggac 600 cctagagggg gcaaaaaccc tattgtatag catggaacag aaaaaaaatc acgaaaaaca 660 acccgctgat gatggtttgt cctccacctc tttatcgagg ttggccgaca aaccggaggc 720 aaacccagac aangcgagca ttcaatgctc tgcggcgtca ccccctacaa cctctgcggc 780 accccaagct ttcttcagcg atagcgatgg ggtttccgct gtcacttcgg tgacatctcc 840 actggcgcat aagctctcaa atctaacgat ggagcaagcg gagctgtctg acttgtcgga 900 cagtggtgaa gcaaactcca ttcggcggac acgagaagcc tgtgagccag caccttccaa 960 ggcggagtat aaaagagaaa agaatcggcg gcagaaggct cggaaacgag cccgaaaagc 1020 caatgataat ctctgcaatc gtcccggccc cagcaacacc tcagtgccca cagctgcccc 1080 cactggggaa tggcaaagtg agcgccagcg ggtccgctaa acggaaaagg cagagcaaaa 1140 cttcctcttc caccacccct actccggcca caaaaaagcc ccgtggtcca tctgcgtcgt 1200 ccacacgagc tacgtcatat gcagatgcgg caaacagcag tcttagactt gatgttctac 1260 ccgctagcgg gtcagaaagt cttgtccagg ctgatatccg ctacatcgag agtgctatgg 1320 cagactccat gttggacggt acacttcctc ttataaaggc caaagggctt gtgctgctaa 1380 agtcaggtac catgaaggtc acctgtatgg atgcagcatc actggcggcc ttcaggaggt 1440 ttatccgtgg tttaccacca atgccaggta agaacattgg ctatacagca aatgggcccg 1500 aagatcgtct caacgaacgg gtctatactg tctgggtctc tgacccgaag gttcgaacaa 1560 acccggaacg gtttattcgt cttctcgggt tccagaactc cgaacttccg attgagaagt 1620 tccacgtccg tgggtacgtc ggcgaagtga cggaagaagg cacacaccta cgtgtaggtg 1680 tggagatgtc tgcgattgaa gatctgcgca agatggactt cagccctcac tacatggcag 1740 gccttctcac ttttgtggga aaccggtccg tggaagccac tacccactca gagtctctta 1800 aacactagtg tgtgatggag gctctgaaaa agactagcaa ggaccttgtt ctttgccaga 1860 tcaacttgca ccattgtata gcagccgccc aacaattaat gggtcggttc caaacaatgg 1920 ggcaaacacc acttgccctc atacaggaac catactgtaa gcgaggaaaa gtggcctttg 1980 ttcctcagaa tgcttgctgc ttttctgatc ctaattcaag caagccacgt gcttgcatcc 2040 tgtctgcaaa gaatctggtg gcgtggcctc tactccaatt catggataga gacacctgca 2100 caattgcaat tgcagacaac actaacagca aaaaccctgt tgttttctgt tctgtttaca 2160 tggctggtga tgaccccacg gaaccaccac cgcacaagat gcgagaactt gtcacccatt 2220 gtgaacaatc aaaccttcgt ctgctcattg ggacagatgc aaacgctcac cattactact 2280 ggggaagcac tgactgcaat agtcgggggc atgcccttat ggagtattta atgagcacaa 2340 ggctgtcagt atgcaatact ggcaactctc cgacctttgt caccagaaac agagctgagg 2400 ttctcgatct cactctgtgc tctgccaatc tgctttcaaa ggtgtctgac tggaaggtca 2460 gcaaagaagc ttctttgtcc gaccaccaat taataacttt cagggtgaac aacctttgta 2520 agaccactcc atcgcatcac aggaatgttc gcaaaactga ctggatgctc tacctgcaag 2580 aactaactct cctgtgtgag gcaattccag ctgcacctcc aaaaactgct aaggaggtcg 2640 acatgctggc gaaagctgtc gaaacagcca ttaccttatc atatgagaaa tcgtgtcgcc 2700 tgatcaaaag aacatcaaat gacactcctt cctggtggtg cgcagaactc acacttcttc 2760 gtcgctgtgc gagacgctcc caccggaaag ctcttaagct taagactcct gccgcatggg 2820 atgagtccaa ggcaagcacc cgagcttaca aaaatgccat tcgctccaga aaacgggcgt 2880 cttggcgcaa actgtgtgag gaggtgaaca ccttgccagc aatggcacgg cttcatcgaa 2940 tcctcaagaa gcatcgctct cttcagattg gatctctacg gaaatcaaat ggtgatttca 3000 ccagcactcc atctgaaacc ttacacgttc tactggacgc acacttccca gagaacgtaa 3060 ataatcaagg atcatctact aaccacgtta aacaatggtc ctcaattcaa gctagcgatg 3120 aagtgctagt caacaacatc atcacagctg aaagaattaa ggctgcttat acatcattta 3180 agccttttaa aagcccaggg cctgatggaa tctatccaat acttattcaa aaaggattgg 3240 aaatcctgtg cccccttctt atccagatct ttaaagcatc tctcctgttg ggttttatcc 3300 cacaaagatg gctggtcgcg agagtggttt tcccacccaa accagggaag gaagactact 3360 caacggcaaa agcctatcga cccctgagtc tcatgtcatt cgtgatgaag ggcttcgaga 3420 gactcatact ttggtatctg caggagggac ctattggaca atccccccta caccccaacc 3480 aatatgcata tagagcggga ttttctaccg aagatgccct ccataatatg atcatcaaac 3540 tggagaaggc ggtctttcat aaccaatatg cacttggagt atttctggat attgaaggag 3600 ccttctctaa tgccactttc gattccatga tagttccttt gcgaaagtat catgttaatg 3660 aaacagttat ccggtggata agttacatgc ttaaacaccg aacagccaca gcagaactgc 3720 atggctgtac agagaaaaaa caggtcaaaa aaggttgtcc acaaggggga atcctatccc 3780 ccttgttatg gaacctcctg gtggacactc tactacgcca attcactagc caagaggctg 3840 cacttgtgca agcatatgct gatgacttga gcgtcattat atctggccca gacccatcaa 3900 cgattgggac gattgcccaa tcaaccctta atcgtctgga gaaatgggca ataaacaacc 3960 accttcgctt tgccccggca aagacagtgg tggtaatgtt cacaagaagg agaaaatggt 4020 ctttaagacc aatttttctc cacaatatcc agcttagcct atcaaaggaa gctaagtatc 4080 ttggtgtcat cctggacaac aagctgtctt ggaagaatca ccattcaagt cgtatcaaga 4140 aggcaactgc tgctctagca caatgccgta gagcagttgg ccctacatgg gggctacagc 4200 caaaaataat gatgtggctc tactgtcaga ttatccgacc tatcctgacc tatgcttcat 4260 tagtgtcagt cagcgcaaca taccagcgca atatacagga caacctgcgc aaggttcaaa 4320 ggttggcctg cctatgcatc actggcgcct ttaaaggcac ccccacaagg gcaatggaga 4380 tgctacttaa cctccccccg ctgcacctgt acctccaggg cacagccatt aaaacggcac 4440 accgcctgag atttttaggc cactggaagg ggactggtta ctcagtgtgg cacaagaaaa 4500 gtcatattga catctgcaac caagcaatgg aagatatcca tcttctctca ttgccagtgg 4560 attgcagttc ctgcactctc catctggaac gccacttcgc gattcagatc agcgaccgtg 4620 acaacatcac tccacctgta cgaccatctg acccatcaca gcttgtctgt tacactgatg 4680 ggtcacgact agatggatcg actggtgctg gagtacttat tcaatccacc aagtttgatt 4740 acaaattcag ctttccactt ggaagcttcc cctcagtttt tcagacagag attcttgcaa 4800 taaatcatgc tgcaaggatt ctgcagtctt ggaacactgc aaaaatgtcc atcaccattc 4860 acagtgatag tcaggctgct ctgaaagctc ttatgagctt gaagatacat agccccttgc 4920 tgcgagagac atggtacctt cttcacaatc ttggagggct taatcttgtg gaactttgct 4980 ggattccagg ccactcaaac tttgatggca atgatcaagc tgacgagttg gctcgtgacg 5040 catccagcac tgccttcatc ggtccagaac catccctctg tattcctgtc tgctctgtca 5100 gaacagaatt ggaggcgtgg atgagccgaa aacatgctga agaatggaga caacactgtg 5160 gctgtcgtca gacgaaggag tatcttggac aaggcctatc cctgtccagg gaacttctcc 5220 ggctgcggcg gagacagtta cgatctgttc ttcagattat cactggtcat gctaatttgg 5280 caaagcacca atccatcctc atgcgcacgg catgtgcaac ctgtcccttc tgccaagaag 5340 aagacgagac tcctttacac tatgttggcc attgtccagc atttgctgta accagatgga 5400 ggaattttgg agtttttaaa tctgatgatc tcgcatctat tccggtcaag catttagtca 5460 gtttccttca ggacactggc agacttgacc gctatatccc ggatccatgc tagccctggg 5520 cgctggagta atgggccttc catggcctaa attcctggtc ccggtagcta ctgtggaccc 5580 gcccntatta atgtaatgta atgtaatgta a 5611 // ID hAT-7_AP repbase; DNA; INV; 2951 BP. XX AC Contig56589; XX DT 19-AUG-2009 (Rel. 14.08, Created) DT 19-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE DNA transposon. XX KW hAT; DNA transposon; Transposable Element; hAT-7_AP. XX NM hAT-7_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-2951 RA Jurka J.; RT "DNA transposons from pea aphid."; RL Repbase Reports 9(8), 1791-1791 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX FH Key Location/Qualifiers FT CDS join(317..874,909..2714) FT /product="hAT-7_AP_1p" FT /translation="MPRLQIYLIGPMNNQITGSKLPSKRDCLSVLFYNMRL FT VNLNLHDSSRLVIDECSIFWKKARIPTHDNSDCIKKLKKLYEEWRKLDKNK FT TRTTELQKTHENKFEEQLDNLFDISHANALNLIKIEEDKQFLFRQREKGRP FT GCMLGTDMKLAGIEKRKATLKENEDGRKKRREMETLKLTGKLLNIENYYNL FT FLNNDFLERIRLYSSSSSEDQDNTLLEEVLNETPGCSLPMPSSSNPIRAKK FT SIMTSRLSAALDKCKISDRDAVHLLTAAAESFQVNCLEFTINRSTIKRARE FT NFRKQSSEAIKSTFIDQNLNYVIVHWDSKLLPDILSKENIDRLPVIVSASN FT VEKLLGVPALTSGTGEKISSAVHNLLLDWNLVDKVQAFVFDTTASNSGRLN FT GACVLLEQLLNRDILFFACRHHIFEIVLQAVFAHVKITVMNGPDIPLFKRF FT KNHWKNIDLSNFRIFSTFQHSHTNLKDVANNISLFCIQRLNDNFPRDDYRE FT FLELVLIFLGGSPPRGIKFRQPGAYHLARWMAKAIYCIKILLFSDQFKITT FT REQNSLDEVCCFIVKCYIQEWITAPNPVTAPMNDLLFLKKLKNYNGKLGEV FT AIRKFINHLWYLNEECAVFSIFDDRVNNETKQRMAKYILEGKEQYSNNSEE FT ENEESKKKLNLKFDDLTHFLNKDHPFDLLTNKSWKLFERFQISDIFLRAHP FT STWKDIEEYRQAKEVISSLKIVNDAAERGVKLMEEYNQKFSKNEEQKQFTL FT QVNIKNQSNLQSKMPNNSIYFSGCSRISQNIFRLQS*" XX SQ Sequence 2951 BP; 1120 A; 387 C; 455 G; 989 T; 0 other; aggcaaaaat attggtttta cggtttttaa aatatatttt acgaacggtt gaagatacgt 60 tcaaatattt attattatat taaaaataat gtgaaattac cgtcatcgtt atcaaaagta 120 gataaaactc cgatagcgat aaaataaaac ttaataacag ttatgaatta tacagataaa 180 caatataata gttataaaag gacattttat attttagttg ttgtatttca cttgaacgtt 240 tcgcgttttt atagttttat agtgattatt ttaagttata atttatacgt tactgtttca 300 aaatttattt tacaaaatgc ctagactaca aatttattta attggaccca tgaataatca 360 aataactggt tcgaaattgc cttccaagag ggactgtttg tctgttttat tttataatat 420 gcgcttagta aaccttaatt tacatgatag ttcacgcttg gttattgatg aatgttcaat 480 tttttggaag aaagctcgaa ttccaacaca tgataactca gattgcatta agaaactgaa 540 aaaattatat gaagaatgga gaaaattaga caaaaataaa acacgaacaa cagaattaca 600 aaaaacacat gaaaacaaat tcgaagaaca attagataat ttatttgata tatcccacgc 660 caatgcctta aatttaatta aaatagaaga agataagcaa tttttattta gacaacgaga 720 gaaaggtcgt cctggttgta tgcttggtac ggatatgaaa ttagcgggca ttgaaaaacg 780 taaagcaact cttaaagaaa atgaagatgg gcgaaaaaaa cgacgagaaa tggaaacctt 840 aaaattgacc ggtaagttat taaatataga aaattaaatt gtatacatat tttcttaatt 900 ttgattagta ttataattta tttttaaata atgatttttt agaaagaatt cgactttaca 960 gttcatctag ttctgaagat caagataata cacttttaga ggaagtttta aatgaaactc 1020 caggttgctc attaccaatg cctagttctt caaatccaat acgggctaaa aaaagtatta 1080 tgacatctcg cttatcagca gctttagata aatgcaaaat tagtgatcgg gatgcagtgc 1140 atttattaac agctgcggcc gaaagttttc aggtaaattg tttggaattt acaataaata 1200 gatcaactat taaacgagcc cgtgaaaatt ttcgtaagca gtcatctgaa gctattaaat 1260 caacatttat tgatcaaaat ttaaattatg taattgtaca ctgggattct aaactattgc 1320 ctgatatatt aagtaaagaa aatatagata gacttcctgt aattgtatca gcttcgaatg 1380 tagaaaaact tttaggggtt cctgctctaa cttcaggtac tggagaaaaa atatcttcgg 1440 cagtgcacaa tttattatta gattggaatt tggtggataa agttcaggcg tttgtgtttg 1500 ataccacagc aagtaatagt ggtcgtctta atggggcatg tgttctttta gaacaattat 1560 taaaccgtga tattttattt ttcgcatgtc gtcatcatat atttgagata gttttgcaag 1620 ctgtatttgc tcatgtgaaa attactgtta tgaatggccc tgacatacct ttatttaaac 1680 ggtttaaaaa tcactggaaa aatatagatt taagcaattt tagaatattt tctacttttc 1740 aacattcgca tacaaatttg aaagacgttg caaataacat atctttattt tgtatacaaa 1800 gattaaacga taattttccg agagatgact acagagaatt tcttgaactg gttttaatat 1860 ttctaggagg ttcacctcct cgtggaatta aatttcggca accaggtgct tatcacttgg 1920 cacgatggat ggcaaaagcc atttattgta ttaaaatttt attattcagt gatcaattta 1980 aaattactac tagagaacaa aactctcttg atgaagtttg ttgtttcata gttaaatgct 2040 atattcaaga gtggatcacc gcaccaaatc cagttacagc accaatgaat gatttgttgt 2100 ttttaaaaaa actaaaaaac tacaatggta aattaggaga agtcgccata agaaaattta 2160 ttaaccattt atggtatttg aacgaggaat gtgccgtgtt ttctatattt gacgatcgtg 2220 ttaacaatga aacaaaacaa cgtatggcaa aatacatttt agaagggaaa gaacaatata 2280 gtaataacag tgaagaagaa aatgaagaat cgaaaaaaaa attaaattta aagtttgatg 2340 acttaactca ttttttaaat aaagatcatc catttgattt attaactaat aaatcgtgga 2400 aactatttga acggtttcaa atttctgata tatttttacg agcacaccca agtacatgga 2460 aagacataga agaatataga caagctaaag aagttatatc gtcattaaaa attgttaatg 2520 atgcagcaga aagaggtgtt aagttgatgg aggaatataa tcaaaaattt tctaaaaatg 2580 aagaacagaa acaatttact ttacaagtaa atataaaaaa tcaatcaaat cttcaatcaa 2640 aaatgcctaa taattcaata tatttttcag gttgttcaag aatatcgcaa aacattttcc 2700 ggctgcagtc gtgaagtctt aaaaaaaaat tttgaatatt aatagtatta actataattt 2760 gtataatatt cttataactt aactgagtaa tttaactgca acaacaaaac tgttgattcg 2820 atactatttt cacccataac attatttgta gaaaaatcaa ttgtatttaa ttagtttatc 2880 aaacattttt ttcatacctt caaccgttca taaaatagat ttaaaaaacc gtaaatccaa 2940 tatttttgcc t 2951 // ID R1-2_AP repbase; DNA; INV; 6085 BP. XX AC Contig3122; XX DT 19-AUG-2009 (Rel. 14.08, Created) DT 19-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE Non-LTR retrotransposon. XX KW R1; Non-LTR Retrotransposon; Transposable Element; R1-2_AP. XX NM R1-2_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-6085 RA Jurka J.; RT "Non-LTR retrotransposons from pea aphid."; RL Repbase Reports 9(8), 1795-1795 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC The termini are approximate. XX FH Key Location/Qualifiers FT CDS 762..2123 FT /product="R1-2_AP_1p" FT /translation="MSTHSLEDITQEPQSPLPGTSQQQMRSPPSPGDSPQT FT KKPKGLPTPGPVARVTIKWIRHTLDLASTRKTAMAVDVTRNLFSKLDELDT FT AVIDLVIENLQLRSQVEEARRSAEICVGAAAAQFGTELRLREAAHEQTLEA FT VVTRYAEKEAIRSLETAPVRQNVTEEQMCTEEPPTFAVVTRRKQSRTVDRI FT ADRSRSRATTRNKRIRETRQAEHLPSFILNESPGKSNTEVRDLIWNQVVAK FT NSRPKCHTITTKTGKTILKPSDKETVDALKYISKVSTLLKEDSLRWPRVII FT RGVSSDMKFDHQIQQCIIAQNPELGFDDNEDEMVIKPVFKSGPRDRSTTNW FT IVEVNPKHFEKFENTTIFLGFMRCRVSAYEEVTQCHLCLRYGHPASKCLEK FT DCVCAHCGRKAHKAADCPAAEADPVCINCRGKHSARDKTCSARTAFLVNQV FT RRTDYGKSQ" FT CDS 3468..5228 FT /product="R1-2_AP_2p" FT /translation="MIRVLWPAIAPRLVTIVNESLRTGRFPNSWKMAQVIP FT ILKEQDRDTALPKSFRPVSLLPVMGKITEKAINSRLSEQIRPRLSGKQFGF FT TPGRSTLDAIQSLLTWINLSAEQHVITIFLDISGAFDNLLWPALQRDLCSL FT GASLHMRHLIADYLRGRTATMTIGGVSKSVRVMKGCPQGSILGPVLWNVTV FT EPLLQTDFPDYVNIQAYADDIAISVLGPNRRTLIERAEATLLPALAWAHER FT GLTFSAQKSVAMITKGTLVPGFTLSFGDERIVSADNGRYLGIRLDQKKSFM FT PHFEALKSSSETLFSRLRGTIGAGWGLRRENIMILYRGVFVPKIAYGSQFW FT VHKIKTRQMIRNFGSIQRRALLGMTSAYNTNSTDALQVIAGVPPLDIEIQW FT LVQKAEVALLPAQLRNETLKAKREALLDEWQTRWSNSIKGRWTFRFFPDVR FT ERLLLPLSLGHEVVQFMSGHGNSRAKLASFNLQPDSRCACGMGEEFVDHVL FT YNCTLHAEHRARGTQSRSPMAVRSGHVGFHIQALYGASKILQNGCLHGMPM FT RWKNTRTDKTLWAPECQTPHVFWTSVKARDGRSGSHVGPS" XX SQ Sequence 6085 BP; 1598 A; 1579 C; 1673 G; 1235 T; 0 other; gtgtccactt tacagtggag cctgtaagac gtgtgttgat acagttagtc gagctccgag 60 gggcgccctc tagcggattt ctacggaaca tgcgctaagt cagtgagttc tcaaaaattt 120 tttgtgtttc cagttcgagc cagtgtataa aaagttcgcg aacagaccgg tgccggtttt 180 gagtcggtct gtccaaaacc ccctctgtgg ggaccgtttt tgtgtctgcc cataaggtta 240 acacggccgc ccgccggtat tgccgagcga tttttggttt gtcgtcagcc acgagaccag 300 gcatcaagag cctgacgagg tacgtttggc agttcactta cgtaataaga cttatattta 360 agtgtcccct cctagggctg gtataggtag tataccaccc ccactccaac actttgggtg 420 gccataactc cgtgggtttt gggccgcaag accccggacc aagggggaac ctattttttg 480 ggctcaccat tgaatctccc gcattcggga acaaagcccc cacccccccc cccctgtttt 540 gagttaagta taactccgcg ggaaatcgcc gtaagagcct agaggcaacg ggcatcggtt 600 accctaggtt ggggccgcaa tttgacccaa tcgggaacgg ggcacctttc caccccccat 660 tttggatttt cgtttttgag caaaaatctg tcagttagac gaccagcacc aatacccccg 720 aggggaaact cggggttttg agttccctcc cctccagagg gatgtcgaca catagtttgg 780 aggatatcac gcaggaacct caatcaccgt tgccgggcac cagccagcag cagatgaggt 840 ctccaccttc accaggggac tcgccacaga cgaagaagcc aaagggtttg ccaaccccgg 900 gaccggtagc cagagtgaca ataaagtgga ttcggcatac cctagacctg gcctccacaa 960 ggaaaactgc catggctgta gatgtgaccc gaaatttgtt ctctaagctg gacgagctgg 1020 acacagcagt catagatcta gtcatagaga acctccagct gaggagccag gtggaggagg 1080 caagaaggtc cgcggagatc tgtgtggggg ccgctgcagc gcaattcgga acagagctac 1140 ggctcagaga ggcggcacat gagcaaaccc tggaggctgt tgtgaccaga tacgccgaga 1200 aggaggccat caggtccctg gagacggcac ccgtgaggca aaacgtaact gaagaacaaa 1260 tgtgcaccga agaacctcca acatttgctg tggtaaccag gaggaaacaa agccgaacgg 1320 tcgacaggat cgcggatcgc tcaaggtcca gagccacaac gaggaacaaa aggatcagag 1380 agacacggca ggccgagcac ctaccttcct tcatactcaa cgagtcgccc ggcaaatcaa 1440 atactgaggt acgcgacctt atttggaacc aggtggtagc caaaaactcc agacccaagt 1500 gccacaccat caccaccaaa actggcaaga ccattctgaa gccctctgac aaggagaccg 1560 tggacgcact gaaatacatc tccaaggtat caaccctgct aaaagaggac agcctgcgtt 1620 ggccaagggt gatcattcgg ggtgtcagct cagacatgaa gtttgaccac cagatacaac 1680 aatgcatcat agcgcaaaac ccagagctag gtttcgacga taacgaggac gaaatggtga 1740 ttaagccggt attcaagagc ggtcccaggg acaggagtac cacgaattgg atcgttgagg 1800 taaatccaaa acacttcgag aagttcgaaa acaccacaat cttccttggc ttcatgaggt 1860 gtcgagtcag cgcctacgag gaggtcacac agtgtcatct atgcttaaga tatgggcatc 1920 ctgcttccaa gtgcctggaa aaagattgcg tctgtgctca ttgcgggcgg aaggctcaca 1980 aagcggctga ctgtcctgct gccgaagcag atcccgtgtg cattaactgc agaggtaaac 2040 acagtgccag ggacaagaca tgctctgcaa ggacggcttt cttggtcaac caggtcagga 2100 ggaccgatta tgggaagtcg caatgagcgt ccatccccca ctcaagattg tgcagctaaa 2160 catggggaga gcggccgcgg tcaacgacca gctcttggcc tactgccagg aatcgggagt 2220 ggatattgcc atggtccaag agccttacac caacaggggc aaactcacag gtttcgagac 2280 agccccgata aggggttatc tttctaaggg cacacgacga agaggcgccc ccaattacgt 2340 ggactatggt gcggcaataa tcgttttcaa tgccaacttg gtgatagcga ccagatccgt 2400 gggaaccact gaaaatttcg tctccataga cctggattgc gatgcagatg gcactgtgac 2460 cttgatcagt gggtatttca agtaccgagt tcccacagag gtgcatgtgg atgtacgggg 2520 acccctgttc cagaccgcag cacaggaggt actcattacg ctggacgcca atgcgttctc 2580 aacccgttgg tttagcagaa tcaacgacag acgagggaag gctttggtca cgtggctgga 2640 cgaccataac ctgaatattg caaacaaacg cagtccacac aataccttcg acggaccccg 2700 aggtagaaca aacatcgata tcacagtctg cagtgattac ttattgggca aaatccggga 2760 atgggctgtc attccagctg ccacttccag cgaccacaac cttatttcct tcactgtgga 2820 tctgagtcta agggaattca tccaccgcag catacgtttg aacctcttga ggagaaatta 2880 taataccttc gtccaagaat atgaagccag gacaacacaa agaaccgaaa ttgctctcga 2940 tctagatact atggccaccc aggtatttga ggacgtaaca tattctgcaa attgacatgc 3000 caccagaagc actcgcaaaa ggaaagtcac accgccttgg tggtctcctg agctgaccaa 3060 caagaggaaa gaggtacgcg cagcagcaag acacaaggct gacgggggtc gacaacacta 3120 caactccata agaaacgagt acacaatgct gctgaggaga aataaagtaa cctcgtggaa 3180 aaacttttgc accctggagg gggtccaacc ctgggggaaa ttataccgat ggatgaaggg 3240 cggaaacaaa cccctaacgg caattgggct aatgaggctc cccgatcgac gaatcggtgt 3300 ccacgctgct aaatgtgctg atcccgaatg acccaactac ccaggaggat gcacgtcccg 3360 cgctagcaga aggggacctt gatccagtaa cagaggagga gctcaaggtg cacgcatggg 3420 gcctgtcgcc aaacagagca ccggggacgg acagcatcac ggccagaatg atcagagttc 3480 tctggccggc aatagccccc cggctcgtta ccatagtaaa cgagagtctg aggacaggac 3540 gattcccaaa cagctggaag atggctcagg tcataccaat actcaaggag caagataggg 3600 acacagcact ccctaagtct ttcaggcctg tgagtctgtt gcctgtaatg gggaaaatca 3660 cggagaaggc aataaattct cgactgtccg agcagataag accaaggctc tcaggtaaac 3720 agtttggctt tacccctggt cgctcaacat tggatgctat ccaaagccta ttaacatgga 3780 tcaatctcag tgcagaacaa catgtcataa cgatattcct tgacatttct ggggcttttg 3840 acaacctttt gtggccggca ctacagcgcg acctgtgtag tttgggtgct agtctacaca 3900 tgaggcatct cattgctgac tacctgagag gtcgcacggc taccatgacg ataggaggag 3960 tatccaaatc cgtgagggta atgaaggggt gcccccaagg atcaatactg ggtccagtat 4020 tgtggaacgt cacagtggaa ccgctactac aaactgattt cccggactat gtgaacatac 4080 aagcctatgc agatgacata gcaatatctg tcttagggcc aaacaggcgc acgctaatcg 4140 agcgggctga agccacgttg ctgccagccc tagcctgggc acatgaaagg ggcttgactt 4200 tctctgccca gaaatccgtt gcgatgatca ccaagggcac acttgttccg ggtttcactc 4260 tctccttcgg tgacgagagg atagtttcag ccgataatgg aagatacctg gggatacggt 4320 tagaccagaa gaagtcgttt atgccacact ttgaggcttt aaaaagctcg tctgagaccc 4380 tgttctcgag attaagagga accataggcg caggttgggg acttaggaga gagaatatca 4440 tgatcctcta tcgcggcgtt ttcgtgccaa agattgccta cggttcgcaa ttctgggtcc 4500 acaaaataaa gacacgacag atgatcagaa actttggctc catccagaga agagcactgc 4560 ttggtatgac cagtgcatac aatacaaatt caacggatgc cttacaggtg atagcgggag 4620 tcccaccgct ggatatagag atccaatggc tggtgcaaaa agccgaggtt gctttgctcc 4680 cagcacaatt gagaaacgaa actctaaagg ccaagagaga agccttgctg gatgagtggc 4740 aaacaagatg gtcgaactcg atcaagggca ggtggacctt tcggtttttc cccgacgtca 4800 gagaaagact actgctaccc ctttccctgg gtcacgaagt ggtgcagttc atgtctggcc 4860 acggaaactc ccgggccaaa ctagcgagtt tcaacttaca accagattcg aggtgcgcat 4920 gtggtatggg tgaagagttc gtggaccatg tgctgtataa ctgtaccctg catgccgaac 4980 acagggctcg cggtactcag agcaggtcac ctatggccgt gcgatccgga cacgttggtt 5040 tccacatcca agctttatac ggcgctagta agattctgca aaacggctgc ttacatggaa 5100 tgcccatgag gtggaaaaac accaggacgg acaagacact ctgggcgcca gagtgccaaa 5160 ccccgcatgt gttctggacg agtgtcaagg cgagagatgg cagaagcggt tcacatgtgg 5220 ggccgtcctg aggggacact cgttgcgtcg gggacgcggg cggcgctaac gcgtacaaca 5280 gagcatcgta caacatggag gcgccgcggt ggccgaaagc cctggtgagt agttcacacc 5340 tacatcgggc cctgtgtctg gggccatgga agaatacaga cacagtctac cccgcttgtc 5400 gtatgaggcg actaaagggg tggggttgcg acctacgcat cgcgggcgag cgcccgcggt 5460 aaaggcccac caagagcctg ctgaaacagc gaatcctggg agatcgggct tgccttgtcg 5520 aacatcctac cgacgatgac ctacctaacg ggggtgaggt cggtgccacg ggaagggggg 5580 gagccccact gccccgagtt atgcccgggt cagtgcccct ctactctggg cgcgtgtcgg 5640 cttgccggcg ggcgctcaga gccggtttgg tttcgtggtg gctgtagcgg gagctactga 5700 tatggttaga gccaaagctg gcaggcgaag gtctccggtg aacggctgta ccgtcacggg 5760 gatacaaatc tctttggcgt tgcagtttct cccgggcaca cagctggtct accgttggat 5820 aaaaacagcg gaggttgccc gtaaccaaaa acgggaagtg atgggatagg tcacgaccgc 5880 accagaggtg gcacacggtg gctccagacc tgtgccgtct ggtacggcgc ttcggcaccg 5940 gtgctggtgg tccatagacg acacgctcgc ggcgtgggca cttcagcgcc cccgcacggg 6000 tgaaccttta tggctcgtaa caagcgcaca gcgcttacac acggtccata gttgtagccg 6060 caaggcatca aaccgaggat actct 6085 // ID AeBuster3 repbase; DNA; INV; 3019 BP. XX AC . XX DT 27-OCT-2010 (Rel. 15.1, Created) DT 27-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE A hAT DNA transposon family from Aedes aegypti. XX KW hAT; DNA transposon; Transposable Element; AeBuster3. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-3019 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-3019 RA Kojima K.K. and Jurka J.; RT "hAT-type DNA transposons from the yellow fever mosquito."; RL Direct Submission to Repbase Update (11-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. This consensus is generated from 7 CC sequences with >97% identity, and is ~100% identical to the CC original sequence in [1]. 8-bp TSDs. XX FH Key Location/Qualifiers FT CDS 1038..2813 FT /product="AeBuster3_1p" FT /note="transposase." FT /translation="MDKKRKYCEEYIKYGFTVLTINGIDKPQCVLCNIVLS FT VEALKPSKLKRHLETKHPEHIQKDTAFFQRHETGLIRSRFGATGSIQQQNT FT AVLQASYEAALEIAKNKKPHTIGETLIKPSMLKMVSLVLGEASAVKMRQVS FT LSNSSVQRRIADMSEDVKIQILEEIKQSPLFAIQLDESTDVSSCAQLLVFV FT KYVHLDNLKEEFLFCSDLETTTKSDDIMAKVKTFFDSNGLQWEKLCGVCTD FT GAPAMLGSKSGFQKKIKELAPQAKGVHCMIHRYALASKTLPKDLQKVLDSV FT IKIVNHIKSGALNTRLFKELCKEMNSDHETLLFHTAVRWLSKGNVLNRVFE FT LKHEIKTFLETQGKTDLLGNFYETGWNKRLAYLADIFDQLNSLNLKLQGKQ FT TNLILFQDSLQAFVAKIKNWRRKVKANNVAMFERLCSMLDEEAEEIETELK FT TEIIAHLDSLDEQFERYFPELTQEEAALVRNPFSPSLDVASLPDRIQDEYL FT DLRNDSSVRDLYNEKNLNQFWCTMYRSYPAVSLQALKILVPFASTYLCESG FT FSCLLQIKTKARNRLNVEDDMRLALSDTQPRITKLVSQMQTQQSH" XX SQ Sequence 3019 BP; 951 A; 595 C; 595 G; 878 T; 0 other; cagcgtttct caaccggtgg tccgcggacc cctggggggc cgcggagcct gatcaggggg 60 tccgcgaagc cttttgacgc gaccattcct tttaagcaac aaattttccg caagatattt 120 caaaaattga aaaaaaataa tttaaatgca cttataaatt tgatacaata ccatttttta 180 acaatatttg caaaatttgc agtattctat ccaatattaa taagattttt gcaaaccaga 240 aaaatttaaa atcgatggaa actttcaatt aagctttccg tcctcggctg gtcaacctta 300 aaattaaaca gtcgttggtt aattaatgtc cgtgtcctct tcaattatta aatttcagac 360 aaatgttcct ttctggcatg cataacatca ctcaaccaag gtgtcagtag gacgcgtgcg 420 taaaaaccga attatgattc agacgacatg gatcaaaaga tttaactttc attcacgttg 480 ggcttgccgt ttttccgttg ttgtcgcaat gccgtttcat tatgtttggg tcttaaccaa 540 gcagtcatga ttgctatgaa tgaagatctt ttgttttgtt tatattctta aggtgaatta 600 gcgcaagagc gtatttttcg acgcagcgtt ttttttttaa cttttcaatc atatctgttt 660 gcaatacaat acatattaag ttttcaattt gcaaaaaaaa cacacatgct atttttgtgt 720 caaacttcat tgataacatt tgaggtattc atgagatttt ggtaattgaa ccggcacagt 780 ggctttgtcg taaatgctgt cccgtatccc cttaatgaag aaatgtcatt cacaaaactg 840 gccggtcgag ctttgcatta ttcgttttcg ctctttgcaa aattcgtagt gaaattaacg 900 tttgtaatca atttgcttcg ttgtatttta aacgtttgag ctcgagttcc acggtaattt 960 tatcaattcg tgtgggtaaa acactttttg atgaattgga acagctacaa gtgttattta 1020 ttacaggcgc cgcaaaaatg gataagaaaa gaaagtactg tgaagaatac atcaaatatg 1080 gattcacggt tcttaccatc aatggaatcg acaagccaca atgtgtattg tgtaatatcg 1140 tgttaagtgt tgaagcactg aaaccatcga aactgaaacg gcatttggag acgaaacatc 1200 ccgagcacat tcaaaaggac acggcatttt ttcaacgcca tgaaaccggc cttatacgat 1260 cgcgattcgg cgccacaggc agcattcaac aacaaaatac cgctgttctt caggcgtcat 1320 atgaggcagc attggaaatt gcaaaaaaca aaaagccaca cacaattggc gaaactctga 1380 tcaagcccag catgctgaaa atggtaagtt tggttcttgg cgaggcaagt gctgtcaaaa 1440 tgcgacaagt atctttgtca aatagttctg tgcaaagacg aatagctgat atgtccgaag 1500 acgttaaaat tcaaatcttg gaggaaatca agcagtcccc tttattcgca atccaactgg 1560 atgaatctac ggatgtaagc tcatgcgcgc agctgttggt gttcgtaaaa tacgttcacc 1620 ttgataattt gaaggaagag tttcttttct gcagtgattt ggaaactact accaaaagtg 1680 atgacattat ggctaaggta aaaacctttt tcgattccaa tggtttgcag tgggagaaac 1740 tttgcggggt ttgtactgat ggtgctccag ctatgcttgg atctaaatca gggttccaga 1800 aaaaaattaa agaactcgct ccacaagcta aaggtgttca ttgcatgatt caccgatatg 1860 cccttgccag caagacactg ccgaaggatt tgcaaaaagt gctggattcc gtaataaaaa 1920 tcgtaaacca tatcaagtct ggagcattga acactcgttt attcaaagaa ctctgtaagg 1980 aaatgaactc agatcacgaa actcttcttt tccatactgc tgttcgatgg ttgtcgaaag 2040 gcaatgtcct taatcgtgtt ttcgaattaa aacacgagat caagacattc ctagaaacgc 2100 agggaaaaac ggatcttctt ggtaacttct acgaaaccgg atggaacaag cgtctcgcct 2160 atctggcaga catctttgat caactcaatt cattgaactt gaaactgcaa ggaaaacaaa 2220 ccaatcttat ccttttccaa gacagcctgc aagcatttgt ggcaaaaatc aaaaactggc 2280 gtcgaaaagt aaaggctaat aatgtcgcca tgtttgaaag actttgcagt atgcttgatg 2340 aagaagctga agaaatagaa acggaactca aaacagaaat cattgctcat ctggattcac 2400 tggacgagca attcgagcgc tatttccctg aattgacgca agaagaagct gctttagttc 2460 gaaacccgtt ttctccttca ttggatgtgg caagccttcc tgatagaatt caggatgaat 2520 acttggacct acgtaatgat tcctcggttc gtgatttgta caacgagaaa aacttgaatc 2580 agttttggtg tactatgtac agatcgtacc ccgcagtctc cttgcaagct ctgaagattc 2640 tcgttccatt tgcatcaacc tacctctgtg aaagcggatt ttcatgccta ctccaaatca 2700 agacaaaagc cagaaataga ctgaatgttg aggacgacat gagattggcg ctgtcagata 2760 cgcaacctcg gatcaccaaa ctagtatcac aaatgcaaac tcaacaatcg cactaaaaat 2820 ggttcaatga tctcaaatgt aatattcctt gaattgaatt gatgtttaaa tatacgccta 2880 ataaaaataa aacagagtaa taaatcaaac gtgtgctatg tacttttatt gaattctgtc 2940 tctgaaaatt ttaggggtcc gcagcttaat tgaaaatacg tcaaggggtc cgcgatccca 3000 aaaaggttga gaaccactg 3019 // ID BEL-139_AA-LTR repbase; DNA; INV; 483 BP. XX AC supercont1.251; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-139_AA_; KW BEL-139_AA-I; BEL-139_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-483 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.251; Positions 620527 620045. XX SQ Sequence 483 BP; 145 A; 104 C; 117 G; 117 T; 0 other; tgttgacgaa caaggtatct gccctgaacg aacgcagcca agcaagggaa tctgtcggac 60 aaggcgagat ccgtattgat cactcaagcg gtcactcaga ttacccatac ccatatgagc 120 attggactgg acagaatgta gcggagagat cgagagaagt cagttgtcca ctaccactgc 180 tcgcgatcgg tcatcggagg gcgaacctcc gcacgaattt ccgttttagt gtttttatgt 240 tacgaataaa tgttaattta agtttaataa agtgtaatca gtaaatgccg agaaaatata 300 gtgtgatttt gtgtaacctg tgttgtgtaa cctgctcttg aagaagaaac ccacttggaa 360 cgttgccatc gacgaaatcc tccccaagta atttggatga cgacgattcc taccagcaag 420 acgaaggtca tctgaagggc agccaagcca gacgagagga cgatttttga aggtatccca 480 aca 483 // ID Poseidon-3_HM repbase; DNA; INV; 2718 BP. XX AC . XX DT 13-DEC-2008 (Rel. 13.12, Created) DT 14-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE Penelope-like element. XX KW Penelope; Non-LTR Retrotransposon; Transposable Element; KW Poseidon-3_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2718 RA Jurka J.; RT "Penelope-like elements from Hydra magnipapillata (Poseidon RT group)."; RL Repbase Reports 8(12), 2086-2086 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 190..2466 FT /product="Poseidon-3_HM_1p" FT /translation="MKEYSKKLLIHARNEAKHRLYSSKKLINELNIFLQTK FT VRLHDYELINSITNKSKNYHYIKKKXEMKEKYHKLQITSIIKNTFRINPPN FT QVIKPAILNLTNVILDKSTTELLNLGPNFVPLQKKIPYMDIITNIETCALQ FT LEKLKKPTQAETLRQNCSQILTNSLKLRLKDNLTKQQRLSINKLKNNSNVK FT IYPFDKGTGFALLNQKDASLKLEEQIKNSKIIDYDPTSTLTTKFQRQLCKL FT RKEGKLDTNTYFKMYPSDCNPPRIYGMIKAHKPEKDYPMRPVVSTINTPPY FT GTSDYLVKIIQPTLNKNKSRLMNSNSFVNEAKQWIIDPNEVQVSFDIVNLY FT PSIPIDEAIPVIIDILNADIDDLKTRTKLTLADIHQLIELSLSICYFLYEN FT NIRIIPNSGPIGLSLMVVMAEAFLQNIERKALNIAIIRTSEPKTYERYVDD FT CHARFASIKQQQMFLNILNEQNPAIKYTVELENDLKQLNFLDINITNTGSG FT TYEFQIHRKEAITNVQLKPNSNINPNIIIGVFKGFLCRAKRICSQKHLQKE FT IDFLIDIFVENGHNKNILNNITIDYLKNGSKNTTFQPIDTQPFIKLPWIPI FT VGPKLRKEFRKQNIKVIFTSTPNLKNILCNNKTKLPPNTNPGVYQLKCSCX FT SIYIGETKKKVISRCIEHQKNSLKGKWSSSGATEHSETCHGKFDWSHPKTL FT SVAPEYXIRKIRESLEINKAKVRQEFGSXEKVLNRDDGDYVTTKTWGPFFI FT XIXDVNI*" XX SQ Sequence 2718 BP; 1117 A; 462 C; 334 G; 789 T; 16 other; tcaaagatta tattactact ttttacggca aaagatttac gatgatacca aaaagttaca 60 agatttgaag atacgaaatg cgaactcaaa aaaccaactc gtttttcttc aaaagtgctt 120 atccaataac ataactccaa aatcgtttaa aataaaaact ccattcatac taaaaaagca 180 aaaaacataa tgaaggaata cagtaaaaaa ctactaattc atgctcgtaa cgaagccaaa 240 cacagattat attcaagtaa gaaattaatt aatgaattaa atatatttct tcagacaaaa 300 gtaagattac acgattatga gttaattaat agtataacca acaaatcaaa aaattaccat 360 tacattaaga aaaaaaygga aatgaaagaa aaatatcata aattacaaat tacatcgatt 420 ataaaaaata cttttagaat aaatcctccc aatcaagtta ttaaacctgc tatactcaat 480 ttaactaatg ttattctgga taaatcaaca actgagctac tcaatttagg tccaaatttt 540 gtaccactgc agaaaaaaat tccttatatg gatattataa ctaatatcga aacttgcgct 600 ttacaacttg aaaaacttaa aaaaccaaca caggcagaaa ctctacgaca aaactgttca 660 caaattttaa caaattcgct aaaattaaga ttaaaagata atttaactaa gcaacaacgt 720 ctttccataa acaaattaaa aaacaattct aacgttaaga tatacccttt cgataaaggc 780 acaggtttcg cattattaaa ccaaaaagat gcatctctca aattagaaga acaaataaaa 840 aatagcaaaa ttattgatta tgatccaaca tccacattga ctactaaatt tcaaagacaa 900 ttgtgcaaat taagaaaaga aggtaaacta gacacaaata catactttaa aatgtatcca 960 tcagattgta acccaccaag gatttacgga atgattaaag ctcacaaacc agaaaaagat 1020 tacccaatgc gccctgttgt atcaacaatt aacacaccac cttatggaac atctgattat 1080 cttgttaaaa ttattcaacc gactctgaac aaaaataaaa gtagacttat gaattctaat 1140 agttttgtaa atgaagctaa gcaatggata atagatccta acgaagttca agtttcattt 1200 gatatagtca atttatatcc atccataccc attgacgaag caattccggt tattattgat 1260 atattgaacg ctgacattga tgacttaaaa actcgaacta aacttactct tgcagacata 1320 catcaattaa ttgaactttc attaagtata tgctattttt tatatgaaaa caatatccga 1380 ataataccta attcaggtcc tataggttta tctttaatgg ttgttatggc ggaagcgttt 1440 ttacaaaata tagaaagaaa agcccttaac atagcaatta ttcgtacatc cgaaccaaaa 1500 acttacgaga gatatgtaga tgattgtcat gctcgctttg cttcaataaa acaacaacaa 1560 atgttcctga acatccttaa tgaacaaaat cctgccataa aatacacagt ggaacttgaa 1620 aacgacctca aacaactcaa tttcctagat attaatatta caaacacagg ctctggaact 1680 tatgaatttc aaatacatag aaaagaagct attacaaatg ttcaacttaa acctaactcg 1740 aacattaatc ctaacattat tattggcgtt ttcaaaggat ttttatgtcg tgcwaaaaga 1800 atatgctctc aaaaacacct tcaaaaagaa attgattttc taatagacat ctttgttgaa 1860 aatggccaca acaagaacat tctaaataat attactatag actatttaaa aaacggttca 1920 aaaaacacca catttcaacc aatagatacc caacccttta taaaactacc ttggatacca 1980 attgttggtc caaaacttcg taaagaattt cgaaagcaaa acataaaagt tatttttaca 2040 tcaactccca atttaaaaaa tattttatgc aacaataaaa ctaaactacc tccgaacaca 2100 aacccwggcg tmtaccaact aaaatgttca tgcarttcka tatacattgg cgaaactaaa 2160 aagaargtga tatcgagatg tattgaacac caaaaaaata gcttaaaagg aaaatggtcc 2220 agttcaggtg caactgaaca ttccgaaaca tgccatggta agtttgattg gtcgcacccc 2280 aaaacattat cwgtagcycc agaatatcaw attcggaaaa tcagagagtc actcgaaata 2340 aacaaagcta aagttcgaca agagtttgga agtgrtgaaa aagttytaaa tagggatgac 2400 ggtgattacg tgacaacaaa aacttggggg ccatttttta ttaraataty tgacgtcaac 2460 atttaaagct attttgtatt tcgttatgct ctttttaatg ttatcttttt aacggttwgt 2520 ttttattacg acgattttct ttgtaacgat tttatttatt atacctgatg acgctgatca 2580 caatagatca gcgaaatatc gaaaaaatta tatattaaaa taaaaacaaa tatatatttt 2640 taaaaagttt atacgcagcc gtwtttattg aattctacat attttatcgt ataacaagaa 2700 aatgacattt aaagatta 2718 // ID BEL-614_AA-LTR repbase; DNA; INV; 586 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-614_AA_; KW Pao_Bel_Ele18; BEL-614_AA-I; BEL-614_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-586 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 586 BP; 154 A; 129 C; 149 G; 147 T; 7 other; tgtctgtgag gaccacccta acacctcata cgagctgacc agacggcaga cggcagacgt 60 agttcatatg gcttttgttc tgccaatcgc tcggctaccg gctacgttga tccacagaca 120 tcactttcgc tttctatcga gaggccgtga cgtcaccgat gcactatgag aaggcagatc 180 tgagcgggag aaaatagccc atgtgaatcg taggtagcgg tcaaattcac cacaggatat 240 ttgggagaaa atcggggata tttagagcag aattgggagt gatctagcag tagaaggtag 300 aacacggtcg aagacggtcc tgcgcgacca cggttwgwwa cwgatcgatt tgttagtgtt 360 agttcgaata aatgtgttgt gttagatktg cgttkaatta aagagctgtg ttttcgtaaa 420 tttgtgtttt gtgctgtgtg aatcggaaga gtcttgacca aggattaggc aggtccctca 480 caatttggag cctaaatatc gsacgaccaa accacccagc ctactgagcg tccaatcatt 540 accagcttta ccagccccag ctatcgaacg gttcaggtat cctaca 586 // ID L1-7_HM repbase; DNA; INV; 4790 BP. XX AC . XX DT 27-JAN-2009 (Rel. 14.02, Created) DT 27-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE a non-LTR retrotransposon from the L1 clad - consensus. XX KW L1; Non-LTR Retrotransposon; Transposable Element; KW reverse transcriptase; endonuclease; L1 clad; L1-7_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4790 RA Bao W. and Jurka J.; RT "L1-like retrotransposon from Hydra magnipapillata."; RL Repbase Reports 9(2), 431-431 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 3..473 FT /product="L1-7_HM_1p" FT /translation="SVKLENIPHSIEIDGITIALHFAGKSFLCKVCGNSHP FT PQKRCISQQKAENVNINVSETKKVTTTNNEPRILMSNANNTPKQVDSKIPI FT ITNTETNFEKVKSKKQKQEEKRKNELKDNDSDTEEGEIIESKNDKNYHKKN FT PPTNKEDQKRIPRQRAV*" FT CDS 672..4538 FT /product="L1-7_HM_2p" FT /translation="MESIKISVNICGLNDKNKQNVVITKFQEMNYDIIMLQ FT ETHLSKNNADDFLLKWNQLGFFSIGERHSKGTLTLINQAKNNLKLIEHFAD FT DSGNFNYVIIENQYHKYLIINLYASSGTSFKHERERKFLFKKITEKIKKLN FT YQNYVTILGGDFNMVLSDIDKYPSIFRKQCSSSEYLKALLNTLNVEDTWRI FT FNPQSLEYTYKSTNNITYSRLDRIYLSNHMRQYIDIYHEPMTHSDHHNAVH FT AHIKINNFKVGKDIWILNNNHLKNKSFKKSITNIINSHTTIINTSTYKNDE FT WDNFKTNIKIFSRKFSKDEAKNRRRKEFTMKKQLKNALKKAHLNPHMKNLS FT QNLKTNLENYEKKKAKGAAIRAKIDWRFQGEKCSKFFFHLEKEKTRKQLIT FT NINDSNGNEHNNTPEIIKSFHDFYSKLYKESKNSVIHQTILLENTDIIKIT FT EQQRQTLDQTITLTEVKAALQSMKSDKSPGSDGLTAEFYKENFDTLGETLL FT IVFNNIIKENELPNSLKQAIITCIYKKGNKSDIANWRPISLLNTDYKILTK FT IISNRLKHILPYIIKETQTACIHHRTIFESLSYTRDIIHIANEMKLDASII FT SIDQIKAFDRVDRTFLFDCLSAFSFSNSFINIIKTLYYDINSRLKINGFLS FT EEIKIHRGVRQGCPLSMSLYIIQAEIFAGYIRNNSQIKGITINNKETKLQQ FT YADDTNFFLTTDDSIKALGEALALNKLATGTKINANKCQGIWLGKNRNGNK FT ENYLNFSWKEHSFKCLGVVFSNTNFSHSKQWDECTNKIEKKIKSWKRFKLS FT FKGKRIVINQSLLSSIYHLAFTIPLPNDKKIKSIERLTEKYLWDNSTIKTK FT KEISKLSIEKGGLNIIDIQTKLKSIQLSWISKLFDQQYNGAWKDAASYILN FT NYREANQGNQVFFTSHSNDSLKTLPLFYSQMLKNWNSIYSQCDNVDKLSYE FT ELLNQPLFYNPYIKHLNTSLKPTPNTEKNNITKIADISKVFVPGFLDYQLV FT DMKPRQISIIINSLPPQWKLRINTETQKYDHNKAIPYLYSKNNRPIPITEL FT KPKVLYKILNTQKYDKTTVNLSKWKHYFKKSMSNITDEEWVKLFLTTXKNN FT TNNKADEIRYKLIHFKLPTNEIFKKRGYKINELCPQCMKTKENLPHMIYNC FT KKVQPLIKYSISLINKIYTQIPHLKNTFKTFLFAPSQILFLCNIGRIIINE FT CLYKIYCNRQRSFHENKKFSRIALLVEFQNRIKKILNEEYQFARENDSLEA FT FLKEIKQIWCKRNKTLKLELDINRFLN*" XX SQ Sequence 4790 BP; 2057 A; 830 C; 597 G; 1300 T; 6 other; catcggtaaa attagagaat attccccact caattgaaat agacggtatc acgatcgcgc 60 tacactttgc tggaaaatct ttcctctgta aagtttgtgg aaactctcac cctcctcaaa 120 aaagatgtat ttcgcaacaa aaagcagaaa atgttaatat aaatgtttcc gaaacaaaaa 180 aagtaacaac aacaaataac gaacctcgaa ttttaatgtc aaatgcaaat aacaccccga 240 aacaggtcga tagcaaaata ccaataatca caaataccga aacaaatttc gaaaaggtaa 300 aaagcaaaaa acaaaaacaa gaagaaaaga gaaagaatga attaaaagat aacgatagcg 360 atacagaaga aggcgaaatt atagaatcaa aaaatgacaa aaactatcat aaaaaaaatc 420 ccccgactaa taaagaagat caaaaaagaa tcccaaggca gagagcggta tgatatagtt 480 tttctttata ttcttttaca tatcttttat tattttgacg gatctatcta ttttatttta 540 acaaaaacgt atcttgatat tacaaattat ctgtcagtat aaccatctat tcatttcgtc 600 atatcaatta ttatcatacc acacactcat tataaaccga aaaaacgaca aataactgaa 660 cttttcatga aatggagtca attaaaatct cagtaaatat ttgcggtttg aacgacaaaa 720 ataaacaaaa cgtcgttata acaaaatttc aagaaatgaa ctacgatata attatgctac 780 aagaaacgca tctaagcaaa aataacgctg acgatttcct tttaaaatgg aaccaactcg 840 gtttcttttc tataggcgaa aggcattcca aaggaactct aactctaata aaccaagcaa 900 aaaacaattt aaagctaatt gagcattttg cggatgactc aggtaatttc aactatgtga 960 ttatcgaaaa ccaatatcat aagtacctaa tmataaatct atacgcaagt tcagggacaa 1020 gttttaaaca tgaaagagaa agaaaatttt tgtttaaaaa aattacagaa aaaatcaaaa 1080 aacttaacta tcaaaattat gtgacaatac ttgggggcga ttttaatatg gtgctaagtg 1140 acattgacaa atatccctca atttttagga aacaatgttc gtctagcgaa tatctaaaag 1200 ctttattaaa taccttgaat gttgaagaca cgtggcgtat ttttaacccc caatccttag 1260 aatatactta taaatctact aacaatataa crtactctcg tcttgataga atttatttaa 1320 gcaatcacat gcgtcaatat attgatatat atcatgagcc gatgacgcac tcagaccacc 1380 acaacgcagt tcatgctcat attaaaataa ataacttcaa agtaggaaaa gacatttgga 1440 tccttaacaa caaccacctt aaaaataaaa gctttaaaaa aagtataaca aacatcataa 1500 attctcacac tacgattata aacacctcaa crtataaaaa tgacgagtgg gacaacttta 1560 aaactaacat aaaaatcttt tcaagaaaat ttagcaaaga tgaagcaaaa aaccgccgcc 1620 gcaaagaatt tactatgaaa aaacaattaa aaaatgccct gaaaaaagca caccttaacc 1680 cacatatgaa aaatttaagt caaaacctaa aaactaattt agagaattac gaaaaaaaaa 1740 aagcaaaagg tgctgctatt cgagctaaaa ttgattggcg atttcaagga gaaaaatgta 1800 gtaaattttt ttttcacctt gaaaaagaaa aaacacgtaa acaactaatc actaacataa 1860 atgattcaaa tggtaacgag cacaacaata ctccggagat aataaaaagt ttccacgatt 1920 tctattcgaa actatataag gaaagtaaaa acagtgtaat tcatcaaacw atactactag 1980 aaaacacaga tattatcaaa atcaccgaac aacaacgaca aacactagat caaaccataa 2040 ctttaacaga agtaaaagct gctcttcaat ctatgaaatc agataaatca cccggaagcg 2100 acggcttaac wgcggagttt tataaagaaa attttgatac ccttggcgaa acattactta 2160 ttgtatttaa taatataata aaagaaaatg agttgccaaa ttcactaaaa caagccatca 2220 taacgtgtat atataaaaaa ggaaacaaat cagatatagc aaactggcga cccatttctc 2280 tacttaatac agattataaa attctaacaa aaattatttc aaacagactt aaacatatac 2340 ttccatatat cattaaagaa acacaaactg cttgtattca tcatagaaca atattcgaaa 2400 gcttatccta taccagagac attattcata tagctaatga aatgaaacta gacgcatcca 2460 taatatccat agatcaaata aaagcatttg acagagttga cagaactttt ttatttgatt 2520 gtctgtcagc tttcagtttt agtaattctt ttataaacat cataaaaaca ctatactatg 2580 acatcaattc acgtttaaaa ataaatggtt ttttatccga agaaatcaaa atacacagag 2640 gagtaagaca aggctgccct ctatccatgt ctctatacat aatacaagca gagatttttg 2700 caggctatat cagaaacaac agccaaatta aaggcattac tataaacaac aaagaaacaa 2760 aactccaaca atatgccgat gataccaatt tctttcttac aactgacgat tccatcaaag 2820 ctcttggaga agcccttgcg cttaacaaac tagcaacagg aacaaagata aatgcaaaca 2880 aatgccaggg tatctggcta ggtaaaaaca gaaatggaaa taaagaaaac tatctgaatt 2940 tttcttggaa agaacattca ttcaaatgtt taggtgtcgt cttttccaac acaaacttca 3000 gtcacagtaa acaatgggat gaatgcacta acaaaatcga aaaaaaaata aaaagttgga 3060 aacgttttaa gctgtccttt aaagggaaaa gaattgtaat taaccaaagt ctccttagta 3120 gtatatatca ccttgccttt acaattccgt tgccgaacga caaaaaaatc aaatccatag 3180 aaagattaac agaaaaatac ttatgggata atagcaccat taaaacaaaa aaagaaattt 3240 ctaaactttc tatcgaaaaa ggcggcctca atatcattga cattcaaaca aaattaaaat 3300 caatacaact aagctggatc tccaaacttt ttgatcaaca atacaatgga gcctggaaag 3360 atgcagcatc atacatctta aacaattata gagaagcaaa ccaaggaaat caagtctttt 3420 tcacaagcca ttcgaatgac tctttaaaaa ctttgccttt gttttatagt caaatgctta 3480 aaaactggaa cagcatatat agccaatgtg acaacgttga taaattatca tatgaagaat 3540 tgttaaacca acctttgttt tacaacccat acataaaaca ccttaacaca tctctaaaac 3600 caacaccaaa tacagaaaaa aacaatataa ctaaaatagc agacatttct aaagtatttg 3660 taccaggatt tcttgactac caactggttg atatgaaacc tcgacaaatt tccataatta 3720 taaactcgct accgccacaa tggaaactaa gaataaatac tgaaactcaa aaatatgatc 3780 ataacaaagc cataccatat ctatactcta aaaacaaccg tccaatccca atcacagaac 3840 tcaaaccaaa agtactatac aaaattttga atacacaaaa atatgataaa actactgtaa 3900 acctttcaaa atggaaacat tatttcaaaa aaagtatgag taacattaca gatgaagaat 3960 gggtcaaact atttctaaca acayacaaaa acaacacaaa caacaaagca gatgaaatac 4020 gatacaagtt aatacatttt aaactaccaa ctaatgaaat tttcaaaaag agaggatata 4080 aaatcaacga gctttgtccc caatgtatga aaacaaaaga aaacctacca cacatgattt 4140 ataactgtaa aaaagttcag cctttaataa aatactcgat ttcactcatt aataaaatat 4200 acacacaaat cccacatcta aaaaatacat tcaaaacatt tctttttgca ccatcacaaa 4260 ttttattttt atgtaatatt ggaagaatta taataaacga gtgtttatat aaaatttatt 4320 gtaacagaca aagatcgttc catgaaaata aaaagttttc aaggattgca ctcttggtag 4380 aattccaaaa cagaattaaa aaaatactga atgaggaata ccaatttgca agagaaaacg 4440 atagtttaga agcatttcta aaagaaatca aacaaatctg gtgcaaaaga aataaaaccc 4500 tcaaattaga attagatata aatcgctttt taaactaata tgaccagatt caattttgaa 4560 acgttttgtt tttaaactct tctttatttt aacatttttg aaataatctc tttgtttttt 4620 gatttttgat tttttcgttt ttgtttttct tcgatatttt ttgtttttaa taatggaaca 4680 gtaggcaaaa attgtaaaaa taatataaaa aaaaaaacaa ctactgtatt cattattaat 4740 atctctacat ggcataggcc ggaggagaga aataaactac aatacagtta 4790 // ID BEL1-LTR_SM repbase; DNA; INV; 737 BP. XX AC . XX DT 08-FEB-2008 (Rel. 13.02, Created) DT 08-FEB-2008 (Rel. 13.02, Last updated, Version 1) XX DE A consensus sequence of BEL-type family (LTR). XX KW BEL; LTR Retrotransposon; Transposable Element; BEL1-LTR_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-737 RA Jurka J.; RT "BEL-type family of LTR-retrotransposons from Schmidtea RT mediterranea."; RL Repbase Reports 8(2), 40-40 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX SQ Sequence 737 BP; 241 A; 105 C; 140 G; 248 T; 3 other; tgagaatttg gttagtctgt tcccttagcc ccatgttgtt tggtaacttg gagaatggat 60 tcaaaatttg agtcttttct ccaagttatg gttaaagtgg atgaaaaaga ttagaaaaca 120 atgtaaatgg tagaaatttg aatgattaaa agtgaaattt aagttgtaat tataatttgt 180 aaattaattt gaataaattc ctcaaaatgc ctgctattct taaatttttg aataattctg 240 ataaattatt tgcagttggt ggatttatgt ttatgacata caaatgtggt gtctggtcag 300 ataatgattc tagtcggagc tcatttaata cattacggca ttctattatt gcaaaatctt 360 gtgacgttat tcctagtaga gccggcgaaa ttaaagaggc tagcacagct cgagaagctt 420 tgtctactct taaaccacaa gttggaatta yatctcatat tatttctcct caagctgcac 480 aattcgataa aagtgaagca ggaaaaactc ttcgttggat ggatatgtta aatgctacag 540 ttccaagtcg tattaaaact gcacttgata aaatcccaac aaatattcaa gaatgtgcta 600 tggatgaata tcataacttt ctgtttgatg gtctagtttt gggatatttg atggcagcgt 660 tgggttcagg tcatagtgaa caattaaatg gaaaagcctg gayaaaytct gacgcaaaca 720 ttctcatcct caatcaa 737 // ID CR1-76_HM repbase; DNA; INV; 3855 BP. XX AC . XX DT 31-DEC-2008 (Rel. 13.12, Created) DT 31-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE CR1-type family - consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-76_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3855 RA Bao W. and Jurka J.; RT "CR1 families from Hydra magnipapillata."; RL Repbase Reports 8(12), 1903-1903 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(64..738,742..1305,1292..1459,1441..3777) FT /product="CR1-76_HM_1p" FT /translation="MVLKADFEALEKRVSALEKAIKAKDDQLTEKXQQIEE FT LIKKLEDKNCNDVRGNEELWTDVVKGKQKRKSINQLNLINAVAKESNEQKM FT REKNVIIFGLAASTNKNKDNAFDEDKKNVISIFNKIDTNVNIEHIIKLKTR FT NENQPPFIVVLKDKHERNKVLKSSKVLKDDDSFSKVFINPDLTEAERHKAK FT LLREECHRMNTERKQRNENNYYFSIRYDRVVKITVQLPKNYKINKIDYQKS FT TDRNKKTINKVFKKINYESKSIESINVLSENTKKPTINNNDVTENFLSCWY FT TNATSLNCKYNEFIAEIVTSHPQILFISETWFNEMSLSNVPGYNIFRKDRI FT GKRGGGVCIYVKENIKSVTVSDKMLNDDDLEQVWTSLHIGNEKILCGCIYR FT PGTIDYELNLKIKINYNINITISIKRSYNLLIKSKYSGLLICGDFNYFSIK FT WNKENIGELINNSDISAKEFLKSSRXFKKFLKCLNDCFITQNVIKPTFQIN FT EFKSTNVLDLILTESNKRIIDLNHFDNLGALSHGHQILKFLYSFRNNLKKT FT ETVKKNITLYKHGNYNGFSDMLNSYDWHLEFKNLDANVSYNKFVDIYNKGL FT IYFIPTITKGVVKTKAPWMNSNLKKLIRKKRNLWRKCLHNRFLNKGNVLEY FT KSIRNKVKKEIKNQIKLYETTIALNVKKNPKAIYAYINKKSKVKEGINALN FT VNGKLETNYLSIAHSLNNYFSSVFTNEISDNIPFFKNRTNIICEDPTFSPD FT TVSIILKSLNPNKCTGVDGVHPFPLKCCADAFSLPLSLIFNKSYDTGIIPS FT MWLNANLTPLFKSGDKSEPSSYRPVSLTSVISKVMERILKATFMSHLENNN FT LLTKEQHGFSKKKNCCTNLIESMDLITQAMEDNLPVDIIFLDMKKAFDSVP FT HNRLCLKLKSYGFNYKAVRWCKNFVCNRVQRVVCNGEFSNWSPVTSGVPQG FT SVLGPFMFIVYINDLPNELSTNCKMYADDIKLINIIRSSSDIEKTQTDLNK FT LMTWSQLWLLKFNVSKCKRMHIGSKNTKHIYTMYDSVLNRVELPETKIEKD FT LGVMISNNLKWSSQVNYVSGKANKMLGIIKHSFKNLDINASKLLYTSLVRP FT YLDYANSVWYPYLEQDKKKIESIQRKATRIKCLQGKNYESRLTYFKLTTLE FT KRRXRNDLIQLYKIVKKIDNLTLEYPPDLIASNTRGHNMRFHRQLTKNPKR FT YYFLTNRVINSWNNLDQNIVDSKTLKQFKSKLDLFLT*" XX SQ Sequence 3855 BP; 1621 A; 545 C; 565 G; 1119 T; 5 other; ccgcttctaa cgcgataaga cgtgtttttg aggaggagaa aaaatttata aatttttaat 60 attatggtat taaaagctga ctttgaggct ctagaaaaaa gagtatcagc tttagaaaaa 120 gcaatcaaag ctaaagacga ccagttaaca gaaaaaaamc aacaaataga agaactaata 180 aaaaagttag aagataaaaa ttgtaatgat gtaagaggta atgaggagct ttggactgat 240 gttgtaaaag gtaaacagaa gagaaaatca ataaatcaac tcaacctgat taatgcagta 300 gccaaagaat caaatgaaca aaaaatgcga gaaaaaaatg taattatttt tggtttagca 360 gcatctacta ataaaaataa agataatgca tttgacgagg ataaaaaaaa tgtgataagt 420 atttttaaca aaattgatac aaatgttaat atagaacaca ttatcaagtt aaaaacaagg 480 aacgaaaatc aaccaccttt tattgttgtc ttaaaagata aacatgaaag aaataaagtt 540 ttaaagtcat caaaagtatt aaaagacgac gactcgttct cgaaagtatt cataaatcct 600 gatcttacag aagcagaaag acataaagca aaactcttaa gagaagaatg tcaccgaatg 660 aacactgaaa gaaaacaaag aaatgaaaat aattattatt tcagtatcag atatgatcga 720 gtagtgaaaa tcactgtata gcaattacca aaaaactata agattaacaa aatcgattat 780 caaaaatcaa cggacaggaa caaaaaaacc atcaataaag tatttaaaaa aataaattac 840 gagtcaaaat caattgagtc aataaacgta ttatctgaaa atacaaaaaa accaacaatt 900 aataataacg acgtaacaga aaatttttta tcatgttggt atacaaatgc tacatcatta 960 aattgtaaat acaatgaatt tattgctgaa atagtcacat ctcacccaca gatacttttt 1020 atcagtgaaa catggtttaa tgaaatgtca ttaagtaatg ttcccggtta taatatattt 1080 cgcaaagatc gaataggtaa acgaggaggg ggagtatgca tatatgtcaa agaaaatatt 1140 aaatctgtta cagtttctga taaaatgtta aatgatgacg acttagagca agtttggaca 1200 tcattgcata ttggcaatga aaagatactt tgcggatgta tttataggcc aggcacaatt 1260 gactatgaac ttaatctaaa aattaaaata aattacaata tcaattaaac gaagttacaa 1320 tctattgata aaaagtaagt attccggact attgatatgt ggagatttta actacttttc 1380 tattaaatgg aataaagaaa acatcggcga acttattaat aattcagata tttctgctaa 1440 agarttttta aaaagttctt aaaatgttta aatgattgtt ttataacaca aaacgtaatt 1500 aaaccaactt ttcaaattaa tgaatttaaa tccactaatg ttcttgacct aatattaaca 1560 gaatcaaaca aaagaataat tgatttgaac cattttgata atttgggtgc acttagtcac 1620 ggtcatcaga tcctaaagtt tttatatagc tttagaaata atttaaaaaa aacggaaaca 1680 gtaaaaaaaa atataactct gtacaaacac ggcaattata atggcttttc cgatatgtta 1740 aatagctatg actggcatct tgaattcaaa aatttagatg ctaatgtttc atacaacaaa 1800 tttgtcgata tttacaataa aggacttata tatttcattc caacaattac aaagggagtt 1860 gttaaaacaa aagcgccttg gatgaactct aacttaaaaa agttaattag aaagaaaaga 1920 aacctctgga gaaaatgttt acataatcgt tttttaaaca aaggaaatgt attagaatac 1980 aaatcaatta gaaataaagt aaaaaaagaa attaagaacc aaattaaatt atatgaaact 2040 accatcgcct taaacgttaa aaaaaaccca aaagcaattt atgcatatat aaacaaaaaa 2100 tcaaaagtta aagaaggaat taacgcatta aatgtaaacg gcaaacttga aacaaattat 2160 ttatcaatag cacattcctt aaataactat tttagctctg ttttcaccaa tgaaattagt 2220 gacaatatac cattctttaa aaataggaca aacattattt gtgaggaccc tacattttca 2280 ccggacacag tttctataat cctgaagtca ctaaatccaa ataaatgtac aggagtagat 2340 ggagttcacc catttccttt aaaatgttgc gcagatgcgt tctcccttcc tttatcactt 2400 atttttaata aatcgtatga cacaggaata attccatcga tgtggttaaa tgctaatcta 2460 acaccattat ttaaaagtgg agataaatcg gaaccatcga gttatcgacc agtatcactt 2520 acttcagtta taagtaaagt aatggaacgt atcttaaaag ctacrtttat gtcgcactta 2580 gaaaataaca atctcttgac taaggaacag catggatttt ctaaaaaaaa aaactgttgt 2640 acaaacctta tagaatctat ggatctaata actcaagcaa tggaagataa tttaccagta 2700 gatattattt tcctrgatat gaaaaaagct ttcgattccg tgcctcacaa cagattgtgt 2760 cttaaactaa aaagttatgg ttttaactat aaagcagtta gatggtgcaa aaattttgta 2820 tgtaacagag tgcaacgtgt tgtctgcaac ggcgaatttt caaattggtc tccagttaca 2880 agtggagtac cgcaagggtc tgtcttgggt ccgtttatgt ttattgttta tataaacgac 2940 ttaccgaatg agttatcaac gaactgtaaa atgtatgctg atgatattaa actaattaac 3000 ataattagat catcttctga tatagaaaag actcaaacag atctaaacaa attaatgaca 3060 tggtcacaat tatggctatt aaaatttaat gtcagcaaat gtaaacgcat gcatataggt 3120 agcaaaaata ctaagcatat atacacaatg tatgattcag tgttaaatag agttgagcta 3180 ccagaaacca aaattgaaaa agatttagga gttatgattt caaacaactt aaaatggtct 3240 agtcaagtta attatgtgtc tggtaaagct aataaaatgt taggaataat aaaacattca 3300 tttaaaaact tagacataaa cgcatctaaa ttattgtaca cctctcttgt aagaccgtat 3360 ctagactacg caaattcagt gtggtatcct tacttagaac aagataaaaa aaaaattgaa 3420 tcgatacaac gtaaagcaac acgaataaaa tgtcttcaag ggaaaaacta tgaaagtaga 3480 ctaacctact ttaaattaac gactcttgaa aaaagacgcr aacgtaatga cctaatccaa 3540 ctttacaaaa tagtaaaaaa aatagataat ttaacattgg aatacccacc agatttaata 3600 gcttcaaata cgagaggtca caatatgagg tttcatagac aattaactaa aaatccaaag 3660 aggtactatt tcttaacaaa ccgtgttatt aacagctgga ataacttaga tcaaaacatt 3720 gtagattcta aaacacttaa acaatttaaa tcaaagttgg acttattttt aacataaatc 3780 ggctgttaaa gcctagatca actaggctcc gtacattgtc attgtacatg tacacagtta 3840 ttaaataaat aaata 3855 // ID Kiri-5_AAe repbase; DNA; INV; 4644 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 16-FEB-2011 (Rel. 16.02, Last updated, Version 1) XX DE A Kiri non-LTR retrotransposon family from Aedes aegypti. XX KW Kiri; Non-LTR Retrotransposon; Transposable Element; Kiri-5_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4644 RA Kojima K.K. and Jurka J.; RT "Kiri non-LTR retrotransposons from the yellow fever mosquito."; RL Repbase Reports 11(2), 700-700 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 12 sequences with >93% CC identity. CC This family and related Kiri elements found in two mosquito CC species, Culex quinquefasciatus and Aedes aegypti, constitute a CC distinct lineage belonging to the CR1 group. The phylogenetic CC analysis with PhyML indicates that Kiri is a sister lineage of CC the L2 clade. Although several Kiri elements do not encode two CC proteins, probably due to 5' truncation, Kiri elements generally CC encode two proteins. Their ORF1 proteins commonly contain an CC L1-like RNA-binding domain. XX FH Key Location/Qualifiers FT CDS 595..1101 FT /product="Kiri-5_AAe_1p" FT /translation="MDYIAESVRRCDKNKELIISGVPYQNQEDLVEVFKRI FT SGSIGFKEXKIPLVELQRLSRSPIAPGLTPPILCEFALRNHRNEFYRNYLS FT KRSLCLRHIGFESENRIYINENLTPNARRIRSEAVKLXKAGRLDSVTTRDG FT IVCVKPKGSEKATALHSLPQVASFIRNPIQ" FT CDS 1661..4474 FT /product="Kiri-5_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MDNDTTFGNNASDGSYIPKAVLNSILMNGKLNVCHLN FT VQSLCARQFTKFEELKNTIFESKIDIACFSETWLDSSISDGMIEIKGFNLI FT RNDRNRHGGGLCVYIRKGLSYRLIHKSESFISTEFLIVELLLGRDRLLICV FT YYNPPNIDCSDILRGHFDEYSVKYDSTFFIGDFNTDPNKHTRKSQCFRDTI FT SSMSYSIVNSEPTFFYKTGSSFLDLFITDSQDKVLKFNQISMPGISKHDLI FT FASLDYSHVKGEHGYWFRDYCHFDAASLREEFHIINWNEFFSINDSDIMAN FT ILNNHLSALHEQFFPLKFQRFRKNPWFNNSIQKAIIERDLAYRNWKRSKTA FT EHETIYKRLRNKVNSVVEKAKAGFDRQRLNVNLPSKQLWQNVRNLGIATKV FT TRPVDNDLTADSINEYFSSNFCADDSRSPEIIGNESGFKFRPVLDYEIINA FT IFEIKSNAVGLDCIPISFIKIITPLVLPMLEHLFNAIISTSKYPSVWKRSK FT VIAINKKPNSSSLTNLRPISLLSSISKVFEKLIKRQISEFVNRMNLLHPFQ FT SGFRKYHSTETALMKVHDDIARTIERNGMTILLMIDFAKAFDRVSHNKLVS FT KLISLYDFSTEAARLIKSYLCNRSQTVWHNGHFSDFIPAISGVPQGSVLGP FT LLFSLFINDLPSVLNFCSVHLFADDVQIYICTDDSSDLTAIANMMNNDLRL FT IMNWSNENLLPINPSKTKAMLISKQKTNITPPKIYINESEVSFVQKINNLG FT VIFTNNLEWEDHIIYQCSKIYGSLKRLNLTTRNSNIDTKLKLFKTLLLPHF FT IYGDFVYTNALTGSLHKLRLALNACVRYVYNLSRYSHVSHLQKNLIGCPFN FT NFYKYRTCINIFKIIKTSNPVFLNEKIIPLRGTRTGNYRVPQHSSMYYSQS FT FFVRGIVNWNSLPTSIKLINSKACFKRDLLLYLQ" XX SQ Sequence 4644 BP; 1476 A; 860 C; 821 G; 1468 T; 19 other; gtttctgaag ggatgtacta cwgtacataa gctaaaagtg ctagcagcca agccgtacgc 60 ttatcactag aaaactatcg ttccttagtg aaatactgct cattttckac gaattgaagt 120 cctctgaagt gsaaaaacat cgagtttwga tcaaaagtgc tttcgtcgtt acaatttktc 180 catcttawaa tacgccactt tacgttatta tagtggtgtg aacccttcag aatatcttta 240 tctggtgttt acggttcacc acttgtcatc gcttcgcaat gwtgcaaaas aacaaaatcg 300 tcacacgwtc gggatcaacg tcgtctctga atacagtggg atcgcaagcc aaacggtctm 360 gtgaggatat ttctaaacat gatgacgagg aaatcgctag cttgaatgat ctgtggacga 420 aaatgcaaac atgatgttca gcaccagtga aaggattgag gcaaaaattg aaagctgcaa 480 taatgtgctg gaaaagctat ctcgttggtt gaaagtcagc tagctgcggt tcgggatkag 540 ttttctggca gagtcgttag actcgaggaa gaagtcacag ctacacgtaa tgagatggac 600 tacatcgccg aatcagttcg tcgctgtgat aaaaataagg agctaatcat ttctggtgtt 660 ccttatcaaa accaagagga tttggtagaa gtwttcaaac gaatttctgg atccattgga 720 ttcaaggagm caaaaatacc gctggttgag ctgcaacgtt tatcacgatc tccgatcgct 780 cctgggctga caccaccgat actatgtgag ttcgctctcc gcaatcacag aaacgagttc 840 tatcgcaact atctctcaaa gcgctcgctc tgtcttcgtc acatcggatt cgagagtgag 900 aaccgtatct acatcaatga gaacctcaca cccaatgcta gacggattag gagtgaagct 960 gttaagctga mgaaagcagg acgattggat tctgtaacaa cgcgtgatgg aatcgtatgc 1020 gtgaagccca aaggatctga aaaagctacg gcacttcatt cattgccaca ggtcgcttcg 1080 ttcattagga accctatcca ataaatgctc twaacttcct tcctgctgca tccgtgtttc 1140 caatcsttmw ttccmtcatt ccaacctctc ctaaaagtca atttaaaagc taaaaaaaaa 1200 cctatccttg caagctctca aactcctacc atgccattcc atgaatccgt tccactgcaa 1260 tttccatgat ctcttccctc ctaaaagtca accgctggat tgctgttgct gttggagttg 1320 ttgttcgacc atggtgctgc tgttgctgtt gttctaattt tgctgctgct gttgctggaa 1380 acacttttgc tatgattttg atatctcttt tttgaattat ggagaattta attaaagtta 1440 tagaatatct agaagtaaaa ctttacatta taaattttga tctactggat gttggtacgt 1500 aacggtttgg ctgtaagcca agaatgcgat aggcgattca aaattatgtg gattttttct 1560 agtttagttt ctaggtttag gttctcttta attatacatt actcgtcctt tgttcgtatg 1620 ttaggtaggt ttcttttgaa gcaaatcgtt ctatttgaca atggataatg atactacgtt 1680 cggtaacaat gcgtctgacg gatcttatat accaaaagct gttttgaatt caattctaat 1740 gaacggaaag ctcaacgtct gtcatctcaa cgtgcaaagt ctatgtgcac gtcagtttac 1800 taaatttgaa gagcttaaaa acacaatatt cgaaagtaaa attgatatag cttgtttttc 1860 ggagacttgg ttagatagtt ctatttctga tggtatgata gaaattaaag ggttcaacct 1920 gataagaaat gatcgtaatc ggcatggtgg tggactttgt gtttatataa ggaaaggttt 1980 atcatatcgt ttgatacaca agtcggagtc gtttatttct accgaatttt tgattgttga 2040 actacttctg gggcgtgatc gtttgcttat atgtgtttac tacaatcctc ctaatataga 2100 ttgttcagat atactacgag gacattttga cgaatattcc gtcaaatatg attctacatt 2160 ttttattggt gatttcaata cggatcccaa taaacatact cgtaaatcac aatgcttcag 2220 agacactata tccagcatgt cttacagtat agtcaattca gaaccaactt tcttttacaa 2280 aacagggtct tcatttttag atctttttat aactgattcg caagataagg ttttaaaatt 2340 caatcaaatt tcaatgcccg gtatatctaa gcatgacctt atttttgctt cgctagatta 2400 ttcacatgta aaaggtgagc atgggtactg gttccgtgac tattgtcatt ttgatgctgc 2460 ctcattacgt gaggaatttc atatcatcaa ctggaatgaa ttcttcagta taaacgattc 2520 agatatcatg gctaatatct tgaataacca tcttagtgca ctgcatgaac agttttttcc 2580 attgaaattt caacgttttc gtaaaaatcc gtggttcaac aacagtattc aaaaggccat 2640 aatagaaagg gaccttgcat atcgtaattg gaagcgtagt aaaaccgctg agcatgaaac 2700 catttataag agactcagaa acaaggtcaa ttcggtcgtg gaaaaggcaa aagcagggtt 2760 tgatagacaa cgattaaatg taaacttgcc gagtaagcag ctttggcaga atgtacgaaa 2820 cttaggaatt gcaaccaaag ttactcgtcc tgttgacaac gatctcactg cagattctat 2880 caatgaatat ttctcgtcta atttttgcgc tgacgactcc agaagtcctg aaataattgg 2940 caatgaaagc ggtttcaaat tccgaccggt tttggattat gaaataatca acgctatatt 3000 cgaaattaaa tcgaatgctg ttggattgga ctgcattcct ataagtttta tcaaaataat 3060 tactccgtta gtgctaccca tgttggaaca tctatttaat gcaataatat ctacaagcaa 3120 atatcccagc gtatggaagc gatccaaagt aattgctata aataagaaac ctaactcttc 3180 atcattaact aatttaagac cgataagcct attatctagt atctccaaag tctttgaaaa 3240 acttattaaa cgtcaaattt cggaatttgt aaatagaatg aatttgttac atcccttcca 3300 atctggattt agaaaatacc atagcactga aactgcttta atgaaagttc acgacgacat 3360 tgctcgaaca atcgaaagaa atggtatgac tatcttgtta atgatagatt ttgccaaagc 3420 ttttgatcga gtgtcccaca ataagctagt aagtaaactt atttcgcttt atgacttctc 3480 cactgaagct gcaagactca ttaaaagtta tttgtgtaat cgtagtcaaa ctgtctggca 3540 caatggacat ttttcggact tcataccagc tatatcagga gttcctcaag ggtcggtact 3600 tggtccactt ctattttctc tttttataaa tgacctcccg agtgttttga acttttgctc 3660 cgtacatcta tttgctgacg atgtacaaat ttacatttgc acagacgata gttcagactt 3720 gacagcaatt gcgaacatga tgaataatga tctacgcttg attatgaatt ggtccaatga 3780 aaatttgttg cctataaatc ctagcaaaac aaaagccatg ctaatttcca aacaaaaaac 3840 caacattacc cctccaaaaa tttatatcaa tgaatccgaa gttagttttg tacaaaaaat 3900 caacaattta ggtgttatat tcacaaataa tttagagtgg gaggaccata ttatatatca 3960 gtgttcaaaa atttatggtt cattaaaacg actaaatctt acaactagaa atagcaatat 4020 tgatacaaaa ttaaaacttt ttaaaacatt gctgcttcca catttcatat atggagattt 4080 cgtttacaca aatgctttaa ctggatcact tcacaaatta agactagcgt taaacgcgtg 4140 tgttagatat gtatacaacc tttccagata ctcacatgta tcacacctac aaaagaacct 4200 tataggctgt cctttcaata atttctacaa atatagaact tgcatcaata ttttcaaaat 4260 aataaaaaca tcaaatccag ttttcctcaa tgaaaaaatt ataccattga gaggcacacg 4320 aacaggtaat tacagagttc cacaacatag ttcaatgtat tatagtcaat ccttctttgt 4380 cagaggcatt gttaattgga acagtttgcc gacttccatt aagctgatta attctaaagc 4440 atgcttcaaa agggatctct tattgtactt acaatagacg gttgtgttag ttaaagagaa 4500 caatttttca aaattcaatt atgtacgtta gagtcatttt ttttcatcaa ttgcaatctt 4560 aactgtaaca ataaaaaaga tgtatatctt acgttacaag gtgatcaata aataaataaa 4620 taaataaata aataaatata taaa 4644 // ID Gypsy-18_DWil-I repbase; DNA; INV; 5607 BP. XX AC scaffold_181039; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-18_DWil_; KW Gypsy-18_DWil-LTR; Gypsy-18_DWil-I. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-5607 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_181039; Positions 240686 246292. XX CC Positions [4714-5193] - Integrase core CC 'ACAG' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 1233..2285 FT /product="Gypsy-18_DWil-I_2p" FT /translation="MPNQRRARPNRGNNTRPNREIDYTVIQNMIEQTVSRV FT VSSLPMNLGQREVPLDTFQTRSLISSPCPTTVQNAKIANVMQKWNLRFDGS FT TEGLGVEEFIYRVKALTEETLDSDFIGVCKHIQILFGGKAREWYWRYHKQV FT PRIVWSEFCSAIRQQYKDYRSDFMSKEMIRARKQKAGESFVCFYDEVASLI FT DKFGIKFEEEELMEILQNNLLPEIRQKLLYQSIASIGHLRRLVQMQENLTQ FT ELSPQVKVISNTKANGRRQIFEIQEGETEDNKIVPDPDEEYEVAAVQGKFV FT CWNCREAGHTWQNCLEERTIFCYGCGTANVYKPQCQDCIKRLTENRPKGAS FT RQRQMLPQ" FT CDS 2959..5562 FT /product="Gypsy-18_DWil-I_1p" FT /translation="MFPNASHHLGRTSLIKHHIEIGEAKPVKQRFYAISPA FT VEKLVYTEIDRMIQLGVIEPAVSAWSSPIRPVIKPNKVRLCLDARKLNAVT FT IKDAYPLPSIDSIFSRLPKANIISKIDLKDAYWQIELTEASKPLTAFTVPG FT RPLYQFTVMPFGLCNATSTMSRLMDEVVPADLRHCVFCYLDDLCIISEDFS FT SHLSVLARIAEQFRKANLSINIEKSQFCVASISYLGYVIDSKGICTDPSKI FT ECIKNWPPPRNLKQTRGFLGVCGWYRRFIPNFADLTYPITELLSTKRNFVW FT TPEAQASMNKLKDLLTSAPILQNPDFNKKFFLHCDASDYGIGAVLVQLSDA FT EEEKPIAFMSKKLSRAQRNYSVNERECLAVILAIEKFRCYLELQEFEVVTD FT HSSLLWLMRQQNVSGRLARWIFRLQQFKFNMSHRKGKEHLIPDALSRLPNL FT SIEAIEMGPIIDLDSEAFLDDSYMYLKKKITENPDKFPDIKIVNKHLYIRT FT EHANGDSYQEKLSWKLWVPDKLRVDVLKQTHDSHTSSHSGMQKTVDKIRRT FT LFWPGMVKDVRDHIRQCDTCKESKAPNYTLRPPMGQHIPTCRPFQRLYIDL FT LGPYPRSKNGFVGLLIVLDHFSKYHWLQPLKKFRTEDIQEFLLKQIFHCFG FT VPEIIVSDNGVQFKANSFNSFLTELGIKHQYTALYSPQSNAAERVNRSLLA FT AIRSYLKNQDQTLWDQHLTSISCALRSSLHQSLGCSPFKALFGLDMVTHAS FT DYKPLKELKLLQEPITPLNQTDQLNLLRQDIRENIKKAYDRNVAQYNLRSK FT SVSFKEGQEVFRRNFALSKFSQNFNAKLAPKFIKCRIKKKLGNCFYLLENM FT QGREVGTFHAKDIRN" XX SQ Sequence 5607 BP; 1848 A; 1035 C; 1102 G; 1622 T; 0 other; taaaattttt ggcgccgtgg ggcaggataa aagcatactc aactccttga caatattaga 60 tatagacaag aagtttattt aagttacggc atgtggttat tttgacaaca gaccacctta 120 gaagtatgta gttctaccct aagtgatata aagtgttgag ttcaagtata tcaggaaagt 180 cttaagatca aaacatgagg tttcaaattt ttcgtcgatt atgatagact atattatggt 240 ttaacttacg cggataaatg aacctgcacg gtttaagtct gcttggcgaa gccgatgttt 300 ctgttaggaa tattcagccg caaagttaaa agcaagaatg aatgtaaaca agaatttcaa 360 taccctgtat cacaaattgc tcgtttagtt aaggtcctcg ataatttagc actagtttag 420 taaagtcgtt taaacttttc ttagcaattt agattattgt atattggatg tttcgatatc 480 agtactgatg ggatttaaga tcatgttacg acagctgaac gtctaaagaa atcgaagaga 540 acacgaacat attgttttat agtaagactc ggatgattta ataatctggt gtctggatct 600 tggtaataca gatagtatag ccttgccgat cagaagattc aaaaggccga agcctgacta 660 aaagaaacaa ttttgtagac gcaatagccc aaaaacaaaa ataataaagt taccaatatt 720 ctttaggttc atttagctag ttacgatttc aaatcatgta cccagtcatg tcacgcttag 780 gaagataatt ttaaaaaaag aatgactcta gttcaacata gtcccccaaa tacggacact 840 ccagcagaaa acgtatgcaa aatttgttcc ttggttttgg acaacacgca aacatgtttc 900 tgtactccct gtaacacatt tttcataaat cctgtttgga agctctacta gaaaaggaaa 960 agcattgccc aatttgtgag agtgagataa atgtgtctag tataaagctc gtagtaggca 1020 agtgttcacc aaaagcgaag gaaaaacatt tgtcattggc acccagcact tctcgagcat 1080 ggactcgtaa ttttacaaaa cagctaagtc aagtcccaag agacagagca gaacagctca 1140 acgaaccagc tatatcagtt agcgaagcac cagcatcatg cttagatgcc agtcacagta 1200 gtaatcaacc agaaccgcaa ctacctagaa ccatgcctaa tcagcgaaga gcaagaccaa 1260 atcgtggtaa taatactcgt cctaatcgag agatcgatta tacggtcata caaaatatga 1320 tagaacaaac agtttctcgt gtagtatcgt cgttacctat gaatttaggc cagcgagaag 1380 tcccattaga cacgttccaa acccgatctc tgataagttc gccctgccct acaactgtgc 1440 aaaatgcaaa aattgcaaat gtaatgcaga aatggaattt acgttttgac ggatctacgg 1500 aaggtctagg agtagaagaa tttatttata gagtaaaagc cctaacggag gaaactcttg 1560 atagtgattt tattggggtg tgtaaacata ttcaaatatt atttggagga aaagcccgcg 1620 agtggtattg gagatatcat aagcaagtcc ctcgaatagt atggagcgag ttttgttcag 1680 ctattcgtca gcaatataaa gactatcgat cagattttat gagcaaagaa atgattcgtg 1740 ctagaaagca aaaagctgga gaatcttttg tttgttttta cgatgaggta gcttctttga 1800 ttgacaagtt cggtataaag tttgaggaag aggagctgat ggaaatttta caaaataacc 1860 tgttgccaga gatacgacaa aagttactat atcaatctat cgcatcaata ggtcaccttc 1920 gtagattagt acagatgcaa gaaaatctaa cacaagagct cagccctcaa gtaaaagtaa 1980 tttctaacac caaagcgaat ggacgcagac aaatttttga aattcaagaa ggggaaaccg 2040 aagataataa aatagttcca gacccagacg aggaatatga agttgccgct gtgcaaggga 2100 agttcgtttg ttggaattgt cgcgaagccg gtcacacctg gcaaaactgt ctagaggaac 2160 gcacaatttt ttgttacggt tgcggcacgg ccaatgttta caagcctcaa tgtcaagatt 2220 gtataaaaag gctaacggaa aaccgaccga agggtgcatc cagacaacgt cagatgctcc 2280 cgcaatagtt acatcaaact cggaaaacgt tcaacaaagt atcaatattg tgaatcacaa 2340 agtaaacgta cgaaagtctc aaaaactccc actaagattt ttgtcatatc ctgagcgttt 2400 acaacattac cttactgtta aagacagaat tttcagacct gaccaaccct tgtctaaacg 2460 cacgttaaga ctaaggaaat attatgatca cgcaagacaa accaagaaac tttttgttgc 2520 aacaattttt agaaacgaca acgataatag gagctatgca gaagtctcat ttcttacgtt 2580 cacagaatta ggtttattgg ataccggtgc cagcattagt tgtataggct cggatttagc 2640 acaaacagat ttttcacatt ttccccaatt ggtaaaaact aaaggtcaag tgcgcacagc 2700 ggacggaaac ggccaaaatg ttattggtat gttagaagta atagtacggt acggtagatt 2760 ggaaagaatg ttgaaattat ttgtaattcc tacgttgaag aaaagattaa tattaggtca 2820 tgatttttgg agaattttcg aacttgcccc gtctataatt agttcgatcg agattggttc 2880 tgaaaaaagt tagagactga ccctagttct tattctctca cgtattccca aattagacag 2940 cttgaagccg tcaagcaaat gtttcccaat gcaagtcatc atctcggacg tacgtcctta 3000 ataaaacatc atatcgagat tggcgaagcc aaaccggtaa aacaacgttt ttacgctatt 3060 agtcccgctg ttgaaaaatt agtatacaca gaaattgatc gtatgattca actaggagtt 3120 atcgaaccag ctgttagcgc ctggagctct cctattcgac cagtgataaa gcccaataaa 3180 gtgcgactgt gcttagacgc tcgcaagtta aatgcagtca caattaagga tgcttatcct 3240 ttgcccagca ttgacagtat tttttcaaga ttacccaagg caaatattat ttccaaaatc 3300 gatttaaagg atgcctattg gcaaatagaa ttaacggaag catccaaacc tcttactgca 3360 tttacagtac caggaagacc gttatatcaa ttcacagtta tgcctttcgg cctttgtaat 3420 gccacatcta ctatgagtcg tctaatggat gaagtagtcc ctgctgacct tcgccattgt 3480 gtattttgct atttagatga tctgtgtata atttcagagg atttctcatc tcatctttcg 3540 gtgttagcaa gaattgcgga gcaatttcgg aaagctaatc tgtccattaa catagagaaa 3600 agtcagtttt gtgtcgcaag cattagttat ttaggatatg taatagacag taaaggtatc 3660 tgcaccgatc cgtcaaagat tgaatgtatt aaaaattggc caccacctag gaatttgaaa 3720 caaactagag gatttcttgg agtttgcggt tggtatcgcc gttttatacc gaattttgcc 3780 gatctcactt atcctataac ggaactgttg tcaacaaaaa ggaattttgt ttggactccg 3840 gaagctcaag cttctatgaa taagttgaag gatctattaa cttctgcccc tattctacaa 3900 aacccggact ttaataagaa atttttctta cattgtgatg ccagcgatta tggtattggc 3960 gctgtacttg tccaattatc agacgcagag gaagagaagc ctattgcttt catgtctaaa 4020 aagttaagtc gtgcccaacg caattatagt gttaatgaac gcgaatgttt agccgttata 4080 ctggctatag aaaaatttag gtgttatttg gaattgcaag aatttgaagt agttactgat 4140 cactccagtt tattgtggct aatgcgtcaa caaaatgttt ctgggagatt agctcgatgg 4200 atttttcgct tacaacaatt taagtttaat atgtcgcata gaaaaggaaa agaacacttg 4260 attcctgacg cgttatcaag attgcccaac ctttcgatag aggcgataga aatgggtccg 4320 attatagatc tggactctga agcatttttg gacgatagtt atatgtatct gaagaagaaa 4380 atcactgaaa atccagacaa attcccagat atcaaaattg taaataaaca tttgtatatt 4440 cgaaccgaac atgccaatgg agattcttat caagagaagc tttcttggaa actatgggtg 4500 ccggataaac tcagagttga tgttctaaaa caaactcacg atagccacac ctcttcacat 4560 tcaggtatgc agaaaactgt tgacaagata cgtagaactc ttttttggcc tggtatggta 4620 aaggatgtgc gggaccatat tcgtcaatgt gacacttgta aggagtccaa agcaccaaat 4680 tatacgctca gaccacctat gggacagcat attccaactt gtagaccttt tcaacgtctg 4740 tatatcgatt tacttggtcc ttaccctcgt agcaaaaatg gatttgtcgg tttattaatt 4800 gttcttgacc atttcagtaa atatcattgg ttacaacctc tgaaaaagtt tcgcacggag 4860 gatatacaag aatttttgtt aaaacagata tttcattgtt ttggagtacc ggaaatcatc 4920 gttagtgata atggtgttca atttaaggca aattcgttca actcttttct tactgaacta 4980 ggcataaaac atcagtatac cgctttatat tctccgcaaa gcaacgcagc tgagcgtgtt 5040 aatcgttctc tgttagctgc gatccgttcg tatttaaaaa atcaggatca aacgctttgg 5100 gatcaacatt taacgagtat atcttgcgca ttgcgatcct cgttgcatca atctttaggt 5160 tgctcaccct ttaaagcttt gttcggactg gacatggtta ctcatgcatc agattataaa 5220 cctttaaagg aattaaagtt gttacaagag ccaataacac cgctcaatca gactgaccag 5280 ttaaaccttt tgcgacagga tataagagaa aacattaaga aagcatatga tcgaaatgta 5340 gcacagtata atcttcgcag taagtctgtc tcttttaaag aaggccaaga agtttttcgc 5400 agaaatttcg cgttaagcaa attctctcaa aacttcaatg ctaaattagc ccctaaattc 5460 ataaaatgtc gaatcaaaaa gaagctcgga aactgctttt atttgttaga aaatatgcaa 5520 ggtagagaag taggaacatt tcacgccaag gatatacgga actaattccc actaatatgc 5580 tgcctaacgt tgcatattat cgggtgg 5607 // ID Kolobok-1_NaeGru repbase; DNA; INV; 2434 BP. XX AC . XX DT 17-MAY-2011 (Rel. 16.05, Created) DT 17-MAY-2011 (Rel. 16.05, Last updated, Version 1) XX DE DNA transposon: consensus. XX KW Kolobok; DNA transposon; Transposable Element; Kolobok-1_NaeGru. XX OS Naegleria gruberi OC Eukaryota; Heterolobosea; Schizopyrenida; Vahlkampfiidae; OC Naegleria. XX RN [1] RP 1-2434 RA Yuan Y.W. and Wessler S.R.; RT "The catalytic domain of all eukaryotic cut-and-paste transposase RT superfamilies."; RL Proc Natl Acad Sci U S A 108(19), 7884-7889 (2011). XX DR [1] (Consensus) XX SQ Sequence 2434 BP; 907 A; 398 C; 393 G; 736 T; 0 other; ggtggcggta ggtgtttccg aaaatattcc ttcaataaaa aaaaaaattg ttgtgatgaa 60 atacttttct gttcctttgc tctcctttcc gtctgctaat agtgaaccaa tatgtgctaa 120 tcttttatat cgtaaataat attattgtct agttgttttt caaattttga tcaatagata 180 ctagctttca cagagaaaat ttagagtttt acatttaaaa attaaattag atttttcact 240 atttgatcct ccttgatccc acgatcggaa gagtttctaa attcaaaata atgtagactg 300 tgtgttggtt acttttcgtc atgctttgtc atgctttaat tgtgatttta ccttcagttc 360 cacaagaacg aaaattggca aacaaagaac aacaacaaca aacaatcaca aaaaaatgca 420 aagaaaattt tgttttgttt gtggagtctt gacaggtgtt tttgcagcaa aattctacac 480 agttaagaat ggatttgttg gtaaaatacc tattcatcat gcatgccacg atggtataat 540 ttgtggaaaa cattataaca tattcaggag acaatttaat aatttagatg agaactttac 600 aaatattgaa tttggaacaa atgaaacagt acaacataat cactctaaca aaactcaatc 660 atatcaattt cacattcata atgaaagtaa cattttaccc ttagacccac agtcatttaa 720 aagtttagaa aatcaaaata tcggccttac cattattcca actgaccact tgattcgtat 780 tgtgaaacca gtacagtgta aaatctgtga gagcacaaat atacatttaa gagaaattac 840 taaaaatgga tttgaacaga tattccgttt tacttgtttg gcttgtaata cggtatttgt 900 ttttagaaca tgtccaagga atcttaaatt gaatagacat attattcaat catttgaact 960 ctcgaactct acctatgaat catttaaaac atttacaacc cttgtcggat tgccaataat 1020 taataagaaa tcatattatc aagcaacaaa caagttatgg agcagtagta gtcaagtatt 1080 aaatcaagaa atggacaagg aattggaaaa gattgcaaat cagaaaatga aacaagtgct 1140 tggtccttta atttctgcat ttgaactggg aagagaagtt ccggcattga tttactcctt 1200 cattcatgat gcaatatttt tggatatatc agttgatgca agatattcat ctaggagaaa 1260 tgtttacgaa tgtactcttg tggtattcga aacaaaggca aaaaaaacaa ttgaacgatc 1320 acacgttatc aagaaaagag catctaatag aaaaagtgcc ttacaatggt ttatgggagc 1380 ctctaaatta atggagccag aagcatgtcg tttagcaatt acatctttga aacacaagtc 1440 atttccagta atgggttatg gtattttaca tttcaaaatt ggctcttttg ttcatgacaa 1500 agactcaact gtggcagcag tcatcaagaa attagaacca acagcattag aaaagttaga 1560 tccaaatcat gtaatcaaaa atctgagcaa ggaagtagaa gaaaaagcac cccgtattgc 1620 cagtattatt gtatcatcct ttaggaaagc tctaaaatta gccaacacca ccactaattc 1680 aaatgagaaa ctccaagcat tattgaaagc ttatccacaa catttacaaa ataaccattc 1740 tctgtgcgat tcaggttgtc cacacacaat taaaccagaa aaattgatca ctaaagagga 1800 agcacaagtt gttaaagaaa tatttgacaa acgaatcaaa tatagtcaca aattttgtga 1860 tattgcatgc aatagccaaa acaatgagtc catacatagt actatttgtc atacaacacc 1920 aaagacatta gatttcagtc gaaattattg tggaagagct gatctggcaa taggtgaacg 1980 attgcttgga aaatttgcac aattaagaaa aattcaaaga tttattggta gtctcttcgt 2040 gattccaaaa gagcaggcca tcaagttaga tactcaatat catcaggata aaataaggaa 2100 acagtctaaa gcttataagg aaagaagagt ggagcttcgg aaaacaaaac ttaaggtaaa 2160 agaatctcca cagccaaagg gaacaattgg aagttgcaca tgttcgaagg gttcgacatg 2220 tacaaaagct tgtccatgca gaaaagccgg gaaatcatgt acttccaaat gctcttgttc 2280 ccttaccaaa tgttcaaaca acaaaaaagt ttgacttgta ataataaatg attttttatt 2340 aaaattgcta aatttgtaat tttgccaata atagatattc gaatatttgt aaaccaacat 2400 ttgcacttcc aattaaaata ggtagaccgc cacc 2434 // ID BEL-203_AA-I repbase; DNA; INV; 6478 BP. XX AC AAGE02027244; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-203_AA_; KW BEL-203_AA-LTR; BEL-203_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6478 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02027244; Positions 116517 122994. XX CC Positions [5561-6142] - Integrase core CC 'ATACA' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 684..5207 FT /product="BEL-203_AA-I_2p" FT /translation="MDDEQEYLKQKYELLMQIGDEKESSSRKSAISVKSKR FT DQLEKWMTAGSSADRMGHSKPMIIQTGVTLSLPIASEDGTGAKPPLSRNSL FT AKKIGNPSVPLNEGMLYAGLTVPVQGGSVIALTQSYEQLATAAVSESVSCC FT PTTLGLAPVITVSTIAPTGSSLTRTVPATATVTNARVQRAEKVRVTQSAIS FT GMPPVYSYPEMSIPSSSQNFNGIGGNQLAFPISATGPIPSRGSFPPSSAQL FT PVGSRSAFQYSLATEVAATQNQARYPTVAESTNSLFHSIPSAVDATFGIPV FT NSRVVPGNPASVVNPVIPGSMGIGYVQSSPPVQTIGPTSAQLAARQVMPRN FT NSTSKFNGDPQEWPIFYSAFKNTTDVCGYTDAVNLARLQRCLRGPALEAVR FT SRLLLPASVPYVMDTLHKLYGRPEILINSLLKKVRNVPPPKSDNLNTIVAY FT GLAIQNLVDHIVLADQQAHLSNPMLLQELIEKLPTPLNMQWGSYKQQFVSV FT NLATFNSFMSGLVNLASELSFDVDSAQNHPKQARAEKPRQKEKLFTHVSES FT SNPTKKENAVGETSSKSCSFCDKDDHQILNCAAFKALDIGGRWKAVRQKNL FT CRLCLIPHRKWPCRSKKECSAEGCRIRHHTLLHSTRSGNIDRSTNETVHQN FT HHRTKSFSLFRYLPITLYGNGKQVDTFVFLDDGSSSTLLEEGIAKQLGIEG FT EPDSLWLSWTGKIGRHEKTSRRLSVKISGVNKEEMYLLNDVRTVRELGLPI FT QTLNYPELAKVFPHLQGLPVSSYVNAKPGMIIGIEHVRMLTSLKTREGGNS FT DPVAARTRLGWCVYGRNTPSDESVEQLHVHAVESISNVELHDSMRKFFAVE FT EAAVTHQLESDEDKRARNILETTTVRREGRMEVGLLWRRDNFCFPDSFQMA FT FSRLKGLENRLAKDSVLRKRANEQIESFEQKQYVRKVSPEELRNTDSHRRW FT YLPLGIITSPKKPNKIRMVWDAAAKVGGVSFNDMLLKGPDLLVPLVDVLLR FT FREGKIAVCSDIREMFLRILIRDEDRWSQCFLWRNNPKEDVQVYMINVAIF FT GATSSPCSAQFAKNKNALEYAEQYPRAVRAIVKGHYVDDFLDSVNSVDEAV FT QLVQQVQHIHASAGFEFGKILSNSPKVLERLGETSPSTVKSLNLDKDAVHE FT RVLGVMWVPIADHFTPTKRQVLRAVMKLYDPLGFVAHFVVHGKILMQEIWR FT SGTNWDEPIAEHLRELWTKWIKLYESINEVQIPRCFFGDFQPQAIDGIEIH FT VFTDASIAACACVAYLRMLHQGDSWCSMVAAKTKVAPLRPLSVPRLELQAA FT LMGSRLLQRICTALTLNIRKRFLWTDSSTVLAWIRSDGRRYHQFVSFRIGE FT ILSLTSVDEWHYVPSKLNVADDATKWSAGPSFDPENRWFKAPSFLRESEEC FT WPTELAKDLSTADAASEEMHTVAVHQGIDELVEVERFSSWSRLVRTMVYVY FT RASKVWKRILNNHKSSLVPDRSEFVVAEETLWKQAHIHKRFKN" FT CDS 5288..6223 FT /product="BEL-203_AA-I_1p" FT /translation="MDSRISDAPHVSYTMKYPVLLPKDHRITFLLVQAFHQ FT RFLHANGETVCNELRQRFYIPKLRVLVRKVSRKCQLCKIKKAKPLAPMMGA FT LPKVRLMPFIRPFTFVGVDYMGPFLVKVGRSSVKRWICLFTCLTIRAIHLE FT LVHSLSTTSCVMAFRRFVARRGAPSEIFSDNGTNFVGANRQLSEEKRKIER FT INEDCSSTFTNANTQWHFNVPVAPHMGGPWERMVRSVKVAMQAISDSARYP FT SDEVMEMIMLEAEAIVNSRPLTYVPLEDENQEALTPNHFLLYGSTGIKQPA FT MNPANDAGVSTYADETNEMV" XX SQ Sequence 6478 BP; 1822 A; 1441 C; 1652 G; 1563 T; 0 other; atctccaagg taattcatcc taagttggat gattgattgt tgcccgtcgg acgtttctgc 60 ttcagttccg acgaatcgcg aaggcagaaa ggcgatctat ccaagaagac atctttcgcc 120 acagtacgct aagcgcgggg gaggaaccta acttcacatt cactgactgg atgcagccat 180 gggacggtgt acgagaagca gctaagtgca gtaaatttag ctcgtattta ctgtatccct 240 tgttaccgca tttgctcgtt cgcaaatcgc tctactagag ataagttagt cagagttaga 300 gttaggttca ttagtgcaac gatgtcggat aaaaatgtca atgtgtcgac tccgggcggc 360 gagaagcact gcgatcgcgg tgatatcgat gataatatgg tgtgttgcga cgtttgtgaa 420 gcttgggtac actttcagtg cgctggtgtt accgattcca ttggagatcc cgataggagt 480 tggaaatgta gtgcttgttt agccgactgg aatggctcct cgaagtctga ggtgtcattc 540 aacgagtcgt cggttagcaa gagtagtcgt gtttcgttga gactacagct cagcctgcaa 600 ctgctcgaag agcaacagaa gttgaataaa aagcgtgcag cggaagagtt gaaaattcgt 660 agacaggagg aaggttgaag gagatggatg atgaacagga gtacttgaag cagaagtatg 720 agttgttgat gcagatcggt gatgagaagg agtcttcgag cagaaaaagc gccattagtg 780 taaagagtaa gcgtgatcaa ctggagaagt ggatgactgc aggatcctca gcggatagga 840 tgggacattc gaaaccgatg attatccaaa cgggggtaac cctttcactg ccaatcgcca 900 gtgaagatgg caccggagca aaaccaccgt taagcaggaa ctctttggcg aagaagatag 960 ggaacccttc ggttccgcta aacgagggta tgctatatgc aggtctgact gttccagttc 1020 agggcggaag cgtcatagct ttgacgcaat cgtacgaaca gctggcgacg gcagcagtgt 1080 cagaaagcgt gtcatgctgc ccaactacgc ttggtttggc accagtgata actgtatcta 1140 caattgcacc gaccgggtct tccctaacac gcactgtccc ggcgactgcc acagttacta 1200 acgcgagagt tcagagagcg gagaaggtta gagttacaca atccgctatt tctggcatgc 1260 ctccggtcta ttcgtatcca gagatgtcaa ttccatcgag ttcgcaaaat ttcaacggta 1320 ttggtggaaa tcaactagcg tttcctatat cagctacggg tccaattcca tctcgaggtt 1380 cgtttccgcc aagttcagct caactaccag ttggaagtcg ttccgcattc cagtattcat 1440 tggctacaga agttgctgca actcaaaatc aagcgcgcta tccgacagtt gcagagtcaa 1500 caaattcgtt atttcactct atcccatcag cagttgatgc aaccttcggc attccagtta 1560 attcacgagt tgttccggga aatcctgcat cagttgtgaa tccagtcata ccagggtcaa 1620 tgggaatcgg ttatgtgcag agcagtccgc cagttcagac gattggacca actagtgctc 1680 aattggcggc acgacaagtg atgccacgta ataattcaac gtcaaaattc aacggtgacc 1740 ctcaggagtg gccgatattt tatagcgctt tcaagaatac tacggacgta tgcggttaca 1800 cagatgcagt gaatctggcc aggctacagc gttgtctaag aggaccggca ctggaggcag 1860 tgcgtagtcg tctgttattg ccagcttcgg ttccttatgt gatggacact ctgcacaagc 1920 tatatggacg acccgaaata cttatcaact ccctactgaa gaaggttaga aacgttcctc 1980 ctccgaagtc ggacaatttg aatacgatcg tagcatacgg tttggcgata caaaatctcg 2040 tcgatcacat tgtcctagcg gatcaacaag cgcacctatc aaatccaatg ctgttgcagg 2100 agttgataga gaagttaccc acgcctctta atatgcagtg gggttcctac aaacagcagt 2160 tcgtcagcgt caacttagca acgttcaaca gctttatgtc gggactagta aatctcgcat 2220 ccgaactaag tttcgatgtg gattcggccc aaaatcaccc gaagcaagct agggctgaaa 2280 aaccgagaca aaaagagaaa cttttcaccc acgtgagtga gtcatcaaat ccaacgaaga 2340 aggagaacgc agttggagaa acttcttcaa aatcctgctc attctgcgat aaagatgatc 2400 atcagatcct gaactgtgcg gccttcaaag cgttggacat aggaggacga tggaaagctg 2460 tgcgacagaa gaatttatgc cgattgtgtt tgattccgca tcgaaagtgg ccttgtcggt 2520 cgaaaaagga atgtagtgct gaaggatgtc gtattaggca tcatactctg ctgcatagta 2580 ctcgcagtgg aaacatcgat cgaagcacca acgaaactgt ccaccaaaac caccacagaa 2640 caaaatcctt ctccctcttt cgttaccttc ccataacgct ctacgggaac ggtaagcaag 2700 tagacacttt cgtgttcttg gatgacgggt cgtcatcgac gttgctcgaa gaaggaatcg 2760 cgaaacagct tggaatcgag ggcgagccag atagcctctg gttaagttgg acaggaaaaa 2820 ttggtcgaca tgagaaaact tctaggcgac taagtgtgaa aatttctgga gtcaataaag 2880 aggaaatgta cttgctgaac gacgttcgga cggtccgcga gcttggactt ccgattcaga 2940 cgttaaacta tcctgaacta gcgaaggttt tccctcacct tcaagggctt cctgtatcca 3000 gttatgtcaa cgccaaacca ggcatgatca ttggaattga acatgtgcga atgctcacca 3060 gtttaaaaac ccgcgaagga ggcaacagtg acccagtcgc tgcaagaact cgactagggt 3120 ggtgcgtgta tgggagaaac accccaagcg atgagtctgt cgagcagtta cacgtccatg 3180 ctgtcgagag cataagtaat gtcgaactgc atgattctat gcgaaaattc tttgcagtcg 3240 aagaggcggc tgttacgcac caattagagt ccgacgagga caaacgagct cgaaatatcc 3300 ttgaaactac caccgtgcga cgagaaggtc ggatggaggt aggtctgctg tggcgtagag 3360 acaatttttg ctttccagac agctttcaaa tggcatttag tagattgaaa gggctggaaa 3420 accgtttggc gaaggattca gttctacgaa aaagagcgaa cgagcaaatc gagagcttcg 3480 aacagaaaca gtacgtccgt aaagtttcgc cggaagagtt aaggaacaca gactctcacc 3540 ggagatggta tcttccactc ggaataatta cgagccccaa aaaaccaaac aaaatcagaa 3600 tggtgtggga tgcagcagca aaagtcggtg gagtatcctt caatgacatg ctactcaaag 3660 gaccggatct tctagtcccg ctcgtagacg tgttgctacg gttcagggag ggtaaaatag 3720 ccgtctgttc tgacatccgc gaaatgtttt tgagaatatt aattcgggat gaagatagat 3780 ggtcgcagtg ctttctgtgg cgcaacaatc ctaaagagga tgttcaagta tatatgatta 3840 acgtcgccat attcggagct accagctccc catgttcggc gcaattcgca aagaataaga 3900 atgccctcga atacgcagaa cagtacccca gggcagtgag agcgatagtt aagggtcact 3960 acgtagatga cttccttgat agcgtcaatt ccgtggacga agcagttcaa ctagtccaac 4020 aagtacaaca cattcatgca tcagcgggtt ttgaatttgg gaagattttg tcaaactcac 4080 cgaaagttct agaacgttta ggagaaacca gcccgtctac tgtcaagtcg ctgaacctag 4140 ataaagatgc ggtccatgag cgtgttttag gagtaatgtg ggttccgata gctgatcact 4200 ttacgcctac aaaacggcaa gtactgcgag ccgtaatgaa gctatatgac ccattagggt 4260 ttgtcgccca ttttgtggta catgggaaaa tcctgatgca ggagatttgg agatcgggaa 4320 caaactggga cgagcctatt gcagagcatc tacgagaatt atggaccaag tggataaaac 4380 tatacgagag catcaacgaa gtacaaattc ctcgctgctt tttcggtgat tttcagccac 4440 aagctataga tggaatcgaa attcacgtgt tcacggacgc cagcatagca gcgtgcgcat 4500 gtgttgccta tctcagaatg ctgcaccaag gcgatagctg gtgttcgatg gtggcggcga 4560 aaaccaaggt agcgccacta cgaccgcttt cagtacctcg tctggaactc caagcggctt 4620 tgatgggatc gcgtctactt caacgtattt gtacggctct cactctcaat attcggaagc 4680 ggttcttgtg gacggactca tctacagttt tggcgtggat tcggtcagat gggcgacgat 4740 accaccagtt cgtttccttc cgcatcggtg aaattctctc ccttactagc gtggacgaat 4800 ggcactacgt tccatcgaaa ttaaatgttg ctgacgacgc caccaagtgg agtgctggtc 4860 catcattcga cccagagaac cgatggttta aagcaccgtc attcctgcga gaatccgaag 4920 agtgctggcc gactgaattg gcgaaagact tgagtacagc cgatgcagca agtgaagaga 4980 tgcatactgt agctgttcat caaggaattg acgagttagt tgaagttgaa cgtttctcga 5040 gctggagtcg tctggttcgg actatggtat atgtgtaccg agcatcgaaa gtctggaagc 5100 gaattctgaa caatcacaaa agcagcttag taccagatcg aagcgaattc gtagttgcag 5160 aagaaacgct ctggaaacaa gcgcatatcc acaagagatt caagaattga gaaatggaaa 5220 aagagttccc aaacatagcc cgctgtatac gttgtctccc attctcgacg agtatgggat 5280 tatacggatg gatagccgaa taagcgacgc accacatgtt tcatacacaa tgaaataccc 5340 cgtattactc ccaaaagacc atcgcataac gtttctcctt gtgcaagcct ttcatcagcg 5400 atttctccat gcaaacggtg aaaccgtctg taatgaacta cgacaacggt tttatattcc 5460 gaaactgcgg gtgctcgtcc ggaaagtaag ccgcaagtgt caactctgta aaataaagaa 5520 agctaaacct ttggccccaa tgatgggagc tttaccgaaa gtacgtttga tgccgtttat 5580 tcgacctttc accttcgtag gcgtcgacta tatgggacca ttcttggtca aggttgggcg 5640 tagcagtgtg aaacggtgga tttgtctctt cacctgctta accattcgcg ctatccatct 5700 agagctggtg cacagcctat caacaacttc ctgtgtaatg gctttccgga ggttcgttgc 5760 tagaaggggt gctccatcag agattttctc ggataatggc acaaattttg tgggtgccaa 5820 tcgccagtta tctgaagaaa agcggaagat cgagaggata aacgaagact gttcgtccac 5880 ttttaccaat gcaaacactc aatggcactt caatgttccc gtagctcctc atatgggagg 5940 accatgggag cgcatggtgc gatccgtcaa ggttgcaatg caagctatat ctgatagtgc 6000 acggtacccc agtgacgaag tgatggaaat gattatgtta gaggctgaag caatagtgaa 6060 ctcccgtccc ctgacctacg taccactaga ggacgagaac caggaggcac ttacgcctaa 6120 ccattttttg ttgtacgggt ctaccggtat aaaacaacct gcaatgaatc ccgcaaacga 6180 cgcgggagta tctacctatg ctgacgagac gaacgaaatg gtttgaaccc acaaggccgt 6240 tgaaaccagg cgacttagta attgtagtgg atgaggcctc gaggaatagt tgggagagag 6300 gccgtatttt ggagacctat ccggacaaat ccgggaatgt gcggtgtgcg aaggtacaga 6360 caaatagagg agtcttcagt agacctgctg ttaaattggc cgttcttgat gtgacggcaa 6420 atgacgggaa cctagaagat attccgaggg aaccggaagt agttcacggg gtggggaa 6478 // ID Gypsy-38_NVi-I repbase; DNA; INV; 8383 BP. XX AC . XX DT 06-JUL-2009 (Rel. 14.07, Created) DT 06-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE Gypsy LTR-retrotransposon from Nasonia vitripennis, interanl DE region. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-38_NVi-I. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-8383 RA Bao W. and Jurka J.; RT "Gypsy LTR retrotransposons from Nasonia vitripennis."; RL Repbase Reports 9(7), 1391-1391 (2009). XX DR [1] (Consensus) XX CC The 3'-end is incomplete. XX FH Key Location/Qualifiers FT CDS 1067..2773 FT /product="Gypsy-38_NVi-I_1p" FT /translation="MIVVPRRIEVERTNVTPERLNVGPAEKRVVHRSQRSN FT RSRRPADRVHRSRDRTVSRSPSRHSRHDGHRRERRDRRHRSRDSTPRARRE FT RRHRHRDRSDSEDSFSDDRSSHYSRDSRRNRRRRHRYSESSSSSRSDSLSS FT GWSSDGSTRSERVRSRKIETLTKSLKNWTPRFSGTEGRSKAENFLSSLTEY FT VRTRRPPARRLLSCLSCLLTDDAARWYRGAQSDFRSWRQFRRAFLFRFIGE FT TDEQEVKMDLARRTQGVGESISAFIACYKYLRGHLRHPPSMRDQVATVYAG FT LRPEYRNFMVNLPRYSLDDIERYGHQYERHCNLNAHWAPPPPSDKMNLPGS FT AFVADSGGKKKGKEVSAIEPPPETGNGQGXSSAGGTNPPNNTNKGKNNRNR FT KGNNNNGNQSNAGSNSAPAGGSSPTALVQVTPAGAQPANVAASDGLRIGNQ FT PWGGGNPQNCAPNPPPQQYFAPAQPPNNGLQVGRGNGSPKIGPCFGCGIEG FT HRAANCPKMNCFECGQSGHRTASCPKRLARQVYCQLCNKEGVTVREWSNCN FT ILQRLGNGVAGTQLNPLGPAK*" FT CDS 3284..6667 FT /product="Gypsy-38_NVi-I_2p" FT /translation="MTLGKGSAESESEKLNRLLKQALGPELSEGLADDASA FT DYGDPKRGLPTDPESTGAVPRPVPAVKCSVRLWEPKTPLSVVSEGGETDEE FT SVQNDPVATDNVPPVEDIDAIVDDNRKFMRLCLGGRWYTALVDSGASVSLM FT KADVAERFSSRLEPCDTVIRGFNGYSATVKGVLNRLKIRVVEQCYHDIILG FT IDFLEKWRVELAPWDNRWRLPDGMWHDFCGKVASNGKELYAECAGLSELTQ FT SEHDRLLAVVKRFLPENPPTDLKATWLIKHKIVYVGNKTVKHKHRRWSPVR FT WAIAMREIEEMRRMGVIERSNSDFSSAPVVANKEGGEKRFCVDYRDVNDGT FT VKDAYPLPPIDSILDKLRNARYITKIDLRKAYFQVPLDEDSKKYTAFSMPG FT SGLWQFTRMPFGLANAPATFQRLVDSLFGPEFEPYVFGYLDDIIIVTENFE FT DHLKWVEIVLKRITEAGLTVNRKKCEFGCSQVRYLGYLLDREGLRPDPERI FT APVLEYTPPKTKKQLRRFLGMVGWYSRFIPNDSEIKVPLLKLLRNTQPWVW FT EEEQQEAFEALKLALTRAPVLARPDFTKRFTIQCDASDFAIGAVLTQEFED FT GEHPIVYLSRVLSPAEKNYTTTEKECLAMIFAIKKLRPYIEGYGFTVITDH FT SALRWLQNLKDPTGRLARWALELQQWDFEIVHRKGANHHVPDALSRSKGED FT EEVVSAFDVITDEWYLQRRRDVITSPKKFREWKVEDEMLYRYRRDPLLDPI FT THPSEGWRLVVPIEYRPRVLSDAHREASSGHLGVEKTYDRVAREYYWPGVW FT HDVNHYVQECDECQRYKTSQLGTAGQMGQRIVERPWAVVAADLMEFPTSKS FT QYKYLIVFQDLFTRWVELKPIRKATGQAVAKAFEELILFRWETPDFLLTDN FT GKEFINKTLKDTLEEYGVTHVTTPPYHPQANPVERSNRTLKTMIATFVGKD FT HRTWDQHLHEFRHAVNTAMQSSTRVSPAFLNYGRHPQPVKSLRREVEVRGP FT KIELDPSVWIDRVKRLDALRDIVLKHLDQAWGKQAAVYNQGKRVLEFEVGD FT VVRRKVHSLSKGVDGKCNKLDANYSDPCVILQKLSPLVYILETKDKRQTAK FT VHLSELRRYVPPRNRAANT*" FT CDS join(6686..7405,7390..8130) FT /product="Gypsy-38_NVi-I_3p" FT /translation="VVLFLFCAIASLRLVTGMRRLTDSILMDVKSPGSTMK FT TVSWAEHIAEMESECERELRAIEDQREKLRQSRPRTDMTRAAEIWARRSPA FT CPSXSSMEEEERWSAGVTDTRIPRKTMIGAAGPLPLTEEEREAYQREGIPV FT EEEPSALEESRAAVDELEALVGDGDDLTITISNERAEAVGEYPRLRGPNGK FT RRKYNPLRHLELSRWWQDHRATVTTPRRVHQSVAKPWRYVYHSISATGFRR FT HRFPLTFATFQVDIPFYQPPPAPRNAPLMPRLEIPGAESDSSGSPSTGAVP FT RRYGGTTNTGSRSRSSRRRAAVRLDGVMFGGLTPIATDPTLDPPPRTCFNC FT WQPGHRRTGCRAPPSRFCFNCGRQETDLANCPRCRDAHRRHMRHQGFRPRG FT RGNPKGEAGMPGNWEYPAQYPQIDAPPASYDEAMAGREPPYRPAIRMPPPR FT DERDAGQYPRPLPSLWMEALHHIATAPPEVLALLRSHLSNA*" XX SQ Sequence 8383 BP; 2124 A; 2039 C; 2481 G; 1722 T; 17 other; acatggcgcc caccgtgggg cttttacgct aggtagcaac aaagattttc ccttttcctg 60 gcgtggcaag taggaggtga taaagtaatc gacgattgcg tcattttgtg cctcagtatt 120 ttccccgttg cggcatattt ccgacagagc ggagcctgcg agcccacatt aagattgcgc 180 caagtatcga gatgttggat aatcgggtaa acgattatcg ggcgcggatg cgcaccatct 240 tgtcgcgatc gagagattta cgagcggtgt tgaagaattg taaaagattg tccgtggacg 300 aggttctcat gcgtttgacc gaatcaaagt ttagtacggc tgggtcacat acagaaagag 360 ccgagcggtt attctgtttc attattatgg tcacgggaca ggcgaaagat gtcccgtggt 420 accccgaatt cgacgaggaa cagaatattc cggtaggaaa cgttcctggg ccgcagttag 480 tgatagatgc gtctgacgga ccgatgatga gaagagggag gcgaaggagc cgcggcggcg 540 gtggtcgaag cgagtgaaag cgtggacgca aatccgaatg cgccagatcc cccacctgag 600 cgacagcctc caacgggtga taaggcgagc gagggaaaca aatcgctgcc tttggtgcaa 660 cgtagcacca gttcaacggg gactataccc aagcagacac gatcggccgt gctaaatcct 720 cgccccgcgc ccgtaggctt ggaatcgctg caacaacagt tagagcaggc tgcatttaag 780 cgtctagacg cctctgagcc gacagcgcgg actcctagaa ggcgaattga accgattcta 840 gaggtagaca cgccggaggc cgataaatct agcgccccgt tgacattgcc aaaccctaat 900 gaattgtccg tgaatcctcc cttcaaggaa atcagcgttg cgccggcccc tggggccaag 960 aagaaagcgc cagttccgcg cactccgaca gagggaaagg aggtcgagaa accgaaaggt 1020 aaacgggaaa ataggacgcg ttcaaacgaa aatcgtccgg tttcggatga tcgtcgtacc 1080 acggcgaata gaagtcgaga gaacgaacgt aacgcccgaa cgcctgaacg tcggccccgc 1140 cgagaagcga gtcgtgcatc gcagccagag aagcaatcgt tcgcgccgcc cggctgatcg 1200 tgtacatcgc tctcgtgatc gaactgtttc tcgttctcct agtcgtcatt cgcgccatga 1260 cggtcatcgt agagaacgac gtgatagacg acatcggagt cgagattcca ctccccgagc 1320 gcgtcgagaa cgtcggcatc gtcatcgcga caggtcggat agtgaggatt cgtttagcga 1380 cgaccgcagt agtcattaca gccgtgattc gcgacgaaat cgaagacgga gacaccgata 1440 tagtgagtca tcgagtagtt cgcgatccga ctctcttagt agcgggtggt cgagcgacgg 1500 aagtacgcgt agtgagcgtg tgcggtcgcg aaaaatcgag acgttgacga aatctctaaa 1560 gaattggact ccgcggttca gtgggacgga agggcggtcc aaggcagaaa attttttgtc 1620 tagtttgacg gaatacgtgc gtacccgtcg gccgcccgct cgtcgcctgc ttagttgtct 1680 gtcgtgtttg ctcacggacg atgccgcgcg ttggtaccga ggtgctcagt ccgatttccg 1740 tagttggcga caatttcgcc gggcttttct gttccggttc atcggcgaga cggatgagca 1800 ggaggtcaag atggacctgg cgcgtcgtac tcaaggggtt ggagaatcaa tctccgcctt 1860 tatcgcgtgc tacaagtacc ttcgcggtca tttgcgccat ccccccagta tgcgggatca 1920 agtagcgacg gtgtacgccg gactgcggcc ggaatatcgg aatttcatgg taaatttacc 1980 gcgttattcg ttagatgata tcgagcggta cggtcaccag tacgaacggc attgtaactt 2040 aaacgcgcac tgggcccctc cgccgccttc tgataagatg aatcttccgg gttcagcgtt 2100 tgtagccgat tccggaggga agaagaaagg caaagaagtc tctgccatag aaccgccgcc 2160 ggaaacggga aatgggcagg gacakagtag cgccggtgga actaatccac ccaataacac 2220 gaataagggg aaaaataacc gaaatcggaa aggcaacaac aataatggga accagtcgaa 2280 tgcgggctcg aatagtgctc cggcgggagg ctccagtcca acggctttgg tacaagtcac 2340 gcctgcggga gcacaacctg cgaacgtggc tgccagcgat ggactacgca taggcaatca 2400 gccctgggga ggaggaaatc ctcagaattg cgcgccaaat cctccgccgc aacagtactt 2460 tgcgccggcc cagccgccga ataatgggct ccaagtaggc cgcggtaacg gatcccctaa 2520 aatcggaccg tgttttgggt gtggaatcga gggacaccgc gccgcaaatt gcccgaagat 2580 gaattgtttc gaatgcggcc aaagcggaca tcggacggct agttgtccga aaagattggc 2640 acgacaggtg tactgccagc tgtgcaacaa ggagggggtg acggtacgcg agtggtcgaa 2700 ttgtaatatc ctccagcgtc tgggaaacgg agtagcggga actcagctca accccctggg 2760 acccgcgaag tgaggggggt tgaacattca gtagcgccga tcgttccagt agatcctggt 2820 ttggtcattg gcgccgatac cccagtagta gcgtcagggg gtgtatcaga cgaggcggaa 2880 ggggaaggag agtcggattg gggagaagag tggccggagt atatagacgc tcacccgcga 2940 gagaagtcga agtcgaagag atgtacgaag gaagaattgg cgccatccga agggtttggc 3000 gaccagttga agtcgaagtc gaagagaggt acgaaggaag aactggcgcc aaccgaagta 3060 attggcgacc agccgaagtc gaagtcggag agaggtacgt aggaagaact ggcgccaacc 3120 gaattaattg gcgaccagcc gaagtcgaag tcggagaaag gtacacwgga aaaacgggyg 3180 cctaccgmrg taattggtga mcagttggaa tcggtgakaa maaagtcgga gattcwarcg 3240 gagcccaagt tggcgccgac cgcgatstcc awcgagtggg cggatgacgt tggggaaagg 3300 tagcgccgaa tccgagagcg aaaaactgaa tcgtttgcta aaacaggctc taggccccga 3360 gttaagtgag ggtttggccg atgacgcgag cgccgattat ggggatccga aaaggggttt 3420 accgacggat cccgagtcga cgggggcggt gcctcgaccc gttccggccg tgaaatgctc 3480 agtgcgtctg tgggagccta aaacccccct tagtgttgtt tcggaggggg gtgagactga 3540 tgaagaatca gtacagaacg accctgtagc gaccgataac gtacccccgg tggaggacat 3600 cgacgcgata gtcgacgaca accgaaaatt catgcgatta tgcctagggg gacgttggta 3660 tacggctctc gtggattccg gggcatcggt ttcgttaatg aaggctgatg tagccgaacg 3720 tttcagttcg cgcctggaac cttgcgatac ggtcatccga gggtttaacg gttatagcgc 3780 cacggtcaag ggagtactca atcggttgaa aattagggtc gtggaacagt gctaccatga 3840 tattatttta gggatagatt tcctagaaaa atggcgcgtg gaattagccc cgtgggataa 3900 tcgttggcgc ctgccagatg gaatgtggca cgacttttgc ggaaaggtag catcgaatgg 3960 caaggaactc tacgccgaat gtgccgggtt atcagaactc acccagtcgg aacatgatcg 4020 attactagcc gttgtaaaaa gatttttgcc ggaaaacccg ccaactgatt tgaaagcaac 4080 ctggttgatt aaacataaaa tcgtgtatgt gggaaataaa accgttaaac ataagcatcg 4140 tcgttggtcg ccggttcgtt gggctatcgc gatgcgcgaa attgaagaga tgcgccgaat 4200 gggggtaatc gaacgttcaa atagcgactt cagtagcgcc ccggtagtag cgaataaaga 4260 aggcggggaa aagagatttt gtgtagatta tcgcgacgta aatgacggaa ccgttaaaga 4320 cgcgtatcca ctaccgccaa tagatagtat tttagataaa ttacgaaatg ctcggtacat 4380 cacaaagata gatttgcgga aagcctactt tcaagtgccg ctagacgaag atagtaaaaa 4440 gtataccgcg ttttccatgc ctggatcggg tctatggcaa ttcacgcgaa tgccatttgg 4500 gttggctaat gcaccggcta cctttcagcg cctggtggat agcttatttg gaccagaatt 4560 tgagccgtac gtgtttggtt atttggatga tatcattata gtaacagaga atttcgaaga 4620 tcacctgaaa tgggtggaaa ttgtattaaa gcggattaca gaagcggggt tgaccgttaa 4680 ccggaagaaa tgtgagttcg gttgttcgca agttaggtac ttgggctatt tgctagacag 4740 agaagggtta aggcccgatc cagagcgaat tgcgccggtg ttagagtata ctccaccaaa 4800 aacaaagaag cagttgcgcc gatttctagg aatggtgggg tggtatagtc gttttatccc 4860 gaatgactcg gaaattaaag tkcccctgtt gaaactactg aggaacacgc aaccgtgggt 4920 atgggaagag gagcaacaag aagcgttcga ggcgctaaag ttagccttga ctcgcgctcc 4980 agttctcgcg cgccccgatt ttacgaagag attcacgatt cagtgcgacg cgtccgactt 5040 cgcaattggc gcggtactca cgcaagagtt tgaagatgga gaacatccaa tcgtgtatct 5100 aagccgcgta ttatcgccag cggaaaagaa ttacactacg accgagaaag aatgtctagc 5160 gatgatcttc gcgatcaaga agttgcgtcc gtatattgaa ggatatggtt tcacggtaat 5220 cacggatcat agcgccttgc gttggctcca aaatctgaaa gacccaactg gccgattagc 5280 gcgttgggcc cttgagttgc aacaatggga cttcgaaatc gtacatagaa aaggagccaa 5340 ccatcacgta cccgacgcgt tatctcgtag taagggagaa gacgaagaag tagttagcgc 5400 ctttgacgta attacggatg agtggtacct tcagcgccgg cgagacgtga ttacgtcgcc 5460 aaagaaattt agggagtgga aagtcgagga cgagatgcta taccgatacc gccgcgaccc 5520 attattagac ccgattactc acccgtcaga gggttggcgc ctggtggtgc cgatagaata 5580 tcgcccccga gttctgagtg acgcccatcg cgaagcttcc tccggtcatt taggagtaga 5640 aaagacttat gatcgcgtcg cgagagaata ttactggccg ggagtatggc acgatgttaa 5700 ccactatgta caagagtgcg acgaatgcca gcgttataaa acttcccagt taggaacggc 5760 cggtcaaatg gggcagagaa ttgttgagcg cccgtgggcg gtagtagctg ccgacctgat 5820 ggaatttccc acgagtaaga gccaatacaa gtacctgata gttttccaag acttattcac 5880 gcgttgggtc gaattgaagc caattagaaa agcgacgggt caggcagtag caaaagcatt 5940 cgaagaatta attttattcc gttgggaaac gccagatttt ctgttaacgg ataacgggaa 6000 ggaattcatc aataaaactc tgaaggatac cttggaagaa tatggtgtca cccacgtgac 6060 caccccaccg tatcacccgc aagcgaatcc ggtagagcgg agtaaccgaa ctttaaaaac 6120 aatgattgcg acgttcgtgg ggaaagatca ccgtacttgg gatcaacatt tgcatgaatt 6180 tcgtcacgcg gtaaacacgg cgatgcaatc gagtactagg gtttcgccgg cgttcttgaa 6240 ctatggacgc catccccaac cagttaaaag tttgcgccgg gaggtggaag tmagagggcc 6300 caaaattgaa ttagaccctt ccgtgtggat agatagagta aaacgtttgg atgctcttcg 6360 cgatatagta ctgaaacatt tggaccaagc ctggggaaaa caagccgcag tgtacaacca 6420 aggaaaacgc gtattagaat ttgaggtggg cgacgttgta cgtcggaaag tacattccct 6480 ctccaaagga gtagacggaa agtgcaacaa gttagacgcg aattatagcg acccgtgcgt 6540 gattctccag aagttatcgc cactcgtgta tatcctggag acaaaagata agcgccagac 6600 cgccaaggtc cacttgagcg agttgcgtag gtatgtgcct ccgcgaaatc gtgccgcgaa 6660 cacgtagcgc ccacaacgaa ggtaggttgt gctgttcttg ttttgtgcca tagcgtcgtt 6720 gcgtctggtg acgggtatgc gacgacttac tgactccata ctaatggacg ttaaatcccc 6780 aggaagcaca atgaagaccg tgtcatgggc cgagcacatc gcagagatgg aaagcgagtg 6840 tgagcgggag ctgcgcgcca tcgaagatca gcgggagaag ctgcgccaga gtcgtccsag 6900 gaccgacatg actcgggcgg cggagatttg ggcccgtcgg agccccgcgt gccctagcrc 6960 aagctccatg gaggaagagg agcgctggag tgctggggtc actgacaccc gaatcccccg 7020 caagaccatg attggcgccg cgggccctct gcccctcacg gaggaggara gagaggccta 7080 ccagcgggag ggaattcctg tggaggagga gccctccgcc ctcgaggaga gtcgcgccgc 7140 ggtcgacgag ctggaggcat tggtgggaga tggcgatgat ctcaccatca cgatctccaa 7200 cgagagggcg gaagcagtgg gggagtaccc tcgacttcga ggtcccaacg gcaagaggcg 7260 aaaatacaac ccgttgcgcc acctcgagtt gagtcggtgg tggcaggacc atcgcgccac 7320 cgtgaccact ccccgccggg tccaccaatc cgtcgcgaag ccgtggaggt atgtatacca 7380 ctccattagc gccacaggtt tccgttgacc ttcgcgacgt ttcaggtgga tattccattc 7440 tatcagccgc ctcccgcgcc gagaaatgcc cctctcatgc cccgtttgga gattcccggc 7500 gccgagagcg actcgagcgg gagtccgagt acgggcgcgg tgccccgtcg ctacggtgga 7560 acgacgaata cgggctcccg ttcgcgttcg tcgcgccgga gagccgcggt acgactcgac 7620 ggggtcatgt ttggagggct gactcccata gcgacggacc cgactctaga cccgcctccc 7680 cggacctgtt ttaactgctg gcagccggga caccgccgga caggatgcag agcgccgcca 7740 agccgtttct gttttaattg tggccggcaa gaaacggact tggcgaactg tccgcggtgt 7800 cgagacgccc atcgtcggca catgcggcac cagggattta ggccacgggg acggggaaat 7860 cccaaaggag aggccgggat gccggggaac tgggagtatc cagcccaata cccccaaatt 7920 gatgcgccgc cagcgtcata tgacgaagca atggccggcc gggaaccgcc ctaccgtcct 7980 gccatccgca tgccgccgcc tagggacgag agggacgccg gccaataccc tcgaccgctg 8040 ccgagcctct ggatggaggc actccaccac atcgcgacgg ctccgccgga ggtcctagcg 8100 ttgctccggt cgcacctgag taacgcctag gactttcctt tctttttttt ttttaataaa 8160 gataaggatc taaccagtgt gatgattctc gtctcccttg catgagtcac gtaatgtcca 8220 gattttgtgc cgtctgttga gtcatagcgc cagttcaacc atctcgagcc gtgtgttgat 8280 gttcaaatta tttacagaaa tgtccaggac ccaaggagtt ccaggacgag gtcagcgccg 8340 acgtaccctg gcgaggggag cctctccgga atcgcgttgg ttg 8383 // ID Gypsy23-LTR_Dpse repbase; DNA; INV; 2153 BP. XX AC Unknown_group_816; XX DT 15-MAY-2009 (Rel. 14.05, Created) DT 15-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy23_Dpse; KW Gypsy23-I_Dpse; Gypsy23-LTR_Dpse. XX OS Drosophila pseudoobscura OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; obscura group; OC pseudoobscura subgroup. XX RN [1] RP 1-2153 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1125-1125 (2009). XX DR Genome; Unknown_group_816; Positions 161 2313. XX SQ Sequence 2153 BP; 553 A; 449 C; 530 G; 621 T; 0 other; tgtcgcaact gacaacaaac cggcaaggtt caaatgattg tcaaaattgt ggaatgtaga 60 tgtgacctta cattcccaac cagtatcggg acagatgtgt ctttctagtt gtcattctgt 120 tttgtttgtt ttggctttat cactttatgc tggatcagcg caatgggtta ttaaactatt 180 gccaccgttg taagtgacca agcgggctcc tatgatgcag gccggaatga tgagtacata 240 taacaccggt accaccccga tgcatcgagc ttattactat tcgtttcgtt ctttctttct 300 ttattcgttt gtcttccttt ccttcctgtt ccttctggcc aatgctggag gaatgctcct 360 tttttccggt ctggattatc cgatcaggaa cggtgggtga aaggcctcaa ttagacagga 420 aggagttgtg aatggactta tactcatcgg gctgggtatc tggtatcgtt ggatggaaac 480 gtggtataaa actattaccc accgcgaaag acccggaccg cctatagcta cgagttgtgg 540 gttattatta atttctttat gcatgtagtt gagtaaattt agtacatggg aaacatcata 600 cacatacaca cacatagata taccctatta ttccctttgt cttaagcatg gtcctagcag 660 attgtttttt ttgtttgata ctgtatcttt ttttttatta atttcttaaa ttttatttat 720 ctatttaatt acgttattat tatattttat attttatatt atgcctgtat ttggcagttg 780 cgagaattta gcagacgaag aattacaaga aagtgtttgc ttgatctgcc aggctcaggt 840 ggaatatttg gctcagctcc ttaccacgtc ttgtggtcac actttccata gggcctgttt 900 tggcgcacga gtgggttcta agaaggtttg tcctgtctgc aaaacgtctt ttcagtcatc 960 gacgttgaat ttattagacg aaactgctgg ggataccgtc ggaaccttat tggtaccaaa 1020 aatgcctgat acaaggagta aagcccatgt gggggctcca cagggagctt cgacgtctgc 1080 ttcggccaat cagccgcaaa ctagggaagc tggtccggca gtggtggtag ctccgccgcc 1140 tagtaacgag cttcagcgaa tgatagccga agccatggct gaagcctcgt tgacacaaac 1200 cgagatgcta agcaatgcga tcgctaatgg ttttcaaggt attctcagct ctctcatggg 1260 ccaattggaa aatacagcgg atgttcccta gagggatatc caagctggag aaggatcggc 1320 acgacggccg acacctttgc ccgggtttca gggtttcgaa acagccagca ggaattcccc 1380 catgaatgtg gatattcgac cagagcgtgt tagtcaaatt ttagtcaatt gaatgcttag 1440 gttctcttgg cggactggaa tttccgtcga cgatttcatt tatcggatag aagcgctcac 1500 gatccaaaca ttggacggga atttcgagtt gctttcgaga tacgtgagca atctctttga 1560 gggaagtgcc agtgactggt attggcgcta ccataagagt gttaggacgg tcgtttgggc 1620 cgacccaaaa tcttcattgt tagtacgatt caaggatgcg aggacagacg tcaacattcg 1680 agctgcaata gaccgaagga agcagaagga gaacgaatca tttgatgagt tctacgaagc 1740 gattgtccga ctggcagata ccctgacaac gcccttgtca gaaggttccc tgatggagtc 1800 actccgatcc aatttattga cggagataca gcatgaactg ctatatgaaa agatctttag 1860 cattgcacag ttaagacact tggtgaagac tcgggaactg ttcatgcaaa ctgttggtaa 1920 agatccggta gatccggacc acgaccagca ccaagacggt tggttcacgc ggtagaaagc 1980 ccagggcaag aggagtcaga cgaaagtgag ggggagatag cagcgttcaa cttagtctgt 2040 tggaactgcc gtggtaatgg tcatcgctac caggtttgcg tagcagagcg agtagttttt 2100 tgctatgggt gtggccagcc tgacacttac aagccatcgt gccaaaaatg cca 2153 // ID piggyBac-3_SM repbase; DNA; INV; 2457 BP. XX AC . XX DT 29-MAY-2008 (Rel. 13.05, Created) DT 29-MAY-2008 (Rel. 13.05, Last updated, Version 1) XX DE DNA transposon from Schmidtea mediterranea. XX KW piggyBac; DNA transposon; Transposable Element; piggyBac-3_SM. XX OS Schmidtea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae. XX RN [1] RP 1-2457 RA Kapitonov V.V. and Jurka J.; RT "Families of autonomous piggyBac element from the freshwater RT planarian genome and two clear examples of horizontal transfer of RT piggyBac transposons between a flatworm and insect."; RL Repbase Reports 8(5), 522-522 (2008)Repbase Reports. XX DR [1] (Consensus) XX CC piggyBac-3_SM is a young family of piggyBac transposons, CC characterized by 14-bp TIRs and TTAA target-site duplications. CC The consensus sequence was reconstructed based on multiple CC alignment of three copies (they are ~99% identical to the CC consensus). This transposon may be still active. XX FH Key Location/Qualifiers FT CDS 369..2087 FT /product="piggyBac-3_SMp" FT /note="piggyBac transposase." FT /translation="MAGINPKLFYGKRKKVVDSSDSEEADFDRSDSDEYIP FT SISSTDVDDVDVSDVSVSDDCSDIPTPTITRNVWNTVSGNTLHKFTLTAVA FT ASPGISHLDTSSIKPIDIFNIMVSSDIYDLIVQQTNLNAYKKISNSRISRS FT SRMKKWTDTNVSEIKRFLGIVMYMGIVTYPTIDMYWSKDNFYLNSFVPRQM FT SRNRFQLLLKFIHFADDDTIQNNRLGKVAYLLELLEKFFIAAKIPGELLVV FT DESMIPWRGRLMFRQYNPGKAHKYGVKVYKLCDPIGYTYTSEIYCGKTDDS FT ARRGRPTPGVTHSTQIVLDLAAPYLNNGRTVVTDNFYTSRSLANALITQQT FT HLLGTLRKNRIGNPTEVTNAKLRKHEVIGRENSGIVVAKWRDKREVLMLST FT KHDLTEVRTGKTNRNNEDIIKLQMIIDYNFAKSGIDLSDQMASYFTPLRRT FT IRWYHKIAFEYLLSTAVVNALILYKLSKPNIKIAEFRKSICESLCETSISS FT VSVVVQRPPARRHLIQTYTTRDNRNRIIRKRCVECYKQLQQQKGREEATKK FT TKRVTTFCSGCEDNPPLCTNCFNKLH" XX SQ Sequence 2457 BP; 813 A; 425 C; 455 G; 764 T; 0 other; cacgttgact gccaccatat atataaatac tacattccta cggtgccgta gtacattata 60 gactgtggca gccggcggta cacgcggcac ctagtaaata ccgttttcat tatgtgccgc 120 atgtgcgccg gctgtacaca ggttatttgt tatctttttg aaaaaaattg ttttgtggta 180 taattcacag tctttaatta tcatcaattg cataattagt tgattgcata attagttctc 240 taattatcat cgattgcata attagttata gtaaggttta attttatact gttttatatt 300 tatagtgata agtggaaaga tattgtgtgt gttagatagt ggtatataaa agtatccctg 360 tagtaattat ggcaggtatt aatccaaaat tattttacgg aaaacgtaaa aaagtggtgg 420 atagttctga ctccgaagag gccgacttcg accgatcaga ttcggatgag tatattccat 480 caattagtag cacagatgtt gatgatgtcg atgtctccga tgtttcagta tcagatgact 540 gtagtgatat acctacaccc acaattacaa gaaacgtatg gaacacagtt agcggcaaca 600 cattgcataa atttacgttg actgcagttg ctgcttctcc tggaattagt cacctagata 660 caagctccat caagccgatt gacattttca atattatggt ttcatctgac atatatgacc 720 ttatagttca acaaacaaat ctgaatgcat ataaaaaaat ttccaatagt cgtataagcc 780 gcagttcacg catgaaaaag tggaccgata caaatgtttc agagataaaa agatttcttg 840 gcatagttat gtacatggga atagtgacct atccaacaat tgatatgtac tggagcaagg 900 ataactttta cctcaactca tttgttccta gacagatgtc ccgaaataga ttccaattat 960 tgctgaaatt tattcatttt gcagatgacg ataccataca aaataatcgt ttgggtaaag 1020 ttgcctattt gcttgaactc ctggaaaaat tttttatagc tgctaaaata ccaggtgagc 1080 tgttggttgt ggatgagagc atgatacctt ggcgtggaag actgatgttt cgtcaatata 1140 atcccggaaa agcacataaa tatggtgtta aagtttacaa gctttgtgat ccaattgggt 1200 acacctatac aagtgaaatt tattgcggaa aaacagatga ttcggctaga agaggacgac 1260 caacaccagg tgttacacat agcactcaga ttgtactcga tctggctgcg ccttatctaa 1320 ataacggtag aactgttgtt actgataatt tctacacgtc taggtcgtta gctaatgcct 1380 tgataacaca gcaaactcat cttctgggaa ctcttaggaa aaaccgaatt ggtaacccta 1440 ctgaagtcac taatgctaaa ctccgaaagc atgaagttat aggtcgagag aattcaggca 1500 ttgtagtagc taaatggagg gacaagcgag aagttcttat gttgagcaca aagcacgacc 1560 taacagaagt tcgaactggc aaaacaaatc gcaacaatga agatatcata aagcttcaaa 1620 tgataattga ttataatttt gccaaatcag gtatcgattt atctgaccag atggctagct 1680 attttacgcc tttacgcaga actattcgct ggtaccataa gatagcattt gagtaccttc 1740 tgagtacagc cgttgtaaac gcgcttattt tatacaaact aagtaaacca aatataaaaa 1800 tagctgaatt cagaaaatct atttgtgaat ccctttgcga aacatcaatt tcgtcagtat 1860 ctgttgtagt acaacgtcca ccagctagac ggcatttaat acaaacgtat accaccagag 1920 acaaccgaaa tagaattata cgtaaaaggt gtgttgaatg ctacaaacaa ttgcaacagc 1980 agaaaggacg agaagaagca acaaaaaaaa ccaaaagagt tacaactttt tgcagtggat 2040 gtgaggataa tccgccatta tgcacaaatt gctttaataa attacattaa tttttatttt 2100 atattatatt tatcatattt catatatgta taccgaagtt ttcttttgtt ttgttaattt 2160 ttataataca taattaatga atttttatgt ttgcaataaa atttttatgt ttgtaatctt 2220 ttcgtttttc tccattttgt tccagctata cctctaaatg tagatgtaaa ccataacaaa 2280 aaaaccttaa tttagtttta tcaaggtgcc actgtatata aataaacagt ttaagcccac 2340 tgccacaaga aaaacatgat tattcagtag tattatgtat gtaccgccgg ggctacacgt 2400 ggcatatgta taaaaatgta ctgtgtactg cgggcggggc atatggcagt caacgtg 2457 // ID Gypsy-92_CQ-I repbase; DNA; INV; 1951 BP. XX AC AAWU01007055; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-92_CQ_; KW Gypsy-92_CQ-LTR; Gypsy-92_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-1951 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 563-563 (2011). XX DR GenBank; AAWU01007055; Positions 58543 60493. XX CC 'CAAAG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 216..1733 FT /product="Gypsy-92_CQ-I_1p" FT /translation="MDSKQFAEFMGKFQEMIGSLKKAPAAPVERPAAVASV FT PLPPPLELDGDMERNFDFFEESWKYYASAVGMDKWPEAQNEQKTSILLSVV FT GTEALKKYFNFELTAVEKKDPKLALAAIKRKVVRERNKFIDWFDFFSLSQG FT MEERIDDYLCRLKSQAKVCKFGVLEEDLLKYKLVTSIKWSKLRTKLLTMQN FT LTEAQAVDFCRAEEIAEQHPISEQRNASVNKVKKHAKKCKFCGEMHDFTKG FT SCPAFGKRCYRCDGKNHFEAVCKAERRKKLKRKSRVQKVLVDSSTDGDLTE FT TESTSSESATIGKVSDGSGGHVEANLDMCIAGKWQSVQCELDTGANASLVG FT HSWLQRMTGKAHSDLQPSKYRLHGFGGGEIPVIGQTTIRCRQYGRKYNLVL FT QVVDVEQGPLLSAHVCRTLGFLEFGKMTRVNMPVSTKRSICRIEAQCAGCV FT AGQDRQPTRTRHVRYRCDCTTGGPRQTQTDDPVRARRSTNGASCSLGRYTK FT FSRLGESGST" XX SQ Sequence 1951 BP; 517 A; 420 C; 564 G; 450 T; 0 other; tggtgtcaga agcactcgtg gactaaaggt gatttccggc gtcatttgaa atcgtggaaa 60 gtgttcggcg aggaaaagtt cgatcgagcg acgaagaaaa tcatcagcgg aagtgatcgc 120 gaagcagttg cgcagtgaag agcgaacttg tttgatcctg tccagcgcca tatttttctt 180 taacgtcgtg tcgaggcgaa gtgttttccc acaaaatgga ttcaaagcag tttgccgagt 240 ttatgggaaa gttccaggag atgattggat cgttgaagaa ggcgccggct gcgcctgtag 300 aaagaccggc ggcagtggca tcggttcctc ttccgcctcc gctggaactg gatggagaca 360 tggagagaaa cttcgatttt ttcgaagaaa gttggaagta ttacgccagt gcagttggga 420 tggacaagtg gccagaggct caaaacgagc agaaaaccag catcctgctg tcggtggttg 480 gaacggaggc tttaaagaag tatttcaact tcgaactgac tgccgtggag aagaaggacc 540 cgaagctggc attggcggcg atcaagcgga aagttgtacg tgagcggaac aagttcatcg 600 attggtttga tttcttctcg ctgtctcagg ggatggagga aagaatcgat gactacttgt 660 gtcgattgaa gtcgcaagcg aaggtgtgta agtttggagt cctggaagaa gatttgctga 720 agtacaagct cgttacgtcg atcaaatggt cgaagctgcg cacgaagctg ttgacgatgc 780 agaatctgac ggaggcacaa gcagttgatt tttgccgcgc ggaggaaatt gccgagcagc 840 acccgatttc cgagcaacga aatgcaagtg tgaacaaagt gaagaagcat gcaaaaaagt 900 gcaagttttg tggcgaaatg catgacttca cgaaaggcag ttgtccagcg ttcggcaagc 960 ggtgctatcg ttgtgacgga aagaaccatt tcgaagcagt gtgcaaggca gagcgtcgga 1020 agaagctaaa gcggaagtct cgtgtgcaga aagtgcttgt ggatagcagt accgatggtg 1080 acttgacgga gacggagtcg acgagcagcg aaagtgcaac aatcggaaaa gtttccgatg 1140 gaagtggtgg gcatgttgaa gctaatctag acatgtgcat cgctggaaag tggcagtcag 1200 ttcagtgcga attggataca ggggcaaacg ccagcctcgt gggacacagt tggttacagc 1260 ggatgaccgg gaaagcccac agtgatcttc aaccgtcgaa gtaccgcctg catggctttg 1320 gtggcggcga aattcctgtg atcggtcaga cgactatccg ttgtcgacaa tacggccgta 1380 agtacaacct agtcctacaa gtggtcgatg ttgagcaagg tccgttgcta tctgcgcacg 1440 tgtgtcgcac tttgggcttc ttagagttcg gcaaaatgac tcgcgtcaac atgccggttt 1500 ctaccaagcg cagcatctgc cgcattgagg ctcagtgtgc aggttgcgtt gccgggcaag 1560 atcgtcaacc aacaagaacg cgccacgtga gataccgctg tgattgtacg acaggcggcc 1620 cgcggcagac gcagacggat gatccagtcc gcgcacgacg aagcacgaat ggtgctagct 1680 gtagtcttgg ccgctacacg aagttttcga gacttggtga gtccggatcc acgtaatgaa 1740 cccaacacgc ccaaggtaac ccaccaaatc tcgataccag tcatcaagct ttatgcctct 1800 agcagtcaca catcatccga acaagctcca gtggtcttca gaaacatcca gcattttgtg 1860 gtttagtttt aggtctttat ttttagttac acattaacct cattctcaag cacagttcta 1920 tttagttttc tttttgataa aacagggaag a 1951 // ID Saci-2_LTR repbase; DNA; INV; 471 BP. XX AC BK004069; XX DT 09-JUL-2004 (Rel. 9.06, Created) DT 29-MAR-2007 (Rel. 12.04, Last updated, Version 2) XX DE Schistosoma mansoni Saci-2 LTR retrotransposon: long terminal DE repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Saci-2_LTR. XX NM Saci-2_LTR. XX OS Schistosoma mansoni OC Eukaryota; Metazoa; Platyhelminthes; Trematoda; Digenea; OC Strigeidida; Schistosomatoidea; Schistosomatidae; Schistosoma. XX RN [1] RA DeMarco R., Kowaltowski T.A., Machado A.A., Soares B.M., RA Gargioni C., Kawano T., Rodrigues V., Madeira M.A. et al.; RT "Saci-1, -2, and -3 and Perere, Four Novel Retrotransposons with RT High Transcriptional Activities from the Human Parasite RT Schistosoma mansoni."; RL J. Virol 78(6), 2967-2978 (2004). XX RN [2] RA DeMarco R., Kowaltowski T.A., Machado A.A., Soares B.M., RA Gargioni C., Kawano T., Rodrigues V., Madeira M.A. et al.; RT "Direct Submission to GenBank."; RL Direct Submission to Genbank (03-DEC-2003)Departamento de RL Bioquimica, Instituto de Quimica, Universidade de Sao Paulo, Av. RL Prof. Lineu Prestes, 748, Sao Paulo, SP 05508-900, Brazil. XX DR Genbank; BK004069; Positions 4476 4946. XX SQ Sequence 471 BP; 115 A; 108 C; 84 G; 164 T; 0 other; gcgaccaaac ataaactctt tagaggaaga agagcaagcc ctatgtacac taagcttaca 60 ggatgactca taagactact tcgtaaactg gcttttcccg ctgtaatctg acgtcccgcg 120 ctttttgatt ttgaaataaa gaggctcgtg attcccactc catgttcttt cttgatgtga 180 agtcttcgtt taacattgtg tgctgtgctt tttcagatct gtcctgtcct tttctgaccg 240 ctagttccac gtgttacaaa cgttgtatcc gcattctatt cttctgcttt aatgcccacg 300 gcctttgttt gaagattata atagtattct aacattcttt tagtcctttc catactccat 360 tttgctgatg gtgccgtcac atgcagagga acctttatta accaagtgaa cttatttgtg 420 gttgttatct ccgaaagccg cttacaagta acattcatta cataacactg g 471 // ID MITE_AA repbase; DNA; INV; 521 BP. XX AC . XX DT 14-SEP-2004 (Rel. 9.08, Created) DT 14-SEP-2004 (Rel. 9.08, Last updated, Version 1) XX DE Aedes aegypti MITE repeat - consensus sequence. XX KW MITE; MITE_AA; miniature inverted-repeat transposable element. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Tu Z.; RT "Molecular and evolutionary analysis of two divergent subfamilies RT of a novel miniature inverted repeat transposable element in the RT yellow fever mosquito, Aedes aegypti."; RL Mol. Biol. Evol 17(9), 1313-1325 (2000). XX RN [2] RA Gentles A., Kohany O. and Jurka J.; RT "Aedes aegypti MITE repeat - consensus sequence."; RL Direct Submission to Repbase Update (JUN-2004). XX DR [2] (Consensus) XX SQ Sequence 521 BP; 172 A; 91 C; 85 G; 173 T; 0 other; taccgttttg actcatattc cgaacactta aggccaacag tgacttcaaa tgcatctgat 60 aggcataaat tagctgatat ttgtgtaaat tttaatttct tcgcaaagtc taactgttag 120 ctgttgggtg taccaataaa aatgttgatt taattagttt tagtattgtt tttacgtaaa 180 agtatggaac aacattttga ttcaaatgcc gaacactgtg ttcattctgt ctcatattcc 240 gaacaccttg attcaaattc cgaacagcac gaataaatcg tattcaaatg aataatttcg 300 caaataaatt tatctgagct tgttctactg gtctcaaact agagaatcat cactactccc 360 gaggtataaa atatattgga agacgttaaa attgaattgc aattgactgc catttccttg 420 ttatttgatg acatatttca gcgaaacatt tcaaccaaat cgccatacaa aaaccgagtg 480 ttcggaatat gagtctgttc ggaatttgag acaaaacggt a 521 // ID BEL2-I_Dpse repbase; DNA; INV; 7585 BP. XX AC Unknown_singleton_95; XX DT 14-MAY-2009 (Rel. 14.05, Created) DT 14-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL2_Dpse; KW BEL2-LTR_Dpse; BEL2-I_Dpse. XX OS Drosophila pseudoobscura OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; obscura group; OC pseudoobscura subgroup. XX RN [1] RP 1-7585 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1011-1011 (2009). XX DR Genome; Unknown_singleton_95; Positions 60489 68073. XX CC Positions [2105-2686] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 2..4852 FT /product="BEL2-I_Dpse_1p" FT /translation="MGHMKIVPASKVPKRHYFIPHHCVLKPESTTTKLRVV FT FDASCKSTSGKSLNDILQAGPTVQSELMSILLRFRIHKYVFTADIEKMYRQ FT VWMNPEDQFHQMIVWRNNPSEEIRYYRLKTVTYGTTAAPFLATKCLDHLAI FT QYKDRLPVGSTTLKTDFYVDDCLTGANTIPEAIHVRQELNKILLPSGFKLR FT KWCSNNEKLLQGIPKEDIVTDVKLGETLEHYSVKTLGLIWMPKKDTLCGRT FT QKSQATRITKRVVCSELAQIFDPLGLFAPVVVKAKIFMQQLWELKLDWDDE FT LPLQHQTEWKEYRDDLQTLNKMQIPRHIYDGKIPVTQELHTFVDASERAYG FT AAVYVRATYKNNQVSVRLLCSKSRVAPTAKQSLPRLELCAAVLGAELTNRV FT RQDLHMDRNTVSGWLWSDSTVVLSWINASSSTYHTFVANRIAKIQALTNQS FT QWRHVSSANNAADVLSRGIAASRLADHNLWLYGPLFLHGSRDNWTKAPNIE FT PSNLERKAKHVSLPITPSALGDELYTCKHSNSFHKLQRIVAYMLRFCYKTN FT RANTTTLCAEELTHARTIILRTIQQVEFTAELKQFQRYKTVDKKSSIISLS FT PFLGDDELIRVGGRLEEALLPYNAKHPVLLPYNDPIVKMILRELHEENMHC FT GAQALRAIARQQYWIINDKTMARSIIHSCVRCTKARPKLMHQIMGNLPAQR FT VTQARPFLNAGVDYCGPIAIHHKIRGKRPDKAYIAVFCCFSTKAVHLELVS FT DLTTEAFLAALRRFLSRRGKCQTIHCDNATNFVGANNQLKELEDSIFSSKA FT SALITNHCNKKQVDFKFIPPRAPSFGGLWEAAVKSAKRLLITTTNTASLTF FT EELNTVIIEIEAILNSRPITPMSNDPTDTTALTPGHFLIGEPLTTPPDVNA FT DHQGKTLVKRWELVSRLKHSFWKQWSTEYLQELQARHKWKTASPNVPEGLL FT VLVKEDNLPVMKWPMGRIIKTYPGQDNHVRVVDVKTASGVYKRPITRLAPL FT FPEDVAIKRPHNVISDTTEIDEPPTQRTTKLAPTLTSTLVVLLLMLPLAWS FT QSINVTQFANKPGIFYEKLGTSRIATSEWTMIVYYDLDPYWKDMQLMSAGI FT SELRYMCPELKNVPACNSIVQHFQHVYNELQAGSVLMKNSRKRRSPLDIIG FT NVASSLFGVLDAKYAEEMTQVIEKVETNEDFLLHLLQNQTSIVDATINVVR FT KDKLTTDNRLKTLEQHLATINRERGSYHSDHLTQAYLGLVAQLTLLTTTLQ FT RMQSDISDVLIGIHHRRISPTLVSPEQLIDQLDQIRDHLPQDQMLPVTTKE FT VIQLYRIMRAEGTPTAKHVIFKLTLPLVSTAQLEVFSIIPIPFWDSTSWSL FT FDLRTTIVTVNIHRDQFMGTTQEEFRRCLKVHNEEFICFDHQTIFNVDNRC FT EFQLFNNKTETLCQVTHTKADSVWLKTQHPNKWIFATHSTTKLQAVCNGVP FT STITLEGTGLLSMSPGCTARNPKVSISTFSTTISEVKSTYTRFGDVNPPET FT SPSPPMWQLKNQTSDFLQELDDLRQQLSAINQHELPHHLTTVQHHQLVAYA FT ALAISILLIIAFIFRKKYQLWKPRPSNPDPATSSPPIPFTRRFTVDLNDG" FT CDS 6500..7585 FT /product="BEL2-I_Dpse_2p" FT /translation="MEAYLTLESKVQRALAKSATPNTDNVQLPRISIPKFD FT GDLLKWTQFFDLFSCMVRETNMPTVKKMWYLKTTVTGEAERLIRHIDLKEE FT NYEAAWELLVDAYKNPRDVSNTVLNRFLNHQTINDDPKSLKELYITTIESL FT ASLKGLGINISTWDPILIAIMRNKLDVTNKALYEQSLGSSKDIQALDTMLQ FT FLEQRIRVLGTSKRPPASRREPKGTTCASANTGRARAPLCTFCKKASHWLY FT GCEEFRSKSPSERLNWVQKQKMCVNCFRTNHKTNACPGRNCFKCDKKHNTM FT LHLETASGAATESRTPTVGAASNNSYNLRATAKVAVEAENGQVANFRALLD FT SGSQINLISERMASMLSKN" XX SQ Sequence 7585 BP; 2470 A; 1874 C; 1573 G; 1668 T; 0 other; tatgggacat atgaagattg tcccagcatc aaaagttccc aaaaggcact acttcattcc 60 acatcattgt gtactcaaac ctgagagcac caccacaaag cttcgggtag tattcgatgc 120 ctcatgcaaa tccacaagcg ggaaatcttt aaacgacatt ctgcaggcgg gacccaccgt 180 tcagagcgaa cttatgtcca ttctgctacg atttcggatt cataaatacg tgttcacagc 240 ggacatagaa aagatgtacc ggcaagtgtg gatgaaccct gaggatcaat tccatcagat 300 gatagtctgg aggaacaatc catccgaaga gatcaggtac taccgcctga aaacagtaac 360 atatgggaca acagctgctc cgttcctagc aacgaaatgt ctggatcatt tggcaataca 420 atacaaagac agactaccag tcggatcaac aacattaaaa accgatttct atgttgacga 480 ctgtctaact ggagcaaaca cgattccaga ggcaattcat gttcgacaag aactaaataa 540 gatattgcta ccgtccggat tcaaattgcg caagtggtgc tccaacaatg agaaacttct 600 ccaaggaatc cccaaagagg atattgtaac cgatgtaaaa ctaggagaga cactagagca 660 ctacagcgtg aaaactctgg gtctcatctg gatgccgaag aaggatacat tgtgtggacg 720 aacacagaaa agccaggcta cacgcatcac caaacgggtg gtatgctccg aactggccca 780 aatctttgac ccactaggac tctttgcacc agtggtagtc aaggcaaaaa tcttcatgca 840 acaactttgg gagctcaaac tggattggga tgacgaatta ccattgcaac accaaaccga 900 atggaaagaa taccgggacg acttacaaac tttgaacaaa atgcaaattc cccggcatat 960 ctacgatgga aaaattccag ttacacaaga actccatacg tttgtggatg catcagaacg 1020 agcatatggc gcagcagtat atgtacgcgc tacatacaaa aacaaccaag tgtcagtaag 1080 actgctctgt tcgaaatcac gtgtggcgcc gacagccaaa caatcgctac cacggttaga 1140 actttgtgct gcagtcctcg gtgccgagct cacaaaccga gtccgacagg atctgcacat 1200 ggacaggaac acggtttcag gttggctttg gagtgattcc acagttgtgt tatcctggat 1260 aaacgcatca tcatcaacat accatacctt tgtggcgaat cgaatagcca aaatccaggc 1320 attaacaaac cagtcacaat ggcgtcacgt gtcatcggcc aacaacgcag ccgatgtcct 1380 gtcgagagga atcgcagcaa gcaggctggc agatcacaat ctgtggttat atggaccact 1440 attcttgcac ggatctcgag acaactggac gaaggcaccc aacatagaac ctagcaatct 1500 tgagagaaag gccaaacatg tgagcttgcc aatcacacct agtgcacttg gggatgaatt 1560 atatacttgc aaacatagca actcgtttca caagttgcaa cgcattgtgg catatatgtt 1620 acgcttctgc tacaaaacca acagagcgaa caccacgacg ctctgcgcag aagaactgac 1680 tcacgcccgc actataatcc tgcgaaccat acaacaagtc gagtttacag cagagttaaa 1740 acagttccaa cgatataaga ctgtggataa aaagagctcg atcatatcgt tatctccctt 1800 cctaggagac gatgagctca tccgagttgg tggtagactt gaagaagccc tacttccata 1860 caacgcaaaa cacccagttt tgctgcccta caatgaccca atcgttaaaa tgattttgcg 1920 ggagctacat gaggagaaca tgcattgtgg tgctcaagca ttaagggcaa tcgcaagaca 1980 acaatattgg attatcaatg acaagacaat ggctagaagc atcatccaca gctgtgttag 2040 atgcacaaag gcaagaccaa agctgatgca ccagatcatg ggcaatctcc cagcacaacg 2100 agtcacacaa gcacgcccgt ttcttaacgc aggcgtcgac tactgtggac caatcgcgat 2160 ccaccataag atccgaggca aaagacctga taaggcatac atcgccgtgt tctgctgctt 2220 ttccacaaag gcagtacacc tggagctggt aagcgacttg accacagagg cattcttggc 2280 cgccctgcgt cgattcttaa gtagacgtgg gaagtgccaa actattcatt gcgacaatgc 2340 aacgaacttt gtgggtgcaa acaaccagct taaagaatta gaagactcta tattcagctc 2400 aaaggcaagc gctctgatca cgaatcattg caacaaaaaa caagtggatt ttaaattcat 2460 tcctcccagg gccccatcgt tcggcggcct ctgggaagcg gcggtaaaat cagcaaaaag 2520 gctgttgata acaaccacga acacagcctc actgacattt gaagaactaa acaccgtgat 2580 aatcgaaatt gaagcaattc tcaactctcg acccatcact ccaatgtcaa atgatcccac 2640 tgatacaaca gcgctcaccc caggacactt cttgattggt gaaccattga caacacctcc 2700 tgacgtcaac gctgatcatc aaggaaaaac attggtcaag cgctgggaat tggtatcgcg 2760 cctaaagcat tcattctgga aacaatggtc tacagaatac ctccaagaat tgcaagctcg 2820 gcataaatgg aagaccgctt caccaaacgt acctgaagga ttattagttt tggttaaaga 2880 agacaacctt cctgtaatga agtggccaat gggtcgcatc attaagacat atcctggaca 2940 ggacaaccac gtacgagtgg ttgacgttaa gacagcatct ggcgtataca agcgcccgat 3000 cactcgtcta gcgccgctct ttccggaaga tgtggcaatc aaacgccccc acaacgtgat 3060 cagtgacacc acggaaattg atgagccacc cacacagcga acaacaaaat tggcacctac 3120 actaacatca acgttagtgg tgctgttgct catgcttcca ttagcatggt cacagtcaat 3180 aaatgtgaca caattcgcaa acaagcctgg aatattttat gagaaacttg gaacatcaag 3240 aatagcaacg tctgagtgga ccatgatcgt atactatgac ttggacccat attggaagga 3300 tatgcaacta atgtctgccg gaatttcaga gttacgatac atgtgccctg aactaaaaaa 3360 tgttccggca tgtaactcca tagttcaaca ctttcaacat gtgtataacg agctacaagc 3420 aggaagcgtt ttaatgaaaa actctcgaaa acgcagatcg ccgctggaca tcataggaaa 3480 tgtagcaagc agcctgtttg gtgttctcga cgcaaaatac gcagaagaga tgactcaagt 3540 gattgagaaa gtggaaacca atgaagactt tctactacat ctgctccaaa accaaacatc 3600 aattgtcgac gcgacaataa acgtagtcag aaaggacaaa ttaacaactg ataatcgcct 3660 aaaaactttg gaacagcatt tggctactat caaccgagaa agagggtcgt atcacagcga 3720 tcatctgact caagcgtact taggactagt agcacaatta acgctgctca caacaactct 3780 ccaacgaatg cagtccgaca tatctgatgt cctaatcggt attcatcatc ggagaataag 3840 ccccactctg gtgtctcccg agcaactaat agaccaacta gaccagatac gggatcattt 3900 gccacaagat caaatgctgc cggtcacaac gaaggaggtg attcaactat acagaatcat 3960 gcgagccgaa ggaaccccta ctgcaaagca cgtcatcttc aagctgacgc taccgctggt 4020 ctcaacagct cagctcgaag tcttcagcat catcccaatt ccgttttggg attcaacaag 4080 ctggagtcta tttgacctca ggaccaccat agtaacagtt aacatccacc gagaccagtt 4140 catgggcact acccaagagg aattccgacg ttgtcttaaa gtacacaacg aggagttcat 4200 ctgctttgac caccagacga tcttcaatgt ggacaatcgc tgtgagtttc aactgtttaa 4260 caataaaaca gaaacactgt gtcaggtaac acacacaaag gctgactcgg tgtggctaaa 4320 aactcagcat ccaaataaat ggatcttcgc aacgcattct acaactaaac ttcaagcagt 4380 atgcaacgga gttccgtcaa ccataacact ggaaggaact gggctactct ccatgtcccc 4440 tggttgcact gctcgcaatc ctaaagtaag catcagtacc ttcagcacca caatcagcga 4500 agtaaaatcg acgtacactc gatttggtga cgtgaaccca ccggagacat caccctcgcc 4560 accaatgtgg caactgaaga accagacatc agacttcctt caagagcttg atgatctacg 4620 ccagcaacta tcagccatca atcagcacga gctgccacac catctcacca ccgtgcagca 4680 ccatcaactc gttgcttatg cagccctagc aatatcaatt ctactgatca ttgccttcat 4740 tttccggaag aagtatcaac tatggaaacc aaggccatca aatcccgatc cagccacgtc 4800 atcaccccct atacccttta cacgccggtt taccgtcgac ttgaacgatg gttgaacaca 4860 gtttcaacgg ccgggagaat gttcagttgc agaagtatgc ttaagcaata gcttataaga 4920 cccgtcggca taacacagtg acaccgcgac acagtgttcc atttaacatg caatataata 4980 cctcacgaca acacactagc cactgcagtg agttctgctg acaacacact agccactgca 5040 gtgagttctg ctgacaatgc actctctccg ccactgcagt gagttctgct gacaatgcac 5100 tctctccgca gagtgcgaag acagcggagg cagttgctga ccatcaaaat gtcgcaattg 5160 cccaccagac actaaaatta tccataatac acaaaacata tattttacag actcagcact 5220 tttgtacctt ttattaccta taaataagaa tcatttttgt caataaaatc agtatcagac 5280 acagtaccat actgcctgtc actcatcttc catcgcaatc atcagaggct gtcgccgtgt 5340 cttcagaccc tcccttgcgc accgtaggat acccactccg cagtccaact ggttcaagtt 5400 ctagttgcga cgaccaaact gtagacggtt tatgtaccgt atgcaaccca gcttcctaca 5460 cacacacaca cattaacatt tggtgacctc gaccttggtc atcctttcaa actgtcagac 5520 aagctacagt gaaaggattc gatctggaca tcagattgaa gcagtcaaca ggacatcttc 5580 aataaaaatt tggcaccagg ccatacactt acggtagcat ctcctaccaa taagtatcac 5640 tctggcaaat ctgacatatt caggttatca gcccacagaa aaacagctat agaattcgac 5700 taccaccaag gtaatcaatt tttgttgttc cgtgcgttcc atagattgga aacagcacaa 5760 atccctccat ccaacacatc acctgtacta ttaccttgct gaccaacaac ggttgcgatt 5820 cggagcgtat ccagctgcaa agcaaataaa acactcccaa cgggtcaaaa agtaataagt 5880 acaaacagat agaatggata cacatatgga ggcagaagct gccgctacaa tggaaatgga 5940 tgtgcagcaa gaaattggtg aggatatatc gaggtgcatt cggaactata aaaaagattc 6000 ccaagaacga aagcaaacgg cctcttattt tcaagccaaa atatcagcac tagaaaatgc 6060 gtggaatcgc ttcaacaaaa acgactccaa acttcgcgat acagacaaaa cggatgaata 6120 cgtaaaattc cgggaagact tacgtacaac atgtaagcaa tacattgcga tgtttaccga 6180 aggcatggat aagctatcac agacgtctag tgctcgtggg gaacgtcaaa tcgtctttaa 6240 agaaaggcat gagaatcctc cggtagctcc caaagaaagc ggtctgatcc agcagtcctc 6300 gaaggattac agcaaacttc tttcactatg tcgccgacaa aaggctatag tagagtccgt 6360 taagcgcaat ctgaaacaac tttcagaggc ggaaggacgc acataaagtg atttgataaa 6420 atcactatgg actcaagcgc aggaagtcca cttcacaatt catgaggaat acgaggaccc 6480 agtggaagga ggatacgaca tggaagccta tctaacactg gaaagtaagg tccaaagagc 6540 attggcaaaa tcagccacac caaacacgga taacgtgcag ctgccacgga tcagcatccc 6600 aaagtttgat ggagatctat taaaatggac ccagttcttc gatctgttct catgcatggt 6660 gcgtgagacc aacatgccta cggtgaaaaa gatgtggtat cttaaaacca ccgtcaccgg 6720 cgaggcagaa agactgattc gacatatcga tctaaaggaa gaaaattatg aagccgcttg 6780 ggaactacta gttgacgcat acaagaatcc cagggatgtt tcaaacacgg tactgaaccg 6840 cttcctcaat caccagacaa ttaatgacga tcccaagtca ttaaaagagt tgtatataac 6900 aaccatagaa tcccttgcat cactaaaggg attgggcatc aatatatcta catgggatcc 6960 gatattgatt gccattatga ggaacaaact ggatgtaaca aataaggcct tgtatgagca 7020 atcattggga agctcaaagg acattcaggc gctggatacg atgcttcagt tcttggagca 7080 aagaattcgg gttctaggaa ccagtaagag gccaccagca tcaaggagag agccaaaagg 7140 cacgacatgt gcatcggcaa acacagggag agcgcgagca cccttatgca cattctgcaa 7200 gaaagcaagc cattggctct acggatgcga agaatttcgc tccaaatctc catcagaacg 7260 actaaattgg gtacaaaaac aaaaaatgtg cgttaattgt tttagaacaa atcacaaaac 7320 aaacgcatgc cctggtcgaa actgcttcaa gtgtgacaag aagcacaaca cgatgttaca 7380 tttggaaaca gcatcaggcg cagcaacaga atcaagaacc ccgacggttg gtgcagcaag 7440 taacaatagt tacaatcttc gggccacagc aaaggtggca gtagaagccg aaaacgggca 7500 ggttgctaat ttcagagcac tgctagattc aggatcacag ataaacttga tctctgaacg 7560 aatggcatca atgttgtcaa aaaat 7585 // ID Gypsy-158_AA-LTR repbase; DNA; INV; 185 BP. XX AC AAGE02017317; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-158_AA_; KW Gypsy-158_AA-I; Gypsy-158_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-185 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02017317; Positions 50100 49916. XX SQ Sequence 185 BP; 60 A; 27 C; 57 G; 41 T; 0 other; tgtcgggacc aactaggccc tttgtaatac tgggtcagag aataggtaag tctggcaaca 60 caggacgttg agtagaggag agaggaaaag cagttgagga tcggacggta gcgtgaacgg 120 atagtcgcaa ataaacggtc tttgtaaaag tttaagaagt gcgttgttaa taagtgaatc 180 cgaca 185 // ID Crack-6_CQ repbase; DNA; INV; 1957 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A Crack non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; Crack-6_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-1957 RA Kojima K.K. and Jurka J.; RT "Crack non-LTR retrotransposons from the southern house RT mosquito."; RL Repbase Reports 11(1), 37-37 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 10 sequences with >94% CC identity. XX FH Key Location/Qualifiers FT CDS 3..1592 FT /product="Crack-6_CQ_1p" FT /note="reverse transcriptase." FT /translation="XQXTFLLLPTDQFEIITLIDSFDNNTSCGXDKISSSI FT LKKVKLIIAPILTEIFNLCMLNGCYPDKLKIAKVTPVYKSGSASLFNNYRP FT ISVLSLLXXIFEKILFTRLNNFLCLNKFFCAQQYGFRHKSSTKNAVIDLVN FT KIQTHLDQKDDVLGLFLDLSKAFDTVDHKILLTKLNFAGVRGVALELFKXY FT LXNRVQFVSLDGIDSLTALINVGVPQGSVLGPLFFLVYLNDLALLPLKGDL FT KLFADDSSLFYFNKSMVINDINLNNDLRLLTDYFRLNKLTLNINKSNIINI FT KNSNRSTVNPLTLTRTNFADIKIVDECKYLGVILDNKLNWSSHINNLLNKL FT NQIIGIIYKIKHKLTANVLLIIYHSLFNSHLSYITSVWGNSCNILINKLQI FT AQNKILRIIYKKPLRSHSADLYNINKNIIPVRAIYVIQTCCFIYLCLHEQT FT HSNTKFKLSNHQHFTRSHNLLDRPNISTLAGERCISFKGAQLYNYFWNRFG FT DCQSVSIFKNKLLSFLNEPAVIEHILKSFDFLVT" XX SQ Sequence 1957 BP; 636 A; 329 C; 253 G; 728 T; 11 other; aawgccaama aacgttttta ttgctaccaa ctgaccagtt tgagataatc accctaatcg 60 attcatttga taataataca agttgtggtt wtgataaaat ttcttcttcc attttgaaaa 120 aagttaagtt aataatwgca ccaatcctca ctgaaatatt caacttatgt atgttgaacg 180 gctgttaccc tgataaactt aaaatwgcta aagtcactcc cgtttacaaa tctggttctg 240 cttcattgtt taataattat agacctatat cagttttatc cttgctcmat amaatttttg 300 aaaaaatcct gttcactaga ctaaataact ttttgtgctt aaataagttc ttttgtgctc 360 agcaatacgg tttcagacat aaatcctcta ctaaaaatgc tgtcattgat cttgttaaca 420 aaattcaaac tcacttagat cagaaagacg atgttttggg gctattttta gacttatcaa 480 aagcgttcga cactgttgat cacaaaattt tgctaacaaa acttaacttt gcaggtgtaa 540 gaggggttgc attagaattg ttcaaawgtt atcttwmtaa tagggtccaa tttgttagtt 600 tagatggtat tgatagttta acagctttga ttaatgtagg ggttcctcag ggttccgtct 660 tgggtccact tttcttctta gtttacttga atgacttagc gttactgcct cttaaaggtg 720 atttaaaatt gtttgctgat gattcatctc ttttctactt taacaaatct atggtaatca 780 atgatataaa tttgaataat gatcttcgat tgttaactga ctattttcga ttaaacaagc 840 tcacactaaa cataaataaa tccaacataa tcaatattaa aaattcgaat cgctcgacgg 900 tgaatccatt aactttaact agaactaatt ttgctgatat aaaaatagta gacgaatgta 960 aatatcttgg tgtcatttta gacaataaat tgaattggtc gtcacatatt aataatttat 1020 tgaacaaact caaccagatt attggaatta tctataaaat taaacacaaa ttaactgcta 1080 atgtactttt gataatttat cattcactat ttaattctca cttatcttac ataacttcag 1140 tttggggaaa ctcctgtaat attttgatta acaaacttca aatagcccaa aataaaatct 1200 taagaataat ttataaaaaa ccactgcgaa gtcactctgc cgatttgtac aacattaaca 1260 aaaacatcat tcctgtcaga gctatttacg taattcaaac atgttgcttc atttatttat 1320 gtttacatga acaaactcac agtaatacaa aattcaaatt atctaaccat caacacttta 1380 ccagaagcca taatttatta gaccgaccca atatttcaac attagccggc gaaagatgca 1440 tatctttcaa aggtgctcaa ttatataatt atttttggaa tagatttggt gattgtcaaa 1500 gcgtatcaat atttaaaaac aaattgctct cttttttgaa tgaacccgct gttattgaac 1560 acatccttaa atcctttgat tttcttgtta catagtagac tattttgttt tccaattttc 1620 tttttgtttt ttttttctct tttttttctt tcttttttca cttttgcttc cttctaaaac 1680 tcagtttaaa aagttagttt tccgacacaa ttatctttwt tctataattt tgtctttatc 1740 gccataatct atgcactttt ttatatcatt tcagatttct cttgccagga tcaaccaaac 1800 tccttcacag gtggaaacca atggagtttg accggatatt taaaagttaa ttgtaaacac 1860 taaaatttat tgtatattgt attaaaacga tgtaggtttt tggtgctact gctttggtgg 1920 cttttcctac cgaataaaaa atcaatcaat caatcaa 1957 // ID Gypsy-617_AA-LTR repbase; DNA; INV; 342 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-617_AA_; KW Ty3_gypsy_Ele49; Gypsy-617_AA-I; Gypsy-617_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-342 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 342 BP; 109 A; 71 C; 73 G; 89 T; 0 other; tggacactcc accagtaccg gcagctaacc ccattgcaca acatgcgacc ggatgcattg 60 taagcacctt acgcacctag caacaacgac gagtgacgac aacaacgacg acgcatcatc 120 ggttggtcaa cattttaggc tacatgcaga ctcagcagat tatgctatta aatagggccg 180 ataggattcc gatcaggcag tcatcagtgg ttattatgtg aagatagtcg aattgtgtta 240 attaattgtg taaaagtgaa taattgtcga taaagtaaat gtaacagtga ttaaatcttt 300 ttttaaaaat ccgatcacga tagccctgtg attattggcg ca 342 // ID Dparag30 repbase; DNA; INV; 422 BP. XX AC GU229943; XX DT 19-APR-2011 (Rel. 16.04, Created) DT 19-APR-2011 (Rel. 16.04, Last updated, Version 1) XX DE Mariner-like element internal region of mellifera subfamily. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Dparag30. XX OS Drosophila paraguayensis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; tripunctata group; OC tripunctata subgroup II. XX RN [1] RP 1-422 RA Wallau G.L., Hua-Van A., Capy P. and Loreto E.L.; RT "The evolutionary history of mariner-like elements in Neotropical RT drosophilids."; RL Genetica 139(3), 327-338 (2011). XX DR EMBL/GenBank/DDBJ; GU229943; Positions 1 422. XX CC Clone Dparag30. XX SQ Sequence 422 BP; 106 A; 101 C; 114 G; 101 T; 0 other; tttgggtgcc acatgagttg acggaaaata tctcttggac cgaatcaacg cctgcgatgc 60 actgctcaaa acggaacgga ctcgtctcat ttttgaagcg gatggtggct ggtgatgaaa 120 tgtggatcac gtacgtcaaa acggcacaaa ccatcgccaa gcccggattg acggtaagga 180 aggttttgct gtgtgtttgg tgggattgga cggggaatca tctactatga gctgcttaac 240 tatggccaaa ccctcaattc ggtcctctac tgtgagcggc cagaattggt aaatagaaat 300 ggtgttgtgt tccaccagga caatgctcgg actcacacat ctttgatgac ccgccagaag 360 ctacgggagc tcagatggga tattctatcg cacccaccgt attgcccgga cctcgcgcca 420 ag 422 // ID hAT-N5_AP repbase; DNA; INV; 1039 BP. XX AC . XX DT 25-AUG-2009 (Rel. 14.09, Created) DT 25-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; hAT-N5_AP. XX NM hAT-N5_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-1039 RA Jurka J.; RT "hAT-type DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2104-2104 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 1039 BP; 374 A; 125 C; 159 G; 380 T; 1 other; actaggctta gggacttcta gcatttgcat acttttgtta attattgtta aaaatagcag 60 agtgcgaaat aaattcattt ctctgtgcca tgatatcaga tctaaaattt gaatgtgcat 120 atttttgcat atttgataga ttttgcagga aagtgcttat tttaaaatta atatgcttat 180 tttgcatatc ttgcatattt tcacaatttt ttgtaaaaaa aagtatttgg aaaagaataa 240 ccagtgtaac ttggcattta ttaaattaaa tttcggatat ttacccaaat ccataacatt 300 tttggaaaag aaaggaataa aattatccga ttcattaaaa actgtcgaag atgcgaaaaa 360 taaaattgtt gacttaaaat gtacaaaagg caaagcagtc gtacagaagt taaatgacgt 420 tttagacaaa aatcatggtt acaaagcttt attaaaaatt tcaaaaattt tgagcggcga 480 agttgaagat atggaaggat tgcctgagga cttgacaagt aatgatttgg tttattttat 540 gtatatgctc caatgtcgtc tgtagatgtc gaaagaagtt tttccgcgta taaaaattta 600 ctgtcaagta atcgtcgtag attcacgttt gaaaacatca ggaaatatct ctttgtncag 660 tgtaattttc aaggtaaaga aattaataaa aaaaataaga attaaaacat ttaattattt 720 aaacattgtt ttgttcttac tttaggatcg gtagacacgg aagaacacta aaatgtaagc 780 caattacctt ttaaaaatta aatttattaa atcattttaa attcaagtac ctacctaata 840 atattctaat atagtaatat ataatataaa ttctaataaa acaaatattt aattttacca 900 catatttgtt atacttcatt ttttttgggc atatcttcgt gcatatacgg tttttttagg 960 tgcatatttt agagcatatt tggccatttt ttggtgcata ttttggtatt ttttagagct 1020 agaagtccct aagcctagt 1039 // ID Gypsy-31_DWil-I repbase; DNA; INV; 7641 BP. XX AC scaffold_181154; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-31_DWil_; KW Gypsy-31_DWil-LTR; Gypsy-31_DWil-I. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-7641 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_181154; Positions 3538737 3531097. XX CC Positions [5164-5739] - Reverse transcriptase CC Positions [6825-7193] - Integrase core CC 'CTGAT' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 4327..6024 FT /product="Gypsy-31_DWil-I_1p" FT /translation="MVIPSDADIVKKIAVQTKRNDRTNDVCGRCGAADHDE FT RSMACPARNSKCNKCSRFGHYARMCKIALKRRFGTPYDNQRKRPRTSQFQV FT RAIDEEETKCHSDNSWSNCFKITGEPEGEETVSCLVGRSQLQIIIDSGSRY FT NLISLRDWLQLNSVNATVFNVRNNSSTQFGAYASDDVLRVTNAFEAPIRIQ FT ERPEMIATFYVIENGRQSLLGRDTAVQLNVLKIGLGVNKVDSYAHFPKWKA FT SPVRLAIDHNVRPVQKPMRRFPTALEDRIAERISQAVQQDIIEPVHGPSSW FT ISPVVIAYKGSGEIRLCLDMRRANLALSRENYPLPTFDMFMTKLREARCFS FT RLDLKNAYHQLELDESSRQITTFITHKGLFRYKRLFFGVNSASEIAQRRLE FT ELLAGCPNALNYIDDVIVFGKNEEEHDKALTAVLKIFDTHNVVLDKQKGVY FT KTTKLKFLGHILSDHGIEADPEKVKVITDFRDPKTKEETRSFLGLVTYVGK FT FIPDLANTTELLRRLIKANTKFDWTNKEQQAFDTLKKQVVEVTKLSYFNKN FT LRTCLIADASPVALGAVLV" XX SQ Sequence 7641 BP; 2300 A; 1557 C; 1805 G; 1979 T; 0 other; ttggcgacga aggagaagaa ggaaaatgcg gcctaacaaa aaaaaagaaa aaaaaagaga 60 aaagagaagt gaggttatac acgaagggac gcgtgccaga cgcggccata ggaaaaaaat 120 ttggaaaaaa tatatatatg tacatatata caaattacaa agctccagtc ctgttaaggg 180 tgaagattag gttgagaggc aaactgagcc tcttattgct ggccttgggc tcagacaata 240 aaacaaatta ttttacggct aagtgccgga gtgaatattt atttaagcga cttactggaa 300 gagcgctgta ttcaaactga tttcttaata catagaatat ttcacttaag ttatgccaaa 360 attcgcatca cgccacaatg tcatggtcgg caagccgact cagcaaatct tcggcgataa 420 cggaaactct tgcagcaaaa caaaagtgtc cggttaccgt cgctgaccat tgcagctata 480 gcgcgctgat atgcaatgtc gcaattgcga gggctgctac ttaacttatg gagcacgcga 540 tagcgagggt agattatgaa cttataaaat atcctatgtt aagtgttaat tgttaactac 600 tcccccttca agtgtaaggc cgtcctcggc cgacccaatg ccggcgacga cttctcgtaa 660 ctgcattgca gacttctttg caccgatgag tcgacaaaga gtgaggccaa tgcagatcaa 720 agcacagcat atggctccga tgatgaatgc cacacgatag gcctgatcag ttccggcgtt 780 tttttttggg attgctgtat aacctccaga ttacgttcat tcaaacgatg aaggtaaggg 840 aggttgagga catcttgggt ggcagtgatg ttcagcaagg gagaattggc aatacccggg 900 actctattgc gcacgttgtt gtgattataa aacagagtcc cgataattgc gactcggtct 960 ccgaacgtga tgagatgcat gccttgtacc tggatctcag tgccgttgtc cacctgaatc 1020 cttgcggacc cgtcgttgag gattgtgatt ccgtcatcta cctgggtgat tatttccagg 1080 ttgctcggct ggatctcgca gtgtgccgtg acacctgcac ggagctcttt ggcacatgaa 1140 ctttccaaag ccagacgaca aaatgtggct ccaggtgaag cgtcgcaact ctgcagtgtg 1200 tgggtttctc catcgcattc ggctatgacg ttgtcaaaca atcgcactat gaagttcttg 1260 tggataacag gataaatcgt gatctttctg caagctcttt taatcttggg gaatttaatt 1320 ataaagtgta tcgtgttaat ggattgaaga acttttacgg acgcaacgga cataagatct 1380 cctatgggag tgttggtggg ttcctctagc caaattgatt ccaaatctgt gtggtccaaa 1440 atgtttgggc taacaatgtt ggcctttgcc agagtaatag caagcattaa attttgcaac 1500 tccatcaata gcatcctgtt tctacacagt agtgtttcaa ataaatggct agtgtcaact 1560 cgtgagtttt tggctatcct gagaattctg ttgacagtga ttgataattg gttaagctgg 1620 tcctgaactc tagtgtttat ttctactgtt tattattggc atttattaat tgcatttcag 1680 taaacctagc gtgctctaaa ctatgccaag tggctagagg taagttgaat aactatatct 1740 aacttatacg agtttgccta gtggcttaaa aaaaggtgta aaagaaaaac aagtcacctt 1800 aggttgtcct tgtggaccac cctccctcta atgaggacct cggtccccag gtctgcttcc 1860 actgccttct catcataagg gggtgagttt gtttccgagc cgtcggttgg acttgaccaa 1920 aactctgtca ccaacgtcga agaccctatt ctgccgcttt gcgttctccc tagctctgag 1980 catggtctga gcgtttctga tcttttcttg ttttttgggg tgtctatcat atttagccct 2040 agcacatgtt ttgcaatttt gtactatttc attggccatt cttgccatct ttggaaagta 2100 gtactccgag agtacctgct taatattttc ctgtggaaac tcaggtaaac atttgacggc 2160 atctagtggc tcgtcacagc gaatgtcgtt tctaattcct atagtagcat atgctactac 2220 ttgtggggcg gcaacgttag ctgctgctat ttgtggggcg gcagcgtttg ctgctgctat 2280 ttgttgagcc aactcattaa ttctttgagt caatgctcga tccctctgct cgtttgcagc 2340 ggcctgttca gctaaagcat cgctgatcgc tgaccttatt atttccctaa gctgatctgg 2400 atccatttta cctaatgggg agaaactgca gggagaggtc ttgtagccga ggcttcgata 2460 cgaatgtcac agtcggaatc tgagtcgcta aagtgcttgc gaggtgttct gtattggctg 2520 aacatgcgca ggccagtggc cgcgaaaagc aaaacgggtc gaattacgtc gtagctctga 2580 accacgagaa ctgtatgtat tgatatagca caattttaga cacagaaagt aacacaaagt 2640 aatttcatta acgatctgga tgataaattg ttttaactag aattctcaag gcagtcactc 2700 tcacgagttt cagaaacctg gttcagaata ctgttattaa ttatgatctg ttgcaatatt 2760 aaacactcac aagtactaaa ctaattttta caaaaaatta agacttttgt ataagaaaaa 2820 attttggcta gagtatccac ttacatgatt ccactaacac aaaaaaaaat gtttctttat 2880 tatgttggtc actcgcagct ccctcgctgc tgtcttaatt ttttgttggt cactggcagc 2940 tccctcgctg ctgtcttatt tgttgttgct tgttgcagct cctgtgctgc tgtttggtta 3000 ttagtggatg tggagtccga ccactagttg ggcgccaatt acaaagctcc agtcctgtta 3060 agggtgaaga ttaggttgag aggcaaactg agcctcttat tgctggcctt gggctcagac 3120 aataaaacaa attattttac ggctaagtgc cggagtgaat atttatttaa gcgacttact 3180 ggaagagcgc tgtattcaaa ctgatttctt aatacataga atatttcact taagttatgc 3240 caaaattcgc atcacgccac aatgtcatgg tcggcaagcc gactcagcaa atcttcggcg 3300 ataacggaaa ctcttgcagc aaaacaaaag tgtccggtta ccgtcgctga ccattgcagc 3360 tatagcgcgc tgatatgcaa tgtcgcaatt gcgagggctg ctacttaact tatggagcac 3420 gcgatagcga gggtagatta tgaacttata aaatatccta tgttaagtgt taattgttaa 3480 cttacataca tataagtgcg tacatgtaag tatacaatgg tgatggaaac agaatataaa 3540 ttaattatct gcatttgaca gaggccgcgt cgagagagtg caaaggcgac tctaagacaa 3600 gccaaaagaa gaaaaaaaaa gcaaacagct gagcaaggca gaactaacgc ccgtgcaagc 3660 tgaaaaatat ctggaaaatg ccagaacttg tacacagcat ggctgtccaa gcaaaatcat 3720 ctgaaaagaa tcagaggctg agcacgaaat aaagtgtaag ctaaaacaaa tcagcaagat 3780 ggctgaaatt aaaccatttc tgtgcgccaa catagaaaaa tcgttgtggc gcaacgagtt 3840 ggaaaaatgg cttcggtcat tcaatatcta tgtggacaca gaagaaattt catcggtaat 3900 taaaaagcgg aacaaattat tgcatctcgg cggaccacag ctacaagagg tggtgtacag 3960 tcttccaggc gcgttagtgg catacgatgc gacaaaggaa aatgatgaat ttactccttt 4020 ggtggataaa ttaaatgaat atttttcacc acaacgtaac gccgtattcg aaaggcacat 4080 gtttcgcagc atggtgcccg ccaaggtgat tgcttcgccg agttcttgct gcggctaaga 4140 caacaagcca gtaaatgctc ttttggagcc tccaagaccg aaatagagga aatctgcgtc 4200 gctgataaaa taattgatgc ctgggctaag actgcgttga aaaagaagct attagcggag 4260 ctcaagctgg aggagataat aaatacttgc actgttgaag agcaagtgaa tcagcaatca 4320 gaggcaatgg tgattccgtc ggatgccgat attgtgaaga aaattgctgt ccaaacaaaa 4380 agaaatgaca ggacgaatga tgtgtgtggc cggtgtggag cagcagacca tgatgaacgc 4440 agtatggcat gcccggcaag aaactcaaaa tgtaataaat gttcccggtt cggacattat 4500 gctcggatgt gtaagatcgc attgaagaga cgttttggaa caccatatga taatcaacgc 4560 aagcgaccac gaacctcaca gttccaggtg cgcgccattg atgaggagga gactaagtgt 4620 cacagcgaca acagttggtc gaactgtttt aaaataactg gtgaaccaga aggagaagaa 4680 accgtatcat gtcttgttgg gagatcccag ctgcaaataa taattgactc cggatcacgg 4740 tacaacctga taagccttcg cgattggctg caactgaaca gtgtaaatgc gacagtgttc 4800 aatgttcgta acaactcgtc tacccagttt ggagcctatg cttcggatga cgtgctgagg 4860 gtgactaacg cttttgaagc tccgataaga attcaggaaa gaccagagat gattgctaca 4920 ttttatgtaa ttgaaaatgg tagacaatcg ttactaggac gggacacagc cgtacaactc 4980 aatgtcctaa aaattggttt aggcgtaaat aaagtggatt cgtacgccca ttttccgaaa 5040 tggaaagctt cacctgtgag attggccata gaccacaatg tgagaccagt gcagaagcca 5100 atgcggagat ttccgactgc gttagaagat cgaattgcag aacgaataag ccaggctgtg 5160 cagcaagaca ttatagagcc agtccatgga ccgagttcat ggatttcacc agttgtaatt 5220 gcgtataaag gaagcggcga gataaggcta tgtctggata tgcgaagggc gaatttggcc 5280 ctctcgaggg aaaactaccc actgccaaca ttcgacatgt tcatgaccaa actacgggag 5340 gcgagatgtt tttccagact ggatttgaaa aatgcttatc accaattgga actcgatgag 5400 tcaagtcgac agattaccac atttattaca cacaagggac tatttcgata taaaagactg 5460 tttttcggag ttaattcggc ttcagaaatt gcccagaggc ggctggaaga attgttagca 5520 ggctgcccca atgctctgaa ctatattgac gatgtcatcg tcttcggcaa aaatgaggag 5580 gaacacgaca aggcgctaac tgcggttctg aaaatctttg atacccacaa cgttgttttg 5640 gataagcaaa aaggtgtgta caaaacaaca aaactcaagt ttcttggcca tattctatcg 5700 gaccatggta tcgaagcgga tccggagaaa gttaaagtga tcacagactt tcgtgatccg 5760 aaaactaagg aggagacaag gagtttcctg ggcctcgtaa catacgtggg gaaatttata 5820 cccgatctcg ccaacacgac ggagctactg agaagattaa taaaggcgaa taccaaattt 5880 gattggacca acaaagaaca gcaagccttt gatacgttaa agaaacaagt cgtcgaggtg 5940 acaaagctgt catacttcaa taaaaatctc cgtacatgtc ttatagccga tgcaagtcca 6000 gtggcactgg gagctgttct ggtataggtt gatgatcaca acgaccatca tatagtatcg 6060 ttcgccagta aaagcctaac tgatgttgag aagagatatt cgcaaacgga gaaagaaagc 6120 ttggcactgg tatgggcagt ggagaagttt tactactacc tagctggact gcaattcgat 6180 ttgataacgg atcataagcc attggagtcc atcttcaaac caacttcgaa accaccggca 6240 cgtatcgaac gttggttact acgcctacag tcctaccgat tccgagtcat ctataaagca 6300 ggaaaggaaa acatcgcaga ttccgtgtcg cgtctttgtc agcttacaga agcagtagct 6360 ttcgactggc aggatgagca gattatcttt caaattgtag ccaaaggagt ctcaaagtct 6420 ctaaccatat cggaaatcat ggaaaaaagt atgcaggacg aagaaataat tgatgcaatg 6480 acctatctta gcaccaactc atgggaggcg aactcatcca gtccatatta tctgttccga 6540 tgggagttat cgacagttga aaatgtactc ttacgcggaa cgaaaatagt tataccaaca 6600 atactacgat cgcaagtgct cgaactggct cacgagggcc atccgggtga atcggctatg 6660 aaacgaagat tacgttcaaa ggtgtggtgg ccaaggatcg accgggaagc agaaggattt 6720 gctaaggcct gtagagactg catctccgta tcgctagcaa gtaaaccagc gcatatggat 6780 cggcatgcgt ttccagcagg cccttggcaa tgtgtcgcat ctgacttact aggaccgttg 6840 ccgagcaacg actatgtact ggtgttgatc gactattttt ccagatacat gaaaccattt 6900 cgtcgaaggt gctgatcgag caaatggatg agatattttc tcgtttgttt tctctcgaac 6960 tccaagattt ttgtcggacc aacggtatta ctgaaataac gacaccacca tattggccgc 7020 aggcgaatgg cgaagttgaa aatatgaaca agtctttggt gaaacgcctt aaaattgcgt 7080 ggtcgaaagt gaaagattat aaaagagaga tccaaaactt tgtcatgatg tacaacgtaa 7140 caccacatgg aaccacagaa gcagctcctt cggaattgat gtttaaccgg atcattagag 7200 acaagatacc ggatgttcga gacgttgtgg gtgatgtcgg tgaatcctcg gaaagagact 7260 tggattgctg gaacaaacag aagggaaaaa agaaagcaga caagggaaga ggcgccaaag 7320 catcggatat ccaacccgga gacaaaattg tgctgaagaa tgtcttgtgc ccacataagc 7380 tgacacccac gtttgataca accgagtacg tggtagtcaa cagaaaaggg aatatagtcc 7440 aagtgcaagg aggagggaaa acgctaacca gggatgtgag tcatttaaaa aaaattccag 7500 ttacaggtac ccatgaagct gcaacgacaa ccgatagtgt tcagggcccg agcacaccca 7560 gacaaacgtc ccaagcaccg gaaaatgaag cctgcgctca ggacgaagga ttgaagctga 7620 agctgaaaaa tattggaggg a 7641 // ID CR1-30_BF repbase; DNA; INV; 2263 BP. XX AC . XX DT 30-JUL-2009 (Rel. 14.07, Created) DT 30-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE Amphioxus CR1-30_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-30_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-2263 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-2263 RA Kapitonov V. and Jurka J.; RT "Young families of CR1 non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(7), 1601-1601 (2009). XX DR [2] (Consensus) XX SQ Sequence 2263 BP; 807 A; 457 C; 537 G; 462 T; 0 other; gctcagagga tctcactaga ccagagtact tcaagtacaa taagggggac tatcaacaga 60 tggcgcaaga acttgatacg gactgggagg aactcctact aggcaaatcg gtagaagaaa 120 gttggctact ttttgccgac agggtaaaag gcgccacaga gaaatgcata ccaaaagcga 180 ctctaaaaat gaaagaaggt ctaccatgga gaaacaggga cctcaaaagt aagctgaaca 240 aaaaacaaaa agcctggaag aaatatcaag aatcaaaaaa agacaaagat tacaaaagct 300 atacaaaggc aaggaaccaa gctcgctggg cagccagaaa agcagcaaaa aactacgaga 360 aagaaatggc atacaacatt aagggcaacc caaaactgtt ttggaaatat gtaaatagta 420 aatccaagac aagagagagc atcccagaac taaatgatgg aaacgaaatt gcaaaaacgg 480 acagagaaaa agcagaaact ttaaataaat tctttgtgag cacgttcact gaggaaagaa 540 ctggcaccat accacaagca aaaaaacaaa atgtgaatgc actgctgtct atggttgaac 600 tgtcagtgga ggacattggc aagagactga aagaggtaaa ccagaataaa accatgggcc 660 cagacgggat tcatccgaga attctgaagg aactggcaga aatcctggca aagccactgt 720 acattatttt taataagtca ctagaagagg ggaggttacc gaatgactgg aaagtcgggc 780 atatcacgcc aatattcaag aaagggagta aaaaactacc gagtaattat agaccagtga 840 gcttaacatc agtaacagga aagatactgg aaagtatcat cagagacgag atcgtagatc 900 atatgcgagc tcacagtctg tttacagaga accaacacgg cttcctaccg ggaagatcca 960 cgacgacaca aatactagaa tgtcttgatg actggacgaa ttggattgag caaggctggc 1020 cagtggacgt aatatatctc gatttccaaa aagcttttga ctcggtacca attgagagac 1080 tactgagaaa ggtcgaaagc tacgggatat gtggtaactt gctccaatgg gtgaggtctt 1140 tcctgacgga acgcaagcag cgagtttgcg tgaacggagc acgctcaagt tgggcaaatg 1200 tcactagcgg agtaccgcag ggtagcgttc tcggtccggt tctatttaca cttttcatca 1260 acgacatgcc agaagtagta cagagcaaat caaaactttt tgcagacgac acgaaagttt 1320 atcgaagtgt atgtaacatc aaggactgtg aatacctaca gaaagacctt aacagcttgc 1380 aagaatgggc aaacaagtgg caattaaggt tccaccccag caaatgtatt gttctgaggg 1440 tgggctctaa gcacccagag tttacatacc acatgattga ccagtctgcg gcgaaggtga 1500 acctgacctt caacaaagag gaaaaagatc tgggaattac tatagatcaa gacttgatgt 1560 ttgaaaatca catcttatcc atagcatcaa aggccaatca aatggcgggt atcatgtggc 1620 gaacattcca ccatgtggat aaggaaatct tcactctgtt atataagtca ctaataagac 1680 cccacattga gtatggtggc ccgatatggt ccccgagtac gtggagattg actgacctct 1740 tagaaaatgt gcaaaggaga gcgacgaaaa gggttcccgg tctgaaggac ctatcatatg 1800 aagaaagact ccgagccttg aaattgccaa cactactgta tcgacggctg aggggagacc 1860 tcatcaatac gtacaaatat gtgcatggac tgtatgacac aaagccttgc ttaacagaaa 1920 aaataaggga cagcagaact aggggtcaca gtctacgcct aacaagatgt gcatcaaaca 1980 acagcagaag acaggctttc ttcagtagta gagtggtacc ttggtggaat gggttgacag 2040 aggaggtggt aacagcaccg tcagtaaaca gttttaagat aagacttgac agactgatgg 2100 agaaccaccc agtagtgtac aactacaggg ccttggataa tccccacaaa ccaaacatca 2160 cagtcagctg attgcagaga ggaagaagca gttgatcagg agcggttcca tggagtgcga 2220 actcctactc aaccgaagga cctctactct actctactct act 2263 // ID Gypsy-19_CQ-LTR repbase; DNA; INV; 2107 BP. XX AC AAWU01010823; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-19_CQ_; KW Gypsy-19_CQ-I; Gypsy-19_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-2107 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 418-418 (2011). XX DR GenBank; AAWU01010823; Positions 23678 25784. XX SQ Sequence 2107 BP; 527 A; 525 C; 536 G; 519 T; 0 other; tgtaaccgtt gtactaaatt ggtttatttg attctaaaaa acatcacaaa tttagcgctt 60 tgcgcaaact atatttatat aacattttca tttattatta ttatcttatt aattgaattt 120 aaaatacgac ttcgaagata gcccctttgt aaagacgctc actgcttcac agtgagtgac 180 ggagtgcaag cactcgcgtt tacccctgat gattaggcaa cccccaatac gtgagttcgc 240 tgcgcgcgct ttgacgagcg gtcgctcaaa ccgacacgtg attggaactc agttacgatc 300 ttaagtggcc gcacgtgcag attctgaacc aagggtttga ctcgtttatc ttcgtgagaa 360 ttggaaaagg agagcgtgta ggccgtgtac cttcagcgta ccgagtggga gcgggacaca 420 ctattagaga gcgatatttc ccgaataggc taaaaccaac gcaacgttta cgtgcatccg 480 gtgggttcgt acgtaacttc ggcgatcagt ttcccaggct cgagtctgag aaattttagt 540 aactcgatca cattctccta acccgtggaa cgattcggaa gtgcagtgaa tttgctggag 600 ttgtaacgta ggtaatctgc gcgaggttcg atctcgggtg ttaatttaaa gttaactttt 660 cagaagctag gccgatagaa aagtagtgcg acagatagct agcttttgtg acctttgtgc 720 gagaacctgc gaagaaaggt aatgtagtca gtgtgcgaaa tagtgcgtgc tctaaaacga 780 tcgtttcgac ggtccgtgtt ccctttaggc agcgtggtca caagtttact tgtctccggc 840 ggcttccctc gtccgcggac gggattgctg gccaagctac aaccctgtgg ccagccgcca 900 cgccgtcaac atcgtggcag aaaacccccg gtcgtaccgc aaccttcgac cgagccactc 960 cagtcacctg tggccttttc gacgtaaacg cctggccgca ccgcaaccta gttgccgcgg 1020 cgatcttcgt cgattcgtga ggattcttca cgaatcaccg ctcgaaactg cctggccccc 1080 aacaaccacg tggcccgcag cgaacgcctc gaacacgccg tgacgcggtc atggaccgca 1140 ggtcggcaga cctgaagacc gaccacgtgg caaatcgatc ggttccagat gcctggtagg 1200 tgagtcgatc atccgccaca caaccatcgt gatcgcacaa cagcagcagt agcaggtagc 1260 agaactcgca gagcagcagt ctacggcgcc ggcaagggcg aagtagagga gaaagaacgc 1320 gcacaacgac gacgacgacg acgggctcgt aagtacaaaa ctctgtgtgt tagttgtccg 1380 tgtgcacatg tgacgtcccc tcgaaccaca gagttcgcaa gcatgtacaa cccgcgctag 1440 cccgcaccgc tagccccccc aaaaacgccc cattgttgaa gatctggaga agaattaaga 1500 gcacgcacac tgtaggatag ggtggtcgta attagaagat ggattgtaac tattagggaa 1560 taaaactgaa ctcaaaaatg ttaatctgcg atctttaaat gccgtcttta ataaagtccc 1620 tatttaaccc tactcaagtt gctttggttg atgcttggga aagaatttag ttatgagcgc 1680 ttgcgctacg cttcagcggt ctgccgcgca ccggtcactc tgcgacacct ggtggtggga 1740 tcagtaacta tcgatgatga atgagtattg atgttgggag cattcgttga gcacgttggt 1800 tttaggtcaa ttcgggcatt atgggctcgg ataatactga agttcggtaa gtcggacgca 1860 gggaaactgc ggggaaagct gaatcgttct ccaaacctct tctcacgagt aaactggacc 1920 ttcctaagga agagtctcaa gctgggcaac tttagatcag gcgggtggtc ctctaacgag 1980 gtggcgctta agctatccgc ttcttccagt gcgcttcggc gatctcgcac atttccctgt 2040 gacattttct tggaattttt ccctgtaccg taccatgacg ttctggtcca acgtccgtac 2100 ggttaca 2107 // ID CR1_Ele13 repbase; DNA; INV; 4460 BP. XX AC . XX DT 22-OCT-2010 (Rel. 15.1, Created) DT 22-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE A CR1 clade non-LTR retrotransposon family from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1_Ele13. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4460 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4460 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (22-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. This consensus is generated from 20 CC sequences with >95% identity, and ~99% identical to the original CC sequence in [1]. XX FH Key Location/Qualifiers FT CDS 702..1529 FT /product="CR1_Ele13_1p" FT /translation="MPSVCEKCANDLVGEIIKCGGFCTGQFCLRCSKIPSE FT LYPTIKTHQHLVWMCDACRNLLKNSRFTKAMTSISAANDCIVEALKTDIRD FT SILTEVRNEIRSSFKKMIDSVPLTPLNPIPLQLPFSSRKKRPRDAENEDEN FT NRRPAKLICGTSSSSNAVVPTVNIAEPRQEQFWLYLSGIHPSVPDENIKQL FT AQESLGTNELTVAKLVPKDKDIRSLTFISFKIGMPSNLKEKAMSATTWPIG FT IKFREFESAGTTQPVFWIPPVTQPETNNPTAIQSM" FT CDS 1439..4399 FT /product="CR1_Ele13_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="VRIGWYYTTGFLDTTGDSAGNEQPDSHSIDVIPVQIP FT SVHDATIPVTVYYQNVRGLRTKTNALHLSLSSCDYDIIAFTETWLRDDISN FT AELSSNYSLYRCDRSSSTSNLTRGGGVLIGVRKSLQCSFVDVPNACKLEQV FT AVKVSLANLSMYICCIYVRPNSDPEIYRQHSRSVRYLLDLANDTDMVVVLG FT DYNLPNLSWNFDDDLNAYLPANASSEHEITLTQSILDTGLSQVNDITNCNG FT RLLDLSFVSDTQKIESLEPPNSLLPIDAHHRPFVLKCDISSVTDTQDNIFC FT YDFKRCDVEVLNDALARVDWSSLLTDCHIDTAVSRFYETINNIIAELVPLK FT KIWRTHHKCPWWNRELRTMRNRLRKFRKKYFKHRSADNKQSLQRMENLYDE FT LRTTAFRNYIRRLESDFKHDPFSFWSFVKSRKHGQGVPSDMVYRSCKASSF FT RDTANLFADFFSTVYSNQQFNAVQTASNSSVHLPRLEVSTNDVHTALSDVD FT PTKGAGPDRLPPVFFKKCASSLSLPVSIIFNKSLLDGIFPEAWKIASITPI FT HKSGSIHDVENYRPISILNCLGKVLEKMVHEVLYRSVRHVISDSQHGFMKN FT RSTTTNLMTFANKLTKRIEKRQQVDAVYIDFSKAFDKVPHTLAVEKMRHIG FT LPDWIINWTSSYLTNRKSYVKISDAFSDTFSVPSGVPQGSHLGPLIFIIFV FT NDICSLLRCDCLMFADDLKIYRTVLSPLDCCALQEDLNIVLDWCGRNGMQV FT NISKCKVISYSRQRSPFFFPYRLDSEQLERVDKIRDLGVVIDSKVRFNEHI FT SIVTAKSYALLGFIRRNTAAFDDVYALKTLFCSLVRSTLEYAVQVWAPYHN FT EQCHRIESVQKSFLRYALRRLPWSNPIELPAYEDRCQLIDIETLANRRKKL FT RRLFVFDLIMNRIDCNYLLGNICFYAPVRTLRNRSILWIPTRRTQYAFFDP FT FDDSCRLFNDVSDKFDLNISKNVFASRIRNLD" XX SQ Sequence 4460 BP; 1215 A; 1102 C; 908 G; 1234 T; 1 other; actggcaaca ctgtcgctac aagttgttat tgtttgtgtc tcgctccaca taaattgtgt 60 ttcttcgctg ttaccgatcg cctaaaaacc accgctcgtc aaattccacc tacaacaacc 120 tcccgcaccg tgttcattcg ctgctcacct gtatcaaaca aggtagggct gtgaaatagc 180 tccgcaaaat ttgggcttga gagtgattag ctgtgcatac ccagccatca tttcacctcg 240 ccactcaccg ccgcctccac cacccaaaac tcgccactta ccgccgcctc accgtcgccg 300 ccaccaccct aaaacactcg aaccaacaag caacccacac gatagtgcgt taatattcca 360 cgtcaagcca ctgtaacgaa aaccaccacc aaattcgcta ccaataaaca tccacctgct 420 gctgtgtttt catttcccga ccatccgaaa gaatctccag tacctgccaa aaataaacaa 480 tctgcatatc atctggtcat ctggtcacga tmacatcgct acatcatcaa cgtcattgtt 540 gcaaggtaac ccacacatcg gtgctcgttg tacgtgttga tcgaagcata tctccatttg 600 acatcccgtc attcgtcgtt tgtttgagat cagcgccatc tgttggtgga tacgtgattc 660 agccctattt atccctttca agactggttg cataattcga aatgccttct gtctgtgaaa 720 aatgcgcgaa cgatctggtc ggtgaaatca tcaaatgtgg cggtttttgc actgggcaat 780 tttgtctcag atgctctaaa atcccatctg agctgtatcc gaccataaaa acacatcaac 840 acttggtgtg gatgtgcgat gcgtgtcgca atttgttgaa aaactcaagg ttcacgaaag 900 caatgacatc gattagtgca gccaacgatt gcatagttga agctcttaaa acggacattc 960 gagatagcat attaacagag gttcgcaacg aaattcgttc gagttttaaa aagatgattg 1020 attccgttcc actgactcct ctaaacccta ttccgctgca actgccattc tccagcagaa 1080 aaaagcgccc acgtgatgct gaaaacgaag atgaaaataa tcgccgccca gctaagctaa 1140 tttgcggcac tagttcgtct tctaacgcgg tggtcccgac ggtgaatatt gctgagcctc 1200 gtcaagagca gttttggctt tacctgtctg ggattcatcc cagtgtacct gacgagaaca 1260 taaagcagct tgctcaggaa tctttgggta ccaatgaact caccgttgca aagttggttc 1320 ccaaggacaa ggacatacga tcgctcacct tcatttcttt caaaatcgga atgccatcca 1380 acttgaagga gaaggcaatg tctgctacaa cgtggccgat tggtatcaaa ttccgtgagt 1440 tcgaatcggc tggtactaca caaccggttt tttggatacc accggtgact cagccggaaa 1500 cgaacaaccc gacagccatt caatcgatgt gattcccgtt caaataccat cagttcatga 1560 cgccaccatt cccgtaaccg tatactatca aaacgttcgc ggtctacgca caaagacgaa 1620 cgctctacat ctctcgctga gctcgtgcga ttatgacata attgccttca ccgagacttg 1680 gcttcgtgat gatatctcca atgcagaatt atcttcaaac tactctctgt accgttgcga 1740 ccgcagctct tccaccagta atctaacgcg tggtggtggt gtgctgattg gggtgcgaaa 1800 gagtttgcag tgtagtttcg ttgatgttcc gaacgcctgt aaactggaac aagttgcagt 1860 caaagtaagt ctcgctaacc tgtcgatgta tatctgctgt atttatgtta gaccaaatag 1920 tgatccggag atttaccgac aacactctcg cagcgttcgg tatttgcttg atctcgcgaa 1980 cgacaccgac atggtcgttg ttcttggtga ctataacctg ccaaacctct cttggaattt 2040 cgacgatgat cttaacgctt atctacctgc caacgcctct tctgagcatg agatcacact 2100 aacccaatcg attctcgaca ctggactctc tcaggtgaac gatattacga actgcaatgg 2160 gcgtttgttg gatttgtcat ttgtatctga cacccaaaaa attgaatctt tggaaccgcc 2220 aaattcatta ctcccgatcg acgcgcatca cagaccattc gttctgaaat gcgacatttc 2280 gtctgtaact gacacacaag acaatatttt ctgctacgat tttaagagat gtgatgttga 2340 agtgcttaac gatgcgttgg ctcgagtgga ttggtcatcc cttctaaccg attgccatat 2400 tgacactgcc gtttctcgtt tctatgaaac cataaataac atcatagccg aactagtgcc 2460 cctcaaaaaa atctggcgaa ctcatcacaa atgcccctgg tggaaccggg agttgaggac 2520 aatgcgaaac aggcttcgaa aattccgtaa aaagtacttc aagcaccgat cagcagataa 2580 taagcagtct ctccagagaa tggaaaacct gtatgatgag ctgcgaacaa ccgcatttcg 2640 gaattacatc agacgactgg aatccgactt caagcacgac cccttttcgt tctggtcttt 2700 cgttaaaagc aggaagcacg gccagggagt cccgtcggac atggtttacc gcagctgtaa 2760 agcgtccagt ttccgagata ccgcaaatct ctttgccgac tttttttcaa ccgtttatag 2820 caaccaacaa ttcaacgccg ttcaaactgc ttctaacagt tccgtacatc tacctcgcct 2880 ggaagtatca acaaacgatg tccatacagc gctgtcagat gttgacccga cgaaaggtgc 2940 tgggccggat cgccttcctc ccgttttctt caaaaagtgt gcgtcctcgc tctcgttacc 3000 agtatcgatt attttcaaca agtcgttatt ggacggcatc tttcctgaag cgtggaagat 3060 tgcatctatt acgccaattc acaaaagtgg cagtatacac gatgtcgaga attatcggcc 3120 gatctcaata ctaaactgct tgggtaaggt acttgagaaa atggttcatg aagttcttta 3180 ccgctccgtc cgacatgtga tatctgatag tcaacatgga ttcatgaaaa accgttcgac 3240 gactacaaat ctcatgacat ttgcaaacaa attaacgaag aggatcgaaa agaggcagca 3300 ggttgacgca gtttatattg atttctccaa agctttcgat aaagtgcccc acacgttagc 3360 tgtggagaaa atgcggcaca ttggtttgcc cgactggatc atcaactgga cgtcatcgta 3420 cttaactaat cgtaagagtt acgtaaaaat tagcgacgca ttttccgata ccttctccgt 3480 tccctccgga gtcccacaag gcagccacct tggcccgtta atttttataa ttttcgtcaa 3540 cgacatctgt agtcttcttc gctgtgattg tttgatgttc gcggatgact taaaaatcta 3600 ccgcaccgta ttgtcaccgt tggactgctg tgctcttcag gaagatctga acattgtttt 3660 ggactggtgc ggaaggaatg gaatgcaagt gaacatctcg aaatgcaagg tgatatccta 3720 cagtcgtcag cgctctccat tcttcttccc gtacagactc gacagtgaac aattggaacg 3780 tgttgacaaa attcgggatc ttggcgtcgt tattgactct aaagttaggt ttaacgagca 3840 tatctctatc gtcacagcca aatcgtatgc actattgggt ttcatccggc gcaacacagc 3900 tgccttcgac gacgtgtatg ctttaaaaac tcttttttgc tccttagtgc gtagcacttt 3960 ggaatatgct gtccaagtct gggcaccgta tcataacgaa cagtgccatc gaatagagag 4020 cgtccagaaa tcgtttctta gatacgcgtt acgtcgtcta ccgtggtcca atcccatcga 4080 attaccggcc tatgaagacc gttgccaact aattgacata gaaacgctag cgaaccgtag 4140 gaaaaaattg cgcagattat tcgtgtttga cctgattatg aatcgtattg actgtaatta 4200 tttgttaggt aatatttgtt tttatgctcc tgtacgcact cttcgtaatc gttctatttt 4260 gtggattcct actagacgca ctcaatacgc tttttttgac ccttttgacg atagttgtag 4320 attatttaat gatgtttccg ataaatttga cctgaatatc tctaaaaatg tttttgctag 4380 taggataagg aatctagatt aagtcacaat ctgtgcaacc ataagtgttg aagatgtaga 4440 accaaataaa taaataaaaa 4460 // ID Gypsy1-LTR_Dpse repbase; DNA; INV; 201 BP. XX AC Unknown_group_134; XX DT 13-MAY-2009 (Rel. 14.05, Created) DT 13-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy1_Dpse; KW Gypsy1-I_Dpse; Gypsy1-LTR_Dpse. XX OS Drosophila pseudoobscura OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; obscura group; OC pseudoobscura subgroup. XX RN [1] RP 1-201 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1030-1030 (2009). XX DR Genome; Unknown_group_134; Positions 17 217. XX SQ Sequence 201 BP; 49 A; 65 C; 64 G; 23 T; 0 other; tgcaggacga agggaacaga cacccactga ctcgcgggaa cgaatgggac tcgccccttt 60 gggccgagtg gaacctggcc ggtcaaggtc gggcagccac ccctccagca gtacaccacg 120 aggacgaatc cggggacggt gctgaccaca ctactccgac gtgtcccacg caagcgatac 180 aggcggggaa gtaggacccc a 201 // ID Gypsy-620_AA-LTR repbase; DNA; INV; 1256 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-620_AA_; KW Ty3_gypsy_Ele185; Gypsy-620_AA-I; Gypsy-620_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-1256 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 1256 BP; 357 A; 286 C; 273 G; 329 T; 11 other; tgtaaagcca ttaaaaaaac caatgatcca atttgttctt aaatccaatc ggttccacta 60 gactaatatg ctcaatcgcc aacmttacat cgcaaagcga aatgcatgac aaaagctcag 120 cacggacaac aacccctatt cgcccttctt tgcgaaaggg aaattactga gtgcgttcgg 180 taaaatgatt ttgaaattgg ctccactctc tcgttctctt aatctctatc cgcatgagtg 240 ctttgcctgg tgtttaaatt gcgcctttaa tctttgcata ctgcaatgca cggattctca 300 aatggcttta aaaatgttaa accaaagggg attctttgtt ctgcatggat taggcagttt 360 ctgttgaaac aataagtagt ggcgcttggc caaaccaaca atagctcact tgagtatcga 420 tttcctacca gtgtggatgc gttaagctat agctaatctg tagtcagaag tgcagtatag 480 ttggcatkga agccaacttw gaaaattgtt ctccwagtgt gatctaaacc gaaaatagtg 540 cctagaaaat ccaagtaata ttcagattgg taagactttg tgaawcctag tatwaagtct 600 gtacctaawt ttgaatttcc gacttaactt ktagtkagta catgttactt aggcaacaca 660 cgtgaacaac cgagkgctag ttgtaattta cctagtaaat ttcgaggtaa gccaatgtga 720 ctgaaacaag taggctccaa ttaaacwaat acgttgcagg aaacccctac aacgtggggt 780 gttcgtgtag gctcggatta agcaccttca gccacccacg tgcattgcat cctcggtgtg 840 gcctgcagtc ggattaccgc caataagctc tcctgagaac cacttctccg ctcgcctctc 900 atctgcgata cgactaatcc agcaatactg ctcgaacgca tcgactggtc gaattccatc 960 acgttttcga atggttcatt gcctaacccg tctgaggcaa agcggctaaa ttatctgcgg 1020 gtgaagtggt gccgagtgag ctgcgaatcg acgactgcaa accgacgcct ccccacgtgg 1080 tgaaccattg caaggtagga gtgaatcccg catgcgaagc gattgaccaa tagttgaacc 1140 ttgtcggggc cccgatcaac gcgtcgttgg accaccggtg ccggcaagta ctaaaaaaga 1200 agaacctagg ttaagagaag ttagggtagc gtttagtaga tcgatgtgca cgaaca 1256 // ID Gypsy-25_DYa-LTR repbase; DNA; INV; 237 BP. XX AC chr2h_random; XX DT 06-APR-2011 (Rel. 16.04, Created) DT 06-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-25_DYa_; KW Gypsy-25_DYa-I; Gypsy-25_DYa-LTR. XX OS Drosophila yakuba OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; melanogaster subgroup. XX RN [1] RP 1-237 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; chr2h_random; Positions 3469780 3470016. XX SQ Sequence 237 BP; 97 A; 32 C; 42 G; 66 T; 0 other; tgttagcagg gatggtataa gtaccatccc atttaccatt tagtatttag tagtgtgtat 60 tcagcattta gtattttagc atttagcatt agatcagaga agtagagatt atgtaaagaa 120 gaggcctacg gacacacttg attattggac acgagagaga taacatataa acagaataaa 180 aacacaatta aaaatcaaaa gaattaaaat acatcttaaa cctaaacagt ggtaaca 237 // ID Gypsy-32_OD-LTR repbase; DNA; INV; 189 BP. XX AC CABV01001277; XX DT 24-FEB-2011 (Rel. 16.02, Created) DT 24-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Oikopleura dioica genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-32_OD_; KW Gypsy-32_OD-I; Gypsy-32_OD-LTR. XX OS Oikopleura dioica OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; OC Oikopleuridae; Oikopleura. XX RN [1] RP 1-189 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Oikopleura dioica genome."; RL Direct Submission to RU (24-FEB-2011). XX DR Genome; CABV01001277; Positions 9198 9386. XX SQ Sequence 189 BP; 47 A; 48 C; 37 G; 57 T; 0 other; tgtgaggttc cgcgagctgc accatatttg gtcattctca actcgcgtcg ggccgccgta 60 ttttcaaggc tgcaaaagcc ggacaccttt cgctgccttt tcgaaaactt gttcacttaa 120 attttgacga atagataaac aaccgctaat tgactttttc gtatttctcg cagatcacgg 180 aatctaaca 189 // ID RTE-3_PPac repbase; DNA; INV; 2915 BP. XX AC . XX DT 07-JUL-2010 (Rel. 15.07, Created) DT 07-JUL-2010 (Rel. 15.07, Last updated, Version 2) XX DE A family of RTE non-LTR retrotransposons: consensus. XX KW RTE; Non-LTR Retrotransposon; Transposable Element; RTE-3_PPac. XX OS Pristionchus pacificus OC Eukaryota; Metazoa; Nematoda; Chromadorea; Diplogasterida; OC Neodiplogasteridae; Pristionchus. XX RN [1] RP 1-2915 RA Jurka J.; RT "RTE non-LTR retrotransposons from nematodes."; RL Repbase Reports 10(7), 1062-1062 (2010). XX DR [1] (Consensus) XX CC ~99% identical to consensus. CC This sequence was derived from sequence data generated by Genome CC Sequencing Center at Washington University School of Medicine in CC St. Louis. XX FH Key Location/Qualifiers FT CDS 95..2884 FT /product="RTE-3_PPac_1p" FT /translation="MTIFMALLSLATLNCLTLSKDARIIELENALDKIKYD FT IVGLSEVRRKSAGEMDLSWSNGRLYHSARLPNHTAGVGFIVSGSVKQKVVR FT FCVLTARICFLDVAISDGILIRVFQIYAPASIDMLEYSTFIHEVEQAFHQP FT VTGSHRYRFVHKVILGDFNGKVGVCEPGEXSVGNFGYGNRNDKGQIVVDLC FT ERLHLRVGNSLFRKRDARKFTWVSPNGRARNEIDFILFPKGIRVMDVDVVN FT RLXFSTDHRMVRSFISFPFRFRRGRNRPAKGDAHLEKDLFKFSLEAEMKRV FT GGRVEYGTVVSAVQTAGKIATTFTPSGKRVSQKTRDLLIRRRELKIDSVGV FT SRVEWLLVNRAIRESFKADRDQGYLXYINEAIRMGKGFKRAPVKLAIGRER FT ISRIRNGNGDLELTQAGIVEAFQSFYNNLYKGDGGIGYREVIERDPFVPIC FT ASEVDQVLRHFKSGKAPGKDRITGEMLRIGVDQLVPTLTTLFNNIIMGNIP FT QGFGDSRTILLPKTGDATLPKNYRPISLLPVLQKALTGLLNKRYAPVLDRN FT KSWEQMGFRQGHSTIDAIHTLKSVIAHCVDYRLPIYLLFVDISKAFDSVFT FT EAVFRSVEMEGIPSEVVSFLAGLSQGSKNALVVNGSEVEIKVGRGVRQGDC FT LSPRLFTAVLDRAFRQLDWSKKGICINGRFLSHILFADDAVIISHDERQIQ FT SMAVELEKVLAGVGLQLNGDKTIGMTSKPLPRIVKVAGKVVKLQEKVVYLG FT SGITINGTDDWEITRRIQAGYGAFHKHRSFLINRGIPMAHKRRLFNGCILP FT AVLYGCECWALTNAQMNRLSVAQRRMERWMVGCTVLDHVSNERLRDSTRIR FT DFVRDAQKRKWFWLHKIANDDNWKWSRSVIEWFPTRKRRRGRPMTRWSDIF FT RKTVGPNFLNEARKASWNAMHIRALT" XX SQ Sequence 2915 BP; 801 A; 586 C; 778 G; 745 T; 5 other; ttcgagtgcg gcgctcgatt acctgccggt cggttcgcat ttgcgctccg tctctcgttt 60 cctctaccaa gccggttggt caccgtggta gagtatgacc atattcatgg ctcttctctc 120 tctggcgact cttaactgct taacgctgag caaggatgcg cggattatcg agctggagaa 180 cgcgctggac aagatcaagt acgatatcgt cggcctatca gaagtacgga ggaagtcagc 240 gggtgaaatg gatctctcgt ggagcaatgg acgactctac cactctgctc gtctacctaa 300 ccacactgct ggagttggtt tcatcgtgag tggatcggta aaacagaagg ttgttcgatt 360 ttgcgttcta actgccagga tctgtttcct cgatgttgca atttcggatg gaatcctaat 420 tcgagttttc cagatctacg cacctgcctc aattgacatg ctcgagtaca gtactttcat 480 tcacgaggtc gagcaagcgt ttcatcagcc agtgacaggt agccacagat atagattcgt 540 gcataaggtc atcctgggtg atttcaatgg gaaagtgggg gtttgtgaac cgggggaacn 600 atctgtggga aactttgggt atggaaatag aaacgataag gggcaaatag tggtcgattt 660 atgcgaaagg ctgcatctac gggtgggaaa cagtttattc cgtaagaggg acgctagaaa 720 attcacatgg gtttcgccta acggacgcgc naggaacgaa atagatttca ttttatttcc 780 aaaggggatt cgggttatgg acgtagatgt ggtgaatcga ttagantttt ctaccgatca 840 ccgcatggtt agatccttca tttccttccc ctttcggttt cggcgnggga gaaatagacc 900 ggcaaaggga gacgctcacc tcgaaaagga cctgttcaag ttcagcctag aagctgaaat 960 gaagcgcgta ggagggaggg tagaatacgg tactgtggtt tcggcggttc agacggcggg 1020 caaaattgct actactttca cccccagtgg aaagcgggtt tcacaaaaga ccagggactt 1080 actaattaga aggagggaac ttaaaattga ttcggttggt gtatctcggg tagaatggtt 1140 actggttaat cgggcaataa gggaatcttt taaagctgac agggaccaag ggtacctcna 1200 ttatatcaat gaggcaatta gaatgggaaa gggtttcaaa agggcaccgg taaaattagc 1260 tattgggagg gaaagaattt cgcgcattag gaatggtaac ggagacttag aactaactca 1320 ggctggcatt gtagaggcgt ttcaatcttt ttacaacaac ctttacaagg gagatggtgg 1380 gataggatac agggaggtta tagaacggga tccatttgta cctatttgcg cgtcggaagt 1440 agaccaagtg cttaggcact tcaaatctgg caaagcacct gggaaggata ggataacggg 1500 agaaatgcta aggattgggg ttgaccaact agttcctact ctcaccaccc tttttaacaa 1560 catcattatg ggtaacatcc ctcaaggttt cggtgactcg cgcactatcc tcctgcccaa 1620 aactggtgac gctactctac ctaaaaacta cagaccaatc agtttgctgc ctgttttaca 1680 gaaagcatta actggactgc tcaataagag gtacgcgccg gtcttggata ggaacaaaag 1740 ctgggaacaa atgggtttta gacaaggcca ttccactatt gacgcaatcc atactcttaa 1800 atcggtcatt gcacattgtg ttgattacag gctgccgatc tatctgcttt ttgtggatat 1860 atcgaaggct ttcgacagtg tattcacaga agcagtattc agatcggtag agatggaagg 1920 aattccgtcg gaggtggttt cattcttagc gggtctctct cagggaagta agaatgcgct 1980 ggtggtcaac gggagtgagg tggaaattaa ggttggaagg ggggttcgtc aaggtgactg 2040 cttgtcccca agactcttta ctgcagtttt agatagggca ttccgccaat tagactggtc 2100 taagaaggga atttgcataa atggtcggtt cctttctcat atcctatttg ctgatgatgc 2160 tgtcattatc tcacacgatg aacgacaaat tcaaagcatg gccgtggaac tagaaaaggt 2220 tttggcggga gtgggactgc agctcaacgg ggataagacc atcggaatga catccaaacc 2280 cctacccaga atagttaaag tagctggaaa ggttgtaaaa ttgcaggaaa aggtggtata 2340 cttgggaagt gggataacaa ttaacgggac tgacgactgg gaaataacac ggagaattca 2400 agcagggtat ggggcatttc acaaacaccg ctcatttctc atcaatcggg gtatcccaat 2460 ggctcacaaa cgtcgcttat tcaatggctg cattctcccc gcggtattgt atggttgcga 2520 atgttgggct ctgaccaatg cacaaatgaa tcgattatca gtggctcaac gtcgaatgga 2580 aagatggatg gtaggttgta ctgtgctcga tcatgtctct aatgagcggt tgcgagattc 2640 aactcgaatt cgggactttg ttcgagatgc gcagaaaagg aaatggtttt ggttgcacaa 2700 gatcgcgaat gacgacaatt ggaagtggag ccgatctgtg atcgaatggt tcccgacaag 2760 aaaaagaaga agaggtcgac caatgactag atggagcgat atcttccgca agactgttgg 2820 cccgaacttt ctcaacgaag caagaaaagc cagttggaat gccatgcaca tcagagctct 2880 cacctaaccc gacctcttga taaatgtgta ataat 2915 // ID Gypsy-3_DPu-LTR repbase; DNA; INV; 640 BP. XX AC scaffold_140; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-3_DPu_; KW Gypsy-3_DPu-LTR; Gypsy-3_DPu-I. XX NM Gypsy-3_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-640 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 722-722 (2010). XX DR Genome; scaffold_140; Positions 135290 135929. XX CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX SQ Sequence 640 BP; 180 A; 178 C; 139 G; 143 T; 0 other; tgtaaggatc gggttaactc accgctctgg cagagagtag tacccgaacc cagccacggg 60 agggcacccg caaccagcaa acaagcggcg catctatagc accaacagcc ataagcgcgc 120 aattcaaaag acgtatgcgc aacgagtccg ttgagtaaat aggatccgct cacggacacc 180 gtgatgctga ctcacgggaa gtatccagtg ccgacaccgt gaggctgact cacgggaagt 240 atccagtgcc gacaccgtga ggctgactca cgggaagtat ccattgccga caccctgatg 300 tcgacacacg ggaggtatcc attcccaaca ccgtgatatc gacatcagcc agagtatcaa 360 tcccaacacg gcaataacag caataacgag atcaatctac gacgctgact tccgccttga 420 ccagtataaa agcccccttt tttcccttgt ataatcagtc ttccccgagc aggccttcac 480 gctggtaaac tctttctctc ttattattct aacattgaat aagtcgcatt cgtgtatctg 540 atgttaaaac ttgtgtattc aaatacaaaa cgtcatcctc ttacaagtcg ttgcgtctcc 600 atttttgtac aagtaaagta aggctgggac gagccttaca 640 // ID SGRP1 repbase; DNA; INV; 193 BP. XX AC X73982; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 28-SEP-1995 (Rel. 1.08, Last updated, Version 1) XX DE S.gregaria repetitive element. XX KW Repetitive element; SGRP1. XX OS Schistocerca gregaria OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Orthopteroidea; Orthoptera; Caelifera; Acridomorpha; OC Acridoidea; Acrididae; Cyrtacanthacridinae; Schistocerca. XX RN [1] RP 1-193 RA Dawes R., Dawson I., Falciani F., Tear G. and Akam M.; RT "Dax, a locust Hox gene related to fushi-tarazu but showing no RT pair-rule expression."; RL Development 120(6), 1561-1572 (1994). XX RN [2] RP 1-193 RA Akam M.; RT "Direct submission."; RL Direct Submission to EMBL/GenBank/DDBJ (08-JUL-1993). M. Akam, RL Wellcome/CRC Institute, Tennis Court Road, Cambridge CB2 1QR, UK. XX DR GenBank; X73982; Positions 288 480. XX SQ Sequence 193 BP; 51 A; 63 C; 43 G; 36 T; 0 other; attcttctgt aacaccacat ttcgaaagct tctattcccc tagacttaga actgcttaaa 60 cctaactaac acaaggacag cacacacatc catgcccggg gcaggcttcg aacctgcgac 120 cgtagcagca gcgcgggttc ggactgaagg gcctagaacc gctcgcccac agcggccggc 180 catcctggta aca 193 // ID Gypsy-36_DWil-LTR repbase; DNA; INV; 295 BP. XX AC scaffold_181150; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-36_DWil_; KW Gypsy-36_DWil-I; Gypsy-36_DWil-LTR. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-295 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_181150; Positions 4248367 4248661. XX SQ Sequence 295 BP; 107 A; 51 C; 53 G; 84 T; 0 other; tgtagcagga gagacggtca cacttgctct cttgctactc ttcgatcccg gtattttaat 60 tgactctccg agagagagaa aagtttagcg ggtctgacgt gtgagtcagt cgttgctgag 120 cgttgaggag tgtaagacaa atatcgagga aacattaatt atatctaata aataataacc 180 ataataaata ataataaaaa caattaagaa ataaataaac tattattaat ctaaagaaac 240 taaaacccga atctcctaaa tattctggca ttggttaatt tatctgccgc ccaca 295 // ID hATm-44_HM repbase; DNA; INV; 2872 BP. XX AC . XX DT 07-DEC-2008 (Rel. 13.12, Created) DT 11-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type family: consensus sequence. XX KW hAT; DNA transposon; Transposable Element; hATm-44_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2872 RA Jurka J.; RT "Families of hATm elements from Hydra magnipapillata."; RL Repbase Reports 8(12), 1938-1938 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 779..2104 FT /product="hATm-44_HM_1p" FT /translation="MSYRYSRNSIESNRCIWIKNLPYSSKNKRNPKIPSVE FT KLTNLTRMPTNRQILGNFTFLFNSEKRRLDRIKIMITDIKKLWEEILDFPL FT LSSALIRKKLNSLIDLFNKNKKKPSDKFTATLSQLFDVTNVNGEWKNTEDK FT KLYQRQVESGGKIGYSMVKQAPLESIHPRKRIRLSKTQSQASSSQLHPTQT FT HKTVSDSSDSETSVFSPDSQTSSSTSSPECKPKYQRASTTAAGLLVLKNNI FT STQKAHDVLATIAKQGHDVPVPSQAGVWKRVISEGQKMKTKIAQKLKGKDP FT YCLQFDGKRVHGEEYQVVLLKNMTTEIKLGILKCESGSAAAIHKELHQLID FT EYNAWENICMIICDTTAVNTGRLHGIVKLIQDDVLSKGFQKPQYIGCQHHV FT LDLLLKHVMNFFIQEPTSKPELNYSFIDKLTENYVSLQDDYRLYAHR*" XX SQ Sequence 2872 BP; 984 A; 537 C; 456 G; 894 T; 1 other; ttagggtaat aactattttc attttttttt tgtaatgaaa ctgaaaccta tcacattcga 60 ggaacaatta tatgcagatg acagtagcaa cgttaatttt tttttaaatt gaaaaaagtt 120 cccccacctt tcaattttta ttttttctta tatcgctcaa tgaggtattt tcacgaataa 180 acaataccat gataatgatc aacgatcacg tgatggataa gagatgcttt tcgattactt 240 tgagcctatt tgtttacata actataacct ctcaatcact gatctgtata tattatatct 300 atgctctcaa aaaaaaacaa aaaacaaagc cattattccg ttattaatcg atcaattata 360 tttatccatc acgtgacgtt tttaaaaagt tttccggctc attaaaatac ctcatttgtt 420 taacaattta aatttcacac tgacttttta tgtaatgttc taggctgctt ttaagcacat 480 ttctgtcaac tgtctgtttc tatttgttta taagaaaaaa acatttacag atggtagtta 540 acttttttat ttttattgat cccagcttaa acagtggaaa ctcgctccaa catggggcaa 600 gtttagtaaa aataattatt ttattattga tatattttat tttctgtttc aggtgcaaac 660 ataaaatata tatattctgg agtattttat aacattttgg aatattgtag attaattacc 720 tcgttttcag tgcaagattg caagtcatcc agaaaagaca ttttgtcact tcaataaaat 780 gtcttataga tattccagaa atagtataga atcaaacagg tgtatctgga taaaaaatct 840 accctattct tccaaaaata aaagaaatcc aaagattcca tctgttgaaa aactaaccaa 900 tctcacgcgg atgccgacaa accgtcaaat tcttggaaac ttcacttttt tgttcaatag 960 cgagaaacgg cgactggatc gaatcaagat tatgattaca gatataaaaa agctttggga 1020 ggaaattcta gactttccac tgttatccag cgcactcata agaaagaagc tcaacagtct 1080 gattgattta tttaacaaaa acaaaaagaa accatctgat aaattcactg caacactttc 1140 tcagttattt gatgtcacca acgttaatgg agaatggaaa aacacggaag ataaaaagct 1200 ctaccagagg caagttgaga gtggaggaaa gattggttat tcaatggtca aacaagctcc 1260 tctggaatca atccacccaa gaaaacgaat ccgactatca aaaactcaat cccaagccag 1320 ctcttctcaa ttgcatccca cacaaacaca taaaactgtc tcagattctt cggattctga 1380 aacttctgtt ttctcacccg attcgcaaac ctcttcttct acgtcatctc ctgaatgcaa 1440 accgaaatac caaagagcca gcactacagc agctggttta ctagtattga aaaacaacat 1500 ttctacacaa aaagcacatg acgttctcgc aaccattgct aaacaaggcc acgatgtgcc 1560 ggttccaagt caggcgggtg tctggaaaag agtaatctct gaaggacaga aaatgaaaac 1620 caaaattgct caaaaactta agggcaaaga tccatattgt ctccagtttg atggtaaaag 1680 agtgcatggg gaagaatacc aagtagttct gctgaaaaat atgactaccg aaatcaaact 1740 cggtattctt aaatgtgaga gtggttctgc tgcggccatt cacaaggaac tacatcagct 1800 aattgatgaa tacaatgctt gggaaaatat ttgcatgata atctgtgaca cgactgcagt 1860 aaatactggt cgtctgcatg ggattgtaaa gctaatacaa gatgatgttt tgagcaaggg 1920 ctttcagaaa ccacaatata ttggctgtca gcaccatgtt ctcgacctcc ttctgaaaca 1980 tgtcatgaat ttttttattc aagagccaac gagcaaacca gaactcaatt attccttcat 2040 tgataaatta acagagaatt atgtttctct tcaggatgat taccgcctat atgcgcacag 2100 atgatgttaa cacaggcaaa aaccctgcat ggcgagatga tttcaagttc ttatatgaac 2160 tatgtcaagc gtatcgtcat tacaaaacaa cttcacgctg gcctcgaata aactggaaga 2220 aactcccaaa tgtacaccaa gcaagatgga actcacgtgc aatatatgct gttatagcat 2280 tattcctgat gccagagtgg cgcaccaccc tcgttccagc atgcgacttc atctgctacg 2340 aatgggcaaa tgcttggttc cattcgcaac actacaatga agcatccttc aatgaactat 2400 taacctcact catcaagata aaatgcaaga aagcaattaa gtgtttaaag acacattggt 2460 caatcgagga gtcattcatt gatattccca gatctaacca aattgctgaa cgtgcggtaa 2520 aactgatgga ggaacttcat gaagaatcaa gaaaatttga gtttttgaat ttgaagttcc 2580 ttggaaaaaa taacttttaa acatttcatg tatattttgt acatattact aatgttctcc 2640 atcttatttg ttcataaaag ttgcttagat aagctattac ttttattatt ttggataaar 2700 ctaaaaaaca tgtccttttt tgaaaaaaaa cattttttca tttttctttt cctgtgtgag 2760 tgggggaact tttttaaaaa acatcaaaaa tgcaatttat attcttgaat tacacataaa 2820 gtttcctcga ttgcaagtgg tttcataaaa tttttttttg ttataaccct aa 2872 // ID Gypsy-23_DPu-LTR repbase; DNA; INV; 376 BP. XX AC scaffold_35; XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from Daphnia: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-23_DP_; KW Gypsy-23_DPu-I; Gypsy-23_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-376 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Direct Submission to RU (16-DEC-2010). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR Genome; scaffold_35; Positions 686152 685777. XX SQ Sequence 376 BP; 80 A; 83 C; 100 G; 113 T; 0 other; tgttgcagac ggccccgtcc gtctccttac tccatctatg agcccctttc tgtattgtat 60 gggtgccctg agggcccttg tgattgcgat tgtgaccgcc attgggcgcg actgtcatgt 120 gacggccggc acacgaaaga gtaggcagat ctagtgtagc agttgagacg agtgtgtgtc 180 tctgtaagcc ttactgtcgt tagcaagtta gagagagagc tttggctgaa tacctggctc 240 cctcgcgtaa aacgtgttga ttgtgagtaa tatatttgtg taagcaccct agtttattta 300 tcgtgtattt attcattcgt gtaacgtaga caatctgatt agtcgaatcc gctcgtcggc 360 tgagcagatt gcaaca 376 // ID Kiri-13_AAe repbase; DNA; INV; 4404 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 16-FEB-2011 (Rel. 16.02, Last updated, Version 1) XX DE A Kiri non-LTR retrotransposon family from Aedes aegypti. XX KW Kiri; Non-LTR Retrotransposon; Transposable Element; Kiri-13_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4404 RA Kojima K.K. and Jurka J.; RT "Kiri non-LTR retrotransposons from the yellow fever mosquito."; RL Repbase Reports 11(2), 708-708 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 7 sequences with >98% CC identity. CC This family and related Kiri elements found in two mosquito CC species, Culex quinquefasciatus and Aedes aegypti, constitute a CC distinct lineage belonging to the CR1 group. The phylogenetic CC analysis with PhyML indicates that Kiri is a sister lineage of CC the L2 clade. Although several Kiri elements do not encode two CC proteins, probably due to 5' truncation, Kiri elements generally CC encode two proteins. Their ORF1 proteins commonly contain an CC L1-like RNA-binding domain. XX FH Key Location/Qualifiers FT CDS 274..969 FT /product="Kiri-13_AAe_1p" FT /translation="MNSHHSRQMNHKTTTTINCAERDSXGKIGIRNCEQFL FT LQMKLLFXEXDRRIDAALASYMHTVRDKPSEQSSDDVANIMIESPPDSVSM FT RIDAEEGTVNNDLSNDASDSILHLSTIQQSVGFCRESSTNPSVSTAPRTWY FT NCCYYRRRGARKSLTASSFLFRGHLLGTCSRRRLACRKFYLSHRRSGLQAV FT FKDAERFVGDRISIHNVLAIHPSKNIHLPCSIYPMNPSQQYP" FT CDS 1400..4249 FT /product="Kiri-13_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MPVESVHGNARTNEGFSHIPRAVMHSALSSNNINICH FT VNIQSLCARQMSKFEEFKMCFVNSKIDLMCLTETWLSDEVSSELVAVDGYK FT LYRNDRGYSRGGGIGIYCKNDLRCNILSASELFVGMGDIDRTEYLFAEICY FT NADKFLLGVIYNPPRGDCAEKLEQKLFELSLQFQNIVLIGDFNTDFSKPST FT KRSRMQAVFDNFGFQCVSREPTHFYSGGCSLIDLLLTNNDDFVFNFNQVSV FT SGFSKHDMIFAALNIARRESIVPSFFRDYKRIDNVSLQNALNNINWDTLYS FT IDDPDIALNLLNAHLLHLFDSFVPVRLVKKSNNKKWFNNDIMHAMIERDLA FT YRSWVANKNEQNHCQYKRLRNRVTHLINVAKVNYVSSTLSGNSSSKQLWSK FT LKNLNITKPTESDPQYSNSKEEINNFFASNFSNDSSHVPLLSSNSNGFSFC FT LTNEHEVINAIGSIKSNALGLDGIPIKFVKYILPLIVTQITYIFNLIIKTS FT KYPRTWKIVKIIPISKKAGKTDLNNLRPISILCALSKAFEKILKYQMRSFL FT DNFQLLSPYQSGFRSSHNTTSALLNVHDDIHSHIDKKGVAFLMLLDFSKAF FT DRVSHRKLLLKLSRKFFFSNSALSLIKSYLVDRQQSVCIGGQNSSLINIVS FT GVPQGSVLGPILFTMFINDLPSVLKHCKIHLFADDVQVYFFSSDLSLQEMA FT SLINSDLSNIFGWSERNLLPLNASKSRAMFISRSRHPPDLPTIKLGNEELE FT YVNKFSNLGIILQNDLEWEGHVNSQCSKIYNGLRTLRITANMLPCDVKLKL FT FKSLLLPHFSYGCELLLNASARAIDRLRVALNCCIRWIFNLSRYSRVTQYH FT NQLLGCSFYNFFKLRSCLVLYKIINDEKPPYLFQKLQPFHGTRHRNFRLIH FT YNTSHYGGTFFVRGIVHWNQLPPAIKSLRNLQSFRREVTAWFATRN" XX SQ Sequence 4404 BP; 1324 A; 820 C; 811 G; 1443 T; 6 other; tactcgggat gtactcttat agtgagtgaa atttgtggtc ctagttcagt tgaggttttc 60 gacctgtttt actacttttt tcgtcgctaa tcatcgctta tgaaacagac ccagtactga 120 ggcgagcggt ctatataaag tgcaccttkt tgatcttaaa ttgaccggtg ttttcgagga 180 atttcgttgg tgcagtccaa gtaatcccga gtggcattgg cagccatcat tgagttggaa 240 ttacaacagt ttgttcatca acctgtcact acgatgaatt cacaccactc acggcagatg 300 aaccacaaaa caacaacaac aatcaactgt gccgaacgtg attccwttgg taaaattggg 360 atcagaaatt gtgaacaatt tctactgcaa atgaagctct tgttcgwcga aamcgatcgw 420 aggattgatg ccgctcttgc atcctacatg catacagtgc gtgacaagcc aagcgagcaa 480 tcaagtgatg atgttgcaaa cataatgata gagtctcctc ctgattccgt ctctatgagg 540 attgatgctg aagaaggtac tgtcaacaac gatctctcaa atgatgcctc cgatagtatt 600 ctccacttgt ccacgattca acaaagcgtc ggcttctgca gagaatcttc tacgaatccc 660 agtgtttcta cggctcctag aacttggtac aactgctgtt attaccgacg tcgtggtgca 720 agaaaaagct taactgcgtc atcgttttta ttccgaggac atctgctggg tacctgtagt 780 cgaagaagac ttgcttgtcg aaagttctat ctttctcatc gaagatccgg gttgcaagct 840 gtctttaagg atgctgagag atttgtagga gaccgtattt caattcataa tgtgctcgca 900 atacacccat ccaaaaacat tcatcttcct tgctcgattt atcctatgaa tccatcccag 960 caatatccat gaatattccc ctcctgaaag ttataccttt ccaacatagt tattgattaa 1020 tcctactctc gaaattccgt gcttccaatc ctaatgctat ccatgttacc ttccatccta 1080 aaagttatgg gggggtaagg gaatagtatg acaactcaat tgctgttggc gctgctggtg 1140 ttgttgctaa tgatgctgtt atgttgcttg ttcctgctgc tgttgctgat ctgctcttcc 1200 tgctgttgtt gctgatatgc tgttgcgatg atcttattca tgattactac ctaattgaac 1260 tctaaatcaa ctgaactacg aattwttgtt catgaatttt ctagaagcat taaggaaatg 1320 tattgtaaaa ctttagtctg tagttctaat tccttgttca ttgtgcctgt cgtgatttct 1380 atgcggttgt tttccactaa tgccggttga atcggtacat ggcaatgctc gtacaaatga 1440 gggtttctcg catattcctc gtgcggtgat gcactctgct ctctcttcca ataatatcaa 1500 tatttgtcac gttaacattc aaagcttatg cgctcggcaa atgagcaaat ttgaagaatt 1560 caaaatgtgt ttcgttaata gtaagattga tttgatgtgc ttgacagaaa cgtggttatc 1620 tgatgaagtt tcaagtgagt tggttgctgt tgatggctat aagctgtaca gaaatgatcg 1680 tgggtatagc agaggtggcg gaatcggtat ttattgcaaa aatgatttga ggtgcaatat 1740 tttatctgct tctgagttgt ttgttggtat gggcgatatt gatagaacag agtatttgtt 1800 tgcagagata tgctacaatg ctgacaagtt tcttttagga gtaatctaca accctcccag 1860 aggggattgt gcagaaaaac tggaacaaaa attattcgag ctatctctac aattccaaaa 1920 catagtgtta attggagatt ttaatactga tttcagcaaa ccaagtacaa aacgttctcg 1980 aatgcaagcc gtgttcgata attttggatt ccagtgtgtt agtagagaac caactcactt 2040 ctactcaggc ggttgctccc tgatagattt gttattaacc aataatgatg attttgtatt 2100 caattttaac caagtgagtg tttcaggttt ctcaaagcac gatatgattt ttgccgcctt 2160 gaatattgct cgccgtgaaa gcatcgttcc ttcgtttttt agagattata aacgcataga 2220 caatgtttcg ttgcagaatg ctctgaacaa tataaactgg gatacacttt actcaattga 2280 tgaccctgat attgcactaa atcttctaaa tgctcactta cttcacttgt ttgattcatt 2340 tgttccggtt cgtttagtga aaaaatcaaa caacaaaaaa tggtttaata atgacattat 2400 gcatgccatg attgaaagag atttggctta cagatcgtgg gtagctaata aaaatgaaca 2460 aaatcattgc caatataagc ggctacggaa ccgtgtaaca catctcataa atgttgctaa 2520 ggtaaattat gtctcgagta ctttaagtgg gaatagttcc agtaaacagt tatggtccaa 2580 attaaaaaat ttaaacatta ccaaaccaac tgagtctgat ccgcagtatt caaattctaa 2640 ggaagagata aataactttt ttgctagcaa tttttcgaat gattcatctc atgtcccatt 2700 gctttcttcg aactcaaatg gttttagttt ctgcttaaca aacgaacatg aggtaataaa 2760 tgcgatagga tcgattaaat ccaatgcatt ggggttggat ggaatcccaa tcaaatttgt 2820 caaatatata ctacctttaa tagttaccca gattacgtat atctttaatc ttataattaa 2880 aacctctaag tatccacgta cctggaaaat tgttaaaatt attccaatct caaaaaaagc 2940 aggaaaaaca gatttaaata atttgcggcc cattagcatt ttatgtgcat tgtctaaagc 3000 attcgagaaa attctaaagt accaaatgag atcatttttg gacaactttc aactattaag 3060 tccatatcaa tcaggttttc gttcaagtca taatacaact agcgctctat tgaatgtcca 3120 tgatgatata cactctcata tagacaaaaa gggtgttgca tttctgatgt tgcttgattt 3180 ttccaaggct tttgatcgag tgtctcatag aaaactattg ttgaaattgt ctagaaagtt 3240 cttcttttca aatagcgctt taagtttgat taagtcctat ttagttgatc gtcagcaaag 3300 tgtgtgtatc ggaggacaaa actcaagttt gatcaacatt gtctctggag ttccgcaagg 3360 ttcggttttg gggccaattc tgtttacgat gtttattaat gatcttccat cagttttgaa 3420 gcattgcaaa atacatttat ttgcagatga tgttcaggtt tattttttct catccgactt 3480 atctttgcaa gaaatggcaa gtctaatcaa ttcggattta tctaatattt ttggctggtc 3540 agaacgcaat ctattgcctc taaatgcctc aaaatcgcgt gctatgttca tttcccgatc 3600 acgccatcct cctgatttac ctacgattaa acttgggaat gaagaactag aatacgtaaa 3660 taagttttca aatttaggta taatcctaca gaatgattta gaatgggaag gtcatgtcaa 3720 ttcgcaatgc agtaagattt ataacggact tcgcactctt agaatcactg ctaatatgtt 3780 accctgtgat gtcaaactta agctattcaa atctctacta ctgccacatt tttcttatgg 3840 atgtgaactt cttttaaatg cttcagctag ggcaattgac agattgcgag tagctttaaa 3900 ttgttgcata cgatggattt ttaacctttc aagatactcc agggtaacac agtaccacaa 3960 ccaacttctt ggatgctcat tttataactt ttttaaatta cgctcctgtc ttgtgctgta 4020 caaaataata aatgatgaaa aaccacccta tttgtttcaa aaactacaac cttttcatgg 4080 tacgcgtcat agaaatttca gattaattca ctacaacact tcgcactatg gtggaacttt 4140 ctttgtacgt ggcattgtcc attggaacca acttccaccc gcaattaaat ctctaagaaa 4200 tttacaatct ttccgtagag aggttacggc atggtttgca acaaggaatt agaaattaaa 4260 cgagtaaaga ttagttaagt aagaattacc aaaatatcat gaacttgatt atcaatctaa 4320 ttgtagaatt aaaaaaggag aattccttac tctacatgta taacagcata aataaataaa 4380 taaataaata aataaataaa taaa 4404 // ID Copia-1_DWil-LTR repbase; DNA; INV; 203 BP. XX AC scaffold_180634; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-1_DWil_; KW Copia-1_DWil-I; Copia-1_DWil-LTR. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-203 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (04-MAR-2011). XX DR Genome; scaffold_180634; Positions 6921 6719. XX SQ Sequence 203 BP; 54 A; 41 C; 31 G; 77 T; 0 other; tgttgaaaat accatatgtt tctgtttgca gccctgtatt tatccctggt tatatcgatt 60 gtactaagct tgtaggttat cgatataaga tgctttgtaa ctctgtatat tttggcttct 120 tcttcctttt ccgcccttga aagtgtacgg acacttaata aagataaact tataatttaa 180 cgcaatccac tcgatcctca aca 203 // ID TRAS1 repbase; DNA; INV; 7850 BP. XX AC D38414; XX DT 04-JUN-2009 (Rel. 14.06, Created) DT 01-JUL-2010 (Rel. 15.08, Last updated, Version 3) XX DE Complete sequence of retrotransposon TRAS1. XX KW R1; Non-LTR Retrotransposon; Transposable Element; TRAS1. XX NM TRAS1. XX OS Bombyx mori OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Bombycidae; Bombycinae; Bombyx. XX RN [1] RP 1-7850 RA Okazaki S., Ishikawa H. and Fujiwara H.; RT "Structural analysis of TRAS1, a novel family of telomeric RT repeat-associated retrotransposons in the silkworm, Bombyx RT mori."; RL Mol. Cell. Biol 15(8), 4545-4552 (1995). XX DR EMBL/GenBank/DDBJ; D38414; Positions 1 7850. XX FH Key Location/Qualifiers FT CDS 2434..3813 FT /product="TRAS1_1p" FT /translation="MYANIRTPVTKNLNSPPKNADHAGSSNTDDPNITRSV FT RRSVGEWESAKADQAIKLKTPTSPTKKPSEAPQKQKIQGFPSKTAEARACL FT SKAKLNLENSRNLKTDIKMAVLQAVERLYELVKEAEGKVTRGQKIEEKGTH FT KEKEAETKEKDKGQDETLNALKKKLEENCSLLRENSEKMEELKQMFRSQKA FT TYADVVVSQPGRQPPKCTTLHSIMVSSKDENETGDGILTELRKTASEDEGW FT VRVERVRKIKDRKIIMSYRTEEERTKATQRLKKSEGELVVEEIKNKDPLLI FT LYNVLKMHSDEDLQKALRSKNKDLFRNLNKEDDRIEVKYKKSARNPHTHHV FT VLKVSPTIWNRALSMGSLHIDIQPVRVADQTPLVQCTLCLGFGHSRKFCKE FT ALPSCSHCGGPHMRADCPDRLTGIEPTCCNCRKANMTTTAHNAFSRECPVM FT AKWDNIARRAVEYHC" FT CDS 3788..7462 FT /product="TRAS1_2p" FT /note="endonuclease, reverse transcriptase, and FT ribonuclease H." FT /translation="HGEQWNTTAKVRPKNGPPSPPYRVLQANLQRKKLATA FT ELAIEAATRKAAIALIQEPYVGGAKSMKGFRGVRVFQSTAQGDGTVKAAIA FT VFDHDLDVIQYPQLTTNNIVVVGIRTRAWEITLVSYYFEPDKPIESYLEQI FT KRVERKMGPKRLIFGGDANAKSTWWGCKEDDARGDQLMGTLGELGLHILNE FT GDVPTFDTIRGGKRYQSRVDVTFCTEDMLDLIDGWRVDEDLVSSDHNGMVF FT NIRLQKSKSIKIERTTRIFNTKKANWSLFHEKMAQLLLDNNMTTLIDTIDN FT KTKVESAINTYTNIITKTCEQSIPKKTSREILTIPWWSEKLAEMRKETNTM FT RRRIRNAAQDRRQHVVDEYLKQKEKYESEVREAQAGSWKEFCGKQDREGVW FT EGIYRVMSRARTREEDCPLTDVNGAPLDPEKSAKLLAETFYPEDREEEDND FT EHAEIRRRAKISEGVHDEYVPSFTVNELKHALKSFNPKKAPGADGLTSDIC FT THAVDLDQKLFLGLINKCLEHRYFPKIWKAATVVILRKSGKDSYTEPKSHR FT PIGLLPVLGKLYEKMLVARLKYHLLPRMSTRQFGFMPQRSTEDSLYTLMQH FT VKEKLKEKRIITIVSLDIEGAFDSAWWPALEVRLAEEKCPEYLRRVISSYL FT SDRRVSVRYAGAEYERATSKGCVQGSIGGPILWNLLLDPLIHQLQARGEYI FT QAFADDVVLVFDGDSALQIERQANTSLEHVQAWGVRNKLKFAPHKTCAMTI FT TRRLKYDTPRLNMGGTEIATYKELRILGLTIDDKLTFNTHVRNVCKKAIGM FT YKILARTARVGWGLSPEVIRVIYVAVVEPTVLYAAAVWHESVYKLGVQKQL FT NVIQRGFAQKLCRAYRTVSLNSALLMAGILPLDLRVREAASLFEAKKGVCQ FT SWLGDREIERMSSAMDAPHPAEQQSLEFGNLVDEEQYNNLNHLDVRIFTDG FT SKIEGRVGAALSIWDGEVEIRSLKLALAPYCTVYQAELLALSYAVKEAQLR FT NGSTFGVFSDSKAALLTVINHGSLHPLAVDIRKMLKQCALQNKTVALYWIK FT AHAGLEGNERADQLAKEAALLSKKSPNYDLCPVSYVKRIIRSGSLDEWNRR FT YRDSDRASVTKMFFPDAVAAYSTVRKMRITGHITQFTTGHGGFSEYLARFK FT CKGDPSCACEPGMPETVEHLLTSCPIFGKQRFELENKINKIVNKENLCKLI FT VEKYTKELFIVMLYK" XX SQ Sequence 7850 BP; 2622 A; 1686 C; 1836 G; 1704 T; 2 other; cgagttcccc ctcagctctc gtggcggtcg gatcgttttg cgtgcgctcc gcgtttaaaa 60 aaatcgaacg aacgggttat tttacgcaaa atataaagtg ttatatatca aaaggtgcgc 120 gttcgcgaat actatccgtg tgtgaaaaaa gtgcgaaaat cgaactactc tgtgagacac 180 caggtccaaa aaacggattc taacctgcaa attagatatt ccagacgcaa cgctgcctga 240 ttttcagcga caacttacgt ttcttgaact gcagttcgaa gcgcaccagg ggcagctaca 300 atcaggtgaa gtctcggatt tttccgggta aaactcaccg agccacaaag gaaaaaccga 360 aaaccacgtg gccacgtgct cctagttccc gacgtcttgg agaaatttca ccaactcgaa 420 atcgtgaact atgtcgagac acgtcatttg acacttcgaa cttcgaaatc ggagcatccc 480 cctggaagtt ggaagacttt gaaggcacta cgaacttaga tctcatccag agctgaaaca 540 agatctatga cgtcaccttt tctggacccc tagatattgc ccaccacttg gacaataatt 600 gtacaaaatt tcaggaattt ttgacaggta gttttcgagt tacagaggac ataaaaaacc 660 tgtgacgtca cgcagagccg aaagtaccta tgacatcctg ctcccggcag caccctaaag 720 aaaacctcct acaaggaagg aaactcctcg ggaccaaccc cctgagacgc gaagtttctg 780 cactagaagc tatttgcaga aaattgtcag ccacctccag cttcggcagc ttcctgaggg 840 tggtgtggac tacggaggac ctccggacag gcgggacatc agggaccggt tcacctcctg 900 tggaacaaaa tttttagatt gtaagtagga ctaaggatag attttatata gaatttaagg 960 tggaacaagg gtaccggccg gcattaccgc ttaagtttgt acaccccctg gagcctcagg 1020 ggacgagata tcgcaaaaat aacaaaaaat cggttgtctt ggggacaacg atcatagggt 1080 cgactaacat attggcaccc caaaaaacca cccccaacga gattttttgg aaaaacggga 1140 aaaatttcgg ttttcccgat ttccaaagca gaaaacaagg gacttgacgg taaatttggg 1200 ccgcgaccaa taccccagta tttttccccg tctcgatatc ccgttcctaa ggccaaaacc 1260 aggcaaacga agctggcaaa aaactacccc taagtttgag ggcgaataaa aaatcgaccg 1320 agtgacctat ccctttgtga tctcgggacc ggacggccca aattaaaata ctgcaccccg 1380 gtacttttcc cggcgaaatc gaaaaattgg aactaaatta acaggctgtt taaaatactt 1440 attcaagcag ttcgacggca gagggataaa gttgtttagg tatttaaaaa ttccaaaaat 1500 acctcatatc tcgtaaaccg tgcgtcctag cccaaaaata acgaaaaatt tctanncccc 1560 tccctaaccc accccccctg ctccgcaaac cctaaccctt aaacctcccc tccccccgca 1620 cccaacccct agggtcccaa ccccctaccc ctaaaattaa cttatgacca ttattacttg 1680 gactctgaaa aatttttagt agcaaatttt ataattataa aatattaaaa aattattgac 1740 tattattttg gcatactcat aaataggtag tataaggaca ctttaggggt agcaaccccc 1800 aaattaaata aactgcctta tctcgcgaac gggaagagct acagtaacgg ggttttttgc 1860 atgcaatagg gcttggtctc gagcagtttt ttttggacag aggtggagac tctacgtaga 1920 gtcgaaactt aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaatttt acattatacg 1980 atcgaaatcg aattgattga ccgtgtatct tttaagttgt aaaaatacct acctagttac 2040 attctcatac tgactctagg tagagaaaaa ttttgtgcga tccaataaca aaattttttg 2100 catacattcc tacacacaca cacacacacg cactcacaca catatacaca tacacacaca 2160 tacacacata cacacacaca cacacacaca cacaagtata taaataaaaa aaccttctta 2220 ccgacactcg acgagtctag aactcgtata gcctgacgga atctgtaaca aaaaacaaaa 2280 aacagagata taatttttta tttattttat tttgcttatt tgtaccgtcc cttatataac 2340 ttatacaaac cttttgtgac gtcctgttta tataattata ttacgtgaca ttgacgcgtc 2400 ctcactgcaa ggaaaataaa aattaaatta acgatgtacg caaatattag aaccccagtg 2460 accaaaaacc taaactctcc tccgaaaaac gcagatcatg ctggatcatc caatacagat 2520 gacccaaata ttacaagaag cgtaagacga agtgtgggag aatgggaatc agccaaggcc 2580 gaccaagcca taaaattgaa aacaccaaca tctcccacaa aaaagccatc tgaagccccc 2640 caaaaacaaa aaatacaagg ctttccaagc aagactgcgg aagctagagc ctgtcttagt 2700 aaggccaaat taaatcttga aaattcgaga aatttgaaga ctgatataaa aatggcagtc 2760 ttacaagcag tagaaaggct ttatgagctt gtaaaagagg cggaaggaaa agttacaagg 2820 gggcagaaaa tagaggaaaa gggaacgcat aaagagaaag aagcagagac caaggaaaag 2880 gataaaggtc aggatgagac cttaaatgca ttaaagaaaa aactggaaga aaattgcagt 2940 ctcctaagag aaaactccga gaagatggaa gagctaaaac aaatgttcag gagtcaaaaa 3000 gcgacttatg cggatgtggt ggtatcacag ccggggaggc aacccccaaa atgcaccaca 3060 ttacattcaa taatggtctc atctaaagac gaaaacgaaa caggcgatgg tattctcacg 3120 gaactccgga aaacagccag tgaagatgaa gggtgggtga gagtggaaag agtgagaaag 3180 attaaagaca ggaaaattat aatgagttat agaacagaag aagagagaac aaaggcaaca 3240 cagagattaa aaaagtcaga gggggaactc gtcgttgaag agataaaaaa taaggatccc 3300 ttattgatac tctataatgt tttaaagatg cattccgacg aggacctcca aaaagcctta 3360 agaagcaaaa ataaagacct cttccgtaac ctcaacaaag aggatgacag gatagaggta 3420 aaatataaaa agagtgcccg taaccctcac acccaccacg tagtgctaaa agtgtcccca 3480 accatctgga accgagccct gagcatgggg tcgctgcaca ttgatataca accagttaga 3540 gtggcggacc aaacgccact ggtgcagtgt accctttgtc tggggttcgg acatagcagg 3600 aaattctgta aagaagcttt accgagctgc agccattgcg gcggaccaca tatgagggcc 3660 gactgtcccg acagactgac aggcatagag cctacctgct gtaactgccg gaaggcaaat 3720 atgacaacca cagcacacaa cgcttttagt agggagtgcc ccgtaatggc aaaatgggac 3780 aatatagcac ggcgagcagt ggaataccac tgctaaagtt agaccaaaga atggaccccc 3840 atcacccccc tacagagttt tgcaagcaaa cctccaaagg aaaaaattag caaccgctga 3900 gttggccatt gaagccgcta ctcggaaagc tgcaatagcc ttaattcaag agccatacgt 3960 gggcggggca aagagtatga aaggattccg gggcgtaagg gtcttccaaa gcactgcaca 4020 aggagatggg actgtcaaag ctgcgatagc tgtctttgat cacgacttgg acgtgataca 4080 gtacccgcaa ctcaccacca ataacatcgt ggtggtgggg atccggacca gggcctggga 4140 gatcacgctg gtgtcctatt acttcgagcc agacaagccc atagagtctt atcttgaaca 4200 gatcaaaagg gtagagagaa aaatgggacc caaaaggcta atctttggag gtgacgcgaa 4260 tgccaagagt acctggtggg ggtgcaagga agatgatgca cgaggagatc aattgatggg 4320 gactctcgga gagttgggcc tacatattct aaacgaggga gatgtcccga catttgatac 4380 gatcagagga ggtaagaggt accaaagccg cgtggatgtg acgttctgta ccgaagacat 4440 gctggacctg atagatggat ggcgagtcga tgaagaccta gtgagctcgg atcacaacgg 4500 tatggtattc aatattagac tccaaaaatc aaaaagcatc aaaattgaaa gaactactag 4560 gatatttaac actaaaaaag ccaattggag cctatttcat gagaagatgg cccaattact 4620 gctagataac aatatgacaa ctttgataga tacaatagac aataaaacaa aagtagaatc 4680 tgcaatcaat acatatacaa acataataac aaaaacttgt gaacaatcaa ttccaaaaaa 4740 gacaagcaga gaaatactta ccataccgtg gtggtcggag aagctcgcag agatgaggaa 4800 ggagacgaac acgatgagac gccgcatcag gaacgcagcg caggatcgaa gacaacacgt 4860 ggttgatgag tacctgaaac aaaaagaaaa atatgagtcg gaagttagag aggcccaagc 4920 cgggagctgg aaggagttct gtgggaagca ggatagggaa ggggtctggg agggaatata 4980 tagggtgatg agtagagctc ggacaaggga ggaagattgc cctttaacag atgtgaatgg 5040 agccccgctg gaccctgaaa agtcagcaaa gctcctggca gagaccttct acccggagga 5100 ccgggaagag gaggataatg atgagcatgc cgaaatcaga aggagggcta aaataagcga 5160 aggtgtgcat gatgagtatg taccctcgtt tactgttaat gaactcaagc atgctctaaa 5220 aagttttaac ccaaagaagg caccaggagc agacggactc acgtccgaca tatgcactca 5280 tgcagtggac ctggaccaaa aactctttct cggcctaatc aataagtgct tagagcatag 5340 atactttccg aaaatctgga aagcggcaac agtggtgata ctgaggaagt cgggaaagga 5400 ctcgtacact gaaccgaagt cgcacaggcc gattggtctt ttgcctgtac tgggaaagct 5460 gtacgagaag atgctggtgg cccgactcaa gtaccatcta ttaccgagga tgagtactcg 5520 ccaattcggc ttcatgcccc aaaggagcac cgaggactcc ctttatacac taatgcagca 5580 tgtaaaagag aaactaaagg aaaaacgcat tattaccata gtgtcattgg atatagaggg 5640 agccttcgat agcgcgtggt ggccggcttt ggaggtccga ttggctgagg aaaagtgtcc 5700 ggagtacttg agacgggtca tcagcagcta tctcagcgat agaagagtct cagtgcgcta 5760 cgcaggcgct gaatatgaaa gggctaccag caagggttgc gtgcagggat ccatcggagg 5820 cccaattttg tggaacttgc tcctcgaccc attaattcac caacttcaag ccagaggtga 5880 atacatacaa gcatttgctg atgatgtggt cctggtcttc gatggagact ctgcactgca 5940 aatagaaaga caggcaaaca cctccctcga acatgttcaa gcgtggggcg tccgaaataa 6000 gctcaaattc gcgccgcaca aaacttgtgc gatgacaata acgcggaggc taaagtacga 6060 cacaccacgc ttgaacatgg gcggaacgga gatcgccacc tataaagagc tccggatctt 6120 ggggctcact atcgacgata agctcacatt caacacacac gttaggaatg tatgtaaaaa 6180 agctatcgga atgtataaaa tcctcgcgcg tacagccaga gttggttggg ggttgagtcc 6240 tgaggttatc agggtgatat atgtggcggt agtagagcct acagtgctgt acgcggctgc 6300 agtgtggcac gaatcggtat ataagctggg agtccaaaaa cagctaaacg ttattcaaag 6360 gggcttcgct caaaaacttt gcagggcata tcgcacggtc tccctgaact cggcactgtt 6420 aatggctggg atactccctt tggatcttcg agttcgagag gcagcctctc tatttgaggc 6480 caagaagggg gtatgccagt catggcttgg ggacagagag atcgagcgaa tgtcatcagc 6540 gatggatgcc cctcatcccg cagagcaaca gagtctggag tttggcaatc tggtagacga 6600 agaacaatac aacaatctca accaccttga cgtgagaatt ttcacggacg ggagcaagat 6660 tgagggccga gtcggggccg ctttatccat ttgggatggt gaggtcgaga taaggtccct 6720 caagcttgct ctcgcaccct actgtaccgt ctatcaggcc gaactcctgg ctctgagcta 6780 tgcggttaag gaagcccagc tccgaaacgg gtctactttt ggggtcttta gcgacagcaa 6840 agcggcactc ctcacagtca taaatcacgg ttccctccat ccactagccg tagacataag 6900 aaaaatgtta aaacaatgcg cactgcaaaa taagaccgtt gccttatact ggattaaggc 6960 acacgcggga ttggagggaa acgagcgagc agatcaactc gctaaggaag ccgctctgct 7020 gtcgaaaaag agcccaaatt acgacctgtg ccccgtttca tacgtcaagc gaatcatccg 7080 gagtggttcg cttgacgaat ggaacaggcg atatcgggat agcgacagag catcagtcac 7140 taaaatgttt tttccggatg cagtagcggc gtacagcact gtaagaaaga tgaggataac 7200 tgggcacatt acccagttca ctacggggca cggagggttc tccgaatact tggctaggtt 7260 caagtgtaag ggggacccct cgtgtgcctg tgaacctggc atgccggaaa cggttgaaca 7320 cctgctgacc agttgtccca tttttgggaa gcagaggttt gagcttgaaa ataaaataaa 7380 caaaatcgta aataaggaaa atttgtgtaa attgattgtt gaaaagtata ctaaggaatt 7440 gtttatagtt atgctgtaca aatagtaaaa gtagtgaata agaaaaacaa agaatgaaat 7500 tatgttatca caatatcgtg agtatattgt agagtagcat taatggaaga agttaaaata 7560 aatagtagtt aagtatagcg taagatatag tcagtaagaa aaattgtaag taaattatta 7620 tgtagattaa gaaattgtaa taaaagtaaa ctgcccataa catcctagac cccaagttaa 7680 gattagaatt aaagtatgaa tgagaagaac gaagtacgag agggataaat gatagaactg 7740 gaataataga ggacggaatg cctgagaaga ccctagacta agcaacaaaa atgaaagacc 7800 ttatgtccag ccttcaaagc tttgcaccct tgcagagaaa agagtgactt 7850 // ID hAT-9_SM repbase; DNA; INV; 3270 BP. XX AC . XX DT 21-OCT-2007 (Rel. 12.1, Created) DT 21-OCT-2007 (Rel. 12.1, Last updated, Version 1) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-9_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-3270 RA Jurka J.; RT "haT-type repetitive family from freshwater planarian Schmidtea RT mediterranea."; RL Repbase Reports 7(10), 1038-1038 (2007). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 218..2770 FT /product="hAT-9_SM_1p" FT /translation="MAEQMSYELTEPCTTSSEGKSIADHSRSSELPTTHPL FT SVSVPSDCKNKPPTRRATEVYGIGPAICSDDATITGIRLPTCMQVLRCMMY FT HCNVASHSQRPGSTGAQSRFTTAKTVLKQVTKFYEKANIPMVSERRACEKI FT VKLLDDNNKLRSIDKTRRDTPATQCKLEAMQTLLASTFQLWPPNVASLVRN FT AEDLAFLESMKGDRTASFGAFDKSLALKISRRHLRDAATSERLKRSRKDIE FT ASTSTVSSQALNDDSSGSGGSTEEEDVTEQALTSEDDLEAKAASQERTQSQ FT RRKPTGTMAFIPPDVLSRPNLVSLATRLKMTPTQQAAFTHGLITESGGDVT FT RVATSYATADRSRRRVVGEIATDIHTNWEPPKLCTLHWDGKQTPTLQNQHV FT MEDRMTVVVGDASQLKLLGVPSYYKASDKSCGEIIAELTMKLMTEWHCADR FT IVNMTFDTTSSNTGHLTAACIAIQDKLQRAVLWSGCRHHIGEVVLSHVFED FT LKIEASRSPDVTLFTRLRTNWDLVPHDSSQIRPFCPADHNTEAQQLLAEMK FT DEVVAVATGKIEFLRDDYHEFTELALLYLSAKAGVVTLQRPGALHKARWMA FT KLIYSIKIALCENQIGELPPGTITTRHQVSKIRVFVTFVTHVYIVWWLTCK FT KTADAPWNDLQLYKHLLEYEAIDKLISQSAVRAFQRHLWYLTAEMVPLALF FT SDCTPRSEREALAGALLKIKPSIELQAPLNRFGNGWGKPKFPTSINASTRL FT CDLVGEDSWFTIYRLQLDSSFLELPVSEWNNSEAYMNSAKNVAAVNVVNDC FT AERGVKLSSDFVDTARSDEHFQNVLQVVEKDRQEHPDLRRQIKFKNEKL" XX SQ Sequence 3270 BP; 996 A; 714 C; 714 G; 846 T; 0 other; gggtgcatca ccaaaaaaaa atttcgttta ttgagctgtc tgctatttag ctttgttcca 60 ctgaccaaaa taagcaatag tatgcaaaag aaacaatgtt gcataatatt tagaggtccc 120 cctggcccat gagattttct gttattgtat ttaccttata ttattattaa ccttaacaat 180 ttatataatc agttttgcat attagctgca gaagaacatg gcggaacaaa tgtcatatga 240 gctaacagaa ccatgtacaa catcatcaga aggcaagtct attgctgatc acagcagatc 300 atcagaactg ccgacgacgc atcctctctc tgtgtcggtg ccttcagact gcaagaataa 360 accgccgact cgtcgggcta ccgaagtata tggtataggt cctgcaatat gtagtgatga 420 tgcaacaatt actggtattc gcctaccaac atgcatgcaa gttctacgtt gtatgatgta 480 ccactgcaat gtagcatcac acagtcaacg cccaggatct acaggtgctc aatctcgatt 540 tacaacggca aagacggtgt tgaaacaagt taccaagttc tatgagaaag ccaatatccc 600 aatggtgtca gaacgcagag cctgtgagaa aatagttaaa ctccttgatg ataacaacaa 660 acttcgttca atcgacaaaa ctcgccgtga cactccggca acacagtgca aactcgaagc 720 aatgcagacc ttgttagcct caacattcca gctgtggcca ccaaacgtag cgagtttggt 780 gcgaaatgcc gaggatctgg cattcctcga gtcaatgaaa ggtgacagaa ctgccagctt 840 tggtgccttt gacaaatcgc ttgccctgaa aatatccaga cgccatttac gagatgccgc 900 aacatcagag cgactaaagc gttcgcgcaa agatatcgaa gcatcaacat caactgtgtc 960 ttctcaagcc ttgaatgatg acagcagtgg cagtggtggc agtactgaag aagaggatgt 1020 gacagagcag gccctgactt ctgaagatga tttagaagct aaagctgcct cacaagagag 1080 aacacagtca cagcgaagaa agccgactgg tacaatggcc ttcataccac ctgatgtgct 1140 aagccgacca aatctggtat cactggcaac aaggcttaaa atgacaccaa cacagcaagc 1200 tgctttcaca catggactga ttacagagtc cggtggtgat gtaactagag ttgcaacctc 1260 atatgcaacg gctgatcgct ctcgtcgtag ggttgtaggt gaaatagcca cggacattca 1320 caccaactgg gaaccgccca aactctgcac gctgcattgg gatggaaagc agacgccgac 1380 tctacaaaat cagcatgtga tggaagaccg catgactgta gttgttggcg atgcttcaca 1440 gctgaagctg ttgggagtgc ccagttatta caaagcctca gataagtcat gcggagaaat 1500 cattgctgag ttaacaatga agctgatgac cgagtggcat tgtgcggacc gaatcgtcaa 1560 tatgacattt gacacaacca gctcgaatac gggccatttg actgctgctt gcattgccat 1620 acaggacaaa ttgcaacgtg cagtcctgtg gtcaggctgt cgccatcaca ttggtgaagt 1680 ggtcctctct catgtttttg aagatttgaa gattgaagca tcaaggtcac cagatgtgac 1740 attgttcacc aggttgcgaa ctaactggga cttggtgcca catgattcca gtcaaatccg 1800 accattttgc ccagctgatc ataacactga agcacaacaa ctgttagcag aaatgaagga 1860 tgaggtggtt gctgttgcaa ctggaaagat cgaatttttg cgagatgatt atcacgagtt 1920 cactgagctt gcacttcttt atctgtctgc caaggctggt gtggtgacac tccaaagacc 1980 aggagcactg cacaaggcac gctggatggc caagttgatc tatagtatca agatagcact 2040 ttgtgagaat cagattggcg aactgccacc aggcacaatt acgacacgtc accaagtgag 2100 caagatacgt gtatttgtca catttgtcac tcacgtgtat attgtatggt ggttgacctg 2160 taagaaaact gcagatgctc catggaatga cctccagctg tacaaacacc ttctggaata 2220 tgaagccata gacaagttaa tctcacaatc agccgttcgg gcattccagc gacatctgtg 2280 gtatttaacg gctgagatgg tcccactagc gctgttcagc gattgcacgc cacgatctga 2340 acgagaagcg ctggctggtg ccttgctaaa gataaaacca tcaattgaac ttcaagcgcc 2400 attgaatcgg tttggtaacg gttgggggaa accaaaattc cccacatcca tcaatgcgtc 2460 tacgcgcctg tgtgatttgg tcggagaaga ttcatggttt acaatatacc gcttgcagtt 2520 ggattcaagc tttctggaac ttcctgtcag tgaatggaat aattcggaag catacatgaa 2580 cagtgctaaa aatgtggcag cagttaatgt cgtcaacgat tgtgcagaaa gaggcgtcaa 2640 gctatcatca gattttgttg acactgctcg ttcggatgaa catttccaga atgtacttca 2700 ggtcgttgag aaagaccgtc aagagcatcc agatcttcgt cgacagataa aattcaaaaa 2760 tgaaaaactg taattatatt gacaatcgta tttgtgactg ttcaggacta cacagtacac 2820 atgtaattgc attgatctaa taatgatgat tgaatactgt aaaacaaacg atttactcca 2880 taaacatgta catcagttca gttcagctgt ttacatatta tgatttgcgt ccttttggtg 2940 actttttcgt atgtataatt gtgttatgcg agataaaaaa ataagttttt tgtcctaact 3000 aattgttttg accagaaaag aacttgccct tcacacatca acacgtaaac acttaatttt 3060 gaataagtct ccaacattaa gccttttaag gctgcatccc gattagtaac gggtactgac 3120 atcatatttt gtctacgggg gacctctaaa tcaaatcgga tgttaaaaat atttcaccaa 3180 agttatctat aatcatactg gaacacatca gaaacacaga cagatcaaaa gtagacaaaa 3240 aaatttttga catgcagttg tgatgcaccc 3270 // ID BEL-118_AA-LTR repbase; DNA; INV; 580 BP. XX AC supercont1.120; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-118_AA_; KW BEL-118_AA-I; BEL-118_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-580 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.120; Positions 1281045 1281624. XX SQ Sequence 580 BP; 168 A; 149 C; 137 G; 126 T; 0 other; tgttccgtac ggaacgtgcg gaaaaagcgc aaacgacaat tcgtttacac acccaatccg 60 aatgtagcga aacactcacc gaatagagcc cccagcgtga cccagcctag ctacgtcact 120 gagatcgcga gttacttcga ttgatgtaag ttccacccgt gagtgcccgt gagtgattcc 180 cgccatcatt gtcatcgtca tcgaggtact cactggcact caccacaaca gtaacgaaac 240 gatgagtgca gacgcgtcgc gaatgttcca cgcgcgtctg gaacacgtgt ccggctagag 300 cgcgaacttt ctagaacgtg tacgcgctag tataaatacg tgaccatctc acccgtacga 360 tcagtattgt tttaaccctc gcgcgcgtat aatcgagtaa aagtaaaagt gaattaacag 420 tgcaagtaaa gtgtaaatag ttaagtgcag agtgaataaa gtgaagtagt atagtgtagt 480 gattgaaaag tgtggtgtat cctttcatcc gaaatcccga cccacgatat acagtccacc 540 ccgacccgac cggagttggc cacggtgaac agcgccaaca 580 // ID BEL-80_CQ-LTR repbase; DNA; INV; 287 BP. XX AC AAWU01021955; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-80_CQ_; KW BEL-80_CQ-I; BEL-80_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-287 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 302-302 (2011). XX DR GenBank; AAWU01021955; Positions 39457 39171. XX SQ Sequence 287 BP; 80 A; 64 C; 73 G; 70 T; 0 other; tgttctggtt gcactgccat atttccccaa ccagcgtgat gcacaaccct gcgcatctcg 60 ttcagctgtg gtttggcgcg ccgttaacgg cggatggaag gttcgagaaa gcaagcaaca 120 aaaggaaggt tcaagaagaa gaaaaatcgg cgttcatcag catcagtctt gctttaacac 180 gcgaagcaaa aggaagaata aaagcggaag ttagttagtt attccggcgt gttttatttc 240 gtgtccgttc cgggttcagc gaccaagtcc aagttctacg ttaaaca 287 // ID Gypsy-230_AA-I repbase; DNA; INV; 2691 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-230_AA_; KW Gypsy-230_AA-LTR; Gypsy-230_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-2691 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1063-1063 (2011). XX DR [2] (Consensus) XX CC 'TGTATG' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 530..2689 FT /product="Gypsy-230_AA-I_1p" FT /translation="MQNENISPLRASSGGEPLAFKPQLQCSSSTSGIQLNM FT IENTFARLIDQLNFSKTSVTNLTQELSAMRSEINSMRADASRLNLVVKELR FT TSSLVPQQISNPNGGGYFGCDESKHAKNQEVNRGAIIKKVAISSRLTAKEY FT VLSDDDGRSHGSVFRKSIDRDVVNQSNNNVLGNGPNYYSTDRPNVQNLIDG FT RDRACEELNVKIGEGRHATDRVSGSCFGSEAYGGLCGDYHNLAHAHENGIG FT QEPVLSVSEAETAFSKFSGSDCYPVRKWINDFEELSDCIGLTELRKFVIAK FT RQLTGLAKMSLNTTSNVSNWRTLKDFLITEFDFYENCAVVHERLRNRKRNL FT GENVLEYFLQMREIGAKANVDISSIITHTINGINDNGPDKTILYGSKTLDE FT FREKLRVYQGIKDNKYGRDREYHSPTSKRFFSKGRPNGQNNFEDQFNSPLK FT FVKCYNCRKDGHISSECTREIQNNYVYLCYSNPKSSKTSKLSLLINKIPFQ FT ACFDSGSNVSLLREAWATKLNLRIDRNDRKRLMALKGAIWTLGSVSLTIEI FT EKTPLRITFDILRNKDMPHNIVLGENLLMFGDVNITANETKFCPKENLIIR FT KEPSNSGIYHSSHFKNQPKNAKNYFEMCGITKKKPSLKYDVYMTETVKIPY FT KHQHTHISRSQKQATIMMRRPRTESKSEIMRSGRDWKRSDESRSSGMSKAR FT HQTGRSDDTIRAGWWSGWPI" XX SQ Sequence 2691 BP; 905 A; 420 C; 613 G; 753 T; 0 other; aatttggggg ctcgtccggg aacgcgggaa gaaactttcg ggatattcgt tggagcgcgc 60 cggtgggaag atgttttaaa agaacagcag ttgttaaacc gccagttgcc ttgtaattga 120 attgcaaaga aaattatttt aaagtgaact gaagggaaat taaaatgaat tgttttaatc 180 taatatttta tacaataagt tacttttaag acattgaatt tgattgatta gtgataagtg 240 gatttgaatt gaaaaccaac gatcgtggca gtgtaaatta attagcaagc tctgcatgat 300 agaggtgttc tctaaattca gttgaaaaaa agctaagtgt gaactcatat aaaggagcag 360 agtgccacgt cataatttac gggtttgacc tggggttaag ggtcattgta cggtgcgtgc 420 gaatgaacaa acatcaaact atactcgtgc gatacagtgg cggtaaatag tagagaagcg 480 agtgttctgt ttgcttgttt ttttctcctt tttgaaagta acggagacaa tgcagaacga 540 aaacatttcg cctttgcgag caagcagtgg tggggaaccc cttgctttca aaccccaact 600 gcaatgttcg tcttccacaa gtggaattca attaaacatg atcgaaaata cttttgctcg 660 tttaatagat caattgaact tttccaaaac gtctgttaca aacttgactc aagagcttag 720 tgcgatgcgc agtgagataa actctatgcg ggcagacgca tcccgtttaa atttagtagt 780 taaagaatta cgcactagct ccttggtacc ccagcaaata agtaatccta atggaggcgg 840 ctattttggt tgtgatgaaa gtaaacatgc aaaaaaccaa gaagtgaacc gtggtgcaat 900 aataaagaag gttgcgataa gctctagact gactgctaaa gaatatgtgt tgagtgacga 960 cgatggtcgt agtcatggca gtgtatttag gaaaagtatt gatcgagatg ttgtgaatca 1020 atcgaacaac aatgtgttgg gtaacggccc aaattattat tcaactgata gaccgaatgt 1080 gcaaaatctg atagatggtc gtgatcgtgc gtgcgaagag ttgaacgtca agatcggcga 1140 aggccgacat gctacagaca gagtgagcgg gagctgtttt ggctctgagg cgtacggtgg 1200 cttgtgtggt gattaccata atttggccca tgcgcatgaa aatggtatag gacaagaacc 1260 agtattatcg gtaagtgaag ctgagacagc attttctaaa ttttctggaa gtgattgtta 1320 tcctgttcga aaatggatca atgattttga ggaactgtca gattgtattg gcttgacaga 1380 attacgtaaa tttgtgattg cgaaacggca attgactgga ttagctaaaa tgtctttgaa 1440 caccactagc aatgtctcta attggagaac gttgaaagat tttttgataa ctgaatttga 1500 cttctacgag aattgtgctg ttgtacatga aagattgcga aatcgcaaaa gaaatttggg 1560 cgagaatgtc ttagagtatt ttctgcaaat gcgagaaatt ggggcaaagg ccaatgtgga 1620 tatttcatca attataacac acacaattaa cggcataaac gacaatggtc cagacaagac 1680 tattttatat ggatcaaaga ctttagatga gtttagagaa aagcttcgtg tttatcaagg 1740 gataaaggat aataagtatg gaagagatag ggaatatcat tctccaactt cgaaaagatt 1800 tttttctaag ggacgaccaa atggtcagaa caattttgaa gatcaattta acagccccct 1860 taagtttgta aaatgttaca attgtagaaa ggatggtcat atttcaagtg aatgcacaag 1920 ggagatacaa aacaattacg tttatctttg ttattccaat cctaaatcgt caaaaacaag 1980 taagttgtcg ttattgatca ataaaattcc atttcaagca tgctttgact ccggctctaa 2040 tgtttcttta ctacgagaag cgtgggcaac taaattaaat ttgaggatag atcgaaatga 2100 tcggaaacgt ttgatggcgc tgaagggagc aatttggact ttgggtagtg tttcgttaac 2160 catagaaata gaaaaaacac ctttgagaat tacatttgat attttaagaa ataaagatat 2220 gccacacaat attgtgttag gggaaaattt gttgatgttt ggtgatgtaa atattacagc 2280 aaatgaaact aaattttgtc caaaagaaaa tttaattata agaaaagaac cttccaattc 2340 aggcatttac cattcttctc attttaaaaa tcagccaaaa aatgctaaaa attattttga 2400 aatgtgtggt attactaaga agaaacctag tttgaaatat gacgtttata tgactgaaac 2460 cgtgaaaatc ccctacaaac accaacatac acacatttca aggagtcaga aacaggcaac 2520 gataatgatg aggagaccta gaaccgaatc caaaagcgag atcatgcgaa gcggtagaga 2580 ctggaaacga agcgacgaaa gcaggagttc cggaatgagc aaggcaagac atcagactgg 2640 acggtcagac gacaccatcc gggccggatg gtggtcagga tggccgattt g 2691 // ID hAT-40_SM repbase; DNA; INV; 2894 BP. XX AC . XX DT 14-AUG-2009 (Rel. 14.08, Created) DT 14-AUG-2009 (Rel. 14.08, Last updated, Version 2) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-40_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-2894 RA Jurka J.; RT "haT-type repetitive family from freshwater planarian Schmidtea RT mediterranea."; RL Repbase Reports 9(8), 1843-1843 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 704..2518 FT /product="hAT-40_SM_1p" FT /translation="MASKKRKVDTECRNFNDDWKWKYFFTVVKEKAVCLIC FT NSGVAVFKEYNLKRHFETKHENYANNLNEEGRKKHCEDLATKLSRQQNIFV FT KQNHLQESVTEASFLIAKRIAECGKPFSEGEFSKKCMLDVCNVLCPELKNK FT FENISLSRQTIARRVEDLNVDITNQLLNNIELANFYAIALDESCDIKDTAQ FT LLIFIRAINDDFKILEEVLEIVPMKGTTTGEDLFQEIEKCFLKYHIDWKKL FT VNVTTDGAPNMTGKNIGVIKRIKDKIFEIDPNHKIIPIHCIIHQHVLCKNV FT LKIDHVTKIVVKLVNFLRAKGLNHRQFIDFLEELNTTYSDVIYYNKIRWLS FT LGKVLKRVWELQEEIRIFLVLKENEEFPELTDPNWLCDLAFSLDILTYLNE FT FNTSLQGKKLFAYDLMQKLKAFKMKLQLFSTNLAECKFDHFPSLKTISAII FT LKEKAQMYTENILALRLDFENRFECFNRIDKILSIVRNPFTANIEEASPDI FT QLELIELQCDQLLHDKYREEEDNLCNFYKQLDSKKYVNLKNLAKKYLVIFS FT STYICEQTFSLMNHNKSPTRSRLNDLHLEAVLRLATTEIEPDIKKIINEVN FT RLNISH*" XX SQ Sequence 2894 BP; 1019 A; 402 C; 451 G; 1022 T; 0 other; ccaggcatgg ccaacatacg gcccgcgtaa cgaattcatg cggcccgcga gtaatttttc 60 aatatttacg tttgcctgac acttacatag tattaatcaa gtaaattcaa gatatttctg 120 gtttttcata tgaattaaac ttttcgtgct tatttgttat ctataatcac actacaaact 180 ctataatcta tgtatttgaa gtggaacagt acgccagttc attcaatgct ttccgtaagt 240 agcaaacttg ttagtaataa ctttgtcgct tggaagttaa tttgcaatat taattcgttt 300 attggatgtt tattatcatc gcattaaaaa agcaattgct ttcttaattt aaaaatggaa 360 ttaattttgt tacttgctgc taatatcact tcggccgcca ttttttttca atttgaattc 420 actagcatag attcactgtc tcgagatcgc cgtgaaggtc caccgaacct gcgcgtctcc 480 ccactagggt tataaatatc ttaaaacatc tctagtagcc ggttgttttc aggctttagt 540 gctcgctatc acccagtgca gggtacataa tcggctcggc gttatttgta ttatattata 600 tatattaact ttttaattta tttattaatt gattaataag gtatgttacg taatttctgt 660 ttataaaaaa tattttatat taagaaatat tttcatttct aggatggctt ctaaaaaacg 720 aaaggttgat acagagtgtc gaaattttaa tgatgattgg aaatggaaat atttttttac 780 ggtagttaaa gagaaagctg tgtgtctcat ttgtaactct ggtgtggcag tttttaaaga 840 atacaattta aaaagacatt ttgaaacgaa acatgagaat tatgcaaaca atttaaacga 900 agaaggccga aaaaaacatt gcgaagattt ggccacaaaa ttgtctcgtc aacaaaatat 960 atttgtaaaa caaaatcact tgcaagaatc ggtgactgaa gcaagttttt taatagctaa 1020 acgtattgct gaatgcggga aacctttttc tgaaggagaa ttttcaaaga aatgcatgtt 1080 ggatgtttgt aacgtgttat gtccagaact taaaaataaa ttcgaaaata taagcctatc 1140 taggcaaact attgccagaa gagtagaaga tttaaatgtg gatataacca accaattatt 1200 aaataatata gagttagcca atttttatgc aatagctctg gacgaaagtt gcgatataaa 1260 agatactgca caattgttaa tttttatcag agctataaac gatgatttta aaatcttgga 1320 agaagtttta gaaattgttc caatgaaagg tacaacaacg ggcgaagatc tgttccagga 1380 gattgaaaag tgttttttaa agtaccatat tgactggaaa aaattggtga atgttacaac 1440 agacggagca ccaaacatga caggaaaaaa tattggagta attaaacgca ttaaagataa 1500 aattttcgaa atcgatccca atcataaaat tataccaata cactgcatta ttcatcaaca 1560 tgtcctatgt aaaaatgttt taaaaatcga tcatgttacc aaaatagttg taaaattagt 1620 aaatttctta agagccaaag gattaaatca taggcaattc attgatttcc ttgaagaact 1680 taataccaca tattccgatg taatatacta taataaaatc cgttggttga gcttgggaaa 1740 ggtattaaag cgtgtgtggg agcttcaaga ggaaatacga atttttttag tgctcaaaga 1800 aaatgaggag tttcctgaat taacggatcc aaattggctt tgtgatttgg ctttctctct 1860 agacatcctc acatatttaa acgaattcaa cacaagttta caaggaaaaa aattatttgc 1920 ttatgattta atgcagaaat taaaagcctt taaaatgaaa cttcaacttt tttccacaaa 1980 tcttgctgaa tgcaaatttg atcatttccc atccttaaag accatttccg caataatttt 2040 aaaagaaaaa gctcaaatgt atactgaaaa tattctggca ttacgacttg attttgaaaa 2100 cagattcgaa tgttttaaca gaatagataa aatattgtcg attgttagaa atccatttac 2160 tgcaaatatt gaagaagctt ctcctgacat tcaattagaa ttaattgaat tacagtgtga 2220 tcaattattg catgataaat atcgagaaga ggaagataat ttgtgtaatt tttataagca 2280 attggatagt aaaaaatatg taaatctaaa aaatcttgcc aaaaaatatt tagtaatatt 2340 tagttcaact tatatttgtg aacagacatt ttctttaatg aatcacaata aaagtcctac 2400 aaggtcgaga ttaaatgatt tacatttgga ggctgtatta agattagcca ccacagaaat 2460 tgaacctgac attaaaaaga ttataaacga ggtaaaccgc ctcaatattt ctcattaatt 2520 attataatat atttgttatt attaaaatgt ttattttgta ttattctgta tagatatttg 2580 ctgtaagtta cctatgattt ataattattc atatatatat atatatatat atatacttct 2640 atgtatttgt tatatcattt atagtctcat aataaaaatt tgttcatttt ttgtttaaaa 2700 tttttgtatt ttttattata atgttaattt tataattatt atatattgta tataggtatt 2760 tgttacatta tttatatttt cataataaaa ttttgttcat tttttgttta aaatttttgt 2820 attttttacg cctggcccgc gaaaattttt tttctcaatc cggcccgtga caaaaaactg 2880 ttggccatgc ctgg 2894 // ID Gypsy-1_BM-I repbase; DNA; INV; 1933 BP. XX AC nscaf1299; XX DT 19-MAR-2010 (Rel. 15.07, Created) DT 19-MAR-2010 (Rel. 15.07, Last updated, Version -1) XX DE LTR retrotransposon from silkworm: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-1_BM_; KW Gypsy-1_BM-LTR; Gypsy-1_BM-I. XX OS Bombyx mori OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Bombycidae; Bombycinae; Bombyx. XX RN [1] RP 1-1933 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from silkworm."; RL Repbase Reports 10(7), 977-977 (2010). XX DR Genome; nscaf1299; Positions 206879 204947. XX CC Positions [987-1628] - Reverse transcriptase CC LTRs are 96% similar to each other. XX FH Key Location/Qualifiers FT CDS 249..1931 FT /product="Gypsy-1_BM-I_1p" FT /translation="MISPDVIAGTSLEHCIEREPFTVKTAHATTRHEHVAI FT LPLPYLFNTDIHHKFLLFDFDPKYKGLIGMDLFKKLGCNIDFKNKVLRTFN FT TEIPIYFDYPVKQVKIEPNCERFITVKTNYTDGYYICDDFTWSKGLKSPAA FT IVTVKNGTFRTSIINYNDTQQIVSNNRMLALSPLPSHLIENSQKYEINKIE FT TETDIDKELCENLKKIRTSHMNEEEKREITKICYQYRDIFYSENIPLSFTH FT TVKHELRLTDDTPIFVRSYRQAPQQRTEIQKQVDSLLKQGIIRESISPWSC FT PVHIVPKKPDASGKVKWRLVIDYRRLNDRIIEDKYPLPNINDILDRLGRAQ FT YFTTIDLASGYHQLEMHPKDVEKTAFTTERGHYEFLRMPFGLKNAPSTFQR FT LMDHILRGIDNVFMYLDDVIIAATSLQNHNEKLKLVFQRFKMHNLKVQLDK FT SEFLQKHVNFLGHELTDQGLNPNKDKIKAVLNFPIPQTQKDIKAFLGLVGY FT YRKFIKDFAKLTKPLTACLKKNAKVEHTNEFLDAVDKCKQILTNAPILQYP FT DFDKPFILTTDASD" XX SQ Sequence 1933 BP; 707 A; 401 C; 340 G; 475 T; 10 other; gagatagaaa ctaatgnnnn nnnnnnacta taaccataac atgcaataca gcgaccaaaa 60 ctacgagtat atgccagata tgtcgggaga agaatcaaat caacaggatt tcttcaggac 120 gaggaccaat cgaacctcag ctgataaaga ctgttaacct caaccaccac ggcaaagacc 180 acctccctta catcgaatta ccggagtttc aaggtaaatt tttattagat actggtgcct 240 cacgttgtat gatatcaccg gatgtaatag ccggtactag tctcgaacat tgcatcgaac 300 gtgagccatt tacggtaaaa accgcgcatg ctactactcg tcacgagcac gtcgcaattt 360 tacctttacc ttacctgttc aacaccgata tacaccacaa atttttattg tttgattttg 420 acccaaaata taagggcctt attgggatgg acttgtttaa aaaattaggc tgtaatatag 480 atttcaaaaa caaagtgttg cgaacattta ataccgaaat accaatctat ttcgactatc 540 cggtaaagca agttaaaata gaaccgaact gcgagaggtt tataacggtt aaaacaaatt 600 acaccgacgg ttattatatt tgtgacgatt tcacttggtc taaaggatta aagtcaccag 660 ccgcgatagt gacagtaaaa aacgggactt tcaggacatc tataatcaat tataatgata 720 cacagcaaat agtgagtaac aatcgtatgt tagccctaag cccactaccg tcacatttga 780 tagaaaacag tcagaagtac gaaattaaca aaattgagac ggaaactgac atagataaag 840 aactttgtga aaatttaaag aaaattcgta ccagccacat gaacgaggaa gaaaaacggg 900 aaattactaa aatctgctat cagtaccgtg acatattcta ctcggaaaac attcctttat 960 cgtttaccca tacagtaaaa cacgaattaa gactaaccga cgacaccccc atctttgtac 1020 gaagttatag acaggctccc caacaacgaa cagagataca gaaacaggta gatagtctgt 1080 taaaacaagg aatcattagg gaaagtatct ccccttggtc gtgcccggta cacattgttc 1140 cgaaaaaacc ggatgcatca ggaaaagtta aatggagact tgttattgac tatagaagac 1200 ttaatgacag aattatagaa gacaagtacc ccttaccaaa cattaacgac atccttgaca 1260 gattagggcg cgcacaatat ttcacgacca tagatttagc aagcggctac catcaattag 1320 aaatgcaccc taaagacgta gagaaaacag cgtttactac tgaaagaggc cactatgagt 1380 tcctaagaat gcctttcgga ctaaaaaatg ccccgagcac tttccagcgt cttatggacc 1440 atatactccg aggtatagac aacgtattta tgtacttaga tgacgtcata atagccgcga 1500 cgtccctaca aaaccacaat gaaaaactga aattagtatt tcagcgattc aaaatgcata 1560 atttgaaagt tcagttagac aaatcagaat ttctacagaa gcacgttaac tttctaggac 1620 atgaattgac tgaccaagga ctaaatccta acaaggacaa aattaaagca gtattaaatt 1680 tccctatacc acaaacgcaa aaagacataa aagctttctt aggcctagtc gggtactata 1740 ggaagtttat taaggacttt gcgaagttga cgaaaccttt aacagcatgt ctaaaaaaga 1800 acgcaaaggt tgaacataca aacgaatttt tagacgcagt tgataaatgc aaacaaattc 1860 taacaaacgc cccaatcctg caataccctg acttcgacaa accgtttatt ttaacgacag 1920 acgcatctga ctt 1933 // ID R8Hm-A repbase; DNA; INV; 4327 BP. XX AC . XX DT 28-MAY-2010 (Rel. 15.05, Created) DT 28-MAY-2010 (Rel. 15.05, Last updated, Version 2) XX DE R8Hm-A - 18S rDNA-specific non-LTR retrotransposon from Hydra DE magnipapillata. XX KW R2; Non-LTR Retrotransposon; Transposable Element; R8Hm-A. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4327 RA Kojima K.K., Kuma K., Toh H. and Fujiwara H.; RT "Identification of rDNA-specific non-LTR retrotransposons in RT Cnidaria."; RL Molecular Biology and Evolution 23(10), 1984-1993 (2006). XX DR [1] (Consensus) XX CC R8Hm-A and its close relative R8Hm-B belong to the R2-A clade (or CC R2 clade), but specifically insert into 18S rDNA instead of 28S CC rDNA. TSDs are AAGCTGAAA. XX FH Key Location/Qualifiers FT CDS 338..3817 FT /product="R8Hm-A_1p" FT /note="includes RT and restriction-like FT endonuclease." FT /translation="MNLLIVTSSIKESDVPSSGKGGVAVNNITAGASGKDT FT CVIIHPGTDGIWCCTECVEIHNSGKDLKRHLAKRHPSVTISGYKCNLCPFV FT SERQLSVGTHLRYCRGVKEVVKREFACASCSFSSDTFSGLQVHMQRKHIAE FT WNDQLKEKTEFAWTDRELRELAEKELTTPSFRYNKIFYAALGTSRTYDAVR FT KIRYNDRYKSAIAEMRSQIADAAAAAQERDVERGLVSAHSDRGKEMLPVVE FT TKSDIQVNNDIKKDIELTPNSRQKQTNLALARPAVIEVEEDLGRQDVKQYL FT ASLRQDDYTSPAERSIFAYCREETNWSATKRQVLKISRTTRGLRQPKKVRP FT FEFPEGFKPNRNMRKWRKYRFLQECYREKRAETVSKILDGTFIDEPEEEIR FT PELEEVQRMYIDRLEKRTQLDTTKIVQTDEVFCLQSYGRITIGEVRDALGA FT SKKDSASGPDGLLLQDVRRLGPLLLCNIFNMWYLHGIPVEENRCRTILLYK FT SGDRHLASNYRPVTIGNMLNRLYAKIWDKRIRKNVRLHVRQKAFIPVDGCF FT ENVKTIQCVLQSYRKRKLEHNVVFIDLAKAFDTVLHDSIRKALWRKGVPSG FT VVKVVDSLYAGAVTSISVGKTKTRSICINSGVKQGCPLSPLLFNLILDELA FT ERIEATGCGLDLDGHVLSSMAFADDYVLLAKDSVEMNELIRVCSTFFKEKG FT LSVNPGKCQSLRVLPVKEKKRSMKVLVRPHRWWRIKDQDVDIPSMTYDSLG FT KYLGVSIDPTGKIALPIEEWKNWMTKLKECKLKPEQKVKILKEVVCSRVNY FT VLRMSECGISELRSWTRFVRNWAKNIIHLPTWCSSDWIHSIKGLGIPDVSK FT GIVIQRMRASEKMSTSEDGIVRVVGARLVQKNRVLWEKAGFEGIELKAARR FT HCEVERLNNIGNITNGVALKTIAAVSSVNRYWMIEDNLKSGNKILVWKAMA FT GAIPTKINLSRGVADQTLKKCRRCGLTAETDGHILAGCHTSSDAYSKRHNM FT LCDKLAKELKLNGGPNRRVWRERTCFTSTGRRYRPDIIVKDDSKITVIDMT FT CPYEKSEGHLIQCESAKVTKYEPLKLDKYWTRELEGANGIVAEKVELMGLA FT IGAIGTIMRSTLRKLCELKSGRIVRRLQMIACNNSAQIIKGHLSRATRRNL FT R" XX SQ Sequence 4327 BP; 1307 A; 837 C; 1135 G; 1048 T; 0 other; ttcaagtgga tgaagctggg aaggtaatct gtagttggtt gagttggttg cagattactg 60 ctgtcgattt tgctttctat tgaaagcctg tctctacggg tcctgaagct tgaattttgg 120 tagctatagt tttgtgggag gaaagtggaa ttttgtacca tcttttgtct ctcgtatcta 180 ctatagtaaa tccggtcatg cagcctctac gcggcgcaac tagaaacttg gatcagtgat 240 caaggctaat gcatgccggg tctcctcaga ttaggagtat aatacaaatc tgacttcatc 300 actaagaggc tatggggcta acgatcctat agtctcgatg aacctattga ttgttactag 360 tagcataaaa gaaagtgacg taccctctag tggaaagggg ggtgtagcag tcaataacat 420 aacagcagga gctagtggaa aagatacgtg cgtgatcata cacccaggta ccgatggtat 480 ttggtgctgt actgagtgtg tagagataca taacagcggt aaggatctga aacgacatct 540 tgcaaaacgt cacccgagtg taacgataag cggttacaaa tgcaatctgt gtccatttgt 600 tagtgaacgc caactaagtg tggggacaca tctgaggtac tgcagaggcg taaaagaagt 660 ggttaaaaga gagtttgcat gcgcgagctg ctctttttct tcggatacgt tctcaggact 720 tcaggtgcat atgcaaagaa agcatatagc agaatggaac gaccagctga aggagaaaac 780 ggagtttgct tggacagacc gagaattgag ggagcttgct gagaaggaac ttaccactcc 840 ttccttcagg tacaacaaaa ttttctatgc tgcgctaggt acctcccgga cctacgacgc 900 tgtgaggaaa attcgctata atgacagata caaatctgcc attgctgaaa tgcgatcaca 960 gatagcagat gcggctgccg ctgcacaaga gagggatgta gagcggggtt tagtttcagc 1020 acactcagac agaggaaaag aaatgctccc tgttgttgaa accaaaagtg atatccaagt 1080 aaacaacgat atcaaaaagg atattgaatt aacaccgaat tcaagacaga aacaaactaa 1140 tctagcgctg gcaaggccag ctgtaattga ggtggaggaa gacttgggta ggcaggatgt 1200 gaaacaatat ctcgcatccc tgcgccaaga cgactacaca agtccggccg agcggtcaat 1260 ctttgcatac tgcagggagg aaaccaattg gtctgcgaca aaaagacagg tattaaagat 1320 atcgagaact accagaggtt taagacaacc taagaaggtt cgtccatttg agtttccgga 1380 agggttcaaa cctaacagaa atatgagaaa gtggagaaag tatagattcc ttcaggaatg 1440 ctatagggaa aagagagctg agactgttag caagatcctg gacgggactt ttatcgatga 1500 accggaggaa gagattagac cagagttaga ggaagtacaa cgtatgtaca ttgaccggct 1560 ggagaaaaga actcagctgg ataccacgaa gattgtgcaa acagacgagg tgttttgtct 1620 gcaaagctac ggtcgcatta cgatcgggga agtaagagat gcactcggtg caagcaagaa 1680 ggactcggcc tcgggtcctg acggcctgct tctacaggat gtgaggaggc tgggaccact 1740 attattgtgt aacatcttta acatgtggta cttacatggg atccctgtgg aagaaaacag 1800 gtgtcgaaca atactcttat acaagagtgg cgatagacat ctggcatcaa actatagacc 1860 tgtgacaatc ggcaacatgc tgaacaggct ttacgccaaa atctgggaca aacggatccg 1920 gaagaacgtg cgtcttcatg tgaggcaaaa agcatttatc ccggtggatg ggtgctttga 1980 gaacgtaaaa acgatccaat gcgttctcca gtcttacaga aagcgtaagt tggaacacaa 2040 cgtcgtattt attgatcttg ccaaggcctt tgacacggtc ttgcatgact cgataaggaa 2100 agcgttgtgg cggaaaggtg ttccgtctgg ggttgttaaa gtggtagaca gcttatatgc 2160 gggagctgtc acaagcataa gtgttggaaa aacgaaaact cgttctatat gtataaactc 2220 tggagtcaag cagggttgtc ctctgtcacc tcttctattc aacctaatac tggatgaact 2280 agcggagagg atagaggcaa ccggctgcgg gttagatctt gatggtcacg ttctatcatc 2340 tatggccttt gctgacgact acgtgttgct agcgaaggac tccgtggaga tgaacgagtt 2400 gataagagtg tgtagtacat tcttcaaaga gaaaggctta tctgtaaacc caggtaaatg 2460 tcaatcgcta agagttcttc ccgtaaagga gaagaaacgg tcaatgaagg tccttgttag 2520 acctcataga tggtggagga taaaagacca ggatgttgac atcccatcta tgacatatga 2580 cagcttagga aaataccttg gtgtttcgat tgacccaact ggtaagatag cgcttccgat 2640 tgaggagtgg aagaattgga tgaccaagct aaaagagtgt aagctcaagc ccgagcagaa 2700 agttaaaatt ctgaaagaag tggtttgctc tcgggtaaac tacgttttgc ggatgtcaga 2760 gtgtggcatc agcgaacttc ggagttggac acgatttgta aggaattggg cgaaaaacat 2820 cattcactta cccacatggt gcagtagtga ctggatacac tcgatcaaag ggttaggcat 2880 tcccgacgtt tcgaagggaa ttgtcataca acgtatgagg gcttcggaga aaatgtctac 2940 gtctgaagac ggtatagtcc gcgtggttgg tgcacgactt gttcagaaga acagagtctt 3000 gtgggaaaag gccggtttcg aaggtatcga actgaaggca gccaggaggc actgcgaagt 3060 ggagagactc aacaacattg gtaacattac caacggcgtt gcactcaaaa ctatcgcagc 3120 agtctcctcg gtaaatcggt actggatgat tgaagacaac ttgaaatccg ggaacaagat 3180 tctcgtttgg aaagcaatgg cgggtgccat tccaacaaag attaaccttt cgcggggcgt 3240 agcagaccag accctcaaaa aatgtcgtcg atgcggttta acagcggaaa cggatggaca 3300 catcttggct ggatgccata ctagcagcga cgcgtactca aaacgtcaca acatgctctg 3360 tgataaactc gccaaagagc tcaaactcaa tggtggacca aacagacgtg tgtggcgcga 3420 gaggacgtgc ttcactagta caggcaggcg atatagacct gacattatcg ttaaagatga 3480 cagtaaaatc acagtcatcg atatgacttg tccgtatgag aaatcagaag gacacctgat 3540 ccaatgtgaa agtgcgaaag taactaaata cgagccactc aagctagata agtattggac 3600 tcgagaactc gagggagcaa atggtattgt tgctgaaaag gtagagctga tgggattggc 3660 aataggggcg atcggcacaa tcatgcgtag tacccttcgg aaactctgtg agttaaagtc 3720 gggcaggatc gtaagacgtc tacaaatgat tgcttgtaat aatagcgccc aaattataaa 3780 gggtcacctg tcaagggcga ctcggaggaa tttgcggtga taaatgccaa aagttgcttg 3840 ggctaaatga tacgtacgct agaaaaagcg acttgctgca cggatgacgg ttcatcagag 3900 cccgatatgt gcatgtcaag gcggcaggga gaatcactag tgtagctgtt ctttccatta 3960 cgacttacgc ggttaacgtg gcacgataga tttacaccag gaaataatac gtgaagggtt 4020 ccaccatata ctggagttta gatctatgag ggaaacattt gtaataagtc agtctggtaa 4080 cctggcgccg ctgttgagtc aaattaacta tgtcaatact cattaagtta tcgactttga 4140 tatggcatgg ggtgattccg cgttatatca aagtcaaaca tgatgattgc aatgagaaac 4200 taccacgctt ggtcacgttt gtgaggagaa catctcattc aagcctcccg gatgtcggca 4260 cccgctgaca tcttctggct tatgaaaatt ttcattaatt tttgtaagtc atgggcggct 4320 tgaaagc 4327 // ID BEL-29_CQ-I repbase; DNA; INV; 5640 BP. XX AC AAWU01003415; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-29_CQ_; KW BEL-29_CQ-LTR; BEL-29_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-5640 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 211-211 (2011). XX DR Genome; AAWU01003415; Positions 9074 3435. XX CC Positions [4669-5250] - Integrase core CC 'CATTC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 369..4400 FT /product="BEL-29_CQ-I_2p" FT /translation="MGPKIPRTPLKKEGEEEEDLENLVFLREEQTEVIDRL FT KATIDATLVEERTEAAAKVHRWRLDTCFAEFAAIKERIYRADPKKRNDHKK FT VVVEFEALFDQLAMTLGGWNAGAAPAASSALVERHDRPIVIQQPLPRVFPS FT FDGKYENWEKFKVMFVDVVDKTNESARMKLYHLEKALVGDAAGFIDAKTIQ FT DGNYAHAWKQLTDHYEDKRRMVDLHIGGLLSVQKLACEGHLELRALVDSVV FT GNVENLKYLGQAFTGVSEQIVVYLLGHALDDTTKKIWESTVARGELPKYED FT MIKTLKDRISVLERCETSSDSIKKQQPRTSSKPSTGKQPFQRANAATTPAQ FT PEFQCGVCGEGHLTFRCANLTGLTVAQRREKVRNKNLCFNCLRSGHSWKRC FT SSKNSCGKCQQRHHSLLHDEAKAEPVPKSTPQQSAAPPKNPTPVVQPAGQQ FT APPTTEATCNHTQTTKTVMLLTAVVHLSDTQNRPVPCRVLLDNGSQVNFIS FT ESMAKRINIQRVATTVPICGIGAVKTYAREAVTVELQSRYNSFAVNVECLI FT VPKVTGLIPSSPVNIADWPIPECIQLADPDFQTPDRIDLLLGVAMFFRLLK FT SGHIELGNNLPDLRETYLGWVVAGEVGDTVPGPQYSHTATLEDISEAIQRF FT WQVEEIESVTPITTEQEDCETFFRSTHKRDPTGRYEVRLPFRPVVARLDDN FT RSLALRRFLSLEKRLNRDPDLRKQYGEFIQEYQSLGHCKTVDETVDSPKLG FT RYYLPHHAILRPSSSTTKLRVVFDASAKMSPSATSLNDALQIGGNVQNDLF FT SILLAFRKHPVVFTADISKMYRQIRVASADTSYQRIFWRNDPSDFVQVLEL FT TTVTYGTAAAPFLATRCLVQLCEDEGENFPLAASIVRNACYVDDILAGADT FT PEEAIECLRQLQGLLGRGGFPIHKWSSNTPAVMDQIPESDREKLIDLDGLS FT GGVVKALGLYWCPGDDDFRFTATQSDAEATKRRVLSEIGKFFDLLGLLSPV FT IIMAKMLMQKVWNEGLTWDVLLNGELGTTWQQFQTALPDVREIRIPRYVIG FT AGSVGLELHGYSDASKLAYGAAIYIRSLLPNEKTEMRLLCSKSKVAPNKQL FT DIHRLELLGCRLLSRLVVKVIAALKLPFRAVVLWSDSQVVLAWLKKSLDLL FT QTFVRNRVAEILQETAGFIWNYVRTKDNPADLISRGMFPAPLMKCKKWWEG FT PSYMAPVVYPVEPAREVPDEELPELRKVKMVAMLAFNSYELPIFETCSSFR FT KLQRVMAYVVRFVNNCKQKNPADRVHRRHPSIPELRAALNLIVKVVHARRA FT VQGDTAGGRERFGRKAPVSGTFLGQRSSASRR" FT CDS 4642..5640 FT /product="BEL-29_CQ-I_1p" FT /translation="MGMLPSSRVSQAFPFEEVGVDYAGPVFVKVGLRKPQM FT VKAYIAVFVCMTTKAVHLELVSDLTTEAFLAALQRFVSRRGVPRQIHSDNG FT TNFKGAKTELHELYLLFKQRAFNDQIETYCQPKEIAWSFIPPGAPNFGGLW FT EAAVKSTKYHLKRILKNAQLTFEQYATVLAEVEAVLNSRPLFAMSSDPSDP FT EILTPGHFLIARPLTAIPEPNYDGIPTNRLSKWQHLQRLREHFWKVWKNEY FT LTSLQPRGKNLKEKPNVRPGMVVLLEDKEAAPLQWKLGVVTNTYPGPDGLV FT RTADVRVGGTTVRRPITKLAILPILDNEPTPAGSAPSPGGG" XX SQ Sequence 5640 BP; 1309 A; 1654 C; 1632 G; 1045 T; 0 other; ttggtccttc aacccggatt ggaccccggc gtggacgctg gaaagaagac tggtgaagtg 60 aggaggcaaa ccggcctcag cgcgaacagt taccgagtgt ctcgggaagc aaaccggctt 120 ccgcgtgaaa acaaaagaac attcgcgggg tgcaaaccgg cttccgccgc gagtgtgcaa 180 ggcaaaccgg ccttggcaca cgaaccgttc cgtggaacca tcgggggagg caaaccggcc 240 tcagaccgac tgcggaagaa caaaaagtgc cacccagcgt ggaaaaaagt gaaacaaaca 300 cgaaagaaca aagtgaaagt accacgtgct gttcctaacc tcaaaagtgt gcaaaagtga 360 caaaaaaaat gggccccaaa ataccacgca caccgctcaa aaaggaaggc gaggaggaag 420 aagacctgga gaacctggtt ttcctccggg aagagcagac ggaggtgatt gatcggctga 480 aggcaacgat cgacgcgact ctcgtcgagg aacgcactga ggccgctgcg aaggttcacc 540 gatggaggtt ggacacgtgc ttcgccgagt tcgccgccat caaggaacgc atctacaggg 600 ctgatccgaa gaagcggaac gaccacaaga aggtggtcgt ggagttcgag gccttgttcg 660 accaactggc catgacgctc ggaggttgga acgccggtgc cgctccagct gcatcgtccg 720 ccctcgttga acgacacgac cggcccatcg tgatccagca acctcttcca cgtgttttcc 780 catccttcga tgggaagtac gagaactggg agaagttcaa ggtgatgttc gtcgacgtcg 840 tggacaagac gaacgagtcg gcgcgcatga agctgtacca cctcgagaag gctttggtcg 900 gcgacgcggc cgggtttatc gacgccaaga ccatccagga tgggaactac gcccatgcct 960 ggaagcagct gacggatcac tacgaggaca agcgccggat ggtagacctt cacatcggag 1020 gtctactgag tgtgcagaag ctggcctgcg agggccacct ggaacttcgg gctttggtgg 1080 attccgtcgt tgggaacgtc gagaacctga agtacctcgg ccaagcgttc accggcgtgt 1140 cggagcagat tgtcgtctat ctcctgggcc acgccctaga cgacaccacc aagaagatct 1200 gggagtctac agtcgcgaga ggtgaattgc ccaagtacga ggacatgatc aaaacgctca 1260 aagatcgtat ctcggttctt gaacggtgcg agaccagttc cgactccatc aagaagcagc 1320 agccacgaac aagttccaaa ccatcgactg gcaagcagcc gttccagagg gccaacgcgg 1380 cgaccactcc ggcccaaccg gagttccagt gtggagtctg cggagaaggc cacctcacct 1440 tccgatgcgc caacctcact ggcctaacgg tggcccagcg cagagagaag gtccggaaca 1500 agaacctgtg cttcaactgt ttgcgcagtg gccacagctg gaagaggtgc tcgtcgaaga 1560 actcgtgtgg caagtgccaa caacgccacc attctctgct gcacgatgaa gcgaaggctg 1620 aacccgtgcc gaagtccaca ccgcagcaga gtgctgcccc accaaagaac ccgacaccag 1680 tggtccaacc cgctggccaa caggcgccgc cgacgaccga ggcgacgtgc aaccacaccc 1740 agacgacgaa aaccgtcatg ctcctgacgg cggtggtcca cctgagcgac acccagaacc 1800 gacctgtgcc gtgccgagtc ctgctggaca acggttcgca ggtcaacttt atttcggagt 1860 ccatggcaaa acgcatcaac atacagagag tggctacaac cgtcccaatt tgcggcatcg 1920 gcgccgtgaa gacgtacgcc agggaagcgg tcacagtcga gctccagtcc cggtacaaca 1980 gcttcgcggt gaacgttgag tgcctgatcg tgcccaaggt gactggactg attccgtcgt 2040 cgccggtaaa catcgctgac tggccgattc cggagtgtat ccagctggcc gatcccgact 2100 tccagactcc ggaccgcatc gacttgctgc tcggagtggc catgttcttc cggctcctga 2160 aaagtggaca cattgagctg gggaacaacc tgccggatct acgcgagacc tacctgggct 2220 gggtcgtcgc aggagaggtc ggggacaccg tccctggtcc acagtatagc cacaccgcaa 2280 cgctggagga catcagcgag gcgattcaac gcttctggca ggtcgaagag atcgaatctg 2340 ttacgcctat caccactgaa caggaagatt gcgagacctt tttccgctca acccacaagc 2400 gggaccctac cggtcggtac gaggttcgct tgccgttccg ccccgtcgtc gccagactcg 2460 atgacaaccg cagcctcgcc ctgcggcgtt tcttgtcgct ggagaagcga ctcaaccgag 2520 acccggacct gcggaaacaa tacggtgagt tcatccaaga ataccaatcc ctggggcact 2580 gcaaaacggt tgacgaaact gttgactcac cgaaacttgg ccgctactac ttgccccacc 2640 atgccattct ccgcccctcg agctcaacga ccaagttgag ggtagttttc gatgcctcgg 2700 ccaagatgtc tccctctgcc acctccctga acgacgcgct gcagatcggg ggaaacgttc 2760 aaaacgatct cttctcgatt ctgttggcct tccgcaaaca ccctgtcgtc ttcaccgcgg 2820 acatctcaaa gatgtaccgc caaatccgag tcgcttcagc tgacaccagc taccagcgga 2880 tcttctggag gaacgatccg tccgacttcg tccaggtcct tgaactcaca accgtgactt 2940 acgggacagc ggcggctcct tttctcgcga cgaggtgtct ggtccaactg tgtgaagacg 3000 agggtgaaaa tttcccgttg gcagccagca ttgttcgcaa cgcctgctac gtcgatgaca 3060 tattggcagg tgctgacaca cccgaggagg cgatcgaatg tttgcgacaa cttcaaggct 3120 tgctaggtcg tggtggattc cccatccaca aatggagctc gaacacgccg gcagtgatgg 3180 accagattcc agaaagtgac cgggagaagc tgatcgacct ggacgggcta tccggtggcg 3240 tggtcaaggc tctgggactg tactggtgtc ctggagatga tgatttccgg ttcaccgcaa 3300 ctcaatccga cgccgaggcc accaagcggc gtgtactttc cgagattggg aagttcttcg 3360 acctcctggg actcctgtcg ccagtcatca tcatggccaa gatgctgatg cagaaggtgt 3420 ggaacgaagg cctgacgtgg gacgtgctac tcaacggaga actgggcacc acgtggcagc 3480 agtttcaaac cgcactaccg gacgtgcggg aaattcgtat ccctcggtac gtaatcggcg 3540 ccggcagcgt tggccttgag ctccacggct acagcgacgc ctcgaagctg gcatacggtg 3600 cggcgatcta catccgaagc cttctaccga acgaaaagac cgagatgcgg ctgctctgca 3660 gcaaatccaa agtggcaccc aacaagcagc tcgacatcca ccgtctggaa ctgctcggct 3720 gcagactgct gtcgcggttg gtggtcaagg tcatcgctgc gctcaagctt ccgttccgcg 3780 ctgtggtgct ttggtcggac agtcaggtgg tgctggcctg gctcaagaaa tcactagatc 3840 tgctccaaac tttcgtgcgc aaccgcgtcg cagaaatcct ccaagaaacg gccggtttca 3900 tctggaacta cgtccgcacc aaggacaacc cggccgacct gatctcgcgc ggaatgttcc 3960 cggctccgtt gatgaagtgc aagaagtggt gggaaggacc gtcatacatg gcacccgtcg 4020 tgtaccccgt cgaacctgca agagaagtgc cagatgaaga actgcctgaa ttgaggaagg 4080 tcaaaatggt cgcgatgctt gccttcaact cctacgagct gccaatcttc gagacgtgca 4140 gttctttccg gaagctgcag cgcgtgatgg cgtacgtcgt gcgctttgtg aacaactgca 4200 agcaaaagaa cccggcggac cgagttcacc gacgccaccc gtccattccg gagctgcggg 4260 cggcgcttaa cttgatcgtg aaggtggtgc acgcacgacg tgctgtccaa ggagatacag 4320 caggtggccg agaacgattc ggccggaaag ctccagtgtc tggtaccttt cttggacaac 4380 ggtcttctgc gagtaggagg tagactgcag caatcggagc tgccgttcgg gacgaaacac 4440 cagctgattc taccgaagca tcgggtcacg gacttgatca tccgagctta ccacgaggag 4500 cacctgcacg cggggccgtc ggccttgctt gccatcttga gaaggcaatt ctggctgctc 4560 gacggacggt cgacagtccg gaacgtcacg agaagttgtg tgacctgctt tcacgtgaag 4620 ccgcgcaatt cgagtcagct gatggggatg ctgccgtcca gccgggtgtc gcaagcgttt 4680 ccgttcgagg aggtcggcgt ggactacgct ggacccgtct tcgtgaaagt tggactccgg 4740 aagccgcaga tggtcaaggc gtacattgcg gtcttcgtgt gtatgacgac aaaggcggta 4800 cacctggagc tcgtgtccga tctgacgaca gaagcgttct tggccgctct ccaacgtttt 4860 gtcagccgac gcggcgtgcc tcgacagatc cactccgata acggcacaaa tttcaagggg 4920 gccaagaccg agctgcacga gctgtacctc ctgttcaagc agcgtgcctt caacgaccag 4980 atcgaaacgt actgccagcc aaaggagatt gcctggtcgt tcatccctcc cggtgcaccg 5040 aacttcgggg ggctttggga ggcggctgta aaaagcacca aatatcatct caagcggatt 5100 ttgaaaaatg ctcaactcac ctttgagcag tacgccaccg tgctggccga ggttgaagcc 5160 gtgctcaact cccggccgct gttcgcgatg tcgtccgacc cctcggatcc ggagatcctg 5220 acgccgggac atttcctgat cgcgcggccg ctcacggcca tccctgagcc aaactacgac 5280 ggaatcccga cgaaccggct gtccaagtgg cagcatcttc aacgtcttcg agagcacttc 5340 tggaaggtct ggaagaacga atatctcacc agtttgcagc caaggggaaa gaacctgaaa 5400 gagaaaccaa acgtccgtcc gggaatggtg gttctactgg aggacaagga agcggcgccg 5460 ctgcagtgga agctgggagt cgtaacgaac acctacccgg ggcccgatgg cctcgtccga 5520 actgctgacg tgagagtcgg cggcactacg gttcggcggc caatcaccaa gctggccatc 5580 ctcccgatcc tggacaacga acctactcct gctgggtctg cccccagccc ggggggagga 5640 // ID Crack-5_AAe repbase; DNA; INV; 3169 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A Crack non-LTR retrotransposon family from Aedes aegypti. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; KW Crack-5_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-3169 RA Kojima K.K. and Jurka J.; RT "Crack clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1221-1221 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 13 sequences with >93% CC identity. Closely related to Crack elements in Culex pipiens CC (Crack-1_CP to Crack-4_CP). This consensus is likely CC 5'-truncated. XX FH Key Location/Qualifiers FT CDS 112..2988 FT /product="Crack-5_AAe_1p" FT /note="endonuclease and reverse transcriptase." FT /translation="ITLIKINYKAELINELNINFCLEEIKLDKENLNILYL FT NTRSCRNKEEELYQLINEFDGSVHVVVFAETWLYKNEIFNLLGYDAYHCCR FT DGRGGGVSIFVLSDYRSQLMLEISSEDNNFLMVELIDQNVKIMGVYNPGRS FT FNLFFEQLEQILHNYKRIIVCGDFNINLLDSSGELIRQYTYKIQSLGYIIL FT NKLDARYATRISNTIATVIDHIITDIFEYDLQFGIKDTERLISDHRTLTLS FT MRIENHKKYKPDHKKICVNYENIIRSTSQNTFEICQQFSDIVVVLNEIITK FT NRTEILSRKKAMVIRKPYINNALIAEIHNKNELYRQYRTAPEGVLKDELYI FT LYARKRNMLKNKIKFAKEQYYGRKIHQNINNYKKTWSTLREIIFQNGSENI FT TPQALTLKENGKIIDSAAQIAKCFNEYFINIGWNIQQNGTGNNNFLIYLNT FT MPETRFSFSVVETEKVKNVINSLKSGASVGLDKISTKCIQKCVTLIAPKLT FT EIVNNMILSSQFEDCLKIAKIVPLYKSGDKFSKSNYRPVSILPALSKIPEC FT IMNEQIKHHLLQNELLNKNQFGFLPKKSTETASLELLNFIVKGLDDGEYVG FT CIFIDLRKAFDCISHELLIAKLSFYGFTPSAVSLLKSYLANRKQQCQVNGV FT LSEPLLIKSGVPQGSIIGPILFNIFINDLFNIPLKGKLQCYADDAVLKYRA FT KDLETLQSMMQHDLQLLPSWFSANKMQINNKKTNFIIFSLKTSNVSLDLYI FT NNERLSQVTEVDYLGLAIDESLNWKSHITLVKNKITPFIFALRRVRGCLST FT KSCWQLYNAFILPHLSYMNTVWGSAAASHLNVLKVLQNRVIKTIRKLPLLF FT PSASLYSPEVLPLNLLHKYNCLILVYKIKYKLVESNIRLIEFHDLHSHSTR FT ARQNFYVSRPRTELALKHFFYSGITQFNALPNHLKLELNLSSYKKKLKKYL FT FENN" XX SQ Sequence 3169 BP; 1180 A; 455 C; 535 G; 999 T; 0 other; ggtattccaa cgggaatgtc ttcttacgta aggatgaaag ttctaaggca atccatgtga 60 gtgatgttag tgttctatca tcaattgtgt aatatttcct gattaagtta aattactttg 120 attaaaataa attataaagc agagctaata aacgagttga atattaattt ctgtttagaa 180 gagataaaac ttgataaaga aaatcttaat atcctgtacc taaatacaag aagttgtcgt 240 aataaagaag aagaattgta tcagttaata aatgagtttg atggcagtgt acatgtggtg 300 gtttttgcag aaacttggtt atacaaaaac gaaattttta atctattggg atacgatgcg 360 tatcattgct gtagagatgg aagaggaggt ggcgtatcaa ttttcgtgtt atcagattat 420 cgaagccaat taatgttgga aataagttcg gaggataata atttcctgat ggttgaattg 480 atagatcaaa atgtaaaaat aatgggagtg tacaatcctg gtaggtcatt taatttattt 540 tttgaacaat tggaacaaat tctacataac tacaaaagaa ttatcgtatg tggtgatttc 600 aacataaatc tactagattc atcgggagag ttaattcgcc agtacacgta taaaattcaa 660 tccttgggat atataatact taacaaatta gacgcacgct acgctacacg tatatctaat 720 acaattgcca ctgttataga tcatattatt actgatatat ttgaatatga cctacaattt 780 ggaataaaag acacagaacg cttaatatca gatcacagaa ctctaacttt gtcaatgaga 840 atagaaaacc ataaaaagta taaacctgat cataaaaaga tttgtgtgaa ctacgaaaat 900 attataagat caacttcaca aaacactttt gaaatctgtc agcaatttag tgatattgtt 960 gtagtactaa atgaaataat aactaaaaac agaacagaaa tattatcaag aaaaaaagca 1020 atggtcatta gaaaacctta tataaataac gcattgattg ctgaaatcca taacaaaaat 1080 gagttatata ggcaatatag aacagcccca gaaggtgttt taaaagatga gctttatatt 1140 ttgtatgcaa ggaagcgaaa tatgttgaaa aataagatca aatttgccaa agagcagtac 1200 tatgggcgca aaatacatca aaacataaat aactataaga aaacttggag cacacttagg 1260 gaaatcattt tccaaaacgg ctccgaaaat ataacaccac aggcgttgac attaaaagaa 1320 aatggtaaga ttattgattc agctgcacaa atagcaaagt gctttaatga atattttatt 1380 aacattggtt ggaatattca acagaacgga acaggaaata ataatttttt aatctattta 1440 aatactatgc cagaaactcg tttttctttt tctgttgttg agacagaaaa ggtgaaaaat 1500 gtgataaatt ctttgaaatc aggagcatca gttggactag ataaaatttc tacgaaatgt 1560 attcaaaaat gtgttactct tattgcccca aagctaactg aaatagtaaa taatatgatt 1620 ctttcttcgc aatttgaaga ctgtttaaaa atagcaaaaa tagttcctct atataaatcg 1680 ggagataaat tttcaaaatc aaattacaga cctgtttcga ttctaccagc cctgtcgaag 1740 atccctgaat gcattatgaa tgaacaaata aagcatcatt tgctgcaaaa tgaacttctg 1800 aacaaaaacc agtttggttt cctgccaaag aagagtactg aaactgcttc gttagaactt 1860 ttgaacttta tagtgaaagg gttagatgac ggagaatatg ttgggtgtat ttttatagat 1920 ttacgtaaag ctttcgactg tatttcccat gaacttctaa tagctaaatt aagtttctat 1980 ggttttacgc cttcagcagt gagtttacta aaatcatacc tagcgaatag aaagcagcaa 2040 tgtcaagtca atggagtact tagtgaaccc ttgttgataa agtctggtgt acctcaaggg 2100 tcaatcatag gacccatact atttaacatt ttcattaacg atttattcaa tataccattg 2160 aaaggaaaat tacagtgtta tgcggacgac gctgttctta agtatagagc aaaagattta 2220 gaaacgttac aaagtatgat gcaacacgat cttcagttac ttcccagctg gttctcggca 2280 aataaaatgc aaattaacaa taaaaaaacg aatttcataa tattttctct aaaaacttca 2340 aatgtatctt tggacttgta cattaataat gagcgtctgt ctcaggtcac tgaggtagac 2400 tatttaggtt tggcaattga cgagagtttg aactggaaaa gtcacataac attagtaaag 2460 aataaaataa ctccttttat atttgctcta agaagggtta gaggttgtct aagcacaaag 2520 agttgctggc aattgtataa tgcattcata ttacctcact tgtcttatat gaatacagtt 2580 tggggatcag ctgcagcaag tcacttaaac gttttgaaag ttctacaaaa cagggttatt 2640 aaaacaattc gaaagcttcc acttctcttt ccaagtgcaa gtttatactc accagaagtt 2700 ttgcctttaa atttgctgca caaatacaat tgtttaattc tggtgtataa aatcaaatat 2760 aaattggtag aatcgaatat tcgtcttatt gaatttcatg atttgcattc tcatagcaca 2820 agagcaagac aaaactttta tgtttcaagg ccacgtacag aattagctct taaacatttt 2880 ttctactccg gaataactca gtttaatgcg cttcctaatc atttgaaatt agaattaaat 2940 ttatcatctt ataaaaagaa attgaaaaag tatctatttg agaataattg agatgtccat 3000 attgaattat tgaaatgtaa tgaattaatg ttagtattaa gtagaaaata agaaacattg 3060 ttagattata agcttgctcc ctcgaaatat tttacttttc gagtggctgg acacagttaa 3120 ttatgtaact tctacagctg cgttgttaaa taaagcaaaa aaaaaaaaa 3169 // ID Tx1-17_BF repbase; DNA; INV; 5006 BP. XX AC . XX DT 28-APR-2009 (Rel. 14.04, Created) DT 28-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE Amphioxus Tx1-17_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW Tx1; Non-LTR Retrotransposon; Transposable Element; L1-17_BF; KW Tx1-17_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-5006 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-5006 RA Kapitonov V. and Jurka J.; RT "Young families of Tx1 non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(4), 854-854 (2009). XX DR [2] (Consensus) XX CC The 5' terminal portion is incomplete. XX FH Key Location/Qualifiers FT CDS 1090..4695 FT /product="Tx1-17_BF_2p" FT /note="endonuclease and RT." FT /translation="MRGVMILFRKNTKFVVHNCIKDNDGRLLILDVSLHDT FT RVCLVNLYAPNSDDPAFFEDIVNKIATLEECSENIIIAGDFNTVQNTMLDR FT AGRNPTEYHPRAQQKILDMVLELDLVDVWRFKHKNTVKFTWRRRHQASRID FT YFLISFSLLTNVVSATIAGRLKSDHNVICISVQTTTFSKGKNYWKFNQTLL FT EDATFVKETEVFIREFFENNNGTSDPLTVWDTFKCCLRGHTIKFASNRYKM FT YSNKEADLTKQISELSKELDETDTPSDSLLEEIKTLQSELESLHERKTEQT FT HYVRKADWMEFSERCNKFFLLSHRNYSRKNVTNITTSDGDNTQDPLQILAN FT LRSFYSTLYSFKEPPKPLDEKNCSAFFQSTDNVNKHLNSEQQNSCEGPITE FT KELWIAINSFKNGKAPGLDGLPIEIYKVFFSSLKQHMLQSFNYAYEKGQLS FT NTQRVGLITLLLKQNSEGQDKDPSKLTNWRPLSLLCCDVRIISKVISLRLK FT NCISDLISEDQTGYLKNRFIGENIRRILELIEYYDTNEKPGLIFIADFEKA FT FDSLRWDYMFKVLNYFGFGESLIKWISVLYNNISSKVINNGHISDAFPLLR FT GVRQGCPLSPYLFILCVELLATKIRTNEHIKGLKIHTLETKVSQFADDSNF FT PLQPEAQSLVALVEDLKNFSIISGLKPNFDKCTILRIGSLKNTTFSLPCDL FT PIKWTNGPVQLLGIMLGLGTTGYTEKLDQNYWTKLAKIDKILLPWRGHSLT FT LFGKIALVNSLIVSQFTYLFMALPSPSAKFFKEYEKKIFNFIWNGRPEKIK FT RKVIYNTYDCGGLKLIHLPTFDLTRKASWIPRLWQAKTPLSKYLTDFPDPV FT CKLYPFLQLSKYHIDCIFTKFSRSVSPFFINALKAWLSFQFKPPETHQDIQ FT SQLLWLNSNITIDSKPIHFRKQATGQILFISDLLDTNGKFLSYKELSLKFG FT KTLQMMEYNQIISAIPSTWKRHLKDKPLTIGPNAPFISNYSWLGARKINKD FT IYMTTLLTNNVVQTPVKIQTYWEEILDIIIPWKQVFRLVYKTTICPRLRFF FT QFKMLHKFLPTRKMLYIWKLSDSPICIHCGNEQEDTLHLFWECPRVSQFWR FT DVKKWHERVSSEPLHIDLASVILGDLTVNSPRLNNTLILMAKFFIYKQQEP FT ERTNWQFTTVNGEKYFTSFLKPLYFLFVFCYSCGFALFNYFH*" XX SQ Sequence 5006 BP; 1663 A; 899 C; 881 G; 1563 T; 0 other; gaagtttgaa aaagaaaaag ttactgcaca gcaaaaacaa gtgcaactgg aagcgtactc 60 taggagggag aaccttcttt tctacaacat attggaagaa aataatgaaa agcctaagga 120 aactgaaagg ctgttaactc aggtattgat acatagcctg aagttacctg aagatatggt 180 gaaaggtatc aaattccaga gagtacaccg tctcggccca gtaaagattg gggagaatgt 240 tcggcccaga cctatcatcg caaagtttgt ttggtacaaa gaaagagaga gagtgaggct 300 agctgccaaa aatctcaaag attcaatgat aggtctatct gaagatttcc ccagacaaat 360 cagagaagtt agaaggaagc ttctccctgt cttaagagcg gcacgtaagg ctcaacctcc 420 caaatatgct attatgaatg tcgatcagct ctatatagat ggggtactat acaagggtcc 480 tgaagcaaaa cgcccttatg tgacttagct tgttgaactg atttagagat ttttgttttt 540 gttttttcaa atgagattga tattcctctg tatgcgacag taccccagag ccttgtttgt 600 atacacttgt gttgttgttt actcaccacg gatccctgta agatatgtac gcttattgat 660 gtcaaactgt agttacacta tctcatgtta gtttttctga gagaattaag agttcggtta 720 ttatatccag acaatattta tgaattgtct gcttgaatat gctatgtact ccagccacaa 780 cagcagcttt ttcccttcat tatatattgc actgcacgct tctgcaatca gtacctttat 840 aactggaact gaaagattca gtttatgttt tcataaacca ttcctttgac taaatcattg 900 actcagatcg gctcttacaa cgccaatggt ttggctaaca acaaaaagcg aagagaagtg 960 tttgaatggt tacaagaaaa atcgctagat gttttatgta ttcaagagac tcactcgaca 1020 gtgaaagatg aaaagacttg ggtgaacgac tgggatggca cgattatttt taaccacggt 1080 actagtaata tgcgtggagt catgatatta tttagaaaaa atacaaagtt tgttgtgcat 1140 aattgtatta aagataatga cggccgactg ttaatactag atgtctctct tcacgataca 1200 cgagtatgct tagttaactt gtacgctcca aatagtgacg accctgcctt ttttgaagac 1260 attgtaaata agatcgcaac tttagaagaa tgcagcgaaa atataattat cgcaggcgat 1320 tttaatacag tccaaaatac tatgcttgat agggcaggac gcaatccaac agaatatcat 1380 ccgcgtgctc agcaaaagat cttagacatg gtattagagc tagatttagt tgacgtttgg 1440 cgtttcaaac ataaaaatac cgtaaaattt acctggcgta ggcgccacca agcgagccgt 1500 attgattatt ttttgatatc cttttctctt ctcaccaatg ttgtcagtgc aacaattgca 1560 ggacgattga aatctgacca taatgtcatc tgcatttctg ttcaaacaac aacattctct 1620 aaaggtaaaa attattggaa gtttaatcaa actttattag aagatgctac atttgtaaaa 1680 gaaaccgagg tatttattag agaatttttt gagaacaata atgggacaag tgacccgctc 1740 acagtgtggg acacgtttaa gtgttgctta aggggacata ctatcaaatt tgcatcaaat 1800 agatataaaa tgtattcaaa taaagaagca gatttaacta agcagattag tgagttatct 1860 aaggaattgg atgagacaga caccccatcg gattccttac tagaagaaat aaaaacactt 1920 caaagtgaac ttgaaagtct acacgaacga aaaactgaac aaacgcacta tgttcgaaaa 1980 gctgactgga tggaattttc ggaaagatgt aataaattct ttttattgag tcacagaaac 2040 tattcaagga aaaatgtcac aaatataaca acttctgatg gagacaatac ccaagatcct 2100 ttacaaatat tggctaattt aaggtctttt tactcaaccc tttattcatt taaagaaccc 2160 ccaaagcctc tagatgagaa aaattgctct gccttttttc aaagtacgga taacgtcaat 2220 aaacatttaa actcagagca acaaaactcc tgtgaaggtc ccattactga aaaagaattg 2280 tggatagcaa ttaatagctt taagaatgga aaagctccag gactagatgg gctgccaatt 2340 gaaatatata aggtattctt ctctagctta aagcaacata tgctccaatc ctttaactat 2400 gcatatgaaa aaggacagtt gtctaacaca caaagagtag gactaatcac tcttttacta 2460 aagcagaact cagagggcca agacaaggac ccgtccaaac ttacaaactg gaggcctctt 2520 tcacttttgt gttgcgatgt gcggatcatc tctaaagtaa tttcgcttag attgaaaaat 2580 tgtattagtg atttgataag tgaagatcaa actggttatt tgaaaaacag attcattgga 2640 gagaatatac gaagaatttt agaattaatt gaatattatg acactaacga aaaaccagga 2700 ctgatcttta ttgccgattt cgagaaggca tttgatagcc taagatggga ttatatgttt 2760 aaagttttga actattttgg ctttggggaa tctcttatta aatggatttc ggttttatac 2820 aacaatattt ctagtaaagt aataaacaat ggacatattt ctgacgcctt tcctcttctc 2880 cgtggtgttc gacaggggtg tcccctctcc ccctacttgt ttattttatg tgtagaatta 2940 ttagccacaa aaatcaggac caacgaacac ataaaaggac ttaagataca cactctggaa 3000 accaaggtct cacagtttgc tgatgattct aattttcctc tgcaacccga agcacaatct 3060 ctagtagccc tcgttgaaga tttaaagaat ttttcaataa tttctggctt aaaaccgaac 3120 tttgacaaat gtacgatttt aagaataggc tctttaaaaa acacaacttt ttctctccct 3180 tgtgatcttc ctatcaaatg gactaatggc cctgtacaac ttctagggat tatgctgggt 3240 ttaggcacaa ctggttatac cgaaaaatta gatcagaatt actggacaaa actagctaag 3300 atagacaaga tcttgctccc ctggagagga cattccttaa cactttttgg taaaattgcc 3360 ttggtgaact ccctaatagt atcacaattt acatatctat tcatggcatt accttctcct 3420 agcgccaagt tctttaagga gtatgagaaa aagatattca atttcatttg gaatgggcgc 3480 ccagaaaaga ttaaacgcaa ggttatatat aacacctatg actgtggtgg tctgaaatta 3540 atacacttac ccacttttga tctcactcgt aaggcatcgt ggatacctag actatggcaa 3600 gcaaaaacac ctttaagtaa atacttgact gatttcccag atcctgtatg taaactttac 3660 cctttcttgc aattatctaa atatcatata gactgtattt ttactaagtt ttctcggtct 3720 gttagtcctt tcttcatcaa tgcccttaaa gcatggcttt ccttccaatt taagccacct 3780 gaaacacacc aggacataca atcacagctt ttatggctca actcaaatat tactatagac 3840 agcaagccta ttcactttag aaagcaagct actggtcaaa tattatttat aagtgacctt 3900 ttagatacaa atggaaaatt tttgtcatat aaggaattat cgctgaaatt tgggaaaact 3960 cttcaaatga tggaatataa tcaaataatc tccgctatcc caagcacctg gaaaagacat 4020 cttaaggata aaccacttac tataggacct aatgcaccct tcatatcaaa ctattcctgg 4080 ttgggggcac gtaagataaa taaagatata tacatgacaa ctcttttaac aaataacgta 4140 gtccaaaccc cagttaaaat ccaaacatat tgggaagaaa tacttgacat aataatacca 4200 tggaaacaag tttttagact ggtatacaag actacaatat gtccgagact aagatttttc 4260 caatttaaga tgttacataa atttctacct accagaaaaa tgttatatat ttggaaactc 4320 tcggacagcc ctatttgcat acactgtgga aatgagcaag aagatacact gcatctattt 4380 tgggaatgtc caagagtttc ccaattttgg agagatgtta agaaatggca cgaacgagta 4440 tcgagtgaac cattacacat agatttagca agtgtaattt tgggtgatct aacagtaaat 4500 tcccctaggc tgaacaatac actaatatta atggcaaaat tttttattta caaacagcaa 4560 gaaccagaaa gaacaaattg gcagttcact acagtaaatg gggagaaata cttcacatca 4620 tttcttaaac ccctctattt tctttttgtc ttttgttact cttgtggttt tgcattattc 4680 aattattttc actagtattt gaagcgaaat gttgtattga tggtttctta tgtgtctgta 4740 ctttatttga gattattctt gtttacaaat ttaccattaa tttacactga tgttactttt 4800 cttatgtaaa cgatactacc ttatactgct gtattagttt atttggttgg tattattttc 4860 ttcccgctgt agtattgtta tctatttttg gatccgtggt ttatgtttat gttttgtctc 4920 agttattgtc tccatctcta ttgtcattat gtattcttct gttatttctg tttatttgaa 4980 atgcaataaa tattaaaaaa aaaaaa 5006 // ID BEL-212_AA-LTR repbase; DNA; INV; 809 BP. XX AC AAGE02025927; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-212_AA_; KW BEL-212_AA-I; BEL-212_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-809 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02025927; Positions 27328 28136. XX SQ Sequence 809 BP; 273 A; 141 C; 152 G; 243 T; 0 other; tgttggcgaa cctacgccag cgcgcctcta tgatttaatt ggctctatac gaccgacggt 60 tgtacccccc ggcattaggt acgaagagaa gccaaatcgg cttcgatgga tgagatgaca 120 gcttcacaca caattatagc ggtgagtctg tatcgttcat tataaagatg ataacgttga 180 taagagtgaa atagtaaaca tcgtgcgata aaattgaact tacaaatttg cctattagtg 240 cagtaaatgt atagtaggtt tagctcaata agtgaattta tctgtattta taaggtagta 300 taacgttgaa ttgtgttatc gatattcgct aatcttttaa tgaccaatca ggaatctatt 360 ggccaaattc gttacacacc taacctaaag tgatgaattg aatctacgaa ttagtttggt 420 aaaataacat agaatcatga aatcatgcaa atacttaaaa actaccttaa aattatagta 480 cggattaaaa gtacggctgg aacaactgtt cccgagtcat tcggtgtttc gaggacaggc 540 agttaaacat tgacctaaca ttagactggt tcatgatcaa gacatagact ggactagaca 600 caggaacaag actaaacgta agttgaacac ataagacttt atgattgcct ataaatttaa 660 aagaattatt cagttttgct gtgaaattat aacattactc atgcctttca ggaaaataat 720 aatcccccta tcagtaaatc ccagaaacgg tcgtttaatt tcggggttat tatttcggaa 780 atatcttctt ctgttggcga tttccaaca 809 // ID PPSAT2 repbase; DNA; INV; 86 BP. XX AC K02943; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 28-SEP-1995 (Rel. 1.08, Last updated, Version 1) XX DE P.pollicaris satellite, consensus sequence. XX KW SAT; Satellite; Simple Repeat; PPSAT2; Repetitive sequence. XX OS Pagurus pollicaris OC Eukaryota; Metazoa; Arthropoda; Crustacea; Malacostraca; OC Eumalacostraca; Eucarida; Decapoda; Pleocyemata; Anomura; OC Paguroidea; Paguridae; Pagurus. XX RN [1] RP 1-86 RA Fowler F.R. and Skinner M.D.; RT "Cryptic satellites rich in inverted repeats comprise 30222034f RT the genome of a hermit crab."; RL J. Biol. Chem 260, 1296-1303 (1985). XX DR GenBank; K02943; Positions 1 86. XX SQ Sequence 86 BP; 15 A; 18 C; 27 G; 26 T; 0 other; caggtccgga cctgcaaaaa aagttgagtt ttgggccccg aaattttttg ggctgttttt 60 ggcggttttc ggcaggtccg gacctg 86 // ID DNA8-79_AP repbase; DNA; INV; 833 BP. XX AC . XX DT 25-AUG-2009 (Rel. 14.09, Created) DT 09-NOV-2010 (Rel. 15.12, Last updated, Version 4) XX DE A family of DNA transposons - a consensus sequence. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW DNA8-79_AP. XX NM DNA8-79_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-833 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2015-2015 (2009). XX RN [2] RP 1-833 RA Kojima K.K. and Jurka J.; RT "Classification as hAT."; RL Direct Submission to Repbase Update (09-NOV-2010). XX RN [3] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [3] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. CC [2] Weak similarity to hAT-9_AP in both TIRs and a protein CC remnant indicates it is a non-autonomous hAT. Similar TIRs are CC seen in DNA8-43_AP, DNA8-45_AP, and DNA8-73_AP. XX SQ Sequence 833 BP; 284 A; 120 C; 123 G; 305 T; 1 other; tagggctcgg atttatatgt aaatacatat ttttactgtg acttgataaa ttaatttagg 60 taagtgataa attcgtttca agggatttcc attgtaatga actatatttt attttacata 120 tttttacata tttctcaata attccatttt tccctacata tttcgacaat ttacgattta 180 ttaatagttg tatacgatgt atacgatttc ccatgtttca aaaagtacgt aaacaaaata 240 ataccaatta ttgaactaat cgaccgtaat ctacggtcga tataatatta tgttcgacgt 300 ttgtgacatc gataaatgat agataacgat ttattaatgc atagaacaag tacaaggacc 360 cttgggagtt gcagtaaaax aaaaaatgca cagcgtatta caaaaaaatc ctggtttaga 420 ctgcctaaaa ttaattagag atacacattg cgaaattaat ggggtaatgt ttccggctga 480 actcactgct ttggacattg caaatatgaa atttgcaccc attacctcgg tggaagtcga 540 gcgctccttc agtcggtaca aatctgtgtt acgaccaaac cgtagatcat ttaacttcga 600 caatttgagc atgtatatgg tatctcattg ctttcaagat caagatcaag atgaataatg 660 gtagtaaaat aagtaaatta gataataatg tcatttataa agcaatgttt tatatttttt 720 tgtaattttt ttgtattcca tattttcatt ttttaatcac atataaagtg atttattatt 780 acatatatat ttacatattt tggttttttt aatacatata aatccgagcc cta 833 // ID BEL2-LTR_AP repbase; DNA; INV; 545 BP. XX AC Contig56303; XX DT 21-APR-2008 (Rel. 13.04, Created) DT 21-APR-2008 (Rel. 15.12, Last updated, Version 0) XX DE LTR retrotransposon from pea aphid: long terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL2AP; BEL2-I_AP; KW BEL2-LTR_AP. XX NM BEL2-LTR_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-545 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from pea aphid."; RL Repbase Reports 8(4), 432-432 (2008). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 545 BP; 136 A; 134 C; 100 G; 175 T; 0 other; tgttccgtac agttttgcga caatgcatta acgcatagtc gcttgtgact atctactcaa 60 tataattatc aaccggtccg ccgatagcat cggctcgcat atctaatata ccaactagtc 120 agtcttggcc cgcaggcgta cgcagtggtc gtctccgttg gtttgcgccg tacgtgcctg 180 tgaccaaata acgacccgcg cttccatacg catatgggtg agctaatacc cgcgcttgca 240 tacgcacatg ggtgattaat attcgcgcct tcatacgcat acatgtcgtg ttgtatctta 300 gtgtcgttat aaagagattg gacacacctt ggagcacaca gttatcagcg gtcaatccat 360 cttcaccatc ttcctcccac tacgtttcat cacttcatac tcatcaatac caggttccgt 420 ccatttgtgg ttgtgtgagt cggctatacg ccacattatt gtattgtcct ttgagactat 480 ttattattat tgttataata ctaacaccat aaattaaact atatatatat tatttattgt 540 attca 545 // ID Gypsy-22_IS-LTR repbase; DNA; INV; 257 BP. XX AC ABJB010933243; XX DT 15-FEB-2011 (Rel. 16.02, Created) DT 15-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the black-legged tick genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-22_IS_; KW Gypsy-22_IS-I; Gypsy-22_IS-LTR. XX OS Ixodes scapularis OC Eukaryota; Metazoa; Arthropoda; Chelicerata; Arachnida; Acari; OC Parasitiformes; Ixodida; Ixodoidea; Ixodidae; Ixodinae; Ixodes. XX RN [1] RP 1-257 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the black-legged tick genome."; RL Direct Submission to RU (15-FEB-2011). XX DR Genome; ABJB010933243; Positions 2617 2873. XX SQ Sequence 257 BP; 49 A; 84 C; 49 G; 75 T; 0 other; tgttgtattg tcactagctt tttgtttctg cattgtcact cccgcgaatg ctggcttcgt 60 ttcttccgtt agccgacacc agagatagtc ggtgccccgt cgttcctcct ccccctttcc 120 ctcaccctat cagtatcctc ctcccgctca gaaaactata tattgcggaa cacaaccgaa 180 taaagacgtt gatacgctac caccaactgt tcctcggttt tcttcacgcc gcacgcggtg 240 gcttatgcga cgcagca 257 // ID DNA2-3_AP repbase; DNA; INV; 632 BP. XX AC . XX DT 22-AUG-2009 (Rel. 14.08, Created) DT 22-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA2-3_AP. XX NM DNA2-3_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-632 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(8), 1736-1736 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 2 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 632 BP; 201 A; 86 C; 89 G; 256 T; 0 other; gagcataaaa cgattgagca tagacctttt tagcaacacg attgagcata gaatatttta 60 tgcgatgttg ccaaattcac gttgccaaaa ttttgtaatt tattattttc acgtattatt 120 acaccaatta ttatgatatc tgacgcgagc cgtgagccga gcccctcagc tttatcgaca 180 atgttatcga ttaataaatt agttccggag tactattccg gactactatt gagactatta 240 ttgacttata aatattttga ttttattatt ttcataagtc ataactatta ttattattat 300 tattattatt tcattattag caaaaagatc tatatactgt attataatgt gactactatt 360 gagactatta ttgatttata aatattttga ataatgtgac taggtatatt acatattgac 420 tatgaaaaca tgatattttt ttgccatgtt atattttgaa tcattataat tatttaaata 480 aatatttttg tataatacag ttcaaaattg aaaatggcaa cgttgcagag ttttttatgc 540 tcaatcgtgt tggttttctt ttgtatttca atttcctttg atacgccgac aatttatgct 600 caatcgtgtc gcaaaaaagg tgttttatgc tc 632 // ID Gypsy-82_AA-I repbase; DNA; INV; 4458 BP. XX AC supercont1.148; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-82_AA_; KW Gypsy-82_AA-LTR; Gypsy-82_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4458 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.148; Positions 1978855 1983312. XX CC Positions [3374-3892] - Integrase core CC 'GTTGC' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 158..4426 FT /product="Gypsy-82_AA-I_1p" FT /translation="MAEEIAQLLKQQNQIFAYWAERMGNFQITIPNPASPA FT HMTSQSVPLPPPLCLEGDMEENFTFFESNWKNYAVAVGMNDWPETENKKKV FT SFLLSVVGTDALKRYFNFELSEQQKSSPEAVLEAIRNKVVPVRNIIVDRLE FT FFSAIQIPTETIDEYVSRLKVLGKPCRFGPLEQDMMVFKIVTSNKWSSLKA FT KMLTMPDVNLTKAIDLCRMEEITARHSKQLSLEPPVEVNKVTDQVCKFCGG FT RHPFQRGSCPAFGKRCDKCGGRNHFQKDCKSSGSFGRPKQGRRVNAVTTSE FT ARSIRAADDDEEREIWIQEEEETVIGKIMDNSATGGAVLAELSLKFGNVWK FT NVCCELDTGARASIIGIDWLKKLSGEVDPELKASKCRLRAFNGTSIEVLGK FT IEVQCQHKGRRYTMVFQVVNINHPPLLSVAVCTTLGLVNFCNTVRRDSAAY FT QVLDKYQEEARNIVRNYQDVFEGYGKMEGTVRLEIDKTVKPVIQTPRRIPI FT GFREDLRRKLDQLVQDGIIQKESQHTDWVSNLVIVKKNTPGSDSLRLCLDP FT IPLNKALKRPNLQFTTLDEILPELGKAMIFSTVDTRKGFWQVVLDEASSKL FT TTFWTPFGRYRWLRMPFGISSAPEIFQAKLQEALQGLEGIECLADDILIIG FT TGDNFEEALACHNQRLENLLIRLRSNGVKLNITKLNLCQTSVNFFGHVLTS FT EGLKPDMRKIEAIQNYPAPTSPKEVHRFVGMANYLARYIPNLSSKLVSLRK FT LIPENATWRWTSEEKTEFDNVRQSISEHTALQYYNRTEPLLIECDASCQGL FT GVAVYQQKGVVGYASRTLTKTEKGYAQIEKELLAIVFACTRFDQLIVGNPK FT VIIKTDHKPLLNIFEKPLLTAPKRLQHMLLALQRYNLVLQFVKGKDNVVAD FT AISRAPSDSDSNGIMAKQHVYKILAGVEEVQLQNCLNISSERISEIIRHTA FT NDEGLQCVTEYIRRGWPKSIDGVPCHAKMYFKYRDELATQDGLVFRNEKIV FT IPPILWRSMIGRVHIAHSGVEGTLKLARANIFWPGMSNHIKHTISLCATCA FT KFSASQCKPPMQTHPVPVHPFQFISMDVLTVPYKGKARYFLVTVDHFSDFF FT ELDLLPDLTMHTMVDVCKKNFSRHGKPQRILTDNGSNFVNSEMARFTKEWD FT IEHVTSSPYHQQANGKAESSVKIAKKLIKKAEDSKQDLWLMLLNWRNTPNK FT LGSSPVCRLFSRSTRCEVPMSATNLMPRIVQNVPEAIQDNKRKVKYQYDKS FT TRRLPSLEVGDPVFVQLNPDVSKTWVPAEIKNKLSERSYLVERNGTTYRRD FT VVHVKPRNVGGSSFPIAESNPHEEPTPNVSLQLSSSLPSAETPGDAWRNTA FT LLETQSVDQSSKQAPAIAVHSPNRKHLGTPVTKMTPQPERMNKPKREVKMP FT TKFKDYVMS" XX SQ Sequence 4458 BP; 1340 A; 930 C; 1076 G; 1112 T; 0 other; tggtgtcaga agtggagaaa tctgcctaat aaaagttaat tgagcagatt tgcgatcaat 60 ataatcgata aatcgtcggt tgttcgagtt tttcgtagtt gcggcgcgaa atacgtatat 120 aaatatcgcg gtcgaaagca ttgcaagcct catcaaaatg gcggaagaaa ttgcgcagct 180 gctaaagcag caaaatcaga ttttcgccta ttgggcggaa cgaatgggaa atttccagat 240 aaccatcccg aatccagcgt caccagcaca tatgacttcc caatcagtgc cgttaccacc 300 accactttgc ttggaaggag atatggagga gaactttact ttcttcgaat ccaattggaa 360 aaattatgca gtcgctgtcg gcatgaacga ctggcctgaa accgaaaata agaagaaagt 420 cagttttttg ttgtcagtcg ttggaacgga tgcactgaag cgttacttca atttcgagtt 480 aagtgagcag cagaaatcgt ctccggaagc tgtattagaa gcaatcagga acaaggtagt 540 gccagttcga aatattattg ttgaccggct ggagtttttc tctgcgatcc agattccgac 600 agaaacaatc gacgaatacg tgagtcgttt gaaggtgctg ggaaaaccgt gtcgatttgg 660 tccattggag caggatatga tggtgtttaa aatagtgacg tcgaacaagt ggtctagttt 720 gaaagcgaaa atgttgacca tgccagatgt gaacctgacg aaagccatcg acctttgcag 780 gatggaagaa attacagcac gacattcaaa gcagctgtct ttggaaccgc cggtggaagt 840 caacaaggtt accgatcaag tttgcaagtt ttgtggtggc agacatcctt tccaacgagg 900 atcgtgcccg gcctttggaa aacgttgtga taagtgtggt ggcaggaatc atttccaaaa 960 ggactgtaaa tcgagtggat catttggaag gccgaagcaa ggtagaagag tgaatgcggt 1020 aacaacatcg gaagcgaggt ctatccgggc tgctgatgac gatgaagaac gagagatctg 1080 gatccaagaa gaagaagaaa ctgtgattgg gaaaattatg gataattctg ccacaggagg 1140 tgcggttttg gctgaactct cactgaagtt tggaaacgtg tggaaaaatg tttgctgtga 1200 gctggatact ggtgcccgcg ctagtataat cggaattgac tggctgaaaa aattgtcagg 1260 agaagttgat ccagagttga aagcttccaa gtgccgtttg cgagccttca acggcacctc 1320 gatcgaagtc ctgggaaaaa ttgaagtgca gtgtcagcat aagggccggc gatacacgat 1380 ggtattccag gtggtgaaca taaatcatcc accgctgcta tcagtggcag tttgtaccac 1440 ccttggcctg gtcaattttt gcaacaccgt tcgtcgtgat tcagctgcgt atcaagtgct 1500 agacaagtac caagaagaag ccagaaacat cgtgcgaaac tatcaggatg tatttgaagg 1560 atatgggaaa atggaaggca ccgtcaggtt agaaattgac aaaaccgtga agccagtgat 1620 tcagacaccc cgtcgcattc caattggctt cagagaggat ctccgacgaa agttagatca 1680 gctggttcag gacggaatca ttcagaagga gtcccaacac accgactggg tgagtaactt 1740 agttatagtg aagaaaaata cccctgggtc ggactctctt cgcttgtgtc tggatcccat 1800 tcctctaaat aaagccctaa agcgtccaaa ccttcagttt acaacattgg atgaaatatt 1860 accagaactc ggaaaagcaa tgatattttc aacggttgac accagaaagg gcttctggca 1920 ggtagtatta gatgaagcca gtagtaagct tactactttt tggaccccat tcgggcgcta 1980 ccggtggctc agaatgcctt ttggtatctc ctcagcgccg gaaatattcc aggcaaaatt 2040 gcaagaagcg ctacagggac ttgaaggtat tgaatgtttg gcggacgata ttttaataat 2100 cggaacaggg gataactttg aagaagcctt ggcatgtcat aaccaaagat tggaaaatct 2160 gttgatacgt cttcgaagta atggtgtgaa actcaacatt acaaaactta atttgtgcca 2220 aacgtccgtc aacttctttg gtcatgtcct tacctccgaa ggactgaaac ctgatatgcg 2280 gaaaattgaa gccatacaaa actaccctgc gccaacgtca cctaaagaag ttcacagatt 2340 tgtgggcatg gctaattacc ttgcgaggta tattcctaat ttgagcagta aacttgtttc 2400 tctgaggaag ctcattcctg aaaatgcaac ttggcgctgg acttctgagg agaagaccga 2460 gttcgataac gtaagacagt ctatttccga gcacactgca ttgcaatatt ataaccgtac 2520 cgaaccgttg ttaatagaat gcgatgccag ctgccaaggg ttgggtgtag ccgtatacca 2580 gcaaaagggc gtcgtcggat atgcttctcg aacgctcacc aaaactgaaa agggttacgc 2640 tcaaatagag aaagaattgc tggcgatagt atttgcctgt acacgctttg atcagttgat 2700 cgttggtaac cccaaggtca ttatcaaaac tgatcacaaa ccgctattga acatttttga 2760 aaaacctctg ctcaccgcac caaagcgctt gcaacacatg ctgctggcct tgcagcggta 2820 taatttggtt ctgcaatttg tcaaaggcaa ggacaatgta gtggccgatg ccatatcaag 2880 agcacctagc gactctgaca gcaatggtat catggcaaag cagcatgtgt acaaaattct 2940 tgctggggtt gaagaagtac aactgcaaaa ttgccttaac atctccagcg aacgaatttc 3000 tgaaattatc cgccatactg ctaacgatga aggacttcag tgtgtgaccg agtatattcg 3060 cagaggatgg ccgaaatcca ttgatggggt gccttgtcat gctaaaatgt atttcaagta 3120 tagagatgag ttggcgaccc aagacgggtt ggtattccga aatgaaaaga tagttattcc 3180 tccaatattg tggcgttcca tgattggccg agttcacatc gctcattctg gtgtagaagg 3240 aacgctcaag ttggcacgag caaacatatt ttggcctgga atgtcgaatc acatcaagca 3300 caccatctcg ctttgtgcca cttgtgccaa attttctgca tcccaatgca agccacctat 3360 gcaaacgcac cctgtccctg tgcatccatt ccagtttata tcaatggatg tattgacggt 3420 tccatacaag ggcaaggcaa gatattttct tgtaacggta gatcactttt ccgatttttt 3480 tgaactggat ttgctaccag atcttacgat gcacacaatg gtggatgttt gcaagaagaa 3540 tttttcgcgg catggaaaac cacaacggat tttgaccgat aatgggagca atttcgtcaa 3600 cagtgagatg gcacgattta ccaaagaatg ggacattgag catgtgacat cctcgccgta 3660 ccaccaacaa gcgaacggga aggccgaatc ttctgtaaag atagcaaaaa agcttattaa 3720 gaaggctgaa gattcgaagc aggacttatg gctgatgctg ctgaactgga gaaacacacc 3780 gaacaagctt ggatccagcc cggtatgcag gttattctct cggagcacga ggtgtgaggt 3840 tcccatgtct gccacgaacc ttatgccaag aatcgttcag aatgtaccag aagctatcca 3900 ggataataaa aggaaggtga agtaccagta cgacaagtca actcgccgac tcccttctct 3960 tgaggttggt gatcctgtat tcgttcaact aaatccggat gtctctaaaa cttgggtgcc 4020 agctgaaatt aagaataagc tcagtgaacg ctcgtatttg gtagaaagaa acgggacgac 4080 gtataggagg gatgtggtac acgtaaaacc gaggaatgtt gggggttcta gttttcctat 4140 cgctgaatcg aatccgcatg aagaaccaac accaaatgtt tctctgcaat tatcatcttc 4200 attaccctcc gcagaaacac ctggcgatgc ttggaggaac acagccttgc tagagacaca 4260 atcagtcgac caaagttcca agcaagcacc agcaattgcc gttcattcac cgaaccgaaa 4320 acatttaggg acacctgtca cgaaaatgac accacaaccc gagaggatga acaaaccaaa 4380 gagagaagtt aaaatgccta ctaagttcaa agattacgtt atgagttaat tacttttctt 4440 tgtttgaaaa agggaaga 4458 // ID Crack-1_IS repbase; DNA; INV; 4398 BP. XX AC . XX DT 22-JUL-2009 (Rel. 14.07, Created) DT 20-FEB-2011 (Rel. 16.02, Last updated, Version 2) XX DE Crack-1_IS is a non-LTR retrotransposon - a consensus sequence. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; KW Interspersed repeat; reverse transcriptase; endonuclease; L1 ORF1; KW Crack-1_IS. XX NM Crack-1_IS. XX OS Ixodes scapularis OC Eukaryota; Metazoa; Arthropoda; Chelicerata; Arachnida; Acari; OC Parasitiformes; Ixodida; Ixodoidea; Ixodidae; Ixodinae; Ixodes. XX RN [1] RP 1-4398 RA Kapitonov V.V. and Jurka J.; RT "Crack-1_IS, a family of Crack non-LTR retrotransposons from the RT deer tick genome."; RL Repbase Reports 9(7), 1344-1344 (2009). XX DR [1] (Consensus) XX CC Crack-1_IS is a non-LTR retrotransposon that belongs to the Crack CC clade, a sister clade of L2 and Daphne. Like in other Crack CC retrotransposons, the ORF1 protein in Crack-1_IS contains a CC domain similar to the central domain conserved in vertebrate L1 CC ORF1 protein The ORF1 protein contains also in its N-terminal CC part the PHD finger (C4HC3), which is probably typical to Cracks CC in arthropoda. XX FH Key Location/Qualifiers FT CDS 388..1296 FT /product="Crack-1_IS_1p" FT /note="ORF1." FT /translation="MSDEDDDVCPKCHEVLSDDKPNITCPECQFTYHFGTC FT CGIAESTFKARNAKSQKAWRCETCSVAKRSQTGKAKKDAEIDVNSLLTSMN FT LKLNSLLSMKETVSAVETSVQNMSDQYDELLKSVGQHDKDIKSLTKRVEKI FT ENAGTEGQLTELKRELNRLEWHNRKQNLEIHGVPQTANENLLSLVNEVAAK FT LDVPTLVPCEIVSMHRLPSKPEKIPGIILHFTQQQTRDRWLDKRRNLKRPN FT DNEYILENMTRQDKALLWSTKQWAREHNFQYVWHRNGNIFVRRKTGDPAKT FT VKSEHDLPNMS" FT CDS 1375..4197 FT /product="Crack-1_IS_2p" FT /note="ORF2: AP endonuclease and RT domains." FT /translation="MYCLLSLKLLSEKSFPYQLTVYILIFDQRKINVLILS FT VFLLSFPISRLLCFLKRGTRTIPMFSDCLCINVFFINRSETRGGGVSILIK FT ESVDCELVPSFTVITKDYEALSLRTKNDFVSVIYRPPGGALSPFLNFLSSL FT FDFANENKLNIICGGDFNVNMLSNTVSKRELENLILINGCRTVINSPTRVN FT VTSKTLIDLFITNYEESRIRSGIITYDLSDHLPIFLCVKMNNAKKKDPTPS FT AQVQCVTPDGLERFAECVANANWDDIFSKNDPDEAYNFFLERFKSLYQSNF FT PYKLIKVKKRIRKPWITVDLLENIKVKNVLYRKFTKTRDPNDLRTFKSYRN FT QLTNALRKARERYHYDSFRNSVGRSDLMWKNLNSVIEGKTSHCVECIISAG FT KKLTGVSLANTFNDRFVDLPQINCRDAACDNITFKTRDSIFLLPVTDTEVS FT TVFMELKNSHSCDADGMQIRPVKYVLPLITPFLTYIFNLCFSTGVFPQLMQ FT KARVSVLYKKGDKNDINNYRPISILPVFSKGLEKLIHIRLSNFFNKYRVIT FT NDQFGFMKHRSTDLALLEQKEFILESFEANQIALAVFIDFTKAFDYLNHNL FT LLKKLYLYGIRGQALTLLQSYLSNRKQFVYLNGFSSNTKSVFSGVPQGSIL FT GPLLFNIYINDIVNIDSLAKFVIYADDTTLLFSAKTSDELINCANDTLTGL FT QNWTSDNGLKINSSKTKAIFFRTKNKSVFITKSLVINSNVIEIVENFKSLG FT VIFNQYLSWDDHVSYILPRLSRVIGLVSRNRFILPNNVKLLIYQTLFASHL FT NYCHLVWGTTTSSNLQKIHLLQKKFVRLIANVSYISSSKGLFEKYNLLRVF FT DMYTYKLMVRFKVEKKRNVDILAKLSKLREHVTSYPTRSAARWAVPLCHNN FT YSMQSLKYMLPTLLNFLLNKNVDIMSVTMQDLFSYLN" XX SQ Sequence 4398 BP; 1328 A; 854 C; 861 G; 1355 T; 0 other; gactcggcag cggcagcggc aaacagtaca gtgtgtttga tcacttgcct gtgacttctg 60 cagatctgac gccgcaaatt gccccgacca ccggcgaacg ctatctacac caagtggaag 120 tccgaatctg gaaccgtcat ccttacgtgg tcaccacgtg ccggcgtcgg agcgacacct 180 acctcgacga ctcgactaca gaaaccgcca gcaagatctg ctttcttcac tcggcagcgg 240 cagcggcaaa cagtacagtg tgattgatca cttgcctgtg acttcctgca ggttagtgcc 300 cttgttattg aaagcctttc taataactcc cgtgctgctg ctgccgctgc tgcctttgcc 360 cttttcctgt attgtgttca cgtcacgatg agtgatgaag atgacgatgt ttgtccaaag 420 tgccatgaag tgttatcaga tgacaaacct aatataacgt gtccagaatg tcaattcaca 480 tatcactttg gaacttgctg tggcatcgcg gaatcaacct ttaaagcgag aaatgcgaag 540 tctcagaaag cctggcggtg cgagacctgt agtgtcgcga aaagaagtca aaccgggaag 600 gctaagaaag atgccgaaat cgatgtgaac agcttgctta caagcatgaa cctgaaactg 660 aacagtttgt tgagtatgaa ggaaaccgta tccgccgttg aaacttctgt tcaaaacatg 720 tcagaccagt atgatgaatt actgaagagt gttggccagc atgacaaaga cataaaaagt 780 ttaacaaaac gtgtggaaaa aatcgagaac gccgggacgg aaggacaact tacagagttg 840 aaacgggagc tgaatcgact ggaatggcac aatagaaaac aaaacctgga aatccacggg 900 gtcccacaaa ccgcaaatga aaacttgctg tccctggtta atgaggtagc tgctaagtta 960 gacgttccga ctctcgtccc atgcgaaatt gtttcaatgc acaggctccc ttctaaacca 1020 gaaaagatcc ctggtataat tctgcacttc acgcaacaac agactcgcga tcgctggctg 1080 gacaaaagac gcaaccttaa acgccccaat gataacgagt atattctgga aaatatgact 1140 cgtcaagata aagcgctctt gtggtctacg aaacaatggg caagagagca caattttcag 1200 tatgtttggc accgaaatgg taacatcttt gtcaggagga aaaccggtga ccctgccaaa 1260 acagtaaaga gtgaacacga tttgccaaac atgagctaag caactggtgt gtgcaccctt 1320 tctctgcatt tttttttctg tttatacttt cttttgaaat ggcagtctca gaatatgtat 1380 tgccttctga gtttgaagct tttatcggag aagagttttc cttatcaact cactgtttac 1440 atattaatat tcgatcagcg aaaaataaac gtgttgatct tgagtgtttt tttgctgagc 1500 tttccgattt cgagattatt atgctttctg aaacgtggta cacggaccat accgatgttt 1560 tcagattgcc tttgtatcaa tgtttttttt atcaatcgca gcgaaacccg agggggcggt 1620 gtgtcaattt tgattaaaga aagcgttgat tgcgagttag tgccttcgtt taccgttatc 1680 actaaagatt acgaagcttt atcattgcga acaaaaaacg actttgtttc ggtcatttac 1740 cgcccccctg gaggcgcttt gtcccctttc ttgaattttc tgagctccct ttttgatttc 1800 gcaaatgaaa acaagctgaa cattatatgc ggcggcgatt tcaatgtgaa tatgttaagt 1860 aatactgttt caaaaaggga gttggaaaat ttaatactta taaatggttg tcgtactgta 1920 ataaactctc cgaccagagt taatgttacc tcaaaaacct tgattgattt atttataaca 1980 aattacgagg aaagtcgtat aagatctggc ataattactt atgacctcag tgatcatctt 2040 cctatttttt tgtgtgtcaa aatgaataac gcaaaaaaga aagatccaac accttctgcg 2100 caagtccagt gtgttacgcc ggatggcttg gagaggttcg cggaatgtgt ggcaaatgca 2160 aattgggatg atatcttttc aaagaacgat cccgatgagg cgtacaattt ttttcttgag 2220 cgttttaaaa gtctttatca gagcaatttt ccttacaaac taattaaagt gaagaaacgc 2280 attcggaaac catggataac tgtagatcta cttgaaaata taaaggttaa aaatgtgcta 2340 tacagaaagt tcacaaaaac tagagatccc aatgaccttc ggacttttaa atcgtaccgt 2400 aatcaactga caaacgccct gagaaaagca agggagcgct atcactatga ttcctttaga 2460 aactctgtag gccgttctga ccttatgtgg aagaacttaa attcagttat agaaggcaaa 2520 acgtcccatt gtgtagaatg cattatttct gcagggaaga aattaactgg cgtgtcgttg 2580 gcaaacacat ttaatgacag gtttgttgat ttgcctcaaa ttaactgtag agatgctgca 2640 tgtgataata taacctttaa aactagggat tctatttttc tattgccagt tactgataca 2700 gaggtgtcta ctgtctttat ggaattaaaa aatagccata gctgtgatgc cgatggtatg 2760 cagattcggc cggtaaaata tgttttacca ctgatcactc ccttccttac ttacattttt 2820 aatttatgtt tttcaactgg ggtgtttccg cagcttatgc aaaaagctag agtgtctgtg 2880 ctttataaaa agggagacaa gaacgacata aataattata gacctatctc tatcttaccg 2940 gttttttcta agggcttaga aaaacttatt cacattcgtc tgtcaaattt ttttaataag 3000 tatcgcgtaa ttactaatga tcaatttggg ttcatgaagc atcgttcgac agatctagca 3060 ttactagaac aaaaagaatt tattttagag agttttgagg caaaccagat agctttagcc 3120 gtatttattg attttacgaa agcattcgat tatctaaacc ataatttatt gctaaaaaaa 3180 ctttatcttt atggaattcg gggccaagca cttacgttgc ttcagtcata tttgagtaat 3240 cgtaagcagt ttgtctattt aaatggcttt tcatctaata caaagtctgt tttctccggt 3300 gtaccacagg ggagtatctt gggacctttg ctatttaaca tttatattaa tgatattgtt 3360 aacatagatt ctcttgctaa atttgttata tatgcagacg atacaaccct tttattttcg 3420 gctaagacta gtgatgaact tatcaactgc gcaaatgata ctttaacagg gttgcaaaat 3480 tggacatccg acaacgggct taaaattaat agtagtaaaa ccaaagcaat tttctttcga 3540 accaaaaata aaagcgtttt tataacgaaa agtctcgtaa taaattcaaa tgttatagag 3600 atagtggaaa atttcaaaag tcttggcgtt atttttaacc agtatttgtc ttgggacgac 3660 catgtaagct atatacttcc tcgattgtcc agggtcatcg gattagtctc acgaaaccgt 3720 tttattttac caaacaacgt taaactactt atttatcaaa ctctatttgc atcccactta 3780 aactactgcc acctggtatg gggtacaacc accagttcaa atctacaaaa aatacatcta 3840 ttacaaaaaa aatttgttcg cctcattgct aatgtttctt atatttcaag ctccaagggc 3900 ttgtttgaaa aatataattt gttgcgtgtt ttcgatatgt atacatataa acttatggtc 3960 cgttttaagg tagaaaaaaa aagaaacgtg gatattttgg ccaaactttc taaactaagg 4020 gaacatgtta cctcctatcc aactcgttcg gcggcacgtt gggctgtgcc tctttgccat 4080 aacaactatt ccatgcagtc tctaaagtat atgctaccta ctttactaaa tttcctctta 4140 aacaaaaacg ttgatataat gtctgtcaca atgcaagatt tattttctta tttgaattga 4200 atatattact tttctgcgat aggttataat ctatttttgt atacaatgtt gatatttttg 4260 ctgttgcttt gtatttctaa tctgtcatgt ggaactgtaa gggagcaggg ggcttcgtca 4320 agctgttaaa cagctttttc cctctgtttc ctccattcct gtatcgaatg ggaaataaac 4380 ttcattatta ttattatt 4398 // ID Kolobok-2_TV repbase; DNA; INV; 2566 BP. XX AC . XX DT 27-FEB-2007 (Rel. 12.02, Created) DT 27-FEB-2007 (Rel. 12.02, Last updated, Version 1) XX DE A family of autonomous Kolobok transposons - a consensus. XX KW Kolobok; DNA transposon; Transposable Element; KW Interspersed repeat; Kolobok-2_TV. XX OS Trichomonas vaginalis OC Eukaryota; Parabasalia; Trichomonadida; Trichomonadidae; OC Trichomonas. XX RN [1] RP 1-2566 RA Kapitonov V.V. and Jurka J.; RT "Kolobok, a novel superfamily of eukaryotic DNA transposons."; RL Repbase Reports 7(2), 118-118 (2007). XX DR [1] (Consensus) XX CC Kolobok-2_TV is a consensus sequence of a family of autonomous CC Kolobok transposons that were active in the T. vaginalis genome CC in a last few million years. The Kolobok-2_TV transposon is CC characterized by 16-bp imperfect terminal inverted repeats, TTAA CC target site duplications, and it encodes the 465a Kolobok-2_TV1p CC transposase. Kolobok transposons, including numerous families of CC non-autonomous elements, constitute >2% of the T. vaginalis CC genome. See also comments in Kolobok-1_XT. XX FH Key Location/Qualifiers FT CDS 588..1982 FT /product="Kolobok-2_TV1p" FT /translation="MTSQEVHDATNSEIIRVQYEGFGDLIKLDGKSFTFAT FT LICGKTYQEASNLLREAGIDVPSSQYFYRIQKVILEKAKELAEKSVKNHAL FT QLEPGTVITIDAHFSHVRNADECEFVAMGPKGDIVAKFVIVRSRKGKAGNY FT TGASCQMESVAADQALQILEDTVGKGIVDGFCHDRDNKSRNIVHKHFPDAV FT EYQDGFHCKKKFELCWKRLTSNEYAKSDMPIVDSKIFHGIKNRLKIFFNTL FT LDKHISKEQKLFYWLHAVDHLIGNHSPEYCLHSTKDEEKEHFIWTAGLENE FT NARNHLLEFLKDSGSVFQIVNPDYATHINESYNARSCRFNPKHTNYLSSYE FT GRTDIATIYHNEGYQGLLELRNLSGIKPIHPILRDSLIEEHKEKKLRSERE FT KTEEYRSARSERRKARKLIVKGKGDYGNGQDEEIISNESQNSSEIPKYVIA FT GMPQEKREPFPLPQIHGFQE" XX SQ Sequence 2566 BP; 941 A; 365 C; 401 G; 859 T; 0 other; aggtcgagtc aacaaaaatc tgacacatga cttcactgat atcatttcta gttaaaaaac 60 atagaataca tcattgattt tcatatgaat atttttatca tttgttatac ttctaaatat 120 tcaaatcata atggatataa ttaagatttt attgtttcat tatatttaag atttcgacaa 180 aaaatcgatt ctggcttttt tgttgtagaa taaagcatct gtctatattg tattacaata 240 atatggtaag gaaaattcca acaatttata ttttcactct aaaaaagata ttttctagaa 300 aaaaatacat taaaatcaat ttattccata gctctaagtt atttatggtc aatttccata 360 gacttatggc aatctctttt taggggaata aacattcagg gttgagacag aaatatttaa 420 aaatgactta atattatatt aaatttttat tatttcgaaa cattctcgtt tttacgttga 480 atgtcagtat ccccaaaaag aatactgtta gtaaattgta aattattaga aataaaccca 540 taattcaagg tgtcgtattc agaagtgaag aaacaaaaaa ctgacaaatg acatcacagg 600 aggtacatga cgccacaaat tctgagatta ttcgcgtcca atatgaagga ttcggtgatc 660 taattaaatt agatggcaaa tcatttacat ttgccacttt gatttgcggt aaaacatatc 720 aagaagcaag taacttatta agagaggctg gtatagatgt accatcgtca caatactttt 780 acaggattca aaaggtgatt ctcgagaaag cgaaggaatt agctgaaaaa agtgtaaaaa 840 atcatgcatt gcaacttgag cctggaactg ttatcaccat cgatgctcat ttctcgcatg 900 ttagaaatgc agacgagtgt gagtttgtag caatgggtcc taaaggtgac attgtggcaa 960 aattcgtaat cgttagatct cgtaaaggta aagctggaaa ttatactggt gcctcatgcc 1020 aaatggaatc tgttgccgct gatcaggctc ttcaaatttt ggaagatacg gtgggaaaag 1080 gtattgttga cggtttttgc catgataggg ataataaatc ccgtaatatt gttcataagc 1140 attttcctga tgctgttgaa tatcaagatg gcttccattg caaaaagaaa ttcgaattgt 1200 gctggaaacg tcttacttct aatgagtatg caaaaagtga catgcctatc gtcgactcaa 1260 aaatttttca tggtatcaaa aacagactca aaatattctt taatacttta ttggataagc 1320 atatttccaa agaacagaag ctattttatt ggctccacgc cgtagatcat ctaattggaa 1380 atcattcacc agaatattgt ttacattcca caaaagatga agaaaaagaa cattttatat 1440 ggacagctgg acttgaaaac gaaaatgcaa gaaatcatct actcgaattt ttaaaagatt 1500 ccggatcagt ttttcaaatt gttaatccgg attatgcaac tcatataaat gagtcatata 1560 atgctcgatc atgccgtttt aatccaaaac atacaaatta tctcagttca tatgaaggca 1620 gaacagatat tgctacaatt taccacaatg aaggatatca aggattactt gaactacgta 1680 atttatcagg aattaagcca attcatccta tccttcggga ttctcttatt gaagagcaca 1740 aagaaaaaaa gttgcgctcg gaacgagaaa agacagaaga atatcgatct gcgcgctcag 1800 aaagacgaaa ggcaagaaaa ttgattgtaa aaggaaaagg agattacgga aatggacaag 1860 atgaagaaat aatttcaaat gaatcacaaa attcttcaga aattccaaag tatgtaattg 1920 ctggaatgcc tcaagaaaag agggaaccgt ttccattacc tcaaattcat ggttttcaag 1980 aataatttaa taatttaaac aattttattt tttacaactt ttaactttac tataacatta 2040 cataatgtat acacaatata tttgttatat tattttaaat tttacaaatc caagagattc 2100 tgattgcaaa aaaattatga tttaatgcga taattctatg atgtttgtaa tatactattc 2160 atgtatattc agtatgacta ttattaatgc atcttaatgg ggataaagaa ataatagata 2220 attgattttc acatatgata tatttattgt tgccaatata aatatttttg taagtatctt 2280 atactgttac tcttatgatg cttattataa aactcaagat aaaaagattt caacaaattt 2340 tcacatatga tatatttatt gttgccaata taaatatttt tgtaagtatc ttatactgtt 2400 actcttatga tgcttattat aaaactcaag ataaaaagat ttcaacaaat ttcattatat 2460 ctcattttta ttaatgtgat tattattatt agtagtagtg ttttatatta gaactcttaa 2520 atacgatatt tgaatttcaa gtgaagtcac tttgttgagg cggcct 2566 // ID LmSIDER2 repbase; DNA; INV; 537 BP. XX AC . XX DT 08-NOV-2007 (Rel. 12.11, Created) DT 08-NOV-2007 (Rel. 12.11, Last updated, Version 1) XX DE DIRE-derived SINE element: consensus. XX KW SINE; Non-LTR Retrotransposon; Transposable Element; DIRE; KW LmSIDER2. XX OS Leishmania OC Eukaryota; Euglenozoa; Kinetoplastida; Trypanosomatidae. XX RN [1] RP 1-537 RA Bringaud F., Mueller M., Cerqueira G.C., Smith M., Rochette A., RA El-Sayed N.M., Papadopoulou B. and Ghedin E.; RT "Members of a large retroposon family are determinants of RT post-transcriptional gene expression in Leishmania."; RL PLoS Pathog 3(9), 1291-1307 (2007). XX DR [1] (Consensus) XX SQ Sequence 537 BP; 107 A; 142 C; 185 G; 103 T; 0 other; ccctgatgac gagggacacc tcagtgtggt atcagggtcc agtgcacccc actctgtgag 60 gaagccaagc agctccctct atccctgcca atgccgagcc acttctggtg gtgacagggt 120 caagcgccta cgacgtaggg gaggtcagag cgatgcatcg ctgctgatgt cggcggtcag 180 gtcctggatg gcgttgcgtc ggagcgacct gcgacagtga acacgcttgt gccatccata 240 tgatgggcag agtgtcagcg tgactcgagc gtatcccacc cggccctcac tgcctactgg 300 tggggagcct gagccacccc gagggatgca ccaggtggcg accggcatga tgggagcggc 360 tgtgaggcga cctgcggagc gggtgggtag agtttgaggc aggggccgtg ctcagatgac 420 tgagtcggcg cattgctgta acgtgtgtct acggctgctt cgcaccacgc gatgggcctg 480 tgacaggccg ggagttgact catgttgtat ggcagaatgg acgttgaaaa aaaaaaa 537 // ID Gypsy-37_NVi-I repbase; DNA; INV; 19389 BP. XX AC . XX DT 01-JUL-2009 (Rel. 14.07, Created) DT 01-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE Gypsy LTR-retrotransposon from Nasonia vitripennis, interanl DE region. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-37_NVi-I. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-19389 RA Bao W. and Jurka J.; RT "Gypsy LTR retrotransposons from Nasonia vitripennis."; RL Repbase Reports 9(7), 1390-1390 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 16714..18924 FT /product="Gypsy-37_NVi-I_4p" FT /translation="TKFNGNIGINITNPQVDHTIKMTAIGVKNENNNNNLT FT QNIVAITALSHRENATTRHRSYRCAASKETNGGITIQAINHNPGLLVEKLN FT PLATSSTSWKMIQRINLETFFSRWNKLLGSLGILKTFCXQINTACPYEQQL FT AKIRQQIEETARNADKVKELLQGHETTQKPRIKRAWFSAIGKISRTIFGTL FT DEEAGEEIQQLISITANDTRQLAKLLANQTELIHTEFGILHEKADELDNAM FT DELIANQANLQKQEEIAEALNLLEDNVIQYELDTEILTDAVLFATQGVVHP FT RFVTPEQIEKSEALAKESVADSRFPSTEADMPITELIKISDVSILFQDMHL FT VYYIGMPLLDYGDYDLYKASALPIKQKSLNASDTFAYIWPETEYFALDQGS FT NAYIPISLDALNKCKKLSKLYICRNTEPVHEINDGASCEIQISTGKTPINL FT SECKIKVMKSRDTFWIRTSTPNTWVFSTNKKERLFSSCKRLTDNTIDIDGV FT GTLRLPHGCTAHTNTVRLVASREVRVNYLTVLFNETELDIASLINSTGNTS FT LAFIKDTLEKEGTAKSLNLRRDMFSRSLKTGRDFRDIIDKAKRLGEYKDIY FT GRFSSLQSTVTYSGTGILTFVIIVVIIWKVLTKRGAPLLNRDRLISGHNTH FT AQLGTERPQMYEARQPEMANQGLPYYLPMQSAGLSLPNLQQLAAANGMLLQ FT PALMPRDSRLNLQMIGAKNNEDNQSRRNIRLSESIF*" FT CDS join(3299..8095,8129..8686) FT /product="Gypsy-37_NVi-I_2p" FT /translation="NLDRLLYHLFQDERFIIPARTRQVIYARASDRSEQVG FT FAPLQNLGEGLYFGNFIGINQEGKIHALCVNTTEHEVELNPPEVTLEACEA FT IKEGGEEFNLEEDEPEETVNILSIFSNESKDRAERIFELLDPETLKDLTEE FT EITHVKEIIREKPHIFGLPGEKLKATTLVSHKIPTTTNVPVRSKGYRPSQE FT EKEELGRQIEQMLKDDIIRPTDSPYSSPTYVIPKKPAADGTKRWRLVTDFR FT KLNEVTVGNSYPIPNTADIIDSVASARYITAIDLRTGFYQIPMDPADAHKT FT AFTVPFGHYCYQRLGMGLKGAPSTFQSLMDLALAGLQGTELYVYLDDIIVF FT ARDLEEHGKKFRRLMKRLDDANLTIEPMKCQFLQREAHFLGHIAGGGKIRA FT DPKKIKAVNKFPVPTNAKKIKQFLGLAGYYRRFIKDFAKSAAILSRLLRKN FT EPFVWKEEHQQAFDKLRTALCEEPVMKAPDLTMPFLVITDASDYAIGAILS FT QGQLGSDQPCAYASRVLKGPELRYSTYDRELLAVVFAKDQFRHYLAGRKFT FT VVTDHEPLKHFFTTKKPDLRFNRLKAELRGYEFDIVYRPGARNGNADALSR FT NPILDEGEENPERPKAELYELADKQEHDATILRFKTTRKRGKSKKLPESAD FT TADKERISTESENSGNPRERKRRVVVYTESTGPDNSSDDEWQPKAGTRNGK FT KRVYKKVHGEKTGKIYPTCHKLSDSRETSEPILSQADPNRVLGKLCKEENM FT RLGVQAASTPQTRGTNYNDDPKRPIRKQTLNRSLPKFHRNDMEISEDSNSS FT LGSDAEDEPLRAGQVEREQEVEISRKLIEKEPSLNRHDLSQPLTPRLTPSS FT NNSHETSRPIPQKTNSEPSNVDPGNKPTSAESPIANLRQTLFGKENKFRTR FT LTVDVEEISKRDKSGVLIKIWESGAQSSNEIPDEKRVNKRNPSAETPYESI FT DSPQPILTTHIPLAPHDSHPSTLKHTLPIYRPRKDFYHITPQIDDQANAKP FT LPESSASEASRGDIDLRLHNMSKHDNTSPIKGGSRHKSVEPMEGSFNDQPA FT TDRTAATMGTSPRAQPTPTPYTGTDLDAIAQRYCVYSQHEDCAEPSKRLTH FT DKNIPMEPPXNCLFNSIIKILNLQMTLSQLRYKLLCSPHVETYGDPEEARR FT ILLSEVEYGDADCIYVFSREFGYNVCVHFQTNNRNNYCHFNLNEGKEFIHL FT HLAGNHFTPYIITEDESDEESESESEQEEQPANFQRNPHVREAMARYLIAE FT ARPEQDSPEWERDPNDKELFTGVQSLNDHPFAHKCNLAYILAEDMILDTEV FT LNALIERRYIHAEELLETQRFVGEIIVTKYREAYMFGIVLKKHLHDKPLRP FT DIAGCIKTLQLLTEKYGITTFGFIRDLGIITLADWEHVIEQCNKTFKGKAI FT SIVLFKNNLPVPKVEDRYRLIKEYHESSIGGHKGTNKTYNRIASEYYWRNM FT QADVRQFVRGCAGCQEKKLVRVKTKLPMLITDTPSHPFDKISIDFCGPMPQ FT TERGNRYILTIQDNFTKFCILVPVEHATAREVTRALTEKLICYFGSPVTLI FT SDQGAHFMNRVMEEFARIFRVNLKYYNDRVLSAQRMHHTLKEYIKMYIKDN FT DNWDEVVPLAXHXYNSRETDAXXXTXRQKXVFXRRARTPSSFPPREHLQTY FT XDXLADTTEAYAQLRTMAAMNLVQSKHRSKYYYDRKLNVQHFREGEMVYVL FT KEPKVGQFKSEYKGPCEIIAINYSTHNAKIQKGDKTRIVNINKLKRYIPEI FT EQEEGTDISMNKK*" FT CDS join(11274..10015,9933..9217) FT /product="Gypsy-37_NVi-I_3p" FT /translation="LDISLIDRLLRYFSXCXFXFFLSARYPRRSRTPIRGF FT VETTSPILNLSGRVAQGVSQFPYNPSAVSNPPAPLPPITSTYPPSFPAGIS FT EERSGSEVSHTPSVYPRATTRDSVATSVCNGGSDLTRENISGWAGYWYNSA FT TVSAPEAIPASTVRSDFHDGALAYTEYFSANLSGSHPVIHDSSFDVTRTNS FT ENYESVFPSAGGLLPPFGSVPSFEVSRAVPPRESLSQDALPHLSMPALGAP FT PERSVVEVDLSNDSSNPSFSMLSNDSFSSSHSAEVVARPSSRLSISNNSGP FT VLVPCPICCDYMFTPIYQCLNGHSVCSFCWDNTTVCAICRGPRGGVRNLVL FT ESLAGEQTQACRYQYLGCRQFLKPDDWTSHARHCSFSPVNTLVRGSPSPVL FT AVSSESAVRSANPRRIVTDSDSDSLFSLSLLQVRPSLRIPYEPGRALVPAR FT PNPDPVDTWVGITPPASPWDVDQPXAQVPSSPTSSVPDDLLGNLFLDSDED FT GVSDDVPGVVAVAESGELAAAQSTSPALPAENPDEAEALLRDLFTCPICDN FT FIYPDFRQCDNGHAVCESCLSRCSVCPVCRGPAPVWRNRLLQGVGSRMHFP FT CRHHHLGCPEVPAADAWSRHSDSCAYRPRPLSPATRRDIDRRTLFTALKLC FT LIYMRLIHI*" FT CDS join(306..842,743..2746,2668..3393) FT /product="Gypsy-37_NVi-I_1p" FT /translation="MAESNPSDYKMSNFNEHNGLETQNIFTREANDPDTFQ FT RRLQEFEMSAEPALLLHTRLTIALQLADVAMAYPSRGMIDTAISHIQGIVQ FT QERLIPTDKIQLLKVAKEHLLQVRDKIELSETNDKTSQQSNLHVTHNNGTI FT PKCKIFNENDPFALDIKEEKLDDYVTDRKQQLFAKTCVSGRKRPIRARYQR FT RKIRRLRYRPKAATIRENMCIGLNSGALDTVNDIYDKKPIFASARNDRVSL FT IDTRHDSNDAPYTEHRGLTFTEQLPADRRLNRSLLGELPDPKNHGLPSIGN FT FFDKLDAEQQPGPQPGDSDFIEQYEQFKKRMQDKVHEDLNSKPSSDPIIRH FT TQNVQQHRDRSRDGGWMFSLHQALNTIPHYDGNPDMLALFCRAVRDVIHEF FT GPESERWVLNSLASKFRGRAADGYTSRLTQYDTVERLLADLTTQYSGVGGA FT DKVLADLKVIRQYAGESAGDYGLRVQTLHNRLLNIYDASRDMRNWERATYK FT NNADKEALEQFLFGLNGELQHQVRSKNPLCLREAITEAVAFEHRTSARRAY FT SQQPDLIREDSTKIPSWAAELMSIAKAAYEPKATPVASEAVRYTTGMSVCN FT YCRQQGHLEKDCKEKICRYCNIVGHTYVECFSLKNAINNGRLSKEKLEKLA FT SAPAAAAQPVVYTPFPIPTPAPMPYVIPAPQANNATQNNSAPQENNGKQNG FT AYRKNNVRRYSGERNGGDNRNYNNNNGYNNRDDNRNYNNNNRYNNDRDDNR FT NYNRRDDGGYDNRRNDRGFENRRNNRDFDNRRNDRGFDNRRNDGRGNNNNN FT RYNDQNNGNYNNDRNRQDNRNARNQSGDSLNYQDARYQSGVSGNQQKCQKS FT VRRFFKLPGCSLPVGGIGQSISNNPPNPKPTPTQEKKVEGQISKKQEKIEN FT SKRQIKEIAVQFNKLKEQSKQEKEANVQEKINNRKYLIRFATAGRKPPIVE FT LTSEKLEGRKARFYADSGADVTLIKASNVTSRTIIDKQYVIAINGVTPGEC FT LSLGRTNMSLEGLECNVHVVPDEFPIDTDGILGWDVLSKYGGKVNAADRCI FT EFGQTIIPFISRRKIYYTCTYETSDLRKSKR*" XX SQ Sequence 19389 BP; 6928 A; 4344 C; 4275 G; 3804 T; 38 other; ttttctggtg ctaagtgtgg ggttttgttc gcaaaatata cgggcgtatt ttaacaccgc 60 gatagagata gattactcgg ttatcatttc aatccaatag acgggaagga aggatgaacc 120 ttgcctgggt actcatcatt gcagccacct taaggtaagt caagtaaatt ttaataaaaa 180 ctaacaatta taccctggga ttaacaccag tgagaacttc tccggttcgg ataactaaca 240 actaacttaa accgctccga aatatcaaat tccgtggtct tagataccgt gataaaatac 300 gaaatatggc agaatcgaat ccatctgatt ataaaatgtc gaactttaac gaacataatg 360 gtctcgaaac acagaacata ttcacacgcg aggctaacga tcccgacact tttcaaagac 420 gtttgcaaga attcgagatg tcagcggaac cggcgctact tctgcacact cgattaacga 480 tcgcattgca actcgcggac gtggccatgg catatccgtc aaggggaatg atcgacaccg 540 cgataagtca catacaggga atagtacagc aagaaaggtt gatcccaacc gataaaatac 600 aattgttaaa ggttgcgaaa gaacacctac tgcaagtccg agacaaaata gaactttcgg 660 aaacaaatga caaaacgtcg caacaatcta acctccatgt tacacataat aacggaacga 720 tccccaagtg taaaatattt aacgaaaacg acccattcgc gctcgatatc aaagaagaaa 780 aattagacga ctacgttaca gaccgaaagc agcaactatt cgcgaaaaca tgtgtatcgg 840 gctaaactcc ggtgctctag acacagtaaa tgacatctat gacaaaaaac caatcttcgc 900 ctcagctcga aacgacagag ttagcttgat cgacacgaga cacgactcca acgacgcacc 960 ttataccgaa caccggggat taacctttac cgaacagcta cccgccgata gacgtcttaa 1020 ccgatcgctg ttaggtgaat tacccgatcc taaaaatcat gggctaccct ctataggtaa 1080 tttttttgat aagcttgacg cagagcaaca acctgggccg caaccgggag actccgattt 1140 tatagagcaa tacgaacagt tcaaaaaacg catgcaggat aaagtacacg aggatttaaa 1200 ttcaaaaccg tcttctgatc cgattattcg acacacacaa aacgtacaac agcatagaga 1260 ccgctctcgg gacggaggct ggatgttctc gttacaccaa gcattgaaca caatccctca 1320 ttacgacgga aatccagaca tgcttgcact attttgtcgc gcagttcgcg acgtaattca 1380 cgaattcggc ccggaatccg aacgctgggt actcaactcg ttggcatcga agtttcgggg 1440 acgtgctgct gacggctaca cttcgcgcct gacgcaatac gacaccgtag agcggttgct 1500 agccgattta actacgcagt acagcggcgt tggcggagcg gataaagttt tagctgatct 1560 taaagtaatt cgccagtatg caggagaaag tgccggagac tacggactac gcgttcaaac 1620 attgcacaat cgattgttaa acatctatga cgcgagccgc gacatgcgca actgggagag 1680 agctacctat aaaaataacg ccgataagga ggccctcgaa caatttttat tcggtctaaa 1740 cggagaatta caacaccaag ttcgttctaa aaacccacta tgtctcagag aagctattac 1800 ggaggccgta gcatttgaac accgtactag cgcgcgccgt gcatacagtc agcaaccgga 1860 tctgattcgc gaagacagca ctaaaattcc gtcgtgggca gccgagctta tgagtatcgc 1920 aaaggcagca tatgaaccga aggccacgcc agtagccagc gaagctgtca gatacactac 1980 gggtatgagc gtttgcaact attgtcggca acagggacac ctagagaaag attgcaaaga 2040 aaagatatgc cgatattgta atatcgtggg ccacacctac gttgaatgct tctcgttgaa 2100 aaacgcgata aataacggac gtctcagtaa ggaaaaactt gaaaagctag cgtcagcacc 2160 cgctgccgct gctcagccag tagtgtatac ccccttcccc atacccactc cggccccgat 2220 gccatacgtc atacctgcac cgcaagcaaa taatgcgacg cagaataata gcgctcctca 2280 agaaaataat ggaaagcaaa acggcgctta tcgaaagaac aacgtgcgca ggtattcagg 2340 agaacgcaac ggcggggata atagaaatta taacaacaat aatgggtaca ataatcgaga 2400 cgataataga aattataaca acaataatag gtacaacaat gatcgagatg ataatcgaaa 2460 ttataatcgg cgcgatgacg ggggatacga caacaggcgt aatgaccgcg ggtttgaaaa 2520 tcgtcgaaac aaccgggatt ttgataatcg gcgaaatgac agaggatttg acaataggcg 2580 caacgacggt agaggcaaca ataataacaa tcgttacaat gatcagaata acggcaatta 2640 taacaatgat cgtaatagac aagataacag aaatgccaga aatcagtccg gagattcttt 2700 aaactaccag gatgctcgtt accagtcggg ggtatcgggc aatcaataag caacaatcca 2760 ccgaatccga aaccgacgcc aacacaagaa aagaaagtag aggggcaaat cagtaaaaag 2820 caggagaaaa tcgaaaattc aaagaggcaa ataaaggaga tagccgtgca gtttaataag 2880 ttaaaagaac agtcaaaaca ggagaaagaa gccaatgtgc aggagaaaat taataatcgg 2940 aagtacttaa taagatttgc tacagcagga cgtaaaccac cgatcgtaga gcttacgagc 3000 gaaaagttag aggggaggaa agcacgtttc tatgcggata gtggagctga tgttacgttg 3060 attaaagcga gtaatgtcac tagtaggacg ataattgata aacaatacgt aattgccatt 3120 aacggagtta cgccgggcga atgtttaagt ttaggtagga caaatatgag tttagaggga 3180 ctagagtgca acgtgcacgt ggttcctgac gaatttccca ttgataccga tggaatctta 3240 ggatgggatg tgctttcaaa atacggtgga aaggtcaacg ctgcggacag atgtatagaa 3300 tttggacaga ctattatacc atttatttca agacgaaaga tttattatac ctgcacgtac 3360 gagacaagtg atttacgcaa gagcaagcga tagatcagag caagtaggat ttgccccttt 3420 acaaaattta ggagaagggt tatattttgg aaattttatt ggtattaacc aagaaggcaa 3480 aatacacgcc ttatgcgtta atactacaga acatgaggta gagttaaatc caccagaggt 3540 aacactggaa gcctgcgaag ccataaaaga gggaggagaa gaatttaatc tagaagaaga 3600 cgaacctgaa gagactgtca atatactatc gatattttcc aatgaaagca aggacagggc 3660 cgaaagaatt tttgaacttc ttgatccaga gacacttaag gatttaacgg aggaagaaat 3720 aactcatgtt aaagaaatca tccgagagaa gccgcatatc ttcgggctac ccggggagaa 3780 acttaaagca acaacactag tgtcgcacaa gatacctact actacaaatg tcccagtacg 3840 atcgaaagga tacagaccaa gtcaagaaga gaaagaggag ttagggcgcc agatagagca 3900 gatgctgaag gacgacataa tcagaccaac agattcaccc tacagctctc ctacatacgt 3960 gatacccaaa aagcccgcag cggacggtac gaaaagatgg cgattagtaa cagactttag 4020 aaaactgaac gaagtaacgg tgggaaatag ctacccgata cccaatacag cggatattat 4080 tgacagcgtc gcctcagcta ggtacataac cgccatagac ttgaggaccg ggttctacca 4140 gataccaatg gaccccgcag acgcccacaa aacagccttt actgtaccgt tcggccatta 4200 ctgctatcaa cgtttaggaa tggggctcaa aggcgctcct tcgacatttc agtccctgat 4260 ggacttagca cttgcaggct tacaaggaac agaactgtac gtctatttag atgatataat 4320 cgttttcgct agggatcttg aagagcacgg taagaagttc cgacgcttga tgaagagact 4380 agacgacgcc aacttgacaa tcgaacctat gaagtgccaa ttcctgcagc gagaggcaca 4440 ctttttagga catatcgctg gaggaggaaa aattcgagca gatcctaaga aaataaaagc 4500 agtaaataaa tttccagtac caacaaacgc caagaaaatc aaacaattct taggactcgc 4560 aggatattac aggcgcttta tcaaggactt cgcgaaatca gcagcaatcc tatctagact 4620 gttgagaaag aacgagccct ttgtgtggaa agaagagcat cagcaagcct ttgacaaatt 4680 acgaacggca ttgtgcgagg aaccagtgat gaaggcaccg gacctaacta tgcccttttt 4740 agtaataacc gacgcatctg attacgccat aggagccatc ttgagccaag gacagttagg 4800 cagcgaccag ccctgcgctt atgcatcgcg agtccttaaa ggacccgagc taagatattc 4860 cacctacgac agggaactac tagctgtagt gttcgcaaaa gaccaattta gacactacct 4920 cgcaggacgc aaatttacag tagtaacgga tcacgagccc ttgaaacatt tcttcaccac 4980 gaaaaaaccc gatctaaggt tcaacagatt gaaggcagag ctcagaggct acgaattcga 5040 cattgtatac agacccggag ctcgcaacgg aaacgcagat gcactctcta gaaatcctat 5100 cctcgacgaa ggagaagaaa atccagagag gccaaaagca gaattgtacg agttggcaga 5160 taagcaagag cacgacgcta cgatactacg atttaaaaca acaaggaaaa gaggcaaatc 5220 caaaaagctt ccggaatcag ccgatactgc agacaaagaa agaatatcga cagaatctga 5280 aaactctgga aacccaagag aaagaaaacg acgagtcgta gtttacaccg agagcacagg 5340 tccagataac tcctcagatg acgaatggca acccaaggcg ggaacgagaa atggtaaaaa 5400 gagagtttac aagaaggttc acggcgagaa aaccgggaaa atatacccca catgccacaa 5460 attatccgac tcacgggaaa catccgagcc aattctcagt caggcggacc cgaacagggt 5520 cttggggaaa ctctgcaagg aagaaaatat gagacttgga gtacaagccg caagtacacc 5580 gcaaactcgg ggaacgaatt acaacgacga ccctaagagg cccatacgga agcagaccct 5640 taatagaagt ctgccgaagt ttcacagaaa cgacatggaa atcagcgaag acagcaacag 5700 tagcttaggt agtgacgccg aagatgaacc cttaagagcc ggtcaagtag aacgggaaca 5760 agaggtagaa atatcacgca aactcatcga gaaagaacct tcactgaatc gccacgactt 5820 atcacaacca ctaacgccga ggctgacacc aagcagtaac aacagtcatg aaacatcaag 5880 gccaatccct cagaaaacga acagtgagcc gagtaacgta gaccccggta ataaaccgac 5940 gagcgcagaa tcaccgatag caaacctccg acagacgttg tttggcaagg agaataaatt 6000 tcgaactcgc ctcaccgtag acgtagagga gataagtaaa cgagataagt caggagttct 6060 aattaaaata tgggaaagcg gagcgcaaag tagcaacgag atacccgacg aaaagagggt 6120 caataaaaga aatccaagcg cagaaacacc atacgaaagt atagactctc cacagccgat 6180 tcttaccaca catattccct tagctccaca cgacagtcat cctagcactc taaaacacac 6240 tcttcccatc tacagaccaa ggaaagattt ttatcacatt acaccgcaaa tagacgatca 6300 agctaacgca aaacccttgc ctgagagcag tgcttcagaa gcatccaggg gagacatcga 6360 cctacgcctt cataatatga gcaaacacga taatactagt cccataaagg gaggaagccg 6420 ccacaagtcc gtagagccaa tggagggctc gtttaacgat caaccagcaa ccgatcggac 6480 ggcagcaact atgggcacgt cacccagggc tcagcctacc ccgacaccat acacaggaac 6540 agatttggat gcaatagccc aacggtattg cgtgtactca cagcatgaag attgcgcgga 6600 accatccaaa cggttaacgc acgataaaaa catacccatg gagccaccag raaactgcct 6660 gttcaactcg ataataaaaa tcttaaatct acaaatgaca ctgagtcaac tgagatataa 6720 actcttgtgc agcccgcatg tggaaaccta tggtgatcca gaagaggcac gccgcatctt 6780 gctatcagaa gtcgagtacg gagacgcgga ttgcatatat gtattctcga gggagtttgg 6840 atacaacgta tgtgtacatt tccaaacgaa taaccgaaat aattactgtc acttcaatct 6900 aaacgagggt aaggaattta tacatttaca cctagcgggc aatcacttca cgccgtatat 6960 tataacagag gacgaatccg acgaagaatc agaatctgaa tcagagcagg aagaacaacc 7020 ggcaaacttc caaagaaacc ctcacgtgag agaagccatg gcacgctatt taatagctga 7080 ggcgcgacca gaacaagatt caccagagtg ggaaagagat cctaatgaca aagagctatt 7140 cactggggta caaagcctga atgatcaccc gttygcgcat aaatgcaacc ttgcttatat 7200 cttggcagaa gacatgatcc tcgacacaga agtactaaac gctttgatag aaaggcgcta 7260 catacacgca gaggagctct tagaaacaca acgtttcgta ggcgaaatca ttgtaacaaa 7320 atacagagag gcgtacatgt tcggaatcgt tttaaagaaa cacctacacg acaaaccctt 7380 aagaccggat attgctgggt gtataaaaac actacagtta ctgactgaaa agtacggaat 7440 aacaacgttt ggctttatca gagatttggg aataattacg ctcgcagact gggaacacgt 7500 aattgagcag tgtaacaaga cctttaaagg aaaagcaata tcaatcgttt tatttaaaaa 7560 taacttacca gtaccaaagg tcgaggacag atacagactt atcaaagaat atcacgagtc 7620 atcaatcggt ggtcataagg gaacgaacaa aacatacaac agaatagcca gcgaatacta 7680 ttggcgaaac atgcaagcag acgttagaca gttcgttcga ggatgcgcag ggtgccaaga 7740 aaagaaacta gtgcgggtaa aaacaaaact gccaatgctc atcacggata ccccatctca 7800 cccatttgac aaaatatcaa ttgatttctg cggaccaatg ccccaaaccg aacgaggaaa 7860 ccgatacata ttaaccatcc aagataactt cacaaaattt tgcatattag tgccagtcga 7920 acacgcaaca gcacgggaag taacgcgagc tcttacagaa aaactgattt gctatttcgg 7980 ctcacccgta accctcatct ccgatcaggg agcacacttc atgaaccgag ttatggaaga 8040 attcgcacgt atattcagag tgaatctaaa atactacaac gaccgcgtac tatcgtgatc 8100 agcgtaaaaa cccagtccag cggaataggc ccaacgaatg catcatacac tyaaagagta 8160 cattaaaatg tatatcaaag acaacgataa ttgggatgaa gtagtgccat tagcaymgca 8220 cgsatacaat tctagggaaa ccgacgccra ckaggwtact saccgacaaa aaawagtctt 8280 trgccgacgy gcacgcacac cgtcaagctt cccaccacga garcacttac aaacgtacka 8340 cgackatctt gctgacacaa cagaagctta cgctcaactt agaacgatgg cagcgatgaa 8400 tctagtacaa tcaaaacacc gctcgaaata ctattacgac aggaaactaa atgtgcaaca 8460 ctttagagaa ggagaaatgg tctatgttct taaagaaccc aaagtcggcc agtttaaatc 8520 cgaatataag ggcccgtgcg aaattatagc tattaactac tcaacacaca acgcgaaaat 8580 acaaaaaggg gataaaacac ggatagtaaa tataaataag cttaagcgat atataccgga 8640 aatcgaacag gaagagggta cagatataag tatgaacaaa aaataataat aaaattgaac 8700 aaaaaaaaag cataatacta aggttaaagg ttataattca ataaaataat aaaataatta 8760 tatagggata gtaggataag ggaatacaat actaatataa gagaaaacat attagccatg 8820 gtactaggat aaggatatat acacacgcga tttgaaatta aacattgcag agtaaacagt 8880 aattaaaagt agaaacgttt aaattaaaca atattgacat ttaaggaaat aattatattg 8940 aattaataaa tgataaacaa ataagaaaag ctcataaaca aactaacttg aacatctagc 9000 gacccccgga cgccgcataa aataaaggaa catttaatcg aacatctagc gacccccgga 9060 cgccgcaagg ttcgagcagt ctaggaaacc aagtacagaa aaatatacta atatgaatca 9120 ataaataaac aaaatttaac gaaagaacca ggaggggata ttgattggat cagaactagg 9180 tttttaaata aataatcact ggaattccca gtaaaatcat atatgtatta atcgcatata 9240 aatcagacaa agtttaagcg cagtaaataa agtacgacga tcaatatccc gacgggttgc 9300 aggcgataag ggccgcggac gataagcaca ggagtcacta tgcctagacc aggcatcggc 9360 cgctggcacc tcagggcaac ctaagtgatg atgacggcaa gggaagtgca tcctggatcc 9420 gactccttga agcagacggt tcctccagac cggagcaggc ccccgacata cagggcaaac 9480 actacaacgg gataggcagc tctcacacac ggcgtgaccg ttatcacact gccgaaagtc 9540 tggataaata aaattatcac agataggaca agtgaataaa tcacgaagca gcgcctctgc 9600 ttcgtcaggg ttctccgccg gaagagccgg ggaagttgac tgagctgcag ctagctcgcc 9660 cgactcagca actgcaacga ctccaggaac atcgtcagaa acaccatcct catccgaatc 9720 caggaacaga tttcccaata gatcgtccgg cacagaagaa gttgggctag acggcacctg 9780 agccraaggc tgatcgacgt cccagggcga agcaggcgga gttataccga cccaagtatc 9840 aacggggtct gggttaggcc gcgcagggac tagcgcccta ccaggctcat aaggaattcg 9900 gagacttggt cgaacctgca acaaggagag ggattatata gaaacaagta agcggagcgc 9960 agagtagaag ctaattcacg taaccgtaac agaagatgca taaaagatac gctagaagag 10020 actgtcggaa tcgctatcag ttacaatccg acgaggattg gcagaacgta ccgccgactc 10080 agaagaaaca gctaatacag gcgaaggcga cccgcggacg agagtattca cgggagaaaa 10140 ggagcaatgc ctagcgtgac tagtccaatc atcgggcttt agaaattgtc gacagcccag 10200 atattgatac cgacaagcct gcgtttgctc accggctaga gattccaaca ctaagtttcg 10260 cacaccgcca cgcggaccgc ggcaaatagc acacaccgta gtattatccc aacaaaaact 10320 acataccgag tgaccattta aacattgata aataggcgta aacatgtaat cgcaacaaat 10380 cgggcagggt actaatacgg gcccagaatt attcgaaatc gataaccggg aagacggacg 10440 cgcgactacc tcggcactat gcgatgagct gaacgaatca ttygatagca tagaaaagga 10500 aggatttgac gaatcattag agagatcaac ctcaaccacg gatcgctcgg gcggcgcgcc 10560 gagcgccggc attgataaat gaggtaaagc gtcctgagat aaagattcgc gtggaggtac 10620 cgcgcgggag acctcaaatg acggcacgga gccgaaaggc gggagcaagc ctccggcaga 10680 cggaaacacc gactcgtaat tttccgaatt cgtacgcgta acgtcaaaag aactgtcgtg 10740 aattacggga tgagagccag ataaattcgc ggaaaaatac tcggtataag caagagcacc 10800 gtcgtggaaa tcgctcctaa cagtcgacgc gggaattgcc tcgggagcag aaaccgtagc 10860 actattgtac cagtaaccgg cccaccctga tatgttctct cgcgttaaat ccgacccacc 10920 attgcaaaca gacgtagcga ccgaatcgcg agtagtagcg cgcggataca cggagggtgt 10980 atgagacacc tcggaaccgg agcgctcctc gctaatccca gccggaaagg aaggcggata 11040 ggtactcgtt attgggggca atggtgcagg aggattggac actgcagagg gattatacgg 11100 gaattgggat acgccctgcg cgacccgacc ggacagattt aaaattggtg acgttgtctc 11160 gacaaaacca cgaataggcg tccgactacg cctaggatat cttgcactta aaaaaaaaar 11220 aaaasaacac astgaaaaat aacgcaayaa ccggtcaatt aatgatatat caagttaaaa 11280 caaaattatt acttacacga gaactaggcg aagccacaga aatgggtcca ggcagaggtc 11340 gtccacgccg gaagacacga gaaccaaaag aagaaggtgc gaattccgag gacggacgga 11400 agcctcggcc agctgctcgg actgaagtag acgaggaatc acgttctcca ggcacccttc 11460 cgacagcacc tcggccacga ccagtwcgwc ggtgggatgg tcgcgcaccc gattgaggta 11520 acgtgtagaa gacttgrtgg atccacgatc gggtcacaac gtccgrgacg attataggcc 11580 gggtagactg ccggacggac cggaggaaca gcatgtcgtc aatcagccgg tctagtgaga 11640 tgctggaagc atcaacccga gccaaaatgc ggtccatagc ggccatcayg caaktcggag 11700 gacgaggagg cgcaaaggga ccctgcttcg aggccgccag gataaggcct aaaagcacga 11760 caggtccaga tccacacgaa ccagaagaag ccatctttct ttctcttctc tgaaaacgcc 11820 agagaaataa gtaagggaag aacgcagcct ccacagaaaa taagacgacg aagcgcaatt 11880 aataaacaaa cacttaccaa taaattcacg agaagatgaa ccaacagcac gaggcaccag 11940 aaagcactcg aaggaaaata cgtaacggtc aacaaaacag acggacagca attcggctag 12000 cagtgaggta gagaggagcg atcgcgatca atttatacac acgagcagag attcccctga 12060 tttcaaattc gcacaaactc tgtcagtggg gatagacgaa aaagtgtcga gacagcgtca 12120 gcacttagtc cggaaggcaa ggtgcgacgg cagacggtga agcgcgcggc gaatatcggg 12180 aacgaaaaac gcgaacgtat acaaaaatat tacgaacgat attacgatac aattatttaa 12240 ccgcgagacg attacgattt ttttaactat gtaacgccta aattttataa tcgcgaagac 12300 ggagatgtga aacgattacg gcaccggtaa taataacaat taggaaaagg tcattcgcta 12360 tgtaaaacac agggataata ttacgaggta taattaaagt acctaggaaa ataatactaa 12420 tttaaataat gagtaaatta gaagcgcggt agatgcgaag gatagtaygg aataaaataa 12480 tttaaaaacg agcagcaaat gcaacgaaac aaattcgcag taaggcgaac ggggaaatgt 12540 aaataggaac cccatcgccc caccgcggcc aacaccaggc aaatttaaka aggagacgcc 12600 agccgaatca gcaaacgctc gctgaagaat actrcaattg agcattckgc ggccaaacgc 12660 tcagagagag aaaagaaaag cacatacaaa aataagcaac gaccagataa ctgaaaatta 12720 cgaacttatt tttgcccagg gtacgggagc tgaaaaccgc cacatacagt atgcactagg 12780 cctccggtac gcgagataaa aatgtaggca acaaggcact aggtcaaaag ttcggcgata 12840 ctaaataaaa aaaaggatac acttgttaaa acaataatag tgacaggcaa taataaaatg 12900 gaactgtaca aaccatgtca gagctgcgcc gaagacgcac atgacaacaa gaacgaaccc 12960 agggggaaag ctaggaaaaa acatatgtac acataaaagc gtagtcccct cgaggagtat 13020 ggtacaataa gtcgggacaa attggaggct gaaagaattc agagaagcat aaatatacat 13080 ataatcccgg cagacacaca tctcacacgc aaaacgtaga atctatcgtt tgcaatcgac 13140 ccaagaagag aaaaggatct agtacgatat ccgtacaagg atagccttct acaagcctta 13200 cgcactagaa atgggcaaac aatcaagcgt gaacatatac aaggcaagga caaaaaaaca 13260 tcagaagcaa aaaaagaaac tatgcgcccg aaagaagcaa attaggagtg cagaaaaccc 13320 taaggcacac gcgcgcgcgg gcacaatata gatagacact aagacattta aaagcacatg 13380 tataaacaaa aaaaaaaaac ttaccacatg gacgcaatga gtagaatccc tcgcgatttc 13440 cgtatgcatm acctccaaca ctttagtgta cggcactcta tcgaactcgt gcacgaacga 13500 ttgacggata ccacgaggca gattgcactg agacatgatg cctaaaacta atatacggac 13560 gagccagtaa aagaaagcac taaacatata aagatagaaa aaagaagcaa ggaaattact 13620 cacataaaaa aaagaaaacg caagaaacca caacgaaatc gagtaaaaga gcaagcacga 13680 gaagagcacg tgttgacgcc ggaccaggaa aggctaggtg cagctgcatc ccgccaaatg 13740 caagtgcagt gaagttcaca accgaagtat acaagaacgc gggaacggaa ccacaagcgc 13800 ggaactagag gcacacgcag agcgctaccg ctcggccgga gtaggtacga cggccgtgca 13860 gcccaggaat gcacaactgc cgaaggcggc aatattcaaa tcaggatttt gggaatcgac 13920 gatataaatc gaaccaataa tccgatttca ccctccttcg agtcaaaggt gcggacacac 13980 cccccgagag gactcacacc ccaagaatcg caaacaaata attargggck gataaacaaa 14040 taaaaatttt tttatgcctg acaaaaagtt cggaaaaatt tctctataaa gggtggattt 14100 gtatagtcag gggatgtacc aagaccaccc ctatcataaa aatttttaat tctaaggaaa 14160 ccacgagtcc taggaaaaag tccttcgggg tacgtatgga gcatccccag gacatctaat 14220 ggacaaatag ggtgacccat tatttaaaca atcttacatg ttaaatgcaa tatctcggaa 14280 agttaccttc agaagggaaa aatctttacg gccctatatt gggcacacat aacgattaaa 14340 taatcaggaa aaagtccccc aaatgtttaa acaatcttag atgcttaatg aattgctcgg 14400 agagttacac tcacataaga aagatattat acatacatgt taggcatcta taacagttaa 14460 tgaaaagcaa aaattaaaca ccaattattt aaacaatctt agatgtttaa tgaattgctc 14520 ggagagttac aytcacagaa gaaagatctt acacatacat attaggcatc tctaacagtt 14580 aacgaaaagc gaaaatagtc gcccaaatat ttaaacaatc ttagatgact aatgcattac 14640 tcgaaaagtt acaataaggt caagaaaatt ttgtagacac gtatcgggca tccctaaagc 14700 ataagcccaa cacacacaca cacacacaca catacacacc cgacataaaa caagtctaac 14760 attcggacaa aacgaaagaa aaaacgcggt aagcaaaaaa cccatctaga gacagcgtca 14820 gatatcaagc gcctacactg gcatagaagc ggtcagaccg acgtcaacgg caaagcagag 14880 aacaccgtca gaccagcaga aaggaattaa accagtacac acgcagaatg gtaaagaaaa 14940 cgaaaagcat tcgcttagaa aaggtttatt aatagagtca tcccttaaag gatctctggc 15000 agaaagaatt agccgagacg agggagttgc tgaagaaatg gttcacggct gaacactacg 15060 agtacacttg cccttgcgga accgtagtaa caatggtaaa ctcgaccttc cgacggcaaa 15120 tagtcgtcgc aggagcagtc ctatacgact cgacgggaga cacatcctac gtaatgtcta 15180 aggaccccga aataaataag gaaagatttg ggaaaaagat ggacgagaag agtaaaagag 15240 aaagaggaga aatataagat cagaggtgta aataaagaaa gatcaataaa ccctaaccct 15300 aaccttaaca aaaaaagaaa aaacacaaaa tcacaggctg aggactaatg ccttaaaagg 15360 acaacggtcc cagtcaagac cgcaattacc aacagaagga aagactaact aggcatcgtt 15420 agccgataca gcgccctcct cgttagccga tacagcgccc tcccttaccg acggagcacg 15480 aaagcgaaac cagcataaga ataagatagt atgaagcacg acaagaagcc gacatggcaa 15540 cgcgaacaag gtgcccgcct cggaagtcgg ggacctgagt tcaagcccag gacaaaattt 15600 tttaattttt tatatttctg tttaggccaa cttacaccac ggtcgtaaca gcttccactc 15660 ggtcgctgag gcgcaattct ttttatattt ttttgttttt gttctttttt ttatattttg 15720 gggttgttgg gtttttggga agtcgcttta ttawtttcta tttagagaaa agggataaat 15780 aagaaaaaaa gggaataata ataataaata aattaattat aagaaacatc tggcactggt 15840 atccaccaat ggccagggag ccgcatcatc ccaggggcct cgtcgatgga ccggccacat 15900 acagagcacg gggccgtatc cgcggggatg ccccactcct cctcgtggca cgggaaccac 15960 gttccctcag caacgaaagg gatcatcgaa ttatacaact cataagacac gatgacccca 16020 taggaccgtt gctggaccct gaggtaagac ggcgctcccg gaggttgccg gcagaagacg 16080 caacgccctc cgatgaagag ggtagagcct tccactccgt ggcatggata gactccaccg 16140 tcaggcccta ggagctcttc gaacccttca tcatcggaca gggcctagga gaaaagcatt 16200 attaacactt atctagaaaa acctcaatag caaactctta cctcctccgg gtcgtccacg 16260 ggctcttcgc caagccccca gaactttgag aatcctaact ccccagccat tgtacaggaa 16320 tagtaaaaag ggaaaagaag aattaaaagg agaaaacggg aggatagtgt atggaggata 16380 gtcgcaatcg gactgcgaat aaacacacaa ccgacaaacc aagaagagct tgacgaaatg 16440 caatcatggc agaaaacccc accaagagaa gagctaccgg atgaagaaac acgaagatcg 16500 cgtctcgcga aatctgagcg cccgaaagac agcgcgcaag aaaagccacg aagaatcgga 16560 taagcaacga aagccttatc acggctaaaa ctttgaatca aataaatatt ttgctagcat 16620 taaaaccctt ggctaattaa agggatccca taagggagcg tcccataatg ctcctcatct 16680 agcaccatat agcctaataa ttcaaataaa taaacaaaat ttaacgggaa tatagggatt 16740 aatattacta acccacaagt ggatcataca attaaaatga cagccatagg cgtgaaaaat 16800 gagaataaca ataacaatct aacacaaaat atagtagcca tcacggcgct atcacaccgt 16860 gaaaacgcaa ctacaagaca ccgctcttac aggtgcgcgg catccaagga aaccaatgga 16920 ggaatcacca tacaggccat caaccataac ccaggactat tggttgagaa actaaaccca 16980 ttggcaacct ccagcacatc atggaaaatg atacaacgca taaaccttga gacattcttc 17040 tctagatgga acaagttatt aggaagctta ggtatattaa aaacattctg caygcaaata 17100 aacacagcat gcccgtacga gcaacagctg gctaagatac gtcagcaaat cgaggagaca 17160 gcgaggaacg ctgacaaagt aaaagagcta ctacaaggtc acgaaacgac acagaaacca 17220 agaatcaaaa gagcctggtt ctcagcgata ggaaaaatta gccgcacgat attcggaacc 17280 ctagatgagg aagccggaga agaaatccag caactaatca gcataacggc caacgatacg 17340 agacaactag cgaaattact agcaaaccag accgaactca tacatacaga attcggcatt 17400 ttgcatgaaa aagccgacga actagacaat gccatggacg agctgatagc caatcaagca 17460 aacctacaga aacaagagga aatcgcggaa gccttaaact tactggaaga caacgtaata 17520 caatacgaac tagatacgga gatcctaaca gacgctgtac tctttgcgac ccaaggagtc 17580 gtacacccac gcttcgtcac accagaacaa atagaaaaat cagaagccct cgctaaagaa 17640 tcagtagctg actcacggtt tccatccacg gaagcagaca tgcccattac ggaattaatt 17700 aaaatatccg acgtctcaat actgttccaa gacatgcatt tagtatacta tatagggatg 17760 ccactgctgg actacggcga ttacgaccta tacaaggcgt ctgcattgcc gatcaaacaa 17820 aaatcactga acgcatcaga cacattcgct tacatttggc cagaaacaga atattttgct 17880 ctagaccagg gatcaaacgc atatatccct atctccttag atgcactgaa taaatgcaaa 17940 aaacttagta aattatacat atgtagaaac acggagccag ttcacgaaat caatgacggc 18000 gcgagctgcg aaattcagat cagtaccggc aaaactccca taaacttatc ggaatgtaaa 18060 ataaaagtca tgaaatcaag ggacacattc tggatacgta caagcactcc gaacacttgg 18120 gtattctcta caaataaaaa agagagacta ttttctagct gcaaacgact gacagacaac 18180 acgatagaca ttgacggcgt aggcacgctg cgcctgccgc atggctgcac cgcacacacg 18240 aacacagtac gactcgtagc gtcgcgcgaa gtacgcgtta actatttgac ggtactattt 18300 aacgaaacgg aattagacat agcctcgttg ataaacagta cgggcaacac gtcgttagcc 18360 tttattaaag acacgctaga gaaagaaggg acggcaaaat ccttaaacct tagaagggat 18420 atgttctcgc gttcgttaaa aacgggacgg gattttcgcg atattattga taaggcaaaa 18480 aggctaggcg aatataaaga catttacggt agattctcgt cccttcagtc aacggtgact 18540 tactccggga ctggcatact cacgttcgtc ataatagtag taattatatg gaaagttctc 18600 acaaaaaggg gggccccgct actaaaccgc gatagattga tttcaggaca caacacccat 18660 gcgcagctcg gcacagagcg cccacagatg tatgaggcca gacaaccaga gatggcaaac 18720 cagggtcttc catactacct gccgatgcaa tcagcaggcc tcagcctccc aaacctacag 18780 cagctggccg cagccaatgg gatgcttctt cagccagccc tgatgccaag agattcaagg 18840 ttgaatctac aaatgattgg tgccaagaac aacgaagata accaatcaag acgcaacata 18900 agattaagcg agagtatatt ttaagaaact taaatgaaag agagattcgc taggtagtta 18960 taatgtataa taggattacg caaacacagc caacggctca ctacactaca cacaataatg 19020 taacaattac acgccagaaa aaaaaaaaaa ttgttaaccg ctcgaccaac ctaagcaaaa 19080 aaaaaaaatt gttaaccgct cgaccaacct tagcaaattt atgtgaccgg agaaaaacgc 19140 gtattacttt agaaattttt ttaacaaggg aattagggaa atagcatgta aatcttagcc 19200 taacctagca tattttgtat gaaactagac tcacttacct ttatcttgat tcgggactag 19260 attatactgt aaggacttag agtaaaaggg aaattcatat gagaaaaatc tcaaattttt 19320 tttacgtgga attttatttg catagtttta gtttgcgggc gagggcgccc ggtagcgacc 19380 ccaaggacg 19389 // ID hATm-18_HM repbase; DNA; INV; 2863 BP. XX AC . XX DT 07-DEC-2008 (Rel. 13.12, Created) DT 07-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type family: consensus sequence. XX KW hAT; DNA transposon; Transposable Element; hATm-18_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2863 RA Jurka J.; RT "Families of hATm elements from Hydra magnipapillata."; RL Repbase Reports 8(12), 1912-1912 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 319..2550 FT /product="hATm-18_HM_1p" FT /translation="MAVTRKKVSCVIFDAPSEMPDNQLPTYADVMKFYNLT FT KQKLKYELNNKDPPHHNVAEIVTTKVEQIWDKASIPHVSHRRVQEMLNKYH FT KTFMNLLKPYKSRKDSGPYREKISQFKNESKKLFDICSCKCSFISQCQCEK FT SRKIPAIERDFLEDQRGPRGMIIAKVDEAESIKLQKRYIRKELTQRQMCSK FT SDIDDISEKQSDVCEEFSSTEEKTDASDDESDDDCMPDPNEQFRKLSDKQM FT RVKLPSLALACDRHGVSDRAAAGIASAVLQDVGIIHERDVSKVIDRSKVRR FT ERCKKRGTLCANVSNTITGLYFDGRKDSTITLIKECDGKFHRKLTTEEHIS FT IISEPGSIYFGHVTPIAGSSKVIAAELIAYLSKKNVDLGGIKAVGCDGTVA FT NTGNRGGIIRLLEVALNKPLQWFVCQLHANELPLRHLFEHLDGPTNGPKGF FT SGPIGKSLVACEKLPVVKYAPINCVLPTGLNSDDLSTDQKYLFDLCTAVSV FT GVCPVGLSLRDPGVINHSRWLTTANRILRLYIATRCPSDELKTMVTFVVTV FT YAPAWFEIKNTSSCKDGARHVHNTILKSRYLTDELKKVVDPVIQRNAFFTH FT PENLLLAMLTDEKSEIRELALRRIMKSRKQKRTSSVRSFCVPVINFEATSY FT IDMIDWQKTPITEPPIVMDIDDDTFLTMIREEDTPRLDFARYPCHTQSVER FT HIKLVTEASQAVCGPEKRDGFIRARLESRSKMPKLDTKAQHHL*" XX SQ Sequence 2863 BP; 943 A; 530 C; 560 G; 830 T; 0 other; ttagggtgga gtgaaaacgt gaaaatattt tttttttgat ttgcttgact ttggatagtt 60 attaaaactt gctctgtgat acaatcaatt actccgtaaa aaaaaattaa tttaggtaac 120 gtcttgaggt tccacaacca acttcaaaat ttaaccattt ataagcattt actatccggt 180 atttttttat tgtagttcac ttttatacgt attcacgtgc gtttttcctg tctagagtaa 240 acaaacagcg tttattgcaa aacattttat tcatataaga taatacttta attcatttat 300 aataacaatt catatttaat ggctgtaacc aggaaaaaag tatcttgtgt catttttgat 360 gctccctctg aaatgcctga caatcaacta cctacttatg cagatgttat gaagttttac 420 aatctgacaa aacagaaact taaatatgaa ttgaacaata aagatccacc acatcacaat 480 gtggcagaaa tagtgacaac aaaagtggaa caaatatggg acaaagcttc tataccacat 540 gtaagccacc gccgtgtcca agaaatgttg aacaagtacc acaagacgtt catgaatttg 600 ctgaagccat ataagtctag aaaagactcc ggaccatacc gagagaaaat atctcagttt 660 aaaaatgaat ccaaaaaact ctttgacatt tgctcctgca agtgttcttt catatctcag 720 tgtcagtgtg aaaagtcaag aaagataccc gcaattgaac gtgattttct agaagatcag 780 cggggtccac gaggtatgat cattgctaaa gttgatgagg ccgaatccat caaattgcaa 840 aaacgatata ttcgtaaaga gttgacgcag aggcagatgt gcagcaaatc tgacattgat 900 gacatcagtg agaaacagtc agatgtttgt gaggagtttt cttctacgga agagaaaact 960 gatgctagcg atgacgaatc tgatgacgat tgtatgccgg atcctaatga acaatttaga 1020 aagttgagtg acaagcagat gagagttaag ttaccatctc tggcgttagc atgtgatcga 1080 catggtgtat ccgacagggc cgcagcagga atcgccagtg ctgttcttca agatgttgga 1140 ataatccacg aaagggacgt ttctaaagtg attgacagaa gtaaagtgcg acgagagcgt 1200 tgtaaaaaac gtggaacatt atgtgccaat gtcagtaaca caatcactgg gctatatttt 1260 gatggaagaa aagacagtac aataaccctt attaaagaat gtgatgggaa atttcatcga 1320 aaactaacaa ctgaagaaca tatatccatt atatccgaac caggatccat ctattttggc 1380 cacgttacac caattgcagg gtcaagcaaa gttattgcgg ctgagttgat tgcatatttg 1440 agtaaaaaga atgttgacct gggtggtata aaagctgtag gttgtgacgg aaccgttgct 1500 aatacaggca atcgaggagg tatcattcga ttacttgaag tggcgttgaa caaaccgctg 1560 caatggttcg tatgtcagct acatgctaac gaattaccgt taaggcattt gttcgaacat 1620 ctagatggac ctaccaatgg tccaaaagga ttttctggtc caatcggaaa atcattggtg 1680 gcatgtgaga agcttcccgt tgtcaaatac gcacccataa actgtgttct accgacaggt 1740 ttaaattctg acgatttaag tacagatcaa aaatatctgt ttgacttgtg caccgctgtc 1800 agtgttggcg tttgtcctgt aggcctttca cttcgcgatc ctggtgttat caatcattca 1860 agatggttaa caacggctaa tcgaatacta cgcctgtata tagccacgag atgcccatcg 1920 gatgaactga agacaatggt taccttcgtc gtaacagttt atgcaccagc atggtttgaa 1980 ataaaaaata cttcttcatg taaagatggc gctcgacatg tccataatac aatactgaaa 2040 tcacgctacc tcacagacga gctgaagaaa gttgttgatc cagtaattca acgcaacgca 2100 ttttttactc atccagaaaa cctgttgctt gcgatgctta ccgacgaaaa atcagaaata 2160 cgcgagttgg ctctgagacg tataatgaaa tccaggaagc agaagcgcac gagttcagtt 2220 agatctttct gtgtgccagt aatcaatttt gaagctacca gttacataga catgatcgat 2280 tggcaaaaaa ctccaatcac agagcctcca atcgtaatgg acattgatga cgatactttt 2340 ttaaccatga tccgggaaga agatacacca agactggatt tcgcacgcta tccttgtcac 2400 actcagtcgg tggagaggca tattaagcta gttacagaag cttcccaagc ggtttgcggt 2460 ccagaaaaac gcgacggatt cataagagca cgtttggaat cgagatcaaa aatgccgaaa 2520 ctggacacta aagcacaaca tcatttgtag ctataaaaac atgtttgact agagttttca 2580 tgtgtttttc ttttcaaata tttagttcta tttacttgta actttcgtat agactgttaa 2640 aaatgtatac tgctacacaa atatgttgtc atatataaac ttgaaatttc aaaaaatatt 2700 gttttttatt tgaaatatgc ctataattca ggtcaattgt ggaacctcaa gatatgcttt 2760 aaatcgtttg attgtttgcc acattactaa ttgcatatgt cagcaaattt ttatagctat 2820 ccaaccaaga tattcaacaa aaaatttttt cactccaccc taa 2863 // ID TC1A_HC repbase; DNA; INV; 1590 BP. XX AC AF099908; XX DT 10-SEP-2005 (Rel. 10.09, Created) DT 10-SEP-2005 (Rel. 10.09, Last updated, Version 1) XX DE Mariner-type DNA transposon from Haemonchus contortus. XX KW Mariner/Tc1; DNA transposon; Transposable Element; TC1A_HC. XX OS Haemonchus contortus OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Strongylida; Trichostrongyloidea; Haemonchidae; Haemonchinae; OC Haemonchus. XX RN [1] RP 1-1590 RA Hoekstra R., Otsen M., Lenstra J.A. and Roos M.H.; RT "Characterization of a polymorphic Tc1-like transposable element RT of the parasitic nematode Haemonchus contortus."; RL Mol Biochem Parasitol 102(1), 157-166 (1999). XX DR EMBL/GenBank/DDBJ; AF099908; Positions 1 1590. XX FH Key Location/Qualifiers FT CDS 286..1314 FT /product="TC1A_HC_1p" FT /translation="MARHTGIRNLRQDQVDAIIRSFHAGLTSRQVSEIQGV FT TIRCVQRIWKKYKDTGSVEVKKHPGAARTTSRLVDRNIVRLARNDPRLTAA FT EILREISTPEGSNLSLSTVQRRLREAGLFGRRPAKKPLISAKNRKARLDWA FT QAHKNWTVRQWRKVIWSDESKFLLFGTDGIKFVRRPVGTRYHPSYQLPTVK FT GGGDSVMVHGSFCGKGAGPLHRIEGKMDAKMYLNIMETVIWPFVRSTARRG FT FIFQQDNDPKHKSKLLTKWFRDNNVPLLMWPSLSPDLNATENLWERLKHQV FT KGLRARNEHEKFNQLKTAWENIPQEEIDKLIESMPCRCQAVIDARGHATKY FT " XX SQ Sequence 1590 BP; 471 A; 345 C; 379 G; 395 T; 0 other; cagtaccggt cagtgaagta ttcgttttca ctttttcaaa cataacttcc tcaaaactca 60 agctttctgc tagaaactca aacagtagaa aacttaggca gttagtaata ttatactgct 120 gtaatggccc agcagcacat ttccaagcaa aactggcgct ttattttcaa atcgcccgcc 180 aacacacatt tctaccttct ataactggtg ggtttttata ttaaaaattt cgggtggttc 240 ttaatgtttt gagtcacgtc atgcacccac tacgttatta aggctatggc aaggcatacc 300 ggaatccgta acttgcggca agaccaagtt gatgccataa ttcggagctt ccacgcaggc 360 ttaacatcaa ggcaagtttc agaaattcaa ggagtgacaa ttagatgtgt tcaaaggatt 420 tggaagaaat ataaggacac agggtcggta gaagtcaaaa aacacccagg agcagcgaga 480 acgacgtctc gtcttgtgga caggaacatt gtgcggctcg caaggaacga tccgcgtctc 540 actgccgccg agattcttcg cgaaatatcg acgcctgaag gatcgaacct atcattgagc 600 actgtacaac gtcgactgag agaggccggt ttgtttggac gacgtccagc aaagaaacca 660 ctgatttccg ctaagaatcg caaagcgcgt ctggactggg ctcaagctca caagaactgg 720 accgttcggc agtggcgtaa ggtcatctgg agcgatgaat cgaagttttt gctgtttggg 780 acggatggaa tcaagtttgt gcgacgtcct gtcggtacca gataccatcc gagttaccag 840 ctgccgaccg tgaagggtgg tggagattcg gtgatggtcc acggtagctt ctgtggcaaa 900 ggagctggtc cccttcatcg gattgagggt aagatggatg cgaagatgta ccttaatatc 960 atggaaactg ttatctggcc gtttgtccgc agtactgctc gccgtggctt cattttccaa 1020 caggacaacg acccgaaaca caaatcgaag ctgctcacca aatggttccg ggacaacaac 1080 gtccctctgc tgatgtggcc gtccctgtct cccgatttga atgccaccga aaatttatgg 1140 gaaagactca agcatcaagt aaaaggcctc agagctcgaa acgagcacga aaagttcaat 1200 cagctgaaga ctgcatggga gaacatccca caggaagaga tagacaaact catcgagtcc 1260 atgccatgcc gatgtcaagc ggtcattgac gccagaggtc atgcgacaaa gtattaatgt 1320 cgtagttgtt atgttgatat tatacaagtt gttccgctga gaatttggaa atccagacag 1380 aataaagatt ttacaggtat gatggtgttt ggacacaaaa tgatgtgaat tctcttaaaa 1440 tatttcataa tccacttaga cagctgagaa attgctcgct catcacattc attgataggc 1500 attaacaatg aaaaaatcag gcgaaaagat ggacatttga ggaagttatg tttgaaaaag 1560 tgaaaacgaa tacttcactg accggtactg 1590 // ID Gypsy11-LTR_Dpse repbase; DNA; INV; 817 BP. XX AC Unknown_singleton_87; XX DT 13-MAY-2009 (Rel. 14.05, Created) DT 13-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy11_Dpse; KW Gypsy11-I_Dpse; Gypsy11-LTR_Dpse. XX OS Drosophila pseudoobscura OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; obscura group; OC pseudoobscura subgroup. XX RN [1] RP 1-817 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1082-1082 (2009). XX DR Genome; Unknown_singleton_87; Positions 20889 21705. XX SQ Sequence 817 BP; 209 A; 217 C; 235 G; 156 T; 0 other; tgtgaggacc catactggga ccctcacata gtaacaagaa taataataat aatattaata 60 attggattac tcacaaccat atgctatagt agtaaataag ctagttatgt tttgcatagc 120 cgaaatcccg ccaaacggcc agctcacacg gaacaaccgc aaagaagacg gcgaggagag 180 agagccatgc tcagccggca gcagagcgga tctctttcgg aaaagtcccg ttggcccgaa 240 aacgagcgcg aggcgtggcg agaacgtcga gcgaccgtag cgagagaggg agtcgattct 300 gggaagtgag cgaagacgga acgtcgcgga catcgaagtg aagagaaaaa aaaagtgttg 360 aacggccgcg cgccagagag gagagaaaag catggaggag ctgcgggaac acaggcgcgg 420 gatccgcggg tgagctggcc gctggccaag gtagtagtag gagtaggagg gagtgcgagg 480 ggaaccaatc caataaaact tcgatcatag aattaagaac gcgttggctt ccatttattt 540 tctctggtcg gaccaccgac tgcggtcatc tctccacccc ctccccaccc gttttcctga 600 ctgttctcgg tcactacgct cccgctctga ctctggcaac agggttttac cctgctggag 660 tgagaagtgg tacgcgtttc gacagaggcg cccccccctc atccttttcc ttcacctccc 720 cggacccgcg gcccacttcg accttggcaa caggggtttt accctgctga agtcggggag 780 tgcagcgtct cgggtgagtt tcgcgccgcc cggccca 817 // ID Gypsy-38_DWil-I repbase; DNA; INV; 3062 BP. XX AC scaffold_181152; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-38_DWil_; KW Gypsy-38_DWil-LTR; Gypsy-38_DWil-I. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-3062 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_181152; Positions 2783 5844. XX CC Positions [387-893] - Reverse transcriptase CC Positions [1815-2294] - Integrase core CC 'TAATG' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 3..1718 FT /product="Gypsy-38_DWil-I_1p" FT /translation="MRATCSSCLLDEDLLLGEEFCKIAEVKINRDGIIVKK FT SDSIELETEISVLMKINASNVNEIDLNIEANASNSAKYKVRELIKNYKPVK FT AKSTNVKMRIVVNDDTPIYSRPRRIGYNERSVVDQKVDEWLKEGIVEHSAS FT EYSSPVVLVKKRDGSPRLCIDYRRINKVILRDHFPLPLIEDQLDRLVSATI FT FSTIDLKNGFFHVDVEEESRKYTSFVTHNGQYQFLKVPFGLANSPSVFQRH FT INAIFRELTRKNIAIPYVDDIIIPAKDEDEAISNLKEVLKQCENYGLMINL FT KKCSFLKKRVEFLGHEIQDQKICISEEKTKAVTKFPQPTNQKQLQSFLGLV FT GYFRKFIKNFGLITKPLTNLLKQNARFHFNDFEIESFKRLKILITERPVLK FT IFNFNHETELHTDASIDGFGAVLLQVSPEDGQLHPVYYMSRITTDAQRKFS FT SYELEILAVIEALNKFRVYLLGIRFKLVSDCNAFTKTMEKRDLGTRVARWI FT LMLQEIDFEVEHREGTRIRHADALSRNPVMLITDGSAISKLKYLQEHDDEI FT KAIIGHFSVRKTKDIIAKEYYIPKL" XX SQ Sequence 3062 BP; 1109 A; 565 C; 622 G; 766 T; 0 other; taatgagagc gacctgctct tcttgtcttc ttgatgaaga cttgctgtta ggtgaagaat 60 tttgtaaaat tgcggaagta aaaataaatc gtgacgggat aattgtaaaa aagagtgaca 120 gtatcgagct tgaaacagaa atttcggtgt tgatgaaaat aaatgcatca aatgttaacg 180 agattgattt aaatatcgaa gctaatgcaa gcaattccgc aaagtataaa gtacgcgaac 240 tgataaaaaa ctataaaccc gtaaaggcaa agtcgacaaa cgtaaaaatg cgaatcgtag 300 ttaatgatga taccccaatc tactcgcgtc cgcgtagaat cggctataac gaacgttccg 360 tagtagacca aaaagttgat gaatggctaa aagaaggtat agtcgagcac tctgcgtcag 420 agtacagtag ccccgtagtc ctggtaaaaa aacgagatgg ttctccaaga ctatgtatcg 480 actatagaag gattaacaag gtaatattac gagaccattt tcctttaccc ttaatcgaag 540 accaactgga cagattggta agtgcgacga tattcagcac tattgacctg aaaaatggat 600 tcttccatgt tgatgtcgag gaggaaagca gaaaatatac atcttttgta acccacaacg 660 ggcaatacca gtttctgaag gtaccatttg gactagcgaa ttctcctagt gtttttcaaa 720 gacacatcaa tgctatattt cgagagctga ctcgtaaaaa tattgcgatt ccatatgtag 780 atgacatcat tattccagct aaagatgaag acgaggccat atcaaatctc aaagaagtcc 840 tgaaacaatg tgaaaattac ggcctgatga tcaacctaaa aaaatgcagt tttttaaaga 900 aaagagttga gtttttaggg cacgaaatcc aagaccaaaa gatatgtatt tctgaagaga 960 aaaccaaagc tgtaactaaa tttccccagc ccacaaacca gaagcaactt caaagtttcc 1020 taggattggt tggatacttt aggaaattta ttaaaaattt tggattaata actaagcctc 1080 taacgaatct acttaaacaa aatgctagat tccactttaa tgattttgaa atcgaatcat 1140 tcaagaggtt aaaaatattg atcacagaga ggcccgtatt aaaaatcttt aactttaacc 1200 atgagacgga gctccacacc gatgcttcga ttgacgggtt tggagcagta cttttacaag 1260 tctcacctga agatggacaa ctacaccccg tatattatat gagcaggatc acaacggacg 1320 cacaaaggaa gttttcaagc tacgaactgg agatactagc cgttattgaa gctctcaaca 1380 aattccgagt ctacttactt ggcatccgct tcaaactcgt aagcgactgc aatgctttca 1440 ccaaaaccat ggagaaacga gaccttggca cacgagtggc ccgatggatc ctaatgctgc 1500 aagaaattga cttcgaggtg gagcatcgag aaggaacacg tattcggcat gctgacgcct 1560 taagccgaaa tccagtaatg ttaatcaccg atggaagcgc aatcagcaaa ctaaagtatc 1620 ttcaagaaca tgacgacgag ataaaagcaa ttatcggaca tttttctgtt aggaagacga 1680 aagacataat tgcgaaagaa tattatatac cgaaacttta ggccaaagtt gaacgacata 1740 ttaggtgctg cattccgtgt attatctcaa accgcaaaca aggaaaaaag gagggactgt 1800 taaatccatt gcctaaggaa gaagaaccac tgcaaacatt ccacattgat tttttgggtc 1860 cgcttgaatc tactgacaag agatacaaac acatcctagc tgttatagat gcattcacca 1920 aatactgctg gttatatacg actaagacga catccgccca agaagttata tcgagattgc 1980 aagcacaaag tctcacgttt ggaaacccag tccagattat tacagataga ggatctgctt 2040 tcacagcaga agactttaag gagtattgcc gttcagagaa tatacttcac cacgctgtta 2100 ctaccggact gcctagaaca aatgggcaag tagagagact taatgctatt attattatat 2160 ttgttttatc caagctttct gttgatgacc caagcaaatg gtacaagttt gttggtagag 2220 ttcagcaaac aataaattca acccactgta gaagcacaaa tattacgccg ttcgagttat 2280 tagttgaagt caaaatgcgt acgaagtttg atatccaatt gaagcaaatt attgatcagg 2340 aaatgatagc tatgtttaac gaaaaaagag atgacctaag gaagcatgcg aagcagcaaa 2400 tcataaatct acaagaggaa aacaagaaca cttataatct acgacgaaaa catgcatcaa 2460 tttataaaaa aggcgatctt gttgcaatta aacgtactca attcggaggt ggactgaagt 2520 taaagccata gtatttaggt ccttatcgag taactaaagt caaatcaaag gatacttatg 2580 atgtcactaa agatgcacac tttgtagatg gtccgaaaaa tacgactact tgcgccgaat 2640 atatgaagaa atggatccct tatgtagacg atgatgaaga tattgctgat aactaagaaa 2700 acttacgaaa ggaaaggaca atgctgttaa cgatgataaa tacaatgatg aaaatgatga 2760 ttacaacgac gtttatgatg atgactactc cgaagatgac gataatgatg attacaacga 2820 tgataatgag aatgacgatt acgaagatga tgacgacgat gctgaaaacc aatgatgacc 2880 accatgaagg cgatgcggaa aaccacgaat ttaatgatga cacctctaac gacgatacag 2940 acgactccaa aactagagtg atgaagacga cgaaaatgaa gatgactcca aagccgatga 3000 tgacgacttt gaagacgacg acgacgcatt tggggcaaat gcatttaagg atggccgatc 3060 tg 3062 // ID Gypsy10-I_Dya repbase; DNA; INV; 7347 BP. XX AC chrU; XX DT 18-MAY-2009 (Rel. 14.05, Created) DT 18-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy10_Dya; KW Gypsy10-LTR_Dya; Gypsy10-I_Dya. XX OS Drosophila yakuba OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; melanogaster subgroup. XX RN [1] RP 1-7347 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1077-1077 (2009). XX DR Genome; chrU; Positions 1150746 1158092. XX CC Positions [4809-5234] - Reverse transcriptase CC Positions [6597-6899] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 4626..6593 FT /product="Gypsy10-I_Dya_1p" FT /translation="MDKARLTNVINCFPSFSQDGLGKTNLISHSIDVGMAK FT PIKQRHFPVSPAVEKAMYSEIDRMLQLGVIEESDSSWSSPIVIVTKPGKVR FT ICLDSRKVNSFTEKDAYPLPQISGILSRLPKAEYITSLDLKDAYWQVPLEK FT ASRDKTAFTVPGRPLYQFKVMPFGLCNATSTMSRLMDKVVPAHLRNEVFIY FT LDDLLVVSSSFERHLEVLREVALQIRRAGLTINIGKSHFCMLRVRYLGHII FT GDGGIRTDPEKVAAIRNFPLPKTLRSLRSFMGLCGWYRKFVPNFASLAAPL FT TDLMTTKRRFSLTEAAVESFEKLKQCLSEAPVLCSPDFSKPFAIHCDASKT FT GVGAVLAQVSEEGDERPIAFISKKLNKAQQNYTVTEQECLAAIVALKSFRA FT YVEGHRFKIITDHASLKWLMSNHDLNSRLARWALALQRYDFAIEHRKGSMN FT VVPDSLSRVNEDAVTAIDLREGLLVDLNSDHFKSAEYTELVQKVSANQKNF FT PDLRTDSGYIYRKAEHLTGEQVHVEYAWKLWVPKELVSEVLSHAHDNSLSA FT HGGIHKTIERVRRYYFWPGLVADVKAYINACEVCKTTKAPNYVLRPPLGKA FT PESQRFFQRLFIDFLGPYPRSRSGNIGVFIVLDHFSKYVFLKPVKKIDSGV FT VIKYLEG" XX SQ Sequence 7347 BP; 2251 A; 1393 C; 1540 G; 2163 T; 0 other; agtgtttaat tggccaacag cgcgcatgcg caataaccta cagtggccag ctttggcgtt 60 tcatgaaaca tttaatgcaa attggcagtg ccaaaagtga ccccaaattt tgacccctag 120 ttgacccctg ttgaccccag cacccatgtg ggattaacct aaaaagattc tgaaagtgaa 180 cctaagtgta gatccgagca tcctatgtcc gactttaaat ggtcttgtac attaaagcaa 240 aacccttaaa tgggttttaa cccagttctg ctgaattaag atttcttgta attttcgcca 300 caatttttga cactcatatg aaatgttttt tttgtgaatt tgattctggc aatagaggcc 360 atgaatagtt gatgaaaacc ctgaaaaatt agaatacatt tatttttata atttaagtta 420 agttagttaa agtctacgaa aatttctcct gttgcgaagt gaaatgaaat gtgtagaaga 480 atgcctattg actctattgt ttaaaccaat gccttaaaac cccaaagtgg cctgccaaag 540 ccatataatt ttcagcacac cttcgaggga gatcgatctt tcgccgatgg ataagatgcg 600 tgagttgata taaccttgtt ttaagcgcta accaaaactt aagagatatt tattaaatgc 660 caatgatata aaagaagttg aaaacgaatt aaatgccgca acaactgaaa tttggctatc 720 gctgtaataa attaaatata tttttataat ttgtcgtcaa ttaaaatgaa gtttaattga 780 atttaattaa aagaaatttc acaatgaagt taaacgagaa gttgtctatc aatgaattaa 840 attagataat caaaactatt tcttattaat ttaattctta ttacaattaa catttatatc 900 aaaatcccac attttctttt actacatatt attaaatcat gcgatacata aatattctta 960 taactatcta agcttataat cacactacct aaatacgatg ttagtcttaa ttactttatg 1020 tttattaaat ttaatgctta acgaaattat gagcaactaa acatttactt taccgactct 1080 actgatgact ccacggtcgt aaggtttcaa gggatcatcg gagccattct cccgcggaag 1140 atgatcatag gttgggtggg cctcaagcga gcacaacaac atccgtgtct tcgcaaatga 1200 tagttttctc tctgtctttt cctctatctt tcgtcctttc ttctaacttt cattttctag 1260 aatttattta tttatttttc ttccggttta gttctagctc ctattttttg tcgcaagcac 1320 gctcaattct tgtctttatt aatagtaatg tgtagtccgt gtccgagatc tgggtgagca 1380 agaccaccac ccccaaagcc gaagagctag agccggaatc taaagcgtgc cccgcagctc 1440 agatcctcga aagcctgcct aacgcaagga gttataattc cttcttgtac tgaggacagt 1500 ggcgcactac gttacacact acagttataa tttattatta ttaataagaa aagtggcgcc 1560 caaacgtgtg gagccttagt agaaaaataa ttttttttat taagactcgc gaataaataa 1620 atttattttc tttaacttca gttatcaaga attctcggat tacagttaag tgaatattag 1680 tattcctagc tgtccagact taaatcgggt agtgcatagc aaaaaaatga ttttttatta 1740 ggaatcccag ataaacaaag tctttctttc ttttatctga ataattcttg gaaaacattt 1800 gaggtgaata tcagtattcc caactgtcca gtcttacaaa gaagctaaag attcttccta 1860 agatgtaggt gagattacaa ggtgtctatt taaaatgagg aaaactgagg tggtcatcga 1920 aggctcatct tctgtcccat attatagcac taagtcattg aactaagtct acgctaaaac 1980 tcaacgctac ggaaaatact ggcaatttct tccgagtcgt gccatagcca gagagagtta 2040 atgcaaattt tcctttattt tttttatttt tctagaagtt ctatattttt tgatggtatt 2100 ctggaataca tgcgtttctg tttttgaacc aatcgctgtc cgaattcgaa atcctaaact 2160 atatatcttc cctccttaat aggtcaaacc ttaacaaaag cctattactt tccaatcttt 2220 caatttctga gaatattaat tatcattaag aaaacgatac cttgcgatac gtgcgacatc 2280 tagtatagag ttcggtaaat acatatacat atttttgatt actttatcat attttacttt 2340 tcgaattttt gggttgaatt atcgaaacat tttctgtttt taattacaaa acagagcatc 2400 aatttctaaa agaacaattt cgattgtagt tggttaaccg ttgtcggaac gtgaatattt 2460 ttattttttt tttgcactca aaccagattt ttatatcttt tgggaatgtt ggggttagac 2520 cgatctcctg aaagaggggc ggactcaagc ccaaaggttg tttgtccgtt atgtacagca 2580 gatataaatc ctgctgaggg ttttataacc agttgtcaac acgagttcca taagcggtgt 2640 atagaagctt attggaagaa gagcgcacaa tgtcctgtgt gtaaagtagc tagtagacca 2700 gagatcccga agcctcaaga agtaactgag agacaaactc gaagtgggtc gagaaaacga 2760 cagcccgtag atacagttgg tcagcaagtc tcaggtaata tagcttcgac ctcgggacaa 2820 agtacgcaag ctaataacat tgggggcagc aattcgaata gtttaacaga taatgctatg 2880 atattactag atatggaaca aaggttatta gctacgcttt cggagaagat gactgactta 2940 attcagaatt ccataaccga aactataacc agggcgttag caagcaccag tggtggtaat 3000 gacaattctc agagggaaca tttttcacat cgatcgctta gcaatggaga tagggtcgaa 3060 acaagaaatg aaaccggtag caattcaagg caagtcctct cggctccggg ctctcccgct 3120 tcacagagaa gcactacgtc cgatttggta aaccgaccag ataaggttgt acatatcctc 3180 aacggttgga agataaaatt ttctggaaaa ggagtatatg ttgataattt tatttataga 3240 gtaaatactc taacatgaga gacgttagac aataatgtcg atctgctttg caaacacgcc 3300 agcgtacttt ttgaaggaaa agccaatgag ttttattggc gttatcataa agcgcacggc 3360 gatattcagt ggaattcttt ctgtacagct ttgcgtctcc agtttcgcga tagcagagac 3420 gattgcgata tcgaagaatt aatcaggaat tctaaacaaa agcaaaacga gtcttttgat 3480 agtttttatg atacaatatc aggtttagtc gatcagctgg agcatccttg gactacgagt 3540 aaattaattc gagttctcag aaacaatttg cgtccagaga taagacatga gatcttaaac 3600 attgatatta aggcggtttc agatctcaga cagatttgtc gcaggcgcga aaccttcttg 3660 gccgatgtta agcgggttag cggctacgtg cgaagcactc cgtttaaacg ggaagtggct 3720 gaatttagcc aggagtatga tacgcagtta gaatcagaac ctgaaaatga ggctgatgtc 3780 gaagcgttct ccttattatg ctggaattgc cgcaaggaag gacaccgata tcaggattgc 3840 gtatcggaga gaagaatatt ctgttacggt tgtggagctg caaacactta caagccaagt 3900 tgtaataagt gttcaaaaaa ctccaaagcc ggcacgtcga agtcgcaatt caaacagaag 3960 acttcgattg cctcccggag tcaatccaca atgaccgaag agtaagagat gcagcaaatg 4020 ttgcacaacc tccaatccct gacgaacaac aactacctgc tagtcctttg ctacatagca 4080 aaatccaaac actgaaggaa attaaacatg aagagactta tttacaaaat attgggaaac 4140 gaaaggctcg aaattctagt cgaattaaag cattctggca aaatgtcaag agctgtagaa 4200 ttggagtcaa ttatataaag tcgctacctg aaggaccaaa agatcctcgt ccgtttttgc 4260 caattcgatt gtttaatcgt acggtttacg gtctgttaga ctctggcgca tcggcaagtt 4320 gtgtaggagg taacttagcc cgtgaagtag aagctgcagg aaatagcagt gccattaata 4380 gtagtgctac aacggctgac ggccgatcac agcagatacg aggacatatt caaacagagg 4440 tagaatacgg cgaccagaag aaggttttaa aaatttacat cgtaccgtcg cttaaacaag 4500 acctatattt agggatagat ttttggaagc tttacgattt acttcccaga agacttatca 4560 tatctgagct agattcaact gaagttgatt tcaaagttaa agttgaccaa cataagttat 4620 ccgagatgga taaggcgagg ctaacgaatg tgatcaattg ttttccatca tttagtcaag 4680 atggtcttgg taagaccaac cttatatcgc actctattga tgtaggtatg gcaaagccaa 4740 taaaacaacg ccattttcct gtttcaccag ccgtagaaaa ggcaatgtat tccgagattg 4800 acaggatgct acaactagga gtgatagaag agtctgacag ttcttggtca tctccaatcg 4860 ttattgtaac gaaacctgga aaggttagaa tttgtctaga cagtcgtaag gtgaatagct 4920 tcaccgaaaa ggatgcatat cctctgcctc aaattagtgg cattttgagt aggttgccga 4980 aggccgagta cataactagc cttgatttaa aggacgctta ttggcaagtg cctttggaaa 5040 aagcttcgcg tgacaagacg gcgttcaccg ttcctggtag acccctctat cagtttaaag 5100 ttatgccgtt cgggctatgc aatgccacaa gtacaatgtc tcgtttgatg gacaaagtgg 5160 tgccagctca tcttaggaac gaggttttta tttacctaga tgatctgctg gtggtttcat 5220 caagttttga gaggcacctc gaagtgctaa gagaggttgc gctacagata agacgtgcag 5280 gtttgacgat caatattggc aaaagtcact tctgcatgtt gagagtgcgt tacctaggtc 5340 acataatcgg agatggagga atccgtaccg atccagaaaa agtagccgca attaggaatt 5400 ttcctttgcc taaaacgttg cgcagtctgc gaagtttcat gggtttatgt ggctggtatc 5460 gcaagttcgt tcctaatttt gcgtctcttg ctgctccttt aactgatcta atgaccacga 5520 aaagaagatt cagtttgact gaagcagcag tagaatcttt tgaaaagcta aagcaatgtc 5580 ttagcgaggc tcctgtttta tgcagtcctg atttttcgaa accgttcgca atacattgcg 5640 acgcgagcaa gacaggagtt ggagccgtat tagcccaagt ttcggaagaa ggagatgagc 5700 gtcctattgc cttcatttca aaaaagttaa ataaggcgca acaaaattat acagtcacag 5760 agcaagaatg tctcgccgcg atcgtggctc ttaaaagttt tagggcgtac gtggaaggac 5820 ataggttcaa gatcataact gatcatgcgt cactaaaatg gttgatgtct aaccatgacc 5880 ttaattcgcg cctagctcga tgggctttgg ccttgcaaag gtatgatttc gcgattgagc 5940 atcgcaaagg ttcgatgaac gtagttcccg actccttgtc tcgggtgaac gaagacgcgg 6000 taactgcaat tgacttgcgg gagggacttc tcgtggatct aaattctgac cactttaaat 6060 cagcggaata caccgagtta gtgcagaaag tgagcgcaaa tcagaagaat tttcctgatc 6120 ttaggaccga tagcggttac atttatcgga aggctgagca tttgacagga gagcaggtgc 6180 acgtcgaata tgcctggaag ctatgggtac ctaaggaatt ggtctctgaa gtactgtctc 6240 atgctcatga taattcgttg tctgctcatg gtgggataca caagaccatt gagagagtta 6300 gacgttatta tttctggcct ggacttgtag ccgacgttaa ggcctatata aatgcgtgtg 6360 aggtgtgcaa aaccacaaaa gcacctaatt acgttttgcg accaccgtta ggaaaagcgc 6420 ctgagagtca aaggtttttc caacgcttat ttatagactt tcttggtcct tatccaaggt 6480 ctagaagtgg gaacattgga gtttttatcg tccttgacca tttctccaaa tatgtctttc 6540 ttaaaccagt aaagaagata gattccggtg ttgtcataaa atatcttgaa ggataattat 6600 ttatgtcgta tggcactccc gaaacgatag tgtcagataa tggatctcaa ttccgttcac 6660 acgcttttca aaagttaatg cagctgtatg gcattactca tacatttact gctgtacatt 6720 cgccacaggc aaatgcctcc gagcgcgtga accgatcagt tattgctgca attagagcct 6780 atgtgagacc cgatcaaaaa gactgggacg aatacttaaa caagattagt tgtgcgttga 6840 gatcgtccgt acattcaagc ctaggaactt ctccttatta tatggttttc gggcaacaca 6900 tggttacctc tgcttctacg tactctttgt tgaggaagct caacctattg gatgatcgat 6960 ctttgaagtt caatcggtca gactcgtttg atattgtacg ggagaaagca tgtaagaaga 7020 tgatggaaaa acactccgag aatgaaaaga agtacaatct tcggtctcga gttgttgctt 7080 atacggaagg gcaagaagta ttccgaggaa attttaagca aagctgtttc caaacaggat 7140 acaacgcaaa atttggtcct tcttttgtga aagcaatggt tcgcagaagg ctaggaagcg 7200 cctactacgt gctagaatat ttacagggaa gactaatagg tacataccat gcgaaggata 7260 ttcggcaatg acgtgtcttc agtcgtgtca tccgtctagg gtaccagtcg aacatatacc 7320 ctagccagat gttaccgggg gggcttt 7347 // ID Art1_Cis repbase; DNA; INV; 3223 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 06-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE hAT DNA transposon from Ciona savignyi. XX KW hAT; DNA transposon; Transposable Element; Art1_Cis. XX OS Ciona savignyi OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. XX RN [1] RP 1-3223 RA Smit A.F.; RT "Art1_Cis - hAT DNA transposon from Ciona savignyi."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX CC 16 bp TIRs; 0.5% div; ORF from 435 to 2960 encodes transposase CC 37% identical (56% similar) to the Arthur1 transposase in CC mammalian genomes. XX SQ Sequence 3223 BP; 976 A; 581 C; 692 G; 974 T; 0 other; cagtagcgga ttaagggggg ggggctaaca gggctgcagc ccggggcccc catcattaag 60 gggcccccaa ttcctaacta gatcatacga ttaaattatt atcggtactt caataactgt 120 ttaaatttat actgattgaa cgttatactt tgtaataagg ctttgttgcg tctatatgta 180 cattgttacg tcacacacta ctgttttttt cgaggacgca attgcgagcc ttggtcttgt 240 catattttca gtattagacg tgaacaaagt gtaagttaac aaagtttaca tttacagttt 300 aattttagat taaattagtg caggcggtgt aacttttgat ggtcaaaact ttcgcccggt 360 atgattaggc tagccggtag cctgttagcc aacagatgaa atgggtgtta gtcttcagca 420 tgaaatttgc ctaattttgt atttttgtta aaccggtggc ccatgagttt ttatacttta 480 ctcttaggat gcagcgtaga tatcctagtg gagcacaaaa gcgtgcagct gctttagaaa 540 agaagaaaaa agaatcacat cttgctgaaa agattcctaa acttacaact ttccttaaac 600 taagtagaag tacaacagca ggtgaggatg aacccatttc gcctaagttg gacattggaa 660 atcagactat acatggatct tctatgcatg tattggatga ggatgttaat gcttgttctg 720 aacctaaaag tgtaatggat gatgttatta gtgatgatct taacttttgg ccacaaaaaa 780 tttctgaaaa cttgagaaat tactggatac agcgtggttc atcttcatgt cagcataaaa 840 atcatggttt taaagaatca gtggtacagt tggagaaaga atcacaacgc cgatactgta 900 gtatagcttt gtttcagcgc gttcatgcaa gaacaggaga acagtttgat agaacttggt 960 tatgctattc taaaacaagt ggacgagttt attgttttgt atgcaagcta atgtcaagta 1020 cagtgagtaa gctgaccagt ggattaaata attggaaaca tgctcatgaa atacttatga 1080 atcatgaaaa ttctaaacag caccttgatg ccatggcagc tttatgtgct cgaaagtcat 1140 ccagtcagat agacagcgat ctggtgaagc agtatgaagg cgaagtccag tattggcaag 1200 aacttcttcg gaggctagtc agtgttgtga aatttctctg tgtccgtggc ctcgcgtttc 1260 gtggcaagaa tgaactaata ggttcatcca acaatggtaa ttatctaggc atacttgagc 1320 ttctgagtga gtacgacacg tttctagctg aacatattag taagcatgca aacaagggtc 1380 gaggccatgc gtcttattta tcttcaacta tttgtgagga gttgatcgag ctcatgggtc 1440 aaaaggtttt atgtgtaatt attgatgaaa tcaaaacagc taagtatttt tcaatatctg 1500 ttgactctac tcctgatatc atgcatgtgg accagctcac agtggttatt cgatatgttc 1560 tacaatctgg ccctgttgaa cgattcttaa agttcatacc gatatattct cacactggat 1620 cagaaattgc tcgaataatt cttcagttcc tggaagaaaa tggcataaac attcaaaact 1680 gtcgtggcca gtcatatgat aatgcttcta acatgagcgg caagtacaaa ggcgtgcaag 1740 caatcatccg tgaaaggtgc agtgttgctt attacatacc gtgcacagcg cactctttaa 1800 atttgattgg aaaatgtgca gcagaatgtt gccccaatgc agttgaattt tttaatcttc 1860 ttcaaaatct ctattcatgg tttgtatcat ctacccaccg atggcaggta caccgaaagc 1920 atctaagggg gttgcctgtt acaaaggcct tgtctgacac aaggtggtca gcacggtatg 1980 atgcggtacg ggctttaaat aaagggtacc atgagaatat gtcagcacta gaagaactgc 2040 aatctgatga aaatcagccg cgggacagta ggcttgtggc tgaaggattt ctaaagaagc 2100 ttcagcagct ggaagttgcc attttggtgg aagtttggga cacggtctta gaaagatttc 2160 aaaaaaccag tttaactctc caaggaagta aactccctgt gaattcagca gttcatttgt 2220 tggaatctct cctcgaattc gttagaagcc agagatctga atttgaatat tatgaagata 2280 aagggatagc gaagtctgtt gtcaaagtgt acaaagaaac ttcaaaacga gaaaagacga 2340 ggaagcgcca attcgatgag ggttgttcga aagaaacaat attctcaccc agagaacgat 2400 tcaggtgtga ggtatatttg ccaattatgg acgcactgtc atcagctgta caacatcgtt 2460 tgcactcgta tatgcacatt cagaacaagt ttggattcct atccaacata tgtgagctga 2520 gtaaagacga cattcgtcat gcagcaatcc agttaatgga aacatatgta aatgattttg 2580 aagactgctt cccatcggaa atgattcatt tttctgaatt ttacaaaacc gttagtggtc 2640 aagagaagaa aaaatcacgg aactcaagta cagagataga aatgcttttg ctgttgaatg 2700 aaaatatgtg tagccacatc tttccaaacg ttcacatcgc acttcgcatt tatctgtgca 2760 tggcaccatc caactgctgt ggagagcgat ccttttcgaa attgaaaaga ataaaaaatg 2820 aggcgaggaa ttccatgggt caagagcgac tcaattttct ctcacttatg agcatcgaaa 2880 atgatgttgt tgactcactt tccttcactg atttgatccg cgacttcgcc ctcagaaaag 2940 ctcgcaaaaa atctatctaa cacattattc gtaggatgaa tgcttattag cttgatgaat 3000 ttgtatattc gttcgttgta tatgtgaatt tcattgtaat ttttgatagt attgtattaa 3060 tgattgatgt tatctgatgc tttgtttcaa taacacaaaa tgttcagttg gttcaggtcg 3120 tgttttcagc ctattaacgt taattttata tttatgctat atatcgatag ggcccctgtc 3180 gaaatttcag ccaggggcct cgtacaacct taatccgccc ctg 3223 // ID Kolobok-1_BF repbase; DNA; INV; 6258 BP. XX AC . XX DT 27-FEB-2007 (Rel. 12.02, Created) DT 01-MAR-2007 (Rel. 12.02, Last updated, Version 1) XX DE A family of autonomous Kolobok transposons - a consensus. XX KW Kolobok; DNA transposon; Transposable Element; KW Interspersed repeat; Kolobok-1_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-6258 RA Kapitonov V.V. and Jurka J.; RT "Kolobok, a novel superfamily of eukaryotic DNA transposons."; RL Repbase Reports 7(2), 116-116 (2007). XX DR [1] (Consensus) XX CC Kolobok-1_BF is a consensus sequence of a family of autonomous CC Kolobok transposons that were active in the lancelet genome in a CC last few million years. The Kolobok-1_BF transposon is CC characterized by 212-bp terminal inverted repeats, TTAA target CC site duplications, and it encodes two proteins: (i) the 835-aa CC transposase, Kolobok-1_BF1p, composed of the THAP DNA-binding CC domain and catalytic "DDE" domain, which is conserved in all CC Kolobok transposases, and (ii) the 281-aa Kolobok-1_BF2p protein. CC The second protein is conserved in highly diverse Kolobok CC transposons present in the genomes of vertebrates (frog, fish), CC chordates (lancelet, sea urchin, sea squirt), and cnidarians CC (starlet sea anemone). See also comments in Kolobok-1_XT. XX FH Key Location/Qualifiers FT CDS 6025..5372 FT /product="Kolobok-1_BF2p" FT /translation="MERLYREEFDPANIIAYNFQPRRQAPAPALGGSGEGD FT VGPLPVDPVAPPSEPECLGAHSTAYWDGEDVETEEWRLDDFLWCRCTNCRP FT MTTVRECVCCHDLTEAETKGVGQWDGIHCLRDHPDFSAVVLNKAVLDAALN FT FRVDIKLEPLRDEYPPRTYRLQAYRQCTAWLHQRLGRKIRRVLPSCAVWAI FT REAYPEPAGGNYRGFLDADDEIYDYIYE" FT CDS join(416..778,1374..2888,3132..3360,3555..3701, FT 3926..4176) FT /product="Kolobok-1_BF1p" FT /translation="MPSCVSCHNNNINTKSKAISFHRFPSEDKIRLQRWLI FT AVRSNLREPWTLEKIQDSIASKNPAFVCSEHFSADSCIDNPKARYVPYSVP FT AKVLLQDAFPTQFGRSTSRVSSERQREKRDRKELLQKLLQQSNDACDHQMT FT EPMSSVDETGMTEASDAGAASPSAPQTKFNSGIVTSQGLLDSGVSTSSLLS FT DSGSSTAQSMLDSGVSTSSLLSDSGSSTAQSMLDSGVSTSSLLSDSGSSTA FT QAEVAHGYGLKFHTYCKPPSAEPDKFVTASTQTELTGEMIESLMSSTLCST FT PKAKANMRELSFATLSPVASPSDKNDPTFNLSSFETDMDDTDTDYEYDSEE FT EEEGNGSQFYIVHRSKLLERFQTCECGQPLAVWNMKSTGSMLAIEYECSSC FT SNRGTWHSQPKIGSMAAGNLLIPAAILFTGGTYKKFADICDTLRLQKFSES FT HYNNVQRTYLLPAVNDYYLNEQQLILRRFQATAEEEAQQVTLLGDGRCDSP FT GHCAKYCSYTLMEEKTQFILDFQLAQVTETGTSQAMERHAFEKSLEFVRDN FT GIDVECIVTDRHRGIGASLKQRNNRHINHQYDVFHMAKSIQKKLSKSAKRK FT ANRALGPWIKFIKNHLWYSSSTCEGDDVLLQEKWLSLIDHIANRHTFRKNQ FT LFKKCAHHRLTPDEKENITWLRPGSAPHRAMREIVSNKTFVKDMAHLTGFK FT HTGVLEVYHNMLTKYTPKRLHFPYLSMRARLQLSVLDHNNNVFREQAKTLE FT GDPRWSVVYPKRTSLWVARKLFEAKTYEYRQQLMEEVVRRKESGQWKYGQH FT VLPDPEMPANIAPVERGSKQEAVDAYRTRFRPR" XX SQ Sequence 6258 BP; 1803 A; 1360 C; 1357 G; 1738 T; 0 other; aggggagcac cagggattag caggagtttt tgtttcatta acaatatatc ccaataaaaa 60 cgtttttgca actgtcgctc tggtccaccg tacttgttgt ttttgccggc cgtctcaagt 120 taccgcctgc ccccggctcc ggggctgcca ttttggcagg ggaatcccaa catccgggac 180 cttattgacg tcactgtgaa tggaatcaac actgtcactt tggagtgcgt tcgcaaacca 240 ccggttttcg cttgattttc gccctgaagt cgccgtagag gcagtgccgt aaaggcgtgt 300 acttgaatct tcctatgatt tgcacagtta aaggccccgg ctcgagcggg agtgagtgtt 360 tatttttacg ccagcttcgc ggcccggaag tggggagggg gagtgccgcg ccattatgcc 420 gtcgtgcgta agctgtcaca acaacaacat caacacgaag agcaaagcga taagcttcca 480 tcgtttccct tcggaggaca aaatcagact acagaggtgg ctaatcgccg tccggtcgaa 540 tttgagggaa ccatggacac tggaaaagat tcaagatagt atcgcctcga aaaacccggc 600 attcgtctgc tccgagcatt tttcggcaga ctcttgtatc gacaacccca aggcgagata 660 tgttccctac tctgttcctg ctaaagtcct tcttcaagac gccttcccca ctcagtttgg 720 gagaagcacg tcacgcgtgt cgtcagagag acagcgtgaa aaaagggatc ggaaagaggt 780 atgtaattac gccgtgattt tgccgaattt atctgcatgt aggatgtgtt tgttttgatc 840 ctgaaaagta gtgatataat ttacctgcaa ggtttttgta acatttatat gatatatcgc 900 gtcatcgaag acaaaccaat gtaactttca acttgacttc cacatttttg gtaactttac 960 atgtattttt aaggattttt attctacttt tcctggaatt gttatgttat cataatatga 1020 catacaaatt acagtgtaac atgataatac caaacatttt gatgttttgg ttttgttgtg 1080 ttgtctacat ttatcatgga taatttttgt gttttatact tatggatttt ctaatgtatg 1140 gatgtaatct aatgtaaaaa ttttttttta aaatttttta tgcaaatgtt ttgttatgtc 1200 catagcatag ctataatgtg tctgtgtgtt tggtggattg attctggaaa aattcagtcc 1260 agttttattt tgaagttctt tggatggatt cttaaattga tatttgtacc tattttttat 1320 taaatattgt catttctttt atcataagtt atcttttatc atccatcttt cagcttcttc 1380 agaagttact tcagcaaagc aatgatgcat gtgatcatca gatgactgag cccatgagca 1440 gtgttgatga aacaggaatg accgaggcaa gcgatgctgg tgctgcatca ccatctgcac 1500 cccaaacaaa gttcaattca ggcatcgtta catcacaagg attgttggat tcaggcgtgt 1560 ctacatcctc gctcttatcc gactcaggtt catctacagc acaatccatg ctggattcag 1620 gcgtgtctac atcctcgctc ttatccgact caggctcatc tacagcacaa tccatgctgg 1680 attcaggcgt gtctacatcc tcgctcttat ccgactcagg ctcatctaca gcacaagcag 1740 aagtagcaca tgggtatgga ctgaaattcc atacatactg caagccacca agtgcagaac 1800 ctgacaagtt tgtcactgct tctacacaaa cagagctgac cggtgagatg atagaatcac 1860 taatgtcctc aacgttatgt tcaactccga aggcgaaagc caatatgcgt gaactgtcat 1920 ttgcgacatt gtctccagtt gccagcccgt cagataagaa tgaccccaca ttcaatctat 1980 caagttttga gacagatatg gatgatacag acaccgacta cgagtatgac agtgaagagg 2040 aggaggaagg aaatggctca caattctaca tagtacacag aagcaagctt ctcgaaaggt 2100 tccagacatg tgagtgcggc cagcccctcg ctgtttggaa catgaaatca actggctcca 2160 tgcttgccat tgaatatgaa tgttccagct gctccaaccg aggaacttgg cactcccaac 2220 caaagattgg cagtatggct gcaggtaacc ttctcattcc tgcagccata ctgtttacag 2280 gtggtaccta taagaagttc gctgacatat gtgacactct aaggctgcag aagttctcag 2340 agtcacacta caacaatgtc caaagaacat acctccttcc agcagtcaac gattactacc 2400 tcaacgaaca gcaactcatc ctcagacgtt tccaagcaac agcagaagag gaagcacagc 2460 aggtcacatt gctgggagat gggaggtgtg actccccagg tcactgtgcg aagtactgta 2520 gttatacact gatggaggag aagacacagt tcatcctgga cttccagctg gcacaggtga 2580 ccgaaacagg aacctcccag gcgatggaaa ggcatgcctt tgaaaaatca ctggagttcg 2640 tccgggacaa cggtattgat gtcgagtgta tagtcactga cagacaccgg ggtattggag 2700 catcacttaa gcaacgcaac aataggcaca tcaatcacca atatgatgta tttcatatgg 2760 ccaagtctat tcaaaagaag ctgtcaaaat ctgcaaagag gaaggccaac agagccctcg 2820 gcccttggat taaattcata aagaaccacc tttggtatag ctcaagtacc tgcgaaggag 2880 atgacgtggt aagattactt ctataaatgc ggttaataag taataaattc ccagccaaac 2940 atcagaggcc ctattattat tgcagccatc cttgttgaaa gttaacatac atgccatcat 3000 atggacagta gatgtaatat tggtcaacag aaaatactga acattttgag taacagggct 3060 ttaaagtgac atgtgattaa tctggtggat tttgcccttc taaactagta ttctttatta 3120 tgtccatgta gctcctacaa gagaagtggc tctctctgat tgaccacatt gccaaccgcc 3180 acacattcag aaaaaaccag ctgttcaaga agtgtgctca ccatcgactc acccctgatg 3240 agaaggaaaa catcacttgg ctgagaccag gaagtgcccc acaccgtgcc atgcgggaga 3300 tagtctccaa taagactttt gtcaaggaca tggcacatct gacagggttt aagcacacag 3360 gtaattatat attgtcctca gaatgtccca ttttttgtac tataactgag gatataagta 3420 gccaaattca tatgtgttta gataaatgtg ttagaggata catgaagtgt gtgtatgtct 3480 attgtattac tttggattcc aatgttgtaa aggtattcaa atgtacagta tgaaaataat 3540 gttccgtttt acaggggtac tggaggtcta ccacaacatg ctgacaaagt acaccccgaa 3600 gcgcctacat ttcccctacc tgtccatgcg ggctcgtctg cagctttctg tgctggatca 3660 caataacaac gtgttccgcg agcaggcaaa gacactggaa ggtgagatca caacttttgt 3720 acaatatata gacattacca atagagaaga ataagcatac tgtgtttgtt acaattcatt 3780 gttcatatat gttagtcagg gtttgtttga gaaaaagtaa cttatctatg atacactgaa 3840 ttaccatttg ctgtctccat gtgctgtaac aatttgctgt gacaatcatg tatgccatac 3900 ttttcttctt atgtcactgc cacaggggac ccgagatggt ctgttgtgta tcccaaaagg 3960 accagtctgt gggtagcccg gaaactgttt gaggcgaaaa cctatgagta ccgacagcag 4020 ttgatggagg aagttgtgcg caggaaggag agcggacagt ggaagtatgg acaacacgtc 4080 ctgcctgatc cagagatgcc tgcaaacatt gcccccgtgg aacgtggcag caaacaggag 4140 gccgtagatg cataccggac cagattcagg ccgcgttgaa ctgtagctgc ataagacagc 4200 atttttgctg tatgtagttc ttgataatga tatacatgta caagcttgtg tagctttgta 4260 tgatatagct aaggtcaaaa catttttttt gacaagtcag aactattata ggtatgttat 4320 aatataccaa catgctgcat taatgagaaa gacacttgat gaagaaacaa tactttttac 4380 agaggaatgc tgacatgtat cagagtgtga tagtgtatac atgaacattt tttagaaaag 4440 ctctaggcaa actatctagc ctgtcacatt agataatctg acaaagtgta actagtgata 4500 cctggaaaat acaaatttat gtaggttaca caatgctttt aactttctac cttccatcta 4560 acttttactg acttggtgta aaatgtaagt tgcattttac tatacaaaca ttagtattta 4620 tagaggaatg ttgacaatat tgtatcagat tgtgatagtg tatacatgaa catgttccag 4680 aaaagcacta ggcaaactgt cacattaaat aatctgacaa agtgtatcta gtgttacccg 4740 gaaaatacaa atttatgtag gttacacagc tctttaactt tttaccttcc atctaacttt 4800 tactgacttg gtgaaaatgt aagttgcatt ttactataca aacattagta tttatagagg 4860 aatgttgaca atattgtatc agattgtgat agtgtataca tgaacatgtt ccagaaaagc 4920 actaggcaaa ctgtcacatt aaataatctg acaaagtgta tctagtgtta cccggaaaat 4980 acaaatttat gtaggttaca cagctcttta actaaaataa agagtattag caagttacat 5040 cctatctttt cctaattcaa acattagaaa aaatgctgat ggtgaaacag gttaacaaga 5100 atctagtaat ctggtgtatt aataataata attatctagt ccagttaaac atcaacttgc 5160 ataattatat tcgaaaattg catctttacg gacttgacag caaagtgtat acaaactata 5220 tacatatata acgtgtatat aatgtcaagg agcatgaaag gggtactcaa aaaaaaattt 5280 taacacaact tcaaaacata ttgtgagtga gtgtttggtc cgcacataac tttaaaaaca 5340 ttatgtagaa ctctcagaaa cgtccgcctc attcgtagat gtagtcgtag atttcgtcgt 5400 ctgcgtcaag aaaacctcgg tagttgccgc cggccggctc ggggtaggcc tccctgatcg 5460 cccagacggc acaagacggc aggacacgcc ggatctttct ccccagccgc tgatgcaacc 5520 aggcagtgca ctgccggtac gcctggagac ggtacgtgcg tggtgggtac tcatccctca 5580 gcggctccag cttgatatca acccggaagt tgagtgccgc gtccaggacg gctttgttca 5640 gaacgacggc gctgaagtcg ggatgatccc gcaggcagtg gatgccgtcc cactgcccta 5700 cgccctttgt ctccgcttct gtcaaatcat gacaacagac gcactcgcga actgttgtca 5760 tcggccggca gttggtgcac ctgcaccaca aaaaatcatc gaggcgccac tcctccgtct 5820 ctacgtcttc cccgtcccag tacgccgtac tgtgtgctcc caagcactcc ggttcggaag 5880 gtggggcgac aggatcgacc ggcagcggac caacgtctcc ctccccactt cctcctaacg 5940 ccggcgccgg cgcctggcgc ctgggttgaa aattgtaggc aataatgttg gccgggtcaa 6000 actcctctct gtaaagtctc tccatgattt tgtcgaacgt gatgttgtgt tgattccatt 6060 cacagtgacg tcaataaggt cccggatgtt gggattcccc tgccaaaatg gcagccccgg 6120 agccgggggc aggcggtaac ttgagacggc cggcaaaaac aacaagtacg gtggaccaga 6180 gcgacagttg caaaaacgtt tttattggga tatattgtta atgaaacaaa aactcctgct 6240 aatccctggt gctcccct 6258 // ID BEL-127_AA-I repbase; DNA; INV; 5849 BP. XX AC supercont1.130; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-127_AA_; KW BEL-127_AA-LTR; BEL-127_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5849 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.130; Positions 368850 374698. XX CC 'CAGGT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 793..3216 FT /product="BEL-127_AA-I_1p" FT /translation="MKTRRIDDCDDPIDAGKKNQSSKQSPVCSRLGMERNS FT VSSIRSHVDSVSQDGLGRRQVGPTKAQLAARKGLTFKLPKFSGKPAQWPLF FT YAAYNASNDACGYMNHENMMRLQEALEGDALELVSGQLLLPESVPRVIEKL FT RRHYGRPEQLLESLLEKVKRLDPPKPDSLRSFVPFGNTVEQLCGHLEAADL FT RQHLVNPLLIKSLVAKLPDREKREWVHYQRGRGEATLRTLTDFLMNIVADA FT CEANVDVEFKPSHQPRGGSHSEKLRVKEKGRLYAHAEVSSPAVNASEEKLL FT RPCQKCRRTDHRLRHCAEFKKLRYTERLKLATREKLCHVCLNEHDGKCRFK FT IRCNIGECREFHNPLMHPVGNVVGINAHIRTNCTVMFRIVPVQLHCGGKST FT TVLAFLDEGANVTLMEKALADRLDAVGGDQVRLTVKWTGNISRTEESRRMN FT LWATGTTAGANNKMMLYNVHTIGKLMLPRQKLDYEELAAQHSHMRGLPIES FT YDGRPQLLIGANNIYSFAPMEAKVGTPMEPIAVRTNLGWTVYGPRQSTTVA FT TGGFFGYHQQVTNEDLHELLKSHYALEESVVVIPQETAEEKRAREILERTT FT KRVGDRFETGLLWKTEDPRFPDSYLMALRRMKQLEKRLEKNPVLHQNVCKQ FT INEYQQKGYAHLATAEELANTPSDQAWYLPINVVLNPKKPEKVRLIWDAAA FT TVLGVSLNSQLLKGPDMLVPLVKVLSGFREWRIAFGGDLKEMFHQLKIREE FT DKQKQRFIFRKNPADPPSTGTQKSSPHNIPKRLRPSRTDITSMTTLIALTQ FT WKRRLP" FT CDS 2982..5333 FT /product="BEL-127_AA-I_2p" FT /translation="MEDCIRWRPEGNVSPAEDSRRRQAEATFHLSEKPGGS FT TKHRNAEEFATQYPEASAAITHRHYVDDYFDSVDTVEEAVTLAQDVRSVHQ FT KAGFEIRNWVSNSPEVLLKLGEEKPATSVHFGRDKQTSKEIVLGVIWDPEM FT DQFSFSTKHREDLIPYLYEGQRPTKRLVASCVMGFFDPLGLLSPFTIHGKI FT IIQHLWRSNCDWDQEINSNSWELWKRWTSLLPEVEAIRIPRCYLGCAKSAE FT VDALEVHIFTDASEHGFGCVAYLRAVIRGEVHCSLMMSRAKVAPIKRQSIP FT RLELMGAVLGARMNQTVLSTHSYQINRTVFWTDSRTVCSWLNSDQHRYKQF FT VAFRVGEIQELTKVADWRWIPTKLNIADVLTKWGQGPPLQSEGEWFNGPSF FT LYQPPEMWPTQESTVEETEEEARGVVLFHEVIDVQPISRWTKLLRVTANVI FT RFIANCRRKREGEPIVVTRATAKQRQLMGSTAVKYASVTAPVGQEELQQAE FT TILWRQAQWDSFPDEMSALTGNLKRAPDTPMETVKKSSQIYKSSPVLDDEG FT VLRMEGRLANSEESSFDKKHPIILSRFHEVTQRLIQHYHETFGHANSETVF FT NEMRQRFQIPKLRPAIQQVVRKCVWCKVNKCRPRSPRMAPLPVERITPHLR FT PFSSVGVDYLGPVEVTVGRRREKRWVAVFTCLAVRAVHLEVVHSLTTESCR FT MAITRFQSKFGKPEQIFSDNATCFRGASNEMVRMEKINQECAEVVSSSSTV FT WNFIPPGTPHMECGNAWCGRWKRRCVRLTTEGN" XX SQ Sequence 5849 BP; 1582 A; 1434 C; 1741 G; 1092 T; 0 other; atctcaaaaa gaagtgacgc aaaccaagtg tccctgtgag aagccggacc tgccccaaga 60 tgccttccga tgctgaatcc gtgaaggatc aaaccctcct ggatctcacg ggaactccgt 120 gtgggatatg cggtccgtcg acatgcgacg aagcgatgat cggttgcgat ggatgccccg 180 gttggttcca cgttcgctgc gtgggcctta cggaaggtaa gctgcctaaa aagtggtact 240 gcaaaagcga agcctgtcaa gagaaggctc aggagtacca gaagcagaag gacagcaaaa 300 agccgacgcg caaccggaag caaaccaacg agtcggaaag ttccaacgtg tccaacgtcg 360 gaacgaaggt gcgcgcactc gaggatcggc aaaagcagca gttggcagag ctggaggcgg 420 ccatgcagct gaaaaggaag gaaaaggagt tccagcgcgc gctcgagagg aagaaaatgg 480 aaatggaaga ggagatgcgt gctgaggagg aggaggagca aaaggcttgg caagcggaaa 540 tgctccagcg taaaagggag catattcaga ggatggaagc caatcgaaaa tcattcgaga 600 tgcagatggc ggatatggac aagaagttgg aagagctttc ggtcccccat gtgccacaat 660 cggggtcaaa gctcgtcggc agcggtacgc aagcaggaga accttccgga aaaatcaaca 720 aaaacgtgtt gaagctgacg aaggccaacc taaaaagact ggcggaggat accgacgagg 780 acaccgacga agatgaagac aaggaggata gatgactgcg acgatcctat cgacgcgggt 840 aagaagaatc agtcgagtaa gcagagtccg gtttgctcga ggctgggaat ggagaggaat 900 tcagtttcat cgattaggag tcacgtcgac agtgtgagcc aagacgggct ggggcgtcgg 960 caagtagggc cgacaaaggc gcagctggct gcgagaaagg gcctgacatt caagctcccc 1020 aagttctcgg gcaagccagc acagtggccc ctgttttatg cggcctacaa cgcgtctaac 1080 gatgcgtgtg gttatatgaa ccacgagaac atgatgcgac tgcaggaggc gctggaagga 1140 gatgccctgg agttggtatc aggccagctt ctcctcccgg agtcggtccc gcgggtgatc 1200 gaaaagctac gacggcatta cggacggccg gagcagttgc tggagagtct attggagaag 1260 gttaaacgtt tggatccccc aaagccggac agtctaagga gttttgttcc atttggaaat 1320 acggtggagc aactctgtgg tcatttggag gccgctgacc tgagacaaca tctcgtcaac 1380 ccactactca tcaaatcgct ggtcgctaaa ctcccagacc gtgagaagcg agagtgggtc 1440 cattaccaga gaggtcgcgg tgaagcaacg ttgaggacac tgacggattt ccttatgaac 1500 attgtggcag atgcctgcga agccaacgtt gacgtggagt tcaagccatc gcatcagcca 1560 agaggaggtt cccattccga gaagctaaga gtgaaggaga aaggtaggct ttatgcccac 1620 gccgaagtca gcagtccggc cgtcaacgcg agtgaggaga agttgctgag gccctgccaa 1680 aaatgccgac gaacagatca tcgcctgcgg cattgcgcgg aattcaagaa gctgcggtat 1740 acggaaaggt tgaaattggc aactcgcgaa aaattgtgtc acgtctgcct caacgagcac 1800 gacgggaaat gccggttcaa gatccggtgt aacatcggcg aatgcaggga gttccacaat 1860 cctctaatgc acccagttgg aaacgtcgtt ggaataaatg cgcacattcg gacaaactgc 1920 acggtcatgt tccggatcgt tccagtccag ctgcactgcg gagggaaatc aaccactgtg 1980 ctggcgttcc tggacgaagg cgccaacgtc acgttgatgg agaaagcgct cgctgatcgc 2040 cttgacgcag taggaggaga tcaagtgcgg cttacagtca agtggaccgg aaatatttcc 2100 agaacggagg aatcccggag aatgaacctg tgggcgacag gcacgacggc cggtgccaac 2160 aacaagatga tgctgtacaa tgtccacacg atcggaaaac tgatgctgcc gcgccagaaa 2220 ctggactatg aggagcttgc agcacagcac agtcatatga gaggattgcc aatcgagtcc 2280 tacgatggaa ggcctcaatt gctcattggg gcgaataaca tttattcctt cgctccgatg 2340 gaggcaaagg tgggcacgcc gatggagcca atcgccgttc gaaccaacct tgggtggacg 2400 gtttatggac cacgacaatc aaccactgtg gcgaccggtg gctttttcgg ttaccaccaa 2460 caagttacca acgaggacct gcacgagctg ctcaaaagcc actacgcgct ggaagagtcg 2520 gtggtagtaa ttccacaaga aacggcggag gagaaacggg cccgtgaaat attggagcgc 2580 accaccaaac gtgtcggcga ccgcttcgaa accgggttgc tgtggaagac agaagaccca 2640 cggtttccgg acagctatct catggccctg cgaaggatga agcagctgga gaagcggcta 2700 gaaaagaatc cggtgctgca ccaaaatgtc tgtaagcaga tcaacgagta tcagcagaaa 2760 gggtacgctc atcttgccac cgctgaggaa ctggccaata caccatccga ccaggcctgg 2820 taccttccga tcaacgtcgt cctaaatccg aagaagcccg aaaaggttcg tctcatctgg 2880 gatgcggctg cgacggtcct aggggtctcc ttaaactccc agctgctaaa aggaccggac 2940 atgctcgttc cactggtcaa agttctctcc ggctttcgcg aatggaggat tgcattcggt 3000 ggagacctga aggaaatgtt tcaccagctg aagattcgcg aagaagacaa gcagaagcaa 3060 cgtttcatct ttcggaaaaa cccggcggat ccaccaagca ccggaacgca gaagagttcg 3120 ccacacaata tcccgaagcg tctgcggcca tcacgcaccg acattacgtc gatgactact 3180 ttgatagcgt tgacacagtg gaagaggcgg ttaccctagc ccaggacgtg cgatcagttc 3240 accagaaggc tggcttcgag atccggaact gggtatccaa ttcaccggaa gttctcctaa 3300 aactgggaga ggaaaaacca gctacgtcgg ttcatttcgg ccgggataag cagacatcga 3360 aggagatagt tctcggagtg atatgggacc cggagatgga ccagttctca ttctcaacga 3420 aacaccggga agatttgata ccgtacctgt atgaaggaca acggccgacg aaaagactcg 3480 tcgcaagctg cgtgatggga ttcttcgacc cgctcggact gttgtcaccg ttcaccatcc 3540 acgggaaaat catcatccaa catctgtggc gatccaactg cgactgggat caagaaatca 3600 attccaattc ctgggagctg tggaagcggt ggacaagctt attgccggag gttgaggcca 3660 tccggatacc ccgctgctat ctcggctgcg ccaagtctgc agaagtcgac gcgctggaag 3720 tacacatctt taccgatgcc agcgaacatg gattcggttg cgttgcatat ctgcgagcgg 3780 tcatcagagg ggaggtccac tgcagcctaa tgatgtcacg ggcaaaggtg gcaccgataa 3840 aacgtcagtc gattccacgt ctggagctga tgggtgctgt tctgggagcg cggatgaatc 3900 aaacagtcct gagcacgcac tcctaccaga tcaaccgcac cgtcttttgg acagattccc 3960 gcactgtgtg cagctggctc aattccgatc aacaccggta caagcaattc gttgcattcc 4020 gggtcggtga gatccaagag ctgacgaaag tagcggactg gcgatggatt ccgaccaaac 4080 tcaacattgc tgatgtgctg acgaagtggg gacaaggccc gccattacaa agcgagggtg 4140 agtggttcaa cgggccgtct ttcttatacc agccgccaga aatgtggcca actcaagaat 4200 cgacggtcga agaaacagaa gaagaagcga gaggtgttgt actgttccac gaggtcatcg 4260 acgtccaacc aatttcacgc tggacaaaac tactgcgggt gacggccaac gtaatacgct 4320 tcatcgccaa ctgccggcga aagagggaag gagagcctat agtggtcaca cgagccacgg 4380 ctaagcagcg ccagttgatg ggatcgacgg cagtgaagta tgcgtcggtg acagcaccag 4440 tcggccaaga agagctccag caggcggaaa caattctttg gcgtcaggca cagtgggaca 4500 gttttccgga cgaaatgagt gcgctcactg gcaacttgaa gcgtgcgccg gatacaccga 4560 tggagacggt caagaagagt agtcaaatct acaaaagttc accggttctg gacgacgaag 4620 gagtcctgcg catggaggga aggctggcca actccgagga aagttccttc gacaagaagc 4680 acccgataat tctgtcgcgg ttccacgaag taactcagag actgatccag cactaccacg 4740 aaacgtttgg gcacgctaat tctgagaccg tgttcaacga gatgcgacag cgtttccaga 4800 ttccaaagtt gcgcccagcg atacagcagg tggtaagaaa gtgcgtgtgg tgcaaggtca 4860 acaagtgtcg tccacgatca ccgagaatgg ctccacttcc agttgagcga atcactcctc 4920 atcttcggcc gttcagttcg gtaggtgtgg attaccttgg tccggtggaa gttacggttg 4980 ggcggcgtag ggagaagcgc tgggtagcag ttttcacctg tctggcggtg cgggcggtgc 5040 acctagaagt cgtgcacagt ctcacgaccg aatcatgccg aatggccatc actcgtttcc 5100 agagcaaatt cggcaagcca gaacagatct tctccgacaa cgcgacatgc ttccggggag 5160 cgagcaacga gatggtcagg atggagaaaa taaatcaaga gtgtgcggag gtcgtcagca 5220 gttccagcac cgtctggaat ttcattccac ccggtacccc acacatggag tgtgggaacg 5280 catggtgcgg tcggtggaag aggcgatgcg tgcgtttgac gacggaagga aattgaccga 5340 cgagatcctg gtgacaacac tagccaaagc tgccgacatg atcaacacac gtcctctgac 5400 gtacctgccg caggagtcgg gcgaaagtga agcgctgacc ccgaatcatt tcctacgggg 5460 aacggtgtct ggcacagatc tgaaggtgga cgaaggatcg acgaataccg cagaatcatt 5520 gcggatcata tacaaacgct cgcagttcct cgccgatcga atgtgggaga gatggagtaa 5580 ggagtacctt ccgacgataa atcaacggtc gaaatggttc aacgagcaga agccgctgat 5640 agagggcgac ttggtattcg tggtcgacgg aaagaaccgg aagtcctgga aaagaggtgt 5700 agttgaggcg gtgatcaagg gctcggatgg aagaattcga caagtggacg tgagaacagc 5760 cgatggtaag gtgctaagac gaggcgtggt caacctagcg gcgctggaga tcaggtaaat 5820 ccgggaattc cggatgttac gggctgggg 5849 // ID NONAUT-3 repbase; DNA; INV; 2609 BP. XX AC BN000787; XX DT 04-SEP-2005 (Rel. 10.08, Created) DT 04-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE Schistosoma mansoni NoNaut-3 LTR retrotransposon (EST). XX KW Gypsy; LTR Retrotransposon; Transposable Element; Fugitive; KW NONAUT-3. XX OS Schistosoma mansoni OC Eukaryota; Metazoa; Platyhelminthes; Trematoda; Digenea; OC Strigeidida; Schistosomatoidea; Schistosomatidae; Schistosoma. XX RN [1] RP 1-2609 RA DeMarco R., Machado A.A., Bisson-Filho A.W. RA and Verjovski-Almeida S.; RT "Identification of 18 new transcribed retrotransposons in RT Schistosoma mansoni."; RL Biochem Biophys Res Commun 333(1), 230-240 (2005). XX DR EMBL/GenBank/DDBJ; BN000787; Positions 1 2609. XX FH Key Location/Qualifiers FT CDS 161..2608 FT /product="NONAUT-3_1p" FT /translation="MDPSKLELFLEQQMKLIQMLTETKITTNNQPSSSNPT FT TTAPSVDGIASSISEFHYDPESNVTFDMWFRRYEDLFKFDFANQDDAWKVR FT LLLRKLGPSELDKYCNLILPLNPRDRSFSDTVQSLSQQFGDNSSLFNTRYR FT CLKLTMNEDTDFLTHVGIVNRECERFWLKSLTEDRFKALILICSLQSQKFG FT DIRTRLLSRLDQDPKLTLNDIANEYQRLVNLQHDTTMVQRGVSDRREVHVV FT QQQTKPVKPAPAIPSSSHSNSVQQTKTNPPAPCWHCGAWHYVRSCPYKQHR FT CRKCKAVGHKHGFCLKRKPNSSRSTRTPVSRSCTNSLSLVATCQSSTPVEI FT TFITVNINEQPFRLQIDTASDVTILSQKSWIKMGRPRMATTTQKPRTACGS FT YLRLLGQLHCEVSFRDSTFTGVCYIAPGDLNLLGLDWFDQLHLADVPLNTV FT YQLVKQPHDPEAYPKELVTKFSTVFQPVLGRCSAMKTTLRLKSGVKPVFRP FT KRPVPYAALQKVEEELNRLQREGVITPVSYSAWAAPIVVIKKANGAIRICA FT DFSTGLNAALEQHHYPLAVPADLFTMLNGGKFFAXLDLADAYLQVEVAEDQ FT ESYSLLLLIGGLFQYNRLPFGGQDRPIYFQQLMDTILSGIPGVATYLDDIL FT IVATTSEQLRERTTAVLQRVSDNGFRLRPEKCQLFLKSVKYLGFIFDAAGR FT RPDPENIRAIRTMPTPTNISTLRSFLGLVSYYSAFVPSMHDIRAPLNYLLN FT KDISWNWTKECENAFCKLKTIISSELLLTHYDPSLPIIVAADASAFGLGAV FT ISHQFPDATEKAIM" XX SQ Sequence 2609 BP; 716 A; 631 C; 545 G; 715 T; 2 other; acttcnccgc tccaacttgg acttcttact ctagttatct gggactgggt cgcgtcagta 60 tagctagtta tcggtctctg gtcacgtgtc tttgttccgt tcgctactag tgaattcgta 120 agcttcagat atatcaattg gcgacgaggc tggtgctaat atggatcctt caaaattgga 180 attattcctc gaacagcaga tgaaacttat ccagatgcta acagaaacaa aaattacgac 240 caataaccaa ccaagcagtt ccaatcctac cactactgca ccttcagttg atggtattgc 300 cagcagtatt tctgagtttc attacgatcc tgaatctaat gttacgtttg atatgtggtt 360 tcgacgttat gaagatttat tcaaatttga ttttgccaat caagatgatg cttggaaagt 420 ccgtttgctg cttcgaaagc tgggtcctag tgaacttgat aaatattgta acctgatttt 480 gcctctaaac ccacgcgacc gctcattttc agacactgtc cagtcattga gccaacagtt 540 cggagataac tcctcactgt tcaacacacg ctacaggtgt ttgaagctga ctatgaacga 600 agacaccgat tttctgacgc atgtgggcat tgtcaaccgc gaatgcgaac gcttttggtt 660 gaagtcgcta actgaagatc ggtttaaagc cctcattctc atctgcagcc tccaatcgca 720 aaagttcggt gacattagaa cacggttgct cagtcgattg gatcaagacc ctaagctgac 780 tctaaatgat attgctaacg aatatcaacg tctagttaat ctgcagcatg acactaccat 840 ggttcaacgt ggtgtttctg atcgccgtga agtacatgta gtccaacaac aaacgaaacc 900 tgttaaaccc gctccagcta tacctagttc gtctcactct aactcagttc aacaaacaaa 960 gacgaatcct cctgctccat gctggcactg tggtgcatgg cattatgttc gaagctgccc 1020 gtacaaacaa catcgttgca ggaagtgtaa agctgtgggc cacaaacatg gattttgtct 1080 aaaaaggaag ccaaatagct ctagaagtac tagaacgcct gtttctagat cctgcacaaa 1140 ttcgttaagt ctagtagcaa cctgtcagag ctcaacccca gttgagataa catttattac 1200 tgttaatatt aacgagcagc catttagatt acaaatagac acagcatcgg acgtgacaat 1260 tctttcacaa aaatcttgga ttaaaatggg cagaccacgt atggcgacaa caacacaaaa 1320 gcctcgtaca gcatgcggta gttacttacg tttgttaggt caactacatt gtgaagtgtc 1380 tttccgcgat tcgacgttca ctggggtatg ttacattgca ccgggtgatc ttaatttact 1440 tggattagat tggtttgacc aactgcattt agcagacgtc ccgttaaaca ctgtgtatca 1500 actagtaaag caaccccatg acccggaagc ctatccaaag gaactagtta cgaagttttc 1560 gaccgtcttc cagcctgttt tgggtcggtg ctcagccatg aaaacaacgc tacgattgaa 1620 atctggggtc aaacctgttt ttcgtccgaa gagaccagta ccatatgctg cattacagaa 1680 agttgaggaa gaacttaatc gccttcaacg tgaaggagtt attactcctg tctcttattc 1740 cgcatgggca gcacccattg ttgttatcaa gaaggctaat ggtgcaatcc gaatatgtgc 1800 agatttttcc acaggtctca acgcagccct agaacagcat cactacccgc tggcagtccc 1860 ggcagatttg ttcactatgc tgaatggcgg aaagttcttt gccaanttgg atcttgctga 1920 tgcctaccta caagtagaag ttgctgaaga tcaagagagc tactcactat tattactcat 1980 cgggggcctg ttccaatata atcgactgcc gtttggtggt caagaccgcc ccatctattt 2040 ccagcagcta atggatacca ttctatcagg tatcccgggg gtcgcaactt atctcgatga 2100 cattctcatt gttgccacaa cgtcggaaca gctacgagaa cgcacaacag ccgtgctcca 2160 acgcgtcagc gacaacgggt tccggttacg tccggaaaag tgccagcttt tcttgaaatc 2220 cgtgaagtat ttggggttca ttttcgatgc tgcaggccgc agaccagatc ccgaaaacat 2280 tcgagctata agaactatgc ccacacctac caatatctct actctccgat cattcctggg 2340 attggttagt tattactcag ccttcgttcc atcgatgcat gatatacgcg ccccactgaa 2400 ttatctcctg aataaggata tcagctggaa ctggacaaag gagtgtgaaa atgccttttg 2460 taagctaaag acaattatta gttcagaact actattaacg cactacgacc cgtctttgcc 2520 aatcatcgtg gccgcagacg cttcagcctt tgggctcggt gctgtcattt cacaccagtt 2580 ccccgacgcg acggagaaag ccatcatgc 2609 // ID BEL4a_Cis_LTR repbase; DNA; INV; 336 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 06-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE Long terminal repeat of BEL LTR Retrotransposon from Ciona DE savignyi. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL4a_Cis_LTR. XX OS Ciona savignyi OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. XX RN [1] RP 1-336 RA Smit A.F.; RT "BEL4a_Cis_LTR - BEL LTR Retrotransposon from Ciona savignyi."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX SQ Sequence 336 BP; 80 A; 45 C; 73 G; 138 T; 0 other; tgttactgca tcgctgcagt agttgttttg gttcagtgct ttttatggtt ggttgatttt 60 gttacatttt atttctatga ttgtccttgg cgaatttacg ttttgatttg tatagtcctt 120 ttttgtgtac gccggctgcg gggaagcagc aatttattca gtaaatggta aatggtcaaa 180 gaagttacgg acgtgcaatt gttaaggtat tattttgtta tggtttttct tttttataat 240 gtctcgtact aatttatgta ttttttagtt accgctaaaa caagtaataa aggaattgga 300 aatagcgcgt caacttcatt gcgtggcacc gtaaca 336 // ID P-12_HM repbase; DNA; INV; 3906 BP. XX AC . XX DT 17-MAR-2008 (Rel. 13.03, Created) DT 17-MAR-2008 (Rel. 13.03, Last updated, Version 1) XX DE P-type DNA transposon family: consensus. XX KW P; DNA transposon; Transposable Element; P-12_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3906 RA Jurka J.; RT "P-type DNA transposon families from Hydra magnipapillata."; RL Repbase Reports 8(3), 358-358 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(1643..2608,2665..3465) FT /product="P-12_HM_1p" FT /translation="MYIKPSVSFRGGHLIGYSEDNPTKIAKTLLVFMIKPM FT FGKPSFVCRIVPIFKLSVHFLHDLIININKQIIDAGGELLCLLSDNHPTNR FT SVYQSFVVNKSFPWQGVIEKEKPVIMLHDPVHLFKSIRNNWFTEKTQTISL FT KINNQPLSGDWSHIVQLYNRQKICVAKTTKLSRKSVYPSLIDRQNVANMVA FT VFNEKTVAALRLANFENTACFLAPIVSLWNMLNVKRKGCDVLLNDINRSPF FT QTLDDLRFKEILAFADATSKMSEGKGLKRHNTFTPETKHALVNTLQGLVEL FT IKKLLSEDHQYVLPGIFQTDRLEGEFGIYRKTNISLIYLNVLFFNIHKFND FT TFLFDNCCFNYNLILGSTQFIWLYLFANFLLICRQLSGGNYFISAEQVLNS FT LHLQRLKLFSKIVSLDETDIHCSRIHSDCCTMDLMEEDIIYLDECLINADN FT ESINDQENAALFYMAGYVQHKIDPQRVVDTTTSQSASSEFTDYVSRGKLRY FT PGDTLLQFICVCYLLFNSVPMSEKRFQCVVYLKKLFLFLHSTLPFDLETDS FT IGISKVISIITNCFLKGVTTLDNTSDIVSLNIEDRKRKKLNT" XX SQ Sequence 3906 BP; 1325 A; 545 C; 597 G; 1439 T; 0 other; cagggactac taaaataacg ggacgcgaaa cttcaaatcg atgctaaaac gacgacaggg 60 ccgattgtaa atggcgatgc tagtttgatt taacaaagaa gttatgaaaa acttacatta 120 atctaattat attcacattt tagaatacat taaagtgata taaatttata ttactaatat 180 aatgtatata tttttcatca gtatcgtact gataaaatat agattaatgc aatagaaata 240 ttgggagata tttacaacat ttttaaaaaa aaaacctttt ttttttttga aaaaagttgc 300 tatttttatt gaaaaatcta aagtttttta atttttgaaa gtagtaagat taaaaaattg 360 tattaagtag tttttaattt gagtaattta ttttaagtaa tgaatctatt aagtgttaca 420 aaatgttatg tcatttataa taaataataa tggtgttact attattagtt ataatcatgc 480 caagaaaatg ttcaatagtt aattgtacca caaattattt aacatcaact tcagaagaaa 540 agcttcatgt ttacatgttt ccaagtgaca cacaaaagtt acaatattgg ttgtcatcta 600 ttccaaatgc gtttgaaaaa gtaacttcaa atatgggtgt gtgtgaaaaa cactggccac 660 caggctgcaa aatgcacaag cctcctcgat caaagtttga ggtaagcatc cttaaagttg 720 tttttattga taagttttac aatgactaat tttaataata tcttttttag gttccttttg 780 atcccccatc tatttttcct ggatgcacat cttcaatggt gcgccaaaca gtctcaacac 840 aaccaagatc tacaaatata aataaaattt ctttaacatc acgaaatgca atacctgatg 900 aaatggatga tttcaatcaa attgatttaa ttagatttta tgatgttgaa ataatgttgt 960 ctgaaataaa agaaaggtaa aattaatttt ttctccttgt tttatgatat ttgtttgttt 1020 tgatttaaag tttacttgtt attaaatcta ttttagatat cctcaatact ttgcattcat 1080 tgaaaattca aaagtcatat tgtttgatgt ttctcgatca tgtctacctg aaacaatatt 1140 ttctgttgaa attaactgtc attcacgtca agttaatatt tattgtaggc atacaaaagt 1200 taactgctat gacctgctca gtttttctca ttgtcttact cggtggtcac agcttgaagc 1260 tattgtcaaa agagtttgta cctctagtcc tgaaataaaa gatgaggtat taagtttgat 1320 tagtcaaatt gaaaatacat acagtgacac ttctgaagaa atttctttcc tcattgaaca 1380 actacgcctc accacttatt ctccaaatgg actgcgcttt tctactttat gtatacgtcg 1440 tgcacttgaa ctgtatttgt cctccagatg ttgttatcga catttaagaa atctgctttg 1500 tttgcccagt ccaaagactc tcatttcaaa attaggaaga gttggagaag ttggaaatga 1560 aaaagagtgc aataatacaa tcaaagttgt ttttgaaaat tttaatagtt ctgaaaagcg 1620 ttgctgcctt ctctttgatg agatgtatat taaaccaagt gttagtttca gaggaggaca 1680 cctcattgga tatagtgagg acaatcccac taaaattgca aaaacgttat tagtctttat 1740 gattaaacct atgtttggta aaccttcttt tgtttgtaga attgtaccaa tatttaaact 1800 gtcagtgcac tttcttcacg atttgattat aaacataaat aaacaaataa ttgatgcagg 1860 aggagaatta ctgtgtcttt tatcagataa ccatcccaca aatcgaagtg tttaccaaag 1920 ttttgttgtt aataaaagtt ttccatggca aggtgttata gaaaaagaaa aaccagttat 1980 tatgctacac gatccagttc acttgtttaa aagtataaga aataattggt ttacggaaaa 2040 aacacaaaca attagtttaa aaattaataa tcaacctctc tctggtgatt ggtctcacat 2100 tgttcaatta tataacagac agaaaatttg tgttgccaaa actacaaagc tttcacgcaa 2160 gtcagtgtac cctagtttga ttgatagaca aaatgttgca aacatggttg ctgtatttaa 2220 tgaaaaaact gttgcagcat tgagattagc taattttgag aacactgcct gttttttagc 2280 accaattgtt tcattatgga acatgctaaa tgtaaagagg aaaggttgtg atgttttact 2340 taatgatatc aataggtctc ctttccagac actcgatgac cttagattta aagaaatttt 2400 agcatttgct gatgcaactt ctaagatgtc agagggaaag ggattaaagc gtcataatac 2460 ttttacacca gaaacaaaac atgctttagt taacacccta caaggattgg tagaattaat 2520 aaaaaaactg ctttcagaag atcaccaata tgttcttccc ggtatttttc aaacagatcg 2580 tcttgaaggt gaatttggca tctacaggta aaattatttt tgtgttactt atagttaaga 2640 aaattttgta gatttgaaat ctaaaaaaca aatatttctc tcatttattt aaatgtatta 2700 ttttttaaca tacataagtt taacgatacg tttttatttg acaattgctg ttttaactat 2760 aatttaatac tcggttcaac tcaatttatt tggttatatc tttttgcaaa ttttttgctt 2820 atttgtagac aactaagtgg cggaaattat tttatctcag cagaacaagt attgaacagt 2880 ttgcaccttc agcggttaaa attattcagc aaaatagttt ctttggatga aactgacatt 2940 cattgctcac gcatacattc agactgttgc acaatggatc taatggaaga ggatattatt 3000 tatttagatg aatgtttaat taatgcagac aatgaatcta ttaatgatca ggagaatgct 3060 gccctttttt atatggctgg ttatgtacag cacaagatcg atccgcagcg tgtagttgat 3120 acaacaacat cacaatcagc tagttctgaa tttacagatt atgtttcacg tggtaaattg 3180 cgttatcctg gtgatactct tctccagttt atttgtgttt gttatctact gtttaacagc 3240 gttccaatgt ctgaaaaacg ttttcaatgt gttgtttatt taaagaaatt atttttattc 3300 ttgcattcaa cattaccatt tgatttagaa acagacagta taggtatttc aaaagttatc 3360 tcaataataa ctaactgttt cttgaaaggt gttacgaccc ttgataatac atctgatatt 3420 gtttcattaa acattgagga cagaaaaaga aagaaactca atacttagtg gtctgattat 3480 ttagttgacg ttggtgatat aattaataac gttttgaaat gtatttatgg ctttatattt 3540 atatggttta aaatttattt ataattatat atttattatt atgatatgaa tataatatag 3600 tatgataagt agtgattaat tagtttatta acattgtctt taaaaataat tatttacagc 3660 taatttacgg ataatttaca gtttctttgt aaaaagttgt tttattttta acacttaatc 3720 aacacaattt aatttttgtt tgaaaatagt gataaataat tgagccataa cattttttta 3780 ttaaatttct attcataatt atatttttgt ttaataatag aagtgtttcg cttttttttt 3840 catcggccct gtcgtcaaaa ctacatgaaa ttagagtttc gcgtcccgtt gttttagtag 3900 tccctg 3906 // ID TDD-5 repbase; DNA; INV; 3783 BP. XX AC . XX DT 03-FEB-2010 (Rel. 15.01, Created) DT 03-FEB-2010 (Rel. 15.01, Last updated, Version 1) XX DE Dictyostelium discoideum Tdd-5 transposable element. XX KW Ginger2/TDD; DNA transposon; Transposable Element; Ginger; KW Ginger2; TDD4; integrase; TDD-5. XX OS Dictyostelium discoideum OC Eukaryota; Amoebozoa; Mycetozoa; Dictyosteliida; Dictyostelium. XX RN [1] RP 1-3783 RA Wells D.J.; RT "Tdd-4, a DNA transposon of Dictyostelium that encodes proteins RT similar to LTR retroelement integrases."; RL Nucleic Acids Res 27(11), 2408-2415 (1999). XX RN [2] RP 1-3783 RA Glockner G. and et. al .; RT "The complex repeats of Dictyostelium discoideum."; RL Genome Research 11, 585-594 (2001). XX RN [3] RP 1-3783 RA Bao W., Kapitonov V.V. and Jurka J.; RT "Ginger DNA transposons in eukaryotes and their evolutionary RT relationships with long terminal repeat retrotransposons."; RL Mobile DNA 1, 3-3 (2010). XX DR [2] (Consensus) XX CC TIR is 297-bp long. XX FH Key Location/Qualifiers FT CDS join(634..780,792..1070,1083..1337,1431..2708) FT /product="TDD-5_1p" FT /translation="MHKGHIGRDATYQKFKSMYFCTGMWVMVDNAVKQCDI FT CQRNKIKGKILIKLLFISNDINNFNIGINKEYVAIEDTEEYSRMVFDLTSL FT KGEHRNNDIIIESTDRFSSNWETIKNDNVKEANDSNIVYILICVNSFTKFA FT TGRYSPKSSILFILTIILKDVLHLKKQFQFTIFWPLHTKINLSRSGIAIME FT RNSKTKCLINFVSLHFLHPKLLMEHLVHQQLKEWWKEHLFTKFDSEKICKT FT QTISSYLELALEVYNNRKHRAIGMSPYQAIGIKPLFQTATLDNGFQVNLEE FT NLVSIPDISYKQRQEIILKNIEKYNNNWSSKSTKKQFKIGDTVFLLEILNK FT KKTLVKSKIIEIHNDDQNNKKTYRIQFLEDGINNKQKTGMAWEYYVGPNKL FT VHYNQAKERTNDFQPQYNDLNDLKDSKDSNDSNDSNDLTSFTSNLYLSNHR FT KVVDILLSFTKKIGYDLTNYNLNAIAIQDDPLKVLEDSFSKINVLNNNLPN FT EANPCFHTFKNEKLKHECLKLLDGNVTAPTLINQLRKELGLLTIPQHVVQK FT SVTCNPSLNVARQLVPSIEHISNIHPLANALQQPPTSNTSSSFVHTLNPPD FT IVSQTLSNPPTIVTSSTIIYTNHCYSSTIKSTSNCHSDTIKYTSNCHSNTF FT KPSR" XX SQ Sequence 3783 BP; 1587 A; 477 C; 403 G; 1316 T; 0 other; tgttgtaggg aatttttttt gaaatttcgc tttctaaacg gcagtgaatt tttacactaa 60 aatacactgt ttatgcatgg tgtaaaaatt cactaaattt tttttttttt tataattaaa 120 ttaaaaaaca atttatttaa tcagaacaat aaaaaaccaa ttttattcag ttatttgctt 180 gttatttatt tttttataac aaaaaaattt agtttgcgcc attttttttt tttttttttt 240 tttttttttt tttttttttt tttttttttt caattatttt tttcaatttt tttttttcga 300 aacaatttta ttttattccc caaaaatgaa acaatcaaaa ataaatgaaa taaaaaatta 360 tttaagaaat aaaaataata gtaataatct aaatagaaaa tataaagatt atgtactagt 420 ctctaaagag tatgtttata agtttcataa tgttgaatca aattcattat taaaatatct 480 aataatataa tcttaatttt ttaaagaaat ggtcacgaag agttacttaa atccgaaaag 540 aaagttttta caaccaatga tggccaaacc atttcatcta gtgtcctccg taaggtcatt 600 aatgatgttg agttttcaaa ccattgggac gagatgcata aaggtcatat tggaagagat 660 gccacttacc aaaaattcaa gtcaatgtat ttttgtactg gtatgtgggt aatggttgat 720 aatgcagtca agcaatgtga tatatgccaa agaaacaaaa ttaagggtaa aatattaatt 780 taaattaata aaaattatta tttatatcaa atgatattaa taattttaat ataggtatca 840 ataaagaata tgttgcaatt gaagatactg aggagtattc aagaatggtt tttgatttaa 900 catctttaaa aggagagcat agaaataatg atatcattat agaatctacc gatagattct 960 cttctaattg ggaaactatc aaaaatgata atgtaaaaga agcaaacgat tcaaacattg 1020 tctatattct tatttgtgtc aattctttca caaaatttgc aactggaagg taattacctt 1080 aatactctcc taaaagttca atattattta tattaaccat tattttaaaa gatgtcttac 1140 atctaaaaaa gcagttccaa tttacaattt tttggccatt acatacaaag ataaacctat 1200 caagaagtgg cattgcgata atggaaagga attcaaaaac aaagtgtttg atcaatttcg 1260 tgagtttgca tttccttcat ccaaagctgc tcatggagca cctcgtacac caacaactca 1320 aggaatggtg gaaagagtaa atcaagaaat taaaaaatta ataagaaatt ttcaaaagga 1380 agagttagta ccctcttcta atttacaccc tatatttttt aaattactaa catttattta 1440 ccaaatttga cagcgaaaaa atatgtaaaa ctcaaacaat tagctcttat ttagaattag 1500 ccttggaagt atataataat agaaaacaca gagctatcgg catgtcacct tatcaagcaa 1560 tcggaattaa accattattt caaacagcta cattagataa tggatttcaa gtaaatttag 1620 aagaaaattt agttagtata ccagatatat catataaaca aagacaagaa attattctta 1680 aaaatattga aaaatataat aacaattggt catcaaaatc aacaaaaaaa caatttaaaa 1740 ttggtgatac agtgttttta ttagaaatat taaataaaaa gaaaacattg gtaaaaagca 1800 agattattga aatacataat gatgatcaaa acaataaaaa aacctacagg atacaatttc 1860 tagaagatgg tattaataac aaacaaaaaa ccggtatggc ttgggaatat tatgttggac 1920 caaataaatt agttcattat aatcaagcaa aggaaagaac caatgatttt caaccacagt 1980 acaatgattt aaatgattta aaagattcaa aggattcaaa cgattcaaat gattcaaatg 2040 atttaacttc ttttacatca aatttgtacc tatcaaacca tagaaaagtt gttgatatat 2100 tattatcttt tacaaagaaa attgggtatg atcttaccaa ttacaatttg aatgcaatcg 2160 caattcaaga tgatccttta aaagttttag aagactcttt ttccaaaatt aatgtattga 2220 acaataattt acctaatgaa gcaaatcctt gtttccatac ctttaaaaat gaaaaattaa 2280 aacatgaatg tttgaaattg ttggatggta atgtgacagc tccaacatta attaatcaat 2340 taagaaaaga attaggttta cttactatac cccaacatgt tgttcaaaaa tctgttacct 2400 gtaatccttc tttgaatgtt gctcgtcaac ttgtgccatc tattgaacat atatccaata 2460 ttcatccttt ggcaaatgct ttgcaacaac caccaacttc aaacacatca tcatcttttg 2520 ttcatacttt aaatccacca gatattgtct ctcaaacact atcaaatcca ccaaccattg 2580 ttacctcaag caccatcata tacaccaacc attgttattc aagcaccatc aaatccacca 2640 gcaattgtca ctcagacacc atcaaataca caagcaattg tcactcaaac actttcaaac 2700 catccagata atgtcattgt acccatgttg tctccaatca aaattcaaca taacaaacag 2760 atcatcctaa gatccctaaa acatgactcc cctcaaaaac caaaaaaaaa gttaaaattt 2820 tcaagaaagg aattattagc ccaaggtgat ttaaaataaa acaatgttga aaaaattaaa 2880 aaaaaaaaaa aaaaatatat tttaaaaaat tgaaaaaaat taaaataata gaaaaaaaaa 2940 gaaaaataaa aaaaaaaaat ttataattca caaaacactt tttattcaag tcgtataaaa 3000 ttaaacagtg ttaataacaa aaatcaaata tgaaatcatt aagaaattaa ttatataata 3060 agaataataa agccaggagc acatttattt attaatttat ttattaattt atttatttat 3120 ttattttttt atttatttat ttattatttt atttcttttt ttttttgaat accaagtcac 3180 caaatgcaat tgcatcttta ttatttaata atatatttaa atttaataat tatatatttt 3240 gttatattta tttatggttt atattttaat tatggtattc attattgttt atataattat 3300 ttttagtatt tatattattt ataattataa actcgagtta atagaattat tattattatt 3360 attattattt aaatgtgggt caactataat tagtgtctta gttcacttat ttaatattaa 3420 cttatttttg tgtcttcgtt acataaaaaa aaaaatgaaa aaaaaaaaaa aaaaaaaaaa 3480 aaaaaaaaaa taaataaaaa aagaaaataa aaaataaaaa aaaaaataaa aaaaaaaaaa 3540 aataaaaaaa aaaaaaaaaa ttggctcaaa tctaacaatt taaattgtta tttaattctt 3600 aaacaacaaa taattgttta aaattagttt ttttattatt ttgattaaat aaattgtttt 3660 taatttataa aaaaaaaaaa aaaaatttta gtgaattttt acactatgta taaacagtgt 3720 attttagtgt aaaaattcac tgccgtttag aaagcgaaat ttcaaaaaaa attccctaca 3780 aca 3783 // ID BEL-73_AA-LTR repbase; DNA; INV; 726 BP. XX AC supercont1.277; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-73_AA_; KW BEL-73_AA-I; BEL-73_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-726 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.277; Positions 1013606 1014331. XX SQ Sequence 726 BP; 270 A; 134 C; 125 G; 197 T; 0 other; tgttgcggcc aagccagttc ataatcctga aaaagaaata gaaatgaaaa tgtaattaaa 60 ttaagaatat aaaatgaatt tatacttacc gatttcattc cccatcaatt ctcgcatcaa 120 ttccaaaaca atgttatgag tggtgagctg atttgacagc tacatattta tgctttaagg 180 atagtagtag taggcaattg agttaccata cacacattag agcgttggca gaactgtgcc 240 gctgtaaact tgcatacaag ttctctagac agggaaagca aatgtcagtt caaagtatgt 300 gtttacatat acacaaattg accgttgaac aatcatgatc acgcaagcaa aagaaaatta 360 taaaggcttt agaagacaaa tgaaagatga aaaaacagct ttaaatgaaa cagatctatc 420 acaataggtc ccatatgaat ccatggatga aaaatgatat aaatacgcac cccaaatcat 480 gtacaattca gttcaaaaca agcatcgaag tcgaaaatac aatcttatat tcaaagtgaa 540 taaacccagt tttttatcat ttcaaaccgt cggaatcgtc ataccgtacc gtatagtcgc 600 gctagttcgg atcagtgtcg aaattgaaag tcaaagttcg tgaagaggtg accgttttta 660 attaaccctt tactgcccag tgtcctaaat tcttcacccc gtgaaaatta gatcgctgcg 720 aaaaca 726 // ID Crack-9_AAe repbase; DNA; INV; 4755 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A Crack non-LTR retrotransposon family from Aedes aegypti. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; KW Crack-9_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4755 RA Kojima K.K. and Jurka J.; RT "Crack clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1225-1225 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 10 sequences with >96% CC identity. Closely related to Crack elements in Culex pipiens CC (Crack-1_CP to Crack-4_CP). XX FH Key Location/Qualifiers FT CDS 341..1369 FT /product="Crack-9_AAe_1p" FT /translation="MMSAENVIICVICDKEEPDVNKVLECLHCHKCYHYRC FT KKLVGSAARKMRSQQFYCTVECNEMGMRTQHAVASENAVIQELRNVIKEVH FT AMRDEAAASRRFLEDSIKEVEKSQDFLCTKFDEVVSELKQLKIDYSKMEKE FT MFSVREDYSQLSDAVVTLEAEVDRFKRAELVRNVIVLGVPITREDNASAIV FT NGIARVMGYALDEKIIEAYRLKASKTDGNYAPIRVVFADSNVKEEFFAKKK FT IFGQMKVSALGGSFAALTGKITLRDEMTSYGLAMLREVRAMQEQLKAKYVW FT PGRDGVILMKRTDTSKVEYIRNRMDIRALECSTSKRARGLSVSPEGPPAKR FT " FT CDS 1373..4312 FT /product="Crack-9_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="ICCFIIYLNQDKLNLKMESINHYYDTVEDCINNKAEL FT IKNNVSLRIFQWNIRGTNSMPKFDLVKQFLDKYQDRIDVIALGETWLKEGC FT TELFGVHGYKSIFSCRQESHGGLALYVREDFVVNVITNEHDDGFHCIHAEI FT ASASNRFNIICVYRPPNFSYTDFQIKIESKLSSISNQKNVFLIGDTNVPVN FT LSQNNVVREYRRILQSYNLSVTNTSVTRAASGNILDHLVCSVDQMGSIINE FT TIYSEMSDHSMILTTLTSSSMKGMQTLTKRIINHTQLNERLSSALMDIPMN FT LSANEKLLTVIEKFHEIQASATKEVIIQVKIKQQCPWMTFDAWKMMRIKDH FT ILKSSRRHPNDQHLQNLLKHASKRLQQIKERCKRDYYHRLLLHSSNKKSWQ FT MMNEMLGRKTNVNRAVLLRIDGKDVTDGFQVSNTFNDFFCRVGEDLANDID FT SDKNIWKFRTLPMQSSSIHLRQATVSEVILLIKELDXNKSPGPDNISAETI FT KTHHISFAKILTEVFNEIIETGEFPECLKIARVVPIYKSGDSANTNNYRPI FT SVLSIMSKLLEQMLASRLSDFLFGNNLIYERQYGFRTGCSTLTATCEMVDE FT IYSSLDKRKFSGALFLDLKKAFDTINHVLLLRKLECYGIRGKALLLLENYL FT GNRQQFVTVNGERSCLRNISIGVPQGSNLGPLLFLVFINDLSRLKLHGRMQ FT FFADDTVLLYSATEAANIARLIQTDLQCLQDYFTSNVLSLNLQKTTYMIFH FT STRRIPQELPTVRIGTVEIRNVDSFQYLGLTLDSVMSWEKHIDNLKNKLAA FT LCGTFRRLSSFIPLRWMLQLYYSMVHSRLQYLVCIWGKASASRLRELQVLQ FT NRCLKVIYKKPHLFPTHQLYNDRNSSLLPIRGLQMLQTLIHMHNIITNANT FT HHNVQIIRNTSARSSRYHGNIRLARPNSEMGKKRFSYLGSKSYNELPSSIK FT NLSSPYLFKKKLKQYLKLHITRYII" XX SQ Sequence 4755 BP; 1531 A; 967 C; 996 G; 1257 T; 4 other; ctgttgttgt ggaagtggtc gtgcctctat cgtgctgcat taagataagc aaaattgcta 60 acaagaaggg ctataatatc gccaaatgtg tatatttgaa cacaagatca ggtgttgcgt 120 cgcaggtggt attgtccaac tgaaatattt tgtgcaaatt gattaaaacg tccttaacta 180 tttgccttga tagcagaacg attccatttt ggaacaaagc ccgttccgat ttggcataca 240 atacagaatt gtacccacca tcgcctgttg gtgtagtgcg ttctaaattc acgccgcgcg 300 attctgtgtt gtacgtatat acaacacatc ggtacagtca atgatgtctg cggagaatgt 360 tatcatctgt gtaatatgtg ataaggagga accagatgta aacaaggtgc tagagtgttt 420 gcactgccat aagtgctatc actacagatg caaaaaactc gtcggttctg ctgcgcgtaa 480 aatgcgtagc caacaattct attgcaccgt ggaatgcaac gaaatgggaa tgcggacaca 540 acatgctgta gcgtccgaaa atgcagtaat tcaggagctt cgcaatgtca ttaaggaggt 600 gcacgctatg agagacgaag cggctgcgag tcgacgtttc cttgaagatt ccatcaaaga 660 agttgaaaag agccaggatt ttttatgtac aaagttcgat gaggtcgtca gtgagttgaa 720 gcagctgaaa atagactaca gcaagatgga aaaagaaatg ttcagtgtta gggaggatta 780 tagtcagctg agcgatgcag tcgttaccct tgaggccgaa gtagatcgtt tcaagcgagc 840 ggagttggtg aggaatgtca tcgtactggg agttcctatt accagagagg ataatgcttc 900 agccatagtc aacggaattg cacgtgtcat ggggtacgca ttggatgaga agatcatcga 960 agcctacaga ctgaaggctt cgaaaacgga tgggaattat gcgccaatca gagtggtttt 1020 tgcggatagc aacgtaaaag aggagttttt tgctaagaag aagatcttcg gtcagatgaa 1080 ggtatcagct cttggaggta gctttgctgc tcttactggg aagattacgc tccgcgacga 1140 aatgacatcg tacgggctgg ccatgctgcg agaagttagg gcaatgcagg agcagctgaa 1200 agcgaagtac gtttggccag gaagagacgg agttatcttg atgaagcgaa cagatacatc 1260 gaaggtggaa tacattagaa atcgaatgga tatcagagct ctggaatgca gtacgtccaa 1320 acgcgcacga ggattatctg tgtcgccgga aggaccaccg gcaaagcgat aaatttgttg 1380 ttttattatt tatttgaatc aagataaact caatttaaaa atggaatcga ttaatcatta 1440 ttatgatact gtagaagatt gtataaacaa taaagctgaa ttaataaaaa ataacgtctc 1500 tctccgaatt tttcaatgga atatacgagg aacgaattca atgcctaaat ttgatctggt 1560 gaagcagttt ttggacaaat atcaggatag aattgatgtc attgcgctag gcgaaacatg 1620 gttgaaagag ggttgtactg aactgtttgg ggtgcatgga tacaaaagca ttttttcatg 1680 tagacaggag tcccatggag gtctagcgtt gtacgttaga gaagattttg tagtaaatgt 1740 gatcactaat gagcatgatg atggattcca ttgcattcat gcagaaattg catcagcaag 1800 taatcgattc aatatcatct gcgtatatcg tccaccaaat ttcagttaca ccgattttca 1860 aatcaaaatt gagtctaaac tcagtagcat ttccaatcag aaaaacgtgt tcttgatagg 1920 agatactaac gttcccgtga atctgtcaca gaacaacgtt gtcagagagt acaggaggat 1980 tctacaatcg tataacctat ctgtcaccaa cacatcagtg acacgtgcag caagtggaaa 2040 tattctcgat catttggtgt gcagcgtaga tcagatggga tcgataatca acgaaacaat 2100 atattcagaa atgagcgacc attcgatgat actaacaacc ctaacatctt cttccatgaa 2160 agggatgcaa acgctgacga aacgaataat caaccacact caactgaacg aaaggctttc 2220 ttctgcgctg atggatattc ccatgaattt gtctgccaat gagaaactct taacggtgat 2280 cgaaaaattc cacgaaattc aagcgtcagc aacaaaggaa gtaatcattc aggtcaaaat 2340 taagcaacaa tgcccttgga tgacctttga tgcatggaaa atgatgagga ttaaggatca 2400 tatactcaaa tcaagtagac gccacccaaa cgaccaacac ctgcaaaacc ttcttaagca 2460 cgcatcgaaa aggcttcagc aaattaagga acgatgtaaa cgtgactact atcatcgatt 2520 actcttgcac tcgtctaata aaaaatcatg gcaaatgatg aatgaaatgc taggacgcaa 2580 gaccaatgtt aatcgcgctg tattattgag gatcgatggc aaagacgtaa ccgatggctt 2640 ccaagtaagc aatactttta atgatttttt ctgtcgagtt ggagaggatc tagctaatga 2700 tatagacagc gataaaaata tctggaaatt ccggacctta ccaatgcagt catcctccat 2760 tcatcttcga caagctacag tatcagaagt aattctactc attaaagaac tcgattscaa 2820 caagtctccg ggacctgaca acatatctgc tgaaactata aaaacccacc acatttcgtt 2880 tgccaagatt ttgacggaag tattcaacga gattatagaa acaggggaat ttccggaatg 2940 cttgaaaatt gctcgtgtgg tgccgatcta caagtctgga gattctgcga ataccaacaa 3000 ctatcgtccc atatcagtgc tgtcaatcat gagcaagttg ctggagcaaa tgctcgcgtc 3060 aaggctttca gactttcttt ttgggaataa ccttatatac gaaagacaat atggcttcag 3120 aacaggctgt agtactttaa ccgccacgtg tgaaatggta gacgaaatat acagttctct 3180 tgataagcgt aagttctctg gagcattatt cctggaccta aaaaaggcat tcgacactat 3240 aaaccatgtt cttctgcttc gtaagctcga atgttacggc atacgaggaa aagcgctgct 3300 gttgttggaa aattatctgg gtaatcgaca acaatttgtt accgtcaacg gtgaaagaag 3360 ctgcttacgc aatatatcga ttggtgtccc acaaggtagt aatcttgggc cactattgtt 3420 ccttgtattt atcaacgacc tatcacgcct taaactwcat ggaagaatgc agttctttgc 3480 tgatgatacg gtacttctgt atagtgccac agaagccgct aatatagcgc ggcttattca 3540 aacagatctg cagtgtctac aagattattt cacatcgaac gtcctttcgt taaacctaca 3600 aaaaactacc tacatgatat tccactcgac tagacgaata cctcaagaat tacctacagt 3660 tcgaatcggt acggtcgaaa ttcgtaacgt ggacagcttc caatacttgg gcctcacctt 3720 ggatagcgtt atgagttggg agaagcacat cgacaatctg aagaacaaac ttgcagcatt 3780 gtgtggaacg ttccgaaggc tatcttcatt cattccattg agatggatgc tacaactgta 3840 ttattcaatg gtacattcaa gacttcagta cctagtttgc atttggggaa aagctagtgc 3900 atctagacta cgtgagctcc aggtgttaca aaaccgttgt ttaaaagtca tctacaagaa 3960 accccatttg ttcccaacac atcagttgta caacgatcgg aacagttcat tgcttccaat 4020 tcgcggactt caaatgctkc aaactttaat tcacatgcat aatattatca ctaacgccaa 4080 cacacaccac aacgtgcaaa taatccggaa tacttcggca agatcatcca gatatcatgg 4140 caacattcgt ctagcccggc ccaatagcga aatgggcaaa aagagattta gctatctcgg 4200 tagtaaatct tacaacgagc taccctcttc gatcaagaac ctttcatctc cttatttgtt 4260 caagaaaaaa ctgaagcaat atttaaagct ccatataacc cgctacataa tttgaaaata 4320 tggttgctaa acctcaaacc gtttcttgtc tgcctttttt ttcgtttccg ctcctccgcc 4380 accctcaacg ccacccgcca ccacccagca caacccgcca ccgccagcca ccgccgccca 4440 acgccacatc aaagcaccac atcgccaaca aataatccca cttgttaaat aattttaagt 4500 tagttaccac tccacaatta ttgtgaaaac ttaaaatccc acctccttca aagagtgcac 4560 actcactgga ggtgaatatg waaatctttc tgttattgga attgtgaaat tgaaaagatg 4620 aggaggtttt atgcctgttg gaggaaggat aaatatggag aatattctac tccagcgggc 4680 ttttccctgc tccggtaaaa aataaaaata aataaataat taaattaaaa aaaaaaaaaa 4740 aaaaaaaaaa aaaaa 4755 // ID Gypsy-33_CQ-LTR repbase; DNA; INV; 140 BP. XX AC AAWU01012402; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-33_CQ_; KW Gypsy-33_CQ-I; Gypsy-33_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-140 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 446-446 (2011). XX DR GenBank; AAWU01012402; Positions 193 332. XX SQ Sequence 140 BP; 42 A; 24 C; 23 G; 51 T; 0 other; tgtgatatat tgtaatatta tagtagtcta tgatttagta gatcatagta gtcaatgata 60 tgtaggtcaa taaagattca ttaccaccct tgtcattcta ctgaacggac ctgtttggac 120 gtctctctct caacattaca 140 // ID Chapaev-7_HM repbase; DNA; INV; 5136 BP. XX AC . XX DT 27-FEB-2008 (Rel. 13.02, Created) DT 27-FEB-2008 (Rel. 13.02, Last updated, Version 1) XX DE Autonomous Chapaev DNA transposon -a consensus sequence. XX KW Chapaev; DNA transposon; Transposable Element; Chapaev-7_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-5136 RA Kapitonov V.V. and Jurka J.; RT "Autonomous Chapaev transposons from the hydra genome."; RL Repbase Reports 8(2), 33-33 (2008). XX DR [1] (Consensus) XX CC Chapaev-7_HM is a very young family of autonomous Chapaev DNA CC transposons that can be still active in the hydra genome (they CC are <0.2% divergent from their consensus sequence). The consensus CC sequence was obtained based on a multiple alignment of 15 copies; CC it codes for a 983-aa Chapaev transposase (ten exons). CC Chapaev-7_HM is characterized by 4-bp target site duplications, CC 12-bp terminal inverted repeats, and 28-bp subterminal inverted CC repeats (separated by a 10-bp and 1-bp regions from the 5' and 3' CC TIRs, respectively). CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(462..1382,1461..1753,1905..2117,2237..2476, FT 2557..2633,2775..2983,3066..3395,3524..3679, FT 3806..3998,4113..4429) FT /product="Chapaev-7_HMp" FT /note="Transposase." FT /translation="MDHLTKLSLICRICGEHITKDSVDVCKYSERIGNTFY FT INTTIDNKNIHPQKMCRRCYIIMRNIEKGSSTSLKVVKWPSPCPDKCSCFS FT KVSGRKVKKCKAGRPCIIEQNQKRWTRPIINNLIASIPKQTLRLNAIDIDP FT VANPHLDLCICQLCNNIMYKPLILKECQHSFCSECLFREIEGKLETEAKCF FT ICNQHIPLYTILNSVNVTLLIEHILLGCNKKCKLKFAVKDNDLKKLHEEIC FT IGEQLLPTMICKKIHDGLDTTLADVFLLKENDAIPRIVEDAALHVIKQKLV FT TSESNLFAFPSGGPRPLHFTSNSIAYKESSDVSKRTIKKRQKAVINSLNVI FT SGMSSTSQLQQTSAILNSFEVKDQVKILKMSDIPLSVISAEEMISMKAHMG FT ITYSNMKILARWLKTKDIECASNKKQRKVAKSWSGEDWVVCNAPFNFLLKD FT SSDSFEVKNAPWGYIKDLPLHIINKLNLLKSKNLLKHSNIKNDEIHIKIGG FT DYGGGSFKMCYQIVNVDKPNSRRNTNIFSIFEAKEYKPNLIVGLSMFTQQI FT NQLQSLCWNEKKIRVFIFGDYEFLCSAYGITGANGRHACLFCHITRAGMKK FT SDTQQKHSPCSNRTLETLDTDLQKFIQKGRIPKLAKYCNNVIDSPLFNIPL FT TQVGIPLLHISLGIYLKFFNMLEDGCHLIDIKIAAKMCLKNQTVNNKSFDE FT YIVIQLRIYEQEKAIAEYCEKITLIHEAMSIQVLRSPENKEYLYEIFQPRV FT VHFMKKKNAKIIELEELKTKTFEKSHGPLVKKLDEVLCGLNVQRQAYHGKC FT FIGNHVHKMLKLNSVLDLCNSIPKTVSDLGFIDTDVFIEAKVLSEKFVELF FT SKYATCYNFMNSSEIINNNPLLELEGAIGELMSYFRKTWPNESITPKMHLL FT ESHCADFIRNWGLGLDIYGEQGLESMHAEFNSMNSTFCHMKGKQRLESILS FT NHYIKNSLEALEIRPTIQERKSYKRKAS" XX SQ Sequence 5136 BP; 1901 A; 646 C; 734 G; 1855 T; 0 other; cacggcagtt tcaaattgat acttattgga actttattct gagcatgcgc agtcaacact 60 tataaaatat gagaaaatca atggtttttt aattatttct ttctgaagct cagctactgc 120 aacttaattt ttttaatgtt tgatatttta aattttttcc aaaccaaagc gatttgtttc 180 agagataact gataaaacat tgttttataa attttgttgt tgaaattagg gttttcaaca 240 aaaactttta aatcaaatgg taaaatattg ttcttaaaat tttttatttt gtaatcagct 300 atgttataat gaaacgaatt ttctgtatct gaaagtaaat atttttgttt tcatcagttt 360 attgttcttc ctttaaactt aattttggag accgattttt ttttttttcg ctaatttttt 420 atttatttta cataactgct atatttagat attttgtaaa catggaccat ttaaccaagt 480 taagtctcat ttgccgaatt tgtggagaac atataacaaa agactcagtt gatgtgtgta 540 aatattctga aagaattgga aatacttttt atataaacac caccatagac aataaaaata 600 ttcacccaca aaaaatgtgt aggaggtgct atatcattat gaggaacatt gaaaaaggtt 660 ctagcacaag tcttaaagta gtaaaatggc catcaccatg tccagacaaa tgttcctgtt 720 tctccaaagt ttctggaagg aaagttaaaa aatgtaaagc tggtcggcca tgtataatag 780 aacaaaatca gaagagatgg accaggccaa tcattaataa cttaattgct agtataccaa 840 aacaaacatt gcgtttaaat gcaattgata ttgatcctgt agcaaatccc catttagact 900 tatgtatatg ccagttatgc aacaacatta tgtataagcc actaatatta aaagagtgtc 960 agcattcttt ttgctcagag tgtttattca gagagattga aggaaagctg gaaacagaag 1020 caaaatgctt tatttgtaat caacacattc cattatatac aattttaaat tctgttaatg 1080 taacactatt gatagagcac attcttttag gttgcaataa gaaatgtaaa ttaaaatttg 1140 cagttaaaga caatgacctt aaaaaattac acgaagaaat ttgtattggt gaacagctat 1200 taccaactat gatttgcaaa aaaattcatg atggacttga tacaactcta gctgatgtat 1260 ttttattaaa agaaaatgat gctataccac ggattgtaga agatgctgcg ttacatgtta 1320 taaagcaaaa gctagttacc tctgaatcaa atttgtttgc atttccttcc ggaggaccaa 1380 gggtaagcac atttttaata tgtagcaaaa gaaaatgaac tatattatgt tttgttataa 1440 ttattaaaat ttttttctag cctttacatt ttacttccaa ctcaatagca tataaagaga 1500 gctcagatgt tagtaaaagg acaattaaga aaagacaaaa agccgttata aattctttaa 1560 atgtgatctc tggtatgtcc agcacatcac aacttcagca aacttctgct attttgaatt 1620 catttgaagt aaaagatcaa gtaaaaatat tgaagatgtc agacatacct ttatcagtta 1680 taagtgctga ggaaatgatt agtatgaaag ctcatatggg aataacatat tcaaacatga 1740 agatacttgc aaggtaaatt atatatatat atatatatat atatatatat atatatatat 1800 atatatatat atatatatat atatatatat atacgaacct taatcaagct aaaatgaaag 1860 tgtgaattaa taaaaaatat atatatattt aaaattttat ttagatggct caaaaccaaa 1920 gatatagaat gtgcatcaaa taaaaagcaa cggaaagtag ctaaatcctg gtctggagag 1980 gattgggtag tttgtaatgc tccttttaac tttcttttaa aagactccag tgattcattt 2040 gaagtaaaaa atgctccttg gggatacatt aaagatcttc cattacatat aataaacaaa 2100 ctcaaccttc tgaaaaggta tacatagtat tttatttaaa taagtgtgta tttaaatgtt 2160 gtcattttat atatttttta ttaatatttt atttactttc tctaaaaaaa aattaaattt 2220 atttaaaatt gagcagcaaa aatttgctaa agcattccaa tataaaaaat gacgaaattc 2280 acatcaaaat tggtggtgac tatggcggtg gatcttttaa gatgtgttat caaattgtga 2340 atgttgataa gccgaattcc agaagaaaca caaatatttt tagtattttt gaagcaaaag 2400 aatacaaacc aaaccttatt gtgggtctct caatgttcac acaacaaatt aatcaattgc 2460 agtcattgtg ttggaagtag gttttatttt atcataattt taaaaatgta tttagattaa 2520 tagattttac tctattttga aaaaaattat ttttagtgag aaaaaaataa gagtatttat 2580 ctttggtgat tacgagtttt tatgttcagc atatggaatt actggtgcaa atggtataaa 2640 ttcaaataat aaaattgtgc aacaaacaaa accacataac cacaatataa aattattgaa 2700 gttgatgttt atattattta gtgacatttt atgacttttg atatcctata tttaaactga 2760 tattatcata ttaggtcgac atgcttgctt gttttgtcat ataactcgtg caggtatgaa 2820 aaagtcagat actcaacaaa aacacagtcc ttgttccaat cgcacgctgg aaacacttga 2880 tacagatctc caaaaattca ttcaaaaagg tcgaatacca aagttagcaa agtattgtaa 2940 caatgtcatt gattctccac ttttcaatat tcctttaaca caagtattac acttttctat 3000 tatattacat ttttacttgt ttatataatt gtaaatattt ttttatattg attttatgaa 3060 tctaggttgg cattccattg ttgcatatct cattggggat ttatcttaaa ttttttaata 3120 tgctggaaga tggctgtcat ctcatagata taaaaattgc agctaaaatg tgtttaaaaa 3180 atcagactgt aaacaacaag agttttgatg aatatattgt aattcagtta agaatatatg 3240 aacaagaaaa agctattgct gaatattgtg aaaaaataac acttattcat gaagcaatgt 3300 caatacaagt tctacgttct cctgaaaata aagaatatct atatgaaatc tttcagccca 3360 gagtagttca ttttatgaag aagaaaaatg ctaaggtata ataagtttat atttgaaata 3420 atgaaataaa attttctata attctatgtg aatgaaacat aaatcaatct aagagcaaat 3480 gcttatatgt gtgtgtgtta atgtaaatct gttttttata tagattattg aattagaaga 3540 gttaaaaact aaaacatttg aaaaatcgca tggaccactt gtgaaaaaac ttgatgaagt 3600 tctatgtggg cttaatgttc aaagacaagc ttaccacgga aaatgtttta ttgggaatca 3660 tgttcataaa atgttaaagg tatttctata catatttttg catacttgta aatatgaatt 3720 ttataaatat tttctcaatc acatagtttt gtaatgcata gaggtactaa ttttgtaaat 3780 cataaaatta tcttttatct attagttaaa cagtgtccta gatctttgca attcgatacc 3840 taagacagta agcgatcttg gatttattga tacagatgtt tttatagagg caaaagtctt 3900 gagtgagaaa tttgttgagt tgttctctaa atacgcaaca tgttacaact ttatgaattc 3960 aagtgaaata attaataata atccattatt agagttaggt aaaaatatat ttttattatg 4020 ttacttgtat taaaattcct cttgcttgca taattattct ttctttgttt tctatttacg 4080 ttttcttatt tatttattta ttttggattt agaaggagcc attggagaat taatgagtta 4140 ctttcgtaaa acttggccga atgagtccat tacgccaaaa atgcatttat tagaaagtca 4200 ttgtgctgat tttattagaa attggggttt gggtctcgat atttatggag agcaaggttt 4260 agaaagcatg catgctgaat tcaatagtat gaacagcacc ttttgccata tgaaagggaa 4320 gcaaagacta gaaagtattc tgtctaatca ctacatcaaa aactccctag aagcattgga 4380 aattagacct actattcaag aaagaaaatc ttataaaaga aaagcctcgt gatatttatt 4440 aaataattgt atatatgatt gtaattgcat aacgacttaa ttttaatgtt tattgtaaga 4500 aaaaatatat atactcttct tccgaatata ataaatattt aataaaataa atagttaatc 4560 ttttggttac tagacaaaaa cgaatatttt atttcattag attagataga tgtgttattc 4620 cttacgtttt ctttgacatc ttggaaagta accatagtta cttacagaag ttttaggcgg 4680 aaaacgtata tttttatttt acaaaaaaag ttcaaaaaat ctacaaaaaa aagtaatcgg 4740 ttcttcaaaa tttgacagaa tctttaaata ccttaactct ttctcttaaa catttcagct 4800 tttctcaaaa aattcattta agtacttatt tattaagaat tactgacttc cttaatattt 4860 ataactaaag aaactgcaaa gttctagaaa ttataaaagt tggataaatg tttttttata 4920 tatatttttt tttcataaaa actagcataa cttaatattt attaaaatag aaaaatttaa 4980 ttactcattt ttatcaaaaa gtatataaaa ttaggttatt aaaaaaagtg tttttaaaag 5040 ttttttttgc gtttcgcatc tttattgaat agttaagcaa ataaatattt gtcaatgcgc 5100 atgcccagaa taatgttcca ttacgaaact gccgtg 5136 // ID Copia-132_AA-LTR repbase; DNA; INV; 230 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-132_AA_; KW Ty1_copia_Ele73; Copia-132_AA-I; Copia-132_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-230 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 230 BP; 50 A; 59 C; 40 G; 81 T; 0 other; tgttaacgaa caacctctgg cagcacggtt gtgtatccat gcaaccgccg ctactgtggt 60 gtgcccctac cagttaggca acactgtggt agctgtcatt attttccctc cactttgtta 120 ttcttaaatg tatcttcgat taatacaaac gtgtttttag tttctctcta aacaagagta 180 gttcttttat tcacttccgg ttcctccgtg tcgttccgaa acccatttca 230 // ID ORTE-2_AAe repbase; DNA; INV; 7050 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A non-LTR retrotransposon family encoding cysteine protease from DE Aedes aegypti. XX KW Non-LTR Retrotransposon; Transposable Element; ORTE-2_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-7050 RA Kojima K.K. and Jurka J.; RT "A lineage of non-LTR retrotransposons encoding an OTU cysteine RT protease from the yellow fever mosquito."; RL Repbase Reports 11(4), 1125-1125 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 4 sequences with >98% CC identity. CC This family encodes OTU superfamily cysteine protease upstream CC of apurinic-like endonuclease. It is positioned at the sister CC lineage of the lineage including RTE and RTEX in RTclass1. XX FH Key Location/Qualifiers FT CDS 1538..5515 FT /product="ORTE-2_AAe_1p" FT /note="OTU cysteine protease, endonuclease and FT reverse transcriptase." FT /translation="MENIAVNGDIGEXYGSLHKALIEQIQVADSRFRSFNF FT SKEVFDRLIAPAVNNKLYKKIITNREVIKLFNRNSIKDVRIICATAEIFCT FT IINIFFNGIYKLSVIPDNGCRMEEGIEFNIDCTTRKSALQFKSCKTQNNHE FT IESRIRDKMAKKIRQNKDEEFGIYETVIFDKATKIRIKKIKGDGNCLIRAL FT IDQLRGKTQDDYVTSKDIQDLRNRIADHMTKNKERYEPFLIESLEEGGFEK FT FINLSVRQMGVWLGHEVMVAVSEIFGKNIIVHQVDEPDVIIGENGRSEEQT FT LRIFYTRTVEGGEKNHFESVIEVLDVENSSQGQIIIQENENDDTMEVGNVV FT EELNVENDNIIRIKQMSRSDNETERLQNNQPFQRFIRVASLNVRGCIKEEK FT RNEIDELMTLHNIDVVALQEVNALGEIIETKNYKWIAPLNRPNKTRGLALM FT IRKNEEISLGDVRMKSSSIMSAEIIINDCWKFLLVNIHGPNKAAYNFLAEL FT GTVIGKDHIRNKTLILGDFNAQIGRGDVDEEDSQIVGDKLGFDXSNENGEE FT FIMFLKLHKLINISSKIGVNTNVTWRIGKKESQIDHVLLPKDSDLRXRYVR FT GFWTSLNTDHKLLMVGLTFENNTIKRXRKKFKKKLDYSALKYDVVKKKYCD FT ALTQYDNDEVREVRKDIHKLYKEVIEKLKSSSDKALETSKAPLTPKRKKAL FT GRLNKALKLWKKHPDMLPYKWKVLNRKEEFKIAEKEHIEKTIKEFYRNLKD FT FDVGVRIRKSYIFLKDFMKKRSRKNAYIPTRLWNEVLKQSEGPNVTFLEKN FT DTCPLTEPPSKEEMSSIIFNSCNGKSAGPDGIRXEMIKYADEKTFNDLVLI FT WQRMWIDNVMPKDLSCTTQIPIPKVSYPRKVEDFRRISLCNVIYKPYAKWV FT KNRLKEFTGDPGLHQAAFTEGRSTDDHIFITRRIMEEYWNGGEQLYIASLD FT IRKAFDNVKLENMKEVLLELNVPTHIIDRVLLCLNEDRTRVKWQNQWSEEI FT NRGKGIKQGCPLSPFLFNLVMQNILKAVVEKVPELKLMEIELLDLPLILVF FT ADDVLIIARSRKELEKIVKALEECLPTVGLEINEEKSHVIIRAPNKMGDIP FT STILLNNKSYKTCKTLKYLGITLTENLNRPATVKQRCINTAKASKVVVEFC FT KQFKPSWTIGKLIYNTVIAPAILYGTKVSTLTKRSRKQLAKYEKIIVKSIW FT HHCTKENDNKLNIRKELNGKTINRKIRVGRISYYGHIKRRPQNHPLKMAYR FT FKFNKKKEGRPSYTWKDSLKQDLDRYRNISKEEWKTLANDKHKLKQKAEEI FT YKQEESENSDGEDS" XX SQ Sequence 7050 BP; 2698 A; 850 C; 1497 G; 1996 T; 9 other; gttgtgattg aagtgagacg caaattcggg gtatgaaaac gagtttcatg ggactgtttt 60 tggtggattg tgattcggag ggaggttggt gaaggtgggg gaagtttgtt ttttgtacag 120 aaacggtcag atagattgaa gttggaggga aaaaagaaaa tagtcgagaa atagcggaaa 180 actagtccag tttgccgtgc agatatgggt gcatcagtgt gagtgtgtgc agtagagtcg 240 aatcagagag ttttattgag agatcttgtg ggagtgttta agtattgaaa acagtggggt 300 actggaagcg gtctggaaac acggaggaaa gtgtttgttt tggaagtttg tttttccaag 360 tttgtgatga agcggaaggg ttatgtaggt tgaaattaga gtcatctcat aagaatacct 420 tattatgtgg ctttgagaag attgaagcag tgtgtacatg acaagggggg aagttatgtt 480 ggaaactgag cagaagaaga agaagaagaa gcaagcggat cacagtacgg atagaaagtt 540 gatgaaagga aagtgcgtta ctagctagaa agacgggaag ggtgcgaatg tactagtagt 600 ggtgttcggg tttttgatga agaggtgtgt ggagcagacg gcatgttggt attggtaaga 660 tagagtgaac ggaagtcaaa acgtgcgcaa aacatgtaga agtgaatcaa aatagtcaag 720 tgattgatta gggtggacca taaatagatt ggaatttggt ggtctgaaat ccataaactg 780 tcctcttttt gttgatgtgc tattgggagc aatgtaggaa ttgttgatca tcaagataat 840 ttttatcaat ggagatcata ttagtgaagt ggaaaagatg aacaagaagg ttttgttcag 900 agaggaaatt ttaaatgatg agaattgagg attacggatg tcttaaacag aaaatgcttt 960 aattgaagtg taaatcgaac atacggttgg atcattaaat gtgaaaagcc aagacgggca 1020 gctataacaa atcgtaattt cgggaataca ggtattataa tttaaagtta agttacaata 1080 aaatacctaa ctataataaa taatacataa taaaagagaa aagttaattc ctcgggcggg 1140 ctagtggaaa tcattgcaag gtaaggaaga tttgaatctt atattgctat tatcatattt 1200 attcttccat tgaatttgtt ttacataatg aaataatctt tattagcttc cagttcttca 1260 gttatcatta ataaaaatta tatatttctt gaaacatcag tagaattatt taacaacatt 1320 taaatagtac agaggaaaac aaacagggca aaaataggcc cagtagatat cattgatgtt 1380 atcaattaat gcgcagaggt atagctaatt ttcatttcta cttattactt atttcttact 1440 wattttttac gcataattca agttatatat tgaaatttct cattaaaaat taattattgt 1500 tgaataatta gggtaaggtt cataaaaaca aaatagaatg gaaaatattg cagtcaatgg 1560 ggatattgga gagkgatatg gcagtctaca caaagcatta atagagcaaa ttcaagtagc 1620 tgatagtcgt ttcaggtcat tcaacttttc caaagaggtt tttgatcgtc ttattgcgcc 1680 cgcagttaac aataaactat acaaaaagat aattacaaac agggaggtca ttaaattgtt 1740 caacagaaat agtattaagg atgttcgtat catatgtgct acggcggaaa tattctgtac 1800 gattatcaat atatttttta atgggattta caagctatca gttattcctg ataatgggtg 1860 tagaatggaa gaggggatcg aattcaatat tgattgtaca acaagaaaat cagcattaca 1920 attcaaatcc tgtaagacac aaaacaatca tgaaattgaa agcagaatta gggataagat 1980 ggccaagaag ataagacaaa acaaagatga agagtttgga atttatgaaa cagttatatt 2040 tgataaggca acaaaaatta gaattaagaa aattaaaggt gatggaaatt gtctgataag 2100 agcgcttatt gatcaattaa gagggaaaac gcaggatgat tatgtcacaa gtaaggacat 2160 tcaagattta agaaatagaa tagcagatca catgaccaag aataaggaac ggtacgaacc 2220 ttttttgata gaaagtttgg aggaaggcgg atttgaaaag tttattaatc ttagtgtaag 2280 acaaatgggt gtatggctag gacacgaagt aatggtagca gtcagcgaaa tttttgggaa 2340 gaacataatt gtacaccaag tagatgaacc agatgttatt ataggtgaaa acggaagaag 2400 tgaggagcaa acgctgagga tattttatac cagaactgtg gaaggtggag agaagaatca 2460 ttttgaaagt gttattgaag tgttggatgt ggagaacagt tcacaaggac aaattataat 2520 acaggaaaac gaaaacgatg ataccatgga agtaggtaac gtcgtagaag aattgaatgt 2580 agaaaatgat aatataatca gaataaaaca gatgagtagg agtgataatg agacagagag 2640 attgcaaaat aaccaacctt ttcaaagatt tatcagggta gcatccttga acgtaagagg 2700 atgtattaaa gaagaaaaaa gaaatgaaat agatgaacta atgacattgc ataatataga 2760 tgtagtggcg ctacaggaag tgaacgcttt gggagagatc atcgaaacaa agaactataa 2820 atggatagca cctttgaaca ggccaaataa aactaggggt ctagcactaa tgattaggaa 2880 aaatgaggag atttcgttgg gggatgtaag aatgaaaagc tcatcgatta tgtcggcaga 2940 aattattatt aatgattgtt ggaaattcct tttggtaaac attcatggac caaataaagc 3000 agcatacaat ttcttagcag aactgggtac agtaatcgga aaggatcata ttagaaataa 3060 aacactaatt ctaggagatt ttaatgccca aattggtcgt ggtgatgtag atgaggagga 3120 tagtcaaatt gttggagata agttaggatt tgacamaagc aatgagaatg gagaagaatt 3180 tatcatgttt ctaaaattac ataaacttat taatatttct tcaaaaattg gtgtgaatac 3240 gaatgttaca tggaggatag gaaagaagga aagccaaata gaccatgtac tattacccaa 3300 agatagcgac ctaagastaa gatatgtgcg agggttttgg acaagtttaa atactgatca 3360 taaattattg atggttggat taacttttga gaataacact atcaaaagaa awaggaagaa 3420 gtttaagaaa aagttggatt attcagcact aaaatatgat gtggtaaaga aaaaatattg 3480 tgacgctcta acacaatacg ataatgatga ggttagagaa gtgaggaaag atatacataa 3540 gctttataaa gaagtaatag aaaaattgaa aagttcatca gataaagcat tggaaacatc 3600 gaaggcgcct ttaaccccta aaaggaaaaa agctttagga agattgaata aggccttgaa 3660 attatggaaa aaacatcctg atatgttgcc ttataaatgg aaggtactga ataggaagga 3720 agagtttaaa atagcagaaa aagagcacat agaaaaaact atcaaggagt tttatagaaa 3780 tttgaaggac tttgatgtag gagtaagaat taggaagtca tacatatttt tgaaagattt 3840 tatgaaaaaa aggtctagga agaatgcata tattccaact aggttatgga atgaagtttt 3900 aaaacagagt gaagggccta atgtcacttt cttagagaag aacgatacct gtcctttgac 3960 cgagcctcca tcaaaggagg aaatgtcttc gatcattttc aattcgtgca atggtaaatc 4020 agcaggacca gatgggattc gaatsgaaat gataaagtat gctgatgaga aaacttttaa 4080 tgatttggtt ttgatatggc aaagaatgtg gatagacaat gttatgccta aggacctttc 4140 ttgtacaact caaattccaa ttcccaaggt tagttatcct agaaaagtag aagattttag 4200 aagaattagt ctttgcaatg ttatttacaa gccgtatgcg aaatgggtta aaaaccgact 4260 aaaagaattc acgggagacc cagggcttca tcaagcagcg tttactgaag ggagatcaac 4320 agatgaccat attttcatta cgcgtagaat aatggaagag tactggaatg gaggagagca 4380 attatatatt gcgtctttag atatcagaaa ggcgttcgat aatgtcaagt tagagaatat 4440 gaaagaagta ctattagaat tgaatgttcc tacacatatt attgatagag ttttattatg 4500 cctaaatgaa gatagaacta gggtaaaatg gcaaaaccaa tggtcagaag aaattaatag 4560 aggaaaagga attaagcaag ggtgtccatt gtccccattt ttattcaatt tagttatgca 4620 gaacatatta aaagcagtgg ttgaaaaggt tcctgaacta aaattgatgg aaatagaatt 4680 attagatcta cctttgattt tagtttttgc agatgacgtt ctcataattg ctaggagccg 4740 aaaggagcta gaaaaaattg tgaaagcctt ggaagaatgt ttaccaacag taggtttaga 4800 aatcaatgaa gaaaaaagtc atgttataat acgagccccc aataaaatgg gagatattcc 4860 aagtacaatt ttgctaaata ataaatcata caaaacgtgt aaaactttga aatatctagg 4920 aataactttg actgaaaact taaatagacc tgctacagtg aaacaaagat gtattaatac 4980 tgcgaaagca tcaaaagtag ttgttgaatt ttgtaaacaa tttaaacctt cttggaccat 5040 aggcaaatta atttataata cagtgatcgc accagcaata ttgtatggca caaaagtatc 5100 aacgcttaca aaaaggagca gaaaacaatt agcaaaatat gagaaaataa tcgtcaagag 5160 tatttggcat cattgtacaa aagaaaatga taataaattg aatatcagaa aagaattaaa 5220 tggtaaaaca ataaatagaa aaataagagt agggagaata agttattatg gccacattaa 5280 aagaagacca caaaatcatc ccttgaaaat ggcatataga ttcaaattca acaagaaaaa 5340 agagggcagg ccaagttata catggaaaga ttccttgaag caggatctag acagatatag 5400 aaacataagt aaagaagaat ggaaaaccct tgctaatgac aaacataaat tgaaacaaaa 5460 agcggaagaa atctacaaac aggaagaaag tgaaaattca gatggtgaag atagttaaag 5520 ctagaaagaa taaaagagat gaatattaca gtacagaata cgcaatggga taacaaatgt 5580 aaaaaaaaaa aaaacaaatc taaataccta acacttataa aatgatttag ccggaaaagg 5640 aatgacatat tgagaagata atgtatgtga actatgaaat gattcatata tttaacagtt 5700 ttttcactca ctaccccatt agtcaaataa taatttgaat tatttatata tttattattc 5760 atttatatat atatattaat ctatttactt tcttatttat ttacttattt attatatatt 5820 tatttattta ttaaattatt tacaagaaca tcatttacag aaaattcacg aaaaacaatt 5880 caatcaacac atgtccacat ggcacactgt cacgcttatt agttacgcta ctccatgctc 5940 ttcaaattga aacggatcgt aatagtggtt tacctataat agtgtatcca gtgwatatac 6000 catttttatg cgtaacgaat cgatctgaaa tactcacaag atattagagc atttagacat 6060 tcaacaactt atgaaaatgt tattttmaaa ttaattttaw gaaattatct cataattaaa 6120 cattgtttgt tacacattat tataattaaa tactggttga attgaaattc gttttttaga 6180 tgatgatgta agatagacag aatgagaggg taaagggaaa aaatagtgag tggataaaca 6240 aataaatata tattaatagg aggatgttat agacaaacaa gttcacatgg aggaaaaaaa 6300 ggaaggaata atgttactag aatgaaatag ttggtcaata aagtatcgac aagttaggat 6360 ttaccggtaa gaatcgtttt ttttagatac tcgtaaagat caatttataa tcaatgagaa 6420 aacaaatata aggatcagat aggaaataca ataggctagt ttttaagtgt atttcacagg 6480 ttgaataaat ggtgatgctg gctcaaattt tgacgcaagg tactggtggt cggattatta 6540 aggtaagata ttgattattg ataaattata ttgaattgtt taacatactt aacatagtgg 6600 gggtagagag ggagggtatc agactaattg cagactggat gagggattcg tggaaaccag 6660 gcccaagtca aggaggtgat cgctgctatg gccaaaattc ttggattttg aacataattg 6720 atgatgtctt ccaacagtct gcaattattc gacgcaccct ataggcaata ccttgttaca 6780 aaccaccaga cggaaatctt aatatgtatt gcgagttgct gaatcatata atatgatcag 6840 tcggttggaa ggagattctc tacccctgaa gtggattgcg cataggtcta agtacttatg 6900 ggtccacact ccattttggc ccttcataca accgattaga gagtatttta tagtccagca 6960 ccttgatctt gctattaaga ggaaccgtac gacatggcaa gtcctgcact ggtggtggct 7020 ttatatcaca tcacatcaca tcacatcaca 7050 // ID Gypsy-33_DWil-I repbase; DNA; INV; 4207 BP. XX AC scaffold_181130; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-33_DWil_; KW Gypsy-33_DWil-LTR; Gypsy-33_DWil-I. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-4207 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_181130; Positions 825422 821216. XX CC Positions [1747-2289] - Reverse transcriptase CC Positions [3301-3777] - Integrase core CC 'TACA' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 113..1240 FT /product="Gypsy-33_DWil-I_2p" FT /translation="MLRSGKKKGAKNQKESNYQVQMDSLNSTFTSNVEITE FT QPDLKVVSARETDIYPCASKNDADESRQTGAEETKSEDGNADNICTSVRSS FT ELETLIRMLAVKFEDSGSQRNDKFSNVTVDSFAKVVPDFDGTSIPIGQWLK FT NFEENAEAYQLSEKQKYVNARNKMMGTAKLFLETIVIGNYEELCIALMDEF FT DQKLSSAEIHKQLRSRKKLQSENFHDYVLQMRKIAALGEIEDTSIITYIVD FT GLEVREDLKYPLYSSTSFKQLRKCYDLVGTMTRNAQNKRRQIVHTKPVVQA FT DSNYKGKQLHCFNCGATDHIKADCNVGTKCFKCNGIGHISKDCPERKHQVN FT IVRKSDEDSRLKEFSIDGKKLFVWWIPAQTLLS" FT CDS 1240..4206 FT /product="Gypsy-33_DWil-I_1p" FT /translation="MRRDKYEELFLDRQMQNHVEKLFGLGNVVTEVIGEVT FT TTVKVDGLSTEHKILIVPNSKLEADVLIGYDFVKKFTVMMDPNGCTFSLHA FT DEKNQLNVLNVVSSVNEIDVEPKYADTVAALIKDYKPKDNPLSCPIMLSIL FT PNCNNIVFRESPSRLSANEQTVVKDQVEEWLKQGIVRQSNSDVASRVVLAK FT KKDGSHRLCIDYRKLNTLVLKDRFPVPIVEDVLEKMQQAKYFTVMDLKNGF FT FHVDIEEESRKYTAFVTKQGLFEFNKAPFGFCNSPAVFVRFIYYIFQKLIN FT AGVMEIYVDDIVVFGSTAEECLDNVKTVLEEARKYGLQIKWSKCQFLKRRI FT NFLGHEIEDGVVCPGKEKVKAVKNFQLPKNIRGVQAFLGLTGYYRKFIRDY FT AIIAKPLTTLLKKDAEFRMDSPQVKAIEKLKECLMNKPVLKTFKHSAETQL FT HVDASKNGFGATLLQQHDDKWHPVFYWSKKTSDQEEKLHSYFLEVKAAYLA FT TKKLRHYLMGIEFVLVTDCAAFKMTAAKKDMPRQVVQWLMHLEDFSCKIEH FT RAGDRMKHVDSLSRFPVMIVTTEVHAQMLRAQSNDEKFAAVFEILQHKPYA FT DFKVKNKLLYKQVDGLDLLAVPKSMEKEIIRNTHNFGHMGVQKTKHAIQQD FT YYIPHLDKKVGQFIANCVQCITHNRKLGKQEDFLHCIDKGEVPLETLHVDH FT LGSLDPTLKNYKYIFAVVDGFSKYTWLYPTKSTDASEVVKHLTDWVTIFVN FT PRRIVSDRGAAFTSKHFKEFCTNHKIEHVETTTGVARGNGQIERVNRVILS FT VLAKMSAAEPQKWYRYTGQVQRTINSNVHSSTKFTPFEVMFGLKMRNESDL FT ELQRILEEELLDGLQNERNQIRAEARQQILEAQTRYKRHYDQRRKPCHGYQ FT LGDLVAIKVPQFVSGKKLSNKFIGPYEIIKVKRNGRYEVKKSAEFQGPNQT FT STSEDNIKLWRYMENNDETDLEGSSEADDLQDDR" XX SQ Sequence 4207 BP; 1497 A; 714 C; 955 G; 1041 T; 0 other; tgggggctca accgctcaac cggtgaaaca aaaaagaaaa agaacagaac tgtaatacaa 60 gttgaaaaag gtgaaaaaag tgaaaagaag agacagaata aagttgataa aaatgttgcg 120 ttctggaaag aaaaaaggtg ccaaaaatca aaaagagtct aactaccaag ttcagatgga 180 cagtttaaac agtacattta cgtcaaatgt tgaaataacg gaacaacccg atttaaaagt 240 ggtaagcgca agagagacag atatataccc atgtgcttcc aaaaacgacg cagacgaatc 300 gcggcagact ggcgccgaag aaacgaagag tgaagatgga aacgcagaca acatatgcac 360 gagtgtgcgt agcagcgagt tggagacgct gatcagaatg ttggcagtta aatttgaaga 420 cagcggttcg caacgtaacg ataaattttc aaacgtaaca gtcgatagtt ttgcgaaggt 480 ggtgccagac ttcgatggga catcaattcc aattggacag tggctcaaga attttgaaga 540 aaatgcggag gcatatcagc tgagcgagaa acaaaaatat gttaacgcac gtaataaaat 600 gatgggcact gcaaagttat ttttggaaac gattgtaatt ggcaattatg aagaattgtg 660 tatagctctg atggacgagt ttgaccaaaa attaagcagc gcagagattc ataaacagct 720 gagaagtcgc aaaaagttac agtcagaaaa ttttcatgat tatgtattac agatgcgtaa 780 aattgctgct cttggtgaaa tcgaggacac atcgattatt acatacatcg ttgacgggct 840 ggaggtacga gaggatttga aatacccatt atatagttca acttctttta agcagctaag 900 aaagtgctac gatctagtag gaacgatgac cagaaacgcc caaaacaaac gtcgccagat 960 tgtccatacg aagccagttg ttcaagcgga ttcaaactat aaaggtaaac agttgcattg 1020 cttcaactgt ggagctacag atcatataaa agctgattgc aatgtgggta ctaaatgctt 1080 taagtgtaac ggcataggcc acatatcaaa ggattgtcct gaaaggaagc atcaagtaaa 1140 tatagtacga aagtccgatg aagacagccg attaaaggaa ttttcgattg atggcaaaaa 1200 gttatttgtt tggtggatac cggcgcagac gttactatca tgagacgaga caaatacgag 1260 gaactttttc tggatcgtca gatgcaaaat cacgtcgaaa aattgtttgg tcttggaaat 1320 gttgttacag aggttatagg cgaggttact actacagtca aggtggatgg attgagtaca 1380 gaacacaaaa tactgatcgt ccctaactcc aaattagaag ctgatgtact tattggatac 1440 gacttcgtca aaaagtttac ggtgatgatg gatcccaacg gatgtacgtt tagccttcac 1500 gctgatgaga aaaaccaatt gaacgtttta aatgtggtaa gctctgtaaa tgaaattgat 1560 gtagaaccaa aatatgctga tacagtcgct gcattaatta aggactataa gccgaaagat 1620 aacccgttaa gttgcccgat tatgttaagt atattgccaa actgtaataa tatcgtattt 1680 agagaaagcc ccagtcgcct atcggcaaat gaacagacag ttgtaaaaga ccaagtagaa 1740 gagtggctaa agcaaggaat cgttcgtcag tccaactcag acgtggcaag cagagtagta 1800 ctagcaaaaa agaaagatgg atcgcacaga ctgtgcatag attatcgaaa attaaacacg 1860 ctcgtattga aggatagatt tccagttcca atcgttgaag atgtgttaga gaaaatgcaa 1920 caagcgaaat actttacagt catggatctt aagaacggat ttttccacgt tgacatagaa 1980 gaagaaagtc gtaaatatac agcattcgtg acaaagcaag ggcttttcga atttaataaa 2040 gcgccttttg ggttttgtaa ttccccagca gtttttgtaa gattcattta ctacattttc 2100 cagaagttga taaatgcagg agttatggaa atttatgttg acgatattgt ggtattcgga 2160 tctacagcag aagaatgttt ggataatgtt aagactgttt tagaagaagc acgaaaatat 2220 ggtttacaga tcaaatggtc gaaatgccag tttctaaaaa gaagaatcaa ttttctgggc 2280 cacgaaatag aagatggagt agtatgtccg ggtaaagaga aagtcaaggc agtcaaaaat 2340 ttccaattgc cgaaaaatat cagaggtgtt caggcatttt taggactaac aggctactat 2400 agaaaattca ttcgtgatta tgcaataata gcaaaaccgc tgaccacgtt gctaaaaaaa 2460 gatgcagagt ttagaatgga cagtcctcaa gtaaaagcga ttgaaaaatt aaaagaatgt 2520 ctgatgaata aacctgtttt gaaaactttc aaacactcag ctgaaacgca attacatgta 2580 gatgcatcaa aaaacggatt tggtgctacg ctcttgcagc aacacgatga taagtggcac 2640 cctgtatttt actggagcaa gaaaacaagc gatcaggagg aaaagttaca cagttatttt 2700 ctggaggtca aggcagcata cctagctacg aagaagttgc gtcattattt gatggggatt 2760 gagtttgttc tggttacaga ctgtgcagct ttcaaaatga cagcggcgaa gaaggatatg 2820 ccccgacaag tcgtacaatg gttgatgcac ttagaagatt tttcctgtaa aatagaacat 2880 cgtgccggcg ataggatgaa gcatgtcgat agcttaagtc gctttccagt aatgattgtc 2940 acaacagaag tacacgcgca gatgctaaga gcacaaagta atgatgagaa gttcgcagca 3000 gtattcgaga ttttacaaca caagccgtat gccgatttta aagttaaaaa taaactttta 3060 tataaacagg tcgatggatt agatttgtta gcagtcccga aaagcatgga aaaggagata 3120 ataaggaata ctcacaactt tggacatatg ggtgttcaaa aaacaaagca tgctattcaa 3180 caagattatt atattccaca tttggacaaa aaagttggtc agttcatcgc aaactgcgtt 3240 cagtgcataa cacacaacag aaagttgggc aagcaagagg attttttaca ttgcattgat 3300 aaaggagaag ttccactgga gacgctacac gttgatcatt tagggtcatt ggatccaact 3360 ttgaagaatt acaagtacat attcgcagtc gtagatggat tctctaaata tacatggcta 3420 tatcctacaa aatcgacaga tgcctcggaa gttgttaaac acttgactga ttgggtcaca 3480 atttttgtaa atccacgtcg tatcgtcagc gatagaggtg cagcatttac gtctaagcat 3540 tttaaggaat tttgtacgaa ccataagatc gaacacgtcg aaaccactac aggagttgca 3600 agaggaaacg gacaaataga gagggtcaac agagtcatac tttctgtctt agccaagatg 3660 tcagctgcag agcctcagaa gtggtacagg tacaccggac aggtacaacg tactataaac 3720 tcgaatgttc actcgtcgac caagtttacg ccttttgaag ttatgttcgg attgaaaatg 3780 agaaacgaat cagatcttga gttacaaagg attttggaag aagaactact agatggactt 3840 cagaatgagc gaaatcaaat aagagcagaa gcaagacagc aaatactaga ggcacagacc 3900 agatacaagc gccattacga ccagagacgc aagccatgcc acggttatca gttgggcgac 3960 ctagtagcaa ttaaagtccc acagttcgtg tcgggcaaga agctgtcgaa taaattcatt 4020 ggtccatacg aaatcatcaa agtaaagaga aatggtaggt atgaagtcaa aaaatctgct 4080 gaatttcagg gtccaaatca gacgtccacc agtgaagaca acattaagct ttggcgttac 4140 atggagaaca acgatgaaac cgatctcgaa ggatcatctg aggcagatga tttgcaggat 4200 gaccgaa 4207 // ID Gypsy-16_RP-I repbase; DNA; INV; 2492 BP. XX AC ACPB02041007; XX DT 28-JAN-2011 (Rel. 16.02, Created) DT 28-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Rhodnius prolixus genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-16_RP_; KW Gypsy-16_RP-LTR; Gypsy-16_RP-I. XX OS Rhodnius prolixus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Euhemiptera; Heteroptera; OC Panheteroptera; Cimicomorpha; Reduviidae; Triatominae; Rhodnius. XX RN [1] RP 1-2492 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Rhodnius prolixus genome."; RL Direct Submission to RU (28-JAN-2011). XX DR Genome; ACPB02041007; Positions 31327 33818. XX CC Positions [1812-2084] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 894..2492 FT /product="Gypsy-16_RP-I_1p" FT /translation="MPVSTRKQNMEELMSLMREMKEDMKNMREGQEETNKR FT LDDVNKGQDELKKSLDETNKGQEKMKEEIIRKIRREVIKMPQERKAETNNI FT QEEDEGSQDDADSRSDLKGDQDKGRRSCVEVQEAAWEEERIENCSEKEEQD FT RGKSTIIVQPGVEWSPEATREEQHQGENIVSVLKCQEEPDRKRVDSKEEGN FT GVILPGSRKEEKLSTEIHGGIAGRHFGVKRAWAKRRQRFSWVGYQEDGADW FT ASRCTEWVARKGPSRETHGALRQYKVGVPCKRIASDMAGPSPDYRRRMRCN FT LFSMDHFIKRAEAHDLAGQESSPAAAASAVYQILGVRRKKTTPLHPHSDGM FT VEQLNRTLGGHQRTLRKDQQEDWAKYTPPMTMANRSAVQETTGRTPASVVF FT GRELPLPCDILLGSLEAEESDVSGYVAELRRKWRAARGKAGNRMKLQSDQM FT KTSYDWQVIRRKFPRGEKIWAYKRVRKKGKPPKLLGRNLHHCLSAQRPGVT FT DSTHAQIQDEGGTLGPIAAQHHEGDFCSGRTVLRGEQ" XX SQ Sequence 2492 BP; 770 A; 435 C; 763 G; 524 T; 0 other; gctaagcgaa aggcctcagt tggggtggtg gcatatgtgt ttcccagcct gtgattgaca 60 ggtgagagtg agacagacgg agttggagag agacggagaa tgggaggagg cggctcacag 120 ctgaggtggt ggcagtatgt gttttccaac caatgaacgt caggagggcg tggaaactta 180 tgctcagaga gttggaggag ggcagtccca gttggaggga ggcacgcagt taaaaggagg 240 ccgacacagc tggagaagtc agtttcaatc atagtttcaa ttaagtgtta atcgagtttc 300 aattgagttt ttcaatctag tttcaatcac agcgttaatc aagtttcaag caagttttaa 360 ggagttatta gagtttatag ttattagtca agcttatgag ttagtcttgg agagtcagtg 420 ttgcggagtg cttcagtggt agctagttgg gcaattagtc agttgatgag tagagctgtg 480 ttagtgtagc tgagactagt gcaattcggt gcagtgcgga ccagtggtca gtcggggaac 540 agttttgtgg cgttacgaag tgagtaagta acagtaaact gttgtgactg ttgtaaatgg 600 tgactgtaat gaatagcgtt agttgcttgt aaatgttaat gtgagtaatg ttaacgtgtt 660 aataaatcag gttattactt acaaatttag cgagttttat tagttaatca aacccatatt 720 atttaataac tagtggcagg cattagttgt gtaagtaaag tttttatttt gggggggaaa 780 aactttacat tggtgtcaga agtgggatct agtaaaaccg ctaaacaagt aagtgtcgat 840 tttattagtt cgggggaatt gccggaaagt tttggaaaat agacgagtgt tatatgccag 900 tgagtactag gaagcagaac atggaagagt taatgtcact gatgagggag atgaaagaag 960 atatgaaaaa catgagagag ggccaagagg agacgaacaa gcgcctagat gacgtgaata 1020 aagggcaaga tgagctgaaa aagagcctag atgaaacaaa caagggccag gaaaagatga 1080 aagaagaaat aataaggaaa ataagaaggg aagtaatcaa gatgccacaa gaaaggaagg 1140 ccgaaacaaa caacatccaa gaagaagacg aaggcagcca ggatgatgca gatagcaggt 1200 cagacttaaa aggtgaccag gataaaggga gacgtagctg tgtagaagtt caggaagctg 1260 cgtgggaaga agaaagaata gaaaactgca gcgaaaagga agagcaagac cgaggtaaga 1320 gtaccattat tgttcagcca ggagtggaat ggagccctga agcaaccagg gaggaacaac 1380 accagggtga gaacattgtg tctgtgctca agtgtcagga agaaccagac cggaaaaggg 1440 tggacagcaa agaagagggg aatggagtaa tcctgccagg atcgagaaag gaagagaagc 1500 tatcaacgga aattcacgga ggtatcgccg ggagacactt tggagtgaag agggcatggg 1560 ccaagcggag gcagcggttc tcttgggtag gctaccagga agatggtgca gactgggcta 1620 gtcggtgcac cgagtgggta gcaaggaagg gacccagccg cgaaacccat ggtgccctga 1680 ggcaatacaa ggttggagtt ccgtgcaaga ggatagccag cgatatggca ggaccttctc 1740 ctgattacag acgaagaatg cgctgtaacc tgttttcgat ggaccacttc attaagcggg 1800 cagaggccca cgacttagca ggtcaggaat cgtcaccggc agctgcggca tcggcggtct 1860 accagatcct cggcgtcagg aggaaaaaga caacgccgct acaccctcac tctgacggca 1920 tggtggagca actcaacagg actctaggag gccatcagag gactttgagg aaagatcaac 1980 aggaggactg ggctaagtac acacctccca tgacgatggc gaaccgatcg gcagttcagg 2040 agaccactgg ccggacacct gctagcgttg tgttcggtag ggagcttcca ctaccctgtg 2100 acattttatt gggcagcctg gaagctgaag agtcggacgt gagcggctac gttgcggaac 2160 ttcgccggaa gtggagagct gcacgtggaa aggcaggcaa caggatgaaa ctgcagagtg 2220 accagatgaa gacatcgtat gactggcagg tcatacgaag aaagtttccg agaggtgaga 2280 aaatttgggc gtataaacgt gtccgcaaaa agggaaaacc tcctaagttg ctgggaagga 2340 acttgcacca ttgtctgtcg gctcaacgac ctggtgtaac ggattcaacg cacgcccaga 2400 tccaagatga aggtggtaca cttggaccga ttgctgccca acatcacgaa ggagacttct 2460 gttcgggacg aacagttcta agaggggagc ag 2492 // ID MuDR3x_AP repbase; DNA; INV; 2411 BP. XX AC Contig17480; XX DT 24-JUN-2009 (Rel. 14.07, Created) DT 24-JUN-2009 (Rel. 15.12, Last updated, Version 2) XX DE MuDR-type DNA transposon. XX KW MuDR; DNA transposon; Transposable Element; MuDR3x_AP. XX NM MuDR3x_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-2411 RA Jurka J.; RT "DNA transposons from pea aphid."; RL Repbase Reports 9(7), 1352-1352 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX FH Key Location/Qualifiers FT CDS join(494..1126,1130..1462,1523..1951) FT /product="MuDR3x_AP_1p" FT /translation="MDKIQILKTDRGGLKIVFKDYMYTKQKNLANNKIRWE FT CVEKKKKNCKGSVTTDQMIFNVLKNIEHKNHPPSPAQIEVAKVMLNMKTKA FT QDNFDTPSRFFATETSNLSNEAMVLLTEEKSVKTSLRRIRNKKYPSLCPLS FT ELKINGIWATTGGPEPKPLLLFDNENNTNRIIVFASPEGMSALSKSVKWCM FT DGTFFTCPKEFYQVYIIHACINTYIPCVYALLQRKTKEIYIELLSTLRSML FT SELKLKTISIDFEQSMIQAIELVFVDINIQCFYYHLSQCIWRKVQNIGLAT FT KYKENQNVRQIVGMLKGLALLPLKYVKKGMMFCYIYWLGMSVLYDLSNDLN FT DPDIDVLLFYFDRTYVNGTYKRTTTKSNGLSLWWSSPIFPPYLWNVHNAKK FT KNTGRTNNISEGFDNKFKSLIRTQHPTIWMFIEALQRSDSLANIKLLNIQT FT GTVFTKKKNKYQKRIKICV*" XX SQ Sequence 2411 BP; 911 A; 293 C; 331 G; 876 T; 0 other; ggacatttcc ccccgataaa ttgttggatg cggacaattt ccccccgttt tttttttgtt 60 ttttttattt ttaattattt accttcgaat atataattat tatttttaac acatattttt 120 tagcttttgt tcatatttat ttattttttt tttttgtaaa aactatgcgt taactaaaaa 180 gtctttaatc ttaatttcaa taaaaaataa caaataatat ttaaaacaag ttaagtaaaa 240 atatagttaa agtaaaatag tagtatacta atattgaata gatgggtaac atcgaaatct 300 aaaataaatt gtgtccaaaa cccagtcaat tactaagact attaagataa cattcttata 360 caatcttaca tcgtcatgaa cattgagtca atattttata tcaaaactgt tcaaagcagt 420 ttagtagtgg tttagtggtt aaagtggttg tatacgttta aaataaaagt tttaaatttt 480 gttttttgtg gacatggata aaatacaaat tcttaaaaca gacagaggag gtttaaaaat 540 tgtttttaaa gactatatgt atacaaaaca aaaaaatctt gccaataaca aaattagatg 600 ggagtgtgtg gaaaaaaaaa agaaaaattg taaaggtagt gtaacaaccg atcaaatgat 660 ttttaatgta ttgaaaaata ttgaacacaa gaaccatcca ccgtctccag cgcaaataga 720 agtagcaaaa gtcatgttga atatgaaaac taaagcccaa gacaattttg atactccttc 780 cagatttttc gcgacagaaa cgtctaattt atcaaacgaa gcaatggttt tattaacaga 840 agaaaaatcg gtcaaaacat cactaagaag aattcgtaat aaaaagtatc cttctctatg 900 ccctttatcg gaattaaaaa taaatggtat ttgggctact acgggagggc ctgaaccaaa 960 accgctttta ttatttgata atgaaaataa taccaatcga ataattgttt ttgcgtctcc 1020 cgaaggaatg tctgcgttgt caaaatctgt caaatggtgt atggatggta cattttttac 1080 ttgtcctaaa gaattttatc aagtatacat aattcatgca tgtatttaaa acacatacat 1140 tccatgtgtt tatgccttat tacaaagaaa aactaaagaa atctacattg aacttttatc 1200 aacacttaga agtatgttgt ctgagttgaa attaaaaact ataagtattg actttgaaca 1260 atccatgatt caagctattg aattagtatt tgttgacatt aatattcaat gtttttatta 1320 tcatctttct caatgtattt ggcgtaaagt tcaaaacata ggattagcta caaaatataa 1380 ggaaaatcaa aatgttcgtc aaatagttgg catgttgaaa ggtttagctc tattgcctct 1440 gaaatatgtt aagaaaggta tgtgatatta tatattatgt taataatatt attaccatta 1500 cttctattat aatttataat aaatgttttg ttatatttat tggttaggta tgtctgtgct 1560 gtatgattta tcaaatgacc tcaatgatcc agatatcgat gtactattgt tttactttga 1620 tagaacatac gtcaatggaa cgtacaaaag aactactaca aaatcaaatg gtttgtcatt 1680 atggtggtct tcaccgatat ttccacctta tttatggaat gtacataatg ctaaaaagaa 1740 aaatacaggt agaacaaata atattagtga aggtttcgat aataaattta aaagtttgat 1800 aagaactcag catccaacta tttggatgtt cattgaagct ctccagagat cagattcatt 1860 agcaaatatt aaattattaa atattcaaac aggtacagtt tttacaaaaa aaaaaaataa 1920 atatcaaaaa agaataaaaa tatgtgttta ggaataaaaa ataaatcaca aataaagatt 1980 tttttacaaa ctgtttcatt tttatgccga cattttttct attagttatt cttttttatt 2040 tttatgtcaa aatttttagt attaataatt cattatattt tgtttataca atcttttttt 2100 atttttacgt cggatttttt ggccagctgt attgtactat attatattat ttaggcaata 2160 tcttacttaa cttgttttaa atattattta ttattttttt attgaaatta agattaaaga 2220 ctttttagtt aacgtacagt ttttacaaaa ataaaaaaat aaatatgaac aaaagctata 2280 tgtgtgtccc tatattaaaa taattttaat tttaaattat aattttatag tcgaaggtaa 2340 ataattaaaa ataaaaaaaa aaacgggggg aaattgtctg catccaaaaa tttatcaggg 2400 gggaaatgtc c 2411 // ID Academ-1_BM repbase; DNA; INV; 5870 BP. XX AC . XX DT 16-MAY-2011 (Rel. 16.05, Created) DT 16-MAY-2011 (Rel. 16.05, Last updated, Version 1) XX DE DNA transposon: consensus. XX KW Academ; DNA transposon; Transposable Element; Academ-1_BM. XX OS Bombyx mori OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Bombycidae; Bombycinae; Bombyx. XX RN [1] RP 1-3053 RA Yuan Y.W. and Wessler S.R.; RT "The catalytic domain of all eukaryotic cut-and-paste transposase RT superfamilies."; RL Proc Natl Acad Sci U S A 108(19), 7884-7889 (2011). XX DR [1] (Consensus) XX SQ Sequence 5870 BP; 1518 A; 805 C; 825 G; 1404 T; 1318 other; tagtccggtc tatttaagcc taaaatcagg aggaaggtta cataaattcc cataaagctg 60 aatattggca tagttgttca aaacatcatt ataaagaaaa tctcaaaagt cctcatcgtt 120 ccgaggtgtg ctaaaaattt ggcacggggt caaaggtcaa aaaaatgagt ttttttgcga 180 tttttagcga aacggtgagt tttttcgcaa aattacccca gtacaaaatt gtagatcacg 240 aaattctcta caaaaaatat attaataatt tttttcctaa aagccaccat tttggagata 300 taacgttccg aaaagttgga tcagtcgtaa tccctctatt ttgcacacgt acgccatata 360 tgaaagtcac taaaggtgta atagaactaa aataagttcg ctattattta ttgtaacttg 420 cttatgtaaa atataagtta aactacgaga atctcaggta taatgaannn nnnnnnnnnn 480 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 540 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 600 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 660 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 720 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 780 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 840 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 900 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnt tccgatgcta 960 tatcgaaagc tgttgttgct cgaatcgaat ttgactatga tttagttgct gctgaagcaa 1020 agtatcacca agaatgttat acttcttttt tgaaaccgac tactggtggt acagttggtc 1080 gtcctaaaga tgaagcaaca gatctggcaa tgcaggaaat atttacatac attgagagta 1140 gcaatgactg tcagtttact ttggctgagt taaaagatgt tagcaaaatt ttaactttag 1200 ataacaggac tattaagctg cgacttaaac tgaaatatgg cgataaaatt attatcactg 1260 aaaaaccggg agcgttaaca tttgtatgtt tgataaacag tcaccatgaa attttaaatc 1320 aatcttggtc cgaaaataaa aaaattaata aacaagaaga acgattaaaa actctagaag 1380 cagcagcatc tattattcga gaagatattc aatcctcggt atttgataat tcctactacc 1440 ctccaccagg tcgaatgttt gaagatttaa ataatgacat cccacagtca ctgacatttt 1500 ttttggagca aatgatctta aaaaataaac gatcaaattt cgatcattta aaactagtat 1560 gcaccaatat ttgtcatagt atcatgactg ctattcgacc aagatcgttc aagtccaaat 1620 tacagttagg tctttcagtt ttttttcatc ggaggttcgg gtctaagcgt cttatacaaa 1680 ttttgtcgtc ttttggtttg tgtgcttctt ataatgatac attgatgtac gagtcagcag 1740 cagtttttca tcctcttcct catgtcctac cacctgaaag tggcacactt attcaatacg 1800 ttgcagataa tgctgatata aatgttaata cgctagatgg taacaattca ctccacataa 1860 tgggtataat tcaaattgta actccaaagc actctgtttt actagnnnnn nnnnnnnnnn 1920 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1980 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 2040 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 2100 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnc cattcatcca tcaaccagct agtaactata 2160 atacggtcta tacgacttta ctttgtgcat tagagaacgc gaaacgttac ggccacactg 2220 tatgcatcgt gactttcgac caacccttgt atgcgaaagc tcgagaaatt gtatcagccg 2280 cacctgacgg ctctgaagta tcgaagatta ttgtcagact tggaggattt catctactca 2340 tgtcttatct cggagcaatt ggttatatta tgcaaggtag cgggataaaa gaagtactct 2400 cagaaatcta tgcaccaaag tcattagaaa aaatgctcaa tggtcatgct tacgcaagag 2460 ctgttcgagc acatacacta ctccagctgg ctttagcaat tacaatatta aaaacaatag 2520 atattaatga tatcatggga gcagatttaa tcataaatat tgaccatata ttagatagca 2580 ctctttcata ttcagatatt gaaaatgata atgaagtatc aggtgcttta cttcataaat 2640 ttaatatgaa attgaaagat tttgaggaac gaggaccaac agcgcaactt tggatacaat 2700 attttaatat ggtttcactt gcaaaagaat ttttgagagc tgagcgtatg ggggactgga 2760 aagctcattt aagttgtgta aaagaaatgt taccttattt ccactcgtca ggacattttc 2820 catatgctaa atctgcacac ttgtatttac aagacatgat gcaattacag gactcgatgg 2880 atcctgaagt ttacgaaaaa tttaccgaag gatttttcac cgtgagacgc tcagataaat 2940 tgagttgtgg aacttctaca gacatggtta tcgaacagtc tatgatgaaa gccatgaaga 3000 cagatggagg tattgctcga ggaagaagta caaaacaaag tgtaatcagt aaatgggttt 3060 acagcatgca tgcaatgaac acagtttgtg agaaattaga agatcttgct aatgttagga 3120 tggatacgac tgaacagcat gttgatgcca gtgattcacg tgtaaaaaaa gacgctaaag 3180 acatacgaat gcttcttgag tggttctcga ttcatgatcc tttcccagag gttaacaaag 3240 tcatctcgat tgcaagtggt gtcgttggtg ataatacaat taattgctat aaagcacgtg 3300 aacttgggct tatctctatt gctaaaataa ctggactgac atttaaagat attaaactta 3360 aacgcgctga caaagttgtt cctcttttag ctatgagtag caccgtaaaa gttcacgaag 3420 aaaaagtacc tgttgaccca gtgttattat tccaacgtat gagtgttact gcggcttttc 3480 aagatgaaat tgagaaatat ttcgagtatg aattagctcc ttatccgtta acattatttg 3540 atgacattgg aatgcgcaag acccagaagt cggctattta cgattgtttt cagatnnnnn 3600 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3660 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3720 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3780 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3840 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3900 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3960 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 4020 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 4080 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 4140 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnaat 4200 caatcggaaa tatattcatc aaacagttta tccaattttc ctaaatgtca aaatcatgta 4260 ttgtttctac acgcaataac tggctgcgat acgacatctg ctttttacag gagaggaaaa 4320 acaacagttt tcaaaatgtt tgaaaaaaaa gatttactac aatgtgctga agtttttaaa 4380 aatagtaatt caactcaaca acatgttatc accaacggag ttcaatttct tcttgctatg 4440 tacggagccc ccaaaaaaac cacttgctta gataagttgc gatacacatc tttcgtgaaa 4500 aatacgcgta ataaaaaaca agtcaactta gcttctcttc ctccaacctc agtagctgct 4560 catcaacatc tgtttcgggt atattaccaa gttcaagtgt ggcttggtaa tcagctggac 4620 cctaaagact ggggttggaa gttgatcgac aatacactag aattagtatc ctattagcat 4680 gtctcttatt gtccatattg ttcaataaca ggaaatcctt gtacttagct taagtttaga 4740 catcattata ttctgttatt acatccttta ttgtcaacat taagtagaat caggaaacct 4800 ttgtacttag cctaagccta tatattgttg taatccaata aagataagca gttgagtttc 4860 gctcttcaag ccgaacacac gcctccaaat cgctttttaa aaaccctctt gccaacgccc 4920 acaacctaac tcgagacagt cttaatattg atagatttgt caaaaaatgt tcgttgtgtg 4980 tttaatggga tgcacttaaa gaaaaaaaaa agcgtaaata ccattttcaa aatatccacc 5040 tggtgccaga aaaatggcct taacgctcca tttttactcc cctgctgctt ataagtacgt 5100 gcgtaaaatg ttcaacacat gcttaccaca ccctaatacg ttgtacgaat ggtatcagag 5160 ggttgatgcg gatcccggat tttgtacaga aactttaaac aggttagaag caaaggcaaa 5220 agaatttaag aaaaaaatat tatgcgccct tgttgccgat gaggtggcaa taaggagata 5280 aaaaatatgg acgggcaagc gttatgannn nnnnnnnaca gaaatcttgg acagttatga 5340 atcagacgac taacttttta ttttaagtat tttatttaat tacaccgccg ttgtttttgc 5400 ccactataat atataatatt attttgcaat agttttaaaa catataaaat ctcagcagac 5460 ccgtggaata ggcctactgg ggaaaccacg ttaaaaaaaa cgcgctttgt gtacctcata 5520 ggtggcgtgg aaacgtgtgc aaaatagagg gattacgact gatccaactt ttcggaacgt 5580 tatatctcca aaatggtggc ttttaggaaa aaaattattg atatattttt tgtagagaat 5640 tttgtgatct acaattttgt actggggtaa ttttgcgaaa aaactcaccg tttcgctaaa 5700 aatcgcaaaa aactcatttt ttgacctttg accccgtgcc aaatttttag cacacctcgg 5760 aacgatgagg acttttgaga ttttctttat aatgatgttt tgaacaacta tgccaatatt 5820 cagctttatg ggaatttatg taaacttgaa tgtgtaaata gaccggacta 5870 // ID Gypsy-593_AA-LTR repbase; DNA; INV; 267 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-593_AA_; KW Ty3_gypsy_Ele155; Gypsy-593_AA-I; Gypsy-593_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-267 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 267 BP; 85 A; 56 C; 66 G; 60 T; 0 other; tgtggtggat gcaccccctg tttacgccta tgtgcggcat gaaggaggtc gtggaatcga 60 acgatgagtg ttatgctttg taagcataaa catccagcgt tcgagaacga taacctaccg 120 taatgcacac gcagcgaaat gggctgtagc ataccacctc ggtagaatgc tagagttaag 180 taagaatgaa aaataaatga gaatgaaagt aaccatagac cggaccagtt cattggctat 240 cactcaccca gtcattggga cataaca 267 // ID Gypsy-111_AA-I repbase; DNA; INV; 5468 BP. XX AC supercont1.213; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-111_AA_; KW Gypsy-111_AA-LTR; Gypsy-111_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5468 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.213; Positions 726093 720626. XX CC Positions [1942-2370] - Reverse transcriptase CC Positions [4318-4794] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 163..2307 FT /product="Gypsy-111_AA-I_1p" FT /translation="MASPKKRTVVVKEENGRQQLSASETVFTKFVEQAKQE FT PLSPGIERRFSNVKVYDIMEEESDEGGEATVSVCSRGSTVTESSDRSRAHL FT RHSREIFLHPIEVERIIPSFNGKEGGCSRWIGKIEGYAEVYGWSPRARLHY FT AHSRLTGTARKWFEAQEEATVDWECFKTSICEAFPSRMNEADVHFKLAKRF FT RGKDEDEESYVYEMQKIAKEGRLSDEAIIAYIIRNVNDDKIQEYLISKDLR FT HVKDLIRCLQRYLRMRAVVNTKPYVKKDTITHPVSRINERKESTVFEDTNK FT QPQVRRCFNCNESGHKSAKCPLPQKKPRCTKCGKVGHLEKECLIATGIAPK FT QAFESNQIRFLQNDDQREDAGIEADPYYLEVDEGSRTPAKILFKKSCQDFN FT ALVDLGSPVSLIKVSKLPSDFDYSSNVPPHEIIGINGTRLMMLGSFRGLII FT VGSRVFVVSLIVVPDDTISKEVNVLLGRDFTNGNGITGIVFRRESVVSDYN FT PQLMIEAFGSRDDCLCVLIEDKEQIDLNVGDSQETTRLKTVVEKVFNDCYV FT NRPSPSQPLVKMEVEIKLKEHKPFTVTPARLSVFEKQELNKIIEEILDKDI FT IRKSQSNDYCRVVLVKKKNNTYRMCVNFKPLNKLVERDNFPMPVIEDLVIK FT LRDKRYFSSLDLKNGFYHVDIAENSKKYTSFVVDGGQYEFNKLPFGYANSP FT AAFVRYINKVLDELI" FT CDS 4120..5277 FT /product="Gypsy-111_AA-I_2p" FT /translation="MGASVIYRYHNDMGHFGADKVCAEIRRLFWFPEMKKK FT VLEHGKSCITCITFNPRDKKYDGYLNNIDKGTIPFDTVHVDHLGPLEITKS FT KNVHIFAVIDVFSKFIKLYATRSTKTYEVLKSLRSYISYYSKPRRIISDRG FT TAFTSKEFESFCVTRGIEHVLIATGCPKANGQIERYNRTLVPLLAKLVEER FT KTSWDNVLFEAEYLLNNSYNRSIANSPAVLLFGVRQRNHFESDLERFLENL FT NKEVDRDIDCIRSDAVEKIKQLQEYNKKKYDVKCKRNTVFKQGDLVAIRTV FT KVSGENSKLKAKYRGPYQIKRVLDNNRYIVTDLEGYQVTSTYFDGTFDPLN FT LRLYKKAATRDTEIFSSSCSSEDLSFKGYSEDESDFYGFPDEE" XX SQ Sequence 5468 BP; 1726 A; 1022 C; 1192 G; 1528 T; 0 other; gaagtgggat ttgaccgaaa aatgattgtg attagtacgc ggatgttcag tgtttaattt 60 gactaatccg tatcattcaa ttttccctga aaaacgaatt gtgaaggaat tgctgattcg 120 acagcacgca acattttcaa cgccatcttg tgtttcggaa acatggcgtc gccgaaaaaa 180 cgcaccgtcg tcgtgaagga ggaaaatggc cgacaacaat tgtctgcatc ggagacagtc 240 ttcaccaagt ttgtagagca agcaaagcaa gaaccgttgt ctccagggat tgaaagaaga 300 ttctcgaacg ttaaagtgta cgacataatg gaagaagaat ccgacgaagg aggtgaagca 360 accgtgagcg tatgctctcg tggatcaacc gtgactgaat ccagcgaccg tagtagagct 420 catttgcgtc attctcgtga aatatttctt catccgattg aggtggaacg aattattccg 480 tcgttcaacg gtaaagaagg aggttgcagt cgatggattg gtaagattga aggatatgcc 540 gaagtctatg gatggtcgcc gcgcgcccgt ttacactacg cccactcaag gttgaccggc 600 acagcgagaa aatggtttga ggcacaggaa gaagccacgg tggattggga atgtttcaag 660 acgtcaattt gtgaagcgtt tcctagtcgt atgaacgaag cagatgtgca cttcaaactg 720 gcgaagcgat tccgaggtaa ggatgaagac gaggaaagtt atgtgtacga gatgcagaaa 780 attgccaaag aaggccgatt gtcagacgaa gctatcatcg cgtacattat aaggaacgtg 840 aacgatgaca agatacagga gtatctgatc tcgaaggatt tacgtcatgt caaggatctc 900 atccgttgtc tacagcgtta tcttcgaatg agagctgtcg tcaatactaa accatatgtt 960 aagaaggata ccataacaca cccggtgtcc cgcatcaacg agcgcaagga gtctacagtc 1020 ttcgaagata ccaataagca acctcaagtc agacgctgtt tcaattgcaa tgagagtggt 1080 cataagtcag ccaaatgccc attgccacag aagaagccgc gatgcactaa atgtggaaag 1140 gttggacact tggagaaaga atgtttgatc gctacaggaa tcgcaccgaa acaagccttc 1200 gaatcaaatc aaatccggtt tttgcagaat gacgatcaac gggaagacgc tgggatcgaa 1260 gcggatccat attatttgga agtagacgaa ggaagccgga caccggcgaa gattttgttc 1320 aagaagtcat gtcaggattt taacgctctt gtcgatctag gatccccggt aagtttgatt 1380 aaggtttcta aactaccgtc tgattttgac tattcatcta acgtaccgcc acatgagatc 1440 attggcatca atggcacgcg tttaatgatg cttggatcat tccgtggttt gattattgtt 1500 gggagtcgcg ttttcgttgt ttctttgatc gtcgttccag acgataccat ttcgaaagaa 1560 gttaatgtcc tcttgggccg agacttcacg aacggcaatg gaataacggg tattgttttc 1620 aggagagaat ccgtagtgtc ggattataat ccacagctga tgattgaggc ttttggtagt 1680 cgcgatgact gtctttgcgt attgatcgag gataaagaac aaatagacct aaacgtagga 1740 gactctcagg aaacgacacg tttgaagaca gttgtagaaa aagtttttaa tgactgttat 1800 gtgaaccgac caagcccttc acaaccgtta gtaaagatgg aagtggaaat caaattgaag 1860 gaacataaac cttttaccgt aactccagcc agattgagtg tatttgaaaa acaggaatta 1920 aacaagataa tcgaggaaat tttggacaag gatattatta gaaaaagcca gtctaacgat 1980 tactgtcgcg tggtattggt taagaagaaa aataacacat accgaatgtg tgttaatttt 2040 aaaccgctta acaaattggt tgagcgcgat aatttcccaa tgcccgtcat tgaggatctc 2100 gttattaagc ttcgtgataa acgttatttc agtagtcttg atttgaaaaa cgggttttat 2160 cacgttgata ttgctgaaaa ctccaaaaag tatacttcat tcgtagttga cggtggccaa 2220 tacgaattta ataagcttcc attcggatat gccaattcac cagccgcgtt tgttcgctac 2280 attaacaagg tcttagatga gttgatctag tcgggtaaaa taacggtttt tatcgatgat 2340 attgttataa gtacagaaac tatcgaagaa cacatggacg ttttggcaga tgttcttcga 2400 actctaaacg ataatcatgt aaaactccag ttacagaagt gttcttttct taagacccga 2460 atcgactatg ttgggtacaa tataacgtac aatttgatac gtccaagcga tcgtcacatc 2520 gattcggttc aatctatccc tattccggtg aatgttcacg cgctgcatcg tttcgtaggg 2580 ctagtaagct acttccggaa atttattcaa gattttaata agttggctca tccgttatac 2640 gaattgttaa agaaagatgc cccgtttaaa ttcgaagaga agcacgttgt cgcgttcaat 2700 gcccttaagg ctagtttgat tcagaagccg gtgctcgcaa tatattcacc atctctcgaa 2760 acagaagttc acactgatgc ttctagcgct ggattcgcag gtattttgtt tcaacgccag 2820 gaagatgaca agaagtttca tccggttttg ttttatagtc gtaaaacaac tgccgcggaa 2880 gccaagctcc atagcttcga gttggagact ctcgctgttg tatactcttc agagatttcg 2940 catgtatttg ttcggaatga aattcgtaat cgtaacagat tgtgaggcta tgaaaaagac 3000 attagagaaa agagatgtga atgcaaaaat atctcgatgg agtctttatc tcgaacaatt 3060 tgattacact attgtacata ggccaggcga acgcatgcgt cacgtcgacg cgcttagccg 3120 tgttaacatg cttttgcttg agctagaaga acatccgaac gatgtgttca ccaactcgat 3180 ttacgtcgcg cagcttcaag atgaggatat ccaaaagatc aaagccgctg tacttgtagg 3240 cactcataga gactacgaag tcagggacaa catcgtctat aaaaaggaaa aaaagaataa 3300 actattactt tgcgtaccgt aaaatggggt gaagttgatc actttcgagc gattctgttc 3360 tataactcct agtagaattc atgtattatt atccaaatat tttttaaaca tgtactggtt 3420 ggatatctat cgaattatac aaacgaaatt ttgacgaaaa gttcactaaa ttacaaaaac 3480 atttatttta tatcaaaaag tgaagttttt gattccgggg tgaagttgat caattattgc 3540 gataagtttc ctgatataaa gagatatgct attcactaaa tttggggtca ctgaatctgg 3600 aacaagttcc aaaaatatta caactcaatg atatatcgat caattttgaa ttttaatgtt 3660 ggaaatttca gaacacacca cattaaacgc ctagaggtat gcaattttcc ttgaaattag 3720 caactcgcgc acaaaaaaat acatttttct cagaaaatga ttgtttatcc acaatgcaaa 3780 tttaaaactg gatttacaga acatatatgc aatgtaagaa atagtttatt taaatggtat 3840 ctttatatag ccttgtccat agtcgattgt ttggagaaga accattcaac agttcatcaa 3900 acatttaata aaaaaatctg tttttccatg gtataattat cttgaatgtc aaaccaaact 3960 taggaacatg aagagcagcc atcaggtaac acgacgtcac agaaattgtg ctaattgaaa 4020 tcaaatcaca gctctattga cagtttatac atgtgggtac atgaatgatc aacttctccc 4080 cactgatcaa ctacaccccg aattacggta ccgaaaaaca tgggagcgtc tgtgatctat 4140 cgataccata acgacatggg tcatttcggc gcagataagg tctgtgctga aatacgtcgt 4200 ctcttttggt tccctgaaat gaaaaagaag gtccttgaac atgggaagtc gtgcattaca 4260 tgtataacgt tcaatcctcg cgataagaaa tatgacggat atttgaataa catcgataag 4320 ggcactattc ctttcgatac cgtccacgtt gaccacttag gtccgctaga aataacgaaa 4380 agtaaaaacg tgcatatttt tgccgtaatt gatgtattca gtaagttcat taaactttac 4440 gcgacgcggt ccacaaaaac ttatgaagtt ttgaaatcgc tgcgatccta catttcttat 4500 tattcaaagc cacgaagaat catttcagac cgcggaactg cttttacgtc taaggaattc 4560 gaatcgtttt gtgtaacacg cgggatagag cacgtgttga ttgcgactgg atgtccaaag 4620 gcaaatggac aaattgagcg ttacaacaga acattagtac ctttgctagc taaattggta 4680 gaagaaagga aaacgtcttg ggacaatgtt ctgttcgaag ctgaatacct tctcaataac 4740 agttataatc gttcgatagc gaattctccg gcagttctgc tctttggtgt tagacagaga 4800 aaccactttg agtcagatct agagcgattt ttagagaatc tcaacaaaga ggttgatcgc 4860 gatattgact gtatcagaag cgatgctgta gagaagatta aacagctaca ggaatacaat 4920 aagaaaaagt acgacgttaa atgtaaacga aataccgttt ttaaacaagg cgatttagtt 4980 gcgatcagaa ctgtcaaagt atcaggcgag aatagtaagc tcaaggctaa gtatagaggt 5040 ccgtatcaga ttaagagagt attagataac aatcgataca tcgttactga cctagaaggg 5100 tatcaggtga caagtacgta cttcgacgga acttttgatc cacttaatct gcgactgtac 5160 aagaaggctg ccactcgcga tactgagatt ttttcatcaa gctgttcatc cgaagattta 5220 agtttcaaag ggtattctga agatgagtct gatttctacg ggtttcctga tgaggaatag 5280 tccagtgatg tcattgccgt tgcgggtgta aatcaaaatt cgcaattcat ttggcaacca 5340 tacttcagca atgaccgaac cactaaagtt tcacgcatta tattgattat ttttttgttt 5400 aagtcatcag acagtgatta gattcctaac tcaagctatc gagggcgata gcagcaggac 5460 ggccgagc 5468 // ID Homo9 repbase; DNA; INV; 2542 BP. XX AC . XX DT 09-OCT-2009 (Rel. 14.1, Created) DT 09-OCT-2009 (Rel. 14.1, Last updated, Version 1) XX DE Homo9 is a putative autonomous DNA transposon. XX KW hAT; DNA transposon; Transposable Element; HOBO; transposase; KW Homo9. XX OS Drosophila mojavensis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; repleta group; OC mulleri subgroup. XX RN [1] RP 1-2542 RA Ortiz M.F. and Loreto E.L.S.; RT "Characterization of new hAT transposable elements in 12 RT Drosophila genomes."; RL Genetica 135(1), 67-75 (2009)DOI 10.1007/s10709-008-9259-5. XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 267..1583 FT /product="Homo9_1p" FT /translation="MYLFMYVHMYIYLYDKINCFRITTKTSFCWKHFAKID FT GNSAKCNICGKTIKTAGNTTNMKDHLRNVHKIREEDQTPVSNSGTPRQASI FT AASFQNITDFGGEGQKTKKINDAIVYMIAKDIQPFSIVENAGFIYLMNTVA FT PRYEIPSRYKVTKWLDEKYSQMKDLWRTRLTGKSISLTMDIWSDQMSMRSY FT LGITAHFLLEQEMSSLTIGAMQLSERHTGPYLSEMLEICCSDWNIDKKQVT FT AIVTDNGTNIKKAAEITFGPKIHLPCFAHTINLIARSAIQRESVNGCIAKT FT KSIVAFFNHSGVASDHLRKATDKVLIQDVPTRWNSTYYMIERFLEVKTQVN FT DILLSIPNDQMLLTGKEFELLNNVLTMLRPLEEATKIVSGDKYCTASTVIP FT IVSIIKDKLEKLSDNTQEAKDMKEFLLLEIERRMGSIEQVYFKYS" XX SQ Sequence 2542 BP; 864 A; 424 C; 489 G; 765 T; 0 other; tagtgatgtg aaaaaacatc gatgcatcga caatcgatgt ttttttggcg atgtttaccg 60 atgttttccg atgtttttat tgataccgat gttaagcgat gcaccgatgt ttttaagcga 120 tgcatcgatg ttttccaaaa cgtcgctggt ttgattttat acatattttg cgtatattgt 180 cgtgtgttgc ctgaaagtgc agttatacaa acttaagcag caaatattgc aatgccgaag 240 taagtaaata tagtaacatt acttatatgt atttatttat gtatgtacat atgtacatat 300 atttatacga caaaatcaat tgttttagaa taacaacaaa gacgagtttt tgctggaagc 360 atttcgccaa aattgatggc aatagtgcga aatgtaatat ttgcggcaaa acaatcaaaa 420 ctgcagggaa cactacgaat atgaaagatc atctacgcaa tgtgcataaa ataagggaag 480 aagaccagac acctgtgtct aacagtggga cgccacgaca agcttcaata gctgcaagct 540 tccaaaatat cacggatttt ggaggagaag gacaaaaaac aaaaaaaata aatgatgcaa 600 ttgtttacat gatcgcaaag gacattcagc cgttttcaat tgttgaaaat gctggattca 660 tttatttgat gaatactgtc gcacctcgtt atgaaattcc ttcgcgttat aaagtaacca 720 aatggctcga tgagaagtac agtcaaatga aagatctgtg gagaacaagg ctgactggaa 780 agagcatatc cttaacaatg gacatatgga gtgaccaaat gtcaatgcgc agctatttag 840 ggatcactgc tcattttctc ttggagcagg aaatgtcttc attgacaatt ggagctatgc 900 agttaagcga gcgtcacaca ggaccatatc tgagcgaaat gctggaaatc tgttgctcag 960 attggaacat tgacaaaaag caagtaacag ctattgtcac agacaatggt acaaatatta 1020 aaaaagctgc tgagataacg tttgggccca aaattcacct gccctgcttt gcgcacacga 1080 ttaatcttat tgctcgctcg gccatccaaa gggaatccgt aaatggatgt attgccaaaa 1140 ctaaatcaat cgttgctttt tttaaccata gtggtgtggc cagcgaccat ttaaggaaag 1200 caacagataa agtacttatt caggatgttc caacgagatg gaacagtaca tattatatga 1260 tagagcgctt tttggaggtc aaaacgcaag tgaacgacat tttactgagt attccaaatg 1320 accaaatgtt gcttaccgga aaggaatttg agctgttaaa caatgtgctt acaatgctac 1380 gcccccttga ggaagcaacg aaaattgtca gcggagacaa atattgtacg gccagtacgg 1440 tcatacccat tgtcagcatc atcaaagaca aattagaaaa attaagtgac aacacacaag 1500 aagcgaagga tatgaaggag ttcctgcttc tcgaaataga aaggcgaatg ggatcgattg 1560 agcaggtata tttcaaatac tcttaattgg tagtttcgtt aaatatatat ttgtaatgta 1620 ctgttttagg tatcgattct agcaatggcc acgttgcttg atactagatt taaaaaatta 1680 cattttaaag atccagttgc atgtgccaat gcactgatga aaattaaatc tatgtttaaa 1740 gaaaatgaaa gcaaagaggt cataccaccg acaacaggga actcggattc tttttggaac 1800 caccatcacc agctcgcgca gtgtcacaac ccgaatgtgg atatggacgt agaaataagc 1860 tcttatctac gaataccgct tacttccttc gaaagtaatc caataactgt ttgggaaagc 1920 atgaaatgta catatccgaa tttgtacaaa attgctatgc aatttttgcc ggtgatggga 1980 tcatccgttc cctctgaacg ggttttctca gcggcttcaa atattttgtg ccgtaaaaga 2040 aacagaatga cacctgagcg actaagtcgt attttgttac tgcaaagcat tgacaaaaaa 2100 tatttttttt aattgtttct ttttaatatg tagtttataa gtaacattga aagataaaaa 2160 aactgctatg aaagcatacg tttttttttt ttttttgttt gtttaataat ttttaattaa 2220 ctgctatgaa agcatacgta ttattttttt ttttttttgt ttgttgttta ataattttta 2280 agataagata agtctcatga atatgaaaag attgctataa gtttttcatt tttaagttaa 2340 aaataattta agtgaaataa taacgctatg aaagcaccta tgaagaagct ttaaaacccg 2400 aaaaaaacgt ttcttaatag aatgggaaaa tacatcgatg catcgaagtt ttgacccgat 2460 gtataaacca aacatcgatg cttgacaacg aaaacatcta tgggcatcga catcgatgtt 2520 tttactcaaa tttacatcac ta 2542 // ID hAT-8_SM repbase; DNA; INV; 3184 BP. XX AC . XX DT 11-OCT-2007 (Rel. 12.1, Created) DT 00-0000 (Rel. 12.1, Last updated, Version 1) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-8_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-3184 RA Jurka J. and Obukhanych T.; RT "haT-type repetitive family from freshwater planarian Schmidtea RT mediterranea."; RL Repbase Reports 7(10), 1036-1036 (2007). XX DR [1] (Consensus) XX CC Although this DNA transposon has the characteristics of a hAT CC element, such as short ~12 bp TIRs and 8 bp TSDs, the protein it CC encodes bears no similarity to other hAT-encoded transposases, CC and is most closely related to hAT-7_SM. A similar protein of CC unknown function is encoded in the genome of Cotesia congregata CC bracovirus, a polydnavirus from a parasitoid wasp. XX FH Key Location/Qualifiers FT CDS 508..2769 FT /product="hAT-8_SM_1p" FT /translation="MTTATRSSAAVWLVGKGTKLLSSSRLSTNGDVLRCLL FT FHHQEEHLNVSESIHRTITELLEIWKKARIPTQRVDSGERKLRKLYDVYQL FT LKKNRTTSLESCRIKEQSFKDDLQKLFDIAPRDVMEIMTNVEDKQFLAMQR FT IDVNSSSMAGVDRKLLEKEARKRAREQASDRRKLNEAEKSQQLEAHGDLAE FT ALSSTPSISEGSDSNTGDDTDTDFLPSSVDEAQSIPKTKRRKSAKNIISSP FT EVAGALDRVNLPDRGAMFVVASVAKALGHPLEDLVLSRSTIRRSRMATRKL FT LTEADKDSFSIECPLLLHWDGKLLPDIAGAEETVDRIAVIVTGNGIEKLLA FT VPKIGRGTGNEQAAACITVLDEWKLRDVVQGLVFDTTSSNTGIHKGACVLI FT EEALGRELVNIGCRHHVLEVVLSSVFTVLFGGTGGPEVGLFKRFKKKWPYI FT KQNDYSPAKDELFNSATEILRKEMVTFYTNAITHQQPREDYLELLRLCLVF FT LGGGSAGTDITFRAPGAMHHARWMAKAIYSLKMVLFQGQMTLTARELKGLT FT ELALFVALVYGRFWHEAPLASHAPLNDAQMLRLLQTYPNRTVADAAFSALC FT RHLWFFSEHLVGLAFFDDRVESHVKKAMVANLQLPKTRSALHRVDSSDADF FT NNLETFVTERTNRLFELLSATGVEESRSFLLKEPEAWEADASYQKLYERVK FT MMKVVNDSAERGIALIEKYNQSLTKDEDQKQFLLRFVQNHRQQFPSSSKAE FT LAT" XX SQ Sequence 3184 BP; 921 A; 648 C; 722 G; 893 T; 0 other; tagggcggtc cacagctata tgaaaaaaat aagtcttcct aaattctctc cacaagccta 60 actttgttcc actgggcatg tggagtagtc tggcaaaaaa tggaccattt gaagctgttt 120 taggggttgc tatgattcaa caattttggc gatataattg ctttttgaaa tattttgtcc 180 cttagccaat agccgtctca ttattatatt aaatttactt acttaatcag catctgtaat 240 tatataaaat ttgagcaaat aaatattgtt tgagtgttat agtttgtcaa cagaatttat 300 gtctgctatt ttggtgagtt aattttgaac tattatgtta ttttgttgaa gtaattacag 360 aaaaaagtga acatttgtta caagcttaca caagtaataa tcatagccaa tgcattgttt 420 ctagctacca tgtaaaattg tattgacttg tgtgattgca atgcttattt atattatata 480 attttcagat tcactacagt aaagtagatg actactgcta cacgaagttc agcagccgtg 540 tggctcgtgg ggaagggcac taagttgtta tcttcatcca gattgtcaac aaatggtgat 600 gtcctccgtt gtctcctctt tcatcaccag gaagagcatc tcaatgtgag tgaaagtatt 660 caccgtacca tcactgagct tttggagata tggaagaaag ctagaatccc aacacagcga 720 gtcgattccg gagaaagaaa actcagaaag ctctatgatg tatatcagct actgaagaag 780 aacaggacaa catcactgga aagctgcagg attaaagaac aatcattcaa ggatgatctt 840 cagaaacttt ttgacattgc acctagggat gtgatggaga taatgaccaa tgttgaggat 900 aagcaatttc ttgccatgca gaggatagat gttaacagca gcagtatggc gggagtggat 960 cgcaaactgt tagagaagga agcgagaaaa agagcgcgtg agcaagccag tgacaggcga 1020 aagctcaatg aggcggaaaa gtcacagcaa ctagaagcac atggtgactt agctgaggcc 1080 ctaagcagca ctccatccat atctgaaggc agtgacagta acacgggtga tgacactgac 1140 acggacttcc ttccatcttc tgtcgatgaa gcacaatcta tcccaaagac taagcgccgt 1200 aagtcagcaa agaacataat atcaagccca gaagttgctg gtgctctgga tagggtcaat 1260 cttcctgatc gtggtgctat gtttgtggtg gcatcagtag ccaaggcctt aggacatcct 1320 ttggaggatc tggtgctgtc tcggagcacg atcaggagat ctagaatggc cactcgcaaa 1380 ctgttgactg aggcagacaa agacagtttt tccatagaat gtcctctcct tctacattgg 1440 gatggtaaat tgcttcctga catcgctggt gcagaagaaa ccgtggaccg aatcgctgtg 1500 atcgtgaccg gtaacggaat tgagaagctg ctggcggttc ctaaaattgg aagaggaact 1560 gggaatgaac aagctgctgc ttgcataaca gttctagatg aatggaaact acgcgatgta 1620 gtgcagggtc tggtgtttga taccacatcg tccaacactg gcattcacaa gggagcttgt 1680 gtcctcatcg aggaagccct tggtcgtgag cttgtcaata tcggctgtcg tcatcacgtc 1740 ttggaagtcg tccttagcag cgtattcaca gttttgtttg gtggaactgg agggcctgaa 1800 gtaggcctgt tcaagaggtt taagaagaaa tggccatata ttaaacaaaa cgattactcg 1860 cctgctaaag atgaactctt caacagtgct actgaaatcc tccggaaaga aatggtgaca 1920 ttctacacta atgccattac tcatcagcag ccaagagagg attacctcga actccttcga 1980 ttgtgcttgg tcttccttgg aggaggctca gctggaacgg acatcacatt tcgggctcct 2040 ggtgcaatgc accacgcaag gtggatggcc aaggccatct attctctgaa gatggtgctg 2100 tttcaaggcc aaatgacttt gactgccaga gagcttaaag gactgactga acttgctctc 2160 ttcgtcgctc ttgtttatgg tcgcttctgg catgaggctc cactggcctc tcatgctcct 2220 ttgaatgatg cacagatgct tagacttctt caaacatatc caaaccgaac cgttgcagat 2280 gccgcatttt ctgccctttg tcgtcatctt tggttcttct cggaacacct tgtgggtctg 2340 gcatttttcg acgatcgagt agagtcacat gtgaagaagg ccatggtagc caacctccaa 2400 cttcctaaga ctcgttcagc actgcatagg gtggactcca gtgatgctga cttcaacaat 2460 ttggagacat tcgtgaccga gagaaccaat cgtctcttcg aactgttaag cgcaacaggg 2520 gtagaggagt ccaggagctt cttgttgaag gaaccagaag catgggaggc tgatgcatct 2580 taccagaagt tgtatgagag agtcaaaatg atgaaagtgg taaatgacag cgcagaacgt 2640 gggatcgctc tcatcgaaaa atacaaccag tctctcacga aggacgaaga tcaaaagcag 2700 ttcctgcttc gttttgtcca aaaccaccgc cagcagttcc caagttcttc taaagctgaa 2760 ttagcaacct agcagagtag caattggaca gggactaatg gacttcgttc gctgctcatt 2820 gaacaatttg ccgaaaaaca atctacatgt caaactaatt tttttctcaa agttatagtg 2880 gacactggtt ggactagata gtactgagca actaaacatc tattatttat cattctagac 2940 atgcttgtaa ctgtgactta aatgtttcta tgttatattg tgaaatgttt gaaacaaaga 3000 atagcattct gaatagtgga ttttcgtact ttttcacatt ttttttaaaa gtgacaattc 3060 atagcgaccc ctaaatattg ctcgatttgc ctcaaatttt tttgtcagtc cactacacca 3120 catggaacaa agttagcctt gtggagacat gattttgaat acttcttatt tttggaccgc 3180 ccta 3184 // ID Gypsy15-NVi_LTR repbase; DNA; INV; 1404 BP. XX AC . XX DT 02-MAR-2008 (Rel. 13.03, Created) DT 31-MAR-2008 (Rel. 13.03, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: long terminal DE repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy15-NVi_LTR. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-1404 RA Jurka J.; RT "LTR retrotransposon from Nasonia parasitic wasp."; RL Repbase Reports 8(3), 248-248 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Nasonia CC Genome Project. XX SQ Sequence 1404 BP; 370 A; 339 C; 349 G; 344 T; 2 other; tgtgacggcc gagtgcacgt caagtgtcta accgtcaatc atattataat tgtaataaga 60 yttatgtttc agcaagctgc caacgagcgc cgaaacgagc gagaacgaga aacgaggcaa 120 gatagagaca gagagagtga gagcataggt agcgcggttg ccgcctggcc gcacagcgtt 180 ggcgagatat gacaacgatg tgccgacggc cgggagagct accgcccgcg tgccgcgcgc 240 tggcgactag agttagtctc gctgcgctat ccacacgagt agctcgcgct rctatatacg 300 cgggtttgcc cttgtgagct cctgcgagca tctcgagagc gaacgatatt cgtccgtgcg 360 ttaccgacga gtcgctgcga cgcccgtccg aactgttgca catctccacg tccgagactg 420 gaccacgtcc gttgcgttca agtactactg ccactgaccg agtctgtaag tacctgcact 480 ctcactctct cacgtatcga cagggtaata tctctcgtct ctctttatcg ttcgtcgaag 540 tgtgcggcgt ttgcacctcc gtgtatagca ttacaattga atatacattt ccttatctaa 600 tacttctcgt gtgttatttc ctacgacccg acctacgccg cctccgcggg ctcgtgtaaa 660 aacacgtgca tgatctgagt cggccgagcc gtgccttctg cacgcccgac gttaataatt 720 acaataaaca gggcccttct ggccctggtt ataacacggc agaaaagata ttaaccgagt 780 gttataaatg gcgccccaaa acgtgtgtct aggttaagct aaataactcg cgagtgtacg 840 aaaaattgtg tagcagctac tcgtgctaca gtgtcgacac attgtaagcg tacagtgaga 900 ctcggaaaaa ccgaccgcaa cggtaaaaag caatagacag tgtcgaataa atcgcacagt 960 ttagcttcgt agcggcaggc atcagccttc gaaattcgaa acgaccaacg gccgagcggc 1020 cagcgacagc gagcgtacat tgtttacatc gcgcgccgcg aaccgagagc tcggatatac 1080 aaacagtgtc gtccgttagg cagcgaattt tcggctgagt gcgagtgaga cggaggaact 1140 acagtcggta taggatattc agattattgt atgaattttc gagaattctt tttgcacttt 1200 gatgcaattc ttgaacccga ctacagttta aaccgtaatc tcgcagtcct caataaattg 1260 ttgtgtttat taataacgtg taaggcataa atcgcgataa taatttggaa tatttcgtga 1320 gattttagcc tggttttgtg agaaatgtag ctgtgtagtt gagtacaggg acgtacaagt 1380 aaacatatat atagtcaaca caca 1404 // ID I_Ele35 repbase; DNA; INV; 6543 BP. XX AC . XX DT 27-OCT-2010 (Rel. 15.1, Created) DT 27-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE An I clade non-LTR retrotransposon family from Aedes aegypti. XX KW I; Non-LTR Retrotransposon; Transposable Element; I_Ele35. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-6543 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6543 RA Kojima K.K. and Jurka J.; RT "I clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (08-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. This consensus is generated from 5 CC sequences with >99% identity, and ~100% identical to the original CC sequence in [1]. XX FH Key Location/Qualifiers FT CDS 336..1775 FT /product="I_Ele35_1p" FT /translation="MMPVSATSPRFRDPGGPGGGGPSNMFRGDYAGQLVPF FT HMDNDGCSGAIQVLQMKATEGKLPCDPFLLRFSIEKWIGKAIDGAFKENQG FT ISYALKVRNTAHFQKLLKMDQLADGTKIQIMEHPTLNKRECVVSNYDCVDL FT SDEYLLEQLSDQGVRGIRRIKKKNPDGKWVNSPTIILTISGTVIPDHIDFG FT WSRCRTRNYYPAPMLCYKCWEYGHTKKRCSQPHETCGVCSQVHDAGSNMDP FT ASRINESTVETSRSPPCTNSAFCKLCKSNDHPVSSRKCPVYLKEVQIQHIR FT IDKGYTYPQARREFESMQENHNSNRDSFAGIAAHSKDKEISDLSDTVRRLI FT EDSRKKDNRIAELEKSLQGRSTTYRMDQVKQHGTIEDLIVKVDQLSRNVTE FT LQDIIKKKDLEIEYLRKIYVNPTESQLNSTLNVTIPETQDMTTDSTSQIDA FT YDSIPDNKLTSQCLDWINQSGKNPKKKKKSSKNGP" FT CDS 2353..6339 FT /product="I_Ele35_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="MDSVGVMFGHSTSSSPSNSEQTQRRYPTRTRREPLRF FT QSETSVFQPLGPEATNADLYDRLHARRYGNSGTNQPGTSQHYPSLRTNQYS FT PAPVLSKSPIAVPPPNEVLSRSNAKKTPYASRCCKQNSCFALQWNMNGFFN FT NLPDLQLLVRERKPAILALQEIHKASISWMETSLSGGYKWITMCCENFYHS FT VALGISAEIPFSQIDLDTELPIVAARISWPFPVSVVSFYLPNGTFSNLENQ FT IRQILAQIPEPVILLGDCNGHHRAWGSKANNTRGIMMVEVANELNLTFLND FT GSTTFISGQRETTVDVTLTSASISHVLQWAADIDPLGSDHVPITLFLNDSP FT PETSRRPRWLYDQADWSSFQNLIEVSLDHTTPDSIEEFVELIHDSARSTIP FT KTSPTPGRRALPWWSNETKKAVKLRRKALRAAKRLPAGHPNKDEATKKYRE FT ARNECRQTIRKAKECSWISFLDSINDNQSTADLWKRLNAILGKRRAKGIAL FT KIDSCTTRNPAIIADSLADYFADLCSLQRYTTVFLNRIPTPAEAIKNFVVP FT PDRNQQFNEPFSINELALVLQQGKGKSSGPDDIGYPMLKRLPVRCKSILLG FT LINREWTNGTFPNIWKHSLVVPIPKNSGIASETKHYRPIALTSCLCKIMER FT LVNRRLIQYLQENNRLDHRQHAFRPGFGTGTYFAALGQILDEAMARGDHIE FT IASLDLSKAYNRAWTPNVLQKLAQWGVTGNMLTFIKNFLSDRSFQVMIGNH FT RSKTVPEETGVPQGSVIAVTLFLVAMNGVFQILPKGIYILVYADDIILISI FT GKTPKNTRRRLQTAVNAVTKWTDLSGFEISAEKSARLHICSSSHIPPRNPI FT TIRGTPVPTKKTMKILGVLFDRSFSFRQHFTNLREDCKSRINFLKTISNKR FT TKSDRISRIRVADAVINSRIFYGIEITCRSFELLVQNMSPTYNEAIRAISG FT LLPSTPAIAACIEIGTLPLVHRTVIAVGSRAISYLERTRGDGSVVFTVREA FT NRILESVAGVKIPPVAGVHWNGSSRWQSRLPKIDFKIKGNFRAGCNPTILR FT SAFQERISTKYTQHTIRYTDGSKALGRVGIGIYGDGISESHQLPDKCSVFS FT AEAAAILQAVSHPYDGSLLVVTDSASALLALASPVSRHPWIQATQIAVAES FT NNITFTWVPGHCNITGNESADHLANFGRTCRLRTRKTPGEDLKKWLKSTVK FT IAWASEWSQSRDLFIRKVKGVTTPWEDVPNRKDQVVLSRLRTGHTRVSHSM FT GGEPEFRKKCLLCNIHNSVEHFICNCPAFDHLRQEYGIGSIRSALQNDRAS FT ETALICYLKEAGLYQDI" XX SQ Sequence 6543 BP; 1999 A; 1588 C; 1375 G; 1578 T; 3 other; cagttgccgg ctcacattgg tcgagtttgg tcacgttttg ctttcgctcc gtagcctctc 60 ggttatagaa aggtctattg ttcgattgtt tttttttgct gcttttattg atcagccgaa 120 acctttgttt cctgctttgt gtttcgatca cacccggtgc gttatcgtcg cgatatagtt 180 ttcttggttg tggcccggct ggttgaccag cagactgtcc agtcgttttc ggagaggtgt 240 attttaaact tgtgcgcgtg cgtttcagtg ttccgtttca tatttccact gtgcgaaagc 300 attaatccgt gattgtgaat ccccgcgccg ccggcatgat gccggtgtcc gcgacctccc 360 ctcgtttcag ggatccaggg ggccctgggg gtggaggtcc aagcaacatg ttccgtggtg 420 attatgctgg acaactcgtt ccgtttcaca tggacaacga tggctgctcc ggtgctattc 480 aagttcttca aatgaaggca accgaaggta aactaccatg tgaccccttt ctcctgcggt 540 tttctattga aaagtggatc ggtaaggcaa tcgatggtgc cttcaaagaa aaccagggta 600 tctcmtacgc actgaaggta cgaaatacag cccatttcca gaagctgttg aaaatggatc 660 agcttgcgga cggaactaaa atccaaatta tggagcatcc gacattaaac aagcgggaat 720 gtgtggtaag taactacgac tgtgttgatt taagcgatga atacctcctg gagcaactaa 780 gtgatcaagg cgtacgtggt atccgccgta tcaaaaagaa gaatcccgat ggcaaatggg 840 tcaacagtcc aacaataatc ttgacaatct ccgggacagt gatacccgat cacatcgact 900 ttggctggtc acgctgtaga acacgaaatt attatccggc tcccatgctt tgttacaaat 960 gctgggagta cggccacacc aagaaacgtt gttcccaacc ccatgaaact tgcggtgttt 1020 gctcccaagt acacgatgca ggcagcaaca tggacccagc ctcccggatc aatgaatcta 1080 cggttgaaac aagcagatct ccaccatgca ctaattcggc attctgcaaa ctatgtaagt 1140 cgaatgacca cccggtgtcg agccggaagt gtccggtata cttaaaagag gtacagattc 1200 aacacattcg aatcgataaa ggatacactt acccacaggc caggagggaa tttgaatcga 1260 tgcaggaaaa ccataattcc aaccgtgatt ctttcgccgg aatcgcagcc cacagtaaag 1320 acaaggaaat atccgacctc tccgatactg tacgacgatt aatcgaagat tcgagaaaaa 1380 aggacaacag gattgccgaa ctggaaaaat cgttgcaagg gcgcagcact acttatcgaa 1440 tggaccaagt caaacagcat ggaaccatcg aagatctgat cgtcaaagta gaccagctgt 1500 ccagaaatgt taccgaattg caagatataa taaaaaagaa agacctggaa atcgaatacc 1560 tgagaaaaat ctatgtaaat cccaccgaaa gtcagttaaa ctctacatta aacgtgacta 1620 ttcctgaaac acaagacatg acgaccgatt ccacatctca aattgatgca tacgattcca 1680 ttccagacaa caaattgacc tcgcagtgcc ttgactggat aaaccagtct ggaaaaaatc 1740 cgaaaaagaa aaaaaaatcg tcaaagaatg gaccttgaca atggcatgga aatttcatcc 1800 gacgatagca tagaaagtaa taagtcaatt acatccgtca aaacgaccac atctatgaat 1860 ggaccatcta agagaaacca cgtcgactgt ggccacggca gcgactccag caagccttcg 1920 aaacccaaam gaagcgaccg tggaggccag agaaccacta atcgactctg aaaaccttac 1980 ctctccgatc agtcatcccc gaaaaaaaaa aaaaaaaact tcatcacctg gtagtacgcc 2040 ctawgcacta ttttcaaccc aaagaagaga atcagcgaaa agtcttcaac tagggtgact 2100 gatcatgtac aaaacgaagc agttacgaac caggatccaa ccgacgacaa caacattcac 2160 acggcagagg ctctgggtag tcggggcccc gtcagtgcgg acgctacacc cctcccggta 2220 ctggcggaca acctccggca tccagcagcc agctgttttc gcgggacgca caagggttcc 2280 caaacccgtt ccccgccgtt cccaagcaca tccggaagag agaaagacaa ggaagttcac 2340 accaattccc caatggacag cgttggagtc atgtttggac actccacttc atcttcccca 2400 tcgaacagcg agcaaaccca gagaagatac cctacccgca cgcgtcgtga accactccga 2460 ttccaatcgg aaaccagtgt tttccaacca ctgggcccgg aggccaccaa tgcggattta 2520 tatgatagac tgcatgcccg aagatatggt aattcaggta caaatcaacc aggtacgtca 2580 caacattacc cctctcttcg aacaaatcag tattccccag cccctgtatt aagtaagagt 2640 cccatagcag tgcctcctcc caatgaagtt ctatccaggt ccaacgctaa gaaaacacca 2700 tacgcttctc gttgctgcaa gcaaaattct tgttttgctc tacaatggaa catgaatggg 2760 ttttttaata acctacccga tctccagctg ctcgttcggg agcgaaaacc agcaattcta 2820 gcgttacagg aaattcacaa agcctcaatc agttggatgg agacctctct ttccggagga 2880 tacaaatgga tcaccatgtg ctgtgaaaac ttctatcatt ccgttgcact cggcatctca 2940 gcagaaatcc cattctctca gatcgatctc gataccgaac ttcccattgt agcagctcga 3000 atttcttggc ccttccctgt ctcagttgta tcgttctatc ttccaaatgg cacattttca 3060 aatttggaaa atcaaatccg tcagattcta gcgcagatac cagaaccagt tatcctatta 3120 ggggactgta acggacacca tcgtgcatgg ggaagcaagg caaacaacac tcgtgggata 3180 atgatggtcg aagtagcaaa tgagctcaac cttacatttt tgaacgatgg gtccactacc 3240 ttcattagtg ggcaaagaga aaccactgtg gacgtcacac taacgtcagc ttcaatttca 3300 cacgtgctac aatgggctgc ggatattgat ccactaggaa gtgatcatgt accaattaca 3360 ttgttcttaa atgacagccc tcccgagact tcacgtcgac ctcgctggtt gtatgaccaa 3420 gcagattggt catctttcca aaatttaatt gaagttagcc tagatcatac aacaccagac 3480 tcgatcgaag aatttgttga acttatccat gactccgcaa gatcaacaat ccccaaaacg 3540 agccccaccc ctggacgtcg agctcttccg tggtggtcaa atgaaaccaa aaaggcggtt 3600 aaactaagaa gaaaagctct tcgtgcagcg aaacgattgc cagctggcca tcccaacaag 3660 gacgaagcta ccaaaaagta tagggaggct agaaatgaat gtcgccaaac catccggaag 3720 gcaaaagaat gcagctggat tagcttctta gatagtatca acgacaatca atctacggcc 3780 gatttatgga aaaggttgaa tgcaatccta ggtaaacgac gagcgaaagg gatcgccctc 3840 aagattgaca gctgtactac gcgaaatccg gctataatcg ccgattcctt agcggattat 3900 tttgctgatc tctgttctct tcagcgatat acaactgttt tcctcaaccg tataccaaca 3960 ccagctgaag ctatcaaaaa ctttgtggtt ccaccggaca ggaaccaaca atttaatgag 4020 ccattttcga tcaatgaact ggctcttgtt cttcaacaag gtaaaggaaa atcatcggga 4080 ccggatgaca taggctatcc gatgctaaag cgacttccgg tacgttgcaa atccattttg 4140 ctaggcctaa ttaacagaga atggacaaat ggcacctttc ccaacatctg gaaacacagc 4200 ttggttgttc caatcccaaa aaattctgga atagcaagtg agacgaaaca ctaccgccca 4260 atcgccctca ccagctgttt atgtaaaata atggagagac tggtgaacag aaggcttata 4320 cagtatctac aagaaaataa tagactagat catcggcaac atgctttccg gcctggattt 4380 ggtacaggta cgtactttgc ggctttggga cagattttgg acgaagcaat ggctcgcggg 4440 gatcacatcg aaatcgcctc tcttgaccta tcgaaggcat ataatcgagc atggacacca 4500 aacgtgcttc agaaactagc acaatggggc gttaccggca atatgctaac atttattaag 4560 aattttttaa gtgatagatc attccaagta atgataggaa atcaccgatc gaaaacagta 4620 cccgaagaga caggcgttcc ccagggctct gtcattgcgg taaccctctt cttagtggcc 4680 atgaatgggg tcttccaaat tcttccaaaa gggatatata tactagtata tgcagatgat 4740 atcattctaa tatcaatcgg taaaactcct aaaaatacta gacgtaggct tcagacagcg 4800 gttaatgctg tgacaaaatg gactgaccta tctggatttg aaatttcagc ggaaaagagc 4860 gcaagactgc acatatgctc aagtagtcac atacccccac gaaatccgat aaccatcaga 4920 ggtaccccag taccaaccaa aaaaacaatg aagatactag gagttctatt tgatcgtagt 4980 ttcagtttcc ggcaacattt tactaatctc cgtgaagatt gtaaaagtag aatcaacttt 5040 ttgaaaacca tctctaacaa acgcacgaaa agtgacagga tttcacgaat tagagtcgca 5100 gatgctgtga tcaatagtcg tattttctat gggatcgaaa taacctgccg ttcatttgag 5160 cttctagtcc aaaacatgag tcccacctat aacgaagcaa taagagctat atccggctta 5220 ctcccatcaa ctccggccat tgcagcatgt atcgaaattg gtacccttcc actagtgcac 5280 cgaactgtaa ttgcagtagg tagcagagca atcagttact tagaaagaac tcggggtgat 5340 ggatcggtgg ttttcaccgt tcgtgaagct aaccgaatcc tagaatcggt tgctggcgta 5400 aaaatcccac cagtggctgg ggtccactgg aacggatcta gtcgctggca gtctagactt 5460 ccaaaaattg attttaaaat caaaggaaat tttcgtgctg gttgcaatcc caccatttta 5520 cggtccgctt tccaagagag aatttcaacg aaatacacac aacataccat ccgatacacc 5580 gatggttcta aagcccttgg tagagtcgga attggtatct acggtgacgg catctcggaa 5640 tctcatcaat taccggacaa atgttcagta ttttcagctg aagcagcagc gatactacag 5700 gcggtatcgc acccatatga tggatcgctc ttagttgtca cagattcagc tagtgcactt 5760 ttagctctgg cctcaccagt ttcgcgtcac ccttggattc aggccacgca aatcgctgtt 5820 gccgaatcaa acaatattac atttacctgg gtacctggac actgcaatat tactgggaac 5880 gaatcagcgg atcatctagc aaactttggc aggacatgca gattaaggac ccgtaagacc 5940 cctggagaag accttaaaaa atggctcaaa tcaactgtca aaatagcttg ggcctcggaa 6000 tggtcccagt ccagagatct ctttattaga aaagtgaagg gtgtaacaac accctgggaa 6060 gatgttccaa atcgcaagga tcaagtggtt ctttctcggc tccgaacagg acacaccaga 6120 gtatctcaca gcatgggagg cgaaccagaa ttccgaaaga aatgcttatt atgtaacatt 6180 cataactcgg tagaacactt tatttgcaat tgtccagctt tcgatcatct tcgtcaggaa 6240 tacggcatag gaagcattcg ttccgccctt caaaatgacc gtgctagcga aacagcactc 6300 atttgttacc ttaaagaggc aggcctttat caagatatat aactacaacc aaacaaagct 6360 aaacactgaa agatcgaata atattaatat taacattgta ttaaacaaac tctgtcaaac 6420 tctgtaactt ctataaactc tgtaaattcc tggagtgccc ttatggtacc ccaccttttt 6480 cctcgagacg aaccagccaa gggctgaaag tctcgttaat aaagacaata ataataataa 6540 taa 6543 // ID BR1_CP repbase; DNA; INV; 286 BP. XX AC X06431; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 25-MAR-1997 (Rel. 2.02, Last updated, Version 2) XX DE Chironomus pallidivittatus BR1 gamma ( Balbiani ring) gene pCp20 DE repeat constant region. XX KW Satellite; Simple Repeat; BR1_CP; CPBR1; Repetitive sequence. XX OS Chironomus pallidivittatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Chironomoidea; OC Chironomidae; Chironominae; Chironomus. XX RN [1] RP 1-286 RA Wieslander L.; RT "BR1_CP."; RL Direct Submission to Genbank (21-DEC-1987)Wieslander L., Dep. of RL Molecular Genetics, Karolinska Institutet, 10401 Stockholm, RL Sweden.. XX RN [2] RP 1-286 RA Lendahl U., Saiga H., Hoeoeg C., Edstroem E.J. and Wieslander L.; RT "Rabid and Concerted Evolution of Repeat Units in a Balbiani Ring RT Gene."; RL Genetics 117, 43-49 (1987). XX DR GenBank; X06431; Positions 1 286. XX SQ Sequence 286 BP; 123 A; 68 C; 61 G; 34 T; 0 other; accaagcaaa tcaggaccta gaccaagcca atcaggacct agaccaagca aatcaggacc 60 taaaccaagc acatcaggac caagaccaag caaatcagga ccaagaccaa gcaaatcagg 120 accaagacca agcaaatcag gacctagacc agagaaatgt ggtagtgcaa tgagaaaggc 180 tgaagctgaa aaatgtgcca gaagaaaggg tagattcaat gcaaataaat gcagatgtac 240 ctcagctggt aaaccaagca aatcaggacc tagaccaagc aaatca 286 // ID MOGWAI1_NA_EI repbase; DNA; INV; 80 BP. XX AC MOGWAI1NA_EI; XX DT 16-MAY-2005 (Rel. 10.05, Created) DT 16-MAY-2005 (Rel. 10.05, Last updated, Version 1) XX DE mMogwai-Ei1 (MOGWAI1NA_EI), a nonautonomous MITE related to DE Mogwai DNA transposons from the single-celled eukaryotic DE reptilian parasite Entamoeba invadens. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Nonautonomous; KW Interspersed repeat; Tc1-Mariner; MITE; mMogwai-Ei1; KW MOGWAI1_NA_EI. XX OS Entamoeba invadens OC Eukaryota; Amoebozoa; Archamoebae; Entamoebidae; Entamoeba. XX RN [1] RP 1-80 RA Pritham E.J., Feschotte C. and Wessler S.R.; RG Department of Biology, University of Texas at Arlington, RG Arlington, TX 76019, USA; RT "Unexpected diversity and differential success of DNA transposons RT in four species of Entamoeba protozoans. Mol. Biol. Evol. (2005) RT In press."; RL Direct Submission to Genbank (16-MAY-2005). XX DR [1] (Consensus) XX CC mMogwai-Ei1 is a consensus sequence reconstructed from multiple CC multiple members of a small family of MITEs (~15 copies) CC identified in the genome of E. invadens. mMogwai elements are CC homogeneous in size (80-bp) and sequence and essentially consists CC of two abutted 38-bp TIRs flanked by a TA putative TSD. The first CC 29-bp of the TIRs are identical to those of Mogwai-Ei1, CC suggesting that mMogwai are nonautonomous members of the Mogwai CC clade of transposons. XX SQ Sequence 80 BP; 33 A; 6 C; 7 G; 34 T; 0 other; tattcctttt tgatttaaaa aatgaataaa atcatacaaa tttgagtgtt tttattcatt 60 ttttaaatca aaaaggaata 80 // ID LOA repbase; DNA; INV; 7779 BP. XX AC X60177; XX DT 04-JUN-2009 (Rel. 14.06, Created) DT 04-JUN-2009 (Rel. 14.06, Last updated, Version 1) XX DE LOA non-LTR retrotransposon. XX KW Loa; Non-LTR Retrotransposon; Transposable Element. XX OS Drosophila silvestris OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Hawaiian Drosophila; OC planitibia group; planitibia subgroup. XX RN [1] RP 1-7779 RA Felger I. and Hunt J.A.; RT "A non-LTR retrotransposon from the Hawaiian Drosophila: the LOA RT element."; RL Genetica 85(2), 119-130 (1992). XX DR EMBL/GenBank/DDBJ; X60177; Positions 1 7779. XX CC This DNA sequence may contain a portion of a Gypsy-like LTR CC retrotransposon whose protease domain is fused with the LOA ORF2 CC protein. XX FH Key Location/Qualifiers FT CDS 749..1816 FT /product="LOA_1p" FT /translation="MVVAVSDEIILAIRNLNISNNQQEARIESGVEQNIRT FT AVDNRSTDSIERPDKVSSIISSWHVQFCGSFNEISVEDFIYRINRLTDECL FT NGNWNLLYQFAIILFCGPALQFYWRFQRTNSRSNWFQLSDALRERYREQRP FT DEKIKDSLRSRKQRSGERFVQFLDAIQCIADTLREPMTDRELVANIKRNVK FT VEMRLELLHVASPNIATLRTECHKHEQFCLSMSSKPVPRPTNTSHFLNEVI FT HEEESIDTSHIYSTESTEINAIRSVDRIKCWNCNEVGHRYQDCIKTRRRPG FT VFCYGCGRFDTTHRKTCNRMSTEHNQLISTYSLGSTICTPPIPNNNTSLSK FT NLRCKQLHGNLN*" FT CDS 1786..7716 FT /product="LOA_2p" FT /translation="MQTVAWQLELILFKKKIIASVERKYSICKSIRFPKVR FT IDEFWRNKRSPKNITFISTIRNRNDSASIFTNLMLFGQSYLALLDSGANKS FT VIGGQLAIRLLAADLKLKKLKGNFRTADGQHENVLSALLIPLEYDLLQKEF FT EFMILPSITQDIICGMDFWKSFGINISTTTVISELDYSDQTCNNRCLVLQR FT GPGLKNVWDGSERAELRHQAGRGLRIGNGLPAGRLAANNTRFVVNKQSRTN FT SGRIMSIVPTRRLYCRLPKCSLPQPPLSPHIFTYPILSPHLTPGWTKASFE FT PCRESAWLAARIKTSQVVNPSILHTVLLQQLLGDAAGTRNFGCEAWGRWLV FT HPDHRAQHRTPRGNPLSAQRGRESSCHLGVNSVDWMTRTGGISAVQGSNGT FT SNQELTPDEALKKAKEPREEIAAHAPRPKRRTSNLTPPVATPKRICPEVVG FT ATRGSATAKVLNKNGAGNLDLRPSTSRAAATKPSSPTPTEPRLSYADMAKN FT VRVAVLPVDFPRVMLSHGDLSVLEEAIIDEVIASGGDIAASFTGIHFRVGF FT LLIECSGEASAAWLRTATSRLKSWKGVPLKCKVGDDIPSPHCITLFCPRSV FT GRSTESLLVLLRNQNRIETDTWKVISRRNEGGGALLVIAIDELSKCILWRR FT GTMSSPLRTIPVSGLKKKTGAKPQPAPTGKEDVSVGNITTSDTPASEQSSP FT TAQPADPSEDLAEDEELADVTLCQEGILGEEDIPMSSQELAEELQDAAVAD FT VVMTNSGDGCSSRPHSPCQLEPSQKYMTKGINTGLAQVNIHRAKAASAVLA FT RMFTNKHLGLALVQEPWVNNGIKGLFTADSKVIWDRRDPAPRACIMVRKSI FT NFNILSEFLTRDCVPILVYTKGSAVSMVIVSAYFAGDAPCPPPEVERLVEY FT CRKEKMPVLIGCDASAHHTIWGSSDIHLRGECLTDFIFKYNLELENVGSAP FT SFVTRIREEVLDITLISRSLKPHLREWHVSQEESMSDHRTILFNLKLNTDS FT STPGRNPRRTNWEGYKSTLGLNLANGLSGTPRNPIELDRATDDLNKCIISA FT FEENCPVGKRFMEKDAPWWNDSLERLRVTTRRLFNKAKRDGIWEQYRECLT FT SYNKEIRKAKRKNYRDFCESIVSTSEGARLHRALAKRTPDANLALKRGDNS FT FTISNKQRLELLFETHFPGCTPLQEEAIVGISRYRPSTDDWACAKSTVTKE FT KLNWAIGTFQPYKSPGMDGISPAFLQTGQDILLSRIRKALVSSLALGHIPS FT ACRRARVVFIPKAGKKDITDPKSFRPISLTSFLLKTLEKMVDYKIRSTLLK FT QRPLHPAQHAYRVGRSTDTALYQLQRTLSAAIDYKEVALCAFLDIEGAFDN FT TSHDAIKDTLSRRGLDPTTSRWILALLRSRQVTASVHDSTVTVLTTKGCPQ FT GGVLSPLLWSLLVDELLNRLTNSGIQCQGYADDIVIMARGKFEESLCDMVQ FT SGLRITYDWCKEVGLNLNPTKTVIVPFTRRHKLQRMRQIWLSGTPLERSRE FT VKYLGVIFDSKLNFGTHVQNAMLKCSRALYTCRSIAGKSWGTSPKIVRWLY FT LMVVRPMLTYGVIAWGDRARLITVKKQLQKLQRMACVCMTGVMCTCPTMAL FT EALMELTPLHHIIRLKQKATLLRMSAEGVGCPTLSNELPLLLQPRDEMKVE FT YIFERNFTVYMSSKRNWTTLEEVHPMKPHTIRWYTADHSPTRAQVSVWWAL FT GCHTTNPYGTHTSIFQAEVCALGKCAILTLKRNYRNTDTSILSDSQAALNA FT ITGTKITSKIVQEARSKLNLLGTHNRLACDGSRATGIYRVTRRLISRREWG FT QKGPLIGPEPYCGIGRHTIRLVLRNEEKQVRQQSWSEAVGLRQARCLLGGY FT NLKRFKQVITMGKNNLRILTGLMTGHCRLRSHLTRLGIYSSDLCRFCEIEE FT ESSVHILAECVALARRRCSILGMHVLNFRDIEDLNPTKILTFVREVGLMEE FT L*" XX SQ Sequence 7779 BP; 2297 A; 1808 C; 1900 G; 1774 T; 0 other; ttaaccaccg ttagcaaaaa acattaatga ttttgatttc tttattggct tcatgtcgat 60 ctaaactagt tgatcgattt ccaaatcaat accacaatta ttgttaaaag agaatgagat 120 tgttcttggt agattatttc gccactttcc tcttttccga aaacaattct tcaaattaaa 180 gtagaatcag taattggtta agggaaatca cgattgatat atttataatt tcaagatgaa 240 accattttat taacggttaa gtctcaacgt gattcataaa ttcataaatg gcagtgcagc 300 gaagtaacga agatatagca gctagcataa atgctgatat ggtattgtgt aacatttgta 360 gagccccagt tctgattaat actgaaattg cggagacccc ttgcaaacat aaattccatc 420 ggtaacctga ttacgtacta atgaaacatg tccgtcgtgt agaacgccat gtacactttc 480 ccagttaatt gatgtcaaaa gccagtcatc gaaagggcct tttcaagttc ctcgaacagt 540 tcgtggtggt catcgagttg gtgctacaca taggaatata cctaatatca aatccaataa 600 ctctggagca caggcttaca ggcctaatac taatttcaat gtcgggaata gacgggattc 660 attcaatgct tcgcttgacc catccatccc ttctgaacag cgaattcagc aacttatttc 720 aaactcgtta gatacatttc gagccagtat ggttgtagca gtgtctgacg agatcatttt 780 agctataaga aaccttaata tctcaaataa ccagcaggaa gctcgaattg aatcaggagt 840 cgagcaaaat attcgaacag ctgtagataa tcgaagcact gattccattg aacgacccga 900 caaagtctca agtattattt cgagttggca tgtacaattc tgtggatctt ttaatgaaat 960 ttccgtggaa gattttattt accgcatcaa tcgcttgacc gacgaatgct tgaacggaaa 1020 ttggaatttg ctttatcaat ttgcaattat tttgttctgt ggacctgcat tacagttcta 1080 ctggcggttt caaagaacaa atagtcggtc taattggttt caactatccg atgccttgcg 1140 tgagcgatat agagaacaac gcccggatga gaaaataaaa gattctttaa gatctcgtaa 1200 acagcgaagt ggagaacgtt ttgttcaatt tttggatgct atccagtgta ttgctgatac 1260 actgcgagaa ccaatgacag atcgggaatt agtggctaat attaaaagaa atgttaaggt 1320 tgaaatgagg ttagaactac ttcatgtagc ttcacccaat atagcaaccc ttcgaactga 1380 atgtcacaag cacgaacaat tctgtttaag tatgtcttca aaacctgtgc ctcggccaac 1440 taacaccagt cattttctta atgaagtcat acatgaggaa gaatcgatag atacttccca 1500 tatatattct acagagtcaa ccgagataaa tgcaatacgc tcagttgata ggataaaatg 1560 ttggaattgt aatgaggtag gtcatagata tcaagattgt ataaaaacca gacgcagacc 1620 aggcgtattt tgctatggtt gtggccgttt tgataccacg catcggaaaa cttgcaacag 1680 gatgtccacc gagcataatc agttgatatc cacctacagt ctcggatcaa caatttgtac 1740 cccgccaatt cctaacaata acacgagcct atcgaagaat ctaagatgca aacagttgca 1800 tggcaacttg aattaatatt attcaaaaaa aaaattattg ccagcgtaga gcggaaatat 1860 tctatttgta aatcaattcg cttcccaaaa gttagaatag atgaattttg gagaaacaag 1920 cgatcgccaa agaacattac atttatctct accatccgca accgtaatga ttccgcgtcc 1980 atatttacaa atcttatgtt gtttggtcaa agctatttag cattattgga tagcggtgct 2040 aataaaagcg tgatcggagg acaattggcc atacgattgc ttgccgcaga cctaaagtta 2100 aaaaaattga aaggcaattt tcgcactgca gatggccaac acgaaaacgt attaagtgct 2160 cttcttattc cgttggagta tgacttatta cagaaagaat ttgaatttat gattctacct 2220 tcaatcacac aagacatcat ctgtggaatg gatttttgga agtcatttgg tattaatatt 2280 tctaccacga ctgtaataag cgaattagac tacagtgatc aaacgtgtaa taataggtgt 2340 ctagtgttac agagaggacc tggtctaaag aacgtgtggg atggttcgga aagggctgag 2400 ctgaggcacc aggcgggtcg cgggttgcgg atagggaacg gccttcctgc agggaggtta 2460 gctgcgaata atacgaggtt cgtcgtaaat aagcagtccc ggacaaacag cgggcgcata 2520 atgtcgatcg taccgacaag gcgcttgtat tgccgccttc caaagtgtag cttaccacaa 2580 ccgcccctat ctccccacat atttacctac cctatcctat cccctcatct cactccgggc 2640 tggactaagg cgtcattcga accgtgccga gagtcagctt ggctggcggc acggattaaa 2700 actagccaag tcgttaatcc atccatcctc cacaccgtgc tgctccagca actattagga 2760 gatgccgcag gcacaagaaa ctttggctgt gaggcatggg ggcgatggct ggtacaccca 2820 gaccaccgcg cccagcaccg aacgccgcga ggcaacccgc tcagcgccca aagaggaagg 2880 gagagttcct gccacctagg ggtcaactcg gtggactgga tgacccgaac cggaggaatc 2940 tctgcggttc aagggtcaaa tggtacctcc aatcaggaac tcacccctga tgaagccctg 3000 aaaaaggcca aagagcccag ggaggagatt gcagcacatg cgccacgccc gaagaggcgt 3060 acgagcaatc tcacaccacc ggtggccacc cccaagagga tttgcccaga ggttgtgggc 3120 gctacccgcg gctcggcaac ggcaaaggtg ctgaacaaaa atggggcagg gaatctagac 3180 ctgcgcccaa gcacgtcgcg tgcagcggcc acgaagccca gctcgccgac acccaccgag 3240 cctcgcctgt catatgcaga catggcaaaa aacgttaggg tagccgtgtt gcccgtggat 3300 ttcccacggg tcatgctcag tcacggagat ttgtctgtgt tggaagaggc catcatagac 3360 gaagttattg cgtctggtgg agatatcgcg gcctcattca cgggcattca cttccgggtt 3420 ggattcctgc taattgaatg ctccggtgag gcctcagccg cctggctgag gaccgcaaca 3480 tcgaggctga aatcgtggaa gggcgtgccc cttaagtgca aggtgggaga tgacataccg 3540 tcgccccact gcatcacgct gttttgcccc aggagtgtgg gtcggtccac cgaatccctg 3600 ttggttctgc tgaggaacca gaacaggatc gagaccgaca cctggaaggt gatctccagg 3660 aggaacgagg gtggaggagc cctcctggtg atcgcgatcg acgagctatc caaatgtata 3720 ttgtggagaa ggggcaccat gtcttctccg ctacggacca tccccgtaag tggactaaag 3780 aagaagactg gagcaaagcc ccaacctgct ccaacgggca aggaggatgt ctcagtgggt 3840 aacatcacca catcagacac tcctgcctcg gagcagagca gtccaaccgc gcagcccgcc 3900 gacccgtcgg aggacctcgc agaagacgag gaactggcag acgtgacgct ctgccaggag 3960 gggattctgg gggaggaaga catccccatg tcctcgcaag agctggcgga ggagctgcag 4020 gatgcagctg tagctgacgt cgttatgacg aacagcggtg acgggtgcag ctccaggccc 4080 cattcaccgt gtcagctgga gccttcccag aagtacatga cgaagggcat caacacaggg 4140 ctggcccagg taaacatcca ccgggctaag gcagcctcgg cggtcttagc aaggatgttc 4200 accaacaaac accttgggct ggccctggta caggagccgt gggtgaacaa tggcattaag 4260 ggcctgttca cagccgactc aaaggtaatc tgggatcgga gagatccagc acccagagcc 4320 tgtatcatgg taaggaaaag tattaacttt aatatccttt cagaattctt gactagagac 4380 tgcgttccca tattggtgta taccaaaggc agcgcggtct ctatggtcat cgtgtctgca 4440 tatttcgcag gggatgcgcc ctgcccacca ccagaggtcg aaaggctggt ggagtattgc 4500 aggaaagaga agatgccggt gctcatcgga tgcgatgcca gtgcgcacca tacgatatgg 4560 ggcagcagtg acatacattt aaggggtgag tgcctaactg attttatttt taaatataat 4620 ctagaactag aaaatgtcgg atctgctccg tcatttgtta ctaggatcag ggaagaggtg 4680 ctggacatca ccctaatcag tcggtcccta aagcctcacc ttagggaatg gcatgtttcc 4740 caagaggaat ccatgtctga ccataggact atcctattta atttaaaatt aaatacggat 4800 agtagcacac caggtcgcaa ccccaggaga accaactggg aaggctataa gtcgacctta 4860 gggctcaacc tagccaacgg actatctggt acacccagga atccgataga gctggacaga 4920 gccacggatg acctcaataa gtgcataatt agtgcatttg aggagaattg cccggtaggg 4980 aaaagattca tggaaaaaga tgccccatgg tggaacgaca gtctggaaag gctgcgcgtc 5040 accacacgtc gcctcttcaa taaagccaag agagacggaa tatgggaaca ataccgcgaa 5100 tgccttacct cctataataa ggagataaga aaggctaagc ggaagaatta cagagacttc 5160 tgtgaaagca tagttagcac aagcgaaggc gccaggctcc acagagcact ggcgaaacga 5220 acgcctgatg ctaaccttgc actgaagcgt ggggataatt ccttcacgat tagcaataag 5280 caaagattag aattactctt cgagacgcac ttcccaggct gcacacccct gcaggaagag 5340 gctatcgtag gaataagcag atatagaccg tccactgacg actgggcatg cgccaagtcg 5400 acagtcacaa aggagaaact aaattgggca attggcacat tccagcccta taaatctccc 5460 ggaatggacg gcatatcgcc agccttcctt caaacgggcc aggatatact cctctcccgt 5520 attaggaaag ctctagtaag tagcctagcc ctcggacaca taccgagcgc atgcaggaga 5580 gcaagggtag tcttcatccc gaaagcaggg aagaaagata ttaccgaccc gaagtccttc 5640 agacccatca gcttgacatc gtttttattg aaaacgttgg aaaagatggt ggactacaag 5700 attagaagca ctttgctcaa gcaaaggccg ctgcacccag cgcagcatgc atatagagta 5760 ggcaggtcta cggacacagc actttatcag ctgcaacgca ccttgagtgc ggcaattgat 5820 tataaggaag tagctttgtg cgccttccta gacatagagg gtgcctttga caatacatca 5880 cacgatgcga tcaaggacac cctctcgaga aggggcctgg atcctaccac cagcagatgg 5940 attctcgcac tgctgcgatc caggcaggtc acagcatcag tgcatgatag caccgtaacg 6000 gtcctaacca ccaagggctg tccccaaggg ggggttctgt ctccgctact ctggagtctg 6060 ttggtagacg aactactaaa cagactcact aacagtggta tacaatgtca aggttatgcc 6120 gatgacattg ttatcatggc gcgaggaaaa tttgaagaat cactctgtga catggtccag 6180 tctgggctaa ggataacgta tgactggtgt aaggaggtcg gactcaacct taaccctacg 6240 aaaacagtca tcgtcccctt taccagacga cataaactac agaggatgag gcaaatatgg 6300 ctctcaggta ccccactaga aagaagtagg gaggttaaat acctgggcgt catatttgac 6360 agtaaactta actttggcac ccatgtgcag aatgccatgt taaagtgctc cagagcgctt 6420 tacacatgtc gcagcatagc cggcaaatca tggggcacat caccaaagat agtaagatgg 6480 ctatacctaa tggtagtaag acccatgcta acctatgggg taatagcatg gggtgacaga 6540 gcacggttga tcaccgtgaa aaagcaactg caaaaattgc aaagaatggc ctgtgtctgt 6600 atgacaggag taatgtgcac ctgcccaaca atggcccttg aagccttaat ggagctcacg 6660 ccactccacc acatcataag gctcaagcag aaagcgacgc ttttaaggat gtcagcagaa 6720 ggagttggat gcccaactct ctcaaatgaa ctgcccctgc tattgcaacc cagggacgaa 6780 atgaaagtcg aatacatctt cgaacgtaat ttcacagttt acatgagcag taaaaggaac 6840 tggacaactc tggaagaggt ccaccctatg aagccgcaca ccataaggtg gtacacagcg 6900 gatcactcac caaccagggc acaggtctcg gtgtggtggg ccctcgggtg tcataccacg 6960 aatccctacg gaacgcacac aagcatattc caggctgagg tatgtgcgtt aggaaaatgt 7020 gcgattttaa ccttaaaacg taactatcgg aatacagaca catccatact atccgatagt 7080 caagcagcat taaatgcaat aacggggact aaaataacat caaagatagt ccaggaggct 7140 cgttcaaagc taaacctact tgggactcac aacaggcttg cctgcgatgg gtcccgggcc 7200 acagggatat accgggtaac gaggcggctg ataagcaggc gagaatgggg gcagaaaggc 7260 cccctgatag gaccagaacc gtattgtggc ataggcagac acactatacg gctggtactt 7320 agaaatgaag agaaacaggt gcggcagcag agctggtcgg aggcagtagg tctcaggcaa 7380 gccagatgtc tcctcggtgg ttataatctc aagcgattta agcaagttat aaccatggga 7440 aagaacaacc ttaggatcct caccggtcta atgacggggc actgtcgact aagaagtcac 7500 ctaactagac taggtatata tagtagcgat ctctgcaggt tctgtgaaat agaggaggaa 7560 tcctcggtac acatcctagc agaatgtgtc gcactagcta gaaggagatg cagcatcctg 7620 gggatgcatg tcttgaattt tagagacata gaagacctca acccaacaaa gatcctcaca 7680 ttcgttcggg aagtggggct gatggaagag ctataggctc agaagggggc acaatagatc 7740 taaaaggtcg cggtgcaact tccccaataa taataataa 7779 // ID Gypsy2_MH-LTR repbase; DNA; INV; 232 BP. XX AC ABLG01001375; XX DT 24-JUN-2009 (Rel. 14.07, Created) DT 24-JUN-2009 (Rel. 14.07, Last updated, Version -1) XX DE LTR retrotransposon from northern root-knot nematode: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy3_MH; KW Gypsy2_MH-I; Gypsy2_MH-LTR. XX OS Meloidogyne hapla OC Eukaryota; Metazoa; Nematoda; Chromadorea; Tylenchida; Tylenchina; OC Tylenchoidea; Meloidogynidae; Meloidogyninae; Meloidogyne. XX RN [1] RP 1-232 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from northern root-knot nematode."; RL Repbase Reports 9(7), 1521-1521 (2009). XX DR Genome; ABLG01001375; Positions 762 993. XX SQ Sequence 232 BP; 64 A; 52 C; 25 G; 91 T; 0 other; tgttgggaac gaaagaaaat gacatttgat cccatttgac ccttttgtct tttacccatt 60 ttcactcgct ccctatataa acatttcact ctcccttaaa tgtatccatt ctcccattca 120 ttctaatttc agcaccctaa gctgacctat tgaattgtat tttatttgga gataataaac 180 ttttattatt ttattcttta ttcactgttt gtacctaaac gggtacacaa ca 232 // ID hAT-61_HM repbase; DNA; INV; 6222 BP. XX AC . XX DT 29-DEC-2008 (Rel. 13.12, Created) DT 29-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type DNA transposon family - consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-61_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-6222 RA Bao W. and Jurka J.; RT "hAT-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 2049-2049 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 1309..4554 FT /product="hAT-61_HM_1p" FT /translation="HNINYVASLILIFFFIIRTSPNGNCLFNACSISLIGN FT ESLSEYLRCLTSIELYTNVGFYLNHPIIFSQTDVTKSRRIDENTAFSIIIS FT QKAYNSFCKNNKTLSVISESINIATNFSYSSFLTLCALSSVVGIPIESYYP FT IEKDRKEIEKTVYEILFNCTVRPRVEHSLSTSVKKIHILRCSSVIAGFKQT FT SNIFINKDHFVPLISLKCIPKKYQAPNIFAPKVVKLPEINQLTSSKSSILF FT PDSTQETDILDSVKILSTFKKRKQLTIDQFAIKSPFPTDKKICVVSSPIKL FT KSSTTLSIAECSSKYIHSPIKANLNSKSSSSAFGNKCELSFVNKYDIGLYE FT TYGINKLGNEDKYDLLKNVYKPSSSFHFESNKSGRSFQFSWLSLYSWLAYS FT EIKKGAYCVNCVIFGSEESSHNASKLVQLYKSPFVAYSKAINKFNKHAESS FT PIHQTATLKATHFRQCMEQKINFIDLNLNKVIDEQVKKNREILKPIVLAII FT MCGKQNIPLRGHRDDSFYYENENSNSGNLQSILKLISDCAENSLVNGRIST FT PKNATYRSKTTQNELINICNDIIISKLKCEVKKAKFFSILADEASDVSNME FT QMPLVLRFVNDNCEICELFFGFIPCDSGLSGEAIGSQILNSIKDLGLEMKF FT CVGQGYDGAGNMAGNCSGAAVRIKAIYPKALYVHCGSHVLNLCVANACNIQ FT IVSNMMSNVRVISQFFNFSPKRFDVLKKKIEEMFPKAHHSRLIDVCRTRWI FT ARIDGLDVFIEVLPAIIKCFELMINNDEKSWNHDSVRDANSYFFATRSFQF FT LVTLVVVSRCLEVTRPITKQFQSPNFDVFASHEKINLLHLALQRLRSDIEH FT SHGAWYSEAVVLAASVGVYEHKPRTIGKQLNRNNCPSESINEYYKKVITIP FT FLDHLSSQIQLRFSNKNIAVYNGFFAMPTNVLNVAEWRSNFLLFSNEYEDE FT LPEPRYIMTELNMWEDYWRSFKNSLPTTLTKLLPLIDRITFPNIFTLIQIL FT ATIPVTTCTCERSISVIRRLKTYLRNTMSQNRFNSLALMHVHQDISIDVNE FT VIDCFARKNPRRMKLVDILNSDPI*" XX SQ Sequence 6222 BP; 2120 A; 881 C; 954 G; 2267 T; 0 other; caggggcgga tccagagggg ttcaaagggt gctccagcac ccgtcaactt ttaacaagta 60 aagtttccta tttcgttgca ttttttttat aaataaaaat tgaattgtta aaatagttta 120 ttcaactcaa aatttgcgat ttcatttttt gtgaaggaag tcaaaaataa ataaataatt 180 atctttgttg ttgtgtaata ctttgcgttc agcgcttgcg tattttgtgt atacttatgc 240 ttcgactagg aaactccagt atagtcgctt tttgattggc ctgctcccct cagaattatt 300 tttgattggc ctgcacccct cagaagctat ttttgattgg cttggtttct taagctaatc 360 aaaacaggtt ctgaggggtg ctgacccaat cagaaagatc tctgaagggt gcaagccaat 420 cacgcaaaaa tcagaaaaaa ggtttttcac taagcttttt gttaactcgt aggttaagtg 480 ttctaaacat ggcggattca atgcttcgaa tcattttaaa atgtatttta gatgaaaata 540 gggttcaatc agaaaaggaa tcgtttttat ctaatgttat tgtcgaagct cataataaga 600 ttagagaaat atttttgttt aatgaaaaat ctaaagtaag tatattgaat aaaatatgca 660 aagtttcaag tatgcagttt cccgtatctc aatcatttcg agcagaaagt gattttcaac 720 cagtgaggta ataagttgtt tcttaaataa ttatattaaa ttaaaaaaac aaaaaacaaa 780 cgaagtgttt atgtatgtgt gtgtgtgttt aataatagca aattaagaaa tcatatatgc 840 tttaaatgat ttctaattgc tattatttta accattaatt aacacaacac acacacatat 900 tttgtatagc ctacacacac acacattttg tttagcctat actataggat atacaaaaag 960 ctatactata gctttttgtg cctaatattt tactacaaac ttttttattt aagctgaaag 1020 atgtgtaaaa ttttgttcta ggtgcttaaa tttaaaaaac ttttttttaa gtaaaattgt 1080 gttcatcctt aagtatatat acttaagcat gaacacaatt ttactaaaaa aaaaagtata 1140 caaaaaagta tataatatca tatttcatat aatatcatat ttcaaggaga aaataatcaa 1200 ttatgttata atttttttta tgtttgaata ataccaaatt tacatttagt aaatcattta 1260 ctagtaagta attttatgat ataaaattat gtaataatta ttttgtgaca taacataaat 1320 tatgttgctt cgcttatatt aatttttttt tttattatta gaacttctcc aaacggaaac 1380 tgtttattta atgcttgttc tatatcattg attggcaacg aatctttgtc agaatacctt 1440 agatgcctga ccagtatcga attatataca aatgtaggtt tttatttaaa ccatccaatt 1500 attttttctc aaactgatgt taccaaatct cgaagaattg atgaaaacac agctttttct 1560 attattattt cccaaaaagc atataattct ttttgtaaaa ataataaaac actcagtgtc 1620 atatcagaat ctatcaatat tgcaacaaat ttttcttatt cctcgtttct tactttgtgt 1680 gctctctcaa gtgttgttgg aataccaatt gaatcatatt atcctataga aaaggataga 1740 aaagaaatag aaaaaactgt ttatgaaata ttgttcaatt gcactgttag accacgtgtt 1800 gaacattctt taagcacatc tgtaaaaaaa atacacattc ttcgttgctc ttcagttatt 1860 gcaggtttta aacagacctc taatatattt ataaacaaag atcactttgt tcctttaata 1920 tctttaaaat gtattcctaa aaaataccaa gctcctaata tatttgcacc taaagtagta 1980 aaattacctg aaattaatca acttacttct tcgaaatcta gtatcctgtt tcctgattct 2040 actcaggaaa cagatatttt agattcagtt aaaattcttt caacttttaa aaaaagaaaa 2100 caactcacaa ttgatcaatt tgctattaag tcaccttttc ctacagacaa aaaaatttgt 2160 gttgtatctt cgccaatcaa attaaaaagt tctaccacat tatcgattgc agaatgttct 2220 tcaaaatata tacactcacc gattaaggca aatttaaact ctaaaagtag ttcttctgct 2280 tttggaaata aatgtgaact atcttttgta aacaaatacg atatcgggct ctatgaaact 2340 tatggaatta ataaactagg caatgaagat aagtatgatt tactaaaaaa tgtttataaa 2400 ccaagctcta gttttcattt tgaatccaat aaatcaggtc gttcctttca attctcttgg 2460 ctctcattat actcttggct tgcatattca gaaattaaaa aaggtgctta ttgtgttaat 2520 tgtgttattt ttggatctga agaatcaagt cataatgcaa gtaaacttgt tcagctttat 2580 aaatctccat ttgtggcata ttcaaaggcc attaataagt ttaataaaca tgctgaaagt 2640 tcaccaattc accaaactgc aactctaaag gcaactcatt ttcgccaatg tatggagcaa 2700 aagataaatt ttattgactt aaatctcaac aaagtaattg atgaacaagt caagaaaaat 2760 agagaaatat taaaaccaat agttctggca attattatgt gtggaaaaca aaacatacct 2820 ctccgaggtc acagagatga ttcattttac tatgagaatg aaaatagtaa ctctggcaat 2880 cttcaatcaa ttttgaaatt aatatctgat tgtgcagaaa acagtttggt gaatgggcga 2940 atttctactc caaaaaatgc tacatatcgc tctaaaacta ctcaaaatga gttgattaat 3000 atttgtaatg acataataat ttctaaattg aaatgtgaag ttaaaaaggc taaatttttt 3060 tcaatattag cagatgaagc atctgatgtg agcaatatgg aacaaatgcc acttgtatta 3120 cgctttgtaa atgataattg tgaaatttgt gaactatttt ttggttttat tccttgcgac 3180 tcaggcttgt ctggagaagc tataggcagc caaattttga atagtattaa agatttggga 3240 ttagaaatga aattttgtgt tggtcaaggt tatgacggtg ctggtaacat ggctgggaac 3300 tgttccggtg ctgcagtaag aataaaagct atttatccaa aagcattata tgttcattgt 3360 ggatctcatg tcctcaatct ttgtgttgca aatgcctgta acatacagat agttagcaat 3420 atgatgtcaa atgttcgtgt tatatctcag ttctttaatt ttagtccaaa gcgttttgat 3480 gttctgaaaa aaaaaattga agaaatgttt ccaaaagctc atcactctcg tttaatagat 3540 gtatgtcgta caagatggat tgccagaatt gatggtttag atgtgtttat agaggtactt 3600 cctgctataa ttaaatgctt tgagctaatg attaataacg atgaaaagtc ttggaatcat 3660 gactcagtgc gtgatgcaaa tagttatttc tttgctacta gatcttttca gtttcttgtt 3720 actttagtag tggtgtctcg ctgtttggaa gtaacaagac caattacaaa acaatttcaa 3780 tctccaaact ttgatgtttt tgcatcgcat gaaaaaataa atttgttaca tcttgcttta 3840 cagcgactaa gatcagacat tgaacacagt catggtgctt ggtatagtga ggcagttgta 3900 ttggcagcta gtgttggtgt ttatgagcac aagcctcgta caataggaaa acagcttaat 3960 cgaaataatt gtcctagtga atctattaat gagtattaca agaaagtcat tacaattcct 4020 tttcttgacc atttatcatc ccagattcaa ttacgttttt ctaataaaaa cattgctgtt 4080 tataatggat tctttgcaat gccaactaac gttttaaatg ttgctgaatg gagaagcaac 4140 tttcttcttt tttcaaatga gtatgaggat gaacttccgg aaccacggta tatcatgacc 4200 gaattgaaca tgtgggaaga ttattggcgt agctttaaaa attcccttcc tactactctg 4260 accaaactcc ttcctcttat tgatagaata acttttccta atatatttac cttaattcaa 4320 attcttgcaa ctattccagt aacaacatgc acttgcgaaa gatcaatctc tgttattcgt 4380 cgtttaaaaa cctatctacg taacacaatg tcacagaata gatttaatag tcttgcttta 4440 atgcacgttc accaagatat cagcatcgat gtaaatgaag tgattgattg ttttgctagg 4500 aaaaatcctc gtagaatgaa acttgtagat attttaaatt ctgatccaat ttaaactgtt 4560 ttatttaata atataatata aatatattat ttactgtttt atatatttta ttgagtaatt 4620 taaatagtgg tcctagaaag ttctgaaagt tctagaaagt tttttttgta ttttttaagc 4680 atgaaattat acttcaaaaa atatatacag tagaatcttg ttaagtctga ttctcaaggg 4740 aaatagataa ataatcagac ttggcgaatg tcagagttag gggaagctct cgttatgtat 4800 agaaagctca ataggaattg aaaagctgac ttagtaaaag tttgagttat cagacataat 4860 tataaagttt tcattattaa taaattagca ttattgatag ttataattat agtttaatta 4920 taattaaagt taaattataa ctggatttaa aaataagtct gttcttacat ttttatttat 4980 atttttgggg aatatccttg gaaaataaat ggctccagaa tatcatgaaa ctcagaatat 5040 ttattttatt actgttacat aattttgttt ataaacagtg ttatataatg ttattaaatc 5100 tattcactat tattgatgta tttagtgtaa gagtttatta ttattgttat tatttttgtt 5160 ataatttata aataattatc gtgtataatt atatatttta tatatcatag tattaattat 5220 ttttaatagt gagttttaat acaattgtgt tataattgca atataatgtg catgacacag 5280 gtgtttttag aagttaaact agatggttac ttgttttatt ttcaagtttc tgtttaaaga 5340 tcatattaga agcaaatttt gagttccatt gaaactagat ttttattaac agtttttttt 5400 atcagttctt cattttgtaa ctattttact attttatttt agttttccta ctgttgaaat 5460 atggttttaa tttagtaaag tttcatgatt tgaaatttac tttattggtc ataagctaac 5520 agtttagaaa aggaaattta gcacagttag ggtaagaaat ttgtttactt ttaacatttc 5580 aaaactgggt tttactgttc aattcttatg ctattcctag tttatgttaa atgtcatacg 5640 aataataaac aaattgtgtg ttcattgttt ggggtcatga aattattgta tcattattga 5700 ttgattcttt atataacggt cgatcactgc caatatatag ttgctgcatt actttgcata 5760 caaaatattt ttatgtacac aaaattactt tcgcactgta ctctaataaa taggagggtc 5820 ggaaattttt ctaaattata ataaacgggg gagagggggt ttcttaataa accgaagggt 5880 cggaatttta aaaagcttaa aatagttttt ttggttggtt attcatattt tcatctatta 5940 ttgagagaaa tatttttctc aaagtaaaaa aatagcaaaa atattttata ttgtttcaaa 6000 ttaataatca aggggggggg ggggggattg ggatcctaat aagctccggg aggtcggaaa 6060 aaaattcccc tccccgtact ttataaaaac cttttttaca accaacccct ttcatgcaac 6120 gcaaagcgcg gggggggggg ggtaatgaaa acaccccctc atcgtcgcag cacaacacat 6180 atttgccagc acccctcaaa aaaaaagcta gatccgcccc tg 6222 // ID CR1_Ele35 repbase; DNA; INV; 5162 BP. XX AC . XX DT 25-OCT-2010 (Rel. 15.1, Created) DT 25-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE A CR1 clade non-LTR retrotransposon family from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1_Ele35. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-5162 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5162 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (25-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. This consensus is generated from 30 CC sequences with >96% identity, and ~100% identical to the original CC sequence in [1]. XX FH Key Location/Qualifiers FT CDS 409..1257 FT /product="CR1_Ele35_1p" FT /translation="MSLVCCSCAGDIGDIQVECQGFCKAIFHPRCCAVAAD FT VFEEVMRNNQLFWFCPSCTALMKDMRLRNTARAAYEVGQGHALNSHSDIMK FT NLKTEIMDELKAEIRTNFAKLINSSTCTPKSSKRVGVDPRFTRSRRLFSTV FT AKSIPNQQPPLLLGTGSTPSPSLDIVTVPPNQPKFWLYLSRIARDVSVNQI FT CALAKKRLGTDDVQVIRLVGKGRDMSTLSFISFKIGMNMEMKSKALSTSTW FT PKGVVYREFTDNRTDENFWRPVPAAASDDPLSLSTEDVVLME" FT CDS 1176..5066 FT /product="CR1_Ele35_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="RKFLASCTCCRIRRSAEPFDRRCCFNGVRMEEVNSYS FT SPGRKLATSIKEASVPLTAVEPLPPATSSRPGPACEKEDEVFQNFSTGKYV FT ILPRTSSAVPFVDFSISPSIQILPTSSSWTPGRIFAACIKETPTLLTAVEP FT FPPTTSSRPGPACEHEDGVFRTPTTGKYADLSSISSAVSFVVSSITSSIHS FT RSTSLKSTPGRTFSASLKEAPIPFTAVEPFQPATSSRPGPACDLEDGVFRA FT PMKGKYTDLPSISSAVSFVDSRNLVPKDALTDSRLYDLWTTALGDPQLAIA FT GPSRNSACFSAIPKLQLTAKPSIPFNVSGGSTQHKRPERCPENPSDMPFDL FT RIYYQNVRGLRTKIDDVFLSTAELDYDVYVFTETWLDECIESRQLFSPDYS FT VFRVDRCATNSSRRRGGGVLIAVKEKFGTTVVSTRANIEQLWVKMSTQVTD FT IFIGALYIPPDKSQDGTFMQLHLDVISEVCSSRSDASPLVVFGDFNQPRLA FT WLLSNGYAIVDALNSHISSASQILLDNMAFQGLRQRNSIRNSSGRVLDLVF FT SDDSSKKSVLATPATEHVVPLDRYHPALDFSVQIPSSMTFYEDIDLSERDF FT RRCNFDALNSLLSQVDWSEVHQCSDVDSAVTCFNAIIHNYIAETVPLNRPP FT RKPAWSNRHLGVLRRRRDKLLQQYNRQKCAHFKRQLDEASRIYRGYNRFLY FT KNYVSRKQNDLRRNPKSFWNFVNSKRKEHGLPTTMHRGDVFTSSQDEKCAL FT FAQQFSSVFVGRIPTVQDTEAATCIVPRDVVDLDIFHVNESMVYAALQKLK FT LSNCPGPDGIPSCIFRKCGASLVAPLVSIFNKSLQSQLFPTAWKTSYMRPV FT FKKGDKTDFANYRGITSLSAGAKCLETIVNKVMFSSFCCYISEAQHGFFPR FT RSVESNLVDFTSTCIRSMDNGAQIDAVYTDIKAAFDTVNHEILLAKLLRLG FT VSTRMCNWLQSYLRNRSLCVKVGSTVSHPFHPASGVPQGSNLGPLLFSLFF FT NDVTIVLANGGLLIYADDLKIFLVVRTEADCRDLQELLNQFARWCTMNFLA FT VSVSKCCVISYRRCKSPILYDYNINGQELERVDKVKDLGVLLDYRLTFKLH FT YATIIDKANRQLGFILKITKEFNDPMCLRSLYCSLVRSILEFASIVWSPYE FT AVWITRIEAVQRKFVRHALRDLPWRDPLNLPPYDHRCRLLGLEPLHVRRKN FT SRAVYGSKIVRTEVDCPALLQHVQFYAPERVLRTRQLIQIASRNTGYGAND FT PLNSICNEFRQAYGVFDFNLTTNTFRQRLSRTRFIN" XX SQ Sequence 5162 BP; 1342 A; 1178 C; 1134 G; 1508 T; 0 other; attcaatcac cgtgctttgt ttacatgttc gatgtttcca aagctgctaa agaaatcgta 60 tattttgcgt gtttaagtac ccttttccgt cgaatcttat tttgttgcgt agtggatcgt 120 tgagtgcatc attagcaagt gaagataccg gaaagagatc ggaataattt gagaatatta 180 tccactaaac tctcgtcgcg tttctgcggt ttgcaaaaat ctgtgttgtt ttgcttcgtg 240 tgtaccctta ctgcactttg ctctggctac cgtacctata cacccgtaca aaacaggtta 300 actctactga gtgtgttgtc ttgtttatta tcggacatca tagtttatcg atacgtgttg 360 gtcgtgagtg aaaatttcaa gagtaacatt tgaaaaaatc ctatcaatat gtcactggta 420 tgttgttcat gtgctggtga tatcggtgat atccaagtcg aatgccaagg cttctgtaaa 480 gccatttttc atcctcgatg ctgtgcggtt gctgccgatg tgtttgaaga agtgatgaga 540 aacaatcagc tgttttggtt ttgtccttca tgtacggcgc ttatgaagga tatgcgcctt 600 cgaaatactg cacgtgcagc ttacgaagtg ggtcaaggtc atgccctcaa ctcccatagc 660 gacattatga aaaatctcaa aacagaaata atggatgaac tgaaggcgga aattcgaacc 720 aacttcgcca aactgataaa ctcaagtact tgtactccga aatcttccaa acgcgttggt 780 gttgacccca ggttcaccag gagtcggagg ctgtttagta ctgttgctaa atcaatcccg 840 aaccaacaac cacctctgtt attgggaact ggtagtactc cctctccgtc actcgacatc 900 gttacagtgc ctccaaatca accaaagttt tggttatatt tgtcacggat tgctagagat 960 gtatcagtca atcaaatatg tgcattagct aaaaaacggc tcggtactga tgatgtgcaa 1020 gttatccgac tggtgggcaa aggaagggac atgagtacat tgtccttcat ctctttcaaa 1080 attggaatga acatggagat gaaatctaag gcgctatcca cttcaacgtg gccgaagggt 1140 gtggtctaca gagaatttac tgacaacaga actgacgaaa atttttggcg tcctgtacct 1200 gctgccgcat ccgacgatcc gctgagcctt tcgaccgaag atgttgtttt aatggagtaa 1260 gaatggaaga agtcaactct tactcctctc cgggacgcaa acttgccaca agtattaagg 1320 aagcctctgt tcctctcacc gcagtcgagc ccctcccgcc agcgaccagc agtcgtcccg 1380 gtcctgcgtg tgagaaggaa gatgaggtct tccaaaattt ttcaacaggc aagtacgtta 1440 tccttccaag aacttcgtct gctgtaccat tcgttgattt cagcatatct ccatcaatcc 1500 aaatattacc gacatcttca agttggacac cgggacgcat atttgccgct tgcatcaagg 1560 aaacccctac tcttcttacc gcagtcgagc ccttcccgcc aacgaccagc agccgtcccg 1620 gtcctgcgtg tgagcatgag gatggggtct tccgaactcc tacaacaggc aagtacgctg 1680 acctttcgag catttcgtcc gctgtatcgt ttgttgtttc cagcataacg tcatcaatcc 1740 attcacgttc aacatctctg aaatcgacac cggggcgcac attttccgct agtcttaagg 1800 aagcccctat tcctttcacc gcagtcgagc ccttccagcc agcgaccagc agccgtcccg 1860 gtcctgcgtg tgatttggaa gatggggtct tccgagctcc tatgaaaggc aagtacactg 1920 atcttccgag catttcgtcc gctgtatcgt tcgtcgattc cagaaacctc gttccgaagg 1980 atgcactcac cgactcacga ttgtatgacc tatggacgac tgcattgggc gatccccaat 2040 tggcgatagc gggcccctca cggaattctg cttgtttttc tgcgattcca aaacttcaat 2100 taactgcgaa gccgtcgatc ccattcaatg tttctggagg atctacacag cataaacgtc 2160 cggaacgatg ccctgaaaac ccaagtgata tgccattcga tctgcgaatt tactatcaga 2220 acgtgcgtgg cttacgaacg aaaattgatg atgtgttcct gtctacggcg gaattagact 2280 acgatgtcta cgttttcact gagacgtggc ttgatgagtg cattgaatct cgtcagctgt 2340 tttctccgga ttattctgta tttcgggttg atcgttgcgc taccaacagt tctcgccgtc 2400 gtggtggtgg cgttcttatc gccgttaaag aaaagtttgg aacgacagtt gtttctactc 2460 gagctaatat tgaacaactg tgggttaaaa tgtcgactca agtgactgat atttttattg 2520 gtgctttgta cattcctcct gacaaaagtc aagacggaac ttttatgcaa cttcatctcg 2580 atgttatttc cgaggtctgc agttcacgca gcgatgcaag tccgcttgta gtttttggtg 2640 atttcaacca accccgtttg gcatggctat tgagcaacgg ttacgctatc gttgatgcac 2700 taaactccca catttcatcg gcaagtcaaa ttcttctcga caacatggca ttccaaggct 2760 tgcgacaacg taattcgatc cgtaattcca gtggccgcgt gttggacctt gtgtttagcg 2820 atgactccag taaaaaaagt gtacttgcaa ctccagcgac agaacatgtt gtcccacttg 2880 atcgatatca tcctgctctt gatttctccg tgcaaatacc atcatcaatg acgttctacg 2940 aggacatcga cctttcggaa cgagacttcc gtcgttgtaa ttttgatgca ctaaacagtc 3000 ttctttccca agttgattgg tccgaagtac accagtgctc tgatgtagac agtgcagtaa 3060 catgctttaa tgccattatt cataattaca ttgctgaaac agttccgctg aacagaccgc 3120 caagaaaacc ggcttggtct aaccgacatc ttggtgtcct ccgacgtcga cgtgataaat 3180 tgcttcaaca atacaatcgt caaaaatgtg cacatttcaa acgacagttg gacgaggcta 3240 gtcgtatata ccgtggatac aatcgctttc tgtacaaaaa ttatgtctcc aggaagcaga 3300 atgatcttcg tcggaatccg aagagttttt ggaactttgt caactcaaaa cggaaagagc 3360 acggccttcc aacaacaatg caccgaggag atgttttcac aagcagtcaa gatgaaaagt 3420 gtgcgctttt tgcacagcag ttttcaagcg ttttcgttgg ccgcattcct actgttcagg 3480 atactgaagc tgccacttgt atagtgccac gtgacgttgt tgaccttgat atctttcatg 3540 tcaatgaatc aatggtttat gctgcattac agaagcttaa attatcgaat tgccctggac 3600 ctgatggtat tccgtcctgc atctttagaa aatgcggtgc ttctttggtg gctccactgg 3660 taagcatttt taacaaatcg ctgcaatcgc aactgtttcc aactgcttgg aaaacatcct 3720 acatgcggcc ggtattcaag aagggcgata agaccgattt tgccaactac cgtggaataa 3780 catccctttc agcaggtgcg aaatgcttgg aaaccattgt taataaagta atgttcagct 3840 cgttctgttg ctatattagt gaggctcaac acggcttctt ccctaggcgg tcggtagaaa 3900 gtaacttggt tgattttact tcgacctgca ttaggtccat ggataatgga gcgcaaatag 3960 acgctgttta cacggatatc aaggctgcct ttgacactgt gaaccacgag attctgttgg 4020 cgaaattact ccgactcggt gtttcaacac gaatgtgtaa ttggctgcaa tcgtacctta 4080 gaaacaggag tctctgtgtg aaggttggat ctactgtatc gcatcctttt caccctgcat 4140 ctggggtacc gcagggcagc aacctgggac ccttactgtt ttcgctgttc ttcaatgatg 4200 tgactatcgt tcttgcaaat ggcggtttgt taatctacgc agacgatcta aaaattttcc 4260 tcgtcgtgag aactgaagcg gactgcagag acctccagga gctattgaac cagttcgcac 4320 gctggtgtac catgaatttc ctggctgtta gcgtttccaa atgctgtgtg atttcatacc 4380 gtcgctgtaa atcaccaatt ttgtacgatt acaatataaa tggtcaggaa ctcgaacgtg 4440 tggacaaggt caaagatctg ggcgttttgt tggattacag attgacattt aaactgcatt 4500 atgccactat cattgacaag gcgaaccgtc aactgggctt catcttaaaa attacaaaag 4560 aattcaacga cccgatgtgt ctgcgttcct tgtactgttc tctggttaga tccattttag 4620 aatttgcttc tatagtatgg tcaccttatg aggcagtgtg gataaccaga atagaagcag 4680 ttcagcgtaa gtttgtgaga catgctttga gggatctacc gtggcgtgat ccgttgaact 4740 tgcctcctta cgaccatcgt tgcagattgc tgggactgga accgctgcac gtccggcgca 4800 aaaatagccg agctgtttat ggatcgaaaa ttgttcgtac ggaggttgat tgtcctgctt 4860 tgcttcaaca tgtgcaattt tatgctccag agcgcgtact acggacaagg caactgatcc 4920 agatagcttc caggaatact ggttatggag caaatgatcc gttaaattcc atttgcaatg 4980 aattccgaca agcttatggt gtatttgatt ttaacttgac gacaaatact tttagacaac 5040 gcttaagcag aacacgtttt ataaattgac tatgtttaca ttttttattg tgttagtatt 5100 agtatttcat taagaccatg atgtccgatg gaagtaatca aataaataaa taaataaata 5160 aa 5162 // ID Gypsy-229_AA-LTR repbase; DNA; INV; 228 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-229_AA_; KW Gypsy-229_AA-I; Gypsy-229_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-228 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1062-1062 (2011). XX DR [2] (Consensus) XX SQ Sequence 228 BP; 86 A; 38 C; 46 G; 58 T; 0 other; tgagtaaaaa tgtatataat atggctaaat aagcccttga gaataaatgt gtaaaaacac 60 gaatgaattc cgttgaacag ttaacgaatg cataaaaggt tgtctgatga tgaccacgca 120 gacgaaggtc tgaactgagc cgaaataaag aacgataagc agtcgtattt gatacctcaa 180 agcaacaagt ttacttttac caatcgtatc acagtcttgc ttagggca 228 // ID BEL-173_AA-LTR repbase; DNA; INV; 654 BP. XX AC AAGE02025208; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-173_AA_; KW BEL-173_AA-I; BEL-173_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-654 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02025208; Positions 12666 12013. XX SQ Sequence 654 BP; 217 A; 124 C; 106 G; 207 T; 0 other; tgtcacgatg aaacccctcg tcgcgtcaaa gctgcaaaca acaagcagcg cttcgaaagc 60 cgagctgtca acctgagccg tcgttgacgt taccatacgt gcgttgtcag actagaggga 120 ttcgcagaaa ttgaagcata ttaccaacaa cttacctaat ctacatacta tacgtaaaag 180 tacctattat ctaaaattga tcccattcga attgttagtg atacatgatg tacaagtagt 240 ttaaagacag tggatgaaat tatgaaatta tgtatgtaat ctacgaatga tctcctatac 300 aaattgttta tttatatctt cactttttgc cctttgtagt tatgctctaa gaatattcaa 360 tcagtgaatt caattattat acctttcaaa ttgaatatat agaagactcg aggtaattac 420 atataactag aactaaggta gctttgaatt aacctacaat tattcaatta ggtcttcgta 480 gcagtcagtc tgtgacacga tccgcaaaag ttttcctggg attttcacca aaattgtaag 540 acgacctctg taatttgttg aaaatgaatc actaaatata attctatttt tagcttaaag 600 cttaacccac aacataaggt ctgcttcatt aagatttggg aaatcttcgc caca 654 // ID TAQI_CC repbase; DNA; INV; 170 BP. XX AC M15723; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 25-MAR-1997 (Rel. 2.02, Last updated, Version 2) XX DE C.captiva TaqI repetitive sequence. XX KW Repetitive sequence; TAQI_CC. XX OS Caledia captiva OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Orthopteroidea; Orthoptera; Caelifera; Acridomorpha; OC Acridoidea; Acrididae; Gomphocerinae; Caledia. XX RN [1] RP 1-170 RA Arnold L.M., Appels R. and Shaw D.D.; RT "The heterochromatin of grasshoppers from the Caledia captiva RT species complex. I. Sequence evolution and conservation in a RT highly repeated DNA family."; RL Mol. Biol. Evol 3, 29-43 (1986). XX DR GenBank; M15723; Positions 502 671. XX SQ Sequence 170 BP; 46 A; 28 C; 34 G; 62 T; 0 other; gatgtgcgca ttctatgtac tattgcctgt ggtttgcttt attacttatt ttaacgctca 60 actcgccttt tttcattaca ttggcattaa atggagtgta tattatgaaa attgagagat 120 aaaaatactt agaagggctg ccactggaac agtggttcac tgcatagttc 170 // ID Copia-31_CQ-I repbase; DNA; INV; 4311 BP. XX AC AAWU01021680; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-31_CQ_; KW Copia-31_CQ-LTR; Copia-31_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4311 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 371-371 (2011). XX DR GenBank; AAWU01021680; Positions 40189 44499. XX CC Positions [1433-1969] - Integrase core CC LTRs are 94% similar to each other. XX FH Key Location/Qualifiers FT CDS 122..3883 FT /product="Copia-31_CQ-I_1p" FT /translation="MAHATGPSPVVERLLGRENWCTWKFGMQTYLETEELW FT EAVEPNLKDDGTPEAIPKAKDRKARGKIILGLDPSLYVHVQGTTTAKGAWD FT KLEQTYEDNGLTRRWGLLHKLMTMNLTNSVSMEAYVTRMVATANQLGGVGF FT PISEEWVGMLLLAGLPQSYKPMVMALQNSGSKITGDLIKTKLLQEDHEPME FT TIEPAFAVKGKPSYKQENIGKPQSKGPKCHSCHKFGHIARDCPAKSSSKPP FT ERKGGSKPGRAFGAVLTTMDEMKSEDWFFDSGATCHLTRTKSLLRDAHQAD FT GKMFAANKGEMQVVAEGTAVLHPTCGEEIDVNGVQLIPELVVNLLSVGKIV FT DSGHTVVFGQKGCEVLDLEGTCIATGSRSNGLFKLEQDVPKVLVCQPTTVE FT LWHKRTGHLSIKNLTRLRNGMVTGVQFLEADSAAGCKICPMGRQTRLPFNK FT SGTRATEVLELVHSDICGPMEQVSIGGSRYYITFIDDRTRKMFIYFLRHKS FT EEEVTRCFKDFHSMSERQTGHKLKVLRTDNGTEYKNKGFEKLLSQLGIRHQ FT TTVEYTPEQNGLAERANRTIVEKARCLLFEANLPKTFWAEAAAVAVYLINR FT SPCTGIQTTPEEAWTGRKPDLSNLRVFGTTTMAHVPKQKRRKWDPKAVECI FT LIGYDEETKGYRLYNVKSKQIFMSRDVTFLDEGLADCDGRTTLKATNGPQS FT TYVRLDFEELVQEPEPVREPEPAQEPEPAQEPEQVEEEEASDTSSDEFEDA FT DDSVTLPALPPRQSSHPPQSQALRRSGRERAIPGKYKDFNVQCSGKRSSRV FT SDAPMMDGSSTSAARLENAASTAPPSSSNQRPVDDVFQHGLMLNDRGCPPH FT GKYSHVDVHDGDTGSNCSQRNLDDPCTHQEALTRDDSDRWRRAMQEEYDAL FT ISNGTWEMTELPADRKAIRCKWVYKTKVDADGKVDRYKARLVIKGYSQRKG FT VDYEETYSPVVRHSSLRYLFALAARMNLQIDQMDAITAFLQGELTEEIYME FT QPPCFDDQDKTNVCRLKKALYGLKQSSRVWNLKLDAALKKFGLTCADYDPC FT VYYKVVGEKVLFLALYVDDLLIFSNCKRWRKEVKDQLCGEFRMKNIGPAKH FT VLGIRVSRTKDSVSLDQEAYVEFVLARFQMSNCKPVATPMNAAENLTKSME FT PTDPEEAKRMANVLYKEAVGCLMYLGQCTRPDICHAVNVLCHFNDNPGDKH FT WNAVKHLMRYLRGTSKFKLTYRKDGNADIKGYTDADWAADCEGR" XX SQ Sequence 4311 BP; 1114 A; 1057 C; 1255 G; 885 T; 0 other; ggtgtgggtg gaaagtgtct aagcttctag aagattgaaa attttcgcaa gaagttttcc 60 caagaatttt aaacaagaag ttcacgcaag cagttttcct gaagaaccgt tttcctcaag 120 aatggcgcac gctacgggac catctccggt cgtcgaacga ttgctcggca gggagaattg 180 gtgcacgtgg aaatttggta tgcagaccta cttggaaacg gaggagctgt gggaagctgt 240 ggaaccgaac ctgaaggatg acggtacgcc tgaagcaatt ccgaaggcga aggaccgtaa 300 agctcgagga aagattattc ttggcctcga cccttcgttg tacgtccatg tccaggggac 360 aacaacggcc aaaggcgctt gggacaagct ggagcagacc tacgaagaca acggattgac 420 gcgtcgctgg ggtctcctgc acaagctgat gacaatgaac ttgaccaact ccgtatccat 480 ggaggcgtac gtgacgagaa tggttgctac cgcaaaccaa ctcggtggtg tcggattccc 540 gatttcggaa gaatgggttg gaatgctgct gcttgctggc ctacctcaaa gctacaagcc 600 gatggtgatg gcgctacaaa actctggatc gaagattacg ggcgacctga tcaagacaaa 660 gttgctgcag gaggatcacg agccgatgga gaccatcgaa ccagcgtttg cggtcaaggg 720 caagccgagc tacaagcagg agaacattgg taagccgcag tccaagggtc caaagtgcca 780 ttcctgtcac aagtttggac acatcgcgcg ggattgtccc gccaagtcgt cgtcgaaacc 840 gccggagaga aaaggtggta gcaagccagg tcgtgcattt ggtgccgtgc tgacgacgat 900 ggacgagatg aagagtgagg actggttctt cgattccgga gccacttgcc atctcacccg 960 gacgaagtcg ttgctgcgtg acgctcatca agcagatggg aagatgttcg ccgcaaacaa 1020 aggcgaaatg caggtcgtcg cggagggcac tgccgtgcta catccgacct gtggagaaga 1080 gatcgacgtc aacggagtgc agctgattcc ggaactggta gtcaatctac tgtcggtagg 1140 taaaattgtc gatagtggcc acaccgttgt tttcggccag aagggctgtg aagtgctcga 1200 tctcgagggc acgtgcatcg cgaccggaag tcgctcgaac ggactattca agctggaaca 1260 ggacgtaccc aaggtgctgg tgtgtcagcc gacaaccgtt gagctgtggc acaagcgtac 1320 cggtcatttg tcaatcaaga acttgaccag attgcgcaac ggcatggtga ctggagtcca 1380 attcctggag gctgatagtg ctgctggctg caaaatctgt cccatggggc gacagaccag 1440 gctgcctttc aacaagagtg gaaccagagc gaccgaggtg cttgagttgg tgcattccga 1500 tatctgcgga ccaatggagc aggtatccat cggtggaagt cgatattaca tcactttcat 1560 cgacgatcgg acgaggaaga tgttcatcta cttcctgcgt cacaaatccg aagaagaggt 1620 gactcgctgc ttcaaggact ttcacagcat gtcagagagg cagacaggac acaagctgaa 1680 ggttttgcgt acggacaacg ggacggagta caaaaacaag ggtttcgaga agctcctgag 1740 tcagctgggg attcgccacc agaccaccgt tgagtacaca cccgagcaaa acggtctagc 1800 ggagagggcc aaccggacga tcgtcgagaa ggcacgttgc ttgctattcg aggccaactt 1860 gccgaagacg ttttgggcgg aagcagctgc tgttgccgtg tatttgatca accgatctcc 1920 gtgcacgggg attcagacga caccagaaga agcctggact ggtcgtaaac cggatctgtc 1980 caatttgcga gtgtttggga cgacaacgat ggcccatgtc ccgaaacaga agcggcgcaa 2040 atgggatccg aaggcggttg aatgtatcct gatcgggtac gacgaggaga cgaagggcta 2100 tcgtctctac aacgtgaaat cgaagcagat cttcatgagc cgggacgtga ctttcctcga 2160 tgaaggattg gcggattgcg acggaaggac aactctcaaa gctaccaacg ggccacagtc 2220 gacgtacgtg aggttggatt ttgaagaact tgttcaagaa ccagaacctg ttcgagagcc 2280 ggaacctgct caagaaccgg aacctgctca agaacccgag caagtggaag aagaggaggc 2340 atccgataca agttcggacg agttcgagga tgctgatgac tccgtgacac ttcctgcgct 2400 cccgccgcga caatcttcac atccgcccca gtcacaggcg ttgaggcgca gcggtaggga 2460 gcgcgcaatc cctggcaagt ataaagattt caatgtccag tgcagtggta agcgctcttc 2520 gcgtgtttca gatgccccga tgatggacgg atcgtccact tcagctgccc gactagagaa 2580 tgctgccagc actgcacccc cgtcgtcatc aaatcaaaga ccagttgacg acgttttcca 2640 acatgggttg atgctcaacg atagggggtg cccaccccac ggcaagtatt cccatgttga 2700 tgtccacgat ggcgatactg gttccaattg ttcacagcgc aatctcgatg acccgtgtac 2760 gcaccaagaa gcgttgacac gtgacgattc cgaccgctgg agacgtgcga tgcaggagga 2820 gtacgatgct ctcatcagca acgggacctg ggagatgact gaactaccag ctgatcggaa 2880 ggcgattcga tgcaaatggg tctacaagac gaaggtggac gctgacggca aggtggatcg 2940 atacaaggca aggctggtga tcaaagggta ctctcagcga aagggggtgg attatgaaga 3000 aacctattcc cccgtggttc gtcatagttc cctgcgatac ctctttgcgc tcgctgccag 3060 gatgaatctc cagatcgacc agatggatgc catcaccgcg ttcctccaag gtgagctgac 3120 cgaggaaatc tatatggaac agccgccctg ttttgacgac caagacaaga cgaatgtttg 3180 ccgactcaag aaggcactct atggcttgaa gcagtccagc cgcgtgtgga acctcaagct 3240 ggatgctgcg ttgaagaagt ttggtctcac ttgtgccgac tacgacccgt gcgtatacta 3300 caaggtcgtc ggagagaagg ttctgtttct tgcgctgtac gtggatgact tgctgatctt 3360 cagcaactgc aagcgctgga ggaaggaggt caaggatcaa ctttgtggtg aatttcggat 3420 gaagaatatt ggcccagcta agcacgtctt gggcattcgt gtttcaagga ccaaggactc 3480 cgtttcactg gaccaagagg cgtacgttga gtttgtgttg gctcgtttcc agatgtccaa 3540 ctgcaagccg gttgccacac cgatgaacgc cgccgaaaat ctgaccaagt cgatggaacc 3600 aaccgatcct gaagaagcca agcggatggc gaacgtactc tacaaggagg cagtgggctg 3660 cctgatgtat ctgggacagt gcacccgtcc agatatatgc cacgccgtga acgttctgtg 3720 ccacttcaac gacaaccccg gagacaagca ctggaacgcc gtcaaacacc tgatgcggta 3780 cctgcgaggg acatcgaagt tcaagttgac gtatcggaag gacggtaacg cagacatcaa 3840 ggggtacacg gatgctgatt gggccgcaga ctgcgagggc cggtagtctg taacgggata 3900 cgttttcatc gcgcaaggtg gagctatttc gtggtgcagc aaacgacaac aaaccgtggc 3960 gttgtcaacc tgcgaggcgg agtacatggc cttgtctgcc acagtacaag aagctttgtg 4020 gtggaaacgt ctacgcgccc ggatcgagaa gaatgaggag atcgtcatct actgcgacaa 4080 ccagagcgcc atagccgttt cgaagaacgg tggtttccac tccagaacga aacacattga 4140 cattcgtcac cactttatac gggacacact agatcgtggc gatgtcgaga ttacctatat 4200 caacaccgaa gtgcaggtag ccgatggatt gaccaagcct ttacagaagc agaaactgga 4260 gttacatcgc gctgctatgg gacttcaaga tcactgattg aggaggagta a 4311 // ID Gypsy-8_OD-I repbase; DNA; INV; 5725 BP. XX AC CABV01000161; XX DT 24-FEB-2011 (Rel. 16.02, Created) DT 24-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Oikopleura dioica genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-8_OD_; KW Gypsy-8_OD-LTR; Gypsy-8_OD-I. XX OS Oikopleura dioica OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; OC Oikopleuridae; Oikopleura. XX RN [1] RP 1-5725 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Oikopleura dioica genome."; RL Direct Submission to RU (24-FEB-2011). XX DR Genome; CABV01000161; Positions 18717 12993. XX CC Positions [3945-4418] - Integrase core CC 'GAGA' target site duplication CC LTRs are 94% similar to each other. XX FH Key Location/Qualifiers FT CDS 215..1453 FT /product="Gypsy-8_OD-I_1p" FT /translation="MSEIYYKHVADLSTKELMQLYNSLDKSHFPAHFPIVD FT KKSVVLNIFNIEPDFFKDDPKFVFEKNQQGEVTYLKDYSDHCIQKFARPHQ FT KVNLEKIFSLLKASSIAEILANQSDVEQSDLDDEQDDDDDDLNETVRENDP FT KVKTETNESKPRPSAPRRSKPKSTKTSSKSKENTAVKSDYRVKAPKYYSSM FT PIETWLRNMEIFGKCSDIEDSKLITVAISNLLSGDAGANIIESLDDDEMAD FT WNLFKRNLVSSLAQSDEFYKNQFHKYQRGSDSFGLCMANLKAYFKKGYKKT FT SLNDDNREAILERFINCQNPRLKEILKREQSQLTIENAAKRATEIEISIPS FT SENLFMADAQQAENPAISELCALLKDQMKRSENASSSKMTKSSPRRGGGFK FT EASRSLHLICQTTKMQEDD" FT CDS 1620..4976 FT /product="Gypsy-8_OD-I_2p" FT /translation="MLDSGCSRTCIRSDLPFIEDSIKSDIKLTCANQETVS FT TRLTKNPIHMTAISDTGTLSISTTPLITDRLSVPIIIGLDVLHDITISKSS FT AFVELNGHRIQTVAPQLSCNSLRISAIIEKVEEEESLRNFQSRRLKLAETS FT KFSPIIGSFGDATESQRSSLQHLVDKFRLAFTMGDSDLGQLFHYRFSLPFH FT DETQTSHQPPRPIPIHIRDQVANEIKTWQDLGIISETQSSNNIPLIILRKP FT DKSIRISLDARGLNQLLVKDRFPLPHMTTVFSRIGIKLAAGESCFISSLDL FT SRAYWQVRVAEEDAFKLAFSYNGRHYQANRMLYGTATAPSAFSRIMGKIMT FT HPSIIIYLDDLICIDSCFSEHLKTLEFIFKTCYDHGLLLSAKKCNLCMRET FT DFLGHKISSDGISPTSKHLKAVEDFQPPTDRNELKRYLGMVNYNIKFIKSG FT SAILAPLYELTSHKVDFKWTDRQQSAFEKIKQKLLKKPTLAHFQLGSKLLL FT TTDSSGKTIGATLYQYQNNDLRTIGYFSKALQGPDLKRPMRIKELFALTWA FT IKHFEFFLIGTEFDAYVDHKSLIYLFREQQQSRLDIKLTNVHNYLMQFDFK FT IIHKPGNDPAMASADFFSRLPTSKSSDLDEEAAIYTDLPETVFMFHVEDQQ FT SNGNIFAFENRVYNAAEFSSLQNQCDSCKNLIAKTKLLKKSSFKFVNDVLY FT NGERLVLPSSLADEFINYLHLITGHAGSKQILHMLRKFYISNVQERVRAIT FT SSCATCIRIKPSKQLKPSMIQNRHFESVPFEKSFIDLVDYGRPDSTGKRYL FT LTCCDSLTGYLDGEPISSKSDKLVARSLLKIILLHGMSGVCVSDNGREFGP FT LCKEIFEKFEIRHVTTTAYRSCSNGKLERLHREIHIHLKNMNANDRNWSLF FT WPEACYYINNLPKATLDGLSANEALFGRSFHLPYFQADNNDRHKEPFITSL FT SKYLKELHPSLLTFQMERYQKQLKKDTNNCPILEIGTRVLAWKPDILSGKL FT GCNWSGPYKVYRRISKDSYIVKCEKTNREYRRHISLLRPLKIRTDCKTQFI FT TESPTIDDSNLTTDDDSDSTPEIIEPKSTNTSTESEDKEEPMESSLKNLFK FT EDWSNRLRPRL" XX SQ Sequence 5725 BP; 1800 A; 1398 C; 1042 G; 1485 T; 0 other; aattggtgac tgaaagagtt caacccagcc tgcaagagaa ccacgatagt ttttaaggag 60 aaatatagag atttttttaa aacagaggtt tactatagga tttccagcga gaatcaaagc 120 aagcggagtg acaaccttag gttgcaccac cgataggttc tcgtttagga ttttaataaa 180 tttactgaag tcctagaaat tttgaatttt caacatgtct gaaatttatt ataaacatgt 240 tgctgacctt agtacgaagg agctcatgca actttataat tcactcgata aaagccactt 300 cccagctcac tttcccattg tggacaaaaa gagcgtcgtt ctgaacattt ttaacatcga 360 gcctgacttt ttcaaagatg acccaaagtt cgtgttcgag aaaaatcagc agggagaagt 420 gacttatctt aaagattatt cagaccattg tattcaaaag tttgctaggc cacatcagaa 480 agttaacctt gaaaaaatat tctctctctt gaaagccagc tctatcgcgg aaattcttgc 540 aaatcaatca gatgttgagc aatcagatct cgacgatgag caggatgacg atgacgacga 600 cttgaacgag acggtccgcg agaacgaccc taaagtcaag acggagacaa acgaatcaaa 660 acctcgaccg tcagctccgc gacgctcaaa gccaaaatct acaaagacgt catcaaaatc 720 aaaggagaat actgccgtca agtctgacta tagagtcaag gcgccgaagt actactcttc 780 catgccgatc gaaacttggc tccgtaacat ggaaattttc ggcaaatgct ccgacatcga 840 ggactcaaaa ctgatcacag ttgcaatctc caatctcctc agcggtgatg ctggagcaaa 900 catcatcgag agtctcgacg acgacgagat ggctgattgg aatctcttca aacgcaattt 960 ggtcagctcc cttgcccaat ctgatgaatt ctacaaaaac cagtttcaca agtatcagcg 1020 aggatctgac agctttggcc tctgcatggc caacctcaaa gcctacttca aaaagggcta 1080 caagaagaca tccctcaatg atgacaatcg cgaagcgatt ctggaacgat tcatcaactg 1140 ccaaaatccg cgactcaaag agattctcaa gcgtgaacaa tctcaactca cgatcgagaa 1200 tgcggcgaag agagcaactg aaatcgagat ctcgattcca tcatccgaga atctgtttat 1260 ggcagacgct caacaagccg aaaatccagc tatatcagag ctatgtgcgt tgctcaagga 1320 tcagatgaaa cgatcggaga acgcaagctc gtcaaaaatg actaaaagtt ctccacgtag 1380 aggaggtgga ttcaaagaag cttcaaggtc attgcatctc atttgtcaaa cgacaaaaat 1440 gcaagaagat gactaactgc aagtacctcc acagcgacga tcctccaaag tcagttgtcg 1500 actacgtcaa aagtctatga ctaaacgaca ctcctagatt cctatgtccg aatttttcgt 1560 ctgcgcctac atcactgaag ttcatcaacg tacgaattgg cgaatacgtc tatccagcaa 1620 tgctcgacag tggctgttcg cgaacctgta tcagatctga ccttccattt atagaggatt 1680 caatcaaatc cgatataaag cttacttgtg cgaatcaaga aactgtaagt acgcgcctta 1740 cgaaaaatcc gatacatatg actgcaatct ctgacactgg aaccctatca atctcgacaa 1800 cccctctcat cactgatcgt ctttctgtgc caatcatcat tggacttgat gtcctccacg 1860 acatcacaat ttcaaaatcg tctgctttcg ttgaacttaa tggacatcgc atccaaactg 1920 ttgcccctca actttcctgc aattctcttc gcatctccgc gatcatcgag aaagttgaag 1980 aagaagagtc acttcgtaac ttccagtcaa ggagattgaa gcttgctgaa acttcaaaat 2040 tctctccaat aattggctca ttcggtgatg caactgagtc ccaaagaagt tctctacaac 2100 atctcgtcga caaatttcga cttgcattta caatgggcga cagcgactta ggacaactct 2160 tccactaccg cttcagccta ccatttcatg atgagaccca gacttctcat caacctccac 2220 gtccgattcc aattcacatt cgcgatcaag ttgccaatga aatcaaaaca tggcaagatc 2280 ttggaatcat ttcggaaact cagagttcga ataatatccc tctgataatt cttcgcaaac 2340 cagataagtc catccgaatc tctctcgatg ctcgaggtct caatcaactg ttggtgaagg 2400 accgatttcc cttaccacac atgacgaccg tcttcagcag aataggcatc aagcttgctg 2460 ctggagaatc ctgcttcatc tcgtcacttg acctgagcag agcatattgg caagtacgag 2520 tcgccgaaga agacgcattc aaactggcgt tttcctacaa tggacgtcat tatcaggcaa 2580 atcgcatgtt gtatggaact gcaactgctc caagtgcttt ctccagaata atggggaaga 2640 tcatgactca tccatcaatc atcatctact tggacgacct gatctgcatc gactcttgtt 2700 tttcagagca cctcaaaact ctggaattca tattcaaaac ttgctacgac cacgggcttc 2760 tgctttctgc aaagaaatgt aacctgtgca tgagagaaac cgacttcctt ggacataaaa 2820 tctcctcaga tggaatatcg cccacttcaa aacatctgaa agcagttgaa gactttcaac 2880 ctccaactga tcgtaacgag ctcaaacgtt accttgggat ggtcaactac aacatcaaat 2940 tcatcaaatc tggttctgca attctagcgc cactttacga actaacaagt cataaagttg 3000 acttcaagtg gacagacagg cagcaatctg ccttcgagaa gatcaaacaa aaacttctca 3060 aaaagccgac actagcacat tttcagcttg gttcgaaatt actgcttact acagacagtt 3120 ccggaaagac aattggtgca acgctgtatc aatatcaaaa caacgatctg cgaacaattg 3180 gctacttctc gaaagcgctt caaggtcccg acttaaagcg cccaatgaga atcaaagagc 3240 tatttgcact gacttgggca atcaaacact tcgagttttt cctaattgga actgagtttg 3300 atgcatatgt cgatcataag tcgcttatct acctgtttcg tgagcaacag caatcgcgac 3360 tcgacataaa actcacgaac gttcacaact atcttatgca gttcgacttc aaaatcatcc 3420 acaaacctgg aaatgacccg gctatggctt ccgccgactt cttctcgaga cttccgacat 3480 cgaagagcag tgacctcgat gaagaagcag caatttacac tgatctacca gaaaccgtct 3540 tcatgttcca tgttgaagat cagcaatcaa atggcaatat ctttgcattt gaaaacagag 3600 tgtacaatgc ggcggagttt tcctctcttc aaaatcaatg tgattcatgc aaaaacttga 3660 ttgcgaagac aaagctgctc aagaaatcat cattcaaatt cgtcaatgat gtcctctaca 3720 atggtgaacg tctcgttcta ccatcatctc tcgctgatga gtttataaat tacctccatc 3780 tcatcactgg ccacgctggt tcgaagcaaa ttctacacat gctacgaaaa ttctacattt 3840 ctaacgttca agaaagagtt cgagcaatca caagctcctg cgcaacctgt attcgtatca 3900 aaccgtcaaa gcaactgaag ccgtcgatga tccaaaatcg acacttcgaa tctgttccgt 3960 ttgaaaagtc ttttatcgat cttgtcgatt acggaagacc agacagcact ggaaagagat 4020 acctgctcac atgttgcgac agtctaactg gatacctcga cggtgagccg atctcgtcaa 4080 aatcggacaa acttgttgcc agaagccttc tcaagatcat cctgctacat ggaatgtctg 4140 gtgtttgcgt ctctgacaat ggcagagaat ttggacctct ttgtaaagaa attttcgaga 4200 agtttgaaat tcgtcatgtc accacaacag catatcgcag ctgctcaaat ggaaaactcg 4260 aacgtctcca cagggaaatt cacattcacc tgaagaatat gaacgctaat gaccgaaatt 4320 ggtctctctt ctggccagaa gcttgctact acatcaataa tctaccaaaa gcaactcttg 4380 atggactctc agcaaacgaa gcacttttcg gaagatcatt ccaccttcca tatttccaag 4440 ccgacaacaa tgaccgacac aaagaaccat ttatcacaag tttatccaag tatctcaaag 4500 aacttcatcc ttcattgttg acatttcaaa tggagcgtta tcagaagcag ctcaaaaagg 4560 acacaaacaa ttgtcctatt ctcgaaattg gtacaagagt tcttgcttgg aagcctgaca 4620 tcctgtctgg aaaacttggc tgcaattggt ccggccctta caaagtgtat cgacgaataa 4680 gcaaagactc ctacattgtc aaatgtgaaa agacaaatcg ggagtatcga cgccatataa 4740 gccttcttag gcctctcaaa attcgaactg actgcaagac tcagttcata acagagtctc 4800 caacaataga tgactctaac ttgaccacag acgatgactc agactcaact ccggaaatca 4860 tcgagccaaa atcaacgaat acttccacag agtcagagga caaagaagaa cccatggagt 4920 catcactcaa gaatcttttc aaagaagact ggtcaaatcg acttcgtcca cgactataat 4980 ctctttttct ttcaaagggg gaagaagaca tcgcatatct tcccgaactt tctggtcctg 5040 gccaccataa cccctatcta cagatcagct tgtcaccgat tggataggct gttgacgata 5100 cagacaagca agacctgcaa ttcttccgtg cacaaatttc ctttcgacaa actaacactg 5160 tttgtggatg tggaccccat acaagcagta caggcaggcc aacataatct cttcaaagtt 5220 tcaaccgaat cgacacttcc tccaacctgc agtacgaaga agatgaacat tttacctaaa 5280 caatgacgag tttatgttga agtacataag atgaagtgtg actggcgtcc caactcctat 5340 aaactacctt acatgtactt cgtaagatcg gcaacactaa cagtatattc ccgtatacta 5400 aaaaacagcc aatttatact cctggatgat ccgctaacaa tcatcaaatc agcaaagcta 5460 cgactcaatt tgcgaataca tccggctcga tttcaatttc aacttcgcaa aaacgacctg 5520 atgagtcctc tcttctcagc tgttattctt taagaactgt ttcaaaattc taacattctt 5580 ttattctaat ctttcttctt ttataatctt actttacgtt tctgataact ttctagaacg 5640 agttaactgt atttcaactc tttttacgtg aacactttct ttgaaaaacc agaacaaact 5700 tcttcttttc aaaaggggga agatt 5725 // ID BEL-7_DPu-I repbase; DNA; INV; 8142 BP. XX AC scaffold_26; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-7_DPu_; KW BEL-7_DPu-LTR; BEL-7_DPu-I. XX NM BEL-7_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-8142 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 661-661 (2010). XX DR Genome; scaffold_26; Positions 1028037 1036178. XX CC Positions [7041-7604] - Integrase core CC 'CTACC' target site duplication CC LTRs are 98% similar to each other. CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX FH Key Location/Qualifiers FT CDS join(3759..5267,5271..8120) FT /product="BEL-7_DPu-I_1p" FT /translation="MLLGAKPSTKEFPHRKLFCPFCTGDHWPTDCDTAKTV FT EERYDIAKVKKLCFNCLRKSHQASSDCPSKYRCRVCNRAHHTSLHKADPSR FT VTGATILSWSPPTSVVLSATDVGEQNSFVFLETAIAKVQSQFVKLDGNILV FT DKGAQRTFITSKLTKLLNLPPLRRESLILFGFTSCRGVAEHYDVVQFSIID FT RHGSPIVVQAIVIPHIVDPLSDPHRAELLSLPHLKKLKLAHPVSTKSTFEV FT DILVGADTYWNIVGDQVIRGSGPTAVDSKIGYLISGRLQYSGEKVEQKTLG FT LHISVEEALDVTKFWNLETLGIQPDLESTKTTEFYQKNSVEFRDGKYVAKL FT PWSENHPPLSSNLQVCQKRTRGTVNRLASKNPNGLKLYSKIILEQLDRGFI FT EKVPSSEMSKPSHYIPHFGVFKESATTPLRIVYDFSCKTPTGVSLNDCLEV FT GPPLQNDMLAILLRFRAHTIGLTADIEKAFHQVGLHEQDRDFVRFLWLKDP FT YDSKSDFLYRFRVIPFGASSSPFILLSVIKKHLQNSSSLLADDINRNIYVD FT NLISGCETSEEAVSYFSEANNVLKSAGLNLQSWGSNDDQLTVKAKSEGVGD FT KSQVTKVLGLDWEREEDRLHVPCVKLSHLSHPQTSKREVLRGISATYDPLG FT FITPLTIPARMLIQGIWKLKLDWDDPIPFELIERWKGIATSIEDSKTSFNR FT SYFKTGEIKELHVFVDASQLAYGAVAYFCNDDSSSFVFSKSRVAPLKTEKK FT LLTIPQTELMAAVIGTRVASSILSALLPLGLKPNCYLWSDSQIVLYWIQKM FT GKIKCQFVHNRVETIRSFTRDTNASWNYCPTTCNPADLLSRGSTLRQFLSS FT DIWLTGPSWLPKRSEWPSWEGNSEPAAVFHLSVITSASASNPPPSHDGGIE FT KILDISRYKFSYLIRVTAIVRRFIGNIKLKDKSRASWKLGYITVSELQEAE FT KVWLLAVQKAYFTEEINYCTKPTVKRPALVSQLDLFIGQDGLLRCNGRLSN FT SQLKKDARHPILLPKNSTLSTLVIAAHHALMMHGGVKLTVASIRQRYWIPQ FT IVQSVKKCLKKCVNCSRVRGPPYTAPNHAPLPVSRSSFSFPFTVTGIDFTG FT AFTIRGPKEAPKVDRTVYILLFTCASTRAIHLEVVEDMTTLSFLDAFGCFV FT SHHSRPAVIMSDNAKTFEKGAKLFQKIFRDPIVTRRLSDQQIEWRFIPKRA FT PWYGGFWERLIGLAKSALSKILGRTKPTLSAFRALVADAEVVLNDRPLENP FT SSSVGDEESLSPAHLMYGRRLNTLPYSEETNEEKFDASYGDKPDELKKAVA FT RHQTLLQHFEKRFLASYLPALREYHQATKKSNPTVIKERDVVLVHDDKPRK FT EWKMAVVEKLIRSQDGQIRAADIRTANGKTNRPISKLYPLQVAEPSDDPAG FT SSDGPLSDPAPSSPSRPTRKCAAQTNAAIKKIFEQEDAE" XX SQ Sequence 8142 BP; 2316 A; 1952 C; 1647 G; 2227 T; 0 other; ttggtgccgt aaccaggatt tgtcggccta gcgtggtcgg caatatcttc gtccctcgtg 60 ttcatctctt gttccttgtg cgattaacac ttactttgac tttaaaacta acttccctga 120 aattttaaac atcggatgtt tgaattatca ccctgattta tcgagattcg tgacacgtgt 180 cgctcatgac gaatctcgtg gctgtgctca gcctcgtgtt ctcgatcttt cgttagccac 240 cgcgtgctga cctcgtggcc agacttggcc aatctctact ctcatttcgt ctctcaagtg 300 aatagtcgac tggcttttct ctcatctttc acagcagtga aatcatcaac ctcgtggcca 360 cgcttggccc aactcttcct ctggttacca agcttggttc ccttcgctag caactgacaa 420 gctagacatc atcatctttt acgctcacca tttccttcga aaaaaatggc gagcgtggct 480 actctcacgt ccaaacgagg tggcaatcgc ggtgcggtca ctcgactcat cgccaaactt 540 tccgacatca tcaacgatgc tgccatggac agagaccgaa aaatccacga actcaacaag 600 aagttggaag atcttcacga caaaatcaag gtagttgaaa cgcttgattc tgaaatcgtt 660 gagctattac ccgctgctga cgtggaagct gaaatgtcca atgctggtac taccaatgct 720 gtagcgtacg acgctcgcga cgcagcagaa tttactctca agactctgat ggatgaaaaa 780 gccgcagaag tagcagcagc tgctgctgcc tctgcggctg cagcagcagc tgcgaatcca 840 aatcctccag cacctacaat cactccaacc atcaacgtga ctgctgccac tgattcaagt 900 catctcccca agtttaatct ccctgaattt ggtggaaata ttctactctg gaatgcgttc 960 tgggacgtat tcgaagtgga agttcatctc aagaagagct actccaatgc tactaagttc 1020 aactttctta actctcgtct atccggcgac gcaaaggcgc ttcttctcgg cctcgtgcca 1080 accaacgaca actacaatgt cgccatcgat ctagggtaaa acagcgaatc cggacacggc 1140 ccgagaaaat gcgaattccg gacacaaaat gctatttttt aagttttgta aatacgagaa 1200 aacggaaaga ttataataaa atctattaaa tgagctttga actgcaaggg aggatacata 1260 tttgggcgaa agggcattaa aaaaatattg acgattaatc aagatatctt taattgaaat 1320 tatctttttc gcttacgggc taaacagcgt ttctgtgtat tggctgctag ctgacaactt 1380 tacgccgtaa ggaatatcaa aagtaaaaac tgagacaatc taataaataa ttatataaaa 1440 agatagagct tgaaaaactc tacattactg tataaatatt tgtcaatatt gtgcagagtt 1500 tacgaaatat tagcatacca ggaaatgtat aaattatgcg aatgtccgag cagctgatgc 1560 gaatgtcgga cataaacaat actcaaatta aaattttcat gattttatga agtatgtata 1620 tcaatcgacg cagaatttca aggcaaatcg attgattacc tagtgtatac gtttaattgc 1680 tttatttagc tttatatgcg aaaacaattt ttgagtgaac aaggttcgag tcttgccaga 1740 tccaaaaatg tagtcgttat aataattttg ttgttttttt atttcgcaaa taatatcttc 1800 aacgagcgta ggtttctagt aatattcgtg aagttttcat aattttattt caagtattca 1860 attagttaat agcttctgtt tctgaaaatg cgctagctta tgtccggaat tgatttattt 1920 gtaaaatact tcagtttttt ttctagtttt tagcttaatt tagtacctct agccctagtg 1980 tacgataatt gagcaagaaa aatgcacctt taataagaaa ataacgaata ttttcgatag 2040 tgtgcttgta gtcattctcc cataaggtgc cgtgatgcga agtccggaca gttactgact 2100 gctcattatt ccttatcgat ttggtaaatt tattaatata tgttgtggct ctgtggataa 2160 aagttgccgt caagataacg aggctagttg ttcgaatccg ggttaattcc cgcatatgaa 2220 tcaagttgcg atattaatac agccaaatga ttgaattaat cttctctgac gaatgttgta 2280 agatattttt ttatagactt tgaaaaagaa ttttttagct taagagaaag gcgaacagga 2340 aaaggagata tgttatctat aggatttgaa ccttaaggtc tcagatccgt aatctaaggc 2400 tttaccactt ggcaaccgaa acttacattt tgtctattat tttgacaata tattccaatg 2460 aactacttta attgtgcgta catagatctt tacataccaa taacccattt caaatttgcg 2520 ggtataatac agaacttcat gctagtacgt aaggaaaata tatttcaatg ctatctatgc 2580 aaatatgcgt tgagaaaatc atgaaaacag tgagatttga acattagacg atcagattca 2640 gagtctagcc tcttatcatt cagccatccg tcacaaattg attaaacgta aaatggaatt 2700 gatgttaatc tgcagtaacg ttgtccggaa ttcgcatcaa gggcttaaat gggggaacaa 2760 cgtcattaat atagttaaat tatatggata gtttgttaca tgtatgtagc tcaattttaa 2820 gaggggatta ttctgaacat accgatacca atgtttcttt aaaaaacatt tatctgaagt 2880 atgtattaaa tattcaatat ttcgttaagt tccggacatt cgcaacacaa atgtctaact 2940 ttacatacaa tttttttgaa aactccgcca cggattttag ataactttct tattttaaga 3000 ttctatgact attcgcttaa gtaaaagata gttaacttta aaatatataa catttttatg 3060 agaaatcatt tttgtgttgc ttgacaggtt tgaacccaag taaaaaagtt caggcagacg 3120 aaaaatggtt ttgacagctc aatgtttggg ttttttcaag tcaaaatatc tcaatatatt 3180 tctattgcac catacatggt gcctcaagca attcgattca gaattaaaaa ctgtatcgaa 3240 tgatataatt agtttcgaaa aatatttctg tccggaattc gaatttttgt ccggggcatt 3300 cgcattttgt ccgaacattc gcatttcggc tttacacgca aacgccataa ggaagattaa 3360 tagggaaatc gctgtccgga ttcgcttttt taccctactc aaaaagcgtt ttggccagcc 3420 ggcgaagatc atcatggctc acatgcgagc gctagtggcc ttaccgaagc ccggaactga 3480 ccgagcttcg ttaaggaagt tcgtggattc tctggagtcg cacatccgtg gactcgaagc 3540 tctgaacaag acgccagatt catatggcga tctgctcgtc tgcatccttc tcgacaaact 3600 ctcagctgat ctgcgtcgta acctggcacg acaaagtgac gccactgagt gggacctcga 3660 tatccttcga aagagcctgc tcaaagaaat cgaaattctc gaagacagtg aaagttcaat 3720 ctcccactca tcggcgttaa aacctccaaa gaaaaccaat gctcttggga gcaaagccgt 3780 ctacaaaaga gtttccccat cgcaagctct tttgtccctt ctgcaccgga gatcattggc 3840 ccaccgactg cgatacagca aaaactgtgg aagagcggta cgacatcgca aaggtaaaaa 3900 agctctgttt taactgttta agaaagagcc atcaggcatc atccgattgc ccatccaagt 3960 atcgctgtcg tgtatgcaat cgagctcatc acaccagtct gcacaaagcc gacccttcta 4020 gagttactgg cgctactatt ctttcctggt ctcccccaac gtctgtggtg ttatcggcca 4080 cagacgttgg agaacaaaac tcttttgtat ttttggaaac ggcaattgcc aaagttcagt 4140 cacaattcgt taaactggat ggcaacattt tagtcgacaa aggtgctcaa cgcacattca 4200 ttacgtcaaa gctgactaaa ttgttgaatc taccacccct cagacgtgag agcctcattc 4260 ttttcggttt cacttcgtgc cgtggagttg ccgaacacta cgacgttgta caattctcta 4320 ttatcgatcg tcatggatct cccatcgttg tgcaagcgat cgtcattcct cacatcgtgg 4380 accctctctc cgatccacat cgagccgagt tgttgtctct tccccatctg aaaaagctca 4440 agttggcaca cccagtgtcg acaaaatcaa cattcgaggt ggatattcta gtaggtgcag 4500 acacctactg gaatatcgtt ggtgatcagg tcatccgtgg atctggccca acggccgttg 4560 actcaaaaat cggctatctc atttctggac gtctacaata ttcaggtgaa aaagtcgagc 4620 agaaaactct tggtctgcat atctcggtgg aagaagcact cgacgtcact aaattctgga 4680 atcttgagac ccttggaatc cagccggatc tggagtcaac aaagacaact gagttttatc 4740 aaaagaattc agtcgagttt cgtgatggaa aatacgtcgc gaagcttcct tggagtgaaa 4800 atcatccgcc cctctcgtcc aatctacaag tttgccaaaa gcgaacgaga gggaccgtca 4860 accgactagc atcaaaaaat ccaaatggac tcaagctcta cagcaaaatc attctagaac 4920 agctggaccg aggcttcatc gagaaggttc catctagtga aatgagcaag ccatctcact 4980 acattcctca cttcggagtg tttaaagagt cggctactac accccttcgt attgtctacg 5040 atttttcttg caaaactcca actggagtga gtttgaacga ctgcctcgaa gttggccctc 5100 cactacagaa cgacatgtta gctattcttc tccgtttccg cgcacacaca atcggcttga 5160 cagccgacat tgaaaaggcg tttcatcaag ttggtctaca cgagcaggat cgagatttcg 5220 tgcggttcct ttggctgaag gacccatacg actcaaaatc ggatttttag ttgtaccgtt 5280 tccgcgtgat cccgtttggt gccagcagct cgcccttcat cctcctgtca gtgatcaaaa 5340 agcatctgca gaatagttca tcgctactag ctgacgacat caatcgcaac atttatgtcg 5400 acaacctcat ctcaggctgt gaaacttccg aagaggcagt gtcttacttt tctgaagcca 5460 acaatgtgtt aaaaagtgcc ggtctcaatc ttcaatcgtg ggggtcaaac gatgaccaac 5520 tcactgtgaa agcaaaaagt gaaggagtcg gcgacaaatc tcaagttact aaggtccttg 5580 gcctcgactg ggaacgtgaa gaggatcgtc ttcacgttcc ctgtgttaaa ctctctcatc 5640 tttcacatcc acaaacttcg aaacgggaag tgttgcgtgg catctcagct acctacgacc 5700 ctcttggctt cattactccc ctcaccattc cagcaagaat gctcatccag ggaatttgga 5760 aactcaagct ggattgggac gatcccattc cattcgagct gatagaaaga tggaaaggaa 5820 tcgcaacttc catcgaagac tccaaaactt ctttcaatcg ctcttacttc aagactggag 5880 aaatcaaaga gctccatgtt tttgttgatg caagccaact ggcctacggg gccgtcgcat 5940 atttttgcaa tgacgactcc tcttcgttcg tgttctccaa gtctcgcgtc gccccgctga 6000 aaacggagaa aaagttacta accattccgc agaccgagtt gatggcagct gtcatcggaa 6060 cccgtgtcgc gtcatccatt ctcagtgctc tcctccccct aggcctcaag ccaaactgtt 6120 atctgtggtc agacagtcag attgtgctct actggatcca aaaaatggga aaaatcaagt 6180 gtcaattcgt tcacaaccga gtcgagacca tccgcagttt cactcgagac actaatgctt 6240 catggaacta ttgtcccacc acttgcaatc cggcggatct tctatctcgt ggttctactc 6300 tccgtcagtt cctctcttcc gacatctggt taaccggacc ttcgtggctt ccaaagagaa 6360 gtgagtggcc gtcgtgggag ggaaattctg aaccagcagc agtgtttcac ctgtccgtca 6420 tcactagtgc atcagcatcc aatccaccac cctcccacga cggaggaatc gagaagattc 6480 tcgacatatc gcgatacaag ttctcttatc tcataagagt caccgcgatt gtccgacgat 6540 tcatcggaaa catcaagcta aaagacaaat ctcgagcaag ctggaagtta ggctacatca 6600 ctgtttccga actgcaagaa gcagaaaagg tttggcttct tgcagttcaa aaagcgtact 6660 tcactgaaga aatcaactac tgcacgaagc caaccgtaaa gcgtccagct ctcgtttctc 6720 aactagatct ctttatcggt caagacggtc tgttgcgctg caacggccgt ctctcgaatt 6780 ctcaactaaa gaaagacgca agacacccta tcctgctacc aaagaattcc actctctcca 6840 ctctcgtcat agccgctcat cacgctttga tgatgcacgg aggagtgaaa ctaacggttg 6900 catcaattcg ccaacgctat tggatcccgc aaatcgtcca aagtgtcaaa aagtgtttaa 6960 aaaagtgcgt taactgcagc cgtgtaagag gaccccctta cacggctcca aatcatgccc 7020 cgcttccagt gtcgcgttca tcattctctt ttccatttac tgtgaccggc attgacttca 7080 cgggagcgtt caccatacga ggtcccaaag aagctccaaa ggtagaccgg acggtctaca 7140 ttctactgtt cacctgtgcg tctactcgcg ctattcatct ggaagtagtc gaggatatga 7200 ctacactttc ctttctggac gccttcggct gctttgtctc tcaccattct cgcccagctg 7260 tcatcatgtc agacaacgca aaaacttttg agaaaggtgc taagcttttt cagaagattt 7320 tccgcgatcc aatcgttaca cgacggctct cagaccagca aatcgagtgg cgattcattc 7380 ctaaaagggc accgtggtac ggtggctttt gggaaaggct gattggcctt gccaaaagtg 7440 ctctttcaaa aattctcggt cgtaccaagc ctacactcag tgcatttcga gcactcgtcg 7500 ccgacgctga agtagtgctg aacgatcgcc ctcttgaaaa tccgtcgagc agcgtcggcg 7560 acgaagagtc actttcccca gctcatctga tgtatggccg acggctaaac acactaccct 7620 acagcgaaga aacaaacgaa gaaaaatttg atgcatcgta cggcgacaaa ccggacgagc 7680 tgaagaaagc cgtcgcccgc catcaaactc tgcttcaaca tttcgaaaaa cgttttctgg 7740 cttcctatct ccctgccctt cgcgagtacc atcaagcaac gaaaaagagt aatccaacag 7800 tcatcaaaga aagagatgtt gttctcgttc acgacgacaa accacgaaag gaatggaaaa 7860 tggctgtcgt cgagaagctc attcgcagtc aagatggaca gattcgcgct gccgacatcc 7920 gcacggcaaa cggaaaaact aatcgcccca tctccaagct ctacccacta caagttgctg 7980 aaccgtcgga cgatccggcc ggatcgtccg acggcccact ttcggatcca gctccgtcgt 8040 caccttctag accaacccga aaatgtgccg cccaaaccaa cgcagcgatc aagaaaatct 8100 ttgaacaaga agacgctgag tagctgcctg gccgcgggag aa 8142 // ID LTRP1 repbase; DNA; INV; 215 BP. XX AC L42495; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 01-JUL-2005 (Rel. 2.02, Last updated, Version 3) XX DE Leishmania tropica DNA repeat. XX KW LTR Retrotransposon; Transposable Element; LTRP1; KW Repetitive element. XX NM LTRP1. XX OS Leishmania tropica OC Eukaryota; Euglenozoa; Kinetoplastida; Trypanosomatidae; OC Leishmania; Leishmania tropica species complex. XX RN [1] RP 1-215 RA Piarroux R., Fontes M., Perasso R., Gambarelli F., Joblet C., RA Dumon H. and Quilici M.; RT "Phylogenetic relationships between Old World Leishmania strains RT revealed by analysis of a repetitive DNA sequence."; RL Mol Biochem Parasitol 73(1-2), 249-252 (1995). XX DR GenBank; L42495; Positions 1 215. XX SQ Sequence 215 BP; 51 A; 68 C; 65 G; 31 T; 0 other; gcaagaatca agaggcagtg tcacagagat gcgcgaaggg gggcggtggg agcgggagag 60 agaccgcggg cacgtggcga cgtccgtgga accaaaaaaa agcagaagac gagtattccc 120 ttttgctgat gtgtgaccac ctctctgcca cagatcacga gctcagctcc cctccaccct 180 aacgcctccc ccgcgcggcc ctgtcacagg ctccc 215 // ID PENEL1_NVi repbase; DNA; INV; 3197 BP. XX AC . XX DT 13-NOV-2007 (Rel. 12.11, Created) DT 13-NOV-2007 (Rel. 12.11, Last updated, Version 1) XX DE Penelope-type element: consensus. XX KW Penelope; Non-LTR Retrotransposon; Transposable Element; KW PENEL1_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-3197 RA Jurka J.; RT "Penel1_NVi: Penelope-type element from Nasonia parasitic wasp."; RL Repbase Reports 7(11), 1176-1176 (2007). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 1064..2719 FT /product="PENEL1_NVi_1p" FT /translation="MFTLADKGNVTVAMLRSDYIEKMELQLSDRNTYTEIA FT KNPLSKIQNKVKKFMNNWNTNGFFGKKYSKFKFTQTDTTLPKCYGLPKIHK FT KDIALRPIISTIGSPTYFLSKSLLKILQNSIKKPDSYVENSYDFIEKIKNV FT VVPPNHRLISLDVKSLYTNIDKSLVKKSIEKRYIQIVQKSEIPLNEILLAC FT ELLMENTFFQFNGKFYKQLFGTPMGSPISGFFADIVMDDLETDCLSKLSFK FT PTFFYRYVDDIITCVPNDKVDEILKTFNSYNNRLQFTHETEIDNSISFLDV FT LLIKDNDKIVTDWYTKPTFSGRFLNYTSQHPMSQKIAMVYNLVDRAIKLSN FT KKFHNKNIQNIKKLLHENHYPPAFVNKYIRKRMSTLTSNTLSRQQISSNEK FT RNIVSLPYIQGFYEQTSHAFKTLNIETVPKINSQMKNIIIKGKDKKDKDLE FT NNVVYKLECNDCSATYVGQTKRNVCIRINEHKKSKNTVISDHVNNLNHTFN FT FENYHIKDREINLNKRLVSEMLHIHLQENGINKKEDTQNLNTVYTHLIHQL FT KTKQT" XX SQ Sequence 3197 BP; 1239 A; 666 C; 454 G; 838 T; 0 other; gtgccttctc gacaaaactt tgtaccgatc ttctccataa ttgtcagttc ttcacactgt 60 acatatgttg ttatacaaca agtttttaat aaatttttcc acatcagtct cctgatgagg 120 gcggcattgt ccgtcgaaac gtcgagaaga aaagagaaaa tgtcttaagt tttttctctc 180 ctttctcacc acacaacctg ccccgggaag accgatagag acaaaccaat atttttgatc 240 accaccgagg gacagcatca gatagagaaa aggtttttct ctaaatatat ttttgttatc 300 cgacaccggt ctcatccacg actccttctt caacaacatc gagaggaatt agaatcgact 360 ctcgacatct tttgaagatt tcctacaaca gaacttgaca agcgccagat cacaacgact 420 tctttcggta cagaacgcac gacaagccta caaacgtcac caacatcgca agtaacctaa 480 gccattttaa ttttcgcagc aattcgggta gatgtaaatt caacaattat ctcaaatttc 540 caaagcaaaa tcctgaatta gaaatcagat ctctatacac atgaattttt taaaacttaa 600 atccgaaaaa ttaaaagaac atttacaaga aataatacca cataacgtca tcagtatttt 660 cgaagatttg aataatgatc acggcgacaa acttaacaaa attaaaaaat tttctaagca 720 ttaacgacaa atacgaactc cgaatcgaat tgcgaaaaat ggtttcaaaa tctcatcgat 780 atagacatat caaccgatct ataatttgta acgtcactag gacaaaaatt taatccaaaa 840 agtgaacttg atgacacaaa aacaatcgaa tgtataaaaa acatagaaaa cttgattact 900 tacaataact tatcggataa aaccgccaat gacatacgaa atatactgat caaaaaaatc 960 acacaacaca gacacaaaac taaacacata aaaatagaag aaacaatttt taagaaaaaa 1020 ttacaagaca ctaaagattt tattaaaaat aacaaaaaac atcatgttca ctttagccga 1080 caaaggaaat gtgaccgttg cgatgctgag atctgattac atcgaaaaaa tggaactaca 1140 attgtcagac agaaatacat acactgaaat agcgaaaaat ccattatcaa aaatccagaa 1200 caaagttaaa aaattcatga acaattggaa cactaacggc tttttcggaa aaaaatacag 1260 taaattcaaa ttcacacaaa cagacaccac cttgccaaaa tgttatggac tacctaaaat 1320 tcacaaaaaa gatatcgctt tacgacctat tatctcaaca ataggctcac ctacgtattt 1380 tttatcgaaa agccttctta aaatattaca aaatagcatt aaaaaacctg actcatacgt 1440 agaaaacagc tatgatttca tagaaaaaat taaaaatgtt gtagtcccac cgaaccaccg 1500 cttaatcagc ctcgatgtaa aatcactcta taccaacatt gacaaatcat tagtcaagaa 1560 gagcatcgaa aaaagatata tacaaattgt acaaaaatct gaaatacctc tcaatgaaat 1620 cttgttggct tgtgaattac ttatggagaa tacgtttttt caattcaatg gcaaatttta 1680 taaacagcta tttggcacac caatgggctc tccgatctcg ggtttctttg ctgacattgt 1740 catggatgat ctcgaaacgg attgcctatc taaactatcg ttcaaaccaa cattctttta 1800 tcgatatgtc gacgacatta ttacatgtgt gccgaatgac aaagtcgacg aaattttaaa 1860 aacattcaat tcttacaaca atagattaca atttactcac gagactgaga tcgataactc 1920 cataagtttt ctagatgttc tattgattaa agacaatgat aaaatagtaa ctgattggta 1980 caccaaaccg accttctctg gcagattttt aaattacaca tcacaacatc ctatgtctca 2040 aaaaatcgcc atggtgtaca atcttgtcga tagagcgata aaattatcca acaagaaatt 2100 tcacaataag aacattcaaa acatcaaaaa attactgcat gaaaatcatt atccacccgc 2160 cttcgttaac aaatatatac gaaaaagaat gtcaacgtta acatcaaaca cattgtcaag 2220 gcaacagatt agcagtaatg aaaaacgcaa tattgtaagc ctaccctata ttcaaggttt 2280 ttacgagcag accagtcatg cattcaagac tctcaacatc gaaactgtac cgaaaataaa 2340 tagccaaatg aaaaacatca ttataaaagg caaagacaag aaagataaag atctagaaaa 2400 caacgttgtc tataaacttg aatgtaatga ttgctctgcc acgtatgtag gccaaacaaa 2460 aaggaatgtg tgtatacgta taaatgaaca caaaaaatcg aaaaatactg taatctccga 2520 tcacgtcaac aatctgaatc acactttcaa ttttgaaaac taccacatta aagacagaga 2580 gatcaattta aataaaagac tagtttccga aatgctacac atacacctac aagaaaacgg 2640 tattaataag aaggaagaca ctcaaaattt aaatacagta tacacacacc ttatacacca 2700 gctgaaaaca aaacaaacat aagccaggct ctgacgtcaa tagcggaaga aattgtcaac 2760 acggccgctc gactctcttc tcactgctcc gcgctatata aacacgacga aaatctcgac 2820 tcgcgcgaat tagaatcgaa cagctgagag agaaacactg ttcacctacg ttccccccgc 2880 cgcgcggcgg ggaagcgcct tctcgacaaa actttgtacc gatcttctcc ataattgtaa 2940 gttctattct cacactgtac atatattgtt atacaacaag tttttaataa attttttcac 3000 atcagtctcc tgatgagggc ggcattgtcc gtcgaaacgt tgagaagaaa agagaaaatg 3060 tcttaaagtt ttttctctcc tttctcacca cacaacctgc ctcgggaaga ccgatagaga 3120 cgaaccaata ttcttgatca ccaccgaggg acagcatcag atagagaaaa accttttctc 3180 taaatatatt ttttttt 3197 // ID BEL-220_AA-LTR repbase; DNA; INV; 710 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-220_AA_; KW BEL-220_AA-I; BEL-220_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-710 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 898-898 (2011). XX DR [2] (Consensus) XX SQ Sequence 710 BP; 220 A; 174 C; 142 G; 174 T; 0 other; tgttcaagag tacgatacac ttttccaaac cctgttatca agaggacgcc gacacataca 60 gcagtgcaag ccatcaacgg tgatacgtaa tccactacca acaaaccgaa gcccagctat 120 cacggttggc tactccacgc tcatctagcc aaatctccct atctctacat caacaatcat 180 cagtcatcat cagaactatc gattccacat gagtgataag gatgaccgat accacaacag 240 catgtgataa gagcggtgag aatatcactc gctcccagaa ccgacgataa catcgcagcc 300 tcgaaggaat gacctacgaa cagtgatctt tcggaagcat cgggcagttc acttctacca 360 gaccgccgga gagcatggaa ccgattaacg tccgatttta acgtgttttt atgtaactag 420 tttttaagaa taaatatacg gtttaagcat gtttaagtgt taaatttaga taataaagtg 480 ttgtgtttaa tgtaaaccgc cgcgagttta agcatgcaag gaaactagaa tccgaggagt 540 gatccctagc cggaatatcc taccaccatt gtttggattt agtggcaaat atccaaccaa 600 gccaccatca ttgctgcagg atatcatctc tattcgatgt cggggaatta aaggctcaac 660 cacttctcag ctgagtgaac gtcgaaggta actggctcaa tccccggtca 710 // ID Gypsy1-SM_LTR repbase; DNA; INV; 999 BP. XX AC Contig17113; XX DT 15-AUG-2007 (Rel. 12.08, Created) DT 15-AUG-2007 (Rel. 12.08, Last updated, Version 1) XX DE LTR retrotransposon from Schmidtea mediterranea: long terminal DE repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy1-SM_LTR; KW Interspersed repeat; LG_I. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-999 RA Xu Z. and Wang H.; RT "LTR_FINDER: an efficient tool for the prediction of full-length RT LTR retrotransposons."; RL Nucleic Acids Res 35(Web Server issue), W265-W268 (2007). XX RN [2] RP 1-999 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from Schmidtea mediterranea."; RL Repbase Reports 7(8), 749-749 (2007). XX DR Genome; Contig17113; Positions 81 1079. XX SQ Sequence 999 BP; 389 A; 135 C; 113 G; 362 T; 0 other; tgttagatga cgaaggaaaa ccattagaca ccttcgtgct gacaaagtac tacagaacgg 60 ggcctaacag atacagaatc aatatccatg cagaaaaacg atcaacagtc gaatcctaaa 120 caaatcccaa accagatcca aagccgaatg atgaacagta ggcaatctat caatatcata 180 gacaatattc taattcatcc cattgtaatt attgacaatt atttattttc ttagtatttg 240 tttattttat tgacaatatg tatctgtttg tattgtaatt catttcattt tatttttgat 300 cttatgtatc tgttttgttt aatatatttt gataaacaat aaatttttat acaagggtaa 360 aatcattatt attattataa ttattattat tattattatt attattatta ttattattat 420 tattattatt attattatta ttacaaataa ttttattaat tgcgtgacaa agaatagtcg 480 acagctgtga aaaaaatagc ttgtaatata ctgttatgaa aaggaaaaaa taataataaa 540 tttatttcat tttctgtatt gtgaaaaaat aataatagta aatttattta atttcctgtg 600 cgtgataata aaataaaaaa taaattaaaa tcataaaacc cataataaaa tcagtatata 660 tatactgacg ctcagtatat tcgcccaaaa atcttgcgac atacttttct tttagagtta 720 atctcgaaaa gaacttaaac aggagcagag ctcaacaaaa tccttttttg caaaaataaa 780 cacaacacgc gtatataaca tacaaacata aacacagttt tttgcacgca cttatcagaa 840 cttgtaaaat acattttatc tcagattgaa ccatgtagaa cttggtaatt tttcatttaa 900 taaattttat ataacaattg tctttttata ttaaagttta tatatatata ctaatacaag 960 tgcacgttga gatctcgatc taaaatagtg agacctcaa 999 // ID LIN5_SM repbase; DNA; INV; 6403 BP. XX AC . XX DT 15-FEB-2008 (Rel. 13.02, Created) DT 11-AUG-2009 (Rel. 14.09, Last updated, Version 2) XX DE Non-LTR retrotransposon from Schmidtea mediterranea: consensus. XX KW NeSL; Non-LTR Retrotransposon; Transposable Element; LIN5_SM. XX NM LIN5_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-6403 RA Bao W., Tempel S. and Jurka J.; RT "Non-LTR retrotransposon from Schmidtea mediterranea."; RL Repbase Reports 8(2), 163-163 (2008). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 8..2926 FT /product="LIN5_SM_1p" FT /translation="MPKKHSNSTGQKSEADDMEPMNCNSNLISEMNRLFQS FT FINQVMEKNTVQNNNRKSRSKSISRPNSKDNQKSKIIPKNKNNKKGFKIEN FT HLRNDKNFNDLKECFYNHEKFYKNLETCDLNELIFKIFLLKVKINKFEFKI FT SEFKNLKIEKKENKMMKNIEDKTTKSKNHVNFKNLKKNENDENKTKINPEN FT QNSEILNKNNEDKTKNLKSFKDVLINGKNKELKKNKKDLNNEKVKNNELTQ FT KNLSKIINLEKIRKWTDDFFNNLTWKFHKNIDVPETMETNTNLFNYLLIKF FT QLMQTKNLTPSRTIIEIKEEVSENDIKEMIDKYFNLFNNEIFLNFKNRFEN FT LNYESFIKDFLYIIHLKIKTFLNLFKYLNQSNLINNNAKKIFRAHQARETN FT GRSSSTFGRLAKNHVIKEGTVTTLHSSSTCGRLTDHPDTDQWEPATTRNPP FT SPNLLPINQIVKNISSIQKPDTTHWIRSTNSFVNNIKSIKIKPKLTSKNEN FT HFGPLINFSEDINEETKPLKIKTTSIEIKIDLEDTPKHITEERINKTSTIT FT NVDLDAPITKISAQTSSNTNDSISYSPLPTAEQLPKTPISPISKTITTEIC FT DITESPYPIPLINVDLNKILDNIETNMESDVFSSISDYESEIDNDNNIKTN FT NARESDIDLFSQSICENNQINIMQKNNQTRSNTPENAFNIMTKASEKNTIK FT KILNVKKKVIAALPPIPPMKTKPDSTLNCQAEDKENLIVYRFGSKGKFSCN FT TTHKSQECGEIAVYDYDALVEHAESNHIIQFNEQSYIDCHSCHPKKGKDSN FT TPVPVKYADFFNHAEAHGHIITTAAADTMKTYLRLTKENQLYCSHRNSNRK FT SKCNEIFNLDSDLSDIIEHMKSHPKYSFDSTKSIQCYCGIWKPFTNLIDHI FT KTVHLKEFINSISDSKTIKMDINTNITPINLAMILGTDETQNIPDEEAIKP FT RSLPENLAFNRDIEKELS" FT CDS 3194..6241 FT /product="LIN5_SM_2p" FT /translation="MHLELATDILKTILNVQNIFGDLHFSLTEMEYPICHE FT ANLSAFYVCHFLKCLMSDIPIDIPDINTMKEAMRPIIRKYNCAKFPESDVR FT SYRILIEDLIYQLNLDTISCEELLNEIERINGRINPKRYFKDNKPKSDIIH FT LQKKKAAELLCVKRLKFQINQKIEIGKIWENNEIDHKPAMAKFLKTFASKD FT CPMSNTSPIKLPFYRDVDSDTYTDCENLSHIMKNLDSSAPGMDLITGGDWK FT KISPKHELITAICNCVLRNKISPQRWKLYRTVMILKPGKLSESFKANSWRP FT LAIMDTAYRIFTTLINNRLLHWIRSGTLISQNQKAIGVPDGCAEHNATLHI FT AIDRAKRCKTELHIVWLDITDAFGSLPHDLIWYTLAGMGLKNETLTLIKEL FT YKDVRTMFDCQGTMSEPVPITKGVKQGCPLSMTLFCLSIDYILKSILDNHP FT FLLHNLNISILAYADDLVLLSDSYQGIRKTLENTVKLAGFANLKFKPSKSG FT YLSINSAGSVNKKLHLYDEEIPTISENNKYRYLGVDFSYKRNQDIDGRLDS FT ALALTRSLFMSYLHPSQKLNAYKTFIHSKLIFSLRNCIIGHRILDCDRNRI FT TQGREKQLGFDQKIKGLLKTMIGDKFQALNNYFLYTHCKMGGLGITATIDE FT YLIQSITGITRLFHSSNLGFRDMLIAELAHSRGSKNFEAGLKWMNCEISKE FT FHNTSFFVKFQKSALALKRKFSICISLKFAEDKFSLEMIYKKRTSYIDHRN FT LNTLSKELHDFVGLHYAEQWYKMKVQGRIAASIGDSITAKYLIASDTLNDA FT QYCFLVRARNNILNLNYSAYRLKYNLNTKCRLCHLDEETQTHVFNHCRAKP FT NARRVKHENVLVSIVAFLEKIGFEIDVEKSPKYVSIPTKLKPDMVIRSKRN FT KDIHVLDLKVPYDSVEGFEKAREDNYVKYKDLSLQIGKAFNQTATISAIVI FT GCLGTWDKKNNAPLSKIGLTKTEIISLARIACPNAVIACYHIYREHVSFTK FT NVIPLPFSLV" XX SQ Sequence 6403 BP; 2458 A; 1240 C; 1001 G; 1703 T; 1 other; aatawcaatg ccaaaaaaac attcgaatag caccgggcag aaatctgaag ctgatgacat 60 ggaaccaatg aactgcaatt ccaacttaat ctctgaaatg aataggctat ttcaatcgtt 120 catcaatcaa gttatggaaa aaaatacggt tcaaaataat aacaggaaat ccagatcaaa 180 atcaatttct agacctaact ccaaagataa tcaaaaatca aaaataattc caaaaaataa 240 aaataataaa aaaggattta aaatcgaaaa tcacctccgt aatgataaga atttcaatga 300 tttaaaagaa tgtttttaca atcatgagaa attttataaa aatcttgaaa cctgcgattt 360 aaatgagctt atctttaaaa tatttttact aaaagtaaaa attaataaat ttgaatttaa 420 aatctcagaa tttaaaaatt taaaaatcga gaaaaaagaa aacaagatga tgaagaatat 480 tgaagataaa acaacgaaat ctaaaaatca tgttaatttt aaaaatttaa aaaaaaatga 540 gaatgatgag aataaaacga aaataaatcc agaaaatcaa aattcagaaa ttttaaataa 600 aaataatgaa gataaaacta aaaatttaaa aagttttaaa gatgtgttaa ttaatgggaa 660 aaataaagaa ctgaaaaaaa ataaaaaaga tttaaataat gaaaaagtta aaaataacga 720 attaacacaa aaaaacctct cgaaaataat taacctcgaa aaaatccgaa aatggacaga 780 tgatttcttt aataacctga cctggaaatt tcacaaaaat attgatgtac cagaaactat 840 ggaaacaaat accaatctct tcaattacct actaatcaaa tttcaactca tgcaaaccaa 900 aaacttaact ccatcaagaa caatcattga aatcaaagaa gaagtttccg aaaatgatat 960 taaagaaatg attgataaat atttcaactt atttaataat gaaatatttt taaattttaa 1020 aaatcgtttt gaaaatttga attatgaatc ttttataaaa gattttttat atataattca 1080 tctaaaaatc aaaacgtttt taaatttgtt taaatatctt aaccaatcaa atcttataaa 1140 taacaatgcc aaaaaaattt ttcgcgcgca tcaggcgcgc gaaactaatg gacgcagctc 1200 gtcgaccttc ggtcgactcg caaaaaatca tgtcatcaag gaaggaactg ttactacttt 1260 acacagctcg tcgacctgcg gtcgactcac tgatcatcct gatactgacc aatgggaacc 1320 agcaactacg agaaatccac caagcccgaa cttactacca ataaaccaaa tcgtaaagaa 1380 catatcatct atccagaaac ctgatactac ccattggatc agatcaacca actcttttgt 1440 aaacaatatt aaatccatca aaattaagcc aaaacttact tcaaaaaatg aaaatcattt 1500 tggtccatta ataaatttct ctgaagatat aaatgaagaa actaaaccac taaaaataaa 1560 aaccacatct attgaaataa aaatagatct tgaggatacc cctaagcata taactgaaga 1620 gcgaataaac aaaacatcta ctataacgaa tgtggatcta gatgcaccga ttacaaagat 1680 aagtgcacaa acatctagca acacgaacga ttctatatca tactctccat taccgacggc 1740 tgagcaactt cccaaaactc caatctcgcc aatctcaaaa accattacta ccgaaatttg 1800 cgatattaca gagtccccct acccaatacc actaatcaat gttgacttga ataaaatatt 1860 agacaacata gaaactaata tggaatccga tgttttcagt agtatatcgg actacgaatc 1920 tgagatcgat aatgataata acataaaaac aaataatgct cgagaatctg acattgacct 1980 tttctcccaa tcaatatgtg agaataacca aattaatata atgcagaaga acaaccagac 2040 aagaagtaat acacctgaaa acgccttcaa tatcatgacg aaagcatccg aaaaaaatac 2100 tattaagaaa atacttaatg tgaaaaagaa agtgattgct gcactgcctc ccataccacc 2160 aatgaaaacg aaacctgatt ctactctaaa ttgtcaagct gaagataagg aaaatctcat 2220 tgtatatcgt tttggaagca agggaaagtt ttcatgcaat actacacaca aatctcaaga 2280 atgcggtgag attgcagtat atgactatga cgcattagtt gaacatgctg aatctaacca 2340 tataatacaa ttcaatgagc aaagctatat agattgtcac tcttgtcacc ctaaaaaagg 2400 caaagatagc aatacacctg ttccggtcaa atatgctgat ttcttcaatc acgctgaagc 2460 acacggacac ataataacga ctgcagctgc tgacacaatg aagacttatc tacgattaac 2520 aaaagaaaac caactctact gctctcaccg taacagcaac cgaaagtcga agtgcaatga 2580 gatatttaac cttgattcag acttatctga cataatagag catatgaaat ctcaccccaa 2640 atacagtttt gattcaacca aaagcattca atgctactgc ggtatctgga agcctttcac 2700 caacctcatt gatcatatca agacagtgca cttaaaagaa ttcattaact caatatcaga 2760 cagtaaaact atcaaaatgg atattaatac caatattaca cctataaacc ttgctatgat 2820 acttgggact gatgaaactc aaaacatccc agacgaagag gcaattaagc ccagaagcct 2880 tccagagaat cttgccttca accgtgatat cgaaaaagaa ctatcttgat ggtcgcagca 2940 cttggtcaaa gcatatattt tctcacatgc tattaaatca tcaaccatct tcatcaatcc 3000 ttatacttgc aatgctttga tccagtgcaa ctacaagatt ttctttgaaa ctttcccgtt 3060 caaagacttt gccagatgga acgagataat cttgccaata cacaacaact cttcttcttg 3120 gtcattcttt ttcttaaaca agaaaaagcg aattgcattg attatagatc catcagcgga 3180 tgatagtcac accatgcacc tcgaactggc aacggatatc ctcaaaacca tactaaatgt 3240 acagaatatt tttggtgatc tacatttctc acttactgaa atggaatatc ctatatgcca 3300 tgaagcaaac ctgtctgctt tttatgtatg ccactttcta aaatgtttaa tgtcagatat 3360 accaattgat attcctgata taaatacgat gaaagaagca atgagaccaa ttattagaaa 3420 gtataactgt gccaaattcc ctgagagcga tgtcaggagt taccgaatac taatagagga 3480 cttgatatac caattgaacc tcgacacaat ctcctgtgaa gaattattaa acgaaattga 3540 aagaatcaac ggaagaataa acccgaaacg atatttcaag gacaataaac caaagtcaga 3600 tataatacat ctgcaaaaga aaaaagcggc agaacttcta tgtgttaaga gactaaaatt 3660 ccaaatcaat caaaaaatag aaattgggaa gatatgggaa aacaatgaaa tagatcataa 3720 accggcaatg gccaagttct tgaaaacatt cgcaagcaaa gactgcccta tgtcaaatac 3780 atcaccaata aaacttcctt tctataggga tgtagactcc gatacatata ctgattgtga 3840 aaatctctcg cacatcatga agaacttgga tagctccgcc ccgggaatgg atctcataac 3900 aggtggagat tggaaaaaga tctctccaaa acatgagctt ataactgcga tatgtaactg 3960 tgtacttcga aataagataa gcccacagag atggaagcta tacagaactg tcatgatcct 4020 aaaacctgga aagttatccg agagcttcaa agctaactct tggagacctc tagcaatcat 4080 ggacacagct tatagaatat ttacgaccct tataaataat cgcctactac attggataag 4140 gagtggcacc cttatcagcc agaaccaaaa agcgatcggt gtccctgacg gatgtgccga 4200 acataatgca actctccaca tagcaattga tcgtgctaaa cgatgtaaaa ctgaactaca 4260 catcgtttgg ctggacatta ccgacgcttt tggttcgctg cctcatgacc tgatttggta 4320 cacactggct ggcatgggtc tgaaaaatga gacacttaca ttaattaaag aactatataa 4380 ggatgtgaga actatgtttg actgccaagg aaccatgtct gaacctgtcc caattactaa 4440 gggagttaaa cagggatgtc cattatcaat gacactcttc tgcctgtcaa ttgattacat 4500 cctaaaatct atcttggata accacccctt tcttttacac aatttaaaca tcagtattct 4560 ggcatatgct gatgatttgg ttcttctctc ggactcttat caaggaatca ggaaaacctt 4620 ggaaaacact gtgaaattgg caggctttgc aaacctaaag ttcaaaccgt caaaatcggg 4680 atatttatca atcaatagcg ctggctcagt taacaaaaaa ctacacctat atgatgaaga 4740 gataccaact atatctgaga ataacaagta cagatatctt ggagttgact tctcttacaa 4800 acgtaatcaa gatattgatg gtcgacttga ctctgcactg gcactaacca gatccttatt 4860 tatgtcatat ctgcatccat cacaaaagct gaatgcatac aaaacattta ttcattccaa 4920 gcttatattc tccttacgca actgcatcat cggtcataga atccttgact gtgataggaa 4980 tcgaattacg caaggacgtg aaaagcagct gggtttcgat caaaaaatta agggattgct 5040 aaaaacaatg attggagata aattccaggc ccttaataac tatttcctat atactcactg 5100 caaaatggga ggtcttggta ttaccgctac tatcgatgaa tatcttattc agagcattac 5160 aggaataaca agactgttcc actcctctaa tctcggcttt agagatatgt tgatagctga 5220 gcttgcccat tctagaggaa gtaaaaactt tgaagctggt ctgaaatgga tgaactgtga 5280 gatcagcaag gaatttcaca acacctcttt ctttgtgaaa ttccaaaaat cagcactggc 5340 tctcaagaga aagttcagta tatgcatctc cttaaaattt gctgaagaca aattctctct 5400 tgaaatgatc tacaaaaagc gcacctctta tatagatcat cgtaacctta acactctttc 5460 taaagagctc catgactttg taggtctcca ttatgccgag caatggtaca agatgaaagt 5520 acagggacgc attgcagcct ctattggaga tagtatcact gctaaatacc taatagctag 5580 tgataccctt aatgatgcac agtactgctt cttggtgcgt gcaagaaata acatcttaaa 5640 tcttaattac agcgcatacc gtctaaaata taatcttaac acaaaatgta gactttgtca 5700 tcttgatgag gagacccaga cacacgtttt caatcactgt cgtgcaaaac caaacgctcg 5760 aagggttaag catgaaaatg tactggttag catagttgcc ttcctagaaa aaattggctt 5820 tgagatagat gtggaaaaat cccccaaata tgtctcaatt ccaacaaagc ttaaacccga 5880 catggtaatt aggtccaaac ggaataaaga tatacatgtc ctggacctaa aagtacccta 5940 tgactcggtt gaaggctttg aaaaagcaag agaagacaac tatgtgaagt acaaagatct 6000 atccttacag atcggaaagg cttttaatca aacggccacc atatctgcta tagtgattgg 6060 atgtctgggc acatgggata agaagaataa tgcccctctc tctaaaatcg gattgacaaa 6120 gactgagatc atatctctgg ccaggatagc atgcccaaat gcggtgattg catgctatca 6180 catataccgg gaacatgtat catttacaaa aaatgttatt cccctccctt ttagtcttgt 6240 gtaaagtatg tttacgaggc aatgctgata tctcatctgc gttgcagttt tgtgtaagta 6300 gaataaaaag ctaaatagta tgaagtgctg agcctcgctc gcacagttgg ccgaaaggca 6360 gcaaatgaat aaagaccaat taaaaaaaaa aaaaaaaaaa taa 6403 // ID BEL-610_AA-I repbase; DNA; INV; 6780 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-610_AA_; KW BEL-610_AA-LTR; Pao_Bel_Ele203; BEL-610_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-6780 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [5813-6373] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 443..1654 FT /product="BEL-610_AA-I_1p" FT /translation="MHSVQEEDEHRSFRSRVSSKQIRDSTREWVGASKPTD FT VVNAQIATTSTHLCSSQLGGTQQNADVASIDQIASTATNTEQAKEVIRTPP FT TGAQQGRAPITSTSAIGAFSEQQTTQIIEHQGEASPAIIGPRADMPVETVP FT SSHRHTGVPKTNAFVQQKTAESLMRANMETPSLADGARIENPITAGLSTES FT QVRTSVSTLHVRLQPIAEVTSTPLIPSIKRSVDSQSGYKAHHKIIAQPSNQ FT LGVVQGQLTRNPNTVPGAESGEPPPGLNVTGQIPLSKFAQLGKSKKAVLQP FT PLERKRLNDPNPSAIIPGQCNESTLLSLPPTGYQHFSASNRQELPPTSGRE FT QYTVTSAKHQSVIQPPPGYENFGEETDCSIWDSAVVISPSAWNKQSPAIIA FT TSTRIRLSGV" FT CDS join(2108..3412,3416..6778) FT /product="BEL-610_AA-I_2p" FT /translation="MREIPSPRGDDLKTPIKFGMGVENLVEHMILAQQFQH FT ISNPMLLQEMVDRLPPNLKLQWASYKRNYNPVNMATFNGFMKDLVTMASDV FT TLLTSVGQLPAEKHDRGRREKSSKEKLFVHQQSPASTDMSGAESSGSIPKP FT CVNCSKTDHRIAECPVFKRLTVDERWKVLRQKGLCRICLVPHRSWPCRSKQ FT ECGVGDCRMRHHALLHLRKAASGVPSSTTAAERNVVHQNHHSVTSCALLRY FT LPVTLHYNGKSVEIFAFLDDGSSSTMMDAEVAEQLGAVGLAEPLYLGWTGD FT ISRTEKDSQHIRVAISGFNMAKEFPLKARTVGQLKLPSQTVNYEELCLDHP FT YLRKLPLSSYTNAAPRLIIGVDNAKLISALKSRESRTGQLVAVRTRLGWCL FT YGRHATTDSSLVDYVNTHMELHEADTDLHNLFRQFMAVDEAVKRSPLSNED FT KRASDILQKTTRRVDGRLETGLLWRSDEPSLPDSYDMAVRRMEALERKLSK FT DDQLREKVRGLIEEYLASGYAHRITTAELDTTEPGRVWYLPLGVVRNPRKP FT EKVRLIWDAAARAKGISFNDMVLKGPDLLTALPVVLLRFRQKSVAFSGDIK FT EMFHQFRIRKEDRQAQRFLYRENPGKPPQIFVMDVATFGAACSPCIAQYLK FT NKNAEDYKEQYPEAARAIIENHYVDDFLDSVDTVDEAVELIEDVKHVHAQA FT GMEIRNFASNSSEVLERIGEASRTQQKSINLEANVERVLGMVWNPSEDTFT FT FELDLKEEVQNIVANKIAPTKRQVLRTVMSLFDPLGLVAHFVVHGKLIMQR FT IWREGLDWDEQISGEILEDWCKWSRLLTKLNEVTVPRCFFSGNGRAHNDAQ FT IHMFVDASENAYACVAYLRSYGNGAPQCTLIAAKTKVAPLKPLSIPRLELQ FT AALIGSRLLETICKALTIPITTRYLWTDSTTVLAWLRSETRRYHQFVGFRV FT GEILTTTTIDEWRKVQTKLNVADQATKWKDGPSFDPEDWWYSGPRFLSDPA FT ERWITDLTLHSSEELATTVDLRSSFMHHRSLHQQPFIAVDRFSSWQRLLRA FT TAYVIRAAKLFLGLRATGPLIQEELANAEILLWRQVQMQVYPDEYVTLVYN FT KEHPKEEPRRLEKTSPLYKMSPILDDNGVLRMNSRITRAPIVSRDLKYPVF FT LPKEHRVTALLVESYHKRFLHRNNETVLNEMRQRFFIPQLRSVVSKIAKQC FT QHCRVWKATPQVPMMAPLPTIRLTPFIRPFTHTGVDYFGPIFVKQGRSMVK FT RWIALFTCLSIRAVHLEVVHSLSTQSCVLAIRRFVARRGSPATFCSDNGTN FT FVGASNILKEQIRTIGKSCAATFTNTDTKWLFNPPLAPHMGGPWERMVRSV FT KVAMSAIADHPRHPSDEVLETIVLEAEAIVNSRPLTYVPLEHENQEALSPN FT HFLLYGTQGINQPCQELTEEHVLLRDSWKLAKYLVDTFWTRWVREYLPTLT FT RRTKWFQPVRPLKPGDLVVVVEEGKRNGWIRGRIIEVLPGKDGQVRRAVVQ FT TGHGSITRPATKLALLEVQGPPTEDSGTPRYGVPELHGPG" XX SQ Sequence 6780 BP; 1926 A; 1605 C; 1752 G; 1496 T; 1 other; acaaaatctt caagatttgc gattgattcg atgtccacgt ggtagtcatc cggcatagga 60 atggatcctt cggagagcca ctgtatgttg tgcaatcgac caaacaacgt cgacaacctg 120 gtgcagtgtg accgttgtga tggatacgtg cattactcgt gtgcagaggt gggagattcg 180 attgccgacc ctgatcggag tttcacctgt aagaggtgcg tcgaaaacga tgatatcgtc 240 actgtttcct cacatcgcac gagccatgta agctccttac gaacctctcg tagtagtaac 300 tcagcacgga tagcattacg actgcaacag ctagagaagg agaaggaaaa ccgcctccga 360 gaactcgaag acgcagagaa gttccagagg ttgcgacgtc aggtcgaggt ggaatttgag 420 cagcaacaat ttgccattct agatgcacag cgtacaagag gaagatgaac atcgcagctt 480 tagaagtaga gtgagttcta agcagatccg agatagtacc agggagtggg tcggtgcgtc 540 gaaaccgact gatgtagtga atgcccagat agctacgacg tcgactcact tatgttcctc 600 ccagttgggt ggaacccagc agaacgctga tgtcgcatcc atagatcaga tcgcatcaac 660 agcaactaat accgagcaag caaaagaggt aatccgaaca ccgccgacag gggcacagca 720 aggaagggcg cctattacgt cgacgagcgc aatcggtgca tttagcgagc aacaaaccac 780 tcagatcatc gaacaccagg gagaggcatc gccagctatc atcgggccaa gggcggacat 840 gccagtcgaa accgttccat catcccatag gcatactggg gtgccgaaga caaatgcgtt 900 tgtacagcaa aaaacagcag aatctctgat gagggccaac atggagacgc catctctcgc 960 ggatggagca cgtattgaga accctatcac tgcaggtttg tctacggaga gtcaggtgag 1020 aactagtgtc agtacgttgc acgtaaggtt acagcctata gctgaggtaa ctagtacacc 1080 tttaatacca tcaataaaac gaagcgtaga tagtcagtca gggtataagg cacaccacaa 1140 aattattgcc cagccaagca atcagttagg agtagttcag ggccaactaa cacgaaatcc 1200 taacacggtt cctggggcag aatcggggga acctcctccg gggttgaacg taacagggca 1260 aataccgctg tcaaaattcg cgcagttagg caaatcgaaa aaggctgtgc tccaaccgcc 1320 attggagcga aagcgtttaa acgatccgaa tccgtccgct attatacccg gtcagtgtaa 1380 tgagtcaacc ctgttatctt tgccaccaac gggttaccag cactttagtg cgtcaaatcg 1440 ccaagaatta ccaccaacat cgggccgtga gcagtataca gtgacaagtg ctaaacatca 1500 atcagtgatt caaccacctc ccggatacga aaatttcggt gaggaaaccg actgcagcat 1560 ctgggattca gccgtggtca tcagtccctc agcctggaac aagcaatctc cagccatcat 1620 tgccacatca accaggatac ggctatccgg ggtataagct gaattcgact gagtacatgc 1680 cgtcaaaaat ctcaacgatt cccaagtcat cccaggctgg agaatcaacc tgaacaacag 1740 atgtactcgg aaccactgtc gcggtttagt catcagtgcg cccgccagat cagccgaatc 1800 aatcgcgtcg agcagggccc acagcagagc aaatggctgc tcggcaagtc atcccaaagg 1860 aattgcctgt gttttccgga gatcctcgga ttggcctctg ttcctgagtt ccttcaataa 1920 ctcgacggaa gcgtgcgggt acaacgatgc cgaaaatttg gcaaggctcc aacgctgttt 1980 acgtggacat gcgctggaga gcgtgagaag tcgattgctg attccggagt cagtgccgta 2040 cgtacttgcg acgctggagc gtttatacgg cagaccggaa gtgattatca atgcacttct 2100 aaaacgaatg cgggaaattc cctcgcctcg tggcgatgat ctgaaaacgc ctatcaaatt 2160 tggaatgggt gttgagaact tggtggagca catgattctt gcccaacaat tccagcacat 2220 cagcaacccc atgttgctcc aagagatggt agataggctt ccgcctaacc tgaagctaca 2280 gtgggcgtcc tacaagcgca actacaatcc agtgaacatg gcgacgttca acggtttcat 2340 gaaggatcta gttacgatgg ccagcgacgt cactctactg accagtgtag ggcagctgcc 2400 agcagaaaaa cacgacaggg gaagacgcga aaaatcctct aaggaaaagc ttttcgtcca 2460 tcaacaatcc cccgcttcaa ccgacatgtc tggggccgaa tcaagcggtt ccatccctaa 2520 accctgcgtc aactgctcca aaacagatca tcgtattgcc gagtgtccag tattcaaacg 2580 actgactgtg gatgaaagat ggaaggtttt gcggcagaaa gggctctgcc gaatatgctt 2640 agttccacat cgatcatggc cttgccggtc aaagcaagaa tgcggagttg gagattgtcg 2700 catgcgccat catgctctac tgcacttaag aaaagcggca tctggagtac cctcatcgac 2760 aacagcagcg gaaaggaacg tcgtccacca gaatcaccat tcagttactt cttgtgctct 2820 gctgcgctat ctgcctgtga cattgcacta caatggaaag agtgtggaaa tattcgcgtt 2880 cttagatgat ggatcttcgt caacaatgat ggatgcggag gtagcggaac agctaggtgc 2940 agtaggactt gctgaaccgt tgtatttagg ctggacaggc gatatatcga gaacggaaaa 3000 agactcccag catataagag tcgcgatatc cggattcaac atggccaagg agtttccact 3060 gaaagccagg actgtaggac aattaaaact cccgagccaa actgtgaact acgaggaact 3120 ttgtttggat catccatacc ttagaaagct acctctgtcc agctacacca atgccgcacc 3180 tcgacttatt attggcgtag acaacgctaa gctaatcagt gctcttaaaa gccgtgaaag 3240 cagaactgga caactagttg cagttagaac tcgactagga tggtgcctct atggacgaca 3300 tgccaccact gacagcagcc ttgtagacta cgtgaacacg catatggagt tgcacgaagc 3360 agacacagat ttgcacaatc tgttcaggca atttatggcg gtcgatgaag caascgtcaa 3420 acggagtcca ctttcaaacg aggataagag agcttcggat attctacaga aaaccaccag 3480 aagagtagac ggaagactgg aaactggact gttgtggcgg agcgatgagc cttctcttcc 3540 agatagctac gatatggctg tccgtagaat ggaagccctt gagcgtaaat tgtcgaagga 3600 tgaccaactg cgggaaaaag tcagagggtt gattgaggag taccttgcga gcggatatgc 3660 tcatcggatt acaacggcag agctggatac tacggaacca ggacgtgtgt ggtacttgcc 3720 cctgggggta gtgaggaatc cacgcaaacc agaaaaggtc cgtctgattt gggatgctgc 3780 agcccgagcc aaaggaattt cgttcaacga tatggtgttg aaaggaccgg atctgttgac 3840 tgctcttcca gttgttctat tacgcttccg tcagaaaagc gtggcattca gtggagacat 3900 taaagaaatg tttcatcagt ttcggattcg gaaggaggac agacaagcgc aaagattcct 3960 atatcgggaa aaccctggaa aaccaccaca gatcttcgtg atggatgttg ccacgtttgg 4020 cgctgcgtgc tcaccttgca ttgcgcagta tttaaaaaac aaaaatgctg aggactacaa 4080 agagcagtat ccggaagcag ctcgtgccat catcgaaaac cactatgtag atgattttct 4140 ggatagtgtt gacacggtgg atgaagcagt ggagctcatc gaagatgtga agcatgttca 4200 cgcccaagca gggatggaaa ttcggaactt tgcatccaat tcttccgagg ttcttgagcg 4260 tattggagaa gccagcagaa ctcaacagaa gtcgatcaac cttgaagcga atgtggaaag 4320 ggttcttggc atggtatgga atccgtccga agatacgttc acgttcgagc tggacctgaa 4380 ggaagaggtg cagaacatcg tagcgaacaa aatagcacca acgaagcgac aagttttgcg 4440 aacagtgatg tccctatttg atccgttggg tctggtggcg cattttgttg tgcacggaaa 4500 gcttatcatg cagcgaatct ggagggaagg attagactgg gatgaacaaa taagcggcga 4560 aatactagaa gactggtgta aatggagcag attgttgacg aaacttaacg aagtcactgt 4620 tccaagatgc ttcttttctg gaaacggtag agctcacaac gatgcgcaga ttcacatgtt 4680 cgtggatgct agcgaaaacg cgtatgcttg cgtagcgtac cttagaagtt acggcaacgg 4740 ggctccacaa tgtaccctga tagcggcgaa gacaaaagta gcacctctaa aaccgctatc 4800 gattccgagg cttgaattgc aagcggctct tatcgggagc cgtttactag agacaatttg 4860 caaggcttta accatcccaa ttaccacgcg atatctttgg accgactcta caactgtttt 4920 agcatggcta agatccgaaa cacggcgata tcaccagttt gtcggctttc gagttggaga 4980 gattcttacc acgactacaa tcgacgaatg gaggaaggtt cagaccaaac taaatgtagc 5040 tgatcaggcg acaaagtgga aggacggacc tagttttgac ccggaagatt ggtggtattc 5100 cggaccaaga tttctatccg accctgcaga gagatggatt actgatttga ctctgcattc 5160 gtctgaggaa cttgcgacaa cggtagatct tcgttcttca ttcatgcacc atcgatctct 5220 acatcaacag cctttcatag ctgtagacag attttctagc tggcaaaggt tgctgcgagc 5280 tactgcatac gttatcagag cagccaaact attccttgga ctcagagcaa cgggcccgtt 5340 gatacaggaa gagctggcaa atgcggaaat actgctgtgg cgacaggtgc agatgcaggt 5400 ttacccggat gaatatgtaa cgctagtgta caacaaggaa catccaaagg aagaacctag 5460 gcggttggag aaaacaagcc cgctgtacaa gatgtcaccg atactggacg acaatggagt 5520 attgcggatg aacagccgaa taaccagagc accaatagta tccagggatt taaagtatcc 5580 ggtttttctt cctaaagaac accgggtaac tgcgttgctt gtggaaagct accataaacg 5640 tttcctgcat agaaacaacg agacggtgct caacgaaatg aggcagcgtt tcttcatccc 5700 tcagcttcgt tcggttgtgt ccaagattgc aaaacagtgt cagcattgtc gtgtatggaa 5760 ggcgactcct caagtgccaa tgatggcacc tctacccaca attcggctaa cgcccttcat 5820 ccggccgttc acgcacactg gagtggacta cttcggtcca attttcgtga aacaaggacg 5880 cagtatggtc aaacgatgga tagccctttt cacgtgcctg tctatcaggg cagtgcacct 5940 tgaggtagtc catagtttat caacacagtc atgtgttcta gccattcgga ggttcgtcgc 6000 acgcagaggt tcccctgcta ccttttgttc cgacaacgga actaactttg tgggtgcgag 6060 taatatactc aaggagcaaa ttcgcaccat agggaagagc tgcgcagcaa ccttcacaaa 6120 cacggatacg aaatggctct tcaatcctcc acttgcgcct catatgggcg gaccatggga 6180 gcgcatggta aggtcggtga aggtcgcaat gtcggcaatt gcagatcatc ctcgacatcc 6240 cagtgatgaa gttctggaga ccatagtact ggaggctgag gccatagtga actccaggcc 6300 actaacctac gtgccgctgg aacatgagaa tcaggaagca ttgtctccaa accatttcct 6360 gttgtacggg acacaaggga tcaatcaacc atgccaggaa ttaactgagg aacacgtact 6420 gctgagagac agttggaagt tggcaaagta cctcgtcgat accttctgga ctcgttgggt 6480 gcgcgaatat ctcccgaccc ttacgagacg caccaagtgg ttccagcctg ttagaccatt 6540 gaaaccgggt gatctggttg tggtggtcga agaggggaag cggaatggat ggattcgcgg 6600 aagaataata gaagttctac ctggaaaaga cggacaagta cgtagagcag tagttcagac 6660 gggtcatgga tcaattaccc ggccggccac taagctggcc ttactggaag tacaaggacc 6720 gccaacagaa gactcaggga cacccagata tggcgtaccg gaactacacg ggccggggga 6780 // ID Penelope-2_HM repbase; DNA; INV; 2199 BP. XX AC . XX DT 13-DEC-2008 (Rel. 13.12, Created) DT 14-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE Penelope-like element. XX KW Penelope; Non-LTR Retrotransposon; Transposable Element; KW Penelope-2_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2199 RA Jurka J.; RT "Penelope-like elements from Hydra magnipapillata."; RL Repbase Reports 8(12), 2092-2092 (2008). XX DR [1] (Consensus) XX CC It is flanked at both ends by (TA)n. CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 33..1943 FT /product="Penelope-2_HM_1p" FT /translation="MRAFECDLFKLIKNIQFRIVPCSFQNKLKKDISTIRK FT STYIYTRADKTSNLYKLSKDEYNKLLMNATTSNYKKTNPIIKDRINKEGKT FT ILKNHEVFNKININSSSNCFITLKDHKPNFLNNPTVRLINPAKNEVGRISK FT HILSDINTNLRNILQLNQWNSTQNVINWFKNINDKHFYKFLIFDIVDFYPS FT INEKLLIDSIKFAEQYVSIDADHKSLILHARKSLLFTNDQVWIKKSSGLFD FT VTMGAFDGAEVCELVGIFLLFQISQFYNKTDFGLYRDDGLAVFKNASGPKM FT EKIKKHFTQIFKQNNLSISIQSNMKIVNYLDVTFNLNNNTFQPYHKPDSTL FT NYIHANSNHPPSILKQLPISIEHRLSSNSSSEKIFQKTAPIYNDALEKSGF FT HTILQYHSNLNSSITNKRKRNIIWFNPPYSKNVITKIGHLFLNLIDLHFPL FT HHKLHKIFNRNTIKISYSCMPNIRSIINSHNQQILRNKLTDNDINCNCINK FT TTCPLNNQCLSKNVVYKATLESPNTSLTEKSYIGISENTFKLRFANHLKSF FT NATRYRNDTELSKEIWKLKDAGNMPIIKWNIIKRCQSXNPSTKKCNLCINE FT KYFIMTFDSRLLLNKKSELVSACRHRKKFLLSNFDSGD*" XX SQ Sequence 2199 BP; 842 A; 369 C; 265 G; 720 T; 3 other; aaacaccgca atgtccacaa cagattaatg atatgcgtgc atttgaatgc gatttattca 60 aattaattaa aaacatacag tttaggattg ttccatgtag cttycaaaat aaattaaaaa 120 aagacatttc tactattcga aaatcaacgt atatttacac tcgtgctgat aaaacatcaa 180 atttatataa attatcaaaa gacgaataca ataagttatt aatgaatgct actacatcta 240 attataaaaa aaccaaccct ataataaaag atcgcattaa taaagaagga aaaactattc 300 ttaaaaatca tgaagttttt aacaaaatca atataaacag ttcttccaat tgtttcatta 360 cattaaaaga ccataaacct aattttttaa ataatcctac tgttcgtcta attaacccag 420 ctaagaatga agttggtaga ataagtaaac atattttatc agatattaac accaacctta 480 gaaatatttt gcaattaaat cagtggaata gtacacaaaa tgtaataaat tggttcaaaa 540 acatcaatga caaacatttt tataaattcc ttattttcga catcgttgac ttttaccctt 600 ctattaatga aaagctactt atcgactcca taaaatttgc agaacaatac gtttccattg 660 atgctgatca taaatcttta attcttcacg cgcgtaaatc tttattattt accaatgacc 720 aagtttggat taaaaaatca agtggtttgt ttgatgttac catgggtgca tttgatggag 780 cggaggtttg tgaattagtt ggaatttttc ttcttttcca gatttcgcaa ttttataata 840 aaactgattt tggattatat agagacgatg gcttagccgt atttaaaaac gcaagcggtc 900 caaaaatgga aaaaattaag aagcatttta ctcaaatatt taaacaaaac aacctcagta 960 tttccattca atctaatatg aaaattgtta attaccttga tgttacattt aatcttaata 1020 acaacacttt tcaaccttac cataaacctg atagtacgtt aaattatatc catgctaatt 1080 ccaaccatcc accaagcatt ttaaaacaat taccaatctc aattgaacat aggttatctt 1140 ccaattcttc aagcgaaaaa atattccaaa agactgctcc aatttataat gacgcattag 1200 aaaagtctgg atttcacacc attctccaat atcattcaaa tctaaattct tctattacta 1260 ataaacgaaa aagaaatatt atatggttta accctcctta cagtaaaaat gtgataacaa 1320 aaattggaca tcttttctta aacttaattg acttgcattt tccattacat cataagctcc 1380 acaaaatatt caacagaaac acaataaaaa ttagttacag ctgcatgcca aatattcgat 1440 caataattaa ttcacacaac caacaaattt tgcgcaacaa acttaccgat aatgatataa 1500 actgcaactg cattaataaa accacatgcc cgttaaataa tcaatgtcta tcaaaaaacg 1560 ttgtatataa agccacttta gaatcaccta ataccagtct taccgaaaaa tcctacatcg 1620 gaattagtga aaacacattc aagctcagat ttgcaaacca tcttaaatca tttaatgcaa 1680 ccaggtatag gaatgatacg gaactttcca aggaaatctg gaaacttaaa gacgcgggta 1740 acatgcctat aattaagtgg aacataatta aaagatgcca atcttwtaac ccatctacaa 1800 aaaagtgtaa cctttgcatc aatgaaaagt attttattat gacttttgat agtcgactac 1860 tacttaacaa aaaaagtgaa ttggtttctg cttgtaggca taggaaaaaa tttttactat 1920 ctaactttga ttccggcgat tagcaaatat cggatcacgt ataattgacg tcagatgccg 1980 ttgcaacgct aaatttgtat tttttataac ggtttttttt tttttttata tgaataatat 2040 ggctgatgat tgccgaatgg catgaaactt ttagttccat cataaagttg tatttttttc 2100 atttaaatta ttttattata atttgatcta ttttttaaaa aaagatattg atcactctta 2160 aacagacaag agttgtattt mcagcaatat acaacgttt 2199 // ID Gypsy-40_DWil-LTR repbase; DNA; INV; 261 BP. XX AC scaffold_181155; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-40_DWil_; KW Gypsy-40_DWil-I; Gypsy-40_DWil-LTR. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-261 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_181155; Positions 3019769 3019509. XX SQ Sequence 261 BP; 85 A; 53 C; 46 G; 77 T; 0 other; tgttgtatgt tgtatgccaa atgagcatat gcccacagta ttgtagagta tacaatataa 60 caagtggtca tatgctcaac attaaagtaa gagtaacata ccaacacttt gttgctatgt 120 ttatgcaacc attcataata ataatatgat ccgccactta tgcggactgg ccgctgagcg 180 tgggatcgct agcgtaagca cttttgacta tgtaccctct gtacatatat gctaacacat 240 acaaacatat atactgcaac a 261 // ID BEL-104_AA-I repbase; DNA; INV; 6772 BP. XX AC supercont1.298; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-104_AA_; KW BEL-104_AA-LTR; BEL-104_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6772 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.298; Positions 1135927 1129156. XX CC Positions [5497-6081] - Integrase core CC 'GTATT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 1105..6441 FT /product="BEL-104_AA-I_1p" FT /translation="MPVKEEKQLAVSMHQRKCLIKMLDALERFIRDYDHDR FT DSYQLCVRIDSLDKLNTDFADCQSAVERYDDPEALDVHIDERVVFEQRYCK FT AKGFLLAKRNADANQSALNESVLAPQHHPNFHLRLPKIEMPKFNGDFSRWI FT SFRDTFTSMVHVNGGIPAVAKLQYLLQSLEGSAKKPFESVDIEADNYATTW FT DALLKRYDNRKYLKKQLFKALYDLPAIKQECATRLHGLVDDYQRHVRALSK FT LGEPVDHWDLPLIYLLCYKLDPATLRAWEEKTSQKDDVKYDELVEFLYQRV FT RILNSVGSESHHPASSKVAGYPQKSFKQKVSANAATASPPTYPCPLSCSDS FT HSLRTCPVFLGKTVSQRRELVSQKRLCWNCLSFGHQSKKCGSKYSCRTCRE FT KHHSLLHDTAPVKVSATPAIQANPSELSVSSKLAVSAAPLVQQPLPSTSQQ FT ASGSRQVSMAVQTTCTMTLLETVVLNIVDDHGNTHKARALLDSASMSNFIS FT KPLAKSLYSRRSTVDVSVAGIGSSTQKIRSAITATIESGDRTISMKLQFLV FT LKQPSSELPTTPIDISAWKFPNVKLADPQFSTPGCVDLVIGSETYWELHTG FT RKISLGKGLPWVVETPFGWAVSGPASRSATCIPRFCYLSTADDRLESALRK FT FWDMETIPLAPVRSTEESRCEELYAATTTRNATGRYVVRLPRTEDPEKVLG FT ESRSIADRRFLSLERRLERDPATKDSYNRFMEEYLQLGHMNKIQEPVDDSI FT PHCYIPHHVVFKQSSTTTKVRVVFDASCKTSSGYSLNDTLLVGPVVQQDLY FT SIYLRFRTQIVAVVADVEKMYRQVLHHPEDLPFLRIRHRSSPSDPITTYEL FT QTVTYGTASAPYLATKTLQQIANDQAELYPAAVDPVVEDFYVDDLLSGAPD FT VDSAATLRQQVTAMLGSAGFTLKKWASNVQEVLQDVPPEDLAIQPLHDLQD FT EQTISTLGLLWDPKADNLRFKVEIPLPAAVLTKRKVMSYIAQIFDPLGLVG FT PVIAKAKLFMQRLWALKQDGVSCEWDSPLPEKLQHEWKQFHTTLPMLAEVR FT VPRLVLSPGGDSIQLHFFADASSVAYGACCFVRTVSADGIQVRLLTSKSKV FT APLSTHHSIARLELCAAHLATQLFKKIEASLKINSIAYFWSDSSTVLQWLR FT SSPTRWKTFVANRVSKIQNTTCIDHWRHIAGSDNPADDISRGLNPTDILGC FT FRWWQGPAWLALAPDCWPKTIIDQENSSAVSEEGRKNSIVALTAVQISFCT FT DLFSRYSRYNKLRRVVAFCSRYLHRLKERAQQRQTRSHIATLGGSIRSKSP FT SVPPLTTAELQHAEHLLCRLAQRETFAEEISDLSNGERVAKSSALKWLKPY FT VDEDGTLRVGGRLRNAALSVDNKHPIVLSAKHPLSALLASSYHVSLLHAGP FT QLLLATLRQKFWILGGRNLVKSVFHQCHTCFRSKPTLVQQSTADLPVSRVS FT PTRPFSVCGVDYCGPFYVKSAVRNRGPIKVYVAIFVCFSTKAVHIELVSDL FT STPAFLAALRRLVARRGRIVELHSDNATTFKGASHVLNRVYRMLKIEAADR FT QQIFDWCSGNEIVWKFIPPRAPHFGGLWEAAVKSAKTHLLKVVGNVNVTYE FT DLLTLLAQVEMCLNSRPLTPIPEDPTDLDVLTPGHFLVGSNLQAVPEPNWK FT ETAENRLDHWDLTQKRLQQIWSRWYPEYLQQLQSRASKGCNPPVTIEVGAI FT VIIKEDNVPPANWPLGKIVKLHPGKDGVVRVVTLKTASAKEVVSPVARIAL FT LPSPRSSK" XX SQ Sequence 6772 BP; 1613 A; 1823 C; 1683 G; 1653 T; 0 other; tttggtcctt cgaaccggat cgaaggcaat tcagaccgag aagtgattcc gcggaaaaag 60 tgatcgaaag caatcagacc aaacgcgcgc gttggtcatc ggtggtggct gccattcaat 120 ttggcaattt ctattgcttt gtgcgaacaa acagtgcgag tgcgctggga ggttttacca 180 aggcgggtaa acctcgtggg tgtgtggaga acagcaatcg attgcaatca gtgcactttc 240 ggtgactgct ttgtgcgtgg tgatagtatc acagcgggta ctattaggca gccgtgggag 300 ataattaatt atcgcaaacg attttgcgac ggtacattgg gtgaacgata gtgactatcg 360 gttctagtgc ttcgatcgcg ggtgaaagaa agcaaacttt cgtagcattt tctgccgtcg 420 gcaaacgttt cgcgtgagtg ttgtgcgaag aacaattttg cgcttgagag tgcactaagg 480 cgagtgttac tccattgtag tgaagaagat cggacattgc cctggccatt gtgaacgtcc 540 gtggccattg tgtggaaggt gattcaaccc gtttcgccat catcgccatc ttcaacgctt 600 gctagcgtgg cttggggacg cgactcggcg ccatcgcttc ggtgccattg ggtccatttg 660 gcgcaaatcc atccagcctt tggaaagcat ccgatcccta ggattgctcc cggtgagtcc 720 gttccaaact ttattgcatg tggctagcag tagccgtgac ttagtacgta agaaagtggg 780 tagctttttt catttgctat tgtggcttgg tggttaagac ggccgcgaat cgagtggaaa 840 tccgttcgcg ccctcgattt gcgataccgc gtagaagtgg gttcgttcgc gacgggaagg 900 agagatcgat tcgattatta aacgatccac ttccgtcgat ttcaaacgtc gcgcagggtt 960 tttacaccgc gacgggaagg agcaatatcg atcgattgtg tatcaccctc gccggttcgg 1020 tttcgtgtag aaacggttag tttcgcgacg ggaaggattt ttgcgttcga ttaaaactac 1080 cgtttccgtt cgatttcgtt cgcgatgcct gtcaaagagg agaagcagct ggcagtgagc 1140 atgcatcaac gcaaatgctt gatcaaaatg ctggatgcgc tggagcggtt tatccgtgat 1200 tacgatcatg atcgcgacag ttatcaattg tgtgtgcgga ttgattccct ggataagctg 1260 aacacagatt tcgcggattg tcagtcagcg gtggagagat acgacgaccc agaagcgctg 1320 gatgtacaca ttgatgaacg tgtggtgttc gagcaacgtt actgcaaggc aaagggtttt 1380 cttcttgcga agcgcaatgc agatgccaac cagtcggcgc tcaatgaatc agtgctagct 1440 cctcaacacc atccgaattt ccacctccgc ctacccaaaa tcgaaatgcc gaagttcaac 1500 ggcgattttt cacgctggat ctcattccgg gatacattta cgtcaatggt ccacgtgaac 1560 ggtggcattc ctgccgtcgc caagctccaa tatcttcttc aatctctcga gggtagtgcc 1620 aagaaaccct tcgaatcggt ggatatcgag gctgacaact acgcaacgac atgggatgcg 1680 ctgctgaaaa ggtacgacaa tcgcaagtac ctcaagaagc agctcttcaa ggctctttac 1740 gatctgccag caatcaagca ggagtgtgca acgcggcttc acggtttggt cgacgattac 1800 caacgtcatg ttcgcgccct atccaaactg ggcgaaccgg tggaccattg ggatttgccg 1860 ctcatctatc tgctctgcta caagttggat ccagctacac tgcgcgcgtg ggaggagaaa 1920 accagccaaa aggacgacgt taaatacgat gagctggtcg aatttctgta tcagcgcgta 1980 cggattctca attcagtggg atcagagagt catcatccag cctcgtcgaa ggtggccggc 2040 tatccacaga agagtttcaa gcaaaaggtg tcagccaacg cagcaactgc ttcgcctcca 2100 acctatccgt gtccgctttc ctgttcggac agccattcgc tccggacttg tccggtattt 2160 ctcgggaaaa ccgttagcca acgccgagaa ttggtttccc aaaaacgact gtgttggaac 2220 tgccttagtt tcggtcatca atccaagaag tgtggttcca aatattcgtg tcgtacgtgc 2280 cgcgagaagc accattcgct tctacacgat accgctccag taaaggtttc cgctacacca 2340 gccatccaag caaatccatc tgaattatcc gtgtcatcaa agctcgctgt ttcagcggct 2400 ccattggttc aacaaccgct accatctacg tctcaacaag cttctggttc ccgtcaggtc 2460 agcatggcag tccaaaccac ctgcaccatg actcttctgg aaacggttgt tctcaacatc 2520 gtcgatgacc acggtaacac gcacaaggcc cgagctctcc tcgattcggc atcgatgtcg 2580 aattttattt cgaaaccatt ggcaaagagt ctctacagcc gtcggtctac agtggatgtg 2640 tccgtcgcag gcatcggatc atctacacag aagattagga gtgccattac tgcaaccatc 2700 gagtcgggag atcgaacgat ctcgatgaaa ctgcaattct tggtactgaa gcagccatcg 2760 tcggagctgc caacaacgcc gatagacatt tcggcgtgga aatttcctaa tgtcaagtta 2820 gcagatcccc agtttagtac tcccggatgt gttgatctag ttatcggtag tgaaacatac 2880 tgggaattac atacaggacg gaagatttct ctgggcaaag gtcttccgtg ggtagtcgaa 2940 actccatttg gttgggcggt ctctggtccc gcttctcgct cggctacctg cattccacgg 3000 ttctgctacc tttctactgc cgatgatcga ctggaatccg ctctacgcaa attctgggac 3060 atggaaacca ttccattggc tccagttcga tccaccgaag aaagccgttg tgaagagctg 3120 tacgctgcaa cgacgacacg caatgcaact ggacgatacg tcgtccgcct gccgcggact 3180 gaagaccctg aaaaggtttt gggtgaatcc agatcaatag ccgatcgtcg ttttttgagc 3240 ctggaacgcc gacttgaaag agatcctgca accaaggact catacaatcg gttcatggaa 3300 gagtacctcc aactcgggca catgaataag atacaagagc ctgtggatga tagtattcca 3360 cactgctaca ttcctcatca cgtcgttttc aagcaatcca gcacgactac gaaggtccgg 3420 gtagtctttg acgcgtcatg caagacctcg tccggctatt cgttgaacga cacgcttctc 3480 gttggacctg tcgtccagca agacctctac tccatctacc tgaggttccg gacgcaaatc 3540 gtcgctgttg tagcagacgt cgaaaaaatg tacaggcaag tgttgcacca tcccgaagat 3600 cttccattcc tccgtattcg tcatcgcagc agtccgtccg atcccatcac tacgtacgag 3660 ctacagaccg tcacgtacgg tacagcgagt gcgccgtact tagccaccaa aacccttcag 3720 caaatcgcca atgatcaagc agaattgtat cctgctgctg tcgacccagt tgtagaagat 3780 ttctacgtcg acgatctact gtccggcgca ccggacgtcg actctgctgc aactctccga 3840 cagcaagtca ccgctatgct cggctccgcc ggttttaccc tcaaaaagtg ggcgtctaac 3900 gttcaggaag tattgcagga tgtaccgccg gaagatctgg ccatccagcc tcttcacgac 3960 ttacaggatg agcagacaat ctccacgctc ggtcttctat gggatccgaa ggccgataat 4020 ctccgattta aggtagaaat acctctccct gctgccgttc ttaccaagcg gaaagtgatg 4080 tcatacatcg ctcagatttt cgaccctctg ggcttggtag gtccagtcat tgcaaaggca 4140 aagttgttta tgcagcgcct gtgggcgttg aagcaggacg gtgtttcctg cgagtgggat 4200 tcgccattgc cagagaaact tcaacacgag tggaagcaat tccatactac attacccatg 4260 ctcgctgaag ttcgagttcc tcgtttggtg ctctctcccg gcggtgacag tattcagctt 4320 catttcttcg ccgacgcgtc atccgttgcg tacggagcct gttgctttgt ccgtacagtt 4380 tcggcggatg gaattcaagt tcgcctgttg acgtcaaagt caaaggtagc cccactgtct 4440 acacaccatt ccatcgccag actagaactc tgcgccgcgc atttggcaac gcaactcttc 4500 aagaagattg aagcatcctt gaaaatcaac tccatcgcct atttttggtc agactccagt 4560 accgtgctac agtggctgcg ctcatcacca acccgttgga agacattcgt cgctaatcgg 4620 gtgtctaaga tccagaacac tacgtgcatc gaccactgga ggcatatcgc tggctccgat 4680 aatcctgctg atgacatttc ccgcggtctc aacccaacag acatcctagg ttgtttccgc 4740 tggtggcaag gccccgcttg gttagctctg gctccggatt gctggccgaa gaccatcatc 4800 gaccaagaaa attcttccgc cgtgtccgaa gaaggacgca aaaattcaat tgtggcgttg 4860 actgccgtcc agattagttt ttgtactgat ttgttttctc gatactcccg ctacaacaag 4920 ttacgccgag tggtggcatt ctgcagtcgt tatctacatc gtttgaagga gcgcgcacag 4980 cagcgccaga ccaggtcgca catcgctaca ttgggcggta gcatccgatc aaaatcacca 5040 tcagtcccac cacttacaac cgccgagctc caacacgctg aacatctact ctgccgttta 5100 gcacagcgtg aaacatttgc tgaggagata tcagacttat cgaatggcga acgtgtcgcc 5160 aagtcgtcag cactcaagtg gctgaaaccg tacgtcgatg aagacggcac tcttcgcgtc 5220 ggcggccgtc tgcgtaacgc cgcactatcc gtcgacaata agcacccaat cgttctgtcc 5280 gccaaacatc cgctgtctgc tctgctggct agcagttacc acgtaagcct actgcacgca 5340 gggccccaac tcctgctagc cactctccgc cagaagttct ggatccttgg cggcagaaat 5400 cttgttaagt ccgtcttcca ccaatgccac acttgtttcc gatccaagcc cactctagtc 5460 cagcagagca cagccgatct gccggtctcg cgagtttcgc cgacgcgtcc gttttccgtg 5520 tgcggggtcg actactgtgg accattctat gtgaagtcag cagttcgaaa tcgtggtcca 5580 ataaaagtct atgtggccat attcgtctgc ttctcaacca aggctgtcca cattgagcta 5640 gtaagcgact tgtcaactcc tgcgttccta gcagcactcc gtcgtctcgt cgcccgccgt 5700 ggaaggatcg tcgaactcca ttccgacaac gccactacct tcaaaggtgc ttcacatgtg 5760 ctcaatagag tctatcgcat gctgaagatt gaagccgccg atcgtcagca aatcttcgac 5820 tggtgttccg gaaacgagat agtgtggaaa ttcattccac ctagggcacc acattttgga 5880 ggcctctggg aggccgccgt caagtcggcg aaaacgcatc tgctgaaggt cgtcggcaac 5940 gtcaatgtaa cctacgagga tctgttgacc ttgttggcgc aagtcgagat gtgcctcaac 6000 tctcggcctc tgacgccgat acctgaggat cccaccgatt tggacgtcct aacgccagga 6060 cacttccttg tcggaagtaa tcttcaagcg gttcccgagc caaattggaa ggagaccgct 6120 gaaaaccgac tggatcactg ggatttaacc cagaaacgtc tgcagcagat ctggtcccgt 6180 tggtatcccg agtacctgca acaacttcaa tctcgtgcat ctaaaggttg caacccgccg 6240 gtcaccatcg aggtcggcgc catcgtcatc attaaggagg acaacgttcc cccagcaaac 6300 tggccgctgg gaaaaattgt gaagctccat cccgggaaag acggtgtcgt ccgtgtggtg 6360 accctcaaga cagcgagcgc caaagaagtg gtgagtcccg tggcccgcat tgcacttctt 6420 ccgtcgccga gatcttccaa atagaaacca ggtagagcaa cccttattgg gagaacttgc 6480 tcgatgctca cctgcgtttc tgcaaaggta gttgcaggtc caatacctta cagtttcccc 6540 gttagatcta cctggtcttt ttctttgttc ccggtcgacg atgatgaacc tccaaccgac 6600 gttactggca ttcgaggcat gaacgaactg aagtcccgtg tcaacgcagt gaagtctccg 6660 tcaacggcaa cgagaagtac tcagccgaac ttcagagaga aggccgagta aacgtgcaaa 6720 cgttgttact ttgtaaattt gttgaaacat ccatgtttca aggtggccgg aa 6772 // ID I-63_AAe repbase; DNA; INV; 5600 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE An I non-LTR retrotransposon from Aedes aegypti. XX KW I; Non-LTR Retrotransposon; Transposable Element; I-63_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5600 RA Kojima K.K. and Jurka J.; RT "I clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1334-1334 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 6 sequences with >90% CC identity. XX FH Key Location/Qualifiers FT CDS 382..1899 FT /product="I-63_AAe_1p" FT /translation="MKKIVDGMQIRSRNDSNTSQPQSDTXISKKKHKRQNP FT PTFYDQSRQSAGPSRMSPPPKNPHQTLEARSEPDTGRIAPWQXLNPWATPT FT AEIXKSIWILTSSNPMDQENQQNLSPTVNPSHPSAKQLLNTNLPKPRFEMW FT ESEVPHSLLNEAQLRVRSVQDAIPLPVDGETPAAAGTHALNCHQYXANETA FT TXQYSQDQQTDFPPNNPPQHFKTPSQTQCFQYSSKKMHLINSSEYNNCPSQ FT IDHRQSQTTNTQANNLCDTSPRCGEGEKKAISPLPDEGVNAWGGRNTSANT FT VRSXTPSWGSTENTFKNLPTAARTSSTIYDQLYNRNQQASSSKRLLSDRIK FT TAPFRDSTAHPPETTQRPKRMSDDAICNDQSGAGSVNPTQVLALALTIDER FT IPIAAAGARXLASSQASTINYSSETVSSAPSIPPSDLHLELESNNSTSDIG FT AGSENPKQDLAPALTTDDYIPVAAAGARKLAPQASRQYSQCEQSSSASSIL FT SPVSVPRRTR" FT CDS 1927..5505 FT /product="I-63_AAe_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="MERSHVWSNYNEFRKLVDDRLPITTCLQEMMTSQTTN FT LXRNRYHWIICNRSESLGNGIAGLGIRSDIPFQVIPHDSDLHVCIARVGKP FT WNITVXSIYVPHTLNNQEFIGKLDDLLLTLEPPFLLCGDFNASHEVWGSRK FT SNQRGRLLLEWAVDHNLLALNDGSPTHYHNTSGSFTAIDISFVSNSLASKF FT IWHCEDDLHGSDHFPIHIRSNNMLPTIRSRKRWIYKDADWTKFEHAVAQKI FT PANFEASIDDLTEGIFEAALESIPRSKGVPGXKNQLWWNKEVGEKIKTRRK FT ALRKLRKMKDDDPAKAAAKQEFNTAKSEARRAVNEAKSKSWTEFQSIFTSS FT SSTSELWQNFRKLNGKRQLKGPGLVLDGEHTSNPAVIADHLANFFADTSAT FT INYPEEFLSKKQIDEQDTIDLESTETTLLDDEFTIHELFRALEHTRGTSVG FT PDEIGYPMLQRLPFHCKISLLRAINQIWRSGNIPGSWKESVIIPIPKTGGH FT RKDANDFRPISLTSCVSKTMERMVNHRLITHLETQRTLHHRQFAFRRGKGT FT TSYLAELDDVITSAAKKGEHCEIVTLDIKKAYDRVWHHNIMRAVVENQLGS FT RMNKFVFNFLQMRTARVCFGGHLSQKVPQENGVPQGSVLSVTLFLLVMNDL FT FKVVPKNVQLFLYADDIVIVGTGKRVSYLRRRLQNAVSIIEKWTMGIGFQM FT SPLKSTTMHCCKVQRHKNWHLEGPKIVLDGAEIPKVKTTRILGIHFDRKCN FT FNHHMKHLKADCASRLNLLKAISRMADRKTLLYIGNATITSKLTYGIEVLR FT KENYKVLAPVYNDIVRIAAGALKTTSILPLVVESGCLPFEFIAVQALVKKA FT CSVLEKAREQSNNLWEKANTAFHDLTDNFLPQPCKLLTTHNRPWYKKKPNI FT DWTIKTKVKAGQMPQLIVSIFREHVNSRYKDHNKWYTDGSLANGLVGFGII FT GPNYRSGASLPGQCSVFSAEAAAPAWATKETTGRSVIFTDSASCLQALDGG FT KSGHPYIQEIEKEAEGKDVSFCWIPGHSNIPGNEEADFEANRNRLEESASR FT FVPTQDIVRWSQGRIWNAHQVNWDHSTRKTTRIVKPFVGKWIDQKSRKDQI FT ILTRLRLGHTWFTKHHLFEKKPAESCDNCDRTQSVHHILVGCPLYHSYRRE FT INLAENLENILKNDKICEEKVLKFIRKIGLYNKI" XX SQ Sequence 5600 BP; 1855 A; 1336 C; 1137 G; 1258 T; 14 other; ttgaagaact gctgcagcta tcagagctat ccgatggaac caaagtcact gttacgttgc 60 atcctacgtt caataccagc cgttgtgtga tttctacgtt cagcttaata atgtctgawg 120 agcgcatcgt tgaaaaacta gccggacaag gagtaactga agccaaacga atcctgaaga 180 acaaggaaaa tacacctgcc atcattttga ccttcaaccg ggccgcctat ccagaaaaag 240 taaaagttgg tcttctgagt gttccaaccc ggcctttcta cccgaatcca ctgttgtgtt 300 accgccaaaa tagatgagct ccagaatcaa attaaacaga gagatgasag aataagaaaa 360 attgaagcgc agaatcacaa tatgaagaaa attgtggatg gaatgcaaat taggtcaagg 420 aacgattcaa acactagcca accgcaatca gatacggawa tcagcaagaa aaagcacaaa 480 cgtcagaacc cacctacatt ctacgatcaa tccaggcaat cagcaggacc atcacgtatg 540 tcacccccgc caaaaaaccc tcatcaaact ctcgaagccc gatcagaacc agatacgggt 600 cgaatagcac cctggcaaat sttgaacccg tgggcaacgc cgacagccga aatcmtgaaa 660 agcatatgga tcctaacgtc tagcaaccct atggaccaag aaaatcaaca aaatttatcg 720 cctacagtta atccaagtca tccttccgcc aaacagttat tgaacacgaa cttaccaaaa 780 ccccggtttg agatgtggga aagtgaagta ccacactcac ttcttaatga agcgcagttg 840 cgagtaagat ccgtccagga cgcaataccc ctacctgtcg atggtgaaac tcctgctgcg 900 gccggtacac atgctctaaa ttgccatcaa tataawgcca acgaaactgc aacgsttcaa 960 tacagtcaag accaacagac agatttccct ccaaacaatc ctccccaaca tttcaaaact 1020 ccctcccaaa ctcagtgttt ccaatactca agcaagaaaa tgcacttgat caatagttca 1080 gaatacaaca actgccccag tcaaatcgac catcggcagt cacaaacaac caacacgcaa 1140 gccaataatc tttgcgacac ctctccgcgg tgcggggagg gagagaagaa agcaatctcc 1200 cccctcccag atgagggagt gaacgcctgg gggggcagga atacaagtgc taatacagta 1260 cgctcccawa caccttcctg gggttcaacc gaaaacacgt ttaagaatct tcccactgcg 1320 gcacgtacca gctcaaccat atacgatcaa ctctacaatc gaaaccaaca agcatcaagc 1380 agcaagcgcc tccttagcga ccgcatcaag accgcacctt tccgtgacag cacagcacac 1440 cctcccgaaa caacacaacg accgaaacgg atgtccgacg atgcgatttg caatgaccaa 1500 tccggagcgg ggtccgtaaa ccccacgcaa gtcctcgcac tggccctgac tatagatgaa 1560 cgcattccta tagcggcagc tggtgcgagg agwcttgcat cctctcaggc aagtacaata 1620 aattattcct ctgaaaccgt ttcttctgca ccgtcaatcc ctccttcaga cctacacttg 1680 gaattggaat caaacaacag taccagcgac atcggagcgg ggtccgaaaa ccccaagcaa 1740 gatctcgcac cagccctgac tacagatgac tacattcctg tagcggcagc tggtgcgaga 1800 aaacttgctc ctcaggcaag tagacaatat tctcaatgcg aacaatcctc ctccgcctca 1860 tcaattctct ctccagtctc tgtgccaaga cgcaccagat gacatcaccg aaacctttac 1920 aattcaatgg aacgttcgca cgtttggagc aactacaacg aattccggaa acttgtagat 1980 gaccgtttgc ccatcacgac atgccttcaa gaaatgatga cgagccaaac cacaaacctg 2040 mtaaggaatc gttatcactg gatcatctgt aatcggtctg aaagtctggg aaatggcatt 2100 gctggtctcg gtatccgcag tgacatccct tttcaagtca tacctcatga ttctgacttg 2160 cacgtctgta ttgccagagt tggtaagcca tggaacatta cagtamtatc catctacgtt 2220 ccacacacgc tcaacaatca ggaatttatc ggtaagttgg atgacctctt actaaccctt 2280 gagccaccat tcttgctctg tggcgatttc aatgcatcac atgaagtatg gggaagcaga 2340 aaatctaatc aacgcggtcg gctactacta gaatgggcgg tggatcataa tctactcgcc 2400 ttgaatgatg gatcaccaac gcactaccac aacacatcgg gctcctttac tgcaatcgac 2460 atctcttttg tttcgaactc tctcgcgtcc aaatttattt ggcactgtga ggatgacctc 2520 catggcagcg accatttccc gattcatata cggtccaata acatgctccc aaccatccgt 2580 tctaggaaaa gatggatata caaggacgct gactggacca aatttgaaca tgctgttgcc 2640 caaaagattc cagctaactt cgaagcctct attgacgact taacagaggg catctttgaa 2700 gcagccttgg agtcgattcc aaggtcaaaa ggcgtacccg gamccaaaaa tcaactgtgg 2760 tggaacaaag aagtcggtga aaaaattaaa acgcgccgta aagctctacg gaagctcagg 2820 aaaatgaagg atgatgaccc cgcgaaggcc gcagctaaac aagaattcaa tactgctaag 2880 tcagaagcaa gacgggcagt aaacgaagct aagtcaaaat cttggactga gttccaaagt 2940 attttcacga gcagctcatc cacttcagaa ctctggcaaa acttcagaaa actcaatggc 3000 aagagacagc ttaaaggacc tggactcgtt ctagacggtg aacatactag taaccctgca 3060 gttatagcag atcatcttgc taattttttt gcggacactt cagctactat aaactatcct 3120 gaagaattct tgtcaaagaa acaaattgat gaacaagaca caatcgactt ggaatctacc 3180 gaaaccacgc ttctagatga tgaatttact attcatgaac ttttccgggc cttggaacat 3240 acgagaggta catcagtggg accagacgag attggatacc ccatgctcca gagactcccc 3300 ttccattgta aaatcagtct gttgagagcc attaaccaga tctggcgttc agggaatata 3360 cctggtagtt ggaaagagag cgttataatc ccgattccaa aaactggagg acatagaaaa 3420 gatgcgaatg atttccgtcc catctctctc acaagctgtg tttctaaaac catggagcgg 3480 atggtgaatc atcgacttat tacccatcta gaaacccaac gaaccttgca ccatcgacag 3540 tttgctttta gaagaggaaa aggaacaact agttacttgg cagaactcga cgatgtcata 3600 acttcagcag cgaagaaagg agaacactgt gaaatcgtca cattggacat aaaaaaggca 3660 tatgacaggg tgtggcacca taatattatg agagcagtag tggaaaatca gttgggcagt 3720 cgaatgaaca agttcgtctt taacttcctg caaatgcgta cagccagagt atgcttcggt 3780 gggcacctat cgcaaaaggt acctcaggaa aacggggtac cccaaggatc tgtgctgtca 3840 gtaaccctct tcttgttggt gatgaacgat ctgttcaaag tagtaccaaa aaacgtccag 3900 ctattcctat acgccgatga tatagtaata gtcggaaccg gcaaacgagt cagctacttg 3960 cgacgaagac tccaaaatgc tgtatcaata attgaaaagt ggacaatggg gataggtttt 4020 caaatgtcac cactcaaatc aactacgatg cattgttgca aggtacagcg tcacaaaaac 4080 tggcatcttg agggtccgaa gatcgttctc gatggagctg aaataccaaa agtaaaaacg 4140 actaggatat tgggtattca ttttgataga aaatgtaatt ttaaccatca tatgaaacat 4200 cttaaagcag actgtgctag taggctaaac ttacttaaag ccatatctag gatggctgat 4260 cgcaaaacgc tactttacat tggcaacgca actatcacct caaaattaac atacgggata 4320 gaagtactga gaaaagagaa ctacaaagta cttgccccgg tatacaatga tattgttcgt 4380 atagctgcgg gtgctttaaa aaccacttca atcttgccac ttgtagtaga atcaggttgc 4440 ctaccttttg aattcatagc agttcaagca cttgtgaaga aagcctgttc tgtattggaa 4500 aaagcacgag aacaatccaa caatctatgg gaaaaagcta atacagcttt ccacgatcta 4560 acagacaact tcctaccaca accatgcaag ttactaacaa ctcataatcg gccttggtac 4620 aaaaagaaac ctaacataga ttggactatt aagactaaag taaaagctgg tcaaatgcct 4680 caattaatag tgagcatatt tcgagagcat gttaatagta gatataaaga ccacaataaa 4740 tggtataccg atggatcact agccaacgga ctagttggat tcggaatcat aggacccaac 4800 taccgaagcg gagcaagcct tccgggacaa tgctcagttt tttcggcaga agcagcagct 4860 ccggcctggg caacgaaaga aactactggt agatctgtga tttttacaga ctctgckagc 4920 tgtctacagg cacttgatgg aggcaaatca ggacatccct acattcaaga aatcgagaaa 4980 gaagcagagg gaaaagatgt tagtttttgt tggatccctg ggcattcaaa cataccaggg 5040 aacgaggaag ccgacttcga agcaaataga aatcgactag aagaaagtgc mtcacgtttc 5100 gtcccgacac aagacattgt aagatggagt caaggaagaa tatggaacgc gcatcaagta 5160 aactgggacc attcaactcg aaaaacaaca agaatagtta aacctttcgt cggtaaatgg 5220 atcgaccaga aaagccggaa agatcaaatc atcctaacta gattgaggct cgggcacaca 5280 tggttcacaa agcaccacct ctttgagaag aaacctgccg aatcttgtga taactgcgat 5340 agaactcaat ctgtgcatca tattctagtc ggatgtccat tataccactc atatcgacgt 5400 gaaattaacc ttgctgagaa tttggaaaat atccttaaaa atgataagat atgtgaagag 5460 aaagtactaa aatttattag aaaaattgga ctgtataata agatataagt gtaaatagta 5520 attcaatcaa cgccgtaata aacaaaaaga gacgaatgcc gtaatacggt aaagtctcta 5580 taagcaaaaa aaaaaaaaaa 5600 // ID Gypsy-118_AA-LTR repbase; DNA; INV; 295 BP. XX AC supercont1.16; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-118_AA_; KW Gypsy-118_AA-I; Gypsy-118_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-295 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.16; Positions 1139590 1139296. XX SQ Sequence 295 BP; 74 A; 69 C; 66 G; 86 T; 0 other; tgtgggtgtt gagatcttct gaaatataat tacttctcgc cctttagctt caccgcagta 60 cgaagtcata tatcttgcaa cacattttat tgaatgtgta gccaacagtt gagaaacaac 120 atgatcagct tggagtgtca ccgcatcagc ccaccctgcc ttagtcggat agcaacgaca 180 caaccagcat tgattgtcgt gcggtgagtc gccagtcttc ggtcgtcggt catataggta 240 agtccaactc gcggttttgg gcgattcttt tgaacgccaa taatggattt tcaca 295 // ID Transib1_Dmoj repbase; DNA; INV; 3058 BP. XX AC . XX DT 27-AUG-2008 (Rel. 13.09, Created) DT 27-AUG-2008 (Rel. 13.09, Last updated, Version 1) XX DE A new family of Transib elements in Drosophila mojavensis. XX KW Transib; DNA transposon; Transposable Element; transposon; KW autonomous; Drosophila mojavensis; Transib1_Dmoj. XX OS Drosophila mojavensis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; repleta group; OC mulleri subgroup. XX RN [1] RP 1-3058 RA Styles P.; RT "Transib1_Dmoj: a new family of Transib elements in D. RT mojavensis."; RL Repbase Reports 8(9), 908-908 (2008). XX DR [1] (Consensus) XX CC Transib1 is a family of autonomous transposons in Drosophila CC mojavensis and Drosophila willistoni. The 3058bp consensus CC sequence was obtained by multiple alignment of 11 sequences. The CC consensus sequence encodes a 682 amino acid transposase, with CC some homology to Transib transposases, in a single ORF between CC positions 499 and 2547. Transib1_Dmoj has 40bp terminal inverted CC repeats. Transib1_Dmoj appears to be currently active in D. CC mojavensis, with nine copies showing between 99.84% and 100% CC identity to the consensus sequence, with an average of 99.93%. CC Two copies are more divergent, with 94.50% and 96.07% identity to CC the consensus. Transib1 appears to have been involved in a CC horizontal transfer event between D. willistoni and D. CC mojavensis. The identity between the consensus sequences in these CC two species is 86.99%. XX FH Key Location/Qualifiers FT CDS 499..2547 FT /product="Transib1_Dmoj_1p" FT /translation="MSCIYPHIILEIFFLATMSVSKRELFSVWHMETDVSK FT KTFAVETYIYTNIFDKNKYDDSQCAKIDKKIKNFITKLTCKWKENHRILKN FT FELNNQEWLDDDIILVDRNSNVGRPDLSFEDSQEKTKKKKISDIVKEIPSN FT ELVSAASTSLYNCGKRSASKMINLLSTDESIPKKIKSIIQSDDKQPIKYTP FT EEALALYIDGGYTKKAYTDLQRGTKMRNANIFPTYDVLRSAKKLCYPSGLV FT VTDHSFEVPLQNLIDHTISRIYKAHYKDFPSIENNNGSQLTAIYKWGCDGS FT SGHSTYRQNFNDLNACMVDQHMFSAYLVPLEILRGSEIVWKNAKPSSTRYC FT RPLKLICQKETIDLVHDVVNNFKSQITAIQPTNITNFEVHHQFHLTMIDGK FT VFSVLAESSTSVCGICGTTPKKMNDLSAIKNLPCKENLYKYGLSSLHAWIR FT FLDCILHISYRLITKNWQVRSTDKAAVDMRKLEIQKNLRKEMGILVDIPKP FT GGSGNTNNGNTARKFFQDAKLAAKITGVNYLLIYRFSVILRTLASGYEIDT FT EVFKHYAFETAILFVDMYPWFYMPSSVHKILIHGAEIIEHFALPIGMMSEE FT ALEARNKDQRKFRLNHTRKNCRNNTMEDLAHTLFISSDPFITMLAKSTVSF FT SKKDRFVDKDIISLISNKDSQVELSDCETSDSD*" XX SQ Sequence 3058 BP; 1125 A; 473 C; 514 G; 946 T; 0 other; tgcacagtgg gcgaaacggc tgatttagct ccgattaatt caaatttaaa gtaattctag 60 ttgccaattt ctcttttgta atcgatcact aatatattcg atagaccaaa aacattaatc 120 gcttcgattt tctctcaata ttcgcacagc tgtgaagctt agaagacaac atatgttttg 180 ttggttgttg caatcagctt tcccgccaat tccgattgaa tcgaaaaaaa attgaataaa 240 aattggtttt actgttttca attgaacttt ttgctgtgaa ctgaataaaa aggtatgtag 300 ttactaacca aataaatgtt tgcgacctgt gattgtgttt cgtcccaaaa cgtaatatat 360 taattagtac ttgaatttgt cttaacattt gctaaaaact gaactttagt agagtagaac 420 ttttaagatt cgactggatt cgatatcaat gtgtttttaa aaaataagtc tgtgtattac 480 gaagtgaaat cgtccacaat gagctgcata tatccgcata ttattttaga aatatttttt 540 ttagcaacaa tgtcagtgtc caaacgagaa ttatttagtg tatggcatat ggagactgac 600 gtttcaaaaa aaacttttgc ggttgagaca tacatctaca ccaatatttt tgataaaaat 660 aaatacgacg atagccaatg cgctaaaatt gataagaaaa taaaaaattt cattacaaaa 720 ttgacatgca aatggaaaga aaatcataga attcttaaaa atttcgagct aaacaaccag 780 gaatggttag atgacgatat aatattggtg gatcgaaatt caaatgttgg gcgacctgat 840 ctcagttttg aagatagcca ggaaaaaaca aagaaaaaaa aaatttcaga catcgtcaaa 900 gaaattccga gcaatgagct tgtctccgcg gctagtacta gtttatacaa ctgtggaaag 960 cgaagtgcat ctaaaatgat taatttgttg agcactgatg aaagtattcc caaaaaaata 1020 aaatcaatta ttcagtccga tgataagcaa ccaattaagt atacaccaga ggaagcctta 1080 gccttatata ttgatggtgg ttatacaaag aaggcttata cagacttgca gcgtggtacc 1140 aaaatgcgaa atgcaaatat atttccgacg tatgatgttc tgcgttctgc aaaaaaactg 1200 tgctatccta gtggtctagt agtaactgat cattcatttg aggtaccttt acaaaattta 1260 attgaccaca caatatcccg aatatataaa gcacattata aagattttcc tagtatagaa 1320 aataacaatg gttcccagct aactgcaata tataaatggg gttgtgatgg aagtagtggc 1380 cactcaactt atcgccagaa ttttaatgat ttgaacgctt gtatggtcga tcagcacatg 1440 ttttcggctt acttggtacc attagaaata ctacgaggat cagaaatagt ttggaagaat 1500 gccaaacctt catccactcg ttattgcagg ccattaaaat tgatttgtca aaaggaaaca 1560 atagatttgg ttcatgatgt tgtgaataat ttcaaaagcc aaataacagc aatacagcca 1620 actaatataa caaattttga agtacatcat caatttcacc tgacgatgat agatggtaaa 1680 gtattcagtg ttctggctga atcatcaact tcagtttgtg gaatatgtgg tacaacccca 1740 aaaaaaatga atgatctcag tgctattaaa aacttgccat gcaaagaaaa cctttacaaa 1800 tatggtttat ctagtttaca tgcttggatt cgtttccttg attgcattct gcatattagc 1860 taccgactaa taactaaaaa ttggcaagtt cgaagtacag ataaagcagc tgttgatatg 1920 agaaaattgg aaatacaaaa gaatttgaga aaagagatgg gtatattggt tgatattcca 1980 aagcctggcg ggtcaggcaa tactaataac ggcaacacag cgagaaagtt ttttcaagat 2040 gcaaaattag ctgccaaaat aactggagtc aactatcttt taatttatcg atttagtgta 2100 atattacgaa cactagcatc cggatatgaa attgatacag aagtctttaa acattatgca 2160 tttgaaactg caatattatt tgtagacatg tacccatggt tttatatgcc ttcctctgtc 2220 cataaaattt tgattcatgg tgcagaaatt atagaacact ttgcccttcc gataggtatg 2280 atgtcagaag aagctcttga agccagaaat aaagatcaaa gaaaatttcg attaaatcac 2340 acacgaaaaa attgtcgcaa taacacaatg gaagacttgg cccacacact ttttatttca 2400 tcggacccat ttattacaat gttagcgaaa tcaacagttt ccttttctaa aaaagatcgg 2460 tttgtagata aagacattat tagtcttatt tcaaataagg actcacaagt tgagctatcg 2520 gattgcgaaa catcagattc cgattagatg taaaattgtt ggtaatattt acgttatatg 2580 gattgcgtac ataatcgcac gctttaagaa aaaaaaatat ataaaataaa aatatatata 2640 ttaaaagaaa aaaaaatttg gtctattgaa cattactcta ggggaacatg cgaaagaaga 2700 gagaaagaag acactttcga taattaaaaa aaaatgataa tgctgtccaa tttatttatt 2760 tttttgatgg tttgtatata tatttctaaa gtaatagcaa ataaaaagaa gtattaaaaa 2820 tgtattgaaa ttccttaaaa ttgagcgaaa ataaccatac atcgaataga acgggcttac 2880 actatcttaa aaaaaatttt ttttaatact ttcacttaat gaaaaaattt ttttctacat 2940 taattctgat gtgtatataa tttatctaca aaacaaaagt atccgtacaa aaatagcatc 3000 atttttgcag aagttattaa ttaatcgccg ctaaatcagc cgtttcgccc actgtgca 3058 // ID DNA-TTAA-2_CQ repbase; DNA; INV; 697 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A DNA transposon family from Culex quinquefasciatus - consensus. XX KW DNA transposon; Transposable Element; nonautonomous; KW DNA-TTAA-2_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-697 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the southern house mosquito."; RL Repbase Reports 11(1), 66-66 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >92% CC identity. ~20 bp TIRs. TSDs arel TTAA. TSD sequences and CC terminal nucleotides indicate that it is a non-autonomous CC Kolobok element. XX SQ Sequence 697 BP; 220 A; 122 C; 122 G; 232 T; 1 other; ggggttacat acatgtaaat cgacaaaaat gtcagaggtt ggtgtgagca cacacttaat 60 tttatttaaa atctgttttc agggcattaa aatatacatt ttcatctatt atcaaaacaa 120 atttgaagac atttggttgt atcattgccg agatatagct atttgaagtt agcagtttca 180 aaaaacgggt gccacgatat ctcaacactg ccttgaccaa atcggctcaa aattttggtg 240 aagactcgtt aaaccggtcc tgtgtgcatg acgaaggccg attttcaaaa agcttatttt 300 aaaaaaagat aaaaatattt ttctgttttt catataaaaa atcgtcagtt tttgattttt 360 gtatttttta aaagcaaaat ttcaaaatcg ggcttcgtca tgcacacggg atatgtcttg 420 ggagtcttca ccccgaattt cagccaattt ggtccatccc atctcgagat atcgtggcac 480 ccgtaaatca actcggtgtt tcgagaaaaa cgctcagaaa gtttgacagt ccgctttgcg 540 cacggcaaaa tkttgagctt aaaatcgtct ctaactcagt ttaatcatga aatatcttca 600 tgaaactttc aggagtgatt gaaaatcatc tttttagtgg attcaataaa ttttctgtaa 660 tatgaaattt tgtgattttc tacatgtatg taacccc 697 // ID Gypsy-62_CQ-I repbase; DNA; INV; 11387 BP. XX AC AAWU01038191; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-62_CQ_; KW Gypsy-62_CQ-LTR; Gypsy-62_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-11387 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 503-503 (2011). XX DR Genome; AAWU01038191; Positions 8982 20368. XX CC Positions [3984-4451] - Reverse transcriptase CC Positions [6066-6488] - Integrase core CC 'GTAG' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 1147..2895 FT /product="Gypsy-62_CQ-I_1p" FT /translation="MEHLEAYIDAELSNLKKSINKPRSEQNTLDKIELVND FT YFQQYEALVRNKKSKSSAEISKIIDSNHNRVREKVEKALDILKNPKHETNS FT DVKDDNDKKSKTNNTNTAEEEKVLVLIERSTDDEIKDLKLSDQLSNLEWVS FT DKIDTFKDNCKKYNRLVEEIKTKVNSEYFKNILIGQNRFLEKADEAYEFLK FT EAEKKLIKNNVTQNQNKKPSKIIIQENTNKVQTVLGNKNYHEIQIFNFPMK FT LAIQCIPDFHGAKNELDPFLFQIDQFASQLPYGVDEASLINVVLMKLKGRA FT LDRTNKIRGKNWAETRENLIKEFSSVSSVENILQKIETLEQGQNENFKNYK FT KRALNILNEISIIEQKDNIDGQFARRSLKIHFLGGLKNCNLKQLAKTQRDK FT NFTELLDYLDQEFIECEQIDDIERRLQACKVSKYNFQKQSNNSAYRNFSRW FT NDTRPDIATNWNRNHNYLPNNRNNQRPTHQTNTNQYIRQNQYQENYNRNYN FT NYQLQPKANQQSLSNNISRNYPERMDQRNYQPQINETRNRYPNSQQNRIQQ FT IPRYQNQNRQRSENYQNNRYNNNNTNSNQDNRYPKN" FT CDS 3195..6488 FT /product="Gypsy-62_CQ-I_2p" FT /translation="MGYSVFNNHEKVTLSTISESVTLSIGTVMVHLLVGNT FT IFITKFSVMKNLPVPGIIGADFLKKHTLFISENFEFLVLKTCNNTNDNENV FT HDRKTNTIANCYENNPRFYQNRRNEVKEAEQPINNYNEAPFLDTEYQEEVT FT SQELDLDTDRDCDVIEYKQFKKVRGCERMEQLLNIIKLDHLINGSYRDIKS FT IIQKFNHIFFLEGDELTYSRAAMHEIETTTNIPINKRQYRMPESTKSHIDE FT QIAEMLKLGIIKPSKSPWNAPVLCVPKKCDANGNKRYRIVVDFRALNLITK FT PFVYPIPLINEILDNIGNSKFFSSIDLKSGFYQIPIDSRDATKTAFSTSKG FT HYEFTRMPMGLEKRPSTFQKLMNTILYEIQPVKAFVYLDDIVVFGTTIEEH FT NLNLCKILEALSKNNLKVEPEKCNFLKKEIKYLGHIIDENGIRPTDENIKT FT IKNMKRPQTIRDVRSFLGTVNFYGKFIPNMADKRKHLNNLLKKDTKFIWTK FT ECEDAFNVLKNCLISEPILVRPNFKDTFVITTDASDYAIGAVLSNEKTNDH FT PIAFASRALSHSEKKYFIIEKELLAIVWAIEYFKHYIFNQHFIVYTDHRPL FT IALWRLKETSPTLSKLRLKIQGIGCEIRYKKGKENIVADFLSRMQHEEIKI FT HNTEKISENLSNNIVAVVTRQQRRQQTNSPNNIINNSNYNKSIDDNENQTN FT KLFNDDDGEDFNTTVDSLQAMDIEDETIDVSSKQFSLDDFFDAQNNDELDF FT KLLKFSKTLIDFDSIDGTFVIINSKTAFKELSQLIDLPHGMKDYLDGKTFS FT FPMHKIWGIILNGSKRSTINIEELFDGFLEGLTKCPNFAKSAENIQIISFR FT NLQRFPGINILRFFANKFEKFFTLYASEEERIFVKVEDRETVLKEFHDAPL FT GGHVGGKRMLKRISPLFKWDNMKRDVENYVRQCESCQKNKIWPTNKIPMKI FT TTTSSEPFQKIYMDIIILVQSEDNNRYGLVIQDDLTRFLTVAPMPDQESET FT VARTFVDYFICRYGCPLEVVTDQGANFMSCLMKNVCKLLKIKKICTSAYHP FT QANLVERSNRELKIYLRQFILGNPESWDSLLPFFTFEYSLL" XX SQ Sequence 11387 BP; 4763 A; 1725 C; 1694 G; 3205 T; 0 other; gtggtgacag caaaattcca agaacggaag gagacgccat cttggattgg aagaaaaaac 60 cagtgccgac atgaaaattt ggtaagtact tttacaattt taactaatta atgcaactaa 120 tcaactttga acaatcgtct gaaccaattt ttttttaaat attagaatct tttttacatt 180 attctgatat tgtctttttc caaaacattt tctttgaaaa aaaagtgttt cattaatttt 240 gctttattga aaaagaaaga gaataaaaag tgtacaaaaa taaaattttc ttattgagta 300 ttgaaaaatc gcaaagaaaa aaacaaagcg aaaaaaaaaa acaattattt taaacgcgac 360 atcgtttact aaccaacgga ctgaagcccc cacccgtggt aagcgacagt tcaaatatca 420 caaaagtgca tactaaacca aaatttcaaa actgaaaaat aaaaataaaa tcgaaattta 480 aacgcgacat cgtttactaa ccaacggact gaagccccca cccgcggtaa gcaacagttc 540 aaatatcaca aaagtgccta ctaaatcaaa aattcaaaat tataagaaaa atgagtgatt 600 aactacactg aaataaaaaa aaaattaata aaaaataacc gtgattcgcc tactaattca 660 ggaactctag cccccaccag cggtaagcga aagttcgaat atcacaaaaa tattttacta 720 aaccaaacag atcaaacccc ttaaaaatat acagtacaag aaaaataata aaataataac 780 cgcgattcgt atactaattc aggaactcta gcccccacag tggtaagcga aagttcaatc 840 cgaaatagtg aaacactaaa ccaatcaatc aaatcataat attaaagaca aatatcaaac 900 cagtaaacga gatttcgcct actaattcag aaaccctagc ccccaccagt ggtaagcgaa 960 ggtttaatcc gaaaagtgaa attactaatc caaattctca gaaaaataat aatcaaataa 1020 acaaaaacca acactcaaca agaaaagaaa aagtaagtga aataaaaaaa aattatctca 1080 aaagtacacg tttatcagat tgaatttttt attcatttct cttcaataaa aaaaaagaaa 1140 tcaaaaatgg aacatcttga agcatacatt gatgcagagc tttcgaactt gaagaaaagc 1200 atcaataaac ctaggagcga acaaaatact ctcgacaaaa ttgaattagt taatgattat 1260 tttcaacaat atgaagcact tgtacgaaat aagaaaagta agagcagtgc ggaaatttca 1320 aagataattg attcaaatca caatagagtt cgcgaaaaag tggaaaaagc tttggatatt 1380 ttgaaaaatc caaaacatga aacaaattca gacgttaagg atgataatga taaaaaatca 1440 aaaactaata atactaacac ggctgaagaa gaaaaagttt tggtactaat tgaaagatca 1500 acagatgacg aaataaaaga tttaaaatta tctgatcaac tttccaactt ggagtgggtc 1560 agtgataaaa ttgatacatt taaagacaac tgtaaaaaat ataacaggct tgtcgaagaa 1620 attaaaacca aagtaaattc tgaatatttc aaaaatatac taatcggcca aaacagattt 1680 ttggaaaaag ctgatgaagc ttatgaattt ttaaaagagg ctgagaagaa attaatcaaa 1740 aataacgtta ctcaaaacca aaataaaaaa ccatccaaaa ttattatcca agagaataca 1800 aataaagtcc aaacagtact tggtaacaaa aactatcatg aaattcaaat ttttaatttt 1860 cccatgaaat tggcaattca atgtattcct gattttcatg gagcaaaaaa tgaactagat 1920 ccttttttat ttcaaattga tcaatttgct tcacaattac cctatggggt ggacgaagca 1980 tcgttgatta atgtggtgct catgaaatta aaaggcagag cacttgatag gacaaataaa 2040 attcgcggca aaaattgggc cgaaacaagg gaaaatttga ttaaagaatt ttcctctgtt 2100 tcatctgttg aaaatattct acagaaaatt gaaacattag agcaaggtca aaatgaaaat 2160 tttaaaaatt ataaaaaaag agctttgaat attttgaatg aaatttcaat catcgaacag 2220 aaagataata ttgacggaca atttgctagg agaagtttaa aaatccactt tctgggtggt 2280 ttgaaaaatt gcaatttaaa acagttagcc aagacgcaaa gagacaaaaa ttttactgaa 2340 cttctagatt atttggatca ggagttcata gaatgtgaac aaattgatga catcgagaga 2400 cgtttacaag catgtaaagt ctcaaaatac aattttcaaa aacaatcaaa taattccgct 2460 tatagaaatt tttcaagatg gaatgataca cgtccagata ttgcaacgaa ttggaacaga 2520 aaccataatt atctaccaaa taatagaaat aatcaaagac caactcacca aacaaacaca 2580 aaccaatata ttagacagaa tcagtaccag gaaaactata atagaaacta taataattat 2640 caactacaac caaaagcaaa ccagcaatca ctatctaaca acatttcaag aaattatcca 2700 gaaagaatgg accaaagaaa ctatcaacca caaattaatg aaacaagaaa tagataccca 2760 aattcacaac aaaaccgaat tcaacaaata ccaagatatc aaaatcagaa ccgccaacgt 2820 tccgaaaatt atcaaaataa tagatacaac aataacaaca ccaatagtaa ccaagataat 2880 cgctatccaa aaaactaata actgggggtt ttcaaaaatt ctacaaccaa tcataccaaa 2940 ggaataaaca tttatttgaa gcaaacacaa atttaagtca aaacacaaat cagaaaaatg 3000 gtagaatttt tgcaagaaaa attgaaaact tcctagaata taacccccca cacgaaacta 3060 ttccgattat ccttacacca aactaccaat ttcgaataag aattgcgtct tcgaaaactc 3120 cacgtcataa agtaaaattt ttactagaca caggagcatg cacgaactta attaggaaaa 3180 acgttttaaa ctcaatgggt tattcagttt ttaataacca tgaaaaagtt accctttcaa 3240 caatttcaga aagtgttact ctttcgattg gtacagttat ggtacattta ttagtgggaa 3300 acacaatctt tataactaaa ttcagtgtca tgaaaaattt acctgtgcct ggtattattg 3360 gtgcagattt tcttaaaaag catactttat ttataagtga aaattttgaa tttttggttc 3420 taaaaacgtg caacaacacg aatgataatg aaaatgtcca tgatagaaaa accaacacta 3480 ttgcaaattg ttacgaaaat aatcctcgtt tttaccaaaa tcgaagaaat gaagtgaagg 3540 aagctgaaca accaataaat aattacaatg aagcgccttt cttagacact gagtaccaag 3600 aagaggtgac atcacaagaa ctcgatttag atactgatcg tgattgcgat gttatcgaat 3660 ataaacaatt taaaaaggtt agaggttgtg agagaatgga acagctactg aatattatca 3720 aattagacca tctaataaat ggatcttata gagatataaa atcaattata cagaaattta 3780 atcacatatt ttttctggaa ggtgacgaat taacatattc aagagcggct atgcacgaaa 3840 ttgaaacaac aactaatata cctataaaca aaaggcaata tagaatgcca gagtccacaa 3900 aaagccatat tgatgagcaa attgcagaaa tgttaaaatt gggcattatt aagccaagta 3960 aaagtccttg gaatgcacca gttttgtgcg taccaaaaaa atgtgatgca aatggaaaca 4020 agcgctacag aattgtagta gattttcgag cgcttaactt aattacaaaa ccatttgtgt 4080 atccaatacc gttaattaat gaaattttag acaacattgg taatagcaaa tttttctctt 4140 caattgatct taaatcggga ttttatcaaa ttccgattga ttctcgagat gctacaaaaa 4200 ctgcattttc gacatcaaaa ggacattatg aatttacaag aatgccgatg ggtcttgaaa 4260 aaagaccttc aacattccaa aagctaatga atactatttt atatgaaata caaccagtaa 4320 aagcgtttgt gtatttagat gatatagttg tttttggtac aacaatagaa gaacataatt 4380 taaatttatg caaaattttg gaagctttga gcaaaaataa tttaaaagtt gaaccagaaa 4440 aatgcaattt tcttaaaaaa gaaattaagt accttggaca cataattgat gaaaacggta 4500 taaggcctac agatgagaat attaaaacaa tcaaaaatat gaaacgtcca caaacaatac 4560 gagatgttcg gtcgttttta ggaacagtaa atttttatgg aaaatttatt ccaaatatgg 4620 cggataaaag aaaacatttg aataatcttc ttaagaaaga tacgaaattt atttggacaa 4680 aagaatgtga agatgcgttt aatgttctga aaaattgttt gatttcagaa ccaattttag 4740 ttcgaccaaa tttcaaagat acttttgtta ttacaaccga cgcaagtgac tatgcaatcg 4800 gagctgtact atcaaatgaa aaaacaaatg atcacccaat agcgtttgct agtagagcat 4860 taagtcattc agagaagaaa tactttataa ttgaaaagga acttctcgca attgtatggg 4920 cgattgaata ttttaaacat tatatattca accaacattt cattgtttat acggaccaca 4980 ggccgttaat tgcattatgg agactaaaag aaacttctcc tactctttcc aaattaagat 5040 taaaaatcca aggcattgga tgtgaaatta gatataaaaa agggaaagaa aatattgttg 5100 cagacttttt atctcgaatg caacacgaag aaattaaaat tcataacact gaaaagattt 5160 cagaaaattt aagtaataac attgtagctg ttgtaaccag acaacaaaga cgccaacaaa 5220 caaattctcc aaataatatt attaacaatt ctaattacaa taaatcgatt gatgataatg 5280 aaaatcaaac taacaagtta tttaatgatg atgatggtga agatttcaat acaaccgttg 5340 attctttgca agcaatggat attgaagatg agacaattga tgtttcgagt aaacaattta 5400 gcttggacga ttttttcgat gctcaaaaca atgatgagtt agattttaaa ttacttaaat 5460 tttcaaaaac attgattgat tttgattcaa tagatggaac gtttgtaata atcaacagta 5520 aaactgcttt taaagaattg agtcaattga tcgatttacc acacgggatg aaagattatt 5580 tagacggaaa aactttttca tttcccatgc ataaaatatg gggtataatt ttaaatggtt 5640 ccaaacgttc gactattaat atagaagaac ttttcgatgg atttttagaa ggtcttacaa 5700 aatgtccgaa ttttgcgaaa tccgcagaaa acatacaaat aatttctttc agaaacttac 5760 aacgttttcc gggaatcaat attttaagat ttttcgcaaa caaatttgaa aagtttttca 5820 cattgtatgc gtctgaagag gaaagaattt ttgttaaggt agaagatagg gaaacggtat 5880 tgaaagaatt ccatgatgct cctttgggag gtcatgttgg tggaaaaaga atgcttaaaa 5940 gaataagccc actttttaaa tgggataaca tgaaaagaga cgttgaaaat tatgtgcgtc 6000 aatgcgaatc atgtcaaaag aataaaatat ggccgacaaa taaaataccg atgaaaataa 6060 caacaacttc atcggaacca tttcagaaaa tttatatgga tattataatt cttgtacaat 6120 ctgaagataa taatagatat ggtcttgtca tacaagatga tctaactaga tttttaactg 6180 ttgctccgat gccagatcaa gaaagtgaaa cggtagcaag aacatttgta gattatttta 6240 tttgtagata cggttgccct ttggaggtgg taaccgatca gggtgcaaat tttatgagtt 6300 gcctcatgaa gaatgtttgc aagcttttga aaattaaaaa gatctgcact agtgcatatc 6360 accctcaagc taacttggtt gaaagatcaa accgagaatt gaaaatatat cttcgacaat 6420 ttattttagg taatcctgag tcgtgggact cattgttacc attttttacg tttgaatata 6480 gtcttttatg agcagcctca tgaagaatgt ttgcaagctc ttaaagatta ataagatctc 6540 cgctagtgca tatcaccctc aagctaactt ggttgaaaga tcaaaccgag aattgaaaat 6600 atatcttcga caatttattt taggtaatcc tgagtcgtgg gactcattgt taccattttt 6660 tacgtttgaa tataattcaa cgataaattc atcaaccggt tactcaccgt ttgaattatt 6720 gtatggaaga gcagctagaa tccctacatc tatttttgca tacaaaaatg atagcttaac 6780 atacgacgat tatatcgccg aacttagagc aactcttaaa ggcattcatg aaaaagccaa 6840 acaaaatttg ataatttcta aaaataaaag aaaaattatt tacgaccggc attccaacga 6900 atggcaaccc atgtggggag atttagttct tgtgcaatct attccttcag gagtagggaa 6960 gaaactccaa agtctttgga gaggacctta cgaagttgtt gatattccaa gtgaacaaac 7020 aacaatcatc aaaaatggaa caaaattaga gaagattcac agcaatagat tgaagaaatt 7080 ttatgactag taacttcaat ttgtgaaatt taaacaaaaa aaaataataa taagaaaatc 7140 gaaaagaagc aaggtttatt aaagatccaa gcatctaatc tttatacgaa tatttggtat 7200 ttcttgtgtt catttttcgg ttgatttaaa acaccgttaa tatttttttt caaatccata 7260 gggctggaat gcgaatcata gggaaagcat tgatattttt agatggataa cggatggaaa 7320 tataattctt gacctacaaa tagatatttt tctggctaaa ttggattaaa agctttgaag 7380 atttatacgc gcctcctcac aaacgaaaaa gaaaggctag aacgaagcca tcggatgtag 7440 aggggactat ttcagtttaa agtattctga atagttttgt gctaacattt tccttaagca 7500 attccaattg agcttcaaaa attcaaacag actcaagaat gtgtttttgt ctcaaggaaa 7560 gaaaaacaaa agcaattgaa agttggaaat gtttcgcatt cttgtactgt ttttggaaaa 7620 catgaataaa tggtcattaa aagagataaa aaaaatataa ataaataatt caatctaaaa 7680 atgaaccaag atcaatgaag tatacacaaa aaaaaaattg taaacgaatt taagcgccat 7740 gtaacgctca aaataaatac tttcaaagaa ttgtggatgg aaaaaggata ttgatcaaat 7800 atttataatt caataaaaaa gaaattttgc ttaatcaata aaattataaa acaataaaaa 7860 gttttccaag tatgacaata atatctgata tctattttga atttcagaaa aaaatagcgg 7920 atccggatct caattttaag aaaacatata gacaaaaatt attatcaaat ctaagtatat 7980 ataatatgaa aagccaagaa aaccattctt caacaaacaa tgttggtcgc attacaaagt 8040 tgctgatcaa ataattgaaa aaaatatata ttattaaaca caatcgcgat gatgcaggaa 8100 ataattttgg atgaaaattc ttatgagaat atgaaacgcg aagcattata ctgcataaac 8160 aaaatattag aagatatcaa cattcttcta tcaaatcaac attatattct tctcatagca 8220 cgatagtttc aaattacacg gttttaatat ttaaaatgta tatagaactg gccagttcac 8280 acataaacta tccttaataa ataatcgtta agatttaaag tttagccagt tcacacataa 8340 actattctta ataaataatc gttaagattt aaagtttagc cagttcacac ataaactatt 8400 cttaataaat aatcgttaag aattaaattt tagccagttc acaaattaat tattgtaaaa 8460 aaaaaattat cgtaaatttt aaattttatc aactaattga aaatattaaa tatgtttcat 8520 aaacataaaa taattcaacg atttgtaaaa gaaaaatgtc tcaattcaaa ggagacattg 8580 cattttggaa caaatatcat ttaagcgaaa aataaaaaaa aatgaagtgt aggaaactag 8640 aacgatttaa ttagaattag accaacttta cggaaatcaa tggatcagaa aataaaaacc 8700 caaaggttga gaacaataat aattaccaaa cggcctaact aacgcaaaaa aaagacataa 8760 aactaacaag aataataaaa atgcgagcca aataacaaaa ggtctaaaca agtatgtacg 8820 aatcgaatga ctgcgattgc atgattgacg ctgtatggac aaggaggtgc accccagcat 8880 gtttcaaaag agtgtggttt tgattcaatt atcaagtggt ggtatagaac acatttttgt 8940 cacatttatt aataactatt tttcttttcc aacagattca gtcccacctt cgaaaggaaa 9000 ggggaagata agattttgta taaacaaaat ccggatacat tctacagtta atagcatttt 9060 tttggtgata gacaggactt tgactacgac aaaatatggc tactaaaaaa aagaggttgc 9120 gcattttcaa taacctaagt ttttgcagaa tgtgcgcaac cacaatacgg gacaaatttt 9180 ctcgatttaa aaaacgattt caatctacaa gtaattacgt tcaacttaat aaatataaat 9240 attaaatgtg atggatttcc ctctcgaatt tgtcgcgatt gaattaaaaa gtttttaaat 9300 atataaatga tgcattgaac agccaatata tggactaatc gtaaattgaa atcggttttt 9360 aacggaacat acttatatgt ttcagtttga catttaaatt tacactgaca catcatcaga 9420 agattcgcat gctactaatg cttttcaggc ggacaaaaat agcgaccata ataaactaac 9480 aatcaataac cagaacttac taacggatgc aattagagaa tctgacgagg aatcacaaga 9540 cacagcggtt actattttat cgaacgaatc tagcgacaac tcctcagaag aaggcgaaaa 9600 ggatgagatg aaatctacga agatgaagac gatgaggcga gagaaatcaa caaaaatgaa 9660 ataaaactat caggacaaac ataaaatatt aacaaacaga attttaacaa acattaaaat 9720 aataacatat taattttgca tgaaaaaaaa acttttgaac taataagatg attaacatac 9780 taaacataaa ataacattgc cttcaaacaa tataaaattg ttttatatgt attataaaat 9840 aattttctat tttgcgttat atatcaaatt ttaactgttg atcacaatgt gttgaaaaaa 9900 aaaacaaaaa aaaatatatc aaggaaaatt ttaaacaatt atatacaaag aatcacaaac 9960 ggaaatttat ttgtgctcga cgaagttcat tcaaaatatt atattattaa acctttctga 10020 aatgatacat tcaaagaaac tagatcaaac caaatattgt aattctcaaa taaatcaaac 10080 aatgaaaaca aaaacattat aaaattgtaa ttgaattaca gataatcgat tcgaatatat 10140 gttcaaccat tattataact tacatgtgcc aagttacacg caaaatgaac attcgaaaaa 10200 aaaaatgtaa tcagattata aaaaaaagta aagagacaac ttttatctcg taatgtaaat 10260 tatgttcatt taaaatatat attaaacaat aactacttat attcagtttt tctaaggata 10320 tttacatttt tttaatgcca cagttcagtt gaaaattatt ccatgctagt aattgtcagt 10380 aaaaatctac taccaaaatt aagaggttat aagtacgagt ttgatcttat aacataaatc 10440 taattataaa agcttaatac ttttgcagtg tcaatgcgaa tgcaaatatg tttctcattt 10500 tcagcgaact aaaatatgca aaactctcag aacaaaatgc acaatttcct cataaccaaa 10560 agtagccaat aaaatgatca taaaattatg aacatttaag aaataaaagg ccacgaacat 10620 gagtagaaaa caagtctcgg atcaaggttt ctgatgacta agtgctaaga cctgagaagc 10680 atacacaaaa aaaaaataac gctgaaagaa aaaaaagaat tcaaaaaaaa ttatataatc 10740 aaaaatgata atcaacatat ttagaggcta agcattgcac aagttgaagc acaataagat 10800 aaaccaaaga acttaattca aatcaaactc acatttataa tttccgtctg tcgcgaagaa 10860 aataaaagac aaaaaaacat tccttgatca ttattacaac tgaactggta tgcataataa 10920 aaaaataaat aaaaatagta aataagtgaa ctatgcaaac aaaccaaaga agatacacca 10980 aaatgaagga tgagaggcca acagcgcgaa accaactatg gtcatcatct ccacacctgc 11040 caaagagttt gcaagtcaat cccagaagac gagtactcac tgcggaggta tacggacatc 11100 agacgaagca gcctacgagc ggtacacaaa aattaccaat atcaaaacac caaggataga 11160 aaaaaaaaaa gtttagcttt ttgataaatc tctctccact tcttaattgt agaacagaag 11220 aatcaatcta ttgtaacaca agttgttaac agaaaaataa gatcataaag gaacaaaaca 11280 ttgcatccta agaaacatta tatcgaacca tccttgaaac caacaatcaa ctgacagtag 11340 atgatcacct attctagatg aacatctagc tgggaagtgt gggggta 11387 // ID DNA-2-2_HM repbase; DNA; INV; 3081 BP. XX AC . XX DT 14-JAN-2009 (Rel. 14.02, Created) DT 14-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE DNA transposon from Hydra magnipapillata - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; 2-bp TSD; KW DNA-2-2_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3081 RA Bao W. and Jurka J.; RT "DNA transposons from Hydra magnipapillata."; RL Repbase Reports 9(2), 374-374 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX SQ Sequence 3081 BP; 1197 A; 447 C; 427 G; 1009 T; 1 other; cccagcgggc attttgacgt cctaaggacg tccgtaagac gtcttaagtc tcgtccgtaa 60 gacatcctta gttggtctca aatgaaaggc ttgaggacgt ctattccaga cgtcctaaag 120 acgtcttttt aagacgtcct agagacgttt tttgcagacg ttctagagac gtctctggct 180 ggtctcatat gaaagactta aggacgtctt tttcagacgt cctagagacg tttgaatttt 240 accaagacat tcttttaaaa gttatataaa aatgcaatga tttacaacac acaccttaga 300 tttgccttaa tctaggcaag ggaagttata aaaagaaatc aatgtaaata aagtgtaatt 360 tgatatattt aaaataaatt aataacttta caaaaaataa gaataatttt tcaaacaaag 420 ttacattttt caaattaaaa actaataggt actgtagaat ataatcgata gtgcattata 480 aatagaagaa aattgcttaa aaatggaaaa aaaataaata aaatacaaca gttaattaag 540 ccatcattga tcgaaatcca tattagtcga tgttttcttt attcggtcag aagaatgcct 600 aattactctc ccaataagtt ggatagcatc cattttggtt aagtttttat ttgactttaa 660 agcagcttct aataaaaaca acaaaacatg tttcaataag atcattaaaa taaaatatac 720 atgtcatgta tacatacact tgaatacatg tcatgttcca tagccaaaga aattaaattt 780 tttagcttac taatgtgttt gattaattaa aaattaaaaa acatcatttt taatttttaa 840 ttaatcaaac acattaataa gctaataaga ataccagtga cagcgttaca tagacttgtc 900 aactctttaa actttttttt cttctttaga caattcatgc tgtattctga caaaatagtg 960 tctgttggca gtctacatat ataaaaaaga agataaatgc tgtactgtca atcataaatg 1020 ttaaaattca aacttttgct aattacttac aagagcaaaa tatttttaat tgtatcagct 1080 agatttttac ccccaattaa tgataactgt tttacctaaa aagaaacaat gtattagaaa 1140 cattgaaaat gtttataaca atttaaaata aatattgtaa taatatgtaa catataaaat 1200 ataaagttga aactcatgta tacaaaagtg atgtattgga tatctccatt tgcgaaattt 1260 ttttttcaaa acattaaatt tatctgaact tttacgttgt ttaaaacctt cttctggatt 1320 attagagaac agtgcagttc tatctactaa aatttttttc atttcagcaa gttggaatat 1380 aatttttttc taaaactctg aaaagtaaaa tatataagtg aagccatata aatagtatat 1440 aaaatagtaa atagtacttc ctatatatgg ttgaaaagag gtagatttaa actgaataat 1500 gcaaaaaatc tcaagtttca tgcttaacct taaaaaaata tagaaacaag taaagttcaa 1560 gaaaattata gggtttttgt ctaacgccaa aaactgtaaa taattgctac ctatacacaa 1620 acgcacacac tatatatatt ttgaatgtag acatcttgta tgagagtatg taaattatta 1680 ttttataaac atcaataaat acgattttct ctcaatacta aatttatttt ttatttttta 1740 ttttttaaaa gaaattttcg ctggattgtt gtgaccagcg tcttcagctt aaaagatttt 1800 gtttgttttt gtttatatgt tatttttaaa tatataaaac taaagttttc gggcttagca 1860 ggttttaagc tcttttggaa tcaaatcttt tgtaccaagt ttcagaattg cagcattact 1920 aaaagcacca tataagggta atattaaaca gaaacatttt tggcactttc aagaaataaa 1980 tcaagaaata aatctaaact acaaagaaac agcacacccg tttaccttat taagtgccat 2040 tgtaacaaaa agtatattgg tgaaaaaaaa tgcaaataaa acacgatgca acaacacaaa 2100 aacagtacat agaaataaaa cagaaagatc agcattagcc acacacatga aacattgtgc 2160 acaacaaatt aactgggaag ttcagaactc taaaagtcaa taacaagttt gataaaaaag 2220 tttgatagga aagttcgtga ggcgcttgaa atacagtacc accaatgtgc ctcaacaaag 2280 tggtatcaat ttagatgagg gacaatatgt gtcaacaaaa ttctggacac catttctaat 2340 gcacttacgt caaaaaagac gaaaccattc cattgacata taataaaatt gccgtttaaa 2400 aataacatat aaacaaaaac aacaaaatct tttaagctga agacgctggt cacaacaatc 2460 cagcgaaaat ttcttttaaa aaataaagaa taaaaaataa atttagtatt gagagaaaat 2520 cgtatttatt gatgtttata atatataaca cagacacaca aaatatcaat actattctat 2580 aaaacacaaa gtgaaaagtc ttggtgcaat tttgatgtct tttgttaaaa ttccctcatt 2640 atacttgtaa cttagttact cttttattta aatattgtat cgattatagt ttatagatca 2700 tagagtatag aatatagaaa attaaaaaat atctttaaac gataaataac taaacaaaat 2760 aatagtctta aaataatcaa ccaaaacaag tagacatatt taaataaccg agtttgcgca 2820 taatattgta tgcgccaagt tgatagtatt aattttttta cttattttta aggtaattag 2880 gacgtcttat gtaggtctta gcgacgtcaa taggacgtct aaatattgaa gtcctgatat 2940 cgtctttata ttaacgtcac aaggacgtcc taacaaagac gtctttacca gacgtcttta 3000 ggacgtccta atttggtctg agacgtcgcg acctagttaa gacgtccgta agacgtttga 3060 ttsacgtctg tgcccgctgg g 3081 // ID Mariner-37_SM repbase; DNA; INV; 2178 BP. XX AC . XX DT 14-AUG-2009 (Rel. 14.08, Created) DT 14-AUG-2009 (Rel. 14.08, Last updated, Version 2) XX DE Mariner DNA transposon from Schmidtea mediterranea: consensus. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-37_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-2178 RA Jurka J.; RT "Mariner DNA transposons from Schmidtea mediterranea."; RL Repbase Reports 9(8), 1886-1886 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 249..1916 FT /product="Mariner-37_SM_1p" FT /translation="MGDNEKQENKATRKSISLETKMQVIRRLDSGERQSQI FT SAALNLATSTIRTILKNKEKILSSATATTSSSATRITRSRNNIIEEMEKRL FT SIWIDDEIERNMPLSQSIIMEKARRIFNYIQAEASDISETFVASRGWFNRF FT KHRNNLHNIKITGEAASGDTKAAAEFPATLKTIIEQGNYPPELVFNVDETG FT LFWKRMPKRTFLSREEKRAPGFKAAKDRLTLLLGGNASGDFKLNPLLVYHS FT KTPRAMKGISKSTLPVIWESNKKSWITMKIFRDWFTEHFCPSVKRYCEIKK FT LEQRALLLIDNAPSHPRNLSDLTTCIPVEVVFLPPNTTALIQPMDQGVISN FT FKAYYLRRTFKQMFEKTDGEEKQSIREFWKNYNIMNAVANINLSWNEITER FT SLKGVWKNIWPDLSKSEDIGHSVDMNEIVEEIVELAKQTGLDEVNVEDVEE FT IVQETAASFSNDELKELTAQEENKNIDSSDFEEDQKELSMAFVKRSLTTIT FT EIMDQFVENDPNFDKSSKARRGVIDALSTYQQLLTERKRKTQTTLDAFLTK FT TQKVGNTE*" XX SQ Sequence 2178 BP; 796 A; 361 C; 405 G; 616 T; 0 other; cagtaaaacc ccgcttaacg ctgtttcggt ttagcgctct tttatttcaa cccgggtatt 60 cctcgaaatt gctcgcagaa ctacggggat tccccgcgaa acagtacagt tgtgtttctt 120 gtttggtgcg gtgttgtttg acgcattttt attatttgaa acatttgttg acattattgt 180 aggtatagat ttatataata ctataacgat taatacagtg acaaatatta tgatctttag 240 tttttatcat gggtgataat gaaaaacaag aaaataaagc aaccagaaaa agcatttcac 300 tggaaactaa aatgcaagtg attagaagat tagattctgg tgaacgtcaa tctcaaatta 360 gtgctgcatt gaatttggca acttcaacaa taaggacaat tcttaagaat aaagaaaaaa 420 tactgtcatc ggcaactgca actacgtcaa gctctgcaac tagaattacc cgttctagaa 480 ataacattat agaggaaatg gaaaaacgac tgtctatatg gattgacgac gaaattgaac 540 gcaatatgcc attaagccaa tctatcataa tggaaaaagc tagaagaatt tttaattaca 600 ttcaggctga ggcaagtgac ataagtgaaa ctttcgttgc tagtagagga tggtttaata 660 gatttaaaca tcgaaataat cttcacaaca taaagattac aggagaagca gcaagtggtg 720 atacgaaagc ggcggctgaa ttcccagcga cattaaaaac aataattgaa caaggaaact 780 atcctccaga attggttttt aatgttgatg aaacaggcct gttttggaaa agaatgccaa 840 aacgaacatt tctatcccgt gaagagaaac gagcaccagg atttaaagct gcaaaagatc 900 gcttaacact cctactgggt ggtaacgcaa gcggagattt caaattaaac ccactactag 960 tttaccactc caaaacccca agagcgatga aaggcatatc taaatcaact ttacctgtaa 1020 tttgggagtc taataaaaaa tcatggataa ctatgaaaat tttccgggat tggttcactg 1080 aacatttctg cccgtctgtt aaacgttatt gcgaaattaa aaaacttgaa caaagagcac 1140 tactattaat tgataatgct ccaagccacc caagaaattt atcagactta acaacatgca 1200 ttccagtcga agtggttttc ttgccgccta acacaactgc cttaatccaa ccaatggacc 1260 aaggcgttat atccaatttt aaagcttatt atttgaggcg aacatttaag cagatgtttg 1320 aaaaaacaga cggagaagag aagcaatcaa taagagaatt ttggaaaaac tacaatatca 1380 tgaatgctgt agcaaatata aacctttctt ggaatgagat aacagaaaga tctttgaaag 1440 gagtatggaa gaacatttgg ccggatttaa gtaagagcga agacattgga cactctgtcg 1500 atatgaatga aatagtggaa gaaatagtgg aattagcgaa acaaaccggt ttagatgagg 1560 taaatgtgga agacgtcgaa gaaatagttc aagaaacagc agctagtttt tctaatgatg 1620 agctcaagga attaactgcg caagaagaaa acaaaaatat tgatagtagt gatttcgaag 1680 aagatcaaaa agaactttcg atggcgtttg taaaaaggag tttgaccact ataactgaaa 1740 ttatggacca atttgttgaa aatgatccaa attttgacaa gagttcaaaa gcaagacggg 1800 gtgttataga tgctttatcc acttatcaac aacttctaac agaacgcaaa aggaaaacgc 1860 aaactacatt agatgccttc ttgacaaaaa cgcagaaagt tggaaacaca gaatagactg 1920 aatgatctga ctgactaaaa atttaattgt ttttataata tattcgttta tattgttatt 1980 tatttttgta ataaataaat gtttggtttt atttttataa aaaacaatca taatttttaa 2040 atttaatacc taacgaattt tctatccaac ttccccaaac cccattattt acatgttaag 2100 atagcctcgt ttagcgctgt ttcgcattac gctctacttt gatggaacgt atccaccgcg 2160 ttaaacgggg tcttactg 2178 // ID DNAX-4_AP repbase; DNA; INV; 172 BP. XX AC . XX DT 26-AUG-2009 (Rel. 14.09, Created) DT 26-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNAX-4_AP. XX NM DNAX-4_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-172 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2057-2057 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC TSD duplication unclear (it could be TA or TATA) CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 172 BP; 39 A; 54 C; 39 G; 40 T; 0 other; ctctgttcaa aataagaaac gcaaacggcg gcgaccagtt ttcgtcggcg gcagtcaggt 60 aacacccgtt tgtcaccccc accgagtcac caatctatat acctgccgtg tagtattgcc 120 acgcccctgc gcacaagtcg gccgccgtct gcgtttctta ttttgaacag ag 172 // ID CR1-97_AAe repbase; DNA; INV; 4745 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-97_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4745 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1185-1185 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 14 sequences with >97% CC identity. Closely related to T1 and Q. XX FH Key Location/Qualifiers FT CDS 332..1216 FT /product="CR1-97_AAe_1p" FT /translation="MVSTCQSCANEITDAHVTCRGFCNEQFHLACCNITSS FT VFDEVMGNGQLFWMCKTCSHLMNDIRLRESVRVAYESGQERVLTAHSEIVA FT CLKQEILIELKNEIKSNFSKLINSSSMTPRSSKYSAPTVFSSRRRRLFGKP FT ALPQPQSVIPGTSESLSPSFENFVATAPSQKFWLYLSRISRGVTSEQVRSL FT VLRRLGTDDVEVVRLVAKNRDVNTLSFISFKIGLSANFKDRALSSSTWPRG FT VYFREFQNLRSNENFWKPQVTPNQTDAIAVNPPINACSSAEDTSMMEQGNR FT DAH" FT CDS 1228..4650 FT /product="CR1-97_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MEAFTLLDTVPPFAASDHRRRRGPVVESVVEVLQPAF FT TGKYLNYRNLSLPDRHSDSRLCTIMHPSIHEASSSTTPLYINETYPHIISS FT SSQSIPGRPQLSSMEASDPPSSVVPSVRAAIANQSQHTRSCRQSRPGPESS FT AGDEVFQTAIHGMYPITIGNSRPDETSTHRSSSPEVNVANADSNILMYYQN FT VGGINSSIAEYQLALSDGCYDVYAISETWLNGSTSSSQLFDSSFSVYRQDR FT SASNSNKSIGGGVLLAVRSRIKSRIINPPNSSAIEQIWVAISTVDATLYVC FT VIYIPPDRVNDAVLVERHIESVNWIVSQMGPRDNVVILGDFNLSTITWHRN FT PSGVLFPNISRSSIGQTSRELLDGYSTARLNQMNGVENEYNRTLDLCYVSE FT ELCANCTTLQAPAPLVKIARHHPPLLIKLEIDPFRCCHDSSESVSYDFSRA FT NFDGMNDFFSRMNWDEILCDSDANLAASTLSGVLLYAIDQFVPVKSKCQPT FT KPVWSNSVLKNLKRVKQAALRRHSKNRTDSTRVEYLEANTAYKQLNDRLYN FT AYMDRLQSRLKTNPKSFWHYVNDQRKESGLPSTMSNGLIDADSTTSIADMF FT RSQFSNVFTNEQLGPDDVVTATQNVPHFPAVGLQFAITNDAVVSAGKEMKS FT STGCGPDGIPSLVIKRCINSLAAPLTTVFNLSLESGVFPYCWKQSYVIPVF FT KKGCKRTVSNYRGIASLSATSKLLELIVLKNLVQCYSHYISQDQHGFIPKR FT STTSNLTCFTSYLIRQIECGHQVDAIYTDLSAAFDKMNHQIALAKFDKLGM FT NDRVLCWLRSYLTGRSMSVKIGDYVSPAFQVWSGVPQGSHLGPFLFLLYMN FT DVNFILKCMKLSYADDLKLYFTIKQPRDAIFLQHQLESFAEWCRVNRMSLN FT VSKCSVISFGRKHSLVHFDYALGDEQLKRESTVKDLGVLIDSKLSFKDHIA FT YIIAKASSQLGFLFRFAKKFKDAYCLKSLYCSIVRPILEYSSVVWAPFYQN FT EVHRIERIQRKFIRFALRHLNWRNPLNLPSYESRCQLIDLEPLVARRNVAK FT ACFIGDLLQGNIDCSTLLGMLDINIRSRNLRSHSFFSIPSARTNYGQHEPI FT RSMSRLFNKCYFVFDFNVSREANKRSFKRVFCY" XX SQ Sequence 4745 BP; 1241 A; 1108 C; 992 G; 1404 T; 0 other; gttgattgtg ttccaaccgt cgctattttg ttatcgttgt tcgacttgtt gtatttatgt 60 taaaatactt tctacttttg gattgtatat tttaaagcgt aaagtcgcgt gttccagtaa 120 aagttaatag ttgactacct gataaccgtc aatagtgaat ttgccacatc atcatccagc 180 cgacttgtag tgtcaacata ccagtcaaca tcaagcatcc ccgagctgtt gctgttgctc 240 tcaattgttg catacaatac ggtgtacttc actgctgcag acctgtaggc gcatcttcat 300 tcaacgctct aattacttgc gcacgagcat tatggtttcg acttgccagt cgtgcgcgaa 360 tgaaatcacc gacgcgcatg ttacttgccg tggattttgt aatgaacagt tccatctagc 420 ctgctgcaat atcacttcat ctgtcttcga tgaagtaatg ggaaacggtc aactgttttg 480 gatgtgcaag acgtgttcgc acctgatgaa tgacattcgt ttgagagagt ctgttcgcgt 540 ggcatatgaa tcaggccagg aaagagttct aactgcacac agcgagatcg ttgcatgttt 600 gaaacaggag attttgatag agttgaaaaa cgaaatcaaa tccaattttt caaaactaat 660 caactctagt tcaatgacac caaggtcttc gaagtattct gctccaacgg tcttctctag 720 tcgtcgccgt cggctttttg gaaagcccgc tttgccacag ccacaatcgg ttatacctgg 780 aacttcggaa tctttatcac cgtcatttga aaattttgtc gctacagctc ccagtcaaaa 840 gttttggcta tacttatctc gcatctcgcg tggtgttacc tctgaacaag tgcgttcttt 900 ggttttacgt cgcttaggaa cggatgatgt ggaagttgtc cgtctcgttg ccaaaaatcg 960 agatgtgaac acgctatcgt tcatatcttt taaaattgga ctgagcgcaa attttaaaga 1020 cagggcttta tcttcttcga cttggcccag gggagtttac ttccgtgagt tccaaaatct 1080 ccgatcgaat gaaaattttt ggaaacctca agtgacaccg aaccaaaccg atgcaatagc 1140 tgtgaatcca ccaatcaacg cttgctcatc agcggaagac accagcatga tggagcaagg 1200 caaccgcgac gcacattaga aagcactatg gaggccttca cactactcga cacagtcccg 1260 ccttttgcag ccagcgatca tcgtagacgt cgcggtcctg tcgttgagtc tgttgtggag 1320 gtcctccagc ctgcttttac aggcaagtat ttgaattatc gaaacctctc cctgcctgat 1380 cgacattccg attccagact gtgtactatt atgcaccctt ccattcacga agcttcatcg 1440 tctacaactc cgttgtatat caacgagaca tatccgcaca tcatctccag ctcatcgcaa 1500 tctatcccgg gacgcccgca actaagcagt atggaagcct cggatccgcc tagctcagtc 1560 gtgccttctg tacgtgccgc catcgcaaac cagtcgcaac atacacgttc ctgtcgtcaa 1620 agtcgtcccg gccctgagtc tagtgctggt gatgaggtct tccagactgc cattcatggc 1680 atgtatccta ttactattgg taattctcgt cctgatgaaa cttcaaccca taggtcctca 1740 tcacccgaag tgaacgttgc gaatgcagat tcaaatattc tgatgtacta ccaaaatgtt 1800 ggcggaatta acagctccat tgccgaatac cagctggcac ttagtgatgg ctgttatgat 1860 gtttatgcca tctccgaaac atggctgaac ggatctactt cgtctagcca actgttcgat 1920 tcatcgtttt cggtttatcg tcaagaccga tcggcttcta atagcaacaa gagcatagga 1980 ggtggtgtgt tgttggctgt gcgctctagg ataaaatccc gtatcatcaa tcctcccaat 2040 agctctgcaa tcgaacaaat ctgggttgca atttcaactg ttgacgcaac actgtacgtt 2100 tgcgttatct acatccctcc cgatcgtgtt aatgatgccg ttttggttga gcgccacatt 2160 gaatcggtta attggatagt atcacaaatg ggacctaggg acaatgttgt catcctcggt 2220 gacttcaatt tgagcacgat cacctggcat cgcaatccgt ctggagtcct cttcccgaac 2280 atttcacgat cctcaatcgg tcaaacttcg cgtgagctgc ttgatggata tagtaccgcc 2340 agactcaacc aaatgaacgg agttgaaaat gaatataatc gaactcttga tctgtgctat 2400 gtgagtgaag aactgtgtgc gaactgcacg actttacaag ctcctgcacc tctcgtgaag 2460 attgccaggc atcatccacc tctgctgata aaactggaga ttgatccttt tcgttgttgt 2520 cacgacagtt ctgaaagcgt ttcctacgat ttcagtcggg ctaatttcga tggtatgaac 2580 gacttctttt cacgtatgaa ctgggatgaa atactttgcg actccgacgc gaacctcgct 2640 gcatcaacac tttctggtgt attgttgtat gctattgatc agttcgttcc cgttaaatca 2700 aaatgccagc caaccaaacc agtatggtca aattctgtcc tgaagaacct caaaagagtc 2760 aaacaggcag ctttgaggcg tcacagcaaa aatcgcacag attctacaag agttgaatat 2820 ttggaagcaa atactgcgta taagcagctc aatgaccgac tctacaatgc ctatatggat 2880 cgtctgcaaa gccggctcaa aactaacccg aaaagtttct ggcattacgt caacgatcaa 2940 cgtaaggaat ctggattgcc ttctacaatg tctaacggat taattgatgc tgactccacc 3000 acctctatcg ccgatatgtt tcgttcccaa tttagtaatg tttttaccaa tgagcagctt 3060 ggaccggacg atgtagtaac tgccactcag aacgttcctc attttccagc tgtgggactg 3120 cagttcgcca tcacgaacga tgcagtagtt tctgccggta aagaaatgaa atcctccaca 3180 ggatgtgggc cggacggtat tccatcgcta gttatcaaac gttgtattaa ttcgcttgct 3240 gcaccgctaa caacagtatt caatctctct ctagaaagcg gtgtttttcc gtattgttgg 3300 aagcagtcgt atgtaattcc agtgtttaag aaaggctgta aaaggaccgt ctcgaactac 3360 cggggtatag catctttgag cgctacctcc aagttgctgg aattgatcgt tttgaagaac 3420 ttagtgcaat gctactcgca ttacatatct caagatcagc acggatttat tcctaaacgc 3480 tcgacgacct ccaacctaac gtgctttact tcttacttga tacgccaaat tgaatgtggg 3540 catcaggtag atgcaatcta caccgacctt tccgcagcct tcgataaaat gaatcatcaa 3600 attgctctgg ctaagttcga caaattaggc atgaatgaca gggtcctatg ctggcttcga 3660 tcctatctaa ctggccgtag tatgtctgtg aaaatcggcg attatgtttc tcctgcgttc 3720 caagtttggt ccggtgtacc ccaaggcagt catcttggac cgttcctgtt cttgctatac 3780 atgaatgacg taaacttcat tcttaaatgt atgaaacttt cgtatgccga tgacctgaag 3840 ctatacttca cgataaagca accgcgtgat gctatttttc tgcaacatca gctggaatct 3900 tttgccgaat ggtgtcgtgt aaatcgtatg tcactaaatg tatcgaagtg ctctgttatt 3960 tcattcggcc gcaagcattc actcgtgcac tttgattatg cattaggaga cgagcagtta 4020 aagcgtgaat caacggttaa ggatttggga gttctgatcg actctaaatt gtccttcaaa 4080 gaccatattg cttacatcat tgctaaagct tcatctcagt taggtttcct ctttcggttt 4140 gctaagaagt tcaaggatgc ctactgcttg aaatcgttgt attgttcgat cgtgcgtccc 4200 atactggaat attcttcagt tgtctgggcc cctttttatc aaaatgaagt acaccgcatt 4260 gaacggatcc agcgaaaatt tattcgtttt gcattacgac acctaaattg gagaaacccg 4320 ctcaacctac caagttatga gagccgctgc cagctcatcg atttggaacc actcgtagcg 4380 aggcgtaatg tggcgaaagc ttgttttatt ggagaccttc tgcaaggcaa tatagactgt 4440 tctacattac tcggcatgct tgatataaac atacgcagca gaaaccttcg gtcacactcg 4500 ttcttcagca ttccttctgc gaggacgaac tatggtcagc atgaaccaat acgcagcatg 4560 tcccgtctat tcaataagtg ttattttgtg tttgatttta atgtgtcacg ggaagctaat 4620 aagcgtagtt ttaaacgtgt attctgttat taagattttt tctatattgt acgtttagct 4680 taagttttgt cattggggtg atttcttacc tgttgactaa atcaataaat aaataaataa 4740 ataaa 4745 // ID Gypsy-7_SI-I repbase; DNA; INV; 4207 BP. XX AC AEAQ01015416; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-7_SI_; KW Gypsy-7_SI-LTR; Gypsy-7_SI-I. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-4207 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01015416; Positions 8549 4343. XX CC Positions [3182-3634] - Integrase core CC 'AAAAT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 77..4192 FT /product="Gypsy-7_SI-I_1p" FT /translation="MTTVGNLKPFDPAIDDIEVWTRTFKAFLLANNMEYTP FT IVPEEADETKILMSRRCVATLLSTLGLSVVGTLMSLLSPAQPEDKTLDELI FT TILKIHYKPPPKALAERYRFMGRKQNQGESVAQFLAELRRLAANCKFDNDL FT NIRLRDQFVFGLRSEVAQKQLFTKPDEVTLEDVVALATAQELSEQSLSLVR FT GNTNAQQKEEVNKVTKNIPGKQSNISKNNSQYKGGGKPTYNQKYKKECQRC FT GSTDHNGIKNCPHKETKCYACGKFGHLSFKCRSSGQKQYYPQSKKPQASQS FT NMVDIRATEVLSTNQMEKKKIWITLKINGIPHQMEIDTGCERSMVSKEFWK FT TLGKPTLSKSTLQFKTYTNEIFHAIGELNCKVKYNNETIEHVFPVTHGSSL FT FGRDLLRKIKIDWADIAAQCNKIQENLTLEGILKEFSDIFEEPRGRIKKFK FT AKIVLKDDATPKFMKSRPIPYAIQEKVDQELDRMEKSGIIERVDHSEWASP FT LVVVPKPNGRVRITGDFKNTVNAQLHITQYPIARPEQIFNTVGGSSIYSKL FT DGSNAYHQMEVEEECKKCLVVNTHRGLYRYNVLPQGISSSPAIFQEFSDKM FT LQGIQRTGTYIDDTILGDTNEREHLQNLRKIFKRMRECNYYVTKEKCEFGK FT SHVEFLGHILSKRGIHTDPNKVSAMEIIQRPQNVTELKSFLGLVNFYNKFV FT PNFADICEPLYRLTRVNEKWNWTSKCESAFNRVKSTLVSAPMLMNYNPDLP FT IGISCDASSIGIGSVLFHRIQEGGKTIEKPIAFTSRVLSAAEKNYSQIEKE FT GLAIIHALKKFYRYLCGRQFILVTDHKPLVHIFNPKTTLQPYAAARLHRWS FT IYMSQFQYDIEYRSTHEHGNADILSRFPDKNTLGEEEEAKEVNLIAEENME FT KLPITYKEIRIATARDKVLSKVLSFINNKWPGAIEKEEEELQSYFKKREEL FT TTHQGIIIWGLRVIIPKVLREKLLKNLHETHAGIVRMKALARQFIWWPNMD FT KEIEELARSCTNCCSNRPDPPSAPLHPWQFPEKPWQRLHIDLAGPLQNRMF FT LIIMDAHTKWPEVYDMKTDTTSKKVIEKLRDSFVRFGIPEQIVSDNGRQFI FT SAEFQRFCKNNGIRHTTSSVYHPRTNGEAERFVQTFKKAVHNSEGDLTYRI FT QRFLFNYRCTPHSTTGASPTELLIGRRPRSLLDLIKPDVFKTVAAAQARQE FT KSYNTRVKNRKFEKGEEVWVRTFSRNKAKWSLGTIIKALGPVTYLVRVGDQ FT HYKRHVDQMYNAMPLQLNFEKAPENDALSRRLIKKEPDSDSTSPEGNEEER FT PETAEEGAGSSESEQSNSREHPGQNRNENTPSCGRPQRTRKPPDRLEYKAR FT GTQKEYGSK" XX SQ Sequence 4207 BP; 1601 A; 746 C; 866 G; 994 T; 0 other; tatttggcga cgaggagaaa actgaacgag acgaataatt ctatcaagaa aagagtgagg 60 ttagaatcat cacagaatga cgacggtagg caatttgaaa ccttttgatc cggcaataga 120 tgatatagag gtatggacac gtacttttaa ggcatttcta ttagcgaata atatggaata 180 cacgccaata gtaccagaag aagcagatga aacaaaaata ctcatgtcga gacgatgtgt 240 agcaacctta ctgtccacgt taggtttgtc agtagtagga acattaatgt ctttgctctc 300 tccagctcaa ccggaagaca agacacttga cgaattaatc acaattctca agatacacta 360 caaaccaccc ccaaaagcac tagcagaaag atatcgcttc atgggtagga aacaaaatca 420 aggagaatcg gtggctcaat ttctagcaga attaagacgc ttagcggcaa attgtaagtt 480 tgataacgat ctcaacattc ggttaagaga tcagtttgta ttcggcttac gcagtgaagt 540 agcacaaaaa cagctcttca cgaaaccaga tgaagttaca ctggaagacg tggtcgcatt 600 agctacagcg caggagctct cagaacaaag tctgtcacta gtccgaggaa atacaaacgc 660 tcaacaaaaa gaagaagtaa ataaagtaac aaagaatata ccaggtaaac aatcaaatat 720 ttcaaaaaat aattcacagt ataaaggagg aggaaaacct acatataatc aaaagtataa 780 gaaagaatgt caaagatgcg gttctacaga tcacaatggt ataaaaaatt gtcctcataa 840 agaaacaaaa tgttatgcat gcggaaaatt tggtcactta tcttttaaat gtaggagtag 900 cggacaaaaa cagtactatc cacagtcgaa gaaaccacag gctagtcaga gtaatatggt 960 ggatattaga gcaacagaag ttttatcaac aaatcaaatg gagaaaaaga aaatttggat 1020 aactttgaag atcaacggca ttcctcatca gatggagata gacacaggct gtgaaagaag 1080 catggtttca aaagaatttt ggaaaaccct agggaaacct actttatcaa agtcaacgct 1140 acaattcaag acgtacacaa atgaaatttt ccatgcaata ggtgagctaa attgtaaggt 1200 aaagtacaac aatgaaacaa tcgaacatgt atttcctgtg acacatggtt catcactatt 1260 tggaagagat ttattgcgta aaattaaaat agattgggca gatatagccg cacaatgtaa 1320 caaaatccaa gaaaatttaa cgctagaagg cattcttaaa gagttttcag acatatttga 1380 ggaaccaagg ggacgcataa aaaaatttaa agcaaaaata gttcttaagg atgatgcgac 1440 acccaaattc atgaaatctc gccctattcc gtacgcaatc caggagaaag tagatcaaga 1500 actagaccgt atggaaaagt ctggaataat tgaacgagta gatcacagtg agtgggcatc 1560 acctttggtt gtagtaccaa agcctaatgg acgagtaaga atcacaggag actttaaaaa 1620 tacagtgaat gcacaactcc atatcactca atatccaatt gccagaccag agcaaatttt 1680 caatacagta ggtggcagtt ctatatactc aaaattggat ggcagcaacg catatcatca 1740 aatggaagta gaagaggagt gcaagaaatg tttagtagta aacactcata gaggactgta 1800 ccgttacaac gtccttccac aagggatatc atcgtcacct gctatctttc aagaattctc 1860 agacaagatg ctacaaggta ttcagcgtac aggtacttat atcgatgaca cgatcctagg 1920 tgacacaaat gaaagagaac atcttcaaaa tttgcggaag atttttaaaa ggatgcgtga 1980 atgtaattat tatgtaacaa aggagaaatg tgaatttgga aaatcacacg tagaatttct 2040 agggcatatt ctttcaaaaa gaggtatcca cacagatcca aataaagtgt cggcaatgga 2100 gattatacaa cgtccacaaa atgtcacaga attaaaatca ttcttaggac tggtaaattt 2160 ttataataag tttgttccta attttgcaga tatatgtgaa cctctctata gacttacacg 2220 ggtaaacgaa aaatggaact ggacaagcaa atgtgaatca gctttcaatc gtgtaaaaag 2280 cactctcgtt agcgccccga tgctaatgaa ttacaatcca gatcttccca ttggaatcag 2340 ttgtgatgca tcatcaattg gaataggcag tgtactgttc catagaattc aagaaggagg 2400 gaagacaatt gaaaagccaa tagcattcac atcacgagta ctctcagcag cggagaaaaa 2460 ttactcacaa atagagaaag aaggacttgc cataattcac gctttgaaga aattttaccg 2520 atatttgtgt ggtcgacaat ttattttggt aactgatcac aaaccactgg tacacatctt 2580 caatccgaag acaacattac agccatatgc agcagcacga cttcatcgtt ggagtatata 2640 catgtcacaa tttcaatacg acattgaata tagatcgaca catgaacatg gcaacgctga 2700 catactatca cgctttccag acaaaaatac attaggagaa gaagaggaag caaaagaagt 2760 aaatttaatc gcagaagaga acatggaaaa attgccaata acgtataagg aaatacgaat 2820 tgctacagca agagacaagg tattatccaa ggttctatct ttcataaaca ataaatggcc 2880 aggagcaata gaaaaagaag aggaggaact acagagttac ttcaagaaac gagaagaatt 2940 aactactcat caaggaataa taatttgggg acttcgagta atcataccta aagtattaag 3000 agaaaagtta ctcaagaatc tccacgaaac acatgcagga atagtgagga tgaaggcatt 3060 agcaagacag ttcatatggt ggccaaatat ggacaaggag atagaagaat tagcaagaag 3120 ttgtaccaat tgttgttcaa ataggcctga ccctccaagt gctcctttgc atccatggca 3180 atttccagag aaaccttggc agaggcttca cattgattta gctggaccgt tacaaaacag 3240 aatgttttta atcatcatgg acgcacatac gaaatggccg gaagtatatg atatgaaaac 3300 agataccaca agtaaaaagg tgatagaaaa gctaagagac agttttgtga gattcggtat 3360 tccggaacaa atagtgtcag ataatggcag acagtttata tcagctgagt ttcaaagatt 3420 ttgcaagaac aacggaatta gacatacaac aagttctgta tatcaccctc gtactaacgg 3480 agaggcagaa cgctttgtac agacattcaa aaaagcagta cataattcag aaggagattt 3540 aacgtatcgg attcaacgat ttttatttaa ttatcgttgc acgccacatt cgacgacagg 3600 agcatcgcca acggagttac ttataggacg aagacctaga agtttactgg acttaattaa 3660 accagacgta ttcaaaacag tagcggctgc tcaagcaaga caagaaaaaa gttataatac 3720 tcgcgtcaag aatcgaaaat ttgagaaagg agaagaagta tgggttcgta cattttcaag 3780 aaataaagct aaatggtcgc taggaactat tattaaagct ttgggtccag tcacgtattt 3840 ggttcgagtc ggagatcaac actacaagcg tcatgtagat cagatgtata atgccatgcc 3900 gcttcaactt aattttgaga aagctccaga gaatgatgct ttgtcaagaa gacttataaa 3960 aaaagagcca gattcagatt ctacatctcc agagggaaac gaagaagaaa gacctgaaac 4020 agcagaggaa ggagcaggtt cttcagagtc agagcaatcg aattcacgag aacatccagg 4080 acagaacaga aatgagaata ctccatcatg tggcagacct cagagaacaa ggaagccgcc 4140 ggatcgtctc gagtacaagg ccaggggcac ccaaaaggaa tacggttcaa aataaggggg 4200 gaggaaa 4207 // ID Gypsy11-LTR_Dya repbase; DNA; INV; 514 BP. XX AC chr3R; XX DT 18-MAY-2009 (Rel. 14.05, Created) DT 18-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy11_Dya; KW Gypsy11-I_Dya; Gypsy11-LTR_Dya. XX OS Drosophila yakuba OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; melanogaster subgroup. XX RN [1] RP 1-514 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1083-1083 (2009). XX DR Genome; chr3R; Positions 734632 734119. XX SQ Sequence 514 BP; 146 A; 122 C; 119 G; 127 T; 0 other; tgttaatgag tagatttaag tcgatataag ttcggggctg ttaccctata ttttatatgc 60 aaattcagca taaactctgg ggagatatgc tgtttttgct agtgtcagca ctttgctgga 120 cgagcggtcg gcttgccgac cgcgaatagc ttctgtaaaa tagaataagt tttcagtttg 180 atttgtataa caataaagag cagttgctca taaatcctac tccagctaaa gctgcattta 240 ttgaattggt tatctagcct taagggcagg taataacgga gatagttctc ccatcgtaaa 300 ctgatcacaa tcaggaacgg cgctgagcaa aggccagcga accaccccca cctagctgag 360 gagacgacta cgttgacgat tacgtcatcc ccggaggaga agatctccag cacatacagc 420 cctagcagag cagccgatcg catcggcgct gaagtactca acccctttcc ccttggaccg 480 cggcgcggga taacctaggg ctacaccaaa aaca 514 // ID TTAA25_AP repbase; DNA; INV; 512 BP. XX AC . XX DT 26-AUG-2009 (Rel. 14.09, Created) DT 26-AUG-2009 (Rel. 15.12, Last updated, Version 2) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; nonautonomous; TTAA25_AP. XX NM TTAA25_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-512 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2093-2093 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC TSD is TTAA tetranucleotide. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea CC Aphid Genome Project. XX SQ Sequence 512 BP; 182 A; 83 C; 86 G; 161 T; 0 other; gggtgtcggt gccggtgcgt tttttggtac aaaaacatgt cttacaggaa aaactctcac 60 gacctgtaga attgtcggat ttcgttaatt ttttttttat cttaaagtag agaacatttt 120 gtaggtcgga tcaaagcccg attttgaaaa ctataattat agaaactgga atatgcattt 180 gaaaaatgca taatatttgt cctttttcaa acgtattttt ctactactat tcaaagtaaa 240 ataaaaaatc gagctttgat ccgacctgta gaaacttctc atctttcaga taaaaaaaaa 300 attaacgaaa tccgacaatt ctacagggcg tgagagtttt tcccgtgaag tcattttcgt 360 tgatataata caccgtatcg accccgtcta aataagtatc gagctgtaga ttagtggaaa 420 agtattataa ttaaggcaat aactaaaata taaaacataa tattcaaata gcatagaaat 480 ttcgaaaatt ggaaatactg gcaccgacac cc 512 // ID Gypsy-7_OD-LTR repbase; DNA; INV; 375 BP. XX AC CABV01000577; XX DT 24-FEB-2011 (Rel. 16.02, Created) DT 24-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Oikopleura dioica genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-7_OD_; KW Gypsy-7_OD-I; Gypsy-7_OD-LTR. XX OS Oikopleura dioica OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; OC Oikopleuridae; Oikopleura. XX RN [1] RP 1-375 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Oikopleura dioica genome."; RL Direct Submission to RU (24-FEB-2011). XX DR Genome; CABV01000577; Positions 4084 4458. XX SQ Sequence 375 BP; 111 A; 96 C; 87 G; 81 T; 0 other; tgtcgcaacg taaggagaat agtgcactgc gcaatgagtc acgcaaagca cgggaagcgc 60 gcacgcttgc cctgaggaaa ggccattagc atggtgaccc gctacgaacg gagagcaatc 120 tcacaacgta cgttggtcac gcgaggtaag gatctagccg tcctaatctc agctaaacag 180 agttccagag aagctcacgc caccttagaa tatattagtt tacgaaccta accctagcag 240 catcatacag atattactta ccgttaaggt tcgttccacg tcaacagagc agaaaccaga 300 agacttctcc tttactatct ctgttcataa taaaagggtt cgcccggaca cgaatcagcg 360 tttgtgtgag tcgca 375 // ID Gypsy-12_AA-LTR repbase; DNA; INV; 215 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-12_AA_; KW Gypsy-12_AA-I; Gypsy-12_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-215 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 994-994 (2011). XX DR [2] (Consensus) XX SQ Sequence 215 BP; 68 A; 46 C; 36 G; 65 T; 0 other; tgtagtatct agataactat tatagcaata gataaattaa ttatagcacg tagctttctt 60 taacgatcga actactacga tcagtctctt tggttccgac cgttcacccg aacggacgca 120 agtgtaacaa gtctccctaa ttaaaagtga aacaataaag ttagtgctca agtgaaactc 180 caagtggtta tagtcttatt tcgtcctccg tccca 215 // ID BEL-150_AA-LTR repbase; DNA; INV; 595 BP. XX AC supercont1.243; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-150_AA_; KW BEL-150_AA-I; BEL-150_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-595 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.243; Positions 1094676 1095270. XX SQ Sequence 595 BP; 218 A; 97 C; 119 G; 161 T; 0 other; tgaagatagt gaactatttt gaatttaatt ctaactctta aaacttaata ctattatcaa 60 tgagctaaaa cttacagcta ggaataaata ggttaaaact taaactgtta tacactccgc 120 gaaacacaac acatgaaata tctatataag tacacatgca gctactgata acactgataa 180 aattattaca caacacaaca cagagataga gcacaaatga tcatatgtgt aggtaataag 240 ccgaacctac aggtagcttg tatggctagg aaagcatttg tacgagttag tagccaacag 300 attgtcgggc aataaagtac ggtttcaacc tgtaacgcaa atcaaataaa ttagtctagt 360 tcaaataaat tcgtagttcc atcctaagtt cggagtttca gtgcatgtaa ggagaaaaaa 420 ggaagtgcct gcaaatatct atcaattgag ttttgaagaa ttcgttcggt atttggtgcg 480 gttgacagat ttctcctacg ctaaataaat tgaagggcgt tagagcaacg atagaagttt 540 tggtgaagtt gggaaaggct gaatccaacc agcgaaccga ggagtgctgt gaaca 595 // ID Crack-33_AAe repbase; DNA; INV; 4113 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A Crack non-LTR retrotransposon family from Aedes aegypti. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; KW Crack-33_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4113 RA Kojima K.K. and Jurka J.; RT "Crack clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1249-1249 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 11 sequences with >95% CC identity. XX FH Key Location/Qualifiers FT CDS 20..853 FT /product="Crack-33_AAe_1p" FT /translation="MIDTLASELKATVASIVTQQMVNVKSEVRLVTAAIEN FT SQDFLSAKFDNIVSEFKNLEAENERLKQQINDLSSSHSKLANIVYQLESNA FT DKYDRKSVSNNAILLGLPQVPNECVMPLVEKTVAHLGVDLPNGSIVSAAXL FT YHSNKPNTVVPIQITFSDKNIKELVFSKKKNMGTLLSTSIDHSLLINGRPT FT SVSLRNELTPLSLELLRKMREYQKELXIQYVWPGQLGGILVKKSDNCKPDV FT IKTREDLSRVIEKYTLFEKDTVSPNQGRNNQIQMHFK" FT CDS 912..3797 FT /product="Crack-33_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MAQQLNFFHNHIDEFNVTQRRNNTKQLRILQWNVRGI FT NDLCKFDEILLAIDHFVYPVDVIIVGETWLKAGNTSLYTISGYNSIFSCRN FT NSSGGLAMYIKSTIDYKIIENISCDGFHHISVELTSNGANFIVHGVYRPPS FT YDFNLFHDRLEGWLNESCTNQPCLIFGDVNVPVNLSNNNVVLKYKYLLQSY FT NFLCCNSFVTRPASSNILDHVLCKMSDAHRLTNHTILSDLSDHYLILSEFE FT LHIGSHATILTKNIVNHDRLELEFKNYIDNLGPVLNVDFCLETIISKYNTI FT LKDCTKTISKTANIKNTQCPWMNFDLWTLIKIKSNYLKKCRRDPNNVQLAE FT CLKHVSKKLDTAKKLAKKRYYENLLNNASPSKMWKNINKILGKSVNESKLS FT LVVDGVVTIESASICERFNDYFSEVGPKLAEKIGSTSRNPLRRITPIPDSI FT YLQPSTENEVILLINDLKVNKSPGPDNVPARIIKNNANSFSRILSESFNLM FT IQTGSYPECLKTAKVVPVFKSGDTRRIDNYRPISTLSVFNKILEKLLINRL FT VSFLERHNVLYSFQYGFRNGSSTTIAITELVDKILQETDANKMVGALFLDL FT KKAFDTLDHGILLRKLDHYGIRGIANNILSSYLANRKQFVSHDGSQSSYKS FT LRTGVPQGSNIGPLMFLIYINDICKLHLSGVPRLFADDTALFYPNVNANII FT INSMQEDLLLLQNYFAENLLTLNLHKTKYMVFRSPRKTIPALPRLVLGDDI FT IEKVDCFKYLGVHFDSTLSWDHHIQIVASKMSSMCGVLNRIGKFLPRKALL FT MFYFAHIHSHLNYVIVSWGRACKSKLKKLQTLQNRCIKIIFKLPLLFPSVR FT LYSDFPHNILPVTALCEEQTLILIHKILHSPTTLNNLPITIIPRIRNSRQA FT NHVTRTRAFSNFGQNRFSFIGPTKFNALPRNLQQITNLISFKTNLKRHLQT FT QIFQLLI" XX SQ Sequence 4113 BP; 1348 A; 720 C; 712 G; 1327 T; 6 other; tcaggacaag cgatcatcaa tgattgacac tcttgcttct gaattgaagg ctactgtagc 60 tagtattgtc actcaacaaa tggtcaatgt gaaatccgag gtgcgtttag taactgctgc 120 tatagaaaac tcgcaagatt ttctatctgc aaaatttgat aacattgttt cagagtttaa 180 gaatctcgaa gctgaaaatg aaaggctcaa acaacaaatc aatgatttat ctagctctca 240 ttctaaactg gctaatattg tttatcaact tgaatctaac gccgataaat atgatcgwaa 300 gtcagtatcs aacaatgcaa tcttgctagg tttgccacaa gtgccaaacg agtgtgtaat 360 gccattggtt gaaaaaactg ttgcgcatct tggcgtkgat ttacctaatg gttcaattgt 420 gtctgctgcg agkttgtatc atagtaacaa gccaaataca gtagtaccta ttcaaatcac 480 tttcagcgat aagaatatca aggaattagt attctcaaag aaaaaaaaca tgggcactct 540 tctttccacc agcattgatc actcacttct aatcaatggg agaccaacca gtgtgtcgct 600 taggaatgaa ttaacacctt tgtcattgga acttctacga aaaatgcgag aatatcaaaa 660 agaattgmaa attcaatatg tttggcctgg gcaacttgga gggattttgg ttaaaaaaag 720 tgacaattgc aaaccagatg taattaaaac aagggaagat ttaagccgcg tcattgaaaa 780 gtatacattg ttcgagaagg atactgtttc accaaatcaa gggaggaata atcaaatcca 840 aatgcatttc aagtaacctt cttaagttgt tttttktttg ttttttcatg tatttttgaa 900 ttattgttag aatggctcaa cagcttaatt tctttcacaa tcacattgat gagttcaacg 960 tgacacaaag gagaaataat accaaacagc tacgaatatt gcagtggaat gttcgaggta 1020 ttaatgacct ctgtaagttt gatgaaattt tgttggcaat cgatcatttt gtgtatcctg 1080 tcgatgtaat aattgttgga gaaacgtggt tgaaagcagg taatacttca ctctatacga 1140 ttagtggcta taatagtatt ttttcctgta ggaacaactc atctgggggg ttagcaatgt 1200 atattaagag caccattgat tataagatca ttgaaaatat atcctgtgac ggttttcacc 1260 atattagcgt tgaattaaca tcaaatgggg caaatttcat tgtccacggg gtgtatcgtc 1320 caccatctta tgattttaat ttgtttcacg atcgtctaga aggttggctg aatgaaagtt 1380 gtaccaatca accttgttta atttttggtg atgttaatgt accagttaat ttgtctaata 1440 ataatgttgt tttaaaatat aagtatcttt tgcagtccta taattttctt tgttgtaatt 1500 cttttgtcac aaggcccgct agttcaaata tattggatca tgttttgtgt aaaatgtccg 1560 acgcccatcg tctcactaat cacaccattt taagtgattt aagtgatcat tatttaatat 1620 tgtccgaatt tgaattgcat attggttcac acgcaacgat attaacaaaa aatatagtta 1680 atcatgatag actagaatta gaatttaaaa attacattga taaccttggc cctgtgttaa 1740 atgttgattt ctgccttgaa acaataatct cgaaatacaa tacgatactc aaagattgta 1800 caaaaacgat atcaaaaact gcgaacatta aaaacacgca gtgtccctgg atgaactttg 1860 acctgtggac cctaattaaa ataaagagca actatttaaa aaaatgtcgc agagatccaa 1920 acaatgtcca actagctgaa tgtttgaaac atgtttcaaa gaaattggac acagccaaga 1980 aactagcgaa aaaaagatat tatgaaaatt tattgaacaa tgcttcgcct tctaaaatgt 2040 ggaagaatat caacaaaata ttggggaaat ctgtaaatga atctaaactc tctctggttg 2100 ttgatggcgt cgtcaccatt gaaagcgctt caatttgtga aaggtttaat gactactttt 2160 cggaagttgg tcctaagcta gcagaaaaaa ttggttctac ttctagaaat ccactgcgac 2220 gaattactcc cattcctgat tcaatttatt tgcagccatc gacagagaat gaagttattt 2280 tattgatcaa tgatctgaaa gtgaacaaga gtcctggccc tgataacgtc ccagctagaa 2340 taataaaaaa caatgcgaat tctttctcac ggattttgtc tgaatcattc aatttaatga 2400 tacaaacagg ttcataccca gagtgtctaa aaacagcaaa agtggttccg gtattcaaat 2460 caggtgacac tcgtagaatc gataattatc gtcctatttc gaccctttct gtgtttaaca 2520 agattctgga aaaactgtta atcaatcggc ttgtgagttt tcttgagagg cataatgtat 2580 tgtattcgtt tcaatatggt ttccgtaatg gaagcagtac aaccatagct ataacagagc 2640 ttgtggataa aatccttcaa gaaacagatg ctaacaaaat ggttggagct ttgtttctgg 2700 atttgaaaaa agctttcgac actctcgatc atggcatttt gttgagaaaa ttggatcatt 2760 acggaattag aggaatagct aacaacatct taagcagtta tttagcaaat cggaaacaat 2820 ttgtttcaca tgacggttcg caaagctctt acaaatcgct tagaactggc gtcccacaag 2880 gaagcaacat aggtccacta atgtttctca tttacattaa cgacatttgc aaactgcatc 2940 tatctggagt tcctaggttg tttgctgatg atacggcgtt gttttatcca aatgttaatg 3000 ctaatattat tataaacagc atgcaggaag acttgttact tctccaaaat tattttgcag 3060 aaaacctcct cactttaaat cttcataaaa ctaagtacat ggtttttaga tcccctagga 3120 aaaccattcc agctttacca cgcttagtac taggtgatga tataatcgag aaagtcgatt 3180 gcttcaaata tcttggagtt cactttgact ctaccttatc ttgggatcat catatacaga 3240 tagtcgctag taaaatgtct tctatgtgtg gagttttaaa ccgaattgga aaattccttc 3300 ctagaaaagc attattgatg ttttattttg cgcatattca ttcccaccta aattatgtaa 3360 ttgtatcgtg gggcagagca tgcaaatcaa agcttaaaaa attgcagaca cttcaaaaca 3420 ggtgtattaa aataattttc aagctaccat tactttttcc ttcggttcgt ttatattccg 3480 attttcctca taacattctg cctgtaacag cgttatgcga agagcaaaca ttaatattaa 3540 tacacaaaat attacactct ccaaccactt tgaacaattt gcctattact attatacctc 3600 ggattcgcaa ttccaggcaa gccaatcatg tgacaaggac tcgtgcattt tctaattttg 3660 gtcaaaaccg gttttcgttc ataggcccaa caaaattcaa cgcacttcca agaaatcttc 3720 agcaaataac aaatctaatt tctttcaaaa ctaaccttaa acgtcacctt caaacccaga 3780 ttttccagtt attaatttag ttaaatctta ttctttattt gtatatatat gtaaaccacc 3840 ttcaaacaat agatatttta ttattaaata gtttgtaatt catatgtgct tctttaaaag 3900 gataacatat ccactagaag cacttctttt atgtaatttt gttaatatgt cattactttc 3960 cttgcttaat tgtcccaacc agtagtttaa tgtttattat tttgtagtgt tgcctgcggt 4020 tgggacttga gtgtccacta ccagggggct caagacattg agctttttgg tgtgggggag 4080 agcggagggt cactcaaaaa aaaaaaaaaa aaa 4113 // ID R1B_DAn repbase; DNA; INV; 5905 BP. XX AC . XX DT 08-NOV-2010 (Rel. 15.11, Created) DT 08-NOV-2010 (Rel. 15.11, Last updated, Version 2) XX DE 28S rDNA-specific non-LTR retrotransposon R1 in Drosophila DE ananassae. XX KW R1; Non-LTR Retrotransposon; Transposable Element; R1B_DAn. XX OS Drosophila ananassae OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; ananassae subgroup. XX RN [1] RP 1-5905 RA Stage D.E. and Eickbush T.H.; RT "Origin of nascent lineages and the mechanisms used to prime RT second-strand DNA synthesis in the R1 and R2 retrotransposons of RT Drosophila."; RL Genome Biology 10(5), R49-R49 (2009). XX DR [1] (Consensus) XX CC This family is specifically inserted into 28S ribosomal RNA CC genes. D. ananassae contains two subfamilies of R1. XX FH Key Location/Qualifiers FT CDS 499..1938 FT /product="R1B_DAn_1p" FT /translation="FVRNFVWRVKFELSNLNLNSEMPRGRGGRSRSGSRSS FT VASVASVASAASVAVDVRQARKSRERSISFDMESEVEPKRLDQRSSPKKNN FT GAEMAFSCPGGLNGAGLEAHGANTVAPAAATIAAIIAEPVAVAPAAVAPVA FT VLEHATARAAVAAGQGAVLAELSATQRAVRGALLSGNLQTVSASMSRYDEL FT VVALMLRVAELETRVAMPPPMTSTLHHNAVGQAAVPVAPVPVPRTKVRETW FT SCVVKGTDPELSSSQIAEKVRKEVAPALGVRVHEVRELKRGGAIIRTPSAS FT EASKVAASRKFAEVGLSVERNQAVKPRLTVFDVDTALSPDVFMEELYANNL FT KDEMTVADFKKSVHFASKPWTAADGGTTNVTLEVSEKALAALDTGRVYIKW FT FSFRCRSQVRTYACHRCVGFDHKVQECRLKVSVCRQCGQGGHVAAKCTNPV FT DCRNCRFKGLPSGHNMLSAACPVYSAVLARVNARH" FT CDS 1928..5005 FT /product="R1B_DAn_2p" FT /note="reverse transcriptase." FT /translation="MQDTNMSRLLQINCQGSYAVMCDVGQMMCERGCVVAL FT LQEPYNRYGGVRGLPANMRVFPDSRGRAAVVVNDPGIECTLVNSTDWGVCV FT GLEGIFGRLYVVSIYCKFSEPLDPYLQYMNAVLLQVSSNPVILGLDANASS FT PLWFSKMSRLGPGHRNYDRGETLAEWVITQDVQVVNKPSEWYTFNSPNGAS FT DIDVTLANEAAMRVCDFQWNVLGGQGVSDHNPIEIVVTRAVALEPNERLNG FT WCYRGANWALHGHYVREAATQVPFSTFCAMNVDEQVACVTEWVTSANDSMF FT ARHRKVTRKRVKWWTRELQAEKRSLRSLRKRFQRARRTNADGVAQLRDQLS FT TRVHEYKHRVLRVKDDEWRSFVDRNKHDPWGRVYKILRENGKEGSRDICAL FT RVEGRDLITWKDCVECLLSEFFPREDRQHSVPDAVGDADPLLMSELQWAFT FT KLRSGKAPGLDGFTGEICKSVWKSIPDHLYELYSKCVSEGAFPSAWKCTRV FT SVLLKSLDRDRSNPRSFRGISLVPVLAKVLERVLVERLQESVGSQMSDRQF FT GFREGRCAEDAWRSVKDSVRSSNSKYVLGVFVDFKGAFDYLSWASVIQRLN FT ECGCRDVALWKSYFSGRNACVVGVNEVVWKDVARGCPQGSICGPFIWNLMM FT DPLLWQLEQAWQCCAYADDLLILVEGQSRSMIEASASECLRIVGQWGDSVG FT VSLARDKTVTMLLKGSLSRSPVVRLDGSSIRYAKQVKYLGVTVGERLRFAP FT HLANLKDRLVGTVGRLRRVLRSEWGLSRKAARTIYGGLFVACAASGSPSWY FT DAVLDVRGRMKILSVQRLILLGCMPVCRTVSTEALQVLFGAVPLDLEIRRR FT AVSYKLKKGLPLLQDEWLFREMSGGNVESLGPSEWKSMLRECVLSDWQARW FT DNSENGRATYEFIPDVSFVVDRPDFGFNLSAGFLLTGHGSLNAFLHRRNLA FT GSPECRCGSRQETWKHVLCECPLYDDLRDLGGLGVVQAGGDFDFSQVLSTS FT DRISRLNVFARAVFSRRRMR" XX SQ Sequence 5905 BP; 1246 A; 1338 C; 1874 G; 1447 T; 0 other; gcgggagtaa ctatgactct cttaaggtag ccaaatgcct cgtcatctaa ttagtgacgc 60 gcatgaatgg attaacgaga ttcctactgt ccctatctac cgaaccgaac ggacgtgttt 120 tctgtgcgct ccggtgaaaa aattaatatc agccgaaata agctgccttt taagacaaat 180 taaagtgcgt tttgtgtgcg aacgttgctg ctacaaagtg cgagttgtgc ggttagtgta 240 gctaacgtat tttcgcggaa aatcgtgatt ttccaaagtg aaaattcgcc acgtggtttg 300 agcaaaacaa taaagcttta agctttggtg ctgccatatc ggtgaaaact ataaaaacag 360 ctttctatca gctgtttttg gtgctgccat atcggcataa aaaccttaag cagctgtgtg 420 tggtgctgcc atataggcat aaaacagtgg aacagctgag tggagtgtgg ccgtgtgctg 480 agaactcttt ggtcctaatt tgtgcggaac tttgtgtggc gcgtaaagtt cgaactttcg 540 aatttaaatt taaattcgga aatgccgcgc ggtcgtggag gccgcagtcg ctcgggcagc 600 cgctcgagtg tagcgagcgt tgcaagcgtc gctagcgctg ccagcgtcgc agtcgacgtc 660 cgccaagcca ggaagtcaag ggagcgttcc atttcctttg acatggagtc ggaagtggag 720 ccgaagcgcc tggatcaacg gagctcgccg aagaagaaca acggtgcgga gatggccttc 780 tcgtgccctg gagggctgaa tggagctggg ctggaagccc atggagccaa caccgtcgcc 840 cccgctgccg ccaccatcgc cgccatcatc gccgagcctg ttgccgttgc ccccgctgcc 900 gtcgcccccg ttgccgtcct tgagcacgct actgctcgcg ccgccgttgc tgctggccaa 960 ggagccgttc tggctgagtt gtccgcaacc cagcgtgccg ttcgcggtgc gttgctgagc 1020 ggcaacctgc agacggtctc tgcctcgatg agccgttatg atgagctcgt ggtggccctg 1080 atgctccggg ttgcggagct ggagacccga gtagccatgc cgccgccgat gacgtcaacg 1140 ctgcaccata acgccgttgg acaggccgcc gtccccgtag ccccagtccc agtcccccgt 1200 actaaggtgc gggagacttg gtcctgcgtg gtgaaaggga ccgatccgga gctctccagc 1260 agccagattg cggagaaggt gaggaaggag gttgcaccag cccttggagt ccgggtgcac 1320 gaggttcggg agctcaagcg tggcggagcc atcatccgca cgccgtcagc ttcggaggcc 1380 tccaaagtcg cagcgtccag gaagttcgcc gaggtgggac tcagcgtgga gcggaaccag 1440 gccgtcaagc cccgcctgac ggtcttcgac gtggacacgg cgctgtcacc ggacgtcttc 1500 atggaggagc tctacgccaa caacctgaag gatgagatga ccgtggctga cttcaagaag 1560 tcggtgcact ttgcatcgaa gccgtggacc gctgctgatg gaggcacgac gaacgtgact 1620 ctggaggtca gcgagaaggc tctggccgct ttagacacgg gccgtgtcta tataaagtgg 1680 ttcagcttcc gctgccggtc ccaggtccgg acgtatgcct gccatcgctg cgtaggcttc 1740 gaccacaagg tccaggagtg ccgcttgaag gtgagtgtgt gccgccagtg cggacaaggg 1800 ggccatgtag cggcgaagtg caccaatccg gtggactgcc gaaattgtcg ttttaagggg 1860 ctaccgtcgg ggcacaatat gctctcggcg gcctgcccag tgtatagcgc agttttggca 1920 cgtgtgaatg caagacacta acatgtctcg gctcttgcaa attaattgcc agggttcgta 1980 cgctgttatg tgcgatgtgg ggcagatgat gtgtgaaaga ggctgcgtcg tggcgctgtt 2040 gcaggagccg tataaccgct acggcggtgt gcgcggtttg cccgcaaata tgcgggtatt 2100 tcctgacagc agaggccgtg ccgcagtcgt tgtgaatgac cccggcatag agtgcacttt 2160 agtgaactct actgactggg gtgtatgtgt gggtttagag gggatttttg gtaggttgta 2220 tgtcgtaagt atttattgca agttttccga gccattagat ccgtaccttc agtacatgaa 2280 tgcggtacta ctccaggtga gtagcaatcc tgtcattctt ggccttgatg cgaacgcgtc 2340 atccccctta tggttcagta agatgtccag gcttggtcct ggccatcgga actatgaccg 2400 gggtgagacg ctggccgaat gggtgatcac ccaggatgta caggttgtta acaagcctag 2460 cgagtggtac acattcaaca gtccgaacgg agcgagtgac attgatgtta ctctcgcgaa 2520 tgaggcagca atgagagtgt gtgattttca atggaatgtg ttggggggtc aaggtgtgag 2580 tgatcataat ccgattgaga ttgtggtcac ccgcgccgtg gccctggaac cgaatgagag 2640 gcttaatggg tggtgctatc gtggtgcgaa ttgggccctt catgggcatt atgtgagaga 2700 agcggcgacg caagtcccgt ttagcacctt ctgtgctatg aatgtggatg agcaggttgc 2760 atgtgtgaca gagtgggtta cgagtgcgaa tgattccatg tttgcgaggc accgtaaggt 2820 gactcgtaaa cgtgttaagt ggtggacgcg tgaactgcaa gccgagaagc ggtctctccg 2880 gtctctgagg aagagattcc aaagggccag aaggactaat gcggacggcg tggcccagct 2940 tagggatcag cttagtacta gggttcatga gtacaagcat agggttttgc gagtgaaaga 3000 tgacgagtgg cgttctttcg tagatcgaaa taaacacgat ccctggggtc gtgtttacaa 3060 gattctgcgg gagaatggga aggaggggtc cagggatatt tgtgccctac gggttgaagg 3120 tcgagacctt ataacctgga aggattgtgt ggaatgtctt ttgtccgaat tctttcctag 3180 ggaggatcgg cagcattcag tgcctgatgc agtgggtgat gctgatcccc tcctgatgag 3240 cgagcttcag tgggctttca ctaagcttcg ctctgggaaa gcgcctggtc tggatgggtt 3300 cacgggggaa atttgtaaaa gtgtctggaa gtccattcca gatcacttgt acgaattata 3360 ctccaagtgt gtgagtgagg gggcctttcc gagtgcatgg aagtgcacga gggtgagtgt 3420 gctcctaaag tcactcgata gggaccggag caatccgcgt tcctttcggg gcatcagtct 3480 cgttccagtt cttgcaaaag ttctggagag agtgctggtg gaacggctgc aggagagtgt 3540 ggggagccag atgtctgata ggcagtttgg cttccgtgaa gggaggtgtg cggaggacgc 3600 ttggaggagc gttaaggact ctgtgagatc cagcaactcc aagtacgtcc tcggtgtgtt 3660 tgtggatttt aagggggctt tcgattacct aagttgggct agtgtgattc agaggttgaa 3720 cgagtgtggg tgccgggacg tagctctttg gaagagctat ttctcgggac gaaatgcatg 3780 tgtggtggga gtgaatgagg ttgtttggaa ggatgtggct cgtggttgcc cacagggctc 3840 catctgtggt ccatttatat ggaacctcat gatggatccc ctgctgtggc agctcgagca 3900 agcgtggcag tgttgtgcgt atgcggacga cttgctcatc ttggttgagg gccagtcgcg 3960 ttcgatgatt gaggcaagcg cttcagagtg cttgcgcatt gttggccagt ggggcgacag 4020 tgtgggtgtc agcttggcaa gggataagac tgtgacaatg ctgctgaaag gcagcttgtc 4080 caggtccccc gttgtcaggc tcgacggctc cagcataagg tatgcgaaac aggtgaagta 4140 cctgggcgtg accgttggag aacggttgcg cttcgcacca caccttgcca atctgaagga 4200 ccgattggtc ggtacggttg gacgattgcg ccgagttttg agaagtgaat ggggcctcag 4260 cagaaaagct gctcgcacca tatatggtgg tctttttgtt gcttgtgcag caagcggatc 4320 accttcgtgg tacgatgcag tcttggacgt taggggcaga atgaaaattt taagtgtgca 4380 aaggttgatt ttgttggggt gtatgcctgt gtgtcgcact gtctctacgg aggcattgca 4440 agttttgttt ggagcagttc cccttgactt ggagatacgg cgtagagccg taagctacaa 4500 gctcaagaag gggctgccat tgctgcagga tgaatggctg tttcgtgaaa tgtcgggagg 4560 gaacgtggag agtttagggc caagtgaatg gaagtctatg ttgcgtgaat gtgtcctgtc 4620 ggactggcaa gcccgatggg ataatagtga gaatgggcgg gctacttacg agttcattcc 4680 agatgtctcg tttgttgtcg atcgaccgga ctttggtttt aaccttagtg ccgggttcct 4740 tttgactggc catgggtcgc tgaatgcatt cttgcatagg cggaacttgg cgggcagtcc 4800 ggaatgccgg tgtggctctc ggcaggagac ctggaaacat gttctctgtg agtgtccgtt 4860 gtatgatgat cttcgtgatc tgggcgggtt gggtgtggtt caggctgggg gggactttga 4920 ctttagtcaa gtgctctcca cgagcgacag gattagtaga ttgaatgtgt ttgcgagagc 4980 cgtattcagt cgtcgacgaa tgcgttgatt tgtagatgcc gggatgtgtt aggggtgaat 5040 ggtgcgtgaa tcaggatcga gtgaataggt gtgtgcaaag ttcgccgggt tttcgcccgc 5100 cgccggggtt accagtcccc gagcttcatg gtagccaaac tgggagtgct tcgttgagtt 5160 gcacgaacct gaccagagga tttttctggt accacgggtg atgaggattc acggagatgg 5220 tgctccaaat ccgccttgtc acaagccctt aggggttgtt gggaacatag ctggccgtct 5280 tcgagcggtc tggtcactga ccagcagcga ttgccgcggt ttgcggaaga ggaaggaata 5340 gtcctcgcct ccgaacagtg agaaagccat gtccatgaac atggtggata aaactgtacc 5400 accttatgat ttggcttcgg ccacttcata atctattccc agcagctatg tactatccag 5460 ggccaccgac ccgtgactgt tctggagtca aaatcggtag tgtccttgtt ggcacggcgc 5520 ctgaccggag gctgtgataa tttctggtac cacgggtgcc aggagcccac ggaacttgtt 5580 ccgtcctggc tatggtgagg ccccctaggg agctacgtgg tggttgtggt ttaacacccg 5640 aatgcgggta gagcctctgg ctcgacgtgg agttgcgtta tacaaccggg tgccgtgatc 5700 ccaaagatcg gaagaggttt agataggcct ccatccttaa ccgaggatta agtcatgacc 5760 gaacagattg acttacattc ggtatccggc ggaacatggt tccaaagggg cgctgattga 5820 cgcattgttt catatcccac gcccgagggg gccgtgagat taagccaaca tggcaggtgc 5880 tcacgttaaa catatcagac tttca 5905 // ID Gypsy-1_BT-LTR repbase; DNA; INV; 1000 BP. XX AC AELG01002157; XX DT 15-JAN-2011 (Rel. 16.02, Created) DT 15-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the buff-tailed bumblebee: long terminal DE repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-1_BT_; KW Gypsy-1_BT-I; Gypsy-1_BT-LTR. XX OS Bombus terrestris OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; Apoidea; OC Apidae; Bombus; Bombus. XX RN [1] RP 1-1000 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the buff-tailed bumblebee."; RL Direct Submission to RU (15-JAN-2011). XX DR Genome; AELG01002157; Positions 50 1049. XX SQ Sequence 1000 BP; 295 A; 243 C; 191 G; 271 T; 0 other; tgtcggagat gaaagaacac cggaaccttc tctttggaac tttgggaaga tccctcaata 60 ctgtaatcca gattccacca tagaagtatt gagaaatagt tgtcattcga tctgactgaa 120 tttatttgag atttatgata gtgagctcgg gctccaggcg acaaccagtc gccaaacgta 180 gccgcggtca agggataaac gttttgtcta agaaaggcag gaagtaactc tatagctctc 240 cttaaaagaa atagttggga cggtgcatga cagtaaacat ttcaatggtt tctaccccgc 300 ggttcgctac acgcagactt tgtccttcgg gtaagatgat tgccagatgt caacgcatct 360 tcacagtata tgtttagctg gccgaggacc cgttataaat cttaagtttg ggttatataa 420 agtccttcaa acagacaaat agtctttgtc ccaactacgg gaagataaaa gaaatctatc 480 cttccatgaa cgacgcttcc cactagcaac tttccccaag gacggctaac atccttctct 540 aaccaccaac atgaaaattg accaattaac agtaacgtca atttccctca cccttcgaac 600 gaagactttc ctctgcgaat ccgatgatct catgccctta gacacaccca gcatagattt 660 cctcgacagc gtcaccgtga cggagagcca ctctcttgta caatctcatc aacgattgaa 720 ctgtcatttt tatacgtagt gattgctcct tgcattaaca ttgagtataa cttagacgct 780 aaaaaagggt aaagttcggc atcccgttga ccgcggattc gttatcgaac gtgagaccac 840 tgtcattagt gccgcgagta tctaaatact gtgataacta ccgacgctat atatctctac 900 ttgtgttaat aaattgtacc tttgttgcac cgcaccgatg gctaattcca atgaagattc 960 attgaacacc cctaatccta atcataacgc caacccgaca 1000 // ID Academ-2_HM repbase; DNA; INV; 6072 BP. XX AC . XX DT 16-MAY-2011 (Rel. 16.05, Created) DT 16-MAY-2011 (Rel. 16.05, Last updated, Version 1) XX DE DNA transposon: consensus. XX KW Academ; DNA transposon; Transposable Element; Academ-2_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3053 RA Yuan Y.W. and Wessler S.R.; RT "The catalytic domain of all eukaryotic cut-and-paste transposase RT superfamilies."; RL Proc Natl Acad Sci U S A 108(19), 7884-7889 (2011). XX DR [1] (Consensus) XX SQ Sequence 6072 BP; 2198 A; 922 C; 1053 G; 1899 T; 0 other; tagtatggtg gaattagctc aaaaaaaacc caaacccgga aaaaaatcat ataaagttag 60 gcttttgcac ataggttagt tagcatgtca acaatgacca aaaaaaaagc cccgtgtttt 120 tgagaacagg aaatgtcatt ttgggcccat ttttcaatca aaaatttggg caaaatgact 180 aattttttta tatcacaaaa attataagtt ttatatatca tttaaaagat aatttaatgc 240 cgagtaaaaa gtttggttaa gcaaagttat aaatatgtat ggtttgtgag ttatagctgc 300 tcaaagtggg aattatgacg tcatattttt tataattatt ttttttggaa atcgtaagca 360 tgatataact ttagaaatat aatttaaagc tgaataaaat actgtcaaac tgtacatata 420 ataaaattta ttaaatgtac agtttgaatg aatagttttt gttcttgaga gaaacttttc 480 gatcaaattg tgtttaaaaa aagacattgc aatgaaaata tgaaatttag aacaaatata 540 attctattca aatatataaa ataatatagt attactatta tattatattc atatatttaa 600 atttatatat attaatatta tatataggta taatactaat ataaatttaa tactaattta 660 attcagtaca aatataaata gaaagtgtgt aaagagatga atgaaacttg ttcgctgtgt 720 aatggtaatt ttgaaaagtc tccatcagtt tcagtaaaaa agaaaggact ccaaacttta 780 aagagaatct gcaatcaaag aggattatac gatttaaata ggtatattta aatataaaag 840 ataaaatagt atcaaatgct aatgataaac agatttgaac aatatagcac gtttgcatta 900 aaattgtacg gaattacctt tatttagata ttttgatcaa aagttaaaag aaaacccaat 960 cggaactgct cttgtgcatc atgattgcag aagaagattt gttgataaaa gaaatacaaa 1020 tatagaagtc gtcccaataa agaaactaag atcatcttca attattgatg tacgtttcaa 1080 ttggaaatcc tgttgcttcc tatgtaccaa atcagctgac ggtaaaaaca gtactgttac 1140 tcaagtgaga acgttaccac ttcgaaataa gcttcttaac caatgtaaaa gtagagctga 1200 tgaatgggga agagaggtgt taggtcgttt atcaagttgt aatgatcttg ttgcagagga 1260 agcagtatat cacattcctt gtatgaacaa attcagactt aatctactga ctggaaataa 1320 aaaaggtcgt ccaacagatg atttgatgct tcttaatttt tataaggttt gtgattggtt 1380 agaaaaagat gcagattgtg acctttgcac attaaacgag atgcacatta aaatgttaga 1440 acttagtaaa gactctccat gttactccat taaacatttg aaaaataaac ttgtagaata 1500 ttatggtaat catatatttt ttgcagaagc accaggtcga caaaatctta tttgcttcaa 1560 agacatgaat tggttcatta tggaaagttt taagaaaaaa gaagaaaaaa catccggtga 1620 catcattgca gccgcagcaa aaattataaa gaacgacatc agagaacttt cttgtgataa 1680 atctaactac cctataattg atcaaatgga cgatttgaat tatgccaaaa actgggttcc 1740 agcaagtctt ctcaactttt taaaatactt aatttcatca gaactgaagc aagtcagtat 1800 tggccagtgc attgctcaat gctctagacc aaaatcaatg attgcatcaa ttccttttgg 1860 gattggagtg gacattgata aatcgtttgc aacaaaatgg cttgttgatc atctatctag 1920 acttggattt agtgtttctt ctgatgaagt gaagttattt aaagaatcgg ctgctgcgga 1980 aaaaagagat gcaattgaaa tgccagaaca taagtttaca caatgggttg ccgataatgt 2040 tgaccataac atatgcactc taacaggaaa aaatactttt cacggaatgg gtataatctc 2100 aattactcca tactctgttc aaaaatccat cgctgttaag cgcttgaagc ataaacaatc 2160 tttttgcttt aaagatgcaa taaagatact accttatcat ggatcaagtc aacaagggtt 2220 atcaaaatta aaatttaagg ctgtctgcga tttagtattg ccattatttc attcacctgt 2280 tatgaatctt gatctattgt ggcaagctgc atggttttta acttcaaagg aatctccacg 2340 ccctaattgg tcaggattca tgcaacatgc tgtgactaca tgcagtgaca actttaaaaa 2400 gacgactatc aattttcttc ccattattga cttaaatcct ttggaagagt cgtgtattta 2460 ttcaactctg caatttgtga tatctcaagc caaaagattt gacatttcta caccatgtat 2520 tacttttgat caaccactgt ggttgaaagc tattggaatt gtaaaaaatg aaaacctgaa 2580 cattgtgtgc agacttggtg gatttcacac actcatgagt tttcttggaa gtattggtaa 2640 gctaatgtca ggatctggac ttgaggaagt cttcgaagaa gtgtatgcag agcatactgt 2700 tcaacatatg ttttcaggaa aggcagttgc tagatctctt agagctcaca tcataattca 2760 aagtgtttta acggtttatt taatggattt tttaatagaa gaaagtagaa ttgatcttgc 2820 tggttttata cctgcctatc aaaatgcaat aaatgatcaa ggtatgaaca aggaacaatt 2880 aattgagata ggaaatagcg atattttcaa acaaactaag agtgcattat ctgctttaat 2940 tgagaaaaag aaagctgaat cgcgtacagc cgcattatgg cttcagtata tggaatatat 3000 tgacgtggtg aaggaattta tttgtgcaga aagaacttca aactggtatt tacatctgca 3060 agccgtgaag aaaatgctga atttatttgc agcaacaggt catgtaaatt atgccagaag 3120 tgctagaatg tattttcaag aaatgttaac tctttcagag actaaccctt ggcttcatga 3180 taggtttata gaaggtgaac acgcagttcg aagatcaagt agatattggg gtggattatg 3240 gtcagatctt gtgattgaac aaacgcttat gcggtccctt aaaactagcg gtgggttgac 3300 aagagaccgt ggtcttgaag agaatgtacg cctcctttgg gtttcaagta ttaattacac 3360 agctgctgtt catgacgcaa tgacaaatct tacaggtgtt aaagttggaa ctagtgagca 3420 gcatctagaa atggggttca cacgtagatt aagcgattat gaagatttcc aaaaatttta 3480 tggttggttc gaaacaagaa acccttttga atacgaagat actcaccttc actcgttatc 3540 aagtggagtc ttttcggaaa ttaaaaaaga caatgttaat tgtgaaaatt cagaagctat 3600 tggactagca attcaaaaaa atcttgacaa tatcagtttt acagaagcga agatcaagac 3660 aaagaatcaa cttttcacac ttgaaacttt gacaaagtgc gctaaaattg acgataaaaa 3720 ctcaattttt atcaaatcaa ccaaactatt tacaagatta gctgcaattg cacagagaga 3780 agatgatgtt gaaagttact ttgattacga actcactcca tttccacaag ctttgtttaa 3840 aaataatctt atgcggaaac ctgataaagc ctcacttagg aaattacttc taacagagga 3900 aaacatttgc tcaatagaaa aattggaaaa ttgcatctat gttcttgatg gaggtgcttt 3960 attgcatcgt gtgcactggg ttaaaggaac aaagtttagc aacatattgt atggatatgc 4020 aacctatgtt cagaaaaact atggcaaatg cttcattgta tttgatggat ataagtccac 4080 gacttcaatc aagtctaatg aacacattag aagaaaaaca gctagtggtt cttcaagaaa 4140 tataatcatc aaagaagata acgatgtttc atactcaaaa gaacgttttt taggtaacgc 4200 gcacaataag gagcaattga tatcgttcct agcagcacac ctcaccaagg atggccatac 4260 ggttcatgta tgtgaagggg atgctgacac aaaaattgtt tctacggctc tcgaggttgc 4320 aaaagatttt ttaacaatag ttgtggctga tgatacagat gtggctgtaa tgcttcttta 4380 ccactggaat aagaagttat ccgatatttt ctttcttcag gaacacggca agaaatgttg 4440 gagcattaaa aaatgccaat tagaagttgc ggacttcaag gatcatttgc tttttgttca 4500 tgcatggagt ggatgtgact ccacctcagc aatatttggc aaaggaaaag caatgttttt 4560 gaactcagtc aaaaaatcag aaagtatgaa ggagatatcc gaaacattta tggactactg 4620 ggcaacaaac aaagaaattg cagagtcctc agtcaatgct ttcaaagaat tatataatgg 4680 acatcaacag acttcattag ccaaattaag gtttattctt ttttaaacat acacaaattt 4740 ttattaaact tttttaaata gtcaaactgc tcctagcatc ataagacatt tgttttatag 4800 gattgtcttt atatttaaac aaatctttct tcaaacatta gatattcaaa atatcttgaa 4860 gcattgtgca aaggtattgt tgttcctgaa aaactaccac cgactgatcg ggctgctcat 4920 ttccatggtt acagagttca tctgcaatta attgaatgga aaatgctaga cgaaggatta 4980 aatctaaaac ctactgaatg gggatggaaa tccactgatg ggcaccttga accaattcca 5040 actgacaaag aaatagctcc accaaatttg ctcaaggtca ttcgatgcaa ctgcagatct 5100 agttcaaaaa accaatgcca aacaaaaaca tgcatttgta ggaaaaatgg aatgaagtgt 5160 atggcagcgt gtgggggttg ccatggtatg gaatgcagca ataagacagt gagacgattt 5220 taatgaatta aatttataat tgattaattt ctatacaaat aaaaattaat tatctgtaag 5280 taaaaattct aatatcttat tttaatttaa tttagttgga cattgaggag gaacatgaaa 5340 acagtgataa agacagcgat gaagaagaaa actctaaaaa tatttttgat gaaatgtttt 5400 aaacatcaaa atattatcat aataaatgaa atgtcttttt gtgtcttaaa acatatatta 5460 attatgatat tataatcata tccatatcat aattatgata tggaaaatgc aatgaacaac 5520 gttgttagta ataaaaactt attatggaaa agttgcaccg aattcgaaaa aaggcagaca 5580 aaaaaattcc ctagtcccta aaagtatagg gcctaacgct atagcccccc cccccaccta 5640 gctacagcac taatattttt acgttatgag ttaataaaat attttacata gcgttaaatt 5700 ctatttctaa cggcatacca tgcttacgat ttcaaagaaa aaatattttt tcaaaatgat 5760 gacgtcataa aacccactct gaacagccat agctcacgaa ccataaattt ttttaacttt 5820 gtgtaagcaa actttttact cagcattaaa ttatctttca aatgatatat aaaacttata 5880 attttggaaa tataaaaaat ttagtcattt tgcccaaata tttgattgaa aaatgggccc 5940 aaaatgacat ttcctgttct caaaaacacg gggctttttt tttggtcatt gttgacatgc 6000 taactaacct atgtgcaaaa gcctaacttt atatgatttt tttccgggta taactccaat 6060 tccaccatac ta 6072 // ID Gypsy-256_AA-I repbase; DNA; INV; 5116 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-256_AA_; KW Gypsy-256_AA-LTR; Gypsy-256_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-5116 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1113-1113 (2011). XX DR [1] (Consensus) XX CC Positions [2478-3011] - Reverse transcriptase CC Positions [4098-4559] - Integrase core CC 'GAAGT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 475..1545 FT /product="Gypsy-256_AA-I_1p" FT /translation="MDGSKAFSNLPPFSFAGVELTDRRQKWHTWKRGFEIC FT LRTAKIKDATEKKDILLGQGGFELQEIFFNIPGADVSEDKENNIDPYDVAI FT SKLDDYFAPQRHEVHERYIFWAMKPEQGETLSKFIMRTQIHASKCDFGKSA FT SESADKAVVDKTLQFAPAHLREKLLQVADLTLEEAIKQVNAYETSQAASDQ FT ISGQSVPNQQVKTEYVQRVAARCRFCGKSHGPQQQCPAWNRSCSFCGKRGH FT FQAVCFDKPSTSSGTPPGGIGKVAPKRTMPVYVKGSYDKHSSTQPKPFAKH FT PRKPVYRQLNAIEDSSMEEIVIMCQIFMHLNCWYDKHRQIRPGKLYGAPGK FT LKRVRKDKMAVTKF" FT CDS 1941..4886 FT /product="Gypsy-256_AA-I_2p" FT /translation="MVWSEKERDELLWAEVGGVLIEMQIDSGVQSNIIDDS FT TWKMMTTNGVQTVGGTQRADRKFKAYAQKDCLEVPHMFEAEIVVHDKQKVL FT KAMARFYVVKDGPQPLLGKRTAIELGVLIVGLPSQHESIHSVKVGQPFPSV FT RGIKIHLPVDNSVTPVAQRLRRLPFATLERVEEKSDELLSKDVIEKVSGPS FT KWVSPMVIVVKDSGDIRLCIDMRRVNKAVLRETHPLPTLEDIRWKLNGATY FT FSRLDIREAFHQLELDEESRPITTFITHKGLYRYKRLVFGISCAPEMFQKV FT MEQILADCECAVNFIDDIIVFGKTEKEHDESLEKVLSKMRDYGILLNHEKC FT AFKLNEIDFLGHHFDANGMIPAHSKVEAIKTFRTPANAEEVRSYLGLVNYV FT GAFIPDLATISFPLRELTKNKAEFKWGKDEQKAFEQLISLISNVETLSHFD FT PNLKTRVVADASPVGLGAVLLQFPDENPRVVMYASKSLTETERRYAQTEKE FT ALALVWAVERFQIYLIGIRFDLETDHKPLEAIFSPDSTPCLRIERWVLRLQ FT AFSYDVVYRKGKSNIADPLSRLSQPTEIEPFDPDSEVYIRNVVELAAVDID FT ELENASAKDIEICELREFLDRGVWNYTSEIIKPYHFLRNELGKVGDLVVRG FT SRLIVPKCLRERMVHLAHEGHPGRTKMQQRLRNTCWWPGMDEAILRVVDSC FT EGCRVVSQPNRPEPMVRRQLPESPWIDVAIDFLGPLPSGDYLLVIIDYFSR FT YKEVEVLRKITASETIDRLEKIFVRLGYPRTITLDNGRQFVSLEFEKYCKN FT RGIVLNKTTPYWPQENGLVERQNRSLIKRLKISQALKQDWKQDLLTYLSMY FT YSTPHSTTGKTPSELMFGRNIRTKLPSLQDLATAVMPTDYRDRDLEAKEKG FT MEAENVRRKAKVSEIKVGDKVLMKNLLPGNKLTPTFNPAVLTVIEKQGPRV FT TVQNTVTKKSYDRNSSHLKRLPSQEEF" XX SQ Sequence 5116 BP; 1573 A; 987 C; 1290 G; 1259 T; 7 other; aattgacgac gaggctggct gtttatcggg waaataagag ttcttgtggt ttaataaatt 60 cgttaacatt cgaggtaagt tggatttatg cggaataatt cacgataatc atcaaaactg 120 atggtttgta tagattgttt tggctaccgc gacgagtcag cggcaattgg ctaccacgac 180 aagacagtgg gctatcgcga cgtgaaagcg acaattggtt accgcgacgt gaaagcggag 240 attggctacc acgacgagag agtggcatag agctatcgcg acgtggaagc gacgattggc 300 taccgcgacg tggaagcgga tgttggctac cgcgataaga cagcggcgaa cggctaccgt 360 gacgtgataa cgggtattgc gaagtcgtag atgattgatt gagattgtgg ctacaaaagg 420 tatgttgaaa cttgttacat aaaacataaa attacgactt gggaatttcg taggatggat 480 ggaagtaaag cgttttcaaa cctccccccg ttctccttcg ctggagttga actcactgat 540 cgaaggcaga aatggcatac ctggaagaga ggttttgaaa tttgcctccg aacggcaaag 600 ataaaggatg ctactgagaa aaaagatatt ttgctgggtc aaggcggctt cgaacttcag 660 gagattttct ttaacatccc tggggcagac gtcagtgagg acaaggagaa taatatcgat 720 ccttacgatg tggctatcag caaactcgat gattattttg ctccacagcg tcatgaggtt 780 cacgaaaggt acatcttttg ggctatgaag ccagagcaag gcgaaacact gtcgaagttc 840 ataatgagaa ctcaaatcca tgcgagtaag tgcgatttcg gaaaaagtgc atctgaaagc 900 gcagataaag ctgtcgtcga taaaacactg caattcgctc ctgctcattt gagggagaag 960 ctacttcaag tggctgacct taccttagag gaggctataa aacaagtgaa tgcttacgaa 1020 acaagccaag ccgctagtga tcaaataagt ggacaaagtg tgcctaatca gcaggtgaaa 1080 actgaatatg ttcagcgagt tgcagctcga tgtcgttttt gtggcaaatc tcatggtccg 1140 caacagcagt gtcctgcttg gaacagatct tgttcttttt gcggaaagcg tggccatttc 1200 caagcggttt gttttgataa gccaagtacc agcagtggaa cgccacctgg aggcattgga 1260 aaggtcgctc caaaacgtac gatgccagtt tatgtcaaag ggagttatga taaacactca 1320 tctacgcaac cgaaaccgtt tgccaaacat ccacgtaaac cagtataccg tcagctgaat 1380 gcgattgagg atagcagcat ggaggagatc gtaatcatgt gtcaaatctt catgcattta 1440 aattgttggt atgacaaaca cagacaaatt cgtcctggaa aattgtatgg tgcaccgggc 1500 aaactaaaaa gggtacggaa agataaaatg gcagtaacaa aattttaata tgatgaataa 1560 ttcaagtacc aaagaattta ttgacaaaag gaacaaaccg tagagtaacc tacagtttat 1620 tggacaacac ataactaata taaaagaaaa aagaaagcat aagtggccct tagcttcaat 1680 aaaacgctta ttgagtaaaa taaatgcaat aaccaaaatg cattcctgta tcaacagtca 1740 gcggaaaaaa acatgcagct tgttctgtgc cattatttaa aaataataca atattttcta 1800 gctcaacatg caatttcggg aaatcccggg attttcaaaa atatcccggg attcgggatt 1860 ttttagatcc cgggatttcc cgggattttt gtcccgggat tcccgggaca agaccctcag 1920 atcagataga atatgtggaa atggtgtggt ctgaaaagga acgagatgag ctgctgtggg 1980 ccgaagtcgg aggcgttctg attgagatgc aaatcgattc cggcgttcaa tcgaacatca 2040 ttgatgacag tacgtggaaa atgatgacta cgaatggagt tcagacggtc ggaggtactc 2100 aacgtgcgga tcggaaattc aaagcatatg cacagaaaga ctgtttggag gtaccccaca 2160 tgtttgaggc ggaaatagtc gttcatgaca aacagaaggt gttgaaggct atggcacggt 2220 tctatgtcgt gaaagacggg ccacaacctc ttttggggaa gcgaactgcc attgaactcg 2280 gagttttgat agttggcctc ccgagccagc acgaatcaat acacagtgtg aaagttggtc 2340 agccgtttcc aagtgttcga ggaataaaaa ttcacttgcc tgtcgataac tcggtaacgc 2400 ccgttgcaca gcgattgcgt agacttcctt ttgcaacact tgaacgagtt gaggaaaagt 2460 cggacgagtt gctatcgaaa gatgtcattg agaaggtttc gggacccagt aagtgggtat 2520 caccaatggt tattgtggtg aaggatagcg gggatatacg gctgtgcatt gatatgcgac 2580 gagttaacaa ggccgtcctc cgcgaaacac acccattgcc tactctcgaa gatattcgct 2640 ggaagctgaa tggtgcaaca tacttttcga ggctagatat cagagaagcc ttccaccagc 2700 tggagctgga tgaggaaagc aggccgataa ccactttcat aactcataag gggttgtatc 2760 gatacaagcg cctggttttt ggaatatcct gtgctccaga gatgttccaa aaagtcatgg 2820 aacaaatctt ggctgactgc gaatgtgcag tcaatttcat tgacgatata atcgtgttcg 2880 ggaaaactga aaaagaacac gacgaatcgc ttgaaaaagt tctgagcaaa atgcgtgatt 2940 atggaatatt actgaatcat gaaaagtgtg cttttaaact gaatgagatt gacttcttgg 3000 gtcatcactt tgatgcaaac gggatgattc ctgctcattc taaagttgaa gccattaaaa 3060 cctttcggac gcctgctaat gcagaagaag tacgcagtta tttgggactg gtgaactacg 3120 ttggcgcttt tataccagat ctggcaacga tttcgtttcc tttgagagag ttgacgaaaa 3180 acaaggctga attcaagtgg ggtaaagatg agcaaaaagc tttcgagcag ctgattagct 3240 tgataagcaa tgtggaaaca ctatcccatt tcgacccaaa tttgaaaacg agagtagtag 3300 ctgatgcgtc cccagttggt ttgggagctg ttcttctgca gttccctgat gaaaatccta 3360 gggtcgttat gtacgctagc aaaagcctta cagaaacaga acgacgatat gcccagaccg 3420 agaaagaagc cttagcgctt gtctgggcag tcgaaaggtt ccagatatac cttattggta 3480 ttcggtttga ccttgagacc gaccacaagc ccttagaggc aatattttca ccagattcca 3540 caccttgttt gcgcatagaa cgatgggtat tgagactgca ggcgttcagt tatgatgttg 3600 tatatagaaa gggtaaatct aacatcgccg acccgttgtc tcgactttct cagccaacag 3660 aaattgaacc gttcgaccct gactcagaag tatacatacg gaatgttgta gaactagcag 3720 ctgtggacat agatgagttg gaaaacgcct ctgctaagga tattgaaatt tgtgagttgc 3780 gggagtttct cgaccgtggc gtttggaact atacctctga gattatcaag ccataccatt 3840 ttttgcggaa cgaacttggg aaagtaggtg acttggttgt ccgtggctca cgattaatag 3900 tgccaaaatg tttgagggaa cgtatggttc atctggcaca cgaaggccac cccgggcgca 3960 caaaaatgca gcagcggttg aggaatacgt gttggtggcc aggaatggac gaagctattt 4020 tgcgagtagt agattcatgt gagggatgtc gggttgttag tcaaccaaat cggccagaac 4080 ctatggtaag gcgccaattg ccggagtctc cgtggatcga tgtagccata gattttcttg 4140 gaccacttcc gtccggagat tatctgttgg taatcatcga ttactttagc cgatataagg 4200 aggtcgaggt wctgaggaaa ataacagcga gtgagactat tgatcgtttg gagaagatat 4260 ttgtcagatt gggatatccg cgaacgataa ctttggacaa cggccgacaa ttcgttagct 4320 tagaattcga gaagtactgc aaaaatcggg gaatcgtgct aaacaagacc acgccgtatt 4380 ggcctcaaga aaatggcctt gtagagcgcc aaaaccggtc gttgatcaaa aggctaaaaa 4440 taagccaggc gttgaaacaa gactggaagc aagatttact gacctatttg tccatgtatt 4500 attcaactcc tcactccaca actggcaaga ctccaagtga gttgatgttc gggagaaaca 4560 twagaacgaa actaccgtcg ttgcaagact tggccactgc agtgatgcca acggattaca 4620 gagaccgaga tctggaagcg aaagaaaaag gaatggaagc agagaacgtt cgtagaaagg 4680 caaaagtgtc ggagataaaa gtgggtgata aagttttaat gaaaaattta ctaccaggga 4740 acaagttaac tccaacattt aatccagccg ttttgacggt aatagagaag caaggaccac 4800 gcgtaacagt ccagaatacc gtcacaaaga agtcctacga tcgaaattcc agtcatctta 4860 agcggttacc ttcacaagaa gaatttmctg agagcggagg gaaaactggt gcttcagttg 4920 aasttgaggt tgctgagawc atgaaatcac cagtgagcgg aaacgacacg atggaacaag 4980 cacaagagcc agatgttgca gaaacaaacc kagatgctct acgaggcaaa tctcggagag 5040 tgatcagaag gcccatcaga tttgaagatt gcgtcactga ttttgggtca taaatgatta 5100 ataaatgaag gggaga 5116 // ID Gypsy-596_AA-LTR repbase; DNA; INV; 124 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-596_AA_; KW Ty3_gypsy_Ele69; Gypsy-596_AA-I; Gypsy-596_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-124 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 124 BP; 44 A; 28 C; 15 G; 37 T; 0 other; tgttagagca gtaacccttg aactgccaag caagccatga ttcttatact aataatgaca 60 actagaaaat atattcattc tttgttcaac ctccaaacta agcagttgtt cttactacac 120 aaca 124 // ID BEL-242_AA-LTR repbase; DNA; INV; 457 BP. XX AC . XX DT 13-JAN-2011 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-242_AA_; KW BEL-242_AA-I; BEL-242_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-457 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (13-JAN-2011). XX DR [1] (Consensus) XX SQ Sequence 457 BP; 169 A; 92 C; 76 G; 119 T; 1 other; tgttcaggac ggcgccgtat cctcctaatt atttccctag gggtgcgtgg taccaacagc 60 agggagaaaa acattatagt gtacaacaca tctacaatac acacaatact acacgatcgc 120 ttaggaacac acagtcagta ctgaaataga caccacagaa gacgcacacg gattagaaga 180 acagaaawct aaattgtcag tgaatttatt gtacagcaaa gttagttgga caggattcga 240 acataaaacg gactaaaaat gtaagttgct gtttaaaatt ttgcataata tctaatattt 300 tgtaaaccta tgaactactc atgccttaaa actactatga actatttttg agcttatagt 360 taatgaacct aataaaattg cagctaaaag caaactccac atcataaaaa acgagtttcg 420 ctctttggat ctccgaaaat catcacctgt cgcaaca 457 // ID TE-X-1_NVi repbase; DNA; INV; 4773 BP. XX AC . XX DT 17-APR-2009 (Rel. 14.04, Created) DT 17-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE nonautonomous transposable element from Nasonia vitripennis - a DE consensus. XX KW Transposable Element; Nonautonomous; TE-X-1_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-4773 RA Bao W. and Jurka J.; RT "Transposable elements from Nasonia vitripennis."; RL Repbase Reports 9(4), 802-802 (2009). XX DR [1] (Consensus) XX CC The element is not flanked by obvious TIRs, but it is flanked by CC TA satellite (shown in the consensus), which is probably CC introduced after the element's insertion. The TSD is unknown. XX SQ Sequence 4773 BP; 1392 A; 1085 C; 1113 G; 1182 T; 1 other; atatatatat atataatata tatatatata tatatatagt tgttttgagc caaggtgaga 60 ttcagccccc ctcgctccct cagccccccc tcaaaataac gttaaaacgt ttaaaaaacg 120 cgttattttt tactcatgaa gcccccccct ccctcgccgc ccctttatta gccccctttc 180 cccgcccccc ccctcccagc ccccctcagt tactttattc aacatttata gtaaaaaata 240 ttattcatta tgttaaaagt tctatttttt ccacgccacc catgaggtaa atagaagagc 300 gcgttttttt ctctgatgac aaaaatttta atgttttaga attagaaatt aatatagaat 360 ctcatagtaa tttcaataag caatgatgta ttttattcgt gtatttttaa tactttattc 420 tgtaacaagc ggtcgcgaat tcatattgcg cgcgctcgtt tagtcgcgcg atgttagctc 480 ggcatgcaag cgcggtgttg ccaagtcgtg cgcggaggcg cgcgtgagac actgcgcgta 540 aaggtagact atttcgggct tgcgaccagt aaacacacgt gcgcggagac caaactaaag 600 tatcccggta aacgtatcgt gtcgacgagc atcatccgag caatcaactc ccatggctgg 660 caccttcgat ctcagcttga cgtccagaag cagctaccca ggagcctttc gaggagtgga 720 gcagcttggc taggagggac gcctcctcga ccggattctg tcgtttgacc ttggtcaccg 780 gagatcgtgc tcgacgcgtg cttcgcatct ccgtatatct cacgaatttc ttcacgcgac 840 tataaagacg ccgaagccga gattcgtgcc accgaatcct tggctcgtct accgaacacc 900 gtcctctcgc gcgacgcgcg cgactattgt aaccaattaa agtgatcaat atatgcaact 960 attctattca caagttgcgc tttttgctgt tacgaacccc gtccttccta tctcctgttc 1020 gcggacttga cagggttgga ctgggatata gccgcggcat cgtcgtcgcc gttggcagcc 1080 gagcaactct gtgcgcggag aaagcgcggc gagcttcact tgccgtgcat tttgtatacg 1140 cgagtgactt agctgcattg cacggagtcg tcattgcacc gacggctaac gtccgtgggc 1200 gactccagcg tcatcgaggg atagccgtgg cgtagtattt taacagatag agaaactgct 1260 cggtgactct gcggcatctt cgtggccgcc gtgcacaagg ccagttgccc cgttacaaac 1320 tggcgcctca acgttgcagg taacgagcaa gagcgtcgga gtaggtgggc attaaaataa 1380 attcgtgtag cgagaaggaa gaattagctc gtctcgcgga agaacgaaat ttaaccctag 1440 cgcaaatgac cgaagcagag aggatggcgg ctctgccatt cgtatttaca ggcatagccg 1500 cagaataatg cgagaacaac aaagatgagt ggcaaagctg ggaagagttc tatacggcgg 1560 cgacaagatg gtacggcacg aacaggcgat tccaacaacg gacgcagaga gaagatacca 1620 aggcgcgcac gagcccgtgc gcgattttgt gacatgcctg ggaggcatga tgaaggaaat 1680 aaagccacaa ccaagcctga aaaaacaact agacctctta cataaaaatc tgatgccaga 1740 actgcaaagg ttgacccctc gtatgaactg ccacgactgg gacagtttcc aaagaagccg 1800 aaaccgtact ggcgagcggg caagagtacc gcgcgccacc accactggag caaactctac 1860 tagcatcgct tgcgtacgtc tcaccgacga aacgcaaaca gcctggcgcc gcagctgcaa 1920 agtggtatag acaattaccg gaaaattacg caacgttggc cgagcctttg actcggctga 1980 caaagaaaga tcagccgttc gtttggggcg atgaacagca atcagcattc gagacgataa 2040 aagctcttat cgccttggct ccagtactgc actgcccgtg cttcgatcag cagtttgtca 2100 tacaaaccga tgcaagtgat acagcgttgt gttgacgcag tgcatagacg gacaagactg 2160 agtactcgaa ttcgccagca gagtactcac ccccacagag aggaattaca cggtcagcga 2220 acgagaatgc ttggccgtgc ttttcgccat acgcaagttt caccagtaca tcgaaggtta 2280 taaattcaaa gtcataacag atcacagcag cctaaggtgg ctatgcaatc tctgtcagct 2340 gagcagagtt gaacagcgcg caccaccagg tttcatgggt aaacgtgtgg tgcaaaggcc 2400 ttggcaatgg gtagccggag atatcatggg ccccttgccg aagtctgcta aaggccacga 2460 gtacgcactt atcttccagg gtttgtttac ttgatggata gagtgcgtgc cactttgcaa 2520 agccaacggc aaagcaatac ttgccgcact caaggagaga gtgattttga gattcggtgc 2580 acccgaggtg tttcactcgg ataatggtac caaattcaaa aacaagacaa taaaagagta 2640 cttggcagag caaggaatcc accactcgta ctcaccgcca tatcacccgc aggtgaatcc 2700 agtggagcga ataaaccgca ctgtcaagaa gaatttttcg atgcccgccg actagtgcca 2760 actttcaagg ttggcgacgt cgccaatagg agacacgtat tgttgtctgc ggcggaaggc 2820 atcacggcaa aattggcttt tccgtacata gacccatata cgattacggc ccaagtcgac 2880 tcgaatacat tcgagctgac cgataaaaaa ggaagagtgg aaaagctagt gtcggccaaa 2940 gaaataaaga tcttttatga caaagacgac gaagaggggg acgacgatga gtctcgatca 3000 ccgggggatg cttgagaggt gaaggcgatg agcgacccgc cagaagcgtg cgaactcgag 3060 tctgagctca agccgacgga cgtagtggcg ggagggaagc gcgagcgcga gcaacggccc 3120 ggtaaatatg ggtcgacatc cgcggtggcg gagagaagtg ggggacccgt aaacggagct 3180 accccgaaac cctctcgccg cccgcgcggg cgacctcgga aaattgtaac ggcaagcgcg 3240 ccgacggcga attagcagga tggaaagcgt aagcccggcc catctaaggg ctcgactaaa 3300 gcaaatcggc cgaaacagct aaagccgacg gcgttatcat ctcgaaagac gtgagcaagg 3360 tgccgcaccg ggcgattgac atcatcataa tatattacat tgacatcatc ataatatatt 3420 acattgacat aatcataata tattacattg acatcatcat aatatatcac attgacacgg 3480 gctcgccacg aggaagaatg cgcgcggata aaggagcagt agcgcgccga gtatctccgg 3540 aagaagaggg agraagagcc gagatgctac atctgcggcc acccgggcgt gaagcggtcg 3600 catcaatgct cggagcgagc cgcacgcaag aagatcttgg cagagatcct acgggaccgc 3660 aagaagaact aggggaaaga agaagatgta gaagaaatga tcgctgaagg gaataaagag 3720 tacctcgttt cccctatatc ctgtgtgttc ttaattgact acctgatatc ccatttcact 3780 gcatccaagc atctcattct ttcgcatgat cgcgcgacgg cgcacagtgt accaaaatcc 3840 gatttttatg gccaaaattc gctcgacaag ccaaatcgca atcattctta atgaaaaaaa 3900 ttgcaaaata cttctacgat aatttccgtt catgtagaac tacttatatt tgataacgcg 3960 ttacataccc tataagaaaa atagaaaaag ttacagcatt ggtttgataa tctataaggt 4020 gtttaagacc ccttcaagtt tctaaaatgc cgattaagtg ttggtggttt ataaaataag 4080 gatctaaaat ttatatttta tcagtttaga gttcagagat ccaaaatcag caagtatata 4140 ttcttttttt tcttcttttt tttgttctta cagttttaga gattttagta catttcgatt 4200 ataaaacatt tcgattatta cattattcag tcttatgtgt tttcctattt ttcttttagg 4260 ttatgtactc ttttattaaa tataagcagt tctacacgaa cggaaatcat cgtacaagta 4320 ttttgcaatt tttttattaa gaatgattgc aatttggctt gtcgagcgaa ttttggccac 4380 aaaaattgga ttttggtaca ctgtgtggcg atacagagtt cttttgaaat ttttatgtac 4440 tcatataatt ttgaataaaa gtgataaaag tattaaatat ttacgaatag aatacataat 4500 tcaagcatat tcaaatttct atgagataat taatctaaat tttaaatttt aaatcaagtt 4560 aaataaaaca taacattatt gtaaacagag aaaaaacgcg cctttttatt tacctcatgg 4620 gtggtgtgga aaaaatagat gtataaataa tcattaccta tttttattta gataaatgtc 4680 aaattaattg tattggggtg ggcgagcgga gcgagcaggg ggaacccccc ctagtatata 4740 tatatatata tataatatat atatatatat ata 4773 // ID Gypsy-46_CQ-I repbase; DNA; INV; 7570 BP. XX AC AAWU01015436; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-46_CQ_; KW Gypsy-46_CQ-LTR; Gypsy-46_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-7570 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 471-471 (2011). XX DR GenBank; AAWU01015436; Positions 9338 1769. XX CC Positions [5889-6365] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 1291..3543 FT /product="Gypsy-46_CQ-I_1p" FT /translation="MELQKLYYNMDVSHLAIDEIEHELYVRKISFKSNELE FT STKRRRLKEVMKNERESNSFKACNSWRTLPEEITLIKSKLLVITGLLENPK FT TPALQRIKLKTKLIHYRVRIFLATKAFSAHKYASQLETLGKQASKIYVTHF FT PDETGSEEVPESQDKLDEDVVKALEEVRNEIEIMNESVVGVELDSEKNLES FT VGSEKTADFQKEIQEVEVRKKSLIEMLEQFETTENLEGNKVIENLKSFVQE FT TSEQQKAMREQQIEEENRKLKEIEENMNRKIRLEQLLVKLNNHLKIAESKP FT TKQIDFPKKTDPGENSKNRQFDSEFDFDLDCFSQSSSDDSSSKSSSDISTA FT AKRKYLKRVKKKRSGRGKSRSKVKGRKHQKQKVSFSGTSDSVQSTDSESSS FT SSSSSSTSSTSSSSSDSSSDSRSARRKNKKHKKSKKKHHHLKRIPVSEWRL FT KYDGKDQGRKLSEFLKEVKMRQKGEKITDKQLFQSALHLFSGRAKDWFMEG FT FENKDFRKWSELKKELKREFLPPDLDFQTEVQASNRRQARGEKFGDYFRDM FT QKLFQSMTKPMSDSRKFEIIWRNMRYDYKNALTAARIKSLSKLKKYGRILD FT ENNCNLFQKPVDASSRPKQQYVNEIITSGNPKQKQGSNNSYNNSKVFSKTK FT PKAGGKEEEDKFERSNQEKGGKFKPRDKEEADAMEGTAAGTMQALVDRYKR FT PPLGTCYNCGRHGHHYSDCSDKRGNFCRICGFPDVHTTKCLFCQKNGQNSA FT " FT CDS join(3507..5798,5802..6752) FT /product="Gypsy-46_CQ-I_2p" FT /translation="MPILSKKRTEFSLKRQVGSAAENPPTSEIVTQEMLSS FT GFRMFSDDDYCSEKNVEELFIRLEGDNRPFAKVWVLGRELIGLLDSGAQRT FT VLGYGCKKLLREWKLKIFPAEIALKTASGSQVEVEGCVHLPVTFNNENKII FT SALVAPKLKQKLILGYDDFWKAFRIQPGVQSIGCIEKVDLTNISRVDEICT FT EDEQEEEIVEILNDLQKTQLESIKGLFKVAVDGETLETTPLITHKIDLKPE FT FVNAPPVRINPYPTSPELQKKINHELDNMIKLQVIEPSRSDWSLSTVPVIK FT PTGEVRLCLDARRLNDRTVRDAYPLPHQDRILSRLGSCKFLSTIDLTKAFL FT QIPLDPASRKYTAFSVLERGLFQFCRLPFGLVNSPATLARLMDGVLGFGEL FT EPNVFVYLDDIVVVSETFEAHLSLLSEVARRLKLANLSININKSKFCVSEL FT PYLGYILTPEGLRPNPERIEAILNYERPNSLRSLRRFLGMANYYRRFIPHF FT SDMSAALTDLLKKKPKSLIWNPKAETSFLQLKESLIAAPVLANPNFDLPFQ FT IQCDASDTAIAAILTQEHFDGERVVAYFSQKLSPAQQAYAASEKEGLAVLS FT AIEKFRPYIEGTHFIVVTDASALTHIMKGKWRTSSRLSRWSIELQGYDLEI FT RHRRGKDNVIPDALSRSLEISLLEEDGDAWYSMMYNKVLASPDEHIDFKVE FT EGKLFKFIPAQTEVLDYRFEWKLCVPERMREDILRKEHEESLHVGYEKLLE FT KLRARYFWPKLAFARKYIEKCRICKECKPTNTSQHPTMGTARLATKPFQIL FT AIDFIQSLPRSKTGNMHLLVLLDLFSKWTVLVPVKKICASVVVKILEEQWF FT RRYSVPEILISDNATSFHSNEFKDFLQKYQVQHWANSRHHSQANPVERLNR FT SINALIRTYVKQDQRLWDSKISQVEYTINNTWHSSTGFTPYKILYGHEIIS FT TGEEHRRDADASEISDNERIETKLKVDRNIYDLVYKNLVKAHEKSERAYNL FT RFRKPAPVYQVGQKVFKRNFALSSAGDAYNAKLGPMYVPCTIVSRRGTSSY FT ELIDEHGKNLGIFSAADLKPGNPE" XX SQ Sequence 7570 BP; 2414 A; 1436 C; 1733 G; 1987 T; 0 other; gggcccgtaa ttaccaattt gagtccacga tagcacgttc gttggaccaa aagcccacgc 60 ggtcaagcac cttgcccaga agggcagacc ggactctaga agctgcgcga acagcgccgt 120 cgtcggataa cctcgtcccg ccgctggcag gacgaccgac gcatcccaag ctgttggtgg 180 atcgcttgcc tagggaaacc gcatgtacct cggcgacaaa ggcgatcacc cacgaggtcc 240 aggtagcagc tcgacgaccg gcgattggcg acagcacgta cgccgccggt gacacccaac 300 aacgccagtt tcggccccac cgtagccgaa agaccgccac gagttccagc aagcggcctc 360 gagaaggcca agttcagccg acgacgaagc gcacgtggca tcgtgcaccg ttcaccacaa 420 ctccagcacc aggccatcca gatcgccgag aacggctgac gtcgcaccgc acgtggcctc 480 ccaccggcca accgcagaag ccgccacctt caagaagcct ccagcatgcc ccccgcaaca 540 cacacacaca cgcccacagt gagtaaaata aaaaccgtct aaaagtgatt attacagggt 600 ttgtcctttt gattatccga gccgccttgc aaattagagc tccttgcttt gaaaaccacg 660 ccagtcgcgc aaaagaggcc ccgagcccac ccctgtccgg tcgtggctct aagaccaacc 720 ggcaggggcg tcttaagggc ctcctggccc agagggtacg ttgtcccgta gtaataaatg 780 gcgcccaact aaagataatt aaaggtaatc aagaggtcaa acacagttcg cagcaagtgt 840 gctaggaggt ttaggtattt tttttttacg caattttttg aagactctgt tgagggaatg 900 ctgtgttggc gcggtttcat caacatttgc tttgtttcaa tagttgtgga tgattcaagg 960 ctgattgtta gtgcgcgaac gttattgaaa gttctcattg gttgaattga acagcaaata 1020 ccgggagaat tacaagaaaa gttgatgttt tttttataat tactgggtct tgatacacca 1080 actaaacaga atattgaatt attcaggatt tactgtgttt ttttttatat ttgtgcttag 1140 aatgaagctc caagaaaatt tcatgttgta gagacatttt ttttctattt ttgtatattt 1200 ttgtttcgct tttgaatatt attattaatc aagcattctt tctctcttaa atttctaggt 1260 ttacaatttt gaatatatta aaagatcaac atggaattac aaaagttata ttacaatatg 1320 gatgtgtctc atttggcaat tgatgaaatt gagcatgaat tgtatgtacg gaaaatttcg 1380 tttaaatcga atgaacttga gagtaccaag cgaagaagat tgaaggaagt aatgaaaaat 1440 gaaagggaat caaacagttt caaagcttgc aactcgtgga gaacacttcc cgaggaaatt 1500 actttgataa aatcaaaact tcttgtcata acaggattat tggaaaatcc taaaacaccg 1560 gctcttcaaa gaataaaatt gaaaacaaaa ttaattcatt accgagtgcg tatttttctg 1620 gcaacaaaag ctttcagcgc tcataaatat gcttcgcaac tagaaacttt gggtaaacag 1680 gcaagtaaaa tttacgttac acactttccg gacgaaacag gatcggaaga ggttccggag 1740 agccaggaca agctcgatga ggatgtggtg aaagccctgg aagaggtacg taacgagatc 1800 gagatcatga atgaatctgt tgtgggagtg gaattagaca gtgaaaagaa tttggaatcg 1860 gttggaagtg agaagacagc ggattttcaa aaagagattc aggaagttga agttcgcaag 1920 aaaagtctga ttgaaatgct tgaacagttt gaaacaacgg aaaatttgga aggaaataag 1980 gtgattgaaa atttgaagtc atttgtacaa gaaacttctg aacaacaaaa agctatgcgg 2040 gaacaacaaa ttgaagaaga aaatcgcaaa cttaaggaga ttgaggaaaa catgaacaga 2100 aaaattaggt tggaacaact tttggtaaaa cttaataatc atttaaaaat tgctgagtca 2160 aaaccgacca aacaaattga ttttccaaag aaaactgatc ctggcgaaaa ttcaaaaaat 2220 cgtcagtttg attcggaatt cgattttgat ttggactgct tttcacagtc ttcgagtgac 2280 gacagttcgt ccaagagtag ctcagatatc agtactgccg cgaaacggaa gtacctgaaa 2340 cgagtgaaga agaagaggag cggaagggga aagtccaggt ctaaagtaaa gggaagaaaa 2400 caccaaaagc aaaaagttag tttttctggg acaagtgata gtgttcagtc tactgatagt 2460 gaatcatcta gctcttcaag ctcttccagc acttcaagca cttctagttc gagttctgat 2520 tcttcgtcag actcgagatc agctagaaga aagaacaaaa agcataagaa atccaagaag 2580 aaacaccatc atcttaaacg gatccctgtt tcggaatgga gactaaaata cgacggaaaa 2640 gatcaagggc ggaagctttc agaatttttg aaggaagtta agatgcgtca aaagggagag 2700 aaaattacgg acaagcagtt gttccaaagt gcacttcatt tattctctgg tcgtgctaag 2760 gactggttta tggagggttt tgaaaacaag gattttcgaa aatggtcgga actaaaaaag 2820 gagcttaaac gtgaattttt gccaccagat ttggatttcc agacagaagt tcaggcgtcc 2880 aatcgacgac aagctcgggg agagaaattt ggtgattatt ttcgtgatat gcagaaatta 2940 tttcaatcta tgaccaaacc gatgtcggac agtcgtaaat ttgagattat ttggcgcaac 3000 atgagatatg attataaaaa tgctctcacc gcagcaagaa ttaaatcttt gtctaaatta 3060 aagaaatacg gtcgtattct cgacgaaaat aactgtaact tgtttcaaaa accggtcgat 3120 gcttcatccc gaccaaaaca acaatatgtt aatgagatca taacttctgg aaacccgaaa 3180 caaaagcaag gttccaacaa ttcttacaac aattccaaag tctttagcaa aacaaaaccg 3240 aaagcgggtg gaaaagagga agaagacaag ttcgagcgct ctaaccagga aaagggagga 3300 aagtttaaac caagggataa agaagaagct gatgcaatgg aagggacagc tgcaggcacg 3360 atgcaagcat tggttgaccg atacaagcgt ccaccgctag gaacatgcta taattgtggt 3420 cgtcatgggc atcattattc agattgttcg gacaagcgtg gaaatttttg tcgtatttgt 3480 ggttttcctg atgtacacac caccaaatgc ctattctgtc aaaaaaacgg acagaattca 3540 gcctgaagag gcaggttgga tctgccgctg aaaatcctcc tacttccgag atagtcacac 3600 aagagatgtt gtcaagtggg tttagaatgt tttcagatga cgattattgc tcggagaaaa 3660 atgtagagga attgtttatc agactggagg gggataatcg gccatttgca aaagtatggg 3720 ttctaggaag agaattgatt gggctattgg atagtggagc gcaaaggaca gtgttagggt 3780 atgggtgcaa aaagcttttg agggagtgga aattaaaaat tttccccgcg gagatcgcct 3840 tgaagactgc ttcgggatcg caagtggagg tggaaggttg cgttcattta cctgtaacgt 3900 tcaacaatga aaacaagatt atttcagccc tcgtggcacc aaaattgaaa caaaagttga 3960 ttttaggcta tgatgacttc tggaaggcgt tcagaattca gcccggagtc caatcaatag 4020 gttgtattga aaaggtagat ttaacaaata tttcgagggt tgatgaaatt tgcacggaag 4080 acgaacaaga agaggagata gtcgagattc tcaatgactt gcaaaagacc caattagagt 4140 caattaaagg gttgtttaaa gttgcagttg atggggagac acttgaaact actcctctga 4200 tcacgcacaa aattgatctc aaaccagaat ttgtcaatgc accacctgtc agaataaacc 4260 catatccaac atcacctgaa ctccaaaaga agattaatca cgaacttgat aatatgatta 4320 aactgcaagt gatcgagcct agtaggagtg actggtcttt aagcacagtg ccggtaatca 4380 aacctacggg agaagtgcgt ctgtgtttgg acgcccgtcg tttgaatgat cggacagtga 4440 gagacgctta ccccctccca caccaagatc ggatattgag cagactgggt tcatgtaaat 4500 tcttatcaac gattgattta acaaaagcat ttctacaaat accactggac ccagcctcgc 4560 gaaaatatac tgctttctcg gtgttggaac ggggattgtt tcagttctgc cggctcccgt 4620 tcggtctagt aaatagccca gcgaccttgg ctcgcttgat ggacggagtg ctggggtttg 4680 gggaactgga accgaacgta ttcgtttacc tcgacgatat cgtcgtggta agtgaaacgt 4740 tcgaagcgca tctctcactg ctgtccgaag tagcacgacg tttgaaacta gctaatcttt 4800 cgataaatat caacaagtct aaattttgtg tgagtgagct tccttatctg ggttacatac 4860 tcacaccaga aggattgcgt cccaacccag agagaataga ggcaattttg aactacgagc 4920 gaccaaactc tctgcgctca ttgcggcgat ttttgggaat ggcgaattat tatcggcgct 4980 ttattcccca tttctctgac atgtcagcag cattaaccga tcttctcaag aagaagccca 5040 aatcccttat ttggaaccca aaggcggaaa catcgttttt gcaattaaaa gagagcctta 5100 tcgcagcacc tgtcctcgcg aacccaaatt ttgacttgcc ctttcagatt caatgtgacg 5160 cgagtgatac tgcgatagcg gcgattctca cacaagaaca cttcgatgga gagagagtgg 5220 tagcttattt ttctcaaaag ctctctcccg cgcaacaagc ttacgccgca tctgaaaaag 5280 aaggtttggc tgtgttatca gcaattgaaa aatttcgtcc atatattgaa ggaacacatt 5340 ttattgtcgt gacggatgcc tcagcactca cgcacataat gaaaggaaag tggcgaacct 5400 catctagatt gagtcgatgg agcatagaat tacagggtta tgatctggaa attcgacacc 5460 ggcgcggaaa ggacaatgtt atccctgatg cattgtcgcg ctcgttggag atttctttgc 5520 tagaagagga tggagacgca tggtattcca tgatgtacaa caaagtactt gcgtctccag 5580 atgaacatat tgacttcaaa gtagaggagg gaaagctttt taagttcatt cctgcgcaaa 5640 ccgaagtgct cgattataga ttcgagtgga aattatgcgt accagagagg atgagagagg 5700 atatcctacg aaaagagcac gaggagtctt tgcacgttgg atatgagaaa cttttggaga 5760 aattgagagc gagatatttt tggccaaagt tagcatttta ggcccgtaaa tacatagaaa 5820 aatgtagaat ttgtaaggaa tgcaaaccca caaatacatc acaacaccca acaatgggga 5880 ccgcgcgttt agctacaaag ccttttcaaa tacttgcaat tgattttatc cagtcgcttc 5940 caaggtctaa aacgggaaac atgcacttac tggttctatt agatttgttc tcgaagtgga 6000 ctgttttggt tcccgtgaaa aaaatctgtg ccagtgtggt tgtaaaaata cttgaggaac 6060 aatggtttcg tcgctactca gtgcccgaga ttctgataag cgataatgcg accagcttcc 6120 acagtaacga atttaaagat tttttgcaaa agtatcaggt gcagcattgg gccaactcca 6180 gacatcatag tcaggccaat cctgttgagc gcctgaaccg aagcattaat gctttaattc 6240 gaacgtatgt caaacaagac cagcgtcttt gggacagcaa gatttcacaa gttgagtaca 6300 cgatcaataa cacgtggcac tcttctacgg gatttacgcc gtataaaata ttgtatgggc 6360 atgaaataat ttccacagga gaggagcatc ggcgagatgc agatgcgtcc gaaatatctg 6420 ataatgaaag aattgaaaca aagttaaaag ttgatcgtaa tatttatgat ttggtttaca 6480 aaaacctagt aaaagcacac gagaagagcg aaagagcgta caatcttcgg ttccgcaagc 6540 ctgccccggt gtaccaggtg ggacagaagg tttttaagag gaattttgca ctttcttcgg 6600 ctggggacgc ctacaacgcc aaactgggtc cgatgtacgt tccatgtaca atcgtgtcaa 6660 ggcggggcac cagttcgtat gaattgattg atgagcacgg taaaaatttg ggtattttct 6720 cggcagcaga tctcaaaccg ggaaatccag agtaagaaga gcgaaaagaa tgcattttaa 6780 aacaattttt gcgttgtttt gttgaaaaat gtaacatttt acaagtttta gagcgttttt 6840 tgtgctattg tttagaacca cgttgtgtca gcaatagttg gttaaaaatg tctgatcatg 6900 atcaatcctt ttgtctgaaa aagaaataat tagaaagaga aagataatgc aagtttgatg 6960 attgtttttt gtttgaaaag ccttttctgg atgaatgtct cagtggaaag gagaatataa 7020 ttataacaat gtatgatgaa ttctagctag gcagaagtaa gatacaagca aaacagatta 7080 atgataacaa taaaggaaaa tgaaatgctt gaaattgctg tgcgtagaaa tttttgacta 7140 gtcatttcta ccctgctgtg caaagatatt ccaaattgat taactgagta tacaagaaaa 7200 cccgatttcc tatttccatg ttaaaaggaa aagcttggaa cgcattcgaa tcacaacatt 7260 gggaaaattt gcggtagttc ctgagagaga acgatcttct cttcctcaga atggttttct 7320 gttctggaat agtgggagta gaaaagatct agtaattgtt ttgaaaagta aattcgttgg 7380 aaaatatatt ttcttattat aagattaaac tatttgctga ttcaacgaac gaatgtcaat 7440 ttttggtaaa gcctagtata tcgttacctg gctgcttccc tgagtcagcc ttgattcagt 7500 tgcagctgtg taaaaaaatt aataaatatt tataatattt ttaatttttt tggggaggag 7560 tagatggtag 7570 // ID DongAa repbase; DNA; INV; 4004 BP. XX AC . XX DT 07-OCT-2010 (Rel. 15.1, Created) DT 07-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE A Dong non-LTR retrotransposon family from Aedes aegypti. XX KW R4; Non-LTR Retrotransposon; Transposable Element; Dong; DongAa; KW R4_Ele1. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4004 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4004 RA Kojima K.K. and Jurka J.; RT "Dong non-LTR retrotransposons from the yellow fever mosquito."; RL Direct Submission to Repbase Update (07-OCT-2010). XX DR [2] (Consensus) XX CC [1] Also called R4_Ele1. CC [2] Consensus update and chracterization of insertion sites. ~99% CC identical to consensus. The consensus is ~100% identical to the CC original sequence in [1]. This family is inserted into (TAA)n CC microsatellites, like Dong elements in Bombyx mori and Anopheles CC gambiae. The target specificity and phylogenetic position show CC that this is a Dong element. XX FH Key Location/Qualifiers FT CDS 189..3851 FT /product="DongAa_1p" FT /note="reverse transcriptase." FT /translation="MKANPAMETRSMRNRRMRSPEEGAPTGAGPGTGDRAS FT GQRLEEDNVQERPSSQRAQPVPRTRNRNGSSRNHQGHAAPTDVAVADRRQS FT LTLAGGRRQRIMWTREMNMYVIRCYYVCTRMETDMSGRSGMLEMFNERFPR FT FARQLDLNKLYTRQRAIMSHNMLTAAEVEYIKLEVQREIGEEGTRSSDTSR FT RSSVGLDAPITSESRDSPAPTAPGPTADMQRQQLRDELALHMGTAVTQFRG FT TDPMSRHRIPKLRYSYRLTSAVSILNQDILPQYLEAVENLEDLQFIVYSAA FT IAVVKTLGLRTRPQGEDGTRPNAQKPVWMRRLESRIATLRSKVGRLTQYKQ FT GNRSTRLVRHVAEIVRPAELRDLREVDITEILDTHVQRLSALAKRLRRYGE FT CSKRKEQNRMFNINEREFYNRIRNDKVDFGEGLPEIGDVTQFWANLWENPT FT EHDGDGMWLAEEERQCDGIGDMTAVVVTAQDIREATRYTRNWAAPGPDFVH FT NFWYKKLTTIHGRMAECFNTVLGDPTQLPEFITRGVTFLLPKDQHTADPAK FT YRPITCLSSLYKVLSSVIARRVQAHCDANNVMTEEQKGCRRNTQGCKDQVI FT VDAVIVGQATQKQRNLSMAYIDYKKAYDSVPHSYLLKVLQLYKVDGNVIRL FT MQHAMGMWSTSLHITDGTAVLRSRTLSIRRGIFQGDTFSPLWFCLAMNPLS FT KALNQCNYGYQLKSGERSTRVTHTFYMDDLKLFAESVQRLHQLLQLVTTFS FT NDIRMEFGIDKCRSIHLRRGQVMDASCFRVNEQEEIRNMVEGETYKYLGFL FT QLRGIRHTMIKKELQEKFLSRVNCILKSFLSAGNKVKAINTFAVPLLTYSF FT GVVKWTKTDLEAIERAVRVAFTKHRMRHPKSSIERVTLPRAAGGRGVTDIQ FT ALCVSQIQQLRAYFVESQNRHEIYRTVCEADHGFSALHLAQEDYQLNCDIK FT TVDEMIAMWKQKELHGTHPHQLELEHIDKVASNTWLVRGDLFSETEGFMVA FT IQDRVIATKNYRHYILHEDVEDRCRKCNSVGETIEHVVAGCSVLAGSAYLD FT RHNEVAKIVHQQLALKHNLVDRFVPYYKYLPDPVLENSCIKLYWDREIITD FT VLIRANRPDIVVYDKRMKRVTLIDIAVPLDHNVQSTFSNKIAKYHDLAEEL FT KQMWHLEDVRIVPVVLSATGIVPKSLLRSLDELELKKDLHSIQKAVILGTC FT SIVRRFLNHHN" XX SQ Sequence 4004 BP; 1128 A; 966 C; 1099 G; 811 T; 0 other; ggtgggagct caggcaacat aactcctggg cgtccatgaa aaaccaccat cttgagtagg 60 tgggagcttg aggtaaaata tcctgggcgt ccatgtaaaa ccaccctcgg cgagcagcgg 120 cgcttgaacc aagctattaa gcactgtaat tggaggaaat gccaatgcag aacccgtagc 180 ttccgggaat gaaagcaaac ccagcaatgg agacacgatc aatgaggaat agaagaatgc 240 gatcgcccga ggagggagcc cctactggag ctggtcctgg gacgggggac agagcgagcg 300 gccagcggct ggaagaggat aatgtgcaag agcggccttc cagtcaacgg gcacagcccg 360 taccgcgaac gcgaaacaga aacggcagca gcagaaatca ccaaggccat gctgcaccta 420 ccgatgttgc tgtggctgat agacgtcagt cactcacgtt ggcaggcggc cgacggcagc 480 ggatcatgtg gacgagggag atgaatatgt atgtgattcg ctgctactat gtctgcacaa 540 ggatggagac ggacatgtct ggcagatcag ggatgctgga gatgttcaac gagcgatttc 600 ctcgctttgc gcgccagctt gacttgaaca agctgtacac acggcagcgt gcaattatgt 660 ctcataatat gctcaccgct gcagaagtgg agtacatcaa gctggaagtg cagagggaga 720 taggagagga ggggacaaga tcgagcgata cgtcgagaag gagttctgtt gggctggatg 780 caccgatcac gagtgagagt cgtgattcac ctgcacccac tgctccagga ccgacagcgg 840 atatgcaacg acagcaatta cgagacgaac tagcactcca tatgggcaca gcagttacgc 900 agttccgtgg gacagacccc atgtcccggc accggatacc aaagctgcgg tattcctacc 960 ggctgacgag tgcagtaagc atcctaaatc aggatatttt gccacagtac ttggaagccg 1020 tggagaacct tgaggatctg cagtttatcg tgtattcagc tgcaatagct gtagttaaga 1080 cgttgggatt gcgaacgcgc ccgcaaggag aagatggcac acgccccaac gcacagaaac 1140 ccgtatggat gcgacgcctt gagagccgaa tcgcaacact gcgatcaaag gttggtcgat 1200 taacacagta caagcagggg aatcgatcaa cgaggctggt tcgtcatgtt gctgaaattg 1260 ttaggcccgc agagctccga gatctcagag aagtcgacat tactgagatc ctcgacaccc 1320 atgtacagcg gttgagtgct cttgcgaaac gattgcgacg ttatggagaa tgttcgaagc 1380 ggaaggaaca aaaccggatg ttcaacataa acgagaggga gttctacaac cgcatccgaa 1440 atgacaaggt cgactttggc gagggacttc cagagattgg cgacgttaca caattttggg 1500 ccaatttatg ggagaacccc accgaacacg acggcgatgg gatgtggtta gcagaagaag 1560 agagacaatg tgatggaatt ggagacatga ccgcggtcgt ggtaaccgct caggatatac 1620 gtgaagctac ccggtatacc aggaattggg ctgcaccagg acctgatttt gtgcacaatt 1680 tttggtataa aaaactcaca acgatccatg ggcggatggc ggaatgcttc aatacggtac 1740 taggagaccc cacgcagcta ccggaattca tcaccagggg tgtcacattt cttctgccaa 1800 aggaccaaca cacagctgac ccagcgaagt acagaccaat aacgtgcctt tcaagtctgt 1860 ataaagtgct atcgtcggta atagcgagga gggtgcaagc ccattgcgac gccaacaacg 1920 tgatgaccga ggaacagaaa gggtgtcgca gaaacacgca aggctgcaaa gatcaggtca 1980 tcgtagatgc agtcattgta gggcaagcta cccaaaagca aaggaacctg agtatggcgt 2040 acatcgacta caaaaaggca tacgactcag taccgcactc gtaccttctc aaggtactgc 2100 agttgtataa ggtagacggg aacgtcatca ggctgatgca acacgcgatg gggatgtgga 2160 gcacatccct acacattacc gacggtacag ctgtgttacg gtccaggact ctcagcatca 2220 ggagggggat ttttcaaggc gataccttta gtccgctgtg gttttgcctg gccatgaacc 2280 ccctcagcaa agcactcaac caatgcaact atggctacca actgaaaagt ggggaaagga 2340 gcacaagagt tacccacacc ttctatatgg acgacctgaa gctatttgcg gaatcagtgc 2400 agaggctgca tcagctgttg cagctagtta cgacattcag taacgacatc cggatggagt 2460 ttggtattga caaatgcagg tcaattcatc tacgtcgggg tcaagttatg gatgccagct 2520 gtttccgcgt caacgagcag gaggagattc ggaatatggt tgaaggcgaa acgtataaat 2580 accttggatt cctgcaactt agaggtattc gccacacgat gatcaagaag gagctgcagg 2640 aaaagttttt gtcacgtgtc aactgtattc tgaagagctt tctgtctgcc ggcaacaagg 2700 tgaaggcgat caacacgttt gctgtgcccc tgttgacgta tagtttcggg gtggtaaagt 2760 ggaccaagac tgacttggag gcgatagaac gagcagtacg agtggcgttc accaagcacc 2820 gaatgcgcca cccaaaatcg tccattgaga gagtcaccct gccacgtgca gcaggaggaa 2880 gaggcgtcac cgatatccag gcactatgtg tctcccagat ccagcagctg cgggcatatt 2940 tcgtagaaag ccagaaccgc cacgaaattt accgcactgt atgcgaagct gaccacggct 3000 tcagcgccct gcatctggcg caggaggatt accagctgaa ctgcgacatc aaaaccgtcg 3060 atgagatgat cgcaatgtgg aagcagaagg agttgcatgg aacgcacccc catcaactgg 3120 agctcgagca catcgataag gtggcgtcaa acacgtggct ggtgcggggt gacctcttct 3180 cagaaacaga aggtttcatg gtagccatcc aggaccgggt aattgcgacg aagaactatc 3240 ggcactacat attgcacgaa gacgtggagg accgctgtag gaagtgcaat tcagtagggg 3300 agactatcga gcatgtcgtt gccggctgtt cagtactagc tggatcagcc tacctcgatc 3360 gtcacaacga agttgccaag attgtgcatc aacagcttgc ccttaagcac aacttggtgg 3420 atcgatttgt gccctactac aagtacctgc ctgacccggt tctggaaaat agttgcataa 3480 agctgtactg ggatcgcgag atcataacgg acgtcctcat ccgtgccaac cgccccgaca 3540 ttgtagtcta cgacaaaagg atgaaacgag ttacactcat cgacatcgct gtaccgctgg 3600 accacaatgt ccaatcaacg ttctctaaca agatagcgaa gtaccacgac ttggcggagg 3660 agttgaagca gatgtggcac ctagaggacg tccgtatagt tccggtagtc ctctcagcga 3720 ccggaattgt cccgaaatct ctcctaaggt ccctagacga gctggaattg aagaaggacc 3780 tacacagcat ccaaaaagcg gtgattcttg gaacctgcag catagttaga cggttcctga 3840 accaccacaa ctgagaagct catccagcag actcattagg ataagttaaa ccgaccaatg 3900 agatccacag agcctaatcc cctttggcat acagattgcc cggggtaggt gaaagtttcc 3960 agcatttttt gctgagaagt gccaaaactc tataataata ataa 4004 // ID LIN10_SM repbase; DNA; INV; 6128 BP. XX AC . XX DT 19-MAR-2008 (Rel. 13.03, Created) DT 25-MAR-2008 (Rel. 13.03, Last updated, Version 1) XX DE Non-LTR retrotransposon: consensus. XX KW Non-LTR Retrotransposon; Transposable Element; L1; LIN10_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-6128 RA Tempel S., Bao W. and Jurka J.; RT "Non-LTR retrotransposons from Schmidtea mediterranea."; RL Repbase Reports 8(3), 344-344 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS join(147..1853,1823..2803,2811..3518,3464..4309, FT 4248..6119) FT /product="LIN10_SM_1p" FT /translation="MSKNGTDPHLLELCYKAHVFLNGSKEIPLSTAVGHWR FT EIDSTGIFDSDFSELMEQIIAKPFKTEGLQLKIENAIKTFDKFIADAVNPD FT KLINHKHLNFFYNQLYYNPLKNFNFSIPIKSILKNNNIPSHRNFFNNEEST FT PLNKKSISFKNISHSKPFSYLTALTNNLKYENIDNILISIRNCTDSFFINL FT NICFFELKHLIKNNFINYFNDYFYIFYLKIQKFNNKKLQKSNYYQNNIPLL FT FKISSLTNDFFINIENYFSKIKTKRNINLFLNDFLXILKIKINILLFKYSX FT FKKLNPKNKHEKISTRSSXASGRLALAVTPSSPLDLTINTTKQTTTDINTD FT TTNTSHKFSNNRYSLFLDQNEPLDLSIVKHPHKNTNIPSTVKNTYNPLNKQ FT KPFHKTTTKTIFSNAQNNIKKHNIDTSKRPHFSXPIPKSCNTDIKYEIPPI FT EKNTPKLNDIIXSNITFNPKTPNIQQNISLPKFILPSNLHNNTTNVDLDAE FT STLVNPHFQLNFNLKPCPNLFIPKXDLPXNLNMEIDQKLLLTNTKASDETI FT SYSPITQTQTDESVLTHKLIELISINTQANRIIEQINSDIDLFSQSPHSYE FT IPNTETKEIFTVYNNLDIAKKRVEIIIEEDILKTKTQTNDDIPMINFNNNY FT INNNSLFKNILLSNESVENLFRDISDNDSIISKSPNKKINNQDFLKKTVKI FT NKTMFNLRNDGYLICIGGKCEKKKIGIHNIDNIIEHRKTNHSDTIDKNTTF FT LCLLCEGSVKRKTILLNKIIQHTETFHSEKHFPVTREELNNKNKILELXMN FT TEGTLNCRYTNIKKHCQNDLTYKSTYAQIEKHIAKKHKRKTCNINEIICHC FT GETLEHHNIIEHHNKHIGVLPAEDKLRMIPDNIPVSVKDIGNDISILSTFF FT CMHLLPQSQQLLYNHTLLKPSYKQNFKTFFETFPFSQYPNWNEIILPFKTS FT TVNWAVFYFNKLKAFATIIDPLSPNSLQKHRKIADTILNTIKNIQNIIYPS FT NINLTELNYPMCANTNESGFFICHIIQCISINHNYITIPIISEIRKRLDNL FT INKVIIDRAPIDNILKYTNILNNIKILQNNNSENPDEGLAEINRIHECIFP FT IKHKIKKXLSLIHHLLLLKIIKLNKKXXFNPPSTIIKNNKIELTYKLRFNF FT NINYKSTINKIFNNSINTLSPHPDRIILNFEREIIIINDIKFKYKLPYYPE FT NFIHKPTTIEEITNTFKIVDLSSSPGKDQISYLDWKKIDPDNSILCSLFNN FT ILFKSITPLKWKTFKTKLIIKPGKENSSHDLSSWRPLAILDTSYRLFATLI FT NRRLLEWIKLGNLININQKAIGCPDGCAEHNSVLLLLNERAVRMGTNINIV FT WLDLADAFGSISHDLIWNVLTNMGINNNTIKLFKNMYNNCNTYYECENNHY FT NFLKICIIIVTLIMNVKTTTTKTIPINNGVKQGCPMSMTLFCLAINFIITN FT ILKDHPFIINNYNVSILAYADDIVLISDNRIKIAKILTDIIKYTSDVKLKF FT RPTKCGFYENNRNHDEKPLKIYNEIIPIIDENNIYKYLGVGFGGKSNHNID FT NILEKALTDAEATFLSDLHPSQKLFAYKTSIHSRLIFPFRTCIVKHMILDC FT DRNRIVDNRTKQLGFDQKIKRLIKNSIGDIFQNISNNFIYAHCSMGGLGIV FT PSIDEYLIQSTIQLTRLLMSSDVTMRDFIIGELIECVRARFNYLENEIEMA FT MKWLNGDIDDEKVGKPCLFLKFKKSVYRLKIKFSIICKLTYIDNIFSLEIS FT KNDFKIVINNENIDNASKIMHILTGEWYALEWYKMTHQGHIAETVGKSQFS FT KHLIKSTTLGDNQYFFLIKARNNSLNLNSNPSNHKDKKDVNCRLCRIQWET FT QAHILNHCTLSQNARRNKHNHVLEIVHNFFASKNYDIDIEKSPPNVDTTLR FT PDLIIKTTDKNKIKLSLVDIKVPYDNETNFINARKQNLDKYRNLAIETGKD FT NNCKAAVYTIIVGSLGSWDKENNKDLQKLGLGVNEIGEMARMCIKAAIEAS FT YMIFMNHVTNNK" XX SQ Sequence 6128 BP; 2450 A; 1040 C; 739 G; 1881 T; 18 other; tttacacatt gacattcgat agcaccgggc cgaaatccgg atctgataag ttattttatt 60 caaaacaacc attactttat tttccagata ataatgaatt aaatgctgat cctatctttg 120 agaaaagttt gcaagaatta cgtcttatgt ccaagaatgg caccgatccc catctattgg 180 aactctgtta caaggcacat gtatttctta atggatcaaa agagatccca ttgagcactg 240 cggttggcca ttggagagag attgattcga ctggcatctt tgattcrgac ttttctgagc 300 ttatggagca aatcatagca aagcccttca aaacagaagg cctgcaattg aaaatygaaa 360 atgccattaa aacctttgat aagttcattg cagacgcggt aaatcctgat aaattgatta 420 atcataaaca tttaaatttt ttctataatc aattatatta taatccctta aagaatttta 480 acttttctat accaattaaa agtatattaa aaaataataa tattccttcg cataggaatt 540 tttttaacaa tgaagaatct acccctctaa acaaaaaaag tatctcattt aaaaatattt 600 ctcattcaaa acctttttca tatttaactg cactaacaaa caaccttaaa tatgaaaaca 660 ttgataacat cctaatatca attagaaatt gcactgatag tttttttatc aatttaaaca 720 tttgcttttt tgaattaaaa catttaataa aaaataattt tataaattac tttaatgact 780 atttttatat attttattta aaaattcaaa aatttaataa taaaaaatta caaaaatcta 840 attattatca aaataatata cctttacttt ttaaaatttc cagccttact aatgattttt 900 ttattaatat cgaaaattac ttttcaaaaa ttaaaactaa acgaaatatt aatttatttt 960 taaatgactt yttatwtatt ttaaaaatta aaataaatat wttattgttt aaatattcta 1020 awtttaaaaa attaaatcca aaaaataagc atgaaaaaat tagtacacgc agctcgycgg 1080 cctccggccg actcgcactt gcggttacac cctcttctcc gctagatctt actataaaca 1140 ccaccaaaca aacgacgacc gatataaata cagatactac taatacatct cataaatttt 1200 ccaataacag atatagcctt ttcttagacc aaaacgaacc acttgacctc tccatagtaa 1260 aacatcctca caaaaacact aatatacctt caactgttaa aaatacctac aatcctctaa 1320 acaaacaaaa gcctttccac aaaactacta ccaaaaccat tttctctaac gcacaaaata 1380 acataaaaaa acataatata gatacctcta aaagacctca tttttcrwtr ccaattccca 1440 aatcctgtaa caccgatatt aaatatgaaa taccaccaat tgaaaaaaat acccctaaac 1500 taaatgacat tatarcctcc aacattacct ttaacccaaa aacaccaaac attcaacaaa 1560 atatctcttt gcccaaattt atattaccct caaatcttca taataatacc accaaygttg 1620 accttgacgc tgaaagtacc cttgttaacc cacactttca acttaacttc aacttaaaac 1680 cttgtccaaa cctttttatt ccaaaawacg acctaccaaw aaacctaaat atggagattg 1740 accagaaact actactaact aacaccaaag catctgacga aacaatatca tatagtccta 1800 tcacacaaac tcaaaccgat gaatcagtat taacacacaa gctaatagaa ttatagaaca 1860 aattaactct gacatagatc ttttttcaca atctccgcac tcctatgaaa ttccaaacac 1920 tgaaaccaaa gaaatwttca ctgtgtataa caaccttgat atagcgaaaa agagggtcga 1980 aataattatt gaagaggata tcctcaaaac taaaactcaa actaacgatg atattcctat 2040 gattaatttt aataataatt atattaataa taattcactt tttaaaaata tcctactttc 2100 aaacgagtct gttgaaaact tatttagaga catttcggat aatgattcaa tcatttcaaa 2160 atcacctaat aaaaaaatta ataatcaaga ctttttaaag aaaactgtta aaattaataa 2220 aactatgttt aatttacgta acgatggcta tttaatatgt attgggggaa aatgtgaaaa 2280 aaagaaaatc gggatacata atatagacaa cattatcgaa catcgtaaga ccaatcattc 2340 tgacacaatt gataaaaaca ccactttttt atgtctacta tgtgagggat cagttaaacg 2400 caaaacaata ctactaaata aaattattca acacacagag acttttcact ctgaaaaaca 2460 tttccctgtc acaagagaag aacttaacaa caaaaacaaa atacttgagt tatyaatgaa 2520 tactgaggga accctcaact gtagatacac aaacattaaa aaacattgcc aaaacgactt 2580 aacctataaa tcgacatatg cacagataga aaaacacata gcaaaaaaac ataaaagaaa 2640 aacatgtaat ataaatgaaa tcatatgtca ctgcggcgaa acattagaac atcataatat 2700 catagaacac cacaataaac atataggggt attaccagct gaagacaaat taagaatgat 2760 accagataac ataccagtct cagtaaaaga tattggaaat gactgactag atcagcattc 2820 tatcaacctt tttttgcatg catttgttgc cacaaagtca acaattatta tacaatcata 2880 cactactgaa gccctcatac aaacagaatt tcaagacatt ttttgaaact tttcctttta 2940 gtcaataccc aaactggaat gagattattt taccatttaa aacatcaaca gtaaattggg 3000 cggtttttta ctttaataaa cttaaggcat ttgctacaat tatagatcct ttaagcccta 3060 acagcctgca aaaacatagg aaaatagccg atacgatcct aaatacaatt aaaaatatcc 3120 aaaacattat atacccatct aatataaatt taactgaatt aaattatcca atgtgtgcca 3180 acactaatga aagtggtttt tttatttgtc atattattca atgtatttct attaatcata 3240 actatattac tatacctatt atatctgaga ttaggaaaag attagacaat cttattaaca 3300 aagttataat agatagggct cctattgata atatattaaa atacactaat atactaaata 3360 atattaaaat tcttcaaaat aataactcag aaaatccgga tgaaggttta gctgaaatta 3420 accgtattca cgaatgtatt tttcctatta aacataagat taaaaaaaan ctktctttaa 3480 tccaccatct actattatta aaaataataa aattgaatta acgtataaac taagatttaa 3540 ttttaatatt aattataaaa gtacaattaa taaaattttt aataactcta taaatacttt 3600 gtctccgcac ccagacagga taatattgaa cttcgagaga gaaattatta taattaatga 3660 tattaaattt aaatataaac taccatatta tcctgaaaac tttatccata aacctactac 3720 aattgaggaa ataacaaata cctttaaaat tgttgactta tcttcttcac ctggtaaaga 3780 ccaaatatct tatcttgact ggaaaaaaat tgaccctgat aattcaatcc tttgctcact 3840 ctttaacaat atcttattta aatctatcac ccccctaaaa tggaaaacct ttaaaactaa 3900 gttaataatt aaaccaggga aagaaaatag cagtcatgat ctttcttctt ggagaccact 3960 ggcaatctta gatacctctt atcgtctttt cgcaactctt ataaaccgta ggcttcttga 4020 atggataaaa ctaggtaacc ttattaatat taaccaaaaa gctataggat gtcctgatgg 4080 gtgtgcggag cacaactctg ttttattatt attaaatgaa agagcagtgc gcatggggac 4140 taatataaat attgtatggt tggacctggc tgatgccttt ggcagtatat ctcatgacct 4200 tatttggaat gtacttacta atatgggtat taataataat actataaaac tttttaaaaa 4260 tatgtataat aattgtaaca cttattatga atgtgaaaac aaccactact aaaaccatcc 4320 caatcaacaa tggagtgaaa caaggatgcc ctatgtcaat gacacttttt tgtctggcaa 4380 ttaacttcat tattactaat attctgaaag accatccctt tataattaat aattataatg 4440 tcagtatact agcgtatgct gatgatatag tacttatatc tgacaaccgt atcaaaatag 4500 caaaaatatt gactgacata attaaatata catctgatgt taaacttaaa ttcagaccta 4560 ctaaatgcgg attttatgaa aacaatagga accacgatga aaaaccatta aaaatatata 4620 atgaaataat tcctattatt gatgagaata acatttacaa atacctaggg gtaggttttg 4680 gggggaaaag caaccacaac atagataata ttcttgagaa agcccttact gacgctgaag 4740 caactttttt atctgacctc cacccttcac aaaaattatt tgcgtataaa actagcattc 4800 actctagatt aatttttcca tttcgtacat gcattgttaa acacatgata cttgactgtg 4860 atagaaaccg cattgtagac aatagaacta agcaacttgg atttgaccaa aagattaaga 4920 gactcattaa aaattctata ggggatatat ttcagaatat tagtaataac tttatatatg 4980 cacactgctc tatggggggt ttaggtatag tccctagtat tgatgagtac cttatccaaa 5040 gtactattca acttaccaga ttacttatgt caagtgatgt aacaatgaga gattttatta 5100 ttggggaact tatagaatgt gtaagggccc gatttaatta tttagaaaat gaaattgaaa 5160 tggcgatgaa atggctgaat ggagatattg atgatgagaa agttggtaag ccttgtttgt 5220 ttttaaaatt taaaaagtct gtatatagac ttaaaattaa attttctatt atatgcaaat 5280 taacatatat tgataatata tttagtcttg aaattagtaa aaatgatttc aaaatagtga 5340 taaataatga aaatattgat aatgcctcca aaataatgca catcctcact ggtgagtggt 5400 atgccctcga atggtacaag atgacacatc agggtcacat tgctgaaaca gtaggaaaaa 5460 gccagttctc taaacacttg attaaatcta caacattagg agataatcaa tacttttttt 5520 taataaaagc tagaaataac agcttaaact taaattctaa cccctctaac cataaagata 5580 aaaaagatgt taattgtaga ttatgtagaa tacaatggga aacacaggcc catatactaa 5640 accactgtac cctatcccaa aatgcaagga gaaacaaaca taatcatgta cttgaaattg 5700 tacacaattt ttttgcaagt aaaaattatg acatagacat agaaaaatcg cctcctaatg 5760 ttgatacaac cttaagacct gatcttatta ttaagacaac tgacaaaaat aaaataaaac 5820 tatcattagt tgatattaaa gtgccatatg ataacgaaac aaattttatt aatgcacgca 5880 aacagaatct tgacaaatat aggaaccttg caattgaaac tggtaaagat aacaactgca 5940 aagcagcagt atacactatt atagttggat cgttaggatc ctgggataaa gaaaataata 6000 aggatcttca aaagttagga ttaggagtaa atgaaatagg agagatggcg agaatgtgca 6060 ttaaagcggc tatagaggca agttacatga tctttatgaa tcatgtaact aataacaaat 6120 aacttcta 6128 // ID Gypsy6-I_Dmoj repbase; DNA; INV; 6971 BP. XX AC scaffold_6496; XX DT 13-MAY-2009 (Rel. 14.05, Created) DT 13-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy6_Dmoj; KW Gypsy6-LTR_Dmoj; Gypsy6-I_Dmoj. XX OS Drosophila mojavensis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; repleta group; OC mulleri subgroup. XX RN [1] RP 1-6971 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1056-1056 (2009). XX DR Genome; scaffold_6496; Positions 26424476 26417506. XX CC Positions [5937-6413] - Integrase core CC 'ATGT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 1957..3324 FT /product="Gypsy6-I_Dmoj_1p" FT /translation="MEDAEYHNPGLSVGWIYQRCKEELITIAREFGLDTEG FT RVEDLRKRLSTFIQSNNYSAEAKARLAEIEKRFAKGSEFTLGGNAPSMGSR FT AYQKSEPDLRKVQEQGPVQAESLRLRVPMANGAAVRRREERKEFGNTGEDE FT AVATIPKTGRQVPASTIPLERSPPWGVAEQVRKWGMRYDGQTDPLGFVEIV FT EERAITYGIELDQMPRAMSEVFADKAAKWFLTSGLRDSSWADFMREFLDFF FT LPPQYFERLEDQIRSRHQREGESFKDYLIELRLLMRQARYSAAQELYRAYE FT NAAPDYRLYVRRHDFVTLTQLTQMAAEFENVQGQRREQGREAVADRTKTAW FT SKPAENRPNNPFRTQTPGSQVRGQAAGAAMAPTAVSQEAVHRMMVQEGESG FT TATRRTCYRCGQVGHFARECQSGQAPSCQRCGKKGKSTRECCERDDAGNGM FT GRPPLERGGAA" FT CDS join(3258..4703,4707..6956) FT /product="Gypsy6-I_Dmoj_2p" FT /translation="MLRARRRGKRNGAPPAGAGRRRLDSKNEVKNPLSVEE FT GRIVAAVVIGGRAMTATIDTGATRSCISEDCVRRQAIRGEIQEMRSRTRLA FT DGSTLEITKRISVEVSLAGKVAKMTLLVMPAMLDHLILGMDFLGAMDTTLC FT CGNAALTMRMKEEPNEETSLRSGPHESKESDQSGGPQSLARSRIREKERKR FT GLEAPCWSPTQKGGAASTGGGLPIDGQGTSQGSGPPARPIKRWREIPKREP FT GRCPQLARAKDLEEGLEERPPEVRLVGLEAEDVDEIRSSQSPQRPIYQEED FT EENTEPSPDDAGWPEDLEDELKEFLEAELALFEGLDGVSNIAEHSIRMRDD FT KPLKQRYYPKNPAMRKVIDEQVDELIRAGAIEPSRSPHSAPIVLVKKKTGD FT WRMCVDYRQLNAHSIPDAYPVPRINHILERLRHARYISTLDLKSGYWQIPM FT AADSRECTAFTVPGRGLFQWRVMPFGLHSAGATFQRADTVIGPDMEPHAFA FT YLDDIVVIGATKAQHVENLREVFRRLRKANLRVNRKKCSFFRKRLVYLGHV FT ISEEGIRTDPAKVEAIRRFPTPSTLKELRQCLGMASWYRRFVPNFASMVEP FT MTRLLKKGHKWEWSDEQEESLKQLKESLTTAPVLACPDFSARFVLQTDASD FT YGLGAVLTQEVEGEERVIAYASRKMLKAELNYSATEKECLAIVWAIRKMRC FT YLEGYRFEVVTDHLALKWLNSIESPTGRIARWALELQQYQFDIRYRRGKLN FT VVADALSRQPVESCQRMMETNPPCKWIARMRGRIKEDPEKFKDYVEENGQL FT YRNLGHRVDDEDYIPWKLCVPTHQRKRVLQECHDAPTAGHQGVRKTVMRLA FT QRYYWPGMFREATLYVKRCEECQKFKSEQRKPAGQMLTRQVAEPMSILCAD FT FVGPMPRSKRGNTMLLVFHDAFAKWVELVPLKRATAALLQAAFRERILARF FT GVPKMFVCDNGAQFASKGFRSYMESLGVRLQFTAPYSPQENPTERTNRTVK FT TMIAQYIEGHQSSWDVLLPEISLAINSSGADSTGFTPAFLMMGREPRLPAT FT LYDEVTPGSATREVDLETRKETMKEIFEVARNNLQKASKDQGRYYNLRRRE FT WRPALGSMVLLRQHQLSSAAEGFAAKLAPKFEGPYRVVHFISPNVVRLARA FT GERRKRVANLSQLKPFYTEEEETGETMLREEAGEEGSGVFQSPTSPHALQS FT DVGRRSLQRHGGGHRSRGRLLD" XX SQ Sequence 6971 BP; 2084 A; 1420 C; 2084 G; 1383 T; 0 other; agtggcgccc gagcagggac tcagggaggt gcgcggtgta tcgcatggcc gaggggaaac 60 ggcggaaatt gtaataaata tataaaagtg ctatatggtg atatataaaa ccttaaaaat 120 cgtgatcgag aagagtagaa aattcatgtg tatatatata tatgtgcgtg ggtgtttaac 180 gattttggcg caagctgtac ttttcggcac cataatcgtt ttaccgttgg ttctcttctc 240 tcgtcgcttt tgaatcttta taccctggac agcgcgatcg ctgtgaggga tatccgttcg 300 ttgcttacat ttcatcacgt atctactcat tgcttatatg tctggttgtg gtggtagttg 360 gttaaatata ttataagtcg gaattaatat atatggtata ttgaagtaaa tatatatatg 420 tggtgcatat aagcgtatat gcatgaagga gtataatggt acatgggtat gtatgtgcga 480 gtaaataccc gagaatattg ttatatgtgc aagtagtggt tgattttgac atgtgaatat 540 cttctctgag tggcaagtcg agtggatctt gcgagtgaga aaagaaaaag tagggtatac 600 tggtaacggt aaagagacga attgcgcgtc taattcgaat ttcggcgcgc cgattttgtg 660 catatttatt cgccactata tgagtatacg catttgcata tatgtatgta tgtttgtgtg 720 catgtctgta gttatgtgca aacatatata tatgcatgca gactagagcg catatacata 780 tatacgcatt ttgtccgttt gttacatata tatatatgtt tgttatatat atatatatgt 840 aggtgagagc gcataggcga gagaaagaga aagagcgagg gtaaattttg gccgcttgtc 900 agttacatat gtatgtttgt cgtttgtccg ttacgtgggt ccgctgtacc ctacgtaggg 960 catatgcgac atgtggttat gaaaacgcgt tgcatcgaga ttcgtccggt ttgtggtggc 1020 ctggggccca ctatgcagct cgtctattat agttcgctta cccgtgcgtt atttcgtgat 1080 accctaccca gggcatatcc gaataaaata aaataaaata aaataaaata aaataaaata 1140 aaataaaata aaatagaata aaataaaata aaataaaata aaataaaata aaataaaata 1200 aaataaaata aaataaaata aaataaaata aaataaaata aaataaaata aaataaaata 1260 aaataaaata aaataaaata aaataaaata aattgaaata taacaaataa acaaggcgga 1320 gcgaatcgga ataaagcagg gtgaagcggg aagtattcaa gaagtgggga ggtaaggtga 1380 agagaacgag gcagaatgca gcatggggta aagcgaagag agatgaaaca agtaggaaat 1440 gaagaagggc aagattcgac aatgaagatg gagtgaagca acgaagaaat ttaacaaagg 1500 tatcgaaaga aaacgatcaa gggaaatgaa agtgtgggaa tgaggatgaa atacagcaag 1560 ttaaacgaaa ggaaaaagga atttaagggc gtataaaaag gaagcgaaaa ggaatgcgac 1620 gaatgaaaga ggaagatcta actgaaagta aatacggaga agaggagatg aagtaaaggc 1680 acttaaacat ttagaataaa aaaaaaaaaa aaaaaaataa aaaaaaataa aataaaataa 1740 gaggaaataa agagaataca gtaaaattag aataagtgat caaaggggta ggttaagagg 1800 aataagataa taataagacg ataaacgcaa gcgaaattgt aaaaaaaaaa aaaaaaaaaa 1860 aaaaaaaagg gaaaaaaaaa aaaaataaaa aaaaaaaaac ccaacaatag agaagtaaga 1920 gctgaacaaa ccgggaggaa taaataagca ttcgagatgg aggacgccga atatcataac 1980 ccaggattga gcgtcgggtg gatctaccag cggtgcaagg aggagctgat cacgatagcg 2040 cgagagttcg gcctggatac cgagggccga gtagaggacc taaggaaaag gctctctacg 2100 ttcatacaga gcaacaacta ttcggcggag gccaaagcga gattggccga gatcgagaag 2160 cgtttcgcga aagggtccga gttcactctg ggaggcaatg cgccaagcat gggcagccgg 2220 gcttatcaaa aatcggagcc ggatttgcga aaggttcaag aacaaggtcc cgttcaagca 2280 gagtccctga gactgagagt gcccatggcg aacggcgcgg cggtgcgacg tcgtgaagag 2340 cggaaggagt ttggcaacac tggagaagac gaggcggtag cgacaattcc gaagactggc 2400 cgacaggtcc ccgcatctac gatcccgttg gagaggtccc caccctgggg ggtcgccgaa 2460 caggttcgta aatggggcat gcgctatgat ggccaaacgg acccgttggg cttcgtggag 2520 atcgtagaag agagggcgat cacctacggc atcgagttag atcagatgcc acgagcgatg 2580 agcgaggtat tcgcggataa ggcggcgaag tggttcttga ccagcgggtt gagagattct 2640 tcgtgggccg atttcatgag ggaattcttg gatttcttcc tccccccgca gtacttcgaa 2700 cgcctggagg atcaaataag gtcgcgtcat cagagagagg gagaaagttt caaggactac 2760 ctaatcgagc ttaggttgtt gatgcggcag gcacgatata gcgcggcgca ggaactgtat 2820 cgagcatatg agaacgcggc cccggactat cggttgtacg tgaggcgtca tgactttgtc 2880 acgctgacac aactcacgca gatggcagcc gagtttgaga atgtccaagg tcagaggagg 2940 gaacaaggca gagaggctgt ggcagatagg acgaagacag catggtcgaa gccggcggaa 3000 aatcgaccaa acaacccttt ccgcacacag acgccgggat cccaagtcag aggtcaagca 3060 gcaggagcag cgatggcgcc cacggcggta tcccaagagg ctgtccaccg gatgatggtg 3120 caggagggcg agtctggaac ggcgacacga cgaacttgct atcgctgcgg gcaagtaggt 3180 catttcgcac gcgaatgcca aagcgggcaa gcaccatcct gccagcgatg tggaaagaaa 3240 ggtaaaagca cgcgggaatg ctgcgagcgc gacgacgcgg gaaacggaat ggggcgcccc 3300 ccgctggagc ggggcggcgc cgcttagata gtaaaaatga ggtgaagaat ccactttcgg 3360 tggaggaggg gaggatcgtc gcggcggtgg tgattggggg tcgagcgatg acggccacaa 3420 tcgacaccgg ggccacacgc agctgcataa gcgaagactg tgtgcgcagg caggcgatcc 3480 gaggcgagat ccaggagatg cggtcccgga ctagattggc tgatggatcc acgctggaga 3540 taacgaagag gataagtgtg gaggtcagtc tcgcaggaaa ggttgcaaag atgacgctgc 3600 tcgtcatgcc ggcgatgctg gaccacctta tcctggggat ggacttcttg ggcgctatgg 3660 acaccacatt gtgttgtggc aacgcagcgc ttaccatgag gatgaaggag gagccgaacg 3720 aagagacgtc cctgagaagc ggaccgcatg aatcgaaaga gagcgaccag tctggaggac 3780 cacagtcgct tgctcgatcg aggattcgcg aaaaggaacg gaagagaggc ctcgaggcac 3840 cgtgttggtc cccgacgcag aaagggggcg cggcgtcaac gggcggagga ttgccgatcg 3900 atgggcaggg cacgtcgcag ggcagcggac caccggcccg gccgatcaag agatggcgcg 3960 agatcccgaa aagagaaccc gggaggtgtc cccagctagc gagggcaaag gatctggagg 4020 aaggactaga ggagcgtccc ccagaagtta ggttggtggg attggaggcg gaggatgtcg 4080 acgaaattcg gagtagtcag tcaccgcagc gtccaatata ccaagaagag gatgaggaga 4140 atacagagcc ttcccccgac gacgcggggt ggccagagga cctggaggac gagctgaagg 4200 agttcctgga agcggagctg gcgcttttcg aggggctcga cggagtttca aatatcgccg 4260 agcatagcat aaggatgcgg gatgataagc ctcttaagca aagatattat cccaagaatc 4320 cggctatgag gaaggtaatt gacgaacagg tagacgaatt gattagagct ggagccatcg 4380 agccgtcgcg gagcccgcat agcgcaccta tcgttctggt aaagaagaag accggagatt 4440 ggcgcatgtg tgtggactac aggcagctga atgcacactc aatcccagac gcctatcccg 4500 ttccacgtat caatcacatt ctcgagaggc tgagacacgc acgatacata tccaccctgg 4560 atttgaagag tggatattgg caaatcccca tggcagcaga tagtagagaa tgcacggcgt 4620 tcacggtccc ggggagaggg ttatttcaat ggcgagtgat gccatttggt ttacattcag 4680 caggcgctac gttccaaagg gcgtaggaca cagtaatagg accggacatg gagccgcatg 4740 cttttgcata cttagacgac attgtggtga tcggcgccac gaaagcgcag catgtggaga 4800 acttgaggga agttttccgc cgattgagga aagccaacct tcgagtaaac cggaagaaat 4860 gcagcttctt caggaaaagg ttggtctatc tcgggcatgt catcagcgaa gaagggatac 4920 gcacagatcc cgcaaaggtc gaggcgatcc gccgcttccc gacaccatcg actctcaaag 4980 aactgcggca gtgtttggga atggcatcgt ggtacaggcg attcgtgcca aactttgcgt 5040 cgatggtgga accaatgacg agactcctta agaaaggcca caaatgggaa tggagcgacg 5100 agcaagagga gtcattgaag caactcaaag agagcctcac gacagcgcca gtgttggcgt 5160 gtcctgattt ttcagcgagg ttcgtactac agacggatgc cagcgactac ggcttgggcg 5220 cagtgctcac acaagaggtc gaaggcgagg aaagggtgat tgcgtacgct agtcggaaga 5280 tgttgaaagc ggagctcaat tattcagcta cagaaaagga gtgcctagca attgtctggg 5340 ctatcaggaa aatgcgttgc tacctcgaag gctatcgatt tgaggtagtt acagatcact 5400 tggcattgaa gtggcttaat tcaatagaga gccccacggg gagaatcgcg aggtgggcat 5460 tggagctgca gcagtaccaa ttcgacatcc gctatcgacg agggaaacta aatgtggtgg 5520 ccgacgcact ctctcgccaa cctgtggaaa gctgccagcg aatgatggag accaatcccc 5580 cttgcaagtg gatagcgagg atgcgtggga ggatcaagga ggatccggag aagttcaaag 5640 attacgttga ggagaacggg cagctgtacc gcaatcttgg ccaccgggtg gacgacgaag 5700 actacattcc ctggaaattg tgcgtcccaa cccaccagcg gaaaagagtg ctgcaagaat 5760 gccatgacgc tcccacagcg gggcatcaag gcgtgaggaa gacggttatg aggctggcgc 5820 aaaggtacta ctggccgggg atgttccggg aagcgactct atacgtgaag cgatgcgagg 5880 agtgccaaaa gtttaagagc gagcaacgca aaccagcggg tcagatgctg accaggcaag 5940 tagcggagcc catgtcgatt ctgtgcgcgg atttcgtagg tcccatgcct cgatcgaagc 6000 gcgggaacac aatgttgctt gtgttccatg atgcgttcgc caaatgggtg gagctggtgc 6060 cgttgaagag ggcaaccgcg gcgttgctgc aagcggcttt cagggagcgc attttggctc 6120 gattcggagt ccccaagatg ttcgtctgtg acaacggcgc gcagtttgcg agtaagggat 6180 ttcggtcata catggagtcc ttgggggtgc ggctacagtt cactgccccg tactccccgc 6240 aagaaaatcc gaccgagagg accaatcgca ccgtaaaaac gatgatagcc cagtacatcg 6300 agggacatca gagttcgtgg gatgtgctgc taccagagat atcgttagcg atcaactcta 6360 gtggcgcgga ttctactgga ttcacgcccg cgttcctgat gatgggtaga gagcctcgcc 6420 tcccggctac cctttacgac gaggtcactc ccgggtcggc aacgagggaa gtcgacttag 6480 agacaaggaa agagacgatg aaagaaatct tcgaagtggc ccgtaataat ttgcaaaagg 6540 cgtcgaagga ccaaggcagg tattacaatc tgcgccgacg agaatggaga cccgcgctag 6600 ggtcgatggt tctgctgcgg cagcaccagc tgtccagcgc agcggagggc ttcgcggcga 6660 aattggcgcc caagttcgag ggcccctaca gggtcgtcca ctttatctcc cccaacgtgg 6720 tgcgactcgc tcgagcgggc gaacgaagga agagggtggc caacctttcc cagcttaagc 6780 ctttctacac agaggaggag gagaccggcg agacgatgct gagagaagaa gctggtgagg 6840 agggaagcgg agtttttcag tcgccgacat cgccgcacgc gcttcaaagt gacgtcggac 6900 gccggagtct acaacgtcac ggtggcggcc acaggagccg tggtcgtctc cttgattaag 6960 taaggggggg g 6971 // ID Gypsy-1_IS-I repbase; DNA; INV; 4073 BP. XX AC ABJB010258749; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the black-legged tick genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-1_IS_; KW Gypsy-1_IS-LTR; Gypsy-1_IS-I. XX OS Ixodes scapularis OC Eukaryota; Metazoa; Arthropoda; Chelicerata; Arachnida; Acari; OC Parasitiformes; Ixodida; Ixodoidea; Ixodidae; Ixodinae; Ixodes. XX RN [1] RP 1-4073 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the black-legged tick genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; ABJB010258749; Positions 6655 2583. XX CC Positions [3179-3613] - Integrase core CC 'GGGGA' target site duplication CC LTRs are 94% similar to each other. XX FH Key Location/Qualifiers FT CDS 1083..2621 FT /product="Gypsy-1_IS-I_1p" FT /translation="MFFFFRFEVDSGAAFSLISEETFRSAWPTQPPQFTQE FT DIALRTCSGENLESLGTIQVTVQFKRFRAQLPLLVMKRSGCNVLGRNWFEP FT LGIGLGGVHQIGLDKSLLNVIGKHAPLFDGDLNQLKGPAVSLELQEKAAPK FT FLRARTVPFVLKPAVETELQRLTDQGILEQTQHSEWATPVILVRKKDGGIQ FT LCGDYRSTINAVTRKAAYPLPTVVEVLAKLRGVKVFSTLNLAQAYQQLKVT FT ERTADILTINTIKGLYRIKRLPFGVSAAPAIFQRCIETTLAGLPGVSAYLD FT DIIVTGATPKEHAQRLDAVLTRLQEVGLRVQKTKCRFGVAEVTYLGHRINA FT AGVHTTDDKLKAIKDLPEPTSKATLQSFLGMLAFYDRFMENRATVACDLYK FT LLEENVPWRWEKEHSDAFKRLKELLPRKTTLVYYDESKPLLLSCDASPYGI FT GAVLAQVDPNGTEAPIAFASTTLGEAESNYSQLDREGLAVVFGAVRFHQYI FT AERKWSSPLTTNLYLRS" XX SQ Sequence 4073 BP; 1101 A; 1065 C; 1047 G; 860 T; 0 other; actggcgacg agtaagacga cggaccttct acctcagtag aaatgactac tcacggctaa 60 atggagcagt tctccggcaa gggctggtcg tcatgggcag agcggctaca atattacttc 120 gaagccaaca gcatcgggga cgatgccagg aaacgtgctg ttttgctgac gctgtgcagc 180 ccgaccacgt tcgaaacagt gaaagcactg gcagcaccga aaacgccaag tgagttttct 240 tttggcgaaa tcgtcgccat gcttcaagcc catttcgatc cgcgcccctc ggaactcttc 300 agccggtgca gatttcaacg cagagatcag ctaccggacg agattgtcac tgcgtacgtc 360 gcagccctca agaagcttgc ggaggactgc aattttggaa cgattcagcc tgtcgcgacc 420 accaccgcta cgactgtcac gtccagcgtc tccggttcgg atggatcagt tggaacgtcc 480 agcggttccg gttcggatgg atcagttgga acctttggcc gcggtttaac accatctata 540 gtcaccatga ctatgctgcc aatggacatc atgctccgag accatttcgt ctgcggactt 600 cgagacgacc atctgcagca gcgtcttttc gccgaaagga acctggcgtt ttttaaggct 660 ttcgacatag ccgtgcgagc ggaaaacgcc aaggaccatc agcgtcaaat cgaagaagac 720 ttcaaaaaag tgaacacagc agaaacgtct cgtctgccag attgacggcg tgaagagacc 780 ccagtgacaa cagtaacgtc ttgttaccgc tgcgaggagc ctcacagtcc atactcatgt 840 aaatttaaga gttcgacgtg cagatactgc aagaaggtgg gacatatagc aaaagcctgc 900 ttgaagaaaa aaaagtccgc tggcagcagc acggttcacg atgtcaatac aaatcaaacg 960 agcgattcga gtgctcatga accaagtcat gcaagctcac ttcatgacct caactcccta 1020 aacaatcgac ggcaatcgac ggcagtcgac agacaaactc aaggtgagcc ttcgagttga 1080 agatgttttt tttttttcgt ttcgaggtgg actcgggggc agcgttttca cttatcagtg 1140 aagaaacgtt tcgatcggcg tggcccacac aacctcccca gttcacgcaa gaggacattg 1200 ccctgcgcac ctgctcaggt gaaaatctgg aatcgctggg cacgatccag gttacggtgc 1260 agttcaagcg cttcagagcg cagttaccac tcctggtgat gaagcgttcc ggctgcaatg 1320 tactcggacg gaactggttt gaacctcttg gtattggact aggcggcgtt catcagatcg 1380 ggctcgacaa aagcttgctg aatgtcatcg gcaaacacgc tccactcttc gatggtgacc 1440 tgaaccagct caaaggcccg gcagtcagtc tggagcttca agagaaggct gcgccaaagt 1500 tcctgagagc ccgaacggtg ccatttgtgc tgaaaccagc agttgaaacc gagttgcagc 1560 gtctcacaga ccagggcatc ctagagcaga cacaacactc ggagtgggcc acacccgtca 1620 ttttagtacg aaaaaaagac ggtggaatac aactttgcgg agactaccgc agcaccatca 1680 acgctgtcac acgcaaggcg gcgtatccgc tacctaccgt tgtagaggtg ctggcgaagc 1740 tccgtggagt aaaagtattt tcaacgttga atcttgcgca agcctatcag caactgaagg 1800 taactgagag aacagctgat atactcacga tcaatactat caaagggctc tatcggatta 1860 aacgcctacc ttttggagtg tcagcagcac cagccatttt tcagcggtgc attgaaacaa 1920 cgttggcggg actacctgga gtcagtgcct acctagacga tatcatcgtc accggcgcca 1980 caccgaaaga acacgcgcaa cgtctagacg ccgtcctgac acggctccaa gaggttggcc 2040 tacgtgttca gaagaccaag tgtcgttttg gcgttgcaga ggtgacctac ttaggacaca 2100 gaatcaacgc agcaggggtt cataccacag acgacaagct aaaggcaatc aaggatttgc 2160 cagagccaac ctccaaggcc actctccagt cattccttgg aatgctggct ttctacgatc 2220 ggttcatgga gaacagggca acagtggctt gtgacctata caagctgcta gaggaaaacg 2280 taccatggag atgggagaaa gagcactccg atgcttttaa aagacttaag gagcttttgc 2340 cacgcaaaac aactcttgtc tactacgatg aatcgaagcc gcttcttttg tcatgcgacg 2400 cttcgccata tggcattgga gctgtgctcg cccaagtgga cccaaacggc acagaagctc 2460 ccatcgcgtt cgcgtcgaca acactaggtg aagctgagag taactactcc cagttggacc 2520 gagaagggct ggcagtggta tttggagcgg tccgcttcca tcaatacatc gctgaacgca 2580 agtggtcatc gccactgacc accaaccttt acttgagatc ctaggttccc agaaactggt 2640 accccaggtt ctgtcttcca gaatgcgacg ctggtgtgtc aagctctcag cttacgacta 2700 ccagctggtc tacagaccag gaaaaaagca ccaaaacgcg gatgcgttaa gccgcttacc 2760 tttgtctgtg agcgaggacg aaccatgtcc tccaggaaac gttctcatga ttgagacacc 2820 aatggataca ccgctcacag ccgagcggat cgcagaaatg acacaagccg atccagtact 2880 gtcacgtctc tttcaagctg tgcagcaagg aaatctctat cagctaaagg aggcaacgtt 2940 ccagcacttt ttgaagaagc gcaccgggct gtcgaccgtt aaaccagagg ctgcacgtgc 3000 tgacgccatg aagctgctgc acgtaggaca caaaggaatt gtaaccatga agatgacagc 3060 ccgtagtcac ctgtggtgac cagccatcga caaggaaatt gagaaggcgg taaaaagctg 3120 cgctgcctgt cagaagtcca gaccggttcc caacaaagct ccccttccaa cgtgggatca 3180 tccatccaag ccctgggaca caattcatct cgacttcgcg ggtcccttgg aagggcagat 3240 gttcttagtc gtcatagacg cctttacaaa gttgctggaa gtgcgctatg tgacaaaggc 3300 cacgaccgca ggagttatca aagagctgag ggctctgtct gcgactttcg gactacctcg 3360 aaagatcgtg tccgacaacg ggacagtttt tgtttcaaag gaagtgatgg atttttatca 3420 caacaatgga atacatgcgg tgacgaacgc accatatgat cctgcgatga atggccaagc 3480 cgagcggatg gtatatgaac tcaagcaagg gctagcaaag gacaagaagg gacttttgtc 3540 gttgcgtata gcgaggctac tctacaaaca acacacggca gttctaactc gaccgggaag 3600 acacccgctt ttatgatgtt tggacgagaa ctcgctacaa acatcagccg actgatgccc 3660 tttccagaga gtgaaaggaa acaagtggat gaaaagttgc ctgcatccag aattttcaag 3720 gagggacaac atgtggttgt ccttaatttc agagggactc caatatggat acacgacagg 3780 ttgataaaaa aaactggccg tcgctcttgg cttgttcaaa cagctggggg aaatatccgg 3840 caacgcctaa accacataag agggtgctcg tctccgttga ctccgccagc agcctggtcg 3900 attggtagtc acgactgcac accgggttgt cctacagaac agaacatacc gcctcaagca 3960 gccacaacga atgaaaattc cggtaatgtg ggtgtagcgc ctagcagtgc ctacccgccg 4020 agaaccagac gtccacccga tcgttatggc gactgggtct aaggagggag gga 4073 // ID hAT-6_BF repbase; DNA; INV; 2529 BP. XX AC . XX DT 30-SEP-2008 (Rel. 13.09, Created) DT 30-SEP-2008 (Rel. 13.09, Last updated, Version 1) XX DE Amphioxus hAT-6_BF autonomous DNA transposon - consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-6_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-2529 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-2529 RA Kapitonov V. and Jurka J.; RT "hAT transposons in the amphioxus genome."; RL Repbase Reports 8(9), 927-927 (2008). XX DR [2] (Consensus) XX CC The transposon is incomplete: it contains only the CC transposase-encoding region. XX SQ Sequence 2529 BP; 710 A; 606 C; 660 G; 553 T; 0 other; atgtctcaaa atgtcatttt tctttcagtc tgtgttgctg atgaggctcc tcccatcccc 60 ctcgaagaag gaagaggcct tgtagagtcc ctgtttgaca tcaagaacaa acgtgtgcac 120 agccactaca catgccgatt caacaactgt tcctccatct cacacactga aagccaaaga 180 atgaaaaccc tgaccgacaa gttcaagcac gactggcttt ttgagaagac agtttccttt 240 gatctgcacg taggggtttg gtggctagtg tatgtggagg gtgaagggat gtactgcctg 300 ctgtgcagga agcacaagca ggtaaaccag caaaacaaaa gtgacaagtt tgtcgtgctg 360 ccttccatca ggttaaggaa ggctgcggtg acagaccatg gtggttgtga cacccacacg 420 cgtagctggg aaacagaaag ccaacaaaga agaggcagca tcattcacag agaactggag 480 aaaaaacgga gtacacactt cactgcagtg cacaaagctt ttggatgtgc ttacttcctg 540 gcaagggaga atgttgccaa ccggaagttc ctgcccttga tcaacttttt ggaggatgta 600 ggtgtgccag atgtaaagta ctttgaacac agaagtcagg gctctatcag ggaagttttc 660 ttgaccatag gtgagacaat ccagaagact gtgattgaac gtatcaaaag gagtggtgtg 720 tttggtgtct tggttgacga tgtcactgat gctgcagcct tggagcaaat gataacattt 780 gtgcagtttg ttgacaccga agagaacaaa gcggctgtgg ctttccttgg cacaaaaaat 840 gtcctggaaa gccacgagtc cgctaatgca gaagctctgc tagacaattt cacagaactg 900 ttagatactt gtggattaga actgtctaac attcatggcc tgtgctcaga tggagcaagt 960 gtgatgctgg gagccagaaa tggatttgca gcaaggctga agaggctacc tggctgcgga 1020 aaagtcctgg ccttccactg tgtgtgccac aagctggcac tggcatgctg tgacactagt 1080 gctgatctga aagccatcaa gggtgtggaa gaggtgctgc tgagtgtgtg gaaatggcta 1140 agctactcac ccaagaggac agcagcattt gtcaaagccc agcttgccat caaaggtatg 1200 gatggagcca aatccccagc agcccagcag agtgtgtgca ggaagctgaa gagagcttgc 1260 cacacaagat ggcttagctt cgacgcagca gtcaaaactg tgtacgatga cttctgggcg 1320 gtactacaga cactgtctgc attcgagtcc tgtccggtag caactggtct actcaagaag 1380 atgaaatgcc ccaagtttct gggcatcatc atcatcctca agaatgtact gccacacttg 1440 gctgagctgt ctacaaaatt ccaaacaagt accctgaact tcgcccaagt ccagcccgct 1500 attgacgatg ccaaagataa gctacagaac atcacacaat ccaacatcgt ttgcaaagag 1560 ctcaaggctg acttgcaccc acagcacgga cagttcaggc acacaggcat tgtccttaaa 1620 gctgatgtcg aaggccaggt ggcaggcctt gtgcacaagt atacagacgc gctttgcgaa 1680 aacatcgaca gaagatttgc aggcagcctt ccagtgatgt cagccctgtc catcttcgac 1740 ccagaggcac taccaacacc acagtctcct ctcttcagct cctatggctc tgacaaggta 1800 gatgtgctgg ctgcccactt ctttgaagaa tcacaagaac agcagaatga gctaagagca 1860 gaatggggga agttcaagta cgacctgcac acgaacctaa agtcccaact acagtctgac 1920 acactgacag ggagtgccac tcaatggtgt ctagagagaa tcattgccct gaagacatct 1980 tacggacatg tgttcccatt attggtgcac attgccaagg tcgctctggt cctgccactg 2040 tcaaatgcat ggccagagcg aggagcaagc aaggttaagc ttatcaaaaa ccgcctccgg 2100 accagactag gtggtgatat gctcaatgcc ttgctctgca tgtcgatcaa tggaccacca 2160 gtcaacagtc cagagtcacg cagcatgctg acatctgcag tggagacctg gtttgcgcag 2220 aagaagaggc atagatccaa gaagctagcg gaggacagaa agaaccagca gcgagtccag 2280 agagccattg tcaagctgag acagctgtat gctcctcccc atcccgagga agaggctcct 2340 acccatcctg aggaggaggc taggatggac caggatcagg agagtcaggc tgctgatgag 2400 gaggttgaga tcaaccactt gagggtcctg gaaagggaag atgctgcttc agtggccaag 2460 gccatcatgg gggagacacc tccagattct gagagtgact ctgactctga tgactatgaa 2520 accatgtac 2529 // ID Copia-24_AA-LTR repbase; DNA; INV; 153 BP. XX AC supercont1.50; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-24_AA_; KW Copia-24_AA-I; Copia-24_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-153 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.50; Positions 2770549 2770397. XX SQ Sequence 153 BP; 50 A; 33 C; 22 G; 48 T; 0 other; tgagatgtga gtaaattgta gtaacctacg atcggctcat tatagtcata cctatcctag 60 caacgaatgt ttagttgtac aaacaagcaa ataaattttc agtcgttctt cagactacag 120 cctaaccaca cgtattttat taacactctc tca 153 // ID BEL-7_DWil-I repbase; DNA; INV; 5745 BP. XX AC scaffold_180739; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-7_DWil_; KW BEL-7_DWil-LTR; BEL-7_DWil-I. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-5745 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_180739; Positions 12871 18615. XX CC Positions [4504-5064] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 1273..2997 FT /product="BEL-7_DWil-I_1p" FT /translation="MHTHTAAPTDNHKQIFCAIKGHHKPVSCEKLLRMSVE FT ARWNAINNHHLCAQCLKNHKAKHHHLLHKEVTESVLHNHRGSLTNQPECYV FT RIVPVTLHSNSKSVDIYAFMDDGSTLTLVEEKLADALEVKGFVNPLCIRWT FT GDVSRQEPEFKQLNLRISKQGQGNKLFKVNEVCTVKSIGLSAETMIADRIK FT QRYPHLKNLPIPSFVNAMPQMIIGSNNPNLIASLIVREGKWQQPIAAKTRL FT GWTIYGGLGRSGGGLNLHRCQCDLDLHEMVKKHLLTEEKLPPMFTKFPDEE FT RSLQILEDTVSYSSRTYTTGLLWKDGPRVFPNSLPVVKRRLLCMQKKMAAD FT PSFANELHSQIQNLIPKGYARKLAEEEVVEKPENAWYLPIFTVRNPHKPDG FT VRLVWDAAAQSSGVTLNDFLLKAPDLHVPLLQIMYNYRMRAIGVRGDIAEM FT FHRIAVREEDAKAQRFLWQNQTTGVIETFQLNVLTFGATCSPCIAHYVRDL FT NAKKHADEFPSAAKAIQRRHYVDDYIDSCHSEEEAITLARGGFAIQQPTKK FT NSSLIVKRHTLMSKYWACGGAHVKMPMH" FT CDS 4477..5520 FT /product="BEL-7_DWil-I_2p" FT /translation="MAPLPRARLGAFKPAFTYVGVDYFGPLTVIDGRKSLK FT RWGMLITCLTMRAISIEIVHSLTTNSCLMGLRRHISRRGIPMEIYSDNGTN FT FRGTDAFLKDKLQLEDAKMHRELSQQEISWYFNPPSAPHMGGAWERLVRSV FT KTVLYRITPNQKFSDESLLTAMAEVEMTVNSRPLTYVSLDDEYQEALTPNH FT LLLGSSNGEKPICDPEQIDYKWSLRQSEMFANIFWKRWVKEMLPNITRRSK FT WHKKVKPIGVGDIVLIVDENQRRNTWTKGRVVEVTTAEDGQVRQAKVQTQL FT GVLSRPAVKLAVLDFGKDEASSSGSLAGRGMLPKFGPANKLCTTVLQNGTL FT SEQTE" XX SQ Sequence 5745 BP; 1839 A; 1295 C; 1304 G; 1307 T; 0 other; attcttttta atttctatcg aaattctttt taatttctac tggaaattaa atttactctt 60 tcgtaaagaa ttttgagcaa attctttaat tcctacggtg tagtagagct acagagacct 120 aaagaggagg gactacacaa atcccaagga aagaacaaat ttaaatttta ccacgcgtaa 180 aattacgtat agacatggca ccgactggaa actgtggtat gtgccaggag ccggccaacc 240 aggaaacgct cacatgctcc aggtgtgcat ctcaattcca ctgcacctgc tgcggatcga 300 aaaacagtgc cacctcgagt tcctggatgt gcgagacatg tcgccacgtg cctgatgaag 360 gcgaccgtaa cctcgaaact gaatcacatt tcaaggttaa ccaaccagga gacaaagatc 420 agcacgtgga agatgaagcg atgccggaag atcagcaaat actattgcaa caattagagg 480 acgaagtgtg cctcgaaagg gaatatctgc gccgtaagta cgccttactc cacacgagag 540 tacagtcaaa cggagaagcg aagaaccatc cacaacatgc tgctggaata cctggttaca 600 taatgatgac ggatgatagt gccaccaaca ctccacaggc gacgagtacc agagcgcaga 660 gcatgccgtg caaagccaat ggcataacac cacaacaagt agctgctcga caatcaatgc 720 cgagagaatt accagttttc aaaggagatc caagagaatg gccacttttc ataagcacct 780 atgaaactag cacccggatc ggtggataca gcaacgagga gaacgctatg cgtctgcaga 840 gatgtctgaa aggtagggcc cttgaagccg tgcgagacag cttactgttc ccggaaatgc 900 tgccgagcgt tataagtact ttgagaatgt actttggacg cccggaacac ataataaaac 960 tgttgatttg cctcctccaa agggcaagct agaaccgctg atcgagtttg catttgcggt 1020 gaaaaatatg tgcgctacca ttagggcaag caagctagat gcgcacctga aaaaccctac 1080 gttattacaa gagcttattg agaagatggg acctgatatg atgcttaact gggccctaca 1140 tttaaaatcc ataccgtacc cgaacataca gcacctagcg gattggttgt ttgagttggc 1200 tgaagcagcg agtcgtgtat cgaccccaac atattaccag aaagcgacaa gaagagaacc 1260 aacagaggag tgatgcatac acataccgca gcgcccacag ataaccacaa acaaatattc 1320 tgtgcgataa aaggacacca caaaccagtg tcttgcgaaa agttgttgcg gatgtcagtc 1380 gaagccaggt ggaacgcaat aaataaccat cacctatgtg ctcagtgcct caaaaaccac 1440 aaagcaaaac atcatcatct tcttcacaag gaggtgacgg aaagcgtcct gcacaatcat 1500 cgtgggtcac taaccaacca gccagaatgc tatgttcgaa tcgttccagt cactttgcac 1560 tccaattcga aatccgtcga tatatacgca tttatggacg acggatcgac tctaacactt 1620 gtagaagaaa aattggctga tgcgcttgaa gtgaaaggtt tcgtcaatcc cctatgcatt 1680 agatggaccg gcgatgttag tcggcaggag ccggaattca agcagctgaa cttgagaatc 1740 tcaaaacaag gacagggtaa taaattattt aaagtaaatg aagtctgcac agttaagtcc 1800 attggtctct cagccgagac gatgatagca gacaggataa aacaaagata tccgcactta 1860 aaaaacttgc ctatcccgag ctttgttaac gcaatgcctc aaatgataat aggcagtaat 1920 aatccgaatc tcatagcttc gctaatagtg cgcgaaggaa aatggcagca accaattgct 1980 gcaaaaacga gactaggatg gactatatat ggcggattgg gacgatctgg tggaggactc 2040 aacctgcatc ggtgtcaatg cgatctggat ttgcatgaga tggttaaaaa acatttgcta 2100 acggaggaaa aattaccgcc gatgtttacc aagtttcctg atgaagagcg atctctacag 2160 attctggagg acactgttag ttacagctcg cggacgtaca ccacgggatt gctttggaaa 2220 gatggtcccc gtgtatttcc aaatagcctg ccggtagtaa aacgacgact cttatgcatg 2280 caaaagaaaa tggcagcaga cccttctttt gcaaatgagc tacatagtca aatacaaaac 2340 ttgataccca aaggatatgc cagaaagttg gcagaggaag aagtcgtaga gaaacccgaa 2400 aatgcatggt acttgcccat ttttactgta cgaaatccac ataaacctga cggggtcaga 2460 ttggtgtggg acgcggcagc tcaatcatca ggagtgaccc taaatgattt tcttctgaag 2520 gcacccgatt tacacgtgcc cttgctgcag attatgtata actatagaat gagagcaatc 2580 ggcgtgagag gtgacatcgc cgaaatgttc catcgaatag cagtaagaga agaagacgct 2640 aaagcacaac gattcctatg gcaaaatcag acaacaggcg tcatagaaac tttccaatta 2700 aatgtgttga catttggagc cacgtgctca ccctgtatag cgcactatgt gcgcgaccta 2760 aacgctaaaa aacacgcaga tgagtttcct tcggcagcta aagctattca gagacgtcac 2820 tacgtagacg actacatcga ttcatgccat tcagaagaag aagccattac attagcacgc 2880 ggaggtttcg cgattcaaca acctacgaag aaaaacagtt ctttaatagt gaagagacac 2940 actcttatga gcaaatattg ggcatgcggt ggagcccacg tcaagatgcc tatgcactaa 3000 agtttgtacg cctgaagcgc aacataatag cgacaaaagt cgcggaggtt tcgcgattca 3060 acaacctaca aagaaaaaca gttctttaat agtgaagaga cacactctta tgagaaaata 3120 ttgggcatgc ggtggagccc acgtcaagat gcctatgcac taaagtttgt acgcctgaag 3180 cgcaacataa tagcgacaaa agtcgcacca accaaaaggg aaattttaca agtcttgatg 3240 tcagtttttg accccatggg ttttgtatcc tgtataatga tgtatctcaa aatactgctc 3300 caacaagttt ggcgatcgaa gattgattgg gatgaggaaa taccacccaa gctgcaggat 3360 atgtggcaat catggctgtc cttcttaccc ttgatagaga acgtgcgaat tccgatatgc 3420 tattttaaaa attggcatgc acacgactgc cgagtttaaa ttcacgtatt cgtggatgcg 3480 ggagaagacg cctactctgc cgttgcgtat ttcggtatcg aacaaaataa ccaaatactt 3540 ttgagtctcg tttcagccaa atccaaggta gctccgttgc agccgctttc gataccccgc 3600 ttggagctac aggcagcagt aactggagcg cgactgatga aaaatataat agcacaacat 3660 gaaatcgact ttgaggaatg ttacttatgg accgactcga aaacggtact agcgtggctc 3720 gatggagacc cacgacgata ttaacaatat gtaatgttta ggatagccga gatatgtgag 3780 ctcaccgagg ccagaacctg gagatgggta ccaagcaaac taaatgttgc ggacatcgcc 3840 actaaacgac aacataggcc tgaaacatac acacaatggt tcgaaggacc caaattttta 3900 aaactgccgc cggcagattg gccaacggaa actaccaatg aaagtcccta atgaactgcg 3960 tcctcgtttc cttcacatgc gaagggaacc agcaagaatc aattacgagt acttctcaaa 4020 atggaatcga ttgacaaggg ccatagcatg ttggcttgcc tataagggga aattgatcaa 4080 caaaatacgt aaatccaatg agaattcaaa ccaactggca tcccggaaca tacaacaaga 4140 atgttacaac gaagccatcg aaaacggcca ccacgcttgc cggagcagtc cactatggag 4200 gttacaactc ttcctaaatg gccagggagt attgtgtgta cgcgatcgag ccagtcgctt 4260 cgctggcatc cctgaaggac gaatcgctct acccccccaa cactgggtta caaagctgat 4320 agtccggcac taccatgaaa aatactatca catgaatcat gagaaaatga ggtgaggtaa 4380 aaatatttta ttccaaaact aagacctctc cttaaaggga ttcgtcgaga ttgccaacaa 4440 tgcaagctga gtagtgctcg acccattgca cccttaatgg cgcccctacc gcgggctaga 4500 ctgggtgcat tcaagccagc gtttacgtat gtaggagttg attattttgg gccgctaaca 4560 gtcatcgatg gaagaaagtc actgaaaaga tggggcatgc tgataacctg cctcacaatg 4620 cgagcaataa gcattgagat cgtgcatagt ctaaccacaa actcatgcct aatgggctta 4680 agacgccata tatcgcgccg aggcatacca atggaaatat atagcgataa cgggactaat 4740 tttcgaggga ccgatgcctt tcttaaagac aaacttcaac tggaagatgc aaaaatgcac 4800 cgagaattgt cgcagcagga aatatcgtgg tacttcaacc caccgtcagc tcctcacatg 4860 ggaggagcct gggagcgctt agtgcgttcc gttaaaaccg tgttataccg tataacacca 4920 aatcaaaaat tctcagatga aagtcttctt accgcaatgg cagaggtgga aatgacggtc 4980 aactccaggc ccctgacata cgtgtccctg gacgacgaat accaagaagc tcttacgcca 5040 aatcacctgt tgcttggctc atcaaatgga gaaaaaccga tttgcgatcc ggaacaaatc 5100 gactataaat ggtcgctccg ccagagtgag atgttcgcca acatcttttg gaaacgctgg 5160 gtcaaagaga tgctaccaaa cataacacga aggtccaagt ggcataaaaa agtaaagcca 5220 attggtgtag gcgatattgt acttatagtg gacgaaaatc agcgtcgtaa tacctggaca 5280 aagggtcgag tggtcgaagt gaccaccgca gaagatggtc aagtccggca ggcgaaggta 5340 cagacacaac taggtgtcct ctccagacca gcagtaaagt tagcagtcct cgatttcgga 5400 aaagatgaag caagctccag tggctcgctt gctgggaggg gaatgttgcc gaaattcggg 5460 cctgcgaata aactttgtac cactgtacta caaaacggca ctttgtccga acaaacagaa 5520 tgacgctctt ttaaatctct taaactctct ctctcatctt actcttgcgg tactttatta 5580 tacataagta gacacataag cgcgaaaaat acgcagctgc attgatctaa gtacatacat 5640 aagaaccgcg tgttcaattg ttttttcgta gccatgaggc caagactaga gtaagtgaca 5700 tgagattgta caattattaa ttataagaaa atttgtaaaa taacc 5745 // ID Gypsy-2_SI-LTR repbase; DNA; INV; 181 BP. XX AC AEAQ01005246; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: long terminal DE repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-2_SI_; KW Gypsy-2_SI-I; Gypsy-2_SI-LTR. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-181 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01005246; Positions 233 53. XX SQ Sequence 181 BP; 56 A; 26 C; 38 G; 61 T; 0 other; tgatgcatat ggcactacgt tagaaatcag ccttaatggc tcattcgttt gtaagtcaag 60 gttggcggcc ttgcgaggtc tttagagatt tggacggttg taatataata gctattaaaa 120 tattataata aaggctacgt taatcagagt ctataactgt gtgttatttc aatacataac 180 a 181 // ID Copia-21_CQ-I repbase; DNA; INV; 1888 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-21_CQ_; KW Copia-21_CQ-LTR; Copia-21_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-1888 RA Jurka J.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 357-357 (2011). XX DR [2] (Consensus) XX CC 'GTTTG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 172..1806 FT /product="Copia-21_CQ-I_1p" FT /translation="MEYEFVSHAFDGTNFSCWSFRIEADLKAQKLHHCIER FT TLEEESFFTVVAEEGVAERVRKEALQKSRKEEDEKCKLFLLKTIAEPQREA FT VRGMSSPKLMWEALKEVSDRKAARAGSNGTFPFKCRVCGKPGHTKVDCPWK FT KKVKQAAGSVQQLKKGKVHAAESSAGAKEDVPFVPEAEEANNEEADGKFRW FT VLDSDVSEHMVGDWNLLVNVSRMETPKVVNVAELESQEEVPEDKVCTDDQL FT DSEESDADDTVQEEEDVPEENAVPELEEAAGERRGRVKKHPAKPDHELKLK FT WLARLVDEIPDTNEVLQKRDDWPLWKRAIDDELRSLEKNRTWDLVEAPAGR FT RVLCCKWVLRIKKEVDGIAARYKARLVVTGCSQRAGYDYKEKNARIVAREL FT PAEAVEQLHQTEVRKLPGFERGIKVRKMNKSLVGAKALLKSRKKRYRCRML FT FKKISKLEGAQTERDRRSAARSCGRRPGKRRRRSPLNIRFFLRTDRNRHQL FT SGGKPRGEQVHYEEDLHDRKRMNLGASATTTSVIVGQEGLTELPVPPDK" XX SQ Sequence 1888 BP; 505 A; 413 C; 641 G; 329 T; 0 other; ggttgtgtgg cccagcacat cgcgtggcgc gttgccaagg aaaccgcgaa cggaaggaaa 60 aagtttttcc acgtgcgcgg acacccggga aggtgacgta gtttttttcg cgatcgaaaa 120 gtgagttaag tttttcggga gtaagacgcg aaaagaacgt tttctagtgc gatggagtac 180 gaatttgtgt cgcacgcgtt cgacgggacg aatttctcgt gctggagttt ccggatcgaa 240 gcggatttga aggcgcagaa actccaccac tgtatcgagc ggacgctgga ggaggaatcg 300 tttttcacgg tggttgcgga ggaaggcgtg gcggagcggg tgcggaagga agcactgcag 360 aagtctcgga aggaagagga cgaaaagtgc aagttgttcc tgctcaaaac gatcgccgag 420 ccgcaacggg aggccgtccg cggaatgtca tcgccgaagc tgatgtggga agccctgaag 480 gaagtgtcgg accggaaagc agcgcgtgcc ggaagtaacg gaacgtttcc gttcaagtgt 540 cgcgtctgcg ggaaaccggg gcacacgaag gtggactgcc cgtggaagaa gaaagtgaaa 600 caggctgccg gaagtgtgca gcagctgaag aaagggaagg tgcatgcggc ggaaagcagt 660 gccggtgcca aggaagatgt gccgttcgtg ccagaagcgg aagaagccaa caatgaggaa 720 gcagacggaa agttccggtg ggtgctggac agtgatgttt ccgagcacat ggttggtgat 780 tggaatctgc tggtgaatgt gagtcggatg gaaaccccga aggtggtcaa cgtggctgag 840 ttggagagtc aagaagaagt gccggaagat aaagtgtgca cggacgatca actcgacagt 900 gaagaatcgg atgcagatga cacagtgcaa gaagaagaag atgtgccgga agagaacgca 960 gtaccagaac tggaggaagc tgcgggagaa agaagaggtc gcgtgaagaa gcatccggca 1020 aagcccgacc acgagttgaa gctgaagtgg ctggcaaggc tcgtggatga gattcccgac 1080 acaaatgaag tgctgcagaa acgcgacgac tggccgctgt ggaagcgcgc tatcgacgac 1140 gagttgcggt cgcttgagaa gaaccgcacg tgggatctcg tcgaggcacc tgctggtcgc 1200 cgagttttgt gctgcaagtg ggtgttaagg atcaagaagg aagtagacgg tattgctgcc 1260 aggtacaaag ctcgtctcgt cgtgacgggg tgctcgcaac gtgcggggta cgactacaaa 1320 gaaaagaacg cccgaatcgt cgctcgtgag ctgccggcag aagcggtgga acagctgcac 1380 cagacagaag tacggaagct tccaggattt gagaggggga ttaaggtgcg caagatgaac 1440 aaatcactgg ttggtgcgaa ggcgctgctg aagtcccgga aaaagcggta caggtgtcga 1500 atgctgttca agaagatctc gaagcttgaa ggagcgcaaa cggaacgaga ccgtcgatcc 1560 gctgctagga gttgcggacg gagaccgggg aagcgacgtc gtcgatcgcc gctcaacatt 1620 cggtttttcc tgcgaaccga tcggaaccgc catcaactct ccggtggcaa gccaagagga 1680 gagcaagtac actacgaaga agatctacac gaccggaagc ggatgaacct cggagcatcc 1740 gcaacaacaa catccgtgat agttggtcag gaaggactca ccgagttgcc agttcctcct 1800 gacaagtgac cagctcgccg atgtgttcac caagggccca agctgtttcg accgagaagc 1860 accgaagcac tttaggtttg agaggggg 1888 // ID CR1_Ele41 repbase; DNA; INV; 3426 BP. XX AC . XX DT 26-OCT-2010 (Rel. 15.1, Created) DT 26-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE A CR1 clade non-LTR retrotransposon family from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1_Ele41. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-3426 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-3426 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (26-OCT-2010). XX DR [1] (Consensus) XX CC [2] Consensus update. This consensus is generated from 7 CC sequences with >97% identity, and ~98% identical to the CC originalsequence in [1]. XX FH Key Location/Qualifiers FT CDS 104..3277 FT /product="CR1_Ele41_1p" FT /note="endonuclease and reverse transcriptase." FT /translation="MTPKCNPVMSSNGQQPGRSVESLMEAPGPSDPVERNA FT VTVLHSRPGPAVGCGTGVFQSATSGEYSSIIDNAPSPSIVLNSRTCNARKM FT PISVNAKKSDDLLIYYQNAGGMNCDVDKYLLATSDDCYDIIVLTETWLDTR FT TISTQVFTPEYEVFRCDRNPRNSRKSTGGGVLIAVKKKLKANIIENDLWCS FT TEQLWVHVQFFGHSLFLCGIYIPPDRARDETLIETHLQSVTSVIEIAKPMD FT EIVVLGDFNLPGISWISSGNGFLHPDMNHSHVHSAASRLLDGYSTATLRQI FT NHVTNENLRSLDLCFVSAQDVAPPISNAPSSLVKQVNHHPPLVLKLIVQHY FT STSESIAQVTYDFTKADHEGILEFLAAVEWTEVLDTRDVDNAVSTLAHILG FT HVIERHVPKKTLASKGLPWQTRELRTMKTAKRTALRRYTKNPTPRMRDQYV FT RLNAAYKRTSRHCFNRYQQQVQRDLTAKPKSFWRYVNEQRKVSGLPPSLFL FT NGDVASEPQSVCQLFAQKFSSTFCDEVIPEVLVTQAASNVPLVNDSWATLT FT VDDDMISRAASQLKSTMKPGPDGIPSAFLKRHIDSLLVPIRHVFNLSLISG FT AFPSLWKNASVFPVHKKGERKDVNNYRGISSLCAISKLFELVVMDPLMTHC FT KNHLSDDQHGFISGRSTATNLIRLTSHIADSFAEGAQTDAIYTDLTAAFDK FT INHAIATAKLERFGISGNMLRWLHSYLIGRKLTVTIEGFQSNEFLATSGIP FT QGSHLGPLIFLIYFNDVHYTLKGPRLSFADDLKIYRRIHSDYDAMCLQQDL FT HAFASWCTLNRMTVNPGKCSVISFVRIRRPVVFRYELYNTVIQRVDSVKDL FT GVILDPQLTFKHHVAYVVAKASRTLGFIFRVAKDFTSVYCLKSLYCSLVRS FT TLEYCSVVWNPYYQNGSDRIESIQHRFIRFALRQLPWRDPLRLPSYRSRCQ FT LIDLETLQLRRDIARAMVVADTLQGRIDCPVLLQSLDLNVRPRVLRNNTML FT RLPFRRTNFGQQSAYIGIKRVFNRVASVFDFHLSRQTIRRKFKTVFLSFLD FT E" XX SQ Sequence 3426 BP; 889 A; 856 C; 736 G; 944 T; 1 other; ctcccagcct gcacttccag gcaagtatgc gtgttatacc agacgttctt tscgtgatga 60 attgccaaat ttcagctctc tatccgccgc tgatgtaaca tcaatgactc ctaaatgcaa 120 cccagtcatg tcttcgaatg gtcaacaacc gggccgctct gtagaaagcc tcatggaagc 180 tcccggccct tccgacccag tcgagcgtaa tgcagtcacc gttcttcata gccgtcccgg 240 tcctgcggtc ggttgtggta cgggagtctt ccaatctgca acctcaggcg agtactcttc 300 cattattgac aatgctccct cgccgtccat tgttttaaat tccagaacat gtaacgctcg 360 taagatgccg atatctgtga atgctaagaa gagtgacgat ttgctgattt attatcagaa 420 cgcaggtgga atgaactgcg acgtagacaa atacctctta gcgacatccg acgactgcta 480 cgatattatt gtgctgactg agacctggct cgacacacgc actatttcaa cccaggtttt 540 cacaccggaa tacgaagtct tccgctgtga ccgcaaccca agaaacagtc gtaagtctac 600 cgggggtggc gtcctgatcg cggtgaagaa gaaacttaag gccaatatta tcgaaaatga 660 tctttggtgt agcaccgagc aattatgggt gcacgttcag ttctttggtc acagcttatt 720 tctgtgcggg atttacattc cacccgatcg agcgcgtgat gagacgctga ttgaaactca 780 cctgcagtct gttacgtccg tgatcgaaat cgcaaagccc atggatgaaa tcgttgtgct 840 tggcgatttt aacttacctg gaatatcttg gatatcatct ggcaatgggt tccttcaccc 900 agatatgaac cattcacacg tccactccgc cgcttctcga cttctcgatg gctacagtac 960 agcaactctt cgacagatca accacgtaac caatgagaat ctccgaagcc tcgatctttg 1020 cttcgtcagt gctcaagatg tcgctccgcc catatccaat gcaccctcat cattggtaaa 1080 gcaagtcaac caccatcctc ctctggtgct caagctaatc gtccagcatt acagcacaag 1140 tgaatcaatc gcccaagtga cgtatgactt cacaaaagct gaccatgaag gtattctgga 1200 attcctcgcc gccgttgaat ggacagaagt tctcgacact cgtgacgtgg acaatgctgt 1260 ttcgactctt gctcacattc tcggacacgt gatcgagagg catgttccca aaaaaactct 1320 tgccagtaag ggtttgccat ggcaaaccag ggagcttcgc acgatgaaaa ccgctaagag 1380 aaccgcatta agacgctaca ccaaaaaccc tactccgcgg atgcgcgacc aatatgttag 1440 gttaaatgcc gcctataaga gaacaagtcg gcattgcttc aatcgctatc agcaacaagt 1500 tcaacgagac cttacagcaa aaccgaaatc tttttggagg tacgtcaatg agcaaagaaa 1560 agtgtctggt ctgcctccat ctttgttttt aaatggagat gtcgcttcag agcctcaaag 1620 cgtttgccaa ctcttcgctc aaaagttttc tagcaccttt tgtgatgaag taattcctga 1680 ggtactcgtt actcaagccg ccagcaacgt tccgttggtc aacgattcgt gggctaccct 1740 caccgtcgac gatgacatga tttcccgagc cgcctctcaa ttgaaatcga cgatgaaacc 1800 tggtccggat ggcattcctt ctgctttcct gaagaggcat attgatagct tgctcgttcc 1860 tattcgacat gtgtttaatt tatcccttat tagcggagcc tttccgtcat tatggaaaaa 1920 tgcttccgtg tttcccgtgc ataaaaaggg cgaacgcaaa gacgtcaata attaccgagg 1980 gatctcgtct ttgtgtgcca tctccaaact gttcgagctc gtggtaatgg acccactgat 2040 gacacattgt aagaatcacc taagcgacga tcaacacgga ttcatctcgg gtagatcgac 2100 ggccaccaat ctaatacgcc taacgtcgca cattgctgac agtttcgcag agggagcgca 2160 aacggacgct atctataccg acttgacagc cgctttcgat aagattaacc acgctatcgc 2220 aacggctaag ttggaaaggt tcggcattag cggcaacatg ctacgctggc tccattcgta 2280 cctcattgga cgtaaactga cggttaccat tgaaggtttt caatccaatg aatttctagc 2340 tacatccgga attcctcaag gcagccactt gggaccgtta atatttttga tatacttcaa 2400 cgatgttcac tacactctca aggggcctcg gttatccttc gcagatgatt taaagatcta 2460 tcgacgaatt cattcggact acgacgcaat gtgccttcag caggatcttc atgcatttgc 2520 cagctggtgc accctgaatc gaatgaccgt taatcccgga aaatgttctg tgatttcttt 2580 tgtccgtatt cgtagacctg tagttttccg gtacgaactc tacaacactg tgattcagcg 2640 cgttgatagt gtgaaagatc ttggagtgat tcttgaccca caactcactt ttaagcatca 2700 tgtggcgtac gtagttgcga aggcatcgag gacattgggc ttcatatttc gcgttgccaa 2760 agattttact agcgtatatt gtctgaaatc gctgtactgc tcgctggtcc gatcaacact 2820 tgaatactgc agtgtagttt ggaatcccta ctaccagaac ggctctgaca gaattgaatc 2880 aatacagcac cgctttattc gctttgctct ccgtcaactt ccatggcgtg atccacttcg 2940 gttgccgagt tacagaagtc ggtgccaact aattgatctt gagactcttc agctacgtag 3000 ggatatcgca cgagctatgg ttgtagcaga cacattgcag gggagaatcg actgtcctgt 3060 acttctgcag tcactagatt tgaatgtccg cccaagagta ttacgaaaca atacgatgct 3120 tcgacttcct ttccggagaa caaactttgg acaacagagc gcctacatag gcatcaaacg 3180 agtgtttaac agggtcgctt ctgttttcga ctttcacctc tcccgacaaa cgatccggag 3240 aaagtttaaa accgtgtttc tttcatttct tgacgaataa tgttgtttta tgttgttgta 3300 tttttatcgt tgattgtaat tttgttagta actgcttcat atttgttaaa ctagaataag 3360 atatacataa gttatacatc attgggacct gttttgtctg ttgatgtcat ccaataaata 3420 aataaa 3426 // ID Copia-10_AA-LTR repbase; DNA; INV; 305 BP. XX AC supercont1.241; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-10_AA_; KW Copia-10_AA-I; Copia-10_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-305 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.241; Positions 1460412 1460716. XX SQ Sequence 305 BP; 72 A; 75 C; 66 G; 92 T; 0 other; tgatggaata tgcaatgctg tttcctctcc gtgcagtatt cggtgagtgg taaccaaggt 60 ggcaaccctg aatgctatcc actgcaccgt cgagcgagcg cgccaaaagc gcatcattga 120 gaaggagaag gcaaagcaac agtgcaaagt acagttgatt ttttcattca ttcgttacta 180 ttcaacccga acagaagtgt attaaagttt atctcgtttc attcacacaa gtgtttcatt 240 ccttgtccgg tttccttttc ttcggtgttt cgcttccgtt gtgccgtgtc cacctctgcc 300 caaca 305 // ID DNA2-1_SM repbase; DNA; INV; 597 BP. XX AC . XX DT 16-AUG-2009 (Rel. 14.08, Created) DT 16-AUG-2009 (Rel. 14.08, Last updated, Version 2) XX DE Putative non-autonomous DNA transposon: consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA2-1_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-597 RA Jurka J.; RT "DNA transposons from freshwater planarian Schmidtea RT mediterranea."; RL Repbase Reports 9(8), 1833-1833 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX SQ Sequence 597 BP; 219 A; 83 C; 80 G; 215 T; 0 other; cagtgtttct caaactttct ggttcctcgg aaccctatca gtttttttcg ttttcgcgga 60 accccttcgg aaatttttat agccagatga gtattatcca cttatgttat agtcctccat 120 tttgacatat ttgtcgataa aaaggggtga atttatttta agcaatttaa aaagagaaaa 180 tttcatatta atgaattttc atagtaattt atttgaaaaa actgccacaa tgaaaataaa 240 aataaagttt tacactaaat tgtttaggta ttttgatgca ccctgatata tatatatatg 300 tgtgtgtatt aaaaataaat atatatgtat taaatataaa tatatgtaaa aaaggccatg 360 aaaagtttac aaatattttt tctacatttt taaaatgaaa ttttaaaaaa atccaccgaa 420 ttgttttaat gtttaaaaat gtattgaaat tctaactgaa actctcagaa aaaaaatcta 480 tggtttctcc aatatttttt acattaaaat tcaaattcga atggaatttc tcgcggaacc 540 ccaaaaattt gttcacggaa ccctagagtt ccgcggaaca tagtttgaga aacactg 597 // ID Gypsy-129_AA-LTR repbase; DNA; INV; 242 BP. XX AC AAGE02025245; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-129_AA_; KW Gypsy-129_AA-I; Gypsy-129_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-242 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02025245; Positions 66174 65933. XX SQ Sequence 242 BP; 73 A; 48 C; 37 G; 84 T; 0 other; tgtatgcacc ttggtttgtc ttgaatttga aaatgtttat ttttcccagt gcataatgca 60 acttcttgtt taaaccttgt actcagttga aaaatgagac gaataaaacc gcgcatcgcg 120 aactagccat cctctattaa attcgcgttt tgtatcaatt tagttaaaca ataaagttat 180 attttgcggt aatcatttcg tcttaagtcc tttaattcgg gaaaccaccc acatacacga 240 ca 242 // ID CR1-7_BF repbase; DNA; INV; 4089 BP. XX AC . XX DT 30-JUL-2009 (Rel. 14.07, Created) DT 30-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE Amphioxus CR1-7_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-7_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-4089 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-4089 RA Kapitonov V. and Jurka J.; RT "Young families of CR1 non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(7), 1578-1578 (2009). XX DR [2] (Consensus) XX SQ Sequence 4089 BP; 1204 A; 1037 C; 849 G; 996 T; 3 other; gaagaagagt tcactgcgca ggcgcgagga gtagacgtgt aaatccgtag ctctcgcctt 60 ccccaaaatc ttcacccttt tctacccttt tctaccctag ttttcctcct cccttggtcc 120 taccttcccc tctgcctacc ctattcgacc ctgaagaagc ttttgccccg accgaaacaa 180 aattttgggt agtggatagc attttaatcg ctagcgatac aagctcagtt cacccgtgtt 240 ggtctcgacg agtaagatgg cggcacctca catgtcgtca ccgtgggatc tttctgcctg 300 actcggaaag gacacactgg gaacttgagt tccaaacccc gtagctgcac cgaactgtaa 360 gtacagaact tatttccccg gactccggaa gtctccaaac ccaagtcgga tctccagaag 420 catatttcca aacttcctac agacaagatg gcggagccgt gtaccaacat gtgactctcc 480 cgaccgtcaa gcgactacct cttgtgcact tacagtagct ttgtgccaca aactgtgaag 540 ttcaccgaat acaatttaca aaagttgacg tataaaatac tagagaaaac taaaacttca 600 atttaaacgg aagaaaattt ttctttgtga atctatgcat gtccaaacat gcccgtggag 660 ctgcacccga ctcgtgaatg aagtacttag gtctaactca caaatgaata cccgactttt 720 taggaacctc ccattccact cattttgctt attgaccctg tactatattg ctgtatttat 780 atctgtattt atatctgtga acatgtcaaa ttcaaaattg ctaaattcaa ccaagtaccc 840 atgtggcaca tgtgaccgca ccgtctcctg gagtgatcta ggagtagagt gcgaaacatg 900 tggccgctgg tatcacttat catgtcaagg gatacagtct agctcatatg accgtctagg 960 acagagtgag gtggtgtgga attgtgatat ttgtgtgccc acatgcacaa gaaactccac 1020 ccctagtcca acaaattcca ctttgaacag tgaaagtgac ttcagatcct ctcattcatc 1080 cactcccaca aattccacat tgaacagtga aagtgacttc agaccctttc attcatccac 1140 tcccacccga gtaaatcaac aaaacaagcg gtccaaccgc cctctcaggt ttttaaacat 1200 caattttcaa tctattgttg gaaaaagggc agagactatg aacttgatag acagtacaaa 1260 accggatgtt atcctaggga ccgagtccca tttagatgat accatattta gtagtgaaat 1320 tttaccacca ggcttcaaca tattcagacg ggatagaaac agaaacggcg gtggcgtttt 1380 catcgctgtt gcagacaatc tgccatgcca tgaagtcacc gagctgtctg ttcctgaatg 1440 cgaaataata tgggtaaaga tcagagtaaa aggtagaaaa catattcttg ctgcctcgta 1500 ttatagacca catacaagag atagcctaga ccaatttgaa acatcagtcc aacgtgcagt 1560 aacaactacc agtaatgcaa ctctcatctt gggaggtgat ttcaatttcc ccgcgtttga 1620 ctggaacaca aagcaactga aagtaacttc caattatccc ggtcttcatt accaattcct 1680 ggattttatc caagatatgg gtctagacca gatggtcacc ctcccaacca gaggtgacaa 1740 cacgttggac ttgtttctta caaactaccc atctcttgta ccacgaatag aatgcctccc 1800 tggcatttct gaccatgatg ttgtctttat ggaatttgac atcaatcctg tctataagcg 1860 acagaatagg cggaaagttc ctcagtactc tagagcagac tggcccgctc tccacgcggc 1920 tgcagccgac ctttctgact ccattgtaaa agactttggc aaagacaaaa atactgagca 1980 tatttggtgc gcttttaagg aaggactctc atccatgtcc caacagcaca tcccccaaaa 2040 aaccctaggc ggtaaaaaca acaaaccgtg ggtagaccat actacttctc gcctaatccg 2100 tcgtagggac agactataca aacgatggaa aaagactggt aacactaagg tacgtgaaca 2160 aatgaaagct ctcaaacata caatccagcg tcggctacgc cgagcctact ggtcatacac 2220 tgaaagtttg tttactaacc aagatgaccc tcatgcagtc cccggggcta acatgaagag 2280 gttttggact tatatcaagt cgcagcggac cgaggctgca agtgttgccc ccttgaaagt 2340 agagggcaaa cttgttacag aagccgtgga ccgtgcagaa gcacttaaca aacagtttca 2400 aagtgccttc agccaaaagg tcactttcac tgtatcagaa tttcagaata gaaccaacct 2460 ctttcagcaa ccaaatgcac caacatgtag ctcgataaac atctctgagc ccggagtgcg 2520 aaaactgatg caaaacctgg accccaaaaa ggctcctggt cccgayggca tcagycctag 2580 gttgctgaaa gaattggcag tcgagctagc accagcactc actctccttt ttcagtcttc 2640 actcgagtcg ggtgtcgtac cacgagattg gagaacagca aatgttaccc cagtctataa 2700 gaaaggggaa cgctatcgtc ctgagaatta caggccaatt tcccttacca gtattccatg 2760 taagataatg gagcatattg ttactagtac gattatgtct tatgtagaag aaaatgaaat 2820 catatgtaaa gagcagcatg ggttcagacg ccgccattcc tgtgaaagcc agttactcgg 2880 acttgtggat gacttgtctg ttgaccttga acaggggaga caaacagatg ccctgataat 2940 ggattttagc aaggcatttg acaaagtttg ccactcccta ctcatccata agttaaacta 3000 ctacggcatc acagggtcgc tgcagacctg gatccagagc ttcctcaccg accgccggca 3060 agcggtagta gtggaaggct caatttcaag ttttgtacca gtagattcag gagtgccgca 3120 gggatctgtt ctaggcccca ccctgtttct gttatatata aacgacctcc ctaccagttt 3180 gtcgtctatt gctcgtctgt ttgcagacga caccttggca cacaaaacca tcagctcgac 3240 tagggatcag gaagtacttc aagaagactt ggatcgtctt gctgagtggg agcaaacctg 3300 gatgatgagt ttccatccag acaagtgcca agcactccat atcacaagga agagggtccc 3360 ccaaacaatg aactacaaac ttcatggtca cactcttgtc tcaaccaagg aagccaagta 3420 tctaggagtg acaattacct acaatctcac ttggtgtacg cacatcgcaa acatcaccaa 3480 caaggccagc agaaccctag gttttctgag aagaaacata aaggtaggat ccaaaaagat 3540 aaaaaacctt gcttatgtta cattagtccg accattacta gaatttgcta gtcccgtttg 3600 ggacccctat acagcaaagg atatagcgaa gctcgaagcg gtccagcgta gggctgcccg 3660 gtgggtgtca aacaggtatc gacggtcctc cagtgtcagc gaaatgcttc acgaccttga 3720 gtggccaaca ctccaggaga gacgacggag ggccagattg gtcacttttt ataaatacca 3780 ctcaggggat ctggctatta actgcaaaac gagtcccgct ccacaagcta gaagttcgag 3840 atcgacccat cccgcagcat acaaggtacc gactagtaga acccaatata ggcaaaatgc 3900 cttcttccca cgtacaayta gagactggaa cggcctccca gcggaggtgg cgctatcgcc 3960 atccttgtct atgtttaaat ccaaaatata atcccccctc agtctgcgca gtgaacctct 4020 ccatgttgcc acggctgcag aatcctcaaa aagatgaggt tggcagccta accgaaagaa 4080 gaagaagaa 4089 // ID Harbinger-2_BF repbase; DNA; INV; 5681 BP. XX AC . XX DT 04-AUG-2008 (Rel. 13.08, Created) DT 04-AUG-2008 (Rel. 13.08, Last updated, Version 1) XX DE Amphioxus Harbinger-2_BF autonomous DNA transposon - consensus. XX KW Harbinger; DNA transposon; Transposable Element; Harbinger-2_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-5681 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-5681 RA Kapitonov V. and Jurka J.; RT "Harbinger-2_BF - a family of autonomous DNA transposons from the RT amphioxus genome."; RL Repbase Reports 8(8), 798-798 (2008). XX DR [2] (Consensus) XX CC This is a young family of autonomous Harbingers characterized by CC 34-bp TIRs and TNA TSDs. The consensus codes for two proteins: CC transposase (3 exons) and DNA/protein-binding SANT/Myb protein (5 CC exons). XX FH Key Location/Qualifiers FT CDS join(178..932,1097..1202,1361..1777) FT /product="Harbinger-2_BF_1p" FT /note="transposase." FT /translation="MAGEPPAHLLLAVFQMFFFTYLVRVMSNMRQGGRQAR FT RRRLRHREMMLGRRPARIHPAYVPAALALARGGDPINRTIWSYVRLPATFW FT QKFVQGTLCDAEFYKRFRMTRDTFQMICDKLNPVLEKSDTRMRLAIPTKKR FT LAICIYWLASGDLMRSVADLFGVSEGSVSLIVHKVCDAINEVLFRDYLSFP FT TGQKLKETIQGYKERWGFPQCAGAIDGSHIPIKAPSKNPNDFYNRKGFYSV FT ILQGVVDHMSRFTMIDIGMPGSVHDARVLRESNVFDRGESGTLLPQEFAQD FT INGVTVPAVLLGDAAYPTLPWLMKPYPDNGALTREKFDFNYRQSRARMTVE FT CAFGRLKGRWRCLTKRLDVLLDNVPDIVGACCVLHNICEVHKDEDFEIQFP FT VEQGERRPAARRQGPPSAARDALARLFSQEGN" FT CDS join(5404..5047,4642..4490,4071..3991,3754..3237) FT /product="Harbinger-2_BF_2p" FT /note="SANT/Myb." FT /translation="MAGRKEGARRCIWSTPETRCLISIWGQAEIQKKLEGP FT RTKMVYAKIHQLMKNAGYEDRTAEQIKRKIADLKAAFNNAVKKNKKSGEGR FT KLCHFYNELNVFLGGREALNPQSRLESIADPESSDSEGGEDGDGGEDGDAQ FT DQALDSPRGPSPPPEDVTEDSINVSVDSSSSEKRKADEEGGGVEKKGKTAS FT QQKKGKPGFVKKAPQNKTRIENSFSALEKSFSSLQDEQEKRQREAEKQKME FT EWKEHDLQVVRMQQEHETRTVNNMMGQFMTMMGQFMTVMQQGSTQFPPTYG FT GQSRSPSPMPYNPSPYPPIAGGQSWSSSMPRSAVPSGSQYASMHGGQSQGP FT SMSNPALSMPHTGSEDPSTSEETYFQL" XX SQ Sequence 5681 BP; 1584 A; 1258 C; 1196 G; 1643 T; 0 other; ggcccacttt acattacggt tttcgcgtta gcggtgaaat ttcccgctga cgggacgccg 60 ttaacgggac ctctttacac accgcccgga aatctagcag gacacccgcg gtgcttaggg 120 ttgcggtagt gaccttacct gacccactgt gacctttgac agaaggcgcg aatcaaaatg 180 gcgggagagc cgccagcaca tttgttgctc gccgtgtttc agatgttttt cttcacgtat 240 ctcgttcgcg tcatgtccaa catgaggcag ggaggcaggc aagcacgcag gagacgcttg 300 agacaccggg aaatgatgtt gggacgtcgc ccagctagaa ttcaccctgc ttacgtcccc 360 gctgcccttg ctttggcacg aggtggcgat cccatcaacc ggacgatttg gagttacgta 420 aggttaccgg cgaccttttg gcagaagttt gtccagggca ctctgtgtga tgcagaattc 480 tacaagcgat tcaggatgac cagagacact tttcaaatga tatgtgacaa gctgaacccg 540 gtactggaaa agagtgacac tagaatgcgg ttggcaattc caacaaaaaa gcgactggcg 600 atatgcatct attggctggc ctctggcgat ctgatgagga gtgtggcgga ccttttcggc 660 gtcagtgaag gctcggtttc cctcattgtc cataaagtgt gtgacgccat caacgaagta 720 ttattccggg actatctctc cttccctacc ggacagaagc tgaaagaaac tatccagggc 780 tacaaggaga ggtgggggtt tccccaatgt gcgggagcaa tagacggctc gcatatcccg 840 ataaaagccc catcaaagaa cccgaatgac ttttataaca gaaagggatt ctactctgtg 900 atcctacaag gggtggttga ccacatgtcc aggtatgttt cccacagcta gttattattt 960 accccatctt actgcttttg gaaacattta tttgggcgat gaagacaatg tcaagtttgc 1020 catcatttat ttattccacg tgattagatc atgccaggga ttaacgatat tgtaaaacat 1080 tcatttctgt tcaaaggttc acaatgatcg acatcggaat gcccggcagc gttcatgacg 1140 cacgtgttct cagagaatcg aatgtcttcg accggggtga gagcgggacc ttgcttcctc 1200 aggtacggat cgatttctgt catcggtgga tatttaaaca tggaatccgt gcaatgcgtc 1260 atattagaaa taaaaatcat ataacagaaa tacattattg ttcatatgtt ttgtaagtac 1320 aacaaggctt atgtattttt ttccttcacg tcatttgtag gagttcgcac aagacatcaa 1380 cggcgtaacc gtgccggcag tgctcctggg agacgcagct taccccacac tgccgtggct 1440 tatgaagccg tacccagata atggcgctct gactcgggag aagtttgact tcaattacag 1500 gcaaagcagg gcgaggatga cggtggagtg tgccttcggt cggctgaaag gaagatggcg 1560 gtgcctaacc aaacgactgg acgtcctcct agacaacgta cctgacatcg tgggggcttg 1620 ttgtgtgcta cacaacatct gtgaagtgca caaagacgag gacttcgaga tccagttccc 1680 cgtcgaacaa ggcgaacggc gacctgctgc ccggcgacag ggccctccca gtgcagccag 1740 ggatgccttg gccaggcttt tcagtcaaga gggcaactag tgtgtgtgtt gtctgccagt 1800 ctactttcaa atctacactc aaatcacttc aaattacttt gtaaacatat ggcaacaata 1860 tgtacaagaa caaaatttca tttcatctag tttttcatcc cttctctacc ctcaagacaa 1920 atatagttct gtaaacttta aggacatgtt tttatttgca acaatgttct agtaaccaag 1980 aatatgtgta gtagaagtag tagttctata attagaatac atacaacttt ttactgcagt 2040 tccttgagtg atgaacacat aaatgaaatg tactttcatg ttttgttcaa cagtaaaagc 2100 tttttttaac tttgtgcaat tatgatgttt atgattatag cttgtttggt attaagagac 2160 atcatcacac agaagaatgt acatgtaaca aaagtttcag ttaaaaattt ccaacaaggc 2220 atcctgaaca actacctcat aatactgtat aacataagct gaaaactctt tacttgattt 2280 cttcttatac ctgagatatg ataatggttc ttaacttttt ttgcacttat gttgttgttt 2340 gacaccatgt attgtgtata gataaatact cttaagttaa atagcaaaaa tatattttcg 2400 gctttgtaca gtacttgtaa cttttgatat caagtatata ctatggtttg gtattgagta 2460 tcaacatcac aaagaagaag caacaggagt aacagagttt cagttgaaca ttcccaacag 2520 agaatcctaa acaactacct cataatataa cataagctga aaaccctgta cttgatttct 2580 tcttatacct gagatatgat aatggttctt aacttttttt gcacttatgt tgttgtttga 2640 caccatgtat tgtgtacata gataaatact cttaagttaa atagcaaaaa tatattttcg 2700 gctttgtaca gtacttgtaa cttttgatat caagtatata gatatggttt ggtattgagt 2760 atcaacatca caaagaagaa gcaacaggag taacagagtt tcagttgaac attcccaaca 2820 gagaatccta aacaactacc tcataatata acataagctg aaaaccctgt acttgatttc 2880 ttcttataac tgagatatgc taattatagt tcttaccctt ttttgcactt gttgttttga 2940 caacatgtct tgtgtacgta tacatagata aaatactctt atgttatctc aagcagcaat 3000 aatatgttct cgtctttgta caatacttgt aacttttgat atcaagtata tagatatggt 3060 ttggtattga gtagcaacat cacaaagaag aagcaacagg agtaacagag tttcagttta 3120 aaacagtccc aacagggaat cctaaacaac tacctcataa tataacatca gctgaaaact 3180 caacagggaa tccactatct actatagatg tgaattgaaa aaaaggcaca cctttaaagc 3240 tggaaatatg tttcctctga agtagagggg tcctcgctac ctgtgtgtgg catggacagg 3300 gcaggattag acatggaggg gccctgagac tgtccgccat gcatgctggc atactggcta 3360 cctgaaggca cagcactgcg tggcatagac gagctccagg actgaccacc agcaatgggg 3420 gggtaagggc taggattgta aggcatgggg gagggggacc tggactgtcc accataagtg 3480 gggggaaact gggtgcttcc ctgttgcatg acggtcataa actgacccat catggtcatg 3540 aactgaccca tcatgttgtt tacagtgcgt gtctcatgct cctgctgcat cctcactacc 3600 tgtagatcat gctccttcca ttcttccatc ttctgtttct ctgcctcccg ctgcctcttc 3660 tcttgctcgt cctgcaggct gctgaacgac ttctccaggg cagagaagga attctcgatc 3720 ctggtcttat tctgcggcgc tttcttcaca aaccctaaaa atcaacatag aattgtcaag 3780 agtctttcat attacaatat gttgttgttg ttgttgatat aggtctatcc atctagtcaa 3840 tatctggttc ccttatgaag cctttgaaca gaaatttaag atgattacaa aataaatctt 3900 tataaagttt atcttgatca tccaggcaac atttctaata gatctccaga aactactgta 3960 ttttgaaaac agtatttcta tgaaaattac ctggttttcc cttcttttgc tgagaggcag 4020 ttttcccctt cttctcaact cctcctcctt cttcatctgc ctttctcttc tctatgagtt 4080 gtaaaaagag cagatcagca cacattatta ctacatgtaa taataatact accgtttgtc 4140 agtctattca gactgacaat tatcccaata aagtttaaaa taatatacat gtaatacatg 4200 tactacagtg acaataatcc caataaagct gaaaacagaa tccattacaa ttggtaaaaa 4260 attgaaaatt tataacttta ctgtacctgg ctgagttgaa gtcttggctg ggacttcact 4320 ggatgtagaa gcactagaac ttgctgaagt gttcagacct accatataca aaagatggga 4380 agcagtcagg ggtacagcag caggtacttc aacacaaggg agtaaaacag gtaagtcaga 4440 caatgtgttt tcctgaccta atattcaaaa aggagaaagg aacacttacc tgatgaagat 4500 gagtctacag acacattgat actgtcttca gtaacatctt ctgggggtgg gctgggtcca 4560 cgtggagaat cgagagcctg gtcctgagca tcaccatcct ctccgccatc accatcctct 4620 ccgccctcgg agtcagacga ttcttgtaaa tatgaaggaa aagtagagtt ggcttgaata 4680 ctttaacatc aaaagtacat ctaaagtcat gcatgtaagt tcttcaaata ttcattctgc 4740 ttgattttct aattttcttt ataaacatgg cagcacccaa cccattcttc tcaactggtt 4800 taaaagggtt ctcagaccaa taatttattg caaactttct ggcaatctta cgtgataaga 4860 attaaacaag gtttcctggg tgaaagtagc atcaaaatac acaagtcaaa attcagggat 4920 attgcgtaac tttcttatat cgaccttgcc taaatactca agggatacta gggtgtgccc 4980 ccctcccccc caaaaaatca tttacgtttg aatcaaggcc atcacgtaga ctagttcaaa 5040 atttacctgg gtctgcgatg gactccaggc gcgactgtgg gttcaaagcc tccctgccac 5100 ccaagaatac gttcagctcg ttgtagaagt ggcagagttt cctgccttcg ccgctcttct 5160 tgtttttttt gacggcattg ttgaatgcag ccttcaggtc agcaattttc ctcttgatct 5220 gttctgctgt gcgatcttca tagccggcgt tcttcatcag ctggtgaatt tttgcataca 5280 ccatcttggt cctgggtccc tccagcttct tttgaatttc agcctgcccc cagatgctga 5340 taaggcacct agtttcaggg gtgctccaga tacagcgtct agcgccttct ttccttcccg 5400 ccatgttttt ctcgaggtat atgctaatta gcgcgcgctt tatgctaatc gctgaaccct 5460 gacctcccgc taacgccagc ccgacgtgtt ctctttatac gcagtttggc gtttacgggg 5520 actcccgctg agcctacggc cgtacgcttc ggagtggacg ggagaaattt cggcgttagc 5580 gggactacgt taacgggaac tctctttaca cggggctccc gctaacgcag ccccgctaac 5640 gccagtcccg ctaacgcgaa aaacgtaatg taaagtgggc c 5681 // ID Crack-26_AAe repbase; DNA; INV; 5566 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A Crack non-LTR retrotransposon family from Aedes aegypti. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; KW Crack-26_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5566 RA Kojima K.K. and Jurka J.; RT "Crack clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1242-1242 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 20 sequences with >96% CC identity. XX FH Key Location/Qualifiers FT CDS 675..1661 FT /product="Crack-26_AAe_1p" FT /translation="MEKSKVICKECNKVELDSNKLISCMYCFSHTHLKCKN FT ISANAARRLKEGMFFCTTSCSEYYKRIIQIKDSKPSITSELNNDIQTVVSK FT ISEEMLSVRSEVRSITTAIEKSQDFLAEKFDFIMSDFKAIKAENECLKLEI FT AELKRLHSSLASTVHTLEYKVDKSDKNVNSNHAVITGLLCTPNENVVELSH FT KVFEKVGVKLSADSIVSAERIFQTIKPNSTAPIRVVFKSKADKERIIKKKF FT ETGNLKSTAIDNRFIINGKPSNVNIRDELTPLSLEILKALRSCQEKLGIKY FT VWPGRDGAILVKQSENCDIEIIRNRADLNDFIRCCSK" FT CDS 1917..4787 FT /product="Crack-26_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="QQDYRNIHRWAPISDTGSTSLKILQWNVRGLNDLQKI FT DSILQFLDELNVPVHVIVISETWLKCGNTSLFNIRNFDPIYSCRESSSGGL FT AVFIRKTLSHKVLENLVDDGLHHISIEIKVKGLYYEIHGVYRPPCFEFNRF FT HDKMETWLQKSSVNRPCLIVGDMNVPINLVNNNVVVKYRNLLESFGFICTN FT SFITRPLSSNILDHIICQSEYAHRLTNYTVLNDASDHLPIISSFNLAKPFD FT KIILTKKIIDYRKLNFDFKNYVDSIGVVDNVSNCLEGIIRNYKLIYDRCIK FT IVSKEAKIKGTQCPWMTFDLWYLIKLKSRYLKKTKRHPLDEQTKIMLEHIS FT KKVDRLKKTCKKKYYENILSNTTHSKLWKNINSLFGLSKSENKICLTVNGT FT EVENDQHTCEILNNYFSSIGSNLADEVSLQNTADPYSAINCIQESIFLQPC FT SENEVTLLINDLNSKKACGYDNIPARLIKENCSTFSKILCQCFNTMLENEI FT YPDCLKIAKVVPVFKAGDPHSPDNYRPISTLSVFNKIFEKMLVNRLLSFLK FT RHNIIYKLQYGFRQGCSTTTAVTELVDSIINEIDAKKTVGALFLDLKKAFD FT TLNHNILLGKLERYGVRGVALNLIKSYLSNRQQYVSIGDYSSDLREISIGV FT PQGSNIGPLLFLIYINDLGNIALNGTPRLFADDTAIFYPCANPNLMINQID FT EDLLLLKYYFANNLLTLNLSKTKFMVFRSARKILPHLTEPKIGTTSIEYVT FT SFKYLGVHLDSTLSWEVHIKHVERKVASLCGIVRKVVSFIPRHVLIKFYYA FT HIHSRLQYLVSVWGRACKSKLIKLQVLQNRCVKLIFKLPILHPTLELYSNE FT LHNILPLLALCELQTLLLVHDILHNPNVHCNLVMDGGNRPHNTRQHGHLTR FT IRAVTNLGQKRISCVGPVRYNSLPIELKEIINRSLFKTRLKKYLKTKINDF FT LL" XX SQ Sequence 5566 BP; 1826 A; 880 C; 934 G; 1926 T; 0 other; tctgtgtacc cctgttttga attaaatttc aaagtgaatt tgtgaacatt aaatcatttg 60 ctgctcctct tggattgaat aatattacta tcgtgttgta taactaaaaa aacagtgtta 120 caattagtgt attacagcta aaatcgtaat cggtgcaaac tttgtttctt gccgtaatta 180 aaacaaagcc agtggtgtga atacacattc aacagtttat ccaagctagt attcgtcttg 240 ttgtgttgca catgtgttct atccagtgtc ctctatagtg attgcataca gtatgttcta 300 ctacttcatt gttgtgttca atatttgatc acactttcat gcaagtatat cataagcgta 360 gctggtggtg ggctctacaa actacacgtg ttgcaaaaat ataaataaaa aatggaatga 420 tatgttttta attttttatt ttttttttct tagagcgata tatatttatt ttattttatt 480 tattttttta tttatttatt ttttacacta attgttattt attttgatcg atatttattt 540 tccttgttat ttattttgcg tgtttgttaa cttattttga gacttatatt tatattatta 600 ttattattat tttttaacct acatgttttg aacagacttc tttgatattt taaagtgttc 660 ctacgtccgc gattatggaa aagtccaagg taatttgtaa agagtgtaat aaagtcgagt 720 tggattctaa taaattaatt agttgtatgt actgcttttc acatactcac ttaaaatgta 780 aaaatatatc tgcaaatgcc gctcgtcgtc ttaaagaagg tatgttcttt tgtacaacat 840 cttgttcaga atattataaa cgtatcatac aaattaaaga tagcaagcct tctattacat 900 ccgagctcaa caatgatata caaactgttg tgtcgaaaat ctctgaagaa atgttgtctg 960 ttagatcaga agtgcgatct atcactaccg caattgagaa atcgcaagat tttcttgcag 1020 aaaaatttga ctttataatg tctgatttta aggccattaa ggctgaaaat gagtgtctca 1080 agctagaaat agctgagctg aaaagattac attcgtcact agctagtaca gttcatactc 1140 tcgaatacaa agtagataag tccgataaaa acgttaattc aaatcatgct gtaattactg 1200 gcttgctttg tactccaaac gaaaatgttg ttgaattgtc acataaagtg tttgaaaaag 1260 ttggggtaaa attaagcgct gactcaattg tttccgctga gcgaatattt caaacaataa 1320 agcctaactc aacagcaccg ataagagtgg tttttaaatc taaggcagat aaagaaagaa 1380 ttattaagaa aaagtttgaa acaggaaatt taaaatctac cgctatagac aatcgattca 1440 ttattaacgg gaaaccaagt aatgttaata ttcgcgatga gttgacacct ttatctttag 1500 aaatcttgaa agctttgcgt tcttgtcaag aaaaacttgg aatcaaatac gtttggccag 1560 gtagagatgg tgcaatctta gttaaacaaa gcgaaaactg tgatatagaa ataattagga 1620 atagggcaga tttgaatgat tttattaggt gttgtagtaa ataaaaccat cgttttcaat 1680 tgtctttctg tgttttatta ttattcttag tttaatggcc catttaaaaa aattattctt 1740 gtgatagtat tgacgatttt aataatatgt gtaggcagtc tagcatcgaa tattaatgtt 1800 ttttaattta tgagatttgg tgcaggtgag gacatgcttt taggagcgtt ttaatgaaag 1860 tcgtttcgtg gattactggg gaactcatcg ctttgaacgt agttgtgtca ctttaacaac 1920 aagattatag aaacattcac cgttgggccc ctatatccga tacaggatca acgtccttaa 1980 agatattaca atggaatgta agagggctaa atgatcttca aaagatagat agtattctac 2040 agtttttaga cgaattaaat gttcctgttc atgttattgt aattagtgaa acctggttaa 2100 aatgtggaaa tacctctttg tttaacataa ggaattttga tcccatttat tcttgcagag 2160 aatcttcatc aggtggtttg gctgtgttta ttcgcaaaac gttatctcat aaagtgttgg 2220 aaaacttggt tgatgatgga ttgcatcata taagtatcga aattaaagta aaaggcttgt 2280 attatgaaat acatggtgtg tatcgccctc cttgttttga gtttaaccgt tttcacgaca 2340 aaatggaaac atggttgcaa aagtcctctg taaatcgtcc ttgtttgatt gttggtgata 2400 tgaatgtccc cattaatctg gtaaataaca atgtggttgt aaaatataga aatttattag 2460 aatccttcgg ctttatatgt accaatagct ttattacaag gcctttaagc tctaacatac 2520 tagaccacat aatctgtcaa tctgaatacg cacatcggtt aactaactat actgtcttga 2580 acgacgctag cgatcatttg cccataatat caagtttcaa tctggccaaa ccatttgata 2640 agattatact aacaaaaaaa atcattgatt atcgtaagct aaactttgat tttaaaaatt 2700 atgttgacag cattggagtt gttgataatg tgagcaattg tcttgaagga atcattagga 2760 actataaact gatttatgat aggtgcataa agatagtgtc aaaggaagca aaaataaaag 2820 gaacccaatg tccctggatg accttcgatt tgtggtattt aattaaactt aaaagtagat 2880 acctaaagaa aactaaacgg catcctttag atgagcaaac aaagatcatg ctagaacata 2940 tttcaaaaaa agtagatagg cttaaaaaaa catgtaagaa aaaatactat gaaaatattc 3000 ttagtaatac aacacattct aaactttgga aaaatattaa ctctctattt ggactttcta 3060 aaagtgaaaa taaaatttgc ctgactgtaa atggcacaga agttgagaac gaccaacaca 3120 cttgcgaaat ccttaacaat tatttttcca gtattggcag caaccttgct gatgaagttt 3180 cattacagaa tacagctgac ccttattctg ccatcaactg catacaggaa tctatctttc 3240 ttcaaccttg ttctgaaaat gaagtcacac ttttaataaa tgatttgaat tcgaaaaaag 3300 cttgtggcta tgataacatt cctgctaggc ttatcaaaga aaattgttcc actttttcta 3360 aaattttgtg ccagtgtttc aatactatgc tcgaaaatga aatatatcct gattgcttaa 3420 aaattgctaa ggtcgtccca gtgttcaaag cgggtgatcc gcattcgccc gacaactatc 3480 gtcccatatc aactctctct gttttcaaca aaatttttga aaagatgtta gtcaatcgct 3540 tattatcctt tttaaagagg cacaacataa tttacaaatt acaatacggc tttagacagg 3600 gttgtagtac tactactgct gttacagaac tcgtcgactc cattattaat gaaattgatg 3660 ctaaaaaaac tgtgggagct ctatttttag atttaaagaa agcattcgat acgttaaatc 3720 ataatatttt attgggtaaa ttagagcgat atggtgtcag aggcgttgcg ttgaatttaa 3780 taaaaagtta tttatccaat aggcaacaat atgtatcaat tggcgattat agtagcgatc 3840 tgcgtgaaat ttcgataggt gtcccccaag gaagcaatat tggaccactt cttttcttaa 3900 tatatattaa tgatttagga aatatagcct taaatggaac tccacgatta tttgcagatg 3960 atactgcgat attttatcct tgtgcaaatc ccaatttaat gattaatcag atcgacgaag 4020 acttgttgtt gctaaaatac tattttgcaa ataatttgtt aaccttaaat ctgtcaaaaa 4080 ccaaatttat ggtgttcagg tcagcgcgca aaattttacc tcatcttaca gaacccaaaa 4140 ttggaacaac aagtattgaa tatgttacct cattcaagta tctaggagtt catcttgatt 4200 ctactctatc atgggaagtt catataaaac acgttgaaag aaaagttgcc tctttatgcg 4260 gaatcgtacg aaaagttgtt agttttatac ctcgccatgt tttgataaaa ttctactacg 4320 ctcatattca ttctcgtcta caatatttag tatctgtatg gggtcgcgct tgtaaatcca 4380 aacttataaa actacaagta ttacagaata gatgcgtcaa attaattttc aaactgccaa 4440 ttttgcatcc caccttggaa ttatactcaa acgaactcca taatattctg cctttactag 4500 cattatgtga attgcaaaca ctccttttag ttcatgatat tcttcataat cctaatgttc 4560 actgtaattt agtaatggat ggtggtaacc gaccgcataa taccagacaa catggtcatt 4620 taacgcgtat aagagctgtt actaatttag gtcaaaaacg catttcatgc gttggtccag 4680 tgagatataa tagtttacca attgagctta aagaaattat caatcgaagt ctgttcaaaa 4740 ctcgtctcaa aaagtactta aaaactaaaa ttaatgattt ccttctataa aaatcagaac 4800 catcacaaaa ctactacatg acaatttttt tttcttgccg ttgtcaaatt attattacta 4860 tttattatct tttggtgttg gtgccttcga ataagctgca aatatcgcgt ttttttaagt 4920 tcttgtttta gtagaaaagt atttcgtaat atatgcatga ttcaaatttc tgaacgccat 4980 gcctagaaaa cacagtgata caacagtagc taacctggca ttttagatag aaaaatcctg 5040 tgaactcaga tgacacccat agttagactg gcagcttgcg ggtgccggat aactggtatc 5100 tttatgtgtt ctacaatgtt tttcttgatt gctagaacaa actacaaacg tttcacggta 5160 aactctagaa gcacaccatc agttatttgt tgctatgcat gttacaaatt taaagtcgaa 5220 taaatgaggc ttcatacttg tttggccttt gcgtttgcag ttttcgatga tcatcaaaat 5280 aacatttaat catctatgaa tcccttaaaa ggagatttct ctccactggg atatcgtata 5340 gattattgtt taaattagaa ctcgctcatt tttctttttt gtatagtttt tagcttgttt 5400 tcatctcagc tcataaacat agatttttcc tcttattaaa aagtgttgct caatttagct 5460 gagaatagat gagtccacta ccaggaggct cttctctcaa tattttgttt ttgtgagatg 5520 agctttttgg tgtgggggtg agtggcgggt aaaaaaaaaa aaaaaa 5566 // ID Mariner-3_BF repbase; DNA; INV; 1308 BP. XX AC . XX DT 13-APR-2011 (Rel. 16.04, Created) DT 13-APR-2011 (Rel. 16.04, Last updated, Version 1) XX DE Mariner-3_BF DNA transposon DNA transposon - consensus. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner/Pogo; KW TA TSDs; Pogo-2_BF; Mariner-3_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-1308 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M. and Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-1308 RA Kapitonov V.V. and Jurka J.; RT "Transposable elements in the amphioxus genome."; RL Deposited to Repbase as the tmplanrep.ref file (02-May-2008). XX RN [3] RP 1-1308 RA Kapitonov V. and Jurka J.; RT "A family of Mariner-3_BF DNA transposons from the amphioxus RT genome."; RL Direct Submission to Repbase Update (13-APR-2011). XX DR [3] (Consensus) XX CC It belongs to the Pogo group of Mariners. TPase-coding region is CC corrupted by mutations. XX SQ Sequence 1308 BP; 390 A; 287 C; 316 G; 298 T; 17 other; cgaggggtga tcaataagtt ctcagcctca ccatgaaacg aggggcggaa ctcctattct 60 tgggtataac ttaatactac aatgtcttaa atcactgcct atagaaatgc aaatttaggt 120 attcattaat tagcccgtgg tgcccattca aaatcagcag gtgtgaagta agtcagaagc 180 aaaatggagc cagacagaag ctcgcgctgt gatcaagtat ctstatwaag aagggacgca 240 cwcccaagga aayccatgaa gamatggtaa agacacatgg tgaggactcc ccttcttatt 300 ccactgtaaa gaagtgggtt gctaacttca agcaaggcca gragagcacg aaagatgatc 360 cmaggyctgg acgtccaara tcagccacca cggacaatca ggttgaggct atccatcgca 420 tggtaaagaa tgacagacgt gtcactatcc gacatatagg scattcacta ggcatcagta 480 tatggtstag ttcaaaatgt tttgtccaag catcttggcg atgagcaags tgtctgcmag 540 gtgggtgccc cgaatgttga cmccacatca gaagttagaa caggctgcaa gtgttcgagg 600 atgcttttgg cccttttcca gtcaaatcct ggcaactttc tcaagagatt tgtgacccaa 660 gatgagacwt gggttcacca ctttgatcca gaatcaaaag aacagagcaa gcaatggaca 720 cacaagggtt cacagccacc caagaagttc aagcgagtag tcatcggttg gcaaggtgat 780 ggcctctgtg ttctgggaca gcaagggggt catcatgatt gactacttgc aaaagggtca 840 aacaatcaat ggagagtact atgcttcaga actgacgaca gttaagagca gcaatcaaag 900 agaaaaggag aggaaagctt cgagctggtg tsctgctgct ccaggacaat gcacctgtcc 960 acacagcaca ggtggcagtg gcagcagcaa ccgaatgtgg ctttgaactc ttacctcacc 1020 cttcctactc acctgacctg gcaccatcwg acttctacct gtttcctaat catgaagtcc 1080 cacttgcgtg gtcaccgttt tgagacggat gatgacgtca tccaagctgt tgaggcctat 1140 ctaaagggtc aagatgaaac cttcttccag cagggaatag gaaaactgga acagcgatgg 1200 acaaagtgca ttgagcttag aggggactat gttgaaaaat agtgtaagaa aaatcttcct 1260 tctgtaacgt tttataagga gggccgagaa cttattgatc acccctcg 1308 // ID Expander1_Cis repbase; DNA; INV; 3208 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 06-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE RTE Non-LTR Retrotransposon from Ciona savignyi. XX KW RTE; Non-LTR Retrotransposon; Transposable Element; KW Expander1_Cis. XX OS Ciona savignyi OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. XX RN [1] RP 1-3208 RA Smit A.F.; RT "Expander1_Cis - RTE Non-LTR Retrotransposon from Ciona RT savignyi."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX CC Ci001025, Ci000255 Probably incomplete at 5' end. ORF from pos. CC 222-3191 encodes a protein 29% identical (49% similar) to the CC pol gene of Expander in Fugu rubripes. XX SQ Sequence 3208 BP; 1151 A; 635 C; 666 G; 751 T; 5 other; cttcttgcgg ctgtgggaac taacattcac cacagataga gtatgtgtaa ccttgacgag 60 gtgcagaagc tgaaagccaa ctagaaacca acgcttggat ctatgttagt agatctgtaa 120 ctttgggaag cggtttctga ggcggtacta attagcatac tctacccgga atgtccttga 180 gcaagtgtcc ggggaatcct acgcttacat gccaaaaatg aactacaaaa aaaagaaaat 240 ggccctcagt cccgggcaat acgagggtgc gaagaatctc ccggttagaa ttgctacatg 300 gaacacaaga acactacgtg aaaacgatac aatcgaagct atttctaaac gaaattaccc 360 acatgaaagt tgacataata ggtgttacag aaacacactg gacaactcaa aatcccacaa 420 tgtgggaaca agataattac gtcatcattc attctccgag aaaagacggc actcatcgac 480 agggagtagc tgtagcantt aggaaagaac tgctaagcaa cctcatctcg tacaagtgcg 540 tttctagcag attgatgaca atgaccatcc acttgaacaa aaaagtcatt acttttttcg 600 ttgcatatgc gcctgattcg agttatgatg actctcaaaa cgaagagttc tataatatca 660 tacaaagcat aatcaatgcg ctaccgcgga aaaatgagat catactctta ggagacttca 720 atgcatctgt aggagctgac aaatgcaata actggtcgga tgtgatgggg aaatatgggc 780 acggtgacat gaacgaacgt gggctacatc ttctccagtt ctgcgcaatc aacgagttaa 840 tcatatcaaa caccatattc aagcataaga gcggaagaag atacacatgg acatccccag 900 actacacctt taagaagcaa attgactata taataatgtc aaagaatttg aaagcatcaa 960 tcaaaaacag cagagcatat cagtcagccg agataggttc ggaccattct ctggtaatag 1020 caaacatgat ttttcgaatc gaaaaagaac caaaacgtgt caaaaaccca aaaataaaga 1080 agtatgacta tgacaagcta aaaaatggcg acatcgccaa agcattccga aaatctgttg 1140 gaagcaggtt tgccactcta ctgggccaaa caactgaaca tgaaccagaa tcantatatt 1200 gcaaaatgac gtcaattcta aaagacgcag ctgaaagaga aattgggttt aaacaacaca 1260 aaaccatacc tggactctcc attgaggtag tgaaactatg tgaagaacga cgagaagcaa 1320 aaatcaaatt gctgagcgaa ccaaataaca gtcttctaca aagtatattc acatctctaa 1380 acaaaaaagt taaaaaagcc gtcagacgac aaaaggaaga aaaccttcac caaaaaatat 1440 gtgaaatgga agaagccttt aagaagaata actcacacaa actctttgat tgcattaaaa 1500 atctagaaag gaaaaagcca aaaccgtttt ttggaataaa agataccaat ggagtccttc 1560 aggtggagaa aaagaaggtt ctcgatgtat ggaagtctca ttttgaactt catctaaata 1620 ctgaatttcc aagagatgaa aacgcattga tggaatttaa tgattcacac ccgaatgacg 1680 aagaaattca gcctataaca atggaggaag taaataacgc tataaggcag cttaaaaata 1740 acaaggcagc aggaatagac ggaatcacct ctgagctcat taaagcaggc ggatacatta 1800 cctcgaaaat gtttgtatcg ttatttaatg gcattattaa aagtgaaaaa atgccaaagg 1860 actggtccaa aatgatagtt accccgattt tcaaaaaagg ggataaacta gaccccaaga 1920 attacagagc tattgctttg ctttcaatac ctgggaaagt tttttgcaaa attctaatga 1980 atagatgtat ggaaaaaacc gagcagtacc tcagtaaatc acaattcggt ttcaggtccg 2040 gaaaaggaac tgtggacgct atttttatag tacggcaaat tattgaaaag gccagagaac 2100 acaatgttaa tctacatcta aatttcatag acttcaaatc agcgtttgac acaatttgga 2160 gagaggctct gtggaagatc atgttgcacg ttggaatttc tcgaaagata gttactctcg 2220 ttagaattct ctacaaagac acaacatgca ctgtacagat tgatggacag caaacagatt 2280 tctttccagt aaacattggc gtcaaacaag ggtgcataat gtcaccaatg ctcttcaaca 2340 tattcttgga ctatgttctg aaagaggtaa aaagtctgga ctctcanttt gattttaatg 2400 actctatgtt gattgacatc cgatacgcag atgacacaac actaatatct gctgtgtttg 2460 aaaagttaca aatagccacc tctgaactgg cgaaagcttg tcgaaagtgg gggcttaaaa 2520 taaacccatc caagtgtgcc gtcttatcag cagagaatgg ccgaattcag ctagaaaacg 2580 acactatccc aaacgtcgac agatttaaat atctcggaag catagttcct gaatgttcgg 2640 aagatataag gaacaggatt tcgttggcct ctcaggcatt tggtcgctta agatcgaaca 2700 tttggagctc acaccatata tcaagaaaac tgaagatacg actgtaccgg gctttaatcc 2760 tnccgatagc aatttatggc gccgaaacgt ggagcaccac ctcatctgac atgaactcat 2820 tgaaagtttt cgaaatgaaa tgcttgagag caattctcaa tgtgaccaga cgtgacagaa 2880 tcaggaatgt ngaccttagg aaaatcctga gagtggaaga aacgatcgag gatgtcgtca 2940 atgcaagacg cttgcgttgg tttggtcacg ttgcacgaag ctcggagtcg agcctgatca 3000 atctaagctt caggaaagac tttcaatgca aaaggcgaag aggacgtcca cgcaaaagat 3060 ggcgggaaaa catccgagat ctatgcggca tccccctcgc aacggctgaa cgaagggcga 3120 gaaacagaag caaatggaga gaagacgtag ttcgatggac cgcaaggggg catcatgccc 3180 tatgcacata actaactaac taactaac 3208 // ID piggyBac-8_SM repbase; DNA; INV; 2409 BP. XX AC . XX DT 30-MAY-2008 (Rel. 13.05, Created) DT 30-MAY-2008 (Rel. 13.05, Last updated, Version 1) XX DE DNA transposon from Schmidtea mediterranea. XX KW piggyBac; DNA transposon; Transposable Element; KW horizontal transfer; piggyBac-8_SM. XX OS Schmidtea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae. XX RN [1] RP 1-2409 RA Kapitonov V.V. and Jurka J.; RT "Families of autonomous piggyBac element from the freshwater RT planarian genome and two clear examples of horizontal transfer of RT piggyBac transposons between a flatworm and insect."; RL Repbase Reports 8(5), 527-527 (2008)Repbase Reports. XX DR [1] (Consensus) XX CC piggyBac-8_SM is a relatively young family of piggyBac CC transposons, characterized by 15-bp TIRs (one mismatch) and TTAA CC target-site duplications. The consensus sequence was CC reconstructed based on multiple alignment of 14 copies that are CC ~98.5% identical to the consensus. The consensus sequence is a CC good approximation of the active transposon. Surprisingly, a few CC ~88% identical copies of piggyBac-8_SM are present in the CC silkworm (Bombyx mori) genome (see piggyBac-1_BM). The 88% CC nucleotide identity, including non-coding regions, of these CC transposons in two species, which have diverged from their last CC common ancestor over 500 million years ago, is a first clear CC evidence of horizontal transfer of piggyBac transposons. The CC horizontal transfer has happened a few million years ago between CC a flat worm and silk worm. We report also a second example of CC piggyBacs horizontally transferred between the flat worm and CC silkworm: piggyBac-15_SM and piggyBac-N1_BM. XX FH Key Location/Qualifiers FT CDS 477..2219 FT /product="piggyBac-8_SMp" FT /note="piggyBac transposase." FT /translation="MSKKAKLSTYLENATNIPLECSDVESESYDSDGDNSS FT AIIESGDEEIPDEDNIDQSSDSDREKILNKRTRRCMRFPSSSEDECGSNIP FT NQQTEIASDGTIWTRIEEGGSAGRSPHHSAFKDEHGPTAHAKRNIMNGNLS FT SAFLLLIDNHILEHIRICTESEASRVLGKNWTITQEKLKAFLAILYARGAY FT EANNLRLQYLWNNKWGPSFFSNTMSRRDFTEIIRYIRFDKKNQRSQRLQTD FT KFALVSAVWDKFIENSKNCFKPGAYITVDEQLFPTKARCRFTQYMPNKPQK FT FGIKFWLASDVETKYVVNGFPYLGKDETRNASTPLSEFVVIKLLEPYTMKG FT RTVTTDNFFTSIPLALKLRSKNTSLLGTIRANKRELPKICKLKKDSMARFS FT TMLYQSNGCTLTVYKSKPNKKVLLLSTKHKHIKIDKAAKKLPETVSFYNNT FT KFGVDVTDQMARKYTVKSGSRRWPLQVFFNILDLAAINSWVLYKNTTGENI FT SRKDFLFQLAEELASEYQTSRQKPHETNIPTINATVPVRKWCQIGYCNNNK FT TTNICNKCKKSLCGKCTRSKIYTCRNCEQQTVNN" XX SQ Sequence 2409 BP; 859 A; 398 C; 439 G; 713 T; 0 other; cactagattt accagaccag tcatttagac tggtcgtgca atttcaatcc aaaattccta 60 ctatattaca cttttttttc cagaaatgat gttatgagaa atgatgttat gacttttgta 120 gctataatta gagagtacgt tttataaact taatgttgtt ttatatacat tagtaacaaa 180 tataattttt gttgttgaca tttataccag taccagtcaa aatgacaggt tgtgtttgta 240 tgaaaaaatg tatgattttg ggaatcatga gaccgatgca gtacatatag cactgaagcg 300 tactttgtta tcgtcttttg ttttctttca ctatcgtcat tgagttgatg acgtcatttt 360 gatgagcatt tgttttgtaa atgcactaga catgttcagt cttgtttcta gctttgtgca 420 acgcgcagca tttttcttat aaacaagatt tttatagtga ttgtgatccc ctaaaaatgt 480 caaaaaaagc aaaactttct acatatttag aaaatgcaac caacatacct ctggaatgtt 540 cagatgtaga aagtgaatca tatgatagtg atggtgataa tagtagtgct attatcgaaa 600 gtggtgatga ggaaattcct gatgaagaca atatagatca gtcaagtgac agtgatcgcg 660 aaaagattct aaacaaacgc acaagacgat gtatgcgatt cccttccagt tctgaagatg 720 aatgtggaag taacatacca aatcagcaaa ctgaaattgc ttcagacgga actatttgga 780 cgagaattga agaaggaggt agtgctggta gatcaccaca tcatagtgct ttcaaagatg 840 aacacgggcc aacagcacat gctaaaagaa acattatgaa cggaaatcta agtagtgcgt 900 tcctattact gattgacaac catatcttgg aacatattcg aatttgtact gagtcagaag 960 cctctcgagt cttggggaaa aactggacaa ttacgcaaga aaaattgaag gcatttctag 1020 caatattata tgcacgtgga gcgtacgaag caaataattt gaggcttcaa tatttgtgga 1080 ataacaagtg gggaccatca tttttctcca acactatgag tagacgtgat tttactgaga 1140 ttatacgata cattcgtttt gacaaaaaaa atcagaggag tcaacgtttg cagacagaca 1200 agttcgcttt agtctcagca gtttgggata aatttattga aaacagtaaa aattgcttca 1260 aaccgggagc ttatattacc gtggatgagc aacttttccc aacaaaggcc agatgcagat 1320 ttactcaata tatgccaaac aaaccccaaa aatttggcat aaaattctgg ttggcgtccg 1380 atgtagagac gaaatatgtg gtaaatggct ttccgtattt aggaaaagac gagacacgaa 1440 atgcatcaac tccactaagc gaatttgtcg tgataaaact tcttgaacca tataccatga 1500 agggtagaac tgtaacaact gataattttt ttacaagtat tcccttggcg ttgaaattac 1560 gatctaaaaa cacttcgtta cttggaacaa tacgcgcaaa caagagggaa ctgccgaaaa 1620 tttgcaaact gaaaaaagac agcatggcac gtttctcaac gatgttgtac caatctaatg 1680 gatgcacact tactgtctat aagagcaaac ctaataaaaa agtacttcta ctaagtacaa 1740 aacataaaca catcaaaatt gataaagctg caaagaaatt acctgaaacc gtatcgtttt 1800 ataataatac caaatttggc gtcgatgtta ctgatcaaat ggcacgaaaa tatacggtga 1860 aatcaggttc cagaaggtgg cctcttcaag tatttttcaa tattttagat ttagccgcaa 1920 taaatagctg ggtattgtac aaaaacacaa caggagaaaa tatttcacga aaagactttc 1980 tgtttcaatt agcagaagaa cttgcttcag aatatcagac ttcaaggcaa aaaccacacg 2040 aaaccaatat accaaccata aatgccacag ttcctgtacg aaaatggtgt caaataggat 2100 attgcaataa caataaaact acaaatattt gcaataaatg caaaaaaagt ctatgcggaa 2160 agtgtacacg cagcaaaatc tacacatgta gaaattgtga gcaacaaact gtaaataatt 2220 aagtgttttg ttatttggaa atatatataa atccatttat ctattataat taatattgtt 2280 caactctatt tttgtcattt ttaagagtaa atatagacta gaaaccacaa gaagcctgtt 2340 ttttatgcac cagtcatttt gactggtttg gtagaaatag gtatattcaa atcgctggta 2400 tatctagtg 2409 // ID ZENON_BM repbase; DNA; INV; 3016 BP. XX AC . XX DT 09-DEC-2004 (Rel. 9.11, Created) DT 06-AUG-2007 (Rel. 12.09, Last updated, Version 3) XX DE Bombyx mori non-LTR retrotransposon complete CDS, a consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; KW putative reverse-transcriptase/endonuclease; ZENON_BM. XX NM ZENON_BM. XX OS Bombyx mori OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Bombycidae; Bombycinae; Bombyx. XX RN [1] RA Liu X.; RT "A complete full-length non-LTR retrotransposon on the Z RT chromosome of the silkworm, Bombyx mori."; RL Unpublished. XX RN [2] RP 1-3016 RA Gentles A. and Jurka J.; RT "Consensus of non-LTR retrotransposon ZENON_BM."; RL Direct Submission to Repbase Update (30-SEP-2004). XX DR [2] (Consensus) XX FH Key Location/Qualifiers FT CDS 1..2958 FT /product="ZENON_BM_1p" FT /translation="MKVRKRKRVLSATPHSTFNVDFSNVRGLHSNLDAVHH FT HLETAQPALLFLTETQISAPDDTSYLEYPGYVLEHNFLRKAGVCVFVRADV FT CCRRLRSLEQRDLSLLWLRVDHGGCTRVYACLYRSHSSDAGSALIEHVQEG FT TNRVLEQYPSAEVVVLGDFNAHHQEWLGSRTTDLPGRTAYDFALAYGLSQL FT VTQPTRVLDIEGHEPSLLDLLLTTDPAGYSVVVDAPLGSSDHCLIRAATPL FT SRPSRRTTTRYRRVWQYLSADWDGLREFYASYPWGRFCFSSADPDVCADRL FT KDVVLQGMELFIPSSEVPVGGRSRPWYNNASRDAAHLKRSAYVAWDDARRR FT RDPNISEERRKYNAASRSYKKVIARAKSEHVARIGERLKSYPSGSRAFWSL FT AKAAEGNFCRSSLPPLRKSDDSLAHSAKEKADLLVKLFASNSTVDDGGATP FT PNILRCDSSLPEICFTQCAVRRELRLLDVHKSSGRDGIPAVVLKTCAPELT FT PALTRLYRLSYCANRVPSSWKTAHVHPIPKKGDRSDPSSYRPIAITSLLSK FT VMERIINIQLLKYLEDRQLISDRQYGFRHGRSAGDLLVYLTHRWAEALESK FT GEALAVSLDIAKAFDWVWHRALLSKLPSYGIPEGLCKWIASFLDGRSITVV FT VDGDCSDTMTINAGVPQGSVLSPTLFILYINDMLSIDGMHCYADDSTGDAR FT YIGHQSLSRSVVQERRSKLVSEVENSLGRVSKWGELNLVQFNPLKTQVCAF FT TAKKDPFVMAPQFQGVSLQPSESIGILGVDISSDVQFRSHLEGKAKLASKM FT LGVLNRAKRYFTPGQRLLLYKAQVRPRVEYCSHLWAGAPKYQLLPFDSIQR FT RAVRIVDNPXLTDRLEPLGLRRDFGSLCILYRMFHGECSEELFEMIPASRF FT YHRTARHRSRVHPYYLEPLRSSTVRFQRSFLPRTIRLWNELPSTVFPERYD FT MSFFKRGLWRVLSGRQRLGSAPGIAEVHGRR" XX SQ Sequence 3016 BP; 639 A; 810 C; 829 G; 737 T; 1 other; atgaaagttc gaaaaaggaa aagggttttg tcggcgactc cccattccac atttaatgtg 60 gatttcagca atgtaagggg tctacatagc aacctcgacg ccgtacacca ccatcttgag 120 acggcgcagc ctgcccttct ttttctgacg gagacgcaga tatctgctcc ggatgatact 180 tcgtaccttg aataccccgg ctacgtattg gagcacaact tcctgcgtaa agccggggtg 240 tgtgtattcg tccgggctga tgtctgttgt cgccgtctac gaagcctcga acaacgggac 300 ctgtccctct tgtggctgcg cgtagatcac gggggctgta cccgagtcta cgcgtgcctg 360 tacaggtccc acagcagtga tgcaggttca gctctgatag agcatgtgca agaggggact 420 aaccgcgtgc ttgagcagta cccatctgcg gaggtggtgg ttcttggaga ctttaacgct 480 catcaccaag agtggttggg gtccagaacc actgacctcc cgggtcggac tgcctacgat 540 ttcgccttgg cctacggcct ctcccagctg gtgacacagc ccacccgtgt cctagatatt 600 gaggggcacg agccttctct gttggacctt ctgctgacca ccgatccagc cggatacagt 660 gtggtggtcg acgctccact tggatcgtct gatcactgcc ttatccgtgc tgccacacca 720 ctctctcgtc ctagtcgtcg aacgacgacc aggtatcgaa gagtttggca gtatttgtca 780 gcagattggg atggattgcg tgagttttac gcatcctacc catgggggcg gttctgcttt 840 tcctctgctg atcctgacgt ctgtgcggac cgtcttaaag acgtggtgct ccaggggatg 900 gaattgttta ttccctcctc tgaagtgccc gttgggggtc gcagcagacc ctggtataac 960 aatgccagca gggatgctgc acacctcaag cggtccgcat acgttgcatg ggatgatgct 1020 aggagacgtc gggatcctaa catctcagag gaaaggcgga aatataacgc cgcttccagg 1080 tcctacaaga aggttattgc cagggcgaaa tcggagcacg ttgctagaat tggcgagcga 1140 ctgaagagct atccctctgg gagccgtgct ttttggtcgc tcgccaaagc tgcagaaggt 1200 aacttttgca ggtctagtct cccaccacta cgcaagtccg atgacagtct ggcccatagt 1260 gcgaaagaga aggctgacct tctggtcaaa ctcttcgcct cgaactcgac tgtggacgac 1320 gggggtgcca caccaccgaa catcctccgg tgtgatagtt ccctgccgga gatctgcttt 1380 acacagtgtg cagtcaggcg ggaactccga ctcctggacg tccataagtc gagcgggaga 1440 gacggcatcc ctgcagtggt tctgaaaacg tgcgcccctg agctgacgcc tgcgctaacg 1500 cgtttgtatc gcctctctta ttgcgctaac agggttccgt cttcatggaa gactgcccac 1560 gtccacccta tccccaagaa gggtgaccgg tcggacccat cgagctacag gcctatcgcg 1620 ataacttcct tgctttccaa ggtgatggag cgaataatta atatacaact cctgaagtat 1680 ctggaggatc gccagctgat cagtgaccga cagtacggtt tccgtcacgg tcgctcagct 1740 ggcgatcttc ttgtatacct tactcacagg tgggctgaag ccttggagag caagggcgag 1800 gctcttgctg tgagccttga tattgcgaag gccttcgact gggtctggca tagggcactt 1860 ctgtcgaagc taccatctta cggaatcccc gagggtctct gcaagtggat cgctagcttt 1920 ttggatgggc ggagcatcac ggtcgttgta gacggtgact gttctgatac catgaccatt 1980 aacgctggcg ttccacaagg ttcggtgctc tcccccacgc ttttcatcct gtatatcaat 2040 gacatgctgt ctattgatgg catgcattgc tatgcggatg acagcacggg ggatgcgcga 2100 tatatcggcc atcagagtct ctctcggagc gtggtgcaag agagacgatc aaaacttgtg 2160 tctgaagtgg agaactctct ggggcgagtc tccaaatggg gtgaattgaa cttggttcaa 2220 ttcaacccgt taaagacaca agtttgcgcg ttcactgcga agaaggaccc ctttgtcatg 2280 gcgccgcaat tccaaggagt atccctgcaa ccttccgaga gtattgggat acttggggtc 2340 gacatttcga gcgatgtcca gtttcggagt catttggaag gcaaagccaa gttggcgtcc 2400 aaaatgctgg gagtcctcaa cagagcgaag cggtacttca cgcctggaca aaggcttttg 2460 ctttataaag cacaagtccg gcctcgcgtg gagtactgct cccatctctg ggccggggct 2520 cccaaatacc agcttcttcc atttgactcc atacagagga gggccgttcg gattgtcgat 2580 aatcccaktc tcacggatcg tttggaacct ctgggtctgc ggagggactt cggttccctc 2640 tgtattttgt accgtatgtt ccatggggag tgctctgagg aattgttcga gatgataccg 2700 gcatctcgtt tttaccatcg caccgcccgc caccggagta gagttcatcc gtactacctg 2760 gagccactgc ggtcatccac agtgcgtttc cagaggtctt ttttgccacg taccatccgg 2820 ctatggaatg agctcccctc cacggtgttt cccgagcgct atgacatgtc cttcttcaaa 2880 cgaggcttgt ggagagtatt aagcggtagg cagcggcttg gctctgcccc tggcattgct 2940 gaagtccatg ggcgacggta accactcacc atcaggtggg ccgtatgctc gtctgcctac 3000 aaaggcaata aaaaaa 3016 // ID Gypsy-11_SI-LTR repbase; DNA; INV; 228 BP. XX AC AEAQ01022239; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: long terminal DE repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-11_SI_; KW Gypsy-11_SI-I; Gypsy-11_SI-LTR. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-228 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01022239; Positions 256 29. XX SQ Sequence 228 BP; 55 A; 59 C; 48 G; 66 T; 0 other; tgtcaagtgt ttaaataatg tgcccgcgta ccccggcctc tcaatcgcga caaatagttt 60 caaggaatct ccctgccact tattacgatt gggtagcgac cgcccgctgt atgcgcgccg 120 tgttttatcg tcatcgattg gatcgctaca atataatttg tgaaacttgt tatcggaact 180 ttccctcgac cgtctcggag cattattgta aaaggaaccg ccattaca 228 // ID Copia-12_AA-LTR repbase; DNA; INV; 294 BP. XX AC supercont1.351; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-12_AA_; KW Copia-12_AA-I; Copia-12_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-294 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.351; Positions 779626 779919. XX SQ Sequence 294 BP; 75 A; 57 C; 70 G; 92 T; 0 other; tgttgtgaag agcaatattt tctgtatacg cctatgtgta tgaataacag ctaccgaaag 60 ataacgaaga agaacagtat gaagaacaat tgaattcgaa gacgtagtgt gcagtcagtt 120 actatcgaac cgtttataat aaaggacgtt ttgtttcgta tgtgcgttgc ttcagttctt 180 ccgatcacgt tgcgcgagtt cccaaatccc tcggttctgt tccggttctt tgtgtccgcc 240 ggtctggata ttgtccgctg ggaagcttgg atggtcgatc agtttaatcc aaca 294 // ID hAT-9_AP repbase; DNA; INV; 987 BP. XX AC . XX DT 25-AUG-2009 (Rel. 14.09, Created) DT 25-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE DNA transposon. XX KW hAT; DNA transposon; Transposable Element; hAT-9_AP. XX NM hAT-9_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-987 RA Jurka J.; RT "DNA transposons from pea aphid."; RL Repbase Reports 9(9), 2100-2100 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX FH Key Location/Qualifiers FT CDS 175..774 FT /product="hAT-9_AP_1p" FT /translation="MCIFNKYLIKYFKFYSKFDIHADFARSPTFLTNIIRP FT NKTKYFILFILFNDNSIKNELSIILSNFQCITDTITCLEKSGVNLGKSLEL FT VQNVENKLSEGGRGIEFAKVKLSEVLSKNPGLSQIKKISEIISGEIRNDDE FT DLAELSPEDISCFKYAPIVSADVERSFSKYKVMLRDNRRSFQFDNLKAHFV FT TSCWYSFNNY" XX SQ Sequence 987 BP; 351 A; 129 C; 138 G; 368 T; 1 other; gggcccggat ttttatgcaa atgcatattc ttctacattt gacgtataat aattctgatt 60 aaatatcttg atgtttcgta aaacgtagat tctactgtta attttttttt gcatattttt 120 gcatatttgc atcaaaatgc ataataattg catatttcgg ggtaattaca taaaatgtgc 180 atatttaata aatacttaat aaaatatttt aaattttact caaagtttga tatacatgcc 240 gattttgccc gatcgccgac gttcctgaca aatataattc gaccgaataa aacaaaatat 300 tttattttat ttatattatt taatgataac tcgatcaaaa atgaactttc gataattcta 360 tcaaatttcc aatgcataac agacactatc acgtgtctag aaaaatcggg tgtcaatctt 420 ggaaaaagtc tagaattggt tcaaaatgta gaaaataaat tgagtgaagg aggacgaggt 480 attgaatttg ctaaagttaa actatcggaa gtattatcaa aaaatcccgg attatcacaa 540 ataaaaaaaa tttctgaaat tataagcgga gaaataagaa acgatgatga agaccttgca 600 gagttgtcac cagaagatat ttcttgtttc aaatacgcac ccattgtgtc agcagacgtg 660 gaaagaagtt tttcgaaata caaggtgatg ctacgcgaca atcgtagaag ttttcaattt 720 gataatttaa aagcacattt tgttacatca tgttggtact cattcaataa ttattaatta 780 atacacattt ttatgcttta tacacatttt aaaaaatgat atttaaattg tatttttttt 840 tcacttaata ctatattttt taattttttt taaattttga ttttttaaac aatanttttt 900 tttgcatatt atttgcatat ttttgaattt ttgagtgcat atatatgtgc atatttaaca 960 aaaatgtttg cataaaaatc cgggccc 987 // ID Polinton-6_NVi repbase; DNA; INV; 10272 BP. XX AC . XX DT 17-APR-2009 (Rel. 14.04, Created) DT 17-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE autonomous Polinton DNA transposon - consensus. XX KW Polinton; DNA transposon; Transposable Element; Polinton-6_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-10272 RA Bao W. and Jurka J.; RT "Polinton DNA transposons from Nasonia vitripennis."; RL Repbase Reports 9(4), 796-796 (2009). XX DR [1] (Consensus) XX CC The consensus may be incomplete at both ends. XX FH Key Location/Qualifiers FT CDS join(204..830,749..1903) FT /product="Polinton-6_NVi_1p" FT /translation="MLMFVERGIRGGISQCCNRYAKANNPYMKEGYDANAD FT EKYLMYYDANNLYGWAMTEALPYGKFKWVTNLETPNFFNVPDDGPKGYILE FT VDLEYPITLHGAHRDMPFCAEHMCPPGLKQKKLMTTLHDKKNYVIHYRALK FT QALAHGLRLKKIHRALEFEQSKWLKPYIDLNSAKRTAAKNEFEKCFSNYLI FT TLFMEKLWRTRESTSRSSWKMLFKLFNNAVYGKTMENERKHVEIKLVTKWE FT GRYGAEALIAKPTFHNCTIFDENLVAVQMFRAEVCIKKPIYIGLSVLDLSK FT TLIYRFHYDYMLERAGDKCKLLYTDTDSLVYEVSNLNMYEIMRTDIHEFDT FT SDYAENNQFDMPRVNKKIVGLMKDECNGDILLEFVGLRSKMYAMLVQNQKP FT IKKAKGVKSSVVKATIEFDDYITCLQENTILTREQRNIRTRQHKLFTEKEV FT KIALSARDDKRFLQKGKTDTLPWGHYSIIVDEQPTAAVLSSREEEEAIAVA FT GPSREGEKQQLSWLESQSLSPPQSSRRYSKREKQSWWGSAYMSGIPGARSS FT YCLLKSQRKDLFINHRRRRKERATTIPRRKNNSAKVSSIKRAHIHL*" FT CDS join(7888..9123,9065..9634) FT /product="Polinton-6_NVi_4p" FT /translation="MDEILSIQSPVSFDESLAHYELHAHQPYTVSSYNNSD FT EIRIAIQHQDLSLLPSRSSLHVCGKLTKPNGTALARTKLVNNAICHMFEEI FT RYEINAVEIDKCKNVGLTTVMKGWISHNPSQSLIMENAGWLDIAETKSLTN FT ASGYFDVNIPLSMIFGFAEDYRKIVVNVKHELVLTRSRNDLNAIIQTATLA FT DGVATFEEYKLELTKIEWLMPYVVASNTNKIRLLNYIEKNRPISMSFRSWE FT LYEYPVLPTSTKNVWTVKTSNQLEKPRFVILGFQTNRKNQQAENASQFDHC FT DISNVKLFLNSQYYPYGNLNLDINRYQYAVLYDMFANFQSLYYDKVSEPVV FT NKNDFISRLPLIVIDCSKQNESLKSAPVDVRLEFESRDNFPAGTSAYCLIL FT HDRIVQYNPVSGDIKTLISYTIASFSIILSAVISRRSYKMEYAVDMQGFKK FT PGKDFVLKELAILPLEEDAEPVVLLFKEPFPWCKLSQKYKRENLWLELYHH FT GLSWDSGDHEYTEIGNLLRDALKDASKIFAIGDVRKKWLERFNFKVTDITD FT CGYPATDHPKLVTICTNHNGAHKATCARQNVRLMRRYYFDNVFMEWDDISS FT DEA*" FT CDS join(4919..3981,3977..3543,3565..2633) FT /product="Polinton-6_NVi_2p" FT /translation="MSDFHQEKNVLHQLVKARNAVKRKYSLLKFGKDNFEK FT AVEETFRPIVDPLQKLVESENKKSITAATSPVDNSHKNPFKRKVKKIKSER FT PIDDDDETYKFDEDNESYWANSTMQDDGAETAYDTADSENETSISKPSLPF FT LNQANLDKIYGLRKENENYMLGNSIVKFEDKKIVVDNVEYPKTAGLMDLIA FT YRNPDKKIVSSDDMKNYRSILEATSVHKKKLNPKADIRSSNSKKYTDFIAP FT LFFGKREKEGGALPRYKIARMNTRMDYVYWDDPNELVDRLRLLIAEQSAGN FT PSHVNEIHSIIEELREGGYIRALPVNXIIKMSVDVFGRNLKKNEGDRGPPG FT FGFKITKDGQYDLDNKRLCNVAAPHKLNEAVDLGTLHTIHQMEHKKVRDVT FT DKLREELKDLDSLIEAHRDEMDRQVLELQADVKAIKDALKEMAEANAISAF FT KATTLEKTADNGAKKLLIMEHERLVEELHKPARRNYPRRKFVMRGLDETWQ FT ADLVEMLPYSKENKGYNYLLTVIDIFSKYAWAVPVKKKTGKEVAAAMKSLL FT SQGRVPKNLHTDRGKEFYNSDFKNLMQKHKINLYSTYSKMKASICERFNRT FT LKSAMWKKFSLRGNYKWLDILPGLLATYNGSKHRTIGMKPKDVSRTNEADV FT LKRFTDKVKPVKKPKFKVGDKVRVSNAKQVFEKGYTPNWSTEIFTITKVAA FT TKPTTYHLKDYQDQPISGCFYEQEILKAKYPDVYLVEKVLKRKGNXIFVKW FT LGFDSTHNSWINKTDNV*" FT CDS join(5767..6558,6562..6963,6989..7696) FT /product="Polinton-6_NVi_3p" FT /translation="MIVHRRQRRGRGFVNKIINKLPVELHLPGYQYCGPGT FT KLAKRLARGDPGINQLDVACKEHDIAYSQNRDNDEARTIADKILADKASKR FT ASSKDASIGEKIAAFAVSKTMKLKTKLGMGAKRKRRTKKSLRLNKIVKAAA FT KSMIPTNNARKAILSALKGARDAVKKAGGKHKVLVPRVLPVPSKVGGFLPL FT LIPIFAGLSAAGALAGGAAGIAKAVNDSKAAKLQLEETSRHNRKMEELSIG FT QGLYLKPHKTGMGLRIAHGKTLIKKKTSLKLRLPGRPLTDSDLVKYAKILK FT IPHFRGVFMRNDLPXNGPRYNESAVVNLDDASGPGTHWVAYRKRGKSVVYF FT DSFGNLQPPTDLMXYLNVNGVKYNPDRYQDYNSFNCGHLCLKFLSNKLNTP FT RRTILILNSTVMENSLTLTLSGSSSILEAQYFPPIELSPHKQYVLGLVELL FT TFNSIPNIDKGNNKFYVGKEEIVLPTGSYKIQDIDSXLREILTKKKISISI FT QPNNNTLRSIIKCNRKIDFRPQDSIGALLGFTQRVLQENXKHSSDLPVAIL FT KVNALRIECNITSGAYINGQLAHTIHEFFPAVPPGYKIIEVPSQVIYLPIT FT VKSIDHLQIRIVDQDGHLVNFRNETITIRLHLKPT*" XX SQ Sequence 10272 BP; 3074 A; 2166 C; 2112 G; 2890 T; 30 other; ttgagaattt tagaagtaat tgtctgcatg catatggttt agatccagct cactactaca 60 ctacgccagg tctgtcatgg gacgctatgc tgaaatatac acaggtgaat ataactaaga 120 taatctaaag ctatgaaaat gggcaaaata tttatattat taattatttc taggttcgat 180 tggatctctt gacggacatt gacatgctca tgtttgtcga gcgtggaata cgaggaggaa 240 tcagccagtg ttgtaaccgg tacgcgaaag ctaacaatcc atacatgaag gaaggttacg 300 atgctaatgc tgatgaaaag tacctcatgt actatgatgc caacaacctg tacggctggg 360 caatgaccga ggcgttaccc tatggaaaat tcaagtgggt aactaatctc gagacgccca 420 acttcttcaa cgttcccgac gacggcccga aaggatatat tctcgaagtg gatctcgagt 480 atccgataac gcttcacgga gctcatcgag atatgccctt ctgcgccgag cacatgtgtc 540 caccgggttt gaagcagaag aaattgatga cgacacttca cgacaagaag aattatgtca 600 ttcattacag agcattgaaa caagctctgg cacacggact tcgtctgaag aaaattcatc 660 gcgctctgga attcgagcag agcaaatggt tgaagccgta catagactta aacagtgcca 720 agcgaacagc cgccaagaat gagtttgaaa aatgcttttc aaactattta ataacgctgt 780 ttatggaaaa actatggaga acgagagaaa gcacgtcgag atcaagttgg tgacaaagtg 840 ggaaggaaga tacggtgccg aagctctcat agcgaagcca actttccata attgcaccat 900 cttcgacgaa aatcttgtgg cagtgcagat gttccgtgcc gaagtttgca ttaagaagcc 960 gatatatatt ggactgagcg ttttggactt gtccaagact ctgatatatc gctttcatta 1020 tgattacatg ctcgaacgtg ctggtgacaa gtgcaagctt ctgtataccg atacagatag 1080 cttggtctac gaagtgtcaa atctcaacat gtacgaaata atgcgcacag acatacatga 1140 gtttgacact tcggactacg ccgagaacaa tcagtttgac atgcctcgcg tcaacaaaaa 1200 gatagtgggt ctgatgaagg acgagtgcaa cggcgacatt ttgctagaat ttgttggact 1260 tcgaagcaaa atgtatgcca tgctagtcca gaatcagaag cccatcaaga aagcgaaggg 1320 tgtcaagagc tcggtagtca aggccaccat cgagttcgac gactacatca cgtgcctgca 1380 agaaaacacg atactaactc gagagcagcg gaacattcgg acgagacagc acaagctctt 1440 caccgagaag gaagtgaaga tagctttgag tgctcgagac gacaagcgtt ttctccagaa 1500 aggtaaaacg gatacgctac cctggggtca ctacagcatc atcgtggacg agcagccaac 1560 cgcagcagtt ctatcatcga gagaggagga agaagcaatc gctgttgcag gaccgtcaag 1620 agaaggggag aagcagcagt tgagctggct agagagccaa tcactgtcgc cgccgcaatc 1680 gagccgccga tattcgaaga gggagaagca gagctggtgg ggatcggcct atatgagtgg 1740 gatccctgga gccaggagct catactgtct cctgaagagc caacggaagg atctattcat 1800 caatcaccgc cgacgaagaa aagaaagagc aacgacgatc cccagaagaa aaaataactc 1860 tgcaaaggta agtagcataa aacgtgcaca cattcaccta taatatatac ataattatta 1920 ataagaattg ttaactttgt gtgttacaga tggatatgga aaatctaaac cgcatcgcta 1980 agggaggatt ccttcctaca aagaagctgt ccgagctgga aaaggatcaa atgtacatgg 2040 tcacagtctt gaaggaggtc aagaccaagt tcgggtctaa gatcgtggca gagctgaaca 2100 gagagttcga catcttcctc ccgaagaagg tctccgacac cttcctggca gacgagagct 2160 tctatgagaa catgcaggat gcggccaaca agctcgaact gttcgtgaca taccgtggcg 2220 gatacgtagt cgagttttcc agcaaatgaa tgcgtcgtga agtggttgaa ctagtctgga 2280 aytgtatgaa cgagaacgag ctrggaataa attgtttwca ctcgtagtga ctagtttagt 2340 ctgcgcattt acaaatagtt ttaagttagt ttgtaaaatc gtcatcgtgc tacatagacg 2400 agtgtgcttt tagtcagtga tagtttttaa gaatgtcgaa ttgtaagaca aagtgtgcgc 2460 gtgtgtgtac ctgtgtctgc gtgtgcgtat gtgcgtgttt ttgcttgtat ataataagtt 2520 agtaataagt atgtatctag gcgcgcggca gttataaata aataaataat tttttttgtt 2580 cacttaaaat ttgatcttat tattaaaaaa aacacaatca ttaatacatt tattatacat 2640 tatcagtttt attgatccag ctattatgag tgctgtcaaa acccagccat tttacaaata 2700 tttkatttcc ttttctcttc aaaacttttt caactagata tacatccgga tatttagctt 2760 ttaatatttc ttgctcgtag aagcatccag agattggctg atcttgatag tcctttagat 2820 gatacgtcgt tggtttagta gcagctactt tggtgatcgt gaaaatctca gttgaccagt 2880 ttggagtata tcctttttca aatacttgtt tcgcattgct cactcgtact ttgtcaccga 2940 ctttgaactt tggcttcttc acaggtttaa ctttgtcagt aaaacgtttc aacacatcrg 3000 cttcgttagt acgcgataca tctttaggct tcataccaat agttcgatgt ttgcttccat 3060 tataagttgc taataaacca ggtaaaatgt ccaaccattt gtagttgccg cgtaagctga 3120 attttttcca cattgcggat ttcagagtgc ggttgaaacg ttcgcaaatt gaagctttca 3180 ttttgctata cgtcgagtac agattgatct tatgcttttg cataagattt ttaaaatcag 3240 aattgtagaa ttctttgcct ctgtckgtgt gtaagttttt tggtactcga ccttgactca 3300 acaacgactt catcgccgca gcaacttctt ttccagtttt tttcttcaca gggacagccc 3360 aggcatactt cgaaaatata tctatcacag tgagcaagta attatagcct ttgttctcct 3420 tggagtaagg tagcatttct accaaatcag cctgccaagt ttcatccaat cctcgcatga 3480 caaactttcg acgaggatag ttacgacgag caggcttgtg aagttcttct acaagtcttt 3540 catgctccat tatcagcagt tttttctaat gttgtcgctt tgaaggcaga tatagcatta 3600 gcttcggcca tctctttgag agcgtctttt atagctttca catctgcttg tagttctaaa 3660 acttgccgat ccatttcatc tctgtgtgct tcaattagag aatccaaatc cttcaattcc 3720 tctctcaatt tatccgtgac gtctcgcacc tttttatgtt ccatttggtg tatagtatgg 3780 agcgtaccca agtcgacagc ttcgttgagt ttatgaggcg ctgctacatt gcatagtctc 3840 ttgttatcca agtcatactg tccatctttt gtaattttga aaccaaaacc aggagggcca 3900 cggtccccct cgttcttttt caaattacgt ccgaacacgt cgacgctcat tttgataatg 3960 ycgtttacgg gcaatgctca gcgtatataa ccaccttcgc gaagttcttc tatgatcgag 4020 tgaatctcgt ttacgtgact gggattgcca gccgattgtt cagcgataag caaacgtaga 4080 cggtcgacca gttcatttgg atcgtcccaa tatacgtaat ccatacgcgt gttcattcta 4140 gcaattttgt atcgaggcag tgctccacct tccttttctc tctttccgaa gaaaagaggg 4200 gcgatgaagt cygtatattt tttgctattt gaagatcgaa tatctgcctt agggttgagc 4260 ttctttttgt gaacactcgt tgcttctaaa atgctgcgat aatttttcat atcatcagaa 4320 ctgacgattt ttttgtcagg atttctgtaa gcgatcaagt ccatcaaccc agcagttttc 4380 ggatactcaa cattgtcgac tactatcttc ttatcttcga actttactat tgagtttccg 4440 agcatgtagt tttcattctc cttgcgtaat ccatatattt tgtccaaatt cgcctgatta 4500 agaaaaggta aactgggttt actaatcgat gtttcattct cactatcagc tgtatcataa 4560 gctgtctcag ctccgtcatc ttgcatagta gaattcgccc aataagattc attatcctcg 4620 tcaaatttgt aagtctcatc atcatcgtca atgggtcgct cagatttgat cttttttact 4680 tttcgtttga aaggattttt atgcgagttg tctactggtg atgtagcagc agtgatagat 4740 ttcttatttt ccgattcaac tagtttttga agtggatcaa cgattggtct aaaagtttct 4800 tcaacggctt tctcaaaatt atcctttcca aatttcaata agctgtattt gcgcttgaca 4860 gcgtttcgcg ctttgaccag ctgatgaagt acgtttttct cttgatgaaa atcagacatg 4920 ctagcacttt ggcattcaat gtcagactaa gtatacagta ctaagcacga gctatattta 4980 tagcaaactg atcgaacccc tttcgatatc ttccagcatt gattggacta tctttgtcaa 5040 tcatgagaaa tccatgattt acatctgacc aacattgcgc gcataaatcc ttgaagcgtg 5100 cgtatgtcat gtcagaattt acatgatcgt cgtataagtg ctttaaattc acatcatctt 5160 gtcgaaatac taccaacaaa ttgacgttat ctctcacaag atgtttaggc acttgcgcrt 5220 atgactgrca aagatagaag cagtcaacyt ttttatgcct acccatrcaa aagaatgctc 5280 tgacgttatt ctgcttttca caagctatat cgtcgaatat catwattgaa ttrggaagtg 5340 cgctatctgg tgctataact tcttcgtgtt cactgaaagg aaaatacttt atacccttga 5400 gaggctccaa tagttgtttt aaaaatktat actttggctg attaagcgac ttggagtaca 5460 cgtatacatt ttcaaatcta acaccgttgg gattagttat gagcgctagc aagctgttag 5520 tcttgccgca attcgacggc ccgcaaaaaa ccgcccttat gctatcaggc aatagatcac 5580 catgccgctt aactttttta tctcgctcaa cgacggaatc gaagttcggt accgggagct 5640 tgataggctg ttccttaaaa ttcatgcttt gaactgttcg gatgctcgac atctactagt 5700 cgctcaggta taaatagagc gcttatatac tctcgccatc agttaatcat aaaccttcag 5760 aggggaatga tcgtacacag acgacaacga cgtggtcgtg ggttcgtaaa taaaattatc 5820 aacaagcttc cggtggaact tcacttgcca ggatatcaat actgcggtcc aggtactaag 5880 ttagcgaaac gtttagcacg aggagatcca ggaattaatc aacttgacgt agcctgcaaa 5940 gagcacgata ttgcatattc gcagaatcgt gataacgacg aggcaagaac cattgctgat 6000 aaaattttag ctgacaaagc ttcaaaaaga gcttcttcga aagacgctag tatcggtgag 6060 aaaatagctg cttttgctgt ttcaaaaacc atgaagttaa agacaaaact cggaatgggt 6120 gcgaagagaa aaagaagaac taagaaaagc ttacgattga ataaaatcgt aaaagctgct 6180 gccaagtcta tgataccaac caataacgct cgcaaagcca ttctttctgc attgaaaggc 6240 gctcgcgatg cggttaagaa agccggtggc aagcataagg tactggttcc acgcgtttta 6300 cccgtaccgt ccaaagtagg tggattcttg ccattactca tacctatttt cgctggattg 6360 agcgctgctg gcgcactggc aggaggagct gcaggtatcg cgaaagcggt caacgactcc 6420 aaagcagcta agttacagtt ggaagaaact tcgagacata atagaaarat ggaggaattg 6480 agtatcggtc aaggacttta tctgaaacct cacaaaacgg gaatgggtct tcgcatagcg 6540 catggcaaga cattgattta aaaaaaaaaa acgtcactca aactaagatt accagggaga 6600 ccgctcactg actccgacct cgtgaagtat gccaaaattt taaaaatacc tcattttcgc 6660 ggtgtattca tgagaaacga tttgcctgyt aatggtcctc gttacaacga atcagctgtc 6720 gtaaacttgg atgatgctag tgggcctgga acgcactggg tggcatatcg taaacgagga 6780 aaatcagtag tttatttcga tagttttggc aatcttcagc cgcctacaga cttaatgatr 6840 tacttgaacg tgaacggagt caagtacaac ccagacagat atcaagacta taattcattt 6900 aactgcggtc atctctgcct taaatttttg agcaacaagc tcaatactcc tcgtcgaact 6960 atataaagat gcaaaaagtc tgagttgact cattctcaat tcaacagtca tggagaattc 7020 acttactctg actctttctg gcagttcatc catactcgaa gctcagtatt ttccacccat 7080 tgaattatca ccgcataaac aatacgttct tggattagtt gaacttttaa cttttaattc 7140 gatacccaat atcgataaag gaaataataa attttacgtt ggtaaggagg aaatcgtctt 7200 acctactggt agttacaaga ttcaagatat cgatagtyat cttcgtgaaa ttttgactaa 7260 aaagaagatt tcgatcagca tacaacctaa caacaatacg ctccggagta ttataaagtg 7320 caatcgyaag attgattttc gaccgcaaga ctcgattggt gcattactcg gcttcactca 7380 acgcgttctg caagaaaacw tgaagcattc ttctgacttg ccagttgcta ttcttaaagt 7440 gaatgctctg cgaatcgagt gcaacatcac tagcggtgcc tacataaayg gacaattagc 7500 gcatactatt cacgaatttt ttcccgctgt gccgccagga tataagatca tcgaagtccc 7560 gtcgcaagtc atttaccttc caatcaccgt caagagcata gatcacttac aaattcgcat 7620 agtcgatcaa gacggacatc ttgtaaattt tcgaaacgaa actattacta tccgattgca 7680 tctaaaacca acgtaatggg aatagtttat aatagcaaag tcggtagagg ctatataagc 7740 aaaccgacrt gtcgttcagt tatcagtcgt cggcgaaacg ccaaagygct aacacggcaa 7800 aacatacagt tcctaaaaag cctaaatttg cgagtcatcg ctcctggtgg aaaataaatt 7860 gcatactatt ctgctgtttg acaaaaaatg gacgagattc taagcataca gtcaccagta 7920 agctttgatg agtcgttagc acactatgag ctgcacgctc atcaacccta cacggtctca 7980 tcttacaaca acagcgacga aattcgcatc gccattcaac atcaagattt gagccttttr 8040 ccatcacgta gttcgctgca cgtttgcggt aaattaacta aaccaaatgg cactgctcta 8100 gcacgtacta agctagtaaa caatgctatc tgtcacatgt tcgaagagat cagatatgaa 8160 attaatgctg tagaaatcga caagtgtaaa aatgtcggtt tgaccacagt catgaaaggc 8220 tggatttcac acaatcccag tcaaagttta ataatggaaa atgcaggctg gttagacatt 8280 gcagagacta aatcactgac caatgcctcc ggctacttcg acgtgaacat accgctgagt 8340 atgatattcg ggtttgccga agattacaga aagatcgtag tcaacgtgaa acacgagctt 8400 gtactaacga ggtcgcgaaa cgatttgaat gcaattattc agacggcgac tcttgcggat 8460 ggtgtcgcta ccttcgagga atataaatta gaattgacaa aaattgaatg gctcatgccg 8520 tacgtcgtag catccaacac taataaaatt cgacttctaa attacataga aaagaatcgt 8580 ccgattagta tgagtttccg cagttgggag ttatacgagt acccggttct tccaacctcg 8640 actaaaaatg tgtggactgt caagacttcc aatcagttgg aaaaaccacg cttcgtaatc 8700 ttaggatttc aaacaaatcg taagaatcag caagctgaga atgccagtca attcgaccat 8760 tgcgatataa gcaacgtgaa gcttttccta aactcgcagt actatccata cggcaactta 8820 aatttggaca tcaaccgtta tcaatatgcc gtattgtacg atatgttcgc gaatttccag 8880 agcctctact acgacaaagt ttcagagcct gtcgtaaata agaacgattt tatttcacgc 8940 ttaccactca tagtaatcga ctgctcgaag cagaacgagt ccttgaaaag tgctccagtc 9000 gacgttcgtc ttgaatttga atctcgagac aactttccag ctggaacttc agcttactgc 9060 ttgatcttac acgatcgcat cgttcagtat aatcctgtca gcggtgatat caagacgctc 9120 atataagatg gagtacgcag tggacatgca gggcttcaag aaacccggaa aagacttcgt 9180 cttgaaggag ctggccatac tacccctcga ggaagacgct gaacctgttg tacttttatt 9240 caaagaacca tttccttggt gcaagttatc tcagaagtac aagcgtgaaa atttatggct 9300 tgaactttat catcacggat tgagctggga ctccggtgat cacgaataca ctgaaatagg 9360 aaatcttctt cgagatgcac tcaaggacgc cagcaagatt ttcgccattg gtgatgtacg 9420 aaagaagtgg ctcgaacgat tcaactttaa agtcaccgac atcactgatt gcggctatcc 9480 agccaccgat catccaaaac tagtcaccat atgcacgaat cataatggag cgcacaaagc 9540 cacctgtgca cgacagaacg tcagactcat gaggcgttac tacttcgata acgtgttcat 9600 ggagtgggat gacatctcgt ccgatgaagc ataaatactc gcgcaatcgc tcattcgagt 9660 catttcatca aaatgagttc aaaccgaagc agctccggct acacgaggct cccaaatgac 9720 gacgacgacg ataaaacaaa aaaagattcc tcggcatcat cctcctcctc ctcctctaaa 9780 ttgaaagagt attcgcaaaa gtattctaca aatacgaatt ctctgcctga acaaaatggc 9840 tacaaaagat tcaagtagcg cgctaaaaag tcacaatggg tgagtcgcta aggcgacagg 9900 ctcgagatcg aaaaatcccg agttcaaatc cccgtgactt catttttttt ttcwacttac 9960 caaaataata taatttaaaa atrtatgaat tagagtcact gaaaaatcat tagtagtagt 10020 agtagtagta gataagctta tattngtgat tyatacttaa cttaataaca atttctgatg 10080 ggtaagtcgg taagatgctc ggctggcaac ccaaaggttc ccggttcgaa tccccgcaaa 10140 accgaaattt ttttcttact tcgcagtgaa ttagraacga attttgtgaa gtaaagtagt 10200 attaccatga agtgatatca ccatgaagtg gtatcaccat gaagtggtat caccatagct 10260 tatctactgc ca 10272 // ID Gypsy-53_CQ-I repbase; DNA; INV; 3893 BP. XX AC AAWU01017349; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-53_CQ_; KW Gypsy-53_CQ-LTR; Gypsy-53_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-3893 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 485-485 (2011). XX DR GenBank; AAWU01017349; Positions 44735 40843. XX CC Positions [2996-3511] - Integrase core CC 'ACTC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 968..3877 FT /product="Gypsy-53_CQ-I_1p" FT /translation="MKGAAAFDLHAANGSRIKVYDKRVLTTDLGLRRRFNW FT LFLVADVGMAIIGADFLAAFGIVVDLKHRRLIDGLTKLSSTGGLTTASVHS FT VTVVATDHPFRDLLVEFRDVTLPTTMKSAAQHDVTHHIQTKGPPVSSKCRR FT LAPDKLEAARKEFQVMSELGICRPSSSSWASPLHCVQKKNGQWRFVGDYRG FT LNQVTVPDRYPVPHIHDLLNAFQGKSIFTTLDLERAYHQIPVEEDDIPKTA FT VITPFGLFEFTRMQFGLCNASQTFQRFMHKVFADMEYVVVFVDDICIASAS FT PDEHYMHVRKVFERLQMYGLVINVPKSHFAQSTVEFLGYTVSGDGILPLPD FT RVKAVTEFKLPSTVKQLRRFLALVNCYKRFLPHATDTQSHLRDLIPGNKKN FT DSRKLEWTDNAREAFEKCKQSLATATLLHYPDSTKPLSLLIDASNTAAGAV FT LQQFADGQWNPLGFYSEKFSPSQQRYSTFGRELTAMKMAVQYFRHMLEGRR FT FVIFTDHKPLTTALTSSPNSRLPHEDRYLRYISEFTNDIRHISGQHNVVAD FT ALSRIEAVAAPSPIDYGQIVADQTNDSELQRLLSSSTGLKIELRQTPLTPK FT PLYCDTSQGKVRPYIPPQHRQLVLQHIHGLAHPGIRSTRRMVTDRFVWTHI FT NKDVKHYVQSCVQCQRSKIHRHTKAPLEKFAPPNSRFRHIHIDLVGPLPPC FT NGNRYLLTAIDRYTRWPEVMPLPDMTAETVARALCEMWICRFGVPETIATD FT QGRQFESELFRELSRIIGCKHIRTTAYHPQSNGIIERFHRHLKSSIMAVDS FT KHWCDKLPLIMLGLRAALREDFGCSAAELVYGQTLRLPGEFLEQQTNNEDR FT TEFAKQLRNAMGKLKPVDPDHHAKPAVFVNRRLEKCSHVFVRIDAVKRPLV FT QPYEGPFKVVARDKKHMDVEINGKTQRISIDRVKPAYTCDPAIDEHPRSDG FT KTVVTPSGHRVRFLV" XX SQ Sequence 3893 BP; 945 A; 1115 C; 1041 G; 792 T; 0 other; attggtgacc ccgacgcgta aaagcggaag tttcgacaat gcccgacccc gatcctaccg 60 tcgtcgaaaa cgcagcagtg gccgccagcg tggccgtaag acttccggat ttctggaaga 120 acgatccggg catgtggttc gcacaggcgg aggcgcagtt cgagttggcc aacgtagtgc 180 gggaccacac caagttcaat cacattgtgg ccaagattga ccagacggtg atttgtcacg 240 tcgccgacat tattacgaac ccgccggaag ctaacaaata cgcagcggta aaggagcgcc 300 tgatttctcg gttccaagtt tcgcctcagg gcagattgga acgccttctg aatgcctgcg 360 atctggggga catgcggccg acccatctgc tcgcgaaaat gatggaactt tcggcaggtc 420 tgaaaatctc cgacgacctg ctcaagatgc tgttccttca gcgactcccg ccggacgtca 480 agaacattct cacaatcagc gacgggacga cggtcggcaa gctggcagaa atggcggaca 540 aaatgatgga ttcctcaacg cgtcacgttt ccgccgcgtc cagcgctccc acgacgaagc 600 ccgaagaccc cgatctgcgc tgccaaattg ctgctctcac cgaagaaatt cgtaacctca 660 aagctgatcg cggcagaagc ttctcccgca gccgttcagc agcgcgatcc agttcccgga 720 gcacgtatgg agacggagac gaaatctgct ggttccaccg gaagtacggc ggccgtgctc 780 tgaggtgtcg ggacccctgc aagttccaga atcaaaaaaa ctagttccgc gttcaccaca 840 gtcggcggag gtggacggaa gtactcaaag ccgccgaatt ttgttgtatg atcgcacaag 900 caacacacga ttcctgattg acacaggctc ggacgtctcc ttgattccgg ccactaacag 960 agaccggatg aaaggagcag cggcattcga tctacacgcg gccaacggct ctcgtatcaa 1020 ggtgtacgac aagcgcgtgc tgactaccga tttgggactg cgacgcaggt tcaactggct 1080 cttcctggtc gctgacgttg gaatggctat aatcggcgcg gatttcctag cggccttcgg 1140 aatcgtcgtt gacctgaaac accgccgtct gatcgacgga ctcaccaaac tgtcgtctac 1200 cggcggtcta acaaccgcaa gcgtccacag cgtaacggtc gtggccacgg atcatccgtt 1260 tcgagacctg ctcgtcgagt tccgtgacgt cacccttccg acgaccatga aatctgctgc 1320 gcaacacgac gtcacgcacc acattcaaac gaagggtcct ccggtgtcga gcaaatgtcg 1380 acgtttagca ccagataagc tggaagccgc taggaaggag tttcaggtca tgagtgaact 1440 cgggatttgt cgtccgtcca gcagcagctg ggcgagccct ctccactgcg tgcagaagaa 1500 gaacggccaa tggagatttg ttggagacta ccgcggcctg aaccaggtga ccgtccctga 1560 tcgctaccct gtcccgcata ttcacgatct actgaacgca ttccagggga agtcgatctt 1620 caccacgctc gatctcgaaa gggcttatca tcagatcccg gttgaggagg acgacattcc 1680 aaaaacggcg gtgattactc catttgggct cttcgagttc acccgaatgc agttcggctt 1740 gtgcaacgcc agtcagacct tccaaaggtt tatgcacaag gtcttcgcgg acatggagta 1800 cgtcgtagtc ttcgtagatg acatctgcat tgcttcggct tctcccgacg agcactacat 1860 gcacgtgcgc aaagtgttcg agcgattgca gatgtacggg ctggtcatca acgttccgaa 1920 gagccacttc gcgcaaagca ccgtggaatt cttgggctac accgtcagcg gtgacggtat 1980 ccttccactg cccgatcgag ttaaagctgt taccgagttc aagttgccgt caaccgtcaa 2040 gcagcttcgt cgatttctgg ccctggttaa ctgttacaaa aggttcctgc cgcacgcgac 2100 agacacgcag tctcacctgc gagatttgat accgggcaac aagaagaacg attcgaggaa 2160 gctggagtgg accgacaatg ctagggaagc attcgaaaag tgcaaacagt cgttggccac 2220 cgcgactctt ctgcactatc cggattcgac gaagcctctg agcctgctga tcgatgcgtc 2280 gaatacagca gctggagcag tcctacagca gttcgccgac ggccaatgga atcccttagg 2340 gttctactcc gagaagttca gtccatccca gcagaggtac tcaacgtttg gacgggagct 2400 cacggcgatg aaaatggcgg tgcaatattt ccggcacatg ctggagggga ggcggttcgt 2460 catcttcaca gaccacaaac cactgaccac tgctctgaca tccagtccaa acagtaggct 2520 cccccacgaa gatcgctact tgcggtacat ctcggagttc accaacgaca tccggcatat 2580 cagcggccaa cacaatgtcg tcgcggacgc gttgtcccgg atcgaagccg tagcagctcc 2640 atcgccgatt gattacggac agatcgtagc agatcagaca aacgacagcg aactgcagcg 2700 cctactgagc tcgtcgacag gactgaaaat tgagctccgt caaacaccat taacgccgaa 2760 gccactctac tgcgacactt cacaaggtaa ggtcaggccc tacataccac cacaacaccg 2820 ccaactggtg ctccagcaca tccatggcct ggcgcatccg ggtatacgat cgactcggcg 2880 gatggttacg gaccgcttcg tgtggaccca catcaacaag gacgtcaaac actacgtcca 2940 atcctgtgtg cagtgtcagc gatccaagat ccatcggcat accaaggccc cgctggaaaa 3000 gtttgctccc cctaacagcc ggttccgcca catccacatc gatcttgttg gaccgttgcc 3060 gccgtgcaat ggtaatcggt atctgctaac agctattgat cgatataccc gttggcccga 3120 agtaatgccg ttacccgaca tgacggcgga aactgtggca cgtgcgctgt gcgaaatgtg 3180 gatttgtcgg tttggagttc cggaaaccat cgctacggac caaggcaggc agtttgagtc 3240 agaactcttt cgggagctct ccaggatcat cggttgcaag cacatcagga cgacagctta 3300 ccacccacag tcgaacggca tcatcgagcg cttccaccgg catctgaaat cgtccataat 3360 ggcggtggac tccaagcact ggtgcgacaa gctcccgttg atcatgcttg gtttgcgggc 3420 cgcgctccga gaggactttg gatgctctgc ggcggagttg gtttacggac aaactctacg 3480 attgcctggc gagtttcttg agcagcaaac caacaacgaa gaccggacgg agttcgctaa 3540 gcagctccgg aatgcgatgg gtaagctgaa accggttgat ccagatcacc acgccaagcc 3600 agctgttttc gtcaaccgcc ggctcgagaa gtgctctcac gtcttcgtga gaattgatgc 3660 agtgaagaga ccgttggtgc agccttacga gggaccgttc aaggtggttg ctcgtgacaa 3720 gaagcacatg gacgtcgaaa ttaacgggaa aacacaacga atttccatcg atcgcgtcaa 3780 accggcctac acctgcgacc cagccattga cgaacatccc agaagcgacg gaaaaactgt 3840 agttacacca tctgggcata gagtaagatt tctggtgtaa ctggggggga cta 3893 // ID Kolobok1-N1_NVi repbase; DNA; INV; 2156 BP. XX AC . XX DT 06-MAR-2008 (Rel. 13.03, Created) DT 10-MAR-2008 (Rel. 13.03, Last updated, Version 1) XX DE Kolobok-type family. XX KW Kolobok; DNA transposon; Transposable Element; Nonautonomous; KW Kolobok1-N1_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-2156 RA Jurka J. and Bao W.; RT "A distinct subgroup of Kolobok-type DNA transposons."; RL Repbase Reports 8(3), 174-174 (2008). XX DR [1] (Consensus) XX CC Putative non-autonomous. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Nasonia CC Genome Project. XX FH Key Location/Qualifiers FT CDS 793..1434 FT /product="KolobokN-1x_NVi_1p" FT /translation="MCDNRDAKGRFASTHPDKKYSRGRGKKRPRRFLGVTS FT EDEPANKSRPSSSARKIQTPAAEGTSEVRNRLIDINILFPALEELLICKQC FT HGKVQLGESEAQGLGFKIEVVCIECGQLAAINSCRKVGVKNHAWENNRRAV FT VTFRALGLGHAGLSTFCGLMDCLKPMAQSAYDTINEQFSEVCEKVAKDSMK FT TAVIEEKAATETGAPGSNIWRLDW" XX SQ Sequence 2156 BP; 730 A; 328 C; 430 G; 667 T; 1 other; ggggacacaa cacctttaaa cttcgaaaaa atcgattttt ttttattgct tattttgaca 60 ggaaatgttc cgtagaacac gctcctgaaa cccgtttatc aaaaaaaaaa aaaaaattcc 120 cgcaaagtta taagcaaaat actaaaacaa aatttttatc gttttttcgt aaaattgcgt 180 atttttcatt ttttgtctaa tacgactaat aattttaatt ttaagtttta taaattcaat 240 ctgaataaaa tttttattgt aaatgttgaa aatactattt ttttaccgta gtatatacgt 300 atattatgct ccgcatacat attttagtaa ataaatttga gcgtattgat aagatttttt 360 tgcgttattt tcgagataaa tttctagttt cgcttcttat ataaggctcc gatggaaacg 420 tagggacatt tatttgaatt caaatatcgc gcgcccgcgc gccacctgtg ggcaggttag 480 aatcaatgtg tcggagaagg acagagcgac cgttctgctg ttctgctagc agacagcaaa 540 tatttcaaaa cacagctgat atagtcctgg gatgtgtttt acactgagta aacacagctg 600 atgtcttaat actatwaaaa aagctgatat tttttaaaag tgtttttaaa ttaatctaag 660 tagtgtttgt agtgtttttg gggttatagc ttatagctta taaccctaaa aagtaaaaaa 720 agtgaagtgt tagtgtaaaa gcagagcagt gaagttattt aaaaaagtgt gaagtgtagt 780 gattttaaaa atatgtgtga taatcgagat gccaagggaa ggtttgcgag tacccatcca 840 gacaagaagt actctcgagg aagaggcaag aagaggccaa gaagatttct aggagtaaca 900 agtgaggatg aacctgccaa caagtcaaga ccttcaagtt cagcaagaaa gatacaaact 960 ccagcagcgg aaggaacctc tgaggtcagg aatagattga ttgatatcaa cattctattt 1020 cctgcgctcg aggaactttt gatttgcaaa caatgccatg gcaaagttca acttggcgaa 1080 tccgaagctc aaggattagg attcaagatt gaagtagtgt gcattgagtg tggtcagctg 1140 gcagctatca actcatgcag aaaagttggc gtcaagaacc atgcttggga gaacaacaga 1200 agagcagtgg ttacttttcg tgctcttgga cttggacacg ctggtctatc aacattttgt 1260 gggttgatgg actgcttaaa acccatggct cagtctgcat atgatacaat caacgaacaa 1320 ttcagtgaag tatgtgaaaa agttgccaag gactccatga agactgcagt gatagaagaa 1380 aaagccgcta cagagactgg agcacccgga tcaaacattt ggaggttaga ctggtaattt 1440 tctgtcagtt ttaacttgaa gattcaacac agtgaccaaa agaaaagata agtataaagt 1500 gttattatta tcaaggataa aataaccttt ttatcacatt tttataaaaa ttcatgaaat 1560 gccttttttt ggttgcaggt aagaaaagcc gtacagaacc cagaggtata agtgaatata 1620 cctcataatt gctgagagtc aaggtctatg tgtgtggaac atagaaaact tggtcccaaa 1680 ggatttggta agcactaatt taagttacac tttcaacaat tttcataaac tttaaagttg 1740 attttctcta aatgttgttt ttactgattt cagctcactg tccgtaggat tcaagtgaac 1800 gaatggtttt ttgggctcaa actcaaggaa gatactttgt acacctgtag ttttggtatg 1860 aatgcaggga ttttgatttg aatgtttaga gtttatacac aaaaaggtgc cagaataagc 1920 gaaaaaaaaa atcagaaaaa aactattcgt tcacttggaa tcctacggac agtgaaaaag 1980 ttcacaaatt cttgcaccga gtgccacaat gggccagcag gaataatgaa ggttttagca 2040 tgtaaaatta aaggtattta tgctaaatat atctataaaa aattttaaat gatttactta 2100 aaatttatta atttttctca gagttctttt ttttagggtt taaaggtgtt tgcccc 2156 // ID Gypsy-2_Cfl-LTR repbase; DNA; INV; 196 BP. XX AC AEAB01029725; XX DT 26-JAN-2011 (Rel. 16.02, Created) DT 26-JAN-2011 (Rel. 16.02, Last updated, Version 1) XX DE LTR retrotransposon from the Florida carpenter ant genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy-2_Cfl-LTR. XX OS Camponotus floridanus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Formicinae; Camponotus. XX RN [1] RP 1-196 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Florida carpenter ant genome."; RL Direct Submission to RU (26-JAN-2011). XX DR Genome; AEAB01029725; Positions 272 467. XX SQ Sequence 196 BP; 61 A; 45 C; 28 G; 62 T; 0 other; tgtagaaaca cgtctcaacg aaaacgcaga ataaaaccac attcgtatag cttatcaacg 60 gtattttctc ttatcccgtt gcagtactgt agaaacacgt cacgacgata acgcagaata 120 aaaaccacat tcgtttagct tatcaacggt attttctctt atcctgttgc cattttgtac 180 tacagttttt cttaca 196 // ID Gypsy-55_AA-I repbase; DNA; INV; 7739 BP. XX AC supercont1.16; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-55_AA_; KW Gypsy-55_AA-LTR; Gypsy-55_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-7739 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.16; Positions 3606434 3598696. XX CC Positions [5259-5741] - Integrase core CC 'ATTT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 451..2838 FT /product="Gypsy-55_AA-I_1p" FT /translation="MSYYNINANALDEEELNYELCIRGCSVDGSLDTRRRT FT LRKVLREVEVAEVRESVHSFDDEFEVVPVKLQQIERQLQTGVESGSWSRLV FT HLHKRLRRYGTISQDQFNRKSALLDAVTRMAKAYFNVDLNEIGKDTPVMTL FT VPEGVALGASGGDQPLEDFGAWQLPTGMSGVVSSDGQRLGAVPKKQATGES FT LILFSPDPNEELNSERRRQTFPATTGWSSSYGQPLPPVNASFVNASPERNQ FT PIPGAMENENGVLGTPQYRTSVQMQINRPSQVSKGFVHPRGHETNEQPLPN FT RSLHEEPVQPYGTIGAEVKQHGLLDECQHRTSDAAYTSPVWSQPMFQAGSS FT HSLRLDDRPPLMTTVMANDPNLAGKRRDESEYIHKSEIEGYIRNYLSQMYG FT TASSAPAVNQPVVQNIVDQFASLQMHTENAPRVHHPHPINQANQVPKLSLD FT PPLQLGGDRREMDGPVRSPRIQTPWINEGVQNMSTRPVSRTPFIAAPPQNL FT RISSPWVNPNNSFRTRLAHQTCNIIEKWPKFSGDSNSVPVVDFLRQLEILS FT RSYQITNQELRMHAHLLFKDDAYVWFTAYDSKMDTWETLLTYLKMRYDNPN FT RDRYIKEEMRNRKQKPNELFSAYLTDLEAMSQRMIKKMTDEEKFDIIVENM FT KLSYKRRLALEPVYSIEHLAQLCYKFDALEGNLYNLRTANKPKVLNQINVE FT DEYFDELVDGDVAELFAIQRNKNLKPVARLPLKESVRKSTEESLCWNCRET FT GHLWRDCDQRKMIFCHICGNPETTAYQCPQNHNLRPKGPSVSKND" FT CDS 2874..6116 FT /product="Gypsy-55_AA-I_2p" FT /translation="MSNEIVPQPNFQYFSQNYSITTKFRRCPHLQVRILDE FT EIEGLADTGAGVSIISSLALIEKLGLQIQRCNVKIRTADSTEYTCLGYVNV FT PFTYHDQTKVIPTIIVPQVSKILILGMDFLNAFNFRLVVTTNSEPQEIQNS FT GDEEEAVDALLIEDFNGDNDQICFEIVPEEIPKPSSLNPETDESLEMPTIE FT VPQQTIHSPADLVTEHELSAKERQELFEAIRILPATADGHLGRTSVLKHSI FT ELLPGAQPRKIPNYRWSPTVEKVIDEEIQRMLNLGVIEECTGPVDFINPLL FT PVKKSNGKWRICLDSRKLNSCTKRDDFPFPNMLGILQRIQKSKYFTAIDLS FT ESYYQVGLDESSKNKTAFRTNKGLYRFVVMPFGLTNAPATMARLMSKVLGH FT DLEPFVYVYLDDIIIISKSFEHHCQLIRIVARRLQQAGLTINLQKSKFCQK FT RIRYLGYVLSEDGLSMDASKIQPIIDYAIPQTVKDVRRLLGLAGFYQKFIR FT NYSEITTPITNLLKKDRKKFHWTEEANVAFEKLKNALISAPILSNPDFRLP FT FIIETDSSDTAIGAVLVQVQEGERKTIAYFSKKLSSTQRRYSATERECLAV FT LLSIENFKHFVEGSQFVVQTDAMSLTFLKTMSIESKSPRIARWALKLSKYD FT ITLQYKKGSENIPADCLSRSVNHVDVDLPDPYIAGLKSQIEKFPERFSDFK FT ISDEKVYKYVTNSATIEDPSYKWKHVVPLRDRINVVKEIHDQAHLGYLKTL FT AKVRERHYWPRMSSDIKRFCSSCQICKESKTPNVNVRPVCGKPKLCSRPWE FT MISLDFLGPYPRSKKGNVWILVVCDFFSKFVLVQCLRTATAPAVCTFLETN FT VFTLFGTPSVCISDNAQVFKSGLIQKLLEKYGVSHWNLAVYHPSPNPTERV FT NRVIVTAIRCALNQQTSHKDWDESIHTIAMAIRNSVHDSTGYTPYFVNFGR FT NMISNGQEYEQIRDLGDGGNYEPPQLNENMRKLFEIVRRNLQKAYEKYSYS FT YNLRANTRHHFQKGEIIYRKHVNLSDKSKDFVGKFGTKFSKARIREKLGTN FT TYILEDLDGRRIPGTFHGSFLKKS" XX SQ Sequence 7739 BP; 2367 A; 1513 C; 1698 G; 2161 T; 0 other; ttttggcgcc caacgtgggg ccgagaaaca gttagttgat tttgtaattg ggttttgccc 60 gtgaaataaa ttaggatttt gaattgaatt tagtttgaat tgaaattaat tgcttgttct 120 gcactaaact ttctaggatt ttgaattttt ctattgataa gttagcttta gatttcgttg 180 ggattagcct taagatcagt aattgataag gaaatttggg tctagggtaa ggtacaaaga 240 tctttaagaa gtttagtgct tacaactttt gatttgatag tctttctttc tttcttgctt 300 atattttttt taattggtct caattaaaat ttaattctgt gtttattcca tttcaattct 360 catttgtttc attattctcg aacgtaatat tatcatcggt ctatcgtgaa ttttctatcg 420 ttaattgaat ttgaaaattg gtttacaatt atgtcatact ataatataaa cgcgaacgct 480 ctcgacgaag aggagcttaa ttacgagttg tgtatccgcg gttgttcggt agatggatcc 540 cttgatacgc gtcgtagaac attgcggaaa gttttgcgtg aagttgaagt tgccgaagtg 600 cgtgaatcag tgcattcctt tgatgatgag ttcgaagtgg tacccgtgaa gctccaacag 660 attgagagac aactccaaac tggtgttgaa tctggaagtt ggtcacgttt ggtgcacttg 720 cacaagagac tccgccggta cggaacaatc agtcaggatc agtttaatcg aaagtcagct 780 ttgctggacg ccgtgacgag aatggcgaag gcatacttca atgttgactt gaatgagatt 840 ggtaaggaca ctccggtgat gacgttggtc ccggaaggcg tggcgttagg agcttctgga 900 ggagatcaac cgcttgaaga tttcggtgct tggcagttgc caacgggcat gagtggtgtg 960 gtgagttcgg acggtcaaag gctcggtgcc gttccgaaga agcaagcgac gggagaaagt 1020 ttgattttgt tctctccgga tccaaatgag gagttgaata gcgagcgacg tcgtcaaaca 1080 tttccggcga cgactggatg gtctagcagt tatggtcagc cgctaccacc ggtgaatgcg 1140 tcatttgtga atgcttcgcc tgaaaggaat caaccgatcc ctggagcgat ggaaaacgag 1200 aatggcgttt tgggcactcc acagtatcga acttcggtgc agatgcagat caatcgtccg 1260 tcgcaggttt caaaaggatt tgttcatcca cgcggtcatg aaacaaatga gcaaccgctt 1320 ccgaatcgtt cgctacatga ggaaccagtg cagccgtatg gaacaatcgg agctgaagtt 1380 aagcagcatg gactgctcga tgaatgtcag cataggacca gtgatgcagc ttatacttct 1440 ccggtttggt cacagccgat gtttcaggca ggatcttcac acagcctccg tctcgatgac 1500 agaccaccac tgatgacaac agtgatggcg aatgatccga atctcgctgg aaagagaaga 1560 gatgagagtg aatacattca taaatctgaa atagaaggtt atatccgaaa ttacctgagt 1620 cagatgtatg gtacagcttc tagtgctccg gcagtcaacc aacccgtggt tcaaaacatc 1680 gttgatcagt tcgctagctt gcaaatgcat actgaaaatg ctcctcgagt gcatcacccg 1740 catccaatca accaggctaa tcaagtgcct aagttgagtc ttgatcctcc gttgcagtta 1800 ggaggagatc gaagagaaat ggatggacct gtgagatcgc caagaataca aacaccgtgg 1860 attaatgagg gagttcaaaa tatgtccacc agaccagtct cgagaacacc attcatagct 1920 gcaccacccc aaaatctaag aatatcttcc ccttgggtaa atccgaacaa ctcgtttagg 1980 acgagactgg cacatcagac ctgtaacata atcgaaaaat ggccaaaatt ttcgggagat 2040 tcgaattctg taccagttgt tgattttcta cgccaattgg aaatattgag tcgttcgtat 2100 cagattacaa accaggagct gcgtatgcat gcgcatcttc tttttaaaga tgacgcgtac 2160 gtgtggttca cggcatacga tagcaagatg gatacgtggg aaaccctctt gacctacctc 2220 aaaatgcgct acgataaccc aaacagggat cgttatatca aggaagaaat gcgtaaccgg 2280 aaacaaaagc cgaatgaact gtttagtgcg tatttaacgg atttagaagc aatgtcacaa 2340 cgaatgatta agaaaatgac tgatgaagag aagtttgata tcattgttga gaacatgaag 2400 ctgtcgtata aacgaagatt agcgttggaa ccggtgtact caattgagca tctggcacaa 2460 ttgtgctata agttcgatgc tcttgaggga aatctgtata atcttcgtac cgctaacaaa 2520 ccaaaagtgt taaaccaaat caatgtggag gatgagtact tcgacgagtt agtagatggt 2580 gatgtagcag agcttttcgc tatccagcgt aataaaaatt tgaagccagt agcaaggtta 2640 cccctcaaag aatcggtaag aaagagtaca gaggaaagtt tgtgttggaa ttgtcgggaa 2700 actggacacc tttggcgaga ttgtgaccaa cgtaaaatga ttttctgtca catttgcgga 2760 aatcccgaga ccacagcata tcagtgtcct cagaatcaca atctgcggcc gaaaggacct 2820 tctgtttcaa aaaacgacta aaatccgaga atttcgggaa cgattctcag gagatgtcta 2880 atgaaatagt tccccaacct aacttccaat acttcagcca aaactacagc ataactacca 2940 aatttcgacg atgcccgcac cttcaggtaa gaattttgga tgaggagatt gaaggactag 3000 cagataccgg tgccggggtg tctataataa gctctctagc cttgatagag aaacttggcc 3060 ttcaaataca acgctgcaat gtcaagatcc gtaccgcaga tagtaccgaa tatacttgct 3120 tgggttatgt caatgtgccg ttcacatatc atgatcaaac gaaagttatt ccgactataa 3180 ttgtgccgca agtgtcaaag attttgatat taggcatgga cttcttgaat gctttcaatt 3240 tcagactcgt agttacgaca aattcagaac ctcaagaaat tcaaaattcg ggagatgaag 3300 aagaagccgt tgacgctctg ttaatagaag acttcaacgg tgataatgat caaatatgtt 3360 ttgaaatagt ccccgaagag attccgaaac ctagctctct taatccagag acagacgaaa 3420 gtttagaaat gcctacgatt gaagttcctc agcaaacgat ccatagccct gctgatctgg 3480 ttaccgaaca cgagttatcg gcaaaagaac gtcaggaatt gttcgaggca ataaggatac 3540 tacccgcaac tgcagacgga catctaggga gaaccagtgt cttaaagcat agtatcgagt 3600 tgcttcctgg agcacaacct cgtaaaatcc caaactaccg ttggtctccg accgttgaga 3660 aggtgattga cgaagaaatt cagaggatgt taaaccttgg agttatagag gagtgtaccg 3720 gacctgttga ctttatcaat ccattactgc ctgtgaaaaa gtcgaatggc aaatggagaa 3780 tttgcttaga ttctcggaaa ctgaattctt gtacgaagcg tgacgacttt ccgtttccta 3840 atatgctagg aattttacaa agaatacaga aatctaagta tttcactgcc attgatctgt 3900 cagaatcata ttaccaggtt ggtttggatg agagctctaa gaataaaaca gcatttcgta 3960 cgaacaaagg actctaccga tttgtagtta tgccttttgg cctcaccaat gctccagcta 4020 ccatggcacg attaatgtcg aaagttttag gccacgacct ggaacctttc gtttatgttt 4080 atcttgatga cataatcatc atttccaagt cgtttgagca ccactgtcag ctaataagaa 4140 tcgtggctcg gcgtttgcag caagctggct tgacgatcaa tctacagaag tcaaaattct 4200 gtcagaaaag gattcggtat ctcggatatg tactttcgga ggacggcctc tcaatggatg 4260 cgtcaaaaat acagcccatc attgattatg cgattcctca gacggttaaa gatgtgagaa 4320 ggctcttagg actcgctggc ttctatcaga aatttatacg aaactattcg gaaatcacga 4380 ctccgataac gaatcttctc aaaaaggacc gtaaaaaatt tcattggact gaagaagcaa 4440 acgtagcgtt tgagaaattg aaaaacgctc taatttccgc accgatactt tcaaaccccg 4500 atttccgttt accttttata atcgagacag acagttcgga tactgccata ggtgccgtat 4560 tagtccaagt tcaggaaggc gaacgcaaga ctatagcgta tttttcaaag aaactatcca 4620 gtacgcagag gagatatagt gcaacagaaa gagaatgctt ggcagtgctc cttagcattg 4680 agaattttaa gcactttgtg gagggcagtc agttcgtcgt ccagacagac gccatgagtc 4740 tgactttcct taaaaccatg tccatagaat cgaaatcacc gcgtattgcc aggtgggcac 4800 taaaactgtc aaaatatgac attacgctgc agtataagaa agggtcggaa aatattcctg 4860 ccgactgtct ttccaggagc gtgaatcacg ttgatgtaga cctaccagat ccatacattg 4920 ctggtttgaa gtctcaaatt gaaaagtttc ctgagcgttt ttcggatttc aagatatccg 4980 atgaaaaagt gtacaagtat gttaccaact ccgctacaat cgaagatccg tcttacaaat 5040 ggaaacatgt agtaccacta cgtgatagaa tcaacgtagt caaagaaatc catgaccaag 5100 cacatttggg ctaccttaaa accttggcta aagtccgcga acgacattat tggccaagaa 5160 tgtcttcgga tatcaaaaga ttttgttctt catgtcagat ctgcaaggag tcgaaaacac 5220 ccaatgtgaa tgtgcgacct gtttgtggaa aacccaaatt atgcagtcga ccgtgggaaa 5280 tgatatcact tgatttccta ggaccatatc cacgatcaaa gaaaggaaac gtctggatat 5340 tggtcgtgtg tgacttcttc tcgaagtttg tcctcgtaca atgtctccgt accgcaacag 5400 ctcccgctgt gtgtacgttc cttgagacta acgtatttac cttattcgga acaccatcag 5460 tatgcatatc agataatgca caagtattca agtcaggact gatccagaaa ctccttgaga 5520 agtatggtgt gtcacattgg aaccttgctg tttaccatcc aagtcccaac ccaacagaac 5580 gtgtgaatcg agtgatagtg acggcgatta gatgcgccct caatcaacag acaagtcaca 5640 aggactggga cgagtccatt cataccatcg ctatggctat cagaaacagc gtacacgata 5700 gtacaggcta tacgccgtac tttgtaaact ttgggagaaa tatgattagc aatggacaag 5760 aatatgaaca aattcgcgat ttaggagatg ggggtaatta tgaaccaccg cagttgaacg 5820 aaaacatgcg aaagctattc gaaatcgttc gtaggaatct ccaaaaggcc tacgagaaat 5880 attcctactc ctataacctc agagctaata cgcgtcatca cttccaaaag ggtgaaataa 5940 tctatagaaa acatgtaaat ctgtcagaca aatcgaaaga tttcgtggga aagttcggta 6000 cgaaattctc taaggcgaga atccgcgaaa agctaggtac taatacatac attttggaag 6060 atctggatgg tcgtcgaata ccagggacct tccatggttc attcctgaaa aaatcctaaa 6120 gtaccaccag atcgtgaact attggaaact ataacattag ctatgacggc gcacacgcct 6180 gttgcgcata aacaaatgaa cattcaatgc caaaagaaaa actccttctg aggtgcataa 6240 ggcagcagtc caacgttcga gatgttcttc gatttcctca cgttgttgac gcaaactcac 6300 tttgagttaa acaccagcta agacggcgta attgcctatt gcgcataaac aaagtatcag 6360 caatgatgaa aaaactcctt ttgaggtgga aagggcaacc gtccaacgtt cgagatgttc 6420 tatgatttcc tcacgttgta gacaaacttg attgaccata aaaacaagct atgactgtac 6480 tacactcccg ggtagtacat aaaccactaa ttagaaaact cccaagatac acttctgtgg 6540 tgctacggtg taataacgat tcgagatgtc cttgtgtttc ctcgttcgac aatgggaatc 6600 acccatttga gaatcaaaaa gcgatttcgt tggagcataa atccaattgg aattgaatga 6660 ggtgctggaa tttactcgca ctaagtacac agctatgtag aagtcacgat agagcttgga 6720 ccacgaacat actcataaaa cactcataga ggtaacaaac cttacaggtg aaacggtgct 6780 ggttgagtgt ccaataccta gattcagttc gttccgacct cccagtgatc cgaagtaacc 6840 agattcgtgt tttgttgcgt aatcgttttt gtcctacaag aaaaaaaaat atgtttgttt 6900 tggttgttct agtcggtcga agagaactgc aaatacatac cagctatagt aatccgccta 6960 cgacaatcac aatcacaata tccatagtag tccgcaatag cccggtttag tttttcaata 7020 gtttctagtt tatactttat tatcactaat taatttccaa tcacatccat ttgaatcaag 7080 cattgttttg tttttcttaa ctcgatccgt ttgacagttc tcgcgattat ctctttagtt 7140 cccgttcgta aaatttgttc gtgtatgtgt tcgtgaaacg ttcggtaacg ctatgtgaat 7200 atgcacggaa aatagggcgt gtttgatttt tagtatgatt tatggaatga gtgagttagg 7260 gaatagttta atattagata gaaaaaaatg tccagacatt tctggagtta cattcggtcg 7320 gtcggtaaat caatatccaa atatttttgg agtaattttt ttcaacagtt gttgaatttc 7380 gattaggtta tatttcggta tggataaaac aaatttccaa atatgtttgg agtacgaaaa 7440 aaatcggtat cttaaataac gcttaggtaa tagatatgtg tccaattgtt attggagtat 7500 ttcggtttag aataagtcgt aagtttggat gagaatagga atgagttgaa tatgaagttg 7560 gctaaacata gaatgatgaa tctcttagtt ggaccacata gttgtctttg atgtttttgt 7620 tgagtacata atattagctt gattaaatgt tttgttgttg agaattagtt atgttttgtc 7680 aagagtaaaa attttgaaaa tttttaattt tcaaaatttt caaaatcagt gtaagcaaa 7739 // ID Jockey_Ele10 repbase; DNA; INV; 4363 BP. XX AC . XX DT 06-OCT-2010 (Rel. 15.1, Created) DT 06-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE A Jockey clade non-LTR retrotransposon family from Aedes aegypti. XX KW Jockey; Non-LTR Retrotransposon; Transposable Element; KW Jockey_Ele10. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4363 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4363 RA Kojima K.K. and Jurka J.; RT "Jockey clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (06-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. This consensus is generated from 23 CC sequences with >90% identity, and ~93% identical to the original CC sequence in [1]. XX FH Key Location/Qualifiers FT CDS 309..1445 FT /product="Jockey_Ele10_1p" FT /translation="MEIGPSIQPMVATQNLFETLSTDDPDGNVNNRSQSPL FT NAPKKNRRPPPITIVSKGTKQSRELMNLANIPQSGYQLKAVKTGTQLNATD FT EDVFNAVIKVLQDSNTEFFTYTPSHQQPVRFILSGLALYDLAELKEELLLN FT EVRPLDLKIFSRKKSGPEESVLYLLCFVKGSTKLSELQGVKTLFNTVVKWR FT FFTRKPDDSVQCHRCQRFGHGMRFCNISPSCVKCGEKHLTANCKLPVKANL FT KDASVSRSLIRCANCSGNHTANFRGCPCRLSYIKQLEENRQRASKKTTAPN FT TRAPPTSLPRINSSVQNRLNNPSAAGGPSYSQVLQGSGQPDRRANLFSVSE FT FLCLARDLFHRLQGCRSKEQQFLALSELIIKYVYNG" FT CDS 1441..4113 FT /product="Jockey_Ele10_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MASLPCLRVVNWNGRSVHSKSLEFFDFVQRHGIDVAV FT VTETWLRPNTSFLHPNFSCVRLDRPSSDEGGRGGGVLIAVRKGLAFKQLNI FT STKKIEATGITISSAGNDELCIIAAYFPGTRRGSEWSKFRREIRSLVTQNS FT PYFVIGDLNARHRSWNCAKNNKAGTILFQEASHSGFNINFPDSPTFIPSGR FT GHPSTLDIVLSNNLVNMSKPTVHNELSSDHLPVTFEISLSSDPVANTESVR FT CYARADWLAFQRLINTKLCPNDPMFSNIRDENGVNAALNFFKDALLDAESV FT AVPLVTPRPYDVAQIPDETKLLIQLRNRRRRQWMRTRDPMLKQIVLSLNDR FT IRGECANARFSKFSXTLATMERGDNKLWRITKALKKTTKYSPPLRSGDTLF FT ACPKEKAKLLAESFALAHNNQSTSDRDTVEAVERSVEQIDLSPPVADNSWL FT VRPKEIATIIRNLKPKKAPGHDGIKNMLLKRLPRKGYVVLAKIFSACLKLC FT YFPDDWKHAIVVAIPKANKDHSIPSNYRPISLLPTLSKLFERVILTRIEKH FT LETVRLIPHEQFGFQKGHSTSHQIVRLVKEVRRNFQQGKSSGLILLDVEKA FT YDSVWHEAILHKMLLGNFPMSILKIMRSFLKDRSFQVAVNGCTSDRMSVPF FT GVPQGSVLSPTLYNIFSADIVKVNGVQYYFFADDTGFLASHRDAGTIIETL FT QHAQNSIQEYQRKWKVKVNPTKSQAIFFTRRRSPRHLPQSNVSCNGVDIPW FT SPNVGYLGITLDQKLKFGCHVTNCLQKCDKLVKMLYPLVKRRSRLDPGTKI FT LLYKTVLRPTVAYGFPAWYDCAQSHRNKVQVKQNRLLKMTLDLSPFHPTDD FT VHRLAGVERIDDWFRRLMPKFLDSCSSSVNPLLQELAA" XX SQ Sequence 4363 BP; 1164 A; 1152 C; 932 G; 1108 T; 7 other; cagtttcgag ctgtcaaagc tattgatgca caagcagcaa aatttgatgc gttgtttttc 60 gttgatttcg ctcatttttt tcgccgaata attaacttcg tgccactaaa gtggataacc 120 acggttcggg tcgcaatact ccgcgcaata atgtgcaaag ttcgcaatcc aacataatta 180 ttcgccgtga ttacgataaa actccgtcgg taacctcaat cgtcgactcc ggtgatcgca 240 acgatgccaa gcggctaaaa cgtacccact gtcagccttc tccagccgac gatcgcaatt 300 ctctcccgat ggagatcgga ccatcaatcc aacctatggt tgctactcag aacctcttcg 360 aaacgctgtc cactgacgac ccagatggaa acgtgaacaa cagatcgcaa tcgccactca 420 atgccccgaa gaagaatcgt cgccctcctc caatcactat cgtaagcaag ggcaccaaac 480 aatcccgcga gctgatgaat cttgccaata tcccgcagtc cggctaccaa ctgaaagctg 540 ttaaaactgg tacgcagctt aatgccacgg acgaggacgt tttcaatgcc gtcatcaagg 600 tactacaaga ctccaacacg gaatttttca cctacactcc gtcgcaccaa cagcccgtgc 660 gattcatact ctcgggtctc gcactgtatg atttggctga gctgaaggag gaattgttgc 720 tcaatgaagt gcgcccgttg gacctcaaaa ttttctctcg aaagaaaagt ggtcctgaag 780 aaagtgtgct gtacctgctt tgtttcgtga aaggatctac caagctctcg gagctgcaag 840 gagtaaagac actgttcaac acggtggtga agtggagatt tttcaccaga aaacctgacg 900 attccgtcca gtgccatcgg tgtcaacgct tcggccatgg aatgcgcttc tgtaacatct 960 ctccatcctg tgttaaatgc ggggagaaac atctcacagc aaactgcaag ctcccggtga 1020 aggccaacct caaggatgct tccgtctcgc gcagtcttat tcggtgcgct aactgcagtg 1080 gtaatcacac agccaacttc cgaggctgcc cctgccggct gagttatatc aaacaactcg 1140 aagaaaatcg tcagcgtgct tcgaagaaaa ctacagcgcc caacacccgc gcaccaccga 1200 cctctctccc taggatcaat tccagcgttc aaaatcgtct gaacaatcct agcgccgccg 1260 gaggaccttc ttactcgcaa gtactgcagg gttctggcca accagatcga cgcgcaaatc 1320 ttttttcggt cagcgagttt ctttgcctgg cgagggatct ttttcatcgt ttgcaaggct 1380 gccgttctaa ggagcaacaa ttcctagctt tatccgagct aataatcaaa tacgtttaca 1440 atggctagcc taccctgtct gcgcgtagtt aattggaacg gaagatccgt tcacagcaaa 1500 tcgctagaat ttttcgattt tgtacaacgt catggcatag atgttgctgt cgtcaccgaa 1560 acctggttgc gcccaaatac ttcattcctt catccaaatt tctcctgtgt tcgtctcgat 1620 cggccatcaa gtgatgaagg cggcagagga ggaggtgtac taatagctgt tcgaaaagga 1680 ttggctttca agcagctgaa tatttcgacc aaaaaaatcg aggcaacagg cattaccatc 1740 tcatcagctg gtaatgatga actttgcatt atcgctgcat acttccctgg aacgcgtcgg 1800 ggctctgagt ggtccaaatt tcgacgggaa atccgctctc tggtcacgca aaactcaccg 1860 tatttcgtca ttggggactt aaacgccaga catcgttctt ggaactgtgc caagaataac 1920 aaagctggaa ctatcctgtt tcaggaggct tcacactccg gcttcaacat caacttcccg 1980 gactccccta cgtttattcc ttcagggcga gggcacccgt ccacgttaga cattgtgctg 2040 tccaacaatt tggtcaacat gtcgaaacca accgtacaca acgagctgtc gtctgaccac 2100 ctgcccgtta ccttcgagat cagtctgtcg tccgatcctg tggcaaatac tgaatctgtt 2160 cgctgctacg cacgtgcaga ctggcttgct ttccaacggt tgataaacac aaaactttgc 2220 ccgaacgacc ctatgttttc maacatccgc gatgaaaacg gagtcaatgc tgcgctcaat 2280 ttcttcaaag acgctcttct cgacgcagaa tctgttgctg tgccgctggt cacccctcgt 2340 ccgtacgatg tmgcgcaaat ccccgacgaa acaaaactcc tcatccagct tcgtaaccgg 2400 cgtcggcgac agtggatgag aaccagagat cctatgctga aacaaattgt tctttcgctm 2460 aacgatcgca ttcgtgggga gtgcgcaaac gcacggttca gcaagttttc tcamacstta 2520 gctacaatgg agcgtggcga taataaactt tggcgcatca ccaaagcgct gaagaaaacg 2580 accaagtaca gccccccgct gcgcagtgga gacacgttat tcgcttgtcc taaggagaag 2640 gccaaactgc tcgctgaaag tttcgcgctt gctcacaaca atcaatccac tagcgacagg 2700 gacackgttg aagctgttga gcgttccgtc gaacagatcg acctttctcc gcctgttgcc 2760 gacaactcct ggttagtccg ccccaaggaa attgccacaa tcattcgcaa tttgaaaccc 2820 aaaaaagcgc ctggtcacga tggaatcaaa aacatgcttc taaaacgact tcctcgaaaa 2880 gggtatgtgg ttctcgctaa aattttctcc gcttgcctca aactatgcta ctttcccgat 2940 gactggaaac acgctatagt tgtcgccatt ccgaaagcga ataaggacca ttccatcccc 3000 tctaactaca gaccaatcag cctgctgcca acattgagca aacttttcga acgcgtcatt 3060 ttgacgcgca tcgagaaaca cctggagaca gttcgtttga ttcctcatga acaatttggg 3120 tttcaaaaag ggcattctac cagccaccaa atcgtgcggc tggtgaagga ggtaagacgg 3180 aacttccagc aaggaaaatc atcaggccta atcctccttg atgtagagaa agcctacgat 3240 tcagtatggc atgaagcgat cttgcacaaa atgttgctgg gcaattttcc catgtcgatt 3300 ctgaaaataa tgcgaagctt cctgaaggat cgttccttcc aagtggctgt aaacggttgt 3360 acgtccgatc gaatgtccgt accattcggt gttcctcaag gttctgttct gagcccaaca 3420 ctatacaaca ttttttccgc cgacatagta aaagtcaacg gagtccagta ctactttttc 3480 gctgacgaca ctggtttcct tgcctcacat cgcgacgcag gaacgataat cgaaacgcta 3540 cagcatgccc agaactccat ccaagagtac caaagaaaat ggaaggtgaa agtcaaccca 3600 acgaaatcgc aagccatatt ctttactcgg cgtcgtagcc ctcgkcacct accccagagt 3660 aacgtgtctt gcaacggcgt ggacattcca tggtcaccaa atgttggcta tcttggtata 3720 accctggatc agaagctaaa gtttggttgt cacgtgacca actgcctcca gaaatgcgat 3780 aagctcgtca aaatgctgta ccctcttgtt aaaagacgct cccgtttgga ccctggtacc 3840 aagatcctgc tatacaaaac cgttctcagg ccaacggttg cgtatggctt tcctgcttgg 3900 tacgactgtg cacaatccca ccggaacaaa gtccaagtta agcaaaaccg tttgctcaag 3960 atgacgctgg atttgagccc tttccatccg actgacgacg tacaccgact tgctggcgtt 4020 gagcggatcg atgattggtt tcggagatta atgcccaaat tcctggacag ctgttcgtca 4080 tctgtaaatc ctcttttaca agagctggcc gcatagatgt gatgtgatat tagatttaag 4140 cttttctttt ttctgtcttt ttcctaagca aattcgaagg tttttttttg ttcttttcct 4200 tccctgtcca aatagttgct tgtcgttcat taacacaaaa tgtgctcatt tgttctatgt 4260 cattgttgta aggaattcca tatctcgatt gttaaactct aaacacaaat tgtaatcgct 4320 aaggtaaaaa taaaaataat tgaaattgaa aaaaaattga aaa 4363 // ID Gypsy-182_AA-LTR repbase; DNA; INV; 1661 BP. XX AC supercont1.136; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-182_AA_; KW Gypsy-182_AA-I; Gypsy-182_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-1661 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.136; Positions 2052707 2054367. XX SQ Sequence 1661 BP; 471 A; 320 C; 358 G; 512 T; 0 other; tgtaacgata atttaaattc actctaaaat aagcacattg tgaattcgtg tttacacatt 60 catacgtctt gctgtcagct agctcgaata caaaaatgct gtcagtttta tgcgagagtc 120 tagacaacaa tcactctaag cgccgcctag gagttggcct aggtaataaa atgatatatt 180 tcaaaattta aatccttagc gtcgagtctc ataaattgcg tcgtagagag agagtcagac 240 aggtgtagct aggaaaaggc caactagtac aaaagactat tgtaactaaa cgctctttaa 300 tcctccaata gtgggatatt gcgcagcaat ttttaggctc gatacaatta gcagcgcgtc 360 ggtcctttca tttcagtgga gtgctttgtt gaaaccagac atactttccg agtattattc 420 aaatttcaat cgttttgaac aaaagttaaa ttgtgcgggt gaataattaa actcgaagta 480 agttttcgcg cgtgcataat tatttagttt atttaatttg tgatatttct ctttttaatt 540 agcaactaaa ttgtcgagta tagtgcagtc gaaaaagatt tttttagtga atttattaca 600 aaacaaattc aggtaagcct tacataacga atagtgagca taactgtgaa ttaattccac 660 gttgattagg attagcgtgc ccaagcaacg tgggcggtta gtctggaggt gaaacctagc 720 cccgtagcta tcccgaactc gacactgttg acgaggatta tttcgagcac gtgcagtggg 780 gacatccacc gttcggcgat ctaacgtctt gctgtcggcg gctcaagcaa tcgtcccgct 840 cgtctttcgt cgatcgtgca acatcgttcc gtgtcaccgc aagttaataa gtcgcacggc 900 tggccacgct acaatccacg gccgatcact ccacgctaca aggggacgcc tataccccga 960 atgagaagcc ggcgcgcggg gcacgtaagt aagacgaagc ataattagag cccgaagccg 1020 acaacgtgtt gatgaatttg tttccaattt gaaattgaat cccgtttatt tgggccgaat 1080 cacgtcgggg gacgtccgtc gttggatcgg tcacgacaat aattggaatt gctcacggag 1140 cgaaaatacg attgcaatca aaagttagga aagcgtgacg tcacgatagg tcattaggtt 1200 tatttgaatt tggcctttga attaaaagtt taagtaacta aatacaaact cgcttatctc 1260 gatagattta gctgaatttg tttcgcgatc gagattgatt ttgaacttaa taaaacctac 1320 ttcaagttta gtttaagtac tcttcttttt ttttccgcgt cacgcaatca atttgaattt 1380 aattgttaat tgtgtttgtc taattagagc taagcgattc tcctttggtg gattattgag 1440 ttatttttgt aaattttcct ttcttatatt ttaaattaga gtagctatat ttgtttgatt 1500 tgccgtacga cgtttgggac taatttgcga catcagttct tagaacattg attgacctgc 1560 cctgagaagg cagtctcatt ccgggctaat taacggagta acggtccttc ttcgaaggtg 1620 gcgcttaagc cgattacttg gaaacggaac ggagcgttac a 1661 // ID CR1-18_HM repbase; DNA; INV; 5211 BP. XX AC . XX DT 15-DEC-2008 (Rel. 13.12, Created) DT 15-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE CR1-type family - consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-18_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-5211 RA Bao W. and Jurka J.; RT "CR1 families from Hydra magnipapillata."; RL Repbase Reports 8(12), 1846-1846 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 129..959 FT /product="CR1-18_HM_1p" FT /translation="MDKNKSSDKKEFQLNTEASEISVKLVREIFSEMFKKQ FT QKDILKLISGNLKITNERIDGLLKEIAEIKETCKTLQIENNLRKSEMVKTN FT DRVSIIEHSQKDIEHSITFTQDIQEEKINKIEKKIVTKVAFSVEEKNKLRQ FT LEDRLRRNNLRLEGITESESESWNESEEKVLSIFENKLNVSGVVIERAHRT FT GKTGQSKPRSIVIKLLNYKDKVNILKNSKKLKGTGIFINEDYSLDTLNIRK FT TLFEEMRTHRTNGKYSVVIYDKLVVKDFKKSLKKA*" FT CDS join(1019..2143,2147..4075) FT /product="CR1-18_HM_2p" FT /translation="MAEAINNIVFDIPGRKNTLNRINDDDKNNNFFDEILV FT ETAYYNVDDVKDVFNSASSNFSLLHLNIRSINKNFENLKLMLNNIKNKFSI FT ICLTETWCHRDTINSNFQLSGYKSIHQPRENGIGGGVSIFVQNSLTFKHVD FT NLKVNDADCESFTIEIINKAIKNTFITALYRPPNGNYNQFENHINYLLPKL FT IKKHVYLLGDFNLNFLNNKTDNHVARFINTLLQYSVFPIINKPTRVTKTTA FT SIIDNIITNNYTRAKMQSGIIKTDITDHFPVFLITDSATPKKKIETKTIYV FT RNINNESIVTFRDLLQEINWELLTDCNDVNNGYDFFIRVFTKQYEKAFPKV FT KKIINSKNLLSPWMTKGLIKSSKKKQKLYQKFKKKNYKNEIKYKKYKNSFE FT KLKKQSKKNYYSLLLNKTFGNARKTWNVIKEITGVGNVKIDNFPKRLQRDG FT NIGDAFVNKEIAETFNDFFINIGKKLASAIPEGSKSFESFLKKSDSIMDES FT ELLVDELRLAISMLKTNKSAGLDEINPDVVKAVSDIIEKPLFIIFNLSLRN FT GIFPDQLKLAKIIPIYKSGDDSIKSNYRPISILSCFSKILERIMHNRLYNY FT LETNNILYHKQFGFRKNHSTEHAVIDLANQILNGFENNSYTLGIFLDLSKA FT FDTVNHKILINKLCNYGVAHSNLKWLQCYLENRKQVVPFDLTCSPFESISC FT GVPQGSILGPLLFLIYINDIYLSSKILNYIIFADDTNVFYSDSNLKNIFCN FT VNTELTYLNEWFIANKLSLNIEKTKYILFAKPSKSENLPLTLPELTINNIK FT IKRVFSMKILGIIFDEHLNWKSQIKLVENKVSKTLGIMYKTKPLLNKICLK FT NLYFSFIHSHINYCNIAWANNHSSFLKTIYTKQKQASRLVSGMDRYESAEP FT LLKEINALNVYKLNIYHNLLFMFKLQNELIPKYFYKNFSLINHKYKTRHSI FT NNYCIPLTLLKKSDFSITCRGPRLWNSVLDENMKKIKSIDIFKRAIKKHLL FT NLNNTYILSFF*" XX SQ Sequence 5211 BP; 2053 A; 639 C; 710 G; 1807 T; 2 other; acggaagtga tcggacgtgc ttttcttgcg ctgtaaattt ttaataaaat tgtttgaact 60 taatttttaa aatttttgtt tataattggt atttaacttt aaacattcgt atttaacttg 120 ttaataaaat ggataaaaat aaaagtagtg ataaaaaaga atttcaatta aatacagaag 180 cttcagaaat atctgtgaaa ttagttcgtg aaatattctc tgagatgttt aagaaacagc 240 aaaaagacat tctcaaactt attagtggta atttaaaaat tacaaatgaa agaattgatg 300 gactgctgaa agaaatagca gaaataaaag aaacgtgcaa aactttacaa attgaaaata 360 atttaagaaa atctgaaatg gtaaaaacaa atgacagagt ctcaataatt gaacacagtc 420 aaaaagatat tgaacacagt atcactttta cacaagatat ccaagaggag aaaatcaata 480 aaattgaaaa aaaaattgtt acaaaagtag catttagtgt agaagaaaaa aataagctgc 540 gacaacttga agatcgactg agaagaaata atttgcgcct tgaaggaata acggagagcg 600 aatctgaaag ttggaatgaa tctgaagaaa aggttttaag tatttttgaa aacaaactaa 660 atgtaagtgg cgtggttatt gaaagggcac atagaaccgg aaaaactggg caaagtaaac 720 ctagatcaat cgtgataaaa cttctgaact acaaagataa agttaacata ttaaaaaact 780 cgaaaaaatt aaaaggaaca ggaatattta taaacgaaga ttattctttg gataccttaa 840 acatacgaaa aactcttttt gaagaaatgc gaactcacag gacaaatggt aagtattccg 900 tcgttattta tgataaatta gtagttaaag attttaaaaa atcacttaaa aaagcctaaa 960 gcttatatat tttttatatt ttgtatttta aatattaatc ttagttaata ttgctaaaat 1020 ggcggaagct atcaataata ttgtttttga cataccagga aggaaaaata cacttaaccg 1080 aattaatgac gatgataaaa acaataactt ttttgacgaa attttagtag aaacggctta 1140 ttacaatgtt gacgatgtta aagatgtttt taattctgca tcatctaatt tttcactatt 1200 acatttaaac atcagaagca taaataaaaa ctttgaaaat ttaaaattaa tgttaaataa 1260 tataaaaaac aagttcagta ttatctgtct gacagaaact tggtgtcatc gagacacgat 1320 taattctaac tttcaattat caggatataa atccatacat caaccgagag aaaacggcat 1380 tggtgggggc gtatctattt ttgtccaaaa ctcattgacc tttaagcatg tagataacct 1440 caaagtaaat gacgctgatt gtgagtcatt tactattgaa ataattaata aagcaattaa 1500 aaatacattt ataacagctt tatatagacc gcctaatggt aattacaatc agtttgaaaa 1560 ccacataaat tacttgctac caaagctcat aaaaaaacac gtgtatttac taggtgattt 1620 taatttgaat tttctgaata ataaaacaga taaccacgtt gcgcgtttta taaatacact 1680 tttgcaatac agcgtttttc ccataatcaa caagccaaca cgcgtgacta aaacaactgc 1740 atccattatc gataacatca ttactaataa ttacacaagg gctaaaatgc aaagtggtat 1800 tattaaaaca gatatcactg atcattttcc agtttttctg atcacagatt cggcaacccc 1860 taagaaaaaa atagaaacaa aaactattta tgtaagaaac ataaataatg aatcaatcgt 1920 cacttttcgc gacctcctac aagaaataaa ctgggaacta ttgactgatt gcaatgatgt 1980 aaataatggc tatgattttt ttattcgcgt gtttactaag caatacgaaa aagcctttcc 2040 aaaagttaaa aaaattatta attctaaaaa tttgcttagt ccgtggatga cgaaaggact 2100 cattaagtca tcaaaaaaga aacaaaaact ttaccaaaaa ttttaaaaaa aaaagaacta 2160 caaaaatgaa ataaaataca aaaaatataa aaattctttc gaaaagttaa aaaagcaatc 2220 aaaaaaaaat tactattcct tattgcttaa taaaacattt ggaaatgcaa gaaaaacatg 2280 gaatgtaatc aaagaaataa caggcgtagg aaatgtaaaa attgataatt tcccaaaaag 2340 attgcaaaga gatggtaata taggagatgc ttttgtaaat aaagaaattg cagaaacatt 2400 taatgacttt tttataaata tagggaaaaa actagccagt gcaattcctg aggggagtaa 2460 atcgtttgaa tcctttttga aaaaatcaga ttccataatg gatgaatcgg agcttctggt 2520 tgatgaactt cggctagcaa ttagtatgtt gaaaacaaat aagagtgcgg gtctagatga 2580 aattaaccct gatgttgtaa aagcggtctc tgatattatt gagaaacctc tttttataat 2640 ttttaatctt tccctaagaa acggcatttt tcctgaccaa ctaaaactag caaaaataat 2700 tccaatatac aaaagcggcg atgattcgat aaaatctaac tatagaccaa tttccatact 2760 ttcctgtttt tctaaaatac tagagcgtat tatgcacaac agactgtaca attatcttga 2820 aactaataac attctttatc ataaacagtt tggttttcgc aaaaaccatt ccacggagca 2880 tgcagtaatt gatcttgcga accaaattct aaatggtttt gaaaacaata gttacactct 2940 cggaatattt ctggatctat ccaaagcctt tgatactgtc aaccacaaga ttcttatcaa 3000 caaactatgt aattacggag tagctcactc taacttaaag tggctgcaat gttatttaga 3060 aaatcgtaaa caagtagtac catttgattt aacatgctcc ccatttgagt caataagttg 3120 tggagttcca caaggatcaa tacttggacc tttactgttt ttaatttata ttaatgatat 3180 ttatttgtca tcaaaaatac ttaattatat tatttttgct gacgatacta atgttttcta 3240 ttctgattca aatttaaaaa atattttttg taatgtaaac actgaactaa catatctcaa 3300 tgagtggttt atagcaaata aactttcgtt gaatattgaa aaaacaaaat atattttgtt 3360 tgctaaacca tctaaatccg agaatcttcc acttacacta ccagaactaa ctattaacaa 3420 tattaaaata aaaagagtgt tttcaatgaa aatacttgga ataatttttg acgaacattt 3480 aaattggaaa agccaaataa agctagtcga gaacaaagtc tcaaagactt tagggataat 3540 gtataagact aaacctcttt tgaacaaaat atgtttaaaa aatttatatt tttcgtttat 3600 acatagtcat attaactatt gtaatattgc ctgggcaaac aatcactcat ctttcctaaa 3660 aactatttac acaaaacaaa agcaagcaag tcgattagtt tcgggcatgg atagatacga 3720 aagtgccgag ccgttactca aggaaataaa tgccctaaat gtatacaaat taaacatcta 3780 ccataacttg ttatttatgt ttaaacttca aaacgaatta attccaaaat atttttataa 3840 aaacttctct cttattaatc ataaatataa aacaagacac tcaattaata attactgcat 3900 cccactaaca ttactcaaaa agtctgactt ctcaattaca tgtcggggac cacgcttgtg 3960 gaactcggtt cttgacgaaa atatgaaaaa aataaaatct attgatatat ttaaaagagc 4020 tataaaaaaa cacttactaa atttaaacaa tacgtacatt ctttcatttt tttaattaaa 4080 aaaaaaaaaa aawatatata tatatgtatt ttgcgtgtgt atgtatatat atatatatat 4140 atatatatat atatatatat atatatatat atatatatat atatgtgtgt atatatatat 4200 atatatatat atatatatat atatatatat atatatatat atatatatat atatatatat 4260 atatatatat atatatatat atatatatat atatatatat atatgtatat atatatgtat 4320 gtatatgtgt atrtgtgtat gtgtatatat gtatatgtgt atatatatat gtatgtgtgt 4380 gtatatatgt atgtgtgtat atgtatatgt gtatatgtat atatgtatat gtgtgtatgt 4440 atatgtatat atgtgtatat gtatatgtat atatatatat atatatatat atatatatat 4500 atatatatat atatatatat atatatatat atatacatgt atgtatgtgt gtgtgtatgt 4560 atgtatgtat atgtgtgtat gtgtatgtat gtatatgtat atgtgtgtat gtgtatgtat 4620 atgtgtgtat gtgtatatgt gtatgtgtat atgtatgtgt gtatgtgtat atatatacat 4680 atatccttct tatatatatt cttcttgttt ctttttgttt gtctgtttta attttgttta 4740 aaaaaaaaaa tatgagtaga tgaacttaaa tctataaaat gttcagaatc ttttttaatt 4800 caaaatatct tttgtatatt taaatatata ttatgaagtt tattggatat gtatacgcat 4860 atgtacggat ttgtaaatta ttttttcaaa attttttctt ttttattatt ttcaaacttt 4920 ttagttttgt caaacttaaa aattattttt tcaacttttt gattttttca aactttttaa 4980 tttgtttttt tttaaacttt ttttaatttc actcttttat atattattgg aaattgtaaa 5040 attttttttt tatttttttt tattttcacg aaggggcttg atgataagac attttgtctt 5100 ctacttgccc cagtcagctt gcatttgtat aaatttattt gaatgtaagg ttaacattgt 5160 aaaatgtgtc aaatgacgaa aaataaaaat taaaaaaaaa aaaaaaaaaa a 5211 // ID CR1_Ele30 repbase; DNA; INV; 4393 BP. XX AC . XX DT 19-OCT-2010 (Rel. 15.1, Created) DT 19-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE A CR1 clade non-LTR retrotransposon family from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1_Ele30. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4393 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4393 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (19-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. This consensus is generated from 23 CC sequences with >95% identity, and ~99% identical to the original CC sequence in [1]. XX FH Key Location/Qualifiers FT CDS 333..1187 FT /product="CR1_Ele30_1p" FT /translation="MEKKHCGECHLEINDLEPVRCGLCESPFHISQNCCGF FT NLRSCKDVFAQGKAVFVCTKCRDELNGRSIRAYIADQAHRNVSPTASADNI FT NTQIQQLSGIVAELSRKVDNIANISTPKLPVVREMRTPVWPGLGMKRRRGE FT NGQSLAPAADRGTGAMDFSDLSIPFITPAASPPKFWLYLSGFQPKISDDDV FT QKIVARCLDLRDPFEVIRLVPKGADTKNMSFISFKIGLELALKQQALDAAR FT WPTGLMFREFVDFSKNRRPLSFAREPPAVTPTQEQPTGNPETIV" FT CDS 1019..4189 FT /product="CR1_Ele30_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="TSSQTASPRCSTLAYRLDVQRICGFFKKQTSTILRPR FT AASSYPYSGATNGESRNYCLNTPRCLEEGGSPRALTRGEYFDVFTSSPALN FT QCTLQRIPDHDSDIMVYYQNVRGLRTKIDDVFLAAHDCEFDAVIFTETGLD FT DCITSLQLFGTTYNVFRCDRCPRNSNKSRFGGVLIAVAEQYTSCKIDTTSA FT RNLEQISVSTNIKGKKLSLCAVYIPPDRSQDLSVINEHIASVQELCNNCSA FT NETVLICGDFNRPRMRWVRNDTGIICGGSLLPPTSHTLLDGMEFLGLGQRN FT LETNLLGRTLDLVFCPLECEATVGGCSMPMLPVDSHHPPLAISLHTDVDEA FT LSSTVGDRIAVRPLNYRLIDFPGLSDRLNNIDWTSLFASKEVDDMAECFCG FT EINRWLETNVPRVRPSFTPAWSSSRLRQLKRERNACQRKLQRRRTNANARI FT FHRAVNAYRHLNASLYKSYVLRMQSSLRSNPRGFWNFVNSKRKTSSIPSNV FT YFGNATASSSLESCELFARHFASVFSSHVTSQQEAENVASNVPCDLVDFGI FT FVITPEMVVKAAKKLKSTFSPGPDGLPAVVIRRCITVLARPLSDIFNRSFE FT QAKFPDIWKQSFMCPIFKNGDRRNVVNYRGITSLSASSKVFEIIVSGAMLE FT RTKNYISFDQHGFMPGRSVTTNLLSFTSKCIASMEARAQMDVIYTDLKAAF FT DKIDHTILLCKLSRLGFSSQLVCWLNSYLFGRVLRVKLDNAVSTPFSNKSG FT VPQGSNLGPLLFALFFNDVALFFEDGSKLVYADDFKLFLEVRSIDDCLQLQ FT SQLQVFVAWCTKNKLVISVAKCYVITFHRTQRPIVFDYNIGGTILTRVSEV FT HDLGVQLDAKLMFDCQRSMVISKATQRLGFIFKIAKDFNDPHCLKALYCSL FT VRPILENASVVWCPHQVSWCLRIERVQKRFVRMALRNLPWRDPVNLPPYPE FT RCQLLGLDTLQRRRKIQQALIIAKLINGEIDSPELRGMLNFRVPSRSLRNT FT TLLEQRFHRTLFGYNEPMAACIRTFSMVEDLFDFDENIDKFAGKINRSRLF FT " XX SQ Sequence 4393 BP; 1181 A; 948 C; 998 G; 1266 T; 0 other; tctctggcaa cactgttgaa attagttgat tgttttttgt acgtggattt ttattacgat 60 tttgatcgtg taatacgcat attatcagtt tcggtttata tacgttgtat tcgactatat 120 cgtgaaatca gttgcatact ggagttttag tcgaaatagc cctcatcgta ctgtttttgc 180 tgaccattac ccgaccaaat tggatatttt tggcgccatc caaccgttag caccagccac 240 atctgtttag aattcgacgt cgttgtggtt caacgaagcg aacgaagctg gttctgcttt 300 ccttgcttac gaaactaagg ataaaaacga agatggagaa aaaacattgc ggagagtgcc 360 acctcgaaat caacgatttg gagcccgtac gttgtggttt atgtgaatct cctttccaca 420 ttagccaaaa ttgttgtggc ttcaaccttc ggtcatgcaa agatgttttt gctcaaggca 480 aggccgtgtt tgtttgcaca aaatgcaggg atgagctaaa cggcagaagc attcgtgcgt 540 acattgccga tcaagcccat cgcaatgtat ctccaaccgc aagtgcagat aatatcaata 600 cacagataca gcagctgtct ggtattgttg cggagctaag caggaaagta gataacattg 660 cgaacatctc tacaccaaaa cttcctgttg tccgtgaaat gagaacgccc gtttggccgg 720 ggttaggcat gaagcgccgc cgtggggaaa acggccaatc attggctcct gctgctgatc 780 gaggcactgg cgccatggac ttcagtgatc tatccattcc ctttatcaca cctgccgctt 840 cacctcctaa gttctggctg tatctctcgg gatttcaacc aaaaatcagt gacgatgacg 900 tgcaaaagat tgttgcgcga tgtctggatt tgagagatcc atttgaggtc atccgtctag 960 taccgaaagg tgcagatacc aagaacatga gtttcatctc gttcaaaatc ggccttgaac 1020 tagctctcaa acagcaagcc ctcgatgcag cacgttggcc taccggcttg atgttcagag 1080 aatttgtgga tttttcaaaa aacagacgtc cactatcctt cgcccgcgag ccgccagcag 1140 ttacccctac tcaggagcaa ccaacgggga atccagaaac tattgtttga acacaccgcg 1200 atgcttggag gaaggtggga gcccaagggc tctcaccaga ggcgagtatt ttgatgtttt 1260 tacgtcttca cctgcattaa atcaatgtac attgcagcga atccctgacc acgactccga 1320 tattatggtt tattaccaaa atgtacgagg attgcggacg aagattgacg atgtttttct 1380 ggctgcccat gactgcgaat ttgatgcggt aatattcacg gaaactggac tagacgactg 1440 catcacctcg ttgcaactgt ttggaacgac ttacaacgtt tttcgctgcg atcgttgccc 1500 tcggaacagc aataaatctc ggtttggtgg cgttttgatt gctgtggccg agcagtacac 1560 tagctgtaaa atcgatacaa cgagtgcacg gaatttggaa caaatttctg tctcgacgaa 1620 tataaaaggc aaaaagttat ccttatgcgc tgtttacatc ccacctgatc gtagccagga 1680 tctgagcgtc atcaacgagc atatcgcatc cgtgcaagaa ttgtgcaaca actgttcggc 1740 aaacgaaacc gtgctcattt gtggtgattt caatcgacca cgcatgcgct gggttagaaa 1800 tgacacggga ataatttgtg gtggttcact gttgccaccg actagccata cattgttgga 1860 cggcatggaa ttccttggtt taggacaacg caatctcgag accaatttgc tcggtcgtac 1920 acttgacctt gttttttgtc cgttggaatg tgaagctaca gtcggtggtt gttcgatgcc 1980 aatgctgcca gtggattctc atcatccgcc gcttgccatt tctcttcaca cagatgttga 2040 tgaagccctc tcgtctacgg ttggtgatag aatagcggtt cgcccactta attatcgatt 2100 gatagatttt cctggtctat ctgatcgtct caacaatatt gactggacat cattgtttgc 2160 atcgaaggaa gttgacgata tggcggaatg tttttgtggc gaaatcaatc ggtggcttga 2220 aacgaatgta ccccgtgtaa gaccatcctt cactccagca tggagctctt cacggcttcg 2280 ccagttgaaa cgggaacgaa acgcatgtca acgtaaactg caacggcggc gaaccaatgc 2340 caatgctcgt atcttccatc gcgccgttaa tgcatatcgt catctaaatg ctagcctata 2400 caaatcgtat gtgctcagaa tgcaatccag tctacgcagc aatccccgag gtttttggaa 2460 tttcgttaat tcaaaacgga aaacgtcatc gataccatca aacgtatatt ttggtaatgc 2520 gaccgcttcg tcgtctttgg agtcttgcga gctatttgcg aggcactttg caagcgtttt 2580 ttcgagccat gtcacttcgc aacaagaggc ggagaatgta gcgtcaaacg taccctgtga 2640 tttggttgat ttcggtattt tcgtcatcac tccggaaatg gtcgtaaaag cggcaaagaa 2700 attaaagagt actttttctc ctggacctga cggattgcct gcagttgtca ttcgccgttg 2760 catcactgtt ttggcgagac cgttgagtga tatttttaat cgatcgtttg aacaagctaa 2820 gtttccagac atatggaagc aatcatttat gtgtcccatc ttcaagaatg gtgatcgacg 2880 aaacgtagta aactaccgtg gcataaccag tttgtcagct tcttcgaagg tgtttgagat 2940 aatcgttagc ggcgcaatgt tggaacgcac caagaactat atctcctttg accagcatgg 3000 gtttatgcca gggagatccg tcacaacgaa cttgctgagc ttcacatcca aatgcatagc 3060 tagcatggaa gcaagagcgc aaatggatgt aatatatacg gatctcaaag ctgcgttcga 3120 caaaattgat catactatcc ttctatgcaa gttatctcgt ctcggctttt cgtcgcaact 3180 ggtatgctgg ttgaactcat atctttttgg gagagttcta cgagtgaaac ttgataacgc 3240 cgtgtcgaca ccgttctcga ataaatcggg tgttccgcaa ggaagtaatt tgggaccgct 3300 actattcgca ctatttttca atgacgttgc actgtttttt gaagatggca gcaagctagt 3360 ttatgcggac gatttcaaat tgttcctcga ggtacgatct atcgatgact gtttgcaact 3420 gcaaagtcaa ctgcaagttt ttgttgcatg gtgtacgaaa aacaaactcg tcatcagtgt 3480 tgcaaaatgc tacgtaatca cgttccatcg tacgcaacgt cctattgtat ttgactacaa 3540 tattggcgga actattctga caagagtcag tgaagtgcac gatttgggcg tccagttgga 3600 tgcgaaactc atgtttgatt gccaacgatc gatggtaatc tctaaagcta cgcaacggtt 3660 gggattcatc tttaagatag ccaaggactt caatgatcca cattgcctga aggcattata 3720 ttgttcactc gtccgtccga ttctcgagaa tgcttcggtg gtatggtgtc cgcatcaggt 3780 ctcatggtgt ttgagaatcg aacgagtgca gaaacgtttt gtccgcatgg cgttacggaa 3840 tttaccgtgg cgagatccag ttaacctgcc accatatccg gagaggtgtc aattgttagg 3900 attggacacg ttgcaacgtc gacggaaaat tcaacaagcg ttgataattg caaaactcat 3960 caatggagaa attgattccc cagagctgcg tggaatgctc aacttccgtg tcccgagcag 4020 atcgctgcgg aatacaaccc tgctcgaaca aagatttcac agaaccctgt ttggttacaa 4080 tgaaccgatg gcagcatgta ttcgaacgtt tagcatggtg gaagatctat tcgatttcga 4140 tgagaatatt gataaatttg ctggaaaaat caaccgctca agactctttt gactctttgt 4200 aatagtttat tttattattt gacatgtttt gtaatgtttt gtgatgagat gtattgtgat 4260 atgatgtcca attgtgtttg tatttaattt tgtaagttta accagttaag tttatgtaga 4320 ctataaagtc cgataaacag aatacctaat aaataaataa ataaataaaa taaataaata 4380 atgcaaaaaa taa 4393 // ID SAT-3_AAe repbase; DNA; INV; 181 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE Satellite-type sequence: consensus. XX KW SAT; Satellite; Simple Repeat; SAT-3_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-181 RA Jurka J.; RT "Tandemly repeated DNA from Aedes aegypti."; RL Repbase Reports 11(4), 1453-1453 (2011). XX DR [1] (Consensus) XX CC 181-bp unit. XX SQ Sequence 181 BP; 53 A; 42 C; 28 G; 58 T; 0 other; atcaattcac tctgccaaaa ccttagcctc caacttttag tgggacccat gcttgaaaat 60 ccaataaaat tataaaaatc cgtagatttc ctaaaaagtt gagtaaatat caacggattt 120 tcgatttctt agcctcaatc gcttcgtctg gatcccagct aactttccgt gtgctgggtt 180 t 181 // ID Sola1-1_Lgigantea repbase; DNA; INV; 2510 BP. XX AC . XX DT 12-MAY-2011 (Rel. 16.05, Created) DT 12-MAY-2011 (Rel. 16.05, Last updated, Version 1) XX DE DNA transposon: consensus. XX KW Sola; DNA transposon; Transposable Element; Sola1-1_Lgigantea. XX OS Lottia gigantea OC Eukaryota; Metazoa; Mollusca; Gastropoda; Patellogastropoda; OC Lottioidea; Lottiidae; Lottia. XX RN [1] RP 1-2510 RA Yuan Y.W. and Wessler S.R.; RT "The catalytic domain of all eukaryotic cut-and-paste transposase RT superfamilies."; RL Proc Natl Acad Sci U S A 108(19), 7884-7889 (2011). XX DR [1] (Consensus) XX SQ Sequence 2510 BP; 885 A; 432 C; 505 G; 688 T; 0 other; ctgccctaaa aaggtaaaag acagtatctg caattttgac gaaaatgagt tatgggtaac 60 attatgacca gcagatacta ttctatgata taaaacaaaa tttacaggta tatatgaaga 120 ataagaagag ttattggctg taaaattcaa cagtatctat attttatgca gagttacaaa 180 gttaaaaagc agtaaatggt agatactgtc ctctccctcc ctttcttagt aagttactgc 240 ctttctcttc caaactagtt ccaatttcat atagaggttg tgagatactg ctgcacaagg 300 tggataaaaa caacatggat agcttatcta gaggaaaaca aattttgaat ttatcgctta 360 aaaacaactt atctgattct aaagaaggtg cccaattaaa aaatgatgag gcagaacctg 420 aaatggacca gaattgaaca caacaatgag ccttcaagaa taattcagaa tctttatgca 480 gacatggtgg caccaacaac accaacaaga atctttctag gatctccaga tgatcttttg 540 gaagaggtga taatagatac tggtacacat agagaccatg attatggtgt agatgataat 600 gaagaatttc aaaattctaa tgctgacgat gttaatgaag atccggatta tgaacctgac 660 aaagacgaaa ccacgacagg tgggcgatta gatgacagaa taaaacaacc taacacaaac 720 cgaaccttga aagcgaccat ccaataagag aaccctgtaa aaattgcaaa agaaattgtt 780 tagataaaat tgatgaaaaa cgtaggaaag aaatttggga agcatattgg aaacttaatt 840 acaatgggag aagacagtat ttgttcaata atgtacatag aaacgagaag agtcgacaga 900 cgactcaagc tccaagtaga agaaaactca gctatgaata ttttctaaag gatgaacatg 960 gaacagctca agttgtttgc aaggttttct tcctttgtac attaggttat catccaaaat 1020 tgataaaatg attaacacag tcatgaagct caccaggcct tcagatattt cacccccacc 1080 agacaaacgt gggatgcaag agccacccaa taagattaac agagaaccaa taaaggccca 1140 tattgagtcg ttcaatccct gtatcagtca ttataggagg gctcacgccc caaacagaag 1200 atatctaccg agtgatgttt caattcgtat gatgtatgat gattttaaat caactgctgt 1260 tcaaaaatgc agctatgata catatagaca ggttatccaa gatatgaaga tcagtttcac 1320 taaattgggt gaagaagaat gtgaggtatg tctccaacat gagctacatg tcaaatcaga 1380 ccacgctgat cttactgatg gcgttacaag tgacagatgc agtacctgga agagacataa 1440 cgaggcagct gcatcagcta gagtacacta tcaaactgat gcatctaatg cagatgcaaa 1500 catattgata agaagtgttg atcttcaaaa ggttattatg ctacctcgta tgccaggtaa 1560 caaaaccgca atttttacga aacgcatcat tgcttttaat gaaacgttcg ctgcagttgg 1620 taagcaatca aagaaaccat cacacattag agagagatct ttggctgttt tatggcacga 1680 gggaataggt ggtaggcagg gcaaagaaat tgcctccgct tttgtgaaag ctttgaagga 1740 gatgcgagat gcagagcata ttgtactgtg gctggacaac tgtgttgctc aaaacaagaa 1800 ttggtacctc atttctgcct tattaacttt ggttaacaat aatgatgttt tagctcaaga 1860 cgtaacattg aagtattttg taacaggaca tacatttatg tctgctgact ctgtccatca 1920 ccaggtagag cagcaaatga atcgtcaaca aggtggcaaa gtattagatt ttccagattt 1980 tgtagatgtt gtatcaaggt gttctaatgt agaaggttta cagatgtcaa atattgattt 2040 tcgtgattgg agtcctatcc attctgcagc caagatgaag aaatctggaa ttaagttggc 2100 ccaaatggtt gaaataaaag tgaagcgtgg gtcgaaaact ttgtcctaca aattgaacca 2160 tgatgatccc gagttcattg aattggactt cacaaaaaag gagcaatgtt ggtgtttcct 2220 aagagactgc gccagaagaa cagaggaata ccccttgaga agaaaactga tattgttgaa 2280 aagttgtgcc ccatgatgcc tacaaatcga cgtatttttt ggaccagtat tgacattgat 2340 aaaaactgtg aagatcttat caatgaatct acagaaaaat atgaagatta atttccataa 2400 aatcatcttg tttcagtatt gtatttggaa gtgggaagat ctcaaacttt aacctcgatt 2460 ttctcaaaac gccatttttg cagatactgt ctttcacctt ttcagggcag 2510 // ID BEL-235_AA-LTR repbase; DNA; INV; 595 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-235_AA_; KW BEL-235_AA-I; BEL-235_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-595 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 926-926 (2011). XX DR [1] (Consensus) XX SQ Sequence 595 BP; 217 A; 91 C; 115 G; 163 T; 9 other; tgtggaagaa ccacagggca atcgaattca gtaacgtcct tgtgagttcc gtatcaccca 60 cagcagawcg tttgacgtag aatgaagaaa cgtcaacgta tgatgagatg gcagaataga 120 aactgwtgac agaaagtttg gttggactag cgtcggcagt tgaagtaaaa actataatac 180 agtcttaaat ttgcaatttg cttcaagtwa aggtaaattt aatattctaa aaggtttgtc 240 aaaagtttga ttattcctaa atttattata gttcctagaa cgaaagwtta cagtacggaa 300 gaatacggcg ttgagtatat tccgtcaaca atttkcttam aagtaagaca tgcaatataa 360 cctaawccta awcgaacaat aaaagtaatg aatttgttat acctaggaat tawattagtg 420 gcacacagta aagtgcatca agaaaatcga gtagtttcgg tagcgtcgag ataagcacga 480 aaatgtaagt cttgagatgt aaaaatgatt aattgaaact aaaatcctta ataattacag 540 ctttaaagct acaattgctg ctaaaagagc gtgtttctcc taccgatccg aaaca 595 // ID BEL-105_AA-LTR repbase; DNA; INV; 502 BP. XX AC supercont1.4; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-105_AA_; KW BEL-105_AA-I; BEL-105_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-502 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.4; Positions 2091793 2092294. XX SQ Sequence 502 BP; 150 A; 90 C; 121 G; 141 T; 0 other; tgaatttgta tcgtataccc tattttgaga agaaaagaga gtaagtgatt acgagagtag 60 tgcgcgtttc gattaatgtc cacatgtctg cgataaatgg tttatagaca gagtgaagcg 120 aatcgtgtat ataggagagt aaggtcagtt cgagctagac cgtcggagtg ataaccttcg 180 taggttactt tgtgattgat tcggtcgttt cgcgtgtgaa ggaaagatta attgccaagt 240 gaaatagccc ttttctaata agtgtccaaa agtgtaacga atctaaattg gctaacggca 300 acaacttgaa ccagtacaag tgaccaaacc ctgaaccgaa tagcattgtc agaccagtgg 360 cctgtaattc caatgaggat ttgctaagtg cggacgacaa tatccagttt cttgtttggt 420 acactcatcg cagttatttg tctgggccgt ttatcactaa acggtggaaa ttccttcgac 480 gaagaacggg taagccctaa ca 502 // ID Harbinger-N7_BF repbase; DNA; INV; 361 BP. XX AC . XX DT 04-AUG-2008 (Rel. 13.08, Created) DT 04-AUG-2008 (Rel. 13.08, Last updated, Version 1) XX DE Amphioxus Harbinger-N7_BF non-autonomous DNA transposon - DE consensus. XX KW Harbinger; DNA transposon; Transposable Element; Nonautonomous; KW Harbinger-N7_BF; non-autonomous. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-361 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-361 RA Kapitonov V. and Jurka J.; RT "Harbinger-N7_BF - a family of non-autonomous DNA transposons RT from the amphioxus genome."; RL Repbase Reports 8(8), 820-820 (2008). XX DR [2] (Consensus) XX CC It is a very young family on non-autonomous Harbingers: copies CC are 99% identical to the consensus CC (TWA TSDs, 32-bp TIRs). XX SQ Sequence 361 BP; 136 A; 50 C; 70 G; 105 T; 0 other; ggccacacca atttaatttc ttggttcacg gattttttca taaaaaatat ggagcgagag 60 ggcgaaataa aaataaaaat tgtaaaatgg ttggggtaaa ggtaacggct aatccaaaac 120 ataaagaaaa aaagttttca gcttgaaaaa agtacaaaaa cactattatt gtacagtaac 180 agcacctgta cccacacttt aaggagctta taatataaag gccaatttgc tacaccagaa 240 agttggtgaa tggtttcatt aatggtgaaa atttgctgag tggtaatttt tatttttatt 300 tttttttcca aaaaatagga gcgagcgaat ccgtgaacca agaaattaaa ttggtgtggc 360 c 361 // ID SKIPPER_I repbase; DNA; INV; 6219 BP. XX AC AF049230; XX DT 27-JUL-1999 (Rel. 4.06, Created) DT 08-AUG-2007 (Rel. 12.07, Last updated, Version 2) XX DE Dictyostelium discoideum LTR-retrotransposon Skipper, Gag (gag) DE gene, complete cds; and Pro (pro) and Pol (pol) genes, partial DE cds. XX KW LTR Retrotransposon; Transposable Element; SKIPPER; SKIPPER_I; KW SKIPPER_LTR; internal portion. XX NM SKIPPER. XX OS Dictyostelium discoideum OC Eukaryota; Amoebozoa; Mycetozoa; Dictyosteliida; Dictyostelium. XX RN [1] RP 1-6219 RA Leng P., Klatte H.D., Schumann G., Boeke D.J. and Steck L.T.; RT "Skipper, an LTR retrotransposon of Dictyostelium."; RL Nucleic Acids Res 26(8), 2008-2015 (1998). XX RN [2] RP 1-6219 RA Schumann G.; RT "SKIPPER."; RL Direct Submission to Genbank (17-FEB-1998)Molecular Biology and RL Genetics, Johns Hopkins University, 725 N. Wolfe Street, RL Baltimore, MD 21205, USA. XX DR GenBank; AF049230; Positions 391 6608. XX SQ Sequence 6219 BP; 2456 A; 1054 C; 955 G; 1754 T; 0 other; cttttttttt ctaattttaa cataaaacaa atacattaaa acaaataatt agatacaatc 60 cttatatttc gtatacaatg acacctcatg atgaagaagt cgactcttca gattagagga 120 ataacagtag tatttaaaac aaaataaaaa tgcattacaa tggcccttct gtttgcaaga 180 ttaacgctgt actgcttagg cggtacaacg acacttccta ctcacattgt aaacatgcat 240 cttttataaa gtaatataaa acaagattgt atcaattaat ttaataataa tgctcaaata 300 caaagacacc acaatataaa ttattagtgg aataacagtt aaacaaagac accacaatat 360 aaattattag tggaataaca gttaaacaaa gacaccacaa tataaattat tagtggaata 420 acagttaaat aaagacacca caatataaat tattagtgga ataacagtta tattgatgcc 480 attgttataa ttaaaataat ttaagataat atttattatc aattaaataa tataaagaca 540 ccacaaaaaa aatatattag tggaataaca gttaacatat attaatactt atatttaatt 600 attcaattaa aaattaaatt aaattaaatt aaataaatta aattaaatta tataattaaa 660 ttatataatt taaattataa aattaaatta taaaattaaa ttatatttaa taaattttat 720 atataattgt tggtgaaagt catgtcatca actactactc cttcaaagaa agccatccgt 780 aaagccaaag tgtcttccac cgccaaacgt gccactgaag tgacaaccgc ctccgatgat 840 gctattactg ttcaagtaaa atcaaaaaag actcttcgtc ccactgaagt cgatgatgaa 900 atttattcaa ctgaaagcac tgatgtatcc gacgtcgaag tcagagagaa agtgccaaag 960 aaatcaaaga ccaattccat tgaagctaaa accattgacg ctttaaaaaa tgctgccact 1020 gctgtcatct tcatgaatct ctttaccatc cactgctccc agaatggtgt tgaaccaacc 1080 ttgaagacca ttatggacaa tggtgctgca tcaaatgaat tatttcaatt gcttcaacct 1140 acccacaatg aatcagttaa gaacaacgat aaaattgaag gtgaaacatt gaagaagcta 1200 atacaccgta actttagatc caagactttg gttccattca aaactgacta cgaagctgcc 1260 aacactaaag tagtcgaagc tattaaccat cgtaactgct actcagccca agatgttata 1320 aaaaacatcg aagccattat gagttaccac aatgtaatta atccggtgta tctaatccat 1380 ggtactgctg aagttcaatt agagtatcta agatcaattt tagataataa tatccttgct 1440 ttgttaaaga tggtcgtacc agactggtta caaactgatg ttgcaatcaa actcaacact 1500 caaaccgata tcaccgagtg tcgtgaagcc atcaccgagt tggttcttaa caatgctgat 1560 aagttctcga gaagaagacc ttccgattcc gcatcgaata atgatgagaa gtttccatta 1620 aaaaagaatc atgctactcc aaatcataac tctcataaca ggaattcctt tgatgcacaa 1680 gctgataaga ttagagctgc aatattacct gaaataagaa aagactctaa ggtagacctc 1740 aaaaagataa gaagcagttt catcgtttac cgtcagagca aaggctactg tctcaattgt 1800 ggtaaatcaa atcactccac atctacatgt agaattgatc cggtcgatgc caaagccaat 1860 ccaggtccat cagccaaaaa gcaacaatga ctcgatgaag gtgatgagtc caagtcaaag 1920 attttaaata aaaattataa aatttcacaa tatcatcacc ttgaggataa tgattaccca 1980 gtgaacaaag attcacttta tactcttaac tctttttata atcaaaacat tgttaaacag 2040 ttacatcaac aaagccagca acatgatgtt cataaaaaca tcaacaaact taaagaatac 2100 gtcaaagact catatggtat aaaactagtt cctgatacaa ccgttccaat tccaaaagct 2160 gttgaatcag agatgaagta taaaccacag aagattgaat ttaatactcc aacagagtta 2220 acaaatgttc aagataaaaa ctatactgta aacatattca atcgtccaaa gatattgaat 2280 tctagtaaag aattaattaa cgatgaagaa gatgaattgc tttacacaat taaattcgat 2340 aaagaaacaa acaaaccaat catctcatta ttacatagta ctggtacaag tgatactgct 2400 ctcattccga ctctcaaatg tttaatcgtt ccaaacagag aaaacatgga taaacccaag 2460 attataatta actctcattc caaaggaaaa caattcaaat taattctcga tactggaagt 2520 aatgttagtt taatcaataa gaggttgatt tcaaaagata tgaagaacca cgttcacaaa 2580 acgaaagcca agattaactt cccgctacta gacgtcaaag aagagtttgt tgaacaaatc 2640 aatatacaag ttaacaatga aattcataac ttctttatta tagaattgga tatcattgat 2700 gtacttttcg gtaatgatat ccttaaggac tctatcatta accaaaaaga taaaataatc 2760 aagttgaata acagtacata caaaattaat tatcaacagt cgaatcaatt acattgtaat 2820 gtaaaagctc caagaaaaca tgttaccgta tcccaatgat aatcagtcac agtcacataa 2880 tctactagat gatgatgaag atatcaaaga acaagttaca aagtttgttg aaagcatccc 2940 atcacaactt tgtgatatac tggttaaaac taatccagaa gaagaagaga tacaagcagt 3000 acgtgatttc attaatgatt cattcgaaga tgttatagta gataaacttc ctgatattcc 3060 agatcagatt aacaattcca gaagaggtaa tatcatacat aatatcatat taaagaaaga 3120 tcaagatgtt gaaccaacaa agaaacaaat ctattattca acagatgatc ataagagaca 3180 tgtcgaagaa atggttctca aatttataga tttaggtatt atcaaaagat cagaatcaaa 3240 ttactcatct ccaatcatgt tattaaagaa gagagattca tggcgtgtag tccacgacta 3300 tcgtcaacta aataaagtca ctgttagaga cgatcatcca ttcactccgg ttgatagtct 3360 tttaaatcaa tgtaaagatt caaagctttt ctcaaagttt gacatgatta tgggttattt 3420 ccaagtgtta attaatccag aacacgcaaa gtacaccgct ttcattacac atattggtaa 3480 atttgaatat actagaatgc cacaaggttt agtcaactct ccatccacat tcgccagatt 3540 gatggtcgaa atatttggaa aaatcaaaag tttattacaa tactttgatg atcttttggt 3600 tcattcaaaa ctcgactaca tggtacattt cattgaaatc attagaatgc ttctatattg 3660 tagaaagtac ctattattca tttcacgaga aaagagtgaa atgttaaaga ctgaagtaga 3720 tttccttggt ttccacattc ataaagatgg tatatctcca agagctgcaa aggttagagc 3780 tatctctgag ttacctgaac caagaaacgc caaagaagct gaagctgcat taggtctttt 3840 tggattcttc aggagacaca ttgaaaatta cgctgaaaaa acctatcacc tttccaaaga 3900 atcaaaagga aaaaacaaga aaacactttc tgatgaatca ctcaaagaat tcaataatct 3960 aaagaaagag tttgaaggtg aaaatattgt cgctattcca attgaacaag ataactccat 4020 accaatagat atcgaaaagg ttaaagcatc aacagacatg ccaatccatt cagataataa 4080 taatctaaac aatggtagtt ttcacttgta ttgtgatgtt agtgataaag cattatcagg 4140 tgtattatat caaattcaag gtaataaatt caaagtcatt tggtttcatt gtagaaaact 4200 tactgatact caaaagaggt acagcatagg tgatagagag ttcctttcaa tcattgattc 4260 tctaaagaag tttcaacatt tattaattgg taaaaaggtt tcaatctaca ctgatcacca 4320 aaatcttaca tatattatca ataagtcaaa cgataaaccg ttcacaaaga gacaagataa 4380 ttatatgaaa tatattaaag aatttgatta tgaattaaga catataagtg gtaaaaagaa 4440 tggtatagct gatttcttat ctcgtaaata tgataatttc caatgggatg aatcgttttt 4500 aaataaaatt aaggaagaac aaatcaattc tcaatggcta ttagagatga aaaagaatcc 4560 aaatttatgc attgaagaaa ttaatgatat ctgttacctt agtgaagatg ggttcaagaa 4620 attaatcatc atagataaag aaactattca taccgtcatt agagaatatc acgataccaa 4680 atatagtggt catcacgctt tagatatcac atacaataat ataagacaag attactattt 4740 caaagaaatg ttttctatta tcaagaggta cataaagtct tgtgcaacat gtcaattgaa 4800 cattaaccgt aaagataatg gtatactcca aagtttagaa attccctttg aagtttggag 4860 agatatatca atcgatttct tatcgctacc aaaaacaatg tatgcaataa acggatttac 4920 agttgaagtc gatcaagttt gtgtcattgt ttgtagatta tctaaaatgg ttcatatagt 4980 tccatgtcat aagacaatag atgcccaaca tactgctcaa ttattgttga atcatgtgtt 5040 cagactccat ggttatccaa gaacaatcgt ctccgataga gatccaagat tcctttcaga 5100 aatatgggag agatgggcaa agacgatgga ttctaaactc aagatgactg tggcacacag 5160 agcacaagct gatggtcaaa ctgaaagaat gaatagagaa attattagaa tattaactaa 5220 agcctcaaca gagtatgggg agaattggtc agatatcatt ccacttattg aatttgcaat 5280 gaactcatca atgagcaagt ctacaaagat gtcaccattc caaatcgtat atggattcaa 5340 tccaccaact cccgttaatc atttcaattc attaacaaag actcgtatac caatgagcaa 5400 catcaaaaag attgtccgtg ataatatact agatgctcaa atcaatgctc caaagtatta 5460 caaccgaggt cgtggtgatg ttatcttcgt agtcggagaa aaggtgatgg ttaaaagaaa 5520 attcttccaa actaatcttt caaaagatct aatctctcac aaacttgaat ccaagaattg 5580 tggaccattc ataatcactg ccgttcatgg taataatgta acactcgatt tagtaggtta 5640 tccaaagaaa cacaatgtgt tcaacaagga tcaaatcgta aaactatacg aggacagtga 5700 gtggttaaga gaagagatat cgatgcccga acctgaggag atggacgaag caagttacga 5760 agttgaaagt attctcaatc acgataaagt caaaaagatg tacctagtga agttcaaagg 5820 ttacccagaa cctgaatgga tcagagaagt cgatacagac tgtgaagagt tagtcagaga 5880 gtattggaat aacgtgcaga agaaacaatt gaactcaaga agagaaagag agaatgtaac 5940 tacagaagca gttgagccaa ttgtgtcacc tccactctcc acagaagttc aaagttctca 6000 aacattgcaa ccaattactc caagacaaca aaactccatc tcaaatcaaa gaaaccaatc 6060 aaagaaaaga aaatcaagaa atcaaaatac ttcaagtgat gatgatgaac atgatcttag 6120 tttacaaatt aaaacaacta gatcaggaag aaaagttaca cccaagagtt tatagtccac 6180 ctctgagcca tagctggcct aaagaaggaa ggggaaggt 6219 // ID hAT-50_HM repbase; DNA; INV; 5247 BP. XX AC . XX DT 18-DEC-2008 (Rel. 13.12, Created) DT 18-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type DNA transposon family - consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-50_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-5247 RA Bao W. and Jurka J.; RT "hAT-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 2038-2038 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(1004..1426,1440..3260) FT /product="hAT-50_HM_1p" FT /translation="MFVCVYVGVFAYKFILMSRTPQDGNCLFSAVSIALIG FT NNTLATTLRKLCAIELYKNANFFAVHPVFRLGHLSGVFRSERSAFLLGISN FT VACTAFEAKKSKVDAILAEALNISQSGVWSSLVCVMSISNVLKVYSHLYKI FT NIILLFYLLYRYYYFSIFFFLFQIPLKLFYPTTGDACLEYLFRSTLYPQTF FT KKFDEPYQIIYLLLYNVESKKEIPDNLTHMGKMHYFKPNHFVPLVSIISTQ FT PEFRNILVRTQTSLNSSKNIVVTGEKRPSNNSLLQIPCSAVQKINLTKPSL FT SIKNFFSVNNKPLPVSSSSSVLSNKFEKPISGVVIPKLNVNCENATTEDDC FT NEFDIGLFVDKVGKLSNQEKRNILLNVWIPPSDYVFPKTNGRKCNPAWFKK FT FSWLCYSKYYDGLFCISCVLFGKASSTSQLFHEYPFRFWSVGNVRIISHNN FT TSPMHKTSIQIMLDFQAVISKKTVPINLQLNKFLSDNISRNRQILFSISRA FT ILFCCRQNISLRGHRDDAKHLALQCNNAGNFQCLLDLMALCGDKILSDHFK FT NGPKNATYRSKTVQNELIDICAGVVRSLLTNGIKKAKFFSILADEASDVSQ FT TEQMAVVVRYVDECCKVKEDFLKFVSCNSGLTGVLLSTKIKKSIHKLGLEM FT SYCRGQGYDGAGNMAGRLCGVAALILKAIYPKAPYVHCYSHQLNLCVAKAC FT AIPSIRDMMDHVRIVSDFFNNSPKRLEDLSTKKLLSYVRKLITFKRQ*" XX SQ Sequence 5247 BP; 1686 A; 805 C; 891 G; 1864 T; 1 other; caggggcgga tctaggctgt agcaagtgta gcaattgcta caaccagctt agttgtcgtt 60 tttctttttt tttatatttg atatatacat tctttttttc atgcaaaaga agaatataat 120 tttaacaaga taggttgcaa gtagtgaaga cgcggcgctc cgaaaattta ttcataattg 180 tagctgagcc aagttgacta tcttacggaa atattgcatt cagaaatata ataaaaaacc 240 caagtttttt gggttttgtc tcacatgctc aaatattttt gaaagaatgt caaacatttc 300 gcagcgtatc aacttaaaat ggactgggag ttaaataaaa tggcaaatcg actgtagcaa 360 ctgtagcaaa agtcaggttg tcattatttc aaactactct atacataccc cctgatactt 420 tataaaaatt atgcctattt atatgcaatc tttttattaa caaggaggtt ttaagttagg 480 actaaaggtc atcaagcgag aacctgtagc aagtgtagca aaaaaacaaa aatctgcttt 540 gctaaatcaa aaaatctgcg ttaattaatt tttttaatga gtgcaatgga gctctatcaa 600 gcgttaaaag caaacaacca ttctttaatt gatcagatta agttaagtaa cactgccaaa 660 ttaaacaagc gctcccagaa aaactctgat ggaatctata ttttatataa attgtctggt 720 ggagccgatg gttttatgga tgttgcagca agtaatagtt tggctagatt gtatgctgat 780 aaacttataa acgaagaaac atgttcaatg gtgcctgtca ggtaaaatct ttgttatagt 840 ttttttggtg tccagctgac taacgcaatg gatgaataga tatttagaat ttttagttag 900 tgctagattt gctatatcaa aagaacttct tgatttatag tcaaaaaatg tgtgtgtgtg 960 tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tgtgggcgtg tgtatgtttg tgtgcgtgta 1020 tgtgggtgta tttgcatata aatttattct tatgtccaga actcctcaag atggcaattg 1080 tttattttct gctgtctcca ttgctttaat cggaaacaat acgctagcaa caactcttag 1140 gaagctttgt gcaatcgagt tatacaagaa tgcaaatttt tttgctgtgc atcctgtttt 1200 tcgtctgggg catttaagtg gtgtttttcg atcagaaaga tccgctttct tgttaggaat 1260 atccaatgtt gcatgtactg catttgaagc taaaaagtct aaagttgatg ctattttggc 1320 tgaagcatta aatatttctc aatctggtgt ttggtcatct ttagtttgtg tgatgtcaat 1380 ctcaaatgtt ttaaaagtat attctcattt gtataaaatt aacatttaaa ctgatttgaa 1440 ttttattgtt ttatttatta tatagatatt attatttttc tatttttttt tttctatttc 1500 agattccatt gaaactcttt tatcctacta caggagatgc atgtttagaa tatttattta 1560 gaagtacatt gtacccacag acttttaaga aatttgatga accctatcaa ataatttact 1620 tattgcttta caatgttgaa tcaaagaaag aaattccaga caatcttaca catatgggaa 1680 aaatgcatta tttcaagcct aatcattttg tccctctagt atctattata agcactcagc 1740 cagagtttcg aaatatttta gttcgtaccc aaacaagttt aaattcttca aaaaatattg 1800 ttgtaactgg tgaaaaacgc ccttcaaata attcacttct tcaaatacct tgttctgctg 1860 ttcaaaaaat taatttaacc aaaccatcat tgagtattaa gaattttttt tctgtgaaca 1920 ataaaccttt gccagtatcc agttcgtctt cggttttaag taataaattt gaaaaaccaa 1980 tatcaggagt cgtaattcca aaattaaatg taaattgtga aaatgcaaca acagaagatg 2040 attgcaacga atttgatatt ggcttgtttg tagacaaagt agggaaatta tcaaaccaag 2100 agaagcgtaa tattctattg aacgtatgga taccaccttc tgactatgtt tttcctaaaa 2160 ctaatggtcg taaatgtaac cctgcttggt tcaaaaaatt ttcatggctc tgttactcaa 2220 agtactatga tgggttattc tgtatatctt gtgttctgtt tggaaaggct tcttcaacta 2280 gtcaattatt tcatgaatat ccgtttcggt tttggtcagt tggaaatgta agaataatat 2340 cacataataa cacttcacca atgcataaaa cctccataca aataatgctt gatttccaag 2400 cagtaattag taaaaaaacc gttccaatta atttgcaact caataaattt ctatctgata 2460 atatttctag aaatcggcaa attttgtttt caatatcacg tgcaatcctt ttttgttgta 2520 ggcagaatat atcacttaga ggacatcgag atgatgctaa acatcttgct ttgcaatgta 2580 ataatgctgg taatttccaa tgcttattag atctgatggc attatgcggt gacaagatct 2640 tgagtgacca ttttaaaaat ggtcctaaaa atgccactta tcgatcaaag acagtgcaga 2700 atgagttgat tgatatttgt gcaggtgttg ttcgatcgct attgacaaac ggaataaaga 2760 aagcaaaatt tttttctatt ttagcagatg aagcttcaga tgttagtcaa actgaacaaa 2820 tggctgtagt tgttcggtat gtggatgaat gctgcaaagt aaaggaagat tttttaaaat 2880 ttgtttcttg caatagtggg ttgactggag ttttgttatc aactaagatc aagaaaagta 2940 tacataaact tggtctggaa atgtcttatt gtcgtgggca aggttatgat ggggctggta 3000 acatggcagg gcgtctttgt ggtgtagcag cattaatttt aaaagctatt tatcctaaag 3060 ctccgtacgt ccactgctat tcccatcaac ttaatctgtg tgttgccaaa gcatgtgcaa 3120 ttccttcaat tcgagatatg atggatcatg ttaggattgt ctctgatttt tttaataatt 3180 ctccaaagag gttagaagat ctatctacaa aaaaattgtt gagctatgtc cgaaagctca 3240 tcactttcaa aagacaataa atgtttgtag aaccaggtgg attgaaagaa ttgatggtct 3300 tgaagctttt attgaattat atccagctat aattgcatct cttarcttgt gattaaagaa 3360 gatgagaagt catggaacta cgaatccaga gcgtgatgct tctgcatatg tgacaatatg 3420 ttgttcgttt aagtttatcg tcaccttaat catagtcagg aagttattag gagtatacaa 3480 gaccactgac aaaaacgttg caaaaagttg atcaagactt ctcaaaagct cgtgatgatg 3540 tggaaatttt aaaaaacact cttttatcta ttcggagctc tattgaaatt tcacactcag 3600 aatggtataa agaggctgtt accatagctg caactaatta taccttgcca tcataaacca 3660 agaacatgtg gacaacaaac acaacgtgat aatcgtcaag tagatgatat tagtgaatat 3720 tatcgagtaa cttgttacga ttccttttct tgatcacatt ctaacacagt tagattcttt 3780 attttcttta gaaaatttaa cgttgctaga tagtttcgtc gttataccat caaatttaat 3840 ctctgacaag aagtggagga aaaaagtaaa agaatttgct catacatacg ataatgattt 3900 accagaacca agttatcttt atgctgaatt ggatatgtgg gaagtatatt ggaaaaatca 3960 ctctgaaaca gctcctaaaa ctatttcaga ttcactacaa aaatgtgatg gcaatacgtt 4020 tccaaatatt tctactatac ttcgaatcct ttgcacagta ccagttacaa catgtacttg 4080 tgaaagatca ttatcagctc tgaaaagaat taaaacttcc ttgcgcaatt ctatgacaga 4140 tgatcgccta aatggcgttt ccatgttgca cattcatcga gatgtcgaaa ttgacttaaa 4200 tattgttgta gatgaatttg cgactgcatt tccacgaaga atggagttta aaaatatatt 4260 gaactctgat gaatagatat acttgttata aagttatact gattacagat agatatagga 4320 tatataaata tatatataaa ttgatagata tataatatag gatactttta tttattactg 4380 atggaatatt taaagaggat atttttttga tgtcataaat atttcatatt tattattaac 4440 tttatgttat gaggttttgt taaaaaatag ttatagtttt ttacctctta tgacattttt 4500 ttttctttat atttttgtag ttctatcttt gccattattt tgttttagtt tgaatactgt 4560 tgaaatatgg tttgaattta gtaaattttc atgatatgaa ctttacttta ttgatcatta 4620 gccaacagtt ttggatgttg aacacaatta aggtaagaat aacttgcaat cgttaattgt 4680 catggtattg ctattttatt aatggtattt gattttgaga agatagttaa ccatcatttt 4740 tggatggtaa attatcataa gattcaccca catttaacat tatcttgata actcgagtcc 4800 ttaataaacg cccccccccc cctattgttt attaatttcc gatatttttc cccatcacac 4860 taagcttatt aagacccctt tattcctttt atttgttatc aacaaaacat ttatatctca 4920 aacctaaaat ctgcaaaaat cggccttaaa aatttgaaaa caatttttga gtctccgata 4980 aaaacaaaaa gtgtaaacag gctttaaaaa ttatcgacat agtaattgta ccgaaaatgt 5040 gttttgttta tttctcacca aggtataagc atatgcatta aaataaagta attgctactg 5100 cccctccccc ttatatttac tcgtattcgg gattcgagca catttccccc tcccttcctt 5160 ggtaacggag agtcaagagt gtcagtactc tggatttgag gacctttttt ttttgctcag 5220 tcaaaattat tcctggatcc gcccctg 5247 // ID Kolobok-8_HM repbase; DNA; INV; 2737 BP. XX AC . XX DT 30-DEC-2008 (Rel. 13.12, Created) DT 30-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE Kolobok-type family - consensus. XX KW Kolobok; DNA transposon; Transposable Element; Kolobok-8_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2737 RA Bao W. and Jurka J.; RT "Kolobok-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 2066-2066 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 398..2170 FT /product="Kolobok-8_HM_1p" FT /translation="MGRSIARTKKRKFSCNQHLKSSKFIQLDSSISDPSLS FT ASSRKLVNTEISGSQKNYNIIINFNILKLMVSEFKCPKCGNEVCLENDLKS FT KKGFCFYLTLQCLCCPFSKGWHTSQFSTKKSKSTPGVPTASVNFQVVTAFR FT EIGKGYQSICKFAAVMNMPSPMNFKNYSAINNNLLNIYNEVASASMSSAAN FT ETKDIISNSSNVNDVSACQVSVDGTWQKRGHQSLNGIVTAISRENGKCIDS FT ILLTKFCKACSYWNKKKSTPGYQNWKINHVCEANHNKSSGSMEASGAVSIF FT QRSIEKHSLRYEGYIGDGDSSAYNDVVASNPYPGFEINRLYCVGHYQKRLG FT NRARNLRVTLKGQFLSDGKRINGKGRLTDKAINTLQNYFGMAIRQNVNNLY FT AMKKAAWAVLFHNSDISEESERHKFCPRTRTSWCLWQSNKVTGETTYKTKL FT SLPLAIKAVLMPIFTDLTNETLLSKCLHGQTQNNNESLNALIWKRCPKDFF FT VGKKVLEISMNSAIIAFNDGSTAVEKVISAGGIEPSMFMQEGLRKLDLIRI FT KKMNHKSSAKGKLRRKQLRAIKKGYIDIEKELEKDPQYNSGGF*" XX SQ Sequence 2737 BP; 973 A; 370 C; 457 G; 936 T; 1 other; ggtggtacat ccccattaat ttcgaaattt tttagaaaaa atcaattttt tgaaaattgg 60 ttttttaaat tcaaaatttt tcatagaata atattattaa ttattttttt aataaataaa 120 taaggaagtg tgtacaatta tgtgttttgt ggccatagta gattgagaag tccgtagcaa 180 cgccatagca acgcatttta aaacaaaaaa atttaaactt gctcaagctt attaacatca 240 aaaactcttt tctactgcag cttcttgttt gttttttgct tcatgtttat tttaaaagct 300 ataacttgaa ctactaatgc gaacatgtat caaaatattg tgcttgcatg ctgttttagt 360 ttgttaatgt tcttatagat aaatagctaa ataagatatg ggacgttcta ttgctcgaac 420 taaaaaaaga aagttttctt gtaatcaaca cctaaaatct tcaaagttta ttcagcttga 480 ttcatcaatt tctgatccaa gtttatcagc aagctctaga aaactagtaa acactgaaat 540 atctggaagt caaaaaaatt acaatatcat tattaacttt aatattttaa aactaatggt 600 tagtgagttt aaatgcccta aatgtggaaa tgaagtttgt ttggaaaatg acttaaaaag 660 caagaaaggt ttttgttttt acttaacttt acagtgcctt tgttgcccat ttagcaaagg 720 atggcacacg tctcagtttt caactaaaaa atctaaaagt actccaggtg ttcctactgc 780 atcagttaat tttcaagttg ttactgcttt tcgtgaaatt ggtaaaggtt atcaatctat 840 atgcaaattt gctgcagtaa tgaacatgcc atctcctatg aattttaaaa attatagtgc 900 aattaataac aacttgttaa atatttataa tgaagttgct tctgcaagta tgtcaagtgc 960 tgccaatgaa accaaagata ttatttctaa ttcaagcaat gttaatgatg tttctgcttg 1020 ccaagtttct gttgatggta cctggcaaaa gagaggtcat caatctttaa atggtattgt 1080 cactgcaata tcaagagaga atggaaaatg tattgattca attttgttaa ctaagttctg 1140 caaagcttgt agttattgga acaaaaaaaa gtcaacacca ggttatcaaa actggaaaat 1200 aaatcatgtt tgtgaagcta accacaataa atcatcagga tccatggaag cttctggtgc 1260 tgtttcaatt tttcaacggt caattgaaaa gcatagtttg agatatgaag gatatattgg 1320 agacggtgac agctcagcat ataatgatgt tgttgcaagt aatccatatc caggttttga 1380 aataaacaga ctttattgtg ttggacatta ccaaaagaga cttggaaatc gtgcgcgtaa 1440 ccttagagtt acacttaaag gtcagtttct ttcagaygga aaaagaataa atggaaaagg 1500 tcgtctgaca gataaagcca taaatacgct tcaaaattat tttggtatgg ctatccgtca 1560 aaatgtgaat aacctttatg caatgaaaaa agctgcttgg gctgttttat ttcacaacag 1620 tgacataagt gaagagtcag agagacacaa attttgccca cgaactagaa ctagttggtg 1680 tttgtggcag agtaataaag taactggtga aacaacatac aaaaccaaat taagcttacc 1740 attagcaatt aaagcagtac ttatgcctat atttacagat ttgacaaatg aaacattgtt 1800 atcaaaatgt ttgcacggac aaactcaaaa taacaatgaa agtctgaatg ccttaatatg 1860 gaaaagatgt cctaaagatt tttttgttgg gaaaaaagtt cttgagataa gtatgaactc 1920 agctattata gcatttaatg atggaagcac agcagttgaa aaagtaatta gtgctggagg 1980 tattgaacca agtatgttta tgcaagaagg acttcgtaaa ttggacctca tccgaattaa 2040 aaaaatgaat cacaaatcaa gtgctaaagg taaattaagg agaaaacagc ttagagccat 2100 caaaaaaggt tacatagata ttgaaaaaga actggaaaag gacccacaat ataatagtgg 2160 aggtttttaa ttgagcttgt acttcttttc ctaatataac gaacttttat tttgtttttt 2220 gtgtttttct caaaatgtca tttttaccaa atacctaaat tttaaaacgc aatattttca 2280 gtttgggacc atatgctgac ttgaaatttt cagggtgctt ttgttataaa ttgacttagg 2340 taatgaacca aaattacaaa taaatatcat gtataagtat gtttaaaatc tttttttctt 2400 tttttttttt gttaaaaatt ttcttaaatt tatagaaaat tacttttaaa atgttctatg 2460 catctttaat gcatgatttt ggttcataac ctctttgtac cttttaggaa cataatgtga 2520 aaatttcaga gtgaaaggat atatagaact ggagatatct tgttttaaat tacacccata 2580 ttttggcaat tcttatggaa taagtgaggc aaaagttggg caatcgagat ttcattttaa 2640 aaaaaataat cagcatactt ttttaaactt gtgtaatgta gttttaattt tgtatagatt 2700 attttaaaga aattttttag aatgggggtg taccacc 2737 // ID BEL-638_AA-LTR repbase; DNA; INV; 230 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-638_AA_; KW Pao_Bel_Ele78; BEL-638_AA-I; BEL-638_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-230 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 230 BP; 82 A; 56 C; 37 G; 55 T; 0 other; tgttggtctg gcaacacgga caacacgcca tcatcatgaa cacgactaga aataaacaga 60 aaaacattac atactttgtc aaacacccat cacacaaaca cacatgtatt cttcaaacat 120 ttgttcaaag taaaattgac tgttgtacta aacagacgcg ttcgagaatc gttcctccga 180 aaaaaccgaa acttgttcgt tacagtccat ttccgcgagt gtcgaggaca 230 // ID Sola3-1N1_AA repbase; DNA; INV; 3337 BP. XX AC . XX DT 28-JAN-2009 (Rel. 14.02, Created) DT 11-APR-2011 (Rel. 16.04, Last updated, Version 2) XX DE Sola3 DNA transposons from Aedes aegypti. XX KW Sola; DNA transposon; Transposable Element; Nonautonomous; KW TTAA TSD; Sola3; Sola3-1_AA; Sola3-1N1_AA. XX NM Sola3-1N1_AA. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-3337 RA Bao W., Jurka M.G., Kapitonov V.V. and Jurka J.; RT "New superfamilies of eukaryotic DNA transposons and their RT internal divisions."; RL Molecular Biology and Evolution 26(5), 983-993 (2009). XX DR [1] (Consensus) XX CC The region 2614-2330 is an inserted FEILAI-1B_AAe (~96% identical CC to the consensus). XX SQ Sequence 3337 BP; 1133 A; 602 C; 536 G; 1065 T; 1 other; gagccagttc gcgagagccg caaccccttc ttgtattgat gttctccgat caggctgaaa 60 ttttcaggga ttgttctact atataaaaga tgatgttttg caaaatttta gatttttata 120 ttagggggaa gtgggacaaa atgaccccca atgatttcat gtcactaaac atgcaaaatc 180 acaaaaactg atataactat agtaaaagtg cacggattat gttgaaattt ggcatgaata 240 ctctacgtta tatgaacttt gagattggca tggtatatgg attttgaaag tacttcaaat 300 cgagaaaaat gagtttttca taaaaaattt agtgattttc aaatcgattt ttttcttacc 360 aaccgaacta ggtcaaaaaa ctaaatatga cgttttgtag ggtaactcat gggctttcat 420 ttgaggtgta atccagatat gccgtttcaa aaattttcga aaaaaatttt tttttcgggg 480 tagtgtttga acttgagcca ttttgcgagt accgcaaccg ccttgtctag agaggttgta 540 ttgatgttct ccgatcgagc tgaaatttac aggagttgtt ttcctatata aaagaatata 600 ttccgcaaaa tttcagattt ttatattagg gagaagtggt acaaaataac cactaatgat 660 ttaatatata aaaacataca aaatcaatta aagtgctata atagcgatag taaaaataaa 720 cgaatttgca tgaaatttgg catgtttaat taacgttata tgaacttcaa aatgaacatg 780 gtatttggat tagaaatgtg tttcaaattg ataaaaaaaa tctgttttat tttttttttt 840 aataaaatta gttttttttt tcaaatcgac ttatattttt gtcaatcgaa ctagttcaaa 900 caactaaata tgaagttttg tagggaaact caggagcttt cattcgcggt gtaccgtagt 960 gaatcaaaat tcggacggta ccaatatccg gacactctga taatatttac caataaaatg 1020 ctaaattgac aacatttata caactatatt gaatgaaaaa atagctgttt ttgtagtgta 1080 gtgattgtaa cataataata cacaaagtta tagtttaaaa gtttagtgac accacattga 1140 aaaaatatgc tgtaaatatg gtttaaaaaa gtaaaaagag gctgtccgga attggagccc 1200 aatgtttttg aatctggaca ttgtattgga attgtatgtt tcgatgccgt aagtacacaa 1260 ttttcaaata aaactggaat aatactttga acatcatcaa aacatgagta atatgtaata 1320 agaacaatac tgtgcgtcgt tcattgaacc aatcagtgat gattgcartt taaaccacca 1380 acatgtctgt ccaaacaact aaatcacaaa cattgcttac acaatatgtg ttcattacta 1440 ttcaaaaaga aacccatttg gcagaactac actttttata agggaaaagt gggtcaaaat 1500 gacccccaat atttttgtgt cggaaaacac acaaaatcac aagaactact gtaccgttta 1560 aacggattgt actgaaaaat atgttttttt ttttgtaaaa aacacacaat tttccgtgga 1620 aatatgcaat ttttaacaat cttcgtctct cttttactct ttcgagaaat ttgcaaacat 1680 caaggctagt aaaagtcaaa attccattca agatataaac agtacagtgt ctcatagcag 1740 ccatatgtgc agtcagtcta aactaagcta aactcaatta tttacaaatt gatctctagc 1800 aaacatacca tcatacacac atacatgtga ccctcctaat caaacccatc aaacattatc 1860 caaattaaaa aaaaatgcgg tagttgatac ccatccaaac agcgtgagaa gaaacaacgc 1920 ttactactga accgccgaag ctgctaacga aaacattggt gcaatgactt attgttatcc 1980 acatcatcac ttttgcaaga aagtgacatt tctttcagat acggcagcgg caacacggta 2040 tgatataatc ggtattacaa ctaccgacaa gaggcctcca tacgctgctc ataccgcccg 2100 gattttgatt taccacggta caccgcgaat gaaagccctt gagttgtcct acaaaacttc 2160 aacctagttc ggttgaccaa aaataagtcg atttgaaaaa aaaaactaat tttacaaaat 2220 aacaggtttt tctcgatttg aaacattttt ttatccaaaa tgatcattca tataacgtaa 2280 agtaaacatg ccaaatttca tacaaattcg tgcattttta ctatcgctat tcttcttctt 2340 ctttctggcg ttacgtccca actgggacaa aacctgcttc tcagcttagt gttcttatga 2400 gcacttccac agttattaac tgagagcttt ctatgccgat tgaccatttt tgcatatgta 2460 tatcgtgtgg caggtacgaa gatactctat gccctgggat gtcgagagaa tttccaaccc 2520 gaaaagatcc tcgaccggtg ggattcgaac ccacgaccct cagcttggtc ttgctgaata 2580 gctgcgcgtt taccgctacg gctatctggg cccctactat cgctattata gcagtttaat 2640 taattttgta tttttttata tattaaatca ttagtggtta ttttgtacca cttctcccta 2700 atataaaaat ctgaaatttt gcgaaatatc ttcttttata taggaaaaca actcctgtaa 2760 atttcagctc gatcggagaa catcaataca acctctctag acaaggcggt tgcggtactc 2820 gcaaaatggc tcaagtccaa acactacccc gaaaaaaaat tttttttcga aaatttttga 2880 aacggcatat ctagattaca cctcaaatga aagcccatga gttaccctac aaaacgtcat 2940 atttagtttt ttgacctagt tcggttggta agaaaaaaat cgatttgaaa atcactaaat 3000 tttttatgaa aaactcattt ttctcgattt gaagtacttt caaaatccat ataccatgcc 3060 aatctcaaag tttatataac gtagagtatt catgccaaat ttcaacataa tccgtgcact 3120 tttactatag ttatatcagt ttttgtgatt ttgcatgttt agtgacatga aatcattggg 3180 ggtcattttg tcccacttcc ccctaatata aaaatctaat attttgcaaa acatcatctt 3240 ttatatagaa gaacaatccc tgaaaatttc agcctgatcg gagaacatca atacaacttc 3300 ccttggaaag ggggttgcgg ttctcgcgaa ctggctc 3337 // ID I_Ele10 repbase; DNA; INV; 6960 BP. XX AC . XX DT 08-OCT-2010 (Rel. 15.1, Created) DT 08-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE An I clade non-LTR retrotransposon family from Aedes aegypti. XX KW I; Non-LTR Retrotransposon; Transposable Element; I_Ele10. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-6960 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6960 RA Kojima K.K. and Jurka J.; RT "I clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (08-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. This consensus is generated from 8 CC sequences with >98% identity, and ~99% identical to the original CC sequence in [1]. XX FH Key Location/Qualifiers FT CDS 654..2231 FT /product="I_Ele10_1p" FT /translation="MVTGPGGPPPWGEAPQLDGAISRRVAGSFSRATLPAF FT MDRDGMYGDLHVLRLQGVNKNVLPHAPFMIRKSIQAHLGANIEGAYPEANG FT NSYALKVRNVRHFEKLLMMTKLSDGTPIEIIEHPALNTTRCVVSCRDVIDE FT SEQKIAEELKDQGVKEVRRITRKSGETRINTSTLIISLRGTNRPEFLDFGY FT IRCRTRPYYPSPMQCFNCWAFGHTKSRCKIEKGICGTCSGDHPFIADKPCT FT EAKYCSKCDTTQHAIRERSCPAYRKENDIQRIKIDQDVPYPEARRIYEESN FT DPKTYAKITATGTSSDFLNLNQKIDELMEMVKTRDRRIEQLESALNGQNTS FT TSNLDGNPPAVDCIPSXFQAMLDKQEAMFNRVIKNLIAANIEMQKEVQHLK FT SSINTXTSIAHNALTQEDSVIEFATPIESALSKTLTPEKQQPNDDPLDIYS FT DDSHPLDNIPSSSKHMGTPRPPVSPMIRKKETSSLTKSHPLTPKRPYSKAN FT SPDKLQDSVNASKVQRQSSLTRKNVSKNSK" FT CDS 2285..6625 FT /product="I_Ele10_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="MTPQPTNEDNCLDSTSSSRGAVGPEAKPLPEPSGRLL FT HSNSKLDEENLKKNSASSSRGAGRPKARTQSEQPGRLRHSTDEDPAIAPGS FT WGTDNADVLPTPVLADNPQRRGEAEGGTDKGENTYSPQDVEGIKTVMPLDL FT GTHKGGNPCPPKDSVGVKNHQPTLPEHSTSTDEINTFFPTVGTSLPLVGGR FT TPPATGLALSCSAHSSSPPASSTQTSEANRFCTQSNVSSSFTGNTNFPTPS FT LPRPTNFILQWNINGFLNNLGDLELLTQTSPPWCLALQEVNKVTAEQLNRT FT LRGQYEWTIKKGNNFRHSVAIGVLRTIPFEFLHLRSDVPAVGVRLQNPLNI FT SVINVYFPCGALPNLHEQVKSLLEAAPGPVICLGDMNAHHPAWGGNRADRR FT GTTLLDLFEELDLVVLNDGSDTFYNGHYSSAIDVTAVSRSTLPKLQWHTIT FT DLHGSDHHPIQITIAASSPITTRRPRWLYERADWEIYNASVREALRAADPT FT SIPDFNELIHKAAEAAVPRSSNKPGRKALRWWTDDTRKAVKARRKALRAAK FT RLPQGHPEKENMLNRYRMLHIQCRQTIRNAKRTSWEEFLEGMNASQTSAEL FT WNRVNALSGKRNLAPLSIQLPDKLVTDPVAVADTLGQYFSSLSAIDSYDDL FT FKRRYNPSTDSVLNFPVPYDLRGAAINEPFSLQELKFALGRSKGKSAGSDG FT IGYPLLKNLPFPGKIKLLEILNQIWLSDTFPAEWQESLVVPIPKANNPTRG FT PSSFRPIALTSCLSKIMERMVNRRLKQQLEAAGHLDRRQHAFRAGYGTNSY FT FASLGDILHNAHSEGLHTELISLDLSKAFNRTWTPLVLKQLVEWGMSGHVI FT HFIKNFLSNRTFRVLIGDNTSKLFAEETGVPQGSVIAVTLFLVAMNGVFRN FT LPSGIYIFVYADDILLVISGRTPVRTRIKAQAAVNAVVKWTSSVGFTLSAP FT KSIRCHICNTGHRITGSPITIQGQPIPLKKTVKTLGIIIDRGLTFKQHFES FT VKTNCRTRLNLVRTISRPHRSNNRAIRFRVAHAIIDSRLVYGLELTCIATG FT RLLQTLAPVYNTYVRIISGQLPCTPADAACTEAGILPFRHVISATICKKAA FT AFAEKTSGNERIFLLSEGCRILRTVANVDLPPVAKIHWYGDKSWQSGKPKI FT DSTIKDRFRAGDNSITLRASVIEWLRNNYPNHLHRYTDGSLSNRGVGIGIY FT GLDVSRSLSLAPLCSIFSAEAAAVFIAATTPAEQPILILTDSASVVSALQS FT ESPAHPWLQGILRFAPPDTTFAWIPGHCGVPGNTAADRLAGAGYASTRYTN FT KVPLQDIKRWITKSVNAQWATEWSQQSTCHLRKIKQDTQPWTDVISLKEQR FT IISRLRTGYTRLSHNMEGLPFHRICTTCNTHNKVEHFLCVCPQYEFHRRNH FT GLTGSIRENLCNDTTTLNTVICFLKDSGLYSQI" XX SQ Sequence 6960 BP; 1913 A; 1866 C; 1512 G; 1665 T; 4 other; cagtttgcaa cttgctatgg ttgaatgtag atgcgttgat cgctatctcc aactgtgaag 60 ttgtttatcc ttttattaac caaaaccggt cgatcaacaa acagttttaa attgttttta 120 ctttgttttc cgttgttata ttcgtctctt cgtacatacg gaataaccgt aatcaatcgc 180 ggctgtgttc ctgttttttt tctataccat ttgttcgttt tcgcgcggtt taacaagcga 240 ttgacattca gttttattcc aagccccttt gtcccaaagc aacttctggg caagggggta 300 gtgtggtttt taatctgtgg cgattttctc gctttcgaag atagtggtgc tgccaaattt 360 caacacgtga aagattattg gtggcgaaca aatccatctc tctgctcaca gtgcaattgt 420 gaggtcgtaa tccgggtcgt gggagtgatt ttgtgagact attacgatcc ctccccaaag 480 gggaggttga aacgcgaggg cgttcgcaac ctttcctgac atcgtaccac agacgcgttt 540 ttgaagaaag ctttagtggc agaagagctg agaagaaacc gagtgaacag tgaaactgct 600 attgcgaaat tgttacagca gtttatttag tgaacaatta gtgaattgcc tttatggtaa 660 cggggccggg gggtccccca ccttgggggg aggctcccca actggatggt gcgatctcaa 720 gacgagtcgc tggtagcttt tcacgagcta cactgccagc cttcatggac agagatggga 780 tgtacggaga tctacacgtc ctccgacttc aaggcgtgaa taaaaatgtt ctgccacacg 840 cacctttcat gattcgaaag tctatacaag ctcatcttgg tgcaaacatt gaaggagcct 900 atcctgaggc caacggaaac agctacgcgc taaaggtgag gaatgtccgc cacttcgaaa 960 agctgctaat gatgaccaaa ctctctgacg gtactcccat tgagattatc gagcaccctg 1020 cactgaacac aacgcgttgt gtggtcagtt gtcgtgatgt gatcgatgaa tcagagcaaa 1080 aaatcgcaga ggaacttaaa gaccaaggag taaaggaagt gaggcgaatc acccgaaaat 1140 ccggtgaaac tcggatcaac acatcgaccc ttattatctc tttgagggga accaacagac 1200 cggagttttt agacttcggc tacattcgct gtagaacacg tccctattac ccgtcgccca 1260 tgcagtgttt taactgctgg gctttcgggc atacgaagtc acgctgtaaa attgaaaagg 1320 gaatctgtgg tacatgctct ggagatcacc cttttatcgc cgataagccc tgcaccgaag 1380 cgaaatactg cagtaaatgt gatactaccc agcacgcaat cagagagcgg tcctgtcctg 1440 cttatagaaa agaaaacgac atccagcgta tcaaaatcga tcaagatgtg ccctaccccg 1500 aagctaggcg tatctacgag gagtcaaatg atccaaaaac gtacgcaaaa ataacagcaa 1560 caggaactag cagtgacttt ctaaacctca atcagaagat cgacgagctt atggagatgg 1620 taaaaacgcg cgaccgacgc attgaacagc ttgagagcgc tcttaatggc caaaatacat 1680 caacttcaaa cctggacgga aatcccccag cagtcgattg tatcccctcc ascttccaag 1740 caatgttaga taaacaagaa gcgatgttta atcgagttat taagaatttg atagcagcca 1800 acatagaaat gcaaaaagaa gtacaacact taaaaagctc sattaatacg cstaccagca 1860 ttgcacacaa cgcgctcacs caagaagatt ctgttatcga gtttgcaacc ccaatcgaaa 1920 gcgctctttc caaaaccctc acacccgaga aacaacagcc taatgatgat ccgctagaca 1980 tttatagcga cgattctcat cctctcgaca acattccgag ctcttccaaa cacatgggaa 2040 caccaagacc ccctgtttct cccatgatcc gcaagaagga gacgtcaagc cttacaaaaa 2100 gccatcctct cactcctaaa cgaccttaca gtaaagcaaa ttctcccgat aagctgcagg 2160 acagcgtgaa cgcgtcgaag gtacaaaggc agtcctctct tactcggaaa aacgttagca 2220 aaaactctaa atagccttcc catcaaaaca ccatcaacca ggcgaaacac cctccaccct 2280 caacatgact ccacaaccaa ccaacgaaga caattgtcta gactctacct cgagtagtcg 2340 gggcgccgtt ggaccggaag ccaaacccct accggaacct tcgggtcgac tcctgcactc 2400 gaacagtaag ttagatgaag aaaatctgaa gaaaaacagc gcctcgagca gtcggggcgc 2460 cggcagaccg aaagcccgga cccaatcgga acagccgggt cgactccgac actcgacgga 2520 cgaagatcca gcgatagccc ctggtagctg gggcaccgac aatgcggacg ttcttcccac 2580 accggtattg gcggacaacc ctcagcgccg gggggaagct gaaggaggga cggacaaggg 2640 tgagaacacc tattcccctc aggacgtcga gggaatcaag accgttatgc cgttagatct 2700 cgggacgcac aaagggggca acccctgtcc cccgaaggac agcgtgggag tcaagaacca 2760 tcagccaact ttacccgagc atagtacaag tacggacgaa ataaacacct tttttcctac 2820 cgtcgggaca tccttgccct tggtgggtgg acgaactcca cctgccactg gtctggctct 2880 gtcttgttcg gctcattcat caagtccccc agcttctagt acgcaaacca gtgaggccaa 2940 ccgtttttgt acacagagta acgtgtcttc ttctttcacc ggcaacacca attttcccac 3000 accgtctctt ccaaggccca ccaacttcat ccttcaatgg aacatcaatg gtttcctaaa 3060 caacttgggt gacctcgaac ttctcacgca gaccagtccg ccctggtgtc tggccctcca 3120 agaagttaac aaagtcactg cagagcaatt aaatcggacc ctgcgaggtc aatacgaatg 3180 gacaatcaaa aaaggaaaca attttcgaca ctcagtcgcc attggggtcc tacgtaccat 3240 cccatttgaa ttcctacatc tcagaagcga tgtccctgcc gtgggtgttc gactgcaaaa 3300 tccgctgaac atatccgtaa taaacgtata ttttccatgt ggggcgctcc cgaatctcca 3360 cgaacaagtc aaaagcctgt tagaagccgc tcccgggccc gtgatatgcc tgggggatat 3420 gaacgctcat caccctgctt ggggaggaaa tcgtgcagac cgtagaggga ctactctcct 3480 cgatcttttc gaagaactcg acttagtagt actcaacgat ggctccgaca ctttctacaa 3540 cggtcactac tctagcgcca tcgacgtcac cgcagttagt cgctcaactc tgcctaagct 3600 ccagtggcac accattactg atcttcacgg aagtgatcat catccgatac agatcaccat 3660 agctgctagc tctccgatca ctactcgtcg tcctagatgg ttgtatgaac gtgctgactg 3720 ggagatatat aatgcttctg ttcgtgaagc acttcgggct gccgatccca ccagtatccc 3780 ggactttaac gaactcatcc acaaagcagc tgaggctgca gtacccagat cgagtaacaa 3840 gcccggacga aaagcgcttc gttggtggac ggatgacacg cgaaaagctg tgaaggctag 3900 aagaaaagcg cttcgtgctg caaagcggct gccacaaggc caccctgaga aagaaaacat 3960 gcttaatcgc tatcgaatgt tgcacatcca gtgcagacaa actatcagga acgctaaacg 4020 cactagttgg gaagagtttc tggaaggtat gaacgcctca caaacttccg cggaactttg 4080 gaaccgggta aacgcactga gtggcaaaag aaacttagct ccattatcaa tacaattgcc 4140 ggataaatta gtcaccgatc ccgttgcagt cgccgatacc cttgggcaat atttctcaag 4200 tctctcggct attgatagct acgacgacct ctttaagcgc cgttacaatc cctcgacaga 4260 ctctgttctt aatttccccg ttccatacga tttaagaggt gcagccatca acgaaccctt 4320 ctcactccaa gaattaaaat tcgctcttgg ccgcagtaaa ggcaaatcag ctggatctga 4380 tggcattgga tatccgctgc taaaaaatct gccgttcccc ggaaaaatca aactcctgga 4440 aattctcaac caaatttggc tgtcggacac ttttccagcc gaatggcaag aaagcctcgt 4500 cgtccctata cccaaagcca acaatccaac tcgtggacca tcaagcttcc ggccgattgc 4560 gctcactagc tgtctctcta agattatgga acgaatggtc aatagacggc tcaaacaaca 4620 actggaggct gcaggccatc ttgaccgccg acaacacgcc tttagggctg gatacgggac 4680 taacagctac tttgcttctc tgggtgatat ccttcataac gcacattccg aagggctgca 4740 cactgagctg atctccttgg acctttccaa agccttcaac cgcacctgga ctcctctggt 4800 actcaaacag ctagtggaat gggggatgtc cgggcacgtt atccacttca tcaaaaactt 4860 tttatccaat cgcactttcc gtgtactgat tggtgacaac acatccaagc ttttcgccga 4920 agagaccggc gtaccccaag gatccgttat tgcagtcacc ctttttcttg tggcgatgaa 4980 tggtgttttc cgaaatctcc caagtgggat ttacattttc gtatacgcgg atgatatctt 5040 gctggtaatt tcaggccgca cacccgttcg caccagaata aaagctcaag ctgcagtcaa 5100 cgccgtcgtc aaatggacct cgtcggtggg gtttactctg tcggcaccca aaagcattcg 5160 ctgtcatatt tgcaacactg gacatcgaat taccggttcc cccatcacca ttcaaggtca 5220 acccattccg ctgaaaaaga ccgtcaaaac ccttggaatc ataatcgata gagggcttac 5280 attcaaacaa cattttgagt ctgtgaaaac aaattgtcgt actcggttaa acctagtgcg 5340 tacaatctct cgtccacatc gatctaacaa tcgggccata cgttttcgag ttgctcacgc 5400 aataatcgac agccgtctag tctacggtct ggaattaacg tgtatcgcta ctggaaggct 5460 gctccaaacc ctcgcaccag tgtacaatac ctacgttcgc atcatctcgg gacagttacc 5520 gtgtactccg gcggacgccg catgtactga agctggtata cttcctttcc gccatgttat 5580 atccgcaact atctgcaaga aggccgctgc tttcgccgag aaaacctccg gaaatgaaag 5640 gatcttcctc ctctccgagg ggtgcagaat ccttcgtacg gtagccaacg tggatctccc 5700 tccagtggcc aagattcatt ggtacggaga caagagttgg cagtcaggaa aaccaaaaat 5760 cgatagtaca atcaaggacc gttttcgtgc cggtgacaac tccatcacac ttcgagcgtc 5820 tgtcattgaa tggctccgta ataactaccc aaaccacctc cacagataca ctgatggctc 5880 cctatcgaac cgtggcgtgg gaatcggcat ctatggactt gacgtttcga gaagccttag 5940 tctcgcccca ctatgttcca tcttttccgc cgaagccgca gccgttttta tcgcggctac 6000 tactcctgct gaacaaccca tcctaatact tactgactca gctagtgtag tatcagctct 6060 tcagtccgag tcccccgccc atccatggct tcaaggtatt ttacggtttg ccccacctga 6120 caccaccttt gcgtggatcc ctggacattg tggtgtacct ggaaacacag ctgctgatcg 6180 cctcgctggt gcaggatacg cgtcaacaag gtatacaaac aaggttcctc ttcaggacat 6240 caaaagatgg ataaccaagt ccgtcaacgc gcagtgggca accgaatggt cccagcagag 6300 cacttgtcat ctacgcaaga tcaaacaaga tacccaacca tggaccgacg ttatttcatt 6360 aaaagaacag cggatcatct cccgactgcg gactggatac acccgtttat cgcacaacat 6420 ggaaggcttg cctttccaca gaatttgcac cacttgcaat acacacaaca aggtggaaca 6480 tttcctatgc gtctgccctc agtacgaatt tcatcgtaga aaccatggac taacggggag 6540 cattcgggaa aacctatgca acgatacgac tacactcaac actgtcatct gtttcctgaa 6600 agattcagga ctatactccc aaatataacc ttacctctac tttttcccgc agtcggggta 6660 tctctgccgg tggagggttt cgaccttcct ccgggagagc tctaccttgt tcttgcctcc 6720 cctccattcc cggtttagtg tgaaacaatt atgtttagtt tgaacctgtt gctggcgact 6780 aattttatct caccgaaaac gtgtttttca aattttgtga tagtttattt gtattttgat 6840 atgctgcttg aaagcctgta aaactcctta ttgaaagcag cctttctggc gcttttctga 6900 caaatggtga tgaactagcc gtatgcgcta aaaatcacct taataaagaa caaaaaaaaa 6960 // ID Gypsy2-I_AP repbase; DNA; INV; 4474 BP. XX AC Contig39744; XX DT 21-APR-2008 (Rel. 13.04, Created) DT 21-APR-2008 (Rel. 15.12, Last updated, Version 0) XX DE LTR retrotransposon from pea aphid: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy2AP; KW Gypsy2-I_AP; Gypsy2-LTR_AP. XX NM Gypsy2-I_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-4474 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from pea aphid."; RL Repbase Reports 8(4), 439-439 (2008). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC Positions [3573-4085] - Integrase core CC LTRs are 91% similar to each other. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX FH Key Location/Qualifiers FT CDS 1696..3270 FT /product="Gypsy2-I_AP_1p" FT /translation="MADVSRSIIGADFLHYFGLLVDVRHQRLVNPADKRTV FT TCAVTPTPSARTTLLTVARSPDWDRLLQDFPGVTRESPVPESFQHRVEHVL FT KTKGPPLFARRDFELMQKQGICRPSSSAWASPLLLVPKKDGSFRPCGDYRR FT LNSVTVPDRYPLPYLHDFTANLAGKTVFTKLDLVRTYNQVPIAASDVHKTA FT VTTSFGLFEFPVMCFGLCNAAQTFQRLVNIVLAGLDFVFAYVNDVLIASTN FT AEQHVEHVRAVLSRFEEFGIAINPAKCVFAASTLTFLGHVVDAQGLRPNPD FT GVDVIRRWPQPNTKKELQRFLGSLNFYHRFVQGAANVQAPLYDISSAIKKK FT DGPLAWTDAARKAFSACREALVTSTLLVHPQRNALLRLTTDAFNIAVGAVL FT EQSVGNEWQPLGFFSRKLSGAQTRYSAYDRELLAAYLAARHFVHAIEGRFV FT TLRTDHRPLLFMFSQKAEKLIDRQARHVAFLSQYFHEVEHVSRELNVVPDA FT LSRLELLPLDNGLPNLDQWATEQASDTDL" XX SQ Sequence 4474 BP; 1038 A; 1421 C; 1061 G; 954 T; 0 other; gcgacgcgat tattaattgt ttcgctctgg tcgtcactaa gtacctatac ctactcaaaa 60 cgttggaagg ctaaccatga taacggctag ctaaccacat tggtgacctc ttcaagtttt 120 cccttataag tataggccag acggcccaga gcatcatcat cgtccccttc gctgagtgcg 180 gcgtgtaggt aagccggctg gtacatacta accagtagac acggaaacct ccgccaaacg 240 acctcacagc gccttgcctt agtctgcgta agccggccgg tactcagaga ccggtagaca 300 cggaaacgct acagactacc ccaccaacca acgacgcttg tcgcgttggc caccctgacc 360 atttttcgtt gcgaacgtcg agccatcgcc gccgacgtcc aacccttcgt aagagtgttt 420 atcgttgtga acgtcgagcc cttgctgccg ccgtccgatc cgttgtacta caattcatat 480 ccgatcaatc ctacactgga ccaccaaact accagaccag atccgtgagt tatatgttca 540 gattcccacc cctgtatgta accccaataa atatttttat tgtaaactct ccggagtata 600 attagtctaa tctatacaaa aagaaaatat tgtcaaacaa aacatcttaa ttgtattaaa 660 ttcagatgat gaacagattg gttcaaaata aataaacata catttttacc aaccaattgt 720 gtttcattac agccttttcc gcgccgagtg ctccagcgcg ttttcgcaat acacgtgtcc 780 tcttagtgcc tatccgtcgt acgagttgta taactctttt ccgcgttaat cgtcatggac 840 gtgcaaggta acgcgcctaa caaccagaac gtcgttgaca acgtgaacca agtcgtcgac 900 gccattgcaa acatgcggct tccggcattt tggaaacgct ccccgaccct gtggttcaac 960 tacgcggagt ccacgttcat aacgcatcgc gtcactgcca acgcgaccaa agtacatttc 1020 gtcggcagcg ctctagacga agaagcagtg cggaacatcg gtgaccttct gaacgcggcc 1080 gccccctact ccgacatccc cgcgcgtctc atcagcgcct atgaggtgcc gacagcgctg 1140 ctgttccgcg aaatcgtcaa gtccggaggt ttgggtgacc gccgaccgtc gcagccgcaa 1200 cagcatgcca cccggcatag gcgaagacgc gttgaaagaa ttttggctac agaaactcct 1260 gtcgaacgtc accgccatcc tcgtgggcct cgacgcaccg ttggacgaac tggccatatt 1320 atataactgg cgtatttcag acgtgtccaa cctacagagc gtcgatgtcc tatccaagga 1380 gcagtttagc gaccttgccg gtgcggtgtc cgcgttgtcg caacaaatca agtccctcac 1440 gaaaatcgtc aactccgcag aagggccctc gcgatcacgg tcacgagcac ccacgagcac 1500 ccacatcaac gttccggtaa gctgttcctc atcgacacaa gcgccgaaat ttcgctcata 1560 ccgtacaact acggttcacg ccagcagtcc gatgtgcacc tcacagccgc caacggagca 1620 cgcatcaaaa cgttcggacc caaaacactt gttcttgacc tcggactttc gcgcccattc 1680 acatggacct ttgagatggc tgatgtctca cgctccatca ttggcgccga ttttttacac 1740 tactttggcc tccttgtcga cgttcggcac cagcgcctag tcaatccggc cgacaagcga 1800 acagtcacct gtgcagtgac acccacaccg tcggcgcgca ccaccttgct caccgttgct 1860 cgctctccgg attgggaccg acttctgcag gactttccag gcgtgacccg tgagtcaccc 1920 gtccctgaat cgttccaaca cagagtggaa cacgtcctca aaaccaaagg accacccctt 1980 ttcgcgcgcc gcgactttga gctgatgcaa aaacaaggga tttgccgacc ttcgtccagc 2040 gcgtgggcca gccctctgct actagtgccc aagaaggacg ggtctttccg accatgcggt 2100 gattaccgcc gactgaacag cgtcaccgtt ccagaccggt acccgcttcc ctacctgcac 2160 gacttcaccg caaacctcgc gggaaaaacg gtcttcacta agctcgacct ggtccgcact 2220 tacaatcaag tcccgatcgc cgccagcgac gtccacaaaa ctgctgtgac aacatcgttc 2280 ggccttttcg agtttccagt aatgtgcttc ggcctatgta acgccgcaca aaccttccaa 2340 aggttagtca acatcgttct cgccggcctc gacttcgttt tcgcatacgt caacgacgtc 2400 ctcatcgcgt ccacgaacgc cgaacaacac gtcgaacacg tccgcgcagt actcagccgt 2460 tttgaggagt tcgggatagc cattaacccg gcgaagtgcg ttttcgccgc gagcacgctc 2520 acctttctcg ggcatgtagt cgacgcgcaa ggcttacgcc ccaacccgga cggcgtcgac 2580 gttatacggc ggtggccgca accaaacact aaaaaagagt tacaacgttt tcttgggtcg 2640 ctgaactttt accaccgatt cgtccaaggc gcagccaacg ttcaagcacc gctgtatgat 2700 atttcctcgg caattaaaaa aaaggacggc ccactcgcgt ggactgacgc cgcccggaag 2760 gcattctcgg catgtcgaga ggcactcgtt acctctacgc ttttggtaca cccgcagcgc 2820 aacgcactgt tgcgccttac cacagacgct ttcaacatcg cggtcggggc agtcctggag 2880 caatccgtcg gaaatgaatg gcaaccgctc ggatttttct cgcgcaaact ctccggcgcg 2940 cagacacggt acagtgcgta cgaccgtgag ttgctcgcag cctacctagc cgcacggcat 3000 ttcgtacacg cgatcgaggg caggttcgtc acgctccgca cagaccatcg cccgctacta 3060 ttcatgttct cgcagaaggc cgaaaaactt atcgaccgtc aagcccggca cgtcgcgttc 3120 ctatcgcagt atttccacga ggttgagcac gtcagccgcg aactcaacgt tgttcccgac 3180 gcactgtcac gtctggagtt gctgcccctc gacaacgggc ttcccaacct ggaccaatgg 3240 gcaactgaac aagcgagcga caccgaccta taggacattc tcaccgggaa gacagagtcg 3300 tcattaaaac tggacgcgcg gcagacggtg agcggaccga tctatttcga cattgcacat 3360 aacaggtcga ggcttttcgt cccgttgcga caacgtcgcg cggtcttcaa taccttacac 3420 cagcaggcac acggcggcag tgcggccaca gctgcgctaa tcgcgcaacg ttttgtgtgg 3480 cctggtatga accgggaaat acgacggtgg gtcaagacat gtgagcagtg ccaaaagtca 3540 aaggtacata gacacacatc gacatcgctg ctacgttcgc ggcaccggac cgacgttttg 3600 gtcacataca tttagacctg gtcggcccac tgcctgtttt ggacggcgct aagtacctcc 3660 tcacctgcgt cgacagattt acgcgttggc ccgaagcctg gcccgtcgac aacatgtccg 3720 ctcacaccgt cgcgtccacg ctggtaacaa attggatcac ccgcttcgga gtgccggccg 3780 tcatcaccac agaccaaggg cggaaatttg agtccgatct aatgcgcgca ctcaactcaa 3840 ctttcgggat tcaacacatt cgcacgtccc gctatcacct gcaggccaac gggctggtcg 3900 agcgccttca cagaacactc aaggtagcgc tcaccgtgca agaatcacca cactggtctc 3960 agcggttgcc catcgtcctc ctagccctac gcaacaccgt caaactggac accggtgcaa 4020 cacctgccga acttgtttac gggatgaccc tccgccttcc tggtgagcta ttccactccg 4080 cgccctaaga ggcgaggtcg ccggatttcg tcacggcact caagtcgtcc atggccgagc 4140 tccgaccgtc acccggatcc aatcacgacc cagcccgtcg tattttcgtg cctacgcagc 4200 tcggctccgt cagccatgtt gtcgtccgcg tcgacgcgca gcgcgctccg cttcaaccac 4260 gttacgagag gccctacgcc atgctagatc gaagagaaaa ggatttcaag ctgcagctgg 4320 gcaaccgaac gtcgtgggtg tcagtggacc gactcaaacc cgccttcgtc ctccgtgacg 4380 accagtcctg tatcattcgt acgccatgca atccgtggac agaccttcga catcgaaaac 4440 ctacggcatt ttctcctgcg tcggggggga gtga 4474 // ID BEL-15_AA-I repbase; DNA; INV; 5368 BP. XX AC supercont1.135; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-15_AA_; KW BEL-15_AA-LTR; BEL-15_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5368 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.135; Positions 1111928 1117295. XX CC 'TATGG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 26..5341 FT /product="BEL-15_AA-I_1p" FT /translation="MDKSELEKLWAKREVMFAKAKWELSVAETLRTRNPTY FT EEVQERKDKLTELAGNFDALQTSIEESTSNLQDVASVFNYRLQFDEVYFKA FT KGLYTAFLEDNQDRMSNYSAGTGESVNDLRDAIRALVETQQALIMHQIQPS FT AAPAAGQVSNKDHHLNVKLPQLNIPVFKGERKNWYSFNDLFVSTIHSRTDL FT KDSLKMQYLLSYLDGEAKKMVSSFPISDANYSEAWETLVTHYNKKKYTVFA FT LVREFIDQPSTTTVTGLKKLVATSDDVIRQLKALGNEFESRDPWLIHLLLE FT KVDRETRSLWAQKIIDIDSPSFAEFLEFLQKRCDALETCTAFTKRPSGDIV FT KKEYTKAHGGEKVQAFHANASPSSCAKCSKDHPIYHCDRFKEMEVAARREL FT AQTSKLCFNCLRPSHSAKKCPSKSVCRSPDCKQRHHTLLCTQEARQADKKP FT EQDQSNQESGLESTLASVNVNSAQTVDENREFTLLPTVVVQVEGSDGKFHN FT VRGLVDSGSQVSLITEACVKRLGLRRSNAKLEVTGVNAEVVGKTAGKVTLV FT LSSRYEDATKLTTQAYILGKLTATLPCQRFSVSNMPYLEGLQLADPQFNKP FT GTMDIILGADLFLSILQSGQVKDHNGIPVAQRSVFGWMVAGRLPKQECIHT FT YHSIINLHQEVDIDRTLRLFWEDQELHNPKQLTKDEQRVVEAFNSTLTRSQ FT EGRFIVRLPMDDSKLKLGNSLIAATRRLRCMERRFDSDNDFKQRYMSFMQE FT YQELGHMRIVPPSEIDADCSKSYYLPHHGVIKEDSVTTKLRVVFDGSSTTT FT TGASLNDILLEAPNINADLFDVLLRFRSYPVVFLADIEKMYRQVLVHPDDT FT DYMRIVWRDSPDKPIQHFRLLTVTYGLKNSGFLAMAALHKAAEAFEERYPK FT AAERIIKHTYVDDLTSGADSAEEAIEIIKQINEILGEAGFTLRKWSSNSPE FT VLDSLSDTISSAMPIQFPDERNTVKALGTHWLPTEDMFTFKVTMPTDGPNT FT KHQLLSDSAKLFDPFGWFAPVIVRVKILYQKCWLYDLNWHDKLPPTIEEAW FT VEVKESLHLLERVKLPRWSANYNGRIELHGFSDASEEAYSAVVYLRSVESN FT NEVHVTLLAAKTKVAPVHQISIPRLELNAAELLAKLMKQVAAPLERFQIDQ FT YAWTDSTIVLQWLSGHPRKWNTYIANRTSSILDTLPRKHWAHVASKENPAD FT CASRGVSPLELVNHPLWWTGPSWLSDDSSTWNRDVPDEAYDQETLEVRKRY FT QSLNVTVSLPATIYVVESEILERRSDLSAACWQLARVRRFVNNLRCSLTGE FT NKVSGSILPSELREARMQFVRLAQQESYEGEVKALVRGEEIPAKSSIASLY FT PFLDGTGTLRVGGRLQHSVYSFDVKHPIIVPKHHRLTKLLVEEIHVNNFHA FT GPTLMTATINQRYWIQGCQTVIKQVIRRCLMCCRQKAQTAKQLMGSLPAAR FT VTACRPFAHVGVDYAGPISVRCSNTRGARCMKGYIVVFICLSSKAVHLEVA FT GDLTTDTFLGAFKRMIARRGYCNEVWSDNGTNLVGADRQLQEIYEVVTKHV FT KQGEHFFTNLGIRWRFIPPASPHQGGIWEAAVKSAKELLRPILGNEKLTHE FT ELSTVLCQVEACLNSRPLCPMSSNPDSLEALTPGHFLVGQPLNLLPEPDVT FT HLKMNQLDRWQKVQRYTEEFWQRWRDEYIATLQPRGKWKTKQENLKPGNLV FT LVKNDNSPPSAWELARIVAVHPDQQGLVRNVTLRRGKSEYQRSVQKLCPLP FT D" XX SQ Sequence 5368 BP; 1437 A; 1361 C; 1407 G; 1163 T; 0 other; ttttggtcct tcgcgtccgg gtattatgga caaaagtgag ctcgagaagt tgtgggcgaa 60 aagggaggtt atgttcgcaa aagcgaaatg ggagttgtcc gtcgcagaaa cccttcgcac 120 ccggaatccc acatacgagg aggtgcagga gcggaaggac aagctcacag aacttgctgg 180 taacttcgac gcacttcaga cctcgatcga agagagcacc agcaacttgc aagacgtcgc 240 gtcggtgttc aactatcggc tgcagttcga cgaggtgtat ttcaaggcca aagggttgta 300 cacggcgttc ctggaggata atcaggaccg gatgtccaac tacagtgccg gaaccggaga 360 atcggtcaat gacctgcggg atgcaatcag agcgctcgtc gaaacgcagc aagctttgat 420 catgcatcaa attcaaccca gtgcagcacc cgctgccgga caggtgtcga ataaggacca 480 ccatctgaac gtgaagctgc ctcagctcaa cattccagtg tttaaaggag agcgcaagaa 540 ctggtactcc ttcaacgacc tgttcgtgag taccatccac agcaggacgg acctaaagga 600 ttcgctgaag atgcagtact tactgtcgta tttggatggc gaagcgaaga agatggtcag 660 ttcgtttccc atcagtgacg ccaactatag tgaagcctgg gaaacactcg tgacgcatta 720 caacaagaag aagtatacgg tgtttgccct ggtccgagaa ttcatcgatc aaccttccac 780 gacaactgta accggcctca agaagctggt ggccacatcc gacgatgtta tacgtcagct 840 caaggctctc gggaacgagt ttgaatcgag ggatccgtgg ctcatccact tactgctgga 900 gaaggttgac agagaaaccc gatcattgtg ggcgcagaaa attatcgata tcgacagtcc 960 gtcgttcgct gaatttttgg agtttctgca gaagcgttgc gatgcactcg aaacgtgcac 1020 agcgttcaca aaacggccca gtggagacat cgtgaaaaag gagtacacga aggcacacgg 1080 tggtgaaaaa gtgcaagcct tccatgccaa cgcatcaccg tcatcgtgcg cgaagtgctc 1140 gaaggatcac ccgatctatc actgtgatcg gttcaaggaa atggaggtcg cggcgcgcag 1200 agagttggca caaacgtcca aactgtgttt caactgtttg cgcccctccc actcagcgaa 1260 gaaatgcccg tcgaagtccg tatgccgttc tccagattgc aagcaacgtc accacacctt 1320 gttgtgcaca caggaagcaa gacaggcgga caagaagccg gaacaggatc aatccaacca 1380 agaatccggt ctggaatcaa cattggcgtc ggtgaacgtc aactcagcgc aaacggtgga 1440 tgaaaatcgt gaatttaccc tactaccaac ggtggtggta caagtcgaag gaagtgatgg 1500 caaatttcac aacgttcgag gactagtgga cagcggttcg caggtgtcgc tcataacgga 1560 ggcgtgcgta aagcgtcttg ggttgagacg tagcaacgcc aaactggagg tcacaggtgt 1620 gaacgcagag gtcgtcggta aaaccgccgg caaggttaca cttgtgttgt catcgcgcta 1680 cgaggatgca acaaagctaa caactcaagc ctacattttg ggaaagctga cggcaacctt 1740 gccgtgtcag cgcttcagcg tgtccaacat gccatacttg gaaggattgc agctggccga 1800 tccgcagttt aacaagcctg gaactatgga catcattctc ggggccgatc tcttcttgtc 1860 gattctacaa tcagggcagg tcaaggatca caacggaatt cctgtggcac agcgttccgt 1920 cttcggatgg atggttgcag gaaggctgcc gaaacaagaa tgtatccaca cctaccattc 1980 gatcatcaat ctgcaccagg aggttgacat cgatcgtacg ctgcggttgt tttgggaaga 2040 tcaggagcta cacaatccca agcaactcac caaggatgaa caaagggtgg ttgaagcatt 2100 caactccact ctaacacgtt cacaggaagg tcgcttcatt gtccgtttgc caatggatga 2160 ctcgaagctc aaactgggaa actcgttgat cgccgctacc agacgactaa ggtgcatgga 2220 acgcagattc gacagcgaca acgatttcaa gcaacgctac atgtcgttca tgcaggaata 2280 ccaggaatta ggccacatga ggattgtgcc cccatcggag atcgacgcag actgctccaa 2340 gtcgtattat ctgccgcacc atggcgtcat taaagaggac agcgtcacta ccaaactcag 2400 ggtagtattt gatggctcat caactaccac gaccggagcg tcactcaacg acatactgct 2460 ggaagcgccg aacatcaacg cagatctgtt cgacgttctg ctacgattca gatcgtatcc 2520 ggtggtcttc ctagcggaca tcgaaaagat gtatcgccag gtgcttgtgc atcctgacga 2580 cacggactac atgcgcatcg tatggcgaga ttctcccgac aagccaatcc aacattttcg 2640 tcttctcacc gttacctacg gtttgaagaa ttcaggcttc ctagcaatgg cagcactaca 2700 caaggccgcg gaggcattcg aggaaaggta cccgaaagca gcggagcgaa ttatcaagca 2760 cacctacgtc gatgatctaa catcaggtgc agactcggcg gaagaagcga tagaaatcat 2820 caagcaaatc aatgaaatcc tcggagaagc agggtttact ctacgcaaat ggagttcaaa 2880 ctcgcctgaa gtactggact cgctatcgga taccatcagc agtgcaatgc cgattcaatt 2940 tccagatgaa cgcaacaccg tgaaggcgct tggaacacac tggcttccaa cagaggacat 3000 gttcacattc aaggtcacca tgccaactga cggaccgaat acgaaacatc aactgttgtc 3060 ggattcggcg aagcttttcg acccgtttgg atggttcgca ccggtgattg tacgagtcaa 3120 aatcctgtac caaaaatgtt ggctttacga tctgaactgg catgacaagc tgccacccac 3180 catcgaggaa gcgtgggtcg aagtaaagga aagtctgcat ctgctggaac gcgtcaagtt 3240 acccaggtgg tcagcaaact acaacggtcg catcgaattg cacggcttct cagatgcatc 3300 ggaggaagca tactcggctg tggtttacct gaggtcagtg gaaagcaaca acgaagttca 3360 tgtcaccttg ctggcagcga aaacgaaggt tgctcctgtt catcaaatat ccataccacg 3420 tttggagctc aacgcagcgg aactgttggc gaaactcatg aaacaagttg ccgcaccact 3480 cgaaagattt caaatcgacc aatacgcctg gacggattcc accatcgtat tgcagtggct 3540 ttccggccac ccccgcaagt ggaacaccta catagccaac agaacgtcat cgattttgga 3600 caccttgcct agaaagcact gggcacacgt cgcttccaag gagaatcctg cagattgtgc 3660 atcccgtggc gtctctccgt tggagctcgt caatcatccg ctttggtgga cagggccttc 3720 gtggctatcc gatgattctt ctacgtggaa ccgagacgta ccagacgagg cctacgacca 3780 ggaaacactg gaggtacgta agcgctacca atcgctcaat gtcaccgtta gtcttccagc 3840 cactatctac gtcgtcgaaa gcgaaatttt ggaacgtcga tccgatctct ctgccgcctg 3900 ctggcagctc gcacgggtca ggcgattcgt caacaacttg cgatgttcgc tcaccggcga 3960 aaacaaggtt tccggatcca tcttgccatc ggagctgcgt gaggcacgaa tgcagtttgt 4020 tcgattggca caacaagagt cgtatgaggg agaagtcaag gcactggttc gtggcgaaga 4080 aattccagca aaatccagca ttgcaagtct gtacccattc ctggatggta ccggcacgct 4140 cagagtagga ggacgattgc agcattcggt ttattctttc gacgtcaagc acccaataat 4200 cgttccaaaa caccatcgct taacgaaact gctggtggag gaaattcacg taaacaactt 4260 tcatgccggc ccaactttga tgacagccac tatcaaccag cggtactgga ttcaaggttg 4320 tcaaaccgtg atcaaacagg taattcgaag gtgcttgatg tgttgtcgtc agaaggcgca 4380 gactgcgaaa cagttgatgg ggagtttgcc agcagcacga gtgacagcgt gtcgcccttt 4440 cgctcacgtc ggggtagatt acgctggccc gatatctgtt cgttgcagca acacgcgagg 4500 agcacggtgt atgaaaggat acatcgtcgt ttttatctgc ctttcaagta aagccgtcca 4560 cttggaagtg gcaggtgatt taaccaccga tacttttctg ggagccttca agagaatgat 4620 cgcacgacgt ggctattgca acgaagtgtg gtcggataac ggcacgaatc tggttggagc 4680 tgatcgacag ttgcaggaga tttatgaggt ggttacgaag cacgtcaagc aaggagaaca 4740 cttcttcacc aatctcggaa ttcgttggcg attcatccca ccagcaagcc cgcatcaagg 4800 cggtatatgg gaagcggcag tgaaaagcgc caaggagctg cttcgaccga ttttgggaaa 4860 cgagaagctc acccacgagg aattatcaac agttttgtgt caagttgagg cttgtctcaa 4920 ttcgagaccc ctctgcccta tgtcatctaa tcccgacagt ctcgaagcgc ttacaccggg 4980 acattttttg gttgggcagc ctctcaacct gctgcctgaa ccggacgtta cccatcttaa 5040 gatgaatcaa ctagatcggt ggcaaaaggt gcaacgttac acggaggagt tttggcaacg 5100 ctggcgcgat gaatacattg ccacgttgca gcctagagga aaatggaaga ccaagcagga 5160 gaatttgaaa cctgggaatt tggtgctcgt taagaacgac aatagcccac cgtcagcttg 5220 ggagctggct cgcatcgtag ctgttcatcc ggatcaacag gggttggttc gcaacgtcac 5280 tcttcgcaga ggcaagtcgg agtaccagcg ttccgtccag aagctatgcc ctctgcctga 5340 ttgagccgtt gcctcaaggc ggggtgga 5368 // ID DNA-5_PPac repbase; DNA; INV; 798 BP. XX AC . XX DT 07-JUL-2010 (Rel. 15.07, Created) DT 07-JUL-2010 (Rel. 15.07, Last updated, Version 1) XX DE Non-autonomous DNA transposon from the Pristionchus pacificus DE genome. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA-5_PPac. XX OS Pristionchus pacificus OC Eukaryota; Metazoa; Nematoda; Chromadorea; Diplogasterida; OC Neodiplogasteridae; Pristionchus. XX RN [1] RP 1-798 RA Jurka J.; RT "DNA transposons from the Pristionchus pacificus genome."; RL Repbase Reports 10(7), 957-957 (2010). XX DR [1] (Consensus) XX CC >97% identical to consensus. XX SQ Sequence 798 BP; 220 A; 180 C; 198 G; 199 T; 1 other; taggggccac aacgcagata gccgatttgt cgcattgggg ccaaacttgt gtctaaactg 60 ttccccgtga ccctaataac atcatacttt tgaccgtttg ttgatatctt gtctctgagc 120 tgcgctacag cccgaaaagt acgcgcgaaa cgcaccgttg tgaaggggga aaaggcgaga 180 cgagagggaa agggggcgga gcctatttcc aactcattcc cgaaattcgg aggcaagagc 240 ttcgccaagt cgtgagagac gattcaaggg agaaattacc tcttatccag gcagagcgcg 300 cgctagtcac gacgggttag aaacagtagg gagagtcgag gcggagggag agggagtgtg 360 gtgtgtctct cccccctctc ctctcatttg actatcccct ctggcgctca cactgtgttt 420 agcgngagta ggaatcgagg aaatacgcgc gtactgaaac ctttaaaaat aaagcacaat 480 tcagggagac aagattaaaa tctctaactt tctctaaaaa ttctcaaata atactattct 540 tcgatatatt gctgtaggtt taaggtacat gctcgggaaa gggataaatt gacactacct 600 aggccgtatt atcgattttt gaccaacaca gataaaactt tttcaaaacc tacaagactt 660 tactggtaga ttaggacagt agctatccaa cgcgccctag cacgcttgat ttggttaaga 720 atgaactttg tgcttctccc gtctcgtttt cattggagga gcatgtcaca ccaaagtggg 780 cggagcttgt ggccccta 798 // ID IRE_Tg repbase; DNA; INV; 1919 BP. XX AC AF177728; XX DT 14-SEP-2004 (Rel. 9.08, Created) DT 14-SEP-2004 (Rel. 9.08, Last updated, Version 1) XX DE Toxoplasma gondii IRE repeat sequence. XX KW Transposable Element; IRE repeat; IRE_Tg; Interspersed repeat. XX OS Toxoplasma gondii OC Eukaryota; Alveolata; Apicomplexa; Coccidia; Eucoccidiorida; OC Eimeriorina; Sarcocystidae; Toxoplasma. XX RN [1] RA Echeverria C.P., Rojas A.P., Martin V., Guarnera A.E., Pszenny V. RA and Angel O.S.; RT "Characterization of a novel interspersed Toxoplasma gondii DNA RT repeat with potential uses for PCR diagnosis and PCR-RFLP RT analysis."; RL FEMS Microbiol. Lett 184(1), 23-27 (2000). XX DR Genbank; AF177728; Positions 1 1919. XX SQ Sequence 1919 BP; 469 A; 509 C; 445 G; 496 T; 0 other; gaattcttca cggttccgga cttgcacagt tgggaccatg cgagaatggt atttagaagg 60 attgatacgg aaaccaacgt cggttggcta ctgctgtgtg tcagctaata gcgacacact 120 gcaggcacgg cctcacacca cacctcgatg caggttctat caccgtggct tgccccctac 180 actggtttct ctggtgcatt cggagctcaa tcgagatact gggagtagcg cggacgccgg 240 cgagagaaaa atatatactg ctgcattacc cttttccgat gccgtcaacc acggcgagtt 300 gctaaaaacc accaaactca cttgtaggga ctggaaaacc aaaatgacga tctcctgtat 360 ctctatatcc aggttcgttt ttctctgcgg atgcgtgtgc acacgtagtg gaggcttgaa 420 gaacaatcgg tgagctttca gggtttaggc atgtaatgca ggaccatggt cgggagtcat 480 cggatggagt ggcgcggtca tggtggactg tcgaccctac atgtgaacct gttgtcaccc 540 tctctgaaac aaaagctaag tgacctgtcg ttgttgttca ttttagcatg tgcatattct 600 gttaacggca gtaaagcagc cttagtcacc gtgtagacgc taccacacag ctaccgttaa 660 tcaagaatcg ttgaaaccgt gcagatttag tggtcactct tggggcagac cgagggacca 720 ggcgttagag atgttatgag tccaccacac ggggaaacac ttatattttg cccgcgcagc 780 cgtcgttccc gctgttccga ttgggcacgt ttcatgcaca tcatggttct gcgcctgtat 840 ctgcactcga ggtccgtttg gtaggataag ccaactaatt ctgaatccag cggctcgctc 900 catacgcggg cattttatgt ttaacgtggt gcttgccctg cgtgtgtgaa ggactctggt 960 atttcaggct atcactttaa aatttttgac ttgaccatta ttggtcacag gcacgcccta 1020 ttatttttgc ctcaccctta tcccaatacg catcttgata gaaccctact cgggcaacga 1080 caagatgcaa ccttgaaacg agttatcacg cctccgagac aactagaaat caaatgctta 1140 caggctgtaa aaggggatac ccctaaagca aacccaagga atgccaatgc aacaaggatt 1200 cggagtgaag gttcttacct agactgcaaa cccatctctc caactgtgag gagtcgtccg 1260 acactgctga ccgcgttatc agtcattttg catatcgctc ctccaccagc ctgtcgtctc 1320 cagaacgcgg gaacactgct cttggaaaac tgcacagaag attgtcgaac tctgaacggg 1380 atcccatcca tgcacgcttc gacacccgcc cgcttgctca ttctatgtta tatcttcagc 1440 ccccatgtca cgtgcgctcg tgagtcggca acgcttgcac aacaaattca tattcgcgtg 1500 gtcgtccgtt gtctcttggg gagggcttcc attaaatatg ccagttgcag atcgccaaca 1560 aacagatgta atgagacaat gcgttacgtg gagactccct atccgtgact gcaaagctcc 1620 cgcaatgcat cttaacatcg cgatggaatt gccctttgag aaggggatgg ccggagacac 1680 ttttcatttg gcctgtgttt ctccccactt cctttcaccc tcacaaccaa ccttcacact 1740 ctcaccacta aaggcgattt tctctccgta tctccccgaa ccttggcgga ctgccaatgt 1800 agatcatagg ccaaatgcac ttaccttagg cacctatgga atgctcagat tcacttgatt 1860 tactcgaagt cttgcacggt tatatcaccg tgcagcatat tttacagaag agggaattc 1919 // ID Gypsy-15_OD-I repbase; DNA; INV; 5995 BP. XX AC CABV01004379; XX DT 24-FEB-2011 (Rel. 16.02, Created) DT 24-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Oikopleura dioica genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-15_OD_; KW Gypsy-15_OD-LTR; Gypsy-15_OD-I. XX OS Oikopleura dioica OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; OC Oikopleuridae; Oikopleura. XX RN [1] RP 1-5995 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Oikopleura dioica genome."; RL Direct Submission to RU (24-FEB-2011). XX DR Genome; CABV01004379; Positions 7162 1168. XX CC Positions [1781-2242] - Reverse transcriptase CC Positions [3335-3802] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 1013..5239 FT /product="Gypsy-15_OD-I_1p" FT /translation="MDPILGSEALKSNEITVVLHRNKLIKSGPIERLAKLC FT AIKIEKIRIDGIKEDSFYAISEQNFTFRGKSEQLIDLRIDNLKDARNLFLD FT ESKLGNSKLELIQSFKSVDPSYPYFQVLVINPLNSIIKIPAGPVFAKLSEI FT AQIANLKSTEKDVTKIFNRITVGEISPNNIEKFKNMIKQFSFLFQDDDDLL FT PETALEKFSIDIGNNRPVSSAMYRTPLALRTEMKRILDSFIEQGIIEETRS FT EFNSPCLLVRKKNGSFRLVVDYRKLYQITTQQHHPIQQIDDVLCYLEGSLI FT YSSIDLKKGFHQCSVEESSRPALAFSNEWGQYTWTKMPMGTKNAPLHFAKC FT MDHILRDIPKTQICAYLDDLIVHSKTESEHFENLQKFFVILSQNNLRINID FT KANFFQRKATVLGFEVSRGQVKPSEDKIESVRKLNIPRNREEAQSLFGLLS FT QHRKFIPNFAALGVDISRTYRGNFLWTDKASVALEKLKKIICESTLSLKIP FT QIDQAVFVLETDASKDSFGGVLYLCIEKNKSHEHSLNCLRPFAYHSLNFTP FT SQIKYCTLEKELYAGRSCMERWKTYLAYVNFVWITDNSCVKFANSFKTSNW FT KIQRWLSEIMGFSFSIIQRKSRQMKISDCLSRNLASINKIEFSKSSFIDFQ FT KNDPILKQIRNYVSLDRWPNNTPREIIQYHLNRNNLEILDTEELVLNCPSG FT LKKFCVPDSLREEIIREYHDNSHPGIEICNDKISKKYFWPGMRKNVAEFVQ FT SCHYCQTSKPNNHPNKASLGKFKTPSGPYETLSIDLIGPLKETNMGNKYIF FT SVIDGFSKKVYAEPIYQKQSDYLLQIFRGFIFRNPKFPIYVVLDNAPEFNE FT IAKFLKINKIEPHFIPPRHPQSNGLIENANRTIKARLRARTNLINWDEHLH FT EIIHEINSSTHSVLKFSPFTVETGILDNHYFYDSNWRNYGNKINVDFEEIR FT NKIENEKNKRIEKFNNEKFTGYSLGDLVLIKNFRSSFPPFIGPFEIIYKSP FT TGSWYNCKNGDREFRRHADNLKPYFERVDKSDELKTIEPTEKSPKKVVQKA FT SELEELYSSDSDFFVQINNNEENDDSENPSESSIKNKLQNFMSTDSSSSSS FT SSESSSDDSEFQKLLLKSKELSVRNEQMKSVFESLPKPEIIGNYLNDSEEF FT ADSENSAQPDFNESVEYFEDNDLELEHSSPADVLSPINPTEPVLISYDKLT FT DSILCNDRKRERSDDSLISRKVAKFTHENNLVNMAILIKMEYEDKSFFESL FT NDKWKNDEIENKQIEQGMFLSLHELTKDLLLFILFKLNEKYCIDENVTILR FT KRIKLKIEKDFPNWRRSPSGKLLFYATFRTQKERSLYDLSLPELKVVCAHY FT NLPTPLKFTKTFLTAFIEQELPKVSNNHPTRKNEIIFIPDVSETEI" XX SQ Sequence 5995 BP; 2091 A; 1058 C; 1053 G; 1793 T; 0 other; aaaacgagaa atttgaattg tttttatcga aactcacttc gattgctgaa cagatttcgg 60 aaaaagctga tgcgaaagaa tacatttgtg atcaaaaatt tagagaatct ctttctcctg 120 atacggaaag ctttttgcgt gatatgtgca tgctcgagaa aagtccttcg gaaattgcaa 180 attttctcga caaaagagaa cgtcatattt ctggcccgac tgttaataag gttgatatag 240 attcgaaatt cggttccttt gttcagtcaa cttcggatat gatcgctaaa acttccatta 300 cttttgagga gcaaattgct cgattagaag ataaacagaa aaaaacggaa gaaatggctg 360 agcaacgcac ccaaaattta agtaatcaga taagtcaatt aacggccacg atcgcgaaat 420 tatcgtttca acagaatcag ccgcaaacgc gccaaaattt tcaaaatcga actgcccaaa 480 atgccccagc cgctcaaaac tcgtttcagc cgcggccacg tttttgcaaa ttttgtaatg 540 tcgatactca ctaccgaagt tcttgcccat atgtaacatg ctttttgtgc ggagaacgcg 600 gtcacatgcg taatagatgc cctaaaagtc agaacacggc cccatcaacg tcaacagttt 660 ctaaaccggc tgaagcccaa gcaaaaagct caaatttaaa ctaatttgcg gagcgaaaga 720 ccctggtcag gctttagctc cacaaagcgt taaaataaat agagtaggat tacagtattt 780 tgttaatatt caaatttttg aaagaaacat taaatgtctc gttgatacag gtagccaact 840 taatctttta ccgaaaacat ttattcctga aaacgttcaa attgcacctc cagatcttag 900 cgccagtaat tacggcggcg gatcaatcga aattcttggt tatgtgaatg aaaaatttta 960 tattggatct gagctctggg ggaaatcacg tttttatatt gtccccgatc acatggatcc 1020 aattcttggt tccgaggctt tgaaatcgaa cgaaataacg gtcgttcttc atagaaacaa 1080 attgattaaa agtgggccaa ttgaaagatt agcgaaattg tgcgcaataa aaattgaaaa 1140 aattcgcatc gatggaatca aggaagactc gttttatgca atttctgagc agaattttac 1200 ttttagaggg aaaagtgagc aacttattga tttgcggatt gacaatttaa aagacgctag 1260 aaatcttttt ttagacgaaa gtaaactggg aaactcgaaa ctcgaattaa tacagtcttt 1320 caaatctgtc gatccctctt atccgtattt tcaagttctt gtcattaacc ccttaaattc 1380 tatcattaaa attccggccg ggcctgtttt cgcaaaatta tctgaaattg ctcaaattgc 1440 aaacctaaaa tctactgaaa aagatgtcac aaaaattttt aatagaatca cagttggaga 1500 aatttcgccg aataacatcg aaaaattcaa aaatatgata aaacagtttt cctttctttt 1560 ccaagatgac gatgatttgc tccccgaaac agcgcttgaa aaattctcaa tcgatattgg 1620 taataatagg cctgtcagca gtgcaatgta tagaacgccc ttagctttac ggacagaaat 1680 gaaacgaatt ttagattctt ttatagaaca gggtattatt gaagaaacaa ggtcggaatt 1740 taattcgcca tgtcttctcg tacgcaagaa aaacgggtct tttcgtctag ttgtcgatta 1800 cagaaagctt tatcaaataa cgacccaaca gcaccatccg attcaacaaa ttgatgacgt 1860 tttatgttat ttagagggaa gtttaatata ttcttcaatt gatctaaaaa aagggtttca 1920 ccagtgctcg gttgaggaaa gttcccgtcc agctctcgcc ttttcgaatg aatggggtca 1980 gtatacttgg actaaaatgc cgatgggcac aaaaaacgca ccattacatt ttgccaaatg 2040 tatggaccac attcttcgag acattccaaa gacgcaaatt tgcgcttacc ttgatgattt 2100 aattgttcat agcaaaacag aatcggagca ttttgaaaat ttacaaaaat tctttgttat 2160 tttatcccaa aataatttgc gaataaatat agataaagca aattttttcc aacggaaagc 2220 gacagttttg ggatttgaag tttctagggg ccaagtcaag ccgtctgagg acaaaattga 2280 atctgttcga aaattaaata ttccgcgcaa tcgggaggaa gcgcaaagtt tatttggtct 2340 tttatcacaa caccggaaat ttattccaaa ttttgctgca ctaggcgtag acatatcgcg 2400 cacctataga ggaaactttc tatggacaga caaagcatcg gtagcgcttg aaaaactgaa 2460 gaaaataatt tgtgaatcga cattgagtct taaaattcct caaattgacc aagctgtttt 2520 tgtgcttgag acggacgctt cgaaggacag cttcggcggc gtgctctatt tatgcatcga 2580 gaaaaataaa tcgcatgaac acagtttaaa ctgtttaaga ccatttgctt atcattcttt 2640 aaattttacg cctagccaaa ttaagtattg tacccttgaa aaagaacttt atgccggtcg 2700 atcgtgtatg gaaagatgga aaacctatct cgcttatgtc aattttgttt ggataacaga 2760 taacagctgt gtaaaatttg caaattcttt taaaacctca aattggaaaa ttcaaagatg 2820 gctaagtgag attatgggtt tttcattttc aattattcaa cgtaaatcta gacaaatgaa 2880 aatcagcgat tgtctatctc gtaatttagc aagtataaat aaaatcgaat tctcaaaatc 2940 ttcatttatt gattttcaga aaaatgatcc tattcttaaa caaattcgaa attatgtttc 3000 cttagataga tggccaaata ataccccgcg cgagataatt caataccatt taaatcgtaa 3060 taacctcgaa attcttgaca cggaagagct tgttcttaac tgtccgtcag gcttaaagaa 3120 attttgcgtt ccggattcac ttagagagga gataataagg gaatatcacg ataattcgca 3180 ccctgggatt gaaatttgta atgataaaat ttcgaaaaaa tatttttggc ctggaatgag 3240 aaaaaacgta gcggaatttg ttcaatcttg ccattattgt cagacctcga aaccaaataa 3300 tcatccaaac aaagcgtcac ttggaaaatt taagacacca tctggacctt atgaaacgtt 3360 gtcaatagat ttaattgggc ctcttaaaga aacgaacatg ggaaataaat atattttttc 3420 tgtaattgat ggatttagca aaaaagttta cgcagaacca atttatcaaa aacagtcaga 3480 ttacttactc cagatttttc gagggtttat tttcagaaat cctaagttcc ccatatatgt 3540 tgttctagat aacgctcctg aattcaacga aattgctaaa tttctaaaaa tcaataaaat 3600 tgagccccat ttcattccac cgcggcaccc tcagagtaac ggtctaattg aaaatgcgaa 3660 tcggacaatc aaggctagat taagagctag aactaatctt ataaattggg acgaacatct 3720 tcacgaaata attcacgaaa taaatagctc aacacactcg gtattgaaat tttctccctt 3780 tacagttgaa accgggatct tggataatca ttatttttac gactcaaatt ggagaaatta 3840 tggcaataaa ataaatgtcg attttgaaga aattcggaat aaaattgaaa atgaaaaaaa 3900 caaaagaatc gaaaaattta acaatgaaaa atttacagga tattctttag gagaccttgt 3960 tttgataaaa aatttccgtt ctagttttcc cccatttatt gggccgtttg aaattattta 4020 taaatcaccc acgggatctt ggtataactg caaaaatggg gatcgggaat ttcggcggca 4080 tgctgataat ctaaaaccgt attttgaaag agtagacaaa tccgatgaat taaaaacaat 4140 agaaccaaca gagaagtcac ctaaaaaggt tgtgcagaaa gcttcagaac tcgaagaatt 4200 atactcgtca gatagtgatt tttttgtaca aattaataat aatgaagaaa acgatgattc 4260 cgagaatccg agtgaaagtt caatcaaaaa taagcttcaa aatttcatgt cgacagatag 4320 ttctagttcc agcagttcaa gtgaaagctc atctgatgac tcggaatttc aaaaattatt 4380 attaaaatcg aaggagcttt ctgttcgtaa cgagcaaatg aaaagtgttt ttgagagttt 4440 accgaagccg gaaataattg gtaattattt aaatgatagc gaggaattcg cggattctga 4500 aaattctgcc caacccgatt ttaacgaaag tgtagaatat tttgaggaca acgatttaga 4560 attagaacac tcaagccctg ctgacgttct ttccccgatt aatcctaccg agcctgttct 4620 tatatcatat gataaattaa cagattcaat actttgtaac gatcgtaaaa gggagagatc 4680 tgacgattct cttatttccc gaaaagtggc taaatttaca cacgaaaata atcttgtgaa 4740 tatggcaatt ttgataaaaa tggaatatga agacaaaagt tttttcgaaa gtttgaacga 4800 taaatggaaa aatgatgaga ttgaaaataa gcaaatcgaa cagggcatgt ttttgtcgct 4860 tcatgaatta acaaaagact tgcttttatt cattcttttc aaattaaatg aaaaatactg 4920 cattgatgaa aatgtaacga ttttacgaaa aagaataaaa cttaaaattg aaaaagattt 4980 tccaaattgg cgcagatccc cgtcgggcaa gcttcttttt tacgcgacat ttcgcacaca 5040 gaaagaaaga agtttgtacg acttatccct tccggaatta aaagttgtat gtgcacatta 5100 taatctgcca acaccactta aatttacaaa gacattttta actgctttta ttgaacagga 5160 acttcctaaa gtttcgaata atcatccaac gcgtaaaaat gaaattattt ttattccgga 5220 cgtaagtgaa acagagatat aaattgagtt caaaaagatt ctcgccctgg acccataggt 5280 ggtcacagtg ataccataca aggaaaatac tcaataaaaa aaatatttat ttcgttattg 5340 tcacggcctc gatcgacgct aatttggata ataaattaaa atataaaaaa tcatcctcac 5400 aagttaaaat ttctaaaaga ttgtcaggaa aagcgagaca ttccgacaca aaacctgcct 5460 tatcatttta tgcatgctca tatacataac agtcaattat cctcaatttt acagaattta 5520 aaaataagtg ttaaaatcag aattctgaaa ttgacagtat ttggcttggt tctctttggt 5580 ggattcataa tgctctcatc aaaatcaaac ggttcggaga ataattaagt gggtacttct 5640 agcaattcat gtgtgattgt gataatgaag tgtaaattaa ctgtatgtac gcatatatta 5700 aaaacctcaa ttgttatccg aatcatcctt tccatcgaca aacgctcatt atcatcattt 5760 ctggtaaata tttgtattct cttcgttgat ttcaattaac aaaatttgta attattctct 5820 tcaaacgacg gacaacaaca acgactacga tttgtatttg cggccctgtt ttctggaatt 5880 tcttgaaggt gttgtcggaa agtagcacgg tcgagttagc tcccggtggc caaacccgac 5940 tagaactgcc tcaaggtaat ttgtatttga taactctact cacgaaccaa taaat 5995 // ID Gypsy-185_AA-I repbase; DNA; INV; 4390 BP. XX AC supercont1.145; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-185_AA_; KW Gypsy-185_AA-LTR; Gypsy-185_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4390 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.145; Positions 1013967 1018356. XX CC 'CTATG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 975..4286 FT /product="Gypsy-185_AA-I_1p" FT /translation="MMISAVNKINEPCYVEVLADRKWLTMEIDCGSSESVI FT SESLFLRNFKSHTLKPCNKKLVVIDGKRLAILGKVFVPVQVGDINKDLSMV FT VLRCDNEFVPLMGRTWLDVFYQGWRNTFARPSTAAGTINALTEDETINDVK FT RKFPNVFDKDFSNPIVGFTGDLVLKEDEPIFKKAYDVPLRLKQKVIDYLDT FT LEKDGIITPVEASEWASPVIVVMKKDQQIRLVIDCKVSINKVIVPNTYPLP FT LIQDIFAAISGSKVFCSLDLAGAYTQLLLSENSKKLMVINTLKGLYRYNRL FT PQGASSSAAIFQKVMDQVLRGIEHVCCYLDDVLIAGKDFEDCKNKLYLVLE FT RLAKANIKVNFKKCKFFVDELPYLGHVLTEKGLLPCPEKVQTIREAKAPRN FT VSELKAFLGLLTYYSKFIPNLSSRIHCLYSLLKKNVHYTWTKECNKAFEDC FT KQILLKPKLLEYFDPDKPIVVVTDACNYGVGGVIAHIVDGVEKPISFTSFS FT LNNAQKNYPILHLEALAIVSTVKKYHRYLYGMKFTIFTDHKPLIGIFGKEG FT KNAMSVTRLQRYIMEMSIYDYDIVYRPASKMGNADFCSRFPLAQEIPKDLA FT VEFVKNLNLSDEFPMDYLEIARATTNDDFIQKIIYFLHNGWPDRLEKCFKD FT VYSLHQDLEEIDGCVLFQDRVIVPVSMKEKVLKLLHSNHSGISKIKQLARR FT TVYWFGMNGDVEKYVKSCSICNKMNAVTKPAPYSRWIPTKKPFSRLHADFF FT YFERKMFLVVVDSFTKWIELEYMRNGTDHKKVIKVFLSIFARFGLPDVLVT FT DGGPPFNSSNFVTFFEKQGIRVMKSPPYHPESNGQAERTVRLVKDVLKKFL FT LDREIRGLDTDEQISYFLFNYRNICLEKDGEFPYERLLSYKPKTLLDLINP FT KSSFKKNLTEFYDDNQHACETKKDKKSDPFAKLRVGDLIYYKNSNKTDIRK FT WIPAKFLKQISDHILQISLGGRIVWAHKRQLKVQSESHLRMDRKVVFRGES FT LVDTTDQPQADCLLQQDNVEEEEEDIAVDNNRKRRREEDDMSDSSSSFYGY FT PADSFIFQENDFNQFDFENEVSEDQQNPIRKSKRRTKKKRKKDFVYY" XX SQ Sequence 4390 BP; 1421 A; 692 C; 924 G; 1353 T; 0 other; gtggcgacaa ggataaagtt tatatagtcg atagcgtcgc gagtgaaata gtgaaatttt 60 cctgaatttt tgcggcgtaa gtttaagaaa cttatttcag gaagtggtga tagcatagct 120 gatttttatt gaattttgtt gacaaagcat tttgtcacac gcttcctaat cttttttgaa 180 tttttaagag catttgtggc gtgctttaac gttcactaat tagttttgtt ggctggaaaa 240 tcagcatcat ttcaaagggt tatttagtga ttttagtgaa atcgagcgtg aaaatcacga 300 cgaaaatcaa ttttggcgaa agaagaaacc attttgtata gtgcatttta tacattttat 360 tgaaaaggcg tttgattctc attgttctat tgtgatttct tattcttttg tttgatgcgc 420 cgtattcatt tcattttcac cattttgagc atcatttatt ttccagtgta agcgtcataa 480 aaggaaggcg gcagtagcct cttctaactg cttctaaagc agaaagaatt atcgttaatc 540 aggaaatatc cagtgacagg acgcgttatt tgaaagagga ttatgataaa agaagtaata 600 accgcagtaa tgtgatagct agacttggtg ctagagtgaa tcgttctcgt gttagatttg 660 aacgcagaag tagaagcagg agtagaagtt acagccgaga tagatcgttt tcaagagaag 720 gccgaagaaa tcgtagattt ggcgataggc gcgaaaacaa tagcagtata ccaacttttt 780 tgtgttcatt ttgtaaaaaa cgtggacaca ctagaaagtt ttgttataaa ttgaacaaga 840 agagtccaag ccaggaaaga agccatgtta accttattca atcacctaaa tcctcaacat 900 cctctacttc agcattgttt aaaagattga aaactgattt gcaagataca gaatcggagg 960 atgattctcc ttgcatgatg atttcggcag ttaataagat taacgaaccg tgttacgttg 1020 aagttctagc ggataggaaa tggctgacaa tggagattga ttgcggctct tcagagagcg 1080 ttatttccga aagtctgttt ttgaggaact ttaaatctca cactttaaaa ccttgcaaca 1140 aaaagttagt cgtcatagac ggaaaacgtt tggctatttt gggaaaagtt tttgttccag 1200 tgcaggtagg agacatcaac aaagaccttt ccatggtagt actacggtgt gacaacgagt 1260 ttgtgccatt gatgggccgt acatggttag acgtttttta tcaaggatgg agaaatactt 1320 tcgctagacc atcgacagct gcagggacga taaatgcttt gacggaagac gagacgatta 1380 atgacgttaa acgtaagttt cccaacgttt ttgacaagga tttctcaaat ccaattgttg 1440 gttttaccgg ggacttagtt ttgaaagaag atgaacccat tttcaaaaaa gcatacgatg 1500 ttcctttgcg tttaaaacaa aaggttattg attatctgga cactttagag aaagacggta 1560 tcataactcc cgttgaagca agtgaatggg catcaccagt cattgtagtt atgaaaaaag 1620 atcagcaaat ccggttagtg atcgattgta aagtgtcaat taataaggtt attgtgccaa 1680 atacataccc tcttcctcta atccaagaca tttttgctgc tatttctggt tcaaaagttt 1740 tttgttcgtt agacctcgct ggtgcttata cacagctttt actttctgaa aattcgaaga 1800 agctaatggt cataaacaca ttaaaaggtc tttatcgtta taacaggtta cctcagggtg 1860 cctcatcgag tgcagcgatt tttcagaaag ttatggacca ggtgttgaga ggaattgagc 1920 atgtttgttg ttatttagac gacgtgttga tagcgggtaa agattttgag gattgtaaga 1980 ataaacttta tttggtttta gaaagactag ctaaggccaa tattaaagta aatttcaaga 2040 aatgcaaatt ttttgttgac gaattgccgt atttaggaca tgttttgacc gagaaagggt 2100 tactaccttg tcctgaaaaa gttcaaacta ttcgggaagc gaaagctcct cgaaatgttt 2160 ccgagcttaa ggcttttttg gggttactaa cttattactc caaatttatt cccaatcttt 2220 catcccgcat tcattgcctt tacagtcttt tgaaaaagaa tgttcattac acctggacaa 2280 aagagtgtaa taaggcattt gaagattgta aacagatttt attgaaacct aaacttttgg 2340 aatactttga cccagataaa ccaattgtag tggttactga tgcatgtaac tatggtgttg 2400 gcggagtcat cgctcatata gtggacggag tagaaaagcc aataagcttt acttcctttt 2460 ctcttaacaa tgcgcagaaa aattacccga tattgcattt ggaggctttg gcgattgtca 2520 gtacggtaaa aaagtaccat agatatttgt atggcatgaa attcactatt tttacggatc 2580 acaaaccgct tattggtatt ttcggcaagg aagggaaaaa tgcgatgtct gtcacaagat 2640 tgcaacgtta tatcatggaa atgtctattt atgattacga tatagtatac agacctgcat 2700 ctaaaatggg aaatgcagac ttctgctctc gttttccgct tgctcaagaa attccaaagg 2760 atttagcagt ggagtttgtg aagaatttaa acctttcgga tgaatttcct atggactatc 2820 tagagatagc tagagctacc acaaatgatg atttcattca aaaaattatt tattttcttc 2880 acaatggttg gccagatcga ctggaaaaat gttttaagga tgtatattct ctacatcaag 2940 atcttgagga aatcgacggt tgtgttttgt ttcaagaccg agtaattgtc cctgtaagta 3000 tgaaagaaaa agtacttaaa ttgttacact ccaatcactc gggaatcagc aaaataaaac 3060 aacttgcgag gcgaaccgta tactggtttg gcatgaatgg tgatgtggaa aaatatgtga 3120 aatcatgcag catttgtaat aagatgaatg ctgtaacaaa accagcacct tattctcgtt 3180 ggatacctac aaaaaagcca tttagtcgac tgcatgcgga tttcttttat tttgagagga 3240 agatgttttt ggtcgtcgta gacagcttta ccaaatggat tgagttggaa tatatgagaa 3300 atggtacaga tcataaaaag gtaatcaagg ttttcctgag catattcgct cgttttgggt 3360 tgccggatgt tttagtgacc gatggaggac cacctttcaa ttcaagcaat tttgttacct 3420 tttttgagaa acaaggcatc cgggtcatga agagccctcc atatcaccct gaaagcaacg 3480 gccaagcgga gagaacagtt cggttggtca aggacgtatt aaagaagttc cttttggatc 3540 gagaaattag gggtttggat acggatgagc aaatatcata ttttcttttc aattacagga 3600 acatttgtct tgaaaaggat ggcgaattcc cctatgagcg attactctca tataaaccta 3660 aaaccctact ggatttgatt aatcctaaat ctagttttaa aaagaaccta acagaatttt 3720 atgatgataa ccaacatgca tgcgagacta aaaaggataa gaaatctgat ccttttgcta 3780 agcttagagt tggagacctg atttattaca agaactctaa caagactgat attagaaaat 3840 ggattcccgc aaaattccta aaacagatct ctgatcacat tttacagatt tccctcggtg 3900 gtagaattgt atgggcgcac aagcgtcaat tgaaagtgca gtctgaatca cacttgagaa 3960 tggatcgaaa ggtagtgttt cgaggagaga gtttggttga tacaacggat caaccacaag 4020 cagattgttt actacaacaa gacaacgttg aagaagagga agaagatata gcagtagaca 4080 ataataggaa aagaagaaga gaagaagatg atatgtctga ttcaagttca tcgttttacg 4140 gttatcctgc cgattcgttt atatttcaag aaaatgattt taatcaattc gattttgaaa 4200 atgaagtttc agaggatcaa cagaatccta tcagaaaaag caaaagaaga actaagaaaa 4260 agcgaaagaa agactttgtg tattattgaa tgaattatca gagatgtgga atttaatgaa 4320 ataattgtaa atagtcgaaa ttatatttta tataaaacaa tcctggaact ttctcaaagg 4380 gatgaggagt 4390 // ID Gypsy-218_AA-LTR repbase; DNA; INV; 228 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-218_AA_; KW Gypsy-218_AA-I; Gypsy-218_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-228 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1040-1040 (2011). XX DR [2] (Consensus) XX SQ Sequence 228 BP; 68 A; 37 C; 56 G; 67 T; 0 other; tgtagagttt gatattataa tccattttca tacgttagac tacatttcaa tgtaagcttc 60 taggttggca acgttagcga cggtcgggag agttccgtgg agagttgacg tgaaagcaag 120 tgagtgagac cgtgaggaac gcgggttgta aaacgatcgg acgaataaat gtaatttaag 180 tgcttttaac tacctgtctg aattactatt ccgaaaccag actttaca 228 // ID CR1-24_HM repbase; DNA; INV; 4785 BP. XX AC . XX DT 15-DEC-2008 (Rel. 13.12, Created) DT 15-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE CR1-type family - consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-24_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4785 RA Bao W. and Jurka J.; RT "CR1 families from Hydra magnipapillata."; RL Repbase Reports 8(12), 1852-1852 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(112..894,800..1264) FT /product="CR1-24_HM_1p" FT /translation="MAITMDQVKKTIQKMFKEFKNEIDEMMKQQEKNVLNI FT LSANTKIIYDRLDKVDLKVNDYAKHIKTLEKDVEEIKRSLNFHEHLFEEKI FT KTAIASQEKKHTSGIDKLNHSSDYVKIKSKLREMEDRSRRNNLRVDGLKES FT EGETWIDSELKVSKVFEEHLGLTNIRIERAHRTGQRDMSKPRTIVVKLLDY FT KDKVEILKKTSQLKGKNIYINEDFCAETVTIRKALREQMKVERAAGKYAFI FT SYDKLVIRDWAANKSKNFSLRLNERLGNMHLSLTISWLFGIGLQTKVKIFL FT CNIFIYSTILFYFFYILLCLIILIFLLKMEANFSLNLNDENASDDNYFNGK FT NLESEYYSISESTQYLCQNKNTFSILNVNIRSINKNFENLKLLLHQLNHEF FT KIICIMYLSFEXTNDFLSFII*" FT CDS join(1579..2556,2674..4377) FT /product="CR1-24_HM_2p" FT /translation="IIETWLKTNEKNSNFELGNYVSVQQPRFNYAGGGISI FT FVHKSISFILRPDLNINETDCESLCIEIINKTTKNIVINSIYRQPAGNLRK FT FKTYLSSFLTKTSITRKHVYLAGDFNLNLLNYASNTNVQHFLDTLIEYNLI FT PTINKSTRITKTSSSLLDNIITNNFQNCRLGTGIIKTDLTDHFPIFLITDN FT ITLNNVPSNTTILTRQINENSLLYFRTLLTKNIDLNVILQSHEVNTAYELF FT LAQFCKQYDIAFPVKKIVIKSKSLLNPWMTKELIKSSKKKKKLYDKYLRHK FT TYKNEINYKNYKNLFEKNTKTLKKNLLYKVVGKNKKKIAKELNSFFLEVGP FT NLASKIPSSTTKFESYIKSCNSIMEESNLNLSEIRNAFNSLKSNKSAGIDK FT ISVNVVKSVFDIIEPSLFHIFDLSLKLGVVPKKLKIAIITPIFKSGDETNI FT NNYRPISVLPCFSKLLERIMYNRLYNYLMLNSLLYSKQFGFQKDNSTNHAI FT IELVNHVSDAFNNDYFTLGVFIDLSKAFDTVNHKILIKKLELYGIKNKNLL FT WFEDYLANRSQCIIYNKKEKEKHITCGVPQGSILGPLLFLLYINDLYLASN FT ILNVILFADDSNLFYSNKDIKALFNIFNEELFKINDWFTSNRLSLNVEKTK FT FILFSKPSKGDDVPLKLPNLLINKTFIKREPIVNFLGVIIDENLSWKPHIK FT YIENKISRNIYILYKTKSYLNTSSLKKIYFSFIHSFISYCNIVWGSTNYGK FT LKKLHSKQKHACRIIFGAHRTFQCEPLLRKLGALSVYKLNIHQVLQFMFKT FT KHELSPIVFQSYFSEISHKYPTKFSINNFIVPKTNSKLNSYKIQYRGPFLW FT KHFSKFITKNNNVQHISLEKFRKESKNLLLQKDFDLKYLF*" XX SQ Sequence 4785 BP; 1922 A; 642 C; 586 G; 1621 T; 14 other; ttattttgaa atttgaaaaa aaaaaaaaaa acttttttga ttgtttggaa ttatttaaat 60 aaacaaatat aaataaatat atatttttat atattaacat attggttaat tatggctatt 120 acaatggatc aagtaaaaaa aacgatacaa aaaatgttta aagaatttaa gaacgaaata 180 gacgaaatga tgaagcagca ggaaaaaaac gtgttaaata ttttaagcgc caacacaaaa 240 attatatatg ataggttaga taaagtagat ctaaaggtta atgattatgc aaaacatata 300 aagacattag aaaaagacgt agaagaaatc aaacgaagct taaactttca cgaacatctt 360 tttgaagaaa aaattaagac cgcaattgct tctcaggaaa aaaaacatac atctggcatc 420 gataagctaa atcatagttc cgactatgtt aaaataaaaa gtaaactaag agagatggag 480 gatagatctc gcagaaacaa tttaagggtt gacggattaa aagaaagcga aggtgaaacg 540 tggattgaca gtgaattaaa agtcagtaaa gtttttgaag agcatttggg attaacaaat 600 attcggattg aaagagctca tagaactggc cagagagata tgagcaaacc aagaacaatt 660 gtagtaaaac tgttagatta taaagataaa gttgaaattt taaaaaaaac atctcaatta 720 aaaggtaaaa atatttatat taacgaagac ttttgcgccg aaactgtcac aattaggaaa 780 gctctacggg aacagatgaa ggttgaacga gcggctggga aatatgcatt tatctcttac 840 gataagttgg ttattcggga ttgggctgca aacaaaagta aaaatttttc tttgtaacat 900 atttatatat tctacaattt tattttactt tttttatata ttactttgtt taattatatt 960 aatttttctc ttaaaaatgg aggccaattt ttcacttaat ttaaatgatg agaatgctag 1020 tgatgataac tattttaatg gaaaaaattt agaaagcgag tattattcaa tttctgaatc 1080 tactcaatat ctctgtcaga acaaaaatac attttctatt ttaaatgtaa acattcgaag 1140 tattaacaaa aattttgaaa atctcaaact tttactgcat caactaaacc acgaatttaa 1200 aataatttgt atcatgtatc ttagttttga anttacaaac gattttttaa gttttattat 1260 ttgaaatact aattactaaa aatattgtaa aattttataa tcaattttta ttaaaaaaag 1320 ttatttaaat tctttttcta atattaatat ttatatytat ttacyaagtt cttaaatata 1380 aatattttty gaattaccaa atattttaaa cgttatatac tttatttwtt acaattaaaa 1440 tattttawga tgaayatgca ttttttagkg tgmttaatra raaaatgagt tttgtcttct 1500 tcaagcccat gtaaatattt attataaatt acgaattyaa agtattttca atcgatacta 1560 acaamraaaa taaaataaat catagaaact tggcttaaaa ccaacgaaaa aaattcaaac 1620 tttgagttag gtaattatgt atcagttcaa caaccacgtt ttaactatgc tggtggtggt 1680 ataagtattt ttgtccacaa atcaattagc tttattctac gtcctgacct caatattaat 1740 gaaactgatt gtgagtcatt gtgcatagaa attataaaca aaaccaccaa aaatattgta 1800 attaattcta tttatagaca accagctggt aatttaagaa aatttaaaac ttatttaagc 1860 agttttttaa ccaaaactag cataacacgg aaacatgttt acttagctgg tgattttaac 1920 ttaaatttac taaattatgc ttcaaataca aatgttcaac attttctaga tactcttata 1980 gaatataacc taataccaac aataaataag tcaacgagaa taactaagac ctcatcgtca 2040 ctacttgata atataattac taacaatttt caaaactgcc gtttaggaac aggcataatt 2100 aaaactgatt taactgatca ctttccaata ttcttaatta ctgataacat aaccctaaat 2160 aatgtccctt caaacactac aatcttaaca cgacagatca atgaaaactc cttattgtat 2220 tttcgaactc ttttaacaaa aaatatcgat ttgaatgtta ttttacaatc ccatgaagtc 2280 aatactgcat atgagctatt tttagcgcag ttctgtaaac agtatgacat agcatttcct 2340 gttaaaaaaa ttgttataaa atccaagtct cttttaaacc cctggatgac caaggaatta 2400 ataaaatcat caaaaaaaaa aaaaaaactt tatgataaat atttaagaca taaaacatat 2460 aaaaatgaaa taaactacaa aaactataaa aacttatttg aaaaaaacac aaaaacactc 2520 aaaaaaaatt tattatacaa agttgttgga aaaaactaac ggcgacacta aaaaaacgtg 2580 gaatataatt aaacaaataa ttggtaaaaa taaatgtgaa aaaaataatc tcccacaaaa 2640 gcttttaatt gatggggaaa tgatttacga taaaaaaaaa aaatagctaa agaactcaac 2700 tcattttttc tcgaagtagg gccgaacttg gcgagtaaaa ttccgtcaag taccacaaag 2760 tttgaatctt acattaaatc atgtaattct attatggaag aatctaattt aaatctaagc 2820 gaaataagga atgctttcaa cagtcttaaa agtaataaaa gtgcgggcat tgacaaaatt 2880 agtgtgaatg ttgttaagtc agtcttcgat atcattgaac cgtcattatt tcatattttt 2940 gatctttcat taaagctagg tgttgtgccg aaaaaactta aaattgccat aatcactccc 3000 atttttaaat ctggtgacga aactaatatt aacaactaca gacctatttc cgttttaccc 3060 tgcttctcaa aattattgga acgaattatg tacaacagac tttacaacta tttgatgtta 3120 aacagtttgc tgtacagtaa acaatttgga tttcaaaaag ataattcaac aaaccatgct 3180 ataatcgaac tagttaatca tgtatcggat gcattcaata atgactattt tactcttgga 3240 gtttttattg atctgtcgaa agcctttgac actgttaacc ataagatact tataaaaaaa 3300 ctagaactat atggaataaa gaataaaaat ttactttggt ttgaagacta cctagcgaac 3360 aggtcacaat gtattattta taataaaaaa gaaaaagaaa aacatattac atgtggtgtg 3420 ccccaaggtt caatcttagg gccattatta tttttattgt atataaatga tttatactta 3480 gcatcaaata ttttaaatgt tatattgttt gcagatgact ccaatctttt ttattccaac 3540 aaagatataa aagctctctt taatatattt aatgaagaac tattcaaaat aaacgattgg 3600 tttacaagca ataggctttc gttaaatgta gaaaaaacta aatttatttt attttccaaa 3660 cctagtaaag gtgatgatgt tcctctaaaa ttacccaatc tcctaatcaa taaaacattt 3720 attaaaaggg aaccaatcgt taattttcta ggagtaataa tagatgaaaa tttgtcatgg 3780 aaaccacata taaaatatat agaaaataaa atatcaagaa atatttatat attatataaa 3840 actaaatcat atcttaacac ctcttctcta aaaaaaatat acttttcatt tattcatagt 3900 tttatctctt actgcaatat tgtatggggt agtactaact atggtaagtt gaaaaaactt 3960 catagtaagc aaaaacatgc atgcaggatt atatttggtg cccacagaac ttttcaatgt 4020 gaacctcttt taaggaagct cggtgcccta agtgtctata aacttaatat acaccaagtt 4080 ctacaattca tgtttaaaac taaacatgaa ctctctccta ttgtatttca gtcttacttt 4140 agtgaaatct cgcacaagta tcctactaaa ttttcaatta ataactttat tgttccaaaa 4200 actaattcaa aactaaattc ttacaaaatc caataccgcg gaccgtttct atggaaacat 4260 ttttcaaaat ttatcacaaa aaataataat gttcaacaca tctctttgga aaaatttaga 4320 aaagaatcta agaatctctt attacaaaaa gactttgatt taaaatatct cttttaaaaa 4380 taatacacaa ataatataaa ttatctaaaa aaatagttat tgacactact tctcttttgt 4440 tgcaatttta atctaattta attttttgtt gaaaaatttt aatactttta aattattgtt 4500 aatctttgat ataatgcgtt atatttcaac actaatattt atattatgtt attagttata 4560 ttgtaactaa cgtatttaaa attcttgtta tgtacttagc tactcttcat tttttatatt 4620 aactgacgat tttttagtgt aaattttatg atggggcttg atgataagac aatttatgtc 4680 ttctgcttgc tccagttctc ttatttatta acacgattta tttcattgta aattttaata 4740 tatgacaaaa ctctttgctt aaaaaaaaaa aaaaaaaaaa aaaaa 4785 // ID CR1-77_HM repbase; DNA; INV; 4295 BP. XX AC . XX DT 02-JAN-2009 (Rel. 14.02, Created) DT 02-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE CR1-type family - consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-77_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4295 RA Bao W. and Jurka J.; RT "CR1 families from Hydra magnipapillata."; RL Repbase Reports 9(2), 364-364 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 52..759 FT /product="CR1-77_HM_1p" FT /translation="MAKNFSASQLKEILDIHENTMMKIFNEKFERLEKQID FT KLNEDNKILKTEVSELKKAVEFISEKYDKIIIEQEGLKADCAKPDRNLEKS FT LTINLRDKLAEIEDRSRRNNLRISGIEESVDESWDDCESKVRELIKKKLDL FT QGEFIIERAHRVGKNDKDNTIKKCRTIVVKFLNFKDKSTVLGKYIKTKLWN FT QRIYVNEDFSERTTELRKKLFTAAKDLRIKGKYAKVVYNKLITRD*" FT CDS join(799..1356,1382..1714,1674..2048,1957..3879) FT /product="CR1-77_HM_2p" FT /translation="MDSNNLNDFELSYNFFRTDEFLFNNQSDPDLNYFCEA FT HALQSNCSYYYADEIKEFLNCRYFNTFHINIRSISKNFENFYLSMKESLNI FT FSLICMSETWCNSNNSNLNSSIFLPGFNIISLDRKNGKRGGGLIVYIREYL FT QYIIRPDMSNSDDDNEVLTVEIQTKKKNKKFNYKFLLSPTFGQYRKFKKSI FT CENKLYYLIGDINLDCYQYHTNYGVKKFYNELFELGAFPIINKPTRITPTS FT KTLIDNIITTDVFNKSLKKGIIKSDVSDHFPIFFSINIDADIISQQKQTFK FT KRFFFYLNKNKLLKNVFFSNDNLKTFKEQLSYLHWKQIDTCADANLVYNTF FT FKTFYEIYDANFPKQIVNKKAKSLMSPWITKELKKSSKIKQKLYVKYLKTK FT TEQNKVLYKKYAKEFERQKKKRKKKLLFRQKLNKIKFSIKNTLKNLKDRKK FT KEKKNYYSNLLDRNNLNSKRTWEILRELTGMQKLKTSSLPKTIKINEETSF FT DQIAIANKLNEYFVSVGPNLAQQIPILKNKLNNYAFPITSFLNSFELSFEE FT FELSFKKLKSNKSVGYDDIYCNVIIDSYGVIKDILFQIFKCSIKQGIFPDN FT LKIARVSPIFKGGNSTNVSNYRPISILPVFSKILERILYNIIFNHLSSNNI FT LYQYQYGFKKSNSTEHAILQTTRNIAESFEKSQYTLGVFIDLSKAFDTIDH FT KILFKKIEYYGIKGNILMILKSYLSNRKQFVYSNTTVSSNLLDITCGVPQG FT SILGPLLFLIYINDLPKASNLMTVMFADDTNLFLAHNNITTLFQNMNIELT FT KVSDWFKVNKLSLNVEKTKWILFHQTSKKKLLPPVMPLLFIDKVQIKREQT FT TNFLGINIDENLSWKYHIDLLCNKVARNIGVMYKARNALNKHSLTQLYFSL FT IHCHINYANIAWGNTNKSKLLPLYRQQKHVARLINYKDRFAHAKPLLFQMK FT ILNIYQLNIFNILCFMFKCKTNESPVSFHNLYTLKDKNKYNLRNDNQIQQP FT LSLTNFGKSCISYRGAFLWNKIVLKNFDFSHEWNYLSFKKKLKEIVLSFED FT IFLYF*" XX SQ Sequence 4295 BP; 1683 A; 591 C; 571 G; 1450 T; 0 other; ttcaattcac gtttcaccgc gaacggacgt gtttttaatt acaaaataaa aatggcgaaa 60 aatttttcgg cttcacaatt aaaagaaata ttagacattc atgagaacac aatgatgaaa 120 atatttaatg aaaaatttga aagattggaa aaacagattg ataagttgaa tgaagacaat 180 aaaattctta aaactgaagt aagcgaattg aaaaaagctg tcgaatttat tagcgaaaaa 240 tacgacaaaa taataattga acaagaagga ttgaaggctg attgcgcaaa accagataga 300 aacctagaaa agagtttaac cattaatttg agagacaaat tggctgagat tgaagaccga 360 agtaggagaa acaatttaag gatatccggt atagaagaat cagttgatga aagttgggac 420 gattgtgaaa gtaaagttcg tgaattaata aaaaaaaaac ttgatcttca aggtgaattc 480 attattgaac gagctcatag agttggaaaa aatgataaag ataatacaat taaaaaatgc 540 agaactatcg tggtcaaatt tttaaatttt aaggacaaat caacagtgct tggtaaatat 600 ataaagacga aactttggaa tcaaagaata tatgtgaacg aagatttcag cgaaaggaca 660 accgaattgc gaaagaaatt gtttacggct gcaaaagatt tacgaataaa aggtaaatat 720 gcaaaggttg tatataataa gttaataacg cgcgattaaa ggaaattctt ttattttatg 780 gttttctaaa atggtaaaat ggattccaat aatttaaacg actttgaatt aagttataat 840 tttttccgaa ctgatgagtt tttatttaat aaccaatcag atccggactt aaattacttt 900 tgtgaagcac atgctttaca aagcaactgc tcctattact atgccgacga aataaaagag 960 tttcttaact gtagatattt taacactttt catattaata taagaagcat atcaaagaat 1020 tttgaaaatt tctatttaag catgaaagaa tctcttaata tttttagttt aatttgcatg 1080 tctgaaacct ggtgtaactc taataactct aatttgaatt caagtatttt tttaccaggt 1140 ttcaatataa tttctttgga tagaaaaaat ggaaagcgag gcggaggttt aatagtttat 1200 attagagaat atttgcagta tattatcagg cctgacatga gcaattctga tgacgataat 1260 gaggtcttaa cagttgaaat ccaaacaaaa aaaaaaaaca aaaaatttaa ttataagttt 1320 ttgttatcgc ccaccttcgg gcagtataga aaattttaat ttatttttaa atgacattta 1380 aaaaaaaagc atttgtgaaa ataaattata ctacttaata ggtgacatta atttagattg 1440 ttaccaatac cacactaatt atggcgttaa aaaattttac aacgagttat ttgaactcgg 1500 tgcttttccc ataattaaca aaccaacaag aataactcct acatcaaaaa ccttaattga 1560 caatattata acaactgatg ttttcaataa atctctaaaa aaaggcatca ttaaaagtga 1620 cgtctcggac cacttcccta tttttttctc tattaatatc gacgcagata taatatctca 1680 acaaaaacaa acttttaaaa aacgtttttt tttctaatga taatttaaaa acatttaaag 1740 aacaattatc ttatcttcat tggaaacaaa tagatacctg tgctgacgct aatttagttt 1800 ataatacgtt ttttaaaacg ttttatgaaa tatacgatgc taattttcca aaacaaatag 1860 taaataaaaa agctaaaagt ttaatgtcac cttggatcac aaaagaacta aaaaaatcct 1920 ctaaaattaa acaaaaatta tacgtaaaat atttaaagac aaaaactgaa caaaataaag 1980 ttctctataa aaaatacgct aaagaatttg aaagacagaa aaaaaaaaga aaaaaaaaat 2040 tactattcta atttgcttga tagaaataat cttaattcga aacgtacgtg ggaaattcta 2100 agagaactca ctggcatgca gaaattaaaa acgagctcat tgcctaaaac tattaagatt 2160 aatgaagaaa catcatttga tcaaattgca atagctaaca aattaaatga atattttgta 2220 tcagttggtc ctaatctggc ccaacaaatt cctatcttga aaaacaaatt gaataattat 2280 gcttttccaa taacttcgtt cttaaactcc tttgagttat cctttgagga atttgaactt 2340 tcttttaaaa agcttaaatc taataaatca gttggttatg atgatattta ttgcaatgtt 2400 attatcgact cgtatggagt cattaaagat attctatttc aaattttcaa atgttcaata 2460 aaacagggta ttttccccga caatttaaaa atagccagag tttcaccaat atttaaaggg 2520 gggaattcaa ccaatgtttc aaattatcgc cctatttcga ttttgcctgt tttttctaaa 2580 attttagaaa gaattttata taatataatt ttcaatcatt tatcttcaaa taatatactg 2640 tatcaatatc aatacggatt taaaaaaagt aattcaaccg aacacgctat actccaaact 2700 acccgcaata ttgctgaatc ttttgaaaaa tctcaatata cgttaggtgt tttcatagat 2760 ttatcgaaag cctttgatac catcgaccac aaaatcctat ttaaaaaaat agagtattat 2820 ggaattaaag ggaatatttt aatgattctt aaaagttatt taagcaaccg taaacagttt 2880 gtttacagta atacaactgt ctcttccaat ttattagata taacatgtgg ggtccctcaa 2940 ggttctatac taggacctct tctcttttta atttacatta atgatctccc taaagcctca 3000 aatttaatga cggtcatgtt tgccgacgat acaaatcttt ttttagctca taacaacata 3060 acaacattat tccaaaatat gaatattgaa ttaacaaaag tttctgactg gtttaaagtg 3120 aataagctgt cactaaatgt tgaaaaaact aaatggatac tttttcatca aacttcaaaa 3180 aaaaagctac tacctcctgt gatgccctta ctttttattg acaaagttca aataaaaaga 3240 gaacaaacta ctaacttttt aggaataaat attgatgaaa accttagttg gaagtatcac 3300 attgatttat tgtgcaacaa agttgctcga aatataggag taatgtacaa ggctagaaat 3360 gctttaaata aacacagttt aacacaatta tatttctctt taatccattg ccatataaat 3420 tatgcaaata ttgcttgggg taataccaat aaatctaaac tgctacctct ttatcgtcag 3480 cagaagcatg ttgctcggct tatcaattat aaagatcgat ttgctcatgc caaaccatta 3540 ttatttcaaa tgaaaattct taatatttat cagctaaata tttttaatat actttgtttt 3600 atgttcaagt gtaaaaccaa cgaatctcca gtttcttttc ataatctata taccttaaaa 3660 gataaaaata aatataactt aagaaatgat aatcaaattc agcaaccttt atctttaact 3720 aattttggaa agtcatgtat ttcatatcga ggagcttttc tatggaacaa aatagtttta 3780 aaaaattttg atttttccca tgaatggaat tatctttcat tcaagaaaaa actaaaagaa 3840 attgttttgt cttttgaaga cattttttta tatttttaaa ttcttttaaa aatgaatatg 3900 aagtttatta cataaggtaa tatataaata ttatatctga tccataactt atgtgtttgt 3960 gaaaacagta ctttttaaaa tatttttgta tttatatata agagtatcaa caatttcttt 4020 attccaattt tatttatata ttattacgaa atttacgttc tcgctaattt ttaatacgac 4080 attgtttaga ttttatactt catgttaatt taatgtactt aatattcacg agcggttctc 4140 gatgataaga cctgaaggtc ttctgcgagt ctccgcgttc tttctttttt gtatatattt 4200 tcttttattg tataacgatt tttatctcaa ctatcttatt gtataactat attgaatatt 4260 caagaaaaga acaaaaaaaa aaaaaaaaaa aaaaa 4295 // ID Gypsy-4_TCa-LTR repbase; DNA; INV; 205 BP. XX AC ChLG6; XX DT 21-FEB-2011 (Rel. 16.02, Created) DT 21-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the red flour beetle genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-4_TCa_; KW Gypsy-4_TCa-I; Gypsy-4_TCa-LTR. XX OS Tribolium castaneum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Coleoptera; Polyphaga; Cucujiformia; OC Tenebrionidae; Tribolium. XX RN [1] RP 1-205 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the red flour beetle genome."; RL Direct Submission to RU (21-FEB-2011). XX DR Genome; ChLG6; Positions 4062846 4062642. XX SQ Sequence 205 BP; 55 A; 49 C; 49 G; 52 T; 0 other; tgttagcgga atataggcag ttcctatggc aatacccagg cgtacccgac gcaggcgcct 60 gcgaaggcgt cgaatagtac gaatacgcag tagtcgcata gtacgcagtt caaatctgta 120 tcaaataaag tggtgttttc cgaatagtgt ctttctggcg gtaaaaaccc cattcactct 180 tgtttacccc attgggacgc taaca 205 // ID CR1-74_HM repbase; DNA; INV; 4442 BP. XX AC . XX DT 29-DEC-2008 (Rel. 13.12, Created) DT 29-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE CR1-type family - consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-74_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4442 RA Bao W. and Jurka J.; RT "CR1 families from Hydra magnipapillata."; RL Repbase Reports 8(12), 1901-1901 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(124..822,826..2031,2027..3991) FT /product="CR1-74_HM_1p" FT /translation="MAVTLSEIAKLFEGYKKEFELMLKQQEKVFIDIISGN FT LKIFNGRLEKVEKDIEELKHSLNYHEELIDEKVKKANVFSKEDNSEITRIQ FT KKLREIEDRSRRNNLRVDGINENEGESWEESELKVKKVFEELLSVKNVQIE FT RAHRTGKKESNKPRTIVIKLLDFKDKVAILSKSSNLKGKNIYINEDFCSET FT TQIRKGLREKMKIERAAGKFAFISYDKLIIRDWVAKKSKNSSSYFLLINGI FT LFLLLLILNLTVFKMEENFVIKFSDENDILGDYDTDSNFLNHKMLESKYYS FT VSECSQHLKKNKSTFSILNINVRSLNKNIENLKILLNELKHDFKIICLTET FT WCKSGETNYEFDLVNYASIHQPREFNAGGGVSIFIHDSLNYILRNDLCVNE FT TDCESLCIEIVNKTEKNIIINNIYRPPSGNMKKFKTHLKTFLNKIKNSRKH FT IYLVGDFNINLINYASNNNAKNFINTLLENNFIPTINKSTRVTKKSSTLLD FT NIITNSFYNSPLFTGIIQTDLSDHFPTFLATNNILINNIATKSTIFRRQIN FT ENSLKHFKCCLKNNVDWNLILQSSDANNAYNLFLGQFCKQYEIAFPEKEII FT INTKSLQNPWMSIGLLKSSKKKEKLYIKYLKKKILEKKNKNFXXYIKYLKK FT KSYKNEITYKNYKNIFEKIKRYSKKLYYAQLLKKHNGNTKKVWNVIKDLIG FT KNNYKKNTLPKNLLINGKLVYDKSIIAEELNNYFVNIGPKLAAKIPSNSTP FT FDSYLKTYDKLMDETSLKLSELSAAFSSLKSKKSEGFDKISIDIVKSVFDI FT IEPSLFHVFDLSFKTGIVPDKLKVARITPIFKSGDKSNILNYRPISILPCF FT SKLLERIMYNRLYNYLTENDMLYCKQFGFKKKNSTEHAIVELSNQISNAFN FT NDCFTLGVFIDLSKAFDTVNHNILIKKLENYGVKNKNLLWFKGYLTNRRQF FT LYYDQNNTIPYSITCGVPQGSILGPLLFLLYINDLYLATNLLDLILFADDS FT NLFYSHRDIKTLFKIVNEQLCKVNEWFVSNKLSLNVDKTKYILFHKVNKSE FT NIPIKLPNLIINNTYIKRETNVNFLGVKLDENLLWYSHINSIETTLSKNIA FT MIYKAKPFLNVASLKKLYFAFIHSYLSYCNIVWASTGYTKLKKIYNKQKHA FT CRIVFGACRNTPCEPLLRALGALNVYKLNLHQVLLFMFKTKMGLSPIIFQT FT YFNKINHKYSTKFSDNNFVLLKYNLKLTFYSIKYRGPYLWKKFPEIVNKNK FT TTLEQFKNESKQLLLFRDLNLSDFKCFF*" XX SQ Sequence 4442 BP; 1766 A; 593 C; 588 G; 1493 T; 2 other; tgctttcttc atgaacagac gtgttttggc agcgtaaaaa tttaaagaaa aaactgtata 60 attttaaatc tttagtatat tttatttatt tttcttgaga aaataattca ttttttatat 120 aatatggctg ttacactatc agaaatagca aaattgtttg aaggctataa aaaagaattc 180 gaattgatgt taaaacaaca ggaaaaagtt tttatcgaca ttattagtgg aaacttaaaa 240 atatttaatg gcagattgga aaaagttgaa aaagatatag aagaattaaa gcatagctta 300 aattaccatg aagaactaat agatgaaaaa gttaagaaag caaatgtttt cagtaaagaa 360 gacaattcag aaattacacg catacaaaaa aagttaaggg agatagaaga tcgatcgaga 420 cgtaataatt taagagtgga tggaattaat gaaaatgaag gtgaaagctg ggaggaaagc 480 gaacttaagg taaaaaaggt ttttgaagag ttactatctg ttaaaaatgt ccaaattgaa 540 cgggcccaca gaacaggaaa aaaagaatca aataagccga gaacaatagt gataaaacta 600 ttggacttta aagataaagt tgcaattttg agcaagtcaa gcaatctgaa agggaaaaac 660 atctatataa acgaggattt ctgttctgaa acaacgcaaa ttcgaaaagg tttgagggaa 720 aaaatgaaaa tcgaacgagc agctggtaag tttgcattca tatcttatga taaattgata 780 attcgcgatt gggttgcaaa gaaaagcaaa aattcatctt cataatattt tttattaatt 840 aacggaattt tatttttact attacttatt ttaaatttaa ctgtttttaa aatggaggaa 900 aatttcgtaa ttaaattttc ggatgaaaat gacattctcg gtgattatga caccgactca 960 aattttctta atcacaaaat gttagaaagt aaatattact ccgtttctga atgttcacaa 1020 catcttaaaa aaaataaaag cactttttca attttaaata tcaatgttcg aagtttgaat 1080 aaaaacattg aaaatttgaa aatattatta aatgaactta aacacgattt taaaattata 1140 tgtctgacgg aaacttggtg taaaagtggt gaaacaaatt atgaatttga tctagttaat 1200 tacgcatcaa ttcaccaacc acgtgaattt aatgcaggtg ggggcgttag tatttttatc 1260 catgattcac ttaattatat tttacgtaac gacctctgtg ttaatgaaac agattgtgaa 1320 tcattatgca ttgaaatagt aaataaaact gaaaaaaaca taattataaa taatatttat 1380 agaccaccat ctggcaacat gaaaaaattt aaaactcatc ttaaaacatt tctaaacaaa 1440 attaaaaatt ccaggaaaca tatatatttg gtcggggatt ttaacatcaa cctaatcaac 1500 tatgcttcta ataataatgc taagaatttt ataaatactc ttttagaaaa taactttatt 1560 ccaacaataa acaaatcaac aagagttaca aaaaagtctt ctactctact tgataacatc 1620 ataactaaca gtttttataa tagtcctctt ttcacaggta taattcagac tgatttatcg 1680 gatcattttc caacattctt agcaaccaat aacattttaa ttaacaatat tgccacaaaa 1740 tccacaatat ttcggcgaca gatcaatgaa aactccttaa agcattttaa atgctgttta 1800 aaaaataatg ttgattggaa tttaattttg caatcaagtg atgctaacaa tgcttataac 1860 ttgtttcttg gtcaattttg caagcaatat gaaatagcgt ttccagaaaa agaaataatc 1920 ataaatacca aatcacttca aaatccctgg atgtcaatag gcttactaaa atcttcaaaa 1980 aagaaagaaa aattatatat aaagtatctt aaaaaaaaaa tattagaaaa ataaaaattt 2040 ttawwaatat ataaagtatc ttaaaaaaaa atcttataaa aatgaaataa cttataaaaa 2100 ttacaaaaat atattcgaaa aaataaaaag atactcaaaa aagctttatt atgctcagtt 2160 attaaaaaaa cacaacggaa acactaaaaa agtatggaat gtaattaagg atttaattgg 2220 aaaaaataat tacaaaaaaa acactctgcc aaaaaatctt ttaattaatg gaaaattagt 2280 ttatgataaa tctattatag ctgaagaact aaacaactac tttgtcaata ttggtccgaa 2340 attggctgct aaaattccat ctaactccac tccttttgat tcatacttaa aaacttatga 2400 caaacttatg gatgaaacta gtttaaaact tagcgagctg agtgctgcat ttagcagtct 2460 caaaagtaaa aaaagtgaag gttttgataa aattagtatt gatattgtca aatcagtatt 2520 tgatattatt gaaccttctt tatttcatgt ttttgatctt tcatttaaaa caggtattgt 2580 tcctgacaaa ctaaaagttg cccgtataac accaattttt aaatctggag ataagtcaaa 2640 tatcttaaac tacaggccta tatccatact accttgcttt tcaaaattgc tagagagaat 2700 aatgtataat aggttatata attatctaac tgagaacgac atgttgtatt gcaagcaatt 2760 tggttttaaa aaaaaaaatt ccacagaaca tgcaatagtt gaactttcta atcaaatatc 2820 taacgccttt aataatgact gttttacttt aggagttttc attgatctgt caaaagcttt 2880 tgataccgta aatcacaata ttctaataaa aaaacttgaa aactatggag tgaaaaataa 2940 aaacttactt tggttcaaag ggtatctaac aaaccggagg caatttttat attatgatca 3000 aaataacaca attccatatt ccataacttg cggggttcct cagggctcaa ttttaggacc 3060 actcttattt ctattgtaca taaatgactt atacctagca acaaatcttt tagacctaat 3120 tttgtttgct gatgattcca atctttttta ttctcacaga gacattaaaa cactttttaa 3180 aatagttaat gaacagttat gtaaagttaa tgaatggttc gtaagtaaca aactttcatt 3240 aaatgtggat aaaacaaaat atatactctt ccataaagta aataaatcag aaaacatacc 3300 tataaaatta ccaaatctta taattaataa tacttatatt aaaagggaaa caaacgtaaa 3360 ctttttagga gtaaaactgg atgaaaattt actgtggtat tctcatataa atagtattga 3420 aacaacactc tcaaaaaata tcgcaatgat ttataaggct aaaccttttc taaatgtagc 3480 atccctgaaa aaattgtatt ttgcctttat tcacagctat ttgtcttact gtaatattgt 3540 gtgggctagt actggctata caaaacttaa gaaaatctac aacaaacaaa aacatgcatg 3600 cagaatagtg tttggagcct gcagaaatac tccttgcgaa ccacttttac gtgcactcgg 3660 tgctttaaat gtgtataaac tcaatttaca ccaagtttta ttgtttatgt ttaaaacaaa 3720 aatgggattg tccccaataa tttttcaaac ctattttaat aaaattaatc ataaatattc 3780 aacaaagttt tctgataaca actttgttct tctcaagtat aatttaaaat taaccttcta 3840 ttcaattaaa tatcgcggac cttatctgtg gaaaaaattc ccagagattg tcaacaaaaa 3900 caagactaca ttagaacaat ttaaaaatga atcaaaacaa ttattactat tcagggatct 3960 taatttatct gacttcaaat gtttctttta aaatttaatc aaataaaaac aaatttattt 4020 atatcaatca aatttaagta atgatgttaa aaattcatct tgtgataatg ttaattatga 4080 aattttattc ttatttaaaa ctttttgttt aaatgatttt aaaacatttt agcggttgct 4140 agcatttcga caattcttag ttatttgcta ttttgaattt cttaattttc tcagttactt 4200 tctgttttaa ttcttaccta tattttatat atttatatta tatacatttg tatgttacgg 4260 ttatgtagag ttttttttat gtagagtttt ttttatgtag agtttttttt ttttatatgt 4320 ggggctcggc gataaggcaa aatgtgcctt cttcttgctc ctgccattag ttttgatata 4380 tgtatatgca atgtaaacat tctttaacgg cgaaataaat tacacttaaa aaaaaaaaaa 4440 aa 4442 // ID Mariner-18_HM repbase; DNA; INV; 2440 BP. XX AC . XX DT 14-DEC-2008 (Rel. 13.12, Created) DT 14-DEC-2008 (Rel. 13.12, Last updated, Version 2) XX DE Mariner-type family: consensus sequence. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-18_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2440 RA Jurka J.; RT "Families of Mariner elements from Hydra magnipapillata."; RL Repbase Reports 8(12), 1952-1952 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(555..1559,1563..2021) FT /product="Mariner-15_HM_1p" FT /translation="MPRKYKHVVGGGYKKHDPKTIERAISDIQNGLSLRKS FT AEKHGLHYSVLYRHLKRGPNLKKHGGQTALSLEEEQLFVDRLKICGKWGYP FT IDTTTLRLLVKDFLDRRGKEVKRFKDNLPGRDFVESFIKRHKEQLAVRMCQ FT NIKRSRAGVTPETINHYFDELTNELKDVPLSNIVNYDETNLSDDPGKRKVI FT IRRGTKYPERVINSSKSSTSVMFAAAADGAILPPYVVYKAMHLYQSWTEGG FT PKNARYNRTKSGWFDSFCFEDWVLTVALPYLKNKTGRKILIGDNLSSHLSM FT ESVKLCKDNNISFIFLPANSTHLTQPLDVAFFRPLKSTWRQILEWKKGPGR FT TEATVPKDKFPPFLKKLCVSLKEENVITGFKKCGIAPLNRNKVLSMLPKII FT DDEENQNPENIQKNTEAVDGSFQELLQSLRHDQTPKIRQKRTKVNVKPGKS FT IGVEDFVVDDEIPQTSRSTGQNPGTSGTKQVQPKKSRKRVEI*" XX SQ Sequence 2440 BP; 825 A; 421 C; 458 G; 735 T; 1 other; ccgtaaaacg gggtgaatag aaacagtggg gtgaatagaa acaaaacttc atttttgtat 60 atataatcta tttactacat aactatgagt aaataatggc aactaataat ttttttaggt 120 tgctacactt tagcctgaaa gtgaaataat ggtttgctta tttagcagca ttggttcttg 180 tttgtttttc atgttgaatt ttaccgcgca attcttgttt tttttttttc actttccagg 240 taagtcggaa tactcagtgt tcttgaaagt aaatagctgt acaaaacata gtttaattat 300 tcatataaaa gagttaggct aactttaagc tttaactgta acaatattat tttaataatt 360 attcgaattt tttaggttag gttttcagaa ttgacggggt gaatagaaac atataatttc 420 tagtgtttct ttacaccccg caataaaaaa gtaattctaa atcacgttta tactgtataa 480 aatacatact gtatgctata tatttatgtt agacacttaa ctgacacagc tgttttattt 540 cagaaccctg taaaatgcca aggaagtaca aacacgtggt tggaggtggc tacaaaaagc 600 acgatccaaa aacaatcgaa agagcaattt ctgacataca gaatggatta agccttcgaa 660 agtctgctga aaaacatggt ttgcattata gtgttcttta tcgacacttg aaaagaggtc 720 caaatttgaa gaaacatggt ggtcagacag cattatctct tgaagaagaa cagctgtttg 780 ttgataggct taagatatgt ggcaaatggg gttatcccat tgacactacg actctacgac 840 tccttgtaaa agacttttta gaccggagag ggaaagaagt taagaggttt aaagataacc 900 tccccggccg tgactttgtg gaatcattta taaaacgtca taaggaacaa ctggccgtga 960 gaatgtgtca gaatattaaa cgctcgagag ctggtgtaac accagaaaca attaaccact 1020 attttgatga actgacaaat gaattaaagg atgtgccttt atctaatatc gttaactacg 1080 atgagacgaa tctaagtgat gaccctggta agcgaaaggt cattatacgt agagggacaa 1140 aatatccaga aagagtcata aactcatcaa agtcatctac gtctgtgatg tttgcagccg 1200 ctgctgatgg cgctatttta ccaccttatg ttgtgtataa agctatgcac ttgtaccaaa 1260 gctggacaga aggtgggcca aaaaacgcca ggtacaatcg aacaaagtct ggctggtttg 1320 attccttttg ctttgaagat tgggtactga cagttgctct tccatacctg aaaaacaaaa 1380 cgggtcgaaa aattctgatt ggggacaatt tgtcttccca cttatcaatg gaatccgtga 1440 agttgtgtaa agacaataat atcagtttta tttttctacc agctaactcc acccatctca 1500 cacaacctct agacgtggca ttttttcgcc cmttgaaatc aacatggcga cagattcttt 1560 aagaatggaa aaaaggccca ggaagaacag aagcaactgt tcctaaggat aaatttccac 1620 cgtttttgaa aaagttgtgt gtttcattga aggaagaaaa tgtgatcacc ggttttaaga 1680 aatgtggtat agctcctcta aacaggaaca aagtgctgtc tatgcttcca aaaatcattg 1740 atgatgagga aaatcagaat ccagaaaaca ttcaaaagaa cactgaagct gttgacggga 1800 gtttccaaga gctccttcag agtctgcgtc acgatcagac tcctaagata aggcaaaaaa 1860 ggaccaaagt aaacgtgaaa cctggcaaaa gtattggtgt tgaggacttc gtcgtggacg 1920 atgagattcc ccaaacttca cgttctactg gccaaaaccc aggtacctca ggtactaaac 1980 aggttcagcc aaagaaaagc cgaaagcgtg tggagatata atcagaagat gaagaggact 2040 attccaattt gttttcttac cgtagggcac attatatgta acttcttttg tttatgtact 2100 caaataaatc attatgttga gaaataaatt tatttgtatt catttattga ccaaaaatcc 2160 atatttgtct gtttcaattc accctaaatt tttttcaata gaaaaacaca tgaaattgaa 2220 gtactgtttc tattcacccc catctgcggg gtgaatagaa acaccatgct ttatatattt 2280 ttctttaaaa tcaaataaaa ttaaaaaatt taaatactat aaacaaatag gcctaacaga 2340 gacctgtgtt tgggcaaaaa attgtttatc tagaacaact acaagtatgc acattttcat 2400 accaaagttg aaaattgttt ctattcaccc cgttttacgg 2440 // ID Merlin-1_Hrobusta repbase; DNA; INV; 2001 BP. XX AC . XX DT 10-MAY-2011 (Rel. 16.05, Created) DT 10-MAY-2011 (Rel. 16.05, Last updated, Version 1) XX DE DNA transposon: consensus. XX KW Merlin; DNA transposon; Transposable Element; Merlin-1_Hrobusta. XX OS Helobdella robusta OC Eukaryota; Metazoa; Annelida; Clitellata; Hirudinida; Hirudinea; OC Rhynchobdellida; Glossiphoniidae; Helobdella. XX RN [1] RP 1-2001 RA Yuan Y.W. and Wessler S.R.; RT "The catalytic domain of all eukaryotic cut-and-paste transposase RT superfamilies."; RL Proc Natl Acad Sci U S A 108(19), 7884-7889 (2011). XX DR [1] (Consensus) XX SQ Sequence 2001 BP; 651 A; 327 C; 338 G; 685 T; 0 other; ggcaaaacct taggatggca ctgggaaatt ccataaaata gtaaacaaag agttaaaaat 60 aaatccagct taatttgcgc catttagctc ctcccccttt ttaatattta tgtttattag 120 attacaagac gagaacaaga aaagatcttt tttcaatcat tttcatttat taatttttaa 180 ttaattttca ataatagata ttttaaaaat tttaatttgt ttttcattta aaaataattg 240 tcaataataa atattttaaa aacggcttgt gttaattatt ttaaaaagaa aatgctgatt 300 tattttatta ataaaataaa ttatgatcaa tcaatcaatc aaattaattt ttatagcgcc 360 ctttccgtaa tgtatgatgt ttcacttaat ttttatagcg cccttttcat gaatgtatga 420 tgtttcactt ttaaataaca acgcgtatgt ttctgccatt ttttcttctg ttttttctgt 480 cattttttcg tctggttttt ctgtcatttt ttcgacattt tgaatttatc tgtagtcatg 540 aactttccgg ccgttaatag tctgtcatca atggaactga caaacgccat aaattcgaaa 600 caaaaagcag tggactgggc gatgcaacac ggattgctgt cgtctggaat gacatgcgtt 660 tgtggaaaag ttatgcgttt gtacaattcc aaaaaagcag cagacgaaca gatatggcgt 720 tgtacaaacc atacggcctg cagcagaaca aaatccatta gaaatggtag tttttatgaa 780 aaatgtaagc taccaatgag cacatgtatc atggttatta tttttttatt cttaacttaa 840 attatatttt ataaatatta aattttagaa tgaaaaacaa taataatttt gttcattcag 900 ttgacatatt attggagcat gggggacatc agtcaggcag cgacagcttc gttgctacaa 960 atagataaaa ataacacgtt gcctgactgg ttcaacctcc atcgccaact ctgtaccgat 1020 tgggtacgag acaacccaaa aataattggt ggacctggtc gaattgtcca tattgatgag 1080 agtctggtgt ccagtaacaa aaggacaagg aatggaaggg ccaggccatt tcggcagcgt 1140 tgggtttttg gcggcataga taatgtttcc aaagaagcct tcttagaaga ggtggctcaa 1200 cgtgatgcgg ccactttgct gcccataata caacggcatg ttttgcccgg taatgatttt 1260 aacattttct attaaccttt cctatattaa cattttatct aattctcatt caggcacgac 1320 aatatggtct gacaagtggg ccgcttatgc gaacattccc agagttacgg gactggcaca 1380 cgacactgtc aatcatcgat atggctttgt tgctccaaat ggagtgcata ctaatgccat 1440 tgagaatctt tggaagtgtg ccaaagataa atttaaacgg atgcatggta cctgcgatga 1500 tcacgtctct tcgtatctcg atgagtttat gtggcagcgc ggtatcaagg accgtgatga 1560 gcggtttcaa aaagctcttg agctcatccg tacatattac cctgtttaaa catttaaatt 1620 aaatttaatt gttacttttt tttattgcga tttaaattgt aagtaatttg tattttccat 1680 ttattacgat aaaataaaat cattaacttc taaaaaaaaa gtgttttggc ttgcactcgt 1740 taatcataaa atttttaaat aacatcttaa aaaggacgaa aaaattaacc gctttaaata 1800 tcatttatta ttttacaaga caagaaaaat taaaaattgc gcatttaaac atttattaca 1860 tcaacaaata caacggttat taaaaatgaa cataaatatt aaaaaggggt ggagctaaat 1920 ggcgcaaatt aagctggatt tatttttaac tctttgttta ctattttatg gaatttccca 1980 gtgccatcct aaggttttgc c 2001 // ID Mariner-31_SM repbase; DNA; INV; 2296 BP. XX AC . XX DT 12-AUG-2009 (Rel. 14.08, Created) DT 12-AUG-2009 (Rel. 14.08, Last updated, Version 2) XX DE Mariner DNA transposon from Schmidtea mediterranea: consensus. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-31_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-2296 RA Jurka J.; RT "Mariner DNA transposons from Schmidtea mediterranea."; RL Repbase Reports 9(8), 1880-1880 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 586..2115 FT /product="Mariner-31_SM_1p" FT /translation="MNLNESTVRTIKNNEKNKRRCLCRVCRVRDILMEKME FT KSLIIWIEDCTNKRIPLDENVIKQKSLKIYQHLKETGQISSEESSGHEFCA FT SNGWFENFKNRFALHNLKIKGETASADKEAAQKFPAEFLKVIEGGYTADQV FT FNADETGLNWKKMPQRTYISKNEKCAPGFKSAKDRITLLLCSNASGDFLTK FT PMLIYRSLNPRVMKTVVKSKLPVFWRANKKAWMTGCLFNDWFYNCFVPEVE FT DYTKKKNISFKVLLVIDNAPGHPKDLNHPNVKIVFLSPNTTSLIQPLDQGI FT ISTLKAYYIRRTFQIILDKMDANPNLTVQQLWKDFSILNCVEIVFLALKEL FT KPTTLNACWKAIWPEVVFNENVVPALDVEVSRILNLAHSLDGEGFNDMVEE FT DIHELLNDGQELTEDELVQLVSESDPNAAKPNDDESSAEDSDNSVKCFTIK FT TLRTGLDLANNLKTYFENNDPSGERSGKFNRQLEICLAPYYELQKELQKNT FT KQCRITDFLNLLKK" XX SQ Sequence 2296 BP; 830 A; 340 C; 410 G; 715 T; 1 other; cacctgaccn tcgtataacg cgatttctat ataacgcgaa ttacaatttt cgcaaaacct 60 cgtatagcgc ggaactaaac cctcgtataa cacggttttt ttgtaactac aaatttgtag 120 tatccgggta tttccctgag tgtataaaga aatatgcata aagaaattct taaaccgaaa 180 cgttgtattt atgtttcttt tccacattgt ttgtgtttat gaacaagttc atacacgata 240 aataattttt gtatgcattt tcatatttaa aaaaagtcat ttttgtatat gtacaaaatt 300 aaaagtgaaa atattaaata ataattttat cgttggactt aataataata atttttaaat 360 aagttttaat ttaattaaac aggtaattaa aagtttctga tttaataata aatttaccgt 420 tttaaattta aaaaattgtt ttgttttagc tatggctagt aaagaagcga atattagaaa 480 caagaacaag ccaacgttga aaagaaaatt tatctctttg gaagaaaaaa ttaaaatatt 540 ggatcgacaa aaaatggaga taaattagga gtcgttgcac actcaatgaa tttaaatgag 600 tcaacagttc gaaccattaa aaataatgaa aaaaataaga gacgctgttt gtgccgagtt 660 tgtcgtgtcc gagatatttt aatggagaaa atggagaaat cacttatcat atggatagaa 720 gattgtacca ataaacggat accattagat gaaaatgtca ttaagcaaaa gtcgcttaaa 780 atttaccaac acttgaaaga aactggccaa atttcatcgg aagaatcgtc tggtcatgaa 840 ttttgtgcta gcaacggttg gttcgaaaat tttaaaaaca ggtttgctct ccacaatcta 900 aaaattaaag gagaaactgc atcagcagat aaagaagctg cccagaaatt ccctgcagaa 960 tttttaaaag ttatagaagg aggctataca gcggaccagg tatttaatgc cgatgagacc 1020 ggtttgaact ggaagaaaat gccacaacgg acgtatattt caaaaaatga gaaatgtgct 1080 cctggattca aatccgctaa ggatagaata acgttgctgc tatgtagtaa tgcttcgggt 1140 gatttcttga ctaaaccaat gcttatttac cgctctttaa atcctcgtgt gatgaaaact 1200 gttgttaaat ccaagctacc tgttttttgg cgggcaaata agaaagcatg gatgactggt 1260 tgcttattta atgattggtt ctataactgt tttgtaccgg aagtcgaaga ttatactaaa 1320 aagaagaata taagctttaa agtactttta gttattgata acgcaccagg acacccaaag 1380 gatttaaatc atccaaatgt aaaaatagtt tttttgtcac ctaataccac atcgcttatt 1440 cagccgttgg atcaaggcat tatatcgaca ctgaaagcat attatattcg tagaacattt 1500 caaataattt tagataaaat ggatgctaac cctaatttga ctgtccaaca attgtggaaa 1560 gatttctcaa ttttgaattg cgtggaaatt gttttcttgg cattgaagga attaaaaccc 1620 acgactttaa atgcttgctg gaaagcaata tggccagaag tagtttttaa cgaaaatgtc 1680 gtacctgcat tggacgtaga agtaagtcgt atactaaacc ttgcacattc tttagatgga 1740 gaaggattca atgatatggt tgaagaggat attcatgaat tattaaatga cggacaagaa 1800 ttaactgagg atgaactagt acagttagta tctgaatctg atccaaatgc tgctaagcct 1860 aacgacgacg aaagttcagc tgaagattca gataattcag tgaaatgttt taccataaaa 1920 actttaagga caggattaga cttagctaac aacttaaaaa cttattttga aaataatgat 1980 ccgtcaggtg aacgttctgg aaaatttaat aggcagttgg aaatttgtct agctccttat 2040 tatgagctgc aaaaggaatt gcaaaaaaac accaagcaat gtcgtataac tgactttttg 2100 aatttactga agaaataaaa taactatttg tttgtatcgt ttaaataaaa tcatataaaa 2160 aagtttattt tctttaaaaa aatcttttgg aacgtaaccc ctgaatttgt actggaactt 2220 agtctcgtat aacgcggatt tgcataacgc gcaatttctg aggaacctaa cctccgcgtt 2280 atacgagggt caggtg 2296 // ID DNA-TA-12_CQ repbase; DNA; INV; 1687 BP. XX AC . XX DT 29-DEC-2010 (Rel. 16.01, Created) DT 29-DEC-2010 (Rel. 16.01, Last updated, Version 1) XX DE A DNA transposon family from Culex quinquefasciatus - consensus. XX KW DNA transposon; Transposable Element; nonautonomous; KW DNA-TA-12_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-1687 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the southern house mosquito."; RL Repbase Reports 11(1), 62-62 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >89% CC identity. TA TSDs. The sequence 727-292 is ~87% identical to CC Zator-N1_CQ, but Zator-N1_CQ is truncated. XX SQ Sequence 1687 BP; 626 A; 196 C; 263 G; 601 T; 1 other; gggttagtga aatttcacga tttcgcggac agcgtgaaat ccgcgaaatt tggctttttc 60 cgcgaaatcc cgtgaaattt tatgtttgtg tgaaaatgta acgatttttc ataaaataac 120 ttatcaaact taaaaaggaa taaaaataat gttatgaact actgtagaat taagcgagtg 180 aataccctgg ttttattact aatatagggt taatgatttt tgacggtttc gtagacggcg 240 tgaaattcgt gaaatttatc aattttctcg aatcattgaa tgcaaatatg ggcaaaaaaa 300 aaataaaaaa tataaatatt gaaataacaa gccatagttt caacatttgc atggaaaaag 360 tgttttaaaa tgcattttac actagttcag tagttttgca atcattagtt ttcaaaaaat 420 gtaagatttg acgaaaacaa aaattttagc aaaaaaaaaa acattttgcg ataacaaaca 480 tcgaaaattt tcaaaaattc gaaagatttt taaatcaacc caaacattct taaaatgatc 540 ccaaacattc taaaaatgga atgaaaaacg caggggaatg aattttaaat ttatttcagc 600 tgatccattg aaatttttaa gtttttcgaa aaaatatttt tttgtcccct gatttttcgg 660 gccaacttcg aagggagggg gagggggggg ggagacaaaa acttttaata aaatttgtac 720 cagcctaaca acttaataaa tatttcatcc aaaaagaaaa gaaattttat tagttttcaa 780 ttaaaaagct tgattatata tgtgtgtcaa gttgatatga accatttgaa gaaatgttga 840 gatttttgag ttttttagaa actaacaaaa tttagtaata ttttcattcg ctgtattaaa 900 aattttagtg ctattttatt tgaaaggtat agttttgaaa gaaattaaaa gaatcaagct 960 ttgttttatt caaaataaaa ttaagagtat ttttttatct ttggaatcat attttaaaaa 1020 cataacaatt tttttgggcc tgttaaattt taacgatatt tgtttatttg ggtggatgat 1080 tcccktgaaa gttaaaaaat actccttttt tgttttctct tctcatttag aataaaaatt 1140 tcgattttgt taaaaattat attatttttg caaaatgttt ataaatttag tttttgaaaa 1200 atgagctaaa acccttaaaa aatatgttta atgggctaaa tgacgcagaa aattcaattt 1260 attttactca ttttgactgg acaaatctag ctttgatatg aaattcaata aaatgtaaaa 1320 tgatataaca gaagcaaatt agcgaaaaca tttgagcttt gttagtcgaa tattaaaaga 1380 gcagttcagt aaatgagaat acgcgtgccc tacattttgt ttcaaaatat atatagagcc 1440 catccaataa ccaggaggat ttttttttaa ttggaagaat ttttttgtcg cattcaaatt 1500 tagtaaattt ttttttacat aataatacat aatgaagcat aaaaagtagt ttctgtgatt 1560 taaacatagg attttcagag ttattttaac atttttcaag ttccgcgaaa tttgcgtgaa 1620 atttggtgtt ttgaaattga ggtccccgtg aaatttgcaa ttttcgagcg tgaaaaatca 1680 ctaagcc 1687 // ID BEL-44_AA-I repbase; DNA; INV; 5946 BP. XX AC AAGE02018502; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-44_AA_; KW BEL-44_AA-LTR; BEL-44_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5946 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02018502; Positions 53132 59077. XX CC Positions [4963-5535] - Integrase core CC 'AATAT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 25..5946 FT /product="BEL-44_AA-I_1p" FT /translation="MMYRSDDMVKDRLEATLHSCQSCNRPDAAESDMVECS FT MCKLWEHFGCAGVDERVKQPDQRYHCKVCRNKLGVISKQLPLSAANPPARS FT TRASSKKSHASATSSVRAAMLQAQLKIAEDELQRRQQELLEQAEIKKKELE FT ESERQLDEKKRIAEEERSLRERKLRDEKELKALQSQLRRESMERQHEIIRQ FT AALNCSRGGSVADSIQKVSNWLVEHTGAGGLGGNQLGDKDNHSIVGPHSKK FT VPTNPALNVDEVNQFDGETPLRQHQVLPPPPVHHSTPIHSVEPPLRVIQDQ FT LNRFSIDPIHQQARRMASGVKRAISEPTVLTQQHIAARQVLGKELPVFSGD FT PEDWPIFISSFQQSTIACGYTDTENLIRLQRCLRGRALETVKSRLLLPSGV FT QSVIQTLRTLYGRPELLIHSLIEKIHHVAAPKHDRLDTLIDFGISVQNLVD FT HLVAADQRDHLSNPMLLRELVEKLPGSMRLDWAAYKSRFQTANLETFGNFM FT TNLMTAASEVTYDLPGSSNSYRVDKRKQKESASFHTHVLSASNPPPPVSEV FT KPGKPCTCCARDGHRVADCPQFNLADVDERWKLVELKGLCRTCLNNHGKWP FT CRSWRGCGIEGCRQKHHTLLHTSGSSENMNIVANHASLDEFPAPLFRILPV FT VLFGNQSSQVVFAFIDEGSSYTLLEDSIADKIGVSGPTEALTLQWTGNVKR FT TEHKSRIVQAEISAKTDSTKYKLVHARTVSSLLLPSQTLRYRELARRFPHL FT RGLPVEEYELVQPKLLIGLDNLRLTVPLKLREGGQNDPIGAKCRLGWTIYG FT CVPESTVIKSVVNFHVAAASNNDNEMNEQMRDYFSLENAGVNSLNVILESE FT EEKRANRILSETTRRTGDGSRFETGLLWKMDNPNFPDSYPMAVRRLEALER FT RLQMNPVLAENVKNQIKAYERKGYAHKATPIELTSVDASRVWYLPLGVVTS FT PKKPNKVRLIWDAAAKVGETSFNSKLLKGPDLLIPLPKVLCQFRQFPIAVS FT GDLMEMFHQIKIRFPDCQSQRFVFREKPTEYPQVYVMDVATFGSTCSPATA FT QYVKNLNAQEFSAEYPRAAEAIVKRHYVDDYLDSFRTVDEAVETVNEVRMV FT HGKGGFTLRQFLSNSREFLNAIGEIAEEESKNLSLERGEKLESVLGMKWLP FT REDVFTYSLVLREDIRKLLEHDHTPTKREVVKVVMSLFDPLGLISFFLVHG FT KVLIQDIWARGTDWDDVIPEDLHDRWLQWTSLFPKLDQLRIPRCYFRSAIP FT ETPDCLQIHVFCDSSEAAYSSAAYLRLEINGVIEVALIGSKTKVAPLKTIS FT IPRLELKAAVLGSRLLETLQSYHNLQLSRRFLWSDSSTVLSWIRSDHRRYN FT KFVAFRIGEILTMTDPAEWRWVPTKSNVADQATKWNNGPQLSMENPWFRGP FT SFLCAPEVSWPVQCPVTTTKEELRSNVVLYHDKPRLSILINLSRFNSWLKL FT QRTVAFIFRYLGNCRRKMKKESLQLGVLTQEELKQAEELLWKIAQNEAFTE FT EISILVQSQGSPENRHRIVHKSSHIYKLWPFLDDRGILRMRGRIGAATYAP FT IETKFPAVLPKNHPITFLIVDWFHRRYNHANRETIVNEIRQKFEIPKLRSL FT VGKVSKHCVWCRIVKAAPRAPAMAALPAMRLTPFIRTFTFVGLDYFGPVFV FT RIGRSLAKRWVAVFTCLTIRAVHLEVVHSLNTESCIMAVRRFVARRGPPRE FT FYTDNGTCFQSASRELKEEIERRNEALASTFTSAETSWKFIPPAAPHMGGV FT WERLVRSVKVATSAVLDAARKPDDETLETVLLEGEAMINCRPLTFIPLESA FT DQEALTPNHFLLGSSSGVKILPTAPVDNKAVLRSSWKLAQSITEEFWRRWI FT KEYLPVIRRRCKWFEETKDLAVGDLVMVVSGSARNQWLRGRVEQVFPGKDG FT RVRQALVRTATGVLRRPAVKLAVLDVASCSKPSSMSSGDPGNHQGLRAGV" XX SQ Sequence 5946 BP; 1614 A; 1420 C; 1536 G; 1376 T; 0 other; attttcttta agaattttgt tggaatgatg taccggagtg acgacatggt gaaggatcga 60 ctagaagcta cgctgcatag ttgccaatca tgtaaccgtc cggatgctgc ggaatcagac 120 atggtagaat gcagtatgtg caagctatgg gagcattttg gctgcgcagg agtcgatgag 180 agggttaaac aaccggatca acgataccac tgcaaggtat gtcgcaacaa gttgggagtg 240 attagtaagc agctgccgct ctccgctgca aatccacctg ctagaagcac acgggcgagc 300 tcgaagaaga gtcatgcgag tgcgacgtca agtgtacgtg cggcaatgct gcaggctcaa 360 ttaaagatag cggaagatga actacagcgc cggcagcaag agttactaga gcaagcagag 420 ataaagaaga aggaactgga ggaatcagag cgtcagttgg acgaaaagaa aaggatcgca 480 gaagaagaac gaagtttgcg tgaacgaaaa ctgagggatg aaaaggagtt gaaggcgttg 540 cagtcgcaat tgcgaagaga atcgatggaa aggcaacacg aaatcattcg gcaggcagcg 600 ttgaattgca gtagaggtgg atcagttgcg gattccattc agaaggtatc gaattggttg 660 gtcgagcata cgggcgctgg aggactagga ggaaatcagt tgggagataa agacaaccac 720 agtatcgtcg gtccccactc taaaaaggtt cctactaacc cagcgctcaa tgtggatgaa 780 gtgaaccaat tcgatggtga aactccgctt agacaacatc aagttctacc acctccgccc 840 gtacaccatt caactccaat ccactctgtt gaaccaccat tacgtgtgat tcaggatcag 900 ctgaaccgtt tctcgatcga ccccattcat cagcaagcca gacgcatggc aagtggtgtc 960 aaacgggcaa tttcggaacc tacggtactc actcaacaac acatcgcggc acgacaagta 1020 ctgggcaaag aattaccggt gtttagcgga gatcctgaag attggcccat cttcattagt 1080 agtttccagc agtcaacgat cgcctgcggt tatacggaca cagagaacct tatccgcctt 1140 cagcggtgtc tacggggccg tgctttagag acggtaaaga gccgtttgct actaccgtcc 1200 ggggtgcagt ccgttataca gaccctgcgt acgctctatg ggcgacccga actgttaatc 1260 cactcactca tcgaaaaaat acatcacgtt gcggcaccta aacacgaccg acttgacacg 1320 ttgatcgatt ttggcatttc ggttcaaaac ctcgtggacc accttgttgc ggctgaccag 1380 agggatcacc tatcgaatcc gatgctactt cgggaattgg tagaaaaact tccgggttcg 1440 atgcgacttg actgggccgc gtacaaaagc cgttttcaga cagccaatct agagacgttt 1500 ggtaacttca tgacaaattt gatgacagca gctagcgagg ttacatacga cctaccaggt 1560 tccagcaatt cgtacagagt ggacaagcgg aagcaaaaag agtcggcatc atttcatacg 1620 cacgtcctca gtgcgtccaa tcctccgcct cccgtttcgg aagtcaaacc aggtaagcct 1680 tgtacatgtt gtgctcgaga cggtcatcgc gtagcagatt gtccacagtt caatttagcc 1740 gacgtagacg agcgatggaa gcttgttgaa ctgaaggggc tgtgcagaac gtgccttaac 1800 aaccatggca aatggccctg cagatcgtgg cgaggctgtg ggattgaagg atgtcgccaa 1860 aagcatcaca cgctccttca cacatctgga agctcagaaa acatgaacat tgtggccaac 1920 catgcatcac ttgatgagtt tcccgcgccg ctcttccgaa ttttgcctgt cgttcttttc 1980 ggaaatcaaa gttcacaggt ggtgttcgca ttcatcgatg aaggctcgtc gtataccctt 2040 ctagaggact ccattgccga caaaattggt gtgtcgggac ccacggaagc gctcaccctt 2100 cagtggacgg gtaacgtaaa gcgaaccgaa cacaagtcca gaatagtcca agctgaaatt 2160 agtgcgaaaa ccgattccac caagtacaaa ctggttcacg cccggaccgt tagtagcttg 2220 ttgcttcctt cgcaaacgtt gaggtaccga gaactcgcac gaaggttccc ccatctgcga 2280 ggcctcccag tagaagaata cgagcttgtt caaccgaagc tgctaattgg gttggataat 2340 ctcaggctaa ccgttccact caagcttcgg gaagggggtc aaaacgaccc aattggcgcc 2400 aaatgccgct tagggtggac tatctacgga tgtgttccag aatcgacagt tatcaaatca 2460 gttgtcaatt tccacgttgc ggcggcatcc aacaacgaca acgagatgaa cgagcaaatg 2520 cgcgactatt tctcgttgga gaatgctggt gttaattctc taaacgtgat tctcgagtcc 2580 gaagaggaga agcgtgcgaa tcgaattctg tcagaaacta cacggcgaac tggagacgga 2640 tcccggttcg aaactgggct gctgtggaag atggataacc cgaactttcc tgacagttac 2700 cccatggcgg tacgacgact cgaagcacta gagcggaggc tccagatgaa cccagtgttg 2760 gctgaaaatg taaaaaatca gattaaggcc tacgagcgta aaggctatgc tcacaaagcc 2820 actcctatcg agttaacttc agtggacgca agccgcgtct ggtatcttcc ccttggagtt 2880 gtaaccagcc caaagaaacc gaacaaagtc aggcttatct gggatgcggc ggccaaggtt 2940 ggtgagactt cattcaactc gaaactactg aagggaccag atcttcttat tccactgccg 3000 aaagttctct gtcagttccg gcagttcccc atcgctgtca gtggagatct gatggaaatg 3060 tttcaccaga taaaaatccg gttccctgac tgccaatccc aacggttcgt gtttcgagag 3120 aaacccactg aataccctca agtctacgtc atggacgtgg ccacgtttgg gtcgacatgt 3180 tctccagcaa cggcccaata cgtgaagaat ctaaacgccc aagaattctc agctgagtat 3240 ccccgtgctg ccgaagctat cgtcaaaagg cactacgtgg acgactactt ggacagcttc 3300 agaaccgtcg acgaggcggt ggaaaccgtc aatgaagtta gaatggtaca tggtaaaggt 3360 gggtttacgc tgcggcagtt tctgtccaac tcgcgtgagt tcctgaatgc catcggagaa 3420 atagcagagg aggaaagtaa aaacctatct cttgaaaggg gagagaagct tgaatcggtt 3480 ttggggatga aatggctgcc tagagaagac gtattcacct attccctagt gctgagagag 3540 gatattcgga aacttctgga acacgatcat acgccaacga agcgtgaggt cgtcaaggtc 3600 gttatgtcat tatttgatcc gctgggactg atttcgttct tcctggtcca tggcaaggtc 3660 ctgatccagg acatctgggc cagaggaacg gactgggatg atgttattcc tgaagatttg 3720 cacgaccggt ggttgcagtg gacgagtcta ttcccgaagt tagatcaact tcgcattcca 3780 agatgttatt tcagatctgc tattcctgaa acccccgact gccttcaaat ccatgttttt 3840 tgtgactcta gcgaagcagc atactccagt gcagcctact tgcgtctgga gataaatgga 3900 gtaatcgaag ttgcactaat tggttcaaaa acgaaggtag cgccgttgaa gacaatatcc 3960 atccctagac tcgaactgaa agccgctgtt cttgggtctc gtctgctgga aacccttcaa 4020 tcttatcata atctacagct ttcacgaaga tttctttgga gcgattcgag tactgtactc 4080 tcctggattc gctcggacca tcgccgatac aataaatttg tcgcctttcg cattggtgaa 4140 attctaacga tgactgatcc cgctgaatgg aggtgggtgc ccactaaatc aaacgtcgct 4200 gaccaagcta cgaagtggaa taatgggccg cagctctcga tggagaatcc ttggtttcgt 4260 ggaccgagtt ttctgtgcgc accagaagtt tcgtggccag tacagtgtcc agttacaacg 4320 acgaaggaag aactgcgctc aaatgtcgtc ctgtatcatg acaagccgcg gttgagtatt 4380 ttgatcaatc tttctcgatt caactcgtgg ttgaaacttc agcgaacggt ggcattcatt 4440 ttccgttatc tgggcaactg ccgccgaaaa atgaaaaagg aaagtctgca actgggagtc 4500 ctcacccaag aggaactaaa acaagcagag gaactgctgt ggaaaatcgc tcagaacgaa 4560 gcctttactg aagaaatatc aattctggta caatcacagg gatcgccgga aaaccgccac 4620 cgtatcgttc acaaatcaag tcacatctac aagttatggc cgttcttaga cgatagaggg 4680 atcttgagga tgcgtggcag aattggcgca gctacttacg caccaattga aaccaaattt 4740 cctgcagtcc tgcctaagaa tcacccaata actttcctca ttgtcgattg gtttcatcgt 4800 cgatataatc acgccaacag agagactatc gtcaatgaga ttaggcagaa gtttgaaatc 4860 ccgaagttga gatctttggt cggaaaggtc tcaaagcact gcgtctggtg tcgcatcgtt 4920 aaagcggctc cgcgagcgcc agccatggcc gcactgccag caatgcgtct aacccctttc 4980 atccgaactt tcaccttcgt tggactcgac tatttcgggc cagtttttgt aaggattggt 5040 agaagcctcg ccaaacggtg ggtcgccgtt ttcacctgcc ttacgattag ggctgtgcat 5100 ttggaagtag tgcactctct caatacggaa tcttgcatca tggctgtgcg tcgtttcgta 5160 gcccgccgtg gtcctccaag ggaattctac accgacaacg ggacgtgttt ccaaagtgcg 5220 agtcgagagt tgaaagagga aatcgagcgt agaaatgaag ccttagcatc gaccttcact 5280 agtgcggaga ctagttggaa attcattccg cctgctgccc cacacatggg cggagtatgg 5340 gagaggctcg tccggtcggt caaagtagcg actagtgcag tcctggacgc tgctcgaaaa 5400 ccagatgatg agactctgga gacggtcctt ctagaaggag aggcgatgat caactgcagg 5460 ccgctcacct ttataccact tgagtcggct gatcaagaag cactgacgcc taatcatttt 5520 ttgctgggca gttcctctgg agtcaaaatt ctacctacag ctccggtaga caacaaggca 5580 gtactaagaa gcagctggaa actggcacaa tctatcacgg aggaattctg gcggcggtgg 5640 ataaaagaat atctaccggt gatcagacgt aggtgcaaat ggttcgaaga aaccaaggac 5700 ctcgcagtag gtgaccttgt aatggtggtc agtggttcgg caagaaacca atggttacgg 5760 ggacgagtag agcaagtatt ccctggcaag gacgggagag tacgccaagc attggttcgc 5820 actgcaacgg gagttctccg aagaccagcc gttaaattag ctgttttaga cgtcgcgagc 5880 tgtagtaaac ctagttcgat gtcttccgga gatccaggaa atcaccaagg tttacgggcg 5940 ggggta 5946 // ID Gypsy-16_DWil-LTR repbase; DNA; INV; 222 BP. XX AC scaffold_180708; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-16_DWil_; KW Gypsy-16_DWil-I; Gypsy-16_DWil-LTR. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-222 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_180708; Positions 8553526 8553747. XX SQ Sequence 222 BP; 76 A; 46 C; 46 G; 54 T; 0 other; tgtagcaggc ttgctcactt aagacggtag tacctctctc tcatgtctca gagagagaac 60 atgctcgtct ctaaatgaga gcgaagtacg agagagcgtc gagtcagtct gggttgaacg 120 agcttgcaaa tcggtcgtgc taacgaggat aaaaatcata gatcacactt acatataata 180 aatacatata tataaacaat tcaaccgaaa cagcctatta ca 222 // ID BLTR_SS repbase; DNA; INV; 276 BP. XX AC X70645; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 25-MAR-1997 (Rel. 2.02, Last updated, Version 2) XX DE S.soubrense B LTR (long). XX KW LTR Retrotransposon; Transposable Element; BLTR; BLTR_SS; KW Interspersed repeat; putative transposable element; KW Repetitive sequence. XX OS Simulium leonense OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Chironomoidea; OC Simuliidae; Simulium. XX RN [1] RP 1-276 RA Flook K.P.; RT "BLTR_SS."; RL Direct Submission to Genbank (19-JAN-1993)P.K. Flook, RL Zoologisches Institut der Universitaet Basel, Rheinsprung 9, RL 4051, Basel, SWITZERLAND. XX RN [2] RP 1-276 RA Flook K.P.; RT "Direct submission."; RL Unpublished. XX DR GenBank; X70645; Positions 366 641. XX SQ Sequence 276 BP; 96 A; 42 C; 42 G; 96 T; 0 other; gagctttttg agctttcatc gtgtggtgag gctgagtcag caatttttta atgaaagtta 60 gatggcgcgg tagataaagt tttcaatagg gtaaacgtga gcaagatgaa tcatattatc 120 aaaatattta gtaaacatac aaacaaacat attcaaactc aaatacctga tgaaattatt 180 ttaaaaattt tcaagtactt tttctatcaa tgatccaatt taccccaatt ttaccataga 240 caattgcgct tttcgagctt tttcataaaa actttt 276 // ID SINE1a1_Cis repbase; DNA; INV; 303 BP. XX AC . XX DT 14-NOV-2005 (Rel. 10.11, Created) DT 09-DEC-2005 (Rel. 10.11, Last updated, Version 1) XX DE SINE Non-LTR Retrotransposon from Ciona savignyi. XX KW SINE; Non-LTR Retrotransposon; Transposable Element; L1-99ext_Cis; KW SINE1a1_Cis. XX OS Ciona savignyi OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. XX RN [1] RP 1-303 RA Smit A.F.; RT "SINE1a1_Cis - SINE Non-LTR Retrotransposon from Ciona RT savignyi."; RL Direct Submission to Repbase Update (11-NOV-2005). XX DR [1] (Consensus) XX CC Ci000003. XX SQ Sequence 303 BP; 68 A; 77 C; 77 G; 80 T; 1 other; ttgtccggcc cgntggtcta gcggttaagt gcgtcgcctt tcaatcgaaa ggttgcaggt 60 gcgaaactgg tcgctagcta gtcggttgtg tccttgggca aggcacttta cggacattgc 120 ctgaacccag cggattaatg ggttctacca aattgaagga acgtgtctat catatacaac 180 acactgcaat agctccggta acccgacgat gggcgcgagg tgatctgacg attgcccgtg 240 tgttaacccc cttggttttc ccattcacgg ggataaacat gaatatccta tcctatccta 300 tcc 303 // ID BEL-13_CQ-I repbase; DNA; INV; 6213 BP. XX AC AAWU01008684; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-13_CQ_; KW BEL-13_CQ-LTR; BEL-13_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-6213 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 179-179 (2011). XX DR GenBank; AAWU01008684; Positions 9222 3010. XX CC Positions [5263-5820] - Integrase core CC 'GTTAGC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 28..6213 FT /product="BEL-13_CQ-I_1p" FT /translation="MDCMLCNFPNTDRMVSCCVCGSWMHYECVGVNDSIAA FT HSRSFYCPPCQRPTPRPQPPVEEPGKKHDGKAKSSKAGSKAGSKAGSKASS FT VQSGVGARTRLAQLKIQKLEEQKKLALQRLELESKQRDAEMAAKKQVQEAE FT LAALKLQVETEALEETFRLLEELEIGDDDDDGRSVASEQSSRSKVTRWQQN FT QEVMSSTLMRPTKTSAGTGQRAGNPTTSGTVAGQVGQPAGGNRTVLDRALD FT GISLNDTVAQSTLGGFIGRVSETVVSKHATQKPFAPGTHNAESSPSPIKSS FT PTPANGKPQTPTSLIQSLLSRQPVLNSEVSAGSGQGNGQQMNPGGQSQTQS FT GPQQSAGGTTQTHTGTGGGQTTGRRPQSFTKLPGFQEPPRADANQVDRNED FT TPDYYGPTPQQLAARQVISKELPVFTGNPEDWPLFISAYVNTTNACGFSDA FT ENLARLQKYVRGYAYERVQGRLLHPAGVPHAMATLEMLFGRPEMLIHSLLQ FT KVHDAPAPKPDRLETYIDFGLAVQLLCDHLEAGGHEAHLNNPMLLFELVEK FT LPANAKLDWSLYKERCSQVNLTTFACYMSALVKAASDVTLSYGSKQQQQQA FT RVAKLDKSSKDKNFCGAHSSEDVSRTPVKEVTEKKAAPACFVCKDPKHRVK FT DCAEFGKKTLEERRRIVDEYRLCGNCLGVHGKKPCRSSDKCDIDGCQQQHH FT ALLHYKQKSGGTESKRAEQKREKNEPKVSGANAITNHHYAGKTALFRIIPV FT ELHGNNRTVSAFAFLDDGSSRTMVDEEIAKELGVEGEVLPLCLQWTANVKR FT TESESQRIALEISGSEDEARFALNDVRTVKKLDLPRQSLRYAELAKTFPHL FT QGLPVKDYADAAPRILIGNDNAHVTAMLKVREGQPGEPIAARSRLGWTVYG FT KQDHTAGRVHSFHVCECQDDQSLHDLVKEFFSVESLGVEATAGPESDEVQR FT ANKILQETTKRVGQRFETGLLWKYDCFEFPDSYPMAKRRLQCLERRIQKDP FT VIGESVVRQLSEYQEKGYIHKATPAELEEADPRRTWYLPLGVALNPKKPSK FT VRIFCDAAAKVDGVSLNTMLLKGPDLLKTLLHVLFGFREKRVALCADIKEM FT FHQVKIRKEDRHAQRLLWRDDSTKEPEVYLMDVATFGATCSPCSTQFVKNR FT NAEEHSHDYPEAAEAIIRKHYVDDYLDSADSVEEATKLALEVKHVHSLGGF FT DLRNWLSNSKEVLARVGENDPALEKCMQLDKNDPMERVLGMYWKPDEDVFT FT FTMTLTFDADLPTKRQALRIVMSLFDPAGLLCFFIIHGKILIQEVWRAKTE FT WDQKIPDKLKERWMRWIAMFKHLSEVRIPRCYFSKYSIADIVSLQLHVFCD FT ASEEAYSCVAYFRAVYFDGTIQVALIGGKAKVAPLKALSIPRLELNGAVIG FT VRLRKSITNGHSLKIDKTVMWTDSRTVLAWINSDHRNYRQFVACRVGEILS FT KSDAAEWRHCPGKLNPADLATKWGPRGPSFSPDYPWYGRFFILSQEIEWPM FT DVNPTEETTEEELRACLVHSEAVTKLTIDWQRFSDWTRLLRAVAYALRYKQ FT NFVENYRKQPLTKGLLTQEELLVAETTIFRTVQSEAYPDEVATLSVARGKR FT TPRRQVELERTSLIRKLSPFMDEAGVIRSDSRISEATFASHDTRFPIILPK FT GHQVTNVLLFWYHRKFLHSNGETVVNEVRQRFFVSMLRTEVRKIPKKCPLC FT KLKRAMEGKTLAIPRMAPLPAARLQAFERPFSCTGIDYFGPIAVRVNRSTP FT KRWIALFTCLTTRAVHLEVVHTLTTESCKQAIRRFIGRRGAPAEIRTDRGT FT NFVGANNELRLEMNKLDGQLAETFTNTNTRWVFNPPAAPHMGGAWERLVRS FT VKTALAAMYTSRVPNEETLATLLVEAESIVNSRPLTFIPLQSEQQEALTPN FT HFLLLSSRGVVQPPKTSIEPRLACRGDWQLCQAMLDQFWRRWIREYLPTIA FT RRTKWLEEAMPIEVGDLVIVVEEKIRNGWIRGRVVEVCPGRDGRVRDAVVQ FT TADGFLRRPVAKLARLDLNGCKTEPEVSDQLNGSGN" XX SQ Sequence 6213 BP; 1516 A; 1570 C; 1991 G; 1136 T; 0 other; aatctaaaag atcgtccgat aggaaggatg gactgcatgc tgtgcaattt tccaaacacg 60 gaccggatgg tgagctgctg tgtctgcggg tcgtggatgc actacgagtg cgtcggggtc 120 aacgatagca tcgcggcgca tagtcggtcg ttctactgcc ctccatgcca gaggcccaca 180 ccaagacccc agccaccagt ggaagaaccc ggcaagaaac acgatggcaa ggccaagagt 240 tccaaggcgg ggtcgaaagc gggatcgaag gcgggttcca aggcaagttc ggtgcagtcc 300 ggcgtcggtg cgcgaaccag attagctcag ctgaagatcc aaaagctgga ggagcagaaa 360 aagctagcgc tacaacgcct ggagctggag agcaagcagc gagacgcgga gatggctgcc 420 aagaagcagg tgcaggaagc ggagctagcg gcgttgaagc tacaggtcga aacggaggcg 480 cttgaagaga cgttccgact gctcgaggag ttggaaatcg gagatgacga cgatgatggc 540 cgaagcgtgg cttcggaaca aagttcgcgc agcaaggtga cgaggtggca gcagaatcag 600 gaggtaatga gctcgacgct gatgagacca acgaagactt cagctggaac tggtcaacga 660 gcggggaacc cgacgacatc tggaacagtc gctggtcagg tgggacaacc ggccggtgga 720 aaccgaaccg ttttagatag ggcgttagac gggatttccc tgaacgacac agtagcgcag 780 tctacgttag gaggttttat tgggagagtc tcagagactg tagtttcaaa gcatgccact 840 caaaagccat ttgcaccggg cacacataac gcggaatctt ctccaagtcc aattaagtct 900 tctccgactc ccgcgaacgg gaaaccccaa acccccacta gtctgatcca gagtcttctg 960 tcaagacaac cagtgttgaa ttcggaagtg agtgcgggta gtggtcaagg gaacggtcag 1020 cagatgaatc ctggagggca atcgcaaact caatcgggtc cgcagcagtc agcaggagga 1080 actacgcaga cgcacactgg gacgggagga gggcagacaa ctgggaggcg accgcagtcg 1140 tttacgaagc ttccgggttt tcaagaaccg ccacgggcag atgcaaacca ggtggaccgc 1200 aacgaggaca caccggacta ctacggacca actccgcaac agctcgcagc caggcaggtg 1260 atctcgaagg agctgcccgt gttcacggga aatccggagg actggccgct gttcataagc 1320 gcatatgtca acacgacgaa cgcgtgcggg ttttcggatg cagagaattt ggccaggttg 1380 cagaagtatg ttcgaggtta cgcgtacgaa agggtgcaag gacgcctgct tcacccggcg 1440 ggagtgccgc acgctatggc aacgttggag atgctttttg gaaggccgga gatgctgatt 1500 cactcgttgc tgcagaaggt gcacgatgcg ccagcgccaa agccggaccg gctcgagacc 1560 tacatcgatt tcgggttggc ggttcagctg ctctgcgatc atctggaagc tggaggacac 1620 gaggcacacc tgaacaaccc gatgctgctg ttcgagctgg tggagaagct gcctgcgaac 1680 gcgaagctgg attggtcgct gtacaaggag cgttgcagcc aggtgaacct gacgacgttt 1740 gcgtgctaca tgtcagcgtt ggtgaaggca gcctcggacg taacgcttag ctatgggtcc 1800 aagcagcaac aacagcaggc gcgggttgcg aaactggaca agagcagcaa ggacaagaac 1860 ttctgcggcg cgcactcctc cgaagacgtg tcacggacgc cagtcaagga ggttacggag 1920 aagaaagcgg cgccggcgtg cttcgtctgc aaggacccca aacaccgcgt gaaggattgc 1980 gccgagttcg ggaagaagac gctggaagaa cgtcggagga tagtcgacga atacagactg 2040 tgtggcaatt gtctaggtgt tcacgggaag aagccgtgtc gaagcagcga caaatgcgac 2100 attgacgggt gccaacagca gcatcatgct ctgctgcact ataagcagaa gtcgggaggc 2160 acggagtcga agcgcgctga gcagaagcgg gagaagaacg aaccgaaggt gtcaggagcg 2220 aacgcgatta ccaaccacca ctacgccggg aagacggcgc tgttccgaat catcccagta 2280 gagctgcacg ggaacaaccg tacggtgtca gctttcgcat tcttggatga cggatcgtcg 2340 aggacgatgg tggacgagga gatcgccaag gagctcggcg tcgaagggga ggtgcttccg 2400 ctgtgtttgc agtggacggc gaacgtgaag cgcaccgaat cggagtctca gcggatcgcg 2460 ttggagattt ccggtagcga agacgaggcc aggtttgcgc ttaacgacgt gcggacggtg 2520 aagaagctgg atttgccacg acagtcgttg cggtacgcag agctggcaaa gacgttccca 2580 catctgcaag gcctgccggt gaaggactac gccgacgctg ctccacgcat tctcattggg 2640 aatgacaacg ctcacgtcac ggcgatgctg aaggtccgag aaggacagcc tggagaaccg 2700 attgcggcgc gatcccgact cgggtggaca gtgtacggca agcaggacca tactgctgga 2760 cgcgtacaca gttttcacgt gtgcgaatgc caggacgacc aatcgctgca cgatctggtg 2820 aaggagttct tctcggttga aagtttgggt gtggaagcga cagcaggtcc ggaatctgat 2880 gaagtccaga gagcgaacaa gatcctgcaa gaaacgacga agcgggtcgg ccagcgattc 2940 gaaacggggc tgctgtggaa gtacgactgc tttgagtttc ccgacagcta tcctatggcg 3000 aagcgtcgac tgcagtgctt ggaacgacga atccagaagg atccggtgat tggcgagagt 3060 gttgtgaggc agctgtccga gtaccaggaa aaggggtaca tccacaaggc caccccggca 3120 gaactggagg aagctgaccc tcggcgcaca tggtacctac cgctaggtgt cgcgctgaac 3180 ccgaagaaac cgtcgaaggt ccgcattttt tgcgatgctg ccgccaaagt ggacggagtc 3240 tcgctgaaca cgatgctgct caaaggcccg gatcttttga agacccttct gcacgtgctg 3300 ttcggtttcc gggagaagcg agtggcactt tgcgccgata tcaaggagat gtttcaccag 3360 gtcaagattc ggaaggaaga ccgacacgct caacgtttgc tgtggcgtga cgactcaacg 3420 aaggaaccag aagtctactt gatggacgtc gcgacgttcg gggccacatg ctcgccatgc 3480 tcgacccaat tcgtcaagaa caggaacgca gaggagcact cgcacgacta ccccgaagcg 3540 gcagaagcca tcatccgaaa gcactacgtg gacgactatc tggacagtgc cgattcggtg 3600 gaagaggcga cgaagctggc gttggaagtg aagcacgtac actctctagg cggcttcgat 3660 ctacggaact ggctgtccaa ctccaaggag gttctcgcac gggtcgggga gaacgatcct 3720 gctctcgaga agtgcatgca gttggacaag aacgacccga tggagcgagt gctgggaatg 3780 tactggaagc cggacgaaga cgtcttcacc ttcacgatga cgctaacgtt cgatgccgat 3840 cttcctacga aacggcaagc gctgcggatc gtaatgtcgc tctttgaccc ggctggcctg 3900 ctttgcttct tcatcatcca cggcaaaatc ctgatccaag aggtgtggcg agcgaagacc 3960 gagtgggacc agaagattcc ggacaagctg aaggagcgct ggatgcggtg gattgctatg 4020 ttcaagcatc tgagcgaggt gcgcattccc cgctgctact tctcaaagta ctcgatcgct 4080 gacattgtgt cgctccagct gcacgttttc tgtgacgcca gcgaggaagc ctactcgtgc 4140 gtagcgtact ttcgcgcggt gtactttgac gggacgattc aagtagcgct gatcggggga 4200 aaagctaaag tggctccact caaggcattg tctatacccc gattggaact gaacggtgcg 4260 gttatcggag tacggttgcg gaagtcgatc acgaatggcc acagcctgaa gattgacaag 4320 accgtgatgt ggaccgactc gagaacggtg ctggcctgga tcaactcgga ccatcggaac 4380 taccggcagt tcgtggcgtg tcgggtaggt gagatcctgt cgaagtcgga cgcagcggag 4440 tggcgccact gtcctggaaa actgaatcca gcggacctgg ccacgaagtg gggcccgagg 4500 ggtccaagtt tctcgccgga ttacccgtgg tacggacgtt tcttcatcct gtcgcaggaa 4560 atcgagtggc ccatggacgt gaatcccaca gaggagacga cggaggaaga gttgcgtgcc 4620 tgtctggtcc acagcgaagc ggtgactaag ctgacaatcg actggcaacg cttctctgac 4680 tggacgcggc tgctgcgggc ggtggcgtac gctctccgtt acaagcagaa cttcgtagag 4740 aactaccgga agcaaccgct aacgaaggga ctgctaacgc aggaggaact gttggtggcg 4800 gagacgacga tcttccggac agtgcagagt gaagcgtatc cagacgaggt ggccaccttg 4860 tccgtagcgc gcggaaagcg gacgccgaga cgtcaggtgg agctggagcg aaccagtctg 4920 atccggaagc tgtcgccgtt catggacgaa gcgggagtga ttcggtcgga ttctcggatt 4980 tccgaagcta cgtttgcgtc gcacgacact cggtttccga ttatcctacc gaaaggccat 5040 caggtgacga acgtgctgct gttttggtac catcggaagt ttcttcactc caacggcgaa 5100 acggtcgtga acgaagtgag acagcgattt ttcgtatcca tgctgcgtac ggaggtgcga 5160 aaaattccaa agaagtgtcc actgtgcaag ctgaagcgag cgatggaagg gaaaactttg 5220 gctatcccac gcatggcccc actccctgca gcgaggctgc aggccttcga gcggccgttc 5280 tcctgcactg gtatagatta cttcggaccg atcgccgtac gagtaaatcg cagcacgcca 5340 aagaggtgga tcgcattgtt cacgtgcctc actacgcgtg ccgtgcatct cgaggtggtg 5400 cacacgctga caaccgagtc ctgcaagcaa gcaattcggc ggttcattgg acgcaggggc 5460 gcgccggcgg agatacggac cgaccgagga acgaacttcg taggagcgaa caacgagctg 5520 cgactggaga tgaacaagct ggacggtcaa ctggcggaaa cgttcaccaa caccaacacg 5580 cgatgggttt tcaacccgcc agctgcgccc cacatgggag gcgcttggga gcggctagtg 5640 aggtcagtga agacggcgtt ggcagcgatg tacacctcca gggtaccaaa cgaggagact 5700 ttggcgactc tgctggtgga agcagagagc atcgtcaact caaggccgct tacgttcatc 5760 ccattgcagt cggaacagca agaggctttg acgccaaacc actttctact actgagctca 5820 cgcggagtgg ttcagccccc gaagacgtcg atagaaccaa ggctggcatg cagaggagat 5880 tggcagcttt gccaagcgat gctggatcag ttctggcggc gttggatccg cgagtatctc 5940 ccgacgattg ctcgcagaac gaagtggctg gaggaggcaa tgccgatcga ggtaggcgat 6000 ttggtaatag tggtggagga gaagatacgg aatggctgga taagaggacg ggtcgtggag 6060 gtctgcccag gacgagatgg gagagtacga gatgcggtgg tgcagacagc ggacggtttc 6120 ctgcgacgac cagtggcaaa gttggcacgc ttggacttga atgggtgtaa gactgaaccg 6180 gaagtttcgg accagcttaa cgggtcgggg aac 6213 // ID Gypsy-81_CQ-I repbase; DNA; INV; 7372 BP. XX AC AAWU01003442; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-81_CQ_; KW Gypsy-81_CQ-LTR; Gypsy-81_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-7372 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 541-541 (2011). XX DR Genome; AAWU01003442; Positions 4489 11860. XX CC Positions [3366-3887] - Reverse transcriptase CC Positions [4851-5327] - Integrase core CC 'CCAG' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 571..2559 FT /product="Gypsy-81_CQ-I_1p" FT /translation="MEVKHMLSQYYGMNASHLTIDEFSHELHIRKLPLDGA FT RSQLEKVLYNHLKQERKQLNVSYEFVSDSVDEELGVCEDKLDFIKNHLETS FT RAAKAPGQVYKTRLIHMLFRMERLKKHITENADLDRLAVVAVEIVRLLSVY FT YSIVSPIPEVRAAEWELINQSITVAKKQLEENTPGDENGSGGDGTEKSDQT FT GALVERRESDKRTAGDDGEEKQKLVDQNQELRLAVSALLQRIETLELKQPE FT KDPATAANSTQIASNENGKQSGETGKAEPDFFIWLKEKVGSLDNSRDSEQN FT NKEKPKSKPEQSSGNEKSKDNRNRLPVHKWTVRYDGMDNGRRLNEFLKEVE FT FNARSEGFSEAELFQCAHHLFTHKARSWFMEVNGNNELGTWRRMVKELKNE FT FLPIDIDYVYERQVYNRKQGAREKFHDFYLDMARIFRNMSQQWDEKRKFDS FT LFRNTREDCQIAMLAANIQDIAAMKEFGKKFDSINWQLYKRKESRFTPRTA FT HVEEVGELRRPYGGNGNGNRFQNNNQDGRGYYQQQKPQVKGFQKGNWNQRP FT PKQEQQQYQQRNQQYHQQPKNREQSQREEKAKPKQDNQQRSKSPVQGPSGT FT DPLQRIVRAYIPIKRGVCYNCHEEGHGFSECPKNRNVFCERCGFPGFDTRT FT CPFCESKNLQRTAQ" FT CDS 2874..5699 FT /product="Gypsy-81_CQ-I_2p" FT /translation="MNVVGCADVPITFNEETKILPVLIAPDLKRRCILGYD FT FWQKFGIRPSVHQSIETLDEGDDQILEEERLDEHQIQQLEEVKKLFLTASP FT GNLGKTGLIEHKIELKEEFQGADPVRKNPYPWSPEIQRRIHRAVDLLLRDD FT IIEPSSSEWALPVVPVKKRDSDEVRLCLDARKLNERTKRDAYPLPHQNRIL FT SHLGPFKYLSTIDLTQAFLQVPLDQESRKYTAFSVPGMGLFQFKRLPFGLI FT NSPATLSKLMDKVLGQGALEPSIFVYLDDIVVVSQDYDDHIAKLRDLARRL FT KDANLSINVEKSSFCCHELPYLGYILSRNGLRPNPDRVRAILGYEVPNSVR FT SLRRFLGMINYYRRFIENFSELTAPLTDLLKNKPKRVQWNTAANEAFQAIK FT ERLISAPVMANPDFNLPFTVQTDASDTALAGVLTQVQGGEERVIAYHSEKL FT KGAEQNYHAAEKEGLAALRCIEKFRCYIEGTHFKLVTDSSALTFIMRAKWK FT SSSRLSRWSIILQQYDMEVIHRKGKDNVVPDALSRSIEVLDTEVEDSWYKR FT LYASVERDPEKYMDFKIENKKLYKLVSARSDVLDYRFEWKECVPESLRPQV FT MRQEHDGKMHIGYEKCVDTLKRRYYWPRMSADLKSYIAKCEACKMNKHSTT FT ASVPEMGKQRVATRPFQIVCMDYIQSLPRSKHGNAHLLVIMDIFSKYCLLI FT PVKKIAAGNLCTNLEKEWFRRFAVPEYVITDNATTFMSKEFKELLDKYGVQ FT HWANARHHSQANPTERLNRTINSMIRTYVRQDQKLWDTRISEIEYVLNNTV FT HSSTKFTPHRIIFGHEIVSKGDEHRLEQDEEFSEEERMNKLRGVTKKVWET FT VRENLKKAHESTKKQYDLRHKRYSPTFDIGQRVFKRSFKQSSAGDNFNAKL FT GPPYTPCIITAKKGTSSYAVSDMNGKPLGVFSAADLKA" XX SQ Sequence 7372 BP; 2226 A; 1460 C; 1836 G; 1850 T; 0 other; ccagttgtta tacttggcgc ccaacaagaa tagcttagct tattcaggct cgggaaaacg 60 atgcattaat gagaggtcga cagattcttt ggtaaaactt ccgcaactat actagctcaa 120 cttgcaaaaa actttacgct actttgagtg gctaaattta actaagatca tttagtattt 180 gaggggtctt gaagatgttt taagggagga agactaccaa cactggaaag acacaaattg 240 aacagaaaaa gtcggagaat tgttcatact tactcaactt gttacttttg gaaaaatccg 300 gagttcttta ctcatgaagt gcagtccttg gcaggacaaa taaaacacca gacggaaaca 360 gtcgtttatt ttcgttaaaa ttttgtaccc tgtacaattt gcttgatttt cgaggatttt 420 ctaattgatt acttctgtga tacttaggca ctatttttca attttcactg ttgcttgatt 480 gcttctagga agcgtgaata aattttaggt tttccaattt ttagtatttc tttttaaatt 540 ttaattgttg tgtttctaaa cgataccata atggaggtga aacatatgct gtcgcagtac 600 tatgggatga atgcatcgca cctgacgatt gatgagttca gccatgagtt gcacatccgc 660 aagttgcctc tggatggagc acgcagtcag ctggagaagg tgttgtacaa tcatctgaaa 720 caggaaagaa agcagcttaa tgtgtcgtat gagtttgtga gtgattcggt agacgaagaa 780 ctaggtgtgt gtgaggacaa actagatttt atcaaaaacc acttggaaac cagccgtgcg 840 gcgaaagcac ctggtcaagt gtacaaaacg agattgattc acatgttatt taggatggaa 900 cgattgaaga aacatatcac cgaaaacgca gacttggaca ggttggctgt ggtggctgtg 960 gagatagtta gactgttgag cgtgtactac tccattgttt ctccaatacc ggaagtaaga 1020 gcagctgagt gggagttgat taatcagagc attacggtgg cgaagaaaca attggaggaa 1080 aatacaccag gagacgagaa tgggtcagga ggcgatggta ctgagaagag tgaccaaaca 1140 ggagcacttg ttgagaggcg tgaaagcgat aagagaactg ctggcgacga tggtgaggaa 1200 aagcaaaaat tggtcgatca gaaccaggag ttacgactag ctgtgagtgc tttactccag 1260 cgaatcgaaa ctcttgagtt gaaacaaccc gaaaaagatc ccgccacggc cgccaacagc 1320 actcaaatcg cgagtaacga aaacggcaag caatcaggag aaactggaaa agcagaaccc 1380 gatttcttca tatggttgaa ggaaaaggtt ggctcgttgg ataactctcg agactcggaa 1440 caaaataaca aggaaaaacc gaagtccaaa ccagaacagt cgtctgggaa tgaaaagtcg 1500 aaagataacc gaaaccgttt accggttcat aaatggacag tacgatacga cggaatggac 1560 aacggtcgca gactgaatga gttcctgaaa gaggttgaat tcaacgctcg ttcagaagga 1620 ttttccgagg cggagctttt ccaatgtgcg caccacctgt tcactcacaa agccagatcg 1680 tggttcatgg aggtgaacgg gaacaatgaa cttggaacgt ggagacgcat ggttaaggag 1740 ttaaagaatg agttcttacc cattgatatc gactacgtgt acgagcggca ggtgtacaat 1800 cggaagcagg gtgctcggga gaaatttcac gatttctacc tggacatggc cagaattttc 1860 cgcaacatgt ctcagcagtg ggacgaaaaa cggaaattcg actcgctttt ccgcaacacg 1920 cgcgaagatt gccagatcgc tatgctcgcc gcaaatatcc aggacatagc tgcaatgaaa 1980 gagtttggaa agaagttcga ctcgattaat tggcagttgt acaaacggaa ggagagtagg 2040 ttcacacctc gtaccgcaca cgttgaagaa gtaggagaac tacgtagacc gtacggagga 2100 aatggaaacg gaaataggtt ccagaacaac aaccaagatg gtagaggtta ctatcagcag 2160 cagaaaccgc aggtgaaagg gtttcagaag gggaattgga atcaacgtcc acccaagcag 2220 gaacaacaac agtatcagca gcgtaaccaa cagtatcatc agcagccgaa gaatcgtgag 2280 cagagtcaaa gagaggagaa agctaaaccg aaacaggaca accagcaacg gtcgaagagt 2340 ccagttcaag gtccaagcgg aactgaccca cttcaacgga tcgtgagagc gtacattccg 2400 atcaagagag gagtgtgcta caattgccat gaagaaggtc acgggttcag cgaatgtccg 2460 aagaatcgga acgtcttctg tgaacgttgt gggttccctg ggtttgatac gcgaacgtgc 2520 ccgttctgcg agtcaaaaaa cttgcagcgg actgctcagt gaggcgagac agtcgcggtg 2580 ctaactacac aagacctcac gatggatgcg aagcggtact cagtgaactt ggctattcga 2640 gagtgagtgg agaagttacg aaggatgatg agattgcttc tttactcgtc acccttagta 2700 acgaccccag gccattcgcg agagttgagt tcctgggtat taccgtcgta ggattgttag 2760 atagcggagc agcaagaact gttcttggca aaggaggaga aaaattgatc cattcattag 2820 gactgaaggt gagagagtca gcagtgtcat tgaagacagc tgcgggacag cagatgaacg 2880 tcgtaggttg tgcagatgta ccaattacat tcaatgaaga gacgaagatc ttacctgttt 2940 tgattgctcc agatctaaaa cgtcgttgta tcctaggata cgatttttgg cagaagtttg 3000 gaatccgtcc gtcggttcac cagagtatcg agacgctcga cgagggtgac gaccagattc 3060 tagaagagga gagattagat gagcatcaaa tacagcagtt ggaggaggtc aaaaagttgt 3120 ttcttacagc gagtcctgga aatctcggca aaacggggct cattgagcac aaaatcgagc 3180 tgaaggagga gtttcaagga gcggatccag tcaggaaaaa tccgtacccc tggagtccgg 3240 agatccagcg tagaatacac cgtgccgtgg acttgttgct gagagatgac atcattgaac 3300 cttcttcttc tgaatgggct ttacctgttg tacctgtgaa gaagcgagac agtgatgaag 3360 tgcgactttg cctcgacgcg aggaagttga acgaacgtac gaagagagac gcatacccac 3420 tcccacacca aaaccgaatt ttaagtcatt taggaccttt taaatatctg tccacaatag 3480 atttgaccca agcttttcta caagtgccct tggaccagga gtcacggaaa tacaccgcgt 3540 tttctgtgcc aggtatgggt ttatttcagt ttaaacgttt accgtttggc ctgattaaca 3600 gcccggcgac tttgagtaag cttatggaca aagtattggg acaaggtgca ctagaaccgt 3660 caatctttgt gtacctggac gatattgtgg tcgttagtca ggactatgac gatcatatag 3720 caaaactacg agaccttgcg agacgcttaa aagatgcaaa tttatcaatc aacgttgaga 3780 agtctagttt ttgctgccat gaattacctt acttgggtta tattctgtcc cgtaacggac 3840 tgagacctaa tccggatcga gttagagcga ttcttggata tgaggtcccg aactcagtga 3900 gatctctgag acgattcctc ggtatgatta attactaccg caggttcatc gagaacttca 3960 gtgaactcac tgcaccactc acggatttac ttaaaaataa accgaagaga gtgcagtgga 4020 atacggcagc gaacgaagct tttcaggcca tcaaggagag gcttatatct gcgcctgtga 4080 tggccaaccc cgattttaat cttccattta ccgtccaaac ggacgcaagc gatacagctc 4140 tcgcaggggt actcacgcaa gtgcaaggtg gagaggaacg ggtaattgcg taccattcgg 4200 agaagttgaa gggggcagag cagaactatc acgcagctga gaaggaagga cttgccgctc 4260 tgcggtgtat cgagaagttt cggtgctaca tcgaaggcac tcacttcaag ctcgtgacag 4320 attcttcggc tctcaccttc atcatgagag cgaaatggaa gtcttcatcg cgactcagca 4380 ggtggagcat catcctacaa cagtatgaca tggaggtcat tcataggaaa ggaaaggaca 4440 acgtggtacc tgacgctctg tctcggtcca ttgaagtgtt ggacacggag gtcgaggaca 4500 gttggtacaa acggttgtac gcatctgtgg aaagagaccc ggaaaagtac atggatttta 4560 agatagaaaa caagaagctg tacaagctgg tgtcagctcg ctccgacgtc ctcgactaca 4620 ggttcgagtg gaaggagtgc gttccggagt cgttacgtcc acaggtcatg agacaagagc 4680 acgatggcaa gatgcacatc ggctatgaga agtgtgtcga caccctgaaa cgtcgatact 4740 actggccgcg aatgagtgcc gatctcaaat cttacattgc taagtgtgag gcgtgtaaga 4800 tgaacaagca ctcgacgacc gcttccgtac cagaaatggg aaagcaacgc gtagcgacca 4860 gacctttcca gattgtgtgt atggattaca ttcagtcact tccccggagt aagcatggca 4920 atgcacactt gttagtcata atggacatat tttctaagta ctgcctgctc attccggtta 4980 agaagattgc tgctggtaat ttgtgcacca atctggaaaa ggaatggttc cgtcggtttg 5040 cggttcctga gtacgtcatc acggacaacg cgacaacgtt tatgtccaag gagtttaagg 5100 agttgttgga caagtatgga gttcagcact gggcgaacgc tcgacaccac agccaggcta 5160 acccgacgga gaggttgaat cgtacgatca attcaatgat tcgtacctac gtccgtcaag 5220 accagaagct gtgggatacc aggatatcag aaatcgagta cgtcttgaac aacaccgtcc 5280 actcatcgac aaagttcacg cctcatcgga tcatattcgg tcacgagatc gtgtccaagg 5340 gcgatgaaca tcgcttagag caggacgagg agttcagtga ggaggaacgg atgaacaaac 5400 tccgcggagt gaccaagaag gtgtgggaaa cagtgcgtga gaacctcaaa aaggctcacg 5460 agagtaccaa gaagcagtac gatttacgac acaaacggta ctcacctaca ttcgacatcg 5520 gtcagcgggt cttcaaacgt tcattcaagc agtcatcggc gggggacaac ttcaacgcga 5580 agttgggtcc accgtacacg ccgtgcatca ttacagcgaa gaaagggacc agctcgtatg 5640 cagtctctga tatgaatggg aagccgctag gtgtcttctc agctgccgat ttgaaggcat 5700 gaatggaagt agtcagatga gcagccagca tgacgattct gcgatgaacg aggagctttt 5760 ccggcaacag catttggaaa tccgccgttc ggcgattaca gcgtcacagg gcagcataat 5820 ttggtgcttc attcatggaa ttgcgcagca tagtttgatt cattcatggt aaacctaatt 5880 ggttgtttta gaaaatttta atttgtttgt attgttgatt ttgggaaagc cccaaccaca 5940 tgtggaaaat ttgttttgga aaataggagt tggagttaat tgtgtaggag tatcttcgat 6000 acacatgcaa cagacgcagt ccttaacctt gaatgtgttg ttattacttc attgaacgat 6060 acacaatggg aggggatgtt ggcttatgca tggtgtgtgt gcaaggtagt tgtcgatcga 6120 tcttgatcaa cccatagtgc cataatttaa actttatttt catagaacgt tttcacatca 6180 cagctgcaat agtatttcaa tttacttacc tttattttcg catttttcct cataaaattt 6240 cagcatcaac aaaattttga cccaagataa ctttggaatt agtgatctac cgaggtaaat 6300 cttattactg aagtgatcaa agggccctta atcatcgatc gattgatttt aaaacgataa 6360 ttttacgccc ggtaagatca aaattcactt aattttctgg aatcacaaat tttgttgctg 6420 cgaaccacgt ttgaccactg cacatttgat tagatcttga cagatcgatc ttgacagcac 6480 gcacatgagt aaagaacgat aacgttatca gtaaagagaa gataaaaaga gaagatgaac 6540 tccgacacgc gaaacgtttc aactgatcgt tacgggtata ttgattaccg taactgcgcg 6600 cgcgtttgaa aatatttttt tatccggatt tttaatttat tttattcacg tattgtgatg 6660 tatttttgaa tttgttttga attaattgca tgcctaaacg aaaaattata aaatcaactt 6720 tgctgtacga atgggatcgc tggagaccgg agttgaatta aacgttgatg gtcagtagaa 6780 ggttccgacc ttgaaacaga gaggagttga cttcggtcga cgatgaaaac ttatgacccc 6840 gacccattca agcttagtta ccaaaaacaa tttagtagat tttgcatgcg agtagaaaga 6900 aaatttcccg ggaaacaaaa aataaataaa tgtaatgtaa atattgtaaa taattttaac 6960 ttgtaaatat tttgatttgt gtctcaccag cattggaagt agtacagtgt gattgtatcg 7020 ctggaggcta gagttgttaa aacgttgaag gccagtacaa gcaattgctt cgaaacagca 7080 cggagttggt ttagtccgac gatgaagtct tatgaccccg atcaagcccg cactgtatga 7140 aatgtttcaa acccttggct catcaacatg acttagtaga atccaaaatg atttttataa 7200 ataaagcaac ctaaagcaag taaaaaatga aaaggtgttt gctatttggt ctagttccag 7260 ttttgggttc agagagacgt ggcagtttat tatatgcagg gtgtaaaaaa aacttgtaaa 7320 ctcttcggag tttacaattt ttttaacccg agccggggga gagtgtaacc ag 7372 // ID Crypton-1_TC repbase; DNA; INV; 1729 BP. XX AC . XX DT 17-FEB-2009 (Rel. 14.02, Created) DT 17-FEB-2009 (Rel. 14.02, Last updated, Version 2) XX DE Putative Crypton-type transposon. XX KW Crypton; DNA transposon; Transposable Element; Nonautonomous; KW Crypton-1_TC. XX OS Tribolium castaneum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Coleoptera; Polyphaga; Cucujiformia; OC Tenebrionidae; Tribolium. XX RN [1] RP 1-1729 RA Jurka J.; RT "First Cryptons from insects."; RL Repbase Reports 9(2), 468-468 (2009). XX DR [1] (Consensus) XX CC This element may represent a Crypton-like DNA transposon, first CC to date identified in insects. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Tribolium CC castaneum Genome Project. XX FH Key Location/Qualifiers FT CDS join(325..525,626..1450) FT /product="Crypton-1_TC_1p" FT /translation="MSHVPEAIRNEARSVVSGLLPKKSSARYEHQYLDFKN FT WQKKQNRRECKQDEECVIFIGILKSTVDEILKEITVVLFRFYKVIAFLKRK FT NERYTPKKAKILTKEQVETFLLEAPDDQWLLFKVVTIFGIFGACRCDELLS FT LTPNDVEDTGKYIIVTLRNTKNFTTRRFTITDEECRFQPCVLYRKYASLRP FT TQAESLRLFLTYRGGKCISLNAGQHTIGGIPKKIASYLKLSEPELYTGHSF FT RRSAATMVVDSGGDILALKRAGGWKSSAVAEGYVEDSIKQKLERAKKLFLV FT PEPTSASSSSKHASTVSEETSEKFNVSGNQNCNITINVYSREINPVNNE*" XX SQ Sequence 1729 BP; 589 A; 272 C; 347 G; 520 T; 1 other; gtaatagtat tttacagtat tagcaaggta aaacggtttt tactggcgaa aggagggaat 60 gactggcgag ccgaaggcga gcctgtcacc tcctggagcc agtaaaaacg gttaccttgc 120 gtatactgta caatacttta tttgtttctc acttgtaaac aaaacgaaag ttattaataa 180 aatttgaata aaaaatgtgg ctattttcga aaatttatca aaatgctgta gaaaattact 240 gtggaaacga catgacaaca acgttattta ttaaacaatt tctttgcaat tttgcaatta 300 ttacaccttt aacagattaa taaaatgtct cacgtaccag aagcaattag aaacgaagcg 360 agaagtgtag tttcgggttt attacccaaa aaatcttcag caaggtacga gcatcagtat 420 ttagatttta aaaattggca raagaagcag aatcgtaggg agtgtaaaca agacgaggaa 480 tgtgtgattt ttattggcat acttaaatca actgtcgatg aaatatagcc ccaattcttt 540 atggtccaag tggtcaatgt taaaaagctg cttagaaatt aaagaaaata ttggaactga 600 caggtttgct gtttactgta aataactgaa agaaataact gttgttttat ttagatttta 660 taaagtaatt gcttttttaa aaagaaaaaa tgagcgttac acgccaaaaa aagccaagat 720 cctaactaaa gagcaagtcg agacttttct gctggaagct cctgacgacc agtggttact 780 ctttaaagtt gtgacaattt ttggaatttt cggagcttgt cggtgtgatg agcttttgtc 840 tctcacacca aacgatgtag aagacacagg gaaatacata attgtcactt tgcgaaatac 900 aaaaaatttt acaacaagga ggtttacaat aacggatgag gagtgcagat ttcaaccttg 960 cgttttgtac agaaaatatg cgtctcttcg accaactcag gccgaaagtc tacgcctttt 1020 cttaacttat cgtggcggta aatgcatctc cctgaatgct ggtcaacaca ctattggtgg 1080 cataccgaag aaaattgcgt cgtacttgaa attaagtgaa ccggaattgt ataccggtca 1140 ttcttttcgc cgctcagcgg caactatggt tgtggattct gggggagaca ttttggcact 1200 gaaacgtgca ggaggctgga aatctagtgc tgttgcagaa ggatatgtgg aagattcaat 1260 taagcagaag ctagaacgag caaaaaaatt gtttctagtt ccagagccga caagtgcttc 1320 ctccagttcc aaacacgcct ccacagtttc agaagaaaca agtgaaaaat ttaatgttag 1380 tgggaatcaa aattgcaata tcactattaa tgtgtatagc cgagaaatta atcctgtcaa 1440 taatgaataa gttgttcttg tattttaata atttgtagtt taattcagta attcagtttc 1500 tgtttgcaat aaaagcttga gaattaagag taattttgcc tcaaattttg ctaaatattg 1560 atggttgtaa agaaaacaat ttctaatcaa tataaatcaa cgttgcgaac gtgttaattt 1620 tataactggc gctcctgtca gttttactag tagctaactt gtaaaagtga agtttcgaag 1680 tgattagaca gatcggataa ttgacgttta caagtgagga acaaataaa 1729 // ID BEL-89_CQ-LTR repbase; DNA; INV; 459 BP. XX AC AAWU01006427; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-89_CQ_; KW BEL-89_CQ-I; BEL-89_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-459 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 312-312 (2011). XX DR GenBank; AAWU01006427; Positions 89494 89952. XX SQ Sequence 459 BP; 120 A; 105 C; 124 G; 110 T; 0 other; tgttggagta cgagggtccg gactatcgga gggatcgctg tcgatgggtt ccagacctgt 60 cgacggagaa catcgcagcg atcccaccgt actactacga cctcatcatg cgcgcggaat 120 cgtcgggagc agcgaagagc gatcggcaag actcgctcta ttgcccactt gctgcggtgg 180 tgaacgggtc acggcaaacc cctactcaat tttattttac atttttttga tctttgttag 240 ttttaagaat aaatttgtat taaagaataa atcgcgtttg taaggtaata aatgttgtat 300 attgtgaagt atgtaaagtg cgtgttttgt gtttggccaa cttgggacaa gaagccgccc 360 agcggacacc acaacttcac caggtgagaa ggaccagcac gggggaacag ctcgacggac 420 cagagagcta tcgaaggcgg aacggctcac atccccaca 459 // ID BEL-199_AA-I repbase; DNA; INV; 7651 BP. XX AC AAGE02030545; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-199_AA_; KW BEL-199_AA-LTR; BEL-199_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-7651 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02030545; Positions 17384 9734. XX CC Positions [4453-5034] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 361..5157 FT /product="BEL-199_AA-I_1p" FT /translation="MTEAKVFRFVDNKKGNCRLCEETDKKDNMVSCDDCDR FT WFHLSCVNLLRRPTEEEKFLCRKCEYLENERVGLLNFKKKFDIANQLNESL FT KLQQDVANRREGEINLSARRQALIKLPKFNGAARDWPKFISIFNDTTAEGQ FT FTNLENLSRLEEALYGDAAKSVSNLMIGAANVPKILKRLEDVFGRPETIYQ FT QLLHELIKTLNYAKVHIVDIAKALDDLVTTMETLNKPEFLRDYRLITDIVT FT KLPFNLQITWTEALQRNTSQPTLQDLNKWLQAHAKTMRLLAAPTRKEHKTR FT INVHNNSPKIQCNICKNDHQTYKCNVLKNLNVEQRQQRVQQLGLCYSCLNK FT GHGSKFCRNKRECKINGCRGTHNRLLHKGEKPKENQDAAAEKSNHHSSKKS FT TVLYQILPVTLQNGKQVLETFAFLDPGSSLTLLDKSAADKLGLQGRSEPLE FT MTWTQNVSKETPSQRVQLWISGHKGKPYSMKDVRTIEDIELPKQTLDVQRM FT KREFQYLADVPVQSYVNAKPLILIGMEHAHLLLTTERRIGDEDSPMAAKTK FT LGWLIFGKVYRGEPSYSFCIRERESSDLKSFFEKFSSTEDFGVRAVEKLPK FT SLADERADAIIKETINYENGHYSIGLLWKTDTVNFPPSYNNALKRLRMLEK FT KLSKNADVKMWALNSFDGFIKKGYARKLSLEEVLENHPNTYYLPHFIVTNK FT NKIPPKSRIVFDAAAAIQGVSFNSALLAGSDTVKPLFGVLLRFRENTIAVT FT GDIQEMFQQVKIIKKDQHTQRYLWRQCEIDRSPDVYVMESMIFGSTCSPAC FT AQAVKNHNARLFKKDFPKASEASEDNFYVDDYLDSFNEVEEAAEVVQGVIN FT IQQHAGFHLRNFITNSHELAKTIPSERLQDSEIKLLENKEIPTDKVLGIYW FT NTKMDLIVFQLKLDVLEKTMTKREVLSFVMSMYDPLGLISNFTVHGRIILQ FT RWHKQSSDWDSHLPEELSALWDSWIKALNIASNTQVPRCMSLKNALIYELH FT TFVDASEDAFAAVIYLRSVSEDKVNVDLVSAKARVAPIKMLTIPRLELQAA FT VLGTRLLETVKKELRIKITKCFMWTDSEIVLAWINSQHRRYKIFVAHRVAE FT ILETTTADQWKYVKSAQNPADKGTKIIDRTSIWQHGPSFLLDPEEFWPETQ FT IEMKTDEELRNYVNIHQDIHPYSFIKDDYFSDWWRLVKQTNIIKKFTDWIR FT NKSDFNRSIDYEDLKLVEDATFRKAQWDCYAEEVLSLRTGQAINQSSNLRQ FT LNPALDEYGVLRARGRLESAMSLPKSARTPIILPYEHHISFLIVKSYHERY FT LHQNDNVVIAAIHQKFWIVKLRALLKKVKKYCQECVIAKSKPMPPMMAPLP FT DFRTEPFLYPFTNTGVDYFGPFEVAVNRSREKRWGVIFTCMSSRAVHIEMA FT EKLDTDSFIVCLRNFQNRRGKIKHLFSDNGTNFVGADNELKGLVMDIDKRM FT RNGDAAALAIKWTFNPPAASHFGGVWERLIKTIKLSLYKMLKQYGDRLPRP FT AILRSALIQAEFILNSRPLTHVPVEDFDDEVMTPFHILIGRAGEYVPPYDS FT TAVHLEKYHWKKVQIMANIFGTVGLKNTSQCY" FT CDS 5151..7538 FT /product="BEL-199_AA-I_2p" FT /translation="MLLKRNKWTNKVEPIKVDDIVIITDDKAPPGSWLKCR FT VISVRTAKDGQVRSAEIKTSKGIYERPAVKIAVLDIRKINKPDSLPKINSE FT TGQQGQTFQDTMQLSNDKNLSTANVQGQQNNDKDERRTPHKRTLRNKNPCN FT FISIMVFVYMFSMIFGFETKGLIAYDCANPEVNMTSYSLLDVASCVPQKTN FT LTSSVISIQVLQRNVRSLTKVLQCKVIIRRSIRHCGTFSHTSDYQYGYSYI FT VKEFNSEECRKVHALGIVSLTHDRQINELKLNTTTRGETLIVGSVTGNSCS FT GGTYSTPFYTWTGALVYYEYEIALYDYIASIDLENDQIFLKNGLMCTYSIG FT TCLDSEEGYLTWDVDLNQACETTEFEVIYEGVVNKTISLDGNIKQTHSVVY FT STISDTQVFSIKTREETRICGYNGFTTDHPRILIIETDTIRSPFTRKASTG FT KNLDLFTYFNSKITLVESYIGQSLNDIYNMVMAEMCKLDKVILETKLTLAR FT LNPNEFVTSIMKRNGYTAVVAGEVLHVLECKPVYITPRFTDQCYQEIPVYY FT NNATMFLAPVTRVLQTMGTEIDCTPLLPAKFKFGNRWYTTDGRLRETTAPT FT KLSTDIVTSWTYTPLPNLMESGVYDSSSLDKMRNMIYEQGERRVASSVVYK FT VLAGQHPNTQGFRFDALVSEKIIDGAINKYWKKLLSWSTWLGNITSTAIGV FT YIIIRALKFTIDTIIHAQILYDIYGFSWHLIASFWDSLTNLLSHKNIRKET FT LNVKRDENNRCSNENQVPQNDQNTTEVTELSDYRGRIYPKFDSAA" XX SQ Sequence 7651 BP; 2652 A; 1282 C; 1586 G; 2131 T; 0 other; ttttggtggc tccagagagg aaaactagtt aactcattat cggaagtgtc gcgacgttga 60 tggtgttgtc ggcgcccaaa gatcgacaag tgtacgggat ttatcgtgag tgaaattaga 120 actccggcag taaatctgtc gttataaagt gaaaagtaaa agtgagaaac ctcataagtc 180 gctgagccgc agatggttgc aagaagtgag aactgagaag agttctgaaa cgtgataaaa 240 aaaaaaaaat tgacttctta agacgccgtg tcgcagaagt aaggaacaat ccagagtgcc 300 caagcgaaat atagtgctag ttgttaaatc agaaacagtg atttaaagac aaagtaaata 360 atgacagaag caaaagtgtt tagatttgtg gacaataaaa aaggaaattg ccgtttgtgt 420 gaggagaccg ataagaaaga caacatggta tcctgcgacg attgtgatag gtggtttcac 480 ctgtcatgtg tgaacctctt aagaaggcca acggaggagg agaaatttct ttgtcgaaag 540 tgtgagtatc tggaaaacga aagagtgggc ttattgaatt tcaagaagaa gtttgatata 600 gcaaatcaac tgaatgagtc gctgaagctt cagcaggatg tagcaaaccg ccgtgaagga 660 gagatcaatc taagtgcaag aaggcaggcg ttaattaagc ttccaaaatt caatggcgca 720 gctcgtgatt ggcccaaatt tattagtatt ttcaacgata ctaccgccga gggccaattc 780 acaaatcttg aaaacttatc aagattagaa gaagctctgt atggcgacgc agccaagagc 840 gtgtcaaatt taatgattgg ggctgcaaat gtacctaaaa tattgaaacg tcttgaggat 900 gtttttggaa ggccggagac tatttatcaa caattattgc atgagttgat caagacttta 960 aactatgcaa aggtacatat agttgatata gcaaaagcat tggacgatct agttacgaca 1020 atggaaacat tgaataaacc ggagttctta cgcgattatc gattgataac cgatatcgtg 1080 acaaagcttc cgttcaacct gcaaataaca tggacggaag cactgcaaag aaacacttcg 1140 caaccaactt tgcaggattt gaacaaatgg ctgcaagcac acgctaagac aatgcgattg 1200 ttagcagctc ccacgcgaaa ggaacataaa acaaggatca atgttcataa caattcgccc 1260 aaaatacaat gcaacatttg taagaacgat caccaaactt acaaatgcaa tgtattaaaa 1320 aatctcaatg ttgaacaaag acaacaacgc gttcaacaat tgggattatg ttattcgtgt 1380 ctgaataaag ggcatggcag taagttctgc cggaacaaac gtgaatgcaa gattaacggt 1440 tgtagaggaa cacacaatcg tctgctacac aaaggagaaa agccgaagga aaatcaagat 1500 gctgccgctg aaaaatctaa tcatcatagc agtaaaaaat caacagtatt atatcagatt 1560 ttacccgtca ccttgcaaaa tggaaagcaa gttctggaaa catttgcttt ccttgatccc 1620 ggatcttcgt tgactctact ggacaaatca gccgcggata aacttggcct tcaaggacgt 1680 tcggagccct tggagatgac gtggacacaa aacgtgtcga aggaaacgcc aagtcaaaga 1740 gtgcagctat ggataagcgg tcacaaagga aaaccatatt caatgaagga cgtcaggact 1800 attgaagaca tcgagctgcc aaaacaaacg ttagacgtac agcgcatgaa gcgggagttc 1860 cagtatctag cagatgttcc agtacagagc tacgtcaacg ccaagccgtt gattttaatt 1920 ggtatggaac acgcccacct gttattgaca actgagagaa ggattggcga cgaggactcc 1980 ccgatggcag caaagacgaa actgggttgg ctgattttcg gcaaggtata cagaggtgag 2040 ccatcatatt ccttttgcat tagagaacga gagagtagcg atttaaaatc attttttgaa 2100 aaatttagtt caacggaaga ctttggtgtt agggcagtag aaaaattacc taaatcgctt 2160 gcagatgaga gggctgatgc cataataaaa gaaacaatca attatgaaaa tgggcactac 2220 tcgataggtt tgttgtggaa aaccgacact gtcaattttc ccccaagtta taataatgcc 2280 cttaaacgtt taaggatgtt ggaaaaaaag ttaagtaaga acgcagatgt taaaatgtgg 2340 gcactaaatt cattcgatgg atttattaaa aaaggatacg cccgaaaact tagcttagaa 2400 gaagtgttag aaaatcatcc gaacacctat tatttacctc atttcattgt gacaaacaaa 2460 aataaaatcc cgccgaaatc ccgaatcgtt tttgatgcag cagctgcgat tcaaggagtt 2520 tcttttaatt ccgctttgct tgcgggttcg gatacagtaa aacctttatt cggtgtttta 2580 ttgagattcc gtgagaatac aattgctgtc actggagaca ttcaagagat gtttcaacaa 2640 gtgaaaataa ttaagaaaga tcagcataca caaagatatc tttggagaca atgtgagatt 2700 gacagatctc cagatgtata tgttatggaa tctatgattt tcggttccac atgttcacct 2760 gcttgtgcgc aagctgttaa aaatcataat gcacggcttt ttaaaaagga ttttccaaag 2820 gcttctgagg cctcagaaga taatttctat gtagacgatt acttggacag cttcaatgag 2880 gtggaagaag cagctgaagt cgttcaggga gttataaaca tacaacagca tgctggattc 2940 catctgagaa attttatcac aaactcacac gagcttgcca aaactattcc ttcagaaagg 3000 ctacaagatt cggaaattaa gttattagaa aataaagaaa ttccaactga taaagttttg 3060 gggatctatt ggaatactaa aatggattta attgtatttc aactgaaact tgacgtattg 3120 gaaaaaacaa tgacgaagag agaagttctt tctttcgtga tgagcatgta tgatccattg 3180 ggattaattt ctaattttac agttcatggt aggattattt tgcaaagatg gcataaacaa 3240 tcctctgatt gggatagtca cttgccagag gaattatctg ctttatggga ttcatggata 3300 aaggctctta atattgcttc gaacacacag gttccaagat gtatgtcgtt gaaaaatgca 3360 ttgatttatg aattgcatac ctttgtggat gcttctgaag acgcattcgc agcagttata 3420 tatttgagaa gcgtgtctga agataaggtt aatgttgatt tagtttcagc caaggcaaga 3480 gttgccccaa ttaaaatgtt aactatacca cgcttagaat tgcaagcagc ggtattggga 3540 acaagattat tggagacagt aaagaaggaa ttacgcatta aaattactaa atgttttatg 3600 tggacagaca gtgaaattgt tctagcttgg ataaacagtc aacataggcg atacaagatt 3660 tttgtagctc atcgagttgc tgagattctg gaaacaacta cagctgatca gtggaaatat 3720 gtcaaatctg ctcaaaaccc cgctgacaaa ggtacaaaaa ttattgatag aacatcaata 3780 tggcaacatg gaccttcatt tttacttgat cctgaagaat tttggcctga aacccaaatt 3840 gaaatgaaaa ctgatgagga attacgaaat tatgtaaaca ttcatcaaga catccatcct 3900 tattcattta ttaaggatga ttatttctca gattggtggc gattggtaaa gcaaaccaat 3960 ataataaaga agtttactga ttggattaga aacaaatccg atttcaaccg aagtattgat 4020 tatgaagatt taaaattggt tgaagatgct acattcagaa aggctcagtg ggattgttat 4080 gcagaggaag tgttatccct aaggacaggg caagcgatta atcagtcgag caatttacga 4140 cagttaaatc ctgcacttga tgaatacggt gtactcagag caagaggtcg attggaatct 4200 gcaatgagct tacctaaatc agcaagaacg cctattatat taccgtatga acatcatata 4260 tcttttttga ttgtcaaatc gtatcatgaa agatatttgc atcaaaacga caatgttgta 4320 atagcagcta ttcatcaaaa attttggatt gttaaattga gagcattact aaaaaaggtt 4380 aaaaagtatt gccaagaatg tgtcattgcc aaatctaaac ctatgccacc catgatggca 4440 cccttgccag attttcgaac agaaccattt ttgtatccat tcaccaacac aggagtggat 4500 tattttggac cgtttgaagt cgcagtaaat cgttcgagag agaaacgttg gggtgtgatt 4560 tttacttgca tgtcaagtag agcagtccat attgaaatgg ctgagaaatt agataccgat 4620 tcgtttattg tttgcttgcg aaatttccaa aatagaagag gaaaaatcaa acatcttttt 4680 agcgataacg gcactaactt tgtcggagct gataacgaat tgaaagggtt agtcatggat 4740 atcgataaaa gaatgagaaa tggtgatgct gcagcattag ccataaaatg gactttcaat 4800 ccaccagcag catcacattt cggtggtgtg tgggagagac tcattaaaac gattaagcta 4860 tctctctaca aaatgcttaa gcaatatggt gatcgattac cacgtccagc aattttgaga 4920 tctgcattaa ttcaagcaga atttattttg aattctcgtc cgttaaccca cgttcctgta 4980 gaagattttg acgatgaagt aatgactcca tttcatattc ttattggaag agcaggagaa 5040 tatgtaccac cttatgattc gacagcggta catttagaaa aatatcattg gaaaaaagtt 5100 caaattatgg caaatatttt tgggaccgtt ggactaaaga atacctccca atgttattaa 5160 aacgaaataa gtggaccaat aaagttgaac ctattaaagt tgatgacatt gtcataataa 5220 ccgatgacaa agctccacct ggatcatggt tgaaatgtcg tgtaattagt gtgagaacag 5280 caaaagatgg acaagtcagg tctgctgaga ttaaaacttc taaaggaatt tatgaacgac 5340 cagcagttaa gatagctgtc ctagatattc gaaaaattaa caagccagat tcccttccga 5400 aaataaatag tgaaactggt cagcaaggtc aaacctttca agacacgatg caactctcaa 5460 atgataagaa tttgagtact gccaatgtcc agggtcaaca gaataatgat aaagatgagc 5520 gcagaacacc ccacaaaaga actttgcgaa ataaaaaccc ttgtaatttt atatcgatta 5580 tggtttttgt atacatgttt tcaatgattt ttggttttga aactaaaggt ttgatagcgt 5640 atgactgcgc caatcctgaa gtgaatatga caagctattc tcttcttgat gttgcttcct 5700 gtgttcccca aaaaactaat ctgacctcat cagtaatatc catacaagtc ctgcagcgaa 5760 acgttagaag tctaactaag gttctacaat gtaaagttat aattcgaaga tctatacggc 5820 attgtggtac attttcgcat acctcagatt accaatacgg atactcctat atagttaagg 5880 aatttaattc cgaagaatgt aggaaagtac atgctttagg aattgtatca cttactcatg 5940 atagacaaat taatgagcta aaattaaata ccacaacacg aggtgaaacg cttattgtag 6000 gaagtgttac tggaaattct tgtagtggag gtacttacag tacacccttt tatacatgga 6060 caggtgctct agtttactat gagtacgaga ttgcactata cgattatatt gcaagtattg 6120 accttgaaaa tgatcagata tttctgaaaa atggcttaat gtgtacttat tcaattggaa 6180 catgtctaga ttccgaagag ggttatctga cctgggacgt agatcttaac caagcctgtg 6240 aaactactga atttgaggtc atttatgaag gagttgtgaa caagaccata agtttggatg 6300 gtaacataaa gcaaacccat tcagtagttt atagcactat atcagatacg caagtatttt 6360 cgataaaaac ccgagaagag actcgcattt gtggatacaa tggatttact accgaccatc 6420 cgagaatttt aattattgaa accgacacta ttagatcacc atttacgaga aaggcatcca 6480 caggaaaaaa tttagattta tttacgtatt tcaactcgaa aataactctt gttgaaagtt 6540 acattggaca atctttaaat gatatttata atatggtaat ggctgaaatg tgtaagttgg 6600 ataaggttat actggagact aagttaactt tagccagatt gaacccaaat gaatttgtaa 6660 ctagtataat gaaaagaaat ggttacacag cagtggtcgc tggtgaagtt ttgcacgttt 6720 tggagtgcaa gcctgtttac attacaccac gatttactga tcaatgctat caagagatac 6780 ccgtctacta caacaatgcg accatgtttt tggcaccggt tacacgcgtt ttacaaacaa 6840 tgggaactga aatagattgt acaccgttac tgcctgcaaa atttaagttt ggaaatagat 6900 ggtatacaac agatggaaga ttacgagaaa ccactgctcc gactaaatta tcaactgata 6960 ttgtaaccag ctggacatat acaccattac caaacttgat ggaaagtgga gtttatgatt 7020 ctagcagtct agacaaaatg agaaacatga tatacgaaca aggtgaaaga cgtgtggcgt 7080 cttctgttgt ctataaagta ttagctggcc aacatcctaa tactcaaggt tttagatttg 7140 acgcattagt ttcggaaaag ataattgatg gtgcaataaa taaatattgg aaaaaattgt 7200 tatcttggtc gacgtggttg ggtaatatta cgtcaacagc tattggcgtt tacataatca 7260 tcagagcatt gaaatttacg attgatacaa ttatacatgc acaaatattg tatgatattt 7320 acgggttcag ctggcatttg attgcttcgt tttgggactc tttaacgaat ttattgtcac 7380 acaaaaatat acgaaaagaa accttgaatg taaaaagaga cgaaaataat aggtgttcaa 7440 atgaaaatca ggttccacaa aatgatcaaa ataccactga agtcacagaa ttaagtgatt 7500 atcgaggaag aatttatcca aaatttgatt cagcagctta aaataaggaa gaataattat 7560 aataagcata aagctcagac gaaattacaa gttaaaacat cgcttatcga ctttttgtta 7620 tgtaaatcca gtaggtttac ggggcgcgga a 7651 // ID P-33_HM repbase; DNA; INV; 4986 BP. XX AC . XX DT 07-JAN-2009 (Rel. 14.02, Created) DT 07-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE P-type DNA transposon family: consensus. XX KW P; DNA transposon; Transposable Element; P-33_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4986 RA Bao W. and Jurka J.; RT "P-type DNA transposon families from Hydra magnipapillata."; RL Repbase Reports 9(2), 445-445 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(4458..3577,1826..2179,2121..2459,2338..2550, FT 2656..3498) FT /product="P-33_HM_1p" FT /translation="MHYNDFVTDIKKIVAETKADLSTNKCYPQEVISEIKD FT FEIKITENFFDEIRKMSKKLKSSGNAEKYYSNFYSNIVLFSDTYFNLKKPA FT STVFTTKLCNKLFQHFQSSPSKNVEVEEISPREVDGLKYLSGYVIRNLLKK FT AKTTKNYKSQYNQAIIALLLQTTKEVSSSEKLINTQNRGGLTAVCEECYQI FT FLLAEKHFRQETAVANLRNIESKNLTKHLLKDPNIISFYNAIVDDVKFVSI FT DKEIKLNLLEKMIELYFRVRTFSLAKNITAKNNDQSFKVKSLRKSIKRKSN FT NKRSSNVLTLPSRRTLRDYKNFIRPKAGLNPPVIEELNKISINLVGYQRYV FT TLSFDEVKIQEKLVFNKHTGDIIGFVDLGEPDLYFSTFQKEDSLASHVLVY FT FIRGIASDLKVCFMSLCYSTSFEVLLLILKFALCHFATKGLTSFQLMPTFW FT EAVGVLELTCKLYVIAAVSDGAAPNRKFYRMHSMFDDKLDDCNVVHRCINI FT YAPERYLWFFADAPHLIKTARNCLYHSGMMTVMLFIDALIFMPLNGTYGFL FT LMLPISSKQQGTVYITQVCKLYLKCIGLQAFPVENYQVMVISCWKVIIKAI FT ILCYIKIQSFFISGDGRGTRSLWNDGQQLIWYHITRIVNDEMKNGLKIIPK FT LTQDHIKLSAYSVMNVRLAAQVLSSSVSNILKNYYPDDTNGTAKFCEMLDS FT FFDCLNVRNSSEGIMKRKPFLLPYTDVNDERLTWLQNTFLCYLEKWKECII FT NRPGNFSNNAKGRMFISWQTYEGLQITVMSVIELVKFLLKSGMPFVLTEKF FT NQDVLEEYFGRQRSLGRRNDNPTLYQFGYQSNTLRLQRSIAPVTGNTKGGH FT GQKREVSWSVVDNEKLPKRKKNL*" XX SQ Sequence 4986 BP; 1729 A; 694 C; 767 G; 1796 T; 0 other; catagtcaat agaaataaac cctggtctgg aaaagctatt gtcgtatgtt attgttcgct 60 tttttttaag tcacgtgatc aaatctagtg caaaacaaac aaaactttta taatgaaagt 120 tacgaattaa taaagatttt gatgtttatt ttaaattaat atattaaaat tattattttt 180 ctttattatt aaaataatat tggttccttt ctattgtcta tggtttaaaa actttattca 240 atttaaaatg tttctttata ttctatattt aaaaaaagtc taaaagataa atccaaaatg 300 ccaggttcaa attgttgtct accaggatgt acagtttcaa gatctattaa gcacaaagat 360 atcaccttat tcagaattcc tatgaggaca ggagactttt ataaaaaatg gagacagaga 420 ttgttagcta ttatcactca gtatagagac tttgacaaag atttgaaaga aagactctca 480 aagggcaata tttatatatg tgagaatcat tttttagcag aagatattga attgacaagt 540 aaggatactt ttatatattt agttgtttaa atttgctttt acactctgtg ttatttaatt 600 ttaaaagctt taattttggg caactctgat aagtttaagt acaataaaca aataaatttc 660 ataaagctta aagtatttgt ggctttatat aaatttgtgg ataacatatt ctcttttttt 720 ctcagtattg ttcttatttt tgtgtgataa tttatattta ctttattgaa atatgttgaa 780 tttcagaaac tggaaggaaa tcattacagc ttagaggctt taccaactgt taacctaccc 840 ataaaatcac atgaaaaaga aaaaaaggta cctgaaagaa gacacataag tattgtaaaa 900 gatattgtca ccacagaagc gaaacataca gacaattatt caaacttaat tgaatttaca 960 gaaaaaaacc caaaagttaa aacttagtgg atgggagatg acatgtaatg aaaacaatat 1020 caaatttaaa aagttattgc ctttgtattt gttacctaag tatgaagttg tagttagatg 1080 acagtttagc gttcaccatt attgtatttg ggtggctctt gccagatgac catcacttaa 1140 tctataaatt atactcacga tcaatgctga acactacagt taccgaactt ttatgcaaaa 1200 tatgtacatt acagattttg ttctggtatt atagaaaaca tcagaaatat aaataatgat 1260 gattttgttg atcatattgt tccaatggaa tttgatttag aaagtgcgaa aaaagtacag 1320 atggtatgcg tgtatcaaat gaaaggttta aaagagcaaa atcatgtatt atattacata 1380 ctagtgagtc tgataatata aattctgttt gttcaatttt gtacccacac actaaaaaag 1440 aacaaaatga ttaaaataaa aaaacaacat cagttagcca cacctgctct tcgaaaagct 1500 ccacttagta aaacaaatcc tgttcgcgtt ctattagcat taaaacaaga acgttctaag 1560 atgtgcccag ctaacaagtg atttagaaaa aataaatttt caaatagact ctaaaggtat 1620 tactattgat gatgggtttg ttgatgattt aaatcaaata atgagtagta actacgaaaa 1680 agctactccc tttatgaaat tattttggga ccagcaaaag aaaatcgtgt aataaaaatt 1740 ttggctcaat aaagtatcac cctatgataa tacggttttg cctctcactt gcctcaaagt 1800 cggcctcagc ttatgatgag tttagcgctc atcaaatgta ctaacattac caagtcgcag 1860 aactctaaga gattataaga atttcattcg accgaaagct ggattaaatc ctcccgtaat 1920 tgaagaattg aataaaatta gtattaacct agttggctac caacgatatg taacgttatc 1980 ttttgatgaa gtaaaaatac aagaaaaact tgtttttaat aaacacacag gtgacattat 2040 aggttttgta gatttaggtg aacctgattt atacttttct acttttcaaa aggaagattc 2100 tttagcaagc catgttctag tctacttcat tcgaggtatt gcttctgatc ttaaagtttg 2160 ctttatgtca ctttgctact aaaggtctca cctcgtttca actaatgcca actttctggg 2220 aagcagttgg agttttagaa ctgacatgta aattatatgt tatagctgca gtatcggatg 2280 gtgcagcacc aaacagaaag ttctatcgaa tgcatagtat gtttgatgac aaattagatg 2340 actgtaatgt tgttcatcga tgcattaata tttatgcccc tgaacggtac ttatggtttt 2400 ttgctgatgc tccccatctc atcaaaacag caaggaactg tttatatcac tcaggtatgt 2460 aaattatatt taaagtgtat tggtttacag gcctttccag tagaaaatta tcaggttatg 2520 gttatttctt gctggaaagt aataataaaa taacagattt ttatattaaa aagtatgaaa 2580 tgcttactta tttagctaag attagaaggg ctatgcaatg ttttaaacct ccctagattg 2640 agagttgaat tgtaggcaat aattttgtgt tatattaaga ttcaatcttt ttttatttca 2700 ggggatggaa ggggtacacg aagtctttgg aatgatggac aacaattgat ttggtaccat 2760 atcacaagaa tagtcaatga cgaaatgaaa aatgggttaa aaattatacc aaagttaaca 2820 caggatcata ttaaacttag tgcttattct gttatgaatg taagacttgc agcccaggtt 2880 ttaagtagta gtgtcagtaa cattttaaaa aattattacc ctgatgatac aaatggaact 2940 gcaaaatttt gtgaaatgtt agatagtttt tttgattgct taaatgtgcg aaacagcagt 3000 gaagggataa tgaaacgcaa accattttta ttaccataca cggatgttaa tgacgagcgt 3060 ttgacatggc tacagaatac atttttatgc tatctagaaa aatggaaaga gtgtattatt 3120 aatcgtcctg gtaacttttc caacaatgca aaaggaagaa tgtttatctc ttggcaaaca 3180 tacgaaggac ttcaaataac tgtaatgtct gttatcgaat tggtaaaatt tttattaaaa 3240 agtggtatgc cgtttgtttt gactgaaaaa ttcaatcaag acgttttgga ggaatacttt 3300 ggtagacaac ggagtttagg tcgaagaaat gacaacccta cactatatca atttggttac 3360 cagtcaaata ccttgcgatt gcaaagatca attgcgccgg ttactggaaa tactaaaggt 3420 ggtcatggtc aaaaacgtga agtatcttgg agtgttgttg ataatgaaaa gttaccaaaa 3480 cgaaaaaaaa acctataaat aagtaatcta aatataaaca tagcatataa caaaatatat 3540 caattatttg tattaagttt aaagtttaag tctttattta ttgtttgatt tacgttttat 3600 actttttctt aaagatttta ctttaaaact ttgatcattg ttttttgcag ttatattttt 3660 agccaatgaa aatgtacgaa ctctaaaata aagttctatc attttttcta acaagttaag 3720 cttaatttcc ttatcaatag acacgaattt cacatcatcc acgattgcat tgtaaaaact 3780 tataatatta ggatccttta ataggtgttt tgttaaattc ttactttcaa tatttcggag 3840 gttagcaact gcagtttctt gtctaaaatg tttctcagct agcaagaaaa tctgatagca 3900 ttcttcacat actgctgtta aaccccctct gttttgggta tttattaact tttctgaact 3960 tgaaacttct ttagttgttt gaagtaataa tgcaattata gcttgattgt actggctttt 4020 gtaatttttt gttgtttttg cctttttaag tagatttctt ataacatacc cagacaaata 4080 ttttaatcca tccacttcac gtggtgaaat ttcttcaact tcaacattct ttgaaggaga 4140 gctttgaaaa tgctgaaata atttattgca taactttgtt gtaaatactg ttgatgcagg 4200 tttctttaga ttaaaatatg tgtctgaaaa gagaacaata ttgctataaa agttggaata 4260 atacttttca gcatttccag atgattttaa tttttttgac atttttctta tttcatcaaa 4320 aaagttttct gttattttta tttcaaagtc ttttatttct gaaataactt cttgaggata 4380 acatttattt gttgacaaat cagctttagt ttctgcaaca atttttttta tatctgtgac 4440 aaaatcatta taatgcatat ctgtaagact ctttatggta tgtttgtttt tattgtgtcg 4500 tgctaagcct ccttttgttt tatacgtttt ggatttgcaa attttgcact tgaagtctcc 4560 aatatcggtt acctaatata tattaaaaat agcatttatt atatatatat tattttagta 4620 cgattatcaa cagtaaatat taggatacaa taacttttaa aaatgttttt ttaatttaac 4680 ttaatcattt gcgttaacct ttacaccaaa ataaaccatt ctaattaaaa tttaaaaatt 4740 attattacct ctaataatgc attatcaaaa tattcttgca aatctggatt taattcatca 4800 tcagcattaa tatttattat ataatcaagc gtttcgcctg taaaaagttc ttcagaaaac 4860 gccatttaaa aatttgtttt gcactagatt tgatcacgtg actacttttt acgcactttt 4920 caagatacgg ttttaaaata agattgaaac aatagctttt ccagaccagg atttctattg 4980 actatg 4986 // ID Copia-20_AA-LTR repbase; DNA; INV; 191 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-20_AA_; KW Copia-20_AA-I; Copia-20_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-191 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 949-949 (2011). XX DR [2] (Consensus) XX SQ Sequence 191 BP; 49 A; 44 C; 32 G; 66 T; 0 other; tgttggagta aatcggtaac gcagtgtgct gatgacctcc ttggagaccg atggtctccc 60 tagcaaccga agttttcttt aataaacaac ggtgcaattt tttttcattc ctgttcaagt 120 ctccactcaa gctagttaaa taaaattcct ttctgctctt aaaactttgt tccggttatt 180 tccaaccatc a 191 // ID hATx-1_SM repbase; DNA; INV; 2680 BP. XX AC . XX DT 11-OCT-2007 (Rel. 12.1, Created) DT 11-OCT-2007 (Rel. 12.1, Last updated, Version 1) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hATx-1_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-2680 RA Jurka J.; RT "haT-type repetitive family from freshwater planarian Schmidtea RT mediterranea."; RL Repbase Reports 7(10), 1037-1037 (2007). XX DR [1] (Consensus) XX CC This is another branch of highly diverged families of hAT CC elements CC from freshwater planarian. XX FH Key Location/Qualifiers FT CDS 417..2504 FT /product="hATx-8_SM_1p" FT /translation="MERVARGTYTKWQNHCKILKNDKFTVECNLCSDDAKD FT KLIKRTDYNTKQIHSHLWKFHQYRYVQPEKSKDSAATQSQRKTIDTYFKEK FT LTLTDEIVAKLAADVGLSFNQIANPLMQKLIESYTKVEAPKNDEKVRQTVY FT RAAAFVKNNIKNDIQLMKSQGAKFSYVSDEATFKQIKVMNVHLFGPNGYNQ FT NLGMIEMGKECDAKTLLRLQNQQLVIFKVDPDKDCIGQTMDGAPVCIKCGR FT LSNRLHHICYDHGIHNCVKRVTTRKKPKKISADLNCDDNSSSSMSDETSDG FT DCDDNEVSDVEIVGSSKGSGSVASSNGMIRTVDGSHSSDESDSESDDGCDV FT AISDEEIVAEEYEYDPALSITKTLKDVREAIRKFKYNLNLVKVLKENATFL FT GIDFKLLAVDVKTRWNSSLKMLTVFNDMMPAVKKTLKDYGHNWHEYLESSV FT PELISTLQPCKLAMDAISGDESDLNMAEGALEFLLKEMRLQQKCFLRDRLV FT EELETEIEKRRPKLTVSLLKFLNNPGVLNEDQEYPFTLSKKEEIFKEGIRL FT WRKHYFINDSDGGDDDDVEVEKVSIVSTETQQQKNDDVDDPKSRLLASLKR FT RTTSNFVPKLPKRKKPSPKAIDPNDLTVEDFNKYISTGIKSEKLSKLEEAL FT KLIRGTSIRSEQNFSIAGNHRSKRRERMTSVTMSDLSVLKSYFQKKR" XX SQ Sequence 2680 BP; 967 A; 464 C; 526 G; 723 T; 0 other; tatacttcca gatctcgctt gtctcggaat ctcggtctcg gcaatacgag attccattac 60 gagatagaat caaaaacgag attttaaaaa atcgaaaaaa ttaaacatag aaatttgcta 120 aatagaccat ttagtacttt tataatttaa caccaaaact tatttttagc atattgctat 180 gctttttcgg taattattag aatgttattt aaatgtgttt tagacaataa caacgttagt 240 cagttttgac gggaaaaaac tgtttactaa atattcttca gaaatcaatt aaaaatttga 300 acaaatcatt gaaaatttga cagtaatttg aatatgttta gacaacaacg aggttagtca 360 gttttgatgg atataaatca gccataaata aattttaaat ttaaaaaggc tcgaatatgg 420 agagagttgc gagaggaact tacacaaaat ggcaaaatca ttgcaaaatt ttgaaaaacg 480 acaaattcac ggtggaatgc aatctttgtt ctgatgacgc aaaagacaag ctcatcaaac 540 gaacagatta caacacaaaa cagattcatt ctcacttgtg gaagtttcat caatatcgat 600 acgttcaacc agaaaaatca aaagattctg ctgcaactca atctcagcga aaaacgatcg 660 atacatattt caaagaaaaa ctcactttaa ctgatgaaat tgttgcaaaa ctggctgcag 720 acgtcggcct cagttttaat caaatcgcga atcctttgat gcagaagttg atagaaagtt 780 acactaaagt ggaagcaccg aaaaatgacg agaaggttcg acagaccgtt tacagagctg 840 cagcatttgt gaaaaataac attaaaaacg acattcagtt gatgaagtcg caaggagcta 900 aattttctta tgtcagcgat gaagcaacat tcaagcaaat caaagtaatg aacgttcatc 960 ttttcgggcc gaatggttac aatcagaatc ttggaatgat tgaaatgggt aaagaatgcg 1020 acgctaaaac actattgagg ttacaaaatc aacaattggt aatattcaag gttgatcctg 1080 acaaggattg tattggacaa actatggatg gcgcacctgt ttgtatcaaa tgcggtcgat 1140 tgtctaatcg actccaccac atctgttacg atcacggtat ccacaattgt gtgaaacgcg 1200 ttacaactcg aaagaaacca aaaaaaattt ctgctgatct caattgtgat gacaattctt 1260 catcctcgat gtctgacgag accagcgatg gtgattgcga cgataacgaa gtgtcggacg 1320 tagaaattgt tggttcttca aagggttctg gaagtgttgc aagcagtaat ggtatgataa 1380 gaacagttga tggctctcac agttccgatg agtcagattc cgagagtgat gatggttgtg 1440 atgtcgcaat atctgacgaa gaaatagtcg cagaagagta cgagtatgac ccagcactga 1500 gtattacaaa aactctaaaa gatgtacgag aagccatcag aaagttcaaa tataatctca 1560 acttggtcaa agtcttgaag gaaaatgcta cttttctcgg aatcgatttc aaactactag 1620 ccgtcgacgt caagacacgt tggaacagtt cattgaaaat gttaactgtc ttcaatgaca 1680 tgatgccagc tgtgaaaaaa actctcaagg attatggaca taattggcat gaatacttgg 1740 aatctagtgt tcctgaactc atctcaactc tgcaaccatg taaattggca atggatgcca 1800 taagtggcga tgaatcggat ttgaacatgg cagaaggagc tttggaattt ttgttgaaag 1860 aaatgcgact tcagcagaaa tgcttcctga gagatcgctt ggtagaagaa cttgagacgg 1920 agatcgagaa gagaagacct aaattaacag tgagtcttct caagtttctg aacaatccag 1980 gagttcttaa tgaggatcag gaatatccat ttacattgtc caaaaaagag gaaattttta 2040 aagaaggaat cagactttgg cgaaagcatt attttatcaa cgacagcgat ggtggtgatg 2100 acgatgatgt cgaagtcgaa aaagtctcaa ttgtctcaac tgaaacccaa caacaaaaaa 2160 acgatgatgt tgatgatccc aaatcacgat tacttgcctc gctgaaacga cggacgacat 2220 cgaatttcgt tccaaaactg cctaaaagga agaagccaag tccaaaagct attgatccta 2280 atgaccttac tgttgaggac ttcaacaaat acatttcaac aggaatcaaa tcggaaaagc 2340 tctccaaact cgaagaagca ttgaaactta tcagaggaac gtcgatccga agcgaacaga 2400 atttctcaat tgctggcaac caccgtagca agagacgaga gcgaatgacc agcgtaacta 2460 tgagtgattt gagtgtcctt aaaagttact ttcaaaagaa acgttaattt gtcgatttag 2520 catgtatata tatgtgaaaa atataaaaat aaaactaaaa aatctgtttt tttaaaaaca 2580 attaatcatt taatgaaaaa tctcgtttta ttctcggtct cgtacgagat ctcgtctcgg 2640 aaaaccgaga ttataaaacg accgagatcc tggaagtata 2680 // ID Gypsy-4_AC-LTR repbase; DNA; INV; 225 BP. XX AC AASC02003433; XX DT 18-JAN-2011 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from Aplysia californica (California sea DE hare): long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-4_AC_; KW Gypsy-4_AC-I; Gypsy-4_AC-LTR. XX OS Aplysia californica OC Eukaryota; Metazoa; Mollusca; Gastropoda; Heterobranchia; OC Euthyneura; Euopisthobranchia; Aplysiomorpha; Aplysioidea; OC Aplysiidae; Aplysia. XX RN [1] RP 1-225 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Aplysia californica (California sea RT hare)."; RL Direct Submission to Repbase Update (02-FEB-2011). XX DR Genome; AASC02003433; Positions 6979 7203. XX SQ Sequence 225 BP; 63 A; 53 C; 61 G; 48 T; 0 other; tgtgatgtca agggacatga ttcttatctt tactgtgggg acaagataat taggggagtg 60 gcacaccctt ggatgacgtc aagagcccgg cgcgaaaaag agagacatcg tctgagagac 120 tcactgagtt atttagtcga actcaggact acgacgaggc aacacttatc ttggacgggg 180 accccctgac agacccgact catcggaccg ccaggtaact ttaca 225 // ID BEL-64_CQ-I repbase; DNA; INV; 6346 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-64_CQ_; KW BEL-64_CQ-LTR; BEL-64_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-6346 RA Jurka J.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 281-281 (2011). XX DR [2] (Consensus) XX CC Positions [4993-5571] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 2416..5961 FT /product="BEL-64_CQ-I_1p" FT /translation="MELYYDLLLDGFTKLGPEKPILQNTVFGWVASGKIGS FT GQAEGALKMARVCSDQSLDQLLNRFWEVESCRSASTQSMEEVACEEHFVGN FT TYRDESGRFVVTLPKQPSVLNQLGDSKKIATRRFLALERRLDANPQLKQVY FT TNFIEEYHQLGHMREVEDVSSHPAGPSYYLPHHGVEKADSTTTKLRVVFDA FT SCRTDSGVALNQALMVGSVVQDDLFAIRLRFRMNRIALIADIEKMYRQIRI FT HPSDYQLQRILWRNSSSEPLRTFELTTVTYGTASAPFLATRCLNELSKQGA FT EDYPLASLALGRDFYVDDMLTGVYDEEEGEELCKQLLSLLPSAGFSLRKWA FT SNSAEVLSKIPPELRDERSVLGLESHTTPIKTLGIQWQPATDTYSYAVPIW FT STQSVITRRVVASDLAKLFLPLGLLGPVIVLAKIFVQSLWEENKSWDAPLK FT PEQQQYWWEYRNSLEDIPSISIPRWVACASDPVLVELHGYCDASERAYGAN FT IYIRTVSRDGTISVRLLCAKSKVAPSGKSKNSVLLSLPLLELSSALLLAHL FT YHKVAASLDLKTKPFFWTDSMIVLRYIRSPPARWKKFVANRVAEIQRLTAG FT GIWSHCPGAENPADIISRGMSPAELRDTPAWWISHEWLNRESRFWPPINQP FT LPDYLPGIAQAERAVALPVQAVKPNELFAIQSSFPLLLRLVAYLRRFQHNA FT SSRDRNNRRIGHLTTLELTDATKILVRIAQREAFARDLAEIESSGQVSHRS FT ELKNLAPILVDGVLRVGGRLKHAAISEDRKHPMILPARHPLTERILVHYHE FT KHLHAGPQLLVACVRERFWPLRVRNSARTIVHTCVNCYRCKPTTSEQLMGD FT LPQQRVTPTLPFLNTGVDLCGPFQYRKAPRAAPTKCYVAVFVCLVTKAAHI FT ELVYDLSTAAFLAALHRFIARRGKPKLIQCDNATNFKGAERELEKLRKQFI FT NQQLQAAVVNRCADDGIDFKFIPPRSPNFGGLWEAAVKSFKKHLRATIGNS FT VLSQDEFVTLLARIEACLNSRPLTPLSADPNDLEVLTPGHFLTFRPLTSPP FT EPDLSEIPRNRLDRWQENQELLRRIWKRWTIDYLSGLHPRTKWTQKRDNIN FT VGTLVLLKEDNLPPLKWRYGRVIRVCRGDDDNIRVVVVRTADGEYTRSISK FT ICVLPLRQPIADVAGAANPNPED" XX SQ Sequence 6346 BP; 1509 A; 1823 C; 1630 G; 1373 T; 11 other; ttttttggtc cttcgattgc cggatgtgcg cgatttatta cggacaatcg cggacatttc 60 ttgtcgtaca tttcgctggc gtcaaaagtg gactttatcg ggtcgtgtgg ctctcgaaca 120 ccacgccgcg tgtgaagtgg gtgcaatagt gaaaaaaaag gcgctaactg tactgtcgcg 180 gtaaactgcg ctgaagatac atgtcccgga aacagtgata agtgcttaaa aattgctggc 240 gtgttcttcg attaattggc gagtttcttc gatctgctgc ctcctgcacc acgaggtcac 300 cacgtggcac ggcacggcgc ggcacggcgt ggcttgggct gtattgtgcg aggaggacaa 360 agggccttga ctggatcaaa ggtcatcgta gtctatcgat tgacgaccac gtgagtactg 420 catgtctgtt gtggtggact gattcgtttg ctttgggtgt tcgatcgtca attctgcgct 480 catctcgacc tggttggtag tttgcgcagc gtgttttttc cgcatttttg ggctttgttg 540 agcactgctg gtcgctgtgg gacatggctc ctggtctacg tgcgttgttc caccaagagc 600 ggttctatct ggacacactt accaacgcaa agcagtttgt ggagggttac aaagaggctc 660 agcatagtgg acaattagcg gggtggaagc agcgaatcga tggtcttgat gacaaatttc 720 atgctaaccg tctagccatc gaactgtcgc tggacgatga tgaggataag tccaaatcag 780 acaaggtgac ggcgaagccc gaggatgccg ctgcgcaagc taaagaggac gctgacaagg 840 atgccgctga agaaagcaat cgacttatac gcaagaaatt cgaactggac tacgttcttg 900 tgtacagctt cctggtcaac gagatccaga aacaaactgt ggctccccaa gctgaagtca 960 accctgcctc gactggacgc tcgcgcgtca agctgccgga catcaasctg cccaacttta 1020 atggctcgat cttggattgg atcacgtttc gtgacacatt cgaaagcctg atccactcca 1080 acgccsacct gccgcccatc gataagttca cktacctagt ggggtcgatg acgccgaagg 1140 cgagggagag gattgacaac atcgatgtta ctgctgctaa ctatccactc gcctggsagg 1200 cattggagga gcggttcgac aacaagaagc tcatcgtcaa ggcgtacatg gatgttctct 1260 tcgcgatcga gccgatgcgg aaggagtgct acgagtccct ggcgcascta gtcgacgggt 1320 tcgagaggaa cctgaacatg atcaagaagc tggacgtcgs aacggasggc tggggcgctg 1380 ttctggstca catggtgtgc gcscggctgg atcattctac actgagacag tgggagtcgc 1440 accacagttc caaagacgtt ccggactaca aggagctgat sgagttctta aagggtcacc 1500 tctccgttct ccagtcgctg cctacaaagt cgtcgtcgaa agagtctcac aagcaagaca 1560 caggtcggcc gctgaaatcg aagaccgtcc acgcagtcac tagtccagcg accggcttga 1620 gaagctgccc gttctgctcc aagtccacgc actctgcgtt taaatgcgaa gtcttccaag 1680 ggttgccagt cccgcaacga ttcgaccttg tgaagaagaa gaatctgtgc ataaattgct 1740 tgtcgtcgga gcacatggtt cgcaattgct cgtcaggcac ctgcagggtc aacaagtgcc 1800 accaaaaaca ccacaccatg ctccacctat cagcggacga tcgcccaacg aatcccaaca 1860 aaccccagtt gtcgtcgtcg ctcacatccg cccttgtcgg aaagcccgcc gaacccgtcc 1920 tcctcgcagt cgtcgccccc tctcccccgt cgcgcatccc aaccgccctc tgccacaagc 1980 ctccccgtgt ccaccaactc gcccggagcc cctcgagccc ttcctgccac cgtgctgctc 2040 cagacggcta tcgttaaggt cctcgacttc tacggamatc accagtgggc acgagtccta 2100 ctcgatccag cgtcgcagct gaacctgatc accgagaaca tcgtgcagaa gctgaaactg 2160 acgccgttac aagtgtcacc aagcaatcgg aggagtcgga aactcgatgg tgacatcgtc 2220 gcacgctgtc cacgttcaga ttgggtcaca ctgctcggac ttcaagtccg ttcacgatcg 2280 ttccacattc tgaaaaggat cacccaagat ctacctacca gaactttcga cccaacatcg 2340 tggaaacttc cgacaaacat caccctagca gatcccaaat tcttcgaacc tagctcaatc 2400 gatctactcc tcggcatgga actctactac gacctgctac tagacgggtt caccaagctg 2460 ggacctgaga aaccgattct tcagaacaca gtcttcggct gggttgcctc tgggaaaatt 2520 ggatctggcc aggcggaagg agcgctgaag atggcccgtg tgtgcagtga ccagtctctg 2580 gatcaacttc ttaaccgctt ttgggaagta gaatcctgtc ggtcagcaag cactcagtcg 2640 atggaagaag tcgcgtgcga ggaacacttc gttggaaata cctatcgaga cgagtcaggt 2700 cggtttgtag tcacccttcc gaaacaacca tccgtcctga accaactagg agactcgaag 2760 aaaattgcca caagacgatt tcttgccttg gaacgtcgtc tcgacgcaaa ccctcaattg 2820 aaacaagtct ataccaattt cattgaggag tatcatcagc tggggcacat gcgcgaagtt 2880 gaagatgtca gctcccatcc agctggtccg tcgtactact taccacacca cggagttgaa 2940 aaggcggaca gcaccactac aaaactacga gttgtattcg acgcctcctg tcggacggac 3000 agtggagtcg cgttgaacca ggcactgatg gtcggatcag tggtccaaga cgaccttttc 3060 gccatacgtc ttcgttttcg tatgaatcgg atcgccttaa tcgcggacat tgaaaaaatg 3120 taccgtcaga ttcgcatcca tccttcggac taccaactcc aacgaatcct ttggcgcaat 3180 tcctcctccg aaccgcttcg cacgtttgag ctgacgacag tcacctacgg aactgcctca 3240 gccccgtttt tggccacgcg atgcttgaac gagctgtcta agcaaggcgc ggaggactac 3300 ccacttgcgt cactggcatt gggtagggac ttctacgtgg acgacatgct aaccggagtg 3360 tacgacgaag aggaaggcga agaactctgc aagcagctgt tgtcactttt gccatctgcc 3420 gggttcagtc tacggaagtg ggcatcaaac tccgccgaag ttctctcgaa aatacccccc 3480 gagctgcgcg acgaacgatc cgttctgggg ctggaatccc acacaactcc aataaaaact 3540 ctcggaatcc agtggcagcc agctacagac acgtacagct acgcggttcc aatctggtct 3600 acacagtctg tcatcacacg tcgcgtcgtc gcgtccgatc tagccaagct gttcctaccg 3660 ctcggcctgc tgggtccggt aatagttctt gccaagatat tcgtccaatc tctttgggaa 3720 gaaaacaaat catgggatgc accgttgaag ccagaacagc agcagtattg gtgggaatat 3780 cgcaacagtc tagaagacat cccatcaata tccattccac gatgggtagc gtgcgcgagt 3840 gatcctgtgc tcgtcgagtt gcacgggtac tgcgatgcat ccgaacgtgc gtacggagcc 3900 aacatctaca tacgaaccgt atccagggac ggcactattt cggtgcgact gttgtgcgcc 3960 aaatcgaagg tggcgccttc gggaaagtcc aagaattcag tactactatc cttaccactg 4020 ttggagctct catcggcact tctcctggct cacctgtacc acaaggttgc cgccagtctc 4080 gacctgaaga cgaagccgtt tttctggacg gactcgatga tagttctgcg ctacattcgc 4140 tcaccaccgg cccgctggaa aaagtttgtt gcaaaccgag tggcggagat ccaacgactc 4200 accgccggag ggatctggtc gcattgccct ggggccgaaa atccagcgga cataatctca 4260 cgtggaatga gccccgcgga gctgagagac acacccgcgt ggtggatctc acacgagtgg 4320 cttaatcgag agtcccgatt ttggccgccg ataaatcagc cccttcctga ctacctcccg 4380 ggcatcgccc aggcggagcg tgcggtagcg ctcccggttc aagcagtaaa accaaacgaa 4440 ctcttcgcga tccagtcatc gttccctctc ctgcttcgtc ttgtcgccta ccttcgtcgg 4500 ttccagcaca acgcgtcctc tcgagatcgc aacaaccgga ggatcggaca cctaacaact 4560 ctcgagctga ctgacgcgac gaagattctt gttcgaatcg cacaaagaga agccttcgcc 4620 cgagatctag ccgaaattga atccagtgga caagtaagcc atcgttcgga actgaaaaat 4680 ctggcaccta tcctggtcga tggggtcctc cgtgtgggtg gtagactaaa acacgcagcc 4740 atctccgaag ataggaaaca tcccatgatc ttgccagctc gtcaccccct gaccgagcgc 4800 attttggtgc actaccacga gaaacatctc cacgctggac cgcagcttct cgtagcctgt 4860 gtcagagaaa gattctggcc ccttcgcgtg cgcaattcgg cacggactat agtgcatacc 4920 tgcgtgaatt gctatcgttg caagcccacc accagcgagc agctcatggg agacctgccc 4980 caacaaagag tcacacccac gctgcctttc ctcaacaccg gcgttgacct gtgcggaccg 5040 ttccagtatc gcaaggcgcc gagagcagca ccaactaaat gctatgtggc cgttttcgtg 5100 tgcctcgtca caaaggctgc ccacatcgaa cttgtctacg accttagtac agccgccttc 5160 ttagcggccc tgcacagatt tatcgctcgt cgagggaagc cgaagctcat ccagtgcgac 5220 aacgcaacga acttcaaggg agcagaaagg gaattggaga agttacgaaa gcagttcatc 5280 aaccagcaac tccaagccgc agtggtcaac cgctgtgcag acgacggcat cgactttaag 5340 ttcattccgc cccgcagtcc caattttggc ggattgtggg aagcggcggt gaagtcgttt 5400 aaaaaacacc tgcgtgccac aattggaaat tcggtactgt cgcaagacga gttcgtcacc 5460 ctactggcac ggattgaagc ctgcttaaac tccagaccgt tgactccgct ctctgccgac 5520 ccaaatgacc tggaggtctt gaccccgggc cacttcctca cttttcgtcc gctcacttcg 5580 cctcccgagc ccgacctgtc cgagattcca cggaaccgtc tcgatcgctg gcaggagaat 5640 caagaactac tacggcgcat ttggaagcgt tggactatcg attacctctc gggcttacac 5700 ccacgcacga agtggacgca gaagcgtgat aacatcaacg tcggcacgtt agtgctactc 5760 aaggaggata atctgccacc cctgaagtgg cgctacggtc gtgtgattcg tgtctgtcgt 5820 ggcgatgacg acaacatccg tgtcgtcgtc gtacgcaccg ccgacggaga gtacacacgg 5880 tccatctcga agatctgcgt cctgccgtta cgccaaccca tcgctgatgt cgctggcgcg 5940 gccaacccga accccgagga ctaaatcttc catcgcaacg ctgcggcgtt gcgcaccggg 6000 aggcctacgg gcctccaacc cccgtaagtt gttttaatat tgaatttgca aaaagctcat 6060 gaatttttcg tcccccggca tgttttggct catcgatggc gactccaacc cacaaccatg 6120 agcgcgccca acgcggacga tcaccgagca tcaaccgaga acgcgcagag gatctcccgt 6180 gacgtcacca ggagcagcaa tttggaagga gaggagagcg gtcgatagtc gaagcagagc 6240 tgctgtggac aacatcaaca aagatgacct tggccttctg cgacgaggca gaagggtgcg 6300 atcgcgtgta ttggttgaaa gctgaccctt tcaacggggg ccggca 6346 // ID BEL-146_AA-LTR repbase; DNA; INV; 883 BP. XX AC AAGE02027562; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-146_AA_; KW BEL-146_AA-I; BEL-146_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-883 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02027562; Positions 284 1166. XX SQ Sequence 883 BP; 350 A; 144 C; 150 G; 239 T; 0 other; tgtcgcaaca ttcattgcgt cgcacctaca catccaatcg ccgttctcca gacttctgtt 60 gaagtgaatt gtctctatag cgaagcggcg ctgcgcgtca accttaaagt cagaatgaca 120 gttagggaag atgtcagaaa gaaaggagcg taaaagaaga cagatttgga agtcagacag 180 ataacaagca ggcaaagtgt ggtctagtga gtgttggaaa tttaatacgt tccaaattta 240 ttagtctact taaaattaaa acaaattagt aatgaattag taaaagttaa tttgtttaat 300 atgaagaaag gtgttaaata gagcaaaaat tacaaaatta gtacggatgt tagtataggt 360 tagttcatgc aaatacagta attagtgcgt gctcttaaaa ctaatggaat cttaattgaa 420 actagatcta cagctactat tagaaacgag tcagaaacgt aaacaaagtt aacacacaga 480 caaagaaaca taaaaacagg taaaatattg aaatattgtg aaattaactt aacctaacct 540 aaaattacac cttatagcat tcctatcacc gatctaagaa ctaaacgttg gaacacgtag 600 aaacgggaca ctaaacgtaa gtcaatcttt ctttttaaaa atgccccaga ttataccaac 660 aaatactatt gacaataatc acatgaaatc aagagaaaca aaatgtacaa aatgacacct 720 aaattcctaa aactactatg tacaacacaa attacataaa aattatgatt gtctctcgtt 780 aatagggatt ttttaacgtt gtcgatcgcg ttgaatcgca acttgaataa aacgtttaag 840 aaatcggaaa actgtctgtc aagtttcgtt gaaaattgca aca 883 // ID Gypsy-214_AA-I repbase; DNA; INV; 6234 BP. XX AC supercont1.9; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-214_AA_; KW Gypsy-214_AA-LTR; Gypsy-214_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6234 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.9; Positions 1367470 1361237. XX CC Positions [3512-4048] - Reverse transcriptase CC Positions [5150-5617] - Integrase core CC 'GTGGT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 3044..5746 FT /product="Gypsy-214_AA-I_1p" FT /translation="MFSTLMADETYRREVFNVKQGTDRSLKGYASSSTIHV FT AATFEAYMFITNDRPMLLEKFYVVNERQALLGRFTAARYSILMLGVKVPLT FT TALKVDSSSTRGEVATIDIYEKFPKFNVPPVRIEYDKSKPPCRNVFMNIPQ FT AVKPLVEDRLRKLEMTDIIERVTDDMDASFCSSMLVVPKGKDDIRLVIDLR FT GPNRYVYRSPFAAPTLEGILAELHGAKWFSTIDLSNAFFHIELDRESRHLT FT NFCTEFGMFRFVRLPFGLCNAPDVFQETLQRKILGNCKGVKNYQDDVLVFG FT KTKEEHDENLEAVLDRLRNHNVKLNESKCVFASQTVTFLGFTITPDGWKIE FT EGKLDAIENCRRPETCSEVKSFLGLITFVDCFIPNRADLTQYLRALANADK FT FYWSDNEEQEFQFLRTRALKTIKTLGYYSQTDPIELYVDASPVGLGAVLVQ FT HDKDKTARIIACASKSLTATEKRYPQTHKEALAVVWGVERFSTYLLSRYFV FT IRTDAEANQFIFNGTHRLGKRALSRADAWALRLQSFDFSISRIPSNMNIAD FT ALSRLIEQTQDPIPVDVDDENHFLYALDVGYMNITWDEIERHSEEDPELKN FT VQSALRCNFWPPDLRKYEAQRKNLRFLGFLLFKDDRAVLPESLRQMALQSA FT HGGHVGIVAMKKILRQFFWWPGMSTAVERFVKGCEVCMQLAKKNPPIPLSS FT RVLPEGPWEILQIDFLKVPGFGTGEFLVVVDTYSRYLNVVEMKQTDADTTN FT AALHGVFKHWGFPLTIQSDNGPPFQSANFTKHWENKGVKVRKAIPLCPQTN FT GAVERQNPGIIKALAASKLEGSNWRHALETYVHHRNTLVPHSRLGITPFEL FT MVGWRYRGTFTSLWNPSSTDLDRTDVRELDDDAKLKSKKDADQYRNAR" XX SQ Sequence 6234 BP; 1955 A; 1246 C; 1478 G; 1555 T; 0 other; ttggcgcaga caacgctccg agagggtaag tattcgtatt gcagcggatt ttggaaatta 60 ctcccaaaat actacgtgaa aacatgaatt tgtttcaaga catttcaaca tggccgccca 120 ggattgatgt gatagtaaaa ttccaagaga agggaaaacg tttcagtagg ctattcaata 180 aggttggaga gtaggctgaa accaaaataa aaaccatact ttatcagtag gctcaaccaa 240 atagggtact gagtaggctg aaaagttaat tggcaaaatg gctgcctgta aatttgagca 300 catggcggtg gttcaagaga agaaaaattg gttcagtagg ctattcaata aaattggata 360 gtaggctgaa accatggaaa gtaacacagc aaaataccgt ctgcgagctt gagcacatgg 420 cgatcatgtg ataatccaat tccaggaaaa ggagtaatgg ttcagtaggc tattcaattt 480 aattgaaaag taggctgaga ccataacaaa acactatcag taggctcggc catataaggt 540 tctgagtagg ctgaaaagtt aatttacaaa atggctttga tgcgataata taatttcaag 600 agaaggaaaa ctggttcagt aggctattca atgaaattga atagtaggct gaaaccatga 660 aaaaaaacct caaccaaacg gagcactgga gcatcaaacg gctgaaaagt aacaaagtgg 720 ctaatttgca gtaggctcac tgtctgtaaa actactgttt attgaacaca attcaatatt 780 cgaagatatt taaatcgact tactaatcta gtaagtatgc taaccatact aagaaaccat 840 tcgtgcgcaa tatagtaggc tcatttatcg gtaagataga taaagttgaa tgtggattgg 900 ctatccgaat tatgagcatg atcggtaata atgccaagga ataaagaaaa taagtagaat 960 gaacaaattc aataaaaaat gcagatcagt tgtggttatt ataaacttac gagaaggtgt 1020 caattctatg ataaagaatt tagtaaattg tgttgcacct ttgaagaatg atcagtacga 1080 gtaattgaac tatcattttt tgattttcag ggttgcgtca acaaaccaga gaaagaaaaa 1140 gatggaattc aacgatcgga gttcagtagc tgtaaaacgc aagaaagcat tactagttcg 1200 caatcgttcg acaaaaagtg ttcaatccaa aaagcaaact ttgcgtaccg gattcatgga 1260 acggcaacgc attgcaaaat tggagaagga actggaagaa gaaaaacttc gcaactctct 1320 gttggtcgac gaaatgaaca aaattcgcca gaaccaggag gtgaataacg gatcttgtat 1380 tgagacacaa tcggcagata ctggtagaga agtcgtcgaa cagagttcaa ttgattttga 1440 aggaatagga ccgtcaagta cccgacgctc aaaagacatg tcgagtttgg aggaatccaa 1500 attcatggct tctatcaatc agctgtcggt ctcatcgata agcgtgcccg aatgccggcc 1560 agcagttgaa aacgaagaca tcagccgaca cactttcgaa cagtggcgcg atctgctgat 1620 tgacacaatg tcactagctg gcattacaga tgaagctaca aaattcacaa tattcaaagt 1680 caaggcagga tttcggctgc tcgatatcta tcgtaatgct aaagcagatg aaaacgcgcc 1740 ggacgcaact ttgtttccat tcacgcatgc tttgtatcga ctcaaagctt atttcggatc 1800 aggtaacatg atacttcatg aatattgcga acaaaaatgt gattattggt ttatttcgat 1860 taggctcaga tattatgttg cagcggcgtc ggctagccct aatggaacag aaggtcggtg 1920 aatcggattt ggcattcgtg acacgcgtcg gttcggctgc cagactttgc gattacggca 1980 aggacaagga acttgaggaa actgttggta caattgctga gcatgcccat agcaaggaag 2040 tccgcacgat ggctttgaaa cttttgagca ggaatggtac ctttaccgac ttggtggaca 2100 aggttaggga aattgaggcc atccgattga acgaacagtt ttttgcccag aaacattcct 2160 tgaaggatga agttgtggta gccccggtaa gagccgactt tcctttgaga agtaatgctc 2220 agcagaggta tcaaggcaga acggtcactc aacgaggcta tcgtggacat ccagccgcag 2280 caaggcgacc ggcaaatcaa ggcttccaga gatcaggaat gagaccaggc tccaatccac 2340 cacgaagatt tgtcgcgcca cagtacaata aatgttggag gtgtaccagc gtgtaccatg 2400 tccctgctaa gtgccatgcc atcaacaaag actgcagaaa ctgtggacgg cttggccaca 2460 ttcaggtggc atgtccctca atcaaaacgt tcaatccgaa cagctcgaca gttccggagc 2520 cagaggcgga gcctgctcgg caaattattg cagttgtcga aaaacgcgaa gaggccacgc 2580 cgcaggaagt aagtgagaat cttgattttt ctaccgaacg ccaggattga gtttgattag 2640 ttgaacgttt cgagaccagt ctgatcatta ttttatgtat tgtattcctt taattcggat 2700 atccatacag gaatctgaaa acaaacaaaa cagttaaatt tatcattaaa taccttgtca 2760 agcaaatact actctaagat tgtcataagc ttaaggatac tctctttcga actaacgaat 2820 gaatttgaat ttgaatgatt cttgaatcaa agtgcaaata aacgataata acgactattc 2880 aatctcatta cattgacaga tgacactgcc aacgcaaaac atttctacga ttgctatgat 2940 tgcgtccgat aatgacgatg gtttcattgt ggcaacagta gcagggatgc catgtcgttt 3000 ctgaattgac tcaggagcgc aagtcaatac cttcacggtg gacatgttta gtactttgat 3060 ggcagatgag acttaccgta gggaagtttt caacgtcaaa caagggacgg atcgttctct 3120 gaagggatat gcatcgagca gcacaattca tgtggcggct actttcgaag cctatatgtt 3180 cataactaat gataggccta tgcttctgga gaaattttac gtagtaaacg aacgtcaagc 3240 actgttgggt agatttacag cggcaaggta cagcattctg atgctcggcg ttaaagttcc 3300 tttgactaca gctttaaagg ttgattcgtc atcgacgcgt ggagaagtgg ctaccatcga 3360 catatacgaa aagtttccca aattcaatgt gccaccggtt cgaatcgagt atgacaaatc 3420 gaagcctcca tgccgtaacg tttttatgaa catacctcag gcagttaaac cactcgtgga 3480 ggatcgactg cgcaagttgg aaatgactga cataatcgaa cgtgtgactg acgatatgga 3540 cgcatccttc tgttcctcga tgctggtggt accaaaaggg aaggacgata ttagactggt 3600 gatagatctc agaggaccga atcgttacgt gtacaggtca ccatttgcag cacccaccct 3660 agaaggaatc ttagccgaac tgcacggtgc gaagtggttc tcaactatcg atttgagtaa 3720 cgcattcttc cacattgagc tggatcgcga gtcacgccac ttgacaaatt tttgtactga 3780 atttggtatg ttcagatttg tccgccttcc gttcggtttg tgtaacgcac cggacgtgtt 3840 tcaggaaaca ttgcagcgga aaatactcgg taattgcaag ggcgtaaaga attatcaaga 3900 tgacgtactg gttttcggaa aaacgaagga agagcacgat gagaacttag aagcagtatt 3960 ggacaggcta agaaatcata acgtgaagct gaatgaatcg aaatgcgtgt ttgctagtca 4020 aacagtaaca tttctcggat ttaccataac tccagacgga tggaaaatcg aagagggaaa 4080 gttagacgct attgaaaact gccgacgacc agaaacttgt tcagaagtca aaagctttct 4140 cggtctcata acgttcgtgg attgctttat cccaaatcga gcagatttga cccagtatct 4200 gagagctttg gcgaatgctg acaagttcta ctggtccgat aacgaagagc aggaattcca 4260 attcctgaga acaagggcac tgaaaaccat taaaacgcta ggctattaca gccagacaga 4320 tcccattgaa ctttacgtcg acgcttctcc ggttggactg ggagccgtct tagttcaaca 4380 tgataaggac aagacagcac gaattattgc gtgcgcctcc aagtctctga ctgcaactga 4440 gaagaggtat cctcagactc acaaagaagc actcgcggtg gtctggggcg tagagcgctt 4500 ctctacgtat cttctgagca gatattttgt tatcagaacg gacgctgaag caaatcaatt 4560 cattttcaac ggcacgcatc gtttgggaaa gcgagcacta tcaagggcgg atgcatgggc 4620 cctccgtcta caatcattcg acttctccat ttcgaggata ccaagtaata tgaacatcgc 4680 ggatgccttg tcaagactca tcgaacagac acaggaccca attccagttg acgttgacga 4740 cgagaaccat ttcttgtatg cgctcgatgt gggatacatg aatataacat gggatgaaat 4800 tgaacgacat tcagaagaag atccggaact gaagaatgta cagagtgctc tgcggtgcaa 4860 cttctggcct cctgatttgc ggaagtacga agcccaacgc aaaaacctaa ggtttttagg 4920 atttctcctg ttcaaggacg accgggctgt tttaccagaa tcgcttcgac agatggccct 4980 tcaatcggct cacggtgggc atgttggaat cgtcgctatg aaaaagattt tgagacaatt 5040 tttctggtgg ccaggaatgt caactgccgt tgagcggttc gtcaaaggat gtgaagtgtg 5100 tatgcagcta gctaagaaaa atcctcctat tcccctatcg agcagggtcc taccggaagg 5160 gccctgggag atacttcaaa ttgatttttt gaaagtacca ggattcggaa caggagagtt 5220 tcttgtagtc gtagatacgt actcaagata cttaaatgtg gttgagatga agcaaacaga 5280 cgctgataca acaaatgccg cgctacatgg agttttcaag cactggggat ttccattaac 5340 aatccaaagt gacaacggac ctccgtttca aagcgcaaat tttacgaaac attgggaaaa 5400 caaaggtgtc aaggtgcgta aggcgatacc gttgtgccct caaactaacg gtgccgtgga 5460 aagacaaaat ccgggcatca taaaagcgtt ggcagcttca aagctagagg gatctaactg 5520 gcgtcacgct ttggaaacat acgttcatca tcgaaacacc ctagttcctc attccaggtt 5580 ggggatcact ccctttgagt tgatggtcgg gtggagatac aggggaacct tcactagctt 5640 gtggaatcca tccagcacag atctagaccg aacagatgtt cgcgagctag atgatgatgc 5700 aaaattgaaa agcaagaagg acgcggatca atatcgtaat gctagatagt ccgatatcaa 5760 agtcggagat atggtacttc tagcacaaca caaacggagc aaagccgatc caaatttttc 5820 cgaggagcgt ttccaggtta ttgctagaga tggggcaaag gtggtgttgg tgagcccaaa 5880 tggaatccaa tattctcgca gcgtgaatga catgaaaaaa gcagcgatgt cttgttcggc 5940 atcgacattg cacaaatcct caagtaacca gcaggttcag atacaaaatg cagacggact 6000 actggagctc cctgtgatgg acaatttcaa tgatgcatcg agcgtatatg attctaattc 6060 caggcgtaaa gaggatacag gtggcagatt tttgcgtcgt cgaaatgatt tgcagcgacc 6120 agtccgattc gatgaaaact tcgtctacca cattttcggt tagattgcgt gataattcga 6180 tactgcgtta acaggaacaa gtaagaacgc ctaaagagta gagaagaaga atga 6234 // ID Copia-2_BM-I repbase; DNA; INV; 4040 BP. XX AC nscaf3015; XX DT 19-MAR-2010 (Rel. 15.04, Created) DT 19-MAR-2010 (Rel. 15.04, Last updated, Version -1) XX DE LTR retrotransposon from silkworm: internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; LTR-1_BM_; KW Copia-2_BM-LTR; Copia-2_BM-I. XX OS Bombyx mori OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Bombycidae; Bombycinae; Bombyx. XX RN [1] RP 1-4040 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from silkworm."; RL Repbase Reports 10(4), 585-585 (2010). XX DR Genome; nscaf3015; Positions 2283287 2279248. XX CC Positions [1450-1950] - Integrase core CC 'AAAAT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 157..2196 FT /product="Copia-2_BM-I_1p" FT /translation="MNNQMALIEKLSGRDNFATWRFAMQTYLQHEELWDCI FT EDEVKDTKRDIKAKTKIILSVDPINYVHIQEAKTAKEVWHKLTSAFDDSGL FT TRRVGLLRDLCNTSLAGCQNVEEYVSKIINTAYKLRNIGFKVDDEWLGTLL FT LSGLPESYQPMIIAIESSGMKISSDSVKAKLLQDVRPPENDSKAFAVTKKF FT HKNSKKQSKGPRCYTCNKYGHKSPECKNKTKDTQKASNSYAAVFVASTTQD FT DAWYVDSGASSHMTKHRGLLSNENDTPSVKTIRTADNKILNVQCSGQVSLN FT VCNKKGQENKLLFRNVLCVPELATNLISVSQIVRSGGQVRFNQNGCAIINR FT GQIVATASIINNMYRLNMPGRGYACMSDVDAEDIYLWHQRMGHLNFNSLKK FT MSDNVDHVIFSEKSENLSCITCKEGKQTRLPFKSECKRAQTPLQLVHSDIC FT GPMETQTLGGSKYFITFLDDYSKKIYVYFLRNKSDALEKFKEFRNEVENQL FT EARIKVLRTDNGLEYVNNNFSDYLKSCGIIHQTTTPYTPEQNGTAERMNRS FT LIERAKCMLLNANLPKLYWADAVHTAAYIINRSPTKSLSFKTPEELWSGQK FT PYVKHMRIFGCEAMVHLPQEKRRKWDPKAQKMIFIGYCDHTKGYRFIIPNS FT RKVIKSRDATFLESTVKRDYVPIELSQKKHQ" FT CDS 2196..3974 FT /product="Copia-2_BM-I_2p" FT /translation="MNIDDSKESETSEMTDDSTSTGSYDTVCDTKSSDSDY FT LPDRSYDSLPISNITLRRKKKVQQSSDDPEENTYLCQDEVMLLDVPETYKE FT AVNSIHKEKWMRSIDEELHAHQKNNTWTLVQRPANTKVINCKWVFRIKDEP FT TGPRYKSRLCAKGYAQTKGIDYNETFAPTVRYDSIRILLSVAAQNNLKVIQ FT LDVKTAFLYGELEEVIFMSPPEGLPCEENMVCKLNKSLYGLKQAPRCWNSK FT FDTVLKKYGFVNTKADQCVYVGLVNNKKCYFCLYVDDGLLFSTDESALLEL FT TKELSIIFEIKVLNKPSNFVGMQIEQFSNCIFIHQTKYIKQMLLNSYMDNS FT NPNSIPVDPHVKLQKGNEEPNKSIPYRQAVGSLMHVAIVSRPDVMFAVSLV FT SRYLNCYNQSHWNAVKRIFKYLKDTIDHGLCYVKTTEPSETTGYSDADYAN FT DLDNRRSVTGYVFIKNGAAITWSSQRQQTVALSTTEAEFIAACAATKETMW FT IKQFLSDISEYKQNSMTLHLDNQSAISVIKNINFHKRCKHIEVRYHFIKEK FT YHQKIIALEFVGTDNQYADIFTKALCRDKFAFLRSKLGMCSYNSLV" XX SQ Sequence 4040 BP; 1463 A; 697 C; 784 G; 1096 T; 0 other; ggttatgggc ccagtaaagc ctataaataa taaaagtgct aaataaactt taaaatcagt 60 gatagacagt tttgcagtga acataacctc tcaataagaa gccggcatat tgtttttttt 120 ttaactaaaa tccaccgaat atcattatca atcacaatga ataatcaaat ggcattaatt 180 gaaaagctat caggccgtga taactttgca acttggcgat ttgcaatgca aacatacttg 240 caacacgagg agctgtggga ttgtatagaa gatgaggtaa aagatacaaa acgagatatt 300 aaagcaaaaa caaagattat tctctcagtg gatccgataa actatgttca catacaagag 360 gcgaagactg cgaaggaagt ttggcacaaa ttaacatcag cattcgatga ctccgggcta 420 acacgccgag ttggtctact gcgtgacctt tgtaatacat cacttgctgg atgccaaaat 480 gtagaggaat acgtaagcaa aattattaac actgcctaca agctaagaaa cataggcttt 540 aaagtagatg acgagtggct aggtacactg ctgctctccg gtctacccga atcttaccaa 600 cctatgataa tagctattga gagttccggc atgaaaatca gttcagactc agtaaaggca 660 aaactactac aagatgtaag accacctgaa aatgactcaa aagcatttgc agttaccaag 720 aaatttcata agaattccaa aaagcaatca aaaggaccaa ggtgctacac ttgtaataaa 780 tatggacata aaagtccaga atgtaaaaac aagactaaag atacacaaaa agcaagtaac 840 tcatatgctg ctgtttttgt agcatctacg actcaagatg atgcctggta cgtggactca 900 ggtgcatcct cacacatgac caaacataga ggtttgctgt caaatgagaa tgacacgcct 960 tctgtaaaga caattagaac tgcagataac aaaattttaa atgtgcaatg ctcagggcaa 1020 gtgagcttaa atgtatgtaa taagaagggt caagaaaata aattattatt cagaaatgtg 1080 ttatgtgttc ccgaactagc aactaatttg atatcagtta gccagattgt cagaagcggt 1140 ggccaagtaa ggtttaatca aaatggctgt gctattataa acagaggtca gatagtagca 1200 acggccagta tcatcaacaa catgtaccgg ctaaatatgc ctggaagagg ctatgcatgc 1260 atgtctgatg ttgacgcaga ggatatttac ctatggcacc aaagaatggg acacctgaat 1320 tttaatagtc tgaaaaagat gtctgacaat gtagatcatg taattttctc tgaaaaatca 1380 gaaaatctgt catgtatcac ctgcaaagaa ggtaagcaaa cgaggctgcc ttttaaaagt 1440 gaatgcaaga gagcacaaac ccctcttcaa ttggttcact cagacatatg tggtcctatg 1500 gaaacacaaa cactcggtgg atctaaatac ttcataacat tcctggatga ttattcaaag 1560 aagatctatg tatattttct ccggaataag tcagatgctc tagaaaagtt taaagaattc 1620 agaaatgaag tcgaaaatca acttgaagct cgtataaaag ttctccggac agacaatgga 1680 ttggaatatg tcaataacaa tttttctgat tacttgaaga gttgtggaat tatacatcaa 1740 acaaccacac catacactcc agagcaaaat gggactgcgg aacgaatgaa cagaagtctt 1800 atagaaagag caaagtgtat gctcctgaat gccaatttac ctaaactata ttgggctgat 1860 gcagtgcaca ctgctgctta tatcataaac cgttcaccaa caaagtcttt gtcatttaaa 1920 acaccagaag agttatggtc tggacagaag ccatatgtga aacatatgcg tatatttggc 1980 tgtgaagcta tggtacactt accacaggag aaacgcagaa agtgggaccc aaaagcacag 2040 aagatgatct ttattggata ttgtgatcac accaaaggct atcgattcat aataccaaat 2100 tccagaaaag tgataaaaag cagagacgct acattccttg aatcaactgt caaaagggac 2160 tatgttccca tagaactatc tcaaaagaaa catcaatgaa tattgatgac agcaaagaaa 2220 gtgaaacatc tgaaatgaca gatgattcta cttcaacagg ttcatatgat actgtatgtg 2280 acactaaatc atcagacagc gactacctac ctgataggag ttatgatagt ctccctatct 2340 ctaacataac tctacgccga aagaagaagg tgcaacagtc ttctgatgat ccagaagaga 2400 atacatactt gtgtcaggat gaagttatgc ttttggacgt tcctgaaaca tataaagaag 2460 ctgtgaactc aatccataaa gagaaatgga tgagatctat agacgaggaa ttacatgctc 2520 atcagaaaaa taatacatgg acattggttc agagacctgc taataccaaa gttataaatt 2580 gtaaatgggt attccgtata aaggatgaac caaccggacc aaggtataaa tcaaggctgt 2640 gtgccaaagg atatgcacaa accaaaggca tagactacaa tgaaacattt gcaccaactg 2700 tgaggtacga ttctatacgt atattattat ctgtagctgc tcagaataat cttaaagtta 2760 ttcaacttga tgtgaagaca gcattcctgt atggagagct ggaggaggtc atattcatgt 2820 caccaccaga aggcctacct tgtgaagaaa acatggtatg taaacttaat aaatccctct 2880 atggtttgaa acaggctccg aggtgctgga acagcaaatt tgatactgtt ctgaagaagt 2940 atggctttgt caacactaag gcagatcaat gtgtgtatgt tggtctagtt aataataaga 3000 aatgttattt ttgcctatat gttgatgatg gattgttatt ctcaacagat gaatctgcat 3060 tgcttgagct cactaaagaa ttgagcataa tttttgaaat aaaagtatta aataagccta 3120 gtaattttgt gggtatgcaa atagagcaat ttagtaactg tatattcata catcagacta 3180 aatatattaa acaaatgcta ttaaattctt atatggataa ctctaaccca aacagtattc 3240 cagttgaccc tcatgtaaag ttgcaaaagg gtaatgaaga acctaataaa agtataccat 3300 acagacaagc ggtcggctcc ctgatgcatg tggctatagt gagtcgtccc gatgtcatgt 3360 ttgctgtcag tctggtgagt cgttatctaa attgttataa ccaaagtcac tggaacgcag 3420 tgaagaggat tttcaaatat ttgaaagata caatagacca tggcctgtgt tatgttaaaa 3480 ccacagaacc gtcagaaact acaggctaca gcgatgcaga ctatgcaaat gatctggata 3540 accgtagatc agttactggc tatgttttta tcaagaatgg agcagctata acttggtcta 3600 gtcaaagaca acagacagta gccctgtcta ctactgaagc agagttcatt gctgcttgtg 3660 ctgctactaa agagacaatg tggattaaac agtttttaag tgacatcagt gaatataaac 3720 aaaattcaat gactcttcat ttggacaatc aaagtgctat aagtgtcatt aaaaatataa 3780 actttcacaa acgatgtaaa cacattgaag ttagatatca ttttattaaa gaaaagtatc 3840 atcaaaagat aattgcatta gagtttgtag gtacagacaa tcagtatgca gatatattta 3900 ccaaagcatt atgtagagat aaatttgctt tcttaagatc taaactggga atgtgttcat 3960 ataattcatt ggtttaataa ttgtcttaat ataaaacaat agtatttact tttattgtta 4020 aggcttaaat tgaggagaag 4040 // ID MuDR1x_MH repbase; DNA; INV; 2095 BP. XX AC . XX DT 24-JUN-2009 (Rel. 14.07, Created) DT 24-JUN-2009 (Rel. 14.07, Last updated, Version 1) XX DE MuDR-type DNA transposon: consensus. XX KW MuDR; DNA transposon; Transposable Element; MuDR1x_MH. XX OS Meloidogyne hapla OC Eukaryota; Metazoa; Nematoda; Chromadorea; Tylenchida; Tylenchina; OC Tylenchoidea; Meloidogynidae; Meloidogyninae; Meloidogyne. XX RN [1] RP 1-2095 RA Jurka J.; RT "DNA transposons from northern root-knot nematode."; RL Repbase Reports 9(7), 1348-1348 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS join(212..520,556..885,1323..1649) FT /product="MuDR1x_MH_1p" FT /translation="MASFINNKLVYEGYIYVKCKNGKDGKLYWRCENWRSG FT KCGGTSMTDKNEVVTVGNQHNHGPSPTRIELARIKDRINQAAISSALTPRA FT IVNSQLAGISDQAKVFKTALPKLRNLEKTVGRKRYADGQQVPVPHLLSEIN FT IPEQLRRTKTDLNEDFIMADTGPEDPNRIIVLASRTDVARLASCDVWLCDG FT TFKSCPQLYYQLWVFLYFNYLNDNLVDTRNWFRQLAQIFLDVYGDGDEDGP FT YVAFVDYMEKTWVGREFVTPRFSLEMWNNRGITLDQMPRTTNSAESWHNSF FT SSIFHRHSPNPYKLVRALLDEQVYFINSS*" XX SQ Sequence 2095 BP; 630 A; 332 C; 420 G; 713 T; 0 other; cccagatgat cgaaacccaa acgatcgaca cccaaacgat cgaagcccta gtgatcgaat 60 taccctaatg atcgaaacct aaatgatcga aacccccaaa tgatcgaata aaaaagatca 120 acactaaatt tttttatttt ttgtttgtat tctcaaatta tttttctgtt ttacttctaa 180 tcattttaaa aatctagctc agttattaaa aatggcttca tttataaaca acaaattagt 240 ttacgaaggg tacatttatg ttaagtgtaa aaatggcaaa gatggaaaac tttattggag 300 gtgcgaaaat tggaggagcg gcaagtgtgg tggtacttct atgaccgata aaaatgaagt 360 tgtcacagtg ggaaatcaac acaatcatgg tccatctcca actcgaattg agttagccag 420 gattaaagac agaattaatc aagctgcaat ttcatctgca cttacaccga gggcaattgt 480 taattcacaa ttggctggaa taagtgatca agccaaggtt tgatttttat ttcttgatat 540 aaacttaatc aataatttaa gactgcactt ccaaaattga ggaatttaga gaagactgta 600 ggaagaaagc gttatgctga tggacaacaa gtacctgttc cgcatttgct gtctgaaata 660 aatattcctg agcaattgcg gcgaacgaaa actgatctta acgaagattt tattatggcg 720 gatactggac ctgaagatcc caatcgtata atcgttttgg catcgagaac ggatgttgcc 780 cggcttgcta gttgcgatgt ttggttatgt gatgggactt tcaaaagttg tcctcaattg 840 tattatcaat tatgggtttt tctttatttt aattatttaa atgattaatt atttaaaggt 900 cattcatggt cgatttcgtc aggcagcagt attgcctttc atatatgcat tacttcccag 960 caaaacacgc gagtgctaca ggagggctct tgatttggtt ttgaaagatt gatgaagtta 1020 atcttggtgc aagaccaaat gtggttgtaa tcgattttga gaaagcggaa gagcttgctt 1080 tgcgaacagc cttaccagaa gcgacaattc atggatgttt cttccatttc aaagttttag 1140 ttaaaagaaa attttctcaa gaaattttag caagcattat ggaggaaaat tcaagaacta 1200 ggatgggcag ctaaatatca agatgaagta gaggatggat ttaggttaca tttaaaaatg 1260 tttgcggcac tcacttttgt taatacaggt tttttcttat taaacttagc ggaccttctt 1320 gaaatttagt ggatacccga aactggtttc gtcaattggc acagattttt ctggatgttt 1380 atggcgatgg tgatgaagat ggcccttatg tcgcttttgt ggactacatg gaaaaaacat 1440 gggttggcag agagtttgtt accccgcgat tctctttgga aatgtggaat aacaggggca 1500 taactcttga tcaaatgcca agaacgacaa attcagccga atcatggcat aactcgttct 1560 ctagcatttt tcaccgccat tcacccaacc catacaaact tgttcgggcg cttttggatg 1620 agcaggttta ttttattaat tctagttaaa ttaaatatta attttaaaaa aggtgcggtc 1680 tgatgcgatt tctattcgaa ttttggctgg agaaaatatt ccattatttt ctagagttga 1740 atacaaacgt gcaaatgaac gtcttttgaa tgtgctcaga ggaggtggag gtttgcgaaa 1800 tcctattgaa tttcttacag cttgttccca ttacatacgt ttctgattta ttatttttga 1860 tttaatcttt ttaatgttta accgaaattg tcccaaaaaa ttattaatac tggtcaattc 1920 tttttttaat tgttattttt gtttgatatt tgatatcttt attaaaattt agtgttgatc 1980 ttttttattc gatcatttgg gggtttcgat catttaggtt tcgatcatta gggtaattcg 2040 atcactaggg cttcgatcgt ttgggtgtcg atcgtttggg tttcgatcat ctggg 2095 // ID DNA8-110_AP repbase; DNA; INV; 689 BP. XX AC . XX DT 29-AUG-2009 (Rel. 14.09, Created) DT 29-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-110_AP. XX NM DNA8-110_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-689 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2048-2048 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 689 BP; 214 A; 111 C; 110 G; 253 T; 1 other; gggcggttcc attttcctcc aaaaacgaga taccgttttc ctccactgtc cattttcctc 60 cacaaaaaaa aaaggttggt ttccattttc ctccaaatta ttttagactg taattttgtc 120 tattttcctc caatatttta gccagtgttg tagtattacg ttccgtntta taaatatata 180 gagtaaagct aggatcacat tttgcgctta tcggttgatg attacactct ggtatcggct 240 atctatatca gctatcgatt aagcctaata aagatacgct catataacat tcgggcatat 300 acgtatatac ttatttatgt tatattgtgt atcagttgtc acttattatt atgattacac 360 acaaaacgtt tgttaattcg tgatggttac aaatattttt tagcatatga gttaaaatct 420 ggattatcac gttggcgatg ttgtttgact acttgtcaag cagttatatt atacactaat 480 aaaactgatg aaatagtagc cgatgaatta cacaaaaatg tacatccaaa tcatttgccg 540 atcaagccta aaaaaaaata aatcgatttt ttttgctatg cataaattat ttgtattaac 600 tattttatta ttttattcat tatttattgg aggaaaatgg acagtggagg aaaacggtat 660 ctcgtttttg gaggaaaatg gctacggcc 689 // ID Gypsy-7-LTR_HM repbase; DNA; INV; 305 BP. XX AC . XX DT 25-DEC-2008 (Rel. 13.12, Created) DT 25-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE Long terminal repeat of the LTR retrotransposon - a consensus DE sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-7-LTR_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-305 RA Bao W. and Jurka J.; RT "Gypsy LTR retrotransposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 1981-1981 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX SQ Sequence 305 BP; 124 A; 45 C; 44 G; 92 T; 0 other; tgtaatgatt gttacgcaaa acaattcaga atagcaatta gcgattgcaa aacaagcaat 60 tcccttttgc gaattgcaca aaaacttata acgaaagtat tcccttgaga gaattccgca 120 cgaggaattc tcgcttaaaa gccagtgcgc tgaatttaat tacagttgaa gtttcgacgc 180 gaaactatac agagttttat atttaaacta gttgtgataa tttaatatga aaatattgaa 240 ataaatatta aataaaaatc atagtcgaat tctctttata ataaatataa agacaaaaca 300 ttaca 305 // ID TTAA28_AP repbase; DNA; INV; 316 BP. XX AC . XX DT 30-AUG-2009 (Rel. 14.09, Created) DT 30-AUG-2009 (Rel. 15.12, Last updated, Version 2) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; nonautonomous; TTAA28_AP. XX NM TTAA28_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-316 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2098-2098 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC TSD is TTAA tetranucleotide. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea CC Aphid Genome Project. XX SQ Sequence 316 BP; 90 A; 59 C; 44 G; 123 T; 0 other; ccctttgagg agcatggacg tatatatacg ttttaaaata tactgtcaaa aagagcatgg 60 acgtatatat atacgttttt accgcgaatt tcttttaatt tctttgttta tgtttcatta 120 ccgtaaaata ccttatataa aatatgtgct gccatctgtg aagtttaatt tcaatcaatc 180 gatttatttt catttatcaa tcctattaaa ataggacgta catatacgtc ttcgctcata 240 attttttttc caatttttca ttcctaagcc gctccctgag ttgcatttac gtattccaag 300 gctgctcctc aaaggg 316 // ID BEL-6_CQ-LTR repbase; DNA; INV; 334 BP. XX AC AAWU01028809; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-6_CQ_; KW BEL-6_CQ-I; BEL-6_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-334 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 166-166 (2011). XX DR Genome; AAWU01028809; Positions 37482 37149. XX SQ Sequence 334 BP; 83 A; 87 C; 84 G; 80 T; 0 other; tgttccgtac ggaagttctt cagcgtgtac cagcgtcgaa cccctcggtg ctgagtgcaa 60 gagtgtgcgt tcataacgga agcgtgctgg caaccgggag ggagtttgac gccatgattc 120 gagcttgagc aaccgacgcg accgtaccgc gaccgcaaaa agagaaattt tctagagaaa 180 taaaggtgaa atagtgttat attttgctgc cgcttactct cgattccgat ccaaagtgat 240 ttgcctcatt gtccgcccga agtgaaccgc acccagaagt tcgccgtcta tacagtccat 300 gtgtattctc cgaacattcg attccgcccg aaca 334 // ID L1-9_CQ repbase; DNA; INV; 3983 BP. XX AC . XX DT 26-DEC-2010 (Rel. 16.01, Created) DT 26-DEC-2010 (Rel. 16.01, Last updated, Version 1) XX DE An L1 non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW L1; Non-LTR Retrotransposon; Transposable Element; L1-9_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-3983 RA Kojima K.K. and Jurka J.; RT "L1 non-LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 139-139 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 4 sequences with >98% CC identity. XX FH Key Location/Qualifiers FT CDS 98..745 FT /product="L1-9_CQ_1p" FT /translation="MEDGAVDVKLFDLSDEVSDDEIIEFLSDYGEVLSVRD FT LYWEDKYFFRAIPTGVRVVKMFVERNIDSYVSIGGLVTQVQYHGQQHTCRH FT CNEFXHNGISCVNNKKLMVQKTYADAAKQPKQVTKPTPKSTPRLPTGSQLN FT QVAGPSQPRLNDNIMLPPKDRLVFTKLTPQPQKSKGKGSKTDGNDTDTSST FT STRSLRSKSKKHKRDGAVSVDEGMVL" FT CDS join(1451..3115,3119..4621) FT /product="L1-9_CQ_2p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="MANYNSYNIATININTITNTTKIDALRTFTRTMNLDI FT IXLQEVENEQLTLPGYNVICNVDHARRGTAIALKEHIRYSHVEKSLDGRLI FT ALQINDTTLCNIYAPSGTAQRADRERFFNNTLAYYLRRRTSHTVLMGDFNC FT VLRQCDATGYNLSPALQTAVRQLQLFDVWEQLQPQAQGYTYISHNSASRLD FT RIYVTQGLRNQLRNSDTHVCSFTNHKAVTARICLPHLGHETGRGYWSLRPH FT LLTPENIAEFRNRWQYWTRQRRNFPSWMRWWLEFAKPQIKKFFRWKAKEAY FT DEFNGKHQRLYAQLRAAYDQYYQNPVALVAINRIKGEMLTLQRQFSQMCVR FT VNETYVAGEPLSMFQLGERRKKTTTITQLHAEQGELIDGSPAVEEHLHRYF FT SELYAEVHEEREAVDEFQCCQIIPNADPTNEACTREITTADILAAIKSSAP FT RKSPGPDGIPKEFYQWLFDVIHREINLVLNEALRGDFPPEFVEGVVVLVKK FT KGNSNTARSYRPISLLNVDYKIFSRMLKARLDDVMKLHGILSDGQKCSNAE FT RNIFQATLIKDRLAQLISRREKAKIISYDLDHAFDRVRTAFLHQTMRSLGF FT DRRFVDLLACIADRSTSRLLVNGHLSPAFPIQRSVRQGDPISMHLFVLYLH FT PLVSRLERVCGDDLLVAYADDISVISSSVEKIQRMGELFSNFEIVSGARLN FT MEKTVSIDVGLSDGAPLLVPGLRTEIKIKILGVIYTNSIRAMVKLNWDAAV FT SRFAQQVYLHSMRTLTLLQKVALLNTFITSRIWYLSSTLSPCAYHVAKITA FT TMGTYLWKGLPARVAIQQLARSRDQGGLQLPALKSRALLVGRHLQETGSIP FT FYSSFIAQVNPNPPADCPCLKVILQNISLLPXQVQQNPSSSLIHSFFINQT FT ETPRIVRLNPTADWSRIWTNISTRXISSAVRSDLYLLVNGKTEHRQLMHTI FT GRADGASCLHCGALAETLEHKFSTCPRVAAAWVHLKRVAAPILQRMRALTI FT NDVLKPELHNIGPSRRRKFLQLINXYIHFIDTVNGRIDVNALDFHLNCEV" XX SQ Sequence 3983 BP; 1132 A; 1079 C; 901 G; 863 T; 8 other; ggtsatcgac aacgacttgg cgcagcagat tgtggaccaa cacgacggca aacacgcgat 60 cgagcacgag ggcaagtcgt acccaatccg tattgttatg gaagatggag ctgtagatgt 120 gaagctgttt gacctgtccg acgaagtgtc cgatgacgag atcatcgagt tcttgagcga 180 ctacggggaa gtgctttctg tccgtgacct ctactgggaa gacaaatact ttttcagggc 240 catccccact ggtgtgcgtg tggtcaagat gtttgtggag cgaaacatcg actcgtacgt 300 gagtattggt ggacttgtga cacaggtcca ataccacggt caacaacaca catgtcgcca 360 ctgcaacgaa ttckcacaca acggaatctc gtgtgtgaac aacaaaaagc tgatggtgca 420 aaaaacgtac gctgatgcag caaaacaacc aaaacaggtg accaagccaa cccccaagtc 480 aacaccacgc ctgccaacgg gctcccaact caaccaagtg gctggtccca gccaacccag 540 actgaatgac aacattatgc tgcctccaaa agatcgcctt gtcttcacca aactaactcc 600 tcaaccccaa aaatccaaag gcaagggaag caagactgac ggcaacgaca ctgacacctc 660 ttcaacgtcc accagaagct tgagatccaa atccaagaag cacaaacgtg acggtgctgt 720 gagtgttgat gagggaatgg tgttgtaaat atggccaact acaacagcta taatattgca 780 accatcaaca tcaacacaat aaccaacacc accaaaatcg acgcccttcg tacattcaca 840 cggactatga acttggacat catctktttg caggaagttg agaacgagca gcttaccttg 900 cctggataca acgtgatctg caacgtcgat catgctagac gcggcactgc aattgcactc 960 aaagagcata tccgttactc tcacgttgaa aaaagcctgg acggcaggtt gatcgctctc 1020 caaatcaacg acaccacact ctgtaacatc tatgctccat cgggaacagc acagcgagcg 1080 gacagagaac ggttctttaa caacacgctc gcatactacc tccgccgtcg cacttcgcac 1140 acagtgctaa tgggtgactt caactgtgtt ctgcggcagt gtgatgcaac ggggtacaat 1200 ttgagtccag cactacaaac agctgtacgc caactacaac tgttcgatgt atgggagcag 1260 ctacaaccac aagcacaagg atacacgtac atctcacaca actcagcgtc ccggctggat 1320 cgaatctacg tcacacaggg gcttagaaat cagctgagaa actcggacac ccacgtatgc 1380 tcattcacaa atcacaaagc tgtcactgcg agaatttgtc ttcctcacct cggccacgaa 1440 actggtcgcg gctactggtc cctacgcccc catcttctta caccagaaaa catcgcagag 1500 ttccggaaca ggtggcaata ctggacccgc cagcgccgaa acttcccgtc gtggatgaga 1560 tggtggttgg agttcgcgaa gccacagatc aaaaagtttt tccgctggaa agccaaggaa 1620 gcatacgacg agttcaacgg taagcatcag agactgtacg cgcagctgcg agcagcatat 1680 gaccagtact atcagaaccc ggtggcactg gttgctatca atcgtatcaa gggagagatg 1740 ctcacgctac aacgacaatt ctcgcaaatg tgtgtgaggg taaacgagac ctacgtcgca 1800 ggcgagccgc tgtccatgtt ccagctgggc gagagacgga aaaaaactac cacaatcaca 1860 caactacacg ctgaacaagg agagctgatc gacggctcac cagctgttga agaacacttg 1920 caccgatatt tctccgagct ctacgccgaa gtccacgaag agagagaagc agtcgacgag 1980 ttccagtgct gccagatcat cccgaatgcc gatccaacga acgaagcttg tacgagagaa 2040 atcaccacag cagacatcct cgcagccatc aagtcgagtg caccaagaaa atcacctgga 2100 ccagatggaa ttcctaagga gttctaccaa tggctgtttg acgtcatcca tcgggaaatc 2160 aatctcgtcc tgaacgaagc tctacgtggt gattttccgc cagagtttgt cgaaggtgtg 2220 gttgttcttg ttaaaaagaa agggaacagc aacactgctc gctcttacag accgatctct 2280 ctactgaacg tagactacaa gatcttctct cggatgctga aagcccgcct agacgatgtg 2340 atgaaactac acggaattct cagcgatggg caaaagtgtt ctaacgctga gcgcaacatt 2400 ttccaggcca cactatagat caaggatcgg ctcgcacagc tgattagccg aagagagaaa 2460 gccaaaatca tctcgtacga tctcgatcat gcattcgaca gggtacgtac agccttcctt 2520 caccaaacca tgcgctcgct cggtttcgat cgtcgcttcg tagaccttct tgcatgtatt 2580 gcggaccgct ccacatctcg tctcctcgta aacggccatc tctctcccgc attcccgatc 2640 cagcgctccg tgcgccaggg tgatcccatt tcgatgcact tgttcgttct gtatctacat 2700 ccactagtaa gcagactcga gcgtgtgtgt ggggatgatc tacttgtggc ctacgcggac 2760 gacatcagcg tgatctctag cagtgtggag aaaatacaac gaatgggaga attgttctcc 2820 aacttcgaga tcgtttctgg agcgagactt aacatggaga aaactgtctc catcgacgtt 2880 ggtctcagcg acggtgcccc gctgctagta cccggccttc ggacggagat caagatcaaa 2940 attctcggtg ttatttacac caactcgatt cgagcgatgg tgaagctgaa ctgggatgct 3000 gcagtgtcca gatttgccca acaggtctac ctacactcga tgcgaaccct gacgctgctc 3060 cagaaggtgg cattgctaaa cacgttcatc acttcgagaa tttggtacct ctcgtcaact 3120 ctctccccct gcgcgtacca tgtagcaaaa ataacagcta caatgggtac gtacctgtgg 3180 aaagggttgc cagctcgagt tgcgatccaa cagctagcac gaagtcgaga tcaaggtggt 3240 ctacaactcc ctgcacttaa gagcagagcc ttgttggtcg gcagacatct acaagagact 3300 ggctccatcc ccttttactc atccttcatt gcccaagtaa atcccaaccc accagcagac 3360 tgtccttgcc tcaaagttat cctccaaaat atttcccttt tgcccawcca agtccaacaa 3420 aatccctcct ccagtctcat tcacagtttc tttataaacc aaactgaaac tcccagaatc 3480 gtgcggctaa accctacggc tgactggagc aggatatgga ctaatatctc aacaaggmac 3540 attagttcgg cagttcggag tgacctgtac ctgttggtca atggaaaaac agaacatcga 3600 cagcttatgc ataccattgg amgagcggat ggagcatctt gtctccactg cggggcacta 3660 gctgaaacac ttgaacacaa attcagcaca tgtccacgcg ttgcagccgc gtgggttcat 3720 ctgaagcggg tggcagcgcc aatcctccag cgaatgcgwg cactaaccat caatgacgtc 3780 ctgaagccag agctccacaa catcggacca tcaaggagga gaaaattcct tcagttgatc 3840 aatsgctaca tccatttcat tgatacagtc aatggaagaa tcgatgtgaa cgctcttgat 3900 tttcacctaa actgcgaagt ttgaacaatt atgtaggtaa ttcttttaag ctgcaaaata 3960 aaccaaattt tacaaaaaaa aaa 3983 // ID Copia-5_CQ-LTR repbase; DNA; INV; 208 BP. XX AC AAWU01030339; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-5_CQ_; KW Copia-5_CQ-I; Copia-5_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-208 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 326-326 (2011). XX DR Genome; AAWU01030339; Positions 6881 7088. XX SQ Sequence 208 BP; 45 A; 64 C; 43 G; 56 T; 0 other; tgataagttg gcaaccaccg cactctgttt gcccctacga cgctcaagcc cactgggcac 60 acgcgggctg ctgctgtttg acagctgcac ttccgctacc agtctgtaac caaacaacaa 120 agagattaca cgctattttt cttttcgctt aagttaaacc gcgtgtttta ttccggcctc 180 cccgaaacct ctttaggttg tgggccca 208 // ID CR1_Ele16 repbase; DNA; INV; 4588 BP. XX AC . XX DT 22-OCT-2010 (Rel. 15.1, Created) DT 22-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE A CR1 clade non-LTR retrotransposon family from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1_Ele16. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4588 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4588 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (22-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. This consensus is generated from 30 CC sequences with >95% identity, and ~99% identical to the original CC sequence in [1]. XX FH Key Location/Qualifiers FT CDS 784..1620 FT /product="CR1_Ele16_1p" FT /translation="MVDVCESCAHELTDQIVVKCNAFCSSKFCQRCSRMPD FT EILRAVNSSQNLFWACNACTDLMKKSRFRNAMVSVNDVNHTTIDELKAEIR FT DYVLGEIKTEIRLNFKNFADNMPVTPIGTLRPLVQVSTRSKRNREDDIDQT FT RTRPQKLFRGTGVAASSGMTTTDGGPTIHEEDRFWLYLSGISPEVTDDAVS FT SLVSSALETNDSIVTKLVPRGKDMSTLSFISFKVGLRPSLKEKAMTASSWP FT RGLVFREFVDDTRDNRRGFWKPSITTADAGPTPATIIQ" FT CDS 1623..4520 FT /product="CR1_Ele16_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MFPTSHLSSTPAKSKISVYYQNVGGMRSKIRSFFLTS FT VSLDYDVIVFTETWLYDGIRNSELTEEYTIYRCDRNSASSNFTRGGGVLIA FT VRKTLPCSSVHLHDSEHLEQVAVSIELPRQYVYVCAIYIRPNSDPTIYSTH FT SSSICQILDMANTDDSVVVVGDYNLPRLTWHFDDEVDCYLPINASSEQEIT FT LTENILASGLKQICDVTNENGRLLDLAFVNTVAYVECIEPPSAILRVDRHH FT TPFVLRIDVQHNDEPEPPPSPSDDIFDFNRCDYDALIDALSQYDWNVMLLG FT LTTDEAVSRFYDTIHDLLRRFAPLARRHTTRTNRKPWWNSELQHLKNVLRK FT ARKRFFRNKTVQNGLSVRRLEDELKSKMDLQYRMYLGNLQDDLKVNPSNFW FT RFIKSKRCANTIPSNITYHGQSSHSTVEATELFAEFFKTVYSSPISTISDD FT YLQSLPSYNIHLPPISVTSNDALASLKRLDPKKGPGPDGLPSVFLTRCAVQ FT LATPVSLIFNKSIAEGTFPRAWKSAAVTPIHKTGSIHSAENYRGISILCHL FT SKVLESIIYDKLYAAAQPILSEFQHGFVKKRSTVTNLLSYISTLNSSLEKR FT LQIDSIYIDFSKAFDKVPHELALKKLDRLGFPVWIVQWLRSYLSERTSFVK FT IGCHKSESFRTTSGVPQGSHLGPLIFIIFVNDLCTYLESCKLMYADDLKLY FT RIVTSVLDCVALQSDVQRLEEWCSLNGMEVNASKCKIIRFNRTRSDIVFDY FT KLMDKTLEKVTSIRDLGLIVDNKLRFSEHISATIAKAFVVLGFVRRNAADF FT EDVYTLKALYCSLVRSILEYAVQVWSPYHAIQANRIERVQKKFLRFALRRL FT PWNDPLRLPPYEQRCQLISLETLASRRTLSERMFVFDVLSGNLDCPELLQR FT INFSVPSRSLRRHSALWLPRHRTNFGENNPIDRCCRKFNEIAHLHEFGXSK FT NVFKCRIRNVL" XX SQ Sequence 4588 BP; 1264 A; 1118 C; 943 G; 1262 T; 1 other; ctgtcaaact gtcaagtcaa gctgttattg tttgttttgt gctccgcttt ttttgagttg 60 tttatgctgt tttaatcgcc caaataatat acagtctgca ttataccgct cattgcatcc 120 acccacgcca caatcagcca cagttttatc cgtcaagcaa aggtaggacc caaaaactcg 180 cctccaaaag taggctcgag agtgcttagc tgtgtacacc cagcaattgt tctacctcgc 240 ctcgctgcct cgccaaaatc caccacccac cgccaccgcc caccgctacc gccgccactc 300 aagcctcatc aacccatcca ccaataattg caacctgcca cttctgtcac caaaataatt 360 gcaagtccgt atctgctcca gttacttgca cttaataatt gcgtcgttct gccaaagagc 420 tgtatcttcg aaacaccgtt cgcaataaac atcgcccttt gctgctgtgt tatcaatcca 480 ccgttaatcc gaaagaccca caacaacatt agtctgcatt gtttcgcaca ttcatcgagt 540 aaacaaacaa cgtacctgct ttctcgtcaa cccattcgcc tcgctcatca tcctccagct 600 acaagtaagt ttgtctaata gtgtgagagt cgatacaacg tcaacgtttc tatcagtgaa 660 tcgtacgcac acgacccgta cttcgaaagg ggttcgtcga ttgccaattt cgatcagcgc 720 catctgtttg tcagtctgca tagaaaacca cggattacgt atcagaggtg tgcagctttc 780 attatggttg atgtgtgcga aagttgtgct catgagctga ctgatcaaat cgttgtaaaa 840 tgtaacgcat tttgttcgtc gaagttctgc cagcgatgct cgagaatgcc tgatgagata 900 ctacgcgctg tgaatagctc tcaaaatctg ttctgggcct gcaatgcatg tacggactta 960 atgaaaaaga gcaggtttcg caacgccatg gtatctgtga atgatgtcaa ccacacaacg 1020 attgatgaat taaaagcaga aattcgtgat tatgtgctgg gagaaatcaa aacggaaata 1080 aggcttaatt tcaaaaattt tgctgacaat atgccggtaa ctcctatcgg aacacttcgt 1140 ccgttggtgc aagtatcgac aagaagcaag cgcaaccgag aagatgacat cgatcagaca 1200 agaacacgtc cgcagaaact tttccgaggg accggtgttg cagcttcatc aggaatgact 1260 actactgacg gtggtccaac catccacgaa gaagaccggt tttggctgta tttgtcagga 1320 atttctcctg aggtgactga tgatgcagta tccagcctcg tttccagcgc tcttgaaact 1380 aacgattcca tcgttacgaa gctcgtcccg agaggaaaag acatgagcac gttaagtttc 1440 atatctttca aggtaggtct ccgccccagc ttgaaagaaa aagctatgac ggcgtcgtct 1500 tggcctaggg gcctcgtttt tcgtgagttc gtcgacgaca cacgtgacaa tcgtcggggt 1560 ttttggaaac cgtcgatcac aacagcagac gcaggtccaa ctccagcgac gatcattcag 1620 taatgtttcc gacttcgcat ctttcctcta cacccgccaa atcgaaaata tccgtgtact 1680 accagaacgt tggcggtatg cgtagcaaaa ttcgttcatt ttttctaacc tctgtttcac 1740 ttgactatga tgtcatcgtt tttaccgaaa catggctgta tgacgggata cgtaactcag 1800 agctgactga ggaatatacg atttatcgct gtgatcgcaa ttcagcatct agcaacttta 1860 ctcgtggcgg aggggttttg atcgctgtaa gaaaaacttt gccatgctca tccgtacacc 1920 tccatgatag cgagcatcta gaacaagtag ccgtatccat cgaactacca cgtcagtatg 1980 tttacgtttg cgctatttac attcgcccga acagtgatcc aaccatatat tctacgcact 2040 cttcatctat ttgccaaatt cttgacatgg ctaacaccga tgattccgtg gttgtcgttg 2100 gtgattacaa cctaccccgc ttaacatggc actttgatga cgaagttgat tgctacttgc 2160 ctattaacgc ttcttctgaa caagaaatca cgctcaccga aaatatactt gcatctggac 2220 ttaaacagat ttgcgacgtt acaaatgaaa acggaagatt actggacttg gcatttgtga 2280 atactgttgc ctatgtagaa tgcattgaac ctccatctgc tattctacgc gtggacaggc 2340 atcatacgcc atttgttttg agaattgacg ttcagcacaa tgatgagcct gaacccccac 2400 cctcaccgtc cgacgatatt ttcgatttta accgctgtga ttacgatgca ttaattgatg 2460 ctctttctca gtatgattgg aatgtcatgc tacttggctt gaccacggat gaagccgtat 2520 cgcgttttta cgatacgatc cacgaccttc ttcgacgatt tgctccgctc gctcgtcgtc 2580 acacaacccg cacaaaccgg aaaccctggt ggaactcgga acttcaacat ctgaaaaacg 2640 ttcttcgcaa ggcacggaaa cgcttttttc gaaacaaaac tgttcaaaat ggactatcag 2700 ttagacgtct ggaggatgaa ctcaagagta aaatggactt gcagtaccga atgtatttag 2760 gaaatttgca ggatgaccta aaggtgaacc catccaactt ttggagattc atcaaaagta 2820 aacgttgcgc caataccatt ccttccaaca tcacatacca tggacaaagt tcgcactcga 2880 cagttgaagc caccgaatta ttcgctgaat ttttcaaaac tgtatacagt tctcctatca 2940 gtaccatatc cgatgactac cttcagagtt taccgtcata caatattcat cttcctccga 3000 tatccgtgac tagtaacgac gcactcgcct cattgaaaag attggacccc aagaaaggcc 3060 ctggccctga cggtttgccg tcagtttttc taacacgttg cgctgttcaa ttagcaacac 3120 ctgtttcgct cattttcaac aagtccatcg cggagggaac tttccctagg gcatggaaat 3180 ctgccgccgt tacaccgatc cacaagacag gaagcatcca ttctgcggaa aattacagag 3240 ggatttccat tctgtgtcat ttatcgaaag ttctggaatc gattatctac gataaactgt 3300 atgcagctgc tcaaccaata ctctcagaat tccagcacgg tttcgtgaaa aaacgttcga 3360 cggtgacgaa tctgctttca tatataagca cgctgaatag tagcttggaa aagcggttac 3420 aaatcgactc gatctatatc gatttctcga aagccttcga caaggttccg catgaattag 3480 cattgaagaa acttgatcgt ctaggatttc cggtatggat tgtacaatgg cttcgctctt 3540 atctatcgga aagaacatcg ttcgtgaaaa taggatgcca caagtcagaa tcgtttcgca 3600 cgacatcggg cgtgcctcaa ggtagtcacc tagggccact gattttcata atttttgtca 3660 atgacctgtg cacttatcta gagtcgtgca aacttatgta tgctgacgac ctaaaactgt 3720 acagaatagt aacctctgtg ttagattgcg tcgctctcca atcagatgta cagcggttag 3780 aagaatggtg cagcttgaac ggcatggaag tgaatgccag caaatgtaaa ataattcgat 3840 tcaacagaac ccgatccgat atagtctttg actacaagct gatggacaag actctagaga 3900 aggtcacatc aatacgggac ctaggtctga ttgtggacaa caaactacgc ttttctgaac 3960 acatttcagc aacaatcgcc aaagctttcg tcgtactcgg atttgtgcgg cgtaacgctg 4020 ctgacttcga agatgtttat acgctgaaag ctctatattg ttcgttagta cgcagcattt 4080 tggagtacgc ggtccaagtc tggtctccct atcatgccat ccaagcaaac aggattgaac 4140 gagtccagaa gaagttcttg cgatttgcac tcaggaggct gccgtggaat gacccgttgc 4200 gtctaccacc gtatgagcaa agatgccaac ttatctccct ggagacgctt gcttcgagaa 4260 ggacactttc tgagcgaatg ttcgtattcg acgtgctgtc tggcaatttg gattgcccag 4320 aacttctcca aaggattaat ttctccgttc cgtcacgttc gctccgtcga cattctgcac 4380 tgtggctgcc caggcatcgc acaaatttcg gcgagaataa tccgatagat cgctgttgtc 4440 gcaaattcaa tgaaatagct catctacatg aatttggcmt atccaaaaat gtttttaaat 4500 gtcgcatccg taatgtttta tagttttaag catcagtctg tagagttttt tttaactgaa 4560 gacgtagtac aataaataaa taaataaa 4588 // ID Gypsy-267_AA-LTR repbase; DNA; INV; 210 BP. XX AC . XX DT 13-JAN-2011 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-267_AA_; KW Gypsy-267_AA-I; Gypsy-267_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-210 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (13-JAN-2011). XX DR [1] (Consensus) XX SQ Sequence 210 BP; 61 A; 44 C; 39 G; 66 T; 0 other; tgatggaatg ataacgccct gtaacggata tggtaatcat ataaaagatt aaagaaacca 60 tcggagaagt ctcttttggt tatcaactat caacgtatta gctcgtgttt aaattattac 120 tctgaaggaa agtcatctct aaaggattca tctccgtttc acttcgctga tccatttcgt 180 ccctggattg tcccccggct ggatatcaca 210 // ID CR1-16B_CQ repbase; DNA; INV; 3951 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-16_CQ; KW CR1-16B_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-3951 RA Kojima K.K. and Jurka J.; RT "CR1 non-LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 19-19 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 5 sequences with >99% CC identity. CC This consensus is ~80% identical to that of CR1-16_CQ. XX FH Key Location/Qualifiers FT CDS 58..3837 FT /product="CR1-16B_CQ_1p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="MGRFTKSRLEVPRAPXTVEPNHSPSSNYFSGSAARPG FT PVFGHGNGAFQAATSGKYTHPLRTSPRPDALSLSSKLRXVDEEMLAVAERC FT VADGPGRMYRGIEVEPRFPVTAVPFTTNTLSCLSSSQHSRPGPVNGTRSGV FT SRALLSGKYMWFPRTILPPDVPEVFSQRYVPTTEPQPRLADNSAFSVPEQD FT CTFQCARAEIADLPDHVDFPCFPVTITPFITANHRSDESDTSGRSHQGSEE FT CPRPPYTVESSQRSSPGVSTSHQSRPDPVYGHGSGGFRPPPTGEYNQINDS FT XPALNHQSLQQSKGTLSFITVYYQNAGGMRTKTKQFFLALASXDYDVIALS FT ETWLQDDIVDAELSSNYNLFRQDRNGLTSDRSRGGGVLIAVKKAHDFTCTR FT VLSAGYEHLEQVAVRIKARNHTVYVCCIYIRPNSPPDVYTSHGTAVQELVD FT LSSNDDSIIVTGDYNLPHLAWTFDVDVNGFIPLNASTEQELALTENVVATG FT LLQICSLANVNGRILDLAFVNDTLSVELIEPPKPILRTDRHHKPFILRVDY FT PDIPGVATRIEDAEPDFSRCDFDRVTESLSSIDWDDILQDQDANTSTTIFY FT DVLYEIVRQFVPSKRVSRNRTEKLPWWNADLRNRRNILRKARKRLFKAGTP FT ENQTAVDRLEIEYELLQDSLFRDYLNRVQVDLKDNPSSFWKYVRSRKRTSV FT LPTRISLNGSTAENPEDAANVFADFFSSVYEADAPSASNDYLNDLPTYDVD FT FPQPEFSQAEVKSALDAVDPSKGAGPDRLPPSFIKRLSEHLAKPVSVIFNR FT SLSEGTFPDEWKLAAITPIHKTGSTTAAENYRPISILSCLPKVFEVLIHEG FT MYSSAQVVISEFQHGFVKKRSTVSNLMAYVSALNENLEKFKQTDAIYFDFA FT KAFDKVPHTLIIAKLDRLGFPRWLTAWIRSYLSGRFGYVRLGGVQSRNFPI FT PSGVPQGSHLGPLIFILFINDICQRLQSQKLLYADDLKIFRTVASTIDCVA FT LQQDIDAIQEWCNLNGMKVNPSKCKSISFTRSPAPIRYDYTFDRHEMDRVC FT SIRDLGLLIDRKLSFSEHVSSTTAKAFALLGFLRRNTAEFENINALKTLYI FT TMIRSILEYAVQVWAPHHANQRDRLEKVQRRFTLYALRRLPWRNGVWRSSY FT SDRCTLLEMVSLEKRRTFLQRMFVFDVLTGRIDCPQLREEITVHRPTRTLR FT NQPLLRIPFHRTLYGYNRPIDRCCRIFNSVSDEYEPSMTRERFKRKILAL" XX SQ Sequence 3951 BP; 954 A; 1110 C; 940 G; 941 T; 6 other; taggtttcgg awtgcgagct gttcgaatgt tccagacatc tcatctgtgt tgctgtcatg 60 ggacgtttta ccaaaagccg tttggaagtg ccccgtgccc ccgwcacagt tgagccaaac 120 cattcaccat catcaaacta tttttccggt tcagctgcac gtcccggccc tgtgttcggt 180 cacggcaacg gggccttcca agctgccacc tcaggcaagt acactcatcc gttaagaacc 240 tcaccgcgcc ctgatgcgtt gtcactttct agtaagctgc gakccgtcga cgaagaaatg 300 ctagcagtcg ccgagagatg tgtcgctgat ggtccgggac gcatgtaccg aggcattgag 360 gttgaacccc gcttccccgt cacagccgtg cccttcacca ccaacacgct ttcctgtttg 420 agttcgagcc agcacagtcg tcccggccct gtgaacggga ccagaagcgg ggtcagccgc 480 gcactgcttt caggcaagta catgtggttt cccagaacta ttctgccacc tgatgttccc 540 gaggttttca gccaacgcta cgtgcccact accgaaccgc aaccacgact tgcagacaat 600 tcagcattct ccgtcccgga acaggattgt acatttcaat gtgcacgtgc tgaaatagct 660 gatctgccgg accacgtgga ttttccctgc ttccccgtca caattacgcc gttcatcact 720 gccaatcacc gctctgatga atcagatacg tcgggacgct cgcaccaagg tagtgaggaa 780 tgcccccgcc ccccgtacac agtcgagtca tcgcaacgtt ccagcccggg tgttagcacg 840 agccatcaga gtcgtcccga ccctgtgtac ggacacggca gcgggggctt ccgacctcca 900 cccacaggcg agtacaatca aattaacgac agtwgtcccg cactgaatca tcaaagtttg 960 cagcaatcca aaggcacgct gagctttatt accgtctact accaaaatgc tggtggtatg 1020 cgtaccaaga cgaagcagtt tttcctcgcc ctggccagcw gcgactacga cgtcattgcg 1080 ctgtcggaaa catggttgca ggatgacatc gtggatgcgg agttgtcgtc aaactacaac 1140 ttgtttcgcc aggaccggaa cggattgacg agcgatcgca gtaggggagg cggcgtgctc 1200 attgcagtaa agaaggcgca tgattttacg tgcacacgtg ttctttctgc gggatacgag 1260 catctcgagc aggttgcggt tcggattaaa gcccggaatc ataccgtcta cgtgtgctgt 1320 atttatatac gcccaaacag ccctccagac gtctacacct cacatgggac cgccgttcag 1380 gagcttgttg atctgtcatc taatgatgac tcaattattg taaccggcga ttacaactta 1440 ccgcatcttg catggacctt tgacgtggac gtaaacggtt tcattccgtt gaacgcatct 1500 acggaacaag agctggcgct gaccgaaaat gttgttgcaa cgggactcct ccaaatctgc 1560 tcgctggcta atgtcaacgg aagaatcctc gaccttgcgt ttgtcaacga tacgctctca 1620 gtcgaactga tcgagccgcc caaaccgatc ctcagaacgg atcgtcatca caagccgttc 1680 attcttcggg tggactaccc cgatatccca ggcgtggcaa cgagaatcga ggatgctgag 1740 ccggatttca gtcgctgcga ctttgaccgt gtcaccgaat ccctcagcag cattgactgg 1800 gatgacattc tgcaagatca ggacgccaat acttcgacta cgatcttcta cgacgtcctt 1860 tacgaaatcg tgcgacagtt cgttccctca aagcgtgtct cacgcaaccg gactgagaag 1920 cttccatggt ggaatgctga tctgcgaaat cgtcggaaca ttctgaggaa agcacgtaaa 1980 agattgttca aagccggcac accagaaaat caaaccgctg tcgaccgtct ggaaatcgag 2040 tacgaattgc tgcaagactc actctttcgc gattacttga atcgggtgca agtggacctg 2100 aaagacaacc catcgtcatt ctggaaatac gtgaggagca ggaaacgtac cagtgtcctt 2160 ccaacaagga tttcgctcaa cggttccacc gctgagaatc cggaggacgc tgcaaacgtc 2220 tttgccgact tcttcagtag cgtgtacgag gcggatgcac cttctgcttc gaacgattat 2280 ttgaacgacc ttccaacata tgatgtcgat tttcctcaac cagaattctc acaagcagaa 2340 gtaaaatccg ccctggacgc tgtcgatcca tcgaaaggtg ccggtcctga tcggctccca 2400 ccttccttca tcaaacggct ttctgagcat ttagctaaac cagtcagcgt aattttcaac 2460 cgctctctgt ctgaaggaac ttttcccgac gaatggaaac ttgcagcgat cacacccatt 2520 cacaaaaccg gtagtactac ggctgctgaa aactacaggc caatctccat cctgtcctgc 2580 ctgcccaaag ttttcgaggt gcttatccac gagggaatgt actcttccgc ccaggtagta 2640 atttccgagt tccagcacgg ttttgtcaaa aaacggtcca cggtatccaa cctgatggcg 2700 tacgtcagcg cactgaatga aaacctcgaa aagttcaagc aaactgacgc gatctacttc 2760 gacttcgcca aagcgttcga caaggtaccc cacacgctta ttatcgctaa gttagaccgg 2820 ctggggttcc caaggtggct gacagcatgg atccgttcgt acctttcggg gcggtttggc 2880 tacgtccgtc tcggtggcgt ccaatcgagg aatttcccga ttccatctgg agtcccccaa 2940 ggaagccatc tgggaccgct gatcttcatt ctcttcatca acgacatctg ccaaaggttg 3000 cagtctcaaa aactgctgta tgctgatgac ttgaagatct tccgtaccgt ggcgtccacc 3060 attgattgcg tcgctcttca acaggacatc gacgcgattc aggagtggtg taatctcaac 3120 ggaatgaagg tgaaccccag caaatgtaaa agcatcagct tcaccagatc acccgctcca 3180 attcggtacg attacacctt cgaccgccac gaaatggacc gmgtctgctc aatcagggat 3240 ctcggtctgc tgatcgaccg caagctgagc ttctcggagc atgtctcttc cacaacagct 3300 aaggcgttcg cgttgcttgg atttctgcgt cgcaacactg ctgagttcga gaacatcaat 3360 gcccttaaaa cgctctacat caccatgatt cgaagcatct tggagtacgc tgtgcaggtg 3420 tgggcgccgc accacgctaa tcaacgtgat cgcttggaga aagttcagcg ccgtttcact 3480 ctttatgccc ttcgccgtct gccttggagg aacggcgttt ggcggtcgag ctacagcgac 3540 agatgcacct tgctggaaat ggtgtcacta gaaaagcggc gaacctttct tcaacggatg 3600 ttcgtcttcg atgttctgac cggtcgcatc gactgcccgc aactccgaga agaaatcacg 3660 gtgcatagac caacgcggac gctgaggaac cagccccttc tgaggattcc cttccatcgt 3720 actctatatg gatacaaccg accaattgac cgctgctgcc gaattttcaa ctccgtatct 3780 gacgagtacg agcctagcat gaccagagaa cgtttcaaac gaaaaattct agctctttga 3840 tgcgatgtga tttttttatg ttatttgact gtttaatttt atgttagttt ttaagatatt 3900 cagtctgtgc gactctggtc gaagacggtg aacaataaac aacataaaca a 3951 // ID Mariner-2_TCa repbase; DNA; INV; 905 BP. XX AC . XX DT 21-MAR-2009 (Rel. 14.03, Created) DT 22-MAR-2009 (Rel. 14.03, Last updated, Version 1) XX DE DNA transposon: consensus. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Nonautonomous; KW Mariner-2_TCa. XX OS Tribolium castaneum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Coleoptera; Polyphaga; Cucujiformia; OC Tenebrionidae; Tribolium. XX RN [1] RP 1-905 RA Jurka J.; RT "Mariner/Tc elements from insects."; RL Repbase Reports 9(3), 675-675 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Tribolium CC castaneum Genome Project. XX SQ Sequence 905 BP; 311 A; 151 C; 187 G; 256 T; 0 other; cgagggcaag gtgagaaatt ctcggcctac cacagaatta acattgttat caataaaatt 60 tttatgcggg agcaagtcat catgtgtcat aatagcatcc ctcaagtttc gtcttaatct 120 gacaaatagt tttttttgag cattttttaa agtaaaggtg catgtctgtt ttgaacaatg 180 gataaaatgg agaatttcgc cgtcattaaa ttctttgttt tggagggctt attgccaaca 240 gaaatttatc caaagttggt gagagtttat aaagagtcaa cctcttcata ttctactgta 300 aaaaatggcc agctgaattc gaacgtggtc gcacatctct caatgatgag ttttgtgaag 360 gacgaccaaa aacagtaacc acggatgaaa acctcaaaaa aatgcacaat ttggtattgg 420 atgatcggcg aatgaaagtg tatgagatag ctaaggtcgt agggatatct gagaaaaggg 480 tgtagtttat tttgcaccaa gaattagata cgaaaaaatt gtgtgcaaaa gggtgccgca 540 tgggtccaat gcacctgccc acaaaggcgc tctggcaatg gaaaaattga gggatttaaa 600 acatgaatta ttagaacatt cttcatattc gcctgatttg acttccccag acttccacca 660 accacaaatt aaaaaaaaat tgtggttagg aaaaattatc actcgaatga ggaggtgatt 720 gcggctgtaa acaagtattt tgcagaccta ccagaaagtc acttcaggga tgggattaaa 780 aaattagaaa cacattgcac taagtgcgtt gaactaaata gagattatac aaaaaataaa 840 tgcatatttg aattaaaaaa cgaagtcttt cattgttagg ccgagaattt ttcacctcgc 900 cctcg 905 // ID Gypsy-166_AA-LTR repbase; DNA; INV; 187 BP. XX AC supercont1.387; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-166_AA_; KW Gypsy-166_AA-I; Gypsy-166_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-187 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.387; Positions 234711 234525. XX SQ Sequence 187 BP; 55 A; 39 C; 35 G; 58 T; 0 other; tgttgcatta ttagcacacc cctatgtatg tacatccctt aagaattcat ccattcattc 60 attcattact ttctacctga ctgtcccgtg tgggaccaga agtgtttaga gcgggagtga 120 gtacagttcg ctacgtaata aagaagtact aacccaaagt ctagtgtgtt ttattagaca 180 aagaaca 187 // ID Gypsy-59_CQ-LTR repbase; DNA; INV; 1143 BP. XX AC AAWU01037375; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-59_CQ_; KW Gypsy-59_CQ-I; Gypsy-59_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-1143 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 498-498 (2011). XX DR Genome; AAWU01037375; Positions 15118 13976. XX SQ Sequence 1143 BP; 331 A; 280 C; 248 G; 284 T; 0 other; tgtaacgcac ctaagttcct acctaaaatc gcatcctcac ccacgcccag acctttcttc 60 taagtttatg agccagacgt acatgtaccc cgcccagcaa caaggctggc gaggtatcac 120 taagccaggg gtccgtctct ccactactct ctctttagaa caaataaaaa gtgagagaga 180 aattcaagga ggctctcact aattgactat ttgagggaga ctctcactaa ttgactaatt 240 tccccaggta gtgaccttgg cattcctgcc ctcgtttgac tcgggcccgc acacttcagt 300 gacacctctc ctcgctctac ccctttaccg attccacatt tcccaggaaa agtgcgcaac 360 tctaataaag cacacctcct attaagtccg cggtcctggt ttcatataca tttggacaag 420 gacggaatca tgatagcaca caggggcata ttggggcagg tcagggttaa gcaagtccgg 480 tagcaagtgc gcttcggact atgacgtcac cgtttggtgt ttggggcagg tcaaggcaaa 540 gaggttcaag cagaaagaag acctctcgcc tgaggatggt acccggggaa taattaacga 600 gacctccatg tccatcctcc ggaaaactaa aaactgctgg aactcatggg ctcctaacat 660 acctaattca catagaagga cccaggcgac atcttagtac aacgccctgg gtccaaaggt 720 caaaacaaca atggaaattt ccactacctg gatcgtggga aaagtccaaa gatcattcag 780 tagtggaaaa ttccgatacc tttgggatgt ccacgtgttc gcggtctttg accgaaatca 840 aggtcgtttt cccctcccat aattgagaga gtcgttaatt tagtacaaag aagtggggag 900 ggaatcctaa tctcgacgct aagtataaat agtgatcgaa gggctctcat aatcagttca 960 gtgtttacct tcagactcct acagttagtt cctacagtta ttcccttgta attcataaat 1020 gtaagttgtt tctttttgaa taaagttata aaaggtgaag aaaagtgaag tgatcaattc 1080 ccgaaagagt aatatcacga tacccctcca gaagaaccga tcgaccggag atcggtggtt 1140 aca 1143 // ID Gypsy-56_CQ-I repbase; DNA; INV; 4397 BP. XX AC AAWU01017208; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-56_CQ_; KW Gypsy-56_CQ-LTR; Gypsy-56_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4397 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 491-491 (2011). XX DR GenBank; AAWU01017208; Positions 19858 24254. XX CC Positions [3347-3811] - Integrase core CC 'GGTAC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 101..4384 FT /product="Gypsy-56_CQ-I_1p" FT /translation="MSTSPVPPPLSPHQQPQQGPHPPTTNAALLQILAQQQ FT QLMTELAAQVKGMSRDEVVLDSLSSNIAEFAYDPEHGCTFDSWYARYADLF FT DKDAGKLDDAAKIRLLMRKLNPAAHERFTSFILPKTPKDFKEFSVVVTKLK FT TIFGTPVSVFNRRFHCLQITKDDAEDFVSYSCRVNRACVDFKISELSEEQF FT KTLIFVCGLKSPRESDIRMRLITKLNERAEVTLEQVVEDCKNLVNLKKDTS FT LVEKQGAPALVQEVRQSRFKAKQDKPDKQKQQHSSEAGPSSGQPSAPCWSC FT GGMHYSADCKFRDHVCRDCKKKGHKEGYCACFARKGKKKPPSKGVKVVSVK FT NVSQRRKFAEIEINQVPVQLQIDSASDITVISDRCWKQIGSPSGVKPSCSA FT KTASGAPLDLALEFWCDVEISGITKRSLCRVVSPKIDLNILGADWISAFGL FT WDVPFSSFCRKIAESAPDAVQALQAQFPKVFTSEMGLCTKTKVQLALKENS FT KPVFRPKRPVAYAMQSAVQTELQRLQDLGVLVPVDHSNWAAPIVVVRKPNG FT SVRICADFSTGLNSCLEPNQYPLPLPEDIFAKMAGCKLYSHIDLSDAYLQV FT EVDPKDQHLLTINTHKGLYRYTRLTPGITSAPGSYQQLMDTMVGDLNGTCG FT YLDDILVGGHTPEEHDRNLQQVLRRLEEYGLTVRIEKCSFRMPQVKYLGQI FT LDGDGIRPDPDKTSAVATMPPPHDVSSLRSYLGAVNYYGKYIPEMRKLRYP FT MDQLLKTGVKWEWSEACQRAFNRFRDILQSPLALTHYNPKLDIVVSADASQ FT HGIGARIAHKLPDGTVKAISYASRSLTPAEANYSQIEKEGLGLIFAVTRFH FT RMIFGRSFTLETDHKPLLAIFGSKKGIPVYTANRLQRWALTLLLYDFAIQY FT IRTESFGYADVLSRLINTHIRPNEEYVIAAVELEDCMQDIVKQSLAHLPIT FT FKMVQGGTKADPVLKQVIQFVQTGWPTNRTDLTDQHVQQFHQRRDSLSLVS FT NCLMYGERTVIPAKFRDCVLHQLHKGHPGVERMRSVARNYVYWPGIDEHIT FT QLVRSCNECAKAAKTNPKTSLESWPIPTQPWQRVHADFAGPVDGLYFLVVV FT DAFSRWPEVVPTKRITTTATIAIFREIFSRHGMPETLVTDNGTQFTSEDFE FT GYCNSNGILHLKTPPYHPQSNGLAERFVDTFKRTLKKITAGGEALRQAIDT FT FLLCYRSTPCRSAPQSKTPAELLLGRRLRTSLDLLKPPTPFNKPADSEQEK FT QFNRKHGAKARNYNVKDLVWAKVHRNNTWTWEPGQVLERVGAVVYNVWLPS FT KQDLIRSHCNQLRKRHESEAVESTEEQQPAQIPLNILLDSWGLLQSAPAEP FT AVQQPDPEPQPSSSGNELLPDQEPGVVPRRPRPQRRATPQQHQGPSRTSSR FT IRRPPVRYDPYHCY" XX SQ Sequence 4397 BP; 1048 A; 1302 C; 1226 G; 821 T; 0 other; aagtggcgac gagtatcgta gcagccggtc tcgcgtagaa gtcgatttcg caaagttttc 60 gtcggaaatc cgtcgcgacc gaagttgagg ttagaacgcc atgtctactt caccggttcc 120 accgccgctc tcgccgcatc agcaaccgca acaagggccg catccgccga caaccaacgc 180 ggcgttgctc cagatcttgg cccagcagca gcagctgatg accgagctcg cggcgcaggt 240 gaaggggatg tcgagggatg aagtcgtgct agactcactc tccagcaaca tcgccgagtt 300 tgcgtacgat ccggaacatg gctgcacctt tgactcctgg tacgcgaggt atgcggatct 360 gttcgacaaa gacgcgggca aattggacga cgcagcgaag atccggctac tcatgcggaa 420 gttaaaccca gcggcccacg aacgattcac gagcttcatc ctccctaaaa cgcccaagga 480 cttcaaggag ttctctgtgg tcgtgacgaa gctcaagacc atcttcggga cgccagtgtc 540 cgtcttcaac cgcagattcc actgcctgca aataaccaag gacgacgccg aagattttgt 600 gtcctactct tgtcgggtga accgggcatg cgtcgacttc aagatcagcg aactctccga 660 agaacagttc aaaacgctaa tcttcgtgtg cggactgaag tctccccgtg aatccgacat 720 ccgcatgcga ctgatcacca aactgaacga gcgagcggag gtcacgctgg agcaggtggt 780 cgaggactgc aagaacctgg tcaacctcaa gaaggacacc agcctcgtgg agaagcaggg 840 agcacctgcg ctcgtccagg aggtgcggca aagcagattc aaggcgaagc aggataagcc 900 ggacaagcag aagcagcagc attcatccga agctggcccc agcagtggcc aacccagtgc 960 tccgtgctgg tcgtgcgggg gcatgcatta ttcagctgac tgcaagttcc gggaccacgt 1020 ttgccgcgac tgcaagaaga aggggcacaa agaagggtat tgcgcctgtt ttgcgaggaa 1080 ggggaagaag aaacccccgt ccaagggcgt gaaggtggta tcagtcaaga acgtcagcca 1140 gaggaggaag tttgccgaga tcgagatcaa ccaggtcccg gtgcagctgc agatcgactc 1200 agcgtcggat ataaccgtca tttccgaccg gtgctggaag cagattggga gtccttccgg 1260 tgtcaaacca tcgtgcagcg caaaaacggc ctctggggca ccccttgacc tcgcactgga 1320 attctggtgc gacgtggaga tcagcggcat cacgaagcgc agcctgtgcc gcgtcgtgtc 1380 tcccaaaatc gacttgaaca ttcttggcgc tgactggatt agtgcatttg ggctgtggga 1440 cgttccattc agttcatttt gccgcaaaat cgccgagtca gcccccgatg ccgttcaagc 1500 actgcaagca cagttcccca aagtcttcac cagcgagatg ggactgtgca ccaagacgaa 1560 agttcagctc gcactcaagg agaactcgaa gcctgtgttc cgtcccaagc gtcccgtggc 1620 gtatgcgatg cagagcgccg tgcagaccga acttcaacga ctgcaggatc tcggagttct 1680 cgtgccggtg gatcactcca actgggcggc gccgatcgtc gtcgtacgga agccgaacgg 1740 gtccgtgcgg atctgtgctg acttttccac cgggctgaac agctgcttgg aaccaaacca 1800 gtacccgctc cctctgccgg aggacatctt tgcgaagatg gcgggttgca aactgtacag 1860 ccacatcgac ctgtcggatg cctatctgca ggttgaagtc gatccgaagg accaacacct 1920 gctgacaatc aacacacaca aagggttgta ccgctacacc cgcctcactc cgggcatcac 1980 ctctgcaccg gggtcctacc aacagttgat ggacacaatg gttggcgatc tcaacggaac 2040 ctgtggttac ttggacgaca tccttgtcgg tggccacaca ccagaagagc acgatcgcaa 2100 ccttcagcag gtcctccgcc gtctcgaaga gtacggcctc accgtccgga tcgaaaagtg 2160 cagtttccgg atgcctcagg tcaagtacct gggacagatc ctcgatggcg atggtatcag 2220 gccggacccg gacaagactt cggccgtcgc aaccatgccg ccgcctcacg acgtctcatc 2280 gttgcgctcg tacctcggcg ccgtaaacta ttatggcaag tacattccgg aaatgcgcaa 2340 actccggtat ccgatggacc agctgctcaa gacgggcgtc aagtgggagt ggtccgaggc 2400 gtgccagcgc gcgttcaacc gttttcgtga tattctgcag tccccgctag cgctcacaca 2460 ctacaaccca aaactggaca ttgtagtctc ggcggacgcc tcacaacacg gaatcggcgc 2520 gcgcatcgcc cacaagctgc cggacggcac cgtcaaggcg atctcgtacg cgtcgcgcag 2580 tctcactccc gctgaagcaa actacagtca gattgagaaa gagggactcg gactcatctt 2640 cgctgtcact cggttccacc ggatgatttt tggccggagt ttcacgctgg agacggacca 2700 caaaccgttg ttggcgatct tcggaagcaa gaagggcatc ccggtgtaca cggcgaacag 2760 gttgcagcgt tgggcactaa ctctgttgct ctacgacttt gcgatccagt acatccggac 2820 ggagagcttc ggatacgcag atgtgctttc gcggctgata aacacacaca ttcggccgaa 2880 cgaagagtat gtcatcgccg cggtcgaact cgaagactgc atgcaggaca tcgtgaagca 2940 gtccctggcg catctgccga tcacgttcaa gatggtccaa ggcggaacga aagcagatcc 3000 ggtgctcaag caggtgatcc agttcgtgca gacgggttgg ccaacaaaca ggaccgacct 3060 caccgatcaa cacgtgcagc agttccacca gcgcagagac agtctatcgt tggtttccaa 3120 ctgtctcatg tacggagagc gaacggtcat tccagccaag ttccgggact gtgtcctcca 3180 tcagctgcac aagggtcatc caggagtgga acgaatgcgg tcggtggcac gtaactacgt 3240 ctactggcct ggtatcgacg aacacatcac ccagcttgtt cgttcgtgca acgagtgcgc 3300 caaggcagca aaaaccaacc ccaagaccag cctggagagc tggccgatac caacgcaacc 3360 gtggcagcgg gtgcatgctg atttcgctgg ccctgtggac ggactctact ttctggtcgt 3420 ggtcgacgcg ttcagcagat ggccggaggt ggtgcctacc aagcggatta ccaccacagc 3480 cacgatcgcc atcttcaggg aaattttctc gcggcacggc atgccggaga cgctggtaac 3540 cgacaacggc acgcagttta ccagcgagga cttcgaaggc tactgcaaca gcaatggcat 3600 tctccacctg aagacaccgc cgtaccaccc gcagtccaac gggctcgccg agcgcttcgt 3660 ggacacgttc aagcgcacac tgaagaaaat aaccgcgggg ggagaggccc tccgtcaagc 3720 catcgacact tttctgttgt gctaccgatc cactccgtgt cgaagcgcgc cgcagagcaa 3780 aacgccagca gaacttctgc tgggaagacg gcttcgcaca tcgctggacc tactgaagcc 3840 accgacgccg ttcaacaaac ccgcagattc cgagcaggag aagcagttca accgcaagca 3900 cggcgctaag gctcggaact acaacgtcaa agatctggtt tgggccaaag ttcatcgcaa 3960 caacacttgg acttgggagc ctggccaggt gctggaacgg gtcggcgctg ttgtttacaa 4020 cgtgtggctg ccgagcaaac aagatctgat ccggtctcac tgcaaccaac tgcggaagcg 4080 ccacgagtcc gaagcggtcg agtcaactga agagcagcag ccagcacaaa tcccactcaa 4140 cattcttctg gattcctggg gacttcttca atcggcgcca gctgagccag cagttcaaca 4200 accggatcca gagccacaac catcgagttc ggggaacgag cttctcccgg atcaggaacc 4260 aggggtggtt ccacgtcgcc ctcgccctca acgaagagct actcctcagc aacaccaagg 4320 accatctcgc acgtcttctc gtatacgaag accgcctgtg aggtacgatc cgtaccattg 4380 ctattaaaag ggggagg 4397 // ID Tx1-N1_CQ repbase; DNA; INV; 1180 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A non-autonomous Tx1 non-LTR retrotransposon from Culex DE quinquefasciatus - consensus. XX KW Tx1; Non-LTR Retrotransposon; Transposable Element; nonautonomous; KW Tx1-N1_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-1180 RA Kojima K.K. and Jurka J.; RT "Non-autonomous Tx1 non-LTR retrotransposons from the southern RT house mosquito."; RL Repbase Reports 11(1), 594-594 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 3 sequences with >98% CC identity. XX FH Key Location/Qualifiers FT CDS 134..1096 FT /product="Tx1-N1_CQ_1p" FT /translation="MERTQKKNTIGFEFERDGVMPTLKEAISFLVNDLKIK FT DTEVHSVYLELTGKTFFVKFIDETTLKEVTLRLKETEPFKYANGKVVQVRV FT AAADGFFRYVRLFNLPPEVTDSDITKVMAKYGTVRQMVREKLPLELGFDAF FT SGTRGVHMEVKTEIPPSLFIGHYKCRIFYDGLRNRCFVCRQEGHVKAACPS FT KASASRATGESGSSSEVQGVTEHVENLIVPLNVAVENPIVPNGTVPSIEDS FT NFLKEIAFGEDFEDELGKDGENESEKEWTTAKKRGRQQKDKQEESGSDSSE FT PGKSKVSRTPSLLKLQSEYNRKLRSGMSK" XX SQ Sequence 1180 BP; 386 A; 221 C; 303 G; 270 T; 0 other; agtcgcggat aaggagtggt tgaagacgga cgcgaaaaca gatagctccg taagatcttt 60 ttaagctacc caaggccaga ggccttctgc attgttcttt tgaacaaaga ggatttgatc 120 aaaaagtgaa aaaatggaac gcacccagaa gaaaaataca attggcttcg agtttgaaag 180 agatggcgta atgccgacct tgaaagaggc tatcagtttt ctggtcaatg acctgaagat 240 aaaggatacg gaagtgcatt cggtctatct ggagctcaca ggtaaaacgt ttttcgtcaa 300 attcattgac gagaccacgc tgaaggaagt aacgctgcga ttgaaggaaa cggaaccctt 360 taagtatgca aatggaaagg tagtgcaggt acgcgtggca gcagctgatg ggttcttccg 420 ttacgtgcgg ttgttcaacc tacctccgga ggttacagat tcggacatta ccaaagtcat 480 ggcaaaatac ggcacagtac gccaaatggt tcgtgaaaag ctacccttgg aacttggctt 540 cgacgccttt agtgggacta ggggtgttca catggaagtg aagacggaga tcccgccgtc 600 gctctttatt ggccattaca aatgtcgaat cttttatgac ggactacgga atcgctgttt 660 tgtctgcagg caggaagggc acgtgaaggc tgcatgtcct tcgaaggctt cggcttcacg 720 agctacaggt gaaagtggat caagtagcga agttcaaggc gtaactgaac atgtggaaaa 780 cctcatagtt ccactgaacg tagcagtgga gaaccccatc gtaccgaacg gaacagtacc 840 gagcatcgag gattccaatt tcttgaagga aattgcgttt ggtgaggatt ttgaagatga 900 actcgggaag gatggcgaga atgaatccga aaaagaatgg acgacagcaa agaaaagagg 960 acggcaacaa aaggataaac aagaagaaag cggttctgac tcctcagagc cagggaagtc 1020 caaggtttct cgaacgccat cacttttaaa gctgcagtcg gaatacaaca gaaagttacg 1080 gagtggaatg tcaaagtgaa tattaatttt tctttatttt ttgtaatgta aaaagataaa 1140 tgaaaataaa taacctcata cacaaaaaaa aaaaaaaaaa 1180 // ID Gypsy-39_DPu-I repbase; DNA; INV; 6053 BP. XX AC ACJG01004895; XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the common water flea: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-39_DPu_; KW Gypsy-39_DPu-LTR; Gypsy-39_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-6053 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the common water flea."; RL Direct Submission to RU (08-FEB-2011). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR Genome; ACJG01004895; Positions 6530 12582. XX CC Positions [4495-4974] - Integrase core CC 'ATAT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 2425..5616 FT /product="Gypsy-39_DPu-I_1p" FT /translation="MTDKGVSIGRFLLPSRVTSGTYRFPLTNFSSSAQFIP FT AGMVVGKILPIEQVVEDAPVDSPSPPSTEMPLPFASRINKDLGKEDREKTI FT ALLGKYLRCFAASPHELGCSNLVQHKIDTGNHPPLHQPPYASAWRERELIN FT DQTQKMQKDNVVVPSNSPWASPVVLVKKKDGEWRFCVDYRRLNAITTKDVY FT PLPRIEDALSRMEGSRYFTILDMQAGYWQVGVDEQDRAKTAFITADGLYEF FT KVMPFGLTNAPATFQRMMDVVLAGLKWNTCLVYLDDIVVFAPTVTQHLERL FT ESVLQRIERAGLKLKLSKCSFLEQSLKVLGFIVSSEGLSPDPEKITAVHDF FT PTPRNVKEVQSFLGLCSYYRKFVPGFAVLARSLSNLTKKNQRFLWGEEQQR FT SFEALKTILTSPPILAHPRYDLPMEIHCDASHYGVGAVLVQQHDGKERVLA FT YASRLLSKPEINYSVSEKECLALVWSIKKFRTFIWGQKVKVVTDHHSLCWL FT LKKRDLSGRLARWSLQLQDLEIDIVHRSGRLHTDADSLSRAPIGTPEEEEE FT IPLLANFPAASPEKVDIALEQQQSPWWEKIISGLKETNPTPRIRKLIHPYE FT LRNGVLFRRRVHRGRCSYQLCLPSPFVEQVLLACHSDVTSGHLGVTKTMYK FT IQQRYFWPKMRRQIVRFVLSCVDCQTKKRPREAPAGLLHPIRANQPFEKVG FT IDLIGPFPITDAGNRYAIVAVDYLTKWAITKAVPKASVKEIVDFFVRNVVL FT QHGAPVFLISDRGKCLSASFAEELYKALQTNHLVTAAYHPQCNGLVERYNH FT TFAEMLSMYVNSYHNDWDGLVDFATFAYNTTRQESTGVSPFYLLYGREAVL FT PVDVALGNNPEMNKGDDGPNARIRQLTSELPTIRDEVKRRLTLIQSKQKSR FT YDRRRRSVNFTVGNLVLVYRPIRKKGRSTKFLHRYFGPYKIVRRVSDLNYI FT IEPLCGRRKKQDCVHVSHLKPFRLNALSESARAKVPSAIRVVSENTQRRGG FT RERKKTPKKTPTVRRKRVRFSRSVNKQCDPGESTNKEQLGEHYLRPRRTLR FT SPDRLDL" XX SQ Sequence 6053 BP; 1511 A; 1398 C; 1605 G; 1539 T; 0 other; atggtggaga tgcagggtaa tggctgcact tgctgtttgt gaataattcg actttaagcc 60 ctttttgtga acaattttgt gtctgagaaa actcgaggcg tcctttgatc aacgttcatg 120 attcatcgcg ccgtttattg ttcgtctctc tacacatcgc tcctgcttct taatttccgg 180 tcacgtatcg ttcctatttg tgttgagagc agttaatcag tctgttttta attttagatt 240 ttgcccaaaa tttgtggaaa aatttggaaa ttcgaggtga ccgttgacca acgttgttaa 300 acgttacgac gtttgtcgtt cgtcgttcga cgcactcatc ttattcttat ttttttcggg 360 ttttgtatcg tttcagtttt attaattgag aaagttaatt tgtcgctgtt caattatttc 420 taggtttgat gggaaaaatt ttgtttttaa caaaaaaaaa cgttgaaacc ctgacgctgg 480 agtggcgctg ccgtcgatcg tatgatcaat gttctacccg tccctttgaa tgctgacgtt 540 gatgatgcgt tgcgacagga tttttttttg atttttgatt tttactttcg aggaccatgt 600 attaatttta tttttaattt ttatttagaa attgaagatt ccgctctttg ggtcttatcg 660 ttggatgaat tgttggatgg atttcgccga attttaagcc cttgctattg aacgatagac 720 agacccaaag gccggtgagt tgcgtcccct aaccaaacaa ctttattttg caggagagag 780 gcttgactcc gacttgatac cgctgtaaca ctcagcagtc aaagtgttgg tcccattatt 840 ggggttcctt tccctccgtt cccttctgtt cagagtgtgt tgggtgggaa tcggtttggg 900 cgtatagagt aattcagggc ccttggcaac acaaaatggc ggataggcag cgaagagaag 960 aacaggatgg tggggaacag catgggcagg acgcagatga cagacgcctg gtgcggcgtg 1020 cactccaccc cggactcgct gggttaaatg atgaagcaaa tcaggcgccc cctttagtcc 1080 acgtcgggaa ttttaaagag agggaccctc cgatcttccg tgggttgccg cacgaagacg 1140 tgatggagtg gagctatcaa ttccaaagag tgtcggcgtt taatcaatgg gggcccgaac 1200 aacagcttcg acacgttgag ttcagcctag agggggtggc ggcgaggtgg ttttcgggat 1260 tgcgaccccg tccggaaact attgatgggt taatggaagc actacaggcg gcctttcggc 1320 accataacta tgccctagaa tggatcagat ggataccact gaactgttcc gaaggctgca 1380 agcgcatagc caggccgccc tgatcaccgg acggtccatc ccgataaacc agattctccc 1440 aaccccccct gcgggcagta aggtggatat tgagaagctc gtccgggagg aggtgtccaa 1500 gagtctagat ccgatcgttc ggaaactgga ggaagcgacg cgatcgttgg gtaccgctgg 1560 aggatcgcga gatggttgga gagcgcgaac agcagggagg gggcgggacc cgccgtcacg 1620 gcagcaagat cggccaaaca aaaggacggt ggatgggcga cccatatgca atgcgtgtgg 1680 gaaaccaggc catattgctc gaagttgccc ccgagacaat tgttgctata attgtgggga 1740 gcaaggacat ttctcacgcc aatgtccgaa tgtcaaatca gaaggtggat ccaaaaataa 1800 acagaagccg acagattccg ctaagaaaac gtccgatcct cccgccgggc agtcaaacta 1860 gaaccggtgg taagggggga gcgtaccacc gtcgtcttta atttccccta cactaattcc 1920 cgtcatctaa ttcttaagga ggttgattgt caagggcgga ttgcgaaggc agtaatcgat 1980 acggggtccg gaatttcgtt ggtttcgccg acgttctgcc ggatattggg aattgaacgc 2040 tatagagaat gggaaggacc gcggttgttg ctcgctaatg ggaacccctt agtaccagaa 2100 ggatcggtca agcttagaat ctatgttgaa gggcggttgg tttgggtaac agcggcagtg 2160 agtgggatga acggattcga tctcctattg ggaaatgatg cgttgtcgca gttggggtgt 2220 ttctcggtgc agtacgataa agctggcgtg ggatcgttct cgacaaccac cacgaaagag 2280 gatgtttctc gggaaagggc ggattatatc gtgaatcatg agactgtaag cattccggcg 2340 ttctccatgg tgtacgtaaa caccgtcgta ccacaattgg gtgggcgaaa tccgtgacac 2400 ctggtggaac catcgccgaa agttatgacg gacaaagggg tctcgattgg gcgattcctc 2460 ttaccctcac gcgtaaccag cggaacatat cgtttcccct taacaaactt ctcttcgtcc 2520 gctcaattta tcccagcggg aatggtcgtc gggaaaatcc tccctataga acaagtcgtg 2580 gaagatgccc ctgtcgatag tcccagccca cccagtaccg aaatgccgtt accattcgcg 2640 agtcggatca ataaggattt aggaaaggag gatagagaaa aaaccatcgc gttattgggt 2700 aaatacttgc gttgtttcgc ggcgtcgccc catgagctcg ggtgctcgaa cttagtccag 2760 cacaaaatag acactggtaa tcatccgcca cttcatcaac ccccctatgc gagtgcgtgg 2820 cgagagcgtg aactaatcaa cgatcaaacc cagaagatgc agaaggataa tgtagtggta 2880 ccctccaata gcccatgggc gtcgccggtg gtactagtca agaaaaaaga tggggaatgg 2940 cgtttctgcg ttgactatcg ccgattgaac gctatcacga cgaaagatgt ttaccccctc 3000 ccgagaatcg aagacgctct aagccggatg gaaggatcgc gatatttcac tatcttagac 3060 atgcaggcgg ggtattggca agttggggtt gacgaacagg atcgggcaaa aactgctttc 3120 attactgcgg atggactata cgagttcaag gtgatgcctt tcgggttgac gaatgcgcca 3180 gccacctttc aaagaatgat ggacgtagtg ttagctgggc tcaaatggaa cacctgcctc 3240 gtgtacttag acgatatcgt cgtcttcgct cccacagtta cgcagcatct agaaaggctg 3300 gagtcggtcc tccaacgtat cgagcgggcg ggattgaaat tgaaactgtc caagtgttcc 3360 ttcctggaac aatccctcaa agtcctgggg ttcattgtta gtagcgaagg gctctcccct 3420 gatccggaaa agatcaccgc cgtacatgat ttccctaccc cgcggaacgt gaaagaggtg 3480 cagagcttcc ttggtctctg ctcctactat cgaaaattcg tccctgggtt cgccgtactc 3540 gcccgatcac tctccaatct gaccaagaaa aatcaacgtt tcctttgggg ggaagaacag 3600 caacgcagtt ttgaggctct gaaaaccatc ctgacctctc cacccatttt agcccacccg 3660 cgatacgacc tcccaatgga aattcattgt gacgctagtc attacggggt aggagctgtc 3720 cttgtgcagc agcacgatgg gaaggagcgt gtcctcgcgt atgccagtcg gctgctgagc 3780 aaaccggaaa tcaactattc ggtatcggag aaggaatgcc ttgctcttgt ctggtctatt 3840 aagaagttta ggacgttcat atgggggcag aaagtaaagg tggtaacgga ccaccattct 3900 ctgtgctggt tgctgaagaa gcgggatcta tcgggacggc ttgcgcgatg gagccttcag 3960 ctccaagatc tcgagatcga cattgttcat agaagtgggc ggctccatac cgatgcggat 4020 agtctgtcaa gggctcccat aggcactcct gaagaagaag aagagattcc gttgctagct 4080 aatttcccag cggcatcgcc tgagaaagtg gatatcgcgt tggaacagca gcaatcgccg 4140 tggtgggaaa agattattag tgggctgaaa gaaacaaatc cgactcctcg cattcgaaaa 4200 ttgatccacc cttacgagct gaggaatggt gttctcttcc ggcgtcgggt ccaccgggga 4260 cgatgctctt atcagctctg ccttccgtct ccctttgttg agcaggtcct tctggcctgc 4320 catagcgacg taacttccgg gcatctgggg gtaacaaaga cgatgtataa aatccagcag 4380 cgctatttct ggcccaaaat gcggagacaa attgtccgtt tcgtactctc gtgcgttgat 4440 tgtcaaacaa agaaacgacc acgggaagct ccggccggcc tactacatcc tatacgggca 4500 aatcaacctt tcgagaaggt tggaattgac ttgattggcc ccttccctat cacagatgcg 4560 gggaatcggt atgcgatcgt cgctgtagat tatctgacca aatgggctat caccaaggcc 4620 gtccctaaag cgtccgtcaa agaaatcgtc gacttctttg ttcgaaatgt cgtcctccaa 4680 catggggcac ccgtattcct catttccgac cgaggaaagt gtttgtccgc ctcattcgca 4740 gaggaacttt acaaagcttt gcagaccaat catctggtca ccgccgcgta tcatcctcaa 4800 tgcaacgggt tggtcgaacg atataaccac acattcgcgg aaatgctctc gatgtacgtc 4860 aattcgtacc ataacgattg ggatgggctc gtagatttcg ccaccttcgc gtacaacacg 4920 actcgtcaag agtcaacggg tgtcagtccc ttctacctat tgtacggacg ggaggccgtg 4980 ttgcctgtcg atgtcgcgct ggggaataat cccgaaatga acaaaggcga tgatggtcct 5040 aatgcacgca ttcgtcagct cacgtcggaa ctcccaacta ttcgagatga ggtaaagaga 5100 cgcttgactt taattcagtc aaaacaaaag tcgcggtatg atcgtcggag acgctcggtt 5160 aacttcaccg tcggaaatct tgtgctcgtt tatcgtccca tccggaagaa aggacgatcg 5220 actaaattcc tccatcggta tttcggtcca tataaaattg tgcgtcgagt gagcgatctc 5280 aactacatca tcgaacctct gtgtggacga cgaaagaaac aagattgtgt ccacgtgtcc 5340 catctcaagc cttttaggct caacgctttg agtgaaagtg cgagagctaa agtcccttcc 5400 gccatacgag tagtgagtga aaatacgcaa cggcgcgggg gaagagagcg aaaaaagacg 5460 cccaaaaaga cgccgacagt gaggcgaaag cgcgtccgtt tttcccgttc agtgaacaaa 5520 cagtgcgacc ctggcgaatc aacaaataag gagcagctag gagagcatta cctacgccct 5580 cgacggaccc tgagatcccc ggatcgcctt gacttatgat tgtatcctgc ttatttcctt 5640 gtgttcgttg tgcctatccc atagtatttg ttttgccact tgattgtttt ccctgctgac 5700 tgcgttgtga agagagtgaa gaggtgtgcc ttcacggaag tggagaggag tgccttcact 5760 gagtcttgat gagtgaagag gagtgccttc gctctggaag ctgtagtctt ccaggcgaag 5820 aggtgtgcct tcactgagta tacatgagcg tgactgaaag agtttcagtg tgtgttgtac 5880 aattagactg gccatagtgt aaggtttagc atagtgatga tgattaaaat aagtttatta 5940 tagagaccta gaaatgtgat actaactaag gaatgggtta aagtagttaa ttgtgttggt 6000 gttaatgttg tttattccaa tgatcgggac gatcattctt cgtgggggag gga 6053 // ID Gypsy-99_AA-LTR repbase; DNA; INV; 189 BP. XX AC supercont1.75; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-99_AA_; KW Gypsy-99_AA-I; Gypsy-99_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-189 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.75; Positions 894250 894062. XX SQ Sequence 189 BP; 52 A; 32 C; 37 G; 68 T; 0 other; tgttatggta ttgttacgta gcttttgaat atccctcagt ttgtagtttg tcaacgctcc 60 cctattcaac aagactttag cttacagtta aaggtagtca gtttgagtag agactttgta 120 tcgtacgaac gcagtgaata aaagctcatg tgaatttgtt aaacctgtgt tttcgttcgt 180 actaaaaca 189 // ID Mariner-15_HM repbase; DNA; INV; 1793 BP. XX AC . XX DT 07-DEC-2008 (Rel. 13.12, Created) DT 07-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE Mariner-type family: consensus sequence. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-15_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-1793 RA Jurka J.; RT "Families of Mariner elements from Hydra magnipapillata."; RL Repbase Reports 8(12), 1949-1949 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX SQ Sequence 1793 BP; 703 A; 238 C; 271 G; 581 T; 0 other; cagtcgtgtg acaatttatt atggccacct atgagttttg atcattttga aatttaatct 60 catttttttc atatgaagag taaaaattca ttattgaaag ttaatcttct gtttaaaata 120 aagttttaac ttataaaatc aatgcttttt ttttgctact ggcaacactg tcacttttga 180 cagccatttt gtatgatgtc atcttaatta taatcttttt gctgtacact tgttaagttt 240 tattaccaga gttttgaatt cttttattag gttatcgtca tattacttat ttaaatcagt 300 aaaaatttct tgaatatttc tttttatttt aaattaaaaa tagttgtatt tagtttgata 360 tttttaaagt tatgagtacc aaagcaaatt cttaactgtt atgtgatact gatatgtgat 420 atcttatttt agttatgagg gaaaaaaaaa acttgtcaca aaaaattgct gaaatcaggg 480 cacttttaat aaatactgat cattctcaat gaaaaatagc cagaaaatat ggtatttcgc 540 gggtatctgt acaaaatata cacaaaaaaa tgctaatatg gagaacttga tgccaaagag 600 ggttggtaaa tgtggaaaga agagaatcct aacaccatgc aggaaatgaa tacttaccaa 660 gattgttcta gaaaatagac gggccaccaa caacaaaata actaataagc tgggccaatc 720 tggagtcaaa atgttgaaga gaacggtcca acattgccta cacaatcttg gttacatttc 780 acgacgtcca gctaagaagc ctaaactgac accaaaaatt gtacaaaaaa tgcttgagtg 840 gacaaaaaag tataggaatt taacatatga ttggtaatta ataaaattaa tttagatgaa 900 tttaaaatat ccactttctt acaccaaact ttaaatattc attcaatatt cacttaacaa 960 tccttgtata atatttgaag gtatgcttta gcggcgagag tacttttaag ttttaaaaag 1020 aaaaaaaggg aaaaaatact tattggactg catcatgcaa actgttaaac accccacctc 1080 tgtgatgatc tgatcaatga aatgtgtaaa agaaatagga aggttctata ttgcagaatg 1140 aaaaatgtta cagttacaat ataaaaatgc tggaacaaag attgttgtca caattggcag 1200 aatagtttag tccaggtgaa aaaaaagcct tcatgcagga cagggcatga ccatgtcgca 1260 cagctaaatc agttaaaaaa tatctatcag aacaaaatat tccattactt gattggccag 1320 gaaactcttc tgacctaaat cctatagaaa atgtttgggc gctcctaaag aagcaaatat 1380 ctgaaaaaat tattacaaat aaagtgtatt taatagaaag aattattcaa cattggaact 1440 ataataaaga attgaaggaa atggctcaaa aatgcattaa aagtatgcca aaaagggttc 1500 ctgcaattat agaggcaaag gaggaatcac caaatattga tcttgaatat atttgcagta 1560 atatttgaat aataaaattt atttaatttg aataataaaa tttatttaat aatattttaa 1620 taataaaata ataaacaata tgttaaatgt ttttatttgt caaaaaaata cccatttctt 1680 aaataaaagt tactttgagt acataaatca caaaaaaata ctaaaataat agactttgaa 1740 acttgataat tttgaaaaat tgtgggtggc cataataaat tgtcacacga ctg 1793 // ID Crack-22_BF repbase; DNA; INV; 2589 BP. XX AC . XX DT 30-APR-2009 (Rel. 14.04, Created) DT 30-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE Amphioxus Crack-22_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; KW Crack-22_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-2589 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-2589 RA Kapitonov V. and Jurka J.; RT "Young families of Crack non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(4), 827-827 (2009). XX DR [2] (Consensus) XX CC Its 5'-terminal portion is incomplete. XX FH Key Location/Qualifiers FT CDS 3..2369 FT /product="Crack-22_BF_1p" FT /translation="TSEHREIHVMGDFNVDWIKTSDVSKKLHGMTQSYGVQ FT QLVDKPTRTVNRGLSVVASCIDLLFTNQPDKCSKITIRTVGFSDHDAVMFT FT RKAKIPTTGPRTVHKRTYKNFTEDAFLYEMKEAPWHLVYNFDDVDQATEMF FT ADILQDTCDRHAPIRKFTVRSNAAPWLTPDIKELMQMRNEAKRDAKATGLA FT SDWAVYRKLRNRVVSVCRRAKVQFYRDTFEECSGNPKKTWNTINSLLGRKH FT TVSPACVQEGGKLLSKPRDIAEHFAKFYDTKVANLRSGMSNTAGDETYANL FT GQEAFVLKSVTVRDVHKILSRLPDGKTPGCDGIDNKLLRISSDIIAEPLTY FT IINLSFATGTFPAKWKHAKVIPLLKDSSKPLSGQNSRPVSLLPTCGKVCEI FT VASEQIVSYFSRSGHQSEAQHAYRKLHSTETVLLKMTDEWLDSMDKGLMTS FT VVLLDYSAAFDLVDHVHLQKKLEMYGFSEEALSWIMSYLSNRKWQVYANGA FT YSQSRVLQCGVPQGSCLGPQLYSIYVNDMSSTVENGELDQYADDSTVHAAG FT RTVQDIRTKTLVDLECVAKWSDHNMLKLNNGKTKTMLVGTSKKTRVAPPLQ FT LELRGQSLQQVPTVKLLGVHVDQNLTFDNHVDHVVKNCNRSMAQVNRVRKL FT LPRRLRIKLIQALVLTHLDYCSSVWASTSKKNINRLQVVQNRAARIALGCH FT RYTSRDALLQTLGWLSVKDRIQQKSVVTFRKIMQNKQPVSLYKQVKMQSHG FT YQTRSSKSQTIRLPKPRTNAQKKRFLYRMSSTWNGLQ*" XX SQ Sequence 2589 BP; 828 A; 558 C; 612 G; 591 T; 0 other; caacatctga gcatagggaa attcatgtca tgggagactt taatgtggac tggattaaaa 60 cgtcagatgt atcaaagaaa ctacatggta tgacacaaag ctacggtgtg caacagttag 120 ttgacaaacc aacaagaact gtgaaccgag gactatcagt ggtggcatct tgcatcgatc 180 tgttatttac aaaccaacca gacaaatgta gcaagataac cataaggaca gttggcttct 240 cagatcatga tgccgtgatg tttacacgca aggcaaagat cccaacaaca gggccaagaa 300 ctgtgcataa aagaacatac aagaacttca ctgaggacgc attcctttat gaaatgaagg 360 aggcgccgtg gcacttggtg tataactttg atgatgttga ccaggcgact gagatgttcg 420 ctgacattct acaggacaca tgtgacagac atgcaccaat aaggaaattc actgtcagat 480 caaacgcagc cccttggctg accccagaca tcaaggaact gatgcaaatg agaaatgagg 540 caaagaggga cgccaaagca actggactag catcagactg ggctgtatac cggaagctca 600 gaaacagagt ggtatcagtc tgtcgcaggg caaaggttca gttttacagg gacacatttg 660 aagaatgtag tgggaacccg aaaaagacat ggaataccat caactccctt cttggaagga 720 aacatactgt aagtccagcc tgtgtgcaag aaggcggaaa gcttctcagc aaaccccggg 780 acatagcaga gcatttcgca aagttctatg acaccaaggt tgccaacctt cgctctggta 840 tgagcaacac agccggtgat gaaacatacg caaacctcgg gcaggaggcc ttcgtgttaa 900 agtcagtcac agtaagggac gtgcacaaga ttctgtcaag acttccggat ggtaaaaccc 960 caggatgcga cggaatagac aacaaactcc tgaggatttc ttcagacatc atagcagaac 1020 cactgactta cataatcaat ttgtcttttg caacaggaac tttcccagca aaatggaagc 1080 atgccaaggt gataccgctt ctgaaagaca gcagtaaacc actgtccgga caaaactctc 1140 gccctgtaag tctccttcca acttgtggca aggtatgcga gattgttgcc agtgaacaga 1200 tcgtctcata cttctcccgt tcaggtcatc aatcagaagc acaacacgcg taccgtaaac 1260 tccactctac agaaacagtc cttctgaaga tgacagatga gtggctggac agtatggaca 1320 aaggacttat gacctcggta gtgttgctgg actacagtgc ggcgtttgac ttagtagacc 1380 acgtccatct tcagaagaaa ctcgaaatgt acggtttctc agaggaagcg ttgtcttgga 1440 tcatgtcata cttgtcaaac cgcaagtggc aggtctatgc aaacggggca tacagtcaga 1500 gccgagtgtt gcaatgcggg gtgccgcagg gaagttgttt gggaccacaa ctgtacagta 1560 tttacgtcaa tgacatgtcc agtacagtag agaatgggga actggaccag tatgctgacg 1620 acagtacagt acatgcagca ggacgcactg tacaggacat aagaacaaag actctggtgg 1680 atttagaatg tgtggctaaa tggtcggacc acaacatgct taaactgaac aacggcaaga 1740 caaagaccat gctggttggg acgtctaaga agacaagagt cgcacctcca ctacagttgg 1800 aactcagggg gcagagttta caacaagtac ccactgtcaa gcttttggga gtacatgtcg 1860 accaaaacct cacattcgac aatcacgtag accatgttgt aaagaactgt aatagaagca 1920 tggcgcaagt gaacagggtg aggaaattac ttccgcgcag actgaggata aaactgattc 1980 aggcccttgt actaacacat ctagactact gcagttcggt gtgggcaagt acgtctaaga 2040 agaacataaa ccggctacag gtcgtgcaga acagagcggc caggatcgcc cttggttgcc 2100 acaggtacac aagcagggac gctttgctgc agacgctagg atggctgtcc gtgaaagaca 2160 gaatccagca gaaaagtgta gtgacttttc gtaagataat gcaaaacaaa caaccagtct 2220 ctctgtataa acaggtcaaa atgcagtccc atggatatca aacacgctca tcaaaatccc 2280 aaactatcag actaccaaag cccagaacaa atgctcaaaa gaaaaggttc ttgtacagaa 2340 tgtccagcac atggaatggt ttgcaataga taattggtcc cattgtatat atatataagc 2400 tcaggtagtc attaagttat cttttatcaa gtgtgaattt tatgtacatt gtatagattg 2460 tatgtcttcc tgtgtgtcag gtttttgtcc tgttatgttt aataattgtt gataaagacc 2520 acaggaagag tagcttcaat gtactccatt gttatgctaa ttgtggatct taataaagtc 2580 aaagtcaaa 2589 // ID FALCI repbase; DNA; INV; 144 BP. XX AC X04970; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 28-SEP-1995 (Rel. 1.08, Last updated, Version 1) XX DE P.falciparum repeated element falci. XX KW Satellite; Simple Repeat; FALCI; Repetitive DNA element. XX OS Plasmodium falciparum OC Eukaryota; Alveolata; Apicomplexa; Aconoidasida; Haemosporida; OC Plasmodium; Plasmodium (Laverania). XX RN [1] RP 1-144 RA Rao S.A., Green J.T. and Guntaka V.R.; RT "Organization of Plasmodium falciparum genome: ii. sequence RT analysis of falci element."; RL Nucleic Acids Res 14(23), (1986). XX DR GenBank; X04970; Positions 1 144. XX SQ Sequence 144 BP; 55 A; 12 C; 26 G; 51 T; 0 other; aattaagact atgttagtga agttaagact tagttagtca agttaagact ataattgttg 60 aagtatgact ttattactta atttaagatc taagttagta aatgtaagac ctaagttagt 120 taaggaaaga tctacgttag tcaa 144 // ID Outcast-17_AAe repbase; DNA; INV; 5795 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A Outcast non-LTR retrotransposon from Aedes aegypti. XX KW Outcast; Non-LTR Retrotransposon; Transposable Element; KW Outcast-17_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5795 RA Kojima K.K. and Jurka J.; RT "Outcast clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1431-1431 (2011). XX DR [2] (Consensus) XX CC There are 4 complete sequences with >99% identity in the genome CC of Aedes aegypti. XX FH Key Location/Qualifiers FT CDS 335..1894 FT /product="Outcast-17_AAe_1p" FT /translation="MDTTRTDDPKLPPDKGKEWMETNLPNTDDEDLFAGFS FT EQEQTPTQQENHQRKRRDNDTHLDKQTKQLKKSQPHTQLSTRSQTQKQIKN FT STDNLTSNTTNAKSNNQHINNDRTIWIQNNQHDVLFIEPISIKEGEKLLEP FT MEVGKFLHDTGLDQFTELKRAGQYRYKLIYKKPKDAEKLLNATQLLKNNNY FT RAFIPRMLLETTGIIKNVPISLTEKEIYQNTISDKKITKIERIKRRKDPKD FT QKSELINTRSIKVTFEGPELPRTVSIYGVIEKPEIYIYPVRICTKCWRLGH FT KDTACKGKKTCIKCGQQETDKHTECDKTQKRICRNCGGNHLPTNKECPERV FT RMEHINVAMTINKMTFQEAQQLYPKHTLQTSNRFALLESTVEFPQLEQPSM FT TNTKFRQVNQTTSKKLDYRGIIRNITNNDTESKPKTKTETHKNTFPTVDTY FT PEQVHVIENNPHKVTEPEKLQTEQENLTKQLQNIIEKITNKTNDNKLDSDL FT LLIEIASMIQNLIHLTKSRKTSNT" FT CDS 1907..5551 FT /product="Outcast-17_AAe_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="MDNCFLNDLNISQLNIHSIRPLHKRETIKTYLXTNDI FT HIFLLQETWLKPDEKYKFLNYQFIKNCRPDGYGGTGILVHSTLIYEEITIP FT DIKLEMTIVKIKNLKQPIIFISLYLPPDTPITEIKEPLEKLFNYIENSTVP FT VFLGGDLNAHHSTWDKISNKTDRRGELISELLGNHDVIFLNTGVATRWEDX FT NNNSSAIDLTITTPDLGPITSWETSSQDLGTDHKLIECHIIQYNKFNNHAQ FT KTIISKNKAIENLNQIDPNTLTNVEDLNKEINNAMKKATYDIYPNSKKKIK FT PYWNENIKKLYEIKNKKHIEFRNNLTLANKHEFKKAELNFKKALKKEVFEY FT RDKMLNEINEQTNVNQMWKIVRCVSGNFKHKENSEIINNKELATKFIELNF FT KEEAMTNHSHSNTNYDVEEEIYVEPLKIENMIKTIRERKDTSAAGANRLSY FT YFLKNINLKLLTKILELTSQVWMNEKIPEEWLLIKIIPILKQGKDKLKESS FT YRPIALININLKLINCEIKKRLNKYLEIENLIPSLSYGFRKGYSAINCINH FT INSIITEAKRQKQIVATIFLDLTKAFDSVDINILLNKLDDLRIPRKITTWL FT DLYLKNRKIIIQTTQGDIIQNTNKGLPQGCPLSPLLFNVYTRELHDINTDK FT VIFIQFADDFSITIIGESIEEVETTSNIILEQIKSKLASIKIDINPEKSAV FT ILFNSKLNDKINIRINNVQIVQKEIHKYLGYILDYKLSHKAHINYVNDKVK FT KRLNVIKMICKKNKGAHPKTALKINKAIIRSQIDYGLTLYGGTAKTNLAKL FT NSSFNTSLRTCLRLLKSTPINVLYAEAGELPIKYRARWLAKKEAINTFFHD FT KPIVQIISKFLELDDLPKCYTFLECIIFENNYLIIITNNKTESXNTIELNE FT LIKTEIPGVKSQDMNNTAIKMITINYLNQKYDQFHKIYTDGSKTANACGIG FT IWYEQKNEKIKCKVNNAMSIMNVELMAISTAIDIINTKDTNNFVICSDSKS FT ALISIKNKNTDDNFIITDIRNKINKTRKTINFQWVPAHKGIIGNEIADTLS FT KEGCTNEQMIHTRIPLKDTINLAKIETLEEWTQEYITTSTEKGVKHYNLMP FT KPTFKPWFDKLELDTIQIVTIGRLRTFHTLSKEKLYLWKLTQDDKCDICGV FT KEDSSHVLLHCTKYSQSRMQYKILTEHDNIETLLDQAGANNYREISKFVQT FT NNITI" XX SQ Sequence 5795 BP; 2568 A; 1103 C; 827 G; 1293 T; 4 other; cattcctcct caaaccgtcg ttctgagcgg accacaagag ccaatcaaaa aaccagtcga 60 gtgtagtgaa gcattttcaa accgcgtgcg cgaaaacttt tttccaaatt taaatctcta 120 tctccatttt atatttcctt taattaatag ttttattttt ccacaaattt gataataagt 180 acatagaaga gagaaaaatc ggccaataag tgttagttta taagttgtga atagaaataa 240 aaaagagaaa aactcggtag tgattagtgt aagctaggtt aagaaaaaaa agtgaaaggg 300 aggaacgcca gctgtatgtt ggctgaatcc taacatggac accacacgaa cagacgaccc 360 aaaactacca ccagacaaag gtaaggaatg gatggagacc aatttaccca acacagacga 420 tgaagaccta ttcgccggat tttccgaaca agaacagaca ccaacacaac aagaaaacca 480 ccaacggaaa cgacgagaca acgacacaca cctagacaaa caaacaaagc agttaaaaaa 540 atcacaacca cacacacaac tcagcaccag atcacaaaca caaaaacaaa taaaaaattc 600 aacagataac ctaacctcaa acaccacaaa cgcaaaatca aacaaccaac acattaacaa 660 tgaccgaaca atctggattc aaaacaacca acatgatgta ctattcatcg aacctatttc 720 aatcaaggaa ggagaaaaat tactcgaacc aatggaagtg gggaaatttc tccacgacac 780 aggactagac caattcacag aactcaaaag agcagggcaa taccgataca aactaattta 840 caaaaaacca aaagacgctg aaaagctact aaacgcgaca caactactca aaaacaacaa 900 ttacagagca ttcattccaa gaatgctact ggaaacaaca ggcataatca aaaatgtacc 960 gatatcactc acagaaaaag aaatctatca aaacacaatc tctgacaaaa aaatcacaaa 1020 aatcgaaaga atcaaaagac gaaaagaccc gaaagaccaa aaatcagaat taattaacac 1080 tcgatccatc aaagttacct ttgaaggacc agaactsccg cgaactgttt cgatctacgg 1140 tgtcattgaa aaacccgaaa tttacattta cccggtaaga atatgcacaa aatgctggcg 1200 actagggcac aaagacacgg cttgtaaggg gaaaaaaacc tgcattaagt gtggtcaaca 1260 agaaactgac aaacacacag aatgcgacaa gactcaaaag cgaatctgta ggaattgtgg 1320 tgggaatcac cttcctacca acaaagaatg tcccgaaagg gttaggatgg aacacataaa 1380 tgtagcaatg accataaata aaatgacctt ccaagaagcc caacagcttt accccaaaca 1440 caccctacaa acctccaaca gattcgcttt gttagagtct accgttgaat tcccacagtt 1500 ggaacaaccc agtatgacaa acacaaagtt cagacaagtc aaccaaacaa catctaaaaa 1560 acttgactac agaggaatca ttcgcaatat cacaaacaat gacacagaaa gtaaacccaa 1620 aaccaaaaca gaaacacaca aaaacacctt tccgactgtt gacacttatc ccgaacaagt 1680 acacgtaatt gagaacaatc cacataaagt aactgaacca gaaaaacttc aaacagaaca 1740 agaaaaccta acaaaacaac tccaaaacat catcgagaaa attacaaaca aaactaatga 1800 caacaaactg gattcagact tactccttat tgaaatagct tcaatgatac agaacctaat 1860 acatttgacg aaatcacgta aaaccagtaa cacctaaaat ttaatcatgg ataattgctt 1920 tttaaacgat ctaaatatat ctcaactaaa tatacatagt ataagacctc ttcataaaag 1980 agaaacaata aaaacctacc ttaawactaa tgacatacat atattcttac ttcaagagac 2040 ttggcttaag ccagatgaga aatacaaatt tctcaactat caattcatta aaaactgtag 2100 gccagacggc tacggtggca ccggaatttt agtacactct acgttgatat acgaagaaat 2160 aacaatccca gacataaaac ttgaaatgac aatagtaaaa attaaaaacc ttaagcaacc 2220 aataatattc atctcgctat atctaccccc agacacacca ataacagaaa tcaaagaacc 2280 actcgaaaaa ctttttaact acatcgaaaa cagtaccgtt ccagtatttt taggaggaga 2340 tttgaatgct catcactcaa catgggataa aatttcaaac aaaactgaca gaagaggaga 2400 gctaataagt gaattactag gaaatcatga cgtcatattc ttgaacactg gagtcgcaac 2460 caggtgggaa gacmtaaata acaattcttc agcaatagat ttaacaatca caacaccaga 2520 cctaggacca ataacaagtt gggagaccag ttcacaagat ttaggaacag accataaact 2580 tatagaatgc cacattatac aatataacaa attcaacaac catgcacaaa agacaattat 2640 tagcaaaaat aaagcaatag aaaaccttaa ccaaattgac cctaacactc tcacaaatgt 2700 tgaagacctt aacaaagaaa taaacaacgc aatgaaaaaa gcaacatatg atatataccc 2760 aaatagtaag aaaaaaataa aaccttactg gaacgaaaac attaaaaagc tctacgaaat 2820 aaaaaacaag aaacacatag aattcaggaa caatttaact ctagctaaca aacacgaatt 2880 caaaaaagca gaattaaact tcaaaaaggc tttgaagaaa gaagtatttg aatacagaga 2940 taaaatgtta aatgaaataa atgaacagac caatgtaaac caaatgtgga aaattgtcag 3000 gtgtgtaagt ggcaatttca aacataaaga aaactctgag ataattaaca ataaggaact 3060 tgcaacaaag ttcattgaac tcaacttcaa ggaagaggca atgacaaacc attctcattc 3120 aaacactaac tacgacgtag aagaagaaat ttacgttgaa ccattaaaaa tagaaaatat 3180 gattaaaaca atccgagaaa gaaaagacac ttcggccgca ggagcaaata ggttatcgta 3240 ttatttcttg aaaaacatca atttaaagct tttaactaaa atattagaat taaccagtca 3300 agtttggatg aatgaaaaaa taccagaaga atggttatta atcaaaataa ttccaatatt 3360 aaaacaaggt aaagataagc ttaaagaatc atcgtataga ccaatcgctt tgataaacat 3420 aaacttgaaa ctaataaact gtgaaattaa aaaacgatta aacaaatact tagagattga 3480 aaatctaata cccagcttat catacggatt tagaaaagga tactcggcaa ttaattgtat 3540 caaccatata aactcaatta taacagaagc aaagagacag aaacaaattg tagcaactat 3600 tttcttagac ctcacaaagg catttgactc agtagacata aacatcttat taaataaatt 3660 agacgaccta cgaattccta ggaaaatcac aacttggtta gatctgtacc taaaaaacag 3720 gaaaataata atacaaacaa cacaaggaga cataattcaa aataccaaca aagggttacc 3780 acaaggttgc cccctctcac cactattgtt taatgtttat acaagagaac tacatgatat 3840 aaatacagac aaagtaattt tcatacaatt tgcagacgat ttttctatta caataattgg 3900 agaatcaata gaagaagtag aaacaacatc aaacataatt ttagaacaaa tcaagagtaa 3960 actggcatca atcaaaatag atatcaaccc ggagaaatcc gctgtcattt tattcaactc 4020 aaaactaaat gacaaaataa atattagaat aaacaacgta caaatagtac aaaaagaaat 4080 acataaatat ctaggataca ttctggatta caaactttca cacaaagcac acataaatta 4140 tgtaaatgat aaagttaaaa aaagattgaa tgtaattaaa atgatatgta agaaaaacaa 4200 aggtgctcac cctaaaacag cactaaaaat caacaaagca attataagat cacaaattga 4260 ctacggatta acattatatg gtggaacagc gaaaaccaac ttagctaaac taaactcttc 4320 gtttaatact tcattaagga cgtgccttag attgttgaaa tcaacaccca taaatgtact 4380 atacgctgaa gcaggagaat taccaatcaa atatagagct agatggctag ccaaaaaaga 4440 agcaatcaat acctttttcc atgataaacc aattgtacaa ataatttcaa aattcttgga 4500 actagatgat ttacctaaat gctatacatt cttggagtgc ataattttcg aaaacaacta 4560 tttaataata ataaccaaca acaaaactga aagtwcaaat actattgaac taaatgaatt 4620 aataaagacc gaaattcctg gggtaaaaag ccaagatatg aacaacactg ccataaaaat 4680 gataacaata aactatctaa atcaaaaata cgatcaattt cacaaaatat atacagacgg 4740 ttctaaaaca gcgaatgctt gcggaatagg tatatggtac gaacaaaaaa atgaaaaaat 4800 caaatgtaaa gtcaataacg cgatgtcaat tatgaacgta gaactaatgg ctataagtac 4860 agctatagac ataattaata ccaaagatac aaacaatttt gtgatttgct cagattcaaa 4920 atctgcactg ataagtataa aaaataaaaa cactgatgat aacttcatta taacggatat 4980 tagaaataaa ataaacaaaa ctaggaaaac tataaacttt caatgggtac cggctcataa 5040 aggtataata ggaaatgaaa tagcagacac cctctctaaa gaaggatgta ctaacgaaca 5100 gatgattcac acaagaatac cactcaagga cacaattaat ttagcgaaaa tagaaacact 5160 agaagaatgg acacaagaat acatcacaac ttcgacagaa aaaggagtta aacattacaa 5220 tcttatgcca aaacctacct tcaaaccgtg gtttgataaa cttgaactgg atacaataca 5280 aatagttaca atagggagat taagaacatt ccatacctta tcaaaagaaa aactgtattt 5340 atggaaatta acacaagacg acaaatgcga tatctgtggg gttaaagagg actccagcca 5400 tgtcttacta cactgcacaa aatattcaca atcaagaatg caatacaaaa tactaactga 5460 acacgataac atagaaacat tacttgacca agccggcgct aataactatc gagaaatttc 5520 aaaattcgtt caaacaaata acataacaat ataataacta acaccaaaca gagcgatttt 5580 tctctttgac agtacaacta cagtagttta ccgactaaat tggatcggta caatctatta 5640 attaaccgaa aaaaaaagat tgatagagat ttcgatcgca ttaagtaaat ttaagggacg 5700 atggagatgc cctaatcata tgcttcatga tttgacaaca cttggcagta tagactcaca 5760 aaagtctcgc catcagagga ggagaagaag aagaa 5795 // ID BEL-56_CQ-I repbase; DNA; INV; 2554 BP. XX AC AAWU01004237; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-56_CQ_; KW BEL-56_CQ-LTR; BEL-56_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-2554 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 265-265 (2011). XX DR GenBank; AAWU01004237; Positions 47259 44706. XX CC 'CTATC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 40..2553 FT /product="BEL-56_CQ-I_1p" FT /translation="MSDSTIVCGACGRILTDAEERTTCGNCNETYHSCCVY FT STAADTKWYCPEGCLQRNLENAQAKDRSDPASSASGQEASGQPAVVAESNM FT SEVAEAALEQHAEEFDADRWFREKQIEIEMELAARQAQIDRELRKREERMA FT EAFQKAMRRQEVALEVGLKKKANHERRMAELERSFQRRSSLIDKQLEQIKF FT NCEKPLPSSHIIDSAMEETAKTGKKNEREDDLPETESGGSLERRTKDADEN FT GAKLKTQEFVSQGPNGLGQQCSGLEKAQLAAGGRLTRQSPCYPNDPEQWPE FT SLGTREGSSKAHGVTDARNLVGLRHGIEGAAQDTGHCPNHWSEPVFEKLQP FT TLECPELLPVSHPERAHELQHPKADGLAKLTSLENAGNILCEHMEAKDLTL FT VNPLANEDPASVLPDGEERHWAKKKTEATNLPSRINPNTKAGTRRRRRQLE FT EATLYCRKKGSSSSSRANTQQQKCYDHQLRFCGEIGKLFDDKLFKIVTRWE FT IGDGVKDQLKDRCQSNARGKLYILEDLPVVNSVSISAPEGYVNAIIFGTGP FT GQRYGGISANANDAATVPLVVGVTAFRPRLGDAEGKNGLQQKAEVRRAWPR FT FTRMQGWISTVSSTGGNNWFLSTEHLELWMSHHKLQAEELSALYSSRAAAK FT DSRLEDMIGRRNRPAGTYAELVSGGTSEPWISEKMAGGLEHGSEPDAVLQE FT VDRDYTKAEMARDLMDEAKDHVLMNGGRYSGSKGNKLPQSNSTPLWIPRIL FT AKKMGHRPTSVDAGDVDAHMDATHGRANQVCAISEENLQVLNGAQKTTAGN FT VGKATNLDERGAVDLAVWKFLVNPTHRMARRMLRAG" XX SQ Sequence 2554 BP; 724 A; 620 C; 815 G; 395 T; 0 other; aactcaaaaa ttaaatacgg gttcggaaac ctcaccacaa tgagcgattc aacgatagtc 60 tgcggcgcat gcgggcgcat tctcaccgat gcggaggaga ggaccacctg cggtaattgc 120 aacgagacgt accactcctg ctgcgtgtac agtacggcgg cggacacaaa gtggtactgt 180 ccggaaggtt gtttgcagcg gaacctggaa aacgcccaag ccaaagacag atccgatcca 240 gcaagttcag caagcggaca agaagcaagc ggacagccag ctgtcgtggc cgaatcgaac 300 atgtcggagg tcgcagaggc agcgctggaa caacacgcgg aagaattcga tgctgatcgt 360 tggtttcggg aaaagcagat cgagatcgag atggagctgg cagcacggca ggcgcagatc 420 gatcgcgagc tgcggaagag agaagaaagg atggccgaag cattccagaa ggcaatgcgg 480 cggcaagaag ttgccttgga ggtcggccta aagaagaagg cgaatcacga gaggcgaatg 540 gcagagctag aaagatcgtt ccagagaagg tcgtcactga tcgacaagca gctggagcag 600 atcaagttca actgcgagaa gcccctgcca tccagtcata tcattgattc agcgatggag 660 gagacggcaa aaaccggcaa gaagaacgaa cgcgaagacg acttgccgga gacagagagc 720 ggtggaagtc tggaaaggcg cactaaagac gccgacgaga acggggcgaa gctgaaaacg 780 caagaatttg tcagccaagg tccaaacggg ctggggcagc agtgttccgg acttgagaag 840 gcacaactgg ccgcgggcgg gaggctcaca agacagtctc cttgttaccc gaacgaccca 900 gaacagtggc cagagtctct tggaacgcga gaaggatcct cgaaagccca tggagtgacg 960 gacgcacgaa accttgtggg gcttcgccac gggattgaag gtgcggcgca ggatactgga 1020 cactgtccga atcattggtc tgagccagtc ttcgagaagc tccagccaac gttggaatgc 1080 ccggaactgc tgccagtaag ccacccagag agagcgcacg agctacaaca tccaaaagcg 1140 gacgggcttg cgaagctcac atcgctcgag aacgcaggga acatactgtg cgagcacatg 1200 gaggcgaagg atctcactct ggtgaatccg ctcgcgaacg aagacccagc tagcgtactc 1260 ccggacgggg aggaacggca ctgggcaaag aagaagacgg aagcaacgaa ccttccgtca 1320 agaatcaatc cgaacacaaa agccgggaca agaagaagaa gaagacagtt agaagaagca 1380 acactctact gtcgaaagaa gggcagctcg agttcgagca gggccaacac gcaacaacaa 1440 aagtgctacg atcaccagtt gcggttctgc ggagaaatcg gaaaactgtt cgacgacaag 1500 ctcttcaaga tcgtcacaag atgggagatc ggcgacggcg tgaaggatca gctgaaggat 1560 cgctgccaaa gcaatgctcg cgggaagctt tacattctgg aagatcttcc ggtcgtgaac 1620 tcggtcagca tcagcgcgcc cgagggatac gtcaatgcga tcatatttgg aacgggtccg 1680 ggtcagcgat atggaggaat ttcggcgaac gcgaacgatg cggcgaccgt cccgttggtc 1740 gtaggcgtga cagcttttcg accgagactc ggagacgcag aaggtaaaaa cggcctgcag 1800 cagaaggctg aggtaagacg cgcgtggcca cgattcacac gaatgcaagg gtggatttct 1860 acggtgagct caaccggtgg caacaattgg tttctgtcta cggagcatct ggaactgtgg 1920 atgagccacc acaagttgca agctgaggaa ctgtcggcgc tgtacagcag ccgtgctgcg 1980 gctaaggaca gtcggctgga agatatgatc ggccgaagga accggccggc gggaacttac 2040 gcggagttgg tctctggtgg cacgagcgag ccctggatct ccgagaagat ggccggcgga 2100 ttagaacacg gaagtgaacc agatgcggtg cttcaagaag tcgaccgaga ctacacgaaa 2160 gcggaaatgg cacgcgattt gatggacgaa gcaaaggacc atgttctgat gaacggtggt 2220 cgctacagcg gcagcaaagg aaacaagctg ccgcaaagca acagcacccc gctatggatt 2280 ccaaggattc tagcgaaaaa gatggggcat cggccaacga gcgtcgatgc tggcgatgtg 2340 gatgcgcaca tggacgcaac gcatggccgg gcgaatcaag tctgcgcgat ctccgaggag 2400 aacctgcagg tgctgaacgg cgcgcagaaa acgacggctg gaaacgtcgg gaaagcgacg 2460 aacctggatg aacgaggagc agttgactta gcggtttgga agtttttggt aaatccgacg 2520 caccggatgg cccgacggat gttacgggct gggg 2554 // ID BEL-2_ASu-LTR repbase; DNA; INV; 693 BP. XX AC AEUI01005845; XX DT 08-APR-2011 (Rel. 16.04, Created) DT 08-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the pig roundworm genome: long terminal DE repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-2_ASu_; KW BEL-2_ASu-I; BEL-2_ASu-LTR. XX OS Ascaris suum OC Eukaryota; Metazoa; Nematoda; Chromadorea; Ascaridida; OC Ascaridoidea; Ascarididae; Ascaris. XX RN [1] RP 1-693 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Ascaris suum genome."; RL Direct Submission to RU (07-APR-2011). XX DR Genome; AEUI01005845; Positions 200 892. XX SQ Sequence 693 BP; 219 A; 139 C; 177 G; 158 T; 0 other; tgggtaactg gcagaggcaa gccaaagtga tcattgacga gcgaaaggcg tcaataattg 60 ggtacggtgc gacaaccagg ctggtcgcaa ttggaaggaa ctcactaatc gggaacgtcg 120 acaccagcgg tgatcgtggt tggatcgacg ccagtggaca aagagataac tcgacggtgg 180 atcagagacg ccacgcgtca acacagaact gacaaaggca aatagcggaa accgatcggt 240 ggctcaggcg acatcgatca acaacaggga cgtaggccta ttttttaaca gtaatcatat 300 ttttgtgttt tttcttcact tctattgttt gttcacgaaa ctttggggac aaactatcct 360 atcatacatt caaaattgtt gaccatcttc acaccttcac aaaaatacaa tcaacatccc 420 agaaggggag aaataattga agaaattgtg tgaatggtca atacatggtg ccacgagccg 480 ggacattact gagttgccct tcagcaatca cagtaagttg gctttagata acgcaagtga 540 ggtaaaaaaa aaaaaaatat gtcgtctgct ggtctgcgaa gacagcttac gctgaccaag 600 ggaaggctgc agaagtatct gagtgagctg cctgcttaca gcgaggattt taagaatgga 660 caggagagaa tggttatgag gtcccagttg aca 693 // ID Transib-13_HM repbase; DNA; INV; 3046 BP. XX AC . XX DT 31-JAN-2008 (Rel. 13.02, Created) DT 31-JAN-2008 (Rel. 13.02, Last updated, Version 1) XX DE Autonomous Transib DNA transposon -a consensus sequence. XX KW Transib; DNA transposon; Transposable Element; Transib-13_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3046 RA Kapitonov V.V. and Jurka J.; RT "Autonomous Transib transposons from the hydra genome."; RL Repbase Reports 8(2), 38-38 (2008). XX DR [1] (Consensus) XX CC Transib-13_HM is a young family of autonomous Transib DNA CC transposons that were active in the hydra genome less than a few CC million years ago (copies are ~3% divergent from their consensus CC sequence). The consensus sequence was obtained based on multiple CC alignment of 7 copies; it codes for a 754-aa Transib transposase. CC Like other Transib transposons, Transib-13_HM is characterized by CC 5-bp target site duplications and short terminal inverted CC repeats. CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 489..2714 FT /product="Transib-13_HMp" FT /note="transposase." FT /translation="MFISICGFLRFLSKFFFSGVICLQNSAVFDIWLTQQK FT KGKNKDALINYFIQKCVADKISDECYKYLTEKARMLNNNLSAKWEKCHRRX FT DDFQIKYSGWLCSETVINRQINLEIASTETQCIVELXPDEKIVSLTGRPQL FT KYSEKSERSKRRQAANLSMSEHNETNLLVQAASISARKQGQVDLAVVLKET FT IESPSRPSKIRKLYVDDVKAPIVMSDEEALAFLLENNFSKSQYCSIRTESK FT QRNANIYPPYDNILAIKSKCRPDGVLVDDKCAKVPLQMLLXHTAKRIIEMQ FT KEVFATITGNFISTEMIFSYGFDSSSGQSQYKQSFLDQSASRNADSCLLAT FT TVIPLRLLSSAGAPIWNNCFPQSTRFCRPLKLEYKSETKEVITREAADLNK FT QIQELENLRVEIEAGKIIEITFRLFLTVIDGKVLNVLTDTKSNQKCPICHA FT TQTDFLKVTDYNSVKFHPINGNLKYGISPLHCWIRFFEFVLHLGYKMEVKK FT WQIRSDEDKDAIKKKKEYIQNEFWKELSLRVDFPKAGGCGSTNDGNTSRRA FT FTEYDKFSKVLGVDTELIFRMRIILICLSCQFSLNLDKFEEYCFQTGQLYQ FT SKYPWLPMTPTVHKVIVHSKQIMQNTALPVGYFGEDAAESRNKIYKSDRLH FT HARKTSRIDNLFNVFHRALDTSDPLISSLRLNTRVHQRKRLTLPPEVKELL FT YCEDIGETCATENLENMDGDEDDYPEQLEEEXTFLDNELYLN" XX SQ Sequence 3046 BP; 1108 A; 457 C; 534 G; 937 T; 10 other; cacagtggtc ggaaacatca cttttcttgg ccaaaagtca aaatttttac actcacraaa 60 tatgtttttg ttttttaaat ctagtagtcw acactccaaa acaaccaaaa attaattttg 120 ttgttgtaat atgcaattca ccgttattck gtgagtcatc gaagttgaat ttgtttgctt 180 catttgtaaa cagaattgcg wtcggtgttt tatagtgctt acttcaattg cttgttaaag 240 ctgaaaacaa agygtgttaa ataaaagttt aaatggaaaa cgaaggtgat ggtaagttta 300 ttaattttgt aaaatataat actcaaaatt tggatactct aaaaaataaa ataacttttt 360 ccaaaagtct ttaacatttt ggaatttatt ttatttttta agtttgagta agcttaacac 420 attttgttta ggagtgttaa gcataagatt aaaaaccaaa attactttga agtgtttagt 480 ttttgaaaat gtttatctcg atttgcggtt ttttgcggtt tttgagtaaa tttttttttt 540 caggcgtcat ttgtttacag aattcagcag tttttgatat ttggttgacc cagcagaaga 600 aaggaaaaaa taaggacgca ctaataaatt attttataca aaaatgtgta gccgataaaa 660 tttcagatga atgctataaa tatttaacag aaaaagcacg catgcttaac aataatttgt 720 ctgcaaagtg ggaaaagtgt catagaagat kagatgattt tcaaattaaa tattctggct 780 ggctttgttc agaaactgta ataaacaggc aaataaattt agaaatcgct tccacagaga 840 ctcaatgcat agtagaacta ascccagatg aaaaaattgt atcactaact ggacgcccgc 900 aacttaagta ctctgagaaa tctgaacgaa gtaagaggag acaagcagca aacctctcga 960 tgtcagaaca caacgaaaca aatctgttag ttcaagcagc atcaatttcg gctcgaaaac 1020 aaggacaagt agatttggca gttgttttaa aagaaaccat cgaaagtcca tcaagaccat 1080 ccaagattag gaagctatat gtcgatgatg taaaagcacc tattgtaatg tctgatgagg 1140 aagcgttagc atttcttctc gaaaacaatt tttcgaagag ccaatactgt tccatccgga 1200 ctgaaagcaa gcaacgtaac gctaatattt atccacccta tgataatata ttagcaataa 1260 aaagtaagtg cagacccgat ggtgtacttg tagacgacaa atgtgcaaag gttcctttac 1320 aaatgctttt aaygcataca gctaaaagaa ttatagaaat gcaaaaggaa gtatttgcga 1380 caattacggg taacttcata tcaacggaaa tgatatttag ctacggtttt gacagcagct 1440 ctggtcaatc tcaatataag caatctttcc ttgatcaaag tgcatcacga aatgctgatt 1500 catgtctgtt ggccacaaca gtgattccct tgagactctt gagtagcgcg ggagccccga 1560 tttggaataa ttgttttccg cagtccacaa gattttgtcg gcctttaaaa ttagagtaca 1620 agagtgaaac gaaggaagtg ataacaagag aagctgctga cttaaataaa caaattcaag 1680 agcttgaaaa cttacgtgtt gaaatcgaag ctggtaagat aatcgaaata acgttcagac 1740 tgtttttaac tgttatagat ggaaaagttt tgaatgtttt aaccgataca aaatcaaatc 1800 aaaaatgtcc aatatgccat gcaacccaaa ctgactttct taaagtcaca gattataaca 1860 gcgtaaaatt tcatcctatc aatggaaatc taaagtacgg aattagtcca ctgcactgct 1920 ggattagatt ttttgagttc gtactacatt taggatacaa gatggaagta aaaaaatggc 1980 aaataagatc ggatgaagat aaagatgcaa ttaagaaaaa aaaagagtat attcagaatg 2040 agttttggaa agaactaagt cttcgtgttg attttccgaa agcaggtggt tgtggttcca 2100 caaatgacgg caacacgtcg cgtagagcat ttacggaata tgataaattt tcgaaagttt 2160 taggagtaga caccgaatta atatttcgaa tgagaataat attaatatgt ttatcttgtc 2220 aattttccct aaatttggat aaatttgaag agtattgctt tcaaactggt caattatatc 2280 aaagcaagta cccctggttg ccaatgactc ccacagtaca taaagttata gttcactcaa 2340 agcaaataat gcagaataca gcacttcctg ttggttactt cggagaggat gcagctgagt 2400 cacgaaacaa aatatataaa agcgacagac tgcaccatgc acgtaagact agccgcattg 2460 ataatctttt caatgtattt catagagctc ttgacacatc ggatccatta atttcatctc 2520 ttcgcttaaa tacacgtgtg caccaaagga agcgtttaac actgccacca gaagtaaagg 2580 aattattgta ttgtgaggat atcggtgaaa catgtgcgac tgagaattta gaaaatatgg 2640 atggcgacga ggacgattat ccagaacaac ttgaagagga awatacattt ttggataatg 2700 aattgtattt aaattaaaga gtttaatgaa attttaatat tttcgtaaat acgtgtgtaa 2760 atcatatttt caatttccag aaacaacata tttactcgta agtatcgtat attttctaat 2820 ttagaactaa aaaaataaaa actcagaaag aaaagaaaaa aaaaatttat gtaaaaattt 2880 tttttttaaa ttttaattct aaatgttrat aattttttta aaaaaaatta attattttat 2940 acaaaaacac ttaaaatata ctgcaaattc aaattccagt taaaaattcc gttattttaa 3000 gttttaatag ttttggccaa gaaaagtgat gtttccgacc aatgtg 3046 // ID L1-N5_CQ repbase; DNA; INV; 1064 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A HAL1-like non-LTR retrotransposon family from Culex DE quinquefasciatus - consensus. XX KW L1; Non-LTR Retrotransposon; Transposable Element; nonautonomous; KW L1-N5_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-1064 RA Kojima K.K. and Jurka J.; RT "HAL1-like non-autonomous non-LTR retrotransposons from the RT southern house mosquito."; RL Repbase Reports 11(1), 104-104 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 7 sequences with >88% CC identity. CC This family encodes a protein similar to ORF1ps of L1 in CC mosquitoes. Thus it is likely a HAL1-type element. XX FH Key Location/Qualifiers FT CDS 81..977 FT /product="L1-N5_CQ_1p" FT /translation="MSSTRKNTIFVDFGVLPARPPLDSIKTFVEKSMGLNP FT AGFKCLQLHNTRNGILIEMADLATATKVAADNHMKHALRSGEKNFLIPVYV FT DDNAVDVRVHDLPPGMPNDDIADGMAQYGEVLTITDDVWKNFFPGIPNGVR FT VLRMKLXKXVPSYVSIKGQLSLATHAGQIPTCRRCGQKSHPEKTCSSAKAK FT KKTVNANQKNDKPTTTNTIVPAATTTTVSDPISKVTDDDGFRTVEKKRKSV FT KRQLSIESKTTPQEKRTQINFDSETEMDEQTLKQISKEDSAKLRTDNFMKW FT CELMYMQ" XX SQ Sequence 1064 BP; 342 A; 264 C; 245 G; 208 T; 5 other; cagtcggcta caatacacgg tgcctatcag acgtgctacc tmctcgcagc aaattctttt 60 gttttcgtgc aggctcagaa atgagctcaa ctcgaaaaaa cacgatcttc gtggactttg 120 gagtcctccc agctcgccca ccgctggatt ccataaaaac ctttgtggaa aaaagcatgg 180 gccttaatcc tgccgggttt aagtgccttc aactgcacaa cacccggaac ggcatcctca 240 tcgagatggc tgatctagca acagccacca aagtagcagc agacaaccac atgaagcacg 300 cgctccgttc gggtgagaag aatttcctga ttccggtgta cgttgacgac aacgccgtcg 360 acgtccgggt ccacgatctt ccgccgggaa tgcccaacga cgacatcgct gacggaatgg 420 cgcagtatgg agaggtgctg acgattacgg acgacgtttg gaagaacttt ttccccggga 480 ttccaaacgg tgtgcgggtc ctgaggatga aactckccaa gcmggtgccg tcgtacgtca 540 gcatcaaggg ccaattaagc ttggccactc atgctggcca gatcccgact tgccgacgat 600 gtggacagaa aagtcacccc gaaaaaacct gctcatccgc gaaagcaaaa aagaagaccg 660 tgaacgcgaa ccagaagaac gacaaaccaa caacgacgaa tacwattgtg ccagctgcaa 720 caacaacaac agtgtctgat cccatctcga aggtcaccga cgacgatggg tttagaactg 780 ttgagaagaa aagaaagagt gtaaaacgac aactttctat cgagtccaaa acaacgcccc 840 aagagaagcg cactcaaata aatttcgaca gtgaaaccga gatggacgag caaacgttga 900 aacaaatcag caaagaagac agcgcaaaat taagaacgga caatttcatg aagtggtgtg 960 agttgatgta catgcaataa aaatgtattt atctttattt ttmttaagag gcgaatgcac 1020 aaactgtgta aaacctctat aataaaaaca aacaaaacaa gaaa 1064 // ID Sola3-3_HM repbase; DNA; INV; 6048 BP. XX AC . XX DT 28-JAN-2009 (Rel. 14.02, Created) DT 28-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE Sola3 DNA transposons from Hydra magnipapillata. XX KW Sola; DNA transposon; Transposable Element; Sola3; TTAA TSD; KW Sola3-3_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-6048 RA Bao W., Jurka M.G., Kapitonov V.V. and Jurka J.; RT "New superfamilies of eukaryotic DNA transposons and their RT internal divisions."; RL Molecular Biology and Evolution 26(5), 983-993 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 1569..4376 FT /product="Sola3-3_HM_1p" FT /translation="MRLNDNNIWEENGLIANRLKRNLEKDEQICAFHRYTF FT GIAWSPPKTCFHPHHLNIKGKKSPALRASPIHVVISMSKRYNEVFSVGAML FT CFTHLKQETALSRVNENTNLCTETQTQQEQTYMTPATPDEEFDPGEINVSQ FT EILDESLRSRESVTEVLNASPIKFRLKRKFEYLEDTTKDKLKRKYVRLENL FT LKKKFAEAVAPGQEEILIDFLNSKNVDEINSPLPIEIIRAQKVYKQSDSMG FT KLVILSLLDHTKYTKKFIMNTFECTKHRIETARKWHASHKGLAFPEKKVFV FT RSSLDQTKCEHFLDFIFTSGILHDVAYGITKLKYDSGEEQKIVHAILTTKY FT SHAIMFYRKSCSENNYIPLSDSSLWKVLHAIKPSSRKSLAGLDDVTASGMN FT GFQTLQKLAQRFSSKSLEAALEKGKRYLKTSYQTNCSVNDSNISSHSSKHA FT LSDPSEKNLQSNTEISEVVCADCYDLCKAIEMIKELTIQNSDDADSIYDLE FT IAVKDVFNYIKHLMRDSQQKKAKIEAFKQLNDETAFWLKDFCQKILPVRYR FT EGQREYFGKKGMSLHVDIFFIKIAGKLFKRVYFTSMYRCDQGIGDVVSLAT FT AVLDQFRIDQPHIKKMFTKSDNAGCYQGNLSAEAIYNVCKERDIKLLRYDY FT NEPCCGKDQCDRESAVVKTILRSYVDSGNNLLTAEDIHKAMHYSFGAKDAK FT VAVAQISNDKTVVTGPKIKNISNYHSFEFGEKSMKMWRYFNIGEGIEQEYG FT NLKIQPSIKLLLPYSKTDNSIKRNKSLKEKQKRSDRQLYSLRFCTEMNCTL FT SFESDAELEEHMLSGLHTVPKSLTSLDKVRNSFVHKMKITSQLNMPISSSS FT NSASVKDKPHCMNIFLLQGWALPVRSSFRFSNQQKELLYKYFIRGEESGNK FT MSPEQVHMQLRRELPPDQYVTSQQIRSLFSR*" XX SQ Sequence 6048 BP; 2229 A; 859 C; 1026 G; 1934 T; 0 other; gagtcatttt ttgtaagaat caggcagtca atggaagtga cctcctccaa attgcttgaa 60 ttttttatat attgatcaga atatagagaa aacctgttgg ggaaaaaaaa aaatttcaaa 120 aatgtttcgt tcactcggaa tctgggctta aatttttgag atttttttaa aaatttttag 180 aaatttagtt tttgccacct ttgaacccat ttttttggca aggcaaaaag ttgcacatag 240 cttttgtagt taatatcaga cgtaacccaa aagctctctc caaaaaatat aaaaaagttg 300 cgtgacactg tgcggttggc taattatcat agttcaaaag tacaaaatca atcagttttg 360 tcaattttta atcggagtta attctagcgg cgaagtgttg cacacaattt ttcttttagg 420 atttgaaatt atcaaaggta ttgagtctaa aaatgcaaag aaagttatgt gatacctaag 480 gcctattaat atattaaggc ttgaaattgg aaaaaaatgt tatagtatac ttacaaaatg 540 agctattacg gcagaggcgc taagttttgt aaaaatataa tcatgttaaa tggtcaatat 600 aagatgaccc taacatagtt taaaaaaaaa ctcatgtgat ctttagatat taagttaaaa 660 ataaaggtac aaattgttaa aaatgattaa gtacgaagag atattttttt ggacacacag 720 atgaggtttt cgctcacaat aaaatttcta tcatccaaaa ttagcaatta gcattacaca 780 aaacattgca cgtaatgtta agaaaaaatt taagcgtttc tgactatgat atagaaatca 840 taacaattga aataaataat aataaaaaac aatacatgat cagaagtgaa ttatttaaat 900 ttacgtcata accaaataac ttgtggtgat cgaatggaag aagaatcgct gatatcacga 960 ttgtggagaa acaagttagt gggcggttgt ttactttacc ataaaaatgt cttaacttta 1020 ttttaacgat ctttaaaatt tgcataaata gagtagatat tttatcattg ctattgagaa 1080 ataggtactg cataaataga acttgttttt agttgagtat ggaacaaaac aaatattttt 1140 atgaaaactg ttgttgcttt tcaaaaggaa attgtggatc attcaaaaaa cataaaatta 1200 taaacaaaat gttctacatt catcagctgg gagatgttga tgttaagaaa cacctcatca 1260 acttaaaggt aattatgtat tttaaattta cgcagactgt aatataattt ttataattac 1320 atactttata acttctctgc aataagattt tgagacaaag ttctttaaac aaaaaaaaat 1380 agctctttta attaattcct cactctttta ataaataatc actaaattaa tcacactctt 1440 gcaatattgt attttaataa taacaacaat gtatttcaaa cttgtaaaca tttttcgtga 1500 agagtcagat tataaacaat gctttaaatg acacacttaa taattgttta aaaaattgtt 1560 tttaggtcat gcgattaaac gataataata tttgggaaga aaatggactt attgcaaata 1620 gacttaaaag aaatcttgaa aaggatgaac aaatatgtgc ttttcaccgg tacacgtttg 1680 gtattgcatg gagtcctcca aaaacatgct ttcatcctca ccatctaaat ataaaaggaa 1740 aaaaatctcc cgctttaagg gcgtcaccaa tacatgtcgt catatcaatg tcaaagcgat 1800 acaatgaagt attttcagtt ggtgcaatgc tgtgttttac ccatttaaag caagaaactg 1860 ctttatctag agtcaacgaa aacaccaatt tatgtacaga aacacaaaca caacaagaac 1920 aaacatatat gacacctgct actccagatg aagaatttga tcccggtgaa attaatgttt 1980 cacaggagat tttggacgaa tctctaagaa gccgtgagag tgtaactgag gttcttaatg 2040 caagtccaat aaaatttaga ttaaaaagga agttcgaata tctggaagat actactaaag 2100 ataagttaaa acgcaagtat gttcgcctag aaaacttatt aaagaaaaaa tttgcggaag 2160 ctgttgctcc tggacaagag gaaatactta tagattttct taacagtaaa aatgttgatg 2220 aaataaatag ccctctccca attgaaatta ttcgtgctca gaaagtatat aagcaaagtg 2280 attccatggg aaagttagtt atcctctcat tattagacca taccaagtat accaaaaagt 2340 ttataatgaa cacatttgag tgtaccaaac atcgtataga aacagcaaga aaatggcatg 2400 catctcataa agggttggca tttccagaga agaaagtatt cgtccgatct agtcttgatc 2460 aaacaaagtg tgaacatttc ttagacttta tatttacaag cggtattttg catgatgttg 2520 cttatggaat aaccaaatta aaatacgaca gcggtgaaga acaaaagata gtgcatgcaa 2580 tactgacgac aaaatatagc cacgctatca tgttctatcg aaaaagttgt agtgaaaaca 2640 attatatacc attatctgat tcaagtttgt ggaaagtatt gcatgcaata aaaccttcaa 2700 gtcgaaaaag cttagctgga ttagatgatg ttactgcttc aggcatgaat ggttttcaaa 2760 cattacaaaa attggcacaa agatttagtt ctaaatctct cgaagctgcc cttgaaaaag 2820 gaaaaaggta tttgaaaaca agttatcaaa ccaattgtag tgtcaatgac tcaaacattt 2880 cttctcatag ctcaaaacat gccttatcag atccatctga aaaaaatctt caatccaaca 2940 cagagatatc agaagtggta tgtgccgatt gctatgattt gtgcaaagct atcgagatga 3000 tcaaagaact aacaattcaa aattctgacg atgctgattc tatttatgat ttggaaattg 3060 ctgtaaagga tgtattcaac tacataaaac acctgatgag agattctcaa cagaaaaagg 3120 caaaaatcga agcttttaag caattaaacg atgaaactgc tttctggctt aaagattttt 3180 gtcaaaaaat tcttccggtt cgatacagag agggccaaag agagtatttt ggcaagaaag 3240 gaatgagttt acatgtggat atattcttca taaaaatagc aggaaagtta ttcaaacgtg 3300 tttactttac ttcaatgtat aggtgtgatc agggaatagg tgatgttgtt tcgttagcca 3360 ctgcagtttt agaccaattc agaattgatc aaccgcatat caagaaaatg tttaccaaat 3420 ctgataatgc cggctgctat cagggaaatc tttcagctga agcaatctac aatgtatgca 3480 aagagagaga tataaagttg ctgagatatg attataatga accctgttgt ggaaaagatc 3540 aatgtgacag ggagagtgca gttgtaaaga caattttaag gagttacgtt gactctggta 3600 ataatctttt gactgctgaa gatatacaca aggctatgca ttatagtttt ggcgctaaag 3660 atgcaaaagt agcagttgct caaattagca atgataaaac tgtagttact ggaccaaaga 3720 ttaagaacat tagcaactat cactcatttg agtttggtga gaaaagtatg aagatgtggc 3780 gttatttcaa tatcggtgag ggaattgaac aagagtatgg aaatcttaaa attcaaccct 3840 cgattaagtt gttgttgcca tatagcaaaa cagataattc aattaagcgt aacaagtcac 3900 ttaaggaaaa acagaaaaga agtgacaggc aattatattc attaagattt tgcactgaaa 3960 tgaattgtac tttatcattt gaaagcgatg ctgagttaga agaacacatg ctatccggtc 4020 ttcatacagt tccgaaatca ttaacatcat tggataaagt tcgcaactcg tttgttcata 4080 agatgaaaat tacttcacaa ctaaatatgc caatttcttc atcctctaac agtgcttccg 4140 taaaagacaa accacattgc atgaacattt ttctattgca aggttgggca ttaccagttc 4200 gaagttcttt tagattttca aatcagcaga aagaactgtt gtataaatac tttattcgtg 4260 gagaagaatc aggtaataaa atgagccctg agcaggttca catgcagctg aggagagaac 4320 ttccgcctga tcaatatgta actagccaac agataagatc tttattttca aggtaggcat 4380 tgttgctatc aaaacttttg caaattagat tatgtttctg actgttaaat agttctttta 4440 caaatatgaa attaaaattt aggtgaaagt tataattttg tgtgtttgtt tacatatgtt 4500 taaaaatatg tacaattgat attgttatag atttagtaac ctgaaaagaa aaggtaagct 4560 ggtagaaccg acaacagaaa ataatgaaaa tagtcaggtt aacgataaca aagaagttta 4620 tggcgaaaat gatgacttta atctgacagg agacaatgaa gatgataata aatatgagga 4680 agacatcgcc aatcttgcaa aagaaatttg tcttgtatgg aaagtgaatg attgggttgc 4740 tgttgcatat gaaaaacaat ggaatattgg atatattgtg gaggtaatat tattcaaagt 4800 atatattagt ttcaagtttt tattttgttt tacaaaacag tgctctaaaa taaaaggaat 4860 ataaataaaa tgtaaaaata atttcagaat gacattatat aagactttac aagttttatt 4920 gaatgtaagc aaaattaaac aaatcataaa ctctttatta tattatttta aatgttgcgc 4980 taatactttt aggtatctat aactgggatc agagttaatt gtatgattaa cgggcaagag 5040 aagaatactt ttcgctggcc agttactacc gaggagataa ataaccaaac tgataaaatt 5100 atctgtttag taaatgctcc ttttctaatc agtggttgtg gcgattattc attatccgaa 5160 gaagactata atacagtcat atcattattc ttagagaaac ttactgcagc agagtagaaa 5220 ttatgtgtga tgtggtggat aatgtgtgta aatgactcta ttaataaatg aaaaactaat 5280 ttttaaatgg aaaaacattt aaatgtttga tttatgtttt atttaattag caatatagtt 5340 atcttgatgt tagatttttg ttcagttttt aattcatgtt atatcatgcg tgtgacaaaa 5400 tcacacgaat tttttttatt attatgttag gacctttttg tattgacagt tggatatgtt 5460 tacattttta caaaactaag cgcctctgcc gtaatggttc attttgtaag tatactaaaa 5520 catttttttc caatttcaag ccttaatata ttaataggcc ttaggtatca cataactttc 5580 tttgcatttt tagactcaat acctttgata atttcaaatc ctaaaagaaa aattgtgtgc 5640 aacacttcgc cgctagaatt aactccgatt aaaaattgac aaaactgatt gattttgtac 5700 ttttgaacta tgataattag ccaaccgcac agtgtcacgc aactttttta tattttttgg 5760 agagagcttt tgggttacgt ctgatattaa ctacaaaagc tatgtgcaac tttttgcctt 5820 gccaaaaaaa tgggttcaaa ggtggcaaaa actaaatttc taaaaatttt taaaaaaatc 5880 tcaaaaattt aagcccagat tccgagtgaa cgaaacattt ttgaaaattt tttttttccc 5940 caacaggttt tctctatatt ctgatcaata tataaaaaat tcaagcaatt tggacgaggt 6000 cacttccaca gtttcaggaa tttgcctgat tcttagaaaa aacggctc 6048 // ID Gypsy-36_DPu-LTR repbase; DNA; INV; 134 BP. XX AC scaffold_66; XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from Daphnia: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-36_DP_; KW Gypsy-36_DPu-I; Gypsy-36_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-134 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Direct Submission to RU (16-DEC-2010). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR Genome; scaffold_66; Positions 413563 413696. XX SQ Sequence 134 BP; 27 A; 39 C; 29 G; 39 T; 0 other; tggaatgacc cggcaatgat gtctccccct ccccccgtga tgtgagccag tgtgtaatgc 60 tcaagcgtag gttacagtct tgcttgtacc cggttccatt cctattacac ggacttgata 120 cagacttctt tcca 134 // ID CR1-15_HM repbase; DNA; INV; 4410 BP. XX AC . XX DT 11-DEC-2008 (Rel. 13.12, Created) DT 14-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE CR1-type family: consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-15_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4410 RA Jurka J.; RT "CR1 families from Hydra magnipapillata."; RL Repbase Reports 8(12), 1843-1843 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 756..4070 FT /product="CR1-15_HM_1p" FT /translation="MPYFIYYCFVFNLYFYMAIKYHCKFCNKAVANHQRGI FT LCDCCSSWVHAKCNSINKFSYNLLVKDSSDWYCSDCIIKNMPFSTLSDLEF FT SLTISCQTLPTSTLNSLESPLQFKNFFKDINKISNTYFNCKYYDILDLNKT FT LKSNSELYLHLNIGSLHFHIDELRSLISSLTILPTVXGISESKLYINDPNI FT TDITINGYNIEQCSTEAKKGGALLYLQSNLNYNVGRDLIVYAPKYLESIFI FT EIVNPFKKNIIVGCIYRHPSMNPKEFISDYFNPLLEKLSYENKQVMLMGDF FT NMDLLNYNESLIISDYLDSLCSYSFYPTIIQPTRVTXKSKTIIDNIFLNFQ FT IPNLISGNVTISISDHMAQFVCIPSTPFKRKKSITTKRSFKNFDDNKFIQD FT ISNIDWKFYIKNDDDVNESMNFFLKTFNKILDRHAPYKKLTKNQIKLKSKP FT WITNGILKSISIKNKLYKKYIKSKNSNTKSQLFNKFKLYRNKISNLLRISK FT KSYYISFFNKNLNNIKNTWKGIKEIINIKPSTPIKSFNLKFNGNTISDSMA FT VANIFNKYFETIQHPLKSKMISSKCNFSDYLKNPNSNSFFXDPVTENEVSN FT FIKNSLQNGKSFGPNSIPTFLLKLTSHVISKPLCTIINNSFKNGLFPDVFK FT IAEVIPIHKKGSLMDHRNYRPISLLSNISKIFEKAMYNRLYNFFDKXQCLY FT KHQYGFRNKHSTTHALIEITETIRKALDNKLFACGVFVDLQKAFDTVDHSI FT LLSKLEYYGIRGITLQWFSSYLYNRSQFVSINDAKSFLIKCSTGVPQGSVL FT GPLLFLIFINDLNISLKFSTAYHFADDTDLLIINKSLKKINKHINHDLSNL FT XQWLRSNKLSLNTNKTELIIFKSNQSKINKHLNFRLSGQKIFPVISLKYLG FT IKIDSNLSFSSHLKDLAVKLCRSNGMLAKIRHYVNYETLLNIYHAIFGSHL FT RYACQVWGQCKSNAFLRLSNLQNKALKIIYFQHYHSNSDILYFLSKILKLC FT DLVQLLNCLFVWNHKQVNLPPSFSNFFTAKESSRYYLRSFSNFKLSVPNHC FT SVKYGDNSIKNQSVKSWNNLPFQLKSLDSFSKFKTXLFNYFIEKYSN*" XX SQ Sequence 4410 BP; 1618 A; 662 C; 437 G; 1674 T; 19 other; tagaaggtct tcgcaaaggc gggttttttt aaccgttaaa aaactcaaag gtaaagaata 60 agttaaacag ataaatattg tttttaaaaa agtcaccttt aaatcataat caaatcttaa 120 tcaaaataat ttataaatta atttaaaaca aacaattaaa tcaaaaaatt tactaaaagg 180 taaacagaat ttcgaactaa aaaaaaaaag aaaaaaaaga aaagaactct tccgtaaaac 240 aaaaaaaaaa aaaaaaaaaa aaaaaaaacg agagagaaag aaaaaagtta accaataaaa 300 agaaacaaaa gaagaaaaga agaaggaaaa ataaagaaaa ttacaaacaa agaataagaa 360 gattggttgt tttatacttt ttttttaacg atttatttgt aataatatac tataattttt 420 tgtgtttcta attttattat tattaatttt cttgaattat tatattttaa ataatattct 480 tttattatat tattattttt cttgtattat attattattt ttcgttaata tatttaataa 540 ttttttgttc gatatttttt tttatttttt ttgtttttag ttgttgtatc tttaattgtt 600 atttatttat tatacatata ttatctgcta ttatattatt tattattaaa ttaactattt 660 aaaattctta atactatata tattttttta tgattactag ttaattatta tatcctttat 720 ttttaaatta tactatattt atttattatt gatttatgcc atactttata tattattgct 780 ttgtttttaa cttatatttt tacatggcaa ttaagtacca ttgtaagttc tgcaataaag 840 cagtggccaa tcatcaaaga ggcattttat gtgattgttg ttcttcttgg gttcatgcta 900 agtgcaattc aatcaacaaa ttttcgtata accttcttgt caaagactct tctgactggt 960 attgtagtga ctgcataatt aagaatatgc catttagcac actttcggac ttagaatttt 1020 ctttaactat atcatgtcag actctaccaa ctagtacctt aaattcttta gaaagtccat 1080 tacaattcaa aaactttttt aaggatataa ataaaatctc taatacttat ttcaactgca 1140 aatactacga tattcttgat ctcaataaaa ctttaaaatc aaattctgaa ttgtaccttc 1200 atctaaacat tggttctttg cattttcata ttgatgaact tcgcagtctc ataagttcac 1260 ttaccattct tccaacggta wttggaattt cggaatctaa attgtatatt aacgatccca 1320 atataactga cattactata aatggttata acattgaaca atgctctact gaagctaaaa 1380 aaggtggtgc tcttctatac ttacaatcca atctaaacta caatgtcgga agagatctta 1440 tagtatatgc cccaaaatac ctagaatcta tctttattga aatcgtcaat ccttttaaga 1500 aaaatatcat tgttggctgc atataccggc atccttcaat gaacccaaag gaatttattt 1560 ctgattattt taatccttta cttgaaaagc ttagttatga aaataaacag gttatgttaa 1620 tgggtgattt taacatggac ttactaaact acaatgaatc ccttattata tctgattacc 1680 ttgactcact ttgttcttat tctttttacc caacaataat tcaaccaacg cgagtaacct 1740 staaatctaa aacaattatt gataacatct ttttaaattt tcaaattcct aatcttattt 1800 ctggtaacgt aacaatatca atttcagatc acatggcaca gtttgtttgc attcctagta 1860 ctcctttcaa aagaaaaaaa tccataacaa ctaaacgatc ctttaagaat ttcgatgata 1920 ataaatttat tcaggatatt tctaatattg actggaagtt ttatatcaaa aatgatgatg 1980 atgtcaatga atcaatgaac ttytttttaa aaacattcaa caaaattctt gatcgwcatg 2040 ccccgtacaa aaaactaact aaaaaccaaa ttaaacttaa atctaaacct tggatcacta 2100 atggaattct gaaatcaatt tcaattaaaa ataaactcta taaaaaatat ataaaatcta 2160 aaaattccaa cacaaaaagt caactattta ataaatttaa attgtacaga aataaaatta 2220 gcaatttatt aagaatttct aaaaaatcat actacatctc cttttttaac aaaaacctca 2280 acaatataaa aaatacatgg aaaggaataa aagaaattat taatatcaaa ccytctacac 2340 caattaaatc tttcaatctc aaatttaatg gtaatactat ctcagacagt atggctgttg 2400 ctaacatttt taacaaatac tttgaaacta tacaacaccc actcaaaagt aaaatgattt 2460 cttctaaatg taayttttct gactatttaa aaaatcctaa ttctaattct ttctttmttg 2520 atccagtgac tgaaaatgaa gtgtctaatt ttataaaaaa tagtttacaa aatggtaaaa 2580 gttttggacc aaatagtatt cccacattcc ttcttaaact tacttctcac gttatttcca 2640 agcctctctg tactataata aataattctt ttaaaaatgg tctttttcca gatgttttta 2700 aaatagccga agttattcct atycataaaa agggttctct aatggaccac agaaactatc 2760 gaccaatttc tcttctatct aacattagta aaatatttga aaaagctatg tacaacagac 2820 tttataactt ttttgataaa watcaatgtc tttacaaaca tcagtatggr tttcgtaaca 2880 aacactccac aactcatgca cttattgaaa taactgagac tattagaaaa gctcttgaca 2940 ataaactttt tgcatgtgga gtgtttgttg acttacagaa agcctttgat actgttgatc 3000 actctatcct cttaagtaaa ttggaatact atggtattag aggtattact ctccaatggt 3060 tttcatcata tctttataat agatctcaat ttgtctccat taatgatgcc aaatcttttc 3120 taataaaatg ctccactggt gtaccccaag gttctgtatt aggaccattg cttttcctta 3180 tttttattaa tgaycttaat atctctctaa aattttcaac tgcttaccat tttgctgatg 3240 acaccgactt actcataatt aacaaatcat taaaaaaaat taacaaacat attaatcatg 3300 atctatctaa tcttsttcaa tggcttcgct ctaacaaact atctcttaat actaacaaaa 3360 ctgaacttat tatctttaaa tctaatcaat ctaaaatcaa taaacaccta aactttagac 3420 taagtggaca aaaaatcttt cccgtaattt cattaaaata tcttggaatc aaaattgact 3480 caaatctatc tttttcctct catcttaaag acttagcagt aaaattgtgt agatcaaatg 3540 ggatgctagc caaaattcgc caytatgtca attatgaaac tctacttaat atataccatg 3600 ccatttttgg atctcatctt agatatgctt gtcaagtttg gggacaatgc aagtctaatg 3660 cttttttaag rttatctaac ctacaaaaca aagctttaaa aataatatac tttcaacatt 3720 atcattctaa ctctgacatc ctttactttt tatcaaaaat actcaaatta tgtgatcttg 3780 ttcaactatt aaattgtcta tttgtctgga atcataaaca agtaaacctc cctccgtcat 3840 tctccaattt ttttaccgct aaagaaagtt ctcgttatta ccttcgatct tttagtaact 3900 ttaaattatc tgtcccaaat cactgttctg taaaatatgg tgataattct ataaaaaacc 3960 aaagcgtaaa atcatggaac aatctacctt ttcaattaaa atccctcgac tcwttttcaa 4020 aatttaaaac cygtcttttt aactatttta tagaaaaata ctctaattaa aatcttagac 4080 ttataataga ttattattac tattgttatt agtactatta ttatacgtaa tgttaatatt 4140 atttacwttt tattttcata tttattaytg ttattattac ttattggaat tatttaatat 4200 atgaatgtga tatttgttaa tattattatt gttattatta tttttgttaa cttcgttact 4260 attattatta ttaytattat tgcttttgtt attataatta ctgatataat cattgttatt 4320 gttattatta tatttaatag tattattggt attattataa tcgtcttcgt ttaatttaca 4380 ttattattat tgttattatt attattatta 4410 // ID Gypsy-36_AA-LTR repbase; DNA; INV; 218 BP. XX AC AAGE02023632; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-36_AA_; KW Gypsy-36_AA-I; Gypsy-36_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-218 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02023632; Positions 22562 22345. XX SQ Sequence 218 BP; 66 A; 48 C; 34 G; 70 T; 0 other; tgtcatacct ctgtaactaa attaactttg cgctaataga atatcgcaca tacctatctt 60 taactcatga cctttctttc taaaaaccgt gcgagaaaca ttgtcattgt attcaagagt 120 aaacaaagaa taaacgtgta tgaattcgag tccgttcttt taattacgcg tccagttccg 180 tttgcggatt tcggtccaga tttaccccgg atacaaca 218 // ID I-65_AAe repbase; DNA; INV; 6369 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE An I non-LTR retrotransposon from Aedes aegypti. XX KW I; Non-LTR Retrotransposon; Transposable Element; I-65_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6369 RA Kojima K.K. and Jurka J.; RT "I clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1336-1336 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 7 sequences with >94% CC identity. XX FH Key Location/Qualifiers FT CDS 514..1869 FT /product="I-65_AAe_1p" FT /translation="MASACGGDPGGSNGRTLPVYLDGRNEFGAVTTLLMTG FT KDGKNLPIEPFIIGKSLELCAGPIELAKSENKSTRYVIQTRKPTQVEKLLK FT MTQLIDGTEVSVVLHPRLNVSRCVISAYDLLEKDEAEIVQEMSSQGVIAVR FT RILKANKEKTTAIILTFNRSVYPANVKIGVLNFKTRPYYPNPLLCFGCYEY FT GHPRVGCTNPKRCYNCSQNHEEKDVCDHAPFCRNCGKDHRPSSRQCPIYKN FT EMDIIRTRIDHNISSAEARKRVAAGNGSYAQVAAQPRLDQARMDALIAKLV FT EKDKKIEQLEANHERTSHELLAKLNAFMEKMEEKENQIKELLEDLRHRDEK FT IAKIEADNENMKNYIENILNRPRTSSQSSEPTPNKKTKTKRPTAQSTNEHT FT SRSNMSPPPKKQPNANRSPVMTRSNSNKSKCETVTSIDMDSSTELTFLDQT FT NYGPTEN" FT CDS 1853..6259 FT /product="I-65_AAe_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="MDRPKINNNQVLRPFNKFTVSTEQNVNTTHPHHSTAY FT LEPFRFEEWESAVPLSLPLEALHTGELVRAALPQPVDNEPNCTAGSRSQPV FT SSNAIITDNTSHRFTDHPFINIPNSSTSQILEYVSPVSDPFNLDCKTYPKT FT HSKAYPSTQTQCSTEESLGSKNRPQLRQQSFHHGSQIPTSSRKLTTDIPIL FT HGEGSVNPRQILAPALTPDVPSFWTEPAAAAGAGNLSVAQASIHQFPSQRS FT NDALEIPTANNIHASTSSTPSTIRSGTQGQVFALQWNVRSVWRNYAEFRKI FT VDDNNPIAICLQEMMTNKTNGLLNNNYDWAIHNRSESMGNGICGLGIRKDV FT PHKFISVTSSLHVCIARISKPWDLTIISIYVSQAMSGEELSNQLSHLIDFL FT EPPYVFCGDFNSMHEVWGSVQSNRRGRFLCEWAVDNSLLALNNGSPTHYQA FT SSGTFSAIDVTFVSNCLATKFSWQSDDDLHGSDHFPLLIHSNNPLPTIRST FT KRWLYKDADWKKYETIIAAKLPPNSMASIDIITQVIYDAAVESIPRTKGLP FT AKKRQVWWNEEVGKIVKARRKALRKLRKLRDGDPTKEEAKVHYNAVNREAK FT KMIDEAKKNTWSTFQEIFTAKSNTSQLWQSFNKLTGKRRNKISGLLIDGEY FT TDDPTKIADHLVNFFASTCSTASYSENFLRTRRQGEETVFPVDAGNDSPLN FT AKFSIQELFQALENANGTSVGIDDIGYPMLKHLPFHCKVTLLEAFNGVWES FT GNIPSTWKESIIIPIPKPGENSTEPNSFRPISLISCIAKTMERMVNRRLID FT HLEARNILHPKQFAFRKGKGTSTYLAQLDEIITEAANRGEHCEIVTLDIKK FT AYDRVWHKHLMEAVFECNLGGRMNKFLNNFLQQRTARVRYAGALSKSTQLE FT NGVPQGSVLSVSLFLLVMNTLFSETPNNIQVFLYADDIVIVATGKRVSYLR FT RRLQKAVSNIENWATGIGFQLSPTKSSTMHCCKLRRHRNWHLEGGNIFLEG FT TELAKPKTTRILGITFDRKCNFNQHTKQLKEDCRSRLNLLKAISRKADRKT FT LLHIGNAIVTSKLFYGLEVLRLENIEQLAPTYNQIIRTASSAFRTTPILSL FT MVESGCLPFEHLVLSTLIKRATAILELSPKESNHLWAKANAAFQSLTDTEL FT PKLCKLVTAHTRTWNQKGPTIDWSIKTKIRAGQTPQVAREIFRQVIEEKYQ FT GFEHWYTDGSLAEGKVGYGVIGPNLRVEAGLPNQCSVFTAEAKALSEAVKY FT ITNQAVIFSDSASCLKALESGKSKNYYIQCIESDSKEKDIRFCWIPGHSGI FT LGNENADAAARHGREHLATTTLVPRQDIITWMQKKIWRAHQQLWDQAPRNT FT LRIVKQNVGKPVDRSDKKEQIVLSRLRTGHAKFTKEHIFENRSPDRCTVCN FT RKLTVHHVLAACQRYNAERQEFGLSNKLDEILCNDKKAEDRTIKFIKKIGL FT YDRI" XX SQ Sequence 6369 BP; 2060 A; 1466 C; 1362 G; 1480 T; 1 other; cagttcgcgt tgagcaccca gtgtgatcgg tcgctttgcc attagatctg aagtgattga 60 tttattattc tcaacgttat tatccgggag cgccagcgct acgcctcccg gctgttttgt 120 ggaatcagtg ttctgagtgt agttaaatcg cgaaattatc cgcgagtatt ttcgctatct 180 ctaccaggcg cacaaacgag cgaagtagca tatccccaat taccgggtgc gggtggtcga 240 ccaatcggct ccatcagcgt tggagaattg tggagaaaaa ccgcaaccta ggttgccaac 300 ataggtagtg ataggttgag tgggtgaaga agagaagcga catcaacagg ctattaggct 360 acgaaccata tcgatcaggt gatcacaatc ttgtaaggta gggtcagatt tcctttttcc 420 tctttatcct tccgccgcgc cggggtaggt agtgggtgac cactaccctt atctgcggtt 480 tttgatctct acggtctgaa gtaggaacag tgcatggcct ccgcttgcgg aggtgacccc 540 ggggggtcaa atggaagaac gttacccgta tatctggatg gacgcaacga attcggagct 600 gtaactactc tgctgatgac gggtaaggat ggtaaaaatc ttcctatcga gccgtttatc 660 atcggaaaga gtcttgaact atgcgccggt ccgatcgaac tcgcaaaaag cgaaaacaaa 720 agtacacgtt atgtgataca aacgaggaag ccgacacaag ttgagaagct actgaaaatg 780 acacaactga ttgacggcac ggaagtgtcg gtcgttcttc atccccgact caacgtaagt 840 cgctgtgtta tctctgcgta cgatctgttg gagaaagacg aagccgaaat tgtgcaagaa 900 atgagctcac aaggagttat cgcggttcga aggatcctga aagccaacaa ggaaaaaacg 960 acagccatta ttctgacgtt caatcgaagt gtgtaccctg ctaatgtcaa aattggagtg 1020 cttaatttca aaactcgccc gtactaccca aaccccctcc tttgctttgg ttgctacgaa 1080 tatggtcatc cccgggtggg ctgtaccaac cccaaacgtt gctacaattg ctcccagaac 1140 cacgaggaaa aagatgtctg tgaccatgct cctttctgca gaaattgtgg aaaagaccat 1200 cggccttcta gtcgacaatg tcccatttac aagaacgaaa tggacatcat cagaaccagg 1260 atcgatcaca acatctcgtc tgccgaagct aggaagcgtg tcgcagctgg aaacggtagc 1320 tatgcacaag tcgcggccca accacgattg gaccaagcta ggatggatgc gctaatagca 1380 aaactcgtag aaaaggacaa gaagatcgag caacttgaag cgaaccacga gcgcacttca 1440 catgagttgc ttgcaaaatt gaatgcattc atggagaaaa tggaagaaaa agagaatcag 1500 atcaaggaac tgctggagga ccttcgccac cgagatgaaa aaatagccaa aatcgaagcc 1560 gacaacgaaa acatgaagaa ctacatcgag aacattctaa acagaccgag gacgagctca 1620 cagagcagtg agccgacgcc caacaagaag acaaaaacca aacgaccaac agcacagagc 1680 accaacgaac acactagccg ctcaaacatg tcacctcccc cgaaaaaaca gccgaatgca 1740 aacagaagcc cagttatgac taggtcaaac tcaaacaaaa gcaaatgcga gacggtaacc 1800 tcaattgaca tggactcctc aaccgagttg acttttctcg accaaaccaa ttatggaccg 1860 accgaaaatt aacaataatc aagttctacg acctttcaat aaatttaccg tcagcaccga 1920 gcaaaacgta aatacaaccc acccacatca cagtactgca tacctcgagc ctttccggtt 1980 tgaagagtgg gaaagtgcag taccactctc acttcctttg gaagcgctgc atactgggga 2040 acttgtccgg gccgctctac cccaacccgt tgacaacgaa cccaactgca cggccggttc 2100 tcgttcccaa cctgtttcat caaatgcgat aatcaccgat aatacttcac accgctttac 2160 agatcatcct ttcattaata ttcctaattc atcaacctct caaatcctag aatacgtttc 2220 tcctgtttct gaccctttta accttgattg taaaacgtat cccaaaacgc actcaaaagc 2280 ctatccgtct acgcaaactc aatgttccac tgaagagtct ctcggaagta aaaatcgacc 2340 acagctgcga caacaaagct ttcatcatgg gagccaaata cccacatcat cgcgcaaact 2400 gacaactgac atccctattc ttcacggaga agggtccgta aaccctaggc aaatccttgc 2460 accggccctg accccagatg ttcccagctt ctggactgag cctgcggcgg cagccggtgc 2520 agggaacttg tctgtagcac aggcaagtat ccatcaattt ccatcccaaa gatcaaatga 2580 tgccctcgaa atacctacag caaacaacat ccatgcgtca acatccagca ctccatcaac 2640 aatccgatcc ggtacccaag gccaagtttt cgctctgcag tggaatgtcc gtagcgtctg 2700 gcggaactat gctgagtttc gtaaaatagt ggacgacaac aacccaatag caatctgtct 2760 tcaagagatg atgactaaca aaacgaacgg tcttcttaac aacaactacg actgggctat 2820 ccataatcga tctgagagca tgggtaatgg catctgtggc ctcggaatcc gtaaggatgt 2880 gccacacaaa tttatcagtg tcacctccag tctacatgta tgtattgcta gaataagtaa 2940 accatgggac ctcactatta tctcaatcta cgtatcgcaa gcaatgagtg gcgaagagtt 3000 aagtaaccaa ctaagtcacc tcatagattt ccttgaacca ccgtatgttt tctgcggtga 3060 ttttaattca atgcacgagg tatggggtag tgtgcaatca aacaggcgtg ggcgttttct 3120 ttgcgaatgg gcagtcgata acagtctctt agcacttaac aacggttctc ctacacacta 3180 ccaagcatcg tcaggaacat tttcggcaat tgatgttacc tttgtgtcaa actgtctcgc 3240 taccaaattt tcttggcaga gcgacgacga tttacacggc agcgaccact ttccactcct 3300 cattcattca aacaatccgc tcccgaccat ccgttcgaca aagagatggc tctacaagga 3360 tgctgattgg aaaaaatatg aaactattat cgctgcaaaa ctcccaccaa attcaatggc 3420 atcgattgac atcataacac aagtcattta tgatgcagct gtggaatcta ttccaagaac 3480 aaagggtctt ccagctaaga aaagacaggt atggtggaat gaagaagtcg gtaaaattgt 3540 taaagcgmga cgaaaagctt tgcgtaagct gcggaaatta agagacggcg atccaactaa 3600 agaagaagcg aaagtgcatt ataatgcagt gaatcgggag gcaaaaaaga tgattgatga 3660 agcgaagaaa aacacatggt ctacctttca agaaattttc acggcaaagt caaatacgtc 3720 tcaactttgg cagagtttca acaagcttac gggcaaacga cgcaataaaa tctcaggact 3780 tcttatagac ggagagtaca ccgatgaccc aacgaaaatt gccgaccatt tggtaaactt 3840 tttcgctagt acatgctcaa cagcaagcta ttccgagaac ttccttagaa ctaggcgaca 3900 aggggaagaa acagtcttcc ctgttgatgc agggaacgac tctccgttaa atgctaagtt 3960 ttctatccaa gaactatttc aggcgttaga aaatgcaaat ggaacatcgg ttggtataga 4020 tgatattgga taccctatgc tcaagcatct tcctttccat tgcaaggtaa ctttgcttga 4080 ggcgtttaac ggtgtttggg aaagtggaaa tataccatcc acgtggaaag agagcataat 4140 tatcccaatc ccaaaaccgg gagaaaactc cacggaacct aatagcttta gaccgatctc 4200 cttaattagt tgcatagcca aaacgatgga gaggatggtc aatcgccgcc tgatagacca 4260 cttagaggct agaaacatcc ttcacccaaa acaattcgct tttcgaaaag gcaaagggac 4320 ctcaacctac ctggcccagc ttgacgagat tatcactgaa gcagcaaata gaggggagca 4380 ttgtgagata gtcaccttgg acatcaagaa ggcttatgat agagtatggc acaaacattt 4440 aatggaggcg gttttcgaat gcaatctggg aggaaggatg aacaaatttt taaataactt 4500 tctgcaacaa cggacagcaa gagttcgtta cgctggagcc ttgtcaaaat caacgcaact 4560 tgaaaatgga gttccgcaag gctccgttct ctcagtatcg ctttttcttc tggttatgaa 4620 cacactgttt tctgaaactc caaataacat ccaggttttt ctctacgccg atgatatagt 4680 cattgtcgcc accggtaaac gagttagtta cttacgacgt agacttcaga aggcggtttc 4740 aaacatagaa aactgggcta cgggaatcgg gttccaatta tctccgacga agtcatcgac 4800 catgcactgc tgtaaactta gaagacaccg aaactggcac ctcgaaggtg gcaatatttt 4860 tctagaagga actgaacttg ccaaaccaaa gacaaccaga atacttggca tcacctttga 4920 tcgtaaatgc aatttcaacc aacacacgaa gcaactcaag gaagattgta ggagtcgact 4980 aaatctatta aaggccattt caagaaaagc agatcgcaag acacttttgc acattgggaa 5040 tgccattgtg acctctaaat tattctacgg actagaggtg ctacgattgg aaaacattga 5100 gcaattagca ccaacataca accaaatcat taggacggca tcgagcgcat ttagaaccac 5160 tcctatactt tcattgatgg ttgaatcagg gtgtcttcct ttcgaacact tagtattgag 5220 cactttaatc aaacgagcaa ctgcaatatt ggagctatcg ccgaaagaat cgaaccatct 5280 ctgggcaaaa gctaacgccg cttttcaaag tctaacggat actgaacttc ccaaactttg 5340 caaattagtg acagcacata ccagaacgtg gaaccaaaaa ggtcccacca tcgattggtc 5400 aatcaaaaca aaaataagag ctggacaaac ccctcaggtg gcccgtgaaa tatttcgaca 5460 agtgatagag gaaaaatacc aaggattcga gcattggtac actgatggct cgctagcaga 5520 agggaaagtt ggatatggag taatcggacc taatctgagg gttgaagctg gactacctaa 5580 tcagtgttca gtttttaccg cagaagctaa agcactatct gaagcggtta agtacattac 5640 aaatcaagcc gttattttct ccgactctgc cagctgcctc aaagctctgg aatcgggaaa 5700 gtcaaaaaat tactacattc agtgtattga aagtgactcc aaagagaaag atatccgatt 5760 ttgctggata cctggccatt caggaatact aggcaacgaa aatgcagatg cagcagcgcg 5820 acatggaaga gagcatctag caaccacaac tcttgtccca agacaagata tcatcacatg 5880 gatgcaaaag aaaatctgga gagctcatca acagctgtgg gatcaagcac caagaaatac 5940 gttgagaatc gtgaagcaaa atgtaggtaa acctgtagat agaagcgata agaaagagca 6000 aattgtgtta agtagattaa gaacagggca cgcgaaattc actaaagaac acatttttga 6060 aaatcgttca cctgacagat gcacagtatg taaccgtaaa ttaacagtcc atcacgtgtt 6120 ggctgcttgt caaagatata acgctgaaag acaggagttt ggactaagta acaaattgga 6180 tgaaatattg tgtaatgata aaaaagcaga agatagaacc attaaattca taaagaaaat 6240 aggactgtat gataggatat aaatttaaaa tgtatttttg aatagtcttg gtaaataaat 6300 ctatttttgc agagacgaat gccccgtcag ggcaaaatct cttaaaacca aaaaaaaaaa 6360 aaaaaaaaa 6369 // ID Mariner-10_SM repbase; DNA; INV; 1342 BP. XX AC . XX DT 14-FEB-2008 (Rel. 13.02, Created) DT 03-MAR-2008 (Rel. 13.02, Last updated, Version 1) XX DE Mariner DNA transposon from Schmidtea mediterranea: consensus. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-10_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-1342 RA Jurka J.; RT "Mariner DNA transposon from Schmidtea mediterranea."; RL Repbase Reports 8(2), 154-154 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 122..1237 FT /product="Mariner-10_SM_1p" FT /translation="MTKHLNSDEKTLILLHYERLLMKMGKVDLEYQTVEQL FT GKAFGVGKNYVKELRKGFIESGTLLRQSGSGRSAVDKEERMDLVVEAVRAK FT RDSTVRDIAAITEVPRSTVWRLLNESEFKICSRRTIPLLSQAHIDSRLKFC FT REQRRNNWNDWVDVDEKWFELNNTKRKERYHENSPKKKIPCVSKGNPQKLM FT VLSAIAKPRPEYNFDGLIGIWRITADYVAQRSSKNHARGDIYKIDTTITAD FT SYYDLMTKHVIPAVRNKMNFTDNIAIQQDNARPHVGKRNVTRIEEFGAVGG FT SSISXVNQPAQSPEHSSRIEKSTKGNIEELWQNIQDAYYSYDASXLKSLWG FT TKSAVIQEIIKARGXSINIPHTGVRNSXL" XX SQ Sequence 1342 BP; 456 A; 232 C; 283 G; 367 T; 4 other; tactacctcc gtcccagaaa aaaccgggat ttttattatt aggtatttta gttgttgata 60 tttaatagtt ttttaatatg cagtgacttt tgttttagtt gatcatttta ctaaagaaac 120 aatgacgaaa catttgaaca gtgacgagaa aactctgatt ttgttgcatt atgagcgttt 180 gcttatgaaa atgggtaaag ttgatttgga gtatcaaacg gtagaacaac ttggaaaggc 240 tttcggagtg ggcaaaaact acgtcaaaga acttcgcaag ggatttattg aaagcggcac 300 actacttcgt caaagtgggt ctggacgctc tgctgttgat aaagaagaac gtatggattt 360 agtggtagaa gcagtacgag caaaacgtga ttccacggta agagatattg ctgcaattac 420 tgaagtcccg cgtagtacag tttggcgcct tttaaatgaa agtgaattta aaatatgttc 480 acgaagaacg attccattat tatctcaagc tcacattgat agcaggctga agttttgtag 540 agaacaaaga cgcaacaact ggaatgattg ggttgacgtt gatgaaaagt ggtttgaact 600 gaataacacg aaaagaaaag aaagatacca tgaaaacagc cctaaaaaga agattccgtg 660 tgttagcaaa ggtaatcctc agaaattgat ggtgctttct gcaattgcaa agcctcgtcc 720 cgagtacaat tttgatgggc tcattgggat ttggagaata accgcagatt atgtcgctca 780 aagatcttcc aaaaaccacg caagaggaga tatttacaaa atcgatacaa ccatcacagc 840 agattcgtac tatgatctaa tgaccaaaca tgtaattcct gcagtaagaa ataagatgaa 900 cttcacggac aatatcgcca ttcaacaaga taacgccaga ccacatgttg gaaaaaggaa 960 tgttacacga atcgaagaat ttggcgctgt cggtggttca agcatatccr tcgtgaacca 1020 accagcacaa tcaccggagc actccagcag gatagaaaag tctacaaaag ggaatattga 1080 agaactctgg caaaatatac aagatgctta ctactcatat gatgcaagtg yattgaaaag 1140 tttgtggggg acgaaatcag ctgtcatcca agaaatcatc aaagcaagag gtrcatccat 1200 caacattcca catactggcg ttagaaactc tyttctttga ttaatagtat tttttaactt 1260 tcgtgtttgg tttattcatt aaacacgtta taccactgac tcaaattgaa aaatcccggt 1320 tttttctggg acggaggtag ta 1342 // ID DNA-TA-5_AAe repbase; DNA; INV; 2424 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 11-APR-2011 (Rel. 16.04, Last updated, Version 1) XX DE A non-autonomous DNA transposon family from Aedes aegypti. XX KW DNA transposon; Transposable Element; nonautonomous; KW DNA-TA-5_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-2424 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the yellow fever mosquito."; RL Repbase Reports 11(4), 1274-1274 (2011). XX DR [2] (Consensus) XX CC ~98% identical to consensus. TA TSDs. TIRs are 29 bp long. The CC sequence similar to DNA-TA-8_AAe is masked by x. CC The region 1334-1586 is an inserted FEILAI_AA (~94% identical to CC consensus). XX SQ Sequence 2424 BP; 744 A; 442 C; 412 G; 740 T; 86 other; taccgtaaag tgccctaatt cagcgcagtg cctaattccg cgcatttcat acaaaattat 60 ttatatatgg aaatacacat taatatttca gcaactttta atgctattcg aggctacttt 120 gattaagaat aagtttgaat cacaaataaa agcaaataag gcattttttc gcggcataaa 180 aaaataatca gtgaaggccc gtcggagtag acgttatttt tctaattttc ttagcgtccg 240 cgctaagact gtaggacaac aacaaacgcg atttctttat taaaaatgtt ttatttttta 300 tgcaatattc catggaacta cttgctacag atatgtttca tggtgcttct atgaaatctt 360 atcaaatatt agcataatta ttgagtttta ttcaaaaaag tattgcgcgg aattaggcca 420 cgtctttttt cataatgccc taattccgcg cactgttctt cgcataacaa cagacaagta 480 aaacatgcta tatccgtctt cacatctgtc tatttctctg aaaagtaact ttatgcatac 540 atacgtttca tgtacaagcg gaatatcggg aaagcccact taggcagcgt ccataaatga 600 cgtagcatct tttgaccaat ttttgacacc ccttccccct tcgtagctta ttttcccata 660 cttaatacat ggttcgtagc aaaatcgtag actccccctc ccccctaaaa tgctacgtca 720 tttatggacg gtcccttaca gggaaaaata ccacgtggtt tatgcacagc cccaacaaaa 780 tagtctatac acacttagat ttttttcacg agctcggctg tgcggatctc ggttgaaatt 840 agccgagatt cggcattctg aattaagtgt gtacatctgt gcgcacaaaa tttacttagc 900 ataatgaacc acacataaca ttggcaatgt atacttgttt atgattctta tttatattat 960 acaatccata tagaattatt gtaaaaacaa catatcgttc acatgttcaa caggtcgtaa 1020 tttgtagaag accccacaaa ttacgacaca ctctcgggaa atgattcacg gaaactttac 1080 gatctctaca taatttacga agcctccata caaaaagtat gataatacgg gggagaggga 1140 ttgaaaatgg ccatattttg tgtgacgtaa tttatggatg acccttaggt tcttctgagg 1200 tttctttttg cttaaaaaaa taagtttcag ttacccgtgg cattagtgga taccattaat 1260 tgatttgctt aaaacccttt gattccaagc aataatgcga actggtcctt gtattgtatg 1320 aacagtgcag tttgctacaa agcaaagcca tgctgaaggt gtctgggttc gattcccggt 1380 cggtccagga tcttttcgta gtggaaattt ccttgacttc cctgggcatg aggtatcatc 1440 gtgcctgcca cacgatatat acgaatgcaa aaatggcaac tttgacaaag aaagcttcag 1500 ttaataactg tggaagtgct cataagaaca ctaagatgag aagcaggctc tgtcccagtg 1560 aggacgttaa tgccaagaag aagaagxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 1620 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxgagtattt 1680 catttctaaa ctttccagtt cagtgaataa aataattaaa ttgttagatt ctatgggata 1740 atttcacaac ataaaatagg gtgctcatat caccccatcg acagaagaat cgacgaaaaa 1800 ttcattcgtg ttttttttta agacgacgga aatcgtgcgt atctagtgta ttgatctaaa 1860 aatttcaatt attacgactt ttcttaagat ggccaaattg ccccactttc ccctagacga 1920 aattcacacg taccaatgcc acggttttgg ttatctttat ttcatgtgat agacaaaaag 1980 ttaataaaaa aaactgaacg tagactaaaa agttacggtt catacaaaac tctacgtttt 2040 ctcatacaaa aagtgttacg gaaggggagg gggttgaaaa ttttcaattt taagtgttac 2100 gtaactaatg gatgcagccc cttatcaaaa ttaatgttaa tttgattgtt tcatgagtgc 2160 gcggaattag gcattcttct gcgcggaata tggaaaagcg tttaaaaaat gcgcggaatt 2220 aggcaatttc aacgagtgaa ctatttcaaa aaaatcacaa ttgtaaatta tgaatacagt 2280 ctagtttata tttttcccat aatcttgaat agatttgcta gcgatccacc cataaagctt 2340 ttttgttgat gcaatactaa tttttaatta acaatttgaa tcgttttatt ccttaactgc 2400 gctgaattac ggcactttac ggta 2424 // ID GIZMO1_EI repbase; DNA; INV; 1691 BP. XX AC GIZMO1_EI; XX DT 16-MAY-2005 (Rel. 10.05, Created) DT 18-MAY-2005 (Rel. 10.05, Last updated, Version 1) XX DE Gizmo-Ei1 (GIZMO1_EI), a new member of the Tc1/mariner DNA DE transposon superfamily from the single-celled eukaryotic DE reptilian parasite Entamoeba invadens. XX KW Mariner/Tc1; DNA transposon; Transposable Element; mariner; KW Interspersed repeat; Tc1/mariner superfamily; Gizmo-Ei1; KW GIZMO1_EI. XX OS Entamoeba invadens OC Eukaryota; Amoebozoa; Archamoebae; Entamoebidae; Entamoeba. XX RN [1] RP 1-1691 RA Pritham E.J., Feschotte C. and Wessler S.R.; RG Department of Biology, University of Texas at Arlington, RG Arlington, TX 76019, USA; RT "Unexpected diversity and differential success of DNA transposons RT in four species of Entamoeba protozoans. Mol. Biol. Evol. (2005) RT In press."; RL Direct Submission to Genbank (16-MAY-2005). XX DR Repbase; GIZMO1_EI; Positions 1 1691. XX CC The TIRs of Gizmo-Ei1 are 84-bp long (with only two mismatches) CC and are flanked by TA putative TSD. The element contains a CC single ORF, which can potentially encode a 294-aa protein 43% CC similar to the ISAn1 transposase from the prokaryote Anabaena sp. CC and with a DD32E motif. There are several elements closely CC related to Gizmo-Ei1 in the E. invadens, E. moshkovskii and E. CC terripinae genomes. It seems that Gizmo elements belong to a CC distinct clade of Tc1/mariner elements most closely related to CC the IS630 group of bacterial insertion sequences than to CC established eukaryotic clades of the superfamily (e.g. mariner, CC Tc1, pogo?). XX FH Key Location/Qualifiers FT CDS 607..1489 FT /product="GIZMO1_ORF" FT /translation="MREYIKGMMVMDDIVNVEDIVEIREETFKEIVEHPEI FT IVGIDKKMEKKNKIVPSITSITNFMRGKTGVDEDRELPILSFKRETTRGAE FT SNSEVNKDKRINAIQKLYARMCEGYSWVCLDETLWRIATTSAYGWSERRKK FT CFITKSKRGLRLTSVCAIDVNGMSYCDIVNGINDKPLFNTYFKRLMQYYDQ FT RNVRVVFFWCDNCGIHNDIEELVIGTRHCVVFNAAYSQELNPIENIFGIWK FT RRAEREIRTWTTLESLLEKLSNAFTSIETPDVVASLERCRNTVWQKVFSRD FT NLX" XX SQ Sequence 1691 BP; 645 A; 228 C; 323 G; 495 T; 0 other; tattaaattt tatcaacaat attatacaac attatcttta ttaaaatgaa attatccaca 60 ttttatttaa atcaaaaata aaaacaaagc atcctttttt taaaaaacaa ctttttatta 120 aaaataaaat aagtaactcg gctatggttt acataaacgc aagagaacat gttgttagag 180 aacaatttat agatagcgca ccagaaccac ttcctggtaa tccttcaaca agtacaacct 240 cggttctatt tattgaacca atgccgcccc cccaaaaaat gaagaagact aaaaatgatt 300 agagagggac ccattccatt gtcaacgagt attaccaaac ctagaaaccc gtatataaga 360 attgcattag aggcgaaaaa aggctaatcg cattttggag aaaaaaatgg agatgaatgg 420 caactaggcg aatatgcctc caattgtgtt attcaaaacg acaattgtaa gtcattgttg 480 accaagttaa gaaaggggga tagtatattg ccaaagaacc attataataa aaaaagtcgt 540 gtgactccat atcaagtata ggtaggaaga cttttggaga atgactctac gacaaaggtg 600 catcagatga gagaatacat aaagggaatg atggtaatgg atgacattgt aaatgtcgaa 660 gacattgtag aaatccgaga agaaacgttt aaagaaattg ttgagcaccc ggaaattatt 720 gtgggaatag acaaaaaaat ggaaaagaaa aacaagatcg ttccatctat tacgtctatt 780 acaaatttta tgagaggaaa aactggtgtt gatgaagaca gagaacttcc gatattgagt 840 tttaaaagag agactacaag aggtgctgaa tcgaactcgg aagtaaacaa agacaaaaga 900 ataaatgcta ttcaaaaatt gtacgcaaga atgtgcgagg gatactcgtg ggtgtgtttg 960 gacgaaactt tatggagaat agctacgacg agtgcgtacg gatggagtga acgtagaaag 1020 aagtgcttta taacaaaaag taaaaggggg cttcgattga caagtgtatg tgcaatagat 1080 gtaaacggaa tgagttattg tgatattgtt aatggaatta acgacaagcc tttattcaac 1140 acgtatttta aaagactcat gcaatattat gatcagagaa acgttcgcgt tgtttttttt 1200 tggtgtgata actgtggaat acacaacgac atcgaggaac ttgttattgg cactcgtcat 1260 tgtgtagtgt ttaacgccgc atactcgcaa gagttaaatc ccatagaaaa tatatttgga 1320 atttggaaac gaagagcaga acgagaaata cgtacatgga caacacttga atctttactg 1380 gaaaaactta gtaatgcgtt cacttcaatt gaaaccccag atgttgtagc ctcgttagag 1440 cgttgcagaa atacagtatg gcagaaagtt tttagtcgag ataatttata agttggtttc 1500 tttttgatat ttcgaatgat atttttttat tattgaaaat atcctgtcaa cctaaataat 1560 taatttttaa attataaaaa aatgttatca ataaacattt ttgttgtttt tatttttgat 1620 ttaaataaaa tgtggataat tacattttaa taaagataat gttgaataat attgttgata 1680 aaatttaata a 1691 // ID Copia-1_RP-LTR repbase; DNA; INV; 275 BP. XX AC ACPB02045731; XX DT 28-JAN-2011 (Rel. 16.02, Created) DT 28-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Rhodnius prolixus genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-1_RP_; KW Copia-1_RP-I; Copia-1_RP-LTR. XX OS Rhodnius prolixus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Euhemiptera; Heteroptera; OC Panheteroptera; Cimicomorpha; Reduviidae; Triatominae; Rhodnius. XX RN [1] RP 1-275 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Rhodnius prolixus genome."; RL Direct Submission to RU (28-JAN-2011). XX DR Genome; ACPB02045731; Positions 1211 937. XX SQ Sequence 275 BP; 80 A; 34 C; 62 G; 99 T; 0 other; tgtttaatta atacattggt aattttgtct ttgtgggttg gcggcattgt ggcgtcagta 60 atgcggcttg tagcgtctgt acagcgtcat tgttgttgtt gttgttgttg ttgttgttgt 120 tgatgtgtac tgggagttag cgtcaagtgt atatagtacc ttgtgtaaga aagtacaatt 180 aaaataaaaa ctataaaaca tataagtgtt gtttattcgt agcagctgca gcagaaacct 240 cacaattaaa aacattaaca agaaaacttt taaca 275 // ID TCSAT1 repbase; DNA; INV; 196 BP. XX AC K00393; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 28-SEP-1995 (Rel. 1.08, Last updated, Version 1) XX DE T.cruzi satellite dna. XX KW SAT; Satellite; Simple Repeat; Repetitive sequence; TCSAT1; KW Satellite repetitive element. XX OS Trypanosoma cruzi OC Eukaryota; Euglenozoa; Kinetoplastida; Trypanosomatidae; OC Trypanosoma; Schizotrypanum. XX RN [1] RP 1-196 RA Sloof P., Bos L.J., Konings F.A., Menke H.H., Borst A.P., RA Gutteridge E.W. and Leon W.; RT "characterization of satellite dna in trypanosoma brucei and RT trypanosoma cruzi."; RL J. Mol. Biol 167(1), 1-21 (1983). XX DR GenBank; K00393; Positions 1 196. XX SQ Sequence 196 BP; 52 A; 52 C; 52 G; 37 T; 3 other; ctcgcgaaat tcctccaagc agcggatagt tcagggttgt ttggtgtcca gtgtgtgaac 60 acgcaaacag ayattgacag agagtgcctc tgactcccrc cattcacaat cgcgaaacaa 120 aaatttggac cacaacgtgt grtgcagcgg ccgctcgaaa acgatccgcc gagtgcagca 180 cccgtgtggg caagag 196 // ID BEL-70_AA-LTR repbase; DNA; INV; 961 BP. XX AC supercont1.276; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-70_AA_; KW BEL-70_AA-I; BEL-70_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-961 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.276; Positions 187722 188682. XX SQ Sequence 961 BP; 239 A; 224 C; 242 G; 256 T; 0 other; tgttcgcaat agcgaaaact caaaacggat gcacacccct attacgagca gcggaacgct 60 ttcaacatga aacctctacc acacaaatac tcactcttgt ttgtgtgcgt aacaagtcac 120 gccacctcaa tcggaaacga agccaaccgg cttgcgtccc agcgcagacc agcccagcca 180 atcagaggga aggacttttt tttgacacta taaaagaggg agtaaccctc atacagttga 240 tcagttttgt aacgaatttt gaagaataaa catcagtgct aagtacagtg aaagtgtttt 300 tacgatcgga gagaaaccgt cccagcccag cttggagtgc tgattcctga tcgatgccga 360 tttgttcctg tgccgtgcca aaaactgctt ggtgccagtg tttcgagagg gtgatcttcc 420 tttcgctcga tgtgagctcg ctacttgtgg gaaaaatttc acgagctctt ttcagagcta 480 attaccaaga gttactcaag tagacgacgc tctttgctga gtgaattgga agacatttgt 540 ggattttcaa tcggcaccaa gcacattctc agtggcagcg agttagcggt agtgccgtcg 600 gtgtgaaaaa aagtgaacgc gggtttgttg gtagtggacc aactcgtcgt ccattcagcg 660 tcggtcgcaa gttatttctc cattttgtgt gcgtgtgtgg tagtagtgca ttgccacatt 720 gtcgcgttat tgtcgaccat ccagcgtcgg tagtcttcgc catcggcagt gagccgaatg 780 cttcgtgagt atcatcgagt gtgcatgtat tggcagtgtg ccattacgat ctcacgtcat 840 catcgtcggc agtgagccga ctcgaacgat cagtgtgtgt gtatggctag cagtagtcag 900 caatgggctg attatttgcc gccacgaatt catcgtcccg aatgttggca atgagccaac 960 a 961 // ID Gypsy-9_DPu-LTR repbase; DNA; INV; 170 BP. XX AC scaffold_14; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-9_DPu_; KW Gypsy-9_DPu-LTR; Gypsy-9_DPu-I. XX NM Gypsy-9_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-170 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 734-734 (2010). XX DR Genome; scaffold_14; Positions 742040 742209. XX CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX SQ Sequence 170 BP; 46 A; 34 C; 49 G; 41 T; 0 other; tgtgatgtat tgaatgcgat cacagatggc gcacgaaagg cttacgccag agggccacgg 60 agtgaacggc attcgagttg agaggagagt ggaggaaggt cttaccattc ccgtcttgtg 120 tatttcatac gccttgtgga cttagaatac agcttaacgc agacttaaca 170 // ID TTAA4_AP repbase; DNA; INV; 433 BP. XX AC . XX DT 21-AUG-2009 (Rel. 14.08, Created) DT 21-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; TTAA4_AP. XX NM TTAA4_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-433 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(8), 1784-1784 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC TSD is TTAA tetranucleotide. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 433 BP; 136 A; 70 C; 82 G; 145 T; 0 other; gaggacgcta cacccgcatg cgttgtctcc gtcttacaaa tgcacaacat agcaaaaact 60 gttttgcgcg ggacaactat gctccctctg tatttatagt agaattacta aaattccaca 120 acgcataggg aagaacttta tctgtgtcgt agcgttttta ttatttggat aacattaata 180 taattaaagt tattagtttg agaacattaa gttttttttg tttttttcgt ttaattatta 240 aaaactattg gtaagtgcta taaaaaaaaa aaacaaaaac gctacgacac agattaagtt 300 cttccttatg tgttgtggaa ttttggtaat tctactataa ataaagaggg agcatagttg 360 tcccgcgcaa aacagttttt gctatgttgt gcatttgtaa gacggagaca acgtatgcgg 420 gtgtagcgtc ctc 433 // ID SINE_CP repbase; DNA; INV; 284 BP. XX AC X79504; XX DT 09-JUL-2004 (Rel. 9.06, Created) DT 09-JUL-2004 (Rel. 9.06, Last updated, Version 2) XX DE C.pallidivittatus Cp1 (SINE transposable element). XX KW SINE; Non-LTR Retrotransposon; Transposable Element; SINE_CP; KW transposable element Cp1. XX OS Chironomus pallidivittatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Chironomoidea; OC Chironomidae; Chironominae; Chironomus. XX RN [1] RA He H., Rovira C., Recco-Pimentel S., Liao C. and Edstrom E.J.; RT "Polymorphic SINEs in chironomids with DNA derived from the R2 RT insertion site."; RL Unpublished. XX DR Genbank; X79504; Positions 1 284. XX SQ Sequence 284 BP; 102 A; 55 C; 49 G; 78 T; 0 other; ctaagcgagc gaatgcttat tcaaaatgag aataagtcgt tctgttagca gagcaagctt 60 aagcttgaca gagtctcgaa ttcatcaaaa agtagcctaa gctactacga attcagacaa 120 aacagaacat cagacttaac ttctcatata ttccctttga aaatattatc tctctaagcg 180 agcgaatgct tattcaaaat gagataagtc gttctgttag cagagcaagc ttaagcttta 240 cagtagtctc gaattcatta aaaagtagcc taagctacta cgaa 284 // ID Gypsy9-I_AP repbase; DNA; INV; 4042 BP. XX AC Contig13293; XX DT 21-APR-2008 (Rel. 13.04, Created) DT 21-APR-2008 (Rel. 15.12, Last updated, Version 0) XX DE LTR retrotransposon from pea aphid: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy9AP; KW Gypsy9-I_AP; Gypsy9-LTR_AP. XX NM Gypsy9-I_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-4042 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from pea aphid."; RL Repbase Reports 8(4), 453-453 (2008). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC Positions [3131-3646] - Integrase core CC LTRs are 99% similar to each other. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX FH Key Location/Qualifiers FT CDS 51..962 FT /product="Gypsy9-I_AP_2p" FT /translation="MQNWGDIWALGDANANNPSANADNSSASANSDPAQTN FT VNAENLPRPLAPTDASAVMPFDISAFSSVRLPGFWRHSPQQWFTHAEAIFH FT NQRVRSDLTRVNHVLAALDEDGVRTIGDLLGADVQYSAVKSRLITAYDVPQ FT ATRFRSIIQHGGMGDRRPSQMLRDMRSALPSGIGESTLKEFWLQKLPPNIL FT AIVSGLDGSLESLAERADRVADASAGHNLAAVSNETDRFHSMENAISALTA FT QIAALVSTQGTQHRPARSQDRSRSKSRNRSKSRTHNDAWCFYHNVYGTKAQ FT KCRDPCSYESEN" FT CDS 887..4039 FT /product="Gypsy9-I_AP_1p" FT /translation="MVFLPQRLRYQSTEMPRPVFVRVGKLTEAAVVNSPPA FT AHHTYRLFVTDIRSGGRFLIDSGAEISVIPPGNGPRRSSDIVLNAANGTRI FT ATYGPRELHLNLGLAREFVWTFETADVARPIIGADFLHHFGLLVDVRRKRL FT IDCVTNKCVKAISVVNIDVDAIKVVPLPQKWSDILKSFPSVTRESPVPDKF FT LHRIEHELHTTGPPLFSRPRRLPPDRHRIARQEFEYMRQKGICRPSASAWA FT SPLLLVPKKDGTFRPCGDYRRLNEVTRADRYPLPHLHDFTTNLAGNTVFTK FT LDLVRAYHQIPIAKQDIPKTAVTTPFGLFEFPVMCFGLRNAAQTFQRVIND FT MLRGLNFSFAYIDDVLIASRNHEEHEQHVRTVLGRCQDYGIAINPVKCVFA FT VDSLTFLGHNINKDACRPTSERITAIHEWPLPATKKSLQRFLGSVNFYRRF FT IRSAAELQAPLYDLASKIKKRDGPLSWTDETQKVFESCRAALADTAELAHP FT LPNVPLRLSTDASNTAIGAVLEQLSDKGWQPLGFYSKKLSAAEKKYSTYDR FT ELLAAYSSTRHFLHAIEGRLTTLLTDHKPLTYMFTQKTEKVIDRQIRQITF FT LSQYINDVKHLQGDKNIIPDALSRLEVSATQVVLPDLKKWSADQADDPELK FT EILSGATKSALQLEPRSTAEGTIYLDVSTKQARMYVPSQHRRTVFDGLHGQ FT AHGGGNATSRLIGTRYCWPGMNRQIKHWVKCCIQCQRTKVLKHTVSPLTPF FT APPDRRFGHIHIDLVGPLPPSNGCRYLLTCVDRFTRWPEAWPIDNMSAHAV FT ATTLTTQWISRFGVPDMVTTDQGRQFESELFTTLTKNLGIQHLRSSPYHPQ FT ANGMVERLHRTLKTALTAHDTVNWSLRLPIVLLALRNTVKPDIGHAPAEMV FT YGMSLRLPGDMFHPAPIEAKTAPELVRNLRDSMTQLQPTPGSNHATQRYIF FT VPPDLNKVTHVFLRVDAVQTPLQPRYEGPYAVLERLGKNFKIQRDNGTVLV FT SIDRLKPAFVLREDPTAADHTYATHAEVTKQLKKRVRFSFRPQGE" XX SQ Sequence 4042 BP; 1065 A; 1224 C; 920 G; 833 T; 0 other; aactggtgac ctctctagta aacccttcca cgttcgttca attacgcgtt atgcaaaact 60 ggggagatat ttgggctctc ggcgacgcga acgccaataa tccgagtgcg aacgccgaca 120 attcgagcgc gagtgcgaat tccgaccccg cgcagacgaa cgtaaacgcc gaaaatttac 180 cgcgtcccct tgcgcctacc gacgcttcag cagtaatgcc gtttgacata agcgcattct 240 catccgttcg attacccgga ttttggcgac actcgccaca gcagtggttt acacacgcgg 300 aagcgatttt ccacaaccaa cgtgttaggt ccgatctgac acgcgtaaac cacgtgctcg 360 cggcacttga cgaagatggt gttcgcacca tcggagatct actcggcgcc gacgtgcaat 420 attctgcagt aaaaagtcga cttatcaccg cgtacgacgt cccacaagcc acgcggttcc 480 gttcgatcat tcaacacggg ggcatgggcg accgacgtcc ttcgcaaatg ttacgcgaca 540 tgcgtagcgc gttaccaagc ggaatcggcg agtcgacgtt aaaagagttc tggctccaga 600 aactcccacc aaacatcctc gccatcgtct ccggtctcga cggttcactg gagtcattgg 660 ccgaacgtgc tgaccgagtc gcagatgcaa gcgccggtca taacttagcc gccgtgtcga 720 atgaaaccga tcgttttcac tcaatggaaa acgcaatttc agcgctcacc gcacagattg 780 ccgcattagt gtcgacacag ggcacacaac accgaccagc ccgcagccag gacagatcgc 840 gatcgaaatc gcgcaaccgc tccaagtccc gcacacacaa cgacgcatgg tgtttttacc 900 acaacgtcta cggtaccaaa gcacagaaat gccgcgaccc gtgttcgtac gagtcggaaa 960 actgacggaa gcggcggtag tcaactcgcc tcccgccgct caccacacat acagattgtt 1020 cgtcaccgat atccgttccg gcggacgctt tctcatcgac tcgggagccg aaatctccgt 1080 cataccaccg ggcaacggcc cgcgccgctc atcggacatt gtcttgaacg ctgcaaacgg 1140 cacccgaatt gcgacatacg ggccaagaga actgcacctc aacctcggcc tcgcacgtga 1200 attcgtttgg acgttcgaga ccgcagacgt cgctcgaccg ataatcggcg cagacttttt 1260 acatcatttt ggcctcttag tcgacgtacg acgtaagcgc ctaatcgact gcgtcaccaa 1320 caagtgcgtc aaagctatct ccgtcgtcaa tatcgatgtg gacgctataa aggtcgtacc 1380 cctacctcaa aagtggtcgg atattctaaa gagtttccca agtgtcacac gcgagtcacc 1440 ggtcccagac aagttcttac atcgaatcga acacgagctt cacacgaccg gtccgccgtt 1500 attttcgcgc ccgcgccgtt taccccccga tcgacatcgc atcgcgcgtc aggagttcga 1560 gtacatgcgc caaaaaggca tttgtcgacc ttctgccagc gcttgggcca gcccattatt 1620 actcgtacca aaaaaggacg gcacttttag accatgtgga gattaccgac gtttgaacga 1680 agtaacgaga gcggaccgtt acccactacc tcacctccac gactttacca cgaatctggc 1740 ggggaacacg gtgtttacca aactcgacct ggtccgagcg tatcaccaga tacctatcgc 1800 caaacaggac attccgaaga cagccgtaac cacaccgttc ggtctgtttg agttcccggt 1860 aatgtgcttc ggcttacgta atgccgcaca gaccttccaa cgtgtcatta acgacatgtt 1920 acgcggatta aatttctcgt tcgcctacat cgacgacgta ttaatcgcgt cacgtaacca 1980 cgaagaacat gaacaacacg tacgcacagt cctcggacgt tgccaagatt acggcatcgc 2040 aattaacccc gtaaaatgcg ttttcgcggt cgactctctg acttttttgg gccacaacat 2100 caacaaagac gcatgccgcc cgacgtcgga acgtatcacg gcaatccacg aatggccact 2160 accagcgacg aaaaaaagct tgcagagatt tctgggctct gtaaattttt accgacgctt 2220 tattcgcagc gccgcggaat tgcaagcacc tctgtatgac ctcgcctcga aaattaaaaa 2280 acgcgacgga ccgttatcat ggactgatga gacacaaaaa gttttcgaat cttgtcgcgc 2340 cgcgctggcc gatacggcag aactggccca cccactgcca aatgtaccac tccgcctaag 2400 taccgacgct tctaatacag ccataggagc cgtgctcgaa caactctccg acaaaggctg 2460 gcagccgctg ggtttttact ccaaaaaact ttcggcagct gagaaaaagt acagtacgta 2520 cgatcgcgag ctgctcgccg catattcaag cacacgtcat ttcctacatg cgatcgaggg 2580 acgactcacc acgctgctga cggaccacaa accactgact tatatgttta cacaaaaaac 2640 agaaaaagtc atcgatcgcc agatcagaca aatcacgttc ctgtcacaat acataaacga 2700 cgtcaaacat ctacaaggtg acaagaacat aattccggac gcgttatcaa ggctggaagt 2760 ttcagctacg caagtagtct tacctgacct caaaaaatgg tctgctgatc aagccgacga 2820 ccccgagctg aaagaaatcc tgtccggcgc taccaagagt gcacttcaac tcgaaccacg 2880 aagcacagct gaaggtacca tttacctgga tgtttcaacg aaacaggcgc gtatgtacgt 2940 accgtcacaa caccgccgca ccgtttttga cgggctgcac ggccaagccc acggcggtgg 3000 caacgctact tcgcgtctta tcgggacaag atactgctgg ccaggaatga acaggcaaat 3060 caaacattgg gtcaaatgct gcatacaatg tcagcgcact aaggtgctga agcacaccgt 3120 atcaccctta acgcctttcg caccgccaga tcgccgtttc gggcatatac acatagattt 3180 agttggtcct ctgccccctt ctaacggctg caggtatttg ttaacgtgtg tagatcgttt 3240 cacccgttgg ccggaggcat ggcctattga caatatgtca gcacacgccg tcgcgacaac 3300 tctaacgacg caatggatct cccgttttgg cgttcctgac atggtcacta ccgatcaggg 3360 ccgccagttt gaatccgagc tgttcaccac tctcacaaaa aaccttggca tccaacacct 3420 caggtcatcc ccatatcacc cgcaggccaa cggtatggtg gagagactac acaggaccct 3480 caaaaccgcc ctcacggcac atgacaccgt gaattggagt ttacgacttc ccatcgtcct 3540 gctggcactc agaaatactg tcaaaccgga cattggccac gcacccgccg agatggtata 3600 cggtatgtca ctgcgactac ccggcgatat gttccatccc gctccgatcg aagcgaagac 3660 tgcgccagaa ctcgttagga atctacgcga ttcgatgaca caactgcagc cgacacccgg 3720 ttcaaaccac gctacccaaa gatacatttt cgtgccacca gacttgaaca aagtcacaca 3780 cgtttttctc agagtcgacg cagtgcaaac gcccttacaa ccacgatatg aaggtcccta 3840 tgcagtacta gaacgactcg gcaagaactt caaaatacaa cgcgacaacg gcacagtctt 3900 ggtgtccatc gaccgtctca aacccgcgtt cgtgctcaga gaggatccca cagccgcaga 3960 tcacacgtac gcgacacacg cagaagtcac caagcagctc aaaaaacgag tacgtttttc 4020 ttttcgccct cagggggagt ag 4042 // ID BEL-5_DPu-I repbase; DNA; INV; 8133 BP. XX AC scaffold_283; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-5_DPu_; KW BEL-5_DPu-LTR; BEL-5_DPu-I. XX NM BEL-5_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-8133 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 657-657 (2010). XX DR Genome; scaffold_283; Positions 31394 39526. XX CC Positions [7187-7750] - Integrase core CC LTRs are 98% similar to each other. CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX FH Key Location/Qualifiers FT CDS 2456..8131 FT /product="BEL-5_DPu-I_1p" FT /translation="MFPVSHAATLSASHPTMVTASHSATLPASFQPTLPAS FT FQPTLPASHPATFSASHPAMFPVSHPATLSASHPTMVTASHSATFPASFQP FT TLPASFQPTLPASHPATFSASHPAAVSSSHPATLPASFQPTLPASHPATVS FT SSHPAMFPANIPPTVTASHHVTFQPSFPPTVTASHHAAVPASIPPILSTDH FT PAMVPSSYPAMLPASYPGTYSHQTYPPPPLPQQQPAGPSRTTDSWITDPNW FT NVTQSHPSSWLASKLPRINLAPYDGDPRNWSYFIQNFKCLVHDVVPDDAQR FT IIFLKDYLSEKVRNSIANSLRDPAQYRSALAKLQRRYGNPQLVVRAHVQSL FT IQLVGPKEGDFDALCVFSGAIQAAVADLSNGGHLHDLLAPGLLYQVTSKLP FT PALMQKWGEEMCSLQPRAATLIDFDQWLESRVMAGTWAAPALQPTSNAGRT FT RERAPKGKEETKMPAVRHPRILHTSHRTSCVACTGPHPLEKCNKFSKLSPK FT ARAELLLEKGGCFRCLNPGHNSKACEKKVACGQDNCKSYHHPMVHGAPRMF FT APPKSPQSDTPKTEPATPAAPSILKKGKNFCITSKKTSPDQVLLAVVHLKI FT EANGYSFETYGLLDPGAEMTLVTKDVLRMLKLNPPREKMQMGTLHGDGPLL FT SVQRADLQISSLDDSFSFSAYEVPALTTFNLHPGYVDWPKEKKKYPHLADL FT DLPPVDFSKITILLGSDYQEALEPLQIRKSTSESPGPWAIKTVFGWCARGP FT LYKADGQHSPPSINLIQNRKRPNLNEMVHQLWSIESFGTHPDVKAPISGDD FT ARALKLLEENVRFLGDRYEAPLLWKLDQPNLPNNYSVALSRFHTNDRSLKK FT CPVKAKSYEATINDYIMQGHARKLQEDELEGPIGLTWYVPHHAVFHPDKPG FT KCRVVFDGASTFRGISLNSCLLTGPDLLTSLIGVLLRLRERPIAISGDIKG FT MFHQVRVRSVDKHVLRFLWRPPHTAGPPDVYEMQVQIFGAASSPTVCSYVL FT RRAAADNEEEFPGLVEKVATNCYMDNYMDSFGTEEEAIEFRNKFQDGLKKG FT GFHWTQWMSTSRKVLQSVPEDLRSDPKLNLNLDALPAEKTLGVTLNWQEDV FT FVFKVKIQTGSDTYRTILSDVCGLYDPLGFWTPVTLTARILLQDICRIKPE FT WDEILPEPLLERWRIWVSNLHHLESVAVPRCLQPFPPTSVQLHIFVDASEL FT GFGAVAYLLLQNKKTSTTVFVMSKSRVAPIHHLTIPRLELSAALLGARLAR FT TIKNELRLKIDGEVFWSDSSTVLRWINSTHCRYHTWVANRVGEILTLTTPE FT QWRHVPGILNPADDCSRGVEASALTENYRWWTGPDFLLQPRTNWPAMPEHF FT AQPVDDPEINLSKWAGNVSVPTPHPLLPILQKSSSMCRLKRIIAWVLRFIH FT NKLKPAAERRSKPYPIIPELREAETILVRLAQREGYGAEYKRLSEGKELDP FT GSKIATLTPFFDEEAKLIRVGGRLGYSNFPLAIKHPILLPADHHVTKLIVN FT REHLLLRHATVERTVAGLHKKYWIPRSRVSVNRIIKNCFDCKRHRAKPDIP FT LMSALPIHRLQDDRPAFSNTGIDYFGPILTTVGRRREKRWGIIFTCLVSRA FT VHFEMAYFLDTSSFLLAFWRFARRRIRPDVIYSDNGTNLTSGEKELKAGLD FT RLNQTAIANELGAQNIEWIFSPPSAPHFGGVWESLIKSAKDSLSFILQDHT FT FPDEVLLSALIEVESIMNDRPIGRCSTDPNDFSVLTPNHLLLARANPSLPA FT DVVYESSPNSRQRWRNAQIIADHFWRRWTAEVLPSLLKRKKWYVDRRHIQV FT GDLVLIVDENVTRGRWPVSYVDEVYPGKDGIPRSALVRNKDGVYHRPVSKL FT CVVEESNLLISDASGNDAG" XX SQ Sequence 8133 BP; 2072 A; 2347 C; 1839 G; 1875 T; 0 other; caatggtgca tcgaccggga aagtgatccc gctcggcctc tcgtggcgag acgatttctc 60 ccctctcaag tcccgcgtgg caggaccaga catatcggtg attatcacaa acatctgata 120 cattcgtgag tagttacgca gttcgagtat atgtctaaat ttaacttgta aaattgaacc 180 aactattgtg cacgaagtga caagctgtaa tttgtgtgtg tgacgtcgag tgtcgtgtgt 240 ttattggccg cgtcgctttc gcagcttata gtccgccaat tcggccgact ttatcgcccg 300 aacgactcca ccaacaactc gccaccctga ttcaccatca tggcggatcc aacggccgac 360 gacgacgagg aattgcacac cgaggatccg gaagcattga aggtgcttcg agccaacgcg 420 aaacggcgat tcacaaatct gataaagctc gctcgagact tgatggccga tcatggcagt 480 cggacatcaa taaaaaatag aagacaggat ctcatcgtcg ccttccagga atgcagccac 540 cacaacaatt gctacaaagc aatccgcccc gacgacggga gaagtgacac ctggatcacc 600 acgatcgaaa cagacctcga tttctggctt gatacaatcg aggaacatct gttcatgaga 660 ataggcgagt cgaactcaag cagagcatcg ttccgttcgc ggctttccca acaaccgctt 720 ttgactgaag aaatgaatca ggtgcaatct cctcggccga tttctgaccc cgactggagt 780 gatttaaggc gtcgggtcaa ccaactagag ctccaaggag tgcatgggta ttctgggagc 840 ccaagcctac gcccaggccg ttcaccgcca caacctccac cgagaccgtc ttccgcaccg 900 actcaacatc cagacgcgga cgacggtaat tatggagcga gaccaagagt aacttcattt 960 tattctcata gaggtgagga cgcgtttcac ggttgggacc aggcttgtga gtatcgctag 1020 cgcttccgat agcgatatta gcggcgatat acttaaatgt agcgctaaaa aatcgccggt 1080 taattcgaaa ttaggctatc ggaaacacta gcgttgatat cgccgctagt gtttgcgata 1140 gcgataaaat cagaaaactt ttcgttattt cattgtaaga aattgcctaa atttagagtt 1200 tcaccatcaa attgcagaat caaattattg atttttatta tattattgaa gactttaaga 1260 ctttcgtttt gaatcccttt agttttgtga tctaatcaaa atattctcac aataaagttt 1320 tttaaaattt tttttgccga gctaataaaa ttttttaaag caaacgaatt tttttcaata 1380 tcgcttaaat atcgctatcg ctagcggcga tattttctcg cgatagtatc ggtagcggta 1440 tcggcgatac agggtaaact aagtagcgct atcgcaaccg gcgatattgc ggagactatc 1500 gatttacaac cctggttggg accagccacc agggcttgga aacagcatct ttgcaccacc 1560 accgcaagtc gaaccacgcg aaaccatcgc acacatcact ccagctgatt cgaaattgaa 1620 gatcgacgat ccagggtggg gacaacagga agtcccactt catgcaaacg acacagcagc 1680 cccaagctgc agctccgaat cgtcgtcgtt tggtttcggc cctggaacga gccattcaaa 1740 cttctttcaa agccatccag ccacggaagc agctggccat ctcgcgtcgt ttccagccag 1800 tcatcctacg tttgcatcca gctttcagcc ggcgcttcca gccagccttc agcccacgtt 1860 tgcatccagc tctcagccta cgcttccagc cagctttcag cctacgcttc cagccagcca 1920 tccggctacg ttttcagcca gccatccggt tacgcttcaa gccagcttcc agcctacgct 1980 ttcaaccagc catccagcta cgtgtctagc cagctttcag cctacgcttc cagccagtca 2040 tccggccacg gtttcatcta gccaaccggc tacgccttca gccagccaga cagctacgct 2100 ttcagccagc catccagccg ctgtatcatc tagcaatccg gctacgcttc cagccagctt 2160 tcagcctacg cttccagcca gccacccgac tacgttttca gccagccatc cggctacgct 2220 tcaagccagc ttccagccta cgctttcaac cagccatcca gctacgtgtc tagccagctt 2280 tcagcctacg cttccagcca gctttcggcc tacgtttcca gccagctatc cagccacgct 2340 ttcagccagc catccagccg cggtatcatc tagccatcct gctacgcttc cagccagctt 2400 tcagcctacg cttccaacca gccatccggc tacgttttca gccagccatc cggctatgtt 2460 tccagtaagc catgcggcca cgctttcagc cagtcatccg accatggtta cagctagcca 2520 ttcagctacg cttccagcca gctttcagcc tacgcttcca gccagctttc agcctacgct 2580 tccagccagc catccggcta cgttttcagc cagccatccg gctatgtttc cagtaagcca 2640 tccggccacg ctttcagcca gtcatccgac catggttaca gctagccatt cagctacgtt 2700 tccagccagc tttcagccta cgcttccagc cagcttccag cccacgcttc cagccagcca 2760 tccggctacg ttttcagcca gccatccagc cgcggtatca tctagccatc cggctacgct 2820 tccagccagc tttcagccta cgcttccagc cagccatcca gccacggttt catctagcca 2880 tccggctatg tttccagcca acatcccgcc taccgtcaca gctagccatc atgttacgtt 2940 tcaacctagc ttcccgccta ctgtgacagc tagccatcat gctgcggttc cagccagcat 3000 cccgcctatc ttatcaactg accacccggc catggtccca tctagttacc cggctatgtt 3060 acccgctagc tatccgggta cctacagtca tcaaacttac ccgccgcccc ctttgcctca 3120 gcaacaaccg gcaggtccaa gccgaacgac cgacagttgg ataacggacc cgaactggaa 3180 tgtgacacag agccacccat cgtcttggtt agcatcaaaa ctcccaagga tcaacctcgc 3240 accgtacgac ggagatccaa ggaattggtc ctattttatt caaaacttca aatgtctggt 3300 ccacgacgtg gtgccggacg atgcgcaaag aatcattttt cttaaggatt acctcagcga 3360 gaaggtgcgc aacagcatcg caaattcgtt gcgcgatccc gcgcaatata gatcggccct 3420 ggcgaaacta caaagaaggt acggaaaccc acaactggta gtccgagccc atgtacaaag 3480 tttaatacaa ctcgtcggac cgaaggaagg cgatttcgac gctctctgcg tcttctccgg 3540 cgcaatccaa gcggcggtcg cggatctgtc aaacggaggt cacttgcacg atctcctcgc 3600 cccaggccta ctttaccaag tcacttcaaa gctcccgccg gccctaatgc aaaagtgggg 3660 tgaagaaatg tgcagcctcc agccaagagc cgccacgctt atcgattttg accagtggct 3720 cgaatcacga gttatggccg ggacctgggc cgctccagcg ctacaaccta cttccaacgc 3780 tggccggaca cgagagagag cgccaaaggg aaaggaagag actaaaatgc cggccgtccg 3840 tcacccgcga atccttcata cgtctcaccg cacgtcttgc gtagcatgca ctggaccaca 3900 ccctctggaa aaatgcaata aattctccaa actgtcgcct aaagccagag ctgagctcct 3960 tttggagaag ggtggttgct tcaggtgcct taatcctgga cacaacagta aagcctgtga 4020 aaaaaaggtg gcttgtggcc aggacaattg caaatcctac catcatccga tggtgcacgg 4080 agccccgcgg atgtttgctc caccaaaatc tcctcaatcc gacacgccaa aaacagaacc 4140 agcaacccca gcagcgccgt caatcttgaa aaaaggcaag aacttttgta ttaccagcaa 4200 gaaaacttca ccggatcaag ttttgttagc cgtcgtacac ctcaagattg aagctaatgg 4260 ttattccttc gagacctacg gcctcctcga tcccggagcg gaaatgaccc tggtgaccaa 4320 agatgtcctg cgtatgctga aattgaatcc acctcgcgaa aaaatgcaga tgggcactct 4380 ccatggtgac ggcccgttgt tatcagtcca aagagcggac ctgcaaattt cctcgctcga 4440 cgattctttc tctttcagcg cctacgaggt cccagcgtta acgacattca accttcatcc 4500 cggatatgtc gattggccga aggaaaagaa gaagtatcca cacctggccg acttagatct 4560 accacccgta gatttctcga agataacgat ccttctaggc tccgattatc aagaagcgct 4620 cgaaccttta caaattcgaa agtcgaccag cgagtctccc ggtccatggg cgatcaagac 4680 tgtgttcggg tggtgcgctc gcggccctct ttataaagca gacggtcaac attcaccacc 4740 ctccatcaac ctcatccaga atcgaaaaag gcccaattta aacgagatgg tccaccagtt 4800 atggtcaatc gaatcctttg gcactcatcc ggatgtgaag gcgccaatta gtggagacga 4860 tgccagagcg ctaaagctac tcgaagaaaa tgtgcgcttt ttgggcgatc gctatgaagc 4920 gccgctctta tggaagctgg atcaacccaa cctccctaac aactattcgg tggcattatc 4980 acggttccac acgaatgata ggagcttgaa aaaatgtccc gtgaaagcca agtcttacga 5040 agccaccatc aacgactaca tcatgcaagg gcatgccaga aaattgcaag aagatgaact 5100 ggaaggcccg ataggcctaa cgtggtacgt cccgcatcac gcagttttcc atccagacaa 5160 gcccggaaaa tgccgtgtag ttttcgatgg ggcctctaca tttcgtggaa tttctctcaa 5220 tagctgtctg ttgacaggcc cagatctttt gactagctta attggagtgt tgctacgttt 5280 gagagaaaga ccgatcgcca tttccgggga catcaaagga atgttccacc aagtccgcgt 5340 ccgttctgtc gacaagcatg tattacgatt cctctggaga cccccccaca ccgccggacc 5400 tccggatgta tacgaaatgc aagttcaaat ttttggtgcc gcgtcttcgc ccacagtttg 5460 ctcttatgtc ctgcgccgag ctgccgccga taatgaagaa gagttcccag gcttagtcga 5520 aaaggtagca accaactgct acatggacaa ctacatggat tcctttggta ccgaagaaga 5580 agctatcgaa tttaggaaca agttccaaga tggtttgaag aaaggcggct tccactggac 5640 ccaatggatg tctacatcgc ggaaagtgct tcaatctgtc cctgaagatc tcaggtccga 5700 tccgaaattg aatttaaacc tagacgcgct tccggccgag aaaacgcttg gtgtcacact 5760 caattggcaa gaagacgttt tcgtctttaa agtgaaaatt caaaccggct ctgacaccta 5820 caggaccatc ttgagtgatg tctgcggcct gtacgacccg ctcggatttt ggacaccagt 5880 cactctcact gcgagaattt tacttcaaga tatctgccgg ataaaaccag aatgggacga 5940 gatcttacca gaaccactgc ttgaacgttg gagaatttgg gtctcaaacc tgcatcatct 6000 cgagtccgta gcagtgccca ggtgtcttca accctttcct ccgacgtcag ttcaacttca 6060 tatcttcgtg gatgccagcg aattgggttt cggcgcagtg gcgtatttgt tgctgcagaa 6120 taagaaaacc agcaccaccg tttttgtaat gtcaaaaagt agagttgctc caattcacca 6180 tcttacaatc cctcgtctgg aactatcggc cgccctactc ggcgcccgcc tagcccgaac 6240 catcaagaac gagcttcgat tgaagataga cggtgaagta ttctggtcgg attcgtcaac 6300 tgtattacgc tggatcaatt caacgcattg tcggtaccac acctgggtgg caaaccgagt 6360 cggggaaatt ctcaccctca ctactcctga acaatggcga catgtgccgg gcattttaaa 6420 ccccgctgac gactgcagcc gcggcgtcga ggcgtccgcc ctgacagaaa attatagatg 6480 gtggactggc ccggactttc ttcttcagcc ccgaacaaac tggccagcaa tgccggaaca 6540 ttttgcccag ccagtcgatg accccgagat caacctctca aaatgggccg ggaatgtctc 6600 tgttcctacc cctcatccgc ttctacccat cttgcagaag agttccagta tgtgccgact 6660 aaagcggatc atcgcctggg tgttgcgctt catccacaac aaacttaagc ctgccgccga 6720 gagaagatcc aagccgtacc caatcattcc cgagctacga gaagccgaaa caattctcgt 6780 ccgcctcgcc cagcgtgaag ggtacggagc cgagtacaaa agactaagcg aaggaaagga 6840 attggatcca ggatctaaaa ttgccacgtt aacgccgttt ttcgacgagg aagcaaaatt 6900 aattcgtgtt ggcgggcgtt tgggctacag caacttcccg ttggcgatta agcacccgat 6960 tttactgccg gcggatcatc atgtaaccaa gctgatcgta aatagggagc atctccttct 7020 cagacatgcg acggtggaga gaacggtcgc cggactgcac aaaaagtatt ggatcccgcg 7080 gagtagagtg tcagtcaatc gtattatcaa gaattgtttt gattgcaagc gccaccgtgc 7140 caaaccagat attccgctta tgagcgctct cccaattcat cgccttcaag atgacagacc 7200 cgcgttctct aataccggaa ttgattattt cggaccgatc ctgacgaccg ttggtcggcg 7260 gcgagagaag agatggggaa taatatttac atgtttagtg agcagagcgg tacatttcga 7320 gatggcgtac ttcttggaca cctcctcgtt tttgctagcc ttttggaggt ttgcacgccg 7380 aagaattcgg ccggacgtga tctactccga caacggtacg aacctcacat ccggcgaaaa 7440 ggagttaaaa gctggattag accgtcttaa ccaaaccgcc attgcaaacg agttaggcgc 7500 tcagaacatc gagtggatat tttcaccgcc ttcagctcct catttcggcg gagtgtggga 7560 aagtctaatc aagtcagcca aagactcgtt gtcctttatt cttcaagatc atacattccc 7620 ggatgaagtc ctcctctccg ccttgatcga agtagaatcg ataatgaacg accgtccgat 7680 cggccgttgt tcaacggatc ccaacgattt ctccgtcctc accccgaatc accttttgtt 7740 agctcgcgct aaccccagcc tcccagcaga cgtcgtgtac gagtcaagtc caaattccag 7800 acagcgatgg cgaaacgccc aaatcatcgc cgaccacttc tggcgaaggt ggaccgcgga 7860 agtgttgccc tcccttctga aaagaaagaa gtggtacgtc gatcgtcgcc acattcaagt 7920 tggagatttg gtcctcatcg ttgatgaaaa cgtcaccagg ggtaggtggc ccgtcagtta 7980 cgtcgatgaa gtctacccag gaaaggacgg aattccccgc tcagccctag tacgaaacaa 8040 ggacggagtt tatcatcgcc ccgtctccaa gctgtgcgtc gtggaagaga gcaacttatt 8100 aatttccgac gccagcggaa atgatgccgg cga 8133 // ID BEL4b_Cis_LTR repbase; DNA; INV; 328 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 06-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE Long terminal repeat of BEL LTR Retrotransposon from Ciona DE savignyi. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL4b_Cis_LTR. XX OS Ciona savignyi OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. XX RN [1] RP 1-328 RA Smit A.F.; RT "BEL4b_Cis_LTR - BEL LTR Retrotransposon from Ciona savignyi."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX CC Ci000072. XX SQ Sequence 328 BP; 72 A; 49 C; 73 G; 133 T; 1 other; tgttactgca ccgttgcagt agttttttgg ttcgttgatt gttcatggtt ggttgatttt 60 ggtaaatttc tatttctatg atttgtcctt ggcgatttta cgtttttgat ttatanagtc 120 cttttcgagt tcggctgcaa ggaagcagca gtttatttta gcgatcaagg aattaacggg 180 cgcgcattgt taaggtatgg ttttttgtag tatagtttcc cattttttat agcgtcttac 240 atgcatttat gtttatttta gttaccgctg aataccgaat aaaggatttg gaaatagcgc 300 gtcaacttca ttgcgtggca ccgtaaca 328 // ID Gypsy-105_AA-LTR repbase; DNA; INV; 188 BP. XX AC supercont1.2; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-105_AA_; KW Gypsy-105_AA-I; Gypsy-105_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-188 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.2; Positions 5301854 5302041. XX SQ Sequence 188 BP; 50 A; 49 C; 42 G; 47 T; 0 other; tgtgtggcat acataacggt cagtgtgact ggagctttac tttagccaaa aacattccgc 60 cgttgcattt gacagttcat cgtcgttaaa cgggatacgc tcaacacaaa tgcatgcgtg 120 cgactcgtcc actctttcgt caaccatcct cgaggtaagt cacgctacta ggaaacagga 180 gctgcaca 188 // ID Gypsy-259_AA-I repbase; DNA; INV; 4394 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-259_AA_; KW Gypsy-259_AA-LTR; Gypsy-259_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4394 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1119-1119 (2011). XX DR [1] (Consensus) XX CC Positions [3200-3631] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 1400..3631 FT /product="Gypsy-259_AA-I_1p" FT /translation="MSPDKLSIAKQEFQTMMDLGICRRSSSSWASPLHCVP FT KKSGQLRFVGDYRKLNSVTVPDRYPVPHIHDLLNTLHGKSIFSTIDLERAH FT HQIPVETEDVPKTAVITPFGLFEFTRMQFGLCNAGQTFQRFMHRIFADLDF FT VVVFIDDICIASSSIEEHYRHMRSFFDIFRSNGLVINLPKCKFAKQEVEFL FT GYLINKDGVKPLPSRVTAFLEYARPTTVKDLRRFLALVNGYKRFVRQATDM FT QAALRYLITGNKKNDSRKIEWSVQANEAFENCKKGLAEATLLYYPDPTKPL FT GLFVDASNTAAGAVLQQLHNGTWQPLGAKTRIVYRKRAPECGNRFASMNHN FT ISMELTSNEIGSTLSQQEYYSSCWRTDSAFRCRSAIIAFGPLGFYSEKFTS FT GQRNYSTFGRELIGMKMSVRYFRHLLEGRSFTIFTDHNPLTHAMTSNSACR FT LPHEERALQFISQFTTNIQHISGKQNVIADSLSRLNAISSPSPIDYKIIAL FT DQDSDPELQNLLRSTNSSLNLKTLTLQQCSKPLYCDVSVTGSTRPFIPESH FT RQVIMEHFHNIAHPGIRATRKFVSSRFVWPKMNRHIAEFVRKCEACQKSKI FT HRHTTAPLASFGLPKCRFNHIHVDLVCPLPPSNGHTYLLTIVDRYTRWPEA FT IPLEDMTANTVATALCSQWISRFGCPEYITTDQGRQFESELFRELAILLGV FT NHTRTTAYHPQANGMVERFHRKLKEAIMCVDAKKWFTRLPLRPK" XX SQ Sequence 4394 BP; 1272 A; 1037 C; 978 G; 1107 T; 0 other; cattggtaac cccgagagag gaaatccgcg gcaaaacggg acgattttat cgagtttcgt 60 acgttcgaat ttttcgcgtc cgtatcgtcg ccattttgtt ttcaaccaag ccgacaaaat 120 gctggatgaa gtggaggatc aggctgcagc aacagtcacg gcatcagtag ctattaaatt 180 gcccgatttt tggaagagtg atccggctat gtggttcgcc caagccgaag ctcagttcgt 240 gctggctggc gtaacgcgag atgcaacaaa attttactac ataatcgcaa aagtagacca 300 gagcgtactt tgccatatct cggatttagt ggcaaaccct ccacaggacg acaaatacca 360 agccatcaag agcagactgt tgagtcgctt cgaaatgtca gcgcaagcta agatggagaa 420 gcttttgaac tcctgcgact tgggtgatat gaagccgacg catcttttgg cccgaatgca 480 ggatttggca gctggactca aagttgacga cggtctaatg aagatgctgt ttttgcaacg 540 actcccggca aacgtgaaaa ctgtactgac catccacgac ggtacactga tgaaattagc 600 ggagatggcg gataagatgg tggaatcctt ctacccacaa gtggcggcca ccgcctttca 660 gccagctgtt tcgagtcagg aggaactcgt gcagcagatt gccagtctca cagcggaaat 720 tcggaaatta agagcaggaa acgacagaaa cgagaggagc cgatcttcgt ctcaagcccg 780 gcagcgttct agtgatcgtg gtagcgtctg ttggtaccac aaaaagtacg gtgcacaagc 840 tcatcagtgt cgcgagccgt gtgcgttcag aagttcgtca aaaaactaaa tacttgccca 900 tctggaacgg cggaggtggg ctcaaacaaa gaaagccgtc gcttatgtgt catcgacaaa 960 atcaacaacg ttcgcttcct ggtcgacacc ggatcggatg tctcgatcct tccagctact 1020 agacgagatc gaagtaagcc accgattccg tttatggacc aatggaccga tgcatgagtt 1080 cactaatttg acatttgagc agtgccgtgt tatttatgtg accatggaaa ccagtgaatt 1140 cagcaccgct caaacgtcaa attagtgaac tcgtgcactg gtccattttg catgggcaac 1200 taatctcaaa gccaccgttg gtgtagttca ttcgcttcta catgatgtca ctacgatcaa 1260 ctacgaccat ccgttccggc gtttacttga ggaattccaa gaaatcactc gcccttcaac 1320 gctgagggcg gaagttcatc atgatattac tcaccatatc atcacgaaag gaccacccat 1380 tgcttataag gctagaagga tgtcgcccga caagttgtca atcgcgaagc aagagttcca 1440 gacaatgatg gatcttggca tctgtcgccg ttctagttcc agttgggcga gccctcttca 1500 ttgtgttccc aaaaagtctg gacagttacg ctttgtggga gactaccgaa agttgaacag 1560 tgtgactgtg ccggacagat acccggtacc acacatacac gacctcctca acactcttca 1620 tggtaagtca atattcagca ctattgatct tgaacgagct caccatcaga taccggtaga 1680 aacggaagac gtaccgaaaa cggccgtcat cactccattc ggtcttttcg aatttactag 1740 gatgcagttt ggcctgtgta atgcaggtca aaccttccaa agattcatgc ataggatttt 1800 tgcagacctc gatttcgtcg tcgtcttcat cgacgatata tgcatagctt cgtcctcaat 1860 cgaagaacac tatcgacaca tgcgatcttt ttttgacata tttcgatcaa acggtctcgt 1920 cattaatctt ccgaaatgta aattcgcgaa acaagaagtc gaatttctcg gatatctgat 1980 taataaggat ggcgtaaaac cgttgccaag tcgtgtaaca gcatttttag aatatgctcg 2040 accaacgaca gttaaggatc ttcggagatt cttggcacta gtgaacgggt ataagcgatt 2100 tgtacgtcaa gcaaccgata tgcaagcagc attgcgttat ctcataacag ggaacaagaa 2160 aaacgactct cgaaaaatcg agtggagcgt acaggcaaat gaagcgtttg agaattgtaa 2220 aaaaggatta gcggaagcaa cattgctcta ttatcccgat ccaaccaaac ctttagggct 2280 atttgtggat gcttccaata ctgcggccgg agctgttttg cagcagcttc acaacggaac 2340 ctggcaaccg ttaggggcca aaacgcgcat agtataccgg aaacgggcac cggaatgcgg 2400 aaaccggttc gccagcatga atcacaacat ctcgatggag ctgacatcga atgaaattgg 2460 atcaaccctg agtcagcaag aatattatag ttcatgctgg cgaaccgatt ccgcattccg 2520 ttgccgttcc gctatcattg cgtttgggcc tttaggcttc tattcggaaa aatttacatc 2580 gggccaacgc aattactcta cgtttgggcg cgagctgata ggaatgaaaa tgtccgtcag 2640 gtatttcaga catctgttag agggtcgttc ctttaccatt tttaccgacc acaacccact 2700 aacacatgcg atgacatcca attctgcgtg tcgtctacct catgaagaaa gagctttgca 2760 gtttatatca caattcacta caaacatcca gcacattagt ggcaaacaaa atgtgatcgc 2820 agattcgctc tccagattga acgcgatcag ttctccatcg ccaatagact acaaaattat 2880 tgctctggat caggacagcg accctgaact tcaaaatttg ctaaggtcca caaactcatc 2940 actgaactta aaaacactca ctctgcaaca atgttccaaa cctctttatt gcgacgtctc 3000 cgtcacaggt tccactcggc cattcatacc ggaatctcat cgacaggtga ttatggaaca 3060 ttttcataat attgctcatc ctggcattcg agccacccga aagtttgtgt ccagtaggtt 3120 tgtttggccc aaaatgaaca gacatatcgc tgagtttgtg agaaaatgcg aagcatgcca 3180 aaaatcgaag attcaccgtc acactacagc gccattggcc tcatttggtt tgccaaaatg 3240 ccgattcaac catatccatg ttgaccttgt atgtccattg cctccgtcta acggtcatac 3300 gtaccttctt acaattgtag atcgatacac cagatggccc gaagctattc ctttggaaga 3360 tatgacggcg aatactgtag caacggcact ttgctctcaa tggatttctc gttttggctg 3420 ccctgaatac atcactacgg atcagggacg tcaatttgaa tccgaattgt tcagggaatt 3480 ggcaattcta cttggagtca atcacacacg tacaaccgca taccacccac aagcgaatgg 3540 gatggtcgaa cgttttcatc gaaaattgaa ggaagccatc atgtgcgtcg atgccaagaa 3600 gtggttcact cgactccctt taaggcccaa gtaaaaatgg tgcaaaagtc aaaactaaaa 3660 aagcagaaat aaacaaatcg aagaaaataa aaacacacag ctcttttatt tgccaaataa 3720 aaaggctgtg tgttcttatt tccatcaatt tattatttta aaaggagaaa actgcttttt 3780 cagttctgac ttttgagcca ttttttcttg caccttaatt ctactcggcc tacgaactgc 3840 tattaaagaa gatatggatt gttctcctgc cgagttggtg tacggtcaac ctttacctgt 3900 acctggagaa tttttcgagc cgccggaaaa aatcgttggt cgagccgatt tttcgttgga 3960 cctccatcgt gtgatggacc agattagacc agtcgaagcc aaacatcaca caaaaggcaa 4020 attattcgtg aacaaaaagc tgaaggattg cactcacgta tttgtacgac gctgttaaaa 4080 aatctctaca acgaccttat gacggccctt atcgtgtact tagccgaaca gataagtacg 4140 tggatatcct ggtaaatcag aaacagcaaa gagtttcgat cgaccgtgta aaaccagctt 4200 acaccgggtc agaaacaact accaacgatc ctgtgaataa gaaaactata gttacacctt 4260 cgggccaccg ctttaagttt ttggtgtaac tgggggggca ctgtggtggt tggcacaccc 4320 ctggatgcca ctcataccaa gcagagagag agagaaagga tggcgacaag aagagaggga 4380 acaaatccag ttag 4394 // ID Poseidon-5_HM repbase; DNA; INV; 2278 BP. XX AC . XX DT 13-DEC-2008 (Rel. 13.12, Created) DT 15-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE Penelope-like element. XX KW Penelope; Non-LTR Retrotransposon; Transposable Element; KW Poseidon-5_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2278 RA Jurka J.; RT "Penelope-like elements from Hydra magnipapillata (Poseidon RT group)."; RL Repbase Reports 8(12), 2088-2088 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 13..2142 FT /product="Poseidon-5_HM_1p" FT /translation="MFIKSKERLVKKFLILQNEXENKNDNAEKTTNYKKNA FT VLNLCDTDIPVSHNNFLNLGPHFVPSLKSIPYMDIITATESSALKLEYNNK FT IENAQDLRKNVLRELKTGKKLNNNLTKEQWKAYREIKEEKNVDIYPFDKGT FT GFVRIEHSKALEKIREQIGPTKIVSEDPTANYAAKIRRYLSKLNKRNCFSK FT EEYEKIYPSDPIPPRMYGVIKAHKPEKFYPMRIIISTIGTANYGISEFLVK FT IIQPVLNENLTRLKNSFDFIKKAESWKIDKDEVQVSFDVVNLYPSVPLKEA FT TLVLIEKLNNNTTYKKLTRLSIPEIKQLIELCLFQCYFHWNNEIHIMENSG FT PIGLSFMVVLAESFLQFHENNAIKIALNFNPALNLKSFLRYVDDSHARFPD FT LNQAKHFQYILNQQHPSIQYTIEVENELGTLNYLDIQITNKKTGKYEYKVY FT RKDAITNIQIKPHSNHDPNILNSIFKGFLHRAYSICSKSYLQNEINFLIDV FT FIENGYDKTQMMNIAHQFQHKRQNKITIPSEINNYPVVSLPWVPGLSPKLR FT KIFRKAGYRAVFKSNPNLRSLLTSKNKSKLPNNSQPGTYIIKCNCSKVYVG FT ETKMKVNTRMHQHQKSIDENKQHQSALALHKTFCKKEIIWEQTKTLKVENK FT KFERKVREALEIQKNMCSAKNGGINLDEGQYVKTKFWTPFFKFQRKRSHST FT ADVNSNVLI*" XX SQ Sequence 2278 BP; 949 A; 384 C; 328 G; 616 T; 1 other; gctcgggaga atatgtttat aaagtcaaaa gaacgattag tgaaaaaatt tctaatattg 60 caaaatgaac rtgaaaataa aaacgataat gccgaaaaaa caacaaatta taaaaagaat 120 gctgtcctaa acttatgcga taccgacatt ccagtaagtc ataataattt tctaaacctc 180 ggtccacact ttgtaccatc tttaaagtcg ataccatata tggatattat tacagccacg 240 gaatcctctg cgttaaaatt agaatacaac aataaaattg aaaatgccca agacctaagg 300 aaaaatgttt taagagaact aaaaaccgga aaaaaattaa ataataatct tacaaaagaa 360 caatggaaag cttatagaga aataaaagag gaaaaaaacg ttgacatata tccattcgac 420 aaagggacag gttttgtacg aatcgaacat tcaaaagcct tggaaaaaat tcgcgaacaa 480 ataggcccaa caaaaatagt aagtgaagac cctactgcaa attacgcggc taaaattcgt 540 agatatcttt caaaactaaa taaaagaaac tgtttttcga aagaggaata cgaaaaaata 600 tatcctagtg atcccatccc gcctcgtatg tatggtgtaa tcaaagccca caaacctgaa 660 aaattctatc ctatgagaat aattatctcc acaatcggca cagcaaacta cggaatatct 720 gagttcttag ttaagataat acaaccagtt ttgaatgaaa acttgacaag attaaaaaat 780 tcctttgact ttattaaaaa agctgaatct tggaaaattg acaaagatga agttcaggta 840 tcttttgatg tcgtaaatct gtatccgtcg gttcctttaa aagaagccac ccttgtactt 900 atagaaaaat taaataataa tacaacatat aaaaaattaa ccagactaag tatacctgaa 960 ataaaacaac ttatagaact ttgtttattt caatgttatt ttcattggaa caatgaaatt 1020 catataatgg aaaactcggg tccaattgga ctttcgttca tggtcgttct tgcagaatca 1080 tttctacaat tccatgaaaa caatgcaatt aaaattgcac taaatttcaa ccctgcactt 1140 aacctcaaat cgttcttaag atatgtcgat gatagtcacg caaggtttcc agatctcaac 1200 caagcaaaac atttccaata cattttaaat caacaacatc cttccatcca gtatacaatt 1260 gaagttgaaa acgagttagg aacactaaat tatctggata tacaaattac gaataaaaaa 1320 acaggaaagt atgaatacaa agtatatcgg aaagatgcta tcactaacat tcaaattaag 1380 ccacactcaa atcatgatcc aaatatcttg aactctattt ttaaaggatt tctccaccga 1440 gcttactcaa tctgtagtaa gtcttacttg caaaatgaaa taaattttct aatcgacgtg 1500 tttattgaaa atggttacga taaaactcaa atgatgaata ttgcgcatca gtttcaacac 1560 aaaagacaaa ataaaattac aattccatct gaaatcaata attatcctgt agtttcctta 1620 ccatgggtac ccggtctctc gccaaaactt aggaaaatat tccgaaaagc tggttacaga 1680 gcggtattca aatcaaaccc aaacttaaga tccctgttaa catcaaaaaa taaatcaaaa 1740 ttacctaata acagtcaacc aggaacatat ataatcaaat gcaactgttc aaaagtgtat 1800 gtaggcgaaa caaaaatgaa agtaaacaca agaatgcatc aacaccaaaa aagcattgat 1860 gaaaataagc agcatcaatc tgcattagct ctgcataaaa cattttgcaa aaaagaaatt 1920 atttgggaac aaacaaaaac attgaaagta gaaaataaaa aatttgaaag aaaagttagg 1980 gaagccctag aaatacaaaa aaacatgtgt tctgcaaaaa atggcgggat taatctcgat 2040 gagggtcaat acgtaaaaac taagttttgg acgccatttt ttaaatttca gcgaaaaaga 2100 agccattcaa ccgctgacgt caatagcaac gttttaattt aaattataac ggtttactaa 2160 ttactgtaac atttaacaag ctgaagaagc tggtatctaa aatccagcga aaatttctat 2220 aataataaaa aattataagt gttgagagaa atcgtatttt tgatgtttta aaataata 2278 // ID NVBRP3 repbase; DNA; INV; 94 BP. XX AC X64091; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 25-MAR-1997 (Rel. 2.02, Last updated, Version 2) XX DE N.vitripennis repetitive DNA from B chromosome. XX KW SAT; Satellite; Simple Repeat; NVBRP3; Repetitive DNA; KW satellite DNA. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-94 RA Eickbaum C.D.; RT "NVBRP3."; RL Direct Submission to Genbank (27-DEC-1991)D.C. Eickbaum, RL University of Rochester, Dept. of Biology, Hutchison Hall 334, RL Rochester, NY 14627, USA. XX RN [2] RP 1-94 RA Eickbush G.D., Eickbush H.T. and Werren H.J.; RT "Molecular characterization of repetitive DNA sequences from a B RT chromosome."; RL Chromosoma 101, 575-583 (1992). XX DR GenBank; X64091; Positions 1 94. XX SQ Sequence 94 BP; 35 A; 20 C; 19 G; 20 T; 0 other; tcgagtttga atcgtcagca tatcggagga aggcgaacga gggcgagaat ttaatttaaa 60 acaataaacc ttaaacccac ctaccctata gaca 94 // ID Gypsy-610_AA-I repbase; DNA; INV; 4339 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-610_AA_; KW Gypsy-610_AA-LTR; Ty3_gypsy_Ele68; Gypsy-610_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4339 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [3263-3724] - Integrase core CC 'CTCCC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 1256..4327 FT /product="Gypsy-610_AA-I_1p" FT /translation="MVKRGKCFVTSSPDLNVLGIDWIDAFNLWSIPFDTLC FT SQVAAEPRPCFDEVTAQLQANHPAVFDNSLGHCTKTKVKLFLKPNAKPVFC FT PKRPVPFNTISLVDAELTRLQSLGIITPIDFSEWAAPIVAVRKPNGKVRIC FT ADYSTGLNEALEANHYPLPTPEEIFAQLNGSAVFSIIDLSDAYLQVEVDEE FT SKKLLTITTHRGLFQFNRLAPGVKSAPGAFQRLVDTMIADIPGVRSFLDDA FT IVFGATWEAHKASLDKLLQRLEEYGFHVKLEKCHFFQTEIGYLGHIVDQNG FT IRPDPAKLQAIASIPAPTNVSELRSFLGAVNFYGRFVRNIHELRHPLDKLL FT KKDSKWQWNSDCQRSFEQFKQVLQSDLLLTHYDPKLPIIVAADASSTGIGA FT VIFHKFPNGSLKAIQHASRSLTPAEQATGQPEKEALALIYGVTKFHKYLLG FT RRFTLQTDHKPLLSIFGSKKGIPLHTANRLQRWALMLLNYDFEIQHVSTND FT FGCADMLSRLIDRSKQPEEDYVVAAISLEEDMVSIIRDTIKQVPISFAAIQ FT SATKADKALQAVLKFIREGWPNEAKSITNPDVRPYFIRQDSLTHVDGCILF FT HDRVVVPSKFRRQILKQFHRGHPGIVRMKSIARSFVYWPGIDNDIEDFVRH FT CVPCCTAGKTPVKTTLESWPMPDKPWSRIHIDYAGPMDGVFYLVVVDPYTK FT WPEVYATKTTTTKTTMKLLTNSFATFGIPETIVSDNGTQFTSHEFQSFCEQ FT QGIHHIRTAPYHPQSNGLAERFVDTLKRTLRKIRSGGETLEEALQTFLQVY FT RTTPTADLGGKSPAEVMFGRPIRTISSMILPPSYSTAAAQSKAESQNQRFN FT KKQGAVSRQFAPGDSVYAQVHQGNSWQWHAATVIERIGRVNYNVFLEDRQR FT LIRSHTNQLKQRAAESVPDRDANKGPSPLSVFFDGFGLDAPSAAVQATATN FT PDPSIPQNENDLSLQAEETDESQYYTEGSDEEDEGNPPQLDPAPVPRTPLA FT RERRQIRLPARFEPYWMT" XX SQ Sequence 4339 BP; 1138 A; 1245 C; 1050 G; 900 T; 6 other; gctggcgacg aggattattc tggattgtca agagctccag aattgaaccc gaggaaggat 60 ggcgacgaac gaagaactcc aagcggcgat tgtgcagatg acccagctac tccagcggtt 120 ggccgttcct caagcaacca acccagagca agtattggag tccctgtcga ccaacatcag 180 tgagttttgt ttcgatccgg agaacggcac cactttcgac aaatggtttg cccgttactc 240 ggatctgttc gagagcgacg cacgaagcct ggacgacgca gccaaagttc ggctcctact 300 ccggaagttg gacacgtcct cgcattgtcg ctacgtcaac tatatccttc cgacgctacc 360 gaaggacgtc acgtttgccg ataccatcaa gacgctcaag aaaatcttcg gcaggcagac 420 ctccattttc cacaagcgct agcagtgctt gcagctggtg aaatccgaga cggaagacat 480 catcagctac ggtggcaaaa tcaacagggc gtgtgaagag ttcgagtttc atgacttgaa 540 gatcgaccag ttcaagtgct tgatgttcgt ttgcggtctc aaggctccac gctacgcaga 600 cattcgagca aggcttcttt cccgcatcga aggtgagacg gcacaagcgc cggtaacact 660 ccaaacgctg atcgacgaat tccagcgaat cgtcaacctc aaatcggaca ccacgatgat 720 cgaacatcaa tcaagttcga aaaactcggt ccgcagcgtc tccgagaaga aaaccggcca 780 ccatcagcgt ccatcgaaac cggaaagcaa gtccgtccca cgaactccct gctggcagtg 840 cggwcaaatg cactatgtcc gcgactgtcc attckccggc catctctgca aggwatgtaa 900 ccgcaccggc cacaaggaag gctactgtac ctgcatcaaa aagtcttcaa gcgactcttc 960 aactcaaccc gggaagaaga agaccaagac cggttccaga tctcaagcca aaggaatctt 1020 cgtgaaccac atcgcgaaca gcactccaaa gcgtaagtac gtgaccatsa ccatcaacga 1080 mgtcgctatt tcgctgcaat tkgattccgc aagcgacatt accgtggttt ccaagacaac 1140 atggcaacaa ctgggacgac cgaagctggc tccgtcatcg atcgaagcat ccaacgcatc 1200 cggtggacag ctcgccctca tcggcgaatt ccattgtgaa gtcaccctca acggtatggt 1260 caagcgtggc aaatgtttcg tcacctcatc accggacctc aacgtgttgg ggattgattg 1320 gatcgacgcg ttcaacctgt ggtcgattcc attcgacacg ctctgcagtc aagttgcagc 1380 cgaaccccga ccatgcttcg acgaagtaac agcacagctt caagccaacc atccagcggt 1440 ttttgataac tcgctgggac actgcaccaa aaccaaggta aagctttttc tcaaacctaa 1500 cgccaaaccc gttttctgtc cgaagcgccc ggttccgttc aacaccattt cgctggtgga 1560 cgccgagctt acacgtctcc aatcgctggg aatcatcacg ccgatcgatt tctcggagtg 1620 ggccgctccg atcgttgcag ttcgcaagcc gaatggtaag gttcgcatct gtgcggatta 1680 ttccactgga ctcaacgaag cgctggaggc caaccactat cctctgccta caccggagga 1740 gatcttcgca cagctcaacg ggagtgccgt gttcagcatc atcgatctct ccgatgcgta 1800 tctgcaggtg gaagtagatg aagaatcgaa gaaacttctc accatcacca cgcaccgggg 1860 tcttttccag ttcaatcgcc tcgccccagg ggtgaagtcg gcgccaggag cattccaacg 1920 gctcgtcgac acgatgattg ctgacattcc tggtgtacga tcgttcctcg acgacgcaat 1980 cgtttttgga gcaacctggg aagcacacaa ggcgtctctg gacaagctac ttcagcgtct 2040 agaggaatat gggttccacg tcaagctgga aaagtgccac tttttccaaa ctgaaatcgg 2100 ctacttaggg cacatcgtcg atcaaaatgg catccgtccc gatccggcga agctccaagc 2160 catcgcctcc attccggcac caaccaacgt gtctgaattg cgatcattcc tgggagccgt 2220 taacttttac ggccgattcg ttcgcaacat ccacgagctt cgacatcccc tcgacaaact 2280 tctcaagaag gattcgaagt ggcagtggaa ctccgattgc cagcggtcgt tcgagcagtt 2340 caagcaagtc ctgcagtccg atttgttgct gactcattac gacccgaagc taccaatcat 2400 cgttgcggcg gacgcatcca gtacgggtat cggtgcagtc atcttccaca aattccccaa 2460 cgggtctctc aaggctatcc agcatgcgtc aagatccctc actccggcgg agcaggccac 2520 cgggcaaccg gaaaaggaag cacttgcgct catctacggg gtaacgaagt ttcacaagta 2580 cctcctggga cgccggttca ctctccagac ggaccacaag ccgctcctat caatctttgg 2640 ttcgaagaag ggaattccat tgcataccgc aaatcgtctt caacggtggg cgttgatgct 2700 gctaaactat gattttgaaa tccagcacgt gtccacaaac gactttggtt gcgccgacat 2760 gctgtccaga ctgatcgacc gttccaagca gccagaagaa gattacgttg tcgcagcgat 2820 ttccctagaa gaagacatgg tgagcattat tcgtgacact atcaagcaag taccgatttc 2880 tttcgcagcg atccaatctg ccacgaaggc agataaagca ctacaagcgg tgctcaaatt 2940 catccgtgag ggttggccaa acgaagcaaa gtcaatcaca aacccagacg ttcgtcccta 3000 cttcatcagg caggactccc tcacccatgt cgacggctgt attttgttcc acgacagagt 3060 tgtcgttccg agcaaattcc gacggcaaat cctaaagcaa ttccatcgcg gacatccagg 3120 catagttcgc atgaaatcta ttgcacgaag tttcgtctac tggccgggaa tcgataacga 3180 catcgaggat ttcgtccggc actgcgttcc ctgctgtact gctggaaaaa caccggtcaa 3240 gacaacgctc gaatcatggc ccatgccgga caagccgtgg tcacgcatcc atatcgacta 3300 cgctggacct atggatggcg tgttttacct ggtggtagtc gacccgtaca ccaaatggcc 3360 tgaagtgtac gcaacgaaaa ccacaacgac caaaacaacg atgaagctgc tcacaaacag 3420 ctttgcaact ttcggtatac cagaaaccat cgtctcggat aatggaacac agttcaccag 3480 tcacgagttc cagtcgttct gcgagcagca aggtattcac cacatccgca ctgcaccata 3540 ccacccacag tcaaacgggt tagcagaacg gttcgtggac accctgaagc gtaccctacg 3600 caaaattcgg tcgggaggag aaacactgga ggaagcattg caaacgtttc tgcaagtgta 3660 tcgcacaaca ccaacggctg acctcggtgg aaagtctcca gcggaggtga tgtttggacg 3720 gccgatccgc acaatttcgt ccatgatact tccaccaagt tattccactg cagcagctca 3780 gtcgaaggca gagagtcaaa accaacgatt caacaaaaag caaggagcag tttcaagaca 3840 gttcgctccc ggtgattccg tgtatgctca agtccatcaa ggtaactctt ggcagtggca 3900 tgccgccacg gttatcgagc gtatcgggcg agtgaactac aacgtgttcc tggaggatcg 3960 tcagcggcta atacgttcgc acaccaacca gttgaagcaa cgtgcggccg agtctgttcc 4020 agatcgtgac gcaaacaaag gtccaagccc actgtcggta ttcttcgacg gttttggcct 4080 agacgcacca tctgcagccg ttcaagcaac agcaacgaat cctgatcctt ccatccctca 4140 gaacgagaac gacctgagcc ttcaagcgga ggaaaccgac gagtcgcagt actacaccga 4200 aggcagcgat gaagaagatg aaggcaatcc gccgcagttg gatccagcac cagtaccaag 4260 aacgcctctg gcaagggaac gacgacagat ccgtttgcca gctaggttcg aaccctactg 4320 gatgacttaa ggggggaga 4339 // ID hAT-65_HM repbase; DNA; INV; 3807 BP. XX AC . XX DT 30-DEC-2008 (Rel. 13.12, Created) DT 30-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type DNA transposon family - consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-65_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3807 RA Bao W. and Jurka J.; RT "hAT-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 2053-2053 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 1005..3578 FT /product="hAT-65_HM_1p" FT /translation="MAYHHKSGYQKAKDREDKKKREEAVISKTVKITELFP FT TISVNVNREDPPSNSPLPCVGQCGLPGQFGHEFERDNSVAAVCTQSISHSE FT SDDMVDAETLSQKPEAQAGSKGDDNSFSTDIGQWSEVEINDTFCNFWVTRG FT SECCQHSNADFTSSMQIDGKQKRYCSQNLFYRVHSRNGEKVPRSWLCYSPA FT TSRIYCFVCKLFSADDNSFSRNGFCDWKNAASRIQSHENSIAHRNATITFS FT QRVQVRGCVDSAIVEQYRQECAYAKSVLERVVSVIKFLAERGLAFRGHTEI FT LGSSSNGNFLGILELIAQFDPFLSSHIDNHGGKGKGSVSYLSSTICDELID FT VMGKQVMKTILSEIRGAKFYSISVDSTPDISHVDRLSCVFRYVLNDGPIER FT FVQFLSMKGHGAEDLFNSLYSFLEEAKLDIKYCRGQSYDNASNMAGKYSGL FT QARIREKNPLAEYVPCFAHSLNLVGASAVNCVTTVSGFFDFVQNLYVFLNA FT STQRWELLTKGLDSNQLVLKRLSDTRWSAHSAATKALSKGFKNVKTVLENI FT AEDPEEKVEARTTAAGIAKKMDQLEYGILLELWTPILDRFHKTSLKLQSPQ FT LDLNEAVQLLTSLKEYVNSLRAQFDFFEKKGKEITGSTSYKEENQRKRKRS FT VQLTRHDGSADETVFSAQQKFRVMVFLPVLDHLNSALCKRSEAYSGVCDKF FT GFLRNIRSLEQSDLRRASRNLIDTYVDDLEESFEDEILHFKEYLLADTYLQ FT QNEKELKCSIELYMYRALRNSSALQDIFPNVATALHIYLSLMITNCSDERS FT FSALKRIKNYLRSALGDDKLNSLTVMCTESDILRSCDFNEILSDFVERKCR FT KMNF*" XX SQ Sequence 3807 BP; 1224 A; 620 C; 753 G; 1210 T; 0 other; cagtggcgga ttatccatta ggctgagtag gctgaagcct aggggcccgt cagccatagg 60 ggcccggttc ggaagatgat tgcaaatttt ttttacagca aatatcgata taggtatgta 120 tcatggataa atgaatgtat cagaaccaaa tgttattttt aaactaaaaa atcaattttg 180 tagcccagtt atgctgccca aatgtttgtt gattactttg tcttagtcaa aaatttgctt 240 tatcagaacc aaatgttatt tttaaactaa aaaatcaatt ttgtagccca gttatgctgc 300 ccaaatgttt gttgattact ttgtcttagt caaaattttg ctttatcaga accaaatgtt 360 atttttaaac taaaaaatca attttgtagc ccagttatgc tgcccaaatg tttgtttttt 420 actctgtctt agtcaaaatt ttgctttatc agaaccaaat gttattttta aactaaaaaa 480 tcaattttgt agcccagtta tgctgcccaa atgtttgttt ttactctgtc ttagtcaaaa 540 ttttgcttta tcagaaccaa atgttatttt taaactaaaa aatcaatttt gtagcccagt 600 tatgctgaga atggtattct tgaacaaatt gacactactg acagcataca taattgtaga 660 tattttttgg gatgctttat gcatatcgtt aatataaatt aagaatcaat aatggaccta 720 ggattgaacc ctgaagaata cgacacatac tgtacagtaa attctgtgaa ataggaccat 780 agggggtagg gtgtgggtgg atggaacaga aaagctctac acgtaatatc gagggctcct 840 gacttttcta gtaatacaaa ttgctttctg ttagacaaat agctttgaat ccacttaaat 900 attgagccta ttataccgta atattgctta gttataccgt tatatatata tatatatata 960 ataatgcagt atctacattg tatgtttttc gatttcaggt cataatggca tatcatcaca 1020 agtctgggta tcaaaaagct aaagaccgtg aagataaaaa gaaacgtgag gaagcggtca 1080 tttcaaaaac tgttaaaata actgagcttt ttcctacgat tagtgtcaat gtcaatagag 1140 aagacccacc atcgaattca cctttaccat gtgttggtca gtgtggtttg cctggtcaat 1200 ttggtcatga atttgaacgt gataacagtg ttgcagctgt atgcactcaa tctatctctc 1260 actctgaatc tgatgatatg gtagacgcgg agacactttc tcagaaacca gaagcacaag 1320 ctggttccaa aggtgatgac aattcctttt ccactgacat tgggcaatgg tctgaagtgg 1380 aaatcaatga tacattctgt aacttttggg taactcgtgg cagtgaatgt tgtcaacatt 1440 caaatgctga cttcactagt tcaatgcaaa ttgatggaaa acagaagcgc tactgctcac 1500 agaatctctt ttatcgtgtt cattctcgca atggtgagaa agtcccaagg tcatggttgt 1560 gttattcacc agcaacatct cgcatttact gctttgtatg taaattattt tctgctgatg 1620 ataattcatt ctcaaggaat ggtttttgtg attggaaaaa tgcagcttca cgaatacagt 1680 cacatgaaaa cagcattgca catagaaatg caacaatcac attcagccaa agagtacaag 1740 tcagaggatg tgtagattca gcaatagttg agcaatacag acaggaatgt gcatatgcaa 1800 aatctgtgct agaaagagtt gtctcagtta tcaaattcct tgctgaaaga gggctcgcat 1860 ttagaggaca tactgaaata ttggggtcaa gttctaatgg caattttttg ggaattttgg 1920 agttgattgc tcagtttgat ccattcctct ccagtcatat tgacaaccat ggtggtaagg 1980 ggaaaggctc tgtttcatac ctgtcatcaa caatttgtga tgagttaatt gatgtcatgg 2040 gaaagcaagt tatgaaaaca atattgtctg aaatacgtgg agcaaaattt tattcgattt 2100 ctgttgactc tactccagac atatctcatg ttgataggct gtcatgtgtc tttcgatatg 2160 ttttgaacga tggccctata gaaaggtttg tgcaattttt aagcatgaaa ggacatgggg 2220 cagaagattt gtttaacagt ttatacagct ttctggaaga agcaaagttg gatataaaat 2280 attgccgagg tcagtcctac gataatgcta gcaatatggc aggaaaatat tcagggttac 2340 aggcacgcat tcgagagaag aacccattgg ctgaatatgt gccatgtttt gcacactccc 2400 ttaatttagt cggtgcatca gctgtaaact gcgtgacaac agtgagtgga ttttttgatt 2460 ttgtacaaaa cttgtatgtc tttttaaatg catcaaccca acggtgggag cttctaacca 2520 aaggattgga tagtaaccag ttagttctaa aacgcctttc tgatactcga tggtcagcac 2580 attctgcagc aactaaagca ttgagcaaag gctttaaaaa tgttaagaca gtgttagaga 2640 acattgctga ggaccccgag gagaaggttg aagctcgcac tacagctgca ggaattgcaa 2700 agaagatgga tcagttagaa tatggaattt tattagaact ctggacaccc atattagatc 2760 gctttcataa gaccagttta aaactgcaaa gtccacaact tgatctaaat gaggctgttc 2820 aactgttaac ttcactaaaa gaatatgtaa attcacttcg agcacagttt gatttctttg 2880 agaagaaagg aaaggaaata acagggtcaa cttcttataa agaagagaat cagaggaaac 2940 gaaagcgaag tgttcaatta acccgtcatg atggatcagc tgatgaaaca gtgttttcgg 3000 cacagcagaa atttcgagtc atggttttct tgccagtact tgatcacttg aattcagcat 3060 tatgtaaacg atcagaggca tacagtggtg tatgtgataa gtttggtttt ctgagaaaca 3120 ttcgatcact ggaacagtct gatttgcgaa gagccagtag aaacttgata gatacttatg 3180 tagatgattt ggaagagtca tttgaagatg aaatacttca cttcaaagaa tatttgttag 3240 ctgatacgta cttgcaacag aatgagaaag aactgaaatg tagcatagag ctgtatatgt 3300 atcgggcttt gagaaacagc agtgcactac aagatatatt tccaaatgtt gccacagcac 3360 ttcacatata cctcagtttg atgatcacca actgcagtga tgaaagatca ttctctgcat 3420 tgaaacggat aaaaaactat ctgcgaagtg ctttgggtga tgacaaactg aattccctta 3480 ctgttatgtg tactgaatct gacatcctaa gaagctgtga cttcaatgaa attttatcag 3540 actttgttga acgaaaatgc agaaaaatga atttttgatc gtgttgtttt tttctttggg 3600 tcagccattt ataaaatata tactagtatc taatatacct gataatttgt ttgtaatttt 3660 gaaataaaat tagattttat tatagatatt attattataa ataaaattaa aattgtactt 3720 tttgacgtat tttttagaaa aatataggta tatagggccc atgaattagt taagcctata 3780 ggcccataat ttgataatcc gccactg 3807 // ID Gypsy-16_SI-I repbase; DNA; INV; 4147 BP. XX AC AEAQ01025185; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-16_SI_; KW Gypsy-16_SI-LTR; Gypsy-16_SI-I. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-4147 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01025185; Positions 5084 938. XX CC Positions [3166-3624] - Integrase core CC 'GAATT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS join(139..2601,2605..4122) FT /product="Gypsy-16_SI-I_1p" FT /translation="MAHPTIPGSLAIEAFDPANTTWRRWLQRLQGVFLIFG FT IKEQARVPYLLHYVGPTAFDMLSDRLDPVDPFTGQYNDLVQILQDYYAPAP FT LEIAENYSFHQRKQQEGESVQQYVAALRKLSIHCKFGNYLNTALRNQLVFG FT LRNKKAQSRLLEKSDLDFEEAVRIAVTMELSDKSSAQMKDTNTEAASVDYL FT KAGKKPSRKNAGDKRPMQMATKDDNYIRNNNNKFSNQYKNSNIKCYRCGKG FT HLANKCNIDRSVRCHNCGKLGHLRAVCFRATSFTNNLEEMLSLEHVNYRDK FT FLLSLEIEGKKVKFELDSGAAVTVMCESEAIKLFPTATIHHTDLRLISFCG FT RALRCKGYITVRVKYNSTMKNLNIYLINGVRMPLLGREWIRQLNVYDSLIC FT NARVNNVTINSQSKLQEILDRYRNIRSPEFASIKNTQVHLKLKKDTSPVFM FT RARSVPFKLQLLVNKELDILEKAGVIEKVESSKWATPIVPILKKDGRIRIC FT GDYKATLNPHLIVDEYPFPTVDELFLKLANGEKFSKIDLKQAYLQLEIAPE FT DRELLTISTCKGLYKVNRMMYGIAPGPTIWQREIEKILQGIPGVAIFFDDI FT VITGSTDTEHLTRLEELLSRLHKFNVRINLDKSKFFIDKIDYCGYVVDSVG FT IHKDNRKIEAIQKMPRPKNVSEIRAFTGMINYYGRFIQNLSSILHPLNKLL FT QKGVPFVWSRDCETAFNKAKAAFTSNQVLVHFNPKLPLVLATDASSYGVGA FT VLSHQYLDGSERVIQFASQTFSSVQSKYSQIDKEAFAIIFGVKKFVQFLYG FT NNFTLITDHRPLVQIFSPSSLPIYSAMRMQHYAIFLQGFNYKIEYRKSEKH FT ANADCLSRLPVDAPQTIADVVDAYQLEIIETLPVTASKLAYETQKDKDVSE FT LLEALQTGKIIHKTKRFNIEQNEFGLVNNVIMRGSRVYVPKILRAEILKEL FT HSGHFGIVKMKNLARSHCWWPDIDNDIEKLVRNCANCNSHKNNPPKVEVHL FT WEAPSAPMQRVHIDFAGPFLRKMFLLMIDAFSKWLEVHIVRDITAKTTITK FT CREIFAAYGIPQVIVTDNGRSSTSAEFQQFLHMNGIKHKRTSPFNPATNGQ FT VERFVQTCKQALKRMNCDTTNVNFALSKLLLQYRAMPHAITNKTPAEMFLN FT RKISTRLDLMIPVHNNSKLYDNTTENVKTFSCGERVACRNYSGSVKWKFGI FT ISARKGKLHYSIRLDDGRSWERHANQMRKIGENTPTSCTENDHYYWDIEES FT REPECPSVSPSSNTENHLDAPQNAAPVSPQSNIGVDARSEVEAQRRSVRYK FT KRPDFFSECTAKYNK" XX SQ Sequence 4147 BP; 1366 A; 791 C; 894 G; 1096 T; 0 other; attggcgacg aggatgggat cgccataatt cgttctgcta aagacatatt ctctccgcgg 60 cgaagccggc taacctgtgc gtgatcgcca cgtgctgaga atacaaaggc tcgtggtacg 120 attcgtgata caaggatcat ggcgcatccg acaataccag gatctctcgc catcgaagct 180 ttcgacccgg cgaatactac atggaggaga tggctacagc gtcttcaagg agtctttctg 240 atttttggaa tcaaagaaca ggctcgggtc ccgtacttgt tacattacgt ggggcccacg 300 gcttttgata tgctgagtga tcgactggat ccggtcgatc ctttcactgg ccaatacaac 360 gacttagttc aaatattaca agattattac gcgccggcac cgcttgagat tgctgagaat 420 tattcatttc atcagcggaa acaacaggag ggtgagtccg tacaacaata cgttgctgca 480 ctaaggaaac tcagtataca ttgcaagttc ggcaattacc tcaacaccgc cttgcggaac 540 caactggttt tcgggctgcg aaataaaaag gctcaaagtc gattgctgga aaagagtgat 600 ttggacttcg aggaggccgt tagaatcgca gttacaatgg aactctccga caagagctct 660 gcgcagatga aggatactaa tacagaggct gccagcgtgg actacctcaa ggcaggcaag 720 aaaccttcga ggaaaaatgc aggtgataaa aggccgatgc agatggccac gaaagacgat 780 aattacatcc gtaataataa taataagttt tctaaccaat acaaaaactc taacataaaa 840 tgttacaggt gcggtaaggg ccatttggca aataaatgta atattgatag aagtgtacgt 900 tgtcacaact gtgggaaact ggggcacttg cgtgcggtct gcttcagagc tacgtccttc 960 actaataatt tggaagaaat gctcagttta gagcatgtga actaccgcga caaattcctg 1020 ttatcattgg aaattgaagg aaagaaagtc aagttcgagt tggatagtgg agcggcggtt 1080 acagtcatgt gtgagtctga agcgatcaaa cttttcccaa cagcgaccat tcatcataca 1140 gacttgagat taatttcgtt ttgcggccgc gcacttagat gtaagggtta tattacagtt 1200 cgcgttaagt acaattctac aatgaaaaat ttaaacattt acttaattaa cggagttaga 1260 atgcctctgt tagggcgaga gtggattagg caactcaatg tttatgattc attaatctgt 1320 aatgcacggg taaataatgt aacaataaat tcacaaagca aattacaaga aatattagat 1380 agatatcgga acattcgttc tccagaattc gcttcaatta aaaatacgca agtacattta 1440 aaattaaaga aagatacgtc tccggttttc atgcgagctc gatcggtgcc tttcaaatta 1500 caattgttag ttaataagga gcttgatata ctagaaaaag ccggcgttat cgaaaaagta 1560 gagtcgtcaa aatgggctac acccattgtt ccaattttaa agaaggatgg aagaatacgt 1620 atttgcggcg attataaagc cacgcttaat ccgcatctga ttgtcgacga gtaccctttt 1680 cctacagtag acgagttgtt cttgaagctt gcaaacggag aaaagttttc caaaattgac 1740 ctaaaacaag cttacctaca gttagagata gcacccgagg atcgggagtt gctaactatc 1800 agtacatgca agggactcta taaagtaaat cgtatgatgt atggaattgc cccggggcct 1860 actatatggc aacgcgaaat cgaaaaaatt ttgcaaggaa taccgggggt tgcgattttc 1920 tttgatgata tcgttattac aggtagtaca gatacagaac atttaacacg actcgaggaa 1980 ttactaagta gattacataa attcaatgtt cgtataaatc tagataagtc caagttcttt 2040 atagataaaa tcgattattg cggatacgtg gttgacagtg ttggcataca taaagataat 2100 cgaaaaatag aagctataca gaaaatgcct cggcctaaaa atgtgtcgga aatcagagct 2160 tttacaggca tgatcaacta ctacggtaga tttattcaga atctaagttc tattttacat 2220 ccattaaaca agctgctgca aaaaggagta cccttcgtat ggtctcgtga ttgtgagact 2280 gcttttaata aggctaaagc ggcttttact agcaatcaag ttctggttca ttttaaccca 2340 aagttacctt tggtattagc gaccgatgca agttcgtacg gagtcggagc ggtgctgtcg 2400 caccaatatc ttgacggatc cgaaagggtc attcaatttg cttcacaaac attttcaagt 2460 gttcaatcaa agtactcgca gatagacaaa gaggcgtttg caatcatttt tggtgtaaaa 2520 aaatttgttc aatttctata cggaaataac ttcacgctaa taacggatca tagaccgttg 2580 gtacaaatct tctcgccctc ttagagtttg cctatatact cagcaatgcg catgcagcat 2640 tatgctatat ttctacaagg atttaattat aaaatcgagt ataggaaatc tgagaaacat 2700 gctaatgcgg attgtttatc cagactccct gtggatgcgc cacaaaccat agcagacgtg 2760 gtagatgctt atcaattaga aataattgaa accttaccgg taactgcgag taaacttgct 2820 tatgaaaccc aaaaagacaa agacgtaagc gaattactag aagcattaca gactggtaaa 2880 atcatacata aaacaaagcg ttttaatatt gagcaaaatg aattcggtct agtgaataat 2940 gtaattatgc gaggttcacg cgtttatgtt cccaaaatac tacgagcgga aatattaaaa 3000 gaattgcatt caggacactt tgggattgtc aaaatgaaaa atttggctcg aagccattgt 3060 tggtggccag acattgacaa tgatatagag aaattagtac gtaattgcgc aaactgcaat 3120 agtcacaaaa acaatcctcc aaaagtggaa gtacatttat gggaagcgcc atcagcgcca 3180 atgcaacgcg tacatatcga ttttgcgggg ccattcttga gaaaaatgtt cttgcttatg 3240 attgatgcat tctcgaaatg gctcgaagta catattgtaa gagatatcac agcaaaaact 3300 acaattacaa aatgccggga aatctttgct gcatacggga taccacaagt gatagtaact 3360 gataatggta gaagctccac ttctgcggaa tttcaacaat ttttacacat gaatggtatt 3420 aagcataaac gtacttcacc tttcaatccc gcaacaaacg gtcaggtaga acgttttgtc 3480 caaacatgta aacaagcgct aaaacgaatg aattgcgata ctacaaatgt taattttgca 3540 ctaagtaaat tactattaca gtatagggca atgccacatg cgataactaa caaaactcct 3600 gcggagatgt ttctcaatcg gaaaatatcc actagattag atttaatgat accagtacat 3660 aataactcaa aactatatga taacactaca gaaaatgtga aaacattctc gtgcggggag 3720 agagttgcgt gtcgtaacta ttccgggagc gtaaaatgga agttcggcat aatatcggca 3780 agaaaaggaa aactacacta ttcaattcgg ctggatgatg gtcgctcttg ggaaagacac 3840 gctaatcaaa tgcgtaaaat aggcgaaaat actccaacga gttgtacgga gaacgatcac 3900 tattactggg acattgaaga gtctcgggaa ccggaatgcc cgagcgtttc cccatcatca 3960 aatacggaga atcatctgga cgcgcctcaa aatgctgcgc cagtttcccc acagagtaat 4020 ataggcgtag atgcgagatc tgaagtggaa gctcagcggc gttcggttag atataagaag 4080 cgtccagatt tcttttccga atgtacagca aaatacaaca agtaggaatt ttctttacga 4140 gagaagg 4147 // ID LanceleTn-4 repbase; DNA; INV; 245 BP. XX AC . XX DT 15-MAY-2006 (Rel. 11.05, Created) DT 15-MAY-2006 (Rel. 11.05, Last updated, Version 1) XX DE Non-autonomous DNA transposon - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; KW Interspersed repeat; LanceleTn-4. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-245 RA Osborne P.W., Luke G.N., Holland P.W.H. and Ferrier D.E.K.; RT "Identification and characterization of five novel miniature RT inverted-repeat transposable elements (MITEs) in amphioxus RT (Branchiostoma floridae)."; RL Int. J. Biol. Sci 2(2), 54-60 (2006). XX DR [1] (Consensus) XX SQ Sequence 245 BP; 64 A; 55 C; 56 G; 70 T; 0 other; ggtggtatct cactgcactt ggggcaccgg tgcggcactg cggggttcgt tcactgcggc 60 actgttgtgt tattttcgcc gattttttat aatttagata ttgcgtaata cgtaaaagta 120 tgacttagaa gacaacaaaa tacacaaaac gtaagaaaat tcgttcttta tctctgaaat 180 tcgttgagta atctttcgaa ccccgcagtg ccgcaccggt gccccaagtg cagtgagata 240 ccacc 245 // ID L1_Ele12 repbase; DNA; INV; 4433 BP. XX AC . XX DT 08-OCT-2010 (Rel. 15.1, Created) DT 08-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE An L1 clade non-LTR retrotransposon family from Aedes aegypti. XX KW L1; Non-LTR Retrotransposon; Transposable Element; L1_Ele12. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4433 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4433 RA Kojima K.K. and Jurka J.; RT "L1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (08-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. This consensus is generated from 5 CC sequences with >98% identity, and ~99% identical to the original CC sequence in [1]. XX FH Key Location/Qualifiers FT CDS 151..1191 FT /product="L1_Ele12_1p" FT /translation="MSVRRENTFRIDYANVPKKPSFEELHDFVGSALGLSY FT EQVVQLQPSRALGCAFVKVVDLELAQKVVAENDNKHVXEVDGKTYKLRITL FT EDGAVEVKLTDLSEDITNDQISEFLSAYGEVLSVTEQVWDSKYRFGGIPTG FT TRIVRMFVKRNIESYITIDGQTTNVVYFGQLHTCRYCNEYVHNGISCVQNK FT KLLVQKTYANVAKQAESNPSVAKPKATKQKTSFAKLFGPKPREVQQQEPST FT KTTTVTEIFPSLPKTNVSCPTGVTPIGQILTRANLAEQRKNLMPPPVLPQN FT LSTTSRMSTRQASDGNETDISSASNSSKRRSGRPPGKKQRQNNEDDASEEL FT AGQF" FT CDS 1194..4361 FT /product="L1_Ele12_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MAFNSYNIASININTITNTTKINALRTFLRMLEIDIA FT FLQEVENEQLLLPGYHVVCNVDHTRRGTAIALKEHVNFSNVEKSLDGRLIA FT LRVQNTTLCNVYAPSGTAFRAERERFYHNTLAYYLRHSSEHILLAGDFNCV FT LRQCDATGHNPSPALHTTVQQLKLLDVWEKLRPSLPGYTYITHNSSSRLDR FT MYVSQSLRSHLRSTDIHVCSFSDHKALTARICLPYLGREPGRGFWSLRPHL FT LTAENVQEFQLRWQYWTRQRRNFGSWMQWWMSVAKPKIQSFFRWKSKTAFN FT DFHREHQRLYSNLRQAYDNYHQNPAVLPSINRIKAQMLTLQRNFTQTFVRI FT NESYVAGEPLSTFQLGERRRRKTVITQLRDENNNPIDGSEAVERHMLEFYR FT ALYATGEATEVMDEFACPRVIPENDPTSESCLSEITPADIYAAIKTSSPNK FT SPGGDSIPREFYLKTFDVIWRELTLVMNEALAGNFPADFVDGVIVLVKKKG FT NDQTAHSYRPISLLNCDYKIFSRILKQRLENVMRTHRLISSGQKCSNSGRN FT IFQATLSLKDRIAKLKHTKQRGLLASFDLRHAFDLVDRAFLARNMCSLGFN FT PAFVRLLTKIGELSSSRLLVNGYLSESFPIQRSVRQGDPISMHLFVIYIQP FT LIKRLEEIGGPDLVVAYADDISVISTSKVRLERMRETFHRFERVSGAKLSL FT TKSSSISVGYTDDMPLTVPWLKNEYSVRILGVTFVNSIRLMNKLNWDEIVG FT KFTRLVYLHIPRTLTLHQKIILLNTFITSKVWYIASVLSPSAAHKAKITAT FT MGNFLWRGIQARVPMHQLARCREAGGLKLHLPAMKCRALLLNRHVRDIDSL FT PFYKSFLNQNNPNPPSDCPCLKLILNDFPLLPPLVQENPSANGIHSVYLEE FT TDEPKISREHPEANWRKIWANISVRQLNSAQRSLLYLLINAKLEHRRLWFR FT MNRTDGENCLHCNCALETLEHKLSECVRVEAAWRLLQRNITTLLNGWRILT FT IDDLLRPQLDGVAKWKRTRILKMFQQYVFFIMECNDAIDLVALEIEIQHA" XX SQ Sequence 4433 BP; 1278 A; 1096 C; 976 G; 1082 T; 1 other; cagtttgcgc tcaacttcca tgcagaacag ttgtgtttct cgctggaagc cttaagctag 60 ctatcgatta ttctccctat cgtttaaaat accgtaaagt ttgtttcgcc gttccgtttc 120 gtcgcgagta gtaccgcggg tggtgccgcg atgagtgttc gacgcgaaaa cacgtttcgc 180 attgattatg caaacgtgcc gaagaagcca tcctttgagg agcttcacga ctttgttggc 240 tcagctttgg gtcttagtta cgaacaagtt gttcaactac aaccgagtag agcactcggg 300 tgtgcgtttg tgaaggtcgt tgacctggaa ctggcgcaaa aagtggtagc cgaaaacgac 360 aacaagcacg taascgaagt agatgggaaa acctacaaac ttcggatcac actggaggac 420 ggagctgtgg aggtcaagct gaccgatctg tccgaagata tcacgaacga tcaaatttct 480 gagttcctca gcgcatacgg agaagttctc tctgtcaccg agcaagtgtg ggacagcaaa 540 taccgcttcg gtgggattcc tacaggtact cgaattgtac gaatgtttgt gaaacgcaac 600 atcgaaagct acatcaccat agatggacaa accacgaacg ttgtttattt cgggcagctc 660 cacacatgtc gctactgcaa cgaatatgtc cacaacggta tctcttgtgt gcaaaacaag 720 aagcttctcg tacagaagac ttacgccaac gtggcgaaac aggccgagtc aaacccgagc 780 gtagcgaaac ccaaagcgac aaagcaaaaa acttcgttcg ctaaactttt cggaccaaag 840 ccccgagaag tacagcaaca ggagccctcg actaagacaa caaccgtaac cgaaatcttc 900 ccctccttac cgaaaacaaa cgtatcgtgc cccaccggcg tgacgccgat tggtcaaata 960 ttgacaagag cgaatctagc agaacaacga aagaatctca tgccaccacc ggttctccca 1020 cagaacttgt cgacaacaag cagaatgtct acccgccaag ccagcgacgg gaatgagacc 1080 gatatatcat ctgcatcgaa tagtagcaag cgccgaagtg gtcggccgcc gggtaaaaag 1140 cagcgacaga acaacgaaga cgatgcgagc gaggagttag ctggtcagtt ctaatggctt 1200 tcaacagtta taacatcgct tcgatcaaca ttaacacaat caccaacact actaaaatca 1260 atgcactccg aacgttcctt cggatgctgg aaatcgatat cgcattttta caggaagtgg 1320 agaatgagca gctcctctta cctggctacc acgtagtgtg taatgttgat catacgagaa 1380 ggggaacggc aatagcgctg aaagagcacg tcaacttctc aaacgtcgag aaaagtttag 1440 acgggcgact catcgctctt agagtgcaaa atacaacgct ttgtaacgtt tacgctccat 1500 caggcacagc atttcgtgct gagagagagc gtttctacca caacactctc gcatactatc 1560 tccggcattc gtcagaacac atcttgctag cgggcgattt taactgtgtt ttgcgacagt 1620 gcgatgccac gggccacaat cccagccctg cattgcatac aaccgtacag caacttaaac 1680 tcctcgacgt ttgggaaaaa ctccgtccaa gcctacctgg ctacacctac ataacacaca 1740 attcctcttc tcgtctggat cgaatgtatg tgagccagag cctacgaagc catttgcgat 1800 caacagacat acatgtatgc tcattttccg atcataaggc attgacagcg cggatttgcc 1860 ttccctatct tgggcgagaa cctggtcgtg gattttggtc ccttcgtccg catctgttaa 1920 cagcagaaaa cgtccaagaa tttcagttgc gctggcaata ctggacccgc caacgccgga 1980 actttggttc atggatgcag tggtggatgt ctgttgcgaa acctaaaatt caatctttct 2040 ttcgctggaa atctaaaacc gcttttaatg atttccaccg tgaacaccaa cggctttaca 2100 gcaatctgcg tcaagcatac gacaactatc atcaaaatcc agcggtgctg ccatccatta 2160 accgtattaa ggcacaaatg cttactctgc agcgcaactt cacccaaacg ttcgtgcgca 2220 tcaatgaatc gtatgtagct ggggagccgt tatcaacgtt ccagcttggc gaacgacgta 2280 ggagaaaaac agtgatcaca cagctgcgcg atgaaaacaa caaccccatc gatggttccg 2340 aagcagtaga aagacacatg ctcgaattct atcgtgctct ctacgctact ggtgaggcaa 2400 cagaagtcat ggacgaattt gcgtgcccca gagtgattcc cgagaacgac ccgacgagtg 2460 aaagctgttt gagtgaaatc acaccagccg acatctacgc agcaatcaag acaagtagcc 2520 cgaataaatc tcctggggga gactcaatcc ctcgtgagtt ctacctaaaa accttcgacg 2580 tcatttggcg agaattaacc cttgtcatga acgaggcgct tgctggcaat tttccggcag 2640 attttgtcga tggggtgatc gtcctagtga aaaagaaggg aaatgaccaa acggcacact 2700 cgtataggcc aatatcactt ctaaattgcg actacaagat attttctcgc attctaaaac 2760 agcgtcttga aaacgtaatg cgtacacacc gactaattag tagcggacaa aaatgttcca 2820 actctggtcg caacatattc caggccaccc tatcattaaa agatcgaatc gccaaactaa 2880 aacacactaa acagcgaggt ttattggcat ctttcgatct ccggcacgct ttcgatctcg 2940 tagaccgggc tttcctagct cggaacatgt gctcgctcgg cttcaaccca gccttcgttc 3000 ggttgctcac caaaatagga gagctctctt cgtcgcgatt actcgtaaat gggtacctgt 3060 cggaatcatt ccccatccaa cgatcggttc gccagggaga tccaatctcc atgcatctat 3120 ttgtcatcta tattcagcct ctgatcaaac ggctagagga gataggtgga cccgatctcg 3180 tggtggctta tgcggatgat atatctgtaa tctccactag caaagtgaga ctagagcgta 3240 tgagagagac tttccaccga tttgagcgtg tgtctggggc caagctgagc ttgaccaaat 3300 cgtcgtcaat atctgtggga tacacagatg atatgccact cactgtaccg tggttaaaaa 3360 acgaatacag cgttcgtatc ttgggagtga cttttgttaa ttccatacgc ctgatgaaca 3420 agctaaattg ggatgagatt gtgggaaaat tcactcgcct ggtttacctg catattccac 3480 gtactctcac gctgcatcag aagatcattt tgctaaacac gttcatcaca tcgaaggtgt 3540 ggtatattgc gtccgtactt tcaccgagtg cagcacataa ggcaaagata acggcgacga 3600 tgggaaattt cctgtggcgt ggcatccaag caagggtccc tatgcatcag cttgctcgct 3660 gcagagaagc tgggggcttg aaattacatt tgcctgcaat gaaatgtaga gctcttctcc 3720 tgaaccgtca tgtgcgcgac attgactccc ttccatttta caaatccttt ctcaaccaaa 3780 acaatcctaa cccaccctca gactgtcctt gcctaaaact catccttaat gattttccct 3840 tactccctcc tttggtccaa gaaaacccct ccgccaacgg tatacatagt gtttatctgg 3900 aggagactga tgagccaaaa atatcgcgag agcacccaga agcaaactgg cgtaaaattt 3960 gggcaaacat ttcggtgcgt cagttaaatt ctgctcaacg tagtttgctt tatttactga 4020 taaacgccaa actcgagcat cggcgactct ggtttcgcat gaacaggact gatggagaaa 4080 attgcctgca ctgcaactgt gcacttgaaa cgctggaaca taaactcagc gaatgcgtac 4140 gcgtcgaagc agcgtggaga ttattgcagc gaaatataac aactcttctg aatggatggc 4200 gtatactaac catagatgac ctgttgcgac cccagctgga tggagttgct aaatggaaga 4260 gaacacgtat tcttaaaatg tttcaacaat atgtcttctt tattatggaa tgtaatgatg 4320 ctatagacct agttgcactt gaaattgaaa ttcaacatgc ctaaatgatt ataatgtaat 4380 tatttttttt acgtactaat aaacacattt tatcaattaa aaaaaaaaaa aaa 4433 // ID Gypsy-131_AA-LTR repbase; DNA; INV; 161 BP. XX AC AAGE02027875; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-131_AA_; KW Gypsy-131_AA-I; Gypsy-131_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-161 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02027875; Positions 107569 107409. XX SQ Sequence 161 BP; 48 A; 27 C; 31 G; 55 T; 0 other; tgtgctgtcg cgtgttccaa agcgttgtta acgtggcaag ccaagtatcg ttcgggtagc 60 aacatcctag taacataagt tgttgatttg caataaactt tattctgtat tttctcgtca 120 taaaagaaag acgtattatt taatttgtct aatacagaac a 161 // ID Gypsy-72_AA-LTR repbase; DNA; INV; 179 BP. XX AC supercont1.159; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-72_AA_; KW Gypsy-72_AA-I; Gypsy-72_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-179 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.159; Positions 467502 467680. XX SQ Sequence 179 BP; 58 A; 35 C; 34 G; 52 T; 0 other; tgtaaccata ataaatacag acgatcgttg agatccaccc taaagacgtt atcgtatcat 60 tgtagtcata ttgtgccgat gaagagaatt tggaaataaa agttagttga attcgtacca 120 cagaaacgaa cgcctcgtgt ttaagttccc tatcggtatc acagagtctc tgctttaca 179 // ID Transib2_AA repbase; DNA; INV; 796 BP. XX AC CC150426; XX DT 13-JUN-2005 (Rel. 10.05, Created) DT 27-JUL-2005 (Rel. 10.05, Last updated, Version 2) XX DE Transib2_AA is a DNA transposon, a partial fossilized copy. XX KW Transib; DNA transposon; Transposable Element; KW Interspersed repeat; DDE-class; TRANSIB superfamily; KW Transib2_AAp transposase; Transib2_AA. XX NM Transib2_AA. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-796 RA Kapitonov V.V. and Jurka J.; RT "RAG1 core and V(D)J recombination signal sequences were derived RT from Transib transposons."; RL PLoS Biol 3(6), (2005). XX DR GenBank; CC150426; Positions 796 1. XX CC Transib2_AA belongs to the Transib superfamily of DNA CC transposons. CC The consensus sequence is not complete; termini are not known. CC Transib2_AA encodes remnants of the Transib2_AAp transposase. CC The transposase is not perfectly recovered due to available CC sequence CC data. XX FH Key Location/Qualifiers FT CDS 240..791 FT /product="Transib2_AAp" FT /note="transposase" FT /translation="RLPFKTWQVRGDVNKKTFEERKKRILNEFKVQLGLIV FT DKPKPGYGTTNDGNTARTFFAHPKVSAAITGVDETLIVKFATILRVVACGR FT EIDIPKYRELLLETREQYFSLYSWYYMPLTVHKLLIHSANIISMFELPVGE FT LSEEALEATHKIIRRARLNHTRKSSRINTNEDLMRCLLLNSDAFL" XX SQ Sequence 796 BP; 249 A; 152 C; 159 G; 236 T; 0 other; tacgttgtct cacgaaaaag catcagccaa atgttacatt tgtggtgcaa ccccgaaggg 60 catgaataat cccgaagttg atcagaaacc tgaaaataca gataatatgc gatttggaat 120 gtcggtttta tactgctgga ttaacatgtt cgattgcttt cttcatatcg cttacaggta 180 tttattcatt tatttctttc cagtgtaaac caattgatca tttttttctg tttttttaaa 240 ggctgccttt caaaacttgg caggttcgtg gagatgttaa taagaaaacc ttcgaggaac 300 gtaagaaaag aatcctgaac gagtttaaag ttcagctggg attgatagtt gataagccta 360 aaccaggcta cggcacaacc aatgacggca atacagcgcg tacttttttt gctcacccta 420 aagttagtgc tgctattacg ggcgttgacg aaactcttat agtaaaattt gcaactatac 480 taagggtagt tgcctgcgga agagagatcg acattccaaa atatcgggaa ttgttactgg 540 aaacaaggga acaatatttt tctctttaca gttggtacta tatgccatta accgtgcata 600 aactgctaat acatagcgcc aacattattt caatgtttga gttgcccgtc ggcgagctgt 660 cagaagaagc cctagaagct actcataaaa tcataagaag agcccggttg aatcatacca 720 gaaaatcgtc cagaatcaat actaacgagg acttaatgag atgcctgctc ctaaattcag 780 atgcttttct ataagt 796 // ID Copia-17_SI-LTR repbase; DNA; INV; 253 BP. XX AC AEAQ01022875; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: long terminal DE repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-17_SI_; KW Copia-17_SI-I; Copia-17_SI-LTR. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-253 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01022875; Positions 1516 1264. XX SQ Sequence 253 BP; 54 A; 62 C; 43 G; 94 T; 0 other; tgttggagca gatggcgctt atgccaagtt ttcatagtta ttttttcgat taaggcgcta 60 tctatgacca tctcggttgt ttccgctcta ttgtgcgctt atgcaaaact ctctattgta 120 tctgttacgt ttcggccccc cccccttgcc accaaaatat atatgtgtgt gcatctctgc 180 attaatactc tttatcattc ctgtccactg cgtacgatca ttaagatatt tccgaagtcg 240 atttattcca tca 253 // ID CR1-49_AAe repbase; DNA; INV; 5159 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-49_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5159 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1136-1136 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 12 sequences with >97% CC identity. XX FH Key Location/Qualifiers FT CDS 272..1357 FT /product="CR1-49_AAe_1p" FT /translation="MDCQICSTTLDPGRAVICSGSCGMSFHFTCVEMSKSQ FT YSSWASKIGLMWFCKICRSNFDPVVHDREKVIMKALRELLIRTDSMNTRLG FT NYGENLRRVNKTLSGMLLNSKTPNHTKQTTFRHSIDDLNLDDTLEDTTNCS FT RSCEETSFFEVLDEIDNSIANVPDKFVVGSDKRVQIVASRQKASKSSTNKA FT RIDVSTPAAKRHYFDENVEASTSHGNSNSIEECTTKRHPPAIHGSSSTTNG FT NNSNIYSEPNRTKLKVATNPQATADSESFYXTPFEPNQSEDDVKNYVMDIS FT NLHASLVKVTKLVPRGRRIEDLSFVSFKVTVCQSASAVVGDSWYWPEGISV FT RLFEPNQKNGSAVLLPNTQ" FT CDS 1405..5025 FT /product="CR1-49_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MEAPKPTLTVEIGQPALIRRPGPVLGSXGGVFQPFLS FT GKYASNKNFTGSDDTTSNSSGESIDSLAQQAQQAFTTGHFYPGRTTLSTME FT ASMPTHTVQTLLPALNRRPGPVLGTGRGVFQPVLSGKYTMCKNFICSDDVP FT SNSFCDDSDRFIQSSGFNRPVQIVAINESDSHPGRTTLGTMEAPKPPITVQ FT TFQPAFLSRPGPVLGVSEGVFQRPLLGKYNLAENDFRSDGSYPHSKLAIPE FT TIKTCRDDHRLLVYYQNVRGLRTKIDNFLLDATESSYDIVVLTETWLDDKI FT YSAQLFGDKYTVFRNDRNQINSAKSRGGGVLIAVSTRLSCSIDPSPICETL FT EQLWIRIKTPGRNIIVGVLYLPPNRKSNCMDIEAHVRSIGLVASTLKSADI FT FLQFGDYNQSGIVWSTHLCNYPLIDAQKSNVNEPCSILLDGFSLHGLTQIN FT TVINRNGKLLDLVLTNEPGLEICSVSEAIEPLIALEPVHPALEVSIKLIKP FT IIFEDIPESPNLDFRRADYEALNDAIAAANWEFLENATDIDDSVEYYTDVV FT KRIIADNVPIGRPPCKPIWSDPHLRKLKSLRAKALRKYSRSRCPSQKRLFN FT YASNRYRLYNRLLYKRYVLRTQDNLRRNPKQFWSFVKTKRKENGLPAEMHL FT ESETASTVLHKCNLFANHFKRTFNTDSASESHVTEALTNTPRDVIDFNMFR FT ISEINVKDAIDKLKFSYSAGSDQIPSCILKKCFNAFVKPLVLLYNASLQQR FT KFPTAWKLSEMFPVYKKGDKTNIENYRGITSLCSTSKVFEIIINDALFASC FT KNYISPDQHGGFFPRRSVSTNLVSFVSDCLRNMDSGNQIDAVYTDLKAAFD FT RVDLAILLGRLEKIGMSAPFVCWFKSYLTNRELYVKIGSEKSMKFSNLSGV FT PQGSNLGPLLFVIFINEIAMILPPGCRVFYADDVKIYMVIKSLIECMELQS FT RLSCFQTWCIRNNLTLSIGKCLVISFHRKLKPIIFDYHLDGHLLQRVEQVR FT DLGVTLDSALTFRTHYNDIISRANKQLGFIFKISDEFRDPTCLRSLFCSLV FT RPILEFSVVVWCPHHANWITRMEAVQKKFVKFALRGLPWLDPLNLPSYEDR FT CRLLGLVTLERRRFNAQAIFVAKILKGEIDSPPILQEINLYAPERRLRQRD FT FLQLGARNAQYGQYDPVRYMCSSFNAVFDLFDFNVSCDSFRFQLCQRR" XX SQ Sequence 5159 BP; 1438 A; 1201 C; 1049 G; 1468 T; 3 other; cgttttgact atagtgctgt taacgcgtta tcgagctata tttacggtga tgctatagta 60 tagctattat tgtttttcgc ccattgacaa gcataaccac gctcacttgc ctttctgaag 120 agccatctgc cagaaatgtt ttaaaactgg tggagctttc tgtggaactt atttgaaccg 180 gctgtactac acgctgcata agcgtaggct catcaacgca tctccttcca caaccgtcat 240 attgcgacac attgagtctc ccacatacac gatggattgc caaatttgct caacaaccct 300 ggatcctggc agagcagtaa tatgcagcgg atcttgcgga atgtcctttc atttcacatg 360 tgtggagatg tccaaatccc agtactcgtc atgggcgtct aaaattggac tgatgtggtt 420 ctgtaaaatc tgccgttcaa attttgatcc ggttgtccat gatcgggaaa aagttatcat 480 gaaagcctta agggagttat taatccgtac agattctatg aatacccggc ttggtaacta 540 tggtgaaaat cttcggaggg tcaacaagac cctttctggt atgctcttga actcaaaaac 600 gccaaatcac acgaagcaga ctacgtttcg tcatagtatt gatgacctta atctagacga 660 caccctagag gacacaacaa attgttctag atcgtgtgaa gaaacttctt tcttcgaggt 720 ccttgatgaa atagataact cgattgcaaa tgttccggac aaatttgttg ttggatccga 780 caaacgtgtc caaatcgttg ctagccgcca gaaagcctcc aaaagttcaa ccaataaggc 840 acgtatcgat gtttcaacac ctgctgctaa acgacactac tttgatgaaa atgttgaagc 900 tagcacttct catggcaaca gcaatagtat cgaagaatgc accacaaaac gtcaccctcc 960 agctatccac ggttccagtt ctaccacaaa tggaaataat tcaaacatct actccgaacc 1020 caatcgtacc aaattgaaag tagcaactaa tcctcaagca actgctgaca gcgaatcgtt 1080 ctacktcact ccatttgagc cgaatcaaag tgaagacgat gtgaaaaatt atgttatgga 1140 catttctaat cttcatgcat ctctcgtaaa agtgaccaag ctagttcctc gcggcaggcg 1200 tattgaggat ctttcatttg tatcatttaa agtcactgtt tgccaatcag cttcagccgt 1260 tgttggagac tcttggtatt ggccggaggg aatttctgtg agattgttcg aacccaacca 1320 aaaaaacgga tctgctgtac ttcttccgaa cacacagtaa aacagtccac tcttctacat 1380 ccgggacgca cgacgactag ctctatggaa gccccaaagc ccaccctcac agtcgagatc 1440 ggccagccag cgctcatcag acgtcccggt cctgtgcttg ggtcckgtgg aggggtcttc 1500 caacccttct tgtcaggcaa gtatgcatca aataagaact ttactggctc tgatgatact 1560 acttccaaca gctcggggga aagtatcgat tcgcttgcac aacaagccca gcaagctttc 1620 accaccggcc atttctaccc gggacgcacc accttaagca ctatggaagc ctctatgcct 1680 acccacacag tccagaccct cctgccagck ctcaacagac gtcccggtcc tgtgttggga 1740 acaggtagag gggtcttcca gcctgtgcta tcaggcaagt acacaatgtg taagaatttt 1800 atctgctctg atgatgttcc gtccaacagc ttctgtgatg actccgaccg gtttattcag 1860 tcatcagggt tcaatcgacc cgttcaaatt gttgcaatca atgaatccga ttctcatccg 1920 ggacgcacga cacttggcac catggaagcc ccaaagcccc ccatcacagt ccagaccttc 1980 cagcctgcgt tcctcagtcg tcccggtcct gtgcttgggg tgagtgaagg ggtcttccaa 2040 cgtccactac taggcaagta caatcttgct gagaacgatt tccgctctga tggatcttat 2100 ccccacagca aactggctat tcctgaaaca ataaaaacat gtcgagatga tcatcgtctg 2160 ttggtgtatt accaaaacgt tagaggactc cgtaccaaga tcgacaactt tcttctggac 2220 gccactgaat cttcgtacga catagtagtt ttgaccgaaa cgtggctgga tgacaaaatt 2280 tattcagctc agctttttgg cgacaagtac acggtgtttc gaaatgatcg caatcagatt 2340 aatagtgcaa aatctcgagg aggaggcgtc ttaattgctg tttctacgcg tttatcatgc 2400 agcattgacc cgtctcctat ctgcgaaaca ctggagcagt tatggattag aattaaaact 2460 ccaggaagaa acattatagt aggagtactg taccttccac caaaccgaaa gtcaaactgt 2520 atggacattg aagctcatgt acgatctatt ggtttggttg cttccacatt gaagtctgca 2580 gacatttttc ttcaatttgg cgactataac caatccggca tcgtgtggtc cacgcactta 2640 tgcaattatc ccttaatcga tgctcaaaaa tccaacgtaa acgaaccttg ctcaattctt 2700 ttggatggtt ttagtttaca tggattgaca caaatcaaca ctgtaatcaa tcgcaatgga 2760 aaactacttg acctggtgct aaccaatgaa ccaggtttag aaatctgttc tgtttctgaa 2820 gctattgaac cactgattgc actagaacct gttcatcctg ctctagaagt gtcgatcaag 2880 ttgatcaaac caattatttt cgaagatatt ccggaatcgc ctaacctgga cttccggcga 2940 gcggattacg aagcgttgaa tgatgcaatt gccgcagcca actgggaatt ccttgagaat 3000 gctaccgaca ttgatgattc agtggaatac tacaccgacg ttgtgaaacg gataattgct 3060 gataatgttc ctattggaag acctccctgt aaaccaatat ggtccgaccc tcatttgcgc 3120 aaattgaaga gtctgagagc aaaggctctt cgaaagtaca gtagatcgcg atgtccttcc 3180 cagaaacgat tgttcaacta tgctagcaac cgatacaggc tttacaatcg tttgctgtat 3240 aaacggtatg tattacgcac tcaagataat cttcgccgca acccaaagca gttctggtct 3300 tttgtaaaaa ccaagaggaa agaaaatggt ctacctgctg aaatgcatct tgaaagtgaa 3360 accgcttcta cggtgctgca taagtgcaac ttgtttgcca accacttcaa gcgcacattt 3420 aacaccgatt ccgcatccga atcacatgtc actgaagcac tcaccaacac gccaagagac 3480 gttatcgatt tcaatatgtt cagaatatct gaaatcaacg tcaaggacgc cattgataag 3540 ctgaaatttt cctattctgc tggctcggat cagattccat cttgcatcct gaagaaatgt 3600 tttaatgcat tcgtcaaacc gcttgtatta ctatacaatg cctccttgca gcaacgtaaa 3660 tttcccacgg cttggaagtt gtcagaaatg ttccctgtct ataaaaaagg ggataaaaca 3720 aacatcgaga actaccgagg tatcacctca ttatgctcta catcaaaagt ttttgagatt 3780 attatcaacg acgcgttatt cgccagttgc aaaaactaca tttcaccgga tcaacacggt 3840 ggttttttcc ccagacgatc tgtctctact aatttggtga gcttcgtttc tgactgccta 3900 cggaacatgg attccggtaa tcaaattgac gctgtatata ccgacctgaa agctgcattt 3960 gatcgggtgg atctagccat tttgttaggc agactggaga aaattggaat gtcagcacct 4020 ttcgtctgct ggtttaaatc ctacctcacg aatcgggaac tttacgtgaa gattggatca 4080 gaaaaatcga tgaaattcag caacctctca ggcgtccctc aaggtagcaa cctcgggcct 4140 ctgctctttg ttatttttat caacgaaatt gcgatgatcc ttccgccagg atgtcgggtc 4200 ttttatgcag atgatgtaaa aatttacatg gtgatcaaat ctctcatcga atgcatggaa 4260 ctacagagtc ggctgagttg ttttcaaacg tggtgcatac gaaacaacct cacactaagt 4320 attggtaaat gtctggtgat ttcatttcat cggaagctca aaccaataat cttcgattac 4380 catcttgatg gccatcttct tcagcgagtg gaacaagttc gagatttggg cgttactcta 4440 gatagtgctc ttacgtttcg cactcactac aacgacatta tcagtagagc caataaacag 4500 cttgggttca ttttcaagat ctctgacgaa tttcgcgacc caacttgcct acgctcatta 4560 ttttgctcac ttgttcgtcc tattttggaa ttcagcgtcg tggtgtggtg ccctcatcat 4620 gcaaattgga ttacgcgtat ggaagcagta caaaagaagt ttgtgaagtt cgccctccgt 4680 ggtcttccat ggctcgaccc attgaacctg ccctcatatg aagatcgctg tcgccttctt 4740 gggctagtca cattggaacg aagacgtttt aatgctcaag caatttttgt cgctaaaata 4800 ttgaagggtg aaatcgattc tccaccgatt cttcaggaaa tcaatttgta cgcaccagaa 4860 agaagacttc gacaacgtga ttttcttcag cttggtgcgc gtaatgccca gtacggtcaa 4920 tatgatcctg ttagatatat gtgctcttct tttaatgcag tatttgatct tttcgatttt 4980 aatgtgtctt gtgatagttt tagatttcaa ttgtgtcaga ggcgttaaat gttattcaat 5040 tgttcattaa tgtttgtttt tcatacgtga tttaagtttt gtagtagttt aattagttaa 5100 ggttttcatt aagaccaatg ttgcgtcaga tgaatgaaag ttgaataaat aaataaata 5159 // ID Sola1-4_AP repbase; DNA; INV; 3041 BP. XX AC ABLF01009849.1; XX DT 28-JAN-2009 (Rel. 14.02, Created) DT 28-JAN-2009 (Rel. 15.12, Last updated, Version 2) XX DE Sola1 DNA transposons from Acyrthosiphon pisum. XX KW Sola; DNA transposon; Transposable Element; Sola1; Sola1-4_AP. XX NM Sola1-4_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-3041 RA Bao W., Jurka M.G., Kapitonov V.V. and Jurka J.; RT "New superfamilies of eukaryotic DNA transposons and their RT internal divisions."; RL Molecular Biology and Evolution 26(5), 983-993 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX FH Key Location/Qualifiers FT CDS join(846..1922,1926..2420) FT /product="Sola1-4_AP_1p" FT /translation="THHIDKATRENIFKKFWSLGDLQQQRNFIHSCMENIN FT PLYRYPKSTGSNKRKLNQAFYFSVNNINIRVCKIFFKNTLFINDRMIRTVL FT SKLKDGFVEKDKRGKHLNHKKLDSDIKDGVRAHINSIPRIESHYLRSQTTR FT EFIDGGKSLADIHRDYVNLCKENNLCFAKIDIYSRIFNREFNISFFIPKKD FT QYSHCEAYKNADEVQKVMLDENYQLHLHEKELSRTEKEKDKIFASNTSNKT FT TVLCYDLQAVLPTPRGEVSVFYYKSKLSIFNFTISNIVKSSTYCYVWHEGE FT AHRGVNVIGSCVLRYLSTECDDQNVIFYSDNCAGQNKNKFMISLYLYAIAK FT LNIISITHKFLVVGHTNEGDSAHSVIEKQIKKSLKSGPIYIASQYVTLIQI FT AKKTDTPFTVEQLSSDDIYDIKDISKNIGDNYTLNTNGEKVYWNNIRIVKV FT EKCHPGIIFYKNTYENDEFMKINVNMKKKGRRSSGIPYNIKLAYAEKPKIT FT ENKKRDLLSLCDSNHIPKVYHSFYKNL*" XX SQ Sequence 3041 BP; 1180 A; 371 C; 411 G; 1079 T; 0 other; cgcacctttg aaagacaact cgaaaaatac gtttttcgag atattaccat caaaatttgc 60 agaatttcat cctttatatt ttggtgtggt tgaaattttt gactatatat aatattaaag 120 accgtattgt cagctcaaat gtctttattt ctgcttaaaa taaaatatag tttccagctg 180 gaatatcgac acttctcaga tgtaacttta ttctggcctg gcttatgttg ttacacaata 240 tttttctagc aagaataatt tcatatctca aaagtaatag tttcacacga gaaagattat 300 tttactttaa ttaatataat aatattacca gtggctatta taagaatatc tagtagtgtt 360 atcagtatta taatcgtaat aataaccgac ttgtataata accgacttgt atcgtggtgt 420 tattattatc gtagataaat atatggatgt aaatgttatt aaatgttatt tttgttatcg 480 attctacatt ctactatcgt cttgcactct tccagtacga aatccgattt cgttagttta 540 ttttattttt ttattgttta atagtttagt taactataat aattatcgtg taatactgtt 600 ttacgtttta taagtaaaat ggatcttcta caatgatcaa gatcaaaacg tcttgttaaa 660 ttagcaacaa aaaccaaaag tcaggtatgt acattattta aaatattgta atttatattg 720 ttttcaaaaa attttaatag tatattataa aaatttcatt gaaactaatg tgtgcacaac 780 ataatatttt gttttatgtg gcttatataa ttttattatg ttttattaat tgagtacatc 840 cttaaacgca tcacatagat aaagccacaa gggaaaatat ttttaaaaag ttttggtcat 900 taggtgattt acaacaacaa agaaatttta ttcacagttg tatggaaaac ataaatccat 960 tatatagata tcctaagtct actggttcta acaaaagaaa gcttaatcag gcattttatt 1020 tttcagtaaa taatattaat ataagagttt gtaaaatatt ttttaaaaat actttattca 1080 taaatgaccg tatgataaga actgttctgt caaaattaaa agatggattt gttgaaaaag 1140 acaagcgtgg aaaacattta aatcacaaaa aactagattc agatataaaa gatggtgttc 1200 gtgcacacat aaattctata ccaaggatag agagccatta tttgcgatca caaactacaa 1260 gagaatttat agatggtggg aaatctctag ctgatattca tagagattat gtaaatttat 1320 gcaaggaaaa taatctttgt tttgcaaaaa ttgatattta ctctagaatt tttaatagag 1380 aatttaatat ttcatttttc atacctaaaa aagaccaata ttcacattgt gaagcttata 1440 aaaatgctga tgaagtgcaa aaggtcatgt tggatgaaaa ttatcagtta catttacacg 1500 aaaaagaact tagtcgaaca gagaaagaaa aagataaaat atttgcatca aatacttcaa 1560 ataaaacaac tgttttatgt tatgaccttc aagctgtatt accaactcca agaggagaag 1620 tatcagtctt ttattataaa tctaaattgt caatatttaa ttttacaatt tccaatattg 1680 taaagtcatc aacatattgt tatgtatggc atgaaggtga ggctcaccgt ggtgtaaatg 1740 taataggctc ttgtgtatta agatatctgt caacagaatg tgatgatcaa aatgttatat 1800 tttattctga taattgtgct ggccaaaaca aaaataaatt tatgataagc ttatatttat 1860 atgccattgc aaaacttaat ataatttcaa taactcataa gttcttagtt gtagggcata 1920 cataaaatga gggggatagt gcccattctg taattgaaaa acaaataaaa aaatcattaa 1980 aatcaggtcc tatatatatt gcatctcagt atgtcacact tatccagatt gccaaaaaaa 2040 ctgatacacc attcactgtt gaacaactaa gctcagacga tatttatgat attaaggata 2100 tttctaaaaa tattggagat aactatacat taaacactaa tggagaaaag gtttattgga 2160 acaatattcg aattgtaaaa gttgaaaaat gtcatcccgg tataatattt tataaaaaca 2220 catatgaaaa tgatgaattt atgaaaatta atgtaaacat gaagaaaaaa ggaagaagat 2280 catcaggtat accttacaac ataaagttgg catacgcaga gaaaccaaaa ataacagaaa 2340 acaaaaaaag agatctttta tccctctgtg atagtaacca tattcctaaa gtatatcatt 2400 cgttttataa aaatttataa gtatcttctt atattcttct catattatac ttgtattaat 2460 aaatattaca atattatgtc tgttttaata aatattatct caaaagtatt atctattatt 2520 ttttttaaat ttaagttttg aacataacca aattagtaaa cctacacttg gcaaaggtaa 2580 agtacataga tacttaagta gataggtacc taaaatgtat catgtagtta tacaaaatgc 2640 aataataaca aatatttaca tacctattaa ttattatttt gtattaagtg ttattgatct 2700 taaatgtgat aaaacaggtt ataggttttt aatgttttaa ataagattgt tattacaaat 2760 tttattttat gtgttgtaaa tcaaaaaata ttatttttcc aagcagaaat aacgacacaa 2820 gtcaaattga aatcagattt ccaataaaac gatagtgaaa tggtacataa attatcaatt 2880 taagaatata aaagtaaact atacaatatt aagaataata taagattgaa aaaagaaaaa 2940 ttcatcgaaa attctattta tagtggcata ataaaacatt aaaagtcgat ttcccaacat 3000 tttgattttt tgatttgtgt cactattctt tcaggggtgt g 3041 // ID SNAPBACK_TC repbase; DNA; INV; 1231 BP. XX AC U31526; XX DT 09-JUL-2004 (Rel. 9.06, Created) DT 09-JUL-2004 (Rel. 9.06, Last updated, Version 2) XX DE Tribolium castaneum mid-repetitive element SNAPBACK. XX KW SNAPBACK_TC; midrepetitive element. XX OS Tribolium castaneum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Coleoptera; Polyphaga; Cucujiformia; OC Tenebrionidae; Tribolium. XX RN [1] RA Stuart J.J., De Gortari J.M., Hall S.P., Maxwell E.M., Mocelin G., RA Brown J.S. and Muir M.W.; RT "Useful DNA polymorphisms are identified by snapback, a RT mid-repetitive element in Tribolium castaneum."; RL Genome 39(3), 568-578 (1996). XX DR Genbank; U31526; Positions 1 1231. XX SQ Sequence 1231 BP; 380 A; 233 C; 249 G; 369 T; 0 other; aagtgataaa taaattgcta ttacaaatga aaaatttgct actttcattt tacccgtcat 60 ccgaaatggt tgtgtttgtg gattttcaag ggttcaatta cggtaattgt accgaaactt 120 taaccataaa agaattagct gtatttaata cgaacaaaag cgcaccagac atctttttgt 180 ttaaaccacc cgatgatctt tcaaccttac cgcatcgtta tcagaagcaa gccgattggt 240 tgacgaataa ctttcacggg ctttcttgga ctgcgataat tacgactacg aaaagttgtc 300 tgaaatattg aatgaaacta cgaaaaattc aaagtgtata tacgtgaaag gaatagataa 360 gaagcgtctt acttttgaaa cgtttaccaa aaagtcaaat tgtaaacatt gaagatttag 420 agtgtccatc gcttcgttac ttgagagaac attttgatac gaattgttgt gtgaatcata 480 taatcagcga ttcggtgtgc gctgttcaaa acgtacaaaa cttggctagt tggtacaatg 540 aatactttac aactcaaaac gtgtctggag atgatctctc ggaagaaaag acaccacgtt 600 cttgtttctg cgtataacac tctacctgtc ctaatcacca gaccctctat ggttattatc 660 aacagagacc ctgattactc ccgggtagtc attggatggc tgtttacatt gataaatttg 720 gtttcgcata cttctttgat agttttggaa atcagccgcc tccgaatatt ttaaaatttc 780 ttagaaaaaa tgctatgatg tggtcttaca atatttctca aattcaaaat ttttcttcaa 840 ccgtctgtgg caaatactgt actttatttc ttttaaacta cgcattaggc aattctgttg 900 atgattttct aaagttattt aacaaagatt ttcacaataa tgacaaacta tgcaacaaaa 960 tgtttaacaa atatttcatg aataaataaa gactttgcaa gatccatgtt ttttctttct 1020 ttccttaatt ctcgttactc accagcattg acgcggacgg ggccggcgcg gacgacgggg 1080 cggcggcgcg gaaacgggcg cgggacgggg tggcggcggc gcgggaacgg gcgcgggacg 1140 gggtggcggc ggcgcgggga cgggcgcgac acacacacac acacattctg ggttaaatca 1200 aaataaaatc acagaaatca tatttattag a 1231 // ID Gypsy-86_CQ-I repbase; DNA; INV; 8062 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-86_CQ_; KW Gypsy-86_CQ-LTR; Gypsy-86_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-8062 RA Jurka J.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 551-551 (2011). XX DR [2] (Consensus) XX CC Positions [5336-5818] - Integrase core CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 469..2091 FT /product="Gypsy-86_CQ-I_1p" FT /translation="MEQWYHANANDLYEEEITYELAIRQLPVEGSLSTRMR FT NLRQALREPETAKVQMVIGYSFADDYPTITYSLKELVHRLENNQQRGCWSR FT LVHYHKRVRRYVCADRTQLQKQEELLNLINDLSLKYYHQDMNQMIPHTPLF FT LVVKSPTGKLLIRTQTETPAVTSPSASVTTSANPEQEIRSTSTVAGQVISS FT LEDGAYGGASVVSSAGLVTSAGELSSQMSSLRVSNVSTTAYPMVTATVTSV FT QTGAIPKRPVLADAYEQLRVFLQKPLSTSPVLDSGDLSAMLRTPEFHRAFR FT RAIQSEMSAAESCFRFPAPTTMITAMNTSTPTVSSTTTRPTPILQVTPPTS FT AENPVGNKGPRRELDMSQYVHIKDIEGYIKACVNSIVHQGPRCPDPHEQTI FT HNLVDQITNVGVHDSEVTNISRGVGSIPHPVAGQFDSNPTLQHTVPTTVVP FT QGLMGSVESLFGGLTTAVSQAPLFGIPDTPEFNPGVSGIDRGGQQPQPQPD FT QFWRRSKFVKPTVRSKPVRSINRFAAAVFCWSTSVCPPASAASDL" FT CDS 2911..3969 FT /product="Gypsy-86_CQ-I_2p" FT /translation="MEIEGNRSSAEPISNVPTINFRYFNQNYNINTTFRRC FT PHLTVNILGETVEGLADTGAGVSIISSLELIGKIGLKIQKCNIKIKTADST FT EYTCAGYVNIPYTYESKTCVIPTIVVPEVTKDLILGVDFLEAFNFRLMVAP FT ELLEPRNETEVEEDSAAVRSVDLVFAEDFFAEEDCTVCFQIIPMEGDSLID FT PPEEDESLEMPTIEIPESHLQQPSDIETEHSLSFPQRQQLFEAVRQLPATA FT EGKLGRTSVLQHAIDLLPGTLPRRLPSYRWSPVVERVIDEEVDRMLRLGVI FT EESAGPVEFLNPLLPIKKPNGKWRICLDSRRLNSCTKRDDFPFPNMLGILR FT ESRDRVTFRL" FT CDS 4811..6193 FT /product="Gypsy-86_CQ-I_4p" FT /translation="MSLTFLRTMSIESKSPRIARWALKLSKYDITLQYKKG FT TENVPADALSRSLTAVDVCLPDPYVDGLKTQIEKCPDKYKDFKVVDGQVFK FT FISGSSLAEDSAFRWKQVVPMCERQPLIRQIHEEAHLGYWKTLCKVRERFY FT WPRLATDVKRFCFACQICKESKIPNINVRPTCGKPKLCSRPWEMISLDFLG FT PYPRSKKGNVWILVVSDFHSKFVMCQCMRNATAPAVCQFLETLIFTLFGAP FT SVCISDNAKVFQSDLFRKLLEKYGVQHWNLAVYHPSPNPSERVNRVIVTAI FT RCALNDKKNHRDWDESVHTIAMAIRTSVHDSIGFSPYFVNFGRNMISNGRE FT YDHLRHLGSEEEEDPLKRSAEMEKLFEVVRQNLAKAYQRYSQPYNLRANKR FT HQFEVGQDVYKKNVHLSDKSKDFVGKFANKFTKARVKEKLGSNTYVLEDMN FT GRRIPGTFHGSFLKNA" XX SQ Sequence 8062 BP; 2130 A; 1704 C; 2002 G; 2225 T; 1 other; agaatcgtta catttggcgc ccaacgtggg gccgtgaaaa cggattttcg tgagaatttt 60 tcactttcag tgatagtttt catcgtttgg tttaaagtta ttttgcaaat tagtttcagt 120 agtgttaatc actaaactgg ttgagatttt ggattgttgg gttagcatat ttagttggta 180 tttttagtta agttttggga ctttggtttt tggacttttt gggttagttt gggtaagatt 240 cggagattgc gagtctaaac tggtttagtg aacttccatc cacaacatca matttatttt 300 tttttctttt tatactcatt tgccaactgt ttgggttagt taaattggtc gaattagttg 360 ttttctttac tacattccaa tctttttgtt tattattaca cttaaacttt cattgcttga 420 agttaccgat tattttcttt tgattactga cttattacgc aattcaaaat ggaacaatgg 480 taccatgcca acgcgaacga tctgtacgaa gaggaaatca cgtacgagct cgcgattcgt 540 caactgccgg tggaaggttc tttgagcact cgtatgagga atctacgcca agctttgcgc 600 gaaccggaaa cggcgaaagt gcaaatggtg atcggttatt ccttcgcaga tgattatcca 660 acaattacgt acagcctcaa ggaactcgtg catcgcctgg agaacaatca gcaaagagga 720 tgctggtcca gattggtgca ctaccacaaa cgtgtccgac ggtatgtctg tgccgatcga 780 acccaattgc agaagcagga ggagctgctg aatctcatca acgacctttc gctcaagtac 840 taccatcagg acatgaacca gatgattccc catactccgc tgttcctcgt cgtgaaatcg 900 ccgaccggaa agttactgat tcgaacgcaa acagaaactc ctgccgttac gtcaccgagc 960 gctagcgtta ctacgtcagc caatccggaa caggaaattc gaagcacgtc taccgtcgcc 1020 ggtcaggtga ttagtagctt ggaagatggt gcttatggag gcgcatcagt tgtttcgtca 1080 gctggattgg ttacgtccgc cggagagttg tcttcgcaga tgtcgtccct acgcgtttcc 1140 aatgtcagta caaccgccta cccgatggtc acggcaactg ttacttccgt tcagactggt 1200 gccatcccga agcgtcccgt tttggccgat gcctacgaac agctgagagt ctttttgcaa 1260 aagcctttgt cgacatcgcc agtgttggat tccggtgatc tgtcggcgat gctgcgcacg 1320 cccgagttcc atcgtgcatt ccgcagagcg attcagtcgg agatgtccgc cgcggagagt 1380 tgcttccgat tcccggcgcc gacaacgatg attaccgcca tgaacacgtc aaccccaacc 1440 gtttcgtcta cgaccacgcg tccgactccg atcttgcagg taactccgcc cacatccgct 1500 gagaatccag ttggaaataa aggacctagg agagaattgg acatgagtca gtacgttcat 1560 atcaaggata tcgaaggata cattaaagca tgtgttaatt cgatcgtgca tcaaggacca 1620 cgatgcccag accctcatga acaaacgatt cacaacttgg tggaccagat taccaatgtg 1680 ggtgtccacg actcagaggt aacgaacata tctcgtggag ttggaagtat accgcatccc 1740 gtggccggtc agttcgattc gaatccgacg ttgcagcata cggttccgac tactgttgtc 1800 cctcaaggcc ttatggggag cgtagagagc ttgtttggag gtttaaccac tgcagtttca 1860 caagcgccgt tgtttgggat tcctgatacc cctgagttca accctggtgt cagtggtatt 1920 gatcgtggag gtcagcaacc acaaccgcaa cccgatcagt tctggcgacg ttccaagttc 1980 gtcaaaccca cagttaggtc aaaaccagtt cgctcgatca accggtttgc cgccgcagtc 2040 ttctgctggt caacatccgt atgtccgccc gcgtctgccg catcagacct gtaacatcat 2100 cgagaaatgg ccgaagtttt ccggcgacac gagtccgatg cctgttgatg actttttgcg 2160 acaaatcgca caacatagtc gatcgtatca gatctccgcc gctgaactac gagttcacgc 2220 tcatctcttg ttcaaagatg atgcttacgt ttggttctgt gcgtacgatg agcagttgga 2280 cacgtgggag aagttggtga tctacctcag gatgcgttac gacaacccga accgtgatcg 2340 tttcatcaaa gaccagatga aagcccggaa gcagcgacct aacgagaagt tcagcgcgta 2400 cttaaccgac atcgaagcgc tgtcgcaacg tttggtgaat aagatgaccg ttcaggagaa 2460 gttcagtcta attgttgaga acatgaagat gtcctaccag cgtcgtttgg ctttacacat 2520 gggatcaatc acctcaattg gccatctcgc tcttctttgc tacaagtttg actcgctgga 2580 aaccaatttg tacaatgcga aggcagctgg agtaaaccag attgagctgg aggatgagtt 2640 tggtgaagaa gctgaggaat cggacgaatc ccaagttttt gctgtccagg ttaggaaaac 2700 aacagccgcc gcacgtccgc cgaatgaagc tgttgcagta tcgagacctc gagcggagga 2760 actgtgttgg aactgccgtc agctcggaca catgtggaga gattgtgtgc agcggaagag 2820 actgttctgt cacatctgcg gtcacgagaa cacgatcgcg tccgtgtgcc cgaaccaaca 2880 caaccttcgc gcgccaaaaa acgattagag atggagatcg aagggaacag atcttctgct 2940 gaacccatct ctaacgttcc cactatcaac ttccgctatt tcaatcagaa ctacaacatc 3000 aacaccactt tccgacgatg ccctcatttg acagttaaca ttcttggcga aacagttgaa 3060 ggattagcag ataccggtgc aggcgtatct atcattagtt cgttggaact gattggaaag 3120 attggcctaa agatccagaa gtgtaacatc aaaatcaaga ctgcggacag taccgaatac 3180 acatgtgccg gttacgtaaa cattccgtat acgtacgaat cgaaaacatg tgtaattccg 3240 accatagtcg tcccggaagt tacgaaagat ctaatccttg gagtagactt tctggaagct 3300 ttcaatttcc ggctaatggt cgcgccagag ctgttggagc ccaggaatga gactgaagtt 3360 gaggaagatt ctgctgctgt acgatctgtc gatttggtgt tcgccgagga cttcttcgcg 3420 gaagaagact gcacggtttg tttccagatc atcccgatgg aaggagattc gctgatcgat 3480 ccgccggaag aagatgagag cctcgagatg ccgacgattg agattccgga aagccatcta 3540 caacaaccgt ccgatatcga aaccgaacat tcgctctcgt ttccgcaacg tcaacagtta 3600 ttcgaagcag tccgccaact tccggctacg gccgaaggaa aattaggtcg aacgagtgtt 3660 ctgcagcatg ctatcgatct gctacctgga acgctgcctc gccggttgcc cagttaccgt 3720 tggtctcccg tagtcgagag agtgatcgat gaagaagtag accggatgtt gcgtttagga 3780 gtgatcgagg aaagtgctgg tcctgtcgag tttttgaacc cgttgttgcc catcaagaag 3840 ccgaatggga aatggagaat ctgcttggat tcgcgtcgtt tgaactcgtg cacgaaacga 3900 gatgacttcc cgttcccaaa catgctcgga atactcagag aatccagaga tcgcgttact 3960 tttcggttat agatctgtcg gagtcctatt atcaggtcag cttggaggag tctgcgaagg 4020 ataagaccgc tttccggact aataaagggt tgtttcggtt tgtggtgatg cctttcggac 4080 ttacgaacgc gccagcgaca atggcccggc tcatgagtaa ggttttggga cacgatctcg 4140 agccgtttgt ttacgtgtac ctggatgaca tcatcatcac gtccgaatcg tttgagcatc 4200 attgtgagct gatcagcacc gttgcgtctc gtctcagcac cgctggactc acgatcaacg 4260 tccagaagtc gaagttttgt cagcgtcaga ttcgttattt ggggtacgtt ctttcggaaa 4320 aaggtttgtc catggatgtg acgaaaatcc agccaattct tgactacgct ccgcctcagt 4380 ccgtcaagga cattcgccgt ttattgggat tggcagggtt ctaccaaaaa ttcatacaaa 4440 actattccga aataacaaca ccgataacaa atctcctcaa gaaagaccgg aaaagttcag 4500 ttggaccccg gaagctgacg aggcgtttag gaagttgaag aatgcgttgg tgtccgcacc 4560 tgttctagcg aatcccaact tcgcgttgcc tttcgtgatc gagacagaca gctcggattt 4620 agcgattggt gctgtgttgg tccagattca ggatggagta cgtcggacga ttgcctactt 4680 ttcgaagaag ctttccagca cccagcggaa gtacagcgcc acggagagag aatgtttggc 4740 cgtcctgctg gcaatcgaga acttcaagca ctttgtcgaa ggtagtcagt tcgtcgtcca 4800 gacagatgcg atgagtctga ccttcttgcg aacaatgagc atcgagtcca agtccccgcg 4860 gatcgctcgt tgggccttga agctatccaa gtacgacatc acgctccagt acaagaaggg 4920 aaccgaaaat gtacctgctg atgccctgtc gcgaagtctc accgccgttg atgtctgttt 4980 gccggatccg tacgtagatg ggttgaagac gcagatcgag aagtgtccgg ataagtacaa 5040 ggacttcaaa gtcgttgatg gtcaagtctt caagttcatc tctggttcgt cgcttgccga 5100 agatagtgcg tttcggtgga aacaggtcgt tccgatgtgt gaacgtcagc cgttgattcg 5160 tcagatccac gaagaagctc atctcgggta ctggaagacg ttgtgtaagg ttcgagagcg 5220 cttctattgg ccgcgactcg ccaccgatgt caagagattc tgcttcgcgt gtcagatctg 5280 caaggagtca aaaataccga atattaacgt cagaccgacc tgcggaaagc ctaaattatg 5340 ttcgcgccct tgggagatga tttcgttgga ctttctcgga ccatatccgc ggtcaaagaa 5400 gggaaacgtg tggattttgg tggtgagcga cttccattct aagttcgtca tgtgccagtg 5460 catgaggaat gccactgctc ccgcagtttg ccagttcttg gaaacactga tttttacgct 5520 ttttggtgcg ccgtccgttt gtatttcaga taatgcgaag gtgttccagt cggatttgtt 5580 ccggaagctg ttggagaagt acggtgtgca acactggaac ttggccgtgt atcacccaag 5640 tccgaatccc tcggaacgcg tcaatcgagt catcgtgacg gccatccggt gtgccctgaa 5700 cgacaagaag aaccatcgtg actgggacga gtccgttcac acgattgcca tggcgatacg 5760 aacgagcgtt cacgacagca taggattttc accgtatttt gtcaatttcg gacgaaatat 5820 gatcagcaac ggaagggagt atgatcacct gcgtcacctg ggcagcgagg aggaagaaga 5880 cccgctgaag cgaagcgcag agatggagaa gctgttcgaa gtggttaggc agaacttggc 5940 gaaagcgtat caacgctact cgcagccgta caacttgcgc gcgaacaagc gtcatcagtt 6000 cgaggtggga caggatgtgt acaagaagaa tgtccacctg tcggacaagt caaaggactt 6060 tgttggcaag ttcgcgaata agttcacgaa ggctcgagtg aaggagaaat taggttcgaa 6120 cacatacgtg ttggaagaca tgaatggtcg gaggattccg ggaacattcc acggttcctt 6180 ccttaagaat gcgtaagtca gttggggaaa aatcggatta gatgacaaaa agctatgact 6240 gcacttttta ccggaagtgc acacacaaag attaagccta ataaaacaca tcattggtga 6300 tacggttcag cacgatttga gatgtccacg agtttcctca gtcgtttagg ctaagttacg 6360 caacacgagt tagttgctag gattaagcaa gctatgacta tgctgacgtg ggtcgcataa 6420 acacaattga agatattatg aaatcttcag ccagaaaaat actcttttga ggtgttccca 6480 cacacaaatc aggcacaatt tcgttgcctt tagaaagtaa agctatgaat gagtctcgat 6540 gagctcaaac tagaacgtaa acaaatcaaa acactctttt ctgaggaact ttcttgtgtg 6600 aaatgggccg tggtcgagat gcaatcaaat ttcactttag cttaccattt cctacgccca 6660 aagtaaccaa tttgttttaa tttgtagtaa agttagttgt cgttgtacct agaatgatag 6720 aaaatgttat gaatgagatg aaagtttggt tgggtttagt tagcagggaa tgtttcgctt 6780 tgtaaacaaa cattcacctt caaattttgg cctttttgca acgtttttca ctttagtttg 6840 atacttactc agtaaatctg cactttagtt tcagttttta gttaaattta gagtcaatag 6900 cccggtttaa attgtaaaat ttcagtaaac ttttggaatt cgcttcagaa agtaaatttt 6960 gggaagcaca ttttgttttg gtttgacagt ttgacatttc tctgctgctt tgagtgtttg 7020 cgttgaatgt cagttggatt ggcgttatga ttttatgttt tggtaacgct atgttgcaca 7080 gtgggagtgc agataagttt agtggtcaaa atgttggaat ggtaaagaaa agttggaaag 7140 ttagataagc ttagttggat atcaggtctg tctgagtagt ttgggttgga ttggaatgtt 7200 ttgtatttcc taaatctttt gcattagtgt aagttggagt acttgtccag acatttctgg 7260 aagttttcgg aattagttta agttgaaaat attttccaaa catttttgga gtaatttggt 7320 aaaattggtt tagttataag aaatattggt tttccaaata tttttggagt aaaattggca 7380 actttggttg aaatttagtg taaggttagt tagttggcaa atgatttagg aaaataataa 7440 acataagttg agatggttcg gaatgtgaga aggggaagta gaagatgttg gatggaatga 7500 gttcaagtaa gaggatggca acatagtttg tctatttggg aaaaataagt ttgttggact 7560 attaatcaat tggaattttt agttaagaag tatgttttaa gttaatgttg gatttttgag 7620 tttgaacttt agtttgaact taaaaatttt gtaagttttc aacttacaaa atttttaata 7680 ttagtgtggg cgaatgtaac cacatcattg ggagtccctg tatgtatgtt tgctaagata 7740 tgttgggtca gctaggacac gcaccaaatt agggaaactt tgctttttga tcgtcgccac 7800 ctatgggagg atagcagaaa cccaattttg caccgttttg ttagatgaaa atgattttta 7860 ggcaaattgt gttcggttta aaattaaagc catttgggaa gcatctcctg tgtaaatttt 7920 ttgaagctca atttgtcctt gaaaattaac aagaaagaga gcaaaatgaa aaagtctttt 7980 gtagtaaaat ttgggagaaa actaagattg tcggttcgta caagcgatga agactgactc 8040 tcgttcactt tggcagaact cg 8062 // ID I-N1_CQ repbase; DNA; INV; 1589 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version 1) XX DE A non-autonomous I non-LTR retrotransposon from Culex DE quinquefasciatus - consensus. XX KW I; Non-LTR Retrotransposon; Transposable Element; Nonautonomous; KW I-N1_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-1589 RA Kojima K.K. and Jurka J.; RT "Non-autonomous I non-LTR retrotransposons from the southern RT house mosquito."; RL Repbase Reports 11(1), 583-583 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 7 sequences with >95% CC identity. ~5-bp TSDs. XX FH Key Location/Qualifiers FT CDS 89..1471 FT /product="I-N1_CQ_1p" FT /translation="MAAAAKADSDWTVHKTKDGKPFYWNKVTQESVWKKPD FT GFQEEKQPLTDKEVEKPDPPTPRVRKYQDAPGERWQVFFRPKHKPLQTMRI FT SEELWKHYPGVTDVTKLHQNKLRATVNNPKEANAIVCDPRFCIEYRVWIPA FT RSVEIDGVVSENGLTVQQVLTAVGHFKRRNLPTIPVIEARQMGTAEGEGAS FT KRFVPSSSYRVTFAGTALPDYLVVDNMLRLPVRMYRPRVMSCSNCKKLGHT FT KAFCSNKTVCGKCGEKHPDEQCQKEVEKCLLCGGQPHEVRSCPKYKTREDK FT LKRELERRSKRSFADMLKTVTTSAGSENPFSVLSEEEDDSSEDGTEDELII FT NAKGGTSKRKRKTKKKSNSDKSRSSTLPQPGSEKFNKAFPGVSSTKTSPKD FT PTNPAAVVPQKDHGKDNRIPFSLLLEAVLSAVSGSTRTLLEPLIPFFKELG FT KILSENSALNAIISFD" XX SQ Sequence 1589 BP; 443 A; 392 C; 446 G; 308 T; 0 other; tctctcgtca gctcttgcag tgagcggtcg tctttcgagt ttttagcgcg tgtttctgac 60 ggtgttttgt ttaccttcgc cgacggccat ggcggcggcc gcgaaggcgg atagtgattg 120 gacggtccac aaaaccaagg acggcaagcc gttttactgg aacaaagtta cgcaggagag 180 tgtgtggaag aaaccggacg ggtttcagga ggagaagcag ccgttgacgg acaaggaggt 240 ggagaaaccg gatccaccaa ccccgcgcgt gcgtaagtac caggacgctc ccggggagag 300 gtggcaggtt ttctttcggc cgaagcacaa gccactgcag acaatgcgga tttcggagga 360 actgtggaaa cactaccccg gagtgacgga tgtcaccaaa ctccaccaga acaagctccg 420 cgctactgtc aacaacccga aggaagccaa cgcaatcgtc tgcgatccgc ggttctgcat 480 tgaataccgg gtctggattc cggcgcgttc cgtggagatc gacggcgtgg tgagcgaaaa 540 tggactgacg gtgcagcagg tgttgacggc ggtaggccac ttcaagagga ggaacttacc 600 aacaattccg gtcatcgagg ctcgacagat gggtacggcc gaaggtgagg gcgcgtccaa 660 acgctttgtc ccatcgagct cgtaccgagt gacgtttgcg gggacagctc taccagacta 720 cctggtggtg gacaacatgc tccgccttcc ggtgcgcatg taccgcccgc gcgttatgag 780 ttgctctaac tgcaaaaagc tgggacatac aaaggcgttc tgcagcaaca agacggtctg 840 tggcaaatgc ggagagaaac atccggacga gcagtgccag aaagaggtag agaagtgcct 900 gctctgtgga ggacaacccc acgaagttcg ttcttgtccc aaatacaaga cgcgcgagga 960 caagttgaaa cgcgaactgg agaggcggtc gaagcgctca ttcgcagata tgttgaagac 1020 ggtcacgacg agtgccggca gcgagaaccc cttttccgtc ctttcagagg aagaggatga 1080 ttcatctgag gacggaactg aagatgagct gataatcaac gcaaaaggcg gaacctccaa 1140 aaggaagaga aaaacaaaga aaaagagcaa cagtgacaag tctcggagca gcacccttcc 1200 acagccaggt tcggaaaagt tcaacaaagc gttcccgggg gtgagctcaa caaaaacatc 1260 gccaaaagat ccaaccaacc cagcagcagt cgtgcctcag aaggaccatg gtaaagacaa 1320 ccgaattccg ttttcgcttt tgttggaagc ggtgttgtcg gcagtgagcg gatctacaag 1380 aaccctgctg gaacctctga tccccttctt caaagaactc ggcaagatac tcagcgagaa 1440 ctcggcgctt aatgccatta tctcttttga ttaaactgtt aatttactat agccttaagt 1500 taaaatgtaa attacaagta aaacccggtc cgaaccagtt tacgtacagc taaggggacc 1560 taaataaaca atttcatgaa taaaaaaaa 1589 // ID MuDr-1_TV repbase; DNA; INV; 5335 BP. XX AC . XX DT 08-OCT-2008 (Rel. 13.1, Created) DT 08-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE MuDr-type DNA transposons from Trichomonas vaginalis. XX KW MuDR; DNA transposon; Transposable Element; Mutor; 2 ORF; KW MuDr-1_TV. XX OS Trichomonas vaginalis OC Eukaryota; Parabasalia; Trichomonadida; Trichomonadidae; OC Trichomonas. XX RN [1] RP 1-5335 RA Bao W. and Jurka J.; RT "MuDr-type transposons from Trichomonas vaginalis."; RL Repbase Reports 8(10), 1198-1198 (2008). XX DR [1] (Consensus) XX CC The MuDr-1_TV consensus sequence contains 2 ORF in different CC strands. One ORF encodes a mutator-like transposase, the identity CC of the other is uncertain. TSD is 9-bp long. TIR is ~100-bp CC imperfect sequence. CC This sequence was derived from sequence data generated by TIGR CC Database: The Trichomonas vaginalis Genome Sequencing Project. XX FH Key Location/Qualifiers FT CDS 1885..779 FT /product="MuDr-1_TV_2p" FT /translation="MADDSLYQLTQLVKHNDYSSDSLNQLYQCISTMTERS FT IGCLNWDKFFKKITVTDEVLLHFLASVLNIEKISECFKDSIFYHNINLEKF FT SNLTETEQQTILNSYKADKFTYYIMCSQIVKSNWNNLSQTLVNIANQYLPF FT LDEEKQIYIFTEITGNYQNFEISVANLLLFYENNEEKLSNCFNLTLFQQIY FT SKNGIEQMIFFAANTICCNNTSINFYNLLEIEEMNPFDFFYFSIQYKEILN FT CDDFPKVTEVITGLISASNINAIMRICQKILDKITFYKCNYCHITSIWNFI FT TNAAIILPDEDLLIKLLKFIEKTEKATDELFEPENDLGFYEYLNSIFSDIT FT SARNKQFLLEFLNTHEDDFTYDQTL*" FT CDS 2220..4472 FT /product="MuDr-1_TV_1p" FT /translation="MGKSSIFNQKRANFTDENKIFIREQSDKYQKSTSRDE FT KTNIIKQVLQYFNIKYAENESKAVRRYFSYIAVKNIPAITGKYPDTIVRKD FT MLYLHDHTNIINSNNIERYYRCIDHSCQARYLVKIENSSVSEDLQKEHIPE FT CRQFKKLQNEKNDLHKSMTLTEIYNKAMAKFSEDNGTFQQIKKIMTELAVG FT TDENPLIKDSLVREVYNKSRKFSVGEKITDDEIANFCLINGKQFFIGHDFQ FT ANCETMFFGFEKAIQFAKEATDLFCDGTFKITPKHFQKGQMLTIMICEPST FT NQYLPLLFIFMKNRTFESYKKAFEYIFTVKGIQFSKLIQIHCDFEKGMIEA FT LQQIFPKVRIIGCLFHFKQALHRKLVALYTKNFNTLQNSLFKLYSITPFMS FT HEEFVLTMHIINQNKVDSIKDYIDYFNKVWLPHYNLISQYNNATAIFTNDC FT LESMHSEFSSLKHPNIYEAIKKISQIQLYKYNAIKNNQKIERHIKTVITDS FT YKNYILDCFQKELEKTIFSIKQYNVSDLSLAQNEISPYGSIESSSLFENIP FT VHDADILKSIDLTNKDTMITKFINEKREKIMALFNETLNHPDFNSSTPIQT FT NFEQADTNQGSDAPIRSENSNIADSTDLEPENQNHPSTNQMSSNLEHKLDL FT LDHIEKQLSKMRSLLGILPTTNAIVGPSKQVSVNSKQNSEKNKKKSSITTK FT SNNKKQNSSDNPPKRKAPISERILSDNNLYFQVHRKRSKSKRSTSTEKKN* FT " XX SQ Sequence 5335 BP; 1949 A; 775 C; 678 G; 1933 T; 0 other; ggtaaaaaag ggggttggcg tcctaaggcg tcctagtgta aatttctcaa aattgtgata 60 tttacatcac atttttgaga ggtttatggg cgtcctaagg gaaaattgaa tagctagggc 120 gtcccgaatt ttcattctaa gatttaaata tattatatac ttaagtatac tacaagaaat 180 ataaaaatta aatttttatt atgttaactt ataagaaatt tttttgaatt atcgttgatt 240 tattgtaatc tttccaggct atacttttga aatcaacttt acctgtcata cgcattaatg 300 tagtagatgt ccaggtgtag ctattatatc ccagcaacat tggtatttta tcattgtaag 360 atatcaaaat gaagcgtttt ttcagatata atttttcgta aatttttcaa aaataagatg 420 gtaaaaaagc ctcgacttgc tcgaaaaata acaatttttt ttacttgtaa aaataacctt 480 tttacagcat gggcagctat tcaatcacta tttctttttc taaaagtata tttttcaaaa 540 aaaattaaga tcaaattcga caatttcttg atgaaagcaa aaatattaca catacttcta 600 gatgcatggg ataaaatatg caattattca atcaattcgc atgatttgaa aaaaaaaatc 660 taaatttgat aaaaatgtag aaatttttta ttttgaactt gagattcgat ttttttgatc 720 caatttgtga attttttttt gatcggaatt tcaattaatt tgttgatcaa aaatgacctc 780 acaatgtttg atcataagtg aagtcatctt catgggtatt tagaaattcc aagagaaatt 840 gcttatttct cgcagatgta atatctgaga aaatcgaatt taaatattca tagaaaccca 900 aatcattctc cggttcgaat aattcatctg ttgctttttc tgttttttct atgaatttta 960 ataatttgat taataaatct tcatcaggaa gaattattgc tgcatttgtg ataaaattcc 1020 atattgaggt aatatggcaa taattacatt tataaaatgt aattttatcc aaaatttttt 1080 ggcaaattcg cataatagca ttaatatttg aagctgagat taaacctgtg attacttctg 1140 taacttttgg aaaatcatcg caatttaata tctctttata ttgaattgag aaataaaaaa 1200 aatcaaatgg attcatttct tcaatttcaa ggagattgta aaaattgata gaagtgttat 1260 tgcaacaaat tgtatttgca gcaaaaaata tcatttgctc gattccattt ttggaataaa 1320 tttgctggaa caatgttaaa ttaaaacagt tgcttaattt ttcttcattg ttttcataaa 1380 aaagtaacaa gtttgcaact gaaatttcaa aattttgata attgccagtt atttcagtaa 1440 atatatatat ttgcttttct tcatctaaaa atggcaaata ttgatttgca atattaacta 1500 aagtttggga taaattgttc cagtttgatt ttacaatttg tgagcacatt atgtaatatg 1560 taaatttatc cgctttgtag gaatttaaaa ttgtttgttg ttctgtttca gttaaattac 1620 tgaatttttc aagattaata ttgtgataaa aaatagaatc cttgaaacat tctgatatct 1680 tttcaatgtt taaaacggat gctaaaaaat gtaaaagaac ttcatcagtc actgtgattt 1740 ttttgaaaaa tttatcccaa ttcaagcatc caattgatct ttctgtcatt gtacttatgc 1800 attgataaag ttgatttaat gaatccgaag aataatcatt atgcttgact agttgagtta 1860 gttgatacag actgtcgtca gccattttaa aaaaatgaat aaaattgtat agtaaattat 1920 atttgtgaaa tttgtgacgc tacattcaat ctaagaccaa tatctcaaag aattcctcat 1980 ttaagtatta cttttcattg ttttttaaaa aatatctttg cttatttgtt cacttttttg 2040 acatttatta tatgcttttt atcatatttc cgatgatatg aatatatatt ctatataaga 2100 atgacgctac attcaatcta agaccaatat ctcaaagaat tcctcattta agtattactt 2160 ttcatttttg attcaatttt cgaacggttt tcgtaaattt tgcatcattt cattttttaa 2220 tgggaaaatc atctatattt aatcaaaaaa gagcaaactt tacagacgaa aataaaatat 2280 ttatcaggga acaatctgat aaataccaaa aatcaacatc gagagatgaa aaaacgaata 2340 taatcaaaca agttcttcaa tattttaata ttaaatacgc cgaaaatgaa tctaaagctg 2400 tcagaagata ttttagttat attgcagtta aaaatattcc tgcaattact ggaaaatatc 2460 ctgatacaat tgttcgcaaa gatatgcttt atttgcatga tcatacaaat attattaatt 2520 caaataatat tgaacgttac taccgttgca tcgatcattc ttgtcaagca cgttatttag 2580 ttaaaattga gaacagttca gtttccgaag atttacaaaa ggaacatata cccgagtgca 2640 gacaatttaa aaagttacaa aatgaaaaaa atgatcttca taaatcaatg actctgacag 2700 aaatatacaa taaggcaatg gcaaaatttt ctgaagacaa cggcactttt cagcagatta 2760 aaaaaattat gacagaactt gctgttggta ctgatgaaaa cccattaatt aaagattcat 2820 tggtcaggga agtttataat aagtccagaa aattttctgt tggagaaaaa attacagacg 2880 atgaaattgc taatttttgt ttaataaatg gcaagcagtt tttcattggc catgattttc 2940 aagcaaattg tgagacgatg ttttttggtt ttgagaaggc aattcaattt gcaaaagaag 3000 ccacagattt attttgcgat ggaactttta aaattacacc aaaacatttt caaaaaggac 3060 agatgctaac aattatgatt tgtgaacctt caacaaatca atatctacca cttttattta 3120 tattcatgaa aaaccgcaca tttgaatctt ataagaaggc atttgaatat atttttactg 3180 tcaagggtat tcaattttca aagctaattc aaattcattg tgattttgaa aagggaatga 3240 tagaggcact tcaacaaatt ttccctaaag tccgtataat tggatgtcta tttcatttca 3300 aacaagcttt acacaggaaa cttgtagcac tttatacaaa aaatttcaat actttacaaa 3360 attctttatt taagctatat tctattactc cattcatgtc tcatgaagaa tttgtactta 3420 caatgcatat aattaatcaa aataaagttg attcgatcaa agattatatt gattatttca 3480 acaaagtttg gttgccgcac tacaatttaa tatctcaata taataatgca actgcaatat 3540 tcactaatga ttgcctcgag agtatgcatt cagaattttc ttcgttaaaa catccaaaca 3600 tttatgaagc tattaagaag atatctcaaa tccaattata taaatataac gccatcaaaa 3660 ataaccagaa gatcgagcgc catataaaaa ccgttatcac tgatagctac aaaaattata 3720 ttcttgactg ctttcaaaaa gaacttgaga aaacaatttt ttctatcaaa caatacaacg 3780 taagtgatct ttctttggct cagaacgaga tttctccata tggttccata gaatcgagtt 3840 cattgttcga aaatattcca gttcatgatg cagatatact caaatcgatc gatctaacaa 3900 ataaagatac aatgatcact aagtttatta atgaaaaaag agaaaaaatt atggcattat 3960 ttaatgaaac attaaatcat ccagatttta attcatccac tccaattcaa actaattttg 4020 agcaagccga tacaaatcaa ggttcagatg ctccaattcg ctccgaaaat tcaaatattg 4080 cagactctac agatttagaa ccagaaaatc aaaaccatcc ttctacaaat caaatgtcta 4140 gtaatctaga acataagctg gaccttcttg atcatattga aaaacaatta tcgaaaatgc 4200 gctcgttatt gggaattctt cctactacaa atgctattgt tgggccgtca aaacaagttt 4260 cagttaatag caagcaaaat tctgaaaaaa ataagaaaaa atcttccata acaacaaaat 4320 caaacaataa aaaacaaaat tcatccgaca atcctcctaa acgtaaagct cctatttcgg 4380 aaaggattct cagcgataat aacttatatt ttcaagttca tcgaaagaga tcgaaaagca 4440 aacgttccac atctacagag aaaaagaatt gatatatgaa gtctaattcc ttttataact 4500 actatatttt tattcttatg actaaaaaat aatatattat ttaacaattt tcattttttt 4560 cttttcacaa tttcgtttta tttccattta ttgttaaatg atttttagca tacacctttc 4620 tttgttcatc ttctaaatac aatttatttt ttcttagata taattctatt cattcaattc 4680 gatcgtcatt tttcagtaca tgatcgtaat tttcaattat tgatcgcgat tttacttcag 4740 aatttgtttg caatattgct tatcaaaatt atatctctat tctctcattt ctgtttagtc 4800 aatatcatta cactcctatt gtctaataac atgctattaa ttttcttaaa attaaaaaaa 4860 aatatcaact tttaaaaaca gaaacttaac atccttaggt caatggattt tgatgtgaga 4920 tacatttaat ctgtttttct ttttttattt taaataattg aatatttata gttgttatgg 4980 gtcttacatc atgtaaaacg aattttctga aagtaaattt actttttctt caatctttct 5040 tacatattta tttaaaaaaa attatccaaa aaaagatgtt ttagaatgaa aaaaaaatag 5100 aaaaatcttc atttatagtc tttcttgaat gtatacgttt tgattttttt cttttttaaa 5160 aaattcaatt tctagaaaaa tcgagaaaaa atcattttct tctatttttt taggacgcca 5220 tacccattga aatctcctct caggacgcca taaactattc aattatggtg taaacttctc 5280 aaaaattgaa tagtttacat caggacgcct taggacgcca accctttacg cgccc 5335 // ID L2B-1_CP repbase; DNA; INV; 4834 BP. XX AC . XX DT 21-JUL-2009 (Rel. 14.07, Created) DT 21-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE Culex pipiens L2B non-LTR retrotransposon. XX KW L2B; Non-LTR Retrotransposon; Transposable Element; L2B-1_CP. XX OS Culex pipiens OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RP 1-4834 RA Kapitonov V.V. and Jurka J.; RT "L2B, a novel clade of non-LTR retrotransposons from animals."; RL Repbase Reports 9(7), 1412-1412 (2009). XX DR [1] (Consensus) XX CC This family, together with L2B-1_HM and CR1-1_AG from the hydra CC and African malaria mosquito genomes, belongs to a novel L2B CC clade of non-LTR retrotransposons. This clade appears as a sister CC clade of the L2A, L2, Daphne and Crack clades. The L2B-1_CP CC consensus sequence was derived from multiple alignment of several CC copies of L2B-1_CP, which are ~98% identical to each other. XX FH Key Location/Qualifiers FT CDS 260..1600 FT /product="L2B-1_CP_1p" FT /note="ORF1." FT /translation="MSEEMDTGGRNEGETTVLAHPCGQCAAPIANGDEKVD FT CFGKCRLSMHTRCLPRATAVGIKMLSQMPNAVFVCDACLASHKFDDGNTEK FT ILQVIDAKFNNLAGVIDFVKNFDSAVKKIVREELVRANGKKTVVAEKTEES FT RKIFTRSAAKAAASANKRKAEEEVEETGASFSTPKTSFAEVVRKRVKISEE FT EVKKKQSKKPDPVVVIRPKEGVQVEDVRAEVQKKVNAKDLNVHRVTSSRSG FT AVIVSLKDEASVAVLQANVEKQLGGRFEAQLRESFKPSIKIIGMSDEMDED FT ELRDSLVEQNDVFANLKHFKLRKTFHIEKWRFNNIGVFVELDAETFFKVLD FT LGKVNCGWNRCRVFDGLQVTRCFKCNGYGHKGADCRSEKQICPICSEDHKW FT EECKATAEKCVNCEKLRVQRKVNVDANHSAWSSDCPVFQKEQEKRNRLVDF FT TT" FT CDS 1649..4405 FT /product="L2B-1_CP_2p" FT /note="ORF2." FT /translation="MKSHFDELKLIIQKTTPKIIVLTETHLTEKHDLDEFQ FT IQTYSCSFCLSRSAHTGGVAVYVDDRLKYETISNMAVGENWFLAVDVKSSA FT LNGIYGGIYHSPSSSDTEFIDSYEEWLRSVFADERRNVFVGDFNIRWNEPG FT CSRELKNVADAMGMRQIVLEPTRVGPTSSNIIDLVFTNMENAEARVVEELK FT VSDHETISIAIGNDTINLPEREESYVSWKRYSKDKLQEILRRNRSNNSRID FT ETIGEASRNFSTNLVSAVESLVDVRCCRRTETSDWYTDHLRAMKNERDNAY FT RQYRCTKSVPALERYKLLRNAYVRGLKQAKNQSVEEEIRSCHGDSKKLWRC FT LKSLIQPGGKQHAEIVFGTGCSDAETARRLNNFFVDSVEEIHAKIPPPSVV FT PVAVQEEPEESLYEFQQVTMAKLKDTVRSLKDCAGVDNVTKRVMLDALDVV FT GGELLDIVNRSLSQGEFPQHWKQTLVIPIPKVPKSTRPEDHRPINMLPLYE FT KVIETIVKEQLTAFVDRVGVIVEEQSGFRRHHSCESALNLLLVKWKQYVEE FT GKIILAVFVDLKRAFETIDRSKLKAVLHRCGIRGTALRWLSSYLSSRVQVT FT RYNSATSPATDVNLGVPQGSVLGPLLFILYMNDLKQALRQAQVNLFADDTV FT LFVVGDSLDECFDVMNAELAGFVDWLRWKKLQLNVSKTKSMIVTTRRLNDI FT SKSVMVDGEAVERVEAIKYLGVMLDEKLNFNEHINYTIRKAARKYGVMCRI FT SRYLTSEAKIHVYNSLIAPHFDYCASILFLATRTQLKRMQVLQSRVMRLIL FT KCDRLTSRQLMLECLQWMSVRQRIEYNTLVFVFRIRKGMAPKYLTDTVVHG FT SDIHQYNTRQADDLRLLQCKKTCTQNSLLYKGYNLFNQLPDEAKLTSNINE FT FKRHCKTFVLRRPLE" XX SQ Sequence 4834 BP; 1499 A; 948 C; 1284 G; 1103 T; 0 other; agagagtgtt atccaacgag agttgcaatt ttttaaatgc gctatcgtta gattgagttt 60 ttaatgtgtt ctggtgcctt tcaaaggatt aaaacttttt caaagtgaag tgtaaagtgg 120 atgcttcacg tgaacagtgc agtttaatcg tcggctggtg tccaacggtg gattttttgg 180 cccaatcaat ttaccctcaa caccaagcta agtaccgctt tcacccttgc tttttgtttt 240 gttttgctct cccggcgcga tgtccgagga gatggacacg ggagggagga atgagggaga 300 aacgactgtt ctcgcacacc cgtgcggaca atgcgcagcg ccgatcgcaa acggtgacga 360 gaaggtcgat tgttttggca agtgcagact gtctatgcac acgagatgct tgcctagggc 420 cacggcagtg ggaataaaaa tgctcagcca aatgcccaac gctgtatttg tgtgcgatgc 480 gtgtttggct tcacacaagt ttgacgatgg aaacacggaa aaaattctgc aagtaattga 540 cgcaaaattt aataatttgg cgggcgtgat tgacttcgta aaaaattttg attcggctgt 600 caaaaaaata gtgcgggaag aattagtgcg tgcaaacgga aagaaaactg tagtggcaga 660 aaaaacggaa gaatctcgga aaatatttac gcggtcggca gcaaaagcag cagcgtcagc 720 aaacaagagg aaggcggaag aggaagttga ggagactgga gcgagcttct caactccaaa 780 aacaagcttt gcggaagtgg tgcggaaacg agtgaaaatc tccgaggagg aagtaaaaaa 840 gaaacagagc aagaaaccag atcctgtcgt cgtgattagg ccgaaagaag gagtgcaggt 900 agaagacgtt cgtgcagaag ttcagaagaa agtcaacgcc aaggatctca acgttcatcg 960 ggtgaccagc agcaggagcg gtgcagttat cgtgtcgctc aaagatgagg catccgttgc 1020 tgtgctgcag gcaaacgtgg aaaagcagct aggaggacga tttgaagccc agttacgtga 1080 aagcttcaaa ccgtcgataa agatcatcgg aatgagcgac gaaatggacg aagacgagct 1140 tagagattcg ttggtggagc agaatgacgt cttcgcgaat ctgaagcatt tcaagttacg 1200 caagacattc cacattgaaa aatggcggtt caacaatatc ggagtcttcg tcgagctgga 1260 cgcagaaact ttctttaagg tactggatct ggggaaagtc aactgcggat ggaatcgttg 1320 tcgtgttttt gacgggttgc aggtgaccag gtgcttcaag tgcaatggct acggtcataa 1380 gggcgccgac tgcagatccg aaaagcagat ctgtccaatc tgcagcgaag accacaaatg 1440 ggaggagtgt aaagcaacag cggaaaagtg tgtaaactgc gaaaaactac gagtgcaacg 1500 taaggtgaac gtcgacgcaa accattccgc ttggagtagt gattgcccgg tcttccagaa 1560 ggagcaggag aaaaggaaca gattggtgga ctttacgacg tagcaaccaa aaccggacgg 1620 agatatattg tacctgaaca ttgcgggcat gaaatctcat tttgacgaac tgaagctgat 1680 tattcaaaag acaacgccaa aaataattgt actgacggag acacacttga cagaaaagca 1740 cgaccttgac gagtttcaga tacaaacgta cagctgcagt ttttgtctat caaggtcagc 1800 acacacaggt ggagttgccg tgtacgtgga tgatcgctta aaatacgaga cgatttcaaa 1860 catggctgtt ggagaaaact ggttcttggc agtggacgtg aaaagctcag cactcaacgg 1920 aatttacgga ggaatttatc attcacctag cagcagtgac acggagttta ttgacagcta 1980 cgaagaatgg ctaagaagtg tgttcgcaga tgaaagaagg aatgtattcg tcggtgactt 2040 caacattcga tggaatgaac cagggtgttc cagagagcta aagaacgttg cagatgctat 2100 gggaatgagg cagattgttt tggaacccac gcgcgtcgga ccgacaagca gcaacatcat 2160 tgatctcgtc ttcaccaaca tggagaatgc agaagcacgt gttgttgaag aactgaaagt 2220 gtctgaccat gagacaatca gtatcgcgat tgggaatgac acaatcaatc ttccggagcg 2280 agaagaatcg tatgtaagct ggaagcggta ctccaaggac aagctgcaag aaattcttcg 2340 aagaaacagg agcaataata gcaggataga tgagacaatt ggagaagcat cgcgtaactt 2400 cagcacaaat ctggtgtccg ctgttgaaag tttagtcgat gttcgctgct gtagacgcac 2460 agaaacaagt gactggtaca ctgatcatct gagggctatg aaaaatgaac gagacaacgc 2520 gtaccggcaa tataggtgta caaaatctgt tccagcgctg gagaggtaca agctgttgag 2580 gaacgcttac gtccggggtc tgaaacaggc gaaaaatcag tcggttgagg aagaaatcag 2640 aagctgtcat ggagattcta agaagctatg gagatgtttg aaatcgctga ttcagcccgg 2700 aggaaagcag catgcagaaa tcgtctttgg aaccggctgc agcgatgcag aaacagcaag 2760 gcgtctaaac aacttctttg tggacagcgt agaagagatt catgcgaaga ttccgcctcc 2820 atcagtagtt ccagtagcag tgcaggaaga gccagaagag tcgctgtacg aattccaaca 2880 agtcaccatg gccaaactca aggataccgt tcgatcgcta aaagactgtg ctggcgtcga 2940 caacgtgaca aagcgtgtga tgttggatgc tctggatgtc gtaggaggag agttgttgga 3000 tattgtgaac aggtctctga gccaaggtga attcccacaa cattggaaac aaactttggt 3060 aattccaatt ccaaaggtgc cgaaatcaac gcgtccggaa gatcacagac caatcaacat 3120 gttgccgctg tacgagaagg tcatagaaac gattgtgaaa gagcagctga cggcgtttgt 3180 ggatcgagtg ggagtaatag ttgaggagca atctggtttt cggagacatc attcttgtga 3240 gtcagctctt aacctcctac ttgtgaagtg gaagcagtac gtggaagaag gaaaaattat 3300 cttggctgta ttcgtggact tgaagcgcgc ttttgagaca attgaccgat ccaagttgaa 3360 ggcagttttg catcgttgcg gaatacgtgg tacagctttg aggtggttaa gcagctattt 3420 aagcagccga gtacaagtga caaggtacaa cagcgcaaca tcgcccgcta cggatgtgaa 3480 tcttggagta ccacagggaa gtgtactagg accgctgctg tttatcctgt acatgaacga 3540 cctaaaacaa gcgttacggc aggcacaagt gaacttgttc gccgacgaca cagttctgtt 3600 cgtagtggga gatagtttgg acgaatgttt tgacgtgatg aatgctgaac tagcaggatt 3660 tgtcgattgg ctgaggtgga agaaacttca acttaacgtc agcaaaacga aaagcatgat 3720 tgtgactaca cggcggctca atgacattag caaatcagtt atggtagatg gcgaggcagt 3780 tgaacgggtg gaagcgatta aatatcttgg agtaatgctg gacgaaaaac ttaactttaa 3840 cgaacacatt aactacacga tacggaaagc agctcgcaag tatggagtta tgtgcagaat 3900 cagccgatac ctgacctcag aagcaaagat acacgtctac aactctttga ttgctcccca 3960 ttttgactat tgtgcctcaa ttttgtttct tgcaacccgg acgcagctaa aacgaatgca 4020 agtgcttcaa agccgagtta tgcgtctgat actgaaatgt gaccgtttga cttcaagaca 4080 acttatgcta gagtgcctgc agtggatgtc cgttaggcag cgcatcgaat acaacacctt 4140 agtttttgtg tttaggatta ggaaagggat ggccccaaaa tacttgacgg atacggtggt 4200 acacggaagt gacattcatc agtacaacac aagacaagct gacgacctca gactgctgca 4260 atgcaagaag acttgcaccc agaactccct actttataaa ggatacaacc tgtttaatca 4320 actccccgac gaagctaagc tcaccagcaa catcaacgag ttcaagaggc actgcaagac 4380 tttcgtttta cgaagaccac tggagtaggt acccacgatt gtactgtgag gaagagcatg 4440 ttatgacggc cggccatctt cattatcggt acaaatcgca tgggatttac cttgggccgc 4500 atatgaaaaa gtttgatcaa aagtaacgcg aatctgggcg cggttttaac cctatgtgct 4560 catatgtgtg tgagatagca aatgatctca aatccctaaa taggagaaaa ggaagcagca 4620 gcgagtgaca taaaagactc aacaaaggat ttgtatgagc gccttgaggt gatgtaatga 4680 gagaaacgga tgggcataca cggaagtcga gtaagattta aggatactct caagcactgg 4740 aatagaatta tcgaaagata tctgctcgta aaccttccat actacaaata atgtgtatgg 4800 gcaagaggtg ggccatccaa ggaaaaaaaa aaaa 4834 // ID Copia-3_DWil-LTR repbase; DNA; INV; 240 BP. XX AC scaffold_180708; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-3_DWil_; KW Copia-3_DWil-I; Copia-3_DWil-LTR. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-240 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_180708; Positions 1909935 1909696. XX SQ Sequence 240 BP; 57 A; 58 C; 43 G; 82 T; 0 other; tgttaaaaaa ggtaacgttc aagaactttc cagagcaatg cacacatgta ttctaattgc 60 gtatgtattt gaacgcatac atctatctat ctcttttttg tatttgaata ctgcgctcgc 120 ccgactgaat taccggcacg cgcagttagt ctctaataaa cttctgagtt gtacggacgt 180 gttactattc ttttattctg cctgcgttat tcgggttgtg cgtcccctcc cccctcaaca 240 // ID BEL-110_AA-LTR repbase; DNA; INV; 754 BP. XX AC AAGE02021062; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-110_AA_; KW BEL-110_AA-I; BEL-110_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-754 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02021062; Positions 54288 53535. XX SQ Sequence 754 BP; 210 A; 188 C; 195 G; 161 T; 0 other; tgttcagttc gacaatagtc gacccgaaac tgaagcagcg gcgtctagca gtggcagcgg 60 tatgcacaac gagagctccg cccatccaac cgatgacaac gacaacgatg acgatccgac 120 accgaatcat ggtgctataa aagcaagggc cgaactgcgc aacaaggtca gtttttgttc 180 acccgtcagc gtgacaacaa tcgagcgtga agtcccgagc cgtcatccag ccgaagccga 240 gttggcgaac cgtgatcgtc gcgagtaaat cgaactcgac tgccaacgcg agtgattcgg 300 ttacaaaacc gaatgcgtgt gtgtcgcaac gattagttgc acgcgagccg ccttacagtg 360 caaatcggaa agtagccgtc tgccggctga agaataagtt gaagaagaaa gtagtttgtc 420 aagagagaag aatgaataca gtagtgtaaa ttaagtcttc tttcgtgttt gatttccgac 480 caccccacat ccagtgccca tccaccaaat agttttgagt tgatttggaa aaaatattca 540 ctcgagtcgc taagaaaaaa aattcacata tggagttact ggttgctgct agctagaggg 600 ttagtccagc tgaaccggat cgcggcttca tgaggcgaag cccagccagc gtcggaagga 660 acatagagtc cttcacccat cccgctcggt tgtttgaggg aacgcgagcc acttagttct 720 gccagtgcag tgcagtccat tgccgtccgg taca 754 // ID hAT-32_SM repbase; DNA; INV; 2278 BP. XX AC . XX DT 02-MAR-2008 (Rel. 13.03, Created) DT 31-MAR-2008 (Rel. 13.03, Last updated, Version 1) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-32_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-2278 RA Jurka J.; RT "haT-type repetitive family from freshwater planarian Schmidtea RT mediterranea."; RL Repbase Reports 8(3), 235-235 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS join(481..813,817..1734) FT /product="hAT-32_SM_1p" FT /translation="MPRVKQSVSNRLKQYVSEFPDFKTDGKILFCKVCNKS FT VXAEKIFTIKQHLASAKHIELTERNVAKKSTQQFLSGYSRVSKDAQFAEDL FT CGAFIAADIPLYKMRNKKIQSFLKYTEHKVPSESTLRTNHVNSIYKENIEK FT IKSCISNRFLWLSIDETTDVANRYVANVIIGILDPDEVVSRQNFLLNTAQL FT DKANHXXIARLFDDSIRILGENFNKDSILLFISDAAPYMVKAARAIQIFYP FT KITHLTCLVHGLHRVCEQIRGLYPNVDRLIANVKKVFLKAPSRVVFKDLEP FT GLSLHPQPITTRWGTWLTAVNYYANNFEKIVRIFDXLDNEEAASIKISQDX FT LRDSTIKADLIFIVSNYGFLGASITKLETSGLELSAQIKVVTDAMGAINVV FT YGNTADIIKKTCYCNRKKTVVLL" XX SQ Sequence 2278 BP; 781 A; 331 C; 375 G; 785 T; 6 other; ttagggcaac caaaaatatg cattagcatg tttttttctg aatcatattt agaaaactag 60 cacgatatca atccatttta agaaaaacta tgccctttaa cccttatttt tggggcatat 120 tttttcatat ttcaccaaaa gtgcatattt ttgcataaaa taatttttaa aggcatattt 180 tagtaaaaaa agttgttttt aatagtattt tttcgaaata atagacacta attattgcca 240 gttttttata cagtatatta aataatgtgt ttttatagtg tacattttag ctggggttaa 300 gaaaaattat tacattttag ctgggtttta taaaaaatgt attgacttta gtttgaaggg 360 gatccccggt tattaataaa ttttgtttaa aaataaaact tcttagttat tgcaaaagtt 420 acttaatata caacattaaa ttataagttg ctatttaaat aaatttaatt tttgtaagat 480 atgccgaggg taaagcaatc cgtatcaaac agattgaagc agtatgtaag cgaatttcca 540 gatttcaaaa ctgatggaaa gattcttttt tgtaaagtgt gcaataagtc ggtgtyagca 600 gaaaaaatat ttacgataaa acagcacctc gctagtgcaa aacatatcga gcttacagaa 660 agaaatgtag ctaagaaatc gacacaacag tttttaagtg gatatagcag agtctcaaaa 720 gatgctcaat ttgctgaaga tttgtgcggt gcttttattg ccgcagatat accattgtat 780 aaaatgcgta ataaaaaaat tcaatctttt ctgtaaaagt atacggagca caaagttcca 840 tcggaaagca ctcttcgcac gaatcacgta aactccatct acaaagaaaa catagagaag 900 attaaaagtt gcatcagtaa tcgttttttg tggttgtcaa tcgatgagac aaccgatgtg 960 gctaacagat atgtggcaaa tgttatcatt ggaattttgg atcccgatga agtagtttcc 1020 agacaaaatt ttttattaaa tactgcacag ttggataaag caaatcacay cwgtattgct 1080 cgtctttttg acgactcaat taggatctta ggtgaaaact tcaataaaga ttcgattctg 1140 ctttttatat cggatgctgc gccgtacatg gtgaaagcgg cccgggcaat acaaatattt 1200 tacccaaaaa taactcattt gacttgcctt gttcatggac tacaccgagt atgtgagcaa 1260 atacgaggcc tgtatcctaa cgttgatagg ttaattgcaa atgtaaaaaa agtattttta 1320 aaggctccgt ctagagtggt ttttaaggat ttagaacctg gactttcact ccatccacaa 1380 ccaataacca cacgatgggg cacttggctt acagccgtaa attactatgc aaacaacttt 1440 gagaagattg tgagaatttt tgatgyatta gacaatgagg aagcagcctc aataaaaatt 1500 tcacaagatc wtctacgtga cagcacaata aaagctgatt taatttttat tgtatcaaat 1560 tatggatttc ttggagcatc aataacaaaa ctggaaacat ctggactaga actttctgca 1620 cagatcaaag tcgtaacaga cgctatggga gcaattaacg tcgtatacgg taatacagcc 1680 gacattatta aaaaaacttg ctactgtaat cgtaaaaaaa ctgtggtttt gctttaatga 1740 ggaatatttc agcaattttg gctggtgtat ccgtcacagc aacagaaaag tatacctgtr 1800 gtgaaacttt agcttttaaa ttcgcgccaa ttacttcagt tgatgtcgaa cgaagtttct 1860 cgatgtataa gagcgtttta cgatcaaaca gacaaagctt cctgtttgaa aatttaagcg 1920 aaatgtttgt catatattgc aataataatt taaattaatt ttgttcttaa ttaataatta 1980 tttatttttc aatcttgtca ccttgaactt ttatgacatt ttagttctct tttcttttaa 2040 atttaatttt tgaataaaat acaaataaat atgtatatat ttaatgaata ttcattttta 2100 aatctatttg gtggtttatt tttatttgtt aggttttcag atttttgaag caaaataaaa 2160 ttttgtaaaa catattttca gttttataga gcatattttt aatgcttaaa attgttttta 2220 taggcatatt ttaagcgctt aaaaccactt tttaagagca tatttttggt tgccctaa 2278 // ID Gypsy-165_AA-LTR repbase; DNA; INV; 153 BP. XX AC AAGE02017896; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-165_AA_; KW Gypsy-165_AA-I; Gypsy-165_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-153 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02017896; Positions 41757 41605. XX SQ Sequence 153 BP; 54 A; 26 C; 22 G; 51 T; 0 other; tgtagaatgt tatgttataa atatgtatat aataattaga tactaagcta ccccaatcgt 60 agtagtctat gacttaggat actctctgac atttgactga ctagacatca ttcccgaata 120 aaccttgaaa ctgataagaa gtcttttact aca 153 // ID Copia-1_Cfl-LTR repbase; DNA; INV; 231 BP. XX AC AEAB01029077; XX DT 26-JAN-2011 (Rel. 16.02, Created) DT 26-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Florida carpenter ant genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-1_Cfl_; KW Copia-1_Cfl-I; Copia-1_Cfl-LTR. XX OS Camponotus floridanus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Formicinae; Camponotus. XX RN [1] RP 1-231 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Florida carpenter ant genome."; RL Direct Submission to RU (26-JAN-2011). XX DR Genome; AEAB01029077; Positions 8030 7800. XX SQ Sequence 231 BP; 68 A; 35 C; 52 G; 76 T; 0 other; tgttgaagta tgtacattta tattcggtga cgccatctat gagtatgtag ggagcgcggt 60 tggtgtcctg tctttgatgt gtgtattctt atgttcttat aataataaaa agacatgtgg 120 tgcaacgcag tggtgcgagc acgttcttaa atcaaagata ttagaaagag ctattataaa 180 gccccagatt gtattatatc agtccttcta aagatcccgg taaatatgac a 231 // ID Gypsy-15_AA-I repbase; DNA; INV; 4370 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-15_AA_; KW Gypsy-15_AA-LTR; Gypsy-15_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4370 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 999-999 (2011). XX DR [2] (Consensus) XX CC Positions [3300-3761] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 2334..4289 FT /product="Gypsy-15_AA-I_2p" FT /translation="MFHDESFEWKPDRIAAYEDIKSALISPQVLMPYDPNL FT PLVLATDASKFGLGAVLSHRLANGRERPIAYASCSMSATEQRYPQIDKEAL FT AIVWAVKKFFNYLYARKFTLITDHKPLTQILHPTKSLPTLCISRMANYADY FT LAHFNFDVLYKSTKENVNADYCSRITRFPTSSDVNTLSVYQGRKNAEDEFE FT LFTLNQLAQLPIRAEHIARETRKDPSLGKIVQLLENGSELARFGYKAPEAK FT YTLSASCLLFEHRVVVPAILRQSVLNDLHVSHIGMVKMKGIARSFVYWPGI FT DTDIEKTVKACSDCARQAHAPPKFSDHHWQYPKCPWERVHIDYAGPVAGSM FT LLIIVDAYSKWLEVKVTNTSTTAATIGILDELFSRHGVPVTVVSDNGPQFT FT AVEFKNFLQKSGVKFHKLSAPYHPATNGQAERYVQTTKDAMKAMGTTASTL FT HSDLNIFLQLYRRAPHATTGESPSKLFIGRNIRTRLDLLKPEDIHRKITAK FT QGANFRPSFRELDPGQQVYFLSGNCRMDKWIPGVIAVRLGDLHYEVEYAGK FT RFKRHIDQIRSRIGRQGSQVTEATISEAPTELDASRRIHFYGNGGTLSSAP FT TTPSNARNAPADQGSPEFHTPSASPMNDAIQRDCSPFILRRSTRVRRPPCK FT YSP" XX SQ Sequence 4370 BP; 1209 A; 1055 C; 997 G; 1105 T; 4 other; ttttggtgtc agaagtggga tttagaggca agttttccgg tcgagagtaa cgcgttcgtt 60 tcggtctcgc gaaaatcaag atggctaacg aagagctaat ggcatctctt acccagatgt 120 tagccaatgc cctcaaatcc tccattgaag cagtggctgg cgcacaagtg gaccccgctg 180 ctagagctgc agcgtcacag ccaaagccgc ccactttttc agtttccgaa taccgctcct 240 cggacaacgc caccgtatcg gactattttc ggcgttttga gtgggcgtta cagttgagca 300 atatccctgc tgctcaatac ggacattacg cccgagtaca catgggtacg gagctgaata 360 atgccatcca gtttctagtg agtccaacgg atccagaaac tctcacattc aatgaaatac 420 gtacgacgtt agttgaacac tttgatcaag caaaggataa gtacgtagag agcattaaat 480 tccgggcaat tgttcagcaa aakagcgagt ctgtctctag ctttgtcctc agactcagac 540 aaggagcagc acactgtgag tacggagact ttctggacag aatgctaatc gaacaactcc 600 ttcatggcct cgagtctcgt gaaatgtgtg acgaaataat cgctaaaaag ccagcgactt 660 tcaaagctgc tttcgaagtt gcacataccc tggaagcaac ccgcaacact gccaaagaag 720 ttcgaacggc tagaccgtcg tcgatcatag aggccaccaa caagctgggt tacgaaaaac 780 cgaagacccg taaagtgaac tcttcctcaa aacagaacaa acagtttagt gaaaaacaat 840 ctgactcaag tgtgtgtagt ggatgtggtg gtaaccatac ccgaaatatg tgcaagtttc 900 gtgatgcaaa gtgctacaat tgtgagcgta agggccacat ctccaaagtg tgtagatcct 960 ataagcggaa acttcaggat caatccacat cgcaggtgca atcagaagtg ttaccggctt 1020 cacaagtaga tgtagtccag tcgctgggaa aaatccactc cgtcaacacg tccccgaagc 1080 tagttattaa tgttaatatc gacggacata ccttggacat ggaagttgac accggtgcac 1140 cgtgcggtat catcagtgaa gccaaactgc gtattatcaa accccatttc actctacaga 1200 aatctgatcg gcagttcacg agctacactg gtcatcgtat aaagtgtttg ggccgacttc 1260 cagtgagcgt atgtatcggt actacaacac gtaaactcga tttgtatgtg gtagccgaag 1320 attacgatac gctattcggt agagagtgga tctcacagtt tacaaccgaa gtgaatttga 1380 gtaaactttt tggtgttagt ggtacggtaa actcgctggc gattgctcat tccacgatct 1440 cgcmggatca acagcaagct ctgtcggagt tgctggccaa ctatgatagt gtttttggcg 1500 atatcgcggg gatactaaaa ggtcctccag catcggtaca ccttaagcct ggtgcaacac 1560 ctgtgtttgc aaaagctcgt gacgttccgt tggcattgaa gagccagtac gcacaagaaa 1620 tcgacaagaa actttgtgct ggcgtgtatg agcgagtcga ctactctgaa tgggcatcac 1680 ccacccatat tgtggtcaag aaaaatggaa aactacgcat caccggaaat tacaagccca 1740 cggtaaatcc ccggatgatt atcgatgaac acccgatccc aaagatcgaa tccatcttca 1800 accaaatgcg aggagccagt ctgttctgcc acttggacgt gacggatgca tacacccacc 1860 tcccaatcga cgaagaattc cgacacgttc tcacgctcaa tacaccaaca catgggttgg 1920 ttcgtccaac cagggctgtg tatggtgcgg ccaatattcc tgcgatatgg caacgtcgaa 1980 tggaaacagt gcttcaaggt ttgcccagtg taattaattt ctatgacgat attattgttt 2040 ttgcggatag tttcgaaaac ctgttgatcg ctctcagagc tgttttcgac aaactcaaag 2100 aacacggctt acgactgaac cgttcaaaat gtgtttttgg taccccggct ctcgaatgtc 2160 tgggacacaa gatcgacgcg macggtttgc acaagtcgga ccaccatatt gaagctgtac 2220 gagacgcacc acgtccatcg aacacagatg aactccagct gtttttgggt aaagccacct 2280 actacaacgc tttcattcca aatctctctt cacgctctcg ttgtctacgt gacatgttcc 2340 acgacgaatc gttcgaatgg aagcccgata ggattgccgc ctacgaagat ataaaatctg 2400 cgctcatttc cccgcaagtt ctcatgccat acgatccgaa tcttccgttg gtgttagcca 2460 cagatgcgag caaatttggg ttgggagctg tactctctca tcgtctagcg aatggtcggg 2520 agcgtcctat agcgtatgcc agctgctcaa tgtctgctac tgagcaacgt tacccgcaga 2580 tcgacaaaga ggctcttgcc atagtttggg cagtcaagaa gtttttcaac tatctttacg 2640 ctcgaaaatt cacgcttatc actgaccata agcctttaac ccaaattttg catcctacaa 2700 aatcgctacc aactctgtgt atcagcagaa tggcaaatta tgctgattat ctggcacact 2760 tcaattttga tgttctgtac aagtccacaa aagaaaacgt taatgcagat tactgctctc 2820 gaatcactcg ctttccaacg agttctgatg tgaatacact ttcagtttac cagggaagaa 2880 agaacgcaga agatgaattc gaactgttta cgttaaacca gctagcgcaa ctaccaattc 2940 gtgctgaaca catcgctcgc gagacgcgga aagatcctag ccttggtaag atcgttcaac 3000 tattagaaaa cggctccgaa ttggcacgat tcggatacaa agcaccagaa gcaaaataca 3060 ctctcagtgc gagctgtctg ttgttcgagc atcgtgtagt ggttcctgct attcttcgtc 3120 agtctgtgct gaatgatctc catgtgtcgc acatcggtat ggtgaaaatg aaaggaattg 3180 ctcgttcatt tgtatactgg ccgggaatcg atacagatat cgagaagact gtgaaagcct 3240 gttccgattg tgctcgccaa gctcatgctc caccgaaatt tagtgaccat cactggcaat 3300 atcccaaatg cccctgggaa agagtacaca ttgattatgc cggtccagta gcaggctcta 3360 tgttgctcat catagtcgat gcgtacagta agtggttgga agtgaaggtc acgaatacgt 3420 caacgactgc agcgacgatc ggcattttgg atgagttgtt ttcacgacat ggcgtacccg 3480 tcacagttgt ttcggacaat gggccgcagt ttaccgctgt ggaatttaaa aacttcctgc 3540 agaaaagtgg agtgaagttt cacaagctgt ctgctccata ccacccagca acaaatggac 3600 aggccgaacg atatgtgcag acaacaaagg acgctatgaa ggctatggga acaactgcct 3660 cgactctgca ttccgacctg aacatttttc tccagctata ccgccgtgct cctcatgcaa 3720 ccacaggtga atcaccatcc aaattgttca ttgggcggaa tatcagaacc cgtctagatc 3780 tgctgaagcc tgaggacatc catcgaaaga tcactgcgaa gcaaggggca aatttcaggc 3840 catcattccg tgaacttgat ccgggtcagc aagtatattt tctttccggc aattgtcgta 3900 tggataaatg gattcctggt gtgatcgctg ttcgactggg agatctccac tatgaagtcg 3960 agtatgctgg gaaacggttc aagcgtcaca tcgaccaaat tagatccaga attggcaggc 4020 aaggttctca agtgactgaa gcaactatca gtgaagctcc tactgaactg gacgcatcga 4080 ggcgcattca tttctatggc aatggtggca cgctttccag tgctccgaca actccaagca 4140 atgctagaaa tgctcctgct gaccaaggtt ctccagagtt tcacactcct tctgcaagcc 4200 cgatgaacga tgctattcag cgagattgtt ccccattcat cttacgtcgt tctacaagag 4260 ttcgtcgtcc tccatgcaag tattcgccgt agacgattca tttcgaaagg agggaagaaa 4320 tgttatgtat gtaatctttt tatataamtt acttttgctt taaagtagtc 4370 // ID Gypsy-50_CQ-LTR repbase; DNA; INV; 416 BP. XX AC AAWU01002291; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-50_CQ_; KW Gypsy-50_CQ-I; Gypsy-50_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-416 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 480-480 (2011). XX DR Genome; AAWU01002291; Positions 29687 30102. XX SQ Sequence 416 BP; 128 A; 80 C; 99 G; 109 T; 0 other; tgttgtgcac acggattgca caactttgca ttcttagcgt agctccaatg aatggtttgc 60 gttaatcaca acgcacgcat tgcagctgac gctaagggaa cggacgataa atcaagtctc 120 gttttcgtaa cattcacgca ttgatgcgtg tgtcaagaga ataggtgacc cattttctag 180 aacattgatg cgtggatcag aaaaaaatag actaagagga ttacccaaaa gcgagggtga 240 cgaagttaac gattccgttg gtccgtggtg caacctgttg agatgcatgg catgagtaaa 300 aatagattaa gtgattccta ataaatatgc cctgctccga ggaatcggag tcagttatgt 360 tcgagaagta tagtgagtca gttcctttat atcacaagtc ccaagcagat acaaca 416 // ID Gypsy4-NVi_I repbase; DNA; INV; 4196 BP. XX AC AAZX01001018; XX DT 05-NOV-2007 (Rel. 12.11, Created) DT 05-NOV-2007 (Rel. 12.11, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy4-NVi; KW Gypsy4-NVi_I; Gypsy4-NVi_LTR; internal portion. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-4196 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from Nasonia parasitic wasp."; RL Repbase Reports 7(11), 1122-1122 (2007). XX DR Genome; AAZX01001018; Positions 30547 26352. XX CC Positions [3215-3670] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 120..2522 FT /product="Gypsy4-NVi_I_2p" FT /translation="MGETTTTTSAGTTTTTTTIANTTAALRIDNYVHGEMK FT WDRWIRKLEIGFMYHAIDESRKATCLLFHFGDKAFDSLCDFLNGVKPITLQ FT YPELKEKLKDLFSPKVIEISENFKLSNRKQLPGEDIKTFTNALNNLSLQCN FT FGTYKDKALLNQFVIGVRDTRLRRKFLETSDLTYEKAIQTVGALELTEQEN FT STLSTATNTSVNSLHVDKKVVPRSQKNQKAKSKQARGKNNKNYKGQDKTQN FT TTKYIQCYRCAGEHTAKHCTVNPNNLSCGQYGRRGHVTSVCLKGNNTTNFI FT EKILLVEHFNARDKYTTTLPVNNTEVTFEIDSGTAVSLMSEEDARRLFKGT FT TLFRTQTKLVSYCNMQINVTGFIKVEVKFANQIFNLHLYLTTNKRPPLLGR FT EWMKAMLKCGNANQLFAEVSQITESNTSASDRKKIIENLLKQYSNCTKTDM FT GKIKGITATLKLKENAQPVFLKARPAAFRLITLLDKEIGRLVKEDILIKVN FT SSKFATPVVPVLKRDGTIRMCGDYSVTLNPNLIIDEHPLPTPDDMFTSVKD FT SKHFAKIDLHQAYMQMEVDEASSALLTINTHRGLYRPTRLMYGVASACAIW FT QREIENIFRDLEGMFVLLDDIRIAGRTQTEFLHRLEAVLRRLHDYNIKINQ FT NKCEFFQNEIEFCGYVINEHVIHKSKAKMEAIEKMPRPSNVSEIRSFIGFV FT NYYSRFIDNVSEILRPLNELLRKDTKFVWARECEAAFCKAKEAFVSDKCLT FT FFNPKLGLILATDPSPTGVGAVLSHKFPDGTEKPIMYISQTLNKTQSKYA" XX SQ Sequence 4196 BP; 1527 A; 859 C; 824 G; 986 T; 0 other; aaattggcga cgaggacggt aggaattgag aaatacatac aatcacaaga aaattactct 60 acgaagcagc tccacgaggc tacctacata caaacgtaca acacgcaata cattgtacca 120 tgggagaaac tacaactaca acatctgcag gtacaactac tacaactact acaatagcta 180 acactacagc ggccttacga atagataatt acgtacacgg agaaatgaag tgggatagat 240 ggatacgaaa actagaaatt ggatttatgt accacgccat agatgagtca agaaaagcga 300 catgtttgct atttcatttt ggagacaagg cttttgatag cctgtgcgac tttctgaacg 360 gcgtaaaacc aataacactt caatatccgg aacttaaaga aaaattgaaa gatttatttt 420 cgccaaaagt gatcgaaatc tctgaaaatt tcaagttaag taatagaaaa caattaccgg 480 gtgaggatat taagacgttt acaaacgctc taaataatct cagtttacag tgtaattttg 540 ggacttacaa ggataaggcg ctactaaacc agtttgtcat cggtgtccga gatacaaggt 600 tacggcgtaa atttttggaa acatccgatt taacctatga aaaagcaata cagacagtag 660 gagcattaga gttaactgaa caagaaaact ctacgttatc tactgctaca aatacatcag 720 tgaattcttt acacgtcgac aaaaaggtag tcccgcgttc ccaaaagaac cagaaggcga 780 aatcaaaaca agctcgtggc aaaaacaata aaaattacaa agggcaagac aaaacacaaa 840 atacgacaaa atacatacag tgctacagat gtgcaggtga acatacggcg aaacattgca 900 ccgtaaaccc gaataatcta tcgtgtggac agtacggacg aaggggccac gtgaccagcg 960 tttgtctcaa gggcaacaac acaacgaatt tcatcgagaa gattttgctg gtggaacatt 1020 tcaacgcaag agacaagtac acgacaacgc tgcctgtcaa caacaccgaa gtcaccttcg 1080 agatcgacag cggaactgcg gtttcactga tgtcggaaga agacgcaaga agattattca 1140 aaggtactac attattccgc acgcaaacaa aattagtatc gtattgtaat atgcaaatta 1200 acgtaaccgg ttttatcaaa gtagaggtaa agttcgcgaa ccaaattttt aacttacatt 1260 tatatttaac tacaaacaaa cgcccaccgt tgttgggacg cgaatggatg aaggctatgt 1320 tgaaatgcgg taacgcgaat caattatttg cagaagtttc acaaattacg gagtcaaata 1380 catcggcgtc tgaccgaaag aaaataatcg agaatttgct caaacaatac agtaattgta 1440 caaaaaccga tatgggaaaa attaaaggca ttacagccac tttgaaatta aaagaaaacg 1500 cgcaaccagt atttctcaaa gcccgtccag cagcgtttag gttaattaca ctcctcgata 1560 aggaaatcgg acgattagtg aaggaagaca tacttataaa agtaaattcg tcaaaattcg 1620 caacaccagt agtaccagtg ctaaaacggg atggcacaat tcgtatgtgc ggtgactaca 1680 gtgtcacgct aaatccgaat ctaatcatag acgaacaccc actccctact ccagacgata 1740 tgtttacatc ggtaaaagac agtaaacact tcgctaagat cgaccttcac caagcataca 1800 tgcaaatgga agtcgatgag gccagttcgg cattgcttac aataaataca caccggggcc 1860 tatacagacc aacgcgactt atgtacgggg tcgcttcagc atgcgcgatt tggcaacgcg 1920 aaatagaaaa tatcttccgg gacctagagg ggatgttcgt gcttcttgat gacatacgca 1980 tcgcaggcag aacccaaacg gaatttttac ataggctaga agcggtacta cgtagattac 2040 acgattacaa tatcaaaata aatcagaata aatgcgagtt tttccaaaac gaaatcgaat 2100 tttgcggtta cgtgataaac gaacatgtca tccacaaaag taaagctaaa atggaagcta 2160 tcgaaaaaat gccacgacca agcaacgtaa gcgaaattcg ctcttttatc ggatttgtta 2220 attactatag ccgtttcata gacaacgtaa gtgagatatt acgcccgtta aacgaattat 2280 tacgtaaaga tactaaattc gtctgggcac gagagtgcga ggcagcattt tgtaaagcca 2340 aagaagcgtt cgttagtgat aaatgcctta catttttcaa cccaaaatta ggattaatac 2400 tcgcgacgga tccgagccct acaggagtcg gtgcggtatt gtcacataag tttcccgacg 2460 gaacggagaa accaatcatg tacatttcac aaacgctaaa taaaactcaa tcgaaatacg 2520 cctagattga taaagaagcg tatgcgatcg tctttgcggt aaagaaattg caccgatacc 2580 tctacggcgg aaaatttaca ctcattacgg accatcgtcc acttacacaa atattttcac 2640 aaagaaataa cttacctata tacaatgcat tacgcatgca acattatgcg atttttctta 2700 gagcctataa ttacgacata gtttataaac gatccgaaaa taattgtaac gcagatgggt 2760 tatccagatt accgagtccg aaggcaacag aacatattga cgtaattgat gtacactaca 2820 taaatacaat acaagcgatg ccggttacaa taacacaaat acgcgaggca atgcaaaaag 2880 acgcaaacat tatgaaaatt gtaaaagcgt tacaaacggg taaacagctg tctgctaaag 2940 gtacctggaa tgtaaaccca ctagaattta gtttagaaca agatttgctt gtacgcaatc 3000 aaaaagttgt aatactacgc ctgtacaact acgcgacgca gtattacagg aactacatac 3060 agggcacttc ggcgtggtta tgatgaaatc gctcgcaagg ggacactgct ggtggccagg 3120 aattgacaca aatattgaag acatggtaca taactgtaca gattgccaga ctcataaaca 3180 ccatgcccca gctgtggaaa aacacatatg ggatccaccg tcaaaaccct tcgagcgagt 3240 tcattgcgat ttcgcgggac cattcttaaa taaatacttt ttgatcttgg tcgatgccta 3300 cacgaagtgg cctgaagtac atataacacg agggcttacg agtgaggaga caattctagt 3360 ttgtaaaaag atattcgcca ctttcggcat cccaaacatt ttcgtttctg ataacggtag 3420 aaatttcacc tcacgattat tctctgaatt tatgaaattt tacgggatta cacataagct 3480 gactgcacca tacaatccgt caactaacgg acaagcagaa cgctacgttc aaactattaa 3540 agacgcgcta aagaaaatcg cgagttcaaa taaccttgaa aacaaacttc aagacatcct 3600 cctacaatac agaataactc ctcactcagc caccggcata tcgccgtcag aactaatgtt 3660 tagtagaaac attaaatcta agttagatct catgaatcct agtctaagta gtaagaatga 3720 ttctccgtat aatgtaaata agcaaattcg cgatttcaaa attaaccaaa gagtgagcgc 3780 gagaaattac gtaggagaca aaaaatggat cttcggcaga atttccaagc gtataggaaa 3840 actacattat aatgtaactc ttgacgacgg gagagtctga aaaagacatg cgaatcaatt 3900 aaataagatt ggaaataaca ttcctccctc tacactacgt tatgacaacg aagtcataga 3960 aactccgccg aacgaacaaa taaaagttta tccgtcaatt tcgaaacaaa tacatgtaaa 4020 ccgggactaa tcattgaata gtttaagcag aagctccgat tatgcgagtt gtagcaccga 4080 tacatccatc gaggcacaac cgaaaagtag aggtagaggt agaggaaggg gttgcgtacg 4140 acaaaaatca caaaaggaag aacaaattcc gggtggacct cgccgagaaa caggaa 4196 // ID Rehavkus-2_NVi repbase; DNA; INV; 9132 BP. XX AC AAZX01004877.1; XX DT 14-MAY-2009 (Rel. 14.06, Created) DT 26-MAY-2011 (Rel. 16.05, Last updated, Version 3) XX DE Rehavkus DNA transposon. XX KW MuDR; DNA transposon; Transposable Element; Interspersed repeat; KW Rehavkus group; FB; FB4; NOF; Rehavkus-2_NVi. XX NM Rehavkus-2_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-9132 RA Bao W. and Jurka J.; RT "Rehavkus DNA transposons from Nasonia vitripennis."; RL Repbase Reports 9(6), 1159-1159 (2009). XX DR [1] (Consensus) XX CC TIR is 400-bp long, TSD is 9-bp. XX FH Key Location/Qualifiers FT CDS 4761..8564 FT /product="Rehavkus-2_NVi_1p" FT /translation="MVGQKPAVPAKEIIEALINFDIYQSKKKLKRKKDPIW FT QEICDILNNNRERKIKPLNLYMHVDQDRHNIQSSYRKIKDLDIYESDDCET FT IIEETNEDEISESENTKEDTVCYRPTKRKNKNIIFNFTITPEQWSQIKPIT FT ITYKDGREGMSLQEPYREMFGKEMYYKEKLPCAYNYKRNVIDDIESYVKVT FT GYCNTCNAQLMINCFDKPIDGEPVTFTVETLDSRGIPHTAKRRLAGSQRKL FT IKKELEHKKPKKWRREEASKHMKYGDPEPPFLYSQEVIQKASHEVKYEKLG FT LKPGEKLFDSLTNLKKDVEFNRYFRGIGNDKFYLMYWSPEQVSIHNDVQAK FT LRNPLSLDATGSVALKIKRPDGESADIFLTVLSTYIKSMIVPVAQVLSEKN FT DTNFLSYWLLEWRKSGAKIPEFIVTDMGKGIQNSVCLSFNTMNFSKYNDEC FT LKILLNAQYKIELNTQLRTDVAHLVHAVTKWPCFSSDKPKVKELFKRCVGF FT MTGIDNLKNFSEFLVAVFIVSNSKNNDDNCQKALKYLIERIQTYKFNANVT FT EKLELLDQKETLDIFADIEEFNTKSNQRLTRKYVDELKVKPIKNKSDNKSD FT NSSDFNIQNNDYYLPNFAERLSILCAEFPCWTKVMNVYFDNSEDVATSARA FT ESYFSDYKSSNDASQRGDVVFVNHCRQIDSDMLLARASLNNLQPDEIVSKK FT VIVKDDNEFLFATERWKRKYTTAECQEFSDFLSETVELAENSISETINGKN FT DLILDKNKLEQEQKSHIKNSMKVNDESQVTFDDDLEGFSDYIKLTSSIVGD FT EKLISSIHSPEKINSILQPINSNKTMHLNFNTDHNYSLEPEKESPKQIFQN FT TDEVLLIDHQQSESERVLLDYKKTEEVVLDYEQSDEEVDLKHSKETLLNKE FT KYSSYEVKDVDVKSSVNTVQNSVQLVVKNEQDQNSLPKNKKNRGKYVSAEP FT GLNLVLQKEKKKNKGRKVIQNGNKLQEKSVRGKKFRFNNTCSFDSITEIMT FT HSCYEVQAFEQFIKNNINSILEMQNLCYASTIINYATSSLTNTLYTNRAYL FT LHSLMPNLNNYDVNCQGNISNLFCSLMKDLTSIRETSLCQTCGYKTECNKC FT QIPIPNTDIIVKHNYTTMQKELDQYFTDKNVFCSNCKTSNAHLSRTVGSYL FT CIDTEDSYQRSLNPKMTDFNQLDITSTLDQVPVTINVKKDTFNLCGVARYI FT PSVVEKGMGHYVAYCRKINGTWDERNDLLTYKTAITRSQTKRLEMKISILF FT YVKFNND" XX SQ Sequence 9132 BP; 3378 A; 1376 C; 1490 G; 2888 T; 0 other; acccccctta ccccagcccg caaatattga atatatataa taaaaaaaat actagtcgat 60 gtatttttaa ttgcacatta caaataatct atttattgta attgaacctc cgtagttttt 120 taattaaaaa taaaaaaaga aatgtgctat atgtaccttg gaaaaatacg tttttctcaa 180 gaaatattgt gtttaccgat attctatcaa cgttaaaaaa ttgaacccta gctatttgta 240 attgagacac tgctaaatat tgcaaaaaaa gcataaaaaa atctaaaaaa tcacttttga 300 gaaatttaac atacagcacc aacgtattgc gcatgcgcaa accactccga gtgcgtcagc 360 ggtcagcgtg tggcaggcaa agcgagcagc tgttctgcag tggagaaaat acttagtgct 420 gctggctgca ctttgtgctg ctgctgatga tgtcgcatct gaaagcagta tgcagcgatc 480 caggggttgt gccattgccg caatcaagga tggacttttc ggatattcat acaggaagtt 540 ctggtggaga tgattgtgat gagaaatgac tggactgtgt gcactaggtg tgagacttac 600 agaccaccta gggcacatca ttgtaggatt tgtaaaaggt gcattaggag aatggatcat 660 caaactttct cctgatattc ttttctgtgt ccatattttg gaaaaactgt gctacttact 720 cgcacttgtt ggcaataaag cagatatgat tcatctgagg cggaggaagg tgaaattatg 780 aataaaatgt tataattctt gatgcaagga aaaaagaact attgtatgct tgtgaacaca 840 tgtaaacaaa cttgattata tattatactt atttgttttg aaatatataa atattctaaa 900 atatgaacaa ctttgagttt atttttacta attttggtaa ggaataggct tgaatcagta 960 cattaaaaat gacaaggagg tagaatttga aaatccattt attcgcctcc agactatttg 1020 tgtaatatac cttgacagca atgtctgctt gttttcacat tatcaatata tttggcttac 1080 gaactaagtt tctcttaatg tacaaacaac tatattattt aaaaaattaa tacgtggatt 1140 taattggcgt aaaaagtttg cccttattat gtattgtgca aaaaaagtgt ggattcttct 1200 tgcgtataca aaaattttaa atactcataa agcggagaac tgtggaagaa taaactggag 1260 cgcaaatctt agtccaacga gattctcgtt tcctcaatca accctcattc gcaaatcaaa 1320 tggatgcccg tcgtatttta agaaaaaatt ttataatttc gaaatctttt ataaattctt 1380 ctattgttat taatattatt atattatcat tgctctttca aatgcgaagt taaaaaaaaa 1440 cattttgtca cttttatgag tgcattaata cacatctggg gaaaatatga ttatgaaata 1500 attttttgta gctcaaggta tatatttact tatcataata aagctaaatt atttttttac 1560 ataaaaaaaa cttaaaatat attattaaat ctacagacat aaagcatgat aaaccacttt 1620 ttgtgaagtg attgtacatt atgactgtaa atagccaaga accttaagta ttcagttgat 1680 ataaatataa gcttatagag tctttataac taatattttt gttgcaatgc ctcatattat 1740 tcagatatgt attattgcat gataaaagtt acagtgttca aaaccaaacc gcgccgcaag 1800 aaaaaaaaac tcgacaaaag tgatatagag aaagagagac acgcatgaca gaatatctaa 1860 gctaggcatt caaaatttcg aaagtttaat ttccaaatga ttaaacggta taattcttaa 1920 acggttctct tctcttagaa ataaaaatga aatagaacag aacccacaat ccacgacgca 1980 cttctttttt taatattacc gatgtctttt ttcagtctac tgatcttttc atcgggtgag 2040 aggatcggcg ctgccgagaa gatctgcatg tacgactgta gaaatccata aaaatattct 2100 ggaaacactt tgcctcgtgc ctgtgacaaa tacaattcag cgctcttgcg attcgacgga 2160 tttttctcga tcatcgaggc caagatttct cgaatgcctt cgtcttctat attatcaagg 2220 tgcttgcttg cagagtagtc gccgtttcta taagctgtaa atgaaaattc ctaaaatcga 2280 atcaaaattt taaatggaaa aaaaaattaa aaatctattt acctaaaagt tgagagaaat 2340 cgaaaggcgg gtggccttca ttgtacaatt cagtcaaagc acaaccggcc gaaaaaatat 2400 ccatcatggg atgcaaatct cctgttttaa gttcttgaaa gagcagggta ttgttgggtt 2460 ctgaagtcaa cgttttaaca aatctctctg gtgcaatgta gcatgttcta aaaagaaaat 2520 cgaataagta aacattctct tctcttagaa ataaaaatga aatagaacag aacccacaat 2580 ccacgacgca cttctttttt tacttttatt aaaattgagt gtcccctcct ctggagagaa 2640 tcttcacaac acgcacacgc aggccgagca tctcatcact tcttggcgag ctcgttgctg 2700 agccagtcga aaccttcgta caatccgtga ccctgagtgg cgcaggcact ctttatgtac 2760 caggagtcag cataccactg attgtaagac ccatcttgta atgtcgtatc tatcgctgaa 2820 attaaaattt gatattacta tttaatacta gtgttaatac acatttgttg gaattgtgta 2880 tcttaaatat agtttttatt tttagttttt gtttaattat aattcttgtt caactttgaa 2940 atatactttt acaaaaatgt atacagcaac taccctgcca taatagcatt agatcattaa 3000 agtttaaaga agtgatttga gcaaatgaat gtattcctac tagatcaaat gatttaatgt 3060 tcaagttttt acagtatatt agttgtatac atttttaaat agataaccca tgcatggaaa 3120 aattattatg aatgatgcaa ctgcttatat ttaatccagt actttaatac aagtacataa 3180 agcatgaata cattgataaa aattattaat acagcacgga ggaaggtgaa attttggcca 3240 aagacttcga atgctggttc agcaaagtta tggctgccga acaggtacgt tttcctttaa 3300 tgtgaattag caatgagcct tatttagttt ttacaaaggt tgaaaacaaa tttcatttaa 3360 acgtactttt aggtggctca agtagcagaa tcctttcatg agctatgtcg agaagtgtgg 3420 caagacaaag gaataagcaa tctctgctcg acagaatgct cggtagtaaa tcaacgaaag 3480 tttattccag aggaaaaagc gactcagcgt tgccaaaaga ttgataagat gatattttta 3540 caatgaacga aactcaagct ttttaaatac ctgtggaact agatgaatgt tgagccttgt 3600 catatacaat ctctattttt tgtatttaca acaaaggatt gagctaatga aagccttttt 3660 ccgtttatta aagtttctta aaaataaatg ttcatgattt ttagtattac cttaatctta 3720 gaaaactgtg agccttatca ttatgatcgc agactacttg ttatcaatat gaaatactgt 3780 tatatgtatt ctcatcaact aataaattat tttcacacac acaacaaagt aattccaaat 3840 aatgattgtt ttcccagtct attccttgtc ttcgcttata aatattatcc atctctagtc 3900 ttgtagttca ggcaaatctt gtagttctgc aaaaaaaaat aataataaca ttcaattaat 3960 tgtaatcttt aagtaactaa aacctaaaat tttcaaaatt tagtgatttt atgtttacaa 4020 ttttacttgt tacaaaaagt tcaaagtaaa acgcttaagc cttcatatat aatatcttta 4080 atatttattt aattataaac aaaacatcat tgataaccaa tcgcgccaga acatccatcc 4140 aagttttgaa aacataataa cagctgatga gctttttcac gattatacgt caatgtaatt 4200 ataaccgtaa tatgaccgta actgacactt atgaaaatta attaaatcat gaataaatga 4260 ttgtagttct taaagtaatg ttgaaaataa tatatattta taatttataa ttaagaaata 4320 attatatgaa tcttatatta tgaatgatat taatatccct aatgaacaaa atttgtacag 4380 cccgcgataa tttatggcgg caagtaaatt ttcgaacaaa ggactgtcgt catttgttta 4440 ctatttatat tctaaatatt ctcttttcag tgatcaaaaa agtataggaa cagaaaacag 4500 ttcatatact gttctaagta aatttagtta cgtgtggata tttggattat ttgactgtta 4560 attaatcact ttgtcaatgg aataaaaaac aaaaagtata ataataaata tctgctaact 4620 atattatctt ggaaaaggtg aaattgataa gttaaaacaa tgaaaaaaga ttattttatt 4680 ccttaattaa aaattcttaa atcaaaataa ctattttttg ataaaaaaac atttcaaata 4740 ttttagattt ctataaaata atggttggac aaaaaccagc agttccagct aaagaaataa 4800 ttgaagcgct tatcaacttt gatatatatc aatcaaaaaa gaagcttaaa agaaagaaag 4860 acccaatttg gcaagagata tgcgatatct tgaacaataa cagagaaaga aaaataaaac 4920 ctcttaattt gtacatgcat gtagatcaag atagacataa cattcagtct tcctatcgaa 4980 agataaaaga tttagatata tatgaatctg atgactgtga aacaattata gaagaaacta 5040 atgaagatga aatttcagaa agtgagaaca cgaaggaaga cacagtctgc tatagaccaa 5100 ccaaaagaaa gaataaaaat ataatattta attttacaat aactccagag caatggtctc 5160 aaataaagcc tattacaata acttataaag atggaagaga aggtatgtcc ttgcaagagc 5220 cttacaggga gatgtttggg aaagaaatgt attataagga aaaattaccc tgtgcttaca 5280 attataaacg gaatgttata gatgacatag aatcttatgt caaagtcact ggttattgca 5340 atacttgcaa cgcacagtta atgattaact gttttgataa accgattgat ggtgaaccag 5400 tcacattcac tgtagaaact ttagattctc gtggaatacc tcatacagca aaaagacgac 5460 tagctggatc acagagaaaa ttaattaaga aagaattaga gcataaaaaa ccaaaaaaat 5520 ggagaagaga agaagcaagt aagcatatga agtatgggga tcctgaacca ccgtttcttt 5580 acagtcaaga agtaatacaa aaagctagtc atgaggtaaa atatgaaaag ttaggattga 5640 aaccaggaga aaagttgttt gactctttga caaatttaaa aaaagatgtt gaatttaata 5700 gatattttag agggattggt aatgataaat tttacttgat gtattggtca cccgaacaag 5760 taagtatcca taatgatgtc caagccaaac taagaaatcc tctatcgcta gatgctacag 5820 gttctgttgc cttgaaaata aaaagaccag atggagagag tgctgacatt tttcttactg 5880 ttctgtcaac atatataaaa agtatgattg tgcccgtagc acaagtattg tcagagaaaa 5940 atgacacaaa cttcttgtct tattggcttt tggagtggag aaaatctggt gccaagattc 6000 cagaatttat tgtgacagac atgggaaaag gcattcaaaa ctctgtttgc ttgtctttta 6060 atacaatgaa tttttctaaa tataatgatg aatgccttaa gattttatta aatgcacaat 6120 ataaaattga gttgaatact caattgagaa cagatgtggc acacttggtg cacgctgtaa 6180 cgaaatggcc ttgttttagc agcgataaac caaaagttaa ggaattgttc aagcgatgtg 6240 tgggctttat gacaggcata gataatctga agaacttttc agagttttta gttgctgtat 6300 ttatagtgtc taacagcaaa aataatgatg acaattgcca aaaagctttg aaatatctga 6360 ttgaacgaat tcaaacatat aaatttaatg cgaatgttac agaaaaatta gaacttttgg 6420 atcaaaaaga aactttagat atttttgcgg atattgaaga attcaataca aaaagtaatc 6480 aaaggcttac acgtaaatat gtagacgaat taaaagtaaa gcctataaaa aacaaatctg 6540 acaacaaatc tgataattct tctgatttca atattcaaaa taatgactat tacttaccta 6600 actttgctga acgattatcg attttatgtg cagaatttcc atgctggact aaagttatga 6660 atgtctattt tgataactca gaagacgtgg ctacttcagc aagagcagag tcatatttta 6720 gcgactacaa atcttcgaat gatgcttctc aaagaggaga tgtagttttt gtgaatcatt 6780 gtcgtcaaat tgatagtgac atgctcttgg cacgtgcttc attgaacaat ttgcagccag 6840 atgaaattgt ttcaaaaaaa gtaattgtta aggacgacaa tgaatttctt tttgccacag 6900 agagatggaa aagaaagtac acaactgctg agtgccaaga attttctgac tttctttctg 6960 aaactgttga gttggctgaa aatagtattt cagaaaccat taatggaaag aatgatttaa 7020 ttttagataa aaataagctt gaacaagaac aaaaatctca cattaaaaac tcaatgaaag 7080 tgaatgatga aagccaagtg acctttgacg atgatttgga agggttttct gattacataa 7140 agttgacctc ctcaatagtt ggcgatgaaa aattaatcag tagcatccat tcacctgaaa 7200 aaataaattc aatattacaa ccaattaaca gtaataaaac tatgcatctt aattttaata 7260 ccgatcataa ttatagctta gaaccagaaa aagaatcacc taagcaaatt tttcaaaata 7320 ctgatgaagt tttattaata gatcatcaac aatctgaatc tgaaagagta ttattagatt 7380 ataaaaaaac cgaagaagta gtattagatt atgaacaatc tgacgaagaa gtagatttga 7440 aacattccaa agaaacattg ttgaataaag aaaagtattc cagttacgaa gtaaaagatg 7500 tagatgtaaa aagctctgtg aatacagttc agaactcagt acaattagta gtaaaaaatg 7560 aacaagatca aaatagtctt cctaaaaata aaaagaaccg tggcaaatac gtatctgctg 7620 agcctggatt aaatctagtg ctacaaaaag aaaaaaagaa aaataaaggc agaaaagtta 7680 ttcaaaatgg caacaaactc caggaaaagt ctgtgcgtgg aaaaaagttt agatttaata 7740 atacttgttc gtttgatagc atcactgaaa tcatgactca ttcttgctat gaagtacaag 7800 cttttgaaca atttatcaaa aataatatta acagtatact cgaaatgcaa aatttatgtt 7860 atgcaagtac tattataaac tatgctacat catcattaac caatacatta tatacaaatc 7920 gagcatactt acttcattca ttgatgccaa atttgaataa ttatgatgta aattgtcaag 7980 gtaatataag caatttattc tgcagtttga tgaaagattt aacttccatt cgagaaactt 8040 cactatgcca aacctgtgga tataaaacag aatgcaataa atgtcaaata ccaatcccca 8100 atactgatat aatagtaaag cataattata cgactatgca aaaggaatta gatcaatatt 8160 tcacagacaa aaatgtattt tgttcgaatt gtaagaccag taatgcacat ctttcacgta 8220 cagtaggttc ttacttatgc atagatacag aagattctta tcagcgatct ctaaatccaa 8280 agatgactga tttcaatcag ttagatatta catctacatt agatcaagtt cctgttacaa 8340 taaatgtaaa aaaagatacc tttaatctgt gtggcgtagc taggtatatt ccttcagttg 8400 ttgagaaagg gatgggccat tatgttgctt attgccggaa gataaatggt acctgggatg 8460 aaagaaatga tctattaaca tataaaactg ctattacaag atctcaaacc aaacgtttgg 8520 aaatgaaaat aagcatctta ttttatgtaa aatttaataa tgattaataa attatataaa 8580 cataaattag aataaaaatc gtataatact tacataatca aattttgtta cgaaattacc 8640 tatgaactac gagcttacgg ccctactata cgactccgag cattagactt ttctataggt 8700 tatgtctcac acctagtcca gccactgcag aacagctgct cgctttgcct gccacacgct 8760 gaccgctgac gcactcggag tggtttgcgc atgcgcaata cgttggtgct gtatgttaaa 8820 tttctcaaaa gtgatttttt agattttttt ataaataaat gctttttttg caatatttag 8880 cagtgtctca attacaaata gctagggttc aattttttaa cgttgataga atatcggtaa 8940 acacaatatt tcttgagaaa aacgtatttt tccaaggtac atatagcaca tttctttttt 9000 tatttttaat taaaaaacta cggaggttca attacaataa atagattatt tgtaatgtgc 9060 aattaaaaat acatcgacta gtattttttt tattatatat attcaatatt tgcgggctgg 9120 ggtaaggggg gt 9132 // ID Copia-31_AA-LTR repbase; DNA; INV; 144 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-31_AA_; KW Copia-31_AA-I; Copia-31_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-144 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 957-957 (2011). XX DR [2] (Consensus) XX SQ Sequence 144 BP; 47 A; 22 C; 22 G; 53 T; 0 other; tgagtgtaac aatcttttac tcaaaatcca attgattcac tttgacagtg taaagaagag 60 aaacgtcaat aaaattcatt atgttgttgt tcttatcgaa tccgcacgtg ttttgttttc 120 tctgctaatt gaaagtaaaa ttca 144 // ID CVA repbase; DNA; INV; 465 BP. XX AC . XX DT 04-OCT-2002 (Rel. 7.09, Created) DT 04-OCT-2002 (Rel. 7.09, Last updated, Version 1) XX DE CvA: putative non-autonomous DNA transposon element from oysters DE (Crassostrea virginica) - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; CvA; KW nonautonomous DNA transposon. XX OS Crassostrea virginica OC Eukaryota; Metazoa; Mollusca; Bivalvia; Pteriomorphia; Ostreoida; OC Ostreoidea; Ostreidae; Crassostrea. XX RN [1] RP 1-465 RA Gaffney M.P., Pierce C.J., Mackinlay G.A., Titchen A.D. RA and Glenn K.W.; RT "Pearl, a novel family of putative transposable elements in RT bivalve mollusks."; RL J. Mol. Evol 56(3), 308-316 (2003)In press. XX DR [1] (Consensus) XX CC Most common element observed in genomic sequence survey, CC estimated CC copy number 70,000. Modular organization includes subterminal CC inverted CC repeats (nt 25-35, 452-462), imperfect inverted repeats (nt CC 42-55/129- CC 142), self-complementary regions (nt 8-19, 275-284, 431-448, CC 450-463), CC and an (ACRG)n microsatellite region (nt 395-419). Putative CC target site CC duplication AA. Individual CvA elements contain 2-6 copies of a CC 156 nt CC core repeat unit, the first copy being truncated at the 5m end CC (nt 131-221 CC and 222-377). Portions of CvA show sequence similarity to CC repetitive DNA CC in other bivalve species. XX SQ Sequence 465 BP; 153 A; 87 C; 96 G; 129 T; 0 other; aacaagaggc ccatgggcca catcgctcac ctgagaaaac agttcaaatc aataaacaca 60 tatgattata tcataacgtt gattaagaga agaaaaaaaa cattattaac tttatgagaa 120 tttattggtt cttcatttga acaaacttga atccccttca cccaaggatg ctttgtgcca 180 agtttggttg aaattggccc agtggttctg gagaagaaga tttttaaatt tcgtcaatgt 240 attttcacta tttcgtaatt atctcccctt ggaaaagggc ggggcccttc atttgaacaa 300 acttgaatcc ccttcaccca aggatgcttt gtgccaagtt tggttgaaat tggcccagtg 360 gttctggaga agaagatgaa aatgtgaaaa gtttacagac agacagacag acagacagaa 420 ggtgatcaga aaagctcact tgagctttca gctcaggtga gctaa 465 // ID TATE repbase; DNA; INV; 6986 BP. XX AC . XX DT 12-JUL-2007 (Rel. 12.06, Created) DT 07-DEC-2010 (Rel. 16.01, Last updated, Version 3) XX DE Putative retroelement, possibly a new class. XX KW Transposable Element; TATE. XX NM TATE. XX OS Leishmania braziliensis OC Eukaryota; Euglenozoa; Kinetoplastida; Trypanosomatidae; OC Leishmania; Leishmania braziliensis species complex. XX RN [1] RP 1-6986 RA Peacock C.S., Seeger K., Harris D., Murphy L., Ruiz J.C., RA Quail M.A., Peters N., Adlem E., Tivey A., Aslett M. et al.; RT "Comparative genomic analysis of three Leishmania species that RT cause diverse human disease."; RL Nat Genet 39(7), 839-847 (2007). XX RN [2] RP 1-6986 RA Kojima K.K. and Jurka J.; RT "Consensus update."; RL Direct Submission to Repbase Update (07-DEC-2010). XX DR [1] (Consensus) XX CC The presence of tyrosine recombinase domain and a domain with a CC slight similarity to reverse transcriptase indicates it is a CC retrotransposon related to DIRS, but unclear. XX FH Key Location/Qualifiers FT CDS 357..1571 FT /product="TATE_1p" FT /translation="MRPATEKRLHIQKRFSRHILESLRQKRTXSLSSDSPT FT SEMSDKRLTEKELDEYLELFVAEEATKMTAELTRKLYREGGMTDADFHFAC FT KGMTKSLLRSVGAADPTTKNIKKEVMDAQKLFANKEIQQVLDVEERWEEIW FT DNIEASKNPLEGYKVKMEAAIVAKQMTPVIAITTATLLGLTGPRAQKATAD FT QRIAAWLAMFVDEPMTSRLLAYNTQIAQTSAGGMRQWRKVLTRVEVPLVPE FT TDRLEHLNRITFEKAHSESQAGPRTVEGSGVHRKHPCDTYRAPKRDGSDIL FT EGGAPYAPILGADGTQIGALDMAGFEALSGLTEENLLKMKDLLSQCLKEQK FT FGQRIPRNSQSQHDYQNSNDYTYGPAKHNGRGTGQRNGYRGNAGDNRRKFF FT RGGEAQDGSKN" FT CDS 1667..4612 FT /product="TATE_2p" FT /note="tyrosine recombinase." FT /translation="MWGGDMREGKTSNIELDVIYKAYLPKKRIVPSDTMVH FT LDWKRAQQLKAKVHRHGVVYFPIFIMKHWIAGLLEKGTRDSAEIQLSILDS FT APSPIVEEKLRKHFKMVWPALRLVNEFSPRQERYSDDCGLYMSAVFFGAHL FT DIQIDHSHDMAKCMRRLLYAASKHHPPREYFLEKMRKILTNHPVSRKDFFY FT NEIEHKPWLKKTTVNREDTFHLGGGERRATKSTNPKRDSRRANQPKARAPX FT PPPRQKKKEETVRTKMDRAERTKRTPQTPVPASKAPSRKRPERSAMDAPLP FT RRKAARKSDRRTPASSSGRSATGRNGNKTRSLNSSEFPSDDEDLPVIDLPW FT RKKNSVTRTPEHAPNDATVSSEGIIPTNVSPHVQHGKCISEWTEITLAEAN FT KHARKVYEAVLDHLWAATALARVGDGVAHGLTDTAVGQQRVRHKLTPLQPY FT SVQEMLKLLRKKIHMVDASPTDPNGVILREEYRSEEVTLDSSIDELYLVRG FT PSLPTLYDVIKGYRFLLGACLREAPADVNGVRYPSHYVLTKDASEATVGVY FT VPATWTHYQSSKRQRAPRHASRYIPLPRSAASEAHQDPSDGQRPRLVKRKR FT WPGDGAGRDPLQEDEEDGATYRTPGNQKGDGYADVDANISAEAMRALESLR FT SNPLNMQGRPLSKNEEAEVCPRNWFLFAAKPPHISQLAWNLVRPDTRAHHL FT RWLTRIKSMNSEQMLMRFPAACIDLILSTARARKWKWATTAKAFAAVAGAL FT RDLPLYSTQTRGIRLQDDPEWRSAFGTVQRYMKESVPDAPPFVSRQQVERI FT SKRLRLGHPRAALFLAMMWGFAARACDISTLRAKDVTLFPGTSTDTYVKVT FT LTIRKGKGAKTRGPYPIPSMLTRDLAATLQEMLVEKRPSEELFSPHVEELR FT ALIAQEVRTEMRGAQLPSIRKGALRCMAEAGVPLKDLMMISGHAKQATLLR FT YLGYGQQPTVEAETARDNAGRALFQTL" FT CDS 3727..6585 FT /product="TATE_3p" FT /note="weak similarity to reverse transcriptase." FT /translation="MELGEARHSRTPLAMVDANQVDELGTDAHALSGGMHR FT LDSVDRKGPQMEVGNHREGLCRGGGCAARPAALLDADAGYSPSGRSRVAKR FT FWHGAALHEGVGAGCASLRFASTGREDLQTAPVRPSTRRAVPRHDVGIRST FT SVRHFHAPSEGRDAVPGXIDGYIREGHADDPQGKGCQNTWPIPDPIDANEG FT PRSDAAGDAGREKTIRGAFLTTRGGAAGADRPGGANRDARCTAALDPERRA FT PLYGGSGCPVEGPDDDFRAREAGHAAALSWVWPAAYGGGRDRKGQRRKSAI FT PDPLEAAASVFRFRSQESSCANLGIAPQEVSAMVDQMSGFIQVLTERPALV FT KQWPLHLKRNTPLDMDTVLAMPTKRASTKRFLQRIQCFLDPSFYDGLRTSR FT TIKKCVLTTAEIQQAVEMGKFEPCPISDIGAQVQLPEGMHGVNVFTVPELK FT GRRRLITEPLLNRVIPKHHVPRVHYDTRLGRRQRLRYARYMLQIDFEAYYD FT AIPIAATLRNKFVFRARHDGRYYRLRTLPTGARWSVAVGQAVTWTIVDIDT FT PVTITTLIDNILVAAREGQEREFVLAVRTVVARIKAANLMTSPNRDELEAM FT SDEEILQLASANTVFLGEEYTWNGRERLIRNSVKTVAKLKLALQKTSHTIR FT SLASLISLIFFALHTTQMNPARAFKLLRAYRGIYRLTFRGYDWDDAVPYID FT SSVARSLQEIGGALVQNPWWKISDERHPTTDEATYDAVAFTDASLEGWGAV FT LHLRDAGATEMWTYRQRWTEDLERQLGGDDGEAERVLEKLRQYQLRRRVRS FT GGRFEDPDLQADRFQARYSAHAEPRAAQLMLRHLVEHHRVPNGARIALATD FT HRAIVIAQKHLNGFGGIGRGYALNKLFEYTYDLWYNRGIDVVFFYVEGARN FT PADAYSRHFGVDATGSLEVHRVEPFGVPFLRHTWCPLCEERRREEGGEI" XX SQ Sequence 6986 BP; 1755 A; 1973 C; 2018 G; 1230 T; 10 other; ccattggcgg cgtgtaaaac cacaaaggca atcgaatggc acttgacctt ataggggggg 60 gggggtggtg gttggtggga tgtggtgttg gatctgggaa atttgaccga tagctgaatc 120 ggtttcttgt atccgtggaa aaggaatcct gagtgagtgt gtgtgtctcg gaaaccgact 180 cctcacttcc acttgtgatg tgtgaccgtc tgtgtaccat aatgtttccc gattggaaat 240 tatggccaca ggtcggcaaa tatgtactct tgcatctgtt tctatggatt cctactttac 300 ctttgtccct ttatctcttc tccaaatgta tatagtaatc acactaagtg gagttcatgc 360 ggccagcgac ggaaaaacga ctacatattc agaaacgttt ctccagacac atactcgaat 420 ctctgcgaca aaagcgcacg ktttcccttt cctccgactc acctacatcg gaaatgtccg 480 acaaacgact aaccgagaaa gagctggatg agtacctcga attattcgtc gcagaggagg 540 ctacaaagat gacggcagaa ctcacaagaa aattataccg cgaaggcggt atgacagatg 600 ccgattttca tttcgcctgt aagggtatga caaaatcact actgaggagt gttggcgcag 660 cagatcccac aacaaaaaat atcaagaagg aagtgatgga cgcccagaag ttgttcgcaa 720 acaaagaaat ccaacaagtg ctggacgtgg aagaaaggtg ggaagaaata tgggataaca 780 tagaagcctc aaaaaaccca ctggagggat acaaagtcaa aatggaggca gcaatcgtag 840 cgaaacaaat gacacccgtt attgccatca ctaccgcgac tctgctgggc ttgacgggac 900 cacgggcaca aaaagccacc gccgaccaac gtattgcagc atggttggcc atgtttgtgg 960 acgagcccat gacgtcacgt ttactcgcat acaacacgca aatcgcacaa accagtgcag 1020 gaggaatgcg gcaatggaga aaagttttga cacgggtcga agtgcctttg gttcccgaaa 1080 cggacaggtt agagcacctc aaccgcatca ctttcgaaaa ggcacacagc gaaagtcaag 1140 caggaccaag aacagttgag ggcagcggag tgcaccgaaa acatccgtgt gatacctacc 1200 gggctccgaa aagagacggg tcagatatcc tagaaggagg cgcgccatac gcgccaattc 1260 tcggagccga cgggacacag atcggagcac tggacatggc aggattcgaa gcactctcgg 1320 gactcactga agaaaaccta ctaaaaatga aagacttgct ttcacaatgc ctgaaagaac 1380 aaaaattcgg acagcggatt ccgaggaata gccagagcca gcacgattac caaaattcaa 1440 acgactatac ctatggtccg gcaaaacaca acggaagagg cacaggccaa cgaaacggtt 1500 accgaggaaa cgccggagac aaccgacgta agtttttccg cggaggcgag gcacaggacg 1560 ggtcaaaaaa ctaacgcccg acggagccac agccgacgtt tcgcaggagg ccaccgcaca 1620 caagaggcaa accacattcg gtatcgcaag gtcatttgca aaagaaatgt ggggcggtga 1680 catgcgcgag gggaaaacat cgaacatcga actcgatgtg atctacaaag cgtacctccc 1740 taagaaacgg atcgttccgt cggacacgat ggtgcacctt gattggaaac gcgcgcagca 1800 actcaaggca aaggtgcacc gtcacggcgt ggtttatttc ccaatcttca ttatgaagca 1860 ctggatcgca gggcttttgg agaaagggac gcgggactca gcagaaatac agctgagcat 1920 cctggactcc gcaccttctc ccatagtgga agaaaaactg cgtaagcact tcaaaatggt 1980 ctggccagct ttgcgtttgg tgaatgaatt ttcaccacgt caggaacgct atagcgacga 2040 ctgcggtttg tacatgtctg ccgtattttt cggcgctcat ctggacatac aaatcgacca 2100 tagccacgac atggcaaagt gcatgcggcg cctgctgtac gcggcgtcga aacaccaccc 2160 gccacgcgaa tatttcctcg agaaaatgag gaaaattctg acgaaccacc cggtgtcacg 2220 gaaggatttc ttctacaacg aaatcgagca caagccgtgg ttgaagaaaa ccacggtgaa 2280 ccgtgaagac actttccact tgggtggtgg cgaaaggcgc gcaacgaaat ccacgaaccc 2340 gaaacgggat tccaggcgcg caaaccagcc gaaagcacgt gctccamagc cacctcctcg 2400 gcaaaagaaa aaggaggaaa cggtgcgcac aaagatggac cgggctgagc gcacaaagcg 2460 cacgccacag acgcctgtac ccgcgtcaaa agcaccttcc cgtaaacgtc cggagagatc 2520 cgcaatggat gccccactcc cacggcgaaa ggccgcgagg aaaagtgaca ggagaacacc 2580 agcgagttcc tcaggtagat ccgcaacagg taggaatgga aacaaaacac gctctctaaa 2640 ctcgagcgaa tttccatccg atgatgagga tctgccggtg atcgacctcc cgtggaggaa 2700 gaaaaactcc gtcacacgta cgcctgagca tgcgccaaac gacgccacgg tgagctcaga 2760 aggcattatc ccgacgaacg tgtcaccaca cgttcagcat gggaaatgca tttctgagtg 2820 gacagaaatc accctcgcgg aggcgaacaa acatgccaga aaggtgtacg aagcagtcct 2880 ggatcacctg tgggctgcga cggctctggc gcgagtcggc gatggtgtgg cacacggtct 2940 gaccgacaca gcggtgggac agcaacgcgt gaggcacaag ctgacaccac tgcagcctta 3000 cagcgtccaa gagatgctga agcttctccg aaagaagatc cacatggtcg atgcctcacc 3060 gaccgacccg aatggggtga ttcttcgaga agaataccga agcgaggagg tgaccctcga 3120 ctcctcaatc gacgaactgt accttgtccg tggaccgtcg ttaccgacat tatacgacgt 3180 gattaaagga tacaggtttc tgcttggcgc atgtctccgk gaggcgccgg cagatgtgaa 3240 tggcgtgcgc tatccgagcc attacgttct cacaaaggac gcctcggagg caacggtggg 3300 agtctacgtt ccagcgacgt ggacgcacta ccaatcgtcg aagagacagc gtgccccacg 3360 tcacgcgtca cgatacattc ctcttccgcg aagtgcggca tcagaggcac accaggatcc 3420 gtccgatggc caacggccgc gcctggtgaa gagaaaaaga tggccaggcg acggcgccgg 3480 aagggaccca ctccaagagg acgaggagga cggcgcgacg tacaggactc caggaaacca 3540 gaagggtgat ggatatgcgg acgtcgacgc gaacatctcg gcagaggcaa tgcgcgctct 3600 cgaaagcctc cgctccaatc ctttaaacat gcagggcagg cctctttcaa aaaacgaaga 3660 ggcagaggtc tgcccgagaa actggttcct cttcgccgcg aaaccgccac acatctcaca 3720 gctcgcatgg aacttggtga ggcccgacac tcgcgcacac cacttgcgat ggttgacgcg 3780 aatcaagtcg atgaactcgg aacagatgct catgcgcttt ccggcggcat gcatcgactt 3840 gattctgtcg accgcaaggg cccgcaaatg gaagtgggca accaccgcga aggcctttgc 3900 cgcggtggcg ggtgcgctgc gcgacctgcc gctctactcg acgcagacgc ggggtattcg 3960 ccttcaggac gatcccgagt ggcgaagcgc ttttggcacg gtgcagcgct acatgaagga 4020 gtcggtgccg gatgcgcctc ccttcgtttc gcgtcaacag gtcgagagga tctccaaacg 4080 gctccggtta ggccatccac gcgccgcgct gttcctcgcc atgatgtggg gattcgcagc 4140 acgagcgtgc gacatttcca cgctccgagc gaaggacgtg acgctgttcc cgggwacatc 4200 gacggataca tacgtgaagg tcacgctgac gatccgcaag ggaaagggtg ccaaaacacg 4260 tggcccatac ccgatcccat cgatgctaac gagggacctc gcagcgacgc tgcaggagat 4320 gctggtcgag aaaagaccat ccgaggagct tttctcacca cacgtggagg agctgcgggc 4380 gctgatcgcc caggaggtgc gaacagagat gcgaggtgca cagctgccct cgatccggaa 4440 aggcgcgctc cgttgtatgg cggaagcggg tgtcccgttg aaggacctga tgatgatttc 4500 cgggcacgcg aagcaggcca cgctgctgcg ctatcttggg tatggccagc agcctacggt 4560 ggaggccgag accgcaaggg acaacgccgg aagagcgcta ttccagaccc tctagaggct 4620 gcggcwtccg tgttccgttt ccgatcgcaa gagtcatcgt gcgcgaatct cggaatcgca 4680 ccacaggagg tgtcggccat ggtggaccag atgtccggtt tcatccaagt gctgacggag 4740 cgaccggcac tcgtgaagca gtggccgctg cacctgaaac ggaacacacc actggacatg 4800 gataccgtgc tcgcgatgcc gacaaaacgc gcctcaacga agcggtttct ccagcgaatc 4860 cagtgctttc tggatccctc cttctacgat gggttgcgga cgtcgaggac catcaaaaag 4920 tgcgtgctca caacggcgga aatccaacag gcggtcgaga tgggcaagtt cgaaccgtgc 4980 ccgatcagcg acatcggcgc ccaggtgcaa ttgccagagg gcatgcacgg cgtgaacgtc 5040 ttcacggtgc cggagctgaa aggacgacga cgcctcatca cggagcccct gctgaaccgc 5100 gtgatcccca aacatcacgt cccgcgcgtc cactacgaca cgcgcctcgg aagacgacag 5160 cggctgcgat acgcccgtta catgctacag atcgacttcg aagcttatta cgacgctatc 5220 ccgatcgcgg cgacactccg taacaagttc gtttttcgag ccaggcatga cgggcgatac 5280 taccgccttc gtactctccc gaccggcgcg cggtggagcg ttgccgtcgg ccaggcggtg 5340 acgtggacga ttgtsgacat cgacacgccc gtcaccatca ccacgctcat cgacaacatt 5400 ctcgtggccg cacgcgaagg ccaggagcgt gagtttgtgc tcgcggtgcg cacggtcgtc 5460 gcacgcatca aggcggcgaa cctgatgacg tcacccaacc gggacgagct ggaggcgatg 5520 tcggacgagg aaatcctgca gctggcgagt gccaacaccg tttttctcgg cgaggaatac 5580 acatggaatg gccgtgagcg kctgatccgc aactcggtga agacggtggc gaagctgaag 5640 cttgcgctcc aaaagaccag ccacaccata cgcagtctgg cctcgctcat ctcgctgatc 5700 ttcttcgcgc tccacaccac gcaaatgaac cccgcacggg cattcaagct gctgagagcc 5760 taccgaggca tataccggct gacgttccgc gggtacgact gggacgacgc ggtkccktac 5820 atcgactcct ccgtggcgcg gtcgctgcag gagatcggcg gcgcactggt gcagaatccg 5880 tggtggaaaa tctcggacga gagacaccca acgacggacg aggcgaccta tgacgcggtg 5940 gccttcaccg acgcgtcgct ggagggctgg ggtgctgtgc ttcacctccg cgacgcgggc 6000 gccacagaaa tgtggaccta tcggcagcgc tggaccgagg acctggaacg gcaactcggc 6060 ggcgacgacg gcgaggcgga acgcgtcctc gaaaaactgc gccagtacca gctgcgtcgc 6120 cgcgtccggt cgggwggccg gttcgaggac ccagacctgc aggcggaccg cttccaggcg 6180 cggtactcgg cacacgcgga accacgcgcg gctcaactaa tgctgcgaca cctggtggag 6240 caccacaggg tgcccaacgg agcgcgaatc gcgcttgcca cggaccaccg tgcgattgtc 6300 attgcgcaga aacacctgaa cggtttcggc ggcattggca gaggctacgc cttgaacaaa 6360 cttttcgagt acacctacga cctctggtac aacagaggga tcgacgtagt ctttttctac 6420 gtcgagggtg cgcggaatcc ggcggacgcc tactcgcgac acttcggggt ggacgcgaca 6480 ggatcgctgg aggttcaccg ggttgaaccg tttggcgtgc cgttcctccg gcacacgtgg 6540 tgtccgctat gcgaagagcg gcgccgcgag gagggcggcg agatatgatc gtcaccagaa 6600 acatccgagg ggagccggaa cctgacggcc gactctccgg aggacggtgc gacgacatcg 6660 ctcacaagta tggatggcta accgcgtgct atggggggga ggcttccccg gtaccggtgc 6720 caagccataa gaaaaagcac acgaaagaag tccaagacga cttgatgtgc tcctcacgcg 6780 accgagcgcg gtagaggagg ccggcggcat aatcatgacg cggcgccgcg gtcgtggttg 6840 cgtgcctgcc ttgccgcttc ccaatggggt cgcgaagact agctccatac ccatctgcct 6900 tccacaaatc tttctccctt ctcgccctct gcttgtcttc aagaaatcat atcttcgcga 6960 gcaccctatc acacaggacg gtggca 6986 // ID L2-7_NVi repbase; DNA; INV; 4786 BP. XX AC . XX DT 20-APR-2009 (Rel. 14.04, Created) DT 20-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE CR1-like retrotransposon: consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; L2-7_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-4786 RA Bao W. and Jurka J.; RT "CR1-like elements from the parasitic wasp Nasonia vitripennis."; RL Repbase Reports 9(4), 757-757 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS join(457..774,695..1495) FT /product="L2-7_NVi_1p" FT /translation="MVKCRKCSRNVATVSKSCTTCGSTFHPRCHVTFLIGK FT PSDTCCKALVTLAADTQVEPTLTVQNHPSSIAALTSDSPSPLSFAATSEAG FT KKLRRKALATIAIRAARVVLRRPPRQAKSFAEKLSPPSPSELLASKQTRRE FT FSSSSESSNDSHNMPDSAADEAPAWFKAYLTEYRNDKAEVNQRLNNLETGM FT RKTVSSVVRNIALVDSCEITISGLPPDTSIQPRELARKVFAAIELEWLLNF FT VIRFRDWPQKAQANEASTSSRTEPGSRTIVVKLMSAPLRDSALENASLLDK FT TSAQALFGSGGARKIYINPLYPKPVYLLRKKAMRVAKQLNYARPVVQNLVV FT CMRQTKESPLIPIVSEEELAALEPYRPQQ*" FT CDS join(1496..3406,3325..3642,3566..4396) FT /product="L2-7_NVi_2p" FT /translation="RPASGPNSSGPLDSINIGHINVNRLASHFALFEEHCT FT TSHFDVIGVTETFLSVADPDDQLTLSGFNFFRVDREGQAGGGVAFYARDFF FT KVKILAHSGGNIPEFVIAELIYNNFKLLFAVVYRRPNSAHPSHFFQCLAPL FT LPHYLNAVVTGDFNADMCTLNANSNNLRSSLESLSLYLVPSEPTHHVFRYD FT RPPSHTWLDLFIVNCRDSVVALRKSDSPFIAGHDFIELTIRAAKPPACEKV FT IMCRNLKKVDPAVISAALCDILAPLANPFQHGQNRFLTASPTLPVTLGPCP FT ADTDVFQRHITSALISTYDAVAPLRRITLSSRRKPWVSPAIKALMKRRDAA FT HGLARSSGIHSDFERFRALRSEVSNLLDTAKNEYLAGRLVSAPDSNSKWRE FT LRSMYITSPRLPSPLLYFDADVLNRHYAAIVNRHPPLSEEVFQVLFDSPLA FT EPAADRVFDLRPVTLAEVRDSVLNASSKASGVDGISVPMIKAALPGVLDHL FT TDLVNACILNGTFPTDWKKALIVPLAKSKTLTSPSDTRPIAQLPELSKVLE FT RLVHSQLTSYLEAHRLLDPRQAGFRAGHSTQTALLGVLDDIRRGIDDRKLT FT ILILFDFSKAFDSIPHAKLLAKLKAMGSESARFAFFLTILDPACKTVGQTK FT SNGIRERSLRFFFNYLADRFQAVIDKGGTASDWLRASSGVPQGSVLGPLLF FT AVYINDLPRVLQFSRHMIFADDTQIYHQCYPNELPSALAAVQASSPTTRRF FT IINVTRMNCPAPWQRCKPNLNKCKAMILGSQFYVSRVDLPTLPKITVGGVR FT LDYVSEACNLGVWLTPSLDWQLHTTKVLRKIYGALYAPKTSERISSKAWFF FT PVLTTPARYTTMSMLRGTRSCNVRRTRAPDSFLAVSRSAPMLHYRLELGWL FT SVMRRREFLLGSLAYNVLAFENPIYLASGFKRITLDLSVRRSERNPPQALL FT YSAPRTEALKQSFAITASSLLNSLHQAEFSPDKIGAFKRSMRAALLERDRR FT DWADRVTNKGLTI*" XX SQ Sequence 4786 BP; 1081 A; 1358 C; 1056 G; 1291 T; 0 other; ttgccggatc agtgtaacgt gtgcgagctg cgttcgcaga taattgtgct tttgtaggtt 60 agagtacctg accacgtggc aactgaattt tgcatttctt gcttttccag tgtaacttct 120 cacggagtgt gtgcctatcg cccttgccag cttcgctgtg cccactaaaa tttgtttatc 180 gcttgtatac tacctaaaaa cgcccccgta gaaattttgt tttgcctgac cgtacgctta 240 cccgtatcac tgtgtatacg ggtgtaaaag ggcactgtag taccgacgtc tgaccgctag 300 cagaccacgt tcacttgtca ctcgcaacct ttgttttcag gcttttttct ggactatttt 360 tagtttttct ttcttcgggc gcacatttac ttagttttat ttttaatttt atttcgtatt 420 actcgcactt ttgcctcaaa cctcctagcg ctactgatgg taaagtgtag aaagtgttcg 480 agaaatgtcg ccactgtttc taagtcttgc acgacttgcg gctctacttt tcatccgcgg 540 tgccatgtga cttttcttat tggaaagccg tccgacacgt gctgtaaggc cctcgtaacc 600 ctagctgctg acacacaagt cgagcccact ctgaccgtgc aaaaccaccc gtcgagcatt 660 gcggccttaa cttccgattc tccttcaccg ctgagttttg cggcgacctc cgaggcaggc 720 aaaaagcttc gcagaaaagc tctcgccacc atcgccatcc gagctgctcg cgtctaagca 780 gactaggcgc gaatttagtt cttcttccga aagttcaaac gacagtcaca acatgcccga 840 ttcagccgca gatgaggccc cggcctggtt taaagcctac ttaaccgagt ataggaacga 900 taaagccgaa gttaatcagc gccttaacaa cctcgagact ggcatgcgca aaactgtatc 960 cagcgttgta cgaaacattg cgcttgtaga ttcctgcgaa ataactattt ccggacttcc 1020 gcccgacacc agcattcagc cgcgcgaatt ggcacgcaaa gtattcgcgg ctattgaact 1080 tgaatggctg ctcaacttcg tcatccggtt tcgcgactgg cctcaaaagg cccaggccaa 1140 cgaagcttca acctcgtcgc gaacagagcc aggatcgcgc acaatcgttg tcaagctcat 1200 gtccgccccg ctcagggact cagcgctaga aaacgcgagc ctccttgata agacctctgc 1260 gcaagcgctt tttggatctg gaggtgcaag gaaaatttat attaacccgc tgtaccctaa 1320 accggtgtat ttactgcgca agaaggctat gagggtggct aaacagctca attacgcacg 1380 tcctgttgta caaaacttgg tagtctgtat gcggcagacc aaagagtctc cgctcatccc 1440 gattgtatcc gaggaagagc tcgctgccct cgagccctat cgtccccaac aatgacgtcc 1500 agcctccggg cccaacagct ccggacccct cgattcgatt aatataggcc acattaatgt 1560 taatcgcctt gcgtcacact ttgcgttatt cgaggagcat tgtacgacat ctcacttcga 1620 tgtaatcgga gtaacagaaa cttttctttc cgtggctgat cccgatgatc agctgacgct 1680 cagcggtttc aactttttta gggttgatag agagggccag gccggtggag gagtggcttt 1740 ttatgcgcga gactttttca aggtcaagat tcttgcgcat tcaggtggca acattccgga 1800 atttgtaata gctgaattaa tatataacaa tttcaagctt ttattcgcag ttgtttatcg 1860 caggcccaac tctgcccacc catcacactt tttccagtgc ctagccccgc tcctgcctca 1920 ttacttaaat gccgttgtaa ctggcgactt caacgcagac atgtgtacgc taaacgctaa 1980 ttctaataat cttcgctctt ctcttgaatc tctctcgctc tatctggtgc cttctgagcc 2040 tacacaccat gtctttcggt acgatcgccc gccctctcat acgtggcttg accttttcat 2100 agtcaattgc agggactcgg ttgtagccct acgtaaatcc gactcgccgt tcatcgctgg 2160 tcacgacttc attgagttga ccatacgagc agccaagccc ccggcctgtg agaaggtaat 2220 catgtgccgc aacctgaaga aagtggaccc tgctgttatc tcagccgctc tgtgcgacat 2280 tcttgcccct ctcgctaatc catttcagca cggtcaaaat cgttttctga ccgcatctcc 2340 aactttgcca gtgacgctcg ggccctgtcc tgcggatacg gatgtattcc agcggcacat 2400 tacgtcagcg cttatttcta cctacgatgc agtcgcccct cttcgtagga tcacgctgtc 2460 gtcccggaga aaaccctggg tctcccccgc gatcaaggcg ctcatgaaac gacgtgacgc 2520 ggcacacggc ctggcgcgct cctcgggcat ccatagcgac tttgaacgct tccgagcgct 2580 ccgatctgag gtatcgaacc tcctggacac agctaaaaac gagtacctag caggcaggct 2640 agtctcagcc cctgactcga actcgaaatg gcgagaactt cgttcgatgt acataacctc 2700 acctcgccta ccatcacctc tactttattt tgatgcggat gtgctgaaca ggcactatgc 2760 tgctatagtt aacagacacc cgccactctc agaggaagtt tttcaggtac tatttgattc 2820 cccccttgcc gaaccagctg ctgacagagt ctttgacctt cggcccgtta ctttggccga 2880 ggtgcgtgat tctgtcctca atgcctcttc caaagcttca ggagtcgacg ggatttcagt 2940 tcccatgatt aaggcggcgc ttcccggagt actcgatcac ctaactgatc ttgtaaatgc 3000 ctgtatatta aatggaacgt ttccgaccga ctggaagaaa gcccttatcg tccccctggc 3060 aaagagtaaa actctcactt cgccttcgga cacgcgccct attgcgcagc tccctgagct 3120 gtccaaggtc cttgagcgct tagtccactc ccagcttact agctacctcg aagcacacag 3180 actcctagac cctaggcaag ccggctttag ggcaggccac agcactcaaa cggctttgtt 3240 gggtgttctt gatgacatcc gcaggggcat cgacgatcga aagctcacca tcctcatctt 3300 atttgatttt tcaaaggcct ttgactcgat cccgcatgca aaactgttgg ccaaactaaa 3360 agcaatggga tccgagagcg ctcgcttcgc ttttttttta actatctagc tgaccgcttt 3420 caggccgtga tagataaggg tggaactgcc tctgattggc taagagcgtc ctcgggcgtg 3480 ccccagggga gcgttcttgg accgctgtta tttgcagtat acatcaacga cctcccgcga 3540 gttctgcagt tctcgaggca tatgatcttc gccgacgaca cgcagattta tcatcaatgt 3600 tacccgaatg aactgcccag cgccctggca gcggtgcaag cctaacctca acaagtgtaa 3660 ggccatgatt ctaggcagtc aattttatgt aagccgcgtt gatctcccta cgctgcccaa 3720 aatcactgtt ggtggtgtaa ggcttgacta cgtttccgaa gcctgtaacc tgggtgtctg 3780 gctcacaccc tctctcgact ggcagctcca tactacgaag gtactgcgta aaatctacgg 3840 cgcgctatat gccccaaaga catcagaaag gatctcgtcg aaagcctggt ttttccctgt 3900 tttgactacg cctgcgcggt ataccacgat gtcgatgtta cgcggaacca gaagctgcaa 3960 cgtgcgcaga acgcgtgcac cagattcgtt tttggcagta tcccgttccg cgcccatgtt 4020 acactaccgt ctcgaactcg gctggctctc ggtcatgagg aggagggagt tcctgctcgg 4080 ttctctcgcc tacaacgtcc tggcttttga aaatcccatc tacctcgcca gcgggttcaa 4140 gcgtatcacg ctcgacctct ctgttcgacg ttctgagcgt aacccacccc aagcgctgct 4200 ctattctgcc ccgagaaccg aggcacttaa gcagtctttc gctatcacgg catcctctct 4260 attaaattcg ctgcaccaag ctgagttctc gcccgacaaa ataggggcgt tcaaacgctc 4320 catgcgggca gcgctccttg aacgcgaccg ccgggactgg gccgacagag tgaccaacaa 4380 gggcttaact atttaattat ttagatcttt gtttagccaa tcacctagtt ttcctagcca 4440 gcccagttcg tctacatagc ataacgcgtt tatctactta tctagtagtt taatcaccca 4500 ggcttatttg ttatttctcg tttgattaag ctcgcccctt gcgtacttcg cgcctagtat 4560 taagtcttgc gtactttctg tcctcgagat ttatcgaagc gattgcaatt agcgctacta 4620 caccgtattg cattctatgg actctattat actctgcgcc ttatcctact atgctacgat 4680 tttctctgta attattataa tttactatta cactgtattt ttttcataag cacccgatgg 4740 ggtcctaccc cagggtgaaa tatataaaca atcaatcaat caatca 4786 // ID Gypsy-1-LTR_MI repbase; DNA; INV; 230 BP. XX AC . XX DT 28-FEB-2009 (Rel. 14.02, Created) DT 02-MAR-2009 (Rel. 14.02, Last updated, Version 1) XX DE Long terminal repeat of the LTR retrotransposon from Meloidogyne DE incognita. XX KW Gypsy; LTR Retrotransposon; Transposable Element; TG-TA termini; KW Gypsy-1-LTR_MI. XX OS Meloidogyne incognita OC Eukaryota; Metazoa; Nematoda; Chromadorea; Tylenchida; Tylenchina; OC Tylenchoidea; Meloidogynidae; Meloidogyninae; Meloidogyne; OC Meloidogyne incognita group. XX RN [1] RP 1-230 RA Bao W. and Jurka J.; RT "Gypsy LTR retrotransposons from Meloidogyne incognita."; RL Repbase Reports 9(2), 464-464 (2009). XX DR [1] (Consensus) XX SQ Sequence 230 BP; 78 A; 54 C; 38 G; 60 T; 0 other; tgtgatagga aaaagaacag ctgtcactaa ccactttttc ctcgcccact aattcttcaa 60 ttcattcata taatttttaa atcccaacct cccaaagaat aaatatatat taaaaaagaa 120 aaagtagagg ttttaaatct aggagataag cggctcgggc gcgtctccgg ttcccgcttg 180 cgggaaccgg atgacccaac gctcgagcca ccataaacta tactattata 230 // ID Copia-110_AA-I repbase; DNA; INV; 4225 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-110_AA_; KW Copia-110_AA-LTR; Ty1_copia_Ele111; Copia-110_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4225 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [1493-2002] - Integrase core CC LTRs are 90% similar to each other. XX FH Key Location/Qualifiers FT CDS 1682..4225 FT /product="Copia-110_AA-I_1p" FT /translation="MAERQTGKKLKTIRTDNGKEFANRKLEDHLRRLGIRH FT QTTADYTPEQNGLAERVNRTIVERARCMLFEANLPKAFWAEATATAVYLIN FT RSPTKGHSATPEEVWSGRKPNLAHVRVFGSKVMAHVPKPKRKKWDAKAFEC FT ILTGFDEDTKAYRLWDPKSKKIIKSRDVTFVSEAREKPSSSNSMQVAAPER FT KVVRLDLGDPFPVHEDPEPVRTEAIDVEDGIVPVEEHQSSSEDEEFLDPIR FT DVTNSALPPRSSSNPPESQALRRSGRERYLPGKLKDFFFPSRGLSSDRFSD FT NQPESDVYVDAGSAIDLPGPAASDRQQGLAARRQPSTDGDPRTPSEALRGP FT EAKLWKQAMDDEYRAQIENNTWELVELPPGRKAIGCKWIFKTKEDERGNVV FT RHKARLVAQGFNQKYGVDFDEVFAPVAKQVTLRTLLTVASRRQMFVKHVDV FT KTAYLHGKLDETIYMRQPEGYSTGGPNAVCLLKRSIYGLKQSARVWNQSID FT AVLRQMGFNPSSADPCLYVRRKGNSYTYVLLYVDDMLVVCRTEEEFKAIHQ FT ALRQHFNVTALGDVKHFLGIEIERDGNGFQLNQERYILKLAERFGLENAKP FT SKIPLDPGYLQQKEEEHELRLPNNTQYLSLIGGLLYVAVNTRPDIAVSVSI FT LAQKSSCPTQLDWHEAKRVLRYLKSTSNHKLILGSTADGLEMYADADWAGN FT HRDRKSNSGFLIRLGGGVVSWASRKQTCVALSSTEAELVALTEACQELSWL FT KKLMIDLGVNVSSPVTTFEDNQSCIRLVENGKIEKRSKHIATKYFFVRDLQ FT EQQQIILKYCPTEFMLADILTKPLQHLRLKTLRDKLGVLNYIVEEE" XX SQ Sequence 4225 BP; 1135 A; 999 C; 1189 G; 834 T; 68 other; awaggttatg ggccccagag agctcagata gttagaagaa cktagtwcaa gattcaagaa 60 aagttctcga agcgtatcct csaagatggc acgammcggc ggaggmkwtt cagcgagkcg 120 magagacgca gtccmgggtc ggakccattc matccatggt tcagcaagcc tgccgtcaat 180 cgagctkcts cacggmcgcg acaactggtc kacstggagg ttcgckgtgc agacgttcct 240 ggagctggaa gacttgtggg aagccgtgaa gccwgcgsag aacscmgacg ggacattcst 300 tcmggtggcg atgcatcgaa ggaccggaga gckcgagcga aaatcatcct tttgctcgac 360 ccggttaact acgtgcatgt gagggaagcc aagacagcac gggacacgtg gtcgaggctg 420 gaagcagcgt tcgaggactc cggattgacc cggaaagtcg gcctgttgag aaaactcatc 480 acaacctcct tggcaaccag caactccatg gagacgtttg ttaacgacat catagcaacc 540 gcacaccaac tgcgtggaat tggcttcgag atcaccgacg agtggatagg aacacttttg 600 ctggccggtt tacctgaaga atatcggccg atgatcatgg ctcttgaaaa ctccggagca 660 gcgatcaccg gtgacgtgat caagacaaaa ctgctgcagg agatcagagt ctgccagcgt 720 cgagtggggc cgcattcgct ggcaacaggc atggccggaa gccaatcaag caaagtgaaa 780 aaccgaatcc atcagcaagc aamggtccga aatgcaggmg ctgcmagaag ttcggmcaca 840 ttgcmaggga ktgcccgacg agaggtkccg aagaamggmg acggaaacgc ctggtgcacc 900 gtgmtgtcag csgttgaaga ggacgataam gattggtatt tcgactckgg agcttccagt 960 cattttacga agtcggmgsa gctctggagg aktttcgcma gtgcggcggm aaggttattg 1020 ccgcgaacmg mggckcsatg tmmgattgtt gccaagggaa gcgtgaaatt gaaacccgca 1080 tgctgcccga acgaccctgc gattacggtg gatgaggtgc aagtgattcc cgatttttca 1140 tcgaatttgc tttcggtgag ccagatagtg magcgaggmc acacggttaa cgttcaacga 1200 ggatggcgtc saagtcatca acccggcggg saggtaatcg cgactggtas kaggasmgcg 1260 acctgttcaa gctggaccat ctggttcagt ckcgaagagc actggcgtgc tctgcctcgg 1320 atgaacctgc agccctgtgg cacaggcgga ttggkcacat caaccgacag cttacatcgg 1380 atgaagcgtg gcttggtctc tggaatcgaa tatgcagagc cgcccaggag tagacgacaa 1440 gtgcaaggtc tgttcgatgg gaaagcagac gcgctmccgt ttmgsaagts cggttcaaga 1500 gcgtagtgaa gtactggagc tggtmcactc cgatatcggc ggaccgatgg aggtcccatc 1560 cctgggtggc agccgatact acctgacgtt cacggacgac aaatcacgca ggatcttcgt 1620 ttacttcctg gaagasaagt cagcgcagaa tgtatacgaa gctttcgaaa atttccgatg 1680 catggctgag cggcagacag gtaagaagct gaagactatt cgcacmgata acggcaagga 1740 gttcgcgaat cggaaactgg aggaccatct gaggcgtctg gggatccgtc atcaaacaac 1800 ggcggactac actcctgagc agaatggtct ggcggagcga gtcaacagaa ccatcgtcga 1860 acgagcgcgg tgtatgcttt tcgaagcgaa tctcccgaaa gcgttctggg cagaagctac 1920 agcgacagca gtctacctca tcaataggtc tcctaccaaa ggacactcgg cgaccccaga 1980 ggaagtttgg agcggtagga aaccaaatct ggcgcacgtt cgtgttttcg gttcgaaggt 2040 aatggcacac gttccgaagc cgaagcgcaa aaagtgggat gcgaaggcat tcgaatgtat 2100 tctcactgga tttgacgagg atacaaaagc atatcgtctg tgggatccga aatcgaagaa 2160 aatcatcaag agtcgagatg ttaccttcgt gagtgaagca agagagaaac catcgtcttc 2220 caacagcatg caggttgcag ccccggagcg aaaagtcgta cgactagacc tgggcgatcc 2280 gtttccggta catgaggatc ccgaaccggt acgaacagaa gcaatcgacg tcgaagatgg 2340 aatagtgcct gtggaagaac atcaatcaag ctctgaagat gaagaatttc tagatccaat 2400 tcgtgacgtg acaaactctg cgctccctcc gcgatcatct tcaaatccgc ccgagtcaca 2460 ggcgttgagg cgcagcggac gggagcgcta cctgccaggc aagttaaaag attttttctt 2520 tccgagtaga ggactctctt ccgatcgatt ttcagacaac caacccgagt ctgatgtcta 2580 cgtggacgca ggaagcgcca tcgaccttcc tggacccgca gctagtgacc ggcaacaagg 2640 actagcagcc aggcgtcaac catcaaccga tggtgatcca cgtactccgt ccgaagcatt 2700 gaggggacct gaagcgaaac tgtggaagca agcaatggac gacgaatatc gagctcagat 2760 cgagaacaac acgtgggagc tggtcgagct acctccgggc cgaaaagcga tcggctgcaa 2820 gtggatcttc aaaacgaagg aggacgaaag aggaaacgtc gtgcggcaca aagcaagact 2880 ggttgcacaa gggttcaacc agaaatacgg cgtggacttc gacgaggtct tcgcaccagt 2940 tgctaagcag gtgaccctca ggacgctact cacggtggca agccgtcgtc agatgtttgt 3000 gaagcacgtg gacgtcaaaa ccgcatacct tcatgggaag ttggatgaaa ccatctacat 3060 gaggcaaccg gaaggatact ccacgggagg gccgaacgca gtatgcttac tgaagcgaag 3120 catatacggg ctcaagcaat ccgcacgtgt gtggaatcag agcatcgatg cagttttgcg 3180 gcaaatggga ttcaacccat catcggcaga tccctgtctg tacgtgcgga ggaaaggaaa 3240 cagttacacg tacgttctac tctatgtcga cgacatgtta gtcgtttgca ggacagagga 3300 ggagttcaag gctattcacc aggcgttgag gcaacatttc aacgtgacag cactaggaga 3360 tgttaaacat ttcctaggaa tcgaaatcga gagggatggt aacggattcc agcttaatca 3420 agaacggtac atcctgaagc tcgcagaacg gttcggtcta gaaaacgcca agccttcgaa 3480 aatacccttg gatcctggct acctgcagca gaaggaggag gagcatgaac tacgtttgcc 3540 gaacaacact caatacctga gtctcatcgg cggtctgctg tatgtagcag ttaatactcg 3600 gcctgacata gctgtgagcg tttccattct ggctcaaaaa tcaagctgcc cgacacaact 3660 ggactggcac gaggccaaga gagtgctcag gtacttgaaa tcgactagta atcacaaatt 3720 gattctagga tccacagctg acgggctcga gatgtacgca gatgccgatt gggctggcaa 3780 ccatcgcgat aggaagtcca attccggttt cctaatacga ctgggaggtg gcgtcgtcag 3840 ttgggcatca aggaaacaaa cttgtgtggc gctgtcatca acggaggcgg agctcgttgc 3900 cttaaccgag gcgtgccaag agctcagctg gctcaagaag ctgatgatcg atctaggtgt 3960 caacgtttct tcgccagtta ccacattcga agacaaccaa agctgtattc gtctggtcga 4020 aaacggcaag atagagaaga ggtcgaagca tatagccacg aaatacttct ttgttcgtga 4080 tcttcaagag cagcagcaaa tcattctgaa gtattgtcct accgagttca tgttggctga 4140 catactgacg aaaccgctgc agcatctacg attgaagaca ttaagggata agttaggagt 4200 attgaactat attgtcgagg aggag 4225 // ID CR1-2_IS repbase; DNA; INV; 2569 BP. XX AC . XX DT 20-JAN-2010 (Rel. 15.03, Created) DT 20-JAN-2010 (Rel. 15.03, Last updated, Version 1) XX DE CR1-2_IS autonomous non-LTR retrotransposon from deer tick- DE incomplete consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-2_IS. XX OS Ixodes scapularis OC Eukaryota; Metazoa; Arthropoda; Chelicerata; Arachnida; Acari; OC Parasitiformes; Ixodida; Ixodoidea; Ixodidae; Ixodinae; Ixodes. XX RN [1] RP 1-2569 RA Kapitonov V.V. and Jurka J.; RT "Young families of CR1 non-LTR retrotransposons from animals."; RL Repbase Reports 10(3), 530-530 (2010). XX DR [1] (Consensus) XX CC The consensus sequence is less than 1% divergent from several CC copies of CR1-2_IS. The 5'-terminal portion of CR1-2_IS is not CC complete. XX FH Key Location/Qualifiers FT CDS 3..2294 FT /product="CR1-2_IS_1p" FT /note="RT domain; the N-terminal portion of the FT perotein is incomplete." FT /translation="RQHNNILNDQGNALDLCFSNTDGIRVARSELSMVPED FT SYHPALDVVLSVFDPPVLSPSNGPSPHTSYKYSAGDYTGMYHHLSGVDWAS FT ILESSDINGQVYLLTTVVQEAMDMYIPLKTSKKSRFPIWFSRELRTLLKRK FT DFLHRRFKKSGLPKWEVEFKICRKRCKKILNRDRQAHSDSVESDLVKNPKK FT FWRYAKSRLGKNIESEIVLNQHSDSAKGMSELFADHFASVYINNNNCSSSI FT LPHSDVFGDSLGSVVVDEECVRLCIKKMKPTLSAGHDGIPAAIVKAYHELL FT TPILCTIFNNSLTSGIFPDDWKLAVVVPVLKSGNSGDVLNYRPISLLSSFS FT KLFEMVIHQFLSFKFRSIVIPNQHGFMSGRSTSTNLVSFMSAASEVVCNRG FT QLDVFYFDLSKAFDVVNHNILLYKLSSYGVCGSVYNLLKSYLSDRXAYVRV FT NNTTSSLYKASSGVPQGSVLGPLLFNIFVNDVSRVILNSSFLQYADDIKLY FT KKISTLEDCIALQNDAYSFGRWCLDNDLRLNHSKTKVMTYSRKTHDIFFPY FT SYHGELLSRVSELRDLGVVFDSTLRFDTHVIRVVQSALRTLGAVSRITKEF FT KSPSAFFTLYCSLVRSQLEYASVVWNGISKTNSSSVERVQNKFISIFKYRY FT LENGDPCLNGVCNSQLVNLLSLQQRRVKADLLFLFKTLHGSINSPSLLSEV FT SLRVPRVPTRLQSSFYIARHFSNLNPIQRSAECYNRYSDTIDIFNSCGSKF FT GNAVQIFLLQEKG" XX SQ Sequence 2569 BP; 636 A; 549 C; 533 G; 849 T; 2 other; ttagacagca caataacatt ctaaacgacc aaggtaatgc tttagattta tgcttttcaa 60 acacggacgg tatacgtgtg gctcgaagtg aactttctat ggttcctgag gatagctatc 120 accctgctct tgatgtagtt ctttctgttt ttgatcctcc ggtcctcagt ccctctaatg 180 gtccttcccc acacacatca tataagtatt cagcggggga ctacactgga atgtatcacc 240 atctttcggg tgttgactgg gcttctatcc tggagtcgtc tgatatcaac ggccaagtat 300 atctcctcac aactgttgtt caggaggcta tggacatgta tatccctctt aaaacgtcta 360 aaaaatctcg cttccctatc tggttctctc gtgaactcag aacgctttta aaacggaaag 420 attttttgca caggaggttt aagaaaagtg gtcttccaaa gtgggaagtt gaatttaaaa 480 tttgcagaaa gcgatgtaag aaaattttaa accgtgatcg ccaggcgcat tctgactcgg 540 tggaatcaga tcttgttaaa aatcccaaga aattttggag gtacgctaaa tcgcgcttgg 600 ggaagaatat tgagtccgaa atcgtcttga atcagcatag cgacagtgcg aagggcatgt 660 ctgagttgtt tgctgatcac tttgcgtcag tttacattaa taataataac tgctcctcat 720 ctattttgcc tcactcggac gtttttggcg actcattagg ttctgtggtt gtcgatgagg 780 aatgtgttag gctatgtatt aagaaaatga aacccacatt gtctgcaggt catgacggga 840 tacccgctgc aattgttaag gcttatcatg aattacttac gcctatactg tgcacaattt 900 tcaataactc cttgacgtca gggatattcc ctgacgactg gaaactggca gtggttgttc 960 cggtmctgaa atctggaaat agtggtgatg ttcttaatta cagaccgatt tctcttctct 1020 cttctttttc caaactcttt gaaatggtaa tccatcaatt cctctccttc aagtttcgta 1080 gtatagtcat tcctaatcag catggtttca tgtccggcag atcgacatcc accaatctcg 1140 tatcttttat gtctgctgcc tctgaagtag tttgtaatag aggtcagctt gacgtttttt 1200 atttcgacct gagtaaggca ttcgacgtag tcaaccacaa catcctctta tacaagcttt 1260 cctcgtacgg agtatgtggt tcggtataca atttgttaaa aagctatttg tctgataggc 1320 awgcctacgt ccgcgtcaac aatactacct cgtccttgta caaggcaagc tctggagtgc 1380 ctcagggatc tgtcttgggc cccctccttt ttaatatttt tgtaaatgat gtctctcgag 1440 ttatattaaa ttcttcattt ctgcagtacg ctgatgatat aaaactttac aaaaaaatat 1500 cgaccttaga ggattgcatc gcgcttcaga atgacgcata ttccttcggt cgttggtgtt 1560 tagacaatga cctccgtttg aatcattcta aaacaaaagt aatgacatat tctcgcaaga 1620 cccatgatat attctttcca tatagttatc atggcgaact gctgtcacgt gtaagtgaac 1680 tgagggacct tggagttgtc tttgattcaa ccctgagatt tgacactcac gtcatcagag 1740 tggttcaatc ggccttgcgc acgttaggtg ccgtgtctcg cataacgaag gagttcaaaa 1800 gcccgtccgc tttttttacg ctgtattgtt ccttagttag atcccagcta gagtacgcct 1860 cggttgtttg gaacggcatt tccaaaacta acagttcatc tgtcgagcga gtccaaaaca 1920 aatttatttc tatctttaaa tatcgttacc ttgagaatgg tgatccttgt ctgaacggtg 1980 tgtgtaactc gcaacttgtc aacttgttga gcctccagca gcgtcgggtg aaggcagatt 2040 tgttgttttt gtttaagacc ctgcacggct cgattaactc tcccagcctt ctgtccgaag 2100 tgagcctcag agttccacgt gtgcctacac ggctccaatc atctttttat atagctcgac 2160 attttagcaa cctaaacccg atccaaagat cggcggagtg ttataatcgc tactctgaca 2220 caattgatat ttttaactca tgcgggagca agttcggaaa tgctgttcaa atcttcctct 2280 tgcaggaaaa gggctgatca gttgaagcgc tgatgtagta tttgcttttt gctctcacac 2340 ctggtgatgt ctctgttttg tgcttttgat ttcccttgtt gatctgagtg tctttcatac 2400 ttgttaatat ttttattctc agactggaga actacgttct ccttctgcac tcgtgtgtat 2460 gcgtgtttgt atgtgtgtgt ttttgctcat ttttttttag tgtgccagca ctaagaccga 2520 ttggttgttc ctggacactt taataaacat tgattgattg attgattga 2569 // ID DNA8-39_AP repbase; DNA; INV; 1069 BP. XX AC . XX DT 25-AUG-2009 (Rel. 14.09, Created) DT 25-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-39_AP. XX NM DNA8-39_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-1069 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 1969-1969 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. It includes Sola-like fragment at positions 493-828 CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 1069 BP; 407 A; 151 C; 149 G; 360 T; 2 other; cagtcttggg taaaatactt ttgaaaagta tttggaataa atactcgaat acttatgaaa 60 aatagtattc cgaatatgta ttcgaatact tgataaatat aatattccga ataaagtatt 120 tggaatacta taaaagtatt ccgaatactt aaaaatattt ttgataaata tatgaaatga 180 taccagttat aattgaaggg tggaaatgca aaaggtggta ttgccaaaaa ccaaaatgtt 240 cacaaatcaa ggaaaactgt aacacatttt ctgacgacta taaacactca gaataagttt 300 atttatatgt aaaaaataaa aagtaaacga ttgacagtgt caaaataaat ctacctaaat 360 tgtaaacatt gtttttaatg ctttaaaaat ttcgtgtcag accaaaacgt ggtattgcag 420 gttttgaaaa ggtaatggca atatcacgtt ttaaagttgt tcctatttct gttgatgcaa 480 aacgcattat tgccggcaat accacgtttt gaagaaaaat gtaatattgg caataccacg 540 ttttgcaatg atatcaagta tgctccaata ctaataacat gggcaataat acgttttgac 600 ttgaacatga ttgaatatgt aataccacgt tttgcnttga cgttaaatgt ggcaatatca 660 cgttatgcat tgaacatcag ttttaaaatc gattttattc acaaaagtaa tcgatacaat 720 ttaattctga ctgcaaaatc gaattcctca cattttttcc cataaaaatc gatgtttaac 780 tcgattttcg aaattttggc aataccacgt tttgcatttt cacccttcaa ttttgatatt 840 gtatgatatt aatagatatt attgattcca aatttccaac tcataaaata taaattttgt 900 atttttgttt taaatcaaat gcaagggaat aacatgcaaa tcaaacaatt taatttccat 960 actataaaag tattccgaat aaagtattca gaatacatca caaaatactt ttggaaatag 1020 tattctaaaa gtattcggaa tactntattc ggaatactac ccaagactg 1069 // ID Gypsy-11_CQ-I repbase; DNA; INV; 5172 BP. XX AC AAWU01009286; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-11_CQ_; KW Gypsy-11_CQ-LTR; Gypsy-11_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-5172 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 401-401 (2011). XX DR GenBank; AAWU01009286; Positions 7353 2182. XX CC Positions [4144-4614] - Integrase core CC 'GTCCG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 1002..2891 FT /product="Gypsy-11_CQ-I_2p" FT /translation="MDESRPLPAFRCEQIEGGRLAKEWQEWKGSLQCYFDS FT YEITDQKLMRAKMLHLGGPQLQKVFRSLDGTEDFPEELVEKPWYDAAIEKL FT DAYFKPRRQDVLERHKLRNMKQAPSERFAHFVLRLRQQMQDCGFEKYRPKV FT RRIIEEMLMIDVIVEGCASQDLRRKILAKDQTLAEIEAMGESIESVLMQEK FT ELGKGNDSGRSAYSEVCKITKDKGSRRQAHFENRSKDEEHCPWTCFACGRR FT GHKAMDKECPANGRNCNKCNAIGHFGLRCPKSKRSFKQEIPEQPSKKIRSV FT EASDNRSEEMDIKKVYYTFYGGNTTNVVEVNIGGVSTEMLVDSGSDANLIT FT SKTWQKLKFRQVELLSCRKGGNKILRSYASEVPLVILGTFQAVVMVGELST FT EAEFFVVENGQRDLIGDFTSKRLGILKVGLDVQSVQGGSTETMPFSSIKGA FT KVHIQMDPDVKPVFQPLRRIPIALEDAVNKKLDDLLRRDIIEVKTGPATWV FT SPLVVANKANGEIRLCVDLRRVNQAVVRERHPMPVVEDVVAKIGRGRIWSV FT LDIKDAFFLLKLDEESRDIVTFITHRGLYRFTRLTFGLVSAPEVFQRHMDE FT MLADCEGAYWYLDDVGIVGDTVEEHDARLNKV" FT CDS 3172..5124 FT /product="Gypsy-11_CQ-I_1p" FT /translation="MGKFVHDLATIDEPLRKLTQKGTKFEWGDQAEAAFVE FT IKGRIANAQCLGFYRVEHVTSVIADASQYALGAVLVQTDSQQQSRIICFAS FT KSLTDTERRYCQTEREALALVWSVERFQIYLIGREFNLLTDCKALTFLFKP FT TSRPCARIERWVLRLQAFQYKTVFISGKDNIADALSRLSITVPVPFDLSEE FT LTIQEVSGGAASMIAVKWEEIKTASGADLEIQEVFQALESGCTDEMSLPYK FT VIAPELCRVDGVLLRGDRIVIPDRLRERVIQLGHEGHPGARVMKEHLRSTV FT WWPKIDRQVEDFVKACRGCTLVSAPNPPEPMIRKELPVGPWQQIAIDFMGP FT LPDGENLLVCVDYYSRYVEVVEMTDISTASTIKELLTVFSRYGVPESIRAD FT NGPQLSSAEFKEFCLEYGVALVSTIPYWPQMNGEVERQNRSMGKRLQIAHD FT LGLDWRVELRKYVLTYHSTKHSTTGHSPAELMFNRKLRTKLPCIPTTTADD FT GEVRDRDRIEKEKGRMYADRKRNATFSDITVGDTVVAKRMRRQNKVESEFA FT PEEFQVVRKKGSDVTIKSSVDGKQYRRCSTHLKRVRRAGSDETPFEDGPVM FT ESVVDSPEPEFAGDANIGELEENGDLNETVEEGPSNSKRKRCEPNRFKDYI FT PY" XX SQ Sequence 5172 BP; 1373 A; 1026 C; 1538 G; 1235 T; 0 other; aatggcgacg cggtgaagga aaaaaaaaag aaaaatttat cggcctttgg atttttagta 60 gcgcggaaat tgttgtaaag tggttttctc gagttaattg gtgattgtac gcggaattat 120 tgtaacaaaa tgcaggtaaa ataatgcagc ggcgatgagc tcagctaaaa gaacgaaaaa 180 acaaaatggc ggatgccaaa caaagcgtgc aaagaaaagt gaaaaaaaaa taaatgtcgg 240 gggggtttta tttgaacgcg gcggggcggc gagtgatttc tttttccaaa agaaaaaaat 300 gcgtcgggta aatgaaaatt acccattcat ggtgttccgg ccgagttcgg gcagaattcg 360 gacggaaata atttccaaca aaattcttgc tgagatgttc ggtttatata tgaggatgct 420 gacagcagtt tgtcagatct ttatttattt gttttgtgat aagaattaga tgcggatctg 480 ttaaaaaaaa agtgatcggc cgtcgttgag acgcgcaaga gatcgacgcc cgctggggtc 540 ggtatgtgtt cggccgtcgt tgagacgcgc aagagatcga cgcccgctgg ggtcggtatg 600 tgttcggccg tcgttgagac gagcaagaga tcgacgcccg ctggggtcgg tatgtgttcg 660 gccgtcgttg agacgagcaa gagatcgacg cccgctgggg tcggtatgtg ttcggccgtc 720 gttgagacga gcaagagatc gacgcccgct ggggtcggta tgtgttcggc cgtcgttgag 780 acgagcaaga gatcgacgcc cgctggggtc ggtatgtgtt cggccgtcgt tgagacgagc 840 aagagatcga cgcccgctgg ggtcggtatg tgttcggccg tcgttgagac gtgcaagaga 900 tcgatgcctg cgtggatcga tatgtgttag cggggtctga atatatgaaa aaaaagaaga 960 tgatttgatg atttttttta tcactttctg tttggttgta gatggacgaa tctcgaccgt 1020 tgcccgcgtt caggtgcgaa cagatcgaag gcggcaggtt ggccaaagag tggcaggaat 1080 ggaaagggtc tctccagtgc tattttgact cctatgagat cacggatcag aagttgatga 1140 gagcaaagat gctgcatttg ggaggaccgc agctgcaaaa ggtttttcgc agcctggacg 1200 gaacggaaga cttcccggag gaactcgttg aaaaaccttg gtacgacgct gccatcgaaa 1260 aacttgatgc ctatttcaag ccacgccgac aggatgtcct cgagaggcac aagttgcgta 1320 atatgaaaca agctccgagc gaacggtttg cccacttcgt tctccgtttg aggcagcaga 1380 tgcaggattg cggatttgag aaatatcgtc ccaaggtccg gcggatcatc gaggaaatgt 1440 tgatgatcga tgtaatcgtg gaagggtgtg cgtcacagga cctgcgccgg aaaattctgg 1500 cgaaagatca aacattggcg gaaattgaag cgatgggaga gtcgatcgag agcgttctca 1560 tgcaggagaa ggagcttggc aagggaaatg actcagggcg cagtgcgtac agcgaggtgt 1620 gcaagatcac caaggacaag ggatcaagaa ggcaagctca cttcgagaat cggtcgaagg 1680 atgaggagca ttgcccgtgg acctgttttg cttgcggacg ccggggtcac aaggcgatgg 1740 acaaggagtg tccagcaaat ggccggaatt gcaacaagtg taacgccatt ggccattttg 1800 ggttgaggtg ccccaagtcg aagcggagtt ttaagcaaga gattcccgaa caaccctcga 1860 agaagatccg atctgtggaa gcaagcgata atcggtcaga ggagatggac atcaagaagg 1920 tgtactacac tttctatggc ggaaacacga caaatgtggt ggaagtgaac atcggcggtg 1980 tttcaacgga aatgttggtc gattcaggct cagatgccaa tttgattacc tcgaaaacat 2040 ggcaaaagtt gaagttcaga caggtggaac ttttgagctg cagaaagggc gggaacaaaa 2100 ttcttcgatc gtacgcaagc gaagttccgc ttgttattct gggcaccttc caggcggtgg 2160 ttatggttgg tgagctttcc actgaggcgg agttcttcgt cgtagaaaat ggtcagcgtg 2220 atctgatcgg tgattttacg tctaaacgtc tgggcatact taaggtggga ctcgatgtgc 2280 agagtgtcca gggcggctcg acagagacga tgcctttctc tagtatcaag ggtgcaaagg 2340 ttcacatcca aatggatccg gatgttaagc cggtgtttca gcctctgagg cgaattccaa 2400 ttgcactcga ggacgctgtc aacaagaagc tggacgatct gctcagacgg gacatcattg 2460 aggtgaagac gggtccggca acctgggtat ccccattagt cgtagcgaac aaagcaaacg 2520 gagagatccg tttgtgcgtt gatcttcgac gggtgaacca agctgtggtt cgggagcgcc 2580 atcccatgcc ggtagttgag gacgtcgtcg caaagatcgg gagaggaaga atttggagtg 2640 tgctcgatat taaggatgcg ttttttctgt tgaagctgga tgaggagtcg cgagacatcg 2700 tcacgttcat aacgcatcgc gggttgtatc gcttcacgcg gctcactttt gggctggtgt 2760 cggcgccgga agtgtttcag cggcacatgg acgaaatgct tgctgattgt gagggtgctt 2820 actggtactt ggacgatgtc ggtatcgtag gagatactgt tgaagagcac gacgctcggt 2880 tgaacaaggt ataattcaat tgtgatcaat tggtggttag aaataaatcg gtcttcaaaa 2940 tctctgtttt ttttatttac atttttgtag gtgctcaaac ggtttgaaga taatggcgtt 3000 gtattgaatt ggacgaaatg taaagtacgc gttaccgaat ttgacttctt gggatacagg 3060 ttcagcccgc acggcattcg accctcgttg gcaaagcagg aagcggtgct gtcttttcga 3120 aggccggaaa acgaaagcga agtgagaagt tttctgggtc ttgctaacta catgggcaaa 3180 ttcgtgcacg atcttgccac tatcgacgaa ccgctgcgaa aattgacgca aaagggaaca 3240 aaatttgaat ggggagacca agcggaagca gccttcgttg aaattaaggg tagaatcgca 3300 aacgcgcagt gtcttgggtt ttatcgagta gaacacgtaa caagtgtcat tgctgatgct 3360 agccagtatg cgttgggggc cgttctcgtt caaacagatt cccagcagca gtcgcggata 3420 atctgttttg cctccaagtc gctcacggac acggaacgtc gatactgcca gacagaacga 3480 gaggccttgg cgctagtttg gagcgttgaa cgtttccaga tatacctcat cggaagagag 3540 ttcaacctgt tgacggactg caaggctttg acattcttgt tcaaacccac atcacgacca 3600 tgtgcacgca tcgaaaggtg ggttcttcgc ctgcaagcgt tccagtacaa gactgtgttt 3660 atttccggga aggacaacat tgcagacgct ctctcaaggt tatcaatcac agttccagta 3720 ccatttgatc tctcggagga gttgacaatt caggaggtta gtggtggggc tgcttcgatg 3780 atcgctgtga agtgggagga aattaaaacc gctagtggtg ctgatcttga aatacaagag 3840 gtttttcaag cactggagtc gggttgcact gacgagatgt cgctgccgta caaagtgatt 3900 gcacccgagc tgtgccgagt tgacggagta ttgttgagag gtgaccggat tgtaatacct 3960 gacaggctgc gcgaacgcgt tattcaactg ggacatgaag ggcaccctgg agcacgcgta 4020 atgaaagagc atttacggtc gacggtttgg tggcccaaga ttgatcgaca agttgaggat 4080 tttgttaaag cttgtcgagg atgcacatta gtctcagcgc ctaacccgcc cgaaccaatg 4140 atccgaaagg agctacccgt cggaccgtgg cagcaaattg ccattgattt tatgggcccg 4200 cttccggatg gcgaaaatct gttggtttgc gtcgattatt atagtagata cgtagaggta 4260 gtagagatga cggatatttc gacagcgtcc accatcaagg aactgctaac agtattctcg 4320 cggtatggtg tacccgaatc tattcgagca gacaacggcc cacaactttc atcagcggag 4380 tttaaagagt tctgcctgga atatggtgtg gcattagtga gcacaatacc gtactggccg 4440 cagatgaatg gcgaggtcga gaggcaaaat cggtcaatgg gaaaacgtct acagattgct 4500 cacgaccttg gtctggattg gcgtgttgaa ctacgaaagt atgtgttgac gtaccactct 4560 acaaagcact caaccaccgg tcactctccg gcggagctga tgttcaaccg gaagttgcga 4620 acaaagttgc cgtgtattcc tactacaact gctgacgatg gagaggtgcg cgatcgtgac 4680 cgaatcgaga aggaaaaggg aagaatgtat gctgacagga agcgaaatgc gacgtttagt 4740 gacatcacgg tgggtgacac ggtagtggcg aagcgtatga gaaggcagaa caaggtggag 4800 tcggaatttg cgccggaaga attccaagtg gttcggaaga agggaagtga cgtgactatt 4860 aagtcatcgg tcgatgggaa gcagtacagg agatgctcta cacacctcaa aagggttcga 4920 cgtgcgggaa gtgatgaaac accattcgaa gatggtcctg taatggaatc tgttgtggat 4980 agtccggaac cggagttcgc cggggatgcg aacattggcg agcttgaaga aaacggtgat 5040 ttgaatgaga cggttgaaga gggtccgtcg aactcaaaga gaaagcgatg tgaaccgaac 5100 cgcttcaaag attacattcc atattgatga tgtttttcag attcgtatta tttgaataaa 5160 gttaaggagg ga 5172 // ID Harbinger-4N1_BF repbase; DNA; INV; 601 BP. XX AC . XX DT 04-AUG-2008 (Rel. 13.08, Created) DT 04-AUG-2008 (Rel. 13.08, Last updated, Version 1) XX DE Amphioxus Harbinger-4N1_BF non-autonomous DNA transposon - DE consensus. XX KW Harbinger; DNA transposon; Transposable Element; Nonautonomous; KW Harbinger-4N1_BF; Harbinger-4_BF; non-autonomous. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-601 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-601 RA Kapitonov V. and Jurka J.; RT "Harbinger-4N1_BF - a family of non-autonomous DNA transposons RT from the amphioxus genome."; RL Repbase Reports 8(8), 800-800 (2008). XX DR [2] (Consensus) XX CC This is a non-autonomous derivate of Harbinger-4_BF. Exact TSDs CC are not clearly defined. XX SQ Sequence 601 BP; 178 A; 125 C; 134 G; 164 T; 0 other; taatagggcc ttcacactga aaccgaatta gcaggaatca agcagaacca gccggaatga 60 agtatttgca aaattcgtgc cacattcggg gccattccgg gtgggaattc caaatggacc 120 ttatatcccc ttcacacgag aaggaatgag ccagaatgcc gtcagaatga aaattctttt 180 ctattcctgt gaattcgtag tgtattctag acattcttac tgcattccag atattctttc 240 ggccacattc tggcaggatt cgaggtggct tgccgtttcg gttctccatc gaatgagata 300 agaatgtttc gaataatgtt ggaatgccgt agaatgcggt cagactgcag ttagaatagt 360 tagaatacac ttagaatcca ctttgaatgc cattcgatat ttctccactt cgaatgcacc 420 tcgacagttt ttaacatgtc aaaaactttc gagccagcca aaagaacggg gacgaatatc 480 tggaatgcag taagaatgtt tagaatacag tacgaattgc caggaattgc caggaataga 540 aaaaaatttt cattccgacg gcattccggc tcattcgtgc tctcgtgtga agggggctta 600 a 601 // ID Helitron-1_DYak repbase; DNA; INV; 9848 BP. XX AC . XX DT 31-MAR-2007 (Rel. 12.03, Created) DT 15-APR-2009 (Rel. 14.05, Last updated, Version 2) XX DE Autonomous family of Helitrons - a consensus sequence. XX KW Helitron; DNA transposon; Transposable Element; KW Interspersed repeat; Helitron-1_DYak; DNAREP1_DYak. XX NM Helitron-1_DYak. XX OS Drosophila yakuba OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; melanogaster subgroup. XX RN [1] RP 1-9848 RA Kapitonov V.V. and Jurka J.; RT "Helitrons in fruit flies."; RL Repbase Reports 7(3), 128-128 (2007). XX DR [1] (Consensus) XX CC This is a consensus sequence of a family of autonomous Helitron CC transposons transposed in the Drosophila virilis genome a few CC million years ago (copies are less than 5% divergent from the CC consensus sequence). The Helitron-1_DYak consensus sequence CC encodes the 2186-aa Hel-1_DYakp protein composed of the REP, HEL, CC and apurinic endonuclease domains. XX FH Key Location/Qualifiers FT CDS 2551..9111 FT /product="Hel1_DYakp" FT /translation="MPRLNRPTASQRARTQMLRNRINRENEGYAEVERVAN FT TVSQRRRREDSLVRNAEQGVNTVSRRSRRLNRPVRAIEQESNSQFRSQRRL FT DPAIRAVEQAADTRRRSARRQNSENRAVEQVQNTAGHRERRLDPALRAVEQ FT AADTRRRSERRQNSENRAVEQVQNTAGHRERRLDPALRAIEQAADTRRRSA FT RRQNSENRAVEQVQNTADHRERRLDPALRAVEQAADTRRRSIRRQNSENRA FT VEQVQNTAGHRERRLDPALRAVEQAADTRRRSVRRQNSENRAVEQVQNTAG FT HRERRLDPALRAIEQAADTRRRSERRQNSEIRAMEQEQNTANHRRRRLDTV FT YRAGEQEANTLRRALARDTQERLLERERDRANRARARAGRDRRAVEQARNT FT LARRSARLAARQATIDAIRNISQRVSQYRSLSANREAENLRNALRSQEIRL FT DIAQENRTIRIRPIEIEAHIATFNRNVKLGPDQICFCCKGLWFPKQISKLS FT RSYLEEHCEFREEIYQNARHLAFTGNLFKFCKTCHSNIKAGRKPKGAISNG FT LDFPEIHNSLRGLTPLEERLISPRLPFMIIKSIGHERQSAIKGAVVNVPIP FT VSNIVTSLPRAFNEAEVIQLHLKRRMEYGHDFMAETIRPAKIADAIRYLVN FT TELYRKHNVSVNEQWISDFTSETVPFIASQADVAFVEGQLAIQQEDENSEA FT RNSDQNTEAEELNPGGQETLLENNQTHTVGMTRITIAPGEGQKPLDVILDA FT DSEELAFPSIYAGIKRPSSESYTTIVRSELRNVDRRGCRTDKLFFNYKKLE FT LIKIRNNISTCLRKRSASSAITAANVLNEDFMGNLICHDEGYRILRDIRTS FT PAHWEDEKKKVMAMIRQFGLPTFFITLSAAETRWPELLVLLKRNVDRVNIN FT EEEASNLQFREKARLIRTDPVTCARYFDYRYREVLKLMKKPGGVFGSNFVT FT TYYWRVEFQQRGSPHIHGMFWLKDAPKVDLNNEESIRTVIAFIDQFVTVDV FT TNPDLAPYIEYQKHKHGHSCLKKVRGQSICRFGIPYPPMPQTEILNPLPEA FT TQNSLHHENFKKIRDFLLRNHADDMISHLSIFENFLSHNHVSLSYEDYIYA FT IRSSLKKPQIFLKRTFAEIQVNAYNKHILSMQRANMDIQLILDPFACCSYI FT INYINKSQRGISKLMREAANDIRRGNSSIRQKLQYLGHKFISGTEISAQEA FT VYCCLGMALSEASNKCVHINTFPPEQRVRMLRPTRDLQNMPQNSTDIFLKG FT LLDRYEQRPDFHEGLCLADFSANFEFSKSRRSTRQRANSDDETEEGNDVLG FT DVYFPLRDGSGFIRERNRASIIRYRPFNINTDRINYFRSLVMLFSPWRNEQ FT VDLIQRNCEEFYKENEEAIKANFEKYNAIDSLEDALRRAMEADSIEENEEE FT EVEHNEEFRALAIPEISSQINVLNLNNNLLDVDPDSNIRVIKLPPLISPLN FT LANLVRSLNLEQKTFLTHVLHNARTNRTFYEFVGGGAGVGKSRLISTLFQS FT LSKEYNGRVGCDPTSVKILLCAPTGKAAFGIGGSTLHSMFSLPVNQAGSAF FT RSLSPDLLNTLRSRFIDLKILIIDEISMVGATMFSHLDSRLKQIFSNVETP FT FGGISVIVFGDLRQLRPVCDRWIFQAPSHDPYSAIFGSYLWSPFRFFELTE FT IMRQRDDQPFAIALNNMASGQMTSDDISLLRSRISDESQIPNDSIHLFTSN FT QDTDRYNREKLNSIPTAQFLSEALDSVKSAQISLEQRNRCLEQARSLKTSE FT TQGLCTSLTLKTTAKYMMTVNVDTSDGLVNGATGVLREIGFSTSNTPDLLW FT IQFLDESIGVNARSKRRHCEEASWTPISKIIKSFQINSNQATTIDRKQFPV FT VPAEAITIHKSQGATYSKVVVHTNSSMQRAALYVACSRATTAAGLFIIGPF FT VPPRSAQDSASEAELRELRSTKLLNTHFDCLINSPNLHIFFHNTESLHCHI FT SDVCSDRLMLRSSLLCFVEASTYSNEVYEIPGFTTAVRLDCQSVASGARPK FT RGILVFIRNELFENVSLCSSGRFFSNISNLASPVFEFAIFKYRSLGILVLY FT KSPTYPLKKFETEFKELFQQHTFLNSNCLVLGDFNLCLHSISGCCKQIFDF FT LVDVKGFSSLLDLDSSTTNSNTHIDWAFSNIFDNRATARTFETTYSYHSGI FT LVSVREAE" XX SQ Sequence 9848 BP; 3153 A; 1743 C; 1898 G; 3052 T; 2 other; ccattacatt acaagttttt aagaatgtac gatttcaagg agtagcatgt tctatgacgt 60 cagagacagc cagccgatga cgtaatagat cggtgcagcg actttttggc agacgctaac 120 aatattcaaa actggaaaaa tgcttcaagc gttattttgt gcacaaataa aatattcgta 180 ttattaaata caccctttga aatattatgg ttctgcctcc acgaaataag cccaatctgt 240 ttaaatatta tcttattata taattttgat cggagcaatg ctttatctga gtagctgata 300 tatgatgata aataaatgaa atcatatgtt gttttgcttt tctttcccac ttctcaataa 360 ttcaatatat attgatctag tgttgaaaag aattttagtg ttaaatctcg ctcttagctt 420 ctctctctta gggcgctttt ccacgttttt tacaagtgcg tggcaggtga atgttagaga 480 gagagggacg tctctgttga cactttgaaa ggagtagcat gttctatgac gtcagagaca 540 gccagccgat gacgtaatag atcggtgcag cgactttttg gcagacgcta acaatattca 600 aaactgcaaa aatgcttcaa gcgttatttg tgcacaaata aaatattcgt attattaaat 660 acaccctttg aaatattatt gttctgcctc cacgaaataa gcccaatatc aatatctgtt 720 ttaatattat cttattatat aattttgatc ggagcaatgc tttatctgag tagctgatga 780 tatatgataa taaataaatg aaatcatatg ttgttttgct tttctttccc acttctcaat 840 aattcaatat atattgatct agtgttgaaa agaattttag tgttaaatct cgctcttagc 900 ttctctctct tagggcgctt ttccacgttt tttacaagtg cgtggcaggt gaatgttaga 960 gagagaggga cgtctctgtt gacactttga aaatagaaat cgatgcgcac tctttataaa 1020 gagagaaact cggtttttga acaaacataa tattatcaat atgtgccatt tttcctttct 1080 gaccttttat ttatcttctt ttactatagt aaaatatatg tctaaactat tttcttttgg 1140 taagataaaa gtaaaattct agaagcttga agtttgtttt tcttatgagc tggcaggcgc 1200 tacaatcatt aagcctttaa tcatatgtta tggttatttg gcttttctca ggtactcttc 1260 aataattcaa tatatattga tctagtgttg aatagaatgt aattttaata agagcattgc 1320 tcttagaaaa tgttaaatct aaatggaaga actcgctctt agcttacggt gctttcccgc 1380 gcttttcttg aaaaagtaaa tagtaaaaga aaagaaagaa gaagggagtc agtcgaagac 1440 taaattttgg tattacaaca tctttttaga gccgagtgcg atttttaatt gatttttcca 1500 gttcttaacg tttttttgcc aaccaataag tgttaagctc aatttttaac gtaaagatta 1560 cattttttaa taaaaagttg catacattaa cgtaaaatta aatttttaaa ttacaacgcg 1620 gtcgtgagtg ctgttgctgc ttcgattgac gtcgctgccg tttggcccgc tgctgtctcg 1680 tcgctgcgct cattgctaag tatcaaattt gtagttgatt tttctttatc ctaatatcgc 1740 ccacgctttt ctacctatat taatataaga actgagcctt tttaaaggct tattcttata 1800 tttttatcca tatacgtatt ccgcgtatat aaacttaata gtactaaaag tttaagtctt 1860 tttaaagact tgcacttata atactattaa tactatacta tattattata ctatataatt 1920 atactatatt caaataataa taatttaagt cttaagtctt tttaaagact tcttaactta 1980 ttattatttt ttttcacatt taatcttaat atattctata aattttccca ttaatagcat 2040 ttttaaaggc ttactcttat atttttatyc atatacgtac ttagcgtata taatcttaat 2100 agtactaaaa gtttaagtct ttttaaagac ttgtacttat aatactattg atactattat 2160 atttcttcaa ataataataa tttaagctta agtcttttta tagacttttt cttaactaat 2220 tattatttct cttttttcac atttaattaa tataaatata agaattatac tttttaaaaa 2280 aataattttt atatatacat attttacata tttatacgta tttgtttaat agtgctgcaa 2340 gcgttagtct ttgtaaagac tatcttctaa tactattaat ttaattatat tgatatttaa 2400 aataataata taagcttgag tctttttaaa tacttattct tataatatta tttttctatt 2460 aagtattttt tcctgtgttc aataaataat aatttttaaa tattatatat ttataaatat 2520 tttattaagt agtgtacatc atttttagaa atgcctcgtc taaatagacc cactgcttct 2580 caacgtgcaa gaacgcaaat gttgagaaat aggataaata gggaaaatga aggatatgca 2640 gaggtggaga gagtggcaaa tacagtatct cagcgtaggc gtagggaaga ctcgttagtc 2700 aggaatgcag aacagggcgt caatacagtg tcgcgaaggt cgagacgctt gaatcgtcct 2760 gttcgtgcga ttgagcaaga atctaactct cagtttcgtt ctcaaaggcg tctagatcct 2820 gcaattcgcg cagttgagca agcagccgat actcgacgtc gttctgcaag gcgacaaaat 2880 agcgagaatc gagctgtgga gcaggttcaa aatactgctg gccataggga aaggcgcttg 2940 gatcctgctc ttcgcgcagt tgagcaagca gccgatactc gacgtcgttc tgaaaggcga 3000 caaaatagcg agaatcgagc tgtggagcag gttcaaaata ctgctggcca tagggaaagg 3060 cgcttggatc ctgctcttcg cgcaattgag caagcagccg atactcgacg tcgttctgca 3120 aggcgacaaa atagcgagaa tcgagctgtg gagcaggttc aaaatactgc tgaccatagg 3180 gaaaggcgct tggatcctgc tcttcgcgca gttgagcaag cagccgatac tcggcgtcgt 3240 tctataaggc gacaaaatag cgagaatcga gctgtggagc aggttcaaaa tactgctggc 3300 catagggaaa ggcgcttgga tcctgctctt cgcgcagttg agcaagcagc cgatactcga 3360 cgtcgttctg taaggcgaca aaatagcgag aatcgagctg tggagcaggt tcaaaatact 3420 gctggccata gggaaaggcg cttggatcct gctcttcgcg caattgagca agcagccgat 3480 actcgacgtc gttctgaaag gcgacaaaat agcgagattc gggctatgga gcaggaacaa 3540 aacactgcca accatagaag aaggcgcttg gataccgtgt atcgcgcggg tgagcaagaa 3600 gcaaataccc tgcgtcgagc actggcaaga gatacacagg aacgcttatt agaaagagag 3660 cgtgataggg ctaacagagc tagggctagg gccggtaggg atcgcagggc agtcgagcag 3720 gcgagaaata cacttgctcg acgaagcgca cgattagctg ctcgacaagc aacgatcgat 3780 gcaattagaa atataagtca aagagttagt cagtatagaa gccttagtgc taatagggaa 3840 gcagaaaatc ttcggaatgc acttcgaagt caggagatta gacttgatat agctcaggaa 3900 aataggacta ttagaatcag acctattgaa atagaagctc acatagctac attcaatagg 3960 aatgttaagc ttggcccaga tcagatttgc ttttgttgca aaggattgtg gttccccaag 4020 caaataagta agctttctag gtcttatttg gaagaacatt gcgagttcag ggaagagata 4080 taccaaaatg caagacatct tgctttcact ggtaatcttt tcaagttttg caaaacttgt 4140 cacagcaaca ttaaagccgg ccgaaagccc aaaggggcca tttcaaatgg cttagatttc 4200 ccagaaattc ataactcttt gaggggccta actcccttgg aagaacgact gatttcacct 4260 cgccttccat ttatgattat taagtcaata ggtcatgagc gtcaaagtgc tattaaagga 4320 gcagttgtta atgtccctat tccagtaagc aatatcgtga cgtctcttcc acgagctttc 4380 aatgaggctg aggtaattca gctccacctc aaacgaagaa tggaatacgg acatgatttc 4440 atggcagaaa ccatacggcc tgccaaaatt gctgacgcta ttaggtattt agttaatact 4500 gagctttata gaaagcacaa tgtgtcagtt aacgagcagt ggatctctga cttcacatcc 4560 gaaactgttc cattcatagc atctcaagct gatgtagcct ttgtagaggg acaactagcc 4620 atccagcagg aggatgaaaa ttcggaggcg cgtaattctg atcagaatac agaagccgaa 4680 gaactaaatc ctggaggaca agaaactttg cttgagaata accagacaca tactgtagga 4740 atgacgagaa ttacaatagc tccaggtgag ggccaaaagc ctctcgacgt aattctcgat 4800 gctgattctg aagaactggc attccctagc atatatgctg gcataaaaag accatcttca 4860 gaaagctaca caaccatagt tagatctgaa cttaggaatg tagatagacg aggttgtcga 4920 actgataagt tattcttcaa ttataagaag ttggaattaa taaaaattcg aaataatatt 4980 tcaacttgct tacgtaagcg gagtgcctcc agtgccatta cagctgctaa cgtcctaaat 5040 gaagatttta tgggcaacct catttgtcat gatgagggat ataggatcct tagggatatt 5100 cgcacgtctc ctgctcactg ggaagatgag aagaaaaagg tcatggccat gattcgccaa 5160 tttgggctgc caactttttt cataacttta tcggccgctg aaactaggtg gccagagctg 5220 ttagtactgc tcaagcgcaa tgtagatagg gttaatataa atgaagaaga agcttcaaat 5280 ttgcagttta gggaaaaggc gcgccttata agaacagacc cagtgacgtg cgctagatat 5340 tttgattaca gatatagaga agttttaaaa cttatgaaaa agcctggggg tgtctttgga 5400 agtaatttcg ttactaccta ttactggcga gttgagtttc aacagagagg ttcgcctcat 5460 attcacggca tgttttggct gaaagatgcc ccaaaagtgg atttaaacaa tgaagagtct 5520 atacgtactg taatagcctt cattgatcag tttgtaactg tggatgttac aaacccagat 5580 ttggctccgt atattgaata ccaaaagcat aagcacgggc attcgtgtct taaaaaagtg 5640 cggggacaat ctatttgtcg tttcggtatt ccataccctc caatgcccca gacggaaatt 5700 ttaaatccac ttcccgaagc aactcaaaat tctcttcatc acgaaaactt caaaaaaatt 5760 cgagattttc tacttcgcaa tcatgcagat gacatgatta gccatttaag catatttgaa 5820 aattttcttt ctcacaacca tgtgagcctc agttacgagg actacattta tgctattagg 5880 tctagtttga aaaaaccaca gatttttctg aagagaacat ttgcagaaat tcaagtaaat 5940 gcttataata aacacattct ttctatgcaa agggcaaata tggatatcca gttaattttg 6000 gacccttttg cttgctgtag ctacattatt aattatatta acaaatccca acggggtatt 6060 tcgaagttaa tgcgagaagc agctaatgac attagaagag gcaattcaag cattcgacaa 6120 aagctacagt atctcggtca caagtttatt tcaggcactg agatatctgc tcaggaagcc 6180 gtgtattgtt gtctgggaat ggcgttgtcc gaagcaagta ataagtgcgt tcatatcaac 6240 accttccctc cagaacaacg agttcgtatg cttagaccta cacgcgacct tcaaaatatg 6300 cctcagaatt caactgacat atttctgaaa ggattgctgg atagatacga acaaagacca 6360 gatttccatg aaggcctgtg tttagccgat ttttcagcaa attttgagtt ttcgaaaagc 6420 cgtagaagta ctcgtcagag ggcaaatagt gacgatgaaa ctgaagaagg caatgacgtg 6480 ttaggagatg tatattttcc gctccgagat ggaagcggtt ttattaggga aagaaatagg 6540 gctagtatta ttaggtatag gccatttaat attaatacag ataggattaa ctattttaga 6600 tcattagtta tgctcttttc tccgtggcga aacgaacagg tagatttaat tcaaagaaat 6660 tgcgaggaat tttataagga gaatgaggag gctattaaag ccaactttga aaagtataat 6720 gccatagata gtttggaaga cgcgcttaga agggccatgg aagcagatag tatagaagaa 6780 aatgaagaag aggaggtcga acataatgag gagtttaggg cattagctat tcctgaaata 6840 tcttctcaaa ttaatgtttt aaacttaaac aacaatttgt tagatgttga tccggatagc 6900 aatatccggg taataaagct tccgccactt atatcccctt taaacttagc caatttagtt 6960 agatctctaa atttggagca aaaaactttc cttactcatg tcttgcataa tgcaaggaca 7020 aatcgaacct tttatgagtt cgtaggtggc ggagcaggtg tcggcaaaag tagattaata 7080 tcaacgttat ttcagtccct atccaaggag tataatggta gagtaggatg cgacccaact 7140 tcggtaaaaa ttttactgtg tgctcccaca ggcaaagctg ctttcggaat cggaggatcc 7200 actcttcatt ccatgttctc tctccctgta aatcaggctg ggtccgcatt tagaagttta 7260 agtcctgatt tgttgaatac ccttcgttca agatttattg atcttaaaat tcttataata 7320 gacgaaattt ctatggtcgg tgcaacaatg ttttctcatt tagactctcg cctaaagcaa 7380 atattttcta atgttgaaac tcctttcgga ggtatttcgg ttatcgtgtt cggcgatctt 7440 agacagttgc ggcccgtatg tgatcgctgg atttttcagg ccccgtcgca tgatccatac 7500 agtgctatat ttggatcgta tctgtggagt ccttttaggt tttttgaact cacagagata 7560 atgcgacagc gagatgacca accgtttgca attgcactta ataacatggc atccggccaa 7620 atgacgtctg atgatattag cttgcttaga tctagaattt ccgatgaaag tcaaattcca 7680 aatgactcaa ttcacctttt tacaagtaat caggacacag atcggtacaa tagggagaag 7740 cttaactcga tacccactgc tcaatttctc tcggaagctt tagactcagt taagtccgct 7800 caaataagct tagagcagag gaacagatgt ctggaacaag ctaggtccct aaaaacatct 7860 gagacacaag gattgtgtac ctcgctcact ttaaaaacca ctgccaaata tatgatgacg 7920 gtcaatgtag atacatcaga tggtctcgta aacggagcca ctggtgtcct tagagagata 7980 ggcttcagta cgtccaacac tccagatctt ctttggatac aatttttaga tgaaagcata 8040 ggagtaaatg cgcgttccaa gcgcagacat tgtgaggagg cttcatggac gccgatttcg 8100 aaaattataa aaagttttca aattaattct aatcaggcaa ctacaattga ccgaaagcag 8160 tttccagtgg ttcctgcaga agcaataact attcataaaa gtcaaggcgc tacgtacagt 8220 aaagtggtag ttcacactaa ctccagcatg cagagagctg cgctgtatgt agcctgtagt 8280 agagccacaa ctgcggcagg tcttttcatt attggacctt ttgtcccccc aagatccgct 8340 caggattcag cttcagaagc agagctccga gaattgcgtt ccactaagct actgaatacc 8400 catttcgatt gtttaatcaa tagtccgaac cttcatattt ttttccataa tacggaaagt 8460 ttacactgcc acattagcga tgtttgttcc gatcgtctaa tgcttaggtc ttctcttttg 8520 tgcttcgttg aagcatcaac atactcaaat gaagtatacg aaattcctgg ctttaccaca 8580 gcagtgagat tagattgcca gtcagtagct agcggagccc gaccaaaaag aggcattctc 8640 gtttttattc gaaacgaact atttgagaat gttagtcttt gctcaagtgg tcgatttttt 8700 tcaaatattt caaatttagc ctctccagtg tttgagtttg ctatctttaa gtaccgatct 8760 ttagggattc ttgttttata taaaagtccg acctatccct taaagaagtt tgaaactgaa 8820 tttaaggagt tatttcagca acacacattt ttaaatagta attgtttagt gttaggagat 8880 tttaacttgt gcctccactc catctctggc tgctgcaagc aaattttcga ttttttagta 8940 gatgtaaagg gttttagttc gttgctagat ttggattcgt ctacaaccaa tagcaatacg 9000 catattgatt gggcgttttc aaatattttc gataatcggg ccactgcgcg cacatttgag 9060 accacatata gttaccacag tggtatatta gttagcgtta gggaggcaga gtaggtattg 9120 tatagttatt taattaaata taaactaaat aagtatatat ttttgaattt ttttaattat 9180 ttattattaa atwaaaaaaa aaacaaaaac aaaaaaaaac aaaaaaataa aaaaaaataa 9240 ttaatattat taaataaatc tgataattta atttaattta caaaaaataa caagaaaaac 9300 ccaaaaaaaa tataattgaa ttttttaatt attaaatgta tttattatta aattaaaaaa 9360 caaaaaaacc caataaaata aataaaaaca aaaaaaacct aaaaaaataa ttaatatttt 9420 taaataagtc tgataattta atttaattta aaacaaaaac aagtgagaac ccacaaaaaa 9480 aattaagtaa aaacctaaaa aaataagtca tatttttaaa taagtttgat aatttaattt 9540 aatttaattt aataattcat tttttaatta tttaatttaa tactaacaat atctatgcct 9600 ccgtatgcta cgggtaaaca aaatttttgg ggaacatgga gttttaaaaa tttttcaaag 9660 tttgaaaact caagtcgact cgtaggtccc caaaattgat tttggtttaa acgaccccgt 9720 tgtaactcaa cggcctccac gtggtgttcc ctgggtaccg atttcaaaat tttaatcgga 9780 agctaattcg agcggcgaaa agttacgtac atatatgaag gcatatttat tgttttttaa 9840 atttagct 9848 // ID Copia-30_AA-LTR repbase; DNA; INV; 207 BP. XX AC supercont1.42; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-30_AA_; KW Copia-30_AA-I; Copia-30_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-207 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.42; Positions 2422745 2422951. XX SQ Sequence 207 BP; 55 A; 42 C; 33 G; 77 T; 0 other; tgttagaatt gcaacccctg atagtgagta gccctcattt taatttctca cagtgtgcac 60 cttctgtgct ctttttttta ctacctaaga tgtataaggt aaacattagt attataagat 120 aaccctgaat acatatcgac acgttttatt gagttccgta aatccgtgat ttgttccgtg 180 ttcttgtgac catccgaaac tatttca 207 // ID Gypsy3-I_Dya repbase; DNA; INV; 2305 BP. XX AC chrU; XX DT 18-MAY-2009 (Rel. 14.05, Created) DT 18-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy3_Dya; KW Gypsy3-LTR_Dya; Gypsy3-I_Dya. XX OS Drosophila yakuba OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; melanogaster subgroup. XX RN [1] RP 1-2305 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1040-1040 (2009). XX DR Genome; chrU; Positions 5680974 5678670. XX CC 'CTAC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 290..2131 FT /product="Gypsy3-I_Dya_1p" FT /translation="MESPRSMIEDAAAGVVPPSSEHCTNAESMNVHDDARP FT STSTNHRDSELAREEHFPEVSTQFDAMQRQLRLLQLENEMLKLESARRHNA FT ATSDQNSAATSGHNKATPLIRSSVAKPNQHKDATQDQEVAGGDARSEEAFV FT NKQTLLAMAKEMIPIFDGASNNKLNVSTWVAQLNAVSKMFKLSDDVIRMLV FT MSKLKDHAQIWLHSSERLLTLPVQELLFQLGEAFHGKESKIISRRKFQERK FT WKPSEDFSTYFKEKTLLATQIRIDDEELIDNIIEWIPDSLLRQQAHMHCFN FT SSAQMLHAFAKVSLRKPLLSPAGRVKVSFDKEPGSAPPKRCFNCNAVGHFA FT ADCRKPKREYGACYACGSKDHLVYNCTERKFVSDNEYVRPFKIYFSSKPNE FT CLFLECLIDSGSPISFIKKSYVEKILKKEDITLNGKILKNYFGLNGSPLDI FT IGKISCFVIINKEILNFELLIVTDRSMAYGTVLGRDFMKISNFKIVREKDS FT SNDRHCENDCLNNRNFENFIANDRHCENDRSNDRNVENFIANDRYCENYDS FT NSGNAVLNNENSEDNGLNDRKSENDVANVKNYGNKCSININNGNESKTSSP FT YDGLEILSIEYPDPIE" XX SQ Sequence 2305 BP; 767 A; 436 C; 514 G; 588 T; 0 other; aattcagaag tgggattact gtgaattttt ttttcacaaa aaactcaacc tacgaagcaa 60 agtaaaaaca atatctttaa atataatcaa atggaaacca tcggtgcgga cgattataca 120 gcagcagaac tgcgcaattg gcggtaacaa ggccacgctt gcggcgcgat tgaacggtgt 180 gcccccagaa gcccgaggag tttgcccggt gattggcgaa ttagaaggcg aaatcgctaa 240 caatgatttg gagcacgaaa tcgtcggaga gtcagcacca gaggtcgaca tggaaagccc 300 aaggagcatg atcgaggacg ccgccgctgg agtggtgccg ccaagcagcg aacactgcac 360 caatgctgaa agcatgaatg ttcatgacga cgccaggccc tcaacatcta ccaatcatcg 420 cgattcggag ttggcacgcg aggaacattt tccagaagtt tctacgcagt tcgacgccat 480 gcagcgtcaa ctaaggctgc tccaattgga aaacgaaatg ttgaaattgg aatctgctcg 540 acgccataac gctgccacat cggatcaaaa cagcgctgcc acatcgggtc ataacaaggc 600 taccccacta attcgaagca gtgttgccaa accaaatcaa cacaaggatg ccactcaaga 660 tcaagaagtc gctggcggag atgccagatc ggaggaagca tttgtaaata aacaaacatt 720 gctagcaatg gccaaggaaa tgattccgat attcgacggt gcttctaaca acaaactcaa 780 tgttagcaca tgggtcgccc agctcaacgc ggtctcaaaa atgtttaagc taagcgacga 840 tgtaattcgc atgctagtga tgtccaaatt aaaggaccat gctcaaattt ggttgcactc 900 ttcggaacgt ttgctgactt tgccggttca agaacttcta tttcaacttg gtgaagcttt 960 ccacggtaag gagagcaaaa taatatctcg acgcaagttt caagagcgca agtggaaacc 1020 gtcggaagat ttctccacat acttcaaaga aaaaacgctg ttggctacac agattcgaat 1080 agacgatgag gagctaatcg acaacatcat tgaatggatt cccgactccc ttctacgaca 1140 acaggcacac atgcactgtt ttaattcgtc tgcgcagatg ttgcacgcat ttgccaaggt 1200 ctcgttacgt aaaccgctac tttctccggc tggacgtgtt aaggtttcgt tcgataagga 1260 acctggttcg gcgcctccaa aaaggtgctt caattgcaat gctgttgggc atttcgccgc 1320 cgactgccgc aagcccaagc gcgagtatgg agcctgctac gcatgtggca gtaaggatca 1380 tctggtgtac aactgcactg agaggaagtt cgtatccgac aacgaatatg taagaccgtt 1440 taagatttat tttagttcaa aacctaatga atgcctgttt ttagaatgcc taatcgattc 1500 tggaagtcca attagtttca taaagaagtc atacgtcgaa aaaattttga aaaaggaaga 1560 tattacgtta aatggaaaga ttcttaaaaa ttattttggc ttaaacggta gccctttaga 1620 tataattgga aaaatatcat gtttcgttat tatcaataaa gaaattctaa attttgagtt 1680 actaattgtg acagacagat cgatggcata tggaacagtt ttgggaagag attttatgaa 1740 aattagtaat tttaagattg ttagggaaaa agatagttca aatgatagac attgcgaaaa 1800 tgattgttta aataatagaa actttgaaaa ttttattgca aatgatagac attgtgaaaa 1860 tgatcgttca aatgatagaa atgttgaaaa ttttattgca aacgatagat attgtgaaaa 1920 ttatgattca aattccggaa atgctgtttt aaataatgaa aattctgaag ataatggttt 1980 aaatgataga aaatctgaaa atgacgtggc taatgtcaaa aattatggaa ataagtgttc 2040 aattaatata aataatggaa atgagagcaa aacaagtagt ccttatgacg gattagaaat 2100 actatcaatt gaataccccg atccaataga ataacaaaag ttttgcgtaa cgataggtat 2160 gtcgtggagg acgttgaagg ttttcaacag tatagacttc catacaaggg agtttgggca 2220 gtagcaaata tgcgcccatg gataaacaaa agtaaaagtt aacatgtttg tagagatcag 2280 gagctctccg gtcaggatgg ccgat 2305 // ID MICRO-1_AAe repbase; DNA; INV; 240 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE Microsatellite-type sequence: consensus. XX KW MSAT; Satellite; Simple Repeat; MICRO-1_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-240 RA Jurka J.; RT "Tandemly repeated DNA from Aedes aegypti."; RL Repbase Reports 11(4), 1446-1446 (2011). XX DR [1] (Consensus) XX CC 10-bp unit. XX SQ Sequence 240 BP; 48 A; 48 C; 48 G; 96 T; 0 other; gttttaagcc gttttaagcc gttttaagcc gttttaagcc gttttaagcc gttttaagcc 60 gttttaagcc gttttaagcc gttttaagcc gttttaagcc gttttaagcc gttttaagcc 120 gttttaagcc gttttaagcc gttttaagcc gttttaagcc gttttaagcc gttttaagcc 180 gttttaagcc gttttaagcc gttttaagcc gttttaagcc gttttaagcc gttttaagcc 240 // ID hATw-1_BF repbase; DNA; INV; 5833 BP. XX AC NZ_ABEP01001525.1; XX DT 12-JAN-2009 (Rel. 14.02, Created) DT 12-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE hATw-type DNA transposon family from Branchiostoma floridae. XX KW hAT; DNA transposon; Transposable Element; hATw; 7-bp TSD; KW hATw-1_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-5833 RA Bao W. and Jurka J.; RT "hATw-type DNA transposons from Branchiostoma floridae."; RL Repbase Reports 9(2), 513-513 (2009). XX DR EMBL/GenBank/DDBJ; NZ_ABEP01001525.1; Positions 2413 8245. XX CC TSD is 7-bp long and TIR is 12-bp long. XX FH Key Location/Qualifiers FT CDS 1310..4780 FT /product="hATw-1_BF_1p" FT /translation="MSSDGGEESEEAAGVLAKLCITKQMLEGDHSHLPRVK FT PAEFTNGLVLELRKLEGTANSTVVSQLMHLAPTAVKEDIEKCNPKSLQNRV FT KDMFKKYQKHSKNDRSHLEDFLKAQFRLPVPRPPPAQDEQSASSSTMFKLQ FT QERKEKKACKRKIAQLEKEREDVELKYERLIQDYFSTVTQLQVNEENFRQT FT IQAKSNHVSELTAELASYNKANETLSAKLDSLQTQMSRCNTGKVRNLNKKV FT KRKDDKLKLQDELIKEKDQTIKELEAAASSSSEKLEDSTAHLQLSTIKKKV FT KSMQDSKRKLQQKLEKAQTDKQQLQADLEATVEVLHKKIKDLQKENTDLQQ FT IQALMEKDEVKFFEDGKYSDEIREVVMDLLTRGVSMNKVPSVITTVLSKLA FT GKTVDQLPSKALMSRLLVEADCLATLQVGEAIISGAAEVEKGNCLHQDGTS FT KFHKKYLTFDATLPSGKTLTMSMSEVPSGDAAGITEAFTERCRELAAALCD FT EGEDVAQKTAQIITSFTSTMSDRGATNPLFNSQLQALRSELLPAVHENWDA FT LADDVKGEMANVCNYFCKMHLLVNFATEANSTLKLFEDAVAEGSNPLAYTQ FT QGESGAARLIRTACTAFTEHGSEKSGAPHYFNTHISHNHGEDTNYMVTFRG FT NRFNILFYNAAAVFHHHKHITSFVKSWPDPNGLLKAVKADATQKVYLAGVR FT ALGIIDKTVTGPFFRLLGMEKSILGMNKHLHQMQLSLERWAKDASTLLGGE FT PLFNEEVVKRNKDVLFESLFAPSEDEELDMLTQQALEVVCAAILILLERQA FT EEQLPGGKYWQPSEAEQQKSQHVPTTNVVSERDFAVLDNLIRSKPNASSLS FT CEAYIMWLNNQTSTWLDNLDAEEKERHMAYARTHAASIHAKFLEKKQKIKE FT QRLEALLKKQKEKEDKKQKARQRKVALTEKVLKMGGVWKTAEEVDERVGSL FT NREQEKLEAVKNQLQFHKKVLNSEGDKELFQMTITRPGRGRHVFSSSEKIS FT HLKEVISKNVTHSTDINSESEEEDEEREYVTLKEPEARTVQGIHETLNKKR FT QAETEKREVNRQKLSLPDYLKDPSKLVGCKIRQKFNTKDQDGNKVQTWYTG FT TVTEMMDGAQDPKDTEYLIVYDDENEKDEVYNLLSDLEKGELIVLNSD*" XX SQ Sequence 5833 BP; 1877 A; 1174 C; 1351 G; 1431 T; 0 other; tagggtgttc agtcacgtga cctgacccca gcgaaaatag tgtaaacaac ccatcttgaa 60 agtttgaaag acgatatctc tgcaatcctg tgaccaaaat gaatgattta tataccaaat 120 aaaaggtaat tttgttggta aaatactctc aaaatagcag aaagattgat tcaaggaatc 180 ttgagttatc gtacgaagtt tgcagctacc tcgccgctcg acgctccggg cgcgcgcgga 240 cacgatcttc ggcccgaggc gattttttca tcggcttttt tcgggagatt tagcaactgt 300 tcgtcaccca aataaggtag tattcggcta tcatcggcta gtagacaccc tccgaagcac 360 ccacgagaag gtgcgagttc ttggaagcac tcggtggcaa aacgccaagg taggaaagtt 420 ttctttcgta aattttttgt agaatttgaa aagatatcca tatgcacctc ggacatataa 480 aaatagacat tgtaaggaac aatttgatat atagattgtg tgatttgatg agaaattacg 540 ggtgctaaat cgcagaaacg tgaccggact tgtcggggct tcggcgtggg ccgtggggtg 600 ggagggcggc cgagtctcgg ggcacgccac gctacgtaaa caagtcaatt tcctcccgtt 660 tctttgcatt attacttgtt cctgcccata tttaccgcca ttaacgtaaa atagaatcat 720 ttaggaacag attgaagtga tattttttgt ctttttggat attacgttag atctagtttc 780 gcttgggttt ctcgtaacgg tcctgtcaga atgctacatc atgtgacgaa aacaatgcgg 840 atttcccggg acatatcatt ctacattgac gtaaacgtaa atttatacac attcaggatt 900 tcttctacgt ttggatgatg tgttttttgg ggaaaatatt agcatttagt tgcatagaat 960 aagggatttg tggcgattca atgccgcaca cccaagactc ggtggggagg ggggggggga 1020 gctattcctt cttcagatca gaacacaagt gttactcagt tgatataaaa ttataacaac 1080 ataaattggg aacatttcat gtttcttcta ctgttttttt taatttgtaa catgtataga 1140 attaaatatt tttattgaat tttcatgagg aaatactgag tggctgagta aaagaaaaaa 1200 aaaaactcac cttttgtttt taaagtcaga acttttttta catgaaattt gtacttatat 1260 ttattataat tttaatcagt ctttccttct tcttatccag agcatcaaga tgtcctctga 1320 tggtggtgag gagtctgagg aagctgctgg ggtgctggca aagttgtgta tcaccaagca 1380 gatgttagaa ggtgatcaca gtcatttgcc cagagtgaaa cccgcagaat tcaccaatgg 1440 actcgtcttg gaactacgca agctcgaggg cacggcaaac agcacagtcg tctcacaact 1500 gatgcactta gctcctactg cagtaaaaga ggacattgaa aagtgcaacc ctaagtcgct 1560 tcaaaaccgt gtcaaggaca tgtttaagaa gtatcagaaa cacagcaaaa acgacagaag 1620 tcacctggaa gatttcctca aggctcagtt ccgtctcccc gtgccaagac caccaccagc 1680 acaagatgag caatctgcat ccagttcgac tatgttcaaa cttcagcagg agaggaaaga 1740 gaagaaggca tgtaaaagga aaattgctca actagagaag gaaagggaag atgttgagct 1800 gaagtacgag cgtctgatac aagactactt ctcaactgtg acacagcttc aggtgaacga 1860 agagaacttc agacaaacca tccaggccaa atcaaatcac gtgtctgagt tgactgcaga 1920 actggcaagt tacaacaagg cgaatgaaac attatcagca aaactggatt cactgcagac 1980 tcagatgtcc agatgcaaca caggaaaagt gcgaaatttg aacaaaaagg tgaagaggaa 2040 agatgacaag ttgaaactgc aggatgaact gatcaaagaa aaagatcaaa caatcaaaga 2100 gctagaagca gcagccagtt caagtagtga aaagttagaa gactccacag ctcatttgca 2160 actgtcaaca atcaagaaaa aagtaaagtc catgcaggac tcaaaacgaa agttgcaaca 2220 gaaactggag aaagcacaaa ctgacaagca gcaactacaa gctgacctag aggcaacagt 2280 agaggtgcta cacaagaaaa tcaaggacct gcaaaaggaa aatacagatc tacagcagat 2340 ccaagcactg atggagaaag atgaagtgaa attctttgaa gatggaaagt attccgatga 2400 aatcagggaa gttgttatgg atctattgac aagaggtgtg tccatgaaca aggtgcccag 2460 tgtaatcacc acagtgctat ctaaacttgc cgggaaaact gttgatcagc ttccaagcaa 2520 ggcactgatg agcagactcc ttgtggaggc agactgcctt gcgacactac aggtgggaga 2580 ggcgatcatc tccggggcag cagaggtaga gaagggcaac tgtctccatc aggatggtac 2640 ttctaaattc cacaaaaagt acctgacatt tgatgccacc ttaccttccg ggaaaaccct 2700 caccatgtca atgtcagaag ttccgagtgg tgatgcagca ggaatcacgg aagctttcac 2760 ggaaagatgt agagagttgg cggcggcgtt gtgtgacgaa ggtgaggacg ttgcacagaa 2820 gacagcgcag ataataacat cattcacatc caccatgtcc gacagaggag ccacaaatcc 2880 actattcaat tcccagcttc aagctctacg aagtgaactc ctcccagctg tacatgaaaa 2940 ctgggacgct ttggctgacg atgtcaaagg tgagatggcg aatgtgtgca actatttctg 3000 taaaatgcac ctcctggtga attttgcaac ggaggcaaac agcacactta agctgtttga 3060 agacgcagta gcagaaggat ccaacccctt agcctacacc caacaagggg agtccggcgc 3120 ggcaaggttg atacgtacag cttgcactgc tttcaccgaa cacggcagtg agaaatccgg 3180 cgcacctcat tatttcaaca cccacatttc acacaaccac ggagaagaca ccaactacat 3240 ggtcactttc agaggaaacc ggtttaatat ccttttctac aatgcggctg cagtcttcca 3300 tcatcacaaa catatcacca gttttgtcaa atcctggcct gaccccaacg gactcttgaa 3360 agctgtgaag gcggatgcaa cgcagaaagt gtacctggct ggtgtccggg cgttgggcat 3420 cattgacaag actgtcactg gtcctttctt cagactcctg gggatggaaa aaagcatact 3480 agggatgaac aaacacctgc atcaaatgca actgtcactg gagaggtggg ccaaggatgc 3540 cagcactctc cttggcgggg aacccctctt caatgaagaa gtagtgaaga ggaacaaaga 3600 tgtcttgttc gagtcgcttt ttgcgcccag tgaggatgag gagctggata tgctgaccca 3660 gcaagcactg gaagtggtgt gtgctgccat cctgatattg ctggagcgtc aggcagagga 3720 gcagctacct ggtggcaaat actggcagcc aagtgaagct gagcagcaga agtcgcagca 3780 tgtgccaacc acaaatgttg tgtctgaaag ggactttgct gtgctcgata acttaataag 3840 atccaagccc aatgcaagct ccttgtcgtg tgaggcatac atcatgtggc tgaacaatca 3900 gaccagcacg tggctcgaca atctggatgc tgaagaaaaa gaacgccaca tggcatacgc 3960 cagaacgcat gcagcgagta ttcatgcaaa gttcctggaa aaaaagcaaa aaatcaagga 4020 gcaacgtctt gaggcgttgc taaagaagca gaaagaaaaa gaagacaaga aacaaaaggc 4080 tcgacagagg aaggtggcac tgacagagaa ggttctcaaa atgggcggag tgtggaaaac 4140 tgccgaagag gtggacgaga gggtgggttc tctgaacaga gaacaagaaa agctggaggc 4200 agtgaagaac cagctccagt tccacaaaaa agtgctcaac tctgaggggg acaaggaact 4260 gttccagatg acaatcacca ggccgggaag aggaaggcat gtcttctcca gtagtgaaaa 4320 aatctcccat ctaaaagaag tcatctctaa aaacgtcaca cacagtacag acatcaacag 4380 cgagtctgaa gaagaggatg aagagaggga gtacgtgacg ctgaaagagc ctgaagcccg 4440 tacagtacag ggcatacatg aaacactgaa caaaaaacgt caagccgaaa cagaaaaaag 4500 agaagtgaat aggcagaagt tgtctttacc ggactacctc aaagacccgt ccaaactggt 4560 cggctgcaag atcaggcaga agttcaatac taaagatcaa gacggtaaca aggtccagac 4620 ttggtacaca gggacagtta ccgagatgat ggacggggca caagacccaa aggacactga 4680 gtacctcata gtatacgacg acgaaaatga gaaggatgag gtgtacaacc tcctgtctga 4740 cttagaaaag ggagagctta ttgtacttaa cagtgactaa atgaggatgt attagtatat 4800 atgtacagta actgatacaa tttgtgatat ggttgacctg ttgagacctt tattgtctcg 4860 ccttttttct ttattgttgg ctatactata ctacagtact cccttgaagc accttgggcc 4920 ttccaactta gtagacatag gtgcgatttc ccttgtgtgt tgagcattca aaagactttc 4980 tgttcgtgac atggctgtaa gtttttacga gtacagttta agttcatgca gttttataga 5040 actggtctgg tatgttagtg gacgaagaag tatctgtgtc acatgcttga gaaatctgaa 5100 agaaaggaaa agaaagatta gtatgtttta aaaggaaaag ttcaagtgca tagatttgta 5160 agtttgtaca ttagttaacc tgttacgagg aaatatcatt ctgtatgatc aatatgttgt 5220 cgtggaccaa acttgtgata cactgtttcg cagccgacta gtctaatgat gatgatgttg 5280 taatatatct atgtacatgg aaattgtatg ctgttttctc atttattgac ttgattaact 5340 taaggtatcg tattagtaaa ggtacaggca attttctgga gtcaaaacta gtcagaactg 5400 tccaagaaac aggaaaactg gtccccagca taaattatgc aaatgaccta attaattatg 5460 caaatcatac attaattatg caaattaatg tagatatgct tgatgttaac atgtatacca 5520 actttgaaga tcataggccc taccattctt aagatacgtt agcccaagta atgtcttaaa 5580 tttcttgtaa tttttaccat tgcaaacaat aggacagtgg tgccatcttc atactcattg 5640 accccatggt taaagccttt gtttttggtg acccaaatcc ctgatttcca agaaatgaaa 5700 tttttgtaac tccccaacca tacatcatag aaagatgaaa taaaaaccat tttaattgtc 5760 acaatttgca ctacaatttg atataagaat caatatccta gggcaaagta taatttttga 5820 cctgaacacc cta 5833 // ID Gypsy-163_AA-I repbase; DNA; INV; 4705 BP. XX AC AAGE02017816; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-163_AA_; KW Gypsy-163_AA-LTR; Gypsy-163_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4705 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02017816; Positions 24356 19652. XX CC Positions [3555-4010] - Integrase core CC 'TGTAC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 321..4649 FT /product="Gypsy-163_AA-I_1p" FT /translation="MASGVDRLPQSLDCTNLSKEWPRWKQKFNIYMIANNK FT TGETERSKIATFLWLVGEHGVEIYNTLFPNNGDVESMFGGGAAGGDQPAGG FT GEENNQPAAVDPAAGAAQVRTLTQVINAFDAYCLPRKNLAVEAFKFNLIMQ FT KEKQPFAEFETALRTQLAYCEFECEACHTSYADRMLRDRIIIGIQDKKLQL FT KLLDGKNEPLANILEMCKVYEAAAENKQLLERKEVHNVIDKSAKANGEIAA FT LKTLTCYNCGQPFNGRHRRYCPAIDVVCDGCGRRGHFKKYCRTTKHDNKSG FT SQYSSNGGARSSAGSKHDVKQAAVNTNVHMVNWADAGNFFVADSNNNHSKC FT RFPGRSISNYRIDSNATSNGEAGGKWTKRYLIQDWPVNFKLDTGADVNCIP FT LSVITRMNIPVLNKRSFNVVDYSSNEIKIHGLVTLKCVDEGKGLTHTADFL FT VVDDSFEPLIGRESCVAFGLIERLDSLECLVSFPSEREKFIESNVDVFEGL FT GKLPGYCTIVLKDNAVPSLHYKKRIPMSLHDRLKTELENMVAHGIISPVDY FT PTDWVNNMQIVEKPNGSLRICLDPKPLNACIKREHFLIPKSEDLLSRLSGK FT RVFTVLDLRNGFWQMELDRQSSDLTTFMTPFGRFRWNRLPFGISSAPELFQ FT KRMIQLFGDIPGVEVYFDDVAISGENFEEHDKTLAIVLERARKNNVKFNSV FT KTQYRSDEVKFMGHIISEGQIRPDQSYISAILDMPKPKTKSDVLRLLGLLK FT YIGKFIPNLSQRTAALRELSRNDVEWSWTDVQERELKDLLGSITTTPVLAI FT FDPSKSVTVQTDSSKDGLGSVLMQDGKAVAYASRTLSNSEKKWAQIEKELL FT AIVFACERFHYFLYGREFVVQSDHKPLESLVKRDIDDVTPRLQRMFLHLLK FT YPGMNIKYTPGKDMLIADCLSRAALPGSKDLPEELSGMIHAITRSVCLSPD FT NYKLYVETLNNDERYRRIIGYVDHGWPSYHQLDDLGQLFFKHRHLLHFENG FT LLFKEHRLVIPTSLQSTISTWLHAPHLGVEKTLARARAQFFWPGMSNDITE FT LVKECNVCEKFTRNIQKEPLHQDSPPCYPFQRVGIDLYEYAGHDHVVILDA FT YSGYVISRQLNEKSARHVIDVLDRVFCDYGYPTQIRCDNVPFNSSAFDSYA FT NECNIEFVFSSPRYPQSNGLAEKGVAIAKNILKRCYEAGNVKSFRYRLFEY FT NTTPVASMGLTPAQLFFGRQLKTRLPISSNLLLRNNLKENHVAEKIKRKRR FT YQKQYYDRSSKPLPVLNVGDKVIFKKNGKEWHYGQIVRIVNSRSYIVRDGS FT GNHFRRNRRFIATTQNQSPNDNELFFEKHIDRHNRPDRSVVRPPHFDHSNS FT DQQLPVRLELQDGGRSELQSPDTSLEVDPDSFYTASDSESPIDVDLTNSMS FT QSDSDDSSINVPYKTRSGRVVRLPIRFDI" XX SQ Sequence 4705 BP; 1375 A; 907 C; 1094 G; 1329 T; 0 other; tggtgtcaga agtgaaagga tattcgtgaa aacggactga aaaccagttc gagtgtcgtg 60 aaaattctac atgagtagcg aaaattaaac catttagttg tcgccattgg cgtctgaaag 120 aaaccgcaag gaaaaccggt tcaatttgtg cagtcgtgaa tcgtattcaa tcattgatgt 180 gtatgtgtaa gctcggttcg cagaaaattc tgtgagcagt gagagaccga atagtgggca 240 gtgagagacc gaaagtaaca gcacccttgc actcacactg tgaagattga actgtcaaac 300 cagaattgaa atttgtaaac atggcaagtg gtgtggatcg actaccgcaa tccttggact 360 gcacaaattt atcgaaagag tggccaagat ggaagcaaaa gttcaacatt tacatgattg 420 cgaacaacaa aactggcgaa acagagagaa gtaaaattgc aacgttttta tggctagttg 480 gtgagcatgg tgtggaaatt tataacactt tgtttccgaa taatggtgac gtcgaaagta 540 tgttcggtgg aggagcggcc ggcggcgatc agccagctgg aggaggcgag gaaaacaatc 600 aaccagccgc cgtggatcct gctgcgggag cagcacaagt acgcacactt acgcaggtga 660 tcaatgcgtt tgatgcgtac tgtcttcccc ggaaaaatct ggcagttgag gcgttcaagt 720 ttaatttgat catgcagaaa gaaaagcaac cgtttgctga attcgaaaca gctcttcgca 780 cccagctggc atattgtgag tttgaatgtg aagcttgcca tacttcgtat gcagaccgga 840 tgttgcgtga ccggattatc atcggaattc aggacaagaa gctacaactc aaactgttgg 900 atggaaagaa tgagccgctc gcaaacattc ttgaaatgtg caaagtttac gaagccgctg 960 ccgaaaataa acaactgttg gaacgcaagg aggtgcataa cgtaatcgat aaatctgcaa 1020 aggcgaatgg tgagatagca gctttgaaga cattgacgtg ctacaattgt ggtcagccgt 1080 ttaatggacg tcatcggcgt tattgtccag ctatcgatgt agtttgtgac ggatgcggac 1140 ggcgtggcca cttcaaaaag tactgcagaa caaccaaaca cgacaacaag agcggtagcc 1200 agtacagcag taacggagga gcgcgttcgt cagcgggatc taagcacgac gtgaagcagg 1260 cagcagtcaa cactaatgta catatggtaa actgggctga tgcaggtaat ttttttgtag 1320 cggattcgaa taacaatcat agtaaatgta ggtttcctgg tagatccatc tctaactata 1380 gaatagattc gaatgcaacg tcaaatgggg aggcaggtgg aaaatggacc aagcgatatc 1440 taatccagga ctggccggtt aatttcaagc tcgataccgg cgctgacgtt aattgcatac 1500 cactgagtgt tattacccgt atgaacatcc ctgttttaaa taaacgatct ttcaatgttg 1560 ttgactacag ttctaatgaa attaaaatac atggtttggt tacgctaaag tgtgtggatg 1620 aaggtaaggg tttaactcat acggccgatt tcctcgttgt cgatgattcg tttgagcctt 1680 taattggacg tgaatcgtgt gttgcctttg gtcttattga gcgtttggac agtttagaat 1740 gcttggtttc cttcccatcc gaaagagaaa agttcattga atcaaatgta gatgtatttg 1800 agggcttggg taagctgcca ggttattgta ctattgttct taaagacaac gctgttccat 1860 cccttcatta caagaaacgt attccgatga gtctgcatga tcgtcttaaa acggaacttg 1920 aaaatatggt agcgcacggt attatcagtc ctgtggatta ccctacagat tgggtgaaca 1980 atatgcagat agtcgaaaaa ccaaacggtt ctttgcgtat ttgtttagat cctaaacctt 2040 tgaacgcgtg tataaagaga gagcattttc ttattccaaa atctgaagat ctcctcagtc 2100 gtttgtcggg aaagcgcgtt ttcactgtgc tggatctacg taatggcttc tggcaaatgg 2160 agttggatcg ccaaagctcg gacttaacta cattcatgac cccattcggc cggtttcgct 2220 ggaaccgtct tccatttgga ataagtagcg caccagagct atttcaaaaa agaatgatac 2280 aattatttgg agatatccca ggagttgagg tgtattttga tgatgttgcg atttcaggag 2340 aaaattttga agaacatgac aaaactttag ctattgttct tgagcgagca cgtaagaata 2400 acgtaaagtt caattctgtc aaaacgcaat acagatctga tgaggtcaag ttcatgggtc 2460 atattatctc cgaggggcaa atacgacctg atcaatcgta tatatctgcg atccttgata 2520 tgcccaaacc caaaactaaa tcggatgttc ttcggctgct tggattacta aaatatattg 2580 gaaagtttat tcctaatttg tcccagcgca ctgctgctct cagagagctg tctcgaaatg 2640 atgtcgaatg gtcgtggaca gacgttcagg aacgagaact caaagatcta ttaggctcta 2700 taactacaac acctgtgtta gccatatttg atccaagtaa gtctgttact gttcaaactg 2760 atagttcaaa ggatggtcta ggaagtgtac taatgcaaga tggaaaggcg gtggcttatg 2820 cttctcgaac actgtcgaat agtgagaaga agtgggctca gattgaaaag gagcttcttg 2880 ctattgtgtt cgcttgtgaa cgattccatt atttcctcta tggtcgagag ttcgtagtac 2940 aatccgatca taagcccttg gagtcattgg taaaacgtga tatcgatgac gttacgcctc 3000 gtcttcaaag gatgttttta catttgttga aatacccggg aatgaacata aaatacacac 3060 ctggcaagga tatgttgatt gctgattgcc tgtctcgtgc agcacttcca ggtagcaagg 3120 atttgcctga ggaactttcc ggtatgattc atgccataac tcgtagtgta tgcttatccc 3180 ctgataatta caaactctat gtagaaacct tgaacaacga tgaacgttat cgtcgcatta 3240 ttggctacgt ggatcatggc tggccatctt atcatcagct ggatgatctt ggtcagcttt 3300 tctttaaaca tcggcattta ctgcatttcg aaaacggatt gctgtttaaa gaacatcgtc 3360 tggttattcc aacatcactg cagtcgacaa tatctacgtg gctacatgcg ccacatttgg 3420 gtgtggaaaa gactcttgca cgcgcaagag ctcagttctt ttggcctgga atgtctaacg 3480 atatcactga actagtcaaa gaatgtaatg tttgtgaaaa gtttactcgt aatattcaaa 3540 aggaacctct ccaccaggac tctcctccct gctacccttt tcaacgtgtt ggtattgatc 3600 tgtatgagta tgccgggcat gatcatgtgg taatcctaga tgcttattcg ggatatgtta 3660 tatccagaca gttgaatgag aagtccgccc gccatgtgat tgacgttctt gatcgagtgt 3720 tttgtgacta tggctatcct acacagatta ggtgtgataa cgtacctttt aactcgtctg 3780 catttgattc atatgctaat gaatgcaaca tagaatttgt gttttctagt cctcgatacc 3840 cacagagcaa tggtttggca gaaaagggtg tggctattgc taagaacatt cttaaaaggt 3900 gttatgaagc tggcaacgtg aaatcttttc gttatcgatt gtttgagtac aatactactc 3960 ctgtagcgag tatgggcctc actccagctc aactattctt tgggcgccaa ctcaagacgc 4020 ggctaccaat atccagtaac ttgcttttgc gaaataattt gaaagaaaac catgttgcgg 4080 aaaagataaa acgtaaacgc cggtatcaga agcagtacta tgatcgttct tctaaaccgt 4140 tgccagtcct gaatgttggt gataaagtta tttttaaaaa gaacggtaaa gagtggcact 4200 acggtcaaat cgtgcgtatt gttaacagtc gatcatacat cgttagggac ggttctggga 4260 atcatttccg acgaaatagg cggtttattg caacaacaca aaatcaaagt ccaaatgaca 4320 atgagctgtt ttttgaaaaa cacattgatc gacacaatcg tcctgatcga tcagttgtta 4380 ggccaccaca ttttgaccat tccaattcag atcagcagct tcctgttaga cttgaactac 4440 aggatggtgg tagatcagaa ctacagtcac ctgatacatc gctggaagtt gatcccgata 4500 gcttctatac agcaagtgat tcagagtccc ccattgacgt tgacttaacg aatagtatgt 4560 ctcaatctga ttcggatgat tcttctataa atgtacctta taagacgcgt agtggacgtg 4620 ttgtgcgcct tccgatcaga tttgatattt agagatagat taggataagt ttttttttta 4680 tcatcaaaag aaaaaagaga aagcg 4705 // ID Zator-3_HM repbase; DNA; INV; 4338 BP. XX AC . XX DT 29-JAN-2009 (Rel. 14.02, Created) DT 25-JUL-2009 (Rel. 14.08, Last updated, Version 2) XX DE Zator DNA transposons from Hydra magnipapillata. XX KW Zator; DNA transposon; Transposable Element; Zator-3_HM. XX NM Zator-3_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4338 RA Bao W., Jurka M.G., Kapitonov V.V. and Jurka J.; RT "New superfamilies of eukaryotic DNA transposons and their RT internal divisions."; RL Mol Biol Evol 26(5), 983-993 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 669..3023 FT /product="Zator-3_HM_1p" FT /translation="MNNSEKYKIIYQQVAKALPQKSKQEQQKEANKLWYKV FT KHKELTYDSVLKELTRKTMESKSQLMSFWSNVKTSNNQSIAVKLPLTNEIS FT DEGPEALLTSDEADKIDFEINKTLATSSKFDIQKSQKSQTEVPIDQPTKSH FT TPQQEKIQTEISQLKEQLLTLVNVRRAGLITDELKIKSIFLEKQIKIKQNR FT LKKMKREAIRQRQRRASLKRTLNDIVDDTPYLKSKLSKFNREKNGRPAVIE FT DQPDLLNAIVNIASIGAAASDRRRSQELRSCKTLDDLNKSLLKLGYCLSRS FT ATYLRLMPKRSNTLEGIRHIKTVPVKLIRTANTARDRHIDAGFTFACMQYM FT DEVCSIFGPDSCFYLSLDDKAKVPLGLAAATKQSPILMHLDYQVRLPDHDF FT LVASRHKLIPSVYAACIIDPNKFADAVSYSGPTYIAIRSMKHDSSTAFTHG FT RDLERLVSIENFKLHICTPEGQVKPIFVVACDGGPDENPRFPKPLQVAIAR FT FIKWDLDVYLTGTHAPHYSAYNRVERRMAPLSRELAGLVLPHDHFGTHLNA FT SGKTIDVDLEKKXFAKAAETLAEVWSDIVIDNYSVCAEYVEPSQLSPIPEQ FT DSKWIENHVRQSQYFLMIVKCDVPNCCKPWRSCWKKYFPQRFLPGPVVMSH FT AAEGIFIPEPKVARNIPKPYFSSLAQRLGYIGSKISMEASFDLYCPSISQQ FT KIDQRTCKSCKIYHASKASVKRHKCVFRNIINTTSEENKNDDENNDTHLTV FT NESNSDQIHVIENIFDWVKSPFDLDHQQFKLID*" XX SQ Sequence 4338 BP; 1539 A; 741 C; 665 G; 1392 T; 1 other; ggggtcatcc ataaagtacg tacgccaaaa atttcaaaaa attgaccccc ctccccccag 60 ttgccatgcg tactttttag tcctctcccc cctttgaaaa gtacgtacgc ttttgaagta 120 cccccccccc ccatcttctc ttttctcttt atgagttaaa aagttaaaat ttataaaaaa 180 aaaacattat ttaagaaatt taaatattag ggatcgtcaa taaagtacgc cgggcatgag 240 taaaaataag aagtcatcaa aaagtggtga tattctttat atttcaaaac gttccatttc 300 attaaatcaa ttgctcaata cttttcaaaa acaatatgaa gtataaaaag aatttcggaa 360 tttcttcata tttcaatttt taattaatta gcgcgacaac agtcttacta agggtaacaa 420 gtttaaaagc taaagtttta aagtattgtt aaactttaaa tattaatttt tgaaattgtt 480 atgtaaatag tattttttat aattttttaa actttatttt gtaaaaaata ttttataata 540 aaaataagat actataaatt atttaaaaac taaacaaaca acccccgttt gcatttattg 600 tagcttgtaa atttaagttg tgctttaaat agtcagttaa aaattagttt gaaggcggaa 660 ttattaaaat gaataatagt gaaaagtaca aaattattta ccaacaagta gcaaaagctt 720 tgccacaaaa aagtaaacag gagcaacaga aagaagctaa taaattgtgg tataaagtta 780 agcataaaga attgacgtat gattcagttt taaaagaact cacacgaaaa acaatggaaa 840 gtaaaagtca gttgatgtct ttttggtcta atgttaaaac atcaaataat caatctattg 900 cagtgaagct tcctctcaca aacgaaatat cagatgaggg accagaagct ttgttaacat 960 ccgatgaagc agacaaaata gattttgaaa tcaataaaac actagctact tcttctaaat 1020 ttgatattca aaaaagtcaa aaatcacaaa cggaagtacc aattgatcag ccgacaaaat 1080 ctcatactcc acaacaagag aaaattcaaa ctgaaatttc gcagttaaaa gagcagctgt 1140 taacattggt aaatgtgcgc cgtgcaggat tgataactga cgaacttaaa ataaaatcga 1200 tttttttaga aaagcaaata aaaataaaac aaaaccgtct taaaaaaatg aaacgcgagg 1260 caatccgtca aagacaacgc agagcaagtt tgaaacgaac acttaatgat attgtcgatg 1320 acacgcccta cttaaaatca aaactttcta aattcaatag agaaaaaaat ggtagaccag 1380 ctgtaattga agatcagcca gatcttctta atgccatcgt taacatcgca tcaataggag 1440 ccgctgctag cgatcgacga agatcgcagg agttacgttc atgtaagaca cttgatgact 1500 taaataaaag cctactaaaa ctgggttatt gtttatcgcg atctgcaact tatttaagac 1560 tgatgcctaa gcgatcaaac acacttgagg gaatacgtca cattaagaca gttcctgtga 1620 aattgatcag gactgctaat acagctcgtg ataggcatat cgatgctgga tttacatttg 1680 cgtgcatgca atacatggat gaagtttgtt caatatttgg tccagattca tgcttttatt 1740 taagtctaga cgacaaagct aaagtaccat taggattagc agctgcaaca aaacaatcac 1800 caattttgat gcatcttgat tatcaagtac gtttgccaga tcatgatttt ttagtcgctt 1860 ccagacataa gcttataccg tctgtttatg cagcttgtat tattgatcca aacaaatttg 1920 ctgacgcagt gtcatattct ggcccaacgt atattgctat tagatctatg aagcatgata 1980 gcagcactgc ttttactcat ggccgtgatt tggagcgtct tgtttcaatt gaaaatttta 2040 agcttcatat ttgtactcca gaaggtcagg taaaacctat atttgttgtt gcatgtgatg 2100 gaggccctga tgaaaatcca aggtttccca aaccattgca agttgcaatt gcaagattta 2160 taaaatggga tttggatgtc tatctaacag gaactcatgc acctcattat tcagcatata 2220 acagggttga acggcgtatg gctccgctga gtcgagagtt agcaggattg gttttgccac 2280 atgaccattt tgggacacat ctaaatgcat ctggaaaaac aattgacgtt gatttagaaa 2340 aaaaaawttt tgctaaagct gcagaaactc tagctgaggt atggtcggat atagtaattg 2400 ataactattc tgtttgtgct gaatatgttg aaccttccca actgtcacca ataccagagc 2460 aagattcaaa atggattgaa aatcacgtaa gacaaagtca atatttttta atgatcgtta 2520 agtgtgatgt tccgaattgc tgtaaacctt ggcgaagttg ctggaaaaaa tattttcctc 2580 aacgtttttt accaggtcca gttgtaatgt ctcatgctgc tgaaggaatt tttattcctg 2640 agccaaaggt agcacgaaat attcctaaac cttacttttc ttctctagcg cagcgcttgg 2700 gatatatagg ttcaaaaatt tcgatggaag cttcctttga tttgtactgc ccatcaattt 2760 cacaacaaaa aattgatcaa agaacttgca aatcctgcaa aatttatcac gcttctaaag 2820 caagtgttaa acgtcacaaa tgcgtgttta gaaacataat caacacgacc tctgaggaaa 2880 ataaaaatga cgatgaaaac aatgatactc atttaacagt aaatgaaagt aattctgatc 2940 aaattcatgt catcgaaaac atatttgatt gggtgaaatc accctttgat ttagatcacc 3000 aacaatttaa attaattgat taatgaagac tatattaata attttgttta taaaatatta 3060 aaacgatgtt ttgaaaccta taaagataac agtttttgtt ttaaattgtg attacttaaa 3120 aaaattttat caccctgccc ttaaccacaa cctaatcttt aaaggtacca aacaaacatt 3180 ctcgccagtt tgttgatttt aaaggtttcc tgtttgccag caaaagcgct taaagatcaa 3240 ctacacaaat ctttcaacaa gacttttttg ttttattttt tgttaatgat cttttaaacg 3300 attgaaaagt aatacacttc actgtcacct tgatgttcag agttaaaaat tttcaaacac 3360 taaattgatt gcttgcattt ttttagctga aatacttttc aaaaatttat aatttccagt 3420 tttctcaagc ttcttagttt attatattta tcaaaaaaat taccctatat aagatgatgc 3480 aattttgtta cgtgacattt tttataatca agaaataata actatgttct catacacaaa 3540 gaatttttgt atttatataa gattgagcac aaaagtgttt tcctcgcggt agttattctc 3600 ctactttaat tgcgatacaa caacataggt ggagggtatg taaaaaactg tgacattaaa 3660 attttgaaag aaagtttcca ggctggagat aaaaacaatt gttttgacca tcgcaagcaa 3720 atcattaaaa ctttctctat tcctaaataa ctaataaacg taaccaagta tgtgagttgt 3780 tacatttata aaacacatca aacgataact atgaggtgcc aaataaacgt gcctgcccgt 3840 ttatttgggt taaatgttgt ggattattat actaaaaacc cacaacattt ttaccaagcc 3900 cattaaaact tattacgttt ttattgttaa atttaaactt cttttactaa ttttaaggtt 3960 cctttagtaa aaagaacttt aaagtttcca aaaaaaaatt gtaattagat cttttttaag 4020 ggcaagtata cttccctccc tccccaactg acatttcaaa gtcacggtca tggttttatt 4080 atttaaaaaa aattatttat ttattattat attcaaaaat atctagttat ttattaaaaa 4140 acatctagtt atagttaagt atttgttagt ggtgataagt tagaaataca agcattaact 4200 aagaaaaatt acttttttta attagtgcgt acgtactttt atattcaacc cctccccccc 4260 ccctccccat acgcttttgt acgctttttg aagacccccc ctccccccta tgagcgtacg 4320 tactttatgg acgacccc 4338 // ID Gypsy-23_AA-I repbase; DNA; INV; 4344 BP. XX AC supercont1.118; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-23_AA_; KW Gypsy-23_AA-LTR; Gypsy-23_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4344 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.118; Positions 63330 67673. XX CC Positions [1949-2410] - Reverse transcriptase CC Positions [3459-3761] - Integrase core CC 'TATCC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 341..2035 FT /product="Gypsy-23_AA-I_1p" FT /translation="MEGWNISPFKFNHLPETQTRIEWMRWKRNFEVIVAAS FT DEKNSTKIKNILLAKGGLELQDLFYSIEGADVVEDIEKGVDPYHVAIAKLD FT GHFTPKQHDSFDRNEFCKLSLSITNEGKRETLGKFLMRCADQARRCNFGKT FT ETESRELRIIDKVIYHAPAELREKLLQKEKLNLAQLTRIVNSFESIKLQWK FT AIENTRFGDSSTLTVQDPIARINKLNTTFKHPGRTCFRCGQQSHHGNDREC FT PARGRKCEKCHKIGHYAKVCRSGTSFKRSYEEPIMSHPEKRRKFGNVRAIV FT ADENLEQDEPESFIFNIGDGDEYLWTKIGGVLIQVLIGSGRSKNIIDDKTW FT QYMKQHGVRSCPSNVSNMDLRGYGPEAKPLEIGHAFEANIEVENTNSKHGM FT NALFYVIKGGQQSLLGKETAKRLGVLKIGLPNQINSLTVLEKRSFPKMKNV FT QVKIPINQNVTPVAQQVRRPPIALLSKIEDKLDHLLAMDIIEPVSGPAEWV FT SPLVTIVKDNGDLRLCVDMRRANQAIQRERHMMHTFEDFLPRFKLARFFSR FT LDIKDAFHQVISGRNNPT" XX SQ Sequence 4344 BP; 1536 A; 738 C; 926 G; 1144 T; 0 other; tctggcgggc gagaataaat tgtcaagact ttcaagtcta ttcccatgaa tcacgaataa 60 gaaatcaacc aaaactaaga ataaacgaat aaaacggtaa agcaattacg aagggtccga 120 atccaaatcg cgtggaaaaa ccaaaactaa aagaaataac gattgaaaca agtaaggatt 180 tgccattact aattggctag gtaacataag taagactgtt acgacaggac acactgctcc 240 gacaggacac ttcgctgttg ttgcgaaaaa aaaacaacct atatgtggta ccatgacaac 300 catctatagt gtaattttga aattttcagc ttgagaaaga atggaaggat ggaatattag 360 cccgttcaag tttaatcatt tacccgagac gcaaacccgg atagaatgga tgcgttggaa 420 acggaatttt gaagtcattg tggcggccag cgatgagaag aattcgacta aaatcaagaa 480 cattctactt gcgaaaggag gtttggagct tcaagactta ttttattcaa ttgagggagc 540 tgatgtcgtc gaagatattg aaaaaggagt tgatccgtac catgtggcta tagcaaagct 600 tgatggccac ttcacaccaa aacagcatga ttcttttgat cgaaacgaat tttgcaaact 660 ttcactatcg ataaccaatg aagggaaacg cgagacgtta gggaagtttt tgatgcgttg 720 tgcagatcag gcgagaagat gcaactttgg aaagaccgaa accgaaagtc gtgaattgcg 780 catcatagac aaggtgattt accatgctcc tgccgaatta cgagagaagt tattgcaaaa 840 agagaaactg aatttggcac agttaacacg tattgtgaat tccttcgagt caatcaaact 900 tcagtggaaa gctatagaaa acactagatt tggagatagc tcaacgttaa cagttcaaga 960 tcccattgcg cgtattaaca agctaaatac tactttcaaa catcccggta ggacgtgttt 1020 ccgttgtggt cagcaaagtc atcatggcaa cgatcgtgaa tgccctgcac gaggaagaaa 1080 atgtgaaaag tgccacaaaa tcggacatta tgctaaagta tgtcgatcag gaacatcatt 1140 caagcggagt tatgaagaac caattatgag tcatccagaa aaacgaagga aatttggaaa 1200 cgtccgtgca attgtggcag acgagaatct tgaacaagac gaaccagaaa gctttatttt 1260 taatatcgga gatggagacg aatatctgtg gacaaaaatt ggaggagtat taatacaagt 1320 cctaattggt tcaggacgtt ccaagaatat tatcgatgat aaaacgtggc aatatatgaa 1380 gcagcatggt gttaggagtt gtccatctaa cgtctccaat atggatttga gaggatatgg 1440 tcctgaagca aaaccattag agattggtca cgcatttgag gcaaacatag aagtggagaa 1500 tacgaacagc aagcatggaa tgaacgcatt gttctatgtc atcaaaggtg gacaacaatc 1560 gctactcgga aaagaaactg ctaagcgcct tggagtattg aaaattggat tgccgaatca 1620 aatcaactcg ctgacagtct tggagaaacg ctcattccca aaaatgaaaa atgttcaagt 1680 aaaaatccct attaatcaaa atgtgacacc agttgctcaa caagttcgtc gacctccaat 1740 agcattactt agcaaaattg aagacaaact ggatcatctt ttggctatgg acatcatcga 1800 gccagtatcc ggtccagcag aatgggtatc accacttgtt acgattgtga aggataacgg 1860 tgatttacgt ctctgtgtgg acatgcgacg tgcaaatcaa gcgattcaga gagaacgtca 1920 tatgatgcac acttttgaag attttttgcc tcgtttcaag ttagcacgtt tcttcagccg 1980 tttagacatt aaggatgctt ttcatcaggt tatttctgga agaaataatc ctacgtaata 2040 tttttttcaa ggatactaat attgttgatt gttattttac ttcgatttga ttacattaga 2100 ttgaattaga agaatcgtct agatacataa caacatttat atgtcataag ggcttattca 2160 gatataaaag attaatgttt ggtatttcat gcgccccgga aatgttccaa aaagttatgg 2220 agcatatttt agcggaatgc gaaaatgttg tgaatttcat agacgatatt atggtttttg 2280 gtgaaacaga agaagagcat aacgaagcgt taaacaaagt ccttaaagta cttaaaattc 2340 gcaatatttt actgaaccaa gaaaaatgta ttttcaaagt tagccaagtt gaatttcttg 2400 ggcatgcaat gtcacctgaa ggaatccgcc caacagttag caaagtggaa gcgatacaaa 2460 aatttcgaga accaagaaca tctgaagagg ttataagttt tcttggcctt gttacatgtg 2520 taggaaaatt cattcctgat ttagctacaa tcacggaacc acttcgacag ttaatatgta 2580 aggacaacaa gttttgttgg ttacagaaac acaaagaaag ctttgaacat ttgaaaacac 2640 tcatagccaa tgtaaagacc ttgtcgtttt tttataattc attaagcact agagtaattg 2700 cggatgcttc tcctgtagct ttaggtgcag tacttgtaca atttgatgga gaatcaaact 2760 ctgatcctcg tatcatcagt tatgcgagta aaagcttgac agtaacagag aaaaggtatt 2820 gccaaacaga aaaagaggct ttagcgctag tatggtccgt ggaaaaattt tcaacttact 2880 taattggtcg agagtttgag ttggaaaccg accacaagcc attagaagtt atatttgctc 2940 ccacatctaa accctgtgcc agaattgaac ggtgggtact ccgacttcag tcattcaagt 3000 tcacggtaaa atatcgaaag gatcaggtaa tatagcggat tcgttgtcac gcctcgtatt 3060 tgaagataat tcagctgaat ttgaatccga taatcatttc tttgtacttg ctgttcaaga 3120 atctgttgcg attgatgtga gcgaaattga aaaagtgtcc aaagtggacc cagagctgaa 3180 ggcggtaagg gactgtttgg aatcaggaaa ttggaataat tcaacagcaa aattattcga 3240 accattcaaa aatgagctgg gcatgataga ggacactata gtacgaggaa ataaactagt 3300 tgttccacaa ggattacggt caagaatgct tcagcttgca catgaaggtc atccaggaga 3360 aagaatttgt tggtaatagt tgattatttt agtagatata aggaggtgga agttatgtca 3420 aagatcggtt cacgtgagac agtcaataga cttgataaaa tatttacgcg attaggatat 3480 ccaagaacga ttaccctgat aatgcgaaac aatttattgg aaaggatttt gaggaatatt 3540 gtaatataca tggcattcat tttaatcact cagcgcctta ctggccacaa gaaaatggat 3600 tagtagaaag acagaaccgg tcacttctga aaaggttgca gattaataat gcattgaaga 3660 gagactggaa gaaagatttg aacgattacc tcctaatgta ttatacgact ccacattcaa 3720 ctaccggaaa aactcctaaa gaattgtgct atggacgcac aataagatca aagctgccct 3780 ccattgacga catcgagact attccggaat cgtcggatta tcgcgatcaa gataaactac 3840 ataaacaaaa ggagaaagaa agagaggata ggagaaggca tgtccaagaa tcggacatct 3900 gtagcggaga tatggtgttg atgcaaaacc ttctaccagc aaacaaactg gcaactacat 3960 ttggtaaaac gaagtataag gtgttggaac gtagaggtca tcgagctata gtagaaaacc 4020 aagacacggg tgtaaaatac gaacgaaaca ttgcacactt gaagaaaatt aacattccac 4080 aaacgtcgac ttctccgatg cctagtcata caacaagaac ggaaactagc caaaagataa 4140 ttgaagcacc agcaattgaa gatgcagtag caacaaggtt tcaagaagat gagccaccct 4200 ttcgaggatt tgaataagta gaagaagaat ttattggaat tggaaaaaaa taacttataa 4260 caatataata aatttatgcc aaataatact tgagaattta gagaacttat gagatttgaa 4320 ctttaaatta tatgaaaaag gaga 4344 // ID ITmD37D_Ele1 repbase; DNA; INV; 1305 BP. XX AC . XX DT 13-OCT-2010 (Rel. 15.1, Created) DT 13-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE An ITmD37D DNA transposon family from Aedes aegypti. XX KW Mariner/Tc1; DNA transposon; Transposable Element; ITmD37D_Ele1. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-1305 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-1305 RA Kojima K.K. and Jurka J.; RT "ITmD37D-type DNA transposons from the yellow fever mosquito."; RL Direct Submission to Repbase Update (13-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. >99% identical to consensus. This consensus CC is ~100% identical to the original sequence in [1], but is CC complementary. TA TSDs. 26-bp TIRs. XX FH Key Location/Qualifiers FT CDS 55..1221 FT /product="ITmD37D_Ele1_1p" FT /note="transposase." FT /translation="MYIFCIGIGVSFSFIHTLPNVRGVDILLVVVRLLRAM FT KEYRDFVIKRFLNGERPGDIFRLLKSHGVKRNFVYTTIRRYRETSSTNDRA FT RSGRPRSARTPRVIKIVRERIRRKKNRSIRKTAADLNVSIGTAHTILIKDL FT GFRPYKKRKVHGVSEATSKKRLDRAKRILSRHAGQEFVFSDEKLFVLQQPH FT NVQNDRVWAPSRDSIPESNINIPRFQSAASVMVWGAVCKRGKLPLVFIEKN FT VKINAAYYKTEVLEKVVAPSLRSLYGDEHYVFQQDGAPAHTANVVQAWCRD FT NLTDFLDKTLWPPSSPDLNPLDFFVWSYMMAKLNEYKVSTLDHFKTVILKI FT WDEMPMQSVRAACDAFEKRLKLVKEYKGGGHSKRNVVNVPCKHSFQ" XX SQ Sequence 1305 BP; 341 A; 300 C; 333 G; 331 T; 0 other; cacggtgttc aataagttcg aatacaagtt ttcatcattg cgtaggtatg cgccatgtac 60 atattctgca ttggtattgg tgtcagcttt agcttcattc atacgctacc gaatgtgcgc 120 ggtgttgaca ttctgttagt tgttgttcgt ttgttacgcg cgatgaaaga gtatcgggac 180 ttcgtaatta agcgtttttt gaacggtgag cgacccggcg atatattccg gctgctgaaa 240 tcgcatgggg tcaagcggaa ctttgtctac acgaccatca ggcgataccg ggagacgtcc 300 tcgaccaatg accgtgcgag atccggtcgg ccgcgttcag cgaggacgcc acgggtcatc 360 aagatcgtga gggagcgaat tcggcgcaaa aagaaccgct caatccggaa aacggctgca 420 gatctcaacg tttccattgg aaccgctcac accatactca tcaaggacct tggtttcagg 480 ccttacaaaa aacgtaaggt ccatggcgtt tcggaggcta ccagcaaaaa gcggttggat 540 cgagctaaga ggatcctctc tcggcacgct ggtcaggagt ttgttttttc ggacgagaaa 600 ctgttcgtgc tgcagcagcc gcacaatgtg caaaatgacc gggtgtgggc gccatcgagg 660 gacagcattc ctgaatccaa tataaacatc cctcggttcc aaagtgccgc gtcggtgatg 720 gtttgggggg cagtatgcaa acgtggtaag ctacccttgg tgtttattga aaaaaacgtc 780 aaaatcaacg cggcgtacta caaaactgag gttttggaaa aggttgttgc ccccagtctc 840 cgaagcctct acggcgatga gcactacgtg ttccagcagg acggtgcacc agcccatacg 900 gcaaatgtgg ttcaagcctg gtgtcgggac aatttaaccg actttctgga caaaactttg 960 tggcctccca gctccccgga cttgaatcct ctcgactttt ttgtttggtc ctatatgatg 1020 gcgaagctga acgaatacaa ggtcagcact ttggaccatt tcaagacggt aattctcaaa 1080 atctgggacg aaatgcccat gcagtccgtg cgtgccgctt gcgacgcgtt cgagaaacgt 1140 ttgaagctcg ttaaggagta caaagggggg ggtcattcca agagaaatgt tgtaaacgtt 1200 ccttgtaaac atagctttca ataacataaa tccaaaaaat aaaaaaacat gttttcattt 1260 ttttaacaaa ttttgaaagt gtatccgaac ttattgaaca ccgtg 1305 // ID hATx-21_SM repbase; DNA; INV; 2689 BP. XX AC . XX DT 12-AUG-2009 (Rel. 14.08, Created) DT 12-AUG-2009 (Rel. 14.08, Last updated, Version 2) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hATx-21_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-2689 RA Jurka J.; RT "haT-type repetitive family from freshwater planarian Schmidtea RT mediterranea."; RL Repbase Reports 9(8), 1856-1856 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 421..2430 FT /product="hATx-21_SM_1p" FT /translation="MENMKFVQHSKGGDENSIWFYFLKEEKGNYAKCKKCA FT NFIKTGGGSTSGLHTHLKTKHEINLRKRAGAEPVAGSSSQCDKVNPLTKIT FT KYFRNTKDDSLAAVLARMTARDGLPFRVFITSDDLRKSLKALNLSDELPKS FT ANTIRKIVIDYSEKIRQLLINEISDRKLNGKGFSITFDEWTSLRNRRYINI FT NLHSENCFWNLGLLRINGSLPAVKCVELLSDKLNHYGLSLEKNIVCLINDG FT AAVMXKVGKLVPAAQQLCLAHAIQLAVVDVLYKTSTTEDTNNSPNESIYMD FT SDSETEECNNYDASEGDDYEDGLVMEYNIQGRERDIDLNHNQLGPLIQKVR FT KIVKTFKRSPTRNQVLQDYARVEFGKEYCLILDSKTRWDSLIVMLERFYKL FT KSSIKKSAIDLNLNINFTDADFETVSLTVSTLLPVKLAVESLCREDANLLS FT ADTTFKFMFDSLSEIKSSLSTEVRSALMKRVFERRTELSDLIGYLHKGYTD FT SSQIINRTFNFPRISKSTILKSLLDIVNQMDLWNQNTALLEQSNEILDSDV FT NEPEEDLIELVDKSESSLKKKLDEAILKQKNYTVIRKKPERAGDLTKCLKK FT EISLFDDEGVRGKYLEQIYNLLLTIRPTSVESERAFSAAGVILNKFRCRLD FT DKTLDALCFLRGHFKGSSKSS" XX SQ Sequence 2689 BP; 976 A; 397 C; 460 G; 855 T; 1 other; agggaatgca ataccgggat accggtcatt ttttccggta ttaaataccg gtatttcggt 60 attaaattaa aatattattt taaacaacaa aaaatgcatt ttacagctat ataaactaga 120 atttttattt taacaaataa acaaataaat gccgttctat ttacacgaaa acacttcaaa 180 atcggggcac cccctataaa ctgcacacca cataaatttt ttagtgtgaa ttaataaaca 240 agcaccagtt ttgttgtatc agagttattt aaatcgtaga aagaaaagta cagcaaaatt 300 tttatgacaa atattaatta attttggtaa gtttatttat gctaaatcca aagtttgata 360 tttttttaaa tatataaatg tatgatatat tttattcatt acaataaaat attttagaat 420 atggagaata tgaaatttgt tcaacattca aaaggcggag atgaaaattc tatctggttt 480 tactttctaa aagaagaaaa aggaaactat gcaaaatgca agaagtgtgc taattttata 540 aaaaccggag ggggctcaac aagcggcctc cacacacatc taaagacaaa acatgaaata 600 aatttacgga aacgtgctgg tgctgaacct gtagctggat ctagctccca atgtgataaa 660 gtgaatcctc taaccaaaat tacaaaatat ttcagaaata ctaaggacga ctcactagca 720 gcggtactag caagaatgac agcgcgtgat gggctaccat tcagagtatt tatcacatct 780 gacgatctta gaaaatcact aaaagcatta aatttaagtg atgagctacc aaaatcagcc 840 aacaccatta gaaaaattgt cattgattac agcgaaaaaa ttcgtcaatt attaattaat 900 gaaatatctg atcgtaaatt aaatggcaag ggttttagta ttactttcga tgagtggacc 960 tctctaagaa atcgtcgata cataaatatt aaccttcatt cggaaaattg tttttggaat 1020 ttaggacttc tacgcatcaa tggaagttta cctgcagtaa aatgtgttga gcttttgagt 1080 gataaattga atcattatgg tttgtcttta gagaagaata tcgtttgttt aataaatgat 1140 ggtgcagcag ttatgcanaa agtaggcaaa ttagtaccgg ctgctcagca attatgtttg 1200 gctcatgcga tacaacttgc ggtcgtcgat gttctatata aaacaagtac aaccgaagat 1260 acaaacaata gtccgaatga gtcgatttac atggattcag attctgaaac tgaagaatgc 1320 aataattacg atgccagcga aggagatgac tatgaggatg gcttagtaat ggaatacaat 1380 atacagggac gagaacgaga tattgactta aaccataatc agctcggtcc gcttatacaa 1440 aaagttagga aaattgtgaa aacatttaaa cgttcaccaa cacgaaatca ggtactacaa 1500 gattatgcac gtgttgaatt cggtaaggaa tattgcttaa tcctagattc aaaaactaga 1560 tgggacagtt taatagtaat gctcgaacgt ttttacaaat taaaatcaag cataaaaaaa 1620 tcagctatag atttaaattt aaatataaat tttactgatg ctgattttga aacagtttcc 1680 ttaacagtgt ccacgctatt gccagtaaag ctagcagttg agtcattatg tcgtgaagat 1740 gccaacttac tttccgcaga tacgacattt aaatttatgt ttgatagtct tagtgaaata 1800 aagtcatcac tgagcacaga agttcgaagt gccttgatga aacgcgtatt tgaaagaaga 1860 acggagctgt ctgatctaat tggctatcta cataaaggtt acacagattc tagtcaaatc 1920 atcaaccgaa ctttcaattt tcctcgtata tcgaaatcaa caattcttaa atcactgttg 1980 gatattgtta atcaaatgga tctttggaat caaaatacag ctctactaga gcaatcaaat 2040 gaaattttag attccgacgt caatgagcct gaggaagact taatagaatt agtagataaa 2100 tcagaaagtt cattgaagaa aaagttggac gaagcaattt taaaacaaaa aaattatact 2160 gttatcagga aaaagccaga acgtgctggt gatttaacaa agtgtttgaa aaaagaaata 2220 agtctatttg atgatgaagg cgtgcgggga aaatatttag aacaaattta taatctactt 2280 ttaactattc gtccaaccag tgttgaatct gaaagggctt tttctgctgc cggtgtaatt 2340 ttaaacaaat ttcgatgcag actggatgat aaaactcttg acgctctctg ttttttaaga 2400 ggacatttta aaggaagcag taaatcttca tgaattttat gttatttatg aattatgaat 2460 tttatgttta atttgtttat ttcatattac tttgtgttta ttttatcttt aatttgtgaa 2520 ttatgtaaaa ctttttttat gtttattaaa aattcattga ataaatttaa cactgaattt 2580 tttaaaaaat tatttttttg aaaaccggtt ttaataccgg tatcccggta ttatcatttt 2640 gcaataccga aataccggta ttgagatttc ctccggtatt gcattccct 2689 // ID Homo10 repbase; DNA; INV; 3922 BP. XX AC . XX DT 09-OCT-2009 (Rel. 14.1, Created) DT 09-OCT-2009 (Rel. 14.1, Last updated, Version 1) XX DE Homo10 is a putative autonomous DNA transposon. XX KW hAT; DNA transposon; Transposable Element; HOBO; transposase; KW Homo10. XX OS Drosophila mojavensis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; repleta group; OC mulleri subgroup. XX RN [1] RP 1-3922 RA Ortiz M.F. and Loreto E.L.S.; RT "Characterization of new hAT transposable elements in 12 RT Drosophila genomes."; RL Genetica 135(1), 67-75 (2009)DOI 10.1007/s10709-008-9259-5. XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 1416..2828 FT /product="Homo10_1p" FT /translation="MSSVESCLERALLYDTKSKRKRDIDRALTEMVVKDML FT PCTIVENEGFIEYTRVLEPRYNLPSSRHLRDVLMLHLFKETSAKLAVILEN FT VADIAITCDLWTSSTNVSFLSVTGHFVLDYSLKTVCLARQKLIDSTDHSAQ FT NIANTLQDILNFWNILDKTVCVVTKNSSSMLEACEILKIQNHPCFAHTLNS FT VVQDGLKLEDDAMKALIAKCKTIVKFFKQSSITNEKFKNVQENFASILLEE FT SHSRWNSLHFMIERILASQEAINAVLLPIKSAPLQLTVDEIIILKDIDIIL FT SLFQEASDKISGENYVSLVIPLAHGLFRKVHNLSSQLQTSVGERMKNTILK FT SITKRLSTYEQRTVTRVATILDPRFKKDGFQLNTNAEKAAIFLQHELMNLT FT NREPSINLDLESNTKTQDSLLDFLYQPVIRKPKNEKFDASYIKRQYLETPL FT APLKMDPFLWMKVNICYQCLKIQFLH" XX SQ Sequence 3922 BP; 1339 A; 722 C; 718 G; 1143 T; 0 other; cagatgttgt gacgtcacag ttcagtgcaa tcgtactgtt tgtgcctaca tacacacata 60 cataaacatc tgtttgataa agcagagagt agacagagaa aaagggagag tgagaaatca 120 gctcgattat ctcggccgta gtggttgaaa ccatcgatgt cactatcgat gtcaatgtac 180 tctcgatgta ttgaactatc gaatagtcaa atgggccgat agtatcatcg atagcttgct 240 tttcatcgct gtaatacatt ttggcgcact ttttgcaaaa gtcgaatcat agctcagtgc 300 taaaaataca ataaaaatat cacgaactca aagctgcata ctgaaacaca ttaaaaaatg 360 tgtaaataat ttctccgaaa aaaaaagtac ctcaaatgtg aaaaagaata tagaacaagc 420 ggcttaaggt aaaatgaaaa atccagttat tcttattcag gtattaaatt tctttattaa 480 atttccttgc aggctgatca gacaaatttt ccactgattg tggatcaagt gaaatagata 540 taaatgcaaa acacgttttg ttaataaaaa tattttttgc cactatcaac aactaccgat 600 tgtttctggt ttactatcgc caactatcga ttgcacacat catcgatagt ctgacacccc 660 cacttggccg aatgcgtgaa attttgcttt cgatagcgcc atcgatagct tcgagcacta 720 ctaaaaagtt cgatggtgcc accaatagta cggcttgcac atcactgaga cacactcttg 780 cgtttttctg tatctgagag ccaagcggat ttgcaattga ttggagattt ctttttccgc 840 attattaata ctctttaaat gcacacaaca gaatcaaatg gatagatttc ttctaaaagg 900 taataatcat aacgcaattg agaagtatct taataacact gagttttctt taaggtaagc 960 gtccgaattc gacgcgggct tgggatacca gcttcacgga ttctgataag aattctacgg 1020 tcgaaaagcc aacaccaaag aaacaaaaaa ggatctcaga ggtttggaag tattttaaga 1080 ggtccgacga cagactcttc gctaagtgtc tgtgctgcgg taaactttat aaaacgagcg 1140 gaaacacatc taatatacgt gaccatttga aaagattcca ctcaaacgta tctgtacttg 1200 attccagcgc atccgctgca gaagctttag tggataacga taacaacaag gacaccaatg 1260 acaccaatga caaccatgac gacaacgacg acaacgatga caacaatgac gacaacgacg 1320 acaacgatga caaagctgac cactacgacg acaaccacga caaaaatgag aacaacgacg 1380 acaagatcgc gtctacaagc ggcagctgta gatccatgag ttctgtggaa tcttgtttgg 1440 aaagagcttt attatatgac acaaaatcaa agcgaaagag ggacattgat agagctttaa 1500 ccgagatggt tgtcaaggat atgctgccat gtaccatcgt tgaaaatgaa ggctttatag 1560 aatacactcg tgtattggaa cctagataca atcttccaag tagcagacat ttgagagatg 1620 tgttgatgct ccacttgttt aaagaaacat cggctaaact ggccgtgatt ttggagaacg 1680 ttgcggatat agcgattacg tgcgacctgt ggacttcaag cactaacgtt agcttcttaa 1740 gcgtcactgg tcatttcgtt cttgattata gcctgaaaac agtatgttta gcaagacaaa 1800 aattaataga ttcaacagat cactctgctc aaaatattgc aaacaccttg caagatatat 1860 tgaatttttg gaatatacta gataaaactg tatgtgtagt aacaaaaaat tccagctcaa 1920 tgctagaagc atgtgaaata ttaaagatac aaaatcatcc atgcttcgct cacaccttga 1980 attcggtggt acaagatggg ctaaaactgg aagacgatgc aatgaaagct ttaattgcaa 2040 agtgtaagac gatagtgaag ttttttaagc aaagttctat aacgaatgag aaatttaaaa 2100 atgttcaaga aaacttcgcc tctattttat tagaagaatc tcattcaaga tggaacagcc 2160 tccactttat gatagagcgc atattagcat ctcaggaagc cattaatgca gttttattgc 2220 caataaaaag tgcaccgctg caattgactg ttgacgaaat cattatatta aaagacatag 2280 atataatatt gtcacttttt caggaagcaa gcgacaaaat atctggtgaa aattatgtgt 2340 cattagtaat acctttagca catggtcttt tccgcaaagt tcacaatctt tcatcacagt 2400 tacaaacttc ggtaggagaa agaatgaaaa atactatatt gaagtccata acaaagcgtt 2460 tgtctacata cgagcaacgg acagtaacca gagtggcgac aatattagat ccgcgcttca 2520 agaaagacgg gtttcagcta aatacgaatg ctgaaaaagc tgcaatattt ttgcaacacg 2580 agttgatgaa tctgaccaat agagagccct caattaactt ggatttggaa tcgaacacaa 2640 aaactcagga ctcactttta gatttcttgt atcaaccggt cattagaaaa ccgaaaaacg 2700 agaaatttga tgcatcttat ataaagaggc agtatctgga gacaccctta gcccccctaa 2760 aaatggatcc ttttttatgg atgaaggtga acatctgcta tcaatgccta aaaattcagt 2820 ttttacatta aatttcattt caggcttccc agacggaatt cccacttgta aaatcattgc 2880 tatttaagta tttatgcatc cccgcaacat ctgtagaatc agagaggctt ctcagaaagg 2940 cggagcgaat tgtttctgac ggaagaacaa ggcctaaaga agaaaactta aacattttgt 3000 tgtttttgaa ccaaaatctt tggattaagt caaaatacaa aaccctatat taataaatta 3060 actaattttt tgtatgcata tatgtataaa aaaaataaat aaaacaagca aaatacagtg 3120 aataaatagt atacgttaac aattttcatt gcttagtgac aactttaatt tcgattttat 3180 agattacgct cttccagtca agtacttaaa gtggccctgt tgtcgtaaat taattctgaa 3240 tatatagtgt gtttttgaga actactgatg gtatagactt gaaattggct taaatttgta 3300 gtatatattc aaggggaata ccaaagatac atgtataagt tactataact tactattgaa 3360 cgtattaatg tataataata catcccctag atagaggatt tctattttaa ttctttgatt 3420 tttttgttct ttacgtttta taggaaatat cacaaaacaa tttgaaccac tgtaattttc 3480 acgcatctac atatttacat catgttttga atggctaacc attcgtcaag ctcgacgcct 3540 tctaaggtga gcaagttgaa caaattttca aagtgatgct cacgaacatt aggttttgac 3600 ctggcaaacg cattgaaata tcgaacaata gaatttgaat gtcctggggg gtccgttgat 3660 tcacgtatca tattgcgagc cgcaaacttg aaagcaatgt acagtctgtt acaggtattc 3720 atatgactgt tccttagtct ctgaaattca ccgttggtat attgattgag taagtcatca 3780 tctttgccaa ccagtaacat ttccttctgt gcgctaaact gttgaagagc ggcgtaatct 3840 tctactaaag tggacgcttg tggagtccat tcgatgataa agtctttcca tttctcaagt 3900 tcctctttat tcacaacatc tg 3922 // ID Gypsy-2_RP-I repbase; DNA; INV; 3597 BP. XX AC ACPB02032414; XX DT 28-JAN-2011 (Rel. 16.02, Created) DT 28-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Rhodnius prolixus genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-2_RP_; KW Gypsy-2_RP-LTR; Gypsy-2_RP-I. XX OS Rhodnius prolixus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Euhemiptera; Heteroptera; OC Panheteroptera; Cimicomorpha; Reduviidae; Triatominae; Rhodnius. XX RN [1] RP 1-3597 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Rhodnius prolixus genome."; RL Direct Submission to RU (28-JAN-2011). XX DR Genome; ACPB02032414; Positions 5363 1767. XX CC Positions [2706-3182] - Integrase core CC 'CTGAT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 31..2535 FT /product="Gypsy-2_RP-I_1p" FT /translation="MTPEQKPDSTQTNVLKPERLMVEPSSANASKQWNHWF FT RTFESYVRRSGISDESVKLDYLVTLVSSDIYEYIEECTTYKEAVDILKSLY FT VKRNNEIFARHQLATRCQQSGESLDRFLQELKILSKDCNFSSVTADQHRDE FT SIRDAFITGMLSSNIRQRLLENSTLTLKTAYDQARSLEMAQKQSQSYVNLE FT VPSVNSVLERDSSTSGHSDDSPVAAATSVTCLYCGNRKHSRKVCPARDVVC FT HSCNKKGHFSKVCLANRRQIRKKTSASILAAAPSSLSKAVVKVTINGVCAD FT ALVDTGSSDSFINTEFVVRRQIPMTRAPSNVSLASKAVSSQINGYCVVDLE FT FQRYSYKAKLNVLKDLCADTIIGHDILNQHSSLELSFGGNKAPITICSVHE FT ANVSSVSLFSNLTTDVRPIATKSRRHSLADREFIRDEVTRLLKDGIIEESI FT SPWRAQPLVVSNENNKKRMVIDYSQTINRFTLLDAYPLPRIEEVVNRVAQY FT SVYSSIDLKEAYHQVPIRDDEKAYTAFEANGKLYQFRRIPFGVTNGVACFQ FT RIIDKIINDENLQDTFPYVDDVTVCGVSQEAHDRNLERFLNAAKRYNLSLN FT ENKCVFSATTINLLGYKITQGTIQPDPERLRPLTNLPVPHSTNELKRVLGM FT FSYYSRWIPKFADKIRPLVKTQNFPMDTLAVKAFEQLKTDVASSVVQSIDE FT SSPFTVETDASDFAIGASLSQSDRPVAFFSRTLSKCEQRYSAVEKEAQAII FT EALRKWKHYLIGRHFKLVTDQKSVAFMLDRKHTNKIKNDKIMRWRLEMSCF FT QFDVVYRAGRENQVADTMSRICASIEEPSGKTI" XX SQ Sequence 3597 BP; 1118 A; 800 C; 753 G; 926 T; 0 other; atttaatcaa caaaaaacaa aaacacaatt atgactcctg aacagaaacc tgattccaca 60 caaacgaatg ttctgaagcc agagaggctc atggtggagc ctagctcggc taacgcgtcg 120 aagcaatgga accactggtt cagaacattc gagtcatatg ttagaagaag tggaataagt 180 gacgagtcag taaaactcga ttaccttgtt acacttgtct catcggatat ctatgaatat 240 atcgaggagt gtactactta caaagaagct gtagatattc ttaaatctct ctatgtcaag 300 agaaataatg aaatatttgc tcgtcatcaa ctggcgacaa gatgtcaaca atcaggtgag 360 tctttagata ggtttttgca agaattgaaa atactaagta aagactgtaa cttcagctca 420 gttactgccg accaacatag ggacgaatca atacgggacg cttttattac gggtatgttg 480 tcaagcaata ttcgtcagag attactcgaa aattcaaccc tcacactcaa aactgcttat 540 gatcaggcca gatcactcga aatggctcag aagcaatcgc agtcctatgt taatctcgaa 600 gtccccagtg ttaactctgt gctcgaacgt gacagctcta cctctggcca ctccgatgac 660 tctccagttg ctgcagctac atcagtcact tgtttatatt gtggcaaccg caagcactcg 720 cgtaaggttt gtccggctcg tgatgtcgtt tgccattctt gcaataaaaa ggggcatttt 780 tccaaagtat gtctagcgaa taggagacaa attagaaaga aaacctcagc atctatacta 840 gcagctgccc catcttcctt atcgaaagcc gtggtaaagg ttacaataaa cggggtatgt 900 gccgacgcgc tagtggatac aggtagctct gacagcttca tcaacacaga gtttgtagtt 960 aggcgtcaga tacctatgac aagagcccct agtaatgtca gcctcgcctc taaagctgtt 1020 agttcgcaga ttaatggata ttgcgttgta gatctcgagt ttcagagata ttcttacaag 1080 gccaaactaa atgtcctaaa ggacttatgc gccgacacca tcataggtca cgatatactg 1140 aatcaacaca gtagtctaga gctctcattt ggtggtaata aagcacctat cacaatttgt 1200 agtgttcatg aagctaatgt aagttcagtt tcactctttt cgaacctgac cactgacgtc 1260 aggcctattg ctactaaatc caggcgacat tctctagccg atagggagtt cattagagat 1320 gaagtaactc gactcttgaa ggacgggatt attgaagaga gtatcagccc atggcgtgct 1380 caacctttag tcgtcagcaa tgaaaataac aagaagagaa tggtcatcga ttactcgcag 1440 acaatcaata gatttacgct actggatgcc tatccactgc ctagaattga agaggtagtg 1500 aaccgcgtag cccaatacag tgtgtacagt agcatcgatc taaaagaagc ttaccatcaa 1560 gttcctatca gagacgatga gaaagcttat acagcattcg aagctaacgg aaagctctat 1620 cagttccgcc gaatcccctt cggcgtgact aatggtgtag cttgttttca aagaattatt 1680 gacaaaatta ttaatgacga gaacttgcaa gacacctttc cttacgttga tgatgtcact 1740 gtttgtggag tgtctcaaga agctcatgac agaaacttgg aacgtttcct caatgctgcc 1800 aaaaggtaca acctatcgtt gaatgaaaat aagtgtgtgt tttcggcaac aaccatcaac 1860 cttttgggat acaagataac ccaaggtacg atacaacctg atcccgagcg gcttcgtcct 1920 ttgactaacc tgccagttcc gcattctacc aacgaactaa aaagagtttt aggaatgttc 1980 tcgtactatt cgcggtggat acctaagttc gcagataaaa ttcgaccact tgtcaaaact 2040 caaaatttcc caatggatac attagcggta aaagcatttg agcaactcaa gacagatgtc 2100 gcctcttcag tcgtacagag catagatgag agttctccat tcactgtaga aactgacgct 2160 tcggattttg caattggagc atcgttgtcg caatccgacc gtccagttgc atttttttct 2220 agaacattat cgaaatgcga gcagcgttat tcggcagttg agaaggaagc acaggccatt 2280 attgaagcgc ttaggaaatg gaagcactat ttaataggcc gacattttaa gctagtcacg 2340 gaccaaaaaa gtgtcgcttt tatgttggac aggaagcaca ccaacaagat caaaaatgat 2400 aaaataatgc gctggcggtt ggaaatgtcc tgtttccagt ttgatgtggt ttatcgcgct 2460 gggagagaaa accaggtggc agacacgatg tctagaatat gcgcttccat tgaggaaccg 2520 tcaggaaaaa caatttagta gaattacaca catccctatg ccatccaggg gtgacaagaa 2580 tgtaccatta cgttcgtatt aagaacttac ccttttcgct agatgatatt cgaaaaatta 2640 cgaacgcctg tagcatttgt gctcaactca aaccacggtt ttacaagcag aagggtactc 2700 taataaaagc tacgagcccc tttgaaagac taaacataga tttcaagggc cctcttccaa 2760 gtttaaccaa gaaccgatat ctcctgacaa ttgtggacga gttttcaagg ttcccctttg 2820 cctttccctg caaagacgtt acttcatcca cagtaataga atgtcttcgt cagttattct 2880 acttgtttgg gattccttgt tacatacatt cggatagggg ctcatcatca atatctaagg 2940 aactaaaatc atttcttact tctttaggca tagcaactag ccgcacaacc ccatataacc 3000 cacaaggtaa tgggcaagta gagcgctaca acggtatcat atggaaaaca gtatctcttg 3060 cactcaaatc aaaaaacatg gaacagggag aatgggaggc cgtgttggcg gattcactac 3120 atgcgattag aactcttcta tgtacagcca ccaatgaaac gccccatgaa caacaactgg 3180 actagctaca cccacttggc taattacacc gagaccgatc ctcctaagga aacatgttag 3240 acaaagcaaa tttgaccctt tagtagaaga agtggagcta ctggaggcta atccatccta 3300 tgctcacgtc agattcccgg atgggaggga atcaacagta tcacttaggc atttagctcc 3360 cagtggcgat cccgattgtt ctctcgaaac cttgcaggct ccccaaggga acgaggagct 3420 agagaaaaca atgggtgtta acgaaacgga gcatcaaggt actgaattcg caaccgctac 3480 tccaccacta gcagaaccga catctactaa agaaggaact gggagacctg ttagaatccg 3540 gcggatccct tcttatttac aagaatatca cctgaacacc ctctcaggag ggaagaa 3597 // ID DNA8-92_AP repbase; DNA; INV; 600 BP. XX AC . XX DT 26-AUG-2009 (Rel. 14.09, Created) DT 26-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-92_AP. XX NM DNA8-92_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-600 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2029-2029 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 600 BP; 166 A; 92 C; 117 G; 225 T; 0 other; cagtggttga tttggggggg gggcgcaggg ggacggagtc ccccttcttt ttaaattttt 60 ttttagcgta ccccttctat ttatttgaaa cggaacgctg ggaacccaac ataagctagt 120 gaatttgtaa atcgtgattt cctgatgcca aatttactta ataggtacac aatattttat 180 attgttatta tattttatta tagcctgtaa gttgaattaa tattataaat tacaacaaaa 240 taactaaaat cgttattttt atttttattc gtttctattg tgataactga taaacaaagc 300 gttagaaatt aaaatcccat ttttagcggt ttttttgtaa tttgtcggtg gtttttcccg 360 tggcattaaa taactattga gaaaatcgaa aaatgacccc cctataaaaa gtgtttttct 420 ttctatatgt atttttcgaa taaatttgat attgttttgg tgtgataggt tgatgggtgg 480 agagggaggt gtggagggct ttgccctcca cggcaatgtg acgattaatt ttgtgtggaa 540 cgctcgttta tccaacatca gcagagtccc ccttcttttt tatttccaaa tcaaccactg 600 // ID I-9B_AAe repbase; DNA; INV; 5795 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A I non-LTR retrotransposon family from Aedes aegypti. XX KW I; Non-LTR Retrotransposon; Transposable Element; I-9_AAe; KW I-9B_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5795 RA Kojima K.K. and Jurka J.; RT "I non-LTR retrotransposons from the yellow fever mosquito."; RL Repbase Reports 11(4), 1363-1363 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 10 sequences with >97% CC identity. The consensus is ~80% identical to I-9_AAe. XX FH Key Location/Qualifiers FT CDS 486..1622 FT /product="I-9B_AAe_1p" FT /translation="METDGDSPNSKEDDHSAPLEAASKPFRVKLFPASFLG FT PYPVYFRKKDKPINVLLISAEVYKQYKSVKEIKKISLDKLRVVFGSRGDAN FT ALLESNLFSNLYRVYAPCDSCEISGVIYDEFLSCNDVIDYGLGIFKNKSIS FT PVKILDCNRLSKLSFNGSNTNYVHSDCMKITFAGSVIPDFVSIDNVTYSVR FT LYYPRLMHCTRCLLFGHTSNYCSNKPKCSKCGESHSSSECDKHSDVCIYCK FT LNHSSIKECSVYKANQQKFNQQIKLKNRFSYSEILKASGDFASSNIYETLS FT DYDDDNLNENSNQFIYKPPTKRKRANLLSGKINDNCDPQPSTSFDKSFPPL FT DTTSNPKSIPGFQRVNDDCSNNKNSKENIDDSAKKN" FT CDS 1792..5493 FT /product="I-9B_AAe_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="MASIDSKTLNVLQWNCRSIIPKLDRLKSLLSNYNIDV FT FCLNETWLVECKNIHISSYNIIRKDRNTPSGGVLIGIRDGIQFKYLDLPPN FT SSVECIAVTIKHNNIQFSIICFYIPPNSVFSLTQIKNILDSVPSPFYILGD FT FNAHNIAWGSNNTDGRGNLIMELVDELNLNILNDGSFTRIAVPPAHHTCID FT LSLCSNTLSFFSSWKIINDPNGSDHLPILIEISCSNCNSTCNESFVPDLYR FT NVDWEKFSDLISFSLISFDYSLPPLENYKQFSKTLNHCLLKSQSKKIKSNI FT CTKNRRHSFWWDNDCSIALKNKSEAFKKFRRIGSRENYFSYCKAEAQFTRV FT TKYKKRNYWRNFVENLDRNSGLSSLWSVARNLRNYNFSPPNILEYSEDWID FT TFASKICPAFVPRATSFKNNQKYNYFPELCALFSLEELDLALSITKNTSPG FT IDNIKFIVLQNLPHEGKSHLLALYNSFLQQNIFPSEWRSIKVVSIVKPGKD FT PSLVDSRRPISLLSCLRKLMERMLLNRLELWAENNQILSSSQFGFRKRCST FT RDCTALLASQINLCFNRKQDMVSTFLDVSGAYDSVLIDLLFYKLNSFKIPY FT IITNFLYNLFSFKIMHFFHDGSSKLIRYSYFGLPQGSCLSPFLYNLFTSDI FT SSVIPNGCQFYQFADDKVISICGNNREIIRHFMQNALDNIDIWAHNNGFSF FT SVSKTKFILFSRKRSPVNIHLYLDGYEIEQVDEYKYLGLWFDSKLSWNKHI FT QYIQVVCAKRINFLRTITGTWWGAHPSDLITLYKTTIRSVLEYGCFTFASS FT VQSKFCKLEKIQYRCLRICLKLMNSTHTKSVEVLAGIHPLKIRLHELNCKF FT MIQCFSINHPIVNILKCLHDINPTCRILDSFNYCCSINVAPNSNHSLHFHS FT FGIDIHSFQPNIDLSLHEELKQISNTEYSRFAPLFFKRKFIGLDAKQFYFS FT DGSFTQGIAGFGVYNHYSAYFFKLQSPCSIFVAELTALYFTCSLIKQFLPN FT IFIICSDSLSCLNAIKTINFNSKTHHIILKLKEELFYLHTQGYIIKFVWVP FT SHSNIYGNEQADSLAKLGVRCGIWYDRQIASSEYYTELRKHSLNNWQISWN FT SSDKGRWCHSIFPIVNQTTWYKKLSVGRNFICTLSRIISNHYICNSHLYRI FT DISDSNLCDCCETYEDIDHIVFHCTRYIIPRNKFVSDLKNHHDHLPSSVRD FT ILGSRFLPSLKLLYRFLNDASYYV" XX SQ Sequence 5795 BP; 1805 A; 824 C; 897 G; 2269 T; 0 other; tcattgctgt agtaggcctt gaagtaacag gacgtgtttt tttttcaacg tctgacattt 60 tttctctttt ttcaggagga gaaaattttt ccgtggctgc tcaagagttg aagttcgtga 120 gtgctttggt gttgacgaaa gacgtttcgt cttggatttt tgcgaacgtt tggagggcga 180 agatttcttt gaagtttatg ctgtgttttc cactgaggag gaattgttgt gaactttgga 240 ctgaggactt ctttttcgaa ctgctggtgg caagctgttc aagagaactt ttggctttag 300 aagcgttcca gtttctgtat tgcgtggatt ttgaagtttg aagttgactt tcaagaagag 360 aaactttgac gtttttggat tgagctgagt acaattttta tgtttcttat ttctacaatt 420 ttcttcattt gtagctttta ttaggttttt ttttgttgat taatattccc cgttgttttc 480 caattatgga aaccgacggg gattcaccaa attcaaaaga ggatgatcat tctgcacctc 540 ttgaggcagc atccaaacca tttcgagtta agttatttcc tgccagtttt cttggtccat 600 atccagttta ttttcggaaa aaggacaaac ctataaatgt tttgttgatt tctgcagagg 660 tgtataaaca atacaaatct gttaaagaaa tcaaaaaaat atctttggat aagttaagag 720 tagtttttgg ttctcgtgga gatgcgaatg ctctacttga atctaactta ttttcaaatt 780 tatatagagt gtacgcacct tgcgactcat gcgaaattag tggtgtcatt tatgacgaat 840 ttttgagttg taatgatgtt attgattatg gtttaggaat ttttaaaaat aaatcaattt 900 cacctgtgaa aattctagat tgtaatagat tatccaaatt atcatttaat ggcagtaata 960 caaattatgt ccattctgat tgtatgaaaa ttacttttgc gggctctgtg attcccgatt 1020 ttgtttcaat tgacaatgtt acatacagtg tgaggcttta ttatcctagg ctaatgcatt 1080 gtactcgttg ccttttgttt ggacatactt caaattattg ttcaaacaaa ccaaaatgct 1140 cgaaatgtgg tgaatcccat tcgtcatcag aatgtgataa acattctgat gtttgtattt 1200 attgtaaatt aaatcatagt tcaatcaaag agtgttcagt ttacaaagca aatcaacaaa 1260 agttcaatca acaaattaaa ttaaaaaatc gtttctctta ttcagaaatt ttaaaagctt 1320 ctggagattt tgcttcttct aatatttatg aaacattatc agattatgat gatgataatt 1380 taaatgaaaa ttctaatcaa tttatataca aacctccaac caagagaaag agagcaaatt 1440 tattatctgg taaaattaat gataattgtg atcctcagcc atctacttca tttgataaaa 1500 gttttcctcc tcttgatact acttcaaatc ccaaaagtat accaggattt caaagagtaa 1560 atgatgattg ttccaacaat aaaaactcta aagaaaatat tgatgattct gcaaaaaaaa 1620 attgacaatc atgataattc cattttgagc attttggaac aggtaataga tttattagaa 1680 ttaaatgatt tttggaaaaa aactgattaa aaaatgttta ccagttttag cttttcttct 1740 tgataaatta aattcttttg ggtccctctt ttcttcatta ttctcttttt aatggcttca 1800 atcgattcta agaccctgaa tgttttacaa tggaattgtc gtagtataat tcctaaactt 1860 gatagattaa aatcattgct gtcaaattac aatattgatg tattttgttt aaatgaaact 1920 tggttggttg aatgcaaaaa catacatatt tcgtcatata atattattag gaaagataga 1980 aatactccat ctggaggggt tcttattggg attcgtgatg gtattcagtt taaatatttg 2040 gatttacctc ctaattcatc tgtagaatgc attgctgtta ctattaaaca taataacatt 2100 caattttcga ttatttgttt ttacattcct ccaaattctg ttttttcttt aacacaaatt 2160 aaaaatattt tggatagtgt tccttctcca ttttatatat taggagattt taatgctcat 2220 aatatagcat ggggtagtaa taatactgat ggcaggggaa atttgataat ggaattggtt 2280 gatgaactca atttaaatat ccttaatgat ggatcattca ctagaattgc tgttccacct 2340 gcccatcata catgtattga tttatctctt tgttccaata ctttatcttt tttttcttct 2400 tggaaaatta tcaatgatcc taatggtagt gatcatctac ctattttaat tgaaatttct 2460 tgttctaatt gtaattctac atgtaatgaa tcatttgttc ctgatctgta tagaaatgta 2520 gactgggaaa aattttctga tctaatttcc ttttcattaa tcagttttga ttattcttta 2580 ccaccactag aaaattataa acaattttct aagacattaa atcattgttt actgaaatct 2640 caaagtaaaa aaattaaatc aaatatttgc actaaaaaca gacgtcattc attttggtgg 2700 gataatgatt gttcaatagc acttaaaaat aaatcagaag ctttcaaaaa attccgtcgt 2760 attggatcaa gggaaaatta tttttcatat tgtaaagctg aagctcaatt tactcgagtt 2820 actaaatata aaaaaagaaa ttattggaga aattttgttg agaatcttga tagaaattct 2880 ggtttgtctt ctttatggtc ggtagctaga aatttgagaa attataattt ttctcctcca 2940 aatattttag agtattccga ggattggatt gatacatttg catctaaaat ttgtccagct 3000 tttgttcctc gcgctacttc tttcaaaaat aatcaaaaat ataattattt tcctgagctt 3060 tgtgcattgt tttctttaga agagctagat ttagcgttat ctattactaa aaacacttca 3120 ccaggaatag ataacataaa atttattgta ttgcaaaatc taccccatga aggaaagtct 3180 catttattgg cattgtataa ttcatttctt cagcaaaaca ttttcccttc tgaatggcgt 3240 tcaattaagg ttgttagtat tgtaaaacca ggcaaggatc cttcgttggt tgatagtcgt 3300 agaccaatta gtttattgtc atgtcttcgc aaactcatgg aaagaatgtt actaaatcgt 3360 cttgaattat gggctgaaaa taatcaaatt ttatcatctt ctcaatttgg attcaggaaa 3420 cgttgtagta ctcgtgattg cactgccctt ttagcttcgc aaattaatct ttgctttaat 3480 agaaagcagg atatggtttc tacttttctc gatgtttcag gagcttatga ttctgtttta 3540 attgatctac ttttttataa attaaatagt tttaaaattc catatataat tacaaatttt 3600 ttatataatt tattctcttt taaaataatg catttttttc atgatggttc ttctaaactt 3660 attcgttata gttattttgg acttcctcaa ggttcttgtt tgagtccatt tttatataat 3720 ttatttacga gtgatatttc ttctgttatt ccaaacggtt gtcaatttta tcaatttgct 3780 gatgataagg ttatttctat ttgtggtaat aatagagaaa tcattcgtca ttttatgcaa 3840 aatgcattgg ataacattga tatttgggct cataataatg gattttcttt ttccgtgtct 3900 aaaaccaaat tcattttatt ttctcgcaag cgttcccctg ttaatattca tttgtatctt 3960 gatggttacg aaattgaaca agttgatgag tataagtatt taggattatg gttcgattcg 4020 aaattatcgt ggaataaaca tattcaatac attcaagttg tttgtgcaaa gagaataaat 4080 tttctaagaa ccattactgg tacctggtgg ggtgcacatc cttctgattt aattacactt 4140 tataaaacca ctattcgttc tgttttggaa tatggatgtt ttacttttgc aagttcggtc 4200 caatccaagt tttgtaaact tgaaaaaatt caatatcgat gcttaagaat ttgtttaaaa 4260 ttgatgaatt ctactcacac aaaatctgta gaagttttag ctggaattca tccgttgaaa 4320 attcgtttac atgaattgaa ttgtaaattt atgattcaat gcttcagtat aaaccatcca 4380 atagttaata tactgaaatg tttacatgac ataaatccta cttgtagaat tttagattct 4440 ttcaattatt gttgttctat aaatgtggca ccaaattcaa atcattctct tcattttcat 4500 agttttggta tagatattca ttcatttcaa cctaatattg atttgtcatt gcatgaagaa 4560 ttaaaacaaa tttcaaatac tgaatattct cgttttgctc cattattttt taaacgtaaa 4620 tttataggtc ttgatgccaa acagttttat ttttcagatg gatcatttac ccaaggtatt 4680 gctggttttg gagtgtataa tcattattca gcatattttt tcaaactcca atcaccttgt 4740 tcaattttcg tagctgaatt aacagcctta tattttacgt gttctttgat taaacaattt 4800 ttacccaata tttttatcat atgttcagac agcttgagtt gtttgaatgc aattaaaact 4860 atcaatttta attctaaaac tcatcatatc attttgaaac ttaaagaaga actattttat 4920 ctacatactc agggatacat tattaaattt gtttgggttc cttctcattc caatatttat 4980 ggtaatgagc aggctgattc attagctaaa ttgggtgttc gctgtggaat atggtatgat 5040 cgtcaaattg catcatctga atattacact gaattaagaa aacattcttt aaacaattgg 5100 cagatttctt ggaattctag tgataaagga cgttggtgtc attctatttt tcctattgtc 5160 aatcaaacta cgtggtacaa aaaattatct gttggaagaa attttatttg tacattatca 5220 agaataattt ctaatcatta tatttgtaat agtcacttat atcgtattga tataagtgat 5280 tctaatttat gtgattgttg tgaaacttat gaagatattg atcatatagt tttccactgt 5340 actcgataca ttattccaag aaataaattt gttagcgatt taaagaacca tcatgatcat 5400 cttccttcat ctgtacgaga catattagga agtagatttc taccatcttt gaaattactt 5460 tatagatttt tgaatgatgc ttcatattat gtttgatact tgtttctttt ttatttcaga 5520 attggtttga agaaaatttt cccagatctg tcccgttttg atgatttcaa gtagattctg 5580 gttcctggtc ctccttgaag attgtggctc tgctatggat caatgccgtt tgagccttta 5640 gtttataata ttttttttat aacgatttta gaaaagataa agaggtttta tgcctttttg 5700 agaacgattt ctttgatgat aatcactcaa aggggctttt ccctctttct aaattattta 5760 gttaaaataa ataaataaat aaataaataa ataaa 5795 // ID RTEX-11A_BF repbase; DNA; INV; 693 BP. XX AC . XX DT 30-JUN-2009 (Rel. 14.07, Created) DT 30-JUN-2009 (Rel. 14.07, Last updated, Version 1) XX DE Amphioxus RTEX-11A_BF autonomous non-LTR retrotransposon - DE consensus. XX KW RTEX; Non-LTR Retrotransposon; Transposable Element; Jockey-9A_BF; KW RTEX-11A_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-693 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-693 RA Kapitonov V. and Jurka J.; RT "Young families of RTEX non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(7), 1727-1727 (2009). XX DR [2] (Consensus) XX CC This is a young subfamily of RTEX-11_BF. The 5' terminal portion CC is missing. CC The 3' terminus is composed of the (TTC)n CC microsatellite. XX SQ Sequence 693 BP; 244 A; 120 C; 108 G; 221 T; 0 other; ggccttgggc aaatatggac gaaacctgag gcttataaaa cagattatat tgctaaccaa 60 ctgcgccccc ggctacaaga tatatacata caggagtggt ttagttctat tgaaaacaat 120 tcaaaactgt cttttcttag caaatcaaaa gaatgttatg aacaagaaaa atatctttat 180 gatataaata actttgaaat tcgcaaagca ataactcagt ttagaatcag tagccataaa 240 ttgaatattg aaacaggcag atactacaat atagcccctg accaaaggtt ctgtccattc 300 tgcccaaagc atattgaaga tgaatttcac tttataatgg aatgttctag atacgctagt 360 ttacgtaatg aattattcat atttctcgaa tcaagtacaa cagatttcaa gagactcgat 420 gctacaaaca gatttgtata tatctttaga tgtcagaact cccataatgc aaaaataggc 480 aaatatatca aggactgctc tgctattaga aagaataccg aaactaatga ttagcctact 540 ccattgtaat actatatcaa agaattatgt tagcccttat tgttatgtta actaggttgt 600 taagctttat cctatacctc tattttgtac tttgtgtaat agttgttgcc atacattgta 660 tcatgtacaa ttgtcgtgca ataaagttct tct 693 // ID Gypsy-126_AA-I repbase; DNA; INV; 3431 BP. XX AC AAGE02025415; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-126_AA_; KW Gypsy-126_AA-LTR; Gypsy-126_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-3431 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02025415; Positions 803 4233. XX CC Positions [3144-3425] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 186..3431 FT /product="Gypsy-126_AA-I_1p" FT /translation="MEKATITPFDVTDTASIASRWTKWKRSLQLCLEVNCV FT ALPSRKRSYLLHFAGPEVQDIFYNIEGHDADPPLGSDVFTEAIRLLDEHFA FT PLNNIPYERCVFRKMTQQENEPVEKFIHRLRDQGRLCDYGAALEMRITEQI FT FDNCVSDTLREAILKKKLMTVQEIAEEGRVLETVKRNRDQMKKTVEEEQLS FT LVKKQSREEICFRCGIVGHFANDKKCPARSKMCDKCQIVGHFKKMCKTKVK FT SGKVSKRNQKINQVLTQDIVSDSSSEETDSDDVQQVYANGSGLDKTTCFIG FT GVKTEWIIDSGAHVNVITRGTWKSLKEQGCVVSSECKSDKVLRVYGDGKLN FT VLKIIKADISTRDKRVHHEVCVVDSEKGANLLSRATSIELGLLEIRGNVLS FT VDDSGEPPIGKLKNVQVVIKLDPNIPPVQQSCRPLPIPLKALVDEKLADLL FT KQDIIEPAPLNISWASPLVVTPKDGGRTVRLCVDMRRANKAIIPEKHPLPT FT FEEIMPHLEGCKVFSKIDLVKAFHQIELSPESRDITTFVTQDAYYRYKRLM FT FGMKVAPEVFQRCIERVLKGLKGVKVFIDDVLVFGSSKEEHDIRLRAVLDR FT LKENGLTINEAKCEFGQSMVVFMGHQLSANGILPTNEKVSAIQSFRRPQTA FT SEMRSFLGLVNYVGKFIPNLSTLTAPLRDMIVKGVKFHWSREAKVSFNEVK FT RAMSNPNHLAFYSPRYKTTLITDASDHGLGAVLLQTNNSKTRPVSYASKSL FT SKTEKNYSTLDKEALAIVWATERFEMYLRGLDFSILTDHKPLIHIFSDSSC FT PNKRQERWVLRMQSYRYSVEYVPGEVNIADPLSRLCDLADMKTYDKRTEDA FT LYSIVEVNIPSAITMTEMIRSSQDDPEFAKVRKALNDDRWEDLRGFAPFKS FT ELCFSKDLLLRKDKIVVPGDLQEAVVRLSHTGHPGKEKMKRRLRTAVWWPG FT LDSDVEQACRSCVECQMVGAANKPEPLRIREIPSAPWVHLSADFLGPLPNG FT KYIFVLVDLYSRFVVAEFMTRTLSADVIRVLRQVFTRMGLPFVLTTDNAKN FT FSSQELKDYCVDYGIKLTHTTPYWP" XX SQ Sequence 3431 BP; 1014 A; 678 C; 871 G; 868 T; 0 other; tttttggcga ctaatgatgg gatctcgcga ttaaattcga taaaagtttg cttaaatgcg 60 ggaagtatcg gtttaaatac gaataaagaa gatagttgta tcggccgcga ttgttttggc 120 acgaaacggc gtcgtaatcg gcaaactttt cgaggtcata tggaacataa cctcggctag 180 tggaaatgga gaaagcaaca ataacacctt ttgatgtgac ggacacagca tctatcgcga 240 gccgctggac gaagtggaag agatctctcc agctttgctt ggaagtgaat tgtgttgccc 300 ttccttcgcg caagcgttcg tatttgttgc attttgccgg gccagaggtg caggatattt 360 tctacaatat cgaaggacac gatgcagatc ctcctctggg atccgatgtg tttacagaag 420 ccattcgttt gcttgacgag cattttgctc cactcaataa cattccgtac gaaaggtgtg 480 tgttcagaaa gatgacgcaa caagaaaatg aaccggttga gaaattcatc catcgccttc 540 gagaccaggg ccgcctctgc gattatggag ccgctttgga aatgagaata acggaacaaa 600 ttttcgacaa ctgtgtgtcc gacactttga gagaagcaat attgaagaaa aagttgatga 660 ctgttcaaga aattgctgag gaaggccgag tgttggaaac ggtgaaacgg aatcgggatc 720 aaatgaagaa aacagtagaa gaagagcaac tgagtttggt gaaaaaacaa agccgggaag 780 aaatttgttt tcgttgtgga attgtgggtc attttgctaa cgacaagaag tgccccgcca 840 gatcaaaaat gtgtgacaaa tgtcaaattg ttgggcattt caagaagatg tgtaaaacca 900 aagtgaagtc aggaaaagtg tcgaaacgaa atcagaaaat taatcaggta ctgacgcaag 960 acattgtttc ggattcatca agtgaggaaa cggattccga cgatgtgcag caggtatatg 1020 ccaatggatc tggattggat aaaacgacgt gtttcattgg aggtgtaaaa acggagtgga 1080 tcattgactc cggtgctcac gtcaacgtga ttacgcgagg aacgtggaag tcgttaaaag 1140 aacagggatg tgtagtgagc agtgaatgca aatcagacaa agtgcttcga gtctacggtg 1200 atggaaaact gaacgtactt aagattataa aagcggacat ttcaactcga gataagagag 1260 tgcaccatga agtttgtgtt gttgacagtg aaaaaggagc caatctgtta agccgagcga 1320 cttctattga actcggtttg cttgaaattc gtggaaatgt tctcagtgtt gatgacagtg 1380 gtgaaccacc gattggaaaa ctgaaaaatg tgcaagtggt tatcaagctc gacccaaaca 1440 taccgcctgt gcaacagtcg tgcagacctc tgcctattcc gttaaaagca ttagtggatg 1500 aaaaactcgc ggacctgcta aagcaagata ttatagagcc agcgccactc aatatcagtt 1560 gggcgtctcc gcttgttgta acaccgaaag acggtggacg taccgttcga ctttgtgtgg 1620 acatgcgtag agcgaataaa gcgataatcc cggagaagca ccctctgcca acattcgaag 1680 aaatcatgcc ccacttggaa ggatgcaaag tgttcagcaa gattgatttg gtgaaggcat 1740 ttcatcaaat tgagttatca cccgaatcgc gtgacattac cacctttgtc actcaagatg 1800 cgtactaccg ttataagagg cttatgttcg ggatgaaagt ggctccggag gtattccaac 1860 gatgcatcga aagagtgctg aaaggactaa aaggagtgaa ggtgtttatt gatgatgtgt 1920 tggtgtttgg atcgtccaag gaggagcacg atattcgcct gagagcagtg ttggaccgac 1980 ttaaggaaaa tggactcacg atcaacgagg cgaaatgtga atttggacaa tcgatggtgg 2040 tattcatggg acatcagctg tctgcaaatg gaattcttcc aacaaacgaa aaggtcagtg 2100 caattcagag cttccgtcgt cctcaaactg ccagcgagat gcgcagcttt ttggggttag 2160 ttaattatgt agggaaattt atccctaatt tgtcgaccct caccgctcct cttcgcgata 2220 tgatagtaaa gggtgtgaag tttcactggt cgagagaagc aaaagtatcg tttaacgagg 2280 taaagcgagc gatgtcgaac cccaatcatt tggctttcta tagtcctcgt tacaaaacca 2340 cattaattac cgatgcgagt gaccacggat tgggtgcggt tttgttacaa acaaacaact 2400 cgaaaaccag accggttagt tacgccagta aaagcctctc taaaaccgaa aagaattatt 2460 cgacgttaga taaagaagcc ctcgccatcg tttgggcaac cgaacgtttc gaaatgtatc 2520 tcaggggatt ggatttttcg attctcactg accacaaacc gctaattcac atttttagcg 2580 actcgtcctg tccgaacaag cgtcaagaga ggtgggtcct tcgtatgcag tcatatcggt 2640 actccgttga atacgtccct ggtgaggtga atattgcgga cccgttgtct aggttgtgcg 2700 atctggctga tatgaaaacg tatgataaac ggaccgaaga tgcgctgtat tcgatagtgg 2760 aagttaacat accgtctgcg ataacaatga ccgaaatgat tcgctcctcc caagatgacc 2820 cagagttcgc aaaagtacga aaggccttga acgatgaccg atgggaagac ttgagaggat 2880 ttgcgccgtt caaatccgaa ctatgttttt caaaagatct tctgctgcgt aaggataaga 2940 tagtcgttcc tggtgatctc caagaagcag tggtacgatt gtcacacaca gggcatcctg 3000 ggaaagaaaa aatgaaacga cgtctacgaa cagctgtatg gtggccaggt ctagattcgg 3060 atgtcgagca ggcatgcaga agctgtgttg aatgtcagat ggttggcgct gcaaataaac 3120 cggagccatt acgaattcgt gaaataccat cagctccttg ggtacatctg agtgcagact 3180 ttttggggcc tctcccgaat ggcaagtata tttttgtact ggtggacctc tacagtcgat 3240 tcgtggtagc cgagtttatg acccgtactc tgtcggctga cgttattcga gtattacgcc 3300 aggttttcac aagaatgggg ttgccttttg tgctgacgac tgataacgct aaaaatttct 3360 caagtcaaga actcaaagac tactgcgtcg attatggaat aaagctaact cacaccacgc 3420 catactggcc a 3431 // ID Gypsy-17-LTR_HM repbase; DNA; INV; 216 BP. XX AC . XX DT 03-FEB-2009 (Rel. 14.02, Created) DT 03-FEB-2009 (Rel. 14.02, Last updated, Version 1) XX DE Long terminal repeat of the LTR retrotransposon - a consensus DE sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy-17-LTR_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-216 RA Bao W. and Jurka J.; RT "Gypsy LTR retrotransposons from Hydra magnipapillata."; RL Repbase Reports 9(2), 407-407 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX SQ Sequence 216 BP; 59 A; 27 C; 44 G; 86 T; 0 other; tgttgtaact gcgtaagccg gtttccggca aagtatagac taaaaaacgt gctggtgagt 60 ttcctgagca gtccgagagt aatgatcgaa aaaacacttg ttattttgat agacttattt 120 atggttgtta ctctttgttg tagatcttat tatttgtgat tattaattga gttttcttaa 180 agcagtagtt tttctttatt aactgtggtt ataaca 216 // ID CR1-79_AAe repbase; DNA; INV; 5089 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-79_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5089 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1167-1167 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 13 sequences with >95% CC identity. XX FH Key Location/Qualifiers FT CDS 293..1708 FT /product="CR1-79_AAe_1p" FT /translation="MTTVLCDVCIQKITIESDRVYCFGGCHKILHAKCFEL FT NSAAVAALRQNVGLKYMCFDCRKDQTDLNAVKNHCSEIASTVDVMRESMDD FT IDSKVNAILKTQFDDFQNSLLSHVKQLIGKEVANTYVCKTASTASSYADVT FT RSGIDIPTAQNTRIELTTPSNATMYRQDSCTPVATDQDDDRGWLRSGKRRG FT SKSSSTTNVKLNQIKSTVISNSPAPKTSAITKIEQTVTFKPIEVQSAEITK FT RDIQQKLDPVSFAVKNVHFKDTGEASIRCESRELALRMMSAAKKVLSEKYA FT IDIQNALKPRVKIIGFDESVSKDKLVDTIKKQNSSLQDVDMNVVRLKKNER FT HKSNPMTALVEVDASLLVKLIKLQRVNVGWFRCRVVEDINVNQCYNCFEYG FT HKACDCTAPACCPKCAECHEFNECESDFQKCVNCYNHNRQFKLNEENQLDI FT AHSSWSTDCPIFLRRLNRAKQRIDYSS" FT CDS 1712..4999 FT /product="CR1-79_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="QPIVTRGSTPTTLCSVSSGDTCVLPGNDTICICPAEA FT GGDTARNLIQTALQTNYLDPITTEITSTLTSGDSYGPSGKHRTSICPAEAD FT EDAARILIHRNLQSHQSNHCPTSSGNCANTDHCDCNASRNLESAESSSPPL FT KIYYQNTRGLRTKMEDFFLAVSEMEYDVVVLTETWLNDQHLSTQLFGNSYT FT VHRDDRDPVVTGMSRGGGVLIAVANHLSSSRVDVHTAIDLDQVWVKIDLRE FT VRIFIGVVYFSPDQAAIPSKIEDHLNSVRAVSDIVSTSDINLLFGDYNQPN FT IVWSNAPSGFAYPDPVESRFSVASSTLLDGMSLLDLKQLNVIKNTIGRTLD FT LAFIGGDKYASCKILPVSDPLVTADSFHPPILSLLDCSPPVRFVEPDESGQ FT FNFRKADFNALNSALQNTDWSVLETSDDVDDAVAMFNSTLTGMFREYVPVF FT RRRLRPPWSNNHLKNLKRKRAAALRRYSINRNMYTKRIFVLASNKYKSLNR FT KLYHRHVRQKQSNLKRNPRQFWSFVNDKRKQSGLPCSMFLGTQAANTQQES FT CDLFAEHFADAFQLVPSDDELIRVALRNVPTNVISLNFTYFTEEDVLNAIK FT KLKPSTSAGPDGIPPIVLKQCASSLCQPLLMICNKSVELGRFPESWKLSVM FT FPAFKKGDKRNVQNYRGVTSLCAGSKLLETLVGRVLFAGVRNYITTNQHGF FT FPGRSINTNLVEFTSFAIHEMEAGGQLDTIYTDLKSAFDRVNHDLLLAKIE FT RLGATERFVSWLRSYLYNRVLYVKVGNATSVCYQATSGVPQGSNLGPLLFS FT IFFNDICLVLPDGTRLLYADDLKIFLRIRSIQDCKELQRLIDVFQGWCSRN FT LLLVSISKCSVISFTRRKTPIVWSYTMCGEPLERTSVVKDLGVLLDAKLSF FT TDHYSSIISKANRNLGLIFRISSEFRDPYCLRALYVALVRSVLESASIVWC FT PYTDNWGRRIEAVQRRFIRYALRFLPWNNPENLPPYEDRCRLLGLETLSNR FT RSISKAAFVGKLLLNAIDSPTLLRQISFNVPPRLLRNRDFLRLDYHRTDYG FT QNEPIRAMCSVFNSVVSVFDFNVSVDVFITRLKIMFNRL" XX SQ Sequence 5089 BP; 1475 A; 1107 C; 1114 G; 1392 T; 1 other; cgctgatcac gataacaagt ccgcatgtga tacgattgtc cgcgtctacc ggtttttttt 60 tcgttgttgt agcgtacggt attagtcaac atttatacgt tcgtgtagct cttaggaaga 120 ccttttgaat gatacttcgg gctaaatcgt gcgattaggt ttagacgagt tacttccatc 180 ggaagtttta tcgagttggg aaaaatcatc atcagccaca acaaaattgc ctcatcataa 240 aggcccgcta cagggtaaaa attgcgtggc ggcggcagtg ccgctcctca atatgacgac 300 ggtattgtgc gacgtgtgca tacagaagat tactattgaa agcgatcgtg tctattgctt 360 cggcggttgc cacaagattc ttcatgcaaa atgttttgag ctgaattcag ctgccgtggc 420 tgctctgcgg caaaatgtag ggctgaagta tatgtgcttt gactgtcgca aagatcaaac 480 cgacttaaat gcggtgaaaa accattgctc tgagattgct tcaactgtag atgttatgag 540 agagtcaatg gatgatatcg actcgaaagt taatgcaata ctcaaaaccc aatttgatga 600 tttccaaaac tctttgctat cgcacgttaa acagctgatt ggaaaggagg ttgccaatac 660 gtatgtctgt aagacagctt caaccgcaag ttcatacgct gacgtcaccc gtagtgggat 720 tgatataccc acagcgcaaa atacgaggat tgaattaacc acaccttcca acgccacaat 780 gtatcgacag gattcatgca cccccgtggc cactgatcag gatgatgacc gaggatggct 840 ccgctctgga aaacgccgtg gatcaaagtc gtctagcacc acgaatgtta aactcaacca 900 gattaagtcc accgtaataa gcaactcgcc agcccctaag acaagtgcga taaccaaaat 960 agaacaaaca gtcacattca agccgatcga agtccaatct gccgaaatta cgaaacgtga 1020 cattcaacaa aagttggatc cagtcagctt tgctgtaaaa aacgttcact tcaaagacac 1080 gggtgaagcg tccattcgct gcgaatctag agaattggct ctcagaatga tgtctgctgc 1140 taaaaaggtt ttgtccgaaa aatatgctat cgacatccag aatgcactca aaccgcgagt 1200 gaaaattata ggatttgatg agagtgtgtc caaagataaa ctggttgata ccatcaagaa 1260 gcaaaattca tcacttcaag atgtcgatat gaatgtagtt cgattgaaga aaaatgagag 1320 gcataaatcc aatccgatga cggcattggt agaagtggac gcaagtttgc ttgtgaagtt 1380 gattaagctt cagcgggtga atgttggatg gtttcgttgc cgcgttgtgg aagatataaa 1440 cgtgaaccag tgctacaact gtttcgaata cggtcataaa gcgtgtgatt gtactgctcc 1500 tgcttgctgt cctaaatgcg cggagtgcca tgagttcaac gaatgcgaat ctgactttca 1560 gaagtgtgta aactgttaca accataacag gcaattcaag ctaaatgagg aaaaccagtt 1620 ggacatcgct cactcatctt ggagcaccga ttgtccgatt tttctgcgtc gtctgaatcg 1680 tgccaagcaa aggatcgatt actcatcata gcaaccaata gtgactcgag gaagcacccc 1740 aacaacgctc tgcagtgtgt catctggcga cacctgtgtt cttccaggta atgacacaat 1800 atgtatatgt cctgccgaag caggaggtga tactgcacgc aaccttattc agactgcatt 1860 acagactaac tatttggacc ccatcacaac ggagattaca tcaacgttaa catctggcga 1920 cagctatggc ccctcaggta aacatagaac aagtatatgt ccggccgaag ccgacgaaga 1980 tgctgcacgc attttgattc atcgtaattt gcagtcacac caatcgaatc actgcccgac 2040 gtcatctgga aactgcgcaa ataccgacca ctgcgactgc aatgcatcac gtaatctgga 2100 gtctgctgag agttcgtcac cgccgcttaa aatctattac cagaacacac gtgggttgag 2160 gaccaaaatg gaggatttct tcttggccgt atcggagatg gaatacgacg tagtggttct 2220 aaccgaaaca tggctgaacg accaacatct gtccacccag ctattcggaa acagctatac 2280 agtacataga gacgatcgag atcctgtagt caccggtatg tccagaggag gaggagtact 2340 aatcgctgtg gcgaatcact tgtcctcaag tcgtgttgat gttcacaccg ccatcgactt 2400 ggaccaagtt tgggtcaaaa ttgatttgag agaagtcagg atatttatcg gtgttgtgta 2460 ctttagtcca gatcaggctg ctattccctc caagattgaa gaccatctca attctgttag 2520 ggctgtttct gatattgtgt cgacctctga tatcaacctc ctattcggag attacaatca 2580 gccaaatatt gtctggtcca atgctccatc gggattcgct tatcccgatc cagttgagtc 2640 ccgcttttcc gttgccagtt caactctgtt agatggaatg tcgcttttag atctcaaaca 2700 gctgaatgtt atcaaaaaca cgattggtcg cacgttagat ctggcgttca taggaggtga 2760 taaatatgcg tcttgtaaaa ttttgccggt atctgacccg ttagttactg cggattcttt 2820 tcatcctccg atactgtctt tactggattg ctcgccacca gtacgttttg ttgaaccgga 2880 tgagtcagga cagtttaact ttcgtaaggc tgatttcaat gctcttaatt cagcacttca 2940 aaatactgat tggtctgtac tggagaccag cgacgatgta gatgacgctg tagccatgtt 3000 caatagtaca cttacgggca tgtttcggga atacgttccc gttttccgac gtcgtcttcg 3060 acctccgtgg tccaacaatc atctcaaaaa tctgaaacga aagcgtgcgg ctgctttgcg 3120 acgttactca atcaacagga atatgtacac gaagcgaata tttgtgttag ctagcaacaa 3180 gtataaatca ctcaaccgta agctctacca tagacacgtt agacaaaagc aatctaatct 3240 gaaacgaaat cctcgacaat tttggtcatt tgttaatgac aagcgcaaac aaagtgggct 3300 tccgtgcagc atgttcctag gaacgcaagc ggcaaacact caacaggaaa gctgtgacct 3360 cttcgctgaa catttcgcag acgcttttca attagttccg tctgatgatg aactgattcg 3420 agtggctctt aggaacgtcc ccactaatgt aatttctttg aactttacct actttaccga 3480 agaagatgtc ctcaatgcga ttaaaaaatt gaaaccatct acatctgctg gaccggatgg 3540 tatcccaccg atcgtcctga aacaatgtgc aagttctcta tgtcagccct tactaatgat 3600 atgcaacaaa tctgttgagc ttggtagatt tcccgagagt tggaaacttt ccgtaatgtt 3660 tcccgctttc aagaaaggtg ataagcggaa cgtgcaaaac tatagagggg taacttcatt 3720 atgcgctggt tcaaagttat tagaaacatt ggtgggccga gtactattcg ctggagtgag 3780 aaattacatc acaaccaatc agcacggttt cttcccagga agatcgatca acacaaattt 3840 ggttgaattt acatcgtttg caatacatga aatggaggca ggaggacaac ttgacaccat 3900 ctataccgat ttaaaatctg cattcgatcg ggtgaatcac gaccttctac tagcaaaaat 3960 tgagcgtctt ggtgcaacag aaagattcgt ctcatggttg cgctcatatc tatataaccg 4020 tgttctctac gtgaaggttg gcaatgcaac atcggtatgc taccaggcaa catcgggagt 4080 accacaaggg agcaacctgg ggccgctcct attctcgatt tttttcaacg acatctgctt 4140 agtgctgcca gatggaaccc ggctacttta cgcggatgac ctcaaaatat ttttgcgaat 4200 caggtctata caggactgta aagagctgca acggttaata gatgtattcc aaggctggtg 4260 ctcgaggaat ttattgttag taagcatctc gaaatgttcc gtcatatctt tcacaagaag 4320 aaagactcct attgtttgga gttacacaat gtgtggagaa ccattggaac gaacatctgt 4380 ggtgaaagac ttgggagtgc tgctcgacgc aaagctctcg ttcacggatc actactcgtc 4440 gatcatatca aaagcaaaca gaaacttagg attgattttt aggatttcat ccgaatttmg 4500 agacccttac tgcttacgtg ctctttatgt agcgctggta cgatcagtcc tagagtcagc 4560 atctattgtt tggtgtccct acactgacaa ctggggtaga cggattgaag ctgtacagag 4620 acggtttatc agatacgctc ttagatttct accctggaat aatccagaaa acttgccacc 4680 atatgaagac cgatgccgat tgttgggtct agaaaccctc tccaacagac gaagtatatc 4740 gaaagctgcc ttcgtcggaa aactgctttt gaacgccatc gactccccca ctctattgag 4800 gcaaatcagt ttcaatgttc ctcctcggtt acttaggaat cgtgactttt tacgacttga 4860 ttatcaccga acagattatg gacaaaatga gcctataaga gcaatgtgta gcgtttttaa 4920 tagtgttgtc agtgtgttcg atttcaatgt atctgtagat gtttttataa ctagactcaa 4980 aataatgttt aatagattat aagtttttta atatgttcat gtagactaag cagtcagatg 5040 aatagagatg taataataat aataataata ataataataa taataataa 5089 // ID Copia-42_AA-LTR repbase; DNA; INV; 470 BP. XX AC supercont1.229; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-42_AA_; KW Copia-42_AA-I; Copia-42_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-470 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.229; Positions 1334669 1334200. XX SQ Sequence 470 BP; 125 A; 87 C; 94 G; 164 T; 0 other; tgaggggaag tgttaaattt caatttatca tttagattta gtagggatca tttccaatca 60 gattttctct caccgccaaa ggcggagcaa tatctctctc atttagtttt tgctctctcc 120 cgtctgtagt gtacacaagt aaaataacga taccaatcac taatgtacct tgtagtccaa 180 taggtagtag ggacctcatg actaatcaat gtatcctttg tatatgtaca gaattcttgt 240 gaatacaaag ttttatactt tttctaagaa tttctgagta ctatttatgg atgttcgagt 300 ggtaggatta ccatcgaact ttatgagaaa agtcttttgt gtttctagct tggttggagg 360 aagtcaggca cagtctgact ctttcctcgt ccactttggt tggctgacgt atgctgtgga 420 ggaagttttt cctcaagcat agtcgcggca ttccactcat tgaatcaaca 470 // ID DNA-2_CQ repbase; DNA; INV; 1866 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A DNA transposon family from Culex quinquefasciatus - consensus. XX KW DNA transposon; Transposable Element; nonautonomous; DNA-2_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-1866 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the southern house mosquito."; RL Repbase Reports 11(1), 43-43 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >95% CC identity. ~790 bp TIRs. XX SQ Sequence 1866 BP; 524 A; 395 C; 416 G; 531 T; 0 other; aaaggagaat gggcaaattt gtatgggagc tggtccaagg atgtacacga acgggaccaa 60 cttttttcga aagatctcgc cgagacatga aaaaaagtct tctggaccaa ctctctaaac 120 cccggggttc aaaagttaca agttgttgaa gtttgagcat tttgggttaa aatgtacaaa 180 aaatcgtatt tttgctattt ctaaaatcat tcataaaatc tctgtttcat tatggatttc 240 gattttcttg acttcattcg acgcgtatca gcaaaaacta tgagttctga tatgtcttag 300 catgattgaa acatgatttc tcttgttttt atccgtttga accgtttgtg caagaggcat 360 cccatagaaa caagtcgaaa cttcaaggtt ttgaaaccgg atccgaccca gactgcttca 420 gtgagtatgg aaggcttcag tgcaatgttg taacaactta ttgggatggt ctgagtcatc 480 cgagaactcc ttggagttaa gaccggtgag tctaccgccg taacaagcaa gatgtcggcg 540 gtttttgacc acggatcata cccgcatggc ttcagtgagt atggtagact actatgctga 600 tgtgttggag tgtcctgggt tctcctagga ctccctggag ttaagatctg tgggtctact 660 accgtaacaa gcaaaatgtc gaccggtttt gacaacggaa catacccgca tagcttcagt 720 gagtgtggta gactactttg ctaatgtgtt aggatgtcct gggtcatcca aggactccct 780 ggagttatga tctgtggagc tacagccgta acaagcaaga ggtcgacggt ttttgacccc 840 ggaacataca cgcatagctt cagacagtat ggtagactgc tttgttaatg tgttgggatg 900 tctttggtca tcctaggact ccctggagtt tagatatttg ggtcactgta acaagaaaaa 960 ggtcgatgat ttttaaccgc gcatcataac cgcaaagatt cagtgagtat ggcagactgt 1020 attgctgtta tgttgggatg ttctggacca ttccagggag tccttgtaac agccgtagac 1080 ctacagatca taactccagg gagtccttgg atgacccagg acatcctaac acattagcaa 1140 agcagtctac cacactcact gaagctatgc gggtatgttc cgttgtcaaa accggtcgac 1200 attttgcttg ttacggtagt agacccacag atcttaactc cagggagtcc taggagaacc 1260 caggacactc caacacatca gcatagtagt ctaccatact cactgaagcc atgcgggtat 1320 gatccgtggt caaaaaccgc cgacatcttg cttgttacgg cggtagactc accggtctta 1380 actccaagga gttctcggat gactcagacc atcccaataa gttgttacaa cattgcactg 1440 aagccttcca tactcactga agcagtctgg gtcggatccg gtttcaaaac cttgaagttt 1500 cgacttgttt ctatgggatg cctcttgcac aaacggttca aacggataaa aacaagagaa 1560 atcatgtttc aatcatgcta agacatatca gaactcatag tttttgctga tacgcgtcga 1620 atgaagtcaa gaaaatcgaa atccataatg aaacagagat tttatgaatg attttagaaa 1680 tagcaaaaat acgatttttt gtacatttta acccaaaatg ctcaaacttc aacaacttgt 1740 aacttttgaa ccccggggtt tagagagttg gtccagaaga ctttttttca tgtctcggcg 1800 agatctttcg aaaaaagttg gtcccgttcg tgtacatcct tggaccaaat ttgcccattc 1860 tccttt 1866 // ID Sola1-N2_CQ repbase; DNA; INV; 248 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A Sola1 DNA transposon family from Culex quinquefasciatus - DE consensus. XX KW Sola; DNA transposon; Transposable Element; nonautonomous; KW Sola1-N2_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-248 RA Kojima K.K. and Jurka J.; RT "Sola DNA transposons from the southern house mosquito."; RL Repbase Reports 11(1), 626-626 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 10 sequences with >98% CC identity. 4-bp TSDs are usually TATA. XX SQ Sequence 248 BP; 85 A; 49 C; 51 G; 63 T; 0 other; ctgcccatgt tcgcataaat gtcccatatg caaaaacagc aagctgagaa aaacgcgagg 60 gaagtttgtc ccacacataa ggctacgtgt taagttttca cgaaaaaact ggatttcctc 120 ctgatttcta gaacaaagta ctggatgtta taggctcttt tgaaagagca cacgattttg 180 aaccaaactg catcaataac tcaaaagtga tgaaaatgca tatgggacat ttatgcgatc 240 atgggcag 248 // ID piggyBac-4_SM repbase; DNA; INV; 3161 BP. XX AC . XX DT 29-MAY-2008 (Rel. 13.05, Created) DT 29-MAY-2008 (Rel. 13.05, Last updated, Version 1) XX DE DNA transposon from Schmidtea mediterranea. XX KW piggyBac; DNA transposon; Transposable Element; piggyBac-4_SM. XX OS Schmidtea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae. XX RN [1] RP 1-3161 RA Kapitonov V.V. and Jurka J.; RT "Families of autonomous piggyBac element from the freshwater RT planarian genome and two clear examples of horizontal transfer of RT piggyBac transposons between a flatworm and insect."; RL Repbase Reports 8(5), 523-523 (2008)Repbase Reports. XX DR [1] (Consensus) XX CC piggyBac-4_SM is a young family of piggyBac transposons, CC characterized by 14-bp TIRs and TTAA target-site duplications. CC The consensus sequence was reconstructed based on multiple CC alignment of 11 copies (they are ~99% identical to the CC consensus). This transposon may be still active, and the CC consensus sequence is a good approximation of the active CC transposon. XX FH Key Location/Qualifiers FT CDS 1239..2825 FT /product="piggyBac-4_SMp" FT /note="piggyBac transposase." FT /translation="MSNEIADYELLADEGSDIGSDDEIEDFDIINDRAGER FT IWSKEGNFVPKLHQFQEKPGPKNFRVENEQPIDYFEWFIDSEFMDYVVKET FT NLYQQQHPLPSQKKMAKWFDVTSNELSVFFAIVMLTGLVSKNELKDYWSVD FT PLLNTPIFPKTFTRNRFLQIMRFLHFANNEQKTNDKLFKIGNVVNMVKEKF FT KNAIIPDRNICIDESLMLWKGRLSFKQYIPSKRHRFGIKLFEMVDCTTQCI FT LNFIIYTGKSTKFRFHELLGISGSIVMELMNMHLDKGHVLYVDNWYSSPPL FT FEILHYKKTGACGTLRKNRKGLPKYSGKMKKGELESYNTDKLLFQRWMDKR FT EVFMLSTIHRNGLVESTKTNFQSFEKIMKPKTVIDYSLFMGGVDYSDMRMS FT FSECVRKTTKWYKKFFFHIIDMVVHNAYCIYRAKNKSRIQLADFRLILVRQ FT LIEKYGSQRSERVGRPPLSENPLRLQGRHFPDKIPINSGDNGKKCCVVCST FT SSDRKRTTIMCQLCDVPLCIVPCFKDYHTKVNL" XX SQ Sequence 3161 BP; 1154 A; 456 C; 514 G; 1037 T; 0 other; ccctttgagg accggctttt tttaaaattt atttaaattt ttaaatttta ttttttttat 60 aaatatttat ttatttcatt aaaaaccgga aaaaaaaatt aattttataa tttttaatat 120 aaaatttatt tacatgaata taaatatttt ataatatatt ttttataaaa tattgttcaa 180 tattcacata ggtttatttc attcattcat tcattcattc gaatcattca aacaaatgtc 240 tgatccgagg cgagtaggaa gaccacgcaa aagaattcca ccttcaaggg ctggttttac 300 tttggtgtgt cctgataaag gttgctatca ccctcctaca agagaagtgt atcctcctta 360 tttggaaaat gcaccatctt catgcaaatc aaaggaagag aaagcgatgt gggtaagtat 420 ggcccgacat ttgcctcctt tcagagattg caaagatgaa aaaagtagaa aatatatatg 480 cgcaaaggat accccatgct tgccatatat atttcaggag gtaaaaaatc attacgattt 540 tagttaccta aatcctgaag aatggacaaa ttcaataaaa cctgatttta tgtctgatga 600 tgattggaat cattttattg aatcatcatc gagaaaagga agagattctc aaaccgggcc 660 agctggcgaa ggtttgaaaa gaggtcttga ccaacaagaa acaaaaaaga aacgaaatag 720 gcaaatataa atccttttag atatgaaagg taggtcaaac cagcctcgaa atggtaatcc 780 tttcagatat gagaggtagg tctaaccagc ctagaattgg taccattctt ctggaaaggc 840 cttgtcgtgg cagaatggct tttcactaac ttaccaaatc attttttgtt aaaaatttaa 900 ttcaacgttt tttgaattat ctcacaactg agaattttat gacaaaaact attatgattt 960 tcagaaccag cataaaaaac tctataaaat aggtattttg ttttttctaa ttttccaaat 1020 ttttcacata tcttttttat attcaacaaa agtcgcatac ggatcacgtc cgattttttt 1080 tcgattatct tctaaactac gctatttaga ggtgagttga tattttttat gaatataata 1140 tacgaaaata aagattttga catcaaaaaa attgaattac gtaaaggatt agaaaagtta 1200 gcaaatttat tttatttttt atacattttt atagagaaat gtcaaacgaa atcgctgatt 1260 atgaattatt agcagatgaa ggatcagata ttggaagtga tgatgaaatt gaagatttcg 1320 acatcataaa cgatcgagct ggtgaaagaa tttggtcaaa agaaggaaac tttgtgccaa 1380 aacttcatca atttcaagaa aaacctggtc caaaaaactt ccgtgtcgaa aatgaacagc 1440 caattgacta ttttgaatgg tttattgatt ccgaattcat ggattatgtg gtaaaagaaa 1500 caaatctcta tcaacaacaa catcctttac catcacaaaa gaaaatggcg aagtggtttg 1560 atgtaacatc aaacgaatta tcagtatttt ttgcgattgt gatgcttaca ggtttggtta 1620 gcaaaaatga gttgaaagac tattggagcg ttgatcctct actgaacact ccaatatttc 1680 caaagacatt cacaagaaac agatttcttc agattatgag attccttcac tttgcaaata 1740 atgaacaaaa aaccaatgac aagcttttca aaattggaaa cgttgtaaat atggtcaaag 1800 aaaaattcaa aaatgccatc atacccgata gaaacatttg tatagatgaa agcctaatgc 1860 tgtggaaagg aagattgagc ttcaaacaat atattccatc aaaacgccat cgatttggaa 1920 taaaactatt tgaaatggtg gactgtacca cacaatgcat tctcaatttt atcatttata 1980 ctggtaaatc tacaaagttt agatttcatg aattacttgg tataagtgga tcgatcgtaa 2040 tggaacttat gaatatgcat ctggataagg gacacgtttt atatgttgac aattggtatt 2100 caagcccacc attgttcgaa atacttcatt acaagaaaac tggagcttgt ggaacattac 2160 gaaaaaatcg caaaggtctt ccgaaatatt ctggaaaaat gaaaaaaggt gaacttgaaa 2220 gctacaatac tgataaatta ttgtttcaaa gatggatgga taaacgcgaa gtattcatgt 2280 tgtccacaat tcatagaaat ggactagttg aatcaacaaa gaccaatttc caatcattcg 2340 aaaaaataat gaagccaaaa acagttatag attacagttt attcatgggt ggtgtggatt 2400 attctgatat gagaatgtca ttttcagaat gcgttagaaa gactaccaaa tggtacaaaa 2460 agttcttttt tcacatcatt gatatggtag tacataacgc ttattgtata tacagagcga 2520 agaacaagtc aagaatccaa ttggctgatt ttagactgat attggttaga caattgatag 2580 aaaaatatgg atctcaaaga agcgaacgag ttggaagacc accactaagc gaaaacccac 2640 ttcgtttaca aggaagacac tttcctgata aaattccaat aaattctggt gataatggaa 2700 agaaatgctg tgttgtttgt tcaacttcaa gtgatcgaaa aagaaccaca ataatgtgtc 2760 aactatgtga tgttccatta tgcattgtac catgtttcaa agattatcac acaaaagtaa 2820 atttataata accatattta ttgataaaaa taaaatagtt tatttttcga ttgaattttt 2880 tgtcttttct tatttttaca tattgattcg tagtttttag aaaaaaaatt tcttgaaaaa 2940 aatcattcgt tttgtttagt tttataaaaa tacacacaga aatgtgattt tcattatagt 3000 ttgatttatt ttagttttta gctaatatta tgtaaacagt gtgttttatg aaattttttt 3060 atttgaaaaa aaattttctt ttttaatttt atataagaaa aaccacattt gatattttta 3120 gaactagttt gccactatag tggcggccgg ccctcaaagg g 3161 // ID Gypsy3-NVi_LTR repbase; DNA; INV; 148 BP. XX AC AAZX01003557; XX DT 05-NOV-2007 (Rel. 12.11, Created) DT 05-NOV-2007 (Rel. 12.11, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: long terminal DE repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy3-NVi; KW Gypsy3-NVi_I; Gypsy3-NVi_LTR. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-148 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from Nasonia parasitic wasp."; RL Repbase Reports 7(11), 1121-1121 (2007). XX DR Genome; AAZX01003557; Positions 9639 9786. XX SQ Sequence 148 BP; 22 A; 52 C; 43 G; 31 T; 0 other; tgtggcggtt gctccacccg gacagttggc gccatctcgc ggtcgccgca aacaatgcct 60 gaagcctcgg gacgtctact agagggctct gtcttccgca gggtgcttcc cgccagagtc 120 gagcacgccg ctcgctctct tatcgcca 148 // ID L2-1c_Cis repbase; DNA; INV; 5475 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 01-SEP-2010 (Rel. 15.1, Last updated, Version 2) XX DE CR1 Non-LTR Retrotransposon from Ciona savignyi. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; L2-1c_Cis. XX NM L2-1c_Cis. XX OS Ciona savignyi OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. XX RN [1] RP 1-5475 RA Smit A.F.; RT "L2-1c_Cis - CR1 Non-LTR Retrotransposon from Ciona savignyi."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX CC Ci000401, ORFs from 77-778 and 832-4155 (1320 bp 3'UTR). Over CC coding region 75% similar at DNA level to L2-1a and 1b. L2-1a to CC c are all probably currently active in Ciona savigny. XX FH Key Location/Qualifiers FT CDS 86..775 FT /product="L2-1c_Cis_1p" FT /translation="MESLLMEKLSKMEETMLAFKGKQEGLEARFEKEPDTA FT PSLNIAGSSDRRKEILEMRSSLNALEASFKELKALIERTDERLDGLEQYGR FT RNCLVLHGCQGIPNNRDQFLKYILNXFNKMXLPYPIVPAEIDIAHVLPTKE FT NRTPIIVKFVRRMVRNDIFNSKRCLKGTKMAITESLTARRLRIVEKAKEVF FT GFRSVWTXNGAVYIMHNNKRQVLHRLSDITNILASTPGST" FT CDS 841..4152 FT /product="L2-1c_Cis_2p" FT /note="reverse transcriptase." FT /translation="MPPSKRKTCLICSKLIRINQNFVFCDSCHSNSHLNCL FT KNTHLSLNANNIKKNIYXSHCNWLCNTCSSESHLPFHEISDSELSLLLSPS FT TTAQINLDPSKLNSFFNYLDDDVLDKEGNEETTIGCPELYITSNQCKPHMF FT ESEQNFSTLSLNIRSLANPHNFSKLEALVLSLKFKPEVIAITETWIVGNQT FT GQYSNLPGYVFVSNSRTEHRGGGVGMYIRSDLDFHVKECLSVMEERVFESL FT FVTFPNLLPQPTPNSKSSLICGVIYRPPRSDSESNSFFLENLLKLLEKIET FT RHSQCIIVGDFNYDLLIGNNNHINNFKDAMHENCYQSIINKPTRITDSGAT FT AIDHIWTTIDSSFIRAGILTDSISDHLPVIMALNTKTDNKSSIIQSKSRIF FT SQSNMSHFNTAIQTMELDSIYYESDPNIAYSKLMTLYNTKFDSSFPLVKIT FT SDKSNTWFSEDIQKLNKIKQKLHKKFIQKQTEDSKLKYHSARNIYNRKILE FT SKKAHYRKLFDFNKKNMKATWRSINNLLGRLKSKNPRSFFVDGVRTSDPKC FT IANHFNNYFSNVASGLVKQIPKSXCHHTDYLGTQSPSSMFLFPTSPSEIKT FT IIKDLKSKSSSGMDQIPTKLIKSTPENILLALSHIFNLSIQSGEYINAFKI FT AKVIPIFKKGKPTLINNYRPISILPSFSKVLEKIIYNRTSAFLTRNCFFNN FT QQFGFRKGHSTSDACNVLVNIVNEQLEKKKSVIGVFLDLSKAFDTIDHGIL FT LSKMSHIGIRGVALDWFGSYLSDRKQIVDFNGVLSSHMNPVKLGVPQGSIL FT GPLLFLIYINDLPNCLKHSQTIMFADDTSIFTPGVNQSVIYKNVNDDLGRV FT SSWLCTNKLILNIDKTKYMYFSNSTKPVPQPPPIVINNYPIEKVHSFKFLG FT LTINEHLSWKPHMLTLIKKLNTNMLMIRKIKHLIDRPSLITLYHSLIVSHI FT TYCISTWCYGNNQLLNKLQRICNKYIRMIFNLGKQENTVRIMQQHSFLTIH FT DMHKLEILSIMHKCKHHNVPPSILQCIKLKPMTSGMSTRQKSRFIIPYCSK FT TTTQQSIKYIGPKLWHQLPTSXRDIKSFNKFIKVVKQHLLDDPHPLP" XX SQ Sequence 5475 BP; 1776 A; 1093 C; 811 G; 1781 T; 14 other; cattctattt gaataaggtg aatgcaagga taggaaaact tgttaacttg gccttttaaa 60 aaactntatt atttgaaacc gaaaaatgga atctctactg atggagaagc tttcaaagat 120 ggaggaaacg atgcttgcgt tcaagggtaa gcaagaagga cttgaggcca ggtttgaaaa 180 agaaccagat actgctccat ccctaaacat tgcaggttcg tcggatcgaa ggaaagagat 240 cttagagatg agatcctctc tcaacgcatt ggaggcctcc tttaaagaac tgaaggccct 300 gattgaacgc accgacgaac ggttagatgg attggagcag tatggaagaa gaaactgcct 360 cgtccttcat ggctgccaag gaataccgaa taacagggac cagtttttga aatacatttt 420 aaatatnttt aacaaaatga anttaccata tccaattgtt cccgcagaaa tagatatcgc 480 acatgtgctg ccgacaaaag aaaaccggac cccaattatt gtgaagttcg tccgaagaat 540 ggttcggaat gacatcttta attccaagag gtgcctgaag ggaacgaaga tggcaataac 600 cgaatcattg acggcaagac ggctccggat cgtggagaaa gccaaggagg tgtttggttt 660 tcgaagcgtg tggaccnaca acggggctgt ctacatcatg cataacaaca agaggcaagt 720 gctccataga ctatctgata ttactaatat cctagcttct acacccggtt ccacttagtc 780 aatactatta tttatcggta taatttanta tgttcnaatt ggccagttta aacatcccgt 840 atgccacctt ccaaaagaaa aacatgtctg atatgcagca agctaatccg tattaatcaa 900 aattttgttt tttgtgacag ttgccactca aactcacact tgaattgtct aaagaatacc 960 catctaagcc ttaatgcaaa taatattaaa aagaatattt atanttcaca ttgcaactgg 1020 ctgtgcaaca cctgctcttc tgagtcacac ctgccttttc atgagatttc cgactctgaa 1080 ctttcgcttc tactttcccc ttctactact gcccaaatca atcttgaccc aagtaaatta 1140 aattcttttt ttaattacct agatgacgat gtgttagata aagaaggtaa tgaggaaact 1200 accattgggt gcccggagct atatataaca tcaaatcaat gtaaacccca tatgtttgaa 1260 agtgaacaaa atttttccac cctgagtttg aatattagat ctttggcaaa tcctcataac 1320 tttagcaaac tggaagcttt agtattatct cttaaattca aacctgaagt tattgcaata 1380 actgaaacat ggatagttgg aaaccaaaca gggcaatatt caaatttacc cggttatgtt 1440 ttcgttagca acagccggac tgaacacagg ggtggtggag ttggaatgta catcagatct 1500 gacctagatt ttcatgttaa agaatgccta tcagttatgg aggagagggt ttttgagtcc 1560 ctttttgtaa cttttcctaa tttactaccc cagcctacac ctaattcaaa atcatcctta 1620 atctgtggtg ttatttatcg gccacctaga tcggactctg aatcaaattc cttcttccta 1680 gaaaatttat taaagttgct tgaaaaaatt gaaacaagac actcacaatg catcatagtn 1740 ggtgacttca attatgacct tctnattgga aataacaacc atatcaacaa ttttaaagat 1800 gcaatgcatg aaaattgtta ccagtccatt ataaataaac cgacccggat tacggactca 1860 ggtgcaactg caattgacca tatttggaca actattgaca gttcattcat aagagctggc 1920 atattaacag acagtatatc tgatcatcta cctgtcataa tggccttaaa cactaaaaca 1980 gataataaat ccagtataat acaatcaaaa tcaagaatat ttagtcaaag taatatgtcc 2040 cactttaata ctgccatcca aactatggaa ttggattcaa tttattatga gtcagatcca 2100 aatattgctt attcgaaatt aatgactttg tataatacta aatttgacag tagttttccc 2160 ttggttaaaa ttaccagtga taaatccaac acttggttta gtgaagacat ccagaaactg 2220 aacaaaatta aacagaaact tcacaaaaaa ttcattcaaa aacaaaccga agactccaaa 2280 ttaaaatatc attctgcaag aaatatttac aatcgaaaaa tcctggagtc aaaaaaagct 2340 cattacagaa aactttttga ttttaataaa aaaaatatga aagccacttg gagatccata 2400 aacaatttac ttggtagact taaatccaag aacccaaggt cattttttgt agatggtgtt 2460 cgcacaagtg atccaaagtg cattgccaac cactttaata attatttttc aaatgttgca 2520 tctgggttgg taaaacaaat tccaaaatcc myctgtcatc acaccgacta tctgggaact 2580 cagtcaccct cctccatgtt tttattcccc accagtccat ctgaaattaa aactattatc 2640 aaagatctaa aatccaaatc aagcagcggt atggaccaaa tcccaacaaa gctaattaaa 2700 tcaacccctg aaaacatttt gctagctcta tcccatatct tcaacctttc tatacaatct 2760 ggagaatata tcaatgcctt taaaattgct aaggttattc caatttttaa aaaaggcaag 2820 cctaccttaa taaataacta caggccaata agtatattac cttcattttc aaaggtactc 2880 gaaaaaataa tctacaatag gactagtgct tttctaacac gaaattgttt ttttaacaat 2940 caacaatttg gatttagaaa agggcactca accagcgatg cttgtaatgt gttggttaac 3000 attgtaaatg aacaattaga aaaaaagaaa tcagttattg gtgtttttct tgacctttcc 3060 aaagccttcg atacaattga ccatggaatc ttacttagta aaatgtccca tatcggtatc 3120 cgaggtgttg cacttgattg gttcggcagt tacctgtccg atcgcaaaca gatagtagac 3180 tttaatggag ttttatcatc ccacatgaac cctgtgaaac ttggtgtgcc tcaaggatca 3240 atccttgggc cattattatt tcttatatat ataaatgacc ttccaaattg ccttaagcac 3300 agccaaacca taatgtttgc tgatgacacc agtattttta cacctggtgt taatcaatct 3360 gtaatttata aaaatgttaa tgatgactta ggtagagtat cttcctggct ctgtacaaac 3420 aaattaattt tgaacattga taaaaccaag tatatgtact tttccaactc taccaaacca 3480 gtgccacaac ctccccccat agtgataaac aattatccaa ttgaaaaagt tcattctttt 3540 aaattcctgg gattgactat taatgaacac ttgtcttgga aaccccacat gctaacactg 3600 attaaaaaat taaacaccaa tatgctcatg atacgtaaaa ttaaacactt aattgaccgc 3660 ccatcactga ttacattata tcactcactg attgttagtc atatcacata ctgcatttcc 3720 acctggtgtt atggtaacaa tcaattacta aataaacttc aacgtatatg taataaatat 3780 atacgaatga tatttaacct tggaaaacag gaaaacacag taagaattat gcaacaacat 3840 agttttttaa ctatacatga catgcacaag cttgaaatat tatctattat gcacaaatgt 3900 aaacaccaca atgtaccccc ttctatttta cagtgtatta agttaaaacc aatgacatca 3960 ggcatgtcga ccagacaaaa atcccgattt ataattccat actgcagtaa aaccaccacc 4020 cagcaatcca taaaatatat tggccccaaa ctctggcacc aacttcccac tagtatncgt 4080 gatattaagt catttaacaa atttattaaa gtagtaaaac aacacctctt ggatgatcct 4140 caccctcttc cttaaccatt gttaatttat ttaatatcat atgattttca aaccctccct 4200 tataatggcc cacttttaca attccttgta tctcagtaaa aataattctt ggttaacata 4260 cataatctta tctatatcta agcttattct tataatatcc gaacccattc ttaaattccc 4320 tcttactatc acaccactag tatattttac cttttgttct accactcttt tctaactctt 4380 tcatgttttg atttgtttta ttctttgttg tatcacctca atcttttaat gtaacacctg 4440 caatagatgt tattttttta tttttatttt ttaatacacc actttatttt tgtccacatg 4500 attactcacc aattcgttcc accatattac accgcaggct gtaactgtta catggtaagc 4560 cgatggcaat antattttga agtggatttt ttgttgatgt ttaaatttag ccggccctaa 4620 cagttacctt gtatttagaa tgctccacat ccaaccacca ctaatgtagc gcactagctc 4680 acctgcaccc gcatgtatca cttacaagta aaaccctttt tctttctctt tcttttacta 4740 ttgttttatt taagaaccct actgtccaga ttttctattt ttcagagcat tgggagatct 4800 agtgacggca ggaggctctt ctgttgttgt ggcgtcacta taaatcatca tctcaagaat 4860 ttatttttcc ggaacttgct gcatagttta ttcatttttc atttttattt aatattagtt 4920 ttgtcagtta ggcagtcttt gcctacttat tgtttgtact atgcataatt gttatgcata 4980 cttaaagtgc aatattgttt tgcacaataa tgttttttct aattaatttt aattatttat 5040 ttatgtattt natttttaat taatatactt ttaaatgcct tggtaagtgt tgcacttgaa 5100 acttttcagt ggaatttcat ttttggggtt tttctttttc cactgcggcc cacactttcc 5160 gggccacccc ttctcctaaa gccgactccc ccattgcacg ggtcgcttta ccgcacactt 5220 ttcttcgagg cttagtgcaa tttttatttt ataaacaatt aattaattat ttttaatatt 5280 ttttttattt tttttaattg ggtttagcgg ccacgtcaaa ttatacagaa ttttaatagt 5340 ttatttcttt tttttagacg tgttctaacg actgtcgttt tgccgttttt tcccgttttc 5400 caaattttta ccgatagtcc cgcttcgtgt tttcatcgtg gaaacaaaaa ataaactaaa 5460 ctaaactaaa ctaaa 5475 // ID L1-5_Cis repbase; DNA; INV; 6432 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 06-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE L1 Non-LTR Retrotransposon from Ciona savignyi. XX KW L1; Non-LTR Retrotransposon; Transposable Element; L1-5_Cis. XX OS Ciona savignyi OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. XX RN [1] RP 1-6432 RA Smit A.F.; RT "L1-5_Cis - L1 Non-LTR Retrotransposon from Ciona savignyi."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX CC Ci000757, Ci000758; 5% div. XX SQ Sequence 6432 BP; 2528 A; 1162 C; 849 G; 1784 T; 109 other; cttggaaaaa aatctacacc atattcaaat agaaanctng ttggactatt tcaacgttca 60 aaaactgtgg atatacgtta tttgggattt actctnttac cctcaaacgg aagactcggt 120 aagtccgtag tttttgcacc ctgtctcntt aacttntttg gtatttcttc gattgctagt 180 gttgccctaa ctttttgcat aattaattca ccattttttt cctggcaaca ctaataattg 240 aaaaagcgcc cctaattatt acnttacaca ctaggactta gtttttttca gcttccatac 300 ggcgaagctc tgcccatttt ttctagacaa tactgccatc taacggctta attttcccat 360 tgtaatctaa ttcgccccat ttttncgtac tacgccacct tacgactact tcactgcgat 420 aantattnac ntggttagtt ttttgagctt caatccgcga agctctacca aantttctgn 480 cgcaatactg ccanctaacg gtcanatttt ttctcctcaa ncataattta antcgctnca 540 tctcccatac tacgccacct tncgacnact tctctgtcga ncaatatcaa cgtggtgatt 600 tcttgaacaa tagcnaaatt taacttaacg taantnaatt gcnncacaac naaacagcct 660 tatatctnta tcatatcata nactaaaata acacaatatg gcngcnncna aagaagacat 720 cccggaattg gaaatcaaca tggaaaggtc gatnatcata aaactcaccg gaaacgtcga 780 attgatcagc gtatcttcat ttatggagat atttgcagag aatggccctc tttgctatct 840 tggcccagac gtggatggtg tcatcacnga gagtttcaaa gactcngaat tcctngtcac 900 tctgaaantg atggatggnn aatgtcccga cctggangaa ataatnaaaa taatnaataa 960 taaaaaattg gaatttactg gaccagcagg ctccatcatg tccgtggagg ccaagtatcc 1020 agagcctcca ctcgagatag taaccctcta cccggttccc tgcctcgtca cggagtncag 1080 acttcgccaa ctnttgacca aatataagtg gggaaagatn cattcnttct tcttcggnac 1140 ccatcgtcac ttcccggata taaagaacgg ctggctgacc atcaagatgt ataacataaa 1200 tataaataac atacccaagg tnatgaaagt tggaggacgc tgggtcaccg tcacccggcc 1260 cggagaatcg catcttcctc tgtgcaggtt ctgcaaggag agaggacacc tccaacagaa 1320 gtgccccaaa aaaggttact gcaacaactg ctcnacntnc gggcacctna cangaagatg 1380 cagatcagcc ccaatgagac ccactcaacc caatcanaac caacctcaac acggnatgac 1440 ccaataccag accccagcca cgaccttaaa ccaatggata accccgaaac tggcaaagaa 1500 aaaaacntac caaccaacca accagcaacc natttccata caaaatatgt tctcnacact 1560 acaacaattc gacgacactg accacagcat ggatgaggag gatacgacaa tactaaaatc 1620 tggacccaac cttcaactcc cacaaagaac cccacgaaaa aataaaaagc aattcaattc 1680 ggccaattca accccaaaac tcacaattga cgactcccct accaacctca ccaccctatt 1740 tgctcaaacg gaatctggac aaacgccaac accagaatat atgatcaatc aatccgatca 1800 aagaaaatat gaccaatcca tcccatcaat cacaatcaac caagagntna caaaagaaaa 1860 aacgtcacct caacaaacaa atttcacaaa ggaaccacaa caaacatcgc aaccacaaca 1920 agactctcca tccacancat catcccaaac aaatggcata tcacccaccc caacggatcc 1980 ctctacggaa aattccccag attcggcctt tgaggtcatt aaaaaggcac acaaaaccct 2040 gggccttgat ccaaagaccc attcctctgc aaaaccncca tacattcgac gaacnataaa 2100 agaaataaca cgcgaactnt tccacccaga atctccatcc aagaaaagaa gaaccctcga 2160 acactcataa acagtcatag tgtgacatac accttatttt aataaaaaca acatatacan 2220 taattnatca cacctatatt ntcataantg aatcatggat ataacaaata ataatgaaat 2280 aacnatnaac aactatccnt taaatatngc naccatcaat attaatggtc ttaaaacnaa 2340 attgaaccat ttgnataatt acataaataa taataatatn gatataatgt gtattcaaga 2400 aactcaccgc gttgatattg aaaccatntc caaagtcgaa aatcanttcc atctttccgc 2460 cttcttcaat accgcgattt ccagtgaccc taaaganaaa tgtcattatg gtacngcnat 2520 antnttnaag caacatattc ttaataattt taatgtagca catagtatac ttatagaaaa 2580 tatggtnaat atagtaacac taactaataa taattcttaa atatnactat aattaataca 2640 tacttaccgt gtgganctaa aaatagaatn ntacgtaaag aatatataaa tataattaaa 2700 gatagcatac aagtatccga tacatcntac tatataattt tgggcgactt taatatgata 2760 acgaatgatc tcgatataaa aaataactat gataaacgaa ggaaagtaga tcgaacggcg 2820 tgggttaacc ttgaaaaaga aaaatttatt actgatacct ttagacgatt aaataaattt 2880 gaaattaatt tttctagaat aactaatgta tccgcctcga gaatcgaccg natttatacg 2940 agcaaaaacc ttaatcactt actnattaaa tattntcata aacgcaactt tttttccgac 3000 cataataact gtcctatact atccttaaca attcaaaatt cgaaacgctg gggtccatcg 3060 tattataaaa taaataattc tatattagaa gataatgatg taattcaaac aatggtatat 3120 caatgggaaa attgggcatt gcaaaaaagg aaatatgaca atatattaaa atggtgggat 3180 gatgggaaaa ttttattaaa aaatatagct acagattact ctcgaattaa agctcgaact 3240 gagaaacgaa actatgaaaa cgcaataata gaattaacta aacaaaatcg cctaccggat 3300 tcggttaata gaaataaaat aatagatatt ctagataata atatctctgt atatgaaaag 3360 aaattaaacc aaggagcctt aattcgatct aaaataaaag aaatctcaaa cgaggaaacg 3420 ccgacaaaag ctttttttga ttatgaaaaa aagcatcaaa aaagggatac aatctataaa 3480 gttttaaatt gtaacggtga aataacgcga aatgaaatag aaacggttaa aacaatatat 3540 aatttctacg anaatctttg ggggaaaaaa cctaatacgg taatacccga aaattattta 3600 aacgtattta gtccgatcgt aattgacgat ttagataatt taaataaacc tattaatatt 3660 aacgaaattg aaatagccct aagtgatact aatgtcaata gctcaccagg aagcgacggg 3720 ttaagctacc ttatatataa aaaacttttt aatacaataa aatacgatat ggaagaagtn 3780 tttaataata tctatataaa gcagaacctt tcggaatcaa tgaaaacctc tatagttaaa 3840 ttaattcaca aaaaaggtga aaaaacaaac ttaaaaaact ggcgtcccat atcgattctt 3900 aactgcgact ataagatatt aagcaaaata atcagtaata gacttaaaac tataattgat 3960 aaaataataa gcccaaacca aaggtgtggt ataaaagata gacgtattaa tgacgcacta 4020 tataatatcc aaagtgtaat agactcgtca aaatatttta ataaaaatgt aacngtaata 4080 gcngtcgatt tcgaaaaggc attcgataga acagacccan attttataat ccaaatttgt 4140 aaaaaactaa atttgcccaa aacaattatt gattggatta aaattatgta tacagacata 4200 antagtaaaa tagaaataaa cggaaccttt acaaaaaaca taaaaattaa acgtggaatt 4260 cggcaaggtt gtcccctaag catgctgctt tttttaataa acatggaagc gttaacacgt 4320 aaaattaaca ataatagtaa nataattgga tatcgcctta ataaaataga aataaaaacc 4380 gaacaatatg cagacgacct aacaattttc acccaagaca aatcctccat taaacatata 4440 tttacggaat tggaaaacta cggaaaagta tcggaccaaa taattaacat cgctaaatcg 4500 caagtaatta gtaacgatat tatatctatt aacgaaatta aaaacgtttt tcccaacatc 4560 caaatcacca aaactattaa aattcttggn atatatttta atcttaacga taattgcatc 4620 ccaaaaaatt tggagaaaat agaaaaacan attacnaata ttttaaatat aaacaagcga 4680 cgcaagctga cnctgaacgg aaagaaagca ataattaata cgttaattct gccgactata 4740 aatacaattg gtgatttatt ccanatcgat aaattaacaa taaataaaat taataataaa 4800 atatttaaat ttatatggta tccaacccac tttgaatttc taaaaagaaa taaantatat 4860 gctccctata aaaacggtgg nttgcaattc ccngatatac aaacaaaact tgatgcattt 4920 aaagcaacaa gaatttcaaa actaaaacta ttaaataaaa taactgattt ttggcaagag 4980 tgggcccgat ttaacctcgg ttcaacaatg aaacaaataa ataaattatt gtataccaac 5040 agccttccga ataaggattt tcccgacccc ttttataaaa taattcgtca ccttttttat 5100 aaattaaata aacaaaacta tgactggaat aatgggaaaa cgaaagattt ttatgccgca 5160 cttattacgg aaaaatggga aaataccgaa attaaaataa atgacaatat cgttccctgg 5220 aaaaatatta atctacaaaa caaaagcatc aagcattttt ttacgaatat agatagggat 5280 attgcataca aattggccca aaatattatt ccgcatggat tttggtatga atctaggaaa 5340 attagtaata tatataacgg taaaatatta atacggcaat gtaaattttg tagacaagaa 5400 aacgatacnt tcgagcatat atttcttaaa tgtaaaatag ttaaaaatat aatagataaa 5460 ctttttcaat atgcaaataa tatatcttct aaacaatata atagaaacga taatttaatc 5520 ttatataata aaggtaataa ctctcaaaaa cgggatcttt tggtaataaa aattttatct 5580 attgccaaaa aggaaatatt acgcgaaaaa caaaaattag atatacaaaa tatatattta 5640 tgggacaatg tagaatttgt gcgaaaaata ctttgggttg tcaaaataaa atttaaacga 5700 tcaattagaa aacaaattgc tatctttaat cttaattata ttcaggagaa actatatctt 5760 aatagtaatt ttactccata tgatggaaca taaattaaaa taataaaccc atatctaaat 5820 aatttatagt atatctaaat gcatatgtta atatatgtag aatattataa tagtaaatat 5880 ctaataaaat aataatatta ataatattta actaattaat aagtatttat tatatacata 5940 aataatatta tagagactgg gtgcaaaact atggactacg ctaaatttta ttaaatttaa 6000 atctaaacaa ttagttaaaa tatgcaaagt acttaaatat aaccttcaaa ttacatagca 6060 gggtcttccg tttgggggta actcttcctt ttcttcatat taaaagattc gtcaattcgc 6120 catgtaatct tcatcattca tcatgctcca cctgaaagtg aaaactacaa taaattagat 6180 cgtatttaaa ctcataacag attaaacgta attctttaaa attgtatata ataatgtaaa 6240 tatgtaaatt gtaaatatgt aaatatgtaa aatatgtaaa tatgtaaatt aattaatgaa 6300 tgtcttactt tttttagtga tccgttttgt tctttgtgta aattattatc attgtatatt 6360 tttcgcatta ttatgtacgg ccgggaaacc ggtatgtgtg tttattagtc gcccaataaa 6420 aaaaaaaaaa aa 6432 // ID Harbinger2-1_MBr repbase; DNA; INV; 3334 BP. XX AC . XX DT 13-AUG-2010 (Rel. 15.09, Created) DT 13-AUG-2010 (Rel. 15.09, Last updated, Version 1) XX DE A family of autonomous Harbinger transposons - an incomplete DE consensus. XX KW Harbinger; DNA transposon; Transposable Element; KW Interspersed repeat; Harbinger2; Harbinger2-1_MBr. XX OS Monosiga brevicollis OC Eukaryota; Choanoflagellida; Codonosigidae; Monosiga. XX RN [1] RP 1-3334 RA Kapitonov V.V. and Jurka J.; RT "Harbinger2, a novel clade of Harbinger transposons in protozoan, RT fungi, choanoflagellate, and metazoans."; RL Repbase Reports 10(9), 1216-1216 (2010). XX DR [1] (Consensus) XX CC Harbinger2-1_MBr belongs to a novel clade, Harbinger2, of CC Harbinger DNA transposons. This clade includes transposons CC present in protozoan (brown alga), fungi, choanoflagellate, and CC metazoans. CC Harbinger2-1_MBr is a consensus sequence of a family of CC autonomous Harbinger transposons that were active in the CC Nematostella vectensis genome very recently. This CC transposon is probably still active. The assembled genome CC contains only two copies of this transposon, they are >99% CC identical to each other. The consensus is incomplete due to CC the insufficient sequence data: its 5' and 3' termini are CC uncertain. The monosiga genome contains, as minimum, two other CC families of Harbinger2. XX FH Key Location/Qualifiers FT CDS join(3131..2895,2612..1662) FT /product="Harbinger2-1_MBr_2p" FT /translation="MASLSGDSAVEEELKSACSLLQIPILGRSVSPVEASE FT RCLKEVIARGMEMGLHLRRSTATRKLSLRCFDAAQLNIVCQRLFSELSITP FT SEYTQLRRNIWHTLDLPGAKSRMGPRHEWWGVFWDVTQQYLSDVQKIPHTG FT NADPSNDSSVPPGRREIWGRYSQLLRTMKMERETVLQAESRGKEQAVLRQA FT VLEEAGQATLHRGHSVFPLDRQNHVAALPSLTPSRSASEGIDHTPHVESND FT EQSAFATDAIAESQPCAEPLVSTDLPQTAAYPRLGEARTDRALVEDESSHI FT GMRPGWFEGAIGEALVTLRQPALSQQQQLVDALRSEMKADMSVMNEKVTGI FT QQSIEELKSILLGSGRQDTLPTTVDTGKRTMQETQAPEQTQRRSTRARSSQ FT KE" FT CDS join(129..890,965..1036,1123..1566) FT /product="Harbinger2-1_MBr_1p" FT /note="Harbinger TPase." FT /translation="MPRPREAIVSADRLLRDSRFSASARRAIFQKRKDYAS FT KVALAFFLQTLARLVTEHRAGLLPDDFNADEAILFLCLAATNMVKNATREL FT ERPPVVIAENGTPHVNLARFTETAWAQCFGLTNAAQFERLAAALAPHINEH FT LPVPLREALLAFFALFHRSSRLIDAIHTLGVSWTVPKMSQVINQFALAVAK FT KFQRRLAFDARFFQPDWTTAAVAAVAAKGSPLPGICAFIDGTGQNLARPKG FT KDNQAAFYSGKDSKHTTRYQGVVAPNGVLVSFAGPWRPRHDAYILEQSGLL FT DNLKRVWPEETPWCLLGDSAYPRSKRLVRPAKRDEGWTATTEALNAELGRL FT RIVIEWAFGASGYGVASRFEGVVDSRRMLVGERPVSAYMLTAVLLTNMIVC FT DDQSNPVAQYFGHKLAIPSLTEYMAPLSPSER" XX SQ Sequence 3334 BP; 746 A; 900 C; 845 G; 843 T; 0 other; agagactttg caagcgacct tcgtttgtga atcaaaaagg cacaaaaaca tgtgtaaact 60 tttttcacac ttccggtggc accctttgta atcaaaggtt tctttaccta atttaccgtt 120 actgagtcat gcctcgaccg cgcgaggcca tcgtctcggc ggatcgcctg ttgcgagata 180 gccgcttctc ggccagtgct cgccgggcaa tctttcaaaa aaggaaggat tacgcgtcaa 240 aagttgcgtt ggcgttcttt ttgcagacgt tggcccgctt ggtgactgag catcgagccg 300 gtcttctccc tgacgacttt aatgctgacg aagctatcct cttcctctgc ctggcggcga 360 caaacatggt caagaatgca actcgtgaat tggagcggcc gcctgttgta attgctgaga 420 atggaactcc tcacgtgaac ttggctcgtt ttacagaaac ggcatgggca caatgttttg 480 gactcacaaa tgcggcgcag tttgagcgac tagcagccgc gctggccccc catatcaacg 540 agcacctgcc cgtcccactc cgggaggccc tgctcgcttt cttcgctcta ttccaccgat 600 cgtcgcgcct catcgacgcc atccacacct tgggagtgtc ctggacggta cccaaaatgt 660 ctcaagtcat aaatcagttt gcgctggccg tggcgaaaaa gtttcagcgc cggttggcct 720 ttgatgcacg tttctttcaa ccggactgga ccacagccgc tgttgctgcc gtggccgcaa 780 aaggatctcc attgccagga atttgcgcct tcattgacgg cactggtcaa aacctggcgc 840 gccctaaagg aaaagacaac caggccgcct tttactctgg taaagactcg gtgagttgac 900 tgccgcccgc tttttttttt tttttttttt tgtgctgggc ctcatgttac tatacatttt 960 gtagaagcac accactcgct atcaaggtgt tgtggcaccg aacggtgtcc tcgtcagttt 1020 tgccgggcca tggcgaggta gggagtgccc ggctattcgg atgctcactt ggatgttggt 1080 tgccgggacc tggcttacct caacgtactg actccactag ggccgaggca tgatgcttac 1140 atactcgagc aatcgggctt gctcgataac cttaagcgag tatggcccga agaaacacca 1200 tggtgcttgc tgggagactc agcgtatcct aggtcgaaac gcctggtacg ccccgctaag 1260 cgtgacgagg gttggacggc gacaactgag gcactaaacg ctgagcttgg gcggctccgt 1320 attgtgattg agtgggcctt tggtgcttcc ggatatggag ttgcatcgcg ctttgaaggc 1380 gttgtagatt cacgtcggat gttggttggc gagcggccag tttcggccta catgctcaca 1440 gcagtattgc tgaccaacat gattgtgtgt gacgaccaga gcaaccccgt tgctcagtat 1500 tttggccata agttggcgat cccttccctg acagagtata tggcgccgct atctccttcg 1560 gaacgctgaa tgtgtctgac aaggcaggga ttggataaga atcaagggaa ggaattcaaa 1620 tttattctgt tgatcaaaca tgaaatattt tactctcgtc actccttttg cgatgaccgg 1680 gcgcgcgtag accggcgctg tgtttgttcc ggggcttgcg tctcctgcat cgttcgcttg 1740 ccggtatcta cggttgtcgg caaagtgtcc tgccgtccac taccaagtaa aatcgacttg 1800 agctcttcaa tagactgctg aatgcctgtc accttttcgt tcatgacaga catgtctgcc 1860 ttcatctcac tcctcagagc atcaacaagt tgctgctgtt gcgatagggc aggctggcgc 1920 aaggttacaa gggcctcgcc gatggctcct tcaaaccagc ctggtctcat ccctatatgg 1980 gaagattcat cttcaacgag ggcccgatcg gttcttgctt cgcccaatcg ggggtatgct 2040 gccgtctgag gtaaatcggt ggataccaga ggctccgcgc acggttgcga ctctgcgatc 2100 gcatcagtcg caaaggcgct ttgctcatca ttgctctcaa catgaggtgt gtgatcaatg 2160 ccctctgatg ctgaacgact cggcgtcaaa gaaggaagtg cggcaacatg gttttgcctg 2220 tcaagaggaa acacgctgtg acctcgatgc aaagtggcct gtccggcctc ctcgagcact 2280 gcttgacgca agacagcctg ctcttttcca cgagactcag cttgcaagac ggtctcacgc 2340 tccatcttca ttgtacgaag gagttgacta taccggcccc aaatttcacg acgcccagga 2400 ggcacggacg aatcattgct tggatccgca ttgcctgtgt gcggaatctt ctggacatcg 2460 ctcagatatt gctgtgtaac atcccaaaaa acaccccacc attcgtgacg cgggcccatt 2520 cgagattttg cgccaggcaa atctagggtg tgccagatgt tgcgacgcag ctgcgtgtat 2580 tcgctcggag tgattgataa ctcactgaac aacctgccat catcccagag catttttgag 2640 catcggcaga aaaatgcgca taccattcaa aaatgcttac cacagcttca tggcacgttc 2700 atagataccc acgactgtca gccaatcaat cttgccagtt gtggaccttg cagcgagttg 2760 catttctgcc caagggtcat tcaggtcgaa cgttcgaaat ctaaaaccac gagttgattt 2820 aagctgaatg gcgacagtcg gagaacgaat agcctcgcag gagagacagg caataatcaa 2880 cacctgcctt accttcgttg gcacacaatg ttcagctgag ctgcgtcaaa acacctcaga 2940 ctgagcttcc gagttgccgt tgaccgccga agatgaagcc ccatttccat gccacgagcg 3000 atcacctctt tgaggcagcg ctctgatgct tccacaggac taactgaccg accaaggatg 3060 gggatctgca atagggaaca agcacttttc aattcttcct ccaccgcaga gtctccactc 3120 agggaagcca tggtggcact gccgtttatg gcgccctcga gcttttgtgc cttgccgtca 3180 aaaaatgcaa aagctcctct tcaaaaagat caaaaaggca tgtttgcacc aaatctcgat 3240 ttttgcgccc acctttcgta ccgtttttgt ctgacaatgg atttcgtccc tcgaggagct 3300 tttgcactgg tatccaaaag cttagttaca aggt 3334 // ID Gypsy6-LTR_AP repbase; DNA; INV; 616 BP. XX AC Contig9985; XX DT 21-APR-2008 (Rel. 13.04, Created) DT 21-APR-2008 (Rel. 15.12, Last updated, Version 0) XX DE LTR retrotransposon from pea aphid: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy6AP; KW Gypsy6-I_AP; Gypsy6-LTR_AP. XX NM Gypsy6-LTR_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-616 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from pea aphid."; RL Repbase Reports 8(4), 448-448 (2008). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 616 BP; 184 A; 81 C; 102 G; 249 T; 0 other; tgtattgaca ttatattgta ttgtatattg ataatgtatt gacattatat tgtattgtat 60 attgataatg tattgacatt atattgtatt gtatattgat aatgtattga cattatattg 120 cattatgtat tgatgatgca tatgctcacc aattgcattc tattttttta aaaaaaaaag 180 actgtttatt atataatttt gttattcttt caaattgtta tgagtggagt ccactcaaag 240 tcaagtatgg ccgagttgta gtaaaattat tattattatt attaatattt attataagtg 300 gataatattt gtcaattggt cgcgccacag tcgattatca cttttaatcg actgagcacg 360 acaccaaccg attattatgt tcctggcgac cgacggtaat tcattactgt ccgcggccac 420 cgctagttgc taagatatat attgtgatat gcatttctgt accgttgttt taatttaatt 480 atcatgtgta atgtgtattc aattattaat aaatgtgcgt tgattagctt aatgaaccgt 540 ctattaagtt aattatttaa ttgtacctaa attgtcctga cacattgacg tggtggtttg 600 gagagacctc accaca 616 // ID R2Sm-A repbase; DNA; INV; 4317 BP. XX AC . XX DT 19-FEB-2010 (Rel. 15.02, Created) DT 19-FEB-2010 (Rel. 15.02, Last updated, Version 2) XX DE R2Sm-A - R2 non-LTR retrotransposon from the bloodfluke DE Schistosoma mansoni. XX KW R2; Non-LTR Retrotransposon; Transposable Element; R2Sm-A. XX OS Schistosoma mansoni OC Eukaryota; Metazoa; Platyhelminthes; Trematoda; Digenea; OC Strigeidida; Schistosomatoidea; Schistosomatidae; Schistosoma. XX RN [1] RP 1-4317 RA Kojima K.K. and Fujiwara H.; RT "Long-Term Inheritance of 28S rDNA-Specific Retrotransposon R2."; RL Molecular Biology and Evolution 22(11), 2157-2165 (2005). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 341..3784 FT /product="R2Sm-A_1p" FT /translation="MPVSTGAETDITSSLPIPASSIVSPNYTLPDSSSTCL FT ICFAIFPTHNILLSHATAIHHISCPPTPVQDGSQQMSCVLCAAAFSSNRGL FT TQHIRHRHISEYNELIRQRIAVQPTSRIWSPFDDASLLSIANHEAHRFPTK FT NDLYQHISTVLTRRTAEAVKRRLLHLQWSRSPTAITTSSNNHTTTDIPNTE FT ARYIFPVDLDEHPPLSDATTPDASTHPLPELLVILTPLPSPTRLQNISESQ FT TSHESNRNSMHTPPTYACDSDESLGVTPSSTIPSCFHSYRDPLAEQRSKLL FT RASASLLQSSCTRIRSSSLLAFLQNASTLMDEEHVSTFLNSHGEFVFPRTW FT TPSRPKHPSHAPANVSRKKRRKIEYAHIQTLFHHRPKDAANTVLDGRWRNP FT YVANHSMIPDFDCFWTTVFTKTNSPDSREITPIIPMTPSLIDPILPSDVTW FT ALKEMHGTAGGIDRLTSYDLMRFGKNGLAGYLNMLLALAYLPTNLSTARVT FT FVPKSSSPVSPEDFRPISVAPVATRCLHKILAKRWMPLFPQERLQFAFLNR FT DGCFEAVNLLHSVIRHVHTRHAGASFALLDISRAFDTVSHDSIIRAAKRYG FT APELLCRYLNNYYRRSTSCVNRTELHPTCGVKQGDPLSPLLFIMVLDELLE FT GLDPMTHLTVDGESLNYIAYADDLVVFAPNAELLQRKLDRISLLLHEAGWS FT INPEKSRTLDLISGGHSKITALSQTEFTIAGMRIPPLSAADTFDYLGIKSN FT FKGRCPVAHIDLLNNYLTEISCAPLKPQQRMKILKDNLLPRLLYPLTLGIV FT HLKTLKSMDRNIHTAIRKWLRLPSDTPLAYFHSPVAAGGLGILHLSSSVPF FT HRRKRLETLLSSPNRLLHKLPTSPTLASYSHLSQLPVRIGHETVTSREEAS FT NSWVRRLHSSCDGKGLLLAPLSTESHAWLRYPQSIFPSVYINAVKLRGGLL FT STKVRRSRGGRVTNGLNCRGGCAHHETIHHILQHCALTHDIRCKRHNELCN FT LVAKKLRRQKIHFLQEPCIPLEKTYCKPDFIIIRDSIAYVLDVTVSDDGNT FT HASRLLKISKYGNERTVASIKRFLTSSGYIITSVRQTPVLTFRGILERASS FT QSLRRLCFSSRDLGDLCLSAIQGSIKIYNTYMRGTQRLNE" XX SQ Sequence 4317 BP; 1145 A; 1274 C; 764 G; 1134 T; 0 other; atgttttaat ttatttttga actactactg tctgagtgct tcttacaacc tgaaggctca 60 gaaactaccc actttttgct gtttatccac aacaacagtt gtgaatctat tctccaaata 120 ttccttgtgc ttttgtcaac attattctat accaactgta ccacctactt cttcatctca 180 cgttttaatt ctggtctaat tttctcatca ttagtcacgg agagggccta tgaacggtcc 240 gtgacgcgaa attcaatcca cgaattcgtc ctcttctgct agtggtcccc gaaatacggt 300 tcctctggcc tgtcagttgt gttaaaacta tataataacg atgccggtct caaccggcgc 360 agaaactgac ataacctctt ctttgcctat tcctgcatcc tcaatcgtct cgccaaacta 420 cacactccct gattcctctt caacctgcct tatatgtttc gctatcttcc ccacccacaa 480 catactcctc tcccatgcca ctgcaatcca ccatatttct tgtcctccta ctccagtgca 540 agacggttct cagcagatgt cttgtgttct ttgcgccgcc gctttttcat ctaacagggg 600 actaacacaa cacattcgcc accggcacat ctccgaatat aacgaactaa tcagacaacg 660 aattgcagtg cagccgacgt ctcgcatatg gtcaccattc gatgatgctt ctctactatc 720 aatcgctaac catgaagccc atagattccc cacgaagaat gacttatacc aacacatcag 780 cactgtatta acacgcagga cggcagaggc cgtcaaacgc cgactcctcc acctacagtg 840 gtccagatca cccacagcga ttactacctc ttcgaataat cacacaacca cagacatccc 900 caataccgag gcccgatata tttttccggt agacctagac gaacatccac cattgtctga 960 tgccacaacc cccgacgcat cgacacatcc actcccagaa ctccttgtca tcttgacacc 1020 gcttccatcc ccgactagac tacaaaacat atccgaatca cagacctccc atgaatccaa 1080 taggaactca atgcatacac cgccaacgta tgcctgcgat tcggatgagt cactaggggt 1140 tactccctca tcaactatcc cctcatgctt ccacagttat cgggaccccc tagctgaaca 1200 aagaagcaaa ctcctgaggg catccgccag cctactacaa agcagttgta ctcgcatacg 1260 gtcctccagc ctgctcgcct tcctccaaaa cgcatccaca ttaatggacg aggaacacgt 1320 gtccaccttc ctcaatagtc atggagaatt cgtcttccct agaacatgga ccccatcccg 1380 acccaaacac ccctcccacg ccccagctaa tgtttctagg aagaaaagga ggaaaataga 1440 gtacgcacac atccagacac tcttccacca ccgtcccaaa gatgccgcca acaccgttct 1500 agacggtcgg tggagaaacc cctatgtcgc aaaccattca atgattccag acttcgactg 1560 cttctggaca acagtcttta ctaaaacaaa ttccccagac agccgggaga ttactccaat 1620 catccctatg actccctctc tcattgaccc gatcctcccc tctgacgtca catgggcgct 1680 gaaagaaatg catggcacgg ccggtgggat tgatcgtcta acatcgtacg atctgatgag 1740 attcgggaag aatggtcttg ctggatatct caacatgcta ctcgctcttg cataccttcc 1800 cactaatctc tcaacagcac gggtaacttt cgtccccaag tcatcaagtc ctgtgtcacc 1860 tgaggacttc cgtcccatca gtgtcgctcc agtagccact aggtgcctgc acaaaattct 1920 agcaaagaga tggatgccgc tctttccaca ggaacgactt cagttcgctt tcctaaaccg 1980 agatggatgc tttgaagcag ttaatcttct gcactcggtc atacggcacg tccacacccg 2040 ccatgcagga gcatccttcg ccctgctcga catatcacgg gcctttgaca ctgtatcaca 2100 tgactccatc atcagagcgg cgaaaagata tggggcacct gaactgttat gccgctacct 2160 caataactat taccgacgtt caaccagctg cgtcaaccgc actgaattgc atcctacgtg 2220 tggggtgaag caaggagacc ccctgtcgcc actcctcttc atcatggttc tcgacgaatt 2280 actggaaggt ctagatccaa tgacccacct aacagttgat ggagagagct tgaactacat 2340 agcttatgct gacgatctcg tagttttcgc tccaaatgca gaactccttc aacggaaact 2400 cgatcggatc tccctacttc tacacgaggc tggatggtcg attaaccctg aaaaaagccg 2460 gaccctggac ctaatctctg gtggccattc caaaatcaca gcgctctctc agacagaatt 2520 caccatcgcg gggatgcgta taccaccgct ttccgccgcc gacaccttcg actatctggg 2580 tatcaaatcc aacttcaagg gccgatgccc agtggcccat attgacttat tgaacaacta 2640 cctcacggaa atatcgtgcg ctccacttaa gccgcagcag cgcatgaaga tcttgaaaga 2700 taatctactc cctcgactcc tctaccccct gactctagga atagtacacc tgaaaaccct 2760 gaagtcaatg gaccgaaata tccacacggc cataaggaaa tggttgcggc taccctccga 2820 caccccgcta gcatattttc actcacccgt cgctgccgga ggcctaggga tcctccatct 2880 gtcctcatcg gttccattcc accgtcgaaa acgtctagaa accctcctat cttcaccgaa 2940 ccgcctactg cacaagttgc caacttcccc aacactagct tcttattcac accttagtca 3000 actgccagtt cgaattgggc acgagaccgt aacgtctaga gaagaggctt ccaacagctg 3060 ggtgagacga ttacattcgt cctgcgacgg gaagggacta ctcctagcac cactaagcac 3120 cgagtcccat gcatggctgc gctaccccca gtctattttt ccaagtgttt acatcaacgc 3180 cgttaaatta cgaggtggct tactatccac caaagtcagg agatctcgcg gaggtagagt 3240 gacgaatggc ctgaactgtc gaggcggttg cgcccatcat gaaacaatcc accacattct 3300 gcaacattgc gcgctcaccc atgacatcag atgcaaacgc cataacgaac tatgcaacct 3360 tgtggcaaag aaactgcgta ggcaaaaaat ccatttctta caggagccct gcattcctct 3420 agaaaaaact tactgcaaac ctgattttat aattatacgt gattcaattg cttatgttct 3480 agacgtcact gtatcggacg acggaaacac ccacgccagc cgcctgttaa aaatatcaaa 3540 atacggcaat gagcgaaccg tcgcatcgat caagcgattc ctcacatcca gtggatatat 3600 cattaccagt gttcgacaaa caccagtcct tacattcaga ggtattctgg agagagcaag 3660 ttcacaatcc ctacgacgcc tatgtttttc gtcccgtgac ctcggtgacc tttgcctgag 3720 tgcgattcaa ggctcaatta aaatatataa tacctatatg agaggaaccc aacggctgaa 3780 cgaatagccc ccttcactct tagacattcc cccactgttg ttgcttatct tcatgttttt 3840 gtgttaattg actgctctct tctgggttga tgtctgattg tctctctctc tttccatatt 3900 gcttgctctc cccgcttact tccaatagtt gtcatattat gtctttgttt acttgccatg 3960 tctaacgaca attactttat ctaccttagt tggtcctctt ggtttggttg ccttcatgtg 4020 ttcatggcgg aatctgatgt ttataatgac tattcctact accaccatta caactattat 4080 tattatcact attattaaca ttattattac ttctacaatt agtattatgg ctactccttt 4140 cagcacacca ataaaatctc aatcaaacat ctcacttatt aaactctcta tttccccttc 4200 gttataaact tacaattcag tttaaccgaa tatctctctt ttacaaatct taagtatgta 4260 attttgtgcc aagcccattt gggtctgtac aatttgatac ttaaaaataa atgttat 4317 // ID Chapaev-13_HM repbase; DNA; INV; 2831 BP. XX AC . XX DT 23-DEC-2008 (Rel. 13.12, Created) DT 23-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE Chapaev DNA transposon -a consensus sequence. XX KW Chapaev; DNA transposon; Transposable Element; Chapaev-13_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2831 RA Bao W. and Jurka J.; RT "Autonomous Chapaev transposons from the hydra genome."; RL Repbase Reports 8(12), 1828-1828 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(318..1040,1044..2465) FT /product="Chapaev-13_HM_1p" FT /translation="MSTKLRRLFSGKGLHKYHSFEEKFVECASKMPNSAKT FT HEDCRKSVCIICMKKGDQVLTDFLKQRIQRLIKNDLSFNDCRVPLASCQTC FT RFKLKKLEDGDPKTCKPRLYRFEDIQIKPATRSSNTCDCLICQIGRLKGKE FT THPLEQKDQPATTSQPPKSSEKRCTKCLSVLSRGLSHHCTIGSRHENLKQL FT ALSDPKGAEQVASSIVASKEASPGGTIRLSQSSGMHLPIRKGKMYTYHILL FT RMLIDFVFLGGVVAQELFPSRLSTGDMVQIQQNTGLSNNGMRKLGSFINHI FT SPTRLVEPNFQKKFAESGQKLQDFFTVSSLKISDSGENRVVVHCKSIKDLK FT DRIEEEIGSSKTAFAKIGIDGGGSFLKVSIGVIDEEENNLEPHSPVQKSAK FT LTTQVSKQTSVKKQFLVAVADDTPENYWNVQEIFRLIKLQELIPSEAAVIS FT CDMKLANIICGIQSHSSKHPCCWCDIESSKLLLCGNPRSFGAIRAQHQSFV FT GSGGDVRRSKEFQNAVHSPLIFGPDQALILEFIPPMELHLLLGVVNHLFKC FT LVQKWPKATEWPTSLHIKIQPFHGGHFNGNDCMKLLRKIDKLAEMAENEST FT SEPSQFIQTFKDFHEVVSACFGKTLHPNYEEKITIFKNSYMSLGISVTPKV FT HTVFHHVTQFIKLKKKSLGLYSEQATEALHSDFGKHWERYKRLNTHPDYSS FT QLLKCVIDYNSKKM*" XX SQ Sequence 2831 BP; 910 A; 530 C; 559 G; 832 T; 0 other; cacagtttga aaattttgga ccctaatacc aaaaacaact cctatgaaaa cccttttgag 60 gggtgttagt acatatgata gggatctcat aacaagtaat tttgaaatca gggattaccg 120 tccgcaagac ttatttgact ttattagttt tatcttaatg aaacgctgtg acttttactg 180 aaaaattttt ctgacacaag tttacacaac ttttctaaat ataacaagta tcaaaaggtt 240 aagaaagtgt tacttgatgg tatttggtgt attttaaaac accagtgtga tttttttaaa 300 ttcttatcac caaaataatg tcaacaaaat tacggcgatt gttttccggg aaagggctgc 360 ataaatatca tagttttgaa gaaaaattcg tagaatgtgc atcaaaaatg cccaattcag 420 caaaaaccca tgaggattgt cgaaaatctg tgtgcattat ctgcatgaag aaaggagacc 480 aagttctgac agacttttta aagcaaagga ttcagcgcct catcaagaat gatctgagct 540 tcaatgactg cagagttcct cttgcctcat gtcagacatg tcgttttaaa ctgaaaaagc 600 ttgaagatgg ggatcctaag acgtgtaaac caagattgta caggtttgaa gacatccaaa 660 taaaaccggc aactcgaagt tcaaacacct gtgattgttt gatctgccaa attgggaggc 720 tcaaaggtaa agaaactcac ccacttgagc aaaaagacca gcctgcaaca accagtcagc 780 cacctaaaag ttcagaaaaa cgttgtacaa aatgtttgtc tgttctctca cgaggccttt 840 ctcatcattg cacaattggt tcacgccatg aaaacttgaa gcaactggca ttgtcagatc 900 ctaaaggagc tgaacaagtt gcatcttcca ttgttgcatc caaggaagca tctcctggag 960 gaacaattcg acttagtcaa tcgagtggga tgcatcttcc aatcagaaaa ggtaaaatgt 1020 atacttacca cattcttctt tagagaatgc taatagattt cgttttttta ggtggtgttg 1080 ttgctcaaga gctttttcct tccagacttt caacagggga tatggttcaa attcaacaga 1140 atacaggatt gtctaacaac ggaatgagga aacttggatc gttcatcaat catatcagcc 1200 caacaagatt agttgaacca aactttcaaa aaaagtttgc tgagtctgga caaaaattgc 1260 aggacttttt tactgtcagc agcttgaaga tttcggattc tggagaaaac agggttgtgg 1320 ttcattgcaa aagcattaag gacttgaaag acaggattga agaagaaatt ggctcctcaa 1380 agactgcctt tgcaaagata ggtattgatg gaggagggtc attcttaaaa gtgagcattg 1440 gagttattga tgaggaagaa aacaacctag agccccacag tcctgtccaa aaatctgcca 1500 aactcacgac acaagtctca aagcaaacta gtgtgaaaaa gcaatttctt gtggctgttg 1560 ctgatgacac tcctgaaaat tattggaatg tgcaagagat ttttcgactc atcaagcttc 1620 aagaactaat cccatcagaa gctgcagtaa tttcttgcga tatgaaactt gcaaacatta 1680 tttgtggcat ccagtctcat agcagcaagc acccttgctg ttggtgtgat attgagtctt 1740 caaagcttct tctgtgtggg aatccccgat cctttggagc tataagagct caacatcaaa 1800 gctttgtggg tagtggcgga gatgtaagaa gatccaagga gttccaaaat gctgtccaca 1860 gtcctttaat ctttggtcct gatcaagcac tgattttgga gttcatccca cctatggagc 1920 ttcatcttct gctaggggtg gtcaaccatc tgttcaaatg cttagtgcaa aaatggccaa 1980 aggccacaga atggccaact tcattgcaca tcaagattca gccttttcat ggagggcact 2040 tcaatggcaa tgactgcatg aagcttctga ggaaaattga caaattggct gaaatggctg 2100 aaaatgagag cacttctgag ccatctcaat tcattcagac attcaaagac tttcacgaag 2160 ttgtctctgc atgctttgga aaaactttac atcccaacta tgaggaaaaa atcaccatct 2220 tcaaaaacag ttacatgagt ttgggaatct ctgttactcc aaaggttcac actgttttcc 2280 accatgtgac acagttcatc aagctgaaaa agaaaagcct tggtctatac agtgaacaag 2340 ccactgaagc actgcattcg gattttggca agcattggga gcgatacaaa agattgaaca 2400 cccatccaga ctactccagt caactcctga agtgtgtcat tgattataac agcaagaaaa 2460 tgtagagtag cttgctatga ggcactgacc atcaatgaac atgttgcatt tttgagaatt 2520 tattagttta tctttagaat gtatttattt tgtaataaat aaatgtttta agatgtattt 2580 ttatcatttg ttaccttttt acttcataat atggtaattt agggcagcca aatatttttg 2640 gaggaatttt gcaatttttc aggtttcgta atttaaaagc aaaaacttca atgttgaata 2700 agtcttgcgg acggtaatcc cggatttcaa aattacttgt tatgagatcc ctatgatatg 2760 tactaacacc cctcaaaagg gttttcatag gagttgtttt tggtattagg gtccaaaatt 2820 ttcaaactgt g 2831 // ID DNAX-2_Tad repbase; DNA; INV; 374 BP. XX AC . XX DT 07-OCT-2009 (Rel. 14.1, Created) DT 07-OCT-2009 (Rel. 14.1, Last updated, Version 3) XX DE Putative non-autonomous DNA transposon: consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; DNAX-2_Tad. XX OS Trichoplax adhaerens OC Eukaryota; Metazoa; Placozoa; Trichoplax. XX RN [1] RP 1-374 RA Jurka J.; RT "DNA transposons from Trichoplax adhaerens."; RL Repbase Reports 9(10), 2144-2144 (2009). XX DR [1] (Consensus) XX SQ Sequence 374 BP; 109 A; 70 C; 75 G; 120 T; 0 other; catggactac tagaagaaga cctggactga tcactcttgg acggacagac actttcggcc 60 tcatctcgtc aaaatattgt gccaaacact tcgttatgac agtactttag tatgcagacg 120 gtcgattttg aattgactga ttttttctaa attaagtaat attacgttga atcttattat 180 ctgttactaa gttctgcttg gaaacagtta tataagatcg tcgatgttat ttaaattagt 240 accagtgatt tagaaaacag acacggaaga tccatactaa gtagccattt ttcgatattg 300 acggcctcat ttggtaagaa aagtccgtcc gtccgaggga gtgatcagtc caggtcttct 360 tctagtagtc catg 374 // ID DNAX-1_Tad repbase; DNA; INV; 130 BP. XX AC . XX DT 01-AUG-2009 (Rel. 14.08, Created) DT 01-AUG-2009 (Rel. 14.08, Last updated, Version 2) XX DE Putative non-autonomous DNA transposon: consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; DNAX-1_Tad. XX OS Trichoplax adhaerens OC Eukaryota; Metazoa; Placozoa; Trichoplax. XX RN [1] RP 1-130 RA Jurka J.; RT "DNA transposons from Trichoplax adhaerens."; RL Repbase Reports 9(8), 1825-1825 (2009). XX DR [1] (Consensus) XX SQ Sequence 130 BP; 38 A; 20 C; 26 G; 46 T; 0 other; tatcaatccc aacagtggca aatcagtaat tcattcattc taaggattga tattagaatt 60 cattactgat ttgccactgt tgggatgcat attggaatga attgctgatt tgccactgtt 120 gggattgata 130 // ID Copia-16_AA-I repbase; DNA; INV; 4159 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-16_AA_; KW Copia-16_AA-LTR; Copia-16_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4159 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 943-943 (2011). XX DR [2] (Consensus) XX CC Positions [1431-1961] - Integrase core CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 978..2660 FT /product="Copia-16_AA-I_1p" FT /translation="MAADKGVMEIAATGTVKLFPKCCPENPPIDVNNVKLI FT PTISSNLLSVSQIVRRGHEVRFCDNGVKVIDSDGDVIATGTHDVSNGQFRF FT DESDEQKALSLAASPVNLDVWHRRMGHLNVRSLKLLKNGLATGVEFQETAI FT DNCSVCAMGKQTRLPFPSSGHRASGVLELVHTDICGPMEETSLGGSRYYVS FT FADDWTRRIFVYFLETKSEAEVLNAFDQFRALAEKQTGAKLKAIRSDNGKE FT YCNKSFQKRLAEAGIIHQKSTEYTPEQNGLAERVNRTVVERARCMLYEAKL FT PKSFWAEAVAAATYVINRSPTKGHDLTPEEAWTGRKPDLSHVRVFGAKTMV FT HIPKQKRKKWDAKSTECILVGFDELTKAYRLYDPVKKDVIRSRDVVFLNEL FT PQAEVKAKPATTKSKPTRVRTFVNLDFDDSPIEPIPRDEPARPTREEESSD FT EGASGSDEDFYSQAETSGGEYSEGAEGDVTITALPPRQSSNPPESQVLRRS FT GRERAIPGKYNDFVFPRKGFPFSPSTEQPDQAGQSNRQQGLKASKRPASKS FT GDPQTRRRRCETTT" XX SQ Sequence 4159 BP; 1062 A; 1053 C; 1211 G; 825 T; 8 other; ataggttatt ggcccaagta ttttttcaaa agaagattct cgaagcagaa aatgtctctc 60 ggcggagctg gcgacaacca agcggaacgg attccgcaaa tccagcagca gtttcaaaac 120 cactcaagcc ttccccccat cgagcggttg tgcggtcgtg aaaactggcc cacgtggaag 180 tttgcggtgg aaacttatct cgagctggag gagctgtggg aaacagtgaa gccgattccc 240 aacgcggatg gaacgctgcc ggcgatcgac gagaggaagt gtagaagagc acgtgcgaaa 300 attattctcc ttctggatcc cgtcaactac gtgcacgtga aggacgcgaa aacggcccgt 360 gaggtttggg cgaagcttga agcagcattt gaggacaccg gcctgacscg gcgggtggcs 420 ctgctcagga agctcatcac aacgatcgct ttcgtcgtgc ggttaccgtk gamacgtacg 480 taacggagat cgtttccacc gctcatcacc tccgcggcgt ggggttcgaa atttccgaag 540 agtggatcgg atcacttttg cttgctggtc ttccggagga gtacaagccg atgatcatgg 600 cgctggaaag ctctggcatc cccatcaccg gcgacagcat caagacgaag ctgctccagg 660 aagtgcaaat cccatcggac aaaacggcgt tcgttggaag aaaatcgtct cctccgaaaa 720 cgaagcaaag cgtacccgaa aaccggtcag ctgcgcagcc caaaggtccg aagtgcaagc 780 gctgccgtag gtacgggcac attgctaggg aatgccaggt gaaaagcgag gcgaggaagg 840 gatcagcgtt cagcacggtg ctaacagcaa atagttgcca tagcgacgac gattgggtgt 900 tcgattctgg cgcatcggac cacttcacca tsaacagccg cctactggtg aatgcgcgac 960 cggctagtgg cgtggtcatg gcagccgata aaggtgtgat ggaaatcgct gcgaccggga 1020 ccgtcaaact ttttccaaag tgttgcccgg agaatccacc gatagatgtg aacaacgtca 1080 agctcattcc aacgatttcc agtaatctcc tctcggtgag tcaaatcgtg cgacgaggtc 1140 acgaggtgcg gttttgcgat aatggcgtga aggtgatcga ttcggatggt gacgtcattg 1200 caaccggtac ccacgatgtg tcgaatggac aattccgttt cgacgagtct gatgaacaga 1260 aggctctatc actcgcagct tcgccagtca acttggacgt ctggcatagg agaatgggtc 1320 acctcaacgt acgcagcctg aagcttctca agaacggcct tgctaccggg gtcgaatttc 1380 aggaaacagc gatcgacaac tgttcggtat gcgccatggg taagcaaact cgtctcccat 1440 tcccaagcag tggtcatcgt gcatccggcg tgttggagct tgtccatacg gacatttgcg 1500 gaccaatgga ggaaacctcg ttaggtggaa gccggtacta cgtgtcgttt gcggacgatt 1560 ggacgcggcg catattcgtc tacttcctgg agacgaaatc cgaagccgag gtgctgaacg 1620 cattcgatca gtttcgcgct ctagcggaga agcaaactgg cgccaaattg aaagccatca 1680 ggagcgacaa cggtaaggag tactgtaata agtcgtttca gaaacgcctt gccgaagccg 1740 gcatcatcca ccaaaagtcc accgaataca caccggagca aaacgggttg gccgagcgtg 1800 tcaatcgaac cgttgtcgag cgtgcaagat gcatgctcta cgaggcgaaa ctaccgaagt 1860 ctttctgggc ggaagcggta gcagctgcaa cgtacgtgat caatcggtcc cccaccaaag 1920 gtcacgattt aacaccagag gaggcatgga ccggtcggaa gcccgatctt tcccacgtgc 1980 gagtttttgg tgcgaagacg atggtccaca ttcccaagca aaagcggaag aagtgggatg 2040 cgaaatcgac ggagtgcatc ctggtcggtt ttgacgagct gaccaaagcc taccggctat 2100 atgacccggt gaagaaggac gtaatcagga gccgtgacgt agttttcctg aacgagctgc 2160 ctcaagccga ggtgaaggct aaaccagcta ctacaaagag caagccaacg cgagtgcgga 2220 cgttcgtcaa tttggacttc gatgacagtc cgattgaacc gattccgcga gatgaaccag 2280 cccggcctac acgtgaagaa gaatcatccg atgaaggtgc gtctggatcc gacgaagatt 2340 tttattcgca agctgaaacc agcggcggcg aatactcaga aggtgctgag ggtgacgtga 2400 caattacggc gctcccaccg cgacaatctt caaatccacc ggagtcacag gtgttgaggc 2460 gcagcggtcg ggagcgcgca attccaggca agtacaatga ttttgtattt ccgagaaaag 2520 gctttccgtt ttccccttct acagagcagc ccgaccaagc cggtcagagc aatcggcaac 2580 aaggattgaa ggcgagcaaa cgtccggcga gcaagtcggg cgatccccaa acgcgacgga 2640 ggcgttgcga aacgacgacg cstcgcaatg gcaatccgcc atggacgaag agtttcgagc 2700 gctgatcgac aacgacacgt gggagttggt acagcttcca gccgacgaga aggccatcgg 2760 ctgcaaatgg ctcttcaaaa ctaagcagga cgagaagggc aacgtgatcc gtcacaaagc 2820 gagaatcgtc gctcaagggt tttcgcagcg gtacgggtcc gactacgacg aggtcttcgc 2880 gccggtggca aagcaaacga cattccggac gttgttgact gtcgccagcc ggagaggttc 2940 catcgtaagg cacgtcgacg taaaaacggc gtacttgaac ggtgtgctcg aagaaaccgt 3000 ctacatgcgt caaccagagg ggtaccatgt cggcgacgaa aggacggtct gccgattgag 3060 gaggagtctg tacggcctga agcagtcagc gcgcgtatgg aatcgcaaag tggatgcggt 3120 tttcaaatcc atggggttca agccgaccga atcggatcct tgtttgtacg tgcgacgtat 3180 gaatggttcg gtcgcgtata ttctgatata cgtggatgac atggtcgtcg tacgcaaacg 3240 acgaaggagt tccaatccat tctgaagccg cttcaggaac atttttccgt ggctgacctt 3300 ggagacgtca gccatttcct tggcatgcaa gtcgagagaa gcgagcaggg tacaatgctc 3360 aaccagsaaa tgtacatccg caagctggcc gagcggttcg gcatgcagca agccaagccg 3420 cgaagatacc actcgatccg agctacctac agcagaagga ggagatggat cagctgccga 3480 acaaccacga ctacatgagc ttgattggag gtctcctgta tgtagctgtg cacacgcggc 3540 cagacgtatc cgtgagcgtg tcgattctgg cgcaaaaatc cagctgcccg aattcgcaag 3600 attgggcaga agcgaagcga attcttcggt atttgtaacc accagcaacc taagctgcag 3660 cttggtgctt ccagcgctgg gttggaaatg ttcgccgatg cagactgggc cggcgacgct 3720 cgcgaccgga aatcgaattc cggcatgatt ttgatgttcg gggaggacca atctcgtggt 3780 gctcacgaaa gcaaacgtgc gttgctttga gttcgaccga agcggagttc gtggcgctcg 3840 ctgaagggtg tcaggagcta tatggaccaa gcgactgctg aaggagatca gtgaagcaac 3900 cgacgttccc atccccgttt tcgaggacaa ccaaagttgc atcaagctgg tggaaagcga 3960 tcgcttcgaa cggcgcagca agcacatcga tacmaagttc ttctttgtcc gagacctgca 4020 agagaaggaa agatcagatt gcagtactgt ccgacggaat cgatgctggc cgacttgatg 4080 acgaaaccac tccaacgggt tcgtctggag agattacgag tagctgttgg gaatccgtcc 4140 ggatctgccc gaggaggag 4159 // ID DNA-5_CQ repbase; DNA; INV; 184 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A DNA transposon family from Culex quinquefasciatus - consensus. XX KW DNA transposon; Transposable Element; nonautonomous; DNA-5_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-184 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the southern house mosquito."; RL Repbase Reports 11(1), 46-46 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >96% CC identity. ~40-bp TIRs. TSDs are possibly TTAA or TAA, but CC uncertain. XX SQ Sequence 184 BP; 42 A; 42 C; 47 G; 53 T; 0 other; ttaaggctag tatgcagatc ggaaggaaag gggtcaagaa acagctttac gaacagcaga 60 acaaaggaga gggagcgatt tcttggggct tttcttcacc ctctctgact tgcttgcgct 120 gccgctgctg cccatttgat ttttttcttg accggttcct tccgaagtgc gtataccgct 180 ttaa 184 // ID hAT-N3_BF repbase; DNA; INV; 1812 BP. XX AC . XX DT 30-SEP-2008 (Rel. 13.09, Created) DT 30-SEP-2008 (Rel. 13.09, Last updated, Version 1) XX DE Amphioxus hAT-N3_BF non-autonomous DNA transposon - consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW hAT-N3_BF; non-autonomous. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-1812 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-1812 RA Kapitonov V. and Jurka J.; RT "hAT transposons in the amphioxus genome."; RL Repbase Reports 8(9), 913-913 (2008). XX DR [2] (Consensus) XX SQ Sequence 1812 BP; 489 A; 421 C; 417 G; 485 T; 0 other; cagggttgta gccagcgccc gtccttccgt cctttgacgg aattttgcag ctggggacgg 60 aaaaatattt tgcccattcc gtcctctgtg acggagaaaa tcgatatgta aagatacaac 120 aaaaactaaa aatcgtgcaa agatctccaa aaattcaaag gattttttca cagcgccggg 180 gcaggggtcc gcatttcagg ctattcagaa ggcttatttt atcaggtttt tagggccata 240 tctacgataa tagtagcagt cacttaactt cttttcagtt tgacataaca atatttcaag 300 tcaaagcgtc aaagtcaact aaaaactcgt aatcaacaaa ctttactcag ctcaacagta 360 acgttaacta ttgtccggtt tgccatccga agttgaagcg cttatttata gcgctagtat 420 cttcgatgcg ccgttagccg ccgtgccgtg cgactgttgt tgacggtctt acatccctaa 480 gttgccgcct cgctgtcaga cgatatacca acggacatgt ttcaagaaaa atacccggcg 540 aaaccattgt tccgcgtcag attctattct ataaaacatg tgggttacag tctgataaat 600 attttgtaga agcaccgctg tgggaaagtg caaggaataa tcacgtagca tacgaagttc 660 gctgtcaact gtcacacgac tgtttgaaac tctaaccaac agccgcggcc tttgatgaag 720 tcggcgaaaa ggtgcactgc ctttgcaaca aggaagccga cggtctcacg gtgggcgggg 780 ccacatgacg aggcttgccc cgaaagcccc ggccccgaaa ggaaatgccc gggttttaca 840 tacaaactta cggtggcttg tactttacgt tgccgaacca aagagcttca aatttagaac 900 aggaaatttg ggaacgcgac ccgcccttgc tctagaaatg tgtcagagag aattgatttt 960 attttccgcg ctgaaaaaga aatgacgcaa ctaatagtaa gagcctatgt gatcaacccc 1020 gctccccata taaggagatt cccgggaaaa tctggctgtt ccggggaggg atcccacccc 1080 gctataatag gaccgcgccg gatcaaaccg tgggcgattc tttcaagtga ttcgttgtac 1140 atgctcgccg actcgccgtc tatgtttcca atctttgtaa agtgttaaaa tgctacttac 1200 ctcgttcaga tgaactgaca gatgaagtgt gttgtgttaa tcgacggttt ctctcatcgg 1260 cgacggtagg aatcgacgtg aactttgata cgacttaact aagtaagacg caaacggccg 1320 gtgttgaaaa tgttttattc tcgccgatgt gagggcgccg ttatcccttg tatgtgctat 1380 tgtgcttgtg tatatttctc gtaggacgta gatacgtaat aaatttttgt tgtactaaac 1440 gtgctgtttc ttttctgtgt aaattctata caatatagaa gtgcagctgt gtaacacgct 1500 tacaatctta ccctcatacc agcaatctgt gcgatcactg cggtggcggt agcatcggcc 1560 cctttcctaa ataaaatcca gcgtaaaacc ctcgtgtaat ttcttttacc aaaacagatg 1620 tattaacagc tcaaaatgct ggaaatagcg tttcagaggg tctagatttc aaaattttcc 1680 gggggagcat gcccccggac ccccctagga acgtcgcgcc tttggcgcga catctcgcgc 1740 ctacggcgct cgataggatt ttcagtcaaa agtgggggga cggaaaatga tttcgggctg 1800 gctacaaccc tg 1812 // ID CR1-53_HM repbase; DNA; INV; 3839 BP. XX AC . XX DT 22-DEC-2008 (Rel. 13.12, Created) DT 22-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE CR1-type family - consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-53_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3839 RA Bao W. and Jurka J.; RT "CR1 families from Hydra magnipapillata."; RL Repbase Reports 8(12), 1881-1881 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(118..777,781..3741) FT /product="CR1-53_HM_1p" FT /translation="MVRKEDFEALVSRVLKLEAELQIKDKEINDLKIVVND FT MKNQSKLVSNSNDVSNTWSDVVKGRKKNTSVQLDLLNVVVKESHDREIRQK FT NLVVFGFKTSTNKDESVAKEDDKQELQNLFKQINVKVTIQRVFKLNTKTDK FT PPPVIVVLTNKEERNAVLKTAKNLRNSPETNNIFINADMTEAERFKMKALR FT EERAKLNEANQDLKYYYGIRNERVVKLLKKLYESTQPYQQISPKKVLYXNL FT NNITKNPNDRHLLNSNKRLIVYYTNSTSLNNKKNDFRALIDIEKPDVFAVT FT ETWYTPLSDVNIEGYVIYRNDRLNGKKGGGVCIYVNSSIKSYEISNIELKS FT DLIEQIWTWIEYGQEKVLLGCIYRPPDSSCVTNKNIINSIIAAKRAQDKKQ FT FTSIIITGDFNYNTLKWKDSVEESHSQQAEDFVACLNDCFFEQCVTEATFQ FT SNENELTNILDLVIVDDSNRIFGLSHLPPLGLSSKGHHILKWFYKMKDRID FT DFSSSTKKFAYRRGNYEKINVYIKSINWADEFQNKCIDKCYDKFLYHYNQS FT CKNYIPIVKLKNKASEVWMSNSLRKEIKYKNNYWTTNKKMNWSIAGSKEKY FT NKMKRNILKQINNSVKNYERNLVNQVKNEPKLLYSYIRKKQKVKNHIRALN FT DNIGSTMYDKHSIVDIFNKYFHSVYIKEENINDCNFEVQTNSRFGKIFVDQ FT ALVKSYLSKLDINKSSGYDEVNPYVLKVCSESFSIPLSIIYQQSLTNSDIP FT DLWKMANVSPIFKNGSKLQAINYRPVSLTSIPCKVLEKIISDNLKNYLESN FT NLFCKQQHGFLKGKNCITNLLESLDFLTYQLMIGNSIDILYTDFQKAFDRV FT PHKRLISKLCGYGIEKEMLNWIETYLSNRKQRVIIGDTKSDWLEVLSGVPQ FT GSVLGPLLFLLYINDLPSKIKNKCELYADDNKIIAVVNNQVDSKSLQDDIN FT NLTEWSDKWMINFNIKKCKIMHFGKHNKNFEYFIKDSKLDSTTNEKDIGLL FT ISNTMKWNQHVNMVVNKANSRLGLLSKSFTYKDKNTMKILYCTFVRPILEY FT ASPIWNPYNKNDIHKLEIVQQRATKLIPELRHLPYEERLKKLQLTSLKIRR FT LRYDLIQYYKIFHNLDKVCWYQQPKTAISISANGPASNIRGHKARVDIELI FT KRNRPRENFFTNRVAKWWNKLPEEIISAKSVNIFKDKLDKHMVIMKL*" XX SQ Sequence 3839 BP; 1635 A; 500 C; 585 G; 1117 T; 2 other; cgcgcgccgt taaagatggc agacgtgact ttgtgaaaaa ataagaataa caaaacataa 60 acattttgta ataataacta aataataaaa gataaactta tcttgtcaaa tttgacaatg 120 gtaagaaaag aagacttcga agctttagtt agtagagtac taaaattgga agcggaactc 180 caaataaaag ataaagaaat aaatgacctc aaaatagtag taaatgatat gaagaatcaa 240 tcaaaactcg tatcaaactc aaacgatgta tcaaatacat ggtccgatgt agtaaaaggg 300 agaaaaaaaa atacatctgt gcaactggat ttattaaatg tagtggttaa agaatctcat 360 gacagagaaa taaggcaaaa aaatttagta gtttttggat ttaaaacctc tacaaataag 420 gacgaaagcg tggcgaaaga agatgacaaa caagagttac aaaatttatt taaacaaatt 480 aatgttaaag ttactattca gcgagttttt aaattaaata cgaaaaccga caaaccacct 540 ccagttattg ttgttttaac taacaaagaa gaaagaaatg ctgtattaaa aactgctaaa 600 aacttacgaa attcacctga aactaataat atctttatta atgcagacat gacagaagca 660 gaaagattta agatgaaagc tcttagagaa gaacgtgcta aattaaatga agctaaccaa 720 gacctaaaat actattatgg aattagaaat gaacgtgttg taaaactgtt aaaaaagtag 780 ctatatgaat caacacaacc atatcaacag atatctccaa aaaaggtatt atatraaaat 840 ttaaataata taacaaagaa tcctaatgat agacatttat taaatagtaa caaaagacta 900 atagtgtact atacaaattc aacatcttta aacaataaaa aaaatgattt tagagcttta 960 attgatattg agaagccaga tgtttttgct gttactgaaa catggtatac acctttatca 1020 gatgtcaata ttgagggtta tgtaatatat agaaacgatc ggctgaatgg taaaaaaggg 1080 ggtggtgttt gcatatacgt taatagttcc ataaaatcat atgaaatcag taatattgaa 1140 cttaaatccg atcttatcga gcaaatatgg acatggattg agtatggtca agaaaaagtt 1200 ttattaggtt gcatatatcg tccccctgac tcatcctgtg tcaccaataa aaatattata 1260 aattctatta tagcagctaa aagagcacaa gataaaaaac aattcactag tataataata 1320 acaggtgact ttaactacaa cactttaaag tggaaagaca gtgttgaaga atcacatagc 1380 caacaagctg aagattttgt agcatgttta aatgactgtt ttttcgaaca atgtgttact 1440 gaagcaacat ttcaaagcaa cgaaaacgaa ttaacaaata tacttgattt agtaattgta 1500 gatgatagta accgtatttt tggtttatcc catttgccac cacttggttt gagtagtaaa 1560 ggacatcata ttctaaaatg gttttataaa atgaaagatc gaatagatga cttctcttca 1620 tcaactaaaa aatttgcata tcgtagagga aactatgaaa aaattaatgt ttatataaaa 1680 agcataaatt gggctgacga atttcaaaac aaatgcattg acaaatgtta tgataagttt 1740 ttatatcatt ataatcaatc atgcaaaaat tatataccta ttgtaaaact aaaaaataaa 1800 gcaagtgaag tttggatgtc aaatagcttg agaaaagaaa tcaagtacaa aaataattac 1860 tggacaacaa ataaaaaaat gaactggtct atagctggct caaaagaaaa atataataaa 1920 atgaaaagaa acattttaaa acaaataaat aactctgtta aaaattacga aagaaacttg 1980 gtaaatcagg taaaaaatga acctaaatta ctatattcat atattagaaa aaagcagaaa 2040 gtaaaaaatc atataagagc acttaatgat aatattggtt caacaatgta tgacaaacat 2100 agtattgtcg atatttttaa caagtatttt cattctgtct acataaaaga agaaaatata 2160 aatgattgta attttgaagt acaaacaaat agtaggtttg gaaaaatatt cgttgatcaa 2220 gcacttgtga aatcatattt gtcaaaatta gacattaaca agtctagtgg gtatgacgaa 2280 gtaaatcctt atgtgttaaa agtctgctct gaaagttttt caataccact ttcaattatc 2340 tatcaacaaa gtctaacaaa tagtgatatt cctgatctct ggaaaatggc aaatgtgagt 2400 cccatattta aaaatggaag taaattgcaa gcaataaatt atcgaccagt atctttgaca 2460 tccatacctt gcaaagtact agaaaaaatt ataagtgata acttaaaaaa ctatttagaa 2520 tcaaataatc ttttctgtaa acaacaacat ggtttcttaa agggtaaaaa ttgtataaca 2580 aacttattag aatccttaga ttttttaaca taccaactaa tgataggcaa ttcaatagac 2640 attttatata ctgattttca aaaagcattt gatagagtgc ctcataagag attaatatca 2700 aaattatgtg gttatggaat agaaaaagaa atgctaaatt ggatagaaac atatttatca 2760 aatagaaaac aacgcgttat aattggtgat acaaaatcag attggttaga ggtgctaagy 2820 ggtgttccac agggttctgt gcttggacca ttattgttct tgttatatat taatgattta 2880 ccatctaaaa ttaaaaataa atgcgaactt tatgcagatg ataacaaaat tatagctgta 2940 gttaacaacc aggtagactc aaaaagcttg caagacgata ttaataatct aactgaatgg 3000 tccgacaaat ggatgataaa tttcaacatc aaaaaatgta aaatcatgca ttttggaaag 3060 cataataaga attttgaata ttttataaaa gattcaaaat tagattcaac aactaatgaa 3120 aaagacattg gattgttaat ttcaaatact atgaagtgga atcaacatgt aaatatggtt 3180 gtaaataaag caaatagtcg attaggtcta ctaagcaaat catttactta caaagacaaa 3240 aacacaatga agattttgta ttgcactttt gtaagaccta ttcttgaata tgcatcacca 3300 atatggaacc cttataataa aaatgacata cataaattag aaattgttca acaacgagca 3360 acaaagttga ttccagagtt aagacaccta ccctatgaag aaaggttaaa aaaactccaa 3420 ctgacttctt taaaaatcag gagattaaga tatgatttaa ttcaatatta taaaatattt 3480 cataatttgg acaaagtgtg ttggtatcaa caaccaaaaa cagcaatttc aatttcagca 3540 aatggtccag cttcaaacat tagaggacat aaagcacgtg tagatattga gctaataaaa 3600 agaaacagac caagggaaaa tttttttaca aatagagttg caaaatggtg gaataaacta 3660 ccagaagaaa tcatctccgc aaaaagtgta aatatcttca aggacaagct tgataaacat 3720 atggttatta tgaaactctg atctctatga ctattatgaa actgttaata gcaatggctc 3780 ataatagatt caacatggcg ttgaatctca aggagcataa ataaataaat aaattattt 3839 // ID Copia-36_CQ-LTR repbase; DNA; INV; 136 BP. XX AC AAWU01006152; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-36_CQ_; KW Copia-36_CQ-I; Copia-36_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-136 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 376-376 (2011). XX DR GenBank; AAWU01006152; Positions 950 1085. XX SQ Sequence 136 BP; 46 A; 26 C; 33 G; 31 T; 0 other; tggaagacga cgaagatcag atcacagtag actacgaata ggggagtcgg aggtgtaatg 60 ggacgacagt ctaaaataaa attcattcta ttgcaaacac tcaactgaag tagttcggtc 120 tctggctaat ctccca 136 // ID DNA8-27_AP repbase; DNA; INV; 763 BP. XX AC . XX DT 22-AUG-2009 (Rel. 14.08, Created) DT 22-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-27_AP. XX NM DNA8-27_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-763 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(8), 1769-1769 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 763 BP; 251 A; 140 C; 134 G; 235 T; 3 other; cagtggcggc tcgtgggttt ttaatgaagt agtgcacatc caaaaaaaat acacgaataa 60 ataataataa tataatatta atatacgtta atgtcggtcg tcgatgaaaa acagaaacac 120 gattgtcacg attaatgttt tgaatttccg ataatgatcg actactgact agctaatatt 180 atcagctgat aatcggaata taataaataa acgagactac gagagtaaaa ataaacttta 240 ataacataat aatgccgatg ccgacgcgca acgacgctgc tgcgacgcga cgtcgaaaat 300 aattcgctga aaacaattta attggctatg tgcacaccat agacctatta ctagttaatt 360 taattggcta tgtgcacaca aacgaatgtg cacaccaatt ccattataac gcgggttgcg 420 tgtttcgact tgctttaatt tttacactgt acaaatgtgc acagcgattg tgtgcacnac 480 tgcagtactg ccactataat agtacactat aataatacac gccgataccg ttccgtcgcg 540 gacggccgtt cggccgtcga aataacccac tccaaatatt aaatataata ttatatatta 600 ttattttcnt natattctga atttacgttt ttatattttt aaaaataatc ctaattacat 660 agtaggtact tagttgtatt attgtgattt atgttttcaa tttttcttga ggttgggtag 720 gtagtgcacg tgcacttgtg cacatacaca cgagccgcca ctg 763 // ID Mariner-21_SM repbase; DNA; INV; 2348 BP. XX AC . XX DT 12-AUG-2009 (Rel. 14.08, Created) DT 12-AUG-2009 (Rel. 14.08, Last updated, Version 2) XX DE Mariner DNA transposon from Schmidtea mediterranea: consensus. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-21_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-2348 RA Jurka J.; RT "Mariner DNA transposons from Schmidtea mediterranea."; RL Repbase Reports 9(8), 1870-1870 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 690..2129 FT /product="Mariner-21_SM_1p" FT /translation="MDRSIIQKRAQKIYDNLKRTEQSSSEDQNSSHNFSAS FT RGWFENFKKRFSLHNIKIRGEIASADIESAKKFVPEFADIIKNKGYTPDQV FT FNADETGLYWKKMPQRTYISKNEKHAKGFKVAKERISLLLCSNSSGDSIMK FT PLFINRSLNPRALKGYEKNLPVYWRANKKAWTTSILFKDWFYNCFIPDARD FT YMRNKNLDFKILLILDNAPGHPLDLKYPNVEILFLPPNTTSLIQPLDQGII FT STFKAYYIRRTFEKILEDIDQGENFTLPQAWKKFSIRNCVDIVSASVKEIK FT QSTLNSCWKKLWPEVVSGENVLPPIEDEYQRTINICRGIGGEGFNDMDIPD FT IQEILEDRMLNEEELVEMLDEPTTINDTESSCSEAATCFTLKTINDGLELA FT IKLENFFIEKDPIMERSSKFKRELQQILEPYKEIRKDLYAKSRQSLITEFL FT KAKTTRNPESSDDENVQPISKKSKTYRIISDSDDE" XX SQ Sequence 2348 BP; 872 A; 353 C; 401 G; 720 T; 2 other; mrcactcgag cctcgtataa cacggtttta tacaacacgg ttgtataaat tgtggcatat 60 tttttataca acacgcttta tataacacgg tttcaaaaaa aaattaaata atttacatgt 120 ttaatttgta cgtccctgta atgtgcacat gacaaatctt actgagcgtg tcgagacttg 180 tcaattttgt ttgtttcgat acatttcagt tgcgctctgg cagtcaatgg tgcagtgcat 240 atttctttgt aatccgtaga ttattctggc ttattgtttg taagtataca ctttacatat 300 ttcatataac gtttgataat ttatacaaat tttatagatt taatcgtatc aatttattat 360 ttatttgtag atgaaaaata tttaaacccc aaaatgtcga cagttatgga taaaattaaa 420 aaaagagaaa ctttatctct attgaaaaaa attaatatcc tagattgtct taaaagaggg 480 gagaggccgt catcccttgc tgcaattttt aatttaaatg aagcaacaat taggacgata 540 caaaaaatga aaacaaaatt agatctatag ctttagcagg atcgtctatc agtgcgacaa 600 agattgcccg catacgctca gtaataatag aaaaaaatgg aaagagctct tatgatatgg 660 ctagaagact gcaatgacaa aacattccta tggacagaag cattatacaa aagagggcac 720 aaaaaattta tgataattta aaaagaactg aacaatcgtc atcggaagat caaaactcat 780 ctcacaattt ttctgctagt agaggatggt ttgaaaattt taagaagcga ttctcactac 840 ataatataaa aattcgagga gaaattgcgt cagctgatat agaatctgct aaaaaatttg 900 ttccagaatt cgcagatatt ataaaaaata agggctacac accagaccag gtattcaatg 960 cggacgaaac gggattgtat tggaagaaga tgccgcaacg aacctacata tcaaaaaatg 1020 aaaaacacgc gaaaggattc aaggtggcga aagaaagaat atctttacta ttatgttcta 1080 attcttcagg agattccata atgaagccat tatttattaa tcgatcctta aacccccgag 1140 cactaaaagg ttatgaaaag aatttacctg tttattggag agccaataaa aaggcttgga 1200 ccacaagtat tctttttaaa gattggtttt acaattgttt tatacctgat gcaagagatt 1260 atatgagaaa caagaatctt gattttaaaa tattattgat attggacaat gcaccaggac 1320 atccgctcga tttaaaatat cctaatgttg aaatattatt tttaccacca aatactacgt 1380 ctttaatcca gcccttagac caaggtataa tttcaacatt taaagcctat tacattcgtc 1440 gaacatttga aaaaatatta gaagacattg atcaaggtga aaattttaca cttccacaag 1500 cttggaaaaa attttctatt cgaaattgtg tcgatattgt ttctgcatcc gttaaagaaa 1560 taaaacaatc cactttaaat tcttgctgga aaaaactatg gccagaagta gtgtctggag 1620 aaaatgtttt accaccaata gaagatgaat accaaaggac aataaacatt tgtcgtggta 1680 ttggtggaga aggttttaat gacatggata ttcctgacat ccaagaaata ttggaggatc 1740 gtatgttgaa cgaagaagaa ttagtcgaaa tgttagatga acccaccaca attaacgaca 1800 cagaaagcag ttgcagcgaa gcagctacat gtttcacatt aaaaacgata aatgacggtt 1860 tggaattggc tataaaatta gaaaactttt ttattgaaaa ggatcctata atggaaagaa 1920 gttcgaaatt taagagggaa ttgcaacaga ttttggaacc ttataaggaa atacgtaagg 1980 atttatacgc taaaagcaga caatcgctta ttaccgaatt tttgaaggct aaaaccacaa 2040 gaaatccaga gtcctccgat gatgaaaacg tacaaccgat ttcgaaaaag agtaaaacat 2100 atagaattat aagtgatagt gatgatgaat gaatgccata ttatgttatt aaagtttttt 2160 tcatgaatat gtaaatgtat atttgtttat ttaaaaaaat aaagtatata tttatataaa 2220 ttatcttcgt tatttttttt ggaacgcaac ccctaatttt gtatgaactt caaaccttgc 2280 ataacacgga tttgtattac acgatgtttt tgtggaacca tataaccgtg ttatacgagg 2340 cccgagtg 2348 // ID DNA8-113_AP repbase; DNA; INV; 176 BP. XX AC . XX DT 30-AUG-2009 (Rel. 14.09, Created) DT 30-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-113_AP. XX NM DNA8-113_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-176 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2051-2051 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 176 BP; 61 A; 32 C; 33 G; 50 T; 0 other; cataggcgca actagacctc taaaacgtgg ggtgctataa tatttcatat tttaaaaaca 60 taaaatttat ggtatttcac tataataatg atagttagac ctaaaaaaac aaggggtgct 120 aaattagaac ttgggggtgc taaagaccct tttgcacccc cctagttgcg ccaatg 176 // ID Copia-134_AA-I repbase; DNA; INV; 4289 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-134_AA_; KW Copia-134_AA-LTR; Ty1_copia_Ele217; Copia-134_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4289 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [1455-1991] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 78..3092 FT /product="Copia-134_AA-I_2p" FT /translation="MANPNAGSSASGDGVLVRQQQQSFNTQGHPHIERLVG FT RENWRTWKFAVKTYLEVEDLWEAVEPTPNEDGTLPAVDPRKDRIARGKIIL FT FLDPVNFIHVEEVKTAAEVWRKLSNTFEDTGLTRQWCLLHKLITTNLVSCG FT SMEAYVNRMISTAHQLIGIGFPLDDRWIGMLLLAGLPPEYRPMVMGLENSG FT IQITGDIIKTKLLQETVLAEPTTEPAFVSRHDSKSKHGKVTKATTTQGKGP FT QCRRCEKFGHIARYCNAKEPAQRNKSDKKGDAFCTVLSVREAGEDDEWYFD FT SGSVKHLTKNEMLLEHMRPADGSIYAANKGIMKIVAEGSAKLRPTCNPDTI FT DVTNVELIPELAVNLLSVGKIVDQGHSVTFTKTGCKVINPTGKIIATGQRS FT NGLFKLEQKPSKALACPSPETADLWHKRMGHLASRNLVRLRNGMVDGVRFH FT GNDLGNCKTCAMGKQTRLPFDHEGSRASEVLELVHSDIAGPMEENSLGGSR FT YYLSFIDDKTRKVFVYFLRTKSKEEVLQKFKDFHSMAERQTGKKLKVLRTD FT NGWEYKNWNFENYLKQLGVRHQTTVEYTPEQNGLAERVNRTVVERARCMLY FT EAKLAKPFWAEAVAAAVYLINRSPTKGTETTPEEAWTGREPNLSHIRTFGT FT TVMSHVPKQKRKKWDQKADECILTGYDDETKAYRLYNLKSKQIFKSRDVTF FT IDEGIIKRDNTPAGTTDDQRRSYVRLEFSENVLDEPLPTPADQIDAAQAND FT VNSSDESFQDADDTEADSALPPHQSSSTPVALRRSGRERFLSGKYKDFYIV FT NKGLPSTKFTDSKDDCPMPSTSVAEDSVALPSCSNQRLASVVTQQRLMLCD FT RGRLPQGKYLNDSDNECSDTGADFSETFLDDPVTHNEALSRDDADHWKQAM FT IDEYDALISNETWTLTDLPEGRKAIKCKWVYRTKLDVNGNVDRYKARLVIK FT GYSQRKGVDYEDTYSPVVRYSSLRYLFALAARWGCKWTKWMPLQHSCRAI" FT CDS 3038..4279 FT /product="Copia-134_AA-I_1p" FT /translation="MGLQVDQMDAVTAFLQGDLAEEIYMEQPPCFVDGGKK FT SKVCRLNKALYGLKQSSRVWNSKLDAALRKFGLESTVYDPCVYYRISGGKV FT LFVAVYVDDVLIFSNCKRWKNEIKGKLNQEFKMNDIGPAKYALGIRITRTK FT EAVSLDQEAYIDAMLDRFQMAKCNPVTTPMNTSEKLTKEMCPSTAEEEERM FT RDVPYREAVGCLMYLAQSTRPDICYAVNVLSRFNTNPGEQHWNAVKHLLRY FT VRGTSKFRLIYHRSKDSTIDGFSDADWASDHDDRKSTTGYVFTAQGGAISW FT CCKRQQTVALSTCEAEYMALSAAVQEALWWKRLRGLFEKEEAIVIHCDNQS FT AISVAKNGGYQPRTKHIDIRHHFIRDAWEKGEVKVAYIGTDKQIADGLTKP FT LPAVKMKICREVMGLQNPMG" XX SQ Sequence 4289 BP; 1216 A; 1008 C; 1176 G; 888 T; 1 other; ataggttatg ggccgtcgac acgtgtctag cggaagcgga agaaaattca agacttgaag 60 cttttttcac ttcaagaatg gcgaatccaa atgcaggaag cagcgcatcc ggcgatggcg 120 ttttggtacg acagcagcag cagagtttca acacacaggg ccacccccac atcgagcggc 180 tggtcggtcg agaaaactgg aggacctgga agttcgcagt caagacctat ctggaagtgg 240 aagatctctg ggaagcagtg gaaccaacgc ccaatgaaga cggaacgctc cccgctgtgg 300 accctcggaa ggacagaatc gcacgaggta aaatwattct gttcctcgac ccggtgaatt 360 tcattcacgt cgaggaggta aaaactgcgg ccgaagtgtg gaggaagctt tcgaacacat 420 tcgaggacac tggattgacc cgccagtggt gccttctcca caagctaatc acgacgaatt 480 tggtcagttg tggatcgatg gaggcctacg tgaaccgtat gatttcgaca gcgcatcagc 540 tcatcggaat tggctttccc ttggacgaca gatggatcgg catgttactg ttggctggcc 600 taccgccgga atatcgcccc atggtgatgg gactggaaaa ttctggaata cagataacag 660 gtgacatcat caaaaccaag ttgctacagg agaccgtttt ggcagagcct actacggagc 720 ctgcattcgt ttcgcgacac gattcaaaaa gcaagcatgg taaggtaacc aaggcaacga 780 ccacgcaagg taagggaccg caatgccgtc ggtgtgaaaa atttgggcat attgcaaggt 840 attgcaatgc caaggaaccg gctcagcgca acaaatcgga caagaaagga gatgcttttt 900 gcacggtgct atctgtgcga gaagctggag aggacgatga gtggtatttc gactctggct 960 cggtgaagca cctcacgaag aatgaaatgc tactggagca catgcggccg gcggacggta 1020 gcatctacgc ggcaaataag ggaatcatga agattgtcgc ggaaggatcc gcaaaactgc 1080 gacccacgtg caaccctgat acgattgatg tgacaaacgt cgagctgatt ccggagctgg 1140 cagtaaacct actgtcggta ggtaagatag tcgatcaggg tcattcggtt actttcacca 1200 aaacaggctg caaggtcatc aatccgactg gcaaaatcat cgctactggg cagcgatcga 1260 atggactgtt caagttggag caaaaaccat ccaaggcgct ggcgtgccca tccccggaaa 1320 cggcagatct gtggcacaag cgcatgggac atctggcatc gcgaaatctg gtaaggttgc 1380 gaaacggcat ggttgacgga gtgagattcc atggtaacga cctcggaaat tgcaaaacgt 1440 gtgcaatggg gaagcaaacc cggcttccat tcgatcacga gggatcacgt gcgagcgaag 1500 ttctggagct tgttcattcg gacatagccg gtccgatgga agagaactcg ctgggaggca 1560 gtcgttatta tctgtcgttc atcgatgaca aaactagaaa agttttcgtc tactttttgc 1620 gcaccaagtc aaaagaagag gtgctgcaaa agtttaagga cttccacagt atggctgagc 1680 gccaaactgg aaagaagctg aaagtgctga ggacggataa cggctgggag tacaaaaact 1740 ggaatttcga gaactatctc aagcaattgg gtgtccgtca tcaaaccacg gtggagtaca 1800 caccggaaca gaatggactc gctgaacgag tcaaccgaac ggtggtggaa cgcgcgaggt 1860 gcatgttgta tgaagccaaa ctagccaaac cgttctgggc agaggctgtg gccgcagcag 1920 tctatctgat caaccgatca ccaacgaagg gtaccgaaac aacaccggaa gaagcatgga 1980 caggtcgcga accaaatctg tcgcatatcc gaacgtttgg aactaccgtg atgtcccacg 2040 tgccgaagca aaaacggaag aagtgggacc agaaggctga cgagtgcatc cttaccggct 2100 acgatgacga gacgaaggca taccgcctgt acaacttgaa atccaagcag attttcaaga 2160 gccgggacgt aactttcatc gacgaaggaa tcatcaagcg agacaacaca ccagctggaa 2220 caaccgacga ccagaggcgc agctatgtca ggttggaatt cagcgagaac gtgttggacg 2280 agcccctgcc gacgccggct gaccaaattg atgcagctca agcaaatgat gtgaattcaa 2340 gcgatgaatc atttcaagat gctgacgaca cggaggccga ctctgcgctc ccgccgcatc 2400 aatcttctag cacgcctgtg gcgttgaggc gcagcggtcg ggagcgcttt ctctcaggca 2460 agtataaaga tttttatatc gtgaacaaag gcttgccgtc tacaaaattt acagactcca 2520 aagatgattg cccgatgccg tccacatccg tggctgagga ttccgttgcg ctcccgtcgt 2580 gctcaaatca aagacttgct agcgttgtaa cgcaacagcg gttgatgctc tgcgataggg 2640 ggcgcctgcc ccaaggcaag tatttgaacg atagtgacaa cgaatgttcc gatactggtg 2700 ccgatttttc agaaacgttc ttggatgatc cagtgacgca taacgaagcg cttagccgag 2760 acgatgctga ccactggaaa caggccatga tcgatgaata cgatgcttta atctccaacg 2820 agacgtggac gctgaccgat ttgccggaag gacgcaaggc gatcaagtgc aaatgggtgt 2880 accgaaccaa gctcgacgtg aacggaaacg tcgacaggta caaagccagg ctggtcatca 2940 aaggatactc gcagcgaaag ggggtggatt acgaggacac ctactccccg gtcgtccgat 3000 atagttcgct gcgatacctt ttcgctctgg ctgcaagatg gggttgcaag tggaccaaat 3060 ggatgccgtt acagcattcc tgcagggcga tttagcggag gaaatttata tggagcaacc 3120 cccttgcttc gtggatggtg gcaagaaatc caaagtttgc cgcctcaaca aagccctcta 3180 cgggttgaag caatccagcc gcgtctggaa ttcgaaattg gatgcagcgc ttcgaaaatt 3240 tggactggaa tctactgtgt acgatccatg cgtatactat cgtatcagcg gtggcaaggt 3300 gctattcgtt gctgtatacg tcgatgatgt cctcattttc agcaactgca aacgctggaa 3360 aaatgaaata aaaggtaaat taaatcagga atttaaaatg aacgatattg gaccagcaaa 3420 atacgccctg ggtatcagga ttaccagaac caaagaagcg gtttctctgg accaggaagc 3480 gtatattgat gccatgcttg accgttttca aatggccaag tgcaaccctg ttactacccc 3540 gatgaatacg agcgagaagc tgacgaagga aatgtgtccg tcaacagctg aagaagagga 3600 gcgtatgaga gacgtgccat acagagaagc tgtgggctgt ttaatgtacc tcgcccaaag 3660 tactcgtccg gacatctgct acgccgtgaa cgtgctcagc cggttcaaca ccaaccctgg 3720 cgaacagcac tggaacgcgg tgaaacatct cttaaggtac gtgagaggaa cttccaaatt 3780 tcgcttgatc tatcatcgca gtaaagattc tactatcgat ggattctccg atgctgactg 3840 ggcatcagac catgatgaca ggaagtccac gacagggtat gtatttaccg ctcaaggagg 3900 agccatatcg tggtgctgca agcggcaaca aacagtagct ctatccacat gcgaagccga 3960 atacatggcg ttatcggctg cggtacagga agctttatgg tggaagcgat tacgaggcct 4020 gttcgaaaag gaggaagcaa tcgttattca ttgcgacaac caaagtgcca tttccgtcgc 4080 aaagaatggt ggttatcaac cgaggactaa acacatcgac attcgccacc acttcattcg 4140 tgatgcgtgg gagaaaggcg aagtcaaagt ggcgtacatc ggcactgaca aacagatcgc 4200 ggacggcttg actaaaccac tacctgccgt gaagatgaag atttgccgtg aagtaatggg 4260 actgcaaaat ccaatgggtt gaggaggag 4289 // ID Ingi-4_AC repbase; DNA; INV; 4663 BP. XX AC . XX DT 17-AUG-2010 (Rel. 15.08, Created) DT 17-AUG-2010 (Rel. 15.08, Last updated, Version 3) XX DE A family of Ingi non-LTR retrotransposons from a sea slug - DE consensus sequence. XX KW Ingi; Non-LTR Retrotransposon; Transposable Element; I group; KW Ingi-4_AC. XX OS Aplysia californica OC Eukaryota; Metazoa; Mollusca; Gastropoda; Heterobranchia; OC Euthyneura; Euopisthobranchia; Aplysiomorpha; Aplysioidea; OC Aplysiidae; Aplysia. XX RN [1] RP 1-4663 RA Kojima K.K., Kapitonov V.V. and Jurka J.; RT "Recent Expansion of a New Ingi-Related Clade of Vingi non-LTR RT Retrotransposons in Hedgehogs."; RL Molecular Biology and Evolution 28(1), 17-20 (2011). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 714..4649 FT /product="Ingi-4_AC_1p" FT /note="AP endonuclease, RT, RNase H." FT /translation="MHRTAGRPQTWVGRKATGSPANMAKKNKLNSSELGRR FT LVYQGPRHAVPLQAAGEKLATGKTEQFRLLQVNVCGLANKKIEISHLLDSR FT KIHVALLQETIHRNTDPYIPGYTHYTCTCPDCQGVITYIRNDIQGKVENIT FT TDTRTDTQKATIWYAGCKFTIFNIYSPPGNDCSFSFLQESIYTKTIVAGDF FT NGHSPQWGYRTYNKSGMAIEELCGTTNLLVLQDKKSTPTLLHRAHNTLSRP FT DLTILSSDLFQRHTTEVLDGIGSDHRPIMTSVSTPSQSHFVQKTKWNFKKA FT NWELYKTTTEKLLKDITPNPSVDVFCGAITDSLLKAAALCIPRGCRKKYKP FT FWNKNIEQAVQDRENARKTLESNPTTGNKINYKKASAKVKQTTLLAKRNKW FT STTVADLDLRRNGAKAWSLLSNLCGEKRRNNPKPMDTEEGTIVEDQKKAET FT FNKFFASINKATPHTDQHRDCLKRLKEKEHAPNASISLFEDDFTLTELNRA FT MKKLKPRKSPGPDKIHNEMLVNLSPIGKGYILQLINKTWNENTIPRAWRNA FT IITPILKKGKPAEEMKSYRPISLTSCIGKLAERMVNHRLYWWLETTGALDS FT NQAGFRAGQRTDDQLFRLSQKVLDGFQEKKHTVAVFVDLQQAYDRVWRKGL FT LWKMTQTGIHGKLYRWVKTFLTDRTIQTRINNGVSSKAVLEEGLPQGSSLS FT CTLFLVFINDLPETLKSEKALYADDLAIWHTSKQAGISARLINEDLDRFNN FT YCCKWKLKANCTKTIYTVFTKSHKVAKQNLTILLAGNKLEKVENPTYLGIE FT LDRQFSCKNHIWNLKTKANKRLNLLRRLASTSWGADKDALRQLYVGYVRSA FT MEYNLTLQTICSKPTQRSLDKIQNQAVRFISGALRSTPTAACEIHTNLEPL FT DLRREAAVVEMSERYKRLEKNHPNRKIIEQRRPKKRIHGRSILDTATELAE FT KHHLPCNREMLKPLRPELPPNTNIKNANIKTSLIKETVKKECDPVELKTTA FT LKTIDSYPSDSIHIYTDGSAFKGTINAGFGVRIEHPDKTCEELFDACGTSC FT TNYEAEAIAIEAAILHLTNIFIIHPNKIKNTIIFTDGKSVLQALENMNFES FT PACLSVAQTINTFIQTYAVDLTLQWIPGHSNIPGNDRADTLAKKGATQTQP FT HNPVSQQTAKNIIKCNTREQWLNGWAMGKTGRTVFAHMTAPNSKDAINSLS FT RREQVTIFRLRTQHIPLNSYLNRIRKDTPPDCPLCDCQDETVPHHLFNCPA FT LCDLRSTYLPASPDIGNTLYSDKAQLIKTCQFFAISTDRRARTQMAAGSEK FT " XX SQ Sequence 4663 BP; 1536 A; 1244 C; 964 G; 919 T; 0 other; ggggatcaag ggctggtgaa gggcctcatt taaaagatga gaaacaccca agcgtctcat 60 atagcttctc ccagtctagc caaccgcacc tactcgtggt gatccctgtc tgcactgacg 120 tggctctggc aaatggcgaa tcgttggcgc caacaaacca accccccttt gccaggaggg 180 gtaaaatcaa caggctggga gtaggtgtga ccccacaagg tgcggagggc agaacccacc 240 ggcgtgtggt agacgctatg ccccccacca acacactctc actctccctt gagtgagtgt 300 cctcctctgc tacctcacct acttcttcta cgcactccct gtggttggct tgtgctctac 360 ctttggcctt gtgcatctcc atggattctc ctggtccctg caaggatgaa cacaccactg 420 ggtgggatag gcctctggga catggcatgc ctaaacatgg acgtctacgt accgacacaa 480 aggatccagg acgtgacaag ccgctcgtct gtgtgatttt ggctcaacaa tgctgccagg 540 gtcagaccag attaagttga ccccccacag cccgccagga gacagcggtg gcggggaatg 600 gaccacagtg gatcgtgcct tgaccacatg taaaacggaa gccagtccct gctcagcgcg 660 ttgccccact tctgtgagag acaaaacttg tagctcagaa atctcaatgt cagatgcacc 720 ggactgctgg tagaccacag acctgggtgg gaagaaaagc cacaggctcc ccagcaaata 780 tggcaaagaa aaataaatta aattcatccg agctcgggcg acgcctcgtt tatcaaggcc 840 cgcgccatgc agtgcctctc caggctgctg gtgaaaagtt agctactggg aaaacagaac 900 agtttcgact actccaggtc aacgtttgcg gcctcgcaaa caagaaaata gaaatttctc 960 accttttgga ctcaagaaaa atccacgttg ccctattaca agaaactatt caccgtaaca 1020 cagaccccta tatcccaggt tacacccatt atacctgcac atgccctgac tgccagggtg 1080 taatcaccta catcagaaat gacatacaag gaaaagttga gaacatcaca actgacacaa 1140 gaacagacac ccaaaaagct acaatttggt atgcaggttg caaatttaca atctttaaca 1200 tctacagccc cccaggaaat gactgctctt tttcatttct tcaagaatcg atatacacca 1260 agacaatagt tgcgggtgac tttaatggcc actctcctca gtggggatac agaacataca 1320 acaaaagcgg aatggccata gaggaactct gtggcacaac aaacctcctt gttctccaag 1380 acaagaaatc tacccctacc cttttacaca gggctcacaa cacactcagc aggcctgact 1440 tgaccatcct ttcatccgac ctcttccaaa gacacaccac agaagtcctt gacggcatag 1500 gtagcgacca cagacccatc atgacttcgg tgtcgacccc aagccaaagt cactttgtac 1560 aaaagaccaa gtggaatttt aaaaaggcga actgggagct ctacaagaca acaacagaaa 1620 agctcctgaa agatattaca ccaaacccct ctgttgatgt cttctgtggc gcaataacag 1680 attccctgct caaggctgcg gctctctgca tccctagggg ttgcaggaaa aagtacaagc 1740 ctttctggaa caagaacatc gaacaagcag tccaagacag agaaaatgct cgaaaaactc 1800 ttgaaagcaa tcctacaaca ggaaacaaaa ttaactacaa aaaagcctca gctaaggtca 1860 aacagacaac cctcttggcc aaaagaaaca aatggagcac cacagttgca gaccttgatc 1920 tacgtagaaa cggagccaaa gcatggtcac ttctctccaa cttgtgtgga gaaaagcgcc 1980 gaaacaaccc aaaacccatg gacacagagg aaggcaccat cgtagaagac caaaaaaagg 2040 cagaaacctt caacaagttc tttgcttcta taaacaaagc tacaccccac actgaccaac 2100 atagagactg tctgaaaaga ctgaaggaga aagagcatgc gccaaatgca agcatctctc 2160 tctttgaaga tgacttcaca ctcactgagc tcaacagagc aatgaagaaa ctgaagcccc 2220 gcaaatcccc aggaccagat aagattcaca atgaaatgct tgtcaacctc agtcccattg 2280 gtaagggtta tattctccag ctgatcaaca agacgtggaa tgaaaacacc atccccagag 2340 catggagaaa cgccatcatc acacccattc tcaagaaagg aaaacctgct gaagaaatga 2400 aaagctatcg acccatttca ctcacctcat gcatcgggaa actagcagag agaatggtca 2460 accaccgcct atactggtgg ttggaaacaa ccggtgccct ggacagcaac caagcaggat 2520 ttcgtgcagg ccaacgaaca gatgaccagc tcttcaggct cagtcagaaa gtccttgatg 2580 gcttccagga aaagaagcac acagttgcag tattcgtcga tctccagcaa gcctacgaca 2640 gggtttggag aaaaggcctc ctgtggaaga tgacacagac aggaatccac gggaaactct 2700 acagatgggt gaagaccttc cttacagaca ggacaatcca gaccaggatt aacaatggtg 2760 tgtcatccaa agctgtactg gaggagggcc taccccaggg ttcctctctg agctgcacac 2820 ttttccttgt tttcatcaac gacctgcctg agacactgaa atcagagaag gccctatatg 2880 ccgacgacct agccatctgg catacaagta aacaagctgg catcagtgca cggcttataa 2940 atgaggatct agatagattc aacaactact gctgcaagtg gaaactaaaa gcaaattgta 3000 ccaagacaat ctacaccgtc tttacaaaaa gccacaaagt tgcaaagcaa aatctcacca 3060 tcttacttgc aggaaacaaa ctagaaaaag ttgaaaaccc gacctacctt gggatcgagc 3120 tggacagaca gttttcatgc aaaaaccata tctggaacct gaagacgaag gccaacaaga 3180 gacttaacct tctcagacga ctggctagca cttcatgggg agcggacaag gatgcactca 3240 gacagctata tgttggctat gtacgatctg ccatggaata taacctaacc ttacagacaa 3300 tctgcagcaa accaactcag aggtctcttg acaaaattca aaatcaagct gtacgattta 3360 tctccggagc tttgagatcc actccaactg cagcttgtga aatccatacg aacctggaac 3420 ccttggactt gcggagagag gcagcagttg tagagatgag tgagcgctac aaaagactgg 3480 agaaaaacca cccaaacagg aaaataattg aacagagacg accaaagaaa cgaattcatg 3540 gaagatcaat acttgacaca gccactgaac tagcagaaaa acaccatctt ccatgtaaca 3600 gagaaatgct gaaaccttta agaccagaac tccccccaaa cactaacatc aagaatgcca 3660 acatcaaaac ctccctcata aaagagacag tgaaaaagga atgtgaccca gtagaactga 3720 agacaacagc tctgaagaca attgactcct accctagtga ttcaattcat atttatacag 3780 acgggtcagc cttcaaggga actataaacg caggctttgg ggtaagaatt gaacatcctg 3840 acaaaacatg tgaagagcta tttgacgcat gtggtacttc ttgtacaaat tacgaggctg 3900 aggcaatcgc cattgaagca gcaatcctcc acctgaccaa cattttcata atacatccaa 3960 acaaaatcaa gaacactata atcttcacag acggaaagtc agtactccaa gccttggaaa 4020 acatgaattt cgaaagccca gcctgcctat ccgtggctca aacaatcaat acgttcatac 4080 agacctatgc cgtcgacctg acactacagt ggatacctgg ccacagcaac atcccaggaa 4140 acgacagagc agacaccctt gccaagaagg gagcaacaca aacacaacca cacaacccag 4200 tatcacaaca gacagccaag aacatcatca aatgcaacac cagggaacag tggctgaatg 4260 gatgggcaat gggcaaaact ggaagaactg tatttgcaca catgacagcc cccaactcta 4320 aagatgcaat aaattccctg agtagaagag aacaagtcac gatatttcgc ctgcgaacac 4380 aacacatacc actaaattcc taccttaata ggatccgcaa ggatacccca cccgactgcc 4440 cgttgtgcga ctgccaggat gaaacagtac cacaccacct ctttaactgc ccagcacttt 4500 gtgatctgcg tagcacctac ctacctgcct ccccagacat cggaaacact ctatacagcg 4560 acaaagcaca gctgattaaa acctgtcaat tctttgccat atcgacagac cgaagggcta 4620 gaacccaaat ggccgctgga tcggaaaagt aaaagtaaaa gta 4663 // ID Copia-5_SI-LTR repbase; DNA; INV; 196 BP. XX AC AEAQ01008405; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: long terminal DE repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-5_SI_; KW Copia-5_SI-I; Copia-5_SI-LTR. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-196 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01008405; Positions 604 409. XX SQ Sequence 196 BP; 38 A; 53 C; 40 G; 65 T; 0 other; tgttaaagtc cctatctgat ctcgcactga tgcttttcga tacgcgcgtt accatcccgt 60 gcgccctctc tctatcgccg tgacccgtgt gcgagtcttg agagtgccgt agttcatgtt 120 cttcatcact tgatctttgt ctgtaatcgc aacgtgtctg tcatacaata aagtctctgt 180 accgtgtgat taaaca 196 // ID Academ-1_DPu repbase; DNA; INV; 6119 BP. XX AC ACJG01001373; XX DT 26-FEB-2011 (Rel. 16.02, Created) DT 26-FEB-2011 (Rel. 16.02, Last updated, Version 1) XX DE Academ-type DNA transposon from Daphnia. XX KW Academ; DNA transposon; Transposable Element; Academ-1_DPu. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Direct Submission to Repbase Update (09-FEB-2011). XX RN [2] RP 1-6119 RA Jurka J.; RT "DNA transposons from Daphnia."; RL Direct Submission to RU (24-MAY-2010). XX DR EMBL/GenBank/DDBJ; ACJG01001373; Positions 33882 40000. XX FH Key Location/Qualifiers FT CDS join(518..1552,1580..1972,1933..2631,3076..3597, FT 4155..5198) FT /product="Academ-1_DPu_1p" FT /translation="SVEKNMCPTKTSIVLRNKCCQIIVLMIMISRQIGTRN FT NLKIDNTACIFCKINPKHNEKENISYVQTLDFKQTIIKKCKSRNDSWANEV FT LANVICIGDLFIHKTCYHISCYQRFVKGKNKPSAFHPTSEIPSKNITDLKR FT KAVKRKSEQHVPGQNTDNERKLAFLQAVQYLEEMDEKLFTIKDFRNLMGSY FT GCEPYSTTYIKSKLLDYFGCDITFFSSDGIHDIICLSASTEKILHAFYLKS FT EKNKLHSAEERKLRTMIEAAKIIIQDLKTINTNSELYDIFQKLESPKDALE FT FVPVSLQIFLTTIITAKSNQSKIASVAQSIMQLACPKTILAPLQVIYIYFK FT NIQIGLAVQLHLLHGSRALIDILHSHGFCSSYNEVLKFERSAAVTSNSNMD FT ISQESFVQYVADNADHNLCTLDGKGTFHGMGIIATITPCFKKTNYIVPRKK FT VNFTSYVFLNKNNNFFYRLQLMKYYNVVKSKVTIDEILQCGEIKIIRHSID FT YKILSTIKFRTLEPLFTEDRTTNLDLLWLASWSFKHERPGWQGCMDASLSR FT NSHPGKSEVVFLPMIDMSASDENCIYSTLHFIATQAQNNGYSPILTFDQPL FT WWKAMLIIENADPSCRIKNIILKLGGFHTLMSFLSSIGHVMAGSGIEEILS FT LIYAENTVPHMLSGKAYTRAVRGFLLLERAINALFHENTIFKNDNIVVCKK FT VICFIFYSNVLVDPSSGVSLHPKNNMYSIMMSGVSLDNLEFINQSATIRKL FT KTIVLKNIENNSNCKTGKLWIQLLDMVSIMKNFIKAERMGDWMLHLASTLK FT MLPFFAATGHNNYINSSYLYVQNMVKLAVENKNVHEKFLNGLHVGRRTDKL FT WAGLSHDHIIEQELMRSMKSTGNEIFSSVTYPLSLFNKNCVLRNANKSELA FT KEIAVMCTYDPGKEEISLQEELTFVLDGGWLLHRLSWTKQDSYIDLFSKYS FT DYVMKCYGKGTVVVFDVYNGPSTKDMAHQKRTKEVGREVLFTNDMKINFTK FT EEFLSNQRNKQKFLIELGKLLDEKGIVVKHADGDADLKIVTTAMECAIAKN FT VVVVGEDTDLLILLIHYCKESNHNMYMKSESKSKKCGKLWNIKKIQDSLGK FT ELCSSILFCHAFLGCDTTSKPFGKEKCASLKLQNTNSDFKIVSKIFYESES FT TKQDIDTAGENAMCIVYGGLVIDGIDRLRYQIFQKKVNNAKLTKSIIPEEL FT PPTQAALKFHSRRAYFQVFKF" XX SQ Sequence 6119 BP; 2228 A; 986 C; 1089 G; 1816 T; 0 other; aaaattagaa aataagaaat gggggactaa atttgaatat tttattattt ttagtacatt 60 ttatcaaact ttatacgtca ctaaacgtgt gttgaaaatt taaatgtttt tggtgtagat 120 ttactatttt ttatttaatt taaaagtttc tgaaattggt ttccgcgtcg aaattttttc 180 gccgtattta acacggcgct tatggcgcta cgcgtcttct taaaattaat gctacataac 240 aactaaataa ccttaaaatc cccgaatagt ctagttttgt agccaatata acctcgttta 300 ctttttatta attatatttt atgaaaatag cccagcagaa aagttattca cattataaaa 360 aaaaatatcc cttttgggct ttttccaaat tccgcaaaat tccaataggt tttacaacac 420 acccgccagg gtgacatttt ttggggggcc gctgcaactt ttactcgact attagtaaaa 480 cgtcttgtta ttgttatcaa aatattcaaa atgttgaagt gtcgaaaaaa atatgtgtcc 540 aacaaagaca tcgattgtgc tacgaaacaa gtgctgtcaa ataattgtgc tcatgatcat 600 gatctcaaga caaataggaa cgaggaataa tcttaaaatt gataacacag cctgcatatt 660 ttgcaaaatc aatccaaaac acaatgagaa agaaaacatc tcttatgttc aaactttaga 720 tttcaaacaa acaattataa aaaagtgcaa atctagaaat gacagctggg caaatgaagt 780 gttggctaat gtcatttgca ttggtgattt gtttatacac aaaacgtgtt atcatatttc 840 ttgttatcag cgttttgtga aaggaaaaaa taaacccagt gcttttcatc cgactagtga 900 aattccttcc aaaaacatta ctgatttaaa acggaaagca gtgaaaagaa agagtgaaca 960 acatgttcct ggtcaaaata ctgacaacga gagaaaattg gcttttcttc aagcagtaca 1020 atatttagaa gagatggatg aaaagttatt cactattaaa gatttcagaa atttaatggg 1080 ttcatatggt tgtgagccat actctactac ttacattaag tcaaagttgc ttgattattt 1140 cggatgtgat ataacgtttt ttagtagtga cggaattcat gacataattt gtttatcagc 1200 atctacagaa aagattcttc atgcatttta tttaaaatca gaaaaaaata aattgcactc 1260 ggcagaagaa agaaaactac gaacaatgat tgaagcagca aaaattatca tacaagactt 1320 gaaaacaatt aatacaaact cagagctata tgatatattc cagaaacttg aaagtccaaa 1380 agatgctctc gaatttgtgc ctgtctcact acagattttc ttgacaacaa taatcacagc 1440 caaaagtaat caatccaaaa ttgcatcagt agctcaatca attatgcagt tggcttgtcc 1500 aaagacaatt ctagcaccgt tacaggtaat ttatatctat tttaaaaaca tttaattgat 1560 ttaaatttca accgtataac agataggact tgcagtacag cttcatttac tacatggttc 1620 aagagcctta attgatatac ttcacagcca tggattttgt tcttcataca atgaagtact 1680 gaaatttgaa agatccgcag cagttacttc gaactcaaac atggacatat cgcaagaatc 1740 ctttgttcaa tatgttgccg ataacgcaga tcacaatttg tgcactctgg acggaaaagg 1800 cactttccat ggaatgggca ttatcgctac tataacgcca tgttttaaga aaactaacta 1860 catagttcca cgaaaaaagg taaattttac tagttacgtt tttctaaaca aaaataacaa 1920 ctttttttat aggttacaat tgatgaaata ctacaatgtg gtgaaatcaa aataattaga 1980 cactcaatcg actataaaat attgtccaca atcaaatttc ggacattgga acctttattt 2040 acagaagaca ggactacaaa tctggattta ctttggctag catcgtggtc atttaaacat 2100 gaaagaccag gttggcaagg atgcatggat gcatcactgt caagaaatag ccacccagga 2160 aaatctgaag tagttttcct cccaatgata gacatgtcgg ccagcgatga gaactgtata 2220 tattcaactt tacatttcat cgctactcaa gcgcaaaaca acggctattc accaattttg 2280 actttcgatc aaccattgtg gtggaaagcc atgttgatta ttgaaaatgc agatccttca 2340 tgccgaatca aaaacattat tttaaaacta ggaggatttc atacactaat gagtttttta 2400 agtagcattg gtcatgtcat ggcaggatcc ggaatcgaag aaatattatc gctcatttat 2460 gccgaaaata ctgttcctca catgttgagt ggaaaggctt acacaagggc agttcgaggt 2520 tttcttttac ttgaacgtgc catcaatgct ctatttcacg agaatacaat tttcaaaaac 2580 gacaatatcg tggtatgtaa aaaagtaata tgttttatat tttattccaa ttaattcatt 2640 tgaacatttt ataaatatag gaaggcaatt acagtgacag tacccaatcc aacacttgtg 2700 atgatgttca aacgtctaca tgtttaagta acgttgactg tgatgaaatt aatcgtctgt 2760 actgttgaga atatacaagg ttattcgagt cacatgaggc atctgtatta actaatacag 2820 ggtacaaagt tccaatgtgt ggaacataca taagtgacac attataacat tgagaaaact 2880 caccagaatc gaataagcac aatacgggtg tgtgcagttt ccaaatagtt tatttggaac 2940 aaaaaggaaa aacgagagtt ttgaatgaca agcgactatg gcgtcgttac cagttgacag 3000 gcagagagag agagagaagg ataggctaaa cagcaaccac aagtaaggaa ccttcactag 3060 attataggcc ggtgagtatt ggtagatcca tctagcggtg tgagtttgca cccgaaaaac 3120 aacatgtaca gcattatgat gagtggtgtg agcttggata acttggaatt tataaatcaa 3180 tcagcaacga ttcgtaagtt aaagacgatt gtccttaaga acatcgagaa taattcaaat 3240 tgcaagactg gaaaactatg gattcaattg ctggacatgg tgtcaataat gaagaatttc 3300 attaaagcag aacgaatggg agactggatg ttacaccttg catctacttt gaaaatgtta 3360 cccttcttcg cagcaaccgg gcataataac tacataaatt caagctattt gtatgtacag 3420 aatatggtta aattggcggt tgaaaacaaa aacgtacacg aaaaattctt gaacggatta 3480 catgttggca gaagaacaga taagctctgg gcaggtctat cacatgacca tatcattgaa 3540 caagaactaa tgcgatcaat gaagtccact ggtaatgaaa tttttagttc tgtaacttga 3600 atctttcgta accatctttt tttaaaggtg gagttactcg tagaaaaggg atggaggaaa 3660 tcgaccgatt aaatttgggt tttatcgaga cctacttgct ctaaaattaa ttcgctaatg 3720 cagaatgtca tagaagtcca actttcaaca agtgagcagc acctagagct taataattca 3780 aggaaaaaga gggacatcgc agacattcat acattaatta attatctcag acctcacatc 3840 ctttcattgg aggacacaac gaagtaaaaa acattgcaac tggagtgtta tctgttggta 3900 gtgttaacgt ttatgaagcc aaagaaatag gaaataacat tattcaaaaa atgtctggat 3960 catctattga agattttacc gcacggaaga aagatcaatg cattttgatg actgcaaaat 4020 ctaatcctgg agcaaataag aaaagcgtca atattgaccc aaatttactt tttcaaatga 4080 ttattgtctt gttaacatcc aaaaaagaag aatacggaac gttaacggac tacttcaaat 4140 acgaactgag ttaatatcct ttatcactgt tcaataaaaa ctgtgtttta agaaacgcaa 4200 acaaatctga acttgccaaa gaaattgctg tgatgtgtac ttatgatccc ggaaaagaag 4260 aaatttctct acaagaagag cttacgttcg tgctggatgg gggctggtta ctacaccgct 4320 tgtcatggac aaaacaagat agctacattg atttattttc aaaatactcc gactacgtga 4380 tgaaatgcta tggaaaagga actgtagttg tgttcgacgt atacaatggt ccatcaacta 4440 aagacatggc gcaccaaaag agaacgaaag aagttgggag agaggtattg ttcacaaatg 4500 acatgaagat caactttacc aaagaagagt ttctttctaa tcaaagaaac aaacagaaat 4560 tccttataga attaggaaaa ttgttggatg aaaagggcat tgttgtaaag catgctgatg 4620 gagatgccga tttgaaaatt gtaactactg cgatggaatg cgcaattgca aagaacgtag 4680 ttgtagttgg tgaagacaca gacttgttaa ttttactgat tcactactgc aaggaatcta 4740 atcataatat gtacatgaag agcgaatcca aaagcaagaa atgtggtaaa ctttggaaca 4800 ttaaaaaaat tcaagattct ttaggaaaag aactatgcag ctcaattctc ttttgtcatg 4860 catttcttgg ttgcgatacc acgtcaaaac cttttgggaa ggaaaaatgt gcctcactga 4920 agctccaaaa tactaatagt gattttaaaa ttgtgtccaa aatattctat gagtcagaat 4980 ctacaaagca agatattgat acagccggtg aaaatgcaat gtgcattgtg tatggaggat 5040 tggtcattga tggaatagat aggctgcgtt atcaaatttt ccagaagaaa gtaaataatg 5100 ctaagcttac caagtctatt attccagaag agttaccgcc aactcaagca gctctgaaat 5160 tccacagcag aagagcatat ttccaagtat tcaaattcta agtaaaaaaa taataactta 5220 ataagaaata aatattgtat atagatccaa caatggttag gaaaaaagtt atcagaactt 5280 gattggggat ggactatcaa tgatggatta ttttacccga agacaacaga tctccaacct 5340 gcaccaaagg agatattgaa aatgattaaa tgtggatgca taggtacatg tgacagtaac 5400 aaatgttctt gtcgtaaaaa tgatatgaaa tgtactgtag cttgtaagag aatgacgaaa 5460 tgatgtaata cacaaatgaa aagttatgaa atattattat tctattacaa ttacactata 5520 taacattata ttgacaatta cacttagacg ataaaatacc ttgaaaagat tatccatcat 5580 gcaataataa gaaatcaatc actggtgcta gcagaacatt ttgataacaa taaaaagacg 5640 ttttactaat agtcgagtaa aagttgcagc ggccccccaa aaaatttcac cctggcgggt 5700 gtgttgtaaa acctattgga attttgcgga atttggaaaa agcccaaaag ggatattttt 5760 tttataatgt gaataacttt tttgctgtgc tattttcata aaatataatt aataaaaagt 5820 aaactaggtt atattggcta caaaactaga ctagtcgggg atttttaggt tatttagttg 5880 ttatgtagca ttaattttaa gaagacgcgt agcgccataa gcgccgtgtt aaatacggcg 5940 aaaaaatttc gacgcggaaa ccaatttcag taacttttaa attaaataaa aaatagtaaa 6000 tctacaccaa aaacatttaa attttcaaca cacgtttatt gacgtataaa gtttgataaa 6060 atgtactaaa aataataaaa tattcaaatt tagtccccca tttcttgttt tctaatttt 6119 // ID Mariner-1_DMac repbase; DNA; INV; 478 BP. XX AC . XX DT 19-APR-2011 (Rel. 16.04, Created) DT 19-APR-2011 (Rel. 16.04, Last updated, Version 1) XX DE Consensus of mariner-like element internal region of mellifera DE subfamily. XX KW Mariner/Tc1; DNA transposon; Transposable Element; KW Mariner-1_DMac. XX OS Drosophila maculifrons OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; guarani group; OC guaramunu subgroup. XX RN [1] RP 1-478 RA Wallau G.L., Hua-Van A., Capy P. and Loreto E.L.; RT "The evolutionary history of mariner-like elements in Neotropical RT drosophilids."; RL Genetica 139(3), 327-338 (2011). XX DR [1] (Consensus) XX CC Consensus of clones that show less than eight percent divergence. XX SQ Sequence 478 BP; 132 A; 113 C; 124 G; 109 T; 0 other; tttgggtgcc gcacgagctg aataaaaaaa aatctcttgg accgaatcaa cgcttgcgat 60 tctctgctaa aacggaacga agcggatggt gagtggtgat gaatagtgga taacgtacga 120 caacgctaag cgaaaaagat cgaggtcgag aaggggtgag ccggcccaaa ccatcgccaa 180 acccggattg accgccacga aagtttttct gtttgtttgg tgggattgga agggaattat 240 ccactatgag ctgctcaact atggccagat ccttaattcg gtcctctact gtgagcaact 300 tgaccgtttg aagcaggcaa ttaaccagaa gcggccagaa ttggtcaata ggcatggtgt 360 tgtgttccat caagacaaag ctcgtcctct cacatctttg atgacccggc caaaagctac 420 gggagctcgg atgggatgtt ctatcgcacc cactgtactc acctgaccta gccccaag 478 // ID Dorna1cons repbase; DNA; INV; 507 BP. XX AC . XX DT 19-APR-2011 (Rel. 16.04, Created) DT 19-APR-2011 (Rel. 16.04, Last updated, Version 1) XX DE Consensus of mariner-like element internal region of mellifera DE subfamily. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Dorna1cons. XX OS Drosophila ornatifrons OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; guarani group; OC guarani subgroup. XX RN [1] RP 1-507 RA Wallau G.L., Hua-Van A., Capy P. and Loreto E.L.; RT "The evolutionary history of mariner-like elements in Neotropical RT drosophilids."; RL Genetica 139(3), 327-338 (2011). XX DR [1] (Consensus) XX CC Consensus of clones with show less than eight percent divergence. CC Dorna1cons. XX SQ Sequence 507 BP; 124 A; 130 C; 139 G; 114 T; 0 other; tttgggtgcc gcacgagctg acgcaaaaaa acatttttgc ccgtatggat gcatgcgaat 60 cgcttctgaa tcgcaacaaa ttcgacccgt ttttgatgag gatggtgact ggcgatgaaa 120 agtgggtcac ttacgacaac gtgaagcata aacggtcgtg gtcgaaagcg gtgaagctgc 180 ccagacggtg gccaagcatg gattgacggc caggaaggtt tcttctgtgt gttttggtgg 240 gattggcagg gaatcatcca ctatgagctg atcccctatg gctaaacgct caattcggac 300 ctgtactgcc aacaactgga cccgcttgaa atgcagcact catgcagaag aagggccatc 360 tttgatcaac cagaagccga attgttcttc catcagggac gacgccaggc cacacacaat 420 cttttggtga cgcgcccaga aagctcccgg gagctcggga tgggaggttc ttttgccatt 480 ccacccggta ctccccccgg aactgcc 507 // ID DNA-1_PPac repbase; DNA; INV; 805 BP. XX AC . XX DT 07-JUL-2010 (Rel. 15.07, Created) DT 07-JUL-2010 (Rel. 15.07, Last updated, Version 1) XX DE Non-autonomous DNA transposon from the Pristionchus pacificus DE genome. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA-1_PPac. XX OS Pristionchus pacificus OC Eukaryota; Metazoa; Nematoda; Chromadorea; Diplogasterida; OC Neodiplogasteridae; Pristionchus. XX RN [1] RP 1-805 RA Jurka J.; RT "DNA transposons from the Pristionchus pacificus genome."; RL Repbase Reports 10(7), 952-952 (2010). XX DR [1] (Consensus) XX CC ~98% identical to consensus. XX SQ Sequence 805 BP; 203 A; 191 C; 208 G; 202 T; 1 other; taggggccac aacgtagatt gccgaattgt cacaatccgg ttaaatttgt gtccaaactg 60 tccctcatgg acccaataac atcacccttt ttaccgtttg ttgaaatctt gtctctgagc 120 tgcgctacag cccgagaaga acgcgcgaaa ggcaccgttg gagagggggg aaaggcgaga 180 cgacagggaa aggggcggag cctattctgc ccatttcccg catctcagcc ggccatagtt 240 tcaccnagtc gagcttgcgc tttgcgacgc tcaaagtcga aatcgaaggg agatatagac 300 gagagcgtgc gctagtccca ccgggtaaga acagcgggga gagcgaggcg gagggagagg 360 gagtgtggtg tgtctctcgc ctctctcctc tcatttgact atcccctctg gcgctcacac 420 tgtgtttagc gtgagtagga agcgaggaaa tacgcgcgta ctgaaacctt taaaacaaag 480 tcgcaagctt ttgaagcatg ataaaattct gtttttttca ctaaacttcc taaaatattc 540 gtatttgtcg aaaaattgct gcccaaccaa aaagtgcaca ctgcaacccg tcaaaacggc 600 ctattcccat cgtttaggtt caattttgac ctactcagtg aagatgtgca cattttgggc 660 agtgctcaat cgattgggaa tcccagtagc tatccagcgc actttagaat gagtgatttg 720 gtcaagaatg aaccttgcga ctttgatttc tcgcgttttg tggggagtcg tgtctcatca 780 aagtgggcgg agcttgtggc cccta 805 // ID Transib-N1_DYa repbase; DNA; INV; 1521 BP. XX AC . XX DT 15-MAY-2009 (Rel. 14.06, Created) DT 15-MAY-2009 (Rel. 14.06, Last updated, Version 2) XX DE Putative non-autonomous Transib-type sequence: consensus. XX KW Transib; DNA transposon; Transposable Element; Nonautonomous; KW Transib-N1_DYa. XX OS Drosophila yakuba OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; melanogaster subgroup. XX RN [1] RP 1-1521 RA Jurka J.; RT "DNA transposon families from fruit fly."; RL Repbase Reports 9(6), 1154-1154 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS join(773..1018,1320..1520) FT /product="Transib-N1_DYa_1p" FT /translation="MSLYPWFPMSATVHKVLMHGYQIIEASIXPLGVLDKN FT ARRLATNTIRVTGGYMLERVHGRTTWQTFSTEQWIPQILLCRASIKHLLNL FT TSEFKIKRKPISYGRLGKAWSDRLKFGYVFICIDTIICTKFQEPSFKTLIF FT VEKSARWPNV" XX SQ Sequence 1521 BP; 500 A; 249 C; 267 G; 495 T; 10 other; gcacattggg ccagcgagcc gatttttcga cgaaaattca aatttttcat agaaagtaaa 60 cctgttttaa taataaaatt tgaatagtat atacttcgwc taaaatkaat caaaagattt 120 tagttgtttt gatgcatyyc tgaaattttg gcagcccttc taagtcagst gtttgattgc 180 atgcgtgcga aactaaaaag cgcaggaaaa attttgttga cttgcgagcg tgccaaaatt 240 tttgtggttt aaccaatttt cacaaaaatt atggaaaaga aggaccaagg tatgtagaat 300 atttgtatta aaaaaatttc gttcatgttt gttaacgtgt tttttgtgtg cttttatttg 360 tgttgtccaa aagtagaaaa aagctaacct gctaacccgt cwtttgtgga gtataactaa 420 ttttgtgcca atttgcgcca aaaagtaatt gtcattatag tgcaaagtgt atataaacta 480 attttaaaaa tcttttttgc kcctgcttgt tccgtttcga gttggatata taattcttaa 540 gaaaacggtt gaagttgagt tcaccttaat atgtatacca aatggcttaa gttccttcaa 600 atcgtctaca tcaagacgat ttggacgccc acctaagcta ctagctgaaa ttttaggtct 660 tgatgaattc ctcattgcat attttagcaa tattttaatt gctatttctt gtgacctacc 720 aattgatcca acacaattta aaaawtattg ccagaaaacc ataactcaat agatgtcctt 780 gtatccatgg tttccaatgt cggcaacggt ccataaagtg ttaatgcacg gctaccaaat 840 aatagaagca tcaatcyttc cattaggtgt tcttgacaaa aacgcccgga ggctcgcaac 900 aaatactata agagtgacag gaggttacat gctcgaaaga gtacacggca gaacaacatg 960 gcagacgttt tccacagagc aatggattcc tcagatcctc ttgtgtcgag catctatcta 1020 acaaaaagat tgggaggaaa ataaactacc cctaccagca gacgtaatca aacttttgaa 1080 attaccaaac tkgctaaatt ttgttgacat gaacaataat aacaatgttc ccgattttga 1140 tgctaacaat agtaatagtg attcagattc tgactctgaa atccgttttc aatagatctt 1200 gatgtagaac acgactttta tgaacatttt taaatgtaaa atacttattt atagtcattt 1260 gaatgcattg ttatttttta gttattaata ttaaggttgt gttaaaaaaa attaattaaa 1320 aacatttgct gaatttaacg agtgaattta agattaagag gaaacctatc tcatatgggc 1380 ggctagggaa ggcgtggtca gatcggttga aatttggata tgtttttata tgcatcgaca 1440 ctatcatatg tacgaaattt caagaaccta gctttaaaac tttaattttc gtcgaaaaat 1500 cggctcgctg gcccaatgtg c 1521 // ID Gypsy-195_AA-I repbase; DNA; INV; 6928 BP. XX AC supercont1.68; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-195_AA_; KW Gypsy-195_AA-LTR; Gypsy-195_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6928 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.68; Positions 2010395 2017322. XX CC Positions [4838-5314] - Integrase core CC 'GTAC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 600..2579 FT /product="Gypsy-195_AA-I_2p" FT /translation="MESKYASQYRSMNVNHLLDDELEYELVLRKVKFSSGE FT SRDIKRRKLRGAMKEERDANTFASRSIAEEARESESRLVEEKITMIRDSLE FT TRKAKKSEFPQLETRLVHLYFRLQRLKKVFNIDGLEQLVHRLLNEYFSCRS FT RDPEEIGEPSTRYIPRQSSQDRERVESEEDVSNPRTEDESKDDSEKDEETL FT SDELDFDVRNVRSRRSSTPKGPENKVVNADELVERIMQQVDKMITSKLKAL FT NLAGVEQSSNSENKSRQMRKKRIPQALSVEKSEQSKIVSNQRGARRVVKKS FT DEIASVSSDAISDRDIGTDLDSEVVESDIDREDVERGKQLKRRPRPVCDWK FT LKYDGRDDGRGLNKFVSEVEFMAEAENISKRALFNEAIHLFSGEARAWYIE FT GKRNREFRNWTEMIAELKLEYQPPDMDYLYEQQAAARRQKRGEKFQDYYNA FT IKEIFDQMAVPPTEQRKFDIIFRNLRSDYKNSLLVKGVRTLRMLKVWGRKL FT DSANWFLYRKSEGDQGTKSAQIHEISQGSGPRRYQPPRNDWKDNGFKGRSE FT WRNRNAGAYQGPSKTDNGGFRKPTFDQSKDNQQKPKQSTSNADPRVGNSEE FT LLRKRILAYRIPERTVCFNCRGNYHHSSACLKPKEIFCSTCGFQGFRKPDC FT PFCAKNGKQSM" FT CDS 2702..5707 FT /product="Gypsy-195_AA-I_1p" FT /translation="MGFHIIGLLDCGAQMTVLGVGCGKLLRDLKLKLHPTG FT LKLITAGGSQMEVTGFVNLPMTFNGKTKIVAAVVAPNLNRRLILGMNFWKS FT FNIEPTINQDECLVEEVEAGLEEEPCLTAAEKQLLDKVKTEFKVFDEGSKL FT EVTPLITHRIEFEEQFKNAGPIRLNPYPWSPEIQRSVNEELDKWIASGVVE FT RSNSDWALLIVPVTKKGESSAEGRIKVRMCLDARKLNDRTRRDAYPLPHQD FT RILGRLGASKYLSTIDLSKAFWQIPLDPESRKYTAFRIFGRGLFQFTRLPF FT GLVNSPATLSRLMDHVLGYGELEPNVFVYLDDIVIVSNTFEEHLRSLKEVA FT RRLREANLSINLEKSKFCVPELPYLGFVLSQRGIRPNPDKIEAIVNFERPT FT SVRSLRRFLGMVNYYRRFISEFSEVTAPLTNLLKGKPKIVQWNDEAERAFV FT ILKEKLITAPILACPDFGRPFTIQTDASDTAIAGVLTQEVNGSEHVIAYFS FT RKLTMSQRSWKAAEKEGVAALEAIEKFRPYVEGARFTLITDSSALSFIMNS FT KWKPSSKLSRWSMLLQQYDMFIKHRKGAENVVPDALSRAVEAIDLSKDNWY FT SSLYKKVLESPDDFPDFKIEEDKLYKFVASPTDAMDYCFEWKLCLPEDMRQ FT DVLKQEHDQALHPGYEKTIHRLKVRYFWPRMAIQAKKYIQSCSVCRQCKPS FT TMPTAPAMGKQRLTNKPFQILALDFIQNLPRSRSGKSHLLVLMDLFSKWTL FT LIPVRKIESKQVCQIVEDCWFRRFGTPEVVISDNATTFVGKEFQALLARRG FT IQHWPNSRHHSQANPVERTNRTINACLRTYMEKDQRLWDTKIAEVEELVNT FT TIHASTGLSPYRILYGHEKITKGDEHRLEREDREVPVEERDEARRKLNEKV FT FKIVEENLRKSHEKSKKIYNLRHRKYCPTYQIGQKVYKRNFKPSSAAEKYN FT AKYGPVYTPCVIVAKRGTSSYELADNSGKVLGVFSGADLRPGENYQS" XX SQ Sequence 6928 BP; 2178 A; 1248 C; 1700 G; 1802 T; 0 other; attggcgacc aacgttaaag tcgtgaacgg taatcggagg aacgattagt atttaaaaaa 60 aatcttgaga tttttttaac tgggcgacag gtgagtgcgc aatcaatgag ttgattgtgc 120 agtccgacac ttgataaccg ctaatcacag tccgcgagtt ggtagattga agtgggaaga 180 cattggcctt attcggtttc ggaagagtac ggaaaggaag aaaagttggc acaaaagcag 240 taaagatttg attgatttcg gagacagaga catattttgg agttggtact ggtaactaac 300 ctagaaagga cttacactga aacaaataca ctttcacact ttacaaaggt cctgaatgaa 360 ctaaattaca ctatacaaaa caacacaagg atagcacggt ttattgaatt gtttttgaat 420 ttttcttttt cattatttct tattttctag cttaagattt gaattttatt atttgtcgtg 480 tgaatttgaa tttattgtct aatttctaag tgtatatatt ttgaattcgt gaactttatt 540 gaactttagt gaactttatt gaattactct tgctttacat aaatctgatt ttaaatacga 600 tggagtctaa atacgctagc caataccgtt cgatgaacgt gaaccatctg ttggatgatg 660 agctggaata cgagcttgtt ctacgcaaag taaagttctc gagtggtgaa tctagggata 720 taaaacgaag aaagttgaga ggtgcaatga aagaggaacg agatgcaaac acattcgctt 780 cgcggagtat agcagaagag gcccgtgaat cagagtcgcg attggtagaa gaaaaaataa 840 cgatgattag ggattcatta gaaactagga aggcaaaaaa gtcagaattt ccacagttgg 900 aaacgcgtct ggtccacttg tacttccggt tgcagcggtt aaagaaagta ttcaatattg 960 acggtttgga acagctagtt cacagattgt tgaatgagta tttttcgtgc cgtagcaggg 1020 acccagagga gataggagaa ccgtctacaa ggtatattcc gagacagagt tcacaagatc 1080 gcgagagagt agaaagtgag gaagatgttt cgaacccgcg aactgaagat gaaagcaagg 1140 atgatagcga aaaagatgag gaaactcttt cggatgagtt ggattttgat gtgcggaacg 1200 tacgtagtag gaggtctagt acacccaagg gtcccgagaa taaggttgtc aacgcggatg 1260 aactagtaga acggattatg cagcaggtag acaaaatgat cacgtctaaa ctaaaagcgt 1320 taaacctagc gggtgtagaa cagagtagta attcagaaaa caagagcagg cagatgagga 1380 aaaagcgaat tccacaggct ttgagtgtgg aaaagagcga acaatcgaag atagtcagca 1440 atcaaagagg ggcgagacga gtagttaaga aaagtgatga aatagcttcg gtttcgtccg 1500 atgccataag tgatagagat atagggaccg atctagactc agaagtagta gaatcagata 1560 ttgatagaga ggacgttgag cgcggtaagc aattgaagcg aaggcctaga ccagtctgcg 1620 attggaaact gaaatacgac gggagagatg atggtagagg gttgaacaag ttcgtgtctg 1680 aagtagaatt tatggcggaa gcggagaaca tcagcaagcg agcattattc aacgaggcaa 1740 tccacttgtt ttcgggtgag gcgagggcct ggtatataga gggtaagcgc aatcgggaat 1800 ttcggaattg gaccgagatg atagcagagt tgaagctgga gtatcaaccg ccagatatgg 1860 attaccttta cgagcaacaa gcagcggcca gaaggcaaaa gagaggtgaa aagtttcagg 1920 actactataa tgctatcaag gagatttttg accagatggc agtcccacca acggaacagc 1980 gaaagtttga tataatattc cgaaacctcc gttcggatta taagaactcc ttacttgtta 2040 agggagttcg aacgttgagg atgttgaagg tatggggacg aaaattagac tcagccaact 2100 ggtttttgta ccgcaaatcc gaaggtgatc aggggacaaa atcagcacag atacacgaaa 2160 ttagccaagg ttcaggtcca cgtagatatc agccacctcg gaatgattgg aaggataatg 2220 gtttcaaggg aaggtcagag tggagaaacc gaaatgctgg tgcgtatcaa ggtcctagta 2280 agactgataa cggaggattt aggaaaccaa cgtttgacca gtccaaggat aatcaacaaa 2340 agccaaaaca atccacctca aatgctgatc ctagagtagg taacagcgaa gaacttttaa 2400 ggaagagaat tctagcttat cgaattcctg agaggacagt ttgttttaac tgtcgaggaa 2460 actaccatca ttctagcgcc tgtctgaaac ccaaagaaat cttctgttca acttgcgggt 2520 tccagggctt cagaaaaccg gattgtccgt tttgtgcaaa aaacggcaag caatcgatgt 2580 gagaggtcgt cgatgcagca ataaaaagcc tcttaccaag ttcgtttcaa tcgctcagga 2640 aattggggag cttattgtac aggtagatgg ggacaatcgt ccttttgcga aagtagatgt 2700 tatgggattt cacataattg gtttattgga ttgcggagcg cagatgacgg ttttaggggt 2760 tggttgcgga aagctgttac gagatcttaa gctgaaattg caccctacag gcctgaaact 2820 gatcactgct ggagggtccc aaatggaagt aaccggtttc gtaaatttgc ccatgacgtt 2880 taatggcaaa accaaaattg tcgcagctgt agtagcaccc aatttgaatc gtcgtcttat 2940 attgggaatg aatttttgga aaagtttcaa cattgaaccc acaataaatc aggatgaatg 3000 cttggtagag gaggtagaag caggactgga ggaagagcca tgtttgactg cagcggagaa 3060 acagttgtta gataaggtta aaacagagtt caaagttttt gatgaaggga gcaagttgga 3120 agtaaccccg cttattaccc atagaattga atttgaggag caattcaaaa atgcaggacc 3180 cattcgatta aacccgtatc catggtcgcc agaaatccaa agaagcgtta atgaagagct 3240 ggataaatgg attgcttcgg gcgtagtgga acggtctaac agcgattggg ctttgttaat 3300 tgtccctgtg actaagaaag gcgaatcaag tgcagaaggt cggataaagg tgcgaatgtg 3360 tttggatgca cgaaagttga atgacaggac tcgtagagat gcctatcccc ttccacatca 3420 ggatcgtata cttgggcgat tgggagcatc caaatatttg tcgacgattg acttgtcgaa 3480 ggctttctgg caaataccgt tggatcccga atctcggaag tatacggcat tccgaatatt 3540 tggaagggga ctgttccagt tcacccgact cccatttgga ctggtgaata gcccggcaac 3600 cctatcccgg ttgatggacc atgtattagg gtatggcgag ctggaaccaa acgtgtttgt 3660 ctaccttgac gatatcgtca ttgtaagcaa cacgtttgag gaacatctcc gaagtctgaa 3720 agaggtagcg agaaggctta gagaagccaa tctctcgatt aaccttgaga agtcgaagtt 3780 ttgtgttcca gaacttccgt atttggggtt cgtgctgtca caaagaggaa ttcgtccaaa 3840 tccagacaaa attgaggcca tcgtaaactt tgaaaggccc acgtctgtgc gatcgttaag 3900 gcgctttttg ggcatggtga actactatcg gcgtttcatt tcggagttta gcgaagtaac 3960 ggctcccttg accaacttgc tcaagggaaa gcccaaaatt gtacaatgga atgatgaagc 4020 tgaaagggca ttcgtcattc taaaagagaa acttatcaca gcgccgatat tagcctgccc 4080 agactttgga agacctttta cgatccagac ggatgcaagc gatacggcga tagcaggagt 4140 gctcacacag gaggtaaatg gaagtgaaca cgtgattgca tacttctctc gtaagctaac 4200 tatgtcacaa cgatcctgga aagctgctga aaaagaaggt gtggccgctt tggaggccat 4260 agaaaagttc aggccttacg ttgagggagc tcggttcact ttaatcaccg attcatctgc 4320 tctttcattt ataatgaatt caaaatggaa accgtcatcc aagctaagtc gctggagtat 4380 gctgctacag caatatgata tgtttatcaa acatcggaaa ggcgcagaaa acgtggttcc 4440 agacgcactt tcgcgggctg tcgaagcgat cgacttgagc aaggataact ggtattcttc 4500 gttatacaag aaagtattgg aatctccgga tgattttcct gatttcaaaa tagaagaaga 4560 taagctttat aagtttgtcg catcacccac ggatgcaatg gattactgtt tcgagtggaa 4620 actttgcttg cctgaagaca tgcggcagga tgttcttaag caggagcatg accaagcatt 4680 gcatccggga tacgaaaaga cgatccatcg attgaaagtt cgttatttct ggccaaggat 4740 ggcaatacag gctaagaaat acatccaatc atgtagcgta tgcagacaat gtaagccttc 4800 gactatgccc acagcaccgg ccatgggtaa acagaggttg accaacaaac ctttccagat 4860 tttggcactt gattttattc aaaaccttcc tcgcagccgc agcggaaaaa gccatttgtt 4920 agtattgatg gacttattct ccaaatggac tttgctgata cccgtgagga agatagagag 4980 caaacaagtg tgccaaattg tggaagactg ctggttcaga cgatttggaa cccctgaagt 5040 cgtcatctca gataatgcga ctacattcgt cggcaaggag ttccaagcac tgttagcgcg 5100 aagaggaatc caacattggc ctaattcacg ccaccatagc caggcgaatc cagttgaaag 5160 gacaaaccgg accatcaatg cctgcctacg cacctacatg gaaaaggatc aacggctctg 5220 ggataccaaa atcgctgaag tggaggaact ggtaaacacg accattcatg cctcaactgg 5280 tctctctccc tatcggattc tctatggtca tgagaaaatc accaagggtg acgaacatag 5340 attggaaaga gaagatagag aagtcccagt tgaggaacga gatgaggcgc gtcgaaaatt 5400 gaacgaaaaa gttttcaaaa ttgtagaaga aaatttaagg aaaagccacg aaaaaagtaa 5460 gaaaatctac aatttgagac atagaaagta ttgtccgact taccaaattg gacaaaaggt 5520 ctacaaacgg aacttcaagc cgtcatctgc cgcggaaaag tataacgcga agtacgggcc 5580 tgtatacacg ccgtgcgtga tcgttgccaa acgcgggact agttcatatg aactcgccga 5640 caactccgga aaggtccttg gagtattttc cggtgcagac cttcgtccag gagaaaatta 5700 ccagtcctaa aaagggaaaa taaacactta ttaaacgatt cgataaagag cgagcataat 5760 gattacgaat ccagcatgct gcattatggc tctcatcttc gatcggttgc aaacagcaag 5820 ggtcatcagt cgaccagttt cgctgaactc tcactgtgtt ggatgataac atgagagagg 5880 agacggatat ctcatgggag aaaccgtgtc tcgagagagt ggttactctg agttccaatg 5940 ccgaacgttg tatgctggga ggatcgaaag agttcaatta gatttatgta gtacagtgac 6000 gtcatagaag ggttagcgat agagatagtt gaggaaggaa gaaaattgtt cttttgaata 6060 ggcgccatac aaaagaaccg tgctagcacg catagatgag gattgctttc ctgaaatgga 6120 gagattttta agttttcgca tgttttatgt agtattttta ttattttttt tgtttttgtg 6180 tcgtagattt tgattataag atttgcatgt gtaataaaaa aaacgatcgc gagcaagaga 6240 acggagcaca cgtaccttga acaatagcca taatcaaata ggagcatcca ttgtaaagac 6300 tataggctcg ttcccagtat ttttccgccg tgatcaggac ggccgcctga tcgaattatg 6360 ctgggaataa tcctcataat tatacaccat atcgtaggcc atattttaat ttgttctgta 6420 tccgtttgat tcaccatagt tttaatgtgt ttgcgtccag aattttatac ttgttgttac 6480 gttgacaata atttttatag atatactgtc ttccaaacgt gtttgctcgt acagcgtgtg 6540 aaatcaactg atcaaccttc tcaaattcat tacctaccgt tggtagggcg ggatgaaatt 6600 gagtaacgga tgaaccaaaa tgatcgttct actataaggt gttcgtgctt atctacatgc 6660 atactaacag gataaaatca cacgtgaata attatggcaa attctttagg ggaatttgtc 6720 cattagttgt cagctgattt taaaatgttg tgttatgtcc aagtaaattt taatcatttc 6780 tatactgctt cgcgatccgt aaaaaaaata tgtagtagaa aaaattggca tttgtctttg 6840 tgtttcaatt tattgataag tgtattgtgt aggtaaaaat ataaaaataa ttgaagtact 6900 tcaattattt accgtggcgg gaggatag 6928 // ID Nimb-2_CQ repbase; DNA; INV; 5981 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A Nimb non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW Nimb; Non-LTR Retrotransposon; Transposable Element; Nimb-2_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-5981 RA Kojima K.K. and Jurka J.; RT "Nimb non-LTR retrotransposons from the southern house RT mosquito."; RL Repbase Reports 11(1), 582-582 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 10 sequences with >95% CC identity. XX FH Key Location/Qualifiers FT CDS 771..2078 FT /product="Nimb-2_CQ_1p" FT /translation="MAALNPLPPGDPPDGNAGPSGTNVNARTMPEWMDRQG FT MFGQRIVLLLQPTGTSMLPNNPFVIGKSIEKTCGKIESANTEENGTKYALK FT TRSVDQARKLLAMTKLIDGTKVEVVTHPRQNVCRCVVVCREAIELSEDELV FT TELSAQGVVNARRFTKREGNNRVNTATVCLTLNGTVVPKHIWFGPLRITTR FT LFYPSPMLCFNCSDYGHTSKNCKKEAVCFNCSLQHTVEPSEGGRCILDPYC FT KHCKGAHPATSKECPKYKKEVAIIKIQVETGVTFAEARREYEKRNQAHGLP FT THAEVVGAQKRLEAIGKDNEKDNEIRLLKEEIEKLKNTAMDKNKDEVITSL FT RYELEQTKRLSSALMKKLRENKFAKNDSEIESDKDSSEKEGSPKQAKFAKG FT QAGKLQDNTSSQPEKNPRRGRGRPRKTPTLEPGKFDAEMHST" FT CDS 2084..5908 FT /product="Nimb-2_CQ_2p" FT /note="apurinic-like endonuclease, reverse FT transcriptase and ribonuclease H." FT /translation="MDLSTTNKTKDKEPAMNKNYSNIISWNTQGLRTSKPN FT LKILCSKINPSVLCLQETMCSSKDQANINGYSIYYRPRAGGDRAAGGVLVG FT VKPELDSEEIDLDTNLEAIAVKVGPPFNLTIINTYLPPGQAIQASEIVHLI FT NQVPQPRLLVGDINAHHPLWGSETFSARGLMFEEVLNECNLVVLNKTDPTH FT ISVATGTLTSIDVACCSTEIIDQLDFSVLQDAYGSDHLPLLLTLPDTAHAI FT RTRPKWKIEEADWETFQTTVQFPLMPTATQQISSITQKILEAAEQTIPKTT FT GMVNKRAVPWWNQEVASAVKSRKKALRALQSAKRNNLENQEALAAEFRAAR FT SHARKVIQEAQRTSWINFVNDFTVQTPVKEVWENFRRIQGKRKTNRISAIT FT AEGGTVSKDEDIAEALASSFASISSNDAYSPEFRAFKDQVEQTPLSIPADT FT SAEYNKPFLFTEFEEATAGLRGSSPGPDTVHYSMIVKLPLDCKRQLLDAYN FT RLWLGSVYPPEWTESLVIPIFKNCGEKANPRNYRPIFLNSCLGKVFERMVN FT NRLVHIIETRRLLHPHQFAFRKGKTTVDHLAELEKVVRAAWNKKEYVQGIF FT LDVTKAYDTTWRRLVLNQLRDWNIEGLMLKFLDRMLENRSFRVFVNGQLSQ FT SKIMETGLCQGSVLSVTLFLVAINTLVARMPPSITTLLYADDVVLLASGRD FT VEEVENDLQAALKAVECWQSSTGFKISAEKSATVIFRSYGTRKPPSRAALE FT LNGTLIPTKKQHRCLGVILDQHLPFKTHCEEVKAACRQRVQLIRCVARRSW FT GGDRKTLIKLYRATVLEKILYAAPITAAVSDNVLKILEPTHNAGLRAICGA FT FRTSPVDSLQAETGIPSLRVFFEQRTAIYAARKAAISAQSRTQPAEDDSQS FT AGSSTESSYDSNSSGEEWNSTRHRGPLRGVETAETRGKAILEELELPLPEL FT KIFTLPLCPPWERRRIRIDKTLLEAERAGATSTQLKELFVSRRNTEYRLCE FT TIFTDGSKKDGRVGYAMVRGDLVVRRRISDLSSIFAAECGAITEALRWIIG FT QNRVGTYLVCTDSLSAITALGKRKTKCRWKDEINILHNQAECNGTEIIYMW FT VKSHVGIAGNEKADEEAKQSLNDRNIWDRSVEFKEFRTVIKKRTVWRWNAE FT WSRKVGNKLREVKNSVLPYRDVFVGSRKEDVILTRLRIGHTLLTHQYLLEK FT DSAPRCTRCNLALTVKHILAECPEFEEQRRRAGVPSNVREALADDRDMAKN FT VLKFLKATDMYGKV" XX SQ Sequence 5981 BP; 1851 A; 1393 C; 1530 G; 1206 T; 1 other; acgaggtttg actgtaaagg tctatagtta taccaagtct tgagtctcgg attagtatcg 60 cttttctagc gtaaatcgga tcggttttcg aatagttttc gtatcggtcg aagcataagt 120 gaacggacaa tcggtggaat atcgagtaca gcagcaaaac tacggattag tgcgaatatc 180 gagcagaatc tggattgttg atattcgaac cgaacagacg acggagcaag tgctagatca 240 gcagagtggt gttaccaaga gagagagcga aagctagcac cgcaagcttg cctctttaaa 300 cgtgcaggca agctttctac tgctacatct gctagcttct actggatctc catcgtcaac 360 agctcgtttc attggaaacg cacctgtcgt ccgttggtgg agaaacgcac atcaactgaa 420 ctgcagtata cggactacac tagcgtgtcc caaaaagtga gaagctgaaa tcgaactaca 480 cggctggacc ttctaacgct tggtcgttgt agagtgaata gggtaaagcg tagtcgtctt 540 ccgttccgac ggggccgcaa aacaggccac ctggacctta acccttgggt gggtctgaat 600 tgaggtaaaa gggtagtcgt cacttgctta ggcggtgtcg caaaaccgat cacctggacc 660 tttaaccctt gggtgaatct ggtgtgaagg tgaaagggta gtcgttcctt gttcagacaa 720 gaacgcaacg ctgtactggt tctaaagtga gctgcaacgc tattgcggcc atggccgcgc 780 tgaacccact ccctcccggg gatcctccgg atgggaatgc tggacctagt ggtacaaacg 840 taaacgcgag gacaatgccg gaatggatgg atcgacaagg aatgtttggg caacgtattg 900 ttctgctcct gcaaccgaca ggaacaagca tgctaccaaa caacccattc gtaatcggca 960 aatcgatcga aaaaacctgc ggaaaaatcg agtcagcaaa cacggaagaa aacggtacaa 1020 aatacgcact gaaaactcga agcgtcgatc aagcaagaaa gctgctggcg atgacgaagc 1080 tcatagatgg taccaaagtg gaagtggtta cccaccccag acaaaacgta tgcaggtgcg 1140 tcgtcgtttg cagagaagct atcgaactca gcgaggacga attggtcacg gaactttcgg 1200 cacaaggagt ggtgaatgcg cgcagattca caaaacgaga gggtaataac agagtgaaca 1260 ccgcaacagt ctgcctaact ctgaatggta ctgtggtccc caaacacatt tggtttggac 1320 cgctcagaat aactacaagg ttgttctatc catcaccgat gctgtgcttt aactgcagtg 1380 actatggaca cacaagcaag aactgtaaga aagaagccgt ttgcttcaac tgctccttgc 1440 agcacaccgt tgagccgtcg gaaggtggac gatgcatttt ggacccatat tgcaagcact 1500 gtaagggggc tcatccagct accagcaagg aatgccctaa gtacaaaaag gaggtggcaa 1560 taatcaagat ccaggtagaa acgggagtca cgttcgcgga agcgagacga gagtatgaaa 1620 agcgtaacca agcacatgga ttacccaccc acgctgaagt tgttggcgcc cagaaaagac 1680 tggaagccat tggtaaagat aacgaaaagg acaacgagat tcgccttctc aaggaagaaa 1740 ttgagaaatt gaagaacact gctatggaca aaaacaagga cgaagtcatc acttctctaa 1800 ggtatgaact cgagcaaacc aagagactgt caagcgcctt gatgaagaag ttgcgcgaaa 1860 acaaatttgc caaaaacgac agcgaaattg aatcggataa agacagctca gaaaaagaag 1920 gatcgccaaa gcaagctaaa ttcgccaaag gacaagcagg aaaactgcag gacaacacca 1980 gctcccagcc cgagaaaaac ccccgccgtg gtcgtggaag gccacggaaa actccaacac 2040 ttgaaccagg aaagtttgat gcagaaatgc actccactta accatggact taagcacaac 2100 aaacaaaacc aaggacaaag aaccagctat gaacaagaac tactcgaaca ttatttcatg 2160 gaacacccaa ggacttagaa ccagtaaacc aaatttgaaa atactatgca gcaaaatcaa 2220 cccatctgta ctatgcctcc aagaaaccat gtgctcatct aaggaccagg cgaacatcaa 2280 cggctacagc atatactacc gaccgagagc tggtggcgac cgtgcagccg gaggagtttt 2340 ggttggagtg aaacctgaac tggacagtga agaaatcgac ctggacacca acctggaagc 2400 catcgctgtt aaggtaggtc ccccatttaa cctgaccatt atcaatacat atctgccacc 2460 ggggcaagcc atacaagcct cggaaattgt acacctgata aaccaagtcc cccagccacg 2520 cctactcgtg ggagatataa acgcccacca cccactgtgg ggatcggaaa cgttcagcgc 2580 ccgaggacta atgttcgagg aagtattgaa cgaatgcaac ctggtggtcc tgaataaaac 2640 agatccaacc cacattagtg ttgcgaccgg aactctcacg agtattgacg tggcatgctg 2700 ctcaacagaa ataatagacc aactagattt ttctgtacta caagacgcct acggtagtga 2760 ccatctacca ctattgctaa ctctccctga cacggcccat gccatccgaa cgcgacccaa 2820 atggaagatc gaggaagcag actgggagac attccagaca acagtgcagt tccccctcat 2880 gcctacagct acacaacaaa tctccagcat tacccagaaa atacttgaag cagcggaaca 2940 aactataccg aaaacgacgg ggatggtgaa caagagggcg gtgccttggt ggaaccaaga 3000 ggtggcctcc gctgtcaaga gcaggaagaa agccttgcgt gcactgcaat cagccaaacg 3060 gaacaacttg gagaatcaag aggctcttgc agcagaattc agagcagccc gctctcatgc 3120 acgcaaagtc atccaggaag cacaacggac ctcgtggatc aacttcgtca acgacttcac 3180 agtgcaaaca cctgttaaag aagtgtggga gaacttccgc cggatccaag gcaagcgcaa 3240 aacgaaccgt atcagtgcaa taacagccga aggaggtact gtgtctaaag atgaagatat 3300 tgctgaggcg ctggctagct ctttcgcttc catctctagt aacgatgctt acagccccga 3360 gtttcgcgca ttcaaggatc aggttgagca aactcctctg tcaatcccag ctgacacgag 3420 tgctgagtac aacaagccct ttttattcac tgaatttgaa gaggcaacgg caggtttgcg 3480 tggctcatca ccaggccctg acactgtcca ctattccatg atcgtaaagt taccgttgga 3540 ttgcaaacga cagcttcttg acgcgtacaa ccggctgtgg ctcggttcag tatacccacc 3600 cgaatggacg gaatcgctgg tgatccctat cttcaaaaat tgtggtgaaa aagcaaatcc 3660 tcgtaactat aggccaatct tcttgaacag ttgcctggga aaggtgtttg agcgaatggt 3720 caacaatcgt ctagttcaca tcatcgagac aagaagatta ctgcatcctc accagtttgc 3780 cttccggaaa ggaaaaacta ccgttgacca cttggctgag ctcgagaaag tggtacgagc 3840 agcctggaac aagaaggagt acgttcaggg gatttttctg gatgtgacca aggcgtacga 3900 taccacctgg agaagactgg tcctgaacca actgcgtgac tggaacatag aaggtctgat 3960 gttgaaattc ttggaccgaa tgctggaaaa tcgctcattt agagtttttg tgaacggcca 4020 gctgtcgcaa agcaagatca tggaaacagg cctctgccaa ggatcagtac tgagcgtcac 4080 cttgttcttg gtggctatca acacattggt agcacggatg ccgcccagca taacaacact 4140 cttgtacgca gatgatgttg tactgctggc cagtggacga gatgttgaag aagttgagaa 4200 cgacttgcaa gctgcactga aggcggtgga atgctggcag agctcaacgg gatttaaaat 4260 ttctgcggag aaaagcgcta ckgtcatctt tagaagctac ggaacgagga aaccaccgag 4320 cagagctgca ctggagctga atgggaccct gatcccaacc aaaaagcagc accgatgttt 4380 gggcgttatc ctggatcaac atttaccgtt caagacacac tgcgaggaag tgaaagcggc 4440 ttgccgacaa cgagtccagc ttatccgttg tgtagcacgt aggtcatggg gtggcgatag 4500 gaagactcta atcaagctgt acagagccac agtgttggag aaaatcctgt acgcagcacc 4560 tataacagct gccgtcagcg acaacgtcct gaagatcctc gagccaacac acaacgcagg 4620 cttgagagct atctgtggtg cgttccgtac cagccctgta gatagtctcc aagctgaaac 4680 aggaatcccg agcctgagag tgttctttga acagcggaca gcgatctacg ccgccagaaa 4740 agcagccata tcagcacaga gcagaaccca gccagcagaa gacgactcac aatcagcagg 4800 aagcagtacc gagagcagct acgatagcaa cagttcagga gaggagtgga attcgactag 4860 gcatcgaggt ccgctacgtg gagtggaaac tgctgagaca agaggaaagg cgatacttga 4920 agagctggaa ctaccgttac ccgagctgaa gatattcaca ctaccattgt gtcccccctg 4980 ggaacgcaga cggatccgga tagacaaaac attactcgaa gctgaacgag ccggagcaac 5040 ttcaacacaa ctgaaggagc tcttcgtgtc ccgaaggaac acggaatatc gcttgtgtga 5100 aacaattttc accgatggat caaagaaaga tggcagggtt ggttacgcga tggttagagg 5160 ggacttggtt gttcgaagaa gaatcagtga tctgagcagt atctttgctg cggagtgtgg 5220 tgcgatcact gaagcgctca ggtggataat tggtcagaat cgcgtgggga cctacctcgt 5280 ctgtacagat tccttgagtg ccatcacggc cttgggtaaa cgaaaaacca aatgcagatg 5340 gaaggatgag atcaacatcc tacacaatca ggcagaatgt aatggcacgg agataatcta 5400 catgtgggtg aaaagtcatg taggaatagc aggcaacgag aaggcagatg aagaagcgaa 5460 gcagtcattg aacgacagaa acatctggga cagaagcgta gaattcaagg agttcaggac 5520 ggtgatcaag aagaggactg tttggcgctg gaatgcggaa tggagcagaa aggtgggcaa 5580 caaactacga gaggtgaaga actcggtact accttatcga gatgtatttg ttggatcgag 5640 gaaggaagac gtcattctga caagactcag gataggacac accttgctaa cccatcagta 5700 tctgctggag aaggacagtg cgccgcgttg cacaaggtgc aatttggcgc tgacggtgaa 5760 acacatctta gcggaatgcc ctgagtttga agagcagaga cgccgagctg gagtaccatc 5820 caacgtcagg gaagccttgg ctgatgatcg agatatggcg aaaaatgtgt tgaaattttt 5880 aaaggctact gacatgtatg gaaaagtgta aagattcata aacaggaccc gaaagactca 5940 tagttaaaag gtcctaaata aatacaaaaa aaaaaaaaaa a 5981 // ID hATm-1_HM repbase; DNA; INV; 3852 BP. XX AC . XX DT 15-MAR-2008 (Rel. 13.03, Created) DT 22-MAR-2008 (Rel. 13.03, Last updated, Version 1) XX DE hAT-type family: consensus sequence. XX KW hAT; DNA transposon; Transposable Element; hATm-1_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3852 RA Jurka J.; RT "Families of hATm elements from Hydra magnipapillata."; RL Repbase Reports 8(3), 205-205 (2008). XX DR [1] (Consensus) XX CC This family is closer to the minor "m" branch than to other CC elements. It has unusually long TIRs (>340 bp). CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(479..778,845..2101,2178..2477,2543..3145) FT /product="hATm-1_HM_1p" FT /translation="MTPKKKVRSIRLDKIFLVKKLIEKLPAKKFATAKEII FT QYYLFLRKSKSEAKVKFLVGCPNKSSSFIPNCPVDQSDCCILSKLMVPWIL FT GGFKVLTIQKLRYFYRIKIEKLLKEWMKLKDQRYRTNPAEIKKRHLFEEQM FT NKVFWIGEHPNVMFEEILKDKMRKVKDKEEDIAFLKDQLGERKGCLGGRDK FT RYEYSVIRSKRSTSKFNKNPSAELNKNDXDYLSSASYTSDELGEETDIDFD FT FNEHPNVQNSVLLPKNILQSTAHTAVGEGITPHQHTALISSVIVMAGGSVA FT NFNCSKSTSYRAAENIISARAKEIREYIKNIVISSDSLMTIHFDGKIVNEL FT TERKQFKKDRIAVLVRIDEKTELLGIPPIDSSSGVNQKQAIVELMEFFGLK FT DKVKCLCFDTTSANTGRWQGTCMKLIQQVQRPLLLIACRHHVIERHIAHFW FT KIYPSSETSGPENKLFEVLKQNWNSIDAETKELKRIIIPPGTWIYNQKLAA FT VQFCNNVISLKIFKKVIFLFLLIDRNIRKEYHELAELTLMVLSSNRFKFHL FT PGPVHHSRFMGKGLYYLKLYLLMNEIQNLTDKHKEEITGMALFISLFYTEW FT FLKAELSSLAPTQVYSLLVFYDIKAYWQMKRFEEWNLLGSQAVASSILRHT FT WYLDPTLVVFALADKECRERGDMAKKLYSLQRCPSESFPLERQVMDQAILN FT SLNFSKDEPPSLTPLITEKSWLVFDMLRHGKAETQWMKTPPEFWKFNDFYL FT EFRATVKGLDVVNDCSERAVKLVQDLINRAYSEEKRQDTFLFTNSYKKNRK FT GKKKIDYEKAASTNINKV" XX SQ Sequence 3852 BP; 1465 A; 501 C; 599 G; 1279 T; 8 other; ttagggtgtt tcaaaaatag gtcatgtctg aaaaaaaaaa agagccctag cttatcatct 60 gcccagacca tgatttagta ctaaaaattt tttttttttt ttttttgaaa actctaaggg 120 ttccttcagg gggtaaaagg ggcaattttt agggcatttc gtaccatttt taaaactgtt 180 tgtaggaagt cttgacatta attttttttt tttttaatat atatttatgt ttgtaaagac 240 ttatttaatc taaataacat gggctctatt ctcaaaaaaa aattaaaaat ttataaaaat 300 tgcttcaaaa aaccctaaaa tycataaata tttaacaaaa aaaaatattt ttacattgaa 360 catagtttta catttttaaa ataaagtttt aaattttttc agtaattaat atgttttctt 420 gaaatatatt ttatattaaa tttttatttt tagattatat atttaaaaaa ttaataaaat 480 gacaccaaaa aaaaaagtta gaagtattag attggataaa atatttttag ttaaaaagtt 540 aatcgaaaaa cttccagcta aaaaatttgc aactgctaaa gagataattc aatattatct 600 ctttctacga aaatcaaaga gtgaagcaaa agtgaagttc ttagtaggtt gtcctaacaa 660 aagttcttct tttataccta attgtccagt tgatcagtct gactgctgca tactgtccaa 720 actaatggtt ccttggatac ttggtggatt taaagtgctt acaatacaaa agttaaggta 780 atcttttcaa taagttctaa atcattttag tccatgttta atccataatt atactattaa 840 ataatatttt tacaggatta aaattgagaa attgttgaag gagtggatga aattaaaaga 900 tcaaagatac agaacaaatc ctgctgaaat aaaaaaaaga catttatttg aagagcaaat 960 gaataaagta ttttggattg gagaacatcc taatgttatg tttgaggaaa tattaaaaga 1020 taaaatgaga aaagtgaaag acaaggaaga agatattgct ttcttaaaag accaacttgg 1080 tgaaagaaaa ggatgtcttg gaggtagaga taaaagatat gaatattctg taataagaag 1140 taaaagaagc acatcaaagt tcaataaaaa tccgtcagca gaattaaata agaatgatty 1200 agattatcta agttctgcta gttacactag cgatgaatta ggygaagaaa cagatattga 1260 ctttgatttt aatgagcatc ctaatgttca aaattctgtt cttcttccta aaaatatcct 1320 tcagtcaacc gctcatacag cagtgggaga aggtataaca ccccatcaac atacagcatt 1380 gattagtagt gttattgtca tggctggtgg ttctgtagca aactttaatt gttctaaatc 1440 cacatcttat agagctgcag agaacataat atctgccagg gcaaaagaaa taagagaata 1500 cataaaaaac attgtaatat catcagattc actaatgact atacactttg atggaaaaat 1560 agttaatgaa ttaacagaga gaaagcaatt taaaaaagat agaattgcag ttttggtaag 1620 gattgatgar aaaacagaac tacttggcat tcctccaata gattctagtt caggggttaa 1680 tcaaaagcaa gctatagtgg aattgatgga gttttttgga ttaaaagata aggttaaatg 1740 tctctgtttc gacacaactt cagcaaatac aggaagatgg caaggtactt gcatgaaatt 1800 aattcaacaa gttcaacgtc cattgttgtt gattgcatgt agacatcatg taattgaacg 1860 acacatagcc catttctgga agatttaccc aagcagcgaa acttctggtc cagaaaacaa 1920 actatttgaa gttttaaaac aaaactggaa cagtatagat gctgaaacta aggaactgaa 1980 aagaataata ataccacctg gaacatggat atataatcaa aaattagctg ctgttcagtt 2040 ttgtaataat gtgatcagtt tgaagatctt caaaaaggtg atatttttat ttttattaat 2100 ttgatttcta ttctaacttt ctaactttta aaaagtaaca tttattacaa atttaaaata 2160 tatcaaaaac attttaggac agaaatataa gaaaagaata ccatgagcta gctgaactta 2220 ctttaatggt tctatcatca aataggttta aatttcattt accaggacca gttcaccatt 2280 caagatttat gggaaaaggt ttatactatt taaaattgta tcttttaatg aatgaaatcc 2340 aaaacctaac tgacaaacat aaagaagaaa taactggtat ggctttattc ataagccttt 2400 tctatacaga gtggtttcta aaagcagagt taagttcttt ggcaccaaca caggtataca 2460 gccttttagt attttattaa taagttagaa agctcattat tgattaaagt gattcaaaat 2520 gttaattaat attatttatt aggatataaa agcttactgg caaatgaaaa ggtttgagga 2580 atggaatcta cttggctctc aagctgttgc tagctctatt ctcagacata cttggtatct 2640 agatccmact ttagtagttt ttgccctagc agataaagaa tgtagggaac gtggtgatat 2700 ggccaaaaaa ctgtatagtt tacaaagatg tccaagtgaa agttttcctt tggagcgcca 2760 ggtaatggac caagctatat tgaatagcct caacttcagt aaagatgagc ctcctagttt 2820 gactccatta ataacagaaa agtcgtggct agtttttgac atgcttaggc atggaaaggc 2880 agaaactcaa tggatgaaga ctccaccaga attttggaaa tttaatgatt tttatctaga 2940 atttagagca acagttaagg gacttgatgt tgttaatgac tgctctgaga gggcagttaa 3000 gttagttcaa gacctaataa atagagcata ttctgaggag aaaaggcagg acacctttct 3060 ctttacaaat agttacaaga aaaacagaaa aggcaagaaa aaaatagatt acgaaaaagc 3120 cgctagtaca aatataaata aagtctagtt tcttatgtac atttttttgt ttataacaat 3180 ttttcaagaa agaatagaga ggatcataac tttactagga tttctgtatt ttaatttaaa 3240 agaaacttta ctattgattt raaatttttt atarttttat acttttaatt ttttttaagt 3300 ttgagaacaa accgtatggt tttaactgtt gattacaaaa aatcttcatt ccctgtccag 3360 gccctccctt taatatattt ctcaaartaa cccctgaaac tgattataaa tgagaaatca 3420 atatagtact gtattaaaag taaataaaaa cataatgtaa tttttaggtt ttaaatataa 3480 ttttaaaata ttttgtactt ttataagctt tactaaatat ttatgaattt tagggttttt 3540 tgaagcaatt tttataaatt tttaatattt ttttgagaat agagcccatt ttatttagat 3600 taaataagtc tttacaaaca taaatatata ttaaaaaaaa aaaaaaaatt ttaatgtcaa 3660 gacttcctac aaacagtttt aaaaatggta caaaatgccc taaaaattgc cccttttacc 3720 ccctgaagga acccttagag ttttcaaaaa aaaaaaaaaa aaaaattttt agtactaaat 3780 catggtctgg gcagatgata agctagggct cttttttttt ttcagacatg acctattttt 3840 gaaacaccct aa 3852 // ID Copia-2_CQ-LTR repbase; DNA; INV; 211 BP. XX AC AAWU01028584; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-2_CQ_; KW Copia-2_CQ-I; Copia-2_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-211 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 320-320 (2011). XX DR Genome; AAWU01028584; Positions 6255 6465. XX SQ Sequence 211 BP; 58 A; 49 C; 56 G; 48 T; 0 other; tgttgaggat acaaccctgt cgtctaggtg tgccgagtag tgcgccttca ctgtacacag 60 gaagtgtacc gcgagagaga gagagagagc tgtgaccgag ctgtcactgc aaaacagcca 120 ttaccatttg tgatcgagca gaataaacac gtgtgtgaag taaaaccgcg ttttattccg 180 tgtgttccgt tccgcaagaa tccgaaaccc a 211 // ID Copia1-I_Dmoj repbase; DNA; INV; 4030 BP. XX AC scaffold_6680; XX DT 13-MAY-2009 (Rel. 14.05, Created) DT 13-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia1_Dmoj; KW Copia1-LTR_Dmoj; Copia1-I_Dmoj. XX OS Drosophila mojavensis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; repleta group; OC mulleri subgroup. XX RN [1] RP 1-4030 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1018-1018 (2009). XX DR Genome; scaffold_6680; Positions 20014551 20018580. XX CC 'CTTCT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 121..3309 FT /product="Copia1-I_Dmoj_1p" FT /translation="MSSMYNIEKLDDSNYDCWSVMVESVLVHQNLWAVVSN FT EFVKPEDETSEAFKTWKRENDKAKAVLILSMSASQVNYVKRCIYAKDVWNE FT LKKLHQSSGPVRRVTLFKQLLNKKMAEGENLQTYIMEFSNIVENLSAIGIT FT LQEELIVIMLLASLPSSFENFVVALEARDELPNFSSIKCKLTEENERRKIS FT DQKEATQSFSARQTGQGGRERGRNETTDRRRYFKCYKCGQQGHYAANCTTK FT EANEKYGKERRGKGNEKSERENGKNRDAHSNQRSFAGMMGNYRMSKDAWVI FT DSGATSHMCARRDLFDNFEQHNESIELAGNNFTTACGRGTVVINSLCGFEI FT TLKDVLFVPGLQCNFISVGKAVANGCSVEFNSGFCKVFDSENIVILIAKRI FT SGLFLCETKNNEKLCLSKGADDIKKWHARYGHLNVQSLRELKSKDMVRGLA FT VSSDIKHFDCINCLKSKCHARPFEDSKNRSKQICELIHSDVCGPIRQVSNG FT GARYFLTMIDDKSRYIHVYFMKQKSEVTKIFKEHVTMLERQKGQKVKIIRS FT DNGTEYVNKDFDEYIKLAGIKRQLSAPYTPQQNGVAERCNRTLVEMARVML FT NSAALNESFWAEAVACAAYLRNRSPTKALDGITPFEAFTGLKPVVKHLRVF FT GCTAIALEKGPKKKFQEKGKEYTMVGYSESSKAYRLFDKNTGDVKISRDVY FT FIEDSVRESTEKASATIMVEDDNADQEGIQEDHSSTEEQPDDDADEAEQPR FT RGPGRPKIVRTGLKGRPTKQYNMANVMLSNNIKVPLTVQQAQSSPQADEWR FT AAMQEEYNALMCNDTWDLVQLPHGQKPIGCKWVFTLKTNSSGEVERYKARL FT VAKGCGQRYGLDFKETYSPVVRYSTIRCILAMAAEYELYVHQLDVSTAYLN FT GILEEDIYMQQPELYDDHSGCVLKLKRSLYGLKQAGRVWNTKLDGVLTKMG FT YRACESEPCLYTKCSDDQTINLIAVYVDDMLLACSSMGEMLAAKKAISSEF FT QVVDKGQVTNFLGFEIQREGRTGAIRVSQRKHIVGLLNEMGMSNCRTTSTP FT LEANFQ" XX SQ Sequence 4030 BP; 1345 A; 683 C; 1032 G; 970 T; 0 other; ggttatgggc ccagggctta tgtatgtgtg caacgaaata gtgtaaagca gagagaagaa 60 agtgagctcg agaagtttct gtgcataacg gacattgcaa gcagaaaagt aaagctgaaa 120 atgagttcca tgtacaatat agaaaagctg gatgacagca actatgactg ttggagtgtc 180 atggtagaaa gtgttttggt tcatcaaaac ttatgggccg tggtttcaaa tgagtttgtg 240 aagccagaag atgaaacgag tgaagcgttc aagacatgga aacgagagaa cgacaaagca 300 aaagcagtgc taattttgtc gatgagtgcg tcgcaagtaa actacgtgaa gagatgcatt 360 tacgcaaaag atgtatggaa cgaactgaaa aagttacatc aatcatctgg acctgtgcgt 420 agggtaacgt tgtttaaaca gcttttaaac aagaaaatgg cagaaggaga aaatttgcaa 480 acgtacatca tggagttttc aaatattgtt gaaaatctat cagcaattgg aattacgttg 540 caagaagagt taatcgtcat catgctactg gcaagtcttc catcatcgtt cgagaacttt 600 gtagttgcat tagaggctcg tgatgagttg ccgaatttca gcagcataaa atgcaagttg 660 acagaggaga atgagaggcg aaaaatcagc gatcaaaaag aagctacgca atcattttcg 720 gcaaggcaga cgggacaagg aggcagagaa cgaggaagaa atgagacgac agacagacgc 780 cggtatttca aatgctataa gtgtggtcag caaggccatt atgcagcgaa ttgtacgacg 840 aaagaggcga acgaaaaata cggcaaggaa cggcgcggga aaggtaacga aaaaagcgaa 900 agggaaaatg gaaagaatag agatgcgcat tcaaatcaac gatcgtttgc tggcatgatg 960 ggaaactatc gtatgagtaa agatgcttgg gttatcgata gcggagcaac atcgcatatg 1020 tgcgcacgtc gtgatttatt cgacaacttt gagcaacaca acgaaagtat cgaattggct 1080 ggaaataatt ttacgacggc ttgtggcaga ggcacagtag ttatcaatag tctgtgtggc 1140 tttgaaatta ctctaaaaga cgtgttattt gtaccagggc tacagtgtaa tttcatatca 1200 gttggaaagg cagttgccaa tggatgctct gttgaattta attcaggttt ttgcaaagtt 1260 tttgacagcg aaaatatagt catattaatt gccaaaagaa tcagcggact gtttctgtgt 1320 gaaacaaaaa ataatgaaaa gttgtgttta tcaaaaggtg ctgatgatat caagaaatgg 1380 catgcacgat acggccattt aaatgtgcag agccttcgtg aactgaaaag caaagatatg 1440 gtgcgtggat tggcagtaag ctctgacata aaacattttg attgtataaa ttgcttgaaa 1500 agtaaatgcc acgctagacc atttgaagat tcaaagaata gatcaaaaca aatatgtgag 1560 ttgatacata gtgacgtgtg tggacctata aggcaagtat cgaatggcgg cgccagatac 1620 tttctcacaa tgattgacga taagagccgt tacatccatg tttactttat gaaacaaaag 1680 tctgaggtga ccaagatatt taaagaacat gtaacgatgt tagagcgtca gaaaggtcag 1740 aaagtgaaaa tcattagaag tgataatggc acagaatatg tcaacaaaga ttttgacgag 1800 tatattaagt tggctggtat taaacggcag ctgagtgcac cgtatacccc gcagcaaaat 1860 ggtgttgctg agcgctgcaa tcggacactg gttgagatgg caagagtaat gcttaatagt 1920 gcagcattaa acgagagttt ctgggcggag gcagtcgcgt gcgctgcata tctaagaaac 1980 cgatcaccaa cgaaagctct ggatggcata acacctttcg aggcgtttac agggctgaag 2040 ccagttgtca aacacttgag agtatttgga tgtacagcaa ttgcactgga aaagggtcca 2100 aagaagaagt tccaggagaa aggaaaagag tatactatgg tcggatactc cgagtcgtca 2160 aaggcatatc gactgttcga caagaacaca ggtgacgtga agattagcag agacgtttat 2220 ttcatcgagg atagcgtacg tgagtcaacc gagaaagcta gcgctacgat aatggttgaa 2280 gatgataatg ctgatcagga aggtatacaa gaagatcatt ccagtacaga ggagcaacct 2340 gacgatgatg ctgacgaagc tgaacagcca agaagagggc caggtaggcc taaaatagtc 2400 agaacgggct tgaagggacg gccaactaaa caatacaaca tggcaaatgt aatgttaagc 2460 aataacatta aagttccttt aaccgtgcag caagcacaat cgagtccaca agcagatgaa 2520 tggagagcgg caatgcagga agagtataac gcattgatgt gtaatgacac atgggacctg 2580 gttcagttac cacatggtca gaagcccata ggctgcaaat gggtgtttac tttaaagacg 2640 aattcatctg gagaagttga gcgctacaaa gcgcgcttgg ttgcaaaagg atgcggccaa 2700 aggtatggct tagacttcaa agagacgtat tctccagtag tgcgctacag tacaatacga 2760 tgtatcctgg ctatggctgc agaatacgag ctttatgtac atcaacttga tgtttcgacg 2820 gcatacctaa acggcattct ggaagaagat atctacatgc aacagccaga actctacgac 2880 gaccattctg gatgtgtgct gaagcttaaa aggtcgctgt acggcctgaa gcaagctggt 2940 cgggtttgga acactaagtt ggatggcgtt ctaaccaaga tgggatacag ggcatgcgag 3000 agcgagccat gtctgtatac caagtgctca gatgaccaga cgataaacct gattgcagta 3060 tacgtagatg atatgttact ggcgtgttca agcatgggtg agatgctagc tgctaaaaag 3120 gctatatctt ctgaatttca ggtagtagac aagggtcaag tgacgaactt cttaggcttt 3180 gaaatccaac gagaaggaag gactggtgcg attcgcgtga gccaaaggaa gcatatagtg 3240 ggtctgctca atgagatggg catgtctaat tgccgcacca cgtcaacacc attggaagct 3300 aacttccagt aaactgcgag agagagtcgt gtaagcgcgt tgatatcaca gagtatcagt 3360 ctgtgattgg ttcgctaatg tatattgcgc tatgcacacg gccggacatc ttacattcgg 3420 tgtgcaagct ggcacaacgg aattcgaatc cacatagcga acatcttgca gcagcaaagc 3480 aagttttgag gtatctgcac accacagcag atctagcgtt agtctacgag aagacaggag 3540 aggagattgc tggctatgca gatgcagatt gggctggaga ctcaactgat aggaagtctt 3600 tcacaggata tgccttcatg tggggaggtt cggcattttc atggacctcg aagaaacaga 3660 agtcggtggc gctgagcagt actgaggcag agtatatggc gctctctgat gctgcaaagg 3720 aggcgattta tttaagaaag ctaataagcg aaatgaatcc gtattttaag agaaactgta 3780 taagaattta tgaagataat gtaagcgcgt taaatcttgt gaagaatcct atatatcatg 3840 cacgaagcaa acatattgat actaaatatc accatatacg aaattgttat gaaaataagt 3900 tgatcgattt atcttattgt ccatcaaatg aaatgattgc cgatgtatta actaagaatt 3960 tatcaagagt aaaacatatg aaatgtatag cgttaatcgg cctaatgaaa atataagaat 4020 tgaggaagag 4030 // ID Gypsy-2-I_HM repbase; DNA; INV; 3719 BP. XX AC . XX DT 19-DEC-2008 (Rel. 13.12, Created) DT 19-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE An internal portion of the LTR retrotransposon - a consensus DE sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW reverse transcriptase; Gypsy superfamily; Gypsy-2-I_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3719 RA Bao W. and Jurka J.; RT "Gypsy LTR retrotransposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 1970-1970 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 1..3717 FT /product="Gypsy-2-I_HM_1p" FT /translation="MALRVRDDLTLDDPDNAEAWIRCFSASARSKKLKDEI FT NGSYEITDLFMAKAGIEAVKKISLMVYPEELENMMFDDIKTVVMSHLRPQK FT RLIIAERVRFLALKQQNNENIVSYAQRLREASRFCNFEKLGRDGQSAEDDL FT IQMRLIDGLQSSDQRVKALEMIQSGESPKLGACIDFIRQLEQISCFSIQNN FT IENSIDLPSVNYIDKNKERMIKCKYCGLQHPPRKCPAFGKSCKKCGKLNHF FT KNVCKANTKYEKIVEEIENDDASVYSINMNDTNSEISVKINDYNIKMQVDT FT GSQVTIIPKNFWELMSKPKLQKCYLRLKQFDGTVIKVLGEFEATLETESKM FT NIVRIIVADCTKNHGLIGMDVLNINATKLVNSIEPLVHGRLINYKANILLK FT NGVQPTYFEARPLPIHIRPLVVDKLNGMIQQGLLQRVPPGGSRWASPLVVV FT RKTDGDLRICADYKIGVNQKICSDSYPIPNIETVFHKMAGMKYFAKIDLKG FT AYHQIEMDKEAQEITTVNTPIGLLRWTCLPFGIKTASAIFQRAIESVVGDI FT IPNMIIFQDDICVGAKNAEELKSNLNKILSKLKESGMSINKRKSVNETKSL FT SFLGYELSEKGVTPDRNLVSKILGISPPNNKKELESFIGLINFYGRHIDKY FT AEKILPLNNLRKKGVTFEWRKEHQQAFELLKRNLCEEPVVKVYDINQDLEL FT TTDASEKAISGILSQNGHPVLYLSRTLSDAETRYSNIEREALAIVWSAHRA FT RHFLLGRKFKLISDHRPLEFIFDSNKELPKVTSARILRWAIQMMAFDYEIS FT YKKGENIPHADALSRLSFLPQDEEKDQETFIHLVESDIVNLEQIIKETEND FT RVLMDIKTRIRKNNWSRCSVTERPYKSIRNKLTIEKGIVCNGDLIVPPKTM FT RKIFIKSIHDDIHCGIMQTRCRLKLEAWWPGYCQDVEEYVKNCEKCAEIKV FT KKERTLHTWPSENKPWCRVHMDHAHVQGIGLFLILVDSFSGWPEVIKVNNR FT EATTVKAVLQSVFSRNGVPEVLVSDNAAEFHDTTLHQWLKKVGCVPYKTPP FT YHPQSNGAAERMVETVKMGLRAYSPDKGLLCGYLSRMLLSYRTIPHAGRSR FT SPSEMMGRQLRSPLTMSFESGTPLWYRQKPDSKPEPASFISQAGLNTAIIL FT RNNSSGTLAHCDQINKRLNEPSILMNNEDSMRYDHTNNEKIDPCNSDKSDM FT SELRVTKEEPNQNRRSTRSTRGQKPIRFR" XX SQ Sequence 3719 BP; 1375 A; 610 C; 751 G; 983 T; 0 other; atggcgctac gagtaaggga cgatttgact ttagatgacc cagataatgc agaagcatgg 60 attagatgct tttcagcttc cgcacgttca aagaaattaa aagatgagat aaacggtagc 120 tacgagataa cagatttatt tatggcaaaa gcagggattg aagcagttaa aaagatttct 180 cttatggtat atccagaaga attagaaaat atgatgttcg atgatattaa aactgttgtt 240 atgtcccatt tgagacctca aaaacgacta attattgctg aaagagttcg atttttagca 300 ttaaaacagc aaaacaatga aaacatcgta tcatatgcac aaagattacg cgaagcttca 360 cgcttttgca acttcgaaaa attgggaaga gatggccaaa gtgcggaaga tgatttgatt 420 caaatgagat tgattgatgg tctccaatct tctgatcaga gagttaaagc tctagaaatg 480 atacagagtg gagaatcgcc taaactagga gcatgtattg attttatcag gcaactggag 540 caaatatcct gttttagtat acaaaacaac atcgaaaaca gcatagacct tccttcagtt 600 aattacattg ataaaaataa agaaagaatg attaaatgca agtattgtgg ccttcaacac 660 ccacctagga aatgcccagc atttgggaag agttgtaaaa aatgtggaaa gctgaaccac 720 tttaaaaacg tatgcaaagc aaacactaaa tatgaaaaaa tagtagagga aatagagaac 780 gatgatgcta gtgtgtattc cattaacatg aatgatacaa actcagagat tagtgtaaaa 840 attaatgact acaatataaa aatgcaagtt gacaccggaa gtcaagtcac aattattcca 900 aaaaattttt gggaactaat gtctaaaccg aaattacaaa aatgctacct tcgcctgaaa 960 cagtttgatg gcacagttat aaaagtccta ggcgaatttg aggcaacttt ggaaaccgaa 1020 tcaaaaatga atatcgtacg aattattgtt gcagactgta caaaaaatca tggtctaatt 1080 ggtatggatg tattaaatat caacgccaca aaattagtaa atagtataga acctcttgtt 1140 catgggcgtt taattaatta caaagcaaat attctgttaa aaaacggcgt acaaccaact 1200 tattttgaag ctcgacctct tcctatccat ataagaccac tagtggtaga taaacttaat 1260 ggaatgatac agcaagggtt gcttcaaagg gttcctcctg gtggaagcag atgggcttca 1320 ccgcttgtgg ttgttagaaa aacagacgga gatttacgca tttgtgcgga ttacaaaata 1380 ggtgttaacc aaaaaatatg ctcagactca tatcccatcc caaacattga gaccgtattt 1440 cataaaatgg cagggatgaa atattttgct aaaattgatc tcaagggagc ttatcatcaa 1500 attgagatgg acaaagaggc acaagaaata acaacagtta atacccctat tggattatta 1560 agatggacct gtttaccttt cggaattaag actgcaagtg cgatatttca aagagcgata 1620 gaatctgttg ttggcgatat tattcccaat atgattatat ttcaagacga tatatgtgtc 1680 ggtgctaaaa atgctgaaga actcaaatcg aatttaaaca aaatattgag caaattgaag 1740 gaatcgggaa tgagcataaa taaacgaaag tcagtgaatg aaacaaaaag tctttcattt 1800 ttaggatatg aactgtcaga aaaaggagtt accccagatc gtaatttagt cagcaaaata 1860 cttggaataa gtcctcctaa taataaaaag gaactcgagt catttatagg attaataaac 1920 ttttatggtc gacacattga taaatatgct gaaaaaattt tacctttaaa caatttaaga 1980 aagaaaggag ttacattcga atggaggaag gagcatcaac aggcatttga attgctaaaa 2040 agaaaccttt gtgaagaacc cgtagtgaag gtttatgaca tcaaccaaga tctagagtta 2100 acaacagatg ctagcgagaa agctatatct ggaatacttt cacagaatgg ccatccggta 2160 ttgtatctct cacgaacttt atcagacgct gagacaagat attctaatat tgagagagaa 2220 gctttagcta ttgtatggtc tgctcacaga gctaggcatt ttctgttggg tagaaaattc 2280 aaattaatat cagatcacag gccattagaa tttatatttg atagtaacaa agaattgccc 2340 aaagtcacat ccgctagaat tttaagatgg gccatacaaa tgatggcatt tgactacgaa 2400 ataagctaca agaagggaga aaacattcct catgctgacg cattatcgag attgtccttt 2460 ctaccacaag atgaagagaa agaccaagaa acatttatac acttagtcga gtcagatata 2520 gtgaaccttg agcaaataat aaaagaaaca gaaaatgata gggtattgat ggacatcaaa 2580 accagaatta ggaagaataa ttggagcaga tgctcggtaa cagaaagacc ctataaatct 2640 attaggaata aactgactat cgaaaaagga attgtgtgta atggtgacct cattgtacct 2700 cctaagacaa tgagaaagat cttcatcaaa tcgatccatg atgatataca ttgtggcata 2760 atgcaaacgc gatgtagatt aaaattagaa gcctggtggc ctggctactg tcaagatgtt 2820 gaagaatatg tgaaaaactg tgaaaaatgt gccgaaataa aagtgaagaa ggaaaggaca 2880 ttacatacat ggccaagtga aaataaacct tggtgtagag tacatatgga tcatgcacat 2940 gtacagggta tcggactgtt tttgatttta gtagattctt tttctggatg gccggaagtt 3000 attaaggtaa ataaccgtga agcgaccact gtaaaggcag tactacaaag tgttttttca 3060 agaaatggag ttccagaggt tttagtatcc gacaatgctg cggaatttca tgataccacc 3120 ttacaccagt ggctaaagaa agtaggttgc gtaccttata agactccacc ttaccatcct 3180 caatctaacg gagcagctga aagaatggta gagacagtta agatgggttt aagagcttat 3240 tcaccggaca aaggtttatt atgtggatat ttaagcagaa tgcttttaag ttacagaaca 3300 atacctcatg caggaagatc aaggagtcct tccgagatga tgggaaggca gttgagatcc 3360 cctctcacca tgagtttcga aagtgggaca cctttatggt accgtcaaaa accagattcc 3420 aaacctgaac ctgcctcatt catctctcag gcaggtctaa acactgcgat catattaagg 3480 aataactctt cgggtacgtt agcacactgt gaccaaataa acaaacgcct aaatgagcct 3540 agtatactta tgaataatga agacagtatg cggtatgatc ataccaataa tgaaaaaatc 3600 gatccatgca atagtgataa atctgatatg tccgaactac gtgttactaa agaagaacct 3660 aaccagaatc gaagaagtac tcggtcgaca agggggcaaa agccaatacg tttcaggga 3719 // ID Crack-17_AAe repbase; DNA; INV; 5261 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A Crack non-LTR retrotransposon family from Aedes aegypti. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; KW Crack-17_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5261 RA Kojima K.K. and Jurka J.; RT "Crack clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1233-1233 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 16 sequences with >96% CC identity. Closely related to Crack elements in Culex pipiens CC (Crack-1_CP to Crack-4_CP). XX FH Key Location/Qualifiers FT CDS 1047..1994 FT /product="Crack-17_AAe_1p" FT /translation="MYCFSEAHYRCRNIMGKAIRRIKERIYFCSTKCSCLY FT QKIVELQNNKSLLLESLATELKGTVSEIVSHELKNFRYDIKQITSAIERSQ FT EFLSSKFDAIVHDFYELKNENETLKQELVKMKEIQQTLSTTVHNLEHQTDK FT SNRDARCNNAVILGVPFNPDDNVYEVVHKVFAGFGANIEPDSITSVSRLSS FT KSLAPIRVVFRSQADREAVFCKSKEHGKLLSSSIDPKFVINGKSTNVTIRN FT ELTPLAMGLLNEMKGYKDKLEIKYVWSSRGGNVLVKKCENSKLEIIKTRND FT LLNIVDRYSEGRNIMVNVHSAPKP" FT CDS 1972..4929 FT /product="Crack-17_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MFIQLQNHKPFLHIYCFFFCVTLCLTMANTINLYHDN FT VDDFNSNFSMNNQNFICIAQWNVRGLNNMQKFDDILLFLDNLHVPIDILIL FT GETWLKADNCSLFQIPNYNSVFSCRESSSGGLVMYIRSGLSFNVVKKINID FT GMHSIHSEIKINRQVYDVVGVYRPPSFDFMKFHDELENLFSSFGSRPFYLV FT GDVNIPMNMTNNNVVLRYKSLLESYDSICSNTCITRPASKNTLDHFVCKKE FT ELSKVRNDTIFSDVSDHLIVVSSVKIDSSRECVLLTRRIINKRKLNEMFTN FT ALNNFTCCEDINESIETLTTSYQQILQECTKLKSENVNIKGKHCPWLDLYV FT WQWIKLKNKYLKKVKSNPFDNHLKEMFQYISKKTDIAKKRCKHNYFERLLS FT DTCHAKLWKNLNEIMGRKKQSVKIELNYEGLRTSNNEEICEIFNNYFSNIG FT ENLAKSIKRNNRNPMTTVARIEQTIFLKPTNIREVTTVIHELDVKKSCGPD FT NFPSSIIKSNENKFAEIFSHLFNQMLNQGTYPKCLKVARVTPVFKSGDATD FT PCNYRPISTLSVFSKIFEKLLVTRLLNFLNKHNILYKYQYGFRKGCNTSTA FT IVELVNFLIEGIDKKCTIGGLFIDLKKAFDTLNHQILLEKLESYGIRGLAY FT DIIHSYLNDRKQFVAVQNAHSSTRFIKVGVPQGSNIGPLLFLIYINDLGKL FT PLKGIPKLFADDTALFYPSVSTSSIINCINDDLQLLKRFFDSNLLSLNLSK FT TKYMLFHSPRKKILPHVNPKISNTEVEEVSSYKYLGLRLDPILSWREHIES FT TTRKVSSLCGLMYRVRKFVPRNALLSYYYGCIHSHLQYLIIVWGHASKSNL FT HKIQVLQNRCLKIVYNLPRLHSTVQLYANTSHSILPLRGLCKLQTCLFMFD FT KLKNPDFHCNLTFSTSSHVHNTRYASNILRSRASTCLGQMRITFYGPSVYN FT VVPEHLKTINNRILFKTKLKHHFKSNINEFL" XX SQ Sequence 5261 BP; 1731 A; 819 C; 947 G; 1763 T; 1 other; caacactgtt gtatagttga tacttgtctt actgttaaga ttttttattt tgagtttaaa 60 gagtgtaaat cacgtaaaaa atccaagtaa tattgtatat tgatagtaaa tagtaccatt 120 tccatgtttt taaaaacagt agtatgaata ttgtgcttca tttatgttta taagccaaac 180 attttgcggc attcatagtg ctattgagag gatcatatct gtgttattgg tgggctgctg 240 aatgttgctg ctgactgctg ctgttgttgt tactgctggt ggttctggtg ttttttttta 300 ttcaagcgat aactataagt ataatcgacc ctgatttgtt gcttattatc tcaaaccgtg 360 cacacaatta ggttctggtt tactacggct attgcgaatt gcatgattga gtagttcagt 420 actggattgg cagatttagt gaaattagag cgaacgggcg tcgtacacaa tcttgccgtt 480 ggttcgtgct tggtgtatgg gtgctgaagt agctctgtga gtagcacttt gcgtggttct 540 gtcgatacta ccgagacaaa aacttttttg cgaatttgta tgcaagtttg ttgagtttcg 600 gtgggtgatg gatttagtat tgattgagta aataggcgtc attcaaagct gggcgatact 660 tgtttcggtc attatcattc aatttttgtt tccataaacg gagtaacaat ggcaaaaaag 720 cagtcttatg gcacgttcta gcttacaaaa ccagttgtga tatwttattt atattctaaa 780 ttatttattt aatcgttatt tatatatact tgtagtacaa tatttattgt ctcactgttt 840 ttgtactcat tttttttatt ttaattttat tagcagtata gctttttgtt tgtagtactt 900 acaggtgagt taagcgaatt taatccaaca tagttttttt tttagattgg tgtgttgtta 960 taatttggtg aatatgaacg ataaacaaag tgtttgtgtt ggaatgtaat gaagtagaga 1020 aagatgtgag taaaattatt acctgtatgt attgcttctc ggaagcgcat tacagatgtc 1080 gaaacattat gggaaaagcg atccgacgca ttaaagagag gatttacttt tgctctacca 1140 agtgctcttg tctctatcaa aagattgttg agttacaaaa taacaaatca ttattattgg 1200 aatcactggc gactgagctt aagggaacag tttctgagat tgtctcccat gaactgaaga 1260 attttaggta tgatataaaa caaattacta gcgcgattga aagatctcaa gagtttctct 1320 catccaaatt cgatgcaatt gtacatgact tctatgagtt gaaaaatgaa aacgaaactt 1380 tgaagcagga actagttaaa atgaaagaaa tacaacaaac cttgtctacc acagttcata 1440 accttgagca tcaaaccgat aaaagtaatc gagatgcacg ctgcaacaat gcagtaattt 1500 tgggtgttcc ttttaatcca gatgataatg tttatgaagt tgtgcataaa gtttttgctg 1560 gttttggtgc aaacattgag cctgattcaa ttacgtcagt ctcaagactg tcatcaaaga 1620 gtcttgctcc tattcgtgtg gtttttagga gtcaagcaga tagagaagct gttttttgta 1680 agtcgaagga acatgggaaa ctgttatcat cctcaatcga tccaaaattt gtgattaacg 1740 gaaaatcgac aaatgtaaca atccgaaatg aattgacacc gttagcgatg ggacttttga 1800 atgagatgaa aggatacaag gacaaattgg aaattaaata tgtttggtct agtcgaggag 1860 ggaatgtctt agtaaaaaaa tgtgagaatt caaagctgga aattattaaa actagaaatg 1920 atttgctcaa tattgtggac cgatattcag aaggaagaaa catcatggta aatgttcatt 1980 cagctccaaa accataaacc ttttttacat atttattgtt tttttttttg tgttacttta 2040 tgtttaacaa tggctaacac aataaattta tatcacgata atgtcgatga ttttaactca 2100 aatttttcaa tgaataatca aaattttatt tgtatagcac agtggaatgt cagaggattg 2160 aacaacatgc aaaaatttga tgacatacta ttatttctag ataatttaca tgttcctatt 2220 gatattttaa ttttagggga aacatggctc aaagcagata attgttcatt atttcaaata 2280 cctaactaca attcagtatt ttcttgtcga gaatcgtctt ctggtggtct tgttatgtat 2340 attagaagtg gattaagttt taatgtcgtt aaaaaaatta atattgatgg aatgcattcg 2400 attcatagtg agataaaaat caatagacag gtatatgatg ttgtaggtgt gtatagacct 2460 ccatcgtttg atttcatgaa gtttcatgac gagcttgaga atttattttc atcttttggt 2520 tcacgtccct tttatttagt tggagatgtc aacattccaa tgaatatgac gaacaataat 2580 gttgtcctga gatacaaaag tctactagaa tcgtatgatt ctatttgctc aaacacatgc 2640 atcactcgtc cagcaagtaa aaacacacta gatcactttg tatgtaaaaa agaagaactt 2700 tcaaaagtaa gaaatgatac tattttttcg gatgttagtg atcacttaat cgttgtctcg 2760 tctgtcaaga tagatagctc tagagaatgt gtattactta ccagaagaat tataaacaaa 2820 agaaaactga atgaaatgtt taccaatgct ttgaacaatt ttacctgttg cgaggacatc 2880 aatgaatcga ttgaaactct aacaacttct tatcaacaaa ttttacagga atgcacaaaa 2940 cttaaaagtg aaaacgttaa cattaaaggt aaacactgtc catggcttga tctgtatgtt 3000 tggcagtgga taaaacttaa gaacaaatac ttgaaaaaag ttaaaagcaa tccgtttgat 3060 aatcacttga aagaaatgtt tcaatatatt tccaaaaaga ctgatattgc caagaaaagg 3120 tgtaagcata attattttga gagacttttg agtgacactt gccacgccaa gctttggaaa 3180 aatttaaatg aaattatggg acgtaaaaaa caatcagtca aaatcgaact taattatgaa 3240 ggactcagaa cttcgaacaa tgaagaaatt tgtgaaatct tcaataatta tttttcaaat 3300 attggagaaa atttggcgaa atccatcaaa agaaataata gaaatccaat gacaacagta 3360 gctcgtatag aacaaaccat tttcttaaaa ccaacaaata tccgcgaggt aacaactgta 3420 atacatgaac tggacgtgaa aaagagctgc ggaccggaca attttccgtc aagtataatt 3480 aaatccaatg agaacaaatt tgctgaaatc ttttcacatc tttttaatca aatgctcaat 3540 caaggaactt atcccaaatg cctcaaagta gcaagagtta caccagtttt taagtctggt 3600 gatgcaactg atccatgcaa ttatcgaccc atttctacct tatccgtatt cagtaaaata 3660 tttgaaaaac tgcttgtgac acggcttcta aattttttga acaaacataa tattttgtac 3720 aaataccaat acggttttcg taaaggatgt aacacttcaa cggctattgt ggaacttgta 3780 aatttcttga ttgaaggtat tgataaaaaa tgcacaattg ggggcctttt tattgacctt 3840 aaaaaggcat tcgatacgct gaatcatcag atattattag aaaaattaga aagctatggc 3900 atacgtggat tagcctatga tattattcat agttatctca atgatagaaa acagtttgta 3960 gcggtacaaa atgcacatag ttctacgcga ttcatcaaag ttggagtgcc acaaggcagt 4020 aatattggtc ctcttttatt cttaatatac attaatgatc taggtaaatt accacttaaa 4080 ggaatcccta aattgttcgc tgatgataca gcgttgttct atccgagcgt tagcacatcg 4140 tcaattatta attgcataaa tgatgactta cagctgctca aaagattttt tgattcaaat 4200 ttgctctcac ttaatctgtc taagaccaaa tacatgttat ttcactcgcc taggaaaaaa 4260 attttaccac acgtcaatcc aaaaatctca aacacagaag ttgaggaagt gtcaagctac 4320 aagtaccttg gactgagatt agatcctata ctttcatgga gggaacacat agaatctact 4380 acaaggaagg tctcttccct ttgcgggttg atgtaccgtg taaggaagtt tgtgccgagg 4440 aatgcactcc taagttacta ttatggatgt attcactctc atttgcagta cttaatcatc 4500 gtttggggtc atgcaagtaa atccaatttg cataaaatac aagtgctaca gaacagatgt 4560 ttgaaaattg tttataattt accaaggcta cattctactg tacaattata tgccaataca 4620 tcacacagta ttttgccact gcgtggtttg tgcaagctgc aaacatgttt gtttatgttt 4680 gataagctta aaaatccaga ttttcattgt aaccttacat tttcaaccag tagtcatgta 4740 cataatactc gttacgcaag caacatattg cgctcaagag catccacttg tttgggtcaa 4800 atgcgaatca ctttttatgg tccgtctgtg tataacgtcg tacccgaaca cttaaaaaca 4860 ataaacaatc gtatactatt caaaaccaaa ttaaaacatc acttcaaatc aaatatcaac 4920 gaatttttat aaaatgctgc cagaactgtt atttgatttg ttgtcaattt tctttattgc 4980 tcatcttgtt ttagatttac actctagtta aattatctgt ttcatttatt tatcaactaa 5040 ttagcttttg tcgttcaagg gatcccttca aaggaatact attccactgg gaaatcccat 5100 agtcatattt gttgaacgtc gttcttcgct ctcaccttag cattgaaagt ttacttttta 5160 aattatgttt attatgtata aaaagatgcg tccactacca gggggctcat actgagcttt 5220 ttggtgtggg ggttgagtgg agggtcgcat taaaaaaaaa a 5261 // ID Gypsy-10_AC-LTR repbase; DNA; INV; 191 BP. XX AC AASC02060368; XX DT 18-JAN-2011 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from Aplysia californica (California sea DE hare): long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-10_AC_; KW Gypsy-10_AC-I; Gypsy-10_AC-LTR. XX OS Aplysia californica OC Eukaryota; Metazoa; Mollusca; Gastropoda; Heterobranchia; OC Euthyneura; Euopisthobranchia; Aplysiomorpha; Aplysioidea; OC Aplysiidae; Aplysia. XX RN [1] RP 1-191 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Aplysia californica (California sea RT hare)."; RL Direct Submission to Repbase Update (02-FEB-2011). XX DR Genome; AASC02060368; Positions 4345 4535. XX SQ Sequence 191 BP; 56 A; 35 C; 49 G; 51 T; 0 other; tgatactgga gttattacgt cctatttggc atttcatgtt tggccttaag gcttggggaa 60 agatctggca aggtcaggca gagaataaag atggcagccc ccaatgtgga tgctgaatcc 120 cctggttcca tgttaagtgt attcctgagt gtctgacaga atgaatagaa acttgcaagc 180 aagacaaaac a 191 // ID Gypsy-1-I_DP repbase; DNA; INV; 6090 BP. XX AC . XX DT 17-MAR-2009 (Rel. 14.03, Created) DT 17-MAR-2009 (Rel. 14.03, Last updated, Version 1) XX DE An internal portion of the LTR retrotransposon - a consensus DE sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW reverse transcriptase; Gypsy superfamily; Gypsy-1-I_DP; 4-bp TSD. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-6090 RA Bao W. and Jurka J.; RT "Gypsy LTR retrotransposons from Daphnia pulex."; RL Repbase Reports 9(3), 655-655 (2009). XX DR [1] (Consensus) XX CC TSD is 4-bp long. XX FH Key Location/Qualifiers FT CDS join(842..1936,1924..5229,5160..5699) FT /product="Gypsy-1-I_DP_1p" FT /translation="MADQDQADHGEFVLPHHGPGNDVHPPNFEQNVPQNVL FT GAPNVPPLIHVGNFRERDPPIFRGLPHEDVVEWIHQFQRVSAFNQWGPVQQ FT LRHIEFSLEGVAATWLSGLQPRPQTYDGMIAALQEAFRHHNYAMELESRLR FT ARKQKPNEPVMSYCYDMIYLCSRVDPEMTEERKLQFIFPNMEPALMQKVFP FT QMDQLTTNELFRRLQAHSQASLMAERSIPVNNIVPAPPVNKKEEIEKLVRD FT EVSKSLDPIVRRLEQATRTMGTAGGSGSNSRMRTPERGRNSQWQNRDRPNK FT RTIDGRPICNSCGTPGHIARNCPKGGKGCFNCGEPGHFSRQCPKRKQDGGS FT PNEQEPKDSNQKPANSPKRQSNTSVKLEPVARGERATVVLNFPHTDSRRLI FT LKQIDCQGQGVKAIIDTGSGASLISPKFCKALGIENFREWEGPRLLLMDGK FT TLEPSGMVKLRIHVEGRLIWVIAAVSEMNGFDLLLGNDALSQLGCFSVQYN FT EAGVGSFSTTPTTSEEIPKGKAGYIVNYETLSIPAFSMMHVDVVVPQLGGQ FT NPGHMVEPSPKVMADKGVSLGRLLLPSRVTGGTHRFPLTNFSSSTQFIPAG FT MVVGKILPVDQVDENRESGPTAVPTEPSLPFASRVNTNLGDEDRERTVALL FT NRYLRCFAASPHELGRSNVVKHVIDTGNHLPVHQAPYASAWRERELINDQT FT QTMMRDNVIVPSNSAWAAPVVLVRKKDGEWRFCVDYRRLNAVTTKDVYPLP FT RIDDALSRMEGSRYFSILDMQAGYWQVEVDEQDRAKTAFITADGLFEFKVM FT PFGLTNAPATFQRMMDVVLAGLKWNSCLVYLDDIVVFAPTVSQHLERLESV FT LQRIERAGLKLKLSKCSFLEQSLKVLGFIVSGEGISPDPEKISAVRDFPVP FT QSVKEVQSFLGLCSYYRRFVPNFAVVARPLSNMTKKNQRFSWGEEHQRSFE FT AMKTILISPPILAHPRYDLPMEIHCDASNYGVGAVLVQKHGDEEHVVAYAS FT RLLSNPEINYSVSEKECLALVWSVRKFRSYIWGLKIRVVTDHHSLCWLLKK FT RDLSGRLARWSLQLQDLDIEIVHRSGRLHSDADGLSRAPTGCPEEEEEIPL FT LNIAVTPGALDVGSAQRESSWWEGILRGMKDTAPTLRIKKLIQPYELRGDV FT LYRRRIRGGVVSYQLCLPKSLVEQVLLACHSDVTAGHLGVTRTTHKIQQRY FT YWPGMRRQITRFVLSCVDCQTKKRAQEAPAGLMRPIRVSQPFEKVGIDLIG FT PFPLTSAGNRHAIVAVDYLTKWAICKAVPSASSKEVVDFFVRNVVLQHGAP FT VFLISDRGKCLTADFSEELFKALQTNHLVTAAYHPQCNGLVERYNHTFAEM FT LSMYVNSLHNDWDGFIDFVTFAYNTSRQESTGFSPFFLLYGREAVLPIDVA FT LGNNPEKDLDVGDSSDRARTLTTKLSAIREKVKKGWRLFNQDRRNGMIAAA FT GKKGEKRMAIVQSRQKKRYDRRRRQVKFAIGDPVLVYRPIRKKGRATKLLH FT RYFGPYRIVRRVSDLNYIVEPLNGRKKNQDCVHVSHLKPFRLSAPSGKASV FT KSTVVKTPVSIIKKKTTDDQVKEKTPKVVRWCDQQTRAVDQEQHCEKDASS FT TTGLSDERVGGHVLRSRRRLKSPTRLDL*" XX SQ Sequence 6090 BP; 1518 A; 1358 C; 1601 G; 1608 T; 5 other; atggtggaga tgcagggtac agttacactc gttgattgag arcaatttac attttgattt 60 tgtgtgygtg aaacaaaaac ttggacaaat ttgtgttgga agattgaggc gtcctatagt 120 caacgtttat gttacgttgc atcgttcgcc cgttcaacat caacatcrac gcytcgcttc 180 tttttgtggt cacgtatcgt tccaaattat ttgtgtggca cgattatctg tttctttytt 240 aaattttgca gattcatcat tgtggaaatt tttattgatt ggaatttgga atcgacgtag 300 tatcaacgtt gtgtacgttg gcgtggcgtg ggtgctgagt catccaacgg agctgagcca 360 cccaacgttt tttccattta cgtcttgccg acgttggtct tgtgttgcga gcagagaatt 420 ttttgattca taactttttt gatattcgtt tattaattct cttttgcttt ctatttagat 480 ttgagtcgaa gattattggc ttttgggtct tctttctgga agtggataat cccgttttga 540 tttgtttgtg aaccagaccc aagggctggt aagttgtgtc ccctaaccct catagatttt 600 ttcttacagg tacttgaggc ttgatagccg ttttgattga gagtgagaga gcattggctc 660 ctgtttggtt gtgagtcttg gactcttttc ttgttgatgg ccacttgttg attatccccc 720 cacaaaactt attgattgcg agtagctccc tccctatttt cgacaattag ttgggattgg 780 aattgggtgg cggccggggc gaatatttgt aaaatttcct tcgctacagt aaacgattaa 840 aatggctgat caggatcaag cggatcacgg ggaatttgtg ctaccacatc acgggccggg 900 caacgacgtt caccctccaa atttcgaaca aaacgttccg caaaacgtac ttggcgctcc 960 caacgtccct ccattaattc atgtgggaaa ttttagagag cgggaccctc caattttccg 1020 cgggttgcca cacgaggacg tggtagagtg gatccatcag ttccagcggg tgtcggcttt 1080 taatcagtgg ggccctgtcc aacaacttcg acacattgaa ttcagtctag agggggtggc 1140 ggcaacatgg ctatcgggcc tacaacctcg tccacaaacg tatgacggga tgatagcagc 1200 tttgcaggaa gcgtttcggc accacaacta tgcgatggaa ttagagtccc gattgcgagc 1260 gcgcaaacaa aaacccaatg agccggtcat gtcatactgt tatgacatga tctatctatg 1320 ctcacgagtc gacccggaga tgaccgaaga gcgtaaactc caatttatat tcccaaacat 1380 ggaaccagcg ttgatgcaaa aagtgttccc ccaaatggat caattaacaa ccaatgagct 1440 tttccggcgt cttcaagctc actcccaggc ctcgcttatg gcggagcgat ccatccctgt 1500 aaacaatatc gttcccgcgc cacccgtaaa caaaaaggag gaaattgaga aactcgtacg 1560 ggacgaggtg tcaaagagcc tggatccaat tgttcgtcga ttggaacagg caacgcgtac 1620 aatgggaact gccgggggat cgggatctaa ttcacggatg cggacaccag agagagggcg 1680 aaattcgcaa tggcaaaatc gggatcgccc aaacaaaaga acaatagacg ggcggccaat 1740 ttgtaacagt tgcgggactc cagggcatat tgcacgtaac tgtccgaaag gggggaaagg 1800 gtgtttcaac tgcggggagc ctggacattt ttcacgccaa tgtcccaaac gaaaacagga 1860 tggtgggtcc ccaaatgaac aagagccgaa agattcgaac caaaagccgg ctaattcccc 1920 taaacgtcag tcaaactaga accggtggca aggggggagc gtgccaccgt tgttttgaat 1980 tttccccaca ctgattctcg tcgacttatt ctcaaacaaa ttgattgcca ggggcagggt 2040 gttaaagcca tcattgatac gggatccgga gcttcgttaa tttctcccaa attttgtaaa 2100 gcgttgggga ttgaaaattt tagagaatgg gaagggcctc gattgttact tatggatggg 2160 aagacgttag aaccgagcgg gatggtcaag ttgagaatcc atgtagaagg gcgtttgatt 2220 tgggtcatag cggcggtgag cgaaatgaat gggtttgatt tgttgttggg aaacgacgcg 2280 ttatctcagt tgggatgctt ctcggtgcaa tacaatgaag ccggggttgg gtccttttcg 2340 acaacgccca caactagcga agagattccg aagggaaagg cgggctatat cgtgaattat 2400 gaaactctaa gcatccctgc gttctccatg atgcacgtcg acgtagtcgt tccacagctg 2460 ggcgggcaaa atccaggcca tatggtagaa ccatctccga aagtaatggc ggataagggg 2520 gtttcacttg ggcggctctt gttaccatcg cgtgtaactg gtgggacgca tcgttttccc 2580 ttaactaatt tctcctcgtc tacccaattc attccagcgg gaatggtcgt tggaaaaatc 2640 ctacccgtcg atcaagtgga tgaaaatcgt gaaagtggcc ccactgctgt ccccaccgaa 2700 ccgtcgctgc cattcgcgag tcgagtcaac acaaatctgg gagatgagga tcgggaaagg 2760 acggttgcgt tgctgaatcg atatttgcgt tgcttcgccg catcaccaca tgagttgggt 2820 cgttcgaacg tagttaaaca cgtaattgat actgggaacc atctgcccgt ccatcaggcc 2880 ccttatgcta gtgcgtggcg tgagcgcgag ttgatcaacg atcagactca gaccatgatg 2940 cgggataacg ttatcgtccc ttctaatagc gcgtgggctg caccagtcgt attggtcaga 3000 aagaaagatg gggaatggcg tttctgcgtc gactatcgtc ggttgaatgc ggtcacaacg 3060 aaagacgtgt accccctccc aagaattgat gatgccctta gtcgaatgga aggttcgcgc 3120 tatttctcca tccttgacat gcaggcgggg tactggcagg tcgaggtcga cgagcaggat 3180 cgggctaaaa ccgctttcat caccgcggat gggctgttcg agttcaaagt gatgccgttc 3240 gggttaacaa atgcgccggc cacgttccaa cgaatgatgg atgtcgtctt ggcaggcctc 3300 aaatggaatt cttgccttgt ctatttggac gacatcgtcg tcttcgcacc caccgtatcc 3360 caacatctcg agcgtctaga atcggtcctc caacgcattg aacgggcggg gttaaaattg 3420 aagttgtcca agtgctcatt cctagagcaa tcccttaaag tattgggctt tattgtaagt 3480 ggcgaaggga tctctccaga tcccgaaaag atctccgccg tgcgtgattt ccctgttccg 3540 cagagtgtaa aggaagtaca aagcttcctt ggcctttgtt cttattaccg gagatttgtc 3600 cctaatttcg ctgtcgtggc ccgccctctc tcgaatatga ccaaaaagaa ccaacgtttc 3660 tcctgggggg aagagcatca gcgtagcttc gaggctatga aaaccatcct gatttcccct 3720 ccaatcttgg cccacccacg gtacgattta ccgatggaaa tccattgcga tgccagtaat 3780 tatggggtgg gagccgtcct tgtacaaaaa cacggagatg aagaacacgt cgttgcgtat 3840 gcgagccgcc ttctgagcaa cccggaaatt aattattcag tatccgagaa agagtgtctc 3900 gctcttgtct ggtccgtccg taaatttcga tcctacatct gggggctaaa gataagagtg 3960 gtgacggatc atcattctct ttgctggcta ttgaagaaac gggatttgtc tgggagactt 4020 gcgcggtgga gccttcagct ccaagacctt gacattgaaa tagtgcaccg tagcgggcgc 4080 cttcactctg atgcagatgg gttgtcacgg gctccaacgg gttgtcccga ggaagaagaa 4140 gagatccccc tcctgaatat cgctgtgacc ccgggtgcgt tggatgtcgg gtcggcccag 4200 cgggaatctt cgtggtggga aggcatctta agagggatga aagatacagc gcccactctc 4260 cggattaaaa aattgattca accctatgag ctgagggggg atgtccttta tcgtcgtcga 4320 attcgtggtg gggtggtatc gtatcaactc tgtctcccca aaagcctcgt cgaacaagtg 4380 cttttagcct gccatagcga cgttaccgct ggacacctgg gagtaacgcg cacgactcat 4440 aaaatccaac agcgatatta ctggccggga atgcgacgtc aaattactcg ttttgtgctc 4500 tcatgtgtcg attgccaaac gaagaaacgg gctcaggaag cgcctgctgg tctcatgcgt 4560 ccgatacgcg ttagtcagcc gtttgagaaa gtggggattg acttgatagg ccctttccct 4620 ctcactagcg ccgggaatcg acacgccatt gtggctgttg attacttaac aaaatgggca 4680 atttgcaaag ccgtcccttc ggcgtcgtcc aaagaagtcg tcgacttctt tgtacggaat 4740 gtagtcctcc aacatggtgc gcccgtattc ctaatctccg atcgcggaaa gtgtttgacg 4800 gccgatttct ctgaagagct tttcaaagcc ttgcagacca atcacttggt tacggctgcg 4860 tatcatcctc agtgtaatgg gttggtcgag cggtacaacc atacatttgc ggaaatgctc 4920 tcgatgtacg taaattcact ccataatgat tgggatgggt tcattgattt cgtcaccttc 4980 gcctacaata cgagtcgcca ggagtcgact gggtttagtc cattcttcct cttatatggg 5040 cgggaggccg tgctaccgat tgatgttgcg ttgggaaata atccagaaaa agatttagat 5100 gttggtgatt ccagtgaccg tgcccgtacg ctgacgacaa agctctccgc tatccgtgaa 5160 aaggtgaaaa aaggatggcg attgttcaat caagacagaa gaaacggtat gatcgccgcc 5220 gccggcaagt gaagtttgct ataggcgatc cggtgcttgt ttatcgcccc attcggaaaa 5280 aaggacgcgc gacgaagctt ctgcatcgtt actttggccc ctaccgtata gtacgtcgag 5340 taagcgacct caactatatc gtggagccat tgaacggaag aaagaagaat caagattgtg 5400 tgcatgtgtc tcatctaaag ccgtttaggc ttagtgcgcc gagtggaaaa gcgagtgtga 5460 aatcaactgt tgttaagaca cccgtttcaa ttatcaagaa aaaaacaacg gacgatcaag 5520 tgaaagaaaa gacgcctaaa gtggtgcggt ggtgtgatca gcaaactcga gctgtggatc 5580 aagagcagca ttgcgaaaaa gacgcttcat caacaactgg attaagtgac gaaagagtgg 5640 gcggtcatgt tctccgctcc cggcgaagat tgaagtcccc gactcgactt gacctgtaga 5700 ttgattgcgt gatttgttgt gtggccaatt gttttgattg tttgtttggc ttgttgaatg 5760 tgtgttgtgt cgtgaagagg ttggaccgtc acgacatgac agagctagct gtgtcgtgaa 5820 gaggttggac cgtcacgact gagtgaggct gtgccgtgaa gaggttggac cgtcacggca 5880 taggctttca ctgtgtggat cctgcgtgtg tgtatgttaa gttagatgtg cacttagttt 5940 aagctagttt aagtttctga taatttcggt ggttgggtaa aaaatcaaca aaaacaaaaa 6000 aaaaaaatat tgggaaccaa gatcatttgc ttttagttgt atattccctg tttgatgatc 6060 gggacgatca tttttcgtaa ggggggaaaa 6090 // ID Gypsy-27_OD-LTR repbase; DNA; INV; 323 BP. XX AC CABV01000704; XX DT 24-FEB-2011 (Rel. 16.02, Created) DT 24-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Oikopleura dioica genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-27_OD_; KW Gypsy-27_OD-I; Gypsy-27_OD-LTR. XX OS Oikopleura dioica OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; OC Oikopleuridae; Oikopleura. XX RN [1] RP 1-323 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Oikopleura dioica genome."; RL Direct Submission to RU (24-FEB-2011). XX DR Genome; CABV01000704; Positions 6549 6227. XX SQ Sequence 323 BP; 78 A; 86 C; 55 G; 104 T; 0 other; tgatgcgata cgaaatattc gaacttttga attctcaaac acattgtaac actttactgt 60 atatttgtta cgcgccacta tcttcccgca cttttcggtt tgtcggcgat ccggcccagt 120 gggatttgac acaccccgct gacctgattg acgactgacc tacttgacaa ctgaccgatt 180 ttctgtactt ttaccgccag ctgtgaccga gatcccagcc catataagca agcttcctgc 240 tcagaacttt actcttttca atacaatccc catcagatat cagtttgtat agtctgtttt 300 cgtatatatt ttgcagcgca tca 323 // ID RTEX-13_BF repbase; DNA; INV; 3914 BP. XX AC . XX DT 30-JUN-2009 (Rel. 14.07, Created) DT 30-JUN-2009 (Rel. 14.07, Last updated, Version 1) XX DE Amphioxus RTEX-13_BF autonomous non-LTR retrotransposon - DE incomplete consensus. XX KW RTEX; Non-LTR Retrotransposon; Transposable Element; Jockey-11_BF; KW RTEX-13_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-3914 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-3914 RA Kapitonov V. and Jurka J.; RT "Young families of RTEX non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(7), 1730-1730 (2009). XX DR [2] (Consensus) XX CC The 5' terminal portion is missing. The 3' terminus is composed CC of the (ATTGTC)n microsatellite. XX FH Key Location/Qualifiers FT CDS 216..3788 FT /product="RTEX-13_BF_2p" FT /note="AP endonuclease and RT domains." FT /translation="MNMCIPQRTQMPRSRLSLNPCKTRNFGISIGFWNIHG FT LGKKLEDSKFRNELNGFDFFSLMETWHTGELSFPDYLYFSSCRTKSKKAKR FT NSGGILFLFKKEYRNLVTKLENKSEDILWVRLDKQLFNSDRDIYLASVYIS FT PQRSTIHANRTYDIFDILEEDISHYSNLGDILVGGDFNARVGTLNDYVTDD FT TPIIDSKILPKDYSFDHPLPRSNMDETQPTVFGKNLIDLCINSKVRILNGR FT TPGDLLGKPTCYQPKGCSVVDYVLASEDIVLRKSIFFHIEALSPLSDHCKL FT TLLLEGNPKPLQSQQKVNKLSTMPYRRFTWNSDSSEKFTEIINSVSFSSNI FT DRFIHKNYELKPESLETALQDLVTPLQEAGKKSLSLHSTKKTKRQPSSKQS FT NKKKWFNQTCASARRELQNLAKLLSRNPRDPHIRGSYFVKKKQYTRLIRKM FT KKEHKEKIFKQLNSLSDNNPSGFWSLLKEYKSNDKTDDHETIPAESLHSHF FT KKLYGIDPNTSKPRSSFETHIENELYKLEKNLENNPLDFPISKAEVKQALS FT NLKNGKASGSDLIRNEMLKCSSNIILEPLVKLFNLFLSAGYFPHEWCTSHI FT VALHKGGTKDDPNNYRGISVTSCLAKLFTSILNTRLTEFLDEYKLISPNQA FT GFRKHFGTRDNLFVMDTLISKYTSENKRLYSCFIDFRKAFDSVWREGLRFK FT LLNSGIGGNFYSLIKCMYEKPSTCVKTSSGLTPPFVTNKGVRQGCNLSPTL FT FNLFINDLVHELDNTECSPPALNNLLVSSLLYADDVVLFSETEKGLQSALD FT KLNIFCTKWKLEVNLKKTKIIIFNKSGRLFSKQNFLFGQERVDIVSSYCYL FT GITITSSGTFSLAKKQLSNKARKAMHSFKCLTRSSVSPDSLLRIFDSCVKP FT ILLYGCEIWGTEKSNDTSPIEITHNQFCKNILGVSRNSSNLACRSELGRFP FT LDIDISVRLVKYWLCLTQMDYTKHPLQVNALLEQHNSVVNGRRRSLLQRVK FT EILDYSGYSYLWYSDNSLYDPVYVCNMIRNRITNMYIQTSLHNISIDTGKL FT RFYKTCKSVYAREKYLELVDFELRSAISKVRISAHPLEVERGRYKRLPVCE FT RICKYCTSKAVGDETHFISKCSFNETERITLFREVTKTYPNFPTLTDKQKA FT TFLLKSQNPQILENVGLFVSLCFQKRSQTQPV" XX SQ Sequence 3914 BP; 1334 A; 847 C; 692 G; 1041 T; 0 other; aagaatggcc ctccccagca gaggtgtatg gccgtccccc agcatgtgga gtacttcccc 60 cgtggcccag ccctcccccg ttcccgtact tcaactaccc caggaatggc cctcctccac 120 ctaggcgaat ggcggggcag tgttagcagt agacgtcgga cacgcaacca gcacggaaca 180 tctttcggac ttaatttgcg acctcaacta tttgaatgaa tatgtgtatt cctcaacgga 240 ctcagatgcc gaggagtaga ctatctttga acccttgcaa aactagaaac tttggtatat 300 ccataggttt ctggaacatt cacggactcg gtaaaaagtt agaggatagt aagtttagaa 360 atgagctcaa tgggtttgac ttcttttccc ttatggagac atggcacact ggtgagttat 420 ctttccctga ttatctttat tttagtagct gtagaactaa atctaagaag gctaaaagga 480 attcgggtgg cattctattc ttattcaaaa aagaatacag aaatctagtc actaaattgg 540 aaaacaaaag cgaagacatt ctatgggtta gattagataa acagttattc aacagtgaca 600 gagacattta tttagcgtct gtgtacatta gtcctcagag atcaactatt catgcaaaca 660 gaacatatga catttttgac atattggaag aagacatctc acattacagt aatcttgggg 720 acatattggt ggggggtgac tttaacgccc gagtaggaac cctgaatgat tatgtcacag 780 atgacacccc gattatcgat tctaaaattc ttccaaaaga ctatagtttt gaccaccctc 840 tccctagatc caatatggat gaaactcaac caacagtttt tggtaaaaat ctaatagacc 900 tttgtatcaa tagcaaagtt agaatactaa acgggagaac gcccggagac cttcttggta 960 aaccaacgtg ctatcagcca aaaggatgta gtgtagtaga ctatgtacta gctagcgagg 1020 acatagttct ccgtaaatca atcttctttc acattgaagc attatcgcca ctctccgacc 1080 attgcaaact cactctactg ttagaaggca acccaaaacc acttcagtcc caacaaaagg 1140 taaacaaact gtcgacaatg ccttatagaa gatttacatg gaattcagat tctagtgaaa 1200 agttcaccga aataattaac tcagtatctt tttcctcaaa cattgacaga ttcatccaca 1260 aaaattacga gctaaaacca gaatcattag agactgccct acaagacctc gttacaccat 1320 tacaagaagc cggaaaaaag tctctttcct tacatagcac aaaaaagacg aaaagacaac 1380 ccagttcgaa acaatcaaac aagaagaaat ggtttaatca aacatgcgct tctgctagac 1440 gtgaactaca aaaccttgct aagcttctct ctagaaatcc acgcgaccct catattcgag 1500 gatcctattt cgttaagaaa aaacaataca ccagacttat acggaaaatg aagaaagagc 1560 acaaagaaaa gatatttaag caactaaatt cattatccga taacaaccct tcgggctttt 1620 ggtcacttct taaagagtac aaatctaatg ataaaactga cgatcacgaa acgattccag 1680 ctgaaagcct acactcgcac ttcaaaaagc tttatggcat agacccaaac acctcgaaac 1740 cacgcagcag ctttgaaaca cacatagaaa atgaattata caagctagag aaaaatcttg 1800 aaaataatcc cctggacttc ccaataagca aagctgaagt taaacaggca ctctctaatc 1860 ttaaaaacgg caaagcttcg ggaagtgatc tcattcgaaa tgaaatgttg aagtgctcat 1920 caaacattat attagaacca ctagtcaaac tttttaacct gttcctgtct gcagggtact 1980 ttccccacga atggtgcacc agccacattg tggctttaca taaaggcggt acaaaagacg 2040 accctaataa ttacaggggg atctcagtga cgagttgtct agcaaaacta ttcacttcta 2100 ttttaaacac acgccttaca gagttcttag acgaatacaa actaatctca cccaaccaag 2160 ccggttttag gaaacatttt ggaacaagag acaacctatt tgtcatggat acccttatta 2220 gtaaatatac atccgaaaac aaacgactat attcgtgttt tatagatttt agaaaggcgt 2280 tcgactcggt ttggcgagaa gggctccgct tcaaattact caactcggga ataggaggaa 2340 acttttacag tcttattaaa tgcatgtatg aaaaacccag cacatgcgtc aaaacgtcat 2400 ccggcctaac acctccattt gttaccaata agggtgtcag acaggggtgc aacctaagcc 2460 ctaccctttt taatcttttt ataaacgacc ttgttcacga gttagataat accgaatgct 2520 ctccacccgc actcaataat ttattagtgt caagtttact atatgcagac gacgtagttt 2580 tattctctga gaccgaaaaa ggactgcaaa gcgctttaga taaactaaac atattctgca 2640 ctaaatggaa gctcgaagtg aatctaaaga aaacaaaaat tatcatattc aacaaatctg 2700 gccgattatt ttcaaaacaa aactttttat tcggacaaga gagagtagac atcgtatctt 2760 catattgcta cctaggtatc accataacat catcaggtac cttttcatta gccaaaaaac 2820 aattgtcaaa caaagcgcgc aaagccatgc acagcttcaa atgtctaacc cgttcatcag 2880 tctcacccga tagcctacta agaatattcg attcctgtgt taaacccatt ctgttatatg 2940 gctgtgaaat ttggggtacc gagaaatcaa atgatacatc tccgatcgaa attacacaca 3000 accaattctg caagaacatt cttggagtct ctagaaatag tagtaattta gcatgtaggt 3060 ccgaattagg tagatttcca ctagacatag atatttccgt tcgacttgta aaatattggt 3120 tgtgtttaac acaaatggat tatactaagc atccattaca agtaaatgca ctcttggaac 3180 agcataactc cgtcgtaaac ggccggagaa gatcattgtt gcagcgtgtt aaggaaatac 3240 ttgattatag tggatactct tatctctggt actctgacaa ctcactctat gaccctgtat 3300 atgtatgtaa tatgataagg aacagaatta ccaatatgta tatccagacg tctctacata 3360 atataagtat agataccggt aagctgagat tctacaaaac ttgtaaatct gtatacgcta 3420 gagagaagta cttagaacta gttgatttcg agctgcgctc cgcaataagc aaagtcagaa 3480 ttagcgctca tcctctagaa gtggaaagag ggagatacaa gagattacca gtgtgtgaaa 3540 ggatttgtaa atattgtaca tctaaagcag tgggagacga aacacatttc atttccaaat 3600 gttcattcaa cgaaaccgaa agaatcaccc tgtttagaga agtcacaaag acttatccca 3660 acttccccac attgacagac aaacagaagg ccacttttct cttgaaatcg cagaacccac 3720 aaattttaga aaatgtgggg cttttcgtct ccctctgctt ccaaaaaaga agtcaaaccc 3780 aacctgttta gaaatagcca acactagtat tagattagaa tattgtttat gactgtccat 3840 tgtcactatg tgtataccat gtacttgcaa ttagccctcg ggcacgaact tgcaaataaa 3900 cttcttatat aaaa 3914 // ID CR1-41_BF repbase; DNA; INV; 2505 BP. XX AC . XX DT 30-JUL-2009 (Rel. 14.07, Created) DT 30-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE Amphioxus CR1-41_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-41_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-2505 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-2505 RA Kapitonov V. and Jurka J.; RT "Young families of CR1 non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(7), 1612-1612 (2009). XX DR [2] (Consensus) XX SQ Sequence 2505 BP; 919 A; 631 C; 393 G; 562 T; 0 other; gactccgcga atccctaaac aagattaaac cgaagggtgg acaaatctgg ttgggaggtg 60 attttaacct ccctgacata gattgggaca atttgaccgt taaacccaat gcccaatatg 120 gaatcctctc cagaggtttc atagaactag ttaatgattt tggtttaaca caggtggtca 180 aacaaccgac acgtataaac aacatcctcg acatattctt gactaatagc ccatcgctaa 240 tagacaattg ttccatgtta cctgggattg gcgaccacga cggcattcct ttagtaacag 300 ctaatgtaaa tccaaccaga tcaaaacaaa aaccccgtat aacaaaccta tggaacaaag 360 ccgacgaaac tgcaatcaag gctgaactta aagactacag caactcgata accaacagga 420 acaatagcaa attcactgta gatgaactct acgatgaatt cattgacaag attaaatctg 480 taatggataa atatgtccct accaagacta caaagaacaa acatacatca ccttggatta 540 acaacaaagt gaggagatta cagaaaaaga aacaaagagc ctataattct tatcgtaata 600 acccaaatga aaccactcga aacaagtacc actccgtccg caaacttact aaaaaaacaa 660 caagacaaac ttaccgtaaa tatgtaaact ccatctgcac agattcccca aaaaagttct 720 ggtcccatat taaacacttg aaaaacgaag aaactggtat ccctagtcta aaatgcaacg 780 gcaaactcga gtcagacaat aaaaccaaag ccgacattct aaattctcaa tttagctcag 840 tattcaccaa agaaaaagac caactcccta acttaccccc tagcaacacc cctgctatgc 900 ctcacctgaa cataaccacc aatggtatca ctaaactcct aaaagacttg aacccccata 960 aagctagcgg ccctgatggc atcccagcaa gaatacttaa actggcagct gaagaaattg 1020 cccctgccct tactattata tgccaaaaat cactagagtc aggccaaata ccatatccat 1080 ggctccaggc taatattaca cctatcttca aaaaaggcga taaatctgac ccctccaatt 1140 accgcccagt gtctctcacc tgtgtttgca gcaagattat ggaacacata atccactcac 1200 agattatgaa tcactttgac aaacactcca tactaataaa caatcaacac gggttccgca 1260 agaaaaggtc ctgtgaaaca caactcattt tgacaacaaa tgacttagct actactcttg 1320 acaccagatc ccaaacagac atgatcatca ccgactttgc taaggcattc gataaagtgc 1380 cccataaccg actcctcatg aagctaaaaa actacggtat ttcagaccaa ttgcttaaat 1440 ggataacaaa ttttctaaca aatcgtaaac aaagggtagt ggttgggggt gaacactctg 1500 aatggtcaaa tgttgactct ggggtgccac aaggcactgt ccttggccca ctactattcc 1560 taatttatat aaatgactta gccgatgacc taaattccaa cattagactc ttcgcagatg 1620 actgcgtcat ctacagggaa atcaaaaatg accaagacca ttcactgcta caagaagaca 1680 ttaataaatt agacaaatgg caagaggatt ggcaaatgaa actacatcct gacaaatgtc 1740 atgtcatgag gttcacacat aaacgtaaac ccaaattata tgattataga ttaggtaacc 1800 acattctaac ggaaaccaga aaccacaagt acttgggtgt cacactaaac aaccaattat 1860 catggtccaa ccacataaac aacattacat gcaaagcaaa caagaccctt ggtttcgtta 1920 agcgcaattt atatgactgc cccaaaaaga tcaaacaaaa agcctacagt agcctggtta 1980 gaccccacct tgaatatgcc tgtgcagctt gggatcctta tcacaaagac cacattgcca 2040 aactcgaggc ggtccaaaac agagcagcca gatttgtttc aaacgtcccg aactgtccaa 2100 attctcagac aagcgtttct aaactagttt cagacctggg ttgggatact ctcaaaaaca 2160 gaagaacagc aaacagactc accatcctcc aaaaatctag atatgacctc ctagccctac 2220 cggtcgatca ttacttgcag ctgaacccta ggcaatcccg acacaatcat ccaaattcat 2280 ataaacccat taagactaac aaagactgcc tcaaatattc atttttccca agaactgcca 2340 cagattggaa tacattacca tactctacca cactccaagc taaccccact aagtttaagg 2400 aagaggcatt acaacatcta aggaacagcc aataacacaa cagcactacg ccctgtgtgg 2460 tttgccccaa caaactaggg gtgttgcaca gtacaagaac aagaa 2505 // ID I-52_AAe repbase; DNA; INV; 6324 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE An I non-LTR retrotransposon from Aedes aegypti. XX KW I; Non-LTR Retrotransposon; Transposable Element; I-52_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6324 RA Kojima K.K. and Jurka J.; RT "I clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1323-1323 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 8 sequences with >97% CC identity. XX FH Key Location/Qualifiers FT CDS 164..1501 FT /product="I-52_AAe_1p" FT /translation="MSGRSPGPSGGSAGAICRSNVPEWMLGPDDLGQVMVL FT ILRRKQNVNSQQDATPFDSFIVGTSIQLKVGVKEARTIQASCEGRGTRYIL FT RTSSKTIYEKLIKITQLTDGTEVEIVPHPTLNMVQGVVYDPDTINKEETVI FT LDNLKAQGVQSVRRIKKRVNGVLKNTPLLVLSFHGTIVPNHVFFGLLRIKV FT RVYYPSPMMCFNCGIYGHSKKSCQQPGVCLRCSASHEIPEGEQCTNPPSCL FT HCKSGHPTTSRDCPKYKQEEKIVRLKVDRGISFVEARRIYAEENKQGTIAE FT VVQEQVQQQLAAKDQVITSLQKQVAVLTKELVALKQTLKAYSRSQSSSPLK FT QNETVFQKPAPKTTPSKAVQQNATAQHDRLSRKDQTLLSFSSGQKELNNNT FT QDHGIHTRSRSGKRQMEISPTERTSHRGKRISASSVTNSTLTYTEKNNGPE FT T" FT CDS 1197..6218 FT /product="I-52_AAe_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="MKLYSKNQHRRLLQVKPFSRTQQHNTIDYRERIRHSS FT LFHLVKRNSTTIRKITAFTHAAEAANAKWRYRPLRELVIGVNASQHHPSRI FT AHSPTPRRIMDPKPDEKIYWRTQNIDPGMKTDKKTDKRLGQRASNTTDQTR FT TLLPPLTAEDYDKIHNLTSIPTTSKQHKPQTDSPSARPPTSTLTRDVPEQS FT DEPLAAVDVGCPLYRASTFLHISPQKQPNSTLIFTNPGSWESNCTEDVTTN FT TTYATDIIPASTAINNHDSPLTVSTDLHRFDSPRARPPTPTLTRDVPVYSD FT EPLAAVGVGRPHPWASITSNLHHEQFEANIFPTYSGNNANGKRRGSPIKVI FT QRMNNSSKPPLPKTNHDSPRARPPTSTLTRDVPEQSDEPLAAVGVGRSHHR FT ASIITHSPSEESAISVVYPQXTASDEEHATSGMDLSCVVAEQGQQYHITEL FT HSPSXASTDSEGCLPKPGKTTPNIAIQWNINGLRSHLGELQLMIAKYQPFV FT ICLQETNADGSKLGSDSLGPNYQLLLSQCSPHGRQGAGMAIKKGTPFQQIS FT FRTKLQAVAIQLQAPTSMTIVSVYLPPKDKNAVGLLGDLLNELPKPILILG FT DLNAHHPAWGSRITNTISEAKKRGDKILELVLQHDMLILNNGTHTRIDPST FT GASQALDVTICSTSHASKFHWKTLMEFSGSDHLPTMISTPGNTNEPKRRPN FT WILEKANWDLYEKITSETLHPKSILTVDEFTNRIISAANSSIPKTTGNVGR FT KSVVWWNEEVETVIKARRKCLRALRRLKEDDPRKPAALKRFHDARSVCRKT FT INDAKQKSWEDLVESINPDTPASQVWNSVNRLQGKRKNTTITLNLPTGHTN FT DGEKIANALADEYQRKSSNENYSAEFRKKHKDNNCTAFKNQRPSLHKNYNV FT DFTIEELMWALNRRAGSSTGDDNISYVLIQRLPFSSKIALLMLFNRVWDSG FT SFPAQWKVGNIIPIPKPEADRSKPEGYRPITLLSCVGKLFERMVNRRLITE FT LETTGRLDPRQHAFRAGKGVDSHFAELESFISLENDEHVEIVSLDISKAYD FT TTYRPAILRTLTEWRICGRLMNIVSSFLCDRYFQVVANGSLSTLRKAENGV FT PQGSTLSVTLFLVAMQPIFEVIPSEAQILLYADDVILIVKGRNHGTIRKTL FT RKAVAAAVGWAASVGFSIAPTKSKLLHACLIPHRKRGRAIKINQTPIPLVR FT SLKILGIILDSKFNLLKHFRTVKENSGKKIHLLRILGYRIKRSSRVTLLNI FT GFALIVSKIFFGLGLTSLNIEAMQRILGPIYNEVVRRATGAFPTSPVISIM FT SEAGCLPFNLALIQRLGQLAIRLLEKYETDVEYPLVKRARDIILDTTGWSL FT PVVCKLLRLTDRAWYASPPKIDNHLKNSIRAGASKETVLPMFKSFISNRYP FT THEQVFTDGSKDNNNVGAGIATSDRHISFRLPDDCSVFSAEAFALKAAVSN FT IQAGKKTIILTDSASCIDALKKGWSKHPWIQSIEKMTENQDITFSWIPGHS FT GIRGNDKADEAAKQGRNETKLDIPLPAQDVLRSMRNKIWEVWQTEWHRTQV FT HLRQIKASPCKYPDRKCPSEQRVLSRMRIGHTRITHVYLMTNSPPPMCNFC FT GVQLTVQHFLVECRGFEQNRERCGINGSTAEILAYNTARETLLIKFLKDCN FT LFNEI" XX SQ Sequence 6324 BP; 2040 A; 1546 C; 1327 G; 1408 T; 3 other; cgatataatc acccatacac acatctgaac tgttgtgtag gcagtgttgt gatccgagtg 60 ctaattggtg acaaagagga aataaaaacg atccttcgtg aatacgttcc ataatttttg 120 gaggaccaac ggtgaagtga atagttcttg gcatctcgaa taaatgtcag gcaggtcccc 180 tggcccctcc gggggctcgg cgggagcgat atgtcgaagc aacgttccag aatggatgct 240 tggtcccgat gatcttggtc aggtgatggt actcattttg cgtcggaaac agaatgttaa 300 tagccaacaa gacgccacgc cgttcgattc gttcatagtg ggcacttcaa tccaactgaa 360 agttggtgtt aaggaagcca gaacgattca agcatcttgt gaaggacgag gtacacgtta 420 cattctacgc accagctcta aaaccatcta tgagaaactc atcaaaatca cacagttaac 480 cgatggcact gaggtcgaaa tagttcctca ccctacactg aacatggtcc aaggtgtagt 540 atacgaccca gacacaatca acaaggaaga aactgtgata ctggacaacc tcaaggctca 600 gggcgtacaa tcagtccgca gaataaaaaa gcgagtgaac ggtgtcttga aaaacacccc 660 cttgttggtt ctctctttcc acggcacaat tgtacccaat cacgtgtttt tcggattact 720 gcgcatcaaa gttcgtgtat attacccctc cccgatgatg tgcttcaatt gcggtatcta 780 tggtcactca aaaaagtcct gccagcagcc tggagtttgc ttacggtgtt ctgcgtcaca 840 tgaaatcccc gaaggagaac aatgtacgaa tcctcccagt tgtctccatt gcaagagcgg 900 acatcccaca acatcgcgtg attgcccgaa gtacaagcaa gaagaaaaaa tagtgcgtct 960 gaaagttgac agaggcattt catttgttga ggcgaggcgt atatatgcag aagaaaataa 1020 gcaaggaacg atcgctgaag tagttcagga gcaagttcag caacaactag ccgctaaaga 1080 ccaagtaatc acctctttac aaaagcaagt agctgtacta acgaaagagc tcgttgcact 1140 aaagcaaact ctgaaggcat actcacgtag ccagtcctcc tcaccactca aacaaaatga 1200 aactgtattc caaaaaccag caccgaagac tactccaagt aaagccgttc agcagaacgc 1260 aacagcacaa cacgatagat tatcgcgaaa ggatcagaca ctcctctctt tttcatctgg 1320 tcaaaaggaa ctcaacaaca atacgcaaga tcacggcatt cacacacgca gcagaagcgg 1380 caaacgccaa atggagatat cgcccactga gagaactagt catcggggta aacgcatctc 1440 agcatcatcc gtcacgaata gcacactcac ctacaccgag aagaataatg gacccgaaac 1500 ctgacgagaa aatatactgg agaactcaaa atattgaccc cggaatgaaa acggacaaga 1560 aaacggataa acgactagga caacgagcat caaataccac agatcaaact cgcactctgc 1620 taccaccttt aactgcagaa gactacgaca aaatccacaa tcttactagc atcccaacaa 1680 ccagcaagca gcacaaacca caaactgatt cgccaagcgc gaggcctccc acgtcgaccc 1740 tgaccaggga tgtcccggag caatccgatg agcccctggc ggcagttgac gtgggctgcc 1800 cactatatcg ggcaagtacc tttttgcaca tttcaccaca aaaacagcca aactccacac 1860 ttatttttac gaatccagga tcctgggaat caaactgtac cgaagatgtc accaccaaca 1920 cgacttatgc aaccgacatc ataccagcat caaccgcaat aaacaaccac gattcaccgt 1980 tgaccgtatc gacagacctg catcgctttg attcgccacg cgcgaggcct cccacgccga 2040 ccctgaccag ggacgtcccg gtgtattccg atgaacccct ggcggcagtt ggcgtgggcc 2100 gcccacaccc atgggcaagt atcacatcca atttacatca cgaacagttc gaagccaata 2160 tatttccaac ctattcagga aacaacgcca atggaaaaag aagaggatcc cccatcaaag 2220 taattcaacg aatgaacaac agcagcaaac caccattacc caagacaaac cacgattcgc 2280 caagagcgag gccccccacg tcgaccctga ccagggatgt tccggaacaa tccgacgagc 2340 ccctggcggc agtcggcgtg ggccgctcgc accaccgggc aagtatcatt acacattccc 2400 catccgaaga atcggcaatt tctgttgtat atccccaamt aacagcaagc gacgaagagc 2460 atgcgaccag cggaatggat ctctcgtgtg tagtagccga acagggtcag caataccaca 2520 tcacagaact acattcacca tcagmagcat ctaccgactc cgaaggatgt ctccccaaac 2580 cggggaaaac aactccaaac atagcaatcc agtggaatat caacggcctc cgctcgcact 2640 taggggaact ccagttaatg attgctaaat accaaccatt cgtgatctgc cttcaggaaa 2700 ccaatgccga cggtagcaag ttaggatcag acagccttgg tccgaattat caactattac 2760 taagtcaatg ctcaccacac ggtagacaag gtgctggaat ggccataaag aaagggactc 2820 cgtttcaaca aatcagcttt cgaaccaagt tacaagccgt tgcaatacaa ctacaagcgc 2880 caacgtccat gacgattgtt tcagtttacc ttccacccaa ggataaaaat gctgtagggt 2940 tgttaggtga tttgctgaat gaactcccaa aaccaattct aatactaggg gacctgaacg 3000 cccaccatcc tgcatgggga agtaggatca ccaatacaat atctgaagcg aagaagagag 3060 gtgataaaat tctggagttg gtattgcagc acgacatgct aatcctcaat aatggaactc 3120 atacgcgcat cgacccttcc acaggggcat cacaagccct tgatgttacc atctgctcaa 3180 catcacatgc atcgaaattc cattggaaaa ctttgatgga attctctgga agcgatcatc 3240 tcccaaccat gataagtact cctggcaaca caaatgaacc aaagcgtaga ccaaactgga 3300 ttcttgaaaa agccaactgg gatctttacg agaaaatcac aagcgaaacc ctacatccga 3360 agtccatttt aacagtggat gaattcacga acagaataat ttcggcagcc aactctagca 3420 tcccgaaaac taccggaaac gtgggaagaa aatctgtagt atggtggaac gaggaggttg 3480 aaacggtgat taaagccaga cgcaaatgcc tgcgtgcgct tcgtcgattg aaagaggacg 3540 acccacggaa acccgctgcg ctgaagcggt tccatgatgc acgatctgtc tgccgcaaaa 3600 cgataaatga tgccaagcaa aaaagttggg aggacctcgt ggaaagtatt aacccggata 3660 ctccagcaag ccaagtatgg aacagcgtta acagactgca gggaaaaagg aaaaacacca 3720 cgattacctt gaatctaccg acaggccaca caaatgacgg agaaaagata gcgaacgctc 3780 tggccgacga atatcaacgg aagtcatcga acgaaaatta ctctgctgaa ttcaggaaaa 3840 aacacaaaga taataactgc actgccttca agaatcaacg tccaagcctt cacaaaaact 3900 acaatgtgga cttcaccatc gaagagctta tgtgggcact caaccgacgt gctggtagtt 3960 ctaccggcga tgacaacata agctatgtgc taatacaacg tctccctttc tcttcaaaaa 4020 ttgcccttct aatgttgttc aacagagtgt gggacagtgg tagctttcca gcgcaatgga 4080 aagttggcaa cataatccct ataccaaaac ctgaagcaga cagaagtaaa cctgaaggat 4140 acagacctat aacgctcctc agttgtgtag gtaaactttt cgaaagaatg gttaatcgaa 4200 ggctcatcac cgaactggaa accaccggta gattggatcc acgccagcat gcattccgtg 4260 ctggaaaagg tgtcgactcc cacttcgccg agttagaatc tttcatcagt ctagaaaacg 4320 atgaacacgt ggaaatcgta tctcttgata tttcaaaagc ctacgatacm acgtacagac 4380 ccgcaattct gcgtacctta acagaatggc ggatctgtgg acgtttgatg aatatcgttt 4440 ctagttttct ctgcgacaga tattttcaag tagttgccaa tggctcccta tctacactac 4500 gaaaagctga aaatggtgtg cctcaagggt caaccctttc tgtaacgctt tttttagttg 4560 caatgcaacc tatctttgaa gtcattccat ctgaagcaca aatactacta tatgcagatg 4620 atgtgatatt aatagtgaaa ggaagaaacc acggtacgat tcgcaagaca ctgagaaaag 4680 cagttgcagc ggcagttgga tgggctgcga gtgtggggtt ctctatagct ccaacgaagt 4740 ctaaactgct acatgcttgc ctcattcccc acagaaaacg tggtcgtgcc atcaaaatca 4800 atcaaactcc tattcctctt gtacgaagcc tcaaaatttt gggaatcata ttagactcga 4860 aattcaactt attgaaacac ttcagaacag ttaaagaaaa cagcggtaag aaaatccatt 4920 tattgcgtat tttgggttat cgaatcaaac gaagcagtag agtaacactg ctaaacatag 4980 gattcgcttt aattgtctcg aagatattct ttggccttgg gctaacgagc ctcaacatcg 5040 aagcaatgca acggattctt ggtcccatat acaatgaagt ggttcgtcga gcaactggag 5100 cttttcccac aagccctgtt atctcaataa tgtctgaagc tggatgcctt ccctttaatc 5160 tagcactgat ccaaaggctt ggccaactgg ctatccgatt gctggaaaaa tacgaaactg 5220 acgtcgaata tcctttagtt aaaagagcta gagacattat attggatacc actggatggt 5280 ctctcccagt cgtctgcaaa ttgttgaggc taacagacag agcatggtat gcatcacccc 5340 ctaaaatcga caatcacctt aagaacagca tcagagctgg cgcaagcaaa gaaacggtac 5400 tgcccatgtt caaaagtttc atcagtaatc gataccctac tcacgaacaa gtttttacag 5460 acggttcgaa agacaacaat aatgtaggcg ccggaatagc tacgtctgat agacatatta 5520 gcttccgttt accggacgat tgtagtgttt tttcagctga agcttttgct ctgaaagccg 5580 cagtatctaa catacaagct ggcaaaaaaa ctataattct aaccgattct gctagctgta 5640 ttgatgctct gaagaaaggt tggtccaaac atccatggat acagtcgatc gaaaaaatga 5700 cagaaaacca ggacatcaca tttagttgga tcccaggtca ctccgggata agaggtaacg 5760 acaaagctga cgaagcggct aaacaaggaa gaaatgaaac gaaactggat attccgttac 5820 cggctcaaga tgttctgcga tcgatgagaa acaagatctg ggaagtatgg caaacggaat 5880 ggcatcggac ccaagttcat ctacggcaaa tcaaggcaag tccttgcaaa tatccagacc 5940 ggaagtgccc ctccgaacaa cgtgtattat ctagaatgcg tataggacac actcgaatca 6000 ctcacgttta tttgatgact aattcgccgc caccaatgtg caacttttgt ggagtacagc 6060 tgacagtgca acactttctg gtagaatgtc gaggattcga acaaaacagg gaacgttgcg 6120 gcataaatgg ttcaactgct gaaatattag cgtacaacac agcaagagaa acattattaa 6180 ttaagttcct aaaagattgt aatttattca acgaaatttg acaaattgct gtttgaaatc 6240 atgtaactct attttaatta ctatctgaca cgaatgccat atgtttggta aagtgtcatt 6300 aataaataat aataataata ataa 6324 // ID BEL-54_AA-I repbase; DNA; INV; 7557 BP. XX AC supercont1.270; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-54_AA_; KW BEL-54_AA-LTR; BEL-54_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-7557 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.270; Positions 659961 667517. XX CC Positions [5163-5744] - Integrase core CC 'GCATT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 807..5990 FT /product="BEL-54_AA-I_1p" FT /translation="MGEARTREQLMTRRTTLIASLSRTERFVEDYVAERDG FT LEVQIRLDSLDLLWQSLEEVQTALEDLEETNEGKALNLQYRASFEPRLFRI FT KARLKSILPPPLILQANVPVEPPPRAVSTLTGLKLPTISIPEFDGDYMQWL FT TFFDTFKALIHDNQDLPPIQKFHYLRSAVKGEPAQVIETIGISAANYPLAW FT QALVTRYSNEYLLKKRHLQDLLNIPRMKKETAAALHSTVDEFQRHIKILQQ FT LGEPTPAWSTLLEHLLCMRLHDDTIRAWEDHAETVNDQSYTCLVEFLEKRI FT RVLDSVSVNHSSSPSTPPVVPNGNHRRQMPIKTSSYVTTENSAPKCHACDQ FT QHPLVKCPTFEKMTLAERLRTVNSKRLCLNCFRQDHYARDCPSNFTCRVCK FT KNHHTLLHPGQLPSNTKTNSEGSPAVSQSNQRTNQHPTSSSRAQSRVSMQM FT VTKVEEPTVVQCNNVLRNPQPTVFMLTVLVRIVDVYGKEHFARALLDSASQ FT PNLISDRLAQLLNLKRTKVNVIIDGIGEKPEHATDSVRTNVHSRKGNFSRD FT VTFLVLKRMISNLPAEDIPVDDWKLPNDISLADPNFNRSSKIDLVIGAQHF FT FDCFPTAARIKLSEGLPDLIDSVFGWIVAGGTNLAANQESVCYKTVATSLS FT PLEQSMERFWKIEEVGTRPALSVEEKACEELFQSTVSRNDDGRYIVELPKQ FT PNFAEMIGESEAAAIRRFELIERRFSKDPKLKLEYDKFMAEYLSLGHMKPI FT PNGTAPGVVECFLPHHPVIKESSTTTKTRVVFDGSSKTSSGFSLNQALCVG FT PTVQDELLDLVIRNRKFPVAMAADVEKMYRQVLVHPKDTPLQKIKYRFKST FT EPLQSYELLTVTYGLAPSSFLATRALQQLANDEGHAYPLAAPVLKKSFYVD FT DFIGGANSIVEAIKLREELTELLSKGGFPLRKWISNKLEVLQGLPADQIGT FT QSTYRFDPGETVKTLGITWEPESDQFLFDFHVNQRDQTATKRVILSRISQL FT FDPLGLIAPVVVKGKMLIQKLWVIECSWDEKVPSEIRREWENLYDQLPLLS FT KFRIDRYAFQPNSTIQLHSFADASESAYGACVYARSVDAEGNIRVQLLAAK FT SRVAPLQRLSITKLELCAAVIAAQLHSRVVEALQMNIDESFFWSDSTVTLQ FT WLRSPPSSWKTFVANRVSEVQTLTHGSHWSHVSGLQNPADLVSRGMDVGEF FT VESTLWKQGPNWLQLSRENWPRCNATEVPDADEERRGVVVSVARILPSYNP FT IFSRYSSFNRLSRVVALCQKFVQNLRSKSRTQPSTTSPAKFQPLSVEQLIQ FT AKTTLVRLAQADCFQEELRELQQGRPLPKKSPVRLLAPFVDTEGVLRVGGR FT LKLSEQPYLSKHPVLLPSFHPLAQLIAKFYHYKLIHGGGHVTLSVMREEYW FT PINGRRLVRSIIRNCFQCARANPVPASQQIGQLPAQRVTVSRPFEITGVDY FT AGPLYLKAIHKRADPAKVYICLFVCFATKAVHLELVGDLSTPAFISALRRF FT VARRGRPRHLHSDNGKNFIGAKNDLHHLYKMLSDDAQVDRITRFCAEEEIV FT WHLNPPKAPHFGGLWEAAVKVAKKHLHRQLGNSRLSIEDLSTVLAEIEAAM FT NSRPLVPMTEDPNDFTVLTPAHFIIGSTMHAVPSPNVTDIHFTRLNHHQKL FT QQLYQCFWHQWRTEYLQELQKNTRLRQPNHQILPGKLVVIVDEFQHPVRWP FT LARIEAVH" XX SQ Sequence 7557 BP; 2045 A; 1929 C; 1640 G; 1943 T; 0 other; caattttggt gccgtgacca ggatcgtggg cttcgccgct ttcgccatct tccacgcacg 60 ccatcccacc ggatcgaaga gctattgttt cgacaccaac ctagtcgaac gttgccgcca 120 tagagcatac tgaaggacgc catccaacct aacaaaggga ccctattgtt aatcaactgc 180 atgttgttga cgccattgcg ccatccttgg ttcctgagct gatttcttga ttgattcaag 240 gcaaattgaa tcattgatca ggtaaagtac ctgtccaatc cctgcgcatc tcctcacggg 300 tgcctttaga tctccctttc tccaattttg catgctgatc gtcggtggag ctgtgcacgt 360 gtcatccctt tcatcaagcc gactggtaat ctaacaacag tggttcccga caccaatcgc 420 ctaccgccca cggtggcggc cgttttgctt ggtgcagcta cacagtgaag gaagtctgcg 480 catcccaacc caaccaaaga aagtttgcga atatcgatcg tgagacgatt ggtatcgccc 540 ctatccacag acccatccgt gaaacgaaag ccatttttcc aaacgacaaa gacaattcct 600 cgttagacaa gtcaccccta gtgccgtata ggcagagatt gttcctgcgt tccagttttg 660 aataaaatac aaggcatatt caagccttgg ttcaggttag taaccttcca agcatttcat 720 tttcgtttgt ccgggtctac ctggtctttc atatcggcag gcattctcca ctccaacttc 780 actcttcgac gtcaccgcga gcaacgatgg gtgaggctcg tacccgagaa caactgatga 840 cgcgaaggac aacgctaatc gcgtcgctga gtcgcactga acggttcgtc gaggactacg 900 tcgccgaacg ggatgggctt gaggtacaaa tacgactgga cagccttgac ctgctgtggc 960 aatcgctgga ggaggtacag acggctttgg aggatctaga ggagaccaac gaaggtaagg 1020 ctttgaacct acaatatcga gcctcctttg aacccaggct attccgaata aaagccaggc 1080 tgaaatcaat tctaccgccc ccgctcattt tgcaagccaa tgtccctgtt gaaccccctc 1140 cgcgcgctgt gtccaccttg accggtttga agttgccaac aatttccata ccagagtttg 1200 acggtgatta tatgcagtgg ttgaccttct tcgacacctt caaggcattg atccacgaca 1260 atcaggacct tcctccgatc cagaagttcc actatctccg tagtgcggtg aagggagaac 1320 ccgcgcaggt catcgaaact attggcatca gtgccgccaa ctatcctcta gcatggcagg 1380 cgctggtcac tcgttattcc aacgagtatt tgcttaagaa gcgccacctg caggacctgt 1440 tgaacattcc tcggatgaag aaggagacgg ccgctgctct tcattcaact gtagatgagt 1500 tccaaagaca catcaaaatc ttacagcagt taggagaacc aaccccggcg tggagcaccc 1560 tgttagagca tctattgtgc atgcggctac acgatgacac gattcgcgcg tgggaggacc 1620 acgcagaaac cgtcaacgac cagtcctata cgtgccttgt ggaatttttg gaaaagcgga 1680 tccgagttct cgactcggtt tcagtcaacc attcctcctc tccctcgact ccaccagtgg 1740 ttcccaacgg aaaccatcgc agacagatgc ccattaaaac atcttcgtat gttaccactg 1800 agaattccgc cccaaaatgc catgcatgtg atcagcagca tccgttggtg aagtgcccaa 1860 cttttgaaaa gatgaccctt gctgagcgcc ttcgcacggt taattccaaa aggttgtgcc 1920 ttaattgctt ccgccaagac cattacgcac gtgactgccc ttccaatttc acgtgtcgag 1980 tttgcaagaa aaaccaccac acccttctcc acccaggcca gcttcctagt aacaccaaaa 2040 cgaatagcga aggttccccg gcagtttcac agtccaacca acggacaaat caacatccaa 2100 cgtcgtcgtc cagagcgcaa tcaagagtct cgatgcagat ggtgacaaaa gtagaggaac 2160 ctaccgtcgt tcaatgcaac aatgtgcttc gaaatcctca gccaacggtt tttatgctga 2220 ccgtcttggt ccgtatcgtc gacgtctacg ggaaagaaca ctttgccaga gccctactgg 2280 acagcgcttc gcaacccaat ttgatatcgg accgactggc ccaactttta aatttgaaac 2340 gcactaaagt gaatgttatc atcgacggaa ttggcgaaaa accagaacat gccaccgatt 2400 cggttagaac caatgtccat tcccgtaagg ggaacttctc cagagatgta acgttcctag 2460 ttctcaaacg gatgatatcc aaccttcctg cagaggacat tccagtagac gattggaaac 2520 tcccgaacga catttcgctt gccgatccta acttcaaccg ctccagcaag atcgatttgg 2580 tgatcggagc tcagcacttc ttcgactgtt tcccaacagc agcgcgaatc aagttgtccg 2640 aaggacttcc agacttaatt gacagcgttt tcggttggat agttgctggc ggcaccaatc 2700 ttgctgctaa ccaagaatcc gtctgctaca aaactgttgc cacttccctt tcccctttag 2760 aacaaagcat ggaacgattt tggaagatag aggaggttgg aactcgaccc gccttgtcag 2820 tggaggaaaa ggcatgcgag gagttgttcc agtccaccgt ttcccgtaac gacgatggca 2880 gatatattgt tgagttgccc aagcagccaa attttgcaga aatgattgga gaatccgagg 2940 cggcggccat tcgccgtttt gagttgattg agcgcagatt ttccaaagat cccaagttga 3000 agttagagta cgataaattc atggcggaat acttatcgct cggacacatg aagcctattc 3060 ccaatggtac tgcccccgga gtcgtagaat gctttcttcc tcatcacccg gtcattaaag 3120 agtccagtac taccaccaaa acccgggttg tcttcgacgg ttccagcaaa acctcgtccg 3180 gattctcttt aaatcaggct ctttgtgtag ggccgaccgt gcaggacgag ttgctggatc 3240 ttgtcatccg aaaccgcaaa tttccagtag ccatggcagc agatgttgaa aaaatgtacc 3300 gtcaggttct ggtacacccg aaggacacgc ctttgcagaa aatcaagtac cgcttcaaat 3360 ccaccgaacc tctccagtca tacgagctcc tcaccgtcac atacggtctt gcaccgtctt 3420 ctttcctggc tactcgagca ctgcagcagc tagccaacga tgaaggacat gcgtatcctc 3480 tcgctgctcc tgttttgaaa aagtccttct atgttgacga cttcatcgga ggcgcgaatt 3540 ccattgttga agccatcaaa ctgcgcgaag aattgaccga gctgctgtct aaaggtggat 3600 ttccgttgag gaagtggatc tcaaataaac tagaagtgtt gcaaggactc cctgctgacc 3660 agattggaac ccagtcaaca taccgatttg accctggaga aactgtgaaa acccttggca 3720 ttacgtggga gcctgagtcc gaccaatttc tgtttgactt tcatgtcaac cagcgtgatc 3780 aaacggcaac aaaacgagtg atcctatcca gaatttccca gttattcgac ccgctagggt 3840 tgatagcacc agtcgttgtt aagggaaaaa tgctaattca gaagctatgg gtgattgagt 3900 gttcctggga tgagaaggta ccaagtgaaa ttagacggga atgggaaaat ctgtacgacc 3960 aacttcctct tctgtccaag ttccgtatcg atcgatacgc ttttcaacca aactccacaa 4020 tccagttaca ctcgtttgca gacgcttccg agtccgccta cggggcctgc gtttacgctc 4080 gcagtgtaga cgccgaagga aatattcgtg ttcagctttt ggcagctaaa tcccgagtcg 4140 ctcctttgca gaggctgtcc attaccaaac tagaactatg cgccgctgtt atcgctgcac 4200 agctgcattc ccgagttgta gaagccctgc agatgaacat cgatgaatcc tttttctggt 4260 ccgactcaac ggtgacccta cagtggctaa gatctcctcc cagttcgtgg aaaacttttg 4320 ttgctaatcg cgtgtctgaa gtgcaaacat taacccatgg atcccactgg agtcacgttt 4380 ctggccttca aaacccggca gacctcgttt ccagaggtat ggatgttgga gagtttgtgg 4440 agagtacttt atggaaacaa ggtccgaatt ggttgcagct ttcgagggaa aactggccac 4500 gatgtaatgc tacagaagtt cccgatgccg atgaagaacg cagaggagtt gtagtttccg 4560 tcgcacgaat tttgccgtcg tataacccaa tattctcaag atactcctcg ttcaatcgcc 4620 tatcaagagt cgtcgccctt tgccagaagt tcgtccaaaa cctaaggtcc aagtctagaa 4680 ctcaaccaag cactacaagc cctgccaaat tccagccact gtcggtcgag caacttattc 4740 aagccaaaac tactctggtt cgactggccc aagccgattg tttccaagaa gaactgcgag 4800 aacttcagca aggtcgtccg cttcccaaga aatctccagt cagattgctg gctccgtttg 4860 ttgacacaga gggggttctt agagtggggg ggcggttaaa attatcggaa cagccatacc 4920 tatccaagca cccagtcctg ctccctagct ttcatcctct agcccagctg attgccaaat 4980 tctaccacta caaacttatt cacggtggcg gccacgtgac tttgtcagtc atgcgtgaag 5040 agtactggcc gataaacgga cgacgacttg tgcgaagcat catcagaaac tgtttccaat 5100 gcgcccgagc caatcccgtt ccggcaagcc aacagatagg acagctgcca gcccagcgag 5160 taactgtcag cagaccgttc gaaatcaccg gagtggatta tgcggggcct ctctacctaa 5220 aggccatcca caagcgagct gatccagcaa aagtctacat ttgcctgttc gtgtgcttcg 5280 cgacgaaagc ggtgcacttg gagttggtag gcgacctctc cacaccagct ttcatctccg 5340 cccttcgtcg atttgtcgcc cgccgaggac gtccccgtca cctgcattcc gacaacggga 5400 agaatttcat cggcgccaag aacgatctgc accatctgta taagatgctc tccgacgatg 5460 cccaggttga ccggattact aggttctgcg ctgaagaaga gattgtctgg catctaaacc 5520 cgccgaaggc ccctcatttc ggcgggctct gggaagccgc cgtcaaggtg gcgaaaaagc 5580 atctccatcg tcaacttggc aactccaggc tgtccattga agacctctcg acagtccttg 5640 cagaaataga agccgctatg aattcacgac cccttgtgcc tatgactgag gacccgaatg 5700 actttaccgt tcttacaccg gcacacttca tcattgggtc aacgatgcat gctgtaccaa 5760 gcccaaacgt taccgatatc cacttcacgc ggctgaacca tcatcagaag ttacaacaac 5820 tttaccagtg cttctggcat caatggagaa ccgagtatct tcaggagctc caaaagaaca 5880 ctcgactgcg tcagcccaac catcaaatac tgcctggaaa actggtggta attgtcgacg 5940 agtttcaaca tcccgttcgt tggcccctag cccgcataga ggctgtacac tagacctatt 6000 catttgtctg gaaattttcc agtcaaccaa ttctttgatt ttttatgaca aaatgaactt 6060 ctggtcaaaa tttcagtcat tttggaaata atttaggtgt gcttcaaatc aattatgtgt 6120 ttctgagcta atttcaagct tgaaaaattc ataactttcg aacggaacac caaaaatgat 6180 tgaaaatacc tctacatagt agttgattca attctacgaa cttttgtcga acacactttt 6240 atgattggag caagttttaa catagtttgg ttgagatttg tgcttcaagt ttcttgaaaa 6300 tcactatttt tatggaattt tctctctgaa aaacacacct gtttggaaat tacggatttt 6360 taatttccag tacatgatga ctttacatgt ctcaaactca atcccgattt aagttacaac 6420 ttatcttgcg gaaataaatt ataaaaaaaa tagaaaatta ggaagcactt ttataaatcc 6480 gctttgggtg agccaagagc accaccgtga gctttggcgg gtgtaaaaaa tgaacacatt 6540 agcactgctt ttttcagtcg atgacgatga ttgttttcaa atcgttctga atcgtcaatg 6600 aaatgcgatc caaacaatca tcactcggaa agaaaaagca atgctaacat gttcaattca 6660 ttcattggtc ggtacatgtg tcatgcgtga attaactatc atcaggttgc gatgcggtgt 6720 tcgatctttg ggcatttttt gtaatttttt gcctaaagca acatgtaagc tgttaatggc 6780 taatgaattc attgatttat ttcgttcgat tcctgttgac taggtactga aaaactaatg 6840 tcaatgtgtg ttttttctac cgtatatatt cagaaaactg tgatttttgg gtactatgca 6900 gaacaaatca tgatcaaaca atgttaaaac tcaatgtaat cttatttgcg tgttcgacaa 6960 acttaaacag aatagaatta gctttcaaat agaggtaatg acaagtggtt ttgattttcc 7020 acatgctagt tattattttt caaaccttga aattagccct aaaacacatt tttaacttga 7080 agcatactga aatctgtatc aaattagctg aaaatttgac cagacattct tcttagtagg 7140 agaattcagg atactgcttg accggtagat ttccagacat tttttcaaat atgaacaggt 7200 ctactgtaca cccaggagcg gacggtcttg tgcgcgtggt tacattgcgc acagccaaag 7260 gcattttcaa gcggcccatc acgaaaattt gcctgctgcc aacggactca tcaacccaag 7320 caaacacaac gtacctgcaa caaacccaaa cattagacga tcacaaccag caaactggca 7380 acgacgatca tcatgcaaac aacggataga tcatcaagct tctgacccca ataatctgac 7440 caagtgtagt acataagaaa aaattgctaa aatgaattat tttcgattgt gtaagctaag 7500 ccatgaattg aaaatttgtt taattttgaa aatctcagtt ttcaaaggtg gcggcga 7557 // ID BEL-596_AA-I repbase; DNA; INV; 6816 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-596_AA_; KW BEL-596_AA-LTR; Pao_Bel_Ele208; BEL-596_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-6816 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [5977-6426] - Integrase core CC 'TCATC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 30..6074 FT /product="BEL-596_AA-I_1p" FT /translation="MSRRSNVRQTRSQAKALQDQQANLHDDIERAVLSSID FT NRNMAPSTERAEACDCGGCERPNNAEPMVLCRDCQCYYHYSCASVTSETVR FT TKPFVCSSCVRLTRLPPARSTSGRLSTSSSRQAQIARELQRLEEERQLEEE FT FHREQIEQERLLQERSRREKLERQRQFIARKYELLSQGDLEEDDASNNASD FT NNRVENWIREQQVATSAVPSDAVIHGVKTPEQAPTGDRHTPSREIFPNVQS FT TPLTPKETTGVEPNRMPCLAEQLALAVDIDTTGSITIGDTADDEDERGAIG FT TEVRSSQRFTSISPVDVRIYSQFLKEPNIAKPITGTVPKILKNTTLFEKWR FT SETENLRKCKELESERKKENEIRRKRELQLANQLKQLEIQNRDDLQVQRAR FT EADLKAQLQQLERDQALIEEQKHQELTALQDELQRLRAVEQQFLKHSQQQH FT TGGYIGVLKSDPIELLQGNHQQPTSDEQPERTVYEQPAQMIHEQPTSQHAA FT NVSNYPAPHCTIMPPMSFINVIPNNLQNASINTSPSYPRYSFNENYSPSPI FT HTPVGGRSHYNVPPIVPPHVVSGVQSQRPIPNPFVMNTAVPNIGFQSAELG FT PSPQQLATRQVVSKDLPVFSGDPVDWPLFCSSYQHSTQACGYSNSENLLRL FT QRSLKGRAREAVSSFLLHPSTVPQVLSTLQILFGRPEHIIHNMISKIREIP FT APRADKLETLVGFGLAVQNLCGHLKAIGLEQHLSNPMLLQELVDKLPANVK FT LSWALHQQLLPAVNLTEFGEYMGNIVSATSSVTCFNALPPKSSKEPRSKEK FT VYVNAHTTSNEGRSEGYRNTSGEWRGSNNRAKSPDKQIAVSKMCPACGARS FT HQTAACPSFKRLSIDDRWKLVKENKLCRRCLTPHTRWPCKGEVCGIDGCQK FT KHHRLLHSNNSSSEVSTQNPTNATVAVHRQISSSTLFRVLPVTLYGKNGQV FT NTLAFLDDGSSVTLVERSLIDGLGVNGSAESLCIHWTGGVKKQITDTNLVQ FT LQISALGSNQRFKLNEVYTVEHLGLPEQSLNFNEMSARFQHLEGLPVQSYD FT AAVPGVLIGLTNIHLLATLKLREGRQNEPIATKTRIGWSIYGCQRGNVEVM FT PHRQLHICAKPADEDLHDYVQSFFSIESLGIAMVPQVEGADDRRARKILQE FT TTIRTNSGRFQTGLLWKHDGIEFPDSRPLAEKRLRCLEKRLSKDPQLYDNV FT RRQMLDFLEKGYAHKATTEELRSFDPRRTWYLPLGVVLNPRKPGKVRIIWD FT AAAKVEGVSLNSMLLKGPDQLTSLLSVLFKYREREVAISGDIREMFHQLLI FT QEEDRSALLFLWRNCPNSPIDIMVMNVATFGASCSPAQSLFAMNLNATENE FT ADYPRAAAAIKHRHYVDDYLDSVDTEEEAAELALEVAEVHSKAGFEIRNWL FT SNRTTVLDKIGEVKPGSVKCFGADKETGTERLLGMVWRPDEDFFSFSLSFR FT EDLQQLIEGEIVPSKREMLKITMSIYDPLGIVAAFVIHGKVLVQEVWRAKI FT DWDEKIPWEIFCRWQQWLAVLRKMNTVKVSRCYFPGYSIECYNSLDLHIFV FT DASEEAYAAVAYFRVVDNGRVRCSLVSSKTKVAPIQPLSIPRLELLAAVLG FT ARLRQTIEEKHSLEIRRTFFWSDSSTVIAWISSDARRYRQFVAFRVNEILN FT LSSINEWRWVPTKLNVADEATKWGKGPSFDSKSRWYHGPGFLYDNEEEWPK FT DCRYEVNNTVEELRPAFVCNHHVTEPAVDLERFSRWERLLRSVAYVLRFID FT NRMWQHRKTSSKIGALSRDELQKAERCLWRIAQFDGFPDEVTVFKYNLRSN FT SSHRRRLERNSPLIKLTPAIDEYDVLRIDGRIQEADFVDFDTRNPIILPRR FT HRVTTLLLDWYHRKFQHANDETVVNQVRQRFYIPRLRVQVRLARKQCMWCR FT VYNATPTIPRMGPLPKARLTPFCRPFTMVGIDYFGPYNIRSPTGRCTLKRW FT VVLFTCLTIRAVPRRNCSQPIDGLLQESDSAVHRTQRCST" XX SQ Sequence 6816 BP; 1970 A; 1631 C; 1693 G; 1522 T; 0 other; aattctcgaa gatttatcct tgaggagtca tgtccaggag gtcaaacgtt cgccaaacga 60 gatcccaggc caaagctttg caagaccagc aagctaacct ccatgatgac attgaacgag 120 cggtactttc atctatagat aatcgcaata tggctccatc cactgagcgc gcagaagctt 180 gcgactgcgg tggatgtgaa cggccaaaca atgccgagcc aatggtatta tgtcgcgatt 240 gccagtgcta ttatcactat tcgtgtgcca gtgtaacgtc tgagacggta cgtacgaaac 300 cattcgtttg ctcttcttgt gtccggctaa ctcgtctccc cccggcacgc tccacgtctg 360 gtcgtttgag tacgtccagt tcacgacaag cacaaatagc gcgagagcta cagcgactag 420 aggaggaaag acaactggag gaagaattcc atcgggaaca gatagagcag gagagattgc 480 tgcaagaaag atctagacgg gaaaagctag aacggcagcg acagttcatc gctcgaaaat 540 acgagcttct gagccagggg gacttagaag aagatgatgc tagtaacaac gccagcgaca 600 ataaccgagt cgagaactgg atccgcgagc agcaagtggc taccagcgct gttccttccg 660 atgccgtgat acacggagta aagaccccag aacaggctcc aaccggggac cgccacacgc 720 cttcgcggga aatattccca aatgtgcaat cgacaccgct aaccccaaag gaaacgaccg 780 gtgttgaacc gaaccgtatg ccatgtttag cggaacagct tgctttggca gtcgatatcg 840 acaccactgg cagcattact attggggata cagccgacga tgaagatgaa cgtggcgcta 900 taggtaccga agtgagatca tcccaacgct tcacctctat atcaccagta gatgtacgga 960 tatatagcca gttcttgaag gaacccaaca tcgccaagcc gatcactggg acagtgccaa 1020 aaatattgaa gaacacaacg ttattcgaga agtggcgctc tgaaaccgaa aacctcagaa 1080 agtgcaagga attggaatcg gagcgtaaga aggagaacga aatacgacga aaacgggaac 1140 tgcagctggc gaaccaatta aagcagctag agatccagaa ccgtgacgat ttgcaagttc 1200 aacgagcacg agaagcagat ttgaaagcac aacttcaaca actggagcgt gaccaagcac 1260 taatcgaaga gcagaagcat caagagttga ctgcgcttca ggacgaacta cagcggctcc 1320 gagcagtgga acagcagttt ctgaagcatt cccaacagca gcatactggt ggatatatcg 1380 gtgtgcttaa atccgacccg atcgaactgc ttcagggaaa tcaccaacaa cctacgagcg 1440 acgagcaacc agagcgaacg gtctacgagc aaccagcgca gatgatccac gagcagccaa 1500 cgtcgcaaca tgcagcaaat gtaagtaact accccgcacc ccattgtacc atcatgccac 1560 caatgagttt cataaatgta attcccaata atttacaaaa cgcatcaata aatactagcc 1620 cctcgtatcc gcgctacagt tttaatgaaa attattcccc atcgccgatc catacccctg 1680 tgggtggtcg aagtcattac aatgtaccac caatagttcc cccccatgtt gtgtctggtg 1740 tacaaagcca acgtccgatt ccaaatccgt ttgtgatgaa cacagcagtg ccaaacattg 1800 gattccagtc agctgagctt ggtccatcac cccagcagtt ggctactagg caagtagtgt 1860 caaaagattt gcctgttttt tcgggggatc cagtggattg gccactattc tgcagcagct 1920 atcaacactc gactcaagcg tgtggttatt ccaattcgga gaatttgctt cgtttacaac 1980 gaagtctcaa gggtcgagct agggaagcag tcagcagctt tttgcttcat ccctctacgg 2040 ttccacaggt gttgtctacg ttgcaaattt tgtttgggag gccagaacac atcatccaca 2100 atatgatttc caaaatccgc gaaattccag caccaagggc ggacaaactg gaaacacttg 2160 tgggtttcgg tcttgcagtg cagaaccttt gtggacatct caaggcgatt gggttggagc 2220 agcacctttc aaaccctatg ctactgcaag aactcgtcga caagttgccg gccaacgtca 2280 agcttagttg ggcacttcac caacagctat taccagcggt aaacctcact gagttcggag 2340 aatacatggg caacatcgta tccgccacca gcagcgtgac ctgtttcaac gcacttccgc 2400 ccaaatcatc gaaggaacct cggtccaagg agaaggtata tgtgaacgca cacacaacgt 2460 cgaacgaagg tagaagcgaa ggctatagaa atacgtctgg cgaatggaga ggttcgaaca 2520 atcgagccaa gtcacccgat aagcaaatcg ctgtcagcaa aatgtgtcca gcatgtggag 2580 ctagaagtca tcaaacagcc gcatgtccgt ctttcaagag actatccatc gacgatcggt 2640 ggaagctagt gaaggaaaat aaactatgcc gtcgctgtct aacaccgcac acacggtggc 2700 cttgcaaggg agaagtctgc gggatagacg gttgccaaaa gaaacaccat cggctgctac 2760 actccaacaa ttcgtcgtca gaagtatcaa ctcaaaatcc aactaatgca acagtagcag 2820 ttcaccgtca aatcagctcg tcaacgttgt ttagagtcct tccggtaacc ctgtatggta 2880 aaaacggtca ggtgaacaca ctggcattcc tcgatgacgg ttcgtcagtc actttggttg 2940 aacggtcatt gattgatgga cttggagtaa acggttctgc agaatcccta tgcatccact 3000 ggacaggagg tgtgaagaaa caaattaccg acacaaatct cgtgcagcta caaatatcag 3060 cattgggcag taatcagcgg ttcaaactaa acgaggtcta cacagtagaa cacctcggat 3120 taccagagca gtcgctgaat ttcaacgaga tgtctgcacg ttttcagcac ttggaaggtt 3180 tacctgtaca gagttatgat gctgcagtac caggtgtgct aataggacta accaatattc 3240 acttgttggc taccttgaaa cttcgagaag gtcgacaaaa tgagccgatt gcgacaaaaa 3300 cccgtatcgg ttggtcaatc tacggatgtc agcgaggaaa tgtagaagtt atgccacacc 3360 gacaactgca tatctgtgcg aaaccagccg acgaagatct gcatgattac gtacagagct 3420 tcttttccat agaaagcctt ggaattgcga tggttcccca ggtcgaaggt gccgacgatc 3480 gacgagcacg caagattctg caagaaacaa ctatacggac aaatagtggg agattccaaa 3540 ctggtctctt atggaaacac gacgggatag agtttccgga tagccgtccg ttggcagaaa 3600 agcggttgag gtgtctagag aaacggctgt cgaaagaccc gcaactttac gataacgttc 3660 gacgacaaat gctggacttc ttagagaagg gctatgcaca caaagctacc acagaagagt 3720 tgcgcagttt cgatccacgt cgaacctggt atcttcctct cggtgtagtt ctcaacccca 3780 gaaaaccggg taaagtgagg attatctggg acgctgccgc aaaagttgaa ggcgtttcac 3840 tcaattcgat gctgttaaaa ggaccagatc agttgacttc gcttttatcc gtgctgttca 3900 agtatcgcga gcgggaggtg gccatttcag gagacatcag agaaatgttc caccagcttt 3960 tgatacaaga ggaagaccgt agtgcactac tttttctatg gagaaactgc ccaaacagcc 4020 cgatcgacat aatggtgatg aacgttgcaa catttggtgc atcgtgctcc ccagcgcaat 4080 cgctattcgc catgaacttg aacgctactg agaacgaagc tgattatccg agagcggccg 4140 ctgcaatcaa acatcggcac tatgtagacg actatttgga cagtgtcgac acagaggaag 4200 aagcagcgga attagcactg gaagtagcag aggtccattc caaagccggg tttgaaataa 4260 ggaattggct gtcaaataga acaacggtac ttgataaaat cggtgaagta aaacccggtt 4320 ccgttaaatg tttcggtgca gacaaagaaa caggcacgga gaggcttctc gggatggttt 4380 ggaggccaga cgaagacttc ttttcatttt cattgagctt ccgtgaagat ctgcaacagc 4440 taatcgaagg agaaatagtt ccgagcaaga gggaaatgtt gaagattaca atgagtatct 4500 atgatcctct gggtatcgtg gcagccttcg ttattcatgg gaaggttctt gttcaggaag 4560 tctggagggc caaaatcgat tgggacgaaa agattccctg ggaaattttc tgccgttggc 4620 aacagtggct tgccgtactc cgtaagatga atacggtgaa agtctctcgt tgttacttcc 4680 cgggatacag catcgaatgt tacaactcct tagatttgca catcttcgtt gatgcaagcg 4740 aggaagccta cgcagcagtc gcttattttc gcgtagtcga caatggtcgt gtcagatgtt 4800 ctttggtttc ttcgaagacg aaggtagctc caatccagcc cctttcgata ccgcgcctcg 4860 aacttttggc agcggtactt ggcgctcgtc tgcggcagac aatcgaagaa aaacattctt 4920 tggaaatacg tcgaacattt ttctggagcg actcatccac tgttatcgcc tggataagtt 4980 cggatgctcg gcggtatcgg caatttgtag cattccgagt caatgaaatc cttaacctat 5040 catcgattaa cgaatggcga tgggtgccca caaaactcaa tgtggctgac gaagccacaa 5100 agtgggggaa aggcccttcc tttgactcaa aaagccgatg gtatcatggg cctggttttc 5160 tgtacgacaa cgaagaagaa tggccgaagg actgtcgata cgaagtcaac aatacagtgg 5220 aagaacttcg accagccttt gtctgtaatc accacgttac cgaaccggca gttgacttag 5280 aacgattctc tcggtgggag cgattgctgc ggagcgtagc gtacgttttg cggttcatcg 5340 acaacagaat gtggcagcac aggaaaactt cctcaaagat aggcgcgctg tccagagatg 5400 aactacaaaa agcagaaagg tgcttgtggc gaatagcaca attcgacggt tttccggacg 5460 aggtgacggt gttcaaatac aatctgcggt caaattcctc gcatcgtcgg agactagaac 5520 gaaacagccc ccttatcaag ttaactccag cgatcgacga gtacgacgtg ttgcgcatag 5580 acgggcgtat ccaggaagct gacttcgtgg acttcgatac tagaaatcca attatcttac 5640 ctagaagaca tcgggtgacc actctactac tggactggta tcaccgaaag ttccagcatg 5700 caaacgatga aacggtggta aaccaagtca ggcagcgttt ctacatacca agattacgag 5760 ttcaggtacg ccttgcgagg aaacagtgca tgtggtgccg agtttataac gctactccaa 5820 caataccaag aatgggaccg cttccaaaag ccagattaac gccgttttgt cgcccattta 5880 ctatggtcgg aatcgactat ttcggaccat ataatatacg atcaccaaca ggaagatgta 5940 cgttgaagcg gtgggttgtc ttatttacct gcttgacaat cagagcagtt ccacgtagaa 6000 attgcagcca acctatcgac ggactcctgc aagaaagcga ttcggcggtt catcggacgc 6060 agaggtgctc cacgtgaaat cttctcggac aatggtacca atttcgttgg tgcgagcgga 6120 gaattggcct cagaaatacg gatgatcaat agagaaatca gtagcacgtt cactgacatc 6180 cagacacagt ggcagttcaa ccctccatcg gcaccgcaca tggggggatg ctgggaaagg 6240 atggtgcgct ccttcaaaac tgctttagga gcccttccga ccgttcgcct gttagatgaa 6300 gaagcgttcg ctacggtact cgtcgaggcg gaatctatga tcaattccag accattgacc 6360 ttcatcccac tagagaccgc tgcacacgag tctttgacgc caaaccattt ccttctcctc 6420 agttctactg gtgtacgaca acctatcaaa acccctgtag acgataaggc tgcaatacgt 6480 aatagctgga atatgattca aaacacactt gatgaatttt ggcggcgatg ggttgaagaa 6540 taccttccga ccttaacccg gagaacaaaa tggttcgaag acgtccagcc gatacgagag 6600 ggtgcgctag tagtggtagt tgacgggaat gttaggaatc ggtggcaaag ggggcgagta 6660 gtacgaacat atccgggtaa agatggcact gtacgtaggg ccgatgtgca gacatcctgc 6720 ggcatcctac gacgaccagt cgtcaaacta gctttgatcg atgtggataa ggaaagtgac 6780 accgaagcaa cagttctggt gacacgaagg gcggaa 6816 // ID DNA-TA-3_CQ repbase; DNA; INV; 418 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A DNA transposon family from Culex quinquefasciatus - consensus. XX KW DNA transposon; Transposable Element; nonautonomous; DNA-TA-3_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-418 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the southern house mosquito."; RL Repbase Reports 11(1), 53-53 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >95% CC identity. TSDs are TA. XX SQ Sequence 418 BP; 158 A; 75 C; 95 G; 90 T; 0 other; cagtcgactc tctggttgtc aatatccaag ggaccgtcga ggaagagaat catcagttta 60 cagaacgatg caaaatgaag actcgattga aaatattttt ttcttgatac ccagctatgg 120 gagagaatca tggcaacgtc catcaaacaa aaacaaacta atgtcaaaca cccttcaaag 180 cttcgtttcg ccaagaaaaa tgtctatgca agccatgaga aagtgaaatt attgacaacc 240 ggaagagatt tttaaagcaa acagaatcca agggaccgtc gaggaagaga tccttcaagc 300 aagggaaaat atcgaggaat gaagacaatc gaagtatgca gattgaaggg actgaagaat 360 tcatcgatag atggagaatt attgatatcg agaagatcga cagccagaga gtcgactg 418 // ID Gypsy16-I_Dpse repbase; DNA; INV; 6852 BP. XX AC Unknown_singleton_14; XX DT 15-MAY-2009 (Rel. 14.05, Created) DT 15-MAY-2009 (Rel. 14.05, Last updated, Version 1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy16-I_Dpse. XX OS Drosophila pseudoobscura OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; obscura group; OC pseudoobscura subgroup. XX RN [1] RP 1-6852 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1100-1100 (2009). XX DR EMBL/GenBank/DDBJ; Unknown_singleton_14; Positions 22974 29825. XX FH Key Location/Qualifiers FT CDS 935..3496 FT /product="Gypsy16-I_Dpse_1p" FT /translation="MHKIEVGSATPIKQRHWPVSPAIEKLMFAEVDHMLAL FT DVIEESSSPWSSNCVLVRKGEKNRLCLDSREVNKVTIKDAYPLPHIDGILS FT RLPPARFITGLDMKHAFWQIPLDEKSRQCTAFTVPNRPLYQYKTMPFGLCN FT AAQTLCRLMDQVIPAHLRKCVFVYLDDLLVLSEDFESHLVTLQEVSVCLKK FT ANLTINIEKSKFCMREIKYLGFIIREGQILTDPGKIKAVNEFPVPTSIKQL FT RRFLGLSGWYRRFVENYATISFPLTELLKNKKSIIWNDDAEKAFNLLKSRL FT TSAPILITPDFTSQFILLCDASTYGIGCVLAQEREGVELPIAYMSEKLSKA FT QRNYTVSELECLAVIKGIKKFRAYIEGQDFVVVTDHASLKWLMKQKDLSGR FT LARWSMKLQSFNFSITHRRGSENVVADALTRRNQPAIEEFVASGPMIDLQS FT SHFKSPEYLSVIQRVQENQSKLLDLKVIDGYVYKRTEFNRDNDGHEESWKL FT WVPSAMIPEVLQQAHDAPSSAHCGMAKTVEKLKRYFFWPRMVSQIREYISE FT CHICRSTKSPNKMLKPPMGRPLPSDRPFQCLYMDLLGPYPRTKNGNIGLLV FT VVDHNTKFHFLSPLKKFTAPKMCEYLESSIFYTFGVPEVVLTDNGSQFKSS FT CFEAFLTSFGITHRCTAIYSPQANASERLNRSVLAAIRAYIGSDHSNWDAN FT LNAINGALRAAVHRSTGYSPYFLVFGQNMILNGGDYTLLRNVDLLSNDVRL FT EHSDLLQLARQDAKKNIKKSHELNAKKYNLRSRNVQLKTGDEVFARNFTQS FT NASKKFSSKLAPVFISAKVLKKRSPYYYELVNPAGKKIGVFHLKDIQVKNK FT *" FT CDS 6089..6850 FT /product="Gypsy16-I_Dpse_2p" FT /translation="MAKMPVTHSNENLDGIREQSDPFCYICNEVILVSELV FT ASTPCNHRFHKCCIIEAITTTPACPICAACCLISQLRYGNNTLAAELGDAA FT EEAETGIKDISATDSGAHTLRRMGGLYKSRGRGRPISSKGGVTTRSKSRLT FT DPSRSEQEENLSGKSPDDIVRYIQNALSVQQENMIDDMVDILNKNIEETIQ FT RQLATLNLNIATQAVENVETGNRQSGVRASLAGNISSHSSSRSRSNAQLAP FT AKATNIIQNWRLK" XX SQ Sequence 6852 BP; 2178 A; 1333 C; 1471 G; 1870 T; 0 other; aacctaacct gttggaactg tgaaaaacct ggtcatcgat ttgatgattg tctagagcag 60 cgctctatat tttgttatgg ttgtggcgga aggaaacgtt caagccgaaa tgtcccaggt 120 gtaatccagc gggaaaccga ccggcgggtg tgtcgatcga caaaacctgg acgcacccga 180 aactgtgaag actcctagta tacccgtaaa attgtcaaat gaaacaaagg caaaaactag 240 tttagaaaat aagttagaag aagccaaacc ccatttcttt actattccat tacatgtaag 300 gatagctaat tacaataagc ttcgcgataa attgtttggc gagactataa attccaaaat 360 ccatctgacc caaagaaggt cgacgaagcg attaagacag tattggaaat cgattcgctc 420 tgacagaaaa aaactggtag ccgcgatttt tagtaaaact gataacagac tgtacacaga 480 agtatgcatt gaagggaata actacactgc gctcctagac tctggggcaa ccctgagttg 540 tgtcggaggc aaggcagcac tggagttcct agcgttagga aaaaccaaaa agtgctccgg 600 caacatacga actgctaatg gaacgccaag taaggttgtt ggcaaattac aaactcaaat 660 tacttatcga aagaaaacag catggttaga tctttttatt atcccaggtc tcaaacaaga 720 tatttatctc gggatagact tttggcatca gtttggatta acagatagca tcagcaacgc 780 gcaggtttcc gagctagatg tgggtggaga gactcatgac caagaactcc ccaagtggca 840 tcttttgtca aatgaacaac aaaatcgcct aaaggccgtt gtagaagttt ttccatcatt 900 tgccaagcaa ggtttaggca gaacaaattt gataatgcat aaaatagaag tcggatccgc 960 aacacccatc aagcaaagac attggccagt gtcaccagca atagagaaac taatgtttgc 1020 cgaagtcgat catatgttag ccctagacgt aattgaagag tcttcaagtc catggagcag 1080 taattgcgtg ttggtgcgga aaggggaaaa gaacaggctt tgcttagatt cgcgtgaggt 1140 aaataaggtc actataaagg atgcttatcc gttacctcat atagacggta ttttgagtcg 1200 attgccgcca gctaggttca taacaggcct agatatgaag cacgcctttt ggcagatacc 1260 actcgacgaa aagtcacggc aatgtacggc cttcacagtg ccaaacaggc cactatatca 1320 atataaaaca atgccgttcg ggctctgcaa tgctgcgcaa accctctgtc gactaatgga 1380 ccaggttata ccggcccatc taagaaaatg cgtctttgta tatttagatg acttacttgt 1440 gctgtcagag gatttcgagt cacatttggt cacgcttcag gaagtttcgg tttgcctaaa 1500 aaaggctaat ctgactatta atattgagaa gtccaagttt tgtatgcgcg aaataaagta 1560 tttgggattc atcattcgag aaggacaaat tctgactgat ccagggaaaa taaaggccgt 1620 caatgagttt cctgttccta catctataaa acagttgcgt cggttcttag gattgtctgg 1680 ttggtatcgg cgatttgtcg aaaattatgc caccataagc tttccactga cggaattgtt 1740 aaagaacaaa aagtctataa tttggaatga tgatgccgaa aaagctttca atcttctaaa 1800 atcgcgttta acatcagctc cgattttgat taccccagac ttcacaagcc aatttatctt 1860 actgtgtgat gcaagtacct acgggatagg ttgtgtcctg gcgcaagagc gtgagggagt 1920 ggaactccca atcgcgtata tgtcggaaaa attatcgaaa gcccaaagga actacaccgt 1980 tagcgaactg gaatgcttag ccgttatcaa gggtataaaa aaatttcgtg cctatataga 2040 aggtcaagat tttgtggtag taacggatca tgcgtcttta aaatggttga tgaagcagaa 2100 agatctttct ggaagactag cgaggtggtc gatgaaatta caatcattca atttttctat 2160 tacacatagg cgagggtcag aaaacgttgt tgctgacgcc ctgactcgac gtaaccaacc 2220 tgctatagaa gaattcgttg ccagtggtcc catgattgat ttacagtcaa gtcattttaa 2280 gtcgccggaa tatcttagcg tgattcaacg agtacaagag aatcagtcaa aactgcttga 2340 cttaaaagtt attgatgggt atgtctataa aagaaccgag ttcaatagag acaatgacgg 2400 acacgaggaa tcctggaaac tttgggtacc ctccgcgatg atccccgaag tattgcaaca 2460 agcacatgat gcgccaagct ctgcacattg tggtatggct aagacggtag aaaaactcaa 2520 aaggtatttc ttctggcccc gaatggtaag ccaaattcga gagtacatct cagaatgcca 2580 tatttgtcga tctactaaga gtccgaacaa aatgttaaaa cctccgatgg gccgaccatt 2640 accatcagac cgcccctttc aatgcttgta tatggatcta ctaggtccgt acccccgtac 2700 taaaaatgga aacattgggt tgttggtagt agtagaccat aacacgaaat tccattttct 2760 tagccctcta aagaagttca cggcgccaaa aatgtgtgaa tacctggaaa gctccatatt 2820 ctacaccttt ggagtgccgg aagttgttct aacggataat ggttcgcagt tcaaatcaag 2880 ttgtttcgaa gcttttctta caagttttgg cattacacat cggtgtacgg ctatatattc 2940 tcctcaggcc aatgctagtg aacgattaaa cagatcggtt ctggcagcca tccgcgctta 3000 tattggaagt gaccattcca attgggacgc taatttgaac gctatcaatg gtgcacttag 3060 agcagcggtg catcgaagta ctggttattc accatatttc cttgtttttg gtcagaacat 3120 gattctgaac ggaggagatt ataccctatt gcgaaacgtg gatctgctaa gcaacgacgt 3180 ccgattagaa cactccgatc tgttacaact ggcaagacaa gacgctaaga aaaacataaa 3240 gaaatcacat gagcttaacg cgaagaaata taatctacgg agtaggaacg tacaacttaa 3300 aacgggtgac gaggtttttg ctcgtaactt tacccagagc aacgcgagta agaagtttag 3360 ttcgaagtta gctccggttt tcatttctgc aaaggttctc aaaaaaagaa gcccatatta 3420 ttatgagttg gtgaacccag cagggaaaaa gattggagtt ttccatctca aagatattca 3480 ggtgaagaat aaataacagc atttttttca gataatgtac cttcgagcag gtcttgtttc 3540 attatctgtg gttgtagtgt ccaactattc gaaaatcaaa agaaaaaaaa aagaaaggaa 3600 aaggtggaac aaggtacact ttaggtacac tttctagaaa gatacttttc ataccgccct 3660 cgctagcata gctcgaagaa attggtggaa tgcaggagat ccagaagatc atccgattgc 3720 acagttatgt taaagcttaa agctcagctt aggcatgcaa taggtacagg cgaatagact 3780 ttattcatgt gaaatcattc ccctgcatag tttcacttac ggcacacgta tatatagatt 3840 ttcgtgattt ttgcaatcgc agagtgaatg gtgaagacac gtggagttac ataacttcca 3900 atttttcact attatctaat catccgctgt ttatatggat taataaatcc gaagagtggc 3960 gttagctgaa ctacccgagg gaaaagtgaa aataaagtga aattcttgaa agaagcaaag 4020 gtacgttagt ccaccgtgga aagataaggt tctgacacag ttttatttag ggataggagt 4080 actggaaaat cttaatacag ccaagccggc caaccaatct acggtatcaa ctactaaaac 4140 aaccaagctg accgtttggg agttagtgca gtgcacatcc acggcggcaa agtgaaggct 4200 gaaccgttga cgttgggcta gctcaatatc ggaaatcatt tacggctcgt gaaagaccaa 4260 ccaccataca ccactgctgg acgtccgaat ttcagtttga acctgccgat ctcataaaaa 4320 aaaagccgga ggacataatc gtcaccaacc tacgaaaact tacaaaggat tgagtaatta 4380 ggctagccta aaccacattg cctcagaacc actcttgctg tcagcagatt tgtgtttatt 4440 ttgatttccg tgtttctcaa attaaatttc aagtgcaagt tcccgtttgc ctttatcact 4500 atcacataaa aatctatacg ctgctgctct aagaaccagc tgattagatc cacatattca 4560 tatctcgttt atcattgtgt tgattaatgt agtaactaaa tagcttcctc caatgagtaa 4620 atttgtaacg taactgacga ttttgaaaac tgagcaatat gagttcggga taatgtgttt 4680 ctctttatat atatatatac atttatactg accaacctct aaattacaca gtgcatacaa 4740 atgaatccgt caaagtcatc ttgggcccaa cgatacaccg atgagtttaa ccaattactg 4800 tgagtaccac tcgtggctga aaccacaatg tctatacacc aatgcaggcg taagtttgtt 4860 gaagtagata tatatatact gatttgaaag aatattaccc ccctcaaatg ctcaaataat 4920 taagtttttt tttttttgct gatccaagaa gaaggaagtc tgtatatttt tgtttatttg 4980 taaatgtaag gaatcggatt atatctgtaa tttatgtaaa aaaaaaaaaa aaatatatat 5040 atcaatacaa tctcacctgg ttagggacat ttaacataaa aataaaagga atttgtaatt 5100 atgaatatgt ctgcctaaac taaatattat ctccgtccgt caagtactta tgcttatctc 5160 cctcaaaatg cactagaaga tgttggatct ttgaattggg acaacgcccg aatgtaaacg 5220 cgtttgtcat ggttgggtgg gtaaagacta gggacatata tcaccgctgt tccctccatc 5280 agacagaaga cgaactacac tttttctaag cgttttctaa atgcgtatcg gcgcagagtg 5340 gaaatgagtt tttttttata ggttattgat attaacgtcg agatcctgct tatttttttt 5400 ttgaagcgca ttatattgat ttgtgtcagg aaaactgtgt agctagggga gaaagaaccc 5460 cgaccaatcc gacgagctgg acccgagcac agcaagaatg cgcgcattcg aacctagctt 5520 cctattaaca tgatgtcaga aaaaaaccat ctattggcca ctgtcggagt tagagcctaa 5580 ataggctctg gccctaactc actggcaaag actgaacact acacaaacat ggcgcccaac 5640 gtggggcccg aaaagttttc tttaagtccc atttaagctc tacaaatcca tccatttatt 5700 tcacatttgc tgcacaacaa gttgttttgt attcggctgg aatcgatcgg tggacttagc 5760 gattgcaatt cggataccgg attgatcgac cctgatagat ctatgctgag caaaataaac 5820 aatcttgaca ggaaacaatc tcaaagtgtt ttggatttac gatgctctgg agaatttcgg 5880 acattgtcca gcaatttgag atggttgtag attgttgtta ttcaacaata gttacttagt 5940 taaaatagag gaatgcgctg gaggagagtt ttcgtttttt tgtctgtttt cttgtatttt 6000 gagatattga tgagctaagg aaagacggac attaggggat aagcaatttg tttcgagaaa 6060 gttttttgtg ttagtgaatt aacaataaat ggctaaaatg cctgttactc atagtaatga 6120 aaatctcgat ggcattcgcg aacaaagcga tccgttttgc tatatttgta atgaagtaat 6180 cttggtctct gagttagttg caagtactcc gtgtaatcat agatttcata aatgttgtat 6240 aatagaagca atcacgacca caccggcttg tccgatatgt gcggcatgct gccttatttc 6300 gcaactacgt tatggtaaca atacccttgc agcggaactt ggtgacgcag ccgaagaggc 6360 agagacagga atcaaagata tttcagctac ggactcagga gcccatactc tgagaagaat 6420 gggaggcctt tataagagcc gaggcagagg taggcccatc agttcgaaag gaggcgtgac 6480 cactagatca aaatcgcgac ttacagatcc tagtcggtca gagcaggaag aaaatttatc 6540 gggaaaatcg ccagatgaca tagtaaggta catacagaat gccttaagtg ttcaacaaga 6600 aaatatgata gatgacatgg tggatatttt gaataaaaac atcgaagaga caattcagag 6660 acaactggct acactcaatc taaacatcgc cacgcaagca gtggaaaatg tggagactgg 6720 gaataggcag tccggcgtac gtgcttcgtt agcaggaaac atctcaagtc acagtagtag 6780 tagaagtagg tcaaatgcac agttagcacc tgcaaaggca actaacataa ttcaaaattg 6840 gcggctaaag tt 6852 // ID Gypsy1-NVi_LTR repbase; DNA; INV; 259 BP. XX AC DS265623; XX DT 03-NOV-2007 (Rel. 12.11, Created) DT 04-DEC-2007 (Rel. 12.11, Last updated, Version 1) XX DE LTR retrotransposon from parasitic wasp: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy1-NVi_LTR. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-259 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from Nasonia parasitic wasp."; RL Repbase Reports 7(11), 1169-1169 (2007). XX DR Genome; DS265623; Positions 469633 469891. XX SQ Sequence 259 BP; 65 A; 62 C; 78 G; 54 T; 0 other; tgtggggagc tcccttggca gccctgcgta agccaggagt ggggatgtcg gccatgttgg 60 ctagagagag ggagagagaa cgaacgagcg catgtcgcgg cctgcgagcc gcataacgtt 120 cgagacagtt gatctaacgg gatattctaa taaaagaaag tgttattcca gaggaagctg 180 gtggccttta ttcatcctgg accccatcct cgagagagta ggactctcgg catcgtcatc 240 ccagacgcat aatcctaca 259 // ID BEL-60_CQ-LTR repbase; DNA; INV; 852 BP. XX AC AAWU01016932; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-60_CQ_; KW BEL-60_CQ-I; BEL-60_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-852 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 274-274 (2011). XX DR GenBank; AAWU01016932; Positions 39149 40000. XX SQ Sequence 852 BP; 256 A; 194 C; 171 G; 231 T; 0 other; tgttacgtca tctggcgacg tagcctccgt tacaaaccgc cgaactcttc gcccgaggtt 60 ccagaaaaat acgacgcagc tctaaagtgt tcggccacac ccagtaccac cggagggatc 120 gttatcaaac gcatcggcaa tagcgaaaga agagaaatag taggaataag aagtgatctt 180 gaacttgaaa tgagtaggcc tttgtaatag cttctattgt atctccaaag tactacaccg 240 aggacgcgct gttaaaagct ggactaagat cgtagaatat tgtgcatgcc agtgacctaa 300 atttacgatc gcaggtgaaa aaagcatact atatgttgta cacgctgcgg tacttatatt 360 tttgaccccc attctagccc tactgcgcgt tgtaccccag aacaaattgt gcgccacgct 420 aatatgatcc ccgtagatcg agcgcaataa ttaggtaacg attaggctag atttgttgat 480 ttttgcgccc tcacacttat actatcgtcc tcgtagaaat tgaattcccc tcgtgcgagg 540 catcccagcc gatgaacggc ttagaaaagt ataaatatta aggtaatttc tctaaattca 600 atccctaaac aagagctaca tgcaaatcta ttgttctaac agtaagccac aacctcgctg 660 ttggacctac cgtcagtgca tgcattccgg ccgttgaacg gttataaagg gcataaacgt 720 aagttcgttc aaacttttcc ctaatgcttg attgatcaac tgaatttgta tttgtaggga 780 aaatttaatc acctcagaat aaacggattc aaatttgtgg aaaactctgt tacttgacca 840 cttctcgtaa ca 852 // ID Gypsy-150_AA-LTR repbase; DNA; INV; 181 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-150_AA_; KW Gypsy-150_AA-I; Gypsy-150_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-181 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1026-1026 (2011). XX DR [2] (Consensus) XX SQ Sequence 181 BP; 50 A; 49 C; 32 G; 50 T; 0 other; tgtagcatcc accctagcta caccactggc tgtctggtac agcagtggag atcgtatcaa 60 taaaagaatc agtttatctt ctgactgcac acccgaacag acgcgcgttt cattactgcg 120 ttttaccgta tcacagtctc ggtcataagt aattaattac cgtttagaac ccaatctccc 180 a 181 // ID Gypsy-26_DPu-I repbase; DNA; INV; 6093 BP. XX AC scaffold_5; XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from Daphnia: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-26_DP_; KW Gypsy-26_DPu-LTR; Gypsy-26_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-6093 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Direct Submission to RU (16-DEC-2010). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR Genome; scaffold_5; Positions 2356056 2362148. XX CC Positions [4662-5141] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 649..1884 FT /product="Gypsy-26_DPu-I_1p" FT /translation="MPPKRGESTVPFQIRTRSNTQTAVTPVTRARVVTPTT FT LVPSPSVRGGSSRLTFNLPVPATAVARVADMALPQNLIDALTNLATAMAAD FT RTAAQNQSTALLNALDQQRIQSVALVQQLADDAAAAPAAATPRVAAAAVES FT IPCFEGKLRDFPQDFVDFVDRVAVAEDWTDAQRIQVASRRLLKTALDWHIH FT IGHTHATWAAWSGAFITNFSPRLHVGEWLRLVEERRQKIDESGIEYALDKH FT KLLRVAPIPLNDEKMVAFLIDGLASWQHVAAMTANRPANVAEFIQRIRALE FT TLGVASRVFPPPVAPPVAPPAAPTTTPPVAPPSAPDLNATLATFGNQLVNQ FT LTAQLNKMTIGSRGTGGGGGGGDRGGRSGGDRGGGGWIDPSKRKCYSCDAI FT GHMARHCPTKSGKGPTGS" FT CDS 1809..5756 FT /product="Gypsy-26_DPu-I_2p" FT /translation="MLQLRRDWTHGPSLPNKVGKRTNRKLGARPMLTGYAH FT LPCRPQLPLVKVFIESIGEVTGLVDTGASMSAIRLSVVKNVLDPRREKSFL FT NLTGVDDKKVIVDSFCSLKVKWENKVVELNEVAVVKNCPFALILGVDWIVK FT SKLNLIVEDGKIVLKSQDSNQPKVKKVRFAGIEERNICSEEDDENDFFVSD FT ELIDSLEAENKTKRCPRVIGTEVKVVESAVIPAESLCFVKAKVSNKFSGNV FT IVRPNMCAHPGMEWVIPSCVMKVSAGKLKIPVLNMKMSSLVLRRKDFIAYV FT DTDFDSNMVVVGQEEQPENPVCSFVENAGESDEKLKTLMDARVGENLSEEE FT RSAVFELLSKYLRCFPSADGELGFTNMSEHFIDTGDAQPISCVPYRVSAME FT RKIIMEKVADMLKQGIIRPSFSPWAAPVVLVKKKSGDFRFCIDFRRLNAVT FT KRDVYPLPRLDDVFDRLAGAKYFSSLDLMSGYWQVPVASADTCKTAFVTPD FT GLYEFVRLPFGLNNAPSTFQRLMDRVLARLKWQMCLVYLDDVLVFGRTFDE FT HQKRLECVLMALVEAGLTLNVSKCIFATNRIFHLGHTIDEYGIRPDSEKIS FT ALVNFKINNVKTLRAFLGLASFYRRFVPDFATIAHPLHRLLKKNSVWSWTE FT AQESAKAALIGRLVSSPVLAHFDQNIDVVVQTDASLVGLGAVLMQDAGDGP FT RPVAFISRKLTDAESKYHANELECLAIVWALKKLRSYVYGRRFSVCTDSSA FT VRWLWSKKEVTGKFARWILALQEYDFEIRHIKGVNNLVADALSRNPDASCI FT GTSGSAIGHVVCVLDSRWPVGMNNTELAFQQQLDSQLRPIITCLNSKVPGK FT IAEQFKIHGKILYRKNPTQGRKFLLCVPSILRRKIIEFSHDDPSSSHMGID FT KTIARVSERYWWPKFRSSVRKYVMSCNYCQFHKCIPGLPAGQLQPIPPPDR FT PFHTVGMDHLGPFKATSEGKKHIIMAIDYLTKYVEAAAVADTSTALVVDFV FT RDQINFRHGGTTRILSDQGTAFSSHLMEEKVNEWKTQHVFATAEHPQTSGL FT VERVNRTMTLALAAYVNTDHDDWDRHLPAAIFAINTARQSTTEISPFQLVY FT GRLPFTALENEFPWPEERPESFDVFLSRVKELRDAARLKIVKKQEKVKRLV FT DLRRRVVKDLFPGELVLVRRKLKKKGKTKKLLPKYVGPFQVVKKVCPTTYL FT VEDLPAQRKKKRFRRFNAHVVQIRKFHPRDDAEWDDWPDEPEESIEIQPSS FT GQPAEKEASVTSQESIQPASSTIDPPNEVVAPPPPTTTRAGRKVVRPGWMK FT NFVA" XX SQ Sequence 6093 BP; 1636 A; 1267 C; 1450 G; 1740 T; 0 other; acatttggtg tcagaagtcg ggcacgggcc accgaacagt tctctttttt tttgttggta 60 aaattttgtt gctttttttt tttttttgag aactgcccgt gcaatatcca agtcattttc 120 tttttttttt tggtaacgcc ccctcgtctc tttcattgat ttagttcatt atttatgggt 180 ttcgtgtctt gtgtgtgaca cacgagaatt tttctagtaa tcgattatta gttgaatcag 240 tttttttttg tcacgtcatc agcccccccc ttttttcaat ttttttttct cgtgttgaaa 300 acgccccacc caccctccat aacatttcgc gcgtaaaacc acgtggcccg tcgaggcatt 360 gaaacaattc tttttctttt tttgtggctt atggaatctg cgagatagtc gccattttct 420 tttttctttt ttttttgttg ttgacttcgt gtgaatgggc tttggcttat ggtagtagat 480 aattagtatt actttatttt tttttcctcg tgtgttcgac cacgtcgttg tgtgtcgtta 540 atatcaccta aatttgattg aagttttggg attaatattt ttggttttgt gttccgtaaa 600 tcggggtggt catatcgatg gttttttttt tgtttgtgtg tgaattatat gccgcctaaa 660 cgtggagaat cgacagtgcc atttcagatt agaacgcgtt caaacaccca aaccgcagtc 720 acaccagtga cacgtgcaag agtagtaacg ccaaccacac tcgtaccgtc accatccgtg 780 cgtggtggct caagtaggct aacatttaat ttaccagttc cggcaactgc agttgcaagg 840 gtagcagaca tggcgctacc acaaaatcta atagatgcgt tgacgaatct tgcgacagca 900 atggccgcag atcgaacggc tgcccaaaat caatcaacgg ctcttctgaa tgcattagac 960 cagcagcgta tacaatcagt ggcattggta caacaactcg cagacgacgc cgcggctgcc 1020 ccagcagcag caacgcctcg tgtagcagct gcagccgtcg aatctatacc gtgttttgaa 1080 ggtaaattaa gggatttccc gcaagatttt gtagattttg tagacagagt tgccgtggcc 1140 gaagattgga cagacgcaca gcgtatccag gtagcatcaa ggcgactgtt gaagacagca 1200 ttggattggc atattcacat cggccatacg catgcaacct gggctgcctg gtcgggagcg 1260 ttcatcacga atttttcacc acgtttacac gtaggtgaat ggctcaggtt ggtagaggaa 1320 agacgacaga aaatcgatga atcaggtatt gagtacgctt tagataagca taaattgtta 1380 cgtgttgccc ctatccccct taacgatgaa aagatggtag cttttcttat agatggcctg 1440 gcaagttggc aacatgtggc cgccatgaca gcgaaccgtc cagccaacgt cgcggaattc 1500 atccagcgga ttcgtgcttt ggagaccctt ggtgtagcat cacgtgtctt ccccccgcca 1560 gtagcgccac cagtagctcc accagcagct ccgacaacaa ccccaccagt ggcgccacct 1620 agcgctcctg atttgaacgc cactctagca acatttggca atcagctggt taaccaactg 1680 actgcacagc tcaacaagat gacgattggg agtcgtggca ccggtggcgg aggaggcgga 1740 ggtgatcgtg gtggaagaag cggaggtgat cgcggaggag gaggatggat tgacccaagt 1800 aagcggaaat gttacagctg cgacgcgatt ggacacatgg cccgtcactg cccaacaaag 1860 tcgggaaaag gaccaaccgg aagttaggag caaggcctat gctcaccggt tatgcacacc 1920 ttccttgtcg tccccagctt ccactggtta aggtattcat agaaagtatt ggtgaagtaa 1980 ctggcctggt agatacaggc gctagtatgt ctgctataag acttagtgta gtaaaaaatg 2040 ttttggatcc taggcgtgaa aaatcttttt tgaatctaac tggggtggat gataaaaaag 2100 ttatagtaga ttctttttgt tctttaaaag taaaatggga aaataaagtg gttgagttaa 2160 acgaagtagc cgtagttaaa aattgtccat ttgcattgat tcttggtgta gattggattg 2220 tgaaaagtaa attgaatttg attgtagaag atggtaaaat tgttttaaaa tctcaggatt 2280 caaaccaacc aaaagttaag aaagttcgtt ttgctggaat cgaagagcga aatatttgta 2340 gtgaagagga tgatgagaac gatttttttg tgtctgacga gctgattgat tctttagagg 2400 cagaaaataa aacaaagagg tgccctcgtg tgataggcac agaagtaaaa gttgtggaat 2460 cagctgttat accagcagag tctctttgtt ttgtaaaagc taaagtttct aacaagttca 2520 gtggtaatgt tatcgtcagg ccaaatatgt gtgctcatcc tggtatggaa tgggttattc 2580 catcctgtgt tatgaaagtg tcagcaggaa aacttaaaat ccccgtatta aatatgaaaa 2640 tgtcatctct tgtgttacgt cgtaaagatt ttatagcgta tgtagataca gatttcgata 2700 gcaacatggt cgtcgtcgga caagaagagc agccagaaaa tcccgtctgc tccttcgtcg 2760 aaaatgctgg tgaatccgat gagaagctga agaccctaat ggacgcccgc gtaggcgaaa 2820 atttgtctga agaagagagg agtgccgttt tcgagctctt gagcaaatat ctgcggtgtt 2880 tcccctcagc agatggcgaa cttggattta caaacatgtc ggagcatttc atcgacactg 2940 gcgacgctca accgatcagc tgcgtcccgt atcgcgtgtc agctatggaa cgaaagatca 3000 taatggaaaa agtcgccgat atgctgaaac aaggtatcat tcgtccgtca tttagtccgt 3060 gggcagcacc ggtagtgctg gttaaaaaga aatcgggtga ctttaggttc tgcatcgact 3120 ttagacgttt aaacgcggtg acaaaaagag acgtgtatcc tttaccccga ttggatgacg 3180 tttttgatcg tcttgctggt gcgaaatatt tttcgagttt agacttaatg agtggctatt 3240 ggcaggtacc cgttgcctcc gctgatacgt gtaaaacagc gtttgtcact ccagatggat 3300 tgtatgagtt tgttcgtttg ccgtttggac tgaataatgc accgtccact tttcaacgtt 3360 tgatggatcg agtgttagct cgccttaaat ggcaaatgtg tctcgtgtat ttagacgatg 3420 tgttagtttt tggaagaacg ttcgacgagc atcagaaaag acttgaatgt gttctaatgg 3480 ctttggtgga agccggatta actttaaacg tgtctaaatg tatttttgcg accaacagaa 3540 tttttcattt aggtcacacc atcgatgagt acggaatccg gccagactct gaaaaaatta 3600 gtgccctagt taactttaaa attaataacg ttaaaacgtt gagagccttt ttaggcctcg 3660 catcttttta tcgtcgtttc gttccagatt tcgctacaat agcacatccg cttcacaggc 3720 tcctgaaaaa gaattccgta tggagctgga cggaagccca agagtcggca aaagctgcgt 3780 tgattggccg cttggtgtcg tcccctgtac tggcgcactt tgaccaaaac atcgatgtag 3840 ttgtacaaac agacgccagc ctggtgggcc ttggggccgt tttaatgcaa gatgccggag 3900 atggaccacg tccagtcgcg ttcattagcc gaaaacttac cgacgcggaa agcaagtatc 3960 atgctaatga actagagtgt ttggcaattg tatgggcatt gaaaaaatta cgttcgtatg 4020 tgtatggtag acgattttct gtctgtacag atagctcagc ggttcgatgg ctatggtcta 4080 agaaagaggt tactggcaag ttcgccagat ggattttggc tttgcaagag tacgatttcg 4140 aaattcgcca cataaaagga gttaataatt tggtggctga tgccctatcg cgaaaccctg 4200 atgcatcctg tattggaacc agtggctccg cgatcggaca tgtagtttgt gtacttgaca 4260 gtagatggcc ggtgggcatg aataatacag aattggcatt ccaacagcag ctggatagcc 4320 aattgcgtcc cattatcacc tgtcttaatt caaaagtacc gggtaaaatt gcagaacagt 4380 ttaaaattca cgggaaaatt ttgtatagga aaaatcccac ccaagggcgt aaatttttgc 4440 tttgtgtccc gtcaatttta agaagaaaga taatagagtt ttctcatgat gacccctcct 4500 ctagccatat gggaatagac aaaacaattg caagagtgtc tgaacgttat tggtggccga 4560 agtttcggtc aagtgtccgt aaatatgtta tgtcttgtaa ttattgccaa tttcataaat 4620 gtatccccgg attacccgct ggtcaactcc agcccatacc accgccagat cgaccatttc 4680 acaccgtcgg catggatcat ctcgggccat ttaaggcaac gtcggaaggc aagaaacaca 4740 ttattatggc tatcgactac ctaacgaagt atgtagaagc agccgcagtg gccgacacgt 4800 caacagcatt ggttgtggat tttgtgagag accagatcaa ctttcgccat ggtggaacaa 4860 cgcgaatatt aagtgatcaa ggaactgcct tctcctccca cctgatggaa gaaaaagtca 4920 acgaatggaa gacccagcac gtttttgcaa ccgcagaaca tccacaaaca tctggactcg 4980 ttgagcgagt caaccgaacg atgacccttg cgctagctgc ctatgtcaat actgaccatg 5040 atgactggga tcgccatttg ccagcagcaa ttttcgccat caatacagca aggcaaagta 5100 cgaccgagat atcgccgttc cagttggtgt atggccgctt gcccttcacc gccctagaga 5160 acgagtttcc gtggccagaa gaacgaccag aatcattcga cgtctttctg tctcgagtca 5220 aggagctgag agacgcggcc cgattgaaaa tagtgaaaaa acaagaaaaa gtgaaaagac 5280 tagtggattt gaggcgtcgt gttgtcaaag acctctttcc tggtgaatta gtgctcgtcc 5340 gacgaaaact aaaaaagaag ggtaaaacga agaaactcct tccgaaatac gttggtccgt 5400 tccaggtcgt taagaaagtg tgcccaacaa cttatctggt ggaagatctg ccggcccagc 5460 gaaagaaaaa gagatttcgc cgtttcaatg cgcatgttgt gcaaatccga aaatttcacc 5520 cgagagatga cgctgaatgg gatgattggc cggatgaacc agaagaatcg attgaaatcc 5580 agccgtcatc aggccaaccg gctgaaaaag aagcgtcagt tacatcccaa gaatctattc 5640 aacccgctag ttcaacaatc gatccgccga atgaagtcgt ggcgccacct cctccaacga 5700 caacgagagc cggaagaaaa gtagttcgtc cgggctggat gaagaatttc gttgcatgat 5760 ttgtgtatac ctccctcctc cccttgtcaa cgactatttt tcgctagttt atgttgagta 5820 cgttttgtcc agtttacttt ctttatattt tatcccgtcc catcttaagt tttttttttg 5880 ggttggatag aatcagggaa aggaaaggat tagtttaagt cccctcttat ttatttattt 5940 tttttttttt gtttgttttg tttgttgtct caagtgtagt ttcgatttga caaagttgca 6000 aagtttgcgc tcgtcatact aagttttatt gttttgtgga tgttgtttgt ttcccttgta 6060 tatagtcaaa tcgagtcagg aagggccgaa tgt 6093 // ID Gypsy-17_DPu-I repbase; DNA; INV; 8720 BP. XX AC scaffold_1168; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-17_DPu_; KW Gypsy-17_DPu-LTR; Gypsy-17_DPu-I. XX NM Gypsy-17_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-8720 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 749-749 (2010). XX DR Genome; scaffold_1168; Positions 9631 912. XX CC Positions [4066-4566] - Integrase core CC LTRs are 98% similar to each other. CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX FH Key Location/Qualifiers FT CDS 1388..2557 FT /product="Gypsy-17_DPu-I_2p" FT /translation="MGNLKRQNRPFIKVKANKIEGTWLYDTGASVSCMSLE FT QFRRIPPEQRPVKQDAQVRLLSAAKTEIRVVGMYILTLNILGKTFSHPVHV FT CSPMNQGGIVGMDIIKKLGLTYLPIRKTFVFDTHIAVEPDKQHDAPTVFKK FT SAGVVASLATDRQIKIPPHSQKVISINCSPTHSGLSAKGATAVANIFSSQF FT PLLWGGPALIQTNFKGKTSMPVINCGPTEMTLPRGTPIGLMETIHADGAIK FT VDEGEVAEKLAAREVQMPAPPSAERRQQILQELTLTVPPTEKQAYIDLIMQ FT NHDIFSKDKNDLGRANNFTHKIDLKNKAPVYIPQYRLADTHKAALDKQIDE FT WLKMGVIQPSNSRYNSPIFVVPKKMGKIGTCWIIGRLTLARTTIGIQ" FT CDS 2557..6210 FT /product="Gypsy-17_DPu-I_1p" FT /translation="MRTVDECIAEIGKSGSTIFSTMDLSSGYHQMLLERNS FT RAATAFTVPGKGQFEWLTTSMGLRGAVSSFQRMVELTMKGIHNLIVYIDDL FT LAHSQNHQQHRQLLQLVFDRLRKTGLKVNLKKCHFGSPNVAYLGFQLTPQG FT VLPGLDKLAAVRQAKPPTDVHQVRQFLGLVNFFRAHVRNFAMVASPLTQLT FT RKDTPWRGGQLPPEALTAFKELKQILCSQPLVAYPRPDRPFALIVDAAAGV FT TKINSKGQRTFRQEGGLGAILCQPDHKGELHVIAYASRALAQHEKNYTPFL FT LEMLACCWGIDHFDVYLKGRKFVIYSDHRPLEKLSCVHEKTLNRLTHKMNE FT YDFIIQYKKGAEMPADFLSRNVLEEIQIFTPDLPLLQQRDEFAAAVVKFLQ FT VKQLPANKHQAAYIARIAPSCFLEDGILWRRITRHGAPARTVLVVPAAHVD FT QLIHETHGALMAGHEGITKTKERLLQSYFWPNMDKDIARHVQACQRCQARR FT RDVRPTPNVLTPLAQCTELNQRVHMDLFGPLKVSNQGKKFVLCLTDAFTKY FT AVMVAIDNKEASTVANAVFEQWICKFGTPLEFVSDNGKEFCNNLAKELYAL FT LRIKHSTTTPYWPQCNSQAEVANKTIQKYLASFVDATTLDWPIYMAPMAFA FT YNTSVHRSIKTTPFFLTHGVDARYPSFPTPDVQRYYGESKAAEWYNTLQHC FT RQIAAQNNMDATGQAEQQYNKTAQVHNFHQGQMVWLNETNYLGRNRKLSPT FT WTGPHLVLQVFQNGVVELLIKNRRVKVNVGRIKPATPSLPQAQEQQQQPEG FT QQAQQQSQQQQQPRDGVNDPQPFITDDNPLRPAAILPQVIHPQQDQQQQQQ FT PPPAPEQVAPQPPKRGRGRPRKVTVDPPAQQQQDQQPQPAAPAPQADAQEQ FT PTGPITRARARALERQALSGTDAIKLIRQVNNEAKTTQKFKPIPHYNAVEG FT PDFVADEYGLPKQFKGQKQPAAIIRRRNFLKSLSPTQRNLLLTGDPVFAFD FT PIAYEVFLTCRNVPPIIQQQFDYLQPAASGATSTVTSATASSTSSTPGTPT FT SPQPPQPAVTPKAPHKTYPVDDDKAPVKGVTWAADAGASPTLALDVASPGA FT RDRWMRSSTSPTPAYLVTSPTTSWEKFKSATKAVAEDLLLVPPPGWKPPKP FT PGYVKPPPSMLQKAARAIKDEIMNPPIPFVGQPLPPRGAAKLQQRDAPAAT FT GARPKTHR" FT CDS 5820..8615 FT /product="Gypsy-17_DPu-I_4p" FT /translation="MGRRRRSFANAGARRGKSRGQRSLDALIHQPHAGLLG FT HVANNQLGEIQVSHQSRRRRPTVSATPRVETAQAAGIRQTAAVNAPKSSQG FT DQRRNHEPANPVRGTAIAAERSGQTTTTRCSCRHRSTAQDAQVTQLSKVLY FT RQTALISLLSLLFTFSCILPFASAQITHADNDHFIVFDQVGHMASSLGYIH FT VAIPLNISTYQHQITLFHDFLADFSSKTTSNPTQVSFTKAIRDLATFARKR FT VDKLVEQLRFIDVVLPEDDLTNLHHPRQKRFLWGVLGVMLPTIMREKAKAE FT RELDLFDENCNLKDPSVSFENVTKILTHLSRDKRSLVRDYSNPRRQFQMLC FT FLNKTMQTTTPLPYPDYPEFHEVFVNETFPTTTTTTPPPPSHKVFHSIPTD FT PAPSLPYPYRHKRDTDNIVNANNDDAIDNMDADIDNSNNRDKRQILAGIAA FT AGGVLGTIFGLFNQLEMHSIQNHVSNLEASTNMLIHVQHKNQQQFKIITAE FT MAHLTTIIETLIQYNPALVYAKLMSQVDDIADHLNNLLDTVQQLQHQKLSI FT RLLDLQQLNTLHTSLKRSAKQNNWQLLINTPQDIFQLDVSYIRKKSDVIIM FT VHVPCLTDNNLLTIYRYANLPLPVNTLQLSPKVNESLNTLLPVHTINDLLT FT QFNSPASIAPIQEALYLLPEADLIAIGRNNGNSHRYKLLKHADLTACIQKN FT HIFLCEGHQVLNTDLEGSCLGSLYLQSERGVRENCKLERKPLRETVFQVSN FT TDHLVISPYPHTAQIFCQNGTHYPIRIRTMARLHLSPGCTLQLFNHTLRSD FT QSIRVKAEPLVFPWSFNPLMLPSELMHRAQHTDDQVNLLKQSIQTLQDVAV FT QDDEIPQTITNSLSSVSGFSVLFWLALGTAVLALGLVTCWYCGSRIRERRG FT RPPGIAAKDLPMTISRIADLNLPDADGR" XX SQ Sequence 8720 BP; 2654 A; 2428 C; 1951 G; 1687 T; 0 other; ctaatttacg atagtggtgg cagcgatttt gaagtcaaaa gaggatcatc caaaaacttc 60 gcagctaaat cctactttag tacttcaaaa ctacctatcg cctccgaaaa atggccaaca 120 ttaacgcagc actcatgagg gccgacctcg ccaagctccc cctctggtcg ggggacgcct 180 ccaaggacgg atacacaatt gaacaatggg tcacaagagt caacaaagca gcaacgacgg 240 ctgcgtggaa cgacgcggac acgatggctt acgtgtacaa tgccctacga ggaccagcat 300 tgcgatggct ggaatcactc aaacgcttca atatcaacat cgactcttgg gccgcagtga 360 gaacagaaat gctagacgcc tacagcagag tacagacagc caggacggcc atagtaaatt 420 tgtcggatct caagcaaggg caaaacgagt cagttaccga ttttggagca agagtagcca 480 gaacagtgga cgatctggag catctcatgc cagcagccag ccgggcgcct caaggagtag 540 tctgggccga tgttttcacg ggactggccg gctgggcagg agttacggcc gaacaaaaag 600 caacacagct tcagatagca gcagacaagg taatctgggc tacttataac catttgggag 660 ttcaactatt catctccaac ctaaaacccg cgctcaggga cgagttaatg aaagcaccgc 720 ctacggacct caatgcagca atcaaggcag ctcggcagct agagaaaatt cacgccaaac 780 cagaaaacgg gcacgctaca gtatcagaaa tcgggcagag caacccacag gcagcagccg 840 acgacgtcga tcagcaaatc gaagctctgt cagcccaatt ccaagcgctt ctcaaacgca 900 ggcaggcaaa caacggccgc ggcggccgag gaggccgagg ccggggcagc caaggacgag 960 gaggacgcgg cagaggccgc gggcaaacgc aaggaaacaa cggcgcgagc agcagcagtt 1020 acaacacctg ccgctactgt aagaagccag gacatctgca aaaagtttgc aactctcgga 1080 tccgcgccgg agcgccggaa gtggacgcac aaggcaaacc ttactcgcac ggcaacgaga 1140 tggaccagga ggatgaggac taccaagggg acatgtccaa tggcaaccca tggggtcagc 1200 aacaacaaca acaacaagat tggaacgaac tgcaggaagt ctacaatgaa acgccggatt 1260 ttctctaaac gacggcactg cgcccagaag tgaccgtcgc tcaggtgatt caatttttgt 1320 agaaacaatt actgtagaaa acctttctag aaaaaataaa aatgttccta caattgcaat 1380 cgatgttatg ggcaatctca aacgacaaaa tagacccttt ataaaagtta aggccaacaa 1440 aattgagggt acttggttat acgacacagg agcctcagtg tcctgcatgt cattagaaca 1500 attccggcga atccccccgg agcaacgacc agttaagcaa gatgcgcagg tcaggctgtt 1560 gtccgcagcc aaaaccgaaa tcagagtggt gggcatgtac atcttaacac taaacatttt 1620 aggcaaaaca ttttcgcacc cagtacacgt gtgtagccct atgaatcaag ggggcattgt 1680 tggtatggat atcatcaaaa aactagggct cacatattta ccaatcagaa aaacatttgt 1740 tttcgacacc cacatagcag tggagccaga caaacagcac gatgcaccca cagttttcaa 1800 aaaatcggca ggggttgtag catccctagc aacagacagg caaataaaaa taccccccca 1860 cagtcaaaaa gtaattagca ttaactgttc ccccacccac tctgggctct cggcaaaggg 1920 ggcaacagca gtcgcaaata ttttctcttc acagtttccg ttgctttggg gggggccggc 1980 gttgatacaa accaatttta aaggaaaaac gtcaatgcca gttatcaact gtgggcctac 2040 agaaatgaca ctccccaggg ggacaccaat agggttaatg gaaacaattc acgcagacgg 2100 ggctattaag gtagacgagg gggaggtagc cgaaaaatta gcggccaggg aggtacaaat 2160 gccagccccc ccctccgcag agcgacgcca acaaatattg caggagctca cgctcacggt 2220 gcctccaacg gaaaaacaag cgtacattga ccttataatg caaaatcatg atattttcag 2280 caaagataaa aacgatttgg gcagggcaaa caactttact cataaaatag atttaaaaaa 2340 taaggcacca gtgtatatcc cgcaatacag gttggcagac actcacaaag cagccctgga 2400 taaacaaatt gatgaatggt taaaaatggg ggtcattcaa ccctccaaca gcaggtacaa 2460 tagcccaatt tttgtcgttc ccaaaaaaat ggggaaaata ggtacgtgtt ggattatagg 2520 gcgcttaacg ctagctcgca cgacgatcgg tatacaatga gaacagtaga tgaatgcatt 2580 gctgaaatag gcaaatccgg aagcaccata ttttcaacca tggatctttc aagtgggtac 2640 caccaaatgt tgttagaacg aaacagcaga gctgcaacag ctttcacagt gccaggaaaa 2700 gggcagtttg agtggctaac aacctccatg ggcttacggg gggcagtatc tagctttcaa 2760 cgcatggtgg aactcacaat gaagggcatc cataatctca ttgtatacat cgacgattta 2820 ctagcgcaca gccaaaatca ccagcagcat aggcagttac tgcaactagt gtttgacagg 2880 ctcagaaaaa caggcctaaa agtaaattta aagaaatgtc atttcgggtc gcccaatgta 2940 gcgtacttag ggttccagct cacaccgcag ggcgtgttac cagggctgga caaactggca 3000 gcagtcaggc aagccaagcc gccaacggat gtgcaccagg tgcggcagtt tttaggtcta 3060 gtaaattttt ttcgggccca cgtccgaaat ttcgcaatgg tagctagccc cctcacacag 3120 ttaaccagaa aagacacccc atggagaggg gggcagctgc ctcccgaagc cctcacggca 3180 ttcaaagaat taaaacaaat tctatgttcc cagccattag tagcttaccc ccgaccagac 3240 agaccgttcg cgttaatagt agacgcggca gcaggggtaa caaaaatcaa ttcaaaaggg 3300 caaagaacat ttcgacaaga aggggggtta ggggcaattc tctgtcagcc tgatcacaag 3360 ggggagttgc atgtaattgc ctacgccagc cgggcgctag cgcagcacga aaagaactac 3420 acaccctttt tactggaaat gttagcctgc tgttggggca tagaccactt tgacgtgtat 3480 ctcaagggca gaaaatttgt catttactcc gatcacaggc ccctggagaa actgtcatgc 3540 gtacacgaaa aaacgcttaa taggctaaca cacaaaatga acgagtacga tttcattatt 3600 cagtacaaaa aaggtgcaga aatgcctgcc gactttctta gcagaaacgt gctggaggaa 3660 atccaaattt tcacccccga cctccccctc cttcaacaaa gagacgagtt tgctgcggct 3720 gttgttaaat ttttacaggt aaaacaactg ccggccaata agcaccaagc agcctatatc 3780 gcacgcatcg cgccatcatg ttttctcgag gacggcatac tctggcgccg tataacacgt 3840 cacggagcac cggcgcgcac ggtactggtg gtaccggcag cgcatgtcga ccagctgata 3900 cacgagacgc acggagcact catggccggg cacgagggca tcaccaaaac aaaagaaagg 3960 ctactgcaat catacttctg gcccaacatg gacaaggaca tagcgcgcca cgtacaagcg 4020 tgtcagcgct gccaagccag gcgccgggat gtacggccaa cacccaacgt gctaacaccg 4080 ctggcgcagt gcacggagct caaccaaaga gtgcacatgg acctctttgg gccgctgaaa 4140 gtcagcaacc agggcaagaa attcgtgcta tgtctcacag atgctttcac caaatatgca 4200 gtcatggtag caatcgacaa caaagaggca agcacggttg ccaatgcagt cttcgaacaa 4260 tggatatgca aatttggcac acccctagaa ttcgtgtcag acaacggaaa agaattttgc 4320 aacaacttag ccaaagaact atacgcgcta ctgcgaatta agcactccac aaccaccccc 4380 tactggccgc agtgcaacag tcaggcagag gtggccaaca aaacaatcca gaagtacttg 4440 gcatcattcg tcgacgccac aaccctggac tggccaattt atatggctcc catggcgttc 4500 gcctacaaca caagtgtgca caggtcaatt aaaaccacgc cgtttttcct aacacatgga 4560 gtcgacgcca ggtacccatc gttcccaacc ccggacgtgc agaggtatta cggggaatcc 4620 aaagcggcgg aatggtacaa cacactgcag cactgccggc aaatagcagc tcaaaataac 4680 atggacgcca cagggcaagc agaacagcaa tacaataaga cggcgcaggt gcacaacttt 4740 caccaagggc aaatggtgtg gctcaatgaa acaaattatc tgggtcgcaa cagaaagctg 4800 tcgccaacct ggacaggtcc gcacctcgtg ctccaggtgt ttcaaaacgg ggtcgtcgaa 4860 ctgctcatca aaaatagaag ggtaaaggtc aacgtggggc gcatcaagcc ggccacaccc 4920 agtctgccgc aagcgcagga gcaacagcag cagccagagg ggcagcaagc gcaacagcaa 4980 agtcagcagc agcagcaacc tcgcgacggg gtcaatgacc cacagccatt catcaccgac 5040 gacaaccctc tacgccccgc agcaatatta ccacaggtga tacacccgca acaggaccag 5100 cagcagcagc agcagccacc cccggcgcca gagcaggtag cgccacaacc gcccaagcgc 5160 gggaggggga gaccccgcaa ggtgacagta gacccaccag cacagcagca gcaggaccag 5220 caaccccagc cagcagcacc agcaccccaa gcggacgccc aagagcaacc aacggggcca 5280 atcacgcgag caagagcgcg agcgctagaa cgccaagcgt tatcaggaac cgacgctatc 5340 aagctcattc ggcaagtgaa caacgaagcg aaaacaactc aaaaatttaa accaattccg 5400 cactacaacg cagtcgaggg tccagacttt gtagccgatg agtacggatt gcctaaacag 5460 tttaaagggc aaaaacaacc agcagccata atccgacgcc gaaatttcct caaatccctg 5520 tcgccaacgc aacgaaacct gctgctcaca ggggacccgg ttttcgcgtt cgaccccatt 5580 gcatacgaag tatttctcac gtgccgaaac gtacctccaa tcatccagca gcagttcgac 5640 tacctccagc cagcggcatc cggagctacg tccacagtca cgtcagcaac agcatcaagc 5700 acctcatcga ccccgggcac gcccacttcg ccgcagccgc cccaaccagc ggtaacaccc 5760 aaagcaccgc acaaaacgta cccagtggac gacgacaagg cgccagttaa aggagtaaca 5820 tgggccgcag acgccggagc ttcgccaacg ctggcgctcg acgtggcaag tccaggggcc 5880 agagatcgtt ggatgcgctc atccaccagc cccacgccgg cctacttggt cacgtcgcca 5940 acaaccagct gggagaaatt caagtcagcc accaaagccg tcgccgaaga cctactgtta 6000 gtgccacccc cagggtggaa accgcccaag ccgccgggat acgtcaaacc gccgccgtca 6060 atgctccaaa aagcagccag ggcgatcaaa gacgaaatca tgaacccgcc aatcccgttc 6120 gtgggacagc cattgccgcc gagaggagcg gccaaactac aacaacgcga tgctcctgcc 6180 gccacaggag cacggcccaa gacgcacagg taacccagct gagcaaagta ctgtaccgcc 6240 aaaccgcact aatctctcta ctatcacttc ttttcacttt ttcatgcatc ttgccctttg 6300 cctccgccca aatcacgcac gcagacaacg accactttat tgttttcgac caagtcggac 6360 acatggcttc ctctttgggt tacatccacg ttgctattcc cctaaacatt tcaacgtacc 6420 agcaccaaat cacgctcttc catgatttct tagcagattt ttcttccaaa acaaccagca 6480 accccacaca agtttctttc actaaagcaa tcagagatct ggcaacgttc gcgcgcaaac 6540 gagttgacaa actggttgaa caactacgat ttatcgacgt tgtcttaccg gaggacgact 6600 tgaccaacct gcaccacccc cgacaaaagc gttttctttg gggagtgctc ggggtcatgc 6660 tgcccacaat catgcgtgaa aaagctaaag ctgagcgaga actagacctg tttgacgaaa 6720 attgcaacct taaagacccc tccgtttctt tcgaaaatgt aacaaaaata ctaacgcatc 6780 tctcacgcga taagcgcagc ctcgtgcgag attattccaa cccccgacga cagtttcaaa 6840 tgttgtgctt tctaaacaaa acaatgcaaa caacaacacc gctaccgtat ccggactatc 6900 cggaatttca cgaagtattt gtcaacgaaa ccttccccac aaccacaacg accaccccgc 6960 cacccccgtc acataaagtt tttcacagta tccccacaga ccccgccccc agtcttccgt 7020 acccgtaccg acacaagcgc gacacagaca acattgttaa cgctaacaac gacgacgcta 7080 tcgacaatat ggacgccgac atagacaatt caaacaacag ggacaaacga caaatcttag 7140 ccggtatagc agctgccggc ggagtactag gaacaatttt tggcttattc aatcagttgg 7200 aaatgcacag cattcaaaat cacgtctcaa atttggaggc aagcaccaac atgctaattc 7260 atgtgcagca caaaaatcaa cagcaattta aaataataac agcagaaatg gcacacttaa 7320 ctactatcat tgaaacgcta attcaataca atccagctct tgtatacgcc aaattaatgt 7380 cccaagtcga cgatatcgct gaccacctta acaatctgct tgacacagta caacaattgc 7440 aacaccaaaa gctttctatc cgacttttag atcttcaaca gctcaacacc cttcacactt 7500 cacttaaacg ttcagcaaag caaaacaact ggcaactttt aattaatacc ccgcaagaca 7560 tttttcaact ggatgtatct tatataagaa agaaatctga tgtaataatt atggtccatg 7620 ttccgtgcct tacagacaac aatcttttaa ccatctatcg ctatgcaaat cttccgcttc 7680 cggtcaacac cttgcaactg tcacctaaag ttaacgaatc tctcaacaca cttcttccag 7740 tccacactat taacgacctc ctaacacaat ttaactcgcc tgcctctatt gccccaattc 7800 aagaagccct gtaccttctc ccagaagccg acctcattgc aatcggcaga aataatggaa 7860 actctcatag atataaactt cttaagcacg cagatctaac ggcttgtatt caaaagaacc 7920 acatttttct ttgcgaggga caccaagtcc taaacaccga cctcgaaggc tcatgcctag 7980 gctctctcta ccttcagtcc gaacgagggg tgagagagaa ttgtaagttg gaaagaaagc 8040 cgttaaggga aaccgtgttt caggtttcaa acaccgacca cttggtcatc tccccgtacc 8100 cccacacagc acaaattttc tgtcaaaatg gcactcacta cccaatcaga attagaacca 8160 tggccagact tcacttgagt ccgggctgca ctttgcaact gtttaatcac acgctccgat 8220 ccgatcaaag cataagagtt aaagcagaac ctttagtctt tccctggtcc tttaacccct 8280 tgatgttacc ctccgagcta atgcatcgag ctcagcatac ggacgaccaa gtcaatcttt 8340 tgaaacagtc gatccagacg ctgcaagatg tggcagtgca ggatgacgaa atcccgcaaa 8400 ctattactaa ctcactttcc tctgtttccg gtttctcggt actcttctgg ctggcactgg 8460 ggaccgcggt cttggccttg ggactcgtga cctgttggta ttgcggatca aggattcgag 8520 agcgacgagg ccgaccacca ggcatcgctg ccaaggatct gccaatgacc atctcgcgga 8580 tcgcggactt gaacttgccg gatgctgacg ggcgttaaac gcggccctga agacgtcaca 8640 agcttactaa cccctaattt caatttgtgt aaacgtgtta ctaaccctta tcataaaccc 8700 ctaaaaagag ggaatgatat 8720 // ID Vingi-1_Tcas repbase; DNA; INV; 3181 BP. XX AC . XX DT 29-JAN-2010 (Rel. 15.02, Created) DT 17-AUG-2010 (Rel. 15.09, Last updated, Version 3) XX DE A family of Vingi non-LTR retrotransposons from Tribolium DE castaneum. XX KW Vingi; Non-LTR Retrotransposon; Transposable Element; Ingi-1_Tcas; KW Vingi-1_Tcas. XX NM Ingi-1_Tcas. XX OS Tribolium castaneum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Coleoptera; Polyphaga; Cucujiformia; OC Tenebrionidae; Tribolium. XX RN [1] RP 1-3181 RA Kojima K. and Jurka J.; RT "Ingi non-LTR retrotransposons from insects."; RL Repbase Reports 10(2), 150-150 (2010). XX RN [2] RP 1-3181 RA Kojima K.K., Kapitonov V.V. and Jurka J.; RT "Recent Expansion of a New Ingi-Related Clade of Vingi non-LTR RT Retrotransposons in Hedgehogs."; RL Molecular Biology and Evolution 28(1), 17-20 (2011). XX DR [1] (Consensus) XX CC Originally classified as Ingi [1] and re-classified as Vingi [2]. CC ~95% identical to consensus. The 3' termini are composed by (TA)n CC microsatellite. XX FH Key Location/Qualifiers FT CDS 50..3109 FT /product="Vingi-1_Tcas_1p" FT /note="includes endonuclease and reverse FT transcriptase domains, and a CCHC zinc-finger FT motif." FT /translation="MGGFSPPRNYRHLDDRSRSTRSSRSSSGFLGLRSSSE FT ILELPQNLTVCQLNIEGISKDKCDFLSKMALRERIDVIVLQETHTSSEMDL FT FSRGQIPGYTLVDFINSASYGCATYVKSNFEDYQSIAKTQENNIFLLTVKI FT AGIFIINVYKPPNDQWPENLVLNYHPAIYIGDFNGHHNLWGYAENYRNGLA FT LVSWMELNKLHLVFDGKTRKSFYSARLKKEYDPDLCFVTINNEVPLQASRD FT VLSGFPHSQHRPVIIKIGNEIKQFPSLQKPRWNFQKAKWNEFAKQVDSDLR FT WIPPTSINYHRFVGAIKNAAKKFIPRGFRKNYVPCWSDETKSFYNAYVSNP FT NPENADAVLNSFNKARKEKWNNLMEDLNFTHSSRSSWKLLQKLGSSNIAQP FT KTDSAISPNAVASRLVNVSNSVKLEKRAKKEMKRRVQKQRKKMKVSEELSR FT CFNVDELKAAISSTKSKKSAGFDGIYPEFIKNLGSFALKWLTFLFNDIFIS FT ARLPSEFKKTKVIAILKKGKPPQEASSYRPISLLSVCYKILEKMIYQRISP FT LIEDILPSEQAGFRPQRSCCDQVLALTTHIESGFEKRLKTGVVLLDLTAAY FT DTVWKDGLIHKLYKVVPCKRIVSLIESMLTNRKFRVFVGEKSSKVKTLNNG FT LPQGAVLSPLLFNVYTSDLPETFSRKFVYADDIALTFQHKEFHRLEEQLSQ FT DASVLCDYFKQWRLCVNPSKTEVSCFHLSNSQKERKLNVSLNGIPLKHNFH FT PVYLGVSLDCSLTYKFHLEKLRQKLKTRNNILLKLAGSTWGANAKTLRVTA FT LALVFSTAEYCSAVWMNSSHTSKIDAQLNTAMRVITGTLKSTPTEWLPVLS FT NIAPPALRRKLAVKKLWNKYRRHPQEYPIWSDLIAPENRLKSRKPFWVETF FT NIENFFIINEWRNIWSNIQLFNKDLIEDPSKRVPGMDQPRWVWVKLNRLRT FT GHARCKSMLFKWKATENPYCQCGQVEETIQHLVEECQITSFQGGFNEIHQL FT SPQAKTWLSTIKVL" XX SQ Sequence 3181 BP; 1022 A; 607 C; 596 G; 955 T; 1 other; ctttgattga cggatgtcgg taaccaagtc tctctgattt tggtccgtta tgggtggttt 60 ttcgccaccc agaaattacc gacacttgga cgaccgatcc agatctacaa gatcctctcg 120 gtcatcgtct ggttttcttg gattgcgaag ttcttccgaa attttggaac ttcctcaaaa 180 tttgacagtc tgtcaattga atatagaagg aatttcaaag gataagtgtg atttcttgtc 240 gaaaatggct ctaagggaac gaatagacgt tattgttctt caagaaaccc atacttcgtc 300 agaaatggac cttttttctc gtggacaaat tcctggttac accttggtgg atttcatcaa 360 cagtgcttct tacggttgtg caacctatgt taaatcgaat ttcgaagact atcagtccat 420 cgccaagaca caggaaaaca acatctttct tctcactgtg aaaattgctg ggatttttat 480 tattaacgtt tataaacccc ctaatgacca atggcctgaa aatctggtgc tgaactatca 540 tccagctatt tacattggcg atttcaatgg tcatcacaat ttgtggggtt atgcggagaa 600 ctacaggaat ggccttgctc ttgtttcatg gatggaacta aataaacttc atttggtttt 660 tgacggcaaa actaggaaaa gtttctattc ggccagattg aagaaggagt acgatcccga 720 tttatgcttt gttacaataa acaatgaagt accattacaa gcatcacgag atgtactatc 780 tggttttcct catagccaac atcgtcctgt tattattaag attggcaatg agataaaaca 840 atttccttct cttcaaaaac ctagatggaa ttttcaaaaa gcaaaatgga atgagtttgc 900 taaacaagtg gatagtgatc ttcgttggat tccaccaact tcaatcaatt accacagatt 960 tgttggagcc atcaagaatg ctgctaaaaa attcattcct cgtggattca gaaagaatta 1020 tgttccttgt tggagtgatg aaacaaagag cttttacaat gcatacgttt ctaatcctaa 1080 cccagaaaat gccgatgctg tattaaattc atttaataag gcgaggaaag aaaaatggaa 1140 caacttaatg gaagatttaa atttcacaca cagtagtcgc agtagttgga agctgttgca 1200 aaaacttgga tcttcgaaca ttgcacaacc taaaacagat tctgctatca gtccgaatgc 1260 tgtagctagc cgtctagtaa acgtgtcaaa ctccgttaag ctagaaaaaa gggcaaagaa 1320 ggagatgaaa cgaagagttc aaaaacagag aaaaaaaatg aaagtttcag aagaactgtc 1380 gagatgtttc aacgtagatg agttgaaagc agccatatca tcgactaaaa gcaagaagtc 1440 ggctggtttt gatggaatct acccagaatt catcaaaaat cttggaagtt ttgctctgaa 1500 atggcttaca tttcttttta atgacatttt tatttcagcc cgtttgccct cagagtttaa 1560 aaagactaag gtgatcgcca ttttaaaaaa gggaaaacct ccacaagaag catccagtta 1620 tcggcccatt tctttgctga gtgtatgtta caaaattttg gagaagatga tttatcagcg 1680 catttctcct cttattgaag atattcttcc ctctgaacaa gctggatttc gaccacaaag 1740 aagttgctgt gaccaagtct tagctttaac gacccacatt gaatctggtt tcgagaaacg 1800 cttgaaaact ggtgttgttc ttttggatct cactgccgca tatgatacag tttggaagga 1860 tggtttaata cacaaacttt ataaagttgt gccttgcaag cggattgtca gtcttatcga 1920 gagcatgtta acaaacagaa agtttagagt ttttgttggt gaaaaatcaa gcaaagtcaa 1980 aactctcaac aatggtcttc ctcagggwgc cgtattatct ccattactct tcaatgtata 2040 tacgagtgac ctccctgaaa ccttttcaag aaagtttgtt tatgctgatg atatcgcact 2100 gacttttcaa cataaagaat ttcatcgatt agaggaacaa ctttcccaag atgcttctgt 2160 tttgtgtgat tacttcaaac agtggcgtct atgtgtaaat ccatcaaaaa cagaagtttc 2220 ctgcttccat ttatcaaact ctcaaaaaga aaggaaattg aacgtctcat taaatggaat 2280 accacttaaa cataactttc atccagtcta ccttggtgtc tctcttgact gttcgctcac 2340 ctacaagttt caccttgaaa aattaagaca aaaactgaaa acaagaaaca acattttatt 2400 aaagttggcc ggttcgactt ggggtgctaa tgctaaaact cttcgtgtta cggctcttgc 2460 attagtattt tcaacggctg aatattgtag tgctgtttgg atgaacagtt cacatacaag 2520 caaaattgac gcacaattaa acactgctat gcgagtcata actggcactt taaaatcaac 2580 ccccaccgaa tggcttccag tattgtctaa cattgctcca cctgcactca gaagaaaact 2640 tgcagtcaag aaactgtgga acaaatatag aagacatccc caagagtatc caatatggtc 2700 tgacctgatt gcccccgaaa atcgattaaa atctagaaaa cctttttggg tggaaacgtt 2760 taacatcgaa aactttttca tcattaatga gtggcgcaat atatggtcta atatccaact 2820 gttcaataaa gatttaatag aagatccatc aaaaagggta cctggaatgg atcaaccacg 2880 gtgggtctgg gtgaaattaa acagactaag aactggccat gccagatgta agtcaatgtt 2940 gtttaaatgg aaggctacag aaaatccgta ttgccaatgt ggccaagtgg aagaaacaat 3000 tcaacatctt gttgaagaat gccaaattac aagttttcaa ggaggcttca atgaaatcca 3060 tcaactgtcg ccacaagcta aaacttggtt atctactata aaagttttgt gatttcttct 3120 acatacaatg tactattgtc cttttttttc tgtcatacga atatatatta tatatatata 3180 t 3181 // ID Copia-39_DPu-LTR repbase; DNA; INV; 186 BP. XX AC ACJG01005078; XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the common water flea: long terminal DE repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-39_DPu_; KW Copia-39_DPu-I; Copia-39_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-186 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the common water flea."; RL Direct Submission to RU (08-FEB-2011). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR Genome; ACJG01005078; Positions 9049 8864. XX SQ Sequence 186 BP; 48 A; 38 C; 19 G; 81 T; 0 other; tgtcatctcc attctatatt tatctcgtat ctgttcctat ctttgttcgt gtgtttcctg 60 gtaattatat atacttactt tattcacaca actcaatcta tgtttatgtc ataaattgaa 120 agatgttcta tcctgtatta atatacagaa gcctcgctca tactatcctt gtatcatttc 180 ttaaca 186 // ID Gypsy-22_CQ-LTR repbase; DNA; INV; 197 BP. XX AC AAWU01028702; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-22_CQ_; KW Gypsy-22_CQ-I; Gypsy-22_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-197 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 424-424 (2011). XX DR Genome; AAWU01028702; Positions 8192 7996. XX SQ Sequence 197 BP; 41 A; 61 C; 50 G; 45 T; 0 other; tgcatcccta cctgtcgtca ccacgctgga tcatcatgca ccgtcagcag gatacggtgc 60 cacgtagtgc cacgcggcat agcaacgccg ggttatgctg actcggcagg actgttcaac 120 aaaactgtcc gcgcaggtat gcgcggcctc tttcattccc ggatcgccgt gaagttaacg 180 tcgtagttca tcattca 197 // ID Baggins-1_NVi repbase; DNA; INV; 5066 BP. XX AC . XX DT 15-FEB-2009 (Rel. 14.02, Created) DT 23-JUL-2010 (Rel. 15.08, Last updated, Version 2) XX DE Non-ltr retrotransposon: consensus. XX KW Loa; Non-LTR Retrotransposon; Transposable Element; KW Baggins-1_NVi. XX NM Baggins-1_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-5066 RA Jurka J.; RT "LINE retrotransposons from the parasitic wasp Nasonia RT vitripennis."; RL Repbase Reports 9(2), 483-483 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Nasonia CC Genome Project. XX FH Key Location/Qualifiers FT CDS 1255..4944 FT /product="Baggins-1_NVi_1p" FT /translation="MRVAVLSNKMADRGQRNSVNNMNGAGSVPDTTIEFTQ FT INLHHCKSASAVLARRMAGVRTGICLIQEPWIHNGKIAGLNGIGTLISGSP FT VSTRTCLIVKGLQVETVPKYCSRDLCTARVSYKGTEGERKVIMIAAAYFPY FT EEACPPEEMVALIRECEAEGTKLIMGCDANAHHTCWGSTDCNSRGESLLEF FT LAATNMDFLNTGSRPTFRNAVREEVIDITLASRNVWSEVMDWRVSEEVSMS FT DHQHIVFRLGGQSTLDQLIRNPRKTNWVGYREELKAKISCFPVTYGTAEDI FT DHCSRILRDIIISSYENNCELRLKRPSKGAPWWNKSLEKRRKKVRRLYNRS FT KHTKSDKDKRAYRAEQKEYKKEIEKAGVEGWKRYCKSRSTLSDTARLCSVL FT QHPKKPLTDSIRLVTGDWAKNGQEALXGLMETHFPDFREGARSEAVQLMAR FT GEDWRLAKRVTDKARIRWAIGTFEPYKAAGPDGIFPALLQQGMDVLVPALE FT KLYRACLALGYVPEEWGQARVAFLPKPGKTQHAVAKDFRPISMTSFLLKTL FT ERLVDRYIEESSLVEAPLHSKQHAYQTGKSVDTALVDAVSFIQKGMKNRGL FT VLVAFLDIEGAFNYTTGEAISAGMEEHAIPATVARWISVMLRTRTIVAAWG FT AYSCKGVVRKGCPQGGVLSPTLWCLVVDSLLCILNEAGINAQAYADDIVIL FT IRGDDEDVLAGLMQFALGLVEKWCNKVKLGVNPNKVSVMLCTNRYKTKPME FT GLQLHGVPLKLVKEVKYLGVTLDARLNWGKHIKDKCEKAIGTFWACRRAFG FT NTWGLEPDKVRWLYDAIIKPRLTHGALVWGHKCELKTHTGALDRVQRLVMG FT GITGSMRTTPTVAMERLLELPPLGKVIRANACRTFCRIAESTNCSDFKDAA FT LLEQVMPLMEERGCDRMAERIYFSKPFSIIIPEREEWKTGSHELLENSVVW FT FTDGSKNENGVGAGAWEKGDTQEIVCSLDHYATVFQAEIRAITEAAKWLLE FT RGTGQRTVSFCSDSRAALMALDSISISSKEVLRCRQALESLAEHNAVRLVW FT VPGHSGVVGNEKADRLAGRGADGIRARRCAVAVPTCEVNRAIKDWLNSQLS FT DKWTNANGLRQARALMGSSPPEEWLRTIRGLSRNRLRLAVGWLTGHWRVGY FT HLWNLRLRDSGSCRWCEYETETTSHLLCECPAFAGTRQRVWGVPMLGLEEL FT RLKSLASICRVAEAINKGL*" XX SQ Sequence 5066 BP; 1399 A; 1136 C; 1589 G; 937 T; 5 other; gcacgtatgt ccaccagggt cactcgaggg ggactgactg cgtcgatctg agggtaccgt 60 aggaaaacta ggttgtctgg cgctcctagg ccgtcggata gtacaggggc ccgggtcagc 120 ctaaccagca tgacctggct gccctatcag ccgacgaaaa atcccgcctg acctaacagt 180 gccggttggt gaccggatag ggccatgttg ggttgggcgg ggggccctgc tgaggccaca 240 ggtgtcgggg ttggcttctc cgcggtgggt ggcttctggg gggcgtactg gtccgggatg 300 cgggggcggg gcttggtcca cacacactaa cacacccgta taacatacaa aataaaatgg 360 aaaagaataa aacgcaggaa acggttgcaa aagtgccagc aacaaaggag ctccccagct 420 caaaccaagt cctcacggac ggtattctga aagaccgggc ggctgtcgtc aagacggcaa 480 ataccggtgc aagttcccag gagagtcacc gggcggctgc ggggtccgcc ccagcggcaa 540 ataccggtgc ctcctcggga actgaaaata gtaaggtaaa gaaaagaaac aggcacgcta 600 ataggaagct agcctcgggg gcgaagctaa ggacgaagcg tctggaggac gctcctgggg 660 ggtctcaggt tgggaccctt gcgggcgcga ccaaaagaca gcactccgac agctccaccc 720 cgagggagct taaaaagagg tccgttgggg acgctccaaa gagctacagg acttaaggtg 780 gccatcgttc cggtcaagta cccggaagag cgccttgacg aggaccaggg aaagaaggcc 840 ttggaggggg agattgatga cgccccggac ggaggctatt ttccaagttt cttggacaat 900 tggttccaga ggggtgcaca gatctggtta cttacactaa atggtgtggg aggacccaac 960 tcaaggctat tgggagacct ccagacgatg atcccggagt gcatactccg gaggtcgtgg 1020 tgaagcgctt gaaaaggcag aaggtggrag ggraaaaaag gaggagacgg cattcgggtg 1080 tgctggaaag gactgaggtc gaggcactga aggcccttga cttcagacct ttctgtggag 1140 gcggcagggc acatgtcgtc ccgttcaagg aaaagacaga ggaagagccg ggacagacgg 1200 aagtagaggt tatggaggtc gatcccggcc ttaataaggt aaacatagag gtagatgcgg 1260 gtagcggtgc tctcgaacaa gatggctgat cgggggcaac gcaacagcgt gaacaatatg 1320 aatggggccg gatctgtacc ggacaccacc atagagttca cgcagattaa tctgcaccat 1380 tgcaagagcg cctcggccgt tctggctaga cgtatggcgg gggtgcgtac aggtatttgc 1440 ctaatccaag aaccttggat tcacaacggc aaaatcgcag gcttaaacgg aataggtaca 1500 ctgatcagcg gcagtcccgt ctcgactaga acgtgcctta ttgttaaggg gcttcaggta 1560 gagacggtgc caaagtactg ctcaagggat ctatgtacgg ctagggtaag ctacaagggt 1620 accgaaggtg agcggaaagt cattatgatt gccgcagcat actttcctta tgaggaagcg 1680 tgcccaccag aggaaatggt cgcattaatt agagaatgcg aagccgaagg caccaagctt 1740 ataatgggtt gtgacgcaaa cgcacatcac acatgttggg gcagcacaga ctgcaattct 1800 agaggagaaa gtctattgga gttcttagca gcaacgaaca tggactttct caatacgggc 1860 agcagaccca cgttccgaaa cgccgtaagg gaggaggtca tagacatcac cctggcctcc 1920 aggaacgtgt ggtccgaggt tatggactgg agagtgtcgg aagaagtctc catgtcggat 1980 caccaacaca tcgtgttcag acttggcgga caaagtacgc tcgatcagct cataaggaac 2040 cctaggaaaa caaactgggt aggatacaga gaagaactta aggctaagat cagttgcttc 2100 cctgtcacat acgggacggc ggaggacatc gatcactgca gtaggatcct aagggacata 2160 ataataagtt cttacgagaa caattgtgaa cttaggctga agaggccatc gaagggcgct 2220 ccctggtgga ataagtcact ggagaaacgc agaaagaaag taaggcggct atacaaccgg 2280 tccaagcaca cgaagagtga caaggacaaa cgggcgtata gagcggagca aaaggaatat 2340 aagaaggaga tcgaaaaggc cggggtggag ggttggaagc gatactgcaa aagccgaagc 2400 acgttatcag acacggccag gctgtgcagt gtcctgcaac acccgaagaa gccactgaca 2460 gactcgatca gattagtaac aggggactgg gcaaagaatg gacaggaggc cctaragggc 2520 ctgatggaaa cccactttcc agacttcagg gaaggggcga ggtcagaggc cgttcagctg 2580 atggcccgcg gagaggactg gagactggcg aaacgggtaa cagacaaggc aaggataagg 2640 tgggctatag ggacgttcga gccctacaag gcggcaggtc cggacggcat tttcccggct 2700 ctgctgcagc agggcatgga cgtattagtc ccagccttgg agaaattgta ccgagcctgt 2760 ctggcactag gatatgtgcc ggaagaatgg gggcaggcga gggtggcttt cctgcccaaa 2820 ccaggtaaga cacaacacgc ggtcgcaaag gacttcaggc caatcagcat gacctcgttt 2880 ttactcaaaa ccctggaaag gctggttgac agatatatcg aggagtcatc actagtagaa 2940 gcgccgctgc acagcaagca gcatgcctac cagacaggaa agtcagtgga cacagcgttg 3000 gtggacgcgg ttagcttcat tcaaaagggc atgaaaaaca ggggtttggt attggtggct 3060 ttcctagaca tcgagggggc tttcaattac acgaccggcg aggcgatttc agcaggtatg 3120 gaggagcacg caataccagc aactgtcgcc aggtggatca gcgttatgct gaggaccagg 3180 acgatagtgg ctgcctgggg agcgtactct tgcaaaggag tggtgaggaa gggatgccca 3240 caaggagggg tgctgtcacc gacactatgg tgcctggtgg tggacagctt gctctgcatc 3300 ctgaacgagg ctggtataaa cgcacaggca tatgcagacg acatcgtcat cctgattagg 3360 ggggacgacg aggacgtgct agcaggtcta atgcagttcg cactgggcct ggtggagaaa 3420 tggtgtaata aagtaaaact gggggttaac cccaacaaag tcagtgtcat gctgtgtacg 3480 aacaggtata aaaccaagcc gatggaaggg cttcaactcc atggagttcc cctgaaattg 3540 gtaaaggagg tcaaatacct gggagtcacg ctcgacgcca gactcaactg ggggaaacat 3600 atcaaagata aatgcgagaa agcgataggc actttctggg cctgcaggag ggcttttggc 3660 aatacctggg gtctagaacc ggataaagtg agatggctat atgatgcaat cataaaacca 3720 agactcactc atggggcact ggtatggggc cacaaatgtg agttgaaaac tcacacaggg 3780 gcactagaca gggtgcaaag attggttatg ggaggaatca cgggatccat gcgaacgaca 3840 cctacggtcg ctatggaaag actgttggaa ctccctccac taggcaaggt aataagggca 3900 aatgcgtgca ggacgttctg cagaatagcg gaatcaacga actgttcgga tttcaaggat 3960 gcggcacttc tggaacaagt aatgccgcta atggaagaaa gaggttgtga cagaatggcg 4020 gaaagaattt atttctctaa gccgttctcg atcataatcc cagaaaggga agaatggaag 4080 acgggttcac acgaactcct cgaaaacagt gtggtgtggt ttaccgatgg ctcaaaaaat 4140 gagaacggtg taggagcagg tgcctgggaa aagggggaca cacaagaaat tgtgtgctcg 4200 ctcgaccact atgctacggt tttccaggca gaaatcaggg ccattacaga agctgctaaa 4260 tggctgttgg agagaggcac aggacaaaga actgtaagct tctgctcgga tagcagggca 4320 gctctcatgg cgctagacag tatcagtatc tcctcaaaag aggtactaag gtgcagacag 4380 gcgctggaat cactggcaga acacaacgcg gtgagactgg tgtgggtacc aggacactcc 4440 ggcgtggtgg gtaacgaaaa ggcggacagg ttagcgggta ggggcgcgga cggcattcga 4500 gcaagaaggt gcgcagtcgc tgtcccaacc tgcgaggtta atagagcaat taaagactgg 4560 ctcaactcgc aactgagcga caaatggacc aatgccaatg ggctaagaca ggctagggct 4620 ctgatgggca gcagcccacc tgaggagtgg ctgcggacca ttaggggcct aagcaggaat 4680 aggttgaggc tggccgttgg ctggctgacg ggacattgga gggtaggata ccacctgtgg 4740 aacctgagac taagggactc cgggagctgc aggtggtgcg agtatgaaac ggagaccacc 4800 tcccacctgc tgtgtgaatg cccagctttt gcagggacaa gacagagggt gtggggggtt 4860 cccatgttgg ggttagagga gctaaggtta aaaagcttgg catcaatttg ccgagttgcc 4920 gaagctataa acaagggttt ataaagtaat tacttatggt acaggaggca cagcacaatg 4980 ggcttcgcct gagtgctcga ccggcacgtc cccgtgcagg ggccgccgca gttcacctcc 5040 rcccaacgtg actawgacta tgacta 5066 // ID Copia-121_AA-LTR repbase; DNA; INV; 170 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-121_AA_; KW Copia-121_AA-I; Copia-121_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-170 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 170 BP; 47 A; 39 C; 32 G; 52 T; 0 other; tgaagaagta ggccttgctg agcaagccat gaaatactga caagcgctag tggagttcat 60 tacgaatcaa gccttcaaac tgtatagacc attctctcct aataaacgtt ctttagttgt 120 ttgagattga gtctttctct tggcccatcc gcccagtcct taatattaca 170 // ID R1-4_AP repbase; DNA; INV; 4235 BP. XX AC Contig12314; XX DT 19-AUG-2009 (Rel. 14.08, Created) DT 19-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE Non-LTR retrotransposon. XX KW R1; Non-LTR Retrotransposon; Transposable Element; R1-4_AP. XX NM R1-4_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-4235 RA Jurka J.; RT "Non-LTR retrotransposons from pea aphid."; RL Repbase Reports 9(8), 1797-1797 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX FH Key Location/Qualifiers FT CDS 618..1997 FT /product="R1-4_AP_1p" FT /translation="MSTHSTMDEDEGLQDAQQGTSQPQRTQRSPNTPGSSP FT QHKKKRGSADPVPIAREAIKWIRYTLEIASTKKTNITAETQRSLFAKLEAL FT EGSIHDMVIANLTLQSQLEESHRGAEICMSACAAQFGTELRLREAAHEQTL FT EAVVARYAAREVAREEDAARVGNTAQQPVEHEEHTFAQVTEHGRRRNRGDR FT PPAVRAAERSKSRTAKRTKLLAESRNEEHRPAFIIQSIDGAKASDTSTGIW FT KKVMSKKVMPRCQTITTKQGKVIIKPLNKETADVLKSLSLESDELQEEPLM FT WPRFIVRGVPSCMPESTIQEAILDQNPELGIEPGCIDRVFKPVFKRGPRDR FT EDTNWVMEVNPPYYENVRKVEHLYIGFTRCRIAEYDEVTQCHVCLRHGHPA FT AKCNEKQPTCAHCGRKGHLADVCPAAEAEPSCANCHGKHNARERSCSARTN FT HLVGRARRTNYGKTQ" FT CDS 1997..4234 FT /product="R1-4_AP_2p" FT /translation="MSISSLSVVQLNMGRASSVSDQLLEYCQTHNIDIAMV FT QEPYTNRGRLTGFEVAPIRSFLSMGTRRRGRPEYLDYGAAIIVFNPDLIVV FT PREAGTVENFVSVDLDCGEEGNVTLISGYFKYRVPTPVHVVALDGLIRDAN FT EKTIIALDANAFSKRWFSRINDARGEALVSCIDEHGLVVMNTRCQHTTFNG FT PRGRTNIDVTLADQVLQRRVANWSIDPGATSSDHQLIKFTVAMRTRTFEHR FT ETRFILRKAAYQQFRLAYEELAARRQHREPNLDHNASSISEDVTAAAMAHA FT PRARRRKRVKPPWWTNELHEARKAVRAAARAMTTTDGRQLFNQKRNVYTSI FT LRRNKINSWRAFCTEEGKQPWGKLYRWLRNGSKRYCSIGLMTRPDGTRCDT FT IDESVELLLNSLIPNDPGQQRPAPAEETLCDIHPISSSTLRDFAWAISPNR FT APGNDGITGRMLRVLWPCLESRFLSLTNACMEKAHFPGSWKSAIVVPISKG FT EDRDPGLPKSYRPVSLLPTMGKILEKVINHRLQEQIQPNYTGKQYGFTSGK FT STMDAVGNLLLWNTQRAEKYAMTVFLDISGAFDNLAWPALQSDLESIGATP FT HMRRWIADYLSGRTATMTVGGITKDVRVTKGCPQGSILGPILWNVTMEALL FT RTTYPEHVAIQAYADDIAISIAAQTRVSLIQRAEEALRPALEWAHHRGLTF FT SAQKSQAMITKGSLAAGFTIAFGTDRIATVDCIKYLGIWLDEERSF" XX SQ Sequence 4235 BP; 1186 A; 1084 C; 1153 G; 812 T; 0 other; gtgcgcgtac ggaccggtgt cggtgttgag tcggtccgat aaaacccccc ctcgtggggc 60 ccgtttttgt gtgcttccat aaggaataca cggccgcccg ccggtattgc cgagcgattt 120 ttggataatc gtcagcttcg agaccaggcg tcaagagcct gacgacgtga gttcggactt 180 attatataac gtaattacgt aataccacgc aaacgtgacc ccccaccaag tggctggtat 240 aggtagtata ccacccctcc ctaaaatttt aattggcgat aactaccaag ggaaaaaggg 300 tacagacccc ggacccaggg ggaactcaat ttttgagcac accgcttgat ttcccgtata 360 cgggaacaaa gcccccaccc cccccccccc cgatttgagt taagtataac tccgtgggta 420 atcaccgcaa aaccctaggg gcaacggtta tcggttaccc taggtcaggg ccgcaattcg 480 acccaatagg gaacggggta cccccccacc ccccattttg gttttcgttt tttgggcaga 540 aatctttcag ttagaccttc agtacccata cccccgaggg gaaactcggg gttttgagtt 600 ccctcccctc cagagggatg tcaacacaca gtaccatgga tgaggatgaa gggttgcagg 660 acgcccaaca aggcacgagt caaccacaaa ggacgcagag atcgccgaac accccgggct 720 cttctccgca acataagaag aaacgaggct cagcagaccc ggtacctata gcccgcgagg 780 ccatcaagtg gatcagatat acactagaaa tagcgtcaac aaagaaaacg aacattactg 840 ctgagacgca gcgcagcctg ttcgccaagc ttgaggcgtt agaggggtcg atccacgaca 900 tggtcattgc aaatttaacg ctgcagagtc aactcgaaga gtcccacagg ggggcggaga 960 tttgtatgag cgcctgcgcg gctcaatttg gaactgagct acggttgagg gaagctgctc 1020 atgagcaaac gctggaggct gttgtggcca gatatgcagc cagagaggta gccagggagg 1080 aggatgctgc aagagttgga aataccgctc aacagcctgt ggaacacgag gagcatacgt 1140 tcgcccaggt aacggaacat ggacggagga gaaacagggg cgataggcca ccagccgtcc 1200 gggctgctga gaggtcgaaa tcaagaacgg ccaagagaac aaaacttcta gctgagagca 1260 gaaatgagga gcacagacct gcgttcatta ttcaatcgat agatggcgct aaggctagcg 1320 acaccagcac aggtatctgg aaaaaggtga tgtcgaagaa ggtcatgccg agatgccaaa 1380 cgatcaccac gaagcagggg aaagttatca tcaagccgtt gaacaaggaa acggcggacg 1440 tgcttaaatc gctttccttg gaatccgacg aactccaaga ggaaccattg atgtggccta 1500 gatttattgt caggggggtc ccatcctgca tgccggaaag cacaatccag gaggctatcc 1560 tagaccaaaa cccagagcta ggaatcgagc cagggtgtat cgacagagtc ttcaaaccgg 1620 tcttcaaacg aggcccacgg gacagagagg acacgaactg ggtaatggaa gtaaaccccc 1680 cctattacga aaacgtaagg aaagtagaac acctatacat tggcttcacg aggtgtagga 1740 tagctgaata tgatgaggtc acccaatgtc atgtatgcct gagacatggg catccagcgg 1800 ccaagtgcaa cgaaaagcaa ccaacctgtg cccactgtgg ccgcaagggc catttggctg 1860 atgtctgtcc ggctgcggag gctgaaccgt cctgtgcaaa ttgtcatggc aagcacaacg 1920 ccagagagag atcctgctcg gcaaggacca atcacctggt gggaagagct aggaggacca 1980 attacgggaa aacacaatga gcatctcttc cctcagcgtc gtgcagttga atatgggcag 2040 ggcatcctcc gtcagtgacc aactgctaga gtactgccaa acgcacaaca ttgacattgc 2100 tatggtccag gagccataca cgaacagggg taggctgacg ggcttcgaag ttgcccctat 2160 cagaagtttc ctgtccatgg ggacgaggcg tagaggcagg ccagagtacc tggactacgg 2220 agctgccatc attgtcttca acccggacct gattgtcgtg ccgcgggagg ccggcaccgt 2280 cgaaaatttt gtcagcgttg acttggattg tggggaggaa ggcaacgtca cgctcataag 2340 cggttacttt aaatacaggg tgcctacacc ggttcacgtg gtggctttgg acggactaat 2400 ccgtgatgcc aatgagaaaa ccatcattgc tctggacgcc aacgctttct caaaaagatg 2460 gtttagtagg atcaacgatg ccaggggcga ggcactcgtg tcgtgcatag acgaacacgg 2520 cctagtagtt atgaatacaa ggtgccaaca cactactttc aatggtccga gaggccgtac 2580 caacatagat gtaaccctcg cagatcaggt cctgcaacgg agagttgcca actggtcgat 2640 agatccggga gccacatcaa gtgatcacca acttataaag ttcaccgtag cgatgcgaac 2700 gaggactttt gaacataggg aaaccaggtt catccttcgc aaagcagcgt atcagcagtt 2760 tcggctcgca tacgaagaac tagcagctcg cagacagcac agggaaccaa accttgatca 2820 taacgcatcc agtataagcg aggacgtcac cgctgcagcc atggctcacg cccccagagc 2880 cagacgccgc aaaagagtca agcccccatg gtggacgaac gaattgcacg aggcgaggaa 2940 agccgtcaga gccgcggcaa gagcgatgac gactacggat ggacgtcagt tgttcaacca 3000 aaaacgcaac gtctacacat ctattttgcg cagaaataaa atcaactcgt ggagggcgtt 3060 ctgtactgag gagggcaaac agccgtgggg caaactatac cgttggctaa gaaatggtag 3120 caaaaggtat tgctccatag gcctaatgac gcgaccagac gggaccagat gcgacaccat 3180 cgacgaatca gttgaactgc tattaaactc gttaattccg aatgacccgg gtcagcagag 3240 gccagcccca gcagaggaaa ctctctgtga tatacatcca atatccagca gcaccctcag 3300 ggatttcgct tgggccatat caccaaacag agcaccggga aacgatggta taacaggcag 3360 aatgttaaga gtgctttggc catgcttgga aagcagattc ttgtccctga caaatgcctg 3420 tatggagaag gcgcactttc caggctcgtg gaagtccgca attgtagtac ctataagcaa 3480 aggtgaagac agagaccccg gactgccgaa gtcttacagg ccggtgagcc ttttgccgac 3540 catgggcaag atcctggaga aggtgataaa ccacaggcta caggagcaga tccagcctaa 3600 ttatactggc aaacagtatg ggtttaccag tggcaaatcc accatggacg cggtgggcaa 3660 tctcctgctg tggaacaccc agagggcgga gaaatacgcc atgactgtgt tcctagatat 3720 atcgggagcc tttgacaacc ttgcgtggcc agccttacag tcagatctgg agtccattgg 3780 agcaacacca cacatgagga gatggatagc ggattacctc tcaggccgca ccgccacaat 3840 gactgtgggt ggaatcacta aggatgtcag agtaacaaag ggctgccctc aagggtcaat 3900 cttgggcccc atcttgtgga acgttacgat ggaagcccta ctgagaacaa catacccgga 3960 gcacgtagcg atccaggcgt atgctgatga tatcgccatc agcattgcgg cacaaaccag 4020 ggtgtcactc attcagcggg cagaagaagc gctaaggcca gcactggaat gggcacacca 4080 caggggtctt acgttctccg cacagaagtc ccaggccatg atcactaagg gcagccttgc 4140 agcagggttc acaattgcat ttggaaccga cagaattgcc acagtggact gtataaaata 4200 cctaggcatc tggctggatg aagagaggtc attca 4235 // ID hAT-37_HM repbase; DNA; INV; 3527 BP. XX AC . XX DT 07-DEC-2008 (Rel. 13.12, Created) DT 13-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-37_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3527 RA Jurka J.; RT "hAT-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 2025-2025 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 1193..3022 FT /product="hAT-37_HM_1p" FT /translation="MRMILHLLFVVALGESSMTTYGRTHPKQRRILAAVVK FT HLVVGASLPVSIVESEHFRRYNNVLDPQCSTISRFAVKTYLEKTYTDAKEK FT LTENLCQAASISLTLDIWSDRKMRGYLGVTAHYIHNYHCHTRLLACDRFLG FT AHTGDNIAEHFESICKTYKILDKVVYVVTDNASNMRKAFTTNFPEDPEGGG FT DTSAEELAEDLENEDLWEDIIAPDEIIQNINQLNPRREVTRLSCFAHTLQL FT VVGDGLKEAKIYSAQAKASKICTLLHTSTTFKEAFDKTFGSKQGIPRVNAT FT RWNSTFRHIKSVVNLDEKKLKNLLDDLSHRNLILNEREYVQLRELVDLLQP FT FLEATDMTQGEKNITISLVVPSIVGLYSHLEQFNTTYLSGMKQVLRNSLQK FT RFSGIFANVDLIPSIDKKLPFNDLIYIIATVLDPNFSFFWTEKLCHVNPGQ FT TDNLKTNIISKIRQFCNKQFSHTNVAMDGKVLPENLPSKKPRTNSFLYSYC FT RSTPVIQTGTTSQPPVKSFDTQFHCYEELVNEVFASNFNTLDIDCLQFWKM FT HGQKLPILQSAAMYTFVVPASSAPVERVFSQGGLIMNPKRASLNSDTVCQL FT VFLKCNTDILL*" XX SQ Sequence 3527 BP; 1116 A; 637 C; 602 G; 1172 T; 0 other; cagagctgga gactcgagtc aaccgactcg aactcgagtc cgacttaagt cacaccagtg 60 actcgactcg agactcgact tgggttttac cctgagactt gcgacttgac tcgagacttg 120 ctaatttcgg cttagcgact ttaaaattat tattttccgt ctcttcaaaa atggagcaat 180 ttatcttcta ttttaattca actgtttttt aactattttt tattttcgaa tgtttttatt 240 ttaacttacg tttaactggt ttgtagtcta agcgagatcc gaagaaaaat atcgcattta 300 ggctctttcg ttttcaaaaa attccaactt ttttgtgtgc gttctgattg gttcattttt 360 tctttaactc atgaaattta ttgtttataa atgttcggtg attttaattt cttatttaaa 420 ttttttttaa ttctttcttt ttctttcctt aacaatcttc atttttctct tatttatttt 480 tactttcaaa attatctttt attattatta ttttttcatg taatttattt ataaaattca 540 ttataaaaaa gcgtccgcca ttccggaagt ctttgcactg ctgttctaat tttaagtatt 600 taaaattgtg tttgctacta atttcggtga tttgtaaagt ttcataagtt atcttaaaac 660 tatgagtgct ataaaaattc aaaaaacgcc tgcagtacct ggaataatta attttggtta 720 tactgagtta gagaaaaaaa aaagaaaaaa ctagatcggc acgatgtaaa ttttgtcaag 780 ctttaattac cgatggagtt gcgactacaa gcaacttcat tcgccattta aaaacacatc 840 caggaaaata tgaattatat caaaagttta ttcagacaag accaacggct gcttcaccag 900 accaatctaa gatttcagac tttcgagcag gttatttact tatactaatt tgtttttatt 960 ttattagata tttaaaatat attaaaccct agagtccgtt ctatgtctac ctctagtaga 1020 caagtagtct actagaggtc tgacattcca taacagttct tatagttctt attagcttgt 1080 acggcaatac gttgcagtgt cgcactacat cttaccactt aacccgaagc ttttaaataa 1140 agctttaaac ttagcgtata aagcatggcg ttaaggatta ttactttata gaatgagaat 1200 gatattgcat ttattatttg tagttgcttt gggagagagt tctatgacaa catacggccg 1260 cacccacccc aagcaacgaa gaattttggc agccgttgtg aaacatctag tagttggtgc 1320 atcactaccc gtgtctatag tagaaagtga acattttagg cgatacaata acgttcttga 1380 tcctcagtgc tcaactatta gccgttttgc cgtaaaaacc tatttagaga aaacgtatac 1440 ggatgcaaag gaaaaattaa ctgaaaattt atgtcaagct gcaagcattt ccctaacact 1500 tgatatttgg tcagatagaa aaatgagggg ttaccttggt gtaacagcgc attacataca 1560 taactatcac tgtcacacac gactgttggc atgcgatagg ttcttaggtg ctcacactgg 1620 tgataacata gcagaacatt ttgaatccat ttgtaaaaca tacaaaattc ttgacaaagt 1680 tgtgtatgtt gtgacggaca atgcctcaaa catgcgcaaa gcatttacta caaattttcc 1740 agaagacccc gaaggtggcg gtgatacatc tgcagaggaa cttgcagagg acttagaaaa 1800 tgaagacctg tgggaagaca ttattgcacc ggacgaaata atccaaaata tcaatcagct 1860 taacccaaga agagaggtta caagactaag ctgttttgct cacaccctac aattagttgt 1920 tggcgatggt cttaaagagg caaaaattta ctccgctcaa gctaaagcat ccaaaatatg 1980 cactttgctt catactagta caacatttaa agaggctttt gataaaacgt ttggctccaa 2040 acagggtatt cctcgagtaa acgcaacaag atggaattcc acattccgtc acataaaaag 2100 cgttgtcaat ttagatgaaa agaaattgaa aaacctgctt gatgatttaa gtcaccgcaa 2160 tctaatacta aatgaaagag aatacgtgca acttagagag ctcgttgatt tattgcaacc 2220 atttcttgaa gccaccgata tgactcaagg tgaaaaaaat ataacaattt cattagtagt 2280 tccctctatc gttgggctat atagtcactt agaacaattt aatacaactt atctgtccgg 2340 tatgaaacag gtattgcgaa acagtcttca aaaaagattt tctggtatat ttgcaaatgt 2400 tgaccttatc ccatcaatag ataagaaact accatttaat gacttgatat atattattgc 2460 cacagttttg gatcccaatt tttctttctt ttggactgag aagctatgtc atgtcaatcc 2520 aggtcaaaca gacaacttaa agacaaatat tatttcaaaa attcgtcaat tctgcaacaa 2580 gcagtttagc cataccaatg tagccatgga cggaaaagtc ttaccagaaa acctaccctc 2640 taagaagcca agaactaatt catttttgta ctcttattgt agatctacac ctgtcattca 2700 gactggaact acatcacaac cacctgttaa aagttttgat actcagtttc attgctatga 2760 ggaactggta aatgaagtgt ttgcatctaa tttcaatacc ctagacatag attgtctcca 2820 attttggaaa atgcatgggc aaaaattacc aattctccaa agtgctgcca tgtatacttt 2880 tgtggtacct gcaagcagtg ctcctgttga aagggtattc agtcagggtg gcctgattat 2940 gaaccctaag cgtgcaagtt taaattctga cactgtttgt cagttagtgt ttttaaaatg 3000 taacacggat atattactat aattttttat gaaaatcagc atttgtaatc tttttttcaa 3060 aagaatgata gaataaaaac attagtaaat tggctaaata gcttatgaag tgttatgtca 3120 ctgtattgtg tattatgact ccttagctat tttattacag atggtttatt ttacttttaa 3180 agctacttta gtaaataaag taaaattata actattatca tgctgtattt tataattgta 3240 aggtgactgt ttgaatgtag ctaccagagc catcgcaagg ttcccccatg cccctcttgt 3300 aagaggaatt tcgtggtttt tatgacaggt tttggaaggg gcctccgaac tcctagccgc 3360 gtcactgata gccaccacat gcacgggaat cttttattgt aatatgactt gacttgagac 3420 ttgacttgac atgtcactat atgacttgag acttgacttg agacttgacc tcagtgactt 3480 gagacttgac ttgagacttg ctccaaagcg acttgaaaac agctctg 3527 // ID hAT-40_HM repbase; DNA; INV; 5247 BP. XX AC . XX DT 07-DEC-2008 (Rel. 13.12, Created) DT 13-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-40_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-5247 RA Jurka J.; RT "hAT-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 2028-2028 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 1756..4764 FT /product="hAT-40_HM_1p" FT /translation="METLSETSQQANSSVSNEHKMAATKSDSTLCKSRRKS FT SGAKRRQEKKKRDLLQSGRDPKQRKLLFNNEHIINVVEEASVPASNPVLIK FT KNAFLWNITPNVIIEASTVSNTVSSSPQQDKQSDFTISKSPVKTITDCTSE FT LGAVVTEESIYDVYIELTKDDANSSILNHKHQCTETTKLIYKRLPDSAQRN FT QRIALVNNNMIQPNHTDVKHLPYDPKRVYQRLISDNKYVQRRWITARVEDN FT QIVALYCNACMAFSKEDSSFTLGFKKFSHIYQRIKDHEESKSHNAAVIALL FT SCTTEKDISSLINKGIVEKQKAQVLKNIEVFKRIFAAVIFIGKQALPYRGN FT RGESLYSLHDKSIDHGNFLELILLIAEFDVPLNNHIEKAIQASQKRKDCLI FT RKGKGSSRGRGGLVTMLSKTTVNNILAAISNLMKNCIIKEIGDQKFSVQMD FT GSQDTSVIDQETIILRYVLGEDVKERLFAVKKITDATGAGLYQLLKSELEH FT NGLKMEKIVGESFDGAANMRGEYHGVQRYINDVSPNAVYMWCYAHVLNLSA FT TDIVENILAVKNLIGLLQSTATYFGDSCKRMNVWTDIHIGQGVGSAKLKRL FT QKIGETRWWSKQMALERIFGTYDAPQVESFYALLQVFNCIRMSSRFDAKST FT FEATALHDKWCRFDVLLAAFLLLRVYTVLRQASEYLQTRGLDYLSAWNMVE FT SAKQELQTISFEDVHSKTIAFVKEINTLLDASELDSEVETELRVERVHRKK FT RMAGELAFDDIPENPLDRFRMDVFRRVIDQITTSIDERFSVNRDIIRDTAC FT LDPRRFKEIVENGIPENSLAKISQLTNLCSERLRSELHSFAINFDRLSKTL FT RDEFTVSPSDEDSEESNSVSELDTSTKATENENDRDFQNFERCVGSCKKCL FT KCCYSILYKYSLNATAYSNLFLAYEYLLTLSFSQVSCERAFSKLKIVKTRL FT RSSLCNDKLETFMLMSTEKDILDSIAVEDVIGYLCKDSVELSKMLLL*" XX SQ Sequence 5247 BP; 1797 A; 860 C; 930 G; 1659 T; 1 other; caggcgcgga ctggcccacc gggaaaccgg gaaatttccc ggtgcgcccc accatccttg 60 gcgcccccct ccccgccaaa tcataagttg tatatctcca taataccaca tgttacatat 120 aattaataac tataaaacca tttagtgtta tcgcttcaag tgcaaattca tgatccttca 180 atttctcaca atacaattga agcatcattt atatcagggg cgaattcaaa aatattttag 240 gggaggacag aaaaatattt ttgacttttt aacaaatact cgcgtgaatt agtttttctc 300 atgctgttta ctttactaaa tgatagttta aactagaccc gaaagaaact ttatttcgtc 360 cgaaattaaa atttgaaaaa agtgattagg aaaagtctga gttaggcccc ccmcctccct 420 aaacccgccc ctgatttata tcacaaactt aataacaaac tgatttgata cttcttattt 480 ttgattccaa aagcgtgtca tggagattct ttaaagaatc tccgtgacaa ttttgcattt 540 ttaaatgcaa aattgtgtgc ttttgtttaa attaaattat taaattgctt cttttcgaag 600 ggaattccat tggaacttaa tgcacggatg acgctacaaa gaacaatgga aaatcttgct 660 ctaattatat tttaaagtaa aatatataat tgttttaaaa tttaatgttt ttaaatattc 720 gagatcgttc aaatttaaaa tcgcgttttt tttgttaagg taagtggttt aatttatttt 780 ttacaacatt ttatttatag tgcttttttg catcaacgta gaaatataaa gaagactaaa 840 aaataaatga atatctaaaa aaaacctttt aaagacaaca atttaaaaat ggactaaaaa 900 gtaaaaaatg acttttatta agccagcgtg tagatataca gtagaatccc gttaaggtgg 960 tactagccca taaaaattaa aacagtcaat ttatttaatt ttgatgcaat atttttttaa 1020 aactgtgtgt attataaaat tatttagagt aaatataata aaaagtataa ttatactcga 1080 aaacatgctg agtcagcaaa aaatatgcaa ttttaacttc atttttaaac aaggatttct 1140 catgaaatag tcaatgtgtg aggctaaaat tttgcaaaaa ggttaacaac atgttgattg 1200 agggattgac cagaatcacc atcagaactt ttacagaatc aaaatggctg cactttaaaa 1260 atttttcctc cttgattttg ttacttcttt ttccttcctt tccatatgaa tgatgcgatt 1320 gttgtccttt ttaattgcac cgagaaggaa atatttttca caatgaaatc caagttgttg 1380 aaatgtttcg ttatgcgaaa cgtaacatcc attaaaagat ataactgctg agaaaacagc 1440 taactctaaa acactttttg aaacaaaatt ttggggcatt ttttccagat tattcgatta 1500 aaagtctcat taggattttg ggtatagctg tgtaaacatt ttgacaaaag attgtcagaa 1560 caaattttga tgtaaaaatt gagcaaaagg taagtttact actctaaaat gccttttttt 1620 tccaaaacga cccccctaaa aatcacggta cgggcctgcc ttgagagtcc gagttaacag 1680 gattttactg tttatatagt cattaaatgt atctatgtta ctacagtgat tgcattgtac 1740 attgtgcagg ttgctatgga aactttgtcc gaaacttccc agcaagcgaa ctcttcagtg 1800 tctaatgaac ataagatggc cgctacaaaa tctgactcaa cattatgtaa gtctaggaga 1860 aagtctagcg gtgccaaaag aagacaagaa aagaagaagc gtgatttgct tcaatcaggc 1920 cgtgatccta aacaacgcaa gctgctattt aataatgaac atattatcaa tgttgttgaa 1980 gaagcatctg ttcccgcgtc aaacccagtg ttaattaaga aaaatgcttt tttgtggaat 2040 ataacgccta acgtaatcat tgaagcaagt accgtgtcaa atacagtgtc ctccagtcct 2100 caacaagata agcaatcaga ttttaccatt agcaaatcgc ccgtaaaaac tataacagac 2160 tgtacgtcag aactcggtgc agttgttacc gaagaaagta tatatgatgt gtatatagaa 2220 ctaacaaaag atgatgctaa ctcaagtatt ctcaaccata aacatcagtg taccgaaaca 2280 acaaaattga tttataaacg gcttcctgac agtgcacagc gtaaccaaag aatagcatta 2340 gtaaataata acatgattca gccgaaccat acagatgtaa agcatttacc gtatgatcca 2400 aaaagagtgt accaaagatt gatatcagat aataagtatg tacaaagacg ttggataacc 2460 gcaagagttg aagacaatca aatcgtggca ctttactgca atgcatgcat ggcattttcg 2520 aaagaagatt ctagttttac tcttggattt aaaaagttct cccatatata ccaaagaatc 2580 aaagaccatg aagaatctaa atcacataac gctgctgtaa ttgcactatt aagttgtaca 2640 acagagaaag atataagctc tttaatcaac aaaggtatag tggaaaagca gaaagcgcaa 2700 gtattgaaaa acatcgaagt atttaaacga atttttgcag cagttatttt tattgggaag 2760 caagctcttc catacagagg taatcggggt gaatctctct acagtttaca cgataaatct 2820 attgatcacg gaaacttttt ggaactaata ttgctgatag cagagtttga cgttcctctt 2880 aataatcata tagaaaaagc tattcaggca agtcaaaaaa ggaaagattg tttgatacgt 2940 aaaggaaaag gaagctcacg tggacgtgga ggcttggtaa caatgttgtc aaagactaca 3000 gttaacaata ttctagctgc tatttcaaac ttgatgaaga attgtataat caaggaaatt 3060 ggtgatcaga aattcagtgt tcaaatggat ggtagtcaag atacgtcggt gattgatcag 3120 gagacaataa tcttacgata tgtacttgga gaggacgtga aagagcgatt atttgcggta 3180 aaaaaaatta ctgatgcaac aggtgcagga ctataccaac tattgaaatc agagcttgaa 3240 cacaatggtt taaaaatgga aaaaattgtg ggtgaatcct ttgatggagc agcaaacatg 3300 cgtggtgaat atcatggagt tcaacgttac attaacgacg tttcaccaaa tgctgtatac 3360 atgtggtgct acgcgcacgt tctaaatttg tccgcaacag atattgtgga aaatatatta 3420 gctgtcaaaa atttaattgg tctcttgcag agtacagcaa cttactttgg cgattcttgc 3480 aaacgaatga acgtttggac ggatattcac attggacaag gagttggatc tgcgaaactg 3540 aaaagactgc aaaaaattgg tgaaacacgt tggtggtcga agcagatggc gcttgaacga 3600 atttttggaa cttatgatgc tccacaagtg gaatcgttct atgctctttt gcaagttttt 3660 aattgtatta ggatgtcttc aagattcgat gcaaaatcaa cattcgaagc cacagcactg 3720 catgacaaat ggtgtcgatt cgatgtgctt cttgcagcct ttcttcttct acgtgtgtac 3780 accgttttgc gacaagcatc cgaatatttg caaacaagag gtctggatta tttaagtgct 3840 tggaatatgg tagagtcagc aaaacaagaa ctgcagacaa tatcgttcga agatgtccat 3900 tccaagacaa ttgcatttgt taaagaaatc aacaccctcc tagatgctag tgaacttgat 3960 tcagaagttg aaacagaatt gcgtgtagaa cgagttcatc gaaagaaacg tatggcaggg 4020 gaactcgcct ttgatgacat accagagaat cctcttgatc gtttcagaat ggatgttttc 4080 cgtagagtaa ttgatcagat taccactagc atagatgaac gtttctcagt taacagggac 4140 atcattagag acactgcgtg tttagatccc agacgtttta aagaaattgt tgaaaatggt 4200 attccagaaa acagtcttgc aaaaatttct caactaacta atctttgttc tgaaagattg 4260 agatctgaat tgcatagctt tgctataaac tttgataggc tctcaaaaac actacgagat 4320 gaatttacag tgagtcctag tgacgaagat tcagaagaat cgaattcagt ctcagagttg 4380 gatacttcaa caaaagcaac tgaaaatgag aatgatagag attttcagaa ctttgagcgt 4440 tgtgtagggt cttgtaaaaa atgtttaaaa tgttgctata gtattttata caaatattcg 4500 ttgaatgcaa ctgcttattc aaatctgttt ctggcttacg aatatctatt aacactttca 4560 ttctcacaag tgagctgtga acgggcattt agtaaattaa aaattgtcaa aactaggctg 4620 cgttcctcat tatgtaacga caagcttgaa acatttatgt tgatgtcgac cgagaaggac 4680 attttggatt caattgcagt tgaggacgta attggctatt tgtgtaagga ttccgtagag 4740 ttatcaaaga tgttgcttct ataaattaca ttaggagatt gtagtcagtt acagtataaa 4800 taccagataa accacagggt ccattttgat ttttttttta tcttaaatga tttaaaaaga 4860 ggtttattat tcatttatca taataaattc atcgtgttat tattattatt gttattattg 4920 ttattattgc tattattgtt attaattatt gttattattg ttattattat tattattatt 4980 attattatta ttattattat tattattatt atcattgtct acataattat catgtatatt 5040 tcatacataa gtaaaatgac atactgaatt atattgtaag cctgaatgaa ctttgaactt 5100 tgaaagtgta acattatttt acctattcat attcaaatta aaaagacctt tatataaaca 5160 ctagcctatt ttctacccct atcccgacct tgtagcgccc ctccctcttc ccttcccggt 5220 aggatttgaa gccccagtcc gcgcctg 5247 // ID Gypsy13-SM_LTR repbase; DNA; INV; 216 BP. XX AC Contig99; XX DT 15-AUG-2007 (Rel. 12.08, Created) DT 15-AUG-2007 (Rel. 12.08, Last updated, Version 1) XX DE LTR retrotransposon from Schmidtea mediterranea: long terminal DE repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy13-SM_LTR; KW Interspersed repeat; LG_I. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-216 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from Schmidtea mediterranea."; RL Repbase Reports 7(8), 761-761 (2007). XX DR [1] (Consensus) XX SQ Sequence 216 BP; 63 A; 20 C; 35 G; 98 T; 0 other; tgtgaggcgt tttataattt gtatttaaat aatggtatga tgttgggcat gtttgattgt 60 atgttgaaat ccttacttat tcattaatta ttatttttta gaaactcctc aaacaaatta 120 acgatataat ttgttttaat aatatttgtt tgtgttttga taataaacgt gttttagtta 180 tggattctat taatacccgt agctctgtgg ctaata 216 // ID Copia-6_AA-I repbase; DNA; INV; 4141 BP. XX AC supercont1.224; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-6_AA_; KW Copia-6_AA-LTR; Copia-6_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4141 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.224; Positions 320413 324553. XX CC Positions [1558-2097] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 100..4128 FT /product="Copia-6_AA-I_1p" FT /translation="MEDSGKEEIRRVAMFDGHNFVSWKFRMMTLLEEHDLV FT ECIDVEIDDVEELKVEETDTAAEQQQKRKALAERQKKDKKCKSIIVSRIHD FT DQLELLHGKSSAKQMWDTLVRIFERKSVAKRMQLNRELFELRHSGGPLHDY FT FVKYDRLIRLFRNAGGKIDDIDVVCRLLLSLGSEYDAVVTSIESQPEEQIT FT MEFVKCRLLDEEIKRKSIAASDVCGDHNESAAFSGSGKVSRTPKLKKPRVW FT KCFGCQKEGHKIANCPEKKEKKAVKPSAYSAEPCAEPSDDGGGVVFLADER FT KSKPPMSRVQWFVDSGATEHICNDKSLFAKLSPLKKPMEIAVAKNGEWVTA FT KYVGDVPVLSVVGEKVIESTVSRVLFIPEARCNLFSLSKVESAGMKVVIAG FT GRLEIFRGSNVVATGERRNKLYELNFFSRRCNNDMLCFSGQISKETELWHR FT RYGHLGDRNLSSLMKNGTVKGIPSKCSGGATTICEPCVSAKQTRNPFTLSE FT ERRSSRVLEIVHSDVCGPVTPVGWNNVRYFVSFIDDWTRFTVVYLIRSKDE FT VVDCFKNYEAQVTAKFGVKISRFRSDNGGEYTSKAMRSFCASKGIKMELTV FT PYTPEQNGICERMNRTLVEKARSMLFDSAVGREFWGEAIQTAAYLTNRSPC FT SVLDSSVTPSEVWEGTKPDVSKLRVFGSPAYCHIPKERRKKLDEKTWKGVL FT VGYCANGYRVWNPETRQIVAVRDIIIDENARLADVQSKKEVVRESSVWDYA FT EENSGQQEEDDNHAEAELEESVRSDNSEIFDTCDDSAGEEIIPVQSSARRQ FT RKPPSWHQDYDMTFAGVALGAMNYVNNLPDTIAELRTRADWPRWKLAVQEE FT LESLRKNGTWTLCKLPEGRKPITCKWIFRIKPGDDEQPDRFKARLVARGFC FT QKKGFDFNETYSPVAKLDTLRVMLAIANKERMSVHQMDVRTAFLNGVLSEE FT IYMTQPEGMHQEDGLVCKLRKSIYGLKQASKAWNDKFHGFVTELGFRRSKS FT DQCLYVRGTGKKKIFLILYVDDILVIGHDSNEIELVKTSLAKGFDMTDGGE FT ISSFLGMKIERNLQRRTMRISQRRYLEALLDRFNMADCKTSSIPMECRLRL FT DKGTEENRTSKPYRELIGCLMYVALTSRPDLLAAVNYYSQFQSCPTDMHWT FT HLKRILRYIKGTLDLGLEYMGRDDADMLSAFCDADWANDISDRRSVTGYLF FT KVFDCTVVWATRKQRTVALSSTEAELCALCEATCDGVWLTRLLEDVGYRSD FT VPIPYHEDNQSTIRIVQEPRDRSRLKHIDVKDCFVRNLVQQGSVVLKYVPT FT ETQEADILTKGLPAGHFKRLRSAIGLRDSMN" XX SQ Sequence 4141 BP; 1144 A; 818 C; 1192 G; 987 T; 0 other; ttaggttatg ggcccaggac cggttccgga ttgcgtcgcg aaagtggttc tagttctggt 60 gctttgagaa acgcggagtg aaagtgtttt gtgtgcatca tggaggattc cggaaaggag 120 gagatccgtc gagtcgccat gtttgatggc cacaacttcg tctcgtggaa atttcgcatg 180 atgaccttgt tagaggaaca tgacttggtc gagtgtatcg atgtcgaaat cgacgacgta 240 gaggagttaa aagtggaaga aacagatacg gcggctgagc agcagcaaaa acgaaaagct 300 ctcgcggagc gccagaagaa agacaaaaag tgcaagtcca tcatcgtatc gcgaatccac 360 gatgatcagt tagagttgct tcacgggaaa agttcggcaa aacagatgtg ggatacatta 420 gtgcgcattt tcgagagaaa aagtgttgca aaacgtatgc agctgaaccg cgaacttttc 480 gagctacgtc actcgggtgg tccattacac gattattttg tgaaatacga tcggcttatt 540 cgtttgttcc gaaacgccgg tggaaaaatc gacgatatcg acgtcgtttg tcggttgctt 600 ttgtcgttgg gctcggagta cgacgctgtc gtgacgtcga tcgagagtca gccggaagaa 660 caaataacga tggagttcgt taagtgtaga cttctcgacg aagagataaa gcgaaagtca 720 attgccgcga gtgatgtgtg tggagaccac aacgagtccg ccgctttcag tggtagtggt 780 aaagtttctc gcaccccaaa gctgaaaaag ccaagagtgt ggaagtgctt cgggtgccag 840 aaagagggac acaagattgc gaactgcccc gagaagaaag agaagaaagc agtgaaaccg 900 tcggcctaca gcgcggagcc ctgtgcagag ccaagtgatg atggtggtgg tgtagtgttt 960 ttggcggacg aacgcaaaag caaaccaccg atgagtcgag tgcagtggtt cgtggactcg 1020 ggcgccaccg aacacatatg caatgacaaa agcttgtttg caaagctctc cccactgaag 1080 aagccgatgg aaatagccgt ggcgaagaac ggcgagtggg tgacagccaa atacgttggt 1140 gatgtgcctg tattgtcggt cgtcggagaa aaagtgatcg agtcgacagt gagccgagtg 1200 ttgtttattc cggaggcgcg atgtaatttg ttttcgctga gcaaagtgga gtctgccgga 1260 atgaaagtag tgattgctgg tggtcggctg gaaatatttc gtgggtcgaa tgtagtagcc 1320 accggtgaaa ggcggaataa actgtacgag ttgaactttt tctcgcgtcg gtgcaacaat 1380 gacatgttgt gtttttcggg acaaatcagc aaagaaaccg agctttggca tcgccggtat 1440 ggccatctcg gtgatcgaaa tttgtcgtca ctaatgaaga acggaacagt gaagggaatt 1500 ccgtcgaagt gtagtggtgg tgcgacaaca atttgtgagc cgtgtgtgtc tgcgaaacag 1560 acgagaaatc cgtttacgct gagtgaagag cgtcgttcga gtcgtgtctt agaaattgtc 1620 cactcggatg tttgtgggcc tgtgacccct gtcggatgga acaatgtgcg ttactttgtg 1680 agttttattg acgattggac acgttttacg gtcgtgtacc tgatccgatc aaaggacgaa 1740 gtagtggatt gtttcaaaaa ctatgaagcg caagtaacgg caaagttcgg agtgaaaatc 1800 tcgcgttttc ggtcggacaa tggtggtgaa tacaccagta aggcaatgcg ttcgttctgt 1860 gcgagtaagg gtatcaaaat ggaacttact gtgccctaca ctcccgaaca aaacgggata 1920 tgtgaacgaa tgaacagaac cctcgtagag aaagcgcgtt cgatgttatt cgattccgcc 1980 gtcggtcgtg agttctgggg ggaagcgata caaactgcgg cgtatttgac gaacagaagc 2040 ccgtgtagtg ttctcgattc gagtgtgacg ccgagtgaag tgtgggaagg cacgaaacct 2100 gatgtttcga agctgcgagt ttttggttcg ccagcttact gccacatccc aaaagaacgt 2160 cggaagaagc tggatgagaa gacgtggaag ggagtgctgg tcgggtactg cgcgaatggg 2220 tatcgagtgt ggaaccccga aacgcgacaa attgttgctg tgcgtgatat catcatcgac 2280 gagaatgcga gattggccga tgtgcaatcg aagaaagaag tagtgcgtga atcgagcgta 2340 tgggattacg ctgaagaaaa cagtggacag caagaagaag acgataacca cgctgaagcg 2400 gaactagaag agagcgtgag aagtgataac tctgagattt ttgacacttg tgatgacagt 2460 gctggtgaag aaataattcc tgtgcaaagc agtgctcgcc ggcaacggaa gccgccgtca 2520 tggcatcagg attatgacat gaccttcgct ggagtagcgc ttggtgccat gaattatgtg 2580 aataatttgc cggatacgat cgctgagctg agaacgcgtg ctgactggcc gagatggaag 2640 ttggccgttc aagaagagtt ggagtcgcta cggaaaaatg gaacgtggac cctgtgtaag 2700 cttccggagg ggcgaaagcc aataacatgt aagtggattt ttcgcatcaa gcccggagac 2760 gacgagcagc cggatcgatt taaagcaagg ttagtagcca ggggtttttg ccagaagaaa 2820 ggatttgact tcaatgaaac gtattcgcca gtcgccaaat tggatacctt gcgcgttatg 2880 ctggcgatag ctaataaaga gcgcatgtct gtccatcaaa tggacgtgcg cacggcattc 2940 ttgaatggtg ttctctctga ggaaatatat atgactcagc ctgagggtat gcaccaagaa 3000 gatggactag tctgcaaatt gcgtaagtcc atatacggcc ttaaacaggc ttccaaagca 3060 tggaatgaca agtttcatgg ctttgtgact gaattaggat ttcgacgatc aaagagcgat 3120 caatgcctct atgtgcgcgg cactggtaag aagaagattt ttctgattct gtatgtcgac 3180 gacattcttg tcatcggaca tgattcaaac gagatcgaac ttgtaaaaac gagtctcgca 3240 aaaggtttcg acatgacaga tggaggagaa atatctagct ttcttgggat gaagatagag 3300 cgaaatctgc aacgtcgtac gatgagaatt agtcagcgca gatatctgga agctctactc 3360 gatcgtttca acatggctga ttgtaaaaca agctctatac cgatggagtg ccgtcttcgt 3420 ttggataagg gcaccgaaga aaaccgaacg agtaagccat acagagaact aataggctgt 3480 ctcatgtatg tagctctgac gagccgtccg gatttgttgg cagcagtaaa ttattacagc 3540 caattccagt cctgcccaac agacatgcat tggacacacc taaaacgaat tttgcggtac 3600 ataaagggta cacttgatct gggactcgaa tacatgggac gagacgacgc tgatatgcta 3660 tcggccttct gcgacgctga ctgggcgaat gacatcagtg acaggcggtc ggttaccgga 3720 tatcttttta aagtattcga ctgtacggta gtttgggcaa ctcgtaagca gcgtaccgta 3780 gctttatctt ccaccgaggc cgagctgtgt gctctttgcg aggcaacctg cgatggagtc 3840 tggttgacac gtttactcga agatgttggc tacagaagcg acgtgccaat accgtatcat 3900 gaagacaatc agtctaccat aaggatcgtc caagaacctc gtgatcgaag cagacttaag 3960 cacattgatg tgaaggattg ttttgtgcgg aatttggtgc aacaaggaag cgttgtgctg 4020 aaatacgttc caacggaaac gcaagaagct gacattttaa cgaaaggttt gccagccggt 4080 catttcaagc gtctccgatc tgccataggg ttacgagatt ccatgaatta aggaggggta 4140 t 4141 // ID CR1-60_HM repbase; DNA; INV; 4118 BP. XX AC . XX DT 23-DEC-2008 (Rel. 13.12, Created) DT 23-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE CR1-type family - consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-60_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4118 RA Bao W. and Jurka J.; RT "CR1 families from Hydra magnipapillata."; RL Repbase Reports 8(12), 1887-1887 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 104..826 FT /product="CR1-60_HM_1p" FT /translation="MAPKVDYVSRKQLEDLITRVSELERKDIENLSKIDKL FT MSENVLLTARVKILEEKQTNSILDWSKLFNNKKCDKSTEEIKVTQGIIDLS FT KDVSKRDKNIIVFGIPNSTDSDAGNRKTFDENILKNLFSEIKINPDKIRRI FT HRFKNTNENNTKMETSTPLLVELPDSSDKFAILKAAKQLKDSESFKKVSIQ FT PDQNELERKFTKSLILKRNKLNEELNLKGQLNNPFRFGIRNNEVKKIRTN* FT " FT CDS join(798..1820,1801..3075,3003..3950) FT /product="CR1-60_HM_2p" FT /translation="MKSKKSEQISQKINQKDKNNLNQHMNSNDNISVCQPY FT DSLNIKTMLADSNINSKEVHLSRNFLPNNDLTQVNSVFNINSKKMLNINPK FT KPRNSIKCWYTNATSLNNKYNELIAEVEINNPDVMFICETWWNDCSPKSIQ FT GYSLFERERTYGKGGGVCIYVKDSIKSFEVSNSVLQSQQIEQIWCNIEVGN FT ESILCGCIYRPGISDIQNCIKITASIKVAYEMISNKTYSGILLNGDFNLAK FT IDWSEEFFAHLQEPNEIASMFIDCLDDCYLFQNINKPTFQVDDITETNVLD FT LVITENNLRIYKLEHHPPLGSIKHGHHVLTYNYNYSSSEEILAEFNKKKQS FT LTKKNNIYNKGNYDEINKYFDSTDWSRIFENLNANECYEKWLEHYKIICNV FT NIPLTKSNKKKRAPWFNKSLSEMIKAMIKAKKVLWIKCRNAKFRDQELEAK FT YKSIKNTTKKAIKNEIVKYEHTIAKNAKKNPKIVYQYMNNKTQTKDYIRAI FT ENNNGIVTTNLSEIANELNNFFSSVFTIEDKNLPEFPKQCDVLCPNPTFDI FT ETVSKKLATLNVYKSTGVDTIHPRVLKECCKSISKPLSIIFDRSFSSGIVP FT CLWLCANITPLFKKGDKLKVKNYRPVSLTSIVCKVYEESIIRDTLMNHLTK FT NSILSDSQHGFVSSKSCCTNLLETFDIITQALEENLSVDIAFLDFAKAFDS FT VAHKRLLLKLQSYGIQEQLLCWFKSFLSRRTQRVVLGEITSEWKKVISGVP FT QGSVIGPLLFVIYINVEKSNKWSTTRISYWTPFICHIYKLNDLTSEITNHC FT KLYADDTKIISVINTVEDSRTLQQDLCKLVEWSEKWQINFNKEKCKIMHIG FT KRNLKYNYLMCSTFDIIQQNALVMEETTLESDLGVIISNDVKWEKHINTIA FT ARANRKLGQIKKSFCRLDEISMKHLYTSLVRPHLEFAVPVWNPYYQKDIDK FT IEKIQRRATRINSLRSFCYEDRLKKFNLTTLKTRRERGDLIQFYKISKNMD FT KVKWNHPPKIIKYAITRGHTQKIQKQFTHSTTRFNFLTNRIVNNWNNLTEE FT IINSKNVNIFKNKFDRFEKNKIKK*" XX SQ Sequence 4118 BP; 1743 A; 550 C; 637 G; 1188 T; 0 other; cgcaattaaa gatggcaggc gtgtttttat aataataata aaagaaaaaa ctttttaaac 60 ttaatatatt gatatttact ttataaactt tatatattaa gatatggcac caaaagttga 120 ttatgtttct agaaaacagc ttgaagactt aattaccaga gtgtcagaat tagagcgtaa 180 agacattgaa aacttgtcga aaattgataa actgatgagt gaaaatgttt tattaacagc 240 acgagttaaa atattagaag aaaaacaaac aaatagcatt ttagattgga gtaaattatt 300 taataataag aagtgcgata aatctacaga agaaataaaa gtaacacaag gtattataga 360 tttaagtaaa gacgtaagta aaagagataa aaatatcatt gtttttggaa taccaaattc 420 aaccgattcg gatgctggaa ataggaaaac atttgatgaa aatattctta aaaatctttt 480 ttctgaaata aaaattaacc ctgacaaaat aagaagaatt catcgcttca aaaacacaaa 540 tgaaaataac acaaaaatgg aaacaagtac acctttgttg gtcgaattac ctgatagttc 600 agataaattt gcaatattaa aagctgccaa gcaactaaaa gattctgaaa gttttaaaaa 660 agtatctatt cagccagatc agaatgaatt agaaagaaaa tttacaaaga gcttgatctt 720 aaaaagaaat aaactaaatg aagagctaaa tttgaaaggc cagttaaata atccctttcg 780 attcggaatc aggaataatg aagtcaaaaa aatcagaaca aattagccaa aaaataaacc 840 aaaaggataa gaataattta aatcaacata tgaactccaa tgataatatt tctgtttgcc 900 aaccatatga ttctttgaac attaaaacaa tgcttgctga ttcaaatatt aatagtaaag 960 aagttcattt gagtagaaat tttttaccaa acaatgattt aactcaagta aattcagttt 1020 ttaatataaa cagtaaaaaa atgctaaata tcaacccaaa aaaaccaaga aatagtataa 1080 aatgttggta cacaaatgca acttctttaa acaataaata caacgaacta atagccgaag 1140 tagaaataaa taatccagat gttatgttta tttgtgaaac atggtggaat gattgttctc 1200 ccaagagtat tcaaggatat tcactgtttg aacgagaaag aacttatggc aaaggtggtg 1260 gtgtttgcat atatgtcaaa gactcaataa aatcttttga agtttcaaat agtgtgctgc 1320 aaagccaaca aatagaacaa atatggtgta acatagaagt tggaaatgaa tcaatactat 1380 gtggctgcat ttatcgtccc ggaataagtg acattcaaaa ttgtataaaa attacagcat 1440 caatcaaagt agcatatgaa atgatatcca ataaaacata tagtggcatt cttttaaatg 1500 gggattttaa cttagcaaaa atagattgga gtgaggaatt ttttgcccat ctgcaagaac 1560 caaatgaaat tgctagtatg tttatagact gcttagacga ttgttatttg tttcaaaata 1620 taaacaagcc aacgtttcaa gtggatgata taactgaaac taatgtctta gatttagtaa 1680 taaccgaaaa taatttgcga atatataagc tagaacatca tccaccgtta ggatcgatta 1740 aacatggtca tcacgtatta acatataatt acaactacag ttcatctgaa gaaatattag 1800 cagagtttaa caaaaaaaaa taacatctat aataaaggaa actatgatga aattaataaa 1860 tactttgata gtactgactg gtcaagaata ttcgaaaatt tgaacgctaa tgagtgttac 1920 gaaaaatggt tagaacatta taaaatcatt tgtaatgtaa atataccgtt aactaaatca 1980 aataaaaaga aaagagcacc ttggtttaat aaaagtctat cggaaatgat taaggcaatg 2040 attaaggcaa aaaaagtttt atggataaaa tgtagaaatg caaaatttag agatcaagaa 2100 cttgaggcga aatataaaag tataaaaaac acaacgaaaa aggctattaa aaatgaaatt 2160 gttaaatatg agcacacaat agcgaaaaat gctaagaaga atcccaaaat agtttaccaa 2220 tatatgaata ataaaactca aactaaagat tatattcgag ctattgagaa taacaatgga 2280 atagttacga ccaacctttc tgagattgct aacgaactaa acaatttttt cagctcagtt 2340 tttactattg aagataaaaa cctcccagag tttccaaaac agtgcgatgt tttatgccca 2400 aatcctacat tcgatattga aacagttagt aagaagcttg caactctgaa tgtttataaa 2460 tcaactggag tagacacaat acaccctaga gttttaaaag aatgttgcaa aagtatttca 2520 aaaccactat caattatttt tgatcgctca ttttcgagtg gcattgttcc atgtttatgg 2580 ttatgtgcaa atattacccc tctgtttaaa aaaggagata aattaaaagt aaaaaattat 2640 aggccggtat cactcacatc aatagtgtgc aaagtgtatg aggaaagcat aataagagac 2700 acactaatga atcatctgac taaaaatagt atactctctg actctcaaca tggttttgta 2760 tcatccaaaa gttgttgcac aaatttactt gaaacatttg acataataac tcaggcactt 2820 gaggaaaatt tatcagtaga tatagctttt ttagactttg ccaaagcctt tgactcagtt 2880 gcacataaga gactattatt aaaactacaa tcctatggga tccaagaaca attattatgt 2940 tggtttaaaa gttttctaag cagacgaaca caaagagtgg ttctcggtga gataacctct 3000 gagtggaaaa aagtaataag tggagtacca caaggatcag ttattggacc ccttttattt 3060 gtcatatata taaattaaat gatttaacaa gtgagataac taatcattgc aaattatatg 3120 cagatgacac aaaaattatt tcagtcatta atacagtcga agacagtcga acgttacagc 3180 aagacttatg taagctggtt gaatggagtg aaaaatggca aatcaacttc aataaagaaa 3240 aatgcaaaat tatgcatatc ggcaaaagaa atttaaaata taattactta atgtgtagca 3300 cattcgatat aattcagcaa aatgctttag ttatggaaga aaccacgtta gagagcgatc 3360 ttggtgtaat aatatcaaac gacgtcaaat gggaaaagca tattaacaca attgcggcaa 3420 gggctaatag aaagctcggt cagataaaga aatctttttg tcgattagat gaaatttcaa 3480 tgaaacattt atacacgtca ttggttcgtc cacacctgga gtttgcggtg ccggtatgga 3540 acccttacta ccaaaaggat attgataaaa ttgaaaaaat acaacgtagg gctacacgaa 3600 taaatagttt aagaagcttt tgctatgaag atagactgaa aaaatttaac ttgactactt 3660 taaaaacaag aagagaaaga ggcgatttga ttcaatttta taaaataagt aaaaatatgg 3720 ataaagttaa atggaatcat cctcctaaaa taattaaata tgcaataaca agaggtcata 3780 ctcagaaaat acaaaaacaa ttcacacatt ctaccacaag atttaacttc cttacaaaca 3840 gaatagttaa caattggaat aatttaacag aggagataat taattctaaa aatgtgaaca 3900 tcttcaaaaa caagtttgat cgtttcgaaa aaaacaaaat aaaaaaataa ataaaaaaaa 3960 aatatatata tatatataat atatataata tatatatata tattcattta aaaaaggaaa 4020 aaaaaaaact tttgcaacgg ctgtcaaagt tttaacatct acagatgtta gactcacaca 4080 ttgtatgtgt acagcaaaaa tttattatta ttttatta 4118 // ID Gypsy6-NVi_LTR repbase; DNA; INV; 474 BP. XX AC AAZX01003764; XX DT 05-NOV-2007 (Rel. 12.11, Created) DT 05-NOV-2007 (Rel. 12.11, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: long terminal DE repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy6-NVi; KW Gypsy6-NVi_I; Gypsy6-NVi_LTR. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-474 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from Nasonia parasitic wasp."; RL Repbase Reports 7(11), 1125-1125 (2007). XX DR Genome; AAZX01003764; Positions 481 8. XX SQ Sequence 474 BP; 173 A; 43 C; 67 G; 191 T; 0 other; tgttatagtt cataaattag aatacacaca catatatata tatatatata tatatatata 60 tatatatata tatatatata tatatatata tatatatata tatatttagt ttcacttaaa 120 tctaaaagat ggcgctataa gaatagtaga catgttcgtt gctctaggcg tgctgctgcg 180 agagaaagaa tacgtttgac tgccgagcag tgtggacgca tggcgtcgtt aagttcgaat 240 cgggaaaaag cataattctt gttttaggca gcttgctgcg aacttattat tattattata 300 tttatttttg ggattatttt aaaataaatt tattattcta aaggtttaat gaaaagttga 360 ttagactgtg tgtgtgtgtt tattttatcc tgaccaatat atatatatat atatatatat 420 atatatatat atatatatat atatatatat atatatatat atatatatta aaca 474 // ID Copia-34_DPu-I repbase; DNA; INV; 5247 BP. XX AC ACJG01007327; XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the common water flea: internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-34_DPu_; KW Copia-34_DPu-LTR; Copia-34_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-5247 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the common water flea."; RL Direct Submission to RU (08-FEB-2011). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR Genome; ACJG01007327; Positions 11176 5930. XX CC Positions [2280-2813] - Integrase core CC 'AGAAC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 629..1642 FT /product="Copia-34_DPu-I_2p" FT /translation="MTIIRRLLINIFYFLSQEFDDEGEITNEAEIKQWTRK FT DVLARNCIMATTSKEMKENLYTCTSSAQMWTRLHQQYHLQTEEHLHVLWQN FT YYDYSYTDGNTYSVSLKVPFIDLLDLLSLRLGDDMRTYIQKLSITADKLRE FT RDQPLSEIQIVSKALTSLPETFRIVRSVWASVPAQDRTIDHLLQRLLTEEN FT VIKSYQKKERSTEGAFSANSGSRGGFGRGGYRGRGRGVRGGFVDKQTRGQY FT GANQDKRPRCNYCKIVGHIENECYKKHGYPDTRNRGNDKNEEGRTDNSLFS FT SSRFDPRSIFAFFADSGATKHMCDQRGFFTTFNPINTGTWSVSGNR" FT CDS 3292..5187 FT /product="Copia-34_DPu-I_1p" FT /translation="MVTDLPSSINSQDVPDDEQDFGSSFIPPVSIAHQDLA FT TDHAIQDDAIIPNCEPVEVEDAITRRNTQQTRQSSRKPKYSDKYRIYMESL FT AKQAILYGMPAEHQPEMSKPTEPCSYLEAITCEDAKFWIPAIAEEYDSLIR FT NSTWTLCQLPAGRKAIEGKWVMKFKPGFKTTAPRYKARFVIKGYSQVYGLD FT YTETYAPVVKNYSIRAIMAIVAARDLEIVQLDVKTAFLYGTLTEEIYMQQP FT EGFVIPGREQEVCRLIKSIYGLKQASRVWNIKFNEFILLFGLTRSTADPCV FT YYRHLRPGEADEEITIFILYVDDGLIISSLKTVLTEMMNFLGKEFEVRSLP FT ADRFIGMDISRNRVQRTIHLSQPEYTKTVLERYGMSNCATLSVPADPCVKL FT TPQKCPQTEEEKQEMKCVPFMECIGSIMHLTHLTRPDIAYAVGQASRYSQN FT PGQEHWKALKRIMAYLRKTVNFGLLFGGDNSELIGYCDADYAGDLGSRRST FT SGAVFTLYNGPISWFSRRQSCVALSTTESEFISAAEATKEAIWLNRILIEL FT GARALPVPLRCDNQGAIALIHDPVFHQRTKHIDVRFFFVRDAQAEEKINIS FT YIATESQLADIFTKALAAPRFEKLRASLNICEFK" FT CDS 1977..3146 FT /product="Copia-34_DPu-I_3p" FT /translation="MRDEEVILTGERKDDTLYQLDFQPELQTTETIYAHVC FT SDLFQDTALGADIRASLITWHQRLGHIGYHTLVKMIRQDLVAGLNIAGECE FT IPETLCSGCELAKFTRLPLKIGRHRASRIGELTHSDVWGPIATSSLGGARY FT FVSFKDDFSGFLTIYFMKRKNEVPAFIRLYHAMLLNETGFYMLTLRTDNGR FT AEYDNHEADKFLTEAGVRHETSAPHTPAQNGTAERVNRTLLNSVRAMLISS FT GLPPSLWGEAISYSVYIRNRVLSRTNETTPYALWNSRQPDVSAIRIFGSRA FT FIRDPTPSSKLKARSLEGVFVGRCINQNASRVYIPETRKVLISKDVKIDET FT ILYRDMTKDLPILTVKKKKKKKKKKKKYSKLFFWHHADIISFFSPTE" XX SQ Sequence 5247 BP; 1563 A; 1313 C; 1114 G; 1257 T; 0 other; ggttatgggc ccagaacaca aactgctgtg catattcaac aattagtatg gccgaacacg 60 ctgggaacga aattggatcc atgagggctg tcgcacatgt gcccaaattt gacggaacca 120 accatagaga atggaactac caactgcacc tcagtttcca aggaatggaa atcgatcaag 180 ttgttacagg aacagaagta cttcctgaag aggtaacgca gacactgtcc agctaaatta 240 aactcactat ctctttctca gtcaccgact acatttatct gttgaaagtt gaacctttct 300 ttgtggtgca atgcccagac acattacgca gtggaatttt tttttggttg ggtgtcgcat 360 accatgattt ttccacactg ttcaaagtct tcactccttt gtattgttac agtgcccacg 420 cacaacacaa atgtgaatct gttatatcac gttcattaaa aaaaaaggcc tcgagaacaa 480 atcctgatca tgggacacta aaacttgaca agccacatcc cagtcaattt acacgtgcat 540 attcaagaga gtacaatcct ttttattgaa aacagcccta acgcttcaca ctgcacatgc 600 agttgattta cgctcactgc acagtctcat gacaattatt cgcagattat taattaatat 660 attttatttc ctctcacagg aatttgacga tgaaggagaa atcacaaacg aagcagaaat 720 aaaacaatgg acgcgtaaag atgtactggc tagaaattgc ataatggcaa ccacatcaaa 780 ggagatgaaa gagaatctgt acacgtgcac aagttcagca cagatgtgga ccagactaca 840 ccagcagtac cacctacaaa ctgaagagca cttacacgtg ctatggcaaa actactatga 900 ttattcatat accgatggta acacatactc agtcagcctt aaagtcccat ttattgacct 960 cctcgatctt ttatcacttc gattaggaga tgatatgagg acctacattc aaaagctgtc 1020 gatcaccgcc gacaaactga gagagaggga tcaacctcta tctgaaattc agatcgtgtc 1080 caaagccttg acctctctcc cggaaacttt ccgcatcgtg agatccgtat gggcaagcgt 1140 tccagcgcaa gaccgtacta ttgaccacct cctccaacga ctgttaactg aagaaaacgt 1200 aataaagtcg taccagaaga aagaacgcag tactgaagga gccttttctg ccaacagtgg 1260 atcaagggga ggatttggcc gcggtggtta tcgtggacgc ggaagaggcg ttcgaggagg 1320 ctttgtggac aaacaaacaa gaggacaata tggtgctaat caagacaagc gtccgcgatg 1380 caattactgc aagattgtag gccacattga aaatgagtgc tacaaaaaac acggctatcc 1440 cgacacaaga aacagaggaa acgacaagaa tgaagaagga agaactgaca actctctttt 1500 ttcatcatca agatttgatc ctagaagtat ttttgcattt ttcgccgact caggcgcaac 1560 taaacacatg tgcgatcaaa gagggttttt cacgactttt aatccaatta acactggcac 1620 ctggtctgtt tctggtaaca gataaattca tttatttatt ttatggcatt tcttaaaaca 1680 aaaaattatt attgtggtgt ctcttccagg aattgggaac gcaaaactag atgtcctagg 1740 cgttggaagc atcaatatcg cagtcaaagt gaatggagtt accactccta gaattctgca 1800 ggatgtactc tatgtagctg gcctcggggt caaccttttt tcgataggag ccgcaacggc 1860 taacggacta aaggcatgct tcgaaaacaa caaggtagaa tccatatatc tctcaatata 1920 tacatgctta tgtatggcaa ctaacacacc ttcatttcat tctaaaggtc tcattcatga 1980 gggatgagga agtgatccta actggtgaaa gaaaagatga tacgctatat caactcgatt 2040 tccagccgga acttcaaaca accgagacca tctacgctca tgtttgctcc gatctctttc 2100 aagacaccgc acttggtgct gacatccggg cctctctcat cacttggcac caacgccttg 2160 ggcacatcgg ttaccacact cttgtcaaga tgatccgcca agacctagtg gctggtctca 2220 acatcgcagg tgaatgcgaa attccggaaa cactctgctc tggctgtgaa ctggccaagt 2280 tcacccgcct tccacttaaa attggtcgac acagagccag cagaatagga gaacttacgc 2340 attctgacgt ctgggggccg attgccacct caagcctggg aggagctcgt tattttgtat 2400 cattcaagga tgatttcagc ggcttcctca cgatttactt catgaaaaga aagaacgaag 2460 tacctgcatt catccggctc taccatgcca tgctcctcaa tgagacaggg ttctacatgt 2520 tgacgctcag aaccgacaac ggacgagcag aatacgacaa ccacgaagcc gataaatttc 2580 tcaccgaagc tggtgtccgt cacgagacca gtgccccaca cactcctgct caaaacggga 2640 cagcagaacg ggttaacaga actctactca acagcgttcg agccatgctg atatccagtg 2700 gcctaccacc atcattgtgg ggagaagcaa tttcatattc agtctacatt cggaacagag 2760 ttctctcgag aaccaatgag acaactcctt acgctctctg gaacagcagg caacctgatg 2820 tgtcggccat tcgcattttc ggatctagag ccttcatacg agacccgaca ccatcatcaa 2880 aactaaaagc acgcagctta gaaggagttt ttgttgggcg ctgtatcaac cagaatgcat 2940 ctcgagtata cattcctgaa acccgaaaag tgctcatcag caaagacgtg aagattgatg 3000 aaacgatact gtatcgagac atgacgaagg atctgcccat tctaaccgta aaaaaaaaaa 3060 aaaaaaaaaa aaaaaaaaaa aaaaaatatt caaaactatt tttctggcat cacgcagaca 3120 taatatcttt tttctctccc acagaatgaa ttagacatta taaatgacaa gcacgattcg 3180 ggtacttcca ttgccacccc cactgtcgtc gatggccatc aagacacaac agtacctctt 3240 actgctgatg atgacaatca atgcgttgac ggaacaatcc cgcaaggtga catggtgacg 3300 gatttgccat cttcaataaa cagccaggat gtcccagatg acgaacagga ttttggaagt 3360 tctttcattc ctcccgtcag cattgctcac caggatctcg cgacggacca tgcaattcaa 3420 gacgatgcca tcatccccaa ttgtgagccc gttgaagttg aagacgcaat cactcgccgg 3480 aacactcaac agacacgcca atcaagtcgc aagcccaaat actccgacaa ataccgcatc 3540 tacatggaat cgctagctaa acaagccatc ctttacggta tgcctgctga acaccagccc 3600 gaaatgtcca aaccaacgga accatgtagc taccttgaag ccattacctg cgaagatgcc 3660 aagttctgga ttccggcgat tgccgaagag tatgactctc taattcgaaa ctccacgtgg 3720 actctgtgtc agcttcctgc cggccgcaag gcaattgaag gaaaatgggt catgaaattt 3780 aagccgggat tcaagaccac ggcaccgaga tacaaagccc gcttcgtcat caagggctac 3840 tcacaagtgt acggcttgga ctacacggaa acctacgccc cggtggttaa gaactattcg 3900 atccgagcta ttatggccat tgtagctgcg agagatcttg aaatagttca attggatgtc 3960 aaaaccgcct tcctctatgg gaccttgacc gaagaaatct acatgcaaca gcccgaaggc 4020 ttcgtgattc caggcagaga gcaagaagtt tgtcgtctca tcaagagcat ctacggactt 4080 aaacaggcgt ctcgtgtttg gaacatcaaa ttcaacgagt tcattctcct atttgggcta 4140 accagaagca ctgccgaccc ttgtgtctac taccgtcatc tccgtccggg ggaggcagac 4200 gaagagataa caatcttcat cctctacgtt gacgacggtc tcatcatcag tagtctcaag 4260 actgttctca ccgaaatgat gaattttctc ggcaaggaat ttgaagttcg atcccttccc 4320 gccgaccgtt ttattggaat ggacatcagt cgaaacagag ttcagcgcac cattcatctt 4380 tctcagccag aatacacgaa gacagtgctc gagaggtacg gtatgagtaa ctgtgccact 4440 ctttccgttc cagccgatcc ctgcgtcaaa ttgacgccgc aaaagtgtcc acaaacagaa 4500 gaagaaaaac aagaaatgaa atgtgtcccg ttcatggaat gcatcggctc cattatgcat 4560 ctcactcacc tgaccagacc ggacatcgcc tacgccgttg ggcaagcatc aagatattcc 4620 caaaatccag gccaagaaca ttggaaagcc cttaagcgca tcatggccta cctacgcaaa 4680 actgtcaact ttggactgtt atttggcgga gacaacagtg agttgattgg ctactgtgac 4740 gctgactacg ctggtgacct ggggagtaga agatcaacct cgggagccgt gttcactctt 4800 tacaatggtc cgatctcatg gttcagccgc cgccaatcct gcgttgctct ttcaaccacc 4860 gaatcggagt tcatatccgc cgctgaggcg acaaaggagg ccatctggct caaccgcatt 4920 ctcatagagc ttggtgctcg tgcgttaccc gtccctctgc gatgcgacaa tcagggtgct 4980 attgccctca ttcatgaccc ggtattccat cagcgtacaa agcacatcga cgtacggttc 5040 ttcttcgtac gcgatgccca ggcggaagag aaaatcaaca tctcctacat cgcaaccgag 5100 agtcagctag cggacatctt tactaaggcc ttagccgctc cgagatttga aaaactgcgc 5160 gctagtctga acatctgcga attcaaatga aatctgctga acttgagggg cgcattatgt 5220 tccgttgtgc gctcggcttg aggggcg 5247 // ID Mariner_HB repbase; DNA; INV; 1283 BP. XX AC U68392; XX DT 09-JUL-2004 (Rel. 9.06, Created) DT 09-JUL-2004 (Rel. 9.06, Last updated, Version 2) XX DE Heterorhabditis bacteriophora mariner-like transposable element. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner_HB; KW mariner-like transposable element; putative transposase. XX OS Heterorhabditis bacteriophora OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Heterorhabditidae; Heterorhabditis. XX RN [1] RA Grenier E., Abadon M., Laumond C. and Abad P.; RT "Mariner like elements in the entomoparasitic nematode RT Heterorhabditis bacteriophora."; RL Worm Breed. Gaz 14, 26-26 (1995). XX DR Genbank; U68392; Positions 1 1283. XX SQ Sequence 1283 BP; 416 A; 258 C; 249 G; 360 T; 0 other; tagtaagttg gtgcaaaaat aattgcgctt ttgctattta aaattttggc gcttattctg 60 gtttaaattt ttatttaact attattatgt aatatgccat tttaaagcac gaaaatcgct 120 ccactgattc atgttaatgg ttttgtcttc actgcgtgtt ttgcttattt tatgctatca 180 aattaaaagt gggaaaaagg caaattcgag tgattgtgtt gtacgaatta aaattaggaa 240 gtaaagcagt ggaaactgct cgcaatatta accaagcatt tagcgaggga atcatcaaca 300 aatgtatagc tcaacattgc cttcgaagac ttcgtaacgg agacgagcgc tttgaagatg 360 aggaaggtcg cgaatgttct ttggtgattg acgacaacca actgaagcca ttgttgaagt 420 ggagccatgc aaaacaacac gagaggttgt agaagaacta aacgttaact aatcagctgt 480 tgttcgacat ttgcaccaag tcaaaaaatc aacaaagctc gataaatggg tgccgcagga 540 gctgaacgaa taccaaaaaa atcgccgtta ccaaatatgc tctcgcttct gcggcaacaa 600 aaataatcta tttcttgatt gtattgtgac atgtgatgaa aactggattc tatacgacaa 660 ccgacggcgt tccacccagt ggctggacca tgatgaagct ctaaaacact tcaccaaacc 720 gaagctgcac caaaagaagg gtatggtcac tgtttgatgg ttggcaagtg aagtcatcca 780 ctacaacttc ttgtgtcctg gcgaaactat cacaacagag gagtattgtc acgaaatcga 840 caaagtgcat caagaactgc aacgactgcg tccagcactg gtcaatcgaa aagggccaat 900 cctcctccat gacaatgccc gaccacatgt ctcgcaaatg actctgcaga aattgaacga 960 actagcctac gagactctac cttacccagt ttactcacca gacctctctt ctaccaatta 1020 tcacttttta aagcatttca acaaccttct gcaagagaag gtttctaaca acaaaggagt 1080 tactcaaaaa gccttcgaag aattcatcgg ttccaggact ctataattct atgcaaccgg 1140 aataaataaa cttgtttgtt gttggcaaaa atgcgtagat tacagtggtt cttatttcga 1200 ctaatacttt ttattctaag ctgagatata ctactttaaa gttgatggtg gaaaagcgca 1260 attatttttg catcaaccta ata 1283 // ID Ginger1-10_HM repbase; DNA; INV; 6495 BP. XX AC . XX DT 03-FEB-2010 (Rel. 15.01, Created) DT 03-FEB-2010 (Rel. 15.01, Last updated, Version 1) XX DE Ginger1 DNA transposon from Hydra magnipapillata. XX KW Ginger1; DNA transposon; Transposable Element; Ginger; integrase; KW Ginger1-10_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-6495 RA Bao W., Kapitonov V.V. and Jurka J.; RT "Ginger DNA transposons in eukaryotes and their evolutionary RT relationships with long terminal repeat retrotransposons."; RL Mobile DNA 1, 3-3 (2010). XX DR [1] (Consensus) XX CC TIR is 106-bp. XX FH Key Location/Qualifiers FT CDS 394..2730 FT /product="Ginger1-10_HM_1p" FT /translation="MVRHIDFITIENYVKEKIYPQEIRGDKGKKSNLRKAC FT KHFSIVCGQLMYRSEKLVIACKDQQQKIISDIHSGLSEDAKVKAMASHRGR FT DTTYQKISNRFYWHNIKFDVEEFIKKCMQCQKHGKIKSVSTELHSIPVKSK FT AMEQVGIDICNLPEVDGFKHLVVCIDYFTKWSEAKPLKNKTAESIALFLYE FT VICRHGCIKIQINDQGREFVNEVSKNLHTMTGTEQRVTSAYHPQSNGLCER FT QNRAIKEALVKVLDETAAEWPYIIEGILFAHRVSKHTSTQYSPFFLLYNRE FT PTLPVDIKYGLVDIKEHEMEPFDRETFNAVLVSSITLREEVYKSAYKNIQT FT AQNKQRKDYNKRHQVPLLIGVGQKVLLKNQKRMDRKGGKFSFKWLGPYTVH FT SISKNNLCCLVNQKGKILKTKYNVTLLKSFVDYNEKPLHDKMLNSCVKTEE FT NITSLWCMLSDEVVKMILTNAVQTSNNVTETYHSLSCTCIRFNKILQQKKK FT FLLPRIFLKFRDHEFSSLMQYKNIIKVSVQRLLKIFGSCSAVATSLYNIIN FT DSKWKLAWIVICAEKLSWFTVERIFWKSHQTIISVNAKEDLKPEPPWLDFP FT NLYYLKTEDKQILLSHNSWLNDSIMDAAQMLICKQLGPNAKFQSVLNVQKC FT SNSPYQPVNNEHIQLMHNGSCHWFLTFCSNRKVNICDSLYTSLNLVSLKCV FT NSLYKNNADSNGDLLINFLPVQKQTDDNSCGLFAVAFAAEILAGKPAIMTN FT FDVLKMREHLLLCLKSQYLSPFPKKKIN" XX SQ Sequence 6495 BP; 2354 A; 816 C; 927 G; 2398 T; 0 other; tgtagcagcc taaaagtact ccccgtaaat aagtactccc ggagtaatta tttcctagta 60 aataagtact cccccagtac ttttaggcta gtaaataagt actcccttta ggaaataagt 120 actcccgatt agaattcttg atccgaaaat ttaaactttg taaaacgacg atcgatttca 180 cattttgtaa ttttgcttat ggttagctgg atattttaaa aactaccaca aattatacac 240 ctaatagtat ttgaaataca ggtaataagt tgtactgaat cgagttacgg ggaacatcat 300 tttatagtat attttattca tcgaacttat ctcttataac aaatagctaa caattttaca 360 acatcaatat tctatacttt ttttaatagt aaaatggtac gacatattga ttttataaca 420 atagaaaact atgtaaagga aaaaatttac cctcaagaaa ttcgtggtga caaagggaaa 480 aaatctaacc ttcgaaaagc ttgcaaacat tttagcattg tttgtggcca attaatgtac 540 cgcagtgaaa agttagtcat tgcttgtaaa gatcaacagc aaaagataat atctgacatt 600 cattctggtc ttagtgaaga tgcaaaagta aaagcaatgg cctcacatcg tggaagagat 660 acaacatatc aaaaaatttc taacaggttc tattggcata atattaaatt cgatgttgaa 720 gaatttatta aaaaatgtat gcaatgtcaa aaacatggta aaattaaaag cgtctctaca 780 gaacttcatt ctataccagt taaaagtaaa gccatggaac aagttggtat tgatatttgt 840 aacttgccag aagttgatgg ctttaaacat ttagttgtct gtatagacta ttttacaaaa 900 tggtcagaag caaagcctct taaaaataaa acagctgaat caatagcact cttcttatat 960 gaagtaatct gtcgacatgg gtgcattaaa attcaaatta atgatcaagg aagagagttt 1020 gtgaatgaag tcagcaaaaa tctgcataca atgacaggaa ctgaacaaag ggtaacttct 1080 gcttatcatc cccaatctaa tggactgtgt gaacgtcaga atagagctat aaaagaagca 1140 ttagttaaag ttctcgatga aacagctgct gaatggcctt atattattga aggaatttta 1200 tttgcacatc gtgttagtaa gcatacttca acccagtatt cacctttttt tttattatat 1260 aaccgagaac caactttacc tgtagatatt aaatatggat tggttgatat taaggaacat 1320 gaaatggaac cttttgatag agaaactttt aatgcagttc ttgtctcatc tattacttta 1380 agagaagaag tttacaaaag tgcatacaag aatattcaaa ctgctcaaaa taaacagcgc 1440 aaagactaca ataaacgtca tcaagtacct cttttgatcg gagttggtca aaaagtcctt 1500 ttaaaaaatc aaaaaagaat ggatagaaaa ggaggaaaat tttcatttaa atggcttggt 1560 ccatacacag ttcattccat ttctaaaaac aacctgtgct gtttggttaa ccaaaagggt 1620 aagattttaa agacaaaata caatgtcact ttgctcaaat cttttgttga ttataatgaa 1680 aagccattgc atgataaaat gttgaattct tgtgttaaaa ctgaagagaa cattacaagt 1740 ttatggtgca tgttatctga tgaagttgtt aaaatgatat tgacaaatgc agtgcagact 1800 tcgaataatg ttacggaaac atatcattca ttgagttgta cgtgcatcag atttaataag 1860 attttgcaac agaaaaagaa gtttcttctt cctcgtattt tccttaaatt cagagatcat 1920 gagttttcaa gtttaatgca atataaaaat ataataaagg taagtgttca aaggttattg 1980 aagatatttg gatcctgtag tgcagttgct accagcttat acaatattat taatgatagt 2040 aaatggaaat tagcatggat tgttatttgt gcagaaaaac tctcttggtt tacagttgag 2100 cgaatatttt ggaaatcaca tcagacaatt atatcagtta atgcaaaaga ggacttaaaa 2160 cctgaacccc catggttaga tttcccaaat ctttattatc ttaaaacaga agataaacag 2220 attttgttaa gtcataactc ctggctaaat gatagcatca tggatgctgc ccaaatgctc 2280 atttgtaaac agctgggacc aaatgcaaaa ttccaatcgg tactaaacgt tcaaaaatgt 2340 tcaaatagtc cataccagcc agtcaataat gaacacattc aacttatgca taatggctca 2400 tgtcattggt ttcttacatt ttgctcaaat agaaaagtca atatttgtga tagtctttat 2460 acatcactga accttgtttc tttaaaatgt gttaattctt tatataaaaa taatgctgat 2520 agtaatggag atttacttat aaattttctt cctgtacaaa aacagactga cgataacagt 2580 tgcggcctgt ttgctgttgc ctttgcagcc gaaatattag caggtaaacc agcaataatg 2640 acaaacttcg atgtattaaa gatgcgagaa cacttgctat tatgcttaaa gtctcagtac 2700 ctttcaccat ttcctaaaaa gaaaattaac tgaaagtctg tgtttacagt ctatataaag 2760 aattagattt acatcataaa tctaaaaggt tcttggcttt tctttgagag ttttttaaag 2820 aatcttattg tgcttagaaa atacttaagt aaaagtcaac ttaaaactat taaccttaaa 2880 gttcttgaaa gaaattttgg cttattaata cagtttattt tcaaaaaact ggggagtttt 2940 ttacaaagaa gtgaaacaaa gctccatagt tcttcaaatt gatgtttttt aaatctcact 3000 tgtagtccct taaggaaatg tagtttactt gcatataaat ctttttaaaa gtggtgtctt 3060 ttttttttgg gcatttttgt tagttaaaag aaaattgtgt tttttagctt tttatatagc 3120 tagtattgat tcaacgcaac cattttttat agctagtatg aatgacataa tttttaaatt 3180 gtttttttat gaaaagaaaa aatttaaaca tttcagttgt gtttattatt tttgttgaca 3240 acaacaggga aaaaataaaa ctcaatgttt ttattttttc aaatttatat gtatgtatat 3300 atatatatat ttatatatat atatatatat atatatatat atatatatat atatatatat 3360 atatatatat atatatatat atatatatat atatatatat atatatatat atatatatat 3420 atatatatat atatatatgt atatatatat aatagatata taaatagata tttgttaaat 3480 atcatgtata ttattatata tatatatata atatatatat aaatatatat atatatatat 3540 atatatatat atatatatat atatatatat atatattata tatatatata tatatatatc 3600 gtaaacgatc cttacacatt agcatatatt tattccatgt tatgttgact gaaaggtaaa 3660 agctaagaac agaataaaat taaaatgtta ttatttcttt cttttaacta aaagattgct 3720 ggtccaaacc aaacccatag gtgatatcta atcctaactt aaccctgccc caactcaaac 3780 cctttgatgt gttatcaact ctcttacgtg ttctccctaa atctgtcgat atgcatatta 3840 ctccttgcat gttctatcta aatcagttta tacgttgtaa gaatcgttta aggaatattt 3900 tgttagcata tgactctttc tatgttgttg tgaaaggttt atcatttaga taaataataa 3960 ggttatttaa aacattggtt aaatggcaga ctgtatgaat tggaatgcag aatatgttgc 4020 catagaaatt tgtgatataa agtaatggaa attgttggta tttataaaag gaagattgtt 4080 ttgacaataa cctagcttgt agtgcgaatt ttttaggtaa gttaaacgat ttgagcttaa 4140 cttttttttt agaacaatga atattatata acaataaaat ataatggtta ttgaaaagaa 4200 aagtgtaaat atattaattt tagaaaaagt ctgtaaggag tgacaaaatt gtagcttgca 4260 acaaacctct tccagacttt aacatggtga ttgattgtca gaaatgtagt tggtctgcgc 4320 tagcaaaaga ttcaacatta agctatcttt gatatatatt taaatttcaa tatctttact 4380 tgcaacattg gtgagtatgt tactaatatt ttcttttata cgtgacttta ttttttaatt 4440 tatttggaaa aaagataaac tctgatacaa aaccattatt tttggttagg tacataataa 4500 gtttgtaaac tatagatggt aatcgtacta gtatctactc taggctttaa ttttgtttta 4560 tcaagccata ctccagttaa atattagttg attaatacta aaagcaaatt caggttttaa 4620 caatgtttta attattctag tcacattgag gctctcaacc ccgaatatat agccatatat 4680 atccagggct ttatctattt atatatataa tatatataaa tagataacga taaatatatt 4740 taaaaaaaga taaataggta tatatatata tatgaatata tatatatata tatatatata 4800 tatatatata tatatatata tatatatata tatatatata tatatatata tatatatata 4860 tatatatata tatatatata tatatatata tatataatca tttcattatt taattttaga 4920 tgattcttca aacgcaactc ttgcaatttc gttgtggctt atcgacgttt gttatattac 4980 ttatttatat caacagaata ctaaatctga aaaatgtttt atatattaca ggttgtgcac 5040 caggacaata tttttttatt tttattagct tataagaatt ttttatgtat agaggtattt 5100 agtaaataaa atttgaaata attatgaaaa aattcaaaca agacatactt gttaagaggt 5160 tatttcttag taattttaat agttaagaga tgaaagaaga gatttttgag ttttgttaaa 5220 ctattagagt ttttaaagaa tcattttgaa attgtttata tcattttgca aatatcaatg 5280 tttagatatt tattgatact aatgtcttaa atagggttaa tacattttta ttattatatt 5340 tgtatattgt gaagatcttg tttttgaaat aatatttaat atctttgaat attgttagct 5400 tgtttttaaa tgagtttgaa tattgcatat tgcggcaaga tgattcaatc aatagatagt 5460 atttttgtaa gatgattcaa tcaatatata gtctttttgt gaaaagtggt tttttcacaa 5520 aggatttaat ttaattaact ttgtgatatt taattaaact ttattatctc tcttgttgtt 5580 cttatgtgtc atcttgcggt cgttgaaaat ctattgatat tttttggata tcttaaatat 5640 tgagaccaat ttttctattt ctattttcta atgtttcaat ttttagaaga tcttttgttc 5700 atacaacaaa cccttcagat tctcaatttt gcggctttat gttaaatttt agcagtaaat 5760 agtttaataa ctattgatta ttaaaataga tctctacttc tatgggatta aaaaatatta 5820 aagtcttgga ggctcattaa agaatcttta ttggtgttct tttggcaata acttaaagaa 5880 cttttaaagg attgttaaga actatttcaa gttctttatg gatcctaact ttaaagtgtg 5940 ttcttcagta gaacctctga ggttcttgga agaatctttg aaaaacggct gtttcacctg 6000 cgcgacccta aaaaacccat taaggttcta tgaagatccg ttacatttaa gagtgtttgt 6060 caaatacttt gattaagcct ttaaataaat aatttcaaag aaatcacatc ttttaaaatt 6120 tcaaaaattt tatatatatg gttcaaaata attaaaatat aaattttagt ttaaaaacag 6180 atgctttaca aagtataagt tttgtaaaat cattattttt tgtatcaaat ggtgttttgt 6240 ctataattta caaatttaaa tcacaataat gccaaaagta ttagcactgc aaacacttag 6300 cgctgcctta aaaattgtat ttattaaata tcattagggg agtacttatt tcctaaagga 6360 gtacttattt cctagtaaat aaatactccc gggagtactt atttactagg aaataaatac 6420 tcccggagta ctaatttcct aggaaaaaag tactccggga gtacttattt acgggagtac 6480 ttttaggcgg ctata 6495 // ID RTE-2_PPac repbase; DNA; INV; 2928 BP. XX AC . XX DT 20-MAY-2010 (Rel. 15.07, Created) DT 20-MAY-2010 (Rel. 15.07, Last updated, Version 2) XX DE A family of RTE non-LTR retrotransposons - consensus. XX KW RTE; Non-LTR Retrotransposon; Transposable Element; RTE-2_PPac. XX OS Pristionchus pacificus OC Eukaryota; Metazoa; Nematoda; Chromadorea; Diplogasterida; OC Neodiplogasteridae; Pristionchus. XX RN [1] RP 1-2928 RA Kojima K.K. and Jurka J.; RT "RTE non-LTR retrotransposons from nematodes."; RL Repbase Reports 10(7), 1061-1061 (2010). XX DR [1] (Consensus) XX CC ~99% identical to consensus. ~7-bp TSDs. The 3' terminus is CC composed by (AAAACTG)n microsatellites. CC This sequence was derived from sequence data generated by Genome CC Sequencing Center at Washington University School of Medicine in CC St. Louis. XX FH Key Location/Qualifiers FT CDS 113..2896 FT /product="RTE-2_PPac_1p" FT /note="includes endonuclease and reverse FT transcriptase domains." FT /translation="MACLLLATLNCLTLCQDSRIVELEEALIKIKCHVIGL FT AEVRRPNEASMDLAKSGSVLYHTQRLQNRMAGVGFIVSGDAKRKVVRFGGL FT TPRVCFLDIEVSNNSLLRIFQVYAPASSRDDSEYSEFVDQLEEAYHAPVSG FT SHRYRFVHKIVMGDFNGKIGCRQLGEGSVGAFGYGDRNIKGQLIVDFCERL FT KLKVGNSFFRKRPSRKYTWVAPNGRTANEIDFILCPKGVELRDVGVVNQFD FT FTTDHRLVRASISFPCRFPRRLNGLGFGDSHLDRNLFKIGLGVEIRKIGGN FT LEYGKLVEAINGASKIATIVTPRKGCISQATKDLIARKRDLKMHSAGLSRL FT QWLLLNRSIRLSFKNDRERLYLDYVSKAIENGKSYKKAAAKVAVGKGRIHK FT IRNEEGGWECTQAGISRAFKTFYNDLYAGDGGSSYRSEKERDPFIPLCIAE FT VEHALRHFKSGKAPGRDGVTGEMLRAGTDLLAPMLTGVFNNILTGGIPEGF FT GDSRTILLPKSGDLSLIKNYRPISLLPALQKALTGLINKRLAPVLDNRRSW FT EQMGFRGGHSTVDAIHTLKTVVANCADHHLPIYLLFVDISKAFDSVLGEAV FT FKALERDGVPSEIVTFLASLSRDSKSAIVVNGEEVEITVGRGVRQGDCLSP FT RLFTAVLDMAFSRLQWDKKGININGRFLSHILFADDAVLISHDERQIQSMA FT IELERELGKVGLKLNGDKTLGMTSKGNAREVRVAGKVVGLKNEVIYLGCGI FT SIDGKDGMEIGRRIQAAYGAFHKHRSFLINRSIPMVHKRRLFNGCILPAFL FT YGCETWALTEAQKERLSVAQRRMERWMVGCTVLDHLSNERLRGLTKVEDVV FT RASLKRKWLWVHKVANDYDLKWSRAVIEWTPRGPKRRRGRPNARWKDLFMR FT TVGPTFLRQARSHEWPAMHIRAYS" XX SQ Sequence 2928 BP; 836 A; 583 C; 793 G; 715 T; 1 other; cattccgagt gcggcgctca atacctgccg gtcggttcgc tttcgcgctc cgtctttatc 60 tcgttttccg ataccaagcc ggctggtcac cgtggtatcg tatgaccata tcatggcatg 120 tcttctactg gcaactctaa actgcctaac gctktgccaa gactcaagaa ttgtggaact 180 cgaggaggcg ctcatcaaga tcaaatgcca cgtcatcggt ctggcagaag tgagaaggcc 240 caacgaagct tcgatggatc tcgccaaatc tggatcagtg ctgtaccaca cgcaacgact 300 acaaaatcgc atggccggag tggggtttat cgtgagtggg gatgcgaagc ggaaagttgt 360 aagatttggt ggcttaactc ctagagtctg ttttcttgat atagaggtat cgaacaactc 420 cctcttacgt atattccagg tgtacgcgcc agccagttcg cgcgatgact ctgagtacag 480 tgaattcgtg gaccagctcg aagaagcata ccatgcgcct gtgtctggaa gccaccgata 540 tcggttcgtg cacaaaatag tgatggggga tttcaatggg aagataggat gcaggcaatt 600 aggggaagga tctgtggggg catttggata tggggataga aacattaagg gacaactaat 660 tgtggacttt tgtgaaaggc taaaactaaa agtaggaaat tctttctttc ggaagcggcc 720 atcacgaaag tacacatggg tagcgcctaa tggtcggacg gctaacgaga tagacttcat 780 tctttgcccg aagggagttg agctaaggga tgtaggggtg gtcaatcaat tcgatttcac 840 tactgaccac aggctcgtaa gagcatccat ttcattccca tgccgcttcc ctaggagatt 900 gaatggattg ggatttgggg attcgcacct ggatagaaac ctctttaaaa tcgggctggg 960 tgttgaaatt aggaaaattg ggggcaatct ggaatatggg aaactggttg aggcaataaa 1020 tggggcgagt aaaatagcaa ccatagtcac tccaaggaag ggctgcattt cccaagcaac 1080 taaagacctc atagcgcgca aaagggacct caaaatgcat tcggccggct tatcccgatt 1140 acaatggctc cttctcaata gatcaatccg actctcattc aaaaacgaca gagaaagatt 1200 gtatctcgat tacgtcagca aagccattga aaatgggaaa agttacaaaa aggctgctgc 1260 aaaagtggcg gttgggaaag gcagaattca taagattagg aacgaggagg gaggttggga 1320 atgcacgcag gctgggattt caagggcttt caaaaccttc tacaatgact tatatgcggg 1380 agatgggggg agtagttaca gatcggaaaa ggagagggac ccattcatac cattgtgtat 1440 tgctgaagtg gaacacgcac ttcgacattt caagtcaggc aaagcaccgg gaagggatgg 1500 cgtaacgggg gaaatgttaa gggctgggac tgacttatta gcacctatgc taacaggggt 1560 gtttaataac atactcacgg gcggtattcc agaaggtttc ggcgattcgc gcaccattct 1620 attacccaaa tcaggtgatc tgtctctaat aaaaaattat cggcccatca gtctgctgcc 1680 tgctctgcaa aaagccctca ccggtcttat aaataagaga cttgcacctg ttttggataa 1740 tagaaggagt tgggaacaaa tggggttcag agggggacat tccacggtag acgctattca 1800 cactctcaaa acggtagtag ctaactgcgc ggaccatcac ttgcccatct atctgctttt 1860 tgtagatatt tctaaggcat ttgattcggt gttaggagaa gcagtcttca aggctctgga 1920 aagggatggt gtcccatcgg aaatagttac cttccttgct tcattatccc gggatagcaa 1980 aagcgcaatc gttgttaacg gggaagaggt ggagattacg gtgggaagag gggttaggca 2040 aggagattgt ctgtccccca ggctgtttac agcggttctg gatatggcat ttagtcgact 2100 tcagtgggat aaaaagggta ttaacataaa tggtagattt ctctctcata ttctattcgc 2160 cgacgacgct gtactaatat ctcatgatga gcgacaaatc caatctatgg ctatagaatt 2220 ggagagagaa ttggggaaag ttgggttaaa gcttaacggg gataaaacac taggaatgac 2280 ttcaaagggt aacgcaaggg aggtacgggt cgcagggaaa gtggtggggt taaaaaatga 2340 ggttatctat cttggttgcg gaatttctat tgacggaaag gatgggatgg aaattggtcg 2400 gcgtattcaa gctgcgtatg gggcattcca caagcatcga tcattcctaa ttaatcgatc 2460 gattccaatg gtccataagc ggcggctatt caatgggtgt atacttcctg catttctata 2520 tgggtgtgaa acatgggcac taaccgaagc acaaaaggaa agattatcgg tggcacaaag 2580 acgcatggag cgatggatgg tcggttgcac tgtcttagat catttatcga acgaaaggct 2640 gcgcggactc acaaaggtag aggatgttgt tcgtgcctca ttgaaaagaa aatggctctg 2700 ggtgcacaaa gttgctaatg attacgactt gaagtggagc agagccgtga tcgaatggac 2760 tccacgaggt cccaagcgaa gacgtggacg gcctaacgca agatggaaag atctcttcat 2820 gcgaacagtc ggacccactt ttctgagaca agcaagaagt cacgaatggc ctgccatgca 2880 cattagagca tactcttgat aaatgtgtta aaactgaaac tgaaaact 2928 // ID BEL-102_AA-I repbase; DNA; INV; 6294 BP. XX AC AAGE02018809; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-102_AA_; KW BEL-102_AA-LTR; BEL-102_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6294 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02018809; Positions 7282 13575. XX CC Positions [5352-5909] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 4923..6293 FT /product="BEL-102_AA-I_3p" FT /translation="MVQLEAYPNEYELLLKNQDVPLSQQMKIKKSSALYSK FT SPIIGEGNLIRMKRRTGEVDWIPNDMRFPVILPKSSCVTFLLIDWYHRKFK FT HANGETVVNEAMQKYAVSALRVLVRTVSTRCNWCRVYKAKPQTPPMGPLPA FT VRITPYVRPFSFVGLDYFGPLTVKIGRSNAKRWVALFTCLTVRAIHLEVTM FT SLSTESCKLAIRRFIARRGAPSEIYSDQGTNFQGASRELHEQIYAINSSLA FT GTFTNTTTQWKFNPPYAPHMGGAWERLVRSVKAALNATSVSKNPDEETLMT FT VMVEIESMVNTRPLTYMALENSEQEALTPNHFLLLSSSGVNQPSALPVDPK FT TVLRSNWNLVRAMLDRFWARWIQEYLPVISKQTKWFDEIRTLRVGDLVIVV FT QNDIRTRGKVIRVYPGKDGRVRKADVQTSTGIIQRPVTLLAVLNVQVPGNG FT IAEGTSCNTGGG" FT CDS join(337..3225,3229..4212) FT /product="BEL-102_AA-I_1p" FT /translation="MQRLAEEKELRARLLVEKEKQDKDMHDMAMRLEKERR FT EKAISDLMTLEKEFIDKKYQLLQARLEEDDDGISHRSHRSVRHGVDKVQEW FT MNAQPAVTLSSNSGGVGPTGLTSTASNVFHPPSSSTVNKNACSQPAGIAEE FT ISCATSFGPTFKDLISPVTTSYPGLPVETLTSTPSNVPSVPLQQSTPRSSA FT ANVLGTVSLLPTSIHPSASMPGTNTLHRSFYPNTTTISVSNRPNVVENVSH FT ALGGQGNVGTVIGSIYNRIPWASQPGGEMPVSRVSVPAPINPSWLEGLPAR FT PSVPASGPMCSSFPQSGYPSAPPGIPSITQSSLSLPSMVSSGVGGGIYNLS FT SQQYPGISVTSYQTPIFSVPQQPSIPMQQSFGPNSQQLAARHVVPKELPSF FT AGNPAEWPLFWSSYDTSTQMCGYTDAENLMRLQRSLKGEARKAVSCFLLHL FT SNVPEVMNTLHSLYGRPEAIIHTLLNEVRATPPPKPEKLETLINFGLAVRN FT LCVHLVSTRQDMHLSNPILLQELVEKLPANIKLDWALYKQRVLVADLRAFA FT DYMNVLVTAASSVTTIVEPAFQKAERQKVKTKAFVNPHSTVDSRPPWQSKE FT VPSSDVSKPDIKEDRPCVVCKTSGHKPKDCDMFKQKDLADRWKIASEVHLC FT KRCLYPHGRWPCKASTCGVDGCQQRHHRLLHPGEPQVGGNRTAMNYNTGTV FT AVHKQLRSQILFCIIPVLLHANGKSIRTFAFLDSGSDSTLIENSIVKQLGI FT VGQPAPLCIQWTNGVKRTEDMAQQVQLLISGLDSTKRFALNDVHTVRSLDL FT PRQSLKFDVLQRDFPHLRGLPVKSYDDASPGILIGLDNTKVKNTLKLREGK FT PDEPVAAKTRIGWVVFGRNRGSGENSSHRLLHLCKQSSDEVLHDLVKNFFS FT IEGTGVSSTNALDSADVRRARNILEETTERTSSGRFQTGLLWKYDHFQFPD FT SRRTAERRVCLEKRLQKSPELYDNVKQQIAEYQRKGYAHIATDKELEETDP FT KRVWYLPLGVVIHPKKPGKVRVVWDAASTVQGVSLNSVLLKGPDLLTSLPV FT VLSRFRQRQVAISGDIREMFHQVLIRPEDRHAQRFLWRDSPLLPMQVFVMN FT VATFGSCCSPCSLQYIKNLNADEQADEFPKAAEAVKKSHYVDDYLDSVDTV FT DEAIQLAEKVKAIHAKEGFEIRHWMSNSTEVIQRIGECSEDTDKIFVMDKC FT SSIERVLGMTWQPKEDVFSFSLQLRENLQSLITDDAVPTKRKALSLVMSVF FT DPLGLVAVILVHGKVLLQDIWRSTYICGRQ" XX SQ Sequence 6294 BP; 1774 A; 1450 C; 1576 G; 1494 T; 0 other; aacttcaaga atataaacat tctaacctct catctgccga cggtcgggat aagctccaga 60 aaatatgacg acattaccgt ctgagaccgt gaccaatgtc ggagaggcga cagagcatag 120 aaattgccgc gtttgcaaca gacggagtag accgcacagc cagatggtcc agtgcaacgg 180 ctgtaggcac tggtatcatt tcagatgcgc cgcggttggg gatagtattg ccaacgaggg 240 cagagtatat ctttgcggtt tctgttcaac atcacctggc tcgtaatagc acaatcgaca 300 acggcaagcg cacgagaggc caagttgagg ctcgaaatgc aacgcctagc cgaagagaag 360 gagctacggg ctagattgct ggtggagaag gaaaagcaag ataaggacat gcacgatatg 420 gcgatgcgac tggaaaaaga acgaagagag aaagcgattt ctgacttgat gacgttagaa 480 aaggagttta tcgacaaaaa ataccaactg ctgcaagccc gattagaaga agacgacgat 540 ggaataagcc acagaagtca ccgtagtgtc cgccatggag tggacaaggt ccaagagtgg 600 atgaatgcac agccagccgt gaccctgagt tccaattctg gaggagtagg ccccactgga 660 ctcacatcta cagcatccaa cgtatttcat ccaccctcat catcaacggt aaacaaaaac 720 gcttgcagtc aacctgcggg aattgctgaa gaaatctcct gcgctacgtc ctttggtcca 780 acattcaagg acctaatttc gcctgtgacg acgtcctacc cagggctgcc agtggaaacg 840 ctaacatcaa ctccaagcaa cgtcccatca gttccattgc aacaatccac accgaggagt 900 tcagcagcaa atgtgctagg gaccgtttcg ttattgccta cctcgataca tccatcagca 960 agcatgccag gtacaaacac tctccaccgt tcgttttatc caaacacgac aacaatatca 1020 gtaagtaatc gaccgaatgt tgttgaaaat gtaagccatg ctttgggcgg acagggtaat 1080 gttggaacag ttatagggtc tatttacaat cgaataccgt gggcaagtca acctggcggt 1140 gaaatgcctg ttagtagagt tagtgttccc gcaccgatca acccttcgtg gttagaaggt 1200 ttaccggcgc gcccctctgt accggcaagc ggccctatgt gctcttcgtt tccgcaaagt 1260 ggttatccct cagcaccacc aggtattccg agtattacgc agagtagcct ttctctaccg 1320 tccatggtat catctggtgt gggtgggggg atatataact tgtcctctca acaatatccc 1380 ggcataagtg taaccagtta tcaaactcca attttctcgg tgccgcaaca acccagcata 1440 ccaatgcagc agtcctttgg ccctaattct cagcaattgg ccgctcgtca tgtagtgcca 1500 aaggaactcc cttcgttcgc gggaaatcca gcagagtggc ctctcttttg gagcagttat 1560 gatacatcta cacaaatgtg cggatacacc gatgcggaaa acttgatgcg gctacagcgt 1620 agtctgaagg gcgaggctag gaaagccgta agctgcttcc tgctgcatct atcgaatgtg 1680 cctgaagtga tgaacacatt gcattcgtta tatggtcgtc ccgaggctat aatacacaca 1740 ctactaaacg aagtacgagc cactccaccg ccaaaaccag agaagctgga gaccctgatc 1800 aattttggac tggcagttag gaacctttgt gtgcacctcg tgagtacaag acaagatatg 1860 catttatcga atccgattct attgcaagag cttgttgaaa aattacctgc aaacatcaag 1920 ctggattggg cgctctacaa acaacgtgtc ctggtagcag atcttcgcgc tttcgctgac 1980 tacatgaacg tgttggtcac agcggctagc agcgtcacga ccatcgttga acctgcgttt 2040 caaaaggctg aacgacaaaa ggtgaagaca aaggcatttg taaatcccca ttcgacggta 2100 gattcacgac cgccatggca gagtaaagaa gtaccaagct cagatgtgag caaaccggac 2160 atcaaagagg atcgaccatg tgtagtgtgc aaaacatctg ggcacaagcc gaaggattgt 2220 gatatgttca agcaaaaaga tctagctgat cgttggaaga tcgcatctga agttcacctt 2280 tgcaagaggt gtttgtatcc acatggacgt tggccatgca aagcttcaac atgtggtgta 2340 gacggatgtc agcagcgcca tcatcgcctt cttcatcctg gcgaaccaca agtagggggt 2400 aatagaactg cgatgaacta caacacgggc acagtagctg ttcataaaca attacgtagt 2460 cagatactgt tctgcataat tcctgttttg ttacacgcta atgggaaatc aattcgaacc 2520 ttcgcctttt tggatagcgg ttcagattcc accctgatcg agaactcgat agtgaagcag 2580 ctaggcatag tgggccaacc ggctccgctg tgcatacaat ggactaatgg agtgaagcgt 2640 actgaagata tggcgcaaca agttcaactc ttgatttctg gattggattc gacgaagcgt 2700 ttcgcactga atgatgtaca caccgtaagg agtctagatt tgcctcgaca atccttgaaa 2760 ttcgacgtgc tccagagaga ttttcctcac cttcgcggac ttcccgtgaa gagctatgac 2820 gacgcatccc ctgggatttt gattgggcta gataacacga aggttaaaaa cactctaaag 2880 cttagagaag gtaaacctga tgagccagta gcagcaaaga cacggattgg ttgggttgta 2940 ttcggacgta accgaggctc aggagaaaat tcttctcatc gacttcttca cttgtgtaaa 3000 cagtccagtg atgaggtgct tcacgattta gtgaaaaatt tcttcagtat tgagggcacg 3060 ggggtatcat caacaaacgc tcttgattct gccgacgtcc ggcgcgctcg aaacatcctt 3120 gaagaaacta cagaacgcac ctctagcggt cgattccaga cggggctctt atggaaatat 3180 gaccattttc aattcccaga cagtagacga acagcggagc gccgttaagt ctgtctagaa 3240 aagcggctgc agaaatcacc agagttgtat gataacgtga agcaacaaat tgctgaatat 3300 cagcgcaaag gttatgcgca catagcaacg gacaaggaac tagaagaaac agacccaaag 3360 cgagtttggt atttgccttt aggggtcgtc atccacccaa agaaaccggg aaaggtgcgt 3420 gtagtctggg atgcggcctc aacggtacaa ggtgtctcac tgaattctgt tctgctgaaa 3480 ggacctgacc tactcacctc attacccgtg gtgctaagtc gattccgtca gagacaagtg 3540 gccatcagtg gagatatacg ggagatgttc caccaggttt tgatacgacc ggaagatcgt 3600 catgcacagc gtttcctctg gcgggacagc ccattgctcc cgatgcaagt attcgtcatg 3660 aatgtagcta cttttggatc ttgttgctct ccgtgttctt tgcaatacat taaaaatttg 3720 aacgcggacg agcaggcaga cgagtttcca aaggcagccg aggcggtgaa gaaaagccat 3780 tacgtcgatg attatttgga tagcgtggat acggttgacg aagcaataca gttggcggaa 3840 aaagtaaaag cgatacacgc taaggaaggg ttcgaaattc ggcactggat gtcaaattca 3900 actgaggtaa tccagcgaat tggcgaatgt agcgaggaca cagacaaaat attcgtgatg 3960 gacaagtgca gcagtataga acgtgttttg ggaatgacgt ggcagccaaa ggaggatgtt 4020 ttctcgttct ctctccagtt gcgtgaaaat cttcaatctc tcataaccga cgatgcagta 4080 ccgacgaaac gaaaagcttt aagtctggtg atgagcgtct tcgacccgtt gggattggtg 4140 gcagtgatcc tcgttcacgg aaaggtgttg cttcaggaca tatggcgctc tacatatatt 4200 tgtggacgcc agtgaaaccg cttatgcagc ctgcgcgtac ttccgtttag tggatcgtgg 4260 ggtcactcga tgttgtctag tgtcggcgaa atcaaaagtc gctccactca aaccgatgtc 4320 tattccacga cttgaactcc aagctgcggt tatgggagca cgccttatga aaacagtcgt 4380 taccaatcac acgcttaaaa tttgtcgcaa agtgctcgct taaaatttgt cgctggagcg 4440 actccacaac agtgctttcg tggctgagat cagatccacg gcgccacacg cagttcgtgg 4500 gattcaggat cggggagatt ttggaagcta ctgacatcga agattggaga tgggtcccaa 4560 caaaatgtaa tgtcgctgac gaagctacca aatggaacgc agggcctaat attgagccaa 4620 acggtcgctg gtttaaagga cccaattttc tatataaaga cgaagactgc tggcctcgac 4680 gtgaacctcc gttgcaagag tcttcagaag aattacgatc ggtgcacgta caaacccatg 4740 tcgtgatcaa acaggtgatc gagttcaagc gtatttcaaa atgggaacgt gtaatacgta 4800 ctgtcgcttt tgttcgtcgt ttctacgaga actgtcagaa gaagacgaaa ggtgaacgtt 4860 catatctcac actatctttg agtcgcgaag aattgaagaa tgcggagttc actgtgttga 4920 gaatggtgca actggaagct tatccgaacg aatatgagct attactaaag aaccaagatg 4980 tcccactgag tcaacagatg aagatcaaga agagtagtgc tctttatagt aagtctccga 5040 taattggtga gggtaatctt atccgaatga aacggcgtac tggagaagtt gactggattc 5100 ccaatgacat gcgatttcca gtgatcctac cgaaatcgag ctgcgtgacc tttctactaa 5160 tcgactggta tcatcgtaaa ttcaagcatg ccaacggaga aactgtagtg aacgaggcta 5220 tgcaaaagta cgctgtctcc gcacttagag ttctagtaag aactgtaagc acacgctgca 5280 actggtgccg agtgtataag gcgaaacctc aaacaccacc gatgggcccc ttaccagctg 5340 tacgcattac tccttacgtc cgtccatttt cctttgttgg gttggactac tttggtccgc 5400 taactgtgaa gattggtcgc agcaatgcaa aacgatgggt tgcactcttt acgtgcttga 5460 cggttcgtgc tatccatttg gaagtaacca tgagcctctc cactgagtcc tgcaaattgg 5520 caattcggag gttcatagcg cgtagaggag ctccgtcaga aatttattcc gaccagggca 5580 caaactttca aggagcgagt cgagaacttc atgaacagat ctacgctatt aatagctctc 5640 tggctggcac tttcactaat actacgacac agtggaagtt caaccctccc tatgcaccgc 5700 atatgggagg agcttgggaa aggctcgtgc gatctgtaaa ggcggctctg aacgctacat 5760 cagtgagcaa aaatccggac gaagaaactt tgatgacggt aatggtggag atagagagta 5820 tggtgaatac tcgcccttta acctacatgg cgctggagaa ttcggagcag gaggcgctaa 5880 ctccgaatca ttttttactc ttgagttctt ccggtgtgaa tcaaccatca gcgctcccag 5940 tagacccgaa aacggtgctg cgttcaaact ggaacttagt ccgtgcaatg ttagatcggt 6000 tttgggcacg atggatccag gaatacctac ccgtaatctc caagcaaacc aaatggtttg 6060 atgaaatcag aaccctgcga gtcggtgatc tagtaatcgt tgtgcagaac gacatccgga 6120 caagagggaa ggtgatccga gtttatcctg ggaaagatgg aagagtacgg aaggcggatg 6180 ttcaaacctc taccggcata attcaacggc cagtgaccct gctagctgtt ctgaacgtac 6240 aagtgccagg taatggtata gcagaaggaa cttcgtgcaa tacgggtggg ggca 6294 // ID Kolobok-4_TV repbase; DNA; INV; 5196 BP. XX AC . XX DT 27-FEB-2007 (Rel. 12.02, Created) DT 27-FEB-2007 (Rel. 12.02, Last updated, Version 1) XX DE A family of autonomous Kolobok transposons - a consensus. XX KW Kolobok; DNA transposon; Transposable Element; KW Interspersed repeat; Kolobok-4_TV. XX OS Trichomonas vaginalis OC Eukaryota; Parabasalia; Trichomonadida; Trichomonadidae; OC Trichomonas. XX RN [1] RP 1-5196 RA Kapitonov V.V. and Jurka J.; RT "Kolobok, a novel superfamily of eukaryotic DNA transposons."; RL Repbase Reports 7(2), 121-121 (2007). XX DR [1] (Consensus) XX CC Kolobok-4_TV is a consensus sequence of a family of autonomous CC Kolobok transposons that were active in the T. vaginalis genome CC in a last few million years. The Kolobok-4_TV transposon is CC characterized by 12-bp terminal inverted repeats, TTAA target CC site duplications, and it encodes the 455-aa Kolobok-4_TV1p CC transposase. Kolobok transposons, including numerous families of CC non-autonomous elements, constitute >2% of the T. vaginalis CC genome. See also comments in Kolobok-1_XT. XX FH Key Location/Qualifiers FT CDS 2723..4087 FT /product="Kolobok-4_TV1p" FT /translation="MPPPSRRKQQSREANEKSIEARKNSQEKNAPKEVDPK FT HWTASVIVNGDSYTRARNLFQDNNIKVPSEKEFYRHQKEIGKVILEYKEQS FT IKNAQQTMKKDTFLSTDSHYNVGRNATACQSLMMDNRGKVVGETTVIKKSS FT GGDFEGPSNMMETECTKRMMSNFDFTNVSTVVHDNDNKTKAIIQQYAPNAK FT ITLDPRHCVRAVGRAFEKLENGGYKEAAGKETVNQQKESESQPAFPIKSKA FT EEPKKRGRPLKHQEKVFHQIYSKVIKWFTFIIMLPIETTKKIFLWKNTLNH FT FIGRHENCLHEKDKEYPVFQPLTKPDLAKQFQRFTDDAAKLIPEIDPNAQT FT QQNESIHHSMLTQCPKGNNAKSFEMRTAVTILQKNEGEKCKQEIYNRVNNY FT QISPSIRELQKLEYEKNKSRNQKKATPETRKEINKARNAKRFPKDNPTGDY FT RGGKWTPQDI" XX SQ Sequence 5196 BP; 1973 A; 731 C; 737 G; 1755 T; 0 other; ggataccctt acgtgtcaga ttttttgcgc atacgaagaa gtcatgctag acatgagcta 60 taaacctgaa attttagcaa catatttaaa actttttaaa ctacaataca ctttaatatt 120 caagaacaaa aatattcttt ttagaaatta cataattatt ttatgtgaaa tctatataaa 180 aacatatttt gcgagcgtaa cttttttctt tataagcacc ttatatgact gataaactgt 240 gaaggcacat aattgttaat tcatagagtt gatatgtttc caaaattgag aggaaatttg 300 aaaattgaat accctttcaa ttcaattttc aaaagatatc agctccttca ataattgaag 360 gacgattttg atcatttgct cttaaaacaa gtatgccatg gagaaaaaag ttacgctcgc 420 aaatctgaaa tataactatc atataaaaaa attaaatttc taatttttta ctctatagat 480 attattggcc acttgtgtaa tgcataaaag tttatgtcat atgcatacat ttggaagtat 540 gggagacgag ggaaattact tctatgaggc actaggattt ttaaaaattt ctctatcgga 600 ctcaccattc aatttctcga aaattaaaaa aaatcaataa tagctaaaat gtattatgca 660 taacaaattt taatgttaca cggtattttt tgaacttcat gacatgaatt ttctttatat 720 tttttttttt ggattttcta tgatttatgc gataggctgg aacaacattg taagtgcaat 780 gatatcattt atagagaaat ctaaaacttc agcattcaga actcgcaacc acaaaaatat 840 tcatttgcta agtaagtaaa gttgattaag ttcgacactt agcttttata cgtaaaacca 900 agtgtgtatt gttatttctt gttaaattat tgagaaaaaa ataaattgta agagaaacga 960 tactgattat tgaaataata ttgagggtag ctgcagattt cttcattaac atgccggaaa 1020 tttgttaatc ttggttttct atcataaaat ctgtcgaaga atctaaaaac ggagatatga 1080 aatagagatc atgtaaatct agagtatcct gtcagaaaaa tagactaagc atgtaaccct 1140 tttttggtca taaacacttt ttaccactta gtcatacaga tgatcattaa atttaaacga 1200 gcactttttg tcagaattta ttcttttaca aagacagaag attttagtat tatttaatat 1260 tagtgctgta gcaaaattca aaaaacggtg aaaattcagt gaatttgtga attcacattt 1320 tttttcgaca tttttcttga tatatccatg cacattaaga acacagcttt gatacgctta 1380 aaaaagcata ttttatatga aattgttaaa gaaataaaca atttatataa aaattgcact 1440 tatgggcagt ttagatatat aatttatata taacatatat acaaatccat agacaaagtg 1500 caataactac aatgttgtag aaaattacag cagaaatact gactttttga aggataataa 1560 gaattcttct aaaaatactg aaaattaaca tatttctgta ctcaatttag tatatgcaca 1620 gaagttatga tttggtgtta ggtttttatg ccccaaaatc tagatttttg tgacatcaaa 1680 tcttattagt ggataaatca aagactgtca ttttgaaata agctcatatc aataaatatc 1740 catttaaaag ccttattgta atatgtaact agcaaaaagc ctaatattta tcaaaattaa 1800 tcaaatcaaa tgatcaatat tcatgatatg tcatcagcaa tagctaattg atattttatt 1860 ttgtattgat attcattttt ctcatattat acacgattat atctcttttt tttaaaaaag 1920 tttttatatt agacatgaaa gttgaattaa tacaattttg tactagagtt tgtttgtata 1980 ttaaataata atgatatata cttcaacata ttgcgtctca ccattaccaa aatgtaattt 2040 tgcatacaat tttctttcat ttaacaatga tgttatttga aaagtgatac ttccatgatt 2100 attgattaat gttccatcat atataggaat gaatgttaaa caagacacgc ctgcagctgg 2160 gtgcattcgt ggcattgtga taaggcttat atataaagac ttctctcgat cagcattacg 2220 tcaaatgaat attgacccat tattttggcg atgcgctagt tccaagtaga aatgctttgg 2280 tatatttagg ttatgtctat atgttatgaa tcatggtatt attgataaaa acgttcttag 2340 tcattaaatc aatcctagta tctaaagata taatttagat ttttttgttt attgctaaaa 2400 ttaaatacag tataactttc tttcatattg tcggtactaa caatacaaga ttgtgttgaa 2460 aaaaacttaa atatgcatga aaattaaatt taatgtgtaa atatgataaa aatttcaact 2520 ttactaataa catacataag tttctgatag tttaatatag ttcatattga tgtgactgta 2580 aactatatca aaaaaatatt tttaaatatt gttatgcaaa taatgggatt tctaaggaaa 2640 aaatgattat atttgttata taaagcaatc tttttaatta ttcactacgt ttattacatt 2700 ttttgccgac catttacgta atatgcctcc accaagcaga cgcaaacaac aaagtcgtga 2760 agctaacgaa aaatcaattg aagcaagaaa aaactcccaa gaaaagaatg cgccaaagga 2820 agtcgatcca aaacactgga ctgcttctgt aattgttaat ggtgactctt atactcgagc 2880 aaggaattta tttcaagata acaatattaa agtgccgagt gaaaaagagt tttaccgtca 2940 tcaaaaagaa ataggtaagg taatactaga atataaagag caatctatta aaaatgctca 3000 acaaacaatg aagaaagata catttttaag cactgattcg cattataatg ttggcagaaa 3060 tgctacagcg tgccaaagtt taatgatgga caaccgtgga aaagttgttg gtgaaacaac 3120 tgttataaaa aagagctctg gaggagactt tgaaggccca tccaacatga tggaaacaga 3180 atgtacaaag aggatgatgt caaattttga ctttacaaat gtttccacag tggttcatga 3240 caatgacaat aaaacgaaag ccatcatcca gcaatatgct ccgaatgcta aaattacact 3300 tgatcctcgt cattgtgtac gtgcagttgg cagagctttc gaaaagcttg aaaatggtgg 3360 atataaagaa gcagcaggaa aagaaactgt taatcaacag aaagaaagtg aatcacagcc 3420 agcattccca atcaaatcta aagcagaaga gccgaagaaa cgtggaagac cattaaaaca 3480 ccaagaaaaa gtatttcatc agatatattc gaaagttatt aagtggttca cttttattat 3540 catgcttcca attgaaacaa cgaaaaaaat tttcctatgg aagaatacat taaatcattt 3600 tatcggaagg catgaaaatt gccttcacga aaaagacaaa gaatatcctg tgtttcagcc 3660 acttacaaaa cctgatttag cgaagcaatt tcaacgtttt acagatgatg ccgcaaagtt 3720 gattccagaa attgatccta atgctcaaac acaacaaaat gaatcgatac atcactcaat 3780 gcttactcag tgtcccaaag gaaacaacgc aaaatcgttt gaaatgcgta cggctgttac 3840 aattttacaa aaaaacgaag gagagaaatg caaacaagaa atatacaatc gtgtcaataa 3900 ttaccaaata tctccttcaa tccgcgaact ccaaaaactt gaatatgaga aaaataaatc 3960 cagaaatcaa aagaaagcta caccagaaac aagaaaagag atcaacaagg caagaaatgc 4020 aaaaagattt cctaaggata accctacagg tgattacaga ggaggaaagt ggaccccaca 4080 agatatctaa ttttcaaatt tgaaataaac acatattttc tttgtatttg aaaatatttt 4140 tatattataa gatttttgaa cctataagta gcatgacaaa aattatataa tttgctcctt 4200 ttttatttta gtatacaaat atcccatttg ttatatgttc ggtatatata tgatgatatt 4260 gtaatcaatt atgcttaatg aagctttatc ccgtgaaccc ctcatattat tggaaaataa 4320 atagcgaaaa catgtagagc tgctatgttc tcaagtgtat gctgattttt ggattatttc 4380 atgaaaacta atggcgggat atcttgcatt atttaaaaaa attgaactat gcaaaaataa 4440 atagtattat attttttctc atgaaatgct attatgagac aaccataaaa aaccatatgt 4500 tgtattttta taatttgatt gatttttttt actatttcat taatctttta tgaataataa 4560 ttatttttaa tatatgtgct attaaaattt caaattttga ctaatatatc taagttatct 4620 gaacgaaaca tttagaaaat atcatactta cattgaaata gcaagtagag catattaata 4680 gcaagtttct tggctttata tctatcatct attgttgttt gatcaaacaa aattcactct 4740 attatatggt gtcaattaat taaaacaatt tttaatagtt atttatttag tgcgaataag 4800 tgcaaactca taaatgaata acttaatagt agaaatgaat tatctgatat aaatctcaca 4860 ttttgtccat cagaaatact ttaattgcat acgcataact attagtaata cgtctggagt 4920 gtcaattaat aacaatccca caacaagtaa aattcagatt tttattaaac ttattttttc 4980 ctaaaagttt tctcaaatta ctatatatac atatgtggat aaaataatag taatattgta 5040 aaaaattaga taattttttt tatttgagaa cctctcaaaa gttcattcaa tttatcaatt 5100 gtttcagttt gatgtatatt catttctgga gcctaaaact ttagcaatca tgacaatatc 5160 atattctctt tctaactacg caatgtaagg gtatcc 5196 // ID Gypsy-26_DPu-LTR repbase; DNA; INV; 324 BP. XX AC scaffold_5; XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from Daphnia: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-26_DP_; KW Gypsy-26_DPu-I; Gypsy-26_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-324 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Direct Submission to RU (16-DEC-2010). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR Genome; scaffold_5; Positions 2355732 2356055. XX SQ Sequence 324 BP; 67 A; 77 C; 88 G; 92 T; 0 other; tgcagacggc cccgtctgcc tccttacgcc atctatgaac ccctttctgt gttgtgtggg 60 tgccctgagg gcccttgtaa ttgtgaccgc cagtgggcgc aactgtcatg tgacagccgg 120 cacacgaaag agcagtcaga agatctcgtg tagtagttga gacgagtgcg tctctgtaag 180 acttactgtc gtaagcaagt tgagagagag ctttggctta aacacctggc tcactcgcgg 240 tatcgtgttt cgtgtaagta gaatatactc gtgtaagcac ccctattttg tgtgtcgcgt 300 gtttccttat tcgagtagat tgca 324 // ID BEL-6_SI-LTR repbase; DNA; INV; 401 BP. XX AC AEAQ01022971; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: long terminal DE repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-6_SI_; KW BEL-6_SI-I; BEL-6_SI-LTR. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-401 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01022971; Positions 3 403. XX SQ Sequence 401 BP; 93 A; 119 C; 53 G; 136 T; 0 other; tgttctgata aaacctagca gttactgtta tattctgtta cattattctg tatctcgaag 60 tttctctttt cccatctgtt ttgcacatgt ctatctatac ctgatctcca tctgtttatc 120 attcctcaat ctttattcca ctcgtcctta tcgtccttcc ctcgtaaata ttatcttacc 180 tctccttgtc tccaacacta taacaatgtg ccccctctcc ataactcctg actgaatcct 240 gttctgcggc ccgaacgggc agtcgttctc tgactctgga gcaaaataaa tcgagattca 300 taacgtactc tccgcgcata ttacttttcc ttccccgaaa gattccgatg tcgccagcga 360 atcgccgcgc tcgtcgattc acataacaac ttcacaaaac a 401 // ID Copia-26_DPu-I repbase; DNA; INV; 4602 BP. XX AC scaffold_38; XX DT 11-MAY-2010 (Rel. 15.05, Created) DT 11-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE Copia-like LTR retrotransposon from Daphnia: internal portion. XX KW LTR Retrotransposon; Transposable Element; Copia-26_DPu-I. XX NM Copia-26_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-4602 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 715-715 (2010). XX DR Genome; scaffold_38; Positions 251765 247164. XX CC Positions [1882-2409] - Integrase core CC 'TTCAG' target site duplication CC LTRs are 97% similar to each other. CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX FH Key Location/Qualifiers FT CDS 565..3699 FT /product="Copia-26_DPu-I_1p" FT /translation="MWTRLTQQYEQNAAENLFVLIDRYYQYTFNENDNMMT FT HLAKIEGMVGQLRDLGSPIDDNQVVSKVLMTLPPKYANFRESWGLLATADK FT TKANLIAKLLLSESMLEHRQNQENVQSQERDIALIASRNAGPSRMKCSICG FT MNNHNHLDCRNRNKREFFKPHGPQQPKKPRVKCTYCHYQKHVYEDCRTRKR FT HEQEASTSQKQEQNNSVAHLASKQKEESDSEMFNLEHAFVSFSLGKLQADD FT WFADSASTEHMTEHRHYFTSFEPISHKWNVKGVGRDNQPLEVKGKGDIKIK FT ITVGTNVHYGILQNVLYVPGIGVNLFSIGKATDRGITAISEKDSVRMYRGT FT NLELIGTRSEKDLYQLNLLAVSPESSATSSSYQTVAGTNSLVFPAVSLNVW FT HKRFGYAHHAVIKKMEAKNSVIGLSLSNDPIDPEPCKGCALGKSHRQPFPT FT SGSRRATKVGEIGHSDLCGPMNVTSTTGSLYYVLFQDDASGFRVIYFLKTK FT SETFNYFKIYVARMLTETGHRISIFRSDNGGEFLDGDFQLYLSTNGIRHET FT CAPYTPEQNGVSERSNRTLMESAMSSLFDMDLPRDLWAEAANTSVYVNNRV FT HGRSTNSTPFEIWYCRKPDVSYFRIFGSFAYVHVPKQLRRKLDAKSQKLLF FT VGYSETQKAYRFFDRSSRRIIVSRDAIFCETPTTVPTVSENVAKPDQPIHL FT HVPVPSVNVVNSDGQIHLPQSTQTEPTSDCATESNSPFDPYDTVKNFHGFE FT AVLRRSLRIRTPKVIPSFLAKPNFPLLNDPDNLFTYNDAISGSNSKEWVTA FT MVTEIDALNRLKTWSLVPRPSDRRIVKCRWVYAVKRSITGLVEKFRARLVA FT KGFSQKAGIDFDETFSPVVKYDSLRIILSIAAAQNLDLFQFDVSSAFLYGE FT LTEEIYLEQPEGFCVSGREGDVYRLHKSLYGLKQAGRVWNAKFDGLVTTFG FT LIPSAVDPCVYRYQKDNVYLILCLWVDDGLLVCNTAQLVSDFFSYLQTHFE FT MKPKKVDRFVGLHISRDHANQKFFVSQPSYTANLCLLLI" XX SQ Sequence 4602 BP; 1361 A; 1009 C; 901 G; 1331 T; 0 other; ggttatgggc ccaggttccg taaaattttc taatacatgt aatttacctt tttagactga 60 aagcatggct tacaacgcca aaaagacaag tcacattgat aaatttaatg gaactgactt 120 cacattctgg aagaaacaag ttttgattgt actgaaagta cacaaattag acaaagttgc 180 tgacggcact tttatttgcc cagtccagga gattgatgct gatggcaatc ctctattgaa 240 cgaagctgga gtaccaacac agcaagaaac cattcaagaa tggaacgaca aagatttgca 300 agcacaagac ttgcttttca gcactactga tccaggtaca atctcaacga gattcatttt 360 tcacaatctc acagagattc tttttccaca atctctacga gattttattt gacacaatct 420 catagagatt ctcattgtat aatctcacag agattttact tcttataatc ctacagagat 480 tttattcaat atcacagaga ttaacttata tctttttgca caggtgttcg cagaattctc 540 ttagaatgca acacatcata tgagatgtgg acaaggctca cacagcaata cgagcaaaat 600 gctgctgaaa atctctttgt attgattgac cgttactatc aatatacctt caacgagaat 660 gacaatatga tgacccacct cgccaagatt gaaggaatgg tcggccaact aagggaccta 720 ggatccccaa ttgatgacaa tcaagttgtc tcaaaagttt taatgacact gccaccaaag 780 tatgccaact ttagagaatc atggggcctc ctagcaactg cagacaagac caaagctaat 840 ttgatagcta aactcttact gtcagaatcc atgctggaac accgacaaaa tcaagaaaat 900 gtccaaagcc aagaaagaga tattgctctg attgcttcaa gaaacgccgg tccctctcga 960 atgaaatgct ccatctgtgg aatgaacaat cacaatcatc ttgattgccg taacagaaac 1020 aagagagaat tctttaaacc tcatgggcct caacaaccaa agaagccccg agtgaaatgt 1080 acgtattgcc attatcaaaa acatgtatac gaagattgtc gtacccgcaa gagacatgaa 1140 caagaagctt ctacgtccca aaaacaagaa caaaacaact ccgtggccca cttggcatcc 1200 aagcagaagg aagagtctga cagcgaaatg ttcaatctcg aacatgcttt tgtttcattc 1260 agtcttggaa aacttcaggc tgatgattgg ttcgcagatt ctgcctcaac ggaacatatg 1320 acagaacatc gtcattactt cacctctttt gagcctattt cccacaaatg gaatgtgaaa 1380 ggtgttggaa gagataatca gccactagaa gtgaaaggaa aaggcgacat taaaatcaaa 1440 atcactgttg gtacaaatgt tcattacggg attcttcaaa atgttctcta tgtccctggc 1500 atcggtgtga acctattctc catcggcaaa gcaacagatc gtggaattac agccatctct 1560 gaaaaagatt ctgtccgtat gtaccgtgga accaaccttg aactcatcgg cacgcgttct 1620 gaaaaggatc tttaccaact taatctgcta gcagtgtcgc cagaaagctc agccacatcg 1680 tcatcatacc agactgtagc aggtacaaat tcgcttgttt tccctgctgt ttctctcaat 1740 gtctggcaca aacgttttgg ctatgcccat catgctgtta ttaagaaaat ggaagcaaaa 1800 aattctgtga ttggacttag tctatcaaac gacccaatcg atcctgaacc gtgtaaaggc 1860 tgtgcccttg gcaagagtca cagacaaccg ttcccaacca gtggcagtcg tcgtgcaacc 1920 aaagttggtg aaattgggca ttcagatctt tgtggtccaa tgaatgttac gtccactact 1980 ggttccctgt actatgtact ctttcaggac gatgcgtcgg gttttcgtgt catttatttt 2040 cttaaaacca aaagtgaaac atttaattat ttcaaaatct atgttgcacg catgttaact 2100 gaaaccggtc accgcatctc aatttttcga tctgataatg gtggggaatt tttggatggt 2160 gattttcaat tatatctttc cacaaatggc attcgtcacg aaacgtgtgc gccctacaca 2220 ccagagcaaa atggtgtgag tgaacgttca aacagaacac tcatggaatc ggcaatgagc 2280 agtctatttg atatggatct acctcgtgat ctgtgggcag aagcagcgaa tacgtcagtg 2340 tatgttaata atcgtgttca cggaagatcg acaaattcca ccccgtttga gatttggtat 2400 tgtcgtaaac ctgatgtttc ttattttcga atatttgggt ctttcgctta cgtccatgtc 2460 ccaaaacaac ttcggcgaaa acttgatgct aaaagtcaaa aacttctctt cgtggggtat 2520 agtgaaaccc aaaaagcgta tcgattcttt gaccgttctt ctcgccggat tatcgtcagt 2580 cgagatgcta tattttgtga aacaccgaca actgtcccaa ccgtttcaga aaatgtagca 2640 aaacctgacc agccaattca tttacatgtc ccagtccctt cagtaaatgt agtaaattcc 2700 gacgggcaaa ttcatttacc gcaatcgact caaaccgagc cgacaagtga ctgtgctact 2760 gaatctaatt caccttttga tccttatgat actgtaaaaa attttcatgg ttttgaggct 2820 gttctccgtc gttctcttcg catcaggacg ccaaaagtga tcccctcttt ccttgcaaaa 2880 ccgaattttc cactgctaaa cgatccggac aatcttttca cttataacga tgcaatttcc 2940 ggtagcaata gtaaagagtg ggttactgcc atggtgactg aaatagatgc tttgaatcgc 3000 cttaagacat ggtcattggt gcctcgccct tcggatagaa ggattgttaa atgtcgatgg 3060 gtttatgctg ttaaacgctc catcactggt ttggtcgaaa aattccgtgc tcgtctagtc 3120 gcaaagggct tctcccaaaa agcgggaatt gattttgatg aaacgttttc tcctgtcgta 3180 aagtacgact ccttgcgtat tatcttatcc attgctgctg cgcaaaattt agatttattt 3240 caatttgatg tttcttctgc ttttctttat ggtgaactca ccgaagaaat ttatcttgaa 3300 cagccagagg gcttttgtgt ttccggtagg gaaggagatg tataccgact acacaaaagc 3360 ttatacggcc ttaaacaagc tggaagagtg tggaatgcca agtttgacgg actggtaacc 3420 actttcggtc taattcccag tgctgtagac ccgtgtgttt accgttacca aaaagacaac 3480 gtttatctca ttctttgttt atgggtggat gatggattac tagtatgcaa taccgcccaa 3540 cttgtgtccg atttcttttc ttacctccaa acccattttg aaatgaaacc aaaaaaggtg 3600 gatcgctttg ttggtcttca catttcccgt gaccatgcca atcagaaatt ttttgtttca 3660 caaccttctt acactgcaaa tctatgtctg cttttaatat gacaaactgt gattcagtca 3720 gcacaccagc tgacccgaat agccgtttgt ctgtgtctaa actccctttg gattctttag 3780 taaaattccc ctaccgaaat gctgttgggg ccctcataca tttgtgtgtt acccgccctg 3840 atcttaacta tgccgtcggg caagttgcca aatattcttc caatcctgat caatcccata 3900 tcaatgcggt caaacgaatt tttgcttacg taaagggaac tgcagactat ggcatttgtt 3960 tcggcagtaa aaaagatgat tgtctaattg ctttctgtga cgcagactat gcgggagata 4020 tagacaccag acgatcaacg actgggtatg ttgtaatgtt gaacggaggt ccagtgactt 4080 ggggaggtcg acaacaacaa tgtgtttccc tatcgactac tgaagccgag tatgttgcgg 4140 cttgtgaaac aaccagacaa attgcctggt tgcgcaactt actgcaagat gttgggatag 4200 ttcaaaaagc tccaacgccg ttattttgtg ataaccaggg ggccattcat cttagtaaaa 4260 atcctgagaa ccacaagcgt accaaacatt ttgatgttca gtatcattat gttcgaaaaa 4320 gccaagcaga gggggtaatt agtacgcaat atgtgtcgac aacaaaacaa tctgccgata 4380 tgttcactaa agccttgtcc cgccccattt ttgaaatgtc tcgtgactct atatctgtat 4440 gttctattcc aggtctgatt cccacaacgg aattttgaat tgagagtttt gactcacctt 4500 ctaaattatc cctttcaaag gaatcaagat tcacggcgcc taaaaagaaa aatgtctcag 4560 aaaactcata atttgttttt cccgtttcct cttatgggga ag 4602 // ID Kiri-7_CQ repbase; DNA; INV; 1833 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 16-FEB-2011 (Rel. 16.01, Last updated, Version 2) XX DE A Kiri non-LTR retrotransposon family from Culex quinquefasciatus DE - consensus. XX KW Kiri; Non-LTR Retrotransposon; Transposable Element; Kiri-7_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-1833 RA Kojima K.K. and Jurka J.; RT "Kiri non-LTR retrotransposons from the southern house RT mosquito."; RL Repbase Reports 11(1), 126-126 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 3 sequences with >99% CC identity. CC This family and related Kiri elements found in two mosquito CC species, Culex quinquefasciatus and Aedes aegypti, constitute a CC distinct lineage belonging to the CR1 group. The phylogenetic CC analysis with PhyML indicates that Kiri is a sister lineage of CC the L2 clade. Although several Kiri elements do not encode two CC proteins, probably due to 5' truncation, Kiri elements generally CC encode two proteins. Their ORF1 proteins commonly contain an CC L1-like RNA-binding domain. XX FH Key Location/Qualifiers FT CDS 1..1668 FT /product="Kiri-7_CQ_1p" FT /note="reverse transcriptase." FT /translation="KNDPAGDVDADELNAFFSSGHRQLQAVDRSDNSSEPC FT HRTTADHGDNGFAFRHTDVNEISRKILEVQTNATGTDDIPISFVKLLCPFV FT LPMLAHLFNHIIDTSSFPAAWKKAIVTPIPKSSSPIQPKDFRPISVLPAVS FT KVLEKVLLGQISEHLNAAEPPLLAQNQSGYRKGYSTTTALTKVVHDVYSNL FT DENRCTVMVLVDFSLAFNCVNHQILRKKLNTEFKFTRSACDLITSFLSGRS FT QIVRFGNSMSAALDVPDGTPQGSCLSALLFSLYINSLPRNLKCEWQLYADD FT LQVYLSGPIAEVDRIVRDVNEDLAAIADWARVNQLFPNPKKTQAIVFNKTG FT TVTPQENIVFCDSIIPLSSQVINLGLHMDCNLTWKAQVNDVVRKVYHTLRT FT FRRFGSVLSLQTRRKLVQXVIVPFFTYCDTVYFPGLSAALREQLHRGFKSA FT LRFVHNLRRRDTTVALRNSIMGHDLPDNYQLRVCCFMKKAFEGTLPDYIMQ FT HLQRGRQERTAGFIIPRHTTSSGKSVLVHGASCWNSLPMAIKREARFNPFK FT RAVITHMQN" XX SQ Sequence 1833 BP; 474 A; 527 C; 408 G; 423 T; 1 other; aagaacgacc ctgctggtga tgtcgacgcc gacgaattga acgccttctt ctctagtggc 60 caccgccagc tgcaagctgt tgatcgtagc gacaactcct ctgaaccgtg tcatcgaact 120 accgctgacc acggagacaa tggttttgct tttaggcaca ccgatgtaaa cgagatctca 180 cggaagatcc tcgaggtcca gaccaacgct acgggcacag acgacattcc gatctccttc 240 gttaagttgc tgtgcccatt tgtactacca atgcttgccc acctgttcaa ccatatcatc 300 gacacaagtt catttccggc ggcctggaag aaggcgatcg tcacaccgat accgaaaagc 360 tccagcccaa ttcagccgaa ggacttccga cccatcagtg tcctgccagc agtgtcgaaa 420 gttcttgaaa aagttcttct tggccagatt tccgagcacc tgaacgctgc agaaccaccg 480 ctcctagcac aaaaccaatc gggttaccgg aagggttaca gcaccaccac ggccctcaca 540 aaagttgtgc acgatgtgta cagcaaccta gacgaaaacc gctgcacagt gatggtcctt 600 gttgactttt cgctggcgtt taattgcgtg aaccatcaga ttctacggaa gaagctcaac 660 accgagttca agttcaccag gtctgcttgt gacctcatca catctttcct cagtggacgg 720 tcccagatcg ttcgctttgg gaattcaatg tcagcagcgt tggacgtacc ggacggaacg 780 ccacaaggat cttgcctcag tgcgttactg ttcagcctgt acataaacag tttgccaagg 840 aacctgaagt gcgagtggca gctgtacgcc gacgatctgc aggtttatct ctctggcccc 900 atcgctgagg tagaccgaat cgtgcgcgac gtcaacgagg atctggcggc gatcgccgat 960 tgggccagag tcaaccagct ttttccgaac cccaaaaaga cccaggccat cgtgttcaac 1020 aaaaccggaa ctgtaacacc ccaggagaac atcgtcttct gcgactcgat catcccactt 1080 agcagccaag tgataaacct tgggctgcac atggactgca atctgacctg gaaggcacaa 1140 gttaacgacg tcgtcaggaa agtttatcac acgttgagaa ccttccgaag gttcggatct 1200 gtgctctccc tgcagacacg acgtaaactt gtacaakctg ttattgtccc attcttcacc 1260 tattgcgaca ccgtttactt tcccggattg tccgccgccc tacgcgaaca gctgcaccgt 1320 ggcttcaagt cggcactgcg gtttgtgcac aacctccgac ggcgggacac aacagttgca 1380 ctgcgtaatt caatcatggg acacgatcta ccagacaatt accaacttcg agtatgctgc 1440 ttcatgaaaa aggcgttcga gggcaccctg cctgactaca ttatgcagca tctccagcgt 1500 ggacggcaag agcgcaccgc cggtttcatc atcccacgcc acacgacatc cagcggcaag 1560 agcgtcctgg tgcacggggc atcttgctgg aacagcctac caatggcaat caaacgagaa 1620 gctcgtttca acccatttaa gcgagcagtc atcacacaca tgcagaacta gatccagatt 1680 agattttagt tttttttttc tcacccgttc agagaattga caatttcttt ggctccaaat 1740 gtacaatctc cttatccctt ccttgccctg atacaaagtg taaactaaca ggatatcgtt 1800 accaaataaa ttacaattac aattacaatt aca 1833 // ID hAT-N1_BF repbase; DNA; INV; 433 BP. XX AC . XX DT 30-SEP-2008 (Rel. 13.09, Created) DT 30-SEP-2008 (Rel. 13.09, Last updated, Version 1) XX DE Amphioxus hAT-N1_BF non-autonomous DNA transposon - consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW non-autonomous; hAT-N1_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-433 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-433 RA Kapitonov V. and Jurka J.; RT "hAT transposons in the amphioxus genome."; RL Direct Submission to Repbase Update (03-OCT-2008). XX DR [2] (Consensus) XX CC The genome contains several thousand copies of hAT-N1_BF. XX SQ Sequence 433 BP; 123 A; 87 C; 106 G; 117 T; 0 other; tagggctggg tatcggtaca gcgtaccggt acaaaaccgg tttttcttat tggaccggtc 60 cagaaaaacc ggacctgaaa aaattaggtg gaccggatgt tggaccgatt agaaaattaa 120 catattattt gatcaggcat tcacacgttt tggcgcttgc aggtggaaga aaataacaag 180 agtgaagtag agtagagttt atagtcattt ctaccaagtt ttacagtcaa tcgtacaggt 240 gcagttagcg ttgtaggatt ttaagacgcc agtgtaagtc taatactcca ccaaacagat 300 ttctttgtag tgaaatggac cattggtatg agtcatactg aatcaggtcc aggttcaggt 360 ccggacctgg acctgatcct ctggacctga accggacctg gacctgaatt ttctgtaccg 420 gtacccagcc cta 433 // ID CR1-27_CQ repbase; DNA; INV; 2382 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-27_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-2382 RA Kojima K.K. and Jurka J.; RT "CR1 non-LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 31-31 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 10 sequences with >96% CC identity. XX FH Key Location/Qualifiers FT CDS 100..2328 FT /product="CR1-27_CQ_1p" FT /note="reverse transcriptase." FT /translation="ARNCFRRVYACNXDTANLQPAELQPKTPGPAVTPVPE FT STESLTYDFTRCDFEAVNXALSQINWTGILAHCSLDEAVSRFYDALXDVIQ FT LHTPLRRLHPPSNRKPWWNATLRNLRNRLRKARTRLHKRRCQPNIAEVREL FT EELYESQQNAAFSEYIHRIETDVKQNPRSFWTFLKSRKKGSGIPTEISNGC FT VEANSVEEQASLFADFFSGVYSTSLPAPCNEVVDHIPTHDLHLPVVHFSIE FT DVSRALDDVDVTKGPGPDSIHPSFLKNCAASLALPTTIIFNRSLAESTFPT FT AWKLASITPIHKAGNVHKVENYRGISILSGFAKLFEHLVHGALYPVLKPII FT ANEQHGFVKRRSTTSNLLLFTSTLLTNIEKGQQIDAVYVDLSKAFDKVPHA FT LLLEKLRRYGLPEWIVRWIHSYLADRKAFVKVRCTSSHLLAIPSGVPQGSV FT IGPLLFILFVNDLCSVLESDKLMYADDLKVFRSVSSPLDSCALQQDIDRLL FT RWCTANGMEVNVQKCKIISFTRKRTPLLSSYRMGQNELERVNTIKDLGVTI FT DSKACFNEHIALTTAKAFATLGFLRRNAADFVDLFALKTIYCSLVRSQLEY FT AVQVWAPYYAVQAERIERIQRAFTRFAVRRLPWVRPQGLSSYDDNCERLKL FT PALASRRVLLQRIYVFDLLSCNIDCPTLRDKITLHIPARNLRNPAPFLEVP FT GHRTNYGYYSPLSACCRVFNDVADVFVFGMSKCVFKTRIKNRT" XX SQ Sequence 2382 BP; 649 A; 611 C; 509 G; 610 T; 3 other; ggataccatc gtagttgtag gcgattacaa tctgcctcac ctcaactggt ctttcgattc 60 ggacatcaac ggatacctgc caacaaacgc atctgctgag caagaaattg ctttcgtaga 120 gtctatgctt gcaacwggga tacagcaaat cttcaacctg ccgaactgca accgaagact 180 cctggacctg ctgtgacacc tgtaccagaa tctaccgaaa gcttaacata cgacttcact 240 cgttgtgact ttgaagcggt taacmtcgca ctttcacaaa ttaactggac tggcatactt 300 gctcactgct cactggatga agcagtttcc cgtttttacg atgcactatw cgacgttatc 360 caactccaca cgccgctgag acgactacat cctccatcga accgtaagcc ttggtggaac 420 gctacacttc gaaatctacg aaaccgactc aggaaagcaa ggacccgact ccacaagcga 480 cgctgtcaac caaatatagc agaagtgcgt gaattggaag agctctacga atcccaacaa 540 aatgcagctt ttagtgagta cattcatcgc atcgaaacag acgttaagca aaaccctcgc 600 tctttctgga cattcctgaa gagtagaaaa aaaggtagtg ggattccaac tgaaatctcg 660 aacggatgcg tagaggcgaa ctctgtggaa gagcaagcca gcttatttgc tgactttttc 720 agcggggttt actctacctc tctccctgca ccgtgcaatg aagttgtaga tcacatacca 780 actcatgatt tgcatctgcc agtagtacat ttctcgatag aagatgtatc aagagccctt 840 gatgatgttg atgttactaa aggtcccggc ccggacagta tccatccatc tttcttgaaa 900 aactgtgcgg cttctctcgc tcttcccaca actatcattt tcaaccgctc gctcgcagaa 960 tccacatttc ccactgcttg gaaattggct agtattacgc caatacataa agccggcaac 1020 gttcacaaag tcgagaacta ccgaggaatt tcaattttga gtggttttgc taaactgttt 1080 gaacacctgg tacatggagc actttatcca gtattgaagc caatcatcgc caacgaacaa 1140 cacgggtttg ttaagcgacg ctcaacaacg tccaaccttc tcctcttcac ttcaacactc 1200 ttgacgaaca ttgaaaaagg ccaacaaatt gacgctgttt acgtggatct atctaaagcc 1260 ttcgataagg taccgcatgc tctcttactg gagaaactga gacgctatgg ccttccagaa 1320 tggatcgtcc gatggattca ttcatacctt gccgatcgca aagccttcgt caaagtaaga 1380 tgcacttcct ctcatctcct cgcaattcct tccggagtcc cgcaaggcag cgtgattgga 1440 cccttacttt tcattttgtt cgtgaacgac ctgtgcagtg tcctcgagtc ggacaagctg 1500 atgtacgcag acgacctgaa agtgttccga tcagttagct cgcctctcga cagctgtgca 1560 ctccagcaag acatcgatcg cctgttacgc tggtgcactg ccaacggaat ggaggtcaac 1620 gtacaaaaat gcaaaatcat aagcttcacg cgcaaacgaa cacctctctt gagtagctac 1680 aggatgggcc aaaacgaact tgaacgggtg aacactatca aggacctggg tgtgacgatc 1740 gacagcaaag cgtgcttcaa cgaacatatt gcactgacaa cagcgaaagc ttttgccacc 1800 ctcggatttc ttcgtcgaaa tgcagctgac tttgtggatc tgttcgcgct taaaacaatc 1860 tattgttcgc tagttcggag tcaactagag tacgccgtcc aagtatgggc accttactac 1920 gccgtgcaag ctgaaaggat cgagcggatt cagcgagctt tcaccagatt tgctgtgcga 1980 cgcctgccat gggtgcggcc tcaaggactc tcctcgtacg atgacaactg tgagcggctt 2040 aaactccctg cgctggcatc acgacgagtt ctactacagc gaatttatgt attcgacttg 2100 ttgtcgtgca atattgactg ccctactctt cgcgataaaa tcaccctaca catcccagct 2160 cgtaatctgc gcaaccctgc tccgtttctt gaagttcctg gccatcgtac gaactacggt 2220 tattatagtc cccttagtgc ttgctgtcga gtgtttaatg atgtagcaga tgtgtttgtg 2280 tttggaatgt caaaatgtgt atttaaaact aggattaaaa ataggacata agttgtagtc 2340 tgtgcggcga gtgccgaaga tgaaataaat aaataaataa aa 2382 // ID R1D_NGi repbase; DNA; INV; 7301 BP. XX AC . XX DT 08-NOV-2010 (Rel. 15.11, Created) DT 08-NOV-2010 (Rel. 15.11, Last updated, Version 2) XX DE 28S rDNA-specific non-LTR retrotransposon R1 in Nasonia giraulti. XX KW R1; Non-LTR Retrotransposon; Transposable Element; R1D_NGi. XX OS Nasonia giraulti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-7301 RA Stage D.E. and Eickbush T.H.; RT "Maintenance of multiple lineages of R1 and R2 retrotransposable RT elements in the ribosomal RNA gene loci of Nasonia."; RL Insect Molecular Biology 19(Suppl 1), 37-48 (2010). XX DR [1] (Consensus) XX CC This family is specifically inserted into 28S ribosomal RNA CC genes. XX FH Key Location/Qualifiers FT CDS 2855..6388 FT /product="R1D_NGi_2p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="KLKIKMPGPHKKNVAEVQPEERSPTTPGNPLVAESGR FT LLTGAREDLEHPSVRTWPVRSGGGLRRLGEGEVGLRAASVRLVRDERIEKM FT TRITVEVEEPGSKNEIRFLQINAGGGQLVNAEICELIASKKIDIVLAQEPY FT SKVNKGSRYFTGLRRASRAICLKSAGTKTAPKAFVAVPNPDFHAFFVSALS FT TQHCVVAEVHTPSVTFFAVSMYFQFCDDIEVHLGQLEKVLENLRGQKVVIG FT IDANAESSLWSPRGTNEKGEKLERLIAAFSLHVVNDRTQPPTFEERGVSSY FT IDVTLVSGSMIAEVQSWKVKRDWTSSDHNAIIFKITTVAQTDRVDSSRFNT FT RRADWGLLDSTIKELSVSHLDHIILDSAEEVERMADALQKVLYEACETAIP FT RRRRIRKNNPWWTRELTDKKSELYSARRRMQQQWSLPGHNLRKAEYRALLR FT DYCRSVKGAKVGSWQEVVTVRGNEEPWGVVYKQLRGKLHNERTLSSVRCGD FT TESLSMLETANRLLEVHVPDDTPSNETPEQAQIRDSINSPPETEDAAPFEQ FT WEIALILASLKNNKAPGFDLLEVRILKAAIKAIPHHFLRLFNACLEHGVFP FT RAWKQASLIFLPKGGKESNDSKSYRPISLLPVTGKLYERLVKRRLSDTALG FT PDMISDRQFGFRAGMSTEDAIIELRKLTAASPKKQVAALLFDVKGAFDCIW FT RPAILQSLKEKDCPKNVYKLLVSYFEDRQAKVVWGTNHVSKQATRGCPQGS FT VLGPSGWNLGFDPLLRSLEQGVVTEGGGKLPINFVAYADDLAVLVEGDSRA FT EIEKVGKAVVKHIVEKCSAIKLEVSESKTVGIFVKKPKVIGSKAVKINRKD FT RRKGGARNPKIELGGKSISFEQSVRYLGVHFDANLGISAHCKYLREKLVPL FT FSDLRKLAQCQWGLGHRALETIYKGVFVPTVCYASAAWYKEGAHTDRILED FT LHRQILIAITRCYRSTSYEAACVLAGTLPICIQLRVSVAKYHLRKGEDAEI FT GGVVIRHDPEGLKENYNRVLEIANEMWQARWEASEQGNATRELFFPDVVAR FT VKSDWIRPDHFTAQVLTGHGYFNEKLHQLSLAKAAACFCCGEPDNNLHFLL FT ECPAFAEFRDELITPISGGLEAPEATLMLVSSPEGFAALKEYSRVAFECKR FT QLENALTESDDGLSSESEE" XX SQ Sequence 7301 BP; 1887 A; 1841 C; 1916 G; 1657 T; 0 other; ccggttggtt gaagtactca aggcaacagt tcttgtttgc agggtatgga tccttctgga 60 agtgccagtg atacagcgtc tacgtcgcgc tagatatcaa catctctccc acaagttggg 120 gggccctttc aaaagtgaag atggggccgc gagatggcta atagtattac ggctaagttc 180 gaacgcgcaa agatggggca gtggaatcac cttatgagaa caggacaaga ttcccctgta 240 tgtgtcagag ctgcaggctc ggtattgtat agcataaagg attacttggt ggttttagca 300 aatacgaatt tccgggagcg ctgagggttc tcccggggaa gcgctcccta aaagaagtgt 360 ggaaggaaaa atttttcctg cagaagcact ctcgatgatc ggtagatcta ttgttatgcc 420 gtatcgagct tgtatgcctg ggtcaagggc ccgactcagg ttgcgagccc ggaggcccaa 480 ttgtacacga acccgcccgt tcgaacacct caaccgttga tatacgtgag cctcctttgg 540 gtaagtgcta cgataatcaa cggttttcga aagcgaatct gcctctggca gcccaaactc 600 ccgtgcacct cttgaggcgc acgtattccc ttccttcctt ccttccttgc gctcccaaac 660 gcccgcgtaa ctccagtgag taacgtgtgg tttcttctag gccgggtagg aagttattag 720 gtgataggtt tagttaggat agaattagct ctggcactct gtgaccactt acccccgtgg 780 gtaacgccgt aattatctcg cgcggtcgcc tcgtcggctc gtggcggctg ggaactgcaa 840 gcgcgtaagc gcgagcggaa ctcggctgct gggggtcgac ggcgcgatag taggcctctc 900 cgatctagtc ttctatcctt ccttcttttc tttttcttcc accccgcgtg gcttgctctc 960 tgcgccttgg ctcttgtcag ggtgtagggg gcattctcac gttggggact tttcgagatc 1020 gaaagcaatg cgtgtggaat caaaaagcaa tgcgtgtggg atcaaaaaga aatgcgtggc 1080 agttgatact tagtcttaga ttcacttgcg tccgtaagac gagccgacca ttagggtcgg 1140 caagtcacgt taggaacttt tccagactct gaaaagcaat gcgtgtcgac tagcaaaata 1200 gattttagct tcactagcat ccataagacg agccggccat tagggccggc aagtcacgtt 1260 aaggactttt ttcaaatttc tcacttgaga aagcaatgcg tatcaactga caactaaatc 1320 atagccggcc attagggccg gcaagtcacg ttagggactt tttcacatct tccacgcgcc 1380 ccaagagtag aatcaccccg cgaatataaa ctcaacttcc tctcaacttc ctctctctct 1440 tttctatact ctcacacaca catatacact cacaacaaac acaaaacacg atgcaggccg 1500 cgcatctggg ccagtttcgc tgtttcagcg ttagggtgag accaggtaac tttgccggca 1560 tcgtttacat atacacacac acaaacaatc taaccaacta accacacgat acaggccgcg 1620 cacttgggcc aatttcgctg tcacagcgtt agggtgagac caagtaactt cgccggtatc 1680 aatcacgaac acacatacta acacttttcg atgcaggccg cgcacttggg tcagtttcgc 1740 tgtcacagcg ttaggacgag accaagtaac tttgccggca tcaatcacga acacatacta 1800 acacttttcg atgcaggccg cgcacttggg ccagtttcgc tgtcacagcg tcagggtgag 1860 accaagtaac tttgccggca taaatcacga acacatacta acacttttca atgcaggccg 1920 cgcacttggg ccagtttcgc tgtcacagcg ttagggtgag accaagtaac tttgccggca 1980 taaaataact ccttttctta acttagacaa cttctcttct cttctcctct aaacttagct 2040 aaatcaggca cgcgagcgag cgctcgccta ccatctgcgc actctaatat taattcgaaa 2100 agggattcgc gatcctctag ctcaaggcgg cctgagaaag ccaggctcga cgccagtttc 2160 cccgtgacag gcacctaaaa ctcttagcaa ctaatgtgtc ttccgtaagt ccgcgtgcgt 2220 gaccccatag gtaccgctat gttgcactga aaatcatctc gcgtaataag agctcgtcgt 2280 gctcatgtgc tgcgttgtct ttcctaagtt atctagcttg ggccgatgga gggatagcct 2340 tctcggcctt acagtgcaac tagccaggct tacatagcaa cctcaggtac tctttaagtc 2400 gaaaactcta gtcgcgtccg gcctgtcacg ggaagaaata gaaaacacga attaaagcca 2460 cgcgccgcgt atcgtgcagg gatgggtaaa cagtctgcgc ggaccatgta ggttcacgac 2520 ttcacagtcg ctaggaggcg gagccgtaag ccccgtgtaa aaccagaggc acctcctggg 2580 ccgcgtgata gtttgggagc ccgggccgtc agttagccag gcggttaggc tgctccgtaa 2640 cagcacgtgt aaaagcttcg tcgccggagc acttggttgg ctacctatgg ggacgtgtaa 2700 cggcacggag tgttgccgcg gctgctgaaa gttccgctct ctctcttttg ttcgcgcttt 2760 ttggcgcatg gtaaccgtgc aggcgtccac ccagtgggaa acaaactcga cctagattaa 2820 ggcttctaac aaatatgctt attcctgcag gtaaaaactc aaaatcaaaa tgccaggtcc 2880 acataagaaa aacgttgctg aggtgcagcc agaggagagg tccccaacca ctccagggaa 2940 tcccctcgtc gccgagtcgg gccgtcttct gaccggtgcg agggaagacc ttgagcatcc 3000 ctcagtcagg acctggccgg ttaggagcgg cggcgggcta cgacggcttg gggaaggtga 3060 agtcggcctc cgagccgcgt ctgttagatt agttcgtgat gagcgcatcg aaaagatgac 3120 acggataaca gtagaggtcg aggagccggg ctccaaaaac gagattaggt tcttgcaaat 3180 aaacgcggga ggggggcagt tagtaaacgc cgaaatttgc gaattaatag cgtcaaaaaa 3240 gatagacata gttctggctc aggagccata ctcgaaagtt aacaaagggt cgcgttactt 3300 cacaggtctt agacgtgcaa gtcgggcaat atgtctaaag agcgcaggca caaagacggc 3360 tcctaaagcc ttcgtagccg tgccaaaccc cgacttccac gccttcttcg tctcagcgtt 3420 aagcacccaa cactgcgtgg ttgccgaggt gcatacgcct agcgtcacgt ttttcgctgt 3480 ctcaatgtac tttcaattct gcgacgatat tgaagtacac ctcgggcaac tagaaaaagt 3540 attagagaac ctcagaggtc aaaaggtagt aataggcatt gacgcaaacg cggaatcctc 3600 gctttggtcc cctcgtggga caaacgagaa aggagagaag ctcgagcgac taatcgcggc 3660 tttcagcctc cacgtagtaa acgacagaac ccaacctccg accttcgaag agaggggagt 3720 ttcgtcctac atcgacgtaa ctctcgtctc ggggtccatg atcgcggagg tacagtcctg 3780 gaaagtgaaa cgggattgga cctccagcga ccataacgcg ataatcttta aaatcactac 3840 cgtagcccaa acggatcgag tggactccag tcgattcaat accagacgag ctgactgggg 3900 cctgctcgac tctacgatca aggagttgtc cgtttcccac cttgaccaca ttatcttgga 3960 tagtgcagag gaggtcgagc gaatggccga tgctctccag aaagtcctgt acgaagcgtg 4020 cgaaaccgcc ataccgcgta ggcgccgtat ccggaaaaat aacccctggt ggactcgaga 4080 acttaccgac aaaaagtccg agctctacag cgctaggcgt agaatgcagc aacagtggag 4140 cctccctgga cacaatttgc gaaaagcaga atatcgggct ctattgcgcg attactgccg 4200 atcggtgaaa ggggccaagg tcggcagctg gcaagaagtc gtcacagtgc gcggaaatga 4260 ggagccatgg ggggtagtct acaagcagct cagaggcaag ctgcataacg aaagaaccct 4320 cagttccgtt cggtgtgggg atacggagtc attgtcgatg ttggagacgg ccaaccgtct 4380 gcttgaggtg cacgtcccag acgatacacc ttccaacgaa acccccgagc aggcacagat 4440 tagagattcg ataaactcac cgcccgagac cgaagatgcc gcacctttcg agcaatggga 4500 gatagctctc attctcgcat ccctcaaaaa caacaaagct cccggcttcg accttcttga 4560 agtcagaatc ttaaaggctg ccatcaaagc cattcctcat cacttcctgc ggctcttcaa 4620 cgcctgccta gagcacggcg tcttccctcg agcctggaaa caggcttccc tcattttcct 4680 cccaaaagga ggcaaagaaa gtaacgactc gaaatcgtac cgacccatca gtctcctccc 4740 ggttacagga aaactttacg agcggttagt aaaaaggaga ctatccgata cagcgctagg 4800 accagacatg atctccgaca ggcagttcgg cttcagggct ggcatgtcta ctgaagacgc 4860 gatcatcgag ctgcgcaaac ttacagccgc ttctcctaag aagcaggttg ctgcgcttct 4920 tttcgatgtt aaaggcgcct ttgactgcat ttggcgcccg gccatcctcc aaagcctcaa 4980 agagaaagat tgtcccaaaa acgtatacaa acttctcgtt agctactttg aagataggca 5040 ggctaaggta gtttggggaa caaatcatgt ctccaagcag gcaactaggg gctgtccgca 5100 gggttcggtt ttaggaccct cgggctggaa ccttggattc gatccgctgc tccgcagcct 5160 cgagcaaggt gtagtgacag agggaggcgg aaaactccca ataaacttcg ttgcgtatgc 5220 ggatgacttg gccgtactag tcgaagggga ctctagggcg gaaatagaaa aagtaggaaa 5280 ggcggttgta aagcatatcg tcgaaaaatg ctcggctata aaattggagg tttcggaatc 5340 caagacggta ggaatcttcg ttaaaaagcc taaggtaata ggttcaaaag cggtaaaaat 5400 aaaccggaaa gaccgccgca agggaggagc gcgaaatccg aaaatagagt tgggcgggaa 5460 atcgatcagt tttgaacagt cggtacgtta tcttggcgtg catttcgatg caaacttggg 5520 cattagcgcc cactgcaaat atcttaggga aaagttagta ccgctcttta gcgacttgcg 5580 taaactggca caatgccagt ggggtctggg acacagggcg ttggagacga tatacaaggg 5640 tgtattcgtc ccaacggttt gctacgcatc cgcggcgtgg tacaaggagg gagcgcatac 5700 cgataggatt ctcgaagatc tgcacaggca gatcctcata gctattacac gatgttaccg 5760 atcgacatct tacgaggccg cgtgcgtact agcgggaact ctcccgatct gtatccaact 5820 tagggttagc gtggcgaagt atcacctgag aaaaggtgaa gacgcggaga taggtggcgt 5880 cgtaattaga cacgacccag agggtctaaa ggaaaattat aatagggttc ttgagatagc 5940 gaatgagatg tggcaggcgc gttgggaggc atcggaacag ggcaatgcca ctcgcgaact 6000 tttcttccca gacgtagttg ccagggttaa aagcgactgg atccgtccag atcacttcac 6060 cgcgcaagtt ctcacgggtc acgggtactt taatgagaaa ctccaccagc tctctttggc 6120 aaaggcagcg gcttgcttct gttgcggcga acccgacaac aacttacatt tccttctaga 6180 atgccctgcc ttcgctgaat tccgcgatga gctgataact ccgatttcgg gtgggctaga 6240 ggcgccggaa gccacgctta tgttagtatc ttccccagaa gggttcgcgg ctttgaaaga 6300 atacagtaga gtagcattcg aatgtaagag gcaattagaa aatgccctta cggagtccga 6360 cgatggatta agcagtgaga gtgaggaata gagtgactga ggtggtgggt gaaaggaaga 6420 gctgggtgaa agaatgtcta agttaggcaa acaaacgcga agtcaaaagc atgcttggct 6480 tggccctcgc gaaagtcgcc ttagggctgg actcgcgaaa caaactcgtt cgaaacttgg 6540 ccatcgtccg cgtctaacgc gtaaggcccc gtggaaggtc gctaaatgct tgacacgcga 6600 gtgcatgctc gatcgtaaaa atctggccgt cgaccgaatt gaccatcttg ggtctaccaa 6660 agaaaatagt aaattcaaga ataaatttgg tgtgcttgtg cgttactgca aatatagatc 6720 acgcctctcg aacgaggagt gtagttaggg cctttagaac cttcgcctaa aacatcatgg 6780 aggagatgtt gaaggtgtcc tgtccccaag cattgttcgc tggcgacaag ggctctggct 6840 gaaggattag aacagagccg ctcgttttgc agcggctcaa agcaggactt ttatcctgtc 6900 cccgagtgca cctttcgaag aaggtgcgcc cggcattatt ttattattac taacaacatg 6960 tatttctcct aacaggtaca aacaaaatta gtggagatcc gggcgggatc tcgcgaatgc 7020 gccttcccgt ggttccccgt ggacggtccg gtggatggta gagcttgctc accatcccgc 7080 tatgactgac taaagcattc gtcccagttg acagattgtc cccgcacagc catcctcgga 7140 agaccgggcg ggtacaatct gttgatcgcc aatgggcact tgaatttttc caggaacgtc 7200 ctccttttgg gtggttcgat agatggtgga cggaaaacaa ggtcgcgtat gcttatggcg 7260 aggtagcgag tccaaataac atcagggcta accgaaatta a 7301 // ID CR1-64B_AAe repbase; DNA; INV; 2646 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-64B_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-2646 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1151-1151 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 15 sequences with >92% CC identity. The consensus is 5'-truncated and ~77% identical to CC CR1-64_AAe. XX FH Key Location/Qualifiers FT CDS 1..2574 FT /product="CR1-64B_AAe_1p" FT /note="endonuclease and reverse transcriptase." FT /translation="EQSIVCVKLQKSSIYICGIYLRPNSQPVLYSSHSAAI FT QQLSERISRFDSIVVVGDYNLPQLIWQTDDDINGLLPSNASSEQEVTLLET FT MVASGFQQINNIVNSNGRLLDLAFVNDNNDVELLEPPTPLLRMDNHHKPFV FT LRIDVNDSCTQVGDSNHADDFDFRYCNFDELKSAIATVEWATLFIGKDTDE FT TVCTFYEVLHRILDEHVPRKRRRRTHPFKHPWWTSELQHLRNIVRKSRRRY FT FQSRTVENRDKLRLLEASYIDCQTASFRRYVARMETTAKEDPSAFWTFIRN FT RKRVNKFPAEMAFRDTVANSPEGIANLFADFLENVHCTNPPTFSPDSLRNC FT PTFDLNLEPFDFAQQDVLSALQKLDTSKGPGTDNLPPLLFKECADSFQIPL FT TIIFNKSLQSRTFPMLWKTASVIPIFKAGSTRGVENYRGISILCCLAKIFE FT ELVHNVLYTASQPLISQFQHGFVKKRSTTTNLMTFTSFLSTEIENKHQVDA FT IYFDFSKAFDKVPHDLAISKLRHLGFPNWIADWLRSYLTNRKAFVNINGTY FT SRVISVTSGVPQGSVLGPLIFVLFINDLCFRLKSGKLFYADDLKIYRTIAS FT HLDCCALQADVNELELWCQENGMELNIKKCKSIAFSRRQSRTEFDYKIGSE FT PLERVESIRDLGVIIDTKVRFNDHISVINAKAFAALGFVRRSTNDFNDIYA FT LKSLYCSLVRSILEYAACVWAPHHTTQIVRLEKVQRSFIRYALRQLPWSDP FT VNLPDYPARCMLINLEMLDARRNNLQRLFVFDLIMGNIDCPALLEDVQFYA FT PLRQLRERDLLQIRRHRTSYGFNNPLSRCFRLFNCASALFDFNVSKYVFKN FT RLKDF" XX SQ Sequence 2646 BP; 709 A; 641 C; 556 G; 740 T; 0 other; gagcaatcga ttgtatgtgt aaagctgcag aagtcgtcaa tttacatctg cggaatctat 60 cttcgtccca actcgcaacc tgtattgtac tcatcgcact ctgcagcaat ccaacagctc 120 agtgaacgga tctcgcggtt tgattcgatc gtcgtcgttg gagattataa cctccctcaa 180 ctaatctggc agactgatga tgacatcaat ggactacttc cttccaacgc ctcatccgaa 240 caagaagtca ccctgcttga aacaatggtt gcttctggat ttcagcaaat caacaacatt 300 gttaactcaa atggacgcct tctggatctc gctttcgtga atgacaacaa cgacgttgag 360 ctacttgagc ctccgactcc tctcctcaga atggacaacc atcacaagcc gtttgttctc 420 cgcattgacg ttaacgatag ctgcacgcaa gtaggtgatt caaatcacgc cgacgatttt 480 gactttcgat attgtaattt cgacgagctg aaaagcgcta tcgctaccgt cgaatgggct 540 acactgttca ttggtaagga cacggacgaa acggtatgta ctttttatga agttctgcat 600 cgcattctgg acgagcacgt tccgcgtaaa cgacgaagac gtacacatcc tttcaaacat 660 ccatggtgga catccgagct gcagcacctt cgtaatattg ttcgtaaatc tcgaaggcgt 720 tattttcagt cacgaactgt tgagaaccgt gacaagcttc gtctcctaga agccagctat 780 atcgactgtc aaacagcctc gttccgacgc tacgttgctc gcatggaaac gactgcaaag 840 gaagatccct ccgccttctg gaccttcata cggaatcgca aacgtgtgaa caaattccca 900 gctgaaatgg ctttccgaga caccgtcgct aactcccccg aaggtattgc taatttgttt 960 gcagactttc tcgaaaacgt acactgcaca aacccgccga cgttctcgcc ggacagtctc 1020 aggaattgtc ccactttcga tctaaacctc gagccttttg actttgctca gcaggatgta 1080 ttatctgcct tgcaaaaatt ggacaccagt aaaggccccg gtaccgacaa tcttccgccg 1140 ttactcttca aggaatgtgc cgattctttc caaattccat tgacaatcat tttcaacaag 1200 tcgctccaaa gcagaacttt tccgatgctt tggaagacag cctcagtcat cccgattttt 1260 aaggcaggct ctactcgtgg tgttgaaaat tatcgaggaa tttcaattct gtgctgtctg 1320 gcgaaaattt tcgaagaact ggttcataat gtcctgtaca ctgcatctca accgctgata 1380 tctcaattcc aacacggatt tgttaaaaag cggtcaacga caacgaatct catgacgttt 1440 acgagtttcc tctcgacgga aattgaaaat aagcaccaag ttgatgccat atactttgac 1500 ttttccaaag cattcgacaa ggtaccccat gatctcgcca tttctaagct cagacacttg 1560 ggctttccaa attggatagc tgattggctg cggtcatatc tgacaaaccg aaaggcattt 1620 gttaacatca atggcacata ttctcgcgtc atctctgtaa cgtctggtgt gccgcaggga 1680 agtgtgcttg ggcctctgat tttcgttttg ttcataaacg atctgtgctt tcgtctgaaa 1740 tccggaaagc tgttttacgc cgacgaccta aaaatttata gaaccatcgc atctcacctg 1800 gactgctgcg ctcttcaagc tgatgttaat gaacttgaat tgtggtgcca ggagaacgga 1860 atggagctca atataaaaaa atgtaaatcc atcgctttct cccgtcgaca atcacgcacc 1920 gaattcgatt ataagatcgg atcggaaccg cttgaacgcg tggaatcaat tcgtgatctt 1980 ggagtaatca tcgatacaaa agtgcgtttc aacgatcata tctctgtaat taatgctaag 2040 gcgttcgcgg cccttgggtt cgttcgtcgc agtacaaacg atttcaatga tatttatgcg 2100 ctcaaatcgc tgtactgctc gttagttcgc agcatattag agtacgcagc atgtgtatgg 2160 gcaccgcatc acaccactca aatcgtcagg ctggagaaag ttcaacggag ttttattcgg 2220 tacgcccttc gacagctgcc ttggtcagat cccgtcaact tgccggatta cccagcacga 2280 tgtatgctca tcaatctgga gatgttggac gccagacgta acaacctaca aaggctcttc 2340 gtttttgatc tcatcatggg aaatatcgat tgtcctgcac ttctggaaga cgttcagttc 2400 tacgcgccgc ttcgtcaact gagggaacgt gatttgctgc agataaggcg tcacaggact 2460 tcttacggat tcaacaatcc gttgtctaga tgttttcgtt tgttcaattg tgcgagtgcg 2520 ttgtttgatt ttaatgtgtc taagtatgtc tttaagaata ggcttaagga tttttagctt 2580 aagaaacagt ctgtggaatg taacaattcg agacggtgac aataaataaa taaataaata 2640 aataaa 2646 // ID BEL-1_HAS-I repbase; DNA; INV; 5911 BP. XX AC AEAC01014393; XX DT 27-JAN-2011 (Rel. 16.02, Created) DT 27-JAN-2011 (Rel. 16.02, Last updated, Version 1) XX DE LTR retrotransposon from the Harpegnathos saltator genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-1_HS_; KW BEL-1_HAS-LTR; BEL-1_HAS-I. XX OS Harpegnathos saltator OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Ponerinae; Ponerini; Harpegnathos. XX RN [1] RP 1-5911 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Harpegnathos saltator genome."; RL Direct Submission to Repbase Update (02-FEB-2011). XX DR Genome; AEAC01014393; Positions 7483 13393. XX CC Positions [4918-5505] - Integrase core CC LTRs are 92% similar to each other. XX FH Key Location/Qualifiers FT CDS join(1300..2703,2707..3660) FT /product="BEL-1_HAS-I_1p" FT /translation="MKMTIAALSHLQRTPAELWDDHLVHIMTQSLDPSTRK FT AWILHSSDGNSMPSFDELDQFVASRIHALEDFSQSPPNKAANKANSVSRVH FT VATASISSRSSCPICKTSHFLSACPSFTRGSPEQRRELAKKHRRCFNCLSQ FT NHSAQECKSKFSCRICQKRHHSSLHAAPESRKGVAGGESSAERASRTSDSI FT DGVHSMLASSLGERRVPVLLATARVNVSSACGRSVVVRALLDQGSELTFVT FT ENVTQMLRAKRLRMPVSVSAIGDVSAGKFHYATEIILSPLSSSSAPLTTTA FT VILKSLTSYAPKRAIDSRCLQHLSGLPWADPDPLGDDPIDLLIGAELYGEI FT LRDGLRRGAAGQPIAQNTTFGWILSGPLAHPELIDSRSAPGSARDASVVTA FT SVFHCLTTPTLDVELRRFWKVEELPRVSLRSPEDERCEAHFKSIHSRSSDG FT RYIVRLPFKRDPPIDIGDSRFRARCFISISRRLQKNSEQCRAYCNFLREYE FT ALGHLKRVGNSTNSNDLEPIVFTPHHPVFRVNSLMTSLRVVFNASSPTSNG FT SSLNDHLLAGPKLQTELPAVILRWRNFKYVYVADIAKMYRQIKIDFRDLNF FT QRILWAENPRDPPMEFQLLTVTYGMSCAPFLALHVLQQLTLDKGDEFPFAG FT AVLQDNVYVDDVLFGGNHLRDVIRIHAQLISLLRRDGFELRKWSSNSPELL FT RDIDESNYGLACSKTLAADESIKILGIAWIPSVDAFQISVSLSDPIATSKR FT SILSTIAETVRPSRMGHSGYHYRQNSNATTLVRKS" XX SQ Sequence 5911 BP; 1220 A; 1641 C; 1490 G; 1560 T; 0 other; atcagtgagt gccaactaat ctaatcaggc attttttttg ttctaatcga gaatccgttc 60 gaagcatcat tttgtgtatt ttcgagaatt ttccaccgcc ggaactctct ccgatttctg 120 tcgttgtccc ggacggacac tcagttcttg ctcgctgtcc tgaacggact ctcgattaat 180 cattcttttt cttattgttt ttttttgttt cgcgaatttt gtctaaatct taaaactttg 240 taacgtccta cgccgattat cggttccgat agccgggcac gccgatcatc cgccgctcac 300 gaaaatcgct gacgtcgcga gatcaactgc tcgaagctcc gaagcacttc tcccggcgct 360 ccgattaatt gtaactcggc taaatacatt tatttaagag catttcgccg cagcgaccga 420 gattcatttc gattcttttt ttgttggtgc aacccgtcct ctgcgatccg ttccatactc 480 tcccgcgctt cttgcgaagc ggtcccggca tgcgttcgct tccttttacg cgcgaacaat 540 ttctgagcct tcgagccgga tcggtgaggg tggacgtgaa tctcaaactg cagagggatc 600 ggtgatccaa atggcccaat ggagcgacag atgcaagtgc agcgccaaat cgcacgcgct 660 cttgaaaact tcaagaagct gggacgcaat aattacaccc cggcggtcgt tcgaaatcgg 720 atccgaacgg tgaaggagct ttggctgcaa ttaaatgacg gacatgtgat gctggaaaat 780 tcggtaccag aagctacgtg ggctaatgta agctattttt ctgataaaat ctacgacacc 840 accgagaccg tttatcacgc agcgctggac ttcatgaccg agatcctcga ggaattggag 900 cctcccgtga gcccaaattc tatcgattcg tcttatcagc gcttgcccta gtccgcgttt 960 tcgttgtctc atttgccgcc tatacaattg ccgccgtttg acgggagcgt gacggaatgg 1020 gagccgtttc gtgaccgatt tacggcgctc ataatcgaca acaaagaatt gaatgatttt 1080 gcgaagatgc attttctcgt atcttttctt cgcggtcgag ctctcgagta tctcgctgat 1140 ttcgccgtca ccgcggataa cttcgcgggt gcgtggaaaa cgatgatgga tcgttacgac 1200 aacaaacgtc gcttgttgtc ggctcacatg tcgaccttgt tgagtttgcc gcgtttatct 1260 cgtgagtcta tgtccgatct ttaatctctt cgcggaaaaa tgaagatgac catcgctgcg 1320 ctaagccact tgcagcgcac tcccgccgaa ttgtgggatg atcatctcgt tcatatcatg 1380 actcaaagtt tggatccgag cacgcgcaag gcatggattc ttcattcgag cgacggtaac 1440 tcgatgccat cattcgacga attagaccaa ttcgtcgcgt ctcgtattca tgctctggag 1500 gatttctctc aaagtcctcc caacaaggcg gcgaataagg ccaattccgt ctctcgagtg 1560 catgttgcga ccgcttcgat ttcgtcgcga tcgtcttgtc cgatctgcaa gacgtcacac 1620 tttctaagcg cgtgtccatc gtttacacga ggatctccgg agcagcgacg tgagcttgct 1680 aagaaacatc gtcgatgttt caattgtttg agtcaaaatc attcagctca agaatgcaag 1740 agtaaatttt cttgtcggat atgccagaaa cggcatcatt cctcgcttca tgctgccccg 1800 gagtcgcgta agggcgtcgc tggtggggag tcgtcggcgg agcgagcgtc gaggacttct 1860 gattcgatcg atggggtgca ttcgatgctc gcttcttcac tcggggagcg acgcgttccc 1920 gtgctacttg ccacggcccg cgttaacgtg agctcagcgt gcggccgatc cgtcgtcgtc 1980 cgcgcgctct tggatcaggg atctgagctg acctttgtta cggagaacgt gacgcagatg 2040 ttgcgtgcga agcgcctccg tatgccagta tccgtgtccg ccatcggcga tgtcagcgct 2100 ggcaagtttc attatgcgac agagatcatt ctctcacctc tgagctcctc gtccgctccg 2160 ctcacaacca cagcggtaat tttgaagtcg ctcacctcgt acgcgccgaa acgcgcgatc 2220 gattcacgtt gcttgcagca cttgtccggt ctgccttggg cggatccgga tccgctcggt 2280 gatgatccta ttgatcttct cattggcgcg gaactttacg gcgaaatcct tcgcgacgga 2340 cttcgccggg gcgcggccgg tcagccgatc gctcagaata ccacgttcgg atggatactt 2400 tccgggccgc tcgcccatcc agaactcata gattcacggt ccgcgccggg gtccgctcgc 2460 gatgcgtccg tcgttactgc ctccgtcttc cactgtctga ccacgccaac tctcgatgta 2520 gagctccggc gattttggaa ggttgaggag ctcccgcgcg tctctcttcg ctctcccgag 2580 gacgagagat gtgaggctca cttcaaatct atccattccc gatcatcgga cgggagatac 2640 attgttcgcc tcccgtttaa gcgagatcca ccgatcgaca tcggcgattc acggttccgc 2700 gcttaacgct gtttcatctc tatctctcgg cgattgcaga aaaattcgga gcagtgccgc 2760 gcgtactgta atttcctccg cgagtatgag gctctcggcc acctgaaacg cgtcggcaat 2820 tcgacaaatt cgaacgatct tgaaccaatc gttttcactc cgcaccatcc ggtttttcgc 2880 gttaacagcc taatgacgag cttgcgggtt gtgttcaacg cgtcaagtcc aacgtcaaac 2940 ggatctagcc ttaacgatca tttactcgcc ggacctaagc tgcagactga attgccggcc 3000 gtaattcttc gatggcgcaa tttcaaatat gtatatgttg ctgatatcgc aaaaatgtat 3060 aggcagatca aaatcgattt ccgtgatctt aactttcaac gcattttgtg ggccgagaat 3120 cctcgcgatc cgccgatgga atttcagtta ttaaccgtta cgtacggtat gtcgtgtgcc 3180 ccgtttctcg cattacatgt gctccaacaa ttgacgctcg acaaagggga tgagttccca 3240 ttcgccggtg ccgtcctgca ggacaatgtt tatgtcgatg acgttttgtt cggaggtaat 3300 caccttcgcg acgtaattcg gatccacgct caacttatct cactgctccg tcgcgacggg 3360 tttgaattgc gtaagtggtc tagcaactcg cccgaactgc tcagagacat cgacgagtcc 3420 aattacggat tagcatgttc aaaaaccctc gcggccgacg aaagtattaa aattctcggg 3480 attgcgtgga ttccgtcggt ggacgcattt caaatctccg tttcgttgtc tgatcccatt 3540 gcaaccagta aacgttcaat tttgtcaacg atcgcggaaa ctgtacgacc ctctcggatg 3600 ggtcattccg gttaccatta ccgccaaaat tctaatgcaa caactctggt gagaaaatct 3660 taattgggac gacccgatcc cagacgcgct tctcgcgcga tggcaaccga tttatgcggg 3720 tctctcggtg ctcgatggtt tgcaaatccc gcgctggacc ggattcggcg atactgttcg 3780 gcgcgttgaa ttgcacggct tcgcggacgt ctcaaatgcg gcctatgccg cagtagttta 3840 tagcaaaatt attactcgaa caggtcaagt gaccgtcact ctgctcgcgg gccgatcgaa 3900 agtcgccccc ctccaacctc tcagtgtccc tcggcttgag ctttcggccg ccgtgctcct 3960 cgctcgtctt atcgagttcg tttgtctctc actcaatctc aaaaccgtgc catgtcattg 4020 ttggtcggat tcgacggtcg cgttgacatg gctctcgcac catccgtcgc gttggaggac 4080 gttcgttgcg aatcgtgtcg ctgacgtgca aacacgcata tccacagcgc aatggcgtca 4140 cgttccaacg agcgacaatc ccgcggattg tgcatcgcgc ggattggtgg gccatgaaat 4200 actgaatcat ccgttgtggt ggcagggtcc ggcgtggctt ttgcagtcag ttgacgcttg 4260 gccacctcct ctcgcttctt tggccgccca ggtggtgcct gaggaaaaaa tcgtcgcgat 4320 gcataacgtg cagacccccc gtcgtgggat atcctcgcac gtttttcctc atggtctaaa 4380 ttgatccggg tggtcgtcta tatatttcgt ttcgtgtccg catgtcgtcg tgtcagacat 4440 cccaatgcgg atgtcgaatc tcggggtcga gttctaaccg ctgtcgagtg ctcacaggct 4500 cgactattct tgatcaagcg gattcaaacg gagctgtttt cgcccaccac tcgcaccctc 4560 gcgaaccgaa gagcgctccc tgcaaaggac ccgcttctat cgctccggcc gtttttggat 4620 catgacggca ttgttcgtgt tggcgggtgg atttctcgcg ctccattgcc tttcgagacc 4680 cgacacccga ttctcctgtc gtcatatcct cttactaagc tcatagtaga tcatgcgcac 4740 cttcgcgcgc ggcacgccgg aatgcagctc acattgagta tccttcgtcg ccacttttgg 4800 atcattcgag cgagaagtat aattcgagag cgaattcatc gttgcattcc gtgatgttcg 4860 cgagcgtgcc gcggtccccc ttcaacttat gagcgatctt ccgcgggaaa gagtgaccgc 4920 tccaactcga aattttattc attgcggcct cattattatg cggattatgc ggggccagta 4980 tttattcgcg cctctgctgg ccggggcatc gtttcacgga aggcgtatat cgcgctgttc 5040 gtgtgcttgg cgacgcgcgc cgtgcacctt gagctcgttg ccgactactc ctcgcaagcc 5100 ttcttaaacg cgttttctcg attctccgcc cggcgggggc ttccggcgag gatgtactcc 5160 gacaatggta caaatttcgt tggagccgac agggcactcg ccactgcata ccgtgccgcg 5220 cttcgcgatc ctaattttca aaaccgtacg gcgaccgacg gtatcgagtg gaaatttatt 5280 ccctcgtcgg cccctcattt cggcggcatt tgggaagccg gaattaagag cgtcaagcat 5340 cacgtgcgtc gatcacttaa aactcagact ctcacgttcg aggagttcgc gactcttctt 5400 tgccggatag aagcctgctt gaattcgagg ccgatcgctc ccttgtcgga ttcattcgag 5460 gattatgagc ccctaactcc gggtcatttt ctcatcggat ccgctcttac gacgagcccc 5520 gagccatcct ttcttgatat tcatgaaaat cggctcaccc ggtggcagct agtgcgacat 5580 ttaacggagc gattttggcg gctgtggtat gcggactatg taaactcgct tcaacagcgg 5640 agtaaatgga agcaaatcca acccgcgatt aaaattgggc agctcgtcct gcttaagaac 5700 tcgatgcttc ctccttgtaa gtgggagttg gctcgagtta ctcagtgtca tcccggcgcc 5760 gacggacttg tgcgcgttgt gagcgtccgg acggcgagct cagagatgac gcgtccaatc 5820 ggcaaattat gcccattgcc gatcgattgc gagaaaccgg acccgctagt gtccgaatca 5880 cattgaaacc atcgtttcaa ggcgggcgga a 5911 // ID Gypsy-75_AA-LTR repbase; DNA; INV; 184 BP. XX AC supercont1.331; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-75_AA_; KW Gypsy-75_AA-I; Gypsy-75_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-184 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.331; Positions 584487 584304. XX SQ Sequence 184 BP; 66 A; 35 C; 34 G; 49 T; 0 other; tgtagtgtat tggcatcact gcaaacgtga ataccgtggt tactccctga ctacggttgc 60 taccgatcaa aacaccgatc atagtaagtg ctgttaaaag atagcgatca gaaaattagt 120 tattaagtga acgtcaaata aaccagaagc aaaactaaat ttgtctatta ttcaagacgt 180 caca 184 // ID hATw-2_HM repbase; DNA; INV; 5413 BP. XX AC . XX DT 13-JAN-2009 (Rel. 14.02, Created) DT 13-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE hATw-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hATw; 7-bp TSD; KW hATw-2_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-5413 RA Bao W. and Jurka J.; RT "hATw-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 9(2), 419-419 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(545..811,807..3854) FT /product="hATw-2_HM_1p" FT /translation="MQNRDLYLWYQSLEEKNLSNVCTVFRQRFSSNFQITD FT NDLSLFSKIKRVIQKSRLLTKKSKEEFMLLNFIPPISETNIQTFLLSTKII FT TSPKKEKNLVKDLDHLKIENKTLKRKVXAVDNLKENYKNCYKQLNNLESGN FT KKLKSVIKTSNAEKLEFVRQSKQCKALYTTTNERLSALEKKFTTLKLKNKT FT NRIRNLNKKILYRDIKLEKQEKEIDALKAQFVSESQTLNKEFNDLNQMLES FT SNTKVDILSQEKRNLQKKLCRLKNNLNKTKESISASSIEDLTFLQSEVKEL FT GAKVESLNKENIYLNGLLKLLEDEEIVTFERGRYSNDIREVIMELLSLNVS FT MNKVNDVIKIVLKKLAKKTVSKLPSVDTISRFMSEALILAQIQVSEEMLDN FT VEKDTGNCLHGDGTTKYHRHYQNFQLTSKSGKMLSFGLSELASGDAGSTLS FT SLTETLDDICDVLDTKDKEKDFAKLICSFKTTMSDLGSVNPLFNSKFKEMR FT ELLLPKVIEHWDYLTIKQKNEFKDMSNFFCKLHLLTNFATETDKVLSSFEK FT IALSDDHKDIFAFKTFESGAARLIRTACKAFHKRGSNELGVAGYFNSFLIG FT KGVDKCMFAPFIGNRFNILFYNGAALYYHSESIKEFLSQWPNPNNLLKAVN FT EDISNLLFLAESRALGIFDKLITGPLWRLIEENKNILSMNKFLFCLKMKLS FT DLCKDASPMLNQTPVFDPRDVKIHKDKLYEKLFEETGNIEFDVLVQQSLEI FT ISHAFLIILERQAIDQLPGGKYWNSDDRIQKAAENVPTTNKASESDFAILD FT LLIRTKPNAKIQTIQAYTMWYRNKTLDWLDAKSEEERYILIGKASNSVEKM FT KLKYKERQVELISKKSSILIVKQQLKADTEKKALLKKANIVNELIQLKSQV FT WLTADEAKDKSSKIENDTLRKQVIRVQLDFYRYVMGAKCLLKLFYKTKVSG FT NKRIDLSSGELMENLLTVIGNCNLPPPEKTTNTLKEKNQRNDLIKKQKEKL FT FTTLKDSRMSHLAKQKKVLFLPALLDDPFSLVGRVILHKIKEVDEEEYFWC FT KADVLKIGKVGKKIKQTLYDVIYESEPGSIYTFPLLSDFDKGDMILL*" XX SQ Sequence 5413 BP; 1974 A; 737 C; 864 G; 1832 T; 6 other; taggcacttc agtcacgtga ctttttccgt gtgacaagtt tagaatttaa acattagcga 60 atttttcggc aactttttcg ggaattattt taatgctatc taatttcaaa cccatttgcc 120 agttatattt cactggaaaa aatttaaaaa aaatgttaat ctgtggaggt tattcttcaa 180 agtttcttaa aaaagaagct agaagatttt tcaagtcgtt ttctccatag tttattttat 240 ttacactttt tgacaagtac atatactgtt ttaaattgct tttttgaggt tctagcacat 300 caaaaaagat tattcagctt atcaaattac aatatttgat atttgttttt ataataattt 360 gttttttatg tctggacttt acaattaaaa aaaaaacact tttcattatt ttttcgatgt 420 ttcttgacgc cattatgggt actccgttca acctacatat agctagttat ttatgtgcat 480 ctatagagag cgtttattta cgtaactttt atttttaata cttttttata tctagctgtt 540 taaaatgcag aatagagatt tgtatttatg gtatcaaagt ttagaagaaa aaaacttatc 600 caatgtgtgc acagttttta gacaacgatt ttcttccaac tttcaaatta ctgataatga 660 ccttagtttg ttttcgaaaa taaaaagagt tattcagaaa agtaggttgc tgactaaaaa 720 aagtaaagaa gaatttatgc ttttaaattt tataccacca atatctgaaa caaatattca 780 aacattttta ctatcaacta aaataatcac ctaaaaaaga aaaaaatcty gtaaaagacc 840 tagaccatct taaaatcgaa aataaaactt taaaacgcaa agttraagct gttgacaatt 900 taaaagaaaa ttataaaaac tgctataaac agctaaataa tcttgaaagt ggaaataaaa 960 aacttaagtc agttataaaa accagcaatg cagagaaact agaattcgta agacagtcca 1020 aacagtgtaa agcactatat acaacaacga atgaacggct gtcagctcta gaaaaaaaat 1080 ttaccacatt aaaacttaag aataaaacaa acagaattcg aaatttaaac aagaagattt 1140 tgtatcgaga tataaagcta gaaaaacaag aaaaagaaat tgatgcatta aaagcccaat 1200 ttgtaagcga atctcaaact ttaaataagg aattcaatga ccttaatcaa atgctagaaa 1260 gtagtaatac aaaagttgat attttatctc aagaaaaaag aaatcttcaa aagaaattat 1320 gtcgtttgaa aaataattta aataaaacaa aggaatcaat ttcggcttct tcgattgaag 1380 accttacttt cttgcagtca gaagttaaag aacttggtgc aaaagttgaa tcattaaaca 1440 aagaaaacat atatttaaat ggtcttttga agttgttgga agatgaggaa attgtaactt 1500 ttgaaagagg gcgttattca aatgatattc gtgaagttat tatggagctg ttatcattaa 1560 atgtcagtat gaacaaggtc aatgatgtca tcaaaattgt tttaaagaaa cttgcaaaaa 1620 aaacagtttc aaaacttccc tctgttgata caatatctag atttatgtca gaggctttaa 1680 ttcttgcaca gattcaagta tcagaagaaa tgttagataa tgttgagaaa gacacaggaa 1740 actgtttgca tggggatgga acaaccaaat atcacagaca ttatcaaaac tttcagttga 1800 cttccaaaag tggtaaaatg ctatcatttg ggctatcaga attggcttct ggtgatgctg 1860 gatctacttt gagttcttta acagaaacat tggatgatat ttgtgatgtt ctggacacaa 1920 aagacaagga aaaagacttt gcaaagttaa tttgttcatt taagacaact atgtcagatt 1980 taggttctgt caatccactt tttaatagta aattcaaaga aatgagggag ctattgttac 2040 caaaagttat agaacattgg gattatttaa caattaaaca aaaaaatgaa tttaaggata 2100 tgtcaaactt tttttgtaag ttacaccttt taacaaactt tgcaactgag acagataaag 2160 tactgagttc atttgaaaaa atagctctga gtgatgatca caaagacatt tttgcattta 2220 aaacttttga atcgggggct gcaagactca tacgtactgc atgtaaagca ttccataaac 2280 gaggtagcaa tgaacttggt gtggcaggat attttaattc ttttctcatt ggaaaaggtg 2340 tagacaaatg tatgtttgct ccttttatag gcaataggtt caacattttg ttttataatg 2400 gtgctgcact gtattatcat tctgaatcta ttaaagaatt tttgtctcaa tggccaaatc 2460 cgaataatct attgaaggca gttaatgaag atatatcaaa tttattattt ttagcagaat 2520 ctcgtgcatt aggtattttt gataagttaa ttactggtcc cctttggaga ctcattgaag 2580 aaaacaaaaa cattcttagt atgaataagt ttttattttg tttaaaaatg aaattgtcag 2640 atctttgcaa agacgcttct ccaatgctca atcaaacacc tgtatttgat ccaagagatg 2700 tcaaaataca taaagataaa ctttatgaaa aactttttga ggagactgga aatatagaat 2760 ttgatgtctt ggtgcagcag tcattagaaa taatttccca tgcgttttta ataattctag 2820 aaagacaagc aattgatcag ctcccaggag gtaaatattg gaactcagat gatagaattc 2880 aaaaggctgc tgaaaatgta ccaacaacta acaaagcttc agaaagtgat tttgcgattc 2940 tggatctact tattcggaca aagcccaatg ctaaaataca aacaattcaa gcatatacaa 3000 tgtggtacag aaacaaaacc ttagattggt tggatgcaaa atcagaagaa gagcgttaca 3060 ttttaattgg aaaagcttct aatagtgttg aaaaaatgaa attaaagtat aaagagcgtc 3120 aagtagagtt aatatcaaag aaatcttcta ttctgatagt aaaacagcaa ctcaaagcag 3180 atacagaaaa gaaggcattg ctgaaaaaag caaatattgt aaatgagctt attcaattga 3240 aatcacaagt ttggctaaca gcagatgaag caaaagataa atcttcaaaa attgaaaatg 3300 atactttaag gaagcaagtt attagagttc aattagattt ttatcgttat gtaatggggg 3360 caaaatgttt gttaaagctg ttttataaaa ctaaagtttc tggaaataaa cgtattgatt 3420 taagttcagg tgagttaatg gagaacctgt tgactgtcat tggtaactgc aatttgcctc 3480 caccagaaaa aacaaccaat acattaaaag aaaaaaacca acgcaatgac ttgattaaga 3540 aacaaaaaga aaagttgttt acaacgttaa aagattctag aatgtcacat cttgcaaaac 3600 aaaaaaaagt cctgtttctg ccggctttat tagatgatcc atttagttta gttgggcgag 3660 ttattcttca taaaataaaa gaagttgatg aagaagagta tttttggtgc aaagcagatg 3720 ttcttaaaat aggtaaagtt ggaaaaaaaa tcaagcaaac attatatgat gttatttacg 3780 aatcagaacc tggaagcatt tatacatttc cactgttatc tgactttgat aaaggagata 3840 tgatcttgtt ataaaaggta gaatcactta aaagtttcta tagtattatt agttgatgaa 3900 cctatcattg ttactattta aatgtaaaca aagattttgt tatttttttt gcatgatcat 3960 caaatttttt gttcattttt cttacagaaa catttaaaca cgatgaagtg cttgtaagat 4020 tgtggtgcaa tgcaaacaag aagaaatggt ggatgatgat tttaaataca tttttgtatt 4080 tttcttaatt ttgatttgtg tattttttga ttattgtgtt ttttccagca gtgattatcc 4140 tttaagtaat gtttattttt ttggttttta cagagatttt attgcctttg acccaattgc 4200 cccaaatggg gcaaactagc attgggatta ttacatattt gttgggtttt tataaatgta 4260 tatgaatttg tttactctta caatataaaa aagctctcat tctttaacta aagttttttt 4320 ttatggtttt tacagaaatt tcattgcctt tgacccaatt gtcccaaatg gggcaaacga 4380 gcagtggggt tcttacatat attttgggtt tttataaatg tgtatgaact tgtttactct 4440 gacaatatga aaaagcactt tttctttaac taaaattaat ttttattgtc tttacagagc 4500 tttaattgtt cccggcccag tygcctcgat tcggggaaaa aagacctgaa atttttawat 4560 atatttattg atatttataa atataaatga attcgttcac tctgaaaatt aaaaaaaaaa 4620 actctttttc ttttagtaaa gttcattttt aacggtttta acagagcttt ttgtactgtt 4680 gtcccaattt gccccataat gggaaaacgg gctctgtgta tttttacaaa ttttcctaat 4740 atttataagt ataaatgaat ttgtttattc tgacagtttg aaaaagccct tttttttaac 4800 taaaattaat ttttttttgt ttctacagag ctttgattgt tcttggcccr attgccctga 4860 atgggggaaa caaaccctaa gatttttacg cattttcttg atatttataa atataaatga 4920 atttgttcac tctgacagtt tgaaaaagct ctttttcttt aactaaagtt aatttttacg 4980 gttttatcaa agcttttata gcttttgacc caattgcccc aaatggggca aacgagcagt 5040 ggaatcctta cctaggggcc cctaatctga gaaaaatgat ttgtagaatt gaacgctttg 5100 attctttatt agaaaagaaa acataaaata aagataattt agcataagta cttcaatgaa 5160 actctttctc attgatggca ttttaaaaat aaatttatta tcaaaatttg gtgccataaa 5220 tccttaagga aaactcarca aatctggacg caaaataggc gttattgagc ccaacataaa 5280 ttgtcataac ttgggaacca gttatccaaa tttaaaaaac gaactcattc tgaaaacgtc 5340 tttattatcc gcgtactttg gtattaaata tagccagaga taatgataat agtaaaatcg 5400 accgaagtgc cta 5413 // ID P-25_HM repbase; DNA; INV; 2959 BP. XX AC . XX DT 21-MAR-2008 (Rel. 13.03, Created) DT 21-MAR-2008 (Rel. 13.03, Last updated, Version 1) XX DE P-type DNA transposon family: consensus. XX KW P; DNA transposon; Transposable Element; P-25_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2959 RA Jurka J.; RT "P-type DNA transposon families from Hydra magnipapillata."; RL Repbase Reports 8(3), 371-371 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 255..2735 FT /product="P-25_HM_1p" FT /translation="MPRKCCVPCCSSGYASSTIKVSTYRFPCDEYEKEKWI FT KAIPRKNLIVNKYTVVCRLHWPIDCKSFTFYRGKERPTEPPTVFSNIPASC FT LSLPISKPRTTKLSSSSIRNTKSDELDSFREHDFFRFDSIKERLLRSNDLI FT VYNINDNNDVVYIQSKEFVCGIPKFLVIINRDLTLTSFHLGSSCSIPILIS FT SKTSCCKYWSVFDEIIRYLKNKDICHKKNVLLEHMDCMSQKKVGEFTYSPD FT IITRAFEYFAISRSLYSQISNDYKLPSIRTLTRITSKVGSQDSLEFLKNVM FT MSIENTQRKCILLIDEVYIKSTLTYHGGVLFGNSADQPGVLAKTMLAIMVK FT CLFGGPEFLVKMIPISHLDSDFLFKQCKPIIETINNQPNAKLLSIITDGNR FT VNQKFFKMMNKVEEKPWCGDNENTFLLFDYVHVMKCVRNNWLTEKSGELVF FT EFEGNKHVAKWSDLVNLYELESANLLKLSKLNEISISPKPIERQKVETCLR FT VFCDETVAALKNHPSIKKDEVIGTIKFISIFVKFWKICNVKGVGADIRYKD FT EYRGVIRSHTDKSLLFLLDVADMVEKMKKSGAKRIKQLTKDTSECISHVCR FT GLVNIARYLLDCGNDYVILGWFTTDPLEKTFSKLRQGSGGTYFITAQSVIE FT KVRIQHAKLSLQLEIDSFDDCTNGHSCKNCFRSLNEQECEIFDNLPELEDK FT VQAESLLSIIYIGGYVQRKNGYDQTDTQFYYKKCGAYIDDLNRGGLVIPFD FT NIVQWCIFCFILFEQLSDEFCRTFLMKQFLSVSNKFQFGIYDRHCRTLANI FT FLKNLAIIKSPRCSKESRQKVLKLS" XX SQ Sequence 2959 BP; 1001 A; 429 C; 513 G; 1015 T; 1 other; catggccttc tagaaataca ggccttgaca tcgcgataaa atattttttg actggaggcc 60 gattctatat ggaaagattt tcatcgaata aaacattttt gttttgcgat gtttatactc 120 aacagcgatt ggtctaaaca caaattttaa gcaactttgt agcgttaatt aatcagcaac 180 garaaaaatt agcagtaaag aggtttatct ttttgtaaat tgcaacacat ttatcaaaca 240 aaaagttatc aattatgcct cgaaaatgtt gtgttccatg ctgtagtagt ggttatgctt 300 catcaactat aaaggtgtcc acttatcgat ttccttgtga tgaatacgaa aaagaaaagt 360 ggataaaggc tatcccaaga aaaaatctga ttgtaaataa atacactgtg gtgtgccgac 420 ttcattggcc tatagattgc aaatctttta cattttatcg tggaaaagaa cgtcctacag 480 aaccaccaac tgtcttttct aacataccag ctagctgtct tagtttacct atttctaaac 540 ccagaacaac aaaattgtca tcgtccagta taagaaatac aaaatcagat gaactagata 600 gctttagaga acatgatttt tttcgatttg atagcataaa agaaagactc ttgcgtagta 660 atgatcttat tgtatacaac ataaatgaca ataatgatgt tgtttacatt cagtcaaaag 720 aatttgtatg tggtattcca aagtttttag ttatcattaa tagggattta acattaactt 780 cgtttcatct tgggtcatct tgcagtattc ccattttaat atcaagtaag acatcatgtt 840 gtaagtattg gtcagttttt gatgaaatca tacgttactt aaaaaataaa gatatttgtc 900 ataaaaaaaa tgtattactc gaacatatgg actgtatgtc tcaaaagaaa gttggtgaat 960 ttacttactc tcctgatata attactcgag catttgaata ttttgctata tccaggtctc 1020 tgtacagcca aatttctaat gactataaat tacccagtat tagaactttg acaagaatca 1080 cttcaaaagt cggatcacaa gatagtctgg aatttttgaa aaatgttatg atgagtattg 1140 aaaatacaca acgtaaatgt atattgctga ttgatgaagt ttatattaaa tccacattaa 1200 cttatcatgg aggtgtatta tttggaaatt ctgctgatca acccggtgtc cttgccaaaa 1260 ccatgttggc tataatggta aaatgtttat ttggtggacc agagttttta gtaaaaatga 1320 ttccaattag tcacttagat tctgattttc tttttaaaca atgcaaacca attattgaaa 1380 caattaacaa tcaacccaat gcaaaactgt tgtcaattat tactgatgga aatagagtaa 1440 atcaaaagtt ttttaaaatg atgaataaag ttgaagaaaa accttggtgc ggtgataatg 1500 aaaacacatt tttgttgttt gattatgttc atgtcatgaa gtgtgttcgt aataattggt 1560 taactgaaaa aagcggagaa ctagtttttg agtttgaagg taacaaacat gtagcaaaat 1620 ggagtgattt agtaaatctt tatgagttgg agtctgccaa tcttttaaaa cttagcaaat 1680 taaatgaaat ttccatctct ccaaaaccaa ttgagcgaca gaaagttgag acttgtttac 1740 gagttttctg tgatgaaacg gttgctgcac ttaaaaatca tccatccatt aaaaaagatg 1800 aagttattgg caccatcaaa tttatttcta tatttgtgaa gttttggaaa atttgtaatg 1860 tgaaaggtgt tggagcagat attcgttata aagatgagta ccgtggagtt ataagatcac 1920 acacagacaa aagcctgcta ttcttattag atgtggcaga tatggttgaa aaaatgaaaa 1980 aatcaggcgc aaaacgtatt aagcagctta caaaagatac ctctgaatgt atttctcatg 2040 tgtgtagggg gttagtcaac attgcaagat atcttttgga ttgtggaaat gattatgtta 2100 tcctggggtg gttcacaact gatccattag agaaaacttt tagtaagctt cgccagggtt 2160 ctggaggaac ctattttatt actgctcaat ctgtcataga aaaagttaga atccaacatg 2220 caaaactttc tttacagtta gaaatagata gttttgatga ttgtaccaat ggtcattcct 2280 gtaaaaactg ttttcgaagt ttaaatgagc aggagtgcga aatttttgat aacttgccag 2340 aactcgagga taaagttcaa gcagaatcat tactttcaat tatatatatt ggtggttatg 2400 ttcaaagaaa aaatggctat gatcaaactg atacacaatt ttattataaa aaatgtggtg 2460 cctacattga tgacttgaat agaggaggtc ttgttattcc ctttgacaat attgttcaat 2520 ggtgcatatt ttgttttatt ttatttgagc agttatcaga tgaattttgt cgaacatttt 2580 tgatgaagca attcttatct gtatcaaaca agtttcaatt tggtatttat gaccgacatt 2640 gtcgcacttt agcaaatata tttcttaaaa atctagcaat aataaaatct ccaagatgct 2700 ccaaggagtc caggcaaaaa gttttaaaac tgtcatgaaa atgagttttt atttaaattt 2760 tattatatta tataagaagt atattatagt ttattaactt tattgtagtt ttacttttaa 2820 gacagctctt atatattgtt attgttatag ttttttgctc tccttgtccg catccacttt 2880 acttattagc gcttcgctcg gcctccagtc aataaaaatt tcggcgcgtt gtcaaggcct 2940 gtatttctag aaggccatg 2959 // ID DNA2-5_TCa repbase; DNA; INV; 411 BP. XX AC . XX DT 22-MAR-2009 (Rel. 14.03, Created) DT 22-MAR-2009 (Rel. 14.03, Last updated, Version 1) XX DE Non-autonomous DNA transposon. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Nonautonomous; KW DNA2-5_TCa. XX OS Tribolium castaneum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Coleoptera; Polyphaga; Cucujiformia; OC Tenebrionidae; Tribolium. XX RN [1] RP 1-411 RA Jurka J.; RT "DNA transposons from insects."; RL Repbase Reports 9(3), 666-666 (2009). XX DR [1] (Consensus) XX CC TSD is TA. Based on that, it is classified as putative CC non-autonomous Tc1/Mariner. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Tribolium CC castaneum Genome Project. XX SQ Sequence 411 BP; 167 A; 56 C; 64 G; 124 T; 0 other; cagggtgtat cagaaatacg tgtcttaatt ttaacaggta acagagctca acaaataaaa 60 catcttttct atttgtaatt tttcaataaa aaattcgcaa atttacttaa aatttggaaa 120 aagataacat actgaaacgt gtacccgcct atgggtacaa aaataagaga gccgaactat 180 tttcgttgtg atagaaacgg acaatttaaa caatttaaaa caacaatgaa aattgttgct 240 atgaatacaa actaattatt aaacactgac cggctcgttt gcacaaaaga aaggaggaaa 300 acagtgaaaa ttataaataa gaattttgca aaatgttata tagaaaagtt gttgtatttg 360 atgagtttta ttacgtgtta aaattaacac acgtatttct gatacaccct g 411 // ID I-2_NVi repbase; DNA; INV; 5845 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.11, Created) DT 01-NOV-2010 (Rel. 15.11, Last updated, Version 2) XX DE An I clade non-LTR retrotransposon family from Nasonia DE vitripennis. XX KW I; Non-LTR Retrotransposon; Transposable Element; I-2_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-5845 RA Kojima K.K. and Jurka J.; RT "I clade non-LTR retrotransposons from Nasonia vitripennis."; RL Repbase Reports 10(11), 1905-1905 (2010). XX DR [1] (Consensus) XX CC This consensus is generated from 4 sequences with >99% identity. CC The 3' terminus is uncertain. XX FH Key Location/Qualifiers FT CDS 237..2246 FT /product="I-2_NVi_1p" FT /translation="MGDHPQNLPRVLRPPRTYATSGLKLNNRHNTRVTTKE FT SIGENKNGLNPKRNSLIQARLNHTKSSNNLDQINTTAKGDDLKDTKMETED FT ISLEEIEKIIHTPNDNKTLIQSQPITETNTSSLDQRNHESQNTSHENSKAD FT SERKRPAEFRSPESSSPNNQRFKKKPYNDNARKNLFNDHSLPTDPNDNRND FT KDLPSEYDINIQGPYKIYLQSIDNDRLDAIKIAKSLIPKIEIKDSLHEIKQ FT LSYNKVCVIAKYRNIANLILNLTDWKDLKIRAFIPNHLLSKQGIIKGIPVE FT ITDDELKQFIELDSPFGPLQITHTRRFTKKIYNKSTDKVDILPTKTVQITV FT KGQFLPAEAKVLKVIYPIETYYPQVRQCYRCFRFGHIKNNCKAPNERCIRC FT GEPKHTDGNECPYINTQPTCLHCKQNHLPIDKSCPAKIEEQNLKNMATDQN FT ITIAELKNQLRKXPAYNKSHFPTIGNSSQPTQAQTDTLKNTMYADKVKSYS FT KQPRNHPTQPSVEPTDRGSTQSVGHNHQLTQSPSQKSKNNNFTPNFRLAKQ FT HTDCLSYVNGNTNLKIKPINHRMTAKDFFDHSYLQDTHDNLNFTPIEEKIN FT YITEIINDLPITEIRKILRLTENRINHREQQLKQSNVLLTQVQNTYNDAVN FT ETDNLSKKNDNTNEESSSTEL" FT CDS 2337..5591 FT /product="I-2_NVi_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H, C-terminally truncated." FT /translation="MEDTIQGNPNKCKIKNNLKILQWNCRSISNKTDFIKN FT IADNYDIILLSETWLSDQKKFTIKNFNMVRRDRAENNNNLNKEHGGGVCIL FT VKSXLIFEEYKPLYNSPESLETVAVTVHLNNEEEILISSIYRVPGKVTTAE FT EWYKFFQSIDKIKYKIIGGDFNAHNLLWGSTNNCKTGENLLEIIDDKDLII FT LNDGSMTYSKVVNNNITFSAIDLTIVSSDLFLQSHWKTLDDKMYSDHYLIE FT IEINKDIKICPTASSHKIKTKSINWTNFNLLVEAIVDTQYPIDQFHEFSNK FT PIDEKYNILMNILISAINELKPKPKKSHNKEKINEINNPSDDNSTMNNHPC FT YKVYSPLITDNRTQFIKNKLKASHIPQSIKPPWWNEEAEEINNTRKNLEKS FT LKQNPSLEQLKELKDYEIESKFKLKEIKLNSFKKYTETKLSREANINEVWE FT TIHKFNEKKRNNPIYLDSETINNMHQFISDFCPQTASYNQPPGSIENIEAE FT INSDIDSPFNLKELTIAIESCNKKSSPGLDQIDNIMLNNTPNNYRALLLHT FT INDIIASGRFPREWKEFLVMLLPKKEKNKFRPITLAPCMLKLAEKLIQTRL FT IYYLESNALLPDSQNGFRKAKSCATSIAYLIAQIHKAFIENSHIITLLIDI FT KSAFDMVNPNMLQNILEDLKIPIKTRTFIFNLMTNRNLYFEINHKLVGPFY FT RHIGVPQGCVLSPTLYLIYVIYLNRHINPKNNIIQFADDTIIFTNNKSVEE FT GLXTLQNNTSEIINFFQSIGLDIAPNKTQLIIFSNKNIDPNISITINGHKI FT KNEQAVKYLGVWLDSKLNWNTHAQNLIEKIQKTINIIKVLRTTWWGGHPQV FT LLNVYKGLVRSITDYNCFCINIKNPKLREKINKLQYKAIRLALGYRNSTPI FT NVMLAESKEVPITTRTEYLADKLAVKIISIESDRLSNMLCELFVLARNKNK FT IFTVSSFPLFKSFIHYFYHRGHFIKKFNLPLPYPYKLETMLNDKEDEIISI FT KEGSIIQKSENPNTAFEEMFYNNPSEETVDFFTDGSKTEDKNHTGAAIYSP FT YNNLEKKFKFNKIASIFTAEAFAIFLKN" XX SQ Sequence 5845 BP; 2438 A; 1203 C; 807 G; 1391 T; 6 other; ggagtacggt tcgagatgcc atagtgcacg gacgcgtttt ttttgcagct agtttcttga 60 tacttataaa taacggaatt caggcaaata cttgactggc gaagaagacg aaacggtgtt 120 tatacgtcca aagggattag atatagaaac ttctgattgc tgaaaccaaa aaggcccgct 180 aaagcgggcg caacccccaa cataacctgc acctgcgtgt gcggttaatc gaaaaaatgg 240 gagatcaccc acaaaatctt ccgagagtgc taagaccacc tcggacgtat gctacttctg 300 gccttaagct taataatagg cataacaccc gcgttactac gaaagagtct ataggcgaaa 360 acaaaaatgg tctgaatcca aagaggaact cactgataca agcaagactt aaccacacta 420 aaagttcgaa caacttagat caaattaaca caacagccaa aggtgatgac ctaaaggata 480 caaaaatgga aactgaagac atatcgctcg aagaaatcga aaagattatc catacaccaa 540 atgacaataa aaccctaata cagagtcaac cgataacaga gacaaacacg agcagcttag 600 accagcgcaa tcacgaatcc cagaacacgt cgcacgaaaa ctcaaaagct gacagtgaaa 660 ggaagagacc tgccgaattt agatcgccag agtcatcatc ccccaacaat caaagattca 720 agaaaaaacc ttacaacgac aatgctagaa aaaatctatt caatgaccac tctttaccga 780 ccgatcccaa tgataacaga aatgataaag atttaccatc agagtacgac attaacatac 840 aaggaccata caaaatctac ctacaatcaa ttgacaatga tagactagat gctataaaga 900 ttgctaaatc cctaattcca aaaattgaaa tcaaagactc cttacatgaa ataaaacaat 960 taagctataa taaagtttgt gttatagcca aatatagaaa tatagccaac cttatcttaa 1020 acttgacaga ttggaaggat ctaaaaatcc gtgcgttcat cccaaaccat ctgctttcaa 1080 agcaaggaat cattaagggt attccagtag aaataactga tgatgaatta aaacagttca 1140 tcgaattaga ctctcctttt ggtccactcc agatcacaca cacacgaaga ttcacgaaaa 1200 agatatacaa caaatcaacc gataaagtag atatcctccc aacaaaaaca gttcaaataa 1260 cagtgaaagg acaatttcta ccggcagagg ctaaggtctt aaaagtaatt tacccaattg 1320 aaacatacta cccgcaagta agacaatgct acagatgctt tagatttggt catatcaaaa 1380 acaattgcaa agccccaaat gaaagatgca tcagatgcgg tgaaccaaaa catacagatg 1440 gaaatgaatg tccatacatt aacacacaac ctacatgttt acactgtaaa caaaaccacc 1500 tcccaattga caaaagttgt cctgcaaaaa ttgaagagca aaacctaaaa aacatggcaa 1560 cagatcaaaa cattacaata gccgaattaa aaaaccaatt aagaaaaawg cctgcgtata 1620 acaaatcaca ttttccaacc attggtaact cctcacaacc aacgcaagct cagacagaca 1680 cactaaaaaa taccatgtat gcggataaag taaaatccta ctccaaacag ccaaggaacc 1740 atcctactca accttcagtg gaaccaacag atagggggtc tacccaatca gtaggtcata 1800 accatcaact cacccaatcc ccctctcaaa aatcaaaaaa caacaacttt actcccaact 1860 ttagattggc aaaacagcac acggattgtt tgtcatatgt aaatggcaac accaatctaa 1920 aaatcaaacc aatcaatcat agaatgacag cgaaagactt ttttgatcac agttatcttc 1980 aagacactca tgataattta aatttcacac caatagagga aaaaattaat tatattactg 2040 aaattattaa cgacttacca ataactgaaa tccgaaaaat cttaagacta actgaaaata 2100 gaatcaatca cagggaacaa caattaaaac agtcaaatgt actgttgaca caagtccaaa 2160 atacctataa tgacgcagtt aatgaaacag ataatttaag caaaaaaaat gataacacta 2220 atgaagaatc aagctccaca gagctttaaa caaacacgta aatcgaacta atagcaattc 2280 catgcttata aatgwttttc gacatctaat tttcaatctt aaattatawt taaaaaatgg 2340 aagatacaat tcaaggtaat ccaaacaaat gtaaaatcaa aaacaatctt aaaatccttc 2400 aatggaattg ccgctcaatt tcaaataaaa ctgatttcat caaaaatata gccgacaact 2460 acgacataat tcttctctca gagacatggc tctcagacca aaaaaaattt acaataaaaa 2520 acttcaatat ggtaagaaga gatagagcag aaaacaacaa caatttaaat aaagagcatg 2580 gtggaggagt ctgtattctc gtcaaatcaa wcctgatatt tgaagagtat aaaccattat 2640 ataacagccc ggaaagttta gaaacagttg ctgtcacagt acatctcaac aacgaagaag 2700 aaatattaat cagttcaata tacagagttc ctggaaaagt taccacagcg gaagaatggt 2760 acaaattctt tcaatcaatt gataaaatca aatataaaat aattggtgga gattttaatg 2820 cacacaattt actatgggga tcaacaaaca actgtaaaac tggagagaat cttttagaaa 2880 ttatcgacga taaagacctt ataattctta atgacggttc aatgacctac tccaaagtag 2940 ttaacaataa cataactttt tctgctatag acttaacaat agtatcatca gacttatttc 3000 tccaaagtca ctggaaaacc ctagacgata aaatgtacag tgatcattac ctaattgaaa 3060 tcgagattaa caaagacata aaaatatgcc ctacagcatc aagccataaa ataaaaacaa 3120 agtctattaa ctggacaaac ttcaacctac tagtagaagc tatagttgac actcaatacc 3180 ctatagacca atttcatgaa ttttcgaata aaccgataga tgaaaaatat aacatactaa 3240 tgaatattct aataagtgcc ataaacgaac taaaacctaa accaaaaaag tcacacaaca 3300 aagaaaaaat aaatgaaata aacaacccca gtgatgacaa ttctacaatg aataaccatc 3360 catgttataa agtatactcc cctttaataa ccgataacag aacccaattt attaagaata 3420 aactaaaagc atcacatata cctcaatcaa taaaaccacc ttggtggaat gaggaagccg 3480 aagagattaa taacaccagg aaaaatctag aaaaaagcct gaaacaaaat ccatcgttgg 3540 aacaactcaa agaactaaag gactatgaaa tagaatcaaa attcaaacta aaagaaatta 3600 aattaaacag ttttaagaaa tatacagaaa ctaaactgtc tagagaagcc aacataaatg 3660 aagtctggga aaccatccat aaatttaacg aaaaaaaaag aaataaccca atatacttag 3720 actcagagac aataaataac atgcaccaat tcatctcaga cttttgcccg caaacagctt 3780 cttataatca acccccagga tctatcgaaa acatagaggc cgaaataaat agtgacatcg 3840 acagtccatt taatttaaaa gaactgacaa tagcgattga gtcatgcaac aaaaaatcta 3900 gccctgggtt agaccaaata gacaacatta tgctcaacaa cactccaaac aactatagag 3960 cgctgctttt acacaccata aatgatataa ttgccagcgg aaggtttcca cgagagtgga 4020 aagaattcct ggtaatgcta ctacccaaaa aagaaaaaaa taaattcaga cctatcacat 4080 tagctccatg tatgctaaaa ctagccgaaa aactcataca aacmcgatta atctactacc 4140 tagagagcaa tgcactctta cccgattctc aaaatggctt cagaaaagct aagtcatgcg 4200 ccactagtat tgcttatcta atcgctcaaa tacataaagc ttttatagaa aattcgcata 4260 ttattacatt gctgatagac attaagtctg ctttcgacat ggtcaatccg aatatgttac 4320 aaaatatact ggaagatctt aaaataccaa ttaaaacaag aacattcata ttcaacctaa 4380 tgacaaacag aaatctctat tttgaaatta atcacaaact agtgggaccc ttctacagac 4440 acattggagt cccccaaggc tgcgttctaa gcccgaccct ctacctcatt tatgttatct 4500 atctcaaccg acacattaat cctaaaaaca atatcataca attcgcagac gacacaataa 4560 tatttacaaa caataaatct gtagaagaag gcctagstac attacaaaac aacacaagtg 4620 agattatcaa ctttttccaa tccataggac tagatatagc tccgaataaa acacaattga 4680 tcatcttctc caacaaaaat atagatccca acatatctat caccataaat ggacataaaa 4740 ttaaaaatga acaagctgtt aaataccttg gagtatggct cgactccaaa ctaaactgga 4800 atacacacgc acaaaatctt attgaaaaaa tacaaaaaac tataaatatt ataaaagtct 4860 tgagaacgac ttggtgggga ggacatcctc aagtactcct caatgtctat aaaggtttgg 4920 ttagaagcat caccgattat aactgtttct gtataaatat caagaatcca aaactaagag 4980 aaaaaatcaa caagctacaa tacaaagcaa tcagattggc actaggatat agaaactcta 5040 cacctatcaa tgtcatgctg gcagaatcca aagaagtccc cataacaacc agaacggagt 5100 acttagcaga caaactagcg gtcaagatca tatcaataga atcagataga ctttctaaca 5160 tgctctgcga attatttgtc ctagcaagaa ataaaaacaa aattttcacc gtaagctctt 5220 tccctttatt caaatcattt atacactact tttatcacag aggtcacttt atcaaaaaat 5280 ttaacttacc acttccttac ccatacaagc ttgaaacaat gctaaatgac aaagaggacg 5340 agatcatctc aataaaagaa ggttctatca tacaaaaatc agaaaacccc aatactgcat 5400 ttgaggagat gttttataat aacccatccg aagaaacggt tgactttttt actgacggtt 5460 ctaaaacaga agacaaaaat cacacaggag cagcaattta ttcaccatac aataatctcg 5520 aaaaaaaatt caaattcaac aaaattgcat ccatatttac agctgaagca tttgctattt 5580 tcttaaaaaa ttaaaaaaaa acaaatcaac aattccttta gacttgaaaa aaacatttaa 5640 aatgcctaac agcatagaag caaaatttct aacagaattt ttcaaagcct gtaatttagc 5700 tttataatca aaaatttaca cattatgtag atgatttcac ctgacttatt ctataagatg 5760 taacagaaaa accacggtgc tgatcaataa tcggcctcga gataccttac tgccaaacca 5820 aaaacaaaat cgccgccaac tacaa 5845 // ID Outcast_Ele14 repbase; DNA; INV; 5448 BP. XX AC . XX DT 23-OCT-2010 (Rel. 15.1, Created) DT 23-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE Non-LTR retrotransposon family from Aedes aegypti: consensus. XX KW Outcast; Non-LTR Retrotransposon; Transposable Element; KW Outcast_Ele14. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-5448 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5448 RA Jurka J.; RT "Outcast clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (23-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update and extension. This consensus is generated CC from 6 sequences with >98% identity, and ~98% identical to the CC original sequence in [1]. XX FH Key Location/Qualifiers FT CDS 229..1641 FT /product="Outcast_Ele14_1p" FT /translation="MGAEDDPPDGGGCTSXRPDWGQDGEAAAAVVRNGIQD FT DNKAKREDVIYTHRDAAPYRVYFELRNDENGAKKINKFGLGSTLRSNDTFK FT RHIVDMKYAGRQKILVFLNSYVKANQLVKSINESNSIYRAYVPQHLVCVTG FT VIAGIPAEIPIEQIEDDLECDVPIVGVRRLTRFIDGVKIPTNRVSVTFRTD FT SLPDKVRLFCCSSRVQPFVQKVVICLNCLRTNHRTANCRSAKRCQRCSERH FT DDIHEYDRCEKGKKCVNCRSTDHTTTDPNCPEIKRQDKIKKLMAKRNLTYA FT EAKEQIPIANQNLYETLYDAEDFPTPAESFADMTRGNFRWKDPLREQWMQA FT NNERVKMQAAVNIGKSPDKQPNRKRKTTTNTDKTTEKRNHILGKEAAEATN FT GVALINNHRVDEKERWENMSRQIMERVQKEQHNLLMTFYADFIAQLDNDET FT TKEKFMTCTKRHFNFAKSVIHHCDTK" FT CDS 1663..5337 FT /product="Outcast_Ele14_2p" FT /translation="MIARNQYKMKINKISLYLIPNLKFGELSKIKILQCNV FT QSLDKNKAEIHRVFFGEGFDASFLSETWTNATLETSKKYRIAGCHLIACSR FT DDGYGGSAILLRNNHSYTKINLPALSNKTQAVAISVLSMKLVLFSVYVSPS FT ISVPDFKEDLQILFDTVRQFENVIIGGDFNAHHGDWGDENMDRKGDILADM FT INDSSCLLLNDGSATFVPVRLGCRSSAIDLTMCSPQIXSKAQWRTLAYGIG FT SHHLAIEITISNEVDERPKYCYDNKKIATSIANLEPTKIRNVEDLKSAVNR FT IRKCNRRKDNRTPKFWWSDTVDQAWKEKTEARRVFNRESSVENLIEFKRKA FT AIFQKLKRAEIRNQFEEFSSEIGPQTSSKELWNKVGRLAGKRIKRIENNTL FT LDDSNAANQFLDKHFGPNDAEEWIEPVPVASYNLLDFSKWEAIIGKKKKRS FT APGEDGITYEALRQLSPVVIRNVLEDLNRMWIRGSLTDELKTIKIVAIPKP FT GRNQSSPEGKRPISLVPTMTKIMNTAVLDILQSHLHQNQILPEKSFGFRRG FT TSTSTAICFVINEIKRNKREKLLTALICVDLSNAFNAVRTDILVETLARMR FT VTVEVVTWISSFLKNRRIVLNQRNGSISRRISNGLPQGDVLSPTLFNIYTA FT VLHSIRIEGVTLVQYADDFEIIVTANNWETLGIRAQQYMDKFVETCNELNF FT AINSEKSSVMVMSASSRSLNIKINGNVVETVQKQNYLGITIDRFLTFGAHI FT RLVREKVQERLCMLKVLNGARTGTHPDTMLKVYQALIRSIMEYGCSAHNNA FT SKTTRRIVEVINNQSLRKVTGSTKSTPLNTLSAISGQPPLNFRQEMVTCRE FT IARCLSRDNVLGKQLKNIEDEEEHGQEKFSYMEQIYLRNRTTFDVLMQTEQ FT IVDVHEVEINPYLNDENPKKDHVSQVKLKQLSLGMINGKYRGRGRIWTDAS FT KDGEKCGIGIFVEGNKARYYYRLLHNTSITSAELTAIWLAMQIVEKDQLMH FT YVILTDSRSSCQILENGVDSGEGETVIAEILQVAKRWKVTIQWVPSHIQLA FT GNDVADMLAKEGTKDEAPILENKLFLKDVCLKFTKQLEEKTRRWYEELSQE FT KGKKYYGIRPVWNAQPWFAKLDLKGKDIRLLNRLMSGHDYSKYWLAKMRLK FT DSPDCDLCDEPETAEHVILHCPRYGMQRHHYSFDCRYANLEEIFKTGDKEL FT FEEVANFVSEMKLEL" XX SQ Sequence 5448 BP; 1880 A; 1066 C; 1277 G; 1222 T; 3 other; atcattcgag tgcgacgcgt tgaagcgagc gaaagttgac acgttttcgg cttggtgaaa 60 aactgaaata tcggtgcaaa tattgttaaa aagttacgaa caagcggtgt aaagcaagaa 120 taaagttaag tgctaccgga caaaagtggt tttgtggaga atattacgaa agaaagccaa 180 aaatctgtga tctcatcggt agaaggaaaa agttaggtta gcgtcggaat gggagcggag 240 gacgacccac cagacggtgg gggatgtacg tcanatcgac cggactgggg acaagacgga 300 gaggcggcag ccgccgtagt acgcaacgga atccaagatg acaacaaagc aaaacgagag 360 gatgtgatct acacccacag agacgccgca ccgtacagag tgtattttga attgaggaat 420 gacgaaaatg gagcgaagaa aatcaacaag tttgggctgg gctcaacact gcgtagcaac 480 gacacgttca aacgtcacat cgtcgacatg aagtatgcag gaagacagaa aatcctggtc 540 ttcctgaaca gctatgtgaa agcgaatcaa ctggtcaaaa gcatcaacga gtcgaacagt 600 atctacagag cgtacgttcc gcaacacctc gtatgtgtta caggtgttat cgccgggatc 660 cctgcagaaa ttccgatcga gcaaatcgag gatgatttgg aatgtgatgt gccaatcgtg 720 ggtgttcgcc ggctcacgcg attcatcgat ggagtgaaga ttccaaccaa ccgcgtcagt 780 gtaacgttca gaacggacag tctacctgac aaggtcagat tgttctgctg ctcaagtcgt 840 gtgcaaccat tcgtgcaaaa agtagtcatt tgcctaaact gccttcgtac aaatcatcgc 900 actgcaaatt gtcgaagtgc taaacgctgc cagcgatgct cagaacgtca cgatgacatt 960 catgagtacg acagatgtga aaagggtaag aagtgcgtga actgcagaag tacggatcac 1020 actacaacag atccgaactg ccccgaaatc aaacgacagg ataaaatcaa aaagctgatg 1080 gcaaagagga acctaacata cgccgaagcg aaagagcaga ttccgattgc aaatcaaaat 1140 ttgtacgaaa ccctgtatga tgctgaggat tttccgactc cggcggaatc ctttgcggat 1200 atgacacgcg gcaatttccg ttggaaggac ccgctgcgag aacaatggat gcaggccaac 1260 aacgagaggg taaaaatgca agctgcagtg aacattggga agtccccaga caaacagccg 1320 aacaggaaga ggaaaacaac cacgaacacg gacaagacaa ccgagaagcg aaatcacatc 1380 cttggaaagg aagcagcaga ggccacaaac ggagtggctc taatcaacaa tcaccgcgtc 1440 gacgaaaagg aaagatggga gaacatgtca aggcaaatca tggaacgagt gcagaaggag 1500 cagcataatc ttctgatgac tttctacgca gactttatcg ctcagttgga taatgacgaa 1560 accaccaaag aaaaatttat gacgtgcacc aaaaggcact tcaacttcgc aaagtcagta 1620 atacaccact gcgacacaaa gtaaggtaag cacttcttca ttatgatagc tcgtaaccag 1680 tacaagatga aaataaataa aattagcctc tacctcatac caaatttaaa attcggggag 1740 ttaagtaaaa taaaaatatt acagtgtaac gtgcagagtc tcgataaaaa taaggcagag 1800 atccacaggg tttttttcgg cgaaggcttc gatgcaagtt tcttatcaga gacatggaca 1860 aatgcaactc ttgaaacttc taaaaaatac cgtatcgctg gatgccacct aattgcctgt 1920 tcaagagatg acggctacgg aggtagcgca atcctcttgc ggaacaacca tagctatact 1980 aaaataaatc tacccgccct gtccaataaa acacaagctg tagcgatttc agtgttatcg 2040 atgaaattag tcttgttttc tgtgtatgtg agcccatcga tcagcgtacc cgacttcaag 2100 gaagatctgc agattttgtt cgacactgtg agacagtttg agaacgtaat tataggaggg 2160 gattttaatg cccatcacgg agactgggga gatgaaaata tggacaggaa aggagacata 2220 cttgcggaca tgattaatga ctcgagctgc cttcttttga atgatggatc cgctacattt 2280 gttccagtac ggttaggatg taggtcttcc gcaattgact tgaccatgtg ttccccgcag 2340 attcantcaa aggctcagtg gcgaacatta gcgtacggaa ttggcagtca ccacttggcg 2400 atcgagatta caataagcaa cgaagtagat gaacgtccga agtattgcta tgataacaaa 2460 aaaattgcga cttcaattgc aaatctggag ccgacgaaaa tacgcaatgt agaggattta 2520 aagtcagcag tgaaccgtat tagaaagtgt aataggagga aagacaatag aacacctaag 2580 ttttggtggt cggatactgt tgaccaagca tggaaggaaa aaacggaggc aaggcgggtt 2640 ttcaacagag aatcctctgt tgaaaatcta atcgaattca aacgtaaagc ggccatcttc 2700 cagaaattga agcgtgcaga aattagaaac caatttgaag agttttcttc tgaaattgga 2760 ccacaaacta gttccaaaga attgtggaac aaagtaggta gactcgctgg aaagagaata 2820 aaacgtatcg aaaacaacac nctattagat gactccaatg cagcaaacca atttctggac 2880 aagcactttg ggccgaatga cgcagaggag tggatcgaac cagtaccagt agcatcgtat 2940 aatttactag atttttcaaa atgggaagca atcattggta aaaagaagaa aagatctgca 3000 ccaggggaag acggtattac atatgaagcc ctacgacaac tgagtccagt agtgatacgg 3060 aatgtgttgg aggatctcaa caggatgtgg attcgtggtt cgctcacaga tgaactcaaa 3120 actattaaaa tagttgcgat acctaagcca ggtagaaatc aatcatcgcc agaaggaaaa 3180 cgaccgattt ctcttgtacc cacaatgacc aagattatga atacagcggt gttagacatc 3240 cttcaaagcc atcttcatca aaaccaaatc ctaccagaga aatcttttgg ttttaggagg 3300 ggtacttcaa cctcgacagc gatttgtttt gtaataaacg aaataaaaag aaataaaaga 3360 gaaaaactat taacggcatt gatttgcgtg gatttgtcca atgcgttcaa tgctgttcgt 3420 acggacattc tggtagaaac gttagctcgt atgagagtta ctgtagaggt cgtaacgtgg 3480 atatcgtcgt tccttaagaa caggaggata gttttgaatc aacgcaacgg ttcaatttca 3540 agaagaatct caaatgggct accacaagga gatgttttat ccccaacgtt gttcaatatc 3600 tacaccgcag tattacacag tatacgcata gaaggagtga cattggtcca atatgcggat 3660 gactttgaga ttatcgtgac agctaacaac tgggaaacgc tgggcatcag agcgcagcaa 3720 tacatggata aattcgtgga aacctgcaat gagctaaatt ttgcaataaa ctccgaaaaa 3780 tctagcgtaa tggtgatgag tgcaagtagc agatccctga atattaaaat caatggcaat 3840 gtggttgaaa cggtccagaa acaaaattac ctagggatca caatcgatcg gttcttgaca 3900 ttcggggctc atattaggct tgttcgagaa aaagtccagg agagattatg catgctgaag 3960 gttctgaatg gcgccaggac aggaacacat ccagacacaa tgctgaaagt ataccaagcc 4020 cttatccgga gtattatgga atatggttgt tctgcacaca ataatgctag caaaacaaca 4080 agaaggatcg tagaagtgat aaacaatcaa agcttaagga aagtcactgg atccacaaag 4140 tctacgccgc taaacactct atctgcgatt agcggtcaac cgccattgaa ttttagacag 4200 gagatggtca cttgtcggga aattgctcga tgtctatcaa gagataacgt tcttggaaaa 4260 caattaaaaa atatagaaga tgaggaagaa catggacagg aaaagttctc ttacatggag 4320 caaatctatc tgcgaaatag aactacattc gatgttctta tgcagaccga acagattgta 4380 gatgtacacg aagttgaaat aaatccatat ttgaacgacg aaaatcctaa aaaagaccat 4440 gtcagtcaag taaaactgaa acagctttcc cttggtatga tcaacgggaa atacaggggc 4500 agaggtagaa tttggactga tgcatctaaa gacggagaga aatgcggaat cggaatattc 4560 gtagaaggca acaaagcacg ctactactat cggttactac acaacactag catcacatca 4620 gcagaactca cagctatctg gctggctatg caaatagttg agaaagatca actgatgcac 4680 tacgttatac taaccgactc taggtcatcc tgccaaattc tcgaaaatgg agtagatagt 4740 ggtgaaggag aaacggttat cgcagaaata ctacaggtag ctaaaagatg gaaagttaca 4800 attcagtggg taccaagcca catccagctc gccggtaatg atgttgcaga catgttggcc 4860 aaggaaggaa cgaaagacga agcaccaata ctcgaaaaca aactattcct caaggatgtt 4920 tgtctaaaat tcaccaaaca actggaggag aaaactagac gttggtatga ggagctgtct 4980 caggaaaaag gaaagaagta ctacggaata cgacccgtat ggaatgcaca gccatggttc 5040 gcaaaactgg atctaaaagg aaaggacatt cgtttactaa atcggctcat gtcggggcac 5100 gattactcga aatactggtt ggcaaagatg agactaaaag atagcccaga ttgtgacctg 5160 tgtgatgagc ctgaaactgc cgagcacgta atattacact gcccccgcta tggtatgcag 5220 cgacatcact acagttttga ttgtcgttat gctaatttag aagaaatttt taaaactggt 5280 gataaagagc ttttcgaaga agttgcaaac tttgtgagcg aaatgaaact agaattatga 5340 ataacatgct gaaagcataa aataactggg catatagtct agtaactggc ccaacataca 5400 gagtttgtca accgacagct cgatacagaa agaagaagaa gaagaaga 5448 // ID Gypsy20-LTR_Dya repbase; DNA; INV; 286 BP. XX AC chrU; XX DT 19-MAY-2009 (Rel. 14.05, Created) DT 19-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy20_Dya; KW Gypsy20-I_Dya; Gypsy20-LTR_Dya. XX OS Drosophila yakuba OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; melanogaster subgroup. XX RN [1] RP 1-286 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1119-1119 (2009). XX DR Genome; chrU; Positions 1718888 1719173. XX SQ Sequence 286 BP; 100 A; 58 C; 42 G; 86 T; 0 other; tgacatattc atttcttcat acgacatatt cactcttcat ataaaaaagc taaagccgat 60 cttttggaaa gcttcactct ccaaaagcct cataaataac attcccccaa gatacgaacc 120 gcttaacggt aacgagctta atgatctgtt aagcttgaca tataaagcta agtagaactt 180 aaaaggctta tgttaatcct ctgtaaaagg tttgctgggc cgaaagaata cctttgtaaa 240 ttagttctta actgacctct ggtgaaactt attcaaataa agatca 286 // ID Gypsy-108_AA-LTR repbase; DNA; INV; 191 BP. XX AC AAGE02028575; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-108_AA_; KW Gypsy-108_AA-I; Gypsy-108_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-191 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02028575; Positions 20368 20178. XX SQ Sequence 191 BP; 57 A; 26 C; 49 G; 59 T; 0 other; tgttgtatca ttgacacacc cctagattgg tagcatcttg agagtagagt agtattggta 60 gcgcagtgta gagagtagag cggaagtaag atgaacgtcc attgtcttgt attcgcgtgt 120 ggagtaaaga tcgtcagagg aataaactaa gttaatacgt ttaattctgg tcttttattg 180 cgaatataac a 191 // ID Tx1-9_BF repbase; DNA; INV; 4784 BP. XX AC . XX DT 28-APR-2009 (Rel. 14.04, Created) DT 28-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE Amphioxus Tx1-9_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW Tx1; Non-LTR Retrotransposon; Transposable Element; L1-9_BF; KW Tx1-9_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-4784 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-4784 RA Kapitonov V. and Jurka J.; RT "Young families of Tx1 non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(4), 846-846 (2009). XX DR [2] (Consensus) XX CC ORF1 is incomplete. XX FH Key Location/Qualifiers FT CDS 655..4446 FT /product="Tx1-9_BF_2p" FT /note="endonuclease and RT." FT /translation="MNTTIGSYNVNGIANPRKRREVFNWLKEKNYDIICLQ FT ETHSSPEHEMAWKQDWHGEVFFNHGTTNQRGVLILCKENLKFTVNNCTKDD FT DGRLLIIDISINDLNFCICNLYAPNRDNPDFFDNIEKLISNYNHGNFIIVG FT DFNTVQNAKLDRAGRNPAVYHPNAQQKINDLMFALELVDVWRYKHPNTVRF FT TWRRGKQASRLDYFLVSFSLLADISNVRILDRLKSDHKIISLNIQTVKHPR FT GRNYWKFNQTLLTDVNFQQKTEEFIDEFFRFNIGTSDPLTVWDTFKCCFRG FT HTISFASKKRKEFISKENDIKKHIEQLNEKLDETDEPSQTVLDELSSLQSE FT LEILYEQKSDQMFYKQRADWMEFSDRCTKQFLDLGRRNSSKKNVINLKTTD FT GENIQDPKRAIEEIKSFYHSLYSFSDPPKDLDDDNCEPFFPGDLHVHLDTE FT QKNSCEGLITEKELWEAIDSFKEGKTPGIDGIPVEVYKTFFRNLKNPMLHL FT FNHAYESGNLSDSQKEGLITLLLKQDSNGEYKDPTALNNWRPLSLLCCDVR FT ILSKVISLRIRKVISNLIHKDQCGFLQNRFIGENICQITEIIQLYEKEKLP FT GMIFIADFEKAFDKLRWDFMFRALRFFGFGESFIRWVSVLYKDITSKVINN FT GYISTPFKLSRGVRQGCPLSPYLFILCVELLAVKIRSNNNIKGITLHETEI FT KVSQFADDSNFPLQPTLTSLLALTKDLNNFSEISGLDPNYSKCKILRIGTL FT RNTHFTLPCNLPVQWTDGPVKMLGVGITGDTNKLTKNYEDRLIKIDRTLLP FT WKMIPMSLYGKVTLINTLVVSQFTHLFMSLPSPEKSFFQRYEQKVFRFIWN FT GKPEKIKRKVIYNTYDKGGLGLIHLPTFDRTRKASWVPRMFCQHNSNRKPL FT LDTSSVVFSRHLYPFLQLSLDDITSMRPLCTISPFFKDVLKAWLSLQYKPP FT ETYKDIQNQIIWCNSNILVESKPIALETPMSCGMIHINDILDLDGKLLSYD FT DVIKKFGKVIGRMGYNSIVSAIPNDWKKELLKHSPTVGPIIPSAAHYIWLK FT TCRIHKKTYSFFLQKNNIAMSPTNIQSYWEDEFNVPLPWHHIFKLIYKTTI FT SPRLRILQYKIVHKTLATRKMLHRWKIADSPLCVHCKLEEETIRHIFWECP FT RVHELWTKTQKWLESIFLQKYNFDAMNVIMGEFRLDHPPILNMIILLAKNF FT IMHNNQQHLSMKQFMNIIKTHEHIEYIIAVRKQKLNYHLKKWETLRLALFP FT TVP" XX SQ Sequence 4784 BP; 1619 A; 818 C; 818 G; 1529 T; 0 other; atctcgacag gttaaacagg ttagccctgt gaagtttcat tatattactc ttatgttgta 60 gtacttgcta ttattttcga atgaacgaat aatacaataa ccttgtctag attttcctaa 120 gatggccaat tcttattgtt ccactgttgc gctttacttg atatgtaatt acagggcttt 180 ttttttgtta ttgatccaca taccaaaatt gatccgtatg tgacattatg acttgtaagg 240 ttagccctgt ctagaaaatt ctcggttaaa tatacatgaa taactgtaaa aataagagcc 300 aatcacgtaa attatgataa cggccggccg acggatggtc agggtgcctg aaaccctgta 360 tatatgtttt tgtttcatct tttctgggaa ccatcctagc atgttatatg ctgttagact 420 ttgtttagtt tctaaggtgg ttgacaccct acctttgttt gttaactgtg tttccgttat 480 cttgagttat atctgtatgt gtgtttatgt tcaatatctg gcttactcat gtaacccagt 540 gaactattgt ttatgttcgg ataccacgga tcctgaaatt gttatatata tgtatgtcgt 600 aatcaagtac tcccatagtt acacattacg ttatatgaat gaaaaacctc cgacatgaat 660 accactattg gctcatataa cgtaaatgga attgcaaacc cgcgtaaaag acgtgaggtg 720 tttaactggc ttaaagaaaa aaattatgac ataatttgtt tacaagaaac ccattcatca 780 ccagaacacg aaatggcctg gaaacaagat tggcatgggg aagtcttttt taatcacgga 840 acaacaaacc aacggggagt gttgatttta tgcaaggaaa acttaaaatt tactgtcaat 900 aactgtacta aagatgatga tgggcgactg ttgataatag acatttctat aaatgacttg 960 aatttctgta tttgcaattt gtacgcacca aatagggaca accccgactt ttttgacaat 1020 attgaaaaat tgatatctaa ttataatcat ggaaatttta taatcgttgg ggatttcaat 1080 acagtccaga acgccaaatt agatagagcc ggacgaaacc cagcggtcta tcacccaaat 1140 gcacagcaga aaattaatga tcttatgttt gctttagaat tggttgatgt atggcgatat 1200 aaacatccta atacagttag attcacatgg agaaggggga aacaagccag ccgtttggat 1260 tattttctgg tttctttttc gcttttagca gatatttcaa atgttagaat tttggaccgg 1320 ctcaagtccg atcacaagat tatttcacta aatatacaaa ccgttaaaca cccacgtggg 1380 aggaattatt ggaaattcaa ccaaacattg ctgacagatg taaattttca gcagaaaact 1440 gaagaattca ttgatgagtt ttttagattc aatattggta cgagtgaccc gttaacggta 1500 tgggatactt ttaagtgttg ttttagagga cacacaatca gttttgcctc caagaaacga 1560 aaagaattta tctcaaaaga aaatgatatc aaaaaacata tcgaacagtt aaacgaaaaa 1620 ttagatgaga cagatgagcc ttcacaaact gtacttgacg aacttagctc actacagagt 1680 gaattagaga tactatatga acagaaatca gatcagatgt tctataaaca gagagctgat 1740 tggatggaat tttctgacag gtgtacgaaa caatttcttg atcttggtcg acgaaattcc 1800 tctaagaaaa atgttataaa tttgaaaaca acagacggag aaaatataca ggaccctaaa 1860 cgggctatag aagaaatcaa gtcattttac cattctcttt attcctttag cgatccccct 1920 aaagacttag atgatgataa ttgtgaaccc ttttttccag gtgacctaca tgtacattta 1980 gatactgaac agaaaaactc ctgtgagggc cttattacag aaaaagagct atgggaagcc 2040 atagatagtt ttaaagaagg caaaactcca ggaatcgacg gcattccagt cgaggtttat 2100 aagacttttt ttcgaaattt aaagaatcca atgctacatt tatttaatca tgcatatgaa 2160 agtgggaact tatcagattc acagaaagaa ggattaatta ctttattatt gaaacaagat 2220 tcgaatggag aatataaaga cccgactgct cttaataatt ggcgaccctt atcattgtta 2280 tgttgcgacg tacgtatttt gtcaaaagtt atctcactta ggataagaaa agttatttct 2340 aatttgatac ataaagatca atgtggtttt ttgcaaaaca gatttatagg ggaaaacata 2400 tgtcaaataa ctgaaataat acaactctat gaaaaagaaa aattgcctgg tatgatcttt 2460 atagctgatt ttgaaaaagc attcgataaa ctaaggtggg attttatgtt tagggctttg 2520 cggttttttg gatttggaga atcatttata cgctgggttt cggtactcta taaggacatt 2580 actagcaaag taataaataa tggttatatt tcaactccat tcaagctatc tcgtggagta 2640 cgtcaagggt gtcccctctc accgtatctg tttattttat gcgttgaatt gcttgccgtg 2700 aagataagat ccaataacaa tataaaggga attacattgc acgaaacaga aattaaggta 2760 tcccaatttg cagacgattc aaattttcca ctacagccaa ctttaacatc actcctagca 2820 ctgacaaaag acctcaacaa tttttcagaa atttctggtt tagatccaaa ttatagtaag 2880 tgtaaaatat taagaatagg aacgctaagg aatacccatt tcactttgcc atgtaacctc 2940 ccagttcaat ggacagatgg accagttaaa atgcttggag ttgggatcac aggtgacaca 3000 aacaaactta cgaaaaacta tgaagatcga ttaattaaaa ttgacagaac ccttctgcca 3060 tggaaaatga ttccaatgtc cctatacgga aaggtcactc tcattaatac ccttgttgtg 3120 tcgcaattta ctcacttatt tatgtcatta ccatccccag aaaaatcatt tttccaaaga 3180 tatgagcaga aagtttttag atttatttgg aatggtaaac ctgaaaaaat taaacgtaaa 3240 gttatttaca atacatatga taagggtggt ttaggattga tacatctacc cacattcgat 3300 cgtacacgca aagcttcttg ggttccgaga atgttttgtc agcacaactc aaatcgcaag 3360 cctcttttag atacttcttc tgtggtattc agcagacatc tttacccatt tttacaactg 3420 tctttagatg atataacttc tatgcgacca ctttgtacca ttagtccttt ctttaaagac 3480 gtcttaaagg cgtggttgtc cctacaatat aagcctcctg aaacatacaa agatatacaa 3540 aatcaaatta tttggtgcaa ctcgaatatc cttgtagaaa gtaaacctat agctttagaa 3600 actccaatgt catgtgggat gattcatatc aatgatatac tagaccttga tggcaaatta 3660 ttgtcatatg atgatgttat taaaaaattc ggtaaagtta ttggcagaat gggttacaat 3720 agtatagtct ccgcaattcc aaatgattgg aaaaaggaac tacttaaaca ttccccaacc 3780 gtaggtccaa ttataccatc tgcagcccac tacatatggc taaaaacatg tcgaattcat 3840 aaaaagacat attcgttctt cctacagaaa aacaatatag caatgtcccc tacgaatatt 3900 cagtcttatt gggaagatga atttaatgta cctctcccat ggcatcacat tttcaagtta 3960 atttataaaa caacgatttc acccagatta agaatacttc aatacaaaat agtgcataaa 4020 acgttggcaa caagaaaaat gttgcatcgg tggaaaattg ctgatagtcc attgtgtgtt 4080 cactgtaaac ttgaagaaga aacaatacga catattttct gggaatgccc cagagttcat 4140 gagttgtgga caaaaacaca aaaatggcta gaatcaattt ttctccaaaa gtataatttt 4200 gatgcgatga atgtgataat gggagagttt aggttagacc atccaccaat tttaaacatg 4260 ataatcctac ttgccaaaaa ctttataatg cacaacaacc agcaacatct ctctatgaag 4320 caatttatga atataataaa aacccatgaa catatagaat acatcattgc tgtaagaaaa 4380 caaaagctga attatcattt aaaaaaatgg gagacacttc gcttagctct gtttcctact 4440 gtcccttgat tattcattta ttattgtaaa cttgtttaat tctgttgttt tcattcatat 4500 gttatcatga ttgtattttg ttacaaaaat taatttcctt atgtccagca tattatcttg 4560 tgttacagtt ttgtttactt ttatcacctt gttcagaata atttctgtat atcgatctgc 4620 gtatttattt gtaattacta ctggatttgt attcataaga attagtccat ttacgttctt 4680 tctacttgct tcttcatcac aggatccgtg gtatttcctt tgtatgtatt ctttgtttgt 4740 ccttatgcga aaataataaa aagataaata aaaaaaaaaa aaaa 4784 // ID SMAR23 repbase; DNA; INV; 1307 BP. XX AC . XX DT 07-OCT-2007 (Rel. 12.1, Created) DT 22-OCT-2007 (Rel. 12.1, Last updated, Version 1) XX DE Consensus sequence of Mariner-type family of repeats. XX KW Mariner/Tc1; DNA transposon; Transposable Element; SMAR23. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-1307 RA Jurka J.; RT "Mariner-type element from freshwater planarian (Schmidtea RT mediterranea)."; RL Repbase Reports 7(10), 1081-1081 (2007). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 176..1210 FT /product="SMAR23_1p" FT /translation="MNPVEYRSVIKFLVLRQTESSEIFSQLSATYGTTAPS FT KTTVYHWIAQYRGGRQSVFDDERPGRPVEIDHENIAKRCEIIVRDERRITL FT KELQRRLNISDGKVREILKDLGIRKLASRFVPRFLSGEMCQARLECCQQCL FT SLNDQHGSRFLDNIVTVDETPLSLYLPESKRESSEWRFPDERAPRKMRAGT FT SHRRCLMLTIFWDSRGVIKVDFAEKGVTINSAYYADLIAETRRLRRKPRGS FT PLWLLQDNAPVHKSAVSKQAIDDAGFSIVPHPPYSPDLAPSDFWMFRHMKK FT TMRGKIFETPDSVKDSVVKFLSECDQNFFKTAFLELLKRWTTCVNNNGSYI FT EK" XX SQ Sequence 1307 BP; 360 A; 293 C; 310 G; 344 T; 0 other; tacgtggtac gcgcattaag tccccggact aagcaagaaa taatgagtag aagttaatga 60 gattttatat ttaatatatt atgtatatat aggttaataa tataacattt gttttaatat 120 ttcattgttt gtattgtttc agctcacgca tattcaaaat ttcgtcaaaa ttgctatgaa 180 tccagtcgag taccgcagcg tcataaagtt tttggtgcta cgccaaactg agagctccga 240 aattttcagt caactctcag caacttacgg aacaactgcg ccttccaaaa cgacagtcta 300 ccactggatt gctcagtatc gtgggggacg acaaagtgtt ttcgacgacg aaagacctgg 360 acgacccgtg gaaattgacc atgaaaatat cgcaaaacgt tgcgagatca tcgttaggga 420 tgagcggcgg atcaccctga aggagctgca acgtagactg aatatcagtg atggaaaagt 480 acgagaaata ttgaaagacc tcggcatccg gaagctggca tcacgttttg tccctcgatt 540 tctgtccggt gagatgtgcc aggctcgtct ggagtgttgt cagcaatgcc tgtctctgaa 600 cgatcaacat ggctctcgct tcttggataa catcgttact gtggatgaga cacccctcag 660 tctttacctc ccagaaagca agagggaatc ctcagaatgg cgattccctg atgaaagagc 720 acctcgaaaa atgcgtgctg gaacgtccca tcggcgttgt ctcatgttaa ccatcttttg 780 ggacagccgt ggtgtcatca aagtcgactt cgctgagaaa ggtgtgacaa tcaacagtgc 840 gtactatgcc gacttgattg ctgaaacccg aagacttcga cgcaaaccca gaggctctcc 900 actgtggctg ttgcaggaca atgcaccagt ccacaaaagt gctgtttcca agcaggccat 960 cgacgatgcc ggtttctcga tcgtgcctca cccgccatac agtcccgacc tggcgccgtc 1020 ggatttttgg atgttcagac atatgaagaa gacgatgcga ggcaagatct ttgaaacccc 1080 cgacagtgtc aaggacagcg tcgtgaaatt cctctccgaa tgtgaccaaa atttctttaa 1140 aacggcattt ttggaactgc tgaagcgctg gacgacgtgt gtgaacaaca atgggtctta 1200 cattgagaaa tgagggtata catgttgctt atttgttgtg cgaaatatga agtttctgtg 1260 acttaccttt catagttagt ccggggactt aatgcgcgta ccacgta 1307 // ID BEL-3_DGri-I repbase; DNA; INV; 5418 BP. XX AC scaffold_14822; XX DT 07-MAR-2011 (Rel. 16.03, Created) DT 07-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the Drosophila grimshawi genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-3_DGri_; KW BEL-3_DGri-LTR; BEL-3_DGri-I. XX OS Drosophila grimshawi OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Hawaiian Drosophila; OC grimshawi group; grimshawi subgroup. XX RN [1] RP 1-5418 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Drosophila grimshawi genome."; RL Direct Submission to RU (06-MAR-2011). XX DR Genome; scaffold_14822; Positions 773933 779350. XX CC Positions [4588-5019] - Integrase core CC LTRs are 92% similar to each other. XX FH Key Location/Qualifiers FT CDS join(1798..2973,2977..4470) FT /product="BEL-3_DGri-I_1p" FT /translation="MACEQFKALDALQRFDVVKRHRLCLNCLSKGHQMADC FT PSTNRCRHYQKSDHTLLHKKLRSTTQPSSSPATEVAHNHSQIKKRYQSHVL FT LATAMILVRDSTGTYRLGRALLDSYSQVNFVTDSFAQKLRLKREKHVVQIS FT SIGHARTEITSLTTISIRSRLSPFEVSLDLCVTSHIAYQPDINIDISRWKM FT PQNIVLAGEKFYQSRHIDMLLGTEAFFDILSVGQVKLDITGPMLQKTLFCW FT VVTGKCHPQQSREIAASFKLGATSIDDHLKRLWEIESIETSNVLLKPEHRA FT CEEHYLKTTVTNDSGRIVVRLPFKNNPSQLGHSFDIARRRFLSLERRLAQS FT SGLRLQYCQFMEEYEHLEYMSLVTEPHLSEPHYYIPHHCVLKPVFDVSCTS FT NQTSLNDHLLVGPTLQDDLYLLLLRFRLHRYAITVDGTKMYRQVNVDLNDR FT KYQYILWRALPDEPLRTYQLNTVTYGTASAPYLTIRSLHYLANLCRTEYPI FT GAVIESSFYVDDLLSGGEDIETLRTIKDQVTKILHRGHFPLSKWHSNHAQF FT IENFAIKDINTCDNGLTSALGVSWNQHTDMLLFNFKPKIVSSTVTKRTILS FT IASALFDPLGLVSPLIIVAKIILQELWIAGLTWDESVPQHLELAWKKCLES FT FQTISSLAVPRYCKQVNQSSFQLHGFCDASIRAYGCCIYVRSESACGNIGV FT HLLTAKSRVAPVKKKSLPKLELCEAHLLAQRRATCRIFLVGFANCFTLAKI FT TLRHVINICRKSSFWSWRFVPTTSNPADIVSRDATTSDLQSSIWLSGPAFL FT TQYHEHWPTSSNISNIDMQVINAEQRKSTFVATTDRNHVLEKLANISSYNR FT CVRTVAYMLRFVQITHMNLRNCHAGLKAFVALIRLEY" XX SQ Sequence 5418 BP; 1519 A; 1288 C; 1098 G; 1513 T; 0 other; ttttggcgcc caacttcgca tttcgttacg ttaacaaatt ccgcgtgcgc gtatatacaa 60 actaaacaaa ttcgccttgg cttccaattg cttggttgca acaattaaag catcaaattc 120 gtcgcatgtg aaaatagtta accagaacca acacaaagtt catcggttat tgcattgtgt 180 ctctatattc gttttgctgg ctgtctgcgc cgttttcgct catggtctac tttgttcgcc 240 gctcttattg catatttcgc tactgctaaa acggccagtg cgctgctcgc aatcgtcgct 300 tcattcgctg cctttagtgg tttatcccaa attcattctt cgttcgtttg tgtaagagct 360 gcaactgcga tcgatactgt tgtagttcat atacacacat acacacaaca atcattttcg 420 tcgtttttgg cgcatcaagc agaacttaag ctccgcgcca agttctgtaa atatgtcgca 480 ttccttggaa gactaccgac gccagcgcgc aaacaccaaa aggaacattt cccgaaaaaa 540 aattttggtc gatggacaat ctgcagattc ttctaaaaga tcacctgcgg atcttcaatg 600 tcgtttaggc attttggaat cgtattttaa gcaatcgttg gctattcaat cggaaataga 660 aactctgtat ccaaatgata atgcgtgcgc tgaactggaa gacaactatg tacacatcaa 720 gttggcgatt aagaaattgc ttggtgaaga cttaaatagt acagtcactg atgacgctac 780 atcacacgtt gcttaccatc ctcaactatc acgtctgcct tttttggcct tgcccacatt 840 cgatggcaat catctggaat ataaaacttc atttgctcgt ttaatcaaat cgtgaaagga 900 gcgcaaataa caaacattga gaagtttaac tatctactca gctgcttaaa gggtgttgca 960 ctggagtgga atcgtatttt aagcaattgt tggctattca atcggaaata gaaactctgt 1020 atccaaatga taatgcgtgc gctgaactgg aagacaacta tgtacacatc aagttggcga 1080 ttaagaaatt gcttagtgaa gacttaaata gtacagtcac tgatgacgct acatcacacg 1140 ttgcttacca tcctcaacta tcacgtctgc cttttttggc cttgcccaca ttcgatggca 1200 atcatctgga atataaaact tcatttgctc gtttaatcaa atcgtgaaag gagcgcaaat 1260 aacaaacatt gagaagttta actatctact cagctgctta aagggtgttg cactggagtg 1320 cgtgaaggca tttccagtca ccaatgaaaa ctacgctaag gcattcgaaa attaaagtct 1380 cggtatgaca agccaataga gacaatatcg tctatttttc agttgccttc agctagttca 1440 tcaaacgcta ctcagctacg gtcgttaatt cataatgcat cggcattatt tagttcactt 1500 tcgtccttgg gctctggcat cgacatagcg cgagtcatgt taatctatgt ggttctggat 1560 aaatgcgatc aggaaactag aaacaaatgg aatttatcgc tggattatac aaaaatcccg 1620 agttgagctc agtgcgtaca agtactggag cgtcattgtc aattcctgtt gtccaaaggt 1680 acaacaatta accaaacgta ttcaaaacag tttcgtaatc ctggcaaagg acataattcg 1740 tctttcgcac ttgctatcgc tgcttgtgca ttgtgctcaa gttcaactca caaattgatg 1800 gcttgcgaac aatttaaagc attggatgcg ctgcaacgtt ttgacgtcgt caaacgtcac 1860 cgactgtgtc ttaattgctt gtccaagggt catcaaatgg ccgattgccc ttcaacaaat 1920 cgctgtcgcc attatcagaa atcggaccac actttgttac acaagaagct acgttctaca 1980 acacaaccaa gctcttcccc agccaccgaa gtagcacaca accattcgca aatcaaaaaa 2040 cgttatcaat ctcacgttct tttagcaacg gctatgatcc tggtcagaga ttcaactggc 2100 acctacagac taggccgtgc tctactggac tcgtattcgc aggtgaattt cgtcaccgat 2160 tcgtttgctc agaagcttcg tctaaaaaga gagaaacacg ttgtgcaaat cagcagtatt 2220 ggccatgcac gtacagaaat cacttcgctt acaactatat caataaggtc tcgtttgtcg 2280 ccctttgaag tttcgttaga cctttgcgtc acctcgcata tagcctatca gccggacatc 2340 aatatagaca tttcaagatg gaaaatgcca caaaatattg tgctggcagg tgagaaattc 2400 taccaatctc gacacatcga catgctttta ggaacagaag cattcttcga cattctgtca 2460 gtgggccaag ttaaacttga tataactggg cccatgctcc aaaagacgct gttttgttgg 2520 gtagtgacag gaaagtgtca tccgcaacag tctcgagaga ttgctgctag ttttaagctt 2580 ggtgccactt caattgacga tcaccttaaa cgcttatggg aaattgaatc aattgaaact 2640 tcaaatgtgc tgttgaaacc agaacatcga gcttgtgagg aacactacct aaaaacgact 2700 gttacaaatg acagtggtcg cattgttgta aggctaccgt ttaagaacaa tcccagtcaa 2760 cttggtcatt cattcgacat agcacgccgc agatttctca gcttggaacg ccgtttggca 2820 caatcttcgg gacttcgctt gcagtattgc caattcatgg aggagtatga acatttggaa 2880 tacatgtcgc tggtcacaga accacactta tcggaacctc actactacat acctcaccat 2940 tgcgttttga agcccgtctt cgacgtctcg tgttgaacat ccaatcaaac ttcgctcaat 3000 gatcatttac ttgtcggccc gactctccaa gacgatttat atttgctgtt gttgcgtttc 3060 agattacatc gttatgccat aacagttgat ggaaccaaga tgtataggca agtaaatgtt 3120 gatttaaatg atcgcaaata ccagtacata ctctggcgag ctcttccaga tgaaccactt 3180 cgcacttatc aactcaacac ggttacatat ggaacagcct cggcgccata cttaaccata 3240 cgcagcctgc attatttggc aaatctgtgt cgaactgagt atccaatcgg tgctgtcatt 3300 gaatcgtcgt tctatgtaga tgacctgtta tctggaggtg aggacataga aactcttcgc 3360 acaatcaagg accaagtcac gaaaattctt catcgtggtc actttccgtt atctaagtgg 3420 cattcaaatc atgcccaatt tatcgaaaac ttcgctatca aggatataaa tacgtgtgac 3480 aacggactga ccagcgcact tggcgtcagc tggaatcaac acaccgatat gctgctattt 3540 aatttcaaac cgaaaatcgt ttcgtcaacc gtcaccaaga ggaccattct gtcaattgca 3600 tcagcgttgt tcgaccctct cggactagtg tcaccactca taatcgtcgc aaagataatc 3660 ctacaggagc tctggattgc agggcttact tgggacgaat cggtaccaca gcatcttgaa 3720 ctggcctgga agaagtgcct ggaatcattc caaaccattt catcacttgc tgtaccccgg 3780 tactgcaaac aggtcaacca aagctcattt caattgcatg ggttctgcga cgcttcgatt 3840 cgagcgtacg gttgttgcat ctacgtgcgg tcggaatctg cttgcggaaa cattggtgtt 3900 catcttctca ctgccaagtc cagagtggcc ccagtaaaga agaaatcgct tcccaaactg 3960 gaattatgtg aagcacattt gttagcacag cgaagagcca catgccgtat atttctggtc 4020 ggattcgcaa attgttttac attggctaaa ataacactca gacacgttat caacatttgt 4080 cggaaatcga gtttctggag ttggagattt gtaccaacga catcaaatcc tgctgacatt 4140 gtttctcgag atgccaccac ttctgattta caatcatcca tatggctctc aggaccagct 4200 tttttaaccc aatatcatga acattggcca acatcaagca acatcagcaa cattgatatg 4260 caagtaatca atgccgaaca acgtaaatct actttcgtcg ccacaacgga tcgtaaccat 4320 gtcctggaga aactagcgaa catcagctca tacaatcgtt gtgttcgcac tgtcgcttac 4380 atgctgcgct ttgttcaaat cactcatatg aatcttcgca actgccatgc tggacttaaa 4440 gctttcgtgg cactcattcg attggaatat tgaatcgtaa acgccagaga cttggcacgt 4500 cgcatcgtgc acacctgcat ggcgtgtgtt cggtacaagc caaaattgga aaggcagctt 4560 atgggatcgc tgccagtgga gcgccttcaa tcggaacacc cattccaacg ctgtggtatc 4620 gacttctgtg agccaataaa tacgtatgtt cgcatactag gaaagggtcc cacaaaatct 4680 tacttggccg tattcatttg tttggcatcc aaggcggtgc acatcgaagt ggtctgtcaa 4740 ttatccacga agacattctt ggctgcactc aaacgaatgg ttcctcggcg agcgctgccc 4800 actgacatat attgtgataa tggaactaac ttcgtgggtg cagcaaatga gctaaaggct 4860 ttgaaacaat ttttgtttga tcaatctatt caagacgcaa tatctgaata ctgtgcttcg 4920 gatttcgtgt cgtttcattt tattccgccc agggcacccc actttggcgg actctgggaa 4980 atagcagtta agagcgcaag acgctatcga acactcgcat aacatttgag gagttctgta 5040 cgatcacaac agagatcgaa gctgtcctca attctcgccc attgtcacct atgtcgcctg 5100 atcccaacga tctatcagta atcaccccag gacaccccgg gctcgaacca aatggacatc 5160 atcgtcgtca aatctcgcta ctggaaccat gacgacaatc ttccgcctca gcaatggaag 5220 ctcggccgca tcgaagcact tgtgcctgga aaggatggtc atgttcgagc tgttcatctt 5280 cgtatagcca atggcatctg ctgccgacca gtacacaaat tggccatctt accgatggag 5340 gcttgatgtg ttgaaagcag atcctttcaa ggtggccggg atgtttggtc caactaattc 5400 aaacattcaa tttatcgc 5418 // ID Gypsy-77_AA-I repbase; DNA; INV; 5879 BP. XX AC supercont1.255; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-77_AA_; KW Gypsy-77_AA-LTR; Gypsy-77_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5879 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.255; Positions 965312 959434. XX CC Positions [4888-5367] - Integrase core CC 'GGGGG' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 797..2113 FT /product="Gypsy-77_AA-I_2p" FT /translation="MNRLNELKALFDKEYSNVKKSKNNGKSLNLIQIKKKL FT FADYIQEFKEILTLLRDSLETDLFNKLVNEYTQLKTKRDAAYMILDASSIL FT PTNQKFKEIAHKVIEQEKLRKERLKNLTMDVKTIMNILQPYDGNAEGLDAF FT IDGIQLAKELVKQEHLDTLLKLAKTRLTGKARSGLSPDINTIDGLVDDIKQ FT RCKDSTTPETIIAQMKALKIKHNETPKEFCDSIEKCCAKLKTTYINHQIPP FT AVADKMSTKVGIDSLISGINNPEIKLILKAGTFTNLAEAITKINENSLQQT FT NQGTQILSFTSTGGQASRGNFRRGSRGGYRQSNNNRSYNQNHNSYTQNRYN FT NNFRSRGNFQRYHQGSGNNFRGRRGNNRHVFSIQSGQPQPEMEQLQQLLPS FT MVIGPVNHHQPQGLHPQAIPQLQQATYQQPNFFGQIQRGYLNSSL" FT CDS 3019..5718 FT /product="Gypsy-77_AA-I_1p" FT /translation="MISDDIIEPSISNFNSPILLVPKKSEDNTKKWRLVVD FT FRQLNKKILADKFPLPRIDSILDQLGRAKYFTTLDLMAGFHQIPLAEEARK FT YTAFSSPDGHFQFKRLPFGLNISPNSFQRMMNIALAGLTPECAFVYIDDIV FT VIGCSIDHHLNNLEQIFKRLRHYNLKLNPQKCKFFRTEVTYLGHKITDSGI FT LPDDSKFQTIRDYPEPTNIDEVRRFVAFCNYYRKFVPNFAHIAKPLNNLLK FT KGMTFSWNDERRRSFQSLKHHLLSPRILQYPDFSKDFILTTDASDVACGAI FT LSQLHENGDLPIAFASKSFTKGEKSKPVIEKELTAIHWAIDYFKPYLYGRR FT FKVRTDHRPLVYLFGMNKPTSKLTRMRLDLEEYEFDIEYLPGRANVGADAL FT SRIPTKSEDLKSLVLVVNTRSMTKKQKQNNIPRRLEETSRGIDHLTVWRTE FT NPSEVNKLLKISCVVHHNQLNIMVNNHNFKKILGTVTIPLTHNNGSRTLEF FT ALLEISKLLKIYKRDKIALSEEDILFQYFSLQTIKEIANTAISSYQIIIFK FT PPRFIHKQEEIREILTRHHTTPTGGHVGQHRLYLKLREIYKWKNMKFDIKN FT FIKACEKCKINKIHRHTKEAQVVTTTPSKPFEVLSADTVGPFTRTNHGNRY FT ILTIQCNLTKYVILIPIPTKEANVIAKALVKNFILIYGNFLELRTDQGTEY FT RNEVLDQICQLLQVKQTFSTPYHPESIGALERNHRCLNEYLRSFTNAHQTD FT WDEWIKFYAFTFNTTPHTEHNYTPYELIFGRKATLPHDLELNRPTLEPIYN FT LEEYYNELKFKLQVTNKIAHQNLINKKEARQITLNNNTNPIDVKIGDHVYI FT TNENRRKLDPFYIGPFRIIEVQDPNCVIENIQTCKQNIIHKNRIIKA" XX SQ Sequence 5879 BP; 2279 A; 1072 C; 980 G; 1548 T; 0 other; tggcgaccgt gaggcagtcc ttagaataag taatcagtaa aagtttaaaa gtgcagtgta 60 aaatgggcaa tacctcagat aaagaagaaa gctccaagac ggttggagat caaactgtga 120 caatcgtcga aaatcaaaag gtacatactg gttttcatga aagccacgag tggaaactga 180 atataatact agtgcttcta gtgatacaaa ctgcaattat ggttgtgaaa acgttagcgc 240 tgatattaag aaaagctttt gctaaaaccg cagaacgagc cgtcgcagtg cacaacgtct 300 agtgagcata cggtgcataa aaacttcgct acgcataaaa tcactgctac tagcacaaca 360 gtgtggaatg gaacaaaaac attttttttc caaaaaaaaa aaacgaaccc atagcgcgcc 420 gcagtgtaat cgacaaacag taaaacgatg aattcgcagt gtatggctgc cgaacacata 480 agaacatggt gactaatatt aaaatttacg acggataccg ttagttaaat gaagactgat 540 tgagttcaat gtgttgttag ggaacagtga gcaaaaaaaa aaaacgcaaa caagtaagca 600 acatgatgtt gaggagtgcg cctccatcga gaagtaaaca acttgtatga tgcagcaagg 660 tgcggaggag cgcgccattt tcacctagcc gatccgattt gcgtctccag accaaggaat 720 gagagacaac acttagtcca aatgtgagta gtacataagt attttttttt ttattatacc 780 gatataaaag cacccaatga acagacttaa tgaacttaaa gcattatttg acaaagaata 840 ctcaaatgtt aagaagagta aaaataatgg aaagagtcta aatttaattc aaatcaaaaa 900 gaagttattt gcagattaca ttcaagaatt taaagaaata ttgaccctac tcagagactc 960 tttggagacc gacctattta acaaactagt aaacgaatac acacagttaa aaaccaaacg 1020 agacgccgca tatatgatat tagatgctag ttcaatacta ccgacaaatc aaaaattcaa 1080 ggaaatagcc cacaaagtta ttgaacaaga aaaattgagg aaagaaagac tgaaaaatct 1140 gacaatggac gttaaaacta ttatgaacat tctccaacct tacgacggaa acgccgaagg 1200 tttggacgct tttatagatg gtattcaatt agctaaagaa ttagtaaaac aagaacatct 1260 cgatacatta ttgaaattag caaagacaag gctgactggt aaagctcgat caggactcag 1320 cccagatata aacaccatag atggtttggt tgacgatatc aagcaaagat gcaaggattc 1380 cacgacacca gaaaccataa tagcacaaat gaaagcgcta aaaatcaaac ataatgagac 1440 tccaaaagaa ttttgcgaca gcattgaaaa atgctgcgct aaattaaaaa caacatacat 1500 caaccaccaa attccacccg cagtagccga taaaatgtcg acgaaggttg gaattgacag 1560 ccttatttct ggcataaaca atccagaaat taaattaatt ttaaaagctg gaacatttac 1620 taatttggcg gaagccatca ccaaaattaa tgaaaacagt ctgcaacaaa ccaatcaagg 1680 gacgcagata ttgtccttca ctagcactgg agggcaagca agccgaggaa attttagaag 1740 aggatcaagg ggaggttaca gacaaagtaa caataatagg agttacaatc agaatcataa 1800 ctcctatacc cagaacagat acaacaataa tttccgctcc agaggtaatt ttcagcgata 1860 tcaccaagga tcaggaaata acttccgagg ccgacggggt aacaatcgtc acgtctttag 1920 tatccaatct ggccaaccac agccagaaat ggaacaactg cagcaattgt tacctagtat 1980 ggtcattggt ccagtaaatc accatcaacc acagggttta cacccacaag ctataccaca 2040 attacagcaa gcaacctacc aacaaccaaa tttttttggt caaattcaga gagggtacct 2100 aaattcctct ctatgaatct aaatgcatca aattatgtca cattaaaaac actaatgaca 2160 aattgcaaat gttccttcat tgttgacagt ggagctgaca tttcaatttt caaaagtgaa 2220 aagattttac caactcaaag gattgatgta aataaaacat ttaagataaa cggtataaca 2280 agtaaaacta tccagacgat agctgagact gaaacatatt tgaccacaga agaaaattta 2340 caacttgtac ataattttca aattgtacat agcgattttc caattcctac tgatggtata 2400 cttggaagag actttttagt caagtataag tgtacaatta attatgaata ttggcttcta 2460 aatatgacca taaataatca aacaatatca ttaccaatac aagataatac aaataacgca 2520 tttactatac ctgctagatg cgaagtcatt agaaacgtac cggatttcta agtaacagag 2580 gactctgtag tgttgtcaca ggaaatacaa ccaggtgttt tctgtgggaa cactatagta 2640 tcaccgacct caaaatgtat taaatttgta aatacaacag aaaatcaagt aattataaga 2700 aattttaaac caaaattaga tgcattaaaa aattatgata ggattactac aaatagtaat 2760 acgaatttga atgataacat aagattacat gaactatact ctcaaataag tttagagcat 2820 gtaccaattt ttgccaaaaa caaactcaca aacttgatca gaaagtatca agacgtgttt 2880 tgtttaccaa atgaacattt aacaataaac aacttttacg aacaaaacat acatttacaa 2940 aatcaatctg cagtttacat accaaactat aagcagatcc attctcaagc agatgagatc 3000 gaaagccaaa ttcaaaaaat gatatcggat gacataattg aaccgtcgat atctaacttt 3060 aattctccca ttctcctcgt accaaagaag tcggaggata acacaaagaa atggagatta 3120 gttgttgatt ttcgacaact aaataagaaa atactggcag ataaatttcc attgccacgt 3180 atagattcaa ttttagacca gctaggcagg gcaaagtact ttacgaccct tgaccttatg 3240 gctggatttc atcaaattcc attagcggag gaagctagaa agtatacagc attttcttcc 3300 cccgatggac attttcaatt taaaagatta ccattcgggt taaacatcag tccgaacagc 3360 tttcagcgaa tgatgaatat cgctttagct ggactgacac ccgaatgtgc atttgtttat 3420 attgatgaca tagttgtcat tggatgttca attgatcatc atttgaataa tctcgaacaa 3480 atcttcaaaa gattgaggca ttacaattta aaactcaacc cacaaaaatg taagtttttc 3540 aggaccgaag ttacttactt aggtcacaaa atcacggact ctggaatttt accggatgac 3600 tctaaatttc aaacaattcg agattatcca gaaccaacaa atatagacga agtgcgtcga 3660 tttgtagcgt tctgtaacta ttatcgcaaa ttcgttccga attttgcgca tatagcaaaa 3720 cctttgaata atttattgaa aaaaggtatg acattttcat ggaacgatga aagaagacga 3780 agtttccaat cactgaaaca tcacctcttg tcgcctagga ttcttcaata tcctgatttt 3840 tctaaagatt ttattttaac aacagatgcg tcagatgttg cttgtggagc aattctgtca 3900 caactgcatg agaacggcga tttaccaatt gcattcgcaa gcaagagttt tacaaaagga 3960 gaaaaatcaa aacctgtaat tgaaaaagaa ttaactgcca ttcattgggc aattgactac 4020 tttaaaccgt atctttatgg tcgaaggttt aaggttagaa cagaccacag accattggtt 4080 tatttgtttg gcatgaataa acccacttct aagctcacga gaatgaggct tgatttagaa 4140 gaatatgaat tcgacattga atatctccct ggaagagcaa atgttggagc agacgctctt 4200 tccagaatac caacaaaatc agaagacctc aagtctttag tacttgtggt taacacacgc 4260 tcaatgacaa aaaagcaaaa acagaacaat attcccagaa gattggaaga aaccagcaga 4320 gggattgatc acctcactgt ttggcgtact gaaaatccgt ctgaagttaa taaattattg 4380 aaaattagtt gtgtcgtgca ccacaatcaa ttgaatataa tggtaaataa tcataatttt 4440 aagaaaattt taggtacagt aacaataccc ttaacacata ataatggaag tcgaactcta 4500 gaattcgcac ttctagaaat ttccaaactt ttaaaaattt acaaaagaga caagatagcg 4560 ctatcggaag aagatatatt atttcaatat ttttccctgc aaacaataaa ggaaatcgca 4620 aatactgcca tttccagcta ccaaatcatt atttttaaac caccaaggtt tattcacaag 4680 caggaagaga ttagagaaat cttgacgcgt caccatacga cccctacggg aggtcatgtt 4740 ggccaacacc gtttatatct gaaattaaga gaaatttata aatggaaaaa tatgaaattt 4800 gacattaaaa atttcataaa agcttgtgaa aaatgtaaaa ttaataaaat acatcgacac 4860 acaaaagaag cgcaagttgt gacgacaact ccttctaagc cattcgaagt gctttctgca 4920 gacacagttg gtccttttac acgaacaaat catggaaata gatatatcct tactattcaa 4980 tgtaatttaa ccaagtatgt tattttgatt ccgataccaa caaaagaagc gaatgtgatc 5040 gcaaaagcac tcgtcaaaaa ctttatatta atatacggaa actttttgga actacgtaca 5100 gatcaaggaa ctgagtatcg taacgaagtt ctcgatcaaa tatgccaact tttgcaagtt 5160 aaacagacat tttcaacccc atatcatcca gaatcaatag gtgcattgga gaggaatcat 5220 agatgcctta atgagtacct acgatcattc accaatgcac accaaactga ttgggatgaa 5280 tggataaaat tttacgcttt tactttcaac acaactccac ataccgagca caactatacc 5340 ccatacgaac taatatttgg acgaaaagca acattaccac atgacttaga attaaacaga 5400 ccgactttag aacctatcta taatttggaa gaatattaca atgaattaaa atttaaactc 5460 caagttacga ataaaatagc acatcaaaat ctaataaaca aaaaggaagc aagacagatt 5520 acattgaaca ataatacaaa tcctattgat gttaaaattg gagaccatgt ttacattaca 5580 aacgaaaata gacgaaaatt agatcccttt tacataggtc cattcagaat aatagaagta 5640 caagacccaa attgtgtaat tgaaaacatt caaacttgca aacaaaacat tatacacaaa 5700 aatagaataa taaaagccta aagataaaaa cattcaagct aattataaaa acaaaaatat 5760 aaatattgta aggaaagtct taagagtcac atttgttttt ctgattttca aaattacttg 5820 ttgacaacaa tttacagaat gctttcactt tgttacacca ttcttctaaa gggggaagg 5879 // ID PERERE-6 repbase; DNA; INV; 4300 BP. XX AC BN000797; XX DT 04-SEP-2005 (Rel. 10.08, Created) DT 04-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE Schistosoma mansoni Perere-6 non-LTR retrotransposon (EST). XX KW CR1; Non-LTR Retrotransposon; Transposable Element; PERERE-6. XX OS Schistosoma mansoni OC Eukaryota; Metazoa; Platyhelminthes; Trematoda; Digenea; OC Strigeidida; Schistosomatoidea; Schistosomatidae; Schistosoma. XX RN [1] RP 1-4300 RA DeMarco R., Machado A.A., Bisson-Filho A.W. RA and Verjovski-Almeida S.; RT "Identification of 18 new transcribed retrotransposons in RT Schistosoma mansoni."; RL Biochem Biophys Res Commun 333(1), 230-240 (2005). XX DR EMBL/GenBank/DDBJ; BN000797; Positions 1 4300. XX FH Key Location/Qualifiers FT CDS 258..1355 FT /product="PERERE-6_1p" FT /translation="MVGSSFKCGQPNCLFPVDEGMQCDECKKWYHKMCTRL FT SPAAYKRCSKPNSHWLCMFCCTNKTTLIQEAMSLLALACKKKDSGCANNAS FT TDSEDCVSVVSAVTKRAEQPTLINDEVKLPLQPTRSKSPHGIDMSNHVSLV FT EIEDPDKTVTVPNSPNAAVLNDQKWTAVKRKRKGKKANDKAHLVDKPLRNS FT QSESLNALSHSDRTADHHPSATERNLISSNIIVSKLPESNDPDPKVRHTHD FT LERLGEYIRSILPDNTKGVQVKKLVRIGKRADDSVLPRPCRLLKVILGSEK FT QRNLLLSNARSHDNSDIRMRPDVSLEDRIKRKTALAELETRRQNGEENLRL FT VGFRIVRSWKRMLPRPVWIGRSP" FT CDS 1799..4144 FT /product="PERERE-6_2p" FT /translation="MTTKCSINSFDSRLLETVMEHALVQHVSKPTRFGVNQ FT GSSLLDLVITHETEDIANLNILPPLVNSDHAVLSFTFRASDMIYDQIKPRP FT NIWRANIPEIQDCATKIDWSVDTSSSVDEAWSVFKGKFRSVTSRFIPYLVP FT RRPNNSPPWINKEVKNLLRCRKKHWNMFISTGLEQYRSSYRKIRNNYKALI FT SRTRRSYEKQLVRDCKHSPKRLFSYIKRRTQRSDGIPSLLIQENPLSLARN FT DTEKAEAFSEYFSKVFSSNNEERSPIHGDCGDLLMDPVVIKKETVLSLLQH FT LKPDKSSGPDDIHPRIMKALSDVIAEPLPMLFDMSLRQSRLPRDWKDAIIS FT PVYKTGGRDLVSNYRPVSLTSAVVKLMEKIIRMAVINYVERHDLLSKEQHG FT FRKGLSCLTNLLIAREDWAEAKDRNIPVDVIFIDLSKAFDKVSHSGLKLKL FT ESFGIHYAVIDWISDFLHERRQRVRVNGALSSWEPVKSGVPQGTILGPLLF FT LLYINELPAIAKSSVLLFADDIKIWRPIYSMSDRIVLQEDLNSLVAWMNGW FT SLEVNPNKSVVMQLNNYDDSYHYTLCGLVLPKVRNYKDLGVILSNDLKTTS FT HCKAAAAKGYRALWSIRRSFRYLDGEMFRLLYPTFVRPHLEYGIQAASPCF FT KYEADMLERVQRRGTKMVKGLSSLSYEDRLRHLNLFPLSYRRIRGDLILAY FT RILNDDLGTNMSYLFLPSRAEHLRGHSKKVQKPRSNRLRLEFRFSHRVVNY FT WNSLPEHVISAPSVDIFKTRLDLHSVTNCKD" XX SQ Sequence 4300 BP; 1343 A; 824 C; 918 G; 1214 T; 1 other; gaaacggcgg gcggcttttg gaaatgctct gacttggttt tcaaaaaggg cagtgacttt 60 catacttgtt gtgattatcg ttgcttttgt gtatttgctg tatcaaaagc attcttggtc 120 tactttaact taaatcaatc tagttaaggt ttgaaatttt attgaaaact ttgtctcttg 180 gtcccttcct ttgcatatac tctgacgctt gtgctattat tgtgatcata ttgtttttta 240 aacctaatct gattacaatg gttggtagtt cattcaaatg tggacagccg aactgtttat 300 tccccgtaga tgaagggatg cagtgcgacg aatgcaaaaa atggtaccac aagatgtgta 360 cccgtctcag tccagctgcg tacaaaagat gttctaagcc taactcgcac tggctttgta 420 tgttctgctg tactaacaag acgacattaa ttcaagaggc tatgagccta ttggctctag 480 cctgtaaaaa gaaagatagt gggtgcgcta ataacgcaag cactgacagt gaagactgtg 540 tcagtgttgt aagtgctgtt acaaaacgcg ctgaacaacc tactctaatt aatgacgaag 600 tgaaacttcc gttacaacca acgaggagta aatcaccaca tggtattgac atgagtaatc 660 atgtatcttt agtagaaatt gaggacccgg ataaaaccgt caccgtcccc aatagtccca 720 atgcagctgt cctgaatgac caaaaatgga ccgcagtaaa gaggaaaagg aaaggaaaaa 780 aagctaatga taaagcacac ttggttgaca agccattgag gaactcacag tctgagtcac 840 tcaatgcatt atcgcactct gatagaacgg cagatcatca tcctagtgcg actgagagga 900 atttaatatc atcgaatatt attgtgagta aattgccaga gtctaacgat ccagatccta 960 aggttagaca cacccatgac ttagagaggc tcggagaata tatccgtagc atattaccag 1020 ataacactaa aggtgttcag gtcaaaaaat tagttagaat aggtaaaaga gccgatgaca 1080 gtgttctacc ccgaccgtgt agactcctta aggtaatctt agggtcggag aaacaacgta 1140 atcttctatt aagtaacgct cgaagtcacg acaactctga cattcgaatg cgaccggatg 1200 tatctcttga agatagaata aaaaggaaga cggcactagc tgaactagaa acccgtcgac 1260 aaaacggtga agaaaatctc cgattggtgg gttttcgaat agtcaggtct tggaagcgaa 1320 tgctaccaag gcctgtgtgg ataggtcgtt ccccctagga gttttatatg ccaatgcccg 1380 tagtctaaaa cataaattct acgagctagg aacactggtg gacaaattaa ggccactcat 1440 tgtagctgtg acggaaactt ggttagtcca tgacttcgat atcaccccag aattatccgg 1500 atatcattgt ttaagaagtg atagacatag atgcaggaaa ggaggtggcg tccttctata 1560 catagccaat agtataaata tccgctcctc cgtttgcgaa tcccatgata gtggaactag 1620 tgaagccatt agctgtgaac tcactgtcgg atgttgtaca tttgcactcg gtgtcatcta 1680 tcgcagccca atctgcttag ctgatgactt tattttagag cacattcggc tatggagtgc 1740 aaataacaga tgcttgatat taggagactt taatgcacct gacattagtt ggactgagat 1800 gaccacgaaa tgctctataa actcatttga tagcaggctc ttggaaacag taatggaaca 1860 tgcactagta caacatgtat ccaaacctac acgttttggt gtaaaccaag gttcttctct 1920 gttggacttg gtaattactc atgagactga ggatattgcc aacctaaata ttcttccccc 1980 gttagtcaat agcgatcacg ctgttttatc attcacgttt agagctagcg atatgatata 2040 cgatcagatt aagcctcgcc caaacatatg gagagctaac ataccagaaa ttcaggattg 2100 tgctactaag atagattggt cagtagatac tagctcatca gttgatgagg cgtggtctgt 2160 atttaaaggc aagtttaggt cagtcacgtc ccggttcata ccatacttgg tgccacgtag 2220 accgaacaat agtccaccgt ggataaataa agaagttaag aacctcctta ggtgtagaaa 2280 aaaacactgg aacatgttca tctctactgg cttagaacag tacagatcta gttatcgtaa 2340 gattagaaat aattataaag cgttaattag tagaactaga cggtcatacg aaaaacaatt 2400 ggttagggat tgcaaacata gtccaaaacg gttattctcg tatataaaga ggcgaactca 2460 gagaagtgat ggaataccat cacttttgat acaagaaaat ccgttaagtt tggcaagaaa 2520 tgataccgaa aaagcggaag ctttttcgga atatttcagt aaagtttttt cttccaataa 2580 tgaggaacga tcacctattc atggtgattg tggcgacttg ttgatggacc ctgtagttat 2640 taagaaagaa actgttttaa gcttactcca gcatctcaaa cctgataagt ctagtggtcc 2700 tgatgatatt catcctcgga ttatgaaagc cctctcagat gttattgctg aacccttacc 2760 gatgctgttc gatatgtccc tacggcagtc cagattgccc agagactgga aggacgcaat 2820 aatcagtccg gtgtataaga ctgggggtag ggatttagtt agtaactatc gacccgtcag 2880 cttaactagt gcagttgtaa aactaatgga aaaaattatt cggatggctg ttataaacta 2940 cgtggaaaga catgatcttt tgtcaaagga acaacatggt tttcgaaaag gtctatcatg 3000 tttgacaaat cttctcattg caagagaaga ttgggctgag gcaaaagatc gcaatattcc 3060 tgtagatgtc attttcatag atctaagtaa agcctttgat aaggtttccc attctggtct 3120 taaattgaaa ctagaaagtt ttggaatcca ttatgcagtc atagactgga taagtgactt 3180 tcttcatgaa aggagacaaa gggtaagagt aaatggggct ctctcatcgt gggaaccagt 3240 aaaaagtggc gtcccccaag gcacgatttt aggtcctctt ctctttttac tttatataaa 3300 tgaattacca gctatagcta agtcatctgt cctactattc gccgatgaca ttaagatttg 3360 gagacccata tatagtatgt cagacagaat agttttacag gaagatctta actcactagt 3420 tgcatggatg aatgggtggt cactagaagt aaaccctaat aaaagtgttg tgatgcaatt 3480 aaataactat gatgattcgt atcactacac attatgtgga ctagtgttgc ccaaagtaag 3540 aaactataaa gatttaggag tcatactaag taatgatctc aaaacaacca gtcactgtaa 3600 ggctgctgct gcaaaaggct atagggcatt atggtctatt cgtagatctt tccggtatct 3660 ggatggggaa atgtttagat tattataccc aactttcgta cggccacatt tggaatatgg 3720 gattcaagca gcgagtcctt gtttcaaata tgaggcggat atgttggaac gagtccaacg 3780 ccgcgggaca aaaatggtaa aaggcttatc tagcctatct tatgaagaca gactgagaca 3840 cctcaactta tttccgttat cttaccggcg aatacggggt gatctaatat tggcatatcg 3900 aatcctgaac gatgaccttg gtactaacat gtcctatctt ttcctcccgt ctagggctga 3960 gcatttgaga ggacattcaa agaaagtcca aaaaccgaga tcaaatcgtc tacgtctgga 4020 gtttcgtttc tcgcaccgag tagtgaatta ctggaattct ctaccagaac acgtaatatc 4080 agcaccttct gttgatatat tcaagacgag attggacctc cacagtgtaa caaactgcaa 4140 ggattaaaat aggtcactag accttctatc cttattactg aagactgaag actgaagact 4200 gatatcattt cagagagatt atatatcgtt ctgctcaaaa cccctataaa ttggcaatga 4260 ccgtactggg cctagatnaa attcttttag aaagtttcgc 4300 // ID Jockey-17_AAe repbase; DNA; INV; 4005 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A Jockey non-LTR retrotransposon from Aedes aegypti. XX KW Jockey; Non-LTR Retrotransposon; Transposable Element; KW Jockey-17_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4005 RA Jurka J.; RT "Non-LTR retrotransposons from the yellow fever mosquito."; RL Repbase Reports 11(4), 1429-1429 (2011). XX DR [2] (Consensus) XX CC >98% identical to consensus. XX FH Key Location/Qualifiers FT CDS 181..1122 FT /product="Jockey-17_AAe_1p" FT /translation="MNLVRDLNQLIAKGLQAELRLLTDGVKITVPSTPHYK FT SVTEYLDVVKAEYFTHDIASEKPLKVVLRGLPDMETSELMDTLKIANLQLV FT QIFKMRRHNQAVKYRDQLYLLHLVKGSTNLTELKSIRALGNVVVQWERYKP FT VHREVTQCGNCLNFGHGSKHCHMRSRCSKCGENHRSASCEVEAVVACLNCG FT QNHSSMSRTCPKRNEFINIRKTVAQRTRPKPKKEINIIGENFPELSLPSQT FT QRHPPPPVSPPGFRTYAEAANTPPSDAPYSMDQLALLFSELDKQTRACRTN FT DQQVAVMMRFYYQHRSILSSTA" FT CDS 1334..3661 FT /product="Jockey-17_AAe_2p" FT /translation="MLPDFRLSITEAVGIELTSSNGPIILIAAYCPIQCKD FT ADGTTTQLKNDIQILTRRPSKFILAGDLNARHSIWGNIQSNKNGSVLANDA FT QAGHYTIVHSESPTYFSPAGVGSTLDIILTNIPDNVTTPQALTELSSDHLP FT VIFEVNTSITLREPQRRRNYHRADWPRFQQLVEESLDDHHTLNTTEDIDRA FT IQSITDAIGEAEDRFIPATTISREFLQLDSVTKKIIAVRNSIRRQFQRTHN FT LSKKILCKKLNKIIASRVQQIRNNKFSRDIENLPNHSRPFWRLTKVLKSKP FT TPIPPIVDGNVKYITPIEKANTISRHFMASHRLGQGIASPMEEAVSDSKTR FT LTETPNNFPANRKVTTEELKSAIRFSRNMKAPGFDGLFNIVLKHLGPNAHS FT LIVEIFNSCLELSYFPSAWKLSKVVPIHKPGKDPTLASSYRPISLLSSLSK FT LFEKCIYSRLLEHAEDNNILLEEQFGFRRGRSTIHQLQRVTNLVRRNKAVS FT KSTAMAFLDIEKAFDNVWHDALTHKLLRYNFPTYLVKIIANYLSERTSQVC FT IGSSSSQPYIVNAGVPQGSILGPILYNLFTSDIPALPGNGTLSLFADDSAI FT SYEGRVIRALVSKLQKGIDEYTTYLKTWKICVNGAKTQTIIFPHKNSYRMV FT PATNIQVEGAEIHWSDAVSYLGLIMDSKMLFRQHIDERVTKSTILLKRLYP FT IINRRSKASLTNKLAVYKMIISPMLEHASSIWRGCARTHLQKLQVVQNNFL FT RMILNRPRRTRTAELHRLANIEPLL" XX SQ Sequence 4005 BP; 1273 A; 1061 C; 817 G; 854 T; 0 other; ccttcataag atggagtcat tgtgcctgtc tactttctcc ggctcccaaa caccaccgct 60 gcggattcgg tggaacaagc caacaagaag ctactcagct ccaacaaata cgccactctg 120 tcttccgatg gagtccagaa gaaggagaag ttgccgccct tctatgtcaa gggaaactcg 180 atgaatctcg taagggattt aaatcagctc atcgccaaag ggctccaagc ggaactgcgc 240 ctccttacgg atggcgtgaa aattacggtc ccatcaactc cgcactacaa atcggtgacc 300 gagtacctcg acgtggtgaa agcggaatac ttcacccacg acatcgcctc tgaaaagccg 360 cttaaagtgg ttttgcgagg ccttcccgac atggaaacca gtgagctgat ggacactctg 420 aagattgcta atctgcaact ggtgcagata ttcaagatgc ggcgccacaa ccaggccgtc 480 aaataccgcg accaattgta cctgctccat ctggtcaaag gttcaacaaa cctaacggaa 540 ctgaaaagta tcagagctct gggcaacgtc gtcgtgcaat gggagcggta caaaccggtc 600 cacagagaag tgactcaatg cggaaattgc ctgaatttcg ggcacggatc gaaacactgc 660 cacatgagaa gccgttgctc caagtgtggt gaaaatcatc ggtcggcttc ctgtgaagtg 720 gaagctgttg ttgcctgcct gaactgcggg caaaaccact cctcgatgag ccgtacttgc 780 cccaaacgga acgaattcat caacatccgc aaaacggtgg cccaaaggac ccgcccaaag 840 ccaaagaagg aaatcaacat cattggtgaa aatttcccgg agctatcgct tccatcccaa 900 acccaacggc acccaccacc tccggtttcc ccacctggtt ttcgcacata tgcggaagcg 960 gcaaacacac ccccaagcga cgcaccatac tccatggacc aattggcgct cctgttttcc 1020 gaattggaca agcaaacaag ggcatgccgt accaacgatc aacaagtcgc agtaatgatg 1080 cgattctatt accagcatcg tagcattctc tcttcgactg cttaacttca atgcctcttc 1140 ggtggcaaat aaacaggtgg aaatcattga atttctccga acacacgaaa tcgacgtcgc 1200 tctcataacg gagacgcacc taaagccaaa caaaaacttc agcctaccag atcatcaagt 1260 cattcgcctg gaccgtatcg ctgcaagaaa aggtggtgta gctatagcca tccgacgtaa 1320 cctcaagttc cgaatgcttc cggatttccg ccttagcatc accgaagctg tgggcatcga 1380 actgacctcc tccaatggac ctatcatcct gattgcggcg tactgtccta tacagtgcaa 1440 ggacgcagac ggaacaacaa ctcagctcaa aaacgacatc caaatattaa ctcgaaggcc 1500 cagtaaattc atcctcgccg gtgaccttaa cgctcgccac tcaatctggg gaaacatcca 1560 gagcaacaaa aacggatccg tgctagccaa cgatgctcaa gcagggcatt atacaattgt 1620 acactcggag tcgcccacgt acttctcccc agccggagtg ggttcaactc tagatattat 1680 cctcaccaac atccccgaca atgtaacgac acctcaagcc ctcaccgaac tgtcatcgga 1740 ccacttgccg gtaatcttcg aggtgaacac ttcaatcacc cttcgtgaac cccagcgaag 1800 aaggaactac catcgcgccg actggccgcg attccaacaa ctagttgaag aaagcctgga 1860 cgaccatcac acgctaaaca caactgagga catcgataga gcgatccaat cgattaccga 1920 cgcaatcgga gaggccgagg atagattcat cccagcgact acaatatcac gtgagttttt 1980 acaactggat agtgtcacta aaaaaattat tgcggttagg aacagcataa ggagacaatt 2040 ccaacgcact cataatcttt ctaaaaaaat actatgcaag aaattgaata agataatagc 2100 tagtagagtc caacagatta gaaataataa gtttagtagg gatattgaaa atttaccaaa 2160 ccactccagg cccttctggc gtttaacaaa ggtgctaaaa tcaaaaccga cacctatccc 2220 accaatagtt gacggcaatg ttaaatacat tacaccaatt gaaaaggcaa atacgatatc 2280 acgccatttc atggcatctc atagactagg tcaaggcatc gctagcccga tggaagaggc 2340 agtaagtgac agtaagactc gcttaactga aacaccaaat aatttcccag ccaaccgaaa 2400 ggttacaaca gaagaactaa agtccgccat tagattctcc cgtaacatga aagcccctgg 2460 ctttgatggt ctattcaata ttgtcctcaa gcatttgggt cccaacgcgc actccctcat 2520 tgtggaaata ttcaatagct gcctggagct cagctacttt ccgtccgctt ggaagctgtc 2580 caaagttgtg ccgatccaca aaccagggaa agaccccaca ctcgcatcga gctaccgccc 2640 aatcagcttg ctttcgtcat taagtaagtt gtttgaaaag tgtatctata gcaggctcct 2700 cgagcatgca gaagacaaca atattctcct cgaagaacaa ttcgggttca gacgggggcg 2760 ttctacgatc caccaactcc aacgagtcac caaccttgtg aggcggaaca aagcggtatc 2820 caagtcgacg gcgatggctt ttttggatat tgaaaaggcg ttcgataacg tgtggcacga 2880 tgccctcaca cacaaactcc tccgctacaa ctttcccacc tacctggtaa agattatagc 2940 aaactacctc agcgaaagaa cgtcgcaagt gtgcatagga agctcctctt cgcaaccata 3000 cattgtaaat gcaggggtcc cccaaggtag tatcttgggg ccaatactgt acaatctctt 3060 cacgtcggat atccccgccc tcccaggaaa tggcactctc tcacttttcg cagacgactc 3120 tgcgataagc tacgagggac gagtcataag agcactggtg tcaaagctcc aaaaaggcat 3180 cgacgaatac accacgtatc tcaaaacgtg gaaaatttgc gtaaacggag caaaaaccca 3240 aaccatcatt ttccctcaca aaaatagtta ccggatggta ccagcaacaa acatccaagt 3300 ggaaggagca gaaatacact ggtctgacgc tgtaagctac ctaggcctca tcatggatag 3360 caaaatgctt ttccgccaac acatcgatga aagagttacc aaaagcacaa tactgctcaa 3420 acgcctctac ccaatcatca accgtcgctc gaaggcctcc ctaacaaaca aacttgcggt 3480 ctacaagatg atcatatccc caatgctgga acatgcctcg tcaatctgga gggggtgcgc 3540 aaggacccac ttgcagaaat tgcaagtcgt ccaaaacaat ttcttgagga tgatactcaa 3600 ccgcccaagg cgcacgcgaa cagcagaact tcatcggcta gcaaatatcg aacctctact 3660 gtaagaataa acacacgggc tgaaaaaatc caaaccaggg cacttgaatc tgaatcagca 3720 acaattagaa acatatatgc ctgaaaattg caactaattc caaaacggga tgacaatagt 3780 ttataggatt aggttaagtt aaattgtaag gaaaaaaaaa aactttttta ggaattaacc 3840 aagtggttta actgtaaact cgcggaaagc tatgtaaatt aaactgaaat tattcaaaac 3900 agacgaaact aagcagaaaa acttgtaaag ttgaaactct tacattgtaa aattactgta 3960 ccaacaaaca aaaactagaa ataaaaacaa ttgtaataat aataa 4005 // ID CR1-70_AAe repbase; DNA; INV; 4115 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-70_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4115 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1158-1158 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 25 sequences with >94% CC identity. XX FH Key Location/Qualifiers FT CDS 188..961 FT /product="CR1-70_AAe_1p" FT /translation="MASCCFACSNDLVQEEEIVCKGFCKSTFHLRCVHLSP FT EIRTVIQSNSQLYWMCSACTKMMKNATFRQAIASTNEAMLHLEVQQNKVLE FT ELRSEIKQNTAKINSILSLTPRTPKRVGEFHFSSQLAKRNRTDGSSSVNRL FT STRDEQVGSKDDDPSIVVPLASQVQKCWLYLSQFDPKATTDDIVRLVKRNL FT ETESDVQVVKLVPKGRRIEDLSFVSFKVGIEMDLREKALLATSWQKGIYFR FT EFEFSRSARPDVFRFEP" FT CDS 942..4031 FT /product="CR1-70_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MSFVSNLSSSHSFAPISPTLHSPIPSPARKPVRKNDY FT NVHSVDEIRIYYQNMNSIRSSGRLQSVFLSSVENDYDVYVFCETNLDASIT FT NGMIFHESFVTYRCDRSPLNADKPSGGGVLISVRATLPSCIINESSCNEEV FT SVRISIGKTDIVLCCIYLPPDASLARYSSHCDSVRKLVDTASPESKVIVIG FT DFNLKNTIWYRENENDLHLLPLQVRTPEETAIVDELTACALVQVCGIPNQN FT NRFLDLTFTNDDECCQMDLCEPLLHNETHHNAMSLTVNYNKLTYDESALHS FT QPTRXKLDLKRCDYADIVQELLTVNWDDIFIEYETFDAAQLNASIDSMVSY FT LRRFEMFDELGNGVHRVNNAVDFNVLAFYATIYEVFSRHCRVVNSKRGGPV FT YPEWFDSETVQLLKIKNKLHNKFRRSKSIRDEDAYRQALREFKTKQRSQYA FT AYMDELQFNIQANPKVFWKYVSKKRRDTGIPVNVTYKSTNADCIERAAELF FT AEYFNAVYSERNEIQVPVYPRAEERLPEIALSLDEVCLKMQTLDVSKASGP FT DGIPNICIKSCVEAFALPLHKLFNLSLSCGYFPALWKISHIVPVHKKGSKL FT PVENYRGIAILSAIPKMFESLIYDQLYSWIEPRISSCQHGFLKKRSTTTNL FT AEFVSRTSQWMMDGLQVDTIYTDMSKAFDVININAILSVLGKHGIEDRILS FT WLRTYLENRTQYVKLHQTRSKSFSVDSGVPQGSHLGPLLFILVMNELPSYL FT DGVYILIYADDVKIFIPISCLKDCRNLQRNVELFSNFCMQFGLKVNVSKCN FT VVSFSRKPNTIHYDYRMSADVIHRSFCVNDLGVDLDSELSFTKHIDKVIAK FT ANAMFGMIRRLGQEFDDPYTIISLYVGLVRSIIEYAGIVWQPYYETHKTRI FT ESVQKKFVRFALRNLGWRDVLPDYKSLCTLVGIDSLEIRRKTADALFLKDI FT IDGRYNSQYLAGQISLYEGRTGLRLTRPFQIPNRVQNYARNEPIYRMMSFY FT NANRDCLNVEMSKEQIKYVIRFNF" XX SQ Sequence 4115 BP; 1208 A; 826 C; 882 G; 1198 T; 1 other; gtacctaatt gcgatccaat aatttcgtag ttttcgacgc gtttggcatc gtatttattt 60 tgtacttcgt tcagaatcga cgtggaatta cctcgtgtat ttgaaaatcg tggagttcgt 120 ttgagtttcg aacaatttct cgagatttta cgttctgaaa tccttaccgt atagtttcac 180 cacaacaatg gcttcctgtt gttttgcatg cagtaacgat ctcgtccaag aagaagagat 240 cgtctgtaaa ggtttctgca agtcaacctt ccatctccga tgtgtacacc tcagtccgga 300 gattcgcaca gtgatccagt ctaactctca gctgtactgg atgtgctcgg cttgcacgaa 360 gatgatgaaa aacgcaacgt ttcgacaggc tattgcatcc acgaatgagg ctatgctaca 420 ccttgaagtg cagcaaaaca aagtgttgga agagctaaga tctgagatta agcaaaatac 480 agcgaaaatc aactcaatac ttagcctgac accacgcact cctaaacgcg taggtgaatt 540 ccatttttca tctcaacttg ccaaacggaa cagaaccgat ggtagtagct ctgtaaatag 600 gctatctaca cgtgatgaac aggtaggaag taaagatgat gatccgagca tagtggtgcc 660 cttggcaagc caagtgcaaa aatgttggtt atatctttcg caattcgatc ctaaagctac 720 taccgatgat attgttcgat tagtgaaacg taatttggag actgaatcag atgtgcaagt 780 ggtaaaactt gttccgaagg gacgaagaat tgaggatctt tcgttcgttt ccttcaaggt 840 gggaattgag atggatttga gagaaaaggc actgttggcc acgtcgtggc aaaaaggaat 900 atatttccga gaatttgaat tctcgcggtc ggctcgccct gatgtctttc gtttcgaacc 960 ttagctctag tcattcattt gcgccaatat caccgacttt acactctccg atcccgtctc 1020 ccgctcgaaa gcctgtaagg aaaaatgatt ataatgtcca ctctgtcgac gaaatacgca 1080 tatactacca gaacatgaac agtatccgta gtagcggccg actgcagtcc gttttcttgt 1140 cgtccgtcga gaatgactac gatgtttacg tattctgtga aacgaatttg gatgcgtcaa 1200 taacgaatgg aatgatattt catgaatcat ttgttaccta ccgatgtgat cgttcgccgt 1260 tgaatgccga taagccatcc ggagggggtg ttctcatctc cgtaagagcc actcttccga 1320 gttgtattat taatgagtca tcctgcaatg aagaagtctc agtaagaatc tccattggaa 1380 agaccgacat cgtcttgtgt tgcatttact tacctcccga tgcgagtcta gcacgatact 1440 cgtcacactg cgacagcgtt cgtaaactgg tcgatacagc aagccctgaa agcaaagtta 1500 ttgtaattgg tgactttaat ctgaaaaaca ccatttggta ccgtgagaac gaaaacgatc 1560 tgcatctctt accactgcaa gtccgcacac cggaggaaac tgctattgtt gatgaactga 1620 ctgcctgtgc actggttcaa gtgtgtggta ttccgaatca aaacaatcgt ttcctcgatc 1680 taacatttac gaacgacgat gaatgttgtc aaatggactt gtgtgagccc ctactccaca 1740 acgaaactca tcataatgct atgtcactca ccgttaacta caacaaactt acctatgatg 1800 aatctgcact gcacagtcaa ccaacacgac ktaaattgga cctcaagaga tgtgattacg 1860 ctgatattgt gcaagagctc ttaaccgtca attgggatga tattttcatt gaatacgaaa 1920 cgtttgacgc agcacaattg aatgcatcca tcgactcaat ggttagctac ttacgcaggt 1980 ttgaaatgtt tgatgaatta ggaaatggtg tacatcgtgt gaacaatgct gtcgacttta 2040 atgtgttagc attttatgct accatctatg aagttttttc aagacattgc cgtgtcgtta 2100 acagtaaaag aggtggccca gtctaccccg agtggttcga ctccgaaact gtacaactgt 2160 taaagattaa gaacaagctg cacaacaagt tccggcggag caaatcgatc cgtgatgagg 2220 atgcgtacag acaagcattg cgagagttta aaacgaagca acgttcccag tatgcggcat 2280 atatggatga attgcaattt aacattcagg ctaatccaaa agtgttttgg aaatatgtca 2340 gcaaaaaacg gcgtgatact ggaatcccag ttaatgttac gtacaaatcg acgaatgctg 2400 attgcatcga acgtgcagct gagttgtttg ccgaatactt caatgcagtt tactccgaac 2460 gaaatgagat tcaagtacct gtatatcccc gtgcggaaga gcggttacct gaaatagcac 2520 tgtctttgga cgaagtgtgt ttaaaaatgc aaacgttgga tgtttccaaa gctagcggtc 2580 ccgatggtat tccgaatatt tgtatcaaaa gttgcgtcga agcgttcgca cttcctttac 2640 acaaattgtt taatttgtct ctttcttgtg gctatttccc ggcgctgtgg aagatctcgc 2700 acattgttcc ggttcataaa aagggttcca agttgcctgt ggaaaattac cgtggaatag 2760 ccattctatc cgcaataccg aaaatgtttg aaagcctaat ctatgaccag ctttattcat 2820 ggattgaacc gcgtatttca tcatgtcaac atgggttctt gaaaaagcgc tctacgacca 2880 cgaatctggc tgaatttgtg tctcgaactt cccagtggat gatggatgga ctacaagtgg 2940 acacaatata tacagacatg tctaaagcgt tcgatgtcat taacattaat gccattctat 3000 cagttttggg aaaacacggc attgaagatc gaattctcag ctggcttcgt acttatttgg 3060 aaaatcggac ccaatacgtt aagctacacc aaacgagatc gaaatcattt tctgttgatt 3120 ccggggtgcc tcaaggtagc catttgggcc ctttgctctt catccttgtc atgaatgaat 3180 tgcctagtta cctcgatgga gtctacatcc tcatttacgc cgatgatgtc aaaatattca 3240 ttcctataag ttgtttgaag gattgtcgga atctacaacg aaatgttgag ctgttctcca 3300 acttttgtat gcagtttggt ctaaaggtca atgtttctaa atgcaacgtg gtgtctttct 3360 ctagaaagcc taatacaatc cactatgact atcgaatgag tgcagatgtc atacatcgat 3420 cgttttgtgt gaatgacttg ggcgtggacc tcgatagtga actatccttt actaaacata 3480 ttgacaaggt aatcgcaaaa gctaatgcca tgtttggaat gattagacgg cttggacagg 3540 aattcgatga tccgtacaca ataatttctc tctacgttgg attagttcgt tcgattattg 3600 agtatgctgg gatagtttgg cagccatatt acgaaacgca taagacaaga atcgaaagtg 3660 tacagaaaaa gtttgttagg tttgcactcc ggaaccttgg gtggagagac gttttgccgg 3720 attataaatc attatgcacg cttgttggaa ttgattcgct cgaaattaga agaaaaacgg 3780 ctgacgcttt gttcctaaag gatataattg atggtagata taattctcaa tacctagcag 3840 ggcagatttc attatatgag ggtagaacag gtttaagatt aacaaggccg tttcaaattc 3900 caaatagagt tcaaaactat gctaggaatg aacctatata tcgtatgatg tcgttttata 3960 atgcaaaccg tgattgttta aatgttgaaa tgtccaaaga acaaattaag tatgtaataa 4020 gatttaattt ttagttgtat tgacggaaga acattgtaat tagtttaaga taatgggcaa 4080 atgcctacag tcacaataaa tcaaatcaaa tcaaa 4115 // ID BEL-200_AA-I repbase; DNA; INV; 6505 BP. XX AC AAGE02032565; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-200_AA_; KW BEL-200_AA-LTR; BEL-200_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6505 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02032565; Positions 1672 8176. XX CC Positions [5461-6039] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 1114..6408 FT /product="BEL-200_AA-I_1p" FT /translation="MPPITRKGSTLRQLRTQLEEVTASFDDIKIFIQRFDE FT HITASQVEVRLEKIDDLWEKFSETLVELKSHDDFEDEEKYYDKLRMNISEQ FT YYDGKSFLKDKAKSLHEPTDLEQSVREPSMLGVLDHVRLPQIKLQTFCGEI FT DDWLSFRDLFTSLIHWRTDLPEVEKFHYLKGCLQGEPKGLIDSLKITKANY FT QIAWDTLCKRYNNNKLLKKKQVQSLFQLPILTKESVAELRTLVEGFDRIVQ FT TLDQIVEPADYKNLLLVNLLTMRLDPYTRRAWEECAASKDQDTLEDITEFL FT RRRIQVLESLPPRPADPRFHQQPPSSKPKPPAVKASYNSVQSIPGRCVACK FT DSHLLYQCSAFQQLPVADRDSLLKSNGLCRNCFRFGHQARDCQSKFSCRIC FT RKRHHTLLCFRAEKDNPTTVVSVAGGNHPSTSQDSHSSMGPNSTQVANMAA FT TNVSVCNVAQGVSSTVLLATAVVVVEDDIGVRFNARALLDSGSESNFVSER FT LTQRMKVSRNKVDVSVLGIGQGTTKVKHSVQMVLRSRVSDFSREMAFLVLP FT KVTTNLPTTSINTEGWKIPDGIQLADPAFFQSKEVDIVLGIEAFFEFFETG FT RRIPIGNHLPTLTESVFGWVVCGGCSTSKVSQQINCNFSATTELERLIERF FT WSCEQVEFANFYSPEEKMCEEMYQREVQRGSDGRYTVSLPKNESTLRNLGE FT SWNIAYRRLLGTERRLAKDEQLRKQYVTFMEEYLNLGHMRKVEHIEDDSVK FT RCYLPHHPVLKEASTTTKVRVVFDASCKTSSGVSVNDALLVGPVVQEDLRS FT IILRSRTKQILVVSDVEKMFRQILITPEDRPLQSILYRFSPNEDVAVYELN FT TITYGTKPAPFLATRTLQQLATDEELKFPLAARAIREDVYMDDVITGTDDV FT GIAIRMRNQLSDIMECGGFKLRKWSCNIPAVLDGLPAENLAIPDGNGVNLD FT PDQAVKTLGLTWMPLTDTLRFQFEIPPPTLDIPFTKRRILSIIATLFDPLG FT LIGATTVSAKIFMQQLWTLQDANGDRLDWDLPVPPTVGEDWRKFHEALPLL FT NQVRVNRCVIIPNSTSIELHCFSDASEKAYGACVYVRSQDMKGIVQVHLLS FT SRSKVAPLRTQSIPRLELCGALLASQLYEKVKASTRLSVPTYFWTDSTCVL FT RWIAASPATWTTFVANRVAKIQTITDEHNWRHVAGVDNPADLISRGISPED FT IINSKLWWEGPNWLKAEKNTWPGVAIQDNEPGEEERRRTAVACTMSPVAEF FT NDYYLNKFGSFADLIRRTAYWLRLMKLLQTPREKRSGNGFLTTAELSEAKH FT AIIRNVQREVFAEEWEALRNKDNVSKRSALRWFNPYIDKSGLIRIGGRLKH FT SREPEGAKHPIVLPARHQLTRLIVRHFHERLLHAGPQFLLGVVRLQYWPLG FT GRKVAREIVHRCLKCYRIKPSTVEQFMGELPSPRVTASRPFTKTGVDYFGP FT VYIRPAPRRSAVKAYVSVFICLCTKAVHLELVSDLSTERFLQALRRFIARR FT GRPSDIYSDNGTNFVGANNKLLELFRLLRDPVHRERITTELADNGIQWHFN FT PPSAPHFGGLWEAAVRSAKTHLLKVIGESIISSEDFSTLLAQVESCLNSRP FT LVPMSDDPDDLEPLTPAHFLVGSSLQALPDTELLNIPSNRLNQYQQIQQKL FT QHFWTRWRREYLNQLQARTKRWKPAIPVEIGKLVIVKDENVPPIRWKMGRI FT IAVHPGKDDVIRVVTVKTATGEVKRPVERICILPIPINCEEDKAS" XX SQ Sequence 6505 BP; 1815 A; 1498 C; 1522 G; 1670 T; 0 other; tttggtcctt cgaaccggat gccgaatccg gggaccggtt gaattgaagc tcgccattgt 60 tgatcggtca actaacgcgc ggatatcgcc atcgtactca tcgggaatcg ccatcataag 120 gaaaatcaca atcgactaat cgccatcaaa cccaagacga aaatcgccat tcaccaagga 180 taaaaggata gcagctgcat tgtgtacatt cgttgcgcca aaaccaccat cattttgtgt 240 ttgggctgat ggaagcctga atcctgtttt gaagttggac taggagttgg atttggacat 300 cttgcgctgg aataccgaac aggccggagt atcagtttgc ccaggtaaat actcattagt 360 tcggtatatg tcttgccgaa gcatatttgc atgttactaa tgccaacatt tcgatgcatg 420 ctgttggatt cattgcatga gaggactcga caaccattgt gagttagttc agcgttcaac 480 cgtggatgtt tagactggtt cagcttgtgc gattattgga caaccggtgg accaacgggt 540 accgatctcg tcacccagat ctagtggaag cctttagtgg acggttgctg tgcaggatca 600 aggaagaggg aacgccatct tactcccttc taggtgagtg aacatttagt atgtgtactg 660 ccgaagcaag gcgtacggat aatacacgtt ttgattggag tttttgtgaa tcctcttcat 720 ttactgccac gcaagccctg atattgaact gctcgttaga gcaagtctgc tgaaaccaac 780 ccaaattgca ttctcgactg ttggatctac tttgcggagt tgttagtcct ccaggtaggt 840 tgccacagta tatgcctggc cgaagccaag gttctcctgg atattacacc ccatcttctc 900 ctcttctttc aaccccaacg tgaattgagc ccatgccgta acgagattga agtgacggcc 960 ctgtattctc gaatcgtttt gagtatttgg atcattagaa cagttacctg ttctctcagg 1020 tcagtgaatt tggacagtat atgtccgccg aagcaagttt ttacgcggca tactacacca 1080 tctcttcttc taccatccga ttgacgtcac accatgccac ctatcaccag gaaaggatcc 1140 accctgaggc agttgaggac tcaactggaa gaagttacgg cttcctttga cgatatcaag 1200 atttttattc agaggttcga tgagcatatc acagcttctc aagttgaagt tcgtttagag 1260 aaaatcgacg atttgtggga gaaattcagt gagacattag ttgaactgaa atctcatgac 1320 gattttgaag atgaggagaa gtactacgat aaactacgaa tgaatatcag cgaacaatat 1380 tacgatggga agtcctttct taaggacaag gccaaatcac tccatgagcc aacggatttg 1440 gagcaatcag tccgtgaacc ttcaatgttg ggcgttctcg accatgttcg gttaccgcaa 1500 atcaaactgc aaacgttttg tggtgaaatt gatgactggt tgagcttccg ggacttgttc 1560 acctctttga ttcattggag aaccgacttg ccagaggtgg agaagttcca ttatctaaag 1620 ggatgtcttc aaggcgagcc gaaaggtctg atcgactcgt tgaagatcac gaaggcaaat 1680 taccaaatcg cttgggacac tctgtgcaag cggtataaca acaataaatt gctgaagaag 1740 aagcaggttc agtccttgtt ccagcttccg atcctaacca aggaatccgt ggccgaatta 1800 cgaactttag tggaaggttt cgatcgaatc gtccaaacgc tcgatcaaat cgtggaacca 1860 gcggattata aaaacctttt gctggtaaat ctccttacaa tgcgactaga tccatacacc 1920 cgtcgtgctt gggaagaatg tgcagcttcc aaggaccagg atactctgga ggatattacc 1980 gaattccttc gaaggcgaat tcaagttctg gaatcccttc cgccaaggcc ggcggaccca 2040 aggtttcacc aacaaccacc ctcgtcaaaa cccaaaccac cagcagtgaa agccagctat 2100 aattcggtcc aatcaatccc aggaagatgt gtggcttgca aggatagcca ccttctatac 2160 caatgttccg cattccaaca attacccgtg gcggacagag attcactgct gaaatcaaat 2220 ggactttgcc gaaattgctt tcgctttgga catcaggcaa gagattgtca atccaagttc 2280 tcttgtcgga tttgcagaaa gcgtcaccac actctgttat gcttcagagc ggagaaggac 2340 aatcctacaa cggtagtatc ggttgctggg ggcaaccacc cttcaacctc ccaagattcc 2400 catagttcta tgggtcccaa ttctacccaa gtggccaata tggcggccac gaacgtttca 2460 gtatgcaatg ttgctcaagg ggtttcatca acggttttgc tagcaacagc ggtagtcgtt 2520 gtcgaggatg acatcggcgt caggttcaat gctcgagccc tattggactc cgggtcggag 2580 agcaactttg tgtcggaacg attaacccaa cggatgaagg tctccaggaa taaggtagac 2640 gtttcggttc tcggcatagg gcagggaacc accaaggtga agcattcggt gcaaatggtt 2700 ctacggtcac gagtttcgga tttctcgcga gaaatggcgt ttctggtatt acccaaggta 2760 actaccaatc tacccacaac ctcaatcaat acggaagggt ggaaaatacc agacggaatc 2820 caattggcgg acccagcctt tttccaatcc aaggaagtcg acattgttct tggtatcgaa 2880 gcgttttttg agtttttcga gacaggaaga agaattccaa tcggaaatca cctaccaacc 2940 ctcacggaat cggtattcgg ttgggtagtc tgcggtggct gctcaacttc gaaagtatcc 3000 caacaaatca actgcaattt ctcagcaacc acggagctgg aaaggttgat agaacgcttc 3060 tggtcatgcg aacaggtgga gttcgctaat ttctactccc ctgaggagaa aatgtgtgaa 3120 gaaatgtacc aacgagaagt tcaacgtggt tcagacggtc gatacactgt ttctttacct 3180 aaaaatgaga gtactcttcg caatctgggt gaatcgtgga acattgccta ccggcgtctt 3240 cttggaacgg aacggagatt ggcaaaggac gagcaattgc ggaaacaata cgttacattt 3300 atggaggaat acctaaacct aggacatatg cggaaggtgg aacacattga agatgactcc 3360 gtaaaacggt gttatctacc ccatcacccg gtgctgaagg aagccagcac caccactaag 3420 gttagagttg tcttcgacgc ttcatgcaaa acgtcttctg gagtgtccgt gaatgatgcg 3480 cttttggtgg gaccagttgt gcaagaagat ttaaggtcaa tcatcctacg aagtcggacg 3540 aaacagattc ttgtggtatc cgacgttgag aaaatgttcc gacaaatact gattacgcca 3600 gaggaccgac ctttacagtc aatactctat cgattttctc ctaacgagga tgttgcggtg 3660 tatgagttaa ataccattac atatgggacg aaaccagcac ctttcttagc taccaggacc 3720 cttcagcagt tggcaactga tgaagaattg aaatttccgc tcgcagcaag agcaatccgt 3780 gaagatgtat acatggatga cgtcatcacc ggtacggacg atgttggtat tgccatccgc 3840 atgagaaacc agctatcaga tataatggag tgcggcggtt tcaagctcag aaagtggtcc 3900 tgcaatattc cagcagtatt ggatggtttg ccggcagaga acctggccat accggatggt 3960 aatggagtca acttggatcc cgatcaagca gtaaaaaccc taggactaac ctggatgccc 4020 ttgactgaca ccctacgatt ccagtttgaa attcctccac ccacactgga cattccattc 4080 acgaaacgcc gtatcttatc aataatagct accctgttcg atccccttgg tcttattgga 4140 gctactacag tatctgctaa gatattcatg caacaactct ggacgttgca agatgcgaac 4200 ggtgatcggt tggactggga tctcccggta cctccaacgg tgggtgagga ttggcggaaa 4260 tttcacgaag cacttccact tctaaatcaa gtccgtgtga accgttgtgt catcattccc 4320 aattccactt ctatagagct acactgcttt tcggacgcgt cagagaaggc ttatggtgca 4380 tgcgtatacg tcagaagtca agacatgaag ggaattgttc aggtgcatct gttatcatcc 4440 cgatcgaagg tagccccact cagaactcag tcgatcccac ggcttgaact atgcggtgct 4500 cttcttgcgt cacaacttta tgaaaaggtt aaggcttcta caagattatc cgttccaact 4560 tacttttgga cagattcaac gtgtgtctta cgttggattg cagcttcgcc cgcaacatgg 4620 actacatttg tagccaacag agtagcgaag attcaaacca taaccgatga acataattgg 4680 agacatgtcg ctggagtaga caatcctgcg gatctgatat cccgtggaat atcacctgag 4740 gacatcatca atagtaaact gtggtgggaa ggtccgaatt ggctgaaggc agagaaaaat 4800 acgtggcctg gtgtagcaat acaagataac gaaccaggag aagaagagag acgtcgcaca 4860 gcagttgcgt gcacgatgtc acccgttgcc gaattcaacg actactatct caataaattt 4920 ggatcatttg ctgatctcat tcggcgtact gcttattggc taagactcat gaagctcttg 4980 caaactccaa gagagaagag atcaggaaac ggtttcttaa cgacggccga gttgagtgaa 5040 gccaagcatg ctattattcg aaacgttcag cgagaagtat ttgccgaaga atgggaggcc 5100 ttaaggaata aggataacgt atctaagagg tcagcactga gatggtttaa cccatacatc 5160 gataagagtg gattgattag aattggagga agattgaagc attcgaggga accggaaggc 5220 gcaaaacatc caatagtatt gccagcacga catcagctaa ctcgattgat tgttaggcat 5280 tttcatgaaa ggcttttgca tgctggtcca cagtttcttc taggagtagt tcgactgcaa 5340 tactggccat tgggaggacg gaaggtcgcc agagagatag tacatcgatg tttgaaatgc 5400 taccgaatta aaccgtctac ggtagaacaa ttcatgggag aactaccgtc cccacgtgtt 5460 actgcttcca gaccattcac caaaacagga gtagattatt tcggacctgt ctacattcga 5520 ccagccccac ggcgttctgc agtaaaggct tacgtttcag tgttcatttg cttgtgtaca 5580 aaggcggtac acttggagct cgtctccgat ttatcaacgg aacgattctt gcaagccttg 5640 cgtcgattta tagcacgccg tggaaggcct tccgacattt actcagataa cggtaccaac 5700 tttgtcggtg ccaacaacaa acttttagaa ctcttccgtt tactaaggga ccccgtacac 5760 cgtgaacgaa ttacaactga acttgccgac aatggaatcc agtggcactt caatccccca 5820 agtgcaccac attttggggg actctgggag gcagctgttc ggtcggccaa aacgcatctg 5880 ctaaaggtga tcggagaaag tatcatttcg tcagaagact tttctacatt actagcacaa 5940 gtcgaatcct gcctcaactc tcgtccttta gttccaatgt cggacgatcc tgatgatctg 6000 gaaccattaa caccagccca tttcttagtg ggttcctcac tgcaagccct tccagacact 6060 gaactactca atattccatc gaaccgatta aatcaatatc aacagattca acaaaaattg 6120 cagcactttt ggactcgttg gcgacgagag tatttgaacc aattgcaagc tagaaccaaa 6180 cgatggaaac cagctatccc tgttgaaatt ggtaaacttg tcatcgtcaa ggatgaaaac 6240 gtacctccga ttcgttggaa aatgggaaga attatagccg tgcatcctgg taaagatgat 6300 gtcattagag ttgttactgt gaaaactgca accggtgaag ttaagcgtcc agtggaaagg 6360 atttgcattc taccgatccc gataaattgc gaggaagata aagcatcata aattagccct 6420 acttcccttc ccacactccc atcctgtcga agaggattct cttatttctt ttcagaaatt 6480 caactaattt ctgggtgggt gagga 6505 // ID Copia-119_AA-I repbase; DNA; INV; 4113 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-119_AA_; KW Copia-119_AA-LTR; Ty1_copia_Ele34; Copia-119_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4113 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [1551-2075] - Integrase core CC LTRs are 96% similar to each other. XX FH Key Location/Qualifiers FT CDS 1353..4103 FT /product="Copia-119_AA-I_1p" FT /translation="MAEGAMAASESVHKVDCQHQWHRRLGHRDWAAAERII FT KEELATGIGVSDCGQRVVCECCMEGKSARLPFPPVVDRKSSRVLDIVHTDL FT CGPMDIVTPSENRYVMTITDDYSRFAVTYLLKQKNEAAGNIKDYVRWVSNI FT FGRKPRVVRSDGGGEFDNRELRAFYKSEGIQPQFTTPYPPQQNGVAERKNR FT SLIEMATCMLIDAGLEKRYWGEAMLTATYLQNRLPSRSIQTTPYQMWWNRK FT PDLSHLRVFGSQAYVHVPDTRRKKLDSKATKLTFVGYAMEQKGYRFVDLET FT DQVTISRDARFIELENGTSSVEVTIPRSKKPEEEIKLLPLKEKKEEKDDPF FT EESISAPEDLIETDDGDEVGEPVASSTVRRSERSNRGALPKHFDVYELDYV FT AGIAACAVEEPVGVKEALADPVWRKAMEEELEAHRINGTWQLVKLPEGRKV FT IGSRWVYKTKKNEENRVVKFKARLVAQGYAQRYGVDVGEVFAPVTTQATFR FT TFLTVAAKHNMIVQHLDIKTAYLHGTLEEEVYMRQPPGYVKAGNEELVCRL FT QRSIYGLRQSARCWNKRLHEVLVKNGFKAAVADPCLYIRGAGKTKVLLLVY FT VDDLLLASTNATELKIIQEQLNAEFELTNLGEVRHFLGVEVRREGTVFKIG FT LRNYIDGLLQAHGMNEAKPAKSPMDPGYLKLTDSGKSFDDPTSYRSLVGGL FT LYLAVVARPDIAASAAILGRRFSAPREIDWTAAKRVLRFLKKTRSYQLQLG FT GAVDQPLVGFSDADWAGDAESRRSTSGMVFQFGGGTISWASRRQSSVTLSS FT MEAEYVALSASCQEALWLRQLLHDFGEDQDEPTVIMEDNQGCLAFVRSERT FT SRRSKHIDTRERFVQDLCAKKMIQLQYCSTDRMTADIMTKPLGPLKHQEFC FT ELLGLKFAAEIPH" XX SQ Sequence 4113 BP; 1063 A; 944 C; 1223 G; 881 T; 2 other; ggttatgggc ccaggttgga acggattcga tctcgggaag attcgatcgg gaaaaatcga 60 atggcgtgtt tcgctgaccc gtggcgtcag tgaggtgaac tgcgaaggac aatagtggac 120 aatagtgccg gtcgttgacc gattgaaaag aacagtgatt tgtgagttct gtgaatatgg 180 gcgacgctaa attcgccatt cccaagttga acggtgccaa ctggcactct tggaaggtgc 240 gggttgagat gctgctagcc cgcgaagatt tgtggcatgt tgtcgaggac gacatccctg 300 cggaagaaga aatgagtgag cagtggaaaa gtgaagatcg caaagcgaga tcgactatca 360 ttctccttct ggaggatggg caactctcga aagtgcgaaa ttgtgtgcac gcgaaagacg 420 catacaatgc gctcagaaac caccatcaga agacgacccg ctcggtacgt gtatcgctcc 480 tgaagaagat mtgttccacc aatctgtcgg ataacggtga tgtcgaaaaa catcttcaag 540 agtttgacgg cattttcgaa cgcctggacg gtgccgggat gaggctggat aaagatacga 600 agatctgtat gcttcttcgt agtcttcctg cctgcttcga cggtttgatt acggcgctcg 660 atagccgtac agacgacgat atttcgctag aggtagtgaa atcgaagctg gtggacgaat 720 ataatcgaca attggaacgg aaaggtggct ctaccagagt agagcgagcg atgcggtcgg 780 atgtgcggta tagtgatggc agacctgaaa aggatacccg tacttgccat tactgcaaga 840 agacaggaca cattaaacga aactgtcgga agtttttgtc gacgcagaaa aaggacggga 900 attcttcccg ccaagattcg gacagttatc cgaaggcaaa gacagcgcac ggtgacgtca 960 gaggagtggc gtttacggtt ggtggtgaaa gttcttcgag ttgggtgatt gacagtggtg 1020 ccagctcgca catgtccaac gacaaggmtt tcttcgggtc gttgcgtgaa ttttctggag 1080 gatccatcac gctggccgac ggccagaaga ccaaaattgc gggtgaaggt tctggtgtga 1140 tccacggtgt cgacggttcc ggaaaagtgg caagaattga agtgaacaat gcgaagtatg 1200 tgcctggact atccacgaac ttgatcagtg ttacaaaatt ggccgagaag aacttccgtg 1260 tgaactttga tgtgaatggc tgtactatta tcgatgcgga agacgttgtg gtggccaccg 1320 gaagtcgaca cggtggatta tactacctgc gaatggcgga gggcgctatg gctgcttcgg 1380 aatcagtgca caaagtggat tgccaacacc agtggcaccg gcggctcggt catcgagatt 1440 gggcggctgc tgagcgaatt attaaggaag aactggctac tggaatcgga gtgagtgatt 1500 gtggacagcg tgtggtttgc gaatgctgca tggagggcaa gtcagcgagg ctcccatttc 1560 ctccggtcgt agatcgcaaa tcgtctcgtg tgttggacat tgtccacact gatttatgtg 1620 gtccgatgga catcgttaca ccaagcgaaa atcggtatgt gatgacgatc accgatgact 1680 acagccggtt cgcggttacg tacctgctaa agcaaaagaa cgaggcagca ggcaacatca 1740 aggactatgt gcgctgggtg agcaatattt ttgggcgcaa accccgcgtt gtgcgctccg 1800 atggaggggg agaatttgac aaccgtgaac tccgagcgtt ctacaaatcg gagggaattc 1860 aaccccagtt tacgacccct tacccccctc aacaaaacgg cgtcgccgaa cgtaaaaatc 1920 gttccctgat agaaatggcg acgtgcatgc tgatagacgc gggtttggag aagcgctact 1980 ggggcgaagc gatgctcacc gccacttatc tccaaaatag gctgccttcc agatcgatcc 2040 agacaacacc ataccagatg tggtggaacc ggaagccgga cttaagtcat cttcgggttt 2100 tcgggagcca ggcgtacgtg cacgtgccgg acacgaggcg taagaagcta gacagtaaag 2160 cgaccaagtt aacgttcgtc gggtacgcca tggagcagaa aggctaccga ttcgtcgatc 2220 tcgagacgga tcaggtgacc atcagccgtg atgcccgttt catcgagctg gagaacggaa 2280 cgtcgtccgt ggaagttacg atcccaagaa gcaagaagcc agaagaagaa attaagttgt 2340 tgccgctgaa ggagaagaag gaagaaaagg acgatccgtt tgaggagagc atttccgctc 2400 cggaagattt gatcgagact gacgatggtg atgaagttgg cgaacctgtt gcgagttcga 2460 ctgtccgtag atcggaacgg tcaaatcggg gtgcactacc gaagcatttc gacgtttacg 2520 agctggacta cgttgcaggc attgcagcgt gtgcagtaga agaaccggtc ggtgtgaagg 2580 aagctctagc ggatccggtg tggcgcaaag ccatggaaga agaattggaa gctcatcgta 2640 tcaacggaac atggcagttg gtgaagttgc ccgaaggaag gaaggtcatc ggatcgaggt 2700 gggtctacaa aacgaagaaa aacgaagaaa accgtgtagt gaaattcaag gccagattag 2760 tagcccaagg ctacgcgcaa cgatacggag tcgatgtagg tgaagtgttt gcccctgtta 2820 ctactcaagc aacattccgt acgttcctga cggtggcggc aaaacacaac atgatcgttc 2880 agcatcttga catcaagacg gcctacctgc atgggacgct cgaggaggag gtctacatgc 2940 gtcaaccccc aggatacgtg aaggccggaa acgaggagtt ggtgtgcagg ctccagcgca 3000 gcatctacgg cttgcgtcag tccgctcgct gctggaacaa gcgcctgcat gaagtgctcg 3060 tgaaaaacgg attcaaggcc gcagtcgcag atccctgcct ctacattcgc ggcgccggca 3120 agacaaaagt cctgttgctg gtgtacgtag acgatctgct cctggcttca acgaatgcga 3180 ccgaactcaa gataatccag gagcaactga atgcggagtt cgagctaacc aacctcgggg 3240 aagtgcgaca tttccttggt gttgaagtac gacgtgaagg cacagtgttt aagattggtt 3300 tacgcaacta catcgacgga ttgctgcagg cgcacggaat gaacgaggcg aagcccgcca 3360 aatcaccaat ggatcccggg tatctgaaac tgaccgattc tggaaaatca ttcgacgatc 3420 cgacgagcta ccgcagtctt gtgggtggct tgctctacct ggccgttgtt gcaagaccgg 3480 atatcgcagc gtccgctgca attttgggaa gaaggttcag tgcaccgaga gaaatcgatt 3540 ggacggcagc gaagcgcgta ctgcgcttcc tgaagaagac ccgaagttac cagttgcagt 3600 tgggaggagc tgttgatcaa ccgctagtcg gattctccga cgccgattgg gctggtgatg 3660 ccgaaagtcg ccgctctact tccggaatgg tgttccagtt tggaggcggc acaatttcgt 3720 gggcgagtcg tcgccaatcg tccgtcaccc tttcatccat ggaggctgaa tacgtcgcac 3780 tcagcgcttc gtgccaagaa gcgctgtggc tgcgacagct actccacgac ttcggcgaag 3840 atcaagatga accaacagtt ataatggagg acaaccaggg gtgcttggcg tttgtacgat 3900 cggaacggac cagccgtcgc tccaagcaca ttgacacccg tgagcgattc gtacaggact 3960 tgtgtgcgaa gaagatgatc cagctccagt attgctcgac cgatcgtatg accgctgaca 4020 tcatgaccaa gccccttggt ccgctaaagc atcaggagtt ctgcgagctt ctaggactga 4080 agtttgctgc agagattccg cattgagggg gag 4113 // ID Gypsy-48_AA-I repbase; DNA; INV; 4535 BP. XX AC supercont1.113; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-48_AA_; KW Gypsy-48_AA-LTR; Gypsy-48_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4535 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.113; Positions 2191822 2187288. XX CC Positions [1941-2474] - Reverse transcriptase CC Positions [3558-4034] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 712..1785 FT /product="Gypsy-48_AA-I_1p" FT /translation="MERHIFWTLKPESGENLEKFMLKAREQAYKCNFGTSQ FT QESRDICVIDKITLLAPPDLKEKLLQRDQPSLDDVFKIVASHQSVKYQASQ FT MVVAGPSGLSQSVAAGDVNRMYASSSNRFSTECSRCGRKGHLGHDPICPAR FT DKQCNVCKREGHFARKCKTPSAKSLAAQVKPPFREVSNRYGRQAPQRVRAI FT PVQERDSKSDPEEVQSFIFAIGDGDEFIWLKVGGVMMQALIDSGCNKNIID FT DGTWSRMKAQGVVIRNATKEVDHKFRGYGKDCKPMNVVGMFDATIEIISDD FT QQSHSEARFYVIEDGNQPLLGKETARQLNVLRLGFPVQNDGVNQVCVLLVL FT LSTYFINLQTIFFVL" FT CDS 2037..4520 FT /product="Gypsy-48_AA-I_2p" FT /translation="MRMANRAIKREYHFMPTIDDFLPRLKSAKFFSRLDIK FT EAFHQIELDESSRSITTFITHRGTYRYKRLMFGVSCAPEMFQKIIEQLLAD FT CENCLSFIDDIVVFGADEAEHDRCLNKVLEVLKSRNVLLNMKKCFFKVNEL FT DFLGHRLSGDGIKPAESKVEALKAFRTPKDVEELRSFLGLANYVGKFLPDL FT GTVAAPLRSLTHNGVPFKWTEAQENAFQKLKNMISDVKHLKFFDTKLRTRV FT VADASPVALGAVLLQFEHDHDDKDPRIICYASRSLTPTEQRYCQTEKEALA FT LVWAVERFAVYLLGRRFELETDHKPLEAIFAPTSRPCSRIERWVLRLQSFT FT YDVKYRKGSANIADSFSRLTDHHAVEQAYTENDEKFLILAILESTAIDISE FT IESATINDRELTLVREALRSGIWNETEIKPFEAFQNELGFVGELVVRGNKM FT VIPTSLRKRFLQLGHEGHPGESAMKRRLRDRVWWPGMDRDISKFVAVCEGC FT RLVGLPQKPEPMRCRPLPSEAWTDVAIDFLGPLPSGEYLLVIIDYFSRYKE FT VEIMRLITAKETADRLRRIFQRLGFPRTITLDNARQFLSSDFGIFCKENGI FT FLNFTVPYWPQQNGEVERQNRSLLKRLQISCALKRDWKRDLEEYLMMYYST FT PHSVTGRTPTELLMGRTIRTKIPSLKDIETAPLSEEFRDKDCSSKYRACER FT ENATRKAAESGIKEGDKVLMQNLIPGNKLSTTYSPTQYEVVKKAGTRVTIQ FT NEESGKIYERSSAHLKKIADHLEEPTLPTENDSLPQVSHQTTPSTALAGDE FT ESRPKRACRRPARLEDYVVNDDSSVL" XX SQ Sequence 4535 BP; 1343 A; 890 C; 1118 G; 1184 T; 0 other; acagtggcga cgaggaagga attggaattt cgcgagaaat ctttattttt ttttccgatt 60 attaccgttt gttttgcgga actcgagtga ggttaatttc agctgcctgc agtggtgagg 120 taagcattat tttgtttgtt tttctttcgg gccctattgg aaaggaagag tgtaggcggt 180 atatatgccg aaggcttaaa tgccggataa cagtggctgg aaagccgttg gcttaaatgc 240 cggataacag tggcttgaaa gccgttggct taaatgccgg aaaacagtgg cttaaatgcc 300 ggaaaatagt ggctttaatg ccgtgtactg atgattctga ccatataggg ctcgaaagcc 360 atgataacat tatgttacga tacatgatta cattgtttca tgtgatctta taggataatg 420 gatagatggg atatcctgcc gtttttgttc aaatcgctgc cccagaatga aatcaggggc 480 caatggacga aatggaagag gaatttcgag tacatcgtag ctgctagtga agagaccaat 540 aagacgaaac ttaaattctt tctgctggcc aaagcgggcc ctgatgttta ggaaattttc 600 caagtgattc ctggagctga cgtcgtggaa gatgcggaaa agacgattga tccgttctgt 660 gttgcgcttg ccaaacttga cgagtacttc gctccaaaac atcacgagtc aatggagaga 720 catatttttt ggacgctgaa gccagaaagc ggtgagaatt tggaaaagtt catgctaaaa 780 gcacgagaac aggcttataa atgcaatttc ggtacgtccc agcaagaaag tcgcgatatt 840 tgcgtgattg ataaaatcac gcttctggcg cctcccgatc ttaaagaaaa gctgctccag 900 cgagaccaac cgtctctgga tgatgtgttc aaaattgtag cgtcgcacca gtccgttaaa 960 taccaagcta gtcaaatggt cgttgctggt ccttccgggc tgtcgcagtc agttgctgca 1020 ggtgatgtaa accgtatgta tgcatcgtcg tcgaatcgat tttctactga atgctctcga 1080 tgtggccgga aggggcattt gggacatgat ccaatttgtc ccgctcgcga taagcaatgt 1140 aatgtttgca aacgagaagg acatttcgcc cggaaatgta agacacccag tgccaagagt 1200 ttggccgcac aggtcaaacc accgtttcgt gaagtgagta atcgatacgg aagacaagct 1260 ccacagcgtg tccgggcaat accggtgcag gaacgtgact ctaaatcaga tccggaagaa 1320 gttcaaagtt tcatatttgc aatcggcgac ggcgatgagt tcatttggtt gaaggtcggt 1380 ggagtgatga tgcaagcgct gattgattcg ggatgcaaca aaaacatcat tgacgatgga 1440 acctggagcc gtatgaaagc tcaaggggta gtcattcgga acgcaacaaa ggaagtggac 1500 cacaaatttc gaggatatgg aaaagactgt aaaccgatga atgtagtagg tatgttcgat 1560 gccacaatcg aaattatcag tgatgaccag cagtctcaca gcgaagctcg attctacgtt 1620 attgaagacg gaaatcagcc cttgctcggt aaagaaacgg ctcgacaact gaatgttttg 1680 cgattgggat ttccagtcca aaatgatggc gtgaatcagg tatgtgtatt gttggtgtta 1740 ttaagtacgt attttattaa tttgcaaact attttttttg tactataaag atcaaaaata 1800 ccacatcgtt cccaaaagtt aaaggggtaa agcttcgcat cccgattgac acttcggtaa 1860 ccccggtggc tcaacatgcc agacggccgc cgatcgcttt acttggaaga attgaggaaa 1920 aactagatca acttgaagcg gccgatataa tcgaaaaagt aagccattac agcgattggt 1980 tgtcaccgct agtagttctt gtgaaggaca acggggatct cagaatttgt gtggatatga 2040 ggatggcgaa cagggcgatc aaacgcgaat atcatttcat gccaacaatt gatgactttc 2100 tgcctcgatt gaaatctgcg aaattcttct ccagattaga tataaaagag gcatttcatc 2160 aaattgaact agatgagtca tcgcgttcaa ttaccacgtt tattactcat cgtggtacat 2220 atagatataa acgtctgatg ttcggggtgt cctgtgcccc tgaaatgttt cagaaaatta 2280 tcgaacaact acttgcggac tgtgaaaact gcctcagctt tatcgatgat atagttgtat 2340 tcggtgctga tgaagcggag catgatcgat gtctgaacaa agtacttgaa gttttgaaaa 2400 gccggaatgt attgctcaac atgaagaaat gtttcttcaa ggtcaatgag cttgatttct 2460 tgggtcatcg tctgtctggt gatggaatta aacctgcaga gagtaaagtt gaggcgctta 2520 aagctttccg aactccaaag gatgtagaag aactcaggag ttttttaggg ttagcgaact 2580 atgtgggaaa attcctgcct gatttaggaa cggttgccgc tcccctacgt tcgttaacgc 2640 ataatggagt accattcaaa tggaccgaag cccaagaaaa tgcatttcaa aaacttaaaa 2700 acatgatcag tgacgttaag catttgaaat tcttcgatac aaaactgcga accagagttg 2760 tggcagatgc ttcgccagtt gcacttgggg cagtactttt gcagtttgag catgaccacg 2820 acgacaaaga tccccgtatc atttgctatg ccagcagaag tttgacgccc acagagcaac 2880 ggtactgcca aactgaaaaa gaagctctgg cgctagtttg ggcagtggaa cgttttgctg 2940 tgtatttact gggacgccgc ttcgaactcg aaactgatca taagccgctt gaagcaatat 3000 tcgctccaac gtcaagacca tgttcgcgca tcgaaagatg ggtcttacga ctacagtcgt 3060 tcacatatga tgttaagtat cgcaagggat cagccaatat agcagattcg ttctccaggt 3120 taactgatca tcatgctgtt gaacaagcat acactgaaaa cgatgagaag ttccttatcc 3180 ttgctattct ggaatctact gctattgata ttagcgagat cgaaagcgca acaatcaatg 3240 atcgagagtt gactttggtc agagaagccc ttcgtagtgg gatttggaat gagactgaga 3300 taaaaccatt tgaagcgttt caaaatgagc ttggatttgt tggggagcta gtggtgagag 3360 ggaataaaat ggtaattccg acaagtttgc gtaaaagatt tctacaactc ggacatgaag 3420 gacacccagg agagtcagca atgaaaagga ggcttcgaga tagagtttgg tggcctggaa 3480 tggatagaga tatctcgaaa ttcgttgctg tttgtgaagg ttgccgttta gttggtttgc 3540 cccaaaaacc tgaaccgatg cgctgcagac cgttaccatc agaagcttgg acagacgtgg 3600 ccatcgactt cctgggtcca ttaccaagtg gtgaatactt gttggtcatt attgactatt 3660 ttagccgcta caaagaggtt gagataatgc gactcataac agctaaagaa actgctgacc 3720 gacttcgacg aattttccaa cgattaggat tccctcgtac tattacactc gacaatgcaa 3780 gacagttttt gagttccgat tttggaattt tttgtaaaga aaatgggatt ttcctcaact 3840 ttaccgttcc atactggccg caacaaaatg gggaagtaga gaggcagaat cgatcattgc 3900 tgaaacgtct ccagattagc tgtgccctta agagagactg gaaacgagac ttagaagaat 3960 accttatgat gtattactcc accccgcatt cggtaaccgg aagaactcct accgaactat 4020 taatgggtcg gacgataaga accaagatac catcgttgaa agatattgag actgcaccac 4080 ttagcgagga gtttagggac aaagattgct cttccaagta cagagcatgc gaaagggaga 4140 atgctactag gaaggctgca gaatcaggaa ttaaggaagg agataaggtt ctaatgcaaa 4200 atttgattcc tgggaacaag ctgtctacaa cctacagtcc aacgcaatac gaggtggtca 4260 agaaagctgg aaccagagta accattcaaa atgaagagtc gggcaaaatc tatgagcgca 4320 gttcagctca cttaaagaag atagcggatc atttagagga accaactcta ccaaccgaaa 4380 atgattctct tccacaggta tctcatcaga caactccgtc tacggctcta gctggagatg 4440 aggaatcgcg gccaaaacgc gcctgccgtc gtccggctag actggaagat tatgtggtta 4500 acgatgactc ttcggtgctt tgaaaaaagg ggaga 4535 // ID BEL-633_AA-I repbase; DNA; INV; 5503 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-633_AA_; KW BEL-633_AA-LTR; Pao_Bel_Ele82; BEL-633_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-5503 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [4517-5101] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 3923..5470 FT /product="BEL-633_AA-I_3p" FT /translation="MVTAINDMSVLTKFSSFRKLQRVVSYVIRFINKARNR FT RTAGDSLKFATVTEMRLALTAIIRMVQQQQLFVEIQQIKKLQTNHDGERYV FT GRLRTLNPWIDDDGILRVNGRIKYANVNYEQKHPPILPAEHPVTKLLIQAI FT HNENLHVGPSGTLSIIRQRFWIINGRNAIRQQLRKCIICFKVNPTDTKIYM FT GNLPSYRVTQANPFQRTGIDFAGPIYVRKGHPRKPVYEKGYVALFVCMATK FT CIHIEFVSNLATSSFIAALHRFVSRRGLPSDIYSDNATNFAGASSELHELY FT VLLRQQQTVEALEEFKMPREINWHFIPPRSPHVGGLWEAGVKSAKYFLKRT FT AGEAKLTEEEWHTLLVQVEGLLNSRPLIAQSSDPSDYNVITPGHLLIGRAI FT NAIPEPSYDQLKAGTLSRWQHIQKMRSDFWKRWSVDYLAELQQRHKWNKNH FT TEVKVGDLVLLREDHVPPAQWKLGRVVDVHPGKDNVTRVVSVKTTSGVYKR FT STAKVAVLPLDVPEDKAITN" FT CDS join(77..2278,2282..3865) FT /product="BEL-633_AA-I_1p" FT /translation="MTSTPTNESDFAKDVFSTPIGNQKRSHAEKQRLKKMA FT EEVQAVIYQRGAVKGKLTRIRNALQRVVQEQVAPDEFLLRTHLKTIDAAYE FT EFNVFQNKLYALAPASAEEQEAKYIEFEDLYNEVRALVCRLLERVKQEANR FT PAANQDVDPVHVPVQQIHPVRASPLLNTPLPTFDGKAENWFRFKAIFTDVI FT NRYPGESDATKLYHLDKCLIGDAAGVIDQQTINDSNFDAAWEYLTERYEDV FT RKIVDIHANGLLHLKAMTAESGKQLRTLMDDCKRHVEALKFHEYEVSGLSD FT VLLVNILASKIDLETRKLWEGSITHGEVPTYEETTAFLVKRCQILERIEEN FT SAATKQKKPVSTTAAKVPPMKVTTLAASAELRCNFCEESHNNHQCKEFLKL FT NPSERFEKAKQAKVCYNCLRKGHTTARCSSAMSCKICKKRHHTTLHANVVT FT TTGTVNAQTSTTPQNAPEINASEPASNVNTTVASTHTISASCVSYQQRALL FT CTAIVNVISGDGVTQPCRVLLDCGSQVNLITEKLAGLLQVERKPANVQVIG FT VDGATTRVTAAASVHVQAISNGFTSKIECLVMKRITGPIPSSFVDVASWPL FT PPGLQLADAAFNRPQRVDMLIGVGHFFDLLKSGKIKLAENLPFLQETVFGW FT VIGGIIDSSSSNYKMVRCNVAVPEENLEKLVEKFWESEEVATSTRMSPEEV FT ACEQYYKQTTTRDSEGRYVVKLPFRTNVDQLGESQHALQRFDYLERKLSKN FT AELKKAYCDFMAEYIRLGHCRILREEEDASIGYFLPHHAILKPSSSTTKLR FT TVFDASAKTSSGLSLNDTLMVGPTIQDSLVNICMRFRIHPIVFTSDISKMY FT RMVSVSPDQTKFQRVFWRTDPSVPLKILELSTVTYGTASAPYLATRTLHQL FT AQDEGHNFPKAAKILVKDFYIDDVLSGANSPQEAVEIHDELVALLHKGGFE FT LHKWCTNSPELLASIPEELREKQVAFEQHDVNNVIRTLGLLWDPSNDQLRF FT RVAPPDHAQQVTKRFILSEIAKIYDPMGLLSPVVVVAKLLMRQLWRSKLSW FT DEDVPATLSEPWRRFRDGLESLKNLKIERLIISPRRIAVELHAFSDASMEA FT FGTCVYLRSILEDNTVEVRLLASKSRVASKATIPRLELCGFQLMARLVNQV FT EAALQIDFDRKVMWSDSQIVLCWLRKAPHQLSVYVGNRVAEVQERTSSFEY FT LFVRSEDNPADLVSRGLEAEDLARSEIWWNGPDYLRLNAFEEQTLEHGVLH FT C" XX SQ Sequence 5503 BP; 1536 A; 1281 C; 1358 G; 1325 T; 3 other; ttttggtcct acagcgccgg attttcgtaa aagtttcggt cgattttgca agtttcgcgg 60 acggattttt acgtacatga cttctacgcc cacgaatgag tcggatttcg cgaaagacgt 120 ttttagcact ccgattggaa atcagaagcg ttcgcatgcg gaaaagcagc gcttgaagaa 180 aatggcggag gaggttcaag ctgtcattta tcagcgagga gctgtaaaag gcaagctcac 240 gcgaatcagg aacgccctac agcgagtcgt gcaggaacaa gttgctcccg acgaatttct 300 gcttcgaacg cacctgaaaa cgatcgacgc tgcatacgag gagttcaatg tcttccaaaa 360 caagctgtat gccctggcac cggcgagtgc agaagaacag gaggcgaagt acattgaatt 420 cgaggattta tacaacgaag tgcgagcatt agtgtgtcga ttattggaac gagtgaagca 480 agaagcaaac cgtcccgccg ctaatcagga tgttgatccg gtacatgttc ctgtccagca 540 aatacatccc gtgcgagcat cgcctctgct gaataccccg ttgcctacct ttgatggcaa 600 agccgaaaac tggttcaggt ttaaggcaat cttcaccgac gttatcaaca gataccckgg 660 agagtcggac gcaacaaaac tttaccacct tgacaagtgt ctgatcggcg atgctgctgg 720 cgtgatcgac cagcagacga tcaacgatag taacttcgat gcagcttggg aatacctcac 780 ggagcgttac gaggatgtgc gaaagattgt cgacatacac gctaacggat tgctccacct 840 gaaggcgatg acggccgaaa gcggcaagca gctacgaact ttgatggacg attgcaaacg 900 tcacgtcgaa gcgctcaaat tccacgaata cgaggtaagc ggactttcag atgttttgct 960 ggtcaacatt ttggcatcga aaatcgattt ggagacccga aaactttggg aaggcagtat 1020 tacgcacgga gaagtaccta catacgaaga aacgactgcc ttcctcgtta agcgatgtca 1080 aatcttggag cgaattgaag agaactcagc ggcaacaaaa cagaagaagc ccgtttccac 1140 tacggcagca aaagttccac caatgaaggt aacaactctc gcagcgtctg cagagcttcg 1200 gtgcaatttt tgtgaagagt cccataacaa tcaccagtgc aaggaattct tgaagctgaa 1260 tccgagcgaa cgatttgaga aggcgaaaca agcaaaggtc tgctacaatt gtttaaggaa 1320 aggacatacg acggcscggt gctcttctgc gatgtcttgc aagatctgca aaaagcggca 1380 ccacactacc cttcatgcaa atgtggttac tactactggt actgtgaatg cgcagacgtc 1440 tacaacaccg caaaatgctc cagaaatcaa cgcaagtgaa ccggcttcga atgtaaacac 1500 aactgtagct tcaacgcaca cgattagtgc atcttgtgtt agctaccaac aacgagcact 1560 actatgtacg gctattgtga acgttattag cggagatgga gttacccaac cgtgccgtgt 1620 cttgttggat tgcggctcgc aggtcaatct gatcaccgaa aagttagccg gcctactcca 1680 agttgaacgg aaaccagcga atgtgcaagt cattggagtt gatggtgcta ctacgagagt 1740 aacagcggcg gcgtccgtcc atgttcaagc tattagcaac ggattcacga gcaagataga 1800 gtgtttggtg atgaagagaa ttacaggccc catcccatca agtttcgtcg atgttgccag 1860 ctggccgttg ccaccaggtc tgcaattggc cgatgctgca ttcaaccgac ctcagcgggt 1920 cgatatgctt attggtgttg gtcatttctt tgatctcctc aagtccggca agattaagct 1980 tgcagaaaat ttgccgttcc tgcaggagac agtgtttgga tgggtgattg gcggtatcat 2040 cgattcctcg tcctcgaatt acaagatggt tcgctgcaat gtggcggttc cagaagaaaa 2100 tttggagaag ttggtcgaaa aattctggga atccgaagaa gtcgctacgt ctacacgaat 2160 gtcaccagaa gaggtggcat gtgagcaata ctacaagcaa actaccacga gggattctga 2220 aggacgttac gtcgtaaagc taccatttcg aaccaacgtc gatcagcttg gagaatctag 2280 kcagcatgcc ttgcagcgat tcgattactt ggagcgaaag ttaagtaaaa atgcggaact 2340 caagaaagca tattgtgact tcatggcaga atacattcgt ctcggccact gtcggatttt 2400 gagagaagaa gaagatgcct caattggtta ctttttgccg catcacgcta tattgaagcc 2460 ttcgtcgtct accacgaaat tacgcaccgt ttttgatgca tcagcgaaga cttcatctgg 2520 actctcttta aacgacacgc ttatggttgg tccgaccata caagattctt tggtcaacat 2580 atgcatgcga ttccgtatcc atcccattgt gttcaccagc gatatttcga aaatgtacag 2640 gatggtctcc gtttctccgg atcaaacgaa attccagcga gtgttttgga gaacagaccc 2700 ttcagttcct ttgaagatcc tcgaactctc gacggtcacg tatggtacgg catctgcgcc 2760 gtacctcgct acacgtactc tgcaccagtt ggctcaagat gaaggtcata atttcccaaa 2820 agctgcgaag atattggtga aggactttta tatcgacgat gttctgtctg gagccaacag 2880 tccacaggag gctgttgaaa tacacgatga gctggtggca ctattgcata aagggggctt 2940 cgagctccac aaatggtgca caaactctcc agagctatta gcgtctatcc cggaagagtt 3000 acgtgaaaaa caggtagcat tcgaacaaca cgacgtcaac aatgttatcc gcacgcttgg 3060 attgctctgg gacccatcca atgatcagct gagattccgt gtcgctccac ctgatcatgc 3120 tcagcaggtt acaaagcgct tcatactgtc ggagatagca aagatttacg atccgatggg 3180 cctgctatca ccagtggttg tggttgccaa gctgttgatg cgtcagctgt ggagatccaa 3240 gctttcctgg gacgaagacg tacctgcaac actgtcagag ccatggcgac ggttccgaga 3300 tggactcgaa tcgttgaaga atctgaaaat cgaacgcttg attatttcac caagaagaat 3360 tgcagtagaa ctacatgcct tctcggatgc atctatggaa gctttcggaa catgtgtgta 3420 tctccgaagc atcctcgaag ataacacagt ggaggttcga ctactggcca gcaaatctcg 3480 cgttgcttcc aaagctacaa ttccaaggtt ggagttatgt ggtttccagc tcatggcacg 3540 actggtcaat caagttgaag cagcgctaca aatagatttc gatcgcaagg ttatgtggtc 3600 agactcgcag atagttttgt gctggttacg aaaggctcca catcaactca gcgtctacgt 3660 cggaaataga gtcgccgaag ttcaagagcg tacgtcgtct ttcgagtacc ttttcgtcag 3720 atcggaagat aatccagcgg acctcgtgtc ccgcgggctc gaagcagagg acttagctcg 3780 aagtgaaata tggtggaacg gtccggatta cttgcggctg aatgctttcg aagaacaaac 3840 actggagcac ggtgtactac attgttaatt tttatgccgt gcactggagg aagttgacaa 3900 tctacctgaa gttcgagtgg caatggttac tgcgatcaac gatatgtcgg tcctcaccaa 3960 gtttagttcc tttaggaaac tccaacgagt tgtatcgtat gtcatccgtt ttatcaacaa 4020 ggcaagaaat cgtcgaacgg ctggtgacag tttgaaattc gctactgtta ctgaaatgag 4080 attggcttta accgcaataa tcaggatggt ccagcagcag caactatttg tggaaataca 4140 acaaatcaag aaactccaga ccaaccatga tggtgagcga tacgttggca gactgcgaac 4200 attgaatccc tggatcgacg acgacggaat tctacgtgtt aacggaagga tcaagtacgc 4260 aaatgtgaac tacgaacaga aacacccacc gatacttcca gctgagcatc cagtgacgaa 4320 attgttgatt caagccatcc acaacgaaaa cctacacgtg ggccccagcg gtactctttc 4380 gattatcaga cagcggtttt ggatcatcaa cggtcggaat gccatccgac agcaacttcg 4440 aaagtgtatt atttgcttca aggtcaatcc aacagatact aagatctaca tgggaaactt 4500 gccaagttac agagtgactc aagctaaccc attccagcgt acgggaattg attttgccgg 4560 acccatctac gttcggaaag gtcatcctcg caagccagta tacgagaaag ggtatgtagc 4620 attattcgta tgtatggcga caaaatgcat ccacatcgag ttcgtgtcga acctggctac 4680 aagtagcttc atagcagccc tacatcgttt cgttagtcgt cgcggcctgc cgtccgacat 4740 ttatagtgac aacgccacga actttgcagg tgcgagttca gaactgcatg aactttacgt 4800 tctacttcgt caacaacaga cggttgaagc acttgaggag ttcaaaatgc cccgtgagat 4860 aaactggcat tttattcctc ctcgatcacc ccatgtaggg ggattatggg aggctggggt 4920 taagtcggct aaatacttcc tgaagcgcac tgccggcgaa gcaaagctta ctgaagagga 4980 atggcacact ttgctggttc aagtcgaagg gttgctgaat tcacgtcctt tgatagcaca 5040 gtcttccgat ccaagcgatt acaacgtgat cacaccaggc cacctattga tcggtcgtgc 5100 tattaacgcg attcctgaac caagctatga tcagctcaag gctggtacac tttcaaggtg 5160 gcagcacatt cagaaaatga gatctgattt ctggaagcgc tggtcagttg actacttggc 5220 tgaacttcaa caacgtcaca aatggaacaa aaaccacact gaagtcaagg tcggtgatct 5280 agtgttactg cgcgaagacc atgttccacc ggctcaatgg aaacttgggc gagttgtaga 5340 tgtccaccca ggcaaagaca acgtgacaag ggtggtgtcg gtaaaaacca catcaggtgt 5400 gtataagcga tcaacagcta aagttgcggt acttccgctg gacgttcctg aggataaggc 5460 gataactaat tgaacttccc agaacttcaa tggcgggggt gga 5503 // ID Copia-1-I_HM repbase; DNA; INV; 3946 BP. XX AC . XX DT 15-JAN-2009 (Rel. 14.02, Created) DT 15-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE LTR retrotransposon from Hydra magnipapillata: internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-1-I_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3946 RA Bao W. and Jurka J.; RT "LTR retrotransposons from Hydra magnipapillata."; RL Repbase Reports 9(2), 440-440 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(1..3117,3048..3923) FT /product="Copia-1-I_HM_1p" FT /translation="GYGPSNQWQNLIFDGDSRKFEIWETKFLGYMKLKNLK FT DTLIGNGEIDEDENEIAFAELIQFLDERSIALIIRDAKDKGREAFKILKDH FT YASTSKPRIITLYNQLTSLKKFTNESITDYVIRAEKSATSLNTAGESISDS FT LLIAMVLKGLPDDYRAFVAIITQSETSNNFQKFKQALRSFEETEYTRTTHI FT EHSDTKIMKFKTSGGNIKKVLTCYTCGTIGHKSSECQAKTKPKIWCNYCRS FT NTHKDSSCRKQIKDKAKAVNNIQKHTYAFKINEEQYRETQNKTFLVDCGAT FT THIVNTDEYFTYIDEYFKPEEHFIELADGTRTNNIAKKRGTVKTLLRTKND FT ELVTVTLENTLYIPTYPQCIFSVQAATAKGARINFHKYSAELISKEGTVFP FT IRQYGRLYYFYKNSINETRTESLEMWHKIMGHCNTNDIIKLEDVAQGMKIN FT NIAKFDCETCILSKNVNTCNQQPDTRATYPFQLVHTDLAGPIEPVAIGGFK FT YVINFVDDYSSCLFTYFLQQKSDAAKAAEKFLADIAPYGKVKTFSFYNDIS FT PSGNVXCIRSDNGGEYLSKEFNELLLKHTIKHEFTSPYSPHQNGTAERNWR FT SLFDMARAMIIESKLPKHLWTYAISTATYIRNRCYVQRIKSTPYGLVTGLK FT PNISRLHLFGSVCYPYVHNTKKLQPRSIKGYFVGYDKDSPSYLVYYPESNS FT VLKHRLVKFTEKYESVSQIDTYNTDDFINVEPKTTESTTKPSTPDNKQNIP FT QEISSSRYPQRQRQPPKYLEDYIVDNDNDDEINYIDYCYLMNIPTSYNNAI FT NTXESDKWKNAMDEEIQSLTNNDTFIITELPANKKVVGGRWIYTIKGNNNK FT IIYKARYVAKGYNQIQGIDYLETFSPTARMESVRILMQISVQYNLILHQMD FT VKSAYLHAPIEREIYVNQPPGYEKTHNNKQLVWKLNKSLYGLKQSGRNWQN FT VLSDFLEEIQFIQSNADPCVFVTRSNTEISMILVWVDDIIIAANSNELLIK FT IKKKLSKRFKMKDLGPLTSFLVFNSNQLVIISLNERFRTINFFLGIQFKST FT SNYITMNQSDYLQNVLQKFGFDNCKPRSTPCEQVPNSYHNQESIKSNDSEI FT KLYRQMVGSLLYAMTCTRPDLSYVVTKLSQHLSKPNSGDWIMIKHVFRYIK FT HTLNYCLTFRKTDELKLYAFCDADWASSLEDRHSISGYCFSLCKDGPVXSW FT KSKKQSSVALSTCEAEYMSISAACQEISYLAKLLRELLEVKIEPVTLRNDN FT QGAIALAKNPIKHTKSKHIDIRYHFIREFHQQGRILLEYIQSNENYADIFT FT KPAKKDSLRKFKDFLFGL*" XX SQ Sequence 3946 BP; 1541 A; 626 C; 611 G; 1157 T; 11 other; ggttatgggc ccagcaacca gtggcaaaat cttattttcg atggagatag taggaaattc 60 gaaatatggg aaacgaaatt ccttggatat atgaagttaa aaaatttaaa agatacttta 120 attggaaatg gcgaaattga cgaagacgaa aatgaaatag cgtttgctga acttatacaa 180 tttctggatg aaagatccat agctttaatt ataagagacg ctaaagataa aggtagggaa 240 gcctttaaaa ttctaaaaga tcattacgct agtacaagta aacctcgtat tattacctta 300 tataaccaat taacaagttt aaaaaaattt accaacgagt ctataacaga ttatgttatt 360 agagcagaga aatcagccac ttcattaaat acagctggcg aaagtattag cgattcatta 420 cttattgcta tggtacttaa aggattacct gacgattata gagcttttgt agctataatt 480 acccaatctg aaacaagtaa taatttccaa aaatttaagc aggctttacg aagttttgag 540 gaaacagaat acacaagaac aactcatatt gaacattcag ataccaaaat aatgaaattc 600 aaaactagtg ggggtaatat aaagaaagtt ttgacttgtt acacttgtgg aactattggt 660 cacaaatctt ctgaatgtca agcaaaaact aaaccaaaga tatggtgtaa ttactgtaga 720 tcaaataccc acaaagacag ttcttgcaga aaacaaataa aagataaagc taaagctgtt 780 aataatatac aaaaacacac gtatgcattt aaaataaacg aagaacaata tagggaaacg 840 caaaataaaa catttctggt agactgtggg gcaactaccc atattgtaaa tactgatgaa 900 tacttcactt atattgatga atattttaaa ccagaagaac actttataga gttagctgat 960 ggcaccagga caaacaatat agctaaaaag agaggaacag ttaaaaccct tttacgtaca 1020 aaaaacgatg aactagtaac tgtcactctt gaaaacactt tatatatccc aacctaccca 1080 caatgcattt tttctgttca agcagctacg gctaaaggcg ccagaatcaa cttccataaa 1140 tatagtgctg aacttatatc taaagaggga actgtgtttc ctatccgaca atatggtcgt 1200 ttgtattatt tttacaaaaa ctcaataaat gaaacacgaa ctgaaagttt agaaatgtgg 1260 cacaaaataa tgggacattg taataccaat gatattataa aattagaaga tgtagctcaa 1320 ggaatgaaaa tcaacaatat tgctaaattt gattgtgaaa catgtattct atctaaaaat 1380 gttaacactt gtaaccaaca acctgacaca agggcaactt acccattcca acttgtacat 1440 actgatttag ccggacctat agaacctgta gctataggtg gatttaagta tgttataaac 1500 tttgttgatg attactcaag ttgtttgttt acatatttcc tgcaacaaaa aagtgatgct 1560 gcaaaagctg ctgaaaaatt tctagcagat attgctccat atgggaaggt taaaacattt 1620 agtttttata atgatatttc cccatctggt aatgtamaat gtattcgcag tgataacgga 1680 ggtgaatatt tatcaaaaga atttaatgaa ttacttctaa aacataccat taaacatgaa 1740 tttacatcac cttattcccc acaycaaaat ggaaccgcag aaaggaattg gcgttcatta 1800 tttgacatgg caagagccat gattattgaa tcwaaacttc caaaacattt atggacttat 1860 gcaattagca ctgcaacata tataagaaat cgatgctatg tacaacgaat taaaagtaca 1920 ccatatggat tagtaactgg tttaaagcca aacatttcga gattacactt atttggatca 1980 gtatgttacc catatgtaca taataccaaa aaattacaac cacgaagcat taaaggttat 2040 tttgttggat atgacaaaga cagtccatca tacctagttt attatccaga atcaaattca 2100 gttttaaaac acagactwgt aaaatttaca gaaaaatatg aatctgttag ccaaatagat 2160 acctacaata cagatgattt tataaatgtt gaaccaaaaa ctacagagtc tacaactaag 2220 ccatccactc ccgataataa acaaaatata cctcaagaaa tttcctcatc acgttaccct 2280 caaagacaaa gacaaccccc aaaatatttg gaagattaca tagttgataa tgataatgat 2340 gatgaaatta actatataga ttattgttat ttgatgaata tacctacwtc atacaataat 2400 gctatcaaca ccratgagtc tgataaatgg aaaaatgcca tggatgaaga aatccaatca 2460 ttaacgaata atgacacytt tattattact gaactcccag caaataaaaa agtagttggg 2520 ggaagatgga tctacacaat taaagggaac aataataaaa ttatatacaa agctagatat 2580 gttgcaaagg gttacaatca aattcaagga attgattatt tagaaacttt ttcacccaca 2640 gctagaatgg aatcagtccg aattttaatg caaatatctg tccaatataa cctcatttta 2700 catcaaatgg atgttaaaag tgcttatcta catgcaccta tagaacgtga aatttatgta 2760 aaccaacctc ctggytatga aaaaacccac aataataaac aattagtatg gaaattaaat 2820 aaatcacttt atggtttaaa acaaagtggt agaaattggc agaatgtttt aagtgatttt 2880 ctcgaggaaa tacaatttat tcaatcaaat gctgatcctt gcgttttcgt tacaagaagc 2940 aatactgaaa tatcaatgat tctagtatgg gttgatgata taattatagc cgccaactca 3000 aatgaactat taattaaaat taagaagaaa ttaagtaaac gatttaaaat gaaagattta 3060 ggaccattaa cttctttctt ggtattcaat tcaaatcaac tagtaattat atcactatga 3120 atcaatcaga ttacttgcag aatgttctac aaaaattcgg ttttgacaat tgcaaaccaa 3180 gatcaacccc atgtgagcaa gttcctaatt cataccataa tcaagaatct ataaaatcra 3240 atgattcaga gataaaacta taccgacaaa tggttggaag tcttctttac gcaatgacat 3300 gtacwagacc agatttaagc tatgttgtta caaaattatc tcagcatcta tcaaaaccaa 3360 atagcggtga ttggattatg attaaacatg tatttcgata tattaaacat actctaaact 3420 attgtttaac ttttcgaaag actgatgagt taaagcttta tgctttctgt gatgcagatt 3480 gggcttcatc tttagaagat cgacatagta tatcaggtta ctgtttttct ctgtgtaaag 3540 atggaccagt crttagttgg aaatcaaaga aacaatcgag tgttgcactg tccacctgtg 3600 aagcagaata tatgtcgata tctgcagctt gccaagaaat tagttattta gcaaaactat 3660 tacgagaatt attagaagta aaaatcgaac cagtaaccct tagaaacgat aaccaaggag 3720 ccattgcatt agcgaaaaat cctataaaac atacgaaatc taaacatata gacatacgtt 3780 atcattttat acgagaattt catcaacaag gtcgtatact attagagtat atacaatcaa 3840 atgaaaatta tgctgatatt tttactaaac cagcaaagaa agattcatta cgaaagttta 3900 aagatttttt atttggactt taaaataata tgttgtcaag ttgggg 3946 // ID BEL-3_DPu-LTR repbase; DNA; INV; 309 BP. XX AC scaffold_172; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: long terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-3_DPu_; KW BEL-3_DPu-LTR; BEL-3_DPu-I. XX NM BEL-3_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-309 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 654-654 (2010). XX DR Genome; scaffold_172; Positions 77134 77442. XX CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX SQ Sequence 309 BP; 84 A; 79 C; 62 G; 84 T; 0 other; tgttgggaat agaatcgatg cccaacagcc gacgactaga atcggccggc aaactcgatt 60 gccgcgacat ttaaatgatt actgtattcg gggcccccga taggtggccc cctgtgccgc 120 cgtttacccc ccatttggcc attgttcctt tcctcctatt ttaaacccat tcgaatttat 180 gctgtaaatt atttagtgta caataaacca acgcagagga aggaactcaa gtcgaacagt 240 gtcgtaattc ttatattcaa cttaagtgtg taatagggcc gacaccccaa acgcattgca 300 ttcttaaca 309 // ID Copia-20_CQ-LTR repbase; DNA; INV; 235 BP. XX AC AAWU01016025; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-20_CQ_; KW Copia-20_CQ-I; Copia-20_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-235 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 356-356 (2011). XX DR GenBank; AAWU01016025; Positions 38408 38174. XX SQ Sequence 235 BP; 53 A; 70 C; 53 G; 59 T; 0 other; tgttgagcca ccctattgga ggtgacgtgc gcggatgcaa ccctgcgcac aacggcgccg 60 ccgttcaaga gaaaaagtga cacgtcacac acagttgttt ctaagccagc ttcgcaacaa 120 taaagacaca ttttgttgtt actcgcgcgt attttatttc cggcaaagtt aaccggatcc 180 gtccctcctt ttcgcgttcc ctgctgccgt cggtactctg ctaatccggc caaca 235 // ID Gypsy-2_DWil-I repbase; DNA; INV; 5615 BP. XX AC scaffold_180697; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-2_DWil_; KW Gypsy-2_DWil-LTR; Gypsy-2_DWil-I. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-5615 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (04-MAR-2011). XX DR Genome; scaffold_180697; Positions 2660508 2666122. XX CC Positions [3114-3617] - Reverse transcriptase CC Positions [4687-5196] - Integrase core CC 'ATAC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 837..2321 FT /product="Gypsy-2_DWil-I_1p" FT /translation="MSVRHSSEDLAAIQDDNTLCLICDKPIQYRAQLRSTT FT CGHHFHIACFNIRVGSREICPTCDQPLQSMEGDGAEALSGGPQTSHQARVQ FT TRSAAQRANLLAEMAQRDVSQERRLDESVEPAESVRNSDVNLARLIANAMA FT TAASRQNERLEIQAVRQTELLTQALENGFQRLYSVVATHQANAEESTQQAA FT NLRSQVSATAASEQASNSNSILELRPDRISQVISSWKLRFNGKSGISVDEF FT IYRAEAMTHQALEGNFVVLSRYVSNLFEGNASEWFWRYHKKVSHIKWLDLC FT RALREQFKDERTDQHIKAAISKRRQGDKESFDEFYEAIVALADKLSTPLPE FT IELLEHLRANLLPDIQHELLYESIASVSKLRQLVRTRETFMQTVRKPLGVP FT PRPMPRRMVNAVSIQTEEESEAEDDVPVVEAVNLVCWNCDTAGHPYQECVA FT ERRVFCYGCGKPNTYKPSCTHCNDHSKNTAGRALRRSARKQMISRASNTD" FT CDS 2682..4574 FT /product="Gypsy-2_DWil-I_2p" FT /translation="MSNKVNTADGTPQQIVGKISTEVNFRGNSKILKFYIV FT PGLTQDLYLGIDFWLAFDLLPSGLSVGPIQIASLSVEDTAQRMLDSNQKQR FT LSECMGLFPSFAKLGLGKTQLLSHLIDVAGAAPIKQRHYPMSPAKEQLIYE FT ELDRMLKLGVIEESQSAWSSPVVLVSKPGKHRLCIDSRRLNAVTVKDAYPM FT PIIDGILSRLPKAEYITSLDLKDAYWQIPLQPESRDKTAFSVAGRPLYHFK FT VMPFGLCNAAQTMSKLMDKIVPPHRRHQIFIYLDDLLIVSDSFDEHITLLT FT ELSKCISDAGLTINVEKSHFCRKNVKYLGHIVGEGTIKTDPDKISAIVDFP FT TPKSVKQVRRFLGMAGWYHKFIQNYAAIASPITDTLKHKRKFIWTSEAQSA FT FEDLKTRLSQAPVLHSPDFRKPFSINWDASHTGVGAVLTQLTEDKDEVPIA FT FMSKKLNQAQRNYSVTEKECLAAVLAVKKYRAYVEGQEFSIITDHASLKWL FT MSQSDLSSRLARWSLKLQGFNFTISHRKGSKNVVPDALSRVFTPDLAVLDS FT DLGIDLNSVHFLSPEYVELKTKVEKNPQSVPDVKVMGNLVYRRSEHFPGER FT LADDHCWKLWIPKGLVEQFEIGSRKPTSGPQRY" XX SQ Sequence 5615 BP; 1737 A; 1041 C; 1329 G; 1508 T; 0 other; gttagttaag tattcataat tttattaatt ggcgcccaaa cgtggggcca aatcggtttc 60 gggagccccg acgacaacat tttagtcaac aacagctttt tttatgggta tttatatctc 120 ggttttcgaa cgctagtatt ctttactgta ttgggtggtc agtcggagta cataaaagtt 180 tcgacataac tctatgtccc cccccccaac aaaaaaaaaa aaaaaaaaaa aaaaaaaaca 240 ttagggcaac gcacaatcaa ttggtcagct ggatcagcct taaattctaa aatattttcg 300 gctccgcgct taggattgtc ggtgtagcca caggtaattt tcgatagtca gaaagagtat 360 ttttttctcg ttttggatca ggtttaacaa tggtcaaaga ctatagcctc caagcggtta 420 ataattctta gaactattgg ttttttattt gcctacaagt taggtacaaa atgtttattg 480 ctttaagcag ttaggtaaat tgtaaggagt tatgtgatac ttggcaataa gctagggcat 540 acctgcaatt tcaattatac ttaaagccct cttttatcgt tattttagca gctgctcaaa 600 ccattttttt ttaacacatg gcatgattcc cctcattact tttatgaaag ttattcaatt 660 ttttttatat tttctttatt tctttgcgat attattataa taagtataac tttattaatc 720 aaacaattta gtggaaacag tttcatcagg aattaaacag atctgggatt aaaatcttaa 780 gagcgaagtc ctcagcatag ttagttaaca aataggtaca agttttgttc tgagaaatgt 840 cggtaaggca cagcagcgaa gatctagccg cgattcagga cgacaatacc ctttgcctta 900 tctgcgataa accaattcag tatagggcgc agctacggag cactacttgt ggtcatcact 960 tccacatagc gtgcttcaac atacgcgtcg gtagtaggga aatttgcccc acttgtgatc 1020 agccgttaca gagtatggaa ggagacggag ctgaagcact tagtggggga ccacagacca 1080 gccatcaggc aagagtgcaa accagatcag cagcgcaaag ggccaactta ctggcagaaa 1140 tggcacagag agatgttagc caggaaagaa ggttagatga gagtgtagag ccagccgaat 1200 ctgttaggaa tagcgacgtg aatttagcaa ggttaatagc gaatgccatg gcgacggctg 1260 catctcggca aaatgagcga ctggaaattc aagcagtcag acaaacggaa ttgttgacac 1320 aagcgcttga aaatgggttc cagaggttgt actcagtagt agcgacacac caggcgaacg 1380 ctgaggaaag tacacaacag gcagcgaatt tacgtagcca agtatcagca acagcagcta 1440 gtgagcaggc aagcaacagt aatagcattt tggagcttag accagacaga ataagtcagg 1500 ttataagtag ctggaagtta aggtttaatg gcaagtccgg tatatcagtg gacgagttca 1560 tttatagagc ggaagcaatg acccatcagg ctttagaagg taatttcgta gttctgtcgc 1620 ggtatgttag caatttgttc gaagggaatg ctagtgagtg gttttggaga taccacaaga 1680 aggtatcaca catcaagtgg ctagacttat gtagagcctt gagggaacaa ttcaaagacg 1740 agcgtaccga tcagcatatt aaagctgcta tcagcaaacg cagacaggga gataaagaat 1800 cgttcgacga attctacgag gcgatagtgg cgctagctga caaactgtct actccgttac 1860 cagagattga gttactagag catcttcggg ctaatctctt gcccgatatt cagcatgagt 1920 tattgtatga gtccatagct tcggtctcta agttaaggca gttagtgcgt actagagaaa 1980 cctttatgca gacagtacgg aaaccattgg gtgtaccacc aagaccaatg cctaggagaa 2040 tggtaaatgc agtgagcatt caaacagaag aagagtctga ggcagaagat gacgtgccag 2100 tggtggaagc agttaatttg gtttgttgga actgtgatac ggcgggacat ccctatcaag 2160 agtgtgtagc agaacggcga gtgttttgct atggctgtgg gaagccaaat acgtataagc 2220 catcttgtac ccattgtaat gaccattcaa aaaacacggc aggtcgtgca cttcggagaa 2280 gtgcacgcaa acagatgatc tcgagggcca gcaatacaga ttagaggaaa cacaagggtt 2340 agaagaatat ttgcccgaat ttacgcatat ccccgatgag aaatcacaag aaatatctaa 2400 atattcagaa acggaaccag tagagtcacc tagattagga atctattggt caaagactcg 2460 ttcaagagag cgaaggaagg tttttaataa aggtgttaag aaagagcgaa agctaatagc 2520 ctcggcggta gtcggcaatg tttttgatat ccgtccttat gcagaggtaa agttattcaa 2580 tagattagta agtggcctca tggatactgg ggctagcata agttgcgttg gtggcagttt 2640 tgccgaagaa ttattggctg ttaaaacaga atattaggcc aatgtcaaat aaagtaaata 2700 ctgcagatgg aacaccccag cagatagtag gaaagataag tactgaagtt aattttagag 2760 gaaatagtaa gatattaaaa ttttatatag tccctgggct aacccaggat ctttacttag 2820 gcatagattt ttggctcgct ttcgatttat taccatctgg actgtcagta ggaccgatac 2880 agatagccag tttaagcgtt gaggatacgg cgcaacgaat gctagacagt aatcagaagc 2940 aacgtttgtc tgaatgtatg ggtttgtttc catcattcgc taagttggga ctggggaaaa 3000 cgcagctgct gtcacattta atagatgttg ctggtgcagc accaattaag cagcgacact 3060 atcctatgtc accggcaaaa gaacagttga tatatgaaga gttagatcgt atgttaaaat 3120 taggagttat tgaggaatct caaagtgctt ggtcttcgcc tgttgtgttg gtgagtaagc 3180 caggaaaaca taggttatgc atagatagta gacggctgaa tgccgttaca gttaaggatg 3240 cttacccaat gccaatcata gacggcatcc ttagcagatt acctaaggcc gagtatatca 3300 caagtctcga cttgaaggac gcgtattggc aaattccttt acagcccgag tcacgggata 3360 agacagcctt ttcagtagca ggtagacccc tgtaccattt caaagtaatg ccttttggat 3420 tatgtaacgc ggcacaaacc atgtccaaac tcatggataa aatagtgcca ccgcatcggc 3480 gccatcagat atttatctac ttagatgatc tgttgatcgt ctcagattcg tttgacgagc 3540 atataactct acttacagag ctatccaaat gtatatccga tgcagggtta actataaacg 3600 tcgagaaaag ccatttttgt agaaagaatg tcaagtatct aggacatatt gtaggtgagg 3660 gaactattaa gacagaccca gataaaatat ccgccatagt cgatttcccc accccaaaat 3720 cggttaaaca agttcggcgc tttttaggaa tggcgggttg gtatcacaaa ttcattcaaa 3780 attatgctgc tattgcgtct ccaattacag atactttaaa gcataagcgt aaattcatat 3840 ggacgtcaga ggctcagtca gcatttgaag acttgaagac acgtttaagc caggcaccag 3900 ttttacatag tcccgatttt agaaaacctt tctccataaa ctgggacgcc agtcatacgg 3960 gagttggcgc ggtgctaacc cagttaacag aggataagga cgaagtaccc attgcgttta 4020 tgtcgaaaaa actgaaccag gcacaaagaa attactcagt tacagagaaa gagtgtttgg 4080 cagccgtgtt agcagttaag aaatataggg cctacgtgga gggtcaagaa ttttctatca 4140 taaccgacca tgcttcgtta aagtggttaa tgagtcaatc agacttgagt tctaggttgg 4200 ccagatggtc tttgaaactt cagggattta attttaccat ttcacatcga aaaggtagta 4260 aaaacgtggt gccagacgct ttgtccagag tttttacccc tgatttagcg gtattagata 4320 gtgatttagg aatcgatctc aattcggtgc attttctaag tcccgaatat gtggagctaa 4380 agacgaaggt cgaaaaaaac ccccagtcag taccagacgt gaaagtaatg ggcaatttag 4440 tttaccggcg tagtgagcat tttcctgggg agaggttagc agatgatcat tgctggaaac 4500 tatggattcc caagggatta gtggagcagt ttgaaatcgg ctcacgaaag cccactagcg 4560 gcccacagcg gtattaataa gacattggaa aaacttaggc gctactatta ctggccaaat 4620 ctagtgtcag aagttaaaga gtttataaac agatgtgaag tttgtaaagc caccaaacac 4680 ccgaattata cgttaaggcc accattaggc aagtcgggtg aaacgtcaag gttcttcgag 4740 aaactgtatg tggatttctt gggtccttac ccgcgaagca agtctggcaa tattggaata 4800 tttattgtag tggaccactt ctcaaaattt ccgtttctaa agaaggtaaa aaagtttact 4860 gcagaagtgg tcacacagtt tctagaagag gatctttttc attgttttgg ggtacctgag 4920 attatagttt ccgataatgg gccccaattt agagcacacc attttaatga gctattgcaa 4980 aggtataaag tgcgtcacac atatacggct gcgtatgctc cgcaggctaa cgtgtcagag 5040 agagtaaacc ggtcggtatt ggcagcagta aaggcttacg taagtccaag ccagtctgat 5100 tgggacgaaa agctaagtag tatcgcttgt gccttgaggt ctactattca ttcggcgata 5160 aacacaacac cctataggat ggcatttggt cagcatatgg taaccgacgg gtccgtttac 5220 caagtgctac ggaacttaga attgttagag gatcgagctg tgaggtttag tagggacgac 5280 tctttcgaaa taatgggaac taaagctaaa gaggagtcac gaaaacagca tgatacgtag 5340 tcgcgaggtg tctttttcag ttggacaaga ggtatttcgt aggaacttcc agcaaagtaa 5400 tttctcaaaa ggattcaatt ccaagttagc gccaccgttc gtgaaggcga gagtgagacg 5460 gaagctcggc aacgcgtatt atgagttaga gaatctacaa ggtcatgtgg taggcaaatt 5520 ccatgcgaaa gatattaagc aataattttt tttgcccggt ttcaagcgag tatcaccctt 5580 tccattaggt aagtgtgatt ttagtggggg ggttt 5615 // ID I-78_AAe repbase; DNA; INV; 6456 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE An I non-LTR retrotransposon from Aedes aegypti. XX KW I; Non-LTR Retrotransposon; Transposable Element; I-78_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6456 RA Kojima K.K. and Jurka J.; RT "I clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1349-1349 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 10 sequences with >97% CC identity. XX FH Key Location/Qualifiers FT CDS 548..1912 FT /product="I-78_AAe_1p" FT /translation="MSTPYGGDPEGSLGPRYPSFMDPQNEFGRLTILQMTG FT KDGAALPVDPDLIGKSIEAVVGNKAIESAQSEERCSRYILRVRNPVHVTKL FT LNTTTLLDGTEITIVPHPRLNSSKCVISSFDAIHYSEAEALEKLRSQNVTH FT VKRITRNDNGKLINTPALVLTFNQTTYPNHVKIGLLRIPTRPFYPNPLLCY FT ACFRYGHPKLRCPGPNRCNNCSEEHDGDSCQAPAFCCNCQGDHRPGSRKCP FT VYRKEAAVVKLKVDGNLTYPEARRRIEEGQGSYAQVTAQSRLDASKFEELM FT AENKKKDETISKLLEDNKKKDGMILKLFEEMKTKNSLLESLERKVEALQPY FT FSTPTNKQTDNVALPPSGSEHESRDQQKPVVITTSSLKGISKRKTKKEEEH FT LKQMRSVLNTSPEGSRSPPKKTMKSAISHLELDPMIEYIDEEEIVDISDGD FT PAPESSTSSI" FT CDS 1848..6350 FT /product="I-78_AAe_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="MKKKLWISLMVIQHQKAVPPPFKFFKTKTQACNITPT FT NAPSQFTAQNAETPIHHISQTNEERLGRGESAVPLPLPQAEAHLRGRTVRE FT AIPQPVDGESLRAALEDNASNPPIVTRTNATPVTSKSNDQIRKSFMGFENP FT RPHPTLTLPGKITRRPREAFQAAVGVGGLISQASTFNKATEPPYVVSPSLS FT VPTHTSEDRLQDHSLRKPDHVQPNQPEEAVLHSSRAPLPSTVVSNSAEYDR FT LFHIQQTASASHTTVTHSDLILQEYPTPPRPEDCESTRMSSSSASTTSTSR FT RTATPLALQWNINGLFNNLSDLQLLTHENPPQVLALQEIHCRKSDGDAFNQ FT LLRRQYKWYIKTGSTRFQNVAIAVRHSAPHTPVQLNTPLIALAVKMEFPFP FT HTVVSIYLPQNGVDELGTLLQNLLNQLEKPVLILGDMNSHHYAWGSSKTDK FT RGSVILQVAEENDLIVLNDGSATFSRCSYQSCIDVSLASCEMLRLLRWTIH FT TDPMGSDHHPIEILSTEVLPKITRRPRWRLDDANWETYEQDLLTSINPEHV FT YDPEQLGELILDAAKANIPRTSSKPGRKALHWWSDVTKAAVKARRKALRRR FT KRTPKDHPDWSNINKEYQLLRNECRDIIRKAKQNSWEEFLEGFDSNQSTTE FT MWRRVNALSGKRRSQGITIRQHDTVSNDPAFVANSIGEYFAKISSQSLYPP FT NFLATQKKSKYPSINIAIPDDTNNHDYNMPFTVEELLFALDSTKSKSAGPD FT ELSYLLFKRLPFRVKISLLESFNNVWTMGLFPNGWRHSLVVPIPKQGTYSS FT DPSHYRPIALTSCASKIMERMVNRRLTLILQDKLDQRQHAFLKGRGTGSYF FT ASFAQIIDDALSNNLHVDIAALDLAKAYNRVWRPQVLRQLIDWGIVGNMGK FT FIQGFLQQRSFQVLIGNTRSKMFQEETGVPQGSVLAVTLFLIAMNSIFDRL FT PKGIFIFVYADDIIIVVVGKSPKIIRRKLQAAVRAVAKWAENVGFTMAAEK FT CSVTHCCTFRHHPWKTPVTIGQCEIPYKKELRILGVIIDRKCNFEAHFNKI FT KKETECRIRLIKAISGRHKSNNRRSLSNIACSIVISKLLYGLEITVRSYID FT LINLLSPVYNRVIRLTSGLLPSSPTLATCVEAGVLPFHYLLTITAANRAIS FT FLEKTYGDSRNVFVLEKAKQLLTENANIRFPPIAQVHRVGDRKWNQPAPTI FT DWSVKRTIRAGDSAARVSAVLNHLLNTKYTSHKKIFTDGSRARGKVGIGVY FT SDNLSVARRVPDEFSVFSSEAAAIRLAINLINDPAVPTVILTDSASVLSAL FT ENPQQKHSIIQEIEAAISSNTTLCWIPGHCGIKGNEQADRLADQGRNSRMW FT SLSIPGIDARRSIQQCVYESWQHQWFMNRDLFLRKIKVDIGKWNDRKLRRE FT QQILSRLRVGHTRVTHPHYISNTCRPICPACAVPITVEHLLVNCTELELDR FT TTINLQTSIAGILQNSEAEETKLIALLKKTDLYTKI" XX SQ Sequence 6456 BP; 2082 A; 1584 C; 1340 G; 1450 T; 0 other; agtgtgcgtg aatagccaaa gcgtaaatac gtcgcgaact tttcacctac gtttttcgta 60 aaattatccg tgactcgtcg tggaaaaatt cttgctccaa ccggacgcaa agatctaccg 120 accgcggaca atttcgctag tgtctctatt ggtcgaatcg accataaatc actatttagt 180 ggttgcaaaa tcggttccac ttgttctaaa caagtgtacg atccgcgtcc attacaacac 240 actcgtcgac accgaacgtt ggtttattcg ttcacagtga tagtgttgtt atcactacaa 300 ctcaacgttt tgaccgtgtg tgactgagtt gtgagtgacc atcgctgtat tcagcgcatc 360 atttgcagct atacgaaaca catagagacc agacgatcga ccgaacagtg ggagaaatcg 420 aattcaaaat tggtcactga ccaatttata ttggggtagg taacaggtga ctgttaccca 480 tcattggctc gttttcgatc tccgagtgca tgaaaaaaac taccgcatag ctgacaagac 540 agaaacaatg tcaactccct acgggggcga ccccgaaggg tcgttgggtc caagataccc 600 gagtttcatg gacccacaaa acgaattcgg tcgactaacg attttgcaaa tgactgggaa 660 agatggagct gctttacccg tggatcctga tttgataggg aagagcattg aagcagtagt 720 gggaaataaa gcgattgaat cggcacagag cgaagaacgt tgctcacgct acattcttcg 780 tgttcggaat ccagttcacg tgaccaaatt gctcaacaca accactctac tcgacggaac 840 ggaaatcacc atagtacctc atcctcggct caacagcagc aaatgcgtaa tctcgtcgtt 900 cgatgccatc cactactcgg aggcagaagc tctcgaaaaa ctgcgcagcc aaaatgtaac 960 acacgtcaaa cgaatcacac ggaacgacaa cggcaaatta atcaacaccc cagctctagt 1020 actgaccttc aatcaaacca cctaccctaa tcacgtgaaa atcggacttc tgcgtatccc 1080 aactcgtccg ttttacccaa atccccttct ttgctatgct tgtttccgat acggccaccc 1140 aaaactacga tgcccaggcc cgaatagatg taacaactgc tctgaggaac acgatggtga 1200 tagctgccaa gcgccggcct tctgttgcaa ctgtcaaggt gatcatcgcc ctggcagtcg 1260 gaaatgtcca gtataccgaa aagaagcagc cgtcgtaaaa ttgaaggtgg atgggaattt 1320 aacctaccca gaggccagaa gacgtatcga ggaaggacaa ggttcatacg cgcaagtcac 1380 tgcccaatct cggttggatg ccagtaaatt cgaagaacta atggctgaga ataaaaagaa 1440 agacgaaacc atttcaaaat tacttgagga caacaagaag aaagatggaa tgatactgaa 1500 actcttcgaa gaaatgaaaa cgaaaaactc gctgttggaa tctcttgagc gaaaggtgga 1560 agcgttacaa ccgtacttca gcaccccgac taataagcaa actgacaatg tagccctgcc 1620 gccgtcgggt tcggaacacg aatcacgcga ccagcagaaa ccagttgtca ttaccacaag 1680 cagtctcaaa ggaatcagca aacgaaagac caagaaagaa gaagaacatt tgaaacaaat 1740 gaggagtgtc ctcaacactt ctcctgaagg ctcaagatct cctcccaaga aaacgatgaa 1800 gtccgccata tcccacttgg aactagaccc aatgatagaa tacatcgatg aagaagaaat 1860 tgtggatatc tctgatggtg atccagcacc agaaagcagt acctcctcca tttaagttct 1920 tcaagacgaa aacacaagca tgtaatatta cccctaccaa cgcacctagt cagttcaccg 1980 cccaaaacgc agaaacccct atacatcaca tcagtcaaac aaatgaggaa aggcttggga 2040 gaggggaaag tgcagtaccc ctcccacttc cacaagcgga agcgcatctg agaggaagaa 2100 ctgtccggga agccataccc caacccgtag acggggaatc cctcagagcg gccttagaag 2160 acaacgcttc aaatccacca atcgtaacga gaacaaatgc aacccctgta accagcaaat 2220 caaacgacca gatccggaaa agtttcatgg gtttcgaaaa ccctaggcct caccccacac 2280 tgaccctgcc tggaaaaatc accagacgtc cccgagaagc attccaggcg gcagtaggtg 2340 tggggggcct gatatctcag gcaagtacat tcaataaagc caccgaacca ccgtatgtag 2400 tctcgccatc cctttcagtc cccacgcata cttcggagga tcgacttcaa gatcactcgc 2460 tccgaaaacc cgaccatgtt cagccaaatc aacccgagga agcagttctg catagcagcc 2520 gagcaccact accatcgaca gtagtatcaa actccgcaga gtacgatcga ctgttccaca 2580 tccaacagac ggcgagtgcc agccatacta cagtcacgca tagtgacctt attctacaag 2640 aatacccaac tccaccaaga cccgaagact gcgaatctac acgaatgtca agctcttcag 2700 catcaacaac ctcgactagt cgcagaactg caactccact ggcgttgcaa tggaacataa 2760 atggactttt caacaacctc agcgacctgc aactattgac gcatgaaaat cccccacaag 2820 ttttagcatt acaggaaatc cactgcagaa aatcggatgg tgatgcattt aatcagcttc 2880 ttcggaggca atacaagtgg tacatcaaaa ctggctcaac tcgctttcag aatgtcgcaa 2940 tcgctgtgcg ccattcagct ccacatactc cagttcaact caatacgcct cttatcgctc 3000 ttgctgtgaa aatggaattt ccctttccac atacagttgt ttccatatat cttccacaaa 3060 acggtgtgga cgaactagga acccttctac aaaatctgct taaccaactt gaaaaaccag 3120 tactaatact aggtgacatg aacagccatc actatgcctg gggatcatcg aaaaccgata 3180 aacgaggaag tgtcatacta caagtagccg aagaaaacga tctaatcgtc cttaacgacg 3240 ggagtgcaac cttttcccgt tgcagttatc aatcttgtat tgacgtttcg ctggcttctt 3300 gtgaaatgct tagactttta agatggacta tccatacaga tccaatgggc agtgaccatc 3360 atccaattga aattctaagt accgaggtct taccaaaaat caccaggcga ccaagatggc 3420 gactcgacga cgcaaactgg gagacttatg agcaggactt actcacctcg ataaatccag 3480 aacacgtgta cgaccctgag cagctaggcg aactaatcct agatgctgct aaggccaaca 3540 ttccaagaac gagcagtaag ccaggaagaa aagcgctaca ttggtggagc gacgtaacga 3600 aagcagcagt gaaagctcgc agaaaagccc tccgcagacg aaaacgaact cccaaggatc 3660 atccagactg gagtaacatt aataaagagt accaactact tcgaaacgaa tgcagagaca 3720 tcatccgaaa agcgaaacag aattcatggg aagaatttct agaaggtttc gatagcaacc 3780 agtccaccac tgaaatgtgg cggagagtta atgctttaag cggaaaaagg agatcacaag 3840 gaatcactat acgacaacac gataccgttt caaatgatcc tgcctttgta gccaacagca 3900 tcggagagta tttcgcaaaa atctcatcgc agtcgctata cccgccgaat ttcttggcca 3960 cccagaaaaa gagcaaatac cctagcatca acatagcaat ccccgatgat accaacaacc 4020 acgattacaa catgcctttt accgtcgaag aattgctatt cgccctcgac tctacaaaaa 4080 gcaaatcagc aggacctgac gaacttagct accttttatt caagcgtcta cccttccggg 4140 tgaagatatc ccttcttgaa agcttcaaca acgtctggac tatgggcctt tttccaaatg 4200 gatggagaca tagtttggtg gtacctatac ccaagcaggg aacatactca agtgatccaa 4260 gtcactatcg accgattgcg ctaaccagtt gcgcatctaa aattatggaa cgtatggtga 4320 ataggcgcct aacgttaata ctccaggata aacttgatca aaggcagcat gcattcctta 4380 aaggtagggg tacgggttca tactttgctt cgtttgccca aataattgac gatgctcttt 4440 cgaataacct ccatgtggat atcgctgctc tggatctcgc aaaagcttac aacagagtat 4500 ggcgcccaca agttctgaga cagctcatag attggggcat cgtcggaaat atgggaaaat 4560 ttattcaagg ctttctgcag cagagatcct ttcaagtgtt gataggaaac acgcgttcaa 4620 aaatgttcca agaggagaca ggagtgccac aaggctccgt cctggcggta acactttttc 4680 taattgccat gaattctatc ttcgatagac taccgaaagg cattttcata tttgtctatg 4740 cagatgacat aataatagtt gtcgttggaa aaagtccgaa aataattcga aggaaactac 4800 aagcagctgt tcgagcggtt gctaaatggg cagaaaacgt tggattcacc atggctgctg 4860 aaaaatgctc tgtaacacac tgctgcacct tcagacatca cccatggaaa actccagtaa 4920 cgataggaca atgtgaaata ccctataaaa aagagctgcg aatactcggt gtcataattg 4980 ataggaaatg taatttcgaa gcgcatttca ataaaataaa gaaggaaacg gagtgcagaa 5040 taagactaat caaagcaata agcggtaggc acaaatcaaa caaccgaaga tctctctcca 5100 atatcgcttg tagcatagtt atcagcaaac tcttgtatgg attagaaatt accgtgcgct 5160 catacataga tttgattaat ctccttagtc cagtatacaa cagagtaatt cgtctcacat 5220 ctggcctact tcccagttct cctactctag ctacatgtgt ggaagctggt gtactacctt 5280 tccactatct tctgacaatt acagcagcta acagggcaat cagctttctt gagaaaacat 5340 atggagacag cagaaatgtt ttcgtcctcg agaaagccaa acaacttctc acagagaatg 5400 cgaacatacg tttcccaccg atagcacaag tccatcgggt tggagacaga aaatggaatc 5460 aaccagcccc aacaattgac tggtcagtga agcggactat tcgagcagga gattcagctg 5520 ctagagtatc ggcggtactc aaccatttgc tgaacactaa gtacaccagt cacaaaaaaa 5580 ttttcacgga cggatctaga gctagaggga aagttggtat tggcgtatac agcgataact 5640 tgagcgtagc tagaagggtt ccagacgagt tctccgtttt ttcttctgag gccgcagcaa 5700 ttcgtcttgc tatcaatcta ataaacgatc ctgcagtccc aaccgttatc ctcactgact 5760 cagccagtgt tctttcagca ttagagaacc cacaacagaa acattctatc atccaagaaa 5820 tcgaggctgc aatctcatcc aacaccacgc tctgctggat acctggacat tgtggtatca 5880 aaggtaacga gcaggcagac cgccttgctg accaagggag aaactctaga atgtggagcc 5940 taagcattcc aggaattgat gcacgacgaa gtatccaaca atgtgtctac gaatcgtggc 6000 agcaccaatg gttcatgaat cgtgaccttt ttcttcggaa aataaaggta gacataggga 6060 agtggaatga tcgaaaattg cgcagggaac aacaaattct atcgcgactc agagtcggac 6120 acactcgagt aacacatcca cactacattt cgaacacctg caggccaata tgtccggcat 6180 gtgcagtgcc tattactgtg gaacatcttc ttgtaaattg cacggaacta gaattagata 6240 ggactactat aaatttacaa accagcatcg cgggcattct gcaaaactct gaagctgaag 6300 aaacaaaatt gatagcactc ttgaagaaaa ccgatttgta cacaaaaatc taattgatgt 6360 ttatagtaat aagttaacta tataagtatc taaagaggtg aaccagcgtt cggctgaaag 6420 cctcttaaat aaagacaaaa aaaaaaaaaa aaaaaa 6456 // ID PFRP5 repbase; DNA; INV; 1353 BP. XX AC X61386; XX DT 09-JUL-2004 (Rel. 9.06, Created) DT 09-JUL-2004 (Rel. 9.06, Last updated, Version 2) XX DE Plasmodium falciparum repetitive DNA. XX KW PFRP5; repeated sequence. XX OS Plasmodium falciparum OC Eukaryota; Alveolata; Apicomplexa; Aconoidasida; Haemosporida; OC Plasmodium; Plasmodium (Laverania). XX RN [1] RA Wassermann M. and Del Castillo H.; RT "Direct submission."; RL Unpublished. XX DR Genbank; X61386; Positions 1 1353. XX CC At least 60 copies of this repeat occur in the genome. Repeat CC present in 13 chromosomes. Related sequence U06231: Exp. CC Parasitol. 81:165-171 (1995). XX SQ Sequence 1353 BP; 388 A; 291 C; 259 G; 415 T; 0 other; gaattctatg agagtaatct tatctaatat gattggttct gacatagttc tcatatactt 60 tgtctcactt acagcattta acatatactt tctacacata aagtaagaaa gttccgtaga 120 acatattata tagagtagat tatatatggc atactgaggg gtgggcgtga tatattagct 180 tcatgtctca gagtcaaatt ttatacttgc acgactgcat agtctgctct gccgacactc 240 gatctctgtt tctaacatga ctatgcgcac acacatcgta cgttcgacga catgcacata 300 gagcttacac tagcatgctg catctacggc tgcacgagcg acgtgcgtat ctgactgtat 360 gcatggtcct gagtcgatcg gtactctctc tatatctttg gcatctcagt ctattagtac 420 ggagccaaaa tgtatcttca ctctatctag ccatcacgaa cactactgta gtagagtatc 480 gaaccaagtc acgaagcagt cacattagtc actatagtct aatatatcta cgcgttagag 540 ctatatttta ctatactata cgctatgata acactagcta gtctaatcta tctcttcaga 600 gtacaataag tatatctcat tgttaagagg gggtatagtt agtccgaatt agcagagtag 660 ttaggctgct attcattaac gacagtcagt caatcaaact gtctatagcg cagcttcatc 720 tactcaattc atagtagtag tcattgtatc agtgctatac actatctaca aatactagac 780 actgtcgtgc gtggctacgc aacgatatga ttgacttgca cgtcactata tcgtcagtac 840 gtcattatgc tagcagtgtc gctgcaagtg agcatgcagt gaggcagttc gattacatag 900 ttgcagcaag tagtctacgt acatcattga gtcatgctat cagctactca tgcatgcact 960 cgtcatctgt acaatcacag tcacatatat gcaagcagca tgcaatgtga ccgcaatcca 1020 ggaaaatgaa tgcgttttga accattgctg acaacatgca caacgtattg gtgacacatg 1080 ctgtgcactg tgactgatgt gccatgcact gatgagtcca ctgattgtga gtgatgacat 1140 gacaaacatg tgtgtgactg atgatgtgtg caactactgc tctatatttg tgtgtgcact 1200 tcaacatatt gtcgcttcga taggtccatt tgtcgcccct aggcaacact agttgcctgt 1260 cagtcctaca ctaattgaca tgcacttgaa ggcatttcaa ttgacttatt tgatacacac 1320 aatttatatc ataacaatat aattctcgaa ttc 1353 // ID Gypsy-73_AA-LTR repbase; DNA; INV; 187 BP. XX AC supercont1.150; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-73_AA_; KW Gypsy-73_AA-I; Gypsy-73_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-187 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.150; Positions 1616262 1616448. XX SQ Sequence 187 BP; 54 A; 40 C; 36 G; 57 T; 0 other; tgtggagaca ccctttgctt gtcgcacaac cctaaacagt atatgttttt ggaccaccct 60 aagagaaaac gtcttcgcca ttgttattgt caccgcgtcg agagttaatg aatcgtaata 120 aacacgttcg gttaataata gttaatttct cgcagtttta ttgattccgg tgagagaaac 180 acctaca 187 // ID DNA8-85_AP repbase; DNA; INV; 466 BP. XX AC . XX DT 26-AUG-2009 (Rel. 14.09, Created) DT 26-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-85_AP. XX NM DNA8-85_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-466 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2021-2021 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 466 BP; 142 A; 84 C; 79 G; 161 T; 0 other; cataggcgtg cgcagcccgt tgtggcagag tgggccaatg atacttgaaa caaattattt 60 taggtgttac actgttatct ataaaattac attcatttac tcaccggtca tcacccactg 120 tttaaatttt aaggtataat atatatattt ttattcataa gtagggctag gatttatatg 180 catttgcata ttattttcct taagtccaaa ttgattggtt gtgatatatg tccacgattt 240 accttatcct cgattaaata gaaccatttt ttgtttgcat attttataga tttaagagaa 300 tcttgttaat tttgttgata taagaaaatc aaaaatcaga atgcttatac cactgaataa 360 gacaatacaa ctatttatat gatctaagtt gccatctata ataagtatgg tacaactcgc 420 agggtgggcc acggcccacc tggcccaccc cgtgcgcacg cctatg 466 // ID Mariner-25_HM repbase; DNA; INV; 3418 BP. XX AC . XX DT 22-DEC-2008 (Rel. 13.12, Created) DT 22-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE Mariner-type family: consensus sequence. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-25_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3418 RA Bao W. and Jurka J.; RT "Families of Mariner elements from Hydra magnipapillata."; RL Repbase Reports 8(12), 1959-1959 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 1590..3095 FT /product="Mariner-25_HM_1p" FT /translation="MEKMLFGLTSYELRKLAFELAEKNKKAHKFNKELGVA FT GYDWYQGFMLRHAKHLSLRKPEATSAARAMGFNKVAVNKFFELVENVIDIN FT KIDVERVWNVDETGISTVPKSLSKVISTKGKRQVGSLTSAERGQLVTAVVC FT CSASGRYMPPMLIFPRQRMKAELMDGAPPGAWAECHPSGWIQTDLFINWLK FT KFILHTGATKDSPVLLILDGHATHTKSIELIDIARENGVILLCLPPHCTHK FT MQPLDISFMKPLSTFYDHNLRKWLRTNPGRVVTQFQIASLFGASYLDAATM FT TNAINGFKKAGIWPVDRSVFTDADFIAAEVTDMSIITEDTESFVTTDSALT FT TVSAPATKPSDSTSTTEPSTSCTGSTSATEPSTSCRPSTSTTVPSSFAISP FT RHLLPIPKQAQRKRISKQRGKTAILTSSPYKRSLMEAKEKKNPKKNKSVKK FT SNEDTPCLYCEDLYSSSTESWVSCTECHRWAHYSCAGIDERNKKPYFLCEL FT CTSSD*" XX SQ Sequence 3418 BP; 1093 A; 578 C; 628 G; 1119 T; 0 other; ggggaaactg gggtaaaatg gggtagtggg gcaaaatggg gttttgaaaa tacaccccat 60 tcagagcctc agagcatcga aaaaaatatt ggatttgaaa tatcgcgaga agagacgtcg 120 ccatctttta cttttgttca cgttttcttc aagtatcgtc gttgagaggt aaatacaatt 180 ttgtcaactt tttcatctta ttttttaaca ttatatttta agaagcagct tctttgtaat 240 taaagctttg aattaaatat attattgttc ctgaatgata ttatatatta agcaattaag 300 actcttatat attcatgagc tttgctctgc tttgtttttt cgaacagttt tattatgcct 360 cctatgttgg ggtaaaatgg ggtgtctcaa aggggcaaaa tggggtattt ccccattttg 420 ccccggtagt gttatttatc taaattatag aaatgagatg tggtttatat ttaaacgtga 480 ttctattttt ttaaattact aatagactca ataggcctgc tataatttta ttcaataggc 540 ctaggttctt ttaataaata atactaatac ttaatactag tctattttta aaatataatt 600 ttattgttgt taaagttcag ttaaagtatt attatagcct atactataga actataaata 660 taaaataaga aagtctagtt tgagctaagt catctatgtt ggaaatctcg tctaaagagc 720 aattaacggg ttattttgca atagaacttc atacgcacgg gcttagaaga acgtaacgaa 780 cttataaacg tgaaaaacca gtttttaaac aagttttcaa aaccacttgg gaattgcgca 840 tctgcgcaat tctcacgtta tatgtgtagc tattttgtag atttagttag tgattcgcag 900 tcagtggagt ataacgttat aaacgaaata tatatatctt aaacaaatag tgattcattt 960 agtgtctcag ggaaacataa aataaattcc ccacatctaa gtctaagtag tctagctaag 1020 taaaaaaata attctgaaat tttattataa taatagtagg cctatttgtt tagttcttac 1080 tgaatcgtac ttaaattgct gatattaaag ttgttttatt ttattttttc agattattaa 1140 ctttgaaaaa aatgagaaat tacaaaagaa tatctgaacg ccaaacttgg aatgaaaccg 1200 aaatgcagca ggccgtactg tctgtgatca atggtgatca tgggtataaa aaggcagctg 1260 caatgtatgg tgtgccacag acaaccctcg aacgcagagt ggctaaattt aagaaaaatc 1320 ctgatattga ggatgcttgc aaaaagagta agtaaatgga ttaagcacaa tctaaattta 1380 ggaatttttc acatttttaa atattaaatt aaatgccaga aaataggcta tggtatatgt 1440 tctaaataaa tgaatttgta tgaaaagcat ctttatttag ttaattctaa ttctaatttt 1500 taattttaat ttctttttca gcgctaggcg cattcaaaac tatctttaca ttggaagagg 1560 aggcagaact tgttcagtat gttcagcaga tggagaaaat gctttttggc ctaacctcct 1620 atgagctgag aaaacttgcc tttgaactgg ctgaaaaaaa taaaaaggct cataaattta 1680 ataaagagtt gggggtggcc ggttacgact ggtatcaggg ttttatgttg agacatgcca 1740 aacatttgtc actgagaaag cctgaagcga catctgcagc cagagcaatg ggctttaata 1800 aagtagccgt taataaattt ttcgagcttg ttgagaatgt cattgacata aataaaattg 1860 atgtcgaaag agtatggaac gtcgatgaaa caggcatttc tactgtccct aagtcgctgt 1920 caaaagttat ttccacaaaa ggaaaaaggc aagttggctc tcttacatct gcagaaaggg 1980 gtcaacttgt tacagcagta gtttgctgtt cagcaagtgg gagatacatg ccacctatgt 2040 taatctttcc ccgtcagcgc atgaaggcag agttgatgga tggagcacct cctggtgctt 2100 gggcagaatg ccacccaagt ggatggatcc agactgattt atttataaat tggttaaaga 2160 aatttatatt acacacagga gccacaaaag attcaccagt tcttttaata ttagatggac 2220 atgcgacaca cactaaaagc atagagctta ttgacattgc cagagagaat ggtgtcattc 2280 tcctttgttt gcctccacac tgcactcaca aaatgcaacc tcttgacatt tcgtttatga 2340 aacctctgag tacattttat gaccataatt taaggaaatg gttaagaaca aatcctgggc 2400 gtgtggtaac gcaatttcaa attgcatcac tgtttggtgc gtcatatctt gatgctgcta 2460 ccatgactaa tgcaatcaat ggatttaaaa aagcaggtat ttggccagtg gataggtcag 2520 tatttactga tgctgatttc atagcagcgg aagtaacaga catgagcatt attactgagg 2580 atactgaaag ttttgttacc acggattcag cccttaccac tgtctctgct cctgccacta 2640 aaccttcaga tagcacatca accactgaac cttccacttc atgtaccggt agcacatccg 2700 ccactgaacc ttcaacttca tgtaggccta gcacatcaac cactgtacct tcaagctttg 2760 caatctcgcc acgtcatttg ctgcccatcc caaaacaggc ccaaagaaaa cgtatttcta 2820 aacaaagagg aaaaacagct attttgactt cttcgcctta taaaagatca ttaatggaag 2880 ccaaagaaaa aaagaatcca aagaaaaaca aatctgtcaa aaaatcaaat gaagacacac 2940 catgcctcta ttgtgaggat ctttattcca gcagcacaga gagttgggtc tcatgtactg 3000 agtgccacag atgggcccat tattcttgtg caggcattga tgaaagaaac aaaaaacctt 3060 actttttgtg tgagctatgt actagcagtg attgattata tttgtactag cagtgattga 3120 ttatatttgt actagcagtg attgatttat catatttgtt tgagttcctc ctttgacccc 3180 attttgcccc atccattacc ccgttttgcc ccatgcttgg ggtaaaatgg ggtttttatg 3240 ttatttttaa aaaaatgtaa taaatcaatt aatttacatt cttttgactt gctgttttgt 3300 tacaatgttg tctatgttgt attctgatgt ttcaacattt atttgtaatc attttattta 3360 tttttgtgac ttttataggg tttcattctt aagtacccca ttttacccca gtttcccc 3418 // ID BEL-618_AA-LTR repbase; DNA; INV; 487 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-618_AA_; KW Pao_Bel_Ele52; BEL-618_AA-I; BEL-618_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-487 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 487 BP; 105 A; 135 C; 98 G; 148 T; 1 other; tgatgagtgt taaagtaatt tagttaccct attttgtaaa ctgttccaaa tgaaacccac 60 ttttcatctt caccagccgg ccgaccacta ccgtgcgatc tattgtacga gtcgcctctc 120 gattgtttgt catgcttccc attgaataca gaatattgtt aactgcaaac gcacgtgtat 180 ccgcatcttg ttgctctccc gaaaattctc agttttctac tgttaattgc gccgataatt 240 gctcgtaatt agtcgcaagt tgctccgttt aattcccgcg agtgacatca agcccgttgg 300 ccccgaacag tgtcagtggt tgccccctgg agtccctaac gtactccccg tttttggcga 360 ttgatgcccc gttatcggtg tgattgatgc cacctmgcta attgcccaga ttggcgattc 420 cattagtcca cctgtgcaaa cgaatcaaag gatcctgccc gccttctgag ttgcccgtcg 480 cccatca 487 // ID BEL-601_AA-I repbase; DNA; INV; 6251 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-601_AA_; KW BEL-601_AA-LTR; Pao_Bel_Ele80; BEL-601_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-6251 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [5262-5765] - Integrase core CC 'GTAC' target site duplication CC LTRs are 96% similar to each other. XX FH Key Location/Qualifiers FT CDS 704..1852 FT /product="BEL-601_AA-I_1p" FT /translation="MFETPLTSRGGAQTEAQSNRDGENQTIEPISEQKKKQ FT KKKFQLRRMNEELKVIKIQSNALERKLQRVHAALVVDAEVPNNNLANKHYL FT QLQLKTIEDAFAEYNTNQKRIYGLDVDEDTHNEAEERYIDFEQRYGELYVI FT ITKLLDDIAQRERPTVAPSASTAATSAVPAVTMHLPPLKVPLPTFDGTYEN FT WYAFKSMFETIMDRYQSESPAIKLYHLRNSLVGKAAGIIDQEIINNNDYAA FT AWATLTERFEDKRLIIDKHIDALFDLPKMCGENAADLRKLIDVCTRNVDAL FT KNLELPVDGLGESMLINRITKKLDGETHKAWELDQAANEMPDYETTMDFLR FT ERCRVLEKIRPPDKRVVKPVKPGRQMVETRSKGTLVAHMI" FT CDS 2562..4682 FT /product="BEL-601_AA-I_2p" FT /translation="MLVVPKITGDLPVTKVDSRAITIPEDIHLADPSFGVP FT DKVDMLLGAEIFFEMLKSGRVQLPNCSAILQETQFGWVLSGSVPEKEPSVL FT HSLCARAEEDIGHLVKRFWEIEAYCDPIAQTQTAEEECLDHFRQTHERTAE FT GRYIVRLPFNDLKQQLGESRTMAEKRFLALERRLDKAPVLKDQYMSFLREY FT ELLGHMELNNNAADKAPQSAYYLPHHYVLKPSSRTTKLRVVFDGSAESTTG FT VSINNTQMIGPTVQNDLISIVSNFRAYKYGITADIPKMYRQVEVHADDRRY FT QRIVHRSNCDQPLQTYDLRTVTYGLASSPFLATMALKQLAIDEGTQYPLAA FT VAVEKSFYIDDVLTGANTLEDALELKTQIIGLLRKGCFDVHKICSNSEAVL FT QDVPQDMRESVVNIEDPSINTVIKTLGVAWRPQEDCFTFVVLVDDAVGSDR FT LTKRMILSQIARIFDPLGFVGPVVTAAKLIMRELWSLNLDWDQPVPSEMAN FT FWMDFRSQLRWLNELKIPRWILADGARSVELHGFADASDLAYGACLYTRIV FT KDDGSAALKLICSKSRILPRKKGKQKEITTPRAELLAALLLSRLAVKQLEA FT LDVKFESVILWSDSQIVLCWLKKSPESLAVYVGNRVHEIRELTNRFTWNYI FT PSKSNPADSISRGVEPKNLQSDELWWHGPPTLRHTDSGFDEPNPLKKISCR FT SYAKLFW" XX SQ Sequence 6251 BP; 1722 A; 1446 C; 1597 G; 1482 T; 4 other; ttctggtcct tacaacccgg atttaactag ttttcggaat gatgtgaccg acttttgtgc 60 gcaaaaagtc ggaattcgcg aagtgaaagt gattttccgg aagatttctt tgggctttgt 120 gttgaatcaa ccgaatgtcg tcggcaaaca gcgacgccat ttgtgattcg gtttgatttc 180 gtgaaaaagt taacgcagca atattttgct ggtaccgcat attggggaag tttcgtcggc 240 atgcttgccg ccaagttgtg acgccagttg gcgaatcgta gtgtgaaaaa ttgcccgtag 300 aggcagaata gaagcttgtg ctttcgctcc gaagaaattt cgcgacgcca tcttggcgaa 360 tcgtgtcgtg tgtgctcttc cggtactcag ctaccaccga cttcagacgc cgtcacggcg 420 atccatgtgt gtgtgcatac ggcgcagcgt cggcttttgt cgacccagaa aaattgtgac 480 gtcattttga cgagtcgtgt ttactgaaaa caagattgat tgtgcttgtg cttgcaataa 540 cgaaaaataa aaagcatttg agaaacgaat cgattgtgtt tttgagcgga aatcgtttca 600 aggccatagt gtagagcagt gtgtgtgcgc gaatgcgttc taagctaaga tggcctatca 660 gtgtttccca accggtttcg attcgagtgt gaagtgaagc aaaatgttcg aaactccgtt 720 gacatcgaga ggtggggcgc aaacagaagc acaatcaaac cgagacggtg agaaccaaac 780 gatcgaaccg atcagtgaac aaaagaaaaa acagaagaag aaatttcaat tgcgaagaat 840 gaatgaagaa ctaaaagtga ttaaaatcca gtcgaatgca ctggagcgaa aattgcagcg 900 ggtgcatgct gcacttgtag tggatgctga ggtgccaaat aacaatttgg caaacaaaca 960 ctacctgcag ctgcagctga agaccataga agatgcgttt gctgaataca atacgaacca 1020 gaagcgcatc tatggtctcg acgtcgacga agatacccac aacgaagcgg aggagaggta 1080 cattgatttc gagcagcgat acggtgagtt gtacgttatt attactaaac tgcttgatga 1140 tattgctcaa cgagagaggc cgacagtggc gccctcggca tcgacagcgg ccacgtcggc 1200 ggtcccagct gtgacgatgc acctgcctcc actgaaagtg ccgctgccta cgtttgacgg 1260 tacttatgag aattggtacg ctttcaaatc aatgttcgaa accatcatgg accgctacca 1320 atcggagtca ccggcgataa agttgtatca tctccgcaac tcgttggtgg gtaaagcagc 1380 cggcatcatc gatcaggaga taattaacaa caacgattat gctgcggcgt gggcgacgtt 1440 gaccgaaaga tttgaggata agcggcttat tatcgacaag catatcgatg ctctcttcga 1500 tcttcccaag atgtgtggtg aaaacgctgc ggacctgcgt aagctgatag atgtttgtac 1560 gaggaatgtt gacgcgctca agaacctaga acttcctgtt gatggactag gtgaaagtat 1620 gctgataaat cggatcacga agaagctaga tggcgagacc cataaggcgt gggagttgga 1680 tcaggctgcc aacgaaatgc cagattatga aactacgatg gacttcttga gagaacgatg 1740 ccgagtgctg gagaaaattc gccctccgga caagcgtgtc gtgaaaccgg tgaaaccagg 1800 aaggcagatg gttgaaacga gatccaaggg taccctagta gcacacatga tataatagat 1860 gcagaataac tattatatga cagcagctat tatgggcagc agcttggtaa ccacatctga 1920 caagtgtcca cagtgctcta gcaaccacga attatggaaa tgcgacgttt tcaagaaggc 1980 gaatctcgcg gatcgctaca gtaccctgcg tcgaatcgga gcgtgtttca actgcctcca 2040 aaaggggcat cgtacggttg attgctcatc tgagcattct tgcaagaaat gtagcaaacg 2100 gcaccacaca acgctgcatc cgaacgacac ccccatgaaa aagaacgatt ctccgccatt 2160 agttgttgga ccatcagacg gcgataaacg aaacgagcag caagcatcga gtcaaaaccg 2220 cgattcatca gttggcgctg gtgckccgaa gcctgamggt ggccaggagc aacaacgttc 2280 taatcttctg cacacggcca caatctggca aacagatatt gctttcgaca gcggtgatcc 2340 tcgtatatgg aagagcgaac aacccttacc cttgtagggt tcttctcgac tcaggctcac 2400 atacaaactt tgtgtctgag cacttcgctg ctctgttggg attaaagaag cagcctgcga 2460 actattcgat tagtgggctg aatgaaacac gaacgaagat tcggttcaag atccacgcta 2520 aattgaaatc ccgattaaca gattacaccg cttgtcttga gatgctagtg gtgccgaaga 2580 taaccggaga tttaccagtt acgaaagtgg attcgagagc gatcaccatc cctgaagata 2640 tccacctagc tgacccgagc tttggtgttc cggataaggt tgatatgctg ttgggcgccg 2700 aaatattctt tgagatgctg aagtccggac gtgttcaact accgaattgt tcagcgatac 2760 tccaggaaac gcagttcggt tgggtcttga gtggttctgt ccccgaaaag gagccttcag 2820 tactccattc actctgcgcg cgagcggaag aagacattgg acatctcgta aagcggtttt 2880 gggaaatcga agcctactgc gaccctattg cgcaaactca aacagctgaa gaggaatgct 2940 tggatcattt tcgacagact catgagcgta ctgcggaagg acggtacatt gtccgtctcc 3000 cgttcaacga cctgaaacaa caactgggcg aatcacgcac gatggctgag aaacgcttcc 3060 tcgccttgga gaggagattg gacaaagcgc ctgtattgaa agatcagtac atgtcgtttc 3120 ttcgcgaata cgaactactt gggcacatgg agttgaacaa taatgcggcc gataaggctc 3180 cacagtcggc atattatctt cctcatcact acgtgttgaa acctagcagc agaacaacaa 3240 aactccgagt agtattcgac gggtcggctg agtctactac tggcgtgtcg atcaacaata 3300 cgcaaatgat tggaccaacc gttcaaaatg atttgattag catcgtatcg aacttcagag 3360 cctacaagta tgggataaca gccgatatac cgaaaatgta ccgccaagtc gaggtacacg 3420 ctgatgacag gcgataccag aggattgtgc atcggtctaa ctgcgatcag ccgctccaga 3480 catacgattt gcggacggtt acgtatggac ttgcatcgtc tccatttctg gccacgatgg 3540 cgctaaagca attagccatt gatgagggga cacaatatcc gttagctgct gtggcagttg 3600 agaagtcatt ctacattgac gacgtgctga ctggggcgaa tacgctagag gacgccttag 3660 agctgaaaac ccagatcatc ggattactgc gcaaaggatg tttcgacgtg cataaaattt 3720 gctctaattc ggaagcagtg cttcaggatg tcccacagga tatgcgagaa agtgttgtga 3780 acattgaaga tccaagcatc aatactgtaa tcaagaccct gggcgtggcc tggcgtccac 3840 aggaagattg tttcaccttt gtcgtgctgg ttgatgacgc cgttggatcg gatcgtctca 3900 cgaagaggat gattctcagc cagatcgcaa ggatttttga cccattgggt tttgttggcc 3960 ctgtggttac agctgcgaaa cttattatgc gggagttatg gtctctcaat ctggattggg 4020 atcaacccgt accgagcgag atggctaact tctggatgga ttttcggagt caactgcgct 4080 ggctcaacga gctgaagata ccgagatgga ttcttgctga cggagctcgc tccgtagaat 4140 tacacgggtt tgccgacgct tcagacctgg cgtatggggc gtgcttgtac accagaatcg 4200 tgaaagacga tggatctgct gcgctgaagc ttatttgcag caagtctagg attctaccac 4260 gaaagaaggg aaagcaaaag gaaataacga caccccgcgc ggagctgttg gctgctctgc 4320 tgttatcaag attggccgtc aagcagctgg aggcgctcga tgtgaaattt gaatccgtta 4380 ttctctggag tgattcacag atcgtattgt gctggctaaa aaaatcgccc gaatctcttg 4440 ctgtatacgt cggaaataga gtccatgaaa ttcgagagct gaccaaccga ttcacttgga 4500 attatatccc ttccaagtcc aacccagctg attccatatc cagaggagtg gagccgaaaa 4560 atctgcaatc tgatgaactg tggtggcatg ggccacccac tctacgacac accgacagtg 4620 gttttgatga acccaaccca ttgaagaaga tcagctgccg gagctacgca aaactgtttt 4680 ggtgacaacg ccggaacatc ctcgattgca gcttttcgat cgaatcagcc gatatccgat 4740 catgcagcgt acaatggcgt atgttgttcg tttttgtgac tatgtgaaga gcggcaggac 4800 aaccattacg aaaggtttac cgacaacatc cgaaatatca cgagcatctg cattgatcat 4860 ccgattggtg cagaaagaat ccttcaaact tgaaatcagc gcgttgcaag ccggtaagga 4920 gttcaacttt tcgactcgca acttaaatcc tttcgtcgac gaggcggatg ggattctccg 4980 tgtaggtgga agattgaaaa actcgtcgct tccatatcat cagaagcatc caccagcctt 5040 gcttcctaag aagcatcctg tgacggttgc attgatacgg tacctgcatc gttcgaacat 5100 gcacatcgga cagcgcagcc tcctcggcat tgttcgacaa aggtattggc cactagacgc 5160 gaggagcact attcgtaaat tagttcacca gtgcattcct tgtttccgga tgaagccaac 5220 gagagccact caactgatgg gaaacttacc ggtackatcg wgttcaaccg tcacctgtat 5280 ttgctaatac gggactcgat tttgccggtc ctttcaaaat acgaccaaat gcgaaaatga 5340 agaatgcacc aacgttgaaa ggctatgtat gtgtctttgt atgtatggcc acccgggcac 5400 tacatttaga ggccgtctcc gacttgacaa ccgaggggtt catgggcgca cttcaacgct 5460 ttgtgagtcg ccgtggagtc gtctcgaagc tgtattcgga caacgcgaca aattttgaag 5520 gcgcaaacaa cgaaatggag cgtctagcta ccctattccg agaagaacaa caccagctga 5580 agctgaacga gttctgcacc caacgagcca tcgactggtc tttcatacca ccacgcagtc 5640 cgcactttgg cggaatatgg gaggccggcg ttaagtcggt gaaatcacac ctaaagatga 5700 ttatggcgga acacaagcta tcgttcgaac aattttcgac agtgttagtg caaatgaggc 5760 tatattgaac tcgaggcctc tgacacaact ttcagatgac ccgaatgatg tcagtgcaat 5820 tacacccgct cacttcctga tcgggcggga atttcaagca atcctggagc catcgtacca 5880 gcacatccct caaggaagat tatcgacgtg gcaactggtc caggacttga agcaacggtt 5940 ttggaaggct tggacgcatg attacttgca cgagttacag aagctacaga gagatttcaa 6000 ggtgaccaag ttccaagtgg gcgccttagt tctcatcgtc gacgagaaca gcccaccact 6060 tcactggcag ttggctcgaa tcgtggaact acatcaaggc agcgacggcc acactcgcgt 6120 ggttactttg cgtacaaagg acggaactac gaaaagggcc gtaaagaaga tttgcctact 6180 gcccctggac aatgagcgcg accgttcagc cgagtagttt tgaaattctg caatttcaac 6240 ggccggcagg a 6251 // ID hATx-9_SM repbase; DNA; INV; 2961 BP. XX AC . XX DT 10-FEB-2008 (Rel. 13.02, Created) DT 03-MAR-2008 (Rel. 13.02, Last updated, Version 1) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hATx-9_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-2961 RA Jurka J.; RT "A distinct, diverse family of hAT transposons from Schmidtea RT mediterranea."; RL Repbase Reports 8(2), 25-25 (2008). XX DR [1] (Consensus) XX CC This is yet another family related to hATx/hATm diverse families. CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 423..2648 FT /product="hATx-9_SM_1p" FT /translation="MMNTSNVATRSDTRNSVFGQPNKLPDAQLPTKEEVFK FT CFLWHRQAKTTTTREIVKVVVSDVLSIWEKASIPTIAYENVLQSAERLIEK FT GKELQKYSLARRKTKAFCEKENRFKELFDICSCKCVDKGFVDRAKCQCSVK FT ISILEWNFWIDQKTQRKMIIGSVDAEVSSQLQRRMERKEKVEKRRQEHTKK FT GYVNDVADSSETALDYAVPHKEPASSSSSNNDSDSLSEETIIQNRLKYPQL FT SEMMERTGISNRDACKIINACLKDMKLDTPDNLIEPSKLRRQRIYWRKKAI FT ESRALVLSNLLCIGFDGRIDETRLVEDGSRKRKKEDHYVIVSYPGEHYVDH FT VSPQSGKANDITTELLSVIHETNSTKTLCAILCDGTNVNTGEHKGVIRHIE FT TVLQHPVQWLICMLHLNELPFREVFKRIDGETCGPHSFQGNVGSKLNYDPR FT TLPLVDFEPIHGYVQDIDDDIKNDLSDDQQFLLRACLAIQQGKNETNQNDL FT KFLATASPGPLNHARWLTCANRVLRLYMSTQEPSTQLQKLTTFIVQVYASS FT WFQIKSHPLCFDGSRNFFYIIQKSRYLTDSLMRDAVEKTLSRNSYFAHPEN FT ILLAALTDNNLCIRKDAANKIMLARQIMQGEAQIRKFSKTQIFINFDATTY FT YDMIDWEKSAVTPPPILESLPSEELLQQVELGPISFPSIPCHTQAVERAVK FT EVTRASSKLYGHEARHGMIVAAEESRRRLKNWKRKNILINNK" XX SQ Sequence 2961 BP; 995 A; 546 C; 572 G; 847 T; 1 other; ggtggggcga aaatccgaac ctatatcatt tcagcgatta atctttgcct taagggctag 60 taaaatcatt tgacatagtc attataaaaa attgatgttt aagggtctcg cactttgttc 120 aaagtttcgt gcatgtgtga gtgatatcat attacagagc gagatcgtta ttttcaccat 180 tatctgtttg tcacttgtac aaaattaaag ttaagtacat aaaagcagtt gctttattta 240 cttggataac gcatgaatga aatgtatgac agaaagtgca ttggttgatg gtgaagggga 300 gtcacatgtc cgacacacac tcacgtttct cataagtaat tggtttgtct atctttctac 360 ctattctact ttgacatcaa acattgaggc agcgttacct tttctcttat agattattac 420 tcatgatgaa tacttcaaat gtagccacaa gatcagacac aagaaattct gtctttggtc 480 agccaaacaa actaccagat gctcaacttc caacaaaaga agaagttttt aaatgtttct 540 tatggcatag acaagcaaaa acaacaacta ccagagaaat tgtcaaagta gttgtcagcg 600 acgttttgtc tatttgggaa aaagcaagta tacctacgat tgcttatgaa aacgttcttc 660 aaagtgcaga aaggttgatc gaaaagggaa aggaattaca gaagtactca cttgcaagga 720 gaaaaacgaa agcattttgt gaaaaagaaa atcgcttcaa agaactgttt gacatttgtt 780 cgtgcaaatg tgtggataaa ggatttgtcg acagagcgaa atgtcaatgc tctgttaaaa 840 tctcaatact tgaatggaac ttttggatag atcaaaagac tcagaggaaa atgattattg 900 gctcggtaga tgcagaagta tcatcccaat tgcaaagaag aatggaaaga aaagagaaag 960 tggaaaagcg tagacaagaa cacacaaaga aaggttatgt gaatgatgtg gcagattcaa 1020 gtgaaaccgc tttagattat gcagtgccgc acaaggaacc agcttcgtct tcttcttcaa 1080 acaatgactc tgactcatta tctgaagaaa ctatcattca aaacagactc aaatatcctc 1140 aattgagcga gatgatggag agaactggga taagcaatcg cgatgcgtgc aagatcataa 1200 atgcttgttt gaaagatatg aaactagaca ctccagataa tttaatcgaa ccaagcaaat 1260 taagaaggca gcgcatatat tggagaaaga aggcgataga aagtcgcgct cttgtgctct 1320 caaatttact ttgcattggc tttgatggaa gaattgatga gacaagacta gtggaagatg 1380 gttctcgaaa aaggaaaaag gaagatcatt atgttatagt atcttatcct ggggagcact 1440 atgtcgatca tgtatcacca caaagcggga aggcaaatga tatcaccact gaacttctct 1500 cagtcattca tgaaaccaat tccaccaaaa cgctttgtgc cattttatgt gatggtacaa 1560 atgtcaatac tggagaacac aaaggagtca ttcggcacat agaaacagtg ctacagcatc 1620 ctgtccaatg gttgatttgc atgttgcatt tgaacgagtt accatttcgc gaagttttca 1680 aaagaataga cggggaaaca tgtggaccac atagtttcca gggaaatgta ggaagtaaat 1740 tgaattatga tccaaggaca ttaccactcg ttgatttcga gcctattcac ggctatgtac 1800 aggacattga tgatgatatt aagaatgacc tgagcgacga ccagcaattt ttgttgcgcg 1860 cttgtcttgc aatacaacaa ggcaagaatg aaacaaatca gaatgatctc aaatttcttg 1920 ctacagcttc tcctggtcct cttaatcatg caaggtggct gacatgcgct aacagagtgc 1980 ttcgcttgta catgagcacc caagaacctt caactcagct tcaaaaatta actaccttca 2040 ttgtccaagt ttatgcttca agttggttcc aaataaaatc tcacccgttg tgctttgacg 2100 gctcaagaaa cttcttctat atcattcaaa aaagtcgata tttgactgac agtcttatgc 2160 gtgacgctgt tgaaaaaaca ttatcaagaa acagttattt tgctcatcct gagaacattc 2220 tccttgctgc cttgacggat aacaatctct gcattagaaa agacgctgcc aataagatta 2280 tgttagcaag acagataatg caaggtgaag ctcaaatcag aaaattctcg aaaactcaga 2340 tctttatcaa ctttgacgcc accacttact atgacatgat tgattgggaa aaatcagctg 2400 tgactcctcc accaatccta gaatctttgc caagtgaaga acttttacaa caggtggaac 2460 ttggtccgat ttcatttcct tcaattccgt gtcataccca agcagtcgaa agagctgtca 2520 aagaagtcac cagagcttca tcgaaacttt atggtcatga agcgcgtcat ggaatgattg 2580 ttgccgcaga agaatcmaga agaaggttaa aaaactggaa gcgaaaaaac attttaatca 2640 ataacaaatg aatattcata tgctcgtcaa caatttattt cttggcatgg aacttccatg 2700 gatacttttt tacgatttct tcttggttgc atatgtagca ctcacttgtc atttgaatat 2760 ttgttaatat atactgttat ttggtgaata aaataagatt tcatttacat ccaattttac 2820 tcaattttcg gtgaaaatgt gggaccccta aataaaagtt tacacacgta aaattttgca 2880 tatgaactta tctcacctat tagcaacttt tccagactga aatgattagc gattcaaact 2940 ttcgtatact ttcgccccac c 2961 // ID I_Ele33 repbase; DNA; INV; 6248 BP. XX AC . XX DT 14-OCT-2010 (Rel. 15.1, Created) DT 14-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE An I clade non-LTR retrotransposon family from Aedes aegypti. XX KW I; Non-LTR Retrotransposon; Transposable Element; I_Ele33. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-6248 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6248 RA Kojima K.K. and Jurka J.; RT "I clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (14-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update and extension. This consensus is generated CC from 7 sequences with >99% identity, and ~100% identical to the CC original sequence in [1]. XX FH Key Location/Qualifiers FT CDS 370..1674 FT /product="I_Ele33_1p" FT /translation="MEGHSRPPPCDPPDLARCPAWMIGDDELLHTIVLQMK FT VRPNDNEGANEVRLPTDPFVVGNAVLLALGTANARKVSASKEARGARYILR FT TKSKIFSEKLQKITELPDGTPVEITPHPTLNFVQGIVYDLDTVNHTEENIL FT TNLQSQGVCKVRRIKKRDGETYRNTPLLVLSFQGSVVPQHVYFGLLRIAVR FT TYYPSPLLCFRCANYGHTKKTCDNAKFPQICLNCSESHTDVVECPNPPYCK FT NCQGSHKPISRVCPIYREEDAIIHTKVDRNLSYAEARADWRAANKSRSYAN FT TVQDRLRQDDSQKDKIIKMLQEEVESLRNVILELKTQIANLSNNNQPNVST FT STQAQVRKDNSSTGKAPTVALTRLCAMEQYIKAHSVEKPQRSSSPNSNQSI FT DLNESMEFETTTHKKRKGNKNKSEPDSPERKKGVASSTKRK" FT CDS 1677..6182 FT /product="I_Ele33_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="MTQEPSFRKQNFAIEPDTIFGLSEHSMINNSTNNTNL FT HSTTASTFNNVIYATNTDTQTEQFSKTTIMEHKASSRFIAPSRSNSDSPSA FT RLPTSALTRDVPDESDEPLAAADVDCLLTQASTNTVFFTDENFRCLQPIPG FT LSHDECNSQAPSPNSCYRTTAVRIASDSPSARLSTSALTRDVPDISDEPLA FT AADVACLYPQASKIFHSHHEIVPPRMRFFPDPTQLDALASLPREALVNAEL FT YDHFRQVFVLPSTSSCNNRSGVAISPSNDSRNTANAENPTSQRDQETVQST FT HDNFLGIVVPHASQKQLMLQWNIRGLWANHAELSNYVIQKKPIVVNLQEMK FT TSNTEIALKGKYDWKLADHNIRNGKGIVGLGVLKEVPHCFYDLHSTIPICA FT VRLHSPYNVTIVAFYFPNNANVKDIIDGLNLVLETTDPPYVIGGDANAAHE FT AWGSSKTSVRGRSLLSWIVDNDFIVLNNGAPTFLSDTHCSFSSIDITFASK FT CLAPRYLWEVLEDTMGSDHFPIITYLDATTAKEGCRKRWLYKDANWKHFEQ FT CIDQLLQQNCDLDIVQLGNLIREAAEMSIPKSSGKLNGKSEIWWTKEVDTA FT VKLRRKALRAMKRLNADDPLKEVLKKKFQIARANARKVILRAKQASWEQFC FT ETFTPSTSSSQIWTNFNRLCGKKKAVAKTMIINKTHVTDPNEIVEHFAEVF FT ELNSKANYNISNEHAEAGTNEQSFCTVEPSVLDNNFTMQELLRAIDSAKGY FT SAGIDEVGYPMIRHLPLHGKMRMLMAYNTLWNEGRFPEKWKEGIVIPIPKP FT GQPQQHAESYRPITLLSCIGKIYERLINHRLMVHLESNDLLNQNQHAFRTG FT KGTTTYFADLNEIMTDALKRKLHCEFALLDLRKAYDTTWRPHIIQRVRELN FT LGINMTKSIESFLKDRKFRVSFGGALSSSKTQEDGVPQGSVLSVTLFIIAM FT DTVFNVIPSGVTALIYADDILLVTVGKKVSSTRKRLSLAVKAVCEWAKWTH FT FEISPNKSSLLHVCREKHRNWKKNRKEIKINGESIPDVKTARILGVWIDSR FT VTFKKHFKNTLNAIQSRINFLKAIASRANREVIWKIANATCISKLLYGIEL FT FGLRCIEPYQSSFNQLIRLASSAFRTSPSLALAIESGELPFKERVALAYIR FT SYCKIEEKSHQYRTAFRENIEQLSTTFFGEAFPQISKLHRFGQRPWYYARP FT KIDWTIKRNFKVGEGAHKARAFVNGLLNSEYKQYTRIYTDGSKDEVGVGCG FT VIGVDFIIEGQLPSSSSVFSAEAAALAIAVYHADNTPTVILTDSASCLSAL FT EKGNILHPYIQAIETFSENKHVVFLWIPGHCDISGNVKADQAAKRGRTGLA FT LEFEVPAKDLILWAKHKMKDIVQSRWIGNTTDQLALVKRSIGKWNDKMKKS FT DQRVLTRCRIGHTRLTKCHLFSKNDPDICDLCNSSLDVKHILLECRKFDDI FT RRRLGISSNISIALSNDQKEENKIIKYLKNTGLYNQL" XX SQ Sequence 6248 BP; 1974 A; 1315 C; 1331 G; 1628 T; 0 other; cagtgaatta ttacaactcg aaccatacgg tcgtgtatgc tatttcactt agcaaaatat 60 taccgataat tcgcgtgtaa acggtcggaa ttattcgcgg ttttcagcaa attgtgctta 120 ttagtaatat tccatcacat tcgaagtgct aaccatcgat tttgagactt tttggtgtag 180 ttttgcgata aattgctagt gtttttgttt actgaaaagg tggcgtgctt gactagcaca 240 gatcccgttg gatttgtggg ctctcgtcgt tgagttattt tcagtggatt gtaggtactg 300 tctgggacca tccaaaagaa gcgagcgatt gtagtgttgt agtgagttgt tgtatggaag 360 gaaggcctga tggaaggcca ctctagacca ccaccgtgtg atccaccgga cctcgctcga 420 tgcccggcat ggatgattgg ggatgatgag ttactgcata caattgtact gcaaatgaaa 480 gtgagaccca acgataatga aggcgctaat gaggttcgac tccctaccga cccgtttgtt 540 gttggcaatg cagtgctttt ggcactagga acagcaaatg ctcgaaaagt gtcagcatcg 600 aaagaggctc gaggtgcaag atacattttg cgtaccaaat ccaaaatctt cagtgaaaaa 660 cttcagaaga taactgagtt acctgacggt accccagttg agatcactcc tcatccaacg 720 cttaattttg tgcagggaat tgtgtatgac ctagacacag tcaaccacac cgaggaaaat 780 atcctgacaa atctacagtc gcagggcgtc tgtaaggttc ggcgtattaa gaaacgtgat 840 ggtgaaacct accgaaacac gccacttttg gttctatcgt ttcaaggatc tgtggtgcca 900 cagcatgttt attttggact gctgcggata gccgtgagaa cgtactatcc gtcgccatta 960 ctctgttttc gttgcgccaa ctacggacat acaaaaaaga catgcgacaa cgccaagttc 1020 cctcaaatct gtttaaactg ttccgagtct catacagatg ttgttgaatg ccctaacccc 1080 ccttattgta aaaattgtca agggagtcac aaaccaattt cgcgggtttg tccgatttat 1140 cgggaagagg atgcaatcat tcatactaaa gtggatcgca atctgtcgta tgcggaggct 1200 cgagctgatt ggcgtgcagc caacaaatct cgttcttatg ccaatacagt tcaagatcga 1260 ttgcgtcaag acgactctca gaaggataaa attattaaaa tgctgcagga agaggtagaa 1320 tctctgcgta acgtaatcct tgagctcaaa acacaaatag ccaacctcag caacaacaac 1380 caacccaacg tttcaacaag cactcaagca caagtccgca aagacaactc gtcgactggt 1440 aaggctccta ctgtagctct gacacgattg tgtgccatgg aacaatatat aaaagcacac 1500 agcgttgaaa aaccacaacg atcatccagt cctaactcga accaaagtat agatctgaac 1560 gaaagtatgg aatttgaaac gactacgcat aaaaaacgga aagggaacaa aaataaatct 1620 gaaccggact ctccagaacg taaaaaaggg gtcgcatcca gtaccaaaag aaaataatga 1680 cccaagaacc atcattcaga aagcaaaact tcgctattga gccggacacg atttttggac 1740 tttcggaaca ctctatgata aataactcaa ccaacaatac aaatttacac tcaactactg 1800 catcgacgtt taataatgta atttatgcaa cgaatactga cacgcaaacc gaacaatttt 1860 cgaaaactac aattatggaa cacaaggctt cttcaagatt cattgctcca tcaaggagca 1920 attctgattc gccaagcgcg aggcttccca cgtcagccct gaccagagat gttccggacg 1980 agtccgatga gcctctggcg gcagctgacg tggattgcct actcacacag gcaagtacaa 2040 acacggtgtt ttttacggac gagaactttc ggtgtcttca acctattcca ggcctttctc 2100 atgatgaatg caactcacaa gcaccttctc cgaactcctg ctaccgaacc actgcagtaa 2160 ggatagcatc tgattcgcca agcgcgaggc tctccacgtc ggccctgacc agggatgttc 2220 cagatatatc tgatgagccc ctggcggcag ccgacgtggc ttgcctctat ccacaggcaa 2280 gtaaaatatt ccattctcac catgaaatag tccctcctcg tatgcggttt tttccagatc 2340 cgacacagct tgacgcatta gcttctcttc ctcgagaggc gttggtcaat gccgagttat 2400 acgatcattt ccgacaggta ttcgttcttc cttcaaccag cagttgtaat aatcgtagcg 2460 gagtggcaat atcaccctcg aacgatagca gaaacaccgc caatgcagag aatcccacat 2520 cgcaacgtga tcaggagacg gttcagtcaa ctcatgacaa tttcttggga attgttgtac 2580 cgcacgcctc tcaaaagcaa ttaatgcttc aatggaatat aagaggtcta tgggccaatc 2640 acgcagaatt gtccaattat gtcattcaga agaagccgat cgttgtgaat ctccaagaaa 2700 tgaagacatc aaacactgaa atcgcactaa aaggcaagta tgactggaag ctggccgacc 2760 acaatattcg caacggaaaa ggaattgtag gattgggtgt tctgaaggaa gtgcctcatt 2820 gtttctacga tttgcactcc accatcccga tatgtgctgt tcggttacat agcccgtata 2880 atgtgacaat tgttgcgttt tattttccaa acaatgcaaa cgttaaggat attattgatg 2940 ggctgaatct ggtcctggaa accacggatc ctccgtatgt cataggcggt gacgcaaatg 3000 ctgctcacga ggcttggggc agctccaaaa catcggtgag ggggcggtcg ttactgtctt 3060 ggattgtgga taacgacttt atagtgctaa ataatggtgc tccaactttt ctaagtgata 3120 cgcattgctc tttttcttct atcgacataa cttttgcttc caaatgtctc gctcccagat 3180 atctttggga ggttttggag gacacaatgg gaagcgatca ttttcccatt ataacttatc 3240 tcgacgcaac tactgcaaaa gaaggttgcc gtaaacgatg gctatacaaa gatgccaatt 3300 ggaaacattt tgaacaatgc atagaccaac tccttcaaca aaattgtgat ttggacattg 3360 tgcaattagg taatctcatt agggaagccg cagaaatgtc aattccaaaa tcttctggga 3420 aactaaatgg gaaatccgaa atttggtgga ctaaggaagt cgatacagct gtcaagctac 3480 gccggaaggc tctacgagcc atgaaacgtc tgaatgctga cgatccgtta aaagaagttt 3540 tgaagaaaaa gtttcaaata gcaagagcca atgcccgaaa agttatccta agagctaaac 3600 aagcttcttg ggaacaattt tgcgaaacat ttacgccttc aacaagttca tcacaaattt 3660 ggacaaactt caacagactt tgtggaaaga aaaaagccgt tgctaaaact atgatcatca 3720 ataaaacgca cgttaccgat ccaaatgaaa tcgtggagca ctttgcggaa gttttcgaat 3780 taaattccaa agcaaactac aacatttcta acgaacatgc tgaggctgga acaaatgaac 3840 aatctttctg taccgtagaa ccatctgttt tggataataa tttcacaatg caagaattgc 3900 ttcgtgcaat cgattcagca aaagggtact cagcggggat cgacgaagtg ggctatccaa 3960 tgatcaggca tttgccgctg cacggaaaaa tgcgaatgtt gatggcatac aacacattat 4020 ggaacgaagg tcgttttcct gagaaatgga aggaaggtat agtgatacca attcctaaac 4080 ctggacaacc acaacaacat gcagaaagtt atcgtccaat aactctcctt agttgcattg 4140 gcaaaattta cgaacgacta atcaatcatc gactcatggt tcatttagaa tcaaatgatc 4200 tcttgaacca aaaccagcat gcgtttcgaa ctggaaaggg aaccactacc tattttgctg 4260 acctgaatga gatcatgacc gatgctctca aaagaaaact tcattgtgag tttgccctgt 4320 tggatctacg aaaagcttac gacacgacat ggcgtcctca tattatccag cgagtacgag 4380 aactgaatct agggatcaat atgacgaaaa gtattgaaag cttcttgaaa gataggaaat 4440 tccgtgtcag ttttggggga gcactatctt cttcaaagac tcaagaagat ggagtcccac 4500 aagggtcggt gttgtccgtc acactcttca ttatagcaat ggacaccgtc ttcaacgtga 4560 tcccatcagg ggtgacagca ctcatttacg ccgatgatat tttattagtc actgtgggta 4620 agaaggtcag cagtaccaga aagcgtctaa gccttgctgt taaagccgtt tgtgagtggg 4680 caaaatggac tcattttgaa atctcaccca acaagtcttc acttctgcat gtttgtcgag 4740 aaaaacacag aaattggaag aaaaacagga aggaaattaa gattaatgga gagagcatac 4800 ctgacgtcaa aacagcgagg atactgggag tttggattga cagccgagta actttcaaaa 4860 agcattttaa aaatactttg aatgcaatac aaagtcggat caactttctt aaagcaatag 4920 caagtcgtgc aaacagggaa gtgatctgga aaatagctaa tgcaacttgt atctccaaat 4980 tactttacgg gatcgaactc ttcggtttaa gatgtattga gccataccaa tcatctttca 5040 atcaactgat tcgcttggcc tctagtgcgt ttaggacttc accatctctg gctttggcta 5100 ttgagagtgg ggaacttcct ttcaaagaaa gggttgcatt agcgtacatt cgtagctact 5160 gtaaaattga agaaaaaagc caccaatatc gaacagcctt tcgcgaaaac atcgagcaac 5220 tttcaacaac cttttttggg gaagctttcc ctcaaatcag taaactccat cgttttggac 5280 agcgaccttg gtattatgct cgaccaaaaa ttgattggac gataaaaagg aatttcaaag 5340 ttggagaagg agctcataaa gctcgtgctt tcgttaacgg acttttgaac agtgaatata 5400 agcagtatac gagaatttac accgatggat ctaaagatga agtaggtgtc ggatgtggtg 5460 taattggtgt cgatttcatc atcgaaggtc agcttccttc cagcagttcc gtattttcag 5520 cagaagcggc cgcattagca attgccgtgt accacgcgga taacacacca acagttattt 5580 taacagattc tgcaagctgt ctttctgctc tcgaaaaagg aaacatcctc catccgtaca 5640 ttcaggcgat tgagacattt tccgaaaaca aacatgtagt ttttctttgg attcctggtc 5700 attgtgacat atctggaaat gtaaaagccg atcaagcagc aaaaaggggt agaacaggac 5760 tagctttaga atttgaagta cctgcgaaag atttaatatt gtgggctaag cataagatga 5820 aagatattgt gcaatctaga tggatcggta atactaccga tcaattagct ttagtaaaga 5880 gaagtattgg aaaatggaat gacaaaatga agaagagcga tcaacgagtg ctcaccaggt 5940 gtaggattgg tcatacgcgc ttaacaaaat gccacttgtt ttcaaaaaat gatcccgata 6000 tttgtgattt gtgtaacagt tctctagatg ttaaacatat tttattggaa tgccgaaaat 6060 ttgatgatat cagaaggaga ttaggaatca gctcgaatat tagtatagct ctaagtaatg 6120 accaaaaaga agaaaacaaa attataaagt atctcaaaaa taccggatta tataaccaac 6180 tttaaactaa aacagaggcg aatgaattgt attttaaagc ctctttaata aataaaaaaa 6240 aaaaaaaa 6248 // ID CR1-3_CQ repbase; DNA; INV; 4443 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-3_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4443 RA Kojima K.K. and Jurka J.; RT "CR1 non-LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 3-3 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 10 sequences with >96% CC identity. XX FH Key Location/Qualifiers FT CDS 1..660 FT /product="CR1-3_CQ_1p" FT /translation="HRSCIDGLNRTAFEAIGKFQKNCYWLCDVCAGRFDQF FT VQSMDVDDDPSPATDVSKLSEAVEKLSGIVNELSGQMKEKSSKKSFADVVF FT PGSKREREDDVNNEPPAKIKTVCGTRTIQHKIKTVVNERELFWVYLGRLDP FT CHTDEEIAEMTQECLDLPEXPKVKRXVKKDADTSKLSVVSFRVLLPDDLRD FT TALQPDTWPTGVTVREFDFDLRPSPRFRQL" FT CDS 788..4390 FT /product="CR1-3_CQ_2p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="MRPGPVYGHGNGAFQAALAGKYIHSTRTNPRLDASLH FT YSELPTADKVMPSPEFDADIGLGRMYRGIKVDPRFPVTTEPLTASVASSSS FT HLSRPGLVNGTRSGVNYSLLSGKYLHFPSTIPPPDDPLVSSHFHPSHAEHD FT QPLPLIGPRQQNLLSAVAEVADHPGRTYHGNVECPRSPVTVAPSVSSGEYV FT CSSSPSRPGAVCGQGCGGFRPASSGEYTCIDPNSPALNDCTLQRPAVPNGV FT ISVYYQNVRGLRTKIDDMYLASQDCGHDVIILTETNLKDDITSLQLFGSAF FT NVFRCDRSRKNSEKQSFGGVLIAVSKHYSNAHEVETARGNELEQLCVAASI FT QGENFLFCAIYVPPEKRSDVNLINEHIATIDELRGTKCADHTTVVCGDYNQ FT SHVQWSLIGSIARPTSPLTAASTMALVDGMDYLNLCQTNLVHNCFGRILDL FT VFCSSQRSVAVDVAASALALPVDSHHPPLVCHFPIFKRPQERRGNSANPAL FT RLLNFRKIDFAAMTTFLLSVDWSLCLTNLGVDEMANTFCSIIRNWMNQHVP FT TVKPRVSPAWSNSRLRQMKRLKNACQRALRRRRSVVNKMKFRRSSDAYRKL FT NSTLYKSYVLRVQTNLRRNPRGFWSFVNAKRKSKDIPDSVFLDQHESGSPE FT MSCDLFAQHFKSVFAAGCATDEEAAVAANNVPENLLDLSTFDVSADMVKQA FT LAKLKSSFSPGPDGIPSVVYRRCADALVAPLSVIFNKSFSQQTFPSTWKQS FT FIVPVHKSGDKRNVRNYRGVTNLSAASKLFEIIVSKVIQRQAMRYLSDDQH FT GFIPKRSVTTNLLEFTSTCIGHLENKAQVDVIYTDLKAAFDRIDHNVLLRK FT LSRLGLSVQLVNWLRSYLVGRSLRVRLGTSVSAVFVNNSGVPQGSNLGPLL FT FILFFNDVLFLLGPGCALVYADDLKLFLPIKATDDCCRLQALLDIFVSWCR FT RNKLVVNISKCLVMSFTRSKRMISFDYSIDGVVLQKVDQVRDLGVLMDPKL FT TFDLQRVSVIAKANRQLGFISKISKDFRDPYCWKSLYCSLVRPILEYAAIV FT WLPYQLTWILRIERVQKRFIRRALRSLPWRDPVNLPPYPARCRLLNIQPLQ FT SRRTTQQATFVAKLLNGEVDSPNLLSLINFRTPQRTLRDSTLLARRFHRTA FT YGFNEPVSSMIRAFTAVEDLFEFGESSVRFRDKINRTQTFIH" XX SQ Sequence 4443 BP; 1063 A; 1193 C; 1066 G; 1119 T; 2 other; cacagatcgt gcattgacgg tctcaaccgt actgccttcg aggccatcgg caagttccag 60 aaaaattgct actggctttg cgacgtttgt gctggtcgtt tcgatcaatt tgtacagtcg 120 atggatgttg atgacgatcc ttcacctgcg actgatgttt ctaagctgag tgaagctgta 180 gaaaaactta gcggaatcgt gaacgagctc tccggtcaga tgaaggaaaa gagctcgaaa 240 aaaagtttcg ctgatgttgt cttccctggt tcgaaacgcg aacgagaaga cgacgtaaac 300 aatgaacctc cagcgaagat caaaacagtt tgtggtacgc gcaccatcca gcacaagatc 360 aagacggtgg tcaatgaaag ggagttgttt tgggtttacc ttggccgact tgatccctgt 420 cacaccgatg aagaaattgc cgaaatgact caggaatgcc tcgacctccc tgaacmaccc 480 aaggtgaaaa ggcwcgtcaa gaaagacgcg gacacctcca agctgtccgt cgtgtcgttc 540 cgggttctgc ttcccgatga cttacgggac actgcacttc aaccggacac ctggccgact 600 ggagtgacgg ttcgcgagtt cgacttcgat ctgcgaccct cacctagatt tcgacagctg 660 tgagcatcac ctcatctatg ttgctgacct gggacgttat cacaaaagcc gtttggaagt 720 gccccgtgcc cccgtcacag ctgtgccaca ccattcttca tcaccaagct attgttcctg 780 ctattgtatg cgtcccggcc ctgtgtacgg ccacggcaac ggggccttcc aagccgctct 840 agcaggcaag tacattcatt cgacaagaac caatccgcgc cttgatgcgt cgctacatta 900 cagtgagctg ccaaccgccg acaaagttat gccgagtcca gagtttgacg cagatattgg 960 tctgggacgc atgtaccgag gcattaaggt tgatccccgc ttccccgtca caactgagcc 1020 cctcaccgct tccgtcgctt cgagctccag ccatctcagt cgtcccggcc ttgtgaacgg 1080 gaccagaagc ggggtcaact acagcctgct ctcaggcaag tacttgcact ttcccagtac 1140 aattccgccc cctgatgatc ctttggtttc cagccatttc cacccatctc atgccgagca 1200 cgatcaaccg ttgccgctca tcggtcctcg ccagcaaaat cttctcagtg ctgtcgctga 1260 agttgctgat catccgggac gcacgtacca cggcaatgtg gaatgccccc gctcccccgt 1320 cacagtcgcg ccatccgtca gttccggtga atatgtctgt tcgagctcac ctagtcgtcc 1380 cggcgctgtg tgcggtcaag gatgcggggg cttccggcct gccagctcag gcgagtacac 1440 atgcattgat cccaactctc ccgcactcaa tgattgcact ttgcagcggc ctgctgttcc 1500 gaatggcgta atcagcgttt attaccaaaa cgtaagaggc ctcagaacga agattgatga 1560 catgtactta gcatcgcaag attgtgggca cgatgtaatc attctgacgg agacgaattt 1620 gaaagacgac atcacatcac tgcaactctt cggatcagca ttcaacgtgt ttcgttgcga 1680 ccggagccgg aagaacagcg agaagcagag ctttggcgga gtcttgatcg cagtatctaa 1740 acactacagc aatgcacacg aagtcgaaac agctcggggc aatgaacttg agcaattgtg 1800 tgtagcagcg agcattcagg gcgaaaattt tctattctgt gccatttatg ttccacctga 1860 gaagagatct gatgtcaatc ttatcaatga gcacattgcg accatcgatg agttgcgagg 1920 tacgaagtgc gcagatcaca caaccgttgt ctgtggcgac tataatcaat cccatgtcca 1980 atggtcgctg attggcagca tagctcgccc cacgagcccc ctgactgccg catccaccat 2040 ggctttggtc gatggaatgg attatctcaa cctgtgtcaa actaatctgg tacataattg 2100 ctttgggcgg attctagacc tcgtcttttg ctcgtcgcag agatccgtcg ctgtggatgt 2160 tgctgcatct gctctggcgc taccggttga ttcgcatcat cctcccctag tctgccactt 2220 tcccatcttc aagcgacctc aggaacgcag aggcaactcg gcgaacccag cgctgcgtct 2280 gctaaatttc cgcaaaatcg atttcgccgc tatgacgaca ttcttgctca gcgtagattg 2340 gagtctatgt ttgacaaacc ttggcgttga tgagatggct aacacgtttt gcagcattat 2400 tcggaactgg atgaatcaac acgtgcctac agtaaagccc cgcgtgtccc ccgcttggag 2460 caacagccgt ctgaggcaaa tgaaacgact taaaaatgca tgtcagcgag cgctgcgccg 2520 tcgtcgctct gttgtcaaca agatgaagtt taggagatca agcgatgcat acaggaaact 2580 caactctaca ctgtacaagt cgtacgtgtt gcgtgtccag acgaaccttc ggcgtaatcc 2640 ccgcgggttt tggagctttg tgaacgcgaa aagaaagagc aaagacatcc ctgacagtgt 2700 ctttcttgac caacacgagt ccggatctcc agagatgtcc tgtgatctgt ttgcacagca 2760 tttcaaaagt gtttttgctg caggttgcgc cactgacgag gaagctgcgg ttgctgcaaa 2820 caacgttccc gagaaccttc ttgacctttc aacattcgac gtgtcggctg atatggtaaa 2880 gcaagcgcta gcaaagctaa agtcgtcctt ctccccaggg ccagatggta tcccttctgt 2940 ggtctaccgc cgatgtgctg atgcgttggt tgctcctctt tcggtcattt tcaacaagtc 3000 attttcccag caaacgttcc cgtccacttg gaagcagtca ttcatcgtgc ctgtgcacaa 3060 gagtggtgac aaaagaaacg tcagaaacta tcgaggagtg acgaacctgt ccgcagcatc 3120 caaattgttc gagatcatcg tcagcaaagt gatccagcgt caagctatgc gctacctgtc 3180 tgatgatcag catggtttca tcccgaagcg ttcggtgacc acgaatcttt tggaattcac 3240 ctcaacatgc atcggccact tggagaacaa agcgcaagtc gacgtgatat acaccgatct 3300 caaggccgct tttgaccgca tcgaccacaa cgtcctgctt cgtaagcttt ctcgcctagg 3360 cctgtctgta caactggtaa actggctacg gtcttacttg gtgggtcgtt cgcttcgggt 3420 tcgactaggg acgagtgttt cagctgtttt tgtgaacaat tcgggagtac cgcaaggaag 3480 caatttggga ccacttctat ttatcctgtt tttcaatgac gtgctgtttt tgctaggacc 3540 aggttgcgcc cttgtgtacg ctgatgacct gaagctattc ctgccgatca aagcaacaga 3600 tgattgctgt cgactgcaag ctcttctgga cattttcgtc tcctggtgtc ggcgcaacaa 3660 gctcgtcgta aacatttcca agtgcttggt catgtcattc acacgtagca aacggatgat 3720 ctcgtttgac tacagcatcg atggagtcgt cctgcaaaag gttgaccaag ttagagatct 3780 tggggttctg atggatccta agctgacatt tgacctccaa cgagtttcgg tgattgccaa 3840 ggccaatcgg cagcttggat tcatctccaa aatttcgaag gactttcgag acccgtactg 3900 ttggaaatcc ctgtactgtt ccctcgttcg gcccatcctt gagtacgctg ccattgtgtg 3960 gttgccctac cagctaacgt ggattttgcg aatcgaacgt gtgcagaagc ggttcatccg 4020 gagagccctg agaagcttgc cctggaggga tccagttaat ttgcctccgt atccagctcg 4080 atgtcggctg ctgaacatcc aaccccttca gtccaggcgt acaacgcagc aggcaacttt 4140 tgttgctaag ttgttgaatg gagaagtcga ctctccaaat ctgctctctc ttatcaactt 4200 ccgcacaccg caaagaacgc ttcgtgactc aacgctacta gcccgcagat tccaccgtac 4260 agcctatggt ttcaacgaac cagtttccag catgatccga gccttcacag cagtagaaga 4320 tttgtttgaa tttggtgaat cgtctgtccg ttttcgtgat aaaatcaacc gtactcaaac 4380 ttttattcat taagacaaac attgtcagat gaatgtaaat aaacaattaa acaattaaac 4440 aat 4443 // ID Gypsy-94_CQ-LTR repbase; DNA; INV; 1112 BP. XX AC AAWU01007055; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-94_CQ_; KW Gypsy-94_CQ-I; Gypsy-94_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-1112 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 568-568 (2011). XX DR GenBank; AAWU01007055; Positions 62112 63223. XX SQ Sequence 1112 BP; 307 A; 299 C; 290 G; 216 T; 0 other; tgtgaccaaa cggtttgttt cgtccacgct ttcaccaggt tcaaccagta tgtcccgggg 60 ttttctcatt aaaaacggga aggtagacca caatgctaat aagctagata gggaaactaa 120 ggtagatcaa cccgaaaacg tccaaataaa gcccaaaaat caaccaataa cccaaaacaa 180 ttaaatatct gagctaaatc ttgcagaaag tcggcacaac ggtcatatca aacacacaac 240 acacaattta ggtattgtac gcagccaaaa caacattaca cgcctcacta tgcccaccct 300 gccactgaac tgcaaattgt gcgttctttg cgaagaacgc aaggcacggc taagtgctga 360 gaggaggtcg aggcttcctg gaggaccttc tcagctagac tcgtccactt gtacgagagc 420 cacaaaccga acaacacctg gaagttcgat cagcgaaaag tttacttata acccgcgacc 480 gcgtggccga ttaagccaag tcttgcctag ttggcaatcg attcgccgta cgagttttgt 540 aagtgatcta ccgtcacctg atcaccgagg ctgggcagcc tgtgcagccg tcaccatctg 600 cacgaaccgg accgaaggtg tctaccggtg ttgcgctacg cgtgcggtcc gtcaacacac 660 cgcccccatt gtgcccggta cacccggtcg aagcgaagca ggtccacgag cgacccaggc 720 cagattccgg acgggaactg tgcgcagacg agcgagttgg agcagagcgg ttgagaagac 780 cgcgaggaga ttcggagaac tatactccgg cgcgagaaac cacgtgtccc cgagggacac 840 ggacactgcg ccggatcgag gaagcagcgc aggcgatcga gaccgctgcc agcaagcagg 900 gtgcagcagt tcaaccaccg ggcccgtaca caccacacac acaaggttag ttggttaggt 960 tgttaggagt agggaattag ggtcttgagg gctgaataaa cgtgccgcat taaggagagc 1020 aaacatggtc ttttccttcc cctattgagc cgatcttggg atatttctgg gagtccttgc 1080 gactgagcgg gcgtaactga ccgacagtta ca 1112 // ID GPRP1 repbase; DNA; INV; 374 BP. XX AC X85444; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 25-MAR-1997 (Rel. 2.02, Last updated, Version 2) XX DE G.pallida repetitive DNA element. XX KW GPRP1; Repetitive element. XX OS Globodera pallida OC Eukaryota; Metazoa; Nematoda; Chromadorea; Tylenchida; Tylenchina; OC Tylenchoidea; Heteroderidae; Heteroderinae; Globodera. XX RN [1] RP 1-374 RA Burrows R.P., Smoker M. and Grisi E.M.; RT "Sequence and genomic organization of a novel repetitive DNA RT element from the potato cyst nematode Globodera pallida."; RL Unpublished. XX RN [2] RP 1-374 RA Burrows R.P.; RT "Direct submission."; RL Direct Submission to EMBL/GenBank/DDBJ (16-MAR-1995). P.R. RL Burrows, IACR-Rothamsted, Harpenden, Hertfordshire, AL5 2JQ, UK. XX DR GenBank; X85444; Positions 1 374. XX SQ Sequence 374 BP; 119 A; 65 C; 70 G; 120 T; 0 other; agctttacca catcatttca tagtggttcc aattgtttcc cgggctaggg aataaaaata 60 aatttattat aattttttaa attatcccaa tgcgcccaaa catagggggt acccttttgc 120 tggtgatatt tggtgtacaa aataattcat aaaaaattta tttatcaagc cctagggaag 180 caattgtcgc actgtatatt tcatctcctg atccttattg gacaaaaatt agcctcccat 240 acatgcgttg aatgggtgat atagggtggg gtcgtactac aaaaatggac aaaaatggac 300 aaaaatattt ttaaatttcc gggagtctcc tgagcatcac agcacatgta tgggaggcta 360 atttttgttc aata 374 // ID RTE-16_BF repbase; DNA; INV; 3353 BP. XX AC . XX DT 29-JUL-2009 (Rel. 14.07, Created) DT 29-JUL-2009 (Rel. 14.07, Last updated, Version -1) XX DE Amphioxus RTE-16_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW RTE; Non-LTR Retrotransposon; Transposable Element; RTE-16_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-3353 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-3353 RA Kapitonov V. and Jurka J.; RT "Young families of RTE non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(7), 1714-1714 (2009). XX DR [2] (Consensus) XX FH Key Location/Qualifiers FT CDS 67..3339 FT /product="RTE-16_BF_1p" FT /translation="PPSRFSPGNGCCNDPGLRYHRGPGGALAGLPVVLPPP FT LAGSTMVGRFLGELQLANSTCTRTASVLDGRYTNSGGVVRWVSNPPRKNKL FT ATNPHTWNLNRTDPNGSLTMDRMDLRAKSKQSGRMRVATWNVLSLNREGSA FT ELAANELARLSIAVAGLTEVRWPGSGSHTANDYNFLWSGRDDGQHRQGVAL FT ALAPTAFRALVHWKPVNNRLLLARLRHSHGRISVIVAYAPTNVAADEEKDE FT FFTQLNDLMSTISRHDIVWILGDLNATTGPDRAGYESSLGPHGSGICNNNG FT KRLLEFCSSFRLRTERSFYPHKSIHSMTWISNDGQTRKELDHILTRRRWHA FT STDCRVYRSAVLGNSDHRLLAMTVCLRLQKNPSSKAQRKVDVSRLRSPEVE FT THFQLDLQNRFTALSAHTDEENPEKLWDVFKQNVTTAAVQVLGHKRRKKKK FT DFLSEETLSVVEAKRAARLEGNRSECRRLNAIRNKLLRKDKQRWLDDLAEE FT AEMAARRGDQGSLYRTLRTLAGRTTPPTACVKAMDGTVPDTPDQQLARWRE FT HFENLLNRPAPPPSPQLDALAAAASEDDSVPTGPPTREEVVKAIQKLKPGR FT AAGADGITPELLLHGGPALADQLLQLFTAIWSSETIPEDWLQGVILPFWKK FT GSKDLCSNYRGITLLSVPGKVFANVILGRLRPLLLRKQRKEQSGFTPGRST FT VDRILTLRLLAEKRREFRRPLFAAYVDLKQAFDSVDRQALWKILKTLGVPA FT KLLTLLSLLYSNTSSSVRVNGQMSDSFHINSGVRQGCVLAPSVFNTAIDFV FT MSRTVAQCACGASYGNVTVTDLDYADDVAILAEVMEVLQLALQAMDTETHP FT LGLQVSWGKTKVQSLSDYEPPPPGLVINSNPVEAADKFCYLGGTITSDCKS FT DTDIQLRIGRAAAAMASLDRVWTSSKISVQTKIRLYNSLVLSILLYGAETW FT TLTAAQERRLDAFDTKCQRRILGIRWHDYVSNATLRETTSQPQLSSKVRQA FT RLRLFGHIARTEPPLETAALLREPTPPNWSRPRGRPRHTWQEQLKDDLRAA FT GLDLTTAWVLALNRPMWRSVCAGATLHPGACGPE" XX SQ Sequence 3353 BP; 833 A; 982 C; 873 G; 665 T; 0 other; acaaataaac ataatttaca aaacgtacct caaacgtacc ttttggggat aaagtcagag 60 tgctaacctc ctagtcggtt ttccccgggc aacggctgct gcaatgaccc tgggttgcga 120 taccatcggg ggccgggcgg agcactagca ggtctacctg tagtgcttcc cccgcccctg 180 gcaggttcaa ccatggtggg acggttccta ggggagctgc agctggccaa tagcacctgc 240 acccgcaccg cctcagtcct cgatgggcgg tatacaaatt caggcggggt tgtgcgatgg 300 gttagcaacc caccacgtaa aaacaaactt gctacgaatc ctcacacatg gaacttaaat 360 aggactgacc ctaatggatc tttgaccatg gaccggatgg acctacgagc taagtctaaa 420 caatctggaa gaatgagagt tgcaacttgg aacgttctgt cgctgaacag ggaaggctct 480 gcagagctgg ctgccaacga actggcacgt ctgagcatcg ctgttgcagg tctcacagaa 540 gtacgctggc ctggctccgg ctcgcacact gcaaatgact acaacttcct ttggtcggga 600 agggatgacg gccaacaccg ccaaggagtt gcactcgccc ttgccccaac tgcctttcgt 660 gcactagtcc actggaaacc agtgaacaac agactgcttc tcgcccgcct gcgccacagt 720 catggaagaa tatcagtcat cgtagcctat gccccaacca acgtagcagc tgacgaggaa 780 aaggacgagt tcttcaccca gctaaacgac cttatgtcta ccatttccag acatgacatt 840 gtatggattc ttggggacct taatgctacc accggacctg acagggctgg gtacgagtct 900 agtctcggcc cccatggatc cggcatctgc aacaacaacg gcaagagact cctcgagttc 960 tgctccagct tccgactacg gaccgagcgt tccttctacc cccacaagtc cattcatagc 1020 atgacatgga ttagcaacga tgggcagaca cggaaagaac tggaccatat cctcaccagg 1080 agacgttggc atgcatctac tgactgccgt gtctaccgca gcgcagtgct gggaaactct 1140 gaccacaggc tgcttgctat gactgtctgc cttcgactgc aaaagaaccc ttccagtaaa 1200 gctcagcgta aggttgacgt gtctaggttg cggtcacctg aagttgaaac ccacttccaa 1260 ctggacctcc aaaaccgctt cactgctctg tctgcacata cagatgagga gaacccggaa 1320 aagctctggg atgttttcaa gcaaaacgtc acaaccgctg cagtgcaggt gcttggccat 1380 aagagaagga agaagaagaa agactttctc tcagaggaaa cccttagtgt cgtggaggca 1440 aaaagagcag ctagactgga aggaaaccgc agtgaatgca gacgcctcaa cgccatccgc 1500 aataagctcc tacgcaagga caagcagaga tggcttgatg acctagcgga agaagcggag 1560 atggctgccc gcagaggaga tcaagggtcc ctctacagga ctctaaggac cctcgcaggg 1620 aggacaactc cacctactgc ttgtgtgaaa gccatggatg gaacagtccc agacacccca 1680 gaccaacagc tggcaagatg gagggagcac ttcgagaacc tgctcaaccg accagcacca 1740 ccaccctccc ctcagctcga tgcactagcc gccgcagcca gcgaggacga ctctgttcct 1800 actggccccc ccacgcgaga ggaagtggtc aaagccatac agaagctgaa gcctggccgg 1860 gctgcaggag ctgacggcat cacacctgaa ctgctgctcc acggaggccc tgcactagca 1920 gatcagctct tacaactgtt cactgccatc tggagcagcg agaccatccc ggaagactgg 1980 ttgcaaggcg tcatccttcc cttctggaag aaagggtcaa aggatctctg cagcaactac 2040 agaggaatta ccctacttag tgttcccggg aaggtgtttg caaacgtcat ccttggaagg 2100 ctgcgcccac tccttcttcg caaacagaga aaggagcaga gcggcttcac accgggtcgc 2160 tcgactgttg acaggatttt aactctgcga ctgctggccg agaagagaag agaatttcgg 2220 agacccctct tcgcagccta cgttgatctg aagcaggcct ttgattctgt tgacagacag 2280 gccctgtgga agatcctcaa gactctcggt gttccagcaa agcttctgac tctcctctct 2340 ctactctact ccaacacctc gtcctctgtg agagtaaatg gacagatgtc tgactccttc 2400 catatcaaca gtggtgtccg ccagggttgc gtcttagcac catctgtgtt taatacagcc 2460 attgactttg tcatgagcag aactgtggct cagtgtgcct gcggtgcaag ctatgggaat 2520 gtcactgtca ctgacttgga ttatgcggat gacgtggcta tcctagctga agtcatggaa 2580 gtcctccaac ttgccctcca ggccatggac acggaaaccc atccgctggg cctgcaggtc 2640 agctggggaa agaccaaggt ccagagcctc agcgactacg agcctcctcc accaggactg 2700 gtgatcaact ctaacccggt tgaagccgct gacaagttct gctaccttgg aggcaccata 2760 acatcagact gcaagagcga cacagacatc cagctccgca tcggccgggc cgccgcggca 2820 atggccagtc tggacagggt ttggaccagc agcaagatct ctgtgcagac aaagataagg 2880 ctctacaaca gcctcgtgct gtccatcctt ctctacgggg cagagacttg gactctcact 2940 gctgcccagg agcgtcgact cgacgccttt gatactaagt gccaaagaag aatcctgggg 3000 atcaggtggc atgactatgt atccaatgcc accctgcgcg agaccaccag tcaaccccaa 3060 ctgtccagta aggtgcgcca ggcccgtcta cgactgtttg gtcacatcgc caggacggag 3120 ccacctctgg aaacagctgc ccttctgagg gagccgaccc cacccaactg gtcccgcccc 3180 aggggccgtc ccaggcatac ctggcaagaa caactgaagg acgacctgag ggctgccggc 3240 ctcgacctta ccaccgcctg ggtgctggcc ctcaacagac ctatgtggag atctgtgtgt 3300 gcaggcgcta cgctccatcc cggagcatgc ggacctgagt gagtgagtga gtg 3353 // ID CR1-36_HM repbase; DNA; INV; 4745 BP. XX AC . XX DT 17-DEC-2008 (Rel. 13.12, Created) DT 17-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE CR1-type family - consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-36_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4745 RA Bao W. and Jurka J.; RT "CR1 families from Hydra magnipapillata."; RL Repbase Reports 8(12), 1864-1864 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 757..1680 FT /product="CR1-36_HM_1p" FT /translation="MSANELYETSSISPSLNPIDLLENWPGPGKGKAAYLP FT KRCDAMIRSLATTVKLLLEKVAALEKTNTNLQIQLTNCQPPAVYPSVSQPI FT LKSFASQLTKPGSATNNAILNAVNINAKLQQSKAKRAIITGIPNAASTDAS FT VVDQSVKDILQYVHYSASFKVVGRIPAAISNKNNKSPPNTSISTSSTVTAP FT TQPAPPSLIIEFPTNETRNGLLAIAKNLATNESYKSVFIRPDRTPSEQDLF FT NKLNNEKFKLNNGLKANGKLDQPFRFVIRSDRVKCIDVSKKVNINGLEKHP FT FINWKGALAAAQTNQ*" FT CDS 1514..4627 FT /product="CR1-36_HM_2p" FT /translation="MALKQTVNLISPFVLSSAATESNASTLAKKSTSMAWK FT NTRSSTGKERSRRPKPTNEKSQHSTHDRFHIIKNLSSPTSQSNASKLLLNC FT FYANATSLNPNKLTELELATSLYDYDTIFITETWFNESSAPAIPNYNLFRK FT DRSSRGGGVAIYVKNSIIANEIDYSQLHIGFQCNFSEQIWLELKLPQESLL FT IGCIYRPPTNNVLAFNEIVKAISYAKSAVENKKFNGLIITGDFNFPDVIWH FT DDWTNTTGAETSFGTKFAXTIFDLHLHQHVINPTFIQANGLFKNTLDLVIT FT ETTERLTHITIGPPLGSASQGHCTMTWQHQLAKNVCNSPRFNISKFIYRRG FT NYANLNEKLSQVDWVSLFKDRYADECYQIFCDNYIKLCDKFIPRVTSKPAK FT HRQPWLTSELFLLIKRKKKLWYLNRNSKWRIIPLVKEYNQTRNLVKKTSCT FT AIRAFEHKLASDKWNPKRLYAYVNNKRAVSNQITSMCSPAGEISSDHASIA FT NILNSQFQSVFTIEPSDQQLPPFVKRTTALISSVIFSYDSVHKHLSALNAN FT KSTGGDGISPFVLKACADAAATPLAIIFQKSMDQGVVPKAWLSANVTPLFK FT KGSRLDPENYRPISITSVPCKIMEKIVKEKIMQHLVAHKLIADCQHGFVNK FT KTCVTNLLEAMDFLTYNVSRKLPVDVLFLDFAKAFDKVPHRRLLQKLESYG FT IEGNVLNWIKAFLSNRSQRVILGNTSSTWVKVTSGVPQGSVLGPTLFIIFI FT NDLPEHINSENLCKMYADDTKILSVVKTTEDKARLQSDIDSVVTWTRTWLM FT ELNIKKCKVMHFSKYNVSPHIYLMNEYDANGSVTRTVIESSTSEKDLGVQI FT SNDLKYERQAQIAAAKANKVLGTLKNTFISRDTKIWKKLYTTYIRPHLEFA FT VTAWCPHLEKDINVIERVQHRVTKVPHEIKRFDYETRCIKLGLTSLLNRRL FT RGDLIQKYKFENSIDSISWHIDPPNIPPSYGHRKRFHREMDNNCLVRYNSF FT NNRVAHPWNALPNVVVSATSVDRFKAKFDQL*" XX SQ Sequence 4745 BP; 1531 A; 1141 C; 725 G; 1347 T; 1 other; tttttttttt caaagaggtt ttgaatagat aagacgcttt ttattttgtc gtaagttttt 60 gagctatttt ttttcttatt ttgttttttt cttttaaaat tttctttcct tcttactatt 120 tttcttcttc taaatacaca ttaatattca ttaccctata ctatattata atatattata 180 tcaactaaag tcaatcgctt cgaactaact atatagtaac taaatatata gtgtttttat 240 ttttatttta tttttattgc tcgtatctat atttatatat aaatatacga ctatattata 300 aataaacctt tattattcga agatttgatt tgcgcttata ttatttaagt aacaagtttt 360 taacttgttt tttgaaaacc tatttaaatt atttcatttt tattttattt taaaagtttt 420 tcaactttct tcgtagtttt ttttttatag ttgcgttgca cctgattctt aatctgtttg 480 ttgttttcgg attaattttt gaacataaaa ataatttgtg tgtgtgtgat tgtgacaaac 540 gttaataata tatataatta tatatacgga aaaactgttg ttatatataa atacatataa 600 gtattgttta tatatataca aaaattatat ataaaatatc ctttcaacta tccaccaact 660 taataaccaa atttgttatc caaccacaac acttattcga agcataaacc tcaaatccaa 720 tattcagtca tctaattaga tcctcaacat tccaaaatgt ccgccaacga attgtatgaa 780 acatcgtcca tttcaccttc tttaaaccca atagacctgc ttgaaaattg gccgggtcct 840 ggaaaaggca aggctgctta tttacctaaa agatgtgacg ccatgatccg ctctctagct 900 acaacagtca aattactgct tgaaaaagta gcagccctcg aaaaaaccaa caccaacttg 960 caaatacaac tcactaactg ccaaccaccg gcagtttatc cttccgtttc tcaaccaatc 1020 ctaaaatctt ttgcttcgca acttaccaaa cccggatcag ccacaaacaa tgccatctta 1080 aatgccgtga acataaacgc caagcttcaa cagtcaaaag ctaagagagc cataatcacc 1140 ggaattccca acgcagccag taccgatgct tccgtcgtcg atcaatcggt aaaggatata 1200 ttacaatacg tacattactc tgcttccttc aaagttgtag gacgcattcc tgccgctatt 1260 tctaacaaaa ataacaaatc gccgccaaac actagcatat ctacttcttc tactgtcact 1320 gcccccactc aaccagctcc accgtctctt atcatcgaat tcccaaccaa tgaaactcgc 1380 aacggtcttc tagcaatcgc aaaaaacctt gccaccaatg aatcatacaa aagtgtcttc 1440 atacggccag acagaactcc ttcagagcaa gatcttttta acaagttgaa caacgaaaag 1500 tttaagctca acaatggcct taaagcaaac ggtaaacttg atcagccctt tcgttttgtc 1560 atccgcagcg acagagtcaa atgcatcgac gttagcaaaa aagtcaacat caatggcttg 1620 gaaaaacacc cgttcatcaa ctggaaagga gcgctcgcgg cggcccaaac caaccaatga 1680 gaagtcgcaa cattcaactc atgaccgttt tcatataatt aaaaatttgt cttctcccac 1740 ttcacaaagc aatgcttcta aattgttact aaactgcttc tatgcaaacg ccacatctct 1800 aaatcctaat aaactcaccg aacttgaatt agcaacgtcc ctctatgatt atgacaccat 1860 ttttatcact gaaacatggt ttaacgaatc ctcagcacca gctattccca actacaatct 1920 ctttagaaaa gatcgctcta gtcgtggagg aggagtagcc atatatgtca agaacagcat 1980 tattgccaac gaaattgact actctcaact tcacatcggt ttccaatgca atttctcaga 2040 acagatctgg ctcgaactta aactcccaca agaatctctc ctaatcggct gcatctatcg 2100 cccacctacc aataatgttc tggcctttaa cgaaatagta aaagctatct cctacgcgaa 2160 atcggccgta gaaaacaaaa agtttaatgg tctaataatc accggtgact tcaactttcc 2220 cgacgtcata tggcacgacg actggaccaa taccaccggc gctgaaacta gttttggcac 2280 taaatttgcc raaaccattt ttgacctaca tctacaccag cacgttatca atcccacctt 2340 catccaagcc aacggtctgt tcaaaaatac actagacctt gtaattaccg aaacaaccga 2400 acgattaacc catattacta tcggtcctcc tcttggtagc gcctcgcaag gtcactgcac 2460 tatgacatgg caacaccagc tagccaaaaa cgtctgcaac tctccccgtt ttaacatctc 2520 aaaatttata tacaggagag gtaattacgc gaatctaaac gagaaactct cccaagttga 2580 ttgggttagt ttatttaaag accgatatgc cgacgaatgt tatcaaatat tctgcgacaa 2640 ctatatcaaa ttatgtgaca aatttattcc gcgtgtcacc tctaaaccgg ccaaacacag 2700 gcagccttgg ctaaccagcg aacttttttt actaattaaa cgtaagaaaa aactttggta 2760 cctcaaccgc aacagcaaat ggcgaatcat tcctcttgtc aaagagtata accaaactcg 2820 aaaccttgtc aaaaaaactt cctgcactgc aattagagca tttgaacaca aattagctag 2880 cgacaaatgg aaccctaagc gtctatacgc ttatgtaaac aacaaacgcg ccgtcagtaa 2940 ccagataact tctatgtgct cacccgctgg ggaaataagt agcgaccacg catccatagc 3000 caatatactt aacagccagt tccagtcagt attcacaatc gaaccatcag atcaacaact 3060 acctcccttt gttaaacgta caaccgcctt aatctcctcc gtcatcttca gctatgattc 3120 cgttcacaaa cacctatctg ctctcaacgc caataaatca actggaggtg acggtatcag 3180 tccatttgtg cttaaagcat gcgctgatgc ggcggctacc ccgcttgcca tcatttttca 3240 gaaatcgatg gatcaaggtg tagtcccaaa agcgtggctc tcggcaaacg tgacgccact 3300 gttcaaaaaa ggctccagac tcgacccaga aaactatagg cctatctcca tcacctcagt 3360 accatgcaaa ataatggaaa aaattgttaa agaaaagatc atgcagcacc ttgtagctca 3420 taaacttatt gccgattgcc agcacggctt cgtcaataaa aaaacctgcg tcacgaatct 3480 gcttgaggcc atggactttc tgacctacaa cgtttctcgc aagcttcctg tagatgttct 3540 ttttcttgac ttcgccaaag ccttcgataa agtccctcac agaagattac ttcaaaaact 3600 tgaatcctac ggaatcgaag gtaacgttct taactggata aaagctttcc tttcaaatag 3660 atcacagcgt gtcatattag gtaatacgtc atccacctgg gtaaaggtta ccagtggcgt 3720 tccgcaaggt tcagtgctag gtcctacgct gttcataata tttattaatg acttacccga 3780 acacatcaat agcgaaaact tatgtaaaat gtatgcagac gatactaaaa tcctgagtgt 3840 cgtcaaaacc actgaagaca aagctcgtct gcaatccgat atagatagcg ttgttacctg 3900 gactcgaacc tggttaatgg aactcaacat caagaaatgc aaagtcatgc atttcagtaa 3960 gtacaatgtt tccccacaca tttatttaat gaacgagtac gacgctaacg gtagcgtcac 4020 tagaacagta atcgagtctt caacatccga aaaagatcta ggcgttcaaa tttccaacga 4080 cttaaaatac gagaggcaag cacaaatagc cgctgccaag gctaataaag tcctcgggac 4140 attgaaaaac acgtttataa gcagagatac caaaatatgg aaaaaacttt acaccactta 4200 tatacgccca catctcgagt ttgcagttac tgcctggtgc ccgcatcttg aaaaagacat 4260 aaatgtcata gaacgtgttc agcatcgggt taccaaagta ccgcacgaaa taaagcgctt 4320 cgattacgag acaagatgca tcaaactcgg tctcacctct ctattaaata gacgtttacg 4380 cggcgacctc attcaaaaat ataaattcga aaacagtatc gactcaatta gctggcatat 4440 agatccacct aacatcccac ctagctacgg tcatcgcaag cgttttcatc gtgaaatgga 4500 caacaactgc ctcgttcgct ataattcctt caacaaccgc gtcgctcacc catggaacgc 4560 attacctaac gtagtagtca gtgctaccag tgtcgatcgc ttcaaagcaa aattcgatca 4620 gctttaataa ttactacagc gcatcatctt ccgtgcaatt tcgcacgagc tagcacggct 4680 agctgttgat gggcttacgg aacacgcccg ctccgtgtaa tttaactttt aacttaacta 4740 aatat 4745 // ID Gypsy-6_DWil-I repbase; DNA; INV; 5461 BP. XX AC scaffold_180699; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-6_DWil_; KW Gypsy-6_DWil-LTR; Gypsy-6_DWil-I. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-5461 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_180699; Positions 658748 653288. XX CC Positions [2036-2542] - Reverse transcriptase CC Positions [4452-4931] - Integrase core CC 'GACA' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 38..4171 FT /product="Gypsy-6_DWil-I_1p" FT /translation="MKYTEIIKLNKTALFELLAAKEVPCDREASIYQLRAL FT VKAVMAQNQSGESDVPEEQAVSPAVSANVLAGQLVDVGPEPLIEQPSADQV FT GANRLPTMSRGTPVGPDTNVHVREGAQASETDELASILSSIELYTARKNLA FT ELKTRVMELEENLSGPEYNCRMNLKDVETTVPKFSGDGDLSIHVWIRELEK FT AARVYVLSDAQCCSLGNRLLDGSAKAYMTYEKTTTWSELKDALVKVFGLQM FT TNYEIANQMRKRSIGKGESLLQYFIIMRNIAQQGNFEDTDVVKYIVDGLQD FT RTGLSAPLYYCTSLNELREKMMRFQTVQPPAKNYVPQARPSDRAQAITPAA FT KDKRCFNCRQIGHLMFDCPRPERPKGSCYRCFKVGHQYRNCPMRTQIAAMS FT SSEDPDTIDDKSFIQLVSLRISCKGGCTDVNNIFALLDSGSPVSFASRALM FT PHLDCDKLVKSSFHGLSGGSILTFGTVKCLVLFCKKYYEIQFLVIPDNIMP FT TQLLLGRDALNKIGLKFSLIENIRKYEKNSASLKSTIAFVYKSFCASLFTK FT YPKHNTISNNTTDLYNNCNNPNINTNASPFREWLTELQTVCATLSDTGNTT FT VGSEFGLGAMRECRDIVHEHYIDRFSINYGLPKYMMRICLTDTTPFHSSPR FT RLSYYEREIVSKTVSELLANNVIRPSNSPYASAIVLVRKKDGNIRMCVDYR FT GLNKKTVRDNFPLPLIEDCLEYLESKTVFSVIDLKSGFYHVGMEEDSIKYT FT AFVTPDGQFEYLRMPFGLKNGPAVFQRFISNILSDMIRDRELVVYMDDIIL FT ATRTVDQHLKLLAKLLDRFIQYNLELNLKKSIFLQKSIDYLGYEVSEKGIL FT PNNTHLKAIQEYPEPLNSKALHSCLGLFFYFRKFVPNFSRIAKPLTELLKQ FT GGPFPMNKTAKNVFLTLRSALSSPPVLAIFNPNYETELHTDASSNGFGASL FT MQRQTDGKMHPVSYYSRKATAAESRYHSFELETLAIIYAVRRFRVYLEHRP FT FLIVTDCNSLVQTLSKKSINPRIARWSLELENFDYTIAHRPGSNMGHVDAL FT SRQTELLENIVEGDFDPCDLLVKRDQNAPIEPCEPHITFEESCVKPCESGR FT DLTKLCAIQDDSCVESCELVKPPEFESDFIEPCVPSEKNEFSRDSIRPYAV FT LEQSCSEPCEIKPCEIKPFETKPCETDPCEIKPFETKPCETDPCEIKPFET FT KPCEIDLCEKNPCKIDPCEIKPCEIKPFETKPCEIDLCEKNPCKIDPCEIN FT PCKIEPCAPLGKNYLGPFGTNPCVHLAKSCFRPCELCLERSRLKLVKNKPC FT VSLNESSVDPRLSNQIERSDQLDRNESANLVGVVGASPQDVDLLLQSEQIR FT DPDIRKLRENFRKRQ" FT CDS 4254..5339 FT /product="Gypsy-6_DWil-I_2p" FT /translation="MQGELIRRCHEKLGHLGVEKCVNNLKEKYWFPLMNPK FT VESFIKDCLPCILHSVPRTIHNRTLHSIPKSPIPFDTIHVDHLGPLPSVTS FT KRKHLLAVIDAFTKFVKLYPVNSTSTKEVTAALEKYFDYYSRPRRVISDRG FT TCFTSFEFAEFVENRNIDHIKVATAAPQANGQVERVNRVLTPMLGKLSEPL FT AQADWYKLVNRVEFAINNSVQCSTGKTPSMLLFGCDQRGPLVDELTEYLDE FT KFSSENPCNLEATRQEASAKIQLSQERNEKRYALRHRAPLSYRVNDFVAIR FT NVDVTPGSCKKFAPKYRGPYKINRVLPNDRYEVTDIDDCQLTQLPYKGILE FT SARLKPWMHACDKTVSFCL" XX SQ Sequence 5461 BP; 1670 A; 1026 C; 1236 G; 1529 T; 0 other; aatcaggtgt ggagtttgcc ctaaagaacg tattttcatg aaatacaccg agatcattaa 60 attaaacaaa acagcattgt ttgagttgct agcggccaag gaagtacctt gcgacagaga 120 agcgtcaatt taccaattgc gagcattggt aaaagcagtc atggcacaga atcagtctgg 180 agagagtgat gtacccgaag agcaggcagt cagcccagca gtgtctgcaa atgtgcttgc 240 cggacaactt gtggatgttg gtccggagcc tctcattgag cagccaagtg cggatcaagt 300 gggtgctaac cggttgccga cgatgtctcg tgggacccct gtagggccag acacgaacgt 360 gcatgttcgt gaaggcgcac aagcgagcga aaccgacgaa ttggcttcga ttcttagtag 420 catcgaattg tatacagcac gtaaaaacct ggcggagcta aaaacacgtg ttatggagct 480 ggaggagaac ttaagcggac cggaatataa ttgccgcatg aacttgaaag atgtcgaaac 540 tactgtacct aaattctctg gtgatggtga tctgtccatt cacgtatgga ttcgtgagct 600 ggagaaagca gcaagagtgt atgtactgag tgatgcccaa tgttgctcgt tgggcaaccg 660 actgctggat ggatctgcta aagcatacat gacatatgaa aagactacaa cctggagcga 720 gctgaaggat gcccttgtga aggtatttgg cctacaaatg acaaactacg agattgctaa 780 ccagatgagg aagcgctcaa ttggcaaagg agagtcactc ttgcagtatt tcatcattat 840 gcgaaatatt gctcagcagg gcaattttga agacaccgat gtggtcaaat acattgttga 900 cggactacaa gatcgaaccg gattgtcagc tcctttgtat tattgtactt ctctgaacga 960 attgcgagag aagatgatgc gttttcaaac agttcagcca ccagccaaaa actacgttcc 1020 tcaggctcgt ccttcagacc gtgcgcaggc cattacgcct gctgcaaagg ataagagatg 1080 tttcaattgc cgccagattg gtcacttaat gtttgactgt ccacgcccgg agagaccaaa 1140 aggaagttgc tatcggtgct tcaaagtggg ccatcagtac cggaattgtc caatgcgaac 1200 ccagatcgcc gccatgagct ctagtgaaga cccggacaca attgatgata agtcatttat 1260 tcaattagta agtttaagaa tatcttgtaa agggggatgc actgatgtta ataatatttt 1320 tgccctatta gattccggaa gcccagtaag ttttgccagt agagcgttga tgccgcattt 1380 agattgcgat aaattagtta aaagttcttt tcatggactg agcggtggct ctattctgac 1440 ttttggaaca gtcaaatgtt tagttctgtt ttgcaaaaaa tattatgaaa ttcaattttt 1500 agttattccc gataatatta tgccgacgca actacttctt ggtagagacg ccttgaataa 1560 aataggtctt aagttctcat tgattgaaaa tataagaaag tatgaaaaga attccgcatc 1620 attaaagagt accattgctt ttgtttacaa atccttctgt gcatccctgt ttactaaata 1680 tcctaagcat aataccatta gtaataacac cactgatcta tataataatt gtaataaccc 1740 taatataaac acgaatgcat ccccgtttcg agaatggcta acagagttgc aaactgtttg 1800 tgcaacgctg tcagataccg ggaatactac tgttggcagt gaatttggat taggcgcgat 1860 gcgtgagtgc cgagacattg tgcatgagca ttacattgac cgattttcaa ttaattacgg 1920 gttgcctaaa tatatgatga gaatatgtct gacagataca acaccatttc atagtagtcc 1980 tagaaggtta tcgtattatg aaagagaaat cgtttctaaa acggttagtg agttgttagc 2040 caataatgta atccgaccaa gcaactctcc ctatgcgtcc gccatcgtcc tagtgcgaaa 2100 aaaggatggc aatatacgaa tgtgcgttga ttatcgaggt ttaaataaga agactgtgag 2160 ggataatttt ccattgcctt tgatagaaga ttgcctagaa tacctggaga gcaagaccgt 2220 tttctccgtc atagatttga aaagtggctt ttaccatgta ggcatggaag aggattcaat 2280 taaatacaca gcattcgtta cacccgatgg gcagttcgaa tatctgagaa tgccttttgg 2340 cctaaagaac ggaccagctg tatttcagag atttatatcg aacatactta gtgatatgat 2400 tagagatcga gaacttgtag tatatatgga cgacattatt cttgccaccc gcaccgtaga 2460 tcagcatctt aagttgttag caaaactgtt agatcgtttt attcagtaca atttagagct 2520 caatttgaag aaaagcattt tcctacagaa gagcatagat tatttaggat atgaagtcag 2580 tgaaaaggga attttaccaa ataatacaca tttaaaagct attcaagaat acccagaacc 2640 acttaacagc aaagcgttac actcgtgctt aggcttattc ttttattttc gcaaatttgt 2700 tccgaatttc tcccgcatag ctaaaccgtt aactgagttg cttaaacaag gtggtccatt 2760 tccgatgaac aaaaccgcga aaaatgtatt tcttacatta agatccgcct tatccagtcc 2820 gcccgtatta gctatcttta accctaacta cgaaactgag ttgcatactg acgccagttc 2880 aaatggcttt ggtgcctctt tgatgcaaag acagactgat ggcaaaatgc atccagtctc 2940 ttattattcc cgaaaggcga cagccgccga gtcccgttat cacagttttg agcttgaaac 3000 tttggccata atatatgccg tgcgccgatt tcgcgtttat ttagaacaca gaccattttt 3060 gatcgttacc gattgtaatt cattggtcca aaccttgagt aaaaaatcaa taaatccccg 3120 cattgcgcgg tggtcgttgg aattggaaaa ctttgattat accattgctc acagacccgg 3180 ttcaaacatg ggccatgtag acgcattaag tcggcaaacc gagttgttag agaatattgt 3240 agaaggtgat tttgatcctt gcgatttatt agtgaagaga gaccagaatg ccccaattga 3300 accttgtgaa ccgcatatta cttttgagga aagttgcgtt aaaccgtgtg agtctggtag 3360 agatttgacc aagctgtgtg ctattcaaga tgatagttgc gttgaatcat gtgagttagt 3420 taaaccgcct gagtttgaaa gcgattttat agaaccgtgt gtcccctcag agaaaaatga 3480 gtttagtaga gattcgatta ggccgtatgc cgttttagaa cagagttgtt ccgaaccgtg 3540 tgagataaaa ccttgtgaga taaaaccgtt tgagacaaaa ccgtgtgaga cagacccttg 3600 tgagataaaa ccgtttgaga caaaaccgtg tgagacagac ccttgtgaga taaaaccgtt 3660 tgagacaaaa ccgtgtgaga tagacctgtg tgagaaaaac ccgtgtaaga tagacccgtg 3720 tgagataaaa ccgtgtgaga taaaaccgtt tgagacaaaa ccgtgtgaga tagacctgtg 3780 tgagaaaaac ccgtgtaaga tagacccgtg tgagataaac ccgtgtaaaa tagaaccatg 3840 tgctccttta gggaagaatt atttaggacc gtttgggaca aacccgtgtg ttcatttagc 3900 gaagagttgt ttcagaccgt gtgagctatg tttagaaaga agccggttaa aactggttaa 3960 gaataaacct tgcgtttctt taaatgaaag ttctgttgat ccgagattga gtaaccaaat 4020 cgaaagaagt gaccagctcg atcgaaatga gtcggcaaac ctggttggtg tagtgggtgc 4080 gtctccgcaa gatgttgatt tattattgca gtcagaacaa ataagagatc cggatatccg 4140 taaactaaga gaaaatttta gaaaaaggca atgatgttgg atttgagtta atagatggta 4200 tagtgtacaa aagaaatgct aaggaaagac tgtgtttgta tgcaccggct cgcatgcaag 4260 gggaattaat aagaaggtgt cacgagaaac taggtcactt aggcgttgag aaatgtgtaa 4320 acaatttgaa agaaaagtac tggttcccat taatgaatcc aaaggttgag tcgtttataa 4380 aagattgtct gccgtgtatt ctacatagtg ttccaagaac tatacataat agaaccttgc 4440 atagtattcc gaaaagtccg attccatttg atactatcca tgttgaccac ttaggtccct 4500 taccgagtgt tacatcaaaa agaaagcacc ttttggcagt gattgacgca tttacaaagt 4560 ttgtcaagtt gtatcccgta aattccacta gtaccaaaga agtaaccgcc gcgttagaaa 4620 aatattttga ttattatagc cgaccaagaa gagttatatc cgacagaggt acgtgtttta 4680 cgtcgtttga gtttgctgag tttgtagaaa atcgaaatat tgaccacatt aaagtagcga 4740 ctgctgcccc tcaagccaat ggacaggtcg aacgtgtgaa ccgtgtgttg accccaatgt 4800 tgggaaagct gtcagaacct ttagcccaag ccgattggta taaattagtt aatcgtgtcg 4860 agttcgctat taataactca gtgcagtgta gtaccggtaa aactccgtca atgttgttgt 4920 ttggttgcga tcagcgaggc cccttagtag acgaattaac cgagtatctt gacgagaaat 4980 ttagttcaga aaacccctgt aaccttgagg ccactaggca agaggcaagt gcgaaaattc 5040 aattatcgca ggagcgcaat gaaaagcggt acgccttaag gcatagggca ccattatcct 5100 atagagtcaa tgattttgtt gccattcgca atgtagatgt cactcccggg tcgtgcaaaa 5160 agttcgctcc taaataccgt ggtccttata agataaatcg agttcttcca aatgaccgtt 5220 acgaagtcac tgatattgac gattgtcaac ttactcaatt gccctacaaa ggtatcttag 5280 aatcagcccg actcaagccg tggatgcatg cgtgtgataa aactgtgagt ttctgtttgt 5340 aatatgtatt tgataagcag tatttatatg tatattgtat gtattgtata tatgtatatt 5400 gtatgttgta tgtttgttga attgtatgat cgaggtcgat caaatgtcag gattggccga 5460 a 5461 // ID MuDR1_SM repbase; DNA; INV; 2547 BP. XX AC . XX DT 16-OCT-2007 (Rel. 12.1, Created) DT 16-OCT-2007 (Rel. 12.1, Last updated, Version 1) XX DE MuDR-type DNA transposon element - a consensus. XX KW MuDR; DNA transposon; Transposable Element; KW Autonomous DNA transposon; Vandal-type transposon; MuDR1_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-2547 RA Jurka J.; RT "MuDR-type element from Schmidtea mediterranea."; RL Repbase Reports 7(10), 1089-1089 (2007). XX DR [1] (Consensus) XX CC This element has perfect 46 bp TIRs preceded and followed by CC non-complementary sequences each 5 bp long. It is flanked by 9bp CC TSDs. XX FH Key Location/Qualifiers FT CDS 323..2233 FT /product="MuDR1_SM_1p" FT /translation="MFFYVKSLFIINKIKTNTELIMANNFAEVEDYQSDLE FT FDVEVTSLKKPRAKKVSKVWTHEKVFKSAEEAKNAIKEEKIWSYSYSNKTE FT EGKKVYYRCNQVKFKGKQCDSALHLLYDSRSDEVILFRAEAEHKHEENMSK FT SAISNEAKELIKELFKLKIKPKRMLEIFHERGIKPVPTTIQLRNQLNKIRN FT EVYGPPVISLGELENWLQSSSSIPELEDEAFVVNYKITYADEYECEESNEE FT SDEDEENHGSKFRFFISTIRLLKLASASTILNADATYKLVWQGYPILIIGT FT TDLDRHFHSFGLGICTNEKTADFNFIFSSINIGLKKINASLFNPRVLISDA FT SNAIRKAYSQVFNKNEIVMCWAHMRRNCVKKIESLVAKEDQNELIEDVDQL FT QLSQSQVIFEKASSLFVKKWRQKKATEFLDYMNMMWLTTHNTWYEGYMPFT FT PSTNNNLESNNRVLKDEETIRERTPLSRFSKQALDIVKKWSLAYPRSLKEF FT ITKQSIDTALWTQSYQWAKESKAILSVPQEQSTNYYVSASDATITQEDIDK FT VEKMRWTTFDQFTKRAFTVWRINLPNDKANWADGQCNCPAFLKKFICKHVV FT GISIRLKYCKVPAVAKNTSIGSKLPRGRPTKAKKALLKQ" XX SQ Sequence 2547 BP; 981 A; 360 C; 394 G; 812 T; 0 other; gagaactatt tattcgctat gtatttttta ctaaacgcta tgtattttat aaacactttc 60 tatgtagata aatacacagt atcagtagag taaaacactt ctatgttaat taattaattc 120 tagatattta atacattcgc taagtaatta attaaaaact atgtaaaata aataaatgcc 180 aagtcaagaa aacaagttat aagtaaaata aaaaaaacga tatgtcaatt aataaaatcc 240 atcaataaat cttaattttt tctaagatgt tcttagatct tatttttttt cgacaattat 300 tacaaacttt caataaagcg gtatgttttt ctacgtaaag tctttattta ttataaacaa 360 aattaaaaca aatactgaac ttataatggc taataatttt gcagaagttg aagattatca 420 gtctgatctt gagtttgatg ttgaagttac aagtttaaag aaaccacgag ccaaaaaagt 480 gtccaaagtt tggactcatg aaaaagtatt taaaagtgct gaagaagcaa aaaatgcaat 540 taaagaagag aagatatgga gttactctta ttccaacaaa actgaagaag gtaaaaaagt 600 ttactatcga tgcaatcaag ttaaatttaa aggcaagcaa tgtgattctg cattacattt 660 actttacgac tcaagatcag acgaagttat tctttttagg gcagaagctg agcataagca 720 tgaagaaaac atgtctaaat ctgcaatatc taacgaagct aaagaactaa ttaaagagtt 780 gtttaagctt aaaataaaac ctaagagaat gttagaaatc tttcatgaac gaggtataaa 840 gcctgttcct actactatac aattaagaaa tcaattaaac aagattagaa atgaagttta 900 cgggccacca gtaataagtt taggtgagct tgaaaattgg cttcaatcat catcttctat 960 accagagtta gaagatgaag cctttgtagt caattataaa ataacttatg ccgatgaata 1020 tgagtgtgaa gagagtaatg aagaaagcga tgaagatgaa gaaaaccacg gctctaaatt 1080 tagatttttt ataagtacaa taagattact taagctagcc tcagcatcaa caattttaaa 1140 tgctgatgca acttataaac tggtttggca gggctatcca attttaatca ttggtacaac 1200 cgatttagat aggcattttc attcgtttgg cctaggtata tgtaccaatg aaaaaactgc 1260 tgattttaat tttatttttt caagtatcaa tattggcttg aaaaaaataa atgcatcatt 1320 attcaatcct cgtgtattga tttctgatgc ttctaacgct atacgtaaag cttattcaca 1380 agtttttaat aaaaatgaaa ttgtgatgtg ttgggcacat atgcgtagaa attgtgtcaa 1440 aaaaatcgag tctttagtag ctaaagaaga tcaaaacgaa ttaattgaag atgtagatca 1500 acttcaattg tcacaaagcc aagtcatttt tgaaaaagca agttctctat ttgttaaaaa 1560 atggagacaa aaaaaagcga ctgagttttt agactatatg aatatgatgt ggcttacgac 1620 acacaatact tggtatgaag gatacatgcc cttcacgcca agtactaaca ataatctaga 1680 gtcaaataat cgagttctta aagacgaaga aactattaga gagcgcacgc cattatctag 1740 atttagcaaa caagctttag atattgttaa aaagtggtca ctagcatatc ctcgaagtct 1800 gaaggagttt attacaaaac aatcaatcga cacagcatta tggacacagt cttatcaatg 1860 ggctaaagaa agcaaagcaa ttctttcggt gccccaagaa cagtcgacaa actactatgt 1920 ttcagctagt gacgcaacta taactcaaga agacattgac aaagttgaaa aaatgcgctg 1980 gacaacattc gatcaattta cgaagcgagc ttttactgtc tggcgcatta atttaccaaa 2040 cgataaagcc aattgggccg acggtcagtg caattgtcca gcatttttaa aaaaatttat 2100 ttgcaagcat gttgtcggca tttcaatcag acttaaatat tgcaaagtac cggcagtagc 2160 taagaatact tctattggaa gtaaactacc aagaggaaga ccaactaaag ctaaaaaagc 2220 cttattaaaa caataatttc aaatattttt ataatagtct ttgtgaaata gtctttgtat 2280 ttttatggat ttttttattg tgtcactttt tgtttattct tatgaatttt tcattgtttt 2340 acttagtttt tgtatttttt tacttataaa aattaaaaat attacctata ataaaataat 2400 taaataccta atgaattcta tttttttact tatataatta atataatact tagcatttta 2460 taaatattta cttagaaaaa aatacataaa ggcactaata aaatacatag cgtttagtaa 2520 aaaatacata gcgaataaat aggagcc 2547 // ID DNA8-104_AP repbase; DNA; INV; 340 BP. XX AC . XX DT 29-AUG-2009 (Rel. 14.09, Created) DT 29-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-104_AP. XX NM DNA8-104_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-340 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2042-2042 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 340 BP; 114 A; 57 C; 68 G; 101 T; 0 other; cagaggtatt caaactgtgc gtcgcggctc ccaggggcgt cgcgagagtt gtcgaaggga 60 gccgcgtcaa agcaccggaa accaaaatta atgtaggtca agtataatta atataaatat 120 tatttataat cattaaagcg attatggaaa cgataatgaa acacattacc cgactatttg 180 aagttttatt tctagtacta aagtcactgt tattatcacc catatattgg ctataatatt 240 gtatccgtat tgtttcaatt ttttcataat aatttgggag ccgtaaaaat attgtgaaca 300 cacaaaggag ccgcgggatg aaaaagtttg aatacctctg 340 // ID Gypsy-35_DWil-I repbase; DNA; INV; 5330 BP. XX AC scaffold_181143; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-35_DWil_; KW Gypsy-35_DWil-LTR; Gypsy-35_DWil-I. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-5330 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_181143; Positions 337177 342506. XX CC Positions [4294-4770] - Integrase core CC 'CGGC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS join(1864..2748,2752..5142) FT /product="Gypsy-35_DWil-I_1p" FT /translation="MRTTRSSVTQMPGGKRDAEQNEGGVALRAAEERVKTS FT KNCMLEIDIVNRLRPIYNKLEYKQSNLPISAYSHQQFSEKRNPLGLVEETN FT RKKAMSNRVKRVREQYRQRRTKRQEVCSALEAVSRQKENRAFATVKMQMFH FT VEGLLDTGASVSFLGDGCEDLVNKLELKRKTYYSKVQTAGGVNHAIKGKVD FT VHMTYRGRTQEMTLYLCPSLRQRLYLGIDFWRKFELAPDLLGVEELSIEDE FT EFNSKKEPVEPHVLNETQQEVLASITTKAEVVVPRGRRNAQIGSDRTKRKP FT VEQGNIGAETWEKLRLCLDVRKLNKVTIKDAYPLQNIESILSRVEDTIFIS FT SVDLRHAFWQVELEVESRKYTAFTIPGRPLYQFRRMPFGLCNAAQRLCRLM FT DKVIPQGLRSHVFVYLDDLLIISRTFEENIEHLKAVAECLKKANLTIGLKK FT SKFCFRFLKYLGFVIGDGKLKTDPDKVKAITEIPLPNTVRQLRSFLGTAGW FT YRRFVQGFAEKAAPLTDCLKSKGKFKLTKEATESFNLLKQALTNAPVLVHP FT DFSRRFFIQCDASHVGLGAALFQQDDEGNERPISFFSQKLRGAQLNYTVTE FT KECLAAVKAIERFRPYVELMPFTVITDHSSLQWLMTLKDLNGRLARWSLAL FT QSYDFVIEHKKGKDNVVADMLSRTAVIEELNFFDFETTEFESEEYRQRYKC FT VLSNPDKLPDLHAEDGLVLKRMRITDNEIADVAWKLWIPEALTHTLIQQAH FT DSEEAMHGGVARTLGRLKQFYYWPRMAVQVKNYIAACDTCKEAKHTTQTTR FT PTMGAEVCTSRPIQKLYLDFLGKYPRSRKGNAYILIVLDHFTKFVWLKTMP FT KATSAATIKFLREELFNTFGVPEIVHTDNGKQFTSKEFEEMVSRFGITHTR FT TAAYSPQANASERVNQSILAAIRTHVGEDHTHWDVKIPEIQAALRSAVHST FT IGTSPYYAMFGQHMFRNGGDYRLARKLDADAAVEIKQLLREDQLQILQEKI FT VARAHQAFEKSKKRYNLKAREIQYRPGQEVWKRNFVLSDFKRNINAKFCKR FT YSRCRIMRPVGKNMYELENLHGIRLGVFHAKDLKQ" XX SQ Sequence 5330 BP; 1740 A; 1077 C; 1378 G; 1135 T; 0 other; aatggcgccc aattaaaaaa ataaaaattt aaaacaaaga aacaaacgga agagtataac 60 aaattaaatg tcccaagtac tgctagcatt aagaaggcat aacttagtct ggttctgaaa 120 aagaaagaag agaaacagat cctttttaat attgaactag cacgatataa gaccagatac 180 ttatctaaga ccagaatcag catagtagca attcagccgt gaaccgtgga gatttcaaag 240 aagagatcac atagatgtcc gacctgaaat gaattattga aatggtaaag gtcaatagta 300 cggaatgatg ggtacagagg aaattaaaat ccaagaaata ggatcaagtt cagggaatta 360 aagttctgga atcatctcac ctccattaac aaagaattaa caaaaaaaaa aaaaaaaaaa 420 agagacaatc aaagcataga acaaatagaa aggaatgtca gtagaagaaa gagataggat 480 atcaaaaatc agaaaattta gaagaagctc agaatactta aagaatagag gaacagaggt 540 aatgggtccg ccaataactc gatcccaatc agaaaaccgt gcagactcac gaatttcaaa 600 cgtttcaccg ccgacagact tcaacatgtc aacagaccga aagatcaacg aagaggatgg 660 tgccgtaggg ggcagcgggg caatcccaaa acataatatg ggaacaagaa tgattcacca 720 ggcaccagaa gatggatcat atatgggacc ggaaaaccca cggccagccc tacccacgga 780 aatcacatcg gctataaatc tagctgtagc acaggcgcaa gagacgtacc gagcgacaat 840 gtcagcagag aactctcgat tgcaagcaca aattcgccaa gagatgctag gttttatgaa 900 gcagattcgg gaggcggttc aagaagcggc tcagcaatcc accacgatga gatccaccaa 960 tgtccaagga gagagggctc caatccatcc tacagaagcc atagaccatc gagtattgga 1020 gcgacagcaa cgtgagaatc tggaaagtcg tccaccacaa cgatcttcct ggcattggcc 1080 tgaagatacg aggcaacaat ctggagtgcc aataacaacg gaacagcgtc ccgaaatcgt 1140 acatccgaga agatggggtc tggtttttga cactggcatt cctagtaaat gaatttatat 1200 ttcgtctgga gcacttacag acggcgtacc aagtaagttg gggcgaagta ctgcgagatt 1260 tccactgctt ggtagaaggg caggccaagg actggtactg gatgcacgta cgctcagcag 1320 gaccagcaga ttggccgaat atgaagcacg cactgcagca gagattccag acgcaacgat 1380 caaactttga gcgggaacgt gagttgcgag agcggcggca aaggacagga gaaagcgtgg 1440 atgaatacct ccaagcgatg tttagcctcc gatcacgctt agaaagcaac ttctcggacc 1500 atgacctgat cagagtaatt aaggttaata tcaaagacag tatatcgaaa attatttatc 1560 caatatggat aacaaacctg gagaagttgc gagaggaatg ccatgaggcg gagagactct 1620 tggcaactag atggtctcgt gcatcgggag gagcggagcc gcatagagtg cagagggatc 1680 ggcatcaact aacaaagcgg ttttggcaga gcccgaaggt catgaagtgg agtttgttga 1740 agcggtgtat aagccacgcg aaggcagcaa ggatcggcca gaactcgcct gttggaactg 1800 tgcggagaag ggccatacat tttgggagtg cgagagtgta gttcgcaaga cattctgcta 1860 caaatgcgga ctaccaggag tagtgttacc caaatgccag gcgggaaacg cgatgcggag 1920 caaaacgaag gaggagttgc gctcagggcc gcggaagaac gagtgaaaac ctccaaaaat 1980 tgcatgttag aaatagatat tgttaataga ttaaggccaa tatataataa attagaatat 2040 aaacaatcta acctacccat atcagcatat tctcaccagc aatttagtga aaaacggaac 2100 ccattgggat tggtagaaga aaccaatagg aaaaaggcca tgtctaaccg agtcaagcgt 2160 gtgagagagc aatacagaca acgaagaaca aaacgccaag aggtttgttc ggccttagaa 2220 gcagtgtcaa ggcaaaagga gaatcgggca ttcgctacgg taaagatgca aatgtttcac 2280 gtagaaggtc tgctagacac gggagcttca gtgagtttcc tgggcgacgg atgtgaagat 2340 ctggtgaata aactggaatt aaaaagaaaa acatattact cgaaagttca aacggcagga 2400 ggagtgaatc atgctataaa gggaaaggtg gatgtgcata tgacgtatcg aggtagaaca 2460 caagaaatga cgctgtacct gtgcccgtca ctgcgtcaaa ggttatactt aggcatagat 2520 ttctggcgca aattcgagct cgcacctgat ctcttgggag tagaggagtt gtccatcgag 2580 gatgaggagt ttaattccaa aaaggagccg gtggaaccac atgtcctcaa cgagacgcaa 2640 caagaggtcc tggccagtat caccaccaaa gcagaagttg ttgttccaag aggtcgacga 2700 aatgctcaaa ttgggagtga tcgaactaag cgaaagcccg tggaacaata gggtaacatt 2760 ggtgcagaaa cctgggaaaa acttagactt tgcttagatg tacggaaatt gaataaagta 2820 acaatcaaag atgcttaccc gttgcaaaat attgagtcca ttctgagccg ggtcgaggat 2880 acgatcttca ttagtagtgt tgacctaagg cacgcgttct ggcaggtaga gttggaggtg 2940 gaaagtcgga aatacacggc atttaccatc ccggggcgcc cgctgtatca gttccgcaga 3000 atgcctttcg ggctttgcaa tgcggcgcag cgcttatgcc gactcatgga taaggtcatt 3060 ccccaaggct tgagaagcca cgtattcgta tatctggatg atcttctgat tatatcccgc 3120 acatttgaag agaacataga gcatctgaag gcagtagctg aatgtttgaa gaaggccaat 3180 ctaactatcg gattaaaaaa atcaaaattt tgcttcagat ttcttaaata cctgggattt 3240 gtgatcggag atggaaaatt aaagacggat cccgataaag tgaaagcgat cacggaaatt 3300 ccattgccaa acacggtgag acagctccgt agctttctag ggacggccgg ctggtatcgt 3360 cgattcgtcc aaggctttgc agaaaaagca gcaccgttga ccgattgcct gaaatcgaaa 3420 ggaaaattca agttaacaaa ggaggctacc gagtcattta atttacttaa acaagcccta 3480 accaacgctc cagtattagt acatcctgat tttagccgca gattcttcat tcagtgtgat 3540 gcgagccatg taggattggg agctgccctg ttccagcagg atgacgaagg aaacgagagg 3600 cctatctcat tcttttcaca gaagctacga ggagcccagt taaattatac ggtaacggag 3660 aaggaatgtt tagcagcagt caaggcgata gagagattcc gtccatatgt agagcttatg 3720 ccgtttaccg tcatcacaga tcattcgagt ctccagtggt tgatgacgct gaaagattta 3780 aatgggagat tagctcgctg gtcattggca ttgcagtcat acgatttcgt gatagagcat 3840 aagaaaggaa aggacaacgt ggtggcagat atgttgtcgc ggaccgcggt aatcgaggaa 3900 ttgaattttt ttgattttga gactaccgaa tttgaaagcg aggagtaccg gcaacgatac 3960 aagtgcgtgc tgtccaatcc agataaatta ccggacctac atgcagaaga cggattggtg 4020 ctgaagagga tgcggattac cgataatgaa atcgcggatg tggcgtggaa attgtggata 4080 ccagaggcat tgacccacac actcatacag caagcgcatg atagcgaaga ggcgatgcac 4140 ggtggagtcg cacgaacgct cggcaggctt aaacagtttt actattggcc acgaatggcc 4200 gtgcaagtca aaaattacat tgctgcatgt gatacgtgta aggaggcgaa acataccacg 4260 cagacgaccc gaccaactat gggagcagag gtgtgcacgt cgcggccgat acagaagctg 4320 tatctggact tcctgggcaa atatccacga tctcggaaag gtaatgccta tattctaata 4380 gtgttggatc actttactaa gtttgtgtgg ctcaaaacca tgccaaaagc gacgtccgca 4440 gcgacaataa agttcttgag ggaggaactc tttaacacgt ttggagtacc ggagatagta 4500 cacaccgaca acgggaaaca gtttacctcc aaggagttcg aggaaatggt gtcccgtttc 4560 gggatcacac atacgcggac agcagcctat tcaccccagg caaacgcatc ggagcgggtg 4620 aatcagtcta ttttggcggc gataagaaca catgttggcg aggatcacac gcattgggat 4680 gtaaagatac cagagataca agcggcactg agaagcgcgg tgcactcaac gattggaaca 4740 tctccgtatt atgcgatgtt tggccaacac atgtttcgca acgggggcga ctaccgattg 4800 gctcggaagc tggatgctga tgcggcagtg gaaattaagc aactactcag agaagaccaa 4860 ttgcaaatcc tccaggaaaa gattgtggcc agggcccatc aagcctttga aaagtcgaaa 4920 aaaaggtata atttaaaagc tagggagata caataccgac caggacagga ggtgtggaag 4980 cgcaacttcg ttctcagcga cttcaagcga aatataaatg caaaattctg caaaaggtat 5040 tctcgatgca ggataatgag accagtgggc aaaaacatgt atgaattaga gaacttacat 5100 ggcatccgat tgggagtttt ccacgccaag gacttgaagc agtagatgca gatgcctgta 5160 gttttgcccg gctaactaat gattgtggcg gaaaaatatg atcggctgac tgagagaagg 5220 tatctcaggg tgtgggcgag ggcctgcgtt gggcgagaat gatcctcaga acctagacaa 5280 ggcccacttt tcactcctgt ttcctggtaa gaccgaaaac cgaagtgtgc 5330 // ID EnSpm-1_AA repbase; DNA; INV; 10870 BP. XX AC . XX DT 30-DEC-2008 (Rel. 13.12, Created) DT 22-OCT-2010 (Rel. 15.11, Last updated, Version 2) XX DE EnSpm-1_AA, a family of autonomous EnSpm DNA transposons - a DE consensus sequence. XX KW DNA transposon; Transposable Element; Autonomous DNA transposon; KW EnSmp; EnSpm-1_AA. XX NM EnSpm-1_AA. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-10870 RA Kapitonov V.V. and Jurka J.; RT "EnSpm-1, a family of EnSpm transposons in the mosquito genome."; RL Repbase Reports 8(12), 2107-2107 (2008). XX DR [1] (Consensus) XX CC EnSpm-1_AA is a young family of autonomous EnSpm DNA transposons. CC The consensus was built from 5 copies ~99% identical to it. All CC five copies are flanked by 2-bp TSDs. This transposon contains CC several copies of different transposable elements: pos. 2692-4554 CC (6-bp TSD), 4616-4850 (TA TSD), 7744-7996 (TA TSD), 4851-5015, CC 5013-5220, 6234-6408, 6593-6705,6772-6916, 10282-10495, CC 10540-10680.These transposons will be annotated later.TIRs are CC only 9 bp long. XX FH Key Location/Qualifiers FT CDS 7952..9949 FT /product="EnSpm-1_AAp" FT /note="EnSpm TPase." FT /translation="MFILMSIFIRHNLSNVALCDILELINLIVGFKSLPVT FT YTEFSHFFTKNSYSRHYVCKKCELYIGETMGTCPNCESSSNFFFVTFDFVS FT NLRDILTRNWDAIVEHQVKSKLSQHVTDVLNAGVATAKDIKNAITLTLNTD FT GVKIFNSNVKRSLWPLIVCINDLPANIRFQRKNILIAGLWLHDGEPNLDVF FT LKPFSEMIQQLYYHGLNLXSVLLERVYVIACCVDTKARCKIQNFKQFNGYE FT ACSFCHHPGDIAKKQIRYSYKSHIPKRNLVDTLRAMAVSNSRGVAVNGVKG FT ISPLIRIPEFDVVRNCPVDYMHGILLGVCRQLCRTWFECPTSPCYIKDKIC FT SIDDMLTTIAPFVESSRNARKISDRHSWKANEWLQWLLHYSPVCLKLYLPK FT EYYDHYYLLVSSITLLLGDEISTDDFKMSETMLQQFVQQFEQLYGLDEMTY FT NVHLVSHLVDCSRDYGPLWAFSLFVFEDINGVLKKFVKGPREPIVQITNRC FT IMSHTKNNADINFMSPSVKAFWNKLGRMPQNRKEKKVVNFHLHKSFANKYS FT ENRIFEKQNSYTHNNYIFKPKSKNEYPHEKKEHNNCYFSILEGSELLYGEI FT HCILNDTLGTYFFYKAIQPEEITKHYCKAQILEEFYLVRLNASLTKQIKMR FT VDGNDYLSKVQYKLHID" XX SQ Sequence 10870 BP; 3510 A; 1988 C; 1950 G; 3408 T; 14 other; cccgcatagc cgggaaccaa atgcgtactg tttgctgaat tatccacaac aatatctaac 60 cattaatgaa cagttccacg tgtcatcaaa ttttactata tcccaacact acagtgaatt 120 gtttttacaa tacggtatat tgtaatttgt caaattacta tgtggggaat aggcggttaa 180 gttatgtaac aataaaaaaa tgccaatttt cccacacaaa attttgcctg aacggttggg 240 gaaatggttg acatattatc cattttttcg taagattgac tataagtgga acggttgcga 300 ttaagttgaa caataaaaaa aaacaagcac tatgatcatt aatgaaaaca tcgatttatg 360 gtgggataca gtaatcttac catttatcat gttattatgt tgttgttata ctatttaaaa 420 caataccccg tacatttagg ttatattcca acagttagat ctaaaataaa ttaattgact 480 atatatacat gcaatggaat aaataaggac gtgcggaaaa cctacttcac tgtaaacaaa 540 aacaaataat gttgaaagtt acttgtaaag agcacgcttc aacgatattt gtttgtgaaa 600 tgaagtaaat tctaaccaca cgtactcaca tattccatta catctgaaag ggtttttcca 660 acagtgaata aaaagtaaat tctaaccaca cgtactcaca tattccatta catctgaaag 720 ggtttttcca acagtgaata aatgaacgcc aggatttctc tgttcaacga acgacgttaa 780 catattttga taacaatgtt ttcccattac ccatttgaaa tggttaaata aaatatcgaa 840 ttcaatacaa tatataacag tacaagaatt caattaattt gacaccaagt cagcataagc 900 ttacttcaaa ctcgaatgtt tccagcgtct tgtccgctgc cttcaacaat tccccgagat 960 ttgcggacct cggctgcttt gcgcaaatta gcgattttct ggcagatgta ttttctgatt 1020 aatgttacat ccgaatgtct tcggatggtt gccatattgt cgggcccaat ccttattgcg 1080 gtacgttcag caactttatc tgtgaaaaaa cataaaaaac acttaaaaca acttgtgatt 1140 ttaattaaat tatttcgtaa acctaccgca aacaaactcg atgatgtttg ggtcgatttt 1200 ctcggaggct ggatggacca agcttccatc agggtttttc agattcatga agcggcggct 1260 agcctgtccc gtcacactca tatttgccag cctcgctgga ccgatcagcc gcgtgactaa 1320 aagaccaaca aataagctat cattaccggc acgggcgttc agttcctcca gctcattagt 1380 tgtgattgcg atcccgtcag cctcggtaaa atgaacctaa gaataaaaat ataagctgtt 1440 atttaattaa taccaaattt ggaaaccaat catacacacc taaggtacct tctgcttgtc 1500 cttaagttgc attagttcct tgccaagacg caaaccgtag tcgtctgatg ataatttctc 1560 ttgttccaga ttggaaaact tctgtttgag tttcttatgc cgtgcaagta gttcgttgta 1620 ccgtttctcc aaaaggtgat acttttgttc ccagttatga tgttcaataa caactacttg 1680 agaagcggtg tctttctgaa cttcgatttg agcacttgat gggattgaaa gtgggtttgt 1740 atttttcgca accaaaggag aacggttctg tacaacccgt ggctgagagc tgtcgagaca 1800 acgatcatca ttaaattgtg ggctgtcctg atcaaacacg cgttgtgagt catgttgctt 1860 gccgttttga gggttttgaa tattaatcgc cggaatgcta ttagtcctgc cagtatttcc 1920 agctgtcggt actggtagct gagatgcagg tggctgcgtg tgattttctc gagtgtgaat 1980 ggaacttctg gcttggatgt ttgttagcgg catttctgga gacattggtt cttgcgattt 2040 cttggtgaca ttggtttctt cgatgtggcg cattttattt tttcagccat ttgacgcttg 2100 agagcggctt caagctaaaa ataaatcaat ttttaattaa agaataatta ttaaaataac 2160 gacctctatt tgaacatctc caaaattcac tttggcttca cggtttctta gatcaaattc 2220 tcgaataagt attctaaaag atttatctgt gagaagtgcc caaacttaga ttatccattc 2280 ctcgtattcc gcagacaaaa ttcttttatt tggcccgttg attattccac caaaatgaaa 2340 ttagaacaaa cctgggggat gcgagctgga aatcaacaag cagccaggct gctatcaatc 2400 ttaaattttc gtaaaaatga tctgagtgtg cctagaaatg tacggtcatt ataatttata 2460 taacacaaag accaccaact gttaagaaaa gtacttgcag ggctggtagc agtcagacag 2520 taaaataata gtgactttag tgaccaaaat gataaaaata gtgaccaaaa agtgaccaaa 2580 tagtgaccta aaagtgactt caaaactaca tgtaatagtc atgaaaatga ctattatatg 2640 tagttttatg tgggtatcaa ccaatctaca ttttgggtga atttagagtg tagagaactg 2700 cactgcccat gttcgcatag tcgacgtaag cgccaatatg aaaattcata ttttcgttgc 2760 tatgcatttt ttgcctaata acacactcgt tggacttgcg aacagcactt acttgttagc 2820 acatacacca ccttatgaga atattcacac ggtttgtcta gtgtttgttc gaatagttta 2880 gcgtataaac caaaaaaact tgatatttta gtttttttta tacaaaatag tcaaaatcgg 2940 aagacttgag atcatgaatt aaattaaaac tgcaaagttc aataacaaac cttctaagac 3000 tgtcttcgaa attatgcgtt agccatagac catgattgat tcctggaaaa awwtttgatc 3060 tgrtgaaytt awgwtagatt tttttaacag aaacggtatc tttgaaaaca ttttattttt 3120 ctcctttgaa gctggctaaa tttttgacat aaatctaaaa ttgcatctta caagcaatct 3180 atcaaaaata tggctaagaa ttgcgtaaaa aacttaacaa aaatttgact aagattacct 3240 aaaggacctc rcattttatt tacgttaaaa gatttttcca tgaatttatg aacaatttta 3300 cttgaaattt gttgggaatg attgsaaaaa tacactgtaa atctgcagaa atcaatgcaa 3360 aatctcatca ggggcctcat gagaactttg tcatagatca akgaataact aataggtaat 3420 ccagacaagg cttcatcaaa aatcagtatc atctcagaaa tccgccaatc gtacttctga 3480 attaatctgt caaagatttc agtgaatctc ccaagaaatt tgtcaaatat tcatttagct 3540 caggttttca taaatggcaa tatcatcaac cataacaact tactacaact tattttggtg 3600 gtttgtaggt ttagtgcttg gagatgttcg ttccaaatta ttttcccggt gaccatctta 3660 aatttccggt tttcccgtct ttttccccgg tccagtagcc accctggaat tattgaaaac 3720 taatcgtatc acttagtcac gactcgtamg ttttatctgt tttacgaccc gttttgtata 3780 gaaatagttg acctaagatg aagtgatttc tatcaatatt agtactcgct tttgatgtcc 3840 tggacgagtt tctaacaaac tctttgattt ttkgtttgtt attaaaatcc gtctctggct 3900 tatctccgga aggaattttg atgaggatct ttcaaataat aggttaamga ttccttatac 3960 agtcacatat tattgcagaa agttcactag taatttattc agcagatctg tcagataggg 4020 caatagtatt cgccaaaatt tattaaaata ttacgactac aaaattcagc ccaaaccttt 4080 cccagatttt ttttatattt tccgttgcaa atggctacga catattcagg attccctatt 4140 tttttaacct taaattttca aatcatactt atttgtcttt ataataatac atatcttttt 4200 ctttcaattt gtgtttgaaa agataaccca gctttaaatt tcaataccgt tagatcacga 4260 aaatagtgac tttagtgacc aattttctga aaaaagttac tttgatgaca ggtcgaaaaa 4320 tgtgagcaag tcactgaaaa gtgacttgct accagccctg tacttggtct aaattatttt 4380 gtatccggta gaaattaggt agaaatcaag actaacactt acagtcaact ttctttcgtt 4440 gcactaaacc cgtggcccac ctagtgaaag actttcacta atcgggctaa gtgtcaaata 4500 accgaataga aattcagcgc taccaacatt gacaaattag ggtttatgtc agaaagtgac 4560 gtttgccttc gttttaggtg ggctatagcc cacttaacga gagcccaatt gaaacgaagt 4620 gcaattaaac gtagtgcaac gaacgaaatt cgactgtatt tataaatatt gtttcctcag 4680 gccccgtgct aatatgtgaa tagtgcgctg tgttcataaa aatcgtgaga aagcagcgcc 4740 acactaaaaa actgatttca aaatgcggcg agttgaggcg cgtcgcgagt ctcactggcg 4800 caatccggtt tggtggcgat gatcgatttc gctctatcga tcctctcttt tgacatttaa 4860 catttttcca cctcagaacg caagatttcc ttactgccta gcaatctggg gtggaacaat 4920 attaaaaaaa aaactagcaa cttgtcactt ttattacaag agaagatcga tgaagggata 4980 tcgatcatgt catttacgcc ctcattctag cacatgcttg atataatctt gtataatact 5040 caccatagca cgtatttcct caacgcccaa ctcatgagca accagttcac aagactccaa 5100 aacattgttc tcagattgag cgtcctggtt tgtcggataa atttcagcta ccgtctgctt 5160 gctgactccg ctaacgcttt cgccttcttc aacggccaga tgaaagggca aatcctgaag 5220 aaacatacta ctggatgcag tttcagctgc tcccgctccg gatcctgtat cagttgatct 5280 gtttggtgtt ggtttcagca acagtgtatc gaaaaagtta cttgctcttc caccatgcac 5340 ggatgtttca ttttcagttc cgcttctgct ttcagtttcc tcaccatcgg atgcatcttg 5400 atctggatca tgatctgctt gaaactcggc cagtggatct tgtgattcaa actcggcata 5460 attcgaccca cttgccaccg acatatatgc atagccggtc ggatcgaatg tgtcgccttg 5520 gtttgccagt gctgtttgca gctgtgaata aaaaaaaaac aaaacataag cgatattata 5580 atttcaataa tataaactca cttaaggttt gctaattaaa aaaggtcatt taggccaaat 5640 attccctggc ttatgaaaaa tataaaagta tacaaaacgg ccagactgag aatattagca 5700 atattagttt gaagctcaaa aaactcacca aaacttgcaa atctgctacg ttcatctgct 5760 gagcagccat tttgttgcat ttggcagcag gctcaatcac acgggattct gaaacaatat 5820 ataattggta tttttctaaa attctaatta gatatactta ctttttcctc tcttagcaga 5880 aaactcagtg gcttgctttc gttttggagg catggtgacg attttttacg aattccgacg 5940 cgattaaaat caaaatatac agccaacttg cgcactacgg cacgtactcg ttgataataa 6000 tgcttttttc gctttgagat atcacggttt gctacacgca ccattgagct ctcaccgagc 6060 ggtgtcacga acactcgcgc aaatttgctg gttgaaaatt ctgccgtttg attttgctgg 6120 gaacagttga tctttgttta tattttcaac cagcaatctt taagtagtgt tggtgacatg 6180 taacgtgtat gtacacgagc gcagaaacca ctatatttgt tcacaaatac caatgctttg 6240 gaatgtttgc caaactgtta taataatatt tcgtacagtt agctacgact gtatatgtga 6300 rattcgatgt cgactgcttg agtttgcatg ggaagaagcc taacaaagga aaagcgctca 6360 atgacaaaaa gaaataattt aattagttgt aaaaaaaata aataaaagtg gcgtcaatgt 6420 caattgaaac agcagccaac tgtcattccc ataaaaaacg ggttccaatc gaacagtaga 6480 cctttccaaa ctggaaaata acgacactct taggggccgt ccagtaatca cgtaagaggt 6540 aataatgccc atactcgcac aacagtccca ggcagtaatg ggggaggaga ggtttgaggt 6600 aacttacgcg ccatacatat tgtgttttat attcataaaa aaataaaatg gtaaggggat 6660 gggaggtagg tatgttgaaa aaccgccaaa tttgtcttac gtaattaata gataggccct 6720 tattcaaaga aactctacgt aatgttttaa tctgtctgga gatttacttt acactctgcc 6780 tgatattgcg acgaattcaa gaataggtaa ttttcatttg ttaagcttga tactaaagtg 6840 ttttgattga cacaattttg catcgaagtt cccaatttca ctccgcagaa atattttttg 6900 aattataatt tacaatgaga cgatcacgta tgggctatat ttttgaggcc aacaaaaaac 6960 cctccagagg acacattttg agaaaggtat gtcatgatga caactataat ttccagatgt 7020 aaaaaacagg tattttgttt acagaaacaa cgccggaaca ttccagctcg tgagcccagt 7080 tatggaaacg aagtccacgg cgaacaatcc caggaaatat gtgcaagtca caacacttcg 7140 aataatcagg tttctctgca tcaaaaccga attgtgagta tcgttcatat tcctatgaat 7200 gtggtaataa ataggaatag tggtattatc actccaatta agggaagttt acgttagatt 7260 tcgcttttag tgtccattct gaactatcac agcttagata aaaagtgtaa taaaatttat 7320 tccatttcag gatccctgta tggttaacga ttattcaatt gataatgcgt ttactaagga 7380 ggatgatgtt cccggcgaaa acaataatag tccgcctgaa ttcatggaaa ttggtgaacc 7440 aatcattgac ttcagcgacc tgaattctac agataacaaa catcgaatct acgataatga 7500 tgagaatcct aatatggaag cgccgaaacg cgataacatg gtaattgtac actcagaaac 7560 acgggtagtc acagaattac taaattttag taatttatga ctacttcatg tctgcctctc 7620 ttccattctg cagggaattt tattcctatt tgtcaatacg cctgggttga agcatcgcta 7680 agctgcaacc caggcgaaat gacatttagt gtagaaattc acagcagaat gaaagagagg 7740 cagatagaaa atagtcatta atgcataaac tttaattctg tgactatttt tatttctgag 7800 tgtagtttaa tatatttcgt gtgtgctaga ataagggcgt aagaaggatt gtgtatggat 7860 agtatagaaa gattttattg ttttatttca ggaattcacc tgtacgaccg aattgctcca 7920 tcctagtttg aaattatcta aagaagaggt aatgttcatt ctcatgagta tattcattcg 7980 tcataatctg agcaacgtgg ccctctgtga cattcttgag ctaatcaatt tgatcgtagg 8040 attcaaatca cttcctgtta cgtatacgga gttcagtcat tttttcacga aaaatagtta 8100 cagtaggcat tatgtatgta agaagtgcga gctttacatt ggcgaaacaa tggggacctg 8160 tccgaattgt gaaagctcaa gtaacttttt tttcgttacc ttcgattttg tgtcaaatct 8220 tcgagatatt cttactcgaa attgggatgc cattgtagaa catcaagtaa aatccaaact 8280 atcacagcac gtaacagatg tactgaatgc aggtgtagct acggctaaag atatcaaaaa 8340 tgcaattact ttaacgctta acacagatgg agtaaaaata ttcaattcaa acgtgaaaag 8400 atccctatgg ccgttaatcg tctgtattaa tgatttaccg gcaaacattc gtttccaaag 8460 aaaaaacata ctgattgcag gtttatggtt acacgacggt gaaccaaatc ttgatgtttt 8520 cttgaaacct tttagtgaaa tgatacaaca actctactat cacggattaa acttagratc 8580 agttctgcta gagagggttt acgtaatagc atgttgcgta gatacgaagg ccagatgtaa 8640 aatccaaaac tttaaacaat ttaatggcta tgaagcctgc agtttttgtc atcaccctgg 8700 ggatatagct aaaaaacaaa tacgttatag ttacaaaagt catataccaa aacgtaattt 8760 agtagatacg cttcgggcaa tggcagtcag caactctcgt ggagtagctg ttaatggagt 8820 taaaggaatt tcgccattaa taagaattcc tgaattcgat gtagtccgta actgtccggt 8880 cgactacatg catggcatac ttctcggggt gtgccggcaa ttatgtcgca cgtggtttga 8940 atgtcctact agcccctgtt atattaaaga taaaatctgc agtatagatg acatgttaac 9000 aacgatagct ccattcgtgg aatcctcaag aaatgcaaga aaaatttcag acagacattc 9060 atggaaggcc aatgaatggc tacaatggct gctccattat agtccggttt gtctgaagct 9120 gtatttgcca aaagaatatt atgatcacta ttacttacta gtttcgtcaa tcactttgtt 9180 gctaggggat gagatatcta cagatgattt taaaatgagc gaaacaatgc ttcaacagtt 9240 tgttcaacaa tttgagcaac tctacggttt ggatgaaatg acttacaatg tacatcttgt 9300 ttctcacctg gttgactgct cgagagatta tggacctttg tgggcattct cactatttgt 9360 ttttgaagat ataaatggtg ttttaaagaa atttgtgaaa ggtccaaggg aaccaattgt 9420 tcagataaca aacagatgta tcatgtcaca tacaaaaaac aatgctgata ttaatttcat 9480 gagtccaagt gtgaaagctt tttggaataa gttaggtaga atgccacaaa atcgcaaaga 9540 aaaaaaggta gttaactttc atttgcataa aagcttcgcc aacaaataca gtgaaaaccg 9600 catattcgag aaacaaaact catatacgca caacaactac atttttaaac caaaaagtaa 9660 aaatgaatat ccacatgaga aaaaggaaca taataactgt tatttttcga tacttgaagg 9720 ttctgaactt ctgtatggcg aaattcactg tatattaaat gacacattag gaacgtactt 9780 tttttacaaa gctattcaac cagaagagat caccaaacat tattgtaagg ctcagatttt 9840 agaggaattt tatttagtaa ggctgaatgc gtctctaacg aaacaaataa aaatgcgcgt 9900 agatggaaac gattacctaa gtaaagtaca atacaagtta catatagatt gaacgcagaa 9960 tatgaaatgt attccttcat gtataagact ttaaataaaa taataaaaat taccaatctg 10020 aaaaaaatat acacctcttt gtttacattt ccatagagct atctgactta aaaaagaaac 10080 gcaacattgc atcagttgat ttgattagtg tttccctatt ttgccataac aaatcttgaa 10140 aactttcgca ttttgcatag ttttgaatgc tttagatcaa tatgtatatt ttcggtataa 10200 aaaataaaat acacaaaatg atttaaaatt tgaaaacaat catagaacca cgtacagcga 10260 aacagggaag cattatgttc ataactagta cacctaccct aattatagat acaaagtttg 10320 acctacgtga gtcgtattcg aacttatatt ccatgatgga cttatggaac catagtgaaa 10380 acaagattta ctagttactt tgatgctccc atttctgttc caaaacaata tttcgatgag 10440 ccgatgatcc cttaaatatc gactcttact tacaggtttg aatgtaataa cagatgggtg 10500 tttagatgaa aagaaatcgt ttaaaaagtt gttttaaatt gaatcttcca aaagaaaata 10560 tctgtttaca tcacacgatg atatcgaaac gtaccataac tctaccccgt tttgttacta 10620 ttactcaaat ggcctccata ttatatttta cagtcaagag gtttgcttta tcgatatagt 10680 aacacaatat ttgaatattt ttacatattg tcacaatttg gaatacagta ttcaaattat 10740 tttgtatagt tttatttgtt gtggctcata ctgttagact ctataacaac tatttcagat 10800 acagtaacaa tataccgtag aagattctta ttgtttggat ttcaaccgga atagtttttt 10860 tctatgcggg 10870 // ID HITCHHIKER repbase; DNA; INV; 582 BP. XX AC S81657; XX DT 11-NOV-1996 (Rel. 1.1, Created) DT 11-NOV-1996 (Rel. 1.1, Last updated, Version 1) XX DE transposon Hitchhiker. XX KW Harbinger; DNA transposon; Transposable Element; KW 3 bp target site duplication; HITCHHIKER; transposon. XX OS Autographa californica MNPV OC Viruses; dsDNA viruses, no RNA stage; Baculoviridae; OC Alphabaculovirus. XX RN [1] RP 1-579 RA Bauser A.C., Elick A.T. and Fraser J.M.; RT "Characterization of hitchhiker, a transposon insertion RT frequently associated with baculovirus FP mutants derived upon RT passage in the TN-368 cell line."; RL Virology entry [NCBI gibbsq 177068] from the original journal RL article 216(1), 235-237 (1996). XX DR GenBank; S81657; Positions 4 585. XX CC 39-bp imperfect inverted terminal repeats. CC Trinucleotide target site, TTA. XX SQ Sequence 582 BP; 142 A; 153 C; 136 G; 151 T; 0 other; gggccctcgc ccacggcgac tttttgtagc gatgcagtcc cgcgtttagc aaacgcgcga 60 cggcatcact tctaacgcgt gttagtcgca tttgcaacgc gctactgcat cgcgattgta 120 tcgatacaaa cgctcgtgcg cttcgcgcta ttaacgcgaa agcaattcac aaaacgtcga 180 atatgacatt ctataacaac ttttgaattt aagatcataa tttttgagtt gtttgaatgt 240 tgttacatga tgtccacaaa gtaatctctg ggtattaaat ggatgcatcc aatattgacg 300 ctgcttccgt tgcctacgct tcggctgctt acgtaaaagc agcactaaag acaacgtgac 360 taatgttgcc tcaacggagt ccatcttgca actgtactga tccaagatcc actgatccga 420 atcacgtttg catcgcgacc gaatcgcttt tgcatcgcga ctgaatctgg tcccgtgggc 480 gcgggcacac agctagtgac tgcttcgcgt ctgatcgatg ctacgcgcga gagcatcgcg 540 actgcatcgc tacaaaaagt cgcccgtggg cgagggccct ta 582 // ID Gypsy-89_AA-I repbase; DNA; INV; 7106 BP. XX AC supercont1.279; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-89_AA_; KW Gypsy-89_AA-LTR; Gypsy-89_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-7106 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.279; Positions 1508204 1501099. XX CC Positions [3542-4051] - Reverse transcriptase CC Positions [5156-5632] - Integrase core CC 'AATA' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 444..2849 FT /product="Gypsy-89_AA-I_2p" FT /translation="MAVSIQELVAEKLKQHFFDIRVDHLNEDELDFELEIR FT EIVFDDNESMTRKRRALREMLKNEKSAENPVYMLKRSPSEDFQACIAKFAE FT IEGTIKISAKGLPPRCQSRLLHLGNRLLLLEASISEEDREKLMKMQEAVLS FT YLNRYFYGKRSVVSEEQEDTGIDRLFIESENPVGRQDEAIGQITQPSSTGQ FT MKTGSLLEDEKDIVESLQRLGLLNTNNLARVEVTDVKDALMSLEYEINLLR FT RFRDLHSILPDNQTPTIIDAHLPQANRTTYTGTIPKNTRSSLAASSHSITS FT GGIDSFASIDPTVHNSMSSFVSSGTTPSYSIARAVDTRLVPELSTSYPMIN FT FQTLTSTTSTGPRVTSTMYAANPYLTMGCLSTSQVQFNVPATIGFNTTAAS FT QFERESMNWTRPSVDPWRATISPVTNSIPTLTSTKGLQGQQMSTNNNFNSL FT TSNSQLGHHTAQSVQHPASTMEACRGQFGHKSLPVSKWKLNKYAGTDQGLK FT LNEFLVLVSQLALSERISEPELFDSALHLFTGPALNWYMTMRSSGRLASWQ FT HLVSELRKAFAHPELDSLVRTRIYQRRQQRNETFQEYYYDMESMFRSMIVP FT MSDAEQLDVLKRNMRSDYKKTLLWKPIFSLPDLLEAGHIIDASNFSLYAKV FT FGHEKTSNAVSEIKIGKGDPKNFSNRPPFKPTTQQGSNPSKFKDDNSKKKG FT VTEEKKTNQPIQAGRSKEMDPKEGPSKPTRTLDMLIEAHRPPRSYECLYCR FT HTNHALEQCRSFRGSICMVCGFKGFETQNCPYCQKNGLQTARNRGPSNPNA FT " FT CDS 3239..6010 FT /product="Gypsy-89_AA-I_1p" FT /translation="MDFWQKFQIWPTIRDCAATLVDEATEKGPQQTLSKSL FT ESNQLSEIKNLFLGARPDGLTVTPLIEHRIEIADEWKDKPPVRQYPYTLSP FT KVQQKVAEELDRMLAIGIIERAHSDWCSNVVPVIKPTGKVRLCLDARKINE FT RTVRDAYPLPHPGRILGQLPQARYLSTIDLSEAFLQISLEEKSRKFTAFSV FT QGKGMFQFTRLPFGLVNSPATLSRLMDRVLGHGELEPNVFVYLDDIVIVSE FT TFEHHVQLLREVARRLAEANLSINIDKSKFGVNELPFLGYLLSTEGLRPNP FT DKIQAIVEYERPNTVTKLRRFLGMANYYRRFIEDFSGITSPLSDLLKTKSK FT VLGWNEKAEQAFTLIKEKLISAPVLACPDFSKEFTLQTDASDVAVAGILTQ FT IQEGFERVIAFFSHKLTTPQRNYHACEKEALAVLLSIEAFRGYIEGSHFTV FT ITDSSALTHVMTAKWKTASRCSRWCLELQHYDMTIRHRRGKENVVADALSR FT SVATVVTSNISNPTVKITPDSLDSAQAIYDSLLEKVYEEPDDHVDFRVRDG FT VLYKYVANTSEPHDDRFDWKIIPSPVERNGIIQECHDNSMHPGTDRTLSRI FT RLRYFWPRMVLDVREYVSKCTICKESKAPNIALAPPLGEKRVTSHPWQIIA FT LDFIGPLPRSRNQNQYILSVVDLFSKWIMLVPFRKIDSKNLCKVLRDQWFY FT RNSVPEVLITDNATCFLSHEFRALCTRFDIRHWLNSKYHSQANPVERVNRT FT VNTAIRTYVKSDQKLWDSRISEIEAILNSSAHSATNLTPFFTTHGHEMFLK FT GCDHGIGEDDPQISVADREKRQTELFNAINKLVQDKLEKAHQESHKRYDLR FT HRTYGKPFKVDQMVYRRNMKQSSAVEDYNAKYGPQYLPSKIIRRIGSSSYE FT IADLEGKSLGIWPAMHLKPG" XX SQ Sequence 7106 BP; 2232 A; 1473 C; 1495 G; 1906 T; 0 other; attggcgatc caaccaaatc caaaaatatt cttcttataa gaaatttttt tcacttactc 60 ggggtgaatc acgattctta ctctcattca tcgtgatcgt atcgtggtgt tgtagggaag 120 tattcgtccg aaaagacagc taaaaatctg tttgatccat ctcaaccatt ataatttgat 180 tatagttact attcagaagc ttttctggtc tatttcttat agttgtaatt gattttctat 240 atgtcgttaa ggaatgaatt ggaatctaga ataagttaaa aatcgaatga attggaattg 300 aaaattaata ttaaggaact gaattaaaat tgtaaataac tgaattacaa agaattatta 360 ggatttaatt acttttatag aattatagaa ttgaattatt tgattgaaca attgaactct 420 ctttattcta attttataca aaaatggctg ttagtataca agaattagtg gctgaaaagc 480 tcaagcaaca tttctttgac attcgtgtag atcatttgaa tgaggacgag ttagattttg 540 aactcgaaat cagagaaata gttttcgatg ataatgaatc catgactagg aagaggcgag 600 ctctcagaga aatgcttaag aatgaaaagt cagcagaaaa tccggtttac atgctcaaac 660 gaagtccatc agaggatttc caagcctgca tagctaaatt tgcagagatt gaaggaacga 720 taaaaatttc agcgaaaggg ttgccaccca gatgtcaatc tagattactt catctaggaa 780 accgattgct tctgttagag gcttcaatat cagaagaaga cagggaaaaa ttgatgaaaa 840 tgcaggaagc agtgctctcg tatttgaata ggtattttta tggaaagcgt tcggttgtat 900 ctgaggaaca ggaggatacg ggaatcgata ggttgttcat tgagagtgaa aatccagtag 960 gacgtcagga tgaggcgatt ggccaaatca ctcaaccgtc ttcaacgggg cagatgaaaa 1020 cggggtcatt gttggaagac gaaaaggaca tcgtcgagtc cttacagaga ttaggcttac 1080 taaacacaaa taatttggcg cgagtcgagg taaccgatgt gaaagatgcg ttaatgtctc 1140 tagagtatga aattaattta ctcaggagat ttcgagattt gcattcgatt ttaccggaca 1200 accagactcc gaccataata gatgcacact tgccacaagc gaatcgtacc acctatacgg 1260 gaacgattcc taagaataca aggtcgagtc tagctgcttc tagtcattca atcacttcgg 1320 gtggtataga tagttttgct tcgattgacc ccactgttca taattcgatg agttcttttg 1380 tgtcttcggg cacaacccca agctattcta tagcgagggc cgttgacact aggctcgttc 1440 ccgagttatc gacctcgtac cctatgataa actttcagac tctgacatcg acaaccagca 1500 caggtcctcg tgtcacaagt accatgtatg ccgccaaccc atatttaaca atgggttgcc 1560 tgtcaacatc tcaggttcaa tttaacgtac cagcaactat cggtttcaat accaccgctg 1620 catcacagtt cgagcgcgaa tcaatgaatt ggactcgacc gtctgttgat ccatggaggg 1680 caacgatttc accggttacg aattcaattc ccaccctgac gtctacaaag gggttacagg 1740 gacaacagat gagcactaac aataatttca atagtctgac ttctaatagt caattagggc 1800 accatactgc tcaatccgtt cagcatccag ccagtaccat ggaagcatgt cgtggccagt 1860 ttgggcacaa atcgcttccg gtttcgaaat ggaaactgaa caaatatgcc gggaccgatc 1920 agggacttaa gctgaacgag tttcttgttc tggtgtcaca gctagcttta tcggaaagaa 1980 tttctgagcc cgagcttttc gattctgctt tgcatttatt taccggacca gcccttaact 2040 ggtatatgac aatgcgatct tcggggaggc tggcaagctg gcaacatctt gtcagcgaac 2100 ttcgtaaagc ttttgcgcat cccgaacttg attcgttggt acgcaccaga atataccaaa 2160 gacgtcagca aaggaatgag acatttcagg agtactatta tgacatggaa agtatgtttc 2220 gatcgatgat agtacccatg agtgatgcag agcaactgga tgttctcaaa agaaatatga 2280 ggtcagatta taaaaaaacg cttctttgga agccgatctt tagcttgccg gatttgttgg 2340 aagcagggca cataattgat gcatctaact tttcgttgta tgctaaagta tttggccatg 2400 aaaaaacctc caatgccgtt tcagaaatca agatcggcaa aggtgatccg aagaatttca 2460 gcaatcggcc accgtttaaa ccaactactc agcagggttc taacccttct aagttcaagg 2520 atgataattc caagaaaaag ggagtaactg aagaaaaaaa aactaatcaa ccgattcaag 2580 caggtcgatc caaagaaatg gaccctaagg aaggtccctc gaaacctacg agaactttag 2640 atatgctcat tgaggcacat cgaccccctc ggagttacga gtgtctttat tgtcgacata 2700 caaatcatgc tcttgagcaa tgtagaagct ttaggggttc tatatgtatg gtttgtgggt 2760 tcaagggatt tgaaacccag aactgtcctt actgccaaaa aaacggccta cagacggccc 2820 gaaatcgcgg accgtcgaac ccaaatgcgt aaatccagta ccaactggaa tcactcttgc 2880 cgaattctgg gaaccagttc gggaagacct atattctagt gatgaatctc aggttctcaa 2940 tattgccatt ggagatactc acgatagtcg accttacgcc aatataaaaa tttacgggcg 3000 gccatccaag ggcctgctgg actcaggaag tcaattgacc ttaataagcg aagaagtgtt 3060 tcgcaaattg aaccgctcga aactaagacc cttgagtagg ccgattattg tacgttctgc 3120 aaatggatca gagctagaag ttcttggtca aatttctatt ccgttcaact ttgctggtca 3180 cataaaaata atccaaaccc tcgtagtaaa gaccttatct gtagactgcc tactgggtat 3240 ggacttttgg caaaaattcc aaatatggcc cactattcga gactgtgccg ctacattagt 3300 tgacgaagca acagaaaagg ggcctcaaca aactttgtca aaatctttag aatcaaacca 3360 gctctcagaa attaaaaatc tctttttggg tgctcgccca gatgggctga ccgtcacacc 3420 tctgatcgaa catcggatag aaatagccga tgagtggaaa gacaaacctc cggtacgtca 3480 atatccgtac acgctatctc caaaggtcca gcaaaaagtg gctgaagaat tggatagaat 3540 gttggccatt ggcataattg aaagggcaca ttctgattgg tgctcaaacg ttgtaccggt 3600 tatcaaacca acaggaaagg ttcgtctctg cctagacgct cgcaaaataa acgaacggac 3660 ggtacgtgat gcctatccct tgccccaccc tggacgaatt ttgggtcaat taccccaagc 3720 aaggtacctg agtaccatag atttatctga agctttccta caaatctcct tagaggaaaa 3780 atcgcgaaaa ttcacagcgt tcagtgtaca gggcaaggga atgttccaat tcaccaggtt 3840 gcctttcgga ctagtcaata gtcccgcaac cttgtcacgg ttaatggacc gagttctggg 3900 acatggtgaa ttggaaccga acgtcttcgt ataccttgac gacattgtca ttgtatcgga 3960 gacgttcgag catcacgtac agctccttcg ggaagtagcg aggcgtttgg cagaagcgaa 4020 tctctccata aacatagata agtctaagtt cggtgtcaat gaattgccat tcctaggata 4080 ccttttgtcc accgaaggct taagacccaa cccagataag atacaagcta ttgtggaata 4140 cgaacgacct aataccgtca ccaagctgcg ccggttccta ggaatggcaa attattatcg 4200 tcgtttcata gaagatttta gtggcatcac gtccccgcta tcagacttgc tgaagactaa 4260 gtccaaggta ttgggctgga acgaaaaggc tgaacaagca ttcacactaa ttaaggaaaa 4320 gctgatatca gcccccgtac tagcgtgtcc agatttttca aaagagttca ccctccagac 4380 ggatgcgagt gatgttgccg tcgcgggcat tttaactcag attcaggagg ggtttgaaag 4440 agtcattgct ttcttttcac acaaactcac aaccccgcaa agaaattatc acgcgtgtga 4500 aaaggaagca ttggctgtcc tcttatccat agaagcattc cggggataca tcgaaggctc 4560 tcatttcacg gttattaccg actcgtctgc cctaacgcac gtcatgaccg ccaaatggaa 4620 aacggcatcc cgatgtagta gatggtgtct ggagttgcag cattacgaca tgaccatacg 4680 tcatcgtcga ggaaaggaaa acgttgttgc cgacgctctg tcgcgcagcg tggctacagt 4740 cgtcacaagc aatatttcca atccaaccgt caaaatcacg cccgattccc ttgactctgc 4800 tcaggcaatc tatgacagtt tacttgaaaa agtatacgaa gaaccagacg atcatgtgga 4860 ctttcgtgta cgggatgggg tgctttataa gtatgtcgcg aacactagcg aaccacacga 4920 tgatcggttt gactggaaaa ttatcccttc tccagttgaa aggaatggta tcatccagga 4980 atgtcatgat aactccatgc atccgggaac cgatagaaca cttagccgta tccgtctacg 5040 atacttctgg cctcgaatgg tcttagacgt gagagaatat gtatcaaaat gcaccatctg 5100 taaggagtct aaggccccga acatagccct agctcctccg ttgggagaga agcgtgttac 5160 gtcacaccct tggcaaatta ttgcactaga tttcataggg ccactacccc gaagccgcaa 5220 tcaaaaccaa tacatattat ccgttgttga tctttttagc aaatggataa tgcttgtacc 5280 atttcgtaag attgacagta aaaacctttg caaggtatta cgagaccaat ggttctaccg 5340 gaattcagtc ccggaggtac ttatcacaga caacgctacg tgttttctgt cccacgaatt 5400 ccgggcgtta tgcacgaggt ttgatattcg gcattggctc aattctaaat accactccca 5460 ggcaaatcct gtggagcggg tgaatagaac cgtcaatacc gccatacgta cctacgtaaa 5520 atccgatcag aaactatggg actccaggat atctgaaata gaggccatac tgaattcttc 5580 agcacactcg gctacgaacc tgactccatt cttcacgact catggtcatg aaatgtttct 5640 aaaaggatgt gaccacggta ttggtgaaga tgatccacaa atttcagtag ctgatcgtga 5700 aaagcgccaa actgaattat tcaatgcaat caataaactt gtccaagaca aacttgagaa 5760 ggcccaccaa gaaagtcata aacgatacga tcttcgccat cggacttatg gaaaaccatt 5820 caaggtagat caaatggttt acaggcgtaa tatgaaacag tctagtgctg tagaagacta 5880 taacgccaag tatgggccac aatatctacc gtcgaaaata atccgtagaa taggtagctc 5940 ttcttacgaa attgccgatt tggaagggaa atctttggga atctggccag ctatgcatct 6000 taagcctgga taatataaat ttactatctg ctattgtatc tttgataaga atcacagacc 6060 tatccaaaca gacgaactcc tgcctgctaa atccgacgaa aattctagag tcaacgacct 6120 tcgctttccg ctcaagtagg gtaaatagga aaggtaaaca atctaaacaa ttgtcctctt 6180 tatagctcat agctcgcgag atatcttaaa gaaaaattaa ggtttgtatt acgaaagtag 6240 ttgcttgtac aagtaggaaa catcatagga tatcattttt ttttgtttgc ttaaagcatg 6300 aatgctttag aataaaatag ttccataaac tcttatcacc gtagtttagt gagtttcact 6360 tcacgtcgcg atgttgatgc agccgttctg ttttgctctc tctattcgtc ggcgcgagaa 6420 actcgtaaaa taatcatagc tagattaatg atcgccaacg ttttaggctc aaaataacga 6480 tgctataatt atagctaccg aggggaccag tcgttttgac gaaaacggct tgaccacaga 6540 tttaacacag actgtaatct tcatttttct tacagaatgt ttattggaaa aaatctaata 6600 tcgaataggt agatcttggc ctatttgaat aatcaaaaga tagtaatggt cttttgatat 6660 agttgtgcta tgaagacgca aactgtcttc tagagaaccc aggaagagaa gagtaaaatt 6720 tagctaatta gaataagaat aattaattaa aaaataaaga tttcttaaga ggtgtatcta 6780 cacgaagacc caattgaaca cagcctatct ttaaaaatcg aatatgaata atataaatgg 6840 aataagaata tgaattatat aaatggacaa tatagacaaa ttgcataact ttcggatacg 6900 tcctttagaa aacaattgca acgccaaaga aataacagat tctcgcagct agtgactata 6960 acaaatccga ataaaagcaa ataaggatcc ggatatgggc ttaatagaac gacggaattt 7020 tctggcaata ctggattcag gaacggagaa aaaaaaaaaa tactttgcat agctcagtat 7080 tttttttatt ttgagaggga gatgat 7106 // ID BEL-1-LTR_NVi repbase; DNA; INV; 969 BP. XX AC . XX DT 17-APR-2009 (Rel. 14.04, Created) DT 17-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia vitripennis: long terminal DE repeat - consensus. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-1-LTR_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-969 RA Bao W. and Jurka J.; RT "BEL type LTR-retrotransposons from Nasonia vitripennis."; RL Repbase Reports 9(4), 743-743 (2009). XX DR [1] (Consensus) XX SQ Sequence 969 BP; 181 A; 229 C; 245 G; 313 T; 1 other; tgttcgggtt atgtaagagc gggaagggga attttgaatt tcgcgccgta gcgcggcgag 60 cgctgcagcg agagcggtcg agcgcatgcg cgcgctgatg tgggcctctc ccgtttgcgc 120 gggctccggc gcgcgggagc acgtgcagtt cagttctgac agctcctctc gagtagtcag 180 ttgacattgc gcttttgtac ttgcgattgc aaaagcctct ttgtaaattg cgtgcgaata 240 gagtcgtcat catcgtcctt ttcaggacga tcgacactct ttattgtatc accgtgcgtt 300 gcgtagtgtg tgtgcatttt gcgtgcggtg tgcggattgt gtgagttctt cgagtgttgg 360 gatttcggcc acgtgcgagc ttcctcgccg tgtcgatctc cacccggcga ttgctaccta 420 cgcggcaggg ctgaaaaggc ctgcccgtgg tccaggattg gttccctcga gacgtcgagg 480 cgtcctgcga ccgaggaagg tgcatcccac caacgtgcgg ccagggcttt acgtcgcgcc 540 gtccacgtct cgttctgcga ataaggtagg agagctctcc agcgtgcgaa aggtacctcg 600 cggcgatagg attaagytac ttgcgtttct ttctctgttg ttgtctatag tagagtctct 660 cttagttttt gtataagcgt acattgtaat tgcgtttctc tctttttgtt tagtctatcc 720 tagagtcact tttctttgta taatttcctt atttctaaaa taaagctata ggcttgtccc 780 ggcttttctc tctactccgt gagagttttt acgccgaggc aagtcgtctc tttttctccc 840 gcgaaattca ttgtgttttg cgttgattta ttcacccaaa cttccctcac tcattccacc 900 attttaattt aatatatata tatatatata tatatatata tataatttga actcattagc 960 ttttgttca 969 // ID BEL-5_AA-LTR repbase; DNA; INV; 351 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-5_AA_; KW BEL-5_AA-I; BEL-5_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-351 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 860-860 (2011). XX DR [2] (Consensus) XX SQ Sequence 351 BP; 89 A; 86 C; 81 G; 95 T; 0 other; tgttcggtcc aataattaaa ttgagaatca ctgttgatct ctttagatca ctttcgatct 60 agctcccgct cgcgataaga gctatcgggt ctcgcttccc gagatccacc cgaaagcggg 120 tctaatgttt atgtatatta gcaataaaat cagttttgca ccaactttca aacagacaag 180 ttgtgtctcc ctgtcggtcc gcgaaataaa gttgttgagt cccagtacgc cgcgttgtga 240 ttacttgtga gaaaggtcag tgatcggagg tcgaggtcaa ggacctcggg aacacccgtg 300 ttttgggcgg acccatctat acagtccact cccaaggttc gcgccaaaac a 351 // ID RTE-1_BF repbase; DNA; INV; 1752 BP. XX AC . XX DT 29-JUL-2009 (Rel. 14.07, Created) DT 29-JUL-2009 (Rel. 14.07, Last updated, Version -1) XX DE Amphioxus RTE-1_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW RTE; Non-LTR Retrotransposon; Transposable Element; RTE-1_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-1752 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-1752 RA Kapitonov V. and Jurka J.; RT "Young families of RTE non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(7), 1699-1699 (2009). XX DR [2] (Consensus) XX FH Key Location/Qualifiers FT CDS 1..1725 FT /product="RTE-1_BF_1p" FT /translation="TLDEVSPVYGPPPKIDRARTTKAINAMKKGKAAGPSG FT IVAEMIQAGGDFMADQIASLTNAIIKEEKIPNDWNLSYIINCYKGKGDSME FT RGNYRGLKLLDQVLKVVERVLEPMIREQVHIDSMQFGFMSGRGTTDAIFIL FT RQLQEKFLSKRKDLYLMFVDLEKAFDRVPREVLWWAMRKLGVQEWLVRTVQ FT SMYCHARSSVRVNGKYSPEFDVKVGVHQGSVLSPLLFIIVMEAISRDFRVG FT CPWELLYADDLGLAAETLTLLSQRFSPWKNNLSSHGLRVNTGKTMVVHCKY FT NDSRPSKETGKFPCGVCHKGVGDNSIFCTQCKHWIHKRCTKIKGKLKEDPN FT FICHSCSNQAPTPPEPPLMEATVSGDTFKVVPIFCYLGDTIGQSGGCADAV FT TARIRSAWKSFHELLPILTNRNIPFRNRGHVYSSCVRSAMLYASETWALSA FT EDVRRLVRCDNAMTRWICSRRLAERVPSEQLRCRLGLHSIHDILRYNRLRW FT YGHVQRMSHDSWPRKVLNLSVVGQNPRGRPKRRWVDNVKEDLRKLQKANPL FT DRDGWRAAIRPKRLDDTSNLCKTGKNGR" XX SQ Sequence 1752 BP; 510 A; 375 C; 448 G; 419 T; 0 other; actctagacg aagtttctcc tgtgtacgga cctcccccaa aaattgatag agctaggaca 60 acaaaagcaa ttaatgctat gaaaaaaggt aaagctgctg gaccctcagg tattgtagcc 120 gaaatgatac aagctggtgg tgatttcatg gctgaccaga ttgcttccct cacaaatgct 180 ataatcaagg aagagaaaat accaaatgat tggaaccttt cctacattat taactgttat 240 aagggtaagg gagactctat ggagagaggc aactatcggg gtctgaaact cttagatcag 300 gtgctcaaag tagttgaaag agtgctcgaa cccatgatta gagaacaggt tcacattgac 360 tccatgcagt ttggctttat gtcagggagg ggcactacag atgcaatttt catactacgg 420 cagctccaag agaaattcct ctctaagaga aaggacctgt accttatgtt tgtagacctg 480 gaaaaagctt ttgatagggt gcccagggaa gttttatggt gggcaatgag aaagctaggt 540 gtgcaggagt ggctagttag gactgtgcaa tctatgtatt gccatgctag gtccagtgta 600 cgagttaacg gcaaatatag cccagagttt gatgtcaagg taggagtaca tcagggctct 660 gtcctcagcc cactcttgtt catcatagtc atggaagcaa tatcgcggga ttttagagta 720 ggctgtccgt gggaattgct gtatgcagac gacctggggt tagctgctga aactctcaca 780 ctcctgtcgc aaaggttctc cccctggaag aacaatttaa gctcacacgg cttgcgtgtg 840 aacaccggta aaaccatggt agtccactgt aaatacaatg actctagacc gtccaaggaa 900 acagggaagt tcccttgtgg tgtatgccac aagggtgtgg gagataactc tatattctgt 960 acccaatgca aacactggat acataagaga tgtactaaga tcaagggtaa actgaaagaa 1020 gaccctaact ttatctgcca cagttgtagc aaccaagcac ccaccccccc agaaccacct 1080 ttaatggaag ccactgtctc cggtgatact tttaaggtag tccccatctt ttgctacttg 1140 ggtgatacca tcggccagtc tggtggctgt gcagatgcag taacagccag gattaggtca 1200 gcctggaaga gcttccatga gctcctaccc atacttacaa accgcaatat cccctttagg 1260 aatcgtgggc atgtttatag ctcatgtgtt agaagtgcta tgttatatgc gtcagaaacc 1320 tgggcattgt cggccgaaga tgtaaggagg ctggttaggt gtgataatgc tatgaccagg 1380 tggatatgct ccagaaggct agcagaaaga gtgccatctg agcagcttag gtgtaggtta 1440 ggtttacaca gcatccatga cattttgcgc tacaacagac tccgatggta tggccacgtg 1500 caaagaatgt cccatgacag ctggccaaga aaggtcttga acctgtctgt agtgggccag 1560 aatccgcgtg ggcgccccaa gaggagatgg gtagacaatg tcaaggaaga tcttaggaaa 1620 ctgcaaaaag ccaaccccct tgacagagat ggatggaggg cagcaattag accaaaacgt 1680 cttgacgaca cgtccaacct ctgcaagaca gggaagaacg gacgctaaac cggatagtga 1740 gtgagtgagt ga 1752 // ID MuDR1x_AP repbase; DNA; INV; 2394 BP. XX AC . XX DT 24-JUN-2009 (Rel. 14.07, Created) DT 24-JUN-2009 (Rel. 15.12, Last updated, Version 2) XX DE MuDR-type DNA transposon: consensus. XX KW MuDR; DNA transposon; Transposable Element; MuDR1x_AP. XX NM MuDR1x_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-2394 RA Jurka J.; RT "DNA transposons from pea aphid."; RL Repbase Reports 9(7), 1350-1350 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX FH Key Location/Qualifiers FT CDS join(434..520,749..1345,1349..1843) FT /product="MuDR1x_AP_1p" FT /translation="MAISIFWRIHLLKMLNGGVVQIKYVQQKYSRTTIARQ FT NVSTQCKRVAVEDISIRPRKIILTEMNNHITNTDFTISDVSAIRKSIYHAR FT KKSLPSNPKSVSDVHKALNDLHLTTSKNEDFLFINDELSNIIIFTCHENLK FT FLCKSVYYIDGTFSYCPKYFVQFFVIHGFINEYYVPLVFCILNNKSTETYQ FT LVLSYIKNKAMQNFSLMFKPKYVTVDFELAIHLAVKSVPLTEIIGCRFHLT FT QAWYRKIQSLGLTSAYKDNKWLKFTFGLTFLDPNEVSDCLVDDFMSEIPDD FT PKYRKYADYLVDNYIGENANFPPNIWAAFAADLTRTTNNCESFHSHFNGQF FT YKSHPNIFTFLEILIKTVQTDVYIKINSCVKNIPNPRKNAQVITRLKKL*" XX SQ Sequence 2394 BP; 887 A; 312 C; 322 G; 872 T; 1 other; gggtgcagcc catttgcgcc aaaataaaaa aggtatgttt ttccgacagc ccatttgcgc 60 caaatttacc tgcgtaaaca ttttaaatta acgacttggt taattcttat atttcaaaga 120 cgtcgtgtct gtaacgtatg ttaaagttat ttttataagt ctatagattt taaagctttg 180 tatattattg cgtcgaaaga taagcaaaca agttttatat cagttttatt atgttgtatg 240 atgtaggtac tccgttaccg gatatatagg tatcgttatt cgtcaaaaga acgatcgcag 300 tgtgatttca tcagacattc gttacgttac actaaataat attttcgaat ttattaatga 360 taattatttg aattataaat tataagaatg gagttcacga caagcacaag agggaagaaa 420 atggttgttg aaaatggcca taagtatatt ttggcgtata catcttctaa aaatgttgaa 480 cggtggcgtt gttcaaataa aatatgtcca gcaaaaatat taattcaaga tggaaaaatt 540 atcaaaaagc atggtacgtt gcaaacaata atttttattt taataaaatg tgttaaatgt 600 atataatata taaaacaaat taactgtttt cgcaataaat ataattaatt ttttttttcg 660 ttactctgat aataataaca aaattcactt aaattatacg ttatcaaatc ataaatgatt 720 ttaattttaa ttttatagat tcgcataatc acgtacaaca attgcaagac aaaatgtaag 780 tacacaatgt aagcgtgtgg ctgtcgaaga cataagtatt cgtcctagga aaattatttt 840 aactgaaatg aataaccaca taactaatac agatttcact atatctgatg tatccgctat 900 acgaaaaagt atttaccatg ctcgtaaaaa atctcttcct tcaaatccta aatctgtgtc 960 tgatgtgcat aaagcattaa acgatctaca tttaactact tctaaaaacg aagatttttt 1020 atttattaat gacgaattat ctaatataat tatatttaca tgtcacgaaa atttaaaatt 1080 tttatgtaag tcagtatatt atattgacgg gactttttcg tactgtccaa aatattttgt 1140 tcaatttttt gtcattcatg gatttatcaa cgaatattat gtaccattag tattttgtat 1200 tttaaataat aaatcgacag agacgtatca acttgtttta tcatatatca aaaataaagc 1260 catgcaaaat tttagtttaa tgtttaaacc aaaatatgtt actgttgatt ttgaattagc 1320 tatacatctc gctgtgaaat cagtatagcc attaaccgaa attataggct gtcgttttca 1380 cttgactcaa gcttggtaca ggaaaattca atcactaggg ttaacttcag cttataagga 1440 taacaagtgg ttaaaattta catttggttt aacttttctc gatccaaatg aagtttccga 1500 ttgtttggtt gacgatttta tgtctgaaat ccctgatgat ccaaaatacc gaaaatatgc 1560 agattattta gtagataact acatcggaga aaatgccaat ttccctccaa acatttgggc 1620 agcgtttgca gctgatttaa ctagaaccac caataactgt gaatcttttc actctcactt 1680 caatggacag ttttataaat cgcatcctaa tatttttaca tttttagaaa ttttaattaa 1740 aacagttcaa acagatgttt atattaaaat caacagttgt gttaaaaata tacctaaccc 1800 tcgaaaaaat gcacaagtta taaccagatt aaaaaaactt taaaagcaat tgagaattat 1860 aaaaacaaaa aattaacgcg ttatgaatat gtacaaatag ttgcatttaa ttacaataat 1920 gatcaatttg attgatattt taaaaaaaat tgtgctattg actatcatac ctatagccta 1980 taattttcaa taaaatgtca aatgtgtacc tatactaatt atttactatt gtttaaaaag 2040 tgtattattg ataattatta ttaaaaaaat cgtattattg atttgttata aattataata 2100 gttagtaata agtatatttt gtaatttgta tagtttttaa aaacaatttg tattaattat 2160 gattattgtt tcaaaaaaaa aaaaatgtgt atatacagct attattgatt attattttaa 2220 aaacattcgt ataattaact atgatttaaa taaaattgta ttaattatta tttaattgat 2280 tattatttga gaaaaaaata tattacttta ttataattat accaggtaaa tttggcgcaa 2340 atgggctgtc ggmaaaacat acctttttta ttttggcgca aatgggtaac accc 2394 // ID TTAA20B_AP repbase; DNA; INV; 446 BP. XX AC . XX DT 29-AUG-2009 (Rel. 14.09, Created) DT 29-AUG-2009 (Rel. 15.12, Last updated, Version 2) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; nonautonomous; TTAA20B_AP. XX NM TTAA20B_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-446 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2087-2087 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC TSD is TTAA tetranucleotide. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea CC Aphid Genome Project. XX SQ Sequence 446 BP; 151 A; 68 C; 68 G; 158 T; 1 other; gggcgttgca tgcggcgatt ttctgtcttt gtctaacaca cgtggaacat aggaataacg 60 acctaagcgc ctgacaaaaa ttaaggtact tctagttgtt cgatttaaaa aatgtaaaga 120 tgtttgaatt ggtatacaag tttactttgc atcggtcgaa gcactttttc aattgtagca 180 atattttgat gagaatttta aataaataat ttttatataa attttaaatt ttaaattttt 240 nataatttta ttttataaat tatattaaaa aatattaaaa atatgcttcg accgatgcaa 300 agtaaacttg tataccaatt caaacatctt tacatttttt aaatcgaaca actagaagta 360 ccttaatttt tgtcaggcgc ttaggtcgtt attcctatgt tccacgtgtg ttagacaaag 420 acagaaaatc gccgcatcca acgccc 446 // ID Kolobok1-1_TCa repbase; DNA; INV; 3661 BP. XX AC . XX DT 06-MAR-2008 (Rel. 13.03, Created) DT 10-MAR-2008 (Rel. 13.03, Last updated, Version 1) XX DE Kolobok-type family. XX KW Kolobok; DNA transposon; Transposable Element; Kolobok1-1_TCa. XX OS Tribolium castaneum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Coleoptera; Polyphaga; Cucujiformia; OC Tenebrionidae; Tribolium. XX RN [1] RP 1-3661 RA Jurka J. and Bao W.; RT "A distinct subgroup of Kolobok-type DNA transposons."; RL Repbase Reports 8(3), 171-171 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Tribolium CC castaneum Genome Project. XX FH Key Location/Qualifiers FT CDS join(1007..1765,1758..2699) FT /product="Kolobok1-1_TCa_1p" FT /translation="MYYKKKNQTTLPHKCFKNWKNTSTSMEAAIILDGFKK FT SVDMHNVRFKNLIGDGDSSVYKKIHNARPYGPNYFIKKIECRNHILRNFCT FT KIREIAKTPRSDINIKTFLRNNYLKFRTAIVSAIKYRKNENCTFDQKAENL FT KNDILNGPAHIFGNHSKCATYFCKTVGQITNAVAYSDFKNSSLFELFMQAL FT KRVANLSSSLLYDVDNNSVESFNSVVNKLVGGKRINFSERGSYHGRCFAAV FT VNINNKNKLICIFVFKIMEKISGNSGIYTTKFSNRLLKTKELQATKLNKRK FT IRIQSADSDYGATNDIXDMPTETYDKIKNEFLNDLAACDILKIPLLTIGQR FT NNEQWDKERNNRLTASNFGRIINMKCTTNTANAVKDILYRVGLSKFKKLPE FT PLQWGIDNEEKAKEKFEQITGIKVDSCGFFVDKEKKFLGATPDGLVGSNAI FT VEIKCPFSARNTNIKTAIEQKLIKYLEIDSQDDSKIVLKEKDNYMYQVQGQ FT LHITNRQTCYFIVYTSTDLKYCIIEKNDSKYWNGGMIDCLENFYMNAMLPE FT IIDSRHSRNLPLRSIIKKDL" XX SQ Sequence 3661 BP; 1387 A; 514 C; 557 G; 1195 T; 8 other; ttaaggcttc agggtaccga agcgattgtt gagaattggt gtactacaag ttgcatactt 60 tttttagacc tgttttcagt gacagctgtc aatcatcgga taggcatccg accttcgcct 120 cggggtgtct tgtcccttct cataaaatgc aaataatgta aacaaactga attccactat 180 cgcccacgat agtttctctt atttcagaga ttcaagcaac atacagtgaa aaatacgttt 240 tttattttgt taaaaaaatt tgatactaaa gaataataaa atactagcat agttttgtaa 300 aaaataatac accattacac catttattta ttgggtttta tgttttaatt aaccgaattc 360 tgttcgctat attatgtgtc atgtgcattt ctaattggat cacttgcatt gaaaaaatat 420 aaaataaaaa aattaattgg tgaaacatta ttgttctttc agaaaacagt ttaactttaa 480 aaaagttact aggaatatct cattaaaatg atgaaaactt ctttaaaccc gctttttcgt 540 ctgataattt tatccaggta ataaattctg aaaaatacat tgttgtaaaa atttattgta 600 actataccca gtaataatta tagtacaaat aacgtatcga gagattgtaa gatccagtct 660 tagggatata atttgtttac gttacgccta gtaaaataaa tgatactttt ataattatgt 720 attattttag aatttaggaa ttatttctcg tttttttatt aaatttatac aatttttcta 780 taactagctt gcgaagatat agctgcgtta ctcttacatt atataaattt tgtgtatttc 840 tttaaaatat aagtaaacat tgctaaagat cgggtacctg aaactgaaat cacatgtaaa 900 caaaaatgtt tattagaaag tttagaaagt ttttcataag ctcttgtatt tagtggaggg 960 tttwatttta tacgtgggag ttaaaaataa aaattgtaaa aaatgtatgt attataagaa 1020 aaaaaaccaa acgacgctcc ctcataaatg ttttaaaaat tggaaaaaca cttcaacttc 1080 aatggaagca gcaattattt tggatggatt taaaaaatca gtagacatgc ataacgtaag 1140 atttaaaaac ctcataggag atggtgatag tagcgtctac aaaaaaattc ataatgcaag 1200 accatacggg cctaattatt ttattaaaaa aattgaatgc cgaaatcata tactacgaaa 1260 tttttgtact aaaattagag aaatcgctaa aactcctcgt tctgacataa atataaaaac 1320 ttttttgcgc aacaattatt tgaaatttag aaccgcaatt gtttcagcga taaaatacag 1380 aaaaaacgaa aactgtactt tcgatcaaaa agcagaaaac ctgaaaaatg atattttaaa 1440 tggacctgca catatttttg gtaatcattc aaagtgcgct acgtattttt gtaaaacagt 1500 tggtcaaata acaaatgcag ttgcatattc tgactttaaa aatagttcct tgtttgagtt 1560 gttcatgcag gcattaaaac gagttgcaaa tttatcttca agtctattgt atgatgtcga 1620 taataattct gttgagtcat ttaattcagt agttaacaaa cttgttgggg gtaaaagaat 1680 aaatttttca gaacgcggtt cttatcatgg aagatgcttt gcggcagtcg taaatataaa 1740 taacaaaaac aaattaattt gtatttaaaa ttatggagaa aatatcaggg aatagtggca 1800 tttacaccac aaaattttca aatcgcttat taaaaaccaa agaacttcaa gctacaaaat 1860 taaataaaag aaaaattcgt attcaaagtg cagattcgga ttatggtgcg actaatgaca 1920 taatrgacat gccaacagaa acatatgaca aaataaaaaa tgagttcttg aatgatttgg 1980 ctgcttgcga tattttaaaa attcctttgt taacaattgg tcaacgtaat aatgagcagt 2040 gggataaaga aagaaataat agattaacgg ctagtaattt cggaagaatt attaatatga 2100 aatgtacgac caatacagcg aatgctgtaa aagatatctt gtatcgtgtg ggattatcta 2160 aatttaaaaa attacccgaa ccactacagt ggggaattga taatgaagag aaagcaaagg 2220 agaagtttga acagatcaca ggtataaagg tagattcatg cggctttttt gttgataaag 2280 aaaaaaaatt tttaggtgca acaccagatg ggttagtggg ttcaaatgct attgttgaaa 2340 ttaaatgtcc atttagcgca agaaacacaa atattaaaac tgccatagaa caaaaattga 2400 ttaaatactt agaaatagat tctcaagatg atagtaaaat agttttaaaa gaaaaagata 2460 attacatgta tcaagtacaa ggtcagctcc acattactaa tcgccaaact tgttatttta 2520 tagtgtacac ctctacagat ttaaaatatt gtattattga gaaaaacgat agtaaatatt 2580 ggaacggagg aatgatagat tgtcttgaaa atttttatat gaatgccatg ttaccagaaa 2640 taattgattc aagacatagt cgcaatttac cattacgcag cataataaaa aaggatttgt 2700 aaataatatt tcaagtttag ttttagagat ttttacataa atacataata tgatgataag 2760 cattttaata ataattataa tttttaattt gaacaataaa agtaaaaaaa aacttcctca 2820 gtgggttgac accttgtcgt ggtgagggag ctcagtagtt ctatcacaaa acctgtacag 2880 aacgaagcta agtacaacca aagttttaat catctttttt tatatttttt ttaaaacagg 2940 aataatttta ttgttaaaag tattcaaaaa aatagcgagc gtgattactc gtatttcaaa 3000 atctcataag atacgttgat agcagtgtct acaaaaaaaa acgtaatgca aatctatagg 3060 tacctaatta ttttgttaaa aaaattaatt ccgaaatcat gtgctttaat tttataccac 3120 tattagagaa atcgctaaaa ttccxxxxxx tatttgtgcc gatacatttc tctgtcttga 3180 acaaatgcaa taaaattatt atcttgtggg tcaccccaat ggtggttacc tccagcggtg 3240 cgtcctaaac ccgtcataaa tcttcttcca gctagctacg cattcgaaga aggatgcatt 3300 aaaagttgcc tacttacaat ttaatttttg ctttatatct aacaattaag aacagttaac 3360 aaaaatctaa aatatctcag gtctgtattt ttaaatttcc tataaaatgg tttttttaaa 3420 ttctcaatac aatgcaccgt tatcaaaaat tcgtatcaac ttcattaaca gacattttat 3480 ttaataaaaa aatatctagg catttatttc aatcaaaaat gactcttcga cctacattta 3540 cttataaaga aacactgcaa atgtcaagaa actcttcaag tctaaatttg cggctaaaaa 3600 cagtgaaaac agtttacacg taaaacgtat gcgccagctc ccagagggtc tagagcctta 3660 a 3661 // ID MINEX_Le repbase; DNA; INV; 129 BP. XX AC . XX DT 09-JUL-2004 (Rel. 9.06, Created) DT 09-JUL-2004 (Rel. 9.06, Last updated, Version 2) XX DE Leishmania mini-exon repeat - a consensus. XX KW MINEX_Le. XX OS Leishmania OC Eukaryota; Euglenozoa; Kinetoplastida; Trypanosomatidae. XX RN [1] RA Marfurt J., Niederwieser I., Makia D.N., Beck P.H. and Felger I.; RT "Diagnostic genotyping of Old and New World Leishmania species by RT PCR-RFLP."; RL Diagn. Microbiol. Infect. Dis 46(2), 115-124 (2003). XX RN [2] RA Gentles A., Kohany O. and Jurka J.; RT "Consensus."; RL Direct Submission to Repbase Update (JUN-2004). XX DR [2] (Consensus) XX SQ Sequence 129 BP; 25 A; 30 C; 31 G; 43 T; 0 other; actttattgg tatgcgaaac ttccggaacc cgtcttccgg caagattttg gaagcgcgca 60 ggcgctattt tttttgtgtg tgtggcggcg ccccctttca actaacgcta tataagtatc 120 agtttctgt 129 // ID Mariner-40_SM repbase; DNA; INV; 2556 BP. XX AC . XX DT 16-AUG-2009 (Rel. 14.08, Created) DT 16-AUG-2009 (Rel. 14.08, Last updated, Version 2) XX DE Mariner DNA transposon from Schmidtea mediterranea: consensus. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-40_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-2556 RA Jurka J.; RT "Mariner DNA transposons from Schmidtea mediterranea."; RL Repbase Reports 9(8), 1889-1889 (2009). XX DR [1] (Consensus) XX CC Contains 2 overlapping ORFs (probably due to stop codons). CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 1198..2253 FT /product="Mariner-40_SM_1p" FT /translation="MTIQVKKCVFRRSCKYPERIMNSTKASTSVMFACTAD FT GKMLHPYVVYKAEHMHDRWLLGGRTHVRYNRSRSGWFDSACFKDWFLTVVV FT PFCRRLPGKKALIGDNLSSHFSKEVIEQCSALNIAFVCLPPNTTHLCQPLD FT VSVFAALKKYWRQILTNWKLHEGRCVPVLPKEKFPMLLSLLLDAIEPSFKE FT NAINGFRKCGIVPADRTAITSRIKRTSSSSQAESAEITNAVSNVVMAKLED FT IHKSMEGSKGKQRKRKINLLSGKSISLSDFATPKGHDSDGAVSVTSDDNES FT AEDETDDDIPVLPKFPKPTSNACSSSSFHRTPGPSKFKKPLSGCTSHGRIL FT KKPSRFVD*" XX SQ Sequence 2556 BP; 790 A; 511 C; 516 G; 738 T; 1 other; ccgtcaactg gggtgagttc ggtaaccagg ggtgagttcg ttatttataa aatttttttt 60 aattgtttta atcttctaaa cgctttaact tctactgacg tgactgtttc acttaacatc 120 cgtgtttatt tcctacacca aagtatttaa accagccatt aaatttcaag gtaaggtgct 180 aaactgattg tttttataca ccatttacat ttttataaat aaataatact cattcctttc 240 ttgtatttca actagttcag cttgttgctt tgtagtgatt attttataat ttctaatgca 300 tcatatgatc attattttaa aacagagaat gaataatgga gtttgacagt ttaatgaaaa 360 tttagcattt cctttatggg ggtgagttcg gtaatgcaaa atgtacatga ttctgacaca 420 ataccatcat atattttatg gtactggtga tgttaacagg aaaaacatta agaatgaaaa 480 ggttaactag gtatagttat tcaaccattt aaataattat tcaataccgg taatcaaaaa 540 ttgtattgcc atatcaattt attttttcaa ttctatttcc atgaacaaat ttttctgcat 600 ttaacaaaga aattacattt tatttttagg tgccactaca agtaccacgc tcaccaatgc 660 cacgaaatta caagcctgta atcggaaaga cgagacgaaa gcaataccca gaagcaagga 720 agaacgaagc aattgaagcg gtagcagcgg ggatgaccca aacggaagct tctgcaacct 780 acgacatccc tcaaacaact ctgtccgatg ccctacgtgg agtgcataca cggaaatctg 840 ggggacaaac ggctctgaca gctgaagagg agaatattct agcaaraaat atcgccattc 900 taagcgactg gggttatcca gtggatgtat tggaggtgcg aatgcttgtg aaatgttact 960 tagaattccg agggcttcgc atttcaaaat tcaaagacaa cactcccggt gttgattggg 1020 tgaaggcgtt tttaaaaaga gtacgtgctg tattatccga acgtctgtgc caaaatattt 1080 ctcttaagag ggcagcagtt tcttttgctg atgtggagag ttatttcgat catttgcaag 1140 taacactaga agatgtgcca ataacaaaca tcatcaatta cgacgaaaca tgccttaatg 1200 acgatccagg taaaaaaatg tgtattccgg cgcagctgca aatatccaga aagaataatg 1260 aactcaacga aagcgtcaac ttcagtgatg ttcgcttgca ctgctgatgg caaaatgctt 1320 cacccctacg tagtttacaa agccgagcac atgcatgacc ggtggctcct tggtggacga 1380 actcacgtgc gatacaatcg gagtcgttcc ggttggtttg acagtgcatg tttcaaggac 1440 tggttcttga cagttgttgt acctttttgc cgtcgacttc ctggtaagaa ggctcttatt 1500 ggcgataacc tcagttccca tttttctaag gaagtgatag aacagtgctc tgccctgaat 1560 atagccttcg tgtgtctccc ccctaatacg actcatttat gccaacctct cgatgtgagt 1620 gtatttgcag cgttgaagaa atactggcgg cagatactca caaattggaa actgcatgaa 1680 ggacggtgtg tgccagtact tccaaaggaa aaattcccca tgctcttgtc gcttttgctc 1740 gacgctatag aaccctcatt caaagagaat gccatcaatg gattccgaaa gtgtggaatt 1800 gtcccagccg acagaacagc tattacctca cgaattaagc gcaccagttc aagcagtcag 1860 gcagaaagtg ctgaaattac caacgcggtg tccaatgtcg taatggctaa actggaagac 1920 attcacaaaa gcatggaagg gtcgaaagga aagcaacgca agagaaaaat caaccttctg 1980 tctggaaaaa gcatatccct gtccgatttt gctacgccaa aagggcatga cagtgacgga 2040 gcggtaagcg ttaccagtga tgacaacgag tctgcagaag atgaaactga tgatgacatc 2100 cccgtattgc ctaaatttcc taagccgaca agcaacgcgt gcagctcgtc cagttttcac 2160 agaaccccgg gaccttcaaa attcaagaaa ccattgtcag gttgcaccag tcacggacga 2220 attctcaaga aaccatcacg ctttgtagac taagttatct gtttatgcca ttaagatcat 2280 ttggaacgtt ttcagtatcc attcatatgg gattttataa gttttgtaat tttaccgaac 2340 tcaccccatc agggggtgag ttcggtagag aattatattc gtgtatattt cggaaataaa 2400 attagagatt tctgttcttt aaggcaggat accattctac agaagtactt cccataaata 2460 gtaaaacttg ctcgaaatca gaaatgacaa actagcagca atcattagta tttttttaaa 2520 tgttcatttt attaacgaac tcaccccagt tgacgg 2556 // ID BEL-17_AA-I repbase; DNA; INV; 6116 BP. XX AC supercont1.330; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-17_AA_; KW BEL-17_AA-LTR; BEL-17_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6116 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.330; Positions 296807 302922. XX CC Positions [5139-5729] - Integrase core CC LTRs are 96% similar to each other. XX FH Key Location/Qualifiers FT CDS 37..1752 FT /product="BEL-17_AA-I_1p" FT /translation="MPRRNRRRNHVPEDSIQQTCLLCNLPDSSEMVACDRC FT AQWFHFDCAGVNEDVANQSWSCPDCSRARDPAPPLPLSSTSPEVVTGEPLP FT SISLQSFHPSSHIRTTPTTRVSPVPPHRTPTPRPRTSGLQILPPPTPLPRS FT PRPQIVLPPAVGAPEVPIMNQEVLAEADLVNRLPADLRIQFLEQQQAIEQR FT YLLRRFQLLLESPINANYARGENEIVPINQAVGHSPVCHSSPVCRNQIPPR FT PASQNRIESPFVPISQPHRSVNFPVTVSDQPSVSMIPPPQANEQANFPAFH FT SNPTVHRPIASTFIPPQADSCPSAPEFSHPCGNQRGQQSAFAPLMGRMHPP FT AQVFYPSAFHDEPTIVGGTALLNKSQIAARHAVPKELPLFAGEPEEWPLFF FT ASFENTTHLCGFSAEENMVRLQKCLRGKALEAVKCQLLHPSNLNQVLATLK FT MLFGGRPEIIVHSLLQKINNLPAPKADRLGTLVDFALAVRNMVATVKVCAL FT EEHLCNLTLLHSLTERLPPMIRLNWATHRQSLHSVTLVEFSDWLYKLAEAA FT STVTMPQLSGTEDNKSRRSRRMMDF" FT CDS 2232..6116 FT /product="BEL-17_AA-I_2p" FT /translation="MEEGLAKQLDLNGEKYPLCLRWTADTCRYEKDATIVS FT LNVSGIYDGSSQHNLKEVYTVKELKLPSQSLPVDKLTNKYNHLKGLPIVSY FT SNVQPKLLIGVSNARVMHALDGREGQLDEPAAVKTRLGWTIYGTYSSADDS FT IPSEVPRSFHICHHSNESDEKLHEAVKNYFALDSTGISAPKNQLLSKEDER FT ALAKLREITTFQNGRYEVGLLWKYDDVRLPNNRSMALRRHNCLAKRMERDP FT QLAESLRAKMIDYESKGYIRKLTSDESVKSGDRTWYLPIFPVFNPNKPGKV FT RIVFDAAAEFAGVSLNSVLMKGPDQLNALPPVLYKFRERLIGLGGDVVEMF FT HQMRMNPEDADSQRILWFASEDAMKPCDYVMQVVTFGATCSPSTALYVLNE FT NATKYETRYPLAVDAICRRHYVDDMLTSVDTEEEAIELANEVRYIHHQGGF FT HMRNWVSNSPAVVAALEEDPKSEKSMEMNAELAMEKVLGMWWSTTADVLCY FT KLCTDRNRELLTGAKHPTKRDVLRTLMAIFDPLGLIAHYLMYLKVLLQEIW FT RAKTEWDEKINEKHLEKWLTWLRILPELESVEIPRCYFRHESRIDNATVKL FT HTFVDASENGYAAVSYFRFEKNGYIECALIGSKTRVAPIKFVSIPRLELQA FT AVVGARFAKSIAEGHSIRIDRRYFWTDARDVMCWLQSDHRRYSQFVAFRVG FT EILESTNVTEWNWLGTKHNVADDGTKWRFKPDLKASSRWFTGPQFLWKSKD FT EWPGTSLSDEKTVTEIRQNLLIHSKCEARSILQSENYSSWKRLHRVGEILH FT VSRLLFYRFIRNIRQRQHGKPVVFGPLTQEELFVAEVFLIQQAQSDEYGEE FT LAVISKGDRHLNKKNRLYKLCPFIDDQGVMRIHSRLGECDFVDESERQPIV FT LPRNHPTTRLIVADVHRRYHHQCRENCVNEVRTRFYIPRVRRVCDQVRRSC FT QLCKILGARPAPSAMGPLPKARVAAFVRSFSYVGVDFFGPLHVLVGRRHEK FT RWGVIVTCLTTRAIHLELAASLNTSSCIMALKNCFARRGTPLEIRSDRGTN FT FVAAEKELKAALQELNQDRLMTEFTTPTTSWRFNPPASPHMGGCWERLIQS FT VKKILAVVKPQRVPTEEVLRSYLMQVENIVNSRPLTHVPVDDHSSPALTPN FT HFLVGSSDGSKPFVTYTDCPVRLQQSWKASEALANRFWQRWVAEYLPTITR FT RTKWFQRVKPIAVGDVVIVVDPDLPRNYWPKGRVVSVKTSSDGQVRSAVVQ FT TAAGIYNRPATKLAVLDVGANESELDQSPATGGD" XX SQ Sequence 6116 BP; 1626 A; 1488 C; 1522 G; 1480 T; 0 other; acagtttaaa attttcgttt actggatcaa tctacgatgc ccagaaggaa ccgccgtcgt 60 aaccatgtgc cggaggatag cattcagcaa acgtgcttgc tttgcaacct tccggatagc 120 agtgagatgg ttgcttgtga tagatgcgcg cagtggtttc acttcgactg tgccggagtc 180 aacgaagatg tggctaatca atcttggagc tgcccggatt gctcacgagc tagagatccg 240 gctccaccgc tgccactgtc atcgacctcg ccagaagtag taacaggtga gccactaccg 300 tccatatcgc ttcagtcgtt ccatccttca tcacatattc gaaccacacc cacgacaaga 360 gtttctccag tgcctccgca tcgaacacca accccgcgcc ctcgcacatc gggattacaa 420 attcttcctc caccaacacc tcttccgcgt agtccacgac cgcaaattgt actgcctcct 480 gccgttgggg ctccggaagt gccaataatg aatcaagaag tgctagcaga agcagatctc 540 gttaatagac tgccagctga cctacggatc cagtttttgg agcagcagca agctatcgag 600 caacgatatt tgctccgtag attccaattg ttgttggagt ctccaatcaa cgcgaattac 660 gctagagggg aaaatgagat cgtaccgata aatcaagccg tcggtcattc tccagtgtgt 720 cattcatcac cagtgtgtcg caatcagata cctccccgtc ccgcttcaca gaacagaatt 780 gaatcgccat tcgttccgat ctcacaacct catcgatctg ttaattttcc cgtcacagtc 840 agcgatcaac caagtgtgtc aatgattcca ccgccacaag ctaatgagca agcaaatttt 900 ccagcgtttc attcgaatcc aaccgtacac cgtcccatcg cttcgacgtt cattccacct 960 caggctgata gttgtcccag tgctcctgaa ttttcgcatc cgtgtggtaa tcaacgtggg 1020 caacaatctg cgttcgctcc gcttatgggt cgcatgcatc ctcccgctca ggtgttttat 1080 ccttcggcct ttcacgacga acctaccata gtaggtggaa cggcgctact gaacaaaagt 1140 cagattgccg ctcgccatgc agtgccaaaa gaactgccat tgttcgctgg tgaaccggag 1200 gaatggcctc tgtttttcgc ctctttcgag aatacgactc acttatgtgg attttcagct 1260 gaggagaata tggttcgtct acagaagtgt ttaagaggaa aggcattgga ggcggtgaaa 1320 tgccaactgt tgcaccctag caacttgaat caagttctcg ccactctgaa aatgctattc 1380 ggcggccgtc cggaaattat tgtgcactcg ctgctgcaga aaatcaacaa tctgcccgca 1440 ccgaaagcag accgtttagg tacacttgtc gattttgcat tggccgtgcg aaatatggtg 1500 gctacagtaa aagtttgtgc actggaggag cacttatgca acctcacgct tttgcatagt 1560 ttgacagagc gtcttcctcc aatgattcgt ttaaactggg ctacccatcg acagtccctc 1620 cactccgtga cactggttga gtttagtgat tggttgtaca agttggcgga agcagcaagt 1680 acagtgacga tgcctcagtt gtccggtaca gaggacaaca aatctcgtcg tagccgaagg 1740 atgatggatt tttgaacgca ctcgctgaag aatctccgaa atcgaagcat ttggaaacaa 1800 ttattggttg ccttgtctgt caggatagct gtgcagctgt agataagtgt aagcgatttc 1860 tgtcgttaga cctttccgct cgttgggacg ctctaaggga attcaagttg tgtcgtagct 1920 gtttgggcgt gcatggagga tcgtgcaagt tcgcaaagcc gtgcggcaaa agtggatgtc 1980 agtataaaca ccacatatta ctgcacaacg acgcaaagga taaaccagca gcaacgacgg 2040 tcggtcccag taatcgggcc agagatcctc cacaggaaac tgctgagaag aaacgcgagt 2100 cagagccatg caatacacat cgaggaggaa gtagagcggt attattccaa tgcattccaa 2160 ttatccttca caacaacgga attgagctac gtacacacgc gttcttagac agtggatcgt 2220 ctttgacgtt gatggaagaa ggtctagcca aacagttgga tctgaatgga gaaaaatatc 2280 cactttgtct gcggtggaca gctgatacct gccggtatga aaaggatgcg actatagtct 2340 ccctgaatgt ttctgggata tacgatggga gtagtcaaca caatctgaag gaagtctaca 2400 ctgtgaagga gttgaagctt ccttcccagt cattaccggt tgacaagctg accaacaagt 2460 ataatcacct taaagggctg ccgatagtat cctacagcaa cgttcagcca aagcttttga 2520 taggtgtgag caacgcgagg gtgatgcatg ccctggatgg ccgagaggga cagttggatg 2580 aacctgctgc tgttaagacg cgccttggat ggacgattta cggcacgtac tcgtcagcgg 2640 acgattcaat accgagcgaa gttccacgca gcttccatat ctgccatcac tccaacgagt 2700 cggatgagaa gctacacgag gcggtcaaaa actatttcgc actcgacagt acagggatca 2760 gcgccccgaa gaatcagctt ctttccaaag aagacgaacg agccttagct aaattgcgcg 2820 aaatcacaac gtttcagaat ggacgttatg aagttggtct actctggaaa tacgatgacg 2880 tacgcctccc caacaatcga tctatggcgt tgagacgtca caattgtctg gccaaaagaa 2940 tggaacgtga tccccagtta gccgaatcat tgagagccaa gatgatagac tacgaaagca 3000 aaggatatat tcgtaagctc acatcagacg agagtgtgaa gtctggagat cgtacctggt 3060 atctaccaat tttcccagtt ttcaatccca acaaaccagg aaaagttcgg attgttttcg 3120 atgctgcagc ggaatttgcg ggagtttctt tgaattccgt attaatgaaa ggcccggatc 3180 aactaaatgc tcttccacca gtgctataca aatttcgaga gcgactgata ggcctcggag 3240 gggacgttgt cgagatgttt catcaaatgc ggatgaatcc tgaagacgca gacagtcaac 3300 gtattctatg gtttgctagc gaagacgcaa tgaagccttg tgattacgta atgcaggtgg 3360 taacgtttgg tgctacttgt tcacccagta cggcgctgta tgtcctcaac gaaaacgcaa 3420 caaagtacga gaccagatat cctctcgccg tcgatgctat ttgccgccgc cattatgtcg 3480 atgacatgtt gacaagcgtt gatacagaag aagaggctat tgaattggct aatgaagttc 3540 gctatattca ccaccagggc ggatttcata tgcgaaattg ggtttcgaat tcgccagctg 3600 tagtagcagc tcttgaagaa gatccaaagt ccgagaagtc gatggagatg aatgcagaac 3660 ttgccatgga aaaggtcctt ggaatgtggt ggagtactac agctgacgtc ctttgctaca 3720 aactatgtac cgaccgcaac cgagagctat taactggcgc gaaacacccc accaaacggg 3780 atgtactccg tacgctcatg gccatctttg accccttggg actcattgca cattatctga 3840 tgtacttgaa agtgttgttg caagagattt ggagggcaaa aaccgaatgg gacgaaaaga 3900 tcaacgaaaa acatcttgag aagtggctga cttggttgcg catactgcct gaactggagt 3960 ccgttgagat acctcgctgc tactttcgac acgaatcgag aatcgacaac gcaacagtaa 4020 aacttcacac ctttgtggat gccagcgaga acggctacgc tgcagtgtcc tatttccgtt 4080 tcgagaagaa tggttacatc gagtgcgctc tcatcgggag taaaacaaga gtagctccaa 4140 tcaagtttgt atccattcca cgtctggagt tgcaagcagc ggttgttggt gctcgtttcg 4200 cgaaaagtat agcagaagga cactccatca gaatcgatcg tcgttatttc tggacggacg 4260 cacgcgatgt catgtgttgg ttacagtcag atcaccggcg ctattcacag ttcgtggctt 4320 ttcgcgtcgg cgaaatattg gaatcaacta atgttacgga gtggaattgg cttggcacca 4380 agcacaacgt cgccgacgat ggtactaagt ggagattcaa gccagacctc aaggcctcaa 4440 gtcgttggtt taccggacct caatttctat ggaagtcaaa agatgaatgg cctggaacgt 4500 cactgagcga tgaaaaaacc gtaaccgaaa ttcgtcaaaa tcttctaata cactccaaat 4560 gcgaagctcg atccattctg caatcggaaa actactcttc ttggaaacgc ttgcatcgtg 4620 taggtgaaat cctacacgtg tcacggcttt tgttttatcg cttcattagg aacatcagac 4680 agaggcagca tggaaagcct gtggtcttcg gaccgctcac acaagaagag ctctttgtcg 4740 cagaagtgtt tctgatacag caagcccagt cagacgaata cggtgaagaa ttggcagtta 4800 tatcgaaggg cgatcgacac ctaaacaaga agaatcggtt gtacaaactt tgtcccttca 4860 tcgatgatca aggtgtgatg cggattcaca gccgattggg tgaatgtgac ttcgttgacg 4920 agagcgagcg ccagccaatc gtgcttcctc ggaaccatcc aactacgaga ctcattgtcg 4980 ctgatgtgca tcggcgatat caccatcaat gccgcgaaaa ctgtgtgaac gaagtgcgta 5040 ccagattcta cattccaaga gtgcgtagag tttgtgatca agttcggcga agttgtcagc 5100 tatgcaagat tctcggtgct agaccagcac catcagcgat ggggcctctt ccgaaagctc 5160 gcgtggcagc ctttgtgcgg tcattctcgt atgtgggtgt cgacttcttt ggccctcttc 5220 acgtcctagt tggccgtcgc catgaaaagc gctggggtgt gattgtcacc tgcttaacaa 5280 cacgagcaat acacctcgag ttagccgctt cgttgaatac aagttcgtgc atcatggcac 5340 taaagaattg cttcgcccgc cgtggaacac cattagagat cagaagcgac cgtgggacca 5400 atttcgtcgc tgcagagaag gaacttaagg ctgctctgca agagctaaac caagacaggt 5460 tgatgactga gttcaccact ccgacaacat cgtggcgttt caacccccca gcatcaccgc 5520 acatgggtgg gtgctgggaa cgtctcattc agtcagtgaa gaagatcctg gccgtcgtca 5580 aacctcagcg ggtgcctaca gaggaagtgt tacgaagcta tttgatgcag gttgagaaca 5640 tagtcaacag tcgtccactg acacacgttc ccgtggatga ccactcatct ccggcgctaa 5700 ctccaaacca ttttcttgtg ggttcatcag acggatccaa gccgtttgta acgtacacag 5760 attgtccggt gaggctacag cagtcttgga aagcatcgga ggcactggca aatcggttct 5820 ggcaacggtg ggtggcggag tatctgccta caataactcg acgcactaaa tggtttcaac 5880 gtgtgaagcc aatcgcagtg ggagatgtcg tcattgtggt agatccagat ctgcctcgta 5940 actactggcc caagggacga gtagtatcgg tgaagacttc ctcagacgga caagtgcgtt 6000 ctgcggtagt gcaaactgct gcaggaatct acaacagacc ggcgaccaag ttagctgtgc 6060 tggacgtagg tgcaaacgaa agtgagctgg accagagccc agctactggg ggggac 6116 // ID LOA_Ele3 repbase; DNA; INV; 5746 BP. XX AC . XX DT 06-OCT-2010 (Rel. 15.1, Created) DT 06-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE A LOA clade non-LTR retrotransposon family from Aedes aegypti. XX KW Loa; Non-LTR Retrotransposon; Transposable Element; Lian; KW LOA_Ele3. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-5746 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5746 RA Kojima K.K. and Jurka J.; RT "LOA clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (06-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. This consensus is generated from 30 CC sequences with >98% identity, and ~99% identical to the original CC sequence in [1]. XX FH Key Location/Qualifiers FT CDS 344..1918 FT /product="LOA_Ele3_1p" FT /translation="MSANNQTKIKGVDCLPQFEKGDDCLSQSDETETENAF FT GKLGLTEDELLSSSHENMETDLPLTLPNDPGLHPSVEPMDDEEDGITVTIN FT LSGSQPSITGSNVTSGPIDGKTDQHPTDDANKQKPTGTKKITRSQRKQLKA FT LRQSGLSRPEALSRITGGEAMVSTPSKRTRQDLDKSTNAEEEHKQKRMKQH FT LNPRKRVGQQETNSSTASTINKPPPEDKKQTSLSYGEITSRRRVGIIPKDF FT PTTQLSTTQLDVLQEALLLRVEQQRNATMKPKFSNLIYKSGHMVLICKDQE FT TAEWVKEITPALNPLEGVELVAMDEDKIQRPELIRAFFPQSAQYTDDRIKA FT LIESQNDLITNNWRVMQRLTPNNKHVVWVFNVDGPSMEKLIQSKFILNFRF FT GEIQLRKVKSTTPNSNDNPTGQPTQEKSEEASSSIPQPTPTIQASLSSSDG FT KDAYLTSLPGPSGVTSMAPNKGSGNVKSVKLTTGDKPAGLGKGKDKNQNLR FT HPPKSKVDDPQHPKKDGKRPENAERLSND" FT CDS 1854..5573 FT /product="LOA_Ele3_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="MIRNIQRRTANVRKMRNASAMIKILQVNLHHAQCATD FT VLCRRFTKEHLSVALIQEPWVNKTRIQGIQLNSCRLVYDDSQLSPRAAILI FT RNDTKCFPITEFIKRDIVAVRMEVSTARGSTEIIMVSAYFPGDAEDIPPPE FT MAALVSYSQKHNIPFVIGCDANAHHLVWGSTNTNIRGEHLLQFLSSKNIDI FT CNVGDKPTFENILRQEVLDLTLCSQSISDKIKNWHVSNEISMSDHKHIVFE FT WEGGLMIQKSFKDPKKTDWETYSAILRSEDYIIEPNIQTIIQLEAASDSIK FT NKILNAYQESCPIKTVSSNRDVPWWNFNLDKLRKTARKEFNRAKRTSDWSL FT YRKALTEYNKEMRRAKRKSWVLMCESIEKTPVAARLHKTLSKDHSNGLGSL FT QRTDGSLTVEPRETLSEMLRVHFPDSIPQTSLNAEGARLDVSVPHGFLSGD FT SGAKRDAIKVAKEAFTRDRVVRAVRSFEPFKSAGMDGIFPALIQKEEETLI FT PHMVEIFKASLVLGHIPNDWRQVRVVFIPKAGKKDKTNPKAFRPISLSSVM FT LKIMEKVLCEYIDSKFMKTMPLSKSQFAYQSGKSTVSALHTLVNKIEKTFN FT AKEIALIAFLDIEGAFDNASYSSIGSAMLRRNFDPCIATWVHAMLANRQIS FT SELSDSRITVMATRGCPQGGVLSPLLWSLVVDELLDSLERRGFEVVGYADD FT VVIIVRGKFDSVISSRMQIALNHTLSWCQKEKLGINPSKTTIVPFTKRRKV FT QLHPLFLNQIQLVYSNEVKYLGITLDAKLNWNTHLQTIINKGLNSLWVCSK FT TCGKTWGLKPSMIMWIYKTIVRPRITYASLVWWPKTKEATARAKLNKIQRT FT ACIAITGAVRSTPSFALDAILNLPRLDQFIKLDAEKSALRLKRSTVLLSGD FT LTGHLSILNEFSINPIVEKCSDWMEKVVNYDSPFTVVFPSREEWEGGGPTI FT SPGSIKFYTDGSKMNNLTGSGVYGPKTKISVSLGQWPTVFQAEVYAIKECA FT QLCLKRNYRHATICIFSDSQAALQSLKAFTCNSKLVWECILALKSLAERNR FT VKLYWIPGHTGLEGNEIADQLARNGSTNMFIGPEPFLGISNSALNTELNNW FT LFGQIQSNWNTVSNANQSKRFVTINTTQTQKLIGLNKRDLRTYIGLITGHC FT PSRYHLYKIGVVQNTNCRFCDETDETSQHLLCSCSAHIHRRFKIFGKHYLQ FT PADIWNASPREVVSFIRLITPDWGNYNTAT" XX SQ Sequence 5746 BP; 1833 A; 1308 C; 1239 G; 1366 T; 0 other; ggctgcttgg tcgttcatag aagttaatat tcaatgttat cttcttttcc aagcagccta 60 gatagccgtg tagtgtcggt agtggttgtc tcaactggct aagaataaca ctacggattg 120 cctgatcctg tagtaaaaat ctactaatca ggtaacccca atcccaaggt gtgatgcgac 180 ccgtgctgat ggatgaatgg ttgagggggt ttaaaatatg ctcaatcgct aacggagcct 240 ggagagcacc agggcgaact ctccagtatg tagctcttac tgcattaggg cggggcaatg 300 atgcagcgga ccgtctttcc ccagcgactc gtgggaccaa aaaatgagtg caaataatca 360 aaccaaaata aaaggtgtcg actgcctccc tcagttcgaa aaaggtgacg actgcctctc 420 ccagtcggac gaaacggaaa cagagaatgc ttttggtaag cttggactga ccgaagatga 480 gcttctcagt agttcgcatg aaaacatgga aaccgacctc cctctgacac tcccgaacga 540 cccaggactc catccttcgg ttgaaccaat ggacgacgaa gaggatggaa ttacagtcac 600 aataaacttg tcaggatcgc agccatcaat cacaggatcg aacgtcacct cgggtcctat 660 tgatgggaag accgatcaac atccaacaga tgacgcgaac aaacaaaaac ctactggcac 720 caaaaagatc actcggagcc aaaggaagca gctaaaagca ctccgacaaa gtggcttaag 780 ccgccctgag gccctatcca gaatcacggg gggtgaggct atggtgtcaa ctccttcgaa 840 acgcactcgg caggaccttg acaagtctac aaatgctgaa gaagaacata agcagaaacg 900 gatgaaacaa cacctgaatc cgagaaagcg cgttgggcaa caggaaacca acagctcaac 960 agcctcgacg atcaacaaac ccccaccaga agataaaaaa cagactagtc taagctacgg 1020 tgaaattacc agtcgcagga gagtcggaat aatcccaaaa gacttcccca cgacccaact 1080 ttcaacgact cagctggacg ttctccagga ggcactgctg cttcgggttg aacaacaacg 1140 gaacgcgaca atgaagccca agttctctaa ccttatctac aagtctggcc atatggttct 1200 tatttgtaag gatcaggaga ctgcggaatg ggtaaaggag ataacacctg cattaaatcc 1260 cctggaaggc gttgagttgg ttgcaatgga tgaggataag attcaacgtc cagaactgat 1320 tcgagccttc ttccctcaaa gcgcacaata caccgatgat cgtatcaaag ctctcatcga 1380 gagtcagaac gatctgataa ccaacaactg gcgtgtcatg caacggctca ctcctaacaa 1440 caagcatgtg gtgtgggtct ttaatgtaga tggaccgtcc atggaaaaac tcattcaatc 1500 caagttcatc ctcaacttcc gctttggaga aatacagttg aggaaagtaa agagcacaac 1560 cccaaactcc aatgacaatc caactggaca gccgacccaa gagaaatctg aggaggcctc 1620 tagtagcatc ccacaaccaa ctcccaccat acaggctagc ttgtcttcct ctgatggaaa 1680 agatgcatat ttgacctcac ttcctggccc aagcggtgtc acctcaatgg ctcccaataa 1740 gggcagtgga aatgtcaaaa gtgtaaaatt gaccacaggg gataagccag caggcctagg 1800 taagggaaag gacaaaaacc aaaatctaag acatccccca aaatcaaaag ttgatgatcc 1860 gcaacatcca aagaaggacg gcaaacgtcc ggaaaatgcg gaacgcctca gcaatgatta 1920 aaatcctcca agtgaatctc catcatgctc agtgcgcaac agatgtgctt tgcaggagat 1980 tcacaaaaga acacctttcc gtggcactga ttcaagagcc atgggtcaac aaaactcgaa 2040 tacaaggtat tcaactaaac tcgtgtaggt tggtatatga tgacagccag ctctctccca 2100 gagcagctat tctaatacgc aatgatacta aatgttttcc aattacagaa ttcatcaaaa 2160 gggacatcgt ggcggtcagg atggaggttt ctactgctag gggcagtact gagatcatta 2220 tggtttcggc gtatttccct ggcgacgcag aagacattcc tcctccagag atggctgctc 2280 ttgtctctta cagtcaaaaa cacaacatcc ccttcgtcat cggttgtgac gccaatgctc 2340 atcaccttgt atggggaagt acaaacacaa atatcagagg tgagcatctt ttgcagtttc 2400 tttcctccaa aaatattgac atatgcaatg tcggtgataa gcctacattt gaaaacatct 2460 tacgacaaga agtccttgat ctgactttat gcagtcaatc catctcggat aaaataaaaa 2520 actggcatgt ttctaatgaa atatctatgt cagaccataa acatatagtc ttcgagtggg 2580 aagggggtct aatgattcaa aagtcgttta aagatcccaa gaaaactgat tgggaaacct 2640 actcagctat tctccgttct gaagactata taatagagcc gaatattcag actattatac 2700 agttagaagc agcttcggat tctattaaaa acaaaattct aaatgcttat caagagagct 2760 gtccgatcaa aacagttagt tcgaacagag atgttccatg gtggaacttc aaccttgata 2820 aactcaggaa aactgctcgg aaggaattca accgagccaa acgcacttct gattggagtc 2880 tataccgaaa ggctctgaca gagtataaca aagaaatgag acgagcaaaa cggaaatcat 2940 gggttctcat gtgtgaaagc attgagaaaa ctcccgtagc tgctcgactt cataaaactc 3000 tttcgaagga tcactccaat ggtttgggaa gtcttcaaag gactgacggt tcactcactg 3060 tggaacctcg tgaaacactg agtgaaatgc taagagttca ctttcctgat tcaatcccac 3120 aaacgagtct aaatgctgaa ggtgccagac ttgacgtctc agttccacac gggttcctat 3180 caggggactc aggagcaaaa agggacgcaa taaaggttgc caaagaagct tttactcgcg 3240 atagggttgt tagggcagtg agatctttcg agccattcaa atctgctggc atggatggaa 3300 tcttcccagc gcttatccaa aaagaggaag agacactgat tccacacatg gtagagattt 3360 ttaaggcaag tttagttctg ggacacattc caaatgattg gcgtcaagtt cgagttgtct 3420 ttattccaaa agcaggaaaa aaggacaaaa ccaaccctaa agcattccga ccgataagcc 3480 tatcctcggt aatgcttaaa atcatggaga aggtattatg cgagtatata gattctaaat 3540 ttatgaaaac tatgcctctt tctaaatccc aatttgctta tcaaagcgga aaatcaacgg 3600 tctcagcact gcacacgcta gtgaacaaga tcgagaaaac ctttaatgca aaagaaatcg 3660 ctcttatagc atttcttgat attgagggcg cgttcgataa cgcttcttat tcgtctatag 3720 gttcggcaat gttgaggaga aacttcgacc catgcattgc aacctgggta catgctatgc 3780 tagcaaatcg acagatctca tctgagctga gtgattcgcg cattactgta atggctacaa 3840 ggggatgccc tcaaggggga gtactatctc ccttattgtg gtcattagtg gtggacgaac 3900 tgctagatag cttagaaaga aggggttttg aagttgttgg atacgcagat gacgtagtca 3960 ttattgtacg aggcaaattt gacagcgtta tttcatcaag gatgcaaatt gctctcaatc 4020 acacactctc ctggtgtcaa aaagagaaac taggaataaa cccttcaaaa acaacaattg 4080 taccgttcac aaaaagacga aaggtacaac tccaccctct tttcttgaac caaatacaat 4140 tagtttactc aaatgaggtc aagtatcttg gcatcacact tgacgcaaaa ctaaattgga 4200 acacccatct tcaaacaatc ataaataaag gtctcaattc actctgggtt tgctcaaaga 4260 cctgtggtaa aacatggggt ctaaaaccca gtatgatcat gtggatctat aaaacaatcg 4320 ttcggcctag aataacctat gcgtcccttg tttggtggcc taagacaaag gaagctacgg 4380 ctagagctaa gctgaacaaa attcaacgta ctgcctgtat tgccataacc ggtgcagttc 4440 gtagtacccc ctcgtttgcc ctagatgcta tactcaatct gccccggctg gatcaattca 4500 taaagctgga tgctgagaaa agtgctcttc ggctaaaacg atcaacagtc ctactgtcag 4560 gggacttaac aggtcacctc agtatattaa acgaattttc gataaatcct attgttgaaa 4620 aatgtagtga ctggatggaa aaagtggtaa actatgactc gccattcacg gtggtctttc 4680 cttctcgtga ggaatgggaa ggaggtggac ccaccatttc accgggatct attaaattct 4740 acacagatgg ttcaaaaatg aataatttga cagggtctgg agtatacgga ccgaaaacca 4800 aaatctctgt ctctcttgga cagtggccta cagtatttca agcagaagtc tatgcaatca 4860 aggaatgtgc gcagctgtgt ctgaaaagaa attacagaca tgccaccatc tgtattttct 4920 ctgatagtca agcagcgctt caatctttga aggctttcac ttgcaactca aaacttgtgt 4980 gggaatgcat tcttgcactg aaatccctag ctgaacgcaa tcgagttaaa ctatattgga 5040 tcccaggaca tacgggtcta gagggtaacg aaattgccga tcagctagca aggaatggat 5100 caaccaatat gttcattggt ccagagccat tccttggcat ttcaaactct gcactaaaca 5160 cggaattgaa caactggttg ttcggtcaaa ttcaatcaaa ttggaataca gtttccaatg 5220 cgaatcagtc caaaaggttc gtaacaataa acacgacaca aacacaaaaa cttataggtc 5280 tcaacaaaag ggatctcaga acatacatcg gtctaataac tggtcactgc ccaagcagat 5340 accacttata caagatcggt gtcgtccaaa acacaaattg ccgtttctgt gacgagacgg 5400 acgaaacctc acaacacctt ctctgctctt gcagtgcaca catccatcgt aggttcaaaa 5460 tatttggcaa gcactactta cagccagctg atatttggaa cgcatccccc agggaggtgg 5520 ttagctttat taggctgatc acgccagatt gggggaacta caacactgca acctaggact 5580 tctgcccatc aatggcagat ggttcaaagt tcagtcaaat gcgcaaagta ttccggaggc 5640 acatttgcta ctggaaaaca ttgtgtacat agagtaatag ggtatatcac aatagtccta 5700 aaaaatggac gcagtgatcc cacacccgac agaagaagaa gaagaa 5746 // ID BEL-12_DWil-I repbase; DNA; INV; 6536 BP. XX AC scaffold_181136; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-12_DWil_; KW BEL-12_DWil-LTR; BEL-12_DWil-I. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-6536 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_181136; Positions 1749297 1755832. XX CC 'AATAT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 537..1940 FT /product="BEL-12_DWil-I_3p" FT /translation="MGDTRVRKGTTMTTRHSPIRRSPRLNALEALQEPDAA FT IDLPVDTGENATDEAPTANMNIESSIVAGIPEAVGADEHQETVCSAQPSGF FT QEARLRARIAELERQLQMVTRPATRAEGPPSLITDTMPATHRLESDRPING FT LQPPLINTRPELPSTIGTMPLISPNTVGGWMTRKITELPEFSGLPEDWTIF FT YCAFTETNSVYGCTDLENNQRLVKALKGDARETVKSLLIHPRNVNAVINQL FT RFCYGRPEQLVRSQLQSVRDVQPIQEHNLAKIIPFATRVSNLAAFLQSINE FT SRHLANPTLLDELTSKLPLDKRVDWARRAASIKPYPTVIDFSVWLEELASV FT ICTIVDSDPREGKRRILYTKFEEETRVEQHGQPCPNCYGSHDITECAEFRL FT ASPTARWRLVKKYRLCFLCLSPGHVVRKCTTKSRCHIGECGRAHHNLLQRS FT HKSTKILLWASGAARTQSASKRA" FT CDS 2876..4948 FT /product="BEL-12_DWil-I_2p" FT /translation="MIKTQDLKYELKNVHAVDGLDLPMQTLNQDYLQVINP FT QSRFPVEAYTNAVPRILIGLDHKYLVVPSSTWETSNKGPYVAVTRLGWDVS FT GTFHKHQHTSAPKPCLFAATDTDEVHKMVSEYFELENLGVKPTPPAPSDDD FT IRAQSILNETTVELLGRYQTGLIWKRDNVNLPSSYKMALNRLETVERKMER FT DKQFAQEYKGMINDYISKGYARRVEREEDTAVGSGKIWYLPHFAVTNPNKP FT GKIRLVFDAAAKVGNVSLNSELLKGPQHYRPLLSVLFHFREGAVAVIGDIK FT EMFHQIYVQPQDRRSQRFLWRDGDRQREADVYEMQVMTFGAACSPSAAHHV FT MTLNAMRHQQVDPRAVKAITDYHYVDDYVDSFATEDEAISVSARVRDIHKT FT GGFELRRFQSNSPKVVESLGQEGTTNPVGWAEAEEKILGLYWQPATDEFRF FT NVKYQRIKDDVPTERQFLSLIMSTFDPMGFLSCFTITGKLLLREIWRRNVL FT WDEPLPKELNSAFQKWRHQLEGVGLFRCPRYYFGSEIVKSIQLHVFTDASE FT SAYAAVAYWRATYQNGDVQVSFVCGKTKCAPMRTMSVPRLELQAAVLGTRL FT MNAVKSGHCMETEDAILWTDAKMVLRWICSTHRRYKQFVGNRVTEILESTD FT ASQWRWVPTADNVADEATRPQGQVDFSQKSRWMCGPKFLQQP" XX SQ Sequence 6536 BP; 1850 A; 1544 C; 1710 G; 1432 T; 0 other; tcttttattc gcagtcctat tagatggtct ttacaacgtc gcaactggcg acgaaaccaa 60 gaataaccac agtgggttat atgtgcagtc ctattggatg gactattgag cgtcgcaacc 120 ggcgacgaaa ctaagaataa ccacagtggg ttatatgtgc aggcctattg gatggactat 180 tgaacgtcgc aactggcgac gaaaccaaga ataaccacag tgggttatat gtgcagtcct 240 attggatgga ctattgagcg tcgcaaccgg cgacgaaact aagaataacc acagtgggtt 300 atatgtgcag tcctattgga tggactatta agcgtcgcaa ccggcgacga aactaagaat 360 aaccacagtg ggttatatgt gcagtcctat tggatggact attgagcgtc gcaactggcg 420 acggaattaa gaatcactac agtgagtgat aaagctacag tcctagcgaa ggaacgagtc 480 aacgtcgaac gcggtgacat acatacattg ctgaagtcac cagtggtatc accaaaatgg 540 gtgatactcg tgtcagaaag ggcaccacca tgacaaccag acattcccca atccggagga 600 gtccccggct gaatgcactc gaggcgctac aagaaccaga tgccgcaatc gacttgcccg 660 tagacaccgg agagaatgca actgatgagg cgcccaccgc aaatatgaat attgagtcca 720 gcatagtggc tggaataccc gaagctgtcg gcgctgatga gcatcaggag accgtgtgca 780 gcgcacaacc tagcggattc caagaagcaa gactgcgtgc gagaattgcc gagttggaaa 840 ggcaattgca aatggtaaca aggccggcga cacgtgcgga gggtccgcca tctttaatta 900 cagatacaat gccagcaacg cataggttag aatcagatcg tccgattaat ggactacagc 960 caccactaat aaacactcgc cctgagctgc catcgacaat tggaacaatg cccctgatct 1020 caccaaacac ggttggtgga tggatgacta gaaaaatcac ggaactgcca gagttttcgg 1080 gactaccaga agattggaca atattctact gcgcgtttac tgaaacaaat agtgtatatg 1140 gatgcacgga cttggagaat aatcagcgtc tggttaaagc tcttaagggc gatgcccgtg 1200 aaactgttaa atctctgctt atccatccca gaaatgtgaa cgcagtcata aatcaactaa 1260 gattctgtta tgggcgtccg gagcagctcg tacgcagtca gcttcaaagc gtgcgtgatg 1320 ttcaaccgat acaagagcac aatttagcaa agattattcc attcgctacg cgcgtgagta 1380 acctggctgc gtttttacaa tctatcaacg agagccggca cctggctaat ccaaccctgt 1440 tagatgaact aacctcgaaa ttgccgctgg acaagagagt ggactgggct agacgcgccg 1500 catctataaa gccctacccg actgtcatcg actttagcgt ctggttagaa gaactagcta 1560 gcgtaatatg cactatcgtg gactccgacc ccagggaagg caaacgacga atactgtaca 1620 cgaagtttga agaggagacg cgcgtcgagc aacatggcca accctgtcca aactgttacg 1680 ggtcacacga tataaccgaa tgtgcggagt ttcgtcttgc ttcaccgact gcgagatgga 1740 gacttgtcaa gaagtaccgt ctatgtttct tgtgcttgag tcccgggcac gtagttcgaa 1800 agtgtaccac caagtccaga tgccatatag gtgaatgcgg gcgagctcac cataacctcc 1860 tacaacgcag tcataaatca actaagattc tgttatgggc gtccggagca gctcgtacgc 1920 agtcagcttc aaagcgtgcg tgatgttcaa ccgatacaag agcacaattt agcaaagatt 1980 attccattcg ctacgcgcgt gagtaacctg gctgcgtttt tacaatctat caacgagagc 2040 cggcacctgg ctaatccaac cctgttagat gaactaacct cgaaattgcc gctggacaag 2100 agagtggact gggctagacg cgccgcatct ataaagccct acccgactgt catcgacttt 2160 agcgtctggt tagaagaact agctagcgta atatgcacta ttgtggactc cgaccccagg 2220 gaaggcaaac gacgaatact gtacacgaag tttgaagagg agacgcgcgt cgagcaacat 2280 ggccaaccct gtccaaactg ttacgggtca cacgatataa ccgaatgtgc ggagtttcgt 2340 cttgcttcac cgactgcgag atggagactt gtcaagaagt accgtctatg tttcttgtgc 2400 ttgagtcccg ggcacgtagt tcgaaagtgt accaccaagt ccagatgcca tataggtgaa 2460 tgcgggcgag cccaccataa cctcctacat gatttcgagt ctaattacag gcaggcttca 2520 atgtcaaagg ctggacatgg aagccccacg aggcctcaac cagtacccaa caatggcaag 2580 cagccctcac cggaaccaag aagccaacag ccaactcaac agatgaactt aagttgtgtt 2640 gacactaaga caacgcaact attgtttcgg atcctgccag tgacgcttta tgggtcgaac 2700 cgtcgagtag agacgtacgc actgcttgac gaagcatcat cggtaaccat gatcgatgaa 2760 gtgctactca aggatttggg cctgaatgga gaacgcagtc aacttgcagt taaatggttc 2820 aaaaatatga caacccaaga gccaacaaag aggataaact tacacattag tggacatgat 2880 aaaaacacaa gacttgaagt acgaattgaa gaatgtgcat gcagtggatg ggttagatct 2940 gccgatgcag accttgaatc aggactacct ccaagtaatc aatccgcaat cacgtttccc 3000 tgttgaggca tatactaacg ccgtgcctag gatcctaata ggcctggatc acaagtacct 3060 tgtcgtgcca tccagcacct gggaaacctc caataaggga ccgtacgtgg ccgttactag 3120 gttaggatgg gatgtttctg ggacgttcca caagcatcaa catacatctg cgccaaagcc 3180 atgcctattt gcagcaacgg acaccgatga agttcataag atggtcagcg aatactttga 3240 actggagaac ctgggggtga agccaactcc accagcccct tcagatgacg acatacgggc 3300 acaaagtatt ttgaacgaga cgacggtaga attactcggg cggtaccaga cgggattaat 3360 ctggaaacgc gacaacgtaa atctaccttc cagttataaa atggcgttaa ataggttgga 3420 aactgttgaa agaaaaatgg aaagagataa acagtttgcc caagagtata aaggaatgat 3480 aaacgattac ataagtaaag gatacgctcg acgtgtagag cgagaagaag acactgcagt 3540 cggcagcggc aaaatatggt atttgccgca tttcgcagtg acgaacccga ataagccagg 3600 gaaaattcgg ctagtctttg acgcggctgc caaagtgggc aacgtgtcac tgaattcaga 3660 gctattaaag ggaccacagc actataggcc gcttctttct gttctattcc atttcagaga 3720 aggagctgtt gcggtcatcg gagacatcaa agaaatgttc caccagatct atgtacagcc 3780 ccaggaccga cgttctcaac ggttcctgtg gcgggatgga gatagacaac gagaagctga 3840 tgtgtacgag atgcaggtga tgacctttgg agctgcctgt tccccaagtg ccgcacatca 3900 tgtcatgacg ctaaatgcga tgcgtcacca acaagtagac ccacgagcgg tcaaagctat 3960 tacagactat cattatgtgg acgattatgt ggacagtttc gccactgagg acgaagctat 4020 atcagtctct gccagggtca gagacataca caagactgga ggatttgagc tacgccgatt 4080 tcaatccaac tcacctaagg ttgtggagtc cttgggtcaa gaaggaacca ccaacccagt 4140 tgggtgggct gaagcagagg agaaaattct tggattgtat tggcagccag cgaccgatga 4200 atttaggttt aacgtgaagt accagaggat caaggacgat gtgcctaccg agaggcaatt 4260 cctcagtcta atcatgtcaa cgttcgaccc gatggggttc ctgagttgtt tcaccattac 4320 cggcaagtta ttattacgcg agatttggag aaggaatgtt ttatgggatg aaccgttgcc 4380 aaaagaactc aacagcgcct tccagaaatg gcgccatcag ctagaaggag tgggtctgtt 4440 ccgatgcccg cgttactact ttggatcaga aatagtgaag tcaatacaat tacatgtctt 4500 cactgacgcc agtgagtcgg cgtacgcggc tgtggcctac tggagggcca cctaccaaaa 4560 tggggatgta caagtgagtt tcgtttgtgg taaaacgaaa tgcgctccga tgagaaccat 4620 gtcagttcca cgactggagc tgcaggcagc ggtgctcggt acacgcttga tgaacgctgt 4680 gaagagtgga cactgcatgg agacggaaga tgccatttta tggacagatg ccaaaatggt 4740 gctgcgttgg atatgcagca ctcatagacg ttataagcag ttcgtcggta acagagtcac 4800 agaaatctta gaatccacag atgcttccca gtggagatgg gtccctaccg cggacaacgt 4860 ggcggatgaa gcgacgaggc cacaagggca agttgacttt agccagaaat cgcgatggat 4920 gtgcgggcct aaattcctac agcagccgta agatttgtgg ccccaatcaa ctaccgatct 4980 gacagacatc gaggttgcag ccgacgtcga agagtcaccg tgtgagtttg ccttggttgt 5040 tgctaacgac gaatttatac aatttcagag attctcggag tacagccgac tgttgaggac 5100 tatgacttgg gttctccgat tcgtgtgtcg gtgccgtcgt cagcagcacg aacatggaaa 5160 atatggcttg actgcagccg aatgtgcgga agccgaactc gtcttggttc gtgttgctca 5220 aagggaagcg ttcccagacg agttgcatcg cctatatgca aacagaccag tccagccgac 5280 gagtgagctg aagggactgc taccctacct tgacgactta gcgattctac gcgcatctgg 5340 gagaatcgat gcggccctgt gtttaccgta tggggcgaga aggcccatca ttctcacgca 5400 taggcaccca ctgacggatt tgatcgttcg tcactaccat gtaaggatga aacatcagaa 5460 tgtaaacgca actatcaccg agatccgcat gaagttctgg ataacgaaag tgcgccgggt 5520 actgcagcgg gttatccgga actgtggtgt gtgcaaactc cagcgtgcaa ccccaacccc 5580 acctttaatg ggtccgttgc ctgaagatcg cctaaaaccg ggttgatggc catttgagta 5640 cactggttta gactatttcg gaccactact tgtgacagtg ggtcgacgcc aagagaagag 5700 gtgggtggct ctattcacct gtttaagaac tagagcaatt cacctggagc tagcccacga 5760 tttgtcaaca gactcctgca tacttgcgat gagccgccga ggaccagtaa cgaagctgcg 5820 tagtgacaat gggaagaatt tcatcggcgc agacagggaa gccaaacggt ttaccgacgt 5880 ttttgaagca gacaagatac aggacgagct tgctacccat tgtgtggagt ggatttttaa 5940 ctgccctctc aaccccgcgg agggaggagt ctgggagagg atggtccgat gcgtaaaacg 6000 ggtattggct catacagtaa aggatgtggc cccgaaggaa cacgtattgc aaagcttgct 6060 cattgaggct gagaacattg tgaactcaag gccgctaacc catatgccaa tttcagtcga 6120 tcaagaggca cctttgacgc ccaacgacct actgaagggg acgtccaaca ttcctgatac 6180 acccgcggat agtgaagagt tgcctaaacc gtgtgaaacc aggaagcaat ggcgcatggc 6240 acgactactg agggaccggt tctggaagcg gtgggttcat gaatacctac cgacccttgt 6300 gcgtcgcgag aaatggtgcg cgcgcaccca gccaatcaga aaggatgatg tcgtctacat 6360 ctgtgaccct gcaacaagga gcaagggtat agtcgaggag gtatacaccg gagcagatgg 6420 gatacctcgg cgagcagcat aagagtcgcc gatggaaata gggttcacag ggtaatacgt 6480 cccgtctcca gactagcggt cctagatgtg atgcagtcgc ttcatggggg cgggga 6536 // ID Penelope-15_HM repbase; DNA; INV; 1516 BP. XX AC . XX DT 16-SEP-2009 (Rel. 14.09, Created) DT 16-SEP-2009 (Rel. 14.09, Last updated, Version 2) XX DE Non-LTR retrotransposon: consensus. XX KW Penelope; Non-LTR Retrotransposon; Transposable Element; KW Penelope-15_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-1516 RA Jurka J.; RT "Non-LTR retrotransposons from Hydra magnipapillata."; RL Repbase Reports 9(9), 1939-1939 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 189..1229 FT /product="Penelope-15_HM_1p" FT /translation="MEKIKKHFVKIFKSNDLLISIQCNMKIVNYLDVTFDL FT NNNSFQPYRKCDNELNYVHSDSNHPPSIIKRLPRTVELRLSATSVNESVFY FT NAIPPYEEALKKCGYNCKLKYQPQVTLTKNKNRKRKVIWYNPPFSQNVETK FT IGSRLLALIDLHFPVNHKFHKIFNRNSIKVSYSCMQNVKSIISSHNHKILH FT SDTNLNARTCNCITKTACPLNNQCLLSNIVYQATVIDENPNPKEKEKIYFG FT ISETPFKLRYANHLKSFNAVKYRNDTELSKEIWSLKEKNIKHTIKWKIIKR FT CKPIKTTSKICNLCLHEKFIILYHKENNLLNKKDEILSKCRHSNKFLLSSF FT DTGD" XX SQ Sequence 1516 BP; 577 A; 241 C; 205 G; 486 T; 7 other; tacaaatatn cgtggatata taaaaaaaat ggaatctttg atnttaaaat tgacgcttat 60 atatatgtgc ntaagtatgt gnattagtag gaatatntat atatatntac atatattata 120 atagagatga ttttggtcta tatcgtgatg acgggctagc gatatttaaa aacaaaaatg 180 gccagcaaat ggagaaaatc aaaaaacatt ttgtaaaaat ttttaaaagt aacgatcttc 240 ttatttccat ccaatgcaat atgaaaatag ttaactatct cgacgtaaca tttgatctca 300 ataacaattc ttttcaacca taccgtaaat gcgacaatga gctgaattat gttcactctg 360 attcaaacca cccacctagt attatcaaaa gactccctcg cactgttgaa ttgagattat 420 ctgccacgtc agtaaatgaa tctgtttttt ataatgctat cccaccttac gaagaagctt 480 taaaaaagtg tggctataat tgtaagctca aataccaacc tcaagtcacc ttgactaaaa 540 acaaaaaccg aaagcgaaaa gtcatatggt acaaccctcc atttagtcag aatgtggaaa 600 caaaaatagg cagtcgttta cttgcattaa ttgacctgca ctttccagta aaccataaat 660 ttcacaaaat atttaacaga aattcaatca aagttagtta tagctgtatg caaaacgtaa 720 agtctataat tagttcacat aaccacaaga ttttacacag tgatacaaat ttgaatgcaa 780 gaacttgcaa ctgcattaca aagactgctt gcccattgaa caatcagtgt ttattaagta 840 atattgttta tcaagctact gtgattgatg aaaatcctaa tcctaaagaa aaagaaaaaa 900 tatatttcgg tatcagcgag acaccattca aacttagata cgcaaatcat ttaaaatctt 960 ttaatgctgt gaaatacaga aatgataccg agctttctaa ggagatttgg agtttaaaag 1020 aaaaaaacat aaagcacaca ataaagtgga aaattataaa acgctgtaaa cccattaaaa 1080 caacatcaaa aatttgtaac ctctgccttc atgagaaatt tataatttta tatcacaaag 1140 aaaacaacct gctcaataaa aaagacgaaa tattatctaa gtgtcgacat tccaacaaat 1200 ttctactgtc ctcttttgat accggagact aaatagttac atcttttatt acgtcagaag 1260 acgttctacc gtaattttta attcatttgt aaaagatttt gtttttaaac ggttttttct 1320 aacggttttt ttatgtattt aatagctgat gattgccgtg tatggcacga aactttaagt 1380 actattaaca atgttgtttt tcatttaaac aaaatttaaa tatttattgc tctattatag 1440 naatattgag cactgtttta cgttcaagag tttttgtatt attttaatat aatttaatac 1500 ataaacaacc aatcaa 1516 // ID HAEIII-ECOR1_TC repbase; DNA; INV; 622 BP. XX AC . XX DT 14-SEP-2004 (Rel. 9.08, Created) DT 14-SEP-2004 (Rel. 9.08, Last updated, Version 1) XX DE Trypanosoma cruzi EcoR1 repeat region/HAEIII intergenic repeated DE element, consensus. XX KW EcoR1 repeat region; HAEIII-ECOR1_TC; HaeIII repeat region. XX OS Trypanosoma cruzi OC Eukaryota; Euglenozoa; Kinetoplastida; Trypanosomatidae; OC Trypanosoma; Schizotrypanum. XX RN [1] RA Souza T.R., Santos R.M. and Franco da Silveira J.; RT "Direct Submission to Genbank."; RL Direct Submission to Genbank (02-MAY-2002)Microbiologia RL Imunologia e Parasitologia, Universidade Federal de Sao Paulo - RL UNIFESP/EPM, Rua Botucatu 862, Sao Paulo, SP 04023 - 062, Brasil. XX RN [2] RA Gentles A., Kohany O. and Jurka J.; RT "Trypanosoma cruzi EcoR1 repeat region/HAEIII intergenic repeated RT element, consensus."; RL Direct Submission to Repbase Update (JUL-2004). XX DR [2] (Consensus) XX SQ Sequence 622 BP; 98 A; 176 C; 165 G; 182 T; 1 other; ccgcgtgaga gtgagggaga gagagccgtg caccgcccgc acacacactc gtccttctct 60 cactctctgt gtgtgtcccc ttgcatggac tctcgccctc acctcacccc acgaagcaca 120 cacataatgt gcccattgta tatattgcat gcacacccat gcaggcccgt tggtgctctc 180 ttttgcacga cacgcaccgc tgccgtcctg cctggctgca tgggcggagc accgtaactg 240 tttctgtttt gtttttattg cattgagtgg gcgacctctc tctctctctc cctgtgtgtg 300 tgttttgctt tacgggagca atgagagtgt gtgtgcgctg gtcgtcccac gcttcggctt 360 tttgttttgt tttttattgt tttgtggttg agtgcaccca ttccgcgggc tgcgtgtgtg 420 tgtgtgtgat gatatgagta atttgcgtgt gtgctatctg cttctctctg ccgttgcttt 480 ctttggactc tcttgtgccg cacagctgct ctgcaagtat aggcgctgcc aaacacacaa 540 gcacatgcac acgccatgac gcacacgctg tggagcggcg tgatcgctga gaacgcacgc 600 acgcacarga ttgtggctgc gg 622 // ID Copia-8_AA-LTR repbase; DNA; INV; 247 BP. XX AC AAGE02018254; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-8_AA_; KW Copia-8_AA-I; Copia-8_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-247 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02018254; Positions 21423 21177. XX SQ Sequence 247 BP; 59 A; 62 C; 50 G; 76 T; 0 other; tgttggtggg ggcgatattt tcacccgccg cctctgtgtc aacttctcga tcattgaaga 60 tcgatctcac acttgtgagc acgagagaca ataaatacaa tacttttata ttctatgaca 120 gtcagtgcta aatacaacac gttcatgtaa agtgtcccgg ttctattatt tacatccgac 180 gtttcccagt cttcccggta ttcgattcgg tccgccgcat agcttgttcc gtcgggagat 240 tccaaca 247 // ID Gypsy-5_DPu-I repbase; DNA; INV; 5548 BP. XX AC scaffold_64; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-5_DPu_; KW Gypsy-5_DPu-LTR; Gypsy-5_DPu-I. XX NM Gypsy-5_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-5548 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 725-725 (2010). XX DR Genome; scaffold_64; Positions 348725 343178. XX CC Positions [4182-4676] - Integrase core CC 'CTGAT' target site duplication CC LTRs are 98% similar to each other. CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX FH Key Location/Qualifiers FT CDS 1773..5219 FT /product="Gypsy-5_DPu-I_1p" FT /translation="MARITIGSVQSRFRDRRTPTISVAILNANHQIIATID FT DVTPDPGAEVTVGGSDVLLALGLTESDLSSSSFDLVMANKSTPLLSIGQLD FT IALRYGSRIATVTVVICPEITGMLISWIDCIVLDILHEQYPQPISRSQPHV FT SQLTVDHRAPTATPPICDFLRGVYIPNDPSEEQCAEIKSAIANHFTDVFDQ FT SEELRCMEGPEMVIQLKADAIPYYVNGARPIAFADRPEVKQTLDDMETAGI FT IVPVTEATDWAAPLVVLRKPNGKLRICIDHTRLNRHVVRPTHPTRTPRDAV FT AEIDSEAMFYSSFDATNGYFQIPLHAASQHLTTFMTPWGRYKFLRASMGLC FT SSGDEYNRRADAAFGTLQNTVRVVDDILRFDRSFPSHVEGVCAILQAARAA FT KIALNADKFKFAQKKMVWAGYEIQQGGVTIDPSKLQAIAKFPRPTNITELR FT SFLGLVEQLAGFSTEVAAAKGPLRPLLSTRNAYEWTEDNERSFEAVKTALL FT SPPILAHFDPGRETVIQVDASRTKGMGYALLQKHDDHWKLIDANSRWCTPT FT ESRYAIVELELAAAEWAIRKSKLYLLGLPSFTLMIDHQALVSILDNYTLDA FT VENPKLQRLKERLSPYIFKTVWRKGKEHAIPDALSRSPVADPTSDDEASAG FT DLISSAHQHLIRRIRAISGGTGETDGDIDDTPMMDHLRDLTLEEIRETAAA FT YAEYADLITAIESGFPKSRECTPAHIRQFWNIRQNLSVDQGIILFGCRILI FT PRAARRSILQKLHAAHQGIIRTKRRAQQTVFWPGITNEIVTIIESCQTCQE FT RLSSQPQEPLMRDPLPTRVFEDVSTDLFQSGQLHVLVYVDRLSGWPVIHRW FT RHDPSAREVVQAVVENFVDLGVPVRLRSDGGPQFDSGAFQSALKRWGVRWS FT NSTPYYPQSNGHAEAAVSAMKELVEKISPSGDLSSKEFLQGVLEFRNTPRA FT NGVSPAEMVFGHQTRSIIPAHRTAFAEKWKAVMDARDRQAEIDAAVKTRYD FT ASTKPLSRLPLGAQVRIQDPKSKKWSHVGVVVGLGRYRDYRIKFASGSVLW FT RNRRFLRPIFSKEDGEIGLDNVVNSGAGFSDGDILMGGSSDVSKPEDGTHH FT HFTTTPRSRTTEVSPGKAQIRPQFAVATASGRRKSHLIVRH" XX SQ Sequence 5548 BP; 1326 A; 1607 C; 1389 G; 1226 T; 0 other; tggcgcagtt gtctctaaat actgaaccta cggtactgta attgatattt ttatatgata 60 tttttcatat cctgtgacgt gtcacattcg gcgttgctac atctcggcct acgagcgctg 120 aaaacatcgt tgttgactgt ggttctcgtg gcaggtcggc cattttcttg gctcgttaac 180 cacgccgttc ctctccttct cgcggtttta cgacgcatct gtgtctaaaa ctccccacca 240 ttgatgtaca ctgccacact cagttgaact tggcaattca cgatcgcatc cagtgccgtt 300 gggacacgat cttaacttct tagcttttta tcccctcgtc gcgtagtcac cacactcccc 360 atttccggta tcgcccatcg cgtgggtgag cgccattgtt cgggtttatc cgcgcggcag 420 cgccatctgt gttgggggac gtgatcgtga caaatacgcc acggcaacca tcttgacttt 480 gttctcccgt cgcggtgtcg tggtttcgtg gtcatcgcgc ttcccacttg cagcttcggc 540 caacgcgtgg tttggcggcc gttgtttcag ttcatacgtg tgtcagcgcc atctgtgttc 600 tctccagttg tgtgattcat gtcatcgagc cccaaatact ttttacgttc atcaaaacgt 660 cgcaactcag tacgacacga tccatcttcc accccagttc gaaccggacc aacccctacc 720 atgacatcgg tggcggatgc gttggacgcg gcaaacgccg cgtcggcaac cgccacggcg 780 gctgctgcta ggatcgacgc tatggaacaa tccctcacca atcagacggt tcagttaacg 840 acgatcaccc agcaactggc tgcgttggtg gcggccggca ttggtggcgg tggcggcggc 900 ggcggcggtg gcggtggcgg tggcggtggc ggtggcagac tcacaccccc ggcagcgcca 960 ttaatacctc aacggcgccg tttggaccca tctggtatgg acaaactaca tggcgatgtc 1020 tccatcccat tactacgttc gtggcggaac agatggaacg attttgccga cctcaatcaa 1080 taagccacct acccggcaaa tgagcaaatg gcggcgttcc gtatggccct agacccgtcc 1140 atgcagcagg ttgtggaggt agcactcggg atcctcccca cacacccaac taccccggac 1200 caggacctag accaaattct ggcgtacgta cgcgcgaagc ggaacatcgc actggatcgt 1260 gtagccttcg aagaacggcg tcagggcccg tcagagtcat tcgacgattt ttatatttct 1320 ctcaaacgtc tggcggaggc cgcggacctt tgcggaacat gtgcggacgc ccgcatggcg 1380 accagaatca tggcgggaat tcgcgatgcg gacactaaga aaaaactgct ggctattagc 1440 ccgtttccca cgacccaggc catcgtcaat atttgccgca gtgaggaatc agcgaaatca 1500 aacgaacgcg ctctgagtgg ccaatccggc atttcgttca tcaaacaccg acaaaattct 1560 cgcccggaca gcgggccatc ttgtggcgca tgcggtcgaa ccgcccacac caacggaaat 1620 ccttgcccgg caatcgggaa gacgtgcaat tcatgtgggg gcccagacca cttttcacca 1680 cggtgcccga aacgggacaa atttaagccc agcaactttg gcggcggcgg cagtggcggc 1740 aatagctcgt ttggatctgg cggaaaaaga aaatggcgcg catcaccatt ggcagcgtgc 1800 aatcgcgttt ccgagacaga cgaactccga caatctcagt agcaatcctg aacgctaacc 1860 atcaaatcat cgccaccatc gacgatgtca cacccgatcc cggtgcggaa gttacggtcg 1920 gcggctccga cgtcttgttg gcccttggct tgacagagtc ggatttatct tcttcgtcgt 1980 tcgacttggt aatggccaat aaatctacac ccctcttatc tattggtcaa ctggatattg 2040 ccttgcgata tggctcccgg atcgccaccg taacggtggt catatgtccc gaaatcacgg 2100 gcatgttgat cagctggatc gactgtatcg tcctcgacat attgcacgag caatacccgc 2160 aacctatttc ccgttcccaa ccgcacgtat cgcagctcac tgtggatcat cgcgcgccaa 2220 cggcaacgcc acctatatgt gattttctcc gcggtgtgta catcccaaat gaccccagtg 2280 aggaacagtg cgccgaaatc aaatcggcca tcgctaacca tttcaccgat gtgttcgatc 2340 aatcggaaga gttacgctgt atggagggcc cagagatggt cattcagctg aaggcggacg 2400 ccatcccgta ctacgttaat ggcgcccgtc caatcgcatt tgcagaccgc cccgaagtga 2460 aacagaccct ggatgacatg gagactgccg gcattatagt acccgtcacg gaagcaacgg 2520 actgggcggc accgttggtg gtgctccgaa aaccgaatgg taagctccgt atctgtatag 2580 accacaccag actcaataga catgttgtcc gtcctacgca cccaacccgc acgccgaggg 2640 atgcagtggc ggaaatcgac agtgaagcta tgttctacag cagctttgat gccaccaatg 2700 gatactttca gatccccctg cacgcagcaa gccagcatct caccacgttt atgacgccct 2760 gggggcgtta taagttcctg cgtgcatcca tgggcctctg ttcatcgggc gacgaatata 2820 atcggcgggc agacgctgct ttcggcacac ttcaaaacac ggtcagagtg gtggacgaca 2880 tcctacgttt cgaccggtca ttcccgtcgc atgtcgaggg cgtatgcgcg attttacagg 2940 ccgcgcgagc agcaaaaatc gcactcaatg cggacaaatt taagttcgcg caaaagaaga 3000 tggtgtgggc cggttatgaa atccagcagg gaggcgtaac aatagacccc agcaaattac 3060 aggccatcgc caaatttccc cgaccgacca atatcacgga gctccggtcc tttctagggt 3120 tggttgagca gcttgcgggt ttttcaacgg aagtcgcggc tgcaaaagga cccctccgcc 3180 ctcttttgag cactcgaaac gcgtacgaat ggacagaaga caatgaacgt tcgttcgagg 3240 cggtgaaaac agcgcttttg tcacccccca tactggccca ttttgatccc ggccgggaaa 3300 ccgtcattca agtcgacgcg tcccgaacca aaggtatggg atacgcctta ctgcaaaaac 3360 acgacgatca ttggaaactg atcgacgcca actcgaggtg gtgcaccccc acggaatcac 3420 gctatgctat cgtagagctg gagctagctg cggcagagtg ggctatccgc aaaagcaaac 3480 tatatctctt aggtttgccg tcgttcactc ttatgatcga ccaccaggcg ctcgtgtcca 3540 tactcgacaa ttatacgctg gatgccgttg aaaatccaaa acttcaaaga ttgaaggagc 3600 gcttgtctcc ctacatcttc aaaactgtat ggagaaaagg caaggagcac gccattccgg 3660 acgcgctttc ccgatcaccg gtggcggacc ccacatcgga cgacgaggca tccgcaggag 3720 atttaatatc atccgcacac caacacttga ttcggcgaat cagagccatc agcgggggaa 3780 cgggcgaaac ggacggcgac atcgacgaca caccgatgat ggaccaccta cgagacctga 3840 cactcgaaga aattcgcgaa acggcagcag cctacgccga atacgcggac ctgatcacgg 3900 ccattgagtc tggtttccca aaaagccggg aatgtacccc ggcccacatc cgtcaatttt 3960 ggaatattcg ccaaaatcta tctgtggacc agggcatcat attgttcggc tgccggattc 4020 ttatacctcg agcggcacga cgatcaatcc tgcaaaagct acacgccgcc catcagggca 4080 taatccgtac aaaaagacgt gcccaacaga ccgtgttctg gcccggaatc acgaacgaga 4140 tcgtgaccat cattgagagc tgccaaactt gccaagaacg tctatccagc cagccgcaag 4200 aacccctgat gagagacccc ctcccgacac gcgtgtttga agacgtgtcc acggatctat 4260 ttcaatcggg ccagctccac gttttagtgt acgtggacag attgtcgggc tggccagtca 4320 tacaccgctg gagacatgat ccgtcagcac gcgaggtcgt tcaggccgtc gtcgaaaatt 4380 tcgtggatct gggcgtccct gttcgcctcc gatccgacgg tggaccgcaa ttcgactcgg 4440 gagctttcca atccgcgctc aaaagatggg gcgttcggtg gagtaattcc accccatatt 4500 acccacaaag caacggccat gctgaggccg cggtcagcgc aatgaaagaa ttggtggaaa 4560 aaatctcccc atccggcgac ctatcttcca aggagtttct tcaaggtgtg ctggagtttc 4620 gtaacacccc acgagcaaac ggagtatcgc cggccgagat ggtttttggt catcaaaccc 4680 gatcgattat cccagcccat cgaacggcat ttgccgaaaa atggaaagcc gtaatggacg 4740 cccgggatcg acaagccgaa atcgacgccg cagtaaaaac ccgctacgac gccagcacca 4800 agcccctatc acggctcccg cttggggccc aggtacgcat acaggacccg aaatccaaaa 4860 aatggagcca tgtaggcgta gtagtgggac ttggccgtta tcgcgactac cgcattaaat 4920 tcgctagcgg gagtgttcta tggagaaatc gtcgatttct ccggcccatc tttagcaagg 4980 aggacggtga gatcggtctc gacaacgtcg tcaacagtgg tgccggcttc tctgatggcg 5040 acattctcat gggaggtagc agcgacgtat ccaaaccaga ggatggcaca catcaccact 5100 tcacaacgac accccgaagc agaacgacgg aagtgagtcc aggaaaagcc caaatccgcc 5160 cccagttcgc cgtagcaacc gcatccggaa gaagaaaatc acatttgatt gttagacact 5220 aaaggattgt atccaatgta tcccgtaggc aacacccgcc agttctattt accccctcct 5280 tgtccgtctt aaccatcaat gtctatcgca ctcattaatt cacctttcgc tttacatcgt 5340 tttttttatt ccccacgctg tgtcgattcc atcgtcaaat atcaccaagt ttccaattac 5400 ctatttagct gtgccattgc aacacactta tgtcgtcaga cgttagtcat cctagttcac 5460 aatcatcccc catatgtcat ctatttgttc gtgtttgtag ccgttccata tatgtttgtt 5520 tatgcgtatg taacggctcg ggaagagt 5548 // ID hAT-15_SM repbase; DNA; INV; 2264 BP. XX AC . XX DT 01-FEB-2008 (Rel. 13.02, Created) DT 03-MAR-2008 (Rel. 13.02, Last updated, Version 1) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-15_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-2264 RA Jurka J., Bao W. and Tempel S.; RT "hAT DNA transposon from Schmidtea mediterranea."; RL Repbase Reports 8(2), 65-65 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 250..1881 FT /product="hAT-15_SM_1p" FT /translation="MSTAKKSRLYNEEWLFKYGVIEADNKSMCLICKESIT FT VRSYNVKRHFETMHQNLISLKEAEKKEYFSQQLKQYKSQTSAFKSFLLPKN FT NITCASFQLSHCIARHGKNLSEGEFFKTALLECSESLFADFEEKDCIIKRI FT KELQASRNTVKDRILALDEDTSTQLICDLNSTEMYSISLDESTDITGQAYL FT AVIIRFKSGNIIKEELIKLMNLTGKTTGAEIMKEFTLNMTEMNIDLKKIVS FT VTTDGAPSMVGKNIGFINLLKPKIEHPLIEFHCIIHQEVLCAKRGLKPFDN FT VISIVTSVVNYISTQALNKREFSKLLEEVDFQYSGLLMYNNIRWLSYGQVL FT NRFVNLLDEIKIFLHEKDNIYPELANEEWLNNLMFLCDLTQHFNELNKKLQ FT GRGQIALTMYENIKSFMIKIDIFINDLNSKTMKYFPCLKKYFETSVNLKTN FT AKAIKQYLSCLEEIKIGFSERFNQFKQLETTIRFVISPHIIEYKSLELNFF FT NWLNIDNLEMELVEFQNSFIWRNKFIELNVELEKKAHEIENKSIEEKI" XX SQ Sequence 2264 BP; 892 A; 290 C; 334 G; 746 T; 2 other; ttgatatttt tctgaaaata agtcttttta tattactaaa ttttttatta cttatataaa 60 tacaggaaac tcttataata taagacttat attaataaat tttctagcat ataattgttt 120 actttctttt aataattgtt gtaaatgtat cattttaaaa atttttttcc gatttaacat 180 taactcaaaa aagcaggtga ataattttaa actrtaaatt ataaatatat taatccttaa 240 gatatcacca tgtcaacagc taagaaatca agattataca atgaagaatg gcttttcaag 300 tatggagtta ttgaagcaga caataaaagc atgtgtctca tatgcaaaga aagtattact 360 gttcgtagtt acaacgtaaa gcgacatttt gaaacgatgc atcaaaatct catttcattg 420 aaagaagcag agaagaaaga atatttttca caacaattaa agcaatataa atcccaaaca 480 agtgctttta aatctttttt attgcctaaa aacaacatca catgtgctag ctttcaactg 540 tcacattgta ttgcacgaca tggaaaaaat ctttctgaag gagagttttt taaaacagca 600 ttattggaat gtagcgaatc tttatttgct gattttgagg aaaaagattg cattattaaa 660 agaataaaag aattacaagc cagtcgaaac acagttaaag atcgtatatt agctttggac 720 gaagatacct cgactcaatt aatttgtgat ttaaacagca ccgaaatgta ctcaatttct 780 ttggacgaga gtacagatat aacaggtcaa gcatatcttg cagttattat tagatttaaa 840 tctggaaata ttataaagga agaattgatc aaattgatga atttgacagg aaaaactacc 900 ggtgctgaaa ttatgaagga gtttacgtta aacatgactg aaatgaatat tgatttaaag 960 aagattgttt ctgtaacgac tgatggtgcc ccaagcatgg ttggaaaaaa cataggattt 1020 ataaacctac tgaagccaaa aattgaacat cctttaattg aatttcattg catcatccat 1080 caggaagtac tatgtgcaaa aagggggtta aaaccatttg ataatgtgat atcaattgta 1140 acaagtgttg taaattatat atctacacaa gctttaaata agcgagaatt ttcaaaatta 1200 ttagaagagg tagattttca atattcagga cttctaatgt acaataacat tcgatggctt 1260 agttatggac aagttttaaa tcgmtttgtt aatttactcg atgaaataaa aatcttttta 1320 catgaaaaag ataatatata tccagaattg gcaaatgagg aatggttaaa taacttaatg 1380 ttcttgtgtg atcttacaca acattttaat gaactaaata agaaacttca aggacgtggt 1440 caaattgcat taacaatgta tgaaaatata aaatcattta tgataaaaat cgatatattc 1500 attaatgatt taaattcgaa aacaatgaaa tattttccat gtttaaaaaa atattttgaa 1560 acttcagtta atttaaaaac caatgctaaa gcaattaaac agtatttatc gtgtttagaa 1620 gaaataaaga taggtttttc agaaaggttt aatcaattta aacaactgga aacaactatt 1680 agatttgtta tatctccaca cataattgaa tataaatctt tagaattgaa tttctttaat 1740 tggttaaata tcgataattt agaaatggaa ctagtagagt tccaaaacag ttttatatgg 1800 agaaataaat ttatagaatt aaacgtcgaa ttagagaaaa aagcacatga aattgagaat 1860 aaatcaatag aggaaaaaat ataatattac aggaatggaa ctccttgcca agctcatttg 1920 agggaatgaa aaggtttgca actgcgcttt taactatgtt cggatcaaca tatgcatgcg 1980 agcagttatt ttcatcattg aattttatta aatcaggact acgaaatcga cttagtaaag 2040 aaatgactgc tgcatgcgta agaatcaaaa ccactaacta tgcacctcgc atagaaaaat 2100 tgtcgtcaaa taaacaacaa cagatttccc attaaatgat gtattcttcc tgaaattttg 2160 cttgtaaatt ttattgttaa taaaacttgt aagcctttat ttattattgt tttaattact 2220 caacatatgt gacccaccac atggatttaa taaaaaaata tcaa 2264 // ID Copia-7_DPu-I repbase; DNA; INV; 5240 BP. XX AC scaffold_25; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-7_DPu_; KW Copia-7_DPu-LTR; Copia-7_DPu-I. XX NM Copia-7_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-5240 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 677-677 (2010). XX DR Genome; scaffold_25; Positions 205833 211072. XX CC Positions [2244-2540] - Integrase core CC 'GAAAT' target site duplication CC LTRs are 100% similar to each other. CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX FH Key Location/Qualifiers FT CDS 3240..5180 FT /product="Copia-7_DPu-I_2p" FT /translation="MDVDKDQPSVDQTPENNEHQTNETETPPAGANLNEIT FT IPLDDSNHDVNDENLPRKSTRVNEVPSDDPIHQRDVDCAPQQTRRSTRSPK FT YSERYIQWQQSLAKIANLSSMSAKVQDANNHSPLLEPSSYLEAISCADSKF FT WIPAIFEEYDSLVQNGTWTLCPLPPDRKAIPGKWVMTFKPGFKTTAPRYKA FT RFVIKGYSQVFGLDYTDTYAPVAKNYSLRLILSIAAAKNLEMIQLDVKTAF FT LYGILDEEIYMQQPEGFVVPGREQEVCRLNKSIYGLKQASRVWNIKFNEFL FT IKFGLKRSQADPCVYYRHLRPGETDEELTIFILYVDDGLILSNIQSALTDI FT VEFLGKEFEVRSLPADRFIGIDMNRNRSLGTIHLSQPEYVKKILERFNMSS FT CNPLAIPADPCVKLSPQMSPKNEEEKEEMANIPFMECIGSVMHLTHLTRPD FT IAYAVGQVSRYSQNPGLEHWKALKRILAYLKKTINFGLLFGGGSSELCGYC FT DSDYAGDLESRKSTSGAVFLLYNGAVSWFSRRQTCVALSTTEAEFISAAES FT TKEGIWLKRLHLELGAIESATPLRCDNQGAIALIHDPVFHQRTKHMDVRFF FT FVRDAQQEGRINISYIETESQLADIFTKALAVPRFEKLRNDLNIRELPE" XX SQ Sequence 5240 BP; 1596 A; 1249 C; 1068 G; 1327 T; 0 other; ggttatgggc ccaggtacat tgtgctgcat ccaaatttag aatggcagag catgctgtac 60 cagataattc catgagagca gtgtcccacg tacccaagtt tgatgggacg aaccatcgtg 120 aatggaatta tgagattgac ctgtgttttc agaatctgga cattgctgac gtggtgttag 180 gactagaact atgtccagat gaggtaaact ttcacagaaa cattttttat tttctttctc 240 gcagacaatc ctcattccag agcacatggt tgctcatcgt tcacatatgc atttaagttg 300 tatgctttta tcctgcaagt gtggaccaat ctcacatgtg agagcgtcta ctgcatacat 360 tgctcacgca atccttttct ttggaaagga ttatttcctt tttattcttt tactattcat 420 tgctcacgca atctaaggag aatcattcat gattgattac ctcatattca ctctaacatt 480 gctcacgcaa tcccttttta ttgggaaagg attctttcct tttatttttt tttattattc 540 attgctcacg caattgaagg agaaacattc atgattgatt atctcttatt tactcttaca 600 ttgcttacgc agatatattg gtttagaaca taggttcatt agactcccct cattgcctaa 660 gcaattgatt ttttttttat gttattactt aactaccaat gtctacctaa gataaccaac 720 agcactcatt tatttttcct tattttttta tacaggacta tgatgatgat ggagatatca 780 cgaatgagac ggagataaga caatggaaaa agaaggacgt gctagctcgc aattgcatta 840 tggcaactat tacaagggag atgaagcaaa acctgtacac tcccggtttg aatgcagccc 900 agatgtggac aaaactcaat ctgcattatc agctacacac agaagagcat ctacacctac 960 tgtggcaaaa ttactatgac ttctcataca ctgcaggtga tgacatgcgt actcacatac 1020 aaaagctatc caacactgct gacaaactga aagagcgaga tcagcccctc agcgacatcc 1080 aactagtgtc aaaagcttta gcgacactgc cagaaaccta tcgcattgta agatcagtat 1140 gggctgccgt tccagcagca gatcgaacac tcgaccactt acttcaacgt ctcctatgtg 1200 aagaaagcgt ggttaagtcc tatcaaagga cagaagcccc aactgaagaa gcatttgccg 1260 ccggtggtgg tggaagaggt cgaggaagag gtcgaggaag agaatatcta catggaacgc 1320 gaggaggcta tatcgacaaa aggtcaaagg gccagtatgg ggccgattac gactcacgtc 1380 cacgatgtgg tttttgcact aaaccaggac atgaagtgaa aaactgctac aagaagcacg 1440 ggtatccagg ccccaaaggc gacaacacac ctggtcaagc gctcatgtcg tcatctaaat 1500 ctgatccaag aagcgtgcat gccttcttcg ctgactctgg tgctacaaag cacatgtgtg 1560 atcaaaaagt tttctttgtc acattcacac caattgaacc cggaaatcgg accgtatcag 1620 gtaaatatct tgcacgtact acttattgtt ttctttaaat atgtaatgaa atgtccgttg 1680 tctttctcag gcatcggtgg taccaaacta gatgtactag gatccggtag catctcaatc 1740 tcagttaatg tgaatcagaa ggtgaatgca aggattctaa cggaagttct ctacgttccc 1800 ggccttggcg taaacctctt ctcaattgga gctgcaactg ccagcgggct aatagcagaa 1860 ttcaaagatg acaaggtata attttttttt taattacaaa taaagacata taattgaccc 1920 agctattatt atccaatctt aaggtcttat tcactagaga tgaagagata gtcctcactg 1980 gtgaacgtaa ggagaacacg ctgtatcaac ttaatctgaa gcctgaaatc caacagaatc 2040 agattgaagc tgctcatttg tctctcgata cagcgctcaa tgccgatctc cgatcatcaa 2100 ttaccgtttg gcacgaacgc ctgggacata ttgggtacca cacactgaag aagatgattc 2160 aacaagacct agtgattggc ttgaacatct ctggggaact cgacatccca acgacactat 2220 gcgcaggttg tgaattagga aagttcacta gatcaccact caaaatcggc aggaatagag 2280 ccagtagaat tggtgagctc atccactcag atgtatgggg accaatcgca acgccaagcc 2340 ttggaggcgc gcgctactat gtaacattta aggacgactt cagcggatac ctaaccgtgt 2400 acttcatgaa aaagaagtca gaagtccctt cccttcttcg tctgtacgca gcgatgttat 2460 taaacgaaac cggcaactac atcctcacta ttcgatcaga caacggtcga gccgagtaca 2520 tcaataatga gaatagaggt tagttcaact caactaataa aatatgtata tgtatcactg 2580 aaagtttctt tttgaacgtt atagcctggt ttgaaaaatg tggcatccgc cacgagacaa 2640 gtgctcccca cacgcctgag cagaacggcg tagccgagcg aaccaatcga acgctactaa 2700 attccgtacg ctgcatgctt atctcaagtg gccttccacc tacactatgg gctgaggccg 2760 tcacctatac gacatacatt cgcaacagag tcctctctag aacaatcaaa attactccat 2820 tcgagagctg gaatggtaga aaaccgaaca tgtcggactt acgcattttt ggaaccagag 2880 cttttgtacg aattcccgac acagcaacaa aactgggagc tagaagccaa gaaggtgtat 2940 tcgttggacg atgcaatact cagaacgcat tccgaatcta catcccgaaa acaaggaaaa 3000 ttgtagtcag caaagacgtg aagattgatg aacaaattct atatcgtgac acgaacaatt 3060 catcgccact aacggtaaaa ttattatttt ttttttttct ctctcttttg gagatggatc 3120 acaagttttt ttttttctcg cagtcaatat atatttatat tttttttttt taaaaaaaaa 3180 aaaaaaaaaa aaaaaaaaaa aaggatgaac tagatatcgt tgataccgac gatcacgcaa 3240 tggacgtgga taaagatcaa ccatccgttg atcaaacacc ggaaaacaac gagcatcaaa 3300 cgaatgagac tgaaactccc cctgctggtg caaatctcaa cgaaatcacc attcctctcg 3360 acgactcaaa tcacgatgtt aatgacgaaa atcttcccag gaaatctact cgcgtcaatg 3420 aagtcccttc tgatgaccca atacaccaaa gggacgttga ttgtgccccc caacaaacgc 3480 gcagatcaac tcgttcaccg aaatacagcg agaggtacat tcaatggcaa cagtctctag 3540 ccaaaatagc aaacctctct agcatgtcag ctaaagtcca agacgcgaac aatcactccc 3600 cactactgga gccctcgagt tatctcgaag ccatatcctg tgctgactcc aagttttgga 3660 tacccgccat atttgaagag tacgactcac tcgtacaaaa tggcacctgg acactctgcc 3720 cacttccacc agaccggaag gctatcccag gcaaatgggt gatgacattc aaacctggat 3780 tcaagaccac agctcctaga tataaagccc gattcgtcat caaagggtac tcgcaagtgt 3840 tcggactgga ctacaccgac acctacgcac ccgtagccaa gaactattca ctacgtctta 3900 tcctgtccat cgccgcagca aaaaacctag agatgatcca acttgatgtg aaaacagcgt 3960 tcttgtatgg catactagac gaagaaatct acatgcagca gcccgaaggt ttcgtagtcc 4020 caggtaggga acaagaagtt tgccgcctta ataaaagcat ttatggtctg aaacaagctt 4080 cgagggtctg gaacatcaag ttcaacgaat tcctcatcaa atttggattg aaaagaagtc 4140 aagcggaccc ctgcgtatat taccgtcacc tccgtccggg ggagaccgat gaagaactta 4200 ccatattcat cctatacgtt gatgacggac tcatcttaag caacatccaa tcagccctca 4260 cagacatagt ggaatttctg ggcaaagagt ttgaagtacg atccctcccc gcagatcgct 4320 tcataggaat cgacatgaat cgcaaccgat ctctaggaac cattcatctc tcccaaccgg 4380 agtatgtcaa gaaaatccta gaacgattca acatgagcag ctgtaaccct cttgcaatcc 4440 ctgcggaccc ttgtgtcaaa ctatcgccac aaatgagtcc aaaaaacgaa gaggagaagg 4500 aagaaatggc caacattccc ttcatggaat gcattggatc agtaatgcat ctcacacatc 4560 tcacaaggcc tgatatcgct tacgcggttg gccaggtctc cagatactca cagaatcctg 4620 gcctggaaca ctggaaggcc ttgaagcgca ttctagctta cctcaaaaag acaatcaact 4680 tcggactact attcggcggt gggagcagcg agttatgtgg ttactgtgac tcggactatg 4740 ctggcgacct ggaaagtaga aaatctacgt caggagccgt cttccttctc tacaatggcg 4800 ccgtctcatg gttcagtcga cgccaaacgt gcgtagccct ttctacaaca gaagcagaat 4860 tcatctcagc ggcggaaagt acaaaagaag gaatctggct caaacgcctc cacctagaac 4920 taggagccat cgaatcagca acacccttgc ggtgtgacaa tcaaggtgca atcgcactaa 4980 tccacgatcc tgtctttcac cagcgcacga aacacatgga tgtgcgtttc ttcttcgtac 5040 gagatgctca gcaggaagga cggatcaaca tctcctacat cgagactgaa tcccaacttg 5100 cggatatttt cacaaaggcc ctcgctgtcc caagattcga aaaattgcgg aacgatctca 5160 acattcgtga gctaccagag tagtgcactc gagggacgca gttctttgcc ttagttgctg 5220 cacgcttgac ttgagggacg 5240 // ID Crack-6_BF repbase; DNA; INV; 3322 BP. XX AC . XX DT 30-APR-2009 (Rel. 14.04, Created) DT 30-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE Amphioxus Crack-6_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; Crack-6_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-3322 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-3322 RA Kapitonov V. and Jurka J.; RT "Young families of Crack non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(4), 811-811 (2009). XX DR [2] (Consensus) XX CC Its 5'-terminal portion is incomplete. XX FH Key Location/Qualifiers FT CDS 20..3016 FT /product="Crack-6_BF_2p" FT /translation="MDHQSYCNLANSHEYWYCNQCLLPCFSDSFFNSSHDS FT TMDSSGEDNVDVCPSSCHMVPPTKGLTMGHLNICSLYNKLDQLRVFMTSNN FT IDVMTLSETHLDDTISDAELHIAGYYLYRLDRNRSGGGVAIYVSEAFTHSR FT RSDLQQPGLEALFCQINLPCTKPIIAGSVYRPPSAPVEFYTLLDNSLETLS FT LTSPNSELYLLGDFNVDLTQPRKPPSKALTNLTEKYQLRQLINEPTRVHQY FT SSTLIDHIYCSDMHYVNTSGVVQCTISDHYAVFCTRKAARRHSGVKYVSSR FT KFTKFDEQNYLADLSEIDWSPLYHADSVDDAWTFFKSSFTTVSDLHAPFIT FT KRTRDKQPEWLSPNIRKQMLVRNDLKAKARKTGSDVDWECYRAKRNMVNKM FT VKQAKAKYCQNKLEENISDSKKLWGTIGEILPRKSSVVTKTLTWGGQQLVG FT LTNIVKGFNEFFATAGCRLAQCFGAYTHTPSCPDLYKDLESTFKFTPLNDQ FT DVLCHLKALPISKAPGLDNIHPRLLKAAAPIITIPLTYVYNLSLSTGTIPT FT EWKTARVTPIHKGGDQADPNNFRPISVLPILMKIFEREVHKQFLAYLHAHN FT ILSPSQSGFRPGHSTATTLLDVSDYILHNIDNGNLVGTVFLDLKKAFDTVD FT HNLLLTKLTWIGLRGIELAWFQSYLSNRHQRVSLSGHTSDSVPINIGVPQG FT SILGPLLFNIFINDLPESITNCKVSLYADDTAIFCASNNPNHIESMLNAEL FT SQLSTWFHNNRLTLNISKTKWMLMGSSKRINNCHDLEIKIDDICLEKVTSY FT KYLGVILDSCLTYSNHVDMMHSKASQRIGLLRRLRPYLGTNVANMLYKAIV FT LPIVEYCDVIWDNSSSTLKQRLQILQNRAARIILRRDPRANIEELHHTLQW FT RYLQDRRTQHLCIMVYKCLHGLAPSYLINTFNYNNQIHSYNTRQAQQLHRP FT KYTSRTGQRTFAFRAVSIYNSLKPTTTSQTTLNSFKRALQSDL*" XX SQ Sequence 3322 BP; 999 A; 727 C; 632 G; 964 T; 0 other; tcacatcaaa tgtatagaca tggatcatca atcctactgt aacttagcca acagccatga 60 gtattggtat tgtaatcagt gtttattgcc atgtttcagt gattcattct ttaattcatc 120 acatgatagc acaatggaca gctctggtga ggacaatgtg gatgtttgtc ccagcagttg 180 tcacatggta cctcccacta agggattgac tatggggcat ctaaatattt gtagcctgta 240 caataagtta gaccagctga gggtatttat gacttcaaac aacattgatg tgatgacact 300 tagtgagact cacctagatg ataccatctc tgacgctgag ttacatatag cagggtacta 360 cttgtacaga cttgaccgta acaggtccgg aggaggtgta gcaatctatg tttcggaggc 420 tttcactcac tcgagaagat ctgacctaca acaaccagga cttgaagccc tgttttgtca 480 gattaattta ccctgcacca aacctatcat tgctggttct gtatataggc ccccctccgc 540 tcctgtggaa ttctacactt tacttgacaa ttccctggag accttgagcc tgacttctcc 600 taattcggaa ctatatttac tgggagattt taacgtggat ttaacccaac ctagaaaacc 660 tccctccaaa gccctgacaa atcttactga aaagtatcaa ctccggcagt tgatcaacga 720 accgacaaga gtgcatcagt actcttccac cctcattgat catatctatt gtagtgatat 780 gcattatgtc aacacatcag gagtggtgca gtgtaccatc tctgaccact acgctgtgtt 840 ttgcacacga aaggccgctc gacggcactc aggcgtaaag tatgtgtcgt ctcgcaaatt 900 tacaaagttt gatgaacaaa actatctggc cgaccttagt gagatcgatt ggagtccatt 960 gtaccatgct gatagtgtgg atgatgcttg gacgttcttt aaatcatcat ttaccacagt 1020 ttctgaccta catgctccct ttataactaa aagaacaagg gataaacaac ctgaatggct 1080 ctcgcctaat atcaggaaac agatgctagt cagaaatgat ttgaaagcca aggcacgcaa 1140 aactggatct gatgtagatt gggagtgcta tagggctaaa cgaaacatgg taaataaaat 1200 ggttaagcaa gcaaaagcca aatactgtca gaataagctg gaggaaaata tctcggactc 1260 caagaaactc tggggaacta ttggggagat cttacctcgc aagagcagtg tagtcactaa 1320 gactctaacc tggggtggac aacaacttgt tggcctgacc aacatagtaa agggtttcaa 1380 tgagtttttt gctacagctg ggtgtaggct agcgcagtgc tttggcgcat atacacatac 1440 accctcatgt ccagatctct ataaagacct ggaaagcact tttaagttca ccccactaaa 1500 cgaccaggat gtattgtgcc acctaaaagc cttgccaatt agtaaggctc ctggcttgga 1560 caatattcat cctagacttc tcaaagctgc agcccctatt ataactattc cgttgaccta 1620 tgtctacaac ctctctctca gtactggaac tataccaact gaatggaaga cagccagggt 1680 gacccccatc cacaaaggtg gtgatcaggc tgaccctaac aactttagac caatctctgt 1740 gctaccgata ctgatgaaaa tatttgagag ggaagttcat aaacaattcc tagcatatct 1800 ccatgcacat aatattcttt ctcccagtca gtcagggttt agaccaggcc attcaactgc 1860 cacaactctg ttagacgtta gtgattatat actgcataat attgacaacg gtaatctagt 1920 aggcacagta tttcttgacc tgaaaaaggc tttcgatact gttgaccaca atctcctttt 1980 gacgaaacta acatggattg gtctacgtgg tattgaactg gcttggtttc aaagctacct 2040 ttcaaataga catcaaagag tctcattaag tggtcacaca tccgatagtg tacctattaa 2100 tattggagtg ccccaagggt ccatcttggg accccttctc ttcaatattt ttattaacga 2160 cttaccagag tcaatcacca actgtaaagt ttccctctat gcggacgaca cagcaatttt 2220 ctgtgctagt aacaacccaa accatattga atccatgtta aatgctgagc tttctcaact 2280 gtctacttgg tttcacaata atcggttaac actcaatatt agtaagacca agtggatgtt 2340 gatgggatct tccaaaagaa taaacaattg tcatgactta gaaatcaaaa ttgatgatat 2400 ttgtctggag aaagtaacat catacaaata tttgggtgta atccttgatt catgtcttac 2460 ctatagtaac catgtagata tgatgcatag taaagcatct cagcgcattg gtttgcttag 2520 acgcctacga ccttatcttg gtacaaatgt tgctaacatg ctgtataagg ccatcgtttt 2580 accaatagta gaatactgcg atgtgatctg ggacaacagc agctctacac tcaaacaacg 2640 cctacagatt cttcagaaca gggccgccag gatcatactg aggcgggatc ccagagctaa 2700 catcgaagag ctgcatcaca cactgcagtg gagatatcta caggaccgga gaactcaaca 2760 cttgtgcatt atggtgtaca aatgtttgca tggactagca cctagctatc taatcaatac 2820 ttttaactat aacaatcaaa tacacagtta taacaccagg caggctcaac agttacacag 2880 accaaagtac acatctcgaa caggccaacg aacgtttgct ttcagagcgg tttctatcta 2940 caattccctt aaaccaacaa caacttcaca aaccactctg aattcattca agcgagcttt 3000 acaatctgac ctctgacccc tctgaccaac gacctcgaat ctggtttgaa aattagcttg 3060 atttcgttta tttatgtact gttatgttct attcatatga ttattgttgt gatgagtttt 3120 gtaattaatg actttgagtt gttattattt gggctaatcg catttaagtt attatccata 3180 ttttgattta gtcacgtttt atgttatatg attattgtta tctttatgtt tatgtccatt 3240 atgttcattt gggctccctt ggaaatcagt tttatcaaaa ctgtagggac taccctgaaa 3300 gattaataaa taaataaata aa 3322 // ID Mariner-28_SM repbase; DNA; INV; 1660 BP. XX AC . XX DT 12-AUG-2009 (Rel. 14.08, Created) DT 12-AUG-2009 (Rel. 14.08, Last updated, Version 2) XX DE Mariner DNA transposon from Schmidtea mediterranea: consensus. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-28_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-1660 RA Jurka J.; RT "Mariner DNA transposons from Schmidtea mediterranea."; RL Repbase Reports 9(8), 1877-1877 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 271..1389 FT /product="Mariner-28_SM_1p" FT /translation="MHRKSYSIKEKLKVLELLRQNNNNVGKTARDSGITNK FT MLRDWKKQEDKICEVGSKSTTFRMGSGRTAFFPELEEKLKIWLTTERKTNK FT RLVTYNTLREQAIKIASSLSINNFIGSNRWIENFMKRNDFSVRKITSVGQE FT DNRPVDETREEIHKYFEIFKLKSTNICPINIYNMDETPMYVDMMNSRTISF FT KGEKNTEVLTTGHQKTRFTVVLTICLAGKILKSMVVLKGLKNPPKCQTPTN FT IKVFTSKSGTMDKNIMKTWINTCLKDTGPFNSSERSVLLMDNYGSHKDTGV FT LAFLEKNNFEPVFIPPRTTSYLQPLDVLINSIFKAKMKKKWEEWFLNGEKF FT IPRVDIVKGLHGKRSSRLFLKVLWKFKKQM" XX SQ Sequence 1660 BP; 614 A; 210 C; 288 G; 548 T; 0 other; gaaaataaga cccccgaaaa taggaccatg gtcttatttt cgatttttta ggaaattatt 60 tctatgtatt ttttatttta aacattttta ttaaaattac cttctttgtt ttcgtgcaga 120 gaaaatttta ttgttaatgg aaattaagtt atttaaataa aattcattaa aattgattta 180 tatttgtatt tatttatatt ttgcgggtat atcatttttt taaaagctat ttaagactta 240 ttaattttgt agaaattata tttaaatccc atgcatagaa agtcatatag cataaaagaa 300 aagctcaagg tacttgaact tttaagacaa aacaataata atgttgggaa aacagcaaga 360 gattctggta taacaaataa gatgctgaga gactggaaaa aacaagagga taagatatgt 420 gaagtaggaa gcaaatcaac aaccttcaga atgggatctg ggagaacagc attttttcct 480 gagctagaag aaaagctcaa gatctggtta actacagagc gtaagactaa caaacgccta 540 gttacttaca atactctacg tgaacaagcg attaaaatag cctcttctct ctcgataaat 600 aattttattg gaagtaacag atggattgag aattttatga agagaaacga tttctctgtt 660 aggaaaatta ctagcgttgg gcaagaagat aatcgtccag ttgatgaaac aagagaagaa 720 attcataagt atttcgagat tttcaaacta aaatccacaa atatatgtcc gataaatata 780 tataatatgg atgaaacacc gatgtatgtg gacatgatga actctagaac aatatcgttt 840 aaaggtgaga aaaatacaga agtattaacg acagggcatc agaaaacacg atttactgtt 900 gtattaacaa tctgtcttgc tggaaaaatt cttaaatcga tggtagtatt gaaaggatta 960 aaaaatccgc ctaaatgcca aactccaaca aatataaagg tctttacatc aaagtcagga 1020 actatggata aaaatattat gaagacgtgg ataaatacat gcttaaaaga tacagggcct 1080 tttaatagtt ctgaaagatc ggtattgctt atggataact atgggtctca taaagatacg 1140 ggtgttttag cgtttcttga gaaaaataat ttcgagccgg tttttattcc tcctcgaact 1200 acttcttatc ttcagccgct tgacgtattg ataaattcaa tctttaaagc taagatgaaa 1260 aagaaatggg aagaatggtt cttaaatggg gaaaaattta taccaagggt ggatatcgtc 1320 aaaggcctgc atgggaaaag atcttctcgt ttatttctga aagtattatg gaaattcaag 1380 aaacagatgt agttaaatcc tttcgactct ctggtatatc tcaaaatggt ataaatttga 1440 atgaagaact tctaaatgcc agactaaaag atatcttata ctcttcaaat gtggaaaact 1500 tggatttaat tgatgacgat ggcgatattg attctatata aattttagaa taaacaaatt 1560 tatattaatt tattttaatt ggttttattt ttaattttat taaatttttg tattttttga 1620 aatgcaaaga caatggtctt attttggggg tcttattttc 1660 // ID Gypsy-8_TCa-I repbase; DNA; INV; 4664 BP. XX AC chrUn_2; XX DT 21-FEB-2011 (Rel. 16.02, Created) DT 21-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the red flour beetle genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-8_TCa_; KW Gypsy-8_TCa-LTR; Gypsy-8_TCa-I. XX OS Tribolium castaneum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Coleoptera; Polyphaga; Cucujiformia; OC Tenebrionidae; Tribolium. XX RN [1] RP 1-4664 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the red flour beetle genome."; RL Direct Submission to RU (21-FEB-2011). XX DR Genome; chrUn_2; Positions 25842 30505. XX CC 'AACG' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS join(1364..2173,2177..4618) FT /product="Gypsy-8_TCa-I_1p" FT /translation="MGTVKITLKIDAVQSEIIVDVVSEGVQNVPVLVGRNF FT TELTNVGILKDDVSLRFFTLPPMETRSVQAAKVKLFVAKETIIPPMHYGHM FT PVRSDIQNGEIFVDYSLRLEPEHEYEIPSVVLQLRENACVKLPVINFSKNE FT IRFKKNTTIARGRICREEKYVPTILRISKTPQPELPLDQLDVGNVRPQTKQ FT HLIDMLSEYRDCFALTVNELGCAKNVEMEIRLEEEKPFTFRPYRLAPSEVE FT KVNEMITELIQAEIIQESESNYASPIVLVKKNGEKRLCIDYRKLNSMTVKD FT SHPLPRISEQIDRLQGAKYFSSLDLKSGYYQIPISENSRHYTSFVTPSGQY FT EYLRMPFGLTNAPRVFQRFMNNLLRPVSKIAAVYLDDVLLHSNTEGQALCD FT LREVLDVLRAEGLTLNFQKCAFLKETVHFLGFEVSDGIIRPGLDKIQAVKN FT FSPPKNVKQIRQFIGLTGYFRHFVKNYALIARPLTNLTRKGVNWKWDTEEE FT LAFERLKEILTSRPVLSIYDPTAVTELHTDASSLGVAGILLQYQTDGRLHP FT IAYYSRQTNEHERHYHSFELETLAVVESVKKFRIYLLDLEFTIVTDCNELK FT ATSNKSQLIPRIARWWLQLLEFRFKVKYRAGTQMSHVDALSRNPNDSKTSK FT DVLALDKADWVLAGQLTDPKIKELHNILSKPPSTEYKQNVHKNYALREERV FT YRNTTKGLQWVVPKGMRQQVVRAAHDERGHFAAEKTLSRLSDCYWFPRMRD FT YVSDYIACCIPCIMKKKPSGKQEGFLHPIPKPTKPFDTIHLDHLGPFPRSK FT RGNVHIIACIDACTKFLFMRPVKSTKTVFVVNFLNELYVTYGKPRVVITDQ FT GSCFTAAKFREHCDQNQVTHVKVAVATPRANGQVERLNRSILSVLMTGTLE FT DNLWDQNVPMAQFSINNAANASTGRTPSELMMGYKPRHGTDSYLRDEVQKI FT PTIVDDLIKLRLEASEKVAQVQKRQKELFDRKRKPPREYKVGDVVVVRKGE FT AATGESRKLMAPFSQPMTVKAVLPNDRYLVTDMPNSHRTRRKAIYQRVIAV FT DRMKPSTKPGGVSDESDGPNEDDQGEP" XX SQ Sequence 4664 BP; 1453 A; 967 C; 1078 G; 1166 T; 0 other; atttttcaga agtgggatag acctacacag ctcaatttaa attgcgtata aatttttcat 60 tcaataataa atgaacgata aaatatataa atttaattaa ttaaattttg ttcaaatgtg 120 aggggttgtg tgtttgtagc aacgcatgac ttatctaact agcggcttta cccatttgcg 180 aaatgattaa aacaatttat tggcgtatgt gttctttcgg ctaacagcga ttttataatg 240 gcacccacca gaggacggaa aagacaggct gaatctcgtg cagagagccc agtcccagtt 300 cgtgctggac aacccgatga gaatttggtg gcactcgtga cccctgttca aagcccagac 360 accaccggta gtgtgggtgt ggaagtactc cgacggttaa ctgacgttat gtaacagtaa 420 ctgggggtga tgcaatccct tattttgacc ccagcaatac cgaatttaac agcgaaagat 480 ggtgtaacaa agtcgatgag tgtagagctg tattccactg gaccgaagaa actacagttt 540 atttcgctat caccaagctt aagggcatgg ctgaagtttg gtatcgaagc ctaccgacaa 600 tgatgaggac ctgggatgag tggaaaattc aattgcaaac agcttttcct gcacagacag 660 attattattc gatgttgaag cgaatgatgg ccaggaaaaa gaagattgat gaagactaca 720 cgacttatta ctacagcaaa ctggcacttt taaacgagct agagataacg gggaagaaca 780 gcgtttcatg cattattggc ggtataactg acgcaatcgt aaaaactggt gcaaaagcag 840 gccaatatca aacgcccgaa tctctgctgc aatacctatg cagtgtaacg gagaatcaaa 900 cgtctacaac ccaagctggt aagggccgac atttctcaaa tccaaggttc agaccaaaag 960 catcatttgg caaaaataat gctaagtctt caagagagct gatttgtttc aaatgccaga 1020 aacccggcca tacagctaac gtctgttcgg cacggaaccg agaaatgcgc tgtacacttt 1080 gtcgttctca tgggcatgta gcaaaggatt gtcggaataa gactacgaat aaaccgagtg 1140 atcagaaaac cgtagcattg atcggtaatc cggtattggg cgacccaggc aaccgaaagt 1200 tccataagga tgttttaatg gacaacttac tcgtgcgtac atcgacttgg gaagctcgtg 1260 taccaccatt accaataagg aaatccaacg attaaaaatt accaaagtgg atactacgca 1320 gatttccact ttgcgggggt atggcaacgg gaccgtcact acaatgggta ccgtgaaaat 1380 tactttaaaa atagatgccg tgcaatccga aataatcgtt gatgttgtct cagaaggtgt 1440 tcagaacgtt cctgttctgg ttgggcgcaa ctttactgag ctgactaatg tgggcatatt 1500 gaaagacgat gtgagtctgc gtttctttac tctccctccg atggaaacgc gttcagtaca 1560 agctgcgaag gtgaaattat ttgtggctaa ggaaactata attcctccca tgcattacgg 1620 gcacatgcca gttcgtagcg acattcaaaa cggagagatt tttgttgact atagcctccg 1680 actggaacct gaacacgagt atgaaatacc aagcgtagtt ttacaactac gagaaaacgc 1740 ttgtgtaaag ctccctgtta taaatttctc caaaaacgaa attcgtttta agaagaatac 1800 gacgatagcc agaggacgaa tttgtcgtga ggaaaaatat gtgccaacca ttttgagaat 1860 ttccaaaact cctcagccag agttgccgtt ggaccagtta gatgtgggca atgtcagacc 1920 gcaaaccaaa caacatttga tagatatgct gtccgaatac cgggattgtt tcgcacttac 1980 tgtaaatgag ttgggatgcg ccaaaaacgt tgagatggaa attcgtctag aagaagaaaa 2040 accttttaca tttagaccct accggctagc acctagcgag gttgaaaaag ttaatgaaat 2100 gattactgaa ctaatacaag ctgagattat tcaggaatcg gaatcgaatt acgcttcacc 2160 cattgtgctt gtctaaaaga aaaatggaga gaaacgactt tgtatcgact accgtaagtt 2220 gaattcgatg acggtcaaag atagtcatcc tcttcctaga atcagtgaac aaatcgatcg 2280 gttgcaaggt gctaagtact tttctagtct ggatttgaaa tctgggtatt atcaaatacc 2340 tatcagcgag aattcccgcc attatacctc ttttgtgaca ccaagcggac agtacgagta 2400 cttgcgtatg ccatttggat tgacaaatgc acctcgagtt ttccaacgtt ttatgaacaa 2460 tctgcttcgc cctgtttcca aaattgctgc tgtgtatcta gatgacgtgt tattgcattc 2520 aaataccgaa ggacaagcct tatgtgattt gagagaagtg ttggatgttt taagagctga 2580 aggcttgact ttaaacttcc aaaagtgtgc ctttctcaaa gaaaccgttc actttctcgg 2640 atttgaagta agcgacggaa taatacgccc cgggcttgat aagattcagg cagtaaaaaa 2700 tttctcccca ccgaaaaatg tcaagcagat tagacagttt attggactaa ctgggtactt 2760 tcgacatttc gtcaagaatt atgcactcat tgctagacca ttaacaaact taacacggaa 2820 aggtgtgaat tggaaatggg acacggaaga ggaacttgcg tttgagaggc ttaaagaaat 2880 tttgacgtcg agaccagtgt tatctattta tgaccccaca gctgtcacag agttacacac 2940 agacgcaagt tcgttgggtg tcgctggtat tttgctacaa tatcagactg acggcagact 3000 acacccaata gcctattaca gcaggcaaac aaacgagcat gagcgacact accattcgtt 3060 cgagcttgaa actttggcag tcgttgaaag tgtcaaaaag tttaggattt acttgctcga 3120 ccttgagttt acgatcgtta ctgactgtaa tgagctgaaa gcaacgtcga acaagtccca 3180 acttattcca agaatcgcca ggtggtggct gcaactccta gagtttagat ttaaagtaaa 3240 atacagagca ggcactcaaa tgagtcatgt tgacgcctta agtcgcaatc ccaatgattc 3300 caaaacatct aaagacgtct tggctttaga taaagcggat tgggtgcttg caggacagtt 3360 gaccgatcct aaaatcaagg aactacacaa tattttgtct aaaccgccat ctaccgaata 3420 caagcaaaat gttcataaga actatgcact ccgtgaagaa agggtatatc gaaacacgac 3480 gaaaggatta cagtgggtgg taccaaaagg tatgagacag caagtggtta gggccgctca 3540 tgacgaacgg ggacactttg ccgcagaaaa aacgttaagc cgactctccg attgttactg 3600 gtttcccaga atgcgggact acgtctccga ttatattgca tgctgcatcc cgtgcataat 3660 gaaaaagaag ccgtcaggga aacaagaggg ttttcttcac cctatcccaa aaccaaccaa 3720 gcctttcgat acgatacatt tggaccatct ggggccattt ccaagaagca aaagaggaaa 3780 cgtgcacatc attgcgtgca ttgatgcatg cacaaaattt ctttttatgc gtcccgtgaa 3840 gtcaacgaag acagtttttg tcgtcaactt cctgaacgag ctgtatgtga cctacgggaa 3900 gccaagggtg gtaattaccg accagggaag ctgtttcact gcagccaagt ttagggaaca 3960 ctgtgatcaa aatcaagtca cacatgtaaa ggtcgcggta gcaactccca gagctaatgg 4020 ccaagtagaa cgacttaacc gaagcatcct gtcggtgctg atgacgggga ccttggaaga 4080 caacctatgg gatcaaaatg ttccaatggc tcaattttct atcaataacg cggcaaatgc 4140 ttcgaccgga agaaccccaa gcgaattgat gatggggtat aaaccacgac acggtactga 4200 ctcgtatttg agagatgagg tgcagaagat tccaacgatt gtggacgatc ttattaaact 4260 gcggttagaa gcctctgaaa aggtggctca ggtacagaaa cgtcagaagg aactctttga 4320 cagaaaaagg aaaccaccca gagagtataa agtgggagat gtggtagtcg tccggaaagg 4380 ggaagctgca acgggtgaga gccggaaact gatggctcca tttagccagc ctatgactgt 4440 aaaagcggta ttgccaaacg atcggtacct agtcacggac atgcccaatt cacataggac 4500 ccgtaggaag gcaatctatc agcgtgtcat cgccgtggac aggatgaaac cgtcgactaa 4560 acctggagga gtttcagacg agtccgatgg tcctaacgaa gacgatcaag gtgaacctta 4620 aaccgaagtc gtctcgggtc gagccgatta tcaggttggc cgaa 4664 // ID DNA3-6_AP repbase; DNA; INV; 136 BP. XX AC . XX DT 25-AUG-2009 (Rel. 14.09, Created) DT 25-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA3-6_AP. XX NM DNA3-6_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-136 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 1947-1947 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 3 bp TSD CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 136 BP; 38 A; 33 C; 30 G; 34 T; 1 other; aagcctcgtt cacacggcga cagttgcatg acaccagtgt cgcgatattc aacganttag 60 atatacggca ctaggtgtca cgacacacaa tatcattagt gtcatactca ttagtgtcac 120 cgtgtgaacg aggctt 136 // ID LLRP1 repbase; DNA; INV; 817 BP. XX AC M91591; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 28-SEP-1995 (Rel. 1.08, Last updated, Version 1) XX DE L.loa interspersed repetitive sequence. XX KW Transposable Element; LLRP1; Repetitive sequence; KW Interspersed repeat. XX OS Loa loa OC Eukaryota; Metazoa; Nematoda; Chromadorea; Spirurida; Filarioidea; OC Onchocercidae; Loa. XX RN [1] RP 1-817 RA Egwang G.T., Akue P.J., Ajuh M.P. and Pinder M.; RT "Cloning and characterization of a Loa loa-specific repetitive RT DNA."; RL Mol. Biochem. Parasitol 56(2), 189-196 (1992). XX DR GenBank; M91591; Positions 1 817. XX SQ Sequence 817 BP; 288 A; 124 C; 130 G; 275 T; 0 other; accgaataaa taactggagt gacttcttgt ctaggaatat tttccaatct aatttcactt 60 ccacatgctt cttatttact ttatttgcac gcataatgag tctgtgaaga aaacccttcg 120 ctatcctcat gaaaaagcat ttacatgtag atcaaattct cgaaattgga acggagacaa 180 cgagtttcac ttcgattgaa acaagcagca tataaaagtg tcattgtcgt ctattgcgaa 240 ttgtcatcta gcctggggtt tattaatggt ttgctaaatg gttataaata tcgttggatg 300 actacagcag tgatcaattt gatgaggttg ttgtgtaaat acgttgtccc tcgttatccc 360 ttctgactgt cgaacaaaca taaatatgct attctgccat tctgattata ccgttaaagg 420 agaaaatcag agaatcattt acataaacct attattacta ttattattaa accaaatatt 480 ttgatattcc aaaaagacta acaataacga tgaaagttga atttataact tcggtatgtt 540 aagtcaaaag taaacaacct ttagaagcaa tttaatgtta gatatcgagt tataattata 600 ttggaataga ttgttcaaat attgataaga tcaaacaaag attatcactg caaattaatc 660 aaatatacag ttgatttttc attttcggaa ctttttaatg aataatgcca gaaagcaaga 720 gttcactgga attgatttca agattactca gttaacttac taataatcaa atgatctata 780 attagaatga tgaacagggt cttatgatag cgtaagt 817 // ID SMAR33 repbase; DNA; INV; 1317 BP. XX AC . XX DT 22-JAN-2008 (Rel. 13.01, Created) DT 22-JAN-2008 (Rel. 13.01, Last updated, Version 1) XX DE Consensus sequence of Mariner-type family of repeats. XX KW Mariner/Tc1; DNA transposon; Transposable Element; SMAR33. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-1317 RA Jurka J.; RT "Mariner-type element from freshwater planarian (Schmidtea RT mediterranea)."; RL Repbase Reports 8(1), 21-21 (2008). XX DR [1] (Consensus) XX CC Present on >1000 copies in the genome. The youngest copies are CC ~4% divergent from consensus. CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 199..1227 FT /product="SMAR33_1p" FT /translation="MSDEKIVIRALLRHYWKKGLSTRAAVEEICAVEGEGT FT VKKSAAAEWFKRFNDGDLSLDDKPRCGRPSVLNENALVDALNEDKHASTRS FT LATSLGVCNKTVHNHLQHLHYTLKRPRQDPYDLTDAQAKKRVEICQQLLKN FT PTDDRFWRRIVTCDEKWIYFVNNNRLKQWVKHGEEPEPVIRQDRFGKKVML FT SAWWNYQGVLHYEFIPDGHAVNAELYCEQLDRVYNILKVKYPALVNRNRVL FT MQFDNAPCHRARLTQQKIDELSGVEVLPHPPYSPDVAPSDYGLFRSMQHFL FT RGRKFTTLSEVEQACQQFFASKDQNWYFNQIRLLVERWQLVIDNGGLYFDE FT " XX SQ Sequence 1317 BP; 398 A; 242 C; 280 G; 397 T; 0 other; ggtcgtgtaa aaagttcccg ggcagcagca gcaagtgaat ttacaaagca ctttgcatac 60 atggaagtta taagttaata aactcttttt acacgaccta gtatattacc attgcttgaa 120 ttcatttaat tttgtactgg tgtttaaatt taaattttac tctatattta aatttaaatt 180 ctgtcggaaa tcgttgaaat gtcagatgaa aaaatcgtta ttcgagcttt gttacgtcat 240 tactggaaaa aaggattatc cacaagagca gcagttgagg aaatttgtgc agttgaaggt 300 gagggtacgg ttaagaagtc agcagctgcg gaatggttca aacgttttaa tgatggggat 360 cttagcctgg atgataaacc acgatgtggg agaccatcag tattgaatga aaatgctcta 420 gttgatgcat tgaatgaaga taaacatgca agcacgcgtt ctttagccac gagtcttggg 480 gtctgcaaca aaactgtcca caatcatctg cagcaccttc attacacgct gaaacggcct 540 cgtcaagatc catatgatct cactgatgca caagctaaaa aacgagttga gatatgtcag 600 cagctgctta aaaatcctac agatgatcgt ttttggagac gcattgttac ctgcgatgag 660 aaatggattt attttgttaa taacaatcga ttaaaacaat gggttaaaca tggagaggaa 720 cctgaaccag tgatccgcca agaccgtttt ggtaagaaag tgatgctgtc agcctggtgg 780 aactaccaag gtgttctgca ttacgagttt attcctgacg gtcatgctgt caatgcggag 840 ctttactgtg aacagctgga tcgtgtatac aacattttaa aagttaaata cccagcatta 900 gtcaatcgaa atcgagttct catgcaattt gacaatgccc cctgccatcg cgctcgtcta 960 actcagcaga agattgatga attgtctggt gttgaagttt taccccatcc tccatacagc 1020 ccagatgttg ctccatctga ttacggattg ttccgttcaa tgcaacattt cttacgagga 1080 cgaaaattta caactttatc tgaggttgaa caggcttgtc agcaattctt tgccagtaaa 1140 gatcaaaatt ggtattttaa tcagattcgg ttgcttgtgg agagatggca gcttgttata 1200 gacaatggag gactttactt tgacgaataa aacggttggt taatcatgtt tgctttcatt 1260 gtcattctat atgttgaaat tgtaatccaa aactgcccgg gaacttttta cacaacc 1317 // ID GLT_SM repbase; DNA; INV; 4213 BP. XX AC . XX DT 15-AUG-2009 (Rel. 14.08, Created) DT 15-AUG-2009 (Rel. 14.08, Last updated, Version 2) XX DE Repetitive DNA from Schmidtea mediterranea: consensus. XX KW LTR Retrotransposon; Transposable Element; GLT_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-4213 RA Jurka J.; RT "Repetitive DNA from Schmidtea mediterranea."; RL Repbase Reports 9(8), 1912-1912 (2009). XX DR [1] (Consensus) XX CC 4bp TSD. Includes remnants of PiggyBac sequences and some CC similarities to Gypsy sequences. 96% identical to consensus. High CC copy number (>30,000). CC Preliminary classification: Gypsy LTR. CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX SQ Sequence 4213 BP; 1153 A; 701 C; 763 G; 1587 T; 9 other; tggtgggtta cagggtttca aatcttgctt atcctcctgg cgaccctagt aacgccatac 60 aaaatgatac tgatcgccac ttggcacccg ttgctaggca acgttggtaa tgtgttcaag 120 tggtggattg cccgcaaaaa ttgagtacca ttggctagta ctatagttgc tcggacaacc 180 ctcgggttca cgagctgtcg atagaagacc tgtgctcgct ttgctctgtc cctcgtaact 240 attcaggatt agttcttaat tgactgtgga acgaattggc gatgttggca tacggttata 300 ttaaaataac aattcaacta agtttaaaac tattctattc gatctaaaca cattatcatt 360 tactgcttga tgttttggag ttgggctgca cgacgtcggg cacgttcctt acgtcccggt 420 cgacggttgt tggttgtagt gatggattgt tctgctggct tggtggttgt ctgtggcagt 480 tgggttgggg cgaagatgag cgggtaagtt accgcttcac cttctcggca taccgctagg 540 gcgacttcga gtccccgaat ctgcgtgcgc agactctggt tgagttgccg ttcttcttcc 600 cagcctttct gggccgctgt gagtcgatgg tagattgctt ttgtctttgc gggatccgtc 660 catgtygctt tcatatgttg catttctcgc gttaattgga taatctgggc gagtaattct 720 gtgtttgctt gtctcgcgtg gtccagttcc gtttgcaaga ttcgtaagga agacatcgct 780 gtaattaaac agattatttt tatttattat tgctcaagtt aaagagtcat ctatttcatt 840 taaacaaaat tgcaaataat tccaattaca caaaacaaaa gaacacacaa attgctaata 900 ttttaaatca tccactacat aaattgtcat tatggagtca ttcctcggga cagtatcgct 960 tcaggcttcg tcggagagcc tccactgtgt agggtcttcg ggtatctcgg actgggcgag 1020 gtggataggc gatggcgatc cgcccggcga ggatgtcttc taccacgttt gcaaggtaca 1080 tcctcgcctt aaccacctgc agatctactt gctcctgcac cctttcccac tgggtgagga 1140 ggttctgtgc atttgtgtag gtttctgctg ctggrtattg tctttctggg atatcgcagg 1200 gctccttctc tacgtctaat ayyctccaag cagtcggata ctgccatgat tgatgtatcg 1260 gatctgctga catcgtggtc gattgagttg ttgactgact ctgctttcct acacttcgtc 1320 tggtgttttg ttcgtgtggc atatctaaaa cgctctatta ttatctctat taactttatc 1380 atgttaaatt attaaacaaa tagggcttaa gtcactgtcg agtgggtcat ctgatttaat 1440 cttgttgagt tctagtggct ttctatctga ttggactgct ttcgaaacgg gcgaactatg 1500 ttggtgtgat tacgtttaca tgaggtaagg ctattgatcc actcatttta tttaattaag 1560 caagatgatg tcaagaaatt gtcgagttga ttttaaggat agcggctacg tcgatggggg 1620 tagcgtcgaa tgtaagtagg gattttaaag atttggtacc tgtattctat tttattcgtt 1680 tacttaaatt ttcattttta gattttccaa tataacattt aatcattaaa cctagcaact 1740 tttatcgtca catcctatga tataaattca tggattttct aaatccaata tcaacgtcaa 1800 ttttgtttac caaatgttat aaycaaatga tcgtcggaag gcgagagcgg atgtcgattc 1860 gctgaagcgt cgacgtaygt gtggtgtaac ttccggcgga tatcgcgatt tgaacggttg 1920 aaaggaagta agattatata gttgcatccg gtgatttaca atatttcatt ttagaattat 1980 cgattggcga attcgtcgaa aatctattcg ggtaagttat tctgcaaatt aatttgaata 2040 tcttaacgcc attttttaag ggaatcagta tattacgttc gaaatcttga cggactttat 2100 gctagtggaa tattttctct tatggatttt tatgcgttcg ataattttgt gagttaacga 2160 ttttgtatgt ttttaaaata aaagtttttt ctcattttcg aaacgttgat ttttataatg 2220 atgttaattt aatttaaaca ataattcata tatcttgttt ttcgtctcta aacttgtttt 2280 aatttttaga aaaccttatc ataaacttat aaatcatttc agtttaggtt tgggaatgat 2340 tgaattaact tagttcggtt tcgttactca gttttcgcat atcttttatt acgctaattc 2400 attcaacctt tacttcaayt ctataaaact atattaatat cattgcttca aataatttat 2460 tcaaccttta ttttaattcc gtgaacctac cttaatatta acatcattgc taatttattc 2520 taccttgatt ttaattctac gaaactattt taacattaac atcatactag taacttcatg 2580 aatctagttt gatctttact aagggaacta ttttactttt attcgaatcg tctaacttta 2640 attactgaat gacttggttg atcggcatcg agttttaatt caattaakgt atttttatta 2700 accgaggaac tatttaacca ataaactaat atatgaaaac tttgattcta atctataact 2760 acgttaactt taattgtttc aattatttaa ccataaagtg ggtatattta cgcaaaattc 2820 ctttgaattt ttccaaaagt gaaacaaatt cagtttaatt cagaagttat ttgtctttaa 2880 ttattaatca atcatttaat ctaatcgagt taataaatta tttattgaat aatttttaca 2940 gttaatttag tataccaatg attagtttat tacacattat aggtttcaat tagttatatc 3000 aacgatttta atttatttta tccccgtacc ctttcccttt aatagacctt aggcgcattt 3060 tgtgttcata ttatatttac tgagtgccca ctggactact cggtgtgrtg attttaagcc 3120 gtatgtaata taattttagg tcgattttac agaattgaga aacgaggaag cagccagaga 3180 cttcttcctt atttctcgct tcagctatag ggataaatat tgacgttacg accgctgcgt 3240 tgcgcacgct gagtcttcac gggtgcatat cattggcgat tttccacgct cttacatgca 3300 tctgcactat gttgatttta ctatcttgcg tgatgtgtcg gctgtgggcc ggaaaaattt 3360 cgacatttaa aatttataaa gacatataaa acaagttata tatattaata atatttatta 3420 attataaaac aaactaatta cccattttta accattccat aaattgaaaa aacatcatag 3480 cataaaatag aacataacat aacataaatg cctctctccc cacgtcgccg ccaagcagag 3540 ctctatagag atgctatgga cgtgacgtgg aaagagggtt ccaaccaaaa tatttctcac 3600 ctagataata actgtaaatt cgaaatagct tcaaacaatt aagattttca aaaatagtta 3660 aaacaattct caaagtggtt ttcaagattt gaaaatatgg ttttttcgaa atggttttta 3720 ataagtttga aaaatggtta agttaaatgt gaaaatggat cagtatatat acctaatttt 3780 atggtgtgtg aaatttgatg cagtaaaaca aaattcgacc ttgttctttt ctagcaaatt 3840 ctattattta tttttattat ttttcgattt atttttataa ttttttattt ataattttta 3900 ttgaattatt gctaattcaa atttttcgat ttaaatttca ctttaatgaa ctttaattga 3960 ttttttcaat ttcgtcgaat ttgttgtgtt ttcttgttgt atttgtgtag gtatcgtgcg 4020 gatcttggtg cattttgtta atattttaat ttatttttat gtttattatt ttatatttaa 4080 ttcttttatt tctcagattt tatatggttt acttgatgtt tcagtgggtt tatttaaatt 4140 gcgtgttatt tttgcgtgtg tgtttctgat ccatttaaac atttttgaag gccgtttacg 4200 ttgtggccca aca 4213 // ID Gypsy-22_DYa-I repbase; DNA; INV; 4420 BP. XX AC chr2L_random; XX DT 06-APR-2011 (Rel. 16.04, Created) DT 06-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-22_DYa_; KW Gypsy-22_DYa-LTR; Gypsy-22_DYa-I. XX OS Drosophila yakuba OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; melanogaster subgroup. XX RN [1] RP 1-4420 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; chr2L_random; Positions 3241163 3236744. XX CC Positions [1493-1996] - Reverse transcriptase CC Positions [3109-3582] - Integrase core CC 'TATG' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 101..2359 FT /product="Gypsy-22_DYa-I_2p" FT /translation="MHATRAPASDEKALHVTLPKFCADSAGADPSAWCTTV FT DLIFADNALVGSALVLALSKALEGSASQWLSQICFAGITWSQFKELRFVGI FT ETTAAILMNVLNGRPKPGESFAQYGSRIVTLLLSKWKAKDLEEIAVSVAVA FT HMAQIDNNLLRWVFTTNVFTRNELQQQLQAYAFKKRNNEDDFAPEKKKSRM FT QSQTKCHFCAKVGHKFAECRARMEGTSNTKGRSHSGSNTSGSRDRSNIRCF FT KCDEMGHVASVCPKGNNKYIEKRVDLCKLSEPSGLFFQSGEPFPFFFDSGA FT ECSLVKEKLSNKLAGKRSNNLVILRGIGDNIVKSHLQILSTVQMSKFNLEI FT LFHVVMDNYLKHDIMIGQDILKLGFGVMIESDKFSIYKSKIINAVQNDAPE FT NTLEMLFDTNTDLSDNDKSKLRNILEKYADSFVIGIPNKRVTTGALEIQRR FT PYRLSADEKLLVREKIEQMLESNVIRPSSSPFASPMLLVKKKDGSDRLCID FT FRELNLNTISDKFPLPLISDQIDRLRGGKYFSILDMASGFYQIPIHPDSIE FT KTAFVTTEGQYEFLAMPFGLKNASSIFQRAIVQALGDLAYSYAVVYIDDVL FT VVANTKEEAYERLNTVLKALSEAGFSFNIKKCNFLCTGIEYLGFHIKDGEI FT KPNPRKIEALSVLSQPTSTTQLRQFIGLASYFRQFVPRFSEKMKPLYQLTS FT KLNAFVWKEEHEKIRQDNIKVLTCDPVYSTLSFQLNCILMPVQMAMGLCSC FT IK" FT CDS 3010..4053 FT /product="Gypsy-22_DYa-I_1p" FT /translation="MTKYVRKVVENCMTCKLSKPSSGKMQIEMHPIPKIDI FT PWHTVHVDISGKLSGKNDLKEYIIVQIDAFTKFVHLYHTLNLDSENCIKAI FT KSSMSFFGVPSRIIADQGRSFASNKFREFCSTNRIKLHLIATGASRANGQV FT ERVMSTLKTMLTAVETSQRSWQDALAEIQLAMNCTTNRVTKASALELLIGR FT EARPFGLLPVDVEESVVDREQMRAQAKENMQKNAQYDKQRFDKNKAKIVPY FT KEGDHVLLKSEERHQTKLDPKFKGPFQVINVLDGDRYELKSLVNKRTYKYS FT HEWLRALPDRRLTTECNGDTQEGEKLKDVTSERASVSQEGKDLINGTNEMA FT CVQKE" XX SQ Sequence 4420 BP; 1458 A; 763 C; 1037 G; 1162 T; 0 other; tcagaagtgg gatcgccttc gccacaagag cccagcgctg atcagtatat ggctgcgttg 60 gaaacacaaa accgtaacct catggaaata ataaaaaaca atgcacgcca cgagagcacc 120 ggcgtcagat gaaaaggctt tgcacgtcac actgccaaaa ttctgcgctg atagcgctgg 180 agcggatcca tctgcatggt gcaccacggt ggacttaatc tttgcagata atgcgcttgt 240 aggcagtgcg ctcgtattag cgttgagcaa agcgctagaa ggcagtgcgt cgcaatggct 300 gtcgcagata tgcttcgctg gaatcacatg gtcgcagttc aaggaactgc gattcgttgg 360 gattgagacg acggctgcca ttctcatgaa cgttttgaat ggacgtccaa aacctggaga 420 gagctttgcc cagtatggaa gtcgcattgt caccttgttg ctgtctaagt ggaaggccaa 480 ggatctggag gagattgcgg tatcggtagc ggtggctcac atggcacaaa tcgataataa 540 tttgttgcgt tgggtgttta cgacaaatgt gtttacgcgc aatgagcttc aacaacagct 600 gcaagcttac gccttcaaga aacgtaacaa cgaagatgat tttgccccag aaaagaagaa 660 gtcgaggatg cagtcgcaga ctaagtgcca tttttgtgca aaagttggcc acaaattcgc 720 cgaatgccgc gctcgaatgg aaggtacatc aaacaccaaa ggaagaagcc acagcggaag 780 caacacgtct gggtcaagag atcgttcgaa cataagatgt tttaaatgcg acgaaatggg 840 acacgtggct tctgtatgtc ccaaaggcaa caacaaatac atcgaaaagc gagttgactt 900 atgcaaatta agcgagccaa gtggattatt ctttcaatcg ggtgagccct ttccattctt 960 cttcgattcg ggagccgaat gttctttggt aaaagaaaaa ttaagcaaca aattagctgg 1020 caagagaagt aataatctag ttattttaag aggaattggt gataatatag ttaagagcca 1080 cttgcaaata ctgtcaacag tccaaatgtc aaaatttaat ttagaaattt tgtttcatgt 1140 tgtaatggat aactatttaa agcacgacat aatgattggc caagacatat taaagcttgg 1200 tttcggtgta atgattgaat ctgataaatt tagtatttac aaatccaaaa ttattaatgc 1260 cgttcaaaat gatgcacctg agaacacctt ggaaatgctt tttgacacca atacagattt 1320 aagtgataat gataaaagta aattaagaaa tatattggaa aagtatgccg atagttttgt 1380 aattgggata ccaaataagc gtgtaaccac aggagcattg gaaattcaaa ggcgcccata 1440 tcgactcagt gcagatgaga aactgttggt cagagaaaag atagaacaaa tgcttgaatc 1500 taatgtaatt cgtcccagca gttctccatt tgcaagtcca atgttattag ttaagaagaa 1560 ggacggcagt gataggttgt gtatcgattt tcgtgagcta aacttaaata cgatttctga 1620 taagtttccg ttgccactga tttcggatca aatagatcgt ttgcggggtg gtaaatattt 1680 ttccatctta gacatggcca gcggatttta tcaaataccg attcatccag attctataga 1740 aaaaacagca tttgtgacta cagagggtca gtatgagttt cttgccatgc catttggtct 1800 aaaaaatgcg tcgtctattt ttcaacgtgc cattgttcaa gctctggggg acttagccta 1860 ttcttacgca gttgtttaca tagacgatgt gctagtagtt gcgaatacaa aggaggaagc 1920 atatgagagg ttgaatacag tgttaaaagc attgtcagag gcagggttct cgtttaatat 1980 aaaaaaatgt aattttcttt gtacaggaat agagtaccta ggattccata tcaaggatgg 2040 agaaattaaa ccaaatcctc gtaagataga agctctttct gttttatctc agccaacctc 2100 gacaacacag ttaaggcagt ttataggtct ggcatcgtat ttcagacaat ttgttccacg 2160 attctcagag aagatgaaac cactgtatca gttaacctct aagctgaatg cttttgtttg 2220 gaaggaggaa cacgagaaga tacggcagga taatatcaaa gtgcttacat gtgaccctgt 2280 atattcgacc ctgagcttcc aattgaattg catactgatg ccagtgcaga tggctatggg 2340 gctatgctca tgcataaaat agaagggaaa cacagagtag tcgaatatta tagcaaatgc 2400 acgtcaatat cagaagccaa atatcactcg tatgagttag agacgctagc tgtctataat 2460 gcagtaagac attttagaca ttatttgcac ggtagaaaat ttgttgtgta tactgattgc 2520 aactcattga aagctagtcg tacaaaggct gagttgacac caagagtaca ccgctggtgg 2580 gcgtacttac aggcatttga ttttattgta gaatatcgca agggcagcca aatggctcac 2640 gtggattttt tatcacgaaa tccgattccc acagtttcaa atgccataag caaagtagat 2700 gaagttcgtg ttgatctggc gacaataaca gataattggc tattagccga acagcaacga 2760 gatgagggca taaataaaat cttggctcaa ttgaataatg aggacatggc ggcaaatatc 2820 gataacacat atgaggttcg aactggatta ttatatcgca aaatacagcg taatggcaag 2880 actcgttgcc taccagtaat ccctaaagcg tttaggtggt ccgtcattaa ccacgtacac 2940 gaagctgtaa tgcatttggg gtggcagaaa actttagaca aagtctacga gaaatattag 3000 tttgaacaca tgaccaaata tgtaagaaaa gttgttgaaa attgcatgac atgcaaactg 3060 tccaagcctt catcaggtaa aatgcaaatt gaaatgcacc caattcccaa gattgacata 3120 ccttggcata ccgtacacgt agatataagt gggaagctta gtggaaagaa cgatctaaag 3180 gaatatatta tagttcagat agatgcattc actaagtttg ttcacttgta tcatactcta 3240 aatcttgact cggaaaattg tattaaagct ataaagtcaa gtatgtcttt ctttggagtt 3300 cccagtcgta ttatagcgga tcagggcaga agttttgcca gcaacaagtt ccgtgagttt 3360 tgttcaacaa acaggattaa gttgcactta atagctacag gtgccagtag agcaaatggg 3420 caggttgaaa gggtgatgag cactcttaag accatgctca ctgcggtgga aacgagtcag 3480 cgatcatggc aagatgcatt ggcagaaatc caactagcca tgaattgcac cacaaatcga 3540 gtcactaagg caagtgcctt ggaattgctg ataggaagag aagctaggcc ttttgggttg 3600 ttgccagtcg atgtagaaga aagtgtagtc gatagagaac agatgagggc acaagccaaa 3660 gaaaacatgc aaaagaatgc acagtatgat aaacaaaggt ttgataaaaa caaggcaaaa 3720 atagttccgt ataaggaagg agatcatgtg ttactgaaaa gcgaggagcg gcatcaaacg 3780 aagcttgacc ccaaatttaa gggcccattt caggtaatta atgtactaga tggtgatcga 3840 tacgagttga agtcgttagt taataagcga acatacaaat attcgcatga atggctaaga 3900 gctctaccag acagacgatt gacaacagaa tgtaatggtg acacccaaga gggagaaaag 3960 ttgaaagatg tcactagtga acgtgcatct gtgtcccaag aggggaaaga tttgataaac 4020 ggcacgaatg aaatggcgtg cgttcaaaaa gagtaatgac cgcactataa cagctatctc 4080 gcagggacga gaagtgttat ggcgggggtc aagagattgg gtgtgagaga agacacgcca 4140 atgttgatat gtgaaagtaa agttatggaa aatatgaata gtttgaagtt ttgatacgta 4200 aattggagat gtctttataa ttaagtttgt aaaagaaaaa tcaatcatga gatgagttgt 4260 caatcaatgt ttgtgtaagc ggctctataa gttaataaaa gtaagagaga aatacattga 4320 agtttgttaa ataaattgga attaagttga tgaaatattg tgagtttgac ttgaatggtg 4380 gattagacac acgaggacgt gtgataggtc aggaaggccg 4420 // ID Gypsy-23-I_NVi repbase; DNA; INV; 9054 BP. XX AC . XX DT 21-APR-2009 (Rel. 14.04, Created) DT 21-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW internal portion; Gypsy-23-I_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-9054 RA Bao W. and Jurka J.; RT "LTR retrotransposon from Nasonia parasitic wasp."; RL Repbase Reports 9(4), 783-783 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 707..2440 FT /product="Gypsy-23-I_NVi_1p" FT /translation="RRREQVSLTRRSGLRWLEVRQGHREVNRVHVSQSSRG FT SSKSGGSRDGRESYESEVRIINKQRRRKRAQESRERAVLRTRRTADRLLYY FT SMDEDEQLTYAFGARGEGELRTDRRAPRVAETIKRSEPSRKGTPTVLEWDN FT YLPRRYSESESESFRSRGSAREVRRPIERVVKPEYREDSRDRRERTNPSRR FT GDDLSNQKLQTVKILRSWGISKFAGEEKSEDPEEFLEQIDECRDGSGISDA FT ALLAAIPCCLTKKASRWYRTARAEIFSWPEFCKKFRNHFMKFYDREDLMDD FT LQRRTQAEGEKITSYLASFKYIISRFRHPPSEREQVELAYRKLHPKYREAI FT GDKIIETLDDIERYGRIYERKKDLDLRYLPPPAAEKMRVPGAAYTGSRKPK FT VAAVQEEEEGVAGVESAAGGKKGGKGKGKNKAGNKGGAQASPPEVSAESES FT APTPSPLSYAAAVQNSAKGGVAHANQANSQQAGNQLPNSNNAPNRDSNAKA FT KAVASGGAVGAGGEAKRQYLLNASGANKPNVAEGFTSARINEFAGACYTCQ FT GVGHRASECPEVVCFACRRKGHVSRQCPNK*" FT CDS join(2525..6040,5877..6551) FT /product="Gypsy-23-I_NVi_2p" FT /translation="MSQVRSFIWKSGKREERGTEINPVSRNSIAFTCERDR FT TSSESSVAVEEPSVVTSESDVDSRESGSVAKIHPRQVCWGVSGKTIKHLDE FT KAVCQTPAERTLPEVQEKSAAELQQSLKQRGLGQLKASEAFGRSVPWVNGL FT PAWSEGEFSRDAIEAVQSRAAKRKEVTFEENSKDSSDGDEAAGYAKLGEIA FT RVIELIEEREVPAEPAQVKQNLAGRSKSVGEIQVEKKTESAKSADGPFSEL FT LEKEQTLAGELEREIDESEVVVAGLTADNRNYVFVSLGGKRILALLDPGAT FT LSMVSPKVAEHYRGRLLPINMRVRTATGKLSKVQGVMRVKVDVDQNVREID FT FKVVPDLDHDAILGMDFCEKYDVDTRHGRKLWRVCEGPWHAFASATRNKET FT LVFAECAGISEVEPDQREEIQRVVDRVLANQPEGAGLTWMAEHHIVVTDSR FT PIRHRWRRRSPKTMQLMVEEVERLFFEGFIERSAGDYTNAPVMVRKADGTF FT RFCIDFRDLNKITRRDAYPMKSMDSILDRLRRARYLSKIDLRQAYYQIPLA FT KESRKYTAFALPGSGLWQFTRMPFGLVNAPMTLQRLLDALFGPEYEPHVFS FT YFDDLVIATETFEEHVEWTERVFSKLKEAGLVVNKKKCQFFCEKIQYLGFV FT LDRDGLRPDPEKVAPVLDYPAPKNVKQLRRFLGMVGWYSRFISRDSELKIP FT LVKLLRKGQAWQWGDEQQESFDAFKKALTEAPVLARPDFNKPFTLQTDASS FT HSIAAVLTQEGEDGEHPILYVSRVLTPAERNYTVTEKEALAVVWAIKKLRP FT YLEGYKFRVITDHSALKWLQNLRDPTGRLARWALELQQWDFEIEYRRGALN FT HLPDALSRAFEPEEEVASFGEIKDPEYLKLLIEVEKFPLKYSNWRVEDGRL FT YRFRKEPMLDPIIDRDEGWRLVVPVEQRERVLTDAHCTPSTGHLGVEKTYD FT RVAREYYWKGVYHDVYNFVNECETCRMYKVSQKGRQGLLGKRIVERPWVVV FT AADLMEFPPSKLRNKYLVVFQDLFTRWVEVKQIRKADGRSVARAFEELVLF FT RWETPEYFLSDNGKEFDNQYLAGVLKEYGVKHVTTPPYHPQANPVERSNRT FT LNTMIASFVKERPPELGCACARVSSCGEHSRTVVSQRLACVSELRQTPATG FT GEFAQRSGAEKGCKKDHQNWDVHVHEFRHAVNTAVQSSLNVSPAFLNYGRH FT PRPVASLRREVEQRKGVEKVNPEDWANRMKRLDALRDLVSQHIDQAQEKQA FT REYNKGRRVVTFQVGSWVDRRTHPLSGAAKKFSAKLVPKWEGPFVIEEVLG FT PTVYKLGPAPGQDASRRIAKADIIAATYTEQSSSEGSEPRLIFFIVFRGTR FT CRGRCRDSVATAAPTKRKREFCWRSHERASRSS*" FT CDS join(6311..8185,8110..8892) FT /product="Gypsy-23-I_NVi_3p" FT /translation="AGTCTRAGCESEDSEGRYHCGDIYRAEQLGRQRTSPD FT FLYSFQGNEMPWKMPRLCGHSSPDEEEERVLLEEPRESEPQQLGQAGAETS FT VIPQRSAEESVIMMQPGAALPAPDTRETGQEASVSELEPPTAVTVLAALNQ FT ESELEYEECGEGEASVSADTVAGAFEPAAAEVVASVAAVTAEETAKGNPSW FT ADEVEGQPSQEEPQAIQEVTECASQAEQAQHGEMRPPIKIPPLPPIVSAVT FT FNPNFMLPDEPERYAWLQPSDASVLFQRSLAERHGAIPKSWGTLRRPRTIL FT PLQKPTGRAAEKETTPERMETEEEPEDRSXVEXESEKAKKELEYLQAGTGS FT STSNGSTPGLRSPVKPMQVVTPTAQRSGAPIPRMIPIDAGQAALVSWDEDW FT DNEGQQPPARIAAQVPARVEEEEPEPPRRKSRKRKKAYNPRRQLTGDDYFP FT LHLESREASPEPGAPVRPGQSREEGAFAGAPLRVVKKTSRPSSPSVGKFRC FT PGEAAWRRYNFPLSRLPPRRGGVLFRDGQPVPVVPAIDPPPYCCFNCWDRN FT HAVRSCPEPERRDYCANCGRHGVRIEECPRCAEEFKRRERQRQPEDRSGSP FT PRKLTSRSRRDELERERKSVRDVSEKSKADFEKSERRIREREEECKRRQRE FT ELEKAAERRHQEMRREEERRERDERWRAEEERRQREEEQNRRYVEQRKAER FT RAEEARQEAEALRLEEQRRVENNRLREEAARLLEQREEQALRAELLRQLNE FT RKQSEMQRVAAVRVEEEARRAFEQRRLREQQEELLRTEELRRQIREAELRT FT AEELRRIETQRQALEARLLQPVRGVVPEREGRVLGPPRVPHLPAEPAQTRD FT PVRDALALLEGLQGVTNETKDAVLRRVYGQPQ*" XX SQ Sequence 9054 BP; 2364 A; 1939 C; 3022 G; 1725 T; 4 other; taaatggcgt acaccatggg gctcgacgga aattagtgta aactggaaaa tcgaaaaatt 60 gtacgcgaag aatttaagcc ggaagcgtgt gtttgaaaga agtatgtgtg tgtgagaaga 120 gtgtgcgttc gcgtttaggg taagagatga gtgcacgaaa ttcggtcatg gccggtagag 180 gcaggggaaa cgttgagcac gcaatgatgc gggagctcat gcgagatcaa gtgttgcgtg 240 ggatcaagct gaaagtagca tggcttaccc cagctgtgtt agtagctgag ttagaggaga 300 ggaggcttag caccaggggc acggtagagg tcttgcagaa tcgtctgacg tacgctgtga 360 ttcgtgccaa aagaccagag ctggtggtgc cttgattccc gaacgctcag ccgcaaaaaa 420 cagggactag atttatcgcg agttaccgtg cgggagatgc tctcgagatt tcgcggttct 480 gagaccgagt atacttacta cacgccgcag atggtaagag atggcgttct tccgacggct 540 ccagagttgc gaaacgagca gctcaacgtg agagctgccg caacgccgag aatataggca 600 ccaagggcac aagcgctcaa attgcaagtg ccgagagaac aggtgctgag atctccggag 660 agtcagagag gtcgtggcat gcgacaaacg actcctgtag tcttagcggc ggcgggagca 720 agtaagctta acgagacgtt ctggtttacg ctggctggag gttcgccagg ggcaccggga 780 agtaaaccgc gttcacgtta gtcagagctc gcggggcagc agcaaatcag gcgggtcgcg 840 tgacgggaga gagtcgtacg agagtgaggt taggattatc aataagcagc ggaggcgtaa 900 gcgggcgcaa gagagccgtg agcgcgctgt cttgcgaact cgccgaacag ctgatcgtct 960 gctgtattat tcgatggacg aggatgagca gctgacttac gcgttcggtg ctaggggcga 1020 aggagagttg cgaacggatc gccgtgctcc tcgagtggct gagacgatta aaaggagtga 1080 gccgtcgcga aaagggacgc cgactgtgct cgagtgggat aattatttgc cacgccgtta 1140 ttcggaaagt gagagcgagt ctttccgtag ccgcggaagt gcgcgtgagg tgaggcgacc 1200 gattgagcga gtagttaagc cggaatatcg agaggatagc cgagataggc gggagagaac 1260 gaacccgtcg cggcgcggag atgacctgtc gaatcaaaaa ttgcaaaccg tcaaaatttt 1320 gagaagctgg ggaatttcca agttcgcggg cgaggaaaaa tcggaggatc cggaggagtt 1380 tctcgagcaa attgatgagt gccgagacgg atcaggaatc tccgacgcgg cgttgttggc 1440 cgcgattcca tgttgcctga cgaagaaggc gtcgaggtgg tataggacag cgcgtgcgga 1500 aattttctcg tggccggaat tttgtaagaa atttaggaat cacttcatga aattctatga 1560 ccgcgaggat ctaatggacg atttgcagcg ccggacgcaa gcggaggggg agaaaatcac 1620 gagttattta gctagtttca agtatataat ttcgcgattt cgtcatccgc cgtcagagcg 1680 tgagcaggta gagctggcgt accgtaaact ccatccgaag taccgtgaag cgattgggga 1740 taaaataatc gagacgctcg acgatatcga gaggtatggc cggatttacg agagaaagaa 1800 ggatttggac ttgcgttacc ttccgccgcc agcggctgag aagatgcgag tgcctggcgc 1860 ggcgtacacg ggctcgagaa agcctaaggt agccgcagtc caggaggaag aagagggagt 1920 tgctggggtt gagagtgcgg ccggaggaaa gaagggtggc aagggaaagg ggaaaaataa 1980 ggctggaaat aagggaggcg cgcaggcgtc gccaccggag gtatcagctg agtctgaaag 2040 cgcgccgacg ccgtcgcctc tctcatatgc cgctgcagtt cagaactcgg cgaaaggcgg 2100 agtggcccac gctaatcagg cgaactcaca gcaagccgga aatcagctgc cgaatagcaa 2160 caacgcgccg aacagggact cgaacgctaa ggcgaaagct gtagctagtg gaggtgcggt 2220 tggcgcgggc ggagaggcca aaaggcagta tctcctgaac gcaagcggcg cgaacaaacc 2280 gaacgtggcg gaaggattta cgagtgcgcg tattaacgaa tttgcggggg cgtgttacac 2340 ctgccaggga gtaggtcatc gcgcgtcaga gtgcccagaa gttgtctgct ttgcgtgccg 2400 gaggaaggga cacgtgagcc gacagtgtcc gaataagtag acgcagggga ctttcgtaga 2460 gaataacttc tggcccgcga gaacaatgcc aagggtgtaa caggcccggg gttacggtta 2520 ggacatgtcc caagtgcgct cctttatttg gaaatctggg aaacgcgaag agcggggcac 2580 agagataaat cctgtgtccc gcaattcgat cgcgtttaca tgcgagcgag atcggacgag 2640 tagtgagagt agtgtagccg ttgaagagcc tagtgtagtc acgagtgaga gtgatgtaga 2700 ttcgagagag tcgggctctg tcgcgaaaat acacccgcga caagtctgct ggggagtgtc 2760 ggggaagacg ataaagcacc ttgacgagaa agctgtatgt cagactcctg cagaaagaac 2820 gttgccggaa gtgcaggaga agtctgcggc tgagcttcaa cagagcttga agcagcgcgg 2880 gttgggacag ttgaaagcga gtgaagcttt cggccgctcg gtgccgtggg ttaacggctt 2940 gcctgcgtgg agcgaaggtg agttctcgcg cgatgcaatc gaggcagttc aatctcgcgc 3000 cgcgaaacga aaagaggtga cttttgagga gaattcaaaa gattcctctg acggtgacga 3060 ggcggccggt tacgcgaagc tgggagaaat cgcgcgagta atcgagttga tcgaggagag 3120 agaggttcct gctgagcctg cgcaggtgaa gcagaaccta gccggaagga gtaagagtgt 3180 aggggaaatt caagttgaga agaaaacgga gtcagccaaa tcagctgatg ggccgttttc 3240 tgaactcctt gagaaggagc agacgcttgc cggagaatta gagagagaga tagatgagag 3300 cgaggtagtg gtagcggggc tgacagctga caatcgaaat tatgtgtttg tcagtcttgg 3360 cggcaagaga attttagcat tgttagatcc aggagcgacg ctctcgatgg tctctccgaa 3420 agtagcggag cactatcgag gtcgtctgct gccgattaac atgcgggttc gaactgcgac 3480 cgggaaattg agcaaggtgc aaggggtcat gcgtgtgaag gtagatgtag atcagaacgt 3540 gagggaaatt gatttcaagg tagtgccgga tttagatcac gacgcgattt tagggatgga 3600 tttctgcgaa aaatacgacg tggacacgag gcacggccgg aagctgtgga gagtctgcga 3660 agggccgtgg cacgcgttcg cgtcagcgac tagaaataaa gagacgctag ttttcgcaga 3720 gtgtgcgggt atttcggaag tcgaaccaga tcagcgcgag gagattcaga gagtcgttga 3780 tagggttttg gcgaatcagc cggagggcgc gggtctgacg tggatggcgg agcatcatat 3840 cgtcgtcacg gattcgcggc cgattcggca tcgctggaga cggcgctcgc cgaaaaccat 3900 gcaattgatg gtcgaggaag tagagcgttt gtttttcgag gggtttatcg agagatctgc 3960 gggagattac actaacgcgc cggtcatggt tcggaaggcc gacggtacgt tccggttttg 4020 catagatttt cgcgatctaa ataagattac gcgaagagat gcgtatccga tgaagagcat 4080 ggattcgata ttagaccgtc taagaagggc gcgctattta tcgaaaattg atttacgtca 4140 ggcgtattat caaattccgc tcgctaagga aagccggaag tatacggcgt tcgcgttgcc 4200 aggctcgggg ctgtggcaat tcacgcgaat gccgttcggt cttgtcaacg cgccgatgac 4260 gctgcagcga ttgctcgacg cgttgttcgg tccagaatac gagccgcacg tgttcagtta 4320 tttcgatgac ttagtgatcg ctaccgagac gtttgaggag cacgtagagt ggactgagag 4380 ggtattcagt aagctgaagg aggcggggtt agtcgttaac aagaagaagt gtcagttttt 4440 ctgtgagaaa atccagtacc tagggttcgt gctcgatcga gatggactaa ggccggatcc 4500 ggagaaggta gcgccggtgc tggattatcc tgcgccgaaa aacgtgaaac agttgcgtag 4560 attcttgggg atggtagggt ggtattcgcg ctttatcagc cgggattcag agttaaaaat 4620 tccgctggtg aagctgttga gaaaagggca agcgtggcag tggggtgatg agcagcagga 4680 gtcttttgac gcatttaaaa aggcattgac ggaagctccg gtgctcgcga ggccggactt 4740 taataagcct tttactctgc agaccgacgc gagctcgcac tcgatagccg ccgttctcac 4800 gcaggaaggt gaggacggag agcacccgat tctctacgtg agtcgagtgc tcacgccggc 4860 ggagagaaat tacactgtaa ccgagaaaga agcgttggcc gtagtttggg cgattaaaaa 4920 gcttcgtccg tatttggagg gctataaatt tcgcgtgatt acggatcact ccgctctaaa 4980 atggcttcag aatttacgcg atccgacggg taggttagcg cgttgggctt tagaattgca 5040 gcaatgggat tttgaaatcg aataccgtag gggcgcgttg aaccacttgc cggatgcgtt 5100 gtcgagagcg tttgagccgg aggaagaggt ggcgagtttt ggggaaatta aagatccgga 5160 gtatcttaag ctgcttatag aggtagaaaa atttccgctt aaatactcga attggcgtgt 5220 cgaggacgga aggctttata ggtttcgaaa agagcccatg ctcgatccga taattgatcg 5280 ggatgagggt tggcgattag tagtgccagt agagcagcgt gagcgcgtcc ttacggatgc 5340 tcactgcaca ccgtcgacgg gtcatttggg cgtggagaag acgtacgatc gggtagcgcg 5400 cgaatattat tggaagggag tgtaccacga tgtgtataat tttgtgaacg agtgtgagac 5460 gtgccggatg tacaaggtgt ctcagaaggg aagacagggc ctgctcggca agagaatagt 5520 cgagaggcct tgggtggtgg tagcagcgga cttaatggaa tttccaccga gcaaattgcg 5580 aaataaatac ttggtcgtat tccaagattt attcacaagg tgggtggaag ttaagcagat 5640 ccgtaaagcg gacggaaggt cagtggctcg tgcgttcgag gaacttgtgt tgtttagatg 5700 ggagacgcca gagtatttct tgtcggataa tgggaaagag ttcgacaacc agtatttagc 5760 cggagtgttg aaggagtatg gtgtgaagca cgtcacgacg ccaccgtatc atcctcaagc 5820 gaaccccgtt gaacggagca atcgcacttt gaatacgatg atagcgagtt tcgtgaaaga 5880 aagaccacca gaattgggat gtgcatgtgc acgagtttcg tcatgcggtg aacacagccg 5940 tacagtcgtc tctcaacgtc tcgcctgcgt ttctgaatta cggcagacac ccgcgaccgg 6000 tggcgagttt gcgcagagaa gtggagcaga gaaagggtgt tgagaaagta aatccggagg 6060 attgggcgaa tcgaatgaaa cgtcttgatg cgctgcgcga cttggtttca cagcacatcg 6120 atcaagcgca agaaaaacaa gcgagagagt acaacaaagg ccggagggta gtgacctttc 6180 aggtgggaag ttgggtggat cggcgcactc atccactttc gggcgcagcg aaaaagttct 6240 ctgcaaagct agtgccgaag tgggaagggc cgttcgtgat agaggaagtg cttggcccga 6300 cggtgtataa gctgggacct gcaccagggc aggatgcgag tcggaggata gcgaaggccg 6360 atatcattgc ggcgacatat accgagcaga gcagctcgga aggcagcgaa cctcgcctga 6420 ttttctttat agttttcagg ggaacgagat gccgtggaag atgccgcgac tctgtggcca 6480 cagcagcccc gacgaagagg aagagagagt tctgctggag gagccacgag agagcgagcc 6540 gcagcagcta ggacaagccg gagcggagac ctcagtaatt ccgcagagat cagccgagga 6600 gagcgttatt atgatgcaac ccggagctgc tcttcctgca ccggacactc gtgaaactgg 6660 ccaggaggcc agtgtaagcg agctggagcc accgacggca gtaaccgtcc tggcagctct 6720 aaaccaggag agcgagttgg agtatgaaga gtgtggtgag ggggaggcct ctgtatcggc 6780 tgatacagtg gcaggagcct ttgagccagc agctgcggag gttgtggcgt cagttgctgc 6840 cgtgactgcg gaggagacgg caaaagggaa tccttcctgg gccgatgagg tggagggcca 6900 gcccagtcag gaagagcccc aagccattca ggaggtcact gagtgcgcga gccaggcgga 6960 gcaggcgcag cacggggaga tgagaccacc gattaagatt ccgcccctgc ctccaatcgt 7020 ttcggctgtg acgttcaacc cgaacttcat gctgccggac gagccggagc ggtatgcgtg 7080 gttacaaccc tctgatgcgt ctgttttatt tcagaggagc ctggccgagc gacatggagc 7140 tatcccgaaa agctggggca cccttcgccg cccccgaacg attctgccgc tgcagaagcc 7200 aacggggaga gctgcagaga aagagacgac gccggagcgt atggagactg aggaagagcc 7260 ggaagatcga agcraggtrg agwtagagag cgaraaggcg aagaaagaat tagagtacct 7320 ccaggcggga actggcagct cgacgagcaa cgggtcgaca ccggggctgc gctctccagt 7380 caaacctatg caggtcgtga cgcccacggc gcagagaagc ggtgcgccca ttccccggat 7440 gatcccgata gacgctgggc aggcggctct agtcagctgg gatgaagact gggataacga 7500 gggccagcag ccaccggcaa gaatagccgc tcaagtgcca gcgagagtag aggaagagga 7560 gccagagccg cctcgtagga agagtcgcaa gaggaagaag gcttacaatc cacgtcgcca 7620 gttgactggg gacgactact tcccgttaca cttggagtca cgggaagctt cgccggagcc 7680 cggagcccca gtgagaccgg gtcagagccg agaagaaggc gcattcgcag gcgcaccact 7740 gagagtagtg aagaagacga gtaggccctc ttcaccgtct gttggaaaat tccgatgccc 7800 aggagaagca gcgtggcgga gatataactt ccccttgagc cgtctaccac cgcgaagagg 7860 aggagtcttg ttcagggatg gccagccggt gccagtagtg ccagccatcg accctccgcc 7920 ttactgctgc tttaactgct gggatcgaaa ccacgcagtt cggagctgcc cagagccgga 7980 gagacgcgac tactgtgcca actgcggtcg gcatggagtt aggattgagg agtgccctcg 8040 ttgtgcggag gagtttaaga gaagagagcg tcagagacag ccggaagata ggagtggttc 8100 tcctcctaga aagctgactt cgagaagtcg gagagacgaa ttagagagag agaggaagag 8160 tgtaagagac gtcagcgaga agagttagag aaggcagcag agagaagaca tcaagagatg 8220 agaagagaag aagaaagacg ggagagagat gaaagatgga gagctgaaga ggaacgtcgc 8280 cagagagaag aagagcagaa ccgcagatat gtggagcagc ggaaggcaga gcgacgcgca 8340 gaagaagctc gacaggaagc tgaagctttg agattggagg agcagcggag agtagagaac 8400 aaccgtctgc gagaggaagc cgctcgtctc ctggaacagc gagaagaaca ggccctgaga 8460 gctgagcttc tgcgccagct aaatgagaga aagcagtcag agatgcagcg agtagcagca 8520 gtgcgagtgg aggaagaagc gagaagggcg tttgagcaac gacggctgag ggaacagcag 8580 gaagagcttc tgagaaccga agagttacgg cgacagattc gggaggcaga actccgaaca 8640 gcggaagagc tgagaaggat tgaaacccag cgtcaagcac tagaggctcg actcctgcag 8700 ccagtccggg gagtcgtgcc ggagagagaa ggaagagtgt tgggaccacc aagagtgccc 8760 cacctgccag cagaaccagc ccaaacaaga gatccagtga gagatgccct ggcgctgctg 8820 gaggggcttc aaggggtcac caatgagact aaggacgcgg tgcttcgccg ggtctacggt 8880 caaccccagt agacgcagga gaagagggat gagaccagcc ggaggggacc agccggaagc 8940 acagagagca gagcaaaggc ccgcaagggc cggtaagtac ctttcttgca tttatagccg 9000 aaggacgatg gtagacggag taatcaggga ttactccggt ctagtgaggg ggat 9054 // ID BEL-21_CQ-LTR repbase; DNA; INV; 323 BP. XX AC AAWU01039820; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-21_CQ_; KW BEL-21_CQ-I; BEL-21_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-323 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 196-196 (2011). XX DR Genome; AAWU01039820; Positions 9336 9658. XX SQ Sequence 323 BP; 83 A; 82 C; 61 G; 97 T; 0 other; tgatcgcgcg agagatcaat tcacaatttt accagcctaa atgtgatcca caaataaatg 60 taattttgtc cctatctgtc cctggccatc tctgatgaga gccagtggtg atcctcgctc 120 tttttgaact ctccccggtt cgcatcccat gtactgtccc ctttagagtt tagttagaaa 180 taaactgccg actacgccag acggttacgc gttttaaaat aaacgttttt ctgttcgcgc 240 aataaaattg ttttagttcg cgtagtttaa aaacggtgtt ttactgcgtc cgaaatctcc 300 cgaaccgacc acacgcgcga aca 323 // ID BEL-76_AA-I repbase; DNA; INV; 5930 BP. XX AC supercont1.33; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-76_AA_; KW BEL-76_AA-LTR; BEL-76_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5930 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.33; Positions 3614609 3608680. XX CC Positions [4977-5561] - Integrase core CC 'CTTTC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 693..5930 FT /product="BEL-76_AA-I_1p" FT /translation="MERNLEVLVDRRDMLVDKLMRMNDSLDAESVSIHLLK FT LFEENLRRNADEFEKAYCDISLLLPKEQRAEHRQEYTNFEQLHNEVYVKLQ FT GRIAQTELTNKLAELPNASAVAAAPNPVYVQAAASVPHLQAPFPTFNGDPQ FT NWYSFKNLFQSIMSRYTQETPAMKILHLRNALTGEAKDKIDQDIVNNNDYD FT AAWRILEDAYEDKRLILDTHIDAILDCPKVAKENRGKSITKLVEICAKHID FT ALAGHGYPVEGLAELVLVNIVYKKLDKETQEQWELKIGVGELPDFVEFMEF FT LRERGRVLQRTSRFQQHPMQQPSAPPSKQRQAIGQKPQPPLKSFVQTTTEV FT CTCCKEEHPIFRCSTFKNMTLSERKSVVTKGNLCFNCLKAKHRVSECQSES FT RCKVQGCGRKHHSLLHSSDTRATSVPEQSDEVKQAQSTIPTSNQATVTEQQ FT NVAVSLCANVDRSKRQVLLSTAVVMLVGHGDTIIRCRALLDSGSDSNLITE FT KLARKLKLKMDHVNLPVCGLNDIQTQVEYQLSTKFISCINSFASSILDFLV FT VKRITSSLPVIEVDTRSWPLPPGLKLADPQFHTPGEIDLIIGNEIFFDLIK FT EGRHKIGNNSLTLTETELGWVVGGSVPTRKAKPCPRVCQLNRYEEELHKTM FT SKFWEIESVHPESELTAAEAAVEEHFTKTHTRDDKGRYVVRLPFNNQQGQL FT GDSYNNARKGLDKLMMALSKNPSKHDEYSSFLMEYLTLGHMKEVEPINDGG FT YYIPHHAVYKMSSSTTKTRVVFDASAKTTSGLSLNDTLSVGPTVQNDLLSI FT ILKFCTHQVVLTADIPKMYRQVRLHTEDCKYQRILWLDDDSKLKVYELQTV FT TYGVASSPHHATRALVQLASDEGKDFPLASKVIIEDSYIDDFLTGGSSTAE FT VIQIYTELTQLLHRGGFGVHKFCSNSAEVLQAIPVELHEKQVHFEDADINN FT SIKTLGLIWNPMEDYFAFRVPRPVDMNWTKRIVLSEISKLFDPAGFLGPIV FT TTCKLVMQDVWRQGVPWDELLPDELSYRWRLIREQMPAINEMKKQRCLISK FT DAVSIQLHGFSDASMRAYGGVLYIRSVDRDGTINVNLVASKSRVAPLKMQT FT IPKLEMCGAKLLAELTQKVVSSMQVPFDDVILHCDSKIVLCWLKKSPLTLK FT QFVSNRVATTVELTRGYTWRYIPTGHNPADVISRGALPENLLHNELWWSGA FT PELWESQLPEDLSEPFDESELPEVKTNKVLAVVKSEPPINLTRCSNYRRML FT RAWVYIERFINLVVHKVSSTSNITADEMVRSEKNMLLVLQREVFGDLLKLL FT KEGSTKRHPLSNLAPFVGDDGLIRVGGRLKYSVIPYEGKHQVLLPERHHIT FT TILIRKLHEEHFHVGQNGLLAIVRERYWSLRAKAVIKRVTSKCQVCAKQNP FT RPGSQFMGNLPESRVNPSPPFSKVGIDYAGPFMLKTGGRGSRLAKAYVVVF FT VCMAVKAIHFELVSNLSTDNFIAALQRFSSRRGLPTDIHSDNGTCFVGANH FT ELAALRDLFNDQQHQRKLDEFCSLKGIRWHFIPPRSPHFGGIWEAGVKSMK FT HHMKRVVGETKLTYEEMITFLAQTEAILNSRPLCPMSEDPNDMAVLTPSHF FT LINRSAVAFPEPSYEDEKVGRLSRWQHVQAMQQHLWSRWSREYLHHLQSRQ FT KWHDGVKEFKVGSLVLLIDENLPPQQWRRGRIIATHPGDDGAVRVVTVKTS FT SNSFKRAITKVALLPSVEPEASTGGE" XX SQ Sequence 5930 BP; 1645 A; 1414 C; 1504 G; 1367 T; 0 other; ttttttggtc cttcgagccg gatcttcgga gaatttggag tggttcccgg tttccgtcgg 60 aacgccatta gcccggtgct aatttcggat aagtgtgaag tgaactttgg tggagcttag 120 ccaccagctg ttccggacag ccgccatctt ggtgaaattg gtgaacagtt tgtcccaact 180 ggcgaagaaa aatcggtcgg cgttgcgcca ccaagtgtgt gcccgtggca cggaagaagc 240 tcgccatttt ggcaaaacgg agttgtgtgc gtgtgctcgt agcactgaag aaagtggagg 300 ccatcttggc gaacgaaaaa tgttgccagc tgcctgcgtg tgtgtccgtg acaacacgcg 360 tcagtgaaaa agtgtgtgtg tgcccgtggc actgaagcaa tcgtggtagt catcctgcga 420 acgaatcggt tcaaccggtt gaaccgaagc cattttgaca gttaagcctg tcatgtgtga 480 aagtggtttg aaaacaaaca aaacaaaagt ggccatagaa gaggcaaaga aaataattac 540 gaatgagtga ttctgagacg ccccgtggac cgttctcaaa attcatcccg agaagaaaca 600 ttctggacgc tttatcggga ccttaaagca gtgtagtgaa aactatagcg gaacagaaag 660 cagaaaaaca agcccaagaa aggagtcgca tcatggagag aaatttggaa gtgctggtcg 720 atcggcgaga catgctcgtc gataaactca tgcggatgaa tgattcattg gacgctgaat 780 cggtcagtat tcatttgttg aagctgtttg aggagaatct acgccgaaat gctgacgagt 840 ttgaaaaagc gtattgcgac atttctcttc tgctgccaaa agagcaaaga gcagagcatc 900 gacaagagta caccaacttc gagcagctcc acaacgaggt ttatgtcaag cttcaaggga 960 ggatcgctca gacggagttg acgaacaagc tggcggaact gccgaacgca tcagcggtcg 1020 ctgcggcccc aaaccccgtt tacgtccaag ccgcagcatc agttccccat ctgcaggctc 1080 cgtttccaac gtttaatggt gatccgcaga actggtacag ttttaagaat ctgttccaaa 1140 gcatcatgtc gcggtataca caggagaccc ctgccatgaa aatattacac ctacgaaacg 1200 ccctcaccgg cgaagcgaag gataagattg accaggacat tgtaaataac aatgactacg 1260 atgctgcctg gaggatcttg gaagacgcat acgaggacaa gcgattaatt ttggacaccc 1320 acatcgatgc catcctggat tgcccaaaag ttgcaaaaga gaatcgcggc aaatccatca 1380 ccaagctggt cgaaatttgt gcaaaacaca tcgatgcatt ggctggacac ggctatccag 1440 ttgaaggtct ggccgagctg gtgttggtga atatcgtgta caagaagctc gacaaagaaa 1500 cgcaggaaca atgggagctg aaaattggag tcggagagtt gcccgatttc gttgaattta 1560 tggaattcct acgggagcgt ggtcgtgttc tgcagcgcac aagccggttc cagcagcatc 1620 caatgcagca accgtcagct cccccaagta agcaacgtca agcaattgga cagaagccgc 1680 aacccccact caagtcgttc gttcaaacca caacggaagt ttgcacctgc tgcaaagagg 1740 agcacccaat cttccggtgc tcaaccttca aaaacatgac cttgtcggag cgcaagtccg 1800 ttgttactaa agggaattta tgcttcaatt gtctgaaagc gaaacatcga gtgagtgaat 1860 gccaatcgga atcccgatgc aaggtgcagg gttgtggtcg caaacaccac agtcttctgc 1920 attccagcga tacccgtgct acctcagttc cagaacaaag tgatgaagtg aaacaagccc 1980 agtcaacgat tccaaccagc aatcaagcaa ccgttactga gcagcaaaat gttgcggtgt 2040 ccctttgtgc taacgtcgat aggagcaagc gccaagtact actctcaacg gcggtagtga 2100 tgctagttgg acacggcgat accatcatca gatgtcgtgc cttgctagac tcaggatctg 2160 atagcaacct aatcaccgag aagctagcac gcaaattgaa gctgaaaatg gatcacgtga 2220 atctaccagt ttgtggtcta aacgacatac aaacccaagt tgagtatcag ttgtccacga 2280 agttcatttc ctgcattaat tcttttgcct catccatcct agatttcctg gtcgtaaaac 2340 gaatcacatc gagcttgccc gtgattgaag ttgacacccg atcctggccc ttaccaccag 2400 ggctgaagct agctgaccct cagttccaca cccctggaga gattgacctt attatcggga 2460 acgaaatttt cttcgatctg attaaagaag gtcgccataa gatcggcaac aactcgctta 2520 cgctaactga gacggaattg ggatgggtcg ttggcggatc ggtgccgacc aggaaagcca 2580 aaccatgccc gcgtgtgtgc caactaaacc gttatgaaga agagctgcac aaaacgatgt 2640 cgaaattttg ggaaatagaa tccgtccatc cggagtccga gctgacagca gctgaggctg 2700 ccgttgaaga acacttcact aaaacccata ctcgtgatga caaagggcgg tatgtcgtcc 2760 gacttccgtt caacaaccaa cagggtcaac taggcgactc ttacaacaat gcccggaagg 2820 gtttggacaa actcatgatg gcgctgtcca aaaatccatc gaagcacgac gaatattcct 2880 catttttgat ggaatacctc acattaggac acatgaagga agtcgagccc atcaacgacg 2940 gtggctacta tattccgcac catgcagtct ataagatgtc cagctcgact actaaaacaa 3000 gagtagtatt cgatgcatca gcgaagacca cgtcaggttt atcgttgaat gacacgctat 3060 ctgttgggcc cactgtgcaa aacgacctac tctcaatcat attgaaattc tgtacacacc 3120 aagtggtgct cacagcagat atcccaaaaa tgtaccggca ggttagactg cacacggagg 3180 actgcaagta tcaacggatc ctatggctgg atgacgacag caagctaaaa gtgtatgagt 3240 tacaaacggt aacgtacggt gttgccagct caccacacca cgcaacaagg gcactggtac 3300 aattggcatc ggatgaaggc aaggatttcc cactggcgtc gaaggtgatc atcgaggata 3360 gttacattga cgacttttta actggtggtt catcgacggc tgaagttatt caaatctaca 3420 cggagctaac gcagttgcta catcgaggtg gattcggcgt acacaagttt tgttccaata 3480 gcgccgaagt actgcaagcg attccagtcg agctgcacga gaaacaagtt cacttcgaag 3540 atgctgatat aaataactcc atcaagacac ttggactaat atggaacccg atggaggatt 3600 atttcgcctt ccgtgtcccc cggccggttg acatgaactg gaccaagcga attgtgttat 3660 cggagatttc caagctcttc gatcctgctg gatttcttgg ccccatcgtc acaacatgca 3720 aactggttat gcaggatgtt tggcgtcaag gtgtaccgtg ggacgaacta ttaccagacg 3780 agctttcgta caggtggagg cttatccgtg aacagatgcc agcaatcaac gaaatgaaaa 3840 agcagcgatg tttaatctcc aaagatgcag tgtccattca actccatggt ttttcggacg 3900 catcgatgcg tgcctacggc ggcgtgctgt acatccgaag tgtcgacagg gacggcacca 3960 ttaatgtgaa tttggttgcc agcaaatcac gcgttgcgcc tctcaaaatg cagacgattc 4020 ccaaactcga aatgtgcggc gcaaaattgt tggcagagct cacccagaaa gtagtatcat 4080 cgatgcaagt tccatttgat gatgtgatcc tccactgtga ctcaaaaatt gttttatgct 4140 ggctcaaaaa atctccgttg acactgaagc agtttgtgtc gaaccgcgtt gctacaacgg 4200 tagagcttac tcgaggatac acgtggcggt atattccaac cggccacaac cctgctgatg 4260 ttatatcaag gggcgcgctt ccggaaaatc tcctgcacaa cgaactgtgg tggagtggtg 4320 ctcctgaatt atgggaatcc cagcttccag aagatttgtc ggaaccgttc gatgagtcag 4380 agcttccaga ggtcaagacg aacaaagtac tggccgttgt gaaatcagaa ccaccgatca 4440 atctgactag gtgcagcaac taccgaagga tgctgagagc gtgggtgtac atcgagcgtt 4500 ttatcaacct ggtggtgcat aaagtttctt caacatcgaa tattaccgcc gacgaaatgg 4560 ttcgttcaga gaagaacatg ctattggtac tccagcggga ggtatttggc gatcttttga 4620 agctgctcaa ggaagggtct accaagcgac accctctatc caatctggct ccattcgtgg 4680 gtgatgacgg cctgatacga gttggaggcc gtttgaagta ttccgttatt ccctacgaag 4740 gtaagcatca ggtgttactg cctgaacgac atcacattac gacgatcctg atacgaaagc 4800 tgcacgaaga gcatttccat gttggccaaa acggactgct ggccatcgtc cgtgagcgat 4860 attggtcgtt acgagcaaaa gcggtgatca aacgagttac ttcgaagtgt caagtctgtg 4920 cgaagcaaaa tccaaggcct ggtagtcagt tcatggggaa cctaccggaa tcgcgagtca 4980 atccctcacc gccattctcc aaggtgggta tcgattatgc tggaccgttt atgctgaaaa 5040 caggaggaag aggttcaaga ttagcaaaag cgtacgtggt cgtttttgtg tgtatggcgg 5100 taaaggcaat ccatttcgaa ttagtgtcta acctttcgac ggacaacttt atcgccgctt 5160 tgcaacgatt ttcaagccgt cgtggactgc caactgacat ccattctgat aatggcactt 5220 gcttcgtggg tgccaatcat gagctcgctg cattgagaga tttattcaac gatcaacaac 5280 accaacgcaa gctggatgaa ttctgcagtc tcaagggaat cagatggcat ttcattcccc 5340 cgaggagccc tcacttcggg ggcatttggg aagcaggagt gaagtccatg aaacaccata 5400 tgaagcgagt cgtaggcgaa accaaactca cgtacgagga gatgatcaca ttcctggcgc 5460 agacggaggc catccttaac tcacggcctt tgtgtccgat gtccgaagac cccaacgata 5520 tggctgtgct tacaccatcg cattttttga ttaaccggtc cgctgttgct tttccggagc 5580 catcgtatga agatgagaag gtaggacgtc tcagcagatg gcagcatgtg caagccatgc 5640 aacagcactt gtggagtaga tggtcccggg aatatctcca ccatctacag agtcgtcaga 5700 agtggcacga tggagtgaag gagttcaagg taggttcgtt ggtcctgctg atcgacgaga 5760 atctgccccc acagcagtgg cgacgaggtc gaataatcgc tacacacccc ggcgatgatg 5820 gtgctgtcag agtggtgaca gtaaagacgt cgagcaacag tttcaagcga gcaatcacca 5880 aggttgcttt gttaccttct gttgagcctg aagcctcaac ggggggagag 5930 // ID TTAA18B_AP repbase; DNA; INV; 576 BP. XX AC . XX DT 25-AUG-2009 (Rel. 14.09, Created) DT 25-AUG-2009 (Rel. 15.12, Last updated, Version 2) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; nonautonomous; TTAA18B_AP. XX NM TTAA18B_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-576 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2083-2083 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC TSD is TTAA tetranucleotide. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea CC Aphid Genome Project. XX SQ Sequence 576 BP; 191 A; 88 C; 94 G; 203 T; 0 other; ggggtatgca taccgaccgt ttctaaggag ttcaatatag gcaattttct atttgtgtgc 60 atgcgggtcg tgtaaatgtg tgcgtagtaa gtgatcatta gtgtgagtga ctgtgtataa 120 aataaatagc ggagcgcacc cgcacgtgca cgcacatgtc agtaacgcag cattttacac 180 actaattatt acaaatgtga attttagtga aacactgaaa cttagtttcg tatttccaat 240 aatatagctt actgagtgac attacagtat gaacacctat tatacaaatt gaagagaaga 300 agtttatcta ggtttgtata tacggagtcg attttcgata ttttattttc aatgtccaat 360 atttaactct aaaacttgtg atattttact ttttgaattt gaattattga gtttagtaat 420 taaaatatcg aaaattgact tttatacaga catagattaa cttctcttct acaatattga 480 taataggtgt atttactctc aaaccgctca ataaaccata ttattggaaa tacaaaaata 540 cattttcata gaatttttca atactattca tacccc 576 // ID I-12_AAe repbase; DNA; INV; 5409 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A I non-LTR retrotransposon family from Aedes aegypti. XX KW I; Non-LTR Retrotransposon; Transposable Element; I-12_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5409 RA Kojima K.K. and Jurka J.; RT "I non-LTR retrotransposons from the yellow fever mosquito."; RL Repbase Reports 11(4), 1367-1367 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 9 sequences with >99% CC identity. XX FH Key Location/Qualifiers FT CDS 186..1547 FT /product="I-12_AAe_1p" FT /translation="METDIKDPPDLANPEHSPRARAYPVTAKPPYVVFFRQ FT IEKPLDITTISNDLHKHYRSIQQTRKITMRKIRVVVSERNDANDIVKNRLF FT KQYRVYIPSEEVEIDGVISDSTLSGQYLLENGVGKFRNPNAPSVNVLEVNQ FT LVVPRKEGEDVVYDPSYSFRITFEGTALPDFVEIDHVLIPVRLYVPKVLMC FT TNCKRYGHSAKMCDNKARCGKCAEIHDESTCSFVSPLCIRCNAEHDPSPKS FT CPSYQDHVNMTRRKLLEKSRLSFAELLSTADQETVVLDNLFSPLADLPADD FT LDDDENGNCSYTYVYPKRKKKTANIKRRSENMPPQNQAPQIALTSDQLTSQ FT QSSPNTSKQNKTQQFYRLSQKVPQQTVGATFSIHDKKQFPPLPATQDKPAG FT SSPDQSILSIPQLIRMMCAVFRVSNVWMSVVESLMPLFSLIWEKIIAHMPL FT LAFIAANDA" FT CDS 1540..5232 FT /product="I-12_AAe_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="MMPSATPQQLSLLQWNAHSILPKIANFRSLINLNQVQ FT IFAISETWLTEEKDFYVPQFNILRSDRDQPYGGVLLGLKHGIEFKRVEIEQ FT PSPIEIVAATINVTSTELTIASIYIPPTINLRYTDLKRIVDTLPAPLMLLG FT DFNSKGCAWGEQIDDSRAKIIYDLTDEFDLVILNTGEITRIACPTQSCSRL FT DLSICSSRIALNCSWKVIDDPSGSDHLPIIVSLQSNTPESASTTNYDLTKH FT IDWNRYRGQLDSIVTEINHSEDPIETHAFLIDSIKQSAIASQTKPIPNGGN FT VRKNPTIWWDWECSKMLTKRSDAFKTFRSSGSTEDFLKYKKAEAESKRLFK FT IKKRGYWKKLIESFDKDTALGSLWAIAKKMRNSSTPRENLNSYSEEWIEQF FT SRKICPQFAPNSKPEIQNSSLGDGNCDLTKDFSLEELESALCLAENNAPGM FT DGIKFALLKNMPLNAKKLLLNLYNAFFKFNVCPSTWYELKVLPILKPGKSA FT SEHESYRPISLLVCDRKAFEKMILVRIEHWAEKQKVLSPSQYGFRRARGTR FT ECLGKFSTAIQLAFAKKENLTAVFLDISGAFDSIRIDLLYQKLVSIGLPRQ FT MSDFLFTLLKFKTMHFIINGCIRETRIGYVGVPQGSCLSPLLYNIYVNDVD FT SCITEDCELIQFADDAVIWISEKDEEIARTHIQSSIRNLERWSADNGLTFS FT SSKTEMMIFSKKHKLPEMKLMLDQNEIKAVESHKYLGIWFDRKCLWGGQIR FT DIVRKCQKRINFLKTICSTWWGAHPSDMITLYKTTILSVMEYGSITFCSAS FT KTHILKLERIQYRAIRICLGSMMSTHTSSLEIMAGIKPLKLRYEELNLRFV FT LKNISSDNQFKKDLAILKNLKPQNYLVSYFNTISEMNATTSTHSCSSHLKI FT DDFLFVPQIDFSLHSELENSSEICRKSLANSLFRNKSENVPPENMFFTDGS FT KSYETVGFGLHSETVSLSYKLESPASIFCAESTAIYKACTLIDELPIGLYM FT ICSDSMSTLKALSSRVLDKNSNSTVLMVKQILRKLHSNGYTIRFLWVPAHC FT GIHGNESADDLAKTGALTGEVFERNIQYDEYLPTIKSYMQTRWQQKWTEDE FT LGRFCYSILPNVSNKPWFIEYDFSRNLIKNMSRLIANHFCLSSHLFRIGIK FT QSNICDCGDGYEDFDHILWHCQRYNEHRRKMLDELQKHGHATPCSIRDILG FT TLNNQILICVNNFINEAKIVL" XX SQ Sequence 5409 BP; 1748 A; 1116 C; 1063 G; 1482 T; 0 other; cattccacgt ttagcttttg acgagaacgg tcgtgttggc aaagcagtgt tcgattgtgg 60 tatagaaagt gaagcctttt gtttcctccg tcgagtgatc gttgaacaaa gctaagtacc 120 attgtttctt tcctatcgag aatttgtagc gtgtgtgagc cagtgaccgc aagtccgttg 180 cggctatgga gacggacata aaagatcctc ctgacttggc aaatccagaa cattcaccga 240 gagccagagc ataccctgtt acagctaagc ctccatatgt agtgttcttc cggcaaatcg 300 aaaaaccatt agacatcaca acaatctcca atgacctcca taagcattac cgctctatac 360 agcagactcg taaaattacc atgaggaaaa ttcgggtagt cgtgtcagaa cgtaacgatg 420 ccaacgacat agtaaagaat cgcttattca agcagtatag agtgtacatc cctagcgaag 480 aagttgagat agacggtgtt atctccgatt ccaccttgag cggccagtac ctattagaaa 540 atggagtcgg aaagttcaga aatcccaatg ctccatcggt gaatgttctg gaggtcaacc 600 aactagtcgt ccctcgcaaa gaaggtgaag atgtcgtata tgatccgagt tactcttttc 660 gcattacttt tgaaggaacg gcccttccag attttgttga aatcgatcac gtgttgatcc 720 ccgttcgatt atatgttcct aaagtgttga tgtgtacgaa ttgcaaacga tatggtcact 780 cggctaaaat gtgcgataac aaagcacgtt gtggcaaatg tgctgaaatt catgacgaat 840 caacgtgctc tttcgtgtct cctttgtgca ttcggtgcaa tgcggagcat gatccatccc 900 cgaagagttg tccctcttat caagaccacg ttaacatgac ccggcgaaaa ttattggaaa 960 agtccagact ttcattcgct gaactgcttt ccaccgcgga tcaagaaaca gttgtcttag 1020 acaatttgtt ctctccactt gcagaccttc cagccgatga tctcgatgat gacgagaatg 1080 gcaactgcag ctacacctac gtctatccaa aacgaaagaa aaagaccgct aatatcaaaa 1140 gaagatccga aaacatgcct ccccaaaacc aagctcctca gatcgcgtta acatcagatc 1200 agttaacctc acaacagtcg tctccaaaca cgtctaagca aaacaaaaca caacagttct 1260 atagactctc tcaaaaagtc cctcagcaaa ccgtcggcgc gacttttagc attcacgaca 1320 agaaacagtt tcctccgctc ccagctacac aagataaacc tgcaggtagc agtccagatc 1380 agtccatttt gtccatacca cagttgattc gtatgatgtg cgcagtattc cgtgtctcca 1440 atgtgtggat gtccgttgtt gaaagcctga tgcccctgtt cagcttaatc tgggaaaaga 1500 taatcgccca tatgcccctc cttgctttca ttgctgctaa tgatgcctag tgctactccg 1560 caacaactaa gcttacttca atggaatgct catagtattc tcccaaaaat agcaaacttc 1620 agatccttaa tcaatttaaa tcaagttcaa atattcgcaa tatcagagac atggttgaca 1680 gaagagaaag atttttatgt accacagttc aatatactta gaagcgacag agaccagccg 1740 tatggcggag tcttgcttgg tttgaagcac gggattgaat ttaaacgagt tgagattgaa 1800 caaccctccc ctatcgagat agtggccgct actattaatg tcacatcaac agagttaacc 1860 attgcctcca tctatattcc tcccactatc aacttaagat acacagatct taagcgtatt 1920 gtagatacac ttcctgcgcc tctgatgctc ttaggtgact ttaattcgaa aggatgcgca 1980 tggggtgagc aaatagatga tagtagagca aagattattt acgatttaac tgatgaattt 2040 gatcttgtaa ttttgaacac tggtgaaata actagaatag cgtgtcccac tcagtcatgt 2100 agtagactag atttgtcaat ttgttcatca agaatagcgt taaactgttc atggaaagta 2160 atcgatgatc catcaggaag cgatcatctt ccgataatcg tatcacttca atctaacact 2220 ccagaatcag cttcaacaac taactatgat ttaaccaaac acatagactg gaataggtat 2280 agaggacaat tagattcgat cgtcaccgaa attaatcact ccgaggatcc gatagagaca 2340 cacgcatttt tgatcgattc cataaaacaa agtgcaattg cgagtcaaac gaaaccaatc 2400 cctaacggag gcaatgttcg gaagaatcct acaatatggt gggattggga atgctccaaa 2460 atgttgacaa aacggtcaga tgcattcaaa acgtttagat catcggggag taccgaggac 2520 tttcttaaat ataaaaaagc cgaggcggaa tctaaaagat tattcaagat taaaaaacgt 2580 gggtactgga aaaaactcat tgaatcgttt gataaagaca ccgctcttgg gtcattgtgg 2640 gctattgcca aaaaaatgcg aaacagtagt acacctaggg aaaatttaaa tagttactct 2700 gaagaatgga tcgaacagtt cagcagaaaa atttgccctc aattcgcccc aaattcgaag 2760 cccgagattc agaactcttc attaggagat ggcaactgtg atttaacgaa ggatttcagt 2820 ttggaagaac tggaatcagc attgtgtctg gcagagaata acgctccggg aatggatggt 2880 atcaagtttg ctttactgaa gaacatgcca ctcaacgcga aaaaactttt attgaatctt 2940 tacaatgcct ttttcaaatt taatgtttgt cctagtacgt ggtacgaact aaaagtcctc 3000 cctatattga aaccaggaaa aagtgcttcc gaacatgaat cttatcggcc aatttcactt 3060 ttagtatgcg acagaaaagc ctttgaaaaa atgatcttgg ttcgtataga acattgggca 3120 gagaaacaaa aagtgttatc tccttctcag tacggtttta gacgtgctcg tggaacgcgt 3180 gaatgcctgg gaaaattttc tactgcgatt caactagcat ttgccaaaaa agaaaatttg 3240 accgctgtgt ttttggacat ttcaggagcc tttgattcta tacgtattga tcttctatac 3300 caaaaactgg tttccatagg actcccaagg cagatgagcg actttctttt tacgctgttg 3360 aagtttaaaa ctatgcactt cataataaac ggatgtatta gagaaacccg gattggatac 3420 gttggagtcc ctcaaggctc atgcttgtcc cccctactct ataatatata cgtgaacgat 3480 gtcgattctt gtatcacgga ggattgtgag ctgatacagt ttgctgatga tgctgtaata 3540 tggatttccg aaaaagacga agaaatagca agaactcaca ttcaaagctc aatccgaaat 3600 cttgagagat ggtcagctga caatggttta accttttcca gtagtaaaac cgaaatgatg 3660 atattctcga aaaagcataa gctgccagaa atgaaattaa tgttggatca aaacgagatc 3720 aaagcggtag aatcacacaa atatttggga atttggtttg atcggaaatg cttatggggt 3780 ggtcaaatta gagatattgt tagaaaatgc cagaaaagga taaacttctt gaaaactatc 3840 tgcagtacgt ggtggggagc ccacccttca gatatgatca cgctgtacaa aacaaccata 3900 ttgtccgtga tggaatatgg tagcattaca ttctgcagcg cttccaaaac acatattctc 3960 aaattagaac gcattcaata tcgagctata aggatctgct taggaagtat gatgtcaacc 4020 catacttcat ctttggaaat catggctgga attaaacctt tgaagctaag gtatgaggaa 4080 ttgaatttaa gatttgttct gaaaaacata tcctcagaca accaatttaa aaaggacctt 4140 gccattttga aaaatctcaa gccacagaat tacctcgttt cgtacttcaa caccatcagt 4200 gagatgaatg ctacaacaag cacacattca tgctcatcac atctcaaaat agatgacttc 4260 ctatttgtac ctcaaattga cttctcttta cattcagagc tagagaacag ttctgaaata 4320 tgccgcaaaa gcttggctaa tagtttattc cggaacaagt ccgaaaacgt accacctgag 4380 aacatgtttt tcaccgatgg atcgaaatct tacgaaaccg taggttttgg attgcacagc 4440 gaaaccgtta gcttgagtta caaacttgaa tctcctgcat cgatcttctg cgcagaatca 4500 actgcaattt acaaagcatg caccctaata gatgaacttc ctataggact ctatatgatc 4560 tgctccgata gtatgagtac acttaaagcg ctttcttcgc gtgttcttga caaaaactca 4620 aacagtactg tgttgatggt gaaacaaatt ctgagaaagc tacactctaa cggatacaca 4680 atacggtttt tgtgggtccc ggcccattgt ggtattcatg gcaatgagtc cgcagatgat 4740 cttgcaaaaa caggagcact aacaggagag gtcttcgaac gtaatatcca atatgacgaa 4800 tacttaccaa caatcaagag ctatatgcaa acaagatggc agcagaaatg gaccgaagat 4860 gaattaggca gattttgtta ctcaatatta ccaaatgtca gcaataagcc gtggtttatc 4920 gaatacgatt ttagtagaaa cctaattaaa aatatgagtc gattgattgc taatcatttt 4980 tgcctttctt ctcatctttt ccgcatagga attaaacagt caaacatttg cgattgcggg 5040 gacggatatg aggacttcga tcatattctt tggcattgcc aaagatataa tgagcatcga 5100 aggaaaatgc tagacgagtt gcagaaacac ggacacgcga caccctgcag tattagggat 5160 attctgggca cattgaacaa ccagattctg atttgtgtga ataactttat aaacgaagcg 5220 aaaatagttt tatagagaac cacagaaaaa aaaacgaaac tcttcaatga taatagttaa 5280 atgctaacac cttcattttt tatatttata tacattttaa ttttaatatt tttgtaatcg 5340 ctactcggcg ccgttatgct gtaattgctt ttgtgcctaa taaaaaaaaa aaaaaaaaaa 5400 aaaaaaaaa 5409 // ID CR1-69_HM repbase; DNA; INV; 3958 BP. XX AC . XX DT 25-DEC-2008 (Rel. 13.12, Created) DT 25-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE CR1-type family - consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-69_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3958 RA Bao W. and Jurka J.; RT "CR1 families from Hydra magnipapillata."; RL Repbase Reports 8(12), 1896-1896 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(47..745,749..3769) FT /product="CR1-69_HM_1p" FT /translation="NLIKILIANMVNVQSTEFNVYKADQDKLIKSLIKTID FT ELKNCNLLLTERLNVLEKAKVVPSTSISWASVVSGNMPPKRSTEQLHLMNA FT VVAEAKEREKRESNVVVFGIEASKKADLLSAREEEKNSILHMMNSINAKIE FT IKQIIKLKSKSDKESPFVIVLNNRNDRNSFLKAAKSLKNSTEFNKVFINPD FT LTVAERYKAKLLREECKQKNLENSEPSLFYYGIRNEKVIKISKQSFYNCKN FT QKYFSHGSLTSLVIPKSLQKNHASSISKQRITAVNNKSIKKVICSKRSNFI FT NSYNHLTTNSFNNFTVYLFNARSLANKLNDFFLYVESNKPALICVCETFFN FT DKLPNSMVCPKNYSIYRKDRVRIGGGVAIYCRNDVKSAIVSITQTDKDIDI FT ICVDLIFGSSKLRLITCYRPPSYSLDDYIYLESMISIISNLCESVPQFVIV FT GDFNLPKIDWVNYVSPIEKCHNLFLTLVNNLGMYQYVLEPTRENNILDLVL FT SNIISLLSNISVECPFSTSDHNSVFFYINTFNYYQPETNSESFYDFKNTDI FT IAFNSYLSKISWDFDFSFVFTTEEYWNIFSNHLRKGIDRFVPIRTIYHNKK FT SRSYPKYIYKMLIRKSFLWKRWSITKTTLHKKAFIQYASKCKKAISTHHKN FT VELKLVKENNLSKFFSYVNKNLNNPKKIYPLKTNSNFTTTNVEIAEVFNKH FT FGSVFTIDDGILPKINTFVSEHIKMNFVDFSAKIVYQKLKNLKPSTSFGPD FT GIPNILLKQLAHIICIPLSYIFESSFTSSSLPKQWLQATVSPIFKKGSTSD FT ANNYRPISLTCTSCRVMESIVSSSIADYLNINNLITPNQHGFLSKRSTCTN FT LLESTNDWNKALDSNLITDVVYIDFQKAFDSVPHPKLLKKLAMYGIIDNLL FT HWISAFLSNRYQRVLVGNSLSNPTCILSGVPQGSVLGPTLFLLFINDLPSI FT VKDLNCSLMLYADDIKLYSSFKDADYSHDLAVAIYKVYLWSKKWQLKIANN FT KCFTHRIAPTSFCKDINYNYQLGDHNLTWSTNPKDLGVTMDSYLNYSNHIS FT NIVRASNIRGYLILKCFKSRDPRVIVKAYTTYIRPILEYCSPVWSPNRTKM FT NNTVERVQRRFTKRISNLNNVSYSSRLSILGLELLEIRRLKQDMITCYKIL FT NNLVCLNKSSFFLFNNYIYTRGHNFKLRKLKCKLDVRKHSFSLRVVNIWNN FT LPSDVVNAKNIKSFKIKINSIDFEKYCSG*" XX SQ Sequence 3958 BP; 1413 A; 683 C; 562 G; 1300 T; 0 other; tacaagttac aatttgtgtt tgtgtttaca agttcgagta aaataaaatt taattaaaat 60 attaattgca aacatggtaa atgttcaatc aactgagttt aatgtatata aagcagatca 120 agataaatta ataaaatcac taatcaaaac aatcgacgag ttaaaaaatt gtaatttact 180 tttaaccgag agactaaatg ttctagagaa agctaaagtt gtaccctcca cgtctatatc 240 ttgggctagt gttgttagtg gaaatatgcc accgaaaagg tcaaccgaac aacttcattt 300 aatgaacgct gttgttgccg aggcaaaaga aagagaaaaa cgggaaagta acgttgtagt 360 gtttggtatc gaagcctcaa aaaaagcaga tttattaagt gctagagaag aagaaaaaaa 420 ttctattctc cacatgatga actctattaa tgcaaagatc gaaattaaac aaataataaa 480 attaaaatca aaatccgaca aagaatcacc ttttgttatt gttcttaaca accgaaatga 540 tagaaattcc tttctaaaag ctgcaaaaag tttaaaaaat tcaaccgagt ttaataaagt 600 gtttatcaat cccgatttaa ctgtagcaga gcgatacaaa gctaagttat tacgcgagga 660 gtgcaaacaa aaaaatttag aaaattcaga accctcactt ttttattatg gcatacgaaa 720 tgaaaaagtc ataaaaatta gcaaatagca atcattttat aattgcaaaa accaaaaata 780 tttcagtcac ggctcattaa cttctttagt tattccaaag tcattacaaa aaaatcatgc 840 ctcaagcatt agtaaacaac gtattactgc tgtaaacaac aaaagcatta aaaaggttat 900 ctgttcaaaa cgtagcaact ttataaactc atacaatcac ttaaccacaa attcgttcaa 960 caattttaca gtttatcttt ttaacgcaag aagtcttgcc aacaaactta acgatttttt 1020 tttatacgtt gagtcaaaca aacctgcctt aatttgtgtt tgcgaaacgt ttttcaatga 1080 taaactcccg aattccatgg tctgtccaaa aaattattcc atttatcgta aagaccgggt 1140 acgaattggc ggtggtgtag ctatatactg cagaaatgac gttaaatccg ccatagtaag 1200 tattactcaa actgacaaag atattgacat catttgtgtt gatttaattt ttgggtctag 1260 taaactccgc ttaattactt gttaccgtcc accgtcctat tctcttgatg attatatcta 1320 tcttgaatca atgatctcga taattagcaa tctctgtgaa tctgttcctc aatttgtcat 1380 tgtcggagat ttcaacttac cgaaaattga ttgggttaac tatgtttcgc caattgagaa 1440 atgtcataat ctttttttaa ctttggtaaa taatcttggt atgtaccaat atgtactgga 1500 gcccactcga gaaaataata ttcttgatct tgttctctca aatatcattt ccctattaag 1560 taatatttcc gttgagtgtc catttagtac aagcgaccat aactcagtct ttttttatat 1620 aaacaccttt aattattatc agcccgaaac aaactctgaa tctttttacg acttcaaaaa 1680 tacagatata attgctttta attcttattt atcaaaaata agttgggact tcgatttttc 1740 ttttgttttt acaactgaag agtattggaa catcttttct aatcatcttc gtaaaggtat 1800 tgatcgtttt gttccaatca gaacaatata ccataacaaa aaaagtcgtt cttatccaaa 1860 atatatttat aaaatgctaa ttcgcaaatc ttttctatgg aaaagatggt caatcacaaa 1920 gaccaccttg cacaaaaagg catttatcca atatgcatca aaatgtaaaa aagccatatc 1980 cacacaccac aaaaatgtgg aactaaaact ggtcaaagaa aataacttaa gtaagttttt 2040 tagttatgta aataaaaatc taaacaaccc caaaaaaata tatcctctaa aaacaaatag 2100 caactttact actaccaatg ttgagatcgc ggaagtgttt aacaaacact ttggaagtgt 2160 ttttaccata gatgacggaa ttttgcctaa aataaatacc tttgttagtg aacatattaa 2220 gatgaatttc gtagattttt cagcaaagat agtttatcaa aaacttaaaa atctaaaacc 2280 tagcacatca tttggaccag acggtatacc aaatatactt ttaaaacagt tagcacatat 2340 aatatgtatt ccgctttcat atatatttga atcaagtttt acgtcaagct cgttaccaaa 2400 acagtggctt caagctacgg tttcccctat ttttaaaaaa ggatctactt cagatgctaa 2460 caattataga cctatctctc tcacatgtac tagctgtcgt gtcatggaaa gtattgttag 2520 ttcaagtatc gctgattatc ttaatataaa taatctaata acacctaatc aacacggctt 2580 tttatccaaa agatctacgt gcacaaatct tctagagtca actaatgact ggaataaagc 2640 tttagacagc aatttaatta ctgacgtggt atacattgac tttcaaaaag catttgactc 2700 tgtacctcat ccaaaacttc tgaagaaact tgcaatgtac ggcataatag ataatctgct 2760 tcattggata tcagcatttc tttctaacag atatcaaaga gttctggttg gaaattcact 2820 atctaatcca acctgcatcc taagtggagt tccacagggt agcgtcttgg gtccaacgtt 2880 attcttgtta tttattaatg atctaccctc tatagtaaaa gatcttaatt gctctctaat 2940 gttatatgct gacgatatca agttatatag ttcatttaag gatgccgatt acagtcacga 3000 tcttgctgta gccatttata aagtttatct ctggtcaaaa aaatggcaat taaaaatagc 3060 caataataag tgctttactc atagaatagc acctacttca ttttgtaaag atataaatta 3120 caactatcaa ttaggcgatc ataatttaac ttggtctaca aacccaaaag atcttggtgt 3180 gaccatggat tcttatttga attacagtaa tcatatttca aatatcgtcc gtgcatcaaa 3240 cattcgagga tatctaatcc ttaaatgttt taaaagtcgt gatccacgtg ttatcgttaa 3300 agcctacact acttatataa gaccgatctt agaatattgc tctccggttt ggtcgccaaa 3360 tcgcactaaa atgaacaata cagtggaaag agtacaaaga cgttttacca agagaatttc 3420 taacctaaat aatgtttcat attcaagtag acttagtatt cttggtttgg aactactaga 3480 aattcgtcgc cttaagcaag acatgattac atgttacaaa attctgaaca acttagtttg 3540 cctaaataaa tcaagtttct ttttatttaa caactatatt tacactcgag gacataattt 3600 taaattacgc aaactaaaat gtaaactcga tgtccgtaaa cactctttct cactcagggt 3660 tgttaatatt tggaataact tgccgtctga cgtcgtgaat gccaaaaata taaagagctt 3720 caaaattaaa attaactcta tcgactttga aaaatactgt tccggctaac tggtttttgt 3780 tttataacta tggtcttata tataattgta atatatattt taaattaact agcatttgta 3840 ttatgtgatt gtagggcacg ttgtcagagt ccttcatatt ttatggacct gcgtgtcctt 3900 tagaacatgt atcactgttc taataaattt taatctaatc taatctaatc taatctaa 3958 // ID BEL-89_AA-I repbase; DNA; INV; 5581 BP. XX AC supercont1.336; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-89_AA_; KW BEL-89_AA-LTR; BEL-89_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5581 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.336; Positions 1050016 1055596. XX CC Positions [4590-5174] - Integrase core CC 'TACAT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 210..5579 FT /product="BEL-89_AA-I_1p" FT /translation="MSENEFRGFDENEMASEGTGKVASQIPAVTDPAIAAQ FT LKSLNRQRNLAMNKVLRIYENVNVRLPIDSANLKVYASKLQSAYDEYSKYH FT SEIIAIVPDDAVSAQEDEYIRFEDYYQHTSVAVESRLMRQPANQVAPPAQV FT IVHQQPLKAPIPTFDGEYTKWPKFKAMFQDVMAQSRDSDAIKLYHLDKALV FT GAAAGVLDAKTINEGNYTQAWTLLSERYENKRVIVETHIRGLLSLKKMATE FT SHKELRSLLDECINHVESLEYLKQEVTGVSELMIVYLLTAALDKSTRKQWE FT QTLKPGELPQYKSTMAFLKSQCQVLERCESAYPQITTKPVPKQQSSIPKVT FT SQRSHAAMTSSENFIEKCDFCAAPHRNYQCNKLNSLTSAEKFEKVRSAGIC FT FNCLRKGHRSNACPSSKSCQKCNKRHHTQLHDDESKPKPESTTSNVATEPK FT PEGQGQSSTAPVPLAIPSSEANVTTTCHVNNAHAPKTVLLLTAVILVTDSN FT NQIHQCRALLDSGSQANFITESMANTLGMEKKRANVPISGINNVRSLARDK FT VEVQFQSRCSDFRASLECLVTPKVTGTIPTTDIDVTSWVLPDGIQLADPLF FT FKTDKVDMLIGAELFFALMTPGLIRLDDDLPELRNSHLGWLVTGAYRPVSN FT DEAIQYSHVASLDSVEGMIRRFWETEEVPNAATISSEEQQCEDHFSSTYSR FT DNTGRFVVRLPLKENADQIDSCRNLALRRFFMLENRLQRNPELKDQYVEFI FT REYLHLGHCREVDIANDTPNLKPYYLPHHAVLRPGSSSTKCRVVFDASAKS FT APTNLSLNEALLVGPAVQSDILSIMIRFRHNRYVFTADIKKMYRQIVVHPD FT DTHKQRIFWREHPNEALKILELMTVTYGTASAPFQATRSLIQLANDEAENF FT PVAAAIIKFDCYVDDVLSGTATIDEAIEAQRQLKEMLARGGFPIHKWCSNA FT PQLLALIPEDERESLKPLADRHVNEVIKVLGLLWEPTSDELLIAECSKPEN FT ERDQPATKRIIYSEVAKHFDPLGLFSPSILVAKLLVQRLWQCKLDWDEPVD FT ARTQLEWNELKDALPKLLLIKIPRQVTFHGAIYYELHGFADASNVAYGACV FT YIQSTLEDGSVKSRLLISKSKVAPNHPLTIPRKELCAALLLVRLVCKVLPA FT LTIPIRRVNLYSDSEIVLSWLKKHPYQLQTFVCNRINEIQVNSEGFSWNYV FT RSQHNPADIVSRGLLPSELMVSELFWTGGEYINSPVVSTGAANEIPDGELP FT ELKANLVSMPALIEEPLEIFETCSSFRRLQRTIAWVLRFCNNTRKPKEDRI FT TSSHLNVQELRSSMIVIVRVIQHVELGDEILRLKTKTPCKRIGNLNPILTD FT DGVLRVGGRLKHSNLPIESKHQLILPNASPVTRCLIRDMHQELLHVGPAGL FT LSAIRRRFWLLRARSTIRQVTGSCVKCFRANPTDITQLMGDLPKQRVTPSP FT AFNITGVDYAGPILVKQGTHRAKVVKAYIAVFVCMATKAIHLELVSDLTTD FT AFLAALQRFVSRRGIVSEMHSDNATNFHGTNNELHKLYEMFRNQPDVDRIL FT HFCHVKEIEWHFIPPDSPEFGGLWEAAVKCTKTHLKRVIGDKTLNFEEMST FT ILCEIEAVLNSRPLFAISGDPADPEVITPAHFLIGRPMTAIPEPSYQDLNI FT GRLDRWQHLQLLREQFWRAWSRDYLSSLQPRKKNWNTSANVRPGMIVLLKD FT KNRPPLQWKLGRITSVHPGHDGLVRVVEVFSEGKTFTRSISKLSILPIEEN FT QGQPDQREKVEPRRFNPGD" XX SQ Sequence 5581 BP; 1568 A; 1507 C; 1323 G; 1183 T; 0 other; aatggtccga atcgaaccgg attggacttc ccgagtgcgt gccgccccga aaccgattcc 60 cgaaacgtga tcctgctgag gcagaacagt gcaaaaagtg aattcctgct gcggcagaaa 120 agtgcgaaat tcctgctgag gcagaacagt gctaaaaaga agattcctgc tgaggcagaa 180 cagtgagaaa aaaaaaggac gaaaacaaaa tgtcggagaa cgaattccgt ggtttcgacg 240 aaaacgaaat ggcttccgaa ggaactggca aagttgccag ccaaataccc gccgtaacag 300 atcccgcaat agcagctcag ctcaaatcgc taaatcgcca gcgcaatctg gcaatgaaca 360 aggttcttcg catctacgaa aacgtcaacg tccggcttcc gatcgattca gctaacttga 420 aagtgtacgc atcgaaacta caatctgcgt atgacgaata ctcgaaatac cacagcgaaa 480 ttatcgccat agtcccggac gatgcggtga gtgcgcagga ggacgagtac attcgattcg 540 aggattatta tcaacacact tccgtcgcag tcgaatctcg cctcatgcgg caacctgcca 600 atcaggtggc ccctccagcg caggtcatcg ttcaccagca gccattgaaa gctccgatcc 660 ccacttttga cggtgagtac accaagtggc caaaattcaa agccatgttt caagacgtaa 720 tggcacaatc ccgcgactcg gatgcgataa agctctacca cctcgacaag gcgctggtgg 780 gagcagcggc aggagtattg gatgcgaaga ccataaacga gggcaattac acgcaagcat 840 ggacacttct atccgaacgc tacgagaaca aacgagtgat tgttgagact cacatccgtg 900 gtctactttc actcaagaaa atggcaaccg aatcccacaa agagctgcga tcccttctag 960 atgagtgcat caaccacgtc gaaagtctcg agtacctaaa gcaggaagta actggtgtgt 1020 ccgaactgat gatcgtctac ctactaactg cagctctgga taaatcgacc cgaaagcaat 1080 gggagcagac tctgaagcca ggcgagttgc cacaatacaa atccacgatg gccttcttga 1140 agtcgcagtg tcaagtactg gagaggtgcg aatctgcata cccgcagata acaacgaaac 1200 ccgttccgaa acagcagagt tcgataccga aggtcaccag tcagcggtca catgcagcaa 1260 tgacaagctc cgagaatttc atcgagaagt gcgatttctg cgcagccccg catcgaaatt 1320 atcagtgcaa caagttgaat tcgctcacca gcgctgaaaa gttcgagaag gttcgatcgg 1380 caggcatttg tttcaattgc ctccgaaagg gtcatcgttc caatgcttgc ccatcttcga 1440 aatcctgcca gaagtgtaac aagcgacacc acacacaact gcacgatgat gaatcaaaac 1500 cgaaaccgga atccacgacc agcaacgtag caactgaacc gaaaccagaa ggtcaaggac 1560 aatcgtctac agcgccagtc cctctggcaa tcccatcgtc agaagcaaac gtcacgacaa 1620 cttgccacgt gaacaacgcc cacgctccga agacagtgct actgctaaca gcggtgatcc 1680 ttgtcaccga cagtaacaac caaatccacc agtgtcgagc cctactcgat agtggatcgc 1740 aagcgaactt catcaccgaa agcatggcca acacccttgg catggagaag aagcgagcca 1800 acgtaccgat ttccggcatc aacaacgtca gaagtctagc acgagacaag gtggaagtgc 1860 aattccaatc tcgatgcagc gactttcgtg cctcgctgga gtgcctggtg actcccaaag 1920 tcactggaac catcccgaca accgacatcg atgttaccag ctgggtgcta ccagacggta 1980 ttcagttggc cgatcctttg ttcttcaaaa cggacaaggt tgatatgctg atcggcgcag 2040 agttattttt tgcgctaatg actccaggac tcatcagact cgacgacgat ttgccagaac 2100 tccgcaactc acacctggga tggctggtca ctggtgccta ccggccagta agcaacgatg 2160 aagcgatcca gtattcccat gtagcatctc tcgattctgt cgaaggaatg attcgacggt 2220 tctgggagac cgaagaagtt ccaaacgcag ccaccatatc atccgaggag cagcaatgtg 2280 aagaccattt ttcgtccacc tactcacgag acaacacagg acgattcgtg gttcgattac 2340 cgcttaaaga gaacgccgac cagatagaca gctgccgaaa cctagcattg cgccgttttt 2400 tcatgctaga aaatcggctc cagcgaaatc ccgaactgaa agaccagtat gtggaattca 2460 tacgagagta tctgcatctt ggtcattgcc gagaagtgga tatagccaac gacactccga 2520 acttgaagcc atactacttg ccacaccacg cagtactacg cccaggcagc tcgtcaacca 2580 agtgtcgagt tgtcttcgac gcaagcgcca aatctgctcc gacgaatctg tccctcaacg 2640 aagccctgct ggtgggacca gcagtccaaa gcgacatcct ctcgatcatg attcgattcc 2700 gtcataatag atacgtcttc acggcggaca taaagaaaat gtatcgccaa atcgttgttc 2760 atcccgacga cacgcataag cagcgcattt tttggaggga gcatccgaat gaagccttga 2820 aaatactcga actgatgacc gttacgtacg gtacagcttc cgcacccttt caagcaactc 2880 gaagcctgat acaactcgcc aacgacgaag ctgaaaattt tcctgtagca gcggcaataa 2940 tcaaattcga ctgttacgtt gacgacgttt tgtcagggac tgctacgatc gacgaagcca 3000 tcgaagcaca gcgccaactg aaggagatgt tagcacgtgg tgggttccca atccacaaat 3060 ggtgctccaa cgcccctcaa ctgctagcac tcatcccaga ggacgaacga gaatcgttga 3120 agccgcttgc agatcgtcat gtcaacgaag tgatcaaagt tctcggcctg ctgtgggagc 3180 cgacctccga cgagcttttg attgctgaat gttccaagcc agaaaatgaa cgagaccaac 3240 ccgccacgaa gagaatcatt tattcagagg ttgccaaaca cttcgaccca ctaggactat 3300 tttcaccctc cattctggtt gcaaagctgc ttgtacaacg cttgtggcaa tgcaaactgg 3360 attgggacga acctgtggac gcaagaactc aactcgaatg gaacgaactc aaggatgcgc 3420 tccccaaact gctgctaatc aaaattcccc gccaagtgac gttccatggc gcaatctact 3480 acgagctaca tggatttgcg gacgcttcca atgtggcata cggagcgtgc gtgtacatcc 3540 aaagcacact ggaagacgga tcggtcaagt ctcgattact aatcagcaag tccaaagtgg 3600 caccgaatca cccactcaca ataccacgca aggaattatg tgctgcactt ttgctcgtta 3660 ggctggtgtg caaagtgcta cccgccctca ccattccgat tcgaagagtt aatttgtatt 3720 ccgacagcga gatcgtctta tcgtggctaa agaagcatcc ttatcagcta cagacgttcg 3780 tatgcaaccg catcaacgag attcaggtca actcggaagg gttttcatgg aactacgttc 3840 gctcacaaca caacccagca gacatcgttt ctcgtgggct acttcctagc gagctcatgg 3900 taagcgagct gttttggaca ggcggtgagt atatcaattc acctgtcgtc agtaccggag 3960 cggcaaacga aattcctgac ggcgaacttc cggaattgaa agccaacctg gtatcaatgc 4020 cggcgctcat cgaagagccg ctggaaatct tcgagacctg cagttctttt cgtcgactcc 4080 aacgaacgat cgcatgggtt ttgcggttct gtaacaacac ccgcaaaccg aaggaggatc 4140 gcatcaccag tagtcatctc aacgttcaag aacttcgaag ctcgatgatc gttattgtaa 4200 gagtaattca gcacgtcgag cttggagacg aaatccttcg attgaaaacg aaaacgcctt 4260 gcaaacgaat tggaaatctt aatccgattc tcaccgacga cggcgtgcta agggtcggcg 4320 gacgactcaa gcactccaac ttgccaatcg aatcgaagca ccaactgatt ctcccgaatg 4380 caagcccggt aacacgctgc ttgatccgtg atatgcacca agagcttctt cacgttggac 4440 ctgctggact tctttccgca atcagacgtc gtttttggtt gctacgcgcc cgatcaacaa 4500 ttcgacaggt aactggatcg tgtgtgaaat gctttcgcgc aaatccaacc gacattacgc 4560 agctcatggg ggacctacca aagcaacgtg taacaccgtc tcccgccttc aacatcaccg 4620 gcgtggacta tgccggtccg attttggtca agcagggtac tcatcgagcc aaggtggtca 4680 aagcttatat agctgtgttc gtttgcatgg ccacgaaagc catacacttg gaactggtgt 4740 ccgacctgac gaccgacgct ttccttgctg cgctccaacg ctttgtgagc cgtaggggaa 4800 tagtatcgga gatgcattcc gataatgcaa caaattttca tggcacgaac aacgagctcc 4860 ataagctgta cgagatgttc cgaaaccagc ccgatgtcga tcgaatcctg catttttgcc 4920 acgtgaagga aatcgagtgg cacttcattc cgccagattc gcccgagttc ggcggcttgt 4980 gggaggctgc agtgaagtgc acgaagacac acctgaaacg agtgataggc gataagacgt 5040 taaatttcga agaaatgtct acgatactct gcgaaattga agcagttctc aactcgaggc 5100 cgctttttgc aatatcggga gatccagctg atcccgaagt aattactcca gctcatttcc 5160 tgataggacg ccctatgact gcgatacccg agccatcata ccaggacctc aatatcggcc 5220 ggcttgaccg ctggcagcat cttcaactgc ttcgagagca gttttggagg gcctggtccc 5280 gtgattacct gagcagccta caacccagga agaagaattg gaacacctcc gccaacgttc 5340 gaccaggaat gatcgtcctg cttaaggaca agaatcgacc acctctacag tggaagctcg 5400 gccgcatcac atctgtccat cccggtcacg atggtctggt cagagttgtg gaagttttca 5460 gcgaaggtaa gacttttacg cgatcgattt cgaaactgtc aatcttgcca atcgaggaaa 5520 accaaggcca acctgatcaa cgggagaaag ttgaaccgcg acgcttcaac ccgggggatg 5580 a 5581 // ID Gypsy-602_AA-I repbase; DNA; INV; 4508 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-602_AA_; KW Gypsy-602_AA-LTR; Ty3_gypsy_Ele77; Gypsy-602_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4508 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [3459-3923] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 3240..4496 FT /product="Gypsy-602_AA-I_3p" FT /translation="MYSDRIVVPKKLQQKVLGQLHRGHPGVERMRSLARNF FT VYWPNIDDHITALVRTRQECASVAKAETKTKLESWPIPEKPWQRVHADFAG FT PINDTYFLLVVDSFSKWPEIIPTKRITTAATISSLRKIFGRFGMPEVLVTD FT NGPQLTSDTFEEFCEANGIMHLKTAPFHPQSNGQAERFVDTFKRTVKKIQA FT GGEGLDEALDIFLTCYRSTPCRSAPGGKSPAEILIGRPLRTSLELLRPPSK FT FTKATNNKQDRQFNEKHGAKEKSFDVQDKVYAQVHQGNNWSWIAGEIVERV FT GRVMYNVWLPERQRLIRSHSNQLRKRYNDVNQTPGEVEPSIPLDILLGAWG FT LNQPEDSLAASVEPPPGEPEGFDQMQREFLRELFEPASQPRRARMPTRATD FT LDEALPRRSSRQRHTPVRYEPYQLY" FT CDS join(186..1610,1614..3188) FT /product="Gypsy-602_AA-I_1p" FT /translation="MPTPLPLGPAAQAAGSNADQPFKETILQLLNNQHTLM FT TRMAEQMANIQGNVQNTSRNELVLDSLASNITEFAYDLESGCSFDAWFSRY FT ADLFEKDASKLDDDAKVRLLLRKLNPAAHERYTSFILPKLSREFSFDETVA FT KLKTIFGSPVSTFHRRYQCLQTAKDENEDFISYSCKVNRACVDFKLQELKE FT DQFKCLIFVCGLKSPKDADIRMRLLSKINETQDITLEKVVEECKSLINLKK FT DTVLIGSQSASTAGAAATHAVRANSPNGNRRKKDRFGGSKSDTPKTPCWSC FT GGMHFSSQCSFKSHKCRDCGRTGHKEGYCSCFASKPSSKKNKGKQQNKNHA FT AKIVTVKNVNRSRRYVETAINGVPVDLQLDSGSDITIISKQNWMKVGAPQT FT SQPDCHVQTASGDRLGIEAMFRASYTIGGTQKEGNCYVCCADLSLNVLGSD FT LMDEFGLWDVPFSSFCKLVSSPQPNQQVLEKEKFPDVFTNRMGLCTKTQVH FT LTLKPDAHPVFKPKRPVSYNMEAVVEDELKRLESSGIITPVTYADWAAPIV FT VVRKPDRTVRICADFSTGLNSALESNSYPLPLPEDIFNRMAQCTMFSHIDL FT SDAYLQVEVDEESKKLVTINTHKGLYRFNRLSPGVKSAPGAFQQIMDAMLS FT GIPCTCPYLDDILIGGRNAEEHKRNLCLVLQRLQEYGFTVKLEKCRFFMRQ FT VKYLGQLLDSEGTRPDPDKVKAIVNMPPPHDVSTLRSYLGAVNYYGKYIRE FT MRTLRQPLDELLKEGSSFQWSDACQRSFDRFKEILQSPLMLTHYNPRLEIV FT VSADASNVGIGARIAHRFPDGQEKAIYHASRSLTPAESRYSQIEKEALGLV FT YAVTKFHRMIYGRQFVLQTDHKPLLAIFGSKRGIPPYTANRLQRWALTMLL FT YDFRIEYISTDHFGHADILSRLINSHVKPDEDFIIATIEVETVICNIVCQS FT IEHLPISYKTIAAETSQDRTLQKVMEFVQKGWPSDAKSLSGTPEV" XX SQ Sequence 4508 BP; 1193 A; 1223 C; 1099 G; 991 T; 2 other; ttttaattcg tttattgtat ttagaagtac cgttaataaa tcatatttgt ttgttacgtt 60 aatcactctc ggttcactct cacatcgtaa caaagtggcg acgagtcggt aaaaggtcgc 120 accgattgtt tttctcgcga aaatcgtcgt aaaaacgtcg cgtcgtttgc agctttgtat 180 tcgccatgcc aacgccattg ccactcggtc cagcagcgca ggcagctggt tccaatgcgg 240 atcaaccatt caaggaaacc atcctgcagc tgctcaacaa tcagcatacc ttgatgaccc 300 gaatggcaga gcagatggcg aatatacaag gcaacgtcca aaacaccagt cggaacgagc 360 tggtcctaga ttcccttgcg agcaatatca ccgagtttgc ttacgatcta gagagcggtt 420 gctctttcga cgcatggttc tccagatatg cggacctttt cgaaaaagac gcttccaaac 480 tggacgacga cgcaaaggtg cgactgctgc ttcgcaaact caatcccgcc gcacacgaaa 540 ggtacacttc gttcattctc cccaagctgt cgagggaatt ttccttcgac gaaacggttg 600 ccaaactaaa aaccatcttt ggtagccctg tttccacctt ccatcgccgg taccaatgcc 660 tgcagacggc gaaggatgag aacgaagatt tcatctccta ctcctgcaaa gtcaaccgtg 720 cttgtgtcga cttcaaacta caggagctca aagaggacca gttcaagtgc cttattttcg 780 tgtgcggatt gaagtcaccc aaggacgctg acattcggat gcgcctgcta tcgaagatca 840 acgaaacgca ggacataacg ctggagaagg tagtggagga atgcaaaagt ttaatcaact 900 tgaagaaaga caccgtactc atcggcagtc agtctgcctc caccgccggt gctgctgcaa 960 ctcatgctgt tcgtgccaat tctcccaacg gcaaccgccg taagaaggac aggtttggtg 1020 gcagcaaatc cgacacgccg aagacgccgt gttggtcatg cggcggtatg catttctcca 1080 gccagtgcag tttcaaaagc cacaaatgcc gcgactgtgg ccgcaccggc cataaagaag 1140 gctattgttc ctgtttcgca tccaagccaa gctccaagaa gaacaagggg aagcagcaaa 1200 acaaaaacca cgcagccaag atcgtgaccg tgaaaaatgt gaatcgcagc cggcgttacg 1260 tggagacggc aatcaacggt gttccagtcg atttgcaact ggactctggc tcggacataa 1320 cgatcatctc taagcagaac tggatgaagg tcggagctcc tcaaacgtct cagccggatt 1380 gtcatgtgca gacggcttcc ggtgacagac tgggcatcga ggccatgttt cgagcatcgt 1440 acactatcgg cgggactcaa aaggaaggta attgctacgt ttgttgtgct gacctttctc 1500 tgaatgtttt aggctcagac ctcatggatg agtttgggct atgggacgtt ccgttttcgt 1560 ccttctgcaa gctggtcagc agcccgcaac caaatcagca agtgctcgaa mtgaaggaaa 1620 agttccccga tgttttcacc aaccgtatgg gactgtgcac caaaacacag gtgcacctca 1680 cgctcaagcc ggacgctcat cccgtattca aacccaagcg gccagtttct tataacatgg 1740 aggccgttgt tgaggacgag ttgaagcgcc tagagagttc gggcatcatc acaccagtca 1800 catacgcgga ttgggcagca ccaatcgtcg tagtccgcaa gcccgatcgc actgtccgca 1860 tttgtgcgga cttttccacc gggctaaaca gcgcgctcga atcgaacagc tacccactcc 1920 cgctcccgga ggacattttc aaccggatgg cgcaatgcac gatgttcagc cacatagatc 1980 tgtcggacgc gtatcttcag gttgaggtcg acgaagaaag caagaaactt gtcaccatca 2040 acacccataa agggctctac cgtttcaatc ggctttcccc cggcgtcaaa agtgcacctg 2100 gcgctttcca gcagattatg gacgccatgc tgagtggtat cccgtgcaca tgtccatatc 2160 tcgatgatat cctgatcggt ggacgaaacg ccgaggaaca taaacggaac ctgtgccttg 2220 ttctacaacg cctacaagag tacggtttca cggtcaagct cgagaagtgt agatttttca 2280 tgcgccaagt caagtacctg ggtcaacttc tggattcgga gggcacccgc cccgatccgg 2340 acaaggtgaa ggcaatcgtc aacatgcctc ctccacacga tgtctccaca ctgcgatcgt 2400 acctgggtgc ggtaaactat tatggaaaat acatccgaga aatgcgcaca ttacgacagc 2460 cgctcgacga gctgctgaag gagggttcca gcttccagtg gtccgatgct tgccaacgtt 2520 cctttgaccg ctttaaggaa attcttcaat caccgctcat gctgacgcac tacaatcctc 2580 gcttggaaat agttgtgtcg gcagatgcgt cgaatgtggg cataggcgcc cgcatagcac 2640 accggtttcc tgacggacaa gagaaggcca tctaccacgc atcccgaagc ttaactcctg 2700 ccgaatcccg ctacagccag atagaaaagg aggcccttgg tctggtgtac gcggtcacca 2760 aattccatag gatgatctac ggaaggcaat ttgttctcca aaccgatcac aaaccgctgc 2820 tagcgatttt cggttcgaag cgtggtattc cgccgtacac agccaaccgc cttcagaggt 2880 gggccctcac catgctccta tatgacttcc ggatcgagta catctcaaca gaccacttcg 2940 ggcacgcaga cattctctct cgcctaatca actcccatgt taagccagac gaggatttta 3000 tcattgctac gatcgaggtc gaaacagtca tctgcaacat cgtctgccaa tccatcgagc 3060 atctcccgat ctcgtacaag acgatagccg ccgagaccag ccaggatcgg acgttgcaga 3120 aagtcatgga gtttgtgcag aagggttggc cgagcgacgc aaaatctctc tccggaacac 3180 cggaggttma tcaattcttt gcgcgtcgtg agtcgttgta cgtggcccag aaagttctca 3240 tgtacagcga tcggattgtg gtaccgaaga agctacagca gaaagtactc ggacaactac 3300 acagaggtca tcccggcgtt gaacgcatgc gatcgttggc acggaatttc gtgtattggc 3360 ccaacatcga tgaccacatc accgctttgg tacgcactcg ccaagaatgt gcctcggtcg 3420 ccaaagctga aactaaaacg aaactggaat cgtggccaat ccccgaaaaa ccttggcaaa 3480 gagtgcacgc agatttcgct ggccctatca acgacacgta cttcctgcta gtagtagatt 3540 cattttcaaa gtggccggaa atcattccga ctaaacgtat cactaccgct gctacaattt 3600 ccagtcttcg gaaaattttt gggaggttcg gaatgccgga ggttctggta accgacaatg 3660 ggccacagct aaccagcgat actttcgaag agttctgtga agcaaatggg attatgcatc 3720 tcaaaacagc accgtttcac ccgcaaagca acggtcaggc ggaacgcttc gttgacacct 3780 ttaaaaggac cgtcaagaaa atccaagcgg gaggggaagg cttggacgaa gcactcgaca 3840 tcttcctcac ctgctaccga tcaactccct gtcggagtgc acctggtggg aaatcgcccg 3900 ctgaaatcct tattggtcgt ccactgcgca cgtctctgga actccttcgt ccgcctagca 3960 agttcaccaa ggctaccaac aacaagcaag atcgtcagtt caacgagaaa catggtgcca 4020 aagaaaagag cttcgatgtt caagacaagg tgtacgctca agtgcaccaa ggcaacaatt 4080 ggagctggat tgctggagaa attgtggagc gtgtcggacg agtgatgtac aatgtgtggc 4140 ttccggagcg acaacgtcta atccgctctc acagtaatca acttcgtaaa cgctacaacg 4200 acgtcaatca aacaccaggt gaagtcgaac catccatacc gttggatata ctgctaggcg 4260 catggggtct caatcaacct gaagactccc ttgctgctag tgtagaacca ccgccaggag 4320 aaccggaagg attcgaccaa atgcaacgag agttcttacg agagcttttt gaaccagcaa 4380 gtcaaccacg tcgagctcgg atgccaacaa gagcaactga tttagatgaa gcgctcccac 4440 gtcgctcttc aagacagcgg catacaccgg tgcgctacga accgtaccag ctctactaaa 4500 aggggagg 4508 // ID Dbuski1cons repbase; DNA; INV; 492 BP. XX AC . XX DT 19-APR-2011 (Rel. 16.04, Created) DT 19-APR-2011 (Rel. 16.04, Last updated, Version 1) XX DE Consensus of mariner-like element internal region of mauritiana DE subfamily. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Dbuski1cons. XX OS Drosophila busckii OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila. XX RN [1] RP 1-492 RA Wallau G.L., Hua-Van A., Capy P. and Loreto E.L.; RT "The evolutionary history of mariner-like elements in Neotropical RT drosophilids."; RL Genetica 139(3), 327-338 (2011). XX DR [1] (Consensus) XX CC Consensus of clones that show less than eight percent divergence. CC Dbusck1cons. XX SQ Sequence 492 BP; 161 A; 126 C; 118 G; 87 T; 0 other; tgggtaccgc acgagctgaa accaagagac attgaaaggc gattatgcat tgctgaacaa 60 ctgcttgcaa gacaacaaag aaagggtttt ttgcaccgaa ttgtgacagg ggatgaaaag 120 tggatccatt acaacaacga aacccggcgc aaatcttggg gtaagcccgg tcacaaagca 180 gtatccactc cgaaacccaa tttccatgga accaaggtta tgctctgtgt ttggtgggac 240 cagctaggcc cgatccacta cgaattgctg aaacagggcc agactatcaa cggggagctc 300 taccgacaac aattgagccg tctgagccgg gcactcaaag aaaaaaggcc acaattcgaa 360 gaaaggcacg acaaagtcat tcttcagcaa gacaatgcaa gaccacgcac cagcagagtg 420 gtcaaagatt acctcaacga gctgaaatgg gagattttgc cccacccgcc atacacgcca 480 gacttagcac ca 492 // ID Gypsy-1_NG-I repbase; DNA; INV; 11818 BP. XX AC ADAO01190636; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from parasitoid wasps: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-1_NG_; KW Gypsy-1_NG-LTR; Gypsy-1_NG-I. XX OS Nasonia giraulti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-11818 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from parasitoid wasps."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; ADAO01190636; Positions 290 12107. XX CC Positions [7897-8373] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 3768..7505 FT /product="Gypsy-1_NG-I_4p" FT /translation="MLKRRGKFYADSGADISVIKIEELAPGYPIDTDKIVK FT IQGVTEGTAFTLGQAEIQINGLPCSVQVVKNDFPIENAGIIGWDVIDKHNG FT CVDALSKSLKLGATHLPFEADEKITIPPRVKMAISAPLKNHDVKVGWVPLY FT DLHPDLLFGNFVSENRNGRVHGECINVSDSEITISAPMVELLECEEIANPL FT YQADEDGSAENVATFTASLRRLFGTENPSEKYREVSNHNRELQSNAEKRRA FT RVDKIMSLADLEGCSEAEKNIIREIVDEFAGVFGLEGEPLPATHLLQHKIM FT LKSNRPIRNHRFRFPPAIKEHMIRELAKLREQNIVVPSNSNYSSSLWIVPK FT KPDASGNKRFRLVTDFRGLNEETEGSCHPLPFTSDILEHLATANFITVMDL FT KQGYHQIEMHPDSAHLTAFYAPDGNHGNQLLQFNRMAMGLKEATITFTRAM FT SLAMKGLQGDEVEIYLDDLMVFGGTFDEHNKRLRRVLKRLLEANMTVEPKK FT CQFLKKEAQVLGHIVGGGLIKTDPAKTHAMANYPTPTDAKKLKQALGLFSY FT YRRFIPDFSKIAYPLFKLLQKDIKFQWGKAEKTAFENLRKLMAEEPVLKAP FT VLSEPFIVTTDASDYALGAILSQGKIGVDHPCAYASRCLKGSELRYPTYDK FT ELLAVVFAKEQFKPYLYGRRWTLVTDHEPLKHYHTSKKPDLRFNRLKAALN FT GYDFDVVYRPGIKNSNADALSRNPVLAEGEVNPDLPRYELYELADQQVKEN FT PDEEAGAPPGRVFRARAVKKQNQKAGKKERVSESSSSDASNRGLSKKPRQD FT NALIYERNECVAVRNEFNGYYICRALHDVHQGDEKIAVQWFTETKNLKHVY FT APDYRDAVWFDSILTSASLKRLDRNHYLLGAAERTRIDAILREAIEFIADA FT PAPKPPEISAVAAEFSDSDQSMKSVASALSSSSGFSLLPPELRPVAKYVWP FT SERKTRGSEKTKMTVSRKSRELGNSSSEESIRISPSGKIANVIDEPQIFSF FT PQRVFSRSPSPPQSPQTRVTPEPPVMTKSPLRPTAAEFQMPASTQASQPRD FT SSNAPRAESALAEPPGPKSGSTWPWSGNKTKKKLRVNIEGKDFDGKGGVRI FT ECWELNSSVAPQDKIIPARVENSQQRTRARELSARKDNAMAISKSPVSSEK FT ETAAGNASAPSGKNSSKNGRNESGKKAPETGSLTPAVVSPPKSAACMTHPA FT MIPSPPSSVGPNEDFAGILSKITSKIGRFAPTCENV" FT CDS 9468..10799 FT /product="Gypsy-1_NG-I_5p" FT /translation="MRKLESFAVVLENNSAKNLANINSLDVEMNIKTTLLQ FT FKADTELINTAILFAVKGLIHPKILAPETIAQAAKSVEDSLSHAKFPLSPG FT DFSVIPIMEISRVAILFSNGYLIYHIAIPLVDVEEFNLFKASPIPTVQSIL FT NSAHTAAYIWPVHTYFAISTSNLTYLPIPAEEVRKLRKLQDTLIVVNPEPV FT REISDSVSCEIKVAAGRKIESPEVCDIRLKRLRESFWLRLHRANTWVYSAR FT ERENIYIQCPKKELVTAQISGAGVLELREGCAAHTASARLVASHIITKQLE FT TLSLGALQFNMSKIWAHVNESSAAAADLKQAIAIGTEHRADNAHGSDLDNL FT KAGESLREITARARDIAYRKKTEFELKDLSGHTGILSWVSWSSLSIVLICV FT VAVWCFMRYRAQQPMRDLARQQRDLMQLEAARSFAQLSASRRAAEAANAQ" FT CDS join(902..2461,2465..3703) FT /product="Gypsy-1_NG-I_2p" FT /translation="MLTRRKTFKTKSRPASRESLDRENIKAVKRINKIIDL FT HTNNPRVARKALARTLSRDALSSDEANAPAREQPANLLTGAYRQQQGNDFP FT SRTEDESGEFERQVAPSPVDSVDGARAVPTTQRKPTSGLDWIGLNQKLAAW FT NLKTDMRPMSAPNAGTPLPTAPNDGFKRKSQKVLLPMPSNPAGGKFERSKQ FT GAFARFSRAGANQHNLASTLNPLPPFMFPVASVPNTMGQLNKNTNAFANEL FT QRGAPERQARSTGAYGPALVVKETRRLSNASERPVSRTASVASTVIGSACT FT ARSSQGGTDFAALSEHMESIEAKLSAKIDSLAEKIDSRAGPSHAKNLNVHE FT AIREEVGKQVDILREIRESSLASTMPIQTPPSRKLTSQEPRRQEPVHLNSS FT GKTPARKESAGLDRRPTKNRDPKTAQLIELAESSSSDSAADTESDIEAVTE FT TRRNGGTRAEKLVHKKTAQQIRAKAEAAARTAVREVSSGTTGQNKNKRHAY FT AGGFYPSDDDPESDPSDKDGKGKKDKPGNRKQDAARNPARRQPRRGDNRPP FT SDDPDDDPSDGSDESESDEDDGDKGPETPRIKKNKNEIRSGTSKGEKNLLP FT PKVSFNQAIGSLPDFDGDPDNLSLFSRAVRNVLKSHPAHEDYILMYVAKKL FT TGEKTKGYRTRIGSYTSVEALLLDLTLHFANVSVADKIMAEIRAVKQGSAE FT PAGDYGLRVERLINRYSTIIESAPDLSKAERRLRERLAQEDALEQFLLGLK FT APLDHLVRGKNPTSLKAATTTAIGFEAKQSGRGALSTSRGPENGQAQVRRA FT KAEQARPNNGARGAADSSPKPQPVSGDGPEKYCPYCNKYNHSLSECKTIAR FT HVELQLIKNPNYSNDDNQGGGDNNRNNGASNRGSNNNRKYNKKDSQNNNKK FT GKRDRSSDSNGRNQESGEQADKNNKNNLN" XX SQ Sequence 11818 BP; 3567 A; 3154 C; 2880 G; 2214 T; 3 other; cccgccgtcc gtaacattct ggcatccctc gcgggtctcc gacacagtaa acgtaaagtg 60 cggagaatat taaaactatt cgttgagttt accaacgaat agatcgtttt tacaaaaatt 120 tcggtcgccg aggctcggaa taataattag tccgggctaa aatcatcaag gccaacagcc 180 agccgcagga ctccgtccac gagatagacg ggggcaaata aagtctagtg cactggtgag 240 acgccctcag ctccagcagt tcgagcaaga agttgcccag gaaatcgaaa aactccttca 300 gagcaaccgg gcaaccacat cttcaccagc gcacaggtca ctgatcgttc tagcaccagg 360 aaggccagag aacgcagcga gccattgcct ccgctacgtg atcgaactcc ggtgcggcgg 420 tgaaccagcc gtagcagcag cctaaaaata cgcgccgctc tacaaaaaca aacagaaaac 480 acgaaaatca acgcacgtcg tgcgcctcaa agtcgacctt cggccaacaa agcccgcctg 540 ggccaacgat agccgtcagc gcggaccaga cgctgacgtg cgaagcagcc gcaggaagca 600 gacagcgcgc gcaaccagac tccggcgcga gcagcagccc agctccacaa gccagctctg 660 ccagcccagg tcttcgccca gcagcaacac ttcacgatga atccgagcgt tcttctaatg 720 taagtttttc ttgccaatta ttcttgagca gcttggctac acgggagtac gcgtcgcgat 780 ttattcgaga aaataaaaaa aaaaagtgtt tcgtttctct cgcgagcgcc gcaatacatt 840 tcgcgtcgag taaaatcttc agaataagaa tttttctttt tgtctttctc tctcgtgaac 900 gatgttaacc agaagaaaaa cattcaagac gaaatctcgc ccagcgagcc gcgaaagcct 960 agaccgcgaa aacattaaag ctgtaaagcg aataaataaa attattgatt tacatactaa 1020 taatccgcgg gtcgcacgca aagctctcgc gcgcacactc agccgcgacg ctctctcttc 1080 cgacgaggcg aacgcgcccg cccgggagca accagcgaac ttgctgaccg gggcgtaccg 1140 gcagcagcaa gggaacgatt ttcccagtcg cacggaggac gaaagcggcg aattcgagcg 1200 acaagtcgcg cctagccccg tagactcggt ggacggagca cgtgcggtcc cgacgacaca 1260 aagaaaacca accagcggcc tcgattggat cgggttaaat caaaaactgg cggcatggaa 1320 tctaaagacc gacatgcgcc ccatgagcgc ccctaacgcg ggcacgccgc tccccaccgc 1380 gccgaacgac ggatttaagc gaaagtcgca aaaggttttg ctgcctatgc cgagcaatcc 1440 agccggcgga aaatttgagc gcagtaagca aggagctttt gcgcgattct cgcgagctgg 1500 cgctaaccag cataacctcg ccagcacact aaacccgctt cccccgttca tgtttccggt 1560 cgcatcggtc ccaaatacga tgggacagtt aaataaaaac acaaacgcgt tcgcgaacga 1620 attgcaaagg ggcgcgcccg agcgacaagc aaggagcact ggtgcgtacg gtccagctct 1680 tgttgtgaaa gagacacggc ggctttccaa cgcatccgag cgcccagtat cgcgtaccgc 1740 cagcgtagcg agcacagtaa tcgggagcgc gtgcaccgcc agaagcagcc agggtggcac 1800 ggatttcgca gccctgtccg agcacatgga gtctatcgag gcaaagctgt cagcaaagat 1860 cgacagcttg gcggaaaaga tcgatagtag agccgggccc agccacgcga aaaatcttaa 1920 cgtgcacgaa gcgataaggg aagaagtcgg aaagcaagtc gacatccttc gcgaaattcg 1980 cgaatcaagc ttagcaagca ccatgccgat tcaaacgccg ccaagtcgaa agctgacgag 2040 tcaggagccg cggcgccaag agcccgtaca cctgaattcc tcgggcaaaa caccggcgcg 2100 caaagaatcg gcaggtctcg acaggcggcc gacgaaaaat cgagacccca aaaccgcgca 2160 gctcattgaa ctcgcagaat cctcgagcag cgacagcgcg gcagacaccg agtctgacat 2220 cgaagcggtc acggaaacgc ggcgaaacgg cgggactcgc gccgaaaagc tcgtgcataa 2280 aaagacggcg cagcaaatac gggccaaggc cgaagcagca gcccgaaccg cagtgcgcga 2340 ggtgtcaagc ggtacgacgg gccagaataa aaataaacgt cacgcatacg ctggcggctt 2400 ctacccgtcg gacgacgatc cggaatcaga tccgtctgat aaagacggca agggaaaaaa 2460 gncggacaag ccgggtaatc ggaagcagga cgctgccaga aacccggcac gtcggcagcc 2520 gcgccgcggc gataatcggc caccgtcaga tgaccctgac gatgatccgt cggatggcag 2580 cgacgagagc gagagcgacg aggacgacgg cgacaagggt cccgaaaccc cgcgcataaa 2640 gaaaaacaaa aatgaaatta ggagcggcac gagtaagggc gagaaaaatc tgctcccccc 2700 gaaagtgtca tttaaccagg cgatcgggag tctgcccgat tttgacggag acccagacaa 2760 tctcagcctg ttcagccgag cagtgcgcaa cgtgttaaaa tcacacccgg cgcacgaaga 2820 ctacatccta atgtatgtcg caaaaaagct cacaggagaa aaaaccaagg gctaccgcac 2880 gcgcatcgga agttacactt ccgtggaagc cctactgctc gacttgaccc tacacttcgc 2940 aaatgtgagc gtagcggaca agataatggc agagattcgc gctgtcaagc aaggcagtgc 3000 ggagccagca ggagattacg gtctgcgcgt cgagagacta atcaatcgtt actcgacgat 3060 tatcgaatcg gcgccagacc tatccaaagc ggaaaggcga ctgcgcgagc gcctagctca 3120 agaagatgca ctagagcaat ttttgctcgg cctcaaggca ccgctagacc acctggtgcg 3180 aggtaaaaac ccgaccagcc tgaaagccgc gacaaccacg gcaatcggtt tcgaggctaa 3240 acagagcggt cgaggagccc tgagcacctc gcggggcccc gaaaatgggc aggcgcaggt 3300 gcgtcgtgcg aaggctgagc aagctcggcc aaacaacggc gcgcgaggcg ctgcagatag 3360 ctcgccaaag ccacaaccgg tctcgggcga tggccccgag aagtattgtc cgtactgcaa 3420 taagtataac cattcgctta gcgaatgtaa gacaatagcc cgacatgtcg agttgcaatt 3480 gatcaaaaat ccaaattact ccaacgatga taatcaaggt ggcggcgaca ataatcgcaa 3540 caacggcgct agcaatcgcg gcagtaataa taaccgcaag tataacaaga aagactccca 3600 aaacaacaac aaaaaaggca agcgtgaccg ctctagcgac agcaatggtc gtaaccaaga 3660 gagcggagag caagccgaca aaaataacaa aaataattta aactaaaagc ataaaattca 3720 tcgtcacggg tacgcaaccg ccgatcgtag ctgtagattg cccgcaaatg ttaaaacgca 3780 gaggcaaatt ctacgccgat tcgggagcgg atatctcagt aataaaaatc gaggaattgg 3840 caccgggcta tcctatcgac actgataaga tagtgaaaat tcagggagtg accgaaggca 3900 cagccttcac acttggccaa gcagaaatcc aaataaatgg attaccctgc agtgtacaag 3960 tggtaaaaaa tgatttcccg atagaaaatg caggaatcat cggctgggac gtaatcgaca 4020 agcacaatgg ctgtgtcgat gcgctcagca aaagcctcaa gctaggtgcg acccacctac 4080 ctttcgaggc tgacgaaaaa atcacaatcc ctccgcgtgt caaaatggcc atcagcgcac 4140 ctctaaaaaa ccacgacgta aaagtcgggt gggtcccact gtacgacctt caccccgacc 4200 tattattcgg caatttcgtt tccgaaaatc ggaacggccg tgtgcacggt gagtgcataa 4260 acgtcagtga ctctgaaata acgatatctg ccccgatggt cgagttactc gaatgcgaag 4320 aaatcgctaa cccgctatac caggccgatg aggacggctc agccgaaaat gtcgccacat 4380 tcactgccag cttaagacgc ttattcggca cagaaaaccc aagtgaaaaa tatcgcgagg 4440 tatctaacca caaccgcgag ctgcagtcga acgccgaaaa aaggcgcgcc agagtcgaca 4500 aaattatgag tttagcagac ctcgagggat gcagcgaagc ggagaaaaac ataatccgcg 4560 aaatcgttga cgaattcgcc ggtgtgttcg ggctcgaagg ggaacctctt cccgcgaccc 4620 acctgttaca gcacaaaata atgctgaaat caaacaggcc tataaggaac caccgctttc 4680 gcttcccgcc tgctataaag gagcacatga ttcgcgagct agcaaagcta cgcgagcaaa 4740 acatcgtggt gccttctaac tctaattact cgtcgtcgct gtggatagtc ccgaaaaaac 4800 cagatgccag cggaaacaaa cgattccggc tagtaactga ttttcgagga ctcaacgagg 4860 aaacagaggg aagttgccac cctctcccgt tcacaagcga cattttggag catctcgcga 4920 cggctaattt tataaccgtc atggacctca aacaagggta ccaccagatc gagatgcatc 4980 ccgactcggc gcaccttaca gcgttttacg ctcccgacgg caaccatggc aaccagctgt 5040 tgcaattcaa caggatggcg atgggcctga aggaagcaac tataacgttt actcgcgcca 5100 tgtcattagc catgaagggc ttacagggcg acgaagtcga aatctacctt gatgacctca 5160 tggtattcgg cgggacattc gacgagcaca ataagcgtct gcgacgagtg ctcaagcggc 5220 tactcgaagc aaatatgact gtcgagccaa agaaatgcca atttctgaag aaagaggccc 5280 aagtgctcgg gcatatagtc gggggcggtc tgatcaaaac agacccggcg aaaactcacg 5340 ccatggccaa ttacccgacg cccaccgacg ccaaaaagct aaaacaagct ctcggcttgt 5400 ttagctacta caggcgattc atacccgatt tttcaaaaat cgcatatccc ctattcaaac 5460 tcctacaaaa agatataaaa ttccaatggg ggaaagcaga aaaaactgca ttcgaaaatc 5520 tccgaaaatt aatggcggag gagccggttt tgaaagcacc agtcttatca gaacctttta 5580 tcgtgacaac ggatgcgagc gactacgccc ttggcgcaat tttaagccaa ggaaaaatag 5640 gcgtggacca cccgtgcgca tacgcatcgc gctgcttaaa gggcagcgag ctacgttatc 5700 cgacgtacga taaagagctt cttgccgttg tttttgctaa agagcagttt aagccttatt 5760 tatacggcag aagatggacc ttggtaacgg accacgaacc attgaagcat tatcatacgt 5820 ccaaaaagcc agacctacgc tttaataggc tcaaggctgc gctaaacggc tatgacttcg 5880 acgtcgtgta tcggccgggc attaaaaact cgaacgccga cgcgctctct cgcaatccgg 5940 tactagcaga aggcgaagta aacccggatt tgccgcgcta cgagctctac gagctcgctg 6000 accagcaggt aaaggaaaac cctgacgaag aagcaggtgc ccctccggga agagtcttcc 6060 gtgcacgcgc cgtaaaaaag caaaaccaaa aggcgggaaa gaaagaacgc gtgtctgaat 6120 caagctctag cgacgcatca aatcgcggtt taagcaaaaa acctcggcaa gataacgcac 6180 taatttacga gcgcaacgag tgcgtggcgg ttagaaacga gttcaacgga tattacatct 6240 gtcgcgcgtt acacgatgtt caccaaggcg acgaaaaaat cgctgttcaa tggtttacgg 6300 aaaccaaaaa tctaaaacac gtgtacgcgc cggattaccg cgacgcggtt tggtttgatt 6360 ccatattaac cagcgcatcc ttgaaacggc tcgacagaaa tcattatctg ctgggcgccg 6420 cggaacgtac gcgcatagac gcaatcttac gcgaagccat tgaatttatc gccgacgcac 6480 cagctccgaa gcctccggaa atttcggcag ttgcagccga attttcggat tccgaccaat 6540 caatgaagtc ggtagcatca gcgctttcta gctcttcggg attttcgtta ctccctccgg 6600 agttgcgccc ggttgcaaaa tacgtgtggc cctccgagcg aaaaactcgc gggtcggaaa 6660 aaacaaaaat gacagtatcg cgaaaatcgc gcgaattggg caattcaagc agcgaggaaa 6720 gcatacggat ctcgccgtcg ggaaaaatcg caaacgtgat cgacgagccg cagatcttct 6780 ctttccctca aagggttttt tcgcgaagcc cgagccctcc gcagagccca cagacgcgcg 6840 tcacgcctga accacccgta atgactaaaa gccctttgcg tcctacggct gccgaatttc 6900 aaatgccggc gtccactcaa gccagccaac cgcgcgatag cagcaacgcg ccgcgtgctg 6960 agagtgcgct agctgagccg ccgggaccca agtcaggctc aacgtggcca tggtcaggta 7020 ataaaaccaa gaaaaaacta agggtaaata tcgaagggaa agacttcgat ggaaaaggcg 7080 gcgtccgcat cgaatgttgg gaattaaact caagcgtcgc gccacaggat aaaataatac 7140 ctgcacgagt ggaaaactcc cagcagcgta cacgcgcgcg cgagctgtcc gcaaggaaag 7200 acaacgccat ggcgataagc aaaagcccag tcagctcgga aaaagagact gctgcgggta 7260 acgcgagtgc gcctagcgga aaaaacagct ccaagaacgg tcgcaacgag agtggcaaaa 7320 aagcacccga gacgggcagc cttacgccag ccgtcgtatc accgccaaaa tcagccgcgt 7380 gcatgacaca cccggcaatg attccgtccc ccccctcgtc agtgggtcca aacgaggatt 7440 ttgcgggaat actatcaaaa atcacttcga agatcggccg cttcgcgccg acttgcgaaa 7500 atgtctaaaa accttaaaaa acgttatatt aaaggaaaat atccgcagca tagctctgat 7560 tagagatctg gcgatgctca cggtatccga gtggacaaac ttcgtcgagt tattcgacaa 7620 tatttttgcc ggagttgcat taattgcaat tctgtacaaa aataatttgc cagccccccc 7680 cccccagtct ctgagcgctt taatctcatt aaagagtatc acgaggcaac aatgggaggg 7740 catcggggca aaaataaaac atacagcaaa atcgccaatg atttttattg gcgaaatatg 7800 cgccccgatg taaagcaatt tgtcgctcgc tgcccaacgt gccagagcaa taagctggtc 7860 cgaataaaaa ctaggctacc catgcttatc agcaacacgc catcgatgcc atttgctcat 7920 atcgccatag atttctatgg gccgttggaa cgttccaaac acgggaacag atatattttg 7980 tcggcgcagg atatgctcac aaaatatata gtcctaacgc cagcaaggca tgcgaatgcg 8040 gacgaggtcg cgcgaattct cacggagaaa attatttgtg tttttggacc acctgccgca 8100 ttggtctcgg accaaggtag ccattttcaa aataaaattc tggaggaatt cgctaggatc 8160 ttcaaaataa acaaattttg cactacggca tatcatccgc aggcaaacgg ctcgatcgag 8220 cgtatgcacc atacgctgac cgagtacctg cggaagtatg taagacgtgc cgatacttgg 8280 gatgaatgga cggccgtatg ccagcatgca tacaactgta ccgagcacga gagcactcgg 8340 tactcgccgc acgagctgct cttcggcttc aagccaagaa cgccatccag cttcccccga 8400 gtcagcgacg atatgtccta taacaattat ttgaccgaga tgacaaacaa tctcacggcc 8460 ctgcaaacga cggcggccat gaacttggtg caatccaagt atcggtcaaa acactactac 8520 gatcgtaaat taaactcgaa gcattttcgg gaaggcgaaa tcgtctttct gataaatgag 8580 cccaagaaaa acaaatacct taaagaatac cgcgggccct tcgagataat tgcgatcaac 8640 cgcaagacta ataacgtgac gttgcaaaac gacgagataa caaaagtcgt gcacgtcaac 8700 aaaatcgaac ggccgagcga gctagcaaga aacgcagact tatcggattc cgaccgtgca 8760 agttaggacg tttttttttc ctgttttttt gcgcgccccc gggcgtagcg caaaaaaaaa 8820 aacaacgtcc taacctcaag atttaaaaat gaataaatga aagacactcc cagcgcgtca 8880 aaagccagct gctcatttca gcaacataga tataaggtgc ttttaccatt aattgcaggc 8940 tatgcgcagc ctgcgtgcta gggatcgagg aaactcccca ggtccacagc ccagggcgtt 9000 tcgtgcagga gttaaaacag aacttggggt tgatcaccga gaagattgca cccctctcaa 9060 catccagtac taactggaag ttaatagaaa aaatcgacct gaacgaattc ttccaagcga 9120 gtaaagtttt ggtcgatcgc gtgtcaaccg ccgctcgagc atgccatccg cgttgcgatg 9180 ccgccgggtt gatcaaagaa gccgaatccg tcaccgcgca ggctgaacgc gttataaaat 9240 tgatacaaac ggatagcagc ggacaccaag ccccacggga gcgccgcgcc atactaccct 9300 ttattggctc actccacaaa tggctatacg gtacactaac ggaagccgat gaagctgaga 9360 ttcaagcggc agtgcaaagg atcgcagaag atactcgtct gacagccgcg ttattggcga 9420 atcagaccga aatagttgaa cacgagttat cacacttgaa aaagaggatg cgcaaacttg 9480 aatcatttgc agtagtgttg gaaaataatt cggccaaaaa cctcgccaac attaatagcc 9540 tggatgtcga aatgaacatt aagacaactc tattgcagtt caaagccgac accgagctga 9600 ttaatacagc aatactattt gcagtcaagg gtctgattca cccaaaaatc ctagcgccgg 9660 agacgatcgc ccaagcggcg aaatcggtag aggattcact ttctcacgcg aaatttccat 9720 tatctccggg cgatttttcc gtcattccga taatggaaat atcgagagtg gcaatcctat 9780 tctccaacgg atacttaatt tatcacattg caataccgct ggtagatgtc gaggaattta 9840 atctctttaa ggcatctccg atacccacag tgcaaagcat ccttaatagt gctcacacgg 9900 ccgcttacat atggccagtg cacacctatt ttgcgataag cacgtcaaat ctcacctacc 9960 ttccgatccc tgcagaagaa gtgcgcaaat tgcgcaaatt gcaggacaca ctgattgtgg 10020 tcaacccgga accagtgcgc gagatcagtg acagcgtatc gtgcgagatc aaagtcgccg 10080 cgggccgtaa gatagaaagc ccggaggtat gcgacatccg cttaaaaaga ttacgcgaat 10140 cattctggct gcgcctacac cgagctaaca cctgggtgta ttcagccaga gaacgggaaa 10200 atatttacat acagtgccca aaaaaggagc ttgtcaccgc acagataagc ggcgcaggag 10260 tcctcgaact ccgcgagggg tgcgcagcgc acacggcaag tgcgcgcctg gtcgcctctc 10320 atataataac aaagcagcta gagacgctat ctctcggcgc cttgcaattc aacatgtcca 10380 aaatatgggc acatgtaaac gagtcttctg ccgcagcggc agatctcaag caagccatcg 10440 ctatcggcac cgagcatcgc gccgataacg cgcacggctc ggacctcgac aacttgaaag 10500 caggcgagag cttgcgcgag attacggcaa gagcacgtga cattgcctac aggaagaaaa 10560 cagagttcga actcaaggac ctaagcggac acaccggcat tttgagctgg gtatcctggt 10620 cctcgttgag catcgtactc atctgcgtcg tcgcagtatg gtgcttcatg cgctatcgtg 10680 ctcagcaacc catgcgcgac cttgcgcgcc agcagcgcga cctcatgcaa ctggaggccg 10740 cacgcagctt tgcgcagctg tcagccagta gaagggccgc cgaagccgcg aacgcgcaat 10800 agcttccctt ctcccccctt cagggccctc gtgtaaaaac aaaaaaaaaa gaacgcgcga 10860 aacggccagg gacaaagaaa cgcgcgccac ttttccgaaa acaataaatt aaaattttaa 10920 tacaaaatat aatacccaaa taccggtaac gccgcgtcgc gcgaagcaag cacgaacgga 10980 gctacggagc tctggagttt tatcgcgcgg gccgataacg cgtgcctcta tttctataag 11040 aataatctaa tcgccaaaat agccaagctg ccgctatggc aagaaaaata tccaagcgat 11100 atgatgcgct gcataacgcg caattttgtt atatatacat atatgcatgt aggttataat 11160 aagaaaaata acaaaatata cacatttata tgccatatcc tacaaattag aatgttatag 11220 cttataagga aataccgtaa gttgcgaatg caagtacgcc atgaggctta taattatatt 11280 aataaatgta aacacactat gttatagctt ataaggcaat accgtaagtg gcgaatgcaa 11340 gtacgccatg aggcttataa ctatattaag aaatgtaaac atactaatat taaaaaaaaa 11400 aaaaaaaaaa aaaaanatat gtatcgagct ccgcctctaa aaaaaaaanc tctgtgagac 11460 agcagcggct gtcgaaagac aacatccgcg ccgactcaca cacacacaca cacacacata 11520 cgcacacata tcaacgctta acagattttt tttaagagaa aaacaaagag aactgacatt 11580 acggagtcag caggatttag aaacccaatg cgcttctaat gcgctgtgct caagcactgg 11640 catcccagaa aatagaaaaa aaaagaagta aaaacatgta cacatacata taccctcagg 11700 cgaggagcgc gcgacagcgc gccccgtcgc cttacacact caagcaacgc acgcacggct 11760 ccgccggcga ccctgttttt tgagggcgcc gaggacgtca cccgcgcgcc caagggcg 11818 // ID SINE1b1_Cis repbase; DNA; INV; 303 BP. XX AC . XX DT 14-NOV-2005 (Rel. 10.11, Created) DT 09-DEC-2005 (Rel. 10.11, Last updated, Version 1) XX DE SINE Non-LTR Retrotransposon from Ciona savignyi. XX KW SINE; Non-LTR Retrotransposon; Transposable Element; L1-99ext_Cis; KW SINE1b1_Cis. XX OS Ciona savignyi OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. XX RN [1] RP 1-303 RA Smit A.F.; RT "SINE1b1_Cis - SINE Non-LTR Retrotransposon from Ciona RT savignyi."; RL Direct Submission to Repbase Update (11-NOV-2005). XX DR [1] (Consensus) XX CC Ci000003. XX SQ Sequence 303 BP; 69 A; 76 C; 77 G; 79 T; 2 other; ttgtccgacc gattagtctg gtggtttagt gcgncgcctt tcagccgaaa ggttgcaggt 60 tcgaaactgg tcgctagcta gtcggttgtg tccttgggca aggcacttaa cggacattgc 120 ctgaacccag cggattaatg ggttctacca aattgaagga acgtntgaat catacacaac 180 acactgcaat agctccggta acccgacggt gggcgcgagg tgatctgacg attgcccgtg 240 tgttaacccc cttggttttc ccattcacgg ggataaacat gaatatccta tcctatccta 300 tcc 303 // ID DNA4-10_AP repbase; DNA; INV; 160 BP. XX AC . XX DT 30-AUG-2009 (Rel. 14.09, Created) DT 30-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA4-10_AP. XX NM DNA4-10_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-160 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 1959-1959 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 160 BP; 42 A; 36 C; 43 G; 39 T; 0 other; ctccttttca aaagagaaca taccgctttc gcgcatgtca aatgattcga ttggtgcgtc 60 gagaaccacg tgatcacgct aacaatggga aattggtggg ggatgcgcgc cgccggacat 120 aaattggtcg ccgaccggta tgttctcttt tgaaaaagag 160 // ID Gypsy-19_OD-LTR repbase; DNA; INV; 330 BP. XX AC CABV01002282; XX DT 24-FEB-2011 (Rel. 16.02, Created) DT 24-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Oikopleura dioica genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-19_OD_; KW Gypsy-19_OD-I; Gypsy-19_OD-LTR. XX OS Oikopleura dioica OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; OC Oikopleuridae; Oikopleura. XX RN [1] RP 1-330 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Oikopleura dioica genome."; RL Direct Submission to RU (24-FEB-2011). XX DR Genome; CABV01002282; Positions 5 334. XX SQ Sequence 330 BP; 88 A; 68 C; 86 G; 88 T; 0 other; tgactcagaa caaacggtcg cgcgcggcgt gggcgaaaag tttcgcttgc gcacagctcc 60 gctcagactc actcttcgca aaatccgttt taggacacgg atgtatttat atttttatga 120 ccttgacgct gaataaattg gatagagaat cacctaattt attcgagatt atttattgca 180 gggtagttga tgttaatgcg cgagctgtgt gggtttaaga gaaggtccga gagaagcccc 240 tggcgagttc ccagatttgc attcaagact tgaatgtaga tcggagactt gcctagggaa 300 gctcgaggta caggctccag taacctaaca 330 // ID hAT-73_HM repbase; DNA; INV; 5185 BP. XX AC . XX DT 08-JAN-2009 (Rel. 14.02, Created) DT 08-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE hAT-type DNA transposon family - consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-73_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-5185 RA Bao W. and Jurka J.; RT "hAT-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 9(2), 413-413 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 2676..4562 FT /product="hAT-73_HM_1p" FT /translation="FLIYYVKVLIFYNNILLYLKVDCARITGSNFINRKET FT GFTGRELIKCCANTVREKCCKILENCNFFSILSDGSQARKTGKEKELVLVR FT TERNGIPIYMVTELLEMSQFGGSDSNSVTAGINSVFESDNSFFKISTEDYI FT KKLVSATADSANVNFGIKNGALTQLEKSRKWLLKIHCINHRIELAIKDTFK FT DVTEFIKIEEFYLSNYYLLRNSGKLKSEVEAAAKAIGITFYILPKISGTRF FT VGHRKRALSSLLETWPAFLSAYENYASDPKNGKTVGKVKGLLKLFKCQSFL FT VQVGLYLDCLEVIIPASKIFEKNELLPHEIPTTIQRTLIEIDDKIKSIGEE FT DEFYDSYVIRYLVTDHNEIEGSFQRAGNKRKHPKNRSYINVTMEMNKFDQE FT KTIQKMHEIKRVIFPKLHELLSSRFQDYSREFSIYRSMKFIDPIHWIAEDK FT ESGVTQIKLVSDHFEVPLINAGFQRDAALREWKKLKLLVTTEYESHPVRSL FT WNVIFKHKIKEFPNICLLVSLIMSISGSNSEVERTFSIVTNILADKRLSMN FT HDTLADCVIILGNKSLWSKDEYDNIIDLSLLKYMKKRRKSDVGDFNKLRTA FT NIQDIQTCSDADDVDSEDDMLADYLKELNI*" XX SQ Sequence 5185 BP; 1894 A; 695 C; 786 G; 1803 T; 7 other; caggccttgt taagaagcgt tacgcagctc gcattttgcc tgcgaaaagg cgatttgcct 60 acgcgttcag cagttatgcg ttacgcaaaa acttttatct tttagaatat ttaaattatc 120 tttttatatc gtttatgtga cgttcattca gaagtattaa agtcgtaagt aaaagaattt 180 aaccgcaatt gtaaactaat agattttttg ttaaagaaac aatttgttta ataatttttt 240 ttttttgttt ttaaaaatta cttaaagaaa ataaagtgtt ttaagtttaa acttttcatc 300 atatttaatt ttaaatatta gatttcataa gtttataaac tttattcaca agtaatgttt 360 attttatatg taatgtattt tgtttattta tttataattt taaaacaatt tttttttttc 420 gtccctttaa gaaagtactt atgatgagct gtaaagtaaa tcatttcctc atcatcaact 480 gtcgagaccc cgatgcgatt tcacgagttg tatgggaaaa ggtttgttgc ccccctccct 540 tcattttaag agaggggagt taacatttga ctaacagtta aatatttttt ttgagacttt 600 tgaaggattt actctcctta tttcattcct aaaatcgcgc ctggctggac ctgtcggtaa 660 tatgtcagaa aatttagttg atcgaatttt tttttttttt kaatatttag tatttagaag 720 taaatgattt ttttagtgtt gaagattctt ttttgcatac atttctatta tcattaattt 780 tttagattga tcatagaaag aacgtagatt agaattagtt tatgagcagt agatgaatac 840 ttactagtca agttctttaa agaaaattgt tgatagattt tttgctttgt ttaagcactg 900 attaaaactg taaagtagta tgaagctata atcatattat ataaatgatt gaaacttgta 960 actaccatta gagaaaaata aataaaaaat cgttttattt cttataattt taagcaaaaa 1020 tgccgtgaga taacattaat cgtcaattta taggtcaatt tctttaaaaa aaaaaaactt 1080 ttaaggaaga agtggtaatt ggtagtggta gaggttcgtg tttttcctcc tctgaacagg 1140 ggaagagtac gtgtcttccc cccccacctc cattgtatcg tcgggttaag atgcattgca 1200 ctgctaactc cggcactccg ggttaggttg caaacgatag tccagattct ggtacttgag 1260 tatttcaact aaaattaact tttatcaatg gtatttatgg aaaggttaac aaaaatatat 1320 ttaaaacaaa tacttaaaaa aaaacggtaa tatttagcaa aatcagttgt ttcctattta 1380 taaaaatctt ttctctgtaa ttacgatgat attaggatgt tgttttggtg atgagagctg 1440 ttacattttt cacaactaaa ttggtactta gttaataact atcgggagag aaaaattttt 1500 ttttttccat tcaggaaaaa ataaagatat atatttaatt aaattttaaa gtacaatagt 1560 tattacactt agttaggctg cctatttttt aatgcgcatt aaatttgttt ctgctttttt 1620 gtaaaagata ccggaaactt gtttataata aagcgaaata aacttctagc taatgttagt 1680 tattttaatt tccttacgac atattaattt cctttataag gagtttgata tttactaagg 1740 atttcaacaa atcattcaac aattctaaac ttttttaaaa atacataatc aaatggataa 1800 cattttgaat aaaaacagag tcactttgaa aacttttgaa aaatggaatc tttccgaagt 1860 tttttgtata gaaactgctg tagaaaatgg aatttcatat gttgtgaaaa tattttgtaa 1920 tgtatgtgct cttcatcaag agaatatact ggccaatacc aaaggagcaa taagaacagt 1980 cgctttacga tacataaatg ggacacgcta cgttaaaaaa gatagcgttt ttcgacattt 2040 acaaagtgct gtccacagaa atgcaatccg attgtcgaat gaagacctaa atgaaaacaa 2100 agaagcaaaa gcaaacgaca aagagcaaca ggttaatagt cagactttat aattatttta 2160 tatttaaact ataattttaa acaaagatgt taaaattttg aattagttca ctttttataa 2220 ataaaaatca ttttcaagtt tatatacgaa ataattattt actacatgca acaaagcctc 2280 tataaaataa agtaatcaaa ctgtctactt tccatggaat tcctcattat tttcatatat 2340 ttaatataat tatgataact ttttttacaa ctcatttatg tccttttaag aaattttatc 2400 tactatttct ataaggtgga aaccaacaaa ctcgacatta gttcgtgtga aacaaaattg 2460 acagagaaag aaaaatctac aaataattct tacaaaaact tggtaaaaat agcttatgaa 2520 atggcttgcc atccaactat gcctcatgca catttttcta ttctcgtaag ttttagttaa 2580 tattattgtt aaactttttt tttttttttt gcaaataaat gttaattttt taaaaatgaa 2640 aaacaagttc aaataagttt aagctttaat cgtagttcct aatttattac gtgaaagtat 2700 taatttttta taataatatt ttattatatc ttaaggttga ttgtgcacgc atcacaggat 2760 caaacttcat aaatcgtaaa gaaacaggat ttactggcag agaacttatc aaatgttgtg 2820 ccaatactgt ccgtgagaaa tgctgtaaaa ttctcgaaaa ttgcaacttc ttctcgatac 2880 ttagcgacgg aagtcaagca cgaaaaactg gaaaagaaaa agaacttgtt ttggtccgta 2940 cagagcgcaa tggaatcccg atatatatgg ttaccgagtt gttggaaatg tcacaatttg 3000 gtggaagcga ttcaaactca gttactgctg gtattaacag tgtctttgag tckgataatt 3060 catttttcaa aatatctact gaagattata ttaaaaaact agtttctgcc acagctgaca 3120 gtgcaaacgt taattttggt ataaaaaatg gtgccctaac acagctagaa aaatctcgca 3180 aatggttatt gaaaatacat tgtatcaacc acagaatcga attggctata aaagacacat 3240 ttaaggatgt aacggaattc ataaagatcg aagaatttta tttgtcaaat tattacctac 3300 taagaaattc tggaaaatta aaaagtgaag tcgaggctgc ggcaaaggct ataggaataa 3360 ctttttacat tttaccaaaa atatctggta caagatttgt cgggcacagg aaaagagccc 3420 tttctagtct tctcgagaca tggccagcct ttctttcagc gtatgaaaat tatgcaagtg 3480 acccaaaaaa tggcaaaaca gttggaaaag taaaaggtct attaaaatta tttaaatgtc 3540 aatcatttct tgttcaagta ggtttatatt tagactgttt ggaagttatt atcccagctt 3600 caaaaatttt cgaaaaaaat gagttattac ctcacgaaat tccgacaact attcaacgaa 3660 ctttaataga aatagacgat aaaataaaat caattgggga agaagatgag ttttatgaca 3720 gctatgtaat tcgctatttg gtcactgatc acaacgaaat tgaaggttcc tttcaaagag 3780 ctggtaataa aagaaaacat cctaaaaatc gttcttacat taatgtaaca atggagatga 3840 ataagtttga tcaagaaaaa acaattcaaa aaatgcatga aattaaaaga gtgatttttc 3900 ctaaacttca tgaattactt tcaagtagat ttcaagatta ttctcgtgaa ttttccattt 3960 acagaagtat gaaatttatt gatccaatac attggatagc tgaagataaa gaatcgggag 4020 ttacgcaaat caagttagtt agtgaccatt tcgaagtccc attaataaat gctggttttc 4080 aaagagatgc tgctttacga gaatggaaaa aacttaaact tttagtcaca accgagtacg 4140 agtctcatcc ggtaagaagt ttatggaatg taatttttaa acataaaatt aaagaatttc 4200 ctaatatatg tcttctagtt tctcttatta tgtctatatc cggatcgaac tcggaagtcg 4260 agagaacttt tagtattgtt acaaatatat tagcagataa acgcctatca atgaaccacg 4320 acacacttgc tgattgtgtc ataattctcg gcaacaaaag tctttggtcg aaagatgaat 4380 acgataatat tattgactta tcgttgttga aatatatgaa aaaacgaaga aaatctgatg 4440 ttggtgattt taacaaactt agaacagcaa atattcaaga tatacaaaca tgcagcgatg 4500 ctgatgacgt agacagcgaa gatgatatgc tagcagacta tcttaaggaa ttaaatatat 4560 aaattcattt taaatatata aatcattttt tgtcatagta atttaaaaaa tgcacgattt 4620 attttgatgt aaatatttta ttaataagat tyattattaa taatattttg tttattgtta 4680 attcaattcg aataacgtaa agttttaatg acgtacgttt aatttatgaa cataacattt 4740 atgatcacga atatgttaat cagtaactga taaaactctt tattttttaa aaacaatatt 4800 cataacaaag ataaaaaaaa gtttattaac agttaaaaaa aaagaaacaa aaaccaactg 4860 taaagttata aaaagataaa ctaatatcat gttacgttgt atgaaagaaa atattttttt 4920 aaaaataaaa tttagcaaat gataaaaacg atttttcaaa taggaactta tattttattt 4980 gtaacttatg atatagtgar aaaaaaraay tgtctcaatg atttttaacg cgttaagaaa 5040 ttaactttaa ttatgatgcg gtacttgaat agacgctagt gacgcaattt aacttctaac 5100 aaaacaaaat aaatttgctg agttgagtta ctttgaaatg cgttcagcgt tttcgttacg 5160 cagcraaatc tcgtaacaag gcctg 5185 // ID Mariner-1_AAe repbase; DNA; INV; 3500 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE Mariner/Tc1 DNA transposon from Aedes aegypti. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-1_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-3500 RA Jurka J.; RT "DNA transposons from the yellow fever mosquito."; RL Repbase Reports 11(4), 1286-1286 (2011). XX DR [2] (Consensus) XX CC ~98% identical to consensus. TA TSDs. XX FH Key Location/Qualifiers FT CDS join(484..1059,1086..2114) FT /product="Mariner-1_AAe_1p" FT /translation="MEAKKENIQRALESVAGGTSLRAAAKMFNIRVTTLYD FT RKKAVNRRDAAGKPTVLSPSEELEIVGWLEDMAAAGHPICEEILKINVAQY FT AKQLGKSEAFRSGIPSRGWVLRFLKRHPTISKRTANPISKRRIVTPDEVRL FT WFQEVERYLAKHDYLDILNHPDRIFNMDESAFEMAPNPRKKKVFAKRGIRI FT VIHLFLFVAGTKNVQSIQGNSNRESYTVLMTGSASGVLVAPLVLFPYKERL FT PRDVSQSIPKGWGVGRTESGWMKSEAFLDCLRNVFHPWLLKNNVKLPVIVF FT VDGHRSHATYDTVNFCKSHGIILICLPPNSTHFIQPLDVSFFKPMKAAWDT FT ALVKWRFDNGGAMISKPDFAPLLKQAIDSMNGIEQTLINGFRKCGLHPWNS FT EAIDYASFPDPSAVHQSKQPNNIVSTTSTEPLNQSIPSTIFLRELNRRLSP FT EVLETFEDQRRNLMWTGELEASHLFNLWRTIKDEVDGPPEYFEIDPALVGL FT DITLDGEGSAVLNECAPNDVDCKKDGSGYLFIFVVRKSN" XX SQ Sequence 3500 BP; 1160 A; 613 C; 703 G; 1024 T; 0 other; ctcaaattcc gaacagactc aaattccgaa caccttgtcg taattgttat atttataatg 60 gatatagtat aataatatta agatattttg cattggggta atttccatac agtaagaatg 120 attgtaacga gatttttatg catttacagc gcctactttt aaaatagtaa gcatttgtaa 180 aaaagtgaca gcgcacaagt tatgaaaaag tgcacctgct agcaatttat gctcaaaccg 240 gcttcgttac tatttcagtt atttcagtgt tgtttcgcga ttttgtttga tttgtttcgt 300 actacaagat aactaaaggt aagaataatt gattgttgaa caacttaatg ccaattttgg 360 gcaaaaatcc tacaaaaatc aataaaatat gtagtgttcg taatatgaat catgttctga 420 agcttgattc aaatgtcgaa catagtgtaa gaatctttat ttcaggtgct tgcatcgtgt 480 gaaatggaag ctaaaaaaga aaacattcaa cgagcgttgg aatctgtagc tggcggtaca 540 tctctccgag cggcagctaa gatgttcaac atccgtgtaa caacgctgta tgatcggaaa 600 aaagcagtca atcgtagaga tgccgctgga aaacccactg tattatcacc aagtgaggag 660 ttggagattg ttggctggtt ggaggacatg gccgcagcag gtcatccgat atgcgaagag 720 atcctcaaaa tcaatgtggc ccagtacgct aagcagttgg gaaaatcgga agcattccga 780 tctggaatac cctcacgtgg ctgggttttg cgctttctaa aaaggcatcc aacgatttcg 840 aagcgaactg caaatcccat ttcgaaaagg cggattgtca cgccggatga agttcgttta 900 tggttccagg aagttgagcg gtatctcgca aagcacgatt acctggacat tctgaaccat 960 cctgaccgca ttttcaatat ggatgaaagt gctttcgaaa tggccccaaa tcctcgcaag 1020 aagaaagttt ttgctaagcg aggtatacgt attgttatat aatatattat tcaccggcta 1080 tttaacatct ctttctcttt gttgcaggta cgaaaaacgt acagtccatt caaggaaatt 1140 ccaaccgtga gtcatacacg gtactgatga ctggatctgc ttctggagta cttgtggccc 1200 ctttagttct atttccgtac aaagaaaggc tgccgcgaga tgtttcccag agtatcccaa 1260 aaggatgggg tgtcgggcgc actgagagtg gttggatgaa gtcggaagcg tttctagact 1320 gcttgaggaa cgttttccat ccttggttgc tcaaaaataa tgtgaagctt cccgtcattg 1380 tatttgtcga tggtcaccgt tcacatgcga cctacgatac agttaatttt tgtaaaagtc 1440 atggtatcat tttgatctgt ttgcctccga attctaccca tttcatccaa ccacttgacg 1500 tttcattttt taaaccaatg aaagctgctt gggacactgc actggtcaag tggcgattcg 1560 acaacggagg agcaatgatt tcaaagccgg actttgctcc tcttttgaaa caagctattg 1620 attcgatgaa cggcattgaa caaacattga tcaatggatt cagaaaatgt ggcctgcatc 1680 cgtggaactc cgaggccatt gattatgcat cttttcctga tccatctgca gttcatcaaa 1740 gcaaacagcc aaacaatatt gtatctacga cctctacgga accattaaac cagagcatac 1800 cttcaacaat tttccttcgt gaactaaatc gtcggctttc acctgaagtt ttggaaactt 1860 tcgaagacca gcggcgtaat ctcatgtgga ctggggagct tgaagcatcg catctcttta 1920 atttatggag aaccatcaaa gatgaagtag acgggccacc ggagtacttt gaaatagatc 1980 cagcattagt tggcctcgat attacccttg atggagaagg aagcgctgtt ttgaatgaat 2040 gtgctcctaa tgatgttgat tgtaagaaag acgggagtgg ttatctgttc atttttgtag 2100 taaggaaatc aaattgatat tttcagatca aaatgaagat gcgaactttt ttgagcatga 2160 acgccagacc tgtgaagtta tacaattgga gaatgacgaa catatgtttg aagagttgga 2220 ttttgagcgt aattgaaaga aattaaaatt aaatgttttt gtttttaatg taaacaatac 2280 tttaattaaa ggtgatcgct ctgatacggt agagaacatt tctctattac tgggctgttc 2340 cgcagaaaaa gacagggctg tagagcgtaa ctcagaagct caatcttcta tcaagaaaag 2400 gaaacgatgc gcaccagcag ttgcgacaag tgaggcttgg gaagagtttt ataaagaaga 2460 acaagaaata aaaaaacaaa ttcaacttgc aaaacaaaaa cgacaagaag aacggaaaca 2520 aaaatcattg gaaaaggaag aacagaagaa aataaaagcc gcgttgaggg tactgaagca 2580 gcatgaaaga gatgaactga aagaacagaa gaagaaagaa agagatgcag taagaattga 2640 aaaagctaaa gaacgcctgg agaaaatgaa gaacaaaatg aagaaacaag caaaaactat 2700 aaattaagac aatgcaaaaa ctatagacaa tttaaacttt actgtaattt catctccacg 2760 ataactgttt cattattatt tcatttacta ttgtagttat aattcgactt aataaatata 2820 tgtatgatac actgagcgca aaatcaacat atttttcttc aatgcactct ccatgttaaa 2880 catttttaaa accatttttg agtgaagggt ttgatttcaa ttattgtatt gctggtcacc 2940 tcgatagttg agcaaaaaaa aaaaattgtc ggtgattgta gcacagtcct ttgctacata 3000 ttttagcatg tgtcatctta ttggatctcc aaatagtcga gccaattttg ctgaataatg 3060 gtgcgtctgg ttgttggaat cccggcataa tcaaattcag tctgtaacat aattaaaaat 3120 taaatcttat ttatttcctg ttcaacaacc tggaacactc aaacaaactt ttccttttcc 3180 atttgaaaat atgtgagtga gaaacagcca tattgaaaga agcaaaagga tttttccaag 3240 aacaagccta taatattcga ttgattttca acgactcgtt taaataaggt atttttcaaa 3300 tgttaatcgc atcaatgatt taaattgtac agcataaaat caatagtgca aattattata 3360 ttatttcata tattctgtag tttgtagctc aaatgagcaa aattatgcgt cattcgatca 3420 caattaaaac aaaaataatg aaacaaattt attcaattca ccatgttcgg aatttgagac 3480 aaaagtgttc ggaatttgag 3500 // ID Gypsy-42_AA-I repbase; DNA; INV; 5243 BP. XX AC supercont1.331; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-42_AA_; KW Gypsy-42_AA-LTR; Gypsy-42_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5243 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.331; Positions 575082 569840. XX CC 'AAACC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 920..5125 FT /product="Gypsy-42_AA-I_1p" FT /translation="MPLSTNTEPFLPGTIPFAQYLEQLEWMFQHNKFTPDD FT YKSSFLAVCGTEVFSRLKLLFPGRDFKDLTYNEITESLKKHYDKTDSDVIH FT SFKFWTRRQSQHEKAVDFVLDVKNLAEQCNFGDFKDRAIRDVLVIGTYDRQ FT LQNRLFDEEDLTAIKAEKIIVNKELASDRTNSLQRDDEPRMGVIARLGRRS FT PDKIGYRERSRSDSRNRSVRFRNGGFNRYNREYGNRNRSPSKSPYFCSYCK FT KRGHTKKYCFRLKDKKDSKSTPNVNFVGSPKPSTSDSSGLFKRLATDLEEN FT SDEEDFACLMIPSVNKINQPCYVEAVVENKSFNMEIDCGSAESVISEELFL FT RSFKNYCLERCNKRLVVIDGKRLQVLGKVDVLVQLNGLKQRLPLIVLRCDN FT DFVPLMGRSWLDVFFGTWRQAFTQTMTRIHSLQNEELVDIKSKFPSVFDKD FT MSRPIKGYVGDLVLKDDKPIFRKAYDVPLRLKQKVLNCLDDLEKDGIIEPI FT EASEWASPVVVVIKKDQGIRLVIDCKATINKVIVSNSYPLPLIQDIFSTLS FT GAKYFCSLDLAGAYTQLLLSERSKKFMVINTIKGLYRYNRLPQGAASSAAI FT FQKVMDQVLHGLDNVVCYLDDVLICGKTLKECKEKLYLVLERLAKANIKVK FT MEKCKFFVNELPYLGHVITDKGLLPCPDKVRTIREAKAPENVTELKAFLGM FT VTYYAKFIPNLSSRINCLYSLLKKDVKYVWSTECDKAFNDCKYYLLKPKLL FT EYFDPEMPVVIVTDACSYGLGGVMAHVVNGEERPVCFTSFSLNSAQKKYPI FT LHLEALAIVSTVKKFHKYLYGKTFTIYTDHKPLIGIFGKEGKTSISVTRLQ FT RYLRELAIYDYEIIYRPSNKMGNADFCSRFPLPDAVPANLDVEFVKSLNFT FT NEFPINYNEIAQATKTDNFTKQLMKYIDKGWPERFDKQFKNAYAIHQEMEI FT VDGCILFQDRVMIPDDMKTKILKLLHMNHSGITKIKQLARRCVFWFGLDSD FT IEKFVKSCNICNEMTAVTRPAKYSQWIPTNKPFSRVHADFFHFDKKIFLLV FT VDSHTKWVELEHMPHGTNTKNVIKIFLNIFARFGLPDVLVTDGGPPFNSDL FT FVNFFEKQGIIVMKSPPYHPESNGQAERMVRLVKDVFKKFIRDAEMRKLDI FT NEQIAFFLINYRNTCLGANDKFPSEKLLSYKPKVLLDLIHPNNNYKKKLTN FT SVNESQRNDKRKDDAPDPFACLRVGDLLYYRNNNKTDIRKWLPVKFIKKTS FT SNILQVSLGGRVVQAHKRQLKLTDSRRKTSNRFVFHGESVASIPLEQPSTS FT DRSIVPSHPHQADMADTTKRSNKRRREDDEGDCVGVISSDSESDFYGFAAD FT SFIFGTHQDPEFLTHIGNDSIRRSGRRLKKKRKEDYVYY" XX SQ Sequence 5243 BP; 1667 A; 824 C; 1101 G; 1651 T; 0 other; gtgtcgacaa ggattgaact ggattttttt ttaaccgcgt gtgaaagtga aaatttcagt 60 gcagatcaaa gttttcgggt ttttgtgaac cgtcaagtgc gccggagttt gttcggattt 120 aagatttgtc cgtttgacga aagttgaaag gccacagtac gtaaaaaaaa attgatcaaa 180 agtaagaaat cattattagc tgaagcatat attgaagcgc ttttgtttct gctagtggtt 240 ttcaagcctt tttgcagaac gtttgtcagc ttctgtgaaa atcgatattc cagctaaaac 300 ctgtgatttt gttttgccgt gaagttttga gcacatttgt aaaaagtttt tatagagaaa 360 tctagcattc gcgtcaccag taacgccatt ttgtgaagta atatttgcag ttggttattt 420 cgagttgctg atcttttatt tatctgctat tcataccatt gttttttttt tctccatctt 480 ttattttgtg ctcaggcaca gagttctggc gtcagtggag atcacaccgt gattagtgag 540 aaagccaata gtgagagatc tttcacctca gtagatcttc agtgagagct atttttggaa 600 gcaaatatca ctctgtttgt tttgaaaaaa ggttatttag tgtagaaagg tcattgactc 660 gcttgtattt tggtcagaag atcgtacgtt gacctttgca cctactacat tgctctattg 720 ggctgttgct gtgaagaata cgctgctgct gtgaaaaata cgctattact gtggaggaaa 780 cgttgctgct gttaagaata cgctgttgtt gctgtgctgc tggtgccatt gctgctgttg 840 tggatatttt gcaccgctga ttgacgctgt tgttgctgag ttcgagtgat cgatctattt 900 ggattacatt ggtcgaaaga tgcctctatc taccaatacc gaaccgtttt tgccaggaac 960 cattcctttc gctcaatatt tagagcaact tgaatggatg tttcaacata acaaattcac 1020 gcctgatgat tataaatcat catttttggc ggtttgcgga acggaagttt tttcccgttt 1080 gaaactactg tttccaggtc gtgatttcaa agaccttaca tataatgaaa tcaccgagag 1140 tttaaagaag cattacgaca aaacagattc tgatgtcatt catagcttca aattttggac 1200 gcgcaggcag agccaacacg aaaaggcggt tgattttgtt ctcgatgtaa agaatttggc 1260 agagcagtgt aacttcggtg attttaaaga tagagcgatt agggacgtgt tagtaatagg 1320 aacatacgac cgtcaattac aaaatcgact atttgatgag gaggacctta ctgctatcaa 1380 ggcggaaaaa attattgtta acaaagaatt ggcttcagat agaacgaatt cattacaaag 1440 ggacgatgaa ccgcgaatgg gtgttatagc caggctcggg cgtagatcgc ctgataaaat 1500 aggctataga gaaagaagta gaagtgacag tagaaatcgt tcggttcgat ttaggaatgg 1560 aggtttcaat aggtacaatc gtgaatacgg taacaggaat agaagtccca gtaaatcgcc 1620 atatttctgt tcctattgta aaaagagagg ccacactaaa aagtattgtt tccgtctcaa 1680 agacaaaaaa gatagtaaga gtacaccaaa tgtaaatttt gtcggttctc ctaagcctag 1740 tacctccgat tcttctggcc tttttaagag acttgctaca gatttagagg aaaactctga 1800 tgaagaggat tttgcatgtc tgatgatacc ttcggtcaat aagataaatc aaccttgtta 1860 tgttgaagca gtagtagaaa acaaatcatt taatatggaa attgattgcg gttcagcaga 1920 aagtgtaatc tcagaagagc tgttcctacg aagttttaaa aattattgct tggaaaggtg 1980 caacaaacgg ttggtggtta tcgatggcaa aagattacag gttttgggaa aagtggatgt 2040 attggtacag ctgaacggtt tgaagcaacg tctgcctttg attgtattgc gatgtgataa 2100 cgatttcgta cctctaatgg ggagaagctg gctggacgtc ttttttggta cctggaggca 2160 agcatttacc caaacaatga cccgtatcca ctcgctgcag aatgaggaat tagtagacat 2220 taaaagtaag tttccttctg tttttgataa agatatgtct cgtccaataa aaggttatgt 2280 tggtgatttg gtgctgaaag atgataagcc tatttttcgt aaagcgtacg atgttccttt 2340 aaggttgaag cagaaggttt taaattgttt ggatgattta gaaaaggatg gaattattga 2400 gccaattgaa gccagtgaat gggcctcacc cgtagtggtg gtaattaaaa aagatcaggg 2460 tattcgtcta gtcattgact gcaaggcaac aataaacaaa gtaattgttt caaattctta 2520 ccctctacca ttgatacagg atattttttc aactctcagt ggagctaaat atttttgctc 2580 tttagattta gcaggtgctt acacacaact tctgttgtca gaaaggtcga agaaattcat 2640 ggtgattaac accataaaag gtttatatcg ttacaaccgt ttaccacagg gagctgcttc 2700 aagcgcagcc atatttcaaa aagtaatgga ccaagtactt catggtttag ataatgttgt 2760 ttgctatttg gatgacgttt taatttgtgg gaaaactttg aaagaatgta aagaaaaact 2820 ttatttagtt ttggagagat tagctaaagc taatataaaa gttaaaatgg aaaaatgcaa 2880 attttttgtt aatgagttgc cttatttagg acatgtaata actgacaaag gattattacc 2940 ctgtcctgat aaagtgagaa ctattcgcga agcgaaagct ccggaaaatg ttacagagtt 3000 aaaggctttt ttgggaatgg tgacgtatta tgctaagttc attccaaatt tatcttcccg 3060 aatcaattgt ctctatagtc tgcttaaaaa agatgttaaa tatgtttgga gtacggagtg 3120 tgacaaagca tttaatgatt gtaaatatta tcttttgaag cctaaacttt tggagtattt 3180 tgatcctgaa atgcctgtag taatagtcac tgatgcttgc tcctatggtt taggaggtgt 3240 aatggcacat gttgtaaatg gcgaagaacg tcctgtttgt tttacgtcct tttcattgaa 3300 tagcgcacaa aagaaatacc caattttaca tcttgaagct ttggctattg tgagcaccgt 3360 aaaaaaattt cataaatatc tctacggaaa aacatttact atttatacag atcataaacc 3420 tttgattgga attttcggaa aagaggggaa aacttctatt tctgtaacaa ggttacagcg 3480 ttatttgaga gaacttgcta tttacgatta cgagataatt tatcgtccat caaataaaat 3540 ggggaacgcc gacttttgtt ccagatttcc acttccagat gctgttccgg caaatttaga 3600 tgtggagttc gttaaaagtc ttaattttac aaatgaattc ccgatcaatt acaacgaaat 3660 agcccaggca acaaaaactg acaattttac aaaacaactt atgaaatata ttgacaaggg 3720 ttggccagaa cgttttgata aacagtttaa aaatgcatat gctatccatc aggaaatgga 3780 aatagtggat ggttgcattt tatttcaaga tcgggttatg attcctgatg atatgaaaac 3840 aaaaatttta aaattactcc acatgaatca ttcaggtatc actaaaataa agcaactggc 3900 acgcagatgt gttttctggt ttggtttgga ctcagatatt gaaaaatttg tgaaatcttg 3960 caatatatgc aatgagatga cagctgttac aagacctgcc aaatattctc aatggattcc 4020 aaccaataaa ccattttcac gcgtccatgc tgattttttt cattttgaca agaaaatttt 4080 tctgttggta gtggatagtc atactaaatg ggtagaattg gaacacatgc ctcatggaac 4140 aaataccaag aatgttatta agatttttct taatattttc gccagatttg gtttaccaga 4200 tgtcctggta acagacggag gtccgccatt caattccgac ctgtttgtaa atttttttga 4260 aaaacaaggg ataattgtca tgaaaagccc tccttaccat cctgaaagta acggacaggc 4320 agaaagaatg gtgcgtttgg taaaggatgt atttaaaaaa tttattaggg acgcagaaat 4380 gaggaaatta gatataaatg aacaaatagc attctttctc atcaattaca gaaatacgtg 4440 ccttggcgca aatgacaaat ttccatctga aaagcttctt tcatataaac caaaagtttt 4500 attagacctg attcatccga ataataatta taagaaaaaa ttgactaatt cagtgaatga 4560 atctcaaaga aatgataaaa gaaaggatga cgcacccgat ccttttgctt gtttaagagt 4620 tggagatctg ttatactata gaaataataa caagactgac ataaggaaat ggttacctgt 4680 aaaatttata aaaaagactt cttcaaatat tctacaggtt tctcttggtg gaagggtggt 4740 ccaagcgcat aagcgtcagt tgaagttgac agactctcgc cggaaaacgt caaatcggtt 4800 tgttttccac ggagagagcg ttgcatcgat accactagaa caaccaagta ctagcgacag 4860 aagcattgta ccgagtcatc cacatcaagc cgacatggct gatacaacca agagatcgaa 4920 caagagaaga agagaagacg atgaaggaga ttgcgttggt gttattagtt ctgattcaga 4980 atctgatttt tatgggtttg cggctgattc atttattttt ggcactcatc aagatccaga 5040 gtttttaaca catattggaa atgattccat tcgaagatca ggaagaagat tgaagaaaaa 5100 acgaaaagaa gattatgtat attattaatg attcctaatt tgtgtttaaa taatgaattg 5160 tatgttccat ctgaattgta gcataaagaa attgctttcg agtttttaat cagtgatatt 5220 tttggcttaa aggggtaagg agt 5243 // ID Mariner-7_SM repbase; DNA; INV; 1954 BP. XX AC . XX DT 11-FEB-2008 (Rel. 13.02, Created) DT 03-MAR-2008 (Rel. 13.02, Last updated, Version 1) XX DE Mariner DNA transposon from Schmidtea mediterranea: consensus. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-7_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-1954 RA Jurka J., Bao W. and Tempel S.; RT "Mariner DNA transposon from Schmidtea mediterranea."; RL Repbase Reports 8(2), 151-151 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 661..1608 FT /product="Mariner-7_SM_1p" FT /translation="MENGEDFYKFGQLLGFSKSSIKTIIYKYKNFAIVSKG FT RRGGREKRISEENGNKLLDYVENNFYASLEEMRNFLMQRNCRASKGTICSY FT LRNQLISYKKVKPSIAARNSDRVLEMRMQYVTRFQEEAWHDKKLIYVDECG FT FNTWTSQTFGRKKKGKRIYSTVPSNRGPNMSLALAIGVKGPIHNKLIVGSF FT KKETFQIFLNELTSKLNGDGYHIINDNASIHGGALSTNNFIEYLPPYSPFL FT NPIESVFSKIKLSIRKTIARKNGFSSMSFPSRFDVINEALEIELRKEEYKN FT LKKYFFHIRRFFPNCCKKIEIFGD" XX SQ Sequence 1954 BP; 788 A; 262 C; 299 G; 605 T; 0 other; tacacctgat tcaattcatt aaaaactcct ataataaaat gaaaaaatca ataaagttat 60 aatataggaa catacatact tacccgtgtc tgcatgtaat ttgttcaact gaataaaaac 120 ttatattata ggaaggaagg aaattgaaga aaattttttt tttaaacata aacacataat 180 acatagtaaa taatgcatgc aaactttcaa aatgtagaaa tgttcaattc attaaaaact 240 ctatataaaa ctatttttag taacatttta gacaattttt tataaatggc gaataaaatt 300 ttctatgaaa actcggatga ttctacattg agtttccgag aaaatgatga aataaattgg 360 taagatcttt catttaaatt caatcaattt attattttta cagcttttca gatatttcag 420 aaatatctgt tgcaacttta cctttttgcg gagaagaaga attaaaaagg caagatttat 480 cttaaagtta aaataaacta atttttttac agcttttgca atttatctga atgtatcgaa 540 ccaggaccct caaatgtgga attgagaaaa aaaaggaaaa ggtacgtttt aataacttaa 600 gtctaaaaat aatattttta tagaattaca gaagtagaca aaattaggat tatcaatgcc 660 atggaaaatg gcgaagattt ttataaattt ggacaattat taggattttc aaagtcatct 720 ataaaaacaa taatttacaa atataaaaat tttgccattg tttcaaaagg aagaagaggg 780 ggccgtgaaa aacgaatttc ggaagaaaac ggaaataaat tattagatta tgttgaaaat 840 aacttttatg caagcctcga ggaaatgcga aatttcttga tgcaaagaaa ttgtagagca 900 tccaagggca caatttgttc atatttaagg aatcaactaa tatcatataa aaaagtgaag 960 ccaagtatcg ccgcacgaaa ctcggatcgc gtactggaaa tgcgaatgca gtatgtgacg 1020 cgattccaag aagaagcttg gcacgacaaa aaacttatat atgtcgacga atgtggcttc 1080 aacacttgga caagccaaac atttggaagg aaaaagaaag ggaaaagaat atattcgaca 1140 gttccaagta atcgtggacc aaatatgtca ctggctttgg caattggtgt aaaagggcca 1200 attcacaaca aattgattgt aggatcattc aaaaaggaaa catttcagat tttcttgaac 1260 gaacttacgt ccaaattgaa tggagatgga tatcacatta ttaacgataa tgcatccatt 1320 cacggaggag cattatcaac aaacaacttt atagaatatt taccaccgta cagtccgttc 1380 ctgaatccaa tcgaaagtgt cttcagcaaa attaagttaa gcatacggaa gactatcgca 1440 aggaaaaatg gattcagctc aatgtccttc cctagtagat tcgatgtaat caatgaggct 1500 ctggaaatcg agctaagaaa agaagaatat aaaaatctta aaaaatattt ttttcacatt 1560 cggcgatttt ttccaaattg ctgcaagaag attgaaatat ttggagattg aattgtattt 1620 taataaaatt tatttgtcaa ttaatctgac ttacatttat agtataataa ttccaataaa 1680 ttacataaat tggcataaat tcctaaaaaa aacctaaaaa aattaattaa aactgaatta 1740 aaaataaatg atgattaaaa gtattattta agagcttgaa taatgatctc ataactttac 1800 gagatcacaa tccaagcctt aaaatatact ttattaaaaa atttacctgt tagccatttt 1860 aaaaatggct aattcataaa attaatctta attgtaattg gatgttcatt aaaacttcta 1920 ttatagaagt ttttaatgaa ttgaatcagg tgta 1954 // ID PIVE repbase; DNA; INV; 5953 BP. XX AC . XX DT 19-FEB-2008 (Rel. 13.02, Created) DT 04-MAR-2008 (Rel. 13.02, Last updated, Version 1) XX DE Integrated virus-like element: consensus. XX KW DNA Virus; Integrated Virus; PIVE. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RA Rebrikov D.V., Bulina M.E., Bogdanova E.A., Vagner L.L. RA and Lukyanov S.A.; RT "Complete genome sequence of a novel extrachromosomal virus-like RT element identified in planarian Girardia tigrina."; RL BMC Genomics 3(1), 15-23 (2002). XX RN [2] RP 1-5953 RA Jurka J.; RT "Virus-like element present in multiple copies in Schmidtea RT mediterranea genome."; RL Repbase Reports 8(2), 166-166 (2008). XX DR [2] (Consensus) XX CC There are over 100 remnants of this DNA virus interspersed in the CC genome. The reconstructed copy contains at least three ORFs CC homologous to virus proteins. PIVE is distantly related to CC another planarian virus called PEVE (ref. 2), which appears not CC to integrate in the genome. PIVE has 88 bp terminal inverted CC repeats (TIRs). Current data do not account for its relative CC successful integration to the genome. CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 218..1156 FT /product="PIVE_1p" FT /note="Parvo non-structural protein NS1." FT /translation="MDQLQLYNELSIFFESFIRNSTIYINSDHRGKIDSIL FT SKYXGKIPDKDKEYIQNCLAIYTVLSQRAGKTKTIILEGTSNAGKTLFTAI FT CRGTWNHGFFTGAQSNSEFCFDDCVGKEIIIGEEIYCDFKNVNLIKLMCEG FT NIFFKVSRKNDSKSALPRCPCIISNNHPIWKYASEHELALRNRSLYIKVDK FT PYVFNSNLDVFSNVKILNDVKFDFFNKFLKFFYLRLLLLLLLLLLLLLLLL FT LLLLLLHKKFNXTNKTYINNYNFRXTPKXTRIEXLQVEHVKXVNTVFGRES FT STHRLIGMDASSSKIXYKLLN" FT CDS 1660..1172 FT /product="PIVE_3p" FT /note="Replicase." FT /translation="MESLIQKFGINTKPNFLDFTIPKINLYDWKSHVFVGE FT SGIGKTNFALSHFQNPVHVKHLQMLNRITTNXESCDGIVFDDLYFNKTKFE FT EFLCLIDCAFENAVNIKYSVAMIPRGLPRIFCINDISLFYPEYISDSCMDA FT VKRRLKIHYFHNKLFKKYIAEKVK" FT CDS 2788..5673 FT /product="PIVE_2p" FT /note="Distantly similar to ORF1L from PEVE (see FT ref. 2)." FT /translation="MKMHFEKQEHAGLAERDRLNAINATAQIFNVFATTTF FT GKVQGVLRQAKMTATFKSNPYQQLLLSNSFKSLSSDLFKPLKINQTFKNNI FT LNYTVSNIIRPGMKGFMKKIGGVKQNINDLAAVAGFTIGFYISAYIRSFYA FT EPLELYVKDHLKPYDNKYLIKAFNEISNHDGYVDITNIFLNDLXLDDFRLF FT KEKDFEEIYQPNRXKDFKDLFNLFDKQVAEISIYNGIKMLEKALFTHEKEG FT DEIGDGIYXLIKXDDDMKRLLDDNVGLFEQLIENSKNFKNQNFLGLINQYE FT RHWGPDVGKPGDLHRYVNFYTNMWYHLHYSQGTLHDPKHLPPGYEGKQPLA FT EIISRFPYXVVRKKRNVSYKAHTRLELNKKFDDYLKLNXXQYHTGPKINTK FT GLPDLHYRRLSAIQLNMLRYYYHNNHHKSATHEEWKKIVDEKIKKYDTHTE FT SXELDIDDRNKLHHWYISEHLLQTHEDGVTKYRGTLPAINAFNNLFSXINM FT TTHAPNTTTAPVVPPSAIPGSGGGNGILRSQQGDFRPSLDNNSLTFTSTNC FT GASCIETTATGEWYNFPVEYYYPHTMYGNPDFWAAMEXYLWFKPVAMEFEM FT FNLQVVGETSPQNFSGLQNQVELLVFRDEANIYGPCAAPPQMAKAFKDIFL FT NEASLHNGYNINNQLSKLTPITLPKAGWIYYPETCPEVEIVNCNTSETVNY FT SYQASKVQHPRYIREFWDQHFDLTIAKDXDHIKWHRFDYFNCKKIKPFSYA FT NALGRRTITTDKRVMPFPSKIYCPKWKRKWDXKLKRQHYEYVFGEMTLHKK FT LLPTDKQLLIDEWYSTKREAVEEYXGHCHQEDVSEHNPMKMMFFKLNTMIG FT PGGNKINLTCTFNWKIKYTFELFGKLPTTNIRQIYKVTDPLLQGTQTWQPT FT TKWLTPSIPYCTPYFFYIRDEGTKEGEDELFPMIKHNAARFKYPLVVQFGM FT HAGLPGV" XX SQ Sequence 5953 BP; 2124 A; 854 C; 907 G; 2002 T; 66 other; ataagacccc cctatacccc cctttcccaa gtctatwtta aaagctcacg gccsacacca 60 kattcttcga gggatgaatt caaaacattt tatatancac agttrcagtt gattccccaa 120 gataatatag tcgcagtaac tgtaactgtg tawataataa tatttaaaat gagtcataca 180 gttagaatca ataaaataaa atttcttccc ttattttatg gaccaattac aattgtacaa 240 tgaattatcc atcttttttg agagtttcat tcgtaattct actatctata ttaatagtga 300 tcatcgggga aaaattgatt caattctytc taagtatwct ggtaaaattc ctgataaaga 360 caaagaatat attcaaaatt gtctcgcaat ttatactgtg ttatctcaga gggcaggaaa 420 aaccaaaacc attattctag aaggaacttc aaatgctgga aaaacattgt ttacagcaat 480 atgtcgcggt acatggaacc atggcttttt tactggtgct cagtctaact ctgaattttg 540 ttttgatgat tgtgttggta aagagattat cattggagaa gaaatctact gtgacttcaa 600 aaatgttaat ttaataaagt taatgtgtga aggtaatata tttttcaagg tgtcaaggaa 660 aaatgatagt aaatccgctt taccaagatg tccttgcatc atctccaaca accatccaat 720 atggaaatat gccagcgaac acgaactagc actaagaaat cgatctttat atataaaggt 780 tgataaacct tatgtgttta atagtaattt agatgtgttt tctaatgtaa aaattttaaa 840 cgatgttaaa tttgattttt ttaataaatt tttaaagttt ttttacttaa gattattatt 900 attattatta ttattattat tattattatt attattatta ttattattat tattattaca 960 taaaaaattc aacawaacta ataaaacata tattaacaat tacaattttc gacakacacc 1020 taaatasaca cgcattgaas tgttacaagt tgaacacgtt aaattwgtaa atactgtttt 1080 cggtcgagaa tcatctacac atcggttgat tggtatggat gcatcatcgt ctaaaataaa 1140 wtayaaatta ttaaattaat tataattatt actttacttt ttcagcaata tattttttaa 1200 atagtttatt atgaaaataa tgaattttca atcgtctttt cackgcatcc atgcagctat 1260 cagaaatata ttctgggtaa aaaagtgata tatcatttat acaaaaaatt cgaggtaatc 1320 ctctgggtat cattgccaca gagtatttta tattcacagc attttcaaac gcacaatcaa 1380 ttaaacataa gaattcttca aatttggttt tattaaaata taaatcgtca aaaacaattc 1440 catcacagct ttcttkattt gttgttattc ggttcagcat ttgcaaatgc ttcacatgaa 1500 ctggattttg aaaatgagac aatgcaaaat tagttttgcc aattccactt tcaccaacaa 1560 atacatgact tttccaatca tataaattaa tttttggaat agtaaaatcc aaaaaattyg 1620 gtttmgtatt tataccgaat ttctgtatta gactttccat aattgttttt ttctttgtta 1680 aaaaatgaaa tggttcattt ttttctaaaa tttttattgc ttcattgaaa tttggtgcat 1740 ttaatgccag tgaaaagtra tctgattatt tttttttata gtaaatttac ttggatttcc 1800 gattctgatt ckgattcctc crtcgactct ggtatcgggc tttgttgcat attcgatggc 1860 gtggggtctg tcagccccgc tgactcctga aacaaatggg acattgagtt tatgcctaat 1920 ggcataaact gacatgttag cataaaccta taaaytaata aataaatatt aatttaaaat 1980 atataaatta cttcgagata gacctgcata tggtggaatc cagcatcacc cttctcttcc 2040 tgtcccacca cgaaacgggt gaaggaagwg tcattgaatg ttttctgtaa attgtcttcc 2100 aattctctat acttgccaat agatatgttg aaagtgaagc tgttaaatct tctagctcct 2160 gtataattgc gtttctcatc atattgattg atttgttgtt gaatattgtc cgttgtagac 2220 aacgaactat ctccattagg agaattgcrt cttgatattc gttcagtatg aatctgtaac 2280 tccccgccat ccaatcgagg gcttttacat atgcgttcac tcggtgtacc agctcgcata 2340 ttggacatat gtgccggaag cacatcggaa caatcctcgt ttgtgcttct tttgttgcga 2400 atattgttgt ttgtggacga attcctattc gatatagatt ctcgatttct tccagcagca 2460 cttgaagttc tgcgcctcct atgaactgac gaaattgagg mgtcaggttc attagattca 2520 ttgaatatat gggatttytc aggaaataga tccataagtc agttttatct tgttctagta 2580 aaatagctaa attttaaata taaaatttcc aaataaaatt rtcaacatta ctatcrattt 2640 tatcaaatat tacaaataat tggttggcgt ggtttattaa acaaaaataa tgttaataat 2700 aaagatgtta aacttaatat aactgatatt gaattaataa aaaggttttc tgaraatatt 2760 aaagtggcac aggatttacg attgtttatg aagatgcatt ttgaaaaaca agaacatgct 2820 ggattagctg aaagggatag gttaaatgca attaatgcta cagctcaaat atttaatgta 2880 tttgcaacta caacgtttgg taaagtacaa ggtgttttac gacargctaa aatgacagca 2940 acttttaaat ctaatccnta tcaacaatta ttgttatcaa attctttcaa atcattaagc 3000 agtgacttat ttaaaccgtt aaaaattaat caaacattta aaaataatat attaaattat 3060 acagttagta ayattatacg accgggtatg aaagggttta tgaagaaaat tggtggtgta 3120 aaacaaaaca taaatgatct tgctgctgtc gctggwttta ctattggrtt ttatataagc 3180 gcatatatta gaagttttta tgctgaaccg ttggaattat atgttaaaga tcatttaaaa 3240 ccatatgaca ataaatattt aataaaagcw tttaatgaaa tatcaaatca cgatggatat 3300 gtagatataa caaatatttt tcttaatgat ttaaawcttg atgattttag attatttaaa 3360 gaaaaagatt ttgaagaaat atatcaacca aatagamaaa aggattttaa agatttattt 3420 aatttatttg ataaacaagt agcagaaata tctatatata atggtattaa aatgttagaa 3480 aaggcattgt ttacacatga aaaagaaggc gatgaaattg gtgatggkat atatgawttg 3540 attaaaaamg atgatgatat gaagcggtta ttagatgaca atgttggttt atttgaacaa 3600 ttaattgaga atagtaaaaa ctttaaaaat caaaattttt taggtttaat taatcaatat 3660 gaaagacatt ggggtcctga tgtagggaaa ccaggagatt tacatagata tgttaatttt 3720 tatactaata tgtggtatca tttacattay tctcaaggwa cattacatga tccaaaacat 3780 ttacccccag gatatgaagg caaacaacca ctagcagaaa taatatctag atttccatat 3840 gawgttgtta gaaaaaaaag aaatgttagt tataaagctc atacaagatt agaattaaat 3900 aaaaaatttg atgattattt raaattaaat tnaaawcaat atcatactgg tccaaaaatt 3960 aatactaaag gattaccaga tctgcattac agaaggttaa gtgctataca actaaatatg 4020 ctgcgttatt actatcacaa taaccaccac aaatctgcaa cacatgaaga atggaaaaag 4080 atagttgatg aaaaaattaa aaaatatgac acacatacag aatcacwgga attagatata 4140 gatgatagaa ataaattaca tcaytggtat ataagtgaac atttattaca aacacacgaa 4200 gatggggtta caaaatatag aggcacactt ccagctataa atgcgtttaa caacttattt 4260 tcggrcatta atatgacaac tcatgcccct aatacaacta cggcgcccgt ggtaccaccc 4320 tctgctatcc ccggcagtgg gggaggtaat ggaatactta gatctcaaca aggagatttc 4380 agaccatctt tggacaacaa ctcccttacc ttcaccagca ccaattgcgg agcttcttgc 4440 attgaaacca ccgccacagg agaatggtat aatttcccgg tcgagtatta ttaccctcat 4500 acgatgtatg gtaatccgga tttytgggca gcaatggaga ratatctttg gtttaaacca 4560 gtcgctatgg aatttgaaat gtttaattta caagttgtgg gtgaaacttc gccacaaaac 4620 ttctccggac tacaaaacca agtagartta ttagtttttc gagatgaagc aaatatatat 4680 ggtccatgtg ctgctcctcc acaaatggcw aaagcattta aagatatatt tytgaatgaa 4740 gcatctttgc ataacggata taacattaat aatcaattat ctaaattaac tccaataaca 4800 ttacctaaag ctggttggat ttattatcca gaaacatgtc cagaagtaga aatagttaat 4860 tgtaatacat ctgaaacagt taattattca tatcaagcat caaaagttca acatccaaga 4920 tatattcgag aattctggga tcaacatttt gatttaacaa twgcaaaaga taawgatcat 4980 attaaatggc ataggtttga ttattttaat tgtaaaaaaa ttaaaccatt ttcttatgca 5040 aatgctcttg gaagaagaac tataacaact gataaaagag ttatgccatt tccwtcaaar 5100 atatattgtc cnaaatggaa acgaaaatgg gatcmtaaac ttaaacgwca acattatgaa 5160 tatgtgtttg gtgaaatgac tttacataaa aaattactac caacagataa acaattgctt 5220 attgatgaat ggtattccac caaaagagaa gctgttgaag aatatakagg tcattgccat 5280 caagaagatg tttctgaaca taaccctatg aaaatgatgt tttttaaatt aaatactatg 5340 attggacctg gtggtaataa aattaattta acatgtacat ttaattggaa aattaaatat 5400 acatttgaat tattcgggaa attacctaca acaaatatta gacaaatata taaagttact 5460 gatcctttgt tacaaggaac tcaaacttgg caacctacta ctaaatggtt aacaccttca 5520 attccttatt gtactccata ttttttttat attagagatg aaggtacaaa agaaggtgaa 5580 gatgaattat ttccaatgat caaacacaat gctgcaagat ttaaatatcc tttagttgta 5640 caatttggta tgcatgctgg tttacctggt gtrtaattta ttaaatatat gtatttattg 5700 ttgtttgatt aattttaaat tttattatat agattaactg tatgactcat tttaaatatt 5760 attattaaca cagttacagt taactgcgac tatattaatc ttggggaatc aactgtaact 5820 gtgttatata aaatgttttr atgttttaaa ttcatcccac gaagamactg gtgtcggccg 5880 tgagctttta aaatagactt gggaaagggg ggtatagggg gggtcttatt aatatagcca 5940 tggtatataa tat 5953 // ID Gypsy-7_AA-LTR repbase; DNA; INV; 205 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-7_AA_; KW Gypsy-7_AA-I; Gypsy-7_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-205 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 984-984 (2011). XX DR [2] (Consensus) XX SQ Sequence 205 BP; 57 A; 35 C; 35 G; 76 T; 2 other; tgagattgtt aattttcatg taacgtttta tttcagtgca tttcawtttg atctagatca 60 cggatctaga tgtaagaagt ttttccctca gttttgcacc atgattggtc cgtatcaaat 120 gcacattatt tgttcatgtt aaaaataaag taattaacgg ttcgaagtaw atcgcgtcgt 180 ttccattaca tccgggaaat tccca 205 // ID BEL-194_AA-LTR repbase; DNA; INV; 296 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-194_AA_; KW BEL-194_AA-I; BEL-194_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-296 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 880-880 (2011). XX DR [2] (Consensus) XX SQ Sequence 296 BP; 101 A; 53 C; 54 G; 84 T; 4 other; tgtggatact ggcatcactt ctggtcacga cactgtgacg gctggaacag ccgtttatgt 60 acctgtcaak ctgacaggtc ckaaagggaa agacaaaaag aattgaaaag ctttatctat 120 ctatcaacca ttgttaaagt agaacgtgaa tttgatgacg ctaaggtgaa catttataaa 180 gcacaatgca cmagtaactg aactaaatct aatttcctta cagctaaatt acgagctaaa 240 atctgtaaaa tctaaaawta gttgttcctt acctaaattg ttagtggttt catgca 296 // ID BEL-209_AA-I repbase; DNA; INV; 6903 BP. XX AC AAGE02017349; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-209_AA_; KW BEL-209_AA-LTR; BEL-209_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6903 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02017349; Positions 41646 34744. XX CC Positions [5956-6513] - Integrase core CC 'GACGC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 28..6903 FT /product="BEL-209_AA-I_1p" FT /translation="MSSQANVRQTRSQTRAQQANANPPISGDDGDRMSDRS FT SEDSFIPSIVDSGRGESRECAGCDRPNNAEKYMVECQKCNSWYHFSCANVN FT TVTVRSTSFVCTACMRSEISVALGSARSQISVISSSSSARAARMARELQRL FT DEEKKVMEELSRERIERERALNERELQERMEREKQFIARKHELLNRQDDGE FT GRSVRSMRSSQRSTQRTEDWVKQTASEAVGPSSTPVDAEDTSLGYPQGVHP FT SSTPLKIVDAVFPTAVDDDLKDTGSEVKSIPEALGSLPIADASEEPIEGIG FT KMDDAGVRNQPGLPTVDVKPYSDLLKLDEVVPAGVFSKVKRVNPSTYQRWS FT VETGELRQQNAKVIQQQQQMEVEQRNMHELVRKLQFDIGTGRKREQDLQSQ FT LKSLQIHHAEELRVIHNSETGLQNQLRQLQCENASLRNQIASLNGELQQIS FT TSREKLQVLLKEREDGITSKESQIRTLQMEVVEAAEQMRNLEADLREQLNR FT HKHECDTLELQRAELENEIQGLRTTEQQLQKEMEACLQREREAIRERNTAE FT QEYWDLHDVVQQFVNRNQHWASEGDCSPHLPPPPAAWLDQAANEDCLNPLP FT PPPPHLSNSAYNCLPVMSGGGIPLVPPFAIGHVGPSAHQIAARQVVTKELP FT VFSGDPIDWPLFISSYQHSTEACGYSNAENLLRLQRSLKGSAKESVSSFLL FT HPSTVPQVMSTLQQLYGRPEQIVNNMIAKVRATPPPKPDRLETLVSFGLAV FT QNLCGHLKAVGLERHLANPILLQELVDKLPATVKFNWALHQEQVPEVDLNV FT FSAYMAKISSAASSVTQLTAVSQKVKDERIRPKDRSFVNTHVSADPPKTSR FT EEEPRTATERTRGKQKDGAAGTIGSKKCAICNVDNHQIENCASFKALDLDG FT KWKAVKVNKLCGRCLTSHANWPCKGEICGINSCPKRHHRLLHFDPPETAKT FT NSAVVTVHRQISSSTLFRILPVTLFGKNGQFDTYAFLDDGSSVTLVERSIA FT EALGVQGKVETLRIEWTGGVNKTITGAEVVTMEISETGGSKRYRLSEVYTV FT DNLGLPQQTTDYAELATRFAHLDKLPVKSFRSAVPGILIGQSNCHLLATLK FT LREGRLNEPIATKTRIGWAVCGSLRRSQAVTMQTQLHMYAEPSTVDLHEYV FT RRFFEVESLGVAVVPEVKGVEEERAYKILDETTRRTSTGKFEIGLLWKHDY FT IEFPESKPMAERRFKCLEKRLSRNPELYNSVRQQIADFTAKGYIHEATVEE FT IEGFDLRRTWYMPIGVVVNEKKPGKVRVIWDAAAKVDGVSLNSLLLKGPDL FT LTSLLSVLFAYREREVAVSADVKEMFLQMLVREQDRSALLFPYRDSPELPM FT TTMVSDVAIFGAACSPAHSQYVKNLNATEQEAELPRGAEAVKKRHYVDDYV FT DSFDTAEEAFEVAKEVIEVHKRAGFHIRNWMSSDRSVLEKLDEANVKPSKA FT MLPDKDVGCERVLGMAWIQQKDEFVFSLQFCEKVRILLDGDAIPTKREMLR FT LVMSLYDPLGLVASFVIHGKILIQEVWRTETDWDSNIPGEIATRWMEWLTV FT LKSMTGLRIPRCYFSGYEPSSYNNLELHVFVDASAQAYAAVAYFRIVDRGQ FT IRVALVSSKTKVAPLRGLSIPRLELMAALLGARLRRTIENNHRLKVTKTCF FT WSDSSTVCSWIKSDSRRYRQFVAFRVDEILSLSNIDEWKWISTRINVADEA FT TKWGKGPSCNVDSRWFRGPDFLYEREEGWSMKPEEEELEDESDGELRAAVV FT CSHLVVQPVVDTERFSKFERLRNSTAYVYHFINNLRSSWIESAGTIGVTSK FT ELQIAENFLWRMAQSEAFPEEVATLKREQDEKNSLRKRLPTSSELVKLSPF FT VDDHGVLRVESRAANAQVLAYDTKFPIILPRKHRITELLLDSYHRKYGHAN FT DETVVNEVRQKFHIPRLRVEVRQIRKRCMWCRVYKATPVAPKMGPLPAVRL FT EPFVRPFTYVGIDIFGPYSVKLGRSSVKRWVCLFTCLTIRAVHLEVVASLT FT TDACKKAIRRFIARRGSPQEIYSDNGTNFVGASRELQQETSRIHTELGSTF FT TNAQTQWKFNPPAAPHMGGCWERMVRAVKSALGSIPIVRKLDDESFATVLA FT EAESMVNSRPLTFIPLETADQESLTPNHFLLLSSNGVREPEKFPTDEVKAL FT RSSWNLVKHTLDNFWRRWVVEYLPTIIRRTKWFQDVRPITIGDLVLVVDEN FT IRNRWLRGRVIGTIPGKDGVARRAEVKTSAGILNRPCTKLAVLNVEGSGDA FT QPEAKATRGGG" XX SQ Sequence 6903 BP; 1887 A; 1555 C; 1972 G; 1489 T; 0 other; tttctcaaag aatttatccg gagtaggatg agttcgcaag caaatgttcg ccaaacccgc 60 tcgcagacaa gggctcagca ggcgaatgcg aatcccccca tctcgggcga cgacggcgat 120 cggatgtcag atcgatcatc ggaagattct ttcatccctt caatcgttga tagcggcagg 180 ggtgagtctc gggaatgtgc tggttgtgac cggcccaata acgcagagaa gtatatggtg 240 gaatgccaga agtgtaacag ctggtatcac ttctcttgtg cgaacgttaa cactgttaca 300 gtacgttcca ctagcttcgt ttgtactgct tgtatgcgaa gcgagatttc ggtcgcgttg 360 gggtcggcaa ggagccaaat cagtgtaatt tctagctcat ctagtgctcg ggcggctaga 420 atggctcgag aactgcagcg tctcgacgag gagaagaaag tgatggaaga attaagtcga 480 gagaggatcg agagggaacg ggccctgaat gaacgggagc ttcaggaaag aatggagcgt 540 gaaaagcagt tcatcgctcg aaaacacgag ctgctgaatc gacaggacga tggagaaggg 600 agaagtgtac gcagtatgcg gagcagccag aggagtacgc aacggacgga agattgggtg 660 aagcagacgg cgagtgaagc tgtcgggccg agttcgacgc ccgtggatgc ggaggataca 720 agtctgggat atccgcaggg agtacatccc tcttcaacgc cactgaagat tgttgatgcc 780 gtgtttccta cggccgttga cgacgacttg aaagatacgg gatcagaggt aaaatctatc 840 ccagaagctt tgggaagctt gcctattgcc gacgcgtcag aggaaccgat tgaaggaatc 900 ggtaaaatgg atgacgcggg cgtacgaaac cagccaggtt tgcctaccgt cgatgttaaa 960 ccgtattcag atctactgaa gctcgatgaa gtagtgccgg ccggtgtgtt ttcgaaggta 1020 aaacgtgtga atccaagtac ataccagcgg tggagtgtcg aaaccggcga actacgacag 1080 caaaatgcta aggttatcca gcaacaacag caaatggaag tggaacagcg gaacatgcat 1140 gagttggtgc ggaaactcca gttcgacatc ggtacgggaa ggaaacgaga gcaggatctg 1200 cagagtcaac tcaaaagtct gcagatacat cacgccgagg agctgagagt aatccacaat 1260 tcggaaactg gtctgcagaa tcagctcagg caactacaat gtgaaaatgc ttccctgagg 1320 aaccagattg catctctaaa tggagagttg caacaaatca gcacatcgag agaaaagttg 1380 caagttctgc ttaaggaacg cgaggacggt ataacatcta aggaatcgca gatacgaacc 1440 ttacagatgg aggtcgtcga ggcagcagaa cagatgcgta atttggaggc ggatttgcgg 1500 gagcagctga accgacacaa acacgaatgc gacactcttg aacttcaacg ggcagagtta 1560 gagaacgaaa ttcaggggtt gcgtacaacg gagcaacagt tgcagaagga gatggaagca 1620 tgccttcaac gtgagcgaga agccatccgc gagcgaaata cagccgagca agaatactgg 1680 gatttacacg atgtggtaca gcagttcgtg aatcggaacc agcactgggc atcggagggc 1740 gactgttctc ctcatttacc tcctcctccg gcggcatggc tagaccaagc tgctaatgaa 1800 gattgcttaa atcctcttcc tcctccgcct cctcatcttt caaattcggc ttacaattgt 1860 ctgcccgtga tgtctggtgg ggggattcct ctagttccac ctttcgcaat aggccatgta 1920 ggaccatccg ctcaccaaat agctgctagg caggtggtta ccaaagagct gccagtcttc 1980 tctggcgatc cgatcgattg gccacttttc atcagcagtt atcagcattc tacggaagct 2040 tgcggatact ccaatgctga aaatcttcta cgtttgcagc gaagtttgaa aggaagtgcg 2100 aaggagtcag taagcagctt tttactccac ccgtcgacgg ttcctcaggt catgtccact 2160 ctgcaacaac tgtacgggcg gccggaacaa atcgtgaata acatgatcgc caaggttcgg 2220 gcaactcccc ctccgaaacc ggaccggttg gagacactgg tcagctttgg actggcggtt 2280 cagaatctgt gtggtcacct gaaagcagtc gggttggaga ggcatcttgc caacccgatt 2340 ctgctccaag agctggtgga caaattacca gcaacagtga aattcaactg ggcgcttcac 2400 caagaacagg ttccagaggt ggatttgaac gtgtttagtg catacatggc gaaaatatct 2460 tcagcggcaa gcagtgtaac acagctgaca gccgtttcgc agaaagttaa agacgaacga 2520 attcgcccga aggatagatc gtttgtcaac acgcacgttt cagcggatcc accgaaaacg 2580 agtcgagaag aagagccgag aaccgctacc gaaagaacca gaggaaagca gaaagatgga 2640 gcagccggca cgatcggtag caagaagtgt gcgatttgca acgtcgacaa ccaccaaatc 2700 gaaaactgtg catcgttcaa agcgttggat ttggatggta aatggaaggc ggtgaaagtg 2760 aacaaacttt gtggtcgctg tctcacctcg catgcgaact ggccgtgcaa aggggaaatt 2820 tgtgggatca acagctgccc caaacgacat caccgcttgc tccactttga tccaccagaa 2880 acagcgaaaa ctaatagcgc ggttgtaacc gttcatcgac agatatcgtc atcaacgctc 2940 ttccgcatcc ttccggttac cctgttcggt aaaaatggac agttcgacac ctacgcgttc 3000 ctggacgatg gatcgtcggt aacactcgta gagcggtcga tcgcagaagc tcttggtgtc 3060 caaggaaagg tggaaacgct acgaatcgaa tggactggag gcgttaacaa aactatcacc 3120 ggagcagaag tcgtaacgat ggagatatcc gagaccggtg gaagcaagcg ctacaggctt 3180 tcggaagtgt acactgtcga taacctcggt ttaccgcaac agaccacgga ctacgcagag 3240 ctggcaacac gattcgcaca tctcgacaaa ctaccggtga aaagcttcag gtccgcagta 3300 ccaggaatcc tgatagggca gagcaattgt catctgcttg ctacattgaa gctgcgggaa 3360 gggcgattga acgagccgat tgcaacaaaa acgaggatcg gatgggcggt atgcggtagt 3420 ctacggaggt cacaggcggt cacgatgcaa acgcagctcc acatgtacgc agagccgagt 3480 accgttgacc ttcacgagta cgttcggcgc tttttcgaag tcgagagttt gggagttgct 3540 gtcgtgcccg aggtgaaagg agtcgaggaa gaacgagcct acaaaatact ggatgaaact 3600 acgcgccgta ccagtactgg gaagttcgag ataggattac tttggaagca tgactatatc 3660 gagtttccgg agagtaagcc gatggcggaa cgacgcttca agtgtctcga gaagcgttta 3720 tcccgaaacc ccgagctgta taacagtgtg cgacagcaaa tagcagactt cacagccaag 3780 gggtacatac atgaagcgac ggtggaagaa atcgaaggat ttgatctgcg gcgtacttgg 3840 tatatgccga ttggagtcgt cgtcaatgag aagaagccag ggaaagttcg agttatctgg 3900 gacgcagcgg cgaaggtcga cggggtatct ttaaactcct tgttgcttaa gggcccggat 3960 cttctgacgt cgttattatc tgtcctgttc gcatatcggg aacgcgaagt ggcagtgtca 4020 gcagatgtca aagaaatgtt tctacagatg ctagttcgag agcaggaccg cagtgcctta 4080 ctgttcccat accgagattc cccagaactt ccgatgacta ccatggtgtc tgatgtggcg 4140 atatttggag ctgcctgctc tccagcccat tcacagtacg tgaaaaacct gaacgcgacc 4200 gaacaagaag cagaactgcc ccgaggagca gaggcagtga aaaagcgaca ttacgtggat 4260 gactacgtgg acagcttcga tacggctgag gaagcattcg aggttgcaaa ggaagtgatc 4320 gaagtccaca agcgagccgg attccatatt cgcaactgga tgtctagcga caggagcgta 4380 ttggagaaac tggatgaagc gaatgtgaag ccgtcgaaag ctatgctacc ggataaggac 4440 gttggatgcg agcgagtgtt gggaatggcg tggatacaac agaaggacga atttgtattt 4500 tcgttgcagt tttgcgagaa ggtgcgaatc ctgctggacg gcgatgcaat accaacgaaa 4560 agagaaatgc tgcgtctggt tatgagcctc tacgatcctc tgggtttggt agcgtccttt 4620 gtcatccatg ggaaaatctt aatccaagaa gtctggcgaa cggagacgga ctgggatagc 4680 aacattccag gagagatcgc tacacgctgg atggaatggt tgactgttct aaaaagcatg 4740 actggtttgc gtattcctcg gtgctacttt tcaggatacg aaccgagcag ctacaacaat 4800 ctcgagctgc atgtgttcgt ggacgcaagc gcccaggctt acgcagcggt cgcatatttc 4860 cgcatcgtgg atcgaggaca gatcagggtt gcgcttgttt catcgaaaac gaaggttgca 4920 ccactccgag gactttcgat tccacggtta gagttaatgg ctgcgttact tggagctcgc 4980 ctacggagaa cgattgaaaa caaccatagg ctgaaggtga cgaaaacatg tttctggagt 5040 gactcgtcga cggtttgctc gtggatcaag tcggatagtc gccggtatcg ccagtttgtg 5100 gcattccgag tcgacgaaat attgagtctt tcgaacatcg atgaatggaa atggatttcg 5160 actaggatca atgtagcaga tgaagcaacg aaatggggaa aaggtccttc ttgcaacgtg 5220 gatagccgat ggtttcgggg cccagatttt ctgtatgaac gcgaggaagg atggtcaatg 5280 aaacccgaag aagaagaact tgaggatgag agcgacggag aacttcgagc tgcggtagtt 5340 tgcagtcatc tagtcgtgca accggttgtc gacacggaaa ggttttcgaa atttgaacgg 5400 ttgcggaata gcacggcata cgtttaccac ttcattaaca atctgcgaag ctcatggatt 5460 gagtcagctg gtacgattgg tgtgaccagc aaggaactac agatagcgga aaacttcttg 5520 tggaggatgg cacaatccga agcgtttcct gaagaagtgg caactttgaa gcgtgaacag 5580 gatgaaaaga attcgttaag gaagcgacta cctacatcaa gcgagcttgt gaaactttca 5640 ccattcgtgg atgatcatgg tgtacttcga gtagagagta gagcagcaaa tgcgcaagta 5700 ttagcctacg atacgaagtt tccgattatc ctgccaagga agcatcgtat tacggagttg 5760 ctgctggact cctatcatcg taaatacggt catgccaacg acgagaccgt agtcaacgag 5820 gtgcgacaga aattccacat tccacgcctg cgagtggagg tgcgtcagat cagaaaacgc 5880 tgcatgtggt gccgtgtgta caaagcaacc ccagttgctc ctaaaatggg accccttcca 5940 gcggtgcgat tggaaccgtt tgtgcgcccg ttcacctacg tgggcataga catctttggt 6000 ccgtattcag tgaagctcgg acgaagctcg gttaagcggt gggtttgcct attcacttgc 6060 cttaccataa gagccgttca tctggaggtg gttgccagtt tgaccaccga tgcttgcaag 6120 aaggcgattc gacggttcat cgcgcggcgt ggatcccctc aggaaatata ttccgacaac 6180 gggacaaact ttgtgggagc tagtcgagaa ctacagcagg aaaccagcag aattcacact 6240 gagttgggca gtacctttac taatgcccag actcagtgga agttcaatcc tccagcagcg 6300 cctcacatgg ggggttgctg ggagagaatg gtgcgtgccg tgaaatctgc acttgggtcc 6360 attccgatcg tgcggaaact ggatgacgaa tcatttgcaa ctgtgttggc ggaagcggaa 6420 agtatggtga actcgaggcc actgacgttc atcccgttgg agacagccga ccaggaatcc 6480 ttaacgccaa accatttcct gttgctcagt tcgaacggcg tacgggaacc ggagaaattt 6540 cccacggacg aagtcaaggc attgaggagt agttggaacc tggttaagca cactctggac 6600 aacttttggc gtcgttgggt ggtggaatat ctgccgacga ttatacgacg gacgaaatgg 6660 ttccaggatg tacgaccaat caccatcggg gatttggtac tagtggtgga cgagaatatc 6720 cggaatcgat ggttgcgagg acgagtgatc ggcacgattc caggaaagga cggagtggcg 6780 cgtcgagcag aagtgaagac ctcagcgggg atcttgaaca ggccatgtac gaaattggct 6840 gtgctgaacg tagaaggatc tggtgacgcc caaccggaag ctaaggcgac acgcggggga 6900 gga 6903 // ID Copia1-I_Dpse repbase; DNA; INV; 3027 BP. XX AC Unknown_group_825; XX DT 15-MAY-2009 (Rel. 14.05, Created) DT 16-JUN-2009 (Rel. 14.05, Last updated, Version 2) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia1_Dpse; KW Copia1-LTR_Dpse; Copia1-I_Dpse. XX NM Copia1-I_Dpse. XX OS Drosophila pseudoobscura OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; obscura group; OC pseudoobscura subgroup. XX RN [1] RP 1-3027 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1019-1019 (2009). XX DR Genome; Unknown_group_825; Positions 13083 10057. XX CC Positions [5291-5821] - Integrase core CC LTRs are 93% similar to each other. The original virus is a CC duplicate. XX FH Key Location/Qualifiers FT CDS 1257..2867 FT /product="Copia1-I_Dpse_1p" FT /translation="MKIKNVSVNTNCDTCNKAKICALPFPQKAERVTKSVL FT ELIHTDVCGPMNVKSLAGNRYFVTFIDDYSRKIHVYFMRAKNQVFEKFKMY FT KSYVECQTEKTIKALRSDNGTEYINNEFTNFLNACGIKRELTVPYTPQQNG FT VAERANRTIVEMARSMLINAILKEFLWAEAVQTAVYLRNRCPTRALNGCTP FT FEAWKKRKPSVNHLKIFGSRAFALEKTKKGKFVAKGKKYIFFGYSFAAKAY FT RLYDQEKQMIVERRDVKIVKGEFVKELNDIYENGYQTEDLAKIIIRFESNE FT EPVQVQTAVETEEETISNIDNSDDDEEGNFVSASDPEQSRSSDEEDIPQIR FT GPGRPKIIRTGQPGRPKKQYNVPNNLSCSDDIETPRTVAEALQSHDAEEWK FT KSMRKEFDALVSNNTWTPVNLPPGKKAIGSKWVFRIKRNQNGDIERFKSRL FT VAQGCGQQFGVNYWETYSPVIRYETIRMLFAIAAEKQLHMHQVDISNAYLN FT GKLQKVAYMKQPVGFIDEQNPHKVLKLQKAIYGLKQSGRV*" XX SQ Sequence 3027 BP; 1065 A; 562 C; 702 G; 698 T; 0 other; tgcgtagtgt attgatcaca acggatttgt ggacatatgt aagtggtaaa gccgaaaggc 60 cacaggagga agcagcagga acacaggaac acttggcaca gtgggacaag gctgatcaaa 120 aggcgttggc atgtattata ttaaatgtaa aagcaacgca gttgatgcac ataaagtcgt 180 gcacaacgtc gaagtaggcg tggcaaaaat tacgcgatgt acatgtgtca gtgggaccag 240 tgcgaaaggt acagttatac caaaaacttt tgcggcaaaa tatgcaaaac ggtgataacg 300 tggtacaata tgtaaattct ttcgtggaaa ttactgaaaa gttagccgag ctaaataccg 360 acattcaaga cgagctaaaa gtaataatgt tgttgtcgag cttgcccagc ggttgggaaa 420 atttcgtcgt ggcgatagaa acacgcgaca gtctgccgtc gttcgaaacg gtaaaagtaa 480 aaatacttga agaaggagca cgtaaagacg aaagagacga tcgtgagctt gctgtacagg 540 cagtgtacgc acacacacag actcaaagaa acggtgcgcg ttcgaagaat tacggcgcca 600 gtcgaaaaga cagcaaacga gacggccaga aatccaacga gcaacgcgag tttcgaggaa 660 aatgctacaa atgtaaacaa agcgggcacc gcgcatccga gtgcaaaagc aacaaagata 720 atacgaaaga gtgcgggatg aaaaatcaaa ctgtttaatg cacagctgcg ttgttaagca 780 gcgaaaaaat tcttggtgta tcgacagtgg ggcaacatcg catatgtgtt gtaataaaaa 840 tttgtttgaa acctttaata agaaaaacac aaccattatg ttggctgcag aaaaggatat 900 tgagtcccta ggtattggaa ctgtaaaaat taaatcaaat gaaacaaata tagaattgcg 960 aaacgtgttg tatgtgccgg atttaaaaat gaattttttg tcggtgagca aagcagccga 1020 gttcgggaat gtcactacat tcggaaattc agaagcttta gtgcgaaata aacatggaca 1080 tataatgttg cgagcggtac aagaaaacaa tttatacgtg tatggagtgg aaaaaaccac 1140 tgatgcagtg catttgatat ccgatatatc tcttgcaaaa aaatggcata gccgctacgg 1200 gcatttaaat tttcaaaatt taaagaatct ttgtgaagaa agaacgtgta atcgggatga 1260 aaattaaaaa cgtatccgta aacacaaatt gcgacacatg caataaggct aaaatttgtg 1320 cattgccgtt cccacaaaag gcagaacgtg ttacgaaaag tgttctcgaa ttgatccata 1380 ccgacgtatg cggacctatg aatgttaaat ctctggctgg aaaccgttat tttgttacat 1440 ttatagacga ttactcgcga aaaattcacg tatattttat gcgcgccaaa aatcaagttt 1500 ttgagaaatt taaaatgtat aaaagttatg ttgagtgcca aaccgaaaaa acaatcaaag 1560 ccttgcgaag cgataacggt actgagtaca tcaacaatga atttacaaat tttttaaatg 1620 cgtgcggaat aaaaagagaa ctaacggtgc cttacacacc tcagcaaaac ggcgttgctg 1680 agcgcgcaaa tcgtacaatc gttgaaatgg ctaggagtat gttaattaat gcaattttaa 1740 aagaattttt gtgggctgaa gccgttcaaa ctgcagtata tttgagaaat agatgtccga 1800 ccagagcgtt aaatggatgc acccccttcg aggcttggaa gaagcgcaag ccgtcggtaa 1860 accatctgaa aatatttggt tcacgcgcat ttgcactgga aaaaaccaaa aaaggcaaat 1920 tcgttgcaaa aggaaagaag tacatttttt ttggctactc ttttgccgca aaagcatatc 1980 gattatatga ccaagaaaag caaatgatag tggaacgccg agacgtcaaa attgtgaagg 2040 gtgagttcgt caaggaatta aatgatattt acgaaaatgg atatcagact gaggatcttg 2100 caaaaatcat catacgcttt gagtcaaacg aagaacctgt gcaggtacaa actgctgttg 2160 agaccgaaga agaaacgatt agcaacattg acaacagcga tgatgatgag gaggggaact 2220 tcgtaagtgc aagcgatcca gaacagagcc gcagctcaga tgaagaagac atcccacaaa 2280 tacgtggccc aggaaggcca aaaatcattc gtactggtca acctggcaga ccaaagaagc 2340 agtacaatgt ccccaacaat cttagctgct ccgatgacat tgagactcca cggaccgttg 2400 cagaggcgtt gcaaagccac gatgcagaag aatggaaaaa atctatgcga aaggaattcg 2460 atgcactcgt gtcaaataat acatggactc ctgtcaatct tccgcccggt aaaaaggcga 2520 taggttcaaa atgggttttc agaatcaaac gcaaccaaaa cggtgacatt gaaaggttta 2580 aatcgagact tgtggcacaa ggatgtggcc agcagtttgg cgtaaactac tgggagacat 2640 attctccagt gatacgttat gagacaatcc gaatgttatt tgcgattgct gcagaaaagc 2700 agttacacat gcatcaggtg gatatctcta acgcgtatct caacggtaaa ctccaaaaag 2760 tcgcctacat gaagcaaccc gtaggcttta tagatgagca gaacccccac aaggtactca 2820 aacttcaaaa ggctatctat gggctgaagc agtctggacg tgtctagaat gacacattga 2880 acgaggtgct ggttaacatg ggctttaaaa aaagtcaaca cgagccatgt ctatacgtca 2940 aacagcatca acaaggcttc agttatatag cagtttacgt cgacgacctg ataatcatct 3000 gtacaaaaga gatggacgtc agcgcga 3027 // ID LDT1 repbase; DNA; INV; 5677 BP. XX AC AF081103; XX DT 27-JUL-1999 (Rel. 4.06, Created) DT 25-JUL-2005 (Rel. 10.08, Last updated, Version 3) XX DE Lymantria dispar non-LTR retrotransposon LDT1-4 putative DE gag-related protein and putative endonuclease/reverse DE transcriptase genes, complete cds. XX KW Jockey; Non-LTR Retrotransposon; Transposable Element; LDT1. XX NM LDT1. XX OS Lymantria dispar OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Noctuoidea; Lymantriidae; Lymantria. XX RN [1] RP 1-5677 RA Garner K.J. and Slavicek J.M.; RT "Identification of a non-LTR retrotransposon from the gypsy RT moth."; RL Insect Mol Biol 8(2), 231-242 (1999). XX DR GenBank; AF081103; Positions 1 5677. XX SQ Sequence 5677 BP; 1508 A; 1744 C; 1226 G; 1199 T; 0 other; agactgaatg ctagtatata tacaccgcct tttgctgtct attcaatcat tccgcgtcgg 60 agactcgtag cgcacggacg tgcgcctcgc tgctctccat ttcgccgcgc tcaccccgca 120 cccgccccct gcggtcgtgc gttctcgctc cgtctcgctc gctctcgcag cgaaccacgc 180 gtcgcgtcaa gccgccatct tgttcaacgt ctgtggccgt tcagtggccc gctccgggtg 240 gccgtgtata cggttgtctg tggcagtttc gctgtttgga cgcaggtgtt gccggatccc 300 aactgcacga gtcgactttt tcaagacgtc gagtgcagat ttctaccgtt ggtagataaa 360 aagtcaacct attttggccc cctcagggta gaaaaccata ttgtttggtc cccggcggaa 420 acctcacggt gtcaacgaag ccgggtgagg caaggccccg atttgcctgc cgagcccctc 480 cagggcagag ctaacaatcg gaaggggtta gagcatccgg ccttggggga tctcctagtc 540 ggacctaatc tacggctttt caacattttt ctgcgaaggc ctaacggccc gagcagctag 600 gaacctatcg cacattaaga ttggcgaagg attacaagct tagggcacac taattaagta 660 ctcatcctcc cgacctgcac ccggttggga gggacacgca ccacacacaa cccagggtat 720 cccacccagg gcgctaggtt ttttacatcc ctcttagcta tcgttttttt ccgttagccc 780 acgatggcgg aaaaaaccat cactatagac gcccttctgg caattccaga aatcaaagct 840 tatatcgctc aacatttcgg agcgataacg atccttcctc cgcaacccaa acccgctcga 900 tccaccgccc agccgctcga accggaacta ccggccgtcg gtacggcgat ggaaacggag 960 ccggtcttgt cctcctcaaa ctcgtcttcc tcctcctcct ccgagtcgga agattcgtcc 1020 aatgagggct cattcattga ggtccgctct cgcaaacgga gcaaaaggac caaaaacacc 1080 aaagaggccc cctctaagac cccaaaaatc aagcagactg actcggcaac aacatctgcc 1140 gcagccccgc ttgcaacgac cgccacggcc gctccggccc ccccagtcgt ctcacaggca 1200 gccctaaccg ctgccccacc acctacggac agcaataccg tcgccccacc gacggtcaaa 1260 aaatcactcg ctccgccgcc cctatatatt agggacaagg acaagtggct agccatctcc 1320 gcatggctaa acactaacag gatttcgtat aaatctgcga aatccacccc ccaaggtata 1380 cgagtggacc ttcactcgtc agccgaccac cgatctgtat caaaaatgct agggaatcaa 1440 cagattccct accatacgta caccctccct gaagccaaac ttctgcgtgc tgtaatacgc 1500 aacgtaccga gggaaataga gaccaaagag attctagaat ctcttaaaac ccaggatcta 1560 ccggttgtag aagtacaccg gatgatccgg ggtaggggtc gttatcccct caacatgata 1620 ctcgtctgcc ttacaaataa cgcagagggt aaaggcattt ttaaaattaa gaccatctgc 1680 ggtctatctg gggtctcggt tgagccccca cacaaaaatg gcaacctggc acaatgccac 1740 aagtgccaac tgtatggaca atcgtccaaa aactgctttg cccgtcctag gtgcgttaag 1800 tgcctaggtg accatcacac ctctcagtgt gaacggccaa aggacatctc actctgcaaa 1860 gagccaccag cgtgtgttct ctgtggagaa tacggacacc cggctaacta ccgcggctgc 1920 ccaagagcgc cgaggaggtt agtccgccag cccaacacaa acggcaaggc cttgtactat 1980 aacaagacct tcgtcccagc ccccttacca acccacaatg cgtgggctcg ccccctgcta 2040 aacagcaagg aggctttccc aaccctgccc aacaggcagc ccgctgcccc acccgcaaca 2100 tctcaactca aaccaccaac aaccgggcca aaccagccca ctggtggtaa tcccaagccc 2160 cctcagggcg cccctattcc tatcgtcgtc gctaaaagcg cgcccccaaa agccccccaa 2220 ggactagatc cagatctagc cctggtggcc aacttcgcag ctcaaatcaa tttcacggaa 2280 atccgcgaaa ttgcagccga attaagaagg aacgagggaa acaacctcgc cctccttaat 2340 accgcaatta aattttcccc cacgttagag cgtctaggct cgcttaaatt caaataaaat 2400 ggataggtac actccaggca gagttaaacc cagacacctc aaaataggga cattcaatgc 2460 caatagccta acccaacaga aggatgaagt aggcacattc cttcgggaac accagctcga 2520 catcttgcta gtacaagaga cgttccttaa accatttaac aaggacccta gagtcgccaa 2580 ctataatata gttcgcaacg ataggacaac ctcaccaatg ggcggaacgc ttatctacta 2640 caaaaggtcg ctgcattgca cacctgtgga tccgccccct ctgatgtaca ttgaagcttc 2700 aatttgcagg ctagccctct caggccatca gcctatcacc cttgtatccg catatctctc 2760 tcccaataga gaacattcga aacagttggt tgaaaaagat ctaaaagcct tgctagaatt 2820 gggaggcgcc gtcatcattg gcggtgacct taacgccaag aacacctctt ggagctgtca 2880 aagttccaat aagaggggca agactttaga gctattcgct gacagactgg gatttgatgt 2940 catagcgccc atagagccga cacacttccc ctttaacgtt tcgcacaggc cggacatact 3000 cgatatcttt cttctcaaaa acataaacct tcgtttatgt tcaatagaag ttcaacacga 3060 gttagactca gatcaccgcc cggtgactct ggaattagcc tcccgcaccc ccggcccctc 3120 aggccacaat aacgaaagga ccaagatagc gactgactgg gcacggctag agaaaaatct 3180 caagtcaacc tcgtcggtcc accttgatca aattcccaaa gacattacgt ctaaggaaga 3240 gacctctcag gcgatcgact cgctcacaag tcacgtgcag tcgataatgg acgactgctc 3300 gcggccggta ccagatacga gcgatcgtaa atggcgcctg ccagacgacg tccgtgactt 3360 gttgagacgg aaaaacgccg cgacacgcgc ttatgaccgc taccctacgg aggacaaccg 3420 tacccgccta cggtcactcc agagggaggt taaacaagca attaaaaaat tgagacagag 3480 caaatgggac gccatgatcg aggagctaga accctctcac acagcctttt ggcagctttc 3540 ccgctcactc aaaaaagaca tcgtgtccac tctccctcct ctcaccagac ccaaccagcc 3600 tccggcattc atggacgatg aaaagtcaga atgcctggcc gactgcttag agcttcagtg 3660 tacgccgagt acactacacc ccgacgccgc gaatcaccgc ttcatcgccg acgaagtgga 3720 acgtcgtaac actctctcgc gctcgttagg aaacgacagc cccgatcagc caatcccagc 3780 agtgaccacc gatgaggtcg aatcgttagt caggggtctc aaaaccagga aagccccagg 3840 ttcggacgga atctcaaata aagtcattaa actattcccc gcccacttga taatcctgtt 3900 gtgctgcatc ttcaatgcag cactaaataa taacatcttt cccgcgcaat ggaaagaggc 3960 cgtggtaata ggaatccaca aacccggcaa gcgccgtacc gaacctagta gttaccgccc 4020 gattagtcta ctaagaggtc taggtaagat ctacgaacga atagttctca gtagactacg 4080 tacattcgcc gaggctaaca acctcgttcc cgacgcacag tttggcttcc gagctaaaca 4140 cagctgcgta caacaagttc accgtttggt agaatacaca agtagtcagt tcatcatgaa 4200 cagatatacg ggtgtattat tcctagacgt agcgaaagca ttcgacaagg tttggcatga 4260 aggcctaatc tacaaactct atctcctcaa gatccctcac tacttaattc acattataca 4320 tgactactta agtcatcgca gctttcgcta cagggtggaa gggacgttgt cgacctccca 4380 ccctatcaag gccggcgtcc ctcaggggtc ggtcctctcc ccatttctct tcacgctata 4440 cactagcgat atgcccaaat tcaaccgggt acataaagcg ctttacgctg atgacaccgc 4500 actcttttgt gctggtagat ccccgacaac agtagccagg actctccaaa cagcggttac 4560 cgccctagcc aattggttta ggaattggag attagaaatc aaccccgaga agagtcaggc 4620 ggtcatgttc actagacgaa ctgcccgttc ctactctatc gatagcattc ccccccttaa 4680 aattttcaac aagccggtca cttggaccag gcaggccaaa tacttgggtg tgacccttga 4740 cgatcgactc agcttcgccc ctcacataaa gaaagtccgc gcacgggcag cctttgtgat 4800 gggccgtctc cactgtctac tcaactctag gtcgcgtatg cccctgagta gcaaagtccg 4860 actttacacc acctgcatac ggcctatcat gacctatgcc agcgtggtgt tcgcgcatgt 4920 taagccacac aggatccaca ggctacagac actacaaaat cgtttcatgc gccgagctac 4980 gggagcaccg tggttcatac gaaacgtcga tctccacatc gacctcgaac tgccgaccat 5040 caagcagttt atgaaacggt gctctcagag ctactttgac tctgccatga cacaccccaa 5100 ccaactagtg gttggggcgg cttcctatag accctccaaa atctcctcca tccgacgtcc 5160 tcgtcacgtc ctcgacgaag aggacgacat tatcacactc gctcaacaaa gagcctacac 5220 ggcccgcgag gccttaacaa ataaatatcc gcggttccgc cctcgccgtc gaggtgcgaa 5280 accgcgccgc gcaaaagtca ctaccaaaaa cgtgggcacg tccactttgc ctccagcgac 5340 gccccaagcc gaggtccgtt ccccacaggg ggagcccttg ggcgatgtcg cttagcatac 5400 ttttctccgg gttgagagcc ctcagcgctc acccatagtc cgcgtcaagc tgtcaaggcc 5460 atctgcggcc agcagctgta cacacaaaaa aaaaaaaaaa aaagtctatt cattcagtcg 5520 acatacgata tcagctcgca cactcgtacg gtcgtcacta gttctattat caatttattg 5580 tacataattt atgaaatgta aataaatcgt cagtcttaga aataactctc tacgttcttc 5640 ttcgccttct tctcacatct tcattcgcca accacaa 5677 // ID hAT-46_SM repbase; DNA; INV; 2608 BP. XX AC . XX DT 16-AUG-2009 (Rel. 14.08, Created) DT 16-AUG-2009 (Rel. 14.08, Last updated, Version 2) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-46_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-2608 RA Jurka J.; RT "haT-type repetitive family from freshwater planarian Schmidtea RT mediterranea."; RL Repbase Reports 9(8), 1849-1849 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 492..2294 FT /product="hAT-46_SM_1p" FT /translation="MSDKKKRKLKDENRQFNNNWENDYFFIQVKNNAVCLV FT CRETITAKTYNIRRHYEKHTELSVLSGDERKAKLYLLKANLKSQQNVFQKQ FT NQQSNDIVSTSLKISQIIAKKLKPFSDGEYIKECLIAAAEEISPEKVHIYK FT QVSLSHQTVARRVDDISNEISLNLNEHTKQFVYYSLALDETTDIKDTSQLA FT IFIRGVDNKLKVTEELLDLVSLKDTTTGKDIKDAVLKCAEDRQLDFKKLIG FT ITTDGAPSMIGKNIGAVSLICKHIESLGQNTSFLDLFICHCFLHIENLCAQ FT SLNMSHVMSVVINIINKIKNNSLRHRQFQEYLRELESEYCDLIYYAKIRWL FT SRGKCLLRFWNLREEIRIFMNENGEDVPQLYDERWLLDLCFLIDITTKLNE FT LNQNLQGENKLITNCYQDIKTFIAKLKYFQNQLKSNNATHFPHLNDFEMEN FT KPISEYSHNIKDLLEEFEKRFAHLEKFEQMFNIFNCPFNIDVNSAPDYLQL FT ELIDLQSNSEMKFRFEDINIVDFYGKYIQEDKFPNLKKLAMCIVAANGTTY FT LCESFFSRLKLAKTKDRNRLTDANLTNQLRCATTKLEVDIKKISNKINKQV FT SH*" XX SQ Sequence 2608 BP; 964 A; 379 C; 432 G; 833 T; 0 other; cagccatgtc caacgtgcgg cccgataaca aaaaaaggaa atcacacaaa ttaatattta 60 aattttaata atcgataata caaataattt ttattacgtg gtcgaatgac gttttataaa 120 actcctatat cttaatatat aaaaatggaa tcttatatgt atgtgggtat atacacaaag 180 aagaaagcag gagagcggga gagcgtgcgc gcgcagcgcg cgagtttgca tgtgtagaat 240 aaaagtcaga cgcgtgatta tgattatttt cgttaccgct gtcgtttagt ttagtcttgt 300 ttaagccact gttaaatgaa gtgctttctt gttttaaagt taaaaatttt aaacgtataa 360 atttattgtt ttaaaaaggt tgtgaaaata aaagtaagta atattttatt ttaatttgca 420 gtcctacaaa aatttattcg tttctttaat tattccagaa taaataaaat tgtccataat 480 attgacgtgc gatgtcggat aagaagaaaa gaaaattaaa agatgaaaac agacaattta 540 ataataattg ggaaaatgac tactttttca tacaagtaaa aaataatgct gtgtgcttag 600 tttgtcgtga aacgattact gctaaaactt ataacatacg aagacattat gaaaaacata 660 cggaactttc tgtactttct ggtgatgaac gcaaggccaa attatactta ctgaaagcta 720 acttgaagtc tcaacaaaat gtgtttcaaa aacaaaatca acaatcaaac gacattgtca 780 gcacaagttt aaaaatttct caaattattg ccaaaaaatt gaaacctttc tcggatggag 840 aatacataaa agaatgttta attgctgcag ctgaagaaat tagcccagaa aaggttcata 900 tttataaaca agtaagtctc tctcatcaaa ctgttgcacg tagagttgat gatatttcca 960 atgaaatttc cttaaacctt aatgaacata ccaaacaatt tgtttactac tccttagctt 1020 tggatgaaac aaccgatatc aaggatacat cacaacttgc tatatttatt cgcggagttg 1080 ataataagct gaaagtaaca gaagaactcc ttgatcttgt gagtctgaag gatacaacaa 1140 cgggcaaaga tattaaagat gctgttctaa aatgcgcgga agatcgacaa ttagatttta 1200 aaaaacttat tggtattaca acagatggag caccatcaat gataggaaaa aacataggag 1260 cagttagttt aatttgcaaa catattgaaa gcttgggaca aaatacatca tttttggatt 1320 tatttatttg tcattgtttt ttacatatag aaaatctatg tgctcagtca ttaaatatgt 1380 ctcacgtgat gtcagttgtc attaatatta ttaacaaaat aaaaaataat tcattaagac 1440 accgccagtt tcaagaatac cttcgcgaac tggagtccga gtactgtgac ctaatatatt 1500 acgccaaaat acgatggttg agtcgtggaa aatgtttatt aagattttgg aacctcaggg 1560 aagaaattag aattttcatg aatgagaacg gcgaagacgt tccacaatta tacgatgaaa 1620 gatggttgtt ggatttgtgt ttcttgatag atatcacaac taaattaaat gaactaaatc 1680 agaatcttca aggagaaaat aaattaatta cgaattgcta tcaagacata aagacattca 1740 ttgctaaatt aaaatatttt caaaatcaac tgaaatcgaa taatgccacc catttccctc 1800 acttaaatga tttcgaaatg gaaaataaac ccatttccga atattcgcat aatattaaag 1860 atttgcttga ggaatttgaa aaaagatttg ctcacctaga aaaatttgag caaatgttca 1920 atatttttaa ttgtccattt aatattgatg tgaattcagc tccagattat ttacaactgg 1980 aacttatcga tctgcagtcc aactcagaaa tgaaatttag gtttgaggac atcaatatag 2040 tcgactttta tggaaaatat atccaggaag ataaatttcc taatttgaaa aaattagcta 2100 tgtgtattgt ggcagcaaac ggcacaacat atctctgtga gtcatttttt tctagattaa 2160 aattagccaa aaccaaagac cgtaacaggc tcaccgacgc aaatttgacg aatcagttac 2220 gttgtgccac aacaaaatta gaagttgata ttaaaaaaat ttcaaataaa ataaataaac 2280 aagtttcaca ttaaatatct gttgtaattt actgaaatac ctatatatta ttgtgtatat 2340 ttgtgtacat gctagtttgt gtgtacttat taaaaaaatt tcaaattaaa taaataaaca 2400 agtttcatat taaatatctg ttgtaattta ctaaaatacc tatatattat tgtgtatatc 2460 tgtgtacatg ctaatttgtg tgtacgtatc tcatttgagc gcgagtgtgc gctacaattg 2520 aaattttttt gttcaggtga gtggcggccc agtgccttgt acattcctaa attttggccc 2580 gaagagtaat ttgggttgga catggctg 2608 // ID CR1-22_BF repbase; DNA; INV; 4932 BP. XX AC . XX DT 30-JUL-2009 (Rel. 14.07, Created) DT 30-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE Amphioxus CR1-22_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-22_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-4932 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-4932 RA Kapitonov V. and Jurka J.; RT "Young families of CR1 non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(7), 1593-1593 (2009). XX DR [2] (Consensus) XX SQ Sequence 4932 BP; 1450 A; 1271 C; 1072 G; 1139 T; 0 other; aagacccctc gttagcgggg tgaagcacgc ggacgcacgg tcaggctagc cccatctggg 60 agcggtttct atagtaaccg ccaagtgtct ggtttttccc gccaaacact aagtctgaag 120 atgcctaccg tgtctaaatt caccaggcag cgcagcgtta aggttacatt tctccaggtc 180 gaagatgccc agtttagtga tgccattgaa ctgttgaaaa aacatggggt aaacccactg 240 accgacatcg tgggaataca acggagaaca aggggagtat gtgagatcac cttcaagtcc 300 caatcatccc tcgccaaagt gagccctcga cttacatctg acagtgcagt tgatgtcgag 360 ttgtacggct ccgggttgac agtgataaca gcctttggca tacctgttga aatggatgat 420 aactttatcc gccatcggct cagagagttt ggtactgtag aggacagccg tttccttacc 480 tatgcgaacc aaggtttccc tgaaatcctg accggtacta ggcagtatcg tatgaagatc 540 acaaagcaca tcccgaattc tatccgtatc ggcagtgaac ttgtgacctt taggtacaac 600 ggacagccga ggatctgtca ccgctgtggc agtgacgagc attttgtcgc cgcttgtaca 660 gccgttaagt gcacccgctg ttttgagatt ggtcatttag ccaccgattg tgaccaagac 720 atcaaatgta acatctgtgg aggggaaggt cactctgcaa ggtcgtgcca actaagcttt 780 gcaaacaggc tgaacgtcac cactagctgg accaagatca cccctacaca gccgaacgac 840 aagagtgtaa agaaaccagt tacacctggt gaaaacgagt ataccagtca gagttcagta 900 caggacaaca caccgacagt ccctaaggtg gtggtgagta cggctgagcc ccaaaacgga 960 gtggtgagta cgcctgagcc cccaaacgga gtggtgagta cgcctgagtc cccaaacggg 1020 gtggtgagta cgcctgagtc ccaagacggg ggtaaagagc cgccaggcgg tgagactgac 1080 atggaggatg cagaggactt ggtcaccagc caaactgacg gcctccttct caaattagac 1140 gactcagggg atatcagcgc cgaatacgaa acagacgaag agagtgtcaa caaacgacct 1200 ctcccttcaa gccagagcga ctccgatgac tcggacacct ctgcaagacg aaccggaaag 1260 gcaactggta aaaagataaa gaaagacgca gatgcaagtc ctagcgagtc tgatgactcg 1320 gacacctctg caagacgaaa cggcagggca gttgaaaaaa agtctaaaaa agatgcagac 1380 gagaagggga caactaaaac gctcccttct cctggtaaag ccttagtgtc ttacaagacc 1440 aaagttgtcc cttcatcttc cttacggaag aaaggtagga agggtcctaa gtaactgaca 1500 ttatgttcga caatcctttg acaacgattt ggttatttct cctcgtcgct ttgtcattgc 1560 gaataggtac agacattgta acaaccggat ccacctatct taacccacct atggatccca 1620 cacaacaaaa tatcagacac cctgatactg cttcagacat gacggaactg ttgctgaacc 1680 caaactcgaa aaacttactg accatggtac atctcaatgc tcgaagttta ctacccaaga 1740 ttgaagaact cagatctttg gtatccacga tgaagttcag tgttctatgc gtaactgaga 1800 cttggctaag tagtgaaatt gttgaccgtg atatcgaact ggaagggtac caggtgtacc 1860 ggaaagacag gaacagacat ggaggtggcg tcatgatgta tgtcagtgat tctctgtaca 1920 cgacaagacg tggtgatctt gagaatgacg agatagaatc tatctggtgt gatgtgacca 1980 aagccaatac caatctctta atcaattgtt catatcgccc gccatcagca gatgacacat 2040 tctttgacct attcgaaact cagattcaga aagcaacgga ccaagcacat gctgttaaag 2100 tgatcctcgg agatttcaat gcaaagaaca gcaactggct taacagcaac acaactgaca 2160 acccgggacg tcaacttgac aacattttca tgaaccatgg tctagaacag gtgcttcacg 2220 aaccgacccg tgggaggaac ttgttggacc tgattgtcac ctcccatcca gtaatgtgtc 2280 accaaacggg caccttagct ccccttggtg actctgatca tcttgcaaca gcaactgcgc 2340 tcaacctaaa ggcaaaccac cacagcggaa aacgcctcgt ctggttgtat agcaaggcga 2400 acattgaaac ccttcaccaa cacattgcag ccgccccctg ggatatcaac cacgtttttg 2460 actccatgga cgacatctgg gattcatggt acaacatgtt cattgccata tgcaaacaac 2520 acatccctca caagtttgtc tctaccacta ggacagcaaa gccctggata cattctaaca 2580 aagagatcaa aacagctatc cgtagaaaac accgcctaca ctccagagcg aaacggataa 2640 atactgaggt aacgtgggcc acctacagaa gacaaagaaa tctagtgact gcactcacaa 2700 gaagggccga gtcagcttac attgaggaac tcgtgaatga tgtggagact ggcaacacca 2760 ggcgattctt cacatatgct aaatctgctc tgggaaatac ctcaacgggt atcccagctc 2820 ttaaagtcgg ttcagccata ctggagactc cggaagaaaa ggctaatgcc ctgaacaatt 2880 tcttcataaa ccagactgat cttcccgcta ggaatgaccc agctccaaca ttccacccga 2940 caactgtccc gggaacagtc ttggattctg ttcagctgtc agttgaggaa gtacgacaac 3000 aactaagttc tctgaaagtc ggaaagtcat gtggacccga cgaaatatcc cccaggctct 3060 tacgtctagt agcagatccg atctcagctc ctctaacctg cctgtataac aaatctctcc 3120 aactcgggca ggtaccttcg gaatggaaaa aagctaacgt aacccccatt cacaaagccg 3180 gtagccgcca cctctcaaac aactacaggc ctatatcact actgagcgca gtgtcaaagg 3240 ttatggaaac cctggtgaac aagcgactaa ttgcccatat aaatccgata ctgactgacc 3300 accaaagcgg atttagacca cttgacaaca ccgcactgca gctgtgccga ctaatggagg 3360 aatggaccga cgccatggat agaggacaaa ttgtaggctg cgtatttctt gacctccgaa 3420 aggcctttga caaggtctgg cacgcaggac ttcttgctaa attaaaagca tacgggatta 3480 atggccccat gctcaactgg ttcaccagct atctctccga tagacgacaa cgtgtcgtca 3540 ttcagggagt aggctccgaa tggaagtctc cattagccgg agtgccgcaa ggatccgtcc 3600 tcggaccgac actttttata ctttacatca acgatctggc cactagcctt acacagtgtc 3660 aaacaaacct atttgccgac gatacatcac tgtcttcttg ccaccgctct atcgacagcg 3720 tagtcgcatc actaaacagt gagttgagct ctgtgtcaac ctggctttca aaatggaaac 3780 tagaggctaa tatagacaag tgcaaagtca tgtttattac gtctcgcgcc cttccacgat 3840 cgattcctcc tgtgatccta ggaggaactg ttttacaggt tgtcaccagt cataaacacc 3900 taggcgtcac tgtcacaaat accctgtcct ggtcaacaca cgtagagatc atctcaacaa 3960 aggctaggcg atcatccgga ctactatgtg ctctacggaa gaagattccg aagaatctac 4020 ttctcaggct atacactatg atcacaagac caagtctaga gtatgctgac attgtctggg 4080 ccggccttac caaacgtgac caaaagattc tggaatctgt ccagtaccaa accaccagga 4140 ttatcagtgg ccacttcgga ctaccgtacc cttcatacga acgtctctac tccgaactat 4200 ccctaccgtc actacagtac cgacgcaagt tccacacggc agtaaccatg tacaaactgt 4260 tgaatggacg ttgcccccca cacctgcaaa gtctgcttcc tcatgcacga gcgtctgcta 4320 tcgactcccg ctacccccta agaaacagtg aacatttgac tactcctgca tacaagacta 4380 cacgatctca gagaacattt gttagcagag caatttccct ctggaactct ctccccagta 4440 gtacccgtac agctagcacc atcagctcct tcaaaaatag actgcgcacg tcgcctgata 4500 acaaatactc tgtaaacgct tgataaagta tatcatagtt tcctaatagt cattgtatta 4560 ctgatagtag ctattgatgg caacaactac tgttcctact gaacgatctg cactgtagta 4620 cgtatacatt atagcgttca aacgtgttaa tcgctgacta actagcctta tatagataac 4680 cgagtaatat catttgtatc atagactgac attctagatc gtacgactgc tagtcaagct 4740 gtaaacacgg atcacctctg tagtcaaagt tacatgtatc gttaatactg ttgtatactg 4800 tatactattg tataacgtat attgtaatgt taccagggct agcccttgat aatagcctca 4860 ggctagttgg gtagccctgg ctgtatttct catgattata cagccgaata aatcaaatca 4920 aatcaaatca aa 4932 // ID CR1-6_HM repbase; DNA; INV; 4084 BP. XX AC . XX DT 11-DEC-2008 (Rel. 13.12, Created) DT 13-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE CR1-type family: consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-6_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4084 RA Jurka J.; RT "CR1 families from Hydra magnipapillata."; RL Repbase Reports 8(12), 1834-1834 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(76..414,475..2328,2273..3547) FT /product="CR1-6_HM_1p" FT /translation="MKVKKIFEDILSVKNVQIERAHRTGNKKLKMPRTIVI FT KLLDFRDKVAILSKSSKLKGKNIYINEDFCFETTQIRKNLREKMKIERAAG FT KFAFISYDKLIIRDWVAKKSKNLSSTVLKMEENVIFKSSNENDIIGDNDTD FT SNYFNHKMLESQYYSVSESSQYLGQTKNTFSILNINIRSLNKNFENLKILL FT DEIKHDFKIICLTETWCKCGETNYEFELVNYTSIHQPREVNAGGGVSIFIH FT NSVNYILRNDLCVNETDYESLCVELVNNKEKNIIINTTYRPPSGNLKKFKT FT HLKTFLNKITNSRKHIYLVGDFNINLINYASNNNAKNFINTLLQYNFIPTI FT NKSTRITNRSSTLLDNIITNNFHNNPLYTGIIQTDLSDHFPTFLATNNILX FT NNISTKSTIFRRQINENSLKQFKNCLKNCVDWNLILQSNXANNAYDLFLXQ FT FCEQYEIAFPEIKKVINTKSLXNPWMSKGLLKSSKKKEKLYXKYLKKKTYK FT NEITYKNYKNIFEKTKKRSKKLFYDQLLXKTNGNTKKTWNLIKDIIGKNKR FT KKNNLPKNLLIDGELVYDKSIIAEELNNXFLNIGPNLAAKIPTNSXPFDSY FT LKTYDKLMDETNLKPSELRTAFSSLKSKKSEGFDKISVDVVKSVFDIIEPS FT LFHVFDLSLKSGIVPDKLKIARITPIFKSGDKSNILNYRPISVLPCFSKLL FT ERIMYNRLYNYLTENNMLYCKQFGFKKKQFNLRTICCIVNNLVLKKNNSTX FT HAVVELFNQISNAFNNDCFTLGVFVDLSKAFDTVNHDILIKKLEXYGVKNK FT NLLWFEDYPTNRKQFLYYNQNNTIKHSITCGVPQGSILGPLLFILYINDLY FT LASNLLXLILFADDSNLFYSHRDIKALFKIVNEELCKVNEWFISNKLSLNV FT EKTKFMLFHKPNKSENIPLKLPDLIINKTYIKRESNINFLGVILDENLSWN FT SHINSIEKKISKNIXMMYKAKPFLNIXSLKKLYFSFIHSYLSYCNIAWAST FT NYTKLKKLYSKQKHACRIVFGANRNVPCEPLLSELGALNVYKLNIHQVLLF FT MFKTKNELSPKIFQSYFNEIXHKYPTKFSDNNFVVPKYNLKLTSYSIQYRG FT PYLWKKFPXIVNKKKTLRDISLEQFKNESKRLLLFMDLNKFDFKCFF*" XX SQ Sequence 4084 BP; 1659 A; 537 C; 506 G; 1356 T; 26 other; gggaattgga agattaacaa tttaagagta gatggaattg atgaaaatga aggtgaaagc 60 tgggaagaaa gcgaaatgaa ggtaaaaaaa atttttgaag atatattatc agttaaaaat 120 gtccaaattg aacgggccca tagaactgga aacaaaaaat taaaaatgcc gaggacaata 180 gtgataaaac ttttagattt tagagataaa gtcgcgatct tgagcaagtc gagtaagcta 240 aaagggaaaa acatttatat aaacgaagac ttctgttttg aaacaacaca aatcagaaaa 300 aatttgaggg agaaaatgaa aattgaacga gccgctggta agtttgcctt catatcttat 360 gataaattga taattcgcga ttgggttgca aagaaaagca aaaatttatc ttcataatat 420 ttggcattaa ttaacgcaat tttattttta ttattactat gcatatttaa ttaaactgtt 480 ttaaaaatgg aggaaaatgt catatttaaa tcttctaatg aaaatgacat catcggtgat 540 aatgacaccg actctaatta ttttaatcac aaaatgctag aaagtcaata ctactcagtt 600 tctgaatctt cgcaatatct aggacaaact aaaaacactt tttcaatttt aaatatcaat 660 attcgaagtt taaataaaaa ctttgaaaat ttaaaaattt tattggatga aattaaacac 720 gattttaaaa ttatatgttt gacggaaact tggtgtaaat gcggtgaaac aaattatgaa 780 tttgaactag ttaattacac atcaattcac caaccgcgtg aagttaatgc gggtgggggc 840 gttagtattt ttatccataa ttcagttaat tatattttac gtaacgatct ctgcgtcaat 900 gaaacagatt atgaatcatt gtgcgtcgaa ttagtaaata ataaagaaaa aaacataatt 960 ataaatacta catatagacc accatctgga aacttaaaga aatttaaaac tcatcttaaa 1020 acatttctaa acaaaattac aaattcgaga aaacatattt atttggtggg ggattttaac 1080 attaacctga tcaactatgc ttcaaataat aacgctaaga attttataaa tactcttcta 1140 caatataact ttatcccaac aataaacaaa tcaacaagaa taacaaatag gtcttctact 1200 ctacttgata acatcataac taacaacttt cataataacc ctctttacac aggtataaty 1260 cagaccgatt tatcggatca ttttccwaca ttcttagcta caaataacat tttamttaac 1320 aatatttcca caaaatccac gatatttcgg cgacagatca atgaaaactc cttaaaacaa 1380 tttaaaaatt gtttaaaaaa ttgtgttgat tggaatctaa tattgcaatc aaatgakgct 1440 aacaatgcat atgacttgtt tcttgmtcaa ttttgcgaac aatatgaaat agcgtttcca 1500 gaaataaaaa aagtcataaa taccaagtca cttcwaaatc cctggatgtc taaaggctta 1560 ctaaaatctt caaaaaagaa agaaaaayta tacgwaaaat atcttaaaaa aaaaacttat 1620 aaaaatgaaa taacttacaa aaattataaa aatatatttg aaaaaaccaa aaaacgctca 1680 aaaaaactat tttatgatca attattaraa aaaaccaacg gaaacactaa aaaaacgtgg 1740 aatttaatta aggatataat tggaaaaaat aaacgtaaaa aaaacaatct gccaaaaaat 1800 cttttaattg atggagaatt agtttatgat aaatcaatta tagctgaaga actaaacaac 1860 twttttctta atattggccc gaatttggct gctaaaattc caactaattc caytccattt 1920 gattcatact taaaaactta cgataaactt atggatgaaa ctaatttaaa accgagtgag 1980 ttgagaactg catttagcag tctcaaaagt aaaaaaagtg aaggttttga taaaattagt 2040 gttgatgttg tcaaatcagt atttgatatc attgaacctt cwttatttca cgttttcgat 2100 ctttcattaa aatcaggtat tgttcctgat aaattaaaaa ttgcccgtat aacaccaatt 2160 tttaaatctg gagataagtc aaatatttta aattacagac ctatatcagt cttaccttgt 2220 ttttcaaaat tgctagaacg aataatgtat aatagacttt ataattatyt aactgagaac 2280 aatatgttgt attgtaaaca atttggtttt aaaaaaaaac aattcaactg amcatgcagt 2340 agttgaactt tttaatcaaa tatctaatgc ctttaataat gactgtttta cattaggagt 2400 tttcgttgat ctgtcaaaag cttttgatac cgtaaatcac gatattctaa taaaaaaact 2460 agaaawctat ggagtaaaaa ataaaaactt actttggttc gaagattatc cgacaaacag 2520 raagcaattt ttatactata atcaaaataa tacaataaaa cattccataa cttgcggggt 2580 tcctcagggt tcaatyttag gaccattatt atttatattg tacataaatg acttatactt 2640 agcatcaaat cttttaracc taattttgtt tgctgatgay tcaaatcttt tttattctca 2700 cagagatatt aaagcacttt ttaaaatagt taatgaagag ttatgtaaag taaatgaatg 2760 gtttataagt aataaacttt cattaaatgt ggaaaaaaca aaatttatgc ttttccataa 2820 accaaataaa tcagaaaata ttcctctaaa attaccagat cttattatta ataaaactta 2880 tattaaaagg gaatcwaaca taaacttttt aggagtaata ttggatgaaa atttatcatg 2940 gaattctcat ataaatagta ttgaaaaaaa aatctcaaaa aatattkcaa tgatgtataa 3000 agctaaacct tttctaaata takcatctct aaaaaaatta tatttctcat ttattcacag 3060 ttatctgtct tattgtaata ttgcgtgggc tagtactaat tatacaaaac tmaaaaaact 3120 ctacagtaaa caaaaacatg catgtagaat tgtatttgga gccaacagaa atgttccttg 3180 cgaacctctt ttaagcgaac tcggtgcttt aaatgtatat aaacttaata tacaccaagt 3240 tttactgttt atgtttaaaa caaaaaatga attatctcct aaaatatttc aatcttattt 3300 taatgaaata amtcataaat atccaacaaa gttttcagat aacaattttg ttgttccaaa 3360 atataattta aaattaactt cctattcaat tcaataccgc ggaccttatc tgtggaaaaa 3420 atttccagaw attgttaaca aaaagaaaac tcttcgagac atctccttag aacaatttaa 3480 aaatgaatca aaacgattat tattattcat ggatctcaat aaattcgact ttaaatgttt 3540 cttttaaagt aaactamaat aaaaaaaaat aaaaaaaaac agattcaatt atatcaatga 3600 tatatgataa tatgaataag aaaagaaaga aaaaggaaaa aaaaaaaaaa aaaaaaaaaa 3660 aaaagaataa gaaatccaag tgatgatgtc acaagtgatg atgtcataag tgatgatgtt 3720 acatattcat ttagtgttaa ttatgagatt ttattttttc aatttctttt atattcaatt 3780 tttttttttt ttttttttta tattaactta tttcttaaac attttagcaa ctgttaagca 3840 tcttaacgct tcttaattaa ttcttatttt ttattttttt tattatatta gttattttct 3900 atttcatttc taattgtttc ttttatattt atatttgtat gttacggtta tgtagaattt 3960 ttaatggggc tcggcgataa ggcgaaatgt gccttcttct tgctccagcc atcaatttat 4020 ataaatgtat atgcaatgta aacatttttt aacggcgaaa taaattacac taaaaaaaaa 4080 aaaa 4084 // ID LOA-3_CQ repbase; DNA; INV; 4725 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A LOA non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW Loa; Non-LTR Retrotransposon; Transposable Element; LOA-3_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4725 RA Kojima K.K. and Jurka J.; RT "LOA non-LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 150-150 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 10 sequences with >98% CC identity. XX FH Key Location/Qualifiers FT CDS 115..888 FT /product="LOA-3_CQ_1p" FT /translation="MNHFKDHDFLINYKFGQTTLRRKLPNKENTNENAEXM FT GNPNDNQRRNDQETAKTPENQGEVCVSGPSGIQKPLVNGKSSNYDHQHQKR FT SGSFANQNNSQLSNGNMASKYSTQYKKRSGSFVNQNYKQKYNTNKHSNNYP FT KXQNCSGSSANQDNKQPRSENKRAGTYMNPHGFWKSTTPIDHDQKATGQPR FT HKKYPQAMLDRKADGELRARNLSRPQKSLDRPEIPSGSGATLDQSLSPPSN FT ITRSSGPSKDRKKDLKQ" FT CDS 960..4631 FT /product="LOA-3_CQ_2p" FT /note="apurinic-like endonuclease, reverse FT transcriptase and ribonuclease H." FT /translation="MSKSIKFIQINLHHARGASGALSQKCLKENVSVALIQ FT EPWAHKGKVLGLDTQNSTIIYNPLNDKPRTAILIKRDLNYVPITEFILKDI FT VAIKLEIPTAQGKTEAYIASAYFPGDENEVPPPGVAAFIQFCKQNNKQFII FT GCDANSHHTVWGSTDINNRGEYLYEYISANNIDLCNKGNQPTFITKTRREV FT LDLTLCSPFISPRVRKWQVSDEESLSDHRQIEFEYDAGEIITVSYRNPRKT FT NWELYKNKVKSIQIDKIEDKNHLEKAAKNITDVIISAFNESCPVRKQKSNR FT DVPWWNDALGKLRKKARRLFKQSKKTKQWEQYRKALTAYNKELRKSQRKSW FT RNNCEKIEKTPDVAKLQKALSKEHSNGLGSLKKPDGKFTENSAEVLEMMME FT THFPGSMIVSSDSGEMEEDTNSGLEWTIGRNTNLPSTIFTRSKVAWAMNSF FT QPFKSPGLDGIFPALLQYGGNKLLESLTEIFKASFKLGYIPKIWSKTRIVF FT IPKVGKKDKTAPKSFRPISLTSTMLKIMEKIIDEYIKSNILKTFPLSKAQF FT AYQQNKSTITALDTLVTKLEKNLEAKEIALTAFLDIEGAFDNASYSSIKKC FT MEKRGFDPCIIQWIMTMLNNREVFAELGMSTITVKTTRGCPQGGVLSPLLW FT SLVVDDLLNKLTTEGFEVIGFADDVVILTRGKFAHVLSERMQYALNLTSQW FT CANQGLSINPAKTILVPFTRKRKFTLKTLSLNGSNLKYSTTAKYLGVILDE FT KLNWNSHLEQVINKATAALWVSRNTFGKQWGLKPRMIHWIYTAIIRPRITY FT AALIWWPKTKQLTAQNKLQKLQHLISTSITGASRNTPTKALNAALHILPLH FT QFVQMEAAKSVLRLRRTNNISEYNSVEHLQILAELEINQGVFEITDWMESK FT LDLDVPFKVVETNRLEWESGGPTIPPGSIIFYTDGSKMGNKTGAGITGPGL FT NISIPMGQWTTVFLAEIYAIVECTAICLKRNYRFAKICIFSDSQAALNALK FT SPTCQSRLVWECRTLLKQLASRNRVHLYWVPGHRGIDGNEIADLLARRGSD FT GHFIGPEPFCGVSKSTLNMEFTQLEKKLIQLNWVKAHDMRQSKMFIVPSKQ FT KFAQLMNMNKKYLRIYIGLITGHCPSRYHLKKIGLSPIDICRICNCETETS FT KHLICECSAVSAKRRRIFNKGIISPNDVWLENPGTVVDFILEILPNWGTSQ FT CQPMTVTLNGNVSS" XX SQ Sequence 4725 BP; 1733 A; 973 C; 932 G; 1083 T; 4 other; atctacatcg aaagccaaaa cgatggtatc aacaccaaca attggagagt catcgaaagg 60 aagatcatct acgaaaagca cgtcgaatgg ttgtttacgg ttgatgaggc gtccatgaac 120 cactttaagg atcatgactt tctcatcaac tacaagttcg ggcaaacaac tctgcggagg 180 aaactgccga acaaagaaaa taccaacgaa aatgcggaaa wtatgggcaa tcccaacgac 240 aaccaacggc gcaatgacca agaaaccgct aaaaccccgg agaaccaagg tgaggtttgc 300 gtatcggggc ctagtgggat ccaaaaacct ctggtcaatg gtaagtcatc aaactacgat 360 catcaacatc aaaaacgctc tgggtccttt gcgaaccaga ataattcaca actaagcaat 420 ggaaacatgg cttcaaaata ttctacacaa tacaaaaaac gttctgggtc ctttgtgaac 480 cagaactaca aacaaaaata taacaccaat aaacactcaa ataattaccc aaaaaamcaa 540 aattgttcag ggtcctctgc gaaccaggac aacaaacaac caagaagcga aaataagcgt 600 gcagggacat atatgaaccc acacggtttc tggaagtcca caactcccat cgatcatgac 660 cagaaagcta ctgggcaacc aagacataag aaatacccac aagctatgct ggaccgcaag 720 gccgatggag agctgagggc cagaaacctg tctaggcccc aaaaaagcct ggatcgccca 780 gaaatcccct caggctctgg cgccacactg gatcaatccc tatcgccgcc atcgaatatt 840 acccgatctt ccgggccttc taaggaccgg aagaaggacc tgaaacaata ataacaaaac 900 ctcagtacaa aamaaaaaaa acagaacgaa aaatgccgaa taaatgaaag caatacaaaa 960 tgagcaaaag tattaaattt attcagataa acctccatca tgcgcgggga gcatctggag 1020 cgctcagtca aaaatgctta aaagaaaatg tgagtgtagc attaatccaa gaaccatggg 1080 cgcacaaagg gaaggtwctt ggattagaca ctcaaaatag tacaattatt tacaacccac 1140 taaacgacaa accaagaaca gcaattctga ttaaaaggga cttaaactat gtaccgatta 1200 cagaattcat acttaaagac attgttgcga tcaaactgga aattccaaca gcacagggaa 1260 aaactgaagc ttacatagct tcagcatact tccctggtga cgaaaatgaa gttcctcccc 1320 caggtgtcgc agcattcata caattttgca aacaaaacaa taaacagttc atcattgggt 1380 gtgacgcaaa ctctcaccat acggtctggg gaagcaccga cattaacaac aggggtgaat 1440 acctttatga atacatatct gcgaataaca tcgatttgtg taacaaagga aatcaaccta 1500 catttattac aaaaactaga cgagaagttc ttgatctgac gctctgtagt cctttcatct 1560 caccacgagt tcggaaatgg caggtgtcag atgaagaatc tctgtcggac cacagacaaa 1620 tagagtttga atatgatgct ggggaaataa tcacagtttc atatagaaac cccaggaaaa 1680 ccaactggga attatacaaa aataaagtaa aatcaatcca aatcgataag attgaagaca 1740 aaaatcacct agaaaaagcc gctaaaaaca taacagatgt gattatatct gcgttcaatg 1800 aaagctgtcc cgttagaaag caaaaatcga atcgagatgt tccatggtgg aacgatgctc 1860 tcgggaaact caggaaaaaa gctcgtcgat tatttaaaca atccaaaaaa acaaaacagt 1920 gggagcaata ccgaaaagcc ttaactgcct acaacaaaga acttagaaaa tctcaaagga 1980 aaagctggag gaataattgc gaaaaaattg agaaaacccc ggacgttgca aaactccaaa 2040 aagctctttc caaagaacac agtaatggtc tgggtagctt aaagaaacca gatgggaagt 2100 ttactgaaaa ttccgcagaa gtgttggaaa tgatgatgga aactcatttt cctggttcta 2160 tgatagtctc gagtgatagt ggagaaatgg aagaggatac gaacagtggg ctagaatgga 2220 ccattggtag aaataccaac cttccaagca ccattttcac ccggtctaaa gtagcttggg 2280 caatgaattc ctttcaaccg ttcaaatcac cgggattaga cggaattttt ccagctctcc 2340 tgcaatatgg agggaacaaa ttacttgagt ccctcacaga aatttttaaa gctagtttca 2400 aactaggata cattccaaaa atctggagta aaactcgtat tgtgtttatt ccaaaagttg 2460 gaaaaaaaga taaaacagct ccaaaatctt tcagaccaat aagtctaaca tcaaccatgc 2520 ttaaaatcat ggaaaagata atagatgaat acataaaatc aaacatctta aaaacctttc 2580 cactaagtaa ggcacagttt gcgtatcaac agaataagtc aacaattaca gcattggata 2640 ctcttgttac aaaactggag aaaaatctag aggctaaaga aattgccctc acagcttttc 2700 ttgacattga aggagctttt gacaatgcat catacagctc catcaaaaaa tgtatggaaa 2760 agagagggtt tgatccctgc ataatacaat ggatcatgac catgttgaac aatcgtgaag 2820 tatttgccga actaggaatg tcaacaatta ctgttaagac cactaggggt tgtccacaag 2880 gaggtgtact gtcaccgttg ctgtggtcac tagttgtgga tgatcttctc aacaaattaa 2940 caacagaagg ctttgaagta attggctttg ctgatgatgt ggtcattttg actcgcggaa 3000 aatttgccca cgtcttgtca gagagaatgc aatatgcttt aaacctcacg tcacaatggt 3060 gtgcaaatca agggctcagc attaaccctg cgaaaactat tttagttccg tttacgcgta 3120 agagaaaatt caccttgaaa acactatcgc taaacggctc aaacctaaaa tattcaacga 3180 cagcaaaata tcttggagta atccttgacg aaaaattgaa ctggaactct cacttggagc 3240 aagttataaa taaggcaacg gctgctcttt gggtgagcag aaataccttt ggaaagcaat 3300 gggggctaaa accgagaatg attcattgga tctatacggc aataataaga cccagaataa 3360 catacgctgc tcttatatgg tggccaaaga caaaacaatt gacagctcaa aataaattac 3420 aaaaactaca gcatctaata agtacatcta ttacaggtgc atcgcggaat acaccaacaa 3480 aggccttaaa tgccgctctt cacattcttc ccctgcatca attcgtgcaa atggaagctg 3540 cgaagagtgt tttaaggcta agaagaacaa acaatatatc agaatacaac tccgttgaac 3600 atttgcaaat cttagccgaa ttggaaataa accaaggagt attcgagatt acagactgga 3660 tggagagtaa gttggaccta gacgtaccgt tcaaagtagt tgaaacaaat cgcctagagt 3720 gggagtcagg agggccaact attcctccag gatcgatcat attttacacc gatggatcta 3780 aaatgggaaa caaaacgggt gctggaataa ctggtcctgg actgaatatt tcaataccta 3840 tgggacaatg gacaacagta tttttagcgg aaatatacgc tattgtggaa tgtacagcaa 3900 tctgtttaaa acgaaattac agatttgcaa aaatatgtat cttctcagac agtcaagctg 3960 ccttgaatgc attaaaatcg ccaacatgcc agtccaggct tgtttgggaa tgtagaacac 4020 ttttgaaaca attagcatcc agaaaccggg tacatctgta ctgggtcccg ggtcaccgag 4080 gaatagatgg taatgaaata gcagaccttt tagcaagacg aggttcggac gggcatttca 4140 taggcccaga accattctgc ggagtctcga aaagcacact taatatggaa ttcacacaat 4200 tggagaaaaa gttgattcag ttgaactggg tgaaagcaca cgatatgcgt cagtcgaaaa 4260 tgtttatcgt cccctccaaa caaaaattcg ctcaactcat gaatatgaat aaaaaatatc 4320 tcagaatata catcggtctc ataacaggac actgtccgtc tagatatcac ctgaaaaaaa 4380 ttgggctcag tccaattgat atctgtcgga tttgtaactg tgagactgaa acatcaaaac 4440 atttgatttg cgaatgcagc gctgtttctg caaaaagaag acggatcttc aataagggca 4500 taataagtcc aaatgatgtc tggctagaaa accccggcac agtagttgat ttcatcctag 4560 aaatcttgcc aaactggggt acatcgcagt gtcagccaat gaccgtaacc ctaaatggta 4620 acgtgtcatc ctgaaatttg cgataacaaa atgggtatac cgcaatagat caactcaatg 4680 gtcgcagtgg ttctaaaccc aacaaaaaaa aaaaaaaaaa aaaaa 4725 // ID Sola1-2_BM repbase; DNA; INV; 3302 BP. XX AC . XX DT 07-JUN-2010 (Rel. 15.07, Created) DT 07-JUN-2010 (Rel. 15.07, Last updated, Version 2) XX DE DNA transposon - consensus sequence. XX KW Sola; DNA transposon; Transposable Element; Sola1-2_BM. XX OS Bombyx mori OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Bombycidae; Bombycinae; Bombyx. XX RN [1] RP 1-3302 RA Jurka J.; RT "DNA transposons from Bombyx mori."; RL Repbase Reports 10(7), 946-946 (2010). XX DR [1] (Consensus) XX CC >96% identical to consensus. XX FH Key Location/Qualifiers FT CDS 949..2151 FT /product="Sola1-2_BM_1p" FT /translation="LTHYFKEDGRGKCIPPNKTSNAIKESVKSFIKRLPAI FT PSHYCRKDSTKLYLPVEFKNIKNLYRCYKEDLISKGVDVVSEKVFRQIFTT FT DFNIGFHVSKKDKCIKCLRFEGQKPEDCAEFQDHLEEKKASKKRLECHRQL FT GKENPSILCTSFDLQKVLNTPHGNNMLLFYSRKYAVYNLCFYESITRNGFC FT FIWGESEAKRGANEIATIIQKYIENVDSRGIITSLILYSDSCPGQNKNKIV FT LATIHNALLQSNNLQAIQMNYLLPGHTEMSVDSIHSTIEQSVRNITVWAPS FT QWATISQLARKEPAPYNVENLTHEDFLNFDDMSEKYFKGNLVGKISKIRTA FT TFKKSHPNQMKVKFSMKDNASEEIIQIIAKPRAINRRYNSSLPITKAKYKQ FT PGPKEIV" XX SQ Sequence 3302 BP; 1114 A; 522 C; 540 G; 1125 T; 1 other; cgaggggtga aaggctattt ggtttaagcg acatctaggg tgatattgag ttatatacat 60 atacggaatc agcgaatttt tctccgtcga atgatgccgg atttattaaa atcagagcac 120 ttttttggcg tttgaaacaa tactattttt gggttgtttc caaagtggca gtgtattatc 180 caacgttgat tttgaatttt tttttagtaa tgtacttaaa ccatacaatg atattaagag 240 acgagacata aaccagttac cttttctcag aaaaataatc caataagagt taaagcgtgt 300 ttcctgtttc aattttttat taagcagtaa cttaaatcat tttgacttct accaattaca 360 tggtgtgatg tctcttaaac catttgtccg tcacccaaat acgacatgga gtgtcgctta 420 agctctttca gctgcatcta ttttccatgc agggtgctac ttaaatcaag tagttctttt 480 gtttcagata ttttgcttag ccgttaaaac attttggtcc tattatttta ttaaatgcat 540 gtcgcttaat acagacaata aaatgaagta atgcaatatg aaacattttt tcggtataaa 600 taaaaatacg ggtcggtctt tcaatcatta aattttaata tgatacattt cattcgtgtc 660 gcatgatacc atatttatta catctcccgg gagttaaaaa tcgtgatctt tcaaccggac 720 tgccgcgcgc aggcgcttca gtcgaactcc agagcgcatc gaggagtgac gtgacgcgca 780 gtcgccgcca gtgtttacgt aaaacgtctg tctttgaaat cgaatcgtaa gtactttaat 840 ttcattattt acgtaaaatt ttcaattgag atgaaatgta ataaagttat ttgtttttat 900 tatttttatt agtctttgat tatgttactt cttctattaa gtaggtagtt aactcattat 960 tttaaagaag atggaagagg aaaatgcatt ccccccaaca aaacatctaa cgcaatcaaa 1020 gaatcagtaa aatcttttat caaaaggtta cctgcaatac catcgcacta ctgccgaaag 1080 gatagcacaa aactatatct gcctgttgaa tttaaaaata taaaaaattt ataccgctgt 1140 tataaagaag acctcattag taaaggagtc gatgttgtaa gtgaaaaagt attccgtcaa 1200 atattcacta ctgattttaa cattgggttt catgtgtcta agaaggataa atgcataaaa 1260 tgcctacgtt ttgaaggaca gaagccagag gattgtgccg aatttcaaga tcatttagag 1320 gaaaagaagg catcaaagaa gcggttagaa tgtcatcgac agttaggtaa agaaaatcct 1380 tccattctgt gcacatcttt tgacctccag aaagttctca atacgcctca tggcaataac 1440 atgttactat tttattctag aaagtatgct gtttataatt tatgttttta tgaaagcatc 1500 acaaggaatg gtttttgttt tatctgggga gaatctgaag caaagagggg cgcaaatgaa 1560 attgctacta ttatacagaa atacatagag aatgtggatt ctcgaggtat cattacatct 1620 ttgatcttgt attcagattc ttgccctgga caaaacaaaa ataaaatagt ccttgcaaca 1680 atacataatg ctttgttgca atcgaacaat ttacaagcaa ttcaaatgaa ttaccttctc 1740 cctggccaca cggaaatgtc ggtagacagc atacactcta caatcgaaca atccgtaaga 1800 aatattactg tatgggcccc ttcgcaatgg gctacaataa gccagctagc aagaaaagaa 1860 ccagcacctt ataatgttga aaatctgaca cacgaggatt tcttaaattt tgatgatatg 1920 tcagaaaaat attttaaggg taaccttgta ggtaaaatta gtaaaatccg tacagcaaca 1980 tttaaaaaat ctcatccaaa tcaaatgaaa gtaaaatttt caatgaaaga taatgcatcc 2040 gaagagatta tacaaattat tgcgaaacct agagctatca atcgtagata taattcaagt 2100 ttgccgatta ccaaagccaa atacaaacag cctggaccta aagaaattgt gtgacactgg 2160 ggttataccc ggagttcaga acaatatcgg ttgtgattaa ttatactctt agtaagtttt 2220 atgacagccg atctcaagta atactgaata aataatacct aattatagtt gattttccta 2280 cattcagact gaatttactt gattaatatc acatcagtaa aagtaataat actttataat 2340 gtacaaaata ctaatacgta ttaatactta actgatgtga taatcaaatc agtctgaatg 2400 taggaaaatc gtgtataatg aattaattat agtcgatttt cctacactct gattgatttt 2460 acttgattat tatacgaaat atnaatactt tgactgatgt tgttaattaa tatgttgtat 2520 attaatcaag taaaatcagt ctgatgtagg aaaatcaact ataattatgt attaataata 2580 attgattatt atacgaaata ttaatagttt aactgatgtt gttgattaag actaaaaaat 2640 aaataaatat aatcgatgtt cccgcattga gaccgatcta cttaattatt tttctaaata 2700 aatactttaa ctgttgttgt gttttgtgca tttctttatt tattcaatat aagcacctga 2760 gattttcagt gattgcgata tatttaggtt gttttaacag ccactgatag tattagacgt 2820 aaaaaaatgt aacatcaaac ggtaaatttt caatattata tgtcacataa gtaaattcgc 2880 ctgtcagtca ttttaatcta aattggtagt tgagataaat agcttgctac tattaatttt 2940 gggctaagtc acttcatgct ttttggctta ctctgaaaat atattgcagc gtcgcttatt 3000 ttttttccct gtagccaatt gtggttatga agtcacttaa atctttattc catactctga 3060 aaaataaggg cttaaggcaa tttgtattaa gcgacgttta tctttgtaat taaaggtccg 3120 ttttattcgg tattagcatc aacaacttga tgtttttaat tgaatttcaa ttaagtttac 3180 tccattcaaa ttgctgaaga agtttttttg ttatgatttt ttttccaaaa ttttaaactt 3240 taaatggctc tcctgtaaag ttgcaaaaat gtcgcttaaa ccaaatagcc tttcacccct 3300 cg 3302 // ID Gypsy-179_AA-LTR repbase; DNA; INV; 163 BP. XX AC supercont1.143; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-179_AA_; KW Gypsy-179_AA-I; Gypsy-179_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-163 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.143; Positions 1040073 1039911. XX SQ Sequence 163 BP; 40 A; 34 C; 37 G; 52 T; 0 other; tgtgataatg tggctaagac tgttgtatat ctctgtatgc gtgaggctca tataagcttg 60 accataccta cttcatagga gaggcgagta gtcttactcg gtccaccatc gtgcataggt 120 cacattgcgt atcggtctat ttcactgtaa gttagttacc aca 163 // ID Gypsy-103_AA-LTR repbase; DNA; INV; 207 BP. XX AC supercont1.322; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-103_AA_; KW Gypsy-103_AA-I; Gypsy-103_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-207 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.322; Positions 905452 905658. XX SQ Sequence 207 BP; 62 A; 34 C; 56 G; 55 T; 0 other; tgtagcatag ggcaactatt gggatcgata ccatacctta atatgtagag tagcagtgca 60 gagatcgagc gacgaagagt gtagagtagt agaactgggg acggcaacga ataaaggtgt 120 gcctcgcggc gcggagtatt tcaaaagaat ttataattat tcgaagtgtt gtgatttgtt 180 cctctatccg aagattccgt tactaca 207 // ID Kiri-32_AAe repbase; DNA; INV; 4318 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 16-FEB-2011 (Rel. 16.02, Last updated, Version 1) XX DE A Kiri non-LTR retrotransposon family from Aedes aegypti. XX KW Kiri; Non-LTR Retrotransposon; Transposable Element; Kiri-32_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4318 RA Kojima K.K. and Jurka J.; RT "Kiri non-LTR retrotransposons from the yellow fever mosquito."; RL Repbase Reports 11(2), 727-727 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 6 sequences with >96% CC identity. CC This family and related Kiri elements found in two mosquito CC species, Culex quinquefasciatus and Aedes aegypti, constitute a CC distinct lineage belonging to the CR1 group. The phylogenetic CC analysis with PhyML indicates that Kiri is a sister lineage of CC the L2 clade. Although several Kiri elements do not encode two CC proteins, probably due to 5' truncation, Kiri elements generally CC encode two proteins. Their ORF1 proteins commonly contain an CC L1-like RNA-binding domain. XX FH Key Location/Qualifiers FT CDS 270..1034 FT /product="Kiri-32_AAe_1p" FT /translation="MADDRRTLRGQSTTKLPGSAADDMSVSDLAKLMKSQF FT ATYQQSTNEEIRRLEHSFQSEMNKLRSDITSELENMRKESLKTTTDILNTV FT KTNKTECLAAVDRSMRANDLIVSGVPFVVGESLPDHFIKWCQSLGYTDETI FT PLVDIKRLARGSPSSGTVYLILVQFAITIQRNEFYARYLRSRKLSLTDIGF FT SSNQRIFVNENLEPAARELRSKALQMKKRGKLAGVYTRMGIVYVKKTANDQ FT EKPILSENDLXALL" FT CDS 1313..4156 FT /product="Kiri-32_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MASGSTRDTSLNWCLPGTVMKSALIANKLSICSLNSQ FT SICARKLCKLDELRQIAQSSAIDVICVTETWLNNKTDDSLLAIDGYNIIRN FT DREGRLGGGILIYIKQGIHYRLLDKSIRRSGVSYCEYAAVEIVLKSEKLFL FT VAMYNPPELNCVDTIDYLLSNYGTNYENLFFAGDFNTNLLNHNLTRTSQLM FT ALLSVHNMFSLGSEPTFFHNNGYSQLDLMLFNSKARVLRFNQIDVPNFSNH FT DMIFASLDFDIVHTPKKIEYRNYNAIDVNGLLDEFNSLDWHSYYRINDPNS FT LLDFFNSNILELHNKYVPVRSLFLKNQQTVWFNDIISKAIVDRDLAYKRWK FT ISKSSSDLKAFKTLRNKVTLKVRQTKGAYYERNLDTKLPSKELWKRLKNIG FT VGKGTKSSENEFESNXINANFANNFSKSNCILDTNNATTSVTIRNQFYFDQ FT INEIDVINALYHMKSNATGLDNLPIKFLKSINPLIVRPVTHLFNSIVTTSV FT FPDEWKKTKILPLKKKTHLNSLQNLRPISILSALSKVFERIIKNQVCHYIN FT QKNLLIKQQSGYREKHSTKTAMLKVFDDIGLVLDKGRPVVLLLLDFSKAFD FT SISHNVLCQKLTTRFGFSTHAKNLIKSYLSGRTQTVLNNGIFSNYLPISSG FT VPQGSILGPILFSLYINDLPEVLKHCKIHIFADDVQLYFGCMENTAIAISK FT KINEDLSNIHAWSIRNKLLLNADKTCALFLSHSQSNNILKPTLKLNNTVIA FT YLEEGVSLGITIQPNFKFDKFIYKQCGKIYASLRTLYAVSSFLSTDIKLKL FT FKSLVLPHFIACDFLLTESSMYAESRLQVALNACVRFVFSLNKYDRVSHLQ FT YHLIGCPFNKFSQLRCCLFLFKLARTKAPEYLYTKLKPLRSSRARKYAIPR FT HNTSLYGNSFFIRGISFWNCLPNDITLENSISAFKRRCTEHFNN" XX SQ Sequence 4318 BP; 1366 A; 816 C; 782 G; 1350 T; 4 other; tcgtgacaga aatggtgtac acgtaaaagt atagtgcaga cgcttatctt cgaaaccacg 60 gttattccca cgtataaatc ggttttcact gtgttccgta aattagagat atagttactg 120 cattttactg tgaagtttgt caagttgctg atcagtttac tgccgaaatt gagaaaacaa 180 atttgtgatt tctacaatat ttcgtgggaa gccataacgc aaacagcatc caaagtttcc 240 gttaaacggg cggctggtat cgagcaatca tggcagacga tcgtcgtact cttcgaggtc 300 aatctacgac caaacttcct ggctccgctg ccgatgatat gagtgtgtct gatctggcta 360 aattgatgaa atcgcagttt gctacctatc aacaatctac taatgaagag attcgacgat 420 tggaacactc atttcaatcc gaaatgaaca aactgcgttc ggacatcact tcggaacttg 480 agaacatgcg caaggaatcc ctgaaaacca caactgatat actgaacacc gtcaaaacga 540 ataaaactga gtgcctagct gccgtcgacc gaagcatgcg tgctaacgat ctcatcgtaa 600 gcggtgtacc gttcgtagta ggagagagtc tgccagatca ttttatcaag tggtgtcaat 660 ctctgggata tacagatgaa acgattccac tggtcgatat caaaaggctc gcaagaggtt 720 caccttcaag cggaacagtt tacctaatac tagttcagtt tgcaatcacc atacaacgaa 780 acgaatttta tgcccgttat ctacgctcaa gaaagctttc gctcacggac atcgggttct 840 catctaatca gcgtatattt gtgaacgaga atttggaacc agcggcgagg gagcttcgtt 900 caaaagcctt gcaaatgaaa aagagaggaa aactagctgg agtctatacg aggatgggga 960 ttgtttacgt caagaaaaca gccaacgacc aagagaagcc tatactatcg gagaatgatc 1020 tacamgctct actgtgaacc atcttcccgt gtctcccctc ctaaaagtta aaccactttg 1080 ttgctgctgc tgttgttgct gttggggttg ctggatgcca tttggatgct gctgcagtag 1140 atgctggatt gatgctatca ttcatcgagt gtggtttgtt aaacttatta atatacgcgt 1200 cgaataatta tgaatatgtt tgaatttttc tatgtaattt tgttagtttt tagtcaattg 1260 cgtgtgctct ttacagattt cctcttaatt gctttgttct tatcttatcc ttatggctag 1320 tggttctact agagatacct ctttgaattg gtgtttgcca ggaaccgtca tgaaaagtgc 1380 ccttattgcc aataaacttt caatttgtag ccttaatagt cagagcatct gtgctcgcaa 1440 gttatgcaaa ctagacgaat tacgacaaat agcccaatca agtgctattg atgtaatatg 1500 tgtgactgag acctggttga ataacaaaac cgacgattca ctactggcga ttgatggtta 1560 taacatcatt aggaatgata gagaaggaag gctagggggt ggcatcttga tatacataaa 1620 gcaaggaatt cattaccgat tgttggacaa atccatccgt cgttcaggtg tgtcgtattg 1680 tgaatacgct gccgtagaaa ttgtccttaa aagtgaaaag ttatttctcg ttgckatgta 1740 caatccacca gagctcaatt gtgttgatac cattgattat ttgttgtcaa attatggcac 1800 aaattatgaa aacttatttt ttgcaggtga tttcaataca aatttgttga atcataattt 1860 gaccagaact tcacaattaa tggcgttgct gagcgtccac aatatgttca gtttaggctc 1920 cgaacctact ttttttcata ataacggata ttcgcagttg gaccttatgt tattcaattc 1980 aaaggcacga gttcttaggt ttaaccaaat agatgttcca aatttttcca atcacgatat 2040 gatttttgca tcacttgatt ttgacattgt tcacactcct aaaaaaattg aatatcgtaa 2100 ctataatgcc atcgatgtca atggtctttt agatgaattc aactccttag actggcatag 2160 ttattatcgt ataaatgatc caaattccct gcttgatttt tttaattcca acattcttga 2220 attgcataat aagtacgttc ctgtacgttc attgtttctg aaaaatcaac aaactgtttg 2280 gtttaacgac attataagca aagccattgt agatcgcgat ttggcttata agcggtggaa 2340 aatctctaaa agctcatctg atctgaaagc atttaaaact ctccgtaaca aggtgacatt 2400 gaaagttcgc cagacaaaag gagcatacta tgaaagaaat ttagatacga aacttccctc 2460 taaagagtta tggaaaagat tgaaaaacat cggtgttggt aagggaacta agagttcaga 2520 aaatgaattt gaatccaatg wgattaatgc aaattttgct aataatttct ctaagagtaa 2580 ttgtattttg gatacaaata atgctactac cagcgtgacc atcagaaacc aattctattt 2640 cgaccaaata aatgaaattg atgtaatcaa tgctctttat catatgaaat caaatgccac 2700 tggattggat aatctaccca taaaattttt gaaatcaatc aatccactta tagttagacc 2760 tgttacacat ttattcaatt ccattgttac aaccagcgtt tttcccgatg aatggaaaaa 2820 aactaaaatc ttacccttga agaaaaaaac tcatctcaat agcttgcaga acctacgtcc 2880 cataagcatt ctttcagcgt tgtcgaaagt ttttgaacgc ataattaaga atcaagtctg 2940 ccattatatc aaccagaaga atttgctaat taaacagcaa tccgggtatc gggaaaaaca 3000 tagcacgaaa accgctatgt tgaaggtttt tgatgacata ggattagttt tggataaagg 3060 aagaccagtt gttcttctgc tcttagattt ttcgaaagct ttcgattcca tttctcataa 3120 tgtattatgt caaaaactca caactagatt tggattttct actcacgcta agaatctcat 3180 caaatcatat ctgagtggcc gaactcaaac ggtcctgaac aatggaatat tttccaatta 3240 tcttccaata tcatcgggcg taccacaagg ctccattcta gggccaattc tattttctct 3300 ctacatcaac gatctgccag aagtcttaaa acactgtaag atacatattt ttgcggatga 3360 tgttcagctt tacttcgggt gtatggaaaa tactgccata gctatctcca agaaaatcaa 3420 tgaagatttg tccaatatac atgcatggtc tattagaaac aagcttttgc tgaatgccga 3480 taaaacttgt gctttattcc taagccattc tcaatctaat aatatactga aaccaactct 3540 gaagctgaac aatactgtca ttgcgtattt ggaagaaggc gtaagcttgg gtattacgat 3600 tcaaccaaat ttcaaatttg ataaatttat ttacaaacaa tgtgggaaaa tatatgcgag 3660 tttacgaact ctatatgcag tttcctcttt cttaagtacg gacattaaac tcaaactctt 3720 caaaagttta gttctacctc atttcattgc ctgtgatttc cttttaaccg aatcgtcaat 3780 gtatgctgaa tcaaggttgc aagtagcttt gaatgcttgt gtaagatttg tgttcagttt 3840 gaataaatat gatcgagttt cccatcttca atatcatttg ataggctgtc catttaataa 3900 attctctcag ctaagatgtt gcctattctt attcaaactt gcgagaacaa aagctcctga 3960 atatctttat acaaaactaa agcccttacg tagttcccgt gcaagaaaat acgctatccc 4020 acgtcacaac acgtccctat acggcaactc ctttttcata agaggtattt ctttctggaa 4080 ttgtttaccg aacgatataa ccctagaaaa ctcaatttca gcttttaaga ggagatgtac 4140 agagcacttt aacaattaaa ttagatttaa gcgatgaatt attgaattgt aatcgaatta 4200 gttttgatat gttgatgcac ctttctgtgc ccctttctwc aaaaggtaac aattcaaaga 4260 gacaatagtc ttatgttact taatgagaat aaataaataa aataaataaa ataaaaaa 4318 // ID Gypsy-101_AA-I repbase; DNA; INV; 5743 BP. XX AC supercont1.55; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-101_AA_; KW Gypsy-101_AA-LTR; Gypsy-101_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5743 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.55; Positions 886151 880409. XX CC Positions [4180-4656] - Integrase core CC 'ATCT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 389..1876 FT /product="Gypsy-101_AA-I_2p" FT /translation="MNTHFSIFSPLQSVRNEVMQEFVRSVSNLRMGIDQEN FT NNSRTVDNQTQRSSKTAGEQVPGPSTSDREESDVGAVGLPESSSRRVLKSY FT GKAIEKDVGSISGAIAAALYPFALQSNHAGGSRNEGSHDSNRSEKFIRKIS FT LNKQSKSRPNPLAETSDSVAESDSQSSASSDPPVERYREKRHSRRTQTKSK FT PVSEWNIKYDGKDHGQSLMKFIKEVEFYAKSENVSKRELFRSAIYLFRDQA FT KSWFMSGVENEDFANWDELVTELKREFLSPDHDHVNEIKAISRKQGPKERF FT SDYLSELQKIFNSLTKPMSERKKFEIVYRNLRSDYKGHAVASNIDNLADLK FT RFGRQLDSTYWYKYAQNIQENNASRNKAQVNELVRETKYQPKSSDENSKKS FT FKSRNFYRSRKEGPSEEDQPANNPSRNNPKNSKSEAKKPEEPRAPQSGYDT FT YVPPQEGICFNCRGYGHHHTQCNQPRYKFCLRCGLHQVETKNCPYCAKNFN FT " FT CDS 1819..5040 FT /product="Gypsy-101_AA-I_1p" FT /translation="MWFTPSRNQKLPILCKKLQLDYLRGQVSPKVPKNPSF FT PTLQDNLHNSGYNRVQSSDYSAPEEAQLEELFVRVENDDRPFVQISVFDTP FT IIGLLDSGAHRSILGVGSLKLVKTFKLKLFPASIDLVTASGQKLEVVGYVN FT LPMCFNGQTKIMATLVVPNLKRRLLLGTDFWRAFGIVPTVHSAAVEELDET FT LEEPSLSNDQLQELESIKSQFKVAIDGQLDTTPLITHRIELTEEAKQLAPV FT RINPFPTSPKRQDQINKELDSMLEAGIIERSYSNWALRLVPVDKPDETVRL FT CLDARKLNERTLRDSYPLPHADRILSRLGPCKYISTIDLSKAFLQIPLHPK FT SKKFTAFSVLGRGLFQFTRMPFGLVNSPATLSRLMDRVLGAGELEPSVFVY FT LDDIIVLSETFEEHLTLLRDVAARLNRANLSINIQKSKFCVSELPYLGYIL FT TNKGLKPNPDRVEAIINIERPNSIRALRRFLGMCNYYRRFIAQYSEIVRPL FT TDLLKNKPKSIRWNEFAESSFTKIKELLINAPLLANPDFNQPFSIHCDASD FT TAIAGVLTQEREGIEQPIAYFSRKLTGPEQRYFATEKEALAVLKSVEKFRC FT YVEGSKFTVITDASALTYILRSSWRTSSRLCRWSIELQRHDMIIKHRRGVD FT NVVPDTLSRSVEVLVLNQKGSDWYTNLLKNVHENPEKYKDFRVDNGVLKKL FT VSTQGDALDYHFEWKVCVPKDMRENVLVEEHDDALHLGTDKTMARIKKKYY FT WPNIANDVRLHIRKCSTCLESKPSNRAQNPVVGSPRLAAKPFQIVALDFIQ FT SLPRSKTGKAHLLVIMDVFSKFCLLFPVRKISAPQVCEILENNWFRRYSTP FT EIIISDNASTFLSGNFKDLLQKFHVQHWTNPRHHSQANPVERLNRTINACI FT RTYARSNQTLWDTRVPEIEYALNTTPHGATGFSPYRILFGHEIVGRGDEHR FT VDRDLRELSDTERLEKKLQIDENIHSLVSKNLKKQFEKNSHNYNTRHKTFA FT PVYTVGQKVYKRNFRQSSAPHSYNAKLGPCYLPCTVVARVGTNTYELADET FT GKSIGVFSAADLKPGEC" XX SQ Sequence 5743 BP; 1765 A; 1291 C; 1244 G; 1443 T; 0 other; aaattggcgc ccaacaacaa aacattaaaa tagcattcgt tcaaaaaatt gaattttcaa 60 aaaagaaccc cgtgttgagg ggggcaaaac aaattttgtt ttattagagc ttagagatcg 120 attgaaagaa gagaaggatt taaataattt ggatgttgat tttcttaggt gcaatcaaac 180 ggtagatgca gagattaaag tgatagatgc taacgtagaa gaaattcgta gatatctgac 240 gaacgaagaa cgttttgaag gtttcaaaga cagcttgaag tctcgacttg tgcattattt 300 tgcgcgagtt cgccgagctc aagagcacgc tgagacggac gaagacctga ctgatttcga 360 taagctcctc tgtgcaatac gagggcttat gaatacacat ttttcgatat tttcgccgtt 420 acaatcggtg agaaatgagg ttatgcagga gtttgtacga tcggtttcga atttgagaat 480 gggaatcgat caggagaaca ataattctag gaccgtagac aatcaaactc agagaagctc 540 gaagaccgcc ggagaacagg tgccgggtcc gagtacttca gatagagagg agagtgatgt 600 aggagcagta ggtcttccgg aatcttctag ccgtcgcgtt ttgaaatctt acggaaaggc 660 aatcgaaaaa gacgtcgggt caatttcggg agcaatcgcc gctgcgttgt acccattcgc 720 gctacagtca aatcatgctg gcggttctcg taatgagggc agtcatgatt ccaaccgttc 780 ggaaaagttt attcgtaaaa tctctctcaa taaacaatca aagtctaggc caaatccact 840 ggctgagaca tcggattcag ttgccgagtc cgattcgcaa agttcggcgt cgtcggaccc 900 tcccgttgaa aggtatcgag agaaaagaca tagtcgtcgc acacagacaa aaagtaaacc 960 ggtttcagag tggaacatca aatatgatgg aaaagaccac ggtcaatcac tcatgaaatt 1020 cataaaagaa gtcgagtttt atgccaagtc agaaaatgtg tccaagcgag aattgttccg 1080 gtccgcgatt tatttgttta gggaccaggc caagtcttgg tttatgtccg gcgtggaaaa 1140 tgaagacttc gcgaactggg atgaactcgt gacggagctc aaacgtgagt tccttagtcc 1200 cgatcacgat cacgtgaacg agattaaagc catatcgcgg aaacaagggc cgaaagaacg 1260 gttttcagac tatttgtcgg aactacagaa gattttcaac tctctcacga aaccgatgtc 1320 tgagagaaaa aaattcgaga tcgtgtatcg caatttaaga tcagattata agggccacgc 1380 ggttgcgtcg aatatcgaca atttagccga cctgaaaagg tttggccgac aattagactc 1440 cacctattgg tacaaatacg cccagaacat ccaagaaaat aacgcgtcgc gaaataaagc 1500 acaggttaat gagctggtcc gtgagaccaa gtaccaacca aaatcttcgg atgaaaattc 1560 gaaaaaatct ttcaagtctc ggaacttcta tcgatcgcga aaggaaggac ctagtgaaga 1620 agaccaaccc gcgaataatc cgtcgcgcaa taatccaaag aactcgaaat ctgaggctaa 1680 gaaaccggaa gaaccaagag caccacagtc gggctacgat acgtatgttc cccctcagga 1740 gggcatatgc ttcaattgcc gaggctacgg gcatcaccat actcagtgta accagccacg 1800 ttacaagttt tgtctacgat gtggtttaca ccaagtagaa accaaaaatt gcccatattg 1860 tgcaaaaaac ttcaactaga ctacctgcga gggcaggtca gtccaaaagt acccaaaaat 1920 ccctctttcc caacgctaca agacaatctc cataattcgg gctataatcg agtacaatca 1980 tcggattata gtgcccctga agaagcccaa cttgaggaac tgttcgtgcg tgtagaaaat 2040 gacgatcgtc cgtttgttca aatctccgta tttgacactc ctattattgg attgctcgat 2100 agcggagccc atcgaagcat attaggtgtg ggatcgttga agctcgttaa aacttttaaa 2160 ttaaaattat ttccggctag tattgaccta gtaacggcca gtggtcaaaa actggaagtc 2220 gtcggctatg tcaatctgcc catgtgcttc aacgggcaga ccaaaataat ggctacttta 2280 gttgtaccaa acttgaagag aaggctattg ttaggtacag acttctggag agcttttggg 2340 attgttccta cagtccactc agcggcagta gaggaactgg atgagacgtt ggaggaacct 2400 tcattgtcaa acgaccagct tcaagaatta gaatctatca aaagccaatt caaagttgcc 2460 attgatggcc aattagacac caccccattg atcacccatc gcatcgaact gaccgaggag 2520 gccaagcagt tggcaccggt tcgaataaat ccgttcccta cgtctccgaa acgtcaggac 2580 cagatcaata aagagctgga cagtatgcta gaagcgggca tcattgaacg ctcctacagc 2640 aactgggctc taaggctagt cccagtagac aaacccgacg agacagttcg actttgccta 2700 gatgcgcgaa aattaaatga acgcaccctt cgagattcct atccacttcc acatgcggat 2760 aggattctta gtcgattagg tccctgcaag tacatctcga caatcgactt gtcgaaggca 2820 tttttgcaaa taccactcca tccaaaatcg aaaaagttca ctgcgttttc cgtcctagga 2880 cgagggctct ttcagtttac ccgaatgcca tttggtttgg tcaacagtcc ggcaacacta 2940 tctaggctga tggaccgggt cctgggtgcg ggtgaacttg aaccaagcgt cttcgtctat 3000 ctagacgaca ttatcgtcct aagcgagacg ttcgaggaac acctcacgtt gcttagagac 3060 gtagctgcca ggttgaatcg agccaacctc tccattaaca tacaaaaatc aaaattttgc 3120 gtctcggaac taccttactt aggatatatt ttaactaaca aaggtctgaa acccaatcca 3180 gatcgagtag aggcgataat caatatagaa cgtcctaaca gcatacgtgc cttgcggcgt 3240 ttcttgggca tgtgtaacta ttatcggcga tttattgccc aatatagtga aattgttcgc 3300 ccgctaaccg atttactaaa aaacaaacca aaatcgatcc gatggaacga gtttgcagag 3360 tcatcattca ccaaaatcaa ggaactgctg ataaacgcac cattgttagc taatcctgat 3420 tttaaccaac ctttctccat tcactgtgac gcaagtgaca cggccatcgc cggtgtcttg 3480 acccaagagc gagagggtat tgaacaacca atcgcttatt tttcccgaaa attaactggc 3540 ccagaacaaa gatatttcgc cacggaaaag gaagctttgg ctgtcctaaa atccgtagaa 3600 aaattccgat gctacgtgga aggatcaaag ttcacggtca taaccgatgc ttctgctctc 3660 acatacatct tacgcagcag ttggcgcacg tcatctaggc tctgtagatg gagcatagaa 3720 ctccaacgtc acgacatgat catcaagcac cgacgaggtg tggataacgt ggtacctgat 3780 accctgtcaa gatcagttga agttctagtg ctcaatcaaa agggttcaga ttggtacacc 3840 aacttgctca aaaatgtcca tgaaaatccg gaaaagtaca aggacttccg agtggataac 3900 ggcgttctaa agaagctcgt gtctacccag ggagatgccc ttgattacca cttcgaatgg 3960 aaggtctgcg taccaaagga tatgcgggag aatgtcctag ttgaagagca tgacgacgct 4020 ctccatctag ggacagataa aaccatggct cgaatcaaaa agaaatatta ctggccaaac 4080 attgcaaatg atgtgcgtct gcacattcgt aaatgttcaa cctgtcttga gagtaaaccc 4140 tccaatcgag cacagaatcc ggtggttggt tctcctagac tggccgcaaa accgttccag 4200 attgttgctc ttgattttat acaatcatta cctcgtagta aaaccggaaa agcacattta 4260 ctggttataa tggatgtatt ttccaaattt tgcttgcttt tcccggtgcg taaaatcagt 4320 gctccacaag tgtgcgaaat tctagaaaat aactggttcc gccggtactc cacaccagaa 4380 attatcatat cagataatgc ttccaccttc ttgtcaggga acttcaaaga cctgttacaa 4440 aagtttcacg tccagcactg gactaaccca cggcaccaca gccaagcgaa tccggttgaa 4500 cggctcaacc gcactataaa cgcctgcatt agaacgtatg ctaggtcgaa ccaaacacta 4560 tgggacacca gagtgcccga aattgaatat gcgttaaaca ctactcccca tggagccacg 4620 ggatttagtc cctatcgaat cttattcggc cacgaaattg tgggtcgagg cgacgaacat 4680 cgcgtagatc gagatttgag agaactttcg gatacagaaa gactggagaa gaaactgcaa 4740 atcgatgaaa atattcattc attagtcagc aaaaacctta aaaaacaatt tgagaaaaat 4800 tcacacaatt acaatactcg ccacaaaacg tttgctcctg tgtacaccgt cggtcagaaa 4860 gtgtacaaga gaaacttccg gcaatcttcc gcaccacaca gctacaatgc gaagctaggt 4920 ccctgctatc tcccgtgcac cgtcgttgct cgcgttggca ccaacaccta cgaactagcc 4980 gacgaaactg gtaaatccat cggtgttttc tctgcggctg atttgaagcc cggtgaatgc 5040 tgaaaaacaa gtaaaaaaaa ttcaataaac tcaacatctt cgatacgtat caagcaaaac 5100 ttccttgtgt ttagccatgg ccaggctgga catccgttac acctgtaaca aaaacaaaat 5160 agtatattag tcctacaatc aaaccacaat gaaacactgt tcggtgacaa agatttttct 5220 cagaaaacaa cctcgaatgg cttgcttcat tgaatgcatt tcttcaatgg ctgtaagaaa 5280 tgtgaattga atggtacctc agatgatgat tgattaacag ctattgtttc atgcgcgcac 5340 ctttttctct gatgcattca atggaaagac ccattcagat cagatttcac tgtaaatagc 5400 agtttaaatt tacttacctg ttctagtaac tgcagttgaa aacttttgca actcgtccac 5460 atcgtccatt tgttgttttc gttctcacgt ttgcgatagt ccaatccgtc acaccgtttg 5520 acagttcttg tgtctttgtg ttggtgagtg actacctggc atatctttcc aaatgagtga 5580 aagagtataa cgcataatgc ccagtgctgc cagatctatg tgcaacacat cttgttttca 5640 aaacggaaaa ttaaaatgtt gttgattttt atgtaatgtt tcacctcttg ctcagtcaaa 5700 atgaatgagt ttcattcatt ttgaactggg aggacggggt aaa 5743 // ID Gypsy-70_AA-I repbase; DNA; INV; 5332 BP. XX AC supercont1.22; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-70_AA_; KW Gypsy-70_AA-LTR; Gypsy-70_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5332 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.22; Positions 3998182 3992851. XX CC Positions [4170-4640] - Integrase core CC 'CTTTC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS join(975..3257,3261..5168) FT /product="Gypsy-70_AA-I_1p" FT /translation="MSKVNVTTTIEPYRKGTPFGEWVERLEFYFSLNNVSP FT GDKKAYFITLSGPVIYHELKLLFPTSNLTDVSYDDMITKLRTRLDKTESDI FT IQRLKFNNRVQQPDETVEDFVLSVKLQAEFCSFGDFKDLAIRDRILAGVRE FT KSLRERLLNEDKLTLAIAERIIATWELAGKNAKNLNNNDDDQYGRIASLAP FT ASKVGTSMRKLLAVYNGGEMESSPRVPVRDRLGYRPYSRRVEDTTRYKKQS FT WRETEKFKPGQWRQKPSYADMICNFCGAKGHIKRKCFKLKNMRRDTVNMLN FT DYEPEASEDGHIANLLSRMRTEESDSDADDRDSGDLTCMMVSSINRIRDPC FT LVELMVEGKMLQMEVDCGSSVSVIGKDHYFSMFSKPLEKCNKQLIVVNGEK FT LKIEGEAIVSVKFNGTEAKLKFLVLNTSHKFIPLFGRTWLDVFFSKWRHFF FT SSTLTINNMIESDNNAEVENIKRVYKDVFIKDFSTPIKGFEAELVLKSDVP FT IFKKAYDVPYRLRDKVVDYLNKLEKEKVITPIKTSEWASPVVIVMKKNNEI FT RLVIDCKVSINKVIIPNTYPLPTAQDLFAKLAECKIFCALDLEGAYTQLAL FT SDKSKHFMTINTIKGLYTYNRLPQGASSSASIFQNVMDQILRDIDHVHCYL FT DDVLIAGKNIDECKAKLKLVLERLREANIKVNWEKCKFFVTHLDYLGHVLS FT ENGLLPCQSKVATIQKAKIPKNTTELKSFLGMINYYNKFIPHLSAKLSYLY FT NLLKNGVKFIDNNCNKAFEESKQALLSAQFLEFYDPKKPIIVVSDASGYGL FT GGVIAHIVDNVEKPICFTSFTLNDAQKSYPILHLEALALVCTIKKFHKYLY FT GQEFTVYTDHKPLVGIFGKKGQHSIYVTRLQRYILELSIYDFTIQYRPSSQ FT MGNADFCSRFPLEQSVPDEFDIEHIRSINFSNDFPIDFKVVAKVTKSDEFL FT QNIVFYMKNGWPENLKKCYVDVFSNQHDLEVVDECLLYQDRVIIPQVMQTD FT VLKLLHANHAGMVKMKQLARRSVYWFGINKDIEKFVPLCEECNSMAIINKP FT AEKFQWIPTSRPFSRIHIDFFHFNHHTFLLIVDSFSKWIEIEWMKGGTDCS FT KVLKRLVAFFARFGLPDVLVSDGGPPFNSHTFVSFLRKQGINVLKSPPYNP FT SSNGQAERSVRTVKEVLKKFLLEPEMLKLEMEDQLNLFLINYRNSCLTNDG FT EFPSEKIFSFNPKTIMDLLNPKKQYKNHLKPQPSDDRTVSNDTTSTNTKLH FT DSIDDLMAGDVVWYKNNNPHHRARWVKATFLKRFSKNIFQVQTGSVRLMAH FT RNQLRVASEGMRLRPNLTLSFSNRDGVLPVSSENEDPRGFAESVPDASEKS FT RKRNRPKALLDLTPELRRSKRKRTAKADSDFVYH" XX SQ Sequence 5332 BP; 1715 A; 891 C; 1140 G; 1586 T; 0 other; gtttggcgac gaagataaat tttcctctgt agtttttcct caagtggtcg aagattcata 60 gtggcagcca cacaagtgaa cttagtgtgg atataatcac gtttttcatc tggaagacga 120 cggcccggcg ctgcaggtga taaagatcat ccaaaggtca gccacaattg agcatctaca 180 tgattttgga ataacacacg aattggtaaa atctaagcca catcggtgga tgcggtaagt 240 taaggtgctt aagtgagtat aattgtatgg attgcacgat ataagtgaaa tttagtaatt 300 gctagaattc caatcaattt cccatatttt ccgccattgt tttaattggc atcacccttt 360 tgttcttaat tacaatatat ctggcagcca ttctcttaca gcaacgaaaa ggccgtttcg 420 attcgcaggt gcgaacaaga caacaatcta ggatcgaaca aaacattgtt ttgtttgaca 480 gcttttatca aaggaatctt ttgatgttta ttcttttatt tgcaaagttc taagagtgtt 540 ttatcttctc aaattaaact ctagattcga taagtgagtt gttggcgata ctgtggatcg 600 aaggttttcg aacaaggcgt ttcgaagtgg ctatccaaca cggcctggaa cgtgtggaac 660 agtggtggca caagcgaaga tttttaacga atccgaccgg taactgcaac tccgaatcta 720 tcagcaggtt agcaggaagt ggccaacttt tcggattaaa ccggacgtga cagtgcgttt 780 tggaagctgg caaggatcca tcacaattcg tcgttccgtt tgccctaccg ttggttgggg 840 tgagtttgga ctgtttcggt gagtcttgcc tttattcttt tttatattgt gctgatttgg 900 gaaattgttc agtgtttgca gagttatatt gccattgttt tttctttcaa catttactat 960 ttctctggtt gagtatgtcc aaagtgaacg ttaccacgac tattgagcca taccgcaaag 1020 gcacgccatt tggcgaatgg gtcgagcggt tggaatttta tttttcattg aataatgtct 1080 cacctggaga caaaaaagcc tactttataa cccttagtgg ccctgtgata taccatgaat 1140 tgaagctttt attccctacc agtaatttga ccgacgtgtc atatgatgac atgattacta 1200 aattgcgtac acggttggat aaaacggaat ctgacatcat tcagaggttg aagtttaaca 1260 accgtgtaca gcaacctgat gaaacggtcg aggactttgt tctctctgta aaattgcaag 1320 cagagttttg ttctttcggg gattttaaag accttgccat aagggatcgg attcttgcgg 1380 gagtccgaga aaaatccctt cgtgaaagac tgctaaatga agacaaactc acacttgcga 1440 tagcagaaag gataattgct acatgggaat tggccgggaa aaatgccaaa aatttaaaca 1500 ataatgacga cgaccagtat ggtcgaattg cttctctggc accagcttca aaagtaggaa 1560 ctagcatgag aaaactttta gctgtttaca atggcggaga aatggaaagc tctcctaggg 1620 ttcctgttag ggacagactt ggttatagac cgtattccag gagagttgaa gataccacac 1680 gatacaaaaa acagagctgg agagaaacag aaaaatttaa gccaggacaa tggaggcaga 1740 aaccaagtta cgcagatatg atctgcaact tttgtggagc aaagggacac atcaagagga 1800 aatgttttaa actaaaaaac atgagacggg acacagtcaa catgctgaac gattatgagc 1860 cggaggccag tgaagacgga catattgcga accttcttag ccggatgcgg acggaggagt 1920 cggatagtga cgccgatgat agagactcag gtgatttgac ttgcatgatg gtatcttcca 1980 ttaataggat tagagacccc tgtcttgtag agctaatggt tgagggcaaa atgttgcaaa 2040 tggaggttga ttgcggatct tccgtatcgg tgattggtaa agaccattat ttttctatgt 2100 ttagtaaacc tttagagaaa tgcaataaac agttgattgt agtaaatgga gaaaaactta 2160 aaattgaagg tgaagcgata gtttcggtaa agttcaatgg aactgaagca aaattgaaat 2220 ttttggtttt gaataccagt cataaattta tacctttatt tggaaggaca tggctggatg 2280 tatttttctc taaatggaga catttttttt caagcacatt gactattaac aacatgattg 2340 aaagcgataa taatgcggag gttgaaaata ttaaacgtgt atacaaagat gtttttataa 2400 aagatttttc gacccctatc aaagggtttg aagccgaatt ggtactaaaa tctgacgttc 2460 ccatttttaa aaaagcctat gatgttccat acaggttaag agacaaagtt gtggattatt 2520 tgaacaaact agaaaaggaa aaagtgatta caccaatcaa gaccagtgag tgggcgtctc 2580 ccgtggtcat agtaatgaaa aaaaataatg aaattagatt ggttatagat tgtaaggttt 2640 caataaataa agtaattatt cctaatactt atcctcttcc tacagcacaa gatttgtttg 2700 ctaagctggc agaatgtaag atattctgtg cattagattt ggagggagca tacactcaat 2760 tggctttatc ggataagtct aaacatttta tgaccataaa cacaatcaag ggactatata 2820 cttacaatag attaccacaa ggggcttctt cgagcgcatc gattttccaa aatgttatgg 2880 atcaaatttt gagagatatt gatcatgttc attgctattt agatgacgtc ctcattgcag 2940 gaaaaaatat agatgagtgt aaggcaaaat taaaacttgt gcttgagaga ttgcgagaag 3000 cgaacataaa ggttaactgg gaaaaatgta agttttttgt tacacattta gattatttag 3060 gacacgtatt gagcgagaat ggtttactgc cttgccaaag caaagttgca acaatacaaa 3120 aggccaaaat tccaaagaac acgacagaac taaaatcatt tcttggaatg ataaattatt 3180 ataacaaatt cattccccat ttgtcagcaa aactttcata tctttataat ttattaaaaa 3240 acggtgtaaa atttatttga gacaacaatt gtaataaagc gtttgaagaa agtaagcaag 3300 ctttgttgag tgcccagttt ttagagtttt acgatccaaa gaagccaata attgttgtat 3360 ctgatgcatc gggatatggt ttagggggcg taattgctca tattgtcgat aatgtagaga 3420 aacctatttg tttcacatcg tttacattga acgacgctca aaaatcgtat cccattctac 3480 atttagaggc attagctttg gtttgtacca tcaaaaaatt ccacaaatat ttgtatgggc 3540 aagagtttac tgtttataca gaccataagc ctttagtagg tatttttggc aagaaaggtc 3600 aacattcaat atacgtaaca aggcttcaac ggtacatttt agaattatcc atttatgatt 3660 tcactattca atacaggccg tcatcccaaa tgggaaacgc ggatttttgt tcgagatttc 3720 ctttagagca gtcagtacct gatgaatttg atattgaaca tataaggagc attaatttta 3780 gcaacgattt tcctatcgat tttaaagtgg ttgcgaaggt gacaaaatca gatgaattct 3840 tgcagaatat tgtattctac atgaaaaatg gttggccaga aaatttaaaa aaatgctatg 3900 tcgatgtttt ttcgaaccaa catgatttgg aggtggttga tgagtgtttg ttataccagg 3960 atagggtaat aataccacaa gtgatgcaaa ctgatgtttt gaaactgtta catgccaatc 4020 atgcaggcat ggttaaaatg aaacaacttg ctagaagatc agtgtactgg tttggaatta 4080 ataaggatat agaaaagttt gtaccactat gtgaggagtg taatagtatg gcaattatca 4140 acaaacctgc ggagaaattt caatggatac caacaagcag accgtttagt agaatacaca 4200 tagatttttt tcacttcaac caccacacat ttttacttat agttgacagt ttttcaaagt 4260 ggatagagat tgaatggatg aaaggtggta ctgattgcag taaggtttta aaaaggttgg 4320 tagcattttt tgctcgcttc ggattaccgg atgttttggt atccgatgga ggcccacctt 4380 tcaactccca tacctttgtg agctttttga ggaagcaggg cattaatgtg cttaaaagtc 4440 ccccatacaa tccctctagt aacggacagg ctgagcgttc agtgcggaca gtaaaagaag 4500 ttttgaaaaa gtttcttttg gaacctgaga tgttaaagtt ggaaatggag gatcaattga 4560 atttgttttt aataaactat aggaacagtt gcttgacgaa tgacggcgag tttccgtcag 4620 agaaaatatt ttcttttaat ccaaagacaa taatggactt acttaatcca aaaaaacaat 4680 acaagaatca tctaaaacca cagccaagcg atgatagaac tgtgtcaaat gacaccacaa 4740 gtactaatac taagttgcat gactctatag atgatctgat ggcgggtgat gtggtttggt 4800 acaaaaataa caatccacat caccgcgcac gctgggttaa agccacattt ctaaaacgtt 4860 tttcgaaaaa tatcttccag gtgcaaactg gaagcgttcg cctaatggcc catcgaaacc 4920 agctaagagt agcttctgaa gggatgcgtt taaggccgaa ccttacatta tccttttcaa 4980 atcgtgatgg tgtgctgcca gttagcagtg agaacgaaga ccctagaggt tttgcggagt 5040 cggtcccaga cgcatcggag aagagccgaa aacgaaatcg tccaaaggca ttactcgatc 5100 tcacacccga gctaagaaga tcgaaaagaa aacgcacggc caaagccgac agtgattttg 5160 tttatcattg atcaatgcgc ataaagtgtg atctgaattt tcaaaatgaa ttttctgaat 5220 agttagtctt aagtttaaac tttggattgt tttattagtt taagtcaata gcatgcatag 5280 aattctaaat ggaattttct gaaatatatc cctttcctta aaggggaaga ac 5332 // ID BEL-169_AA-I repbase; DNA; INV; 6294 BP. XX AC supercont1.326; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-169_AA_; KW BEL-169_AA-LTR; BEL-169_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6294 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.326; Positions 460053 466346. XX CC Positions [5351-5908] - Integrase core CC 'ACGCA' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 17..6292 FT /product="BEL-169_AA-I_1p" FT /translation="MVNTRARMQNLTSPNRQASTCAVCDRPDSADNMIQCL FT KCSIWWHYSCAGVTDSIANRDWMCAKCLPAPAESTSNRSSASRKARLELSL FT RRLEEEKEFMKKQMALDLEKQYADQKYQLLEESLAEEDDIGNRSVRDRIDD FT IEKRSRSEQVSEWVQQQSLLAPDKVVNPQPLIESGLARGNPTRPLVLVSDE FT IGAEGGNGFNPPSQLDAPSKQQGDTVLPILNSDEEVVRLLQEQLNIYRENP FT SAHRMQQVRDQFERCFGRGLIDSVQQHQPTGAQRTMVNLQPDVSKGAIRKT FT SGPVQSTKLQHAAPHVNQASCDPVVTPLHRSTRPSGVASLPNRTTTKPLPA FT SSVGATARNPSARVNEGTEQDQPQPCFDGIHAVPHRTRDLFHPMDAADSPE FT RAYFTGTAHGLVNNPTLSASRNNAFAGQYPAYQAGNIQLPSTSNTGQIDGA FT LIAPPVFRPTPEQISARHVVPRDLPEFSGDPEDWPLFHSSFNNSTNACGYS FT HAENLARLQRCLKGDALKAVRYYLLSPESVPDVLKTLQTLFGRPEIIVNKL FT IRNVRDCPIPKAERLETLIDFGMTVRNLTQHLISAGQQAHLSNPVLLQELI FT EKLPANIKLQWAQHLIYYPVPSLQTFSDFMSGVVESVAKVVPYGGGQSRSD FT RPRAREKGFVHTHSETLSPNIVTIQRSSPSEKSCFVCSKLGHRVKECERFK FT GSSIEDRWKIVRSSKLCRSCLSPHGRRACKNLSFCGINGCQYRHHPLLHSP FT STTIDQPTKESHNYHHCGQSVLFRIVQVTLYGVSGSLDVFALLDEGSSTTL FT IERSLTEELDVYGPTVPLCLTWTGDMSRSERDSKMVTFDVSEVEKQKRYPI FT KNARTVSSLNLPAQTLRFKELQKQYVHLKGLPIRSYENAKPRLLIGLRDIS FT LAVPRIIKDGNNGPIATKTLLGWTLYGCFSGASKPEYLNFHACECEGLNEL FT VRTYINGECNDMQPARQMESEAEKRARYLLEQTTVRVGDRFQTGLLWKHDE FT VKMPESYPMALRRLECLERRMNRDLVLKENVHRQVAEYQEKGYAHRATIEE FT LRLADPRRIWYLPLGAVINPKKPEKVRLIWDAAAKVNGISLNSVLLPGPDL FT LTPLPAVLFRFRQFPVAVSSDIREMFHRILVTEADRHSQRFLWRNCPDQKP FT QIFLMDVITFGATSSPVSAQFVKNKNALEFVDRCPRAVEGILKSHYVDDYL FT DSFETVGEAKSVSSEVRAIHLKGGFELHNWSSNNKEVLENLEVAATKAEKH FT LTSNRDTNTERILGMLWNPEEDKLGFSTKFNDEIAMLIENGFRPTKRQILK FT CVMSLFDPLGLLACHLVHGKVLMQDVWRTGISWDEPVSDEIMRHWERWIRL FT FSNVQNLRLPRCYFQNASSALYNSLQVHVFVDASEVAYSAAVYFRIIGADG FT IVQCSLVAAKTKVAPLKYVTIPRLELMAATLGTTLLSFVIDGHTIQIKKRF FT LWSDSKTVLSWIHSDHRKYRPFVAARIGQILTTTNETEWRWVPSKLNVADE FT ATKWGKGPCFEANCRWFVGPDFLYTPETEWPKPNQVKVATTEEIRPCFLHH FT ESLPTPLIEFQRFSKWSRLLRAVAYMLRAVSIWRASGMEAKRGMRSVSKSL FT VRLSHDNLVDAEMLIWKQVQMEAYPDEESLLSDNCSIPKDKRANQLKSSSL FT YALSPVMDEKGIIRVDGRIGNAKQVPMTVKYPIILPRRHHATLLLVDHYHR FT EFLHGNNETVCNEMRQRFYVPQLRVLIRKVSKQCQRCKTVKARPEIPRMGL FT LPAARLSPFVRPFSFVGLDYFGPLLVKVGRSNAKRWVALFTCLTIRAVHVE FT VAYDLSTASCISCINRFVSRRGAPLEIHSDNGRNFVGAANILKEQIKRIEF FT EVATTFTTTNTKWVFIPPLAPHMGGSWERMVRAIKTALFSLPQDRKMDDEA FT LQTMLVKAEAIINSRPLTYLPLDSDEQEAITPNHFLLGSSSGTKQPAVPIT FT DNRNLYKSWELIEQQLDIFWKRWVREYLPTLTRRTKWFEEVKPIEPGSLVM FT VVDDAKRNGWIRGRVLEVVAGQDGRIRQALVQTSAGVFRRPVSKLAVLDLK FT DSGKVASDTHLYGRG" XX SQ Sequence 6294 BP; 1784 A; 1473 C; 1547 G; 1490 T; 0 other; aaatcttcaa aaaatcatgg taaacacccg tgcaagaatg cagaacttaa cctcacccaa 60 cagacaagct tccacttgtg cggtttgcga taggccagac agtgccgata acatgattca 120 gtgcctaaaa tgcagtatct ggtggcacta ctcgtgtgca ggagtgaccg actcgatcgc 180 gaaccgtgat tggatgtgcg cgaaatgcct gcccgctccc gcagagtcca cgtcaaacag 240 atcgtcagcc agccgtaaag ctcgactaga gttgagtctt cgtcgtttgg aggaggaaaa 300 agagttcatg aagaagcaaa tggctctgga tttagagaag caatacgccg atcaaaaata 360 tcagttactg gaagaatcgt tggcggagga agatgacatc ggaaatcgca gtgtacgaga 420 tcgaatagac gatattgaga aacggagccg ttcagaacaa gtttccgagt gggttcagca 480 gcaatcacta cttgcaccgg ataaagtagt aaatccacaa cccctaatcg aatcaggttt 540 ggcaagaggt aatccgacta gaccactagt gttggtgagc gacgagattg gcgcagaggg 600 aggaaacgga ttcaatccac cgtcccagct ggacgcaccg tcgaaacagc agggtgatac 660 agtattgcca atattgaaca gtgatgagga ggttgtgcgg ttgcttcaag aacaactaaa 720 tatttatcgt gaaaatccgt cggcgcatcg aatgcaacag gttcgagacc agtttgagcg 780 atgtttcggc cggggactga ttgactcagt tcaacaacat caacctactg gggcccagcg 840 aacgatggtc aacttgcaac cagatgtatc gaaaggagcg atacgcaaga cgtcgggacc 900 agtccaatct accaaacttc aacatgcagc tcctcatgtg aaccaagcat catgcgatcc 960 agttgtaaca cctctgcata ggtctaccag gccatcagga gtggcgtcgt tacctaacag 1020 aacaacaacc aagcctctac cagctagcag tgtaggagcc actgcaagga atccgagtgc 1080 aagagtaaac gaaggtacgg agcaagatca gccacaacca tgcttcgatg gcatacacgc 1140 agttcctcac aggacaagag atttattcca tccgatggac gcagctgact caccagaaag 1200 agcctacttc actggtacgg cgcatggcct agtaaacaac ccaacgctat cggcttcacg 1260 gaacaatgcc tttgctggtc agtatccggc ttatcaggca ggtaatattc agcttccctc 1320 aacatctaat accggacaaa tagatggcgc tttgatagca ccaccagtat tccggcctac 1380 tccggagcag atatccgcta ggcatgtagt gccacgagac cttccggaat tttcgggtga 1440 cccggaagat tggccactat ttcatagcag tttcaacaat tccacgaatg catgcggata 1500 cagccatgca gagaaccttg cgcgattaca acgttgcctg aagggtgacg ctctcaaagc 1560 cgttcgatat tatctgcttt cgccagaatc tgttccggat gtattgaaga cgttacaaac 1620 tctttttggt cgaccggaaa tcattgttaa taaattaatc cgtaatgttc gagactgtcc 1680 gataccgaaa gcagagcggt tagaaaccct catcgatttt ggaatgacag tgcgcaattt 1740 aactcaacat ttgatttctg ctggtcagca agcacacctg tcaaatcctg tacttctgca 1800 ggagctcata gaaaagctac cagcgaacat caagctccag tgggcccaac acctgattta 1860 ctatcctgtt ccatccctgc agacattcag cgattttatg tcaggagtag ttgaatcggt 1920 tgccaaggta gtcccttacg gaggaggtca gagtaggtca gatcgcccaa gagctagaga 1980 aaaaggattc gttcatactc attctgaaac gcttagcccc aatatcgtca ccatccagag 2040 aagttcacca tccgagaagt cttgctttgt ttgcagtaaa ttgggccatc gcgtaaagga 2100 gtgcgaaaga ttcaaaggga gtagtatcga agatcgatgg aagatcgtgc gatcatcgaa 2160 attatgtaga agttgcttaa gtccccacgg tcgacgggcc tgtaaaaatt tgagcttctg 2220 tggtatcaac ggttgtcagt accgccatca tccattgctt cactctcctt cgacaactat 2280 cgatcaaccg acgaaagaaa gtcacaacta tcatcactgt gggcagtcgg tacttttccg 2340 gatcgttcaa gttactttgt acggagtttc tggaagtttg gacgtatttg cattattgga 2400 tgaaggatct tccaccacac taatcgagcg cagtcttaca gaagaactag atgtgtacgg 2460 accaactgtc ccactatgct taacgtggac cggggacatg tcgcgctcag aacgtgactc 2520 gaaaatggtt acgtttgatg tgtcagaggt tgaaaaacag aagcgttatc caattaaaaa 2580 tgctcggacc gtgtcatctc tcaatctacc ggcccagacg cttcgcttca aagaactcca 2640 gaagcagtac gtacacttga agggcttgcc gatacgaagc tatgaaaatg ccaagccaag 2700 gttgctgatc ggtctccgag atatctcgtt ggcggttcca agaatcatca aggacggcaa 2760 taatggtcca attgcaacaa aaacattatt aggttggaca ttgtatggat gcttttccgg 2820 tgctagcaaa ccggagtacc taaatttcca cgcttgcgaa tgtgaaggct tgaacgagtt 2880 ggtgcggaca tacattaacg gtgagtgcaa tgatatgcaa ccagcacgac agatggagtc 2940 cgaagccgaa aagagagcac gttatctttt agagcaaacc acagttcgag tgggcgaccg 3000 ctttcagaca ggcctgctct ggaagcacga cgaagtcaag atgccagaaa gttaccccat 3060 ggctcttcga aggctagaat gcctggagcg aaggatgaac cgggatctgg tcctgaaaga 3120 aaacgtgcat cgacaagtag ctgaatacca ggaaaagggg tacgctcacc gggcaactat 3180 cgaagaactt cgcttggctg acccaaggcg tatctggtat cttccgttgg gggctgtcat 3240 aaacccgaag aagccagaaa aggttagatt gatttgggac gctgcagcaa aggtaaatgg 3300 aatctcgctg aattccgttc ttcttcccgg tcccgatctg cttacaccat tgcccgctgt 3360 tttgttccgc tttcgccagt tccccgtggc agtgtcgagt gacatcaggg aaatgtttca 3420 tcgaatacta gtcaccgaag ccgaccgcca ttctcaacgt tttttgtggc gaaattgtcc 3480 agatcaaaaa ccacaaatct tcttgatgga cgttataaca ttcggggcaa ctagttcccc 3540 tgtgtctgcg cagttcgtaa agaacaaaaa tgctttggaa tttgttgaca ggtgtcctcg 3600 agctgttgaa ggtattctga aaagccatta tgttgacgac tacctcgaca gcttcgagac 3660 ggtcggtgaa gctaaatcgg tcagcagtga agtcagagca atccatctga aaggtggttt 3720 cgaactccat aattggtcat caaataacaa ggaggttctg gaaaatctcg aagtagcagc 3780 tactaaggca gaaaaacacc tgacatcaaa cagagacacc aatacagagc gtattctagg 3840 aatgctttgg aatcccgaag aagacaagct tggattttca acgaagttca acgacgagat 3900 tgcgatgctg atcgaaaacg gtttcagacc aaccaagcga cagatattga aatgcgtgat 3960 gagcttgttt gaccccctcg gtttactagc ctgccatctg gttcacggta aagtgctcat 4020 gcaggacgtc tggcgtacag gcattagctg ggatgaaccc gtgagtgacg agattatgag 4080 acattgggaa aggtggatta gactgttttc caacgtgcag aacttacgcc tccccaggtg 4140 ttacttccag aacgctagtt ctgccctgta caattcacta caagtccatg tatttgtgga 4200 cgccagcgaa gtcgcctatt ccgcagctgt gtactttcga atcatcggag cagatgggat 4260 cgtacagtgt tctttagtag ccgctaaaac caaggtcgct ccattgaagt acgttacaat 4320 tccacggttg gagctgatgg cggctacttt gggaacgact ctgttgtctt tcgtaattga 4380 tggccatacg atccaaatca agaaacgatt cctgtggtcc gattctaaaa cggtattatc 4440 ctggattcat tccgaccacc gtaaatatcg cccgtttgta gccgctagaa tcggacaaat 4500 actaacaaca acgaacgaaa ccgagtggag atgggtccct agtaaattaa atgtagccga 4560 tgaagccaca aaatggggga aaggcccatg ttttgaagct aattgtcgct ggtttgttgg 4620 gccagatttc ctctacactc cagagaccga atggccaaag cctaaccaag tgaaggtagc 4680 tacaacggag gaaattcgac catgttttct acatcacgaa tctctgccta ctccactgat 4740 tgagttccaa aggttctcaa agtggagtcg ccttctaaga gcggtagcat atatgcttcg 4800 agccgtttca atctggagag caagcggaat ggaggcaaaa cgtggcatgc gatccgtgtc 4860 gaaatcgctt gttcgactat cgcacgacaa tcttgtggat gcggagatgc tgatttggaa 4920 acaagtacag atggaagctt atccagatga agaatcgttg ctgtcggata actgttcgat 4980 acccaaagac aaacgggcaa atcagttgaa gtctagctca ctgtatgcac tttctcctgt 5040 tatggatgag aaagggatta ttcgggtaga tgggcgcata ggaaacgcca aacaagtgcc 5100 aatgaccgtc aaatatccaa tcattctccc aaggcgtcat catgcaacac tccttcttgt 5160 ggatcattac catcgggagt ttcttcatgg aaacaacgaa acagtgtgca acgaaatgcg 5220 tcaacgattc tacgtgcccc agcttcgagt actgattcgc aaggtatcca agcagtgtca 5280 gcggtgcaaa acagtgaaag cacgtcctga aattccaaga atgggactcc taccagcagc 5340 gagactatct ccgttcgtgc ggccatttag cttcgtcggt ttggattatt ttggaccgct 5400 tttggtgaag gttgggcgtt cgaatgcaaa aaggtgggta gccctcttca catgcctcac 5460 catacgagca gtacacgtag aggtggccta cgacctatct acggcgtctt gcatttcatg 5520 tatcaatcgc ttcgtaagtc gccgtggagc gccactggag attcattccg acaacggaag 5580 aaattttgtt ggagctgcaa acattctgaa ggaacagata aagcggatag aattcgaagt 5640 tgctactacc ttcactacga cgaacacaaa gtgggttttc atcccaccat tagctcccca 5700 catgggtggc tcctgggagc gcatggttcg tgcgatcaaa actgctctct ttagtcttcc 5760 tcaggatcgc aaaatggacg acgaggcctt gcagacgatg ttggttaaag ccgaggccat 5820 aatcaactcc aggccgttaa cctacttacc acttgattcg gacgagcagg aagccataac 5880 tccaaaccac tttctactag ggagttcaag tggtaccaaa cagccagcag tcccaatcac 5940 ggataacagg aacctctaca agtcctggga gttgatcgaa caacaactgg acatcttctg 6000 gaaaagatgg gtacgagagt accttccaac gttgactcga cgaacaaaat ggtttgaaga 6060 agtcaaacca atcgaaccag gaagtttggt gatggtcgtc gacgatgcca agagaaatgg 6120 gtggatacga ggacgtgtat tggaggtagt tgcaggacag gatggtcgca ttcgacaagc 6180 tctagtacaa acgtccgctg gagtattccg tcgtcccgtt tcaaaattgg ctgtgctgga 6240 cttgaaggat tccggtaaag ttgcgtcaga tacccatctt tacgggaggg ggaa 6294 // ID Gypsy-8_DWil-I repbase; DNA; INV; 5298 BP. XX AC scaffold_180700; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-8_DWil_; KW Gypsy-8_DWil-LTR; Gypsy-8_DWil-I. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-5298 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_180700; Positions 3505940 3511237. XX CC Positions [2779-3321] - Reverse transcriptase CC Positions [4336-4812] - Integrase core CC LTRs are 92% similar to each other. XX FH Key Location/Qualifiers FT CDS 1156..5244 FT /product="Gypsy-8_DWil-I_1p" FT /translation="MLHDEILQLTVSELKEKLKENNLSIKGNKAELQVRLI FT SFFESGVEEASFYEDSLERSLQVVNDQQEVQSIMAFTLRDIQDSLTVFDGS FT QQQDVDNWLADFENCAETVKWNDLQKFIYSRQLLKGAAKLFVSSEDGLTDW FT ISLKSALEREFSEKLSAKQVHKLLENRKKKQSESLIEYFYEMKSLAKKSTL FT DEESVIEYIIEGIPDSKQNKTVLYQARNFRELRENMKMYEKISTEKESKWS FT RASKFEKKQQDIAINEPRCYKCGGRNHIAKNCKESEFKCFKCNKPGHKANQ FT CSMQGKMFEKDSHKVQQLTQKLYNRVFKDVRFGNEVISAFFDSGSDISTIS FT ESAYKRVYPVPLSSDVKELLGIGSKKIYTKGSFELDTALDGVPIKMCFHVV FT RDRDTLYDGVIGNDVFEVVNATMGKQGLVFYGNTRSFGGNQVITRYSNPQD FT PPPVRDVSPIEELQNTFGQILTTRVENFSSEIELSHLSIELKSKVLEIISD FT YSPVKPANCPVKMKIILTDEIPVYQKPRRLSYEDQKRVDQQVKEWLEEGII FT RHSVSEYSSPIVLVPKKDGEKRLCCDYRRVNQKIIRDNFPTAVIDDVLHKL FT QEGRVFTTLDLCNGYFHVPVEENSKKLTSFVTQNGQFEFNFVPFGINNSAA FT VFTRYIFAMLRPLINEGILMLYMDDIIIPAKNESEGIERLKRVISLAEESG FT LKIKWKKCQFLQRKVNFLGYIIENGTIKPSKEKTSAVENFPVPRDVKGVQR FT YLGLTSYFRRFMKDFATIARPLTNLLRADVPFKMGIEELASFEQLKAGLSN FT PPVLRLFNPRSVTEVHCDASMYGYGAILLQKDSEDQQYHPVEYMSRRTTPA FT EEKYHSYELEVLAIIQALKKWRIYVMGIKIKIVTDCNAFAMTIKKRDVPLR FT VARWAIFLQDFNFEIEHRSGVKMKHVDALSRVYCLLSGDSLKSKIQFAQRK FT DEWISTILKVLEKGAHADYYVQYGILYKDPVKELIVIPTSMEQEIILVAHR FT QGHFGVKKTVDLVEREYYIPSLWSKVEVVVRSCMECIVSESKQGRKEGFLN FT VIDKGDEPLVTFHVDHVGPMELTKKRYNHILVVVDAFSKYVWLYPTKSTGS FT EEVVERLQKQSELFGNPKRIVSDRGSAFTSHIFKEYCESQKIQHLVIATGV FT PRGNGQVERVNRIVIALLTKLCASDPGAWYKHVGRVQQFINSSPPRSTKIS FT PFKILTGTEMRTSYDLELKSMLEEELLVELQNNKSEIRDTAKRNISKMQEE FT NRKTFNKCRKSASEYEVDELVAVKRTQFGSGLKLKGKYLGPYRVVRKMRHG FT RYAVEKVGDGEGPKGTTTVAEYMKKWCPSFGANVEVGWPNVGSRYGRGDFE FT " XX SQ Sequence 5298 BP; 1844 A; 830 C; 1325 G; 1299 T; 0 other; acttgggggc tcgtccagga accgtttggc ggtaggacta ttcatcaatc atcaaattgt 60 tgcagtctga gatccgttat tgtacatcag tcattaaatt gttgtacaaa aaacgttaaa 120 tccatgggaa aaattgtgaa attcagtcaa ttcttggaat aagtcgtaca caagagagag 180 tgagacagca gcagagattc ttggcaaacg aaaaaaaaaa aaaaaaagca catcgagcgt 240 ccactgaaaa tttttggcgc catgtggtaa gaaaaacaag aaagaaatcc ataagagcga 300 cagagacggc aacacggcaa cgaaccagag agacaacgac atgatcgtgc gaagctgaac 360 gaggcaccaa caagacgaga cctaagagag taggcaacga ggcactgagt gtgcaaacaa 420 cgaggcacaa agagtgtgcg aacgagtcga gagagcaaca ttcacaccga cggcggcatc 480 atccccagaa gtgtgcgtga cgtgatgaga agagcaggaa aacgaagagg caacaagaga 540 aaaacctaag agaggaggca acgaggcaca gagagtgtga acaacgaggt acagactgtg 600 cgaacggcga ggcacagaga gtgcaaaacc gagacgagag agcgacgctc acaccgacgg 660 cggcatcaac cccaaaagtg cgtgtgacgc gacgatcgag agcagagcag tgacgaggca 720 gcaagacaca aagagtgttc aagagagcag cactcacaca gacgacggca tcaaccatcg 780 caagtggaag tagcaaaacg tagagtgcgt tattgaggcc gacaagacac aaagagtatc 840 cagagagcag cattcataca aacgacgaca aagaatttcc agagagcaac gctcacaccg 900 acggcggcat caaccccaaa agtgtgtgtg acgcgacgat cgagtacaga ggcgtacgca 960 gcgattttgg cttacgagag ggacggcaac agagtcgaga tacaagagca agtattaact 1020 gagcaagaaa ggaccataac caaagagtaa aacatttgta taacattgta gattgatatc 1080 aattaactta actcactaac gttagcttaa gatacatgat caaaatttgt tatataattt 1140 gtttgaattt aaattatgct acacgacgaa atcttgcaac tcacggtatc tgagcttaaa 1200 gaaaagttga aagaaaataa cttgagtatt aagggaaaca aggcagagtt acaagtaaga 1260 ctaatatctt ttttcgagtc aggagttgag gaagcatcct tctacgagga ctcactagag 1320 cgtagcttac aagtagtaaa cgatcaacaa gaggtacaga gtataatggc gtttacacta 1380 agagatatac aagattcgtt aactgtattc gatggcagcc agcagcaaga tgtcgacaat 1440 tggcttgcag attttgagaa ctgtgcagag acagtcaaat ggaacgattt acagaaattc 1500 atttatagta gacaattgtt aaaaggggct gcgaaattgt ttgtcagtag cgaagacggt 1560 cttaccgatt ggattagtct aaagtcagct ttagagagag aattttcgga gaagttgtcc 1620 gcaaaacaag tacataagct actcgagaat agaaagaaaa aacagagtga aagtttaatc 1680 gaatattttt atgaaatgaa aagtttggcg aaaaagagca cattagacga agaaagtgtt 1740 atagaatata taatagaggg tataccagat tcgaagcaaa acaagactgt gttgtatcaa 1800 gccaggaatt ttagagaatt acgagaaaac atgaaaatgt acgagaaaat atctactgag 1860 aaagagtcaa aatggtctag agcatcaaaa ttcgagaaga agcagcaaga tattgctata 1920 aatgagccaa gatgctacaa gtgcggagga agaaatcaca tagcgaaaaa ctgtaaagag 1980 agcgagttta agtgttttaa atgtaacaag ccaggtcata aggcgaatca gtgtagcatg 2040 cagggaaaaa tgttcgaaaa agatagtcat aaagtgcagc aacttactca gaaattatat 2100 aatcgggtat ttaaagatgt tcgttttggc aatgaggtta tatcagcgtt ttttgattca 2160 ggtagtgata tctcgactat aagtgagagc gcttacaaac gagtttaccc agttcccttg 2220 agttcggatg tgaaagagtt attgggaata ggaagtaaga aaatctatac caaaggatct 2280 tttgagttag atacggcttt agatggtgtt cctattaaga tgtgtttcca tgtagtgaga 2340 gatagagata cattatatga tggggtaatc ggaaatgacg tatttgaggt cgtaaatgct 2400 acaatgggaa aacaaggatt agtgttttac gggaatactc gatcatttgg aggaaatcaa 2460 gttattactc gttacagtaa tccacaagat ccacccccag tacgcgatgt aagtcccata 2520 gaagagttgc aaaacacgtt cgggcagata ttgactacga gagtcgaaaa tttttcgtca 2580 gagatagagc tttctcattt gagcattgag ttgaagtcta aagtcctaga aatcatttcg 2640 gattatagtc cggtgaaacc agctaactgt ccagtgaaaa tgaagattat acttactgat 2700 gaaataccgg tatatcagaa accaagacgc ttatcgtatg aagatcagaa aagagtcgat 2760 caacaggtca aagagtggtt agaagaagga ataatccgcc atagtgtatc tgagtattct 2820 tctccaattg tattagtacc gaagaaagat ggagaaaaga gattatgttg cgattataga 2880 agagtaaatc agaaaattat aagagataat tttccgacgg cagtgattga tgatgtatta 2940 cataagctac aagaaggtag agtgtttaca actcttgatt tgtgtaatgg ctatttccat 3000 gtgccagttg aagaaaactc aaagaaactt acttcgttcg ttactcaaaa tgggcaattc 3060 gaattcaatt ttgtaccgtt cggcattaat aattcagcag ctgtgtttac tagatatatt 3120 tttgcaatgt tgagaccgtt gattaatgaa ggcatattga tgttgtacat ggatgatatt 3180 attattccag caaaaaatga gtctgagggt attgagagat tgaaaagagt tataagttta 3240 gcagaagagt cgggattaaa gataaagtgg aaaaaatgtc aatttcttca gcgtaaggta 3300 aattttttgg gatacatcat agaaaatggc accatcaaac cgtcaaaaga gaagacaagt 3360 gccgttgaga attttccagt tcctcgggat gtgaaaggag tccaacgtta tttgggtctt 3420 acttcatatt ttcgcaggtt tatgaaggat ttcgctacga tagctaggcc gttgacaaat 3480 ttgttgagag ctgacgtacc ttttaaaatg ggcatagaag agctagcttc ttttgaacag 3540 ttaaaagcgg gattgagtaa tccaccggtt ttgcgtcttt ttaatcctag aagtgtgaca 3600 gaagttcatt gtgacgcgag tatgtatggg tatggcgcga ttttacttca aaaagattcg 3660 gaggatcaac aataccatcc ggtagagtac atgagcagac gaactacccc agctgaggaa 3720 aaatatcact cgtacgagtt agaggtactg gctataattc aagcattaaa aaagtggaga 3780 atatatgtga tgggcatcaa gataaagata gtaaccgatt gtaatgcttt tgcgatgacg 3840 ataaagaaac gtgacgttcc gctaagggtg gcccgatggg ctatattctt gcaagacttt 3900 aatttcgaaa ttgagcatag atcaggggta aagatgaaac atgtcgacgc attgagtagg 3960 gtatattgtt tactgtcggg agattctttg aagagtaaaa tacaatttgc acagagaaaa 4020 gatgaatgga tcagcacgat tctgaaagtg ttagaaaagg gagcgcacgc cgactattac 4080 gtgcagtacg gaatattgta taaagatccc gttaaggagc ttattgtaat acctactagc 4140 atggagcaag agataatatt ggttgctcat cgacaagggc acttcggagt taaaaaaacg 4200 gttgatctag tggaaagaga gtattatata cctagtctgt ggagcaaagt agaggtagta 4260 gtgaggtcgt gtatggaatg tatcgtgagc gaatcgaaac agggtcgaaa ggaaggtttc 4320 ttgaatgtta ttgataaagg agatgagccc ttagtgactt ttcatgtaga ccatgtcggg 4380 ccgatggagt taaccaagaa acgatataac cacatattgg tagtagtcga tgctttttcg 4440 aaatacgtat ggttatatcc gacgaaaagt actggttcgg aagaagttgt tgagagattg 4500 cagaaacagt cagagttgtt cggtaatcca aagaggatcg ttagtgatcg aggtagcgct 4560 tttacgtctc acatttttaa agagtattgc gagtcgcaaa agatacaaca tttagtgatt 4620 gcgacaggag ttccgagagg aaacgggcaa gtagagcgag ttaatcggat tgttatagca 4680 ttgttgacga aactgtgtgc cagtgatccc ggagcatggt ataagcacgt aggtagggtg 4740 caacagttta taaactcgtc accacctaga agcacgaaaa tatcaccctt taagatatta 4800 actggtactg agatgaggac tagctacgat ttagaactaa agagtatgtt agaagaagag 4860 ttattagttg agttacaaaa caataagtcc gaaatacgtg acacagcgaa gaggaacata 4920 agtaaaatgc aagaagagaa ccgtaaaacg tttaataaat gtcgaaaatc tgcaagtgaa 4980 tatgaggttg acgagttggt ggccgtcaag cggacgcagt ttggttctgg ccttaagctc 5040 aagggcaaat atttgggacc ttatcgggtt gttcggaaaa tgaggcatgg tcgctatgcg 5100 gttgagaaag taggagatgg agaaggtccg aagggtacta ccaccgtggc agagtatatg 5160 aaaaaatggt gtccatcatt cggggcgaat gtggaggtcg gatggccgaa tgtaggatcg 5220 cggtatggta gaggtgattt cgagtgagaa cgggtattga tgtactatcg attaaaaacg 5280 atatacgata gtgaggag 5298 // ID L1_Ele3B_AAe repbase; DNA; INV; 4345 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE An L1 non-LTR retrotransposon from Aedes aegypti. XX KW L1; Non-LTR Retrotransposon; Transposable Element; L1_Ele3B_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4345 RA Kojima K.K. and Jurka J.; RT "L1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1400-1400 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 5 sequences with >98% CC identity. CC The consensus is ~79% identical to L1_Ele3. XX FH Key Location/Qualifiers FT CDS 36..1094 FT /product="L1_Ele3B_AAe_1p" FT /translation="MRRENTFRIDYSCFPVKPSFEKVHGFCRSVLGLKKED FT VERLQCHKGEQCAFVKVSDLALAQKIVDEHDAHHEVDLNGKKHKLRITMED FT GSVEVKIHDLPENVSEEKIVDFLCGFGEVISIRELTWGEGYEFAGIPLGIW FT SARMLIQKNIDSWVTIDGQQAYIVYKGQLQSCKHCKEQAHIGISCVQNKKL FT LVQKSYANVAKQVVSARPPPKKSTGAKPPRQKPTGPNHPVPPSVTSDAFPE FT LPKPSSQPEQPASTSRIDLTTSPRPQTQRAHGSTSSLRTPQIEESPKASIV FT LVDCFKKPTSAMRSQSKSGNGNETDDSSTSTNSRRSARGRPPGKKPRREDG FT DDEQDEDYHP" FT CDS 1098..4277 FT /product="L1_Ele3B_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MALSSYNISSINIGTITNPTKLNALRNFISSQSLDIV FT CLQEVENDQLSLPGFVVYTNVDHTRRGTAVAVKEHIKVSHVEKSLDSRLLA FT LRVQDTTICNIYAPSGSAQRAAREEFFNGTLAYYLRHQTPHVILAGDFNCV FT LRACDSTSPNTSPALKTAVQQLQLHDVWEKLRPRDPGFTYVCRNAQSRLDR FT IYVSNGLRENLRTAHTHVCSFTDHKALTVRLCLPQLGHEPGRGFWSLRPHL FT LTDENVAEFQYKWQYWTRQRRNYNSWIDWWLTYVKPKIKSFFKWKSRIVFD FT EFHDQHQRLYAELRLAYDGYYQHPEMLPTINRLKGEMLALQRNFSHMFMRI FT NETYVAGEPMSIFQLGERRRRKTVITQLRTGENEIVDEPQAIEANLLNFFS FT SLYSEEVANNNEVGFACERVIPLNDPVNEACTSEITTAEILSAIRASAPRK FT SPGCDGIPREFYLRMFDVIHRELNLLLNEALAGNLPPAFVEGIIVLVKKKG FT SDNTARSYRPISLLNSDYKLLSRILKNRLESVMKAHGVLSDGQKCSNSERN FT IFQATLALKDRIASLRHHRRAGKLISFDLENAFDRVRHSFLFETMRLLGFN FT QELIALLSRIASRSSSRLLINGHLSRPFEIQRSVRQGDPLSMHLFVLYLHP FT LVCRLEQVCGDDLLVAYADDISVIVTSTAQIEAMNELFTRFELVAGAKLNL FT RKTVAINVGFCEGNEIDAHWLQTANTVKILGVVFANSIRLMTTLNWTAMMG FT KFSQQMWLQSLRTLTLQQKVIMLNTFGTSKLWYLSSVLPPLGVHTAKITST FT MGAFLWRGIVARVPIMQLARSKEHGGLNLHLPALKAKSLAINRHHQEIDSL FT PYYRSFLLHAHPRPAIPIDHPDIKIILSNYSQIPHLIQQNPSADLIHRHFI FT QQTELPKVERNNPACNWPRVWRNIADKKLTSTQRTHLYLLVNGKTEHRKPL FT FAMQRVADENCTYCGNQTTETLQHKFSTCARVGPAWTSLQQRLAGLINGWR FT RLTFEDLARPALAGLGRPTRVKILRSFINYICFVNSCNDRIDVNALNFHLD FT LEF" XX SQ Sequence 4345 BP; 1186 A; 1166 C; 1031 G; 962 T; 0 other; atttcttttt gtgccgcgtc gcgtgtgtgg tcgcgatgag gcgagaaaac acttttcgaa 60 tagactattc gtgcttccca gtgaagccgt ctttcgaaaa agtgcacggc ttctgccgtt 120 cagtgctcgg gctgaagaaa gaagacgtcg aaagactaca gtgccacaag ggtgaacaat 180 gtgcgttcgt caaggtcagt gacctggcgc tcgcacaaaa aatcgtagac gaacatgatg 240 cccaccatga agtggatctg aacgggaaga aacacaaact tcgtatcacc atggaagatg 300 gtagtgtaga ggtgaagatt cacgacttgc ccgaaaatgt gtccgaagag aagatcgtcg 360 atttcttgtg cgggttcggc gaagtgatct ctattcgaga gttaacgtgg ggagagggct 420 acgagtttgc cggtatacca ctcggcatat ggtcggcccg tatgttaata caaaaaaaca 480 ttgactcgtg ggtcaccatc gacgggcagc aggcatatat tgtctacaaa gggcagctgc 540 agtcttgcaa acactgcaaa gaacaggcac acattggcat ttcttgtgtc caaaacaaga 600 aattgctggt gcaaaagagc tacgcaaacg tagcgaagca agttgtatcg gcacgaccac 660 caccgaaaaa gtcaactggc gcgaagccgc caagacaaaa gccgaccggg ccgaatcatc 720 ccgttccccc atcagtaacg tcggatgcct tccccgagct cccaaaacct tcgagccagc 780 ccgaacagcc tgcctcgaca tctaggatcg atttgacgac gtccccccgc ccgcaaaccc 840 aacgcgcgca tggttcgacg tcatcgctcc gaacaccaca aatcgaagag tcaccaaaag 900 ccagcattgt tttggttgac tgcttcaaaa agccgacgag tgcgatgcga tcgcagagca 960 agagcggcaa tggcaacgaa accgacgatt cttccacttc cacgaacagc agacgaagcg 1020 cgcgaggtcg accacccggc aaaaagcccc gtcgggaaga tggtgacgac gagcaggatg 1080 aggactatca cccataaatg gctctctctt cctataatat ctcgtccatc aacatcggca 1140 cgatcaccaa ccccacgaaa ctaaacgcgc tacgtaactt catcagcagc cagagtctcg 1200 atattgtgtg tctgcaagag gtggaaaacg accagctctc cttgcctggc ttcgtcgttt 1260 acaccaatgt agaccatacg agaagaggta cggccgttgc ggtgaaggaa cacattaaag 1320 tctctcacgt cgagaagagt ttggatagtc gactgcttgc gctgcgagtg caagacacaa 1380 ccatctgcaa tatttacgct ccctccggct ctgcgcaacg ggctgctcgg gaggagttct 1440 ttaatggaac tctcgcctat tatctccgtc accaaacccc acacgtaatt ctcgctggcg 1500 acttcaattg cgtattgcgc gcatgcgact cgactagccc caatacaagc cctgctctaa 1560 agacagccgt gcaacagcta cagctgcatg atgtgtggga aaaactgcgc ccacgagacc 1620 ctggcttcac ctacgtctgc cggaacgcgc aatcgcgact cgaccgcatt tacgtcagta 1680 atgggttgcg agaaaatctg cgaactgcgc acactcacgt gtgttcgttc acggaccaca 1740 aagcgctaac cgttcgacta tgccttcccc agctcggaca tgagcctggg cgtggattct 1800 ggtctctgcg gcctcacctt ctgaccgacg aaaacgttgc ggagttccag tataagtggc 1860 aatattggac ccggcagcgt aggaactaca actcatggat cgattggtgg ctcacgtacg 1920 ttaaaccgaa aattaaaagc tttttcaagt ggaaatctcg aatcgttttc gacgaattcc 1980 acgatcagca tcagcgttta tacgctgagc tacggctggc gtacgatggg tattaccaac 2040 atccagaaat gctacccaca attaaccggc tcaaggggga aatgttggct ctgcaaagaa 2100 acttttccca catgttcatg cgcatcaatg agacctacgt ggcgggtgaa ccgatgtcca 2160 tctttcagtt gggggagagg cgacgtagga agaccgtcat cacacagctg cgaacgggag 2220 aaaacgaaat cgtcgacgaa ccacaagcga tcgaggcaaa tttgctaaat ttcttctcta 2280 gcctctactc ggaagaagta gcaaacaaca acgaggttgg gtttgcctgt gaacgtgtca 2340 tcccactgaa cgacccagtg aacgaagcat gcacaagcga gattaccacg gcagagattt 2400 tgtctgcaat tagagcgagc gcaccgagga aatccccggg ttgcgatggc atcccacgag 2460 aattttatct ccgcatgttc gacgtcatcc accgagagtt gaaccttttg ctcaacgaag 2520 cactcgccgg caatctcccg cccgcgtttg tggagggcat tatcgtcctc gtgaaaaaga 2580 aaggaagcga caacacggcc cgatcatacc ggcctatctc gctgctcaac agcgactaca 2640 agctgctatc acgcattctc aaaaacaggc tcgagagtgt gatgaaagca catggcgttt 2700 tgagcgacgg acagaaatgc tcgaactcgg agcgtaatat ctttcaagcc actctcgctc 2760 ttaaagatcg aatagcaagc ctacgtcacc accggcgcgc cggtaagctc atcagcttcg 2820 atcttgagaa cgctttcgat cgggtccgtc actcttttct cttcgaaacc atgcgcttgc 2880 tcgggtttaa ccaggaactc atcgctctgc tctctcgtat cgccagccgg tcatcctctc 2940 ggctgctcat caatgggcat ctctctcgcc cgtttgagat acaacgttcg gtccggcaag 3000 gggacccctt gtcaatgcac ctcttcgtgc tgtacctcca tccactggtg tgtaggctcg 3060 agcaagtgtg tggcgacgat ctcctggtgg cgtatgcgga cgacatcagc gtcatcgtaa 3120 catcgacggc gcaaatcgaa gcgatgaatg aattattcac tcgtttcgaa ttagtcgccg 3180 gggcaaaatt aaacctacgg aaaacggttg ccataaatgt tgggttttgc gaaggcaacg 3240 aaattgatgc ccattggctg caaacagcca acactgtgaa aattttgggt gttgttttcg 3300 caaactcaat acgtctaatg acaaccctta actggaccgc gatgatggga aaattttcgc 3360 agcaaatgtg gctgcaatcc ttgcgcacgc tcacgttaca gcagaaggtc atcatgctga 3420 acacctttgg tacctcaaag ctgtggtacc tttcgtcggt gctaccacca ttaggagtgc 3480 acacggcgaa aattacctcc acgatgggtg cgttcctgtg gagaggaata gtcgcccgcg 3540 tcccgatcat gcagctagct cgcagcaaag agcacggtgg actgaatctg catttgccag 3600 ccttgaaggc gaaatctctc gccatcaaca gacaccatca agagatcgat tcccttcctt 3660 actacagatc ctttcttctc cacgctcatc cccgcccagc aattcccata gaccatcccg 3720 acattaaaat aatcctttca aattattccc aaattccaca cctcattcaa caaaacccct 3780 ccgccgatct catccatcgg catttcattc agcaaacaga attgcccaag gtggaacgca 3840 acaatccagc atgcaactgg ccacgcgtgt ggcgaaatat agcagacaag aagctaacat 3900 caacgcagcg cactcatctc tatctgctag taaatggcaa aactgaacat cgcaaaccgt 3960 tattcgcgat gcagcgagtg gcagacgaaa attgcacgta ttgtgggaac cagacgactg 4020 aaactctcca acacaaattc agtacctgtg ctcgtgtcgg cccggcatgg acgagtctgc 4080 agcagaggct tgcgggactg ataaatggat ggagacgact cacttttgaa gacctggcga 4140 gacctgctct ggcaggatta ggacgtccaa cgagagtgaa aatattgcgc tcatttatta 4200 attacatctg ctttgttaat tcatgtaacg atagaattga tgttaacgct ttgaattttc 4260 acttagattt agaattttaa aactgtattc ccaatgaatt gaccacaata aaacctattt 4320 ttaaaccaaa aaaaaaaaaa aaaaa 4345 // ID TTAA20_AP repbase; DNA; INV; 663 BP. XX AC . XX DT 25-AUG-2009 (Rel. 14.09, Created) DT 25-AUG-2009 (Rel. 15.12, Last updated, Version 2) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; nonautonomous; TTAA20_AP. XX NM TTAA20_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-663 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2088-2088 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC TSD is TTAA tetranucleotide. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea CC Aphid Genome Project. XX SQ Sequence 663 BP; 219 A; 112 C; 108 G; 223 T; 1 other; ggcccctgca aatgtgtcat ttttaaggag ttgtgtttag gcaattattt cgtttctgcg 60 attactgatc gtcttagtgt gtgtagtaaa tgatcattag tgtgtgtata ctcccctacc 120 atcacccgcg gaccgcatca atttacaata caagaattat atcacccgcg acccgcatca 180 atttacaata cattgttatt tcttatctgg tttgtggtcg cacgcacgca catgcacaag 240 tcatataatg tctatatcaa aatattcaaa accaatntat cgaaatattc aaaaccaatt 300 tatggacgtg aaatagccgt cgcactgaca aaaattaagg tacttctagt ggttcgattt 360 aaaaaatgta aagatgtttg tattggtaaa caagtttact ttgcatcggt cggagcactt 420 tttcaatttt agcaaattct tgacgagaat aaatatttta cattttataa atttataaat 480 ttttagtatt gaatatattc aaaaatataa aaaaataaaa aatatgctcc gaccgatgca 540 aagtagactt gttgaccaat taaaacatct ttacattttt taaatcgaac cactagaagt 600 actttaattt ttgtcagtcg ctaaggtcgt tttgatgaaa aaataatagg gtttgcaggg 660 gcc 663 // ID TransibN4_DP repbase; DNA; INV; 400 BP. XX AC AADE01000519; XX DT 13-JUN-2005 (Rel. 10.05, Created) DT 13-JUN-2005 (Rel. 10.05, Last updated, Version 1) XX DE TransibN4_DP is a nonautonomous DNA transposon - a fossilized DE copy. XX KW Transib; DNA transposon; Transposable Element; Nonautonomous; KW Interspersed repeat; TransibN4_DP. XX OS Drosophila pseudoobscura OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; obscura group; OC pseudoobscura subgroup. XX RN [1] RP 1-400 RA Kapitonov V.V. and Jurka J.; RT "RAG1 core and V(D)J recombination signal sequences were derived RT from Transib transposons."; RL PLoS Biol 3(6), (2005). XX DR GenBank; AADE01000519; Positions 45354 45753. XX CC TransibN4_DP belongs to the TRANSIB family of DNA transposons. CC This element is characterized by the CACTG target site CC duplications. CC TransibN4_DP has imperfect 36-bp terminal inverted repeats (6 CC mismatches). XX SQ Sequence 400 BP; 140 A; 64 C; 82 G; 114 T; 0 other; cactatggga aattgtgaca ctttttgtgg ccaaaaaagt ctttaaaata ataggtttta 60 agccgataaa caatacatac aagaaattgt aacagagcgc gccactgttt agtgaacagc 120 tgacaaaaaa cagctgtttt ctagcagtgc taaagcagcg gttgtacaat agattcaaat 180 cagagaggta ttaaacatgt atctgtgacg ttgaactgta gaactgtggg ccttatatgc 240 ggtgcaattg ttaacgagta aaattttgtg acgaaaatta gtatttgaag tagcccaaaa 300 aagtgggttc accccaaaag aatttcctct tgcaaaaaat gtcgtttcac ctggaaatga 360 ttaattattg ccacagaaag tgttgaaatt tcccatagtg 400 // ID CR1-51_HM repbase; DNA; INV; 4322 BP. XX AC . XX DT 22-DEC-2008 (Rel. 13.12, Created) DT 22-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE CR1-type family - consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-51_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4322 RA Bao W. and Jurka J.; RT "CR1 families from Hydra magnipapillata."; RL Repbase Reports 8(12), 1879-1879 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 73..834 FT /product="CR1-51_HM_1p" FT /translation="METSFKNIEKLIIKKLEEQKICILQETEKLLKEQEKS FT FASIMSANLKILTERIEKIEIYVTNNKSNIMNIEKDLNDVKTTLNFQETNL FT IDKIKQIRKCYDNEVNMLTKKTIDLENRSRRNNLRIDGVKEKAGETWTECE FT DTVKDIFKNQLKINSEVVVERAHRVGKTKDSKIPRTIVLKLLNYQDKNKIL FT NAVKNLKGTGVFINEDFAKETIESRKKLWEEVKRLRGEGKYAIIKYDKIFC FT REFKNPIPNFKS*" FT CDS join(893..3958,3943..4173) FT /product="CR1-51_HM_2p" FT /translation="MAGISDFESMRFNFNEINDVNIDPDINYSYEALTDCS FT YHFPSDLEGFLFKNVYDKCLNKIMILHVNIRSLNNNFEKLLNLLEETKNRF FT NIVCLTETWISNNIDNTSNFYIPHFKLISFKRQVNKRGGGVLIYVNENIVY FT YVRNDLSASDSDKEILTIEIMNNQSKNILLSCCYRPPDGVSENLSIFFEQS FT IFKKGIKENKKNFIIGDLNMNCFLYNEDNKIKNFYDSFFEAGAIPLINRPT FT RVTKNSTSLIDNIITTDIFNNNIQTGILKSDITDHFPIFLTINSEIKNKIN FT KIKIRIFNHTNIKLFKNQLSLLHWKHINFNDNADKIYDNFYETFYSVYDAN FT FPIIEKTITPKYKNNPWITKGFKKSSKIKQKLYIKYLKTKSLDNEKIYKNY FT KYLFEKVRKNLKKNYYTRLLDKFKNNTKRNWQIMNEITGRQKKCSGSLPQM FT VIVDNKRICEPKAIAHEFNKFFTDIGPKLANKIPYTKATFNDFLVQMDNCI FT SSNELYSELTFYEFEKAFKSLKKNKANGADDINGNIVIECFEQLKDILFKV FT YGASIHQGVFPDQLKIAKITPILKEGDQTNISNYRPISVLCTFSKILERIM FT YNRLYNYLLSNNLLYANQFGFKKNNSTEHAIIQLVREISNSFEKSKFTLGV FT FIDLSKAFDTVDHEILIQKLKYYGINYKVIKWFRSYLSNRKQFVFSNSCYP FT NEFLNVSCGVPQGSILGPLLFLIYINDLNKASNLMSIMFADDTNLFLSDND FT IYKLFSSMNDELKNISKWFICNKLTLNSNKTKWILFHPIAKKPYLPQNLPE FT IFIDNTVIKRHYVTRFLGVFLDENITWKAHIDYISTKISKSIGILYKSRTY FT LCKKDLTQLYYTFIHTYLNYANIAWGSTEKSKLKCLYRRQKHAIRLINFAD FT RYTHSKPFFIEMKILNIYELNVFNVLCFMYLWKNDLSLPIFKVLFCEKPIN FT KYTLRNNKFLHKPFCRTKFNQYGIAYRAPYLWNKVVLPNFNIFYTFPIFKH FT KLKNLILFSDDIIRFFHNKVLLIVIMYLYLELFNCKIRFMLIVKYVICNDS FT FCFTFLFVLLFICNDSFCFTFYLFVLLFTFTVYFINRLYTL*" XX SQ Sequence 4322 BP; 1716 A; 626 C; 588 G; 1392 T; 0 other; attttttttc aagcagcgtg ttgaagcgaa cagacgcgtt ttttctacag cgtgttaaat 60 tttaaattaa acatggaaac ctcatttaaa aatatagaaa aattaattat caagaagttg 120 gaggaacaaa agatatgtat attacaagaa acagaaaagt tgttgaaaga acaagaaaaa 180 agcttcgcgt cgataatgag tgcaaacttg aaaatattaa cggaaagaat agagaaaatt 240 gaaatatatg taacaaataa taaatccaac attatgaata tcgaaaaaga tctgaacgat 300 gtcaaaacaa cattaaactt ccaagaaaca aaccttatag ataaaataaa acaaattaga 360 aagtgttatg acaacgaagt aaatatgttg accaaaaaaa caatcgatct cgaaaacaga 420 tctcgacgaa acaatctgcg aatagatgga gtaaaagaaa aagcaggtga aacctggact 480 gaatgcgagg atacagttaa agatattttt aaaaaccaac taaaaataaa tagcgaagtt 540 gttgtagaaa gagcacatag agtaggcaaa acaaaagata gtaaaatacc tagaactata 600 gttctgaaac ttttaaatta tcaagacaag aacaagattc ttaacgctgt taaaaacttg 660 aaaggtactg gtgtattcat caacgaagat ttcgcaaagg aaactatcga gagtcgcaaa 720 aagttgtggg aagaagtcaa acgtctacgc ggtgaaggaa agtacgccat tatcaagtat 780 gataaaattt tttgtagaga atttaaaaat cccatcccta actttaagtc ctaaaaagaa 840 gcctttctaa aagtaaatgc tcaaataaaa ttatatttaa aaaaaaaaaa gcatggctgg 900 aatcagtgac tttgaatcca tgcgttttaa ctttaatgaa ataaatgatg taaatataga 960 ccccgatata aactattcct acgaggcgct tacggattgc tcatatcact ttccaagcga 1020 tttagaaggg tttcttttta aaaatgttta tgacaaatgt ttaaataaaa ttatgatact 1080 acacgtaaac attcgaagtc ttaataacaa ttttgaaaaa cttttaaact tgttagagga 1140 aacaaaaaat cgttttaaca tagtttgttt aactgaaacc tggatttcaa ataacataga 1200 taatacttcg aacttttata ttcctcattt taaattaatt tcgtttaaaa gacaagtaaa 1260 taaacgcggc ggaggagtcc ttatatatgt gaatgaaaat attgtttatt acgttaggaa 1320 tgatttaagt gcttctgata gcgataaaga aattttgact attgaaatta tgaacaatca 1380 atctaaaaac atattattaa gctgttgtta tcgcccacct gacggcgtga gcgagaactt 1440 gagcattttt tttgaacaaa gtattttcaa aaaaggtatt aaagaaaaca aaaagaactt 1500 cattattgga gacctaaata tgaattgttt tctttataat gaagataaca aaattaaaaa 1560 cttttacgac tccttttttg aggcgggagc aattccttta ataaacaggc caactagagt 1620 aacaaaaaac tcgacgtctt tgattgataa tattattaca acagatatat tcaacaataa 1680 tattcaaaca ggtatcctaa aatctgatat aactgaccat tttccaatat tcttgacaat 1740 taattctgaa attaaaaata aaataaacaa aattaaaatc cgcattttta accatacaaa 1800 tataaagttg tttaaaaacc aactatcgtt actgcactgg aagcatatta acttcaatga 1860 caatgcagac aaaatttatg ataactttta tgaaaccttt tactccgttt atgacgcaaa 1920 ctttcctatt attgaaaaga caataactcc aaaatataaa aataacccct ggataacaaa 1980 aggatttaaa aaatcatcca agattaaaca gaagctgtat ataaaatatc ttaaaacaaa 2040 atcattagat aatgagaaaa tatataaaaa ttacaaatat ctctttgaaa aagttcgtaa 2100 aaatttgaaa aaaaattatt acacaagact tctagataaa ttcaaaaata acacaaaacg 2160 caattggcaa ataatgaacg aaattactgg cagacaaaaa aaatgctcag gttctcttcc 2220 ccagatggtt atagtggata acaaacgcat atgcgaacca aaagctatag ctcatgagtt 2280 taataaattc ttcactgaca ttggtcctaa actagcaaat aaaattcctt ataccaaagc 2340 aacatttaac gattttctag tacagatgga taattgcatt agttccaacg aattatattc 2400 tgagctaact ttttatgaat tcgaaaaagc attcaaatcc cttaaaaaaa acaaagcaaa 2460 cggagcagat gacataaacg gtaatatagt tatagaatgt tttgaacaat taaaagatat 2520 actttttaag gtttatggag catctattca ccaaggagtt tttccggatc aattaaaaat 2580 tgctaaaatt accccaattt taaaagaggg agatcaaaca aatatcagta attatcgtcc 2640 tatctccgtc ctctgcacat tctcaaaaat actagaacgc attatgtata atagattata 2700 caattatctc ctttctaata atttgttata cgccaaccag tttggtttta aaaaaaataa 2760 ttcgacggaa catgcaatta tccaacttgt acgtgaaatt tcaaattctt ttgaaaaatc 2820 taaatttaca ttaggtgttt ttatcgacct atcgaaggcg ttcgatactg tagatcatga 2880 aattttgata caaaaactga aatactatgg aataaattac aaagttataa aatggttccg 2940 aagttattta tctaaccgta aacaatttgt ttttagtaat agttgttatc caaatgagtt 3000 cctaaatgtt tcatgtggcg ttccccaagg ttccattttg ggaccactgt tgttcttgat 3060 ttatataaac gacctaaata aagcctcaaa tttaatgagt ataatgtttg ctgatgacac 3120 caacttattt ctttccgata atgatattta caaactcttt tctagtatga acgatgaact 3180 taaaaacata tcaaaatggt ttatatgcaa taagctaact cttaacagta acaaaacaaa 3240 atggattctc ttccatccaa tcgcaaaaaa accttattta ccccaaaatt taccagaaat 3300 ctttattgat aatactgtga taaaaagaca ctatgtcaca aggtttttgg gtgtttttct 3360 tgatgaaaac atcacatgga aagcgcatat tgactatatt agcacaaaaa tttctaaaag 3420 cattggaatt ttatataaat cgagaacata tctatgtaaa aaagatttaa cccaactata 3480 ctacacattt attcatacct atttaaatta tgcaaatatt gcttggggaa gcacagaaaa 3540 aagcaaatta aaatgtcttt accgccgtca gaagcatgcg atccgtttaa ttaattttgc 3600 ggatcgatac actcattcca aacctttttt tattgaaatg aaaattctta atatttacga 3660 acttaatgtg tttaatgttt tatgctttat gtatttgtgg aaaaatgact tatccctacc 3720 tatctttaaa gttctctttt gtgaaaaacc aataaataaa tatactctta gaaataataa 3780 atttttgcat aaaccttttt gtcgaacaaa gtttaaccag tatggtattg cttatcgtgc 3840 gccatatctt tggaataaag ttgttttgcc aaactttaat atattttaca cttttccaat 3900 ttttaaacat aaactcaaaa atttaatttt attttccgat gacataataa ggttcttcta 3960 attgtaatta tgtaccttta tttagaactt tttaattgta aaatacgttt tatgttaatt 4020 gtaaaatacg ttatatgtaa cgactctttt tgttttactt ttctttttgt tttacttttt 4080 atatgtaacg actctttttg ttttactttt tatctttttg ttttactatt tacttttact 4140 gtttatttta taaatagatt atatacattg taaataatta tgctaaatgc ttgtaaaagg 4200 ttctgatgat aagatcagta cgatcttctt tcagaaacct tgtttgtatt tgttaaagta 4260 ttgtattacg acaaatgtaa acttaaaatg taaaaaaaaa aaaaaaaaaa aaaaaaaaaa 4320 aa 4322 // ID Polinton-10_NVi repbase; DNA; INV; 5601 BP. XX AC . XX DT 02-JUL-2009 (Rel. 14.07, Created) DT 02-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE autonomous Polinton DNA transposon - consensus. XX KW Polinton; DNA transposon; Transposable Element; Polinton-10_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-5601 RA Bao W. and Jurka J.; RT "Polinton DNA transposons from Nasonia vitripennis."; RL Repbase Reports 9(7), 1552-1552 (2009). XX DR [1] (Consensus) XX CC Both ends are incomplete. XX FH Key Location/Qualifiers FT CDS join(704..2566,2406..3716,3571..4425) FT /product="Polinton-10_NVi_1p" FT /translation="QTIKWLWGSLRHAYNGDADVEDIQRWSFLTSKALEGA FT EEISSGPALTREEKQKCQTILRHLHTIQDKLLDLAKRGGDLRVDRSADPPK FT VKWDDFQSAFKTRIRSGVITNLGHLDIVSFFNDVLPLFEEKIKVALSVYDA FT VKVNTELAAEYAIQKNDSESLEVKYFNTKNAPIHQTTDLTKWFEENVQRPV FT QTGMEEFQEKDSGWTLRQILSLTVNVNKFKPMKGSSYIELPEVIKKKQACV FT NVQNRDNECFKWAILSALHPITTHAERVSHYKDFKDELSFKGIEFPLQPKD FT ISKFERQNNISVNLYILKKRGERYEVSPCHVTAEKKEKHVNLLLIQDYYVN FT EEEEEEEEEERPLPKFHYVWIKTLSRLVYSQLSQHREKKHICDRCLHFFRS FT EDNLKEHEVDCVQVNKCKVKLPTEKEKILKFENFKHGERVPYVIYADFECL FT LKPTQDKYAYQEHQAYSIGYYLKSSFNDNHSGYRSYRQEGEEDGQTPAQWF FT VAELEAVKFKMEEVYGKPKPLVLTEAEEQAFSLATSCHICKKPFSKQQTKV FT RDHCHLTGRYRGAGHQGCNLNFKDSRFIPVIFHNLTGYDSHFIIKEIATAA FT NFKGRVNLIPENKDLLHEVYRWKTSRTPASSRSSSTTSPDTTHTLSSRRSP FT PLQTSRDVSTSSLRTRISFTKFIDGSEISFRFLDSLRFMASSLEKLASYLD FT HLDIAKRVFEEDGYSSEQMELLKRKGVFSYDYVSSFDKLKETKLPSQTDFF FT SQLTESGIEDADYAHAQNVWQRFNIASLGEYSDLYLKTDVLLLADVFENFR FT STCLEAYGLDPAHYYTTPGLTWDAMLKHTKVELELLTDIDMLLFVERGIRG FT GVSQCCNRYGKANNQYMSEGYNPLEESKYLMYYDVNNLYGWAMTQALPYRG FT FQWVNNMDTRAISQIDDNDQYGYILEVDLEYPEXLHDAHRDLPLCPEHRAA FT PGSKQQKLMTTLHNKERYVIHYRALKQALSHGLVLRKIHRTLRFEQKPWLK FT PYVDLNTEKRKQAKNEFEKLFFKLLINAVYGKTMENERKRVDVKLVNKWEG FT RTSTPRRGSKRRTSLRSSSSSFSSTQSTARPWRTSAREWTSSSSTSGKVAE FT AIXLARTEIKIRKPIYVGLSVLDLSKINVYEFHYSYMRERYGVGCKLLYTD FT TDSLVYEVRGTDVYEAMKQDIARFDTCDYPEDNKFGMPRQNKKVLGVMKDE FT CAGSVMVEFLGLRSKMYCVRIEDPIKKAKGVKGTVVKNRIDREDYHRCLFD FT REILVRQQQNIRSRKHVVTTEKQEKVALSPHDDKRHLVFGETDTLPWGHYS FT LRACEAALAEGAKICPCNEEESASKRPRLE*" XX SQ Sequence 5601 BP; 1571 A; 1314 C; 1357 G; 1356 T; 3 other; tgaggtacca ccattcttgt ttacctctcc tcaacttcct gcttttattt ataatatatt 60 acatatattt ttatttattt tagaagtcac tctttcctga tcttccttag atgtatccac 120 tatgtcctga ctatctcgat accgctgtaa taaaaaattc ttcatggaat gaacatttcg 180 tagggcgcaa acaggattcc agcatagagg atgatggttg cacagcgtgt acacttcctt 240 ccgaaggttg gttaaagctg gacatccgaa gtcttctaag ttgtaaataa tactgtaaac 300 aaaaagatat gttttagtat tttattatta tttaaatagt aataattata tacttacctg 360 tcgtctggta acactttcct cagtagcaat aagtccttgt agcggatctc ccccgcgctc 420 cagggtatcc cgtggaagtt cctctccaac cagcgatttc tgcaccgact ccccgatgtt 480 agactggacc acttacaata tggttcgaat aagaatatgg aaggagtcgc atcttcttcg 540 agagcaagga tcgcaacctc cttgaagatg aactcattgg atgatgtctt gaagccttga 600 acgtccacca cgtactccat ctcgaagtct actaatcctc tcgagcacta gcgcatctta 660 actaaatact aatttaaact tttattacag atggcgacca tagcaaacca tcaagtggtt 720 gtggggatct ctacgtcatg cttacaatgg tgatgctgat gtagaggata tacagaggtg 780 gtcattccta acctctaaag ctctggaagg agctgaggag atttcaagtg gaccagcctt 840 aaccagagag gagaagcaaa agtgtcaaac aatcttgaga catttgcata ccattcaaga 900 caagctcctt gatctggcga agagaggtgg tgatctacgc gttgatcgat cagctgatcc 960 ccccaaagtt aaatgggatg attttcaatc agctttcaag acacgaatca gatcaggtgt 1020 tattaccaac ctcggccatc ttgacatcgt ttcattcttc aacgatgttc tccccctctt 1080 cgaggaaaag atcaaggtgg cactctctgt gtacgacgct gtcaaggtta atactgagct 1140 tgcagccgag tacgccattc agaaaaatga ttcggagagt cttgaagtga agtacttcaa 1200 tacaaagaac gctcccattc atcagactac cgacttgacg aagtggttcg aagaaaacgt 1260 gcagcgacct gtccagacag gaatggagga attccaggag aaagattccg gttggactct 1320 tcgtcaaatc ctgagcttga cggtaaacgt aaacaagttc aaacccatga agggaagctc 1380 atacatcgaa ctaccggaag ttattaagaa aaagcaagct tgtgtgaatg tgcagaaccg 1440 ggataacgag tgcttcaagt gggccatctt atccgcatta catcctataa ctactcatgc 1500 agaacgcgta tctcattata aggatttcaa ggatgaactc agttttaaag gtattgagtt 1560 cccattgcaa cccaaggaca tttccaagtt tgagaggcag aacaacatct cggtgaatct 1620 ttacatcctg aagaagaggg gtgagagata tgaagtgtca ccatgccatg tgactgcaga 1680 gaagaaggag aaacatgtga atctgcttct aatccaggac tactacgtga acgaagagga 1740 ggaggaggag gaggaggagg agcgccctct acccaagttt cactatgtct ggatcaagac 1800 tctctcccgc ttggtgtata gtcaattgtc tcaacatcga gagaagaaac atatttgtga 1860 tcgttgctta cattttttcc gctcagagga taacttgaag gagcacgaag ttgactgtgt 1920 tcaagtcaac aagtgcaagg ttaaattacc gactgagaag gagaaaattc tcaaatttga 1980 gaatttcaag catggtgagc gtgttcccta cgtcatctat gctgactttg agtgtctctt 2040 aaagcctacg caagataagt atgcttacca agagcatcaa gcttacagca taggctacta 2100 tttgaagagt agcttcaatg acaaccactc tggttatcga agctacagac aagaggggga 2160 ggaggatgga caaacccctg ctcagtggtt tgtagctgaa cttgaagctg tgaaattcaa 2220 gatggaggag gtttacggta aacctaaacc gttggtgctc acagaggctg aggaacaggc 2280 gttctctctg gctacctcct gccacatctg caagaagcca ttctccaaac agcagacgaa 2340 ggtgcgcgac cactgccacc taactggacg ttaccgtgga gctggccatc agggatgcaa 2400 tctgaacttc aaggactccc gcttcatccc ggtcatcttc cacaacctca ccggatacga 2460 ctcacacttt atcatcaagg agatcgccac cgctgcaaac ttcaagggac gtgtcaacct 2520 catccctgag aacaaggatc tccttcacga agtttatcga tggaagtgag atcagcttcc 2580 gctttctgga ctccctccgc ttcatggcat cctcccttga gaagctagct tcttacctcg 2640 atcatctcga tatcgctaag cgagtattcg aggaagatgg ctactcgagc gagcagatgg 2700 agcttctcaa acgtaaggga gtattctcct acgactacgt ctccagcttt gacaagctaa 2760 aggagacaaa gctcccctct caaaccgact tcttcagtca gctgactgaa agcggtatag 2820 aggacgccga ctatgcgcat gcgcagaatg tctggcagcg cttcaatatc gcgagccttg 2880 gcgagtactc cgatctctac cttaagacgg atgtactcct gctcgcagac gtgttcgaga 2940 acttccgctc gacatgtctc gaagcgtacg gtctcgaccc cgctcattac tacaccaccc 3000 ccggcctcac ctgggatgct atgcttaagc ataccaaggt cgagctagaa ctcctcacgg 3060 acatagacat gctcttgttc gttgagagag gtatccgggg tggtgtaagc cagtgttgca 3120 atcgctatgg aaaagcgaac aaccaatata tgagcgaggg ctacaaccca ctggaagagt 3180 cgaagtactt aatgtactat gacgtcaata acctctatgg ttgggctatg actcaggccc 3240 taccatatag aggcttccag tgggtcaaca acatggacac tcgtgccatc tcccagatcg 3300 atgacaacga tcagtatggm tacatcctgg aggtcgactt ggagtacccg gagawcctcc 3360 acgatgcgca ccgagacctg ccactctgcc ccgaacatcg agctgcgccg ggctcgaaac 3420 agcagaagct catgaccacc ctgcacaaca aggagcggta tgttatacac taccgcgcct 3480 taaagcaggc tctgagtcat gggctggtcc tacgcaagat ccacaggacc ctgcgtttcg 3540 agcagaagcc atggctcaag ccctatgtag acctcaacac cgagaagagg aagcaagcga 3600 agaacgagtt tgagaagctc ttcttcaagc ttctcatcaa cgcagtctac ggcaagacca 3660 tggagaacga gcgcaagaga gtggacgtca agctcgtcaa caagtgggaa ggtcgctgag 3720 gcgatcmagc tcgccaggac ggagatcaag atccgcaagc ccatctatgt gggtctctcc 3780 gtccttgatc tctcaaagat caatgtgtac gagtttcact actcgtacat gcgcgagcgc 3840 tacggtgtgg ggtgcaaact cctatacacc gacaccgata gcctcgtcta cgaggtacga 3900 ggtaccgacg tctacgaagc gatgaagcaa gacatcgcga gattcgacac ctgcgactac 3960 cctgaggaca acaagttcgg gatgccacgc cagaacaaga aggttctagg cgtgatgaag 4020 gatgagtgcg ctggcagcgt gatggtggag tttctgggcc tgcgcagtaa gatgtactgc 4080 gtccgcatcg aggatccaat caagaaggcg aagggcgtca agggcaccgt ggtgaagaac 4140 aggatcgacc gcgaggacta tcatcgctgc cttttcgatc gggagatcct cgtccgccag 4200 cagcagaaca tccgctcaag gaagcacgtc gtcaccacgg agaagcagga gaaggtcgcg 4260 ctcagccccc acgacgacaa gcgccatctg gtgttcggcg agaccgacac cctgccgtgg 4320 gggcactaca gtctccgggc atgcgaggcg gcattggcag agggcgctaa gatctgcccc 4380 tgcaacgagg aggagtcggc aagcaagaga ccgaggttgg aataggtaag tcgaaaatct 4440 gctttaccaa ctatacttcg atactaacct taccttttgt ttcagatgct gaatgagtag 4500 gaacaacaag ttggtgcttg ttcgagctct ctgcatgctc cgacctgctt gcagcttcct 4560 aattggtcaa gcggcggcgg cggctcgagt agtcgtagtg gggagaccta tacataaggc 4620 tgcgaggcag actgcagaga tcattctcac cagtcgactg acgagtgaac ataaccaggt 4680 attctcctcg tagaagaagc atcctcgtag aagattatcc tcgtagaaga cgcatcctcg 4740 tagaagatca tccttgtaga agacgtatcc tcgtagaaga tggatctatc agcattgaac 4800 aaggtggctg ctggaggctt cctgccgacg aagaaggttg tggagttgga gaaggatcat 4860 tcctacttgg taacagcttt gaaaacagtg aaaactagat atggtccgaa gagacgagta 4920 tcaactgctc tcaacaagga tgaggcctta ttcaatcagc tgagtgaagc tgcgaataag 4980 tttaagccta acctatcttg gagataacgg tgtgcagttc ggcaacgctt aagtgggtga 5040 agaaacggag aggtagttac aagttggttg tacctcacgg gtgggggcgt ggttggggcc 5100 cccaccacac cgaacaacaa gttgttaaag cttcaacctg ctgaatgctc tctccaaacg 5160 gttaaaagct agtgaagcag actgtggaga tcatttattg ctgaaatgga attcatgagc 5220 tggtctctct tggtgaatag ctatccactg ttaaaactga aagaagatta tacttacgtg 5280 ataacgtcgt tcaaacgtgt tagaactaag tatggtccaa aaatcttcac agttctcgat 5340 gagaagtttt cattgttgct tccgcatcat ctgtctacat gtttgttgaa tgatgatgtt 5400 ttgcttgatg aaatgatatc agttgcaaat caacgtaagt tgactctaac ttttaaggct 5460 gatagaactc tatctgcata tgagatgttc tggagcatag ttaatgatga agatctattc 5520 aacaagatgc taccacttgc aaaaagggtt agattaaacc tcaagctgta ttgaatggga 5580 aatcaatgga gaggtaaatt c 5601 // ID mTA_Ele45 repbase; DNA; INV; 1531 BP. XX AC . XX DT 13-OCT-2010 (Rel. 15.1, Created) DT 13-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE A non-autonomous DNA transposon family from Aedes aegypti. XX KW DNA transposon; Transposable Element; Nonautonomous; mTA_Ele45. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-1531 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-1531 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the yellow fever mosquito."; RL Direct Submission to Repbase Update (13-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. ~99% identical to consensus. This consensus CC is ~98% identical to the original sequence in [1]. TA TSDs. TIRs CC are 24 bp long. XX SQ Sequence 1531 BP; 461 A; 299 C; 269 G; 502 T; 0 other; ggggaacctg gtccaattcg gaccctgttc taattcggac catcaagcat ttctcaatac 60 tggcgctact gtaaagatcg tgggtaccgt ttttctgctc accgtttggc ttagatctaa 120 cacaataaat ttgacgcctt tcagttatta cgtttgcttt ttatattagt ttattgtttt 180 gaagtgctca ggtgtactgt cggttggcgt tatttccaag cttctataag cgacaataat 240 tctccaatct ctctgtgtta ttttataatc gaatcaccct agcctaagct ttcatctgac 300 catatggttt tgatatgtct atgcaccgtt tgtgctcaac gagtttgaaa atctcaaaat 360 gtgtgttggt ccaatccgga ccttttgata tggttcaata cggacctctt gattttcaac 420 gtgtactcag tctattctaa gagctccacc ccgtgaaaat cttcccgact cgatcgaaaa 480 tcacagaagt tgtaatattg attaatagtg cctttgtttc tgttgcggtc aagaaaatgg 540 cttcgcaaaa aatctaagaa atgaggtaga ttaccaaaaa ctaaaggaat caataaatca 600 gctacttgaa aagcatctcc atcaatacca cagccaagct ttatccaaaa tagtttggca 660 atacgctata aaaaacgttt acgagtacta tataagaaca atacagtcaa ctatccctcg 720 tccgatttac tgtgtttttg ttcacaagct cgaaggaaag cgcagtccct tcaatatagt 780 gtactttaat ccataaatat aagtaggtac ttctcaaact tgaatttccg agacttcgag 840 tctgtttgtt gtctgtttca gtttcccata caagttcacc ttttcaaccc gatatttcat 900 agcaaagttt caatgtgtgg aaaattttac aatattcaac ttcaaaatta ctttttcgaa 960 tgtattttgt caaaataaat aaaaaaaagc tctaaaattt gaaaatatgt gtacttcatc 1020 tcgatagctc ccatacacga tagtctgttc aaaatcgagt caggaagagt tgactgtacc 1080 caattttgaa caagtttccg atatattttt taatgaaatt catatttcac ttgagcttgt 1140 tctgactcga attagtttga aatttctcac acattgagtt tattttttat gcttgcgctt 1200 gcagaaaaag ttcaataagc tttaatacca aacaaatctg agttatattt tgttactaac 1260 attgcaaata acgatgaaat atagggtggt ccgaattaga acatgggggg tccgaattgg 1320 aacagggttg gtccaattcg gaccttttgg agaagttggt tcaaacatgt ctagccatgt 1380 cctggtagga gctatcacaa aactctctga gagaaaaatg tgcgcgctga attgcgcttt 1440 cacgaaaaca taaaactttt catgccgttc atcgtgctga aaagatatgg ccaaaaaact 1500 gttactaggt ccgaattgga ccatgttccc c 1531 // ID BEL-79_AA-I repbase; DNA; INV; 2422 BP. XX AC supercont1.172; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-79_AA_; KW BEL-79_AA-LTR; BEL-79_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-2422 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.172; Positions 668639 666218. XX CC 'GTTTG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 294..2420 FT /product="BEL-79_AA-I_1p" FT /translation="MSLSRTPPRRTDNMDNVRMLTKKRGHIKAKVTRISNF FT FDAAGEEERIIPLPVLRVHAKNLQVFYEEFNNIHDAIIAFVDEENVDFQEE FT KYVEFEELYNETLIKVEMMIEAIEKENQITLPIPVTSSSSAANHSSGQPII FT VNTQSYRIPLPTFDGRYEAWPRFKAMFQDMMQRSNDSDAVKLYHLETSLRG FT DAEGVIDIETLQNNNYARAWEILEERFGNQRVIIESHILGLLSMKKISKRS FT SKELRNLVDECSRHVDNLIKLDQQLTGMSQLFVVTLLARVLDDQTRELWEA FT SFNQSKLPEYEQMIDFLKQRCVILERCENSTPSSASKIPNPKVLSRSESYK FT SHVITTASEYACDVCSGQHQNCKCPAFQKMSIEQRQYKLKSANLCFNCLRK FT GHRSAACRSDKSCGKCSKKHHTLIHYEQRKSVFRPPSKQVFLMTAMVILRS FT ESGRTHQVRALLDSGSQVNLLSESVVKKLNLPKHPTNVPVVGVGGLRSQIH FT HQVAVEVTSKTSNFTTNIDCLVTPKITGTVPSVNVEVNSWRIPSGIELADE FT SFNYPSELEMLIGAEHFFEVLKQGQIKLADHLPTLYETQFGWVVAGSYKEI FT EKDPPVCCNMAVSDGMQTCIEQGKFAEPALMASAEEDNEGQFQRSYLCHED FT GRFVVQLPFRDSVDQLESSRSLALKRFLRLEKGLHRKSCWQGDNSEDLQPV FT LEFDSNGGQ" XX SQ Sequence 2422 BP; 690 A; 567 C; 581 G; 584 T; 0 other; ttggtccttc gaaccggatc tggactggag tactgctgct aagtaagtaa agcgttttgc 60 gtggctgaca acgcaaaaga gcgttcgcgt ggctttgaca acgcgaaaaa cccatagttt 120 gcttcgttgg tattccgttt tgcgtgactc actgcgactt gcgtgaaagt cgtttttccg 180 tcacgtgtcg tagttcatcg gaaggatatc ctgtgcgtgt gttccacctt cacgattgaa 240 gctcgaacgt gacccttgta cctgtccaac aagtcgaagc tttcgtgtga aagatgtcgt 300 taagccgtac cccgccccgg aggacagaca atatggacaa tgtgaggatg ctgacgaaaa 360 agcgaggcca tatcaaggcg aaggtcactc ggatctccaa cttctttgat gcagcaggag 420 aagaagaaag aataataccc cttcccgtcc tgcgagtgca tgcgaaaaat ctccaggtgt 480 tctatgaaga attcaacaac atccatgacg ccatcattgc cttcgtcgac gaagaaaatg 540 tcgatttcca agaagaaaag tacgttgagt ttgaagaact ctacaatgaa accctgatca 600 aggtggaaat gatgatagaa gcgattgaga aagagaacca aatcaccttg cctatacccg 660 tgacatcgtc atcgtccgca gcgaatcact cgtcgggcca accaataatt gtgaacactc 720 aatcctatcg cattcctcta ccgacctttg atggtcgtta cgaagcatgg ccacgcttca 780 aagccatgtt tcaagacatg atgcaacgat ccaacgattc cgatgccgtg aagctgtacc 840 atttggaaac ctcgttgaga ggcgacgctg aaggtgtcat cgacatagaa acgttgcaga 900 ataacaatta cgcaagagca tgggagattc tggaagaacg ttttggcaac cagcgagtga 960 tcatcgagtc ccacatactt ggcctcctga gtatgaaaaa gatttccaag aggtcttcaa 1020 aggaactgcg gaatctggta gatgaatgtt cccgccacgt cgacaatctc atcaagctgg 1080 atcagcagtt aaccggaatg tctcagctat tcgtggttac attgttggct cgtgttttgg 1140 acgaccaaac ccgtgagctc tgggaagctt ctttcaacca gtcgaaactc ccagagtacg 1200 aacagatgat cgatttcctg aagcaacgct gtgtgatttt agaacggtgt gaaaattcta 1260 ctccaagctc agcttccaag attccgaatc cgaaggtcct ttccagaagt gaatcctaca 1320 agtcacatgt cataacaacg gcttcagaat atgcgtgcga cgtttgttcc ggccagcacc 1380 agaattgcaa atgcccagct ttccagaaga tgagcatcga gcagcgccaa tacaaactga 1440 aatcggccaa tttgtgcttc aactgcctaa ggaagggcca ccgcagcgca gcatgccgta 1500 gtgacaaatc ctgtggaaag tgttccaaaa agcaccatac cttaatacat tacgagcaac 1560 gaaaaagtgt gtttcgtcca ccaagcaagc aagtatttct catgacagcg atggtcattc 1620 tacgttccga aagcggccgt acccatcaag ttcgagcatt actggattca gggtcgcagg 1680 tcaaccttct gtcggaatct gtggttaaga agttaaacct tcccaaacat ccaaccaacg 1740 ttccagtagt cggcgtaggt ggtctgcgat cccagatcca tcatcaagta gcagtggaag 1800 ttacatccaa gaccagcaat tttactacta acatagactg tctcgttaca ccgaagataa 1860 ctggtactgt tccgtccgtg aatgttgaag taaattcttg gcgtattccc tccggtatcg 1920 aactagctga tgaatcgttc aactatccaa gcgaattgga aatgctgata ggtgcggaac 1980 atttcttcga agttctgaaa caaggccaaa tcaagctggc ggaccacctg cctactttgt 2040 atgaaactca attcggatgg gtagttgcag gttcttataa ggaaatcgaa aaggatcctc 2100 cagtgtgctg caacatggcc gtatccgatg gaatgcaaac atgcatagaa caaggaaaat 2160 ttgctgagcc agcattgatg gcgagtgcgg aagaagacaa cgaaggacag tttcaacgat 2220 cctacctttg tcatgaagat gggcgattcg tggtccagct gccttttcgt gattcagtgg 2280 atcagctgga gagctccaga tcattggcat tgaagcgatt tctgcggtta gagaaaggat 2340 tgcatcgtaa gagttgctgg caaggcgaca acagcgaaga tctacaacca gtgttggaat 2400 tcgattccaa cggggggcag ta 2422 // ID LSU-rRNA_Mfr repbase; DNA; INV; 3565 BP. XX AC . XX DT 29-AUG-2008 (Rel. 13.08, Created) DT 29-AUG-2008 (Rel. 13.08, Last updated, Version -1) XX DE rRNA from Cnidaria. XX KW rRNA; Pseudogene; LSU-rRNA_Mfr. XX OS Cnidaria OC Eukaryota; Metazoa. XX RN [1] RP 1-3565 RA Smit A.F.; RT "LSU-rRNA_Mfr - rRNA from Cnidaria."; RL Direct Submission to Repbase Update (05-AUG-2008). XX DR [1] (Consensus) XX CC gi|15208488|gb|AY026375.1| Montastraea franksi 28S large subunit CC ribosomal RNA gene, partial sequence. XX SQ Sequence 3565 BP; 868 A; 832 C; 1069 G; 796 T; 0 other; aacgcagcca gctgcgataa gtagtgtgaa ttgcagaatt cagtgaatca tcgaatcttt 60 gaacgcaaat ggcgctcttg ggttctccca ggagcatgtc tgtctgagtg tcggatatca 120 tccatcgcgt gcacgcttgc cgtgcctgcg gcgttgaggc gtcacggccg acaccactgg 180 ccgtgtccct cgaaagacag agagactcgc tgtggctcgt gggtttccct gcagcacacc 240 ggcggaggct aaaagtcttg cttgtccttc gcgtgttggc cgggtccccg ccgcggcagt 300 cgcagaggcg aaccgacgct ggtcggtgca ctttcttgac ctcagatcag gcaaggctac 360 ccgctgaatt taagcatatt aataagcgga ggaaaagaaa ctaacaagga ttcccccagt 420 aacggcgagt gaagcgggaa gagctcaaat ttgaaatctc cgatgcttgc atcggcgagt 480 tgtagttgcg agaagcactt tctaggcgga tcggtcgtgc ctaagttgct tggaacagca 540 cgtcgcagag ggtgacaacc ccgtctgtgg cgcgaccggc cgctgacgat gtgctttcga 600 agagtcgggt tgtttgggaa tgcagcccaa aatgggtggt aaactccatc taaagctaaa 660 tattggcgtg agaccgatag cgaacaagta ccgcgaggga aagatgaaaa gaactttgaa 720 aagagagtta aaaagtacgt gaaaccgttg aaagggaaac gaatggactc agcaatgcgc 780 cctttgagat tcagccggtg ggtgggatcg gcctcgggtt tgcggatctg aatggaccgc 840 ggcccggtgc tcgtttctcc tggccgtcgc acttctcttg ggcgcgcgtc aacgtcggtt 900 gggactgggt ctgaaaggtg ccgggaaggt aggtgtgggc cttcgggtcc gttctgctac 960 agcccggttc tctcatggct cgggcccgac cgaggcgttg cagggcatgc ctcttgtggc 1020 tagggtcccc tccttccggc cggctgtcga atgtggtgga ctgcttgcag tgctcgacga 1080 acgctgccgg tcgttggggc ggtgatctcg taccatgccc ttaggatgtt ggcggtcata 1140 tgggtccatc cgacccgtct tgaaacacgg accaaggagt ctaacatgtg cgcgagtctt 1200 agggtgagtg aaaccccgag gcgcaatgaa agtgaaggcg gctccggccg ctgaggtgag 1260 atccgtggct cctcggggcc gcggcgcatc atcgaccgac ctattctact cttaggaagg 1320 tttgagtaag agcgtgtctg ttgggacccg aaagatggtg aactatgcct gaatagggtg 1380 aagccagagg aaactctggt ggaggctcgt agcgattctg acgtgcaaat cgatcgtcaa 1440 atttgggtat aggggcgaaa gactaatcga actgtctagt agctggttcc ctccgaagtt 1500 tccctcagga tagctggaac tcagtacgca gttttatcag gtaaagcgaa tgattagagg 1560 ccttagggtt aaaacaacct taacctattc tcaaacttta aattggtaag atgtccgact 1620 tgctcgactg aagccggact ttcgaatgcg agtacctagt gggccatttt tggtaagcag 1680 aactggcgat gcgggatgaa ccgaacgctg agttaaggtg ccaaagtcga cgctcatcag 1740 accccacaaa aggtgttggt tgctatagac agcaggacgg tggccatgga agtcggaacc 1800 cgctaaggag tgtgtaacaa ctcacctgcc gaagcaacta gccctgaaaa tggatggcgc 1860 tcaagcgtcg cacctatact cggccgtcgg ggcgaatgcc aagccccgac gagtaggagg 1920 gcgcggtggt cgtgacgcag cctttggcgc gagcctgggt gaaacggcct ccggtgcaga 1980 tcttggtggt agtagcaaat attcaaatga gaactttgaa gaccgaagtg gagaaaggtt 2040 ccatgtgaac agcagttgga catgggttag tcgatcctaa gagatagggt aattccgtgt 2100 caaagcgccc gatcctgggc cgcctatcga aagggaatcg ggttaatatt cccgaaccgg 2160 aacacggata ttgccacctg gtggcaggtg cggcaacgca accgaacccg gagacgccgg 2220 cgggagcccc ggaaagagtt ctcttttctt tttaacaggc tttcaccctg aaatcagatt 2280 gtctggagat agggtttaat gcctggtaaa gcaccacact tcttgtggtg tccggtgcgt 2340 tctcgacggc ccttgaaaat ccgggggaga caatgatttt cgtgtccggt cgtactcata 2400 accgcagcag gtctccaagg tgagcagcct ctggttgata gaacaatgta ggtaagggaa 2460 gtcggcaaaa tagatccgta acttcgggaa aaggattggc tctaagcgtt gggtctctcg 2520 ggctgagact tgaagcgggt ggagccggcc cggactggcc gaggctgacc tccgcgagcc 2580 ctcaaaagcg aacgggggaa ggccaaggtc ggaccgggaa ggtaccgccc gtggattggc 2640 ccagctatgg ccgtcaggtc aaatcgggag gcgacgaaca acgaacttag aactggcagt 2700 gactagggga atccgactgt ttaattaaaa caaagcattg cgatggccgg aaacggtgtt 2760 gacgcaatgt gatttctgcc cagtgctctg aatgtcaaag tgaagaaatt caaccaagcg 2820 cgggtaaacg gcgggagtaa ctatgactct cttaaggtag ccaaatgcct cgtcatctaa 2880 ttagtgacgc gcatgaatgg attaacgaga ttcccactgt ccctatctac tatctagcga 2940 aaccgcagca aagggaacgg actttgtaaa atcagcgggg aaagaagacc ctgttgagct 3000 tgactctagt ctgaccttgt gaaaagacat gagaggtgta gaataagtgg gagcagttcc 3060 tgcgccggtg aaataccact actcttatcg tttttttact tattggatgg agcggaggcg 3120 aaccgcaagg ttcactttct ggacttaagc cgcccctcgt gggaggcgat ccgagtccaa 3180 gacaccgtca ggttgggagt ttggctgggg cggcacatct gtcaaatgat aacgcaggtg 3240 tcctaaggtg agctcaatga gaacagaaat ctcatgtaga acaaaagggt aaaagctcac 3300 ttgattttga ttttcagtat gaatacaaac cgtgaaagcg tggcctatcg atcctttagt 3360 ctttaggagt tttaagctag aggtgtcaga aaagttacca cagggataac tggcttgtgg 3420 cagccaagcg ttcatagcga cgttgctttt tgatccttcg atgtcggctc ttcctatcat 3480 tgcgaagcag aattcgccaa gtgatggatt gttcacccac caatagggaa cgtgagctgg 3540 gtttagaccg tcgtgagaca ggtaa 3565 // ID BEL-164_AA-LTR repbase; DNA; INV; 477 BP. XX AC AAGE02018006; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-164_AA_; KW BEL-164_AA-I; BEL-164_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-477 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02018006; Positions 11701 11225. XX SQ Sequence 477 BP; 196 A; 71 C; 89 G; 121 T; 0 other; tgtgacgacg agtacccctc ggctggctac caaaagtagt ttccacaggg tgttccgttg 60 cgccagtaga gctgtgacag gagaaatgtc accatgaaga atgagagatg acgaagcgaa 120 agcaatagaa attgaacaag aaaagaaagt agctgttaaa tttataaggt tgttaaaacg 180 gtaaaaagta gtaaaatttg tgaaattgaa gcctaaaacc taaagtgagt aaaatataat 240 tagtagttaa ttaaattgaa tcctaattag tttccaaaac ttagatacta cagagcaaat 300 tgccaattga aaacccactt aaactaaagg aaaaactaag gagaaaactg aaaaattgta 360 agtctaaatt tgttatccta actaatgaaa tacctaaaat aaatctaaat ttgcagctaa 420 agctgattcc tacaaactaa tagttttagt acgggctgct aagaaacagt ttcaaca 477 // ID BEL-140_AA-LTR repbase; DNA; INV; 352 BP. XX AC supercont1.258; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-140_AA_; KW BEL-140_AA-I; BEL-140_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-352 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.258; Positions 965353 965704. XX SQ Sequence 352 BP; 95 A; 87 C; 71 G; 99 T; 0 other; tgtgtacgcc aataattaat gcatttatgt tgcgagtgcc accctagcgc gtggtcactc 60 attctctgtt gtgatccacc actattgcaa acagcagata taggcattga taccgatcgc 120 ttcccgatcg ttatcgctct aaccacctct acggggtaga gatagcagaa actgaaatac 180 cccaacagtt agaaatatat ccccgccgcg atcggtactc tacgccgtaa taaattacgt 240 tgtgtatttt tcccgacgaa ataaagtgta gttttgtgca ttcgataaaa cccgtgtttt 300 cgaaagtgat tatatcaccg cggaccttcc ggacgttcta tgtgcgcaaa ca 352 // ID Gypsy-19-LTR_NVi repbase; DNA; INV; 1290 BP. XX AC . XX DT 16-APR-2009 (Rel. 14.04, Created) DT 16-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: long terminal DE repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy-19-LTR_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-1290 RA Bao W. and Jurka J.; RT "LTR retrotransposon from Nasonia parasitic wasp."; RL Repbase Reports 9(4), 776-776 (2009). XX DR [1] (Consensus) XX SQ Sequence 1290 BP; 346 A; 377 C; 314 G; 252 T; 1 other; tgtgacgaga cacgagtctc gcacatgtaa atacgtgtga aggcggccgc aatagacgcg 60 gcatgcgcag cgcgcgcgcg caacgaacaa aaggcgaatc cgcacgacgc ggagcgagcg 120 cgcgcgcgca acatccgcaa ggcgagtccg cacgagcgcg gagcgcggca gccggcgagc 180 gactcagcga cgacgcgcca ggcggcgcga gcggcgtacg cggtgcgccg tgaacgcacc 240 catacatatc cgcgccgcgc gcctttataa ggcggatcgc gcggcacgat gggcagttct 300 actcgagctc cactccaggt agcatcgtgc gcacacgtta ctacagggaa cagcgagaga 360 tagatctgcg ccgacgaaat cgtctcgtga gcacacgtga cgatttggta gacgaaccga 420 aacccgaaaa ctccgaattg ccgtcgtgag cacacgaggc aacccggaac acccgagcga 480 catcgagcct cgtctacgtg agcacacgtg agcgataccc ggtgccaccg taaaaatcct 540 aacctacatt ccgacacatt gctgcctccc tctgccactg tgagcccaca gggttaaggg 600 agtaacaata rtaccgccgg cgattcgccc ggtcgtgcgc tcacgtcctg gtgaaacaaa 660 ggtaaattcc taaccaacaa agccagcctt acgccgacat ctcacgactc gctttatctc 720 tgtgcgcaca caggataaag agaggaacga gacgagcgac gaagatttgt gaagccgtgc 780 gcatcacgac tccacaaatc cgagggataa cctcaaaacc ccgttacact atcccggccg 840 ttactgtgcg cacacagagc ggtcgagatc gttgttacgc gacacattcg gatcggtgta 900 gccgtgcgca cacgactcca ccgtaccgaa tgaattccta acctcaaaat ccgccaccga 960 tccgtcatac cgccaccgta ccgaaacacc gtagccgcga acaccgaatc attcttgtaa 1020 cgaaatcgcg tagatattct tgtaaattgt taaccgcgtt agtttaaaca attctatttg 1080 taaaggcgaa atatttttaa caagacgcat aattgatttc tgtttcagcc gggatttaac 1140 aactgattcc aaattgctat tctttactta agaaatatat tttgttatta ttaagaatat 1200 aaaaacagac tcttttcttg cccctttccc ctatccagat cctggaaccg actgactatc 1260 cagtctcttg gggaaagtta aaccgttaca 1290 // ID DEC2 repbase; DNA; INV; 465 BP. XX AC AJ132475; XX DT 24-APR-2000 (Rel. 5.03, Created) DT 24-APR-2000 (Rel. 5.03, Last updated, Version 1) XX DE DEC2 is a putative nonautonomous DNA transposon. XX KW DNA transposon; Transposable Element; Nonautonomous; DEC1; DEC2; KW MITE; TIR; nonautonomous DNA transposon. XX OS Tenebrio molitor OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Coleoptera; Polyphaga; Cucujiformia; OC Tenebrionidae; Tenebrio. XX RN [1] RP 1-465 RA Braquart C., Royer V. and Bouhin H.; RT "DEC: a new miniature inverted-repeat transposable element from RT the genome of the beetle Tenebrio molitor."; RL Insect Mol. Biol 8(4), 571-574 (1999). XX DR GenBank; AJ132475; Positions 1 465. XX SQ Sequence 465 BP; 153 A; 62 C; 62 G; 188 T; 0 other; tacactcacc ggcacaaaaa gcgactcatt atgatttctt taataaataa tcttgtgctt 60 attttttttt tttataatag ttaaattgag atattttcaa agatcgggag tgctatggtc 120 accatggcaa ccgaattgtt ttgtttaaat aaattattga aaagtttgcg ctacttccat 180 gttgtgtttt ggtcggttgt tttacgtgtt aaaattttta atttgacaat acgagatact 240 aattcaaaat actgttcaaa ttttgacaat tttttgtaac ataaaatttt tttctaaaaa 300 ataacttatc atttttcatt aaaatcttat ttcctgtgtt ttaatgtttt gtttgtcgga 360 tattttttgg caaagaataa cttaaatagc acctactaat acaacatgat accaaatgaa 420 ttacttaaaa ttcaaaatga gtagactctt caaacgatca ttgtg 465 // ID DNA8-8_CQ repbase; DNA; INV; 1915 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version 1) XX DE DNA transposon from Culex quinquefasciatus - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-8_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-1915 RA Jurka J.; RT "DNA transposons from the southern house mosquito."; RL Repbase Reports 11(1), 85-85 (2011). XX DR [2] (Consensus) XX CC ~95% identical to consensus. 8 bp TSD. Putative hAT element. XX SQ Sequence 1915 BP; 598 A; 351 C; 375 G; 591 T; 0 other; gggtggtcca aatccggact tttttggggc taccccctga aatcaaagat tgacccatca 60 ctaggctaaa ttccaaattt gagctcattc tgaccacggg aacccctccc tccaatcgct 120 taaagtttgt atgggaaaaa tcgtcaaaat gtatggagaa aagcaactgt tttacttttt 180 tacctgtgga aggcgccata attatccgat tcttaccatt tctcaaatgt agaaccttca 240 ttaaatttag aacaactttc ccgaagacac catattttta ggattttttc ccgcgaagtt 300 attagcgccc aaaactgacc ctttttgcgc ggccagctgt aaggggctac ctaacaacga 360 tgtttatttc caattcgtac acgcacgtgc tctctctcag gttcaaactc tctcacgagc 420 ttgctcgcct gcctgcctgc ctgtttgcct taccaacaag cgcgcgctgt ttctctgctt 480 ggacttttgt cgtcgtcgtc gttttcgttt gctatccgct gcgctgcgtt cctggcatgt 540 ttattataat tttgaactaa aataccaggt ttgttttgag atataaaatt taatgtaata 600 tgtgatcaac aaaagatatc atattataag agcgtgtgag agagaataca aatttgatga 660 attagttcat ccatgaatcg tcatctgttc tgatccaaac ctaattgcaa acgttttgga 720 ttttaattta tctgcttttg ttacaagaaa taccatgcag ttttttttag ataaaaaagg 780 aaggaatttt ctttagattt tttttaaact gaatccaaat gagtatctta aaggaaggcc 840 gcaactgcaa gatctgaagg tagttaacga ttgtgcagaa agagatgttt cctgaatcga 900 aaaatttaca ggtaaactga caaaaaatga taaacagttg caatatttgt tgcaggtggt 960 ctaggaacac agaaagaaat ttccagaaag aataaaaaaa taattttaga gagtatgaat 1020 gaaaatttgc agcaataatt gtaaaattgc cttgatttgt tctgatttgt actaaagtat 1080 atttttgtta aagcaaacta cctaaaggaa caacatttgt attacctgaa gatttttttt 1140 aaataattga atcattatca tgtttcttta gcatatacga aaaactgcgt gtaatgcttg 1200 gaaaaacaaa caaaccctgt tctcatgaca tttgctgtaa ttttttaatc gagcctaagc 1260 tgtcttttct ttaaactatt tttttcagcg cagttttttt tatttttgac ataatttaag 1320 aatttatatg ctaaatatat ctcaaaacaa tcctgctatt ttagttgaaa attataataa 1380 acatgccaga aacgcagcgc agcggatagc aaacgaaaac gacgacgacg acaaaagtcc 1440 aagcagagaa acagcgcgcg cttgtaggta aacaggcagg caggcaggcg agcaagctcg 1500 tgagagagtt tgaacctgag agagagcacg tgcgtgtacg aattggaaat aaacatcgtt 1560 gttaggtagc cccttacagc tggccgcgca aaaagggtca gttttgggcg ctaataactt 1620 cgcgggaaaa aatcctaaaa atatggtgtc ttcgggaaag ttgttctaaa tttaatgaag 1680 gttctacatt tgagaaatgg taagaatcgg ataattatgg cgccttccac aggtaaaaaa 1740 gtaaaacagt tgcttttctc catacatttt gacgattttt cccatacaaa ctttaagcga 1800 ttggagggag gggttcccgt ggtcagaatg agctcaaatt tggaatttag cctagtgatg 1860 ggtcaatctt tgatttcagg gggtagcccc aaaaaagtcc ggatttggac caccc 1915 // ID Gyp1_Cis_I repbase; DNA; INV; 6033 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 06-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE Internal portion of Gypsy LTR Retrotransposon from Ciona DE savignyi. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW internal portion; Gyp1_Cis_I. XX OS Ciona savignyi OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. XX RN [1] RP 1-6033 RA Smit A.F.; RT "Gyp1_Cis_I - Gypsy LTR Retrotransposon from Ciona savignyi."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX CC Ci000388, Ci000091, Ci000430 Both gag (bp 421-1587) and pol CC (1533-4757) closest to the products of the Gypsy retrotransposon CC CsRn1 in the trematode Clonorchis (46 identity, 62% similarity). XX SQ Sequence 6033 BP; 1794 A; 1237 C; 1242 G; 1748 T; 12 other; tatttggtga ccctgtttct gagcaaattt tgaggtggca ttgatgattg gaaagccgga 60 aaaagaagaa tttcaactac attgctttgc tccttaaagt ttcgaggtac atttctgcat 120 ccactgtaaa cgtcgcattc gccatcaaga acaaacgttg gacatttccg ttctattgtt 180 cgtggaggtg tcaagttcac acaacggatt atatacgcga gatcaacgga tactttgaag 240 gattctagat tctacagacg acgaccggat ctactacgac gggaacgaga tcaacttcac 300 cggacaattc aattaagttc tanatttatt atttggtttt gctatagtta ntctgtttat 360 atagatttgg tacggtttaa ttattnttat tattttttaa aacccacgga gtccgttaaa 420 atgagaaaat tctttggatt tagcggtagg tctgttaaaa aggagcgaaa tctgcctgtc 480 cccgaaaata ataatcatgg ttatgatgcg ggttatgacg atgattatca ttggtcggaa 540 gattataacc tacaagacgt tagtctggtg ggggacaaca tttcgttatc atcaccgacg 600 cccaacgtca aacaaccaac ctattatcaa aaaactccat ttaatcgtac gattgattcc 660 caacaacaac atcatcgttc tggtccttta ttgagcgacg ttttaaatag tctaacggac 720 tatttaaaga aatcgtctgt aaacttacct gaattcaccg aaagtatgcc cgagtcatgg 780 tttgattctg cggaaagtat ttttcgtcgc tttgatgtgg tttcggaaga tgataagttt 840 gaattcgtta aagaatcact aaggccccga catctccgac acgttgaaca cttgttggct 900 aaacgagata gaattttgcc ttacacgtcc ctcaaagaag cgctcatcag taactattct 960 gtttctaaga ctgataaaat catagcactg atgaacttcg gtccgcttgg tgatcagaaa 1020 ccatcggaga tgctgaaaga tatgttaaga ctgacaggtc cttacccgga tgaaaatacg 1080 acaatatggt tgcgtcaact gtttttagaa cgtatgccaa aaaacgtacg tagcacgctc 1140 aaagcattta cgtcggattc gctaggtgcg atgtccagcc gagcagatag tataattgac 1200 gacagtcgac attctgcggc attttcttct tccggggtag caaatcctac attgaacgat 1260 ttgcattcaa ttaacttagc agttgttgaa aaccttagag tacttaacaa taagctagtg 1320 aacagaccgt caaattcagc ctccaagaac agcggatatc aggacaatac cgtctacata 1380 gatgacgacc gagacgaaga acacccaccg gggcaatgga gcaatcgcgg acgaggttat 1440 tggagaagac caaaaaggag aaaacttaaa gttatagatt ccaaggagga ccctcaaaaa 1500 gatcaaaaaa actcccaggc aagacgttgg atatggcgta tagcgtcttg tgtagcactt 1560 ctaaatcgcc atctttactg tttataacgg atttgaccac tacgactaag tatcttattg 1620 atacgggagc ctctttgagt ctcataccac ctataagaga cgaccggaca agacgaaata 1680 tcggacgaga cttaattgct tctaatggat cagccctagc aacatacggt gagcgatttg 1740 ttaaactgga ctttggccat gacgaaaaat atccctggat atttactatt gccgaggtgg 1800 aattgcctat attgggtgcc gactttcttg agcattttaa tctgactgta gatttaaagg 1860 caagaatagt atctaatgct cgcaccaata aacggataaa aggcgaaaat tccggcataa 1920 atacgtcaat ttcggctaat ttcgtttcac caattgctgg aaatggttat gaagcgatat 1980 tgcaagaatt ttcggacgtc acgaaaccta aacagaaaat tcccgaggta aaacacaagg 2040 taactcataa gatcataaca aaaggaaacc cagttttctg tagacctcga cgtttatctc 2100 cggaaatgag caagattgcg aaggatgaat tcatgcacat gcttgagctg ggaattgtga 2160 gaagatccga tagcaactgg gcatcacctt tattaatggt tcccaaaaag gatcacactt 2220 ggaggttatg tggagactat cgacgattga acgccataac cgtcccggac aggtatccaa 2280 tccctcacat tcacgacttt gtgaattccc tgcatggggt gcagatattt tcaaaaatcg 2340 atttaatgaa agcgtactac catattcctg ttgaccctga ggatatacac aagaccgcaa 2400 tttcgacgcc ttttggtgct tttgaatttc tcaagatgcc tttcggtttg cgttgcgcgg 2460 cgaattcgtt ccagagattc atggacgaca ttctcagagg gttggatttt tgttactgct 2520 atttagacga cgttctcata gcaagttcaa gcgaacagga acacaaagag catctcagga 2580 tattctttca gcgcttgaga gacaatggag ttgtggtgaa ccctgcgaaa tgtgagtttg 2640 gtgtgacgac catgaatttt ttaggacacc ggattacgac ggacggaata gaacctctgg 2700 aagaaaagat tcaagttatt aaagactttc ctcaacccag cactatgaaa cagctacgac 2760 gtttcctggg tatggtaaat ttctacttcc gttttattcc taattgtgct gagatagcta 2820 aaccattgaa ccagcttctc acaccnaaga aaaatggaaa attcgctatt ccttgggaca 2880 aagatcaaga ngaagcgttt aacaggatga agattaaact tgccagagcg gctatgttaa 2940 catttcctgc accaaatgcc caaacagcgc ttgtggtgga tgcttcaatg atcgcagctg 3000 gaggtgttct acagcaaaga atcgacgatg cgtggcaacc actagccttc ttttctaaag 3060 ctttcgacgt acgccaggta aaatattcgg cgtttgatag agaattgctt gcagcttatc 3120 ttgcagttaa acactttaga tatttcctcg aaggacggga atttnctatt atgactgacc 3180 acaagccact tcttagtgcg tttcacactc cacgtgaaaa tgctaccgga cgccaagcca 3240 gacatttggc ttatgtttct gagtatacgt cggacattca atatctccct ggcactcgaa 3300 atatcgttgc tgatgcatta tccaggattg aagtgaacag catttttcaa gctaaggatt 3360 tattcaatta tcgtgagatg gcagcagctc agaagaatga tgcttccata gaaactttaa 3420 agcaatcgaa aaagagttct ttgaagttgg cgaaccgaag attggacgga tatgatgtct 3480 cgctattgtg cgatatttct acaggtaaac caagacccgt tgttcctgtt tctcttcaga 3540 aaaaggtatt tcagaagatt cattctttgt cacatccggg aataaggagc actacgaaac 3600 tcattcgtga acgttttgtg tggacaagca tgaacactga tatacggaac tggtgtcgac 3660 tctgtgaagc ctgccaaaga tcaaagatca ttcgacacaa tcaagcacct ttagccagat 3720 ttaaattgcc ggttgcaaga ttctatcatg tgcatttaga tatcgttgga cctttaccac 3780 cgtctagcgg ttatacttat ttactcactt gtgtagatcg ttttacnaga tggccagaag 3840 taatctgcct gaccgacatc aaggctgaga ccatcagtaa tgcatttttg ctacactggg 3900 taagccgttt tggcacgcct ggcattataa cnacagacag gggaaagaat tttacatcca 3960 atttgtttca caaactggcg gaatttttag gggctaagct tcaacacacg tgtgcgtaca 4020 gaccttgcgc gaacggccaa gttgaacgat tacatcggca gctaaagact tctcttaaag 4080 cccaggagag tcctcaagat tggtatgcta atcttggttt ggttctttta ggaatcagaa 4140 actctatcaa agaagacatc ggatattcat cagcagaact tgtttatgga actagcctnc 4200 ggttaccggg tgaattcgtt gacacggata atcttgagaa tgatttagac tcccacgaat 4260 atgtgagccg ttttaagact ttcatgagga atcttaaagc catnccaccc cgtataccag 4320 gaccccgagc tgtgtattta gacgacaaac ttcagtcttg cagccatgta tatgtcagac 4380 atgacgcggt tagaagacca ctgcaacgac catatgatgg gccgtttaaa gttttaaaaa 4440 gggatgctaa attcttcact ctagactata atgggatacc taatactgtg acagtagatc 4500 gactaaaagc ggcgaatgtt ctcattcctg tctcggagtc tactgatagt aatcccattt 4560 caccttcttt agaggttgag ataagtccga gcgaaattat taatcccgat ttcgatatat 4620 atactgatat atctgatcat gctgattacc acgaatctga cttagacaga actattgcgc 4680 cagcagatga gccacggaca actcgatttg gacggacaat ttttcgacca agacatttac 4740 aggattatga cctttaagtg ttagatctga cctctatacg cttctttatt ctacattatt 4800 taagcattac tttcgccatt agaaccttcg tgtcagtctt catacagatt tggagaagtc 4860 aatcacaggt taactacccg tttaattcta caagaatggt ggttaagcaa tcaatccata 4920 gacgatatat tccttgaatg tcaactgcaa gtatatcgct gttctctatt taataagctg 4980 gaagactgtt tcccagttga agtgctatca gccaagatgg ataattcccg agctgcaata 5040 actgttggaa atactgaaat tgtatgtgag caaataaatg ccacattacg ccacagtttc 5100 caatataaat cctacttctc catttgtcct ctcgttagct ttaccgatag taacggatca 5160 ttacgaatcg gccaagtctt gaaaaacgat gtggtctatg aaggtgtgag attagttgag 5220 cactacaccc gacaggacat ttacatttcg tgtaaacgat aagttctact tgtatgataa 5280 ctatacttta gaacatgctg atgtccatgt tcggcataaa atttcctctc taactccgat 5340 tgaagaacca gttcggacga tttggtttcg cttgcgaaaa aaattccctt atcgtcgatg 5400 ggattggagc atttcggcgc aatccttggt tcaataaatc aatcgcaagt tacaacggag 5460 aacatcaatt aactcctgaa tgaggatgca atgagacagc agatgcagat gcccatacat 5520 tcgttggaaa gtctatttga tcactttgcc aaacacgcac tcttgatcga tgctacanna 5580 atttcaaacc ctggtagtaa acacgatttt ctcggcattg cttactgctg ccctgaagtg 5640 gagacttgtt ttgaccatgt acacgatcat gaagtagata ttggttgtgc tcccaaagtt 5700 tacctatggc gatttttacc caaaataatc atgattagaa tagacggggt atagcaggtt 5760 agattcgacc cttaccaatg gacttccgct agacccaaat ttaagaacat tttacggtca 5820 accgtgggga catttttaca tcttttgtcg ccacaaatac gtgataacca ttttattatt 5880 attatttttt tatttttttt gtaccattcc ttgagttcga cactttttca gttcatttct 5940 tcgttttctg acgtaatatc ttttacgaag tatggccctt tttcgttcgt tattctaaac 6000 taaaatgacc atacctctga ggggggaagt gta 6033 // ID Sola2-5_HM repbase; DNA; INV; 5087 BP. XX AC . XX DT 11-FEB-2009 (Rel. 14.02, Created) DT 11-FEB-2009 (Rel. 14.02, Last updated, Version 1) XX DE Sola2 DNA transposons from Hydra magnipapillata, consensus. XX KW Sola; DNA transposon; Transposable Element; Sola2; Sola2-5_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-5087 RA Bao W., Jurka M.G., Kapitonov V.V. and Jurka J.; RT "New superfamilies of eukaryotic DNA transposons and their RT internal divisions."; RL Molecular Biology and Evolution 26(5), 983-993 (2009). XX DR [1] (Consensus) XX CC TIR is 700-bp long. CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 1563..3869 FT /product="Sola2-5_HM_1p" FT /translation="VVFVKSTWIQQKYLCSKGKFVVPGEKLCPQCRHKLAS FT SEESEENELEIKLTEDNMEKEIILESTRSLLNSTLCELEVSPLKVHLPKTS FT NKPSLGKRKIKQVEEAVIKKLATALDVTESDLVAPNNYEVLKEIETKAGDL FT DFLVECMKEKLKVSNRRQQLQILTLTPKSWSIRKAAKEFSVSKHKIQNAKL FT LRDQKGIIAYPNVVERHRISKDVIELVKLFYCDDEYSRQMPGKKDCVSVGK FT KKYMSKKLIMCNLKELYVAYKIKYPDHKIGFSKFCSLRPKWCILAGPKGTH FT SVCVCTIHQNVKLMLSAVGLETSYHEIIEKIVCSRKSKVCMIHRCNSCPGI FT QAAQIYFQQYLTQNDDPKEQYDSDENEQSVDFKQWTTTDRTELLTIQLPLN FT EFIELLCEKLDKITSHSFIAKSQSSYLKHLKETISIDEAIILADFAENYTF FT VVQDEIQSYHWSKSQCSLHPVVIYYKKVKLEISSFCVISDDLSHDVGFINE FT VMHKTINHIKTHLCPAITKIHYFSDGCAGQYKNCKHFYNLCHHAQDFSVQC FT IWNFFATSHGKSPCDGIGGTVKRLVSTASLQSPTTGQILSSQAMFEYCQKS FT ISGITFVYVTAEEMELVRIKLADRLLIATTIPGTRSFHQFIPSSHSIIKMK FT RVSDDDDFALTFDFLNKQKYEINKIENVMMSQYLFCKYDDFYWLGMVSEVD FT KENDDFMVKFMHPNYPSRSYYWPKRDDNCWVPRMNVISLVKTPSTSSGRQY FT HITKEDETLFEHLLSCQ*" XX SQ Sequence 5087 BP; 1815 A; 773 C; 811 G; 1687 T; 1 other; gggtaattcc atgtaaaact gggacaaaaa atctaaattt ttcagtcacc ctctcagatt 60 tctttcaaat tttacccagt ggtacctatc aatgtaacat gaaaaattgt aaaatttgag 120 ctcccaaaat caaacggttc aaaagttatg acattttgaa attcagccta aaacgttcct 180 ttttttgcca agtccgccat tatggatttt gtagttgtac tttttaatgt ataatttaag 240 atctagttga agaatgaggc taaaaatggg tacctgacca tttttgaggg tgctgaattc 300 aaaaatgtaa agcaaatagt gaaatttaaa ttttatgatg gttatttttt gattttaaaa 360 tggtcagtcc aattttttgt tctgttacca tgggttgcgc tttcatcaac tcccggagtt 420 gcccaagtct catggattca attttttttt caaatgaagt aggtgatcct aggctatctt 480 tataaaaaag atggttttgg tttcttaaag cattataaag atatatcatt ttgaaattta 540 tacaaaaaaa attccttatt tggactcaaa ttctctttca tagatttttt agctatagtt 600 taggtgtaaa taatttaagt tttagtcgaa gaaactgact gaaaattggt atctgaccat 660 ttttgaagat gctgaatcca aagtagaaaa aaaattaacc taatttaaaa tgttagataa 720 ctgttttttt ttttttttaa aatggtaaaa aagcacatta caataaaaag aggtaaggtt 780 ttcacaagtt tcctgatcac aagactctca tgggttcagt tgtgatgtgt aaaaagagtt 840 tattgaggta ggtttttata tataaataaa attttgtgaa gtgaattgtc atttaaaaaa 900 atagttttgc aatgcctgct gtgatgtata aatatagatt tgtgatgtca gatatgtgtg 960 tataaatata gttttaaaac accagttaat aaccataaac ttcaataaaa aaaattattt 1020 taattttaga ttaaataatt agagaattta gtatggtaat tcagtttctt agtcatggca 1080 aacagccaac agtgctccat tggaatcaag ctagaaagtg aatgtcatct tgcaacatat 1140 acatcgttgc ttggaatcga accttttgaa gatattcctg aatatgaaag agaaatatta 1200 atgtggagga ctgggctctc aataatcaat gtaaaaccaa caacttgtct acatcataaa 1260 catgtttatt tgaagcgtta tgcaactaaa ttaacacggt gctgcaatcc atttaatact 1320 cataacaaag ttataaaagg tatttatagt aatcacgaat ttatagtaaa acttttagtt 1380 ttgtgaaaat gttgtgagtt acaataatga aattcttggc agcaaaattc tattttagct 1440 atatttaacc ttacttttag ctgacttctg aaagtgcaca attaaatgcc atacaamttt 1500 aaacttttga tagttttatt aaaaatttta tacatattaa aagagttctt cccttttttt 1560 aggtagtctt cgtgaaatca acttggattc agcaaaagta cttatgcagt aaaggcaaat 1620 tcgtagtgcc aggtgaaaaa ctttgtccac aatgcagaca taaattagca tcaagtgaag 1680 aatcagaaga gaacgagttg gagataaaat taactgaaga taatatggaa aaagaaatta 1740 tactggaatc aacaagatca ttgctcaaca gtactttgtg tgaacttgaa gtgtctccgc 1800 taaaagttca tttaccaaaa acctctaaca agccttcact tggaaagaga aagataaagc 1860 aagttgaaga agcagttatt aagaaacttg caactgctct ggatgtcacc gaatccgatc 1920 ttgttgcacc taacaactat gaagttctaa aggaaattga aaccaaagct ggcgatctag 1980 acttccttgt tgaatgcatg aaagaaaaac ttaaagtttc taacagaaga cagcaacttc 2040 aaattttgac attgacaccg aaatcttggt ctatcagaaa agcagcaaaa gagttttcag 2100 tttccaaaca caaaattcaa aatgcaaagc ttttacgtga ccaaaaaggc atcattgcat 2160 accccaatgt ggttgaacgg catcgaatca gtaaagatgt catagaactt gtaaagcttt 2220 tttactgtga cgatgagtat tcaaggcaaa tgccaggtaa aaaagactgt gtaagtgttg 2280 ggaaaaaaaa atacatgtca aaaaaattga ttatgtgtaa tctaaaggag ctctatgtgg 2340 catataagat taagtaccca gaccataaaa ttggattttc taaattttgc agtcttagac 2400 ctaaatggtg catccttgct ggtccaaaag gtacacattc tgtttgtgtg tgtacaatac 2460 accaaaatgt taaattaatg ttgagtgctg taggccttga aacatcttac catgaaatca 2520 ttgaaaagat tgtctgcagc agaaaatcaa aagtttgcat gattcaccga tgcaatagtt 2580 gtcctggcat acaagctgcc caaatttatt tccagcagta tctcacacaa aatgatgatc 2640 caaaagagca atatgacagt gacgaaaatg aacagtcagt ggatttcaaa cagtggacaa 2700 cgacagatag aactgaacta ttaacaatac aacttccatt aaatgaattc atagagctcc 2760 tgtgtgaaaa acttgataaa attacttctc attcatttat agcaaaatca caatctagct 2820 atctaaagca tctcaaagaa acaattagta ttgatgaggc tattatactg gcagattttg 2880 cagagaatta cacttttgta gtacaagacg agatacagag ttaccattgg agcaagagtc 2940 aatgttctct tcatccagtt gtcatctact acaagaaagt gaaattggaa atatcttctt 3000 tttgtgtaat ttcagatgat ttgagtcatg atgttggatt tatcaatgaa gttatgcata 3060 aaaccattaa tcatataaaa actcatcttt gccccgcaat tacaaaaatt cattattttt 3120 ctgatggctg tgcagggcag tataaaaact gcaaacattt ttacaattta tgtcaccatg 3180 ctcaagattt ttctgttcag tgcatctgga atttctttgc aacaagccat ggcaaatctc 3240 catgtgatgg aataggtggc actgttaaaa ggcttgtttc aactgctagc ttgcagagtc 3300 caacaacagg tcaaatacta tcttctcaag caatgtttga atactgccaa aagtcaatca 3360 gtgggattac gtttgtgtac gtcactgctg aagaaatgga gctagtaaga atcaaacttg 3420 ctgatagact attaatagca accactattc ctgggacaag aagtttccat caatttatac 3480 cttcttctca ttcaattata aaaatgaaaa gggtatccga tgatgatgat tttgctctaa 3540 catttgactt cctcaataaa caaaaatacg aaattaataa aattgagaat gttatgatgt 3600 cacaatacct attttgcaaa tacgacgatt tttactggct tggaatggtt tctgaagttg 3660 ataaagagaa tgacgatttt atggtaaagt ttatgcatcc aaattatcca agtcggtcgt 3720 actactggcc aaagcgggat gataattgct gggtgccaag aatgaatgtc atttctcttg 3780 ttaaaacacc gtcaacatct tctgggcgtc agtaccacat cacaaaagaa gacgaaactc 3840 tatttgaaca tcttttatct tgtcaatgaa agttgatgaa tattttactt tcattattta 3900 aattatataa acttcaattt tagtcatatt tggtttattc cttttccttt ttatttttga 3960 gcttttttca ataaagcaga ttttttacta tttttcaaaa aggtttgcat tttagtttct 4020 tattttttaa cacaacattt gagcttatac cctattatta gattcataaa gtaataatgt 4080 tacgttaaat tattataata ataataaata aagaaattat attttcaatg tgttgtatag 4140 taaattatat ttccatgtat acttttgaaa atattgatta taaatatgca agttatgcaa 4200 agcattataa tttttttatt ttcttttaga aacctcttca aaacttttta ttcacaatga 4260 gagtaagacc tcttatttca caataagagt aaggcaactc cagaaataag tgaaaggtaa 4320 accatggtta agaaataaaa aaagtttaac tgttgatatt ttaaaattga aaaattattc 4380 tcaaatacta ggaatagggt tcatttatct ttatctttgt attcaatacc tttaaaattg 4440 gtcaggtacc aattttcagt catatttctc cactagaact ttaattattc acacaaaaag 4500 tatcgctata aaatctataa tggagaatag aggctaaaaa atgatttttt tttgtctaaa 4560 tttcaaaatg acatatctta ataattcttg aagaaaccaa aaccatcttt tttataacaa 4620 tagcctagta tcacctactt tgtttgaaaa aaaaattgaa tccacgagac ttgggcaact 4680 ccgggagttg atgaaagcgc aacccatggt aacagaacaa aaagttggac tgaccatttt 4740 aaaatcaaaa aataactatc ataaaattta aatttcacta tttgctttac atttttgaat 4800 tcagcaccct caaaaatagt caggtactca tttttagcca cattcttcca ctagatttta 4860 aattattcat taaaaagtac aactacaaaa tccataatgg cggacttggc aaaaaaagga 4920 acgttttagg ctgaatttca aaacgtcata acttttgaac catttggatt tgggagctca 4980 aattttgcat tttttcatgt ttcattaata atgtaccact gggtaaaatt taaagaaaat 5040 ccaagaggtg actgaaatga ctgtcccagt tttacatgga atgaccc 5087 // ID Harbinger2-2_HM repbase; DNA; INV; 3341 BP. XX AC . XX DT 13-AUG-2010 (Rel. 15.09, Created) DT 09-MAY-2011 (Rel. 16.05, Last updated, Version 2) XX DE A family of autonomous Harbinger transposons - a consensus. XX KW Harbinger; DNA transposon; Transposable Element; KW Interspersed repeat; Harbinger2; Harbinger2-1_NV; KW Harbinger2-2_HM. XX NM Harbinger2-2_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3341 RA Kapitonov V.V. and Jurka J.; RT "Harbinger2, a novel clade of Harbinger transposons in protozoan, RT fungi, choanoflagellate, and metazoans."; RL Repbase Reports 10(9), 1222-1222 (2010). XX DR [1] (Consensus) XX CC Harbinger2-2_HM belongs to a novel clade, Harbinger2, of CC Harbinger DNA transposons. This clade includes transposons CC present in protozoan (brown alga), fungi, choanoflagellate, and CC metazoans. CC Harbinger2-2_HM is a consensus sequence of a family of CC autonomous Harbinger transposons that were active in the hydra CC genome in a last few million years.The consensus sequences was CC derived from several copies of that are ~99% identical to it. CC The Harbinger2-2_HM transposon is characterized by 30-bp CC imperfect (6 mismatches) terminal inverted repeats, 3-bp target CC site duplications (TNA), and it encodes two proteins: (i) the CC 359-aa Harbinger2-2_HM1p transposase and (ii) the 218-aa CC Harbinger2-2_HM2p Myb-like DNA-binding protein. XX FH Key Location/Qualifiers FT CDS 442..1518 FT /product="Harbinger2-2_HM_1p" FT /note="Harbinger TPase." FT /translation="MTALNSLKTARDSLLIAYSEDIIDDCEFALLYQLSYS FT RDIYPHWDYNKFNLSLLDDAQCWTDLRFRKTDLPHLLNIFRLPDVIKCTQG FT TICRGMEALCIMLKRLAFPCRYTDLANTFGRNPTEICLIFNTVIDHVYNKL FT SHKLLLWDQPMLQSNNLRQFADYIHGKGAPLDNCFGFIDGTVRRIARPKTN FT QRIVYNGHKRVHALKFQSIVVPNGMIANLAGPFEGKKHDSTMLCESGLLQQ FT LQQFAWHDGRPLCLYGDPAYPIGVHLLAPYRSLNITPDQHAFNKAMSAHRV FT SVEWVFGLMTNYFKFIDFKQSQKLGSSPIGKVYIVCSLIQNAHTCLYGNIV FT SDYFGLEAPSLQEYFQ" FT CDS 2390..1737 FT /product="Harbinger2-2_HM_2p" FT /note="Myb-like DNA binding protein." FT /translation="MQWNIEKDIMMMREVAALGVLIQKPGSKERGQLWQQV FT SDSLNKNGFYVTSRGVRDRLSNIMKKHRAQANKEKKLSGEGGKEITEYDIL FT VEELIEVSDDTDAQKDEKSQEKKNAVDEDRVKAIDIRNTAMERYGETRKRK FT ALNNEDKSPKTSRRSSNDTLIFLREKMEADKENRRIEREERAEARALAQQQ FT QNNIQNMFNHMLAQQTEILKMLLEKRNL" XX SQ Sequence 3341 BP; 1158 A; 486 C; 461 G; 1236 T; 0 other; aggctgattt agaaactgag acgcggacgt attgtagaac gccgttgtaa aaccgcggtc 60 tcgttgtctc attttatttt taatctgatt tagaaaagtt acaaaccgga acgtggactt 120 aggatttcga ttattcaaaa aaaaggagta tcgaatgttt ttgtttaaaa aaaatgcaac 180 gttgtataaa aataaaatga aataaatttt atcaaatttt ttgttttatt ttaattagtg 240 tttttttttt taaatatata tatatatata tatatatata tatatatata tatatatata 300 tatatatata tatatatata tatatatata tatatatata tatatatata tatatatata 360 tatatatata tatatatata ttattagtaa tagtaaaata aaaaaagtaa taattaaaat 420 aaaaaagtat caagttgaaa tatgactgcg ttgaacagcc ttaaaactgc aagagactca 480 ttgttaattg cttactccga ggacataata gatgattgtg aatttgcact gctctatcaa 540 ctaagctatt caagagatat atatcctcac tgggactaca ataagtttaa cctatcgttg 600 ctggatgatg cacagtgttg gactgatcta cgttttcgaa aaactgactt accacatttg 660 ctaaatattt tcagattacc tgatgtaata aaatgcactc aaggaacaat ctgtagagga 720 atggaagcct tatgcattat gcttaaaaga ttagcatttc catgtcgcta tactgatttg 780 gcaaatacat ttggtagaaa ccctacagaa atttgtctga tatttaacac tgtaattgat 840 catgtttaca ataagcttag tcataaacta ttactttggg atcaacctat gttacaatca 900 aacaatctac gacagtttgc tgattatatc catggaaaag gggcacccct agataattgt 960 tttggtttta tcgatggaac tgttcgtaga attgctaggc ctaaaacaaa tcaaagaatt 1020 gtctataatg gtcataagag agttcacgca ctaaaatttc aaagtatagt agttccaaat 1080 ggtatgatag caaacttagc tggtccgttt gagggtaaaa aacatgatag cactatgtta 1140 tgcgaatctg gtttattgca acagttgcag cagtttgcat ggcatgatgg gcgtccttta 1200 tgtttatatg gtgatccagc ctatccaatt ggagtacacc tcttagcacc atacagaagt 1260 ttgaatatta ccccagacca acatgcattt aacaaagcta tgagtgcaca tcgtgttagt 1320 gttgagtggg tatttggttt aatgacaaat tattttaaat ttatagattt taaacagtct 1380 caaaaacttg gctcaagtcc cattggaaaa gtgtatattg tatgttcatt aatacaaaat 1440 gcacacactt gtctttatgg gaacattgtt tcagattatt ttggtttgga agctccctca 1500 ctacaagagt attttcaata attcaatatc gataagaaat tataatgata aacttattaa 1560 aaagtatgcc tgttgtaaat cttatagatt attgaaaata ttgctaaact ttcacgattc 1620 tttaaaatat aaaaagatag tcacacaaag aattatttat tcccaaaaaa atgctaacat 1680 aacaattaag ttcaaaagaa aaaatttagt ttattaaaaa taaaaacttc tttttacaaa 1740 tttctttttt ctaataacat ttttaatatt tctgtttgtt gagctaacat atgattaaac 1800 atattttgaa tattattttg ttgttgctga gcaagagctc ttgcttcagc tctttcttct 1860 cgctcaatgc gtctattttc tttatctgct tccatttttt cacgcaaaaa tatcaatgta 1920 tcatttgatg atctgcgaga tgtctttgga gacttgtctt cattattaag ggcttttcgt 1980 ttccttgttt ccccataacg ctccatagca gtatttctta tatctatagc tttcacacga 2040 tcttcatcta ctgcattttt tttttcttga gatttttcat ctttctgagc atctgtgtca 2100 tcactgactt caattaattc ttcaaccaaa atatcatatt cagttatttc ttttccacct 2160 tcaccagaaa gttttttttc cttgttcgcc tgagcacgat gctttttcat aatattagat 2220 aatcgatctc gtactcctct gcttgtaaca tagaaaccgt ttttgtttaa tgaatcagaa 2280 acctgctgcc acagctgccc acgctcttta cttccaggct tttgaattaa gacacctaaa 2340 gctgccactt ccctcatcat cattatatct ttttcgatat tccattgcat acatgttcta 2400 aataaaatca attaattaat atatatgtta attttgagtt aagtcctgat tataaattgt 2460 tgacagatta atgtaaacat tgataactta ttttccagag aaaaaaattc atagtcaaca 2520 tgttataaat aaaaatactt gagtaaattt gtcaacaact tgatatttac aaattgtatt 2580 tgtaccaaat ataatttata taaattatat atttattatt gatatataat tctataattt 2640 atatatataa tgcaatatat aattatattt gtgtaaaata ttatcacaat tttattaata 2700 tcaatgaaac aaatgattac taagtaaagt tttaatattt ttagttgtga catatctaac 2760 aaaaatatta ataacattac aaaaaaatga gtttcactgg tataaaataa ctcactgtca 2820 tttaaaatat tttactgtca taatatatta aaaatacttt aatagtattt caaatactat 2880 ttaagagttg ttgactcgta gaaacgctaa ttgtgatcac aactagtgaa atatattatt 2940 aaaattatat tataatttta gtaatatatt tcactagatg taatcacaat tagtgtcaac 3000 aactataatt tataagaaat aaatctcaga actaatagtt aaattaataa cagattgaat 3060 aaagttatat attgaaaaat cttacgaagt aatttacctt ttttggcttg aatttgatga 3120 agaacatatt gcatttggac aaacatctgg atttgaattc ggcgaagtac aagtatcatc 3180 catctttaat ttgacgtttt gaagtagacg cggaaatgtt tgttttgagg ttgagaacgc 3240 gacttcaaaa agtcaagacg tgaaaaatta ttccgcacca gtcttgtttt gggatttccc 3300 tagctttttt cacgttcccg tcttgctttc taaatcagct t 3341 // ID Kolobok-17_HM repbase; DNA; INV; 2762 BP. XX AC . XX DT 20-JAN-2009 (Rel. 14.02, Created) DT 20-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE Kolobok-type family - consensus. XX KW Kolobok; DNA transposon; Transposable Element; Kolobok-17_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2762 RA Bao W. and Jurka J.; RT "Kolobok-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 9(2), 426-426 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 427..2202 FT /product="Kolobok-17_HM_1p" FT /translation="MANKKRIIRPRKRKFNGNQHNNSNNTNKQFDTNTGSS FT TSSKKLKNNLEKTLFDXDEFIFFMHFKQMKSCFEKYATCNICGTKLSMLHD FT HSRHLGFSLFFQIYCSGCAFKDTFFSSPVVKNNKPGINTSEINVRSVMAFR FT EIGRGRDAMLTFTSIMNMPPPLSKPSFDCINSKLYDAYKSVAEQSMKNAAM FT EVRRILKPDSNMNDVIDCGISIDGTWQRRGYSSLNGVVVATSHDNNKVLDT FT VTLSKFCKGCQIWEGKKGTLKYEQWKVNHSCQVNHTKSAGAMESAGATEIF FT CSSVNKYNIRYSKYLGDGDTTSFSTVIAQKPYGDDLVPIKLECIGHYQKRT FT GNRLRLKRKELRRVKLSDGKGISGKGRLTDKVINILQNYVGMAIRQNSNSL FT LEMRNSVIASLHHCTNFTSEEYRHMFCTKGEKSWCKWQSDRATGHVTYKKK FT VNLPVAIMEKIKPVFQDLANTDMLAKCLHGMTQNGNEAFNQLIWNRSPKTI FT FTSKVVVEMAVYSATITYNDGFAKLSSVFDALNIKSGSYFEIGASKKNSIR FT LKNMNKKMSDETKKKRKKLRSVRRNYFGQKLDKETAKDYCAGGF*" XX SQ Sequence 2762 BP; 987 A; 386 C; 450 G; 936 T; 3 other; ggtggtatcc agccattatt ttggtgaaaa ataaaaaaaa aatgaaaatt ttgaaaaaat 60 gttattcata caattgaaca ttgtagaatt caatagaaaa aatatttttt caaaaataat 120 gttagaatat caagaatatt cttctcaaac acatcaaggt atttttatat cacatagcaa 180 cgaccatagc aaccgtttta gcctataagt tagtctctga aaattagaca taccaagcct 240 gtaactattc tttttttaat gttttaagtt tatattggtc taaatgtcat aaatctaagt 300 tacattgcta tttttaacgg tttaacatcg tgttgttgcc tttttgaatt cagatgaaat 360 agtgctttac actttwaatt tttatatctt acctattaca gttaaaaagt tgttgaaaag 420 tatattatgg ccaacaaaaa aagaataata agaccaagga aaagaaagtt taacggcaat 480 cagcataata attccaataa taccaataag caatttgata caaacacagg aagttctaca 540 agtagtaaaa agctaaaaaa taatcttgaa aaaacattat ttgatracga cgaatttata 600 ttttttatgc attttaaaca aatgaaaagt tgttttgaaa agtatgctac ttgtaatata 660 tgtggtacaa agctttccat gttgcatgat cattcaagac acttgggttt ttctttgttt 720 tttcaaattt attgcagtgg ttgtgctttt aaagatacat ttttttcttc tcctgttgtc 780 aaaaacaata aacctggtat aaatactagt gaaattaatg tacgcagtgt tatggcattc 840 agggagattg gcagaggcag agatgcaatg ttaactttta cctccataat gaacatgcca 900 ccacctttat caaaacctag ttttgactgt attaatagca aactttacga tgcttataaa 960 tcagttgctg agcaaagtat gaaaaatgct gctatggaag ttaggagaat actgaaacct 1020 gactctaata tgaatgatgt tattgattgt ggtatttcta ttgatggtac ctggcagcgg 1080 agaggatatt cttctttaaa cggtgttgtg gtagcaacgt ctcacgacaa caacaaagta 1140 ttggatactg ttacattatc taagttttgt aaaggatgtc aaatttggga aggaaaaaaa 1200 gggacactca aatatgagca gtggaaagtt aatcattcct gccaagtaaa ccacactaaa 1260 tcagctggtg cgatggagtc agctggtgca acagaaatat tttgctcttc agtaaataaa 1320 tataacattc gctattctaa atatctcgga gatggagaca ctacctcrtt tagcacagtt 1380 attgcacaaa aaccttatgg tgatgacctt gttccaataa aactagaatg tattgggcat 1440 taccagaagc gtacagggaa tcgactgcgt cttaaaagaa aagaattacg acgggtaaaa 1500 ctatctgacg gaaaaggtat atctggtaaa ggtcgtctca cggataaagt aattaacatc 1560 ttgcaaaatt atgttggaat ggctattcga caaaattcaa atagtctttt ggaaatgagg 1620 aactctgtaa tagcttcact tcatcattgc actaatttta cctctgaaga atatcgtcat 1680 atgttttgca ctaagggtga aaaaagctgg tgtaaatggc agtctgatag agcaactggt 1740 catgttacat acaagaaaaa ggtgaattta cccgttgcca ttatggaaaa gatcaagcca 1800 gtatttcaag atctagcaaa tactgatatg cttgcaaaat gtttgcatgg catgactcaa 1860 aatggaaatg aagcgtttaa tcaactaatt tggaaccgat cccctaaaac tatattcacc 1920 tctaaagttg ttgttgaaat ggctgtttac tctgcaacca ttacttataa tgatggtttt 1980 gctaagttat cttctgtatt tgatgctctt aatattaaat ctggtagtta ctttgaaatt 2040 ggtgcaagta aaaaaaactc tatacggcta aaaaatatga acaaaaaaat gtcagatgaa 2100 acaaaaaaaa agagaaaaaa attaagatct gttagaagaa actattttgg acaaaaatta 2160 gataaagaaa ctgcaaaaga ttattgtgct ggtggttttt aatgatttat ctgagtagta 2220 tacatgattt ataattttaa aattgatttt tctcaatttt tgttttttgt acgggtagct 2280 gtagcaaaac aatgatatct caataaaaaa acattatttt agcttcaaat tttcaggata 2340 tgttcattat tgatgttttt agtgcctgaa cctgaaaaaa acatttaaaa aaattaagct 2400 taacctacaa ttaccatatc taattagtgt tttatcgcta aaaatagttt tttatatcat 2460 taagtctgcc atttttgatt gtttaaattt ttttcaaaaa tttttggttc aggcactagc 2520 tacactagtg tactagcttt ctgcaaaatt taaaccttta atattgattt attcttaaga 2580 tatttatgtt cttgtgacgt taacattttc accttttttt gctgagtcag caattatttt 2640 ttattaatta aaaaaaaaaa ttgcattacg ttttagattt ttatctatta tgctagtaat 2700 tcatcatagg tatttattat gtgtaaaata taaaaaaata gaactaatgg ctggatacca 2760 cc 2762 // ID Cre-1_BM repbase; DNA; INV; 3812 BP. XX AC . XX DT 14-OCT-2009 (Rel. 14.1, Created) DT 14-OCT-2009 (Rel. 14.1, Last updated, Version 1) XX DE Cre-1_BM non-LTR retrotransposon - consensus. XX KW CRE; Non-LTR Retrotransposon; Transposable Element; Cre-1_BM. XX OS Bombyx mori OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Bombycidae; Bombycinae; Bombyx. XX RN [1] RP 1-3812 RA Kapitonov V.V. and Jurka J.; RT "First examples of CRE non-LTR retrotransposons in animals."; RL Repbase Reports 9(10), 2157-2157 (2009). XX DR [1] (Consensus) XX CC Cre-1_BM is a family of non-LTR retrotransposons that belong to CC the CRE clade. The silkworm genome contains >100 copies of CC Cre-1_BM that are ~98% identical to the consensus sequence. Most CC copies of Cre-1_BM are 5' truncated. The silkworm genome contains CC several families of Cre non-LTR retrotransposons. XX FH Key Location/Qualifiers FT CDS 894..3770 FT /product="Cre-1_BM_1p" FT /note="RT and RLE domains." FT /translation="MSHSQPLPPNPNTFPAGLTQCDECLRYFKGNRGLKIH FT TGRIHKAESVPNTLCPGSSRDNRVSVVPLWKKLSLYKNNSPVVKRVPRGAR FT VMVAEELKKNIDQVLHDNNVTSWEDLLSFAYRILHVDKQSNNSHSLTKKIK FT DNCKNNSNKTLLDDGVNYDKQQRPFNKVAKIEGKVSDGDLKGASQLLFSSD FT SLAPNNADTLLALKSKHPAPLTPVSVPPPQSSTTLPRVSLDDVLTTITSFK FT NGSAGGLDGLSPQHFKDLINVSAGDVGRSLLKSIVSLVNLMLAGSVPDSIA FT DILYGANLCALMKRDGGIRPIAVGGTLRRIVSKLCCKEVLPILSGMFRPIQ FT LGVGTKGGCEAAVHAARTFLKGGGAEVLVKVDLKNAFNCIDRNAFLKEVKE FT HIPSIYPYLYQCYGNPSKLVYKENLVESATGCQQGDPLGPAIFSLGIHPVL FT RELKSSLNLWYLDDGTLGGDISTVLKDLEYLKSALASLGLELNFAKCELFI FT NASCCSVPVSDVFAKFNTAAPNIKVVDENSLMLLGAPILDGSIDAYIDEQI FT DKFSQCSNNLFEINSHMALFIIRHCLFVPKFTYVLRCSFLWKHNNKLNSLD FT EMIKGVLCKIINCNLGDLSWEQCVLPVRFGGIGVRKASAVALPAFLASAHA FT SCLIVGNVLNPCMGSVEIVGVSEAGSAWQSLCQADFPTLPHLQRLWDTPLC FT QRIQQRLLQSYDNSVDIARFRAVTEKESSYWLQAYPSSNTGTLLDNNALSL FT AIGLRLGAHINVPHFCVCGEYVNELGHHGLSCQKSAGRIPRHASLNDMIRR FT ALTTVNVPAVLEPIGILRSDGKRPDGVTLIPWSVGRCLVWDATCVDTVAPC FT HLDGTSSRPGSAAASAEITKKRKYSGLGSSYSFLPFAVETMGPWGPEAKXF FT LTEISKRLKEVTHDAKAGWYFAQRVSLAVQRGNVASILGTVPPSSALEDLF FT YCVG" XX SQ Sequence 3812 BP; 1031 A; 701 C; 850 G; 1229 T; 1 other; tggcaacatt gcccgcgtaa acatcgctcg tacataaatt agcttttaaa aattgtaaac 60 ctgtgatttt gtaaatataa tcactaaaat tcattgtacc tgtcctaaat atacagtaca 120 agtgttttcg gtgacagaag ataaatataa gggtgtttcg taagaaagaa acgtttcgtc 180 aaccttacgt acaaatcagc tgattaacag ctgtgtgttg tttacgttac aactagtgaa 240 gtgaagtgtt tacgatatca atgcggcaga agaatctgaa cttggcagca atttgggtgt 300 tttgtagcta tcaagtgtcc ccgtaaatca cacatccatc ttcgtaataa cggggtggct 360 tgtgtgcagc attacatttg actacaagat accgtcgatc aagaaaaaaa tattattgtc 420 aaaggaatac tttaccaaga aatatcaata tcataggaac attcacttca cgtgcaagca 480 atacgcctgg tctagcggtg tgaactatga aatcaagtac gcagtgaagg agttaccgcg 540 ttcgagccca tggttatatc gtcaaaatgt cgttttctat ttttatattt ttttattgtt 600 ttttatttat ttaaagaata attaagtcaa tattgatgta aaattttaca aaattagttt 660 aaattttatt tttttaattt gcaaaattga ttttattgac atgattttta gttattttgg 720 attattattg ttatcattca aatatatttt attggtattg tttttaattt gacatattta 780 aatggaattg ttatagcttt attactatta tcattttcta attttacttt tctacaatag 840 ttattaactt agttagctac atgttaaaga ttttataata tgtttaaaat aatatgtctc 900 actcacaacc gctaccgcca aatccgaata cttttcctgc tggtcttact caatgtgacg 960 aatgtttacg gtattttaaa ggtaataggg gtctaaaaat tcatacaggt agaatccaca 1020 aagctgagtc agttccaaat accctatgtc ctggttcatc gagggataat agggtttccg 1080 tggtcccgtt atggaagaaa ctgagccttt ataaaaacaa ttcgccggtt gttaaacgag 1140 ttcctcgtgg agccagggtt atggtggccg aggaacttaa aaaaaacatt gatcaagttt 1200 tgcatgacaa taatgtcaca agttgggaag atcttttatc atttgcatat agaattttgc 1260 atgtagataa acagagcaat aactcccatt cacttactaa aaaaattaag gataattgta 1320 agaataattc caataagact ttgttggacg atggggtaaa ttatgacaag caacagcgtc 1380 catttaataa agttgcaaaa attgagggga aagtgagtga tggagattta aagggtgcct 1440 cacaactatt attttcatca gattctctag ctccgaataa tgccgacact cttctggcgt 1500 tgaagtcgaa acatcctgca cctttaacac ctgttagtgt gccgccacca cagtcttcca 1560 caactttgcc tcgtgtgtct cttgatgacg ttctcaccac aattacatct tttaaaaacg 1620 gttcagcagg cggactggat ggtctatccc cgcagcattt caaggattta atcaatgttt 1680 ctgctggtga tgtcgggcgg tctttactta aaagtatagt ctcgcttgtt aatttaatgc 1740 tggcgggaag tgtacctgac agtatcgctg atattttgta tggagcgaat ttgtgtgctt 1800 tgatgaagag agatggcggt attaggccta tcgctgttgg cggtacactg cgccgcattg 1860 tatccaagct ttgctgcaaa gaggttctgc cgatattgag tgggatgttc cggccgatac 1920 agttgggcgt cggtacaaag ggcggctgtg aagctgcggt acatgcggct cgcacctttc 1980 ttaagggggg tggtgccgaa gtattggtga aggtggattt aaaaaatgca tttaattgta 2040 tcgataggaa tgcttttctc aaagaagtga aggaacacat accatctatc tacccttacc 2100 tttatcaatg ctatggtaat ccatctaagc tggtttataa ggagaacctt gtggaatcag 2160 cgacgggttg tcaacaaggc gatccactcg gtccggcgat ttttagtttg ggaatacatc 2220 cagttttgag ggagctgaag tccagcctaa atttatggta tctggatgac ggaacattgg 2280 gaggggacat ttccacggtc ttaaaggatt tagaatatct aaaatctgct ttggccagtt 2340 tgggtttaga gctcaatttt gctaagtgtg agctttttat caatgcctct tgttgttccg 2400 taccagtttc tgacgttttt gcaaagttta ataccgcagc gcccaacata aaagttgttg 2460 atgagaattc tctgatgttg ctgggtgctc ccattttgga tggatcgatc gacgcgtata 2520 ttgatgagca gattgataaa ttttctcagt gttccaacaa tttatttgaa attaactctc 2580 atatggcttt gttcatcata agacactgtc tattcgtacc taaatttaca tatgtgttaa 2640 gatgttcttt cttatggaaa cataataata aacttaattc tttggatgaa atgataaaag 2700 gtgttctgtg caaaataatt aattgtaatt tgggtgacct ctcttgggag caatgtgtcc 2760 tgccagtacg ttttggtggt atcggcgtac gtaaagcatc agctgttgcg ttgccggcct 2820 ttcttgcttc tgcacacgcc tcctgtctca tagtcggtaa tgttctcaac ccttgtatgg 2880 gttccgtaga gattgtgggt gtgtcggagg ctggatcggc ctggcagtct ttgtgtcagg 2940 cggattttcc tactttaccg catttgcagc gtctttggga cactccgctt tgtcaacgga 3000 tacagcaacg gttactacag tcatatgata acagcgtgga cattgctcgt tttcgcgccg 3060 ttaccgaaaa ggaatcaagt tactggcttc aagcctatcc atcgtccaat acaggcacgt 3120 tgttggacaa caacgctctc tcgctagcta ttggcttacg gttgggggcc catattaacg 3180 taccacattt ctgtgtctgt ggcgagtacg tcaatgagct tggacatcac ggactctcgt 3240 gccagaaaag cgctgggcgc attccaagac acgcgtcttt aaatgatatg ataagaaggg 3300 cgctcactac agtcaatgtt ccggctgtct tggaacccat cggaattttg aggtcagatg 3360 ggaaaaggcc cgatggagtc acgctgattc catggtcagt tggtaggtgc ctggtatggg 3420 atgctacttg tgtggatact gtggctccct gtcatttaga tggtacttcg tcccgccccg 3480 gttcagctgc agcttcagca gaaattacaa aaaagcggaa gtattcgggt ttggggtctt 3540 cttattcttt tttgcctttc gcggttgaga cgatgggccc ttggggccct gaagcaaaga 3600 awtttttaac agaaatctcc aaacgcttga aggaggttac acatgatgct aaggctggct 3660 ggtactttgc acagagagtg agtctggctg tccaacgagg caacgtagcc agtatcttgg 3720 ggactgtgcc tccttcaagc gcgttggaag atctgttcta ctgtgtgggt tgagttatgt 3780 taaattattt tgatttgact ttttgttaat aa 3812 // ID I-1_BM repbase; DNA; INV; 5377 BP. XX AC . XX DT 27-JUL-2009 (Rel. 14.07, Created) DT 27-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE Silkworm Nimb non-LTR retrotransposon - consensus sequence. XX KW Nimb; Non-LTR Retrotransposon; Transposable Element; I group; KW I-1_BM. XX OS Bombyx mori OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Bombycidae; Bombycinae; Bombyx. XX RN [1] RP 1-5377 RA Kapitonov V.V. and Jurka J.; RT "Nimb - a novel clade of animal non-LTR retrotransposons."; RL Repbase Reports 9(7), 1536-1536 (2009). XX DR [1] (Consensus) XX CC Nimb is novel clade of I-like non-LTR retrotransposons. It CC includes families of retrotransposons present in fish, molluscs, CC sea squirts, sea urchins and insects: I-1_DR, I-3_DR, I-5_DR, CC nimbus, I-3_AC, I-4_AC, I-1_CI, I-1_SP, I-1_AA, I-1_BM. I-1_CI is CC a family of tunicate Nimb non-LTR retrotransposon. The consensus CC sequence was derived from multiple alignment of several copies CC ~98% identical to each other. The 3' terminus is composed of the CC (TAAA)n microsatellite. XX FH Key Location/Qualifiers FT CDS 156..1472 FT /product="I-1_BM_1p" FT /note="ORF1." FT /translation="MDKPTPTVSYANVTSKTSYPKKDQAIIINSIDGIMIR FT DYLLALSAVTKSTSIRFISRISNSRICVYLDSKQTADELVDVIKILKVKDH FT VLEIRPLITRNKRIVLSNVCPIIPHDILEEKFNEMGIKVLSPITFMKVGIP FT DPGFSHIMSFRRQVYVSPEDEKRLPESMQINYDETNYWIYITNETMKCFVC FT NGMGHLAKYCPQSSQQVLGTKGPNNLTQDNSTTSNDTTVESNKLPSQSLNS FT TINLESEINQIYPPTEISKPQVPTNLLPHNKFDIYKGVKRIHPSTSSEPSQ FT PEINQAEPTLDDSNIIVSDYTDETDSNWSLDEELKKTPKSLKKKKKTLDDR FT SEEQVWKDIKTELDQSETQPHFPLTLDQFISFLDNSRSKQNLQELIKDYTD FT DIPDLIDMCTFLRTKMNLNRNLKNRCTRLVKKLELIKLLETSNKP" FT CDS 1532..5308 FT /product="I-1_BM_2p" FT /note="ORF2: AP endonuclease, RT, RNase H." FT /translation="MKILQWNVNGFHRRLENIQRILNDIDPLCICIQETNF FT KNNYCAKLKNYIHFFKNRSNSSIASGGVATYVKSHTLPKEVDIISPLEVIA FT IRIEYPLAITICNVYLPNSCPLSENDLNNLKNQLPTPFVLLGDFNSHHFYW FT GSDHCDLRGKIISDWLDVDENMILLNTNQPTHFNNSNGHTSSIDLAFASRS FT ISTYFDWYALDDLYDSDHYPIMITLKIDLSPANENAPQHFKYQMADWPNFR FT SDIEIKINQLSKVTPESLTRINDLVDEFNQILLGSAEKHIPKTTGKIKKKQ FT VPWWNDECDKSLKESKRAYRLYKKEKRKAEYYQKSPNRLDDLQNLQDSLKI FT DYKRKRAKRFNARRIFKESRKKSWESFITSINSYTPPGIVWQKINAINGQK FT SKKQTIVLKNNDRQLSGDPKNNSELLASAFAANSSSNNYSQEFLSFKNQIE FT ENNNVFDFLDNSNSINQNITIHELYFYLHQLKHSSPGPDNISNTLIKNLPL FT VALERLLDIYNFIWLNQVFPDKWREAIVIPIPKPGKLPTLTSSYRPISLTC FT NLCKLLEKIVSNRLRWYLEQNQLISAVQFGFRQYKSTLDHLAYLENEILTS FT FAIKNKVVAVSLDLEKAYEMVWKHRVLLLLEQMSIKGNTLAFVRNFLENRR FT IRVKIDGIVSDPINTENGLPQGSVVSVILFLVSINSVTEVIERPVKGCLFA FT DDLTLVCSGNSIKPTQTLLQNTLDKLSEWCRHTGFKFSDTKTEFIVLAKRK FT KKETVSLTINNKQIKEVRHLKILGLIFDQKLNWVDHIKKLKSDCYNRLNVI FT KILSGSSWGSDSTCIKNTYKALIRQKIDYGSIIYDSASSNILKTLESIHNT FT SLRLSLGAFRTSPINSILVEAEEMPLAFRRKELCLSYAINHITNNKDSFIL FT KDPVPSDIEFRLQPTLSSPLRLRIKKYLAEINLDFPPVFPRKHHYYPPWHQ FT SNFTINMEIAQNNPRDTTSDYIYKDLFYEIKNNHLDFKMYYTDGSVCEGRS FT GCAIVSENYSCKNRLQDYTSILTCEAVAILECLNIIKTNTNHKKYIVFTDS FT MSTLMALSNNHNKNPLIILIKEILTGLIDNNYLIKLIWIPSHQGISGNERA FT DTEAKRATKLAQCNYLTSNIQDLKKHIKWKIKTLWNSYWVKDNKSALYDIR FT KSVFEKRNFQINNRKDQVVLNRLRIGHCNITHIHLITKEEIKKCSRCDSNI FT SIKHLLTECPMFNAERHTSALPMCLFDCLNNNVKCTLKFLKIIDYHRLI" XX SQ Sequence 5377 BP; 2002 A; 1006 C; 874 G; 1495 T; 0 other; cagttgcctt gcggtccccg tacggttcac acgtatcggt cgacgcggcg cggttaacga 60 gtaattacga atataatatc actaccgagt gttctgtttt aacaccgttt gtcaagttag 120 gggtactcct ggcaacggga gccccttctg acactatgga taaacctacc cccaccgtaa 180 gttatgccaa tgtcacatct aaaaccagct acccgaagaa agatcaagct attattataa 240 actcaatcga cggaatcatg attcgagact acttgttagc cctttccgca gtgaccaaat 300 caacgtcaat ccgcttcatt tctcgcattt ccaactcgag aatctgcgtg tacttggact 360 caaaacaaac agctgacgaa ttggttgacg tcattaaaat ccttaaagtg aaagaccacg 420 tattggagat tcggccgctg ataacaagaa acaaacgtat cgttttatcc aacgtttgcc 480 ccataattcc gcacgatatc ttagaagaaa aatttaatga aatgggtatt aaagtattgt 540 ctcccataac attcatgaaa gtgggcatcc cggaccctgg tttttcacac attatgagct 600 ttaggagaca agtatacgtt tcccctgagg atgaaaaaag attaccagaa tcgatgcaaa 660 ttaactatga tgaaaccaat tattggattt atattacaaa cgaaacaatg aaatgctttg 720 tatgtaatgg aatgggacac ttggcaaaat actgtccgca aagcagtcaa caggttcttg 780 gtaccaaagg ccctaacaac ctcacgcaag acaattcaac tacttccaat gacactacag 840 tcgaatccaa caagctaccg tcacagtctc tcaattctac aattaatctt gaatctgaaa 900 taaaccaaat ctacccccct accgaaattt ccaaaccgca agtaccaaca aacttactac 960 cacacaataa atttgacata tacaaaggtg ttaagcggat tcatccatct acctcttcag 1020 aaccatcaca accggaaata aatcaagctg aacctacttt agatgacagt aatattatag 1080 tcagtgatta cacggatgaa accgacagca attggtctct ggacgaagaa ctaaagaaga 1140 cccctaagtc actcaagaag aagaagaaaa ctctcgatga cagaagtgaa gaacaagttt 1200 ggaaggatat taaaacagag ctggatcagt cagaaacaca acctcacttc ccactcacct 1260 tagaccagtt tattagtttt ttagataatt cccgtagcaa acaaaatctt caagaactta 1320 tcaaagacta tacggatgat atcccggatc taattgatat gtgcacgttt ctccgcacta 1380 aaatgaattt aaataggaac ctcaaaaata gatgtacaag attagttaag aaactagaac 1440 ttatcaaact gttagaaacc agtaacaagc catagtctct tttattacaa tttgcttaaa 1500 aataaggtaa gtgatttagt tatttgacag aatgaaaatt ttacagtgga atgttaacgg 1560 ttttcataga cgattagaga atatacaacg tatattaaat gatattgacc ctctttgtat 1620 ttgtatccaa gaaactaatt ttaaaaataa ctactgtgct aagttgaaga attatattca 1680 ttttttcaaa aatcgatcaa atagcagtat agctagtggt ggagtagcta cttatgtaaa 1740 atcacacaca ctgcccaaag aggtcgatat tattagccca ctagaagtga tagcaataag 1800 aatagaatat cctttggcta ttacaatatg taacgtctat ttgcccaata gctgcccctt 1860 aagtgaaaat gacctgaata acctaaaaaa tcagcttcca acaccttttg tattattggg 1920 tgactttaat agccaccatt tttattgggg ttcagatcac tgtgatctga ggggtaaaat 1980 tatatctgac tggttagatg tagacgaaaa tatgattctc cttaatacta atcagcccac 2040 acactttaat aatagcaatg gtcacaccag cagtatcgat ttagcctttg cttcacgaag 2100 tatatcaact tactttgact ggtatgcatt agacgatcta tacgacagtg atcactaccc 2160 tattatgatt acattaaaga tagatttaag ccccgctaac gaaaacgccc cgcaacactt 2220 caaatatcag atggcagact ggcctaactt tcgttctgat atagaaatta aaattaacca 2280 actaagtaaa gttacacccg aatctttgac cagaattaat gacttagttg atgaatttaa 2340 tcaaatactt ctcggttcag cggaaaaaca tatacctaaa acaacaggga aaattaaaaa 2400 gaaacaagtg ccatggtgga acgacgagtg tgataaatca cttaaagaat ctaaaagagc 2460 atatagactt tataaaaaag aaaaaagaaa agccgaatat taccaaaaaa gtccaaatcg 2520 tttggacgat ctccaaaatc ttcaagattc acttaagata gattacaaaa gaaaaagagc 2580 caagagattc aatgctagaa gaatatttaa agaaagcaga aagaaaagtt gggaaagttt 2640 tatcacttcc attaatagtt acactccgcc agggattgta tggcaaaaaa taaacgctat 2700 taatggacaa aaaagcaaaa aacaaacaat agtcttaaaa aacaatgata gacagttaag 2760 tggtgatcca aaaaataact cggaattatt agcgtctgct tttgctgcaa actctagctc 2820 taacaattac agccaagaat ttctatcttt taaaaaccaa atagaagaga ataataatgt 2880 attcgatttt ttggataata gtaactctat caatcaaaat attacaattc atgaactgta 2940 tttctattta caccaattaa aacactctag ccctggacca gacaacatat ctaacacctt 3000 aatcaaaaac cttccactcg tggctctaga acgactcctt gacatttaca acttcatttg 3060 gctcaatcaa gtctttcctg ataagtggcg agaagccatt gttataccga ttccgaaacc 3120 ggggaaatta ccaaccttaa cttccagtta cagacctata tcgctcactt gcaatttgtg 3180 taaattactc gaaaaaatcg taagtaatag attgagatgg tatttagaac agaatcaact 3240 aatatcagca gttcaatttg gttttagaca atataagtca acattagatc atcttgcgta 3300 tttggaaaac gaaatattaa catcctttgc aattaaaaat aaggttgtgg cggtttccct 3360 ggatttggaa aaggcttatg aaatggtgtg gaaacacaga gtcttgttac tcctggaaca 3420 gatgagtatt aaaggtaaca ccctagcctt cgttagaaat tttttagaaa acaggaggat 3480 tagggttaag atagatggca tcgtttcaga tcctatcaat accgagaacg gtctaccgca 3540 gggatctgtg gtgagcgtaa tacttttttt agtttcaatt aatagcgtaa cagaggttat 3600 agaaagacca gtgaagggat gtttatttgc ggatgactta acccttgtat gcagtgggaa 3660 cagcattaaa ccaacgcaaa cactattgca aaatacttta gataaactaa gtgaatggtg 3720 tcgacatact ggatttaaat tttcagatac gaaaactgaa tttatcgttt tagcaaaaag 3780 gaaaaagaaa gaaacagtaa gcttaacaat taataataaa caaattaaag aggtccgtca 3840 tctaaaaata ttaggtttaa tctttgatca aaaactgaac tgggtcgacc acataaaaaa 3900 actaaaatca gattgttata atagactcaa cgtcatcaaa attctatcag gaagtagttg 3960 gggctcagat agcacatgca ttaagaatac atacaaagca ttgataagac aaaaaattga 4020 ctatggctct attatttatg actctgcatc ttccaatatt ttaaaaaccc tagaaagtat 4080 acacaacact agtttaagat taagtctagg agccttcaga acaagcccaa taaacagcat 4140 tctagttgaa gcagaggaaa tgccactagc ttttaggaga aaagagttat gcctgtcata 4200 tgccattaac catataacaa acaacaaaga ctcatttata ctaaaggacc ctgtaccatc 4260 ggatatagaa tttaggttac aaccaaccct ttcatctcct ttgaggttaa gaattaaaaa 4320 atacctcgca gaaataaatt tagattttcc cccggtgttt ccacgtaaac accactacta 4380 tcctccgtgg caccaatcaa atttcacaat aaatatggaa atagctcaaa ataatccccg 4440 agacaccaca tcagattata tctacaaaga tcttttctac gaaattaaaa ataatcactt 4500 agacttcaag atgtactaca cggacggctc agtctgtgaa ggtagatctg gttgcgctat 4560 tgtatctgaa aattatagtt gcaagaatag actccaagac tatacctcta tattgacatg 4620 cgaagcagtt gctatactag aatgcttaaa cattattaag acaaacacaa atcacaaaaa 4680 atatatagtt ttcactgatt caatgtctac cctcatggcc ctttcaaata atcacaacaa 4740 aaaccccctt attattctga tcaaggaaat tctaaccggg cttatagata acaattattt 4800 aataaagcta atctggattc ctagtcacca aggcatatct ggaaatgaaa gagctgacac 4860 agaagctaag agagccacaa agttagccca atgtaattac cttacttcta atatacagga 4920 tcttaaaaaa cacattaaat ggaagattaa aacactctgg aatagctatt gggtaaaaga 4980 taataaaagt gctttatatg atataagaaa atccgttttt gaaaaaagga attttcagat 5040 taataataga aaagatcaag tagttttaaa tagattaaga ataggtcact gtaacattac 5100 tcacatccat cttattacaa aagaagaaat aaagaaatgt agtcgatgtg atagcaatat 5160 atctattaag catttactaa ctgagtgccc aatgtttaat gcagaaagac atacttctgc 5220 cttgcctatg tgtttgtttg actgtttaaa taataatgta aaatgtactt taaaattcct 5280 gaaaattata gattaccata gattaattta actataagct tatagtggtc gttaatgacc 5340 attgttgtta aaaagcgacc ttaaataaat aaataaa 5377 // ID BEL-132_AA-I repbase; DNA; INV; 6461 BP. XX AC supercont1.254; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-132_AA_; KW BEL-132_AA-LTR; BEL-132_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6461 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.254; Positions 1001151 994691. XX CC 'CTAGC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 82..6216 FT /product="BEL-132_AA-I_1p" FT /translation="MPRTTRVALKANAEMPQPTCEKCKNPYHVAEMVACER FT CHRWYHCTCVEVAVGFNGRWTCKGCIFTVTLSDASISGKSGSTSRSTRVQL FT QLMRLEEEKKAHEKLIQEQLAAKAALDKKYLDEKYVLLIAEAEDEEAGSRR FT SHRSRKSRDSHVQEWIQGLDEAAKGHPELINVDDIFPPVSLSTGMPMPSGG FT MLYKYTGAVRKIVSDPLLNDRTTVGDKPPPSRPAIKWVADNVGIEGIGSAV FT PIRSSTPIRSIGTVQSAISTQPLYIQPVLSQPVYTQSVFPQPTQAASILPK FT LVPRTPLPTFLFPEPISKPPPRVSFPPVSEPPLPPSSGRPPVDMNIESFSS FT NSTQQQANGPRSVVSTSSQPPPVQSSQEPPVRTTEQQSNVPLSSTQASFHH FT PPFQQACTQSHPLRSMTKNHYDEQSQPPSVLRASISYTPSPPLRLPNPEEL FT VPQSSVHQPILPNQSLPHQAEPYITQQQQMPNQQAMWGQFQQLSARQVVPR FT ELPSFTGNPDEWPLFVSSFRNSTAMCGYSHAENLMRLQRCLKGRALEAVRS FT NLLLPSSVPKIMETLETLFGSPERLVQSLLNKVRSVPTPKAERLETLVNFG FT LVVQNLVGHLQAANQLAHLTNPTLLQELVDKLPLHLRLDWALFKRNAGPVD FT LGTFCAYMSVITSAASDVAHFTDLEGSRACGSEKQRKEKAFINAHVSPESR FT KPEQQVKKVDNKERPCYVCKSVQHRIKDCNKFKSLTVGDRLKAVETHQLCA FT SCLVPHGKWSCKSTRICGVGDCDKKHHPALHPSHTAVTKPESAGVKSKSDV FT VVNIHHPMQAATIFRIIPVILYGKEAKISTFAFLDEGSSSTLIDREVADQL FT NLGGESQSLCLTWTGKVSRTEDSRLVGLRISGGESNERFHLDGVSTVRQLD FT LPVQTLQYGELSRRYPHLAGLPVKDYAGAVPRILIGLDNIKLSLPLKVREG FT RINGPVAAKTRLGWTIFGSAGTAKIDSLSPVLHICKSSDESNLHELVKSYF FT ATENLGVSVTSGPEAEEDRRAKEILQQTTVKRPDGHFETGLLWRNDVVELP FT SSYNMAERRLICLERKLRKHPDLRASLEQQISEYQSKGYAHKATPQELEES FT DPHRTWYLPLGVVTNPRKPGKVRIIWDAAAKVNGVSLNDMLLKGPDLLRSL FT PAVLCRFRQRQVAIAGDIREMYHQLKIRKEDRQAQRFLYRSDPLKKPEVFV FT MDVATFGSTCSPCSANFVKNYNAEEWKEELPEAAEAVVENHYVDDYLDSRD FT TEEEMIKLASKVRRVQDAAGFELRNWRSNSEKVLQTLGEGATDSIKDFSFD FT KESQVERVLGMAWLPEEDVFVYSVKLPDHTGTVTKRSILRFVMSFFDPLGL FT ISNLLVHGKVIIQDLWRAKVGWDEDIPEEVFEDWKRWIEQLSQLSGTRIPR FT CYFPGYEPDSFKSLQLHVFADASETAYACVAYFRIIDRGQPRCALIASKAK FT VAPLKPLSIPKLELQAAIIGSRLAKSIAEYHTIPVSRRFFWCDSTNVISWI FT NSDARRYRQYVAVRIGEILEDAEPEEWRWLSTKINVADEATKWGKGPNIHP FT DSRWVTGPEFLYRPENEWVQRREPLPNTEEELRTVYFHRIALIQPFIKYDR FT FSKWERMLRSTAYVYSFKNRLREHAKPDAAKTGVKMKQEDLVQAERTLWRL FT AQAEEYADEIAVLQKAKETSGAEQLELEKSSTIRSLSPFLDEFGVVRMQGR FT TEASPLASYDSKFPVILPKKHRVTELLVDWYHRRFGHHNSETVVNEIRQRY FT HISSLRTVVRRVAKKYQWCIVYKSRPVVPRMAPLPEVRVTPFVRPFSLVGI FT DYFGPYAIKIGRSQVKRWVALFTCLVIRAVHLEVVTSLSTESCKLALRRFI FT ARRGAPTKIYTDHGTNFVGASRELATQLAAMHKELAETFTDTNTRMYFIPP FT SSPHMGGAWERMVSAVKVAMTSINNSRTPSEEVLQTVLCDAESMVNSRPLT FT YVPLETSDQEALTPNHFILLSSSGVKQPEKAPATEGEALRNGWNLCRYVLD FT QFWARWIREYLPDLTRRTK" XX SQ Sequence 6461 BP; 1703 A; 1557 C; 1787 G; 1414 T; 0 other; ttctctaaag aactctactc gaagtgagga agagttgccg aaggtggata gtagattgcg 60 gaggtcagtt agaagaggac gatgcccaga actacgcgtg ttgcgttgaa agccaatgca 120 gagatgccac agcccacttg tgagaagtgt aaaaacccct atcacgtagc agagatggta 180 gcttgcgaac gatgtcatcg ttggtaccat tgtacgtgtg tggaggtggc tgtggggttc 240 aacggaaggt ggacgtgtaa ggggtgtatc ttcacggtga ccctaagcga tgcttcgata 300 tccggcaaga gcggcagcac ctctcgatcg acacgggttc agctgcagct gatgcggcta 360 gaggaggaaa agaaggccca tgagaaacta atccaagagc agttggcagc gaaagccgcc 420 ctggacaaga agtacctcga cgaaaagtat gtgctgctca tcgctgaagc agaggatgaa 480 gaggcgggaa gtcgtcgaag tcatcgcagt cgaaagagtc gagatagcca tgttcaggag 540 tggattcagg gactcgacga agctgctaaa ggccacccag aactgataaa cgtcgacgac 600 atttttccac cggtttccct tagcaccgga atgcccatgc ctagcggagg aatgctgtat 660 aagtatacag gtgctgtgag gaaaatcgta tcagatccac tactgaacga tagaacgacc 720 gttggagaca aaccacctcc atcgagacct gctatcaagt gggtggcaga caacgttgga 780 atagagggta ttggttcagc agttccaatt agatcatcga ctccaattag atcgattgga 840 acggtacaaa gcgcgattag cactcaaccg ctgtacatac agccagtgtt atcacaaccc 900 gtgtatacac aatcggtatt tccacaaccc acacaagcag cttctatact accgaagcta 960 gtaccacgaa ctcctctacc aacattcctg ttcccggaac caatctcaaa accaccacca 1020 cgtgtttcat tcccgcctgt gagcgagcca ccgttaccac cttcaagtgg tagaccgcct 1080 gtggatatga atatcgaatc attttcatcg aattcaacac aacaacaagc aaatggtcct 1140 cgtagtgtgg tgtcgacgtc gtcgcagcct ccaccagtgc agtcgtcgca agagcctccg 1200 gtacgcacaa ccgaacagca atcgaatgtg ccactatcat caacgcaggc gtcgttccac 1260 catccaccgt ttcaacaggc gtgcacacag tcgcacccac tgagatcgat gacgaaaaat 1320 cattatgatg agcaatcgca gcctccatct gtgttgcgtg catcaatatc ttacaccccc 1380 tcgcctccac tgcgtcttcc caacccggag gagctagtac cccaatcatc ggtacatcag 1440 ccaatattac caaatcaatc gctaccgcat caggcggagc catatatcac ccagcagcag 1500 cagatgccaa atcagcaggc gatgtgggga caattccagc agttgtccgc cagacaggtc 1560 gtgcctaggg agcttccaag tttcactggg aatcccgatg aatggccact ttttgtaagc 1620 agttttcgga attctactgc tatgtgtggc tattcgcacg cagagaattt gatgagactg 1680 caaagatgcc taaaaggcag agcgttagaa gctgttcgga gcaaccttct gctaccttct 1740 tcggtcccga aaattatgga gactttagag acattgttcg ggagcccaga gcgattggtc 1800 cagtcgctgc tcaacaaggt acgcagtgta cccactccaa aagcggaacg gctcgaaacc 1860 ctcgtgaatt tcggcctcgt cgttcagaac ctagttgggc atctgcaggc cgcgaatcag 1920 ctagcccact tgactaatcc tacccttctg caagagttag tggataagct gccactacat 1980 ctccggttgg actgggcgtt gttcaagagg aatgcggggc cggtggactt gggaacgttc 2040 tgcgcctata tgagtgtcat tacatcggcc gcaagtgatg tggcacattt cacggacttg 2100 gaaggatcca gagcatgcgg aagcgagaag cagcgtaaag agaaggcgtt catcaatgca 2160 cacgtgtccc cggagtcgag aaagcccgag cagcaggtaa aaaaggtgga caacaaggag 2220 cgaccatgct atgtttgcaa aagcgtgcag caccggatca aggactgcaa caaatttaaa 2280 tcgttaacgg tgggggatcg tctgaaagcc gtcgaaacac accagttgtg tgcgagttgc 2340 ttggtgcctc atgggaaatg gtcttgcaag tcaacgcgca tctgtggggt cggagattgc 2400 gacaagaagc atcatccagc tctacatcca agtcatacag ccgtcactaa gccggaaagc 2460 gctggtgtga agtcgaaatc ggacgtcgtc gtcaacattc accatccaat gcaagctgct 2520 acgatattcc gtataatccc ggtgattttg tacggcaagg aagcaaaaat atccacgttt 2580 gccttccttg acgaaggttc atcgtcgaca ctcattgacc gagaggtggc agaccaactg 2640 aacctaggag gagaatcgca gtcgctgtgc ttgacttgga cggggaaggt atcccgtacc 2700 gaagactcca ggctagtggg tctcaggatt tcaggtggag aaagtaacga acgtttccat 2760 ttggacggtg taagcactgt gcggcaactg gatcttcccg tacagacact acagtatggg 2820 gaactgtccc gtcgttatcc acatttggcc ggactgccgg tgaaggatta cgcgggagct 2880 gtgccacgaa tcctaattgg gctggacaac atcaagctgt cgttgcctct caaggtacgt 2940 gaaggaagga ttaacggtcc ggtggccgcg aaaacaagac tcggatggac gatcttcgga 3000 agcgccggta cagcgaaaat agactcattg tctccagttc tgcacatctg caagagctcc 3060 gatgaaagta atctacacga gctggtcaag agttactttg caacggagaa cctcggcgtg 3120 tccgtgactt ctgggccaga ggcagaagaa gatcgacgag cgaaggagat tctacagcaa 3180 acgactgtta agcgtcccga cggtcacttt gagaccggat tgctgtggcg gaacgatgtt 3240 gtagaacttc cgtcaagcta caatatggcc gagcgacgtt taatatgcct cgagcgaaag 3300 ctgcggaagc atccggattt acgggcaagt ttagaacagc agatttccga gtatcagagt 3360 aaaggctatg ctcataaagc aacaccgcag gagctggaag aaagcgaccc tcaccgaact 3420 tggtaccttc ctctcggagt ggtcaccaac ccacgcaagc ccggaaaggt tcgtatcata 3480 tgggacgcag cggccaaggt gaatggagtg tcgctgaacg acatgctgtt gaagggaccg 3540 gaccttctta ggtcgctgcc agcagtattg tgccgctttc gtcaacgtca ggtggcaatc 3600 gcaggtgaca ttcgcgagat gtatcaccag ctgaagataa ggaaggaaga tcgacaggcc 3660 cagcggtttt tgtaccgtag tgacccattg aaaaagcctg aagtcttcgt catggatgtg 3720 gctacgtttg ggtcgacttg ctcaccctgt tcggcgaatt tcgtaaagaa ttataacgca 3780 gaggagtgga aagaagagct tccagaggca gcggaggcgg tagtagaaaa ccactacgtg 3840 gatgactacc tagatagtcg cgatacggag gaggaaatga taaaactagc ttcgaaagtg 3900 cgaagggtgc aagatgcggc aggtttcgag ctgcgcaatt ggcggtcaaa ctcggagaag 3960 gtattgcaga ctctggggga aggtgctacg gattcgataa aggatttcag cttcgacaag 4020 gaaagccagg tggagcgtgt tcttggaatg gcatggttac cggaagaaga cgtttttgta 4080 tattcggtaa agctaccgga ccacaccggc accgtcacga agcgaagtat tctgcggttc 4140 gtcatgagct ttttcgatcc gcttggacta atttcgaacc tgctcgtcca cggcaaggtg 4200 attatacaag acctttggag ggctaaagtt ggatgggacg aggatatccc agaagaagtg 4260 ttcgaagatt ggaaacggtg gatcgagcag ctctctcagt taagcggcac tcgaattcca 4320 cgctgctatt ttcccggata cgagccggac agctttaaat cgttgcagct ccatgtattc 4380 gcggatgcaa gtgaaactgc ctacgcttgt gtcgcctatt tccggatcat tgatcgtggt 4440 cagccacgat gtgctttgat agcctcgaaa gccaaagtcg ctccgctgaa gccgttgtcg 4500 atccctaaat tggaactgca ggcggcgatt attggaagtc gattagcaaa gtcgattgct 4560 gagtaccaca caattcccgt gagtcgcagg tttttctggt gcgactctac aaacgtgatt 4620 tcctggatca actcggacgc caggaggtat cgccagtatg tagcggtacg catcggagag 4680 atccttgaag atgcagaacc agaagaatgg cgttggttat ccaccaaaat taacgtggcg 4740 gacgaggcca cgaaatgggg aaagggcccc aatattcacc cggacagtcg ttgggttacc 4800 ggaccggaat tcctataccg gccggagaat gagtgggtac agcggagaga gcctttgccg 4860 aacactgaag aagaactgcg taccgtttac ttccatcgaa ttgcactgat tcagccattc 4920 atcaagtatg accgattctc gaagtgggaa cgaatgctcc gatctactgc gtacgtgtac 4980 tctttcaaga accgtctacg ggagcatgct aaaccagatg ctgctaagac tggagtaaag 5040 atgaagcagg aggacctagt gcaggctgag agaacgttgt ggcgactcgc tcaagcagag 5100 gagtacgccg atgaaattgc ggtcctgcag aaagcgaagg aaacttcggg agcagagcag 5160 cttgaattgg agaaaagcag cacaattaga agtctgtcgc cattcctgga tgaatttgga 5220 gtggtgcgca tgcaaggtcg gacggaagca tctccgttgg cttcctacga ttcgaaattt 5280 cccgttatct tgccgaagaa acaccgtgta accgaacttc tggtggattg gtaccaccga 5340 cgatttggtc atcataacag cgaaactgtg gttaacgaaa ttcgtcagcg ataccacatt 5400 tcatctctac ggacggtggt gcgaagagtg gcgaagaagt accagtggtg tattgtgtat 5460 aagtctcgtc cggtggtacc aaggatggca cctcttccag aagttcgcgt aactccattc 5520 gtccggccat tctcgctggt cggcatcgac tattttgggc catacgctat caagatcggc 5580 cgcagtcaag tgaagcggtg ggtggcactc ttcacgtgcc tagtcataag agcagttcac 5640 ttggaggtcg tcacatcgct ctctacagaa tcgtgtaagt tggccctgcg gcgattcatc 5700 gcgcgccggg gtgcaccgac gaagatctac acggatcatg gaaccaattt tgttggtgcc 5760 agccgggagt tggccactca gctggctgcg atgcataaag agttggcgga gaccttcacg 5820 gataccaaca cgcgaatgta cttcattccg ccgtcctccc cgcacatggg cggtgcgtgg 5880 gaaaggatgg taagtgcggt taaagtagca atgacgtcga tcaacaattc ccgtacgcca 5940 tcagaagaag ttttgcagac agttctgtgt gatgcggagt cgatggtcaa ctccagacca 6000 ttgacgtacg tgccgttgga gacatcggac caagaggcct tgactccaaa tcatttcatc 6060 ttactcagtt ctagtggagt caagcagccg gagaaagctc cggccacgga aggtgaggct 6120 ctaaggaacg gatggaacct gtgtcgctac gttcttgatc agttttgggc aaggtggatc 6180 cgggaatatc ttccggacct gactcggcgt accaagtgac atggagaggt gaaacccatc 6240 gaggaaggag atgtagtttt catcgttggt gaagcaaatc ggaaccaatg gatccgaggc 6300 agagttctaa aggtcctccc cggaaaggat ggtcgcatac ggttagtaga cgtacaaact 6360 acagcagggg ttctacgccg tcccgtggcg aagatcgccg tgctcgacgt gcttccggct 6420 ggtaatcctg ctggatcgga gcagaattac ggaaaggggg a 6461 // ID PFH76 repbase; DNA; INV; 124 BP. XX AC U11817; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 28-SEP-1995 (Rel. 1.08, Last updated, Version 1) XX DE Fasciola hepatica clone PFH76 genomic repeat sequence. XX KW PFH76; Repetitive element. XX OS Fasciola hepatica OC Eukaryota; Metazoa; Platyhelminthes; Trematoda; Digenea; OC Echinostomida; Echinostomata; Echinostomatoidea; Fasciolidae; OC Fasciola. XX RN [1] RP 1-124 RA Kaplan M.R., Dame B.J., Reddy G.R. and Courtney H.C.; RT "A repetitive DNA probe for the sensitive detection of Fasciola RT hepatica infected snails."; RL Unpublished. XX RN [2] RP 1-124 RA Kaplan M.R.; RT "PFH76."; RL Direct Submission to Genbank (05-JUL-1994)Kaplan R. M., RL University of Florida, Infectious Diseases, 471 Mowry Road, RL Gainesville, FL 32611-0880, USA. XX DR GenBank; U11817; Positions 1 124. XX SQ Sequence 124 BP; 28 A; 34 C; 23 G; 39 T; 0 other; gatcaattca cctatttccg ctagtcctac tggaatttct cttgtaccaa tgtgtttctc 60 aggccgtgat taccctattg acaaagaatc agcgtgcccg taggacgccg tttaagccca 120 cttt 124 // ID Gypsy-63_CQ-I repbase; DNA; INV; 4954 BP. XX AC AAWU01018550; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-63_CQ_; KW Gypsy-63_CQ-LTR; Gypsy-63_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4954 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 505-505 (2011). XX DR GenBank; AAWU01018550; Positions 13796 18749. XX CC Positions [2145-2621] - Integrase core CC 'CCCT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 350..1975 FT /product="Gypsy-63_CQ-I_1p" FT /translation="MKFPHARHLSNEEVDYELRLRNKEEETRQNLNSKRDI FT LRKLFQIDRNEKYNYPDPPYQWEDEMEFVKSKVDGLLLIAQGKIESETRSK FT LVHYYYRIQRAIAQTQEGQKLQKALLTPVSRLLQSVSSEGNFTQSEPDTDT FT DNKGKPNTNPNPNPNPNPNPNPNPNPNPNPNPNPNPNPDVNPNPTVEELWE FT VIKRLQTELEISREENRRATESDSSESVRVPRPVSPPRHPPRPRPGFESSV FT SSRNDNFRERYHRNHQRRIEHWKISFTGEPNSLSVEDFLFRLKQIAKREEV FT TDQKLLRDVHMVLDGVAANWFFTYADEVKTWEDFEKRIRLRFGNPNQDQGI FT RQALQNRKQQRNEPFAAFVDEIGRLNKMLSKPLSKRRQFEIIWDNMRSYYY FT TKLSTTTVVDLDHLKSLNRGIDAADSHLQQILAKQQQHVHHVDYEEEEVGA FT EEMEFVDAIQPHSGPSGFRAQGNRNRNFRTFNPPREPSQSFNTPQEPSNQQ FT PAARQQVFPNSSSSCDHCKQTGHNWRVCREPKSIFRRTRRNYQEGT" FT CDS 1596..3008 FT /product="Gypsy-63_CQ-I_2p" FT /translation="MQPILTYSKYWPSNSSMFITLTTKRKKLEQKRWSLST FT RFNHTPGRVGFELKGTETGTSERSTHHENQASRSTRRRNLRTSSQQLGNKF FT SQTHRLAAITVNKLDTTGESAGNRSQYSEEREEIIKKEHEVAHFGAEKTLH FT SLKRHYTWPGMSAQVKKFCRECFKCQTSKAPNLNTTPPMGAQKEFVEHPWQ FT FITLDFVGPLPVSGRGRHTCLLVVTDVFSKFVLVQPFREAKASSVVEFLKN FT MVFLLFGVPEIALTDNGTQFTSKLFRQLLEEFNVTHWLTPSYHPQVNNTER FT VNRVITTAIRATLKQHKNWADNLQEIACAIRSAVHDSTKHTPYFIVFGREM FT VSDGQEYRRMRDLPVAEEQKDGEEQKRRKQLLKDVRNQLVKAYERHRKTYN FT LRSNACCPTYTAGETVLKKTFELSDKAKGFSAKLAPKYEPAVVRRVVGKHC FT YELEDQKGKRLGVFYANHLKKFHLPK" XX SQ Sequence 4954 BP; 1543 A; 995 C; 1072 G; 1344 T; 0 other; aattggcgcc caacgttttg ggtcttcttt cattcttttt tctattgaga gttcattaga 60 acgagtttcg tgggttctta gagcaaatat tcctttattg ggtttctatt ggtggatcga 120 ttactccttc ctgtttttgg ttgttgtatc actatttatt tattcattta tttatttatt 180 ctatttattt atttatttat ttatttaatt atttatttat ttatttattt atttaattat 240 ttattgttag tttgataatt tatgtacacg ataaacttac gttattgagc acataacata 300 caattttgct catcacattt ttgatactgt gaagttaaaa agaaatacga tgaaattccc 360 acacgcgaga catttgtcaa atgaggaagt ggactacgaa ctcagactga ggaacaaaga 420 ggaagaaaca cgacagaatc ttaattcgaa gcgtgatata ttgcgaaaac tattccaaat 480 tgatcgaaat gagaaatata attatcctga tccaccttat caatgggaag acgaaatgga 540 atttgtgaaa tcgaaggtag atggactgtt gttgatcgca caagggaaga tcgaaagtga 600 aactcgatca aaattagttc attattacta ccggatacag agggcaatag cacaaacaca 660 agaaggacag aaattgcaga aagcactttt gacaccagtc tcgaggctac tccaatcggt 720 ctcttcggaa ggaaatttca ctcaaagtga accggatacg gatacagata ataaaggaaa 780 gccaaacaca aacccgaatc caaaccccaa tccaaatcca aatccaaacc ctaatcccaa 840 tcccaatccc aatcccaacc caaacccaaa tccaaatcca gacgtcaacc caaacccaac 900 cgtcgaggaa ttgtgggaag taatcaaacg tcttcagaca gagctggaaa tatctagaga 960 ggagaaccga agagcaacag aaagtgactc tagcgagtca gttcgggtac cgcgcccagt 1020 ttcccccccg agacatccgc caagaccaag accaggcttc gagtcatctg taagttccag 1080 gaacgacaac ttcagagaac gttaccaccg aaaccatcag agaaggattg aacattggaa 1140 aatttcgttt acaggagagc ccaattcact gtcggttgaa gattttctgt tcagactaaa 1200 acaaattgcg aaaagggaag aagtaactga tcaaaaactt ctacgggatg ttcacatggt 1260 acttgacgga gtagcagcaa actggttttt cacatacgcc gacgaagtca agacttggga 1320 agattttgaa aagcgaatca gactgcgatt tggcaaccct aatcaggatc aaggaattcg 1380 tcaagcacta cagaaccgga aacaacaaag gaatgagcct tttgcagctt ttgtagacga 1440 gattggacgt ttaaataaga tgctatctaa acctctgtcg aagagacggc aattcgaaat 1500 catttgggac aacatgcgat cgtattacta tacaaaactg tctactacta cggttgtgga 1560 cttggatcac ctgaaaagct tgaaccgagg aatcgatgca gccgattctc acctacagca 1620 aatactggcc aagcaacagc agcatgttca tcacgttgac tacgaagagg aagaagttgg 1680 agcagaagag atggagtttg tcgacgcgat tcaaccacac tccgggccga gtgggtttcg 1740 agctcaaggg aacagaaaca ggaacttccg aacgttcaac ccaccacgag aaccaagcca 1800 gtcgttcaac acgccgcagg aaccttcgaa ccagcagcca gcagctcggc aacaagtttt 1860 cccaaactca tcgtctagct gcgatcactg taaacaaact ggacacaact ggcgagtctg 1920 cagggaaccg aagtcaatat tcagaagaac gagaagaaat tatcaagaag gaacatgaag 1980 ttgcacactt tggagcagaa aagaccttgc acagcttgaa acgccattac acttggccag 2040 gtatgagcgc acaggtgaaa aagttctgcc gcgaatgttt caagtgtcaa acatcgaaag 2100 cgcccaactt gaacacaacg ccaccgatgg gagcacagaa agagtttgtg gaacatccct 2160 ggcaatttat aacactggac ttcgtgggac cacttccggt gtcaggtaga ggtcgacaca 2220 cgtgtctgtt ggttgtcacg gacgtattca gcaaatttgt gctggtgcaa ccgttcaggg 2280 aagcaaaagc cagctcggtg gtcgaatttc tcaagaatat ggtattctta ctctttggtg 2340 ttccggaaat agcgttaacg gacaacggaa cacaatttac ctcgaagcta ttccgacaac 2400 ttcttgaaga attcaacgta acccactggc tcaccccgtc gtaccaccca caggtgaata 2460 acacggaacg cgtgaatcgg gtgataacaa cggcgattcg agcaactttg aagcaacaca 2520 agaattgggc tgataatctg caggaaattg cttgcgcaat ccggagtgcg gttcacgact 2580 cgactaaaca cacaccgtac tttatcgtct ttggtcgaga gatggtatcg gatggtcaag 2640 aatatcgaag aatgcgtgat ttaccggttg cagaagaaca aaaggatgga gaagaacaaa 2700 aacgaagaaa acaacttttg aaagatgtca gaaatcaatt agtcaaagcc tacgaacgac 2760 acaggaagac atacaacctt cgctctaatg cctgctgccc cacctacacc gcaggtgaaa 2820 cagtgttgaa aaaaacgttt gaactttccg acaaagccaa agggttcagt gcgaagttag 2880 cgccaaaata cgagccagca gtagtcagaa gggttgttgg gaaacattgc tatgagcttg 2940 aagatcaaaa aggcaaacga ctaggagttt tctacgcgaa ccatcttaaa aagttccatt 3000 taccgaagta gacttttata cacgttttta cagctatgaa ctccctttta agggtgacca 3060 gatttgaagt ttcgaacaaa ataccaattg ggtaaagcat tcaggttaga cgtgaccttt 3120 ggagggttca tggtgaggat cagaagctaa cagagtactt cgtgaactcg tctggaagtt 3180 gtgaatgagt gggttgaagc gatggtgaag agagcggcaa ctgaactaag acgatcgatt 3240 gagcagtaga aatcgaagaa agggagctgt tcgttgagtt cgctagtatc agggctgagc 3300 atagacgatg acctgttaaa gtaagccact taattgtgtg atacgaatgt gagcgagtct 3360 ccgcgatgat tgtaactcct agtacttatg aatgagaacg gtgatgatct ttagttctcg 3420 aaacgctagt taagttgcac gtctcaacac cccaagtcgc tgaaactcat tcacaaacaa 3480 ctcccaaacg aagctcagtc ctcaggttgg caataacatt cggctttgcc ttacccaacc 3540 agccatgaat ctgtccataa aaggggggga agtggaagaa gaagacaccc cacacgccca 3600 caaggtggca acacttgtgg agatgtaaat tttgtacata gtagttagta attgtttacg 3660 ttagtcgcgc ctatggccct acgttaacct aatgttaatt tttgttgaat ttatgtgagt 3720 agtttgttta aaactcacaa tattgtttgt ttttcggcat aattgtatga attcgttttt 3780 ttggttgttt ttccgtgacg tcatcctcat ccatcgatag cgtcaagatt ttgtttggtt 3840 ataattatcc atcgctaaac caacatgttc taccaaccga gccggatcag ctaaaaatac 3900 agaacacaga gcattagtca aagtcattag gaagtaaaat taggaacgtt acttacaatt 3960 tttccgttgg ggaccacaac cagtagtaaa ttaaacaaaa tccaccgatt agtccttctg 4020 caatcaatta tgagtccaat ctgcatccca ggagcgtggt tgttttgatc gtcaccaggt 4080 gcgaagcatt tgttcttgca gccaaacacc tgggaaacga agacaaaaga tctgctttta 4140 gcgaaacata gactagatac ataaaattta gtcacacttc ccgcaaataa cacgaatttt 4200 gataatccgg ctccgcaaca cacgttttcg agtaaataca attttcaact aaatctgtct 4260 gacagttgtt tgttatggtt cctttttcgt tggtaaccgt gagtgaatga gttacctagg 4320 tggtgagtga cgacttgcga cctgagtgca tgcgacgagg ggtgaatgag tgtttgtaat 4380 tacatgtttt ctcacaattt ttttttacga agaattgttg atgaatttta ctatgctttt 4440 tatgtgtgta cgatacgttt tcgggattta tcctggagtg tttcggcaca tgtagagcgg 4500 tttttcgttg tgggaaattt tttcgtcttt gtgttctcgg aattcattct gaggtgaaga 4560 caatcaggac ccacaagtgt aataaagtgg aaagcagagc gttctgattt tttcggaagt 4620 tacttctgga gtaggacgtc tttgcaattc actagttgtt gtggaagtag gagttttgct 4680 tctctggaat tctgatcaaa tttcaggtga tagcgttttt attgtgattg ttattcttcg 4740 gaagttttat gtatgaatga gtttaccaac gaaataaagt ccaagttttg tagttccaca 4800 atttattttc ctcaaatatt tttgtagtct cagcaataat ttcaagttct ttaattttca 4860 ccatttctgc catagccaca gtagtttcaa tagtttgccc tacgaaaatt tggttgtttt 4920 acaaccaaat tttcgtaaat ttagtagggg atga 4954 // ID CR1-52_AAe repbase; DNA; INV; 5580 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-52_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5580 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1139-1139 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 13 sequences with >96% CC identity. XX FH Key Location/Qualifiers FT CDS 2409..5492 FT /product="CR1-52_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MEFANIIFLTFGQPAITGRPGPACGQGKGVFQITLNG FT KYTSALNNSAADEISDSSLSTFQQPQSTNQRESSEIAVYFQNVRGVRTKVD FT ELFLATCDCDYDVIILVETGLDECINSLQLFGTSYNVYRCDRCALNSRKSR FT FGGVLIAVAHQYISSMIDLRNGKTLEQICVSTVIKGQRINFLAVYIPPDRS FT QDVSSVEAHIASVQELCGGSSFNDTIIVCGDYNQPRLSWTRTIEGICYDSS FT TSLPPAAAALVDGFDFLNLMQANVEHNYLGRVLDLVYSSPDKDIKVTRAVS FT PLLPIDAHHPPLEVAVAVPCLRQSLTSHCETRRLNYRRIDFEALSEYLSNV FT DWTVMFRNIDDVDDISMHFCDVLCTWLNENLPLVRSPAKPPWGNSELRRLK FT RKRNTTQRLLRKHRSHANQLAFKLASDAYRQMNAVLYKSYVTRTQFGLRKD FT PRSFWSFINSKRKCSSIPSNVFLDDSESSSVQESCELFAKHFASVFSCHIT FT SQSEANEAASNIPVDAVEMDIFEISSDMIVAASKKLKNSYSPGPDGVPAII FT FRRCAGSLASPLSHIFNKSFEQKKFPAVWKQSLMFPVHKKGDKRNVRNYRG FT ITSLSAGSKLFEIIVSNAILNATKSYISPNQHGFMPGRSVTTNLIEFTSTC FT ISQIEGNAQVDAVYTDLKAAFDLIDHRILLSKLSRLGASERFTMWLESYLS FT NRVLRVKLESCVSGMFSSSSGVPQGSNLGPLLFVIFFNDVTLLLGSKCVLV FT YADDLKIYLVVKTREDCIRLQELLDTFVEWCRLNRLVVSVAKCLVITFHRS FT KCPIAFDYCIDGNVLERVDQVSDLGVVLDAKLTFNHHITSIITKATRQLGF FT ITKVSRDFTDPHCLKALYCALVRPILENAAVVWTPYHLSWIIRIERVQRRF FT IRIALRDLPWRDPQNLPPYGERCRLLGLDTLQRRRQFQQATFVANILTGEI FT DASALLSRIEFRAIGRQLRSSSLLMTRFHRTSYGFHEPLTSCVRTFSLVDN FT LFDFGERPSSFRRRLERSRMFY" FT CDS join(314..652,618..1127,1040..1639) FT /product="CR1-52_AAe_1p" FT /translation="RRLMSMEKNCAKCNQTISGIDYVACRGYCGCTFHMQC FT SGVSRALMNYFTTHRKNLFWMCNDCAALFENSHLRSITKEVDENSPLASLT FT EAIGNLQKEIKQLIKTSFIWHITGGSKPASSGISPAVRRWPQIETVRAAKR FT PRGPDFAQTVSECRTGSKQVGQNVISVPTTAKPETKFWLYLSRIRPDVTNE FT EMSAMVRANLELSQDPEVVKLVAKGADITNMSFISFKIGLDPALKPIALDP FT LSWPEGLMFREFEDYSVPKFRKPSITNLTPTLTKTPEVPMEPGLFRSKISE FT TIHYKPDSNVNENSRSSNGAINDNSRSFNFDVRESESHLKSLSHHSFLDVS FT PTPGCTAACIMEDPILPSTVVPLLPAINSRPGRVSGCGDGFFQAALSGKYN FT LSLCNTTSDALTVFSTQPASSPPSTPLMSNSTSKTLGRINSSIMEALKPSA FT AVEPFQPAITSRPTVAASHTNAGQKYNSHLKRSQMLII" XX SQ Sequence 5580 BP; 1574 A; 1194 C; 1153 G; 1659 T; 0 other; tctggcatcc ctgccgagta acttaattga acttttacta tcgatttttt tttcgccgtt 60 ttaatgtgtt aacagcgtgt ttaatttttt tattctttga ctactaattg atattgtgaa 120 gtccgtatac ggtagtcatt tagttgttgt gatttctgta ctgatttgat aattattcgc 180 tagttttgtg atcaccgttt gttccgtcca tcgtaataca tacgtgcgca tcaaaacaat 240 taccgtataa attcgagcaa attttgtcac cgttgtgctg ctgcttacgt tgtaacgttt 300 gttgcctact tgaaggcgat taatgagcat ggagaaaaat tgcgccaagt gcaaccaaac 360 cataagtggt attgattacg tcgcttgtcg tggatactgt ggttgcacat tccacatgca 420 atgttcaggt gtctcacgtg cacttatgaa ctatttcacg acgcaccgca aaaatctttt 480 ctggatgtgc aatgattgtg ccgctttatt cgagaactcg catcttcgct caattacgaa 540 ggaggttgac gaaaattcac cgctggcgtc actcacggaa gcaatcggaa atctacaaaa 600 agaaataaaa cagttaatca aaaccagctt catctggcat atcaccggcg gttaggcgtt 660 ggccacagat tgaaactgtt cgagcagcca aacggcctcg tgggccagat tttgcacaaa 720 cggtatcaga gtgtcgtact ggctctaaac aagttgggca gaatgtaatt tctgtcccta 780 ctactgcaaa accagagact aagttctggc tctacttgtc tcgcattcgt cctgatgtta 840 ctaatgagga aatgtcagca atggtacgag ccaacctgga actgtcccaa gatccggaag 900 tggtgaaatt ggtggccaaa ggagcagata tcaccaatat gagcttcatt tccttcaaaa 960 ttggcctgga tccagctttg aagccgatcg ctctagatcc attatcctgg ccggaaggtc 1020 tcatgtttcg tgagtttgag gattattccg ttccaaaatt tcggaaacca tccattacaa 1080 acctgactcc aacgttaacg aaaactccag aagttccaat ggagccataa atgacaactc 1140 aaggagtttt aatttcgacg tgcgtgaaag cgaatctcat ctgaaaagcc tgtctcatca 1200 ttctttcctc gatgtttcac cgacaccggg atgcactgcg gcatgtatta tggaagaccc 1260 tatcctaccc agcacggtcg tgcctttgct gcctgcgatc aacagtcgtc ccggacgcgt 1320 gtctgggtgt ggagatgggt tcttccaagc cgctctctca ggcaagtaca atctcagttt 1380 gtgcaataca acttctgatg ctttgaccgt ttttagcacg caaccggcat catcaccgcc 1440 atcaacacca ttaatgtcca actctacctc taaaacactg ggacgcatca actctagtat 1500 aatggaagcc ctgaagccct ccgccgcagt cgagccattc cagccagcga tcaccagccg 1560 tcccacagtg gccgcatccc atacaaatgc tggccaaaag tacaattctc acttaaaacg 1620 ttcacagatg cttattattt gacaaaacat ggtttttaga ggtaatttga tacaaatagc 1680 cgctgaacca gaaaatcgat atttccggta cattagtgaa ttagtttaaa atagcacatt 1740 ttttcaattc aataaatgaa atccgtattc aggttacaat taaatgaagc gtcattattt 1800 cgatgctaat ttgaagttca atatcgtggg caagctcgag gtggttttta aatctattgg 1860 ggagcatata atggtaaaaa atgcgatatt tataacaagt atgatggtct aatgggctaa 1920 ttcgtagtgt aactaaaaaa tatttgttta tatcttggcg caaaaaagcg caagtgaagc 1980 catagtacat cgaatactat ggaatatttg ccacacattg aggggtaaat gatcgtttta 2040 gagcgtttcc cagatatctt gcactacttt tgaaaaaatg ccttattttt gaaggagctc 2100 tatattatac acttcttgac attctcaaat ggtaaacatc taaaatatgg ttcgaaatat 2160 cttcaaaaca aagcctaagg tatattcttt tgaatgagac cggttgagaa tgatttggtt 2220 atgatttagt acagaaactg caatttctac agaacgaact tcacgaaatt tgaaaaaatt 2280 aaaagggtaa atttcagcta acatctctac aggtcacatg ctgtgaaaaa tcgaaatata 2340 taccgatcga tctgaaactt tgggaaattg ttattcaata ctgaagaaat caagaaaaaa 2400 tattcgagat ggaatttgcc aacatcattt ttttgacttt tggccagcca gcgatcaccg 2460 gccgtcccgg tcctgcgtgc ggacaaggca aaggggtctt ccaaatcaca cttaacggca 2520 agtacacatc tgcattgaac aattcagctg ctgatgagat ctccgattcc agcttgtcca 2580 cgtttcaaca gccgcaatca acaaatcagc gggaatctag cgaaattgca gtatacttcc 2640 aaaacgtacg tggagtaagg acgaaagtcg atgaattgtt tcttgctacc tgtgactgcg 2700 attatgatgt cataattttg gtggagactg gccttgatga atgcatcaat tcattgcagc 2760 ttttcggtac ttcttacaat gtgtatcgtt gtgatcgatg tgctttgaac agtcgtaaat 2820 cgcgttttgg gggtgttctt attgctgttg ctcatcaata catcagctcg atgatcgatt 2880 taagaaatgg caagacattg gagcaaatat gcgtatcgac cgtcatcaaa gggcaaagaa 2940 tcaatttttt ggctgtctac atccctcccg acagaagcca agatgtttcg tcagtagaag 3000 cgcatatcgc ttctgtacaa gagctttgcg gcggatcgtc tttcaacgat acaataattg 3060 tatgtgggga ttacaatcaa ccacgcttat cctggacaag aacaatagaa ggaatatgtt 3120 acgatagttc cacttcgctg cctccagcgg ctgctgctct ggtcgatggc ttcgatttcc 3180 ttaatctaat gcaagcaaat gtagaacata attatcttgg tagagtgctt gacctcgtgt 3240 actcctcgcc tgataaagat atcaaggtaa caagagcggt ttctcctttg ttgcccattg 3300 atgcacatca tcctccactg gaagttgcgg ttgctgttcc gtgtttgaga cagtcactca 3360 cctctcattg tgaaactcga cgtttaaact atcgtcgaat tgactttgaa gctttatcag 3420 agtatttatc gaacgttgat tggactgtta tgttccgtaa tattgatgat gtggatgaca 3480 tctccatgca tttttgtgat gtcttgtgca catggctcaa tgagaacctc cctctagtaa 3540 gatctccagc aaaaccacct tggggcaact cggagttacg cagattgaaa cgtaaacgga 3600 atacaacaca aagattattg cgaaagcatc gatctcacgc taaccaattg gcgttcaaac 3660 tagctagtga cgcataccgt cagatgaacg ctgttctcta taagtcttat gtcacacgca 3720 cacaatttgg cttgcgaaag gatccgagaa gcttttggag ctttattaac tcaaaacgta 3780 aatgttcatc tatcccatct aacgtttttc ttgacgattc ggaatcatca tcagtgcaag 3840 agtcctgtga attgttcgcc aaacactttg catctgtttt ttcctgtcac ataacatctc 3900 aatcggaagc aaatgaagct gcatctaata ttcctgttga cgctgttgaa atggatattt 3960 ttgaaatttc ttccgacatg atcgtggctg cttcaaaaaa gctgaaaaac tcctactcgc 4020 ctgggccaga tggtgtacca gccattattt ttcgtcgttg cgctggttct ttggcttccc 4080 cgctttctca tattttcaat aaatccttcg agcaaaagaa attccctgct gtgtggaagc 4140 aatcgctaat gtttcctgtg cacaagaaag gagataaaag gaatgtaaga aactatagag 4200 gcattacaag tctatctgca ggatcgaagc tttttgaaat aattgtgagc aatgccatcc 4260 ttaatgctac caagagctac atttcaccca atcaacacgg atttatgccc ggccgttccg 4320 ttactaccaa tctcatagaa tttaccagca cctgtatatc gcagattgag ggaaatgcac 4380 aagtggatgc agtgtacacg gatttgaaag ctgcctttga tttaattgat caccgaatac 4440 tgctgtctaa gctgtcccgc ctaggtgcat ctgagcgttt tacgatgtgg ttggaaagtt 4500 acctttcgaa tcgagttctg cgagtcaaac ttgaatcgtg cgtttccggt atgttcagca 4560 gcagttcagg cgtcccacaa ggcagcaatt tgggcccttt gctatttgta atatttttca 4620 atgatgttac cctactgcta ggatcaaaat gtgtgctggt atacgccgat gatttaaaga 4680 tttatctggt agtcaagaca agagaagatt gtatccgact tcaagaacta ctggatactt 4740 tcgttgaatg gtgtaggtta aatcggctcg ttgttagcgt cgctaaatgc ttagtgatca 4800 cgttccatcg atcgaagtgt ccaatcgctt tcgattactg tattgacggt aatgtcctcg 4860 agcgcgttga tcaagtcagc gatttgggcg ttgttctaga cgcaaaacta acattcaacc 4920 atcacatcac gtcaatcata actaaagcaa cacgacaatt gggctttatt acgaaagttt 4980 ctagggattt tactgatcct cactgcctca aagcactata ttgcgctttg gttcgaccaa 5040 tccttgaaaa cgctgctgta gtgtggactc cttaccatct ctcgtggatc atcaggattg 5100 aacgtgttca acgtaggttt atccgcattg cgctcagaga tctaccctgg agagacccgc 5160 aaaatctccc gccctatggc gagcgttgtc ggcttctagg actcgataca ctgcaacgac 5220 gccgacagtt ccagcaagct acatttgttg ctaatatttt gaccggtgag attgacgctt 5280 cagcacttct atcccgcatt gaatttcgag ccattggtag acagctgcgg tcctcttctt 5340 tattgatgac cagatttcac cgaacatcgt atggttttca tgaaccgctt acctcatgtg 5400 tccgaacatt ctctttagtg gataatttgt ttgattttgg agaaaggcct agttccttca 5460 gaagaaggct ggaaagatca agaatgtttt attaatagta attaggtttt agtcttaagg 5520 aaattcatgg agactactgt cagatgaaaa agataataat aaataaataa ataaataaat 5580 // ID Gypsy-17_SI-LTR repbase; DNA; INV; 222 BP. XX AC AEAQ01023932; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: long terminal DE repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-17_SI_; KW Gypsy-17_SI-I; Gypsy-17_SI-LTR. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-222 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01023932; Positions 5424 5203. XX SQ Sequence 222 BP; 64 A; 51 C; 60 G; 47 T; 0 other; tgtaaggaaa cgccccacga cccttggaac agcgtgtcac cttcgggcgt cagtcccaag 60 gacgcttgga ggggatgccg ctgcttggct gcgcgaacga atgaaggagc agtgatgaga 120 gtcgtcgagt gacacacgag agacagagat tcttgtccgc tacagaaaag aaataaatag 180 ctaacctttc attaaaagta aacgtgcgtt cattttctta ca 222 // ID Gypsy-29_CQ-I repbase; DNA; INV; 4879 BP. XX AC AAWU01023199; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-29_CQ_; KW Gypsy-29_CQ-LTR; Gypsy-29_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4879 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 437-437 (2011). XX DR Genome; AAWU01023199; Positions 43017 38139. XX CC Positions [2262-2795] - Reverse transcriptase CC Positions [3891-4352] - Integrase core CC 'GTTGC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 861..4856 FT /product="Gypsy-29_CQ-I_1p" FT /translation="MDKFDIPPFKFKQMPPNEVREEWTRYKRNFEYMALAN FT AVTNKTRLKHIFLARAGVDVQDVFSSIPDADVEERLGVDPFRIAINKLDEY FT FAPKQHEAYQRFLFWSQKMKDGESLDKFMLRAMDLASKCNFGNTQQEARDI FT SVIDKVIQMAPDDLREKLLQKEHLSVDSVTKTVNAYQSVKFQAGQMAGHSN FT QPKPFQEVNVVNNDAAGAVGECSRCGYPGHFHYDERCPARDRTCKVCGNVG FT HFGIKCRSRKARKRKFNGPTGQRKRVKIERVRAVDTEEEGKSTDFIFTIGD FT GDELIWVTLGGIVVQMLIDSGCEKNIVDSGTWDKLKNQGVSVSDMRTDSRV FT NFRAYGHTDPLKVSRVFEASIAVGDNGERAKTMATFYVVEGGQQPLLGRAT FT AKVLGVLVLGLPSTRPPELYQMSCEQKRPFPKIKGVKVVIPIDPTVSPVAQ FT HVRRPPLALLDKIECKIDSLLASDIIEPVEEYSPWVSPLVAVLKDNGEIRL FT CVDMRRANLAIKRESHLMPTFEDFLPRLKEARVFSRLDIKDAFHQLELDES FT CRQITTFISHKGMFRYKRLMFGLSNASEVFQKIVEQMLAGCPNALNYIDDI FT LVYGKNEEEHDAALAKVMDKLKAKNVLLNHKKCVFRVSEVVFLGHHISGTG FT IRPAEEKLSALKSFRAPQTVEELRSFLGLVTYVGRFLPDLATATVALRKLT FT HAGEKFRWEQEHENAFQKVKRLICDLKTLRYFDNKLRTRVIADASPVGLGA FT VLVQFTDYRNDSEVRVIGYASKSLSPTEKRYCQTEKEALALVWAVERFSVY FT LLGRHFELETDHKPLEIIFSPSSSPCPRIERWVLRLQAYRFAVKYRKGSSN FT IADPFSRLCTLDEGVDFDSDSKFLVLAIAESAAIDTNELEKASQGDGELAA FT VRECVRSGVWNQPDAKPYERFQGELGVLGESLLVRGSKLVVPRALRPRMLD FT LGHEGHPGETVMKRRLRDRVWWPNMDGDIVKHVTGCEGCRLIGLPSKPEPM FT CRRELPTKPWVDVAIDFLGPLPSGEYLLVIVDYYSRYKEIEVMKHITAEET FT TKRLHRIFTRLGFPVTITLDNGRQFVSTHFDHYCKQHGISLNYSTPYWPQE FT NGLVERQNRSLIKRLQISHALGRDWKEDLAGYLLMYYTTPHSTTGKTPTEL FT CYGRTIRSKIPSLIDVESTPNDEDFRERDRLLKQKGKELEDKKRHARPSDL FT QPGDTVLMKNVLPGNKLTTAYSPKEHVVLSKEGGRVTVQDKESRKVYKRNV FT THLKRVPVEAPSESSVAGSVEAPQEDPVPPVELRPSPVAEEAQADNHFRNR FT REVKRPTRFRDYIVSHLVDD" XX SQ Sequence 4879 BP; 1363 A; 1129 C; 1375 G; 1012 T; 0 other; aattggcgac tgtggatggg ttgcggaata catcacgcga ttggtaagat tttgttcttt 60 atcgttgaat gttttttctt ccagcgtcaa attatgcggg gcggagcccc tccgtatgct 120 ttgacctgtt acacacacac acacagtgat gccaaacggg tccggcaccg gaaggagtgg 180 tcgaagagag tgaaaactca cacgccgtgg cttcgagcca ggaaaattgg tcaaaggcct 240 tgggccggcg acgagacttc gagtcagaaa cgaaaagact ggactcggaa tcgagtagag 300 gcttcgagcc aggttacaac aaaggccctg agccagatgt tggaaaggcc tagagccagg 360 gaaagttggt taaatccaca aaaaaaaaaa gaagaagaaa gaaattgagc tgagttgtca 420 aaacaacagg ccacccaggc tgacagacat ttctctgtga cagcattcgg aaaagtctgc 480 gattaaaagt tctatcgaca acttggcaac ccagccaact gtcaagctga aaacagacaa 540 cgaaacgaga accagatgta aacattgtgt aatcgaaaca attttgtgtt gttacagata 600 atacgtgatt tcgagttacc agatggctta ctgaaacaag ggtttcgggt aaaatcagac 660 tgaccggtaa gaaaaacaac atttgagttg tgtaaactgt aaatacatga gtaaatgaca 720 ttgtaggacg aaagcagaaa aagcgaagtc aactgagtgc gctgtcaact agtggagcga 780 tatcgattag cccaggtaat ttttacagtc tgacgaatgc atgagaaaaa gctgagaaag 840 ttgtgatttc agcaagcacc atggacaagt ttgacatccc gccgtttaag ttcaaacaaa 900 tgccgccaaa tgaagtgcgc gaggagtgga ctcgctacaa acgaaacttc gagtacatgg 960 ctctggcgaa cgcagtgacg aataagacca ggctgaagca tatcttcctc gcccgagctg 1020 gagtcgacgt ccaagacgtg ttcagcagca ttccagacgc tgacgtcgaa gagcgcctgg 1080 gcgtcgaccc gttcagaatc gctatcaaca agctggacga gtacttcgca cctaagcaac 1140 acgaagccta ccagagattc ttgttctggt cacagaaaat gaaggacgga gaatcactgg 1200 acaagtttat gctgcgcgct atggacttgg cttcgaagtg taacttcggg aatacccaac 1260 aggaagcacg agacataagc gtaatcgaca aagttatcca aatggcccct gacgacctgc 1320 gcgaaaagct tctgcagaag gagcacctga gcgttgattc ggtgaccaag accgttaacg 1380 cttatcagtc cgtcaaattc caagcgggac agatggcagg tcactcgaac cagcccaaac 1440 catttcaaga ggtgaacgtg gtgaacaatg acgcggccgg cgcggtcggc gaatgctcgc 1500 ggtgcggcta tccaggtcat tttcactacg atgaaagatg tccggcgcgg gaccggacct 1560 gcaaggtctg cggaaatgtt ggccatttcg gaatcaaatg ccgttcgaga aaagctcgaa 1620 agcgcaagtt caatggacca actggacaac ggaagcgagt aaaaatcgag cgggtgcgag 1680 cggtggacac cgaggaagaa ggtaaaagca ctgattttat tttcacgatc ggtgatggtg 1740 atgagctcat ctgggtgaca cttggaggca tcgttgtaca aatgctgatc gactcgggct 1800 gcgagaaaaa cattgttgac agcggaacgt gggacaagtt gaaaaaccaa ggagtttcag 1860 tatcggacat gcgcacggac tcccgagtaa acttccgggc atacggacac acggatcccc 1920 tcaaggtcag ccgggtattc gaggcatcca tcgcggtggg ggacaacggt gaacgtgcca 1980 agacgatggc aactttctac gtggtggagg gcggtcaaca acctttattg ggccgcgcca 2040 cagcgaaagt gttgggtgtg ctggtgttgg gattacctag cactcgtccc ccggagctgt 2100 accagatgag ctgcgagcaa aagcggccgt ttccaaaaat aaaaggcgtc aaggtagtca 2160 taccaattga cccgacggta tcacccgtcg cgcaacatgt ccgacgccca cccctggcgt 2220 tgcttgataa aattgaatgt aagattgatt ctttgctggc ctcggacatt attgaacctg 2280 ttgaagagta cagtccgtgg gtgtcgccgc tggttgccgt gctgaaggac aacggagaga 2340 ttcgactctg cgtagacatg cgaagagcta atctcgcaat caagcgcgag tcgcatctga 2400 tgcccacatt cgaagatttt cttccaaggc tgaaggaagc acgcgtgttc agccgtttgg 2460 acatcaaaga tgccttccat cagctggagc tggacgagtc atgccgtcaa attacgacgt 2520 tcatctctca caaggggatg ttccgataca aacggttgat gttcggactt tcgaatgcat 2580 ctgaggtttt ccaaaagatc gtggaacaga tgttggcagg atgtccgaat gccttgaact 2640 acattgacga cattttggtc tacggcaaga atgaggagga acacgacgcg gcgttggcga 2700 aggtgatgga caaactgaag gcgaaaaatg tgctgctcaa ccacaagaag tgcgttttca 2760 gggtatccga ggttgtgttc ttgggtcacc acatttcagg aacgggtatt cggccagcag 2820 aggagaaatt aagcgcgttg aaatcgtttc gagccccgca gaccgtcgaa gagctccgga 2880 gttttctggg cctcgttacg tacgttggcc gtttccttcc ggatctggcc acagctacgg 2940 ttgctttaag aaagctcacg cacgccgggg agaagttccg ctgggagcag gaacacgaaa 3000 atgccttcca gaaggtgaag cggttgattt gtgatctgaa gacgctgcga tactttgaca 3060 acaagctacg aacccgtgtg atcgctgacg catctcccgt gggactggga gcagtacttg 3120 ttcaattcac ggactacaga aatgactcgg aggtgcgggt gatcggttac gctagtaaaa 3180 gcctcagtcc aaccgaaaag aggtactgcc aaacggagaa agaagccctc gcattagtgt 3240 gggcagttga gcgtttttcg gtgtacttgc ttggacgtca tttcgaacta gaaacggatc 3300 acaagccgct ggagattatt ttctctccca gttcgagtcc gtgccctaga atcgagcgtt 3360 gggtcctgag actgcaagca taccggtttg cagttaagta tcggaagggc agcagtaaca 3420 tcgcagatcc gttctcgagg ctgtgcacgc tggatgaagg cgtggacttc gacagcgaca 3480 gtaagtttct ggtgttggca atcgctgagt cagcagctat cgatacgaac gaactggaga 3540 aagcctcaca gggagatgga gagctggcag cagtccgcga gtgcgtccgt agcggcgtat 3600 ggaaccaacc tgatgctaag ccgtacgaga gattccaggg ggagctcggc gtgctgggag 3660 aatcactgct ggttcgaggg tcaaagctag ttgttccgcg ggcgttgcga ccgaggatgc 3720 tcgacctcgg acatgaaggt catccgggag agacagtcat gaagcgtcgg ctaagagata 3780 gggtgtggtg gcctaacatg gacggtgaca tcgttaaaca cgtaacaggt tgcgagggat 3840 gcagactgat cggcctgccg agtaagcccg aaccgatgtg ccgcagggag cttccgacga 3900 aaccatgggt ggacgtcgca atcgacttcc tgggtccgtt gccgtctgga gaatacctgc 3960 tagtgatcgt ggattattac agccggtata aggagatcga ggtaatgaag cacataaccg 4020 ctgaggaaac tacgaaacgg ctgcacagga tattcactcg tctgggtttt ccggtcacta 4080 tcactctgga caatggcaga caatttgtga gcacacattt cgatcattac tgcaagcaac 4140 atggaatatc actgaactac tcaactcctt actggccaca agagaacggc ctcgtcgagc 4200 gccagaatcg atccctgatc aagcgactgc agatcagcca cgcgttggga cgcgactgga 4260 aagaggattt ggcgggctat ttgttgatgt actacacgac tccacactca acaacgggga 4320 agacgccgac tgagctgtgc tacggcagaa ccatcagatc gaaaattccc agcttgatcg 4380 acgtggagtc gactcctaat gacgaggact tccgggaaag ggaccgtttg ctgaaacaaa 4440 aagggaaaga attggaggac aagaaacgcc acgccagacc atctgatcta caaccgggag 4500 acacagtcct aatgaaaaat gtacttccgg gcaacaagct gacaacggcc tacagtccaa 4560 aggaacatgt cgttttgagc aaggagggcg gtcgcgtcac cgtgcaagac aaggagtcac 4620 ggaaggtata caaacggaac gtcacacatt tgaagcgagt gccagtcgag gcaccgagcg 4680 aatcttcggt ggcgggaagt gtggaagcgc cccaggaaga tccagtgccc ccagtcgaac 4740 tcagaccatc accggtagcg gaggaagctc aagcagataa tcatttcagg aaccgacgcg 4800 aagtgaaacg accaactaga ttcagggact acattgtttc tcacctagtt gacgattaga 4860 gtctagaaag aaaaggaga 4879 // ID CR1-12_BF repbase; DNA; INV; 3430 BP. XX AC . XX DT 30-JUL-2009 (Rel. 14.07, Created) DT 30-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE Amphioxus CR1-12_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-12_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-3430 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-3430 RA Kapitonov V. and Jurka J.; RT "Young families of CR1 non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(7), 1583-1583 (2009). XX DR [2] (Consensus) XX SQ Sequence 3430 BP; 1117 A; 826 C; 638 G; 849 T; 0 other; atggccccaa actctgactt taagtttact cctggcatac taaacttact gagtttccac 60 aaacaccaga aagacagacc caactcacct agaaacccac tcaagtcaat ttctgtatct 120 atagccctcc ttgttatgtt aacacagtca ggggacgttc atccaaatcc aggcccctac 180 aaacccaaat ttccttgtct cttgtgtggc aaagctgcca agtggaatca aagagctgta 240 gcatgtgatg aatgccatgg ttggtatcat gttgattgta tgtccatgtc tactgcaaac 300 tacaatgttc tggcagaatg gatatgctgt caatgcggaa tgcccaactt ctcaagttca 360 ctctttaata atagtgactt cgagctgtct aacagtttcg aaagtttgtc ttcgatacat 420 tcacaaacca actcttcttg caattcctgc cctgatggga atcatatcaa cccgtcaaac 480 ccgggaattg ccaagtcacc cttggccgct tcatcgcctc ttaaggctaa agccaagcca 540 agtagacgca agccccccaa acagaaatta aaaacatgtg ttataaattt ccagggcata 600 cgcaacaaaa ctgctgatct agcagcatgc ttagaccagc attctccaga tattatacta 660 ggctcagaaa cacatctaaa tgactcagtt ggcagtagtg aacttttccc cccagactat 720 accgtcataa ggaaagacag agaactgggg agaaagggag ggggtgttct ccttgcctac 780 aagaacgact tgatagttac acatagatta gatctagaca ctgagtgtga aacggtatgg 840 gcaaccttgg agatccaagg agccaaaccg attacaattg gttcttgtta cagatcgcat 900 acttatgcaa aagacattga ctttgtagat aaactgagag cctcattaaa caagattaaa 960 ccacaaggtg gccaagtatg gctagggggg gatctcaatt tccctggtat agactggaac 1020 aacttaagtg tgagcccaag cgctccctat agtgcacttt ctaaacattt tttagagtta 1080 acacatgact ttgggctcga acaggttgtg gatcaaccaa cccgtctcaa taacaccctt 1140 gatcttttct taactaataa ttcatcactt gttgaaactt gttctctgat cccaggtata 1200 ggggaccatg atggcatccc agttattact gttaatgtta aacctaagaa aatcaaacaa 1260 agccccaggc gtgtttcact ttggaaaaaa gctgatgaaa ccgccataag aactgagcta 1320 caggattaca gtaataattt agctaaacgg aacattgaca actgcacaac agaagagctt 1380 tataatgaat ttgtggacaa aatcaagata gtgatggaca aatatgtccc atccaaaact 1440 attaacaata ataatttttc accttggatt aacaacaagg ttaggaaaat acataagaaa 1500 aaacagcgag cctacaactc ttatcgtaaa aacccaacaa ccgagaactt gactaaattc 1560 cgcactattc gcaaacgagc aaaaaagact accagacaaa actaccgtaa atatgtaaat 1620 tctatatgtt ctgattcccc caagaagttt tggtcgcaca tcaaacactt aaaacaggat 1680 acaactggga tccctagtct taagcagaaa ggaaagctag aaacagataa cagaaacaaa 1740 gctaacatcc ttaacaacca attcagctct gtctttacac gagaggatga acacatgccc 1800 cacctccccc gtagttctat ccccactatg cctgactttt ccatcaatgt aaacggagtc 1860 accaaactcc tgaaagatct gaatccacac aaagcaagtg gaccagacgg cctaccaaca 1920 aggatcctca aactagcagc tgctgagtta gcccctgctc tgactattat ttgtcagaaa 1980 tcacttgaaa ctggccaaat cccttcccca tggctacatg ccaatatctc tcccatcttt 2040 aaaaaggggg acaggactga tcctgcaaat taccgtcctg tttctctcac atgtgtttgt 2100 agtaaggtta tggagcacat agtgcaatca cagatgatgg atcattttga caaatactct 2160 atattatctg acaagcagca tgggtttagg aaaaagcgtt catgtgaaac ccaacttatt 2220 ttaactacaa ctgacttagc ccattcttta gataccaggt cgcaaacaga tatgattata 2280 accgacttcg caaaagcttt tgacaaagta cctcatgatc gcttactcct aaaactccaa 2340 aattatggta ttaggggtaa ggtacttcac tggatctcta atttcctcaa aaatcgtaaa 2400 caaagagtgg tagttggggg agagcactca gagtgggcag aggtagtctc tggagtgccc 2460 cagggtactg tgctcggacc actactgttc ttagtgtaca ttaatgactt agccgacaac 2520 ctcaactcta acatccgtct ctttgcggat gactgcgtca tttacaggga gataaacagt 2580 gacagagatc acaccctctt acaagaggac atcaatacat tagatacatg gcaaaacaca 2640 tggcagatga aattacaccc ggacaaatgt tttgttatgc gttataccca caaacgtaaa 2700 cctaaactct acgactacaa actaggaagt catgttctag ccgagactaa caaccacaaa 2760 tacttaggag tcacactcaa caaccagctc tcttggtcac aacacataca gtacagtgct 2820 tccaaggcta acaagacgtt aggctttgtg agacgcaatc tttataactg ccccaaatca 2880 gtcaaaacaa atgcatatac atcccttgtt agaccgcatt tagagtattc cagcgctgcc 2940 tgggatccat accacaaaga gcacattgct aagttagagt ccgttcaaaa aagagcggcc 3000 agatttgtca caaattcgta tcatagttat gatagtgtca ccaaacttgt ctctgatttg 3060 gggtgggaca ctttacgcaa cagaaggaca gcaaatagat taacaatttt acagaaagca 3120 agacaccata aagtagccct accggtcgaa actcatctaa agcctgtacc gcgccaatcg 3180 cgacactcaa acccaaattc ttacaaggct ctaccttttc acaaagactg ctacaaacat 3240 tccttcttcc cacaaaccgt gagagattgg aactctctac cgtacgaggt cacggagatc 3300 acagacgctc caaagttcaa ggaggcagtt ctccgacgcc tcaggggaca ttaaggcgca 3360 gcactccccc tggacgcttt gccccaactg aggggcgttg tccagtaccc aatcaagatc 3420 aagatcaaga 3430 // ID Gypsy-29-I_NVi repbase; DNA; INV; 5327 BP. XX AC . XX DT 12-MAY-2009 (Rel. 14.05, Created) DT 12-MAY-2009 (Rel. 14.05, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW internal portion; Gypsy-29-I_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-5327 RA Bao W. and Jurka J.; RT "LTR retrotransposon from Nasonia parasitic wasp."; RL Repbase Reports 9(5), 994-994 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 782..1957 FT /product="Gypsy-29-I_NVi_1p" FT /translation="CCAPRRASAPTIARVPLAPLPPRVSREGATRHPSAVR FT RAHSVPRLGSRTSQRLAARRALNFXHSETTLRKPGQSSVGPQSXXXGANXS FT XXXPXPPPMPXXGVLPXPRPSXEXTRCEXPSYWXANPALWLAQVDAALEAS FT GVRSXRHKFNMVVARLPQDVAQELSDIILHPPAANRYXALRVAXLQRLXIS FT ADTQLHQVLNEVRLEDRTPSQLLRHMRRLANNAISDEAMRVKWLDLLPSNT FT SRLCRVLPATSLDELAHLADLAMVPEPQVHAVAARQPTPSSSGVSSGTASP FT QPESDMATLQATLAQLLTTSQRQSKLLETLVRNNTSNNXSNNISNSSSNSR FT GRARSRSRPRRTAAPAAASDDMCWYHRKFGQQATQCKQPCSFTPGPGN*" FT CDS 1816..5325 FT /product="Gypsy-29-I_NVi_2p" FT /translation="SALSLTPPSYRSASGRQRRHVLVSPEIRPAGHTXQAA FT LLLHPGPGKLKAPPPLQAEAVGAVTERRLHITDQSTKLRFLVDTGSAVSLL FT PLTHFKETRRRSPLTLSAANASAIGTYGTHSLTLDLALAKPLNWKFIVAAD FT VSDPILGADFLAHFGLAVDLHHRQLLDADHSHHTTGALLPARVFSVAVNIT FT ADVAEGVFADLLRDYQDLATPGSTTISLPDLSALHHIVTNGPPAAARPRRL FT HGERLEAARAGFRALLEMGIVRLSDSSWASPLQLVKKADGSYRITGDYRQL FT NSRTVPDRYPLPVIEDLLLELQGDTFSVIDLKKAFYQVPIAPEDAHKTAIT FT TPFGLFEFTRSSMGLRNAAQSLQRAMDHLLRDLPFARAYLDDIVVASWGRE FT QHLEHLKVLFATLRKARIKINQDKCVLAKESVTYLGYRISKDGCSPPPGKV FT EAIQAFPQPADTSQLRRFLGLVNYYRRCIPGAARLLAPLNDLLKLLPQCKK FT PSPLAWTKEAQDAFEKTKQALADAVTTTFLRREAPLRLYTDASDVAIGAAL FT EQXEREDIWRPLGFFSRKLSDTERRYSTYDRELLAAFASVKHFSSILEGRP FT FTLFTDHKPLSLAAQQPPEKASPRQSRQLDYIFQFPIKLAYVKGPNNIAAD FT ALQMPFRESTPSRCRLAWTSPPSPSSKRATKTCRTCSTPETENFSPSTSRV FT PPCIAQVXEKXXXRYIPKELRKTVFEALHGLSHPGVRSTTRTIAQKYFWPH FT MRKEIARWARACEPCQHSKVHRHNRAALGDFAAPDARFDHLHLDLIKLPSC FT SGXQYCLTIVDRFSRWPHAIPLPDQQADTVARAFLEHWISSFGTPLTITTD FT QGPQFEARLFAALANIIGAARVHTTPYHPQANGLVERMHRTLKAALMCCAP FT TPWPQALPAVLLGLRTTFKEDLQASPAEMLFGTTLRVPGDFFVSASHPDAN FT APAFVAELRGLIGRLRAAPGSRHLPPRTPFFHGDLRTCTHVFRRVDTVRTT FT LQPPYTGPHRVLRRLDDQRYVVEVNGEAKTLSTSSLKPAYLEVADQPPTNA FT PLPARASPAAAAPAPPPAAPSAQAAPPAAPSAQAATPSAPEARPLAQGPPA FT GPSTQAHPAPTPSTQPQPPVSPPVPAPSTSPPPPPSASPSLNRPRKTVSFR FT TQDTSGTGGGV" XX SQ Sequence 5327 BP; 1024 A; 1941 C; 1406 G; 907 T; 49 other; tactctcggc ggccagccca tccttctctc tcggcgtacc gcgcggagcc gagaaagaat 60 tttttttact gattcgactt acggcacgtc gcagcccatt tacagctcgc ggcggccact 120 tttatttttc tttacttctt gcatcgagca gcgagcgcag tcgcgcgcgg tggcacgcgc 180 gaccccggtg aatctcgccg ccggtgaagt gaccctcaat cgagcaccgt agcaacatca 240 ggcgagcacc agcggcacct cgtcgcacgg cactaccagc agcagggagt ctgcctccag 300 cccctctaca gctggagtgc gcgcggagcg gccccgtcgg tcgtctccct gcctgcgagc 360 acaccacgcg gcgagagcag ccgcaccacc gaggacgagg cacagcacca gaacatcgcg 420 gtgacacgca ccaacgcagc cggggtccca gccaggaggt gccttcaaca tcggcggaac 480 aacaccggcc gagcaacgca gtccagcagc agcgcggcgt ccacatctcg ccatcatcat 540 cggcagcgct cggcggccag cgtcaccttc acaccaggtt tttwtacgct ccgcaacagg 600 tatagttctt tcgcggcctt gagtcgctct ttcttttcaa gtgccgcgct atacctgttg 660 cgtgccgccg gcgccgacgc cgcgctccac gcagtcggcc atcttgccga ctcgcgtcgt 720 ctcccgcgct ttctcgcctt ccgccccggc cctcccgcgc gagcggccat tttcccggta 780 gtgttgcgcg ccgcgtagag ccagcgcgcc gaccatcgcg cgcgttccgc tggctccgct 840 accaccacgc gtgtcgcgcg agggcgccac ccgccatcct tcggcggtac gccgcgcaca 900 cagcgtgccg cgtctcggtt ctcgtacttc tcagcgcctt gccgcgcgcc gcgcgctcaa 960 ttttttwcac tccgagacaa cgctgcgcaa accgggccag tcctccgtgg ggcctcagag 1020 cwcgcmrrck ggmgcgaatg rttcggrcts yrtaccasct cctccgccca tgccmrccgr 1080 cggggtrttg ccgtwtcckc gaccgtctwm cgarkycacg cgctgcgagw twccatcata 1140 ytggartgct aatcccgctt tgtggctcgc acaggtcgat gcagckctcg aggccagcgg 1200 agtacgcagc gakcgccaca aattcaacat ggtggtggct cgcctyccyc aggacgtcgc 1260 ccaggagctg tcggacatca tcttgcatcc gccagccgcc aaccgctack grgccctccg 1320 agtcgcgstg ctgcagcgcc tgkccatctc cgctgacacc cagcttcacc aggtgctgaa 1380 cgaggtgcgc ctggaggacc gcacaccatc gcagctgctg cgccacatga ggcgcctcgc 1440 caacaacgcc atctccgacg aggccatgag ggtcaaatgg ctggacctcc tgccatcgaa 1500 caccagccgt ctgtgccgag tcctgccagc cacctcactc gacgagctgg cccacctggc 1560 tgacctcgcc atggttccag agccgcaagt ccacgctgtc gcagctcgcc aaccaacacc 1620 gtcctcgagt ggcgtttcgt caggcacggc atcgccccag ccagaatcgg acatggccac 1680 gctgcaggct acgctggcgc agctcctcac cacctcgcag aggcagagca agctgctcga 1740 gaccctcgtg cgcaacaaca ccagcaacaa camcagcaay aacatcagca atagcagcag 1800 caacagcaga ggtagagcgc gctctcgctc acgcccccgt cgtaccgcag cgccagcggc 1860 cgccagcgac gacatgtgct ggtatcaccg gaaattcggc cagcaggcca cacartgcaa 1920 gcagccctgc tccttcaccc cgggcccggg aaactaaagg cgccgccgcc cctgcaagcc 1980 gaagcggtag gcgcagtcac ggagagacgc ttgcacatca ccgaccagtc aacaaagctc 2040 aggttcctgg tggacaccgg ctcggccgtc tcactgctgc ctctcacaca cttcaaggag 2100 acgagaagac gtagccctct caccctcagt gcagccaacg cctcagcgat tggtacctac 2160 ggcactcact cgctgacgct ggacctcgct ctggccaaac cactcaactg gaagttcatc 2220 gtagccgcag atgtcagtga tcccatccta ggcgcggatt ttctggcaca tttcggcctc 2280 gctgtcgacc tccatcacag acagctcctc gatgctgacc actcacacca taccaccggt 2340 gcactccttc cagctcgagt cttctccgtc gcagtcaaca tcacggccga cgtcgctgag 2400 ggtgtgtttg ctgacctgtt gagagactat caagatctag ctactccagg cagcaccacc 2460 atctcactac cggatctctc cgccctccac cacatcgtga cgaacggacc accagctgca 2520 gcacggccac gccggctcca cggggagcgc ctcgaggcgg cgcgtgccgg atttcgagcg 2580 cttttggaga tgggcatcgt ccgattatca gacagctcct gggcgagccc tctacagctg 2640 gtcaagaaag ccgatggctc ataccgcatc acgggcgact atcgccagct caacagtcgc 2700 accgtcccgg acagataccc gctcccagtt atagaggatc ttttactgga actacaaggc 2760 gacaccttct cggtcataga cttgaagaag gccttttatc aagtccctat cgctcccgaa 2820 gacgctcaca agacggcaat tacaaccccc tttgggctct tcgaattcac ccgctcttcg 2880 atggggctcc gcaacgctgc acaatctctc caacgcgcaa tggatcatct tctgcgcgac 2940 ctgccgttcg cacgagcata tctggacgac atagtcgttg catcctgggg ccgagagcag 3000 catttggagc acctcaaggt cctcttcgcc acgctacgca aggcacgcat caagatcaat 3060 caggacaagt gcgtcctggc caaggagtca gtcacctatc tcgggtaccg catctctaaa 3120 gatggctgta gtccaccacc aggcaaggtc gaggcaatcc aggcttttcc gcagccggct 3180 gacacttcac aacttcgtcg tttcttgggg ctggtcaact actacaggcg atgcatccct 3240 ggagctgcac gcctcctcgc acccctcaac gatctgctga agctcttgcc gcagtgcaag 3300 aaaccgtcgc ctctcgcctg gacgaaggaa gctcaagatg cattcgagaa gacgaagcag 3360 gcgctcgctg acgcagtcac caccaccttt ctgcgtcgcg aggcccctct acggctctac 3420 accgacgcct ccgacgtcgc catcggtgct gccctggagc agyacgagag ggaagacatc 3480 tggcgcccgc taggcttctt ttccagaaag ctctcggaca ccgagaggcg ctacagcacc 3540 tacgatagag agcttctcgc cgcattcgcc tcggttaagc acttcagcag catcctcgag 3600 gggcgccctt tcaccctgtt cacggaccac aagccgctgt ccctcgcagc tcagcagcct 3660 ccggaaaaag catcgcctcg ccagtccaga cagctggact acatattcca gttcccgatc 3720 aagctggcgt acgtcaaggg ccccaacaac atcgccgcag acgccttgca gatgccyttt 3780 cgagagtcaa caccatcaag atgccggctt gcctggacct caccaccctc gcccagcagc 3840 aagcgagcga ccaagacctg ccgcacttgc tcgacgccgg agacggaaaa cttcagcccc 3900 tcaacatcga gggttcctcc ctgtattgcg caggtgrcgg aaaagktawt msgccggtac 3960 atccccaagg agctgaggaa gaccgtcttc gaggcactgc atgggctctc ccacccaggc 4020 gtccgttcca cgacgaggac gatcgcccag aaatacttct ggccacacat gcggaaggag 4080 attgcgcgct gggcgcgcgc ttgcgagcca tgccagcact ccaaggtcca ccggcacaac 4140 cgagcagctc tgggagactt cgcggccccg gatgcgcgtt tcgaccacct gcacctggac 4200 ctyatcaagc tcccgtcgtg ctcgggtttm cagtactgcc tcaccattgt cgacaggttc 4260 tccaggtggc cgcacgccat ccctcttccg gaccagcagg cagacacggt cgcgcgagcc 4320 ttcctcgagc actggatcag ctcgttcggg acgccgctca ccatcaccac ggaccagggc 4380 ccgcaattcg aggcgcgcct tttcgccgcc ctcgccaaca tcatcggcgc agcccgggtg 4440 cacacaacac cttaccaccc gcaggccaat gggctggtgg aaaggatgca tcgcactctc 4500 aaggcggccc tcatgtgctg cgcaccaaca ccgtggccgc aggcccttcc agccgtcctg 4560 ctgggcctca gaacgacctt caaagaggat cttcaggctt ctccggccga gatgctcttc 4620 ggcacaactc tccgagtccc aggagacttc tttgtttcag ctagtcatcc agacgcgaac 4680 gcgccggctt tcgtcgccga gttgcgtggc ctcatcggcc ggttgcgggc agccccaggg 4740 tccaggcacc ttccgcctcg cacgccgttc ttccacggcg acttgcgcac ctgcacgcac 4800 gtcttcagga gggtcgacac ggtccgcacg acgctgcagc cgccctacac ggggccacac 4860 agggtgctgc gacgcctcga cgaccagcga tacgtggtcg aggtgaacgg cgaggccaag 4920 accctctcga ccagctccct caagccggcg tacctggagg tagccgatca gccgcctacc 4980 aacgccccgc tccctgcgcg ggcctcacca gctgcagcgg ctccggctcc tccacctgcc 5040 gcgccatcgg cgcaggcagc tccaccagcc gcgccatcgg cgcaggcagc aacaccatca 5100 gcgcccgagg cacggccact cgcacagggt cctccagctg ggccatccac gcaggcgcat 5160 ccagcgccca cgccgtccac ccagccgcaa ccacctgtgt cgccgcctgt cccggcgcct 5220 tcaacatcgc cgccgccgcc gccctccgcc agtccatccc tcaacaggcc aaggaaaacg 5280 gtgtcgttcc ggacgcagga cacgtccggc actggcgggg gagtagc 5327 // ID L1-32_AAe repbase; DNA; INV; 4525 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE An L1 non-LTR retrotransposon from Aedes aegypti. XX KW L1; Non-LTR Retrotransposon; Transposable Element; L1-32_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4525 RA Kojima K.K. and Jurka J.; RT "L1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1385-1385 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 10 sequences with >95% CC identity. XX FH Key Location/Qualifiers FT CDS 160..1278 FT /product="L1-32_AAe_1p" FT /translation="MSVRRENTFRIDYANVPRKPSFEELHDFVATTLGLQY FT EQVVQLQPSRALGCAFVKVVDLELARRIVAEHDNKHETEVDGKIYKLRITL FT EDGAVEVKLTDLSEDITNEQVAEFLSAYGEVLTITNQVWDSKYRFAGLPTG FT ARIARMMVKRNIKSYVTIDGQTTNVTYFGQLQTCKYCSEFVHNGISCVQNK FT KLLVQKTYANVAKEPAEKNTAPRXSVSKPKPTFAKLFGPKPGEAAQQKVSQ FT KTKRAGTSENAFAVPKPTKPNLAEPSTKKAGASETVFAAPLSSKANPDKPL FT ATAHALSKPNPLDQTTALVTPPVLSQNLLTTTRMVTRQAASDGNETDISTA FT STNSKRRHGRPPGKKLRHDDDTNEESDNLI" FT CDS 1281..4454 FT /product="L1-32_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MAFMSYNLASINITTITNTTKLNALRTFLRTMDIDIA FT FLQEVENDQLHLPGYSVVCNVDHARRGTAIALKEHIRFTHVEKSLDGRLIA FT LRVQNTTFCNVYAPSGTALRAERERFFTNTLAYYLRHHTEHTLIAGDFNCV FT LRQCDATGHNPSPALLTAVQQLQLIDVWQKICPNTPGHTYITHNSSSRLDR FT MYVSQSLGSHLRTAVTHVCSFSDHKALTVRICLPNLGHEPGRGFWSLRPHL FT LTPENIEEFQYRWQFWTRQRRNFPSWMQWWLSFAKPKIKSFFRWKSKAAFD FT DFHREYQRLYVELRSAYDGYYRNPAMLSTINRVKAQMLTLQRNFSQDFMRI FT SERYISGEPISMFQIGERRRRKTTIAHLRDEQNQPIDDAHAIEQHMFDFYR FT TLYAAEQTDVVADELFLCERVVPDNDPANVACMGEITPADIIMAIKSSSPN FT KSPGGDSLPREFYLKTFNVIWRELTLVMNEALAGNFPAEFVDGIVVLVKKK FT GNDETAHSYRPISLLNCDYKILSRILKQRLEIVMRNHRVLSAGQKCSSFGR FT NIFQATLALKDRMAQLIKRKQRGLLASFDLRHAFDLVDRSFLFRNMCALGF FT NPNFVRLLSKIGELSSSRLLLNGRLSGAFPIERSVRQGDPLSMHLFVLYLE FT PLIKRLEETCGPDLVVAYADDVSVIVTCEARLERIRELFHRFGRVSGARLS FT LAKSTSVSVGYTDDAPLKVPWMRCENTVRILGVTFANSIRLMNKLNWDALV FT GNFARQXYLHSQRALTLQQKVTLLNVFITSKIWYLASLLTPAAVHTAKLTA FT AMGSFLWRGIPARVPMLQLSRNVEGGGLKLQLPTLKCKALLINRHIREIDS FT TPFYSSFISHAIPNPPADCPCLKQILSNFPLLPPQIQQNVSVGAIHNFYIS FT QTDEPKVSRDYPAASWPQIWRNIASKQLSSGNRSLLFLLVNGKLNHRKLMH FT RTNRADGENCLHCTEPCETLEHKLGTCVRVEAAWRLLQRKISTLLNGWRRL FT SIEDLLKPSLRNVARWKKTKILKMFASYVVFIMDNNNVIDVDSLEFEIDNA FT V" XX SQ Sequence 4525 BP; 1261 A; 1120 C; 1027 G; 1115 T; 2 other; tcagttagcg ctcaacttcc aagcagagca gttgtgtttc tcgctggaag ccacacgcaa 60 gctattgcga cttaggtcga tttttttttc tttctttgag gtgttatagt acacgtgcag 120 tttcgcgttt gcgagtgtag tgtcgcggat atcgccgcga tgagtgttcg acgcgaaaac 180 acgtttcgca tcgattatgc gaacgtgcca aggaagccgt cattcgaaga gctacatgat 240 tttgtagcca ccacattggg actccaatac gaacaagttg ttcaactcca gcctagcaga 300 gcactcgggt gtgccttcgt gaaggtcgtc gacctggagc tagcgcggag aatcgtagca 360 gagcacgaca acaaacatga gacggaggta gatggcaaaa tctacaagct tcgtattacg 420 ctcgaggatg gtgcggtgga agtaaagctg accgatctgt ccgaggacat cacgaacgag 480 caggtggctg agttcctcag tgcctacggc gaagttctca ccattaccaa tcaagtatgg 540 gacagcaagt atcgcttcgc tggtctacca accggcgctc ggatagcaag aatgatggtg 600 aaacggaaca ttaaaagtta cgtcacaatc gatggtcaaa ccacgaacgt gacttatttc 660 ggacagttac agacctgtaa gtactgctcc gaattcgtgc acaacggtat ttcttgtgtg 720 caaaacaaga agttactggt tcaaaaaacc tacgccaatg tagcgaaaga acctgcggaa 780 aaaaacaccg caccaagatt kagcgtctca aaaccgaaac ctacatttgc gaagctcttc 840 ggaccgaagc ctggggaggc tgcccagcaa aaagtatctc aaaagacgaa aagggctggt 900 acgtcggaga atgctttcgc tgtgccgaag cctaccaagc cgaaccttgc tgaaccatcg 960 accaaaaaag cgggagcatc cgaaactgtg tttgctgctc cgctttcttc taaggcgaac 1020 cccgacaaac cactggctac tgcccatgcg ttgtccaaac cgaacccgct cgaccagacc 1080 actgctctcg taactcctcc ggttctctca cagaaccttc taacgacaac tcgtatggta 1140 acacgtcagg ccgccagcga tggtaacgaa actgatattt caacagcttc gacgaacagc 1200 aaacgccggc atgggcgacc accgggcaag aagctgcgac acgatgacga cacgaacgag 1260 gagagcgata atttgatcta atggcgttca tgagttacaa tttagcatca atcaacatca 1320 ccacaatcac aaatacgacc aaattaaacg cactccgaac ctttctccga acaatggaca 1380 tcgatattgc ctttctccaa gaagttgaaa atgatcagct acacttgcct ggctatagcg 1440 ttgtatgtaa tgtggatcat gcaagaagag gaacggcaat agcgctcaaa gagcacattc 1500 gtttcaccca cgtcgaaaag agcttagatg gacgtttgat tgcccttcgg gtgcaaaaca 1560 caacattttg taatgtatac gctccgtctg gaactgctct acgggctgag cgggagcggt 1620 tcttcacaaa cactctagct tactatctcc gccaccacac agaacatacg ttgattgcgg 1680 gtgactttaa ctgtgttctg cggcagtgtg atgcgacggg acacaatccc agccccgctc 1740 ttctaacagc tgtacagcag ttacagttga ttgacgtgtg gcaaaaaatt tgcccaaaca 1800 cacctggaca cacatacatc actcacaatt cgtcttctcg gctagaccgc atgtatgtta 1860 gccagagctt aggtagtcat ctgcgaacag cggtcactca tgtctgctcc ttttccgacc 1920 acaaggcatt aactgtgcgt atatgcctac ccaatcttgg tcatgagcct ggtcgtgggt 1980 tttggtccct ccgaccgcat ctcctcacac cggagaatat tgaagaattc caataccgct 2040 ggcaattttg gacgcgccaa cggcgtaatt ttccttcttg gatgcagtgg tggttatcat 2100 tcgctaagcc caagataaaa tccttctttc gttggaaatc taaggccgcc ttcgatgatt 2160 tccatcgcga ataccaacga ctgtacgtgg aacttcgatc agcatacgat ggctactatc 2220 ggaatccggc gatgttatca acgataaatc gagtcaaagc gcaaatgtta accttgcaac 2280 gaaatttctc ccaagatttc atgcgaatta gcgagcgata tatttccggt gaaccgattt 2340 caatgttcca gatcggcgaa cgaagaagga ggaaaacaac catcgcccac ttgcgagatg 2400 aacaaaatca gcccattgat gacgcgcatg cgatcgagca acacatgttc gatttctacc 2460 gtacgctcta tgctgctgaa caaactgacg tggttgcaga cgaactattc ctatgcgaga 2520 gagttgtccc agataacgat ccggcaaacg tggcctgcat gggagaaata acaccagccg 2580 atatcatcat ggccatcaaa tccagcagcc cgaacaaatc ccccggaggt gattcgctcc 2640 cacgtgagtt ctaccttaaa acattcaacg tcatctggag agagttaacg ttggtcatga 2700 atgaagctct tgctggaaat tttccagctg aattcgtcga tggaatcgta gtactggtga 2760 agaaaaaagg gaacgatgaa acggcccact cgtacagacc gatatcctta ctcaactgcg 2820 actataagat attgtctcgc atactcaaac agcgcctgga aatagtaatg agaaaccatc 2880 gtgtgctcag cgcaggccaa aagtgctcca gcttcggacg aaacatcttc caggccacgc 2940 tggccttgaa agaccgcatg gcacagctta tcaaacgtaa acaacggggt ctactagcat 3000 cgtttgatct gcgacatgcg ttcgatcttg ttgatcgctc tttcctcttt cgcaacatgt 3060 gcgcactcgg ctttaacccg aactttgttc gtcttctcag caaaatcgga gagctatcat 3120 cttctcgcct gctcttaaat ggacgtctgt cgggagcatt tcccatcgag agatcagtgc 3180 gtcagggaga tccgctctcg atgcacttgt tcgtgctcta cctggaacca ctcatcaaga 3240 ggcttgagga aacctgcggg cccgacttgg tggtggcata cgctgacgat gtgtcggtga 3300 tagttacatg cgaggccaga ttggagcgaa ttcgagagct atttcatcgt tttggacgag 3360 tgtcgggtgc taggctaagt ctagcaaagt caacgtctgt atcagtgggt tacactgacg 3420 atgcgccact caaagtgcct tggatgcgtt gtgagaacac cgttcgaatt ctaggagtca 3480 cctttgccaa ctccattcgg cttatgaaca agttgaattg ggatgcgctc gttggcaatt 3540 ttgctcgtca amtatatctg cactcacaac gggcgctcac gctacaacag aaagtgactc 3600 tactaaatgt gtttatcaca tcgaaaatat ggtacctcgc atcgttgctc acgccggctg 3660 ctgtgcacac agcaaaactg actgctgcga tggggtcttt tctgtggcgt ggtattccag 3720 ccagggtacc aatgctgcaa ttgtcccgca atgtagaagg aggtggattg aaattgcaac 3780 tgccaactct aaaatgcaag gctttgctga tcaaccgaca tatccgtgaa atcgactcca 3840 ctccttttta tagttccttt atttcccatg caattcccaa ccctcccgca gattgtcctt 3900 gtctaaagca aatcctatcc aattttcctt tacttcctcc tcaaattcag caaaacgttt 3960 cagtcggcgc catccacaac ttctatatca gccaaaccga tgaaccgaaa gtttcacgtg 4020 actatcctgc agccagctgg ccgcaaatct ggcgaaatat tgcatccaag cagttgtcct 4080 cggggaatcg aagtttgctg ttccttctcg taaacgggaa gcttaatcat cgaaaactca 4140 tgcatcgaac caacagagcc gacggtgaaa actgcttaca ttgtaccgag ccgtgcgaaa 4200 cattagagca taaattgggt acgtgcgttc gtgtagaggc agcttggaga ctactacagc 4260 ggaagatatc tactttgctc aatggatggc gtagactttc tatagaggat ttgttgaaac 4320 cttccctaag aaatgtagca agatggaaga aaactaaaat acttaaaatg tttgcaagtt 4380 acgtcgtctt tattatggac aataataatg taatagatgt agattcacta gaatttgaaa 4440 tcgataatgc agtataagaa attattttgt aattactttt tactctttga tcaataaaca 4500 aattttatgt aaaaaaaaaa aaaaa 4525 // ID Gypsy7-I_AP repbase; DNA; INV; 4472 BP. XX AC Contig49402; XX DT 21-APR-2008 (Rel. 13.04, Created) DT 21-APR-2008 (Rel. 15.12, Last updated, Version 0) XX DE LTR retrotransposon from pea aphid: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy7AP; KW Gypsy7-I_AP; Gypsy7-LTR_AP. XX NM Gypsy7-I_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-4472 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from pea aphid."; RL Repbase Reports 8(4), 449-449 (2008). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC Positions [3456-3827] - Integrase core CC LTRs are 99% similar to each other. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX FH Key Location/Qualifiers FT CDS 1896..4469 FT /product="Gypsy7-I_AP_1p" FT /translation="MLRAGICRPSNSAWASPLLLVGKKDGSFRPCGDYRRL FT NAVTKPDRYPLPHIHDFSSNLYGKTIFSKLDLVRAYHQIPVAHGDIPKTAV FT TTPFGLYEFPVMCFGLKNAAQTFQRLVNEILRGLDFVYAYIDDVLIASHSE FT SEHETHLRSVIERFKKFGVAINVNKCVFGVQQLIFLRHLINGEGCSPLQER FT VDDVHNWPLPLSKKALQRFLGSDNYYHRFIPNAAQLQAPLYDLVSSVKQRD FT GKLQWTKDTRDAFENSRNALASTVHLAHPRPDANLRISTDASNTAIVAVLE FT QFNNNHWEPLDFFSKKLTPTQARYSTFDRELLAAYLALRHFLHLVEGRCTV FT LLTDHKPLTHMLTVKTDKYSDRQLRHISFIAQFVQQVEYVQGDKNVVPDAL FT SRIETVTCHAQLPDFATLSMDQAEDPELQGLLNGTLASSLRLEARDTGSGP FT VYFDVSAPGRLRTYVPATLRRRIFKILHNQAHPGIRATLALIKERHSWVDM FT DCQIRNWVRHCTTCQRTKVQRHVIAPVMPFVTPKRRFGHIHIDLVGPLPSS FT DGHEYLLTAIDRYTRWPEAYPLTNMSAHAVADKLVSQWFSRFGTPDVVTTD FT QGRQFESELFAALSQTYGFRHSRTSPYHPQANGLIERLHRPLKAALTAHNN FT PQWIKSLPTVLLALRSIVKPELGCSPAEMVYGTTLRLLGEIFHAVQPEPHA FT PALVRILKESMSLLRPTTGTDHSKRSIFVPEELQTATHVFLRVDSTRKPLQ FT PRYDGPFAVLDRNNKNFKLQLHNRTSWISIGRFKPAFLLREDPVADHSYAS FT AHVESQSQSTSTLHEVSTAEHPYYSTSIKSRIHSILRKPGTERLKKQVRFF FT FPRGR" XX SQ Sequence 4472 BP; 1121 A; 1275 C; 976 G; 1100 T; 0 other; cgacctcgtg ttttatattc gtatttattg taatttaagt tatttctaca aacttttgta 60 gctgtattac ctctttgtat ataaataaac atttcgctac gtgaattatt agtcactact 120 tgtgcgttct tatttaataa aatttaattt aggcactgtt tggtttcccg accgagcaat 180 acactaaaac tggtaacctc gagtgtattt ttcgacacac gttatgactt ctagtagcga 240 gtaccgtgat ctactctcgc ttgagaacgg ctcgctcgcg agtgatacgt cgaatccgcc 300 taccgtaaac ccgcccgttt cgaacgatag tttttccgtc gcgagcgtcg cacaagtgcg 360 tttgccttcg ttttggcgac actcgcctcg cgaatggttt ttacacgcag aagccgtttt 420 ttcgacccac cgtctacgcg cggatgtctc tcgagcgaac cacgtggttg ccgctttgga 480 cgaagagggc gttagggcca ttagtgacct catcggcccg gacgtttgct acgaaagcct 540 caagcaacga ctcatcaccg tcttcgctgt tcctcagtca gcacgttttc gttgtttcgt 600 tcaacccggt ggttgggaga tcggcgccca actcaactgc tgtgagatat gcgcaacatt 660 ttgccagacg gcatcggtga agatgccctc aagcagtttt ggctgcacaa attcagctgt 720 acaacgtcaa aacacagtaa ccaacactct tatcaaaagc atttttttta cataatcttt 780 gataactctt catgctgaat ttttctatca aaaaatcttt ggaatctgac tattattcca 840 tgagttatca cgttgtgaac acagagcaat tacacagtgt taagtctgaa ataatttact 900 aaatgaatta tcctgcaaaa taatgaaaaa tacagttgct gtgttttgac gttgtacagc 960 tgaattgccc cccgcgacta gtgccataat cgtcggccag gatgggtcgc tagaggatct 1020 cgccgcacgt gccgaccggg ttatagaggc cggtacctct tacgacctct atgccggcaa 1080 cagcgccgat gcgtcaaccg accgattccg cgcgatggag agcgcgatcg cggcactcac 1140 cacacaagtc gcgagtttgg taacactgca gtccgccgcc aagccttatc acagccgctc 1200 gttttcacat tcgagatctc gctccagaaa caacaacaat ctgtgttttt atcgcgaccg 1260 ttacggcgcg gacgcaaaaa ttgcaaacct ccgtacactt tcaaaccggc ggaaagctga 1320 taggggcggc ggaggcagat cagtctcccg ccgctcataa cagacgacgc gttttcgtca 1380 ctgacgcgcg ttccggtaga cgtttcctca ttgactcggg agcggacatc tctgtcctcc 1440 caccgtctgt tggtcaacga ccgatctccg gaattatcct caccgccgcg aatggacgaa 1500 agtcacttaa cctcgatctg ggactcacca ggccatttac gtggacgttc gagattgcag 1560 atgtcaacag aggcatcatc ggtgcagatt tcctccacta tttcggtcta ctagtcgatg 1620 ttcgccgaaa ccgtctagtc gatctgaact ccggactcgc atcaaaaatc gctactgtaa 1680 gtacggtccc atcaactatt tgcgtgatcg cgcaaccgca tacgtggacc aaattagtcg 1740 ccgagttccc cgaaattacg cgcaaatcgc cggttcccgc ctcgttcacg cataacgtcg 1800 agcacgtctt atcgactacc ggaccacccg tatttacgcg tccacgtcgc ctcgctcccg 1860 atcgtctcgt catagcgcgg aaagagttcg actttatgct gcgtgcaggt atttgccgac 1920 cgtcaaacag cgcttgggcc agtccgttac tgcttgtcgg taaaaaagac ggctcttttc 1980 gtccatgtgg cgattaccga cgtttaaacg ccgttactaa accggatcgt taccctcttc 2040 cgcacataca cgacttttcc agtaaccttt acggcaaaac aattttttct aaactggacc 2100 tagttcgtgc ataccatcaa attccagtcg cgcacgggga catcccgaaa actgctgtga 2160 caacaccttt tggtctatac gaattcccgg tcatgtgctt cggcctaaaa aatgcagcac 2220 aaacctttca gcgtttagtc aacgaaattc ttcgaggctt ggacttcgtg tatgcctaca 2280 ttgacgacgt gctaatcgca tcacattccg aatccgagca tgaaacgcat ctacgctccg 2340 tcatcgagcg atttaaaaaa ttcggtgtgg ccatcaacgt caacaaatgc gtgttcggag 2400 tccaacagtt gatttttctt aggcacctca ttaacggcga aggatgcagc ccgctgcaag 2460 agcgagtcga cgacgttcat aattggccac ttccgctgtc aaagaaagct cttcaacgtt 2520 tcctcggatc ggataactat taccacagat tcattccgaa cgcagcgcag ttacaagccc 2580 cgctgtacga cctcgtttcg tcggttaaac aaagagacgg aaaattgcag tggacgaaag 2640 acacacgcga tgcattcgaa aattcgcgca acgccctcgc gagtacggtt catttagctc 2700 accccaggcc ggacgccaac cttcgaattt ccacggacgc atcgaacacg gccattgtcg 2760 cagttttgga gcagtttaac aacaaccatt gggaacctct agattttttc tcaaagaagt 2820 tgacgcccac gcaggcccga tacagcactt tcgaccgcga acttctagcc gcgtatctgg 2880 cactccgtca cttcctacat ttggttgagg gcaggtgcac cgtactgttg accgaccata 2940 agccgctcac acacatgtta accgtcaaaa ccgacaagta cagtgacagg cagctaagac 3000 acatcagttt tatcgcgcag ttcgtacaac aggtcgaata cgtccaaggc gacaaaaatg 3060 tggttccaga cgccctttct cgaatcgaga ccgtcacttg tcacgcgcaa ctacccgatt 3120 tcgcgacgct gtctatggat caagccgaag atccagaatt acaaggtcta ctcaacggca 3180 cactcgccag ctctttgcgt ttggaggcac gtgacacagg ttcaggtccc gtctacttcg 3240 atgtatcagc acccggcaga ttgcgtacgt acgtacccgc gacacttcga cgccgcattt 3300 tcaaaatact gcacaaccaa gctcaccctg gcatcagagc cacccttgcc ttgatcaaag 3360 aacgacacag ttgggtggac atggattgtc aaattcgcaa ttgggttagg cattgcacaa 3420 cttgtcaacg gaccaaggtg caacgtcacg taatcgcacc ggtaatgcct ttcgtcacac 3480 ccaaacgtcg atttggtcat atccatattg atctggtcgg gccgttacct tcatccgacg 3540 gacacgagta cttactcaca gccatagaca ggtacacgcg ttggccagaa gcatacccac 3600 taaccaatat gtcagctcac gccgtcgccg acaaactagt atcgcagtgg ttttcgcgtt 3660 ttggtacacc cgacgtcgtt acaacggacc agggaaggca atttgagtct gagttatttg 3720 cagccctcag ccaaacttat ggtttccgcc atagtcgaac ttcgccttac cacccgcaag 3780 caaacggcct aatcgagcga ttacaccgac cacttaaagc agcgttgacc gcccacaaca 3840 atccgcaatg gatcaaaagc ttgccgacgg tgcttctagc actccgtagc attgttaaac 3900 cggaactggg gtgttcgccg gccgaaatgg tgtatggtac aacattacgc ttgctcggag 3960 aaatttttca cgccgttcaa cccgaaccac acgccccggc tttggtccgc attcttaagg 4020 agtcgatgtc attactccgc cccaccacag gtacagatca ttcgaaacgc tcgattttcg 4080 tcccggagga actgcaaacc gctacacacg tttttcttcg cgttgattcc acacgaaagc 4140 cgcttcaacc gcgatacgac ggccctttcg ccgtgttaga ccgcaacaac aaaaatttca 4200 agctgcaact tcataaccga acaagttgga tctcgatcgg ccgcttcaag cccgcttttc 4260 tactccgaga agaccccgtc gccgatcact catacgcctc cgctcacgtc gagtcacaat 4320 cgcaatcaac atcgacactt cacgaagtct caaccgccga gcatccgtac tactcaacca 4380 gcatcaagtc gcggattcac tccatacttc gcaaacctgg aactgagcgc ctaaagaaac 4440 aggttcgttt ctttttccca agggggaggt aa 4472 // ID Gypsy-6-I_HM repbase; DNA; INV; 8080 BP. XX AC . XX DT 25-DEC-2008 (Rel. 13.12, Created) DT 24-APR-2009 (Rel. 14.05, Last updated, Version 2) XX DE An internal portion of the LTR retrotransposon - a consensus DE sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW reverse transcriptase; Gypsy superfamily; Gypsy-6-I_HM; TSD 4-bp. XX NM Gypsy-6-I_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-8080 RA Bao W. and Jurka J.; RT "Gypsy LTR retrotransposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 1978-1978 (2008). XX DR [1] (Consensus) XX CC TSD is 4-bp long. This sequence was derived from sequence data CC generated by TIGR, J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 166..7473 FT /product="Gypsy-6-I_HM_1p" FT /translation="MTRLRSGKICPFVEPTHRLIAGNNSHTSLNNSDSDSA FT DSLELSDTNPFKTQNSRSPEEKNNMSLEQILERLTLKPTQEIHPKIFKGTE FT TEDVTEWLASFGRIAQHNQWTEMRKLLAFPLYLEGSALVYYETLPAHIRND FT FNQIQEHFRQYYNNANIAWNRKMELFGLLQDADLASYITTLDKLSQQLGVD FT DETKLNLFIKGLKPHLRNALKLKQPVDYQAAVAFAKLQDTVGSDSTITRLE FT AKLDALLNSPQHIAAATYREHPILNQRIGELQTEIASLRRNNNRPSQHERF FT RPNGRNLRTSDGQVICNQCLKVGHTQRSCSEKIAPLSGRNHYQHASPQPHH FT WNHDTQNPVSFRNNSYPNRQSRHNEPRNSPHLDSRIPHRHTNTLESDPEEE FT PGSEIWGKVYKTKVLILMDTGARCSVLDETIFENFRNQVNRPLPYYGTLRT FT ANGAPLHVLGKTVCPINIGGYIFEHEMLIAKNLTHPVILGYDFMSQFNVKL FT DCGNHRVTFEGMFDIEMISKPIDNKFSTGIIVDDTPIRRIPQIDKFDVKWL FT PAPEPLLKENPLINQPTNAQFECKIQRENLTPKSDAPRTPDEAKEFQPLLS FT DRRPYPAVSIDYQDRFNNTKPNSNPVDLSITPICVTKEIKAVPKCETILTI FT AHALKTKGDCLFIPETNLFEDQNVHVFPTITHGVSNQVKLHVVNPSSDYVT FT IGKNKVIGVVISLPTKISEEYNAGLPDGLIDLLESKTNAAALNQEERLQLL FT TLLKNNQDVFGQGLHDIGQTDVIQHCIDTGNAQPIRQRPYRLAEAQRKSLD FT KHVEEMLANDIIEPSVSPWSSPVVMVPKKDGNFRVCIDYRKLNNVTKKDTY FT PLPRIDESLDMLHGAKYFSSLDLLSGYYQVRLDDESKEKTAFTSHKGLYQF FT KVLPFGLSNAPSTFQRMMNYILRDHLYKHCMLYLDDILIYSKTYKEHLEHI FT LMITEAVRQAGLKLNLLKCTFCTTEVKFLGHYINADGIKPDPENIKAVKDY FT PIPKNIKDVRAFIGLCSYYRRFIKGFSGIVLPLNNLLKKDTSFKWQIDQQK FT SFDQLKEALTTTPILVFPDFKRPFIIYSDASGEALGYVLAQCQGDSEKVIL FT YGGRSFTNTEIKYSTTEREALAVVVAIQKCRPYIYGQRFKIVTDHNSLKWL FT MSIKDPTGRLARWSLMLQSYDFEIEYRPGKMNGNADGLSRRSYDTVATLTV FT PGRPIDQVVKEQRNDPYYADLVSYLANKELPEDLKEAKKVLALEGNHYLDE FT NGLLFHHESKTSRNLTQLVVPQSLRNELITWAHDEPCGGHFGVTKTYEKIR FT TNYYWIGMYNDIQTWVKSCTTCAQRKRNPTTSKAPLLPIPVEGPWDVIAAD FT CLGPFPPSLKGNRYIVVFGCLFTKYVEAFAVPTIAASSIAEIFVDNIVFKH FT GAPRRFLTDRGSNFTSKLVKEVCEILNVKKSFTTAYHPQTDGFVERINGIL FT AQSLSMYVASNQRDWDVHLPAAVYAYNTSISASTGETPFRLTYGRDALLPH FT DTTLLPRKDPTDNTDLLIQRFVSDLRLLRNLAKENIQKAQTKMKLHYDQTT FT KSYPFQVGHKVWVYTPITKKGLTKKLTSFWHGPFRLIEKTSPVTFKVENMN FT NKELPTPIHVSRFKQWFGYEEKPTTDLALNTEVLTEATETMDEFCFEETNH FT KIQEEVNLAPEIEHQVDRMENLDSDIYKMEKIVKKRTRGGKIQYLIKWEGY FT PSSQNTWEPEENIFDPQVIQKYINSKKTSKSVHISAITTTNLCIRKKKNQT FT NCRVMPKVTLRVNKLWKLWCLALLPTFITGLFIGEVFDCTKVKPIGIYQLP FT EVTSCNHNMHTLNDSVKTFMADVYAYRPQTTTITLFHCYAEKVTLTCQSNF FT LNQKSKDLTSKRIPVTAGECLLAMNTKISPYGALQVDNLNTWRTVATDQFH FT CAWMRTKTQEYTHFFMKTYNGQITGRAVTLEQYVTKTLCYHALRRCVPSEW FT PESIIVWGNDNHDKEVMKKLGTYPIERIGDFILIRSLKVGGAIQLAEQKDY FT VLTLDNGMLLKDPKFPDDVFKQYKSTAANYSQKLANDPATAILEAHITIAL FT MTQKMNMISTWEQMCFIQTEISRIHRWMIAQFPTTSAEWVHQSQGVTVESA FT GDALLLSECINYTTYAIQYNRKIGNFCFEHFPITLPSSNITYFLEVSDRKL FT IRTSPRIPCKLRPKHTYLQDPKLNIMYEISAYGRVKIVETQIDHVLPSPNN FT PIPRIRGYNKDFLVEKPQRLSPYTVLQLISGSHETLQSLKTISDTNGGDVL FT TGIGTALGSALQATASGGSQIIKAFGGAIKDSLNGVSDLDEKLVRSIGDAS FT SSVLTAAGGAVKDVGEGAGSFFQKFLGGISGSILWAAILLIVIYLILNKPN FT LNYSLPCLRCFQIPLVNETVVQPRQDQSTTMSPRVKGRHPRNCSACKVVHR FT E*" XX SQ Sequence 8080 BP; 2639 A; 1650 C; 1594 G; 2196 T; 1 other; tttggaggca ctgcccggat acgagtggat tccatctctg ctcgggccga tcaattaaaa 60 actgtgtaag aaaacattaa cgtttgttca ttttattatt tgttttactt gcattcaaac 120 gaagatatta ttttaaactg taataacttt gctgtatata tatatatgac ccgtttacgc 180 tcgggaaaaa tatgtccgtt tgttgaacca acacatcgtt tgattgctgg caataattct 240 catacctcgc ttaacaactc cgactccgac tccgccgatt ccttagagct atccgatacc 300 aatccgttta aaacacagaa tagccgttcc cctgaagaga agaacaatat gtcgcttgag 360 caaatattag aaagattaac attgaagcct actcaagaaa tacatccaaa aatatttaaa 420 ggtacagaaa cagaagatgt taccgaatgg ttagccagtt ttggacgcat tgcccaacat 480 aaccaatgga ccgagatgag aaaactactt gcgttcccgt tatacttgga aggtagtgcg 540 ttggtttatt acgaaaccct tcccgcgcac attaggaacg attttaatca aattcaagaa 600 cattttcgtc agtattataa taatgcaaac atcgcatgga accgtaagat ggagttattc 660 ggcctattac aggatgctga tcttgcttcc tatattacca cgctagataa attaagtcag 720 cagcttgggg ttgatgatga gactaagctg aatttgttta tcaagggact gaagccacat 780 ttaaggaatg ctctaaaact caagcaaccg gtggattatc aggctgcagt agctttcgca 840 aaactacagg acaccgttgg atcagactcg actataactc gtctggaagc caagttagat 900 gccctactaa attcccctca acatattgca gctgccacct accgagagca tcctatccta 960 aatcagcgga ttggtgaatt acagacggaa attgcctcat taaggcggaa caacaaccgt 1020 cctagccaac atgaaagatt ccggccaaac ggacgtaatc tccgtaccag tgacggtcag 1080 gttatatgca accaatgcct gaaagtagga cacacccaaa ggtcgtgttc cgaaaaaatc 1140 gccccattgt cagggagaaa tcactatcaa catgcatcgc cgcaaccgca ccactggaac 1200 catgacacac aaaacccggt ttcattccgt aataattctt atccaaatag acaatcccga 1260 cacaatgaac caaggaacag tccacatttg gatagccgaa ttccccatcg acacactaat 1320 acactcgaat cggatccgga agaggaacct ggttccgaaa tctgggggaa ggtatacaag 1380 actaaagtgc ttattcttat ggacacaggt gctaggtgca gtgtattaga cgaaaccatc 1440 ttcgaaaatt ttaggaatca agttaacaga ccacttccct actatggtac cctaagaacc 1500 gcaaatggag cgccacttca tgtccttgga aaaaccgtgt gccccattaa tattggagga 1560 tatatatttg agcatgaaat gcttatcgct aaaaatttga ctcatccagt gattttaggc 1620 tacgacttca tgagccagtt taacgtcaaa ttggactgcg ggaatcatcg agttaccttc 1680 gagggaatgt ttgatataga aatgataagc aaacccatcg acaacaaatt ctccactgga 1740 ataattgtgg atgatacacc aatccgccgt ataccgcaaa ttgataagtt cgatgtcaag 1800 tggttgcctg cgcctgaacc gttgctaaaa gaaaatccgc ttattaacca acccacgaat 1860 gcccagtttg agtgtaaaat acaaagagaa aatctgacgc cgaagtccga tgcgccacgt 1920 acaccggatg aagctaaaga gttccagccg ctactcagcg accgccgccc atatcctgca 1980 gtcagcatag actatcaaga tagattcaac aacaccaaac ctaattcaaa tcccgtagac 2040 ctttctataa ctccgatatg cgtaacaaag gaaataaagg ctgttcccaa gtgtgaaacc 2100 atcctcacca tagcacacgc tttaaaaact aaaggtgact gtttatttat accagagaca 2160 aatctgttcg aggatcaaaa tgtccatgtt tttcccacca taacccatgg tgtatctaat 2220 caagtcaagc tgcatgtcgt taacccctca tcggattatg taactattgg taaaaataaa 2280 gtaattgggg ttgtaatttc attacctacc aaaatctcgg aagagtataa cgctgggttg 2340 ccggacggct tgatagactt gttggaatca aagactaacg ccgctgcgtt gaatcaagaa 2400 gaacgcctcc agctgttgac cttactaaag aataaccaag atgtttttgg tcaaggtttg 2460 cacgacattg ggcaaactga tgtaatacag cactgtattg acacgggcaa tgctcagccg 2520 attagacaaa gaccttaccg tcttgcggaa gctcagagaa aatcactgga caagcacgtt 2580 gaggagatgc tagcaaatga tatcattgaa ccaagtgtca gtccttggtc tagtccggta 2640 gtaatggtac cgaagaaaga tgggaatttc agggtgtgta ttgattatag gaaactaaac 2700 aacgtaacaa aaaaggatac ctatccactc ccacgaatag atgaaagtct cgacatgcta 2760 cacggggcga aatatttttc ttcactagat cttctcagcg gatattatca ggtgcggctc 2820 gacgacgaat ctaaagagaa aaccgcattt acatcgcata aaggattgta tcagtttaaa 2880 gttcttccct ttggtctgtc gaatgcacca tcaacatttc aacgaatgat gaactacata 2940 ctccgtgacc atttatacaa acattgcatg ctttatttag acgatatctt gatatattca 3000 aaaacgtata aagagcacct ygaacatata ctgatgataa ctgaagccgt tagacaagct 3060 ggtttaaaat taaacctact gaaatgcact ttttgcacaa ctgaagtgaa attcttagga 3120 cattatatta atgcagatgg tattaaacca gaccctgaaa atatcaaagc agtaaaggat 3180 tatccaatac ctaaaaatat aaaagatgtt cgagcgttta ttggcttatg tagctattat 3240 cgaagattta ttaaaggatt ctcaggtatt gttttgccgt tgaataactt gttaaaaaag 3300 gacactagtt tcaaatggca gattgatcag caaaagagtt ttgatcagct caaggaagcc 3360 ctgacaacga cacccatttt ggtttttccc gacttcaaga gaccttttat tatatattca 3420 gacgcaagtg gcgaggcatt agggtatgtt ctagcgcaat gtcaagggga ttccgaaaaa 3480 gttattctgt atggtgggcg tagttttacg aacacagaaa ttaaatattc cacaactgaa 3540 cgggaagcgc tggccgtcgt cgttgcaata cagaaatgta ggccgtacat atatggacaa 3600 aggtttaaaa ttgtaaccga ccataattca ttaaagtggc taatgtcgat caaagaccca 3660 accggtcgcc ttgcaaggtg gagcttaatg ttacaaagtt atgactttga aattgaatat 3720 agaccaggaa aaatgaatgg taatgctgac ggactatccc gacggagtta cgacacagtt 3780 gcaacattga cagtaccggg tagaccaatt gaccaggttg taaaagaaca acgtaacgat 3840 ccctattacg ctgatttagt atcatatcta gccaataaag agctacccga agacttaaaa 3900 gaagcaaaga aagtgttggc attagagggt aaccactact tagatgagaa tggtttacta 3960 ttccatcatg aatcaaaaac ctcaagaaac cttactcagc tagttgtacc acaatcttta 4020 agaaatgagt taataacctg ggcccacgac gaaccatgtg gaggacactt cggggtgaca 4080 aaaacctatg agaaaattag gactaattac tattggattg gtatgtataa cgacatacaa 4140 acctgggtaa agtcatgtac cacttgcgct caacgaaaac gaaatccgac tactagcaaa 4200 gcacccctac taccaattcc tgtcgaagga ccatgggacg tcattgcagc cgattgtttg 4260 ggacctttcc ctccttcttt aaaaggaaac cgctatatag ttgtgtttgg ttgcttattt 4320 acaaagtatg tggaggcttt tgcggtcccc accattgcag catcgtctat tgcagagata 4380 tttgtagaca acattgtgtt caagcatgga gcaccaaggc gatttctaac cgatagaggt 4440 agtaatttta cgagcaaact tgtgaaagaa gtatgcgaaa tattaaatgt aaagaagtct 4500 tttacaaccg cttatcaccc ccaaactgac ggttttgtgg aacgcattaa tgggattttg 4560 gctcaaagtc tttccatgta tgttgcctct aaccaacgag attgggatgt ccatcttcca 4620 gctgctgtat atgcatataa taccagtata tctgctagta ctggagaaac tccttttagg 4680 ttaacatatg gcagagacgc tctactaccc catgatacta ccctactacc aagaaaggac 4740 ccaacagaca acactgactt acttattcaa cgatttgtgt ctgacttaag attattacgt 4800 aacctcgcga aagaaaatat tcaaaaagca cagacaaaaa tgaaactaca ttacgatcaa 4860 acaactaaat catatccgtt tcaggtcgga cataaagtat gggtatatac accaatcaca 4920 aaaaagggtt taacaaaaaa acttacaagt ttctggcacg gtccgtttcg gttaattgaa 4980 aaaacatcac ccgtgacgtt caaagtggaa aatatgaata ataaagaact gcctacgcca 5040 attcatgtgt ctcgttttaa acaatggttt ggttatgaag aaaaaccaac tacagaccta 5100 gctctaaaca ctgaagttct caccgaagcc acagaaacga tggacgaatt ctgttttgag 5160 gaaactaacc ataaaataca ggaagaagtt aatctcgccc ctgaaattga acatcaagtt 5220 gacagaatgg agaatcttga ttccgatata tataaaatgg agaaaattgt taaaaaaaga 5280 acaagaggag gaaaaataca atatctaatt aaatgggaag gttatccatc cagccaaaat 5340 acttgggagc ccgaagagaa catttttgat cctcaagtaa tccagaagta tattaatagt 5400 aaaaagacta gcaaaagtgt acatatctca gctataacca cgacaaatct gtgcatacgt 5460 aagaagaaaa atcaaactaa ttgtagggtc atgccgaaag taaccctcag agtaaacaaa 5520 ttgtggaaat tatggtgtct ggctctttta ccaactttca ttactggctt atttattggc 5580 gaagtatttg actgcactaa agtgaaacct attggaattt accaactacc agaagttacg 5640 tcttgtaatc acaatatgca tacgctaaat gactcggtga aaacgtttat ggcagatgtc 5700 tatgcttata gacctcagac aaccacaata actttatttc attgttatgc cgaaaaagtc 5760 acacttactt gccagtcaaa ttttctcaac caaaaatcta aagacctaac atccaaacga 5820 atacccgtaa cagcaggaga atgtctttta gccatgaata ctaaaatatc gccttacgga 5880 gcattacaag tggataacct caatacatgg agaacggtag cgacagatca atttcattgt 5940 gcctggatgc gcacaaaaac acaggagtat acccattttt tcatgaagac atataacgga 6000 caaatcaccg gacgagcggt aaccctagaa caatacgtca ctaagacttt atgctatcat 6060 gccctgcgtc gttgcgttcc cagtgaatgg ccagaatcta taattgtatg gggaaatgac 6120 aaccacgata aagaagtcat gaagaaatta ggtacttatc caattgaacg tataggtgat 6180 ttcattctaa tcaggagtct caaggtaggt ggagctatac agctagcaga gcaaaaagac 6240 tacgtgctta ccctggataa cggcatgctt ctaaaagacc ccaagttccc agatgatgtc 6300 tttaaacaat ataaatcaac cgctgcaaat tactcacaaa aattagcaaa tgatccagct 6360 acagcaattc tcgaagccca cattacgata gccctcatga ctcaaaagat gaatatgata 6420 agtacttggg aacagatgtg ttttatccaa acggagatct ctcgcataca tcgatggatg 6480 attgcccagt ttcctactac atcagcagag tgggtgcacc aatcccaagg agtaactgtg 6540 gaatcagctg gagatgcttt actattatcg gaatgcatta actacacgac ttacgccata 6600 cagtataatc ggaagattgg caatttttgt tttgaacatt ttccaattac cttgccaagt 6660 tcaaacataa cttattttct agaggttagt gatcgaaaat taataaggac aagtccccgt 6720 attccttgta aacttagacc aaagcacacg tacttacagg accctaaatt aaatatcatg 6780 tacgaaatat cggcatacgg acgtgttaaa attgtggaga cccagataga ccacgtgctt 6840 ccctcgccaa acaacccgat tccccgtatt cgaggataca acaaagattt tcttgtcgag 6900 aaacctcaac gattatcacc gtacacagtc ttacaattaa tctcagggtc acatgaaaca 6960 ctgcagtcac ttaaaactat tagtgacacc aacggtggtg atgtcttaac tggtattggc 7020 acagctttag gaagtgcctt acaagccacc gcaagtggag gaagccagat cattaaagcc 7080 tttgggggtg ctataaagga ttcattgaat ggagtttcag acttagatga gaagctagtc 7140 cggtcaattg gagacgcttc ctcatcggtg ttaacagccg ccggtggggc tgtgaaagat 7200 gttggagaag gtgcaggatc cttctttcag aaattccttg gtggtataag cggatccatt 7260 ctgtgggcag caatactttt aatagtaata tatttaatct tgaacaagcc taatttaaat 7320 tattccctgc cttgcttaag atgttttcaa ataccattgg ttaatgaaac tgtagtacag 7380 ccacgccagg atcagtccac gacaatgtct cctagagtaa aaggacgtca cccacgtaat 7440 tgctcagctt gtaaggtagt tcatagggag taaaaagaat accggtgaac ggtaatggca 7500 ggaattaaat cttgtataat tcctggatgg agatgtgaga cgcctccaac caccacgagt 7560 ttcctgaaaa atgcaatttt tctgtttttc ttgcgctgtg ttgctgtttg caacaatgct 7620 gtttagcaaa aaatttttac gataattaca atgaactttt atagtgctgt ttggaacaaa 7680 cgatgagctt gatgtgcaat gatgctgttt tcaataattt taatgatgct gtttcgcaat 7740 gatttttatg atgctgtttt attacgatga atttttatgt tgctgatgtg caatgatgct 7800 gtttcgcaat gatttttatg atgctgtttt attacgatga atttttaagt tgctgatgtg 7860 caatgatgct gtttcgcaat gatttttatg atgctgtttt attacgatga atttttaagt 7920 tgctgatgtg caatgatgct gtttcgtttt ccgagctgtt gattttcctt aacgagatat 7980 tggcgattac ccgtttgcta ctcacttgag attatactat tgttgctgtt ttgatgacac 8040 ttttaagtcc gaggacggac ttttcgttcg tcgtgagtaa 8080 // ID P-4_AP repbase; DNA; INV; 4074 BP. XX AC Contig40297; XX DT 21-AUG-2009 (Rel. 14.08, Created) DT 21-AUG-2009 (Rel. 15.12, Last updated, Version 4) XX DE P-like DNA transposon. XX KW P; DNA transposon; Transposable Element; P-4_AP. XX NM P-4_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-4074 RA Jurka J.; RT "P-like DNA transposons from pea aphid."; RL Repbase Reports 9(8), 1800-1800 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX FH Key Location/Qualifiers FT CDS 2565..3383 FT /product="P-4_AP_2p" FT /translation="MFEIAAKLFTENLNFKYILTYYFSQDHIELLFGRIRQ FT RYGANNNPNVIQFKTAIKQILLKNAITCSTNNNCNTFDEDIISTIFPFKWN FT KNKNNVLPLAVAGDDNEDDKMEILNTCDLLNNTNSSYEDCKKNIIYYITGY FT AIKKNIFILDCNSCKQSLIKPTSDHNYNHSADYSKFIDFKNNGGLISPSES FT SFKVVYQTEIFLLSLTNNLQTLNIPNLDLKITSLVNKKFALDKNIFKYLEC FT NNVFTLDRPHKLVLIALLIQKYLSIRLPTFIW" FT CDS join(385..648,652..1482,1486..2538) FT /product="P-4_AP_1p" FT /translation="MNLNNNNNNNKIHVLCYYFVIIYEQILHIKHTVFYFI FT NINCDTCIRLTVSTVVIIFFKYILHITMVYYCCAWGCSNKRVPNSGIHFFA FT VYLKKTDINETYKLIISKIVSFPLKNKERTENWLKNINRKGFIPSEYSRLC FT SKHFLPSDFKPYNKENGRLRLNDNAIPSVFNSTTEEVHCQSIIENYEVEVE FT VPLDLSMSKLDEELPVALQRSVLVSDIITTPRKKTTVLFPELQITPKRPSC FT RTPKRGRPQNSIVTPKTKKNKTKLKTLQKKVLRLKKRINNLKELLQDIKEK FT GLIEKNGFNVILEDFDGMSREIFNNQLKNKNKNAKGRRYSDELKKFALTLN FT YYSPKAYKYCRLVCISKILFIKYIILNTIFFSSTIFKLPHPTAIRSWTSSV FT NGEPGFFSEVFLFLKTLEIENKECNLVFDAMCIKKQVIWDKQAHKFTGYCD FT YGDNLTVESSETAATEALVFMLVSLNGKWKLPIGYFLQNKTNAVRQAELIK FT TALTLSHQSGLNVRGITCDGAFTNFSTLKILGCEFGEGFDNIKSWFSHPID FT GSQIFFIPDACHMIKLARNTLGNCLVIESNTGHIKWCYFEYLHEIQSNLSL FT KFANKITGVHLNWKSNKMKVKLAVQLLSSSTAKAIQYLKDNNYKQFKDSDE FT TIVFCQRLDQLFDFLNSRNPFSKGYKSPIFKSNLKFLKSKIIPFINYLYSL FT KYKNKLLFTTNKKHLF" XX SQ Sequence 4074 BP; 1584 A; 531 C; 535 G; 1424 T; 0 other; cacagatata taaaaatact agatagccga cttctatagt acatacgtat aaaataatgt 60 ctccaacaaa atggccgcca atcgttatct aagaggtgac tgtttgcaaa tttaactgat 120 atgataagtg ataagatact attttggagg gctataatat tttaaatttt gttgagacgt 180 aagacttaag aaaatttttt ttccgaatta tatttatctg gaactatttg gttagagcta 240 aaattcggaa ccttttatta tttataataa taaatttaaa tttaaatatt atattttatt 300 gtatttgcat gtaattacat agaacaatat ggtaatcgcg agttatttat aactatttag 360 catctcaaac gcataaacaa cttgatgaat cttaataata ataataataa taataaaata 420 catgtattat gttattattt tgtaattata tacgagcaaa tattacatat taagcatact 480 gttttttatt ttataaatat aaattgtgat acatgtataa gacttacagt aagtacagtt 540 gttattatat tttttaagta catactacat attactatgg tttattattg ctgtgcttgg 600 ggctgttcaa ataaaagagt gcctaacagt ggcattcatt tttttgcgta agtatattta 660 aaaaaaactg acatcaatga aacttacaaa ctaattattt ccaaaattgt cagctttcct 720 ttaaaaaata aagaacgtac tgaaaattgg ttaaaaaata ttaataggaa agggtttatt 780 ccatctgagt atagtaggtt gtgtagtaag cattttctcc ctagtgattt taaaccttat 840 aataaggaaa atggtcgttt acgtttaaat gataatgcaa taccatcagt atttaattca 900 acaactgaag aagttcattg ccagagtata atagaaaatt atgaagttga agtagaagtt 960 cctcttgatt tgagtatgtc taagttagat gaagaacttc cagtcgcact tcagagatct 1020 gtattggtca gtgatattat aactacacct agaaaaaaaa caacagtatt atttccggaa 1080 cttcaaatta cacctaaaag accatcatgt agaacaccaa aaagaggtag accccaaaac 1140 tcaattgtca ctccaaaaac caaaaaaaac aaaacaaagc ttaaaacatt acaaaaaaaa 1200 gtacttagac ttaaaaaaag aattaataac ttaaaagaac ttttacaaga cataaaagag 1260 aaaggtttaa ttgaaaaaaa tggttttaat gtcatattag aagactttga tggcatgtct 1320 agggaaatct tcaataatca gttgaaaaac aaaaacaaaa atgctaaagg tcgaagatac 1380 agcgatgagc ttaaaaaatt tgcacttact ttgaattatt actctccaaa agcttacaag 1440 tactgtcggt tagtatgcat ttcaaaaatt ctctttatta aataatatat tattttgaac 1500 acaatttttt tttctagcac aatatttaaa cttcctcatc ctacagcaat aagatcttgg 1560 acttcatctg taaatgggga acccggtttt ttctctgaag tttttctttt tttaaaaact 1620 ctagaaatag aaaataaaga atgtaatctt gttttcgatg caatgtgcat aaaaaagcaa 1680 gtcatctggg acaaacaagc acataaattt acaggatact gtgattatgg tgataatctt 1740 acagttgaaa gtagtgaaac agcagccact gaggcattag tttttatgct cgtaagttta 1800 aatggtaagt ggaaactacc aattggatat ttcttacaaa ataaaactaa tgctgtgcga 1860 caagcagaat taattaaaac agcacttaca ctgtctcacc aatcaggact aaatgtaaga 1920 ggtatcacct gtgatggggc atttacaaat ttttcgacat tgaaaatttt gggatgtgaa 1980 tttggtgaag gatttgataa tatcaaatct tggtttagtc atcctatcga tggatcacaa 2040 atttttttta taccagatgc atgccatatg attaagttgg cacggaatac gttaggaaat 2100 tgtttagtta tagaatctaa tactggtcat attaaatggt gttattttga atatttacac 2160 gagattcaat caaatttatc tttaaaattt gcaaataaaa taactggtgt acatttgaat 2220 tggaaaagta acaaaatgaa agtgaaattg gcagtccaac ttctaagctc atcaacagct 2280 aaagcaattc aatacttaaa agataacaat tataaacagt ttaaggatag tgatgaaact 2340 attgtttttt gtcaaagatt agatcagtta tttgattttt taaattccag gaatccattt 2400 tcaaaaggtt ataaaagccc tatttttaaa tcaaatctaa aatttttaaa atctaaaata 2460 attcctttta ttaactattt atacagtctt aagtataaaa acaaactgct tttcactaca 2520 aacaaaaaac atttatttta gggtttgcaa ttggtgtaaa atcaatgttt gaaatagctg 2580 caaaactgtt tacagaaaat ttaaacttta agtacatcct tacatattat ttttcacaag 2640 atcatattga attgttattt ggacgtatac gtcaaagata tggagcaaat aacaatccca 2700 atgtgatcca atttaaaact gcaataaaac aaatattatt aaaaaatgct attacatgtt 2760 ctactaacaa caactgcaat acttttgatg aagatataat atccacaatt ttcccattta 2820 aatggaataa aaataaaaat aatgtactac ctttagcagt agctggagat gataacgaag 2880 atgataaaat ggagatttta aatacatgtg atttattgaa taatacaaat tctagttatg 2940 aggactgcaa aaaaaatatt atatattata ttactggcta tgcaattaaa aaaaatattt 3000 tcatattaga ttgtaacagt tgtaaacaat cacttataaa acctacaagt gaccataact 3060 ataatcattc agcagattat tctaaattta tagattttaa aaataatggt ggtctaatat 3120 ctccctcaga aagttcattt aaagttgtct atcaaactga aatatttctt ttaagtttaa 3180 ctaataattt acaaacacta aatataccaa accttgactt aaaaataact tcccttgtta 3240 ataaaaaatt tgcattggac aaaaatatat ttaaatattt agaatgtaat aatgttttta 3300 cattagatag accccataaa ttggttttaa ttgcattgct tatacaaaaa tatttaagca 3360 taagattacc tacattcatt tggtaaaatg tattctacag atattttaaa cccaataagt 3420 aaccgtcata aattgacgaa acaaatttta tttatgaatc aataatattg ttgtatctat 3480 acctacgaga tgtagtttaa aatatctatt atttctttag ttatgtatta taatttaatt 3540 ttagtaaacc caaggaatgt acttatgttg aaaaatctaa atttaaaatg tattttaatt 3600 ttattaagat taaataattt actttatatt attaatttac tgattaaata aagcacttgg 3660 taattttttt ttttaattgt tatactttat tagaaattat atccctcggt atactggtat 3720 ttttattaca ataaattatt ttttgactca cacatgtatc tttgaagctg taccaatctt 3780 caagaacaca agcaagataa ttgatgaaaa tttttcttgc ataaaccata tataaaaact 3840 taactccact caaaatctgt ataaaataat aattgttttt aaagtttatt aagtataaac 3900 aatataatat tttaattata agtacaatag tatattgtat aacactgttc ttcaattgct 3960 tatcatcaat aatataaaac ataaacattc acctcttaga taaggatggc ggccatttta 4020 ctggcgacaa taataatatc gaagtcggct atctagtatt tttatatatc tgtg 4074 // ID BEL-2_AA-LTR repbase; DNA; INV; 395 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-2_AA_; KW BEL-2_AA-I; BEL-2_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-395 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 854-854 (2011). XX DR [2] (Consensus) XX SQ Sequence 395 BP; 120 A; 89 C; 73 G; 113 T; 0 other; tgtctgagat tattaattgt tattttttac ctttcttgaa tttacttttg aaaatttgaa 60 tttgccaata cagactaata ctaaataaaa gtgtatatct agataaacga tctaacgatc 120 acattgccag atgaacagga aatagatgag gcctctcgta ggtcccagtt accactacac 180 acgaataaaa tactgtaccc aatagctttg taagttagtt gaaatatagc agtcgatcgg 240 aaaggaacac tttctcgcgt aggccagtcg tgagccgaaa catccgaagt tccctaaagt 300 ccggtcttcc acagtcgcgt taaaattctc tatcgcttcc gagatccgcg agttctggca 360 agcgccaccg tccgccgaat atcttcccgt gaaca 395 // ID Copia-95_AA-I repbase; DNA; INV; 3451 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-95_AA_; KW Copia-95_AA-LTR; Ty1_copia_Ele173; Copia-95_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-3451 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [916-1419] - Integrase core CC LTRs are 96% similar to each other. XX FH Key Location/Qualifiers FT CDS 577..2094 FT /product="Copia-95_AA-I_1p" FT /translation="MGKKVVFVNDSVLFLNFGEVIARGTVVDGLYCMDFSH FT DSIVDPSALIGTKQVSMEKLHERLGHLSFSGIERLLKNDMVEGINRSLSGE FT DTQKVCESCLAGKQSSKRFEHRELPRSSRPLEIIHSDVCGPMEVKTYNGYR FT YFVTFTDDYTHFTLAYLMARKREVIEKFQEFEALVEGHFGRSISKLRCDNG FT GEYIGNEFKDFCKENGIQMIFTVPYTPQQNGVSERINRTLMEKVRAMLHGS FT GMPKEMWGEALHCAAYITNRCPTNGLVENKTPFEMFFDKRPNLENLRIFGC FT TAYAQIPKEKRQKLDSKTKRLVFVGYANNGYRLWNKETRSIESSRNVIFDE FT ERLYEDQSDRLSNGDDLVSPEDLETVQLIPVSRCKEDESVEKNAAMEETIH FT EQSNVSEENTLTEVSLESDYESTFEEQNSEDEPEMQQVRRSERERKVPARY FT TDYKMAKIASSFSEVNPIPQTIEELKQRDDWEQWETALDSEMGSLKENQTW FT TIADGDVPKE" FT CDS 2226..3452 FT /product="Copia-95_AA-I_2p" FT /translation="MESVRTILSMANEMKLLIHQMDVKTAFLNGKLDEVIY FT MQLPKDEVGKGQTVLLQKSLYGLKQASRSWNKRFDAAITNLGFTQLKCDSC FT VYKNKQKGLILILYVDDLLIVGANLDEIDWIKSELGKLFHMKDLSEVKHFL FT GMDITRDMKTQTLKIAQSGYTARILRRFGMFDCKPVGTPLEANNKWTKNEG FT NETQHPYKELLGCLQYLAITTRPDICAAVSALSKYQSRPSDAHWSGLKRIL FT RYLRGTMDTCIVYQHQNNANVLIGYADADFANDEDNRKSISGYAFQVYGNL FT VAWATKKQQTVSLSSCEAELIALSCAVKEGLWLTNLLKELDVDSIPLIIME FT DNIPCIRYAEEPRSHQRMKHLDIKYMFIRDLIKKRELKLKLLASKEQPADA FT FTKGLPKSQQLRGSV" XX SQ Sequence 3451 BP; 1102 A; 640 C; 889 G; 819 T; 1 other; aattcgacaa cgtcaagggc gctctgacgg tattgtcgaa tgaggaactc tgtcgtaaac 60 ccatttatga catcaagcgg atgttgctgg acgcagagct tgccagtgga gcggtacaag 120 gaccgtcggg ggcatcaagc agtattgcgc tgaaaacgga gaagaagaac gaaatcgaat 180 gtttcgggtg cggtggagtc ggccattata agaaccgttg cccgaagaag aagaagccaa 240 ccaagaagag gaccggcggc cacgccctga tggccggcag aaggatcgag agagcgagag 300 cgaaaagagc agcgaatgtg cgattcgtcg tggactccgg ggccaccgag cacatggtca 360 acgacgaacg gccgctggag aacgtgaagg agctggacga gcccgctgca attagtacag 420 caaaatccgg ccaggtgctt tgggccacaa agtcgggaaa aatgagattg acttccataa 480 ttggggtgaa taaaatgaaa ccgattacat tgtataacgt cttatttatt cctggtctgg 540 attcgaatct cctttcggtt aagaaaggca aatgaaatgg ggaagaaagt cgtttttgta 600 aacgacagtg tcttattcct gaactttggt gaggttatcg ctagggggac ggtagtagat 660 ggactttact gcatggattt ctcccacgat tcgattgttg acccatcagc gttgattgga 720 acgaagcaag taagtatgga gaagcttcat gagcgattgg gtcacttgag cttttcgggg 780 atagaaaggc tgcttaagaa cgacatggtt gagggaatca accgtagttt gtctggagaa 840 gatactcaaa aggtttgcga aagctgttta gcggggaagc aaagcagcaa acgatttgag 900 caccgggaac ttccaagatc aagcaggccg cttgaaatta tacattccga tgtttgcggt 960 cccatggaag taaaaacgta taatggctac cggtattttg ttacatttac cgatgattat 1020 acgcacttta ctcttgctta tttgatggca cgaaaaagag aagttatcga gaaatttcaa 1080 gaattcgaag ctttggttga aggacacttt gggcgaagta tatccaagct acggtgcgat 1140 aatggtggtg aatacattgg aaatgaattc aaagacttct gtaaagaaaa cggaattcaa 1200 atgattttca cagtaccata cacaccacaa caaaatggag ttagcgaaag gatcaaccga 1260 actttgatgg agaaggttcg tgcaatgtta cacggaagcg gaatgccaaa ggaaatgtgg 1320 ggagaagcat tgcattgtgc tgcttacatc acgaatagat gtcccaccaa tggacttgta 1380 gaaaacaaaa ctccatttga aatgttcttc gacaaacggc ctaatttgga aaatttgagg 1440 atctttggct gtacggccta cgctcagatt ccgaaggaaa aacggcaaaa gttagactcg 1500 aaaaccaaac gccttgtttt tgtgggttac gcaaataacg gatacaggct ttggaacaaa 1560 gaaacccgat caattgagtc ctcaaggaat gtgatattcg atgaagagcg gctctacgag 1620 gatcagtctg atcgtttgtc taatggagat gatctggttt ccccagaaga tcttgaaaca 1680 gttcagctaa ttcctgtatc acgctgcaaa gaggacgaat cagttgagaa gaatgcggcc 1740 atggaagaga cgattcatga acagtccaat gtcagtgaag agaatacgct tacggaagta 1800 tcattagaaa gtgactatga aagcactttt gaagagcaaa attcggagga tgaaccggaa 1860 atgcagcaag tgcggcgaag tgagcgggag agaaaggtac cggctaggta tacggattac 1920 aaaatggcaa aaatcgcttc atcgttttct gaagtaaatc cgattcctca aaccattgag 1980 gaactgaagc agcgcgatga ttgggagcag tgggaaactg cactagacag cgaaatgggg 2040 tcactgaagg aaaaccaaac ctggactatc gcggacggag atgtaccgaa ggagtgaaac 2100 caattcagtc taaatgggtt ttcaamgtca aagaagatgg acgctataag gcacggttag 2160 tggccaaggg ctgttcacag cgtcctggat tcgactatag cgaaaccttt tctcctgttg 2220 cgagaatgga gagcgttaga acgattctat ccatggctaa tgagatgaag ttgttaatcc 2280 atcaaatgga tgttaagacg gcatttctta acggaaagtt agatgaggtt atctacatgc 2340 aactaccgaa ggacgaagtt ggaaaaggtc aaactgttct tctgcagaaa agcctgtatg 2400 gcttaaagca ggctagcaga agttggaaca aacgttttga cgcggcgatt acaaatctcg 2460 gatttaccca gctgaaatgt gactcctgtg tgtacaaaaa caagcaaaag ggattaattt 2520 tgattttgta cgtagatgac ctactgattg taggagcaaa tctcgatgaa attgactgga 2580 taaaatccga acttggaaag ttattccata tgaaggattt atcagaagtc aaacatttcc 2640 tcggaatgga cataaccaga gatatgaaaa cccaaacgtt aaaaatcgct caatctggct 2700 acactgcacg gattttgaga cgatttggaa tgtttgattg caaacctgtg gggacaccac 2760 tagaagcaaa caacaagtgg acgaagaatg aaggaaacga aacacaacac ccatacaaag 2820 aactattggg ttgcttgcaa tacctagcaa tcactacacg gcccgatatt tgtgctgcgg 2880 taagcgcatt gagtaaatat cagagccgtc catctgatgc ccattggagc ggtttaaaac 2940 gcatactacg ttacctacgt gggacaatgg atacatgtat agtgtatcag catcaaaaca 3000 acgccaatgt gttgattgga tatgccgatg cggattttgc aaatgatgaa gataatcgca 3060 aatctatatc tggatacgct ttccaggttt atggaaattt ggttgcatgg gcaacaaaga 3120 agcaacaaac ggtaagtttg tcgtcttgtg aagccgaatt gatagcatta tcatgtgctg 3180 taaaagaggg tttgtggtta acaaaccttt taaaggaatt agatgtggat agcatcccat 3240 tgattattat ggaggacaac attccttgca tcagatatgc ggaggaacct aggagccacc 3300 aaaggatgaa gcacttggac atcaagtata tgttcatccg tgatttaatc aagaagcgcg 3360 aactgaagct gaagctcctg gcatccaagg agcaacctgc agatgcattc actaaagggt 3420 tgccgaaaag tcaacaattg aggggaagtg t 3451 // ID Gypsy-54_AA-LTR repbase; DNA; INV; 417 BP. XX AC AAGE02021233; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-54_AA_; KW Gypsy-54_AA-I; Gypsy-54_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-417 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02021233; Positions 26306 26722. XX SQ Sequence 417 BP; 135 A; 66 C; 113 G; 103 T; 0 other; tgtagagaag tgatgaagaa gtaacaaatg acattgttat gagaaaaaaa aatccgaaac 60 gacttcatca taaattagtc tcgattcgac attcatgtca tatccctcag aatcgggcaa 120 cacaaactgc aagaaaacag agcatgtatt tgtgtgggtg gaatggtggt tcgattgatg 180 ggataatgtg tgtgaatgag aaaagagtga aagtggagaa gtgtgtgtga gactcagtat 240 tcgctcgggt aacagcggta tccgtttagg attgtgagat cattctttcg actaatgtgc 300 tagagtacag ttgtaccaaa gtttagtgcg cgaagtgaaa aaaaaaaatg gactcggaca 360 gcgtgtgctc cgaatccggt ggctctccag cgcgagaagt tggctcggaa cacgaca 417 // ID TTAA12_AP repbase; DNA; INV; 590 BP. XX AC . XX DT 25-AUG-2009 (Rel. 14.09, Created) DT 25-AUG-2009 (Rel. 15.12, Last updated, Version 0) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; nonautonomous; TTAA12_AP. XX NM TTAA12_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-590 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2077-2077 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC TSD is TTAA tetranucleotide. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea CC Aphid Genome Project. XX SQ Sequence 590 BP; 191 A; 90 C; 98 G; 211 T; 0 other; gggggctgga gcgtaatcaa ttctaaggag taaaaactag gcattttata tttctgttta 60 tctgggtctt gtgtatgtgt tgaaagtaaa tgaacattag tgtacgtaat tcgtagtact 120 gatcacgttt tataggggca tcatagtgac cgggatgtac gtctgtaatt ttacctacgc 180 gcatgcacat gtcaccatat attgctttat aaaatgacaa acattttttt tcaaaaaata 240 ctacctctga cttcaatcac tacgctcagt aggatgttcc gattttaatt ttgaaagcag 300 atttgaattc tcctctaaat ttactatgct ttaactgtcg tacatttttc atattctata 360 tcaatgtatt taaatttgaa ttatgaatat gatctatttt tactcttttt gttagtaaat 420 ttctaagtaa aaaaaaataa aaatcagaaa aatgaaattg tcaaagcata gataatttag 480 tgacaaattc aaatttgctt tcaaaattaa tatcggaaca ttatattggg cgtagtgatt 540 gaagtcgtga gctatgtttt cgaccgaaaa tgaatcacgc tccagccccc 590 // ID hAT-55_HM repbase; DNA; INV; 3716 BP. XX AC . XX DT 19-DEC-2008 (Rel. 13.12, Created) DT 19-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type DNA transposon family - consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-55_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3716 RA Bao W. and Jurka J.; RT "hAT-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 2043-2043 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 419..3058 FT /product="hAT-55_HM_1p" FT /translation="MFGNKKKKSGWENKKQRETKENERKKQANSILSFVQQ FT SCSSNISQATSNIDEPTIKKMCFDQSTINSINECNMDKSSKFDEYSKASTS FT NLEESQEAAISDLVKDPTEVFISTVDGSERTICETSEDLEVPLLTVNEEDP FT AYEVSSDPGEWKFITDNMRQFLVKKGPVQIDNFAFPVNNKGRQFSPSLYNR FT KLPNGEHVVRSWLIYSQSQDSVFCFSCKLFSKNRNNWAVNGSSDWAHLSRD FT LKSHESSGIHFDSIFMWNELRLRLKTLNTIDAAQQRALNIEKNKWQEILKR FT IIAILKFLASQSLAIRGTSDKMYDRNNGNFLKLFELLATFDQTSANHIQTI FT TQRQTKVHYLSNNIQNELIQLISKNIKEYIVQQIKISKYYSIIVDCTPDQS FT HVEQMSIIIRFVFQDKNRFEIREHFLGFIPVVDTSGKGLTEKILKELEDNQ FT IPIEDMRGQGYDNGANMKGKRLGVQSRILQLNPRSVFVPCACHSLNLVVND FT AASASGEVASFFLIIQQIYVFFSSSPSRWDILKQHVKITLKPVCSTRWESR FT ISAIIPLRFQLEHIYDALVNIHLDENRDNQTKHEAQSLAKKVSCFKFICSL FT IVWYDILSKVNIVSKIMQSPNLNLLNCNNALKEVLVFLNNYRQDETFVKVV FT DEAKKIAEILMIDSKFPDVSTIRIRKKKGHFDYESNDLPIIDSQQNYKVNF FT FFYILDTAINSIDERFEQLKLHVEKFQILYNIYKLKSENRTNILKNCMDLH FT LLLTDNDKRDIDGLELCEEIINLSAVLPSQMEPLEILNYLHENNITTIFPN FT LFICLRILLTLPVSVASGERSFSKLKIIKNYLRSSMSQERLVGLAMMSIES FT EICDSLNYDDLINDFAELKTRRVNFK*" XX SQ Sequence 3716 BP; 1341 A; 564 C; 602 G; 1209 T; 0 other; cagggccgtg ccgtggcatg gagcaatgga gcaaccgctc caggcgcctg agccacgaag 60 gcgcctttca aatatttcac gtttgttcat ttacgtctaa gaatattatt tttctgatat 120 gtaagcaact ctttatgttt tttccttttt ttttttttta ctgataataa agtgtagtgt 180 aatgaataat gatattttca taacaaaaag gcgcctatta gaaatttatc tttattattg 240 tttgtgtaat tagcgaaaaa agttcgaacc tgaacgttga aatttggcgt aaagttcgaa 300 cgttcgaact tgaacatgtt cgagctagca gccctagtta taacagtata cctacaattg 360 tatttgtagt ttatattagg ttttaacaaa tttaattaaa tcataataat taaaaaccat 420 gtttggaaat aaaaagaaaa agtctggttg ggaaaataaa aaacaaagag aaacaaaaga 480 aaatgaacgt aagaagcaag caaattctat attgtcattt gttcaacaat cgtgctcatc 540 aaatataagc caagctacaa gcaatattga tgaaccaaca attaaaaaaa tgtgtttcga 600 tcaaagtaca atcaacagca taaacgaatg caatatggat aaatcgtcaa aatttgatga 660 atattccaaa gcctcaacat ctaatttaga agagagtcag gaagctgcta tttcagatct 720 tgtcaaagat cctacagaag tatttatatc tactgtagat ggaagcgagc gtaccatttg 780 tgaaacatca gaagatctcg aagttccact attgacagtg aatgaagagg acccggcgta 840 tgaagtgtca tcagatcccg gtgaatggaa gtttatcacg gacaacatga gacaattttt 900 ggtaaaaaaa ggtccagtgc aaatagacaa ttttgccttt cctgtaaaca acaaaggtag 960 acaattttca ccttctttat acaatcgaaa attgccaaac ggagaacatg ttgtgcgatc 1020 ctggttaatt tattcacaat ctcaagattc ggtattttgt tttagttgta aattattttc 1080 taaaaatcgc aataactggg ctgtcaatgg tagttccgat tgggcacatt tatcacggga 1140 tcttaaaagt catgaatcat cgggaattca tttcgatagt atttttatgt ggaatgagtt 1200 aaggttacgc ttaaaaacat taaacaccat agatgctgct caacaacgcg cccttaatat 1260 tgaaaaaaat aagtggcaag agattcttaa aagaataatt gcgatattaa aatttttagc 1320 ttcacaatca ttagcaatac gaggtacaag tgataaaatg tatgatagaa ataacggcaa 1380 ttttttaaaa ttatttgaat tactggctac atttgatcaa acgtctgcaa atcatataca 1440 aactattaca caaaggcaaa ccaaagtaca ttatttgagt aataatattc agaatgagct 1500 catacagtta atttcaaaaa atattaaaga gtatattgtt caacaaatta aaattagtaa 1560 atactattca ataatagtag attgcacccc agatcagagt catgttgaac aaatgtccat 1620 tattataaga tttgtattcc aggacaaaaa caggttcgaa attcgcgagc attttttagg 1680 attcattccc gtagtggata cttctggtaa aggcctaact gaaaaaattc ttaaagaact 1740 agaagataat caaattccta ttgaagatat gagaggccag gggtacgata atggtgcgaa 1800 catgaaagga aaaagattgg gtgttcaaag cagaatatta caattaaatc cacgttcagt 1860 ttttgttcct tgtgcatgtc attccctcaa tttagttgtt aacgatgcag cttctgcttc 1920 aggtgaagta gctagctttt ttcttattat tcagcaaatt tatgtatttt tttcgtcatc 1980 tcctagtcgg tgggacattt taaagcagca tgtgaaaatt accttaaaac cagtttgttc 2040 tactagatgg gaaagtagaa tttctgctat tattcctttg agatttcaac ttgaacatat 2100 ttatgatgct ttagtcaata tccatttaga tgaaaatcgg gacaatcaaa cgaaacatga 2160 agcccagtca ttagcaaaaa aagtttcttg ttttaaattt atttgtagcc tgattgtgtg 2220 gtacgacatt ttatcaaaag taaatattgt atcaaaaata atgcagtctc caaatttaaa 2280 tttattaaat tgcaataatg ctttgaaaga agttttggtt tttttaaaca attaccgtca 2340 ggatgaaacc tttgtaaaag tcgtcgatga agcaaagaaa atagccgaaa ttcttatgat 2400 tgatagtaaa ttcccagatg tttcaacgat tcgaattcgt aaaaagaaag gccattttga 2460 ttacgaatcc aatgacctgc ctattataga ttcacaacaa aattataaag tcaacttctt 2520 cttctacatt ctggacaccg caattaattc aatcgatgaa agatttgaac agttaaaatt 2580 acatgtagaa aaatttcaaa tattatacaa tatttataaa ctaaaatccg aaaatagaac 2640 aaatatatta aaaaattgca tggacttaca tttattatta acagataatg ataaaagaga 2700 tattgacggt ttagaacttt gtgaagaaat tataaattta tctgcagtgt taccctctca 2760 aatggaacca ctagaaatac tgaactattt gcacgaaaat aatataacaa caatatttcc 2820 aaatttgttc atctgtctac gtattctgtt aactttacct gtatcagttg cttctgggga 2880 acgcagtttc tcgaaattaa aaataataaa aaactattta agatcgtcaa tgagtcaaga 2940 gagattagta ggtctagcta tgatgtcgat tgaaagcgaa atatgtgatt cgttaaatta 3000 tgacgattta attaacgatt ttgctgaatt aaaaacgagg cgagtaaatt ttaaataaaa 3060 atgctctttc aaataagata atttttttcg ctttttgtcc ttaaatcgtt gttaaagcta 3120 tcaaaatcga ttgaatttgt tacttggctt tctattgatc gaataattag aaatgaaatc 3180 tcttctgact tactgcaaat aatttattat tattctcaat tataattagt ccaaatatgg 3240 aatataaagg attaaataaa ttattgattt atttaatccg cgattattat atacatatta 3300 tacatatttt gttgattttg taaatgtatc ctaatataaa aattgaatat aaaaacgatg 3360 aatcaagggt ttttattttt cttgtcatta atgccaactc aacgatctat tcctggcggc 3420 ctttcattaa cgtgtgcatt tgcccaattt ttgagagaat ggtacactcc acagaaaata 3480 tttcaagaat agttgtagaa ctctcgaagt tctacaaatt ttattagaaa tatttttctg 3540 taaagtgcac tgttattcca aaaatcaagc aaatacaacc gcgttaattg aaggctacct 3600 gtattctact ttagaaaaat ttgaatatgt tggtcaccta aaaaattaat ctacccataa 3660 aaggcgcctt ataagtatgc ctgctccagg cgccaaaata ccaccgcacg gcgctg 3716 // ID BEL-216_AA-LTR repbase; DNA; INV; 345 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-216_AA_; KW BEL-216_AA-I; BEL-216_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-345 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 890-890 (2011). XX DR [1] (Consensus) XX SQ Sequence 345 BP; 102 A; 92 C; 56 G; 95 T; 0 other; tgtttgcgcc aatcgccgat agccatcagc ttataggaca caaatccatt tagctttgct 60 tcgacacata aatctcgacg atatccatta cataccacta caggatcact aactccttca 120 ttagtgtccg agcagataaa ctatagttct actgttcttt ctaatgtaaa ttataagtct 180 atcccatacc gtatcgatcg cttcacggat cgaagatagg gccgaccgca ggtagcgatc 240 cgatgtccag aaaacatcgc acccggtcag taggcaatcc aaataattgt atctaatcag 300 tctgcaataa accacacact cgattcgatc gtttcctttc tcaca 345 // ID Mariner-3_BM repbase; DNA; INV; 1280 BP. XX AC . XX DT 26-APR-2010 (Rel. 15.07, Created) DT 26-APR-2010 (Rel. 15.07, Last updated, Version 2) XX DE Mariner-like DNA transposon - consensus sequence. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-3_BM. XX OS Bombyx mori OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Bombycidae; Bombycinae; Bombyx. XX RN [1] RP 1-1280 RA Jurka J.; RT "DNA transposons from Bombyx mori."; RL Repbase Reports 10(7), 938-938 (2010). XX DR [1] (Consensus) XX CC >93% identical to consensus. The protein is coded by 2 CC overlapping ORFs. XX FH Key Location/Qualifiers FT CDS join(189..542,509..1204) FT /product="Mariner-3_BM_1p" FT /translation="MDTSKIRVIFEYEFRRGTNAAETARNINVAFGEGTAN FT ERTVRFWFKRFRDGNFDLKNEPRGRPPTHVNNNELKEMVEADPSQTTQELA FT AWFNVTLPTILTHLRQINKIKNMKNGCLMINKKYEKWVPHDLTDLQKETRV FT ETCVALLNRYRNEGILDRIVTCDEKWILYDNRKRKMQWLTPGQTPQQCPKA FT KLTNKKVMVTVWWSQHGVIHYSFLRSGQAITADVYCAELRTMIAKLAVKQP FT RLMNRSSPLLLHDNARPHTARETVLTLQELQLETIRHPPYSPDLAPTDYHF FT FRDLDNFLRDKKFSSQEAVQNAFTQFVESRSPEFYRKGINDLPIRWQQCID FT NNGRYFD" XX SQ Sequence 1280 BP; 416 A; 240 C; 256 G; 368 T; 0 other; ggggttaaag tatgaaattg ccgttttatt gtaataggta cttcgaattg acatagaacc 60 ttcatggttt gagatttttg agtgaaactt ttttcaaatg gtacctatga ggttcttctt 120 ttaaaaaatg tgcaacattc gattatcgac tttcagttgt ttttttttat tcagcttaga 180 caagaaagat ggatacttcg aaaattcgag tgatttttga atacgagttc cgacgcggaa 240 ctaacgctgc agaaacagct cgcaatatca atgttgcgtt tggagagggg actgctaatg 300 aacgcaccgt gcgattttgg tttaaacgct ttcgtgatgg aaattttgat ttgaagaacg 360 aaccacgtgg aagaccgccc acacatgtga ataacaatga attgaaagag atggtggaag 420 ccgatccgag ccaaactacc caggaattag cggcatggtt taacgttacc ttaccaacaa 480 tattgactca tttgcgtcaa atcaataaaa taaaaaatat gaaaaatggg tgcctcatga 540 tttgactgat ctgcagaaag aaacgcgtgt tgaaacttgt gttgccttgt tgaatcgata 600 cagaaatgaa ggaatattgg atcgaattgt gacatgtgat gaaaagtgga ttctttacga 660 taaccgtaag cgaaaaatgc aatggctgac cccaggtcaa acgccgcaac agtgtcctaa 720 agcaaagctt accaataaaa aggtaatggt aactgtttgg tggtctcagc atggtgttat 780 tcactatagc tttctccgat ctggtcaagc aataacggca gatgtctact gtgccgaact 840 ccgaacaatg atagcaaaac ttgcagtgaa acagccccga ctcatgaatc gatcttcacc 900 attattgctc catgataacg cgagacctca tacagcacga gaaaccgttt taactctaca 960 ggaactgcaa ttagaaacca ttcgtcaccc tccgtattcg ccagaccttg ctccaacgga 1020 ctaccatttt tttcgtgatt tggacaattt tctacgtgat aaaaagtttt cttcccagga 1080 ggcagtacaa aatgctttca cacagtttgt agaatctaga tcaccagagt tctatcgcaa 1140 aggcataaat gaccttccta ttagatggca gcaatgtata gataataatg gtagatattt 1200 tgattaaata aatatgttaa atgaaaaaaa aaaaaatttc aatttttcag tacaaatcgg 1260 caatttcata ctttaacccc 1280 // ID Gypsy-125_AA-I repbase; DNA; INV; 7830 BP. XX AC AAGE02022342; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-125_AA_; KW Gypsy-125_AA-LTR; Gypsy-125_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-7830 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02022342; Positions 54372 46543. XX CC Positions [6603-7070] - Integrase core CC 'CTGTG' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 2422..3888 FT /product="Gypsy-125_AA-I_1p" FT /translation="MARQLAAEKSKIKLLKKKLKWSRNENKLLIDRLNQQP FT ATDSTSGSFGFGEGNRFSSTRNMTETVQDKEREESCKDDDSNPSVENPEVT FT EHISAGIIQENDYGTHRIGTAEDGHEEQPFLSSDDESRIRIARDTTKERQE FT EVRSSVVRTVTGQDESRFLSSMNQLSVASINVPECKPAVDGDIHRQTFEQW FT KDLLVDSMNLAGIDDERTMFTVFKVKAGIKLLEIFRNTKSQNSDSDPQAKP FT FSNAMDRLKSYFGSGSDILLMRRKLALMAQQANETDLSFVTRVGSTARLCG FT YDEGKEFEEVAATIAEHARDREVRATALKSLSKKGCFTDLVDKVREIESIR FT LNEEYVMRKQGKVSHATVAAVAAPYQHRNKFPERYQTRPKPYKRPVQKPKT FT EWRPSHYLNSNQSNTKCWRCYSSFHSPHDCFAKGKICNNCGNKGHISRACH FT AAVERGRDQRMARSREETPQRIAHVGKSEEDTVDNDKVSDNVET" FT CDS 4437..7640 FT /product="Gypsy-125_AA-I_2p" FT /translation="MNCVFLIDSGAQVNTFTERMFHKLIETPEFSKEVYNI FT RDSTDRPLKAYATDKTIEVLATFDAYLYITEDRPILLEKFYVVKESRALLS FT RATATRYSVLLLGLKVPIHPLQQTNETIRCIERISVVEKNKVYPKFNIPPV FT KIFYDKSKPPCRNVFSNIPLAVKPLVEKRLEELVSANIIEPIVEGMDTSFC FT SSMLIVPKGKDDIRLVVDLRGPNRYIYRTPFAIPTLEKILAELNGAKWFST FT IDLSNAFFHMELDEESRHLTNFYTEFGMYRCVRLPFGLCNAPDLFQEALQR FT KVLAGCKGCKNYLDDVLVFGATKAEHDENLAEVLSCLENHNVKLNESKCSF FT GTQAAEFLGFTLTPEGWQINDEKVEAVKNFRKPTTCSEVKSFLGLITFVDK FT FVPHRATKTEYLRTLASSDMFYWSDNEEKEFNYFKEEALTCIKRLGYFNAE FT DRTELFVDASPIGLGAVLTQFNADGDPRIIACASKVLTVAEQKYPQTQKEA FT LAVVWAVEKFSFYLLAKFFVIRTDAEANQFIFSSNHRLGKRAISRAESWAL FT RLQPYDFSIERVPGNQNIADALSRLIPASQIDKPFDEDEDSHYLYALDAGC FT MQLTWNEIETASENDNELALVRQAVISKKWPAELRSYEAQRKTLHTLGSLV FT FKDDRTILPVSLRSKALQLAHQGHVGEVATKRIMRLHFWWPRMSAEVSRFV FT RSCETCAQLSKRNPPVPLASRDLPNGPWEILQIDFLSVPSFGTGELLVVID FT TYSRYLCVVEMKSLNASSTNSALCEVFHTWGCPIILQSDNGPPFQSTEFIQ FT FWQNKGVEVRKSIPLSPQSNGCVERQNQGIIKALAASRLDGLNWRIALQSY FT VHRHNTVVPHSRLCVTPFEMMVGWRYRGTFPSLWSPTCSGSPDKSELEERD FT AEAKLASKTYADSVRGAKVSDIQVGDIVLLAQSKKSKTDPTFSLERYTVIA FT REGAKVVVMSAAGIQYSRNIREVKKAPGSYAFNDPHSTDEFTRASGRDADD FT GNEPNGEGPSDNTGHAVLMDGTTGSMPGDQNSESSSARNLRERTAINRPKR FT FDDRYIYRVFC" XX SQ Sequence 7830 BP; 2572 A; 1388 C; 1771 G; 2099 T; 0 other; tggcgcagtc ggcgggtaag taaaaattaa gtgcattgtt ctaaaggtaa agctcttgaa 60 taaaaactga taattgtgag tgagtttaca cgtttcacac aacggtttca gtaagagttt 120 gaataaaaac tgataattgt gagtgagttt acacgtttca cacaacggtt tcagtaagag 180 ttactgtaaa ttttattgta gtgagacagt tcacacaaaa cgaatgtgag tgagttcaca 240 aatgtttcat ataatggttc cagtaagaga gtttactgga aattttgttg tagtgagaga 300 gttcacacaa aacgaatgtg agtgagttca caaaatgatc atataatagt tacagtaaga 360 gagtttactg gaaactttct tgtagtgaga gagttgacac aaaacgaatg tgagtgagtt 420 cacaaatgtt tcatataatg gttccagtaa gagagtttac tggaaatttt gttgtagtga 480 gagagttcac acaaaacgaa tgtgagtgag ttcacaaaat gatcatataa tagttacagt 540 aagagagttt actggaaact tttttgtagt gagacagttc acacaaaacg aatgtgagtg 600 agttcacaaa tgtttcattt aatagttaca gtaagagagt ttactggaaa tttttttgta 660 gtgagacagt tcacacaaaa cgaatgtgag tgagttcaca aatttatcat ataatagtta 720 cagtaagaga gtttactgga aattttgttg tagtgagaca gttcacacaa aacgaatgtg 780 agtgagttca caaatgtttc atataatggt tccagtaaga gagtttactg gaaattttgt 840 tgtagtgaga gagttcacac aaaacgaatg tgagtgagtt cacagattaa tcatataata 900 gttacagtaa gagagtttac tggaaacttt tttgtagtga gacagttcac acaaaacgaa 960 tgtgagtgag ttcacaaatg tttcatataa tggttccagt aagagagttt actggaaatt 1020 ttgttgtagt gagagagttc acacaaaacg aatgtgagtg agttcacaga tttatcatat 1080 aatagttaca gtaagagagt ttactggaaa cttttttgta gtgagacagt tcacacaaaa 1140 cgaatgtgag tgagttcaca aatgtttcat ataatggttc cagtaagaga gtttactgga 1200 aattttgttg tagtgagaga gttcacacaa aacgaatgtg agtgagttca cagatttatc 1260 atataatagt tacagtaaga gagtttactg gaaacttttt tgttgtgaga cagttcacac 1320 aaaacgaatg tgagtgagtt cacaaatgtt tcatataatg gttccaggaa gagagtttac 1380 tggaaatttt gttgtagtga gagagttgac acaaaacgaa tgtgagtgag ttcacaaaat 1440 taccatataa tagttacagt aagagagttt acaggaaact ttcttgtagt gagacagttc 1500 acacaaaacg aatgtgagtg agttcacaaa tgtttcatat aatggttcca gtaagagagt 1560 ttactggaaa ttttgttgta gtgagagagt tcacacaaaa cgaatgtgag tgagttcaca 1620 agatgatcat ataatagtta cagtaagaga gtttactgga aactttcttg tagtgagaga 1680 gttcacacaa aacgaatgtg agttcacaaa tgtttcatac aatggttcca gtaagagagt 1740 ttactggaaa ttttgttgta gtgagagagt tcacacaata cgaatgtgag tgagttcaca 1800 aaattatcat ataatagtta cagtaagaga gtttactgaa aattttattg tagtaggaga 1860 gttcacacaa aacgaatgtg agtgggttca caaaattaac atattatagt ttcagtaaga 1920 gaatttactg gaaattttgt tgtagtgaga gctcacacta aacaaaagtg agggagttta 1980 caaaattaac atataatagt tccagtaaga gagttattag aaattttgtt gtattgagag 2040 agttcacaca aaacaaaagt gagtgagttc aaacatgttt catataatgg ttccagtaag 2100 agagtatgct ggaaatatca ttgtagtgag agagttcacc caaatgagtc acataatggt 2160 tttagtaaga gtttgctgga aatattgttg tcgtgggaga gttttcgcga aactaaaatt 2220 tcagtaagtt tcttgtagct cccttgcttc acaagaaaaa aaaaactaga aaccagtttt 2280 cttaaaagta tctattctga gaatataatt atgaaaataa ttatgaaaat gggtacttga 2340 gcttatattc ttttctgtat cttcttcatt ctagaggatc caccaacaga aaagtcaaag 2400 aaagcaaaca aactccaagc aatggctcgc cagttagcgg ccgagaaatc gaaaattaag 2460 ctgctgaaaa agaagctaaa gtggagtcgg aacgaaaaca aactgctaat cgatcgttta 2520 aatcaacaac cagctacaga ctcgacgagc ggaagttttg gattcggaga gggtaacaga 2580 tttagcagca cgcgaaacat gacagaaacg gttcaggaca aagaacggga ggaatcatgt 2640 aaggatgacg attcgaatcc atcagtcgaa aacccggaag ttacggagca catctcggcc 2700 ggaataattc aggagaatga ttacggaaca caccgaattg gcactgcaga ggacggtcat 2760 gaggaacaac cttttttgtc cagtgacgac gaatctcgta tcagaattgc gagagacaca 2820 acaaaggaga gacaggagga ggttcgatca agcgttgtgc gtaccgtgac aggccaggat 2880 gagtctaggt ttctttcgtc gatgaatcaa ctttcggtgg cttcgataaa tgtacccgaa 2940 tgcaaaccag cagttgacgg agatatccat cggcaaacgt tcgagcaatg gaaggatctc 3000 cttgtggact ccatgaatct agcgggcatt gatgacgaac gtacaatgtt cactgtattc 3060 aaggtgaaag cggggattaa attgttggag atattcagga acacaaaatc ccaaaacagc 3120 gattccgatc cacaagctaa acccttttcc aacgcaatgg accggctgaa atcatacttt 3180 ggttcggggt cggatatttt gctgatgcga aggaaattag cattgatggc ccagcaagca 3240 aacgaaacag atctgtcttt cgttacgaga gttggatcta cagctcggct atgtggatat 3300 gacgaaggaa aagagtttga ggaggtcgca gctaccattg cagaacatgc tcgtgataga 3360 gaagtacgag caacagccct gaaatcgtta agtaagaaag gatgctttac ggaccttgtg 3420 gataaagtca gggagataga atcgatccga ttgaatgagg aatatgttat gcgaaaacaa 3480 ggtaaagtaa gccatgccac ggttgctgca gttgcagcac cttatcagca tcgaaataaa 3540 ttcccggagc gttatcagac acgaccaaaa ccttacaagc gaccagtaca gaaaccaaaa 3600 acggaatgga gaccatctca ctatctcaac agcaaccaat caaacacgaa atgttggaga 3660 tgctacagtt cgttccactc tcctcatgac tgctttgcca aagggaagat ttgcaacaat 3720 tgcggcaaca aaggacacat tagtcgggcc tgtcacgctg cagttgagcg aggtcgtgat 3780 caacgaatgg ccaggagtcg tgaagaaacg ccccagcgaa ttgcccatgt tggaaaatcg 3840 gaagaagaca cagttgacaa cgataaagtg agtgataatg ttgaaactta gttatacact 3900 gtaaacgatg acattttcac tcgcaaacat cacttgaact acggagaata tgattttttc 3960 ttcaaccttc ttattatatt gacatgaatt cgaactgaac ttaatatgat aaatgttttt 4020 ttttatcaag catataaaat cgctacggaa aaaaacttca taaaataaaa tgatattcga 4080 acaattaatt gaaagattta ttattaataa caagaaaatg tagtgaaatg tcatcataat 4140 tgaaatagag acaattggta acatgtatta tttgtcaggg tttaacgatc aggtcgacta 4200 agcatggagc catggacaca gtcaaaagtt cgcttcggca ttggttggaa atagatacta 4260 tatacagtat tggaaacgaa gatgttaaaa tccctcgtat gagcgctgag ttagtgcaac 4320 gagtaagtaa tattgatgct ttaagtatgc gaggattttt agtaattcgt tctcattaaa 4380 tacagactgc gttaagcgat ttcggcgacg gatacatcac agttacggta gctggtatga 4440 attgtgtttt tctcatagac tcaggtgctc aagttaacac attcaccgaa cgcatgttcc 4500 acaagttaat tgagacccct gagttcagta aggaagtcta taatatcagg gattccacag 4560 accgaccact caaagcatac gcaacagaca aaacgatcga agtacttgca acgtttgatg 4620 cgtaccttta cattaccgaa gatagaccaa tacttctaga gaagttttac gttgtcaagg 4680 aaagtcgagc acttctcagt agagcgacag caacgagata cagcgtattg ctacttggtc 4740 tgaaggtacc tatccatcca ttgcaacaga ccaacgaaac aattcgttgc atcgaaagaa 4800 tttctgtggt ggagaagaat aaggtatatc ccaaatttaa tatccctcca gtgaaaatat 4860 tctacgataa atcgaaacct ccgtgcagga atgtattctc aaacattcct ctagcggtaa 4920 agcccttggt tgagaaaagg ttggaagaat tggtatcggc caatatcatt gaacccattg 4980 tcgagggcat ggacacatca ttttgttcgt caatgttgat cgtcccgaag ggtaaggacg 5040 acatacggct tgtcgtcgac ttaagagggc caaatcgata tatctaccgt acaccgtttg 5100 caattccaac tttggagaaa atactggcgg agctgaatgg cgcaaaatgg ttttcaacca 5160 tcgatttgtc gaatgctttt tttcacatgg agcttgacga agaatccaga catctcacta 5220 atttctatac ggaatttggt atgtaccgtt gtgtgagatt accctttgga ctatgcaatg 5280 ctcccgatct tttccaagag gcgcttcaac gtaaagtact ggctggctgc aaaggatgta 5340 aaaattatct cgacgatgtg cttgtcttcg gagccaccaa agctgaacac gacgagaatt 5400 tggctgaagt gctttcgtgt ttggaaaatc ataacgtcaa actaaacgag agcaaatgtt 5460 cgtttggtac tcaggcagct gaatttctgg gattcacgtt aactccagaa ggttggcaaa 5520 tcaacgatga aaaagtagag gcagttaaaa attttcgaaa acctacgaca tgctccgagg 5580 taaagagctt cttgggattg ataacctttg tcgataagtt tgtccctcat cgtgcaacaa 5640 agacagaata cctccgtact ttggcctcat ctgatatgtt ttattggtca gacaatgaag 5700 aaaaagagtt caactacttc aaggaagaag ctctcacatg catcaagaga ttgggatatt 5760 tcaacgcaga agaccgaaca gaactatttg tcgacgcttc cccgattgga ctgggtgcag 5820 tcttgaccca atttaatgct gatggggatc caaggattat cgcctgtgca tccaaagtat 5880 tgacagtagc cgaacaaaaa tacccgcaaa cgcaaaaaga agcgttggca gttgtatggg 5940 cagtagaaaa attttcattc tacttgctgg ccaaattctt tgtcattcgt acggatgcgg 6000 aagccaacca gtttattttc agctcaaatc acagattagg caaacgtgca atatccagag 6060 cggagagttg ggcattacgt cttcaaccat acgacttctc gattgaacgt gttccaggga 6120 atcagaatat tgccgatgcc ctttcaaggt tgatcccggc ttcacaaata gacaaaccat 6180 ttgacgagga tgaagacagt cactatctct atgccctaga tgcaggctgc atgcaactga 6240 cgtggaatga aatcgaaacc gcttcagaaa atgacaacga gttggcatta gtcagacaag 6300 ctgtaatctc aaagaagtgg ccagcggagc tgcgtagtta cgaagcgcag agaaaaacct 6360 tacataccct tggttcgttg gtattcaaag atgatcgaac gattcttcca gtgtcactcc 6420 gaagcaaagc ccttcaatta gcgcaccaag gtcacgttgg agaagttgct actaaacgta 6480 tcatgcgctt gcacttttgg tggccacgga tgtcggccga ggtatcacga tttgttagaa 6540 gctgtgagac atgtgcgcaa ttatcaaagc gaaatcctcc tgtgccgctg gccagcagag 6600 atttacctaa tggaccatgg gagattttgc aaattgattt tctttccgta ccatcttttg 6660 gaaccggaga acttctcgtc gtaatcgaca catattccag atatctgtgc gttgtggaga 6720 tgaaaagcct caatgcaagc agcactaaca gtgctctctg tgaagttttt catacctggg 6780 gatgccctat cattttgcag agtgataatg gcccaccctt tcaaagtact gaatttatcc 6840 agttttggca aaacaaaggg gttgaagtac gaaaatccat accactgagt ccccaatcga 6900 acgggtgcgt tgaaaggcag aaccaaggca tcatcaaggc tttggctgcc tccaggcttg 6960 acgggttaaa ttggagaatt gccttgcaaa gttatgtcca tcggcacaat acggttgttc 7020 cccactcgag actatgcgta acacctttcg agatgatggt tggatggaga tatcgtggga 7080 ctttccctag tctgtggtcc cctacttgta gcggaagtcc tgacaagagt gaattagaag 7140 agagggacgc agaagctaaa ctggccagca aaacgtacgc tgactctgtt cgtggagcta 7200 aagtttctga catccaagtt ggcgatattg tgcttctagc acaatctaaa aaaagcaaaa 7260 cagatccgac gttttcactg gaacgttata cggtgatcgc cagagaaggt gcaaaggtgg 7320 tcgtcatgag cgcagctgga atacagtatt cgcgaaacat acgggaggtg aaaaaggctc 7380 ctggaagtta tgcattcaat gatccgcatt caaccgacga attcacgaga gcttctggaa 7440 gagatgctga cgacggtaat gaacctaatg gcgaaggacc atctgataac accggtcatg 7500 cagttcttat ggatggaaca acgggatcta tgcctggaga tcaaaattct gaatcttctt 7560 cagctaggaa tcttcgagaa agaaccgcta tcaaccgtcc taaacgcttc gatgaccgct 7620 atatttaccg cgttttttgt taaagatgaa ctctgaaaat acaagaagaa atgatcgttt 7680 tatgattaaa gataagagaa actgaaaatc agcttataaa taaacaagtg aattgggaaa 7740 ctaggaataa aacgccaagt tttttgttta tcatgatact atcacaactc gctgctaccg 7800 atcgcaaaac agagtagaga agtgggcgaa 7830 // ID Gypsy-221_AA-LTR repbase; DNA; INV; 1167 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-221_AA_; KW Gypsy-221_AA-I; Gypsy-221_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-1167 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1046-1046 (2011). XX DR [2] (Consensus) XX SQ Sequence 1167 BP; 294 A; 273 C; 286 G; 192 T; 122 other; tgttaccgtt ttgggtactt taccctakga cccaactccg agcamatgag agaaaaccat 60 tctgctatta gtcgtakaka gagggwactc aatctgttcg attgtacaca atattgcgta 120 aaagtcgtct tgtatgccgg ggccggttta aagctccgat aakaaagawa aktcatggtc 180 cactcccamm kccaactgtk ggtttaaaga aggawawggc aawtgtaacg ckcataacac 240 akgwmacacm gmatacacaw amgsatactg gagtgggagg akcgaagtgg gcaatggaat 300 aacgattgga agaatkgaag ggaggagcaa wkgagatggg gaawggagts tgsgaagggg 360 gaaggkgggg tawggwwwkg ggtmtgccac gagggaaaac tggatcwggc cagtcttttc 420 cacttgagac ttsaagcwgt csagacgtac agtaaagaaa cacaagtgtc ccagcgactt 480 agggaacwta ttagaagcac cagtggtgat atagaccctg gwaskgtwta aawgmagtgg 540 aagwkktgwg accacgccag ggaacccgtg agagtcaggt aacctaacca gtgaakaaww 600 aagcggcmtw kswgwaccct aaaasctccc cacccctacg cmattcccwt taggacccat 660 tawggtwtcg gctagggsck tcwgtggkck ccawacccwc cwkccattcc ggaggwcaca 720 ccccgtggct atacactggg aagcmcgtgg tttgakmcgc twgtaggacc mgcgwasccc 780 gwmgccccak cgccggcttg ctmasccacg tgggaacatm ckgcsatcgg kktgccscam 840 gcccggaata gcagcmccac ggagcccagc aaaccatcca gcccaggmga cmacgascga 900 tcccatagca ktgtwmgccc cgckggatcc accmkgwmgm ccctccmstg ccggccggcc 960 ccctttacac atccacaccc acacacwsac acacmgtaag tgmaataaag agaaktgtag 1020 ttaaaaswga agtcgawagt kkttggactt ctgatcagcg ctgagagccg accctgtaga 1080 agaagagttt gggaacccct ggtggctagg cctgtaaggt ctcgtcccca gcgagcgtct 1140 cgggtttaac ccwgtgagaa tataaca 1167 // ID LOA-8_AAe repbase; DNA; INV; 5846 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A LOA non-LTR retrotransposon from Aedes aegypti. XX KW Loa; Non-LTR Retrotransposon; Transposable Element; LOA-8_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5846 RA Kojima K.K. and Jurka J.; RT "LOA clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1418-1418 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 10 sequences with >98% CC identity. XX FH Key Location/Qualifiers FT CDS 692..2086 FT /product="LOA-8_AAe_1p" FT /translation="MSFITNVKGSNCSQPEEMETECILGVEELMENQLLRE FT SGSEDQSCSSSILDTPKRVDQIDSESDDDINVTIRPAVADKIDRASLENEE FT GVKIKWRKNLTNGQRNKMKKLLKSGLSMEEARMKVLSKDAPETPNVGKRSR FT DGDQGGSGEKPEAKRTKSHLCPKERAGLTNYDNRATSRKQNDRSVNEAERT FT KEVNERPRSSKADSNSRGHSFRDVASLVKVGIILNGFPSRQMSTAQLDAVQ FT DALLLKIEEQRHATVKPKFTKCSYKQGHLILACKDQATALWLKDVTRSLTP FT WENAELLALDEKDIPRPELFHAFFPVSANFSDERLKGLIESQNDGINTNGW FT NILNRTNANKHAEWKLYLDEESLRTLADRNFVLNFRFGETQLRKVKTQGRS FT QDDGNTNGNEPELNVEQETTPNDLKTETSGAKIEALTEAVNYKRSQLEVHD FT RPIAPPKGKQERITGKNEAN" FT CDS 2076..5732 FT /product="LOA-8_AAe_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="MKLIKFLQVNLHHAKGASAVLSRRFKKEGLDMALLQE FT PWSNKGKILGIQIQKCKLFYDENQSSPRTAILVNNTMKCYPITEFIKRDIV FT AVVVEVPTSRGKTEIIVASAYFPGDVPEVPSPEITSFIRYCREVNKSFIIG FT CDANAHHTVWGSTDINGRGECLLEFLSSNGIDICNKGNKPTFLNAIREEVL FT DLTLCNSAISEKIRNWHVSDETSLSDHKHIVFEWGGGDLSPTAYRNPRKTN FT WGHYANHLKSDTFTNEISIKTIQQLESFSQEVNEVILKAYNNSCPIRQSSS FT SRDVPWWNSKLESLRKTSRKLFNKAKRTSDWAQYRRALTEYNNELRKSKRR FT TWIHMCESIDNTPIVARLQKTLAKNHTSELGNLKRQDGVYTKTCNETLDLM FT MDTHFPGAASGEDIQYTGVQGGSMQTIPEVCSIENPTPDKADKIFTFARVE FT NAVRSFQPYKSAGVDGIFPALLQNGEAVLIPSLMKIFKASLRLNYIPSEWR FT LVRVIFIPKPGKRDKTNPKSFRPISLSSVLLKTMEKVLKDYVNSTYMQTYP FT LSKYQYAYQAGKSTVTALHTLVDKVERSLSVKEIALCSFLDIEGAFDNTSY FT SSIANAMKKRNFNTCIVDWIHTMLAKREITSELGGSSVTIRATKGCPQGGV FT LSPLLWSLVVDDLLRSLEAKGFEVVGFADDIVIVVRGKFDNIISERMQQAL FT KYTQSWCIKEGLSINPSKIVIVPFTKKRKVNLKSFKIGEVEIQVSSQVKYL FT GVILDAKLSWNAHLDSTINKAINALWLCSKTVGRKWGLRPKMIMWIYTSII FT RPKITYASLVWWPKTREATAGMKLAKLQRLACIAITGAMRSTPSKALDAIL FT NLLPLHEYVQLEAERSALRLKRSKNVLPGDLIGHLSILQHFKRGPVMSMNG FT DWMRPEDNYDFLYKVSEMTRTSWDVGGPTVREGSIIFFTDGSKIGTNTGAG FT IFGPGVNISVAMGHWPTVFQAEIFAILECVNVCLKRKYRYANICIFSDSQA FT ALNALSAYKCTSKLVWECILSLRQLCESNSVNLYWVPGHCGIEGNEKADDL FT AKRGSNSQFVGPEPFCGIPNCAVKMELKCWEEQKVMTNWMDVKKCTQSKRF FT ITPNANKTKKILELNKRALCTYTGLVTGHCPSKYHLKNIGQVQNDICRFCN FT IESETSEHLLCSCGALYRRRSKFLDSGCLQPSEIWSADPRKVIGFINHIIP FT DWERVMVSS" XX SQ Sequence 5846 BP; 1912 A; 1062 C; 1307 G; 1565 T; 0 other; tctggcaaca ctgtttcaag tggacaacaa cgtcttcctc gagctcgttt taattttgca 60 tttcttagtg aaatacggca tattaaattg tgagtgatag tgcgttattt aggtcataac 120 tgccggtgaa ggaaaagtcg ttgtacaggt gagaaacagt agtgaattgt gatattttcc 180 tccgtgaatt gtgtgtgcga tgcaatgggg aaaattaaac aataaaagaa acgccaaaca 240 tgataagctc cagctgcaag agtagtcttg ttggggtgag cttttaaaag cttgcatttc 300 ttgtatgcaa tattgtatgt aaacttgcag atgcaggttt ggttaaactt gtatcggatc 360 agtgatgcta gtttcattga aacgcaatgt gaaagaaagg caaaatattg ttctgtgtta 420 ttattattat tttttttttt ttttttatta ttattattat tactattatt tattattatt 480 atttttatct ttcctactta aatacttatg caattcgcaa tactcattaa gttgacttaa 540 attaataaat tatcaaatag acgaacaaaa taaacattaa cgacagtctt atactttaac 600 aattttgttt actttggaat tttcggtttc agttacatac atacgttaca ctcagctaat 660 ttatccgcaa tagcaatctt ccggaatcag catgagcttc attacgaatg tcaagggttc 720 caactgttct caaccggaag agatggaaac ggaatgtatc ctaggtgttg aggaattaat 780 ggagaatcaa ttgttgagag aatctggctc agaggaccaa tcgtgttcct cttccattct 840 cgatacccct aaacgtgtcg atcaaatcga tagtgaaagt gatgacgata taaatgtcac 900 tatcaggcct gcagtagcag ataagataga tagagcatca ttggaaaatg aagaaggcgt 960 taaaattaaa tggcgaaaaa atctgactaa tggtcaacgg aacaaaatga agaaactact 1020 caaatctggg ctatccatgg aggaagctcg gatgaaagta ctttcaaaag atgctcctga 1080 gactccaaat gttggtaaac gatcaagaga tggcgatcaa ggtggtagtg gcgaaaagcc 1140 tgaggccaaa aggacgaaat ctcatctctg tccgaaggaa cgtgctggat tgaccaacta 1200 tgataaccgg gctacgtcta gaaagcagaa tgaccggagt gtcaatgaag ctgaacgaac 1260 taaggaagtt aatgaacgtc ctagaagttc aaaggctgac tcgaactcaa gaggtcactc 1320 tttcagggac gtggcgagcc tggttaaagt tgggataatc ttgaacgggt tccctagcag 1380 gcaaatgtcg acagctcaac tcgatgctgt acaagacgct ctgcttctaa aaattgaaga 1440 gcaaagacac gcaacagtga aaccgaagtt tacaaaatgt agctacaaac aagggcattt 1500 gatccttgct tgcaaggatc aagcaacggc gttgtggctt aaagatgtta ctcggtcact 1560 cactccctgg gaaaatgcgg agctacttgc tttggatgag aaggatattc ctcgcccgga 1620 gctgttccat gcattttttc ccgtgagtgc gaacttcagt gatgaaagac taaaagggct 1680 catagagagc cagaacgatg ggattaatac aaatggctgg aatattctta atcgaacaaa 1740 tgcgaacaaa catgcagaat ggaagttgta cttggacgaa gaatcgctaa gaacgttagc 1800 tgatcgcaac tttgtcctta actttcgttt tggcgaaacg cagttgagga aagtcaaaac 1860 ccaaggccgt agccaagacg atggcaatac gaacgggaat gagccggaac taaatgtaga 1920 gcaggaaaca acgcccaacg acttgaaaac agaaacaagt ggagccaaaa tagaggcctt 1980 aactgaagcg gtaaactaca aaagatccca actggaagta cacgataggc ctatcgctcc 2040 cccaaaagga aaacaggagc gtataactgg gaaaaatgaa gctaattaag ttcctgcagg 2100 tgaacctcca tcatgctaaa ggcgcttctg ccgtattaag ccggaggttc aaaaaggagg 2160 gactggacat ggccctacta caagagccat ggtccaacaa aggaaaaatt cttggaattc 2220 aaatacaaaa atgtaagttg ttttatgacg agaatcaaag ttcaccgaga actgcgattc 2280 ttgttaataa tacaatgaaa tgctacccta ttacagagtt catcaaacgt gacatagtgg 2340 cggttgtggt tgaggtacca acctccaggg gaaagacgga aatcatcgtg gcttcagcgt 2400 atttccctgg cgatgttcct gaggtacctt ctcctgaaat tacatctttc attcgatact 2460 gccgggaagt caacaagtcg ttcatcatcg gctgcgatgc caacgctcat cacactgtgt 2520 ggggtagcac cgacatcaac ggtcgaggtg agtgtctatt agaattccta tcatcgaatg 2580 gtattgatat atgcaacaag ggtaataaac caacatttct aaatgcaatc agagaagagg 2640 ttttagattt gacgttgtgt aactctgcaa tttctgagaa aattcggaat tggcatgtct 2700 cagatgaaac ttcattatcg gatcataaac acattgtttt tgagtggggt ggaggcgact 2760 tatcaccaac agcatatcgt aatcctagaa aaacgaactg gggtcattat gctaatcact 2820 tgaaatctga cacttttaca aatgaaatta gtattaagac aatacaacaa cttgaatcat 2880 tttcgcaaga ggtaaacgaa gtgattctaa aagcatataa taatagttgt cctatcaggc 2940 aatcgtcctc tagtagggac gtaccatggt ggaactcaaa attggaaagc cttagaaaaa 3000 catctcggaa attgtttaat aaagcaaaac gaacctcaga ttgggctcag tatagaagag 3060 ctctaactga atacaataac gaattaagga aatcaaagag aagaacttgg attcatatgt 3120 gcgagagtat agataataca ccaattgttg ctagattaca gaaaactctt gctaaaaacc 3180 atacttctga acttggcaac cttaaacgcc aagatggggt gtatactaaa acttgtaatg 3240 agacacttga tttgatgatg gatactcatt ttccgggagc ggcttcaggc gaggatatac 3300 agtatactgg agtgcaagga ggctctatgc agaccatacc ggaggtctgc agtatagaaa 3360 atccaacacc agataaagct gataaaatct tcacgtttgc aagagtggaa aatgcagtga 3420 gatctttcca gccttataaa tctgcaggtg tggacggaat atttccagca ctacttcaaa 3480 acggagaagc ggttctgatt ccgtctctta tgaagatttt caaggcaagt ttaagattga 3540 actatattcc atcagaatgg aggttggtaa gagtcatatt catacctaag cctgggaaac 3600 gagataaaac gaacccaaaa tcttttagac ccattagttt gtcatcagtt ctgttgaaaa 3660 ctatggaaaa agtattgaag gattacgtaa actcaactta catgcagaca tatccattgt 3720 ctaaatatca gtatgcgtat caagctggaa agtcgacagt cacagcgctt catacgctgg 3780 tagataaagt cgaaagatct ctttcagtga aggaaatcgc actctgttct ttcttggaca 3840 ttgaaggtgc ttttgacaac acttcatatt cttcaatagc aaatgctatg aaaaaacgaa 3900 atttcaacac atgcattgtc gactggattc atactatgct agcaaaaaga gaaatcactt 3960 cagaactggg tggttcgtct gtaaccataa gggcgacgaa aggatgtccg caaggtggag 4020 tactttcgcc acttttatgg tctttggttg tagacgacct tctcagaagc ttagaagcta 4080 aaggtttcga ggttgtaggc tttgcggatg atatagtcat cgtagtaaga ggaaagtttg 4140 acaacataat atcggaaaga atgcagcagg ccctaaagta tacccaatca tggtgtataa 4200 aggagggtct aagcattaac ccgtcaaaaa tcgtgattgt cccattcact aaaaagagga 4260 aagtcaactt aaaatctttt aagattggag aagttgaaat tcaagttagc agtcaggtaa 4320 aatacttggg agtaatctta gatgccaagc ttagctggaa tgcgcatctt gactctacga 4380 ttaataaagc gatcaacgca ttatggttat gctccaaaac tgtcgggagg aagtggggct 4440 tgaggccaaa aatgattatg tggatttaca catctattat acgaccaaaa attacctacg 4500 cttcattagt gtggtggcct aaaaccaggg aggctactgc aggaatgaaa ctagctaagc 4560 ttcaaagact tgcgtgcatt gctataacgg gagcaatgcg cagcactcca tcgaaagcgt 4620 tagatgctat tctcaatctg ctgcctttgc acgaatatgt gcaattagaa gcggaaagaa 4680 gtgctctaag gcttaagagg tctaaaaatg ttttgccagg tgatcttatt ggccacttga 4740 gcattttgca acactttaaa agaggaccgg tgatgagtat gaacggagac tggatgagac 4800 ctgaggacaa ctatgatttt ctctacaagg taagcgaaat gacgcgtaca agttgggatg 4860 tcggaggtcc cacggttcgt gaaggctcaa tcatattctt tactgacggc tcaaaaattg 4920 gaacaaatac aggggctggg atctttggac ccggagtaaa tatttcagtt gcaatgggac 4980 attggccaac agtgtttcaa gctgagattt ttgcaatact tgaatgtgtg aatgtctgtc 5040 tgaaaagaaa atacagatat gcaaatatat gtattttctc tgatagtcaa gcagctctaa 5100 acgcattaag tgcatataaa tgtacatcaa aactcgtctg ggaatgtatt ctctcgttgc 5160 gacagttgtg tgaatcaaac tcggtaaatt tgtactgggt tccaggacac tgtggcattg 5220 aagggaatga aaaggcagat gatcttgcaa aacggggctc aaattcacag tttgttggcc 5280 cggaaccatt ctgtggtata ccaaactgtg cagtaaaaat ggaacttaag tgctgggaag 5340 aacaaaaggt gatgaccaac tggatggatg tcaaaaagtg tacccagtct aaaagattta 5400 taactccaaa cgcaaataaa actaaaaaga ttttagagct caacaagagg gctctttgta 5460 catacactgg cctagtaact gggcactgtc cgagcaaata tcatttgaag aacattggcc 5520 aggttcagaa tgatatttgt cgcttttgta atattgaaag tgaaacctcg gaacatctgc 5580 tttgcagttg tggtgcatta tacaggcgca gatcaaagtt tctcgatagc ggctgtttac 5640 agcccagtga gatctggtct gctgatccga gaaaggtgat tggtttcata aaccatatca 5700 tacctgattg ggaacgtgtc atggtaagca gctgatcgtc taatcaatgg tgttcggcta 5760 gcgtgacatg tacagtaaac tgggacatac tacaatagtt ctaaatattg gacgcagtag 5820 tttcaacccc caacaaaaaa aaaaaa 5846 // ID Gypsy-2_RP-LTR repbase; DNA; INV; 203 BP. XX AC ACPB02032414; XX DT 28-JAN-2011 (Rel. 16.02, Created) DT 28-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Rhodnius prolixus genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-2_RP_; KW Gypsy-2_RP-I; Gypsy-2_RP-LTR. XX OS Rhodnius prolixus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Euhemiptera; Heteroptera; OC Panheteroptera; Cimicomorpha; Reduviidae; Triatominae; Rhodnius. XX RN [1] RP 1-203 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Rhodnius prolixus genome."; RL Direct Submission to RU (28-JAN-2011). XX DR Genome; ACPB02032414; Positions 1766 1564. XX SQ Sequence 203 BP; 57 A; 22 C; 45 G; 79 T; 0 other; tgagatgaaa taagagtgaa tttggcgggc atcttggaaa tagggagggt aaccagaagt 60 aaacctcctt gtgtggacta tatatattgt ttataatgta gtgtactatg tattgttcta 120 atgtagtgtt tctatgtatt tctatgtact atgtattgtt ctttgttgat taaagcctac 180 atttaaggtc tagtgtcatt aca 203 // ID BEL-15_CQ-LTR repbase; DNA; INV; 444 BP. XX AC AAWU01030696; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-15_CQ_; KW BEL-15_CQ-I; BEL-15_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-444 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 184-184 (2011). XX DR Genome; AAWU01030696; Positions 23062 23505. XX SQ Sequence 444 BP; 138 A; 88 C; 88 G; 130 T; 0 other; tgttagtacc aatctgcgct gacgataccc cccgttgggg ctagagccac tggctataca 60 tgagctttag tttctggtag ggagaagaac gtcagtgtga cttgtcaggg tcgtttgtta 120 tcacagtgca atttcaccta atcgaacgca catacgcaat cgaaacagca gtagaattac 180 taatctaaag gtatgaatta tttgcatttt tcttgaacta gctaattaga attaaaattg 240 aatttataga ttctacctag gattgcgcga atcgaagagt tctttctgtt tagtggacga 300 acttaaaaag ggcactaaat gtaaatatga aatttaccgt catgaagaca cagcaactaa 360 taaatctccc tttttagctt tgagctgaac cacagtacga actaagtctg cttctttgat 420 tggcccgaac aaacaccgct aaca 444 // ID DNA8-68_AP repbase; DNA; INV; 764 BP. XX AC . XX DT 25-AUG-2009 (Rel. 14.09, Created) DT 25-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-68_AP. XX NM DNA8-68_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-764 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2003-2003 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 764 BP; 226 A; 101 C; 140 G; 297 T; 0 other; cagtgctcga attggggggt gcgcaggggg acggagctcc cttactattt ttattttttt 60 tttgcgtacc cctacttttt ttacttggac ggggggacgg tacaaactaa tgtccgaaat 120 aatttttatt aaaatgtgga tttcaattga tgtcaattat ataacaagtt ttactatttt 180 tcatatcaaa attatttttg atattagttt attaattaat tataatatca gcctgtgttt 240 tgcctatata gtcagtagtt cacagatcat agatcatata caagtatatt atctaaggca 300 cagatactga tgtgtgctag tactatctta aatcgtgttc cgaatgaaat gttatcagtt 360 tcagttatgg ttatcattta tcggtgtttt tgaattttta aatttttaaa tttttttata 420 cttatcaaaa tatgtggtct cttttgaatc aaaaacaata attattaatt ttgttatcta 480 tttagtatat tgaataaaca ggtagattat tattaatttt tttttctact gtaacaatgt 540 aagatgtaga tataagatat tgctatcaaa atatattata cctatttgga attcaaaatg 600 tgttgctgaa aagtggtatt caaaatgtat gtagataagg caggagcgtg gggggagggg 660 gcaaggcccc cacgggtctg tgttgagtaa atgcgccctt ggaacgcagt ttttaaatcg 720 ttagattgag ctcctctact taatttttcc caattcgagc actg 764 // ID RTE_Ele2B_AAe repbase; DNA; INV; 3399 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE An RTE non-LTR retrotransposon from Aedes aegypti. XX KW RTE; Non-LTR Retrotransposon; Transposable Element; KW RTE_Ele2B_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-3399 RA Kojima K.K. and Jurka J.; RT "RTE clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1442-1442 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >98% CC identity. The consensus is ~87% identical to RTE_Ele2 and ~78% CC identical to RTE_Ele5. XX FH Key Location/Qualifiers FT CDS 336..3353 FT /product="RTE_Ele2B_AAe_1p" FT /note="endonuclease and reverse transcriptase." FT /translation="GSISNFLPTTKNGERRNNERYFRQPTWQRNKDYDWKL FT GTWNVRTLNEPGRVSLLARELQKVGVSVAAIQEVRWPRTGEREFRAVDPVA FT NTAFKYNIYHSGSDKAEHGVGFIVIGKQMKRVMRWKPINERICVLRIRGKF FT FNYSLINIYAPTNDKPDDTKDAFYDCLDKTYGECPKHDVKVVIGDANAQVG FT REDFFRPIIGTESLHSATNDNGLRLITFAAARGMAISSTYFARKDIRKHTW FT MHPNGELCNQIDHVLVDGRHFSDVIDVRTFRGPNIDSDHYLVVSKIRARLS FT TVANTRTQRPLRFNIQRLSTEGVADEYRRKLDERIRGYNVGENLNNLWESI FT HGAMSTAAREVIGTAQRRPRNGWFDEECQNVTDEKNVARSRMLVSGTRQNR FT ERYKEARAAEKRIHRRKKRQYEEDIITEAQNSIEQNDMRRFYETVNGVRRK FT TAPSPVMCNDREGNLLTDKILVAARWKEHFEQLLNGEDTSASENRLIISND FT GQAVEPPTLDEVRRAIKELKNCKAAGKDELPAELFKHGSEQLYEMLHRTLL FT RIWEEEELPASWLEGLICPLYKKGHRLECANYRGITLLNSAYKIMSRVLFN FT RLRPLEESFVGEYQAGFREGRSTTDQMFTLRQILDKFREYNLQTHHLFIDF FT KAAYDSVKRNELWQIMVEHGFPAKLIRLIRATLDGSKSSVRVADETSTSFV FT TLDGLKQGDALSNLLFNIALEGAIRRAGVQRSGTIVTRSHMLLGFADDIDI FT IGIDRRAVEEAFVPFQRETARIGLTVNTSKTKYMIAGRQRGSNRDVGSEVV FT LGGEKFEVVEEFVYLGTLVTCDNDVTREVKRRLAAANRAFYGLRNQLKSRS FT LQTKTKFALYKTLILPVALYGHESWTLKEVDRKAFGVFERKVLRTILGGKQ FT ENGIWRRRMNHELYQVYKEVDIIKLIKHGRLRWAGHVVRMPEERQAKIIFS FT REPGRGRRLRGRPRTRWLFAVEEDLRALNVQGDWKRLAQDRAQWRRIIHSA FT " XX SQ Sequence 3399 BP; 970 A; 731 C; 947 G; 751 T; 0 other; tgggctgtga aatggtgagc cgacaactag gaaggagcgt ccaacatagc tctggtcctc 60 acaagtccct acctcacgct tccacgggtc aaatgatgac aaagaccgcc agctaagggt 120 tgcgtactta gctggtagtg cagcctgggc actgttgtcc ttctgacatc agctagagtg 180 aggaggtgcg tctcgagcgt ctgtccacca aggaggtgcg gctcaaacag cgtctgttct 240 ggcatccagc ggctgagtat gaaatgctgt atcacgtcag ctacacctaa ggtggcagcc 300 ccaccatcgt gatgtaggta tcgcgaccct ggtaaggtag catatcgaat ttcttaccta 360 ccacgaaaaa tggagaaagg agaaataacg aacgatattt tcggcaaccg acctggcaac 420 gaaataagga ctatgattgg aaactcggta cctggaatgt caggacgtta aatgaacccg 480 gacgagtgag ccttttggct cgtgaattgc agaaggttgg agtgagcgtg gctgctattc 540 aggaagtaag atggcctaga actggagaac gtgaattcag ggcggtggat cccgtcgcca 600 acacagcttt caaatacaac atctaccata gtggcagcga taaggcagag catggcgtcg 660 gattcatagt gatcgggaag cagatgaagc gtgttatgcg gtggaaaccg attaatgaac 720 gaatctgcgt attgaggata cggggcaaat tcttcaacta cagtctgatc aacatctatg 780 cgccgacaaa cgataaaccc gacgacacga aggatgcgtt ttatgattgt ctcgacaaga 840 cctacggaga gtgcccaaaa catgacgtaa aagttgttat cggggatgcc aacgctcagg 900 tcggaagaga ggattttttc cgccctataa ttggtacgga gagccttcac tccgccacca 960 atgacaacgg cctaaggcta ataacttttg ctgcagctag agggatggcc atcagcagca 1020 cctactttgc acgcaaagac atccgaaagc acacctggat gcacccaaat ggcgagctct 1080 gcaaccaaat agaccacgtt ttggtggacg gtcggcattt ctcggatgtc atcgatgtga 1140 ggacctttag aggcccaaac atcgactctg atcactatct cgttgttagt aaaattcgtg 1200 cacggttgtc taccgtggcg aacacaagaa cacagcgacc gttgcgtttc aatatccagc 1260 gcttatcaac agaaggtgta gcagacgagt accgccggaa gcttgatgag cggataaggg 1320 gatacaacgt aggcgaaaac ctcaacaatc tgtgggagtc tatccacggt gcgatgagca 1380 cagcagcgcg agaggtgata ggtactgctc agagacgacc cagaaacggt tggtttgatg 1440 aggagtgcca gaatgtgacg gatgagaaga atgttgccag aagtcggatg ttagtgtctg 1500 gtacccggca gaatagagag cggtacaagg aagctagagc agccgagaaa cgaattcacc 1560 gcagaaagaa aaggcagtat gaagaagaca tcattactga ggcgcaaaac agcatcgaac 1620 aaaacgatat gcggagattt tacgaaactg tcaatggcgt acggagaaaa actgcgccgt 1680 ctcccgtcat gtgcaatgac cgcgaaggta acttgctgac agacaaaatc ttggtggctg 1740 ccaggtggaa agaacacttc gagcaattgt tgaatggaga ggacacgagc gcgtctgaga 1800 acagattaat catcagcaac gatggacaag ctgtggagcc cccgacgcta gatgaggtta 1860 gaagagcgat taaggagttg aagaactgta aggctgctgg aaaagacgag ctcccggccg 1920 aacttttcaa gcacggtagt gagcagctgt acgaaatgct gcaccgtact ctgttaagga 1980 tatgggagga agaagaattg cctgctagct ggctggaggg cctcatctgc cctctgtata 2040 agaagggcca cagattggag tgcgccaatt accgaggaat aacgctcctc aattcggcgt 2100 acaaaataat gtcacgtgtt ctgttcaaca gattgagacc gctggaggag tccttcgtcg 2160 gcgaatacca agctggtttt cgtgagggcc gatcaacgac ggatcaaatg tttaccttgc 2220 gacaaatcct tgataagttc cgggaataca acttgcagac acatcatctg ttcattgatt 2280 tcaaggcggc gtacgattca gtaaagagaa atgagttatg gcagataatg gtagaacacg 2340 gttttccggc gaaactgatt agactgattc gtgcaacgtt ggatggatcg aaatcaagtg 2400 tgcgggttgc ggatgagact tcaacctcct tcgtaacctt agatggattg aagcaaggag 2460 atgcgctttc gaatcttttg ttcaacattg ctctcgaagg agctataagg agagcgggcg 2520 tgcaaagaag cggtaccatc gtcacccggt cgcatatgct cctgggtttt gcggacgata 2580 tcgatataat cggaattgat cgtcgagccg tagaagaggc attcgtgcct tttcaaaggg 2640 agacagcgag gattggactt accgtcaaca ccagcaaaac taagtacatg atcgctggta 2700 gacagcgtgg ttccaatcgt gacgttggta gtgaagtggt gctaggtggt gaaaaatttg 2760 aagtagtgga agaatttgtg tatcttggta ctttagtgac ttgcgataat gatgttaccc 2820 gcgaggtgaa aaggcgtctt gcagctgcga atagggcttt ttacgggctc cgtaaccagc 2880 ttaagtcccg tagcctgcag acgaaaacaa aattcgcgct atataagact cttatccttc 2940 cggttgccct atacggccat gaatcctgga cgttaaaaga ggtcgaccgg aaagcgtttg 3000 gagtttttga gcgtaaagtg ctgcgaacaa tactcggcgg taaacaagaa aatggcatct 3060 ggcggcgtcg catgaatcat gagttgtacc aagtatataa agaagtggat attatcaagc 3120 tcataaaaca cggcaggctg cgttgggctg gtcacgtggt acgaatgccg gaagaacgac 3180 aagcaaaaat aatattcagt agagaacccg gacgaggccg tcggcttcgt ggtaggccgc 3240 gcacacgttg gctttttgca gttgaagagg acttaagggc acttaacgtt cagggcgact 3300 ggaagcgatt ggctcaggac cgagcccagt ggagaagaat tatccattcg gcgtagattc 3360 aacgtagcgg attgtagccc atcaagtatc aagtaagta 3399 // ID LINE1_BM repbase; DNA; INV; 5158 BP. XX AC D26009; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 25-MAR-1997 (Rel. 2.02, Last updated, Version 2) XX DE LINE1 repeat family; retrotransposon. XX KW L1; Non-LTR Retrotransposon; Transposable Element; BMLINE1; KW LINE 1 repetitive sequence; LINE1_BM. XX OS Bombyx mori OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Bombycidae; Bombycinae; Bombyx. XX RN [1] RP 1-5158 RA Ogura T., Okano K., Tsuchida K., Miyajima N., Tanaka H., RA Takada N., Izumi S. and Maekawa H.; RT "A defective type of non-LTR retroposon is dispersed throughout RT the genome of the silkworm, Bombyx mori."; RL Unpublished (1993). XX DR GenBank; D26009; Positions 121 5278. XX SQ Sequence 5158 BP; 1181 A; 1584 C; 1189 G; 1204 T; 0 other; attaggagga ggaatctcac acccccgatt actctcttta gacaaaccat accctgggcc 60 ccattccgat agaggcaccc gtcacgcgcg gacgtgctca tcgaccactc gatcgcccgg 120 cctttgttcg cccggctttt gttagcccgg ccattgttcg cgcggcctgt gttcgtggta 180 caccgccgac caagtgtttt cttggagttt tattgtttgg tttcctttgt tattctgtgt 240 cgtggtacat ctgccgacaa actgtcttcc tggaagtgtt taattgtttt gtttcttttt 300 gttatttctg ttttgttgtg atttttcttt gtcctgcaga atctgtgcca tcgcacctgt 360 gtcatagacg ctcaggtcga gaagttggcc tcgccgcaag gcagagtgtt ttctatgacc 420 ccagggtagc cccacctggc acgatagaca tcatggacgc tgtatttgcg gaatttctcc 480 gcttcgctac ccaaagctca cttccgaagt ttcaggccct tcaaggccga tcacactgcg 540 agccctctct cgtggactcc accgagctcg cttgctcctg cgtcgcctgt acctgcgtgc 600 aaagcttctg catcgagcac cgagctcggc gggctgcgtc cgcgtcgcta tactggcgag 660 taaaactgct gcgtcgtcgc gtggtcaccg ttccagcagc gcgatcatcc gcggcctccg 720 tcgcgccgtc caaaacacct actcgtaggt cacctgcgcc cgctcctcgt cctccgactc 780 tgactcggag atggaggtcg acctcgctcc cgcctcatcg acggatggat tcaccctagt 840 gcaaaagggt aagaagcgtg cgcggagtct cgagctcccg cggccgctaa aattagcaaa 900 gccgcgaacg cgtcgcgccc ccgccctcag actcccgttg cgcctccagc ccgtggccac 960 tccgtcgccg cgtccggtgg cacaaaataa agcccagacc cctcccccgg taatccttca 1020 agagaaggca gcttgggaac gagtttccct ggccttaagg ccaaaaatat caattttacg 1080 aatgcccgta acctcgcgaa cggcattcaa attaaggttc gaacagacgt acgtatccat 1140 agggcctctc ttcttacctc cgtaaggagc gtataagttt ccacacatat acgctccagg 1200 aggagcgcga acttcgtgcc gttatacgtg gcatccctaa agagttggat agacgagctt 1260 cggtcaaggc gaccttctcg aacaaggcta ccggttaact ccgtacaccg catgcacaca 1320 ggccgcggaa gggagccata taatatggtt ctcgtcgcct ccagcctacc cccgagggta 1380 agcaaatatt caacatccga acggtctgta gcctttccgg aatcgcagtc gaaaccccac 1440 acaagaaagg cacacctagc agtgccataa ctgcaattat acgggcattc ttcccgtaac 1500 tgtcacgcgc gctcccgatg cgctcaagtg cttaggcgat cacgcacggc cctatgcact 1560 cgcgatcaaa aaaccgcgac aagaaccgcc tagctgcgtc ctgtgtcgaa cacagggtca 1620 cccgcaaatt accgggatgc cccgagcccc gaaaataaat cgccgcgtcg ccgcaaaacc 1680 gccgtccgag cttccgcgcc cagacatcaa agcctcggca ccctctgtgt cgcaggctaa 1740 gccagcgttc gttccggcac cggtgcccag tgtctcggcc tgggcgaaac cgctgccgta 1800 catgaacacg gctacaactc cctcctccgc gattcgtccc gcccccgcga ctcgtccctc 1860 tcccgcgatt gcctccgaac cgtgtccgat caatctcgct ttagtgatcg acttctttca 1920 gtcgatcaac tttgagcgcg ttaacgcttt ggcgacgcca ttcgcactgc caccacatca 1980 caacacttta tcgccgttgt gcaagaatac gccgacgtat acgcgtcatt aaatacgtac 2040 gtcctcccct cactccgccg gtaatcaatg gcgtatataa gtagaataaa gccctatccg 2100 taacgatagg attttttaac gttacggtct cgcaaatcag cgtgatcagg tttctgactt 2160 tttgcgtgac catcaaattg atatcttttt agtgcaggag accctactta agcccgcgcg 2220 ccgtgaccct aaaatcgcca actataacat ggtcaggaac gacaggctct ctgctcgtgg 2280 tggtggtacc gtcatttact atagaagagc cctgcattgc gtcccactcg atcctcccgc 2340 gctcgctaat atcgaagcat cagtgtgccg aatctcactg acggacacgc gccgatcgta 2400 tcgcgtccgt ttatcttcca ccggataaga tcgtctaagc agtgatatcg aggcgctgct 2460 cggcataggg agctctgtca ttctggcggc gacctaatgt aacacatagt gaactcacac 2520 acacaacccg atgcagcgct gacgcgttag tcgataatct cgccttcgat atcatcgctc 2580 cgctaatccc gactcactac ccgctaaata tcgcgcatcg ccggatatac tcgacatagc 2640 gttattaaaa aacgtaactc tgcgcttaca ctcgatcgaa gtagtttcag agttagattc 2700 agaccaccgt cccgtcgtta tgaagctcgg tcgcgctccc gattccgttc ccgtcacgag 2760 gactgtggtg gattggcaac acgtgggcat cagcctggct gaatctgatc caccatcgta 2820 ccgtttagcc ggactctacc cgtctcctca ggataccgtg aagccataga catcttaacg 2880 tcacacatca cctcgacatt agataggtca tcgaaacaag ttgtagcgga ggacttcctt 2940 caccgcttca aattgtccga cgatattagg gaactcctta gagctaagaa cgctcgatac 3000 gcgcctacga caggtatcct accgcggaaa atcgtattcg aatgcgtgcc ctacaacgcg 3060 acgtaaagtc tcgcatcgcc gaagtccgag atgccagatg gtctgatttt cttagaagga 3120 ctcgcgccct cccaaaggtc ttactaccgc ttagctcgta ctcttcaaat cggatacggt 3180 agtaactatg cccccctcgt aggcccctca ggccgactca cggcgttcga tgatgacgaa 3240 aaagcagagc tgctagccga tacattgcaa acccagtgca cgcccagcac tcaatccgtg 3300 gaccctgtgc atgtagaatt agtagacagt gaggtagaac gagagcctcc ttgcaccctc 3360 tgatgtgtta ccacccgtca ccccgatgga agttaaagac ttgatcaaag acctacgtcc 3420 tcgcaaggct cccggttccg acggtatatc caaccgcgtt attaaacttc tacccgtccc 3480 actcatcgtg atgttggcat ctattttcaa tgccgctatg gcaaactgta tctttcccgc 3540 ggcgtggaaa gaagcggacg ttatcggcat acataaaccc ggtaaaccaa aaaatcatcc 3600 gacgagctac cgccgattag cctcctcatg tctctaggca aactgtatga gcgtctgctc 3660 tacaaacgct cagagacttc cgtctcatcc aagggcattc tcatcggtga acaattcggg 3720 ttccgcacaa atcactcatg cgttcaacag gtgcaccgcc tcacggagca cattcttgta 3780 gggcttaatc gaccaaaacc gttatacacg ggagctctct tcttcgacgt cgcgaaaagc 3840 gttcgacaaa gtctggcaca acggtttgat tttcaaacta ttcaacatgg gtgtgccgga 3900 tagtctcgtg ctcatcatac gggacttctt gtcgaaccgc tcttttcgat atcgagtcga 3960 gggaacccgt tcctccccac gacctctcac agctggagtc ccgcaaggct ctgtcctctc 4020 accctcctat ttagcttatt cgttaacgat attccccggt cgccgccgac ccatttagct 4080 ttattcgccg acgacacgac tgtttactat tccagtagaa acaagtccct aatcgcgaag 4140 aagcttcaga gcgcagccta gccctaggac agtggttccg aaaatggcgc atagacatca 4200 acccagcgaa aagtactgcg gtgctctttc agaggggaag ctccacacgg atttcctccc 4260 gtattaggag gaggaatctc acacccccga ttactctcgt tagtcaatcc ataccctggg 4320 ccaggaaggt caagtacctg ggcgttaccc tggatgcatc gatgacattc cgcccgcata 4380 taaaatcggt ccgtgaccgt gccgcgttta ttctcggtag actctacccc atgatttgta 4440 agcggagtaa aatgtccttc ggaacaaggt aacactttac aaaacttgca taaggcccgt 4500 catgacttac gcgagtgtgg tgtcgctcac gcggcccgca cgcacatgga cacccttcaa 4560 tctctacaat cccgcttttg cgggttggcc gtcggagctc ccgtggttcg tgaggaacgt 4620 tgacctacac gacgacctgg cctcgaatct atacagaaat acatgaagtc agcgtcggaa 4680 cggtacttcg ataaggctat gcgtcatgat aatcgcctta tcgtagccgc cgctgactac 4740 tccccgaatc ctgatcatgc aggagccagt caccgtcgac gccctagaca cgtccttacg 4800 gatccatcag atccaataac cttcgcacta gacgccttcg ctctaggagc aggcttaggg 4860 acctcggtaa ccgtactcgt cgaactcgac aaagagttcg ccgtgcaacc taacccatga 4920 atcagctcgc tgagtttctc gccggatctt ctcagcgggt cgcgattccg atccggtact 4980 agattcattc gcgaagcagc tgctcttgag ctgttaggtc tccttcggag gcgctcgggt 5040 agctgttagc aaatcccacc cctcctggct gagcctttgc tcgcccacct gtcctggtga 5100 aactggaaag gcctccgggc caccagtaat ccttcaatca taaaaaaaaa aaaaaaaa 5158 // ID hATN-2_SM repbase; DNA; INV; 492 BP. XX AC . XX DT 11-FEB-2008 (Rel. 13.02, Created) DT 28-FEB-2008 (Rel. 13.02, Last updated, Version 1) XX DE hAT DNA transposon: consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW hATN-2_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-492 RA Jurka J.; RT "Non-autonomous hAT-type DNA transposon from Schmidtea RT mediterranea."; RL Repbase Reports 8(2), 159-159 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX SQ Sequence 492 BP; 175 A; 71 C; 82 G; 164 T; 0 other; tctagagcag agcttcccaa ccagtgtgcc gcggcccact ggtgtgccgt gaaacatttg 60 cgatgcgccg taggataatt gtggattttt taaaattttc ttacaaatta tctatgaagt 120 gataaatatt gttcgactaa tattattact aatattaaat aattaacata ttattatttt 180 tttcaattaa gtataataag ttgaaaattc agtaaagtaa aaagaaagta ggctctgcaa 240 gaaatataac gagtgaatga atagtcacat gtcgatattc ttttatttag tattcagttt 300 agtttcaagt atagaaccta atataagcaa gctttgttct caacatcaag gtcagatatc 360 acattaagct gctttataaa tgtaatacat aatgtatgtc caaaaaaatt attaaaaata 420 gtgtgccgct atatttttac aaaatagtta ccgtgccgta gactgaaaaa cgttgggaag 480 cactgctcta ga 492 // ID L1_Ele28 repbase; DNA; INV; 4770 BP. XX AC . XX DT 15-OCT-2010 (Rel. 15.1, Created) DT 15-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE An L1 clade non-LTR retrotransposon family from Aedes aegypti. XX KW L1; Non-LTR Retrotransposon; Transposable Element; L1_Ele28. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4770 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4770 RA Kojima K.K. and Jurka J.; RT "L1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (15-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. There are 18 sequences with >96% identity CC to the consensus. The consensus is ~100% identical to the CC original sequence in [1]. XX FH Key Location/Qualifiers FT CDS 151..1494 FT /product="L1_Ele28_1p" FT /translation="MSSPKSRENTFRVDFSNLPKRMSYEETHNFVHNTLGL FT SLDAVVRLQVNHALNCAQVKCRDLKTAQETVIKHNAKHDVEINSTKYKVKL FT IMDDGGIEVKIHDLSENVTSEQICEFLKHYGDVIAVKELMWGDNFIYKGVS FT TGVRVAKMVLRKHIKSYVVVQGEQTLISYSGQPATCKHCANLVHPGMTCVQ FT NKKLITQKTDLTARLTRAREERSSYADIAGSSPILMPAQNNTGFRSAEALM FT PEFVKLGGMRKDQEATNTAKENSKLSVHSNIEGEVAGASTSFSTAAAAAGV FT EAAADAAEGDADVEEAVGGQLFSQSEELRTEIIIPMNVDLVEADHDTEVDS FT STVSGNTENAETLSQPQILNVYSQKRVPILATDTTKVSKTSVHQSSEKHFK FT VPAGICDGAMEVSESELEMRAHKEFEFSSDASCHDDGFVVKRSRHRSKKAR FT TTH" FT CDS 1499..4690 FT /product="L1_Ele28_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MSVLNTFKSYNIASININAISNENKVNSLRSMIRLLD FT LDIVLLQEVENSRLSFPGFQVYSNVNETRRGTAIALKMHIPVSNVQRSLDS FT RIISVKINNSVTICNVYAPSGVQNYHSREDMFNQFVPYYLQSASEYVVFGG FT DFNCVTANKDATGVNNKSAALRTLIQNMNMRDTWDCLSREIEFSFIRSNCK FT SRLDRIYVSANLVEQLRSASFHVNCFSDHKVYRIRLCLPDLGRAAGNGYWA FT MRAHVLTEENVEVFENKWNYWCRQRRNYNSWIEWWIDYAKPRIKGFFKKKT FT NDSFREFHAKNEFLYARSKTAYEELYQHPNKMSEVNRLKGKMLLLQREFSK FT KFVRINDQCLEEESMSSYQLGDRIKKKSSSHITSIVDQQRQINGSAAVEEH FT VYQYFRDLYSASETQEYQDATTTTKIPENSQENLRIMDEVTTEEIFRAIKS FT SASKKSPGVDGLPKEFFVAAFNVIHRQLNLIVNEAMQGNIPQKFVEGVVVL FT CKKKGGENSIKGYRPLTMLNYDYKLVSRILKSRVASVIDKNNLLTHSQKCS FT NAKKNIFEALHAIKDRIAEIRHKKKRARVVSLDLDHAFDRVEHCFLFRVMR FT DMGFHERLIELLRKIMQNSRSRVLVNGHLSPEFQIGRSVRQGDPISMYLFV FT IYLHPLLEKLHMICNDPLELVVAYADDITVVIVDDSKLEQIKRTFHEFGQY FT SGAVLSLSKTIALNIGDLSRPQSVVNWPSVSDSVKILGITFFNDHKVMIEF FT NWKETIRKMTQLMWMFKPRLIVLHQKIRLLNVYVTSKLWFVASVLSIRNQD FT VAKVTKHIGTFIWGRKPRVAMNQLIQPVSKGGLNLHLPMYKCKALLANRTL FT QVREVIPFGNRLITEATNGNNPVFPDMYPCIKVVWRVVQDLPQQVMENICS FT SAIERHLTQELPIPKIISENPAVRWPSVFQNIHSKHMSAIEKTTMYLFVNQ FT KLPHAQLLYKIGQTTSPNCATCRLTVESQEHKLANCPKVARLWSYLCSKLR FT LHLNTRIEYSIMKTPEFRDVSRNSRIAVLKLFVKYVVYILEGSIPLNVQEL FT EHILNLN" XX SQ Sequence 4770 BP; 1565 A; 895 C; 1073 G; 1237 T; 0 other; cagttcgcgt tgaaacgctg agatgatcgg acgtgcgcgt tataatctca agcattttca 60 cgtggtcaaa taaagttagt gctctgaacc gtattctttt cgtcaggctg acgacacaaa 120 atcaaaggca ttttcaatgc acaattaacg atgtcttcgc caaaatcgcg ggaaaacact 180 ttccgggtgg atttttctaa tctgcccaaa cgaatgtcgt atgaagaaac gcacaatttt 240 gtgcacaata cattgggtct cagcttggac gctgtagtac gactacaagt aaaccacgct 300 ctcaactgtg ctcaagtgaa atgcagagat ctcaaaactg cacaagaaac tgtgattaaa 360 cacaatgcga aacacgatgt agagatcaac agtacgaagt acaaagtgaa attgatcatg 420 gatgatgggg gaattgaagt aaagatccac gatctttcag aaaatgtcac atccgagcaa 480 atatgtgaat tcttgaaaca ctatggcgac gttattgcgg tcaaagagct catgtgggga 540 gacaacttta tctacaaggg agtctctacc ggtgtacgtg tggctaaaat ggttcttcgt 600 aaacacatta agtcgtacgt ggtagttcag ggagaacaaa cactaatctc ctattcaggg 660 cagcctgcca cgtgtaagca ttgtgcaaat cttgtgcatc caggtatgac ctgtgtacaa 720 aacaaaaaac ttattacaca gaagactgac cttactgcaa ggttgacacg ggctcgcgaa 780 gagcgatcat cgtatgctga cattgctgga agttctccaa tcctgatgcc agcacagaac 840 aacactggat tcagatctgc cgaagctctc atgcctgagt ttgtgaagct gggtggtatg 900 aggaaagacc aggaagctac aaataccgcc aaagaaaatt ccaagctatc cgtacattcg 960 aatatcgagg gtgaagttgc gggagcgtct acttcgttct ctactgctgc tgctgctgct 1020 ggtgttgagg ctgctgctga tgctgctgag ggagatgctg atgttgagga ggctgtagga 1080 ggtcaactct tttctcagtc cgaggagctc agaactgaga taataattcc aatgaacgtg 1140 gatcttgttg aggcagacca tgacaccgaa gttgactcgt caactgtctc aggcaacact 1200 gagaatgccg aaactctttc tcagccacaa atactgaacg tctactccca gaaaagagta 1260 cctatactcg ctacagacac aaccaaggtt tccaaaacca gtgtgcacca gtcgtcggag 1320 aagcacttca aagtcccggc aggaatctgc gatggtgcga tggaagtttc agagagcgaa 1380 ttggagatga gagctcacaa agagtttgag ttctcatcgg atgcgtcgtg tcatgatgat 1440 ggattcgtcg tgaaaagaag tagacaccgt tccaagaaag ccagaaccac acactaagat 1500 gagtgttctc aacacattca aaagctacaa tattgctagc attaatatta atgccatatc 1560 caatgaaaat aaggtaaatt cgctacgatc aatgatacga ctattggatt tagacatagt 1620 tttattacaa gaagtggaga acagtcgctt aagcttccct ggcttccaag tgtacagcaa 1680 cgtaaacgaa acgcggcgag gaacagctat agcactaaag atgcacatac ctgtatctaa 1740 tgtgcaacga agccttgata gcagaataat aagtgtgaaa atcaataact ctgttactat 1800 ctgcaatgtt tatgcaccaa gtggtgtaca aaattatcat agcagagaag atatgttcaa 1860 ccagtttgtt ccgtattact tacaaagtgc atcggagtat gttgtgtttg gaggtgactt 1920 taattgtgta actgcaaata aggatgcaac aggagtaaac aataagagtg cagcgttgcg 1980 tacgctaatt cagaacatga acatgcgaga tacatgggat tgtttgagtc gagagatcga 2040 attcagtttc ataagatcga actgtaagtc acgattagac agaatttatg tgtctgcaaa 2100 tctagtcgag caattgagga gtgcttcttt ccatgtaaac tgcttttcag atcacaaagt 2160 gtatagaata agactgtgct taccagacct gggtagagct gctgggaatg gttattgggc 2220 tatgcgtgcg catgtgttga ctgaagaaaa cgtcgaagtg tttgaaaaca agtggaatta 2280 ctggtgtcgt caaagaagaa actataacag ttggattgag tggtggatag attatgcaaa 2340 accaagaatc aaggggtttt tcaagaagaa aaccaatgac tccttcagag aatttcatgc 2400 caaaaatgaa ttcttatacg cacgttcgaa aactgcttac gaagaattat atcaacatcc 2460 gaacaaaatg agtgaagtga accggttgaa aggcaaaatg ctacttctgc aacgagagtt 2520 ttcgaagaaa tttgtgcgta ttaatgatca gtgccttgaa gaagaaagta tgtcgtccta 2580 ccagttggga gacagaatca agaaaaaatc aagttcgcac ataacatcca tcgtagatca 2640 acagcgacaa attaatgggt cagctgcagt tgaagaacat gtatatcagt actttcggga 2700 tttgtattct gcaagtgaaa cgcaagaata tcaagatgca actacaacaa ccaaaattcc 2760 tgagaattct caagaaaacc taagaatcat ggacgaggtg acaactgagg agatttttag 2820 agcaataaaa tccagtgctt ctaaaaagtc tcctggagtg gacggattgc ccaaagagtt 2880 cttcgtagct gcgttcaacg ttattcaccg tcaattaaac ctgattgtga atgaagcaat 2940 gcaaggaaac attccacaga agtttgttga aggtgtcgtc gtgctgtgta aaaagaaagg 3000 tggtgaaaac agcattaaag gatatcgtcc actcacgatg ctgaactatg attataaatt 3060 agtcagccgt attctcaaaa gtcgagttgc aagtgtaatc gataaaaaca acctactaac 3120 acattcgcaa aagtgttcta acgctaagaa aaacattttt gaagctctac atgcgattaa 3180 ggatcgtatt gcagagatac gtcataaaaa gaagagagca agagttgtat cactggatct 3240 cgaccatgcg ttcgacagag tcgaacattg tttcttgttt cgtgtaatga gagatatggg 3300 tttccacgaa agactaatag agcttctccg aaagatcatg cagaactccc gctcacgtgt 3360 tctagtaaat ggtcatctct ctcctgagtt ccaaattggt cgttcagtga gacaaggaga 3420 ccctataagc atgtatctgt ttgtgatata cttacatccg ttgctcgaaa aattacacat 3480 gatctgtaat gatccgcttg agttagtggt ggcctacgcc gacgatatta cggtagtgat 3540 agtggatgat tcaaaacttg aacagatcaa acgaactttc cacgaatttg gacagtactc 3600 gggagcagta ttgagtctct cgaaaacgat tgcacttaac atcggagatt tatcacgacc 3660 tcagtcagtt gtgaactggc cgtctgtgag tgatagtgtt aaaattctgg ggatcacttt 3720 tttcaacgat cacaaagtta tgattgagtt caactggaag gaaactatac gtaaaatgac 3780 gcagttaatg tggatgttca agccgcgact catagtgtta caccagaaga taaggctgct 3840 gaatgtgtat gtaacgtcca agttatggtt tgtggcttca gtgctcagta taaggaatca 3900 ggatgtggcg aaagtaacga agcacatcgg gacctttatt tgggggcgaa aaccgagagt 3960 agcgatgaat caactaatcc aacccgtttc aaaaggaggg ttgaatctac atctacctat 4020 gtacaagtgc aaggctcttc tggcgaatcg aacgttgcaa gtaagagaag tgataccgtt 4080 cggaaataga ttgattactg aagcaacaaa cggaaataac cctgtgttcc cagatatgta 4140 tccgtgcata aaagtggtgt ggagagtggt ccaagatctt ccacagcaag tgatggaaaa 4200 tatctgcagc agcgcgattg agagacatct tacacaagag ttaccaatcc ctaagattat 4260 ttcagaaaac ccagcagtta gatggccctc agtgttccaa aatattcata gtaaacatat 4320 gtcagcaata gaaaaaacta cgatgtactt atttgttaat caaaaacttc ctcatgcaca 4380 actactgtat aaaattggtc aaaccaccag tcctaattgc gctacatgta gattaacagt 4440 agagagccaa gaacataaat tggcaaattg tcccaaagtt gctagattgt ggagctatct 4500 gtgctcaaaa ttgagattgc atttgaacac aagaatagaa tattcaataa tgaaaacacc 4560 agaatttagg gatgtaagca gaaacagcag aattgcagtt ctaaaattat tcgttaaata 4620 cgtagtttat attttagaag gctcaatccc tctgaatgtg caagagctgg aacatatttt 4680 aaatcttaac tagtgttgag aataggcggt gcgctttgta atgtgtatgt gaacgtaaac 4740 agtgaataaa tgtgtttaaa aaaaaaaaaa 4770 // ID Gypsy-24_DWil-I repbase; DNA; INV; 6078 BP. XX AC scaffold_181148; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-24_DWil_; KW Gypsy-24_DWil-LTR; Gypsy-24_DWil-I. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-6078 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_181148; Positions 4237305 4243382. XX CC Positions [2108-2608] - Reverse transcriptase CC Positions [3710-4039] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 1667..4039 FT /product="Gypsy-24_DWil-I_1p" FT /translation="MMTLRTYDRIWPTNKPKVYNYDPWFSDYQHNPIQILG FT VFDVSIHYNGRNIDNLLVIVTKSGCSDLLGINWFDPLGISIKGIHSVGSDV FT QIAGILNKFEHLFSTELGCYTGPPVILNIDATVPPVRLPPCRIPYALKGLV FT EEELDRQCKQGILKPVEYSDWATPIVPVIKKDGSIRICGDYKSTLNKAVKP FT HCHQIPAINTLLASMEGGSIFAKIDLAQAYQQLVVDENSSLLQTISTHKGA FT FKVTRLQFGISSAPGIFQSCIENVVQNISGVLPYFDDIVIIGKTETELATR FT LQELFIRFDKAGLRLQKDKCQFGVPFVEFLGFKIDSTGIRPCADKVSAIKD FT ALNFYHSFLPHKATVAEPLHRLLDKNATWRWDKQCEKAFSTLKELIGSDDV FT LVHYDGSLPLVLACDASPYGLGAVLSHKMPNGAEKPIAFYSRTLSKAERNY FT AQIDREAIALVAGVKKFHNYVYGRTFTLITDHRPLLGIFTTSKAIPNIISN FT QMLRRSLFLSTYTFELVHRSGLKMGNADFLSRCPLSAASDAVSTDDVLMIE FT LAATPIISAEAIAAQTSKDPLLAKVSNWVLRGWPDMKLEQSDIAYPYYCRR FT EQLSTNKGVLIWGNRAVVPPKSRATILAALHAAHPGIVKMKALARSYVWWP FT NIDAEIELIVKKCIPCQQNRNDHSEAPVHHWESAKRPWSRLHIDFAGPFQG FT KTFLLVVDSYSKWLEVAVVSYTSTAATIKVLRQLFATHGLPDQLVSDNGTA FT FTSEEFKAFLHNNLIRHITSAPFHPATNGQAEITGTVYTF" FT CDS 4407..6071 FT /product="Gypsy-24_DWil-I_2p" FT /translation="MPNGAEKPIAFYSRTLSKAERNYAQIDREAIALVAGV FT KKFHNYVYGRTFTLITDHRPLLGIFTTSKAIPNIISNQMLRRSLFLSAYTF FT ELVHRSGLKMGNADFLSRCPLSAASDAVSTDDVLMIELAATPIISAEAIAA FT QTSKDPLLAKVSNWVLRGWPDMKLEQSDIAYPYYCRREQLSTNKGVLIWGN FT RAVVPPKSRATILAALHAAHPGIVKMKALARSYVWWPNIDAEIELIVKKCI FT PCQQNRNDHSEAPVHHWESAKRPWSRLHIDFAGPFQGKTFLLVVDSYSKWL FT EVAVVSYTSTAATIKVLRQLFATHGLPDQLVSDNGTAFTSEEFKAFLHNNL FT IRHITSAPFHPATNGQAERMVQTTKNYLKKLSPNNNMDLSLARFLFNQHIT FT PHATTNRSPAQLLLNRELKSYFDKLQPQEISNGAEPFINQSKCFDTGKSVW FT VRNYAPGPKWIEAYISEQTGPVSYKVRLTDNRIIKRHYNQIRSRVEQTDVN FT ISHHAPSETNDNTEDPASSNHSRPESPIQDDPQSLHASSMPRRSARMRRPT FT QFYESSGC" XX SQ Sequence 6078 BP; 1852 A; 1322 C; 1277 G; 1627 T; 0 other; aactggcgac gaggatggga tgctaattat catccgaaat aaacttggaa ttattgacgc 60 aaaatatcaa aaaattgtca cgcacaaaag catacatata cgtttacata tacatgctgt 120 ggtgaaattt tgcaaacgtt tacacacaca tttgcatggc agagtcgctt gctggtgaat 180 ttggagagat tagattggcc ggcgtttttg aggaaaagaa ggaagtaatt gattctacga 240 gtgctaaata tactcaacgg aacccaatta tgacgacagt tggaagtttg gaaccgtttg 300 atatgggaca gccaaataaa tggggctcat atttggagcg atttaaattg ttcctgcttg 360 ccaatgatgt caaagacgag ggtcgcaaaa aagcatcttt cctcacattg gtcggagcac 420 cagtctacga tcttttgaca tcattggcgt ttccaaacca ggtcagtaca ttaccgttag 480 accaaattga gaagattcta actgatcatc tttgtccacg tccatcggaa atcgcggcat 540 tttatcattt tcacaaacgt gatcaacatc cggaagtgga ggagtacaaa gtttatggaa 600 agtgagacca cttggaaaag tgaattacgc agcaatgccg tgaaaaaccc catcaaatat 660 tttagtattt tttttgcgga tgaactgggg cagaaaatat gagaggaaac gaacttatat 720 gccattcaaa cgaagagcgt agaattgaaa tgcactgaaa tgtttgttgg catacttctt 780 taccttgcag ttgtgaaaat accagcttat agaatggcgt gggtggattt caaactagcg 840 gccgttgcaa atgcattaca gatttgaaaa aattaaacag ttttttccat ttgaacgaca 900 attctaatca gcctcataag ggaacacctg aatatgacaa gttatatcca tcatcactcc 960 ttttgacaaa gaattcatcc tcattatcac caactccatt atgaccacct ccgttatcac 1020 cagccccatt atcaccagct ccatcttcac cagctccttc tacgggctct aagtcaaacc 1080 cccctaaaag agcttatgtt gcagacccag caaagtcact acgacttgat ggttataatc 1140 attggccgaa atggggaaca agaggaagat gtagagtatg cagatcggga ttttcaagcg 1200 gtaaatgtag caagtgtaat gtgcacttgt gctccaatcc ccaaaaaaat tgttttgtgt 1260 cctatcatac ataaatatta ataaacggca tttgaaaaaa aaaaagaatt tgacttcaaa 1320 tatataacac aattgcccaa tgttgcatat acgcaacatc ctgtagctct tgaaccagga 1380 caccaaggac attcaaattt ttttcagctg atattaaggt caagtctagc ttatatgtca 1440 attcaaaaaa attttagcca cgcccaattt aattcgggca tgagatctta agcgtgtctg 1500 ccggagtaat gactcgtcga acgatgttca tccagtaaag ggcaaaaatt cacgtgggaa 1560 atcggtaaac ctcatatcta atctggtgag cctgaagaaa cgtatcacgg tacagtttaa 1620 tggaaaatcg tgtaattttg aagtggattc tggatctgac gtaactatga tgacgttgcg 1680 gacttatgat cgtatctggc caacaaacaa gcctaaggtt tacaactacg atccgtggtt 1740 tagtgactac caacacaacc caatccagat actaggagtt tttgatgtat caattcatta 1800 caacggtcgt aatattgata atctactagt catcgtcact aaaagcggtt gtagcgacct 1860 tctgggtatt aactggttcg acccactagg catcagtatc aagggcatac attcagtggg 1920 cagtgacgta cagatcgcag gcattctgaa taagttcgaa cacttattct caacagaact 1980 tggttgttac actggtccgc cagtcatatt aaacattgat gctacagtgc caccagtgcg 2040 actacctcca tgccgaattc catatgctct aaagggacta gtcgaagagg agctggatcg 2100 ccaatgcaag caagggattc ttaaacctgt ggagtattca gattgggcca ctcctattgt 2160 accagttatc aagaaggacg gttcaattcg tatttgcggc gactataaat caactttaaa 2220 caaagcggtt aagccgcact gccatcaaat accggcgatc aacactttgc tggcatctat 2280 ggaaggtgga tcgatttttg ccaaaattga tctggctcaa gcttatcagc agttagtagt 2340 cgacgagaac tcgtctttgc tgcaaacaat cagtacacat aagggtgcat tcaaagttac 2400 tcgattgcaa ttcggcattt catcagcacc aggcattttt caaagctgta ttgagaatgt 2460 ggtacagaat atttccggag tcttaccata ttttgatgac attgtaataa ttggaaaaac 2520 ggaaacggaa ttggcaacta gattacagga actgtttata cgttttgaca aagccggact 2580 gcgactacaa aaggacaaat gccaattcgg agtaccattc gttgagtttt taggtttcaa 2640 aattgactca actggtataa gaccatgtgc cgataaagtc tcggcaatca aagacgcact 2700 aaatttttat cactcatttt tgccacataa agcaacagtg gcagaaccct tacatcgtct 2760 tttggataaa aatgcaactt ggaggtggga caaacagtgt gagaaggctt tttctacact 2820 taaggaactt attgggtcag acgacgtact agtgcattat gatggttcat tgccactagt 2880 gctagcttgt gatgcgtcac catatggact cggtgcagtt ttaagtcaca aaatgccgaa 2940 tggagcggag aaacccatcg ctttttactc ccgtacctta tccaaagctg agcgaaatta 3000 tgctcagatt gatcgagaag ctattgcctt ggtagcagga gtaaagaagt tccacaacta 3060 cgtctacggt cgtactttca cactaatcac cgatcatcga cctctgttgg gtatttttac 3120 cacatcgaaa gcaatcccaa acattatttc aaatcagatg ctacgtcgat ctttgttcct 3180 gtcaacatat actttcgagc tagtacaccg ctccggatta aaaatgggca atgcggattt 3240 tctaagtcgt tgtccactgt cagcagcatc ggacgctgtt agcaccgacg atgtgcttat 3300 gatcgaatta gcagcaacgc caatcataag tgctgaggcg attgccgctc aaacatcgaa 3360 ggacccctta ttggcgaaag tatcaaactg ggtgttaagg ggatggcctg acatgaaatt 3420 ggagcaatca gatatagcat atccatacta ttgccgtcgg gaacagctct ctacaaataa 3480 aggagttttg atttggggaa atcgcgccgt agttccaccg aaatccagag caacaatttt 3540 agctgctcta catgctgctc atccaggcat tgtaaaaatg aaggcgttag cgcgtagcta 3600 tgtttggtgg ccaaacattg acgctgagat tgagcttatt gttaagaaat gtattccatg 3660 ccagcagaat cgtaacgatc attcagaggc accggtgcat cattgggagt cagccaagcg 3720 gccgtggtcc cgtctacaca tcgactttgc aggacctttt caaggaaaaa cctttcttct 3780 ggtcgtggat tcttactcaa aatggctaga ggttgcggtt gttagctaca cttctacagc 3840 ggctacaatc aaagttctca ggcaactgtt tgctacgcat gggttaccag atcagttagt 3900 gtcagacaat ggcacagcat ttacctccga agagttcaag gcattcctcc acaataatct 3960 aattcgacac attacatcag caccgtttca cccagcaaca aacggccaag cagagattac 4020 aggaactgtt tatacgtttt gacaaagccg gactgcgact acaaaaggac aaatgccaat 4080 tcggagtacc attcgttgag tttttaggtt tcaaaattga ctcaactggt ataagaccat 4140 gtgccgataa agtctcggca atcaaagacg cactaaattt ttatcactca tttttgccac 4200 ataaagcaac agtggcagaa cccttacatc gtcttttgga taaaaatgca acttggaggt 4260 gggacaaaca gtgtgagaag gctttttcta cacttaagga acttattggg tcagacgacg 4320 tactagtgca ttatgatggt tcattgccac tagtgctagc ttgtgatgcg tcaccatatg 4380 gactcggtgc agttttaagt cacaaaatgc cgaatggagc ggagaaaccc atcgcttttt 4440 actcccgtac cttatccaaa gctgagcgaa attatgctca gattgatcga gaagctattg 4500 ccttggtagc aggagtaaag aagttccaca actacgtcta cggtcgtact ttcacactaa 4560 tcaccgatca tcgacctctg ttgggtattt ttaccacatc gaaagcaatc ccaaacatta 4620 tttcaaatca gatgctacgt cgatctttgt tcctgtcagc atatactttc gagctagtac 4680 accgctctgg attaaaaatg ggcaatgcgg attttctaag tcgttgtcca ctgtcagcag 4740 catcggacgc tgttagcacc gacgatgtgc ttatgatcga attagcagca acgccaatca 4800 taagtgctga ggcgattgcc gctcaaacat cgaaggaccc cttattggcg aaagtatcaa 4860 actgggtgtt aaggggatgg cctgacatga aattggagca atcagatata gcatatccat 4920 actattgccg tcgggaacag ctctctacaa ataaaggagt tttgatttgg ggaaatcgcg 4980 ccgtagttcc accgaaatcc agagcaacaa ttttagctgc tctacatgct gctcatccag 5040 gcattgtaaa aatgaaggcg ttagcgcgta gctatgtttg gtggccaaac attgacgctg 5100 agattgagct tattgttaag aaatgtattc catgccagca gaatcgtaac gatcattcag 5160 aggcaccggt gcatcattgg gagtcagcca agcggccgtg gtcccgtcta cacatcgact 5220 ttgcaggacc ttttcaagga aaaacctttc ttctggtcgt ggattcttac tcaaaatggc 5280 tagaggttgc ggttgttagc tacacttcta cagcggctac aatcaaagtt ctcaggcaac 5340 tgtttgctac gcatgggtta ccagatcagt tagtgtcaga caatggcaca gcatttacct 5400 ccgaagagtt caaggcattc ctccacaata atctgattcg acacattaca tcagcaccgt 5460 ttcacccagc aacaaacggc caagcagagc gcatggtgca gaccaccaaa aactacctta 5520 aaaaactatc tccgaacaat aatatggatc taagccttgc tcgatttttg tttaatcaac 5580 acataactcc acatgcaacg acaaatcgat cgccagcaca gctattgtta aatcgggagt 5640 taaagtccta ctttgataaa ctacagccac aagaaatttc taatggagca gagccattta 5700 ttaatcaatc gaaatgtttt gatacaggca aatccgtttg ggtacgaaac tatgcaccgg 5760 gtccaaaatg gatagaagcc tacatatccg agcaaacagg accagtgtct tacaaagttc 5820 gtttgacaga caatcgcatc ataaagcgtc attataacca aattcgaagc cgagttgaac 5880 aaacagatgt aaacatatca catcacgcac catcagaaac taatgacaac acggaagatc 5940 ctgcttccag taaccactct aggccagaga gtcctattca agatgatcca cagtcattgc 6000 atgcctcatc gatgccacga cgttctgctc gcatgagacg acccactcaa ttctatgagt 6060 catctgggtg ttaagggg 6078 // ID Gypsy-12_DWil-LTR repbase; DNA; INV; 285 BP. XX AC scaffold_180716; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-12_DWil_; KW Gypsy-12_DWil-I; Gypsy-12_DWil-LTR. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-285 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_180716; Positions 21168 20884. XX SQ Sequence 285 BP; 95 A; 44 C; 49 G; 97 T; 0 other; tgtaagattc caattattta aagtctgtta aatactcatg catatatcga ttgttagtaa 60 taagtatcga ttgttagtaa taagtatcga ttgttagaaa taagtatcga ttgttagtaa 120 taagtatcga ttgttagcga taagtcctcg ataggttgag agacggcgtc tctcttttct 180 tccggcgacg caacgagcaa gttgcatctg caaaaaaccc aaagatttaa tccgaccaat 240 acaaactaat ctttatcatt tttacttaat aaataggtgt ttaca 285 // ID BEL-54_CQ-I repbase; DNA; INV; 4075 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-54_CQ_; KW BEL-54_CQ-LTR; BEL-54_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4075 RA Jurka J.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 261-261 (2011). XX DR [2] (Consensus) XX CC 'CCATG' target site duplication CC LTRs are 94% similar to each other. XX FH Key Location/Qualifiers FT CDS 834..2336 FT /product="BEL-54_CQ-I_1p" FT /translation="MAPNLRELEKQQSHLRRTLEAIQQFVNAYDATRDADQ FT VDVRLELLDETFNQFRFVRIKIELLTEEDDLDVEVVEGETEEERAEREADA FT NKKRKEKNVKVLMEAEDFYCAVKAKLNKKRGPLETVPPVLVPSAVDARPAV FT GLSHVKLPDINLPIYTGELSEWIVFRDTYRSLIHNNAQLSDFDKFTYLRSS FT LLGEALLEIAGIDVSAVNYDVAWTTLEQRYDNKKLIVKAHLDALIAVESMK FT QESYAALNQLIGSFDKHLMMLKKIGQDTDIWSTLLVHMVCSRLDGNTLRLW FT ETHHKSKEVPKFQDQMKYLRGHCLVLQSVAPSKPNAEEEKQRRLSVSHAVT FT HSVNKCPFCNELFHSPFHCLLFLKKTVDERMEAARRRNLCLNCLRAGHSTR FT SCTRGSCHHCHQYHNSLLHINAVPERPSASQALTRPTPEQQQYQQQQPQYQ FT SMGQQQPQIQPSIQNQSSNTAQSNTHSQNTQQLSTTEQCTNHNTTTLSISS FT HNRKN" XX SQ Sequence 4075 BP; 981 A; 1035 C; 1058 G; 1000 T; 1 other; ttttggtcct tcggcatccg gattgcgaac gttttttgga ccatccactc gttcgcgacc 60 tgtgacccgt ggagaagaaa acggattcgt gaagtgcact tacgtgcgtg ctgtgcgaaa 120 gaacttttac tgtgcgtgct gaaaagtgcc aaaaactctg tgttgtgacc tgtgagctgc 180 tgtacgagag gtcggtgacc tgaacggacg gttcagtgac cccggccatc gctgtgttgt 240 tcctttgtgc gattggaacc gtacaacatc atcgccatcg cggatcggga cttggcgcgc 300 tgctgattcg gtgcgatggt ggacttctgc aggtgttacg actacaattg gagccgctga 360 agatcggcca tcgtgaagtg gtcgttcctg aacaccgtcg tgctgcgcgc tggaagctgt 420 tcctgatcct ggacctgctg ttcgagattt gctgtttctg ttgggaagta gttgcatgat 480 acggttggtt gtgggcttgc atgtggacat ttctgtgttg ctgagtgtgt ttggagaaat 540 aaaacgaaat taaacagatc tgcatgtgag tttcttccac tctgtggaca tttcctgagc 600 cctggttctt ccttctgttg gacggttggt cttggattcc gctggtgttc cgccgttggt 660 tgtctgctcg gttggttccc ctcgctgctg tctgtctggt cggtcgctgt caattcgcgt 720 gggcgttgag tgagctgtga tttggagttc tgtgcagtgc agtcagtgat ctacctgtga 780 agcgggttag ctgtgatcag tggtgtgatt tgtagtcgcc gtgagtgttc gacatggcgc 840 ctaatttacg cgagttggaa aagcagcaga gtcacctgcg gagaacccta gaagccattc 900 agcagtttgt gaatgcgtac gatgcaacga gagatgctga ccaggtcgac gtccgtctag 960 aactcttgga cgagacgttc aaccagttcc gttttgtgcg aatcaagatt gagctgctga 1020 ccgaagaaga tgacttggat gtggaagtgg tggaaggtga aacagaagaa gaacgtgcgg 1080 agagggaggc tgatgcgaat aagaagcgga aggagaagaa cgtgaaggta ctgatggaag 1140 ctgaagattt ctactgtgct gtgaaggcga agctgaacaa gaagcgcggt ccgctggaga 1200 ccgtcccacc ggtcctcgtg ccatctgctg ttgatgcgag gcctgctgtt ggtctgtctc 1260 atgtcaaact tccggacatc aacctaccga tttatactgg tgagctgagt gagtggatcg 1320 ttttccgtga cacgtatcgt agcttgattc acaacaacgc ccagctgtcg gattttgaca 1380 agtttacgta tctacgctcg tcgctgctcg gtgaagcgct tctggagatc gctggcatcg 1440 acgtttcagc cgttaactac gatgtggcct ggacaaccct ggagcaacgc tacgacaaca 1500 agaagctgat cgtcaaggct catctagatg cgttgattgc ggtggaatct atgaagcagg 1560 agagctacgc ggcgctgaat cagcttatcg gatcgttcga caagcacctg atgatgctga 1620 agaagattgg gcaggacacg gatatctgga gtactctgct ggtacacatg gtgtgctcgc 1680 gcttagatgg caacacgctg cgtctttggg aaactcatca taaatccaaa gaagttccaa 1740 agttccaaga tcagatgaag tacttgcgcg gtcattgctt agtactgcaa tcggttgcgc 1800 catccaagcc caatgctgaa gaggagaaac aacgacgtct gtcagtgagt catgcggtga 1860 ctcactcggt gaacaagtgc ccattctgta atgaactgtt tcattcacct tttcactgtc 1920 tgctgttcct gaagaaaacc gtcgatgaaa gaatggaagc tgctaggaga cggaatctgt 1980 gcctgaactg cttgcgtgct ggacactcta ctcggtcctg cacgagagga tcgtgtcatc 2040 actgtcatca gtaccataac tcactgttgc acatcaacgc tgtaccagag agaccctccg 2100 cctcgcaagc actgaccaga ccaacacctg agcagcagca gtatcaacaa cagcagccgc 2160 agtatcagtc aatgggccag cagcaaccac aaatccaacc ttcaatacag aaccagtctt 2220 caaacactgc tcagtcgaac acacactcac agaacactca gcaactgtcc accacagaac 2280 aatgcacaaa ccacaatact acaacactgt caatcagttc ccacaaccgt aaaaattaaa 2340 ttttactttc aaccgttcaa atttgcatcc gtgatctgcg cggaaacaca cgcctagtca 2400 gagcactgct cgactcgtgt tcgcaatatt cgtttatgtc ctccgcatgc tgcaagaagc 2460 ttgatcttct ttgcactcca gattacctga ccgtactcgg aattggcggg tcgtctgtcg 2520 tctcacggca gctagtatca gcgaacgtac aaccccgttc gtccgctctg ccgcagttta 2580 accgcgagat ggatttttac gtgcttccgg agttgacatc cgctttgcca aatcagaaca 2640 tcaacactac tgcgtgggag tttccgaaca acatcatcct agctgaccca cacttcaacg 2700 aacctggaga agtggagctg atcattggtg ctgaacatta tttcgatctc ttacgagatg 2760 gtcgatcacg catcgccgaa gatggccctg tactgcagaa tactgttttc ggttgggtcg 2820 tatcaggacg tgtaccaagc agatcaacaa gcgcacacgt tgcagtcgcg cagctgaact 2880 ccgcacctac caaccgtagc gactccgtgc cttggcggcc ggcgtgccga ccgggacacc 2940 ctcaagaact tgatgcctgc tcaacgttcg gctgcacgtc aaagcattgc aaccggatcc 3000 gtggaactgc tgagaacaaa ctgtcgatat taagcagtgc gcacaactgt gcgtaaccgt 3060 cgtcagtgaa gatctgatct tgcacttgac tgtccaaact tgaaaaacaa catttcgtct 3120 gtacgactcc cctgagtcga tccgttttac tgtctgggct ccagtgagcc aaattcgagc 3180 cttccgtccg tcaacaatca acctttgcat gaagtccgtt atccgtaaac atctgtccgt 3240 ctttggaatg caatccagta ccgcactgta ccgttctgtc ccgtcgcgca aaagcgctca 3300 tttgcatgaa aacaatcaac tataaactac attttgtcgc cgtagaagcc aagaggacac 3360 gtgtgcacct gcgtgcacag cgagacgaat tctagaccct cgtctcccca gctcagtaag 3420 cgtgatcatc tgcgagtctg atcaactcgt ttcggaggtc gaagcagccc tgagaaccag 3480 cagcacaatg tggtcctgca cgtgcacaaa tgtyggtcgt cccagcgcaa gagcgctcga 3540 ggttagttag cctccccaag gacatccgtc aaagctccag acagtcaaca caacatcgtt 3600 gtcgagaagc agtcagttcc tgaaaacatc attagcggct tgtggccgca accgtttcat 3660 ctggctacaa ctgtggccaa gccaagctgt cgccgtctgt tgcagtacaa gttgcacctg 3720 tctgtttctg ctgcgccggt acaagctgtt catccgagtg gccatcgttc gaggatccag 3780 cactaaagca ctcggggtgg cgctgtcgca atcggacggg ttggactgcc gcccctaaga 3840 agcccaacgt taagtatcct gtaaatgtat gttgtcgttt tagactctga gtcactcttt 3900 ttggcaggta cttaccgtct gcggtcatcg ctggatcccc agagtcagaa ctgcactcca 3960 agcaacatgc aaaacagaag cagtgataag tggagagata acagatatca gagaagagaa 4020 gaagagagca gggtcgtagt ctgtttaaaa ccttaggttt tcaaggcggc cggca 4075 // ID ERE1_EH repbase; DNA; INV; 7160 BP. XX AC . XX DT 25-JUN-2008 (Rel. 13.1, Created) DT 25-JUN-2008 (Rel. 13.1, Last updated, Version 1) XX DE Repetitive element from Entamoeba histolytica - consensus DE sequence. XX KW Nonautonomous; Eh_ERE1; ERE1_EH. XX OS Entamoeba histolytica OC Eukaryota; Amoebozoa; Archamoebae; Entamoebidae; Entamoeba. XX RN [1] RP 1-7160 RA Lorenzi H.A. and Caler E.; RT "GenBank accession number EU099442."; RL Direct Submission to Genbank (09-AUG-2007). XX RN [2] RP 1-7160 RA Lorenzi H., Thiagarajan M., Haas B., Wortman J., Hall N. RA and Caler E.; RT "Genome wide survey and discovery of repetitive elements in three RT Entamoeba species."; RL Repbase Reports 8(10), 1683-1683 (2008). XX DR [2] (Consensus) XX CC Positions [1..2252 ; 4971..7160] Inverted repeats. XX FH Key Location/Qualifiers FT CDS 4640..3534 FT /product="ERE1_EH_1p" FT /translation="MEELINKNNQLIKEVVETLPNIQQSIDDTTKNYQYYI FT GSIKNEVFEIKENETYSEAIKRGEGIFDRIDETERRRKIVIEKSQETINQI FT EIMKNKMSRIIEECNNDEETLSKTRNELIDKIISIEEMIMKNKIFRKPEER FT DKITTEFNKKKEEWETYYSDYLERKKRKKKKERKKKQEEERRKEEERKKQE FT GERLRIMKGMNTKEEMIQLEEWTKRKVGNILFDSDIDNWNKNTSVFDQRIM FT NKEHIIIIIEDEEGNKFGGYVNSKIDKVDDCINDSKSFVFSLESNGRMKGM FT MKFDIKEPQYAFYLYNQSDDCLFEFGYCDIFVYKENNKTESYCKQYSFEYE FT GISNALCGKQFPNHFTPKRIIVIEMK" XX SQ Sequence 7160 BP; 2610 A; 701 C; 552 G; 3297 T; 0 other; aaataaataa gaaagaatga ctctttgttg tattttcaaa ttaattaata attaatatta 60 ataaattaat gaaatgaata ttaatgagaa taatttaatt taaacaaaat gaaataaaaa 120 gaatgataaa taattaatat ttcattgaat aaaatattat ttaactttca tatttcatat 180 attaattatt tcatttatta ttcattctat ttaaatcatt gtttatttaa aatataaatt 240 atcatatttt atattaaatt attgaaaata aatttaaata aaaattcatt tactttcttt 300 aagtttttaa attacttcat caaacataca gtagaaatag aatcatatca tctttctgat 360 gatgttcaat cattatacat acttattatc tttttaatta tttcttcttt atgatgaata 420 ttgctgataa gttattttct agttcttctt ggtcaacaaa ttcaatatct tcttttacaa 480 acacaaatat tttggtaatt ttgttttcta tttatttctt tcacttcttt cttgttcttc 540 ttcttgttgt ttttaggaag aataatttat ttagtttcag ttctttcaat ttcccaatca 600 ttataattaa tttcttctat gtatttcatt cttttatttc tttgcatttc tttattttta 660 tttatttggg tatgcttttc ttgtttaatg ttttgatctt ttttatctct ttcttcagta 720 atactatgat aatcattttc tcttttcaac ttcttctaca ttaattgatg ttaataatat 780 tatcattgtc atcactctta caattccatt ttatctttgt gtattatcat aacaagatga 840 actttttgat tcattaaatt acattggaaa tttattattt tccagttatt ccattaaaaa 900 tttccaacaa tttttgttgt tatttatcta cttttattct tttcttcatg aacttcttta 960 tgaatagaat aataacattt aatcaacacg atacaaacat gtctctaaat gtctgatatt 1020 ggaaaaacat tctatattat tttatccatt gttttattat tttgtgcaaa tcctggttga 1080 aattcaaaac aatattaatt atttaaagtt cttttttttc ttttttaata tatcaacact 1140 atttattcat atatattctt ttcattcttc tgtgtcttgt aacacagacg tatcaataat 1200 aaatatatct catttatcac tttctcatta tgttcatttt gtttctttag tttctgattt 1260 tgcgttatta ttttctttca aaacattctt cttcattttg aatttacaaa atgtattttc 1320 aatcacttgt atttcaattg ataagtaatt taatttgttt aatattcatt gattgtattt 1380 ttagttgata gtcgttttaa ataacaatta aacatgtcaa ttatttatca atttgaagaa 1440 ttgaagttat ttattattaa aaaacatttc atttgaaata aataattaca attctacatt 1500 tttttaatat tcttcacatt atattttatt taaatgaaaa acaacaaaat attttatcga 1560 atgttacata aacattattt tattttaaga taattttatt aatattgttt ttatttaatt 1620 ttacattaaa aatgtatttt tgaaataaaa ggaaatagaa aatatttaaa aataaatgaa 1680 taaatttatc gataatttaa cataacagtt tgtttttgtt tattggtttg aaatacaaac 1740 atcattaaat aaaaataaat ttgtaatgtt atgattgaat aaaaaatgaa taaataattt 1800 tacatatttt tgaaatatat ttattattat aatttttaat ttccattaaa caattatttt 1860 tttttattta gtgacaatga ttatattttt attcatggat tattaattag aatgaatgat 1920 gataatgttg ttaaataatc aaaataaaag aatgaaatat gattaaatta tttattattt 1980 catcttcaat aaacaaaaag aaatattaac aatattgata ttttctcata ttaaatattt 2040 taatattaaa atattgatta tttcttctat tatgttgtta tttaatgtaa ttttatttga 2100 ttttcatttg attcttttat tcattgtgta tttataataa ttatttcatt tcttttctct 2160 tttatttctt attcttattt tattatctta tttttgttta tcatttatta tctttattaa 2220 attaatttga ttgaattaat gttgataaaa caaagtaata ataaaaataa taacaacaaa 2280 taaataaaac aaagataaat aattgatatt atttattttc tattattctt tattcatatg 2340 agaataatca tctctttata ctttaatatt gtttattctt catttattat ttagttttga 2400 attaatgaga atgttgattt tctattcatt attcttttat ttaaattgaa atttcttttt 2460 aattttcatc attttaatta ttctcattga attatcattt atttatttct tattttatga 2520 atgaatattt cattattatt atattattat ttataatttc atttattatt aatatcattt 2580 atatcttttt attttcttat tattatttca tattatttta tttcattctt tctattatta 2640 tttattatat ttcttattaa attaattcat catattcaat tattatttga tatttcatta 2700 tttgattttg tattaatgtt tgtgtgttgt ttctgatttt cattcttcat ttctcattca 2760 ttttcattta ataaaataga agaataaaaa taatgaaaga agataaataa aataagaata 2820 aaataagata aaatcaatga tattgttatt atattattat gttattttag ttttgatttt 2880 atatttatta ttttattttt atcttttata atcatcattc atttcttctt tttgtcattt 2940 cctcttttct aaacatcact caatttattc tatttatttc catttcattc tattttcttt 3000 attcttgttg ttattttatt tagttcatct tttatcattt gtcatttctt gtttattcat 3060 gtttgttgta tattttatta tttctttatt ttatgttgtg ttattcttat ttatctttct 3120 tattatgttt tgtttattat tgtttaaata gaaaataaga ataaataaag aataagaaat 3180 aataacataa atatgttgat gtattttatc atttctttat ttgtttatta ttcttatttc 3240 tattatctta ttcatttgat tcataatgaa ttaattcaat gaaataatac aaataaattg 3300 tttatttatt tcattttatt tctattttat tcttatttct tttcttattt attaaaaaca 3360 ccctcattgg gtgtgaatat ttcaaatgaa ataaaataaa tcaacattga attatgtcaa 3420 tttttaattc accatttcat tcattggcag atgaatgaaa tggctgttca tattcttcat 3480 tgatttgttt tttattcttt tttattcatt ttcaattaat caagattgaa tcatttcatt 3540 tcaattacta tgattcgttt tggtgtgaaa tgatttggaa attgttttcc acaaagtgca 3600 tttgatattc cttcatattc aaatgaatat tgtttacaat atgattctgt tttattattt 3660 tctttataaa caaaaatatc acaatatcca aattcaaaca aacaatcatc tgattgatta 3720 tataaataaa atgcatattg tggttcttta atatcaaatt tcatcattcc tttcattctt 3780 ccatttgatt ctaatgaaaa tacaaatgat tttgaatcat ttatacaatc atcaacttta 3840 tcaattttag aattaacata tcctccaaat ttatttcctt cttcatcttc aataataatt 3900 attatatgtt ctttattcat tattctttga tcaaatactg atgtattttt attccaatta 3960 tcaatatctg aatcaaatag tatatttcct acttttcttt ttgtccattc ttctagttgt 4020 atcatttctt cctttgtgtt cattcctttc attattctca atctttctcc ttcttgtttc 4080 tttctttctt cttcttttct tctctcttct tcttgtttct tttttctttc tttcttcttc 4140 tttctcttct tcctttctaa ataatcactg taatatgttt cccattcttc tttcttctta 4200 ttaaattctg ttgtgatttt atctctttct tctggttttc tgaatatttt gtttttcatt 4260 atcatttctt caattgaaat gattttatct attaattcat tccgtgtttt acttagtgtt 4320 tcttcatcat tattacattc ttctatgatt cttgacattt tattcttcat tatttctatt 4380 tggttgattg tttcttgact tttttctatc actatttttc ttcttctttc tgtttcatca 4440 atcctatcaa atattccttc tcctctttta attgcttctg aatatgtttc attttctttt 4500 atttcaaata cttcattttt aatacttcct atataatatt gataattctt agtagtatca 4560 tcaattgatt gttgaatatt tggaagtgtt tctactactt ctttaattaa ttgattattc 4620 ttgtttatta actcttccat tttaatttaa tttgatgttt ataagttcaa aatatggatg 4680 tttaaatagt taatccacgt tattaacaat caagtcaatt attattttta tcatcatttt 4740 attttaaata aatgttattt ttatcatttt tacaataaca ataaatatat aataatatta 4800 ataatataat aataataaaa ttataatatt gttattttct ataattagtt tattttattt 4860 ttagtatttt ttcatttttt tttaaaaaaa attttaaaaa aatattgaag agaagaaaat 4920 tcaattctgt tttattttta ttatttcatc gtttgtgatt tgaattttca tgttttatca 4980 acattaattc aatcaaatta aattaataaa gataataaat gataaacaaa aataagataa 5040 taaaataaga ataagaaata aaagagaaaa gaaatgaaat aattattata aatacacaat 5100 gaataaaaga aatgaaatga aaatcaaata aaattatatt aaataacaac ataatagaag 5160 aaataatcaa tattttaaat aataaaatat ttaatatgag aaaatatcaa tattgttaat 5220 atttcttttt gtttattgaa gatgaaataa taaataattt aatcatattt cattctttta 5280 ttttgattat ttaacaacat tatcatcatt cattctaatt aataattcat gaataaaaat 5340 ataatcattg tgactaaata aaaaaaaatt aattgtttaa tggaaattaa aaattataat 5400 aataaatata tttcaaaaat tttgtaaaat tatttaatta tttttattca atcataacat 5460 tacaaattta tttttattta atgatgtttg tatttcaaac caataaacaa aaacaaactg 5520 tcatgttaaa ttatcgataa atttattcgt ttatttttaa atattttcta tttcctttta 5580 tttcaaaaat acatttttta tgtaaaatta aataaaaaca atattaataa aattatctta 5640 aataaataat aaaataatgt ttatgtaaca ttcgataaaa tattttattt tttatatttg 5700 ttgtttttca tttaaataaa atataatgtg aagaatattt aaaaaatgta gaattgtagt 5760 tatttatttc aaatgaaaat gaaatgtttt taataataaa taacttcatt cttcaaattg 5820 ataaataatt gacatgttta attgttattt aaaacgacta tcaactaaca acaatcaatg 5880 aatattaaac aaattaaatt acttatcctt tgaaatataa gtgattgaaa agtacatttt 5940 gtaaattcaa aatgaagaag aatgttttga aagaaaataa tacgcaaaat cagaaactaa 6000 agaaacaaaa tgacataatg agaaagtgat aaatgagata tatttattat tattctctgt 6060 attacaagac acagaagaat gaaaagaatg tatatgaata aatagttgat tatattaaaa 6120 aagaacaaaa agaattttaa ataattaata ttgttttgaa tttcaaccaa ggttttcata 6180 aaataatata acaatggata aaataatatt gaatattttt ccaatattag acattcagag 6240 acatgtttgt atcgtgttga tcaaatgtta ttattctatt cataaagaag ttcatgaaaa 6300 aagaagaaaa caaaagtaga taaatatcaa caaaaatttt gaatttttaa tggaataact 6360 ggaaaatttc tttcctgttt ttgtctttct tgtttttcgt tgttgtgtgt gctggttctt 6420 tgtgataaat gaaaaagaga tattgattat catagtatta ctgaagaaaa agataaaaaa 6480 gaacaaaaca ttaaacaaga aaggaataac caaataaata aaaataatga aattgcaaag 6540 aaataaaaga atgaaatata tagaaaaatt aactactgtg atttggaaat tgaaagaact 6600 gaaactaaat aaattattct ttctaaaaat taaaaacaag aaatgaaaga aataaatagc 6660 aaaaaattac caaaacattt gtgtctgtaa aagaagatat tgaatttatt gactaagaag 6720 aactagaaaa tattttgtca gtactactca tcataaagaa gaaacaatta aaaggataat 6780 aagtatgtat aatgataaac aatattagaa agataatgat tctatttcta ccatatactt 6840 gatgaaatat taaaaacatt aaataaacta aatgaatttt tatttaaatt tattttgaat 6900 aatttaatat aaaatatgat agtttatatt ttaaataaac aataatttaa atagaatgaa 6960 taataaatga attaattaat gtaagaaata tgaatgttaa ataatatttt attcattgaa 7020 atattaatta tttatcattc ttttcatttc attttattta aattgaatta ttctcattaa 7080 tattcatttc attaatttat taatattgat tatttattaa tttgaaaata caacaaagaa 7140 tcattctttc ttatttattt 7160 // ID BEL-181_AA-I repbase; DNA; INV; 6367 BP. XX AC supercont1.4; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-181_AA_; KW BEL-181_AA-LTR; BEL-181_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6367 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.4; Positions 2631139 2637505. XX CC Positions [5418-5975] - Integrase core CC 'GAGAG' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 1115..4915 FT /product="BEL-181_AA-I_1p" FT /translation="MSQRFPIPVAQPQEESLHQPPVVWRFSLPAPAWQPPH FT SSGFQHAPEPLISHQTRFESGVPDHHQSSTSELVVPRLCQPNPVESPQLDP FT VYSVRRVSSAEQQPQARDAQVPAWGPTPQQLAARQVMAKELPIFSGNPEDW FT PLFISSYHNTTQACGYSQAENLARLQRCLKGHALESVRSRLLLPQAVPHVI FT ATLETLYGRPELLIHTLLQKVRGVPAPKHDRLDTLIGFGMAVQNLCDHLEA FT GGQEAHLNNPMLLFELMEKLPANLKLDWSLYKQRCGDVNLRVFAQYMSVLV FT RAATDVTLHFGPRPLQQKWDARTEKSGKDKNFCGAHSEENLATFRKEESVD FT KESSSFSTACLVCKDTDHRVKECAQFAKKSLDERWKAVQQFGLCRNCLGAH FT GRRPCKVFKRCGVDGCQMRHHTLLHFRQEENEDKATTNHHFAGKTTLFRIV FT PVTLFGNNRAISVYAFLDDGSERTLVEEEVVRELGVTGKHIPLCLQWTANV FT KRTESDSQRVALQISGANGTKHELIDVRTVSRLELPRQSMNHSKLAEAFPY FT LRGLPIQDYDQALPRILIGTDNAHVTATLKLREGQPGEPVAAKTRLGWTIY FT GMNSTDKVEHRAQTFHICDSRDEDTLHDMLVQFFSVESLGIATEACPESDE FT VQRAKKILQATTKRVGKRFETGLLWKFDNIEFPDSYPMAVRRLQCLERRIQ FT SDPVIGESIKRQLKEYQTKGYIHKASEEELKEADPRRVWYLPLGVALNPKK FT PSKVRIFCDAAAKVDGLSLNAMLLKGPDLLNTLFGVLFGFREKRIAICADL FT MEMFHQIQIRKEDRHAQRLLWREDPTKTPEVYLMDVATFGATCSPCSAQFV FT KNTNATEHAKEYPAAADAIHRKHYVDDYLDSADNVEEAIKIAEEVRHVHSL FT GGFHLRNWLSNSKEVLTRVGENGQATEKNLQMDEGSSAERVLGLFWRPEPD FT DFTFSTELVKHVEQPTKRQALSIVMSPFDPLGFLSFFLIHGKILIQDVWRA FT KTLWDQTIPEKLWERWLQWTEYFQRLDQISIPRCYFPRHSVQDFISLQLHV FT FVDASEDAFACVAYFRAEFKDDIEVALVAGKSKVAPLKALSIPRLELMAAV FT IGARLKKTIINEHSLRIDQAIIWSDSKTVLAWINSDHRSYRQFVACRIGEI FT LSKTRVSQWRWIPSKENVADDATKWGKGPSLSPSSRWFRGPDFLYLSEKHW FT PVDESGTLRTSEELRACMMHREVVLHHHVKWERFSQWKRLCRAIAYVHRYV FT QNLRRR" FT CDS 4932..6365 FT /product="BEL-181_AA-I_2p" FT /translation="MIGPLTQTELASAETTIWRWVQSEEFPDEVAILSKKH FT GQYPERLMKLDRTSKLRKLSPFMDETGVIRMESRIAAATFASFDTRFPIVL FT PKEHLVTGLLIGWYHRKYLHGNGETVVNEVRQRFHVPSLRMVVRKEKKKCV FT WCKIKLAVPAIPRMAPLPAARLKAFERPFSYIGVDYFGPIAVRVNRTSAKR FT WVALFTCLTIRAVHLEIVHSLSAESCKMAFRRFVARRGSPVEVYSDRGTNF FT VGSSNELQQEMNAIDQQLAETFTNAHTKWIFNPPAAPHFGGAWERLVRSVK FT TALSAMYTSRVPNEETLATLMAEAEGVVNSRPLTFIPLEEEQQEALTPNHF FT LLLSSNGVVQTTKTLADSREACRNQWNLCRVMVDAFWRRWVREYLPDLTRR FT TKWFDDVAAIQPGDLVIIVNDSVRNGWTRGRVTEILKGMDGRTRQAVVKTA FT AGLVRRPIAKLARLDILKGKTEPDIPEQLTGRG" XX SQ Sequence 6367 BP; 1814 A; 1460 C; 1642 G; 1451 T; 0 other; aaatctaaaa gatatttgcc attatgccgg taagtacgcg cagtgcttct ggaagaaaca 60 cccgatgcca ggcatgtaac gagtcggata cgtcgcggat ggtttcttgc agtcattgcg 120 ttctatggtg gcactacgag tgcgtgagcg tgaatgagtc catcgccgaa ccggatcgca 180 cattcgtttg ccccaggtgt cagaggccgt caccaatctt tccatctaat ccagatcctc 240 tatcggcaaa tagcaataaa aggctcacca gtttgtctgg aatttcatcc acctccagcg 300 tgaaagctcg gagggctcgc ctccagctgg agaaactcga ggcacagaaa gctctgatgg 360 agaaacgttt ggagcaagca cgtcgagaac agcaaatcag gcatgagcag gagaaattga 420 tgcaggaagc tgagatggag aaagtacgtt tgcagatgga ggaatccatc ctggaggaaa 480 cattccgggc tagggaggaa gagttgctgg acgaagagca agacgaaaaa agctgtacgt 540 ctgaacagag cagtatcagc aaagtacggc agtggcaaca ccagcaaggt ttctctgcac 600 aagggacgac gatcacagaa ccgacgccga gtggcacggc ggctactgaa gtgaaacaag 660 cagccgctat tgggttagaa cgtaatatta tagcggccga cggtacgtca tttaatcagt 720 attccgataa tctagctgta agaagaatca ttgataggac aggcagtata gtagggccta 780 gcgtagagtt aaatgcacct tctcgcacag aagttctggg cgcacagtca ttagaaatgc 840 gtcatcaaat ttacctgaaa agttaacaca aagtaaggaa aattgtctcc gaaaaagtac 900 gtacggtttg cctgtcaaac aacattatcc tcttcctccg gaaaacgtag acctgtctcc 960 taaatctctt aatcccatcc atcctactct cggtaataat caatcgggta cagtctccgg 1020 caacctaagt gataacgtta atgtgaacac caatgatcca gtgtaccaaa ccggtatagg 1080 acgaagtacc gctcagtttc cagtacatag tgatatgtct caacgatttc cgattcctgt 1140 agctcaaccg caggaagaat ctctgcatca accaccggtc gtatggcgat ttagcctacc 1200 agcgccggct tggcaaccgc cgcactcatc tggatttcaa catgcacctg aacctctgat 1260 atcgcaccaa acgcgtttcg aatccggagt cccagatcat catcaatcat cgacttcgga 1320 attagtcgta ccgaggttgt gtcaaccgaa tcctgttgaa tcaccgcaac tcgatccagt 1380 gtattccgta aggcgggtgt cttcagcaga acagcaacct caggcaagag acgcacaagt 1440 accagcatgg ggaccaacac cacagcagct agctgcacgg caagtgatgg cgaaggagtt 1500 gccgatcttt tctggcaatc ccgaggactg gccactattc atcagctctt accacaatac 1560 cacacaagct tgcgggtatt cacaggctga gaatcttgca aggttgcagc gatgtctaaa 1620 agggcatgca cttgaatcgg tacgcagtcg tcttctgctt ccacaggctg ttccacacgt 1680 cattgctaca ttggaaacgt tgtacggtcg cccggaacta ctgatccata cacttttgca 1740 gaaagttcga ggagttccag caccgaaaca cgacaggttg gatacgttga taggatttgg 1800 aatggcagtt cagaatcttt gtgaccacct ggaagccgga ggtcaagaag cacatcttaa 1860 taacccaatg ctgctcttcg aattgatgga aaagctgccc gcgaacctga aattggattg 1920 gtcactatat aagcagcgat gtggggatgt aaatcttcgc gtattcgctc agtatatgtc 1980 tgtcctggtg cgtgcagcga ctgatgtgac attacacttt ggtcctcggc cattgcagca 2040 aaaatgggat gcgagaaccg aaaaaagtgg caaggataaa aacttctgtg gtgcccactc 2100 tgaagagaat cttgctactt tcagaaaaga agaatcggtg gacaaagaat caagctcgtt 2160 tagcacggcc tgtctggtat gcaaagacac cgatcatcga gtgaaggagt gcgcccagtt 2220 cgccaaaaaa agcttggatg aacgatggaa ggctgtacag caatttggat tatgccgcaa 2280 ttgcttgggc gcgcatggaa ggcgaccgtg caaggtcttt aagcgatgtg gcgttgatgg 2340 atgtcagatg cgacatcata cgctattgca ctttaggcaa gaagaaaatg aagacaaagc 2400 caccaccaac catcatttcg cggggaagac aaccctgttc cgaattgttc cggtaacgct 2460 gtttgggaat aatcgggcaa tctcagtgta tgcgtttcta gacgatggat cggaacgaac 2520 tctggtggaa gaagaggtgg ttcgtgaact gggagttaca ggcaagcaca ttccactatg 2580 cctgcaatgg acggcgaatg ttaagaggac ggaaagcgat tcacaacgcg tagcgttgca 2640 aatctctggt gctaatggaa cgaaacacga gttaatagac gtacgaacgg tcagcagatt 2700 ggagttgcct agacagtcaa tgaaccattc aaaacttgct gaagcgttcc cgtacttacg 2760 aggattgcct atacaagact acgaccaagc actaccgcgt atcttgattg gaaccgataa 2820 cgcacatgtt accgcaactc tcaaactacg agaaggacaa ccgggagaac cagtggcagc 2880 aaagacacga ttaggatgga ccatctacgg gatgaactcg accgataagg tggaacatcg 2940 tgcacaaaca ttccatatct gtgacagtcg ggatgaagat acgcttcacg atatgctggt 3000 gcagttcttt agcgtagaaa gcttgggaat agccacagaa gcgtgtcccg aatcagatga 3060 agttcagcgg gccaagaaga ttcttcaagc gacaaccaaa cgggtgggca agcggttcga 3120 gacagggcta ctctggaagt tcgacaacat agaattccca gacagctatc cgatggcggt 3180 caggcgacta caatgtttgg agcgacggat tcagagcgat ccagtgatag gagaaagcat 3240 aaagcggcag ctgaaggaat accaaacaaa aggatacatt cacaaagcat ccgaagaaga 3300 actgaaggaa gcggatcctc ggcgtgtgtg gtatttaccc ctcggcgtgg cgttaaatcc 3360 gaaaaaacca tctaaagttc gtatcttctg cgatgcagca gcaaaggtag acggtctttc 3420 gttaaacgca atgcttctga aaggaccaga tctacttaac acattgtttg gagttctctt 3480 cggtttccgt gaaaagcgta ttgccatttg cgctgattta atggaaatgt ttcatcaaat 3540 ccaaatacga aaggaagatc ggcacgccca acggctgctt tggcgtgagg acccaacaaa 3600 aactccggag gtctacctga tggatgttgc cacattcggc gccacatgtt ccccttgctc 3660 cgcacaattt gttaagaaca cgaatgccac ggaacatgcc aaggagtacc cagctgcagc 3720 agatgccatt catcgcaaac actacgtgga tgattatctg gacagtgcgg acaacgttga 3780 agaggcgatt aagattgcag aggaggtgcg acacgtgcac tcacttggcg gtttccattt 3840 gcggaattgg ctgtcaaact ccaaagaggt tctcacacgg gtcggggaga acggacaagc 3900 tactgagaag aatttacaaa tggatgaagg aagctcggct gaacgtgttc ttgggttgtt 3960 ttggaggccg gagcctgatg atttcacatt ctccacggaa ttagtgaaac atgtggagca 4020 gccgactaaa cgtcaagcat taagcatagt tatgagcccg ttcgatcctt taggattcct 4080 atcgtttttt ctgatacatg gtaaaatcct aattcaagac gtgtggcgag ctaaaaccct 4140 gtgggatcaa acgataccgg agaaactttg ggaacggtgg ttgcagtgga cagagtattt 4200 tcaacgactg gatcaaatta gtattccacg ctgctacttt ccacgccatt cggtccagga 4260 ctttatatcg ctacagctac atgtctttgt agatgcgagt gaggatgcct ttgcgtgtgt 4320 ggcttacttc cgggcagagt ttaaagatga catcgaggta gcactagtag ccggaaaatc 4380 caaagtggcc ccgctcaaag ctttatctat accgagatta gaactgatgg ccgcggtgat 4440 cggagcccgt ctaaaaaaga ccataatcaa cgaacactca ttgcgcatcg accaagccat 4500 tatatggagt gattctaaaa cggtactcgc atggatcaat tctgatcatc gaagctaccg 4560 tcaattcgtc gcctgtcgta taggagagat actctcgaaa acccgtgtaa gtcagtggcg 4620 ctggatacct tcaaaggaaa acgtggcgga cgatgcaaca aaatggggca aagggccaag 4680 tctatcccca agcagtcgtt ggttcagagg gccagatttt ttgtaccttt ccgagaaaca 4740 ttggccagtt gatgaatctg gaacgctgcg gaccagtgaa gaactccgag cctgtatgat 4800 gcatcgtgaa gtggtgctac atcatcatgt gaaatgggag cgtttttcgc agtggaaacg 4860 cttgtgccgt gcgatagcat atgtgcatcg ctacgtacaa aacttgcgac ggaggtgaat 4920 aaaacagaca gatgattgga ccgctgactc aaaccgaatt ggcatcggcc gagacaacta 4980 tctggcgctg ggttcaaagt gaggagttcc cggacgaagt agccattctt tccaaaaaac 5040 atggacaata tccggagcga ctgatgaaac tggatcgaac tagcaaactt cgtaaactgt 5100 cgccgttcat ggacgagact ggtgtaatac gcatggaaag tcgaattgcg gcagcaactt 5160 tcgcgtcttt cgacactcgg tttccaattg tcttgccgaa agagcatctt gtgactggac 5220 tgttgattgg atggtatcac cgaaaatatc ttcatgggaa tggagagacc gtcgtcaacg 5280 aggttaggca acgcttccat gtgcctagtc ttcgtatggt agttcgcaaa gagaagaaga 5340 agtgtgtatg gtgcaagata aagctagcgg tcccagcgat tcctcggatg gctccacttc 5400 ctgcagccag actaaaggca ttcgagcgtc cattttcata tattggagtg gattattttg 5460 ggccgattgc cgtacgtgtg aatcgaacaa gtgccaaaag atgggtagct ttgttcactt 5520 gtcttaccat cagagccgtt cacttggaaa tcgtccattc actctccgca gaatcctgca 5580 aaatggcttt tcgaagattt gtagcacgaa gaggatcacc ggtagaggtc tacagcgatc 5640 gaggaaccaa ctttgtggga tccagtaacg agttgcaaca ggagatgaac gctattgatc 5700 aacagcttgc cgaaaccttc accaacgcac acaccaaatg gatattcaac ccgccagccg 5760 ctcctcactt tggcggagca tgggagcgct tagtaagatc tgtgaagaca gcgctatcgg 5820 ctatgtacac atccagagtt ccaaacgaag agacgctggc aactttgatg gcagaagctg 5880 agggagtagt aaactcgcga ccacttactt tcattccgtt ggaggaggag caacaggagg 5940 cattaactcc taaccatttt ctcctgctta gttccaacgg agtagtccaa actacaaaaa 6000 cactagcgga ctctagggaa gcctgtcgaa accagtggaa cctatgtaga gttatggttg 6060 acgcattctg gcgaagatgg gtccgagaat acctgccgga tttgacacgt cgcaccaagt 6120 ggttcgacga tgtagccgct attcaaccag gtgatttggt gatcatcgtg aatgatagcg 6180 tacgcaacgg atggacccga ggccgtgtta ccgaaatact taaaggaatg gacggtcgca 6240 ctcgtcaagc tgtggtgaag acagctgcgg gactagttcg acgaccgatt gctaagttgg 6300 cgcgactaga tatactgaag ggtaagactg aaccggacat accggagcag cttacgggtc 6360 ggggaga 6367 // ID ORGANDY_BM repbase; DNA; INV; 548 BP. XX AC AB091367; XX DT 17-NOV-2004 (Rel. 9.1, Created) DT 17-NOV-2004 (Rel. 9.1, Last updated, Version 1) XX DE Bombyx mori transposon Organdy. XX KW MITE; ORGANDY_BM; miniature inverted repeat transposable element; KW terminal inverted repeat. XX OS Bombyx mori OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Bombycidae; Bombycinae; Bombyx. XX RN [1] RA Komoto N., Sezutsu H., Yukuhiro K., Banno Y. and Fujii H.; RT "Mutations of the silkworm molybdenum cofactor sulfurase gene, RT og, cause translucent larval skin."; RL Insect Biochem. Mol. Biol 33(4), 417-427 (2003). XX DR Genbank; AB091367; Positions 1 548. XX CC A MITE (miniature inverted repeat transposable; element) CC inserted in the silkworm og gene (molybdenum; co-factor CC sulfurase) in a mutant, ogt. XX SQ Sequence 548 BP; 144 A; 127 C; 120 G; 157 T; 0 other; tgccggaacc acactgcgct attcgctatt cgttatgcga gctattcacg cgcatacggc 60 gaagtgtgga gagtatattc gccaatattc gctttattcg tacgcataac acataaggac 120 gaatagcaaa tacgcgtatc cgtcgacttg tcggtttttt gacacacttc aaaggacacg 180 aataagccga atatgcgcat cgactgccca cacttcggta tgtcattcgc tcgcttgtga 240 gttgtagcgg tcgtgtatcg acgtgtctta tgacttgtaa tcatccacca gcgttttctt 300 ttcggtcttt cagtaacttt ttacaacaag tattggtaag aaacatttag gaaataataa 360 attgcggcaa ctctcaacac gtccatcgct ccacgcaaac aagacacata tttgccttat 420 ttggcataat tcagcgcaca agttcgaagt gtggatggca tgtttgttgt ttgcgcgctt 480 cacagatatt ttggcgaata cggcgtagtc acataacgaa tggcgcataa cgcagtgtgg 540 tcccagca 548 // ID Gypsy7-I_Dpse repbase; DNA; INV; 7039 BP. XX AC Unknown_singleton_20; XX DT 13-MAY-2009 (Rel. 14.05, Created) DT 13-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy7_Dpse; KW Gypsy7-LTR_Dpse; Gypsy7-I_Dpse. XX OS Drosophila pseudoobscura OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; obscura group; OC pseudoobscura subgroup. XX RN [1] RP 1-7039 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1063-1063 (2009). XX DR Genome; Unknown_singleton_20; Positions 17528 10490. XX CC Positions [6222-6731] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 2407..3873 FT /product="Gypsy7-I_Dpse_1p" FT /translation="MGLDRSDDEGASASAICCRLCKVTMAPEDVYRTPCQH FT EFHKTCMRTYLKTQTTCPVCNAVCSAPTAGQPPLVPGKTRQQTKSVLAGGS FT KQSQAGSEGQASISSQASTSNQAANSNQVSREITAAIRGMQNELLIHLTEK FT MSQRISTTVEARLASNAQIPQPSRGSLNMNASLDQLLEIPEGNQGAVYTPP FT LRTSLSRSAASDLAQRPDKVGHIMNNWKLRFSGASDSISVDSFIYRVEALT FT HQTLEGNFMLLCSNANTLFEGKASDFYWRYHKTAGSVRWMDLCEALRKQFR FT DSRTDVDIRQMIRDRKQREKETFEVFFDAIQKLTDRLEQPLSHKTLVEIVR FT RNLRLEIQHEILNQKIDSIEALRDICRRRECFMEEVRRTHGYQKPAPFRKQ FT ISELVDEALGEDAEEFSESEFGEEIAALSLVCWNCRKEGHRYHDCVAKRKI FT FCFGCGAPNTYKPSCKKCQKNTKEKTAESRRLTSVPQRGSAKQD" FT CDS 3792..7037 FT /product="Gypsy7-I_Dpse_2p" FT /translation="MSKKHQGEDGRVSSADQCSAEGVGETGLDVASGETQG FT EPSRDFGEAEPPIVGWHINLIEGKKLAPPVEIVKSRRLTRNARRIKAHFKS FT LKCINEIRKKEGDKRPYAEVKLLDRTVVGLLDTGAGMSCVGGQLAKEVVCS FT KKPFKLVSSNAFTADGKKHQIVGKLTTKIEYRGESKIIELFIIPNLSQDLY FT LGIDFWSLFELLPSEWKISEICESSDVHTLTSAQEAALNQVISLFPSFAVS FT GLGRTDMLCHTIDVGNAKPVKQRHFSVSPAVEKSLYGEVDRMLRLGVIEES FT GSAWSSPVVLVQKPGKARLCLDSRKVNAVTKKDAYPLPQIDGILSRLPKAE FT YVTSLDLKDAFWQIPLEVASRDKTAFTIPGRPLYQYKVMPFGLCNASQTMS FT RLMDKVIPAALRNQVFVYLDDLLVISDTFPSHLEVLRSVAEHIRHAGLTIN FT IEKSRFCRRSVKYLGHVIGEGGIKTDPEKISAITKFPLPRTLRALRSFLGM FT AGWYRKFIHDFASLSAPLTDLLNQKKKFVMSQDGIEAFEKLKETLCKAPVL FT RSPDFGKPFYIHCDASNTGVGGVLVQKTEEGDEMPIAFVSKKLNKAQRNYT FT VTELECLAALVCIKKFRAYVEGHQFTVITDHASLKWLMSQNVLHSRLARWA FT LKLQGFRFSIEHRKGTQNIVPDALSRVHQDEIAAIDKSNGLLVDLDSPHFR FT AVGYLELVERVKSNQERLPDVKVVDNLVYRRAEHANGDLLHDSFAWKLWVP FT KELISEILKQSHDDPLASHGGIHKTLERVRRYYFWPGLVNDVKAYVNNCSE FT CKTTKAPNFTLRPPMGKPAESQRFFQRLYIDFLGPYPRSRSGHVAIFIVLD FT HYSKFVFLKAVKKLTADVIIKYLREDLFHTFGVPETIVSDNGSQFRAEKFQ FT SLLKSHRISHTLTAVHAPQANASERVNRSVIAAIRAYVRPDQKDWDENLSS FT ICCALRSSIHSGVGATPYYMVFGQQMITSGGTYSLLRSLEMLEDRAAAFTR FT EDSLELVRKKAREVMDKQNARNEKQYNLRSREIVYQVGQEVYRRNFKQSNF FT QAGYNAKLGHAFVKARVRRKLGNSL" XX SQ Sequence 7039 BP; 2039 A; 1449 C; 1714 G; 1837 T; 0 other; cccgtttggg taagttgtgg aataaaaaaa gacagcctag cgaaagagta taaaggtgtt 60 ctgtgggtgc aaatttccat aggtcttaaa ttggaggagg ctcgtcttcc acgaccgcca 120 gctgaaattc gtagaaattc gccggagctt tgggtcgtcg ccttggagca gttcggcgag 180 ccagagagga gaggccaact agtgcccggc agacggcagc cgcgtcggct tgttcatcgc 240 tccgcgtatg cgtccatacc ccgtgcgcag gggaggaggc aaagaaaaaa aaaaagtgtt 300 cttcagtggc ggcaaatagt ggcagatcga gttagaaaat ttctcataga aggccagccc 360 agcccaaaat aacaataaca aaaaaaaaaa aaagaatcaa atataagaaa caaataacgg 420 atcgtagtca gatgctctaa gcatcaatac ccgagttatt ttattctccc tctcatttag 480 gcttctttag ctctctatcc ctatctatct ctctccacct aactttgcgc tgcgggcgag 540 cccccacata ccggtgcccg ataagcgccg ctaatctgca cataggaatg caagactgag 600 cgcagaggaa tgtaaggcgg cgtgagggac gggaagaaaa gagacagaca atcaccgtta 660 aaattaactc gatctgcctg agccactcgg tcgtcatctc atttttttgt gtgtgcgaat 720 tgtgtgtgtt cttctttttt tttttttata cttttcaggg gcccaaatcg gaagcgcttc 780 gtcgatggag gttcctgtaa gtcgtttaat tgaataaata aaatatgtat atgaaagttc 840 gaaaaatata taaaaatatg aaagttcgcc acgggagcgc acccaattag tgggttgaaa 900 agagagggca tcagaaagtt cacctcttga ttcttttccc tacgcctctc gtacattaca 960 ccttctctgt tggtggatcg ctatgcccag caaatgctca cataaaattt tcttacattt 1020 cttttctgtt cactcctatt caaatcatct gttctctctc atatctagcc taagcttctt 1080 atttatttac cgtactttac tcaaattatt taaaattaag tcgtagccaa tactcgggga 1140 agggagaagc aaatgagaac attaaattac tggggatcac taattttagt cataagaaat 1200 aaataaacgt ttataatgtc tgagtagaag caatagtgtt tgatctcctg cagcaagccc 1260 ctgtaggtct cgcaaaatat aaggggtctt gtaacaaatt atgtggtatc gggcacagca 1320 agaccaaggt agggtgggcg aacatcactc ctcatgaaca gctgaggaat tcttttcgtt 1380 gtccgttcac cagtgcagtt attgtctttc gttttaggtg ttgtgctttc cgtccggttt 1440 catatggctt attgacacgc attaaatgta aaataataat agggtaggcc tcagcttagg 1500 agcaaataaa caagaacacc acactcgtgg ccgacgagcc agtagccgga aaattgacag 1560 cgtgccgctt caagctgaga ggaccatctc gcctacatcg tggttgcacg ttacatttat 1620 tattattatt ggcgcccaac gtggggcctc actgttaggt gggaattatt acaatttgta 1680 cttaggggtg tggttcacga acaatgacct aaaacaaacc ttcacttttt aaattttcct 1740 tccatcgact gttcgatgtc tttattgaag acaaagaaac ccaaatcatt tccgctttca 1800 ttcagttctc ttacggatct ttggaatacc gatagaaaac taagccccag ggtatgtttc 1860 cgggccacga ttctaagggt tctgagtgag tttgacttat ttctgtacaa atagcctgat 1920 tgtcgtgatt aataggaata ctactagaat agctttaaga agctatctgt tgtccatgtc 1980 atttcagtct gcgctgtaag tgctactcaa ctttatgcct gagggaccaa gggaaaactg 2040 aaccgtcttg gattcactgg tgatgcattt agctaaagtg cactcctgac cacggatcat 2100 ttcattaagt gcctcagttt gctttgtctt atctttgttt ggcaggttgg atcactgctt 2160 ctataagcgc tgttggacgc aggtgcgcag aatttagaat taccactgca aacggcctag 2220 gtgttctaac ttgccagaaa gtattttcct ctttagctgt acttggtgaa ataattcgga 2280 cagtttaatt aatattcaaa tttcatttca cttttaaatt tccttattac atatatataa 2340 gaaaacatac ataaccttta cgagttaagc agatagtttt tttagtaaat acctataaga 2400 agaaagatgg gcttggatcg gtctgatgat gaaggggcaa gcgcctcagc aatctgttgc 2460 aggctgtgca aagtcacaat ggcccccgaa gatgtttata gaactccttg tcagcatgaa 2520 ttccataaaa catgcatgcg aacatacctt aagacccaaa caacctgtcc agtttgcaat 2580 gcggtttgta gtgccccaac ggctggtcag cctcctctgg tgcctggcaa gactagacag 2640 cagacaaagt cagttctcgc gggaggcagt aagcaatccc aagcagggtc ggagggtcaa 2700 gcgtctattt caagtcaagc gtctacttca aatcaagcgg ctaattcaaa tcaagtgtct 2760 agggagatca cggcggctat taggggtatg cagaatgaat tacttatcca cctgacggag 2820 aagatgtctc aaaggataag taccactgta gaagctcgtc tagcttctaa tgcgcaaatt 2880 ccgcagccgt caagagggag cctgaatatg aacgcttctt tggaccaatt gctagaaatt 2940 cccgagggga accagggcgc agtttacaca cctccactcc ggacttcgct ctctagatca 3000 gcagcatctg atttagctca gagacctgac aaggtgggtc atataatgaa taattggaag 3060 cttcgctttt cgggagcctc agactccata agcgtggatt catttatata ccgagtagag 3120 gcccttaccc atcaaacgtt ggaagggaac tttatgctgt tgtgcagcaa tgcaaacact 3180 ctgtttgaag ggaaagcatc tgatttttat tggaggtatc acaagacagc tggctcagtg 3240 cggtggatgg atctgtgtga agcgttgagg aagcagttca gagattcccg gacggacgta 3300 gacattagac agatgattcg cgataggaaa caaagagaaa aagagacttt cgaggtattt 3360 tttgacgcaa tccaaaagct cactgacagg ttagagcagc ctctttctca taaaactttg 3420 gtcgagatag tacgccgaaa cttgcggctg gaaatacaac atgaaatcct gaatcaaaaa 3480 atagactcaa tagaagcact acgggatata tgtcggaggc gagaatgttt tatggaagag 3540 gtgcgtcgga ctcatggcta tcaaaaacct gctccatttc gaaagcaaat ctcggagcta 3600 gtggacgaag ccttgggaga ggatgcggaa gaattctcgg aatcggagtt tggggaggag 3660 atagcagctt tatccttggt gtgttggaat tgtcgaaaag aagggcaccg ttaccacgac 3720 tgcgtagcta aacggaaaat attctgcttt ggctgcggag cgccaaacac gtataagcca 3780 tcttgcaaaa aatgtcaaaa aaacaccaag gagaagacgg cagagtctcg tcggctgacc 3840 agtgttccgc agagggggtc ggcgaaacag gactagatgt agcaagtggc gaaacgcaag 3900 gagagccttc tcgggatttt ggggaggcgg aaccaccgat agtgggatgg catataaatt 3960 tgatagaggg taagaagtta gcacctccag tggaaattgt taagtcaagg agactaacta 4020 gaaatgcacg taggatcaag gcccatttta aatctctcaa atgtattaat gagataagga 4080 aaaaggaggg agataagagg ccttatgccg aagtcaagtt actggataga accgtagtag 4140 gattgctaga taccggcgcc ggaatgagtt gcgtgggtgg tcaattggca aaggaggtag 4200 tatgcagtaa gaaaccattt aaattggtgt cgtccaatgc ttttacggct gatggaaaga 4260 agcatcaaat agtaggaaag ttaacaacta agatagagta ccgcggggag tctaaaatca 4320 tagagctatt cataattcct aatttaagtc aagatttata tttaggtatt gatttttggt 4380 ccctttttga attattaccg tccgaatgga agatttccga aatttgcgag agttctgatg 4440 ttcatacact tacgtctgct caggaggcgg cgttaaacca ggtgatttcc ctatttccat 4500 ccttcgctgt gtcgggtctg ggaaggacag atatgctttg tcataccatc gatgtgggga 4560 atgcgaagcc cgtaaagcag agacatttct cagtctcacc ggctgttgag aaatcgctgt 4620 acggggaagt tgataggatg ctgagattag gggtgattga ggagtcagga agcgcttggt 4680 cttctccggt ggttctggtc caaaagcctg ggaaagctcg tctttgtcta gacagtcgaa 4740 aagtaaacgc ggtgacaaaa aaagatgcgt acccgttgcc ccagattgat ggcatcttga 4800 gtagattgcc caaggcagaa tatgtaacta gcctggactt gaaagatgca ttttggcaaa 4860 tacctttgga ggtggcgtcg agagacaaga ctgcgttcac cataccaggg aggccgttgt 4920 accaatacaa ggtaatgccg tttggtttat gcaacgcctc acaaaccatg tccagactca 4980 tggataaggt aatacccgcc gctctccgaa atcaagtttt tgtctactta gatgacctgc 5040 tggtcatttc agacacattt ccctcgcatc ttgaggtact gcgttcggtg gcagaacaca 5100 tccgtcacgc agggctgacc ataaatatag agaagagcag attctgtagg agatccgtaa 5160 aatatctagg tcacgtaatt ggcgaaggcg gcattaaaac agacccggaa aagatttccg 5220 ccattactaa gtttccgctc ccaagaactc tccgagccct aaggagcttc cttggaatgg 5280 ccgggtggta taggaaattt atacatgatt ttgcttcact atccgcccct ctaacagact 5340 tgctaaatca gaaaaagaaa ttcgtcatgt cccaggatgg aatagaggcc tttgaaaaat 5400 tgaaagaaac gctgtgcaag gctccagttt taagaagtcc tgactttgga aagcccttct 5460 atatacattg cgacgcgagc aatacgggag tagggggtgt actagtccaa aagacggaag 5520 aaggggatga gatgccgata gctttcgtgt ctaaaaagct gaataaagcc cagcgcaatt 5580 atacagtgac cgagctagag tgcctggcgg ccttagtgtg cattaaaaag ttcagagctt 5640 atgtcgaagg tcatcaattc accgtgataa cggatcatgc ctcactgaag tggttgatgt 5700 cccaaaatgt tctgcactcg aggttagctc gatgggcact aaaattgcaa ggtttccgat 5760 ttagcatcga gcaccgaaag ggaacccaga acatagtccc agatgcattg tcaagggttc 5820 accaggatga gatcgcagca atagacaaga gtaatgggct tctggtggat ctagattctc 5880 ctcacttcag agcggtaggg tatcttgaat tggtggagcg ggtaaagagc aaccaggaac 5940 gattgccaga cgtaaaagtg gtcgacaact tagtttaccg aagagctgag catgcgaatg 6000 gcgacttact tcatgactcg tttgcttgga aattatgggt gcctaaggaa ttgatcagcg 6060 agatcttaaa acagtcacat gatgatccgt tagcctcgca tgggggtatt cacaagacct 6120 tggagcgcgt ccgtaggtac tacttttggc caggattggt taacgacgtc aaagcttatg 6180 tgaataattg cagtgagtgt aagaccacaa aggcgccaaa ctttaccttg cggccaccca 6240 tgggaaagcc ggccgaatca caaagattct ttcagcgact atacattgat tttcttggtc 6300 catatcctag gtcacgaagt ggtcacgtcg ctatttttat cgtgttagac cattattcta 6360 aatttgtttt cctcaaagcc gttaagaagc tgacagctga tgtaattatc aaatacttgc 6420 gggaggatct ctttcatact ttcggggtgc ccgaaaccat agtgtccgac aatggatccc 6480 aattcagggc tgaaaaattc caaagccttt taaaaagcca ccgtatctca cataccttga 6540 cagcggtaca cgcgccacag gcaaatgcct ccgagagagt caatcgctca gtgattgccg 6600 ccatcagagc gtatgtgcgt ccggatcaaa aagattggga cgagaattta agcagcattt 6660 gttgcgcgct gagatcgtcc atccactcgg gagtaggcgc tacgccatac tacatggtgt 6720 ttggtcagca gatgatcact tctggtggca cttactcttt gctgagatca ctagagatgt 6780 tggaagacag ggcggcagct tttacaagag aggattcctt ggagttggtc aggaagaaag 6840 cacgtgaagt aatggataag caaaatgcac gtaatgaaaa gcagtataat ctaagatcaa 6900 gagagatagt ataccaggtt gggcaggaag tttatagaag gaattttaaa caaagcaact 6960 tccaagctgg atacaacgct aagcttgggc acgccttcgt gaaggctaga gttcggagaa 7020 agctaggaaa ttcacttta 7039 // ID Sola2-3_NVi repbase; DNA; INV; 4122 BP. XX AC . XX DT 16-FEB-2009 (Rel. 14.02, Created) DT 16-FEB-2009 (Rel. 14.02, Last updated, Version 1) XX DE Sola2 DNA transposons from Nasonia vitripennis. XX KW Sola; DNA transposon; Transposable Element; Sola2; Sola2-3_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-4122 RA Bao W., Jurka M.G., Kapitonov V.V. and Jurka J.; RT "New superfamilies of eukaryotic DNA transposons and their RT internal divisions."; RL Molecular Biology and Evolution 26(5), 983-993 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Nasonia CC Genome Project. XX FH Key Location/Qualifiers FT CDS join(911..1210,1264..3204) FT /product="Sola2-3_NVi_1p" FT /translation="VHSLSSLQDHLVSTFQKYIICRFKDYTYQISIEILST FT VLFRTSLNSPSHDYSQQSFSGLRPTLLFRTFLTSPSHLYSQQSFSGLVSTV FT LLRTSLNSPFQDSHQSSSRLHSTVLLRTSLNSPSQDYSHQSFSRLLSTVLL FT RTSLDSPSQDWSQQSFSRLLSTVLLRTNLISFLNLTISIVLNMSNIRSDRC FT CNPYDLEGHKGKDLRRISIGVKRSFPSFPDNAKICASCRKIKHPRMDEGLF FT DSSSFMNLTLNPHKAEQNIPESPGVSSSSSSRMEANVKSQREIELEDLLNG FT LKAKFSTLDINDPLRLTILTVVPDAWSLNKTSREFNCSRQLAKKARDLKAS FT KGVLADTTAKNGRPLPNNTVVQIDNFYNSDEHSRIMPGIKDVVSVKNDDGR FT HLSQKRLLLSDLRSLYDTYSKLCPEYPVSFSKFAQLRPKHCILAGASGTHS FT VCVCTIHQNCKLMIDSVNLNKLADSDMVLHDYKDCLRQIVCQNSDANCFLG FT ECIKCPGINEFDKHLKELLERENIHHVQFSVWTTTDRATLETQIRSSSEFV FT DELCEKLIKLKPHSFIAKQQSRYYQEKKENLEEREFLVVLDFSENYKYVAQ FT EASQGFHFNNSQCTVFPIVCYYKNGLKIEHKSFIFLSNSTLHDTAAVYTVQ FT KLLDPELKKINSELSKVIYFSDGAKQHFGVRAEWHFHATAHGKGASDVLYS FT KEKLYELVFFVNQMTPLLHSRNLSTGLRKIPKIFMHYLTMTKITTK*" XX SQ Sequence 4122 BP; 1359 A; 780 C; 700 G; 1283 T; 0 other; gaggcattct ttcccaacgt tacacccctg ctgcccactt tttatttcat gtaaaaattt 60 tcaaaaaaat gttaaatttt gtagaaaatt ctcagctttc gaatggcgag tggcaaaatc 120 accagtgtta atcggattat cctgaaaatt caaaaaaacc gtaaattaag gatggaaaat 180 ttattgtaat ggcatcctac taattctgaa cttgctatac gagtattatc caataggtaa 240 cagcctcttt tcactataaa atacctcgac gaaccgttca aatggtctca aattggttct 300 ggaaaaattt agatcctgaa aatagaaaaa aaatgacaaa aagttgttct ttctgtgtag 360 ttatgacaat cttggtgtca gaaagaggta ttagacaacc tttagttaac ttacaaattt 420 gagaacacgt tggaaggttc tcaacatcac tccttctttt gccaaaaagc caaaataata 480 atgcgaagat tcatttaaaa aatagaaaag ataatgacta tttttgtgat gataattcat 540 cagtcggata attttgatac aagaatgaaa ctttgttacc aaaaagtatg atataactaa 600 cgaaatatta ttacacttaa atctcacaca tatcttaatc cccaacgtaa caagacaagt 660 ctaatatttt atcggattac atggacacga ataaaaaatc gtgagtacat gaatgctaaa 720 tcttcaagat gaaatgttca atagaatatc cgggtatgat ttgaatttcc tgattacaca 780 agtctggttg acttaactgg ttctgctctg aaaccatagg tgtatactgc aaaacatcat 840 tctgaataat tttatttgaa attgattgtt gagagatttg gccaatgtga aatcatactt 900 cattttttaa gttcattcgt tgagttcttt gcaggaccat ctcgtcagta ctttccagaa 960 atatataatc tgtcgtttca aggactacac ttatcagatc tctatagaaa ttctgtcaac 1020 agtccttttc aggactagtc tcaacagtcc ttctcacgac tactctcaac agtccttctc 1080 aggactacgc ccaacactcc ttttcaggac ttttctcacg agtccgtctc acctctactc 1140 tcaacagtcc ttctcaggac tagtctcaac agtccttctc aggactagtc tcaacagtcc 1200 ttttcaggac tagtctcaac agtccttttc aggactagtc tcaacagtcc ttctcaggac 1260 tagtctcatc agtcctcctc acgactacac tcaacagtcc ttctcaggac tagtctcaac 1320 agtccttctc aggactactc tcatcagtcc ttttcacgac tactctcaac agtccttctc 1380 aggactagtc tcgacagtcc ttctcaggac tggtctcaac agtccttctc acgactactc 1440 tcaacggtcc ttctcaggac taatctcatt agctttctga atctcacaat ctcaatagtt 1500 ttgaacatgt caaatattcg cagtgatagg tgttgtaatc cctacgattt ggaaggacat 1560 aaaggaaaag atcttcggcg tatttcgatt ggggttaaac gaagttttcc aagctttcca 1620 gataatgcta aaatatgtgc ttcgtgtagg aaaattaaac atccccgaat ggatgaaggt 1680 ttgtttgact catcaagttt tatgaatctg acgctgaacc ctcataaagc tgaacaaaat 1740 attcctgaat ctccaggggt atcctcatca tcttctagtc gaatggaagc taacgtcaag 1800 tcacaaagag aaatcgagtt ggaagatctg ttgaatggat tgaaagcaaa gttttctact 1860 cttgatatca atgatccttt gagattaacg attttgacag tagttccaga tgcatggagt 1920 cttaataaaa ctagtcgaga atttaactgt tcaagacagt tagcgaagaa agctagagat 1980 ttaaaagctt caaaaggcgt tttggcagat acaactgcta agaatggcag gccattgcca 2040 aacaatactg tggtacaaat agataatttt tacaatagtg acgaacatag cagaatcatg 2100 ccgggtataa aagatgtggt ttctgtaaaa aatgatgatg gtagacattt gtcacagaaa 2160 cgccttctcc tttcagattt aagaagtctt tatgatacct acagtaaatt atgtcctgaa 2220 tatccagtta gtttcagtaa atttgctcag ctccgaccta agcattgtat actcgctggt 2280 gcgagtggta cgcactcagt gtgtgtctgt acgatacatc aaaattgcaa attgatgatt 2340 gactctgtta atttgaacaa gctcgcagat tctgatatgg ttttacatga ttacaaagat 2400 tgtttacgtc aaatcgtgtg ccaaaattct gatgcaaatt gttttctcgg tgaatgcata 2460 aagtgtcctg gcataaatga gttcgataag catttgaaag aacttttgga gagggaaaat 2520 atccatcatg tgcaatttag tgtatggaca actactgaca gagcaactct agaaacacaa 2580 atccgttcgt catctgaatt cgttgatgaa ttatgtgaaa aattaatcaa actaaaacct 2640 cattctttca tagcaaaaca acagtctcga tactatcagg agaagaaaga gaatttagaa 2700 gaaagagaat ttctcgtagt gttagacttt tctgaaaatt acaaatatgt tgcacaagag 2760 gcatcgcaag gattccactt caataattcc caatgcacag tctttcctat cgtgtgttac 2820 tacaaaaatg ggttgaaaat tgagcacaaa agtttcattt tcttgtcaaa tagtacactc 2880 catgacacag ctgcagtata tactgtacag aagttgctgg accctgaatt gaaaaaaatc 2940 aattctgagt tgagtaaagt tatctacttt agcgatggag ctaaacagca ttttggtgtc 3000 cgagcagaat ggcactttca tgctactgca cacggtaaag gggcttctga tgtgctgtat 3060 tcaaaagaga agctgtacga gctagtcttc tttgtaaacc aaatgacgcc attattacat 3120 tcgagaaact tatcgactgg gctcagaaaa attccaaaaa tattcatgca ctatcttaca 3180 atgacaaaga tcacaacaaa atgacacgat ttttgaacaa aagatttgat gctgctccac 3240 cagttccaga aattctaaag aaacactgtt tcatacccat taataacaca gaaatgatga 3300 tcaaaagata ctctgaagat ggcgctacaa cgacattttc atatcaagat ttcaaacaat 3360 agagtataat agctcaggag atattagaat cataacatag aaaagatata ttgtgaccaa 3420 cagttttaaa aaatgactac taataacact cagtttcatt aatctatttc aactctcagg 3480 cctgtttctg atgaactaca atgcaacaaa tttatgaatt tgactatgaa tcacatcaat 3540 tcattcatat ttcatagtgg agacatttct gaaaaattta tccgactgat gaattatcat 3600 cacaaaaata gtcattatct tttctatttt tcaaatgaat cttcgcatta ttattttggc 3660 tttttggcaa aaggaggagt gatgttgaga accttccaac gtgttctcaa atttgtaagt 3720 taactaaagg ttgtctaata cctctttctg acaccaagat tgtcataact acacagaaag 3780 aacaacttat tgtcattttt tttctatttt caggatccaa atttttccag aaccaatttg 3840 agaccatttg aacggttcgt cgaggtattt tatagtgaaa agaggctgtt acctattgga 3900 taatactcgt atagcaactt cagaattagt aggatgccat tacaataaat tttccatcct 3960 taatttacgg tttttttgaa ttttcaggat aatccgatta acactggtga ttttgtcact 4020 cgccattcga aagctgagaa ttttctacaa acattttttt gaaaattttt agatgaaata 4080 aaaagtgggc agcaggggtg taacgttggg aaagaatgcc tc 4122 // ID Gypsy-93_AA-LTR repbase; DNA; INV; 191 BP. XX AC supercont1.368; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-93_AA_; KW Gypsy-93_AA-I; Gypsy-93_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-191 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.368; Positions 470247 470057. XX SQ Sequence 191 BP; 41 A; 49 C; 40 G; 61 T; 0 other; tgtggtagaa tgctcaatga atattagttc cttcttgcgc acttcgtatt cacccagctg 60 ctgctggtat gaatgctatg cctaaatcac cgcctcagtt tatccccaac cttctactgg 120 tgtgtttcgc tagtcccttg cagtgcttcg cgcgtggtgt gaaatctata agaatcaagg 180 tcgttaccac a 191 // ID Tx1-8_CQ repbase; DNA; INV; 4715 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A Tx1 non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW Tx1; Non-LTR Retrotransposon; Transposable Element; Tx1-8_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4715 RA Kojima K.K. and Jurka J.; RT "Tx1 non-LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 640-640 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 10 sequences with >97% CC identity. XX FH Key Location/Qualifiers FT CDS 149..1333 FT /product="Tx1-8_CQ_1p" FT /translation="METGVRKNTLKLCFQQGSKIPSNLEVLKFVTGTLELG FT AADLHSVYKDENDGSFYIKLMDEPTFTEYCGRLEELYMFRYDDDSRSPVTL FT VVASRIFRYVRIFNLPPEIDDKTIAQVLGQFGTIRQHVRERYSPECNLNIF FT NGVRGVHMEIAKEIPAGLFIGHFRARIYYDGLKNRCFFCKQEGHVKSNCPK FT LANSSSGSGGSRSYSTVTAQGRPVNAPYLSVPPMTPVMERLVKAVQEPVPS FT AEGDKSATSPPAGGQGLSGPATPPTEATGTPAVIEEPQVSAETNQQQELEN FT MDTGDEGGHQGVGSGTGDEGGIGDHNRGAGTGKGGGLLGVKRPAEPSSDPE FT ANSSEGPEDSTGQLPFKEVSGGKKNKRSKKGKKDQSKPLSTISTRSLLKPA FT K" FT CDS 1396..4650 FT /product="Tx1-8_CQ_2p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="MFLRSICTININAINSLLKKSLLKDFIWNSDIDIVFL FT QEVAFEDFGFLPSHTAIVNISTDNKSTAILVRKNINFDNVLINPNGRLSSI FT LIDDINFINVYAPSGSNYRNERNSFFTFDIIPHLSFTKKNIIVGDFNCILL FT PSDSNSSNKNICSGLQKLVTSLNLHDIEHKINQKTNFTFIRGDSKSRLDRF FT YAPLELINNVRSINTSAVAFSDHHAVILKYKVPDNIPLQTRGRGYWKINPS FT LIMNEEITADFKTLFENWKKQNMYINNFNFWWNDSTKSRIMKYFKSQSFLF FT NQQIHREKNFFYTCLNEIIEKQSVGENVYNELCLVRSKLLEIEQNRLNNLS FT LKTSSGSILTDERLSLFQISSRIKNQTENYSFKLRINGEITNEHCRLKQFI FT TSHFNSIFAADNSDQNNDNVLSHVHKKLSVQDQMSLSKPIDKDELIFVLKE FT VAKKKTPGPDGLTYEFYSHFSDILLDDLVILFNSYLVDGVNPSRDFKKGII FT SLIPKKGCDQYEISNKRPISLLNSDYKLFTKILANRLKPLMKKIIGPGQTA FT CNIDKSCVDNLKTLRNIIIRAGQTKRFKGLLLSLDLEKAFDKVNHSFLWDV FT LKKFDFPEPFINCLKNLYKEATSKILFNGFLTDSIEIKSSVRQGCPLSMAL FT FVLYIEPLIRLISDNVRGCFIADTFIKVIAYADDLNIFVLNDHEFDTVLEM FT INYFSIYSKIKLNVNKSHFMRLNNCVSGPRLIPEKDALKILGVYFENSFSR FT TVKLNYDSVIQKLKHCLKLHSSRHLNLIQKIWILNTFILSKIWYLAQVFPP FT NKVHIAQIYSSCSMFLWNNHFFTVTRNQMFLDVSDGGLGLVDVDLKSKALF FT IKNILFSYPDADDPMDRFMMAQCQNTSLTKATREWLSEVSRIALYQDLITT FT KSIYRFLQSEKIFVPKVVQENPDLDWKLIWENISSNFLTSNSREALFLVMN FT DVICTKSKQFRNNVRGVENNICDFCNSIDTTEHRIKYCTGSVIVWIWLKNV FT LNVKLKIMIDDPEEILATDIDSKDSRLKAALFLIGEVISFNLKHYGTPSLG FT KFEQSLRDTRWNNKKLVKTQFCNFLNIL" XX SQ Sequence 4715 BP; 1563 A; 825 C; 933 G; 1393 T; 1 other; ccgaactcat tctcggacga tacgttgagc cgaacagtcg atagtggcaa acgctcccga 60 aaaaaaaaag caaatttctc gcgcgcggcg cggccgtggt agtagtgtgt ggtggattag 120 cggaatctat tgttaaatag ataaagaaat ggaaaccgga gttcgaaaga acacgctgaa 180 gctctgcttc cagcaaggct cgaagatacc ctcgaatctc gaggttctga agtttgttac 240 cggaacactg gaactgggag cggccgatct ccattccgtc tacaaagacg agaatgatgg 300 ttcgttctac atcaaactca tggatgaacc aacctttacc gagtactgtg gtaggctcga 360 ggagctgtac atgttcaggt acgacgacga ttcaaggtca cctgttaccc tcgtggtggc 420 cagtcgaatc ttccgctacg tacgcatttt caatttgccg ccggaaatag acgacaagac 480 catcgcgcaa gttctcgggc agtttggcac catccgtcaa cacgtgcgag agcgttactc 540 gcccgaatgt aacctcaaca tcttcaatgg cgtccgtggc gtgcatatgg agattgcgaa 600 ggagatcccg gccggcctgt tcatcggaca cttccgggcc aggatatact acgacggact 660 caagaatcga tgcttctttt gcaagcagga gggacatgtg aaatcgaatt gtcccaaact 720 ggccaacagt tcgtcaggca gcggcggatc acggtcctat agcaccgtga ccgcccaggg 780 gaggcctgtt aacgcaccgt acctcagcgt gccgccgatg acacccgtca tggaacgact 840 cgtgaaagca gttcaggaac cagtcccaag cgctgaagga gacaaatcgg cgaccagccc 900 accggcagga ggacaaggtt tgtccggccc ggcgactccg ccgacggaag caacaggtac 960 accagcagtc atcgaagagc cacaagtaag tgcggaaact aaccagcagc aggagttgga 1020 gaacatggac actggggatg aggggggcca tcagggtgtg ggttcgggta ccggggatga 1080 ggggggtatt ggggaccaca accggggtgc gggtactggg aaaggggggg gtctactggg 1140 tgttaaacgc ccggcggagc cgtcatcgga tccagaggcc aacagcagcg agggaccaga 1200 ggacagcacc ggacagcttc cgttcaagga agtcagcgga ggaaagaaaa acaaaagatc 1260 caaaaaaggt aaaaaagatc agtcaaaacc tctctcaacg atctccacgc ggtcgttgtt 1320 gaaaccagcg aagtaagcgt ccataacctc aaccaagcac gcgttgtctc gcggagtagg 1380 tagtagtgta ggagaatgtt tcttcgtagt atatgtacca tcaatattaa cgcaatcaat 1440 tctttgctaa aaaagtctct cttgaaagat tttatttgga attcagatat cgatattgtt 1500 ttcttacaag aagttgcctt tgaagatttc ggatttcttc cttctcacac agcgatcgta 1560 aacattagca ctgataacaa aagcacagcc atcctcgtta ggaaaaatat aaattttgac 1620 aatgttctca tcaatccgaa tggtcggtta tcttctattt taatcgatga tattaatttt 1680 attaatgtct acgcaccttc tgggtccaat tatcgtaatg aacggaactc tttttttact 1740 tttgatataa tccctcatct atcgtttaca aaaaagaaca tcattgtggg tgattttaat 1800 tgcatcctgt taccatctga ctcgaacagt tcaaacaaaa acatctgctc tggattgcaa 1860 aaattagtaa catctttgaa tttacatgac atagaacata aaattaatca aaaaactaat 1920 ttcactttta taaggggtga ttctaaatcg agacttgatc ggttttatgc acctttagaa 1980 ttaataaata atgttagatc gatcaatact tctgctgtgg cattttctga tcaccacgca 2040 gtaattttga aatataaagt tccagataat attcctttac aaacacgagg tagaggttat 2100 tggaaaataa atcctagttt aattatgaat gaagagatta cagctgattt caaaactttg 2160 tttgagaatt ggaaaaagca aaacatgtac ataaataatt ttaatttttg gtggaacgat 2220 tcaacgaaat ctagaataat gaaatacttc aaaagtcaaa gttttctttt taatcagcaa 2280 atacatagag aaaaaaactt tttttacact tgtttaaatg aaataattga aaaacaatcc 2340 gtcggagaaa atgtatataa tgaattatgc ttagttagat caaaattatt agaaattgaa 2400 caaaatagat taaataattt aagtttaaaa acaagttctg gatctatttt aactgatgaa 2460 cgacttagtt tatttcaaat ttcatctaga attaagaatc aaactgaaaa ttatagtttt 2520 aagcttagaa ttaacggcga gattacgaat gaacattgca gattaaaaca atttattact 2580 tcacatttca attcaatttt tgcagctgat aacagtgatc aaaataatga taatgttctc 2640 agtcatgttc acaaaaaact ttcagtccaa gatcaaatga gtctttctaa accgattgat 2700 aaagatgaat taatttttgt tctcaaagaa gtcgctaaaa agaaaacgcc agggccagat 2760 ggtcttacat atgaatttta ctctcacttt tcagatattt tattagatga tttagttatt 2820 ttatttaata gttatcttgt agatggcgtc aatccatcta gagatttcaa aaaaggtata 2880 atatctttga ttccaaaaaa aggttgtgac caatacgaaa tttctaataa gcgtccaatc 2940 agcttgttaa atagtgatta taagcttttt acaaaaattt tagcaaatcg tttaaaacca 3000 ttgatgaaaa agataattgg gcccggtcaa actgcatgta acatagataa atcatgtgtt 3060 gataatctta aaactcttcg taacatcata attcgtgcag gacaaaccaa aaggtttaaa 3120 ggacttttgc tgagtttgga tttagaaaaa gctttcgata aagtaaacca ttctttcttg 3180 tgggacgtgc taaagaaatt tgattttccg gaaccattca tcaactgttt aaaaaattta 3240 tacaaagaag cgacatcgaa aatactcttt aatggttttt taaccgactc catcgaaatt 3300 aaatcatccg tacgtcaagg ttgtccatta agtatggccc ttttcgtatt gtacatcgag 3360 ccattgatac gtttgattag tgataatgtt agaggttgtt tcattgctga tacctttatt 3420 aaggtaatag catatgcaga tgatctcaac attttcgttc tcaatgatca tgagttcgat 3480 acggtactgg agatgatcaa ttattttagt atatactcta aaataaaact gaacgttaac 3540 aaatcgcact ttatgcggtt aaataactgt gtctccggtc cgcgtttaat cccggaaaag 3600 gacgcattaa aaatactagg agtttatttt gaaaattctt tttctcgaac tgttaaatta 3660 aattatgatt ctgttataca aaaattaaaa cattgtttaa aattgcattc ttcgcgtcat 3720 ttgaatttga tacaaaaaat ttggatatta aatacattta tactttcaaa gatttggtat 3780 ttggctcaag tttttccgcc aaataaagtg catattgctc aaatatattc aagttgttcg 3840 atgtttctat ggaataacca cttttttact gtaacgcgta atcagatgtt cttagatgta 3900 agtgatggag gtttaggatt agtagatgta gatttgaaat cgaaagcttt atttataaaa 3960 aatattttat tctcttatcc tgatgctgat gaccctatgg accgtttcat gatggcacaa 4020 tgtcaaaata cttctctaac aaaagcaaca cgagaatggc tgagcgaagt ttcgcgtatt 4080 gctctatatc aagatttaat aacaactaag tcaatttata gatttctaca atcagaaaaa 4140 atatttgtac caaaagtggt tcaagaaaac cctgatcttg attggaagct tatttgggaa 4200 aacattagta gtaatttttt gacatcaaat tcaagagagg ctcttttctt agtaatgaat 4260 gacgtcattt gtacaaaatc taagcagttt agaaataacg ttagaggtgt agaaaacaat 4320 atttgtgatt tttgtaattc tatcgatact acggagcata ggataaaata ttgtacagga 4380 tcagtaattg tttggatttg gctaaagaat gtactcaatg taaaattaaa aattatgatc 4440 gatgatcctg aagaaatatt agcaacggat attgatagta aggatagtag attgaaggca 4500 gcactttttc taataggcga ggtaattagt tttaacttaa agcattatgg tacaccaagt 4560 ttaggtaaat ttgagcaatc tttgcgagat acacgatgga acaacaaaaa acttgttaaa 4620 acacaatttt gtaattttct aaacatactt tgaggttatg gacagaktgt atattttgga 4680 ttttcaataa agaacagtta atttgaaaaa aaaaa 4715 // ID Copia-9_DPu-I repbase; DNA; INV; 5221 BP. XX AC scaffold_317; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-9_DPu_; KW Copia-9_DPu-LTR; Copia-9_DPu-I. XX NM Copia-9_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-5221 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 681-681 (2010). XX DR Genome; scaffold_317; Positions 81125 86345. XX CC Positions [2171-2764] - Integrase core CC 'CCCAG' target site duplication CC LTRs are 100% similar to each other. CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX FH Key Location/Qualifiers FT CDS 1092..2027 FT /product="Copia-9_DPu-I_2p" FT /translation="MAEALLSSKKIDPAQPSTSELARDGKEGEGRSTRTCW FT ECGDTNHIRANCRDYKRKLKRKRDEEEEDRDKKRRRWEREDDDRSRQDKDN FT NQWRNRRERDDRERRGRDYGNNNDERRRRDQKDSKKENNGRRRGYSYSAAT FT SDSRSDKPTEWFADSGATQHMTGNRDLLVNFVPTGSECWMVNGIGESKLAV FT AGQGDVNVVATANGRNVKGTMRGVLFVPGLMINHSSIQSSFIYTVIYPGLG FT INLYSIGSATDEGIEVHFANNTVTFAHKKVIIMEGKRLGKEALYHLNIRAE FT QYYPRTEQALKGARVEPLSI" FT CDS 2090..5059 FT /product="Copia-9_DPu-I_1p" FT /translation="MGSVKGLALFKDQLNPSDHCRGCLQGKMCRAPFMSTR FT IKTTSIGQVIHSDVCGPMQVGTQNGERYFVAFKDDFSGWIEITLLKYKSEV FT PDTFKKFKAKLETETGEKARSFVSEYHIPPLIFSHLFQQVKILRSDGGGEY FT RGNEFKDWLASTGITHQVTPPYTPQLNGVAERANRTIVESARSQMYGRKVP FT MELWGLAMQCAAYVKNRTVSSVNNITPFELWFKKRPDISHLKVFGCLVFIH FT VPDEKRKKLDPKAVEAMMVGYVEGSTSCYQIWDPRARKLVISRDVTFEEQS FT MMELSEIQTATEEKDYYSILPTADDHKHGTGANRMDQEEVAPRAEQPVHQG FT EEEVRPIDQPQEDETTQTQGEGNHHQDEQSDELSYGADPDQDEQIDEPFYG FT FDLAMGGVRRSERIRQARRMPTPADKMRQLRGYPANVLTDENESVGPSTKV FT NFHVAFILNHLQSFFSATKVYGKQELTYRQALSSPDAAKWKGAIKDEYLSL FT IKNGTWTLAPLPPNRKAIKCKWVFDIKQGYEGVDERYKARLVALGCSQIPG FT LDFDQTFAPVVRLSTFRLFLAMAAAGNLEILQIDVKTAFLYGRLEEEIYMH FT QPEGFAVPGREKEVCRLIKSLYGLKQAPRVWNTELNDAILQYGLIRSEEDQ FT CVYYRLQGEEWVAVLFFVDDGFICGTSKEIVKKFSDHLKRKFEIRTLPAGR FT FLGMTIRRDRENGKLSISQPDFVDAVLKKVKMEGCNPVGTPAEPGLPLTAS FT MSPQQQKEKEEMKNIPYKEAVGALLYLSTTTRPDISYAVGQVAKFSQNPGI FT QHWKAVKRIIRYIAGTRTYGILYSQEKGEMVTGYTDADYGGDLEDRTSTSG FT CVFLCSSGSISWFSRKQECTSLSTTESEFVAASEAAKEATWIRSLLKEIQR FT RGQEPIPLLCDNQGAIRCAHNPELHRKMKHIDIRFRFVKRAQESGVIDARY FT VNSQDQIADILTKPLPLPKFQYLRKKLGIIEVNTM" XX SQ Sequence 5221 BP; 1630 A; 1159 C; 1315 G; 1117 T; 0 other; ggttatgggc ccagttatgt agaaactcaa gttgcataat ttttttttgt tgttgtatct 60 cctttaccca catccagtga aaaatggctg agacacgcca agatggtaga ccctcctaca 120 cgggagtgcg gttcgacgga accaactttg actcatggca gtttggtgtt aagctagcca 180 tacaaagtga agagctctgg aggattgtca acggaactga actcaggcca gagcgaatac 240 cagtaagtca acaccaatta atcttgtgtg taccatatac ttttattcta tagagtagca 300 cacacacgtg gagtgatgtg tgtgtggtga tatattgatg ggaatgtgaa atgggcgtat 360 gccgtacaaa acgaggagtt tgagctggcc atggacttct gccctccaca cgaggtgtgg 420 gagatgaggt attgcgtttc ctcacacagg aggtgtgaga tgaagaaatg tttttcctca 480 cacaggatat gtgagatgaa gaagtgtttt tccacacgtg atgtgtgaga tgagaaagcg 540 ctgtaccaca ctgggagtgt gagataagaa aagtttttcc acactgtgag tgtgatggcc 600 acattactta agcacctttt tctctctccc ccacagaatc cagaagatgt aaacgtgatc 660 caaaatgcag cagccatcag tgactgggag aaaaggaaca ccttggccat gcgaatcatt 720 tattcgtcta tccaggagga gagatcacgg ccgttgatgt catgtacgtt agcacaggaa 780 atgtggacaa agatggaaac aatgtggaca gaatttgcag ccgacctcgc ccctctgctg 840 tggagtcagt tttacggcgt caagtttcta cctggacaaa cagtcatgga gtttatgtca 900 gaggtcgaac acattgtctc ccgcctgagg gcaattgacg gaatagttct tctagacaat 960 caaattattg ccaaaatcac gatgtccctg ccaccaaaaa tgaagctcat cttcaaacca 1020 gcctgggaaa gtacggccgc tgctgagagg acgttgagaa atcttacgtc aagcagtgtg 1080 aggaagagaa gatggctgaa gcactcctta gcagcaagaa gatagatccg gctcaaccat 1140 caacatctga acttgctcga gatggcaaag aaggagaagg gagaagtacc cgtacgtgct 1200 gggagtgtgg agacacaaat catatcagag ccaactgccg ggactacaaa aggaagctaa 1260 agagaaaaag agatgaagaa gaagaagatc gtgacaagaa aaggagaagg tgggaaagag 1320 aagacgatga ccgttccaga caagacaaag ataataatca atggagaaac agacgtgagc 1380 gcgacgatcg tgagagaaga ggtcgtgatt atggcaacaa caacgacgag aggagacggc 1440 gtgaccagaa ggacagcaag aaagaaaaca atggaaggcg gagaggatac agctactcag 1500 cagctacatc agacagtcga tcagacaagc ccacggaatg gtttgctgat tccggagcaa 1560 ctcaacacat gacgggcaat cgggatcttc tagtcaactt cgtaccaacc ggatcagagt 1620 gctggatggt aaacgggatt ggggaatcaa aactagccgt tgctggacaa ggagatgtga 1680 acgtggtggc aactgcaaat ggaaggaatg ttaaaggaac catgagaggt gtcctgtttg 1740 tgccaggtct gatgatcaac cattcttcaa ttcaatcgtc atttatttat actgtcattt 1800 atccaggtct ggggatcaac ctttactcaa ttggatcagc cacagatgaa gggatcgaag 1860 tacattttgc caacaacacc gttactttcg ctcataagaa ggtcatcatc atggagggaa 1920 agcggttagg aaaggaagcc ctctatcatc taaatatcag agcagaacag tactatccga 1980 ggaccgagca ggcgctaaaa ggcgcccgag tggagccact gtccatctga caccaaaggc 2040 ttggacacct caacaacaaa tctatactca agatggcctc catgggaaga tgggaagtgt 2100 gaagggtctc gcactcttca aggaccaact taatccatcc gatcactgcc gagggtgcct 2160 tcaaggcaag atgtgcagag ctccgttcat gtcaacacgc atcaaaacta caagtatagg 2220 acaagtcatc cactcagacg tgtgcggccc tatgcaagtg ggcacgcaaa acggagagcg 2280 ctatttcgta gccttcaaag atgacttctc tggatggatt gaaatcacac tactaaagta 2340 caagtctgaa gtcccagaca ccttcaagaa gttcaaagca aaactagaga cagagacggg 2400 cgagaaggca aggtcttttg tgtccgagta ccacatacct cctcttattt tttctcatct 2460 tttccaacag gtaaaaattt taagatcgga cggaggagga gaatacagag gaaatgagtt 2520 caaggactgg ttggcaagta ccggaatcac tcatcaagtt accccgccgt acactcctca 2580 actaaatgga gtagccgagc gagcaaacag aacgatcgtc gagtcggcca ggagccaaat 2640 gtatggaaga aaagtaccca tggagctgtg gggcctagca atgcagtgtg cggcatacgt 2700 gaagaaccgg acagtatcaa gcgtcaacaa tataactcca tttgagctct ggttcaagaa 2760 gagaccggac atatcccact tgaaggtatt cggctgccta gtattcatcc atgttcctga 2820 cgagaagaga aagaagctag acccaaaggc tgtagaggcc atgatggttg gatacgtcga 2880 aggatcaaca tcatgctatc agatatggga tccaagagca agaaagctgg tgattagtag 2940 ggatgttacg ttcgaggagc agtccatgat ggagttatct gaaatccaaa cagctactga 3000 agagaaggac tattattcaa ttctcccgac agctgacgat cacaaacatg gaacgggagc 3060 taatcggatg gatcaagagg aagtggcgcc gcgagctgag caacctgtcc atcaaggaga 3120 agaggaagtt cggcctatag atcaacctca ggaggatgaa actacacaaa ctcaaggaga 3180 ggggaaccat caccaagacg agcagagtga tgaactgtca tatggtgctg atccagatca 3240 agacgaacaa attgatgaac cgttctatgg ctttgatcta gctatgggcg gggtccgtcg 3300 atcggagaga atccgtcaag caagaagaat gccaacacca gccgataaga tgagacaact 3360 ccgtggatac ccagcgaacg tactaacaga tgaaaatgaa tctgttggcc cttcaacgaa 3420 ggtaaacttc catgttgcat ttatcctaaa tcatcttcaa tcattctttt ctgcaacgaa 3480 ggtctatggc aaacaagaac ttacctaccg ccaagctctt tcatcgccag atgcagcaaa 3540 gtggaagggg gcaatcaaag acgaatactt atccctaatc aagaacggaa catggacact 3600 tgccccgctc ccccccaatc gtaaagcaat aaagtgcaag tgggtatttg acataaaaca 3660 gggctacgaa ggagttgatg agagatacaa ggcgcgcctg gtagcactag gatgctcgca 3720 aatacctgga cttgatttcg accaaacttt cgccccagta gtgagactat caacattccg 3780 tctattcctg gctatggctg ccgcaggaaa cctagaaata ctgcaaatag atgtgaagac 3840 ggcctttctg tatggacgac tggaagagga gatctacatg caccagccgg aaggttttgc 3900 tgtaccagga cgtgagaagg aagtttgccg cctaataaaa agcctctatg ggctaaaaca 3960 ggccccacgg gtctggaaca ctgaattgaa cgacgccatc ctgcaatacg ggcttatcag 4020 atcggaagag gaccagtgtg tgtactaccg ccttcaaggg gaggaatggg tagcagtact 4080 tttcttcgtt gatgatggtt tcatctgcgg cacgtccaaa gaaatcgtga agaagttctc 4140 tgaccacctc aaaagaaaat ttgagatccg cactctaccc gcaggacgtt tcctaggaat 4200 gacgatcaga cgagacagag aaaatggaaa gctgagcatc tcacaacccg acttcgtaga 4260 cgccgtacta aaaaaggtta agatggaagg atgcaacccc gttggaacgc cagccgaacc 4320 aggcctgcca ctcacggcaa gcatgtcacc acaacaacaa aaagaaaagg aagaaatgaa 4380 aaatattcca tacaaggagg cagtaggtgc gctactctat ctatccacaa ccacaaggcc 4440 cgatatttcc tatgccgtgg gccaggtggc taagttcagt cagaaccccg ggatacaaca 4500 ctggaaggcg gtaaaaagaa tcatccgata catagctggc actcgaacct acggcatcct 4560 ctactcccaa gagaaaggcg agatggtgac aggctatacg gatgcagact acggtgggga 4620 cctagaagac aggacctcaa catctggatg tgtattccta tgcagcagcg gatcaatctc 4680 ctggtttagc cgcaaacagg aatgcacctc actgtctaca acggagtcgg agttcgtagc 4740 ggcgagtgaa gcagccaaag aagccacttg gattaggtca ctactcaagg agatccagag 4800 aagaggacag gaaccgatcc cgctgttgtg cgacaaccaa ggtgccattc ggtgcgcgca 4860 caaccctgag ctacacagaa aaatgaaaca cattgacatc cgattccggt ttgtcaaacg 4920 agcacaggag agcggggtga ttgatgcccg gtatgtcaat tcacaagacc aaatagccga 4980 cattctcacg aagcctctgc cacttccaaa attccaatac ctcaggaaga aactgggaat 5040 aattgaagtg aatactatgt aagagcaaaa gtgtatgcga ctgtcgccaa attcctttca 5100 cttattttgt tggtgagaac cccgacaatc ccttcctgta tttgatacag taaagttgtt 5160 tatttccctc taaattttat atgccccagt attgaatcca atgtctcagt ttgaggggag 5220 g 5221 // ID BEL-49_CQ-I repbase; DNA; INV; 3658 BP. XX AC AAWU01014131; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-49_CQ_; KW BEL-49_CQ-LTR; BEL-49_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-3658 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 251-251 (2011). XX DR GenBank; AAWU01014131; Positions 51491 47834. XX CC 'GATTG' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 673..3420 FT /product="BEL-49_CQ-I_1p" FT /translation="MTTKADKKLADLLQKRERLVATREVVEKFVAAYDHEK FT HAIQIPVRLETLNRTYKEFLSVQSEIEQLDKPEAFKAHLAVRCEFEDTFCE FT AKGFLLSKADHHTAHADTSLNSSFTHSQGPSTFHHRLPKIDLPKFNGDESR FT WISFRDNFVSMIHINEDIPVVNKLQYLLQSLEDPARKQFETVEIQADKYAP FT TWEELLKKYDNKRSLRKTLFRRLYELPSMEGESAQHLNTLVDDFQRHVKAL FT EKLGEPINQWDTPLVFILSDKLDYATLRAWEQDSSKKEVVTYEELIEFLNN FT HVRMLKSFASDLQHRQTGNVKVAGASQKKSQSIKFIANAATSEIRTYTPQC FT PSCSKQHLLYECPAFTKLAVIQRRELVTQRSLCWNCFRQNHQARACRSKFS FT CRICQAKHHTLLHDPGAPDHTQDAAVASNSTAPQHNPPTTIGTFGLANSSE FT TSLPIVAKQDTVLLETVNLLITDHHGRELLVRALLDSASMSNFVSNNLADL FT LNIRRKNVDVTVAGLGESAKKIDQQITTTVKSLTSQFSTTLDFLIMKPPMV FT NLPRSAVDIAHWNMPDVRLADPHFHRPGAIDILIGGEFYHELHTGQRKPIG FT DGLPLLVETHFGWTVTGKIPTKSDGASTLCCFSTTARTLLVAPQPILELET FT AKPNASHPTKQSSSTPTQTHNGRYVALLPRSKNPQVTHGNSRESATHRVST FT HASRRKKVPSTKPAKKARLSHMRKDNNTKPHCCFGYPLASSFMQADLRSCF FT GQTSPAVNGSGNEMIERQTEMQLTDKSPMRTCRAVPSKSLTFFNQPHLRKL FT AEAESCPAPRQKKFYKQHVYKGEVSTAGEGLQKAEFAESSKIGRAVRNMLN FT AKLECEQVSTVPFAPSPPVRSTGSFAVPDDARESSQPPPRAAETPKTGRAL FT RNCSMRIRNANW" XX SQ Sequence 3658 BP; 1041 A; 975 C; 871 G; 771 T; 0 other; tttggtcctt cgaaccggat aggaggacga cgtcgaacct gggacctgta gtgcggacgg 60 ttagcgctgt agcggtgttg gtaagtggtc aatggaactt gaccacaaaa ggtgaggcac 120 tcgatctagt actactaacg ttgacgggtt cccacatttg cactaatttg gctggttgca 180 aatggtggac aaattgcatg acaaaatcgc atgtttggca aagtgaaagc ataaggcaaa 240 agagcataat tctacgaatt agcgacaaac gcagctcaaa ttctacgcgc tgtgctcttg 300 attcacttag agaggcacag agagcgaaaa gtgggcaagt aaagcaaaag tgggcaagta 360 ctcaacgaat gcacttacag tgacgcaaac aaaagtgggc acgccaagtt ttggtggttg 420 tagaaatttc aacaaaaaat ctgtctctag gaggaaactc gagactttgc gtttgggtgg 480 tggcgtttag cggattcgca gatcgcgaaa agggcacacc acagctggaa acacgtttcg 540 atttgtgtcc ctttcggtcg ttttcgttcg cgtacgcgta gagtaccacg cacgcgcgac 600 ggtaagtggt attccgccgc aatcgtttcg cgtttcgttt cgtttggtga gtgagtagag 660 tggagtgttg agatgacaac caaggcagat aaaaagctgg ctgatctgct gcagaagcga 720 gagcgacttg tggctacacg agaggtggtt gagaaatttg tggctgcata cgatcacgaa 780 aagcatgcta tccaaatacc ggtgcgacta gagacgttga acaggaccta caaggaattc 840 ctttccgtgc agtcagaaat cgagcagctg gacaaaccgg aagccttcaa agcacatcta 900 gcggtgcggt gcgaattcga ggatactttt tgtgaggcca aaggtttcct gctctccaag 960 gcggatcacc acacagcgca tgcggatact tcgttgaact ccagctttac tcattcacaa 1020 ggaccgtcaa cattccatca caggctgccc aagattgatc tacccaagtt caatggagac 1080 gaatcaagat ggatttcatt ccgagacaac tttgtttcca tgattcacat caacgaggac 1140 atccccgtag tcaacaagct acaatacctg ctacaatcgt tagaggatcc cgccagaaaa 1200 caattcgaga ctgttgagat tcaagcagat aagtacgcgc cgacctggga ggaactctta 1260 aagaagtacg acaacaaacg ctcgctgaga aaaaccctat tccgaagact gtacgagttg 1320 ccatcaatgg aaggcgaatc agcacaacat ttgaacacgt tggttgacga cttccagcga 1380 catgtcaaag ccctggagaa gcttggagag ccgatcaacc agtgggatac accgctcgta 1440 tttattctca gcgacaaact ggattacgct accctacgtg cttgggaaca ggactcgagc 1500 aagaaggagg tggtgaccta cgaggagttg atagagttcc tgaacaacca cgttcgcatg 1560 ctgaagtctt tcgcaagtga cttgcagcac cgtcaaaccg gcaatgtcaa ggtggccggc 1620 gctagtcaga agaaatccca atcaattaag ttcatcgcaa acgcagccac atcggaaata 1680 aggacgtaca caccgcagtg tccgtcatgc tcgaagcagc acctgttata cgaatgtcca 1740 gcatttacta agcttgcagt gatccagcgt cgcgagctgg tcacccaacg aagcctgtgc 1800 tggaattgct tccgccaaaa tcaccaagcg cgtgcctgca gatcgaagtt ctcctgcaga 1860 atatgccaag cgaagcacca cacgctgctg cacgacccag gtgcaccgga ccacactcaa 1920 gacgcagctg tggcttccaa ttccacagct cctcaacaca acccgcctac aaccattgga 1980 acctttggat tagcaaattc ctctgaaacc agccttccaa tcgtagcgaa gcaagatacg 2040 gttcttttgg aaacagtaaa ccttctaatc accgatcacc acgggagaga actcctagtc 2100 agagcgcttc tagattcggc atcgatgtcc aacttcgtct cgaacaacct ggcggacttg 2160 ctcaacattc gcagaaagaa cgtggatgtc accgtagctg gactaggaga atctgccaag 2220 aagattgacc aacaaatcac caccacagtc aagtcgctaa caagccaatt ctccaccacc 2280 ctcgatttct tgatcatgaa accgcccatg gtcaacctcc caagaagcgc agtagacatc 2340 gctcattgga acatgcccga tgtccgacta gctgatcctc acttccacag gccaggagcg 2400 atcgacatcc tgattggtgg agaattctac catgaacttc acaccggaca acgcaagcca 2460 atcggtgacg gacttccgtt acttgtcgaa actcatttcg gatggacagt aactggcaag 2520 attcccacca agtccgacgg agcgtccacg ttgtgctgct tctccaccac cgctcgaact 2580 ttactagtag ctcctcaacc cattttggag ctcgaaactg ccaaaccaaa cgcaagccat 2640 cctaccaaac aaagttcctc cactcccacg caaactcaca acggaagata cgtcgcactt 2700 cttcctcggt cgaaaaatcc gcaagtcacc cacggcaact cccgcgagag tgccacgcat 2760 cgcgtttcta cccacgcaag ccgccgtaaa aaagtcccat ctacgaaacc cgccaagaaa 2820 gctcgcctca gccacatgcg taaagacaac aataccaagc cccactgttg tttcggatac 2880 ccgctagcca gctcgttcat gcaagcagat ctacgatcgt gcttcggtca aacatctcca 2940 gccgtcaacg gttcaggaaa cgagatgatc gaacggcaga ccgaaatgca gctgaccgac 3000 aaatcgccaa tgcgcacatg ccgtgccgtt ccgtcaaaat cgctcacgtt cttcaatcaa 3060 ccacacctcc gcaagttggc cgaagctgaa agctgtccgg cgccaaggca gaagaagttc 3120 tacaagcagc acgtctacaa aggagaagta tcaacggcag gagaaggctt gcagaaagcc 3180 gaatttgcag aatcttcaaa gatcggtcga gctgtccgaa acatgctcaa tgcgaaattg 3240 gaatgtgaac aggtgagtac cgttccattc gctccatctc cacccgttcg ttctaccggg 3300 tcttttgctg ttccagatga tgcacgtgag tcaagtcaac caccaccaag agctgcagaa 3360 actccaaaga ccggtagagc actccgaaac tgctcgatgc gcataaggaa tgcgaactgg 3420 tgagcaccgt tccattgact ctaacccaaa cgttcggtct accgggtctt tgttgtttca 3480 gatgatgcca cgtgattcaa gcaggtcggt tcgtcaataa tgaaccaccc acgtcctgaa 3540 ggaatgtcat caatggatcg tgtcatggcg acacgaagtc ttcccgaacg ttgccgaaga 3600 taaggagaga ggataagaga gttttgaaac agccacggct gtctcaaggt ggccggaa 3658 // ID R2E_NLo repbase; DNA; INV; 1550 BP. XX AC . XX DT 08-NOV-2010 (Rel. 15.11, Created) DT 08-NOV-2010 (Rel. 15.11, Last updated, Version 2) XX DE 28S rDNA-specific non-LTR retrotransposon R2 in Nasonia DE longicornisi. XX KW R2; Non-LTR Retrotransposon; Transposable Element; R2E_NLo. XX OS Nasonia longicornis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-1550 RA Stage D.E. and Eickbush T.H.; RT "Maintenance of multiple lineages of R1 and R2 retrotransposable RT elements in the ribosomal RNA gene loci of Nasonia."; RL Insect Molecular Biology 19(Suppl 1), 37-48 (2010). XX DR [1] (Consensus) XX CC This family is specifically inserted into 28S ribosomal RNA CC genes. XX FH Key Location/Qualifiers FT CDS 3..1433 FT /product="R2E_NLo_1p" FT /note="carboxyl terminal end of ORF." FT /translation="KFSXVDSRSQSXRRRXYLXKCGLXVNAXKCMTVSMLT FT VPREKKTVIDPKMKFRCLGRTLPAVSRSSEWKYLGIPFTPEGKWTGNPLQS FT LMTSTEILTKAPLRPQQRLFGLRCVVIPSLYHLLVLGSANISLLRKMDKRI FT RTIVKKWLDLPNDTPSAYFHAAVQHGGLGIPSLRWMVPVQRLKRLKNLPYY FT RPDQPEYYHLSKEIKTTLNRLKDKNVQLVTRLDVDKRMAELLYKAVDGSAL FT KGSNAVPSQHQWVREGTNFLKGTDFIGLCKLRINALPLRCRTARGRPKERL FT CRAGCHNQETLNHVLQHCPRTHEMRIKRHDAVVNYIAKSLNEQGFEVDCEP FT HIQGPAGLRKPDIVVKKEDAAIVVDGIIAAEQADXKRVHNEKRKKYXDLKP FT IIKEKYGVEKVEFTXATLXARGLWSSDXAXDLXRFSVIKSKDLKVISTRVL FT VGGMAIFHDFNRRTSRSXGRRRGQEPPRQGEG" XX SQ Sequence 1550 BP; 496 A; 295 C; 419 G; 321 T; 19 other; gcaagttctc ctnagtggac tccaggagtc aatcancaag gcggcggnag tacctatnna 60 aatgtggttt ganagtgaat gctgnaaagt gcatgacggt gtccatgctt acagtaccaa 120 gagagaagaa gacggtgata gatccnaaga tgaagtttag atgtctggga aggacccttc 180 cggcagttag caggtcgagt gaatggaaat atttaggaat cccgttcacg ccggagggta 240 aatggacagg taatcctctc cagagcttaa tgacatcgac ggagatactc acgaaggcgc 300 ctctgaggcc tcagcaacgg ttgtttggct taaggtgtgt cgtcatacct agtttatacc 360 atcttttagt cctagggagt gcaaatatca gcctacttcg gaaaatggat aagagaatta 420 ggacgattgt aaagaagtgg cttgatctcc caaacgacac gccttctgcc tacttccatg 480 ctgcagttca gcatggtgga cttggaatac cttccctcag atggatggtc ccggtgcaga 540 ggttgaagag gctcaagaac ctgccatact acagaccaga tcagccagaa tattatcact 600 tatctaagga aattaaaact accctcaaca ggctaaagga taagaacgtc caactcgtga 660 ccaggctaga cgtggacaaa agaatggctg aacttctgta caaagcagtg gatggaagtg 720 cactgaaggg ctctaatgcg gttccatcac aacatcagtg ggtccgagaa gggactaact 780 ttctgaaggg gacagatttt ataggtctgt gtaaactgag aatcaacgct ctaccactta 840 gatgtaggac ggccagaggc aggccaaaag aacggctctg cagagcaggc tgccataacc 900 aggaaacact gaatcatgtt ttgcagcact gcccgaggac gcacgagatg aggattaaga 960 ggcacgatgc cgtggtgaac tacatcgcaa agtccctcaa tgagcagggc ttcgaagttg 1020 attgtgaacc acacatacaa ggaccagcag gtttgcgaaa gccagacatc gtagtgaaaa 1080 aggaagatgc agcaatcgta gtggatggga taatagcggc ggagcaagct gatntgaaga 1140 gagtgcataa tgagaagaga aagaagtatg nagacctgaa gccgatcatc aaggagaagt 1200 atggagtgga gaaggtggag tttacgtntg caacattgtn tgcaagagga ctttggagta 1260 gcgattntgc atnggacctg ntaagattca gtgtaatcaa gtcaaaggat ttgaaggtga 1320 tctcgacgcg agtacttgta ggagggatgg ccatctttca cgacttcaac aggagaacat 1380 caaggagcnc aggnaggaga agaggacaag aaccgccaag acaaggagaa ggttgaagaa 1440 gagcaagagg aagctaacgg ccggatgcct gggttacacg ataaccattc ggatcaggca 1500 taatgtntat tnatgatata gattaaaaaa aaaaaaaaaa aaaaaaaaaa 1550 // ID hAT-N8_AP repbase; DNA; INV; 865 BP. XX AC . XX DT 26-AUG-2009 (Rel. 14.09, Created) DT 26-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW hAT-N8_AP. XX NM hAT-N8_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-865 RA Jurka J.; RT "hAT-type DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2108-2108 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX SQ Sequence 865 BP; 272 A; 131 C; 119 G; 343 T; 0 other; gggagcggat gtttatgcat ttatcatatt ttttaataac atctaaatta atgcggagta 60 agatagaaat ttgtttcatc acctaaagaa ttcaaattta ccaatttaat tttgcatatt 120 tttgcatatt tgtccgattt ttacttgatt gatcatattt ttatattatg catcatgctt 180 ttagtacata tttgaggttt ttcgtaatat actaatatag taatatatta acggactcgt 240 cacgcgtaat taatagttca aaatatcccg acggcggcgg gaatggccgg ttttcacccg 300 ccgcggcggg atgtatgggt ttattcaaga catttcgatt actgtgcctt atcgatattg 360 tcccgcgaca aaaataaata taaaataccg ttgtcgccgg agtcggtatc gactatcgaa 420 caatttcgct aattgcagtt gtgttctacg tttctatatt gtataaaatt gtatacctat 480 ttatttaaaa atgccgaaaa ttaattccaa atcgtcgaga tttcataatt ttgttttaaa 540 cattgattat ccagtgcaat gcccaattac tagatatgtt ttcaaatcat ttcatgttca 600 cctagttagt acttggtttc taatatctac ccatgtatat tattttcaga ttaacaaaca 660 attcaccaac agttcttttt aaagttttaa ttttatgaat aattatttta tacctatata 720 tttttaattt ttaataaaca aaataaattt aaataaaaat tttattggtg catattttta 780 ataaattcta agcatatttt tatacttttt agagcttaat tgcatgcata tttttaaatt 840 ttttggagca taaacatccg ctccc 865 // ID Gypsy13-NVi_LTR repbase; DNA; INV; 215 BP. XX AC NW_001814827; XX DT 20-DEC-2007 (Rel. 12.12, Created) DT 20-DEC-2007 (Rel. 12.12, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: long terminal DE repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy13-NV; KW Gypsy13-NVi_I; Gypsy13-NVi_LTR. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-215 RA Kohany O. and Jurka J.; RT "LTR retrotransposon from Nasonia parasitic wasp."; RL Repbase Reports 7(12), 1210-1210 (2007). XX DR Genome; NW_001814827; Positions 18446 18232. XX SQ Sequence 215 BP; 59 A; 40 C; 53 G; 63 T; 0 other; tgtaggatat cgaagtaagt acgtagagat agattgtaag gttggtatgt tggcgcccct 60 gagcgagcgt acgtgagcag cgagcgctgc ggcgcagcgc tcagattaag tcgaaccgtc 120 ttgtcgacta cacatgtatt tgtgtgatct caataaagta ttaagttatt ctaaatacac 180 atgtgtcact agatatttcc tatcacgtta ttaca 215 // ID Gypsy-21_OD-I repbase; DNA; INV; 13363 BP. XX AC CABV01001265; XX DT 24-FEB-2011 (Rel. 16.02, Created) DT 24-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Oikopleura dioica genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-21_OD_; KW Gypsy-21_OD-LTR; Gypsy-21_OD-I. XX OS Oikopleura dioica OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; OC Oikopleuridae; Oikopleura. XX RN [1] RP 1-13363 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Oikopleura dioica genome."; RL Direct Submission to RU (24-FEB-2011). XX DR Genome; CABV01001265; Positions 15392 2030. XX CC Positions [3155-3631] - Reverse transcriptase CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 151..1323 FT /product="Gypsy-21_OD-I_1p" FT /translation="MNRDIKKLFTSLDGHRHHSEEKLHKLSFQDSDHVARR FT QVILAWKESGHAALRKELSKTSPKNEFLTQIPRLSWLWGENDGEIPDLNIQ FT SSIPSSITEFADWQRDKRYRMVPELHNARLNANRTEFIDDNNGGVTKNQAE FT EAYKTHFFPGKENETLPTIMNTRDNDGKTICDYAIEAVRIVGRKLILHVET FT NSERPARNGRFLNLNDYVIKCIKQNCPGDSEANVLLNKIMMIKSFATGNHQ FT KVKASNCFDVLSQVLSFFIRDHRRDGRFPNGTKISEGDGYTEYYYDERDTW FT TPFSEIMSIMMVFMSTDYKIWEKMFKDLAAKRNQDYRTLEMRDIIEDKDYF FT FSLFDKQTGATEHLELKNLELAVESDKSQVKSTYENIKQLALDQAIQK" FT CDS 4254..6158 FT /product="Gypsy-21_OD-I_3p" FT /translation="MSFNIDLLYNSCKTKEIELADYLSRDEGNLKKCSGDC FT GVCVSINKHGCLNVDETIGPEEADLKRFQSAFTVQWVEPRVSYGRWEDIFN FT FENKDEYEIKRDNAFYHRAYSFNRNEEEINAVVTRNRNRSFKKDEKVSIIK FT AFEKRTLTEIVESKDILRMAQGYDRTIKSILDCKLGKGPPPGKGETGAEPL FT RQKSFLTEDGILCRNQMFVEGNIVKPIVLPDFLLELICHKMHTERGCSSIN FT ALKTAVKRDFWCKLIDKTAESVVRGCRNCQYLRKSPPIQKDMKEHDDMSPK FT RLGDTIYIDVITRNSMGKKIHGDTTLKFYVASEAISSLTKVYPIGTGANNS FT ETGVRIVLKILDDFSTGPPGVMLKRILMDGSSVNVKMSKDLEFEEMNVRFF FT TAEKCSKSKNYIAPLDSRIQKLSPVLNELIREAKSPAKIATVASARYNSVI FT GTMGLCPFEIWNNRGMLTGKPLSIEIETLVSMIKKSRKLSREAKDRNLRMG FT RVRMPIHFRPYTKGDLYDSDLESPIKLGDILLIEGDAAKNNLHPFFEVIAT FT KEIPGGIDWENNLAATRKVGVRLLKPYVWNLDAIRAIVDGDLVKDKTDRQE FT IRDKMIKVNVSLLLPECMQSRSEVYEFELWPFERRKI" FT CDS 2309..4261 FT /product="Gypsy-21_OD-I_2p" FT /translation="MPYVVSFELKFGDVTIVFCRARVHDQSSSTCLLGQSD FT LPYNKVSMISSSEIKGWKGETRMLCKIQGKLLEDIWPTEECLALREELFSN FT LPRGKLQPLERTPTIDERVYNSSVIVNMKPTNSFDPDAIRAHARHLEKLHQ FT ENREKNSWKECKVNGRSFDEKFVPDGVFKNVEDPKGLNDRIWNVIERSKVL FT YEGTQGHVVQGEFETFGEIRDGDSSPKFSPGHYGKNQESWVTEETVKKLDQ FT ELAEGVLTILPRGMSPKHMVSFFAVAKKNPTTGVVEMSAGNIRVVADCARS FT GINNDTEHLSRPTDSIRNVLQRVAPYTKTGLVACIDITSMFYCFSMNKALW FT PHFCVLHPIIGLCVYKKLPMGWVSSPLLAREMINRMLYQHRDYVSIYVDDI FT VITGENEEDFVSNLTNVFATLSFYNLRLKGKKTEILSKDILLLGRRVKNGM FT IMQSDHVVNKIKVKTPENLRTLKNIREFLGLINYIAEGMPYKTEICREIQE FT LTQGKDRKSTDVIIWTEKLRESFERIQKTVNKKLVSLFPLEKSLPTFLVVD FT SSNKGTGAFLYQKSKEGETRLIKLYSKKRPDASRNTQWSSCLLELHGILNA FT CIFFRWEINEVSEPVTVITDSISCEKLFKRMQRREGFSFRFENKRTFNKAD FT VL" XX SQ Sequence 13363 BP; 4376 A; 2680 C; 3135 G; 3172 T; 0 other; aagcagggtt ttcgggagaa ttttatacgt ggtgactgaa agttaagcgt aaagcttact 60 ttacggatcc attctccgaa cctttgtgag tacccttttg aataaacctc gagcggagtc 120 tcactatcgc tacgtacgtt tacttaaggt atgaatcgag atattaagaa gctttttaca 180 agtcttgatg gccatcgcca tcacagtgaa gagaagcttc acaagctctc gtttcaagac 240 agcgaccacg tcgcccgtcg tcaagtcata ttggcttgga aagaaagcgg tcacgccgct 300 cttcgcaagg aattgtccaa gacaagcccg aaaaatgagt tcttaactca gatacctcgt 360 ttgagctggt tgtggggtga aaatgatggc gagatcccgg atttgaacat acaatcttca 420 attccatcgt ctatcacgga atttgctgat tggcaaaggg ataaacgtta ccgaatggtc 480 ccggagttac ataacgctcg tttgaacgcg aatcgtaccg aatttatcga cgataataac 540 ggaggagtga ctaaaaatca ggccgaagag gcttacaaga cacacttctt ccccggcaag 600 gagaatgaaa cgttgcccac aattatgaat actcgggata atgacggtaa aacaatctgt 660 gattacgcca tcgaggcggt caggattgtt ggcagaaagt tgattttaca cgttgaaaca 720 aactctgaaa gaccagctag aaacggacgt ttcctaaact tgaatgacta cgttataaag 780 tgcataaagc agaattgtcc gggtgactcc gaagccaacg tattgctcaa taaaatcatg 840 atgattaaat cgtttgcaac aggcaatcat cagaaggtaa aggcttctaa ctgttttgac 900 gtgctgtcac aagttttgag tttctttatt cgggatcata gaagagacgg tcgttttcca 960 aatgggacca aaatatctga aggtgatggt tacaccgagt attattatga cgaacgagat 1020 acatggaccc catttagtga gataatgtcg ataatgatgg ttttcatgag cacagactat 1080 aaaatctggg agaaaatgtt caaagatttg gctgctaaaa ggaaccaaga ttaccgaacg 1140 ttggaaatgc gcgatataat agaggataag gattatttct tctctctatt tgataagcaa 1200 acgggtgcaa cagaacatct ggagttaaaa aaccttgaat tagcagtaga atccgataaa 1260 tcgcaggtga aatcaacgta cgagaacata aagcagttgg ctttggatca agcgattcaa 1320 aaatagtaaa acagggtaaa taaaaatgaa ttcaattgtg tccaggccga agtaggctcc 1380 gatcgcgaga ctgaggagga agcgctacgt ttcaaccctc aaaatacggg ggggaagaag 1440 tttattatga ataaatttca aaacagccaa aaacgtagcc agttgacttc agaagaagtc 1500 aaaagagacc cgaatctgat tgtagccaag ggctctagaa tggtgtataa aagcaccaat 1560 tttgtgttac cccaaaaaga ttttgttgac atcaacgaaa ctggttataa gaaaatcgca 1620 atgattttcc cacgaccacc gccgaaacaa aacggagtca aggaaattat tagaacgcac 1680 gttaatcgag gaggacggag agtgaatcca aaatttgcaa atgggaaacg tgcgatcgcg 1740 ttcgagttag aaactgtcgg agaacaggaa cgttctccgg tcttatatca gatgatgaat 1800 gacgtcgaac ggttttgttt agagcaagag ggcgaagaag atgaagcgaa tttcttgggg 1860 agcgacgaca acaacgaaga gaatggatgg aatggctacg acaatgcggg aataggtaaa 1920 tgtttttctt tagaatttaa taaatctgaa acgttcgtta acaaaaattc cagagtaagg 1980 aacaagtggc tagaagcaag acaagaagta gggaaaacgc ccgttcagaa taaggtattt 2040 gctgtaaact ttgacttagg agtccaaccc aatggcgaag actcattgta cccacaaatt 2100 ggatctgtta ataccaaaaa taaattgata ggtaggaaaa gtttgaaaac cccgttggag 2160 actgttgttc gattccgggg cgagtgtctc acttatttct aatagttttt tgaaaaattt 2220 gcaaaacatt gcagaagaag gtgatctttt aatagaagat gcggacgcag ggtttgccaa 2280 aacagcaggc ggagggtctt tgaaattcat gccgtacgtt gtgagttttg agttaaagtt 2340 cggtgatgta acgatcgttt tttgtcgcgc tagagtacat gaccaaagtt ctagcacttg 2400 tttgttaggc caaagcgatt tgccttataa taaagtaagc atgataagtt cgagcgaaat 2460 aaaaggttgg aaaggagaaa ccagaatgct gtgtaaaatc caggggaagc ttctggaaga 2520 tatttggccg acggaagaat gtttagcgct tcgagaagag ttgtttagta atcttccgag 2580 aggcaagttg caaccgctag aacgtacacc aacgatcgat gaaagggttt acaacagtag 2640 cgttattgtg aacatgaaac ccacgaacag tttcgatccg gatgcaatta gggcacatgc 2700 aaggcattta gagaaactcc accaagagaa cagggagaaa aattcttgga aagaatgcaa 2760 agtaaacgga cgttctttcg acgagaaatt tgtaccagac ggggtgttta aaaatgttga 2820 agatccaaaa ggcctaaacg acagaatatg gaatgtgatt gagcgttcaa aagtacttta 2880 tgaaggtaca cagggccacg tcgttcaagg ggaatttgaa acgttcggag aaatcaggga 2940 tggggacagt tctccaaagt tttcacccgg tcactatggg aaaaatcaag aaagttgggt 3000 tacggaggaa accgtaaaaa agctggatca agaactggcc gaaggagtgc tgaccattct 3060 tccacgggga atgtcaccga aacacatggt gtcatttttc gctgtagcta agaaaaaccc 3120 aacaacgggc gttgtggaaa tgagtgcagg gaacataaga gtagtagcgg attgcgctcg 3180 ttcaggtata aataatgata ctgaacacct gtcacgacca accgattcaa tacgtaatgt 3240 gctgcagcgg gtggcacctt acacgaaaac aggtttggtc gcatgcattg acataacatc 3300 aatgttctat tgcttctcaa tgaacaaagc cctttggccc catttctgtg tactacatcc 3360 gataatcggg ttatgcgttt ataaaaaatt accgatggga tgggtttcat cgccactttt 3420 agcaagggaa atgataaatc gtatgcttta tcaacaccga gattacgtct ctatttatgt 3480 agatgatatt gtgataacgg gcgaaaacga ggaagatttt gtatccaatt taacgaacgt 3540 tttcgctact ctatcttttt ataatttacg cctaaaggga aagaagactg aaatactttc 3600 caaagacatt ttacttcttg gtagacgagt taaaaatgga atgattatgc agtctgacca 3660 tgtggttaat aaaatcaaag ttaaaacacc agaaaatcta cggacgttga agaatattag 3720 agaatttttg ggtttaataa actacatcgc ggaaggcatg ccctacaaaa ccgaaatttg 3780 tagagaaatt caagaattga cgcaaggcaa agataggaaa tctacggacg ttataatttg 3840 gacggagaaa ttgagggaat cttttgaacg aatacaaaaa acggtgaaca aaaagttagt 3900 atcactgttt ccgttagaaa aatctctgcc aacctttctc gtggttgact ctagcaataa 3960 aggaaccggc gcgtttctgt accagaaatc gaaagaaggc gaaacgagat taataaaatt 4020 atactcaaag aaacggccgg acgcatcgag gaacactcag tggagtagtt gtttgcttga 4080 gcttcatggc atattaaacg cctgcatttt cttcaggtgg gaaataaacg aagtctcgga 4140 acccgtgact gtgataactg attcaattag ttgtgaaaaa cttttcaaac gaatgcaaag 4200 aagggaagga tttagcttcc gatttgaaaa taaacgaacg tttaataaag ctgatgtcct 4260 ttaacattga tctactgtat aacagttgta aaacgaaaga aatcgagcta gcggactatc 4320 tatccagaga cgaaggtaac ctaaagaaat gttctgggga ttgtggggtc tgcgttagca 4380 ttaataaaca cggctgtttg aatgtggacg aaacaatagg tccggaggaa gcggatttaa 4440 aacggtttca gtctgcgttt actgtccaat gggtcgaacc tagagtgagt tacggcagat 4500 gggaagacat tttcaatttt gaaaacaagg atgagtacga gataaaacga gacaacgcct 4560 tttaccatcg tgcgtattcg tttaatcgca acgaagagga aataaatgca gtggttaccc 4620 gtaatcgcaa ccgttcgttc aaaaaggatg aaaaagtatc aattataaaa gcttttgaga 4680 aacggacgtt gacggagatc gtggagtcga aagatatatt gagaatggcg cagggttacg 4740 accggacaat caagtcgatt ttggattgta aactaggtaa gggtccaccg ccaggtaaag 4800 gagaaacggg cgcggaaccg ttgcgccaaa aatcattttt aaccgaagat ggcatactat 4860 gccgtaatca gatgtttgtc gaagggaaca tagttaaacc gatcgttttg ccagactttt 4920 tgttagaact aatttgccac aagatgcaca ccgaacgcgg atgctcgtct ataaacgccc 4980 taaaaacggc cgttaaaagg gacttctggt gtaaattgat cgataaaacc gctgagtcag 5040 tcgtccgcgg ttgccgtaat tgtcaatatt tgcgaaaatc cccaccgata cagaaggata 5100 tgaaagagca cgacgacatg tcgccaaaaa ggttagggga cacaatttat attgacgtga 5160 ttacgagaaa ttcgatggga aagaaaattc atggtgacac caccttgaag ttctatgtgg 5220 cttctgaagc gattagctca cttacaaaag tttacccgat agggactggt gcgaataact 5280 cggaaacggg cgttagaatt gttttaaaaa tactagacga tttttcaaca ggcccgccag 5340 gcgttatgct aaaacgtatc ctgatggacg ggtcgagcgt gaatgtcaaa atgagtaagg 5400 atttggaatt cgaagaaatg aacgtgcggt tttttacagc ggaaaaatgc agcaaaagta 5460 aaaattatat agcaccgtta gattctcgaa ttcagaagtt atcccccgtt ttaaacgagc 5520 ttattagaga agctaaatcg ccagcaaaga ttgctacggt ggcatccgca cggtataact 5580 ctgtgattgg cacaatggga ttatgtccgt tcgaaatatg gaacaacagg ggaatgctta 5640 caggaaaacc gctcagtatt gagatcgaaa cgctcgttag tatgattaag aaatcaagga 5700 agctttcacg tgaagctaag gaccggaatc taaggatggg ccgggtcaga atgccgatcc 5760 attttagacc atatactaaa ggagatctgt acgattcaga tctagaatcg cccataaagt 5820 taggtgatat ccttctgatt gagggagatg cggccaaaaa taatctacat cctttctttg 5880 aagtaatcgc tacaaaggag atcccgggcg gaattgactg ggagaataat ttggcagcaa 5940 caaggaaagt cggagtgaga cttttaaaac cgtacgtttg gaatttggat gcgattcgag 6000 ccattgttga cggagattta gttaaagaca aaaccgatag gcaagaaatt cgagacaaaa 6060 tgatcaaagt aaacgtaagt ttattgttgc cagaatgtat gcaaagtagg agtgaagtat 6120 acgaatttga attgtggccg tttgagagaa gaaagattta aaaatgatga aatgcactaa 6180 atgatagcaa ggaagagaac gcgcgtttaa gcccagtcag aatgtcgatc gaccggggca 6240 gctggggaag cattgacggt agtaataacg gcaggtcgag cagttgggac gggtcgagcg 6300 ccagcgagag gcaaggcgga aacctaaaaa caaaatttat ttaaaaccgc tcgaatagaa 6360 aagatttcgg gaagccagga aaaccgaccg ttaattactt acattggcat tttccccgca 6420 aacgttcgta tagcagacgg caaggtcgtg gaggctcgta acgatccgac gggacatgca 6480 gaccaaaagc agaagcgttc cacatagaag agcgagtctg aaaaacaatt tagctacaaa 6540 acactcgtgt tcttttgaat taataaaaac gtgcgtttca tataacaatt tttttttctt 6600 caattactta catcataaaa gtttggtcga atgtagctcc agaatcaggc acagccagat 6660 agctgacaga acgtcgtcgg cggtaatgag agcgaggaaa tggaatctac agtaaccggt 6720 cgtttgagtt taaaatttct aatcaaacct gaaaagcatg agcagcccgt atcagacctc 6780 tgagcgcatg tagcaagccg gtgagcgaaa acatagcgta aataaaaaga accccaaaga 6840 aaaaggcgac ggccgaccca gcaaagctaa agcaaaaaat gcggaaacaa ggtcgcctgg 6900 caggagatgt ctctggcaaa ttcgtgaagg tggaaggagt agaaggtgga atctcgataa 6960 gcggaagtga agtagcggac gagtcggtac tcgacatgtg aaagaaaaag gtaaacggtc 7020 gttttacggg aaaattctcg agggcgaatt atgacgattc ctcagatttg aaggcgagtg 7080 cgatcgggcg accacacggc gccacctgga gttttaaata cgaggcgtgg agagtcctcg 7140 ccgacttccg catggtgata atctccggac aatcgacacc agggcgaata agcaacgcaa 7200 ggccacgatg gtaggagcca aggtaggaaa gatactggtg aagcaatttc acaaagctct 7260 taatgaaatc ggcctcgtga tgataattca acggtgaaac gttcgggggg aaaaagctga 7320 aatcaaactt gcagaaaatc ggagagccag agattttcag ggactaaaat aattttattt 7380 accaatattt taacgttcgt ttatcttacg ttgatcacaa acgccaagta cacgtgaatg 7440 tctacacaag gatcaccata aaggtgggcg ttctggtggt attgaacccg gcgctcgacg 7500 agctcggagg gcaagtagac gctctcgtta atattcgtca gcgacatgat taaatcactc 7560 gggtaatctg gtataagccg tggcggaaca ctatccagac agtaactaaa acgaacgttt 7620 aaaataaaga tttaaccagg gcaaacccaa gggaagtttt gaacaagcgc tcggcgcgag 7680 caacgtcctc gtcgactagt agatcaggag tggcctcgcc gtcactatca gtttcgactt 7740 cgcaaattcc ttgtagttcg gaaaattgat tcctcatcac gcgccgagaa attggagtac 7800 gatcgtaaat aggcttgcgt aaaaacggtt gactagcaga ctgaaaaaac gatcgttaat 7860 ttagttcgga taattctaaa acggcagttt tacaaacctt aggtttcatt ttgggttccg 7920 gcaaacccct gatcgaaaaa ctaaaggagt cgttaatctc cggacatcgc gagtcaccca 7980 ggcgagggga tgaaaaagca gcgggagagc caaatttagc cggatcctga acctcgttcg 8040 tttgatcccc agcggtgaat aaacccgcgt ctaaaaataa gaaaaaccga cagttgaaag 8100 atttaagact aaccaggaaa tcgcattttc aactgcgaaa ggaaacgtcg aaattcgacg 8160 tcgattctac aaataacgaa cgtttagtta gaggtggaaa tttacttaaa attgagatta 8220 gtagcttttc taatcggagg agcggtcggg ctgtaaggat tcaaggctac gctggaatcg 8280 ttcctctcgc gtttaacgcc acaagtaacg cgggcggacg caagttcagc cgaagcttca 8340 acacggccag gcatcgcttt aacagaattc gcggccctct tttgctggtc atgatcagtt 8400 tcctcatcgg gtcgttttaa cccagagcga agcggcattc taggtaaatt taaatattat 8460 acaaaaaaaa acgaccgttt gaaaaagtct atttcaaata aatttggccc cccatctctc 8520 tcgcagcaga tttgcatatg caaaaacaca gccagcgaga gcgaaaataa ctgattgaaa 8580 tcaaggaaga gagtttcaaa aagcgcgcgc tcacgagaac gggtaaaaaa ctcaccggtt 8640 ggtcgaacgg aacaatggta ctgatcgaac aacgtttact cgttcgaaga gaaaatatat 8700 tcactcaatt agagatcaaa attaaattaa aggatgaaat acgaccgttg aacatatttt 8760 aatattttgt tatccacaca attgaaataa cctcctgtaa ctgaaacggc cgttcaagag 8820 aaacaatatt ccgcggaaaa tttgacacta gtttttagat catgcgacgg ttccgcactt 8880 tcgaggatcg cgcctcatcg gacgaggaag ggccaccgcg taaagctaga aagctcgagg 8940 atttcgttat agcaaaccgc cagcaagctt atcacttctc caacccgaga atcctccaac 9000 ctgaggcgga aggcctgaac ttccgacagt tactagcgga acgacgttcg aaaacacgtc 9060 caaaacgtaa gcaacgctcg gtttaccagc tatttatcct caacaggcta gtcaacgaca 9120 cccgatctgg ggcgttgcca atcgaaagac cgagacatgg taaaattaag ttgagagagc 9180 ctaatttatt caatttaggt gagatttggg aaagagaaaa tggaaaaaat tcgctaggcg 9240 agacaggaag ggttaacagt tcaggttttc acggtacgtc tcgtagaagg cgagggtgca 9300 ttccacgtag ccacattcac ggctgaactg tcggttaccc gcgtcggtct gaatggcgcg 9360 ctcgggggat tccatgacag ccaaaggaga acctaaattt tagatttaca aacgttcgtt 9420 aaaataaata tttattcttt ttaaagaaat ttcctacctt ccacataatc accatgaaac 9480 ggcttgttgt gatgctggaa atcgtgttgc ggggtcgttg gaagcaaacc aacggtcatg 9540 tactgaggaa cccttttaat aaaaatttaa ttatcgatcc aaaacgttcg ttataccgta 9600 aattcaatgc gtaaagcaat cgagttccga aaaacacctc caaaatccca acaggaataa 9660 acctccgatg ttctattaga atttagatta ggaagtttcg gtttgtaaaa taaacttacg 9720 gttgaaagcg tatacatctt tggtcggttt aggacaattt ataccaccaa cccaaacgga 9780 acgttcctga ccgagaggtc gcaaaatttc catctgcttg acatagcggt tgagtaacgg 9840 gtaagcccca aaaagcacga catcttgtcg gcgccgggtc tgagtataaa cccagtaaac 9900 caactgcaaa gtgcagaaca tcatttcgtc ctaaaatata ataaaataca gtcaaacggt 9960 cgttaaccac taaaagttga aatcaaggaa gacatctggg cggatgacga gcctcaaaac 10020 cgcgaactgg tcacaggaat tcaccgagtg gctcaacaaa gcacaaacaa gtacgtaccg 10080 acccacagta atagtgctca gctgcggaag tataaaaata cttcgccaaa gccaagaaag 10140 agtcagcgct gaagcgggcc cagactcgag gatcgagacg cgtttcgacg acctctgaaa 10200 ataccgagtt aataatttta aaaaaaagaa acgttcgttc aaaaataatt aataaaattt 10260 agtaaaactc acgtccgagg agaccagcat catcgccaac ctgtcggaga tggatgccta 10320 tgcgcccaac agtctgcaac aggaactgat aagcagcccg accagcagaa ccagtgtaag 10380 ctagatcctc ggtagtgcat cggaaggcga taaacgcacg gcgttcgagc tgctcctccg 10440 aaatatccac aggatccgtc tcgcggcgct tgacatgaaa cggggcacgg tggtttggca 10500 taccattagc aatgggtgca gtccccaatt tctgaccatg aacctgagca tcatagctca 10560 agccgttttt caaaagaagc tagatttggt taataattat tctcatccta aaataacgat 10620 cgttttataa tatctaacct tgcggaccat aaaagcatag atgacgagag ctagctcacc 10680 gccaagccca taatgagtaa ggcagtaccg attgcggttg cttagctcat tcacgcgata 10740 ggtctcaaat gagtattctg ggatgagctc gaaaagtgaa actagtcgcc tagtcggtag 10800 agttccccac cggcaaaaat cagcaagagc cgctcgtttg caaaacactc gggcgtcaga 10860 gtcaatcgcg agcatgccag acgccatctt gcggttagcg tcttcgacaa agacgtctac 10920 ggctttaaag aactcctaga taatttagtt aggactagtt tttcaaaacg aacgttattt 10980 cagctaaatt cacttacccg gaaaaaccga tgcacgctaa tctcggtcac aaaactgcct 11040 gaaggaacac tcatggcgaa ctcgcaccgt cgggacatcc caattttgag gtttttgaac 11100 tcgccagaaa ctaaaaccaa gccagcagcg aaatcttgag cagtttgtcg agttcggaaa 11160 actaaaaaac ggtcgtttta ataaataaga gttacaaatt tcgttaaaaa aaaaacacat 11220 tactgaggtt caaatacctt cattgaactg cttgatgttg ttccacttcg agttgaaacc 11280 gcagtcgtaa tcaggcttgt ccccgccctc aatcggaata agagaattcc gctggtcggg 11340 atcggaaaag agatgctcag cctctccaat gttcccgata ttacgcccgt aagaacgctc 11400 gcgagaccaa gactgctggt tcagaaagga aagcacgtcg aaatcgtgct cgattattcg 11460 ttcgatgttc ctaaaatata aatatttaat tcagggatgc aagaaaacgg acggttacgc 11520 caccgcttaa aatcaacgaa gagaacacca aaaattaggg aaaaacggat ctgtaaaagc 11580 caactttagg gaaattagca acttggaatt ataggaaaat gttttattaa aagttgtatt 11640 aacaaatatc tgctacttga aagtttaccg ttcgttaatc cattttaaag ttaataggat 11700 ttccgttaaa ataaacgatt cagtaatgcc gtttaacgct aagccgagta caccgttcga 11760 cattttcgcg gagaatctgg atcatgaact gaaccagccg accgagaatg actttcatca 11820 ggccaacgat gagatggagc agggttacag agtcccaact ccaccaccgg tggaagaccc 11880 cgtaccaaca ccacctgctc cgattgctcc gcaagcaccg atcgttcaac aggcagaaaa 11940 tagaccgaaa cgagcgaaaa cgagcttcct cgaggactcg gatgacgacg atgtcgccac 12000 atctgaccgg cagcaccgcg aattggccgc cgcagttgaa agcgacgacg cgagacgcgt 12060 cagggagaag ctcgaattgg gcgtcccacc tcacctgtgt ttcgatctga tgggccgcct 12120 cgcgattata aagcaaaatt atgggtaact aaaggcgttg ctataaaagt atttaacttt 12180 cgtaaaataa aggtcaaacg ccgttgtgac atggggcaaa caattgcgag taatgctcca 12240 aaatatctcc gcaattaaca ggccgatctc agttgcagaa aatgaggtaa aaattcaaat 12300 aaaaacgatc gtttcgtact taccgaacaa tataagacac ctggtggcta ccgaagtttt 12360 ccattgactt ctgcatgcct tcgagcggcg aaagaactgt tcactatgtc cctgactgct 12420 cctccggaaa tcggccgtct gcttctgtct ctgactatcc agtggtcgcc tcttgaacct 12480 cgcctctctg gtattcgagt aaaagtgact caactagaaa cggtcgacgc ggtaaaaagg 12540 tttattttgg aaaactcgat cgagctacga acgagtgaac tggtagaaca agaatgtgct 12600 gctttaaata gcgtttttca aatggcgtac ggcaccgctg aggtaataaa aacgagataa 12660 aaccgaccgt tttcctttaa taaaatcaaa gcgtgacgaa acgataaaaa aaataatgaa 12720 aggcagaaca ccgcgcgttt ctctaaatag gaatttctaa atgataaacg accgttctca 12780 gccagttttt ttttattgac tcatcaataa tctaggcacc agatctgccg gacgaaaacg 12840 cgtcgaaagc aacatggatg atttacggca cgcggctgca gcgcggattc aaacgggcag 12900 aagaaaaact caaggagttt acggatggaa tagccgaaag tgctcgatgg acgttcgacg 12960 aactccagcg ttacttggag aaaaatagcg ctttgaaaac tgtggcgatg atatcgcaat 13020 atcagcacgt caataaagaa accgaacgtt tgggcaacaa cgtagtggcc ctgcaaccga 13080 atggaagtct aaccaccttc atggttaagg acgggcgagc attccctaca tctcgcaagg 13140 taaatacggc aagataaacg atcgttcaaa aaaagtatta cagccaggat gggtcaacgg 13200 cttcagagtc gtcagcaacg gaagacacgg actccaactc cgccagaacg gcccctcagc 13260 aggacagttc cagcgaaatc atcgccagga gggcagacga ccaacttggg aggacatata 13320 agttaaaagg tagacctcca gacaaccgtt ccaaacttag aat 13363 // ID Hoana1 repbase; DNA; INV; 3520 BP. XX AC . XX DT 20-SEP-2009 (Rel. 14.09, Created) DT 20-SEP-2009 (Rel. 14.09, Last updated, Version 1) XX DE Hoana1 is a putative autonomous DNA transposon. XX KW hAT; DNA transposon; Transposable Element; HOBO; transposase; KW Hoana1. XX OS Drosophila ananassae OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; ananassae subgroup. XX RN [1] RP 1-3520 RA Ortiz M.F. and Loreto E.L.S.; RT "Characterization of new hAT transposable elements in 12 RT Drosophila genomes."; RL Genetica 135(1), 67-75 (2009)DOI 10.1007/s10709-008-9259-5. XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 667..2490 FT /product="Hoana1_1p" FT /translation="MEETNEIIERKIASGAYTLSGQRKGRSSLWNILSEIV FT NEADENVPGFVFCRTCAAVLKYKNGQTSNLHRHKCCKNLKKTVTFAQISAE FT DKTLATKICANWVAEDCRPFTVVKGGGFKKMVKFLIQIGARYGENIDVDDL FT IPDPTTVSRNVTKAAHEKKKELSEEIEEVVRSGGASATIDLWTDNYVQRNF FT LGVTFHYIYNQKFHDVVIGMKSMDFQKTTSQNVYTKLKSVFDQFGVDNIER FT IKFVTDRGANIIKALENNTRINCSSHLLSNALEKSFSEAVDLSENLEACRK FT LTKFFKKANMQHLLKTTLKSHCQTRWNSNYRMVKSILENWADINQILTAKN FT ESNRLLKINLTTVKSLVDLCENIEIVFKKLQETQSPSLCYVIPSITKLKQI FT LRPNDDEAENIESLKKYLVKNIENIWMENLSIYHRVAFFLYPPANNLQPSD FT TIEIINFVVSLIGSNSVQTNLNGVGSPSSTVSFDSFNIIFNEFDNVVDPQP FT SSSSDAFFFQDLLEKTTRKKIETSYEEVLRYSRELIDMSHNFNLLEWWENN FT KKIFPNLYKVALQVHSIPASSAAAERSFSLAGNIVTEKRNRIAPGGVDSII FT FLNSIQRNDL" XX SQ Sequence 3520 BP; 1200 A; 576 C; 638 G; 1106 T; 0 other; cagagaactg cacgaatcac attttttttg ttcatactcg agtaatgata ggttactctg 60 attgatttta aaatttgatt catgtgaaca gagcgatccg aatgcttcgt gtcaatgatt 120 tggaacaatt ttgaatcact atcgcatcaa ttgggaatga tttgttttga tttgttcaga 180 agcaaagcaa tacccgaaaa tgatgtaaaa tcatacacat atgattgatt ctgattcgcc 240 tgctgagctt aatgagcttt aatctggagc cttttgaggc tgtatctgga gcctatatga 300 ggcttaatat atttataagg ttacgtggaa atttggttaa aagcaacgcc ataaataggg 360 taatcaaaat ttggtcaaga ggataataaa ataataaata aatattaaac aattggcttt 420 gtaacataat ctttctcggt gtggagcaac ctgacttata gctatgcgcg tttgaaaatg 480 tcgccagtat gttgggctga tttttaattc caccttattt cattattaat caattaattt 540 aaaaccatgt ttcgagacta ccaaatcttt aattgaaaca aaataatcga tttttttgta 600 ccatataaaa atttgtgcca tcactacttg ttactcatta ctttcaaata gtattctcta 660 ataaaaatgg aagaaactaa cgaaataatt gaaagaaaaa ttgcgagcgg agcttatacc 720 ctcagtggcc aacgaaaagg acgtagttcc ctgtggaaca tattatccga aatagtaaat 780 gaagcagacg aaaatgtgcc aggatttgtt ttttgccgta cctgtgcagc tgtgcttaag 840 tacaaaaacg ggcaaacatc taatttgcac aggcataagt gctgcaagaa tttaaaaaaa 900 acagtaactt ttgcacaaat atcagctgaa gataagactt tggccacaaa aatctgtgca 960 aattgggtgg cagaagactg tcggccattt actgtagtca aaggaggagg gtttaagaag 1020 atggttaaat tcttaataca gattggagcg agatacgggg agaatattga tgtggacgat 1080 cttattcccg acccaacaac agtgtcacga aatgtaacga aggcagcaca cgagaaaaaa 1140 aaggagcttt cagaggaaat tgaagaagtt gtgcgcagtg gaggagcatc cgcgactatt 1200 gacttgtgga ctgacaacta cgtccaacga aattttttgg gagttacctt ccattacatt 1260 tacaaccaaa aattccatga tgttgtcatt ggtatgaagt caatggattt ccaaaaaacc 1320 acaagccaaa atgtgtacac aaaacttaaa tcggttttcg accagttcgg ggtcgacaac 1380 atagagcgca tcaaatttgt aaccgatcga ggggcaaata ttataaaggc tctcgaaaat 1440 aatacaagaa ttaattgtag cagccacctg ttgtctaatg cactagaaaa gtcattttcg 1500 gaagctgtgg atctatccga aaatttagaa gcatgcagaa aactgacaaa attttttaaa 1560 aaagcaaaca tgcaacattt attgaaaacc accttaaaaa gccactgcca gacaaggtgg 1620 aactctaact atagaatggt aaaatctatt ttggaaaatt gggctgatat taaccaaatt 1680 ttaacagcga aaaatgagtc gaacagactg cttaagatta atttgaccac tgtcaagtcc 1740 ttggtagatc tttgcgaaaa cattgaaatt gtttttaaaa agctgcaaga aacacaatcc 1800 ccgtcacttt gctatgtaat accgtctatt acaaaattaa aacaaatttt acgcccaaat 1860 gatgatgaag ctgagaacat agaaagctta aaaaaatact tggtaaaaaa tattgaaaac 1920 atttggatgg aaaatttatc aatttaccat agagtagctt tttttttgta cccaccagct 1980 aataatttac aaccttcgga tacaattgaa attattaatt ttgttgtatc attaatcgga 2040 tcaaactcag ttcaaacgaa ccttaacgga gtgggaagtc cttcatccac agtttcattt 2100 gattcattta atataatatt caacgaattt gacaatgttg tcgatcctca gcccagttcc 2160 tcaagtgatg cttttttttt tcaagactta ttggagaaga caactcgtaa aaagatagaa 2220 acttcctatg aagaggtact tcgatattca agagaattaa ttgatatgtc gcataatttt 2280 aatttattgg agtggtggga aaacaataaa aaaatatttc ccaacttgta caaagttgcg 2340 cttcaagttc actctatacc agcaagcagt gcagcagcag aaagatcatt ctcattagca 2400 gggaatatag ttacagagaa acgtaacaga attgcgccag gcggtgttga cagcatcatt 2460 tttttaaatt ccattcaaag aaatgattta taatgtcttg ttattaaaac gcacacatat 2520 attttaaaaa tgattttttt gtgttattat ttcattattt gtatctatta tattatctat 2580 tattattatc tattatatta tccgatcaat cagttatatg atagctaaag gatatagtcg 2640 gccgatcctt atgaaattgg gcagatcgga ttattttgcc agattctata gagtgtccca 2700 tcgttctaac ttaaaaatca acaaagttat tgcattttcg atcaatcagt tatatggcag 2760 ctataggata tagttgaccg attccgaaag aagcatgagt gcaaagtttg aagacgatag 2820 ctttaaaact gagaaactag tttgcgtaga aacaggctgc cagacggcaa acggaaattc 2880 ctatattgac tcaggaggtg atcctgatta agaatatata tactttatag ggtcggagat 2940 gtctccttca cttgatcctc caatttgatc aattttttct tgggtgtact gcggaaactg 3000 agggtgaaag gtcgtatgac agatctggat cgttatcagc attttggtga tttttgaccc 3060 ctttatgcat tttattgtga agcattgtga agaaatatta tgataatatg aaacgtcatt 3120 atttcgctga atccctgtcc ccttagttga tagagaagct gaaattctgt ctattctatt 3180 tgtatatcaa aaaaattaaa ataaacatta ctacaagtct ctggtttcaa tcatacattt 3240 ttactcgagt atacctcaat gatttaaatc atctgcctgg atcgtagcct acttcaacaa 3300 tcattttgta tcaaagtcag tcaattatac ctgacgattt ttttgaagcg atttgattca 3360 aaccgagcta aactatacga tacaaatcac atcaactgat ttaaagttga cttgtacttg 3420 aatcgatttg aaatgttgtg acatatgatg tgatttgatt tgatctgttt ttgttaaaga 3480 tgattttatc atactcaatc atactcgttg cagttctctg 3520 // ID Hoana6 repbase; DNA; INV; 2594 BP. XX AC . XX DT 21-SEP-2009 (Rel. 14.09, Created) DT 21-SEP-2009 (Rel. 14.09, Last updated, Version 1) XX DE Hoana6 is a putative autonomous DNA transposon. XX KW hAT; DNA transposon; Transposable Element; HOBO; transposase; KW Hoana6. XX OS Drosophila ananassae OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; ananassae subgroup. XX RN [1] RP 1-2594 RA Ortiz M.F. and Loreto E.L.S.; RT "Characterization of new hAT transposable elements in 12 RT Drosophila genomes."; RL Genetica 135(1), 67-75 (2009)DOI 10.1007/s10709-008-9259-5. XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 407..1891 FT /product="Hoana6_1p" FT /translation="MHCNKVYKTSGNTSNLFDHMKRLHPGISMEEQQKRAN FT TIVPFLFNQYDRDSSRKKLIDNALAVLVSADMQPFSIVEDSGFREFVRVLD FT PRYTLPSRKTLQHVYMKNIFEDLKTKLFTILDRVNSCAITADLWTSKANES FT YITATCHFITKDFVLRSVVLATKPLLDDSNHSAQNIASSLRGICDEWNIFD FT KIAAITTDNANSMIKACEFLQKRHLPCVAHTINLVVQSCLAIDCLQDILLK FT CKRIVTYFKSSSIALSKFKASQETDIKYSLIQEVPTRWNSAYHMIERILLT FT NEAISKVLLSTSKAPTPFTADEVAILKDIVQLLSPFDNATKQVSSSTSVTA FT SLIVPIVCGLLHTLDSFRLKLNSFEGIEALNCLVDQVKKRLLGYEKRTFPK FT MSTLLDPRFKKQGFRSPFNADDGLHALENELAALKTPTEKESDCQQPLFEF FT VVKNISTLGRTNRVDAIVELQQYMRAAHSPQQTDPLKFWKVISLQICSLI" XX SQ Sequence 2594 BP; 833 A; 489 C; 480 G; 792 T; 0 other; tagagctggg tcggccactc acgatagtat cgatatatcg aatatttggg tttgcgaacc 60 actatcgatg tttgagtgcc gatagtaaac gatagtggca agcttgcaca tctttacttt 120 tggcgaacaa ctgcggagat ctacacgtga aagttaatat tggtgttggc ggagaaaatg 180 gacaaatttt tgcaaggcga atacgataaa ggtaggtatc taaactagtg catatctgag 240 tatgttttgg actctttctc aataactaat ttctgatata taacttttta caggcataag 300 tgcaaagagc aagtcggagg aggacgaaaa ccaggtgaaa cgaataggga agtccaaagt 360 gtggaaccac tttaaaaagt cccaagatgg caagacggca agttgcatgc actgtaataa 420 agtttacaaa acgagcggaa acacgtcaaa cttgtttgac catatgaaac gactccaccc 480 gggaatttca atggaagagc aacaaaagcg cgcgaatacg attgtaccct ttttgtttaa 540 ccagtacgat cgagactcgt cacgtaaaaa acttattgac aacgctttgg ctgttcttgt 600 gtcggcagat atgcaacctt tttcaattgt agaggattcc ggatttcgcg agtttgtcag 660 agttcttgat ccacgataca cacttccttc gcgaaagact ctacaacatg tatacatgaa 720 gaatatattt gaagacttga aaacaaaact ttttacaata ttggaccgcg ttaacagttg 780 cgccattact gcagaccttt ggacttcaaa agccaacgag tcgtatatta ctgcaacttg 840 ccactttata acaaaagact tcgttttgcg ctcagttgta ttggccacaa agccgttatt 900 ggatgactct aatcattcgg ctcaaaacat tgcttcttca ttgcgtggca tttgtgacga 960 atggaacatt tttgacaaaa tagcagctat tacaacggac aacgctaact ctatgatcaa 1020 agcttgtgag tttttacaaa aaagacatct accctgtgtt gctcacacca taaacctggt 1080 tgtgcaaagc tgtttggcga tcgattgctt acaagacatt ttacttaaat gcaagcgaat 1140 tgtcacatac tttaaaagca gctcaattgc acttagtaag tttaaagcgt cgcaagaaac 1200 ggacattaaa tatagcttga tacaagaagt gcccacaaga tggaacagtg cctatcacat 1260 gatagaaaga atactactta caaatgaggc catctcgaaa gtacttttaa gtacgtctaa 1320 agcaccaaca ccattcactg cggacgaagt cgcaatatta aaagatattg ttcagttact 1380 atctccgttt gataacgcta caaaacaggt atcttccagt acttcagtta cagcatcgct 1440 gatagttcca attgtttgtg gattattgca cactttggac agcttcagat tgaagttaaa 1500 ttcatttgaa ggcattgaag ctctaaattg tttagttgat caagtaaaaa aacgactttt 1560 gggctatgaa aagcgcacct ttccaaaaat gtcaactctt ctggatccaa gatttaaaaa 1620 gcaaggtttt cgttcgccct ttaacgccga tgatggcctt catgcattag aaaacgagct 1680 agctgcctta aaaacaccaa cagaaaagga atcagactgc cagcagcctc tttttgaatt 1740 tgttgttaaa aatatttcta ctttggggag gacaaaccgg gtcgatgcca ttgttgaact 1800 tcaacagtac atgcgagcgg cacattcacc tcagcaaaca gatcctttaa agttttggaa 1860 ggtaatttct ttacaaatct gcagtttaat ttgaacttat tatctttttt taatttcagg 1920 gcgctccaga tgattctctt cttaaaacta cagctgaacg ccttttttgt gtgcaagcat 1980 catcgactga gtctgagagg agttttagta aaacgggaca aattatttct gctaggagag 2040 catcactaaa ggccaaaaac gtggatatat tatcctttct caaaataatt cacaaaatct 2100 tcttatgatc aatcaaataa tttctaaata ggtacactga cgactaatat tgtaacatga 2160 gttcaatgct atagcctcaa cggttagtat tttctgttct aagttatacc tgctaaaacc 2220 ttagactagg ttatagcatg tgtaactcaa aacagaagat acaaaccgtt gaggctatag 2280 cattgaactc atattacaat attagtcgtc agtgtacaaa tttagaatat attaacctat 2340 tcataagaat cttttgtgaa ttatttcatt tcttcctttg ttataagcga cttcatatga 2400 ttatgagatt ttgttatgtt ttgttattcg aataaagaat ggcctttaag aagttttttt 2460 tattatttac tctaaatgca acttaaactt gatttataat ttttttagtt cgatactatc 2520 gatactatcg aatatttgta tgaaaaacct cgaaaacctc gaatatgcca actatcgatg 2580 ttttcccagc tcta 2594 // ID R1_Ele9 repbase; DNA; INV; 5848 BP. XX AC . XX DT 05-OCT-2010 (Rel. 15.1, Created) DT 05-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE A non-sequence-specific R1 clade non-LTR retrotransposon family DE from Aedes aegypti. XX KW R1; Non-LTR Retrotransposon; Transposable Element; R1_Ele9. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-5848 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5848 RA Kojima K.K. and Jurka J.; RT "Non-sequence-specific families of R1 clade non-LTR RT retrotransposons from the yellow fever mosquito."; RL Direct Submission to Repbase Update (05-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. This consensus is generated from 13 CC sequences with >91% identity, and ~99% identical to the original CC sequence in [1]. This family shows no sequence specificity. XX FH Key Location/Qualifiers FT CDS 694..2097 FT /product="R1_Ele9_1p" FT /translation="MENINNIGEGAANPFARSGLVRSPTLQTQQAHQQHEG FT EQEQINVQQQQQFSMWSKVPKLPKVVTAKKFVEELHEYVDKRSNVHKDIKL FT LVTKIQGALGAAVKEWKTLEARADTAEKELAVTKSALEAMRASASRKVETS FT TVTGEAPVSNKNVSSVQTTPFFTPKRARASPEDVRPGGPKKHKDTPGTGVY FT PTTEEATEENSQTPWQVVGKKKEKTSKKNPSENRVSIKGRNKGEALILKAS FT DDSYMEVLRAMRSNPDLKELGEDVQKVRRTLNGEMILELRKESKASSSFYK FT ELAEKAMGDKVEVRAVCPEATLECKNMDEISTEEDLRLAMQQQCALGNVPM FT SIRIRKGPRGMQVASVRLPVDAAKKALKTGKIKVGWSVCPISVSERYQLVA FT CFKCLGFGHIARFCNGPDRSKLCRRCGEEGHKAQDCQKSPKCLICANNGDN FT KHVTGGPRCPAFKQASANMSQWR" FT CDS 2163..5054 FT /product="R1_Ele9_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MKCDVAIISEPYRIPPTDGNWVADTSKTAAIWTVGRY FT PFQEVVECAEEGFVIAKVDGVYICSCYAPPRWPLEEFNRMLDRLTEELTDR FT RPFVIAGDFNAWAIEWGSRLTNARGCSLLEALAKLDVHLANEGTISTFRRE FT GRESIIDLTFCSPGLARHMNWRVCEEYTGSDHQIIRYRIGGRFQPVYCESQ FT TDVRRWKMADFKKDVFVEALRLEHITLNLSADELTAALTRACDVTMPRVGK FT PANGRRPAYWWNSTIADLRGSCLRARRRMQRACNNAEREERRLIYRAARSA FT LKSEIRLSKKACLQELCDKANANPWGDAYRVAMARLKGPTMPPEKCPERMK FT VIIDGLFPQHDATTWPPTPYGVQDGDYEESRVTNEELIGVAKALDTKKAPG FT PDGIPNLALKTAILENPDMFRTTLQNCIDDGSFPDSWKRQKLVLLPKPGKP FT LGESSAYRPICLLDTSSKLLERIILNRLTKHAENENGLSNMQFGFRKGRST FT VDAIQTVVETMEAAQKQKRRGNRYCAVITLDVKNAFNCASWKAIADSLHSM FT RIPEYICRILKSYFQNRLLIYETDQGVKSIQITAGVPQGSILGPTLWNIMY FT DGVLKLNLPRGVKIVGFADDVVLLVVGESREEVEVLATETIETVEDWMRDK FT KLAIAHHKTEVVMISNRKVVQQAKITVGDCIIDSKREVKHLGVMIDDRLNF FT NKHIDYVCEKAAKAIGALSRIMANDSAIKSSKKRLLASVTTSIIRYAGPAW FT VTALKTERNRSRLSSTYRLMAMRVSSSYRTISSEAVCVIAGMIPISLLLEE FT DRECYTDRHTRGVRERERADTIRKWQHQWDQATTGRWTHRLIPSLSMWIRR FT PYGEVNFHLTQFLSGHGCFRQYLHRFGHTSSASCPECSNTEETAEHIFFAC FT PRFGLEREEMKAILGEDVNVDNVIQRMCNDADKWNAVNRIVTQIMSALQRK FT WRVEQRHEA" XX SQ Sequence 5848 BP; 1717 A; 1328 C; 1636 G; 1167 T; 0 other; cggtgggagc ttgggatttt ttcctgttcc aagcgacaca cattcataca tccgtatgaa 60 ataccacctt tggcacttcc tccgggaggt ggccgagctt ctggtccatc agaacagaag 120 ctcaagcaga gtctgcctag ggtgtggtgg gggttcaaca gtgggctctg ttgaatctct 180 acaaaaaaac cacatatctg caagtagttc tgaacaagcg acctggtacc gctttcaaag 240 tgccttagcc ccctggagtg ccaaccggca catcaggatg gatgccagta aaatcctgat 300 tatggtatac tggtcacggc gttcaaatag gacaaggcaa ggattttgga ttggacagcg 360 atattgggct gacagtcgtg tcattctact ccccgagtaa ggaagggtga caccaacttg 420 aaacggcgag taggctcatg gcgcgactgc ctccgtcccg ttaaaacctt ggcaggcctc 480 tggatacgtt cagaacccgt ccatcttaag tgatgagtac taggctcgga tgaccattcc 540 cgacctacgc cgatccggct ctgaacacga tcctcataag ggatcgtgtc aaccctagca 600 tggcctccct gctcgcatag ataaccatgg gatcatgaaa ggcgactatg tacggcggtt 660 gacgatagcg gtctggaggg gggaccctga agaatggaga atataaataa catcggagaa 720 ggcgcggcaa atccgtttgc cagaagcggg ttggtaagat caccaacgct tcaaacccaa 780 caagcgcatc agcagcacga gggcgagcag gagcaaatta acgtgcagca acagcagcag 840 tttagtatgt ggtcgaaagt accaaaacta ccgaaggtag ttacagcgaa gaaatttgtg 900 gaggagctgc acgagtatgt cgacaaaaga agcaatgtgc acaaagacat caaactgttg 960 gtgacgaaga tccaaggagc ccttggagcg gccgtcaaag aatggaaaac cctggaggcg 1020 agggctgata cagccgaaaa agagcttgcg gtgaccaagt ccgcgttaga agcaatgcga 1080 gcctctgcat cgcgaaaagt ggaaacgagt actgtaacgg gggaagcccc agtgtcgaac 1140 aaaaatgtca gtagcgtgca aacaacgccc ttcttcacgc cgaagagggc aagagcgtca 1200 ccagaagatg ttagaccagg tggcccaaaa aagcacaagg acacccccgg tactggggta 1260 tatccaacaa ccgaggaagc tactgaggaa aacagccaaa ctccgtggca agtggtcgga 1320 aagaagaaag agaagacgag taaaaagaac ccatccgaaa atcgtgtgtc cattaagggg 1380 cggaacaagg gcgaagcgct cattcttaaa gcaagtgacg attcgtacat ggaggttttg 1440 cgtgctatgc ggtcgaatcc agatctcaaa gagctaggag aagacgttca aaaagtcagg 1500 cgtaccctga atggtgagat gatcctcgag ttaagaaagg agtcaaaggc aagcagctcc 1560 ttctataaag agctagccga aaaagccatg ggggataagg tggaagtaag agctgtgtgt 1620 ccggaagcga ctctcgaatg taaaaatatg gacgagattt caacggagga agatctgagg 1680 ttagccatgc agcagcagtg cgcattgggt aacgttccaa tgtcaattcg cattagaaaa 1740 ggccccagag ggatgcaagt agcctcggtc aggctgcctg ttgacgcagc gaaaaaggcc 1800 ctcaaaaccg gcaagatcaa agtcggctgg tcggtttgtc cgattagtgt gtctgagcgt 1860 tatcagctgg tagcgtgctt taagtgtctc gggtttggtc atattgcacg gttctgcaat 1920 gggccagaca gaagcaaact gtgtagaagg tgtggagaag aaggccataa ggcgcaagac 1980 tgccagaaat caccgaaatg tctgatctgt gccaacaacg gagacaacaa acacgtcaca 2040 ggaggcccaa gatgtccggc ctttaaacaa gcgtctgcaa acatgtcaca gtggagatag 2100 cgcaaataaa cttgaaccac tgtgaggttg ctcagcattt gctggggcag tccgtagcgg 2160 aaatgaagtg tgacgtagct atcatctctg agccatatcg aatcccacct acagatggga 2220 attgggtagc agatacaagt aaaacagcgg caatttggac ggttggaaga tacccctttc 2280 aggaagtggt ggaatgcgca gaagaaggct tcgtaatcgc caaagttgat ggtgtgtaca 2340 tctgtagttg ctacgcacct ccgcgatggc cgttagaaga attcaaccgt atgctagaca 2400 gattaacgga agagcttacc gatcgtagac cattcgtcat agccggcgac ttcaatgctt 2460 gggcgatcga gtggggaagc cgcctcacta acgcaagagg atgcagtctt ttagaagcat 2520 tagcaaagct agatgttcat ctagccaacg aaggtaccat cagcacattt cgtcgggaag 2580 gcagagagtc catcatcgat ctcactttct gcagcccggg gctagcaaga cacatgaact 2640 ggagagtgtg cgaagagtat accggtagcg accatcagat tatccggtac cgcatcgggg 2700 gcaggttcca gccagtatat tgcgaaagtc agacagacgt acgaaggtgg aaaatggcgg 2760 acttcaagaa ggacgtgttc gtcgaagcgc ttagacttga acacatcact ttgaacctca 2820 gtgcagacga gttaacggca gcactaactc gtgcgtgcga tgtaaccatg ccgagagttg 2880 ggaaaccagc taacggtcgc cgtccagcct actggtggaa ctcgaccatt gccgatctac 2940 gtgggagctg cctacgagcc agaagaagga tgcagcgggc ttgcaacaac gctgaaagag 3000 aggaacggag gttaatctac agagcagcaa gatccgcact taaaagcgaa atcaggctta 3060 gcaaaaaagc ttgtcttcaa gaactctgcg acaaagcaaa cgcgaatccg tggggcgacg 3120 cgtacagagt agccatggcg aggttaaaag gaccaaccat gccaccggaa aagtgcccgg 3180 agagaatgaa agttattatt gacggactgt tcccgcagca cgatgccaca acctggccgc 3240 ccactcctta cggagtgcag gatggagact acgaagaatc tcgagtaaca aatgaggagt 3300 tgatcggtgt ggcgaaagcc ttggacacaa agaaggcccc aggaccagac ggaatcccaa 3360 atctggctct gaaaacagcg atcctcgaga accccgatat gttcaggacc acactacaga 3420 actgcattga cgatggcagc tttccagact cctggaagcg acaaaagttg gtgttgctgc 3480 caaagccggg taaaccgctt ggcgaatctt cagcgtatag gccgatatgc ttgctggata 3540 cgtccagcaa actgctcgag agaataatcc tgaacaggct tacgaagcat gcggagaacg 3600 agaacgggct gtcgaatatg cagttcggct tccggaaagg aaggtctact gtggacgcca 3660 tccaaactgt tgtggaaacc atggaggcag ctcagaagca gaaaaggaga ggaaaccgtt 3720 attgtgcggt gatcacacta gatgtgaaga acgctttcaa ctgcgctagc tggaaagcaa 3780 tcgccgattc gttgcatagc atgagaatac ctgagtacat atgccggatt ctgaagagct 3840 attttcagaa ccggttactg atatacgaaa cagaccaagg agtaaaaagt attcaaatta 3900 cggcgggcgt tcctcagggc tctattctcg gcccgactct gtggaatatt atgtacgatg 3960 gtgtcttgaa attgaatctt cccaggggcg tgaagattgt tggcttcgcg gacgacgtgg 4020 tgctattggt agtcggtgaa tcaagagagg aggtggaggt actcgccacc gagacgatcg 4080 agactgtaga agattggatg cgggacaaaa agctggcgat agctcaccac aaaactgaag 4140 tggttatgat aagcaatcgt aaggttgtgc agcaggcaaa aatcacggtt ggtgattgca 4200 tcatcgactc gaagcgagaa gttaagcatc tgggggtgat gattgatgat cgacttaact 4260 tcaacaaaca tatcgactat gtatgtgaaa aagcagcgaa ggcgataggg gcgttatccc 4320 ggatcatggc aaatgactct gcgattaaaa gcagcaagaa gcggctgcta gcgagcgtca 4380 caacatcgat catcagatat gcaggtccag catgggtgac tgcactgaag accgaaagga 4440 atcgatcgcg tctgagcagt acgtataggt tgatggctat gcgggtttca agttcgtacc 4500 gcaccatttc atcggaggca gtctgtgtga tagcagggat gatacctatc agccttcttc 4560 tcgaggaaga tcgcgagtgc tacacggacc gccacacaag aggagtccga gaaagagaac 4620 gtgctgacac catcagaaaa tggcaacatc agtgggatca agctacgact ggcagatgga 4680 cacatcgtct tatcccaagt ctgtcgatgt ggataagaag accatatggt gaagtaaatt 4740 ttcacctgac gcagtttttg tctggacatg gttgtttcag acaatatctg cacagattcg 4800 gtcacacaag ttcggcatcc tgtcccgaat gcagcaacac ggaggaaact gcggaacata 4860 tattttttgc atgcccgcgg ttcggcctgg aacgagagga gatgaaagcc attcttggtg 4920 aagatgtaaa cgttgacaat gtcatccaac ggatgtgcaa cgacgcggat aaatggaatg 4980 cggtgaacag gatagtaacg cagataatgt cggcactaca gcggaagtgg agagtagagc 5040 agcgacatga agcgtaaaga tcagtgcttg gcttcgggtc gtcggagcgc caacgaaccg 5100 gaagccattc tccaaccgga atcgtcggac cgacttcggc acttgaatgc cagcctgacg 5160 aagaaagaag aagaagaaga agaaattagt gcggtatggc cttgggtcgt cggggcgccg 5220 atgaaccgaa agttatcccc caaccggaat cgtcggaccg acctcggcac ctgacaagcc 5280 agccgacgaa gaatgaagag gagcttcggg tatgtagggc acccacagtg cggacgtcat 5340 ccccaaaccg gaactgtcgg accacctctg caacccggag acgaagactg cgcgtgaaat 5400 ttcccctgcg atatccgccg gtagacgggg ccccacagtg cggcagtttt cccccaccgg 5460 aactagtgga caacctccgt cgccgggggg agacaggagg accaaagaga gcgagaggga 5520 gtcagcctaa gaaagcaaga aacagaagcg gagagcttaa agtatggcgg cgttaaaaat 5580 ttgaagcccc cgtttcccga agtaatacca tgaggtagtt ccgggggaca cataggcttc 5640 tttccaaaag ccgattatat gtggtgtgat tgagtttttt cccctttttt tcctctttca 5700 caacaactca agcgagccgg agtagttata gaagacatac gaatgagaac cacagagcca 5760 aatcccaatc ggcgccaaga cagccgtgga tggtgaatgt ttctagcttt tgctagaagt 5820 gcgagaaatc tataaaaaaa aataaaaa 5848 // ID Gypsy-22-I_NVi repbase; DNA; INV; 6972 BP. XX AC . XX DT 17-APR-2009 (Rel. 14.04, Created) DT 17-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW internal portion; Gypsy-22-I_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-6972 RA Bao W. and Jurka J.; RT "LTR retrotransposon from Nasonia parasitic wasp."; RL Repbase Reports 9(4), 781-781 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 661..1947 FT /product="Gypsy-22-I_NVi_1p" FT /translation="MAVAAGKTHNRRSLAILEEIQKLQNEHNPDNAEKELE FT WKNAVVYELATLDKKQSTLEERVEKLETNEDLDSLGEPHIRTSFLNTSKEH FT KQFETIERIDALALTLEKIKKESDNNKIDIDNLQQSHDAMKKRLATVESDY FT DALSDQHTKLQSDFNDIRTLIETKASQTELCLRMQKIIEAQARESTRLEKL FT IKNNNGGSSESTTDITKRVKIATPKFKGLASERPMKFLSELDNYIKLVKPE FT DHELKYVISQALEGDAHNWWYLLQAEVDSFETFSTRFRERYWNNQIQRTNG FT RKVEFGQYSPGPRSRVSYATYMFGLARELELNLEEHELVKKIAEHFDRDIR FT RTVTGHEIKSVNKFLEILADYDNDDTNRNRANAQNNQNAQNKQNNQNHKPT FT XPPAGNNESKNKAVSLTSQQATIPVRKRRGRHTI*" FT CDS join(2186..2650,2634..5480) FT /product="Gypsy-22-I_NVi_2p" FT /translation="MENPRNELLHDINPDGREVEXSHCPEIKIFIENIEVD FT ALVDTGSEVTAISEEFYHQNYETLKLCPTLPICGKVIKGATGSKSARLKIQ FT LRLNTKFEKLSEKIIFLVIPKLVKPCIIGFDTIKLLKMLIDTVHEQILFSV FT KNXDGVITYVDKKNAATKKMPRNEFTSLTIQDAGGYTEKGYHTDLNNLYED FT YENNYENPYELTIEEIDGKINSCENLNEAQRNILKNLIIKRKEVFEKKPGL FT LTTYEHELIITDNTPWFCRPHPIPIADRQDALDEFDRMLKLNIIRRSNSVY FT INPIQIVRKRDGSVRPCLNAQKLNMVLQNNYEGXPXIDEVLQRCADRPITS FT VTDFTASYWQIRLAENSRKYTAFKVDSHVYEFNVVPFGIKTSGSALIRGLA FT EATWDFNEFLVLFVDDLMIKSRTFEEHIEHLDKLYERLIEHNLRLNFKKSK FT FFCEEIEFLGHYISAKGIRPDPEKIRTIRNFKRPKNLKQLQSFIGFINFYS FT KFTKHFAESLIPLLNLMRKNVLFHWTTEHQEAFEKIKTLFDDNIILKYADP FT KKPYILTTDASDFALAAVLSQLNDEGEEEVVCFVSRTLKGSECSYFTTEKE FT MLALVWSLNKLDTYLRGAIEIRVRTDHEALTFLRTCKFNNQRLQRWNLAIQ FT DFNLNPTHIPGKQNAVADYLSRMDESEKSKAESQSEIIISAIVQQKPSREL FT INICKNIEKSQRRDPQTRTLIDELVNNDEKTCKRYELIDHVLYKKAKERSK FT IVIPNSVLRDFINEVHAIYGHIGGGGGRKLNKMLCEHFYASGLRRKIAKIL FT KTCDSCQRTKHLTKKYREMSQPILTQKPGELLSIDFYGPLPTSAGGVKYLL FT TSIDVFSKYVVFYPLRRATAHAVIKKLTEDYIPTYGKPEAIICDHGTQFTS FT DKFVNALKNENIKLVYSSIRHPQSNIVERVHRELGRFFRTFVSDKHKGWSR FT YVKIIQNIINETHHETTNYTPTELHFGRKPMRVWEKYLYIPPEKDEITNEK FT RISLACEQAHRIALKRAAKKNAHAEHFELDIGNKVMIKALNVSDAENGKIA FT KFFDIYQGPYEVQKKVGIDTYLLIDINTKKERGKFHINNLKPYFED*" FT CDS 5806..6588 FT /product="Gypsy-22-I_NVi_3p" FT /translation="MEATPTTSEPQVPKPRKRESSSQTEPMAICDQLTQVN FT EAMLAEGYAECWTXKYKISCLXTLMARGSNFMPGNKRFAKGWFYQTPPGWG FT KRDQKTGTEATPPDPATATKVLIPRRYIPARPRPTKPSAAPRIVTVVHLPP FT LDDAGRANLMLPALAPPQQQQQLPPTATPAPASPQQPPKKEEKKAPCPPPR FT QQTRPVALKRHLVNHHERYEPQARNEGWREEGRRLLEEKRAPRAAPSTSAA FT TSAPSTSTATSRRHSTEPP*" XX SQ Sequence 6972 BP; 2473 A; 1564 C; 1426 G; 1470 T; 39 other; ttggcgcccg aacagggacc cgggtttgtt ataatacggg agattgacta agggagatca 60 ttgaaacatt ttttmttttt actttggaga cttgaaaact ccaccacgct caatgatccc 120 ccttcttcag ttatcgctcg ccgagcaaca aatacacaag cgcctcgata atctctgcga 180 ggcgatctgg garctaacca agcgcagcac gcggaactac gaagccctcc aaaatgttga 240 caggaacgtc aaaacgtgtc ataagaatgt gttaaatctc gatcgtgatc tacgcgtcct 300 ggaaaaggga ttcgaagaat tcctmgaaga agaaactgaa acactcaaac tgatcgagaa 360 aatatcggcg cgaacggaca taaaattccg aatcctgaac tcacgactcg actcgctcga 420 ccgggaagtc tcgaaaataa ccgacgggat gagtaagtta acgattcacg agccgacgtg 480 cagcttaatg taaagcgcgg taacgtttcc cccatccccc ccatctatgt ggcaatctca 540 agcatcaaaa aaaaattttt ttattaatcc ttagataacc aaattttttc tcctttcttt 600 cgattacagc ctataggccc tttctctagc tttgtctrcr gctaaaaaaa aaaataaaaa 660 atggctgtag ccgcgggtaa aacccacaac aggcgctctt tagcgatcct agaagaaatc 720 caaaagttac aaaatgaaca taatccagac aatgcggaga aagagctaga gtggaaaaat 780 gctgtagtat acgagcttgc cacattagac aaaaaacaaa gcacactcga agaacgagta 840 gaaaaattag aaactaacga agacttggac tcactcggag agccacatat acgtacgagt 900 tttttaaaca cgagtaaaga acacaaacag ttcgaaacta tcgagcgcat agacgctctt 960 gccttaacgc tagaaaaaat caaaaaagag agtgacaata acaagattga catagataac 1020 ttgcaacaaa gccatgacgc gatgaaaaag aggctagcaa ccgtcgagtc ggactatgat 1080 gccctctctg accaacatac taaattacaa tcagatttta atgatatacg cacactaatt 1140 gaaactaaag catcccagac ggagctatgt ctgcgtatgc aaaaaattat cgaggcacag 1200 gcaagagagt ctacacgact cgaaaaactt ataaaaaata ataacggtgg ctcttccgaa 1260 tctacgactg atattactaa acgcgtgaaa atagctacgc cgaaattcaa aggtctagca 1320 agcgaaagac ccatgaagtt cttgtcagaa ctagacaatt acattaaatt ggtcaaaccg 1380 gaagaccacg agctaaaata cgtmatatct caagccttgg aaggcgacgc gcataattgg 1440 tggtatttac tccaagcaga ggtggatagt ttcgagacgt tttcgacgcg cttcagggaa 1500 cgctactgga acaatcaaat tcaaagaacg aatggtcgta aagtagaatt cggacagtac 1560 agccctggrc ctcgaagccg tgtcagctat gccacntata tgttcggtct agcccgcgag 1620 ctagaactaa atctagaaga gcacgagctc gtgaaaaaga tcgcggagca cttcgaccga 1680 gacatacgrc gtacggtaac cgggcacgag ataaaatcag taaacaaatt tctcgagata 1740 ctygccgact acgataacga tgacacaaat cgaaatcgag ccaacgcgca aaataatcag 1800 aacgcgcaaa acaaacaaaa taaccagaat cataaaccca ccyacccacc cgcgggtaat 1860 aacgagtcga aaaacaaagc ggtcagccta acaagccaac aggcaacaat cccggtcaga 1920 aaaagacgtg gacgccacac aatatraacg ttacgcagac agcagacgac acawccaaca 1980 agggatcagg gtcgcaaaac aaaggtggga acaaaaaaca gggaaataat aaaccctctc 2040 agaaagatrc ctacgacatc gacgtcttgg attgttttca agacataact aaacaaaatt 2100 cgggaaacat gcagtagctg cgtccaaggc ggatgcactt ccttctctag ggaagaacga 2160 cgcagctgct caaatcgcgt ccatcatgga aaacccgcgc aacgaattat tgcacgayat 2220 taacccggac ggtcgcgaag tggaartatc ccactgccca gaaataaaaa tatttataga 2280 aaatatagag gttgacgcac tcgtggatac gggaagcgag gtaacggcca tttctgaaga 2340 attttatcat caaaattacg aaactcttaa gctttgtcca acattaccta tttgcggtaa 2400 agtcataaaa ggcgctacgg gtagcaaatc ggctcggctt aaaatacaat tgcgtttaaa 2460 tacgaaattc gaaaaactgt ctgaaaaaat catattctta gttataccma aactagtcaa 2520 gccttgtata atcgggttcg acacaatcaa actattaaaa atgttaattg acacggtaca 2580 cgaacaaatt ctcttttcgg ttaaaaacra cgacggagta ataacgtacg tagacaaaaa 2640 aaatgccgcg taacgarttc acgtcgctaa caattcaaga cgccggtgga tacaccgaaa 2700 aagggtacca cacggatctc aataatcttt acgaagatta tgaaaacaat tatgaaaatc 2760 catacgaact cacaatcgag gagatcgatg gaaaaatwaa ttcttgtgaa aacctcaatg 2820 aggcacaaag aaatatatta aaaaatctaa ttattaaaag aaaagaagta tttgaaaaga 2880 aacccggtct tttaacaacc tatgagcatg aattgattat tacagacaac acaccttggt 2940 tttgccgacc acacccaata cccatcgctg acagacaaga cgcactagat gagtttgaca 3000 ggatgttaaa attaaacata ataagacgct cgaatagcgt atatataaac ccgatacaga 3060 tcgtaaggaa acgcgacgga tcggtaaggc cgtgcctaaa cgcacaaaaa ttaaacatgg 3120 ttctccaaaa taattacgag ggacyaccga statcgacga ggtactacaa cgatgcgctg 3180 ataggccaat aacaagcgta acagatttca cggctagtta ctggcaaatc agactagcag 3240 aaaattctcg aaaatacacc gcgttcaaag tcgatagtca cgtctacgaa tttaacgttg 3300 taccgttcgg tattaaaaca agyggatccg cgttaattcg agggctggcg gaagccacgt 3360 gggatttcaa tgaatttctg gtactatttg tagatgatct aatgatcaaa tcgcgtacgt 3420 tcgaagaaca catcgaacat ctcgayaaat tatatgaaag gctaatcgaa cataatttac 3480 gattgaattt caaaaaatct aaatttttct gtgaagagat agaatttctc gggcattata 3540 tttccgcaaa aggaatacga ccggatccgg agaaaatacg tacaattcgt aatttcaaaa 3600 gaccgaaaaa tttaaaacag ctacaaagct ttattggatt cataaatttc tattcaaaat 3660 ttacaaaaca cttcgctgaa agtctaatac cattattaaa tctaatgaga aaaaatgtac 3720 tattccattg gacaaccgaa catcaagaag cgttcgaaaa aataaaaacg ctttttgatg 3780 ataacataat tttaaaatac gcggatccaa aaaagccgta catactgacg acagacgcgt 3840 cagacttcgc gctcgccgcg gtcctctctc aactaaacga tgagggagag gaagaagtag 3900 tatgctttgt aagcagaaca ctcaagggta gcgaatgctc atatttcact acagaaaaag 3960 aaatgctagc cctcgtgtgg tcacttaaca agctcgatac atatttaagg ggggcaatcg 4020 aaattagagt ccgtacggat cacgaggctc taacattcct aagaacttgc aagttcaaca 4080 accagagatt acagaggtgg aacctcgcga tccaggactt taatttaaat ccaacgcaca 4140 ttccagggaa acaaaatgca gtagcagact accttagccg catggacgaa agcgaaaaaa 4200 gtaaggcaga aagccaatct gaaatcataa tttccgctat cgtccagcaa aaaccaagcc 4260 gcgaactcat taatatatgt aaaaacatag aaaagagcca gaggcgggat cctcaaacgc 4320 gtacactaat cgacgagctt gtaaacaacg acgaaaaaac ttgtaaacgg tacgagctaa 4380 tagatcatgt attgtataaa aaagcgaaag aacgatcaaa aattgtaata ccaaatagcg 4440 tgttaagaga ttttataaac gaggtacacg caatatacgg ccacataggg gggggggggg 4500 ggcgaaaatt aaataaaatg ctatgcgagc atttctacgc atcaggctta cgcagaaaga 4560 tagcaaaaat tttaaaaaca tgtgacagtt gccaacgcac taaacattta actaaaaaat 4620 acagggaaat gtcacagcct atcttgacgc aaaagcctgg tgaattactg tcaatagact 4680 tttacggtcc acttcctacc tcggcggggg gagtaaaata cttattaaca agtatagatg 4740 ttttcagtaa atacgttgtg ttttaccctc tccgtcgagc aacagctcac gcggtaatta 4800 aaaaactgac agaggattat ataccgacgt acggtaaacc agaagcaata atatgcgacc 4860 acgggacgca atttacgagc gacaaatttg taaacgcgtt aaaaaacgaa aatattaaac 4920 ttgtatactc atctatccga cacccwcaaa gcaatattgt agaaagggta catagagagt 4980 taggacgatt ctttcgtacr tttgtaagcg acaaacataa aggttggtcg cgatacgtga 5040 aaataataca aaatattata aatgaaacgc accacgaaac aacgaactay acaccgacgg 5100 aattacattt cggccgaaaa cccatgcgag tatgggaaaa atacttgtat ataccgcccg 5160 aaaaagatga aataactaac gaaaaacgaa tatcgcttgc atgcgagcaa gcacatagaa 5220 tagctctcaa acgcgcggcg aaaaagaacg ctcacgcgga gcactttgaa ttagatatag 5280 gaaayaaagt aatgataaaa gcgttaaacg tatccgacgc ggaaaacggg aaaatagcta 5340 aattttttga tatttaccaa ggaccatacg aggtccagaa aaaagtcgga atagacacat 5400 acctacttat agatattaat acgaaaaaag agcgaggaaa gtttcacata aataatctta 5460 agccgtattt cgaagattga atacacaatt cataaaaaga aagaaaatca acaaaagaaa 5520 atttattttt taatatacat agaaccgtga attgtgaatt caaacaaatt taytaaaaca 5580 attcgaaaay ataaaaaact caaaaaattc aagaaacagt aaaatcacga awargagaga 5640 gagagagaat aacacsgcgt acgcgacaaa acgccataca catctcgcga cctacgcgac 5700 ctracaaaaa acaaaaacag cggccgcagc cagccgccga attccaaaaa gcctaagcgc 5760 cgaaaaaaga gctgcaatga gctcagttga cactaatcag tcgaaatgga agctacwcca 5820 acaaccagcg agccccaggt gccgaagccg aggaagcggg agtcgtcrtc gcagacagag 5880 ccgatggcga tctgcgacca gctaacgcag gtgaacgagg cgatgctagc tgaaggctac 5940 gcagagtgct ggacacwaaa gtataagata tcatgtctcy gcactctcat ggcccgcggc 6000 agcaatttca tgcccggcaa caagcggttt gcgaagggct ggttctacca gacaccaccy 6060 ggctggggga agcgagatca aaaaaccggg accgaggcaa ctccaccaga cccggccaca 6120 gcaacgaagg tgctgatccc gcgtcgatat atcccagcca ggccaagacc aaccaagccc 6180 tctgccgcgc cacgcatcgt caccgtggtg cacctgcccc cgctggacga cgccggcaga 6240 gcaaacctga tgctcccagc gctcgcacca ccgcagcaac agcagcagct accaccaaca 6300 gccaccccgg cacccgcatc accacagcag cctccgaaaa aagaggagaa gaaggcgccc 6360 tgcccccctc cccgccagca gacccggccg gtcgcgctaa aacgccacct ggtcaaccac 6420 catgagcgat acgagcccca ggctcgtaac gaaggctggc gcgaggaagg caggcgccta 6480 ctcgaagaga agagggcacc aagggcagcc ccctccacgt cagctgcgac ctcagccccc 6540 tctacctcga ccgcaacctc tcgccgccac agcaccgagc cgccatgaag ccaccgacga 6600 tcgcggagat actccgcagg gtgtacgcag cggcggagat cggcgaggcc accgtcgccg 6660 agtacgagga gttcgccgaa gacggcataa gattttattg ttttttatta cttgtctttt 6720 catttgacga tggcatcgat gccgattctt tttttgttac ctgttaaaag cactgacgct 6780 acgtttcagt ttaattataa aaaagagttt tttatgtttt tttaattaac tttaagaaaa 6840 taaattttgt acttttttat aagcacggac tatgttctct gttgctgctc tcttgtccgc 6900 cttttccttt aaataaaaaa ttgtttttca acagacgcag tgccgtctca aaaaattttt 6960 attggggggc at 6972 // ID BEL-31_AA-LTR repbase; DNA; INV; 425 BP. XX AC supercont1.241; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-31_AA_; KW BEL-31_AA-I; BEL-31_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-425 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.241; Positions 1402541 1402117. XX SQ Sequence 425 BP; 147 A; 87 C; 79 G; 112 T; 0 other; tgttgctgcc agcactgccc cgcagtgtaa ccataactga accgctgaac agtgataaca 60 gggagaattc aacggctagg agaaagatat cattgtcaat ccgtagacaa agcaggttgc 120 gaatccatgc gagagctttc acaatcatat ctatcgagct taaatctaac taccttatcc 180 taagtgtatc acagtgttat cacagtcggt tattaatcac aatagtaccg tgaacacaag 240 ggaaacacat ctaaaatcac ataaattagg ctccaagacc gataattgtg agtatggaaa 300 cttattataa aactaaaact aatgagaaaa taaattttca gtttaaagct gtcgtcacac 360 cctaaatctt ggagttttgt gagctgctat taagaatcgg caagagccca tttgttctct 420 gaaca 425 // ID DNA-7_AAe repbase; DNA; INV; 432 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A non-autonomous DNA transposon family from Aedes aegypti. XX KW DNA transposon; Transposable Element; nonautonomous; DNA-7_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-432 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the yellow fever mosquito."; RL Repbase Reports 11(4), 1261-1261 (2011). XX DR [2] (Consensus) XX CC ~98% identical to consensus. 10-14 bp TIRs. Terminal 4 bp can CC be CC TSDs. XX SQ Sequence 432 BP; 149 A; 75 C; 76 G; 131 T; 1 other; aattgggtcc taaaattttt aatgaaatct tgtttttatt caataacacg aaagagcatc 60 ttactgcaac tactttagca atttttcccg ctcaaataac ggctatatca tgtttaaact 120 ttaatttaaa aattgggtcc ataaatgaac cttgacactt aagatcatgt ttgacgttcg 180 cttagtcgac aaaaacacca caggggtttt agttcgacca ctggggttgt tcctatctga 240 catttcgtaa gggacacgga aaacaaaata cacccaaaat ttgagtttaa gccaaaggat 300 gtgacaaaat ctaaaaaaaa gttttttggg cttaaaccaa cggaaaacat tagaaaattg 360 agtaaacatg tgtttttggc ctaaacttaa gcgtttggca ctaaaattgg gacagggctt 420 taggacccwa tt 432 // ID CR1-75_HM repbase; DNA; INV; 3733 BP. XX AC . XX DT 31-DEC-2008 (Rel. 13.12, Created) DT 31-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE CR1-type family - consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-75_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3733 RA Bao W. and Jurka J.; RT "CR1 families from Hydra magnipapillata."; RL Repbase Reports 8(12), 1902-1902 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 2..715 FT /product="CR1-75_HM_1p" FT /translation="VGFLEAENKNQRKEIDFLKNNNNANITYQNNIHNADV FT WAGFVSKEKDLAAKEKNAALFAKISKENNEKNRILNNMIISGLTESVNLDD FT DNKAIDEILTILKVSSDDVKVKKRLKKKNASPPTPISCEIENQFNSVSSRP FT PLVLIEFKNYEKRQSALANARELKNKFGLTKVYVRPDQTENERIALSKLHV FT ECRIRNNALPNSFVDLQGRQLNWGTRDGKKGFWSVRSGTIRWIERKE*" FT CDS 516..3671 FT /product="CR1-75_HM_2p" FT /translation="MFDQIKLKMKELLSLNFMLNVELETMLFQILLWICKG FT GNSIGVQEMVKKDFGLCDLAQFDGSSAKNRTPNPNTYSPLSFISSYSLSDL FT KCLYLNPTSLDNKWDEFQARIASLDYPHIVAVTETWFSATSSTNLENYTVY FT LRNRTIRGGGVAIYVRKDIHSFECEFSDELISEQIWCQILIGNDILLIGCI FT YRQPFTKYESNLQINKSIKRARALIDDGKFTSVLIVGDFNYSDIRWSDLGG FT KCKKGKQSSKIFLKTIDDCFLSQFVTKPTFINNTLDLVLSEDPDRIHNITI FT GPPLGCTQKNYFHNSLLWDFKLKTSTKSIRHHKVNYIYDKGDYNAISEGLK FT AINWPNLFTNKSADESFELFKNIYISLVEHHIPKYNMSFNFKKRDPKWFNS FT TIKAATNEKFRLFMKLRCSSSKFKDSIRVLYNKQNRLVKKLVADAVLKYES FT EVISACKSCPKKFYSYINSQKACKSEIRSLFDHDGVQLVDNMSIANCLNNH FT FFKAFSLPETHSKAPDFPRRTEIVCSPVSSSLFNCCNIEKLLSALDSSKGT FT GNDGIHPRVLKMCSKEFSVPLSLLFIKSFDSGQVPSGWKLANITPIFKKGQ FT RTDPGNYRPISLTSAIGKVMEKIMRDVMTEHLVKHNLLSCHQHGFVKSRSC FT VTNLLEVMDIITKALSDNCAALLILLDFAKAFDRVSHLLLYHKLAGYGFGN FT NILAWLKDFLENRKQRVVLGNFSSEWKDVHSGVPQGSVIGPLLFLIFINDM FT PELVKNNCKLFADDSQLVAKIKTVTDLALVQNDIDMICNWSRLWCMELSIN FT KCKVLKFRGGSTDINIPLTMLDKNGSQIPIEDVSVERTLGVLIHNKLKWSD FT QISDATLKANSILGILKRTFKKWNPTMFVKLYTAYVRPILEYCAPIWCPFL FT LKDIKKLESVQRRATKLVPQFRSLNYKKRLAYLGLSSLAERRTRGDVIQFF FT KIYKKLNIVNWQQQINRASALLCDGPAGHLRGAKHKIDRELSKNSQRHNFF FT RNRVVSSWNALPASVVAANSVDRFKNNYDKWNRSISRRSTIVGSR*" XX SQ Sequence 3733 BP; 1255 A; 649 C; 654 G; 1175 T; 0 other; agttggtttc ttagaagcag aaaataaaaa tcaacgaaag gaaattgatt ttcttaaaaa 60 taataacaat gcaaatatca cttatcaaaa taatattcac aatgctgacg tctgggcagg 120 ttttgtaagt aaagaaaagg accttgctgc caaagagaaa aatgctgctt tgtttgccaa 180 aatttcaaaa gaaaacaacg aaaaaaatag aattttaaat aatatgatca taagtggtct 240 tactgagtca gtaaatttgg atgatgataa taaagcgatt gatgaaattc ttaccattct 300 taaagtatct tctgacgatg taaaagtcaa aaaacgactt aaaaagaaaa atgcctctcc 360 tcctactcct atctcttgtg agatcgaaaa tcaatttaat tctgtgtctt cacgtcctcc 420 attggtttta attgaattca aaaactatga aaaacgtcaa tcagctttgg ctaatgcaag 480 ggaactaaaa aataagtttg gtttaacaaa agtttatgtt cgaccagatc aaactgaaaa 540 tgaaagaatt gctctctcta aacttcatgt tgaatgtaga attagaaaca atgctcttcc 600 aaattctttt gtggatttgc aagggaggca actcaattgg ggtacaagag atggtaaaaa 660 aggattttgg tctgtgcgat ctggcacaat tcgatggatc gagcgcaaag aatagaacac 720 caaatcctaa cacttacagt cctctatctt ttatatctag ctattcttta tctgacctca 780 aatgtcttta tcttaatcca acatctttgg ataataaatg ggatgaattc caagctagaa 840 ttgcgtcact ggattaccca cacatcgttg cagtcactga aacatggttc tccgccacat 900 cttcaacaaa tcttgaaaat tacactgtat atcttcgtaa caggacaatt cgtggaggtg 960 gcgtggctat ttatgttaga aaagacatac attcatttga atgtgagttc agtgatgaac 1020 taattagcga gcaaatatgg tgtcagattt taattggaaa tgacattttg ttaattggtt 1080 gtatttatcg ccaaccgttc actaaatatg agtctaattt acaaatcaac aagtcaatca 1140 aacgtgctag agctctaatt gatgatggta aatttacaag tgttctcatt gttggtgatt 1200 ttaattatag tgatatcaga tggagtgatt tgggtggcaa atgtaaaaag ggtaaacagt 1260 ctagtaaaat atttttaaaa actatagatg attgcttcct atcacagttt gtcaccaaac 1320 caacatttat caataacaca cttgacctgg tgctgtctga agaccctgat cgtattcata 1380 atattacaat tggtcctccg cttggttgta ctcagaaaaa ctatttccac aattcactct 1440 tatgggattt taaacttaaa acatcgacca aatcaatcag acatcacaaa gttaattata 1500 tctatgacaa aggtgactat aatgccatta gtgaaggcct aaaagcgatc aattggccga 1560 acttattcac aaataaatca gcagatgaaa gctttgaatt atttaaaaac atttacattt 1620 cattggttga acatcacatt cccaaatata atatgtcttt taattttaaa aaacgtgatc 1680 caaaatggtt taattctact attaaagcgg caactaatga aaaatttaga ctatttatga 1740 aattacgctg ttcttcaagc aaatttaaag atagtattcg agtcttgtac aacaaacaaa 1800 atcgtctggt gaaaaagctt gttgctgatg ctgtactaaa gtatgagagt gaagtcattt 1860 ccgcttgtaa gtcctgtcca aaaaagtttt attcttatat aaacagccaa aaggcttgta 1920 aaagtgaaat tcgatctctt tttgaccacg atggagtgca attagtagat aatatgtcta 1980 ttgcaaattg cttaaataat cactttttta aagctttttc attacctgaa actcactcca 2040 aagctcctga ttttcctcga agaaccgaaa tagtttgctc accagtttct agctccttgt 2100 tcaattgttg caatattgaa aaacttttat ctgctcttga ctcttcaaaa ggcactggta 2160 atgatggaat ccatcctaga gtgctaaaaa tgtgctctaa agaattcagt gtgcctctct 2220 ctctcctgtt tatcaaatcg tttgattctg gacaagtacc gtctggttgg aaactggcta 2280 atataacgcc aatatttaaa aagggtcagc gtactgatcc tggaaattac aggcctattt 2340 ccctcacctc agcaattgga aaagtaatgg agaaaataat gcgtgatgtc atgactgaac 2400 acctagttaa gcataatcta ttatcttgcc atcaacacgg ttttgtcaaa tctagatcgt 2460 gtgtcacaaa tcttcttgaa gtaatggata ttattaccaa agctttaagt gacaactgtg 2520 ctgcattgct cattttgtta gattttgcca aagcttttga tcgtgtatca catcttctgc 2580 tttatcacaa actggctggc tatggttttg gcaataatat tcttgcctgg ctcaaagact 2640 ttcttgaaaa taggaaacag cgggttgttc taggaaattt ctcttccgaa tggaaggatg 2700 tccatagtgg tgtacctcaa ggatcagtca taggtcctct tctttttttg atctttatta 2760 acgacatgcc agaattagtc aaaaataatt gcaaactctt tgctgatgat agccagttgg 2820 ttgccaaaat aaaaacagtt acggacttgg ctttagttca aaatgatatt gatatgatat 2880 gtaattggtc aagactttgg tgcatggaac ttagcataaa taaatgcaag gttctaaagt 2940 tccgtggtgg ttccacagac atcaatatcc ccttgacaat gttagataaa aatggcagtc 3000 aaattcctat tgaagatgta tccgttgaaa gaacattggg tgttctgata cataataaac 3060 tgaaatggtc agatcagatt agtgatgcca ctctcaaagc taattctata cttgggatac 3120 tcaaaagaac ttttaaaaaa tggaatccaa ctatgttcgt gaagctctat acagcatatg 3180 taagaccgat attggagtac tgtgcaccaa tttggtgccc ttttcttcta aaggacatta 3240 aaaagttaga gtcggtccaa cgaagagcta ccaaactcgt tccacaattt cgcagcttga 3300 attacaagaa acgattagca tatcttggtc tcagctcatt agctgaacgt cgaacaagag 3360 gagatgtaat tcaatttttt aaaatttata aaaaattaaa tattgttaac tggcaacagc 3420 agattaacag agcaagtgca ctcctatgtg atggtcctgc aggtcatctt cgtggtgcca 3480 agcataaaat tgatcgagaa ttatcgaaaa actctcaaag acataacttc tttagaaacc 3540 gagtggtgtc atcatggaac gctttgcctg cttcggtagt tgctgcaaat tcagttgatc 3600 gtttcaaaaa caattatgac aaatggaatc gttctatctc acgtcggtct actatagtcg 3660 ggtctcgtta gacactcggc tcatagaagt caaacttcta ttgtagaaat actataataa 3720 taataataat aat 3733 // ID Harbinger-N18_BF repbase; DNA; INV; 1379 BP. XX AC . XX DT 04-AUG-2008 (Rel. 13.08, Created) DT 04-AUG-2008 (Rel. 13.08, Last updated, Version 1) XX DE Amphioxus Harbinger-N18_BF non-autonomous DNA transposon - DE consensus. XX KW Harbinger; DNA transposon; Transposable Element; Nonautonomous; KW Harbinger-N18_BF; non-autonomous. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-1379 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-1379 RA Kapitonov V. and Jurka J.; RT "Harbinger-N18_BF - a family of non-autonomous DNA transposons RT from the amphioxus genome."; RL Repbase Reports 8(8), 811-811 (2008). XX DR [2] (Consensus) XX SQ Sequence 1379 BP; 281 A; 359 C; 356 G; 383 T; 0 other; ggctagggtc acaattggcc cggtgcccgt cagggatgta gccccggccg ggctctgaag 60 tcgggcggac accggctgtt taccggctgt cgtggtcaca atcaaacgtt gccttcgcgc 120 tgccccatgg gggtcccgga gtgttccggc gggcaccccg tcatttttct ttaaaaaatt 180 tccaccttcc taggttcgtc tccggcagac acccgtcaga tgttctgacg gtgtcctgta 240 gtcacccctg cggggtccga cagggtaacg tttgtctctc acagcccggt ggggcagttg 300 tagggtccct gccgcatgcc ctgcgaacaa cggcagggaa ccaggcaggc tcctggtggg 360 cactcgtcat atttaagggg tataaaaaca agcattgcct gcattcactt aaggtagtac 420 atcattgaac ttggagtact ggtgcggtac gtcggggtcc gttcactgcg gcacggtttt 480 tttcgccaac ttttcctcgc actcgcactg acgtgcggca ctgttacgtt tctaaactaa 540 actaaataaa catgttattg gaatttttaa tgtcggcgga tgacttgcca aactgttgag 600 gcctatattt ttatatttgt aaaacgaaaa aaaatggctt atgggtccga atgactgcaa 660 gtttggcgtt ccgtcggcca attgcagagg aaacgtcctt cttcagccaa acatctttct 720 tcagcttgtt tctatagtct ggggttgtaa tatcgtaaaa aacatagaag ctttctatga 780 aggaggctat ctgttcgtcc tgcttctcag tgatgcagcc tattatgtgg tctgttgcgt 840 ggtgtgttgt cctatccccc tctgttcagc ccttgtctgc agaagtttct gttgctgaat 900 tcttccgcct gcctagtggg ctaagggctg tttcccctgg tcttctcctt tcccttggta 960 tagcgctatg cacttaactc cactgtaata cgagaactgc actgtgatac gtgtaagttc 1020 acggcttttg tcctttcacg tgatctgtga cgtcacatat ttgacccggt gttacgaatt 1080 acacactttt attccgtaaa cctggtcttt gacgtaacga atttattccg acagactgcc 1140 ggcaaccgga ggggtccctc tcggtgttgt cacgattagg tcgcggggtg gttgtacgag 1200 cgggtccctg gttcatttta actcagactt aaattgaacc agggtcccgg ccgggcctca 1260 aaataccccg gcagccgcga ggtgtcccac agggcaacgc aggggtcacg cccgaaagtg 1320 ccaaaaaaca gcccgtgaat tgcccggcag ggcccctttt tcgaattgtg acctaagcc 1379 // ID SINE-3B_CQ repbase; DNA; INV; 341 BP. XX AC . XX DT 28-DEC-2010 (Rel. 16.01, Created) DT 28-DEC-2010 (Rel. 16.01, Last updated, Version 1) XX DE SINE from Culex quinquefasciatus - consensus. XX KW SINE; Non-LTR Retrotransposon; Transposable Element; SINE-3_CQ; KW SINE-3B_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-341 RA Kojima K.K. and Jurka J.; RT "SINEs from the southern house mosquito."; RL Repbase Reports 11(1), 622-622 (2011). XX DR [2] (Consensus) XX CC >99% identity to consensus. Putative SINE. ~76% identical to CC SINE-3_CQ. XX SQ Sequence 341 BP; 82 A; 81 C; 92 G; 86 T; 0 other; tcccatctgc tgcaaccttc catcggatga ggaagtaaaa tgtcggtccc ggccttggtt 60 gttaggccgt taagtcattc caggtgtagg agtcgtctcc atgccataag tacaaacaac 120 acaccaaacc aagcctactc cggtggaatc gctggcggcg gttggactcg caatccgaag 180 gtcgtcagtt caaacactgg ggtggaaggt tccttggagt agaaagaggt ttgggtgctc 240 tccccattca agccttcgga ctcctaggtt cgagcagaaa cttgcaatag agaccacaaa 300 agacccgggg gtcgttaatg tggatggttt gatttttttt t 341 // ID DNA8-57_AP repbase; DNA; INV; 220 BP. XX AC . XX DT 25-AUG-2009 (Rel. 14.09, Created) DT 25-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-57_AP. XX NM DNA8-57_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-220 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 1991-1991 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 220 BP; 75 A; 33 C; 38 G; 74 T; 0 other; catgactaaa taaacaggca tgatcgctag attactgcac gcatatttta tacatatatt 60 atacttcccc agacttggaa gaaacactaa tatcatagaa aaaaatgtta ttcacattac 120 aattatatag gaattatact gtttttgcca tgtctgggga aaatgcgcag tgataggaaa 180 aattggtttt ttggtgatca tgcctgttta tttagtcatg 220 // ID CR1-4_TCa repbase; DNA; INV; 4055 BP. XX AC . XX DT 21-MAR-2009 (Rel. 14.04, Created) DT 04-APR-2009 (Rel. 14.04, Last updated, Version 3) XX DE CR1-type retrotransposon: consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; Nonautonomous; KW CR1-4_TCa. XX OS Tribolium castaneum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Coleoptera; Polyphaga; Cucujiformia; OC Tenebrionidae; Tribolium. XX RN [1] RP 1-4055 RA Jurka J.; RT "CR1-type retrotransposons from Tribolium castaneum."; RL Repbase Reports 9(4), 738-738 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Tribolium CC castaneum Genome Project. XX FH Key Location/Qualifiers FT CDS 360..1100 FT /product="CR1-4_TCa_1p" FT /translation="MAEFNCCHCFKSVSRDTPIVVCDGCSLNIHCRCLKIS FT ENEIAFISQSRSPNIKLFCNRCNVTITAISEIKKLVNDIKTSFDERLSELE FT SLIINNNTSQAINREEIIAESVDRSARACNVIMYNVKPVANKQDVDVVNDV FT LEVIDPSLVIGPENVFRVGKTVGDKPRLLKLQFKTQQMARLCLKKKSSLLQ FT HPQFAHITISDDKTPGQLKHLNNLRDELKRRLDVGEKDLTIKYVNHVPKII FT SHQKN*" FT CDS 1118..3880 FT /product="CR1-4_TCa_2p" FT /translation="MALLYTNIASLSSKFNDLLLEVTEFRPSIIALTETWL FT NPSISDDAVHIENYSIYRDDRLSQKGGGVCFYVDKIKISKFFKITQICISV FT KPYDSLWLKFEGNEQTFVIACIYRPPTGASFNNHENDKNLFNTIQSTINQY FT PNLIIMGDFNFPCIDWSAKNTTGQTIIEKQFSELLLENNLSQIVTEPTRFR FT SQQRPSLLDLVILSDVNLVSSLNVSNPVGTSDHSKIEIELQIMIYSEPNAR FT IKTTSIVDYTRIDESLLTYDWNGLDSLNAEEQWKHFKSILQTCITTNSRLF FT TNRLRKDKPWINQHMLDKIKYKAKLWKKFKRKPTEDNFSAYKRFSNQLKSD FT LQIARCNYENSILNKPKLFYSHVRKFISSRVSVPLVRNASGLLCSDHTETA FT NTLADSFAATYNLNINNNIPSFMPTSTFSGLSYINFTPDLVKSQLCAIKLN FT TAPGPDQIPPKLLRTCAASLSGPLAKMMTKSFLQGTLPYEWLQATITPLYK FT SGDKLDPANYRPVSLTSTCCKIMEKVIVKELLTYMRKNNLIPEHQHGFLPG FT RSVVTNLLSCTNLWTKMLDTNQPVDIIYLDFSKAFDKVPHNLLLAKLESYG FT VTGNLLDWIRAFLSNRNYCVKVCGKFSYSKPVLSGVPQGSVLGPILFLLYT FT ADLLFSLDSYSAYADDIKLFANPLVTDLQDQLNLIFQWSEKWQIPLNIAKC FT CVLQCGHNNPKYLYHINGTLLSVKTSTKDLGVIVNDKLGWSEQCLASVGRA FT RKMFYLLKHVFPNPSVSFISKVYITYIRPHLEFAIPVWRPYLLKDIDLLER FT TQHLVTRWTHCLRQVSYEQRLEILKIPDLRARQNRADAIQIFRLTHGLFPG FT VTRNFLQIQYHDRLRGHSYKLKKEPFKTTVRKNYLTNRAFDIWNGLPDNVV FT AAVTVSSFKNQYDTLQ*" XX SQ Sequence 4055 BP; 1325 A; 745 C; 684 G; 1301 T; 0 other; gtctctgcgt gctcttgcct ttgggattca ttttcagtgg ttgattcttg gtgttgattg 60 ttgaatttga tatctgggtg tatgtggttg ttgttgtttc ctctggaaat tactattttt 120 gcgatttcgg taagttattc ttgtttaatt cgttaatatc cgtcattaga tacgcataat 180 tatatcttat cgcggcagta taagatagcg ctcgcccctc tatagcagca gcagctggta 240 tcgcggcggc gcggcgcagt gaaaatttag tatttatcgc tctgtcgcag agtgagcgtc 300 tggtgtccaa attttgtcta cattaatttc attaatttca gtttaaatca ctaaatttaa 360 tggctgaatt taattgctgc cactgtttta aaagcgtaag tcgtgataca cctatcgtgg 420 tgtgtgacgg ctgttcgcta aatattcact gtcgttgcct caaaataagc gaaaatgaaa 480 tcgcctttat ttctcagtct cgctctccaa atatcaaact gttttgtaat agatgcaacg 540 taactatcac agcaatttca gaaattaaaa aactcgtaaa tgacataaaa acatcatttg 600 atgagcgtct tagcgaactc gaatctctta ttattaataa caatacgtct caagctataa 660 accgcgaaga aattatcgcc gaatctgtgg acaggtcggc tcgtgcgtgc aatgttatta 720 tgtacaatgt taaacctgtc gctaataaac aagacgtgga tgtggtaaat gatgtcctcg 780 aggtcattga tccatcactt gtgattggtc cggaaaatgt ttttcgcgta ggaaaaaccg 840 tcggtgataa acctcgctta ttaaaattgc aatttaaaac tcaacaaatg gcacgtctat 900 gcttgaagaa aaaatcgtct ttgctgcagc acccacaatt tgctcacatt actattagtg 960 atgataaaac cccgggtcaa ctaaagcatt taaacaattt gcgtgatgaa ttaaagcgaa 1020 gacttgatgt tggcgaaaag gacttgacaa ttaaatacgt taaccacgta ccaaaaatta 1080 tttcacatca aaaaaactaa atatttacca ttgttcaatg gctcttctgt atacgaatat 1140 cgcatcttta tcttcaaaat ttaatgattt gttattagaa gtaactgaat ttagaccttc 1200 tattatcgca ctcacggaga cttggcttaa tccgtctatc tctgatgatg ctgtacacat 1260 tgaaaattac tctatttaca gagatgatcg cctatctcaa aaaggtggcg gtgtgtgttt 1320 ttatgtagat aaaattaaaa tttcaaaatt ttttaaaatt actcaaattt gcatatctgt 1380 taaaccttat gattctctat ggttaaaatt tgaaggaaat gaacaaacct ttgttatagc 1440 atgtatttac agaccaccaa caggagcgtc ttttaacaat cacgaaaatg ataaaaatct 1500 atttaatacc atacaatcaa ctattaatca gtatcctaac ttaattatca tgggagattt 1560 caattttcca tgtatagact ggtctgctaa aaacacaaca ggtcaaacaa tcatcgaaaa 1620 acaattttct gaattgctct tggagaataa tttgtcacaa atagtcactg aacctacacg 1680 atttagatcc caacaacgtc cttcattact tgatttggta attttgtctg acgttaacct 1740 tgtatcgtct ttaaacgttt cgaatcctgt aggtacatct gatcattcaa aaatcgaaat 1800 tgagttacaa attatgatat attcagagcc aaatgcaaga attaaaacaa catcaattgt 1860 tgactacact cgcatagacg aaagtctctt gacttacgac tggaatggtt tagattcact 1920 aaatgcggaa gaacaatgga aacattttaa atcgatttta caaacttgca ttacaacaaa 1980 ttccagactg tttacaaacc gactgagaaa agataaaccg tggataaatc aacacatgct 2040 tgacaaaata aaatacaaag ctaaattatg gaaaaaattt aaaaggaagc ctactgaaga 2100 taatttttct gcttataaaa gattttctaa ccaactaaaa tctgatttac aaatagccag 2160 gtgtaattat gaaaattcta tattaaataa accaaaacta ttttattccc atgtacgtaa 2220 atttatctct agtcgtgtgt cggtcccttt agtacgaaat gcttcaggtc tactctgctc 2280 cgatcataca gaaactgcaa acactttagc ggacagtttt gctgctacct ataatttaaa 2340 cattaataat aatataccgt catttatgcc tacaagtaca tttagtgggt tgtcttatat 2400 taattttact ccggatctgg taaaaagcca actctgtgcc attaaattaa atacagctcc 2460 aggtccggat cagatacctc ccaaactcct gagaacttgt gctgcttcgt tgtcgggccc 2520 acttgccaaa atgatgacaa aatcattctt acaaggcact ttgccctacg agtggttgca 2580 agcaacaatc acgcctttgt acaagtcagg agataaactt gatcctgcta actatagacc 2640 tgtaagtctc acgtcgacat gctgtaaaat tatggaaaaa gttatagtca aggaactatt 2700 aacttacatg agaaaaaata atttaatacc tgagcatcag cacggctttt taccaggtag 2760 gtcagttgtc acaaatttgc tatcctgcac aaatctatgg acaaaaatgt tagacactaa 2820 ccaacctgta gacataattt accttgattt tagtaaagcc tttgataaag tccctcacaa 2880 tctcttgtta gcaaaactgg agtcctatgg tgtaacagga aatttgcttg attggatccg 2940 agcgtttcta tcaaatcgaa actattgtgt aaaagtctgt ggtaaatttt cgtactctaa 3000 accagtatta agtggggtcc cacagggatc agttcttgga ccaattttat tcttgttata 3060 tacagcagat ttattgtttt cactggacag ctattcagca tatgcggatg atattaagtt 3120 atttgctaac ccacttgtca cggacttgca agaccaactt aatttaatct tccaatggag 3180 cgagaagtgg caaataccac taaatattgc aaaatgctgt gtacttcagt gtggccacaa 3240 caacccaaag tacctttacc atataaatgg cacactcctg agtgtaaaaa cctcaaccaa 3300 agatctaggg gtcattgtaa atgataagtt aggatggtca gagcaatgtc tagcatcagt 3360 gggtcgagcg cgtaaaatgt tttatttgct aaagcatgtt tttccaaatc cttcagtcag 3420 ttttatttca aaggtgtaca tcacttacat aaggccgcat ttggaatttg ccattccagt 3480 ctggagacct tatttgttaa aggatattga tttgctggaa agaactcagc acctggtaac 3540 tagatggaca cactgtttaa gacaagtttc atatgaacaa cgtcttgaaa tactcaaaat 3600 accagatcta cgtgctcgtc aaaacagggc agatgcaatt caaatttttc ggcttactca 3660 tggtttattt cctggagtaa cacgcaattt cttgcaaatt caatatcatg acagacttag 3720 aggtcattct tataagctga aaaaagaacc atttaagact accgttagaa agaattattt 3780 gacaaatcgt gcatttgata tatggaatgg cttgcctgat aacgttgttg cagctgttac 3840 tgtttcttcc ttcaagaatc aatatgacac attacaataa ttaacattaa cctttattaa 3900 ttaactaatt taactgtaga ttaaaattgt atttcatttt tcattattca atggtgtttt 3960 atttaacacg aaaccgtatt tcttgatatt ttgctataaa ggtgttaacc tcagcaaaga 4020 atctatgtta ataataataa taataataat aataa 4055 // ID Gypsy-9_CQ-LTR repbase; DNA; INV; 133 BP. XX AC AAWU01032098; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-9_CQ_; KW Gypsy-9_CQ-I; Gypsy-9_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-133 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 398-398 (2011). XX DR Genome; AAWU01032098; Positions 69494 69626. XX SQ Sequence 133 BP; 54 A; 39 C; 17 G; 23 T; 0 other; tgttgcacac cctggactaa cgaagcaacc cattcaagca atcgaagcaa ccatcaatca 60 atcaagcgat cattccactt ctgacactcg aacaaacaag aagcaataaa atcagctctc 120 acgaaataca aca 133 // ID TTAA16_AP repbase; DNA; INV; 569 BP. XX AC . XX DT 25-AUG-2009 (Rel. 14.09, Created) DT 25-AUG-2009 (Rel. 15.12, Last updated, Version 2) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; nonautonomous; TTAA16_AP. XX NM TTAA16_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-569 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2082-2082 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC TSD is TTAA tetranucleotide. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea CC Aphid Genome Project. XX SQ Sequence 569 BP; 170 A; 100 C; 107 G; 192 T; 0 other; ggggcctcgc catcgcattg atttaaggag aatcatttag gcaatttcta atctcgttgt 60 cgccacccga gatctatgtg ttagtcgtaa atgattatta gtgtgtgtgg cgtccatgcg 120 ggtcaattgt agcggtacgg agggctctca cacgcactcg cacgcacatg tcagtatgta 180 atattgtgat catgaaaaaa tgtttatttt tcactcaaac acacgattct aattccaaca 240 atacgtcgca acgattaact cgatttgcat tctgacaaaa tattcttttt ttccacaaca 300 tctacgaggc atcggtcgag gtacattttt aaaattgtat ttagttaatt aaatatttaa 360 gttggaaatt ctgaattttt ttgaattttt tacaattttt attgcatttt aaattacaaa 420 aattaaaaat gtggctcgac cgtttccttg tatatattac ggacaatccg aatatttttt 480 cagaataaaa atcgtgataa ttgtcggcac gtaccgttgg aattagagac agtagttttt 540 gacaaaaaaa gacgtggtag cgaggcccc 569 // ID Copia19-NVi_I repbase; DNA; INV; 2679 BP. XX AC AAZX01010888; XX DT 13-NOV-2007 (Rel. 12.11, Created) DT 13-NOV-2007 (Rel. 12.11, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: internal DE portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia19-NV; KW Copia19-NVi_LTR; internal portion; Copia19-NVi_I. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-2679 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from Nasonia parasitic wasp."; RL Repbase Reports 7(11), 1150-1150 (2007). XX DR Genome; AAZX01010888; Positions 5707 8385. XX CC Positions [45-557] - Integrase core CC LTRs are 93% similar to each other. XX FH Key Location/Qualifiers FT CDS 33..1667 FT /product="Copia19-NV_I_2p" FT /translation="MCSLSSPTPSDVSGKFFHSDVCGPMPVDSLGSARYFV FT SFKDDFSGYRYVYFLKHKSDVYDRFKEFERLIYNKFGHSMKVLHTDNGGEY FT CNALMKNLLEEKGIKLETTAPYTPQQNGRSKGDNRTIVESVSTMLLRSQAP FT QYLWAEAVSTAVYVLIMTVTKRKPDTTPWECWTKRKPDYTHLKIFGCLAFE FT HVPKALRKKFAKKANKVIFVGYEKESRNYRLYNYNTGKMSVSRNLTFDESW FT FIEAIKPAEIVIFHAPIPDNGPSEENFKATDNIGGTEASESIHEAANEQEK FT SPSKQPQQQARTQRPASGNTNKKIDLCKVQKPDCMPELSSCRQLRIRENIK FT RSAKYEANFIKIHEPDSYQGAMAGPNAAHWKVAIHEELLTHDKNKTWTLVA FT RPNICTVIDSKWFFRVQEAKAGKENRLKARLCARGFRQKYGINYQETFAPV FT VRYDALRVMLAIAAHRDLEIIQFDVKTAFLHGQLEEDVMEIPEGLSDFYNI FT SDKNLVCKLNKSLYGLSKRQGVGTLPLKHLYLNTVLTVVTRRVQFLLDV" XX SQ Sequence 2679 BP; 869 A; 487 C; 608 G; 715 T; 0 other; ataggttatg gacccagagt acgcaaagtc gcatgtgctc cctttcaagc ccaacaccga 60 gcgacgtgag tgggaaattt ttccacagcg atgtctgcgg accgatgccg gtggactctc 120 tgggaagcgc taggtatttt gtatcgttca aagacgattt ttctggctat agatacgtgt 180 attttttaaa acataagtcc gatgtgtacg atagatttaa agaatttgaa cgtttgattt 240 ataacaaatt tggtcattcc atgaaggtcc ttcacacaga taatggcggg gaatactgca 300 acgctctcat gaaaaatctt ttggaggaga aaggtatcaa gctggagact acagcaccat 360 acacacccca gcaaaatgga agaagcaaag gggataaccg aacgatcgtt gagagtgtct 420 ccacaatgct ccttcgcagc caagcaccgc agtatctctg ggccgaggca gtcagtacgg 480 cagtttacgt attaatcatg acagttacaa aaagaaaacc agacaccaca ccatgggagt 540 gttggactaa gaggaaacca gattatacac acttaaaaat ttttggctgt cttgcttttg 600 aacatgtacc caaggctctt agaaagaaat ttgcaaagaa agcaaataag gttatattcg 660 taggatacga gaaagagtct agaaattatc gcctttataa ttataacaca ggaaaaatgt 720 cagttagtag aaatttaacg ttcgacgaga gttggtttat tgaagccata aaaccagctg 780 aaatcgtcat ttttcatgcg ccaataccag ataatgggcc atcggaagaa aattttaaag 840 ctactgacaa tataggcggg actgaggctt cggaatcaat tcacgaagca gctaatgaac 900 aggaaaaatc accttccaag cagcctcaac aacaagcccg cactcaacga ccagcgagtg 960 gaaatacaaa taaaaaaatt gacttgtgca aagtccagaa gccggattgt atgccagagc 1020 tcagttcgtg tcgccaactt agaattcgtg agaacatcaa aagatcagct aagtacgaag 1080 ctaattttat taaaattcat gagcctgatt cttatcaggg agcaatggcc ggtcccaatg 1140 ctgcacactg gaaagtagca attcatgaag aattattgac tcacgacaaa aataagacgt 1200 ggactctggt agcaagaccc aacatctgta cagtcatcga ctcgaagtgg tttttcaggg 1260 tgcaggaggc taaggctgga aaagaaaatc ggctcaaagc tcgactctgt gccagaggtt 1320 tccggcagaa atacggaatt aactaccaag aaacattcgc acctgtggta agatatgatg 1380 ccctacgcgt catgcttgcc attgcagccc atcgcgacct tgaaattatt caattcgacg 1440 tcaagacggc atttcttcat ggtcagctgg aagaggatgt tatggaaatt cccgaaggat 1500 taagtgattt ttataacatt agtgataaga atttagtatg taagcttaat aaatcgcttt 1560 atggcttaag caagcgtcaa ggtgttggca cactaccttt aaagcatttg tatctgaata 1620 cggttttgac tgttgtaacg cggagagtgc aatttttgtt ggacgtataa gtgatgttgt 1680 agtgtacata acattatttg tagatgatgg tctaatttta acaaaagatt gtaatgtaat 1740 tgacaaagtt gtaaagattt taaaaaatag gtttgaaatt atggtgtgtg agttagatat 1800 ttttgttggt atgagagttg tgcgtgatag agtgagaaaa accatattcc ttcatcaaac 1860 cgagtatgct tacaaagtcc taaagcgttt tgacatgttg gacgccaaaa cctcatgtac 1920 cccagttgaa aagggtgttg atctcgtttt catgaaggaa catgagtcaa atcatgaaaa 1980 attgccatat agggaactaa taggttcttt aatgtttctt tttaccgtgt cgcgtcccga 2040 tatcgcgtat gacgtgaacc ttatgagtag atatttagac aattttaata gaaatcattg 2100 ggaagcggcg aaaagaattt taagatatgt aaaaggcact ttaaattacg gaattttgta 2160 caaaagcagt gggagcatgt taaaattaga agcttattgc gactcggact acgcaggcga 2220 ccaggatgga aggaggtcaa cctctggttt catttttaaa tattgtaatg gtccagtctc 2280 gtggtgtgct cagagacagc gtaccgtgtc cttgagtagt acagaagcgg agtatatatc 2340 cgcatctagc gagacaagag aagccatgtg gctgagacag ttgcagtacg atgttggatc 2400 tccatgtgct ggcgcaacca cattatatat tgataaccaa agtaccatcc aattgatgaa 2460 aaatcctgta tttcacaggc gcacaaaaca tatagaggtc catcatcatt atgttcgaca 2520 aaaatacgag gcgggcgtta taaaagtaaa gtatataccg agtgaattgc aactcgctga 2580 cattttcacg aaagcgctaa cgcgagaaca tcatgagcga atgtatgagg gcataggact 2640 ttgtagtatc tttgtatgaa aatgcttaaa tagcgggag 2679 // ID BEL-213_AA-I repbase; DNA; INV; 6698 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-213_AA_; KW BEL-213_AA-LTR; BEL-213_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-6698 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 883-883 (2011). XX DR [1] (Consensus) XX CC Positions [5644-6222] - Integrase core CC LTRs are 93% similar to each other. XX FH Key Location/Qualifiers FT CDS 1297..6609 FT /product="BEL-213_AA-I_1p" FT /translation="MPPITRKGSTLRQLRTQLEEVTASFDDIAVFIQRFDE FT HTTASQVEVRLEKIDDLWEKFSETLVELKSHEDFEDEEKYYQKLRMNISEQ FT YYDGKSFLKDKAKSLHEPTDLEQSVREPSMLGVLDHVRLPQIKLQTFCGEI FT DDWLSFRDLFTSLIHWRTDLPEVEKFHYLKGCLQGEPKSLIDSSKITKANY FT QIAWDTLCKRYNNNKLLKKKQVQSLFKLPTLTKESVAELRTLVEGFDRIVQ FT TLDQIVEPADYKNLLLVNLLTMRLDPYTRRAWEECAASKDQDTLEDITEFL FT RKRIQVLESLPPRPADPKFHQQPPSSKPKPPAVKTSYNSVQSIPGRCVACK FT DSHLLYQCSAFQQLPVADRDSLLKSNGLCRNCFRFGHQARDCQSKFSCRIC FT RKRHHTLLCFRAEKDNATTVVSVAGGNHPSTSQDSHGSMGPNSTHVANMAA FT TDVSACNVAQKVSSTVLLATAVVVVEDDIGVRFNARALLDSGSESNFVSER FT LTQRMKVSRNKVDVSVLGIGQGTTKVKHSVQMVLRSRVSDFSREMAFLVLP FT KVTTNLPTTSINSEGWKIPDGIQLADPAFFQSKEVDIVLGIEAFFEFFETG FT RRIPIGNHLPTLTESVFGWVVCGGCSTSKVSQQINCNFSATTELERLMERF FT WSCEEVEFSNSYSPEEKLCEDIFQRGVQRGSDGRYTVPLPKNEDIIQNLGE FT SRNIAYRRLLGTERRLARDDKLREQYVAFMNEYLTLGHMRKVGSNEEETVK FT RCFLPHHPVVKEASTTTKVRVVFDASCRTSTGVSLNDALLVGPVVQEDLRS FT IILRSRTKQILLVSDVEKMFRQILTTPEDRHLQSILYRFSPNEEVTVFELN FT TVTYGTKPAPFLATRTLQQLAADEELNFPLASRAIREDVYMDDVITGTDDV FT DTAICLRNQLSAMMERGGFMLRKWSCNIPAVLRGLPAENLAIRDNTGVNLD FT PDQAVKTLGLTWMPMTDTLRFQFDIPPPEQDVPFTKRRILSLIATLFDPLG FT LIGATTVSAKIFMQQLWTLQDENGNRLEWDSPVPQMVGEAWRKFHEKLPLL FT NEVRVNRCVIIPNAVSVELHCFSDASERAYGGCIYVRSQDLEGTVLVRLLS FT SRSKVAPLQNQSIPRLELCGALLVSQLFEKVRNSTRLSVETHFWTDSTCVI FT RWIAAPPNTWTTFVANRVAKIQAITNGWNWRHVAGVDNPADLVSRGISAED FT IVHNELWWGGPHWLKQGQEAWPNGTTEDRIVGEEERRRTTIACTMSPVTEF FT NEFYLSKFGSFTELIRRTAYWLRLMKYLQNPKENRFEGVFLTTAELRTSQH FT VIIRNVQREAFAQERKALMDGQAVSSKSPLRWFNPYIDEEGLIRVGGRLKN FT SSELEGTKHPAVIPARHRLTKMIMKHFHETLLHAGPQLLLGVVRLQFWPLG FT GRNVARQIVHQCMRCYRAKPTTVQQFMGELPSSRVTISRPFTKTGVDYFGP FT VYIRLAPRRTAVKAYVAVFVCFCTKAVHLELVTDLSTECFLLALRRFVARR FT GKPSDMYSDNGTNFVGAKNKLTELLSLLKNREHHERITSELGQNAIQWHFN FT PPSAPHFGGLWEAAVRSSKNHLLKVIGESVVSAEDFSTLLVQVEACLNSRP FT MTPMSDDPDDLEPLTPAHFLIGSSLQALPDIEVEAFPSNRLNRFQLIQQKL FT QLFWSRWRREYLNLLQARTKRWKPAIVVEKGRLVVIKDDNVPPIRWKMGRI FT VAVHPGDDNVVRVVTLKTATGELKRPVEKICILPMPNVDELDESDANSSRN FT " XX SQ Sequence 6698 BP; 1845 A; 1577 C; 1586 G; 1687 T; 3 other; tttggtcctt cgaaccggat ttgcttgccc agtccggttg gaagtaggat tatcgccttg 60 ttgaagccac tgaagccatt tcgtcgccat tgaaaccgcg caagggcgcc atcgagaagg 120 gaacacaatt gctctattgg acttcgacat tgtgaccawt tgctttgttg gagacgcgcc 180 attccgaacc attcaatgaa ggatcctcgc caatattcgc catcacgata ggatccaata 240 ctcaagccac agtgaaggcg caccatattg agacgattgg agaatcagtg catgcaacga 300 ttacacccgg attacatcgc tcgataccag gactgcccag gaataccgaa ccggccggaa 360 tcactgcttt gtcaggtaat tatacataag ttcggtatat gtctggccga agcaaccttg 420 catgacacta acgccatcat ttcgacacac ttcatttgca tgcgctgagg aaaccggcaa 480 tccgctggtg ttgatcattc atccgaggag ctggaaggtt atcatcgagc atctatttca 540 accgcttgaa tttctgtgga cgaaaaggaa cgacgaacga ttgcggcgga tagcagtaga 600 agagggagcg tcgaaatact ccctcgtagg tgagtgcact ttattgtata tgtaccgccg 660 aagcaaggcg tgcagaaata cactctggta atttgaattc ctgtatcctc ttcatttatt 720 actgcgcaat ccctgcaaga gaactgctcg ttagagcaag tctgctggaa ttttcccaaa 780 ttgcatcctc ctcttttgga cacggttggt ggagttgtta gtcctccagg ttagttttac 840 ggtatatgtc ctaccgaagc aagacctacg gatattacac acttcaattt cctcttcaat 900 tgctactgta cattacccac atttggagcc caagtttgtg acgatcaatt attggaaccc 960 atcgctgatt ggcatttcgt gttttggaat tagtaccgct gttggtacct ccaggtaggt 1020 tgccgcagta tatgccaagc cgaagccaag gttttgtgga tattacaccc caacttctcc 1080 tcttctctcg caaaatctga aactgagccg ccgctgatga tagtttgatg tgacgtcatt 1140 gtgctttcga gacaattcgc tgaattttgg aacattttta gaacagccac ctgttctgtc 1200 aggtcagtga aataggacag tatatgtccg ccgaagcaaa actacacgcg gcatactaca 1260 tctctgtttt tccctaccgc caatcgacat cacatcatgc cacccattac cagaaaagga 1320 tccaccctga ggcagttgag gactcagctg gaggaagtta cggcttcctt tgacgacatc 1380 gcggttttta tccagaggtt cgacgagcat actaccgctt ctcaagtcga agttcgtttg 1440 gagaaaattg atgatttgtg ggagaaattc agtgagacat tagttgaact gaaatctcat 1500 gaggattttg aagatgagga gaagtattac cagaaattac gaatgaatat cagcgaacag 1560 tattatgatg ggaagtcctt tcttaaggac aaggccaaat cactccatga gccaacggat 1620 ttggagcagt cagtccgtga accttcaatg ttgggtgttc tcgaccatgt tcggttaccg 1680 caaatcaaac tgcaaacgtt ttgtggtgaa atcgatgact ggttgagctt ccgggacttg 1740 ttcacctctt tgattcattg gagaaccgac ttgccagagg tggagaagtt ccattatcta 1800 aagggatgtc ttcaaggcga gccgaaaagt ctgatcgact cgtcgaagat cacgaaggcm 1860 aattaccaaa tcgcttggga cactctgtgc aagcggtata acaacaataa attgctgaag 1920 aagaagcagg ttcagtcctt attcaagctt ccgaccctaa ccaaggaatc cgtggccgaa 1980 ttacgaactt tagtggaagg tttcgatcga atcgtccaaa cgctcgatca aatcgtggaa 2040 ccagcggatt ataaaaacct tttgctggta aatctcctta cwatgcgact agatccatac 2100 acccgtcgtg cttgggaaga atgtgcagct tccaaggacc aggatactct ggaggatatt 2160 accgaattcc ttcgaaagcg aattcaagtt ctggaatccc ttccgccaag gccggcggac 2220 ccaaagtttc accaacaacc accctcgtca aaacccaaac cgccagcagt gaaaaccagc 2280 tataattcgg tccaatcaat cccaggaaga tgcgtggctt gcaaggatag ccaccttcta 2340 taccaatgtt ccgcattcca acaattaccc gtggcggaca gagattcact gctgaaatca 2400 aatggacttt gccgaaattg ctttcgcttt ggacatcagg caagagattg tcaatccaag 2460 ttctcttgtc ggatttgcag aaagcgtcac cacactctgt tatgcttcag agcggagaag 2520 gacaatgcta caacggtagt atcggttgct gggggcaacc acccttcaac ctcccaagat 2580 tcccatggct ctatgggtcc caattctacc catgtggcca atatggcggc cacggacgtt 2640 tcggcatgca atgttgctca aaaggtttca tcaacggttt tgctagcaac agcggtagtc 2700 gttgtcgagg atgacatcgg cgtcaggttc aatgctcgag ccctattgga ctccgggtcg 2760 gagagcaact ttgtgtcgga acgattaacc caacggatga aagtctccag gaataaggtc 2820 gacgtttcgg ttctcggcat agggcaggga accaccaagg tgaaacattc ggtgcaaatg 2880 gttctacggt cgcgagtttc ggatttctcg cgagaaatgg cttttctggt attacccaag 2940 gtaactacca atctacccac aacctcaatc aattcggaag ggtggaaaat accagacgga 3000 atccaattgg cggacccagc ctttttccaa tccaaggaag tcgacattgt tcttggcatc 3060 gaggctttct ttgagttttt cgagacagga agaagaattc caatcggaaa tcacctacca 3120 accctcacgg aatcggtatt cggttgggta gtctgcggtg gctgctcaac ttcgaaggta 3180 tcccaacaaa tcaactgtaa tttctcagca accacggaac tggaaaggtt gatggaaagg 3240 ttctggtcat gcgaagaggt ggaattctca aattcctatt ccccggaaga gaaattgtgt 3300 gaagatatat tccaacgtgg agttcagcga ggttcagatg gtcggtacac tgttccatta 3360 cccaaaaatg aagatattat ccaaaatctg ggcgaatcac ggaatatcgc ctatcgacgt 3420 cttctcggga ccgagagaag attggcaagg gacgataaac tacgggaaca gtacgtcgca 3480 ttcatgaacg agtacctaac attgggacac atgcggaagg tcggaagcaa tgaagaggaa 3540 accgtgaaac ggtgttttct accccatcac ccggtggtta aggaagctag taccaccact 3600 aaggtcagag tagtcttcga cgcttcctgc agaacatcaa ctggagtatc cctcaacgat 3660 gcactactgg tagggccagt tgtgcaggaa gatttgcggt ccatcattct ccgaagtcgg 3720 acaaaacaga ttctccttgt atcagacgtg gagaaaatgt ttcgacagat cctaactaca 3780 ccagaggacc gacacttgca gtcaatacta tatcgttttt caccgaacga ggaagttacg 3840 gtgtttgagt tgaataccgt gacttacgga accaagcccg caccctttct ggcaacccgg 3900 actcttcaac agttggcagc tgacgaagag ctcaattttc ctcttgcgtc aagagcaatt 3960 cgtgaagacg tctatatgga tgacgttatc accggcacgg acgacgttga tactgccatt 4020 tgtctaagaa accaactttc ggcaatgatg gaacgtggtg gctttatgtt aagaaagtgg 4080 tcctgtaaca ttccagcagt tttgcgtgga ttacccgctg aaaacctagc gattcgggac 4140 aataccggag tcaacttgga tcccgatcaa gccgtcaaaa ccttagggct gacttggatg 4200 cccatgaccg acaccctccg atttcagttt gacattcctc caccagaaca ggatgttcca 4260 ttcactaaac gtcgtatttt atcactcatc gctacattgt tcgatcccct tggattaatt 4320 ggagccacaa ctgtgtctgc caagattttc atgcagcaac tctggacgct gcaagacgaa 4380 aacggcaacc gattggagtg ggatagcccg gtacctcaaa tggtgggtga ggcttggcgg 4440 aaattccacg agaaacttcc acttttgaat gaagtccgcg tcaatcgttg tgttatcatt 4500 cccaatgctg tttcggtgga gttacactgc ttctcggatg catcagaaag ggcttatggt 4560 ggctgcatct acgtgagaag tcaggatttg gaaggaacag tcttggtaag actattgtcg 4620 tccagatcca aggtagcccc tctccaaaat cagtcaatcc cacgacttga gctttgcggt 4680 gccctcctgg tgtcacaact atttgaaaag gtccggaatt ctacgagatt gtcagttgaa 4740 acgcactttt ggaccgattc aacttgtgta atacgatgga tagccgcacc tcccaatact 4800 tggactacat ttgtggccaa tagagtggca aagatccaag caataacaaa tggatggaat 4860 tggagacacg tagctggagt agacaatcca gcggaccttg tatctcgggg aatatctgca 4920 gaggatattg tccataatga actctggtgg ggaggcccac attggctgaa gcagggacaa 4980 gaagcgtggc ccaacggaac cacagaagat cgcatagtag gagaggaaga gaggcgtcgt 5040 acgacgattg cctgtacgat gtcacccgtt accgaattca acgagttcta cctcagtaaa 5100 tttggatcgt ttacggagct tatccgaaga acagcatact ggctgcgact catgaaatat 5160 cttcagaatc cgaaggaaaa tcgttttgaa ggcgttttcc tgactacagc agagttaagg 5220 acatctcagc acgtaataat ccggaacgtc caacgagaag ctttcgccca ggagagaaaa 5280 gcgttaatgg atggacaagc agtttcgagc aagtcaccat tgagatggtt caacccgtac 5340 atcgatgagg aagggcttat cagagtgggc ggaagattga agaattcatc cgaacttgaa 5400 ggcaccaaac atcctgcggt aatacccgct cgtcatcgtt taaccaaaat gattatgaag 5460 catttccacg agacgcttct acatgctggc ccacaactac tattgggagt agtccgacta 5520 cagttctggc ctcttggtgg cagaaatgtt gctaggcaaa tagtacatca atgcatgagg 5580 tgctatcgag caaaaccaac gacagtccag cagtttatgg gtgaactacc gtcctcacga 5640 gtaaccattt cccgaccatt cactaaaaca ggagtagatt actttggacc tgtctacatc 5700 cgactggctc cgaggcgcac cgccgtgaag gcttatgtag cggtttttgt ttgtttctgt 5760 accaaggcgg tacacttgga gcttgtcacc gacctttcga ctgagtgttt cctgctagcc 5820 ctacgacgat ttgtggcccg ccgtggaaag ccgtccgata tgtattccga taatggcacg 5880 aatttcgtcg gcgctaaaaa caaattaact gaacttctca gcttgctcaa aaatcgggaa 5940 caccatgaaa gaatcacatc tgaacttgga cagaacgcta ttcaatggca tttcaatcct 6000 cccagtgcac cacacttcgg aggactctgg gaggcagcag tacgatcgtc caagaatcat 6060 ttattgaagg tgattggaga gtcagtagtt tcagcggagg atttctctac acttcttgtg 6120 caggtcgaag cttgtctgaa ttcccgccct atgacaccaa tgtccgatga tccagacgac 6180 ttggaaccat taactccagc gcatttccta attggctcat cattacaggc tctgccagat 6240 attgaagtag aagctttccc atcaaatcga ttaaatcggt tccagttgat tcaacagaag 6300 ttacagctat tttggagtcg ttggcgccga gaatacctca atctcctgca ggcgagaaca 6360 aaacgctgga agcctgcgat agtcgtagaa aagggtagac ttgttgtgat caaagatgac 6420 aacgttcctc ccatcagatg gaagatgggc agaatagttg ctgtacaccc tggtgacgac 6480 aatgttgtaa gagtcgtgac cttaaaaaca gccacaggtg aactgaaacg tccagtagag 6540 aagatatgta ttttgccaat gccaaatgtt gatgaactag atgaatctga tgcaaattcg 6600 tcacgaaact agctccaaat cctttccaac tcccatcccg tagaagaggt tttcttgttt 6660 tctttcagaa attcagcaaa tttctgggtg ggtgagaa 6698 // ID BEL-14_DWil-LTR repbase; DNA; INV; 493 BP. XX AC scaffold_181096; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-14_DWil_; KW BEL-14_DWil-I; BEL-14_DWil-LTR. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-493 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_181096; Positions 411096 411588. XX SQ Sequence 493 BP; 163 A; 85 C; 82 G; 163 T; 0 other; tgttggtgtt tagattatat aaaataagct aacattaaga caattataat atagaacaaa 60 tggaaatgta tttcaattaa gtatgtaacc cgattataga tatcgatggc tcatcagtca 120 taagtagatc tgatatggtc agcagtaaac aataaagtta attaagctgc tatggttagc 180 tattatcaat aaagacgtac tttgctgacc taacactatg tcgaactatg cactggtgag 240 ctcgctattt tgtactataa aagaaatgca ttattcttag taagatcaga taccaattgt 300 agctacttcg ggtgtgcatc gctgtgtctt tggcccccat ctctaccttt gtaatctcac 360 tccgaggtcc acccaattta tgtgttggtt gcgacaacct ttagtcactg gtttatgtgc 420 tagtgcctat atatatatat aaataaataa acatttaata acttcaagaa acactaacgt 480 ctttcataga aca 493 // ID BEL-13_DPu-I repbase; DNA; INV; 8378 BP. XX AC . XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from Daphnia: internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-13_DP_; KW BEL-13_DPu-LTR; BEL-13_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-8378 RA Jurka J.; RT "LTR retrotransposons from Daphnia."; RL Direct Submission to RU (16-DEC-2010). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR [1] (Consensus) XX CC Positions [7426-7989] - Integrase core CC 'GGTTG' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 1538..3625 FT /product="BEL-13_DPu-I_2p" FT /translation="MDTSANMATAQLPVITEESASVAEQEQSTVVDPKTMD FT RSQLVRQRGTIKSRHTRLLTTLHSGMTFNIEGQFKDEARKLRTTLVQEYEN FT VELCHNLVLAACATQEEFNKEFCWFETFTEAHRKMLADINAYLGEGLATSL FT KPAGSIRSRSSRASSRNSAQSKEDKIQEKARKVAETELRLKQAQEEARLRE FT IEEEKIWRLEEDKRKEELKVEAERTQRQLRNELEGHHLESQILIQQLEVER FT EGARPVTPIEAQENFSSKSTSVATPVFSVQQRPESSVPAKGALAKIKGFFS FT ALTPARTPSSHGPAEEGRPPRKPRRSLDANWIQAPPGWNGSSASLAAPLAS FT SEKKKTSWPQPSTPEIITVPPSPTAPAPPPPRCEASDQNQFLRPPFSGSNE FT NWERTIEQDDGRYAETLGNKAVSQSVSATFLNPLAGAFEPRAGFFSLPGSS FT MSVVQPRSFGDRPPGGGFAPTQTSSTSAGPPTTCVAAVSFPASQPSRSTAL FT PPSSYFYSLNLPLSRDSSRFSAAPITTYTTPFGRTTMAQAGNPLITRQPSH FT LDPSVPPTSLVARHEPTEPPTSVAGGFSEGLREEVSSPDGWIYDIERHRRT FT AMPGFRSLSRLPKIELPKFAGDPLKWPMFIQTFGAQVDRCCCDDSERQAFL FT RNCLSTDIQMQLGECLLHPGLYQRCLQELHRKFGNPRVIATACSRDF" FT CDS 3777..5954 FT /product="BEL-13_DPu-I_3p" FT /translation="MLRSRWAEYSYTILERLPDLTDLDNWLDNCAMAEYCV FT RAGSDRASIPSAHLKDRQEQKGGGKVLKPKVFTNSAPGRAPAEKKCAQCDK FT PHPTRDCRRFREGSLEERAQLVKEKKLCFKCLESGHALKDCTVEKKCGIDG FT CKSGHHRLMHGAPRVFQSAMKKNDSDGKTIKFEDSASSKPPFNGMNVSSVR FT QKLTLLPIVPVVIRSPSNKTYNTFALLDCASEVTLLCEDVRKCLNLSGPTE FT LVEIGSWHAQDPKFESYRVSFTVESTDGQSEFKISDAYSVPRLNLSKRPIQ FT LDKVIEKWPHLAVVPLSCVSPGEVKLLIGMDHSDPHDVLEYRKDPFRQNAP FT KAILTAFGWAIQGNIGPAVNENQGLRCCAVSMKAPQDDLLGVVNQFLLHES FT YGPSPEAKPAVGKDVLGSRKLLQETTRYDAEHGRYETGVLWARDDVVLPNN FT YQGTLERFKKTQRRLDSNANLASVVHKEMEQNFNLGFAERLSEEQLKALPK FT GRVWFLPWHPVAHPHKQGKWRVVFDAAAEFHGMSLNDRLLKGEVPHVNLIG FT ALLRLREFAVAICGDITKMFHQVRFRKEDAFIYLFLYAPKGESVPVACRMK FT VHIFGSVCSPAVCAYVLHRAAEDAYAEDAPFAVQEVEIQFYVDNWITSFHT FT EEEAITGAAKLTRALAKGGLGLCKSFCAGGTARPMHFGSKCGPGWITYRKN FT VGNVAGLCSRLLCVESDGKCEPKHAS" FT CDS 5884..8376 FT /product="BEL-13_DPu-I_1p" FT /translation="MSLDFAADCFVLKATGNVSRSTHREILRATATNFDPL FT GCLAPVLITAKMILQKVCKAKLGWDDLLDQELSEEWQQWAGSLTAVNDLRI FT PRCYCPFPHHEDATDLIMFSDASERAFGAVGYLRFELPDGKVKVAFVLAKA FT KVAPVKYVSMPRLELCGCLMSARMASVLLKEIRIKIRRVVLFTDSTTNLRW FT FNSGSCRFTPYVANRVGEVLESFDATHWHYVPTLQNPADDLSRGVPASELT FT ADHRFFTGPEFLLDPPTMWPAFPDLQAPLGSAPDPEAKEAPPAFAGTTKVS FT ACGIETLLSSLIPWSRVKRTVALVLRWPAKFKERKRRGEGKPEPEVIAVLP FT PPRIKKFTIKKYVAVAREVEMEEPSPEELATVVSRLVGQAQRTAYALELKA FT LNRKVALDAASPLAKVSPYLDQGLLRVGGRIVNAPVPFDVRHPIVLPTKAT FT VTERIVRAIHIKGKGHTSSWRTLSELQREYWIPAPRRLIRRVVDRCTLCRR FT FAANAGSPLMAALPYARLQVFQPPFACTGIDYFGPIEVTLFRRVVKRWGCL FT FTCLTTRAIHLEMAYALDADAFICAYENFRVARGTPKIIYADNGTNFVGGK FT NELAEALERLNQSQIYNHLALEGVEWRFNPPAAPHFGGSWERLVGSAKRAL FT ERVLHLQSFTDQTLTSALKQVEHLINSRPLTYVSVDPSAPEPLTPYHLLLG FT RANPSIPPDVFSTADLSCRKRWRIAHAIADQFWRRWMAEYMPDLIERRKWL FT KRERNLRIGDIVLVIDEKTPRGLWPLGLVTEVFTGVDGVVRSASVRHNGTE FT LHRPAVKLCLLEPEPEKEEDASAAGRRAG" XX SQ Sequence 8378 BP; 1936 A; 2240 C; 2271 G; 1931 T; 0 other; tttggtgcat cgaccgggaa cggtttctcg tttgtttaag cgagacgccc accccctttt 60 ccgacatttc gcgtggtggc cacgctttcg tcacgtgtcg ttcgaagttg tatccgtctg 120 cagccgtgcc atccgccatc cctggttgct gcaatcgtgc agcagtccca gcggtcgtgt 180 tttggagccc cggaaggaag aaaattcgga agattgtttc gtgacaaggc acggccggtg 240 ttcatcgttg cttttgctgt ttatcgcccg ctagttatcg tcccagctgt ttcctgctag 300 gtcgttattc ccgaagttct ctggcgtttc agccctttgc atctcggtat attctttaat 360 ttttcgccag tccgccattt gttggctgcg tctgcccgcc agtccgccat ttgttggctg 420 cgtctgcccg ccagtccacc tttgtcattt tggttttgtg tctgccgtca gttccgaaat 480 tgtgttggct gcgtctgccc gccagtccgc cattttgttt ggctgcgtct gccagccagt 540 ccgccattgt tcatttttgt gtgtaaagtc ccgcgtggca gtcatcgttt aagttttctc 600 agcgtgccga ttgtgttgca tcctgtcgtc agccgccggc tattttccat tgtgtcgtca 660 actggccatc agcccatcgc cgtatcgaga aacatctacg atcatctagt tttctcatca 720 ccggtttgtc gccgcgtcga acacgtgaat tccgccgccc gtccaccact gctggagctg 780 tttttcggcc gtaaaccgtt ctccgcccac cactgctgac tacgggctac gtcatccatc 840 tgtatttcgc cacgtcgctt actgtttgct ggtcattatt tgaatcgtcg ttgtccatca 900 gtgctgcatc gtttgccttt attcatcgtc ccggcttagg aagggcgttt cgcctttcca 960 gcttcaaact gccgcgagga gcgtcgaaaa gggccgagca gaggctcgct ctaaaggaga 1020 agcgaaagat tttcgtcgtg ccccgtacac caagctaaac gatcagcaca gtggtagacc 1080 accgccacct cgcgatcaaa cagtttcgtt cccgttggaa gcagtcaacc gtccgtgccc 1140 agctaggcgc gagaaagaag cgaaacggga agcggtgtat gtaccagaag acgaaccacg 1200 tgccagctgc agccaaccca tcgtcagctc ccgccaagtg gtcgcctttc ctccccgagc 1260 cttaaatcag ccgtttgtgc aattaagaga cgctttcgta aagccccgcc aagttgaacg 1320 agtggttaaa attgtagcca cagagggcgc caccactgcg ttttcgtcga ttttacccgt 1380 ccatcaaaat caggagccac agcagcgctc cgttaaagtt tccagtcgtc ccgcgttcct 1440 gtagttccgt gtttattctt ggtttcaagt gtctcttccg tttttcgaat aaaggcagcc 1500 ggtcgtttga attccctagc ccgcagctta gtgcagcatg gacacctccg ccaacatggc 1560 caccgcccag ctcccagtca tcactgaaga atcggctagc gtagccgaac aggagcagtc 1620 gacagtcgtg gatcctaaaa cgatggatag gagccagctc gtccgtcaaa gaggaaccat 1680 caagtctcgg cacacccgcc tgctgactac tctccacagc ggtatgactt ttaatataga 1740 agggcagttt aaagacgagg cgcgaaaact ccggacgacc ttggtgcagg agtacgagaa 1800 cgtggagctg tgtcacaacc tggtgctggc agcgtgtgca acacaggaag aatttaataa 1860 ggagttttgc tggtttgaga cgttcacgga agctcaccga aaaatgctcg ccgatatcaa 1920 cgcttacctt ggcgaaggtt tagccacgag tcttaagcca gcgggttcga tccggtcgcg 1980 gtcgtcaaga gcgtcgtcaa gaaattcagc acagtcgaaa gaagacaaaa ttcaagaaaa 2040 agctcggaag gttgcggaaa cggagttgcg gttgaagcag gctcaagaag aagcgcggct 2100 gagggaaatt gaagaagaaa aaatctggcg cttagaagaa gacaagcgaa aggaagagct 2160 aaaggttgaa gcggagcgca cccagcgtca gttaaggaac gagcttgaag gtcaccattt 2220 agaaagccag atcctgatcc agcagcttga ggtggagcgg gaaggtgcta ggccagtgac 2280 acccatcgag gcgcaggaaa atttttcatc caaatcaact tcagtagcca ctccagtttt 2340 ctcggttcaa cagcggcccg agagttcagt tcccgccaag ggtgcgctgg caaaaatcaa 2400 gggttttttt tcggcactca cacccgcccg gacgccgtct tcgcacgggc cggcagagga 2460 aggccgtccc cccagaaagc caagaaggag cctggacgcc aactggatcc aggccccacc 2520 tggttggaat ggttcgtccg cgtcgttagc agctccgtta gcgtcgtctg aaaagaagaa 2580 aacgtcgtgg ccgcagcctt caactccgga aatcatcacg gtgccaccca gtccaactgc 2640 tcctgcgcca ccaccgccaa gatgtgaggc aagtgatcaa aaccagtttc ttcgtccgcc 2700 gtttagtggt tcaaatgaga actgggagag gaccatagag caggatgatg gccggtatgc 2760 tgaaacattg ggcaacaaag cagtgtcgca aagtgtatcg gctacatttt taaatccgtt 2820 ggccggtgcg ttcgagccac gggccggctt tttctcactc cccggaagct caatgtccgt 2880 cgttcaacct cggtcatttg gcgaccgccc gccaggagga gggttcgcgc ctacacaaac 2940 cagctccacc agcgccggac caccaactac gtgtgtagcg gcagtcagtt ttccagcgtc 3000 acagcccagc cggtctacgg ctctgccgcc cagctcatat ttttattcgt taaatcttcc 3060 attgtctcgg gattcgagtc gtttttcagc agcgcctatc acaacgtata cgacgccatt 3120 tggaaggacg acaatggcgc aagccggaaa tccgctcata acaagacagc cgtctcatct 3180 ggacccttcg gtgccaccta cgtcgttagt tgctagacac gagccgacgg agccgccaac 3240 gtcagttgca ggcggattca gtgaggggct acgtgaagaa gtgtccagcc cagacggatg 3300 gatctatgac atcgagcggc acagaagaac ggccatgccc gggtttcggt ccctctcacg 3360 cctgccgaaa attgagctac cgaaattcgc tggagaccca ttaaagtggc cgatgtttat 3420 tcaaacgttc ggcgctcaag tggaccggtg ttgttgtgac gattccgaga ggcaagcgtt 3480 cctgcgaaat tgcctttcga ccgacatcca gatgcagctg ggagagtgcc tactacaccc 3540 gggactttat caacggtgcc tgcaggagct tcatagaaag tttggaaacc cgcgggtgat 3600 tgctacggcg tgttcaagag acttttgaac tctcgtcttt taacgacggc gattttaagg 3660 ccctgcagaa attctccgcg gctttacgtt cgtgcgtcgc gactctccgc tttggaggat 3720 atggtgttga gttacagagt catgcgactc tcgcacacgt gaacaagctg ccacatatgc 3780 tccgtagtcg ttgggctgaa tacagctaca ctatactgga acgccttcct gacctaaccg 3840 acctggacaa ctggctggac aattgtgcaa tggcggaata ctgtgtgcgg gctggcagcg 3900 atcgggcttc tattcctagc gctcatctta aggaccgcca ggaacagaag ggggggggga 3960 aggtgttaaa gccaaaggtg ttcacaaatt cagcaccggg aagagctcca gcggagaaga 4020 aatgcgccca gtgtgacaag ccgcacccta ctagagattg cagacgtttc cgcgaaggct 4080 cattagaaga aagagcgcag ttggttaagg agaagaaact gtgttttaag tgtctggagt 4140 cagggcacgc cctaaaagat tgcacagtgg agaagaagtg tggaattgat gggtgtaaga 4200 gcggacacca tcgtttaatg cacggtgccc ctcgcgtttt ccaatccgca atgaagaaaa 4260 acgattccga tggaaagacg attaaatttg aagattcagc gtccagcaag ccgccattta 4320 atggaatgaa tgtctcaagc gttcgtcaaa aattgactct gcttcccatc gtgcccgtag 4380 tcatccgctc accaagcaac aaaacttaca acactttcgc cctgttagat tgcgccagtg 4440 aagtaacgct cttgtgtgag gacgttagaa agtgtctaaa tctgtcggga ccaacggagc 4500 tggtggaaat cggctcctgg catgctcaag atccgaagtt tgagtcgtac agagtttcgt 4560 tcactgtgga atcaacggac ggacaaagtg agttcaaaat ctctgatgcg tattccgtgc 4620 cgcgtcttaa cctcagcaag aggccaatcc agctggacaa agtgatagag aagtggccgc 4680 atttggccgt cgtgccgttg tcttgtgtta gtcccggaga agtcaagctc cttatcggta 4740 tggaccatag tgacccgcat gacgtgctgg aatacagaaa ggatccgttt cgtcaaaatg 4800 caccgaaggc gattctcacc gcgtttggct gggccattca aggtaacatt ggcccagcgg 4860 tgaacgaaaa ccaaggcctg aggtgttgtg ccgtgtcgat gaaagcgccc caagacgatc 4920 tgctcggcgt agtgaatcag tttctactgc atgagtcgta tggccccagt ccggaagcaa 4980 aaccagccgt cgggaaggat gtcttaggct cgcggaagct gcttcaagaa accacccggt 5040 acgatgctga gcacgggcgt tacgagacag gagtgctgtg ggccagagat gacgtcgtgt 5100 tgcccaataa ttatcagggc actcttgagc gtttcaaaaa gacccaaagg agacttgaca 5160 gcaacgcgaa tttggccagc gtcgtgcaca aagaaatgga gcagaacttc aacttaggct 5220 ttgcggaaag actatcggaa gagcagctga aggcgttgcc gaaagggcga gtttggtttc 5280 ttccgtggca tccagtggct catcctcata agcaaggaaa atggagagtg gtgtttgacg 5340 cagcggccga gttccatgga atgtcactca acgaccgatt gctgaaagga gaggtgcccc 5400 atgtcaatct catcggtgcg ctgcttcgtc tccgtgagtt tgcggttgcc atttgcggtg 5460 atatcacgaa gatgttccat caagtgcggt tccgtaaaga ggatgccttc atctacctct 5520 ttttatacgc gcccaaaggt gaatcggtac cagtggcgtg tcgcatgaaa gtgcatattt 5580 ttgggtctgt ttgttcaccc gccgtttgtg cttatgttct gcaccgtgcg gctgaagacg 5640 cctacgccga agatgctccg tttgctgtcc aagaagtgga aatccaattc tacgtagaca 5700 actggatcac ctcattccat acggaagaag aggccatcac gggagctgca aaattgacga 5760 gagcgttggc gaaaggaggt ttggggctct gcaagtcgtt ctgtgctggc ggcactgccc 5820 ggccaatgca tttcggctct aaatgtggac ctggatggat tacctacaga aagaacgttg 5880 ggaatgtcgc tggactttgc agccgactgc tttgtgttga aagcgacggg aaatgtgagc 5940 cgaagcacgc atcgtgaaat tttacgcgcc acggccacta atttcgaccc tttgggctgt 6000 cttgccccgg ttttgatcac cgccaagatg atcctgcaga aggtctgcaa agcaaagctc 6060 ggctgggacg atctgctgga ccaggagctg tccgaagaat ggcaacaatg ggccggctcc 6120 ctcacagctg tgaacgatct acgtattcct cgatgttatt gcccatttcc tcatcatgaa 6180 gacgctacgg acctgattat gttttccgat gcatcagaaa gagcatttgg cgccgttggc 6240 tatctacgtt ttgaactgcc ggacggaaag gtgaaggtcg catttgtatt ggccaaggcg 6300 aaagtggcgc ccgtgaaata tgtgtcgatg ccgcgactgg agctgtgtgg ctgcttgatg 6360 tcggccagga tggcctcagt ccttctgaaa gagattcgca tcaaaatccg tcgcgtcgtt 6420 ttatttaccg actccacaac aaatcttcgc tggtttaatt caggaagctg ccgtttcacg 6480 ccgtacgtcg ccaatcgagt cggcgaagtg ttggaatcgt tcgacgccac ccactggcac 6540 tatgtgccta cgcttcagaa tccggcggat gatctaagtc gaggagttcc agcatccgaa 6600 ttaacagccg atcatcgttt cttcaccgga ccggaatttt tattggatcc gccgactatg 6660 tggcccgctt ttcctgattt acaagcccca cttggctccg ccccggatcc agaagcgaag 6720 gaggcgccgc ctgctttcgc cggcaccacc aaagtatccg cgtgtggtat cgaaacgctt 6780 ttgtcgtcgc ttataccatg gagccgagtg aaaaggacgg tggcgctagt tcttcgttgg 6840 ccagctaaat tcaaggagcg taagcggcgc ggtgaaggca agccagagcc agaagttatc 6900 gctgtcttac cacccccccg cataaagaaa ttcaccataa aaaagtacgt cgccgtggcc 6960 agagaagttg aaatggagga gccgtcgcca gaagagctgg ccacggtggt tagccgtctc 7020 gtcggtcagg cgcagagaac ggcttacgcc ctggagctaa aggccctaaa ccgtaaggtt 7080 gctttagatg ctgcttctcc gcttgccaag gtcagccctt atttagacca ggggctgttg 7140 cgcgtcggag gacgaattgt aaatgctcca gtcccatttg acgtccgcca cccgatagtg 7200 ctgccaacga aggccaccgt caccgaaaga attgttcgag ccattcacat caagggtaaa 7260 ggacacactt catcatggag gacgctgtcg gagctgcagc gtgaatattg gattcctgca 7320 ccacgccgcc ttatccggcg cgtagttgac cgctgcaccc tctgtcggcg tttcgcagca 7380 aatgccggat cgccgcttat ggccgcgctt ccgtatgccc gccttcaagt ttttcagccg 7440 ccatttgcct gcaccggtat cgactacttc ggaccaattg aggtgacact cttccgacgc 7500 gtcgtcaaga gatggggatg tttattcacc tgcctcacga cacgcgctat tcatctggag 7560 atggcctacg cgttagacgc tgacgccttt atttgcgcct acgaaaactt ccgagtggcg 7620 cgaggcaccc caaagatcat ctatgctgac aacgggacga actttgtagg cggcaaaaac 7680 gagttagcag aggcgctgga gcgcctaaat cagtctcaaa tttacaatca tttagcactg 7740 gaaggtgtag agtggcgttt taatccgcca gccgctccgc attttggtgg aagctgggag 7800 cgacttgtcg gatcggccaa gcgggcgttg gagcgtgtgc tgcacctaca atcattcacc 7860 gaccagacgt taacgtcagc actgaagcag gtggagcacc ttatcaacag ccgccctctc 7920 acttacgtca gcgtcgaccc ttccgccccg gagccgctga caccgtatca tctgttactc 7980 ggtcgcgcaa atccaagcat tccgcccgac gtcttttcta cagcggatct ctcgtgccgg 8040 aagcgttgga gaatcgccca tgccatcgct gaccaatttt ggagacgctg gatggccgag 8100 tacatgccgg acttgataga aaggcgcaag tggctgaaaa gggagcgcaa cttgcgcata 8160 ggcgacatcg tcctggtcat tgacgaaaag actccgcgag gcctttggcc gttgggtttg 8220 gtgacagaag tattcacggg agtcgatgga gtggtccgct ccgccagcgt gcgtcacaac 8280 ggaaccgagc tacaccgccc agccgtcaaa ctttgcctcc tggaacccga acccgaaaaa 8340 gaagaagatg cgtccgctgc cggacgcagg gccggcga 8378 // ID CR1-50_BF repbase; DNA; INV; 1536 BP. XX AC . XX DT 30-JUL-2009 (Rel. 14.07, Created) DT 30-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE Amphioxus CR1-50_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-50_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-1536 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-1536 RA Kapitonov V. and Jurka J.; RT "Young families of CR1 non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(7), 1621-1621 (2009). XX DR [2] (Consensus) XX SQ Sequence 1536 BP; 482 A; 326 C; 362 G; 366 T; 0 other; atgacttttt cgcaagtgta ttcacggaag aggacctgtc aaacattcct accataaaca 60 acctacatga cacccagtta ttggaaggtt tagagatcac tgaggaagta gtcctcaaga 120 aattggccga gcttaaccca gcaaaatctg caggctccga taacttgcat ccacggtttc 180 tcaaggaact ggctcatgac cttgctacgc ctctcacaat tctttttaag aaatccattg 240 catcttctga gttacctgaa gggtggaaac aggcgcatgt tacacccatt cacaagaagg 300 gcagtaaatc atctccgggg aactacagac ctgtaagtct cacatccgta gtaggaaaaa 360 tgctagaatc catcattagg gacaaagttg tagaacacat gtcttctcac aactatttca 420 ctgacgccca acacgggttt gttcctggac gttcatgtat gactcaattg ttagtgacca 480 tggaacagtg gacaaaacta ctacaggcag gagatccagt agatgtcatc tatctggatt 540 ttcgtaaagc atttgataca gtacctcata cacggttact acagaaactg gagagatatg 600 gggtggtggg tgatctacca gaggcaatat ctagtgaagt aaaaatcttc gctgatgaca 660 gcaagatttt tcgaccagtc aaacacaaaa gagatcagga ggcattgcaa caggatctag 720 tggcggtaga tcagtggtca cagcactggc agctacgttt caacgtgagc aagtgcaagg 780 ttctacacct ggggaggaca aatcagaaaa tcacatacac tcttggtgga cagaatatag 840 aggaaaccgt ggaagagaag gacttaggtg tatctataga caacaatctc gcattccaca 900 tgcacactgc aaaagcagcg aacaagggta acatgatgct gggactcata aatagagcct 960 tttacaacat tgatgaacaa acaatcccga ttcttttcaa aacgatggtg agaccccatt 1020 tagaatacgg taatattatt tgggggccac acttctcctt ggacaaacag aggctagaga 1080 gagttcagcg cagagcaacc aaaatggtgc ctgctctcaa ggacattcct tacaaggaca 1140 ggttaaaaaa gctgaggcta ccatccctgg agtaccggcg taggagagga gatatgattc 1200 aagtttttaa gattatggct gggaaagaga gactacagtc agatctgttt ttcgtggaag 1260 ctaagggttc ttcaactaga ggccatagtt acaagttgaa gatcccactt gcaaaaacta 1320 gaatcagggg gcatgtcttc agttcgaggg tcgtaaagga ctggaactct ttaccagaaa 1380 ggatagtcat gagcgagtca gtgaaccagt ttaaatccaa cttggatcga cactgggaac 1440 acctgtggta cgtccatgag ggatgcgtct agagcagtcc actacaggca gaggcctttc 1500 ttgactgaag atggtatcca ggtatccagg tatcca 1536 // ID BEL-33_AA-LTR repbase; DNA; INV; 552 BP. XX AC supercont1.344; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-33_AA_; KW BEL-33_AA-I; BEL-33_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-552 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.344; Positions 1155649 1156200. XX SQ Sequence 552 BP; 205 A; 86 C; 107 G; 154 T; 0 other; tgtaagcacc acccactgtg aagtatatat acctgttaaa gagtaacctt gcgcctattg 60 tcaaaattag ggagaaggaa agagatgatg atgaaaggta aataaaaaca aagaaaaata 120 cctcgttgag caattataaa agttagctgc aagttattat ataaaatttc ttcaatttaa 180 tgcatttgga aagtgaagta taaggaaaat agctatcgta agtagtatct ctccaaatat 240 agctagtttt gctgaacaat tgtgtactta gcaggtatat tacaccggag caaagtctgt 300 acgtcgacgg aaaattatta ttaaaactaa cctgaaacta agctgaattt cggcacaatt 360 agtacagagg ttgtcgggaa aaggtgagca cacgcaagga ttgcaaggaa actgttcatg 420 taagcactgt tatactttac cgatgtacaa aactatgacc tgatactaaa ttctaataaa 480 atttcagttt taagctgccc aaaacagctg ctttaaaaag gtatcgttca ttacttgcga 540 ggagttcgaa ca 552 // ID BEL-649_AA-I repbase; DNA; INV; 6193 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-649_AA_; KW BEL-649_AA-LTR; Pao_Bel_Ele107; BEL-649_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-6193 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [5225-5785] - Integrase core CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 1577..6136 FT /product="BEL-649_AA-I_1p" FT /translation="MRDVTLMQELVCKLPPAIKLDWARRTKKLTKVTLTTF FT SDWIFSIAEDASIVANPRKPSSYEHEARSKRQGKAFVNAHVDSSSAIASDQ FT MAAGNRMHNASNQKVFLEESISCPLCKGSCRTAAKCKRFLDLSYEARWAAV FT REFRLCRRCLRKHGGNCNSKVCGVNGCAFKHNSLLHRNLVAPSVSAVSANQ FT SKTKEEHSVNTHQVNAGLELFRYLPVTLFGPQAKVECYAFLDDGSELTLLD FT EQIANELELVGNPLPLCLKWTGGTHRFEEKSRRVDVGISGINGKKFDLSGV FT RTVEALELPYQSLNVDVLKSKYQHLSDVPVESYNRVRPRILIGVKHANVSL FT VRRCREGKDGEPIAVKTHLGWTIFGGRSNHEPKADGSYSYHICACNHPNDE FT LLHREVKQYFSLDSLGIMAPVNSTMSRDDERAMALLNSFTRVVGNKYESGL FT LWRFDDIRLPDSRPMALQRLKCLQRRLANDANLKLAYDEKLADYLAKGYIR FT KVTAEELVQKKDRIWYLPTFPVINPNKPGKIRIVWDAAAMAHGVSLNSVLL FT TGPDMLSSLVNVLNQFREYRYGICGDIREMFLQIGIRPEDQFCQLFLWNDS FT INDGEPNTFMVSVMIFGARSSPTTAQFVKNQNARRFEVEFPLAVETIEKKH FT YVDDMLASTETEIEAIELAKSVKHVHAQGGFDIRNWVSNSPKVLEALDESA FT TGEKSLNFTAEIATEKVLGMWWNTSTDCFTFKISWSRFEAALIDGNRAPTK FT RELLRILMSIFDPLGLVSHFLMILKVTLQEVWRMGLKWDDQIEGRQLESWK FT AWISLLPKLEELQIPRCFRRVTTVGEHNQIQMHTFVDAGDNGMAAVVYLRF FT EEHGYVECALVGAKTRVAPLKYLSTPRSELQAAVIGTRLADTISKSLSLPV FT SQRFFWCDSRNVLCWLRSDHRRYSQYVAARVSEILDTSDVTEWNWIKSEWN FT VADDGTKWTGQVQLKTSDRWFQGPAFLQQPEREWPASPHSMETTCEEMKSN FT ILAHVHSKDAKAVVEAAHYSSWRRLTKVTAYVQRFVVNLRAKQQGLPLRRG FT LLSSEELRLAQLYHFRQAQSDVFSEEMKILCQAKEAGNNLRTQVESSSRIY FT KMSPFLDEDSVLRMNSRASKCRFLFPDEKYPIILPDNHPVTNLLLADFHER FT YHHRNYATVANEVRRRYCIPRLRQTLRRMRCSCMWCRNRDAKPAPPEMAEL FT PKARLAAFSRPFSHVGVDYFGPIEVIVGRKVEKRWGVILTCLTIRALHIEV FT ANTLSTSSCMMVLNNFIARRGTPVCFYSDRGTNFVGASKELREALRLLDSH FT EIAKEFTTPTTSWQFNPPASPHMGGSWERLVQSVKRNLTEVLKCKRPTDEE FT LRSALTQIESVLNSRPLTEVPVDNESEPALTPNHFLLGSSDGSKPMTLYDD FT NVQVVRRGWQVSQMIANQFWRRWLTNYLPVITRRTKWFKKVKPIAVGDVVV FT ITDPELPRNCWPKGRVIGTVEKGGQVRRATVQTSKGVYERPAVKLAVLDVW FT SESG" XX SQ Sequence 6193 BP; 1782 A; 1344 C; 1635 G; 1431 T; 1 other; acttttaata tcgtttattg gaagaggtgc gaaatgtcca atccggtcga ccatagtacc 60 cctacgacta gcagaggtgc tgaaggtaag caaaaacccg aaaagacacg tctctcccga 120 aagcagaaat caaataaaga ccccccaaac gaccccaatc cccaatacaa gccttctgtc 180 gcacaagaca aagcgcgagt aggtggacct aagtcgccaa tttcctcccg gattccgagc 240 ctcaacgaag ggaagaagtt gcatcatgtg tgcggatgta gatcaaagcc ggatggttca 300 atgcgttact tgtagatgtt ggtgccactt tgattgtgcg ggagttcctc aggacatcga 360 tcaatgcgaa tggagttgcc tgaaaagcga aactgcgaag gcaaacccgc gtaagcagaa 420 gacgtccaaa gcgaaaaccc gaagcgtaat gcagacatcg aagccaacga aaactactac 480 caagaagaag agttctggaa aaccaaagac ggtacaggat tcaacttcca aggacctccg 540 aacagatccg aaatccgcgg gcggtgaaaa accgacggat ctggatgcta tgctgttggc 600 tgaaatcgag aagaaggaaa cggtgttggg cagagcaaaa tcgatgaaat ctggatcatc 660 ttgcaaatca gcattatcgc tgaaactgca gatgcagcaa gttctggctg aagaaacatt 720 gatgcttgag gagatgcggc ggagacgtga attcatgaag aaaaagttcg agctgatgga 780 ggaattcgcc gaagtacgaa gctgccctgt cccaggtgct aaagtgaccg acccgctaac 840 gaaggtaaaa ggttggttac agaagcaggt cccatcagac agtgaatcgg aagatgctac 900 tgaggagagt gatacagatg gtagtggcga tgatacggat gagacctgct caaaccacga 960 atccgaggtt gatgaagatc atgtaaaata atctgataat atagaacagg aaagcgaggg 1020 tgatgaagtg agcgccacgg atagcgatgc gacggcaaga gaaaatctat atgcgcacga 1080 ggcgagattc gttccgggag aacgttcaac tcccgttcat gcgaagttac ccggtacttc 1140 aacgcgtgtg aatcattctc gtatcagcaa cgctctcaca cgtgagcagg tggcggcccg 1200 ccaggtagtt ccatgtagcc ttccaaagtt cagcgggaac cccgaggaat ggcccatgtt 1260 tatttcgacg tttgatagta ctacttcgat gtgcggctat agggacgagg aaaatatgat 1320 tcgtctccgg aattgcttga aggatgaagc ctttgcagct gtacgaagtt tcctcatgca 1380 cccatcgatg gtcccgaaag cgataagcgt attaaagctt agattcggcc aaccacacat 1440 gataatcagt actttacgag ataaggtcct agctatgcca ccagttagag ccgattcaat 1500 cgmgaagctg gtcgactatg ctctggccgt gcagaatttg tgttctacga tcgatgcctg 1560 tggacggaaa gagtacatga gagacgtaac cctaatgcaa gagctagtgt gtaagctccc 1620 accggctatt aaattggact gggcaagacg cacaaagaaa ttgacgaagg taactctaac 1680 caccttctct gattggatct tttcgatagc agaagatgca agcatcgtgg caaacccccg 1740 aaagccaagc agctacgaac atgaagcacg aagtaagcgc caaggaaagg cgttcgtaaa 1800 cgctcacgtt gattcttcgt cagcaatcgc gagtgatcag atggcagctg gaaaccggat 1860 gcacaacgct tccaaccaga aagtgttcct ggaggagtcg attagctgtc ctctatgcaa 1920 gggaagttgt cggacggccg cgaaatgtaa acgattcctc gatctttcgt atgaagcaag 1980 gtgggctgcg gtacgcgaat tcagattatg tcgtcgatgc ttgcgtaagc acggagggaa 2040 ctgcaatagc aaagtgtgcg gtgtcaacgg ttgtgcattt aagcacaatt cacttctaca 2100 taggaatctt gttgcaccga gtgtatcggc agttagtgca aatcaatcaa aaactaaaga 2160 agagcacagt gtgaatacgc accaagtgaa cgctggcctg gagcttttcc gctatctccc 2220 cgtgacattg tttggaccgc aagcgaaggt tgaatgctac gcctttctcg acgatggatc 2280 ggagttaacg ttgttggatg aacaaatagc caatgagttg gagctggttg gcaatccact 2340 gccattatgt ttaaagtgga caggaggaac tcatcgattc gaggaaaagt cgagacgtgt 2400 ggatgttgga atctccggta tcaatggtaa aaagtttgac ttgagcggag tgcgaacggt 2460 tgaggcactt gagttaccgt atcaaagttt gaacgttgat gtcctaaaga gcaagtatca 2520 gcacctgagt gatgtcccag tcgaatcgta taatcgagtg cgtcccagga ttctcatcgg 2580 agtcaagcat gccaatgttt cgttggtgag acgatgtcgt gaaggaaagg atggcgaacc 2640 catcgctgtg aagactcatc taggctggac gatcttcgga ggtcggtcaa atcacgaacc 2700 gaaggctgac ggaagttata gttaccatat atgtgcatgc aatcacccta acgatgagct 2760 acttcaccga gaagtcaaac agtatttcag cttagatagt ttgggaataa tggcaccggt 2820 aaactcaacg atgtctagag acgatgaaag ggcgatggca ttattgaatt cattcacgcg 2880 agtggtcgga aacaaatacg aatctgggct tctatggcgt ttcgatgata tccgtttacc 2940 tgatagtcgg ccgatggctc ttcaacgtct gaagtgcctg caaagacgtt tggctaacga 3000 tgccaatctg aaactcgcct atgatgaaaa actggctgat tatctggcga aaggctatat 3060 taggaaggta acagcagaag agttggtgca gaaaaaggac cgaatctggt atctcccgac 3120 attcccagtg atcaatccca acaagccagg taagatcaga attgtatggg atgcggcagc 3180 gatggcacat ggagttagtc tgaactcggt actgcttact ggtccagata tgcttagctc 3240 gctggtaaac gttctgaatc aatttaggga gtatcgctat ggtatttgcg gagacatcag 3300 ggagatgttc ctccagattg gtattcgccc cgaagaccaa ttttgtcaac ttttcctgtg 3360 gaacgatagc atcaacgacg gagaacctaa cacattcatg gtctccgtaa tgatcttcgg 3420 agctcgttca tcgccgacga ctgcccaatt cgtaaagaat caaaatgcac ggaggttcga 3480 agtagagttc ccgcttgcgg ttgagaccat agagaaaaag cattatgtgg acgatatgct 3540 tgccagcacg gagacggaga tagaagctat agagttagct aaatcagtaa aacacgtcca 3600 cgcgcaagga ggctttgata tcaggaattg ggtatcgaac tcccccaagg tattggaagc 3660 tttagatgaa agtgccacag gagaaaagtc tttgaatttc acggccgaaa tagccaccga 3720 aaaggtattg ggaatgtggt ggaatacatc aaccgattgc tttactttca aaataagctg 3780 gtctcgattc gaagcagcgc tgattgatgg gaatcgagcc cccaccaagc gagagctact 3840 gcgcattctt atgagcattt ttgatccgtt aggacttgta tctcattttc taatgatact 3900 caaggttacg ctccaggaag tgtggcgtat ggggctgaaa tgggacgacc agatagaagg 3960 aaggcagctc gagagttgga aggcttggat aagtttacta ccgaagctgg aggaattaca 4020 gataccccga tgtttccgtc gagttacaac cgttggagag cataatcaga tccagatgca 4080 cacctttgtt gacgcaggtg ataatggcat ggcggctgtc gtatacctac ggttcgaaga 4140 gcacggatat gtagaatgtg cattagtcgg agcgaagaca cgtgtagctc ctctgaaata 4200 cctgtcaact ccaaggtcag aacttcaggc cgctgttatc ggcacgagac tagcggacac 4260 tatatccaag tctttatcgc tcccagtatc tcaacgcttc ttttggtgtg actctcgtaa 4320 cgttctgtgt tggctacgct ctgaccaccg acgctatagc cagtatgttg cagcgagagt 4380 aagcgaaatt ctagatactt cggacgtcac cgaatggaac tggatcaaat cggagtggaa 4440 cgtcgctgat gatggcacca aatggacggg tcaggtgcag ctaaaaactt ctgatcgctg 4500 gttccaagga ccagctttcc tgcagcaacc ggagcgagag tggccagcgt cgccacatag 4560 tatggaaaca acgtgtgaag agatgaagtc gaatattcta gcgcacgttc acagtaaaga 4620 tgcgaaggca gttgttgaag cagctcatta ttccagctgg aggcgattga ctaaagttac 4680 ggcatacgtt cagcgattcg tggttaacct acgagccaag caacaaggat tacctttacg 4740 gcgaggacta ttgtcaagtg aagaactacg attggctcag ttataccatt ttcgtcaagc 4800 gcaatcagat gtattcagcg aggagatgaa gatattatgt caagctaagg aagcgggaaa 4860 taatctgcga acacaagtgg aaagcagcag tcggatctac aaaatgagtc cattcttgga 4920 tgaagatagt gtcctacgaa tgaacagcag ggcttctaag tgccgcttcc tttttccaga 4980 tgaaaaatac ccgatcatac ttcctgataa tcacccggta accaaccttc tgcttgcgga 5040 tttccacgaa agatatcatc atcggaatta tgctacagtg gctaacgaag ttcgtcgacg 5100 atattgtatt ccaaggcttc gccaaacgct ccgtaggatg agatgcagct gtatgtggtg 5160 tcgcaatcgt gatgcaaaac ctgctccacc agaaatggcc gaacttccaa aagcaagact 5220 ggctgcgttt agccggccgt tttcccacgt aggcgtggat tattttgggc caatcgaagt 5280 tattgtgggt cgtaaagtcg agaagcgttg gggggtcata ctgacgtgtc tgacaattcg 5340 ggctcttcat attgaggtcg ctaacactct tagcacaagc tcatgcatga tggtgttgaa 5400 caatttcatc gcacgtcggg gaactcctgt ttgcttctat agcgataggg ggacaaattt 5460 tgttggtgcg tccaaagaac ttcgtgaagc tcttcgcctt ttggacagtc acgaaattgc 5520 aaaggagttc acaacaccaa caacgtcttg gcaattcaat ccaccggcca gcccccatat 5580 gggaggcagc tgggagaggt tggttcaatc agttaaaagg aatctaacgg aagtgctcaa 5640 gtgtaagcga ccaacagacg aggaactgcg tagcgcgtta acacagatag aatctgtact 5700 gaacagtcgg ccgttgactg aagtacccgt ggacaacgaa tcagagccgg cgcttacccc 5760 taaccatttt ttattggggt catcggatgg atcgaaacca atgacactct acgacgataa 5820 cgtccaggtg gtccgacgtg gatggcaagt gtcacagatg atagcaaacc aattttggag 5880 acgctggctt actaactacc ttcccgtaat cacaaggcga actaagtggt tcaaaaaggt 5940 aaaaccaata gcggtaggtg acgtcgtcgt cattacagat ccggagttac cgaggaattg 6000 ctggccaaag ggccgagtta ttggtacagt agagaaaggc ggacaggtac gtagagcaac 6060 agttcaaaca agcaaagggg tttacgagcg tccggctgtg aaactggcgg tacttgatgt 6120 ttggagcgaa agtgggtaag ccaaatcttg gagttggcat accggggggg actgttacaa 6180 cccctcggtg ttg 6193 // ID Gypsy-5_DGri-LTR repbase; DNA; INV; 167 BP. XX AC scaffold_4666; XX DT 07-MAR-2011 (Rel. 16.03, Created) DT 07-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the Drosophila grimshawi genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-5_DGri_; KW Gypsy-5_DGri-I; Gypsy-5_DGri-LTR. XX OS Drosophila grimshawi OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Hawaiian Drosophila; OC grimshawi group; grimshawi subgroup. XX RN [1] RP 1-167 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Drosophila grimshawi genome."; RL Direct Submission to RU (06-MAR-2011). XX DR Genome; scaffold_4666; Positions 620 786. XX SQ Sequence 167 BP; 55 A; 30 C; 24 G; 58 T; 0 other; tgactctgca tcaaaagctg aaaatatgta gtctcgaagt catttcgaga aattgacgtt 60 ttaagtttct gtttccaaat ccaaacagat gtatcttcgc caataattgt cagaagttag 120 aatttctttc aacattatga agcccttttt atataagtta acatcca 167 // ID DNA5-1_AAe repbase; DNA; INV; 2724 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A non-autonomous DNA transposon family from Aedes aegypti. XX KW DNA transposon; Transposable Element; nonautonomous; DNA5-1_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-2724 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the yellow fever mosquito."; RL Repbase Reports 11(4), 1279-1279 (2011). XX DR [2] (Consensus) XX CC ~97% identical to consensus. 5-bp TSDs. TIRs are ~900 bp long. XX SQ Sequence 2724 BP; 926 A; 481 C; 453 G; 864 T; 0 other; cacggtgctc caattaccgt tattatcaaa taaaatatac catccaaacg gcagtcggca 60 gttggttcct gacgttgttc tgcacctctg agctgaattt gaaaaaaatc cttaggcaga 120 attttgagtt acgccctttt gaagtttata tgacaaaaat cacatgatgt atgccacttt 180 aacttgctta aacaaagaga ttattataaa attttgtagt tttgtttttt tcatctgatt 240 taagacactg aaaaatactt ttttcttaaa tttatctaga ctgacatttg aaaagggcca 300 atgctatgtt tagttattac taatcagcca atactcgaaa catgatatct accaatcaac 360 tcctgatagt gattttaaga aatggtccaa agcattcaaa tatgtttgaa caattcttgg 420 acaggatatc tagatacgcc cttttgaagt gtaaggaagg aatattgcat ggtgaatttc 480 gttccatcta taatcaacgg ttaggtgcat acataatcat tacacctttg cggttgttct 540 tatttcatta aatttagcaa gttgcagaaa atttacacgt acacatattt tctagactga 600 cattagaaaa gggccaaagt ataattgaaa tacatacaaa tcaatatagc taaacaatga 660 ccatatcaaa gttgatcgta acactcgaac ttcacatagc tgtatgtata tcgtatatct 720 gtattctatt caattctgtt taattctttt ctattctagt ctgttctgtt ctattcaatt 780 ctactctact taaatttatt ttattctgat ccaccctact tgttcttata catattttta 840 caaatagcca ctgttgtaat caacaactgc gtgcaaatca tttgaacggt acactttcga 900 agtttgtacg actttaacac ttcagtcgtc gcgttgttgt attttgtaca acagtggtga 960 aaaaacctcg cttatcgtac acagcatcag cgtggtggtt ctgaaggtag caaaccgcgc 1020 gacgactgaa aggtgaaagc gctgacaggt ctgtgaaaaa gcgcttcgca cagactccga 1080 cacctgctca ttatgtcagc ctagccctcg accagtcgca ttgcagaagt atacatcaaa 1140 catggttatt tgatatttcg aagatttcgg cagactcagc tcctcgcagc tctcgacaaa 1200 cttgttcata ttaccttcaa aactgggatc cagctcgaat gaaaaacatt taatacaaaa 1260 tgcttcaatt ctcgaaaact atcaaatgcc acaatacgtt ttcacttttg gaacgtttaa 1320 ttccggatta caaaacggcg taagtaagaa tttcaacttt cgcaacccaa ccgctctttc 1380 accgcctttg tatgcagaga caaagtgact taaccacgaa tcaaaaaaac tacttgagcc 1440 gattttacac tggtagaaaa acgtcgaaag taactgaatg aaacggctta cggagaaccg 1500 tttttgaacg cgagctgatt gtatgcaaaa tttaacgatg tacacggcga aaaaaattca 1560 ttttaaaaca gttttaaaag acgggatgtg attcttctta attaattttc tcaaaaccgt 1620 atgaaagtta cttacgtcga tttgaaaaag taatcgagaa ttccttaaaa tacataaaaa 1680 attaacatat ataagttaaa acccttattt caacttacag tgaaataacc atttgaagct 1740 tcattttctt agagcatata agaagcacgg atgaggagtc agaatctgtg catagctatt 1800 tttcacagac ctgggagcgc tataaaacca tacaaacatc gaaggtgtac cgttcaaatc 1860 ttttgggcgc agttgttgat taatacagtg tctatttgta aaaatatgta taagaacaag 1920 tagggtagat cagaatgaaa taaatttaaa tagagtagaa ttgcacacaa tagaacacac 1980 tagaatataa aagaattgaa tagaatacag acatataata catatacaga tatgtgaagt 2040 tcgagtgttt catcaacttt gatatagttt gttcagctat attgatttgt atatatttca 2100 attatacttt ggcccttttc taatgtcagt ctagaaaata tgtgtatgtg taaactttat 2160 gcaacttgat gaattgaatg aaataagaac aaccgcaaag gtgaaatgat tgtgtatgca 2220 cccaaccgtt gattatagat ggaacgaaat tcaccatgca atattctttc catacacttc 2280 aaaagggcgt atctagatat cctgtccaag aattgtttaa acatatttga atgctttgga 2340 caatttccta aaatcactat caggcataag attggtagat atcatgtttc gagtattggc 2400 tgattagtaa taactaaaca tagcattggc ccttttcaaa tgtcggtcta gaaaaatttt 2460 agaaaaaaga ttttttcaat atcttaaacc agatgaaaaa aaaaacaaaa ctacaaaatt 2520 ttattctgtt tgtttaaaca agttaaagtg gtatacatca tgtgatttct gtcatataaa 2580 cttcaaaagg gcgtaactca aaattctgcc taaggatttt tttcaaattc ggcccagagg 2640 tgcagaacaa tgtcaggaat caactgccga ctgccgtttg gatggtatat tttatttgat 2700 aataacggta attgaagcac cgtg 2724 // ID Gypsy-38_DWil-LTR repbase; DNA; INV; 802 BP. XX AC scaffold_181152; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-38_DWil_; KW Gypsy-38_DWil-I; Gypsy-38_DWil-LTR. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-802 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_181152; Positions 1981 2782. XX SQ Sequence 802 BP; 323 A; 122 C; 160 G; 197 T; 0 other; tggggatggt atggtaacgc tgtaaggcca gacctattcc cgaagcagct gatcgacgag 60 gtggcaatga tagagaagac gaggtggcaa ggcagagttg aaaacgagga gtctagagag 120 aacgacgaac gacgaacgac gtgtgtgtaa aatgagtcaa tacataattt tgtcaaaacc 180 tattgttccg ttattaatta cctaatttaa taaataatct acttaatata acaactattc 240 taattcttgt gggcaccaat cccacaaata tgggggctca accttactta aacgaagaaa 300 aatataacgg ttcaacataa ctaaaaaatg tgaagtctta agctaaatga tgtgaaaaaa 360 tagcaagaag tgctaaaacg acgaataaga tggacaaatt tatgtaaaag tattaattaa 420 agaaaattaa ttaaagtgag atctgaaaga caaaaaacaa aagtgtttta acgcagttct 480 aagaatttgt caaacgttta aaatacacac atataagagt gacgctacca atacagacat 540 acaaaagtga acatggcaaa gctcgcagat ttcggtgctg taaatgaaaa actgcgtacg 600 aaaagagaca gaacattgca ttgcttcaca aatatgtatt cgaaagggat ggagacagaa 660 ataacagaaa acggttacgc gaatttcaag gttacgacta cgtaagaaca gataaagagt 720 atgcaattaa aatgtcatat attgccgatg aactttctga taatgactta gctgtaatat 780 gtaatatgct acatttacac ca 802 // ID BEL-62_CQ-I repbase; DNA; INV; 2580 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-62_CQ_; KW BEL-62_CQ-LTR; BEL-62_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-2580 RA Jurka J.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 277-277 (2011). XX DR [2] (Consensus) XX CC 'ATACC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 65..2578 FT /product="BEL-62_CQ-I_1p" FT /translation="MQETYQNCGVCLGDQRDGSMIACQSCREWFHHQCVGV FT FSTIVNGFWTCVLCLNLPSHERFAKLLKFGTAPTRTNSPVIASGSSPAVVT FT TAITTSSSPLSMQPSVVGSLTTCRDQGEQHVAGRSYSADESSEDWLSKKIN FT VMKNLLITPVCSSKAARTFVSNSFASTTTGAVNGYSVESCTPSFNSQNQLL FT YGSLQSQQLPPPHHGRSQQQQVEPQQSRQQSQQDGPQPKQAWLQLQPPLVP FT TPQQLLARQVMPKDLPKFYGNPEDWPLFSSAFTNTTDACGYSNVENLARLQ FT EALQGKALEQVKGRLLFPALVPQVMSTLYLLYGRPELIIQTFLDKVRQTPA FT PTLDRMETLISFGLTVQNLCDHIETSGQVSHLQNPILLRELIEKVPAQQRL FT EWAFYKQQFRTVDLRTFAGFMSMLVAAATEVTFPSARKHTRNESEEQENDQ FT ELCSSRSAGRAHRKRSSREGREQPENVKQSTCWACSGSEHKVKDCEMFKRW FT DSHSRRKLIEYRDLCRACFGKHGRRPCNAKHLRRGQYQLQKKQKTPEQETD FT GEVGLIVHHTCQNSTLFRILPVKLSWNGNTVETFAFLDDGSDMTLVEQSIA FT DRLGIDDGEPVPLCLSWTNKVTRKEPKSQRIRLEIAEKGRSEKYTLKDSRT FT VASLDLKTQTLDFGELAQKFPYLRGLPVSSYQAATPKILIGTNNANLTATQ FT NLREGQIREPLASKTRLGWTIHGYAEDEQKHKGYSFHVSGHREDHISNKQK FT HKHTERFDTSTNKKRLTPETSHKGANTDGLFKNQQFLAQSDGLLQRVGDHQ FT QRPKTVFGLLGHAYCSPKDVGKAQSTAEPEVHYGSG" XX SQ Sequence 2580 BP; 750 A; 638 C; 635 G; 557 T; 0 other; ttctcaaaga tttaacctca aaccgaaggt ttgcttttgt ttggtatacg acattctcga 60 agagatgcag gagacgtatc agaactgcgg agtctgcctc ggagaccaac gtgatggcag 120 tatgatcgcc tgtcagtcgt gtcgagaatg gtttcaccac caatgcgttg gagtattttc 180 aactatagtc aacggtttct ggacgtgtgt actgtgtctg aatctaccct cacacgaacg 240 atttgctaag ctcctgaagt ttggcactgc acctacccgg acaaactcgc cggtaatagc 300 atctggatct tcaccagcag tggtaactac agctatcacc acctcatcct cgccgctttc 360 gatgcagcct tcagtcgtgg ggtcgttaac tacgtgtcgt gaccagggag agcaacacgt 420 tgcaggtcgt agctatagtg cagatgaaag ttctgaagac tggttgagta agaagatcaa 480 tgtgatgaaa aatctattga tcactccagt ttgctcatct aaagctgcca ggacgtttgt 540 atcaaacagt tttgcgtcaa caactacggg agctgtcaat ggatactccg tcgagtcgtg 600 cacgccatcg tttaacagtc aaaatcaact attatacgga tcactgcaat cgcagcaact 660 gccgccacca caccacggac ggtcacagca gcaacaggtt gagccacagc aaagcaggca 720 gcagtcacaa caggatggac cgcagccaaa acaagcttgg ctgcagttgc agccacccct 780 ggttccaact ccacaacagc tattagccag gcaggtcatg ccaaaggacc ttcccaagtt 840 ttatggaaac cctgaagatt ggccactctt ctcgtcagcg ttcactaaca ccaccgacgc 900 ctgtggatat tctaatgtgg agaacttggc acggcttcag gaggcattgc aagggaaagc 960 ccttgaacag gtgaagggcc gactgctctt cccagcgctc gtgcctcagg tgatgtccac 1020 tctttatcta ctatacggaa ggcccgagct gatcatccag accttcctgg acaaggttcg 1080 ccaaactcct gctcccacgt tagacagaat ggaaacgttg atttcattcg gattgactgt 1140 gcagaacctg tgtgaccata tagaaacatc aggccaagtg tcacatctgc aaaatccaat 1200 cctgttgcga gaactcatcg aaaaagttcc agcacaacag cgtctggaat gggccttcta 1260 caaacaacag tttaggaccg tcgatctgcg cacgtttgct ggcttcatgt ccatgttagt 1320 agctgcagca acggaagtca cgtttccttc agctaggaag cacacacgaa atgagagcga 1380 agagcaagaa aatgaccagg aactctgcag ctcacgctca gctggccgag cacaccgaaa 1440 acggtcgagc agagaaggac gtgagcagcc ggaaaatgtc aagcagagta catgttgggc 1500 gtgtagcgga agcgaacata aagtgaaaga ttgcgaaatg tttaagcggt gggattctca 1560 tagcaggaga aagttgatcg aatatcgcga cctgtgtcga gcgtgttttg gaaaacacgg 1620 acgacgacca tgcaatgcaa aacatttgcg cagaggccag tatcaactcc agaagaagca 1680 gaaaacaccg gaacaggaaa cagatggcga agttggtttg attgtacatc acacttgcca 1740 gaattccacg ttgtttcgga ttcttcctgt taagctttcg tggaatggaa ataccgttga 1800 aacatttgcg tttttggacg acggctcaga tatgacactg gttgagcagt ccattgcaga 1860 tcgtttaggc atcgatgatg gcgaaccagt tcccctttgc ctatcctgga ctaacaaggt 1920 cactcgaaaa gagccgaagt cacaacgcat ccgcctagaa attgccgaaa agggaagaag 1980 cgaaaagtat accttgaaag actctagaac agttgccagc ttggacctga agactcaaac 2040 actggatttc ggagaattag cacaaaagtt cccgtacctt cgaggactac cggttagtag 2100 ctaccaagcg gcaacaccaa agattcttat tggcacaaat aatgcaaatt tgaccgccac 2160 acaaaacctt cgcgaagggc agatcaggga gccccttgct agcaaaacac gtctgggctg 2220 gactatacac ggatatgccg aagatgaaca gaagcacaaa ggctactcat ttcacgtttc 2280 cgggcatcga gaagaccaca tttcgaacaa gcagaaacac aaacatacgg agagatttga 2340 cacttcgacg aacaagaaga gacttacgcc agagacatcg cacaagggcg caaatacgga 2400 tggactcttc aagaaccagc agttcctggc gcagtcagat ggtcttctgc aacgggtcgg 2460 ggatcaccaa caaagaccta agactgtatt cggactgcta ggtcacgcgt actgcagccc 2520 caaggatgtt ggtaaagcac agtcaacagc ggagccggaa gtgcattacg ggtcggggaa 2580 // ID BEL-645_AA-LTR repbase; DNA; INV; 636 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-645_AA_; KW Pao_Bel_Ele219; BEL-645_AA-I; BEL-645_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-636 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 636 BP; 226 A; 93 C; 146 G; 171 T; 0 other; tggcgaacaa acctgtcgat gtgtagagag ggtaagtaaa taaagaccat aggtaaatag 60 aagaatttcg ccatttcgaa actagaggaa aagcacatta tttgcgaggg agtttatttt 120 caaaatagat ttccggggat taaatagtgt ccaggctgga taaatcgacg gttagagtgt 180 aaaggggaat ccgtgagtac aacctgaaaa aggaaagatg aaatgagtga gaaatcaaat 240 gtaattatgt acttacctgc aggctcggtg aaaccaaact ccgtccgcca cgaaaacccg 300 tgctaagact gtatgttttt cggtgattta gactgtaagt agaattgaaa tcagtgaatt 360 aagaaaattt tcagtattat agtttcatct gtgatgagat tcaacagcaa aaataccaat 420 tgaaatccgc tgaaacgtta gggtagttgg accatagttg gacaccaata ccacggatcg 480 gaagaaattg aacggcaaga cgaaaccaat ttattgtgag tagacgtaga ttagatcagt 540 agaaattaat taaatgaaat ttaataaatt ttagtttgag cgttgctgaa aaccagtcag 600 ctgcttttgg aaaattttgg tttcccttcg ggaaca 636 // ID Penelope-10_HM repbase; DNA; INV; 5412 BP. XX AC . XX DT 05-JAN-2009 (Rel. 14.02, Created) DT 05-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE Penelope-like element. XX KW Penelope; Non-LTR Retrotransposon; Transposable Element; KW Penelope-10_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-5412 RA Bao W. and Jurka J.; RT "Penelope-like elements from Hydra magnipapillata."; RL Repbase Reports 9(2), 448-448 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 1855..3687 FT /product="Penelope-10_HM_1p" FT /translation="MRQYNKEIIQAKSNFSHLFYNKLVTNISKHVLTDDEK FT ELLILGLNFIPSITNTSLNMHITAFERFKTTLLQQLYFGTEEIMRHPFHKK FT SNWSPPMTKNKTIINYLNLVECDITALIFSQLPKDKKNLTPKLIHALNSLM FT FNPTIIIKKADKGGGICILDKTDYEEKISLLLSDKNTYKLLSEDPTKSIAR FT DVNNLIDYMFLHHMIDKKTSEFLRPKTPHRTPLFYGLPKIHKIGTPLRPIV FT SGCDGPIDNLSFFITEFIQPVAEQLPAYFKDTTHFLKLLQSHVLETNNYIL FT VTADVISLYTNIPHREGIDAVKFYLKRAPPFNGPIKRPPIAFIDIILETIL FT THSNFQFSTDHFLQLTGTTMGTRMAPPYANLFMGRIDEKIIDQFSMWISFY FT KRFIDDIFFIFHGPPEILTQVFEFMNAIHPTIKFTFNNSTTSIHFMDLTIN FT KKLDGQLETTIYKKATDTTALLHYNSFHPPHQKSNLIYSQALRYNKNISNN FT YNLIKELKQLAQILVIRGYPIQSINKGFKQALRHTQHELINNKYKEEREKE FT KILPMVTTFGKVGKLINKIVKKHWNIVEQDTDLKNIWPTPPIASFKNIKSI FT KKFLIQTAHKKT*" XX SQ Sequence 5412 BP; 1617 A; 853 C; 774 G; 2167 T; 1 other; tatttaaggc tattgttgtt ttgttgatta aaaaaaattt ttttttaatg aggtattctg 60 atttctgttt atctttggat tatttatgct attaatacct atctggtttt agatcttaaa 120 ttgaatttat ttatattttt atgtttcgat tagaatttta atttattgtg ctttcatgtt 180 ttctttcggt ggtatggtaa gtgatttatt tagktatttt tatttatttt ttacagttta 240 tctacacttt tcatatcata tttttaaaaa aacattagtg ttttacaaat tttgggttgg 300 tggggttttg gctatttgta gtgagtcctt ttttactgca gtatggataa ggcgtaagtc 360 gcggggtaaa gcacaagtgg cgatgacgtt tgtctgacgg gataaactag ttataagtgc 420 tgatcctcat cgaaacgcct gtaataatag acgcgactct gaggagatgt ttattacttt 480 atcactacat aattatgttt acacattagc attaatgttc tttttccatt ctaaggcctt 540 acattgctat ccatattttt tgtttttgtt ttttatattt tttttttgtt ttttattttt 600 aagttttttg tttatcaatt ttttggttta aaaaaaattt tattttggtt atatttgttg 660 tatatcttcg tacgtactag ttttttatat gatacaattt tttgttattg ttattgtgag 720 tactgcttgg atgtgttgtc tgtcttactt gttttgaaat tttcctctat taatctttat 780 ttttgtcttg acaactgtca aaatatgctt cgttcgataa ataattcttc tttattctaa 840 tgtacttcat ttcttccctc gggtgttcac tgctcgtgtg tgaccattcg gcgaatttta 900 tatgccaaag tttttaaata atattgggcc tagcccgtaa gggttagtgt tacggtcatc 960 gagcacgact attttccata tcacaactac tgtcaccgcg aaagaggatg tttgtctctg 1020 gaggatgcct gggctcgctt ttcagtcata agtcagttat aaatccatgc ataggtgttt 1080 aatccatcat ggttttctgg agtatgactg ggttggcttg gtgtagttca gagatatatc 1140 ttttacgctg gttactaaca acacaagact ctactgaaca acataaacgg aacagactta 1200 cacattacac ctatataccc ttatgcacat acagcctgtt tacatcgttt gcatgctttt 1260 tgttctttta ttagataata tatctttgtt tgaactcatc cttatttaaa taatattttc 1320 ttttaataga ttggtatgta tatacagatt tgtatgtttt gtatatgtgc ttatcgtgtg 1380 tgattgattg ttcattttat tattcatata taaaaatata tataaaagcg attcttaaaa 1440 actcacaact tttgatatgc tttgttgatg tataaatttt cgttgttggt ttggttttat 1500 cattaaaaat taaattgttt gttagtttat tgttctagca tctaataatt tatattatat 1560 gcatcttttt taccattaag tttatcgttt aatggtgtct atcattcagt tcggatccac 1620 aaactttaat actgtgtttg ctgtctcaat gctgtttttg actttaaatt atcttcacta 1680 gttttctatt gtctcttaca tgtctacttg tctcttacct gttattacag gtatccttga 1740 taaaatattt ttctctagga tcagcaaaga aagctgttta ctccgtctaa cgaaaaaaga 1800 aattttacac catttactct ggctaaatta ttttcgtcaa caccacacag caacatgagg 1860 caatataata aagaaatcat tcaggctaag tcaaatttct cacatctatt ttacaataaa 1920 ctagttacta acatatcaaa acacgttcta acagacgatg aaaaagagct cttaattttg 1980 ggtttaaatt ttattccttc aattacaaat acgtcactta atatgcacat aacagctttt 2040 gaaagattta aaacaaccct tctccaacaa ctctactttg ggaccgaaga aataatgagg 2100 catccattcc ataaaaaatc aaattggtct ccccctatga caaaaaacaa aactattata 2160 aattatctta acttggtaga atgtgatata accgcactta ttttctcaca actaccgaaa 2220 gataaaaaaa atttaactcc taaacttatt cacgcactaa actcccttat gtttaatccc 2280 acaattatca taaaaaaggc cgacaaagga gggggtattt gtattttaga taaaactgac 2340 tatgaagaaa agatctctct acttttatca gataaaaata catataaact actctcagaa 2400 gacccaacaa aaagtatagc gagagatgtt aacaatctta ttgactatat gtttcttcat 2460 catatgattg ataaaaaaac atcagagttt ctacgaccaa aaacacccca tcgtacaccc 2520 ctgttttatg gccttcccaa gattcataag ataggcactc ccttaagacc aattgtttct 2580 ggttgtgacg gaccaatcga taacttatcg ttctttatca cagaatttat tcagccagtt 2640 gctgaacaat taccagcata ctttaaagac acaacgcatt tccttaaact ccttcaatct 2700 catgtactag aaaccaacaa ttacattctt gtcacggctg atgttatatc tctgtataca 2760 aacattcctc atcgcgaggg tattgacgct gtaaaatttt atttaaagag agcaccgcca 2820 tttaacggac ccattaaacg cccacctatt gcttttattg acattatatt agaaacaatc 2880 ctaacccaca gcaactttca attttcgact gatcattttt tacaattaac aggtactacc 2940 atggggacac gtatggctcc accctatgct aatctattta tgggaagaat tgatgagaaa 3000 ataattgacc aattttcaat gtggatttct ttttataaac gttttataga tgatatattt 3060 tttatttttc atgggcctcc ggagatacta acacaagttt tcgaatttat gaatgccatc 3120 cacccaacga tcaaatttac tttcaataat tctaccactt ctattcattt catggattta 3180 acaattaaca aaaagctaga tggacagtta gaaacaacaa tatacaaaaa ggcaactgat 3240 acaacagcac tattgcatta taattcattc catccccctc atcaaaaaag caatttaata 3300 tatagccaag cgttaagata caataagaat atatccaaca attacaacct aataaaagag 3360 ttaaaacaat tagcacaaat tttagttata cgaggctacc ctattcaatc tattaacaaa 3420 ggttttaaac aagcactgcg acatacacaa cacgaactta tcaacaacaa atataaagaa 3480 gaaagagaga aagagaaaat tttaccgatg gttacaacat ttggaaaagt tggaaaactc 3540 ataaataaaa tagttaaaaa gcactggaat attgttgaac aggatactga tcttaaaaat 3600 atatggccaa ctccacctat tgcgagtttc aaaaacatca aatcgatcaa gaaatttcta 3660 atccagacgg cgcataaaaa aacataaata aaaaatctat tttttcttta ctaagtttca 3720 tttcctaatt gttactattt acatactatt attggtttac cttatagttt tatttataat 3780 ttactagttt tttaataaaa ttttatcagg cgtttgttat ggtctcaaaa ttacattaaa 3840 atcggaccgt agagaaacct ttacttctcg gatattttga tttagaaatg gtctcaaagt 3900 tacttgtaat tcggaccgta gaggaaactt tacttttcga ttatttcgat ttagaaatgg 3960 tctcaaagtt acttgtaatt cggaccgtag agaaaacttt acttttttgc aatatttttt 4020 cccgcttcgt atatcgtttt tatatcgcga cttatatgac tgttacatcg cccaaaataa 4080 atttttaacc actcatttat tttaaattta tctggctttg ttatagtatt tagtttttct 4140 tattgcaata ttcattaata tattatttat ctgctgattt tcatattatt actgtttatt 4200 atgaattttt atgatttatt atcttgcatt gattactcaa ctttatttga attttcatat 4260 gcacaatttt catcttttat tattattcag aggaattttt ttatatcctt catttaatgt 4320 ctacatttat gccggttact ttattaaggt ctttgtgcga cgattttttc gcaaccacac 4380 attgttctat cattgttatg attctaggtt tcgcgggttt ttcattttcc ttcacttttg 4440 attggttatc acatttcgaa cgaggtatct atgacagcgt tagagtttat ttaaggctat 4500 tgttgttttg ttgattaaaa aaattttttt tttaatgagg tattctgatt tctgtttatc 4560 tttggattat ttatgctatt aatacctatc tggttttaga tcttaaattg aatttattta 4620 tatttttatg tttcgattag aattttaatt tattgtgctt tcatgttttc tttcggtggt 4680 atggtaagtg atttatttag gtatttttat ttatttttta cagtttatct acacttttca 4740 tatcatattt ttaaaaaaac attagtgttt tacaaatttt gggttggtgg ggttttggct 4800 atttgtagtg agtccttttt tactgcagta tggataaggc gtaagtcgcg gggtaaagca 4860 caagtggcga tgacgtttgt ctgacgggat aaactagtta taagtgctga tcctcatcga 4920 aacgcctgta ataatagacg cgactctgag gagatgttta ttactttatc actacataat 4980 tatgtttaca cattagcatt aatgttcttt ttccattcta aggccttaca ttgctatcca 5040 tattttttgt ttttgttttt tatatttttt ttttgttttt tatttttaag ttttttgttt 5100 atcaattttt tggtttaaaa aaaattttat tttggttata tttgttgtat atcttcgtac 5160 gtactagttt tttatatgat acaatttttt gttattgtta ttgtgagtac tgcttggatg 5220 tgttgtctgt cttacttgtt ttgaaatttt cctctattaa tctttatttt tgtcttgaca 5280 actgtcaaaa tatgcttcgt tcgataaata attcttcttt attctaatgt acttcatttc 5340 ttccctcggg tgttcactgc tcgtgtgtga ccattcggcg aattttatat gccaaagttt 5400 ttaaataaaa tt 5412 // ID Gypsy-207_AA-LTR repbase; DNA; INV; 1751 BP. XX AC supercont1.1553; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-207_AA_; KW Gypsy-207_AA-I; Gypsy-207_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-1751 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.1553; Positions 27873 26123. XX SQ Sequence 1751 BP; 477 A; 399 C; 386 G; 489 T; 0 other; tgtaaccagt tgttacaatt tttacttgct ggctccaccg caagtggata ttcccttttt 60 cccgaaataa cattacgccc tttccaaaag taagtctccc actaacacta gtgtatggga 120 caggggcggc actcagtggc ttgatgagtg ttcgagctct gcaccactca caacattaca 180 tagtgtgggt agcggctttt gagcattgta aaacggctct ccgcggtgta acataagttc 240 gcccggatat gtgctattcg ctcatagggg gagcacaaat cgctattggg gtaaaccgcc 300 atgttatgag cataactgtc aaaaatcgtt cgtgactttt aagcacgaac gattttagat 360 agttcggctc cactgataaa ttttaagttc ttataaatag tttgaaactg ctaagaaaga 420 tctcttttgc ctggaaaatc tgaaagtgat aagaagtgcg cgttgttagt tgaaagtttt 480 ttacaaaggt tagaagtaaa gtaataagaa aagtggaagt gttaataatt agtgcttttg 540 tttctttgta gataggtggt gattgttagt tggacaatag acaattgtta gtattgttag 600 tctattagtc aatagtcaaa gctgcttaaa tagtgtatta gcagtattag tatggttagt 660 aacaaaaagt gatcattgtt tcctaaagct aatgtgaata aaaccaatta tagagaacac 720 gctaaaaaca tacgaaacag tgtgaaacaa cagttatatt aagacaaaca caagaaaaag 780 gtaatttaat gttttgtgaa tagttgctag tttgttagag gctaaaatat gcgacgtcac 840 agggtactaa ttttgtgtgc aggtccacga agcgggcctt tttcttctcc ttttgagttt 900 ttcttcggca gacgagttcc tggaaagtgc tagtgacacg gagtgaacgg aagtggccag 960 aatcgaagcg caaccaggcc gagacagccc agacgccatc gctgtccgag aattcgggtc 1020 gttaccgtca acgaacgacc gttccgttac accaccgcca tcttccaagc cccagtgcct 1080 ccgtagtcca ccgccattgc ctccgtcgcc attccatcaa gcgaccagtt ccccgtcggg 1140 actcttgttc ttcgccgcca tcaacgtaca ccgccggtgg cccgctattg ccatcgggtc 1200 tcgtccggat tgtgtcaccg ccaatcgacc gccttcgtcc accattgaat ccatggaagg 1260 tccagtcaac cgccgatgaa atccgttaaa gccacctcca agaaccgcat gccaccgaac 1320 ccgccaccaa cgccaccagg accgtgatcc gtttatgcag caagcacgca agtacccctg 1380 tgacatgacg tcagcaataa attgtaatcg ttattttgaa actttgcctc agacgctatt 1440 agttcaaaga gcatagccct ggttaattaa tatttccact tcttttcgta tttccgcttt 1500 gtcgggttca aaaggaaccg aatttgcttc cactttcgga tttccctagc agtttcacta 1560 tttcgaattt aaaaaccctt tggtgtcgaa cattgcagtt tagttcaaac gacacgtgtg 1620 aagccggtgg gctgtagtcc atttcgagac gttccagctc ttagtgagtg cctctcaact 1680 gtgtgggtgt tgagggaaca agccgataag cgaccctgag gattccttta acccttagct 1740 agcaggtaac a 1751 // ID BEL-29_AA-LTR repbase; DNA; INV; 634 BP. XX AC supercont1.240; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-29_AA_; KW BEL-29_AA-I; BEL-29_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-634 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.240; Positions 1591833 1591200. XX SQ Sequence 634 BP; 201 A; 114 C; 127 G; 192 T; 0 other; tgttgctacg accgctcaac cggcaacccg gctaccgtag gtcagcagaa tccaacgacg 60 gacagcgtgg ctactacagc caagcgatag atgtgtaagg agatgtctga tgaataggtg 120 gacaaaacaa aacaactagc tattacgaag caaaattgaa cgggttgaat ttatgtgctg 180 aagtagttta ttagttttac tacactattt gctgtttttt ctttcaaagt ttataccgta 240 taattcgtaa gttatattga atttattacc aattatgccc aatatctaat cgatctattg 300 tgaatagtga aatcaggcta atatcgttgg ccacctctaa agttcgagca cagtacacct 360 actgaactca gtgcaaattg gtaggttaaa gttggaatac tagaagagct gaagtctgat 420 tatattttct atagccatcc tagcgagcct aagagtggaa tctcgcgcat taagcttgaa 480 tcagtggata gatcgaaaaa agcgcactat tatgtatgta gaaggtaaaa ttacaagctc 540 atttcttata ttaatatcac tctaatttct agcttgaagc gttccattaa taaatttgct 600 ctcaagaagt gcggcgtttt gttatgcagt aaca 634 // ID R1_SC repbase; DNA; INV; 3550 BP. XX AC L00945; XX DT 22-DEC-1995 (Rel. 1.11, Created) DT 25-MAR-1997 (Rel. 2.02, Last updated, Version 2) XX DE Sciara coprophila retrotransposable element R1 reverse DE transcriptase gene, 3' end. XX KW Non-LTR Retrotransposon; Transposable Element; R1_SC; KW Retrotransposable element R1; SCR1; reverse transcriptase. XX OS Bradysia coprophila OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Sciaroidea; OC Sciaridae; Bradysia. XX RN [1] RP 1-3550 RA Burke D.W., Eickbush G.D., Xiong Y., Jakubczak L.J. RA and Eickbush H.T.; RT "Sequence relationship of retrotransposable elements R1 and R2 RT within and between divergent insect species."; RL Mol. Biol. Evol 10, 163-185 (1993). XX DR GenBank; L00945; Positions 1 3550. XX SQ Sequence 3550 BP; 788 A; 679 C; 1128 G; 955 T; 0 other; ttgcgtgggg aggtaagggg actgccgtct agcttcaggg tggttactag tagaatagtt 60 aacgatggta ttggaatcag tgcgattgtg ataaacgacc ctgaagctga tgtgcttgtc 120 attgaggact gtaccgatga gtacggtgtg tgtgtgttga tcaaaggtgc cacatgtagc 180 atgtacgttg tgtccgtgta ttgtagattt ggtactgccc taggaccata tctgcaatac 240 atggaaaatg tgcgtgtgaa gtgtggtaat acctatatga tcatggggat ggacgcaaat 300 gccgtttccc ccctctggtt cagtaagggt gagaacttgg gaaggggaag gctgaatgaa 360 gcgaatggtc tgctgctcga ggaatggata ctcgaaggca gaatgattgt tatcaatgaa 420 ccctcggagt ggtacacgtt tagtgggccg aatggctcta gcgacattga cgtgacactg 480 gtaaacgaag ctggcggtag gttcgggtat gagtggtctg tgcagcctga atggggtgtt 540 agtgaccata atctgatccg tattcgagtt agcttggacg gactagtggc tgatgcttct 600 ccatcaccac agtctgccag gtggcagacc cgtgatacgg attggggtga atacatggga 660 gatgtgaaag ctaaagcaga tgtgtttggc ctggcgcaat atgaaaacgt gagtgtcgat 720 gagaaggtcg acttactcac agagtggata tacggtgcga acgactggaa catgcggagg 780 cataccgcgg tgcgaacttt ccagaatgag tggtggtctg tcgaacttgc ggagaagagg 840 agtgagctgc gacggcgcag gcatgccttt caacgcattc gtaatgcggg tgccgcaagt 900 ctggcagatc ggctacaggc gtttagggat tgtaagattg agtacaagcg catgttgtgt 960 gaagccaagc ttcggtgctg gcaggagttt gtggctagcg aatccaatga aaacccctgg 1020 ggacgagttt ttaagctctg tcggggtagg aggaagcctg tcgatgtctg tagcgttaaa 1080 gtcgacggtg tgtatactga tacgtgggag ggtagtgtaa atgcgatgat gaacgttttc 1140 tttcctgcct cgatcgatga tgcgagtgag attgaccggc tgaaggcgat tgccagaccc 1200 ttaccacctg atctggagat ggatgaagta tccgactccg tgagaaggtg taaggtgagg 1260 aaaagcccgg ggccggatgg gattgtggga gagatggttc gtgctgtctg gggggcaatc 1320 ccggaatata tgttctgtct gtacaagcag tgcttgctgg agagttactt tccgcagaaa 1380 tggaaaatag cgagcttggt catcttactc aagctcttgg accggataag atctgaccca 1440 ggctcctatc gacctatctg tttgttagat aacttgggca aggtgcttga gggtataatg 1500 gttaagaggc tggatcaaaa gttgatggat gtggaggttt caccgtatca gttcgcgttc 1560 acttacggca agtctactga agatgcatgg agatgcgttc agcggcatgt tgaatgctcg 1620 gagatgaagt acgttattgg gctgaatatc gactttcaag gtgcgtttga caatcttggc 1680 tggctttcca tgcttctgaa gttagatgag gctcagagta atgagtttgg tttgtggatg 1740 agttactttg gtggtcgtaa ggtctactat gttggtaaga ctggcatcgt tcggaaagat 1800 gtgactcgtg ggtgtcctca gggatctaag agtggaccgg cgatgtggaa gcttgtaatg 1860 aatgaactct tgctggcact cgtggcagct ggattcttca ttgtagcttt tgccgacgac 1920 ggtaccattg ttattggggc taatagtcga tcggcgttgg aggagctggg cacgcggtgc 1980 ttgcagttgt gtcatgagtg ggggaagagg gtgtgtgttc cggtgtcggc gggcaagaca 2040 acgtgtatcc ttatgaaggg tcacttgtcg gccaaccggc ctccgtgcat acggctgaac 2100 ggcacctcca tctcctacaa gagtgaggtg aagcacctgg gaatcttcgt tgctgagcgt 2160 atgaatttta ggccgcactt cgtttacctc cggggtaaga ttcttggtct tgttgggtgt 2220 ttgcgtcgtg tgatgaggaa gtcgtggggc ctggggcgta gggcaacttg tattctgtac 2280 aaggggctct tcgtggcgtg tatgtcgtat ggtgcaagcg tgtggttccg gactctgcgc 2340 ttttcgtatg cgtctatctt gctgaataga tgtcaaaggc tagtcctgta tgcatctcta 2400 aacgtgtgtc ggacggtttc cacagagcga atgcaagttc tccacggaga actaccatgg 2460 gatttggagg caacccgtag aggcttactc tccgaattcc gtaaaggcat tacgccggtg 2520 gatggcgatc cgatcacgga tgaagagatg ctagggttga gtggttctca gttcaaggaa 2580 ctgttgatgg agaggttgct ggatgtgtgg cagggcaggt gggatgtctc ggaaaaggga 2640 cgcctcactc atgagtttat tccgagtgtc cgatttgttc gtgagaatga gtggatggcc 2700 tttggactgt gtcttggtta tgtattaacg ggacacggta gcatgaatgg ttttctgcac 2760 aaacgtggtc tgtcaaatac gccggtatgc atgtgtggag cacctaacga ggacgtcaag 2820 cacctactcg gggagtgccc gctctacgag gatcttcgtg atctgaatgg ttgtgggtta 2880 ctcattcgta acggatcgct tgacgttagc ggggcactca gcgagattgg agcgtttgaa 2940 aagttgaatc agtttgcggt atcgctattc ggcaggaggt cgaggttgat gagagggatg 3000 aggatccgtg aatgatgcat gtgtacgccg ttatgtcttg aatgtggttg attgtgggga 3060 gaatgaccgg cgcaggatga gttcagctgg ttgaacggct aaggccaatt ccagactgtc 3120 tctcggagag tgcggactcc gcgcaattcg agctagtcga tcggagctgg aagttcatgc 3180 ttccatccga agtggtcttc gataccacct accctacccg agggttaaat cggtaccacg 3240 ggtgttggga gcccaccagg attgcactat cctcctggtc ccaactatgg ttgaggagcc 3300 tatccttggc tgaaacgtgg tggctgtggt tcaagtgcgt ttaatacgct ggggtactgc 3360 tcccgggtga attgcagtaa gggttgtgat aggcccttgc cctcctttgg ctatctggtg 3420 catgaaacac cggcatataa aggtagtcat ttgacgcgct gattgagcgt atctcaatct 3480 gctcctctta gagaggacga gatgttatgc cttcgggtgg gtaattctcg ttaatataaa 3540 gatattattc 3550 // ID BEL-67_CQ-LTR repbase; DNA; INV; 167 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-67_CQ_; KW BEL-67_CQ-I; BEL-67_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-167 RA Jurka J.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 288-288 (2011). XX DR [2] (Consensus) XX SQ Sequence 167 BP; 44 A; 46 C; 25 G; 52 T; 0 other; tgactcagat tatgaattct atttataatt acaccttttt tctactgctc aggtatttta 60 taacctgccc caccctgcaa atacacgatt tcacgcaagt ccagtgtctt attgccccgg 120 cggttataaa attggaaacc ctcaagctcg ttgccccaac ctggaca 167 // ID SLACS repbase; DNA; INV; 6920 BP. XX AC X17078; XX DT 14-SEP-2004 (Rel. 9.08, Created) DT 03-JUN-2009 (Rel. 14.07, Last updated, Version 2) XX DE Trypanosoma brucei DNA for retrotransposable element SLACS. XX KW CRE; Non-LTR Retrotransposon; Transposable Element; KW unidentified reading frame; retrotransposable element; SLACS_TB; KW SLACS. XX NM SLACS_TB. XX OS Trypanosoma brucei OC Eukaryota; Euglenozoa; Kinetoplastida; Trypanosomatidae; OC Trypanosoma. XX RN [1] RA Aksoy S., Williams S., Chang S. and Richards F.F.; RT "SLACS retrotransposon from Trypanosoma brucei gambiense is RT similar to mammalian LINEs."; RL Nucleic Acids Res 18(4), 785-792 (1990). XX DR Genbank; X17078; Positions 1 6920. XX CC ORF has low similarity to some reverse transcriptases (from CC Bombyx mori and Anopheles gambiae, amongst others). Genbank CC entry indicates a target site duplication, and 187 bp repeat. XX SQ Sequence 6920 BP; 2109 A; 1704 C; 1860 G; 1247 T; 0 other; aactaacgct attattagaa cagtttctgt actatattgg tatgagaagc tcccagtagg 60 aattatccgt acttggggtc aatattcggg aagagaaaga agtaagaaat cgctgcgttt 120 tatgatatcg ataggaaagg aaggaaaatc ctcaaaacca caaaaagtct tgttttgggg 180 gttcgaaccc ggacctccaa aacacaaaca caataggaga ggagtgttgc cagttgggct 240 atttcgacaa atcggagaaa aaggagaaaa tttatgctta gatgaaatat accaaatcgt 300 acacatcgaa gagaaaaaac ataggtgtac aaacggtgca cacgtagaaa aaagtagcgg 360 aattgataga tctcaggtaa ataaacccat tacagaaatc agcggatcag tttctttctt 420 gggacagaat atcgggaggt ttgatgtcaa aaagtcaccc gaaaatgaat attttgaaaa 480 aatcaactgt ggtcaaatcg acataacgac cacaaaaaat tgtcggctac gcagactttc 540 gtagcggatt tgaaagggat cgaccagccc aatgcatgag taacaacggt tactgcgaca 600 aagaggtcac aaaaaaatta ggtgctgccg aacctaaaat tttttcaaaa aatggaaatt 660 ttgaaaaaat caactggtca atcgacataa cgaccacaaa aaattgtcgg ctagcggaag 720 gctttcgtag cgcatttgaa agctcgacca gcccaatgca tgagtaacaa cggttactgc 780 gacaaagagg tcacaaaaaa attaggtgct gccgaaccta aaattttttc aaaaaatgga 840 aattttgaaa aaattaaccg tggtcaaatc gacatgacga ccacaaaaaa ttgtcgtacg 900 gcagactttc gtagcgcatt tgaaaggctc gaccagccca atgcatgagt aacaacggtt 960 actgcgacaa agaggtcaca aaaaaattag gtgctgccga acctaaaatt ttttcaaaaa 1020 atggaaattt tgaaaaaatt aaccgtggtc aaatcgacat gacgaccaca aaaaaattgt 1080 cgtacgcaag acctttcgta gcgcatttga aagctcgacc agcccaatgc atgagtaaca 1140 acggttactc cgacaaagag gtcataagaa aatagggcgt tggaaaagaa aaatgctcaa 1200 aaaataagga aaaagagcaa atttcgcagg ccgaaaaaaa cgacacaaat ggtttctaca 1260 cagacttccg tagcgcattt acacggaatg gacgagtaga attaggcaat aacaacggtt 1320 attgtcataa cgaagcgaca agaaatatag agagggggaa aaagccgaaa acatcaaaaa 1380 ttgatgaaaa aaggcaaatt tccaacattg cagctggtcg aaaaagtgat cactacgaag 1440 atccgagtag tgcattttca cgaatttcaa tagaccattg caatgctcaa ataaaaattg 1500 gtcgcaaaaa aatggttaga aacttgaggt catctgaacc tcaaaaaatt tcaaggttcc 1560 caaggcacca tggggaagga aaaaaggcgt cttctccggc tttgaacctg cagtggccag 1620 gacagaagca ggtagcgtcg gcagttgcgc ccgaaatgag aaaaagggcg cccccgaaaa 1680 acaacaacac aagcatccac cggaactgta ggaaacacct taggaaagaa ggtgattggt 1740 ggatagtaga gggcgggcaa aaccacaaga aagccgccca ttcccctctc aaaacgaaac 1800 cggtaaataa ggaggggaac cggaagaagt acggcagacc accgcgtgag gtagaaggga 1860 aatggcttac ctccctcatt gcggcaacga cagaggccgt attacgccaa ctggggaagg 1920 gaaaatcccc aacagcccac acgaagtcga accagagtag agtcccgctg cagcactctg 1980 agaaaacggc taagggcatc aaatacgcga cgccccaacc aaaaaagaaa gcacatgagc 2040 agcggaaaga gctcccccat tggcccccaa ccaagaggag ccacggcgct aacaaagggc 2100 agggagcacc agtaagagcc cctgcgaaga cacaagggaa gggggaagaa caaccccaca 2160 caatgcgaac atgggcacag gtggcagcac cgaagaaaca aaaggtgaca agtaaaccac 2220 ccatggctca aaaaaaaaag gcacaggggg aaaagaaagg ggcagcggcc aaccccttcc 2280 actgggagct acaggtggag cagcttctca aggatgcgga acagatacgc gaaagcatgt 2340 acattcgctt tctccacgtt cggcagagtt ggtggagcac atgcttcaaa gcccacatgg 2400 agttccactg cccagtgtgt ggtttcgcgc acccggaaga aaccataaca gtaacacact 2460 gcaggcagca acacccagga gggccgcctg attccctaca ccctgacaac aacagggaat 2520 caggtgcagt gtcgcaggtc cctcctgctc actttcggcg gtggtggcta tcctctcaca 2580 tatggaggaa gagaaaaatt tccaccctat tatcgccgaa gctaccagtc cccgagcaag 2640 gagactaccc aaacgctcgt tcaggcattg ggactggagc tccccacagc ccccatcact 2700 gcgatagaag cgctggccca ccatgaccag atgatccggc agctgttgga ggcaccgcaa 2760 cgacggcacg acaatgcggg cattgtgggg taaatgagct aatggaaggg ccttccaacc 2820 ccggggaaaa tttggggata atctccgtaa accccaacac atgcgagtcg atcgagctga 2880 cgaatcagat cctcgcgaag acatatacga cctcctggac agatatggag gtctatgcac 2940 gggggaacgg aagaatatgc accgcctacc ccgacgttga tcacccactc caagacgttc 3000 ggggaaatgt gtcatcgata ttggaacgtg gggaacgaag cggtgggttt gaaacgcatc 3060 ctccacgaga tcctgttacc ccacgggaac ggcacggcaa catacaaact cgtggcgccg 3120 tgatcgcacc gacaccgttc cacgtggtag cggcaatacc tcaacagacc aggaaacgtc 3180 gctgggatat cctggatggt atggtccgcc gcaccgtgag ccagagcact gtcgacccaa 3240 aaactgtggt catgtgcgtg taccgtcgtg aagaagaaga aacatacgac gtactagacg 3300 aggaggagca agacgacgac ctcctcggta tccccaaccc cacgccacgg cgattgagga 3360 tatcacaagc cggtccacag caaaactcat gggtggacac acggggcagg agggcatacg 3420 gctcacagga ggaacaggat gagaggacat cgacagatca ggtgagtatt ttctcccacg 3480 acgaaacacg ggagttatca tcaccactgg agtgtcctat cgtaggatgc accgcgagtt 3540 tcgtgggccc acgcagatgg gagaaggcca aatcccatat atacggggtc cactcgctgg 3600 aagaggtccg cgaaatcccg aggggggagc ttatatgtaa ggggatagta agatgcgaga 3660 cttgtgccac gctcctccct acgtcggaca gagcgaaaca ggcacaccgc gacgattgca 3720 gaccctatct cccgcggaaa gaaaacatcc gccgaaaaag ggccgctgaa agagaagcga 3780 cagaggcgag cgcacagcaa ggaatagcgc tacgcctcga gcggcagggc ccgtacataa 3840 ctccccgcga catagaggag cccaccaaca cgacgacgga aagttggtgg agggagaagg 3900 tagctacgaa acgctacctt cacagaaagg agtggccgca gtggcttgac atctgccgca 3960 cggtcctcct cggatactcc gcgtcatcac aaggcgagcg gcaccaacgc caagtgatgc 4020 tccttgatct ggtccggaat catctccaca cgcgcacagc caggcgcgag caacagcagc 4080 aacgtggaaa ggataaccag gaagaggagg accgccagaa gaaggaggag aaatccctgc 4140 gaaacgcgtg gaaaccctgt gcctcctcag tgcgacaggg agggcagccc agctcctcgc 4200 agccgaaaag gctcaaccgg tggagtacag ccccgaaatg gctcaaacaa tcggggaact 4260 gtacccgcag gaggatatcc atgatattcc ccggcccacc ggtggaacaa ccaggggtcg 4320 tgtcagtcga cgctgaggaa gtagcgaaaa ctatcgctag gcgactgaca cggggcgcgg 4380 cgccagggtt agatgggtgg acgcgagaac tattataccc actcaccctg gaccccgcgc 4440 taaagatgga gattgccgcc gttgtaaagg acatcataaa cgccgatgtc tcgatggagg 4500 tgggacgccg cctccaagca acgagcctaa cggtacttcg gaagccgaat gggaagtacc 4560 gaccgattgg agctgagagc gtgtgggcga agctcgcatc ccacatagcg atctcccggg 4620 tgatgaagac agccgaaaag aaattctccg ggatccaatt cggagtggga ggccacatcg 4680 aggaagccat tgcaaagatt agaaaagact ttgcaactaa aggcagcctt gccatgctgg 4740 atggtcggaa cgcgtataat gccatcagca ggcgagccat cctcgaggcc gtgtacggtg 4800 acagcacgtg gtccccacta tggcgcctcg tcagcctcct ccttggaacc acaggggagg 4860 taggattcta cgagaatggc aaattatgcc atacgtggga atcgacgagg ggtgtaagac 4920 aggggatggt acttggcccc ctgctattct ccatcggcac cttggcgaca cttcgccgac 4980 tgcagcagac cttcccggag gctcagttta ccgcgtacct ggacgacgtg acggtagcgg 5040 cacccccgga agagctgaaa aatgtctgcg cagccaccgc tgaagcaatg gaagcactcg 5100 gaatcgtcaa caatgcagac aaaaccgagg tcctcgaact gactggggac acaggctttg 5160 ggacagcggt gaagcgtgtg cgcgagttct tggagcgtac gtggccggat ccaatgagcg 5220 aggagattcg ggagggggtg gagaagaagg cgatggaaac agaccgcctc ttcaaggcaa 5280 tcgtggagct acccctctac aacaggacac gatggaggat tctggcgatg tcggcaatgc 5340 caaggatcac attcctgttg cggaaccacg atatgcaaca cacacaccgg gtggcttcct 5400 ggttcgatga gaggaccacc caggtaatgg agcatattct cgggcaaccc atgaccgaaa 5460 gggcccggaa tatagcggcg ctgcccgtaa gcatgggcgg ctgtggaatt aggcggatgg 5520 cccaagtggc agagtacgcc caccagtgcg ccggagagaa aggtctccag cagaggaaga 5580 cggaggaggc tgaccaaaga cagcaagacg acctctacgc cacccttggg ggtgctgatc 5640 gtcaagtctt tacagccaat accgccgccg gagctggcag gcccctcacg gatgctcagg 5700 tgaggctgga cgatgccact ttcggagtgt acctgcggga acgttactgt agggtactac 5760 cggagggggt caaatgccta tgtggtgaag acgcgagcaa tcaccacatc cacactggca 5820 ccaaagtgca caataaaccc aggcagatgc gacacgacat cattaacagc gtgttcgcaa 5880 acggccttcg cctctgtggg ttccagtgcg cgacggaacc acgcctaaat gaggtgagca 5940 agaggaggcc ggacatcctc attgcggggt tggatacgta cgcggtgacg gacatcacgg 6000 tgacgtatcc agggcgcgtg accgtcggaa acaccgccca aggtcagcgc tcagtagctg 6060 cggcagatcc aatgaaagcc gcattggtcg cgttccagga aaaggagcgc aagtacagct 6120 actgggcgat acaaaatgga ctggccttcg caccatttgt tatgcttaca aacggtgcta 6180 ttttcggcaa aagtcgtgac tggcttcgcc gcgtcctccg gggccaggac caccgactta 6240 cggtaaccac cgcattcgac gggataactg cggatgtggt ggcagccgtc ctccgcggga 6300 atgttcacgt ttacagtgcg gcacaagccc ggggagagac acttcggtag ttccagatca 6360 ctgggattac caatatccag atgtagagta gtaatagcaa taaaataaaa acaaccccct 6420 gaagaaaggg aaggtaatta gctaccaaat cattgccaac agggatccct ctccaccaat 6480 cgaccgagta ggtctctttt ttcggttgtg cgggctctcc cataagcccg atggagaaaa 6540 tctctttcca tatagggcaa taaaataata ataaatagat aggattatcc ggtccattaa 6600 agaccacgta acctgaaaaa ggttacactg catgttccgt gaaaatcgga tgaggtttcg 6660 gagatcaaca aaggtgatca cgtttaactc ggaggtcggg gcagttaaaa aaaaaaaaaa 6720 aaaaaaaaaa aaaaaaaaaa aattattaga acagtttctg tactatattg gtatgagaag 6780 ctcccagtag gagctgggcc aacacacgca ttgtgctgtt ggttcctgcc gcatactgcg 6840 ggaatctgga aggtggggtc ggatgacctc cactcttttt atttttttta tttttttcat 6900 ttatttattt ttttttgatc 6920 // ID BEL-626_AA-I repbase; DNA; INV; 6909 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-626_AA_; KW BEL-626_AA-LTR; Pao_Bel_Ele210; BEL-626_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-6909 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [5950-6519] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 48..4223 FT /product="BEL-626_AA-I_1p" FT /translation="MPRKTRSQTKSKSKVSAANWQRDNDTRATNDIPDVNN FT ALSAAVSDLVHPEENISLARMPQVRFAAAGVDVECDDRYDCVVCDQPNNAD FT QCMVQCRKCTNSYHFDCAQRSSGLKGNKDTFLCTSCVPRNTGPTTSSITGL FT SDTSSTRRARLARELELLEEARKLEEDAQREKLERERQINEKAAREKIERD FT KEYLARKRELLNQQDEADEVASLRSNHSIRSSTRRVEDWVVKQTTSNLGSA FT VAEDTSTSADPMHHVAANPIDNRTQYRSSTPVRDVLPGNKRFTIKHEPFDV FT DLISAARTTGSITIGDNELDEAIGGSLPADSNRVSSVISLVDLKPLQNMLE FT EQEAIVSNHGTRPEAPKSLALPYSQWQSETGYLRKTKELEQRHQKENEIRR FT KRELELVNQIRQLQLQKEADIQQRNEYEADVKKQLQLFNDRQNAMDLQRQR FT DLEQRDAEINRLRETEHSLKIQLHSLSLALQEKDVSRMQNRNAGEVVPNAV FT PAGQDRFQQIICPHPEELIVNSTRFPNKESTVNPNLVSATPASPHEINETP FT QPQPYLDHQFLNTSLPIGGSINDNLLPHTHYDHQTLNISTPRGGPSTSFNN FT IPTTQNAETYAYHQFEPAMPSNIRGNCVQQDVFRPSPEHLAARQVLSKELP FT TFTGDPVDWPLFHSSYHHSTQACGYSDFENLLRLQRALKGAAKDAVSSFLL FT HPSTVPQVMSTLQTLFGRPEQIVHNLIEKVRATPPPKPDRLETLVTFGLAV FT QNLCGHLKAVGLNWHLSNPSLLQELVDKLPANVKFNWAMYQQQFPVVDLNV FT FGEYMTRVTSATSSITLTNMQQKIVKDDRAKAKEKAFVNAHTTGEKDSEQQ FT INEDGTHLRTEQRPDLKMEQTKKCPGCKVEGHQASECEKFKALSLDDRWNL FT VKNNKLCRRCLTSHLRWPCKAEACGLNGCQKLHHRLLHYDPATELKQPERS FT TNATVTIHQSITTSMLFRVLPVTLYGKFGEVNTFAFLDDGSSVTLVEEEIA FT RALGLDGPRESLCIQWTGGINKGIADARMVQMEISAQGSEKRYKIGEAYTV FT DKLGLPRQSLDFEEISERFKYLRKLPVKSFQSAVPGILIGIDNVHLLATLK FT LREGRIQEPIASKTRIGWAVYGRIQGGELMQHRQLHICKRPSLTELHDYVQ FT EFFSLENLGIAAAPALESAEEERARKILVESTVRTNCGKFETGLLWKNDYV FT ELPDSRPMAERRLKCLEKRLQRNSTLYEVVRNQIMDYKLKGYVHEATKDEI FT ESFDPRRTWYLPLGVVTNPKKPGKVRLVWDAAAQVNGTSLNDELLTGPDLL FT TPLLAVMFQFREREVAISADIMEMFLQILIRPEDRSALLFLWRDSPEQPIR FT VMVSNVAIFGATCSPVQSQFVKNLNASVV" XX SQ Sequence 6909 BP; 2090 A; 1511 C; 1674 G; 1585 T; 49 other; agagctccga ataacgtgaa caaaactcaa gaatttaata atacatcatg cctagaaaga 60 cgcgctcgca aacaaagagc aaatcgaagg tgtcggcagc aaattggcaa cgagacaacg 120 atacacgtgc tacaaacgat attcccgatg tcaacaatgc attgagtgct gcagtgagtg 180 acctagttca cccagaggaa aatatatccc tggcccgtat gcctcaagtg agatttgcag 240 cagcgggcgt tgatgttgaa tgcgatgatc gttatgactg cgtcgtttgt gatcagccaa 300 ataatgctga ccaatgcatg gtacaatgtc gaaagtgtac caattcgtac catttcgatt 360 gtgctcagag aagtagcgga ctgaagggta ataaagatac gtttctatgc acgtcgtgtg 420 ttccccggaa cactggccca acaacaagtt cgataaccgg actgtcagat acttcgagta 480 cgcggagagc tagattggcc cgtgaactcg aattgttgga agaagcaagg aagttagaag 540 aagatgctca gcgggagaag ttagaacgcg agagacaaat taacgagaag gcagctaggg 600 agaagatcga gcgggacaag gagtacctgg cgagaaaacg agagcttctg aaccagcaag 660 acgaggcaga tgaagttgca agcttgagga gtaaccattc tattcggtcg tcgacaagaa 720 gagttgaaga ttgggtcgtc aaacagacga caagcaatct tggttctgct gtagccgaag 780 acacgtcgac atctgcagat ccaatgcatc atgtagccgc aaatccgata gataatcgaa 840 ctcagtatcg tagttcaact cccgttcgag acgtactccc aggaaacaag cgcttcacta 900 ttaaacacga gccctttgac gttgatctta tttccgctgc ccgaacgacc ggtagtatca 960 cgatcggtga taacgaactc gacgaagcaa ttggaggaag tttaccggcg gatagcaacc 1020 gagtttcgtc tgtcatttca ttagtggacc ttaaaccact tcagaacatg ctagaagagc 1080 aggaagcaat cgtttcaaac cacgggacta ggccggaggc tccgaaaagt ttggccttgc 1140 catacagcca atggcaaagt gaaactggtt atctgcgcaa aaccaaagaa ctcgaacaac 1200 gtcatcaaaa agagaatgaa attcggcgta aacgagagtt ggaactagtc aatcagatca 1260 gacagttgca actacaaaaa gaggcggaca tacagcagcg taacgagtac gaggctgatg 1320 tgaaaaaaca actgcaactg tttaatgaca gacagaacgc tatggactta caacgacaac 1380 gtgatctgga gcagagggat gccgaaataa atcgtctgcg agaaacggaa cattcattaa 1440 agatacagct tcactcacta agcctggctc tgcaagaaaa ggacgtctcc agaatgcaga 1500 acaggaatgc cggggaagtg gttccaaacg cggtgccagc gggccaggat cgattccaac 1560 aaattatctg tccccatcca gaagaactca tcgtcaactc aacacgcttt cctaacaaag 1620 aatctacagt gaatcccaat ctggtaagtg ccaccccagc ttctcctcat gaaataaatg 1680 aaacccctca accacaacct tatctcgacc accaatttct taacacctcg ctaccaattg 1740 gtggttcaat aaacgataat ctgttgccac atacccatta tgaccatcaa accttgaata 1800 tttcgacacc tcgtggaggc ccttcgacct catttaacaa catcccaacc acccaaaatg 1860 cagaaactta cgcatatcac cagttcgagc cagcaatgcc cagcaacata cgtgggaact 1920 gtgtacagca ggatgttttc cgcccatcgc cggaacatct tgccgccaga caagtacttt 1980 ccaaagaatt gcccacattc actggggacc ccgtcgactg gccacttttc catagcagct 2040 accaccactc tacgcaagct tgtggctact ccgactttga aaatcttcta agactacaac 2100 gcgccttgaa gggagcagct aaggatgcag ttagcagttt tttgctgcat ccttcaacag 2160 tccctcaagt tatgtctacc ttgcaaacct tatttggtag accagaacaa attgtccata 2220 acctgatcga gaaggtaaga gcgactccac cccccaaacc cgatcgatta gagacattag 2280 ttacattcgg actggcggtg caaaatctgt gcggtcacct gaaagccgtt ggactaaact 2340 ggcatttgtc taatccatcg ttgttgcagg agctagtcga taagttacca gcaaatgtga 2400 aattcaactg ggcaatgtac caacagcaat ttccagttgt tgatctcaac gtattcggag 2460 aatatatgac acgagtgaca tcagcaacca gtagcataac tttgactaac atgcagcaaa 2520 aaatcgtcaa ggatgaccgt gctaaggcga aagagaaagc attcgttaat gcacacacta 2580 caggagagaa agattcggag cagcaaatca acgaagacgg tacgcacttg cgaacggaac 2640 agcgtccaga tttgaagatg gagcagacaa agaaatgtcc agggtgcaaa gtggaaggcc 2700 atcaagcaag tgaatgtgaa aagtttaaag ccctcagcct agacgatcgt tggaatctcg 2760 tgaagaacaa taaactgtgt cgaagatgtt tgacttctca tttacgatgg ccgtgcaaag 2820 ctgaagcctg tggtctgaat ggttgccaaa aattacatca ccgtctgctg cattacgatc 2880 ctgcaacgga attaaagcag ccggaacgta gtactaatgc aaccgtcacg atacatcaaa 2940 gcatcactac gtctatgctc ttccgagtgc ttcccgtaac actgtatggg aaattcggag 3000 aggttaatac gtttgctttc ctagacgacg gatcttcagt gacactggtt gaagaggaaa 3060 tcgcacgagc actaggacta gacggaccaa gggaatcgtt gtgcatccaa tggacaggtg 3120 gtatcaacaa aggaattgca gatgctcgca tggtscagat ggaaatctct gcgcaaggga 3180 gcgaaaaacg ctacaaaatt ggagaggctt acactgttga caaactagga cttccacgac 3240 aatctttgga tttcgaagaa atatccgagc gctttaagta tctcaggaag ttacccgtta 3300 aaagttttca atcagctgta cccggaatac tcataggaat cgacaacgtt catcttcttg 3360 ctacactcaa gttgcgtgaa ggacgtatac aagagccgat tgcgtcaaaa acacgaattg 3420 gttgggcagt atacggaaga attcagggag gagagctgat gcagcaccgt caattgcaca 3480 tttgcaagcg accttcgctc accgaactcc atgactacgt tcaagaattt ttctcattag 3540 aaaatttggg catcgcagca gccccggcct tagaaagcgc ggaagaagaa cgagcccgaa 3600 aaatattggt ggagtctaca gtgcgtacca actgtggcaa attcgaaaca ggactactgt 3660 ggaaaaatga ttatgtggaa cttccggaca gcagacctat ggctgaaaga cgtctgaaat 3720 gcttggagaa gcgacttcag agaaattcaa cgctatacga agttgttcgt aatcaaataa 3780 tggactataa gcttaaggga tatgtgcatg aagctactaa agacgagatt gaaagcttcg 3840 acccacgacg cacgtggtat ctgcctcttg gtgtcgttac aaacccaaaa aagcctggaa 3900 aagtccgcct ggtttgggat gcagcggccc aggtaaacgg cacttcttta aacgacgaac 3960 tcctgacagg tccagacctc ctaactcctt tgttagcagt gatgtttcaa ttccgagagc 4020 gtgaggtagc aatatcggcc gacattatgg aaatgtttct acagatcctt attcggcctg 4080 aagatcgtag cgctcttctt ttcttatgga gagattctcc tgagcagccc ataagagtaa 4140 tggtttccaa cgtagcgata tttggagcga cttgctcccc agtgcaatca caatttgtaa 4200 aaaatttaaa cgcatcggta gtatgaagaa aactacccaa gagcttcgga agccataaaa 4260 aacaaacact atgtcgacga ctatcttgat agtgtcgaca cggctgacga ggcagttcaa 4320 ctagctaatg acgtaacgac tgttcatggg aaagcagact tcttcattcg gaactggaga 4380 tcaaacagac gcgaagttct tgaacggata ggcgaagtta atacagtggc aactaaacaa 4440 ttcactgctg gtaaagaaat aaaccttgag cgaatcctag gaatgatatg gctgcccgac 4500 gaagatgttt tcgccttcaa cttctgttta cgaggtgatg tccaacgtct tatgaatggt 4560 gaagtaattc ccactaaacg agaagtatta ggcgttgtaa tgagtctata cgacccgctt 4620 ggaattgtgg ctaccttcat agttcacgga aaagtaattg ttcaagatac ctggagagag 4680 agcatcggtt gggatgataa aataccatcc sgaatatttc agaagtggaa gcggtggctg 4740 aacttgatgg cagtgatctc taaagtgaga atcagtcgtt gttattttcc gggctatgac 4800 ccgatcagct acaattcgtt gcagttgcac atttttgttg atgcgmktac agaagcctmc 4860 gctgcagtsg cwtaccttcg kgktgtagac gaaggagaaa ttagatgttc gcwagtctct 4920 tccaaaacta aagcagcgcm tcttcaagct ttatmaatcc cgcgtctwga kttgatggca 4980 gccttgatcg gggctcgttt aagaaaaaca atcaaagaca acaacactcc ctaaaaatca 5040 gtcgtacaat tctctggagc gattccacaa cggtgatggc ctggataaaa tctgactctc 5100 ggcgttatcg tcaattcgtg gcatttagaa tcaacgagat actcagtttg acttcggtag 5160 atgaatggag gtggatagga acgagagcta acgtcgccga cgaggcaaca aagtggggta 5220 aagggccttc ggtttaccct gatagccgat ggtatcgtgg acccacattt ctgtatgaac 5280 ctggtaatga ctggcgggat gagcaatgtg tggaaactac cgaagaactt cgmccatcat 5340 ttgtatgcgc acatgtcckc gcagaaccaw tgattggttt tgcgmgattt tccaagtttg 5400 aacgattgaa sagaactgta gcctatgctc awagatacat ggacwwcttg acgtcgagtt 5460 gtcactggtg ttccacgcga aactggawsc aatmtaggac aagmggagct gcagaaggmt 5520 gaggtaacgt takkgatact tgcacagtta gaagcctatc ctggcgaagt sgcagcgttg 5580 aagmasaaca aggaaagggg aacgaaattg wcaattgagw gmtcgagswa gwtagcaaat 5640 ctcccaccma tgatkgwtgg tcaggwtgtt ctccgagtwg atggtcgtct ggatgctgma 5700 ggctgttcka catttgatgm gaagtatcca gttatacttc cgagggaaca ccgcctgaca 5760 gaacttgtta taaactggta tmacgataag taccggcacg ccaatgatga gactatactg 5820 aatgagattc ggcaaaagtt ttttgtatcg aaacttcggg caagcctacg aaagacgaag 5880 tcgcggtgca tgtggtgccg agttcacaag agtatcccgg tgccgccgaa aatgggacca 5940 ctacctaaag ttcgtttaac accatttgta cgagcattca cattcgttgg tatagattac 6000 ttcggcccat atttagtcaa ggttggccgc agtgtagcca aacgctgggg agtagttttc 6060 acttgtctga ctatcagagc aatacacatt gaggttgcct gtagtttatc ggccgattct 6120 tgcaaaaaag cgattcggag attcatcgca agacgtggag cgcctcaaga aatttattcc 6180 gataacggaa ctaatttcat cggcgttagc agggagcttg caaaggaact cgcagagatm 6240 aattcagagc tcggcaattc tataacggac acttacacaa agtggaggtt caatccgccc 6300 tcagcaccac acatgggagg atgctgggag agaatggtga ggtccatcaa aacggcactt 6360 gaagtcatac caatcgaccg caagctagac gacgaatcat tagcaacgat gtttttggaa 6420 gccgaaagaa tgatcaactc gcggccattg acgtttgtat ccctggcgac gtccgacgat 6480 gaagcaatta ctccgaacca tttcctgctg cttagttcaa gtggagttca acagcctgta 6540 aaactgccgg tggacgaaaa acaagctctg cgaagtagtt gggatatgat tcaacgcacg 6600 ttggatcgga tgtggcgacg ttggatcaca gagtaccttc caacgattgc aagaagaaca 6660 aaatggttca aaggtgttcg awcaatcaaa gagggtgatt tggtggttat tgctgatgag 6720 aaggtcagaa atagatggtt gaggggtcgg gtggcgcgtg tatatccagg taaagacgga 6780 acaatacgga aggctgatgt aatgactact ggaggtatac taagtcgagc tgttgctaag 6840 cttgcgctat tagacgtgga accgaaggat gacgccggat cggaagttca ggcgacatga 6900 gggggagga 6909 // ID STREP_CD repbase; DNA; INV; 1507 BP. XX AC . XX DT 14-SEP-2004 (Rel. 9.08, Created) DT 14-SEP-2004 (Rel. 9.08, Last updated, Version 1) XX DE Chironomus dilutus subtelomeric repeat, consensus. XX KW SAT; Satellite; Simple Repeat; STREP_CD; subtelomeric repeat. XX OS Chironomus dilutus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Chironomoidea; OC Chironomidae; Chironominae; Chironomus. XX RN [1] RA Rosen M. and Edstrom J.; RT "DNA structures common for chironomid telomeres terminating with RT complex repeats."; RL Insect Mol. Biol 9(3), 341-347 (2000). XX RN [2] RA Gentles A., Kohany O. and Jurka J.; RT "Chironomus dilutus subtelomeric repeat, consensus."; RL Direct Submission to Repbase Update (JUL-2004). XX DR [2] (Consensus) XX SQ Sequence 1507 BP; 600 A; 250 C; 218 G; 439 T; 0 other; aattctcaaa actgagcgag ctagagcaaa aaaaccactt tccatacaaa atttcccaaa 60 aaaaaaatct cgaaaaaatt ggaaaaatcg tcctagctcc tccatttctc aaccattttg 120 ggaggtttat atatcgatgg atagctctca ttcgtagcta tccaacggtg caaaaaaatt 180 aaaaaaattc gttaaactga gcgagctaga gcaaaaaaac caaaaatgca caccccccca 240 tcaccaaaaa atcgtcataa ctcctcaaat tttcaacttt ttgaagaggt ttatatatcg 300 ttggatagat ctcattaaaa ctcaactttc acccatacaa accaccactt tattctcaaa 360 gctgagagcg ctagagcaaa aaaactaaaa atatccattc acttagctaa gtttcataat 420 tattataaat ttaacggttt tttgaaattt gattttttaa cggtttttta aaattcaaat 480 gtccatgaga aaatgacatg agtgacattg gattcatcgt ttcttgaaaa tttttatttc 540 cttaattttg tctttattaa gagcatttaa aggaggaact cgatataagg aaggaagaag 600 aacatgttag gaagatattc ctacataatt ggatcattaa aagtgagtaa aaccatcaaa 660 acaaaattca tcaaaatttt tagcatggaa aggccccgcg aatttgccgc cgccaggtcg 720 ccccgatatt ttggctatgt ccgcggttag gtaaaaaagt tgaaaaactc attaaataat 780 gagttaaatt aattacaaat actcaataag tatccgcaaa catggcaaga tttctttaaa 840 ttccacaaat cttgtcaaat aacaatacaa attgctcaaa aatttaattt tctcaaaatt 900 tttacaaaat aatcaataaa aaaggtaagt aaattaaatc agaatgtcat caaaagaacc 960 agcattaaaa ataaggagga tagaacacaa catatctgag aaaatattaa aagaaagtgc 1020 atgaaaatta aaatttctga aaaaaaaatt cagtccttgt atgaaaaaat tattttcacg 1080 cgcaaatttt tcattttatt aatttaagct gttaaacagc tttatcttta tttcagcata 1140 ctaaaaaacg attcagagga caaaacaatg ctgaggaaat agaagaacaa caatcgaagt 1200 atgtattgag gtcaaaaaat ttcatttttt attttggcat ataaataaac atttttattt 1260 tgccaggagt tcgctgtaca actaaaagta gtaaaagtgt caattaaggc actggaataa 1320 aagagtagga agatcacaac aataaggaaa aaagacatta caaaattctc acaagttctg 1380 cagctttaat tttcttatca atcgaaattt atttggattg caaagtgtta gatgacggag 1440 aaagcctcga tggctatcaa ctgctccaac caaaatattc cgagtcccag aaaggcatgt 1500 tttgcca 1507 // ID Gypsy14-LTR_Dya repbase; DNA; INV; 417 BP. XX AC chr3R; XX DT 18-MAY-2009 (Rel. 14.05, Created) DT 18-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy14_Dya; KW Gypsy14-I_Dya; Gypsy14-LTR_Dya. XX OS Drosophila yakuba OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; melanogaster subgroup. XX RN [1] RP 1-417 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1095-1095 (2009). XX DR Genome; chr3R; Positions 6836115 6836531. XX SQ Sequence 417 BP; 173 A; 70 C; 90 G; 84 T; 0 other; tgtgggagag atatttccac atattcgaat atgaacattg acgacgctct ggacaacgaa 60 aacgcaaaga caataattca aggaaaacag aagttgaagt cgacgatcaa cataattcaa 120 ggaaaataga agtcgacgtc gacgatcgca atacgcggaa aaagagaagt tgaaccaaaa 180 ggaaactacg acgtatatca acgattttca gaaggaagcg actgatccca atatgagcgg 240 cgagaacgca gacgacgggg gcagtttgtg acgagagccg agagaagacg gaagtattcg 300 gcgagagaag acttaaacga cgtgaagact attatctaat attaaaatag atgttattaa 360 ttaaaactat acttatcata ataataaaac atcaatatta ataaataaaa ccccaca 417 // ID BEL-95_AA-LTR repbase; DNA; INV; 717 BP. XX AC supercont1.291; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-95_AA_; KW BEL-95_AA-I; BEL-95_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-717 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.291; Positions 20539 19823. XX SQ Sequence 717 BP; 198 A; 164 C; 169 G; 186 T; 0 other; tgtagggaag tagtatcgca cccccaagcc cattcgaact acatcactgg ataagacaaa 60 ccaactgcga tcagcctgaa ccacacaata gctgcagatc agccaggaag gctatagata 120 gcagtggaaa gagaccagac aagctgatgg taaacaacca ttcgtggcta cctgccttgc 180 tggatgacca ctgccatagc ttgcgagata gctgatgata gcatgtgaga tcgttggaga 240 gctgtccatt tgatatcgca gtggtggata tgcttcagga gatgcctctc tcatgttacg 300 actctcatct ccgaattcga tcatttgaga tccctacagc aggtttagga atgacctttt 360 ggtgtcgtag aaatcagttt tagattgaac cttccctccg atagcaggcg cgccggttag 420 aggcctacgc acggccgtat ctgaatgtta cgtgtttaag tttaagaaat cgatgttaga 480 ataagtccaa taaatgtagt ccgttgtagt tttgtgtgtt ttttagaaca aatatagtgt 540 tttaattgta ccgcgagttc tttattgcca aggaagaaga atgctcagtg ttcgcttaaa 600 ggaagccttc taccgcaaag gataattgtt gcccacccaa tccaaccaca tacactggtc 660 gagtgctgag gagccatccc acccagtcga aggaggtgat tccccggttc cccaaca 717 // ID BEL-179_AA-LTR repbase; DNA; INV; 723 BP. XX AC supercont1.1; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-179_AA_; KW BEL-179_AA-I; BEL-179_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-723 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.1; Positions 522028 521306. XX SQ Sequence 723 BP; 188 A; 178 C; 163 G; 194 T; 0 other; tgtttgcaac caacgattta caccctcctc agactacatc cctgacgtgt atgttaactg 60 cgggcaaggc tatagcagga actattgtac cctgacctac ctctacctaa atgcaacaat 120 ccatccactg gcccgattgt gacgaccgct aaacagtagc catccagtac cagcagttga 180 gaatctgata cacagcgtga tcatcactga ctgtgctaga tctatcacca tcatcaatca 240 tcgattggac gagcgataga cgagctccga taacggtgat aagagagtga gatcctcgct 300 ctcggtttac cgaatctacc accggctagg gcagcatctg gattgatgat gatcgtcagt 360 cgaattgggt cgccgtcgag aacgtacact gggcaccgag cccatacgca agttttgttt 420 taatctgtct agccataagc gaaatataga tataagttta gatgatgttt tttgtccaaa 480 tataaggtgt tttaattaaa gtgtttgttc taatgtaaac tcgcgtgttg aaatgtgcat 540 gtgataaggc ctttggacta gatgatcgca agctgggttg atcgttaggc cgtaggatcg 600 gtctcctctg gatcccctct atcccctaca atcaatcggt cgatgtcgaa cctggctccc 660 cactgcaacc cgttactcct ctgctatcga agttctggag gattgaacag ccaggtgcgt 720 aca 723 // ID Gypsy14-LTR_Dpse repbase; DNA; INV; 1179 BP. XX AC Unknown_singleton_87; XX DT 13-MAY-2009 (Rel. 14.05, Created) DT 13-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy14_Dpse; KW Gypsy14-I_Dpse; Gypsy14-LTR_Dpse. XX OS Drosophila pseudoobscura OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; obscura group; OC pseudoobscura subgroup. XX RN [1] RP 1-1179 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1094-1094 (2009). XX DR Genome; Unknown_singleton_87; Positions 6297 5119. XX SQ Sequence 1179 BP; 454 A; 225 C; 266 G; 234 T; 0 other; tgggacccta ccaagttgtt ggaatcaaac ataacaatgc gtatgatgtt cagaaagtcg 60 gaaactgcga tggcccaaag aatacctcat cgtgcgcgga gtacatgaag ccatggtcca 120 ttggcaacga cgacgacgac gacgacgacg acgacgacga cgacgcattc gggtcgaatg 180 cttaatcaga atggccgaat gtgggttggt gtaccacaca aaataccaaa tgaagataag 240 atccctagac gacgagcaac gatgatccca tcactaaaaa gaacgacgaa caacgattat 300 gccatcacga agaagacgac gagcaagaag agacaagcag agaaataaaa cgacggcggt 360 accgaaaaga cgcaaaaacg acgacgcccg cagagaagaa gaataaaacg acgcagaata 420 aagaatacat ttatcctaaa tttatcaaac attaaataca aattaattat taaacataaa 480 gtaaataaaa accccacatt ttgggcgctc ctccgggaat gaattaaaag ggaatgaatt 540 cttctattga agaagaaacg aaaatataaa tttttttccc ccacaatgcc gtatttaaac 600 gatttaatat gtgaaaatct acgctcgagc aagaaagagt gcatagcggc actgcataaa 660 ttcgtgttcg aggccgaggg agacagaatc aaccgaaaag gattacgcga attttctggc 720 ttcctctacg acgaaaatga tgagaggtac aaagcaaaga aggaatacgc agtaaataat 780 ataacacata gtgatcttgt ggctatatgc aatgtgttgg gaatcagtta tactagtgag 840 gaccttttct tgcatgtgtt ttgcaatttg cgttcgatga ccctgctagc gcatgatgaa 900 acaagcaacg acgaaactga ggatgaagag gatgatgatg aaaaagaaaa ggatagcaac 960 aattcgtttg agacggcaga ccatgccaac atagaaaaca ttgcagggaa aaacgaagac 1020 acagaacaaa gaagacacgc tacaacatac aaacaaatag tggagagaga tatgacgaca 1080 gcattcaaag tttcgacggc actaatacta tttccagtcg aagtgtggat aacagaatac 1140 gaggaacagg cgacgctaat gaaaatggaa tgatttcca 1179 // ID DNA8-9_CQ repbase; DNA; INV; 882 BP. XX AC . XX DT 26-DEC-2010 (Rel. 16.01, Created) DT 26-DEC-2010 (Rel. 16.01, Last updated, Version 1) XX DE Non-autonomous DNA transposon from Culex quinquefasciatus - DE consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-9_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-882 RA Jurka J.; RT "DNA transposons from the southern house mosquito."; RL Repbase Reports 11(1), 86-86 (2011). XX DR [2] (Consensus) XX CC ~92% identical to consensus. 8 bpp TSD. Putative hAT element. XX SQ Sequence 882 BP; 326 A; 136 C; 140 G; 278 T; 2 other; tagaggtggg caaaaaaaga gcgagccgct caaagagccg gttcactgaa aagagcgaac 60 gaaccgtggc tcncaaaaaa agaaccgcgg ttctttttta aacttcggtc ttttggaaga 120 accgtttcaa gatggcatga tttttgttat aaatataatt tattgaattt ccaaaaaaat 180 gtagtactaa atttgttttt gagaattgaa aatgcaattt caaatatttc aatgcttata 240 aaaattaatc ccaaaagtaa atgattttct aaacgaactg aaaaaaactt ttcattgtac 300 aactgattgt atattaaagt attaccggaa tacaaatttg agcatttcaa attgcataag 360 ttgtaaaata ttaccaaaaa tctactgaat agtttagaga ttttggaaaa aaaatccttt 420 tgtccgatta aactttagcg tttatagact ttatcgatta aatcagaatg tagaaaagag 480 ntcacagaat attttctcca aataaaatat cagcattagt tcaattcttg cagaaataaa 540 cgcaatttta aaattaatag cagtttttcc agcttcccta gatttaagtt tggatgccaa 600 tcaattactc gaatgtttag gttcgagtag aaatcttgac atagagaccg ccaagaccta 660 tggttttcaa gggcatcttc tcgttttcag ttttaatttt ttttatcaaa taatttataa 720 aagtccaatg tgaaataact tataaaatca cacgattctt aaaaaaaacc ttttcatcca 780 aattttgaaa aagagcgaaa gagccgttca aaagagcggc tctttttaat gagcgaacga 840 aaatgagcgg ctcctaaaaa agagcggttt tgcccacctc ta 882 // ID hAT-26_SM repbase; DNA; INV; 3040 BP. XX AC . XX DT 13-FEB-2008 (Rel. 13.02, Created) DT 03-MAR-2008 (Rel. 13.02, Last updated, Version 1) XX DE hAT DNA transposon from Schmidtea mediterranea: consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-26_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-3040 RA Bao W. and Jurka J.; RT "hAT DNA transposon from Schmidtea mediterranea."; RL Repbase Reports 8(2), 75-75 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS join(608..811,754..1161,954..1538,1604..2047) FT /product="hAT-26_SM_1p" FT /translation="MQRKISKRSQESDSRCEFNKQWEDELLFITSSSGKPF FT CIVCESTLSRNKNMILRDTTRHIIRLLWKKKKHDLKRHYTTHYQTIMEEKM FT KLILGSELRKEYVTKKKEEIRKRQNIFVKVSSEDLAMTEASYEIGFSIGKK FT RNLFLMKKKLLNLVSKXLSTVIMFSKXKQTKLLYQGRQLQDVSRNSLRMYF FT RNWNTLYILVPSFLHWQKKKPFSDEEEIVKPCLQIXIYGNNVFEXKADEIA FT ISRQTVTRRIEELSQDVFQKLEYLVHSCTFFSLALDESTDICDVAQLNIFI FT SGIDXNFNVFEELVSLESFHGKTRGLDIFEKVESCLENLKIDFNKLISVCT FT XGAPSMIGKVNGATTLLENFLKRPILKYHCIIXQEALMLXNYKYTLRXVAS FT CEIFEIVRFWKLKNAGFYFLEQENQLPDERTLLFDKKWLLDLAFLIDVTSH FT LNDLNLKLQGKNKLFPNLVNSVSSFKMKLKLFISQLENKDLSQFPHLKEQS FT EFAADKSSLTKYIEKIKILQESFESRFHDFTKEEDCMLAFINPFSLSEQNV FT " XX SQ Sequence 3040 BP; 1086 A; 467 C; 501 G; 970 T; 16 other; ccaggggtgg ccaaaccatg gcgcatgcgc caaacgtggc gcattgaaag atttcatgtg 60 gcgtatcctt gcgcttactt atttttttaa aaatagttat cataactgaa cccatctgaa 120 tattttaaca ccaaagtgat cacagtttaa ttcgtataac cttctgtagt atacaactag 180 gtaagtttac aaaatacaaa tgtttttatc ggtgttacca tgmgtattac tatttaaatg 240 attgtaaacc tckcgcacta gcaacacaga atatgtttga agtcacgttc gaatgagcgg 300 tgagtgtttt atattgctag cagttctgcg tgagccatcg tggcgtggag tttgaaggag 360 tgttttattt agtatactag cgtagttttt gtattgagag tgattaaaag acagattcaa 420 gattaaatat ataacatttg tcaaataaga gcgccatttc cttttcttgg ctcaaatata 480 taatttaaat tcaagaataa ttgatattct gtgtaataag acgtacgtac ttttaatttg 540 ttttattctt tttaaatttt ttttagttaa aaacttttat atacttaaat tcctgttgtt 600 cattaccatg caaaggaaaa taagcaaaag aagccaagaa agtgacagta ggtgcgaatt 660 caataaacaa tgggaagatg agttgctgtt tataacaagt tcatcaggca aaccattttg 720 tattgtttgt gaaagtactc tttcacgtaa taaaaacatg atcttaagag acactacacg 780 acacattatc agactattat ggaagaaaaa atgaagctta ttcttggatc cgagttacga 840 aaagaatatg taactaagaa aaaggaagaa atcagaaaaa gacaaaatat atttgttaaa 900 gtaagtagtg aagacttggc aatgacagaa gcgtcatatg aaattggctt tagcattggc 960 aaaaaaagaa acctttttct gatgaagaag aaattgttaa accttgtctc caaatawtta 1020 tctacggtaa taatgttttc gaaanaaaag cagacgaaat tgctatatca aggcagacag 1080 ttacaagacg tatcgaggaa ctctctcagg atgtatttca gaaactggaa taccttgtac 1140 attcttgtac cttcttttct ctagctttgg atgaatcaac ggacatatgt gatgttgcac 1200 agttaaatat ttttataagt ggaattgatr ataattttaa tgtctttgaa gaacttgtca 1260 gtttggagtc attccatggt aaaacaagag gattagacat atttgaaaaa gtagaatcat 1320 gcytrgaaaa tctaaaaatm gattttaaca aacttatcag tgtgtgtact gamggagcac 1380 catcaatgat aggtaaagtc aatggcgcta caactttgct tgaaaatttt ttaaaacgtc 1440 ctatactgaa atatcactgc attatacwyc aagaggcact catgttgyaa aactataaat 1500 atacattacg twatgtagcc agttgtgaaa tttttgaata aaaaaagagc aaaagmtata 1560 gtgagatgag attttaarga cttatccttc atacaggttc tgaattgtta gattttggaa 1620 attaaaaaat gcyggttttt attttttgga acaggaaaac caactacctg atgaaagaac 1680 cctattattt gacaaaaaat ggctattaga tttggcattt ttaattgatg ttactagcca 1740 tctcaatgat ctaaatttaa aactacaagg aaaaaataaa ttatttccta acttagtaaa 1800 tagcgtcagt tcatttaaaa tgaaattaaa actattcatt tcccagttag aaaacaagga 1860 tttaagtcaa tttccacatt tgaaagagca aagtgaattt gctgcggata aaagcagttt 1920 gactaagtat attgaaaaaa tcaagatatt gcaagagtca tttgaaagtc gttttcatga 1980 ttttactaaa gaagaagact gcatgctcgc atttataaac ccattttccc ttagtgaaca 2040 aaacgtataa tgcagatgcc tagtaacata caaatggaac tgattgattt aaaaacaaat 2100 tcattcttga aaaatgaaat ttgatgagct tttaaaagat caaaatgcgt ctgatataat 2160 taatttctgg cattcattac caaatgaaca tttcccagag ctaaaaaaag tatgcaaaaa 2220 aatacatatg tcgctttgga acgacataca gatgtgaaca agtattttcg tctatgaaat 2280 tgattaaaag caaaacaaga tcgcgaataa ctgatctaaa tttgaaaaac tctgctactt 2340 tcggtgacaa atttaacacc aaacatcaaa aaactagcta aatccaagca aacgcaaaag 2400 tttcaataaa gatagtaatg tttttgtgct gtaataaata ttgtttttta tacaattata 2460 aaaccattta aaaatgttga ttatgtcaat gagatatttt atttaaaaac cccattaatg 2520 ttataaatta tatttcatga ccttttctag tgagtagaga aaacagacaa ctccacgtta 2580 catgcagatc tcgctctcgg ttcttaaatc gctcagagct ttctttctca ttcatcgcat 2640 caaacaaccg tggtccagaa ttatctttat taagagaggc ttcccggcag ctgtagcatc 2700 accactgggg tccactacag caatcatttt ctcgtacaag agattatttc tactattgta 2760 gattgatctc caataaacat gtcttttgaa atagcgtaaa ttaattgatt tgcatgtttg 2820 agttatttcc tttccctact taagcaaacc agaactaccc ctattttaca ttttgagtta 2880 ttccttttcc ttagttaaac aaagcagatc taaccctttc attattatac aacccaacgt 2940 gaaagtaaaa agataaaaaa tgtgcttgag ttatatggcg catataaact cccctgtgaa 3000 aacattggcg catggtatgc ttaaggttgg ccaccactgg 3040 // ID Gypsy-115_AA-LTR repbase; DNA; INV; 258 BP. XX AC supercont1.240; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-115_AA_; KW Gypsy-115_AA-I; Gypsy-115_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-258 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.240; Positions 473009 473266. XX SQ Sequence 258 BP; 68 A; 49 C; 80 G; 61 T; 0 other; tgtaacgatg tatggtttca tcgagtcgcc aacggtcaag ttgaactacg tacgctttca 60 gtgacgggta tcaggaatga cagcttgaaa gaagaatgta ttggtacatg catgacgaaa 120 ccgtgtgtgc gtgggtttac tcgggtacgg caaaggtgct gccgaacacg agcgtcattc 180 gtggtacagc agtgtagcag aacagacggc tgagctcgag tgtgaagcgt aacgagggct 240 aattattcgg ctggtaca 258 // ID L1BM repbase; DNA; INV; 3187 BP. XX AC AB002279; XX DT 27-JUL-1999 (Rel. 4.06, Created) DT 16-DEC-2009 (Rel. 15.01, Last updated, Version 2) XX DE Bombyx mori L1Bm DNA, non-LTR retrotransposon. XX KW L1; Non-LTR Retrotransposon; Transposable Element; KW reverse transcriptase; Repeat sequence; L1Bm. XX NM L1BM. XX OS Bombyx mori OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Bombycidae; Bombycinae; Bombyx. XX RN [1] RP 1-3187 RA Ichimura S.; RT "Direct submission."; RL Direct Submission to EMBL/GenBank/DDBJ (27-MAR-1997). Sachiko RL Ichimura, National Institute of Radiological Sciences, Division RL of Biology and Oncology; 4-9-1, Anagawa, Inage-ku, Chiba-shi, RL Chiba 263, Japan (E-mail:ichimura@nirs.go.jp., Tel:043-251-2111, RL Fax:043-255-6497). XX DR GenBank; AB002279; Positions 1 3187. XX SQ Sequence 3187 BP; 802 A; 951 C; 702 G; 732 T; 0 other; cgcaagcttg catgcctgca gtccccacgg tctcggcgtg gaaaaagccg ctgccgtata 60 cgaaggcggg aacggaaacc gcacccccgc ccccgctcgt gacacgcccc gcgcccgcgt 120 agttccgcgt cggaactcgt attcaaaaat cattcgaatt ttcgaaatat tcatctttct 180 tccgtccccc gcaaccctct gccgtcaacg acttcgcgct cgtgcgtgac ttcgtcaccg 240 tgtctaagct gattaaaaaa agcaactgaa agtcgataat cgaatgttgc acatttaaaa 300 cggtcaactt cgaccgtctg cgatcgttcg cagacgcgat aacgtaggtc ggtaaccccc 360 aaaaagatcc tcataggttg tccacaagtc caccatttga aaaaagtttc actcaaacca 420 gaacagcggc tcgcggccgc tttcgatcac atggacgtct acgagtccgt gtcgcgtacg 480 tgaaggtcct atatcagttc gaagtaccta ttaaattaaa acggaaattt catactttaa 540 ttataaaatc ttaccggtaa ccaatggcgc aaaaaggtag agaaaaaccg tattccctta 600 cccctattat tttgtgcgtt tactaggtag gtattcaagg cgtaattttg caatttaatt 660 cgttagcatt ctataatgct aacggactcg cgcggcaacg gattccaaat ttttgaattc 720 ctccgcgata atcttgtaga tattctatta gtgcaggaaa cctgtctgaa gccctcgcgt 780 cgcgacccga aagtcgcgaa ttacgtcatg gttaggaatg acagactcac cgcctccaaa 840 ggcgggactg ccatttacta taggcgggcc ctgcacgttg tccctctcga tactccctcg 900 ctctcacata tcgaggcgtc agtgtgccgt atctcgctga cgggacacca gccgatcgtc 960 atcgcatccg tttatctccc cccggacaag ccccttctga gcagtgacat cgagtcactg 1020 ttcggcatgg gagactctgt catcctggca ggcgatttaa attgccacca cactaggtgg 1080 aactgccatc gtacaaacgt taacggtagg cgtctcgacg cgtttataga cgacctcacc 1140 tttgaaatag tcggtccccc aactccaaca tgttattcgt ataacatcgc gctccgtccg 1200 agcactatag acctggcatt gcttaggaac gtaactctgc gcttgcgttc catcgaaggc 1260 aatgtcagag ctcgactcag accaccgacc tgtcgttatg cagctcggtc gccctcacaa 1320 cccagtcact gttacgagga ccatggtgga ttggaataag ctgggcacgt gcctagccga 1380 cgccgctccg ccaatcctcc cttacggccc ggattcgaat ccatcccccg aggacaccgt 1440 cgaatccata aacatcatta ccgatcacat ctcttccgcg atcattagat cttctaaaga 1500 agtcgatgtg gaggacagct tccaaccgca tcagactgtc ccccgatctt aggaatctct 1560 taagagttag gaacgcggca atctgggcct acgatcgtct tcccacgcat tcaaaccgga 1620 ttcagatgcg tcgtctacaa cgcgaagtcc actcccgctt aagcgacgcg cgtaacgata 1680 attggcatag ttatttagaa caactcgcgc cctcccacca agcatactgg cgactagcta 1740 ggactcttaa atccgaaact accgctacta tgcctcccct cttacgccct tcaggccaac 1800 caccggcatt cgatgacgat tacaaagctg agctgctggc ctaactgctg caagagcagt 1860 gcaccaccag ccctcgacac gcggaccccg aacacaccga gttagtcgac agggaggtcg 1920 agcgcagagc ttccctgccg ccctcggacg cgttaccccc cattaccacg gacgaagtta 1980 gagacgcgat ccacaacctc caacctagga aggcaccagg ctccgacggc atccacaacc 2040 gcgcgctaaa aattttgcca gtccaactga tagcaatgtt ggctacaatt ttaaatgccg 2100 ctatgacgca ctgcatcttt cccgcggtgt ggaaagaagc ggacggtatc ggtatacata 2160 agccgggcaa accgagaaac gaaacttcta gttaccgtcc gattagtctc ctctcgacga 2220 taggaaaaat ttacgaacgt ctccttagga aacgcctctg ggattttgtt taccgcgaac 2280 aaaattctca ttgacgaaca gttcggattc cgctcaaaac actcgtgcgt acaacaagtg 2340 caccgcctca cgagcacatt ctgataggac taaataggcg taaacaaatc cgaccggcgc 2400 ctcttcttcg acatcgcgag gcgttcgaca aagtctggca caacggttta atttacaaac 2460 tgtacaacat gggagtgcca gacagactcg tgctcatcat acgagacttc ttgtcgaacc 2520 gttcgtttcg atatcgagta gagggaactc gttctcgtcc ccgtcaactg actgccggag 2580 tcccgcaagc tccgcgctgt ccccgttatt atttagtttg tatatcaatg atataccccg 2640 gtctccggag acccatctag cgctcttcgc cgatgacacg gctatctact actcgtgtag 2700 gaagatgtcg ctgcttcatc ggcgactcca gatcgcagta gccaccatgg gacagtggtt 2760 ccggaagtgg cgaatagaca tcaagcccac gaaaagcgca gcggtgctct tcaaaagggg 2820 tcgccctccg aacatcactt cgagcatccc cactccgtag taggcgcgca aatacctccg 2880 ccgttagtcc catcactctc tttggccagc ccataccgtg ggtctcgaag gttaaatatc 2940 taggcgtcac cctcgacaga gggatgacat tccgtcccca tataaaaacg gtacgcgacc 3000 gcgccgcgtt tattctagga cgactctatc caatgctttg tagtcgaagc aaactgtccc 3060 tccgcaataa ggtaactctc tacaaaactt gtatacgccc cgtcatgacg tatgcaagcg 3120 tagtgttcgc tcactcagcc cgcacccact tgaaatccct tcaggttatt caatcacgat 3180 tctgcag 3187 // ID CR1-71_HM repbase; DNA; INV; 4253 BP. XX AC . XX DT 28-DEC-2008 (Rel. 13.12, Created) DT 28-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE CR1-type family - consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-71_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4253 RA Bao W. and Jurka J.; RT "CR1 families from Hydra magnipapillata."; RL Repbase Reports 8(12), 1898-1898 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 229..1029 FT /product="CR1-71_HM_1p" FT /translation="MVTQKQLNDALENADKIAKEELKQALYVIKTLTARID FT ELTNRIDLQDIEIKSLKDSKSSEKPLLSKLVEQVGKPGTLANVAVVSALNK FT HNKDVTNREKRIVVFGLPESVKTQPSEITKDDTAEFKKVMASLSKSVNVVR FT IERFKKKKNFDSSKPAPLMIEVDSVFIRNDVLASAKQLASGTFKDIFIRPD FT RTAAEQSEFIKLNLERKAANSDLDKLGKLDKPFRFVIRSDKLRCIDVSQEV FT EINGRKKHPFIKWKDAQAARIAKTQ*" FT CDS 935..3985 FT /product="CR1-71_HM_2p" FT /translation="MSRRKLKSMDVKSILSSNGKMHKQPELLKHNESNRLT FT ANTYLNALNTKIKSAKSNPKNKLLKCFYTNATSLNPTKMNELSALCDINMP FT DLIFISETWFTETSATQLNNYSLVKKNRIGHGGGVAIYIKRDLVAHEVSDC FT HLRETLNNSNSEQIWCEIKIVNEMILIGCVYRPPLSHIDDINKTIISAKQA FT VDRKAYSCMLLAGDFNFPDIKWHNDERIELLSGQNSAASVFLVTLANHNLE FT QLVDFSTFEASNGNAKNVLDLIITDSSTRVINLSSSMPLGDLSQGHRILSW FT DYVVLSKNRTTFSNKKYDFNKGDYKNFGKKIMETNWQQLFENKNTNECYEL FT FCNKYDKLSKQFIPLKKVHSTRNAPWMNKEVLAMIKKKQQLGNYLSMTNWG FT STTLICEYKKLRSQVQKACTHRVRVFEAQLASDKNNPKRMYAYAKAQQNVH FT VSIGAISDAKGETLTEAIHIANRLNEHFKSVFVDDSINDQLPVFERRHNQE FT DLGDVIISFEATLAYLNGLNPNKSIGVDNISPKVLKECAAQMTYPLTLIYN FT KALSEGSTPLAWKQSHVTPLFKKGSRLDAANYRPVSITSVPCKVMEKIIKE FT QITKYLEKTSCISNNQHGFMSKKGCTSNLLESVDYITKALSKRNFVDIAFL FT DFAKAFDKVSHRRLIHKLKAYGINGNVGKWIESFLTSRKQRTVLGNHSSDW FT TDVLSGVPQGSVLGPTLFIIYINDLTDNLKSVHKIYADDTKLLQEIRPEFH FT DADCLILQNDLNIISEWSKEWLMELNVAKCKVMHLGHGNINHEYVMNDGNT FT SLIIAATDIERDLGIFLSNDLKWNQHIQVATRRANMILGLLKKTFRSRDIK FT LWGKLYTTYVRPHLEFAVPVWCPYLKGDIKEIEKIQHKATKIPHDITSLPY FT AERCRRLNFTSLETRRRRGDLIQQFKLENGFEIINWHNPPLRRSPSNVLTR FT DFTYNNARHNFFTNRIVNDWNYLPLSCKKAPSINAFKSRIDKYFFPTAANS FT ASFTEDELHV*" XX SQ Sequence 4253 BP; 1569 A; 692 C; 717 G; 1274 T; 1 other; atacggaaaa actattgtta tatgtaaaca cataaacacg tattacaaac tttttattcc 60 ctgtgtaata agatatttac attttttttt ttttttttac taatatataa acttcacgga 120 tataaattat ttctgaggat ttatcgagtt aaaagcaatt catttataac tttaaattcg 180 tttataacct ataaacctta aaatcgcttc cactgttaat tgctaataat ggtaacgcaa 240 aaacagctta atgatgctct tgaaaacgct gataaaattg ccaaagaaga gctgaaacaa 300 gctctttatg taattaaaac acttaccgcc agaattgacg aacttactaa tagaattgat 360 ctgcaggata ttgaaataaa gtctttaaaa gatagcaaaa gtagtgaaaa acctttgtta 420 tctaaattag ttgaacaagt cggcaaaccc ggtacattag caaacgtagc tgtagtgagc 480 gccctcaata agcataacaa ggacgtgact aacagagaaa agagaattgt agtgtttggt 540 ctaccagaat ctgttaaaac tcaaccaagc gagataacca aagatgatac cgcggaattc 600 aagaaggtta tggcatcttt aagtaaatca gtcaatgtag ttagaattga acgtttcaaa 660 aaaaaaaaga actttgattc tagtaaaccc gctcctctta tgattgaagt tgattcagta 720 tttatcagaa atgacgttct agcgtcggct aaacaactgg catcgggtac ttttaaagat 780 atatttatac gccccgaccg cacagcagca gagcaaagcg aatttattaa attaaactta 840 gaaagaaaag cagctaattc tgaccttgac aagcttggaa aacttgacaa accctttaga 900 tttgtcatcc gcagcgataa attgcgatgc attgatgtct cgcaggaagt tgaaatcaat 960 ggacgtaaaa agcatccttt catcaaatgg aaagatgcac aagcagccag aattgctaaa 1020 acacaatgaa tcaaatcgtt taactgcgaa tacttattta aatgcattaa atacaaaaat 1080 aaaatctgct aaatctaacc ctaaaaataa acttttaaaa tgtttttata ctaacgcaac 1140 ttctcttaat ccgactaaaa tgaacgagtt atctgcactc tgtgatataa acatgcctga 1200 tttaatcttt atttctgaga catggtttac tgaaacatct gccactcaat taaataacta 1260 ttcactagtg aaaaaaaata gaataggtca cggtggaggt gttgctatct acattaagcg 1320 tgatctagtt gcgcatgagg tatctgattg tcatctaaga gagactttaa acaacagcaa 1380 ctccgagcaa atatggtgcg aaattaaaat cgttaacgaa atgattctta ttggctgtgt 1440 ctatagacca cctttatccc atatagatga tataaataaa acaattattt cagctaagca 1500 agctgttgat agaaaagcgt atagttgtat gttattagca ggtgatttca actttccgga 1560 tattaaatgg cataatgacg aaagaattga attactaagc ggtcaaaata gtgctgcaag 1620 tgtttttctt gtaactttgg ctaatcataa tctagaacaa ttagttgatt tctcaacctt 1680 cgaagcatct aatggaaatg caaaaaatgt tttggattta attattacag actcttcaac 1740 tagagtaatc aatttatcta gctctatgcc gttaggtgat ttgtcacaag gtcatcgcat 1800 attgagttgg gattatgtcg tcctttctaa aaataggaca actttctcaa acaaaaaata 1860 tgattttaat aagggtgact ataaaaactt cggtaaaaag attatggaaa ctaactggca 1920 gcaattattt gaaaataaaa acactaacga atgctatgaa cttttctgca ataaatatga 1980 taaactcagc aagcaattca tcccgttaaa aaaagtccat tctactcgta atgcaccgtg 2040 gatgaataag gaagtattag ctatgataaa gaaaaaacaa caactgggca attacttatc 2100 tatgacaaac tggggatcaa cgaccctcat ttgtgaatat aagaaactaa gaagccaagt 2160 tcagaaagca tgtacacatc gtgttcgagt atttgaagcc caactggcat cagataaaaa 2220 taatccaaaa agaatgtatg cttatgctaa agcgcaacag aacgtwcatg tatctattgg 2280 cgctatatca gatgcaaaag gtgaaacgtt aacagaggca atacatatcg caaacagact 2340 taacgaacac tttaaatcag tttttgttga tgatagcata aatgatcagt taccggtttt 2400 tgagagaaga cataatcaag aagacttagg cgatgtaata ataagttttg aagcaacgct 2460 agcctatttg aatggtttga acccgaacaa atctattggt gttgataaca ttagtccaaa 2520 ggtacttaaa gaatgtgcag ctcaaatgac ttaccctcta acattaattt acaataaagc 2580 actgtctgaa ggctcaacac ctttagcctg gaagcagtca catgttacac cgttgtttaa 2640 aaaagggagt cgtcttgatg ccgctaatta cagaccagtg tctatcactt cagttccatg 2700 taaagtaatg gaaaagatta tcaaggaaca gataactaaa tatcttgaaa aaacgagctg 2760 catctcaaat aatcaacatg gatttatgtc caaaaaggga tgcacgtcta acttattgga 2820 aagtgtcgac tacattacaa aagcgttaag caaaagaaat tttgtcgata ttgcattttt 2880 ggactttgca aaagcgtttg ataaagtttc tcatcgtaga ctaatacata aattgaaagc 2940 atatgggatc aatggtaatg ttggcaaatg gattgagtca tttttgacta gcagaaaaca 3000 acgaacagtt ttaggtaatc actctagcga ctggaccgat gttttaagcg gagttccaca 3060 agggtctgta cttggtccaa cattattcat catctacata aatgacttaa cggataattt 3120 aaaatcagtt cataaaattt atgctgatga cactaaactg ttgcaagaaa taagacctga 3180 gtttcacgat gctgattgtc tgattcttca aaatgattta aacattatct cagagtggtc 3240 aaaagaatgg cttatggaat taaatgttgc aaagtgtaaa gttatgcacc ttggtcatgg 3300 aaatataaat catgaatatg taatgaatga tggtaataca tctcttatta tagcggcaac 3360 ggatattgaa agagacctgg gtatcttttt atctaacgat ctaaaatgga atcaacacat 3420 acaggttgcg acacgtagag caaatatgat acttggccta ctaaaaaaaa catttagatc 3480 aagagatatt aagctttggg gtaaactata tactacatat gttaggccgc acctggaatt 3540 tgctgttccg gtatggtgtc cttacttaaa gggcgacatc aaagaaattg aaaaaattca 3600 gcacaaagca acaaaaattc ctcatgatat aacaagttta ccttatgctg aaaggtgtcg 3660 gcgtctcaat tttacttctt tggaaactcg caggcgacgc ggcgacttga tccaacagtt 3720 taaacttgaa aacggttttg aaattattaa ttggcataat ccacctcttc gcagatcacc 3780 atccaacgta ctaacgcgtg atttcactta caataatgct cgacataatt tcttcacaaa 3840 tcgtattgtg aatgattgga attatttacc gttgtcatgc aaaaaggctc cgtcaattaa 3900 cgcattcaaa agtagaattg acaagtattt ttttcctaca gctgcaaaca gtgcatcttt 3960 tactgaagat gagctgcacg tttaatttaa ttaattcgtg ctttagcagc tacaaatatt 4020 gactttttag gtacatttgt cataaacgta aactgtaatt aaatgaaagc aaaaaaaaaa 4080 aataaaaaaa ataaaaaaat aaaataataa taataataat aataataata aataaaaaaa 4140 aaaaaaaaaa aaaatgcttt tgttttttaa tctgtttatt ggattgtttt tttatttatt 4200 tgtcatgtaa tctataaaga cttatttaaa tatattacta ttactattac tat 4253 // ID Crack-23_AAe repbase; DNA; INV; 4615 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A Crack non-LTR retrotransposon family from Aedes aegypti. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; KW Crack-23_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4615 RA Kojima K.K. and Jurka J.; RT "Crack clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1239-1239 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 10 sequences with >98% CC identity. XX FH Key Location/Qualifiers FT CDS 748..1515 FT /product="Crack-23_AAe_1p" FT /translation="MNSESRISLTELPEIFSSMFEEYSTRLVDKIVTGQEF FT LSDKFDDIICQLQKLKTELKVLKAENDYLKQSLKVLNEQTNTVSSSLHQQE FT VQIDTRLRKEISSNAIFLGVPRTSNENTTELVLNVCKTLGLNLDRSEIVSC FT ERITSTLEGNNPIKVTFKNTHQKERIMIAKRQFGRLTASMVHGKRWQTGWN FT QTIIVRDDLSPLSMELFRTLKKHQALLNIRYVWPGRNGVIFCKHSEGSKPI FT SVKSRDDLNKLLNRK" FT CDS 1508..4459 FT /product="Crack-23_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="TENKIQLQKICRIDSNLATIENNQSKNVYLESFDELI FT LPTSFEGLKLLQINARGLNRFEKFDLLSECVRNLPVVVDVLLVGETWIKEE FT RSRFYNIRGYSCVHSCRTTSAGGLAVFVKEEIEYEITANSIDLGCHYIGLK FT LLTKPHAVYIHGFYRPPDYEFSRLITRIESAIAVLDTKSIQFVLGDMNLPI FT NISDSRGIQTYLQLLAAYNMVVTNTIVTRPSSNNILDHVVADADNSHKITN FT FTVDCNFSDHRYIFTLFHTKASKATKVYTKTITNYRRLDEQFQNYLNSFDF FT RTLSPNDRLVAISENYTNLRQIHSSSRSVEVRIKNNCCPWFNLEIWELSKA FT SRILYQRWKRNRQDQQLKVLLDLSNKKLAECKKRVKSAYYRKILSTDNPKN FT LWKGINELIGKKSVKAKQHVLRINGIEITDPEKVGDAFNDFFSTVGQQQAR FT NLASNGDINMFNTMDRCNLSLFLRPTSKAEVFSIISNLDATKATGIDGFPI FT AALKQHSTHLSGILADCINDSISIGVYPECLKQSLVFPVFKGGDPMNPTNY FT RPISVLPAVNKIFEKVIYIRFSSFLETTGMLYEHQFGFRQGSSTEVAILEL FT VDEISTAVDKKSCAGSIFLDLSKAFDTINHEMLLKKLEAYGIRGVPLNIIR FT SYLFNRYQQVVVNGMKGKACTVSCGVPQGSNLGPLFFLIYLNDISKLPLKG FT KPRLFADDTAISYKGPNAERVVDEMTQDMELVTAYLENNLLALNLKKTKMM FT IFSSLKCDKEICPDLVINGTLIEKVTEYEYLGVYIDERLSWDFHIRTTVSK FT CSSLCGILRKLSKFVPAHVLMKIYYSFIHSRYQYGIAAWGSCNKVYIKPLQ FT IQQNRCIKSILSLPYLFPTRDLYELPQHKVPPILGLHILQVGTSMYKIINE FT QNIHHNWSFASAFHQHRTRYAHHLQRPGFRTEIGRRRFANVGPSIYNQIPE FT EIKTAPTQFTFRRKLKQYILNNCRNLVIN" XX SQ Sequence 4615 BP; 1488 A; 1001 C; 872 G; 1253 T; 1 other; gtctggcagc cctgctgtat tacagctgtt gtgatctgtg ctccgctttg aactagaaat 60 ttgatctgaa atcactagat ttaaccaact gccacctcat cgcctgattc caatgctgct 120 gttgcaccac tcgtgataat tgctattccg atgcgagtag aaccgcctga gaaatcacgc 180 ataaaattca accatcgcca ccgctgttgc tgctagtgtt ctggttctga cagcttgaaa 240 accaccgccc accgccgcca ctgcccgcaa atcccgaata agcccacagt tgacccaccg 300 taaatcgcca ccaaacaacc actcctaccg ccaccccact gcaagtatcg ctccatcgtt 360 attcatctct cctgctattg ctttgttgaa atagtamcgc tgccaaatta cgctaattca 420 tcacgatttg tactatgcga caataaacga gttcttcatc ggttgtttta ttctgccgcc 480 cgtcccccac cacccccaca acggtcgctg actgcatcaa tactactgca ttcgctgttg 540 acgctgacga gtaaatcttc ctgcctgacg tcactacacc tgctcagttg ataaacaccc 600 cactacaaag gtcgaccgtc aatcatcgtg tgtaagtacc tacaccattg tacagtacta 660 cagcgcaact agtgtagggg cgtaaacaca atagggctca cttcagaccc agcagattca 720 gcgccatctg ttggcgaaca ctcgaacatg aactcagaat cacgaattag cttaacagag 780 ttgccagaaa tattttcttc aatgtttgaa gaatactcaa ctcggcttgt cgataaaatc 840 gttacgggtc aagagttcct atccgacaag tttgatgata taatttgtca attgcaaaaa 900 ttgaagacag aacttaaagt tctgaaagca gaaaacgatt acctcaaaca atcgttgaaa 960 gtcttgaatg agcaaacaaa tacggtttca agttcattgc atcaacagga agttcaaatt 1020 gatacacgat tgcggaaaga aatttcttcc aatgccatat tcctgggagt tccgcgaacc 1080 tcgaatgaaa atacaacaga acttgtattg aacgtctgca aaacattggg gctcaattta 1140 gacaggagtg aaattgtgtc atgcgaaaga atcactagta ctttagaagg caacaatcct 1200 ataaaagtca ctttcaaaaa tacgcatcag aaagagagaa ttatgatagc taaaagacaa 1260 tttggtcgtc taacagcatc aatggtccat ggcaagcgtt ggcaaaccgg ctggaatcaa 1320 acaattatag ttagagatga tttgtcacca ttatcaatgg aattatttcg tacgctcaaa 1380 aaacatcaag cattgttaaa cattcgttac gtgtggcctg ggcgcaatgg tgtcattttc 1440 tgtaaacact ctgaggggtc caaaccaata tcagttaaat ccagagatga cctgaacaaa 1500 cttttgaaca gaaaataaaa ttcagctcca aaagatatgc agaattgata gcaatttagc 1560 aaccatagag aataatcaat cgaaaaacgt ttacttggaa tcattcgatg aactgatttt 1620 gccaacctcc ttcgagggat taaagttatt acaaattaat gctcgaggat tgaaccgttt 1680 tgaaaaattt gacctgctga gtgaatgtgt gcgaaatttg cctgtggttg tggacgtttt 1740 gcttgtcgga gaaacttgga ttaaagaaga gagatccaga ttttacaata ttcgagggta 1800 cagctgcgtt cattcttgcc gtacaacatc agctggcggg ttggccgttt tcgttaaaga 1860 agaaattgag tacgaaataa ctgccaattc aatagatctt ggatgtcatt atattggatt 1920 gaaactattg acgaaacctc atgcagtcta catccatgga ttttaccgtc ctccagatta 1980 tgaattcagc aggttaatca cccgaatcga aagtgccata gccgttttag atacaaaatc 2040 tattcaattc gtgttagggg acatgaattt gccaatcaac atctctgatt ctagaggaat 2100 ccagacttat cttcagcttc tcgctgcata taatatggtt gtaacaaaca cgattgttac 2160 gcgacccagc agcaataata tactggacca tgtagttgcc gatgcagaca actcccataa 2220 aattaccaac tttaccgtgg attgtaactt cagtgatcat agatatatct tcactttgtt 2280 ccacactaaa gcttccaagg ctaccaaagt gtataccaaa actattacta actacagacg 2340 attggacgaa cagtttcaaa attatcttaa ctcgttcgac tttagaacac tgtctccgaa 2400 cgatcgactg gtagcgatct ctgaaaatta cacgaatctt cgtcaaattc attcaagtag 2460 tagatctgtt gaagtgcgta tcaaaaataa ctgctgtccc tggtttaacc tggaaatatg 2520 ggaactaagc aaagcatcca gaatactata ccaaagatgg aaacgcaatc gacaggatca 2580 gcaactaaaa gttcttcttg atctttcgaa caaaaaactg gctgaatgca agaaacgagt 2640 aaagtctgcg tattatcgaa aaatcttatc gaccgataac ccaaaaaatc tctggaaggg 2700 aatcaacgag ctgattggaa aaaagtctgt gaaagccaaa caacatgttc ttcgaatcaa 2760 cggaattgaa atcacagacc ctgagaaagt tggtgatgct tttaatgact ttttttcgac 2820 ggtcgggcaa caacaagctc gcaacttggc atcaaacggc gatataaata tgttcaatac 2880 tatggacaga tgcaatctct ctctattttt gcgacctacg tcaaaggccg aggtattttc 2940 aatcattagt aatttggatg ccacaaaggc aaccggaatt gatggatttc caatagctgc 3000 gctgaagcaa cactctacgc acctttccgg catattagca gattgtatca acgacagtat 3060 cagtattggt gtctatcctg aatgtctgaa gcagtcgcta gtgtttcctg tttttaaagg 3120 aggagatccc atgaacccaa ctaactatag gcccatctca gttctacccg cggtcaacaa 3180 gatatttgaa aaggtgatct acatacgatt ctcaagtttc cttgagacca ccggaatgct 3240 gtatgaacat cagtttggat tcagacaagg atcatcaacc gaagttgcaa ttctggagct 3300 ggtagacgaa atttcaactg ctgtggacaa aaagagttgt gctggatcca tattcttgga 3360 tttatctaag gcattcgaca caatcaatca tgagatgttg ctgaaaaagt tggaagctta 3420 cggcattcgt ggagttcctc tcaatattat cagaagctat ttgttcaatc ggtatcaaca 3480 agtcgtcgtc aatggaatga agggcaaagc ctgcacagta tcttgcggtg taccacaagg 3540 gagcaattta ggtcccttat tttttctcat ttatctaaac gatatatcaa aactaccact 3600 aaaaggtaaa ccaagactct ttgctgacga tactgccatc tcatacaaag gtccaaacgc 3660 ggaaagagtc gtagatgaga tgacacaaga tatggagctt gtcacggcat atttggaaaa 3720 caatctgcta gccctaaacc tgaagaaaac caaaatgatg attttcagct cactcaaatg 3780 tgacaaagag atttgtccag atctggtgat taatggtaca ctaattgaaa aagtgacaga 3840 atatgagtat ctgggggtat acatcgatga gcgccttagc tgggacttcc acattcgaac 3900 aactgtctcc aaatgttcat cactctgtgg aatcctacgg aaactctcca agtttgttcc 3960 agcccatgtt ctgatgaaaa tctactactc tttcatccat agccgttacc aatacggtat 4020 cgcagcttgg ggttcttgca ataaggttta tatcaaacca ttacaaattc aacaaaatcg 4080 ctgtataaag tcaatcctaa gcttaccgta cctttttcca acacgtgacc tctacgagct 4140 acctcaacat aaagtgccac caatactagg gcttcacatt cttcaagttg gaacttcaat 4200 gtacaaaatc atcaacgagc agaatattca ccataactgg tctttcgcaa gtgcgtttca 4260 tcaacaccga accagatatg ctcatcatct tcaacgaccg ggattcagaa cggagatagg 4320 gcgacgtcga tttgccaatg tagggccgag tatatataat caaatcccag aagagataaa 4380 aacagctccc acgcaattta cttttagacg gaaactaaaa caatacattc ttaataattg 4440 cagaaatttg gtaattaact agtgataaga gtcatctgaa ttacaaacta atttactatt 4500 agacaggtag gatactgaac aacacctttt aaagaacaat agttcagtag ggaatgttca 4560 aaattattgt atcgtatttt ttatcataaa taaatgaaat gaaatgaaat gaaat 4615 // ID DNA-TA-11_CQ repbase; DNA; INV; 951 BP. XX AC . XX DT 29-DEC-2010 (Rel. 16.01, Created) DT 29-DEC-2010 (Rel. 16.01, Last updated, Version 1) XX DE A DNA transposon family from Culex quinquefasciatus - consensus. XX KW DNA transposon; Transposable Element; nonautonomous; KW DNA-TA-11_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-951 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the southern house mosquito."; RL Repbase Reports 11(1), 61-61 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >98% CC identity. TA TSDs. 17-bp TIRs. XX SQ Sequence 951 BP; 329 A; 146 C; 154 G; 322 T; 0 other; ggggaaggtg gggcaagacg accatatggg gcaagaggaa caatcgctcc tacggccgta 60 atttgtacaa ttttgattat ttccagtatg aggaattgtt gctagcaatg caattagctg 120 attctactac cacataaccg ccaaaatgac gtaaacgcca cggggcataa gatttaatga 180 agtttttttc gaaacctttg ttttcttata atatttggaa agtacaaaat aaggcttagg 240 gttcgtttta aggctcattt tatcaaaatg ctatttttcc tagatcagta gtgtccctac 300 aaatgacatg cacctattac aaagtatgat ttaacttttg gttatttttg ttgagagctt 360 ttaaaaaatc ttgttcaggt ggggcaagtg taccatatgg atttttagta tggaaaaaat 420 tacgaattgc tgcaacaaca tattttattg ggaaataaat acatgaaagt acttaaaaac 480 tgataaacaa tcgttaaaaa aaattgtaca tacaaagtat agtgatatta tgaaaattta 540 ctatttatca tcgaagtagt attttttttc gtaaaaacga taaaattttt agtaaaatat 600 tattatttaa tctaaaaatg gaagaaacca ttcaaataca ttctaatctg atgtatctaa 660 gtgataacag ttcaattgtt agcaaattac catatttttt catgcattgt tcctcttgcc 720 ccaacgggtt gctcgtcttg ccccactagt tgagtagaac gtacggaaaa tcaaaaattt 780 taaaatcaat ttttcacatt aaaaaacagg attttttaaa aacttgttta aacaaagtct 840 tagtcaagac ctagaataag atgattataa taaaatccga cagatttttt aactttttaa 900 tgggttataa cgagcatttc cttagcttgt tacacttgcc ccactttccc c 951 // ID Gypsy-24_OD-LTR repbase; DNA; INV; 198 BP. XX AC CABV01002164; XX DT 24-FEB-2011 (Rel. 16.02, Created) DT 24-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Oikopleura dioica genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-24_OD_; KW Gypsy-24_OD-I; Gypsy-24_OD-LTR. XX OS Oikopleura dioica OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; OC Oikopleuridae; Oikopleura. XX RN [1] RP 1-198 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Oikopleura dioica genome."; RL Direct Submission to RU (24-FEB-2011). XX DR Genome; CABV01002164; Positions 9104 9301. XX SQ Sequence 198 BP; 41 A; 58 C; 32 G; 67 T; 0 other; tgtaaggatc gccaatgctc atttttggcc gtttccgcat tcgcgtctcg ccgccatgtt 60 ttcaggctgc aaagccggtt tctcttcgcg cctgttctca actcgctatt cacttctact 120 cacgttaata taatgaaata agctccgatt ttccttctcg cgttttactc acaattcaag 180 ctatcgcaaa tccttaca 198 // ID hAT-4B_AP repbase; DNA; INV; 3322 BP. XX AC Contig44876; XX DT 31-JUL-2009 (Rel. 14.07, Created) DT 31-JUL-2009 (Rel. 15.12, Last updated, Version 2) XX DE DNA transposon. XX KW hAT; DNA transposon; Transposable Element; hAT-4B_AP. XX NM hAT-4B_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-3322 RA Jurka J.; RT "DNA transposons from pea aphid."; RL Repbase Reports 9(7), 1366-1366 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX FH Key Location/Qualifiers FT CDS 624..2606 FT /product="hAT-4B_AP_1p" FT /translation="MSRKLRADEQIFLIGSPSTQILGSKLPSIGQVLSVFF FT YNVRTVKLNTRESASLAVRECNIFWEKARIPVRAVQHCIDKLLNVYEEWRA FT LQKNSQKVGDSFKLKETEFINKLNNLFDIAHANALNIMKIENDKQFLINQR FT LPGRIGCLGGIDKKLLESENKAKKRKCIEIEKLTNRQLQLSNYNATEIQNQ FT LSDMSISSPSSSSDSEYTNCTRIPMTLSEKKRGRTNFITPKLVAALDRCKL FT SVRDSVYVIQATAEALGNNVDSLVINKSSIHRCRDAVRVERANKIKTNFQM FT SVPSYITVHWDGKILPTLNVRESGIDRLPIVLTSNNLDLLIGIPKLEKSTG FT KEQANAIYYALENWGVTDVVQALCCDTTASNTGRLNGACVLLEQKLERDLL FT YLPCRHHIYEIILRGVFESVLPHTTTSPDIPLFKKFRNNWNKIDITNITSG FT IDDLECCTALEDVRNDILHFCRAMLQNKCYRDDYKELLELSITFLNGDLDN FT KFKIRPPGAMHQARWMSRAIYCLKIYIFRNQYSLSSSEKNSIRDICVFIVR FT FYLKAWFSCTTAARAPANDLNFIKCLKQYENEHSKISKAAITKISNHLWYL FT TEETAALAFFDDTLSNEIKRLMVQSLNNDGLIHPTKRLIVSFQDLGETFSG FT ILYYRFCYVIT*" XX SQ Sequence 3322 BP; 1210 A; 464 C; 518 G; 1130 T; 0 other; tagggtgtcc cttatttgac tataaaattt ttttttttgg atttttgttg gtctcaccca 60 cttaatttgt gtatatacgt gtaaaaatga actacaaaaa gtttcaagtc aatatattaa 120 ggttaacccg tgccgacttg aatttgaaat ttgagtatat taattcattc ataatttaat 180 gatatatttt acatttttat atttaactca attacaataa ggaaaattaa aatttgttac 240 ttaatatgtt ttaaatatta taaaattgtc tataaattac gttcttacat acttcttctt 300 atttattata atttacaaga tatattgact tttgagtatg aaagtacttt tttctcaaaa 360 ttatattatg aatttttata aataaatatt gcacacaatt aaagaaatca tcagattatt 420 tgaacagtac aaacatcgtt atgttgaata ttatcgtatt ttttctagat aagaatgata 480 cggattacga tccacaagtc taatagattt ttattaataa tatcgtgtag tcgacagtcg 540 tgttcgttac ggtgcgtaat tagttgtgta cgtgtacgtc gtatgttgtt tcgtttttga 600 ttttggtaat tgttttgcaa aatatgtcta gaaaattacg agctgatgaa caaatatttt 660 tgattgggtc accatctacc caaatactag gctcaaaact cccgtcgatt ggacaagtgc 720 tttcagtttt tttttataat gtgagaacag taaaacttaa tacgagagaa agtgcttcgt 780 tagctgttcg tgaatgcaat attttttggg aaaaagctcg aattcctgtc cgagcagttc 840 aacattgtat tgataaactt ttaaatgtgt acgaagaatg gcgtgctcta caaaaaaatt 900 cacaaaaggt aggtgattca tttaaattaa aagaaacaga attcataaac aaacttaaca 960 atttatttga tattgcacat gccaatgctt taaatataat gaaaattgaa aatgacaaac 1020 agtttttaat taatcaaaga cttcctggtc gaattggttg tcttggtggt atagacaaaa 1080 agttacttga aagtgaaaat aaagcaaaaa aacgtaaatg tattgaaata gaaaagctaa 1140 caaatagaca acttcaatta tcaaattata atgcaactga aattcaaaat caattatctg 1200 atatgtctat aagttcaccg tcgtcatcgt cagattctga atacacaaat tgcacccgta 1260 taccaatgac actatcagaa aaaaaaaggg gccgcacaaa tttcattact ccaaagcttg 1320 ttgctgcatt agatcgatgt aaattaagtg tacgggactc agtttatgtt attcaagcaa 1380 cagctgaagc attgggtaac aatgttgata gtttggttat aaataaatcg tctatacatc 1440 gatgtcgtga tgctgtacga gttgaacgcg ccaataaaat taaaactaat tttcaaatgt 1500 ctgttcctag ttatataact gtacactggg atggaaaaat actaccgacg ttaaatgtta 1560 gagagtcagg tattgataga ttaccaattg ttttaaccag taataatcta gatttgctta 1620 ttggtatccc aaagttggaa aaatcaacgg gaaaagaaca agctaatgct atatattatg 1680 cattagaaaa ttggggcgta acagatgttg ttcaagctct atgctgtgat accaccgctt 1740 caaatacagg gcgtttaaat ggtgcatgcg ttttattgga gcaaaaattg gaaagagact 1800 tactttacct accatgtcgc caccatatat acgaaataat tttacgtggc gtttttgaat 1860 ctgttcttcc tcatacaacg acaagtcctg atattccact attcaaaaag tttcgaaaca 1920 attggaataa aatagatatt acaaatataa cttctggtat agacgacctt gagtgttgca 1980 ctgctttaga agatgtacga aatgatattc tccacttttg tcgagctatg ttgcaaaata 2040 aatgttacag agatgactat aaagaattac ttgagctatc tataactttt ttgaatggtg 2100 atttggacaa caaatttaaa atacgtccac ctggagcaat gcatcaagct cgctggatga 2160 gtagagctat ttactgttta aaaatatata ttttccgaaa tcaatactca ttatcatctt 2220 ctgaaaaaaa ttctattcgt gatatttgtg tttttattgt tcgattttat ttaaaagctt 2280 ggttcagctg taccacagca gctagagcac cggcaaacga tttaaatttt ataaaatgtt 2340 taaaacaata tgaaaatgaa cactctaaaa tttcaaaagc cgctataaca aaaattagta 2400 atcatttatg gtacttaaca gaagagacag cagcattagc tttctttgat gacacattat 2460 cgaacgaaat caaaagatta atggtacagt cacttaacaa tgatggactt atccatccaa 2520 ctaaacgtct tattgtttca tttcaagatc ttggcgaaac tttttcaggt atattatact 2580 atcgtttttg ttatgtaata acgtaattca taaggttatt aatagaattt ttttttttgt 2640 agaaaaaaca ctggcctctt ttatatcaaa gaacagtatg caattttttt ctcgatttaa 2700 tattaatacg gactttttga atgatgatcc aagcacatgg gacactcaaa ttagttattt 2760 acacgggaaa gaaattgcat gctcgctaaa tgtagtaaat gatacagcgg aaagagcagt 2820 taaacttatg gaagattttc acggaaactt aaccaaagat gataagaagt ctgagttgct 2880 attacaatgt atacaagaac accgaagact atatcctgat tgtaaaaagg agacattgaa 2940 aaaaaatttt aattaataaa aatgtataat aatattttaa gaaaatagta ctttcacact 3000 cgaaaatcaa tatatcttgt aaattataat aaataagaag aagtatgtaa gaacgtaatt 3060 tatagacaat tttattatat ttaaaacata ttaagtaaca aattttaatt ttccttattg 3120 taattgagtt atatataaaa attttaaata tagcattaaa ttatgaatga attaatatac 3180 tcaaatttca aattcaagtc ggcacgggtt aaccttaata tattgacttg aaaatttttg 3240 tagttcattt ttacacgtat atacacaaat taagtgggtg agaccggcaa aaatgaaaaa 3300 aaaaaattta agggacaccc ta 3322 // ID DNA2-1_TCa repbase; DNA; INV; 1445 BP. XX AC . XX DT 22-MAR-2009 (Rel. 14.03, Created) DT 21-MAR-2009 (Rel. 14.03, Last updated, Version 1) XX DE Non-autonomous DNA transposon. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA2-1_TCa. XX OS Tribolium castaneum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Coleoptera; Polyphaga; Cucujiformia; OC Tenebrionidae; Tribolium. XX RN [1] RP 1-1445 RA Jurka J.; RT "DNA transposons from insects."; RL Repbase Reports 9(3), 663-663 (2009). XX DR [1] (Consensus) XX CC TSD is TA, often in multiple copies. Unclassified (possible CC non-autonomous Tc1/Mariner). CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Tribolium CC castaneum Genome Project. XX SQ Sequence 1445 BP; 502 A; 230 C; 222 G; 491 T; 0 other; cagggtgttc cgtctaaacc ccacattaga agtattggta gttctaataa tgatagtcat 60 ctgaaaattt gtatacaacc ataatctctt aggtcctgct caatgaaaat attttcaaga 120 tggcgacact tccggttata ccggaagtcg ctatcaactt tcttatttta aatggaaagc 180 tatatttttt aatgcgtttt tggattacta gagttatttt aaggtagttt tcataagctt 240 tccctatacc taaatttagc cgtttacgaa atatttaggg tttttaattt ttttttggat 300 ttgcaccttc cagttcaaaa atcgataact ctgccatttt taaaaatttg gaaatatttt 360 ttagctcatt tgaaagagga agcctagatc tttcctttgg tgttttcaaa tttctaaaaa 420 agaataacaa accccaaatg tttaaagtta agttttcaga actttaagca cccataactc 480 aattttttca aatgaaatgc cacaagtttt atttcattaa atcgtagaga atttaattct 540 gcatcgatct gtattagcat tccctatact tatgtttaat agttttcaag ttacaataac 600 tttttgttgg gattgagaaa ttcattagcc ataggccttt tctcataggt tgccatggaa 660 acgagagaaa aaatgtgaga aattaaagtt ttttaaacca acattcaatg aatttatttt 720 gttttcacag gacggacaac acaattttga acagagtttg tcgtttgttt aatcagaaat 780 gtttttatgg caaactatgc aaaaatagct gtggttagta agaaattttc tacaatgtca 840 aaaaaaacca acataacttg aaaattatca cagataggta tagggaatgc taatacagat 900 cgatgcagaa aaaaattctc ctccatctag taaaataaaa attgggacat cccatttgaa 960 aaaattaagt tatgggtact tgaagttgtc caaacttaac cttaaaattt ttgggtttat 1020 taacttcttt tccaattttg aacacactgg agaacagata taggctttct cttttaattc 1080 tccaaatatg atttcgaaat ctctaaaact aagagagtta ccgatttttt aagtgacagg 1140 tgcaaatcca aaaattttca aaaaatctta aatatctcga aaacggttaa acttaggtat 1200 agggaaagct gataaaaacg gccttaaaat aactcaacta atccaaaaac gcagtgaaaa 1260 acatagcttt ccatttaaaa taagaaagtt gatagcgact tccggtataa ccggaagtgc 1320 cgccatcttg aaaatatttt cattgagcag aactcaacgg attatggctg agtaccaact 1380 ttcaaataaa tatctttatt agaactttca atacttctaa tgtggggttt agacggaaca 1440 ccctg 1445 // ID TCRP3 repbase; DNA; INV; 145 BP. XX AC M63895; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 28-SEP-1995 (Rel. 1.08, Last updated, Version 1) XX DE T.cruzi ribosomal intergenic spacer repeat. XX KW Ribosomal intergenic spacer; TCRP3; spacer repetitive sequence. XX OS Trypanosoma cruzi OC Eukaryota; Euglenozoa; Kinetoplastida; Trypanosomatidae; OC Trypanosoma; Schizotrypanum. XX RN [1] RP 1-145 RA Novak M.E., Mello P.M., Gomes B.H., Galindo I., Guevara P., RA Ramirez L.J. and Franco da Silveira J.; RT "Repetitive sequences in the ribosomal intergenic spacer of RT Trypanosoma cruzi."; RL Unpublished (1991). XX DR GenBank; M63895; Positions 6 150. XX SQ Sequence 145 BP; 38 A; 41 C; 46 G; 20 T; 0 other; gtcggagcag ggacagcaga aagggagcaa tggtgcgggg tgctggcggc gcatccaccc 60 agcgacgtct accgccgcgg cttcttcagg acaggagccc gtaaagcagc taacatctgg 120 aacctctccc agtggaaaca aaaat 145 // ID Chapaev3-1_NVi repbase; DNA; INV; 3561 BP. XX AC . XX DT 29-FEB-2008 (Rel. 13.02, Created) DT 29-FEB-2008 (Rel. 13.02, Last updated, Version 1) XX DE Chapaev3-1_NVi is an autonomous DNA transposon - imperfect DE consensus. XX KW Chapaev; DNA transposon; Transposable Element; Chapaev3; KW Chapaev3-1_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-3561 RA Kapitonov V.V. and Jurka J.; RT "Chapaev3, a distinctive group of animal Chapaev transposons."; RL Repbase Reports 8(2), 59-59 (2008). XX DR [1] (Consensus) XX CC Chapaev3-1_NVi belongs to the Chapaev3 group of the Chapaev CC superfamily of DNA transposons (see comments on Chapaev3-1_PM). CC Chapaev3-1_NVi is a young family of wasp Chapaev3 transposons: CC genomic copies of Chapae3-1_NVi elements are ~98% identical to CC their consensus sequence, which was derived from multiple CC alignment of two Chapaev3-1_NVi elements. The CC transposase-encoding region is corrupted by mutations accumulated CC in the genomic copies. Chapaev3-1_NVi contains 14-bp TIRs and CC imperfect ~70-bp subterminal inverted repeats. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Nasonia CC Genome Project. XX SQ Sequence 3561 BP; 1160 A; 645 C; 725 G; 1031 T; 0 other; cacaggccca taaaaaaaat gaacttgtat gtccatcgac aaaacccatg tactcctatg 60 tatttttgcg cgctgagtcc gaatctggca ttcgtttaac ctcagcacgt tgggttcaag 120 ggcatttgaa gatcaaattg cagaaatcag caataaccaa tacaagacat ctatctgtta 180 agttttgggg ggcgctgaat ccaaatccgg tatttgtttt tctccatgag gtaagtttga 240 aggtcagttc aaggtcaaaa ctgtgtactc taggaaaaat gaatttttga cagcagagtc 300 tgaaagagcg ggaaatttta cgtccagata ttcatttttt tttaatataa ccctcaaggt 360 catatgaagg tcacttaatc ttgttggatg gtggattcgt ataacaatta gataatggat 420 tttcatgggg caaactacta tgtcgttatg ttttcagggt cgccgaatcc aaatccggtg 480 tccgtttgtc tccatcaggc cgaattggag gtcgtttgaa ggtcaaatcc tcgacaatta 540 gtgaaatcga gtcaagttat catcctgaca tattttttgg gccgctgaat ccaaatataa 600 agaccgctta gtacctcagg gtcaaagtca aggtcatccc aggtcaaata tacatttttt 660 tttactaaaa aaggaaattt tgaccacaga ttcaggaaaa atggacattt ttacatctag 720 ctgtgatctt tacttatttt ttttatcaca ctcccgaaat aatcgtgtgc cgcatcccag 780 ggtacctcca cgtggtgacg gcggaaaacg gttttttttc gctttttgtt ttccagcaaa 840 accctggggt gaaggcatga tgccgtcatg gtcggcacac gaagaggtac gatgcagggt 900 ttctgatgcc cagtatcggc gattggtcag gatgagcaag caggtccgca atcgtcgtca 960 tcatgcgact gcagctctac agtggttgga aacttggtct ctgtagagta caacaataca 1020 agaatacgct ctaattctta tagttgtaag caaaaagtga attccttttg ctatgtatgt 1080 ggcaaatact ttttgaaaaa gttgaatctt cggcaattta cagatgattt gaaattagta 1140 tacgatgagt gtttcggttg taaagtcact gatagtgagc aagattggct gccacaattg 1200 atatgtaata gctgtaggct gatgtttagc cggtacaaaa agagcaaaac ttcattaaaa 1260 tttgtcaaac ctatggtttg gagtgagcca gaaaatgtag aaaattgtta tttttgtatg 1320 acaaaaacta aaggttttaa ctcatctaat gtccacaaaa ttgtgtacgc gagtgtgtct 1380 agcgtggtga agccgtttat agctccaaca gaagaaagag atacagatgt agaaagtgac 1440 ctttcttttg cacttactga attagaggtt tctagttcag atgaagccga gagtgaaaga 1500 gaagaaacgg acgagagtaa tgacgagtac ttgccatctg gatcacaaac aaagataaaa 1560 ggagaatttg atcagaaaga acttaatgac ctagttagag agttaggctt atcaaaggag 1620 ggatcagagt cacttgcttc gaggttgaag gaaaaaaatt tactgacgaa aggtactaaa 1680 tgttcttact atcgaaatag agaaaaagac ttcagaaaat atttctctgc ggagaatgat 1740 ctcgtatatt gcaacaatgt taagggtctg atggatgaat ttaagaagga tgtttataaa 1800 tctgaggaat ggagattatt tctggattct tctaagcgaa gcctgaaggc tgtactatta 1860 cataacacta ataaatattc tccaatacct gtggcccatt cggttaccct gaaagaagat 1920 tacaaaaata ttgagttgat attaaataaa atacagtaca aggaacacaa ctggctaatc 1980 tctggtgatc tcaagattct taccatcatg ctaggtcagc aatcaggctt tacaaagttt 2040 ccatatttct tatgcatgtg ggatagcagg actcgtaaga atcattatgt aaaagcagaa 2100 tggcctgcta gatcaacgtt cgaaccccgt aaggataaca tcattaacaa gccattagta 2160 gagccatcca aggtccttct tcctcctctc catattaaac ttggtttaat aaaacaattc 2220 gttaaagcgt tgaataaaga aggtcaatgt ttctcgtacc tgggaacaaa atttgcagga 2280 gtaactgacg ctaaattgaa agtaggtata tttgacgatc cccaaatccg cacatagtta 2340 aaggataaag cattcataga caagatggac gataaagaaa aagcagcatg gattagcttc 2400 aaagatgtag tggaaaactt attagagaat cacaagagcg agaactacaa aaaactagtt 2460 gaagacctat ttaaaaatta caatagttta ggttgcttga tgaattacaa gctacacttt 2520 cttcattcac acttagacta cttttcacaa aatcttggag attatagcga agaacaaggg 2580 gaacggtttc atcaagatat taaagtaatg gaacagcgat accagggccg gtgggacata 2640 aatatgatag cggacttttg ttggacattg aaaagagaac atattatcca cggaaagaaa 2700 agaaagcgga atcccctaca ccgatcattc gaagacaaaa gaacaagaaa aaaaagaaaa 2760 tcatgagttt tgtaatacat gatacccatg tctccgtgtg tggctgtttg tcatacgacg 2820 gttggtcagt ctcactattt ttgacaaata ctaatttatt gcacatcatt cattaagatg 2880 taaaactttc catttttcct gaatctgtgg tcaaaatttc cttttttagt aaaaaaaatg 2940 tatatttggc cttgggatga ccttgacttt gaccccgagg tactaagcgg tctttatatt 3000 tggattcagc ggcccaaaaa atatgttaga atgataactt gactcaattt cactaattgt 3060 cgagggacct tcaaacgacc tccaattcgg cctgatggag acaaacggac accggatttg 3120 gattcggcga ccccgaaaac ataactatat agtagtttgc cgcatgaaaa cccattatct 3180 tattgtcata cgaatcctcc atccaacaag attaagtgac cttcatatga ccttgtggat 3240 tatattaaat aaaaattgaa aatctatctg gacgtaaaat ttcccgctct ttcagaatct 3300 gcagtcaaaa attcattctt cctagagtac acagctttga ccttgaactg accttcaaac 3360 ttacctcatg gagaaaaaca aataccggat ttggattcaa cgccccccaa aacttaacag 3420 atagatgtct tgtattggtt attgctgatt tttgcaatta gaccttcaaa tgcccttgaa 3480 cccgacgtgc tgaggttaaa cgaatgccag attcggactc agcgcgctgg acatacaaat 3540 tcattttttt tgtggcctgt g 3561 // ID Gypsy-3_IS-LTR repbase; DNA; INV; 195 BP. XX AC ABJB010044547; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the black-legged tick genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-3_IS_; KW Gypsy-3_IS-I; Gypsy-3_IS-LTR. XX OS Ixodes scapularis OC Eukaryota; Metazoa; Arthropoda; Chelicerata; Arachnida; Acari; OC Parasitiformes; Ixodida; Ixodoidea; Ixodidae; Ixodinae; Ixodes. XX RN [1] RP 1-195 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the black-legged tick genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; ABJB010044547; Positions 6822 6628. XX SQ Sequence 195 BP; 46 A; 59 C; 48 G; 42 T; 0 other; tgttgcgacc ggaagtggca accctggcca gagccgccag cctacttccg gaaagaggaa 60 caacggtcgt ccgccgcgac gccatcatgc cgactagggt cggacgtctg taacctgtac 120 cctaatgtat tcacgtttct catctcctat aaagaacccg agttacaaag cgttgtgttt 180 cctgacgcgc taaca 195 // ID Neptune_Hyd repbase; DNA; INV; 5055 BP. XX AC . XX DT 29-DEC-2006 (Rel. 11.12, Created) DT 29-DEC-2006 (Rel. 11.12, Last updated, Version 1) XX DE Neptune_Hyd is a Penelope-like element. XX KW Penelope; Non-LTR Retrotransposon; Transposable Element; KW Penelope-like element; reverse transcriptase; Neptune_Hyd. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-5055 RA Arkhipova I.R.; RT "Distribution and Phylogeny of Penelope-like elements in RT Eukaryotes."; RL Syst. Biol 55(6), 875-885 (2006). XX DR [1] (Consensus) XX CC Neptune_Hyd is a Penelope-like element (PLE) from the freshwater CC hydra, Hydra magnipapillata. It belongs to the Neptune group of CC PLEs. It has a very small ORF1 with no discernible motifs, and CC its ORF2 contains a region of homology to reverse transcriptases. CC Consensus sequence was assembled from GenBank trace archives. The CC element may be active, its copies are 99% identical. There are CC TG/TA microsatellite repeats in the 3' UTR. At the 5' and 3' CC flanks, there are tandem repeat units of 1.5 kb and 55 bp, CC respectively. XX FH Key Location/Qualifiers FT CDS 603..962 FT /product="Neptune_Hyd_1p" FT /translation="MFKIMNGDCFYPMTNWPLDIQIIFWNYVTSDLETFKL FT TIFFLGNGCSPFVLFTFLILGFKRDPSKFNKRIYQIKWITKNLESHNSRWY FT YFDIHFNQMRYLNLQPLPYQKRNRDTIFDDI*" FT CDS 1491..3356 FT /product="Neptune_Hyd_2p" FT /translation="MRTLFPPGENFRHDRDFKRWVDSEKVRMLTALTDKMI FT TNISDHVLTNEEKEALGKGLKFIPNSQKVGREVYEASVEKFKRKLRLKMFF FT KDRKEQKRSVFYKKSSWVPPETENENILEYFGKIDRDLEEMWNREIPRKRN FT NLKPEVWRALKNLQNNRDIIIKKTDKGGGLCIMNKTTYEEKVNELLMDRTV FT YRELQTDQTNNIVRGITDMSNYMLKLHKINEQTAEFLLPHKPCRPPLFYGL FT PKIHKPNIPLRPIVSACSGPTDNLSEYLTRFIQPLVESLPAYIKDSIHFLN FT MLQNIRLQSSEFIFVTADVTSLYTNIPHRDGIDAVVHYLRLIPLQDRPPDC FT PSPGIIGNLIEEILSNSTFSFGDKHYHQLTGTSMGTRVAPCYANLFMGRLD FT EQITNQFPNHITFYRRFIDDIFFIFQGPIEVLDQICNFMNTIHPTIKFTFN FT HSATSINFLDILLYKNEQGNIHTTIYRKPTDTMGLLHYDSHHPTHTIPGTI FT YSQALRYCTIITELNNLQKELKYLTMTLVLRDYPLHIINNNIKKALHKTQH FT ELINNRRAQENTERLTIVTPYTPIGQQINSIILSNWNMIDPNLQLQDIFPH FT KPLSVHTNLKSIKDLLIHTKHQQ*" XX SQ Sequence 5055 BP; 1821 A; 1025 C; 721 G; 1488 T; 0 other; cgactacaaa tcacatttac gctggacaca atttagtagc acaacaaata atgacttgat 60 ttttgtttca gggctataca tcaactttac acgtaatact aacaacagtt gagtgacaca 120 ataaataatg gttaccactt aacttcgcta tttgaccacc acaattgacc actagtgttt 180 caaaatatgt tcttataatg tattgtctca aaaacacatt tatatgttac tgttcttata 240 tttaacctcg tgttataatg attcaattat atattcctca taataatgta ttctgtttcc 300 attcacataa ccctgattta ttcttattta atctagaaga cttactatgc tttatttaat 360 tatactatgt ccgtcaccac atacaaacac tcataaaaaa tagcacctaa taaatatctt 420 ttcataatac actcatttct ttatcatgcc catcccccca tccactgctg cctaaatgct 480 caaagactgt aaacatactt aacatattat aggaatcgtt gtttgtagag agacacctac 540 agaaggcttt agaacacact acaggcattg caagccctcg aatcaacctg gaacgcgaga 600 agatgttcaa gataatgaat ggagattgct tctatccgat gaccaactgg ccactcgata 660 tccaaattat attctggaac tatgtcacct cagacctgga aacctttaaa ctgacaatat 720 ttttcctagg gaatggttgc tcgccatttg ttctattcac atttctcatc ttagggttca 780 aaagagaccc atctaaattt aataaacgaa tttatcaaat caaatggatt acaaaaaact 840 tagaatcaca caacagcaga tggtactact tcgacattca cttcaatcaa atgagatacc 900 tcaacctaca acctctccct taccagaaac gaaaccgtga cactattttt gacgatatct 960 gaaagacgaa gagcggacat gaatctttat aaacaataca cttaacgaaa taatataaca 1020 cgaacctgta attttttaga cgatttatat gcttatttaa atatgtaaat atcaatatat 1080 gcgaatactg actcatttat ttactaatta atccttattt ataatgccac tcatcacagg 1140 ataacacaac acgcgttgca aagcatataa cacactactg tatttatcaa caatcagtgc 1200 tttatgcata tacatttctc tctcggccaa catctcctat aatcttgtat aagcattcat 1260 atgtaattaa taactattat tcacaatttt actcatacta aaataccacc atttgggcac 1320 tcacaaaaca aaggacaacc acataccaat aattgccata ccacttatat atagtaaaga 1380 aacagtaaaa tacatgcaat acaaaaatga tacacgctat tacttacctt taggacaacg 1440 aacgaaaatt cactccatca aatgagagaa gagcaattgg tccattcagc atgagaaccc 1500 tttttccacc gggcgaaaac tttagacatg acagggactt taaacgatgg gtggacagcg 1560 aaaaggtaag aatgctaaca gccctgactg acaaaatgat aacaaacata tccgaccacg 1620 tactgacaaa tgaagaaaag gaagcacttg gaaaaggact gaaatttata cccaattcac 1680 agaaagtagg aagagaagta tacgaggcta gtgtggagaa attcaaaaga aaactacgac 1740 tcaagatgtt cttcaaggat agaaaggagc agaagagatc cgtattctac aagaaatcca 1800 gctgggtacc tccagaaaca gagaatgaaa acatactcga atattttggc aagatagaca 1860 gagatctaga agagatgtgg aacagagaga taccgagaaa aagaaacaac ttgaaaccag 1920 aggtatggcg agcactgaaa aacctacaaa acaaccgaga cattattatc aaaaaaactg 1980 acaaaggggg agggctctgc atcatgaaca aaacaacata cgaagagaaa gtgaatgaac 2040 tattaatgga cagaacagta tacagagagc tccagactga ccaaacgaat aatattgtga 2100 gaggaattac tgacatgtca aactacatgc ttaaacttca taaaattaac gaacagactg 2160 cagagttcct tcttccccac aaaccatgta gaccaccact attttacgga cttccaaaaa 2220 ttcataaacc taacattcca ttacgaccaa tagtcagcgc atgcagcgga cctacggaca 2280 acctatcaga atacctgaca agattcatac aaccactggt cgaatcacta ccagcctaca 2340 ttaaggacag tatacacttt ctcaacatgt tacaaaatat acgacttcaa tcttcagaat 2400 tcatttttgt gacggctgac gtgacatcac tctacaccaa catcccacac agagatggca 2460 tagacgcagt agtacattac ttgaggctca taccattaca agatagacca cctgactgcc 2520 catcaccagg tatcataggc aacctaatag aagaaatact atcaaacagt actttctcct 2580 ttggtgacaa acactaccat caactgacag gtacatcaat gggcacacgg gttgcgccat 2640 gctatgcgaa tttattcatg ggaaggcttg acgaacaaat cactaaccaa tttccgaacc 2700 acattacttt ctacagaaga ttcatcgacg acatattctt catttttcaa ggacccatag 2760 aagtgttaga tcagatctgc aacttcatga acaccatcca tccaacaatc aaattcactt 2820 tcaaccactc agcaacatca ataaactttc tagacatact actgtacaaa aatgaacaag 2880 gtaacattca cacgaccatc tatcgaaaac caactgacac aatgggacta cttcactatg 2940 actcacatca tcccacacac accattccgg gcaccatata cagccaagct ctcagatact 3000 gcactatcat aacagaactt aacaacctac aaaaagaact taaatactta actatgacac 3060 tagtactaag agactatcct ctgcacatca ttaacaataa cattaagaaa gcgcttcata 3120 aaacgcaaca tgaacttatc aacaacaggc gcgcgcagga aaatacagaa agactaacaa 3180 tagttactcc atatactccc attggacaac aaatcaacag catcattctt agcaactgga 3240 acatgattga ccctaactta caactacagg acatatttcc acacaaacca ctatcagtac 3300 atacaaactt aaaatctatt aaagatttac taatacacac taaacaccaa caataacaac 3360 tgcacaaaag aatattagct tttattgcaa cctctctatc gacaacatca tccaaatgga 3420 aactcaaaac ttcttaagaa cttaaatttc cggttcatca agactctaca aactaacgca 3480 aatttatcat cggcattttt ggcctcgctc tcctcagtct cttattcagc ataaaagcca 3540 tttctaaaac cgcatgctat tctgtttgaa ctctgataca gagcggtgag tttaaaattc 3600 aatttattta cattctctgg atatttagac ggattaagat tcttatcaat cttgacattt 3660 cctgtatatc tttgtttatt tataattcta aatttaactt ttataacaac gacgagttaa 3720 tattcatcat caagaacatc ttacatcttc gtcacgattt aaacccaaaa gaaaatattt 3780 actttcagaa tggtaagcaa acatgttttt gttacttaac tttcttttat tctattcaca 3840 attatatcta ttgataaagt ttatgtacca acatttattt ataaggggac aaatgtctat 3900 gttttttctt aattagctaa atgctactaa gtttttaact atagtaaaag tttataagta 3960 tgtaccaata tgactatcac tttatgttgt atgtctatgg tacagaataa atgtcataag 4020 atatctgtgt aaatgtttac attaacattc ctaaacagat atacaaaaac cctggtcaat 4080 attgaccaag aaaagaacat attaaggtta agacatacca ccccattata aaatactggt 4140 gttttacaaa ttctgagcac gtgctctatt ttagcaaagt agtgtccaca gggtgtcgac 4200 cattggtcag gcgccagacc ttcgaaactc cctgatgaca actaatggtc ctgatgagaa 4260 tgtcatcttg acactacatg ggtaattgga tattaaaaca cctatatttt gatcatctac 4320 acttcaatac acttatgctt atcattctaa tatttgtact gatacttcta gaagctatat 4380 ttgctgattt atgtctataa ggctacactt tactaagatt agatcagtaa aacattactc 4440 caacacctat tcaacacaaa aaaatgtgtg aaatagctat tgtaatgtat ataatacata 4500 taaatgattt agggatactt aggagaagat atgtatatat aatcataggg gtatatgtat 4560 gtatgtatat gggtgtatgt gtatatgtat gtatgtgtat atgtatgtat gtatatgtat 4620 atatatatat atgtgtatat gtatatatgt gtatgtgtgt atgtgtatgt gtgtatatgt 4680 gtaagcatgc acatgtgtga gtacatacat acatactcat atgtgtttgc attaatgcac 4740 atataatctt gtttatcagt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtatttacat 4800 gtgtgctata atgtatttat aggggtattt gtattttctt taatgtattt gcatttgtat 4860 atgtatttgt atatatgctt taatatgata ctactagaat catacatatt taatatacaa 4920 ctccagctaa ccataaccat tacaactact aactactact gtcaccgtac acactctaaa 4980 attttgacta tcatgtacaa tctactttaa cactcctgtc agggttattt aaataatatc 5040 cttgccatct gatat 5055 // ID Crack-19_BF repbase; DNA; INV; 2158 BP. XX AC . XX DT 30-APR-2009 (Rel. 14.04, Created) DT 30-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE Amphioxus Crack-19_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; KW Crack-19_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-2158 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-2158 RA Kapitonov V. and Jurka J.; RT "Young families of Crack non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(4), 824-824 (2009). XX DR [2] (Consensus) XX CC Its 5'-terminal portion is incomplete. XX FH Key Location/Qualifiers FT CDS 3..1976 FT /product="Crack-19_BF_2p" FT /translation="TDRVKNLMGQRDAARRKAIKTKDTKDWELYRSLRNQT FT TSMIKKAKKTHFESAITDAAEDSSLMWKIINTFTGKTKGKCQVRKIRRSDE FT SCISEPSEMAEEFNNYFSSCAADLARDIPVSEEDPLRHVPEATSTFRLRPV FT EETEVLNELLRLKPKKATGVDKIPSRLLKDSAPIIVKPLAHIFNLSITSGE FT VPDDWKLAKLSPIYKSGNKDSVSNYRPVSVLNVASKVMEKMVHNQVSHFVN FT STGQLTAHQSGFRKHHSTGTAVQKVVEDIQSAQNNKKVTVALFLDLRKAFD FT TVNHQILLGKLQKMGFDNGAINWFRSYLTDRFQHVDIQNQQSTQKRVTCGV FT PQGSVLGPLLFSLYVNDLPQVVRKCKIHMYADDTVLYCSASLSKECEETVS FT EDMKLVANWFIVNRLSLHPDKTKAMLFGSPQKLRHAGNTVTVTDGFNSFEQ FT VKVYTYLGVTMDSSLNWSAHTERMVKKLLAGLSALSRAKPYVTKEILLVMC FT RTLLYTHLDYCDTAWLLSLFHCNKVRSQQLTKLLNRAARIITGRSLKDRVP FT TETLLTEAGMVPLVERVKLNTLVTVHKAVRHKAPAYMTQMFKWLSPPVLRV FT RTRTATKQLYNFDPHMLDTPKAHVETFKGSLQYLGPVLWNSLAADQRKHTS FT TSAFKNGL*" XX SQ Sequence 2158 BP; 652 A; 492 C; 483 G; 531 T; 0 other; ttacagacag agtaaagaac ctaatgggcc aacgtgatgc tgcaaggcgc aaagccatca 60 agacgaagga cacgaaggac tgggaactct accggtctct caggaaccaa acaacatcaa 120 tgattaagaa ggcgaagaaa actcattttg aaagtgccat caccgacgct gcagaagatt 180 ccagtctaat gtggaaaatt atcaacactt tcacaggaaa aacaaaaggc aaatgtcaag 240 tccggaaaat acgacgatct gacgagagtt gtatatctga accttccgaa atggctgagg 300 agtttaacaa ctacttctcg tcatgtgcag cagacttggc aagggatatc ccagtgtccg 360 aggaggatcc ccttcgtcat gttcctgagg caacatcgac tttccgcctg cggccagttg 420 aagaaacaga ggtcctaaac gagcttctta ggctcaagcc aaagaaagcc acaggagttg 480 acaagattcc atcaaggctc cttaaggatt ctgcgcctat aattgtcaaa ccgttagcac 540 acatctttaa cctgtctata acatcaggtg aagttcccga cgactggaaa ctggctaaat 600 tatcgcccat atacaaatca gggaacaaag acagtgtttc caactaccgt cctgtctctg 660 tcctaaatgt ggcctccaaa gtgatggaga aaatggttca caaccaggtc tcacactttg 720 tgaacagtac tggccagctc acggcccacc aaagtggttt cagaaaacac cacagcacag 780 gtacagcagt gcaaaaggtg gtggaggaca tccagtcagc acaaaacaac aagaaggtaa 840 ctgtcgcgct gttccttgac ctaaggaaag cgttcgacac tgtcaaccac caaatcctcc 900 ttggtaaact tcagaagatg ggttttgaca acggggcgat aaattggttc agatcttacc 960 tcacagaccg tttccagcac gtagatattc agaaccagca gtctacgcaa aaacgtgtca 1020 cgtgtggtgt cccgcaaggg agcgtgttag gtcctttgtt attcagctta tatgtgaatg 1080 atctaccaca ggttgtgcgt aagtgtaaaa tccatatgta cgcagacgac actgtgcttt 1140 attgctcggc tagcctgtct aaggaatgtg aggaaactgt gtcagaggac atgaaactgg 1200 tagctaactg gtttattgta aatagattat cacttcaccc tgacaaaact aaagccatgt 1260 tatttggatc acctcagaaa ctgcgtcacg caggaaacac tgtcactgta actgatgggt 1320 ttaactcatt tgagcaggta aaagtgtata cttacctagg tgtcaccatg gactcctctc 1380 taaactggtc agcacacaca gaaaggatgg taaagaagct ccttgctggc cttagtgcct 1440 taagccgggc taagccttat gttactaagg agatcctgct agtcatgtgt cgaaccttgt 1500 tatatactca cctggattat tgtgatactg catggttact gtcacttttt cattgtaaca 1560 aagtaagatc acaacagttg actaaactgc taaatcgggc agccaggatt attactgggc 1620 gatcactcaa agatcgtgta cctacagaaa ccctgcttac agaggctggt atggtacccc 1680 tggttgaaag ggtgaaattg aacacgttag tgactgtaca caaggcagtt cgccacaagg 1740 cacctgccta catgacacaa atgttcaaat ggctgtcgcc cccagtgctt agagttcgta 1800 ctcgcactgc tacgaaacag ctctacaact tcgaccctca catgctggac actccaaaag 1860 cacatgtgga aaccttcaag ggcagcctgc agtacctagg gcctgtcctg tggaatagtc 1920 tcgctgctga tcagcggaaa cacacgtcaa cgtcagcatt taaaaacgga ctgtagatga 1980 ctgcagatgt gcctatactg agactgtttt gtacatgtaa atgtttgcaa cgttatttgt 2040 atattgttgg gtttaaactg tgtcatgtaa tttttaatgt gtgtccaggg atacctgaaa 2100 atcagatcaa tggatctgag gtattccctg gtaaaataaa taaacttgaa acttgaaa 2158 // ID Poseidon-2_HM repbase; DNA; INV; 1946 BP. XX AC . XX DT 13-DEC-2008 (Rel. 13.12, Created) DT 13-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE Penelope-like element. XX KW Penelope; Non-LTR Retrotransposon; Transposable Element; KW Poseidon-2_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-1946 RA Jurka J.; RT "Penelope-like elements from Hydra magnipapillata (Poseidon RT group)."; RL Repbase Reports 8(12), 2085-2085 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 248..1795 FT /product="Poseidon-2_HM_1p" FT /translation="MYGLIKAHKPEKSYPMRVVVSTIGTPSYGISNYLVKI FT IQPVLNENPTRLKNSKXFINKXKSWXIDKDEIQVSFDIVNLYPSIPLKEAT FT LILLDQLNKSVSYKNSTKLTLTETKQLIELCLFRCYFLWNDEIHELENSGP FT IGLSFMVVLAESFLQYHEEKAIKMAMTIIPLIDIKSFHRYVDDSHARFSNL FT NQAEQFQTILNKQHPSIKYTIEVENENKILNFLDITVINNTKGKYEFKVYR FT KDAITNIQIKPHSNHDPKILKAIFKGYIHRAYSICSENHLKDEINFLIQVF FT TENGYNEKINCXFINIYTHIYIYKKDISDQVRKKRFANKNEIPSNSNNLPT FT ISLPWIPIISPKLRKIFRKAGYRTVFKSNANLKTLLTSKNKSKLPSNSHPG FT TYLIKCKCSKVYVGETKLQIRTRIQQHQKFLTEGKLNQSALALHKINCNED FT IKWDKVXTLKVEDKKFERKVREALEIQKHQCSPTYGGMNLDNGQYVKTNFW FT TSFFSYLGKRSHYKGDVKI*" XX SQ Sequence 1946 BP; 796 A; 284 C; 258 G; 596 T; 12 other; agcttttaaa gaaattaaag aaaaacattg atatatatcc gtttgataaa ggaacaggtt 60 ttgtaagaat agaacacgat aaagctattg aaaaaattcg agaacaaatt ggcccaacaa 120 raattagtga agatcccact ctagttatgc aactaaaatt acttaccttt caaaacttaa 180 taaaaaacat cgtttttcaa aagatgaata tgaaagtatt tatccaagtg atcctatccc 240 acctcgtatg tatggtctaa ttaaagctca taaacctgaa aaatcttatc cgatgagagt 300 agttgtatca actattggca cacctagtta tggaatttcg aattacttag ttaaaataat 360 acaaccagtc ttaaacgaaa ayccwactag attaaaaaat tcaaaagamt ttattaacaa 420 arctaagtcg tggtwaattg acaaagatga aatacaggta tcctttgata ttgtaaactt 480 atacccatca ataccattaa aagaagcaac tttaatactt ttagatcaat taaataaaag 540 cgtttcttac aaaaactcaa ctaaacttac tttaactgaa acaaaacaac taatagaact 600 ttgtttgttt cgttgttatt tyttgtggaa cgatgagatt catgaattag agaattcagg 660 cccaataggw ttgtctttta tggttgtttt ggctgaatct tttttacaat atcatgaaga 720 aaaagcaata aaaatggcaa tgacaataat tcctttaatt gatattaaat cctttcatag 780 atatgtagac gatagccatg ctagattttc taacttaaat caagcggaac aattccaaac 840 aattcttaac aaacaacatc cttctataaa atatacaatt gaagttgaaa acgaaaataa 900 aatacttaac tttctkgata taaccgttat taataataca aaaggaaaat atgagtttaa 960 agtttacaga aaagatgcaa taactaatat tcaaattaaa cctcattcaa atcatgatcc 1020 aaaaatttta aaagcaatat tcaaaggtta tatacacaga gcatactcaa tatgcagtga 1080 aaatcattta aaagatgaga taaatttttt aattcaagtg tttactgaaa atgggtacaa 1140 tgaaaaaatt aattgcmgct tcataaatat atatacacac atatatatat ataaaaaaga 1200 tatttctgat caggttcgaa aaaaacgttt tgcgaacaaa aacgaaattc catcaaactc 1260 taataacctt cctacaatct ctttaccatg gataccaata atatctccca aacttagaaa 1320 aatattcaga aaagctggtt acagaactgt ttttaaatca aatgctaacc taaaaacgct 1380 attaacgtca aaaaataaat ctaaattacc aagcaacagc cacccaggaa cctatttaat 1440 taagtgcaaa tgctccaaag tatatgttgg tgaaacaaaa cttcaaatac gaacaaggat 1500 tcaacaacac caaaaatttt tgactgaagg caagttaaac cagtctgcat tagctttgca 1560 taaaataaat tgtaatgarg acattaaatg ggataaagta awaacactta aagttgagga 1620 taaaaaattt gaaagaaaag tccgtgaggc cttagagata caaaaacatc aatgttctcc 1680 aacgtacggt ggtatgaatt tggacaatgg acaatatgta aaaacaaatt tttggacttc 1740 atttttttca tacttaggta aaagaagcca ttataaaggt gacgtcaaaa tttaaaaatt 1800 tttttgttgt tatttaacgg tcacaatttt tgtaattata ttatattata tgctgatgat 1860 gctggtgcct ataatccagc gaaaatttca aataaattat ttagtattga gagaattcgt 1920 attatctgat gttttaaatt agttta 1946 // ID Copia-19_CQ-LTR repbase; DNA; INV; 184 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-19_CQ_; KW Copia-19_CQ-I; Copia-19_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-184 RA Jurka J.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 354-354 (2011). XX DR [2] (Consensus) XX SQ Sequence 184 BP; 62 A; 44 C; 27 G; 51 T; 0 other; tgttggcaca atcgtgctag caactacaaa aagtccataa gacgattcgt ttcagtgcag 60 aatttattta cacacacaga cacacatacc aatgtatata acacaataaa ttcattcgaa 120 aagtagttct ctccggacac ggacgtgttc tttatttcaa tccgctgctc gtaaaactcc 180 taca 184 // ID CR1-100_AAe repbase; DNA; INV; 4815 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-100_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4815 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1188-1188 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 23 sequences with >95% CC identity. Closely related to T1 and Q. XX FH Key Location/Qualifiers FT CDS 460..1287 FT /product="CR1-100_AAe_1p" FT /translation="MTAACDRCAKSIKRGEEVIICMGFCDHVVHLRCANID FT KNIIKAINDSSNLNWMCDECVKLMKLTRFRNAISSVGNTINELTRNQQAAH FT DELKRELMKHSEQIAQLSNQFNSVTPLHPASINRRASKRRRMENDPQVTKP FT LLGGTKATDSVSIATVPQPTALFWIYLSRLHPSVKPDAVEKLTKESLQCNS FT AKAIPLIKQGTDVNSLNFISFKVGIDPEYRTAALDPSSWPSGILFREFENT FT NERNYWMPEPSTPSILVTSDSEVTPHQAAIDTADC" FT CDS 1330..4704 FT /product="CR1-100_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="METSSQIGAVQPIAAIVHRSRRGPAVESVEGVLQPAF FT PGKYSNSIEQTLPDRLPASSTLTMSSVPIASYSYRSSCSSTSNPGRSTLST FT MEAPGPPSSVAPSSVVEETLQQCRSCHPSRPGPESSVGEGVFQATCDGKYL FT SNNGVSCPDDSSIPRAATNARNDASAPTNILMYYQNVGGINSSLAEYQLAF FT SDGCYDIYALTETWLNGNTSTRQLFDDSYYVYRQDRSSSNSDKCSGGGVLL FT AVRSNYKSHVIDPPGSSMVEQVWIAISTVDAMLYLCVVYIPPDRVNDEIXI FT EKHLDSLNWVVSQMGPRDKLVILGDYNLSAIFWQRNFHGYLFPVATRSSIS FT LASRKLLDAYSTARLRQMNDVENENNRILDLCFFIEEIQNDCVVMQAPSPL FT VKMCRHHPPVLTKIAIRPQLRFRDTTEGVNYDFSKADFSGMNNFLANVDWD FT EALHDSEANLAASTLSGILLYAIDQFVPVKLRRDTPKPAWSNTGLNHLKRM FT KQSALRRHSKYRTDSTRATYMEVNAEYKKLNNRLYNSFQNRLQNRLKTNPK FT SFWRYVNEQRKEVGLPSTMTDGLNEFNSTMDIANLFRTQFSNVFINEHLHP FT QDIANATTNVPRLPASVEQFTITDNMVISAGKELKSSTGFGPDGIPSLVLK FT RCMNSLASPLAKLFNTSLSTNVFPDCWKHSYVFPVFKKGCRQSVSNYRGIA FT ALSSTSKLFELIVLRKLTQSYSHYISPNQHGFMSRRSTTTNLTCFTSFVIR FT QIESGHQVDAIHTDLSAAFDKMNHQIAIAKFDKLGLNDNMLLWLKSYLTGR FT SMSVKIGEHVSMPFSVWSGVPQGSHLGPFLFLLYMNDVNFILDCLKLSYAD FT DIKLYCTINKPQDSEFLQHQLEIFAEWCNINRMSLNVSKCSVISFGRRRTL FT LQFNYGLAGVELQRVTTVKDLGVLLDTKLTFKDHVAYIVSKASAQLGFLFR FT FSKKFTDVYCLKALYCSIVRPILEYSSVVWSPFYRNEIQRIEAVQRKFVRF FT ALRRLRWRDPLNLPSYESRCKLIDLDLLEPRRNVAKACFISDLLQGSIDSP FT LLLSSLDINTRRRNLRFHPFLNIPSARTNYGLHEPMRSMSRVFNSCYHVFN FT FNVSRGTNKCNFRQFLC" XX SQ Sequence 4815 BP; 1321 A; 1172 C; 991 G; 1330 T; 1 other; gaatcactgg catcactgct acttgtacgt aatgcaaatt gtattcggtc tcgtatttaa 60 gcttcgtttt atttgttaac cacatcctaa aatcgccgtt tctgttcccc gtgtactgta 120 tacgttcagt gaatcgagcg tttatgagtc aacgtctgtg attagtgcct ttgtgatata 180 atcgctaatt gcgctaagtc gtacttcgtt ttgctgataa acaataccaa tactgctctc 240 cacgctgaat taaaccaaag cgacgcaccc cggtggatag cgtaaactac aattgcaagc 300 tgcgcagcta gtgtgctttc acctactcga cttgcacgca agttaaattc gtagtcttcg 360 aagtagtggt gcacatacac actcgcatcc gtgcattgga cttacgctag cataaaatat 420 caagtgtgtg ctagaataag ggccaatcat ctcgtcgaaa tgaccgctgc atgtgaccgc 480 tgcgcaaaat cgatcaagcg tggcgaagaa gtgattattt gcatgggatt ctgtgatcat 540 gtcgttcatt tgcgatgcgc aaacatcgac aaaaatatca tcaaagccat taatgattca 600 tctaatttga attggatgtg tgatgaatgc gtgaagttga tgaagttaac gcgcttccga 660 aatgccatct cttccgtcgg aaacacgata aatgagctca ccagaaatca acaagctgct 720 cacgatgaac tgaaacggga attaatgaag catagtgaac aaatcgctca gctttcgaat 780 caattcaact cggtcactcc attacatccc gcttctatca atcgtcgcgc atcaaagcgc 840 cgccggatgg aaaatgatcc tcaagttacc aagcctcttc ttggtggtac aaaggcgact 900 gactctgtca gtattgcaac cgtgccacag ccaactgctt tgttctggat ttacctctct 960 cgactgcatc ctagcgttaa accagatgca gttgaaaaat taacgaaaga gagcctgcaa 1020 tgcaattcag caaaagctat tcctctaatc aagcagggaa cagatgtgaa ttcgttgaat 1080 ttcatttcat tcaaagtcgg cattgatcct gaatatcgaa cagcagccct cgacccttcc 1140 tcatggccaa gcggaattct cttccgcgaa ttcgaaaaca ccaacgaacg gaactattgg 1200 atgcccgaac caagtactcc atcgatcctt gttacatcag attccgaagt aactccccat 1260 caagccgcca ttgacactgc tgactgctga ccacaaaata ataaaccacg acgcaccgta 1320 gaaagcacaa tggagacctc ttctcagatc ggcgcagtcc agccaattgc agccatcgtt 1380 catcggagtc gtcgtggtcc tgctgtcgaa tctgtggaag gggttctcca gcctgctttt 1440 ccaggcaagt actctaactc aatagaacaa actcttcctg atcgtctccc cgcttccagt 1500 acattaacga tgtcctctgt acccatcgca tcgtattcgt accgttcatc atgctcatca 1560 acatctaacc cgggacgctc gacactaagc actatggaag cccctggtcc acctagctca 1620 gtcgcgccat cgagtgtcgt cgaagaaacg ttgcagcaat gtcgttcgtg ccatccaagt 1680 cgtcccggcc ctgagtctag tgttggtgaa ggggtcttcc aagccacctg cgatggcaag 1740 tatctatcca ataatggtgt gtcttgtcct gacgattctt caatccctag ggccgcaacc 1800 aatgcgagaa atgatgcgtc ggctcctacc aatattctaa tgtactatca aaatgttggc 1860 gggatcaaca gctctcttgc cgagtatcag ttggccttca gtgacggttg ctacgacatc 1920 tacgccctta ccgaaacctg gcttaacgga aatacttcta ctagacaact gtttgacgac 1980 tcttattacg tctatcgaca agataggtca tcctcgaata gtgataaatg ctcaggaggt 2040 ggagttttgc tagccgttcg ttcaaactac aaatcccatg tgatagatcc tcctggaagt 2100 tcgatggttg agcaagtttg gatcgctatc agcactgttg atgcaatgct atatttatgc 2160 gtggtctata ttccaccaga tcgtgtaaat gacgaaattk tgatcgaaaa acatctcgac 2220 tccctgaatt gggtggtctc acaaatggga ccaagagaca aactcgttat tctgggtgac 2280 tacaacttaa gtgcgatctt ttggcaacgg aattttcacg gttatctttt tcccgtcgca 2340 acccgatcct cgattagcct ggcttcgcgg aaacttcttg acgcgtatag taccgctaga 2400 ctccgccaga tgaatgatgt ggaaaacgaa aataatcgca tactggatct gtgctttttc 2460 attgaggaaa tacaaaacga ctgtgtggta atgcaggctc cgtccccact cgtaaaaatg 2520 tgtaggcacc atccacctgt gctgacgaaa attgcgataa gaccacaact acgcttccgt 2580 gatacaactg aaggtgtcaa ctacgacttc agcaaggccg atttttccgg aatgaataat 2640 tttcttgcga atgttgactg ggacgaagcg ctccacgatt ctgaagctaa tctcgctgcg 2700 tcgaccttgt ctggtatact gctgtacgct attgaccagt tcgttccagt caagctcagg 2760 cgcgatacgc ccaagccagc gtggtcgaat accggactta accatctgaa aagaatgaaa 2820 caatcagctc ttcggcgaca tagcaagtac cgtacagact ctacgcgagc aacatacatg 2880 gaagtgaatg ccgaatacaa gaaactcaat aatcgtctgt acaactcttt ccaaaaccgt 2940 ttgcaaaacc gcctcaaaac taaccctaaa agtttctggc gttatgtaaa tgagcaaaga 3000 aaggaggtcg gattgccctc tacgatgacc gatggcctga acgaattcaa ttccaccatg 3060 gacattgcaa atctttttcg cacccagttc agcaacgttt tcatcaatga gcatctccac 3120 cctcaagaca tagccaatgc caccacaaat gttcctcgac tacctgcatc agtagagcag 3180 ttcactatca ctgacaacat ggtcatctcg gctggaaagg aactgaaatc gtctactgga 3240 tttggccctg acggcatacc ttctctagtc ctcaaacgct gtatgaattc actagcatcg 3300 ccgctggcaa aattattcaa cacatctcta agtactaacg ttttcccgga ctgctggaaa 3360 cactcctatg tttttccagt gtttaaaaaa ggatgtagac agagcgtttc gaattatcgt 3420 ggaattgctg cactgagttc aacatctaaa ttgtttgagt taattgttct tcgcaaacta 3480 actcagagct actctcatta catatcgcca aatcagcatg gctttatgtc tagacgttca 3540 acgacaacca atttgacttg ctttacctca ttcgtgatac gccaaataga atctggtcac 3600 caagtagacg cgatacacac cgatttatcg gcagcgtttg ataagatgaa ccatcaaatt 3660 gcaattgcga aatttgacaa actgggcttg aatgacaaca tgcttctctg gctgaaatcc 3720 tacctgactg gacgaagcat gtccgtcaaa attggcgaac atgtttcaat gcctttctcg 3780 gtctggtctg gcgtacctca aggcagccac ctcgggcctt tcttgttttt gctttacatg 3840 aacgatgtca actttattct ggactgccta aaactttctt acgctgatga tataaagctt 3900 tactgcacaa tcaataaacc gcaagattct gagtttttgc aacatcagtt ggaaatattc 3960 gcagaatggt gtaatatcaa caggatgtcc ctaaatgtgt ccaagtgctc agtgatttca 4020 tttggccgca gacgcacact tctacagttt aattatggtt tggctggtgt cgaactgcaa 4080 cgtgtaacta cggtgaagga cttaggcgtt ctattggaca ctaagctgac gtttaaggat 4140 catgttgctt acatcgtctc gaaagcttct gcgcagctgg gttttctgtt ccgcttcagc 4200 aaaaagttca ccgatgtgta ttgcctgaaa gctttatact gctcaattgt acgccctatc 4260 ctggaatact catcggttgt atggtcacca ttctacagga acgagattca acgaattgag 4320 gccgttcaac gcaaattcgt gcgcttcgcc cttcgccgac ttaggtggag agatccactg 4380 aacttgccaa gctatgaaag ccgctgcaaa cttattgatc tggaccttct tgaacctaga 4440 cgcaatgtag caaaggcgtg cttcatctct gaccttttgc aaggatcaat agacagccct 4500 ctgctgctca gttcattgga catcaacact cgtcgccgaa atctccggtt tcatccattt 4560 ctgaacattc cttccgccag aacaaattat ggccttcacg aaccaatgcg tagcatgtcc 4620 cgtgtgttta atagttgtta tcatgtgttt aattttaatg tgtcgcgtgg aacaaataag 4680 tgtaatttta gacaattttt atgttaattc atgtgaaatg tttgtttaga ttttttgtgc 4740 ttttagactt aagatcaatt gtcattgggg tgggaacttc acctgttgac taaagcagta 4800 aataaataaa taaat 4815 // ID DNA8-11_CQ repbase; DNA; INV; 1308 BP. XX AC . XX DT 28-DEC-2010 (Rel. 16.01, Created) DT 28-DEC-2010 (Rel. 16.01, Last updated, Version 1) XX DE Non-autonomous DNA transposon from Culex quinquefasciatus - DE consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-11_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-1308 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the southern house mosquito."; RL Repbase Reports 11(1), 88-88 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >95% identity. CC 8-bp TSD. ~110-bp TIRs. XX SQ Sequence 1308 BP; 478 A; 193 C; 191 G; 446 T; 0 other; tagagcgtcc aatttcccgg ggttacaaaa ttcccgggaa acgggaaatt ttcaacaaat 60 ttcccgggaa atcccgggaa ttcccgggaa atttgaaatt taacgaaaat tattctaatc 120 ctgtttctga ttaatctttt gcaacgaaat tgtatagaac agcaacttta atggtcaaaa 180 tgagtgtgag gatcaattaa tggcttgact gcttgtaaaa aatcatgcaa ctttgagaaa 240 atatataaac tttctaattt ttaaatcttt caaatcgtcc caaatgaaag aaaatattat 300 tagtattttt gttgagaagg atttttttaa atcaaagtgt ttggacagtg aaaatctatg 360 ctccacaacc ataaaattgc ttttaaattt gtctcagtga ccgttaaaat aaaaaaaaaa 420 aacaaaaaat tcaacttgtg aataaataaa taatcattga atttacctta aaaatctaat 480 ttctagcgct ttttaacttg aaacactatt ttttcaactt gaaaatttga tacgaagttg 540 ttttttaatg gtttttcatg gttttatgat atgattttag ttatatggta ttcagcctaa 600 taaatcaatt agattaaaat aattttgagt atttaaaagt ggttgaaatt caacgatatt 660 ttttcatata taaccctctt acgcccttgg tagctcacgt gcttcataaa tttataactt 720 caatctgtaa tttctcgaat acttagaaac aaattcttct cacacaaacc tttttttaaa 780 atttaattat atcagttcaa tttccatcca acatgttcaa ggtaaaaaaa agtcaaacaa 840 atctcaatgt tgatcttggc cggaagatgt aaatatctta ttttcaaatg atattacaaa 900 aaaaaaacat taaaaaaggt tgtcaaaaat aaaatagttt atattcattc cataacagcg 960 taagttttgt tgtccaacat ataagcaaac aatattccta gtggtgaatg cttcagaaat 1020 aaatattatc tgaattaaac cttgacatga aatttacaga caagctgatt agttcaatat 1080 gaacagtaga tggaaataaa taaaatcaaa tcaggttctc caattattat tttttgtata 1140 atttcaaaac attggtgcca aaaaggtagg aaaatgtata cttctgcttg tatttcggga 1200 attcccggga aatttacaaa tttcccggga aacgggaaat atttttttcc gggaaatccc 1260 gggaattccc gggaattttt ttcccgggac gggaaattgg acgctcta 1308 // ID Copia-26_DPu-LTR repbase; DNA; INV; 222 BP. XX AC scaffold_38; XX DT 11-MAY-2010 (Rel. 15.05, Created) DT 11-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE Copia-like LTR retrotransposon from Daphnia: long terminal DE repeat. XX KW LTR Retrotransposon; Transposable Element; Copia-26_DPu-LTR. XX NM Copia-26_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-222 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 716-716 (2010). XX DR Genome; scaffold_38; Positions 247163 246942. XX CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX SQ Sequence 222 BP; 56 A; 56 C; 33 G; 77 T; 0 other; tggtgttata atcgtcaact acagaaacca ccactagatg gtgcagctct cttgtcattt 60 atgttccctt ccttgtcagc agacgaagta tcttgccttc ttcattctcg tgttcctgcc 120 aagacaagtg tgtctttcct cctgtgctcc attcagttca cttaagtaca ataaacgctt 180 ttctgtttct aactaaaagg tatttaatta caaaatccca ca 222 // ID Mariner-27_HM repbase; DNA; INV; 4145 BP. XX AC . XX DT 23-DEC-2008 (Rel. 13.12, Created) DT 23-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE Mariner-type family: consensus sequence. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-27_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4145 RA Bao W. and Jurka J.; RT "Families of Mariner elements from Hydra magnipapillata."; RL Repbase Reports 8(12), 1961-1961 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 1184..3403 FT /product="Mariner-27_HM_1p" FT /translation="MGKTTKITKKNNKIKKKTASSFNSKLNQKVTKVKRFQ FT WSNSNMKNAIAAVQNNLMSQREAATQFSIPRSTLGKYLKGDSVLGVKPGPS FT PILSTNLELKLVEYAATRAALGFGFGKRQFKKYASDLAIKHNLKFKHSTPS FT EKWWRSFNQRHKNLVLRKPEGTSSVRHQCMSIPKVAKYFYSPHEVLRDTHA FT LLKPSLIWNMDESGLQMDFKPPKIIAKRGTKHLQSRTSGKRETITVIAAVN FT AAGGLVPPHLIAKGKTARSLRSFNTADAPSGSNWSFSETGWTKQGIAFLWF FT TNTFLPNIGRDRPQILIVDGHDSHNFVELLSVAIENNIHIVEMPAHCSHWL FT QPLDRTVFGPLKTYYNQTCHELMNEYPGVTIDKSTFCGLFKKAWDQALTAD FT NIISGFRSCGIYPYNPSAVPSTAYLPNSVYSIQQLLDSNDLLESSLHPIKT FT DTKITRLEYAIEEEVDFEVELVNKPEXMSECKIDNNLINLDNNTASLIWDT FT HLTHEQFTTFNFCIQNGYDIPDPLYKLYKEFKLKQISCTNAILEIDDSVPM FT PNESIVTIPFLNETIVQQIPVPNETTGQSSFFSNETTNDCNQMPIGAIVQP FT SLTPIMLCTNHSNNDSDNDILPYPKIYAIKSKENRKKVQKYFVLTSQEAMQ FT SKVNDEEKKKEKVAAAEEKKLKKQKENKAQQKQEKEIQKLSKKANSVKMRK FT XSSIPKILEXNISSDNQNICIKVPDNAIISIDNNXSELM*" XX SQ Sequence 4145 BP; 1462 A; 602 C; 604 G; 1466 T; 11 other; gggcaaagtg ccctaataat ggccgggtct tattaagagc caaatttggg tccgctttta 60 ttgcatttat ttatttggcg gaacatttat ctagttgttg attatacttt tctgtttctt 120 taatatatct atgttttaat ctaactttta taaatgtttt tttcagtctt gattttaagg 180 aaatcttatt tttctcaatg ttaatttaag taaatatttt atgccgtaaa taatmatttt 240 tggttttttc atgcaacttt gagacacatt taacgcaata ttctttcagt tacggctttt 300 tggtttattt tgatctgttg tgtactttaa tattattatg taaactgttt tgttcgtcaa 360 tatacgatag gcagaagcat caatataacc taaactccca gatttcaaat aggtctcgat 420 aatagccact catttgacca aataagagcc acggctctta tttgtaatat cgtattatta 480 tgtggctatt attaggacaa tagtttaatt tattgtgtat atagtttaaa tacaaattgt 540 gtcacaataa attaaactta tatgtcctta ttataagtcc taataactta tatgtcctaa 600 ttatatcctt attataattt tacttagtgt ttatacacgg taaagtaaat actgattata 660 gttaatataa ggcttatatt aactatatcc agtatttact ttaccatgta taaatactaa 720 gtataattag tatttacttt acagtattta ttataacggt ttattatgtt tgtaagtatg 780 taattttatt ttttataatg tttgtaaata tttaaagtaa tggtttatat atgtatttta 840 taatttatat aatacatata aacaaaaatt attattatta atatttacta gctacataac 900 aatttaaaat atatatattt tgtatattta atttaatgct cactaagtat accaatgaaa 960 ttctacctaa atcatatcta attatgttgt atttaaccat tgtacttatg tatgctattt 1020 ctgttatatt gatgtataga tatgtatgat aagcatactt tatgcttgta tatgtatttg 1080 tttgtgacat ctgtatatgt tatgataata tatttagatt agatttatgt aaggtcaact 1140 attgtagtta gtatactcta tcttttagtt aatttgagta attatgggaa aaacaacgaa 1200 aatcacaaaa aagaataaca aaattaaaaa gaaaactgcc tctagtttta attcaaagtt 1260 gaaccagaaa gtgacaaaag taaagagatt tcagtggtcc aacagcaaca tgaaaaatgc 1320 aattgcagct gttcaaaata atttaatgtc gcaacgagaa gctgcaacgc aatttagtat 1380 tcctcggtca acccttggta aatatcttaa aggagattca gttcttggtg tcaaaccygg 1440 acctagtcca atattatcaa cgaatttaga gytaaaacta gttgaatatg cagcaacaag 1500 agctgcactt ggatttggat tcggaaaaag acaatttaaa aaatatgctt cagatttagc 1560 cataaaacac aatttaaagt ttaaacattc aactccttca gaaaaatggt ggcgctcatt 1620 taatcagcga cataaaaatt tagttttgcg aaagcctgaa ggtacttcat cagttcgcca 1680 tcaatgcatg tctataccaa aagtagctaa atatttctat tcaccgcatg aagttttgag 1740 ggacactcat gcccttttaa agccttcatt aatttggaat atggatgagt ctgggctaca 1800 gatggatttt aaacctccaa aaataattgc aaaaagaggt acaaaacacc ttcaatctcg 1860 cacatctgga aaacgtgaaa ccataaccgt tattgctgct gtaaatgctg caggtggttt 1920 agtacctcca catttgatag caaaaggtaa aacggctaga tctcttcgtt cttttaatac 1980 tgcagatgca ccatcaggct caaattggag ttttagtgag acaggttgga caaagcaagg 2040 aattgcattt ctttggttca ctaatacttt cttgcctaat ataggtagag ataggccaca 2100 aattctcata gtagatggtc aygattctca taactttgtt gagttattat ctgttgcaat 2160 cgaaaataac attcatattg tggaaatgcc agcacattgt tcccattggc ttcagcctct 2220 agatcgcaca gtatttggtc cacttaaaac atattataat caaacatgtc atgaattaat 2280 gaatgaatat cctggagtta ctattgacaa gtcaactttt tgtgggcttt ttaagaaagc 2340 ctgggatcaa gctctcacag ctgataatat tatttcaggg tttcgatcat gtggtattta 2400 tccatataat ccgtctgctg ttccaagcac agcttacttg cctaattctg tctattccat 2460 acaacagcta cttgactcaa atgatctgct tgaatcaagt ctgcatccaa tcaaaactga 2520 cactaaaata acaagattag aatatgcaat agaagaggaa gtcgattttg aagtagaatt 2580 agttaataaa cctgaaarca tgtctgaatg taaaatagat aataatttga taaaccttga 2640 caataatact gcttctttga tatgggatac ccatttaact catgagcagt ttacaacttt 2700 taatttctgt attcaaaatg gctacgacat cccagatcct ttgtataaat tatataaaga 2760 attcaaactg aaacagattt catgtacaaa tgccatttta gaaatagatg actctgttcc 2820 aatgccaaat gaatccatag ttactattcc atttttaaat gaaaccattg ttcagcaaat 2880 tcctgttcca aatgaaacaa ctggrcagtc tagttttttt tccaatgaaa ccactaatga 2940 ttgtaatcag atgccaattg gagccattgt tcagcctagt ctgacaccaa ttatgttgtg 3000 tacaaaccat tcaaataatg atagtgacaa tgatattttg ccttatccaa agatatacgc 3060 tatcaagtca aaagaaaatc gaaaaaaggt tcaaaaatat tttgtattaa cttctcaaga 3120 agcaatgcaa tctaaagtaa acgatgagga gaaaaaaaaa gagaaagtcg cagcagctga 3180 agagaaaaaa ttgaaaaagc aaaaagaaaa taaagcacaa caaaaacagg agaaagagat 3240 tcaaaagtta tctaaaaaag ctaactcagt taaaatgaga aaaktcagtt caattccaaa 3300 gatattggaa ktcaatatct cttcagataa tcaaaatatc tgtattaaag taccagataa 3360 tgcaattatc tccatcgata ataataytag tgagctaatg taaatagatt attatcaacg 3420 gttcataata aatatattgt gcaaacttct tctcgttatt ktaatactat caaaacttct 3480 aatatattat tgtaaagctt attgtccata tattgaaaat tgcattatta tatgcaagtt 3540 gaaatcttat aacagctatg tgctaaagct attttttatt taaaatttgt tttatgcttg 3600 attttctata aaagtaatta aaaagtttct taatttattt tgtatttcct ttttgataga 3660 ctgtcttaat ttgaataaca aataaaaact gtttaagttt ttttgtttat tatcaaccaa 3720 aataatattt ttaagcgaag tatagaatat tttactttta aaaaaaaaaa tcttggttta 3780 tagttctata aagtagatcc tagttcatta ttaggcttta atcgtctgtc tgaataatac 3840 ttttcaatat ataagcattc ttagatatat cgattttagg taatgtaagt ggctcataat 3900 aggtaccgtt tcttaatttg ctggctatta tttggtctac ccggccaata attggtacaa 3960 atttaaaatc tcagtttaaa tagatttttc tctgaattaa aaatgtttta aaattaaaaa 4020 aggtttgtat aaggtgtagg cgttggtcat taataaaaca gtaaaaaaaa aattcaccgt 4080 tagtttaaga catcctycac gacaatttat aaaaaatttg tggccattaa ttggtcactt 4140 tgccc 4145 // ID BEL-3_CQ-LTR repbase; DNA; INV; 797 BP. XX AC AAWU01000122; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-3_CQ_; KW BEL-3_CQ-I; BEL-3_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-797 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 160-160 (2011). XX DR GenBank; AAWU01000122; Positions 85248 84452. XX SQ Sequence 797 BP; 314 A; 140 C; 160 G; 183 T; 0 other; tgtagctgac ggtgtcagat ttagttgaag ttaaattaac cgaaagagtt tcaattatta 60 taaaaaaact tactcgcgct gtctgtgacg aaacaatcac gaaaatcaag aaattcccaa 120 gcggtgtcgg gtaatttaga agaaagattt atttattaca tttcctgtaa gaaacacgaa 180 atccaagaaa accattatcg acaggaaagg cagctaggtt tctaggaaca tcagaaaacc 240 cttttcagaa gtacaaatca gaaggttaaa attcaaacga agacataacc atatatcctg 300 caagtttatc agatcctaga atcagaaaca cgaaaagacg gtattgtccg cacagagctt 360 gagaacaagc atccgaagca tgttcgcgca cctgaataca aagtagcacg taaagagtga 420 aagggacgat cgagcaaacg gattaatcga tcaaagcaga gagaaagtag aaattgagcg 480 aacgacgcca ctagaataga gaggacaaaa catagaacga actagaccac cgcgtaagcg 540 agatagacaa gctatgcgcg tatggaagaa aaggacaaaa gtccaggaga gaggtttaga 600 acaagagaga aatcccttga ttgagagcat tgaaaaattg tactaaattt aaggaactat 660 ataatgtaac cgaaactttt taataaacag tcacttttta atcaccagtt tgttagttta 720 caccctcgaa cttacgccgg gttaatttac taataatccg aaaataatgt tattgaagta 780 atgcactgca cattaca 797 // ID Gypsy8-LTR_Dya repbase; DNA; INV; 337 BP. XX AC chrU; XX DT 18-MAY-2009 (Rel. 14.05, Created) DT 18-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy8_Dya; KW Gypsy8-I_Dya; Gypsy8-LTR_Dya. XX OS Drosophila yakuba OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; melanogaster subgroup. XX RN [1] RP 1-337 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1071-1071 (2009). XX DR Genome; chrU; Positions 820325 819989. XX SQ Sequence 337 BP; 118 A; 94 C; 67 G; 58 T; 0 other; tgacggcaca agcatgcctt cacctccccc gaaaccccca cgggcagcga cctacatatg 60 agacactcaa cccggacaca caccgcagca aagtcatcaa cccacactgg atcagcagag 120 tcaggattcc caagaatcca accgaagcac tggaaggggc atccgggtga aaagtgccga 180 cgcaaggggt ttcgcagaac aaaagaattg tacgctaggc taagaggaaa atttattcaa 240 aaaataaaat cattctgtca ccgaacttcg aacgagtcac tttaaatact taaaatcctc 300 cttggcttaa agataaaaca gaacagatcc tcgacca 337 // ID I_Ele11 repbase; DNA; INV; 6046 BP. XX AC . XX DT 27-OCT-2010 (Rel. 15.1, Created) DT 27-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE An I clade non-LTR retrotransposon family from Aedes aegypti. XX KW I; Non-LTR Retrotransposon; Transposable Element; I_Ele11. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-6046 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6046 RA Kojima K.K. and Jurka J.; RT "I clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (08-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update and extension. This consensus is generated CC from 14 sequences with >96% identity, and ~99% identical to the CC original sequence in [1]. XX FH Key Location/Qualifiers FT CDS 43..849 FT /product="I_Ele11_1p" FT /translation="MQCFNCWLFGHTKMRCQAEKAACGTCSGDHXIAENRE FT CSNNTFCKTCNXEDHKISSRACPQWQXENAIQKVKVDQGIPYPAARRIVEQ FT NRNGRSFXAVVGPATTESNQQINGRADQHASALAAKDAEIAELRXALAIRS FT APLAVANNEIEQLKSIVAEQAKQIQLLTEQVSVFLKAVMPAASFPNLTEPX FT KTATXAVLPVSNNTNTASSASSTISDXPINTPSSNPPSNKDIPGFFTTSSE FT KSSFTTNTKTAPYQSFPPKRKRNCKRHS" FT CDS 1115..5671 FT /product="I_Ele11_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="MPQPASPDRLRHRAQNNSPSTSLDSQGANVPDVLPQP FT ESLGRLRRLEGNMDEELLFHFVHCPARTPGSQSPVSAAVSHSGTGGHSLAL FT GEDLEEGNQASEPTSLDDEGSEGSQAIGTRRVLPATPMDYAGSQVASTSSV FT EDLGHPGNPNPGGKSHXAVVSSSDALLAPXSTVVAFTLDDVGSQGLQALGT FT HRALPVTPKDNAGSEATSKHRSIQALIPGNPTHRLSPLATPFFPAVGTSLH FT LVGPSSGPAICRALSCSCTESDTEIPSDRHSTAIESPPLIVSSPSAYTVGR FT NVVHPCTNTTPNQTLSHPTPALPEPARFCLQWNMNGFFNNLADLEILTSSD FT PPWILALQEVNRVTLEQLNRSLGGRYVWNLKRGSNFRHSVALGVLKSIPSA FT FFNIDSQLPAIGVKVQGSTTMSVACVYLPCGNIPNLRGEVEKIIQQLPEPR FT MVVGDLNAHHPAWGGSRSDSRGNTLLNLFEDSDLTILNDGSPTFFNSRYST FT AIDVTAVTRSEIGRFSWCVNSDPHGSDHHPLQISLAVEAPVTTRRPRWLYD FT QADWTAYNNSISVSLRSRHPSNMVEFTAVLTEAAAVSIPKTSSKPGRKALR FT WWSPEIKKVIKARRKALRAMKRMAPDHPNRDVIVDRYHIARNTCRQAIRDA FT KRACWEEFLDSINATQSTADLWAKVNALNGKRIVTPPTLDIDGNSTADPEI FT IADGLGKYFANLAAFGTYDPVFVRQTGATANDINSFTVPEDNAQLPINQPF FT SFRELQYALSRSRGKSAGPDDVGYPLVKNLEGHGKLILLELINQLWTDDSY FT PPEWRESFVVPIAKPSNQARDPANYRPIALTCCLSKVVERMVNRRLTHHLQ FT QRGLLDHRQHAFRPGYGTNTYFAALGEALQEAKSLEHHTEIVSLDISKAFN FT RVWAPLVLEKLASWGFTGHVLHFVRNFLTNRTFRVVIGNTKSDSFREETGV FT PQGSVISVTLFLIMMNGVFQNLPNGVKIFVYADDIVIVVSGPTIAATRRKA FT QAAVTRVAKWASSVGFTMSASKSVRCHVCSSGHRVSGPPININNQAIPIRK FT TAKILGVTIDRGLTFKPHFEGVRANCRSRLNLLRSLSRPHRSNNRNIRFRV FT ATAIIDSRLLYGLEITYLAMNNLIDALSPIYNRYIRIVSGLLPSTPADAAC FT VEAGLLPFRFLVIITLCTKAAAIAERTAGNRRTRLMEEADHQLENATGSRL FT PPIAKTHWCGERDWRSSNLKFDDRIKQSFKAGDSSIRLRKTVAEILRKDYS FT AHQRRYTDGSLSVRGVGIGITDENLATSLSLPGQCSIFSAEAAAILYAATV FT PTTSPIVIITDSASCFTALQSETPRHPWIQGIIKKAPCDVTLMWVPGHSGV FT PGNATADFLAGTGPSGPRYSTRVPLADIRRWVKSTIRQTWQAEWANSRGAY FT LRKIKRSTDAWTDLNSMKDQTIISRLRSGHTRMSHNYGGNPFHRSCEICDT FT GNTVEHFICNCPAFDSPRQMYGISGSIREALQDDLSSMAALISFIKDAGLY FT FKI" XX SQ Sequence 6046 BP; 1557 A; 1767 C; 1343 G; 1358 T; 21 other; gggtwcatwc gctgcaggac tcggccttat tatccgagtc saatgcagtg cttcaactgt 60 tggctcttcg gccacacaaa aatgcgctgc caggccgaaa aagctgcctg tgggacttgt 120 tcaggagatc acttkatcgc mgagaacagg gagtgtagca acaatacttt ctgcaaaact 180 tgtaacagkg aggatcataa aatatcgagt cgtgcctgcc cccagtggca gtkcgagaat 240 gcgattcaga aggttaaggt cgaccagggc atcccctacc ctgccgctcg cagaatagtt 300 gagcagaacc gmaatggaag atctttckcc gctgtagttg gtcctgctac macggagtca 360 aatcaacaaa tcaacggtag agcggatcaa catgcatccg cccttgccgc aaaagatgct 420 gaaattgccg aacttcgagm agcgcttgct atccgctctg cwcccctggc agtagccaac 480 aatgaaatcg aacagctaaa atccatcgtc gctgaacagg cgaaacaaat tcagttactt 540 actgagcagg tttctgtttt tcttaaggct gtcatgcccg cggcaagctt tccaaacctt 600 acagaaccca wtaaaactgc cacacmcgca gtactacctg tttccaacaa taccaacact 660 gcatcctctg cktcatcaac matctcggat mttccgatta acaccccttc cagcaacccc 720 ccatcgaata aagatatccc cggattcttc accacaagca gtgagaaatc tagcttcacc 780 acaaacacca aaaccgcccc gtaccagagc ttccctccca agcggaagcg gaactgtaaa 840 cgacatagct aaaacctctg gtaaacgccc aatcagctct gtctcacgaa cggaagcgct 900 gttccaacag cacaaaaaaa ccaaaaagaa ggtgcccgaa ggtccaacga ccattcctaa 960 gaaacggtaa gtccatcatc cctcttcctc tattttctca ccgccacccc atccacccaa 1020 atctgacacc cccaacactg ccttccaccc cattatcgca agtggtagtc cacagtcaag 1080 cactgatcgt ccgagcaccg gcgatctgga agccatgccc caaccagcat cgccggaccg 1140 actccgacat cgtgctcaaa acaacagtcc atcaacctct ctcgatagtc agggcgccaa 1200 cgtaccggac gttttacccc aaccggaatc gttgggtcga ctccggcgtc ttgaaggtaa 1260 tatggatgaa gagctcctgt tccactttgt tcactgccca gctagaaccc ctggtagcca 1320 gagccccgtc agtgcggccg tttcccactc cggaactggc ggacattctt tggcgctagg 1380 ggaagacctg gaagagggaa accaggcttc cgagcctacc tccctggacg acgagggaag 1440 tgaaggttca caagcaatcg ggacacggag ggttttgccc gccaccccaa tggactacgc 1500 gggaagtcaa gtcgcatcta cctcctccgt agaggatctt gggcaccccg ggaaccccaa 1560 cccgggtggt aagtcccacw ccgccgtcgt aagctcttct gatgctctct tagcaccccw 1620 ctccaccgtg gtcgctttca ctctggacga cgtgggaagt caaggcctgc aagctctcgg 1680 gacgcatcgg gctctacccg ttaccccgaa ggacaacgcg ggaagtgaag ctacctccaa 1740 acacagatcc atccaagctc taattcccgg caacccaacc caccgattat cgccccttgc 1800 aacccccttt tttcctgccg tcgggacatc cctgcatttg gtgggcccat cttccgggcc 1860 tgccatctgt agagctctgt cttgttcgtg caccgagagc gataccgaaa taccatccga 1920 tcgccattca actgcaatag aatctccgcc gctcattgta agctcaccct cagcatacac 1980 cgttggtagg aatgtggtcc atccatgcac taacaccact cccaatcaaa ccctaagtca 2040 tcccacccca gctttaccgg aaccagctag gttttgcctg cagtggaaca tgaacgggtt 2100 ctttaacaat cttgcagacc ttgagatcct cacaagcagc gatccccctt ggatmctggc 2160 cctccaagaa gtcaaccggg tcacactcga acaactgaac cgctcactcg gcggtagata 2220 tgtatggaac cttaagcgcg gaagcaactt ccgccattcc gtcgccctcg gtgtacttaa 2280 gtcgattccg tctgcctttt tcaatatcga ctctcaactt ccagctatcg gggtcaaagt 2340 tcagggaagc acaacaatgt ctgtggcatg tgtataccta ccatgcggaa atattcctaa 2400 cctgcggggc gaggtagaaa aaattattca acaactgccc gaacccagaa tggtcgttgg 2460 tgacttaaac gcccaccatc cggcctgggg tggatcccgg tcagactcta ggggcaatac 2520 cctactaaat ctcttcgaag atagtgatct tactatccta aacgatggtt ccccaacttt 2580 cttcaatagt cgctattcga cagcaatcga tgttacggct gtcacccggt cggagatcgg 2640 acggttcagc tggtgcgtca attcggatcc acatggtagc gatcaccatc ctcttcaaat 2700 ctcgttagcc gtggaggctc ccgtcacaac caggcggccc cgctggctgt acgatcaggc 2760 agattggact gcttataata acagcatcag tgtatcactt cgctctcgcc acccgagtaa 2820 tatggtagaa ttcactgccg ttctcacgga agcggctgcc gtttcgattc cgaaaacaag 2880 cagcaagcca ggacgaaaag ctcttcggtg gtggtctcca gaaatcaaaa aagtgatcaa 2940 ggcacgtaga aaggctcttc gcgccatgaa gagaatggca cctgatcatc caaaccgaga 3000 tgtaattgtt gatcggtatc acatcgcacg caacacttgt cggcaagcca tccgagacgc 3060 caaacgagcc tgttgggagg agttcttgga tagcatcaac gcaacccaat caaccgctga 3120 cctatgggcc aaggtcaacg ccctcaacgg aaaaagaatc gtcactcccc ccaccctcga 3180 tattgatgga aacagtactg cggatccaga aatcattgcc gacggcctgg gaaagtattt 3240 cgccaatctc gcagcctttg ggacgtatga tccggtcttc gtccgccaaa ccggtgcgac 3300 tgccaacgac ataaacagct tcactgtacc tgaggacaat gcacagcttc ccattaatca 3360 acctttctca tttagggagt tgcagtatgc tttgagtcgc agtaggggga aatccgcagg 3420 cccagacgat gttggctacc cattggtgaa aaatcttgaa ggtcatggta agcttatcct 3480 acttgagctg ataaatcagc tgtggactga tgactcctac ccgccggaat ggcgagagag 3540 cttcgtcgtc ccaatcgcga aacccagcaa tcaagctcga gatccagcaa actatcgtcc 3600 aatcgcactc acctgttgcc tctctaaggt agtagagcga atggtaaacc gtcggcttac 3660 acatcatctc cagcaacgcg gcctgctgga tcaccggcag cacgcctttc ggcccggtta 3720 cggtactaat acctacttcg cggccctcgg cgaggctctg caggaggcga aatcwttgga 3780 gcaccatacc gaaatagttt ctttagatat atcaaaggct ttcaacaggg tttgggcacc 3840 tctcgtcctg gaaaagttgg ccagctgggg tttcaccggc cacgttttgc acttcgtaag 3900 aaacttcctc accaatcgta ccttcagggt ggtcatcgga aacaccaaat ccgattcatt 3960 cagagaagag accggcgttc cgcaaggatc cgtcatatcg gtgactctgt tcctaataat 4020 gatgaacggc gtcttccaaa accttccaaa cggcgtcaaa atattcgtct acgcggacga 4080 catcgtaatt gtagtatctg gacctacaat cgcggctaca cgaaggaaag cccaagctgc 4140 cgttaccaga gtagccaagt gggcgtcctc cgtcggcttc actatgtcgg cttctaaaag 4200 tgtccgatgc catgtgtgct catccgggca cagggttagt gggcctccga ttaacatcaa 4260 caaccaagcc atccccatcc gcaaaacggc aaaaatcctc ggtgtcacta tcgaccgagg 4320 cctgaccttc aagccccact ttgaaggagt cagagccaac tgccgaagtc gtctaaattt 4380 attgagatcc ttatcgagac ctcatcgcag caacaaccgg aatatccgat tccgtgtagc 4440 cactgccatc atcgatagtc gcctgctgta cggattggaa atcacgtatc tggctatgaa 4500 caacctcatc gacgcgctat ccccgatcta caaccgatac attcgaattg tctcggggct 4560 tcttccttcc actccggctg acgctgcttg tgttgaagcc gggcttcttc cattccgctt 4620 cctcgtcatt ataaccctct gcacgaaagc agccgctatt gccgagagga ccgcgggaaa 4680 ccgcagaaca cgcctcatgg aagaagcaga ccaccagcta gaaaacgcca ccggttcgcg 4740 tctcccacca attgccaaaa cgcactggtg tggcgaacga gactggcgca gctcaaacct 4800 taaattcgac gacagaataa aacaaagttt taaggcaggt gattcatcca tccgattgcg 4860 aaaaaccgtc gccgagatcc ttcggaaaga ctattcagca caccagcgac gatacactga 4920 cggatccctc tccgtacgag gtgttgggat aggtataaca gatgaaaatc tagcaaccag 4980 cctcagcctc cccggacagt gctctatctt ctccgccgaa gcagcggcta ttctatatgc 5040 cgccactgta cctactacca gccctatagt catcatcacc gactcagcga gctgcttcac 5100 agcgctccaa tctgaaaccc ctcgtcaccc gtggatccag gggataataa aaaaggcccc 5160 atgcgacgtt accctaatgt gggttccagg acatagtggt gttcctggta acgcaaccgc 5220 agatttccta gcaggaaccg gcccttctgg cccccgttac agcactaggg taccccttgc 5280 tgatattcga aggtgggtga aatcgaccat acgccagaca tggcaagcag aatgggcgaa 5340 ctcacgaggg gcctacctac gaaaaattaa acgaagcacc gacgcttgga cagacctgaa 5400 ctcgatgaag gaccaaacaa taatctcccg gttgagatcc ggccacacaa ggatgtccca 5460 caattacgga ggaaaccctt tccatcggtc ctgcgagatc tgtgacacgg gtaacaccgt 5520 tgagcatttc atctgcaact gcccagcctt tgatagcccc agacagatgt atgggatttc 5580 tggtagcata cgagaagccc tgcaggacga cctctcttca atggcggcgc ttatctcctt 5640 cattaaggac gccggattat actttaagat ttagatatat cgtccatgta caattcacgt 5700 gatcgtaacc cactagctgt tgttgtccca tcataagcat gttctgcaat agttttaaac 5760 ttgtccgtcc atttgttcgc tttgaatgtt aggcacataa gaagtggctc cttattgtac 5820 ccccagcttt atgttgtgtt aatttgtggc tttttgtcaa ttagttttat ctaaatctca 5880 ctcgatgtaa taactatgtg tccctagact agttacaata tttttgttgt ttttctcagg 5940 ctggcccttc aggcttgtcc tgattttttt taacctgact ggggatgaac ctgcctacgg 6000 cagaaaatcc cttcaataaa aagtaattca attcaattca attcaa 6046 // ID Gypsy-72_CQ-LTR repbase; DNA; INV; 1614 BP. XX AC AAWU01040934; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-72_CQ_; KW Gypsy-72_CQ-I; Gypsy-72_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-1614 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 524-524 (2011). XX DR Genome; AAWU01040934; Positions 18286 16673. XX SQ Sequence 1614 BP; 448 A; 369 C; 362 G; 435 T; 0 other; tgtaaccgat gggttccaat tcgatcatta tctgcccaaa tatgaaatat tttgtttttt 60 ttatagacta agtacattac atctcttttc gctataaaat ttgatttgct tgcgtaccga 120 aatcgcagcc cacttcacta tgataaaggt tcatttccaa tgcatgagag tgagttgagc 180 aacagatctt agatttaaaa gggcagttca ttattaccaa taatcactaa ttggttggta 240 aatcatgatt cgactagaaa atcatgtatt aaactaacat tagaattaag gcctcttagc 300 taatggttag gataactaac aaagacgcgc cgctctcatg cacaaccact aggtgtagga 360 agttaggttg ttgcggcgtt ccagtatggc gagattggga aattcaaact ttgcgtttgg 420 acttcgcaac gcataagatg accagcagaa tctatgggag agagagaagt ggacccgctg 480 ggacagacta cctcggagag cggaggatat aaaagggccc cgaagcccta gactcgtcct 540 cttttgctat ctaatagccg acactagtac gatccagatt gtggttcctc gaacgagtta 600 attagccgcg tgtgcgagtt gaattaacaa aaaagttaaa atcaggcgtt aagttgaacg 660 gtaatcacga aatccacccg agtggatagt gtgttgggaa ggtcagcggg gagtgtccga 720 gtacacgtaa agcgcgcgtt tcgacccgga tgatcacctg gtgagagtta gagggacgat 780 cctgtcgcgt atgcctgatt taaaataagt ttaagcggcc tcccacacca aataattgga 840 tcatttggat ccaccccgaa gaactaaagt gcgcgagtga taccgaaaat ctagtgtctc 900 cgttcgaggc cgttaagacc atccatcaac atatggtccc acaatcctgg aaacccttgt 960 tctcactagt gagaatgccc cgccgaccat agccacggtc gtcgacgaaa aacccaggcc 1020 cggcaggcgt ggacgccgcc gttatccgcc atcttgcgac ggacgagtcg ctgatcgagg 1080 tatgtctgtt cccccttgac ttgcttgctc tctctacaat accctcatcc cggccttcag 1140 ctatccggtc gtgtatcgcg acctgcagaa gggccccaac cgcgggacgg ccccaacgtg 1200 catgccatcg tgggattgtt cgcgctccgt tttgttccag ctaaaataaa cccctaaaac 1260 ccatcccaaa cgtgttaaat tctctttttt tgaatcttta atttagatta gttttagttt 1320 taagtgcaaa accgtcgaaa aatcactaag gtttctgctt aaaagtaaag taaagtgcgt 1380 tgaataaacc gtgcgtgtgc gatcgttaat tttgcctttt atatttgctg cacacgagag 1440 agaagacgat tgtgaggttg ataatgatcg ggtaagtgcc aaattatcgt tcgttcccca 1500 agaattgccc tgagaagtag ctagccgtgc aaattagctc ctgttgctca taattagaac 1560 cattttcccc aaaaactcta tttcctacaa agtactagct agtctgtcgt taca 1614 // ID EnSpm-4N1_HM repbase; DNA; INV; 5836 BP. XX AC . XX DT 02-JAN-2009 (Rel. 14.02, Created) DT 02-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE EnSpm-type family from Hydra magnipapillata - consensus. XX KW EnSpm; DNA transposon; Transposable Element; Nonautonomous; KW EnSpm-4N1_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-5836 RA Bao W. and Jurka J.; RT "EnSpm-type families from Hydra magnipapillata."; RL Repbase Reports 9(2), 377-377 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX SQ Sequence 5836 BP; 1975 A; 780 C; 874 G; 2205 T; 2 other; cccagccaac attgacagac gggtaccgta tggtttttgt ctgggcttac gggaacggga 60 ttttgacggg taagtgtttt ggtttccaga cgggtcccat atggtaataa aatcaagcat 120 atttgctcgt tagggtttta tctggttttt atcagggccg aataacctgg ggatttgacg 180 ggtttcccat aaggttccca aaaggtttaa aaaaaatcat aacaaaactt actcgatggt 240 tgccattttt aattcgtatt gtgcgtgaaa taagattaaa aatttaactt ttatgcatta 300 cattttttga atacattata tttacattgt aattaaaaag aatttttttt tgatgtgttt 360 tgtcactgta aaactcagtt aaaaagacgt tgttttaaaa tttcatttat ttattatcat 420 tataaaagaa tttttatata tacatatata tatatacatg tatactattt tttaaatgaa 480 tttacaaagt tcaacatatt tgttgaacat atgttccaga aactattaaa catgtagtaa 540 atgttccatt gagacactgt ttgcattttt aaaatgatca aaatataaag ttttgtttct 600 gctagtcact agcagaaaca aaactttata ttttgatcat ttttaaaatg attttaaaaa 660 tgcaaacagt gcaaactagc agctagtgat ataattagat aataaatggt ttgagtttaa 720 tcaaaatatt agtttagtca gaaacattta aaaagagcaa acagacgaat agaatgtaat 780 ggataggtta tatttaggga attaaaagct tttaaatcgt ttttacagat ccactctttt 840 ccattgattt ttctatattt tatgcaagca aaatgactga tgttaacgaa gtttacccac 900 cgtcagatat aaataaagat ataattagat gtttaaatgc aaaaatatgg ctttattgtt 960 atgcatcaac ctcatttttt aatcattgat attttttgat tttgatttaa aattgtacac 1020 ttgcaatata aacaataatt tcataatata cataataata acttatactt acaataccta 1080 tacttgtaaa tacttgtaat ataatactta tgtttataat actgataaaa atttttattg 1140 atataaactt ttgcattcaa actttagtta taaaacttta atttaactta ataactttta 1200 cataaataac aataactaaa aaaagatgtt agttattata tagattataa ccatatgctt 1260 atttattata attgagttac agtaagttaa acaatataaa atgttaagtt ccaggcaata 1320 ggtcttatat gatatgtatc ttatgtatat cttgtatttt gcagcatgtt aaagaatatt 1380 tctgcatggt gcactctgac tgttgacaat atttgcatat tgatatgagt aaaagtgact 1440 tcaacaacca gaatgtgttt ttgataaata ttaataataa caggttagta aatataatta 1500 tagtaataat ggcaataact ttttattatt agaattagtg agagacttat gtataataga 1560 gctacattaa ctttggggga caaatataat aacttgagta aatattaccg ttgtttataa 1620 cctcgattag agtaatgtaa gtaaatatta tttttatcta aagaatattc ggcaaaagtc 1680 agtatttcct aagtattaaa tattataaaa aaacatgtgt ttaaaaaaaa atttttaaat 1740 gtatcataaa ataaaaaaat aagtatttga gtataaaaaa aaaacacatt tattattata 1800 atgtatatac atcacattaa ctatgttgtc tataaagtag gggttgcagc tgatatttca 1860 gcttcagcta taaactcatt agcttcagat taatgctgaa gctgatcaag tcagtttaat 1920 catcttgata gctgatagta aaagagaata atactccata aatattttat ataaaatata 1980 tttttatata ggaatttatt caatgaaaat ctaaaatagt gacaaaattt aaacatacat 2040 gtaaatttta aataaaattt gccaaaagtt gttgctattt ataaatgttg ccatcagagc 2100 ctaggcctag gacttacaat attaggggac ccaaattatg tttttagaaa tttgtttaaa 2160 aagtggtgcc caaaaatcta attccatttt taattatctt aaggggacca actatctttt 2220 gttcaaaaag agtacacttt gcaagtattt cataggtaaa aaaaaaagga agataaaacg 2280 ataaaaaact ttattttttt ttttaaaaca tatttttcgt cagagtggat tttttttgct 2340 agatgaggat gttcgtctaa taatgtagct gcatccgaac atattaaatt tcatattaaa 2400 tttgactgtt aagacggcat ataatgtttt tacagaaagt tgcaacttct cactgctcca 2460 ataatcattc cttagtgaaa ataccttttc aacagcagca ttagtgcctg gacaacataa 2520 agaaatttct gttaaaactt ttaaattttc atgtgggata tgttctactc taaaatgccg 2580 aaagatttct acccatcttt tttcaagttc tattttctca ttgttccatt tcttcaattt 2640 ttcttcattt aaaaataaat ttactcttcc aatttcttca aaaagaaaat tgtcattaat 2700 aacaatatct tttacttcct ttattacaaa ttcgactgct gaagaaactt ttttccatta 2760 aattttacta tttataaggg tccatttaaa acaattcaat gctctaaagt tcattatcca 2820 ttcttcgata tattcatagc aaattttata aaaattttta acttctttta aaattttttt 2880 ttttttttcc gggattggaa tcttccaaag cggtcagttt ataggcttta atatagcctt 2940 taatattgta tatagtagtg tatgtatata tttttttatt taaatttgcc tttaatttgt 3000 ttttttatat ttttctgcaa aaaaattttt ctgtcacttt atccttctga acttatgtga 3060 ttaagttttt gttttctctt acatattttc tcactcatct tatctcatcc gttcattttc 3120 tctttttttt tttcattttt cttagcgtta ctcttttata tttttgtgtt aattccaact 3180 ttttaaatag tattgttcct tatacttctt tttttattac ttgtattaat gtagttttct 3240 ttctttctta ttatttttta ttttatttgt ttattttatg aaggtgtcgc tccattgact 3300 tgtttttaat atgagtgaca ccgaacataa atttgtaaat tataattaag tttgtaattt 3360 ttgattttat ggttaattaa ccataagcgc atgcgtagta tgtacagtaa ggtttttgtt 3420 atccttttat ttaattaaat aaatggataa aaaactttgc tgtacatact acgcatgcct 3480 caagactgtc aagcatggcg tacatactgc acatcaaaga tcagcctggg gattacatat 3540 gcaaatgttg gatccatctc ttttcacaaa tgtaattttg acattgttag aaaaagaggt 3600 ctgtagtatc tatctatctc tccaggttca taatacggcc tctcaattgt gttgaaaatg 3660 gcagccatat gtgcgttcaa aatttaaatc actgtcaaac ttgtatctgg tgtttgccct 3720 gtcctctgac gtgctggaaa cggctagcaa tatgtgcgca tgaaattcaa atcactgtca 3780 aaataatttt aaatatttaa ttaataaaat tgaaatgaaa taatataaaa aaaaatgttt 3840 tagaaataca aatgagtaca aaattataca aatgagtact ttcagagtac aggatttgaa 3900 aaaagtacgg tttgtgaaaa aagagtaaaa aaactcagta aatgagaata gcaggtcacc 3960 ttaattgtct acataatttt ttggtagatt tcaaagttct tgcaaaagat ggttttatct 4020 atgaaaatat tgagtttaaa atcaatcttg ctgttatcat atgtgatgca ccagctagag 4080 catttgtaaa gtgtattaaa ggacacaatg gttataactg atgtgaatgt tgtgcttaga 4140 aaggagagtg gtgtggcaag ataattctcc tacatatttc actaatttta agcactgatt 4200 cagatttttt gtcattaaaa gcattctgaa catcttagtg gtgtgttaaa ttgaattaaa 4260 ttttgacatg gtgacaaagt ttcctttaga cttaatgcat ctagtatgct taggggtaat 4320 gcgaagattt ctcaacttat ggttaaattg cccaaatgtt gtaaattgtc tcagattaca 4380 gtcaatactt tatctggcgg attgttccag atacgtcatt acattccaag aaatttttat 4440 agaaaatcaa ggtcttaatt agatattaaa aaatggaaaa caactgagct tagactattt 4500 atagtawaca ctggaccagt tgtttaaagg gaatggttta gccaaaaatc tattcaattt 4560 tttagatttt tcagttgtag ttctactttc ataatgtttc attttgttga aaaaatatgt 4620 taattttgct tttcagctac ttatttttat atgtgaactt tatcataagg atcaactagt 4680 gtacaatgtc cattcattag cacatatagg aaataataca gttaaatttt aggtacttga 4740 tagatgtccc tctttcaaat acaaaacttt ttcaggtcaa ttaaaaaatg tttgtctagg 4800 gggctcattt gcaaatttag gggccctttt acaaatgtaa ggattttttt ttttttttga 4860 tttttattta gattttgttt atctattgga aaagttttta ggaattttgg ggcccttatg 4920 acaggttttt ttttggaatg ggggacttgt atctagtagc cacggcactg tatataaggt 4980 atatacaata tatatcaaat tatatacaat ataaataaaa aaatatattt atttaataat 5040 tttatataat ttatttatca gtaactaaag gaatgttatc aatttattat atgaatcata 5100 agcatgtgct tatctattac aagtaccttt gatggtgaaa attacatttt ttatctgaag 5160 agagttggga gaaatggtaa gagttttata acatattaaa attaatgtat gtgttaccaa 5220 taaaatttaa aactaacagt taaacttaat tttttacagt tcatattgaa acgagatatt 5280 gtgtaaagag gttttcaaag aagtcgtttt aacaacaatg gaaaagcttc cgattcccga 5340 gataaacgtg aaagcatttg atacatgatt aaagaatcgt tcaaaagttt gcttaatttt 5400 gacagtgtaa ttttaattac tatttttttt gkaaataaac caatcaattt atatataaat 5460 ttgctttttc tgtaaatttt aaacatttaa acgtagtttt gcaagatttg caatgggtaa 5520 acctcgtggg accctcacgg gatacccggc gggcataccc atatgggccc cagaagaaaa 5580 ccgcggccat gatccggcgg gttcccgatg ggtttcccat atgggtccca tttgcaaacc 5640 caaatgggtc ccttacggtt ttaaagtacg ggatttgact gggacccata cgggataccc 5700 ggcgggctaa cccatatggg tcccatatgg aaaccgcggt catgatccgg cgggataccg 5760 ttggtcttcc catacgggac ccagttgacg acccgtatgg gccccgtacg gtacccgtat 5820 gtcaatgttg gctggg 5836 // ID Gypsy-141_AA-LTR repbase; DNA; INV; 183 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-141_AA_; KW Gypsy-141_AA-I; Gypsy-141_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-183 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1016-1016 (2011). XX DR [2] (Consensus) XX SQ Sequence 183 BP; 57 A; 36 C; 35 G; 55 T; 0 other; tgtagagatt gatattatat agcaagctat gttttaattt agtacttagc aacgatgaat 60 aaacaagcta ggtgaggatt cctactctta ccgtactggt gaacggtcac acagaaatac 120 tgtctttcat tccggagaat tggccaggat cttgatccct aagaattacc cttgaaccct 180 aca 183 // ID P-1_Hrobusta repbase; DNA; INV; 5838 BP. XX AC . XX DT 10-MAY-2011 (Rel. 16.05, Created) DT 10-MAY-2011 (Rel. 16.05, Last updated, Version 1) XX DE DNA transposon: consensus. XX KW P; DNA transposon; Transposable Element; P-1_Hrobusta. XX OS Helobdella robusta OC Eukaryota; Metazoa; Annelida; Clitellata; Hirudinida; Hirudinea; OC Rhynchobdellida; Glossiphoniidae; Helobdella. XX RN [1] RP 1-5838 RA Yuan Y.W. and Wessler S.R.; RT "The catalytic domain of all eukaryotic cut-and-paste transposase RT superfamilies."; RL Proc Natl Acad Sci U S A 108(19), 7884-7889 (2011). XX DR [1] (Consensus) XX SQ Sequence 5838 BP; 2157 A; 770 C; 792 G; 2119 T; 0 other; catggatata tataatatgg atatggcaga agcgttatat atacatacat ttttcatata 60 tatatataac atcaaggtaa atctaacaaa tttgatgcat agaatagcaa taattgttta 120 aaagtgaatt taagttaaag gaagtaatca gtttttaacc gatcatcgtc tgtcattttc 180 ttatttgttt aattatttta tatacattaa gtttaaccat gcccatgtgc tcaatatcca 240 aatgtcatgg gtcttataaa gatatcggcg taactttaca taggtatgta taattaatat 300 aataatagta aaattttaac ataaaatagt taagcttttt ttacaataat aaaaccaaac 360 cagactacca aaagatgaca gtcaaagaat acaatgggaa gggaatataa aaaatcatat 420 tacaaatttt tgtagtagaa aaaatatctt catctgcagc aggcattttt tacaagaatg 480 cttttacaaa gtaggttcta aaatgttgct caaaaaaaca tcagttccta cagtatttcc 540 aaaggttttg ttctttttgt aactaataaa aaatttctaa tttaaatagt ttaaaaaaat 600 taaactaaca ttaagattaa cattaaaata aagaaataac taattattgg tgcatcatga 660 aaaatgaatc actaattaat taattgcaag acactaacat attatagtta gtatattaat 720 tagatatttg ttcctgcaaa tcacataatc aagtatattg aatatttatc attttcagat 780 tttatcaaat gatcaagcag atactctatc tgaaatgcaa tcaccaataa tgatggagaa 840 aaaatcaact gatgaaccag tatgtttgtt taaatataaa atttacttag attaatatat 900 tactgattgc atgcgctaca attaatattt atttttttta ttttatgtca ataagaagat 960 acacatatgt aagtacatgt tcaaggtatt tatttcacaa agtttatgat gaatctacca 1020 cacatacttt gatgcataca ttcattcatg tgtaatattc aaacttcaga tagaaaatgt 1080 tgcatcttta gaaaatgttg cagttccatc aacatcaatt gttaaattgg tttgtatgta 1140 tgatggttca tttaataaat aaaaaatagt gaaaaatgta atttagtatt ttaacacatg 1200 tttctataac tataaccaga attataaatg attgttaaaa tacttaatat aagttgttga 1260 aaggaaaatg aaaaatttta cagactggta ttgatgcatt gacacctaga agcttgaaaa 1320 ggaaagctaa aaaattagat atatcattaa aaactaccag aaaagttcta tataactaca 1380 acaaaaaaat taaaagatta aaagtcaaac ttgaagtata tagaaatgaa attaagatgt 1440 tgaagaaata caacgatgaa cttacctcaa cattactttc atattcaggt atgctaatta 1500 ttaaacttta aaaataattt ttagcaacaa acctttacgt atgccacttt ttttaatttt 1560 taatattgaa ttggcatttc agatattcca acaaaacttt tcatgaaacc gaacaaaacc 1620 tatactgaag aacaaaagcg ttttgcattg acattacatt tgtacggtgg aaaaagctac 1680 cagtttttga aatgtaaagg attacatctg ccacataaac gtacattagc acggtaagtt 1740 attgtttaat attatatata taattacaaa ttaaaaattt aagtatttaa atgtattaag 1800 gtggttagaa tctgtagatg gaggaccagg cctcaatcgt tctatgttga atttcctgaa 1860 gaaaaaacac gaaacagatg caaaacaatt tacacactgt tctttgatga tagatagtat 1920 gtcaattcgg gaacaactga tatatttaaa acataaaggt gtctattccg gttatgaaca 1980 ttttggtgga gactacacat ccaatgaatt ggcaacacag gccctggttt taatgatagt 2040 tggtacgtca ggttcatgga agtttcccct tgcttacttt ttgttcaaaa gcatgtcagc 2100 aactacgcaa caagttcttg tattgaacgc acttgaagct ttggaaaaga ttggattgaa 2160 agtataattc ttgttaaaac tttttacaat ttctgttttg ttttttagat tttttaaagt 2220 tattttatca atgttacctg aatgttatta ttgttcaaat acaggtagtg acgctggtga 2280 tggatggctt acgaacaaac attgttatgt gcataaaact tggatgtgtc tttgatttgc 2340 ctacctttgt caaaccatct tttaaattac catcaactaa tagagaaatc tatgtaatat 2400 ttgatgcctg tcacatgttg aaacttgtcc gtaacacatg ggctgataaa aaaattttag 2460 aatcaaatgg tggtttaatt agatgggaat acatatgcaa attacattct gcacagtaca 2520 cgactggctt caaattggct aacaaattaa ctgaccaaca tgttaattat cataatcaga 2580 aaatgaaggt accatttagt ataattcttc ctttctgtac atattcaaca atatcttttg 2640 agtatgttca tgtattaaaa aaataaatca ccatacaaat agattttata attataaaaa 2700 tgacttacat aatttcaagt taaattagtg gctcacatat tgagtaattc ggtggccaaa 2760 gccttaagat gttcaagtaa ctgtgaagaa tttagagatt gtggagcaac tgtacaattc 2820 ctggaaaaaa tggatatatt gtttgatata atgatataat gaatagtaag aacatttaca 2880 gtaaatattt cagacgtggt atatcaatga catcatttaa tgacatagca tcaaacctaa 2940 tgtcactgaa agagtatttg ttatcactaa aaactgtcga tggaaaattt atatatcaaa 3000 ccaataggta aacatatatc ttacagtgtt tgatgaaaat gtactaacca attgttatat 3060 ttttaaaact caaggaatac atgtgttctt ggaatggttg tcaacataga aaccataata 3120 aatttagcca catccttgct tcaaacacaa aaatatgttc tgacatacaa attcagccaa 3180 gaccatcttg agttattttt taatttcatt cgaagtgtag gttagaaaat gttcaaaagt 3240 ggttatttaa caaaagtgac ttcaatttta tatatttaaa tttcatgtat ttagggagaa 3300 gcaatgataa tccaaatgtc cagcaactgc aatctatttt caaaaaagtg tactatagat 3360 gtggtattgt ccctggaaaa acaggttatt caaatatgtt ctatttatac agttgcttta 3420 aatgaggcta tacatatgtt gttttataat tttttaaggg aatgtacagg aacaaatgat 3480 tgttgaatca gaatttaaag ttccagagga tcaatttaaa gatatttttg aagagacgta 3540 tttagattca caaataatga catctaatac catgtccact gtcattgaaa atccatctaa 3600 caccatgtcc actatcatcg acagtgcagt agcgtatatt gctggatggg ttgttagaaa 3660 agttgccaaa acaatcgatt gtaacaagtg tagatttaca cttattgata taaaaggaca 3720 agctaccatg ttacaccaac agcttctgat actaaaagac aatggtggtc tagttttccc 3780 atcctgtggt gttgttcagg tcagaaaata atttttttcc atgtttattt ttatttgcat 3840 gctggaaaaa tattgatctt taatttcttc atttttatat aacatattgc ctaatatata 3900 ttttttgact aattcatatt tgtcaatcag atttgccgtg tcaccgaaag acatttgaaa 3960 atggtgacag actggtctaa aatgaataga acaattgttg tatcaagtgt tttgagtgag 4020 ttggcaaata aagatctatt aaaccttaaa gatcatatca ttgaaagtgc agataatata 4080 aacaatcatt ttttttcttt attgagatta ataacaaaaa catatataga tgttcgctct 4140 tttcatttgg ccaaaaacaa tagtcagttg aaaattcgtt caaaatatca taaactaatt 4200 cactttaagg gccagtgaaa ttttttattt atacttacag agaatgattt atttattttt 4260 tattttaatt tctattaaac atctggtttt tgtgactttt ttgcatttct aaatctctct 4320 tttttaatat ttttttgtta atttttaaac atttaaatat ttcttaattg aaatctgtga 4380 ttattgttat taattttaaa acgcttttac tgagagtcat attattatat tggtacatct 4440 agtcaacttc acctattaat tgaacatttt tcacagcaac attaaacaat ttccagcatt 4500 tcttgtcatt ttttaatctt aatttctttt ttatttaacc ccttttttaa aaacataaat 4560 aggtatgggc atttattaat ttataattta tcagctatta tacaaaagtt tcaatgcttt 4620 cagataaatg taagacgtta tttttatttt taggaccatt tagatattga taagtttaaa 4680 tctaagtatt gtcaaaagaa tatgatgtat tatgtatttt acaaattttt tatctgcaaa 4740 tttaacaaac atttatttca ttaaagatgt aaatattttt gtaaaatgtt ttgaaaataa 4800 ttttctcaat aataattatt ttttattacg ctgaacttca ctgcacccca acagtatttg 4860 agcataataa attagcaata ataattttat acatatacat attttttata ttatgaaata 4920 taataatgtt taattatact aaatagttga ttgattttaa ttaaattcca tgtcgatcag 4980 tttcaactaa gttcattata ttaatagtat ttgatgcagg gtcagtaatt tttttgtttt 5040 tgccatttac gttttttaaa atttatgtct tcatatatca cttagtccat ttattttctt 5100 tttttcaatg aacaacttaa caatttattg gaacatattg tgccttgatt tttagcacca 5160 ctacatgtac atataaaata acggttagcc tctagcttct agcatcatcc aaaagatgtt 5220 tcattttaga aaaattgaac aaaatgtata aatgtcaata ttaaaataat tatgaacatc 5280 catttatttg ccaaaatcct gaatcataat gactaatttg tcattaaatt ctgaattaaa 5340 cctttatttt ataacatttg ttcagatttt aaaaaaattt ataatttgat gacattgaag 5400 ataactgcac ttgttattaa tccaggtgga tgaatgtatg tatcaaagtg tatgtgtagt 5460 agattcatcc aaaagatgtt ttatttccag gacaattgag caaaatgtat gaatgtcaaa 5520 tattgaaata aatatgaaca tccttacatt ttgccataaa ttttccggaa tcaaaatcta 5580 aaatcatttt attttcaaca aacaacctct atttaataac taagtcggta gagacgaaaa 5640 aagcgactta tttaaatgtt atgatataaa aaaacgaaaa cttcgtgtag gcgattaaat 5700 ttaaatttta aaacagtaaa acgaaatgtc aactacaatg aagaaatgtt tacgctcaaa 5760 ttatgttctc tattccatag aactaaaaaa aattattagc cgggaattaa ttcttgtgcc 5820 atatccacta tatccatg 5838 // ID CR1-89_AAe repbase; DNA; INV; 4416 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-89_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4416 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1177-1177 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 10 sequences with >97% CC identity. XX FH Key Location/Qualifiers FT CDS 302..1123 FT /product="CR1-89_AAe_1p" FT /translation="MDKKHCGECSLEINELEPIRCGFCDTYFHISQQCCGF FT NNRANRDLFSNGKAMFICPKCRDILNGRSVCSFLKESLGPQSSSPMDLNLL FT SNQVQKLSSLVESLSKQVENIVNERQAAIAVSTPSWPKTGVKRRRGNNGQT FT VQATVERGTSTIDLSDLSVPFIVPPPQPPKFWLYLSGFQPLIKVEDVQKIV FT ARCLNVFDPFDVVRLVAKDADIAKLTFVSFKIGLNPDHRELALSASTWPDG FT LLFREFDDQSNKRRPGRVPVETPFDSVMNANQP" FT CDS 1087..4332 FT /product="CR1-89_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="DPFRQRYECESTIGRYFEDGGDQPVPTRGEYLILTSD FT DPAPNFYPLQFTDGDGVAFNNPPLDERNHVDRGPHAIADVGRDIHTTRHLE FT DGGGASTSVRGEYLSINSFGNGSSSVQPFQPSQHTRQMHVYFQNVRGLRTK FT IDELFVAVNGCDYDVIVLVETNLDATVTSAQLFGENYAVYRNDRNTNSSHK FT KSGGGVLIAIHRKLHSSSVQCVVDDLELVLARIRTAESSLFICAGYIPPEM FT RSNIVFVKYFSDAIRDSLDCACDDDSIIVCGDFNQANLVWEQSEPFYVTVE FT PSSVGPASAGLLDNMAMLNLQQYCRVSNPWNHTLDLVFANDDSCVVTEAAV FT ALVKIDRPHPPLEIALFSTDSEPLFNHEDVPQLDFKRIDFTLLHGFLARLD FT WTEILTCSDVNDAVRIFSDIVMNWLSSNVPKMRTPAKPPWGNARLRSLKRA FT KNSCQRHYRKQRTPLHKRRFQDASDAYKSLNEHLYAQYVDRMQDGLRRYPK FT RFWNFVNSKRKQTDGVPASVSFVDEVASTAPEKCELYAKHFSSVFNADCAT FT VNDAMRASSTVPENVCDIGLPLVTDQVLRKAAKKMKSSITPGPDGIPVIVF FT KKCIDVLTVPLCYIFNLSLQQKQFPAKWKKSFMFPVYKTGDKSDVRNYRGI FT TSLCAGSKLFETVINDYLFAKTKAFISSDQHGFYPGRSVTTNLLRFTSSCI FT NQLEDGKQVDAVYTDLKAAFDKIDHRILLHKLARMGASTAMVEWLESYLCD FT RKLCVKIGSYTSAWFKNLSGVPQGSNLGPLLFALYFNDVAALLGVGCKLIY FT ADDLKIYVVVESLADCERLQSLLNTFVRWCELNRMTLSINKCSVITFHRSA FT VPLRMDYTIGDTTIQRVSHIRDLGVLLDSRLTFNDHRSEIIDRANRQLGFV FT MRTTKNFTNVHCLKALYTSLVRSILESSSIVWCPYQANWIDRIERIQKRFV FT RYALRLLPWRDRSNLPPYADRCCLLGLETLEQRRITQQAVTAAKVLNGEID FT CPNLLAKLALNAPNRIQRRSLERMLAPQFHRSLFGYNEPLSSVMRAFNDVA FT YLFDFGMSSTVFRRKILDSMRARFV" XX SQ Sequence 4416 BP; 1159 A; 1038 C; 994 G; 1225 T; 0 other; tctggcatca ctgctgtata ttgttgattg tgttttttat actgatattt tttattgatt 60 tttactgttt ttttcgtgtt aaatctccgt tcgcgtttgt gatctgtaag tgttacatcc 120 tggagaacgt taggatcttg gagtaccccg attattgcaa tcctgagaga ttccggaacc 180 tggttttcat cgtagttctt cggttcacta cattgctttt ctacgaccgt cagcaccagc 240 tacaaacccc tatagaccac ttcgtttgat actatcacca actgaaagtg tacaaagcag 300 aatggataaa aaacattgcg gggaatgcag cctggaaatc aatgagctcg agccgatacg 360 atgtggcttc tgtgatacat acttccacat aagccagcaa tgttgcggtt tcaacaatcg 420 cgcaaacaga gacctgtttt cgaatgggaa ggcaatgttt atctgtccca aatgccggga 480 tatattgaac ggacgtagtg tttgctcgtt tctcaaagag tcacttggcc ctcaatcatc 540 gtcacctatg gatctaaatt tgctttcaaa tcaagtgcag aagctctcta gtctggttga 600 atctttgagt aagcaagtcg agaacatcgt taacgagcgc caggcagcta tagcggttag 660 tacacccagc tggccaaaaa ccggtgttaa acgacgtcgc gggaataatg gccagacagt 720 acaggccacc gtcgagcgcg gaacgagcac aattgacttg tcggacttgt cggttccatt 780 catcgtgccc cctccgcaac ccccgaaatt ctggctctac ctatcgggat tccaaccgtt 840 gattaaggta gaagatgtgc agaaaattgt tgctcgctgc ttgaatgttt tcgacccctt 900 cgatgttgtt cgtctggttg ccaaggatgc agatatcgca aagctgactt tcgtctcgtt 960 caaaatcgga ttgaatcctg accaccgtga actggctcta agcgcttcaa cctggccgga 1020 cggacttctg ttcagggaat tcgatgatca atctaataag cgccgtcctg gaagggttcc 1080 tgttgagacc cctttcgaca gcgttatgaa tgcgaatcaa ccataggacg ttattttgag 1140 gacggtgggg accaaccggt ccccaccaga ggcgagtatc ttattttaac ttctgacgat 1200 cccgcaccta atttctaccc gttgcagttt actgacggtg acggggtagc ttttaataat 1260 ccgccacttg acgaacgcaa ccacgttgat cgaggcccac atgcaatagc cgacgttggg 1320 agagatatcc acacaacgcg tcatttggag gacggcggag gtgcgagcac ctccgtcaga 1380 ggcgagtact tgtctatcaa tagtttcggt aatgggtcat cgtctgtgca accgtttcag 1440 ccctcgcagc acactcgtca aatgcacgtg tatttccaaa acgtcagagg tttgcgcaca 1500 aagatcgacg agctgtttgt agcagttaat ggatgcgact acgatgtcat tgttcttgtg 1560 gagaccaatc tcgatgcaac tgtgacgtct gcccaactgt ttggtgaaaa ctatgctgtc 1620 tacaggaatg atcgcaatac aaatagcagc cacaagaagt ccggaggagg tgtccttatc 1680 gctattcatc gcaaacttca ctcttcttca gttcagtgtg tcgtggatga tcttgagctc 1740 gtactggccc gtatccgaac agcagaatcg tctctgttca tctgcgctgg atacattcct 1800 ccagaaatgc gttcaaacat tgtgtttgta aaatacttct ccgatgctat cagggacagc 1860 cttgactgcg catgcgatga tgattcaatt attgtttgtg gcgattttaa tcaagctaac 1920 ctggtgtggg aacaaagtga gccgttctat gttactgttg aacccagcag tgttggaccg 1980 gcgagtgcag gtttacttga caacatggca atgttaaact tgcagcagta ttgccgagta 2040 tcaaatccct ggaatcatac cctggattta gttttcgcca acgacgactc atgcgttgta 2100 actgaagccg ctgtcgctct ggtcaaaatt gatcgcccgc atcctccatt agaaattgcg 2160 cttttttcta ctgatagcga gcccttattc aatcacgaag atgttcctca attggacttc 2220 aaacgcatcg actttacatt gctacacggc tttttagcgc gtctagactg gaccgaaatt 2280 ctcacctgct ctgatgtgaa cgatgctgta agaatttttt ccgatattgt gatgaattgg 2340 ctttcgtcaa acgttccaaa gatgcgtaca cctgcaaagc caccatgggg gaatgctcga 2400 ctccgttcgc taaagcgcgc gaaaaactca tgtcaacgcc actaccgcaa gcagcggacg 2460 ccacttcata agcggcgttt tcaagatgca agtgatgcat acaaatcgct caacgaacat 2520 ctttacgcgc agtatgttga tagaatgcaa gacggcctcc gtcgatatcc aaaacgtttt 2580 tggaatttcg ttaactctaa gcgcaagcaa accgatggtg tccctgcctc ggtatctttt 2640 gtcgacgaag tagcatcgac tgcaccggaa aaatgcgaac tctacgctaa gcatttttca 2700 agtgttttta atgctgattg tgcaactgtt aatgatgcga tgcgtgctag ctccacagtg 2760 ccagaaaacg tatgcgacat tggtttgcca ctcgttactg atcaagtcct acgaaaagct 2820 gcgaagaaaa tgaaatcgtc tatcactcct gggcccgatg ggattcctgt tatcgtcttc 2880 aagaaatgca tcgacgtcct aaccgttccg ttgtgttaca tcttcaacct ctcgctacag 2940 caaaagcaat ttcctgctaa gtggaaaaaa tccttcatgt tcccagtcta caaaaccggt 3000 gacaaaagcg atgttcggaa ctaccgagga ataacttcat tatgcgctgg ctccaaactg 3060 tttgagacag tcatcaatga ttaccttttt gccaaaacaa aagcgttcat ctccagcgac 3120 cagcatggtt tttaccctgg caggtcagta acaacaaatc ttcttcgttt tacctcaagt 3180 tgcatcaacc agctagagga tgggaaacaa gtagatgccg tttacaccga tttgaaggcc 3240 gcctttgaca aaatcgatca tcgtatccta ttgcacaaat tagctaggat gggtgcgtcg 3300 accgctatgg tagagtggtt ggaatcgtac ctctgcgaca ggaagctctg cgtcaagatt 3360 ggatcgtata cttccgcttg gtttaaaaac ttatctggcg taccgcaagg gagtaatctt 3420 ggaccgcttc tcttcgcact ctacttcaac gatgttgcgg ctctactggg tgtcgggtgt 3480 aagttgatat atgccgatga cctgaaaata tatgtcgtcg ttgaaagcct agcagactgc 3540 gaacggctgc agtcgttact gaacactttc gtccgttggt gtgaactaaa tcgcatgacg 3600 ctcagcatca ataagtgttc cgtgatcacc ttccaccgct ccgcagtacc acttcgtatg 3660 gactacacaa ttggcgatac aacaattcag cgtgttagcc acataagaga tttgggtgta 3720 ttactcgact ctagactcac cttcaacgat catcgttcgg aaatcattga tcgagcaaat 3780 cgacagctgg ggtttgttat gcggacgacc aaaaatttta ctaatgtcca ttgtttgaaa 3840 gcactctaca cctcgcttgt tcgttcaatt ttggaatcct cttccattgt atggtgtccg 3900 tatcaggcga actggattga caggattgaa cgtattcaaa aacggttcgt gcgctatgct 3960 ctacggttgc ttccgtggag ggaccgttca aatcttccac catacgctga ccggtgttgc 4020 ttgcttgggt tggaaaccct tgaacaacgt cgaatcaccc aacaagctgt tacagccgcc 4080 aaggtattga acggagagat agactgtccg aatctcctag caaagctagc tctgaacgct 4140 ccaaacagga ttcaacgcag gagtctggaa aggatgcttg ctccacagtt ccaccgttcg 4200 cttttcgggt acaacgaacc attgtcttcc gtaatgagag cattcaacga tgtagcgtac 4260 ctatttgatt ttggaatgtc gtcaacagtg ttcagacgaa aaattttaga ttcgatgaga 4320 gctagatttg tttaaattag tacagtaaca ttcattgaga cttatgtcag atgaattagg 4380 aaaataaata ataaaatata aaaaataata ataaaa 4416 // ID BEL-207_AA-I repbase; DNA; INV; 6257 BP. XX AC AAGE02025367; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-207_AA_; KW BEL-207_AA-LTR; BEL-207_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6257 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02025367; Positions 100838 107094. XX CC Positions [4949-5533] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 576..3812 FT /product="BEL-207_AA-I_2p" FT /translation="MPDKKSEKKVADLQIVRKGLLVTCNGVEQFVKNFDED FT RDSCQIPVRLESLDRVYRDFLKVQGEIEKYDAAEHLDEHLCERAAFETRYC FT VAKGFLLSKRAVDLNQTMFDATVHHPPAAHGNFHLRLPQIDLPKFSGDFSK FT WLSFRDTYTSMVHSNADIPTVAKLQYLIQSLEGEARKPFESVDIEADNYAA FT VWDALLKRYDNRRFLKKQLFRSIYDLPTIKKESADNLHELVDEFQRHVKAL FT AKLNEPVEHWDTPLVNLLSYKLDSSTLRAWEEKTSHQEDVQYNDLVEFLYQ FT RVRIIKSVASDMSQRSQPVQAKVAGSNSSQRSQFSSKFVASAVSSDTKSNV FT PSCFACSERHFLFQCQTFAKMPVSQRRELISQRKLCWNCFRTGHQARSCTS FT KFSCRTCRDRHHTLLHDPSQVKSSSVPAATSKSHEAMSVPSTFATPGPSST FT SQQVSMSVQSQVSTVLLQTVALHVIDVHGKSFEARALLDSGSMSNFMSTRL FT ANLLAIPQSEADVSVAGIGQSHCKLKRAVNATVRSRIGTFSTKLQFLVIDH FT PTANLPTIHVKISSWNLPKVDLADPQFHVPNKIDLVIGGEVYWDLHTGKKR FT SLGPGLPYLIETHFGWTLCGSTSQDSSGSVACQLSTADALLDTTLQRFWEI FT ETIPKQPVHSNSEKACEEFYAATTTRDEVGRYIVRLPRTDNPDIASGESDS FT IAERRFLGLERRLDRDANVKSAYHQFMEEYERLGHMKRLDEPVNNAIPHCY FT LPHHPVFKMSSTTTKTRVVFDASCKTTSGHSLNDILLVGPVVQDDLLSLVI FT RFCFHRIALSGDIEKMYRQVRLHDDDQPYQRIKWRSDASEPIASYQLQTVT FT YGTASAPYLATKTLQRLASDAGNNFPSAAEPVSKDFYVDDLLSGADDVDSA FT IRIRREVSAMLDSAGLPMKKWASNSVEVLEDIPPEDRSVQPWQDLQDPQSV FT STLGLIWEPGSDIFRFKVQLPLPASVLTKRKIMSYIAQIFDPLGLVGPTVT FT KAKLFMQRLWALKTPNSERYEWDQPLPPKLQQEWKEFHTTIDLLREVRIPR FT FASVVDAVSIQLHFFADASDKA" FT CDS 4319..5923 FT /product="BEL-207_AA-I_1p" FT /translation="MTSVQITFAEELFSRYSKYSKLRRTIAWSLRYLQGLR FT DRAIARRRNPTSLNESPKLSTHCQPLNSDELHSADRILCRLAQRESFPEEH FT SDLSTGVLVSKSSPLKWLKAYVDEFGLIRVGGRLNNAELPENTKHPIVLSA FT KHPLARLLTMHCHKTLLHAGPQLILASLRQRFWILGGRSLARNVYHQCITC FT FRSKPKLIQQAIADLPASRVSPTRPFSVCGVDYCGPLFVKSPVRKRGPTKV FT YVAIFICFSTRAVHVELVSDLTSAAFLAALRRLVARRGKISELHSDNATTF FT KGASHALHRVYQMLKLDDTDRNRILSWCADNEIRWKFIPPRAPHFGGLWEA FT AVKSAKTHLLKEIGNTSLSYEDMLTLLAQVEMCLNSRPLTPISSEPSDLEA FT LTPGHFLVGSNLQSVPEATITDVSENRLDHFEQTQRHLQRIWARWYPEYLQ FT QLQSRAVQGCKAPSGIEIGQLVVVKEDCLPPAQWPLGKIVKVHPGKDGIVR FT VVTLKTASSDNVMRPVARIALLPIRTESSPSDTPVGEHQ" XX SQ Sequence 6257 BP; 1520 A; 1771 C; 1504 G; 1462 T; 0 other; tttggtcctt cgagccggat cgaaggatac cccggttaga aggtacggtg tgtgctaaag 60 tgcaagtttt ctggtccgag tgaccacctg aaagtcggca aggaggccgt gaaatttcgt 120 cattggtggc tagtgattgc gccaagtgcg gtgtggaatt ctgtgcgcgc cagattccat 180 ttgttcgacc aagtgaatag ccatcattcc attttgtggc tggtcattgt acgatccccg 240 tggggaaaga atagtgtgtg gattggcgcc tagaaggtat tgtgactgag cgtggcttgg 300 gtagaggcaa agtgtgctgt ttttaggctg gtcgaattgg ttgaagtgcg gcaaaaatta 360 gaagtgtggt tgcaatttaa ttgcgtaaaa tactccttgt tggcggcaag ggagtggtga 420 tagggagcca cagtgaccca agtcacagtg cttgcatcga tccgcgtcgt aatccgcgta 480 atttcgtccg cgtcgggaag gagaaaatct ttgcgtccgc gtcgggaagg agcaaaaccg 540 ttttcgtttc cgcgatcgtt tagatcggtg caaaaatgcc ggacaagaaa agtgagaaga 600 aggtggccga tttgcaaatc gttcgaaagg gcctgttagt gacttgtaac ggcgtcgaac 660 agttcgtgaa aaacttcgac gaagaccgtg atagttgcca gattccggtt cggcttgaat 720 cactagaccg agtgtaccgg gactttttga aagtgcaagg tgaaattgag aagtacgatg 780 cggctgaaca tttagacgag catttgtgtg aaagagccgc cttcgaaacc cgttattgtg 840 tggcgaaagg ttttctgctg tccaaacgag cagtggacct caaccaaact atgttcgatg 900 cgacggttca ccacccacca gctgcgcatg gcaacttcca cctccggttg ccacagatag 960 atcttcccaa gttttccggg gatttctcga agtggctgtc ctttcgagac acctacacat 1020 cgatggtgca ctcgaatgct gatattccca ccgttgcgaa actgcagtac ctgatccagt 1080 ccctcgaagg cgaggcacgc aagccgttcg agagcgtaga catcgaggcc gacaactacg 1140 ctgccgtgtg ggacgcccta ctgaaaaggt atgataaccg acgttttctt aaaaagcagc 1200 ttttccgtag catttacgac ctcccgacga tcaagaagga gtccgctgac aacctccatg 1260 agcttgttga cgagtttcag cgacatgtga aggccctcgc taagctgaat gagcccgtcg 1320 agcattggga tacgcctcta gtgaatcttt tgtcctacaa gctcgattcc tcaactctcc 1380 gtgcctggga ggaaaagaca agccatcaag aagacgtcca gtacaacgat ctcgtcgagt 1440 ttctgtacca gcgggtacga atcatcaaat ccgtagcatc cgatatgtcg cagcgttccc 1500 aaccggttca agcaaaggtg gccggcagta attcatccca gagaagccag ttttcctcca 1560 agttcgtcgc tagtgcagtt tcatccgata ccaagtcgaa cgtcccatcc tgtttcgctt 1620 gttccgagcg gcatttcctg ttccagtgcc aaacttttgc caagatgccc gtcagccaga 1680 ggagagagct gatttcccaa agaaagctgt gttggaactg tttccgaacc ggtcaccagg 1740 ccagaagttg cacatcaaaa ttctcctgcc gaacatgtcg cgatcggcac catacgttgc 1800 tccacgatcc atcccaagtc aagagttcat cagtcccagc tgcaacatcg aaatcccatg 1860 aagcgatgtc tgttccatcc acgtttgcta ctcccggacc gtcttccaca tcccaacaag 1920 taagcatgtc ggtccaatcc caggtcagca ccgttctact acagacagtc gctcttcacg 1980 tcatcgatgt tcatggcaag tctttcgaag ccagagctct actagattcc gggtcgatgt 2040 cgaacttcat gtcgacaagg ttggccaatc tgcttgctat tccccagtcc gaagccgatg 2100 tgtccgttgc aggaatcggc caatcccatt gtaagttgaa gcgagcagta aacgccacag 2160 tccggtctcg aattggaaca ttctccacca aactccaatt cctcgtcatc gatcatccca 2220 ctgccaattt gcccaccatc catgtcaaga tttcgtcgtg gaaccttccg aaagtcgatt 2280 tagccgatcc gcagttccat gttccaaaca agatcgacct tgtcatcggc ggtgaagtat 2340 actgggatct acacaccggt aaaaagcgtt ctttaggtcc aggtctaccc tatctcatcg 2400 agacgcactt tggttggacc ttgtgtggtt cgacatccca agattccagt ggttcggtag 2460 cctgccagtt gtctactgcc gatgccctgt tagacaccac ccttcagagg ttttgggaga 2520 tcgagacgat tcccaaacaa ccagtgcact ccaattccga aaaggcttgc gaagagttct 2580 acgctgcaac cacgaccaga gatgaagtag ggcgatatat tgttcgcctc cccagaaccg 2640 acaacccgga tatcgcctca ggagagtcag attctattgc cgaacgccga ttccttggtt 2700 tagaacgccg cctcgatcga gatgccaacg ttaaatccgc ctaccaccag ttcatggagg 2760 aatacgagcg tctcgggcac atgaaacggc tcgacgaacc ggtcaacaat gcaatccctc 2820 attgctatct gccgcaccac ccggtcttca agatgtcaag caccaccacg aagacaaggg 2880 tagtatttga tgcctcgtgt aaaacgacat cgggtcattc cctcaacgac atcctccttg 2940 ttgggccggt cgtgcaggac gacctgctgt cactcgtcat tcgtttctgc ttccaccgca 3000 tcgccctgag tggcgatatc gagaagatgt accgtcaagt tcgcctccac gatgacgatc 3060 aaccctacca gcgaataaaa tggcgaagtg atgcgtcgga acccatagct tcgtaccagt 3120 tgcaaaccgt aacttacggt acggcctccg ccccctacct cgccaccaaa actctccaaa 3180 gattggcgag tgatgcgggc aataattttc cctccgctgc agagccagtc tccaaggatt 3240 tttatgtcga cgatttgctg tctggcgccg acgacgtcga tagtgccatc cgtattcgac 3300 gtgaagtttc cgctatgctc gactctgcag gattaccaat gaagaagtgg gcttcgaact 3360 ccgtcgaagt ccttgaagac atccctcccg aagaccgatc agtgcagcca tggcaagacc 3420 tccaagatcc gcagtccgtt agcacacttg ggctcatttg ggaaccggga agcgacatat 3480 tccgattcaa ggttcagtta cctctcccag cttccgttct caccaagcgt aagattatgt 3540 cctacatagc tcaaatcttc gacccactag ggctagtagg accaaccgta acgaaggcga 3600 agcttttcat gcaacgcttg tgggcattga aaactccaaa cagcgagcgt tacgaatggg 3660 accaaccgct acctccgaag ctgcagcaag agtggaaaga gttccacact acaatcgatc 3720 tcctccgtga agtgagaata ccacgatttg catctgtcgt cgacgcagtc agcatccagc 3780 tccatttctt cgctgatgcg tctgataaag cctaggaacg tgttgctacg ttcgcgccca 3840 atctgctcaa gaagtgtccg tgaaactcct agtgtcgaag tcgaaacttt cgcccctttc 3900 agctcgccac actatagcaa gactggagct gtgcgcagcc catctatcca cgcagctgta 3960 caagaaggtc gcttcgtcac tcagaaacgt cccgcctgcc tacttttggt cagattccac 4020 caccgtgatc cagtggctcc gatcatcgcc cggacgttgg aaaacgttcg ttgcgaaccg 4080 cgtgagccaa atccaatcaa gcacccccgt cgacaagtgg aaccacgtag caggttcgga 4140 caaccctgca gatgatattt ctcgaggatt ggacccgtcc gatcttctca ccaaagccag 4200 atggtggtct ggcccatcgt ggctgaagtt ttctccagat cattggccta ctggagcact 4260 tcccgctgtt gaaccgccag aagtcgccca agagattcgc aaggctcccg tagtcgccat 4320 gacctccgtc caaatcacgt tcgccgaaga gttgttctcc agatattcca agtattccaa 4380 gctacgccgt actattgcat ggtccttacg ctatctccaa ggtctccgag accgtgcgat 4440 cgcccgccgg cgtaatccaa caagcctgaa tgaatcgcca aagctatcaa cgcactgtca 4500 gccactcaac tccgatgaac tccattccgc cgatcgaatc ctttgtcgtt tggcacagcg 4560 ggaaagtttt cccgaagagc actccgatct ttccactggc gtgctcgtgt ccaagtccag 4620 tccgttgaag tggttgaagg cctacgtcga cgagttcgga ttgattcgcg tcggtggacg 4680 cctcaacaat gccgagctac ccgaaaacac caagcatccc atcgttctta gcgccaagca 4740 tccgttggcc agattgctca ccatgcactg ccataaaact ctgctccacg ctggcccgca 4800 gctcatactg gcaagccttc gccaaaggtt ctggatcctc ggtggacgaa gcctagctag 4860 gaacgtctac caccagtgca tcacctgctt tcgaagcaaa ccgaaattga ttcagcaagc 4920 catcgccgat cttccagctt cccgcgtttc tccgacgcgt cctttctccg tttgtggcgt 4980 agattactgc ggcccactgt ttgtcaagtc accagttagg aagcgtggtc caacaaaggt 5040 gtatgtggcc atcttcattt gtttttccac acgggccgtc cacgtagagc tcgtgagcga 5100 tttgacatca gctgcgtttc ttgccgcact gcgccgcctg gttgctcgaa ggggtaagat 5160 ttccgagctc cactccgata atgccaccac tttcaaggga gcatcgcatg cccttcatcg 5220 agtgtaccag atgctgaagc tggatgacac ggaccgaaac cgaatcctct catggtgcgc 5280 agacaacgag atccgctgga agttcatccc accgcgcgcc cctcattttg gaggtctctg 5340 ggaggctgcg gtgaagtccg caaagaccca cctcctgaag gaaatcggca atactagttt 5400 gtcatacgag gacatgctca ctctactcgc acaggtggaa atgtgcctca attcacgccc 5460 acttacacca atatcgtcgg aaccatcaga cttggaagcc ctgacacctg gtcacttcct 5520 ggtgggatcc aatctccaat cagtacccga agcaacaatc accgatgtct ccgaaaaccg 5580 gctcgaccat ttcgagcaaa cccagcgcca tctccaaaga atctgggcac gatggtaccc 5640 agaataccta cagcagttgc agtccagggc tgttcaaggc tgcaaagctc caagtggcat 5700 cgaaattgga caactcgtcg tggtcaaaga agactgcctc ccgcccgctc agtggcccct 5760 cggcaaaatc gtcaaggtcc acccagggaa ggacggcatc gtcagagtgg tcacgttgaa 5820 gactgcttct tcagacaacg tcatgcggcc cgttgcacgg attgccctgc tgcccatcag 5880 aaccgaatca agtccgtccg acacccccgt cggagaacat cagtagagca gtactgaact 5940 agctcgaggc gcatatcatt ttaaatcagg taattaacgt tccaatcacc ctaatccacc 6000 cgaatgatct acctggtctt tttttgtttt cggacgtcaa caacgaggtc aacgcactct 6060 atttctgtct actgtctgca catcccaaga ccatatcagc caagttcgat cttcggccct 6120 cgcccaatcg gtcttctgat cgacgatgtc aacagagagt cgaactgatc ctctccaatc 6180 tcgaacgagc agatgcagaa gtgttagtgt ataagacaga agtttgttga aataggctat 6240 ttcaaaggtg gccggaa 6257 // ID Copia-21_DPu-I repbase; DNA; INV; 2635 BP. XX AC scaffold_57; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-21_DPu_; KW Copia-21_DPu-LTR; Copia-21_DPu-I. XX NM Copia-21_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-2635 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 705-705 (2010). XX DR Genome; scaffold_57; Positions 325774 328408. XX CC 'CAACA' target site duplication CC LTRs are 100% similar to each other. CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX FH Key Location/Qualifiers FT CDS 36..2429 FT /product="Copia-21_DPu-I_1p" FT /translation="MTNSPTREVSHVIKFNGKNFPLWKFGCWLKLEQHDLV FT HIVNGNETLPEQVYIFFFLQITILSIFMNLLFVQELNDEGEVTNYMTISDW FT MRRDVLARNYLIATIEPQQQRTLINCRTAFEMWTRLSAQHIQTSAENQHVL FT QHRFYEYQYQPSHDVMSHITEIETLASQLSDVGAPMSDIQIMTKILCTLPP FT SYRNFATIWDIVPVNERTMPLLTSRLLKEESNTLRWSSGQQDAADTAFFAQ FT NYPNTYANQDHSFISNSTCFIARRSQDWFADSGATQHMSDQREFFKEFTAV FT KPNTWFVKGIGGAQLQVHDQGSIEFTALVDGTKPTIKIETVLFVPDLGVNL FT LSIAAVTEVGISVHFIESNVSFNQNDTVVMIGERIGRTFYHLAITVDPPCD FT WACFTTPAPPSIDVWHQRLAHTSIKEIRKMASLQTVNCLILPIEEVSRIWN FT RHFDTFLRNFGLIPSESDPCLYHRHHKEEFTMVIIWVDDGLVCSNSSKAIS FT DIINYLANHFEMRSSEANHFVGLSIFRNRKEKTLYLSQPDYTEKILQRFHM FT DGCHPVSLPATPGVFLNKEENLKKTIQVPFKEAIGSLMYLMLSSRPDIAFA FT LNQASQFCENPQAAHWAAVKKIFSYLQGTKNYGLRYGPSLVAPVGYSDSDY FT AGDINTRQSTSGFIFLLNEGPIAWSSRRQNCVALSTTEAEYVAACEAAKES FT VWLRRLLLEIIPDWKQPLPLLCDNISSIELTRSPKFHQRTKHIDVRFHFIR FT AQQEAKEIDVKYTPTTEQLADPLTKPLPNPRFSILRKAIGVVLVPDL" XX SQ Sequence 2635 BP; 782 A; 672 C; 514 G; 667 T; 0 other; ggttatgggc ccagtgcaca ccttccaaag acaagatgac aaattcccca actagagaag 60 tcagccatgt gatcaaattc aacggcaaga actttcccct ctggaaattt ggatgttggc 120 tcaagcttga acaacatgac ctcgtgcaca ttgtaaatgg aaatgaaaca cttccagaac 180 aagtatacat tttttttttt ttacaaatta caatactgtc gatattcatg aacctccttt 240 tcgttcagga actgaatgat gaaggagaag taactaacta catgacgata agcgactgga 300 tgagaagaga tgtcctcgca cgcaattatc tcattgcaac gatcgagcct caacagcaaa 360 gaacgcttat caactgcaga actgctttcg agatgtggac tcgtctttct gcacagcaca 420 ttcaaacttc tgcggagaat cagcacgtgc tacaacacag attctacgag tatcagtacc 480 aaccaagcca tgatgtgatg tcccacatca ctgaaatcga gaccttggca tcacaattga 540 gtgacgttgg tgcgcccatg tctgatattc agatcatgac aaaaatatta tgcacgctcc 600 caccgagcta tcgaaacttt gcaaccatat gggacattgt accagtaaac gaacgcacga 660 tgccactact cacatcgcgg ttgctgaaag aagaatccaa cactttgcgc tggtcaagtg 720 gtcaacaaga cgcggccgat actgcattct tcgcccaaaa ttatccaaac acctacgcca 780 atcaagacca cagctttata tccaactcca cctgcttcat agcgcgacgc tctcaagatt 840 ggtttgccga ttccggcgcg acgcagcata tgtccgatca acgagaattc ttcaaagaat 900 ttacagccgt gaaacccaac acgtggttcg tgaagggtat aggcggtgct caactccaag 960 tacacgatca aggcagtatt gaattcacgg cactcgtaga tggaaccaaa ccgacaatca 1020 aaatcgaaac ggtcctattt gtgcccgatc ttggagtcaa tttgctctcc atagccgcag 1080 taacagaagt cggcatttca gtacacttca ttgaatccaa cgtgagcttc aatcaaaatg 1140 ataccgtcgt gatgatcggt gagcgcattg gcagaacatt ttaccacttg gccatcacag 1200 ttgacccacc ttgtgactgg gcatgcttca cgacacctgc cccgccatct atcgacgttt 1260 ggcaccaacg actagcgcac acaagcatca aggagatacg caagatggcg tctctacaaa 1320 cggtaaattg tcttattcta cccattgaag aggtatcacg catatggaat cgacactttg 1380 atacttttct ccgcaatttt ggtctcatcc caagtgaatc tgatccctgt ctctaccatc 1440 gccaccacaa ggaggagttc accatggtca tcatatgggt ggacgatggc ctggtttgta 1500 gtaacagcag caaagcaata tctgatataa ttaattatct tgcaaaccac ttcgaaatga 1560 gatcatctga ggccaaccac tttgtaggcc tatcgatctt cagaaaccga aaagaaaaaa 1620 cgctctacct gtcacagccg gactacactg aaaaaatctt gcaacgtttc cacatggacg 1680 gatgccatcc tgttagtcta ccagccactc ctggagtttt cctgaataaa gaagaaaatc 1740 tcaagaaaac gatccaagtt cccttcaaag aggccattgg atcactcatg tatttgatgc 1800 tgtcatcgcg cccagacatc gcttttgctt tgaatcaagc ttcacaattc tgtgaaaacc 1860 cacaagcggc acactgggca gccgttaaga aaatcttctc gtatctacaa ggcacaaaaa 1920 actacggtct acgctatggt ccctcattgg tcgctcctgt tggttactcg gattcagatt 1980 atgctggaga catcaacacc cgacaatcaa cttcaggttt catctttctt ctgaacgagg 2040 gccccatagc ctggagcagc cgtcgtcaga actgtgttgc cctctccacc actgaggcag 2100 agtatgtggc tgcatgtgaa gcagccaagg aaagtgtgtg gttgcggcgt cttctacttg 2160 aaataatccc tgattggaaa caaccactac ctttgctctg tgataacatc tcttcaattg 2220 aactcaccag gagccccaaa ttccaccaga ggaccaaaca catcgacgtc cgtttccatt 2280 ttatcagagc ccagcaagaa gcaaaagaaa tcgacgtgaa gtacactcca actactgaac 2340 agctcgcaga tcctctcacc aagccattac cgaatcctcg cttctctatt ttacgcaagg 2400 cgattggtgt agtgctagtc cccgatttat aatatttttc tttttagaga gagagctaac 2460 agaccatcgg agaaacgctg atatttttat ttttcatcac gaagaagttc aaacatttat 2520 tttatttttc catataacct tctctgtctt tctctcctct taaaaaaaaa aaagggacaa 2580 caacaacaaa tctctagatt taaccattgc acctgtacgt atgtttgagg aggag 2635 // ID Tx1-8_BF repbase; DNA; INV; 5526 BP. XX AC . XX DT 28-APR-2009 (Rel. 14.04, Created) DT 28-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE Amphioxus Tx1-8_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW Tx1; Non-LTR Retrotransposon; Transposable Element; Tx1-8_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-5526 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-5526 RA Kapitonov V. and Jurka J.; RT "Young families of Tx1 non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(4), 845-845 (2009). XX DR [2] (Consensus) XX CC ORF1 is corrupted by mutations due to a low quality of the CC consensus sequence. XX FH Key Location/Qualifiers FT CDS 1372..5136 FT /product="Tx1-8_BF_2p" FT /note="endonuclease and RT." FT /translation="MSKVECRISSYNCNGLGDFHKRMGLFTWLKDKPYHLY FT CLQETHTTVRDEKRWQNEWGGQIYFSHGTSNQRGVAILIKSSASVQVHQVK FT TDESGRWIVLDIEVDNLHFCLANLYAPNDDCPSFFKELEEAIDTLEISNEH FT LVVVGDFNTVQNSAMDRSGARLRNYHPNALEAISELKGKFDLYDVWRFRNP FT NVVRYTWRRGLYASRLDYFLISFSLLNRVTKCSIADKLRSDHNLITLSFVT FT ADFPRGPGYWQFNQSLLDDKLFLVQTKQIMSEFFENNVNTANPQVVWDAAK FT CFFRGHCIKFSSWRKKQYLMKEKELIDEINMLQSQIDSAASPPPAQLEELN FT HKQKLLESLYNERLNGILLRSKSRWMELGERCTKYFINLIHRNYTRKNIQR FT LKRPSGEVTCNPKDILSDQVNFYSALYSFEDIPMPLSDVNCDGFFPEDYDK FT RLSDEQQQLCEGLVSEEELKNAIYSFQAGKSPGLDGIPVEVYKNFFDVFKK FT PMLTCFNYSFVNGYLSETQRKGLISLLLKKDSKGIDKDPTAMGNWRPLTLL FT GCDTRILSKCISLRIKHVITDIIGEDQTGFIKDRYIGDNIRRLLDTIEHYD FT QENKPGLIFVADFKKAFDSLRWDFMFKCLEFFNFGPQLIKWVKVLYKKTAS FT CVINNGYISDPFNLYRGVRQGCPLSPYLFLVAVEVLAIKIRSNHSIRGLRI FT YEKTTKISQYADDSNFPFEPKLESFYALLSDLERFSNISGLSLNVDKCKIL FT RLGPLKLSNFRLPTHLPIQWVDGDVDLLGINIPVDLNLITNVNFEPRMERL FT DRLLRPMKNKYLSLYGKIVIINTLVVPQFTNLFQVLPSPNDSFFKEYEKKI FT FSFIWDDGPERVARKVLYNEFENGGLNLKNLRAFNTSIKASWVPKLYSHPE FT WFSSWVTVFHPQLSHSLFPFYQLSSGKNLCLKGFFSEVFMAWFKFQPTTPA FT NIRQQVICLNSNIIIDGKTVCMTSFLNRNIYFINDLLSPNGNFMSYDEFSI FT NYPNACDQHRYLQLISAIPGKWKKILSTEKRKDLVCLPFQKNYKWLRNIKI FT NKSMYIYFLTSMNEVSVAHNTRLSWFYYFDKDIPWREVFTNLYRCTIDPGT FT RYFQYRLINKFLPCNRILHIWKLVDSNSCSFCHDDVESYIHVFWECPHVVP FT FWNEVESWLQVQTSIDCNLNPFIIIFGDTCQDTPPLKNLIILLGKVFIFRC FT RRLKTINFQAFKKLISTFEKTERLIASRRGKLEKHRGKWGTLCLI" XX SQ Sequence 5526 BP; 1715 A; 964 C; 1076 G; 1771 T; 0 other; acccccctcc catcaaaggc cgtgagtaac tcaattaacg ctttggagca gtactcccgc 60 agaaatagtc tgcgcattcg tggcattcct gaggtcgaca aggaaacggg agagatgtgt 120 gtacacaagg tagtgacttt ctgcagagtg aaactgggcc tggatttaca accacagtgc 180 atcgaccgag cgcaccgagt cggcactagg aaggagaacg cttcgcggct gatgctagtc 240 aagtttgtat cctggcagga ccggaaccgc gtgtttcgcg cacggagtaa actcaaaggt 300 aaacgggatg aacacgacag gcccctcctg gtcttagctg atcttaccag agagaatttg 360 acgatcttct cagcggcgtt tactgccaag aagaacggtc gtatcaagga cgcatgggtg 420 gatgccaact gtagaatcat ggtaactctg gctgataaaa ctacaaagag tatcgattct 480 gtcgaagaac tcacttaagt ttgttattga ctttttttgt caaattgccg ttttacgaaa 540 tccgccagtc tttctcgatc tccacgggaa gaatcctcac ctgtctaaaa cacgagtttc 600 tagctttgtt atgctaatga gatggattct tcccctgttg gactctgttg tgatgtacaa 660 gcgggtgact gttttacatg cgatatattc ccatcctatc aagtacatat atatattgtg 720 aaataaagaa atcatttgaa acatggatct tcaacttact atcaattgac ttgttcggtt 780 aacatgtaag cctagggtag cgctgtcctc tttatggcaa atgggtcagc caaggaaact 840 tgttgttctt tgttattttt tgtgttgttt ttattgtcac agagcctcca ttaaatcatg 900 caaagattca gccacagaaa ttgttattaa gacgaaatat gacgtgaact cagtctttga 960 tttaagtttc aataggaagg atatgactta cttatttgat cagtttgatt actttctgtt 1020 cacaattagt ttatgttttg taacagaaaa ttatgttgcg tatcattttg tcaatctgat 1080 tacatacttt atggaactat catttggtaa acaccgatgc actacttatg tttacgtatt 1140 tcttagtttg catttgagtt cacttgaatt atggtacgaa ttgttccgtt attttgttat 1200 cattatttat attctgattc aattatgctt taccatttta tgtttatgtt tgacggttcc 1260 cctagtttat tttacttata gttatgtacc agcagtatca ccagagagaa tcctgtgccc 1320 cgatgacctg acccaagctt cctctccgag acgtttaaac tcccagagaa tatgtctaag 1380 gtagagtgtc gaatttctag ttacaattgt aacgggctgg gggatttcca taagcgaatg 1440 ggactattta cctggttaaa agacaaacct taccacttat attgtctaca agaaacacat 1500 acaactgtaa gagatgaaaa aagatggcag aacgaatggg gtggacaaat atacttctcc 1560 catggtactt caaatcaaag aggggtagct attttgataa agagtagcgc cagcgttcag 1620 gttcatcagg ttaaaacgga tgaaagtgga cggtggattg tgttagacat cgaggtagac 1680 aatttacatt tttgtcttgc gaacttatat gctccaaatg atgactgtcc gtcctttttt 1740 aaggaattag aagaagctat agatactctt gagattagta atgaacattt agttgtggtg 1800 ggagatttta atacagtgca gaattcagcc atggatagat caggtgctcg acttcgaaat 1860 tatcacccta acgcactaga agctatttcg gagctaaaag gaaaattcga cttgtatgat 1920 gtgtggcgtt ttagaaaccc taatgttgtt cgatatacgt ggcgtcgggg gctctatgct 1980 agtagactag attattttct gatatccttt tcattattaa acagagttac taagtgttct 2040 attgcagaca aattaagatc agatcacaat ctaatcacct tgtcttttgt tacggcggac 2100 ttcccgcgag ggcctggcta ctggcaattt aatcaatccc ttcttgatga caaacttttt 2160 cttgtacaga ccaaacaaat tatgtcagaa tttttcgaaa ataatgtcaa caccgcaaat 2220 cctcaagttg tttgggatgc ggcgaagtgc ttttttcgtg gtcactgtat taaatttagt 2280 agctggagaa agaaacaata cctgatgaaa gaaaaagaac ttattgacga aattaatatg 2340 ttgcagagtc aaattgatag tgccgcctct cccccacccg cccaactaga agaattaaat 2400 cataaacaaa agttattaga atctttatat aatgaacgtc tgaatggtat cttgttaagg 2460 tctaaatcac gttggatgga gctgggggaa aggtgcacca aatattttat aaatctaatc 2520 caccgtaatt atacaagaaa aaatatacaa cgactcaaaa ggccctcggg tgaggtaaca 2580 tgtaacccaa aagacattct ttctgatcag gttaattttt attctgcact ttactctttt 2640 gaagatattc caatgcctct ttctgatgta aattgtgatg gttttttccc cgaggactac 2700 gataaacgtt tatcagatga gcaacagcaa ctttgtgaag gtctagtaag tgaagaagaa 2760 ttgaaaaatg caatatattc atttcaggct gggaaatcac cagggcttga cggtatacct 2820 gtagaagttt ataagaattt ttttgatgtc tttaaaaagc ctatgttaac ttgttttaat 2880 tattcgtttg ttaatggata tctgtcagaa actcaaagaa aaggattgat ttcattgcta 2940 ctcaaaaaag acagtaaagg catagacaag gacccaaccg ccatgggaaa ttggcgcccg 3000 cttacccttt taggttgtga cacgagaata ctttcaaaat gcatatctct tagaatcaaa 3060 catgtaatca cggacataat tggagaggat cagactgggt ttattaagga taggtatata 3120 ggcgacaaca ttagaagatt attagataca atagagcatt acgatcaaga gaacaaacct 3180 gggttaatat ttgtagctga tttcaagaaa gcattcgatt cgttgaggtg ggatttcatg 3240 tttaaatgtt tagaattctt taactttggt ccacaactaa ttaaatgggt aaaggtttta 3300 tataaaaaaa cagcgagttg cgttataaac aatggttaca tttcagaccc cttcaattta 3360 tatcgcggag ttcgccaggg atgtccattg tccccttatc tttttctcgt ggcggtggag 3420 gtgcttgcta ttaagatcag gtctaatcat tccatacggg gtttaaggat atacgaaaag 3480 actacaaaga tatcacaata cgccgacgat tcaaattttc cgttcgaacc aaagttagaa 3540 tcattttatg ccttgttgtc tgatttagag cgtttttcca acatctcagg tctttcttta 3600 aatgtagata aatgcaaaat attacgatta ggccccttga agttatctaa ttttcgtttg 3660 cccacccatc taccgataca gtgggttgac ggggatgtcg acctactggg catcaatatc 3720 cctgttgatt taaacttaat cacaaacgtt aatttcgaac cccgtatgga aagactagat 3780 agattattac gccctatgaa gaataaatat ttgtccctgt atggtaaaat tgttattatt 3840 aacaccttgg tggttcccca gtttacgaat ttatttcaag tgttaccatc ccctaacgat 3900 tcctttttca aagaatatga aaagaagata ttttctttta tttgggacga cggtccggaa 3960 cgagttgcac ggaaggtatt atacaatgaa tttgaaaacg gaggacttaa tttaaaaaac 4020 ttacgtgcat tcaacacttc tataaaagct tcatgggtgc ctaaacttta ttctcatccc 4080 gagtggttct cctcttgggt aacagtcttc caccctcaac ttagtcatag tttatttccc 4140 ttttaccagc tgtcttcggg gaaaaacctt tgtttaaaag gatttttttc tgaagtgttt 4200 atggcctggt tcaaatttca gcccacgacg ccggcaaata tacgacagca agtaatttgt 4260 ttaaactcta atattatcat tgatggtaaa actgtttgta tgacttcttt tctgaataga 4320 aatatctatt tcataaatga cttgctatcc ccgaatggta attttatgtc ttatgatgag 4380 ttcagtataa actacccaaa tgcttgtgac caacatagat atctacaact tatttcagct 4440 attcccggta aatggaaaaa aatacttagt actgagaagc ggaaagacct tgtgtgccta 4500 ccctttcaaa aaaattataa atggttacga aacataaaga tcaacaaaag catgtatatc 4560 tactttctta cgtccatgaa cgaggtcagt gttgcacata atactagact ttcttggttt 4620 tattatttcg ataaagatat accatggcgg gaagtattta caaatttata tagatgcact 4680 attgatcccg gtactagata ttttcaatat cgtttaatca acaaattttt accttgtaac 4740 agaattttac atatatggaa attggtagat tctaactcct gttctttctg tcacgatgat 4800 gtagaatcct atattcatgt tttctgggag tgtccgcatg tggtaccttt ttggaatgag 4860 gtagagagtt ggctacaagt acaaacttct atagactgta accttaaccc ctttataatc 4920 atatttggag atacatgtca agatacgccc cccctaaaaa atttgattat tttgttaggc 4980 aaagttttca ttttcaggtg tagacgttta aaaacaataa acttccaggc ctttaaaaaa 5040 ctgatatcta cttttgaaaa aacggaacgt cttattgcat ctcggagagg gaagctggag 5100 aagcatcggg gtaagtgggg aaccttgtgt ttaatttgat tttgttacaa taataatatg 5160 ataagatggt acgatgtaat tacgtaataa tgctaaatga tgataatatt ataataattg 5220 tttgaaatgt tgacatgtat atgatgaacc aatgttcaca taatgaggta atactatgat 5280 aatgaggtct tatgatataa tgtgttacaa tattatggta aactgatacg atgtgatgat 5340 ttgataacta aaatgaataa cgagtataat gacataataa cagctacatg atgtgatctg 5400 taataatgta cataatgtgg ctgttagaca tttagcttga tatgtaaggg atggaggctc 5460 tgcgttgttc tgtattatta ttcatcattg acaataaaat aaaaaaataa aaaaaaaaaa 5520 aaaaaa 5526 // ID Copia-18_DPu-I repbase; DNA; INV; 4454 BP. XX AC scaffold_128; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-18_DPu_; KW Copia-18_DPu-LTR; Copia-18_DPu-I. XX NM Copia-18_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-4454 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 699-699 (2010). XX DR Genome; scaffold_128; Positions 326443 321990. XX CC Positions [1743-2273] - Integrase core CC LTRs are 99% similar to each other. CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX FH Key Location/Qualifiers FT CDS 1161..4286 FT /product="Copia-18_DPu-I_1p" FT /translation="MPQSVRACHILSRLRLHSTHVRSEIILTNFRPIAELE FT RPIDGIGGAKLYARGIGDIAVSRWANGIKIDGWLRDVLYVPQLTVNLVSIG FT CITENGYMVVFSKDSAKIMQKDDTVMEGSRIGKTLYRLDISAKSPPITSLI FT ASSSSATLNVWHERLAHVSHDIVKKMATSNHVTGMDVEPVKEDRNLYCNGC FT NLGKMHKLPFSTSIRITTRVGELIHTDLVGPMHISSPNGAKYYVVFKDDYS FT RYKVVYFLKLKSECPDLFKLFSKKIFCETGKRIATLRSDNGGEFMSHEFQS FT WLTEERIKQEMSAPHTPEQNGKAERDHRTTVEAARSQIHGKNLPLTLWAEA FT VNHSVYTLNRTLTKQRQVTPYEMWHGTKPDISHLRIFGSVAYALIADAERR FT ILDPKAIKGLYVGESEQQKASRIFIKETGRTIICRHVKIYETEIVDTTTSN FT KRDEESGADMSKSENASTTSPPENITQLTKDTTVSIPQPIRQSTRQRIPKK FT LWPVESYAALMSSDETMTNHPMFIFYEPKSFKEAMTSTERDLWKKAADEEI FT RSHQENNTWTVMPLPPNRVSIPSGWNFKIKTDKDGQPKRRKARFFAKGYRQ FT IKGIDFQESFAPVVRYDSLRVLMAIAAMQDLELVQLDVATAFLNGDIDEEI FT YITQPEGYIIPGRETEVCKLNKSLYGIRQASRIWNLKLNSVLIAAGLRQSN FT ADPCVYFRTDNEETVIVAVWVDDGIIAGNRMATIDKIVNTLKTSFKMTHGP FT AEHFVGLVIQRDRANKQIVLSAPQYVEKILAKFQMNTCHPISTPTEKGTPR FT LYALPPSPNDSEIKASFPYREAVGSIMYAAITIRPDISFIAGQLAQHCENP FT SPVHWKGAKRVLRYLAGTRDHGICFGGKRTNPTGLYGYSDADYAGDPNSRR FT STSGFLFTLNGGPITWSSRRQPIVALSTMEAEYIAASDACREATWIRLILS FT ELGAHQNNPTTIWCDNESAISLAKNPESHKRSKHIDVRYHHIREQIKKGVV FT NLSYVNTKNQLADILTKGLDLQSQEKLMKEIGIVKP" XX SQ Sequence 4454 BP; 1446 A; 1090 C; 992 G; 926 T; 0 other; ggttatgggc ccagcgccag ccttagagaa aagacaatta ctatggaagg aattttaact 60 cttgacgcca tcaggcacat aaaacgattt gatgggacac tattccaaaa ctggaaacat 120 tccatggaaa ttatgtttga attcaaagac atcaaagaga ttgtcgaggt aagcccaact 180 accactaagt caaaagccaa attttgttag tcgcagactt aaccattcct cctatcatcc 240 cacagggaga actatgtcct gaaccagcct accaagaagt tgaaaatgaa agagtgtgca 300 ctaacgagga tgaaatttat acatggcgta agagagactg ttatgccaga ctgttaatat 360 tcaacagcac tgatgatgta cgacaaaaag ctctgttcaa ctgcagaaca tcccatgaga 420 tgtggggtag actgaacacc caatacttgc agagggctgc tgataacaaa cacctgttac 480 acagagaatt cttgaatctc aggtagatta tcttcagtta agctataacg aaatgtttga 540 tttcaacctg tcctattaat ttacagttat gtcgatggtg acgacatcat gaaccatgta 600 acagctctag aatctatggc agcacaactc aatgaccttg gcgtcagtgt gaccgatcat 660 gacatcataa caaaaattgt ttgtagtcta ccaacgcgtt tcgacaacct tgtatcctca 720 tgggatggaa tgcaggatca agagaaaaca ctagatgccc ttcgagcacg ccttgtttca 780 gaagaacgga aatttactct tcgaaaagca caagccggcg gatcgagcat agaaactgca 840 actgcatcac ccaataccgc attttttggt caagggatgg gcaatcacgg acgctcacga 900 ccagatatca aatacggaag aggtggaggg agcttcagag gaagtcaacg gcaatcctca 960 gcccaaggaa gatctaagga ccaagtggca agagacaacg ctttctgctc atattgccat 1020 aaaaccaggc actacgcatt cgaatgtcgt aaacgaattg aagacgaagg ggcgagccaa 1080 ggaaacaaca agaaagccaa cctggcagat tcgagcgggg gaaaagaaag cgaccgagac 1140 ttctccttcg tatcgacgga atgcctcaga gcgtcagagc ctgccacatt ctatctcgac 1200 tccggttgca ctcaacacat gtccgatcag agatcattct caccaacttt cgaccaattg 1260 cggaactgga gcggcccatc gatggaatcg gtggcgctaa actgtacgct cgcggcattg 1320 gcgacattgc agtaagccga tgggcaaatg gaattaagat tgacgggtgg ctcagagatg 1380 tactgtacgt cccccaacta actgtgaacc tagtctcaat cggttgcatc acggaaaatg 1440 ggtatatggt cgtcttctca aaagactctg ccaagatcat gcaaaaagat gacacagtga 1500 tggaaggttc aagaatcgga aaaaccctgt atagactcga tatatcggcc aaaagtccac 1560 ccatcacgag tttaatagca agttcaagct ctgcaacact caacgtatgg cacgaacgac 1620 tcgcccatgt aagccacgac atcgtcaaaa agatggcgac cagcaatcac gtcaccggaa 1680 tggatgtcga acccgtgaaa gaagatcgaa acctgtactg taatggatgc aatctgggaa 1740 aaatgcacaa gctgcctttc tccacgagca tacgcatcac tacccgagtc ggggaactca 1800 ttcacacgga cttggtaggt ccgatgcaca tatcgtcacc gaatggagcc aaatactacg 1860 tagtattcaa agacgactac agtcgctaca aagtagtata ctttctcaaa ttaaagtccg 1920 aatgtcccga tttattcaaa ttgttctcta aaaagatctt ttgtgaaacg ggaaagcgga 1980 ttgctacact acgctccgac aatggcgggg aattcatgag ccatgaattc caaagctggc 2040 tgacagaaga aagaattaaa caagaaatga gcgcaccaca cacccccgaa cagaacggaa 2100 aggcggaacg agaccaccga accactgtcg aagcggcgcg aagccaaatc cacgggaaga 2160 atctaccact cacactatgg gcagaggctg taaaccactc agtctacacc ctaaaccgaa 2220 ctctgacgaa acaacggcag gtgacgccat acgaaatgtg gcacggaaca aaaccagaca 2280 tatcacacct taggatattt ggttccgtcg catacgccct catcgcagat gccgaaagac 2340 ggatactgga ccctaaggcc atcaaaggac tttatgtcgg tgaaagcgaa cagcaaaagg 2400 ccagtcgcat attcatcaag gagacaggca gaaccatcat ttgtcgccac gtcaagattt 2460 acgaaaccga aatcgtcgac acaaccacct ccaacaaaag ggatgaagaa tccggcgctg 2520 atatgtcaaa aagcgagaac gcgagcacca catcaccacc agaaaatatc acccaattga 2580 cgaaagatac aacagtatcc atcccacaac ccatccgaca gtccacgcga caacgtattc 2640 caaagaaatt atggcccgtg gagtcgtacg cagcactgat gtcgtctgat gaaacgatga 2700 cgaaccaccc gatgttcatc ttttatgaac caaaatcatt taaagaagcc atgacttcaa 2760 cagaacgtga tttgtggaag aaagcggccg acgaagaaat ccgatctcat caagagaaca 2820 acacatggac tgttatgcct ctcccaccca acagagtgtc aatccccagt gggtggaatt 2880 tcaagataaa aaccgacaaa gacggccaac caaaacgaag aaaagcccga ttttttgcaa 2940 aaggataccg ccagatcaag ggcatcgatt ttcaagaatc gtttgcccca gtagtacgct 3000 acgattcact tcgagttttg atggccatcg ccgctatgca ggatctcgaa ctcgttcaac 3060 ttgacgtcgc cacagcattt ttgaatgggg acattgatga ggagatctac atcacgcaac 3120 ccgaaggata catcatcccc ggtcgtgaaa cagaagtatg caagctcaac aaatcactat 3180 atggaattcg gcaggcgtca cggatatgga atttaaaact aaactcagta ctcatcgcag 3240 ctggcctacg acaaagtaac gctgacccgt gcgtatactt ccgcacagac aacgaagaaa 3300 ccgtcattgt tgcagtatgg gtagacgacg gcatcattgc aggaaatcgt atggccacca 3360 tagacaaaat cgtgaacact ttgaaaacca gtttcaaaat gacacacggc ccagcagaac 3420 acttcgtcgg cctcgttatt caacgcgacc gagccaacaa acagattgtt ctatcggcac 3480 cacagtatgt agagaagata cttgccaagt tccaaatgaa cacatgccat ccgatctcaa 3540 ccccgaccga aaaaggaaca ccaagactgt atgctctacc acctagcccc aacgattccg 3600 aaataaaagc ttcattcccc taccgagaag cagtggggag cataatgtat gcggcgatta 3660 ccattcgtcc tgacatttca ttcatcgctg ggcaactcgc ccaacattgt gaaaacccga 3720 gccccgtcca ttggaaagga gcaaagcggg tactacggta ccttgccggg acccgggatc 3780 acgggatatg cttcggcggt aaaagaacga acccaaccgg actctatgga tactctgacg 3840 cagactacgc tggcgatcca aacagcagga gatcaacatc tggattctta ttcacgttaa 3900 atggaggacc gatcacctgg tcaagtcgac gccaaccaat cgtagcgctg tctacgatgg 3960 aagcagagta tatcgctgca agcgacgcat gcagggaagc aacgtggata agactgatct 4020 tgagtgaact aggtgctcac caaaacaacc ccacaacgat atggtgcgac aatgagagcg 4080 cgatatcact agcgaaaaat ccagagtcgc acaaacggtc gaaacatatc gatgtacgat 4140 accatcatat ccgcgagcaa atcaagaaag gagtcgtcaa cctttcatac gtgaatacga 4200 agaaccaact tgcagacata cttacaaaag ggttagatct gcagagtcag gagaagctga 4260 tgaaggagat tggaatcgtg aaaccttagc gcgctgattg aaacagcggc aattcatttt 4320 gtgaaattag ttcaagtgta ctatgcggac gttgcacaca ttcgtgatga cattttttcc 4380 ttccacatgc tatgttgaaa ttcagaataa ttatctcggt ttaagttcgc tatggacggc 4440 tgattgagga gaag 4454 // ID CR1-14_HM repbase; DNA; INV; 3464 BP. XX AC . XX DT 11-DEC-2008 (Rel. 13.12, Created) DT 14-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE CR1-type family: consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-14_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3464 RA Jurka J.; RT "CR1 families from Hydra magnipapillata."; RL Repbase Reports 8(12), 1842-1842 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 50..3142 FT /product="CR1-14_HM_1p" FT /translation="MDSTYRNDFEKISNLFQTNTFNLDNESDPDINYFNEI FT NGSKFECSYFYPDKIKSIFKNNDNRDYFNIIHINIRSLKKNFDNFLNMICQ FT TENFFNVICLTETWCTEEDFKSNSNLHLTGFNTVFLERNVNKRGGGVLFYI FT KENLAYNIRSDMSVSDDNIEILTVEIINKKSKNILLSCCYRPPTGRTEYLS FT NYLVHNIIEKTSQEKKKNYILGDFNLDCFQYNEKKYITKFYNHLFETGTIP FT IINKPTRITKSSSSLIDNVLTNDYFNITLKKGIIKTDVSDHFPIFFCLGAD FT HKTNTNGKITLKKRIYSNNNLNLFKEQLLHQDWSYINFNEDINIIYKSFFN FT IFYKIYEANFPIREIVLNAKDISCPWITKGIKKSSKIKQKLYIKYLKNKSD FT KHKNNYTVYKNLFEKLRKKAKQIYYSDLLIKHKHNSRRVWQIMKEITSKHK FT TNTNTMPNAIKFENKLITDSNEIAAEFNNFFSSIGHNLSNKIPYVDNNSVD FT EFISSPNSTINFSNLTYNEFEIAFKSLKRNKAIGPDDINGNIVIDSFNEIK FT DILFKVFKASITQGSFPNSLKIAKVTPIFKTGDHTNITNYRPISVLPVFSK FT ILERIMYNRIYTYLIDNKLLYKNQFGFQRNSSTEHAILQVTRSIADSFKNS FT QFTLGIFIDLSKAFDTVNHHILIKKLESYGIKDKTLNWFKSYLSNRKQFVY FT NKESSSQIIKNITCGVPQGSILGPLLFLLYINDLYKASTLTTVNFADDTNL FT FQSHENIEILFNNMNNELKRISYWFRKNKLSLNTDKTKFILFHPTSKKKRI FT QNILPDLFIDNTIIKREKFTKFLGVIIDENLSWSQQIDSISMKVAKNIGVL FT YKARYILNKKQLTQLYYSFIHCHINYANIIWGSAHKTKLKSLYRHQKHAAR FT IINFGNRFSSSTPLLIEMKALNIYELNIFNILCFMFKCKENLSPSVFINLY FT NLKPKNKYELRDSKNIQEPFCKTKIDQFCISYRGPFLWNKIVLPNFDFSFK FT WSYSSFKKKTKEIIFSTQNIFIYF*" XX SQ Sequence 3464 BP; 1397 A; 518 C; 407 G; 1142 T; 0 other; gtgtatataa aaaccttcaa atattttcaa aataacaaaa acatacaaaa tggattcaac 60 ttatcggaat gattttgaaa aaatatcaaa tttatttcaa acaaatactt ttaatttaga 120 caatgaatct gatccagata ttaattattt taatgagatt aatggatcaa aattcgaatg 180 ctcttacttt tatccagata aaattaaaag catatttaaa aataatgata atcgtgatta 240 ttttaatata atccacatta atatcagaag tttaaaaaaa aattttgata actttttaaa 300 tatgatttgt caaacggaaa atttttttaa cgttatttgc ttaactgaaa cttggtgtac 360 agaggaagat tttaaatcaa actctaatct ccatcttaca ggttttaata cagtcttctt 420 agagcgtaat gttaataagc gtggaggagg agttcttttt tacataaaag aaaaccttgc 480 gtataatatt cgaagcgata tgagtgtttc tgatgataat attgagattt taacggttga 540 aataataaat aaaaaatcta aaaatatttt actaagctgc tgttatcggc caccaaccgg 600 aagaactgag tacctaagca attatctagt tcataatata atagaaaaaa ctagccaaga 660 aaagaaaaaa aattatatac tgggggattt caaccttgac tgcttccagt ataatgaaaa 720 aaaatatata acaaagtttt ataatcattt atttgaaacg ggaacaatac ccattataaa 780 taaacctact agaattacaa agtcttcatc ctccttaatt gacaacgttc ttaccaatga 840 ctatttcaat attacactca aaaaaggtat tataaaaact gatgtttcgg atcactttcc 900 aatttttttc tgcttaggcg ctgaccacaa aacaaatacg aatggtaaaa taacattaaa 960 aaaacgaatt tatagcaaca ataacttaaa tttatttaaa gagcaacttt tacaccaaga 1020 ctggagctat ataaacttta atgaggatat aaatatcatc tacaaatctt tttttaatat 1080 attttacaaa atatatgaag caaattttcc tattcgagaa atagttctta acgccaaaga 1140 catatcttgt ccttggataa ctaaaggaat aaaaaaatca tcaaaaataa aacaaaaact 1200 ctatattaaa tatcttaaaa ataaatccga caaacataaa aataactaca cagtttacaa 1260 aaatctattt gaaaaactcc gaaaaaaagc caaacaaatt tactactctg acttgcttat 1320 aaaacacaag cataattcaa ggcgggtgtg gcaaataatg aaagaaatta ctagcaaaca 1380 taagacaaac acaaatacga tgccgaatgc tataaaattt gaaaataagt taattaccga 1440 ttcaaatgaa atcgcagccg aattcaataa ctttttttcg tcaataggac ataacctatc 1500 aaacaagatt ccttatgtag acaacaactc tgttgatgaa tttatatcat caccaaattc 1560 aactataaat ttttcaaatc taacttataa tgaattcgaa attgcattta aatcattaaa 1620 aaggaataaa gcaatagggc ctgatgatat aaatggaaat attgtaatcg actcttttaa 1680 tgagattaaa gatatcctct tcaaagtctt caaagcatct attactcaag gatcgtttcc 1740 aaatagtttg aaaatagcca aagtaactcc aatctttaaa acaggtgacc ataccaacat 1800 aacaaactat cgtcctatct cagttctacc ggtcttttca aaaatattag aaagaataat 1860 gtataataga atttacactt atcttattga taataaactc ctttataaaa atcaatttgg 1920 ttttcagaga aacagttcta ctgaacacgc tattctccaa gtaacacgca gtattgccga 1980 ttcatttaag aattctcaat tcacactagg catattcatt gacttgtcaa aagcttttga 2040 tactgttaat catcatattt taataaaaaa gctagagagc tatggtataa aagataaaac 2100 cctaaattgg ttcaaaagct atctaagtaa ccgcaaacaa tttgtttata acaaagaatc 2160 ttcatcgcaa ataataaaaa atataacatg cggtgtcccc caagggtcca tacttggacc 2220 actattattt ctattataca taaatgatct ttacaaagct tctaccctaa cgaccgtaaa 2280 ttttgcagat gatacgaatt tatttcagtc gcacgaaaac atagaaatac tttttaataa 2340 tatgaataat gaattaaaaa gaatatcata ttggtttaga aaaaataaat tgtccttgaa 2400 tacggataaa acaaagttta tactttttca cccaacctct aaaaaaaaga gaatccaaaa 2460 tatcttacct gacctattca ttgacaatac aatcataaaa agagaaaaat ttacaaaatt 2520 tttaggagtc ataatagatg aaaacttatc ttggagtcag caaattgata gtatttcaat 2580 gaaagttgct aaaaacattg gagttctata caaagctcga tatatactaa ataaaaagca 2640 actaactcaa ctatattatt catttattca ttgccatata aactatgcca atattatatg 2700 gggaagcgct cacaaaacca aacttaaatc tctctatcgg catcagaaac acgcagctcg 2760 cataattaat tttggaaatc gtttttcaag ctcaacacca cttcttatag aaatgaaagc 2820 tttaaatata tacgaactaa atatttttaa tattttatgt tttatgttta aatgtaaaga 2880 aaatttatct ccaagtgtat ttataaatct ctataatttg aaaccaaaaa acaaatatga 2940 acttcgtgat agtaaaaata ttcaagagcc gttttgcaaa acaaaaattg accagttttg 3000 tatttcttat cgtggtccat ttctttggaa taaaatagtt ttacctaact ttgatttttc 3060 ttttaaatgg tcttattcct cctttaaaaa gaaaacaaaa gaaataatct tttcaacaca 3120 aaatatattt atttattttt agtatttcaa gttatttgat atttttgtcg actgtgtgat 3180 gaatattatt taaacaatat atgctgaata ctttatatta taatattgtt caatattgag 3240 aatgttatat atccatatta taaaatatta tgttaaaact tgcttaacat ctgaagaata 3300 ttctaacgct tatgtaattt ttaatatttg taaattttct accttttaat tcgttatata 3360 gattgtaaag ggcttcatga taagatcatg tgatcttctg gaagtcctac cagtaatata 3420 ttcaatgtaa atttaccttg ttcatatttt tctggaaaaa aaaa 3464 // ID CR1-109_AAe repbase; DNA; INV; 2999 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-109_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-2999 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1197-1197 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 11 sequences with >97% CC identity. The consensus is 5'-truncated. XX FH Key Location/Qualifiers FT CDS 1..2892 FT /product="CR1-109_AAe_1p" FT /note="endonuclease and reverse transcriptase." FT /translation="AAPASSNILMYYQNVGGINSSLAEYNRAVSDSCYDVY FT AFSETWLNANTLSNQLFDSSYIVYRQDRSASNSNKRTGGGVLFAVRSNFKS FT RVLNSPNGSSIEQLWVAITTVDATLYLCVIYIPPDRVNDESLIEKHTDSLD FT WVVSQLKPRDSIMIIGDFNLSGISWQYDXSGSLFPVASRSTISLTSRKLLD FT AYSTACLRQTVGIMNENSRVLDLCFVSEELRDASTVMQAPSPLVKTCRHHP FT PILLKMEICPQKRFHDVSESVFYDFRNADFNGMNDFFENVDWDDVLRDSDA FT NLAASTISAILLYGIDQYVPVKVKREPSKPAWSNSELKNLKRVKKAALRRH FT SKFRTDSTRARYMQANTEYKQLNDCLYNAHQDHLQSILKANPRSFWKHVNE FT QRKESGLPPTMSDGLHEADSKEDIADMFRTQFSNVFTNEHVDPDIVNDAIR FT DVPRLSTFVQQIIITDDMVASASKELKSSTGSGPDGIPSLVLKRCAETIAA FT PLARVFNLSLASGVFPNCWKQSYVFPVFKKGSKQLVSNYRGIAALSATSKL FT FEVIILQKLVQSYAHYISPDQHGFMPKRSTTTNLACFTSFILRLMEAGQQV FT DAIYTDLSAAFDKMNHQIAVAKFDKLGMGNSMLLWLQSYLTGRNMSVKIGD FT HVSSPFRVWSGVPQGSHLGPFLFLLYMNDVNFKLKCLKLSYADDLKLYWTI FT KQPEDAVFLQHELEAFAEWCQINRMSLNVSKCSVISFSRKHAPFHFGYFLA FT GTQLERVSTVKDLGILLDSKMTFKDHVAYAVSKASSQLGFLFRYAKKFRDV FT YCLKALYCSIVRPTLEYSSIIWSPYYQNGIKRVEAIQRKFVRFALRHLRWR FT DPLNLPSYGSRCQLIKLESLVSRRNVAKACFVGDLLQGNIDCSTLLSCLDI FT NVRRRNLRTHSFINIPSARTNYGLQEPMRSMSRVFNMCYHVFDFNVSRETN FT KSSYRRVLC" XX SQ Sequence 2999 BP; 798 A; 727 C; 645 G; 827 T; 2 other; gctgcaccag cttcatccaa catcctaatg tactaccaaa acgttggcgg tatcaatagc 60 tcgctcgccg aatacaaccg ggccgttagt gatagttgct atgacgtcta cgctttctcc 120 gaaacctggc tcaacgctaa tacattgtcg aatcagctct tcgacagctc gtacatcgtg 180 taccgtcaag accggtcggc ttccaatagc aacaaacgta ccggaggcgg cgttttgttc 240 gctgttcgat cgaatttcaa atctcgtgtg ctaaattctc ctaacggatc atcgatcgag 300 caattgtggg ttgctattac cactgtcgac gcaacgctct acctgtgtgt gatttacatc 360 ccccctgatc gggtgaacga tgagtccctg attgaaaagc atactgactc gctggattgg 420 gtggtttctc aattgaaacc aagagacagc attatgatta tcggcgattt caacttgagt 480 ggtatttcct ggcagtacga ctsttccggt tctctctttc cggttgcctc tcgatcaacg 540 atcagcctga cgtcccgaaa gctactagac gcgtacagca ccgcctgcct ccgacagact 600 gtcggcatta tgaatgaaaa ctcccgagtg ctagacctgt gcttcgtcag cgaggaatta 660 cgagatgcta gtacggtaat gcaagcccca tcgccacttg ttaaaacgtg caggcatcat 720 cctcccatac tgctgaaaat ggagatatgt ccgcaaaaac gcttccacga cgtttccgaa 780 agtgttttct acgatttccg taacgctgac ttcaacggaa tgaacgattt cttcgaaaat 840 gttgactggg acgacgttct tcgagattct gatgcaaatc tggctgcctc gacaatctct 900 gctatcttgt tgtacggcat agatcagtac gttcccgtca aggtcaaacg tgaaccttcg 960 aaaccagcct ggtctaattc cgaactgaaa aatctcaaga gggtgaagaa agcagctctc 1020 agacgtcata gtaagtttcg tacagattct acgagagcac ggtacatgca agcgaatact 1080 gaatataagc agctgaatga ttgtctgtac aacgctcatc aggaccattt gcaaagcatc 1140 ctcaaagcaa accctagaag cttctggaaa cacgtcaatg aacaacggaa agaatctgga 1200 ttgcccccta cgatgtccga cggcttacat gaggccgatt ccaaagaaga cattgctgat 1260 atgttccgca cgcaatttag taacgtcttc accaatgaac acgtcgatcc agacattgtt 1320 aatgatgcaa ttagagatgt tccccggctt tccaccttcg tgcagcaaat cataatcact 1380 gacgatatgg ttgcatcagc aagcaaggaa ttgaaatcgt ccacaggatc cggtccggat 1440 ggcattcctt ctcttgttct caaacgttgt gcagaaacaa ttgcagcccc gctagcaagg 1500 gtttttaacc tatccctggc ttctggtgtg tttccgaatt gctggaaaca gtcgtacgtt 1560 tttccagttt tcaaaaaagg tagcaaacag ttagtctcca actatcgtgg aatagctgcg 1620 ctgagcgcta cttcgaagct gttcgaggtc attattctgc aaaaattggt ccaaagctat 1680 gcacattaca tatcgcctga tcaacacgga ttcatgccca aacgttcgac gacaactaac 1740 ctggcttgct tcacttcgtt cattttacgc ctaatggagg ctggtcaaca agtcgacgcc 1800 atttacacgg atctctccgc tgcatttgac aagatgaacc atcaaatcgc agtggccaag 1860 tttgataagc ttggtatggg aaatagcatg cttctatggc tccaatcgta cctaaccggt 1920 cggaatatgt ccgtcaaaat tggagatcac gtttcgtcac cctttagagt ctggtccgga 1980 gttcctcagg gtagtcacct cggtccgttt cttttcctgt tatacatgaa cgatgttaat 2040 ttcaagctta agtgcctgaa gctttcatac gccgatgact tgaagctgta ttggacgata 2100 aagcaaccgg aagacgcagt ttttctacaa catgagttgg aagccttcgc tgaatggtgt 2160 caaataaatc gaatgtcgtt gaacgtgtca aaatgttccg ttatatcatt cagccgcaaa 2220 catgcacctt tccatttcgg atattttcta gcaggcacgc aacttgagcg cgtatcaaca 2280 gtcaaagact tgggcattct cctggactca aaaatgacat tcaaagacca cgttgcatat 2340 gctgtgtcca aagcttcatc tcaactggga ttcctgttcc ggtatgccaa aaagtttcgg 2400 gatgtgtact gcctaaaggc tttatactgt tcgatagtgc gccctactct ggaatattcg 2460 tcaataatct ggtctccgta ctatcagaat ggaatcaaac gtgttgaggc tattcaacgc 2520 aaattcgtcc gctttgctct acgccacctc agatggagag atccgcttaa cctgcccagc 2580 tatggtagcc gatgtcagct aatcaaattg gagtcactgg tgtcgagacg caacgtagcc 2640 aaagcctgct ttgttggaga tctcctgcaa ggcaacatcg actgctccac actgctcagc 2700 tgcttggaca tcaacgtacg acgccgcaat ctwcgaactc attcgttcat caacatacct 2760 tctgccagaa caaactacgg tttacaggaa ccgatgcgta gcatgtcccg agtattcaat 2820 atgtgttatc atgtttttga ttttaatgtg tcgcgcgaaa caaataagtc tagttaccgt 2880 cgagttttgt gctaattttt gtaatcttgg tttgaatatt ttagtgtttg tattttgttt 2940 aagaaacaag ggtcattggg gtaatgtttt acctgttgac taataaataa ataaataaa 2999 // ID Gypsy-3_TCa-I repbase; DNA; INV; 4213 BP. XX AC ChLG4; XX DT 21-FEB-2011 (Rel. 16.02, Created) DT 21-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the red flour beetle genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-3_TCa_; KW Gypsy-3_TCa-LTR; Gypsy-3_TCa-I. XX OS Tribolium castaneum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Coleoptera; Polyphaga; Cucujiformia; OC Tenebrionidae; Tribolium. XX RN [1] RP 1-4213 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the red flour beetle genome."; RL Direct Submission to RU (21-FEB-2011). XX DR Genome; ChLG4; Positions 3682049 3677837. XX CC 'CTTTG' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 81..2345 FT /product="Gypsy-3_TCa-I_2p" FT /translation="MHFRHKWPSWSKNMRISWHKHRPRVQASDTPSKQAPT FT SSIVFSNSIPIPEKFSFEKEDWQVWITHYERFRTATKLDKSTQAEQINSLL FT LHMGAKVTKFMEAKQKQETDFPSYKALKEFFDNEFKDTPNTIYARAKFNRR FT DQREGEDAQTYIADVISLAKTCNYRDLEEELIRDRLVVGIRDEKLSENLQM FT NDKLTLKTAIDKITQVERIRTENRELRVREETVNRVARDKAKKFNHKSVKY FT EGKQRRSKDGKQQSEHKFKGCIRCGNSNTHNRMECPANGQTCHKCNFKGHF FT ASRCKTKNVNAVESETSISDDTDDSYDSSYDCREIININRVKLDKPWEAVA FT EIFNTNIVFKIDTGADETIISSKEFKDKLQGKVKLQRTKTTLLGPGKGNQR FT KLEIKGEIVVPLKWENKIENVRCFVVDTPDNLLGRPALQKLDMIKWTGKEL FT VASDSVIRHVTSRKADEKFQHKMMVKYPEVFKGLGTLKNFEYTIEVKSDAQ FT PWTLNTPRRIPLPMMDVVKKELDKMIREDVIEAINEPTEWCAPMVIAAKKN FT GKIRVCSDYTELNKCVNRELYQLPSVDETISKLKDAKWFTKLDFSSGFWQL FT KLSPQSRKYTCFLTPFGRYVYKRIPFGITSAPEVFQKTISGVIGKLKMEHV FT HVHADDILITGRNEADHDTNVNKVLTLLSDHGLTLNKSKCEFKTNKTLYLG FT YLISENGVQADPKSVEAINNYPAPSSVTEVRTFLGMVNYVSKFIVNTSEKN FT PIIT" FT CDS 2296..4140 FT /product="Gypsy-3_TCa-I_1p" FT /translation="MSQNLSLILRKKTQSLRELLHKDKQFVWTDKHQADFE FT KLKSEIASSRVLAKFSTDKKSRVSADASSFGLGAVLEQLQSDGNWKPTYFC FT SRTLEPCEQAYAQIEKEALAITWACERLENFLLGAHFEIHTDHKPLTVILA FT TKELNKLSNRLQRFRMRLLKFNYTIEYVPGKTFFVPDALSRAPIQGRIRHQ FT DPLLEDIQNLYINFVMREVVTEICSTEEIKSYQNKDPTYSKLKEYVKNGWP FT QKHKSDPLVSKFFSHQQDLSIAHDIVCYKDRVVIPGPLQKKCLYVLHDGHF FT GINKCLDRAKATVWWPGITRELKEVVGKCEICIKLRRPTVDPMLPSEVPSR FT PWQTIGADLAEYHESTYLVVQDYYSKYPEIRKLRTTKAKVVIETFKEMFAR FT HGIPEVVRSDNGKQFDCHEYRTFSRKYGFKLITSSPHYQQSNGQAESAVKL FT LKTILRKNEDPYLALLTYRNTPTKCGLSPAQLLFGRNLRDRLPRLPSGLKP FT ATIDHDTVRQTLINNQAEQKSNYDKRHRVTKEEKNLKEGDRVWIINMQKEG FT EVQRRCEEPRSYLIETDGGCVRRNRRHLQPLPDRNNEPEHDPEQPDESEEP FT KKRPIRRPKRYQDYYCY" XX SQ Sequence 4213 BP; 1513 A; 851 C; 872 G; 977 T; 0 other; tggtgtcagt gttttaaaat ccgaatcccg ctcgtgttac aaattgtgag tgtaaaaaac 60 caccacaact atcacgatgg atgcacttca ggcacaagtg gccgagttgg agcaaaaata 120 tgcggatctc atggcacaag cacaggcccc gggttcaggc gagtgacaca ccgtcaaagc 180 aggctccaac gagtagcatc gtcttcagca actccattcc gattccggag aaattttcat 240 tcgaaaaaga agattggcaa gtgtggatta cacattatga acgttttaga acagcaacaa 300 aactagacaa aagcacacaa gcagagcaaa ttaatagttt attgttgcat atgggggcga 360 aagttacaaa gtttatggag gctaagcaga aacaggaaac cgattttcct agttacaaag 420 cgctgaaaga attctttgat aacgagttca aagacactcc taatacaatc tacgcacgag 480 ctaagtttaa cagacgtgac caaagggaag gtgaggacgc ccaaacgtac atcgctgacg 540 tcataagctt ggccaaaaca tgcaattaca gagatttgga ggaggaactc ataagagatc 600 gattagttgt tggaattcgt gatgaaaaat tgtcggaaaa tctacaaatg aatgacaaat 660 tgacacttaa aacagcaatt gacaaaataa ctcaagtgga gaggattaga accgaaaaca 720 gagaactaag agtgagagag gaaacggtaa acagagtagc tagagacaaa gccaagaaat 780 ttaaccataa atccgtgaaa tatgaaggaa aacagcgcag gagcaaagat ggaaagcaac 840 aaagtgaaca taaatttaaa ggttgtataa ggtgtggaaa ttcgaacaca cacaacagga 900 tggaatgccc agccaatgga caaacgtgcc ataaatgtaa ctttaaaggc cattttgcat 960 ccagatgtaa aacaaagaac gtaaacgcag tggaatctga aacctcaata tcagatgata 1020 ctgatgatag ctatgattcg agctatgatt gtagggagat tatcaatatt aatcgagtaa 1080 agttggacaa accttgggaa gcggtagcgg aaatttttaa tacaaacatt gtattcaaaa 1140 tagataccgg agccgatgaa accataatat ctagcaaaga attcaaagat aaacttcagg 1200 ggaaagtaaa gttacaaaga acaaaaacaa cactgttggg acccggaaaa ggaaaccaga 1260 gaaaattgga aataaaggga gagatagtag ttcctcttaa atgggaaaat aagatagaaa 1320 atgttagatg tttcgtcgtt gatacaccag ataacttact gggaagacca gccttacaaa 1380 aactagacat gatcaaatgg actggcaaag aactcgttgc gagtgacagt gttattagac 1440 atgtaactag cagaaaagct gatgaaaaat tccaacataa gatgatggtg aaatatccag 1500 aggtgttcaa aggcttggga actcttaaga attttgagta cacaattgaa gtaaaatcag 1560 acgcacaacc ttggacctta aacacgccta gaagaattcc actccccatg atggatgtag 1620 taaaaaagga gctggacaag atgattcgcg aagacgttat agaagccatt aatgaaccta 1680 ctgagtggtg tgcacctatg gttattgcag cgaaaaagaa cggaaaaatt cgcgtgtgtt 1740 cagactacac cgagctaaac aagtgcgtta atcgcgaact gtaccaatta ccgtcggtcg 1800 atgagacaat ttcgaaactg aaagacgcga agtggtttac caagttagac ttttccagtg 1860 gcttttggca gcttaaactt tcaccgcaat cgcgcaaata tacctgtttt ctcacaccat 1920 ttggcaggta tgtctacaaa cgcattccat ttgggatcac ctcagcacca gaagttttcc 1980 aaaaaaccat cagcggcgta atcggcaaat taaaaatgga acacgtccat gttcatgctg 2040 atgacatcct catcacaggc cgcaacgaag ctgaccatga cacaaacgtg aacaaagtgt 2100 taacactctt atcggatcac ggcttgactt taaacaaatc caaatgtgaa tttaaaacga 2160 acaagacctt atatttagga tacttaatat ccgaaaatgg tgttcaagcc gatcccaaaa 2220 gcgttgaagc cattaataat tatcccgcgc catcgtcggt aaccgaagtt agaactttcc 2280 tcggcatggt taactatgtc tcaaaattta tcgttaatac ttcggaaaaa aacccaatca 2340 ttacgtgaac tactgcacaa ggataaacag tttgtatgga cagataaaca tcaagcggat 2400 tttgaaaaac ttaaaagtga aatcgcatca tcgcgcgtgt tagccaagtt tagcacggac 2460 aaaaaatcgc gagtgtcggc agacgcatcg tcttttggtt taggtgccgt tttagaacag 2520 ttacaaagtg acggaaattg gaaacctact tatttttgtt cgcgtaccct cgaaccatgc 2580 gagcaagcct atgctcaaat tgaaaaggaa gctctcgcca ttacatgggc gtgtgaaaga 2640 ctcgaaaatt tcttactggg cgctcatttt gaaattcata ccgaccataa accgttaaca 2700 gtaattttgg ccaccaaaga gctcaacaaa ctatccaaca ggctacaaag atttcgcatg 2760 cggttattga aatttaacta tactatagaa tacgttccag gtaaaacgtt tttcgttccc 2820 gatgcattat ccagagcacc aatccaaggc agaattcgac accaagaccc acttctagaa 2880 gacattcaaa atctctacat caatttcgtg atgcgtgagg tcgttacaga aatttgttca 2940 acggaagaaa tcaagtctta ccaaaacaag gacccaacgt acagcaaact caaagaatac 3000 gtcaaaaatg gttggccaca gaaacacaaa agcgacccgt tagtcagtaa attcttctct 3060 catcaacaag acttaagcat cgctcatgac attgtttgct acaaagaccg agtcgtcatt 3120 cctggtcctc ttcagaagaa gtgcctctat gtgctccatg acggacactt cggtattaat 3180 aaatgtttgg accgggccaa agctaccgtg tggtggccag gtatcacaag agagcttaaa 3240 gaagtcgtcg ggaaatgtga aatttgcatc aaactacgca gaccaacggt agatcccatg 3300 ttaccatccg aagttccatc tagaccgtgg caaacaattg gtgccgacct cgcagaatat 3360 catgagtcaa cttacttagt ggtacaggac tattattcca agtatcctga aatcagaaaa 3420 cttcgcacca ccaaagccaa agttgttatc gagaccttca aagaaatgtt tgctagacat 3480 ggtatacccg aagttgtccg ctccgacaat ggcaaacaat ttgattgtca cgaatacaga 3540 actttcagca gaaaatatgg atttaagtta attaccagta gtccccatta ccaacagtca 3600 aatggacaag ccgaaagtgc tgtgaagctg ctcaaaacga tattgaggaa aaatgaagat 3660 ccatatcttg cacttcttac ctaccgcaat acacctacta aatgtggatt gtcacctgcc 3720 caactgctct ttggcagaaa ccttcgagac agactgccaa ggttaccttc agggttaaaa 3780 cctgcaacta tcgaccatga cactgtcaga caaacgttaa tcaacaatca agcagaacag 3840 aagtccaact acgacaaacg acaccgagta acaaaagaag agaaaaatct aaaagaaggt 3900 gatcgcgtat ggatcataaa tatgcaaaag gagggagaag tacaacgcag atgtgaagaa 3960 ccaaggtcat atttgatcga gactgacggc ggttgcgtca gacgaaatcg aagacatctt 4020 cagcccttac cagacagaaa taatgaaccc gaacatgacc cagaacaacc agacgaatca 4080 gaggaaccaa agaaaagacc aatcagaaga cccaaaagat accaagatta ttattgttat 4140 tgaaatatgt atattttgat acctcaggta gttagtttca tatttagctt agttttgtat 4200 ttaaaagggg gga 4213 // ID hATx-2B_SM repbase; DNA; INV; 2885 BP. XX AC . XX DT 11-AUG-2009 (Rel. 14.08, Created) DT 11-AUG-2009 (Rel. 14.08, Last updated, Version 2) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hATx-2B_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-2885 RA Jurka J.; RT "haT-type repetitive family from freshwater planarian Schmidtea RT mediterranea."; RL Repbase Reports 9(8), 1854-1854 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 202..831 FT /product="hATx-2B_SM_2p" FT /translation="MSKEDENYSNIWKCFEKKDNVTAICNNCRKEILCKGG FT STSGLHRHLQHIHLKKRKANDDRDLEKLRKTRAPLTLDKFIQTNRPTVGEI FT LAKLAAKDGLTFSQISKSEFIHNSLKAQGFTISAQPRDIMKLIYDYYLNIK FT TKTIQKIKKMKLENKKASISIDEWTSIKTRRYLKAQGFTISAQPRDIMKLI FT FDYSRQVYPNQSSYSRRNIG" FT CDS 944..2455 FT /product="hATx-2B_SM_1p" FT /translation="MKLIYDYYLNIKTKTIQKIKKMKLENKKASISIDEWT FT SIKTRRYLNIHIFYSDGDSDNLGLITLLGSYTSEKLIQLVKEKLELFELDY FT EKDIVATTTDGASVMIKYGRLSPSESQLCYNHAIHLAVTSVFYTKKELLND FT FSENEELEDEIDENNDSEEDINDDDIIIIGADQFLLTSDPEFNILDIIAKI FT RKIVLLFKNSPVKNSILQQYIMDAEKKELSLLIDCRIRWNSLEVMVDRFLR FT VIDPIQKTLKDLNMSHLWGSEETKKAKCILNSLSPIKIVVEALSRKDANLL FT TGEGALKFLFNALQMDNSVLSLKLMNELKFQLTKRRNKPIVSLLKFLQDPK FT NLLEKDDKFFYSTSKCDIIKVAKNYMQKLFNNSDQNEMDFEQNDPYLESDI FT TIVKENNAYSNQNNLKKMLEESMQAILKTSQPAANQFTTLMREFNLYETTG FT ELTTNLRNLKDAMMTIKPTSTQNERNFSISGNIVSKKRSRLKDSSIDCLCF FT LKHHFMKK" XX SQ Sequence 2885 BP; 1149 A; 401 C; 438 G; 897 T; 0 other; cgggaattcc ctcaaaaaaa tttcccgaat cccgggaatt ccctaattga agaaaggtaa 60 ttccccgatc gacaataaaa agagtataaa agcagagaaa atcatacaga atttattaat 120 tttttgaata atttcaatta aagttcgatt ccagtttaca attagtagca taatttaaaa 180 acatttattt agccctttaa aatgagcaaa gaagacgaaa actatagtaa tatttggaag 240 tgttttgaga agaaggacaa tgtaactgct atatgtaata attgtagaaa ggagatactt 300 tgcaaaggtg gttctacaag tggcttacat cgtcatttgc aacatattca tttaaaaaaa 360 cgaaaagcaa atgatgatcg ggatttggaa aaattaagaa aaactagagc accacttact 420 ctcgacaagt ttatccaaac caatcgtcct acagtaggag aaatattggc taagttagca 480 gctaaagatg ggttgacttt ttcccaaata agcaaaagtg agtttattca taatagtttg 540 aaagctcaag gatttactat atcagcccaa ccaagagata taatgaagtt gatttacgat 600 tactacctca acattaaaac caaaaccatt caaaaaataa aaaaaatgaa gttggaaaat 660 aaaaaagcaa gtatttcaat cgatgagtgg accagtataa aaactcgtag atatttgaaa 720 gctcaaggat ttactatatc agcccaacca agagatataa tgaagttgat tttcgattac 780 tctcgacaag tttatccaaa ccaatcgtcc tacagtagga gaaatattgg ctaagttagc 840 agctaaagat gggttgactt tttcccaaat aagcaaaagt gagtttattc ataatagttt 900 gaaagctcaa ggatttacta tatcagccca accaagagat ataatgaagt tgatttacga 960 ttactacctc aacattaaaa ccaaaaccat tcaaaaaata aaaaaaatga agttggaaaa 1020 taaaaaagca agtatttcaa tcgatgagtg gaccagtata aaaactcgta gatatttgaa 1080 catacatatc ttctatagtg atggcgactc agataattta ggtttaataa cactactggg 1140 atcttataca tcggaaaaac taattcaatt ggttaaagaa aaactagaat tatttgaatt 1200 ggattatgaa aaagatattg tggcaacaac aactgatggg gcaagtgtga tgataaaata 1260 tggacgattg tcaccttccg aatctcaact ttgctacaat catgccattc atttagcggt 1320 cacatccgta ttctacacca aaaaggaatt actcaatgat ttttctgaaa acgaagagct 1380 tgaggatgaa attgatgaaa acaatgattc tgaggaagat ataaatgatg atgatattat 1440 tatcattggg gctgatcagt ttttgctaac ttctgatcct gaattcaata tactggacat 1500 aatcgcgaaa ataagaaaaa ttgtattgct ttttaaaaat tcaccagtaa agaattccat 1560 tttacagcaa tacataatgg atgcagaaaa gaaagagctt tctttactta tcgactgtcg 1620 tatacggtgg aacagtcttg aagtgatggt agatcgtttt ttgcgggtta tcgatccaat 1680 acaaaaaact ttaaaagatt taaatatgag ccatctttgg ggatcagagg aaacaaagaa 1740 agccaaatgt atattaaact ctttgagtcc tataaaaatt gttgtcgaag ctttaagtcg 1800 caaagatgct aatcttctta ctggtgaagg agcattgaaa tttttattta atgcacttca 1860 aatggataat tcagtgctca gcttaaaact tatgaatgaa cttaaatttc aattaaccaa 1920 aagaagaaat aaaccaatag tttcgttact aaaatttttg caagacccaa agaatttact 1980 agaaaaagat gataaatttt tttactcaac atcaaagtgc gatatcataa aagtggctaa 2040 aaattatatg caaaaattat tcaacaattc agatcaaaat gaaatggatt ttgaacaaaa 2100 tgacccttat ttggaatctg atattaccat tgttaaggaa aataatgctt acagtaatca 2160 aaataattta aaaaagatgt tagaagagtc aatgcaagca attttgaaaa catcacagcc 2220 agccgcaaat cagtttacaa ctttaatgag agaatttaac ttatatgaga cgactggaga 2280 gctgaccacc aatcttagaa atttaaagga tgcaatgatg acaattaaac ctacgtctac 2340 acaaaatgag cgtaattttt ctatttcagg taatatagta tcaaaaaaaa gaagcagatt 2400 gaaagattca tcaattgact gcttatgctt tttaaagcac cattttatga aaaaataaat 2460 ttttaaaatt attaattttg aattcaattt tgtaattata ttttttatgt agcactataa 2520 ttttatagat gtttaatact cttataattt aagattgaat ttttttatgt tatgtttatt 2580 tttatttttt aaacctaaaa tttatatttt gttacgataa aaaaaactca attgaataaa 2640 atcagcgcat ataattaata cattaaaata ctatatctca aacttttttc tgtatttcat 2700 atattaataa atttattgaa aaaaaattat attgatactt gttataaata taaaatatat 2760 aataactcta taaaaaatat aataatttta acttgtaaat acaataattt taacttgtaa 2820 atacaataat tttaataatc ccgggaattc tcgggattcc cgggatctga ggaaaaaaat 2880 tcccg 2885 // ID LIN14_SM repbase; DNA; INV; 4622 BP. XX AC . XX DT 11-AUG-2009 (Rel. 14.08, Created) DT 11-AUG-2009 (Rel. 14.08, Last updated, Version 2) XX DE Non-LTR retrotransposon; consensus. XX KW NeSL; Non-LTR Retrotransposon; Transposable Element; LIN14_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-4622 RA Jurka J.; RT "Non-LTR retrotransposons from Schmidtea mediterranea."; RL Repbase Reports 9(8), 1908-1908 (2009). XX DR [1] (Consensus) XX CC The 5' and 3' termini are approximate. ~93% identical to CC consensus. CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 668..1288 FT /product="LIN14_SM_1p" FT /translation="MEIDSKLIENAKNDKISFDESKNSISKSAVNNQITIE FT STXPTWKVNKSINKFKKINKNSNFLSRTTSEKYVRILTKNERQDLFKTFPY FT FIKLLKLKESPNKSKTINKELSFIAKSQNPLNIVDFNYLFYRNNNICVSIK FT FKNKSRTSQVANKHAQGIFIVGSSVVKTYNCKSKFVIAPLSIRKFLFALSS FT KFAETAGIKNEKQRGCK" FT CDS join(1324..2559,2563..4266) FT /product="LIN14_SM_2p" FT /translation="MLPDYLIKSEIKYCDGDFLMLDLLNNYDDSDQSSCVS FT SSELSRSQLRDLYVSFPELLELCVVENVSKNPAKLINLINELKLQKTFPEQ FT ISFVEGKFLINNQLSNSCDLQTGYVSDIMFNLGLLNFVDISKEHCEFESLL FT NESNSSTKVIDIIFYLNYLYLKNSLKYKINKTYGEYVQNYIRNCLLMYSVS FT NVESCSSFDLENIDILGAGMINLIPSSSYFNSSCKERILRFKQWKLSEEID FT KIHNIISSGTITKVNNFFDLLDKSFPNKLNQLKILENISTDINQWKSDLNN FT VDEVKAKRLVYFAVRDKKKFLLANKFSYRIPQYCRKICTDKMSTCIDKIID FT SVVQAVNVNTNNQQLDDPKADIEECVIKTTLDMSEDKEKTLANSLLVDIDE FT NVSSSIHTRSRKVPLSSKRQVPASVSISVPISEETDVECAKVEYIKPRDPD FT VWFTDDDIDQYLDRNICNPSFAHLQCFIVSILCSNLKEKIVSIPETVLKAE FT VIFCPINLNNTHWILFVYCKSLLESYFIDPILANKNLIKNXKATLKVNIAL FT NKIFELQVVPSCDTFQSLRYQENNFDCGPFICAYAILISKGLNTFPDSLID FT EIRQEVHEFKIDNSIKTPKTMGGGLTRNNLTEINFKKATAMTLKNYKRNTS FT NIINIKPQFQKSCVSSLLSKWFKSNTNILVIPLELTYNLIAQNNFYIIENV FT NFRQFKDIKHVISVIPNENYWCLFFYSISFRTCNIFDFRKTVISDRLIEIG FT GNISDYLNSFFLNMKIKFKSDHGISHRLFISDDPTFHASAFIVLLEKLILE FT RNFTHDQIYKHLDEIKTIECAEISLNLNLINGKITDNILTKYFNNLGLSSD FT FIFLGITMCTAILDDCTHYLNEHLNYECLQNAAVVFAIFAPPKSRETLIVI FT DYNTDEHYFLDPTTLDVSLNYIFIRKSLVTKINEIKNSHGRALRAGKCPHE FT VRGSGLLSRILTCAFIMHMMTLLRI" XX SQ Sequence 4622 BP; 1723 A; 678 C; 663 G; 1553 T; 5 other; aaccttgcgt aatagccttt aacaangcgt cctcaagttt cgtcaaattc attgcataac 60 agttcaccag atcaaatttg aaatttaatt ctaaagaatc tgttggaaat attttttcga 120 atgctaattt ttcgaatatg ccggatagca tgctgccact ttcggcgaag atgaaggagc 180 atgaagaata catcaatcag atgaattaca agttagattt tatattttct aaattaaatg 240 tttctatgat aatgatattt taaataattt naaatcagta aactttaata aatttgtaag 300 taattattct actaataaaa atgctttaca aaataagaat gttaataatc tacattcttt 360 ttataattct gagaaggttg ctgataaatc ttgccaatgt gactttaatt attttctcgt 420 agaagattct tcgattccca aaaaagtaga tccttcatta agcggagtga aatttaaatt 480 accatctctt tcattaccta cctttaataa gatttctttt cgaagtagtc attttcctga 540 gaatcattct tctactaatc ttgatgattt gagaatagtt gttgttgaca ataaaagaga 600 tgtttttgtt aataatttta acgataaata taattcagtt aatttaaata aaaatgatga 660 aataaatatg gagattgatt ctaagttaat tgaaaatgct aagaatgata aaatttcatt 720 tgatgaatct aaaaattcga tctcaaagtc cgctgtaaat aatcaaatca ccattgaatc 780 aaccaancct acctggaaag ttaataagtc tataaataaa tttaaaaaga ttaataaaaa 840 ctcaaatttc ctttctagaa ctacttctga aaaatatgtt agaattttga ctaaaaacga 900 gcgacaagat ttatttaaaa ctttcccgta ttttattaaa ttacttaaat tgaaagagtc 960 gccaaataaa tctaaaacta ttaacaaaga gttatctttt attgcaaaat cacaaaatcc 1020 tttaaacatt gttgatttta attatttatt ctatcgaaac aataatattt gtgtttcgat 1080 taaatttaaa aataaaagta gaacttctca agttgctaat aagcatgcac aaggaatatt 1140 tattgtggga tcaagtgttg tgaaaactta taattgtaaa tcaaagttcg taatagcgcc 1200 attatcaatt aggaaatttc tgtttgcttt atcgagcaaa tttgcagaaa cagctggaat 1260 taaaaatgaa aagcaaaggg ggtgcaaata aacaatgtgt taaaattact aaaaagacag 1320 aatatgcttc ctgactacct gataaaatca gaaatcaaat attgcgatgg agattttctt 1380 atgttagatc ttttgaataa ctatgatgac agtgatcaaa gttcatgtgt aagtagttcg 1440 gaattatccc gttctcaact ccgtgatcta tatgtttcat ttccagaact tctggaatta 1500 tgtgttgtag aaaatgtatc taaaaatcct gcaaaattaa ttaatttaat taatgaatta 1560 aaattacaaa aaacgtttcc tgaacaaatc tcatttgttg aaggtaaatt tctaattaac 1620 aatcaattaa gtaattcttg tgatttacaa actggatatg tttctgatat aatgtttaat 1680 cttggccttt tgaattttgt tgatatatct aaagaacact gtgaatttga atctcttttg 1740 aatgaatcaa attcttctac gaaagtgatt gatataatat tctatctaaa ttatctatat 1800 ttgaaaaaca gcttaaaata taaaataaac aaaacatacg gcgagtacgt ccaaaactat 1860 attcgtaatt gtttacttat gtactctgtt agcaacgtag agtcttgttc ttcgtttgat 1920 ttagaaaata ttgatatttt gggggccggg atgatcaact tgattccttc ttcttcctat 1980 ttcaattcta gttgtaagga gagaattcta agatttaagc aatggaaatt gtcagaagaa 2040 atagataaaa ttcataatat catttcatct ggaacaataa cgaaagtgaa caatttcttt 2100 gatttgttag ataaatcttt tcctaacaaa cttaatcaac tgaaaatact cgagaatata 2160 agtacagata tcaatcaatg gaaatctgat ttaaataatg ttgacgaagt aaaagcaaag 2220 cgtcttgtat attttgcagt tcgggacaaa aagaaatttt tactcgccaa caaattttca 2280 taccgtatcc ctcaatattg tcgtaaaata tgtacagata aaatgagcac atgcattgat 2340 aaaataattg attcagtggt acaagctgtg aatgtaaata caaataatca acaactcgat 2400 gatccaaaag ctgatattga agaatgtgtt attaagacta ctttagatat gtcagaagac 2460 aaagagaaaa cgttagcaaa tagtcttcta gtagacattg atgaaaatgt atcttcaagt 2520 atacatacca gaagcaggaa agtgccgcta agttccaaat gaaggcaagt gcctgcatca 2580 gtctcaatat ctgtgccaat ctctgaagag actgatgtag aatgcgctaa agtagaatat 2640 attaagcctc gtgatccgga tgtctggttc actgacgacg atattgatca atatcttgat 2700 cggaacatct gtaatccaag ctttgcacat ttacagtgct ttattgtcag tattctatgt 2760 tcaaacctta aggagaaaat cgtctcaatt ccagagacag tgctcaaagc tgaggttatc 2820 ttttgcccaa taaatttaaa taatacacat tggatcttat ttgtctactg taaatcattg 2880 ttggaatcat actttattga tccaatttta gcaaataaaa acctgataaa aaatnaaaaa 2940 gcaacgctta aggtcaacat tgctttaaac aaaatctttg aattacaagt agttccatct 3000 tgcgatactt tccaaagtct tcgatatcag gagaacaatt ttgattgtgg tcctttcatt 3060 tgcgcttacg caattcttat aagtaaaggg ctgaacactt ttcctgatag ccttattgat 3120 gaaataagac aagaggtgca tgaattcaaa atcgataact caattaaaac accaaaaact 3180 atgggtggag gtctcactag aaataatctg acagaaataa actttaaaaa ggcaactgca 3240 atgacactta aaaattataa aagaaacacc agtaacatta ttaatattaa accacaattt 3300 caaaaatcat gtgtctcatc tttattgagt aagtggttta aaagcaatac aaatatactc 3360 gttatccccc ttgaattaac atacaattta atagcgcaaa ataattttta tattattgaa 3420 aatgtgaact ttcgccaatt caaagatatt aaacatgtca ttagtgtgat tcctaatgaa 3480 aactactggt gtctattttt ctactcaatt agttttagga catgtaacat ttttgacttt 3540 agaaaaaccg ttatctctga tcggctaatt gaaataggag gaaatatttc agattaccta 3600 aacagttttt ttctaaatat gaaaataaaa tttaaaagtg atcatggtat ttctcaccgg 3660 ctcttcattt cagatgaccc aacgttccat gcctctgctt ttattgtttt attagagaaa 3720 ctgatattag aacgaaattt tacgcatgat caaatatata aacatcttga tgaaataaaa 3780 actatagaat gtgccgaaat ttctctaaat ttaaatttaa taaatggaaa aataaccgat 3840 aatattttaa ccaaatactt taataattta ggcttatcgt cagactttat ttttttgggt 3900 atcacaatgt gcactgctat tcttgatgat tgtacccatt atttaaatga acatttaaat 3960 tatgaatgcc ttcaaaatgc agcggttgta tttgccattt tcgctcctcc aaagtctcga 4020 gaaacactaa ttgtaataga ttataacact gacgagcatt atttcttgga tcctacnacc 4080 ctagatgtca gtctaaatta tatatttata cgtaaatctc tagtgaccaa aataaatgaa 4140 ataaaaaatt cacatggtcg tgcacttaga gctggcaaat gcccacatga ggttcgtggt 4200 tcaggactgc ttagtagaat tcttacatgt gctttcatca tgcacatgat gactctcttg 4260 agaatataaa tttgagagaa ataggaattg ttatcaactc tttacttccg attgtttctg 4320 agatcaacaa caaagataac gaaaaaacct taaataaaat caaagcaaaa gaaacgatta 4380 aatttagttt acaacagcgg aaaaataaaa tctttgaatt attaactatt ttgaaaaatg 4440 ctgatgttaa tctgattatt gaaagtattc ttcaacaatt tcctcattta aataatctca 4500 ctgaaaataa atttcatgaa ccttacctgg gtagcaaaca aaactcaaat aaaataagta 4560 aatataatag tagatccgaa tttctgatta acatgaaatt aactatttat aaaataataa 4620 at 4622 // ID GilM repbase; DNA; INV; 5242 BP. XX AC . XX DT 17-NOV-2004 (Rel. 9.1, Created) DT 17-NOV-2004 (Rel. 9.1, Last updated, Version 1) XX DE Giardia intestinalis non-LTR retrotransposon GilM, a consensus. XX KW Non-LTR Retrotransposon; Transposable Element; GilM; endonuclease; KW reverse transcriptase. XX OS Giardia intestinalis OC Eukaryota; Diplomonadida; Hexamitidae; Giardiinae; Giardia. XX RN [1] RA Burke D.W., Malik S.H., Rich M.S. and Eickbush H.T.; RT "Ancient Lineages of Non-LTR Retrotransposons in the Primitive RT Eukaryote, Giardia lamblia."; RL Mol. Biol. Evol 19(5), 619-630 (2002). XX RN [2] RA Gentles A. and Jurka J.; RT "GilM non-LTR retrotransposon."; RL Direct Submission to Repbase Update (OCT-2004). XX DR [2] (Consensus) XX SQ Sequence 5242 BP; 758 A; 1990 C; 1501 G; 993 T; 0 other; ccatgcgttc cgccctttcc gcgccttcgg gaaccacgga gatcggggct tcctcaggcc 60 cagagcctcc caaccacgct ccaaccgact cccacaggcc cctacctagg ggcactcccg 120 atcctatgac gctgggtgac ggcaacgatg cctcgacgac cgccatggcc cggccttcct 180 cttgcgagcc cctctgcggc tcccacgggc ccccatcccc ccagcctggg cctggcaaca 240 gccgttccgc cacggacaac tctcccaatg agtcttctcc tgtccatggc ttacagggct 300 ccggggcctc ggtggcccgg gacgcctctc ctccctgcct gtcgaacgac cttctgccct 360 gtggccagga gcccctcccg attcccgacg acggcctccc atacgacctg ggccacgaca 420 actccccgtc catcacggac gagcaggagg cccggggcgc agacctgcct gctcctgggg 480 gccgtacgac tgtctgcgag cgctgcggag aggtgctgcc tgactctgcg gcgatcgatg 540 tccacatcgg gatgcaccat ccgcctcccg gttcaccttc tccgtcgcgc agcccttctc 600 cgctgtcggg gtctgcccct ccacctgcag ccgccgagcc ctctccctcg ccaacgccgg 660 accctgtgtt cccgtatgca tgcagcatgt gccgggcccg atacaaaacg gagagggggc 720 tgaaggccca tatcgccagg ctcggccact acctcccgct cgacgccaca gtccccgagc 780 gcatcggggt gcctagtggc agcccgtccc ctgagctgct tcgcctcctc actgacgtat 840 tcacagagat ggcggggcgc cttcctcctg tagaggccct agcgcgcttc ctctcgacgt 900 gccgccgcat ggagggctct ggacggctcc ccccggcgca gtcgcttctc cggaaggggc 960 tgctcggccg ggcatggcag gcgctgttga gcgagacgtc ctctgccgcc agggtccccg 1020 agccccaggg caaggagcgg gaagcgattg tccatgagct ccacccgcgc cccatgcctg 1080 tctctctgcc ccccgtccat cacgtgctcg gcccgtcgcc caagataacg gccaaggcgc 1140 tgctgaagga gctgcaggcg atgaggcctg tcgcggccgg accctctggg ctggggaagc 1200 ctcacctcct gcacctttgt ggggccgccg gggctgcgga gctcttcacc tctgtcctga 1260 cgaccctctt ctccagcagg aactgggccc agctccagcc cctgtgcgag ttcaggctga 1320 agcttctgcc caagtctggc ggccgatggc gccctatcgc agtgcaggag acgcttctgg 1380 tcgccttcca ccgcttgctc ctgcgtcgga ctcccgcgct ccgcaagctc ccggcgtggc 1440 agctggcctt tgagcacctc gcccagatga aggcgatccg cgcggccgag gagctgaaga 1500 ggacacacca cctgctcacg gtggatgtgc ggaatgcctt caacagcgtc ccgcattccg 1560 tcatcctctt tgcgctccgc cgggcagggg tggtccagcc gaccgtcgcg tacattgagt 1620 cgttcctcgc ggctcggcac tcctcggacc tccccgcggt cccagcgggc gtgccacagg 1680 gggacccgct gtccatggcg atgttctgcc agagccttgt ctggccggtg gagacgtacc 1740 tcggccagta caaggtcgtt gcatacgccg acgacctcgt cattgcgtca gaggaggcga 1800 ttcccataga taccgtgaag agtgacgcgc agtctgccct ctcccgcata gggctcacgg 1860 tcgagctctc gaagtgctcc tcgacccagg ccggggcgat ctccttcatg ggcacccgcg 1920 tgctcaagca ctcctccttc aacctcgcgc agacatcggc ccgccggctc cacgagcatc 1980 tcgccgtcct ccgtgcttcg ggtctctcac tccacgaccg cctcaggctc ctgtctgcct 2040 gcgtcgtccc tgcagttaac tatggacccc ttgttgacga ctacccgggc ccgtccccct 2100 atgccgacgt cgatgcgcag atagtggagg aggtcgcgac ccttctggag atccccgaac 2160 cccttgcgaa gacccttgct ctgacgcccc gcgcgaagta cggcctcgga ctggtgctgc 2220 cccatcacta ctacgacgag atgcacaggc agcgccagga catgaaggcg ggcgtcttcc 2280 gcgagctgag gaagaagcgc ctgcaggaca cggccgcgct ccgatccttc ctgccgctcg 2340 cattgctggg ctgcgcaccc ctggacaaca cgcaggtcct gttcataggg gactgcttgg 2400 cggggaggta ccagcggggc cggccgatgg gcacgtgctg ccactgcaag cagcccttcc 2460 ttcccaggca ccatctggtg tgcaaggcca ttaacgggat tcacgtggcg cggcacgaca 2520 agattctgga cgccctgctc gcgtgctccc gcggtcgcgc tgggtctgtt gtgcgcaatc 2580 ccacgatccc agtggatcac ctccagcctg accttgtcat cggtgggggc ttcggggact 2640 tggtggtcac tgtcccgtgg aggctggagc ggtcctatgc cctgaaggcc gccaagtatc 2700 gccccctcgt gctccagggg cgggcggccc acatcctccc cgtcgtggtc ggtgctgacg 2760 gcgtcctgca ccacctgtct gcggccggac tcgcgttcgc cggcgtggac cttgcgcgct 2820 tcatgcagga ggcggcgcag gtcatcctct ggcactacag gctgtcggcc ctcctgtacg 2880 ctgggctgcg agtggagagg ccggtgcacc gccccgcccc tgcagtatcc ctgccggaac 2940 cggctcccaa cgaggctcgg gctactcctc ctactcccct cacccctgcc atgtggagcc 3000 ctggcacgtc gccctccgta gagatcatgg ccgatctcag ccagcatcct gatgcggcag 3060 tccctcctcc ctggggtcct gctgaggtct cctcctcgat cacctggggc accgatcacc 3120 ccgcacagcc ggatgcaaac gaacgcccag accccttcat gtgctcaagg gccccctccc 3180 cgcctccact ggatgacgac agccccgacg atgcaggagc acccgagccg gtgcacaagg 3240 ccctgccgtc cttcttcaag cgcgtcggtc ctcccaagcc gcatgcccct gatggcttct 3300 accccttcaa acgcatcgac acggggcgct agtggtcatg cttgggcacc ctgcccgctg 3360 tcggctgccc cacagcgcga acggacccct gggcctgcgt gccgagctgt ctcccctggt 3420 actcctcgct cgcacccgag gcctcttgcc cctatcattc gggtgtgcat cacgtccaga 3480 cacggtgcga tagatgccta gccgcgatgc catacacact cgaccaggcc gggccaggcc 3540 ggacgccccc ctggatggcc agacacccgc cctgcctacg tttgggcact tcctcgatgg 3600 gactcgtcat ctgcccacgc cgcagcccgg tgttcgtctg tctgtcctcc cgggcagcgg 3660 cggtccttgg gtgctgagtc tggcgagctg gtgctggggg aagaggctag ccccgtgtgt 3720 cattcgtacc atcgcaccgc gccgtgctga tccccccatg gaactgcccg ccagtggccg 3780 cctccggtcc ggggcggcac agccctggct cgggccatcc tctgctgcag cgccccggtg 3840 ccccctggag aggatggccg ggtgctatgt gctactgagc cctcgggtgc atcgcttccc 3900 cactgccccc cctcatgtac ccactctggg cgccccggta cagggaacct ggtcactctg 3960 gccctggcct ccaggtgctt gccctccaat gggccatccc tcatgcgcct gtgtttgcct 4020 ggcccattct cccctgctcc tctctacctc tcctgctcgg tctccgctga gtcgcggtga 4080 cggctgttga cagtcccggc cgggccccca ggtttcgctc catacctggg ccacatgcct 4140 cccaccagcg ctggcaccac atgggttgct tgtcacagat gggactggca cgcacagaag 4200 aatatggctc cacggcgctc tgccggtctg tcctgagctg gcgtcatcag cctggacccg 4260 cccttagggg gcctcctcca tcacccgtgc ctccccacac cacctcctaa gccgtcagtc 4320 tgctctctca ggcgtgacct tctgtcggcc tcctgggccg tatgggacca ccgccccgct 4380 gcccccgcca tcttggccag acgcctccgg tgagccacca ttctggaata gcacgacacg 4440 tgtggatggg ctagctatgg cgcacggtcc gcaccctccc ttctcacgga ctcccgggga 4500 cagtccggac gcgtctccct tgtcccccga atcggtggcc ccggccgatc cccatcgtgc 4560 atccttatat ggggcactgc acacacctgt ccacgctgtc cccttgggcc ggccacagct 4620 caccccgtgt gtcgatgggc cagaatttgg gctccccaaa cccctccacc cccaagctgt 4680 ttcctcgccg gttctctggg ctcgcgcttc cctttttagt gcaacagcat tatggtgcct 4740 cgcgtatgtg cgcggtgcag tgggccggtc tgtaggcctg cctttgactg gaatcctgga 4800 tccctgggtt tctggcccga cgcgtcgtct tctacccccc aaacaggcct gcactcttct 4860 tggtccacgg gctatttgtc tctttgtttc cgcggcccag ggcctgctgg ttgcgccggg 4920 gcgacgctca tggtggattt tattttttcc aaaatccaga atttgggcct gaagaccgcc 4980 cccgcaggcc tggaggatgc ggggaatcgg tccacccaca ccatgcgctc gagcaaaggc 5040 caagcaccca cgccatcacg tctcatccag gactcctgcg ccgccacgga ggaggatcgg 5100 caccttcgcc atcgatcctc caggatgccc cctaccactc cccctcacat ggcacgcctc 5160 cctcgggcct gtatgtctgc atgccgctat gccctaacac cgcagccgcc ctctccgcgg 5220 atccactaac acacagcctg ca 5242 // ID Chapaev-N5_AAe repbase; DNA; INV; 2588 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.03, Created) DT 23-DEC-2010 (Rel. 16.03, Last updated, Version -1) XX DE A non-autonomous Chapaev DNA transposon family from Aedes DE aegypti. XX KW Chapaev; DNA transposon; Transposable Element; nonautonomous; KW Chapaev-N5_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-2588 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the yellow fever mosquito."; RL Repbase Reports 11(3), 835-835 (2011). XX DR [2] (Consensus) XX CC >94% identical to consensus. 4-bp TSDs. TIRs are ~140 bp long CC and ~80% identical to those of Chapaev-1_AA. XX SQ Sequence 2588 BP; 820 A; 465 C; 458 G; 844 T; 1 other; cacggtgtgt ttcgtagagc aatttcaaaa caaaaaattc ggcatgtctg aaattttgac 60 acaagaaagc catgattctc ccctttcatt tgcaaccaaa acgaaaataa tcggtcgggg 120 ggtctagaac attttttttt ttcaattttt ttacggagtt tgggatataa cttaacctta 180 gtaaggacct tgggtcgttt tcgacccatt tcaaaattca aatcattgta acttttgatt 240 aaaataacct agcaatttga cgttttctga caattaggta ttttttgtgg gaacatttat 300 ttttgcggaa acaaaagttt tgaatggccc cttagggagc ttccataaaa taaaagttac 360 aaattgtgac ttagggtcgt tttcgacccg aaaattcaaa cagctcgaaa aaatcagtgt 420 gttgaccgaa tttagttctg ttgggctgaa atgaaaggca cacatgtcga ttttcggaaa 480 cccatccgga gatccggaat tgcgtcactg tggccaccgg gaacctgcta catctgaaaa 540 aaaagtccct ttagagaccc ttactttgac aacctgtagt ttcgtaacta aacaagtaat 600 ctgaaccgtc ctcatattgt tgaataggta tttacgtgga ctgtaaatag agattctcaa 660 atcatttagt tattcatacc cggttccgga aacccggaac atccgaaaag taagtttcca 720 tcgcagtttt gtgtatgtaa ttcacatcgg tactattcgg ttgatggata ttttgtcata 780 aaatttctct gtaataagga accaattacc ggtgcacatc ccttgaaaaa ctcggtccag 840 tatggtccat gcggaaccgg ttcctggaga cccggaaggt agccaatctg gacaattcga 900 tgaaattact cggattgatg ccaaaaaacc aaatgaattg tgtccaactc ataacgattt 960 attatgtttg aatgtatttt gccaaaagaa tgaattgatg gttattctgg tccttttgga 1020 gcaccgaaat gaccaccgga aaacccgtaa tatgggacca ctaagtttta gcaccaaaac 1080 taggcatgcg acggctcaaa cttcatgatt ttgaatccct gtgactcttc taattcaatt 1140 atagcgccaa ctcgaacgtc ttttgatgga agattagcat caaaagtcgt tcgagctggc 1200 gcttttataa gttttgaaca caacaaagct atacgaaaca tagtacttga gatctgctgg 1260 atataaatag caaacaagtc tgtatgcact aaagatcacc gtttttggct ccttttttgg 1320 tgtaatcgga gacttttatg attcacctga tatttttgtt aaagcttcgt atgatttcgt 1380 tattaagcta gcaaaatcgt aaagttactg agtaacattt tcaaagctga ttgaatgtat 1440 ttgttttatg aattactata gaattccgta caactctaat aaaactatgc taaaactgaa 1500 cattacagca cttattgaga tctctaagca aaatgttacg atcacacgat caatctttca 1560 caggtagatt tgttatgaat gaaacctttc gatgtttgta tgaattcatt tgcacttttt 1620 ctaccatcaa tttaaaaagg aagtgtatat tcgtcagcat tattattaat attcttatct 1680 ttatttacga gattttcagc cctagactgg ctgctaatct gagaatattt gtcagcaatt 1740 ctgtttgttt taccatcgca aacaaaagca atgttctgtc aattctcaca acaaacacat 1800 ggacaatagt gtcaacattg cattgctatt gcaattgggt tggaatattg gatttatgaa 1860 atgttatatt gcgtctgtca ttggcacaaa tgtgaacttg tcaattttct agaatctatg 1920 acataaggtc gaaagggcaa aagatcgaag aggaataaag aggggataaa aggtcaaaaa 1980 tcttctttca aaaatgattt aatttcccat aatgcaaatc actttcaacc tttggtcctt 2040 ttgacctttt gcattttcga ccatttgtct tgtcttcctt ttgtctttcg acttattgtc 2100 ctttcgacct tttgtctttc gatcgcttgt tcctaaacca agtcagaaag atgaatattg 2160 ctcttgagtt acaaataaaa attgaaaaga ggttgtttga catacttagt tttaaaagca 2220 atgcaaattt caacaatgga tgaaagcaaa agagaggcaa ctcgaaaatt taaatttcgg 2280 cctcggcctg taaattttta ggtctagagc tgtctagaat ataatctaca agatattgac 2340 actagatttt tgcaacattt tcatctaaat ccgtttsaac agttcacctc accaaaattt 2400 attaatttac ggtgcattta aaatgatatt acaaacaaca aaatttcaaa aaaaaaattg 2460 cactagaccc cccgatggat tattttcatt ttagcagcaa atgaaagagg aaggtcttgt 2520 ctttcattgg tccaaatttg agacatgccg atttttttgt tataaagtta catgaaaaag 2580 acaccgtg 2588 // ID I-69_AAe repbase; DNA; INV; 7131 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE An I non-LTR retrotransposon from Aedes aegypti. XX KW I; Non-LTR Retrotransposon; Transposable Element; I-69_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-7131 RA Kojima K.K. and Jurka J.; RT "I clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1340-1340 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 9 sequences with >97% CC identity. XX FH Key Location/Qualifiers FT CDS 2924..7060 FT /product="I-69_AAe_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="MVLQPLRIMDQAHDIEHLSTHTTTDLAMPSEPTTVVH FT PYCEVTVPREVACNHNAQPSKPLTSNWNSTLPEFVKSNSTAYDRFRHITTH FT PSTSRKPHRTKTDSPSARPSMSTLTRDVPDLSDEPLAAVDMDPCQTWASTP FT PSLEPHSSANHTFESRFSDCYRISCSRGPSPESRASSNETSINENSKKETL FT FLQWNVRGFWCNQAELIKLVDCELPLAIGLQEMMTRSLTNTIKNRYNWILA FT DRHYNQGGGAAALGILNEIPHQLHQMSSSIPTCVARLNAPYNLTIVSIYVP FT PNAEDEEVVSTLDNLITNHQPPFVIGGDFNAAHEAWGSTKSSKRGWLLLEW FT FVNHQLVVLNNGEPTFISSAHGTTSAIDLTVASQSLASKLYWSIEKDTYGS FT DHFPIEIRLQSNSPKPLCRKRWLYKDADWAAFEQKILENFSSEEPLPIEQL FT GEKIIDAANSSIPKTTGVYRGKSQIWWTEEVQQRIKARRKSLRKLKKLSNN FT DPQLEAARKEFQEARSLARKAIYKAKKESWDSFCQSFNPNTPSDILWNNFH FT RLNGQRKTVQRGLTIDGKHVQDPTQIAEHFADYFYSTSSAKEHQSTEPTQP FT IPARVENSNLDSDFTLQELIRAIDAAKGHSTGCDNVGYPMIRRLPLAGKLA FT MLRSYNSVWSAGRFPECWKQGLVVPIPKPGANLQNADGYRPITLLSCIGKI FT YERMVNHRLMTYLEENQVLNDHQHAFRAGRGTSSYFAELREIIANAEQSGS FT HVEFAILDIRKAYDQTWRPNILKQIEKLNIGKCMRSCIMNFLENRRFLVSY FT GGITSTERIQESGVPQGSVLAVTLFLLAINSVFDAVPRNIRTLVYADDIIL FT ISISKHLPRVRSSLKKAVEAVDIWAKSVNFSLSAPKSCILHCCKKRRHRWR FT NSRLHIKIDNESIPEVNVARFLGVWINRRGEFSAHGAKIKESLQNRIQYLK FT AIAPKADRATIGRIAEATCISKLFYGIELFGLDLCNTFQTTYNQIVRITSG FT AVQSSPVMTLVVEAGELPLKLRMSEILIRRLCRLEEKSTSHPYPHLREAAN FT SALFAETGEQIPNIAKLHRDTIRPWHNKTIKVDWTIKHGFKKGQNSMNATA FT LVHELLHTKYDKHDKFYTDGSKCDNLVGLGVVGRGITVEKSLPFQCSVYSA FT EAAALHTAVCLADRSPTVILSDSASCLTALNKGKSKHPFIQAVEREALNKD FT VTFCWIPGHSGISGNEDADHAAHRGRISHLAQVEIPAIDVINWAHHLFQLA FT FQAAWESHRPTFLKRCKTTTKKWNDREDRYEQRVLTRLRIGHTRLTKEHLY FT NKLVSKNCDVCHTELSVEHIVSNCIKYDNIREELGINSDLKIALGNNKHEE FT VKILKFFKKCKLYDKL" FT CDS join(377..1729,1726..2961) FT /product="I-69_AAe_1p" FT /translation="MTGSSSGPPGGPRHEPPGSSRQPYWMRSADDIGTMVL FT LLRRRSSRTNDDSTKSNADRNEQLPLPNPFIVGTSIELAIGSKVDATREGR FT GTRYLLRTSSKSVFNKLTKITELTDGTQVEIISHPTLNTVQGTVYDPDSKD FT MSEEEILGYLSSQGVHAVRRVKKRVNNVQQNTPLLVLSFHGTQLPKFVYFG FT LLRIPVKVYYPYPMICFNCGAYGHSKKNCQNCDICLHCSQSHEQNEGEKCN FT NAPHCLHCNKGHAITSRDCPKYKSEQKIIQIKVDQGVSFPEARRIYADELK FT KETFASVVQDRISNEMAVKDQMITALQKQVAVLTKELADLKTAIRSRAHSQ FT SPAPRNTKTSNARISASQPSSVQNPSASNGSKADSPSKNANARLSRKDKCF FT ISPPAKKSNCNDANTYDLRNRSKSGKRSMDISPTADSNLGKRFSGQQTRND FT LISPNTHMMDDEQKIDFSTTDHAMYLELIMEEQSLDGIDYSETPMENDFGH FT RLEPVQSNILLQSERFEGEQSETPISLPKQAASSSGQMVWNVNSLPADYQA FT YNLTASEIRTHLLNSVKLNHVFNQHVRCVITLSSARWDKYPAPTDSPSARP FT SMSTLTRDVPDFSDEPLAAVDMDLCHSRASIISFQKFLSATHTTPVSLFRN FT LKNIQRPSGILGTTPTIIQSIVLQPLRIIGQAHNIEHLSTHTTTDLAMPAE FT PTTVVQPYVTNCGVTLTRGGAXNHNARPSKPLINNWNSTLPESVKSNSTAF FT DRFRRITTHPSTSRRPHRTTSDSPSARPSMSTLTRDVPDLSDEPLAAVDMD FT LCHSRASIISFQKFLSATHTTPVSLFRNLKNIQRPFGILGIRGTIKPEYGS FT TTSTNNGPST" XX SQ Sequence 7131 BP; 2276 A; 1698 C; 1452 G; 1704 T; 1 other; cattcgagtg ttcgggatat tgttgttata caacacgcgc gagtcactct actttgcaac 60 cctcttttag attataaata cttcgttaaa atttaccaaa cttcggttat acaagaagtg 120 tcaccagaga tttcattata aacttgaaat tggtggaaaa acagtgatat agtgccgaat 180 aacacagaaa tagtggtcat atcacgcggc gcatgaacaa taccaccaca aagactatac 240 acagtgtgca gtgaaaaggt gaatcaaccc gagtggtgga ttgtactaat caacgtacaa 300 tctcgatatc gcacgtactt ttgaaacgga gaaaaaaaag caactttgaa acaaactgtt 360 gctaggctcc ggtcttatga ccggaagttc ttcaggccct cctggggggc ccaggcatga 420 accaccagga agcagcagac aaccctattg gatgaggagt gcggacgaca taggtactat 480 ggtgttgctg ctacggcgca gatcaagcag aacaaacgat gacagcacga aaagcaatgc 540 agatcggaat gagcaattgc ctcttccgaa cccctttatt gtcggaacat ccattgagct 600 ggcaatcgga tcgaaagttg atgctacgcg tgaagggcga ggaacgcggt atcttctccg 660 cacctcctct aagtctgtat tcaataaact gacgaagata acagagctca ccgatggaac 720 tcaagttgag atcatttcgc atcccacatt gaatacagta caaggcacag tgtacgatcc 780 agattcgaag gacatgagcg aagaagagat tttaggatat cttagttcgc aaggagtaca 840 cgcagtccgg agagttaaga aacgtgttaa caatgttcaa caaaatacac ctctgctagt 900 tctgtcgttt catggcactc agctacccaa gttcgtgtat tttgggcttc tacgtatacc 960 ggtaaaggtc tattacccat acccaatgat ctgcttcaac tgcggtgcat atggtcattc 1020 caagaagaat tgccaaaatt gcgatatttg cctgcattgc tctcagtcgc atgagcaaaa 1080 tgaaggggaa aaatgtaata acgctccaca ttgtttacac tgcaataaag gacatgcaat 1140 aacttcacga gattgtccga aatacaagtc tgagcaaaag ataattcaga tcaaagtcga 1200 tcaaggtgta tcgttccctg aagccagaag aatttacgca gacgaactta agaaagaaac 1260 atttgccagt gttgtccaag atcgtattag caacgagatg gccgtgaaag atcaaatgat 1320 cacagctttg caaaaacaag tagcggtcct caccaaagaa ctcgcagacc tcaaaaccgc 1380 tatcaggtct agagctcata gccaatctcc ggctccaagg aacaccaaaa catcaaatgc 1440 tcgaatttca gcatctcaac catcatcagt tcaaaatccc tccgcttcga atggatcaaa 1500 agctgattca ccctctaaaa atgcaaacgc tagactatca cgaaaagata agtgcttcat 1560 ctccccacca gcaaagaaat ctaattgcaa cgacgccaac acttatgacc ttcgaaaccg 1620 tagcaaaagt ggaaaacgtt ctatggatat atcccctacc gctgatagca acctgggtaa 1680 acgcttttca ggtcaacaga ccagaaatga cctcatcagt cctaacacat gatggatgac 1740 gaacaaaaaa tagacttttc taccacagac cacgcaatgt acctggaact gataatggaa 1800 gaacaatcat tagacggcat agactatagc gaaaccccca tggaaaatga ttttggacac 1860 cgattggaac cagtacaaag caacatactt ctgcaatcag agcggtttga aggagagcaa 1920 agcgaaacgc ctatttcgct tccgaaacag gcagcgagtt cgtctggaca gatggtttgg 1980 aacgtcaaca gtcttcccgc tgattatcaa gcttacaact taactgcttc tgaaataaga 2040 acccatctat tgaactccgt taagctgaac catgtattta atcaacacgt tcgatgcgtc 2100 attactctgt catctgctcg ctgggataaa tacccggcac ctactgattc gccaagcgcg 2160 aggccttcca tgtcgaccct gaccagggat gttccggatt tttccgatga gcccctggcg 2220 gcagtcgaca tggacctctg ccactctcgg gcaagtatca tttcatttca aaagttttta 2280 tctgcaacac acactacccc cgtgtcgctt ttcagaaatc tgaaaaatat tcaacgtcca 2340 tctggcatct tgggtacaac acccacaata atccagagta ttgttctaca acctctacga 2400 ataattggcc aagcgcataa catcgaacat ctttcaaccc atacaactac cgaccttgcg 2460 atgcccgccg aaccgacaac ggttgttcaa ccctatgtta ctaattgtgg agtaacgctt 2520 acgcggggag gggccagmaa tcataacgcc cgtccatcca agccgttgat caacaactgg 2580 aactcaactc tacctgaatc ggtgaaatcg aactctactg ccttcgatcg gtttcgtcgc 2640 ataaccacgc atccatctac aagccggaga ccacatcgaa caacatctga ttcgccaagc 2700 gcgaggcctt ccatgtcgac cctgaccagg gatgttccgg atttatccga tgagcctctg 2760 gcggcagtcg acatggacct ctgccactct cgggcaagta tcatttcatt tcaaaagttt 2820 ttatctgcaa cacacactac ccccgtgtcg cttttcagaa atctgaaaaa tattcaacgt 2880 ccatttggca tcttgggtat aagaggcaca atcaaaccag agtatggttc tacaacctct 2940 acgaataatg gaccaagcac atgacatcga acatctttca acccatacaa ctaccgacct 3000 tgcgatgccc tccgaaccga caacggttgt tcatccctat tgtgaagtaa cggtaccgcg 3060 ggaagtggcc tgcaatcata acgcccaacc atccaagccg ttgaccagca actggaactc 3120 aactctacct gaattcgtga aatcaaactc tactgcctac gatcggtttc gtcacataac 3180 cacgcatcca tctacaagcc ggaaaccaca tcggactaaa actgattcgc caagtgcgag 3240 gccttccatg tcgaccctga ccagggatgt tccggattta tccgatgagc ctctggcggc 3300 agtcgacatg gacccttgcc aaacctgggc aagtactcct ccatctttag aacctcatag 3360 ttctgcaaac catactttcg agtcacgttt ttcagattgt taccgaatat cttgcagtcg 3420 tggaccctca ccagaaagca gagcatcttc aaacgaaacg agcatcaacg aaaactcgaa 3480 gaaggaaacg ctgtttttgc aatggaatgt tcgtggcttc tggtgcaatc aagcagagct 3540 aataaagctt gtagactgcg aactacctct tgcaatagga ctgcaagaaa tgatgactag 3600 atcgctaacc aatactatta aaaatcggta caactggata cttgctgatc gtcattataa 3660 tcaaggaggc ggagctgcag cgcttggaat actgaatgaa attcctcacc aattgcatca 3720 aatgagctcg tccatcccta cctgcgtcgc cagattgaat gcaccgtaca acctcactat 3780 tgtctcaata tacgttccac ctaatgcaga ggatgaagaa gtagtcagca cgctggacaa 3840 tctcataaca aatcaccaac caccattcgt tattggagga gactttaatg cagcgcatga 3900 ggcttggggc agtacaaaat catcaaagcg agggtggtta ctgttggagt ggttcgtcaa 3960 ccatcagctt gtcgtcttaa acaatggaga acctacattt attagctcag cgcatggcac 4020 cacctcagcc atagacctta ctgtggcttc tcaaagtctg gcaagcaaac tctactggtc 4080 gatagagaaa gacacttacg gaagcgacca cttccccatc gaaatacgtc tccaaagcaa 4140 ctccccgaaa ccattatgcc gtaagcgatg gctttacaaa gatgctgact gggctgcttt 4200 tgaacaaaaa attctggaga atttttcaag tgaagagcct ttaccgatag aacaactagg 4260 agaaaaaata attgacgccg ccaattcttc tatcccgaaa acaacaggag tctatagagg 4320 taaatcccaa atctggtgga cggaggaagt acaacagaga attaaagcac gtcgaaaatc 4380 tctcagaaag ctgaagaaac tttcaaacaa tgacccacaa ctagaagcag ctcgcaaaga 4440 attccaggaa gcgcgatcgt tagctagaaa agcaatatac aaagctaaga aagagtcatg 4500 ggatagcttt tgtcaatcgt tcaatccaaa tacaccttcg gatatcctat ggaataactt 4560 ccatcgccta aacggccaaa ggaaaacagt gcaaagaggc ctaacgatcg atgggaaaca 4620 cgtgcaagac ccaactcaaa tagctgagca ctttgcagac tacttctatt caacatcttc 4680 agcaaaagaa caccaatcaa ctgaacccac tcagcccatt ccggcacgcg tagaaaattc 4740 aaacctggac agcgacttca ccttacagga gcttatccgc gccatagacg ccgcaaaagg 4800 acattctacg ggttgtgata atgtaggtta ccctatgata cgacgccttc cactagcagg 4860 gaagttagca atgcttcgaa gttataattc cgtctggtca gctggacgat ttccagaatg 4920 ctggaaacaa ggattagttg ttcccattcc taagccaggg gcaaatctgc aaaatgccga 4980 tggctaccga ccaataacac tgcttagctg tatagggaag atctatgagc gaatggtcaa 5040 tcataggtta atgacctatt tggaagagaa ccaggtttta aatgatcatc aacacgcttt 5100 tagagccggc cgtggtactt cttcctattt tgcagaacta agggaaatta tagcaaatgc 5160 tgagcaatcc ggctcacatg tcgaatttgc aattctagac attagaaaag cgtacgacca 5220 aacctggagg ccgaacattc tgaaacagat tgagaagctg aatattggca aatgcatgcg 5280 cagctgtata atgaacttcc tcgaaaacag acggtttctt gttagttatg gtggaattac 5340 atcaacagag cgtattcaag aaagcggggt tcctcaaggc tcagtactcg ctgtaacctt 5400 atttttacta gccatcaatt cagttttcga tgctgtaccc agaaatatcc gcacacttgt 5460 gtacgcagat gacatcatat tgatatcgat atcaaagcac cttccaaggg ttcgaagctc 5520 cctgaaaaag gcagtagaag ccgtagacat atgggcaaaa agtgtgaact tctctctgtc 5580 agctccaaaa tcctgtattc tacactgttg taaaaaacgt agacatagat ggcgtaactc 5640 tcgtctgcat atcaaaattg acaatgaaag tatccctgaa gtgaacgttg ctcgatttct 5700 cggtgtatgg atcaatcgac ggggtgaatt ctccgctcat ggagcgaaaa tcaaggaatc 5760 cctgcaaaat cgcatacaat acttgaaagc tatagcgcca aaagctgaca gagcaacgat 5820 cgggcgaatt gctgaggcca cctgcatttc caagcttttt tacggaatag agctatttgg 5880 tttggatcta tgtaacacgt tccaaacgac atataaccaa attgtgagaa tcacctccgg 5940 agcagtacag tcatctcccg ttatgactct agtcgtcgaa gctggtgaac taccgcttaa 6000 attgcgaatg tctgaaattc ttatccgaag actttgtaga ttagaagaaa aatcgacatc 6060 tcatccctat ccacatcttc gagaagcagc aaatagtgca ctctttgcag aaacagggga 6120 acaaattccc aacatagcca aattacaccg tgatacaatc agaccatggc acaacaaaac 6180 catcaaagtt gattggacta tcaagcatgg gtttaaaaaa ggacagaact ccatgaacgc 6240 aaccgctttg gtgcatgaac ttctgcacac aaaatatgac aaacatgata aattctatac 6300 ggatggctca aaatgcgaca atttggttgg tctcggagtt gtgggaagag gcataactgt 6360 tgaaaaaagt ttaccctttc aatgtagcgt atattccgcc gaagcagccg cgttacacac 6420 cgccgtctgt ttagctgatc gctcgccaac tgttatttta tccgactccg caagttgttt 6480 gacagctttg aataaaggaa aatcaaagca tccgtttata caagcagttg aaagagaagc 6540 tctcaataag gacgttactt tctgctggat tccaggacat tccggtatta gcgggaacga 6600 agatgctgat catgcggccc atcgaggaag aatatcccat ctagcacaag ttgagattcc 6660 tgccattgat gttatcaact gggcccacca tctttttcag cttgcgttcc aagcagcctg 6720 ggaatcacac cgcccaacgt tcttaaaacg atgtaaaaca acaaccaaga agtggaacga 6780 tagagaagat cgatatgagc aacgcgtctt gactcgactt agaatcggtc acactaggtt 6840 gacaaaagag catttgtata acaaattagt ttcgaaaaat tgtgacgttt gccacactga 6900 actttcagtc gaacatatag tatcaaattg cataaaatac gacaatatcc gtgaagaact 6960 aggtatcaat agtgacttaa aaatagcatt aggaaacaac aagcatgaag aagtcaagat 7020 actgaaattc ttcaaaaaat gtaaattata tgataagttg taaacacaat atcttctaaa 7080 gaggtgaatg aaccagttgg tttaaaacct ctataaataa acaaaaaaaa a 7131 // ID BEL-10_CQ-I repbase; DNA; INV; 4394 BP. XX AC AAWU01000654; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-10_CQ_; KW BEL-10_CQ-LTR; BEL-10_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4394 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 173-173 (2011). XX DR GenBank; AAWU01000654; Positions 77405 73012. XX CC 'ATACT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 47..2155 FT /product="BEL-10_CQ-I_1p" FT /translation="MESITAKRNSLFAKVKRELETAERVKTQNPSLSEVRE FT RLVRLEAQGNSFFDVQDQFEDGTSATTLENLVSVFDYRKEFEDRYYAAKAI FT YAALDEGSGASDRSFAEPANNLKTAVAALLETQRALLSNQAAHAAQLNQIQ FT QGNQPPVGQPAQAQPDPFINVRLPPINIPKYAGVRKEWLSFKDLFVSTIHS FT KESLRPSQKLQYLKSLLEGDAASQISSFGISDDNYPLAWDKLLRKYDQKKY FT TVFALVKEFLDQPVVSDATAGNLQKLVTTSDEVVRQLDVLGEQYQSRDPWL FT IQLLLEKIDEETQALWAQKLVTLENPSLTDFLKFLEDRCDAIETCSSFTRK FT CTLSGAGAKKEIRKQPAKPAEKKVQSYVATPQPCPKCSKEHLLFLCEAFKA FT SSVADRRELVQKSRLCFNCLRTSHTAKTCSSKHTCKTDGCNQRHHTLLCQH FT GGRQAATAGLPGQQIAAMPVQQPAMEAQPTVPSFKAEVTSDPAVKITVLPT FT ALVKLRGKNGVFHTARAMIDSCSGASLISEACMTRLGISRSNARFPVTGVA FT GTQAGITRGMALLEIAPRFSDEVVMKTQAYVLEVLAPPTPCQSFKPKQLEL FT LRGLQLADPEYYKAGPVDVILGAELFLPILQAGQVTDEDGLPVAQKSSLGW FT LVAGKFGGETVLQTNLVSFTVQLDVNIDQTLRKFLGNRRSACGQGTHCRRE FT ARC" FT CDS 2064..4394 FT /product="BEL-10_CQ-I_2p" FT /translation="MSTSTRRCASFWETEEVPAVKVLTADEKRAVDIFNTT FT TRRNDEGRFVVRLPFDDSKPALGDSLGAALRRLHAMERKFLSNPPFKQAYR FT DFMSEYQQLGHMELIPQSEVDKHPSECFYLPHHAVQKEDSSTTKLRVVFDG FT SCRTSSGVSLNDRLLIGPNNNEDLYDVYCRFRTYPVVFVSDIEKMYRQVRV FT DKQDTDFQRIVWRDDPDQPVQHYRLTTVTYGTSSAAYLATESLRQAARDNA FT EKHPVAADRILKGFYVDDLMSGADTVEEAQELAGQITTILGEAGFVLRKWS FT SNAPELLENIADSRQGPVPVEFTNVTASVKALGIHWSPTSDWFEFKVNLDI FT NSPNTKRQLLSDASRLFDPFGWLAPVIVKIKIIFQLLWLYDLLWDDPLPPL FT PEADWNTIKQTLHLLERIRVPRRIPAFNGKLQLLGFSDASESAYSAVVYGR FT SADRNGKIHIVLICAKTKVAPIQQICVPKLELNGSWLLAALMKRVSLALCH FT LELQHFAFTDSAIVLEWLSAHPRKWKTFVANRTSAILDFLPRSMWHHVSSK FT DNPADCASRGISPLELLNHPLWWDSPYWAHSEQSFLEHCAAPAPKAPPLEE FT RQIRTFCLEIGSPRSNFTLERYLLDRFSSLRRCTRTLATIRRFLFNLARPK FT VERLTGPLTPAELKVAGLQLVRLAQHEAFAKEIACLRKSTEVSNKSKLRPL FT FPFIDLDETLRVGGRLQNADIPYDMRHPPILPQQHTWITMASCGPSRCAEE FT QRSTRDRSKSWFYYRLVEATASRWGG" XX SQ Sequence 4394 BP; 1028 A; 1342 C; 1220 G; 804 T; 0 other; tttatggtcc ttcgcgaacc gaatagtaga accgtgcgat ttcccgatgg aatcgatcac 60 cgcgaaacgg aattctctct ttgcgaaagt gaaacgcgaa ctcgaaaccg cggaacgagt 120 gaagacgcaa aacccctcac tctctgaggt ccgggagcgt ctggtgcgcc tcgaggcgca 180 ggggaacagt ttcttcgacg tccaagacca gttcgaggac ggaacctcag cgaccacctt 240 ggaaaaccta gtctcggtct tcgactaccg gaaggagttt gaagatcggt actatgcagc 300 gaaggcgatc tacgctgccc tcgacgaagg ttccggcgcc agcgatcgat cgttcgctga 360 gccggcgaac aacctgaaaa ctgcggtagc ggcgctgttg gaaacgcaac gagcgctcct 420 gtccaaccaa gcggcccacg cagcccagct gaaccagatt cagcagggaa accaacctcc 480 agtaggacaa ccagctcaag cgcaaccgga tcccttcatc aacgtccggc ttcctccaat 540 caacatcccg aagtacgcgg gagtccggaa agaatggctt tcgttcaagg acctgttcgt 600 cagcaccatc cacagcaagg agtcgctgcg gccgtcacag aagctgcagt acttgaagtc 660 gctgctggag ggagacgcag catcccaaat cagctcattc ggcatctcgg atgacaacta 720 cccgctggcg tgggacaagc tgctgaggaa gtacgatcag aagaagtaca ccgtgttcgc 780 gcttgtgaag gagttcctgg accaacctgt cgtgtcagac gcaaccgctg gcaacctcca 840 gaagctggtc acgacctcgg acgaggtagt tcggcagctc gacgtcttgg gcgagcagta 900 ccagtcacgg gacccctggc tgatccagct cctcctggag aagattgacg aggagacgca 960 ggcgctctgg gcgcaaaaac tcgtcacctt ggagaacccg agcctcacgg acttcctgaa 1020 gttccttgag gaccggtgcg acgccatcga aacctgctca tcattcacac gaaagtgcac 1080 cttgagtgga gcgggggcga agaaggagat acgaaagcag ccggccaagc cagctgagaa 1140 gaaggtgcag agctacgtcg caacaccgca gccctgccca aagtgttcca aagagcactt 1200 gctgttcctg tgcgaagcgt tcaaggcgtc cagcgtcgcc gaccgtcggg agttggtgca 1260 gaagtccaga ctttgcttca actgcctgag gacgtcgcac acggcaaaaa cctgttcttc 1320 gaagcacacc tgcaagactg atggatgcaa ccagcgtcat catacgctcc tctgtcaaca 1380 cggcggcaga caggcagcta ccgcagggct tccgggacaa cagatagcag cgatgccggt 1440 ccagcaaccg gccatggaag ctcaacccac ggttccgagt ttcaaggcgg aggtcaccag 1500 tgatccagct gtcaagatca ccgttcttcc cactgcgctc gtcaagctgc gcggtaagaa 1560 tggtgtcttc cacactgcta gggcgatgat agactcgtgc tctggcgcgt cgctgataag 1620 cgaggcctgc atgacgcgac tcgggatcag ccggagcaac gcgcggtttc ctgtaaccgg 1680 agttgccggg acacaagcag gaatcacccg gggaatggcg ctgctggaga ttgcaccccg 1740 tttcagcgac gaagtggtga tgaagacgca ggcgtacgtg ctcgaagtgt tggctccccc 1800 cacaccgtgt caaagcttca agcccaagca attggaacta ctccgagggc tccagctagc 1860 cgatccggag tactacaaag cgggaccggt agacgtcatc ctaggcgccg agctgttcct 1920 gccgattcta caagcgggcc aggttacaga cgaggacggt ctacccgtgg cccagaagtc 1980 gtcgctcggc tggctcgtcg ccggcaagtt cggcggtgaa acagtgctgc aaacaaatct 2040 ggtgtccttc acggtgcagc tggatgtcaa catcgaccag acgctgcgca agtttttggg 2100 aaaccgaaga agtgcctgcg gtcaaggtac tcactgccga cgagaagcgc gctgttgaca 2160 tcttcaacac aaccaccagg cgcaacgacg agggccgatt cgtggttcgg ctccccttcg 2220 acgactccaa gccggcattg ggcgactcgc tcggtgcggc actgagaagg ctgcacgcga 2280 tggagcggaa gttcctctcg aatccgccgt tcaagcaagc ctaccgtgac ttcatgtccg 2340 agtaccagca actgggacac atggagctca tcccgcaatc cgaagtcgac aagcacccca 2400 gcgagtgctt ctatttgcct caccacgcag tccagaagga ggacagttca acaacgaagt 2460 tgcgtgtcgt cttcgatggc tcgtgcagaa ccagttctgg cgtgtcgttg aacgaccggt 2520 tgctgattgg tccaaacaac aacgaagacc tgtacgacgt ctactgtcgc ttccgcacct 2580 accccgtggt cttcgtgagc gacatcgaga agatgtacag gcaagtacgg gtggacaagc 2640 aggacacgga cttccagcgg atcgtctggc gcgatgatcc cgaccaaccg gtgcaacact 2700 accgtctgac gacggtgacg tacggcactt caagcgcagc ctacttagcg accgagtctc 2760 tccgacaagc ggcccgagac aacgctgaga agcaccccgt agcagctgac aggatcctca 2820 aagggttcta cgtggacgac ctgatgtccg gcgccgacac ggtcgaagaa gctcaagagc 2880 tagctggcca gatcacaacc atcctgggcg aagctggatt cgtcctccga aagtggtcct 2940 ccaacgcacc cgagctgctg gagaacatcg cagacagccg gcagggacca gtccctgttg 3000 agttcacgaa cgtgacagcg tcggtcaaag cacttggaat tcactggtct ccaactagtg 3060 actggttcga gttcaaggtc aacctggaca tcaacagccc gaacacgaag cgtcaactcc 3120 tctccgacgc atccaggcta ttcgacccgt tcggatggct cgctccagtg atcgtcaaga 3180 tcaaaatcat cttccaactg ctgtggctgt acgacttgct gtgggacgac ccgctgccac 3240 cccttcccga agcagattgg aacaccatca agcaaacgct tcatctgctg gagcggattc 3300 gcgtgccccg ccgaattcca gccttcaacg gcaagttgca gctgctcgga ttctcggacg 3360 cctcggagtc ggcgtactct gcggttgtct acggccggtc tgcggaccgc aacgggaaaa 3420 tccacatcgt gctgatttgc gctaaaacca aggttgcgcc aattcagcag atctgcgtcc 3480 caaagctcga actcaacggt tcctggctgc tggccgccct gatgaagcga gtctccctcg 3540 ctttgtgcca cctcgagctg caacactttg ccttcacgga ctcggcaatc gtactggaat 3600 ggttgtcggc acacccccgc aagtggaaga ctttcgtcgc caatcgaacg tcagcgatcc 3660 tggacttcct gcccaggagc atgtggcacc atgtgtcatc gaaggacaac ccggccgact 3720 gtgcgtcgcg agggatctcg ccgctcgagc tgctcaacca tccgctgtgg tgggactccc 3780 cctactgggc ccacagcgag cagtcgttct tggaacactg cgcagcacca gcaccgaagg 3840 caccacccct cgaagaacga caaattcgca cattctgtct ggagattggc tccccgcgta 3900 gcaacttcac actagagcgc taccttctcg atcgattctc ctcgctacgc cgctgcactc 3960 ggacgctggc gacgatccga cggttcctgt tcaacctggc acgaccgaag gtggagcgct 4020 taaccggacc gctgacaccg gcggaactca aggtagctgg tctacaactc gtacgactag 4080 cacaacacga agcatttgcg aaggagattg cctgcctgcg caaatcgacc gaggtatcca 4140 acaaaagcaa gctgcgcccg ctctttccgt tcatcgacct ggacgaaaca cttcgtgtcg 4200 gaggaagact gcagaacgcc gacatcccct acgacatgag acacccgccc atcctgccgc 4260 aacagcacac ctggatcaca atggcatcgt gcggtccgtc acgctgcgca gaggaacaac 4320 ggagtaccag agaccgatcc aaaagttggt tctactaccg actggttgag gctactgcct 4380 caaggtgggg agga 4394 // ID BEL-165_AA-LTR repbase; DNA; INV; 475 BP. XX AC AAGE02018869; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-165_AA_; KW BEL-165_AA-I; BEL-165_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-475 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02018869; Positions 128929 128455. XX SQ Sequence 475 BP; 170 A; 79 C; 81 G; 145 T; 0 other; tgttcgccgc aacagcccat tgtagtggac acccctgttg atagagatca tgagccacta 60 cctatgccca ctggatcgtt tgacgtcggc aaaacttcaa aatcaatgac tcgataaaag 120 tgaaggattg tttatcaact gtgcgatagt cggtgataat tcgatttatt gattgatagt 180 aaaaattatt tttggattta ttattatatg caaaggtgaa ttataaaact tgtacactta 240 gaactagatt aaaacttaca tgctaattaa aacttacagg ctacagttta aatcgtttaa 300 aacttagttg tgtgtgagat aaaaacatag tacggagact gaattgtaag taaaaattga 360 atttatcatt aaacatgtcc caaataaaaa aaactatatt ctagcttaag ctgatcatac 420 gcaaatttga gttaaagatc ttgctggaag aactcctacc gaatctccca caaca 475 // ID Gypsy-34_DPu-I repbase; DNA; INV; 5798 BP. XX AC . XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from Daphnia: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-34_DP_; KW Gypsy-34_DPu-LTR; Gypsy-34_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-5798 RA Jurka J.; RT "LTR retrotransposons from Daphnia."; RL Direct Submission to RU (16-DEC-2010). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR [1] (Consensus) XX CC Positions [4293-4784] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 448..1566 FT /product="Gypsy-34_DPu-I_1p" FT /translation="MSERSKQPAVPFQTVTRATTARRAGLSSASSSSGGSV FT AVSQPSPTRPSRISARLSPAPMATNDDLIQALTAFTQAAANDRVLQQQKHD FT DLLQALRTQQDENTALRNLVTAGRAVVNRPSTAVVDSIPKFEGRMDEDVQG FT FINHIDRVATSEDWTDAHRLQVGIRRLVKTALLWHVQTGHTHADWASWSVA FT LVTNFSRRLSFADWNRLIQDRVQQPNESGMEYALDKFRLCRLSPTPLAEQD FT AIPFLINGLAKWEHVAAMTAAAPANIPAFIARIQQLEQLGVSARSDVAPNR FT QMNIPPPQPPDLAAAFNNFSEKLVAELATKLESLTVSRSVGRGRGGPPGAA FT RPPMECWLCHNTGHKARYCPTRPENTSAGR" FT CDS 2367..4850 FT /product="Gypsy-34_DPu-I_2p" FT /translation="MDVKEADSPVCCTATVNESKPTPRPSWVDKLRFGENL FT SEEEKINLLSVIKRRWRCFPSEDGQIGRTDQAEHLIDTGDAKPVRSAPYRV FT SRYEREIIVDEVAEMLRSGTIRPSSSPWAAPVVLVRKKDGSHRFCIDYRRL FT NSVTRRDVYPLPLIDDVFDRLSGAKFFSSLDLASGYWQVPVAEKDQCKTAF FT ITPDGLYEFRRLPFGLNNAPSTFQRLMDKVLSRLKWHMCLVYLDDVLVFGK FT NFEEHQERLELVLMALEKAGLTLNVEKCVFATSRVEHLGHVIDGNGIRPHP FT DKVQALVNFKTNDVKSLRGFLGLASYFRRFIPDFAAVASPLYGLLKKNAPG FT LVGEAGGGQTETVQRLTSAPVLAHFDEKLDVVVQTDASNLGLGAVLLQDDG FT AGPRPVAFVSRTLADAESRYHANELECLALVWALKKFRSYVYGRRFSVETD FT SSAVKWLCTKKELSGKFARWILSLQEYDFEVRHLKGTNNVVADALSRNPDG FT ELPECETDHVICVLQSNRPSGYAPHELAFLQQVDGQLRKIFIDLRYPNPGK FT NPHEFVVHRKVLYKKNFGPAGRKFLLVVPSVLRRKILKSCHDDPHAGHMGR FT EKTFARVSERYWWPKMFASVRKYVSSCMYCQLHKHAAGQTVGRLQPIPPPA FT HAFEMLGIDHLGPLQATDSGNKHVIVAIDYLTKWVEVAAVPDTSTDHVIPF FT IQDNIIHRHGHPKRLVSDQGPAFSSKEFDARMAEWRIDHIPATAEHPQTNG FT LVERLNRTAALAIAAYLNTKHTDWDVRLQGAALAVNTARQSATEITPFELV FT YGRLPVMAQENKFPWPPERPEPYAVF" XX SQ Sequence 5798 BP; 1560 A; 1245 C; 1498 G; 1485 T; 10 other; attggtgtca gtgatgagat gtggccattg aattgtctca tttctttttg agatagtcac 60 aaaagttccc acgccgattt tttttttgtg tatatttggt ccctccgcct tcatattgat 120 ttttcttttt ttttgtctta ttattattat cgccattttt ttttctcttc cctcccaccc 180 tcttcgttaa acgcccccgt tcctgtttgt ccgttcctct tttttgtgat tgttgccgcg 240 cgtttattgc ctttttattc cgcgggcacc agccattttt ttagagtgtt aaaaggcgaa 300 attctttttt tttgtgtggt gaaagagacg aaaggaagct cacgtggtag acacgcggtc 360 aaggtaccgg gccattggtg aagttaattc gtctcgagtt ttttttgtgt gtgcggaagt 420 ttattttttt tagtacgaag ctgtcgtatg tctgaacgtt cgaagcaacc agcggtccct 480 ttccagacag tcacacgagc cacaacagca aggcgcgcag gtttatcatc agcatcaagt 540 tcgtcaggag ggagtgtagc tgtcagtcaa ccgtcaccaa caagaccatc tcgtatttcg 600 gccagattga gcccagcacc gatggcgacc aacgacgacc tcatacaggc actgacggcc 660 ttcactcaag cagcagcgaa cgatagagtg ctacaacaac agaagcacga cgatttactg 720 caggcgcttc ggactcaaca agatgagaac acggctttac ggaatctggt aactgctgga 780 cgtgcagtgg taaatagacc ttctactgca gtagttgata gtatacctaa gtttgaagga 840 cgtatggatg aggacgtcca agggtttatt aatcatatag atagagtagc aacttcagaa 900 gattggactg atgctcacag attgcaagtg ggaatccgac gtttggtaaa aacggcgctc 960 ctgtggcatg tacagaccgg ccacacgcac gctgattggg cctcatggtc ggtggctttg 1020 gttactaatt tttctagacg attgtctttt gcagactgga atcgactaat tcaagacaga 1080 gttcaacagc caaacgaatc gggaatggaa tacgcgctgg ataagtttcg gctatgccgc 1140 ctctcaccaa ctcctctggc ggaacaagac gccattcctt tcctgatcaa cggcctggct 1200 aaatgggaac acgttgccgc catgacagca gcagcaccag caaatatccc agccttcata 1260 gctcgaatcc agcagctgga acagttaggt gtgtcagcga gatcagatgt ggcacccaat 1320 cgtcaaatga atattccgcc acctcagccc cctgatttgg cagcagcgtt caacaacttc 1380 agcgagaagc tggtggcaga gttggcgacc aaattggaga gcttgacagt aagtagaagt 1440 gtgggaagag gccgaggagg tccacctggg gcagcaaggc ctccgatgga gtgttggctc 1500 tgtcacaaca cgggccataa agctcgttat tgtccgacac ggccggaaaa cacaagcgct 1560 ggtcgttaga ggcaaggcca ttgttgacca gctacgcttc tttaccttgc cgcccttgtc 1620 tccctttaat taaagtaatc attgataaaa tcggtgaagt ggtcgcactc gtagactcgg 1680 gggcctccgg tagcgccatt aagtttggta cggccagtaa gctggggaag atggcgctca 1740 gaccgggaga aacaaggtat aagttacgag gtgtagacaa taagattgtg ccggtagatt 1800 cattttgttc tttaattatt gggtggggag gtatcaagac gaaattatcg gaagtagcag 1860 taattaaatc atcgccattt tgtttaattt taggagtaga ttggatagta agcagtaaaa 1920 caaaccttgt tgtaaaggga gggagacttg ttttagaagg gaagggtacg gatattaata 1980 gggggaaaga aattaggtaa aaggaaaagg atgaagaagg gaaaatttat gtgtgtcaga 2040 cgaattgatt gaaatgtatg aagccgagtc cgcaccgaaa cgttctcgtg gtggactcgg 2100 aattaaatta atcgattcag cgtttatccc aggcgactcc atgatgttcg tcgccgcaaa 2160 agtaaaaaga aaattcaccg gaaaaggagc agtgaagctg aatcaatgcg ctcatccggg 2220 caaggagtgg atggtgccgt cgacaatagt gaacatagaa aggcaaacta aaaattccga 2280 tcgtcaacct acagtccagt cctcttcaat tcaaacgacg cgacctgatc acaacagtgg 2340 acatcgacct agaagsggag atcgtgatgg atgtgaagga ggcagattct ccggtttgct 2400 gtacggcgac ggtgaacgag tcaaagccaa caccacggcc atcatgggtg gacaagctga 2460 ggttcggtga aaatttatcg gaggaggaga aaatcaacct tttgtccgtc atcaagcgac 2520 gctggcgatg ttttcccagc gaagatggac aaatcggaag gacggatcag gccgaacact 2580 taatcgacac cggcgacgcc aaaccagtcc ggtcagcacc ataccgggtc tcaaggtacg 2640 aaagggaaat aatcgtggac gaggtggccg aaatgttgag gtcaggaacg attcggccgt 2700 cgtcgagccc ttgggcagcc ccggtcgtat tggtaagaaa aaaagatggt agccaccgct 2760 tttgtataga ttatagaaga ctaaattcag taactaggag agacgtgtac ccgttgccac 2820 taattgatga cgtttttgat cgcctatcgg gcgctaaatt tttttcaagt ctcgatttag 2880 ctagtggcta ttggcaggtt ccagtggcag aaaaagatca gtgcaagacc gcattcatta 2940 caccagatgg actttacgaa tttcgacgcc tcccttttgg actgaataac gcgccatcga 3000 ccttccagag attgatggat aaagtgctgt cgcgtttgaa atggcacatg tgtctcgtct 3060 atttggatga tgtgctggtt tttggcaaaa attttgaaga acatcaagag cgattggagc 3120 ttgtgctgat ggcactggaa aaagcaggtt tgactctaaa cgtagaaaaa tgtgtgtttg 3180 caacgtcaag agttgagcat ttgggtcacg tgatcgacgg gaatggtatc cggccgcacc 3240 cggacaaagt tcaagccctt gttaacttta aaacgaatga cgtcaagtct ctgcgaggct 3300 tcctaggcct agcttcttat ttcaggcgct tcatcccaga ttttgcggct gttgccagcc 3360 ctctgtatgg cttgctcaag aagaatgcgc ctggattggt cggagaagca ggaggcggcc 3420 aaacagaaac tgtgcagcgt ttgacgtccg cacccgtgct ggcccatttc gatgagaaac 3480 tggatgtagt cgttcaaact gacgcaagca acttgggatt aggagccgtg ctgttacaag 3540 atgatggcgc tgggccacgg ccagtggcat tcgtaagtag aacgttggcn gacgcggaat 3600 cgaggtatca tgcgaacgag ctagaatgtc ttgcgctcgt ctgggcgctg aaaaagtttc 3660 gttcatacgt ttatggtcga cggttttcgg tcgagacgga cagctcagca gtaaaatggc 3720 tgtgtacaaa gaaagagtta agtggtaaat ttgcaaggtg gattctttca ttgcaggagt 3780 atgattttga ggtacgccat ttaaaaggta caaataatgt tgtggcggat gctctctctc 3840 gcaatccgga cggagaattg ccggaatgcg agacagatca tgtaatttgt gttttgcaaa 3900 gtaataggcc ttcgggctat gctccacatg aattggcatt cctccagcag gtggacggcc 3960 aattacggaa aatttttata gatttaagat acccaaaccc cggtaaaaat ccccacgagt 4020 ttgttgttca tagaaaagtc ttatataaaa agaattttgg tcccgctggt agaaaatttc 4080 tccttgtggt cccctccgtt cttcgtagaa aaattctcaa aagttgtcat gatgatcctc 4140 atgctgggca tatgggaagg gaaaagactt tcgcaagggt tagtgagcgc tattggtggc 4200 ctaaaatgtt tgccagtgta agaaaatatg tctcttcttg tatgtactgt caattgcaca 4260 agcatgccgc tggtcaaacc gtcggaagac tgcaacccat tccaccgcca gctcacgcat 4320 tcgaaatgtt gggcattgat catctgggtc ctctacaagc gactgattcc ggaaacaagc 4380 atgtgattgt ggcgatcgac tatctgacaa agtgggtgga agtggcagcc gttccagaca 4440 cgtcaaccga ccacgtcatt ccatttattc aggataacat catccatcgc cacggtcatc 4500 cgaaaaggct cgtaagtgat caaggtcccg cnttttcatc caaggagttc gacgccagaa 4560 tggcggagtg gagaatcgac catattccgg ctacagcaga acatccgcag acgaatgggc 4620 ttgtcgagag gctgaacagg acagcagctt tggcgattgc agcatacttg aacacgaaac 4680 atacggattg ggatgtgagg ctacaaggag cagcgttggc agtcaacaca gcgaggcagt 4740 cagcaaccga aatcacaccg ttcgagcttg tctatggccg acttccagtg atggcacagg 4800 aaaacaagtt cccgtggcca ccagaacgtc cagagccgta cgcggtattc scgaaacgag 4860 tggmggaact gcgagaagct gcaagactca gaatcatcga gaaacaacag aaggtgaaag 4920 agcggtggac cgtagtcgcc gagtgacgca agaattacgt gccggagaac ttgtgctagt 4980 gagacgcaac ttaactaaga aaggaaagac aaagaagttt ctsccgaaat ttgttggccc 5040 ttwtcaagtg gttaaaaaag tgtgtgagac gacttatttg gtggaggact tgcccgcaag 5100 gagaaaaaaa actaagtttc ggaggttcaa tgctcatgtg tgtcaaatcc gcmgctttca 5160 cccamgwgag gatgtcgagt gggagcaagg ggacgattcc gaagaagaag aagaaagttc 5220 tgacacagag gaggaaggag aaggcattaa cgacacggaa gagccggaag agaatttgcc 5280 gtcagttgcc gctacagaga cagtggggga ggagcaacag ccagtccagc caagagagcc 5340 ggaaagaaca agggccggca gacagactcg agcaccacgg tggcacaacg aatacgaacg 5400 gcattgacca cggcttgtat tgtgttttgt ttttttcttt ttctgtgtgt gttgttgttg 5460 tttttttttt tcctcgcttt cgtccgcatt tattcactct cattgttcat tcatgtgtct 5520 ttcacctcac cttcatatcc aaactccatc atcattccat ttgaaaacga agccatccgt 5580 ccatttccgt ccagttcttt attcctttct tctctttcac aattcttttt tgtctatact 5640 attcattttt cttccagtgc cactaaaaag ctgccacagg gtagggtttg atttggtgtt 5700 ttgtttttta tcatgtgttg aaatgttttg tctctcgaaa tgtgaagttg gatatatata 5760 tatgtgtttc gaatcgtcaa aagtcaggaa aggccgaa 5798 // ID Copia-138_AA-LTR repbase; DNA; INV; 149 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-138_AA_; KW Copia-138_AA-I; Ty1_copia_Ele71; Copia-138_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-149 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 149 BP; 50 A; 28 C; 24 G; 47 T; 0 other; tgattctgaa gaagctcaac ttagtagcag ttgatctaca ttaagaaagc ttacatcgta 60 gtagcagttg atttgttctg acaacctata taagctcagt tcaattgtaa aatgttcatt 120 cctcaataaa ccttcctaga agctagaca 149 // ID DRP_EG repbase; DNA; INV; 496 BP. XX AC X67153; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 25-MAR-1997 (Rel. 2.02, Last updated, Version 2) XX DE E.granulosus EgDRep repetitive DNA element. XX KW DRP_EG; Repetitive DNA element. XX OS Echinococcus granulosus OC Eukaryota; Metazoa; Platyhelminthes; Cestoda; Eucestoda; OC Cyclophyllidea; Taeniidae; Echinococcus. XX RN [1] RP 1-496 RA Marin M.; RT "DRP_EG."; RL Direct Submission to Genbank (01-JUL-1992)M. Marin, Facultad de RL Ciencias, Sec Bioquimica y Biol Molecular, Tristan Narvaja 1674, RL 11200 Montevideo, URUGUAY. XX DR GenBank; X67153; Positions 1 496. XX SQ Sequence 496 BP; 158 A; 103 C; 106 G; 129 T; 0 other; caaatttgat ggtatttgag ctgtcttact tagtatcttc acgaatccta cgtagcgtcc 60 cgagactccc tgaagacgaa agaagacaaa atactcgggc agaatatata ttcagcgcag 120 agaaggtgtc tgggctattg catgcccaaa aggccagccc acccagcgag gtacaagggg 180 gctaattcaa agtacaaaag tggcgcctca ccagaaccga gggataaacc gctgtcgtat 240 tacacttatg cgttttatga tggggtaatc tgaatcagat ttttgaaacg cgctgaagtt 300 ctgcgaacta taattagggc taatctctta gtaatatatc atcccatgtc ttcttcagtc 360 ctaaagagcc aaagtaaggt actaaaaata taatacaata cacaaaagaa gctgcaagac 420 tgctggtccg gttggaagga cgttgtcttt tgttcacatg ttatgtttaa caagcaatca 480 aagccttaca tcaaat 496 // ID BEL-57_AA-I repbase; DNA; INV; 5726 BP. XX AC supercont1.17; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-57_AA_; KW BEL-57_AA-LTR; BEL-57_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5726 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.17; Positions 3348461 3354186. XX CC Positions [4759-5328] - Integrase core CC 'CTTGC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 403..5700 FT /product="BEL-57_AA-I_1p" FT /translation="MPKKKVLQTTAANQQTPSVKKDKKQTNLEETMAQAEQ FT LSVLVHRRGLAKGKLTRLFNYLFPDEEEAPQLSEAQVRHYVKKVEAVQREY FT TEVHERILALVSEDNREDHDSHYVQFDDLQDVISVNLEEQLAKVSLNPATS FT RATFANTQAQAPILVHQPLKIPIPTFDGRYESWPKFKVMFKDLVDNTPDPP FT AVKLYHLDKALTGSASGIIDAKTISEGNYIHAWEILEERYENKRHSIDKHI FT HGLLHLKKMSKGTYDELRNLLDECSKHVESLKFLEQEFLGVSELILVHVLA FT GALDREVRRRWEHTVKHGELPTYNEMLEFLKEECFTMERCDGPGTKPASSQ FT AKPSAAATKFSQKSFAAVTTASDPKCDFCGKSHLNFACPEFKALSVPQRLV FT KVKECNACFNCLKRGHRGSSCPSEKTCSRCKRRHHSLLHAEEKPKPSVVED FT PQSKPSVLQSPKNEEPPATPTEEATSASCCNERVPSHQVFLLTAVVDIIDK FT NQKTHPCRVLLDSASQVNLISKQMVESLGLNTSPSNVVVAGVNDTKSHALS FT SSLVHIRSRYSRFSANVKCLMADKVTSDLPSSTVSVREWDIPAGVHLADPK FT FYQSGKVDLLLGNQLFLKLLMPGEVQLADNLPCLRETQLGWVVGGVCGDAM FT YDSVVHSHSLTLEELNRTVQRFWEVEDVSSANVPTEADECELHFQATHRRD FT DTGRYVVQLPLKENVSELSDSRTMALKRFYILERRFAQDPDLKQQYSEFLT FT EYERLGHCEEVKESEDPEGVKWYLPHHAVLRPSNTTTKCRVVFDASAKVKG FT LSLNDVMMIGSLNQCALDEIALCFRIPCYVLTTDVAKMYRQVLVDKNHRRL FT QRIFFRLDPSSPLRVLELKTVTYGTASAPFLATRALLQLANEDGHKFPLAA FT EIVKTCFYVDNALFGFDDIGEALEAQSQLIQLLEGGGFHLHKWASNCSELM FT ASIPEAQREELVSIADMGVNDVIKTLGLLWNPSTDELVFVAKPSPVGDNPT FT KRQVLSLVASMFDPLGLAAPVIVVGKMLMKNIWCEKIDWDEQLTEELKKCW FT NYFLKCLSDVKKIKIPRQVVAPGATAFEIHGFGDASLEAYGACVYIRSIVP FT GSPPVVKILTAKSKIVPKSVLTIPRKELLAALLLHRLVIKVIATLRMSFRQ FT VVLWSDNQIVLAWLKKKPEQLDVFVRNRVCEITSNDNDFEWRYVNTADNPA FT DIVSRGCSAEELTVSDLWWNGPTFLRAENYQMDNPAPLLDDEVPELKATVV FT TSTMVELEMPHVFEKYESFRKLQRVVAYVLRFCGNCKKKKENREVSTFPTI FT PEMQAALKVIVRVVQRHEFAEEIFKLKSGESPTKLKNLNPYFDDDVLRVGG FT RLRHSNLSYTSKHPWILPNRNVVVDSLIRCVHRENLHIGPAALLATLRRQF FT WILHGRSAVRKITRSCVRCFKVNPRTADQFMGDLPSSRCDRAPAFQRVGLD FT FAGPFLIKQTGRKAAPVKGYVCIFICMVTKGIHLEAVENLTSDAFIGALQR FT FVSRRGVPEVLFSDNGTNFVGAKKELHELFNLFKEQATKRKIFEFCQPREI FT RWKMIPPGSPHMGGLWEAGVKSTKSVLKKTCNTASLTMMEFSTLLCQIEAL FT LNSRPLYAHSEDALDPEPLTPGHFMVDRPLTAIPEPTYEEIPVNRLSRWQY FT VQLLRGNFWNRWYREYLVELQVRSKWTKKTANIRPGMVVMIKEDNLPPQVW FT KFGVVDKVFPGADGFTRVVDLRTRSGIQRRPIHKLAPLPILDNSSA" XX SQ Sequence 5726 BP; 1530 A; 1238 C; 1537 G; 1421 T; 0 other; tgaattggtc cttcgagccg gatgttcgat ggatgttaag tgaagtggaa tttcggaaga 60 attggtattg aaggccgatg aagtggaaaa aatgtggttc ggccgaatgt gaaaatgtga 120 tccggccgaa tgcgaaaatt ggtgatcggc caccgaaaaa ccaaaaagtg aaaaaaggac 180 gccatcttgg caagtgtgca ggagcgaaca cgtcgccatt gcaatcaagt gttggatgag 240 cgtaaaagac gccatcttga aaagcgaaaa agtgaagtgg ttgtgaagtg aatttgggcc 300 aggagaacac ctttccgccc caagcaccgt gattatagac ttggctgttt tccgtgcttg 360 gttcgggcat aaggctcgaa atgttccggt gattctctgg ccatgccgaa gaagaaagtt 420 ttgcaaacga cagcagctaa ccagcaaact cctagcgtga agaaggataa gaagcaaacc 480 aacttggagg aaacgatggc gcaggcagag cagcttagcg ttctcgttca tcgtcgcggt 540 ttggcaaaag gtaagctcac aagattattc aattacttat tccctgatga agaagaagca 600 ccccagttaa gtgaggcaca ggttcgccac tatgttaaga aagtggaggc agtgcaaagg 660 gaatacaccg aggtccatga gcgaatcctc gcgttagtct ctgaagataa cagagaggac 720 catgattcgc actatgtgca gtttgatgat ctccaggatg taatatccgt gaacctcgaa 780 gagcagttgg ctaaagtttc attgaaccct gccacgtcaa gagctacttt tgccaatacc 840 caagctcaag cccccattct tgttcatcag cctctcaaga tcccaatccc cacttttgac 900 ggccgctatg aaagctggcc taagtttaaa gtaatgttca aggaccttgt ggataatact 960 cccgatcccc cagctgtaaa gctataccat cttgataaag ccttaacggg gagcgcatct 1020 ggtataatcg atgccaagac cataagtgag ggaaactaca ttcatgcttg ggaaattctt 1080 gaggaaaggt atgagaacaa acgccattcc atcgataagc acattcatgg tttgttgcat 1140 ctcaaaaaga tgtccaaggg aacctacgac gagttgagga atttgttgga tgaatgctcg 1200 aagcacgtcg aaagtttgaa gttcttggaa caggagttcc taggagtttc ggagctgatt 1260 ttggtgcatg tgctggcagg ggctttggat agagaggttc gtcgacgatg ggaacataca 1320 gttaagcacg gtgagctgcc cacgtacaat gaaatgctgg agttcttgaa ggaggagtgc 1380 ttcactatgg agaggtgtga tggtcctggg acgaagcctg catcaagtca agcaaaacct 1440 tccgctgctg ccaccaagtt tagccagaag tcctttgcag cagtcacgac agcttcggat 1500 ccaaaatgtg atttttgtgg aaagagtcac ctcaactttg cgtgtcctga gttcaaggcg 1560 ctttccgttc cgcagcgatt ggtcaaggtc aaggagtgca atgcgtgctt caattgttta 1620 aagcgtggac atcgtggtag cagttgcccg tctgaaaaga cttgtagccg atgcaagcgt 1680 cgtcaccata gtttgctgca cgcagaggaa aaaccgaagc caagtgtcgt tgaagatccg 1740 caatcgaagc cgtctgtcct ccaatcgccg aagaatgaag aaccaccagc aactccgaca 1800 gaagaagcga cgtccgctag ctgctgtaat gaaagggtgc catctcacca agtgtttctg 1860 ctcacagcgg tggtggatat tatcgacaaa aaccaaaaaa cccacccctg tcgagtttta 1920 ttagacagcg cctcgcaagt caatcttatc tctaagcaaa tggttgagtc ccttggattg 1980 aacacgagcc cgtctaacgt cgtcgtggct ggagttaatg acaccaagag ccacgcatta 2040 agcagcagtt tggttcacat ccggtcaagg tattcaagat tcagcgccaa cgtgaagtgc 2100 ttgatggctg ataaagtcac gtcggatttg ccttcttcaa cggttagtgt acgagaatgg 2160 gatattccag ctggtgtcca tcttgctgac ccgaagttct accaatctgg aaaggtggat 2220 ttgttattgg gcaaccagtt attcttgaag ctactgatgc caggagaggt gcaattagct 2280 gataaccttc cgtgtctgcg cgagacccaa ctgggctggg tcgtaggtgg tgtttgcggc 2340 gacgcaatgt atgattccgt ggttcattcg cattcactca cactggaaga gttgaatcgc 2400 accgtacaaa ggttttggga agttgaggat gtttcaagcg ccaatgtgcc cacggaggct 2460 gatgaatgcg agctgcattt ccaagcgacc catcgccgtg atgacacggg acgatatgtt 2520 gttcagctac ccttgaagga gaacgtttcc gaattgagtg attccaggac gatggccctc 2580 aagcggttct atatactgga aagaaggttc gctcaagatc cagatttgaa acaacagtat 2640 tcggagtttt tgactgagta cgagcgtctt gggcactgcg aagaagttaa ggaaagtgaa 2700 gatcctgaag gtgttaagtg gtacctacca caccacgccg tcttgcgtcc gtcgaatact 2760 accacaaaat gccgagtggt atttgatgca tcggcgaagg tgaagggact atcattaaac 2820 gatgtgatga tgatcggatc gttaaatcag tgtgctctgg acgagattgc cctttgtttc 2880 agaattcctt gctatgtcct gacgacggac gtggcaaaaa tgtaccgtca ggtactagtt 2940 gacaagaacc atcgccgcct gcaaagaata ttcttccggt tggatccgtc gagtccactc 3000 agagtattag aattgaagac tgtgacatac ggtacggctt cggcgccttt cttggcaacg 3060 agggcgttat tgcagttggc aaatgaagat gggcataagt ttccactggc agcagaaatc 3120 gtcaaaactt gcttctatgt ggataatgca ctttttggat tcgatgatat tggtgaagcg 3180 cttgaagccc agtcgcaact gattcaactc ttggaaggag ggggattcca tttgcacaaa 3240 tgggcctcaa actgttcgga gctgatggct tccataccgg aggcacaacg tgaggagttg 3300 gtcagcatag cagatatggg agtgaacgac gtgatcaaaa cccttggtct tctgtggaat 3360 ccgtctaccg atgaattagt cttcgttgcg aagccgtctc ctgtcggcga taacccgacg 3420 aagcgtcaag tgctgtcgtt ggtggcaagt atgttcgatc cgctggggtt ggctgcgcca 3480 gtgattgtcg tcgggaagat gctgatgaag aatatatggt gtgagaaaat tgactgggat 3540 gaacaattaa ctgaagaatt gaaaaagtgt tggaactact tcctaaagtg cttaagtgat 3600 gtgaagaaga tcaagattcc tcgtcaagtg gtggcacctg gagctaccgc ttttgaaata 3660 catgggttcg gagatgcttc tctggaagcg tatggagcct gtgtatacat tagatccatc 3720 gttccgggaa gcccaccagt ggtgaagatt ttgactgcaa agtcgaagat agtgccgaaa 3780 tcagtgctaa caattccgcg taaggagctt ctggcagcgt tattgctcca tcgactagtg 3840 ataaaagtga tagccacatt aaggatgtca ttccggcaag tggttctatg gtcggacaac 3900 caaattgtgc ttgcgtggct aaagaagaaa ccggaacagt tggacgtttt tgtaagaaat 3960 agagtgtgtg agattacctc taatgacaat gactttgagt ggagatatgt aaacaccgca 4020 gacaatcctg ccgatatagt gtcccgcgga tgttccgctg aagaattgac tgttagtgat 4080 ctttggtgga acggtcctac gtttttacga gccgaaaact accaaatgga caatccggct 4140 ccgttgctgg atgacgaagt tcctgagctc aaggctaccg ttgtcacaag cacgatggtg 4200 gagcttgaaa tgccgcatgt cttcgagaag tacgagagtt ttcggaagct gcagcgagta 4260 gtggcctatg tgctgcgctt ctgcgggaac tgcaagaaga agaaggagaa tcgtgaagta 4320 tcaactttcc ctactatacc cgagatgcaa gcagctttga aggtaatagt acgtgtcgtg 4380 cagcgtcacg aatttgctga agaaatcttc aagttgaagt caggtgagtc ccctacgaag 4440 cttaaaaatt tgaatccata tttcgatgac gacgtcctgc gtgtgggtgg ccgcctacgc 4500 cactcaaatc tatcgtacac gagcaaacat ccatggattc tccccaaccg aaatgtggtg 4560 gtggatagtc ttatcaggtg tgtccaccgc gaaaatctgc acatcggccc agcagcattg 4620 ctagccacct tgcgaagaca attttggatc ctgcacggtc gctcagcggt gcgcaagatt 4680 acacgaagtt gtgttcgctg cttcaaggta aatcccagga cagctgatca gttcatgggg 4740 gatcttccat caagtcgctg tgatcgagct ccagcattcc aaagggtagg attggatttc 4800 gccggtcctt tcctgatcaa gcaaactggt aggaaggcag cgccggtgaa gggatacgtt 4860 tgtattttca tctgcatggt gacgaagggg atccaccttg aggcggttga aaatttgacg 4920 tcggacgctt tcatcggtgc cttgcaaaga ttcgtgtccc gtcgtggcgt cccggaagtg 4980 ttgttttctg acaacgggac caattttgtg ggcgccaaaa aggagttgca cgagctgttc 5040 aacctgttta aggagcaagc aacaaaaaga aaaatttttg agttctgcca accacgagag 5100 attcgttgga aaatgattcc gccaggctcc ccccacatgg gtgggttatg ggaggccggt 5160 gtaaaaagca caaaatctgt actgaagaaa acctgcaata ccgcttcatt gacaatgatg 5220 gaattttcga cgcttttgtg tcaaattgag gccctcctga attcgcggcc gctttatgct 5280 cactcggaag atgcgttgga tccagaaccg ctaactccgg ggcattttat ggtggatcga 5340 ccgttgacag ccataccgga accaacatat gaggagatac cagtcaacag gttgtcccgg 5400 tggcaatatg tccagcttct tcgaggtaat ttttggaatc gttggtatcg agagtatctg 5460 gtggagcttc aagtacgtag taagtggacc aagaagactg ccaacatacg ccctggaatg 5520 gtggtgatga taaaggagga taacttacca ccgcaagttt ggaagtttgg agtcgtcgat 5580 aaggttttcc ccggggcaga tggattcacc cgcgtcgtcg atcttcggac tcgatctggg 5640 atacaaaggc ggccaataca caagctggcc ccgcttccaa tcctggataa ttcttctgct 5700 tgattgagtt actgccgggg ggagga 5726 // ID CR1-46_BF repbase; DNA; INV; 3563 BP. XX AC . XX DT 30-JUL-2009 (Rel. 14.07, Created) DT 30-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE Amphioxus CR1-46_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-46_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-3563 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-3563 RA Kapitonov V. and Jurka J.; RT "Young families of CR1 non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(7), 1617-1617 (2009). XX DR [2] (Consensus) XX SQ Sequence 3563 BP; 975 A; 772 C; 640 G; 1176 T; 0 other; gatagatata gatcagtata gggcagtgat tggcaggttt actgccggct taccaaagct 60 gtatatactc aagtctacct taactatttg tccacaatgc attcttatat ttatttccct 120 tatcatgata attctcgcgg gtgatattca tcccaaccca gggccaccct ttcgcaagga 180 aattaatttc atgcatatta atgttaacag cctggtagct gggtcaaaaa ttgacgaact 240 gtccgctatt gtaattagat ttaaattaga tgtcgtcgct atttctgaat catggttggg 300 agattctgtt aattcttctg acatattgtt aaatgggttt caggccccca ttcgccgcga 360 tagaaacagg cacggcggcg gtgtgctcat ttatgtatct gataagatcg cctacaagag 420 acgcttagat ctggagcctt cacctgtcga atgtatttgg cttgaactgt ccacaggatg 480 ctctcgcatc atattttctg tttactaccg tccacctgga caggactcta gcactgctaa 540 tgaattcatt gacttattta tcgactctgt ctatgcagct aggtcctcca attacgacgc 600 tatgataata acaggtgatt ttaatgccaa acacaatgca tggtgggctc ctgatccaat 660 taccccagtc ggcatgaaac tatttcaagc atcaactatg ttaaacctca cccaggtgat 720 cagtgagccg acatgtgacc tgtcacgctc gccgtctttg attgacttac tgtttactga 780 tgtgggacag cttgttaaac aaaccattgt tttgtctcca ctctcagcat gtcaccactg 840 tccgataatt accaccttta atctatcttt aaagtgccca aagccttata ctcgtactgt 900 atgggactat tctaagattg accttaactt actagaagat ttctcatttg atcccaaatg 960 gaatgatatt ttctattgta ataatgtcga tgatgcatgt gacaagctcg ttgaacttat 1020 atttgaagct aaatccaaat gtgttcctca caaagttata gtggtaagac ccagagacaa 1080 accttggatg tctcctagaa ttagatcgtt aatgcggcaa cgagacaaat tacatagaaa 1140 agccaaagta tcaaatagtc ctacattatg gtctttatac cgcaatgtta gaaacaagct 1200 tgtacgtgaa atctctctct ctaagtctaa ctactacaac cgtctcgttg actcattgac 1260 gtcttctcct tgttccagta agaagtggtg gcacattgtc aaaatgttct tttgtagcaa 1320 agcgagctct accattcccc ccttaaagtc gggtcactct tatgtaactg attcttgtga 1380 aaaggcaaga atgttcaatg attttttctc tgctcaggcc tctgtggatg acaccaaggc 1440 aagccttcca caaattgatt accttaccga cgttcgtttg tcagaatgtt ttactacacc 1500 tgctgaggtt gaactttacg catcgtcttt agatatatct aaagcatgtg gctacgataa 1560 cgtcgacaac cgttttttaa aacttatatg tccccttata tctgacaaaa tcgcttacgt 1620 tttcaacata tctctttgcc atggtgtctt tccagaagcc tggaaacggg caaatgtggt 1680 tcctattttc aaaaagggag atcccgatga tgtatctaat taccgtcccg tttctttgct 1740 acctagcttg tccaaaatct tggaaaaaat tgtatttaaa cacttgtata accatctgat 1800 gtcccaaaat ttactttatt ctctccagtc cggctttatc cgcggagatt ctactgttaa 1860 tcagctcatt tgtataactg acaaaatttt tgatgctcta gactcaaaca gagaagtcag 1920 agcggtttac ctcgactttt caagagcatt cgataaagtc tggcataaag gccttatttt 1980 taagttgcaa agaaacgggg ttgacggttc aatcttaaat tggttttaca gttatctctc 2040 tagtagagag caaagagttg ttattgatgg acaatgttca gattggtgtt atgttggcgc 2100 aggtgtacct cagggttcgg tgctaggccc cctactgttc cttgtgtata ttaacgacat 2160 ggtggatggt ttagtcactt gccctttttt atttgcggac gacagctcct tattggatat 2220 cgtggaaaat cccaactcaa cagccattcg actcaatagt gatctcatga agatttcatc 2280 ttggtcggca agatggctta tggagctaaa tcccttaaaa actgaagaaa tgcttttttc 2340 caagaaatta aatccccctt accatccacc tcttttcctc aacaactgtg ttattaagtc 2400 cgttaaatct cataagcaca taggaagttt tttaacaacc accatgtctt gggaaatgca 2460 aatttcaaat atgatttcca agacatctaa acgagtgtct gcacttaaca aattgaaatt 2520 tagattacct cgaacagtac ttgatactat ttataagtcc tttatccgtc ctcttcttga 2580 gtatgccgat atcatatggc atggatgtac tttatgtgat tcccaacgtt tagaacgtgt 2640 tcagtacgaa tgtgcactta cagtctcagg ggcagtaaga ggttcttctt actcttcgct 2700 ccttgctgaa cttggttggg aaagactttc tgatcggcga cacgttcact ccttggtcat 2760 gttttacaag attgttaacg gtcatactcg ccagtatctt aaggacttga tccctcccgt 2820 agtttctgtt tttacttctc ataatcttcg caacaaacac aacctacggg taccgatttg 2880 cactacaagt cgataccaaa agtcttttat tccttatgca acacaacact ggaatggtct 2940 tgacactgct gttcgttcac ttagcttttc actgtataaa aaacatttaa tgaaattagt 3000 acgcccctcc atcaataaac attttagcta cggtcctcgg tacacctgcg cactccttac 3060 acgccttcgc ataggtacct gcagtctaaa ccaaagttta tttatccgaa atttagcttc 3120 tagtccagcc tgtaactgtg ggtgtcgatg tgaaggtgtt gtgcattttt tgttatactg 3180 ccctacctat caacaataca gaaccatttt tttcgacaat cttcagaacc tgctaggcaa 3240 ttctcttgat ttcaatagtt tatccgaatc agcccgtatc catttacttt taaggggttc 3300 aacttttctg tcatatcata caaactgcag tatcttaaag ttaacccagt tatatattgt 3360 acataccaaa cgttttgcaa ctaccttgta acctgactta tatgtaaata gctgtgaatt 3420 gtttgtttcg atattgtcac tttatttgtt cttgtcagtc atcctttttt gagaggtgat 3480 gtgaatatta gctttctgct agagtgtatt acctctctgt cctgtctgta ctttgttgtt 3540 caataataaa aataaaaaaa aaa 3563 // ID BEL-238_AA-I repbase; DNA; INV; 6888 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-238_AA_; KW BEL-238_AA-LTR; BEL-238_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-6888 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 931-931 (2011). XX DR [1] (Consensus) XX CC Positions [5939-6499] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 555..3551 FT /product="BEL-238_AA-I_2p" FT /translation="MPTRGYLKSGAKGDKGAKKVQPVTGDQCASGGDGGNP FT EVVIIEETVEGHTCKTCEGADTEEMVQCDKCDGWFHFSCVGVSEEVADKSW FT SCTNCVTAKWIQRTKTALEDSTIQRNQNLDRRSTRSQPVGDQPVGNIATST FT RINPVIAPSCPQKPASVVHAEGRPADLTKMDQEKQQSVFPADDCLGVGSVN FT REAAKALSDISVSSSQRSAVQRAKLQLMRLEEERQFQKQQEERRRAAEERA FT AQEYREFLDKKYRLLEEVVSDRSSRSHSSTSSRVNDWVQKASQVQQTEPEA FT NGLRIENLQVPTTSLVPAQSSGQFGQHRIRFESTAQTNSVNNVPTVQQFNQ FT QQSRSAEQPKQQSRSAEQLFHSRFTSNQGFLATQVSQFAPPTRDPAQMNSA FT PAPASQPFNSTSFEGPNLAPPACSTGVNHQNYRGSICASQGTYREEYQDEF FT QLSRSQVAARQAVPRDLPTFTGNPEEWPIFLSMFNRTTMMCGFTEEENLVR FT LQKSLKGKAYEAVKSRLMFPGNVAGILGTLKMLFGQPEVIIQSLIGKISSL FT PSIREEKLESLVEFAVHVQNFCATVDACGMQEYMYNVSLLHQLVSKLPPSL FT KLSWAQHRLTVPTVTLATFSTWIYSLAEAASVVTFASSAQPEKSSRTDARG FT PKKGNAYLNTHSDSSLSEENEKVSPKEYSAGKPAQMKQVCPICKGTCRSAD FT KCKRFLELCRDSRWAAIREFGFCRTCLRMHKGSCNAKPCGKNGCTFRHHEL FT LHNDLKEKSQNPTASTSYESSPPDSPLPSSSGCNSHQTATCSVLFRYLPVI FT LYGQTDVVETYAFLDEGSELTLLDQDLADKLELDGAERPLCLRWTGGTERC FT EPNSRAYNLQISGIREGSQRFDLNDVRTVKDLMLPQQSLDMAQLSEKYPHL FT RGLPIDSYRDVRPRILIGTKHAHLGLVMKNREGEFGQPIAVKSRLGWTICG FT GGSGRGASLNYYSFHVCQCNTSSDDDLHQAMKNYFSLDSLGVTK" FT CDS 4040..6853 FT /product="BEL-238_AA-I_1p" FT /translation="MFHQVLIRAEDQLSQCFYWMNDRGEAEVYAMQVMTFG FT ACCSPSTAQFVKNTNAERFANEYPSAHHAITKCHYVDDMLVSVDTEEQAIE FT LAKNVKYVHEQGGFEIRNWISNSRKVKTALQGEDTNEKSLDLFSELATEKV FT LGMWWNTREDIFTYKVGWNRYDPALLGGQRRPTKREVLRVLMTIFDPLGLI FT SNFLSYLKILLQQIWRSGVQWDEEIDSEAYDKWLVWLKVLPRVEQVRIPRC FT YNSQDLMNETDEVQLHTMVDASENGIAAVCYLRFVKHESIRCSIVSAKTRV FT APLKFISIPRLELEAAVIGARLARSVEASLTIEIHRKLFWSDSRDVLCWIN FT SDHRRYSQYVGHRISEVLEISEAHEWRWVPGKLNSADDATKWNTLPELSSE FT DRWFKGSEFLWCTEDKWPKSPDRKSTTENELRPSLLLHHVLPEPVICVTNF FT STWERLCKVIAYVHRFASNCRKKGMESLITTGPLTANELFMSERTLIRLAQ FT SESYPDEVMLLRKPSDLSSEVIPKTSTLYQLTPWLDSNGILRMRTRIAACD FT YATDDAKNPIILARKHPTTSLIIDHYHRKYHHLNHETVINEIRQKFRIPHL FT RTCYKQVRSNCQMCKNQHAVPQAPYMADLPPSRLAAFTRPFTHVGIDCFGP FT IEVVVGRRVEKRWGMLVTCLTVRAIHVEVLHSLSSSSCIMALRNFMARRGT FT PQTVHSDRGTNFVGANRELLQTSEAINEDEFMKEFANRGIEWVFNPPLSPH FT MGGSWERLIRTIKNNLKVVCSSRRLSDEVLRNLLAEVENIINSRPLSHVPI FT DEDSAPALTPNHFLLGSSNGTKPLSNLDDSGEALRQNWTTSQILANQFWRR FT WISDYLPEITRRTKWFKRSKPISVGDVVIIVDPKMPRNCWPMGKIIATKIS FT MDGQVRSATVRTANGVYDRPATKLAVLDVRCAEK" XX SQ Sequence 6888 BP; 1972 A; 1630 C; 1652 G; 1632 T; 2 other; aggattagtt gaattatgtg cttattattg gatttgtatt agattgcatt acaaaaccag 60 atcactccaa ggcagaaacc aaccgtgcat ctacttggat tacctacgta agagcattgg 120 tgagaagtga gtttgctaaa cctatagatt gcctaaacta actactaaat tatacagcag 180 cagcacccaa atacgagttg aaatccaccc aaattacgaa taggaactgc acgaaaggaa 240 ctaaacgtga gtagaagaag caatattccc tattaagaaa ctacagcata agcaatcccc 300 catcagttcg gatatatcga acaggggttt ccatcaccgg cttcgcggtg cactatataa 360 aagaaatgaa attgtagaaa tgccaaacta atagttcgta tgtattcaat accattcata 420 ggaaaattat attctcccat tcatcggatc acgtgtttca ttcgtttccg gtacggagaa 480 ttacgatttt ggaagttccg ttgaatcaag attcaacagc atataatttt cgtttattga 540 acgaaacgag agtaatgcct actcgcggtt accttaagtc gggtgcgaag ggggataagg 600 gtgcaaagaa ggttcagcca gtgactggtg atcaatgtgc tagtggtggt gatggtggta 660 atcctgaagt ggtgatcata gaggaaactg tggaaggaca tacgtgtaag acgtgcgaag 720 gagcggacac cgaagagatg gttcaatgcg ataaatgcga tggatggttt catttctcgt 780 gtgtgggagt atcggaagaa gttgctgaca agagctggag ttgcactaac tgcgtcaccg 840 cgaaatggat ccagcgaact aagacagctt tggaagacag taccatacag aggaaccaga 900 acctagatcg taggtcgact aggagccagc cggtcggaga tcagccggtc ggaaacatcg 960 caacaagcac ccgcatcaat ccagtgattg ctccgtcgtg tccccaaaag ccagcctcgg 1020 tagtgcatgc tgagggtaga ccagccgatc ttacaaagat ggatcaagag aagcaacagt 1080 cggtatttcc agccgacgat tgtctaggtg ttggttctgt gaatcgagaa gcagcaaagg 1140 cactttctga catttccgtt tcctcatcac aaagatcagc cgtgcagcgt gcgaaactgc 1200 agttgatgcg actggaagaa gaacgacagt tccagaaaca gcaagaagaa cgtcgtcgtg 1260 ctgctgaaga aagagcggcg caggaatatc gagagtttct tgacaagaag taccgtcttc 1320 tggaagaagt cgttagtgat aggagttcga ggagccacag ctcaacatca agtcgtgtca 1380 acgattgggt acagaaggcc agtcaagtgc agcagactga accagaagca aatgggcttc 1440 gcatcgagaa tctacaggtt cccacgacca gcttggtacc agcccaatcg tctggacagt 1500 tcggacaaca tcgaattcga ttcgaatcaa ctgcccagac taattctgtc aataacgtgc 1560 cgactgtcca acagttcaac caacagcagt cccgttcagc agagcaaccg aaacagcagt 1620 cccgttcagc agagcaactg tttcactcca gattcaccag caatcagggc tttctggcaa 1680 cacaagtaag ccaattcgcg ccaccgacta gagatccagc tcaaatgaat tccgcgccgg 1740 caccagcgtc gcagccattc aactctacat ccttcgaagg gccgaatttg gcgccaccgg 1800 cgtgttctac cggtgtcaat catcaaaact atcggggaag catctgtgct tcccaaggaa 1860 catatcgaga agagtatcag gacgaatttc aactttctcg ctctcaagtg gcagctagac 1920 aagcagttcc gagagattta ccaacgttta ctggtaaccc ggaagaatgg ccaatctttc 1980 tatcaatgtt taatcgtacg actatgatgt gcggattcac tgaagaggaa aaccttgttc 2040 gactgcaaaa gagtttaaaa ggcaaggcct atgaagccgt aaagagccgc ctcatgttcc 2100 ctgggaatgt agccggtatt ctcggaaccc tgaaaatgct gtttggacaa cccgaagtta 2160 ttattcaatc tctgatcggg aaaattagtt ctctaccttc gattcgagaa gaaaagctgg 2220 agtcattagt cgaatttgct gtacatgtgc aaaacttctg cgctacagta gatgcctgcg 2280 gaatgcagga gtatatgtac aacgtctctc tacttcatca gctggtcagc aagctacccc 2340 cttcgctcaa actcagttgg gcgcagcacc gtctaacggt gccgacggtc accttagcga 2400 ccttcagtac gtggatctac tctctagcgg aagcggcgag cgtcgtcacc tttgcatcat 2460 cagcgcaacc cgaaaagtct tctcgaaccg acgcacgtgg acctaagaaa gggaacgcgt 2520 acctcaatac acattcagat tcatctttat ccgaagaaaa cgaaaaggtt tctccgaagg 2580 aatactctgc aggaaagccg gcgcagatga agcaagtttg cccaatttgt aaagggacct 2640 gtaggtctgc tgacaagtgt aagcgcttct tggagctttg cagggattca agatgggccg 2700 ctataaggga atttggattc tgccgcacct gtctacggat gcacaaagga agctgcaatg 2760 ccaagccgtg cggcaaaaac ggatgtacct ttcgacatca cgagttgttg cacaacgact 2820 taaaagaaaa gtcccaaaac ccaacagcct ccaccagtta cgaaagctca ccacctgatt 2880 caccccttcc tagttcatct ggctgcaata gtcatcagac agctacctgt tccgttctgt 2940 ttcgatacct tcctgtaatt ttgtatggtc aaacggacgt agtggagact tacgctttcc 3000 tggatgaggg ttcagaactt actttactcg atcaagacct cgctgataaa cttgagttgg 3060 atggtgccga acgtccatta tgtctccgat ggactggagg aactgaacga tgtgaaccta 3120 actctagagc ttacaatctt caaatttccg gaatcagaga aggaagtcaa cgattcgatc 3180 tgaacgacgt acgaacagtg aaggatctga tgttgcctca acaatcgttg gacatggctc 3240 aattgtcgga aaagtatccg caccttcgtg gtttgcccat tgattcgtac cgtgatgttc 3300 gaccgcgtat tctcatagga acaaagcatg cccatcttgg cctcgtcatg aaaaaccgtg 3360 agggagaatt cggacagccg attgccgtta agtctcgact tggatggacc atctgcggag 3420 gaggatctgg acgtggagca agtttgaact actacagctt tcacgtctgt cagtgtaaca 3480 cttcttcaga cgatgatctt caccaggcaa tgaaaaacta cttttcgctc gatagccttg 3540 gtgtaacgaa awccgataaa ctgctcctac ccgtcgaaga tcagcgtgcc ctgtcaatgc 3600 ttcagtctct taccaatcgt aaggatgacc gttacgaatc cggtttgcta tggcgttacg 3660 acgatacacg tctgccggat agccgatcta tggcattgcg ccgatttcaa tgtctgaaaa 3720 aacggatgga gagagacacg cagctgcagg aagttctgaa atctaaaatt gtggaatatt 3780 tagcgaaagg ctacattgga aactcagtga ggaggaaatc attcagcagg tccctcgccg 3840 ttggtattta ccagtatttc ccgtaacaaa tccaaataag cctggaaaag tacgtttagt 3900 atgggacgca gcagcaagcg cttacggtac gtcmctcaac tctgcgcttt tgaaaggacc 3960 ggacctactt tgttcccttt tcacgatcct gcttcgattc cgagaacgcc gtatcggact 4020 aaccggtgat attcgtgaga tgtttcatca agtgctgatt cgagcagaag accagttgag 4080 tcagtgtttc tactggatga atgatcgtgg tgaagccgaa gtatatgcta tgcaagtaat 4140 gacgttcgga gcatgttgct ctcccagcac tgcacagttc gtgaagaaca ctaacgccga 4200 gcggtttgca aacgaatatc catctgcaca tcatgccatc acaaaatgtc attacgtcga 4260 cgacatgctc gttagcgtag atacggaaga gcaggccata gagttagcta agaacgtgaa 4320 gtacgtccac gaacagggtg gttttgagat ccggaactgg atcagcaact cgcgaaaagt 4380 taagacagcg ctacaaggag aggataccaa tgaaaaatct ttggacctgt tctctgagct 4440 ggccactgaa aaggtgttgg gcatgtggtg gaacacgaga gaagacattt tcacctataa 4500 ggttggatgg aaccgctacg atccagctct attaggaggt caacggcggc caacgaagcg 4560 ggaagttctt agggtattga tgacaatttt tgatccgctg ggattgattt ccaacttcct 4620 gtcataccta aagattctgc ttcaacagat ctggaggtcc ggtgtacaat gggatgaaga 4680 aattgatagt gaagcatacg ataaatggct agtctggcta aaagtgcttc cacgtgtgga 4740 acaagtgcgc attccacgat gctacaactc tcaggacctt atgaacgaaa ccgacgaagt 4800 ccagctccat accatggtgg atgccagcga gaacggaata gccgcagtat gttatctacg 4860 ttttgtcaaa cacgaatcta ttcgctgctc catcgtttcg gcgaaaacca gagttgctcc 4920 actcaagttc atatctatac caagactgga actcgaagct gctgtgattg gtgcccgtct 4980 agcacgatcc gtcgaagcct ctctgaccat cgagattcac agaaagctgt tttggtccga 5040 ttccagggat gtcctctgct ggatcaactc tgaccaccgt cgctatagcc aatatgtagg 5100 ccatagaata agtgaagtcc tcgaaatttc tgaagcacat gaatggcgat gggttccagg 5160 taaactaaac tccgctgacg atgctacgaa gtggaatact ctacccgagt tgtcatccga 5220 agataggtgg ttcaagggct ccgaattcct ttggtgcacc gaagacaaat ggccaaagtc 5280 tccagatcgc aaaagcacca ccgaaaatga gctgcgtcct tcactcctgc tacaccacgt 5340 tttgccggaa ccagttatct gcgtcacgaa tttctcgaca tgggaaagac tgtgcaaggt 5400 catcgcatac gtccatcgat ttgcttcaaa ctgtcgtaaa aaaggcatgg aaagcttgat 5460 taccacaggt cctcttacgg ccaacgagtt attcatgtcc gagcgtacac tgatacgact 5520 cgctcaaagt gaatcttacc ccgacgaagt aatgctactt cgcaagccat ccgatctatc 5580 ttcagaggtc attccgaaaa ccagtacact gtatcaacta acaccatggt tggatagcaa 5640 cggaatatta cggatgcgaa cccgcatcgc tgcctgcgac tacgccacgg acgatgccaa 5700 gaacccaatc attctcgcaa gaaaacatcc cactactagc ctgatcatag accattatca 5760 ccgaaaatac catcacctca atcatgaaac ggtgattaat gaaattcggc agaaattccg 5820 tattcctcat ctacgtactt gttacaaaca agtgcgtagt aactgccaga tgtgcaaaaa 5880 ccagcacgcg gtaccacaag caccatacat ggccgacctt ccaccctccc gcttagcagc 5940 ttttacacga ccgttcactc atgttggcat cgactgtttc ggaccgatcg aagtcgttgt 6000 cggaagacga gtggagaaac gctggggaat gctggttaca tgcttgaccg ttcgagcgat 6060 ccatgtagaa gtattacatt cgctcagctc gagttcatgc atcatggcgc tccgaaattt 6120 catggcacga cgtggaaccc cccaaacagt ccacagtgac cgaggaacaa actttgtggg 6180 cgcgaatcga gagttgctgc aaacgagcga agctatcaac gaagacgagt tcatgaaaga 6240 gtttgccaat cgtgggattg agtgggtttt taatccgcct ctctcaccgc acatgggcgg 6300 aagttgggaa cggttgatac gcaccatcaa gaacaaccta aaagtagtgt gttcttcgag 6360 gaggctgtct gacgaagtgc tccgaaattt gctagcagag gttgagaaca taataaattc 6420 acgcccgcta tctcacgttc cgatagacga agactccgcc ccggcgctga caccaaatca 6480 ctttttactt ggttcttcaa atggaacaaa gccactcagt aatcttgatg atagtggtga 6540 agcgcttcga cagaactgga caacatcaca gattctggca aaccaatttt ggagacgttg 6600 gatctctgat tatcttccag agatcacccg gagaacaaag tggtttaaac gatctaaacc 6660 aatttctgtc ggagatgtag ttatcatagt ggacccaaag atgccacgca actgttggcc 6720 gatgggaaag attattgcaa ctaagatcag catggatgga caagttcggt cggcaactgt 6780 gagaacggca aatggagtgt atgatcgccc cgctacaaag ctggcggttt tggacgtacg 6840 gtgcgccgaa aagtaagcca acttgagagt tggcatacct ggggggag 6888 // ID Proto2-1_SK repbase; DNA; INV; 4384 BP. XX AC . XX DT 10-JUL-2009 (Rel. 14.07, Created) DT 10-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE Hemichordate Proto2-1_SK autonomous Non-LTR Retrotransposon - DE consensus. XX KW Proto2; Non-LTR Retrotransposon; Transposable Element; KW Proto2-1_SK. XX OS Saccoglossus kowalevskii OC Eukaryota; Metazoa; Hemichordata; Enteropneusta; Harrimaniidae; OC Saccoglossus. XX RN [1] RP 1-4384 RA Kapitonov V. and Jurka J.; RT "Proto2, a novel clade of metazoan non-LTR retrotransposons."; RL Repbase Reports 9(7), 1556-1556 (2009). XX DR [1] (Consensus) XX CC Proto2-1_SK is a very young family of non-LTR retrotransposons. CC It belongs to a novel clade of metazoan non-LTR retrotransposons CC called Proto2. This clade includes families of non-LTR CC retrotransposons present in the hydra (from Proto2-1_HM to CC Proto2-5_HM), annelid (from Proto2-1_CS1 to Proto2-8_CS1), CC hemichordate (Proto2-1_SK) and amphioxus (Proto2-1_BF) genomes. A CC model Proto2 non-LTR retrotransposon is 4.3 kb long, contains two CC ORFs; its 3' terminus is composed of microsatellites; usually CC there is no target site duplications generated upon CC retrotransposition of Proto2 elements. ORF1 codes for a protein CC conserved in elements from all species mentioned above. ORF2 CC codes for a protein composed from the AP endonuclease and reverse CC transcriptase domains. It appears that the Proto2 clade is a CC clade ancestral to the RTE and RTEX clades. XX FH Key Location/Qualifiers FT CDS 173..1480 FT /product="Proto2-1_SK_1p" FT /note="ORF1." FT /translation="MVLRCNDCTCGVLSKQVTISIGDLQLCGKCTDKRPDG FT LTANNDVECVQEYEDDGHKAIVVNELLCFLSNKMDSMANDMLIKLCVDNFS FT DHDIEIAKRSLYEHCGSSTRFIRRQGHNKKVNNLQDTITLFHEADIHTLPW FT FVARDLSKLPPIDFNHVDVSGMVRDLKFLRQEINILHDSLSQSTNCALSVA FT SLERNVCELRSDIANIKAKVASNDTCSSTVPKSRDMSLSEDNGCVSHKVSP FT EKEMAGDPGIADGLIDNVLIGETHPSIKSYANAVITPSTGENVKQTRLRSK FT SLNAISQPTILESQPGGPTRQPIKSSGLLDEYGFTTVTRRRKKKPPVFTIG FT EGNCNSDLHVVIRSQLPAKIFVTRLAPDTTESSIVKHIKKTVNIDAKCEKL FT RSKFDTYASFVITIPRDTVSSVMQPTSWPAGVLVRKYYQSRS" FT CDS 1480..4314 FT /product="Proto2-1_SK_2p" FT /note="ORF2, AP endonuclease, RT." FT /translation="MNSLTISSFNSRGIKSSVNDVRSLCNRCDICLIQEHW FT LCTSELGILSRIHPDFNSCGESAMDEFNNLRVGRPFGGIGILWRKSLDHVV FT SIKRFDDPRVLGISIECNNNTSILIICIYMPTDNSDNFMEYLNYLAKVKSI FT IHNSNTCNSLIIGDWNANVTSPFGRELINFCNENNLVLSDYVHLHTHGEPF FT TYYSESHGTTSWLDHCVSTINAHDGISSINISHDVISSDHFPLTVRLNIDK FT LPTIETRRLCGVACNTDNVSVNWDKISSSQLNDYREMSDRLLKEVTIPIES FT ISCKNANCHVASHRDDLHTFYCDIITALNSSTRNIQHTPRNKSVNLVPGWN FT EAVKEVYDASRDAFKRWVSFGKPRYGPIYELMRKTRAQFKRVFRACRRDED FT RIKADKIATHHCNKDFSKFWKEVNSDKVNQFISCCVDGQSTEVDIADMWKD FT HFSSILNSVKNCESRDSVEASLGNIMFHPSMIIQVAEVKEYIMRLSGRKAV FT GTDNISPEHLKHASDSLYVLLSLCLTGMLVHGYLPNELLETIIVPIIKDRS FT GDITSRDNYRPIAVATSMSKLLELILLKRLEDFITTTDFQFGFKSNHSTDM FT CVYVVKEIIDFYKSHNSPVFLCFMDASKAFDRVNHWSLFKKLIERHVPLFI FT VRLLMYWYNSQQFYVKWGSVMSDSFGVSNGVRQGGILSPFLFNLYVDDLSI FT ELGNISVGLKINSMFINHMFYADDLILVSPSIKGLNFLVKKCENYANEFDI FT LFNFKKSVCMYNLPSQCGIKRVPNVYLNGKPLKFVSQYCYLGYHFTQDGKD FT SICMRKQIRGLFARANFLLRRFSKCSLPVKCTLFRSYCSNLYCSYLWSDFT FT VADFNRIRVAYNNALRIICNVSRSFNSSITETFARLHVDDFQVKLRKMCYS FT FSQRILLSKNSVLINLLSSDIFFKSRIWQMWRGRFYLRHTV" XX SQ Sequence 4384 BP; 1260 A; 738 C; 882 G; 1504 T; 0 other; tgttactaca ggttcaattt ctacactcta tctccgtgaa tgtcatgtgc gtctgctcgt 60 gtgctttaat agtgtcattt ttattctgaa gcgtgaaaac cgagcataaa ctttgtcgca 120 ccgagagact gacggcaatt gtaccaggcg tggtatgtaa acaggtgcaa cgatggtgct 180 ccgttgtaat gattgtacat gtggtgtact ttctaaacaa gtaactataa gtattgggga 240 tctgcagtta tgtggtaagt gcactgataa aagaccagat gggttaacgg caaataatga 300 tgttgaatgc gttcaagaat atgaagatga tggtcataaa gcgattgtag ttaacgagct 360 gttgtgcttt ttgagcaaca agatggattc catggctaat gatatgctaa ttaagctatg 420 cgtagacaat tttagtgatc acgatattga gatagccaaa cgctctctgt atgaacattg 480 tggtagttca acccgtttta tcagaagaca aggacataac aagaaggtga acaatttaca 540 agacacaata actttatttc atgaagctga tattcataca cttccatggt ttgttgctag 600 ggatctctct aagcttcctc cgattgattt taatcatgtt gatgtttctg gaatggtacg 660 cgatttgaaa ttcttgcggc aagaaataaa tattttgcat gattcgttgt cacaaagtac 720 caattgtgcc ttgagtgtgg ctagtttaga acggaatgta tgtgagcttc gtagcgatat 780 cgccaatatt aaagctaaag ttgcatcgaa tgacacatgt tccagcactg tacctaaatc 840 acgtgatatg tcattgtctg aagataatgg atgtgtatct cacaaagttt cgcctgaaaa 900 agagatggca ggtgacccgg gtattgctga tggtttgatt gacaatgtat tgattgggga 960 aacgcacccc agtattaagt cgtatgcaaa tgccgttata acaccttcaa caggtgagaa 1020 tgttaagcaa actcgattgc gtagtaaatc actaaatgca atttctcaac cgacgatctt 1080 ggagtcgcaa cctggtggtc ctactcgaca gccaatcaaa tcatctggtt tactggatga 1140 atatggattt accactgtta ctaggagacg taaaaagaaa cccccggtgt ttacaattgg 1200 tgaaggtaat tgtaattctg atctacacgt cgttattcga tctcagcttc ctgctaagat 1260 atttgttacg agacttgcac cggatactac agaatcttca attgtgaagc atattaaaaa 1320 aactgtgaat attgacgcta aatgtgagaa acttagaagt aaattcgata cctatgcttc 1380 ttttgtaatt acgattccac gtgacactgt gtcatctgta atgcagccca catcgtggcc 1440 tgcgggcgtt cttgttcgaa aatactacca atctcggtca tgaattctct aacaatatca 1500 tcttttaaca gtcgtggaat taaatcgtcc gtaaatgacg tgagatcttt gtgcaatcgt 1560 tgtgacattt gtctcattca ggagcactgg ttgtgtacct ctgaacttgg aatattatcg 1620 cgtatacatc ccgattttaa ttcgtgtggg gagtctgcta tggacgaatt taacaatctt 1680 agagttgggc gtccgtttgg tggtatcggt attttgtgga ggaagagcct agatcatgtg 1740 gtgtctataa agcgtttcga tgatcctcgg gttcttggca tctctattga gtgtaacaat 1800 aatacatcta ttttaattat ttgtatttac atgcccacag acaattcgga caacttcatg 1860 gaatatctta actaccttgc aaaagttaaa tcaattatac acaattcaaa tacgtgtaat 1920 tcattaataa taggtgattg gaatgccaat gtaacttctc cattcgggcg tgaattaatt 1980 aatttctgca acgaaaataa ccttgtgttg tcggactatg tgcatttgca tacacatgga 2040 gaaccattta catattacag tgaatcccat ggcactactt cgtggttgga tcattgcgta 2100 tcaacaataa atgcacacga tggtatttcg tcaataaata tttctcatga cgtcatctca 2160 tctgatcact ttcctttgac tgttcgtctg aacattgaca aacttcctac aatagaaact 2220 cgtcgactgt gtggtgtagc ttgtaatact gacaatgtat ctgtgaattg ggataagata 2280 tcttccagtc aacttaatga ctatcgtgaa atgtcggaca gattattaaa agaagtgacc 2340 atacccattg aatctattag ctgtaaaaat gctaattgcc atgttgctag tcatcgtgac 2400 gacttgcata cattttattg tgatataata actgcactta attcttccac tcgtaacatt 2460 caacatactc cacgaaataa atctgttaat ctggttcctg gctggaacga agctgtaaag 2520 gaagtttatg atgcatccag ggacgctttt aaacggtggg tgtcatttgg taagccccgt 2580 tacggtccaa tctatgaatt aatgcgcaag acccgggctc aatttaagcg tgtatttcgt 2640 gcatgtcgta gggatgaaga ccgcattaaa gctgataaaa ttgctacaca tcattgtaat 2700 aaagattttt ctaagttttg gaaagaagta aactctgata aagtaaacca attcatttct 2760 tgttgtgtag atggtcagtc tactgaagtg gatattgctg atatgtggaa agaccatttc 2820 tctagtattc taaattctgt aaaaaattgc gaatctcgtg attcagttga ggccagttta 2880 ggtaatatta tgtttcaccc atctatgatt attcaagtag ccgaggtaaa agaatacatt 2940 atgcgattat cgggacggaa ggcggttggg actgacaata tttctcccga acacctaaaa 3000 cacgccagtg atagtttata tgtactcttg tcgttgtgct tgacaggcat gcttgttcat 3060 ggttatttgc caaatgaact gttagaaact atcattgtcc caattataaa agatcgctcg 3120 ggcgatatca cttcccgtga caattatcgt ccgattgctg ttgctacctc tatgtcaaag 3180 ttgttagaat taatacttct gaaacggttg gaggatttta taactactac tgattttcaa 3240 tttggtttca agtcgaacca ttccacggac atgtgcgtct atgttgttaa ggaaattatt 3300 gatttttata aatctcacaa tagtcctgtg tttttatgtt tcatggatgc gtcgaaggcc 3360 tttgatcgtg tcaatcattg gtcccttttt aaaaagctta ttgaacgtca tgttccatta 3420 tttattgtaa gacttttgat gtattggtat aattctcagc aattctatgt taaatggggt 3480 tctgttatgt cagactcgtt tggagtttct aatggggtta gacagggggg tattttatct 3540 ccctttcttt ttaacttgta tgttgatgat ctgagtattg agcttgggaa tatttctgtt 3600 ggacttaaaa ttaacagtat gttcatcaac catatgtttt atgctgacga ccttattttg 3660 gtttctcctt ctattaaggg tttgaatttt cttgtaaaga agtgtgagaa ttatgccaat 3720 gaatttgata ttttattcaa ttttaagaaa tctgtctgta tgtataattt accatcccag 3780 tgtggcatta aacgagttcc caatgtttat cttaatggca agccactaaa atttgttagt 3840 cagtattgtt atttaggata tcattttacc caggacggaa aagacagtat ttgtatgaga 3900 aaacaaatac gtggattgtt tgctcgagcc aactttttat tacgtagatt tagtaaatgt 3960 tctcttcctg ttaagtgtac gctatttcgt agttattgtt caaatttgta ttgtagttac 4020 ctatggtcgg attttactgt ggcagatttt aatcgtattc gtgttgcgta taacaacgct 4080 cttagaatta tatgtaatgt ttctcgttct tttaatagta gtataacaga gactttcgcc 4140 agactacatg ttgatgattt tcaggttaaa cttagaaaaa tgtgctacag tttttcgcaa 4200 cgtattttgt tgagtaagaa ctctgtttta atcaatttat tatctagtga tatttttttt 4260 aaatcacgta tatggcaaat gtggcgtggt cgattttatt taaggcatac tgtgtaattt 4320 gttaatattt ttttttctta ctttttatat gggcttgtcc tgaaataaaa gataataata 4380 ataa 4384 // ID Mariner-2_SM repbase; DNA; INV; 1879 BP. XX AC . XX DT 08-FEB-2008 (Rel. 13.02, Created) DT 03-MAR-2008 (Rel. 13.02, Last updated, Version 1) XX DE Mariner DNA transposon: consensus. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-2_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-1879 RA Jurka J., Bao W. and Tempel S.; RT "Mariner DNA transposon from Schmidtea mediterranea."; RL Repbase Reports 8(2), 146-146 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 644..1609 FT /product="Mariner-2_SM_1p" FT /translation="MENGEDYFHFGKLLGIARNSIKTIIYKYRKFGKLSQG FT KIGGSISKISEENGNKLLDFVEKNVYASLEEMKNFLISENLSVSTNTISRF FT LENHLITTKQIRPSVAERNSDRVLDLRMEYTTRYLEEAWMDSQLIYIDECG FT FNTWTSQKLGRNKKGERLYATLPTNRGRNMSLALAIGVNGPVHHKLVVGSY FT KKDTYQIFINEITVKLNGTGFRLIHDNASIHGGAISLNNVIEHLPPYSPFL FT NPIESVFSKIKFNIRKSISRAGGLSSMNFQSRFDFLKDALDNELVKEDYRD FT LTNYFWHIRRFFPKCIKKIHIFGDYLLLLT" XX SQ Sequence 1879 BP; 732 A; 282 C; 290 G; 575 T; 0 other; tacaccagct gtcactgaga taaatcacct ataatataaa agaaaatcaa taaatttata 60 gtatataaac acaccaactc accttgaggt tgcttgtaaa atcgtcaatg agagaaaccc 120 taaactatag agacacaaaa agtgatagag aaataaaata atgcacatta actcaaccta 180 taataaaaca tgcatgctaa cttttaacat gtagccatcc gaagttgaaa aaacccctat 240 aataaattaa tttagacatt tttgagatat ttttagaaaa aaaaatggcg gaaaaatttt 300 taaaatttaa tgctaaggaa acttatccaa atttttatga aaaagatgaa acttcttcgt 360 aagatttatt tgaaaattga aataaattac atattttcat aggctttcag acatttctgc 420 agaaacaatt ccatttcgcg acgaagaaga aatcacattg tacgatttat ttgaaaattg 480 aaataaatta aatatttcca taggctttca gaagttgcta ggccctcaac aacatctgag 540 actcgaacaa aaaagcataa actgtaaatt ttatttttaa gttataaaaa attattcttc 600 ttacagaatt tcggaaatcg ataaaatacg aattatcaat gccatggaaa atggtgaaga 660 ctatttccat tttggaaaat tattgggaat agctagaaat tccataaaaa caataattta 720 taaataccgg aaatttggaa aattgtcaca aggcaagatt ggtggcagta tatccaaaat 780 ttcggaagaa aatggcaaca aactacttga ttttgtggag aaaaatgttt atgcttcatt 840 agaggaaatg aaaaactttt taatttccga aaatttaagt gtttccacaa acactatttc 900 ccgttttctt gaaaatcatc tcataacgac caagcaaata agaccatcgg tcgctgaaag 960 aaattcagat cgcgtcttgg atctcagaat ggaatacacg acgcgatatt tagaagaggc 1020 atggatggat tctcaattaa tatatattga tgaatgtggc tttaatactt ggacaagcca 1080 gaaacttgga agaaataaaa aaggtgagcg attatacgcc acacttccaa cgaaccgagg 1140 gcgaaatatg tccttggctc tcgcaattgg agtcaatgga ccagtccatc ataaattagt 1200 agtaggatcc tataaaaagg atacctacca gatcttcatc aacgaaataa ctgtaaaatt 1260 gaatggaact ggatttcgtt taatccacga caatgcttca attcatggtg gtgcaatatc 1320 tctaaataat gttattgaac atttaccacc atatagtcct tttctgaacc cgattgaaag 1380 tgtgttcagt aaaatcaaat tcaatattcg gaaatctata tcaagagcag gtggattgag 1440 ttcaatgaat ttccaatcca gattcgattt tctgaaggac gcactggata acgagctggt 1500 gaaagaagac tacagagatt taactaatta cttttggcat attaggagat tttttccgaa 1560 atgtataaaa aaaatacata tttttggaga ctatttattg ttacttactt gatttcctaa 1620 atattattta tagtaatata taaaataaat gatgattaaa agttatattt aagagcttga 1680 attatgatca tatttcctta tgaaatcata attcaactct taaaatttaa ctttaagtta 1740 tataacaata taaaaaacaa cttacttagc agccatttaa aaaatggctg aaatgaattc 1800 acaaaaaata aattccgcca agttgattta tgtatcaact tgaattatag cggaatttat 1860 ctcagtgaca gctggtgta 1879 // ID Tx1-2_CQ repbase; DNA; INV; 4715 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A Tx1 non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW Tx1; Non-LTR Retrotransposon; Transposable Element; Tx1-2_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4715 RA Kojima K.K. and Jurka J.; RT "Tx1 non-LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 634-634 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 6 sequences with >98% CC identity. XX FH Key Location/Qualifiers FT CDS 153..1346 FT /product="Tx1-2_CQ_1p" FT /translation="MANSVRKNTAKLVFGPGIKVPSYLDVLKFSMDTLKLS FT AADIHSIYKDENGGQFYVKLIDEETFSAFIAESDEHYQFKFEDGTTARVAV FT EQASRVFRYVRIFNLPPEIDDATIQYVLGQYGTIRQHVRERFPNELKVNIF FT TGVRGVHMEVAKEIPAHLYIGHFRARIFYDGLKNKCFFCKAEGHVKQNCPK FT MAAISTGSGSGSGSSGGSGSSAPSLGRPAKPIIDFGQPIAPAMTLLNQKKN FT DDTQNPITTTSTQAVPSASTTTTVAVPSASTTTSPSVSTPAPISSTTVAPS FT ASTTTTKQTVDTNTNDGVLDALRPGGVSEEHIDKADTDAMDAEISELDRQT FT LLKRTKSSSPVSSEGDAGKLGGKGKGGKGKGGPTGEGDGFIPVPNGKRNKK FT KKTT" FT CDS 1411..4656 FT /product="Tx1-2_CQ_2p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="MATTTFSRKLATININAISTNIKLGLLKDFVWNNDLD FT IVFLQEVSFENFKFLASHFAFVNISVDNKGTAVLIRKNIEFDNMLMNPNGR FT ILSVTIDKVHFINVYGHSGSQYRHQRDVLFSNDIIPHLSAAHPNVILGDFN FT CILLNSDSNGTVKNLCQGLKDLTSNLDLIDIGNSIHRNTIYTFFRGDSASR FT IDRIYGPKDFLEKVLSFDTLTTAFSDHRAIVLKFNVDSNSNVTSTGRGFWK FT INPSFLHDEDIKREFVLEIQNTRQRFAYNNNFCRWWIEHFKPKAKSFFKTR FT SFELNKFITSSKSFLYGCLREISCKQSNNENVEKELSFVKSKLMNIEQNRL FT SNLKLKIQPENIIEDEKLSTFQMSKYAAKNSIMKLLINGNLTSEKTKLQLA FT LENHFSLLFEGDKTFYPGVNNIFNHVKNRLTLEQSNNLIKPIDLNELQEAL FT QQASKKKSPGPDGLTYEFYCTFFDEIKNDLLKLFNLFLMGEIPNTAFVEGI FT VTLIPKKEQSFEISDKRPISMLNCDYKLFAKIIMNRLQPLMDSLIGPGQAA FT GISEKSCITNLKLLRNLIIKAKKSKSLKVLVASFDLEKAFDRVDHHFLWLT FT LEKFGFPIQFVNLIRNLYKNASSKILFNGFLTNNIFIKSSVRQGCPLSMAL FT FIIYIEPLIRMLYDDVRGCLIDNSFLKVIAYADDINVVIRNNHEFDKTLEL FT INYFSIFAKIRLNVGKSQYMRLNNCLSGPHVLKEVSSLRILGIDLSENFNT FT IIENNYSRIILNIKFLISIHHKRKLNIYQKSWILNQIILSKLWYIAQVFPP FT NNSNIAEIRTICRNFIFKGVGLYKVKFDQLYLDVDQGGLSLVDVESKCKAL FT FVKNILFPSQDGIKDSFMLSQQTNNTLSRNSREWLSLAEEISNDTDLNTSK FT KIYRLLISKQNIKIKIATEFPELPWQNYWQNIQSNFISSEEKHSLFIMLND FT LIPTKAKLFRHKVRDTNTNLCEYCNKLDTSKHRIKLCSSSRLVWNSVKTTI FT IRKLGLDIVDPEELLGKTFKNEKEKLGLWLVVEAIAYNLKNFGNGTVADFE FT CKVREKRWNNKKFYDRVFGKWICYI" XX SQ Sequence 4715 BP; 1597 A; 904 C; 894 G; 1318 T; 2 other; cagtctcgga cgtgagctcg tacgaacaag acgtatttcc atcgacgctc tcgagaataa 60 gtccggtttc gaacaatagt cctccgcggc tattgtttcg tacgcgtttt tctttgttag 120 ctaaaagcgg aatcttcccc ggaagagtaa aaatggcgaa cagtgtacga aagaacactg 180 cgaagttggt cttcggacca ggaatcaagg taccgtcgta cctggacgtg ctgaagttct 240 cgatggacac cctgaagctg tcagcagcgg atatccacag catttacaag gacgagaatg 300 gcggccagtt ttacgtcaag ctgattgacg aggagacttt ctcggcgttc atcgccgaat 360 cggacgaaca ctaccagttc aagttcgagg acggaacaac ggcaagggtg gcggtagagc 420 aggcatcacg cgtcttccgg tacgtccgga tcttcaatct gccaccggag attgacgacg 480 ccacgatcca gtatgtgctg ggtcagtacg ggaccatacg gcagcacgtg cgcgaacgtt 540 ttcccaacga gctgaaggtc aacattttca ccggtgtacg aggggtgcac atggaggtgg 600 cgaaggaaat tcctgcacac ctttacatag gacatttccg agcccgcatc ttctatgatg 660 ggctcaaaaa caagtgcttt ttctgcaagg cagaagggca cgtcaagcag aattgcccca 720 aaatggcggc gatctcaaca ggcagcggta gtggcagcgg cagcagcgga ggcagcggca 780 gctcggcgcc gtcccttggt aggcctgcaa aaccgatcat cgatttcggc cagccgatcg 840 ctcctgcgat gacgttgctt aaccagaaaa agaacgacga cacacaaaac ccgatcacca 900 cgacatcaac gcaagcagta cctagcgcct ccaccactac aacggtagcg gtaccgagcg 960 cttccacgac aacatcacca agcgtctcca cwccagcacc aatatcatcg acaacggtag 1020 caccatctgc aagcaccaca acaaccaagc aaacagtgga cactaacacc aacgacggcg 1080 tgctggacgc tttgcggccg ggcggcgtga gcgaagaaca catcgacaaa gcagacaccg 1140 acgcgatgga cgccgagatc agcgaactcg atcgccagac gttgctgaag cgaacgaaga 1200 gttcctctcc ggtctcttct gagggtgatg cgggtaagct gggtgggaag ggaaaggggg 1260 gtaagggaaa gggtgggcca actggagagg gggatgggtt cattccggtt ccaaatggta 1320 aacgcaacaa gaagaagaag acaacctaaa tcatccatcg ttccatcgtt ttatcgtacc 1380 atcaataact aaaaaaaaaa catcttaata atggccacaa caacattctc tagaaaacta 1440 gccacaatta acatcaacgc tattagcact aacatcaagc taggtttatt gaaggacttc 1500 gtgtggaata atgatcttga tattgttttt cttcaggaag tgtcgtttga aaactttaaa 1560 tttcttgctt cccattttgc ttttgttaat atcagtgtcg acaacaaagg aactgctgtg 1620 ttgattagaa aaaatatcga gtttgataac atgttaatga acccaaatgg tcgaattctg 1680 tcggtcacaa tcgataaagt tcactttatc aacgtttatg gtcattccgg atcacaatac 1740 cgacaccaac gtgacgttct gttttcaaat gatatcatcc cacacctatc tgctgcccat 1800 cccaacgtaa ttctaggcga ttttaattgt attcttttaa attcggattc aaacggaacc 1860 gtgaaaaatt tatgccaagg actgaaagac ctaacatcwa atcttgattt aatcgatatc 1920 ggaaattcaa ttcaccgtaa tacgatttat acatttttca gaggagattc tgcatccagg 1980 attgatagaa tttatggacc aaaagatttt ttggaaaaag ttttaagttt cgatacatta 2040 acaactgcct tctctgatca ccgtgcaata gttttgaaat ttaatgttga ctcaaattct 2100 aacgtcactt ccaccggtag aggtttctgg aaaataaatc cctctttttt gcatgatgag 2160 gatatcaaac gtgaatttgt tttagaaatt caaaacacca gacaaagatt tgcatataac 2220 aataattttt gtcgatggtg gattgaacat tttaaaccaa aagcaaaatc attttttaaa 2280 acaagatctt ttgaattgaa taaatttata acatctagta aaagcttcct ttacggttgt 2340 cttagagaaa tatcatgcaa acaaagcaac aatgaaaatg ttgagaagga actatctttt 2400 gtaaaatcca aattgatgaa tattgaacaa aatcgacttt caaatttaaa attaaaaatc 2460 caaccagaaa atataattga agacgagaaa ctaagtacat ttcaaatgtc taaatacgct 2520 gcaaaaaact ctattatgaa acttttaatt aatggtaacc ttacttctga gaaaactaaa 2580 cttcaattag ctttagaaaa tcattttagc ttattattcg aaggcgataa aactttttat 2640 ccaggtgtaa ataatatttt taatcacgtg aaaaaccgtt taactcttga acaaagcaat 2700 aatttaataa aaccaattga tctcaacgaa ctacaagagg cactacaaca agcttctaaa 2760 aagaaaagtc caggaccaga tggactgaca tacgaatttt actgtacttt ctttgacgaa 2820 ataaaaaatg accttttgaa actttttaat ctatttttga tgggggagat tcctaatacg 2880 gctttcgttg aagggatagt aacattaatt cctaaaaaag aacaatcctt tgaaatttct 2940 gacaaacgtc cgattagcat gttgaactgt gattacaaac tttttgctaa aatcattatg 3000 aatcgattac aaccccttat ggattcttta attggaccag gtcaagctgc aggtatttct 3060 gaaaaatctt gtatcacaaa tctaaaactg cttagaaatc taattatcaa agcaaaaaaa 3120 tctaaatcat taaaagtact tgttgcaagt ttcgatttag aaaaggcttt cgatagggtt 3180 gatcatcatt ttctttggct aaccttagaa aaatttggtt tcccaattca atttgttaac 3240 ttaattcgta atctgtataa aaatgcttcc tctaagatac tatttaacgg ttttctcact 3300 aataacattt tcataaaatc atctgttcgc caagggtgcc ctttgagcat ggctttattc 3360 atcatttaca tagaaccact tattcgcatg ctttacgatg acgttagagg ttgtctaata 3420 gataatagct ttttgaaagt tattgcctat gcagatgaca ttaatgtcgt tattcgcaat 3480 aatcacgaat ttgataagac cttggagctc atcaactatt tcagcatttt tgctaaaatt 3540 aggttgaatg ttggaaagtc tcaatacatg agactaaaca attgcctctc gggtccacat 3600 gttctcaaag aggtctcatc gctcagaata ctaggaatag atttatctga aaattttaac 3660 actatcattg aaaacaatta ctctagaatt atacttaaca ttaagttttt aatttcgatt 3720 catcataaaa gaaaactaaa catttaccaa aaatcatgga ttttaaatca aattatttta 3780 tctaaattat ggtacatagc acaagttttc cctcctaata attcaaatat cgctgaaatc 3840 agaacgatct gcagaaattt tatttttaaa ggtgttggat tatacaaagt taaatttgat 3900 cagttatact tagacgttga tcaaggagga ctatctctcg ttgatgttga aagtaaatgt 3960 aaagcactat ttgtaaaaaa catattattt cctagtcaag acggtatcaa agattcattc 4020 atgttatcac agcaaactaa caacacgctt agtcggaact cacgtgagtg gctaagtttg 4080 gcagaagaaa tatcaaacga tactgattta aacacaagca aaaaaatcta ccgtttgttg 4140 attagtaagc aaaatattaa aatcaaaatc gcaaccgagt tccctgaatt gccgtggcaa 4200 aattattggc agaatataca atcgaatttc atttcctctg aagaaaaaca ttcgctattc 4260 attatgctca atgatcttat tccaacaaaa gcaaaacttt ttcgccataa ggttagggat 4320 actaatacaa atttgtgtga atattgtaat aagttagata catcaaagca tcgtataaag 4380 ttatgtagca gcagtagatt agtttggaat agcgtaaaaa caactataat tagaaaacta 4440 ggcttagaca ttgtagaccc tgaggagttg ctagggaaaa cgtttaaaaa tgaaaaagaa 4500 aaattaggtt tgtggttagt agttgaagca attgcctaca atttgaaaaa ctttggaaat 4560 ggtacagtag cagattttga atgtaaagtt agggagaaac gatggaacaa caagaaattt 4620 tacgacagag tttttggaaa atggatttgt tatatttgaa gatttgaaga cctataaata 4680 aacagccttt aaaccggaaa aaaaaaaaaa aaaaa 4715 // ID Tx1-3_CQ repbase; DNA; INV; 4816 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A Tx1 non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW Tx1; Non-LTR Retrotransposon; Transposable Element; Tx1-3_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4816 RA Kojima K.K. and Jurka J.; RT "Tx1 non-LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 635-635 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 10 sequences with >92% CC identity. XX FH Key Location/Qualifiers FT CDS 130..1143 FT /product="Tx1-3_CQ_1p" FT /translation="MDKTRKNTLKMVFGNAAKVPAHVDVLRFTANVLKIPA FT TDVHSIYRDENDRCFYVKFLDEPTFVEFTGRIEEQYRFVHSDGQVAVVKLE FT VASRLFRYVRIFNLPPEIEEKDISAVLSQFGTIRQHVRERYPTEYGYTVYS FT GVRGVHMEIAKEIPPNLYIGHFKARLYYEGLKNRCFYCKLEGHIKVNCPKI FT ASLKSSTPAGSFSAIVAGAIAANATDLTLPTDMTTLTPQNKPPIQPRDPDR FT GISPTPEEAGDDSEMDDDDEDEREEESGADESPDLGGDKHRKPRSSSRNVL FT EQLENRSRSRSLIRPEGTGGNGGKNGKEGKGGKGGKGGKGRGRPRK" FT CDS 1377..4643 FT /product="Tx1-3_CQ_2p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="MPLVRNIATANLNAINSDVKKSLLKDFIWNNDLDIIF FT IQELSFENFAFLNSHTAIVNISANNKGTGILLRNNLQYSDIILNENGRISS FT VCVDSINXINIYAHSGSNQKKERDTLFNEDILIHLAHGKENVIVGDFNCIL FT LAEDMNGAVKNYCKGLQNLISGLELKDVEQTILKSRVNFTFXRGTSKSRLD FT RFYASNEFINQISCIKTLPLSFSDHHGVLVKLNLKDNAPNIFIGRGFWKLN FT SFFLNDNDILTQFTAMYNGLRNRNSFINLCYWWNEVMKTKSKSFFKKISFE FT SNQQTIREKSFYYRCLNELFEKQKEGENVCSEMKIVKSKLMEIEQNKLRFY FT RYTFRTSNLIENEKLSIYQITSRIKNSTPSKLISLRINNKVTSKTSELKSM FT IFDHFKNIFCKRAESGSDNDDILEHITKTLNDAERDALSKPIELQELKDAI FT FNASNNSSPGPDGLNYDFYKTFFDVLNTDLLNLFNGYLVNNEYPPGHFSAG FT IISLIPKKGDPHDLNNRRPISMLNTDYKIFTKILWNRLQPMMKKLIGPGQS FT ACISETSCVNVLRTLRNVLIKSKQTKHFKNLLLSIDLEKAFDSVDHEVLWK FT ILKKFGLPDSFIRCISRLYQNATSKVLFNGFFTDSFKIECSVRQGCPLSMA FT LFCLYIEPLIRMLFESVKGCLIGDCFVRVIAYADDISLLIRNDHEFDKALE FT LINYFSIYAKIKINTVKSQFLRFNNCSSGPHQIKEVDNLKILGIIFHRNFD FT TTISENYKDLINKLKYTISLHHRRKLNLFQKAFILNTYVLSKLWYIAQIFP FT PNNKHIAEIRRICFKFLWQGYFYSVAKNELYLPIYKGGLALEDVEAKCKSL FT FIKNILFFKTNSGSSVDEFMIAQSNNRAVTRNAREWINLAIELKPRTVLNT FT SKLFYNYFIDQMSIKPRMQTEMQNLQWDLIFENLDKKFISSDTKTYLFSLL FT NELIPTRSKMFRHGIAGIDSPNCILCGNLDTITHRIKSCDKSTVIWNWIKN FT LVIVRIRINTRDPEELIALQFNSKSYKKNAGLWLVCEAIRFNLMNYGLDGM FT GCLEKFKKEIRDARWNNKAVFAKYFKNVLNIF" XX SQ Sequence 4816 BP; 1671 A; 850 C; 928 G; 1363 T; 4 other; cagttatcgc tcggacgttg atacgacaac acacgtttct acgagcactc ttcaacacaa 60 tagtcttttg tgttcggcac acgccgtgtg tccttaacct caactaaagg aatttactga 120 aaagcaaaga tggacaaaac ccgtaagaat acgctgaaaa tggttttcgg aaacgcagcg 180 aaagttccag cccacgtgga tgtgctacgg ttcacagcga acgttttgaa gattccggcc 240 acggatgtcc actcgatcta ccgcgacgag aatgatcgct gtttttacgt aaagttcttg 300 gatgagccga ccttcgtgga atttaccgga agaatagagg agcagtaccg tttcgtgcat 360 agcgatggcc aagtggccgt ggtgaaactg gaagtggcca gccggctgtt caggtacgtg 420 aggattttta acttgccgcc agaaatcgaa gagaaagaca tttcggccgt tctgtcccaa 480 tttggaacca ttcggcaaca tgtccgggaa cgctacccaa cggaatacgg ctacacggta 540 tacagtggtg tgcgtggcgt ccacatggag atcgccaagg agataccgcc taacctctac 600 atcggtcact tcaaagcacg cttgtactat gaaggtttga agaatagatg tttctattgc 660 aaactggaag gacacatcaa ggtcaactgc ccgaagatag cgagcttgaa gtcttcgaca 720 ccagccggat cgttcagtgc catcgtcgca ggggcgatcg cagccaacgc aactgatctc 780 acactaccaa ccgacatgac cacgctgaca ccacaaaaca agccaccaat acaaccgcgc 840 gatcccgatc gcggaatttc gccgacgccg gaagaagcag gcgacgacag cgagatggac 900 gacgacgatg aagatgagcg agaggaggaa tccggcgctg acgaatcgcc cgatctcggc 960 ggcgacaagc atcgtaagcc ccgaagtagt tcacgcaacg tactcgagca gttggagaac 1020 cgwtctcgct ctcgcagcct gatccggcct gaaggtactg ggggaaacgg ggggaagaat 1080 gggaaagagg ggaagggtgg aaagggggga aaggggggga agggaagggg tcgaccacgc 1140 aagtagatta agtgatgatg gagaaggagg atagttcgtg cgaaaaagga ggagaatggt 1200 agttggtatg atctgcaact gtgtaagaga agacgaatgt atgtgtgtgt gtgtgggtaa 1260 aaaaacgtaa aaaaaacaaa gaggaggtaa agatgttcaa aatattgtgc gtacaatccg 1320 gttccgatga tttgctttca attttatcaa actttttcga atttcaatag ccacttatgc 1380 cactagttcg taatattgct accgctaatc taaatgctat caattccgat gtgaagaaat 1440 cactacttaa agattttata tggaacaacg acttagacat tatttttatt caagaactat 1500 catttgaaaa ttttgcattc ctgaattcgc atacagcaat tgtaaatatt agcgctaata 1560 ataagggaac tggaattctt ttacgaaata acttacaata ttctgatata attttaaatg 1620 aaaacggaag aatatcatcc gtttgcgttg actctattaa ctwtataaat atttatgcac 1680 atagtggaag caaccagaaa aaggaaagag acacgttatt taatgaagac attttgattc 1740 atcttgcgca tggcaaggaa aatgtcatag ttggtgattt taattgtata cttttggctg 1800 aagacatgaa tggagcagtt aagaattatt gcaaaggact gcaaaattta atctctggac 1860 tagaattgaa agacgttgaa caaaccatat tgaaatcacg agttaacttt acatttwtta 1920 gaggtacatc caaatcacgc ttagacagat tttatgcatc gaatgaattt attaatcaaa 1980 tatcttgtat taaaacgtta ccattaagct tttctgatca ccatggcgta cttgttaaac 2040 tcaatttaaa agacaatgct cctaatattt ttataggtcg tggtttctgg aaactaaatt 2100 ccttttttct caatgacaat gatattttga cgcaattcac ggcaatgtac aatggactaa 2160 gaaatcggaa ttcttttata aacttatgct attggtggaa tgaagtaatg aaaacaaaat 2220 caaaaagttt cttcaaaaaa attagctttg aaagtaatca acaaacaatt cgagagaaaa 2280 gcttttacta tcgttgtctc aatgaacttt ttgaaaagca aaaagaagga gagaatgttt 2340 gttctgaaat gaaaattgtt aaatcaaaat taatggaaat agagcaaaat aaattacgat 2400 tttatcgcta tacttttcga actagtaatt taattgaaaa tgagaaacta agtatttatc 2460 aaataacatc tagaataaaa aattcaacgc catcaaagct tatcagtcta cgtataaata 2520 acaaagttac ttcaaaaaca tcagaactaa aatcaatgat ctttgaccat tttaagaaca 2580 ttttttgtaa acgagctgaa agtggatcag ataatgatga tattttggag catatcacaa 2640 agacgctgaa cgatgccgaa cgggatgccc tttccaaacc aatagaatta caagaactta 2700 aagatgctat tttcaatgct tctaataatt cttcaccagg acccgatgga ttaaactatg 2760 atttttataa gactttcttt gatgttttaa atacagacct gttgaactta tttaatggtt 2820 atcttgtaaa taacgaatac ccaccaggac atttctcagc cggaataatt tcactcatcc 2880 ctaaaaaagg tgatcctcat gaccttaaca ataggagacc cataagcatg cttaatacwg 2940 actataaaat ctttactaaa attctttgga atagattaca acccatgatg aagaaactga 3000 ttggccctgg acagtcagct tgtatctcag agacatcctg tgttaacgtt cttagaacat 3060 tgcgaaatgt attgatcaaa tcaaaacaga ccaaacattt taaaaatctt ttacttagta 3120 tcgatctgga aaaagctttt gacagcgtgg atcatgaagt tctgtggaaa atattgaaaa 3180 agtttggact tcctgatagt tttattcgct gcattagtcg tctctaccaa aatgcaacgt 3240 ctaaagtttt attcaacggt ttttttacag attctttcaa aattgaatgc tcagttcggc 3300 aaggatgccc attgagtatg gctcttttct gtttatacat tgaacccttg attcgtatgc 3360 tttttgagtc ggttaaaggt tgtttgatag gcgattgctt tgtaagagta atagcttacg 3420 cagatgacat tagcttactc attagaaacg atcatgagtt tgataaggcc ctagaactga 3480 ttaactactt tagcatttat gctaaaatca aaatcaacac cgttaaatct cagtttctga 3540 gatttaacaa ctgtagttca gggccacacc agataaaaga agttgataat ttgaaaattc 3600 taggcataat attccatcgt aattttgaca caacaatttc agaaaattat aaagacctaa 3660 tcaacaaact taaatataca atttcactac atcatcgtcg taaattaaac ctttttcaaa 3720 aagcatttat tttaaataca tacgtccttt ctaaactatg gtacatcgct caaatttttc 3780 ctccaaataa caaacatata gctgagataa gaagaatatg ttttaagttt ctttggcaag 3840 gttactttta tagcgttgca aagaatgaat tgtaccttcc gatttataaa ggaggtcttg 3900 cattggaaga cgtagaagca aaatgtaaaa gtttatttat taaaaatatc ttatttttca 3960 aaacgaattc tggttcctcg gttgatgagt ttatgattgc gcaaagtaac aaccgtgctg 4020 taacgagaaa tgcgcgagaa tggattaatc ttgctattga attaaaacct cgaacagtgc 4080 tgaacacgag taaattattt tataattatt ttattgatca aatgagtatc aagccacgta 4140 tgcaaacaga aatgcaaaat cttcaatggg acttaatttt tgaaaatctt gataaaaaat 4200 ttatatcatc tgatacaaaa acataccttt tctcactttt aaatgaactt attccaacca 4260 gaagtaaaat gtttcgtcat ggaatagctg gaatagattc tccaaactgt atattatgtg 4320 gtaatcttga cacaataact catcgaatta aatcatgcga taaatctact gtgatttgga 4380 actggataaa aaaccttgta atcgttcgga ttagaataaa caccagggat cctgaagaat 4440 tgatagcatt acagtttaat tcaaaaagct acaaaaagaa tgcgggtctg tggctggtct 4500 gtgaagcaat tagatttaat ctaatgaatt acggattaga tggcatgggc tgcctagaga 4560 aattcaaaaa agaaatacgc gacgctcgat ggaataacaa agcagttttt gcaaagtact 4620 ttaaaaatgt tttgaacatc ttttaaaatt tgaacaaacc ccacacataa agaaattaaa 4680 gatcatccaa tgattgtata tctagacgat tataaacatc gaaaaagcac aggatcccga 4740 aactagaaaa aacaaattgt aaactaccgg aactatgtta aataaaacct ctttaaatcg 4800 gaaaaaaaaa aaaaaa 4816 // ID Gypsy-57_AA-LTR repbase; DNA; INV; 207 BP. XX AC AAGE02020632; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-57_AA_; KW Gypsy-57_AA-I; Gypsy-57_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-207 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02020632; Positions 20982 21188. XX SQ Sequence 207 BP; 66 A; 26 C; 61 G; 54 T; 0 other; tgtagggtcg taccctcggt tgaatcctaa atataaataa agctgtttta gttcagtgta 60 ggatgatgga tggtagagag gagagagggg aagaagatca gtgtcagcaa gggaagttgg 120 atgtagcggt tgttgagcaa aatatatact gtgtatagaa ttcgtgagaa acgtgaaagt 180 gtacttctac gagtaatccg aacccca 207 // ID Gypsy-32_CQ-LTR repbase; DNA; INV; 215 BP. XX AC AAWU01031813; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-32_CQ_; KW Gypsy-32_CQ-I; Gypsy-32_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-215 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 444-444 (2011). XX DR Genome; AAWU01031813; Positions 6514 6728. XX SQ Sequence 215 BP; 57 A; 59 C; 47 G; 52 T; 0 other; tgttgggtga gcaccctgca ccacactctg cgacagggtc gcccggacgt caacagctgt 60 caacaccagc cagcgagcgg aagcgagctt ggctgtcatt actcgcccga cagccgacaa 120 ctacaccacg cggaaatatt ttaatttgta gttttaccgt aataaaagta attagtttgt 180 tgtatagtac cacctcgcgt ttcactatta caaca 215 // ID HECTOR repbase; DNA; INV; 638 BP. XX AC U17152; XX DT 03-SEP-1998 (Rel. 3.08, Created) DT 03-SEP-1998 (Rel. 3.08, Last updated, Version 1) XX DE Transposase from DNA transposon Hector element, partial cds. XX KW hAT; DNA transposon; Transposable Element; DNA transposon HECTOR; KW hAT superfamily; HECTOR; transposase. XX OS Musca vetustissima OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Muscoidea; Muscidae; Musca. XX RN [1] RP 1-638 RA Warren D.W., Atkinson W.P. and O'Brochta A.D.; RT "The Australian bushfly Musca vetustissima contains a sequence RT related to transposons of the hobo, Ac and Tam3 family."; RL Gene 154(1), 133-134 (1995). XX RN [2] RP 1-638 RA Warren D.W.; RT "HECTOR."; RL Direct Submission to Genbank (11-NOV-1994)William D. Warren, RL Molecular Genetic Resource Service, CAMBIA, GPO Box 3200, RL Canberra ACT 2601, Australia. XX DR GenBank; U17152; Positions 1 638. XX SQ Sequence 638 BP; 225 A; 102 C; 133 G; 178 T; 0 other; caatgggtcg tcgaggattg tagacctttt tcagctgtca atggatccgg ctttgaaaag 60 ctagtacagc atttcataaa tcttggtgct aagtatgggg aaaatattga cattaaagat 120 ttgcttccca gccaaagaac aatatcgcgg aatatccaaa agacggctga agaaacaaaa 180 aatggaatta aaaatgaaat tgcagaagtt gtgcgaggtg gtggtgcgtc cgctacaata 240 gatatgtgga cggataacta tgtgaagagg aattttttag gtgtcacatt acattaccaa 300 tcaaaatcaa aatttgttga tatggtttta ggtatgaaat cgatggattt ccaaagactt 360 aaatctttat ttaatgagtt tggtgtgcag gatctgggac caataaagtt tgtaactgac 420 agaggatcaa acatattgaa agccctggaa caaaacacac gactgaactg tagcagtcat 480 ctattttcaa atgttcttga aaagtctttc gaagatactg cagaactcaa agaattatta 540 aaggcatgcg caaaattagt taaatatttt aagaaagcta atttgcagca caaattgcct 600 acatcattaa aaagtccatg cccgacccga tggaactc 638 // ID BEL-233_AA-LTR repbase; DNA; INV; 607 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-233_AA_; KW BEL-233_AA-I; BEL-233_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-607 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 922-922 (2011). XX DR [1] (Consensus) XX SQ Sequence 607 BP; 259 A; 76 C; 115 G; 152 T; 5 other; tgtcaacacc gctgggcaat aataacgggg agccggaaag tgatgaacgc aaaacgtcag 60 ggcaaagtac aagaagagga atcatacaaa aataagaaaa aacaacaaac cagtcgagtt 120 taaaggatcg tgaatttatt aaaattatat tttccaaagt twatctactt tctaacagtt 180 aaaaaatcaa atttgwagtg ggaataacct gtaagtaaaa taagaaacaa aacctgaaaa 240 gaaaatatta taaaattaac tcaaattatt gaaatttagm cgattgttac tgaggaagta 300 gagttcggaa aaacaawcta aaatcaagcg gacaattaat ggatttgaag taaattagta 360 tctgaaatcg taagtaaaag aagataattg aaatgttaat gtaaattaat gaatttgtta 420 caaataggtg aagaaatcag tcgaaaagta gaggggacgc agagcacgag gattagaaag 480 gaaactactt tgtaagttat gaaaatgaaa ataatttkac tgtttactta ccttgaataa 540 accattacag tttgagctga ccgataaatt ggaatctgct tcaagatcag ttcccgaagt 600 ccgaaca 607 // ID Gypsy-15_DPu-I repbase; DNA; INV; 4671 BP. XX AC scaffold_27; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-15_DPu_; KW Gypsy-15_DPu-LTR; Gypsy-15_DPu-I. XX NM Gypsy-15_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-4671 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 745-745 (2010). XX DR Genome; scaffold_27; Positions 607857 603187. XX CC Positions [3568-4026] - Integrase core CC 'ACAAG' target site duplication CC LTRs are 100% similar to each other. CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX FH Key Location/Qualifiers FT CDS join(382..2574,2578..4560) FT /product="Gypsy-15_DPu-I_1p" FT /translation="MAFKPYGKPPPFDLEEYKDSFDLWHKKWTIFLSLSTI FT DSALDEDQRDVYKAHTLLSCLSTDTLQAVLSMGLTDAQLDNHTVVIDNLRA FT RCNAGRNRHVWRQQFSAKKQGTQQSADDWLCELRDLARKCEFQTDCCARCE FT PTRILGQLIFGVESDEVRVKLLEQGDTLTLDAALTILRTAEASNKQSINLK FT TGDAAAIQGATSTYRRFKQTSGRPPPRDKTGKQPTCDNKPASIKFAGCGNC FT GSKSQCKPLTACPAQGRKCNKCGRSNHYAQVCRSTTPKQQSIYVEPPPPAV FT GAVKTSDLVAVSITPKPTANGERATVVIHALPDTGADIDAIPESLYMQSFS FT SVSLRKGIQPVTAIGNPIVNVGVFSAFIEWTTNDKNSCSLRTDVHVLRELK FT QPVLSKKSQQALGMLPSGYPHVRVGSVAQTPPNASQQQADLRQLVSEQPKI FT FDGTCRPMTGPPCHFQLVENARPVAMRGSRPVSVPLLPKLKAELDALESAG FT IVRRVSKPTEWVHPSLVVMKKNGDIRLVIDFRELNECIIRPNFETATPFQA FT VRTIPPGMRFFTVIDALKGYHQVPLDDPSIDLTTFSTPFGRYQYLRLPFGV FT THAGDDYCRRVSEIFDDLPNCRRIVEDVLIFSATYDEHVEAVRTVFARAAA FT HNVSINTAKIVFAQPAVTFGGYVVEENGFRPDPALRRAISEFPVPKSITDV FT RSFFGLCQQVGHFSDQLAAALDPLSPLLKAGAFEWTTRHDEAFCAARKLLS FT TVHDLAFYDPKRPTSLHVDASRLNGLGFLLKQLDDSKKWRVVQAGSRFISS FT AESRYAMIELECLSAAWAMHKCRQFLEGLPSFELVTDHKPLIPILNSYAMD FT KLDNPRILRLRLKMQRYSFVARWVPGKQNADADALSRAPVDHVTASDELGE FT GLPSFPAKIAMMSLIGAELSTADPTLDPVLEKIKYAASIDPIMLKLRNQIV FT AGFPNDKCNIDIDLRPYWCVKDRLTIDESDDMIVLGPRVVIPQSLRAAILR FT DLAAMHQGATKTRQRARLSVYWPSIDNDIVNATRNCDECSKHLPSLPPEPF FT RPRPPATRPFEQIHADLGEVNGRHFLVIVDSFSGWPHVVAFRDKKTSARAI FT IGHIRTFFSSVGAPVAFWSDNGPQFGAAEFRRFLADWGITPLTSSPHYAQS FT NGRAEAEINTMKTLIRGSWTSGAFDEAKFAKSILLFRNAPRSGGASPAQLV FT FNRPVRDCLPAHRRSFAPEWQKAADTLEKRARRSKELQIAHYNKRTRPLAA FT FVVGNHVLIQHPVSKQWATPGIIVEVGRHRDYLIKTPAGRIFRRNRRFLRP FT RIPTFPATGSAPTATPIQPEPAPAPAPEPVQPPVQHQQQQDVQPPADQHPP FT TNGPAPPVLRRSTRPRQPRRHFFPTDWTQ" XX SQ Sequence 4671 BP; 1126 A; 1370 C; 1139 G; 1036 T; 0 other; tggcgcagtt ggttcgtgtt ctttacctga cttacgtttc tgtgtttgta cattttccct 60 gtgaaagtct ggcttacccg gctctaattt taagcgtgcc gttgtgggtg tgaaacgacg 120 tccagtcaac ccaccgcacg accacgtttt gtcgctacta gttgactcac gttttgtccg 180 tcccttttag ctgaatcgta tcggcagaaa tcagtgacat cctcgtgtgt accgcgtttt 240 ctatagctcg tgtcaattta tcccatcttg ctggcctatt tagtgtcttc tgccgtagtg 300 tatcacgaaa tttcccgagt tcccaacctg gagaggtata gcgagctaga catcggctgg 360 ctttttggaa attacgtgaa catggctttt aagccatacg gtaaaccccc acctttcgac 420 ctggaagaat acaaagattc tttcgatctg tggcacaaga aatggacaat ttttctcagc 480 ctgtcgacga ttgattcagc actggacgaa gatcagcggg acgtgtacaa ggctcatact 540 ctgctatcgt gtttatccac cgacacactt caagccgtcc tttccatggg ccttaccgac 600 gcgcagctcg acaatcacac ggtcgtcatc gacaatctcc gtgcacggtg caacgccggc 660 cgcaaccgcc acgtgtggcg ccagcagttt tccgcaaaga agcaaggcac ccagcaatca 720 gctgacgatt ggttgtgcga gttacgtgac ttggcccgaa agtgtgaatt ccaaaccgat 780 tgttgtgccc gttgtgaacc cacccgtata ttaggccaac tcatattcgg cgtggaaagt 840 gacgaagttc gtgtaaagct ccttgagcaa ggtgacacgt taacgctaga tgcagccctg 900 acaattttac ggacagctga agcctccaac aagcagtcaa tcaacctgaa gaccggagac 960 gcagcagcga tccaaggtgc aacttcaacg tacaggcgat tcaaacaaac ttcaggccgt 1020 ccaccacctc gtgacaaaac gggcaagcag cccacatgtg acaacaagcc ggccagcatc 1080 aagtttgcgg gctgcggaaa ttgcggctcg aaatcgcagt gtaaaccctt gacggcctgt 1140 ccagcccaag gcagaaaatg caacaaatgt ggtcgatcaa atcattatgc tcaagtgtgc 1200 cgcagtacca ccccgaagca gcagagcatc tatgttgagc caccgcctcc ggcagtaggg 1260 gcagtgaaaa cctcagatct agtggcagta tcgataacac caaaacccac ggcaaacggc 1320 gagcgcgcca cagtggtgat tcatgctctt cccgatacag gcgctgacat cgatgccatc 1380 ccggaatcgc tctacatgca gagcttcagt agcgtgtcgc tgcggaaagg catccagcca 1440 gtcacagcca tcggaaaccc aatcgtcaac gtcggcgttt tcagtgcgtt tattgagtgg 1500 acaacgaatg acaaaaattc ctgctcatta cgcaccgacg ttcacgtgtt gcgtgagttg 1560 aagcagccag tgttatcaaa gaaaagccag caggcgctag ggatgcttcc atccggttac 1620 ccgcacgtcc gtgtcgggtc agtggcgcaa acgccaccta acgccagcca acaacaagcc 1680 gacctccgcc agcttgtgag cgagcagcca aaaatttttg acggaacatg ccgaccgatg 1740 actggaccac cctgccattt ccagttagta gaaaatgcca gaccggtggc catgcgaggc 1800 tcccgtccag tgtcagtccc actgctgccg aagctgaaag ccgagctgga cgctctcgag 1860 tccgccggca tagtgcgccg agtgtcgaag ccgactgaat gggttcaccc ttcattggtg 1920 gtaatgaaga aaaacggtga catcaggctc gttattgact ttcgtgaact taacgagtgt 1980 ataatccggc caaatttcga aacagcaaca ccctttcagg ccgtgcgcac cattccgccc 2040 ggaatgagat tttttactgt tattgacgcg ctgaaaggat atcatcaggt gccgctggac 2100 gatccatcga tcgatttgac aacgttttcc acgccttttg gacggtacca atacctccgg 2160 cttccgttcg gagtgacaca cgccggcgac gactattgcc gacgtgtgtc agagattttt 2220 gacgatctcc caaactgtag gcgcatcgtc gaagacgttc tcatcttctc cgccacctac 2280 gacgagcacg tcgaagcagt gcgaacagtg ttcgcccggg cagctgccca caacgtgtcg 2340 ataaatacgg caaaaatagt gttcgcccaa cccgcagtga ctttcggcgg ttacgtcgtc 2400 gaagaaaacg gttttcggcc agatccggcg ctcagaagag caatcagtga attcccagtg 2460 cccaaatcaa tcaccgacgt ccgttctttc tttgggctgt gtcagcaagt tggacatttt 2520 tccgaccagt tagccgccgc tttggaccct ctatctccac tgctcaaagc aggataagct 2580 ttcgagtgga caacgcggca tgacgaagcg ttctgtgcgg caagaaagct cctgtccaca 2640 gtccacgatt tggcatttta cgacccaaaa cggccgacca gtctccacgt cgacgcgtcg 2700 cgtctaaatg gactcgggtt cttactcaag caactcgacg actcaaaaaa atggcgtgtc 2760 gtacaagcag gttcgcgatt catctcaagc gccgagtccc gctacgcaat gattgaactc 2820 gagtgcctaa gtgctgcgtg ggcaatgcat aaatgccgtc agtttttaga aggactcccg 2880 tctttcgaat tggtgacgga tcacaaaccg ttaatcccaa tcctaaacag ttacgcgatg 2940 gataagttag ataatccccg catcctgcgt ttgcgtctga aaatgcagcg ttattcattc 3000 gtcgcccgct gggtgcccgg caagcagaac gccgacgccg acgccctctc acgggcccca 3060 gttgatcacg tgaccgctag tgatgaactg ggagaaggat tgccgtcttt ccctgcaaaa 3120 atcgccatga tgagcctcat cggagccgaa ttgtcgacgg ccgacccaac tctcgaccct 3180 gtgctcgaaa aaattaagta cgcggcatcc atcgacccca tcatgctcaa gctaagaaat 3240 caaatagtcg caggcttccc gaacgacaag tgcaatatag acattgactt gcgtccatat 3300 tggtgtgtga aagaccggct gacaatcgac gaatcggacg acatgatcgt ccttggaccc 3360 cgtgtagtga ttccacaatc tctccgagca gccatattgc gtgatttggc ggcgatgcac 3420 caaggcgcaa caaaaacgcg tcagcgagcc cgactctccg tgtattggcc tagcatcgac 3480 aatgatattg tgaacgccac cagaaattgt gacgagtgca gcaaacatct cccgtcgtta 3540 ccacccgaac cgtttcgtcc gcgcccaccg gcaactcgcc cttttgaaca aatccacgcg 3600 gacctagggg aagtcaacgg ccgccatttc ctcgtcatag tggacagttt tagtggttgg 3660 cctcacgtgg tcgcttttcg cgacaagaaa acttccgctc gcgccatcat cggtcacatc 3720 cgtaccttct tctccagtgt aggagccccc gtcgccttct ggtccgacaa tgggccccaa 3780 ttcggcgcag cagagttccg gcgcttccta gctgattggg gaattacccc actcacgtcg 3840 tcgccacatt acgcccagtc aaacggtcgc gcggaagccg agatcaacac catgaagacc 3900 ctcattcgtg gctcatggac atcgggcgct tttgacgaag cgaaattcgc gaaaagcatc 3960 ttgcttttcc gcaacgcgcc acgctcaggt ggtgcctcac cagcgcagtt agtgtttaac 4020 cgcccggtgc gtgattgttt gccggctcac cgtcgctcgt tcgcgcccga atggcaaaaa 4080 gcagctgaca cgctggaaaa acgggcgaga cgatcaaagg agctgcagat tgcccattac 4140 aacaaacgaa ctcgaccatt ggccgcgttt gtcgttggca accatgtcct gatccagcat 4200 cccgttagta agcagtgggc aacccccgga ataatcgtgg aagttggccg gcaccgagac 4260 tacctcatca agactccggc cggaagaatc ttccgacgca atcggcgttt tctccgtccg 4320 cgcatcccaa ctttcccggc caccggaagc gcgccaactg ctactccaat ccagcctgaa 4380 ccggctccag cacccgcgcc ggaaccggtc cagccaccag tccaacatca acaacaacaa 4440 gacgtccagc cgccggccga ccagcatcca ccaacgaacg gcccagcacc gcccgtccta 4500 cgtcggtcaa cgcgtccccg ccagccgaga cgtcacttct ttccaacaga ttggacacaa 4560 tagccttgtt tacaacttgt attctcccta tgtgtcccgt ctatctctct ttttgtttgt 4620 taggagcaaa gttaacactg gtttagttat ctttgctcgt tgaaaggggc g 4671 // ID hATm-39_HM repbase; DNA; INV; 4141 BP. XX AC . XX DT 07-DEC-2008 (Rel. 13.12, Created) DT 11-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type family: consensus sequence. XX KW hAT; DNA transposon; Transposable Element; hATm-39_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4141 RA Jurka J.; RT "Families of hATm elements from Hydra magnipapillata."; RL Repbase Reports 8(12), 1933-1933 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 1492..3453 FT /product="hATm-39_HM_1p" FT /translation="MKFLRDQKARRKMTLGKKDKLYNRRVEKREQRRLKHP FT QLVHSVEQFSQRDDSTTELQFSSETSDTSTSMDSDCSFDFEQPGPSNDTSQ FT LAKNFVQDIAMTSVSKNISSRDLLHVCTDLVVNSGGNVENFLLSQSTIWRA FT QKKTIHQNAAQHKREVKKAAEKALFPIIAHFDGKIIEDFTNGKKEKRDRFV FT VSVNIDGDIKLLGIPAMEHGTGAAQYEALNNVLEDYGICDDVKGLCFDTTA FT SNTGRHSGTNTRFSQRQESILLELACRRHIYELHLKHFWEQITSGKTAAPE FT NLMFKRFQTNWNCIKETVDPSNLIRFDINSIGNTFLATQIEETIQFCKDAL FT NTDIVPRGDYRELLELTLMYLRPEIFFKVRAPGCVSHARFMSKAIYYLKIQ FT ILSTHLPYDLTVSQRKELKAMAEFISIFYSVWFLRTSLPSAAPFQDVKAYW FT QMAQYKEHVEKHDADSERVLNGIKSTMMSMKRHTWYLDETLVPLALLDPDL FT SNAERENVAKTLFSIPVPDHFQHSEKSNLIKELNFKQEVPPGLARLIGENS FT WFIFSLLNLTRMEDKLWLNSPAPMWCYVDQFKTFSNFVSKLEVVNDSSERA FT VKLVLEFVNNVHNEDDRQELLLAVQQRRDQLRGSGTKEQLQLAYKAICKKR FT KKIT*" XX SQ Sequence 4141 BP; 1402 A; 688 C; 779 G; 1272 T; 0 other; ttagggtgat tcatatttga cgaattttcg aaaatcaagt tggtcctacc tggaatggtt 60 gaccaccaat ataaaaataa ttttaagccc aaaaatttta atgtcacgtg actttaaagg 120 tccccgccgg tcaaatttgt gtacaaaaac gtatcaaact ttaggggtct ggcggggacc 180 tttaaaatca actcagaaag ccaacttttc ttttatatgg tatattttgg actatttatt 240 tataatagag taggaccaaa atgaaatatg gtggtattta ttccacatag tgtggagaaa 300 gtgacgaaaa aaacgaattt ttacataaaa tgacgcacca ctttgtataa ctgacgcttg 360 tttattcttc atatttaggc cgaattttaa aaattaatgt taggactaag cattaataat 420 gtttatatag aaggggcatg tcaatagcta tccgcccccc tcaccctatt ttcggagatg 480 tcttccctcc ccccccccca tactttgttc gttggagacg ggaggggggc aagcaaattt 540 atttaaattt tattgtaaat catattattt aaaatcactt ttttttcttt agaaaaaaag 600 acaatataaa tattaaatta tgataacatt ttgatgcctg aatattttgg aatttttatt 660 agtttccgtg gtaattttat aaactaacaa attgtcgtat ttgcggaaga catattgtcg 720 taattttaaa ttctctgctc tgtgttctgt gtatctggtg tatccataac attctataaa 780 aggtatacta tctgcggtta tatacaaaat tgttattgta aaatacatgt catttaatca 840 aagaatgaat atgaattgtt ttctttaatc aaattactgt ttagaaacca agatgccaag 900 atcaacaaga cttaaaacaa aggcttatct ttttgaagaa gaggatctta tagaaaacct 960 accagaaata atgttttcaa caaataaaga cattattctc tactttcagt ttagaagatc 1020 tggagacaaa ataactccaa caaaaaagct gattgcttgt accgttggtc atcaaacagc 1080 aaagtgttct ggtcttggta gctgtcctga aaactgcatt atgtttgcag ttaagaaacc 1140 ttggatagac ggaggatacg aggatgggat actcactgac ccatttataa ggtaggttaa 1200 acaaaattta attcatattt aatggacagg ttaccataaa atacactaca aaaattttgg 1260 ataatataat attttctaaa ccagttatat attattatag gaagcatatc actgatctga 1320 tcaacagctg ggagagttta aagaaaagta agtgtaagaa ctcgaaagaa gcaggaaaag 1380 ctagacaaga tttcaaaaaa aaaaggaaat aaggttttct ggattgctaa accaaatatc 1440 caagatattc taaagaagac aagccttaat aaggtctcgt atcgtactga tatgaaattt 1500 ttgagagatc aaaaggccag acgtaaaatg actctgggta aaaaagacaa gctatataac 1560 cggagggttg agaagagaga acagagaaga ttgaaacatc cacagttagt acatagtgtt 1620 gaacagtttt ctcagagaga tgattcgact acagagcttc agttttcttc tgaaactagt 1680 gatacgagta ccagcatgga ctcagattgt tcttttgatt tcgaacagcc aggtccaagt 1740 aatgatacgt cgcagttagc caagaatttt gtacaagata ttgccatgac ttcagtttcc 1800 aaaaatattt cctctagaga tttgttgcat gtgtgtacag atttggttgt taattcaggt 1860 ggaaatgttg aaaacttttt actgtctcaa tcaactattt ggcgagcaca aaagaaaacc 1920 atccatcaaa atgctgcaca gcacaaaagg gaggttaaaa aagctgctga gaaagcactt 1980 ttccccataa tagctcactt tgatggaaaa attatagaag atttcactaa tgggaaaaaa 2040 gaaaaaagag acaggtttgt tgtttcagtt aatattgatg gagacattaa gcttcttgga 2100 attccagcta tggagcatgg aactggagct gctcagtatg aagctttgaa taatgtactg 2160 gaagattatg gaatatgtga cgatgtgaag ggattatgtt ttgacacaac tgcctcaaat 2220 actggaagac attcaggaac caatactagg ttcagtcaaa gacaggaatc tattttgtta 2280 gagcttgcct gcaggaggca tatttatgag cttcatctta aacatttctg ggagcaaatt 2340 acctcaggca aaacagcagc tccagaaaat ttaatgttta aacggtttca gacaaactgg 2400 aattgcatca aggaaaccgt tgatccttcc aatttaataa ggtttgatat caattccatc 2460 ggcaatactt tcttggctac ccagattgag gaaactatcc agttttgcaa ggatgcatta 2520 aacactgaca ttgttcccag aggggattat agggagttat tggaacttac cctgatgtat 2580 ctgcgtcctg aaatattttt taaagttcga gctccaggct gtgtttctca tgctaggttc 2640 atgtcaaaag ccatttacta cttgaaaatt cagatattga gtacgcatct gccctatgat 2700 ttgactgttt cccaaaggaa agagttaaag gcaatggcag aatttatctc gattttctat 2760 tctgtctggt ttctgagaac ctcattacca tcagctgcac catttcaaga tgtaaaagct 2820 tattggcaaa tggcgcagta caaagaacat gtagagaaac acgacgctga ttccgagaga 2880 gttctaaacg gaattaaaag cacaatgatg tcaatgaaga gacacacctg gtacttagat 2940 gaaactttag ttccacttgc tctcctagac cctgaccttt ctaatgcgga gagagaaaac 3000 gttgcaaaaa cgttattctc cataccagtt cctgatcatt ttcagcattc agagaagtca 3060 aacttgatca aagagttaaa ttttaaacaa gaagtgccac caggacttgc tcgattaatc 3120 ggagagaact cttggttcat cttcagtttg ctaaacctga caagaatgga ggacaaactt 3180 tggttaaact ctcctgctcc tatgtggtgc tatgttgacc agttcaaaac attctccaat 3240 tttgtaagta aacttgaagt agtgaacgat agttctgaaa gggctgtaaa acttgtatta 3300 gagtttgtga ataatgttca caacgaggat gatcgtcaag aacttttact agctgtgcag 3360 caaagacgag atcagctccg cgggtcagga accaaggaac agttgcagct agcatataaa 3420 gcaatatgca agaagagaaa aaaaataaca tgatttatat attatattta actattgcat 3480 tatattttat tatgtttaca tatcacgatt ttaatattta tattgccaag tgaagtattt 3540 ttttttaaga aaataaagtg attttaaata atatgattta caataaaatt taaataaatt 3600 cgcttgcccc ccctcccgtc tccaacgaac aaagtatggg gggggggagg gaggggggag 3660 acatctccga aaatagggtg aggggggcgg atagctattg acatgcccct tctatataaa 3720 cattattaat gcttagtcct aacattattt ttcaaaattc ggcctaaata tgaagaataa 3780 acaagcgtca gttatacgta gtggtgcgtc attttatgta aaaattcgtt tttttcgtca 3840 ctttctccac actatgtgga ataaatacca ccatatttca ttttggtcct actctattat 3900 aaataaatag tccaaaatat accatataaa agaaaagttg gctttctgag ttgattttaa 3960 aggtccccgc cagaccccta aagtttgata cgtttttgta cacaaatttg accggcgggg 4020 acctttaaag tcacgtgaca ttaaaatttt tgggcttaaa attattttta tattggtggt 4080 caaccattcc aggtaggacc aacttgattt tcgaaaattc gtcaaatatg aatcacccta 4140 a 4141 // ID Gypsy-1-LTR_DD repbase; DNA; INV; 388 BP. XX AC AAFI02000250; XX DT 26-FEB-2009 (Rel. 14.02, Created) DT 26-FEB-2009 (Rel. 14.02, Last updated, Version 1) XX DE An LTR portion of the Gypsy LTR retrotransposon from DE Dictyostelium discoideum. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW reverse transcriptase; Gypsy superfamily; Gypsy-1-LTR_DD. XX OS Dictyostelium discoideum OC Eukaryota; Amoebozoa; Mycetozoa; Dictyosteliida; Dictyostelium. XX RN [1] RP 1-388 RA Bao W. and Jurka J.; RT "Gypsy LTR retrotransposons from Dictyostelium discoideum."; RL Repbase Reports 9(2), 629-629 (2009). XX DR [1] (Consensus) XX SQ Sequence 388 BP; 159 A; 41 C; 45 G; 143 T; 0 other; tgtaataatc gcgaacaatt aagcgacata aattaatata taaagttttt aaccctggta 60 aaatacaaat atagaaataa atttgtaact gaaaagacgt taacaaatac aagattctta 120 ctttattctc agaaatttta ccaatggttt ctttagtaaa attaatttta gtataaatag 180 aaaataaaat aaaaattaat tcattgttgt tttattccaa attgtgcaca aacgaaagct 240 ttattatatt atataagaga tataatatta tttgttattt attatttatt attaattatt 300 tctgtgtgtt caagggtcac acaagtatat acgtatatac aaattataaa gagtatccca 360 taggaattgg aatacgacgt tcattaca 388 // ID Gypsy-30_OD-LTR repbase; DNA; INV; 197 BP. XX AC CABV01002978; XX DT 24-FEB-2011 (Rel. 16.02, Created) DT 24-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Oikopleura dioica genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-30_OD_; KW Gypsy-30_OD-I; Gypsy-30_OD-LTR. XX OS Oikopleura dioica OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; OC Oikopleuridae; Oikopleura. XX RN [1] RP 1-197 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Oikopleura dioica genome."; RL Direct Submission to RU (24-FEB-2011). XX DR Genome; CABV01002978; Positions 12212 12408. XX SQ Sequence 197 BP; 54 A; 50 C; 40 G; 53 T; 0 other; tgtgaggttc tgagaaccct taggctgaag attttatctg aggccacttc tatgcgcgca 60 gcgcacactc accagtcgtc ttgtaaataa aactcgtgca acacctttct cgagttattg 120 tcttttcatt caccacgcaa gagaatacac ttagtgggaa aactatcagc aagccgcagg 180 atttgcagac ccttaca 197 // ID Kolobok-4_HM repbase; DNA; INV; 2782 BP. XX AC . XX DT 18-DEC-2008 (Rel. 13.12, Created) DT 18-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE Kolobok-type family - consensus. XX KW Kolobok; DNA transposon; Transposable Element; Kolobok-4_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2782 RA Bao W. and Jurka J.; RT "Kolobok-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 2062-2062 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 418..2247 FT /product="Kolobok-4_HM_1p" FT /translation="MGGSKRKQLSSRIYNNKKAKFRGNRYTKIPSSETKTL FT SFVESATHSASSKKLNIADRDFQDFQSTDFNFVMNFSLLENVISLLKCPNL FT CEDNVFLMFDSSKKYGLSIGLKICCNSCNWKTEFFSSATFCKKNFLEQQEP FT SKSKCSGQKPFDINTRVIVAFREMGKGLSSLEKFCGIMNMKPPMNKKSYNK FT TLHNLLDVYQNLVDQNMKNAADELIPAGETSRDIMCSFDGSWQKRGFTSNN FT GVVSAISVESGKCIDFQIETKTCKLCSIWKLKKYTHPVEYEKFQSSHFQKC FT KISHTGSSSAMESSGVLKLFHRSEKKNNLRYTTYLGDGDSSSYTSVVAAKP FT YGHNVEIKKAECIGHIQKRVGTRLRNLKKSSKEILSDGKKLGGAGRLTENV FT INTLQNYYGKAIRQNIGSLYAMKKSVAAVLFHCSESYDNETRHQFCLRTKD FT SWCKFQADKITGKKTYKENICIPAAVRDFIKPIFIDLGSDVLLEKCLHGKT FT QNPNEALNQLIWKRCPKDIFVERTALCVGVASAVLYFNNGLQFLETLFDKL FT EITVGCNLHNFCLINDSKRISKAEKQCTSIVKLRRKKLRAIQKGFCDQNET FT LEGAVYGSGEF*" XX SQ Sequence 2782 BP; 985 A; 382 C; 458 G; 955 T; 2 other; ggtggtagta caccaataaa agtgaaaaat tttttttttt tttactttca ttatttaacc 60 aatataatca tcgcagattt cgaattttta gttgatttaa ctaaataaaa aaagcttcac 120 atgttaaaac ctgtttataa tgtatcggta taaaataatt tttattttcc ccctagcaac 180 gctttgttta ttccgttgtt atgcgcttca gaaacaagtt aaaaccataa aaaagccata 240 aaatacttac ctagccaact attttaagtg caaagttgta gactgcaacg acttcttgtt 300 aattagttca gyatgatggc tttattgacg taattcgctt cttgtttcaa gctgcattca 360 aactttatta tttatgctgt agtatatctt ttatcttttt acaataaaaa taataatatg 420 ggtggatcaa agagaaaaca attatcaagc agaatataca ataataaaaa agcaaaattt 480 cgtggaaaca gatatacaaa gattccttct tcagaaacga aaactttatc ttttgttgaa 540 tctgctacac attctgcttc ttcaaaaaaa ttaaatattg ctgatagaga ttttcaagat 600 tttcaaagta cagattttaa ctttgttatg aactttagtc tgttggaaaa cgttatatcg 660 ttacttaaat gcccaaattt gtgtgaagat aatgtttttt taatgtttga ttcgtcaaaa 720 aagtatggtt tgagtattgg tttaaaaata tgttgtaaca gttgcaattg gaaaactgag 780 tttttttcat ctgctacatt ttgtaaaaaa aattttcttg aacaacaaga gcctagtaaa 840 agcaaatgta gtgggcaaaa accatttgat attaacacac gtgtcattgt ggcttttcgt 900 gaaatgggta aagggttatc gtcattagaa aaattttgtg gaataatgaa tatgaagcca 960 cctatgaaca aaaaaagcta caataaaact ttacataact tgttagatgt ttatcaaaat 1020 ttagttgacc agaacatgaa aaatgctgct gatgaattaa tccctgcagg tgaaacatca 1080 agagacatta tgtgtagctt tgatggttca tggcaaaagc gtggttttac ttcaaataac 1140 ggagttgttt ctgcaatttc agttgaaagt ggtaaatgta ttgactttca aatagaaaca 1200 aaaacatgta aattatgttc aatatggaag ttaaaaaaat acacacatcc agttgaatat 1260 gaaaagttcc agtcatcaca tttccaaaag tgtaaaatta gtcatactgg atcatcatcg 1320 gctatggaat caagtggtgt tttaaaactt tttcatcgtt ctgaaaaaaa aaacaattta 1380 aggtatacca catatcttgg tgacggagat agtagttctt ataccagtgt tgttgctgct 1440 aaaccttatg gacataatgt agaaattaaa aaagcagagt gcattggtca tattcaaaaa 1500 cgagttggca cgcgtttaag aaacttaaaa aaaagcagca aagaaatttt aagtgatggt 1560 aaaaaactag gaggagcagg tcgtttgaca gaaaatgtta ttaacacatt gcaaaattac 1620 tatggcaaag caatcagaca aaatattggt agtttatatg caatgaaaaa aagtgtagca 1680 gctgttcttt ttcactgttc tgaaagttat gacaatgaaa cacgccatca attttgtttg 1740 cgcaccaagg attcttggtg caaattccag gctgataaaa taactggaaa aaaaacatac 1800 aaagaaaata tttgtattcc tgctgctgtg agggatttta taaaaccaat atttatagat 1860 cttggtagtg atgttctttt agaaaagtgt ttgcacggca aaacacagaa tcctaatgag 1920 gcgctgaatc aattgatttg gaagcgrtgt cccaaagata tatttgtaga aagaactgct 1980 ttatgtgttg gagtagcatc agctgtccta tactttaata atggactaca atttttagag 2040 acgttatttg ataagcttga aataacagta ggttgcaatc tacataactt ttgtcttata 2100 aatgactcta agcgcatttc taaagcagaa aaacaatgca caagtatagt gaaattgaga 2160 cgcaagaaac taagagctat acaaaaaggt ttttgtgatc aaaacgaaac attagaaggt 2220 gcagtttatg gaagtgggga gttttgaaca attttttttt gtttttattt ttaaattgag 2280 tttttctcaa aatcatgttt tatactctgg cgacggagat atttttcaaa ccgataaccg 2340 atttgatctg aaatttggca cacttgtttg taacacatgt atttgcaacc tctactagaa 2400 ttatgtttac agctaattgt gttctacttt tgtgagtgat tttgtgttct caaataacac 2460 caaaacttcc aaatatgact aaaatgcata tattttgact ttttttggtc aaaatccaaa 2520 acttttagtg tatgttgtta ttgcataagt aaaggatcac aagccaaaat ttcagttcaa 2580 atttatttac ggttgctgaa aaaaccctgt cgccagttat tcatttttag gcttttgtct 2640 gtcaataatt atgcgcataa ttaatttttt taacttttta aatttttact tatattttaa 2700 tcatataata tatgttatgc actattaata attgaaaaaa aagttaaatt gcaagatttt 2760 tttttaatgg tatactacca cc 2782 // ID BEL-210_AA-I repbase; DNA; INV; 5551 BP. XX AC AAGE02017420; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-210_AA_; KW BEL-210_AA-LTR; BEL-210_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5551 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02017420; Positions 13131 7581. XX CC Positions [4598-5182] - Integrase core CC 'CATGG' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS join(221..4042,4046..5551) FT /product="BEL-210_AA-I_1p" FT /translation="MSEEQTPQSTRNITSKNLLDALNEESTSYEPIEVQKL FT SKTRAQKKKAMAQRLEKAIDRRDMAKEKMIRIHEGMTKPDKNIHWLNLQLE FT NLRRCYDEVEKTYLEICDVVPRDQREDFKVQNIQIEEMYDQLYVNIQSEIS FT EWKAREERGKLSALAPAFNPPQQPAPVNNLPPHLHVPLPTFDGNLENWYSF FT KCMFQTIMGRYPNESPAIKLYHLKNALIGSAAGKIDQDVINNNDYSAAWKM FT LEDAYEDERLIIDTHIDALLSLPKMTNENGEELRKLIDTCIKHVDALKNRE FT LPVEGLAEMILINLIAKRLDKETRKLWESQLSQEELPSYVEMIDYLRESVR FT ILQKMTGYADQRTSSTAKQKTKSDQKIQPARNFVQTKEVCHCCNGDHLIYK FT CGQFKELNVSSRYAKVKQAGLCFNCLRRGHRTVDCNSDKFCKSCKRKHHSL FT LHDEKSSSAHKSDGVPSDQTAKPEEHQEVAVQAGSVNCATSLMLKQQVLLS FT TAEVLVAGSGNVSLPCRVLLDSGSDSNLISEAFAKQLDLSMESINLPISGL FT NNAETRVKYKLRTKISSRVNPFNAILDFLVVPTITSNLPMIKVDIRSWSIP FT TSVDLADPSFHVPKEIQMIIGAELFFTLVKNGRIKLADGNPMLVETDLGWI FT VSGPVKGHPSSPQGSICHLNLREEQINRTLVKFWELETVQEASPLTSREQA FT IEKHFQLTHFRDDSGRYIVRLPFNDNKSQLGDSLNIARNRFDRLLRSFCDE FT KKKTRYTEFMSEYQALGHMVEVFDNPVDFYFLPHHAVYKESSTTTKIRVVF FT DASAKTTSGHSLNDALDVGPTVQKDLITILLRFCCFPVVLTADIPKMYRQV FT QIHEDDRKYQRILWLNSKNEIGTFELTTVTYGCSSAPYLATRALMQLAKDE FT ASELPVAAKVIEENSYIDDFLTGGNTDEEVIEIYEQLTEMLRRGGFGVHKF FT CTNSEIIRNRIPTELQEMQANFEDSDINSAIKTLGLIWNQHEDYFRFNVAP FT LDETVPTKRIVLSAVGQLFDPCGYLGPVITMAKMLMQDLWRLKLEWDDVLP FT DEQMKMWNDFRGQLPMVNNLQKKRCVVTGDAKGVELHGFADASLRAYGAVL FT YTRCISPDGTVEVNLVCSKSRVAPLKPMTIPRLELCGALLLARLVDKTISA FT MKIPFKSVTLHLDSQVVLCWLEKSPLALNQFVSNRVAEILELTQSYNWQSV FT RSEENPADLISRGVFPAELLTMEKWWKSSHVLWEHTPSYNEGRIHLNDNEL FT PELKPAVVVAATVKKPSIDMTRLSNFRRLQRAWAFVLRFINNVRTKTRDTS FT PLRAQEMADALHAVMKITQKDEFQDLFHALQSNDKKRNRYSGLAPFVGTDG FT LIRVGGRLKYSSIPYDGKHQILLPEKHHVTVTLIRQLHEDNFHVGQRGLLS FT IVRERFWPVNAKMLIKRIISKCYVCSRNNPHPVTQYMGNLPNYRITPAPVF FT SNTGVDYAGPIYLKEAGRKTVVYKAYICVFVCMATKAIHLEVVSSLTAGNF FT IAALQRFISRRGIVANLYSDNGTTFVGANHELAALRQLFEDQATQRKLNDF FT CVSKGIQWHFIPPRSPHFGGIWEAGVKSAKYHLKRVVGETKLTFEEMSTFL FT TQCEAILNSRPLIPISNDPNDVEVLTPSHFLIQRPALSIPEPSYEEVKIGR FT LSRWQHVQLMKEHFWKRWSTEYLHQLQSRPKWNSGVTAITIGAMVVLKDEN FT IPPHQWCIGRIVATHPGADGIVRVVTIKTPTSEFRRAVTKICLLPSVEPGD FT STGGE" XX SQ Sequence 5551 BP; 1590 A; 1269 C; 1375 G; 1317 T; 0 other; ttttggacca ttcgaaccag ataagtggtt ttcgatggtt ttacgttggt ttctggtgtt 60 atcctgacga aaaagtgaag tttttcgcga tttccaccgt gtggaaaaat ccggcgttcc 120 gccattcgac gcggcgtaca gtagtgtgtg tgtgctcgcg aaccgcattg gaacgtgttc 180 gatacacgtc atcctcgaaa aagtgttgtg aacggtacca atgagtgaag aacagacccc 240 acaatcaact cggaacataa cctccaaaaa tcttctcgat gcgttgaatg aggaatcaac 300 cagctacgaa cccattgaag tgcaaaagtt gtcaaaaaca agagctcaaa agaaaaaagc 360 catggcgcag cggttggaga aggcgattga tcgacgtgac atggctaaag aaaagatgat 420 tcggatccac gaaggcatga cgaaacccga taagaacatt cactggctga acttgcagtt 480 ggagaatctt cgacgctgtt atgatgaagt ggagaaaacg tatttggaaa tctgtgatgt 540 cgttcctcgg gaccagcgag aagatttcaa ggttcaaaac atccagattg aagaaatgta 600 tgatcagctg tacgtgaaca tccagtcgga aatttcggaa tggaaggcaa gggaggaacg 660 aggcaagctg agtgcattgg cacccgcatt caatccgcct cagcaaccgg cgcctgtgaa 720 taatttgcca cctcatctgc acgtcccgtt accaactttc gatggaaatc tggaaaattg 780 gtactctttc aagtgcatgt tccagaccat catgggtaga tacccgaacg aatctccggc 840 tatcaaattg tatcatttga aaaacgccct tattggaagt gccgccggta agatcgatca 900 ggatgtcatt aacaacaatg actattctgc cgcctggaaa atgcttgaag atgcgtacga 960 ggatgaacga ctcataatcg atacgcacat cgatgctttg ctgagtctgc caaagatgac 1020 gaacgagaac ggggaagaat taaggaagct aatcgatacc tgcatcaaac atgtcgacgc 1080 acttaaaaac cgcgaactac cggtggaagg attggctgag atgattctaa tcaacctgat 1140 cgccaagcgt ttggataagg agacgcggaa gttgtgggag tcacaattgt cccaagagga 1200 gttgccatcg tacgtcgaaa tgatcgatta tcttcgcgaa agtgtccgaa ttttgcaaaa 1260 gatgacagga tatgccgatc aacgaacatc gagcaccgca aagcagaaga caaaatcaga 1320 ccagaagatt caaccagcaa ggaactttgt gcaaaccaag gaagtatgcc actgttgtaa 1380 tggtgaccac cttatctaca aatgtggtca attcaaggag ctcaatgtaa gcagcaggta 1440 tgccaaggtg aagcaagcgg gactttgctt caattgtttg cggcgcggtc atcgtacagt 1500 tgattgcaat tccgacaagt tttgtaagag ttgcaaaagg aagcaccata gtcttctaca 1560 cgacgagaaa tcgagttccg ctcataagtc cgacggagtc ccaagcgatc aaacggcgaa 1620 acctgaagaa caccaagaag ttgctgtaca agctgggtca gtgaactgtg ccacgtcgct 1680 gatgctgaaa caacaagtgc tcctttcgac ggccgaagtg cttgtagctg ggtccggaaa 1740 cgtcagcttg ccgtgccgtg tcctgttgga ttcgggttct gattcgaatc tcatttccga 1800 ggcgtttgcc aagcaattgg atctatccat ggaaagcatc aacttaccga tcagcgggct 1860 caacaacgct gaaacccgag ttaagtacaa attgcgtacc aagatcagct cacgcgttaa 1920 tccgtttaac gctattctag acttcctggt ggttccaacg attacttcca accttccgat 1980 gatcaaggtt gacatccggt cttggtctat tccgacgagt gtggatttgg cagatccatc 2040 attccatgtc cctaaggaaa tccagatgat catcggggca gaacttttct ttaccctcgt 2100 gaagaacggc cgcatcaagc ttgcagatgg gaatccaatg ttagtggaga ccgatctggg 2160 ttggatcgtc agtggtcccg tgaaggggca cccaagcagt ccgcaaggaa gcatttgtca 2220 cctcaatcta cgtgaagagc agattaaccg cactctggtg aaattctggg aattggaaac 2280 tgttcaggaa gcatcgccac tgacatcaag ggagcaagcc attgagaagc atttccagct 2340 aacacatttt cgtgacgatt caggtcgata cattgtgaga ttgccattca acgataacaa 2400 gagtcaactc ggcgactctt taaacatcgc ccgaaaccga ttcgatcgtc tactgcgatc 2460 gttttgcgac gaaaagaaga agactcgcta tacggagttc atgtctgaat accaagcatt 2520 aggccacatg gtggaagtgt tcgacaatcc agtcgacttc tattttttgc cccatcatgc 2580 tgtgtacaaa gagtctagca cgactacaaa aatccgtgta gtttttgacg cttcggcgaa 2640 gaccacgtct ggtcattcgt tgaacgatgc tttggatgta ggaccgacag tgcagaaaga 2700 cttgatcacg attctcttgc gcttttgctg ttttccggtt gtattaactg cagacatccc 2760 gaagatgtac cgccaggtac aaatccatga ggacgaccgg aaataccaac gaatcctttg 2820 gctcaactcg aaaaacgaaa tagggacctt cgaactgacg actgttacgt acggttgttc 2880 tagcgcgcct tacctggcga cacgtgctct aatgcaattg gcgaaagacg aagcttcgga 2940 gttgcctgtt gcagcaaaag tgatcgaaga aaacagttac attgatgatt ttctcaccgg 3000 aggaaacaca gatgaagaag taatcgaaat ctacgagcag ttgacggaga tgctacgacg 3060 aggtggattt ggagtccaca aattctgcac caacagtgaa atcattcgga accgtattcc 3120 gacagagctc caagaaatgc aagccaactt cgaagatagt gacatcaaca gtgcaatcaa 3180 gacactgggc ctcatctgga accaacatga agattatttc cgtttcaacg ttgcaccgct 3240 tgacgaaaca gtgccgacca aaagaattgt tctatctgca gtcggccagc tcttcgaccc 3300 gtgcggttac cttggtccag tcatcacaat ggcaaagatg cttatgcagg atttatggcg 3360 gctgaaactg gaatgggatg atgtattacc ggatgaacaa atgaaaatgt ggaacgactt 3420 tcgaggacag ctaccaatgg tgaataattt gcagaaaaaa cgatgtgttg ttacgggcga 3480 cgctaaagga gtggagcttc acggatttgc agacgcgtcc cttcgcgcat acggggcagt 3540 attgtacacg aggtgcatct ctcccgatgg aaccgtcgaa gtcaacctgg tatgcagtaa 3600 atcgcgcgtg gcgcccctca aaccaatgac gattccacgc ctggagttgt gtggagcgct 3660 gcttcttgct cgactggtgg ataaaacgat atcagcaatg aaaatcccgt tcaagagtgt 3720 tacacttcat cttgattcgc aagtagtgtt atgctggctg gaaaaatcac cgcttgctct 3780 gaatcaattc gtttcgaacc gtgttgccga gatactcgag ttgacgcaat cctataattg 3840 gcaatcagta cgatcagagg aaaatccagc agatttgatt tcgcgaggtg tgtttcctgc 3900 agagcttttg acgatggaga aatggtggaa atcatctcac gttttgtggg aacatactcc 3960 tagctacaac gaaggcagaa tccacttgaa cgacaatgaa cttcctgaac tcaaaccagc 4020 ggtagtggtc gctgctactg tgtagaagaa gccgtcgatt gacatgacca gattaagcaa 4080 ctttcgtcga ctacaaagag cgtgggcgtt tgtattgagg ttcatcaaca acgtccgcac 4140 caaaactcgt gatacttcgc cactcagggc gcaagaaatg gcagatgcac tgcatgcggt 4200 catgaagata acgcagaagg atgaatttca ggatttgttc catgctttgc aaagcaatga 4260 caagaaacgg aacaggtaca gtggactggc tccattcgta ggcacggatg gcctaatcag 4320 agttggtggc agattgaaat attcatcgat cccgtatgac ggaaagcacc agatcctgtt 4380 gcccgaaaaa caccatgtga ctgtcacgct tatccgccaa ttacatgaag acaatttcca 4440 cgtcggacag cgcggattgc tttcgattgt ccgtgaacga ttttggcctg taaacgcaaa 4500 gatgttaata aagagaatca tttcgaaatg ttacgtgtgc tctcgcaaca acccacaccc 4560 agtgactcaa tatatgggaa atctgccgaa ttacagaata accccggcgc cagtattctc 4620 taacaccgga gtcgattatg cgggcccaat atacctcaag gaagcaggaa ggaagacagt 4680 agtgtacaag gcgtatattt gcgtttttgt atgtatggcc actaaggcca tccatctaga 4740 ggtggtttcg agcttaacgg ctgggaattt catagctgct ctacagcggt tcattagcag 4800 gcgtggaata gttgctaatc tttattccga taatggcact acgttcgtag gcgctaatca 4860 tgagctggcg gctttacgcc aactattcga ggatcaagca acccaaagga agctgaatga 4920 tttctgtgtc tccaaaggta ttcaatggca cttcatacct ccgcgcagcc cccacttcgg 4980 tggcatctgg gaggctggcg tgaagtcagc gaaatatcat ctgaaacgag ttgttggcga 5040 gacaaaacta acttttgaag aaatgtcgac attcctgacg caatgtgagg ccatcctgaa 5100 cagccggcct ctaattccga tttccaacga ccccaatgac gttgaagtgc tgacgccatc 5160 acacttcctt atccaaagac cagctttgag cattccggag ccatcatacg aagaggtgaa 5220 gattggccga ctcagtcgtt ggcaacatgt ccagctcatg aaggaacact tttggaagcg 5280 ttggtcgact gagtacctgc atcagctgca atcacgaccc aaatggaaca gcggagtgac 5340 ggcaataacg atcggcgcga tggttgtgct gaaggacgag aacattccgc cacatcagtg 5400 gtgtattgga cgcatcgtgg caacccatcc aggcgctgat gggattgtac gagtggtaac 5460 catcaagact ccaacatcag aattccgaag agctgtaacg aagatttgtc tgcttccttc 5520 ggttgagcca ggggactcaa cggggggaga a 5551 // ID Gypsy-14_SI-LTR repbase; DNA; INV; 209 BP. XX AC AEAQ01023414; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: long terminal DE repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-14_SI_; KW Gypsy-14_SI-I; Gypsy-14_SI-LTR. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-209 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01023414; Positions 466 258. XX SQ Sequence 209 BP; 66 A; 36 C; 51 G; 56 T; 0 other; tgtactcgct tgttacgggc ttgagcttag gtatgtaacg gtatgtaagg gttataacaa 60 aaaagcaggc gaaaaaggaa gcaggcagtt gttaaccgac agttgaatat gacggtcgat 120 cgagcgtgcg ttcccgaaat cttgtaacca tgagatagcg aaatacattt atattaattg 180 aagaagagtg tctccctttt aaccctaca 209 // ID Copia-6_DPu-I repbase; DNA; INV; 4372 BP. XX AC scaffold_71; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-6_DPu_; KW Copia-6_DPu-LTR; Copia-6_DPu-I. XX NM Copia-6_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-4372 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 675-675 (2010). XX DR Genome; scaffold_71; Positions 404276 408647. XX CC 'TATTT' target site duplication CC LTRs are 99% similar to each other. CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX FH Key Location/Qualifiers FT CDS 744..1973 FT /product="Copia-6_DPu-I_2p" FT /translation="MYPVDTVAERDTWKKIVGRKLRMNVDHKLNLPTPMEN FT LNLRIIHLSTNHHGSGDYAFQCSISSHSSPKDDNWLVDSGASQHMSDQRWA FT FVNYQRVKPGCWPVNGIRENRKPLQVHGYGTILISSLVEGCWHDGVLQEVL FT HVPKLGANLFSVRSAACNGFGVNFVGEKVEVVKNGKTVAVGASTSNNLYRL FT NVVSGHVATKLLRPQESFAVLARPVLHSIQLWHQRLGHVCISTIKKMAVSK FT MADGLMISEDNQDYFCEGCLFGKQHRLPFPIGRTRGTHVGHIVHSDVCGPI FT SVPAIGGSTYFLTLKDDFSGYCIIHFLKRKSEVMSLFQNFVQRIKVEFGHS FT VGTLRSDNGGEYVGKAFEEWLLENGIRHETTVAYTPEQWSCRESESYHPGV FT GSKHDSFCRPPFETMG" FT CDS 2272..3771 FT /product="Copia-6_DPu-I_1p" FT /translation="MPGGPPCGISDLIADIFHQQTQSNDSQPPFNLLSSPK FT TNRIFDLVPVVCDDDEAGLVSSESATDPVFVPEQIADVFVPEQVADSVHVP FT EQVADPVFVPEPVAEPVFVPEPVAEPVFVAEPVAEPIVRPILQPGVRPRRA FT GLPDQDLIIPGPRIRTPVVRWIEESTTPPYAGLAAVGAIEPTTLEEALASP FT QADQWKLAMEDEMRSLEQNETWTLTKLPAGRQPIQNRWVFKLKLDGDGAVR FT RYKARLVAKGFTQRPGIDFEETFSPVVKHDSLRAVLAVAAERDLNMLQLDV FT KTAFLNGDLNEELYMTQPTGFVASGREGEVCKLNRSIYGLKQASRAWNIKF FT HGFLIKFGFIRSSADPCVYIKKEEDCLTIIAIWVDDGLICGSDIKRLDSVV FT EYLSQNFEMTCEPVDCFVGIQIVRDRGRRTIHLSQENYIARLLEKFNLSDC FT HSRIVPADPFTRLSKNNGQDTSSEEEDLARLYREAVGALIYAVTCTRLDIA FT WAVS" XX SQ Sequence 4372 BP; 1215 A; 944 C; 1080 G; 1133 T; 0 other; ggttatggga ccagaatttc cataccaggt acaaattcct ttttaaataa tcgttgaaat 60 ggcaactcgt caactgacat ctaaggatgt ggctcatata ccaaagttta acggaactaa 120 cttcacaaat tggaagttcc aattgttcat ggtattagag cactttaaag ttttaaaaat 180 tgttttggga gaagagaaga agccagtacc cacagtctca agtgctggtg gcaacaacag 240 cagcaacgat gcagacattg aaatctgggt tgacaaagat caaagcgcaa gaattgccat 300 ctgtggcaca ttggaagtca aatggcagag ctccctcgtc aattgtaaat catctaatga 360 tatgtggaag agactggcta gccagtatga acaagctact gaagaaaacc agtaccagct 420 gcttcaacgt ttctttgact acaaatttca gaagcagcac tcggtcatgg atcacatcac 480 tgctatagaa accatagcag ctcaactaag tgacattggt tcagaaagaa ccgaagctga 540 gatcatcact aaaattatct gcactctccc tccaagtttt cgttctgtca ggtctgcctg 600 ggaaaatgtg gatgattcta agaaaacatt gcagctgttg accacaagat tattgaaaga 660 agaaagttac aatcaagagt tcgagggtgt tcattcagat gcggccttct ttgctaaaga 720 gccaggtttc aagatgaaga agaatgtatc ctgtggatac tgtggcagaa agagacacgt 780 ggaagaaaat tgttggaaga aagctgagga tgaacgtgga tcacaagcta aatttgccca 840 ctcctatgga aaacctgaac cttcgcatca ttcatctcag cacaaaccat catgggtctg 900 gtgattatgc ctttcaatgt tcaatatcta gtcactctag tccaaaggat gacaattggc 960 tggtggactc cggagcttcg cagcatatgt ccgatcagag atgggcattt gtcaactatc 1020 aacgcgtgaa acctggatgc tggcctgtaa atggcatcag ggagaaccgg aagcctctgc 1080 aggtgcatgg ctatggaacc attctaattt catccctagt tgaaggatgc tggcatgatg 1140 gagtcctcca agaagttctt cacgttccta agctgggtgc gaacctcttc agcgtccgtt 1200 cagcagcctg caatggcttt ggagtcaact tcgttggaga aaaggttgaa gtggtaaaga 1260 atgggaaaac ggtagcagta ggagccagta ccagcaacaa cctttatcga cttaacgtgg 1320 tatcaggaca tgttgctact aagctgctgc gaccccaaga atcttttgct gtcctggcga 1380 gacctgttct gcattctatt caactatggc atcaaaggct tggtcacgtg tgcatttcaa 1440 caataaagaa gatggcggtt tcaaaaatgg ctgatggtct catgatcagc gaagataacc 1500 aagattactt ctgtgaaggg tgtttatttg gcaaacaaca tcgtcttcct tttcccattg 1560 gacggaccag aggtacacac gtaggtcata ttgtgcactc tgatgtctgt gggccaattt 1620 ctgtcccagc tatcggagga tcgacttatt ttctcactct taaagacgat tttagtggat 1680 actgcataat ccatttcctc aaacggaaat ccgaagtgat gtctcttttc cagaattttg 1740 tccagcgcat taaagttgaa tttggccact cagtgggtac actgcgaagc gacaacggag 1800 gagaatatgt tggaaaggcg tttgaagaat ggttactaga gaacggtatt cgccatgaga 1860 cgacagtcgc ctacacacct gaacaatgga gttgccgaga gagtgaatcg taccatcctg 1920 gagtcggctc gaagcatgat tcattttgcc ggcctccctt tgaaactatg ggctgaggca 1980 tgcaatacgg ctgtctatct catgaatcga gtctcaacaa agtccatcat cggcaagacc 2040 ccctacgaaa tttggaaagg tgtaaagcca aatttatcac acatccgggt gtttggatga 2100 actgtctacg tccatgtccc gaaggagaag cgaaacaaat tagaaccgaa gtccatcaaa 2160 tgctgccatg ttgggtattg tgaattgcag aaaggattcc gagcgtggga ccaaggaact 2220 ggaaaggtgc ttatcagtcg tgatgtcatt tttcaagaat tggaaaaagg aatgcctggt 2280 gggccaccat gtggaatctc ggatctgatt gctgacatct tccaccagca gactcaatca 2340 aatgatagtc aaccaccctt caacttactg tcgtcaccca aaacaaacag aatttttgat 2400 ctcgttcctg ttgtatgcga tgatgatgag gctggcctag tctctagtga atcagcaact 2460 gatcctgtct ttgttcctga gcaaatagct gatgtcttcg ttcctgaaca agtagctgat 2520 tctgtccacg tgcctgagca agtagctgat cctgtcttcg ttcctgagcc agttgctgaa 2580 cctgtttttg ttcctgagcc agttgctgaa cctgtttttg ttgctgagcc agttgctgag 2640 cctatcgtaa gacccatact tcagccgggc gtgagaccaa ggagagcggg gcttccagac 2700 caggatttaa tcattcctgg gccaaggatc aggactccag ttgtcagatg gatagaagag 2760 tcaacaactc ctccgtatgc tggcttggct gcagtgggtg ccatcgagcc gaccacctta 2820 gaggaagcac tggcgtcgcc acaagctgat cagtggaagc tggccatgga ggatgaaatg 2880 cgatctttgg agcagaatga aacatggact ctcaccaagc ttcccgcggg caggcaaccc 2940 attcagaacc gatgggtctt taaactcaag ctggatggag atggagcagt tcggcgctac 3000 aaggcacggc tggtggcgaa gggattcacg caacgacctg gtattgactt cgaggaaacc 3060 ttctccccag tcgtgaagca cgactctttg cgtgctgtgc tcgctgtggc agccgaaagg 3120 gaccttaaca tgttgcagct tgatgtgaag actgcgtttt tgaatggcga cttgaacgaa 3180 gaactataca tgacacagcc aactggtttt gttgcatccg gaagagaagg agaggtctgc 3240 aaactcaaca gaagtatata cggattgaag caggcatccc gagcctggaa tattaaattt 3300 catggttttc tcatcaagtt cggcttcata cggagcagtg cagacccctg tgtgtacatc 3360 aagaaagaag aagactgcct tacaatcatt gcaatttggg tggatgacgg cctgatttgc 3420 ggcagcgata tcaagagact tgacagtgtc gtggagtatc tatcacagaa tttcgagatg 3480 acgtgtgagc cggtggactg ctttgtgggc attcaaatcg tcagagacag agggagaaga 3540 accattcatc tgtcacaaga aaattacatt gcccgtctgc tagagaagtt caacctttca 3600 gactgtcact ctcgcatagt tccagcagac ccatttactc gtctttcgaa gaacaatggg 3660 caagatactt catccgaaga agaggacctg gctcgtcttt atcgtgaggc ggttggggcg 3720 cttatttatg ccgtcacctg taccagattg gatattgctt gggcagtcag ctaggtggct 3780 caattctctt cacgtcccac tagagcgcac tgggaggctg tgaaacggat actcgcctac 3840 ctgaaaggaa ctcagaccca tggagttact tatggagaca catcagccgg tgaaggagtt 3900 ctacaagcct acagtgatgc tgattttgct gccaatgtgg atgatcgtcg ctcaacgaca 3960 ggcgttgtgt tgatgttgaa cggtggccct gtatcctgga aaagtaaacg tcagagctgt 4020 gtctcattgt ctactactga gtcggagtac gtcgcggcag ccgcagcagc caaagaaatt 4080 gtgtggatga ggcgtctact tcaagatctg ggatgtaatc aactcaagcc tacttattta 4140 ttttgtgata atcaaagtgc tattaagctt gttcgtaacc ctcaatttca ccaacgcacc 4200 aaacatattg atgtaaagtt tcattttatt cgtgatctgc aggaagacaa agtaattgat 4260 gttgtgtatg taaactctga aggacaattg gctgatttgt taacaaaggg actagatggt 4320 ccaaggtttc gcaaattacg agaagagatt ggaatctcgg tttgagtggg tg 4372 // ID Kiri-35_AAe repbase; DNA; INV; 4829 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 16-FEB-2011 (Rel. 16.02, Last updated, Version 1) XX DE A Kiri non-LTR retrotransposon family from Aedes aegypti. XX KW Kiri; Non-LTR Retrotransposon; Transposable Element; Kiri-35_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4829 RA Kojima K.K. and Jurka J.; RT "Kiri non-LTR retrotransposons from the yellow fever mosquito."; RL Repbase Reports 11(2), 730-730 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 8 sequences with >96% CC identity. CC This family and related Kiri elements found in two mosquito CC species, Culex quinquefasciatus and Aedes aegypti, constitute a CC distinct lineage belonging to the CR1 group. The phylogenetic CC analysis with PhyML indicates that Kiri is a sister lineage of CC the L2 clade. Although several Kiri elements do not encode two CC proteins, probably due to 5' truncation, Kiri elements generally CC encode two proteins. Their ORF1 proteins commonly contain an CC L1-like RNA-binding domain. XX FH Key Location/Qualifiers FT CDS 275..1081 FT /product="Kiri-35_AAe_1p" FT /translation="MSNHESNKPLTRSTSTSSSSLFTKGDVNKQSKSDQDV FT ESLNDLWNKIRKMFADSKSDIEAKIDSCKEDLEQKIGNIEQQLSVLRTECK FT AEIKNVSDSVTVVRDDLELTRRNVHRSGTINELIVSGIPYTSDENLLEIFL FT NISKALSYNSSEVPMVYLKRLSKLPIKAGSAPPILCQFSLRGVRDEFYARY FT LRLRTLNLCHVGFNNQNRVYINENLAREDREIRTQAIKLKKQGRIQQAFTR FT NGIVFIRVKNGDDAVPYYTLEQLFASVN" FT CDS join(1864..3054,3017..4684) FT /product="Kiri-35_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MVDNIVNNASASTLIPKALFHAVLSNDKINICHVNVQ FT SLCARRFSKFDELKSNFFGSKIDVVCMTETWLSDVITNTMISMEGFNLLRN FT DRNRNGGGICIYFRPNLSCKVIKKSTSNLSNVTEFLLVEVSGAGDPFLLGV FT YYNPPEVDCSDTLKEHFEELTVKYNRTFFIGDFNTDLLKSTPRSRRLNDTI FT STMSYTCVNKEPTFFYSSGCSLLDLLITDSPDVVSNVNQVSMPFVSKHDLI FT FASLNLNKPRQEPIVYRDYKNFNANLLSVAFQRMEWNQLLSITDSDILIEA FT LNNNLKFLHDNFIPIRKRIEKVNPWFNREIEIAIIARDIAYSNWKQSRSEQ FT EFGHFKNLRNRVNLLIRDAKRRYDRRKFQSNLPSKHLWANIKKTRNFQASF FT IRLKKLGISKPRSFDCSLSYNENDINQYFSENFTVDDAPRPFIHYQTRSHG FT FSFRPVEEFEIINAVHEIKSNATGLDDIPIIFIKIMLPLILPYIKHLFNII FT ISSSKFPRGWKTVKIIPIRKKANNDEINNLRPISILCALSKVLEKILKVQI FT SSFISTMNFLHPLQSGFRQNHGTNTALLKVHDDISSVIDKRGIAILLLIDF FT AKAFDRVSHRRLLLKLSEYFRFSVEANKLMHSYLAGRTQAVFHNGKLSSFR FT NIESGVPQGSILGPLLFSLFINDLPSTLKYCSIHLFADDVQIYLCERDSSN FT FCNLANKINHDLHNLYHWSTRNLLPINSSKTKAILISRGRSLNTMPDLYLN FT GEKLNYVNSVNNLGIIFNSNLNWDDQICAQCRKIYGSLKTLSLTTKHFDKS FT TKLKLFKALIFPHFIYGDFIYSNASWSSVDKLRLALNACVRYIFNLTRFSR FT VSHLQKELIGCNFSNFYRYRSCVTLFKLMKLKKPDYLFVKLQPMRSDRNRN FT FLIPQHQSAYYSQSLFARGVAYWNQLPLSIKSSVSISSFKRDLMQYLTL" XX SQ Sequence 4829 BP; 1497 A; 874 C; 871 G; 1585 T; 2 other; gtttctgaag ggatgtgtgc tgtatagtaa agcgcttagt ggttgcagtc aaattgacct 60 taatcgctgt gctaagttca actaatttca tgctgagtgc tgctaatgag ctattagtgt 120 cagtgttata cattgaagtt gacattctgc agttctgaat cawaawatct tcggaaaaaa 180 gaaaactaca agtgctatac cccatatagt atttggattg cccttcagga acgtttatca 240 gccgtcaaca ttggtggacg tcaaacaaaa cgaaatgtca aaccacgaat ccaacaagcc 300 actgacacga tccacatcta cctcatcttc ttcgttgttt acgaaaggcg acgtaaacaa 360 acaatccaaa agtgatcagg atgtggaatc tctcaacgat ctctggaaca agattcgtaa 420 aatgtttgcg gattctaaat cagacattga ggcgaaaatt gattcctgca aggaggattt 480 ggagcagaaa atcggtaaca tcgagcaaca gctgtctgtt ctgaggacgg aatgtaaagc 540 tgaaattaag aacgtttccg attcggttac cgttgtgcgt gacgacttgg agctcacaag 600 aaggaacgtt catcgctctg gtacaatcaa cgagttgatt gtgtctggta tcccctacac 660 ctccgatgag aatctcctgg aaatatttct caacatctct aaggctctat cgtataactc 720 atctgaagta ccgatggtat acctgaaacg tctctccaag ttacccatca aagctggttc 780 tgcccctcct attctatgcc aattctcatt gcgtggtgtg cgtgatgagt tctacgcaag 840 gtatcttcgg ctgagaacac tgaatttatg tcatgttggc ttcaacaatc agaatcgggt 900 atacatcaat gaaaatcttg cgcgagaaga tcgtgaaatt agaactcaag cgatcaagct 960 taaaaagcag ggacgtattc agcaagcttt caccagaaat ggaatcgttt tcatccgagt 1020 gaagaacggt gatgatgctg taccgtacta cacgctggag cagctcttcg catcagtcaa 1080 ttaacctatc caataagttt ctcttctcct tacattattt ccatgaatcc tttccttgtt 1140 tttccgtttg catccattcc ctcctgaaag ctacacattt ttttttcata taatttacct 1200 ttccttgata aatcataatt tcctaccgtg ctttcctatg tttccgttcc ttagatatcc 1260 gtgaattctt ccttcctaaa agtcaataca atgtgcaatg ctgatatcga tcacgagtct 1320 attgatgacc acgatgatga tgacgatgac tatgacgcta ttaactacaa cgaccacgac 1380 gaaaacgact gtgacgacga taacatggac tgcgtataat gatgatgacg atgatgacta 1440 tgacgaccac gacaaaagtt gatggctgat gatgatcatt tttggatggc gtgtggatat 1500 cggatgaatc aaggattgtg ttgctgctgc tgttgttgtt gttgtttgct gtgatggtta 1560 ctgctactgt tgatactgct actgttgttc atttgatttt tttttgcttt tgtttacttt 1620 ttgctcattt gaaaatgatt cagctaccta atttgaattt aacttatctt tgctgttcat 1680 tctttttagt actaagtcaa tttgattgca ataactagtt catgactttc atgttcagaa 1740 atttaatgtt agatattatt attagttgta ggttaagctt tctcatcatt aaaaccaatt 1800 tattattgtt gaattcactg cataggtctc ttttaggttt aacgatatgt ttgatatttg 1860 atcatggttg ataacattgt aaacaacgcc tctgcatcca cgttgattcc taaggcactg 1920 ttccatgctg ttctgagcaa tgacaagata aatatatgtc acgtcaacgt ccaaagtctt 1980 tgtgctcgac gattcagtaa attcgatgag ttgaaaagta acttctttgg tagcaaaata 2040 gacgtagtct gtatgacgga gacctggctt agtgatgtta taactaatac aatgatatcc 2100 atggaaggtt tcaacttgct acggaacgat cgcaatcgaa acggaggtgg tatttgtata 2160 tattttaggc caaatttatc ttgtaaagtt attaagaaat ctacatcgaa tttatcaaac 2220 gtaacagagt ttttgttagt tgaagtaagt ggtgctggtg atccattttt gttaggtgtg 2280 tactataatc caccagaagt tgattgctca gatactctga aagagcattt cgaagagctt 2340 acggttaaat ataatcggac tttcttcata ggagacttca atactgatct tctaaagagt 2400 actcctagat cgcgaagatt aaatgacact atctctacaa tgtcttacac gtgtgtgaac 2460 aaagaaccaa ccttcttcta ttctagtgga tgttctctac tcgatcttct cataacggat 2520 tctcctgacg tagtttctaa tgtcaatcaa gtttcaatgc catttgtttc taagcatgat 2580 ctgatttttg cttccctaaa tttaaacaaa cctagacaag agcccatagt ttaccgagac 2640 tataaaaatt ttaacgccaa tctcttaagt gttgcatttc aaagaatgga gtggaatcaa 2700 ttgttaagta taacagattc cgatattctc atcgaggctc tcaacaataa tttgaaattt 2760 ttgcatgaca actttatacc cattagaaaa agaattgaaa aagtcaaccc gtggttcaat 2820 cgtgaaattg aaattgcaat tatagccaga gatattgcct attctaactg gaaacaaagt 2880 cgtagtgaac aagagtttgg gcatttcaaa aatcttagaa atagagttaa tttgttgata 2940 agggatgcca agcgccgtta tgaccgcaga aaatttcaat caaacttacc tagtaagcat 3000 ctctgggcca atataaaaaa aactaggaat ttccaagcct cgttcattcg attgtagttt 3060 gagctacaat gagaatgata tcaaccaata tttttctgaa aatttcacag tcgatgatgc 3120 tcctagacct tttattcact atcagactag atcacatggg ttttctttcc gaccggttga 3180 agaatttgaa attattaatg ctgtacatga gatcaagtcg aatgctaccg gtttagatga 3240 tattcctatt atattcatta agattatgtt gccccttatt ttaccatata ttaagcattt 3300 atttaatatt ataatttctt catcaaaatt tcctcgtgga tggaaaacgg ttaaaataat 3360 tcctatccgg aaaaaagcta ataatgatga aattaataac ctcagaccta tcagtattct 3420 gtgtgcgctt tccaaagtat tagaaaaaat attaaaagtt caaatatctt cttttattag 3480 taccatgaat ttccttcatc ccttgcaatc aggatttcgt cagaatcatg gtacaaatac 3540 tgcacttctt aaagttcatg acgacatttc ttcagtgata gacaaaaggg gcatagcaat 3600 tctattgctt atagatttcg cgaaagcatt tgaccgcgta tctcatcgaa ggcttttgct 3660 gaagctaagc gaatatttcc gattttctgt agaggcaaac aaattgatgc attcttattt 3720 ggctggtcgt acacaggcag tatttcataa cggaaaattg tcaagcttca gaaatataga 3780 atctggtgtg ccccaagggt caattctagg gccccttttg ttctcattat ttattaatga 3840 cttgccgtcg acattgaaat attgctccat acatctgttc gcagatgatg tccaaatcta 3900 tttatgcgaa cgcgacagtt cgaatttctg caatttagcc aataaaataa atcatgatct 3960 tcacaatctt tatcattggt ccacacggaa tctgctgcct ataaattcgt cgaaaactaa 4020 agccattttg ataagcaggg gaagaagctt gaatactatg cctgatttat atcttaatgg 4080 tgaaaaatta aattatgtta atagtgtgaa taatctgggc ataatattca attcgaattt 4140 aaattgggat gatcaaatat gtgctcaatg tagaaaaatt tatggtagtt tgaaaacatt 4200 gtctttgaca accaaacatt tcgataaatc gaccaaactg aaactattta aggctttaat 4260 cttccctcat tttatctatg gtgattttat ttattcaaat gcttcatggt cttcagtcga 4320 taagttgcga cttgctttga acgcatgcgt tcggtatatt tttaacctaa caagattttc 4380 cagagtatct catttacaaa aagaattaat tggttgtaat ttttctaatt tttataggta 4440 tagatcatgc gttacactgt tcaaattaat gaagttgaaa aaaccagatt atttgtttgt 4500 taagttacaa ccaatgcgaa gtgacagaaa tagaaatttt cttatacctc aacatcagtc 4560 tgcctattat agccaatcgc tgtttgcacg aggcgttgcg tattggaacc aattacccct 4620 tagtattaaa tctagtgtgt ccatatctag ctttaaaaga gaccttatgc agtatttgac 4680 attataaatt ggtaaaattg tgggatagct caataaatat gaaagcataa ttgaagcaat 4740 tgaatcacta ggtgtaacga taaaaaaggc cgtagcctta tgttacatga ataatactga 4800 ataaataaat aaataaataa ataaataaa 4829 // ID BEL-168_AA-LTR repbase; DNA; INV; 642 BP. XX AC AAGE02024719; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-168_AA_; KW BEL-168_AA-I; BEL-168_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-642 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02024719; Positions 27305 26664. XX SQ Sequence 642 BP; 209 A; 99 C; 136 G; 198 T; 0 other; tgttacggca tctcacaccg atcccgtacg gttacaccac gattgaatac ccctcgggag 60 atggcgatag atgacagttc aggttgtcat tgagcgtgta atgagagatg tcaaagggaa 120 gatgagggaa atgagataaa atcaaatagg aaggctcgga attgataaac atctttctct 180 taaactgaat taaagctgat taaaggattt attttgctat attttgtaca acagtaagtt 240 gaatatattt cttttctgaa ttgcttataa ttaatgttaa aaacctcatt taggtggtat 300 ttgctgtaaa acaaccgttc tgaatttgta atttgagact agtgtgatag gttgaggtta 360 gataaccgtg agtatgctga aatagagtta aacatacgag ttgtataatg atttaactat 420 tatttatagc ccatttggct caatcgaaag atcttctgat cgttgggtca ctagaagata 480 gattgttggg acattaggag cttagggaga gagaccgatg taagtaagaa tgctctactg 540 ttcagtataa ttattaataa aaatttacag cttttagctc acgcgagtta cacaaaacca 600 actgtgtttg ctcaaaggac cttgaaactc ccacgcccaa ca 642 // ID Cre-2_BM repbase; DNA; INV; 3534 BP. XX AC . XX DT 14-OCT-2009 (Rel. 14.1, Created) DT 14-OCT-2009 (Rel. 14.1, Last updated, Version 1) XX DE Cre-2_BM non-LTR retrotransposon - consensus. XX KW CRE; Non-LTR Retrotransposon; Transposable Element; Cre-2_BM. XX OS Bombyx mori OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Bombycidae; Bombycinae; Bombyx. XX RN [1] RP 1-3534 RA Kapitonov V.V. and Jurka J.; RT "First examples of CRE non-LTR retrotransposons in animals."; RL Repbase Reports 9(10), 2160-2160 (2009). XX DR [1] (Consensus) XX CC Cre-2_BM is a family of non-LTR retrotransposons that belong to CC the CRE clade. The silkworm genome contains >100 copies of CC Cre-2_BM that are ~99% identical to the consensus sequence. Most CC copies of Cre-2_BM are 5' truncated. The silkworm genome contains CC several families of Cre non-LTR retrotransposons. XX FH Key Location/Qualifiers FT CDS 496..3333 FT /product="Cre-2_BM_1p" FT /note="RT and RLE domains." FT /translation="MNSPTRATCPHCPGSTRLFVVPRGLNVHISHMHKDSQ FT IQPSILPRVPCAVTADAHSQTQSVIHTLDDLASYKNKIPVLKHIPKGARNL FT VAGKLRVIMDGCINNNGVADWVSLLSFSYTTLRIPDRGDPRSLTAKVKENA FT NNNSPYFLEDFKHKPRSIVKRIEAKVHEGDLRGAVRLLVSEDSLAPFNDET FT LSALRDKHPTPFRPLSLPPAPDSSCPFLTVNSEDVADAISSFYSGSAPGLD FT GLRPNHLKELISPSAGENGVRLLHSITDLCNFILRGTVNVQVCPYLYGGSL FT CALTKKDGGVRPIAVGSVLRRLAAKLGCRSVRNEMASYLQPHQLGFGTALG FT CEAAIHATRAFAMSRESCNSAIVKLDIRNAFNSLERDIILNEIKEKIPALY FT PFLYQCYSSSSKLFFMESSIDSEVGAQQGDPLGPLIFSLAIHSIIKTVKSP FT LNLWYLDDGTIGGPPDSVLDDIRLLFPKLRDVGLEVNTGKCEVYFSTEVSA FT SVVNDFQELVPGIKVLNKSNFILLGTPIFQDGVPSSLEIKRQLLSSLRGNL FT LSLSSHVALVLLRCCFSMPRLTYLVRTCPTWLFTQDVAQLDALLRDTLEAI FT LNVSLDDRQWCQAVLPIRHGGLGVRQMERTGLAAFLASCYGVVDFVAKLLG FT TNGDGSTIPFASEALAAWGLLCPNEALPVNRSSQRCWDDVLCKLTLSTLLG FT DAAGVELARLKAVSSPESGAWLQALPSPQLGTLLDNNSLRVATALRLGCSI FT CEPHRCVCGSMVDAAGLHGLSCVRSAGRFPRHHALNDIIRRALVSANIPCV FT LEPPGLSRSDGKRPDGLTLIPWEKGRCLLWDATCSCTYAACHLSGTRQCAG FT FAAEASAKTKHAKYDALKSTYLFVPVAVETTGVWSSEAKKFIAAIGHRLRG FT QGHDPRSGSYLVQRLSIAIQRGNAASVMGTFGPGAIQSGLFD" XX SQ Sequence 3534 BP; 837 A; 686 C; 855 G; 1155 T; 1 other; ttttatttta ttttttaaga ttttagtgtg gacgtcattt tattgtaacg ttatagatat 60 aaaaaaaata tattggataa ttgatwctat ttaattatta tttgtggaat tagttttatt 120 ttattagcct gtatgacatt ttatgtaatc ttgatgttaa ttaatattct ttgtgctatc 180 gcacctttgt ggatttgtta gataggtatt caaataattt ttttgtatgt tgtttttata 240 atttaatttt gttttattat gaatgaatgt acttatcgtt tttttttttt ttttttttaa 300 atacttttta attattatta ttttttattt atttactttt tacttttgca aatgccaaaa 360 ttatttacct acttttgtct tattttttgt gcaattgacg aaactgctgg tggataatat 420 taaatgataa attgaatttt gttgttatta atgttatata acatatttta taattattta 480 tatcgatttt aaataatgaa tagtcccaca agggcaacat gccctcattg tccgggttcc 540 acacgccttt ttgtggtgcc ccggggtcta aatgtgcata tttctcatat gcataaagac 600 agccagatac agccgtcgat actgcctcgg gtgccgtgtg ctgtaactgc tgatgctcat 660 tcacaaaccc agagtgttat ccatactctg gacgatttgg catcttacaa aaataaaatt 720 cctgtgctta agcacatccc taagggggcc aggaatttag tcgcaggaaa gctcagagtg 780 attatggacg gctgcataaa caataacggt gtggcggact gggtttcctt gttgtcattt 840 tcttacacga ctctgagaat tccagatcga ggtgaccccc gttcgcttac tgcaaaggtt 900 aaggagaatg caaacaataa tagcccgtat tttttagagg acttcaaaca caagccacga 960 tccattgtca aaagaatcga agccaaggtt catgagggtg atttgcgtgg agctgtccgt 1020 cttttagtgt cggaggattc tttagcacct tttaacgatg agaccttaag cgcattacgt 1080 gataaacatc ctactccttt tcgtccctta tcattgccgc cagcgcctga ttctagttgc 1140 ccttttttga ctgtcaactc ggaggatgtt gcagacgcaa ttagttcctt ttatagcggc 1200 tctgcacctg gccttgatgg tctacgtcca aaccatttaa aggagctcat ttccccttcc 1260 gccggtgaaa atggtgttcg tctattgcac tcaattacag acttatgcaa tttcatacta 1320 agaggtacgg ttaatgtaca ggtgtgtcct tatttgtacg gaggtagctt atgcgctcta 1380 actaagaagg acggcggtgt gagaccaatt gcggtggggt ctgtgctgcg tcgtctcgct 1440 gccaagcttg ggtgtcggtc tgtaaggaat gaaatggcct cgtacttgca gccccaccag 1500 ctgggcttcg ggactgctct tgggtgtgag gccgccattc atgctacccg tgcatttgct 1560 atgagcaggg agagctgtaa cagtgcaatt gtaaagcttg atataaggaa tgcttttaat 1620 agcttagaga gagatataat tttaaatgaa atcaaggaaa agattccggc tttataccca 1680 ttcttgtatc agtgttacag ctcttccagc aagttattct ttatggaatc ttccattgac 1740 tcagaggtag gtgcccagca gggcgatcct cttggcccat taatttttag tcttgctatc 1800 cacagcatta taaagaccgt caagtcacct ttaaacctat ggtatttaga tgacggcaca 1860 ataggtggtc ccccagattc cgtgctagac gacatcagac ttttatttcc gaagttgcga 1920 gatgtcggct tggaggttaa tactgggaaa tgtgaggtat acttttctac tgaagtatca 1980 gcaagcgtag ttaatgattt tcaggagttg gttccaggta tcaaagtttt gaataaaagt 2040 aatttcattt tgcttggtac tccaatattt caggatggag ttccaagttc tctggaaata 2100 aaaaggcagt tattgtcgtc attgcgcggc aacctattga gcttatcatc tcacgtggct 2160 ctcgtcttgc ttcgttgttg cttttcgatg cctcgtttga cgtatttggt gagaacttgt 2220 ccaacgtggc ttttcactca agatgtggcc cagcttgatg ctttgttgag ggatacactg 2280 gaggctattt taaacgtaag cttggatgac agacaatggt gccaggctgt tttacctata 2340 cgtcatggcg gcctgggggt gcgacagatg gagcgtactg ggctcgctgc tttcctggct 2400 tcgtgttatg gggttgtaga ctttgttgct aagttactgg gtacgaatgg tgatggatct 2460 acgattccgt ttgcgagcga ggctttggcg gcttggggcc ttttatgtcc gaatgaggcg 2520 ctcccggtga acaggagctc gcagaggtgt tgggatgatg tgctgtgcaa gctgactttg 2580 tcaactttgc taggagacgc cgctggagtt gaattagcgc gcctgaaagc ggtctccagt 2640 ccggaatctg gtgcttggtt gcaagcattg ccttcaccac agctggggac acttcttgat 2700 aataattcat tgagagtggc cacagctttg agactcggtt gcagcatttg cgagccccat 2760 cgatgtgtct gtggtagtat ggtggatgcc gccgggctgc atggactgag ttgtgttaga 2820 tcggctggcc gcttcccgag acaccatgca cttaatgata ttattcgacg tgctttggtc 2880 tccgcaaata tcccgtgtgt gttggaacca ccaggcctca gcaggtcgga tggtaagagg 2940 ccggatggtt tgactctgat tccttgggaa aagggacgct gcttgttatg ggacgcaacc 3000 tgtagctgta cgtacgcggc gtgtcattta tctgggaccc gacaatgtgc gggctttgct 3060 gctgaggcgt cggctaaaac gaagcatgcc aagtacgatg ctctgaaatc tacatacctg 3120 tttgttccag tggcagtgga aacgacgggt gtgtggagtt cggaggccaa aaagtttatt 3180 gccgcaattg gccaccgcct cagagggcaa ggccacgatc ctcggtctgg gtcgtacctg 3240 gttcagagat tatccattgc catacagcgt ggcaatgctg ctagcgtaat gggtactttc 3300 gggccgggtg cgatccagag tggcttgttt gactagccga tgcctacgcg cgcattgctg 3360 ttagcgtgga cctttatgtt acatatgacg acgcttgcag ctgaaagtaa gtattgtact 3420 ttttgacgaa gcatgttgct gataacttta tttctgtaat ctgacgaagc atgttgcgga 3480 taataacatg tacttttagg ttaggaatat tgtgttcttt ttttttataa taaa 3534 // ID DNAX-7B_AP repbase; DNA; INV; 279 BP. XX AC . XX DT 28-AUG-2009 (Rel. 14.09, Created) DT 28-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNAX-7B_AP. XX NM DNAX-7B_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-279 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2060-2060 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC TSD TATA or TA. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 279 BP; 58 A; 86 C; 83 G; 51 T; 1 other; ctccggccac tctaagagcg cttctgggcg gcgaccaaaa ttcgtccgtt ctcgcgcatt 60 cccaccacta aactctttaa aacactggcc gccgtcaacg cgagccgccg acaccgtcgc 120 cgtcgctgct cccactcacc tacgccaccg ccgaccgnga actgagtggg ggtatttacg 180 gcggcaacta ggtgatagag tgggagaaat ttgggtacgg aaccgcggag ctcggtcagt 240 tggatgcgcg aaggagtagc gctcttagag tggccggag 279 // ID R2B_NVi repbase; DNA; INV; 5188 BP. XX AC . XX DT 09-JUL-2004 (Rel. 9.06, Created) DT 08-NOV-2010 (Rel. 15.12, Last updated, Version 2) XX DE 28S rDNA-specific non-LTR retrotransposon R2 in Nasonia DE vitripennis. XX KW R2; Non-LTR Retrotransposon; Transposable Element; KW reverse transcriptase; R2a_NV; R2B_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RA Burke D.W., Malik S.H., Jones P.J. and Eickbush H.T.; RT "The domain structure and retrotransposition mechanism of R2 RT elements are conserved throughout arthropods."; RL Mol. Biol. Evol 16(4), 502-511 (1999). XX RN [2] RP 1-5188 RA Stage D.E. and Eickbush T.H.; RT "Maintenance of multiple lineages of R1 and R2 retrotransposable RT elements in the ribosomal RNA gene loci of Nasonia."; RL Insect Molecular Biology 19(Suppl 1), 37-48 (2010). XX DR [2] (Consensus) XX CC This family is specifically inserted into 28S ribosomal RNA CC genes. A partial sequence of this family was previously CC registered as R2a_NV. XX FH Key Location/Qualifiers FT CDS 656..4492 FT /product="R2B_NVi_1p" FT /note="reverse transcriptase and restriction-like FT endonuclease." FT /translation="TFAPTHPMVRSGPCRKTKRPGSDYRESLIMDSGNNVA FT SEPRGAVDVTSAAPIGAELNAEPCEGRNQRREAALSAQTRRRNXARRARNA FT QQADEPGDDEEIETHGPLTIRTXEPMEIVAIAKNPQACPKCLQGGTQLLCM FT GSWELSRHINKEHPSVDVTWVCGACQRRCTTLRSWSCHVLHCKGRQEPKDL FT PFKCEHCSLSFDSQIGLSQHERHVHPEVRNDKRAAEANKPKGKSGRRPSIW FT SDEDLLLIRELESEYHGARNINEKIAEHFPDRTGRQVSDARRRKDYAALRG FT RGGPQGPAEGVEAIEEVDEGEIPEGEELVATDGAALESGPPENGGSAPAEQ FT VNAPALESSSQQDRECSPAVGSDEQIEDSSDDDEFSDALGEISLPEPLSVE FT RTTISPPPRDDWKGPMRWEICNASEEAGSYANWVTGLQELVRNNALSEIGL FT DSLYDQLIQIMRHPSDDNEQDRLQLNARGPPRRGHRKNRRRRRLTAADRKR FT FAFARCQDLWNNNPKKLAELVIANDLSILQRRQAPGRTETQTLYNELWGRV FT GPNIEAPRRTEDPIPVSRIFTPITPQEIMGRIRRIKNDSAAGPDGVTKDDL FT RGRGVSIALSKLFNSILLAGYYPKAWRENRTTLLPKPEKDPADVKNWRPIT FT ISSMVSRVYSGLLDQRVRAVIKQCDRQKGFTEENGCFSNIQLLDDAVSNAK FT KAGGVITILDVSKAFDTVPHAVIQGCLEKKGIPETVAAYISSMYRDCSTAI FT RTRSGDVKIGMKRGVKQGDPLSPLIFNLVLEPLLERLQETSGVEIEGMNLS FT CAAFADDIVCFANTAPEAGRQLRMVADYLGRLDMSLSVSKCIAVEYVPHRK FT TWYTKNPGLEVNGNAVPSISPSETFKYLGAKVSPWKGLLEGFESDAFREVI FT SRVQRLPLKPMQKVDLLQMYIFPRYTYGLITSPPAKAVLKTIDRIIRTRIK FT EILHLPESVSSSFLYTPRKQGGLGLLEVEKMVLIAALRNGLRARQSHDPVT FT RAAMNSNAADDRLKSYADALRLHWPLTTKELDTYKYQLRLSYAQKWAEQKW FT QGQGVEEFAQDPVGNSWLQRYDLLPASRYIDAIKLRTNTYPTRALMKIIDG FT RVDSSCRKCQGSSETLGHILGRCRYTKDKRISRHNEIKDLLKARLAKNHQV FT MDEPQITVRGQRFKPDLVVKTNEGRVHVIDVTVRYEHRTYLDEGRTEKIGK FT YRQILSTLRRDLHSNAEEVIPIVIGSRGAIPRETRKALSKLGIGKSDWLTI FT SLIALRSSLEIVNAFMDD" XX SQ Sequence 5188 BP; 1416 A; 1302 C; 1443 G; 1024 T; 3 other; gactagacta tgggttcagt cagtcccaaa tagccgatcc tggcgcgtcc ggcagtaatg 60 ccacgtatga gtcggttacc catctctaaa cgcgtagagg tggggagcta aaggccaggc 120 ggtttacccg acgtcgaatt tctccaggtc tgtgtcagtc gacggaataa aggtactaca 180 acatctacta tctatcggga tcggaagacg ccttacagcg ttttccgatt tttgctcttt 240 gagcattttt cttcaaattg cgataaccga cccgatcacg cggggctttg acaaagcaat 300 gcgtggtcgg taagatggtt gcaatctttt ccacctcgtt tcttttacgg aacgaaagca 360 atgcgtgtgg ggaacgttaa aaactccctt catgcatccc aggatttatc ctgcttactg 420 caaagcaatg cgtgtggagc gactttacca cgagtcgctc caccgcaaag caatgcgtat 480 cgcgcaaaag caatgcgtgt gggggacttg tcaaagatcc cccgccgcaa agcaatgcgt 540 gtcggcacca cgtagagcaa agcgtgtagg cagactttgt caaaagtagt tctgccgcaa 600 agcaatgcgt gtggagatct tcgccggtga aagcaatacg tgtgggcgaa cttagacctt 660 cgcgcccaca catcccatgg tgaggagcgg cccatgtcga aagactaagc gccccggtag 720 tgactaccgt gaaagtctaa taatggacag cggaaataac gttgcctcgg agccgagggg 780 agctgtggat gtgacctcag cagctccaat cggggcggag ttaaacgccg aaccttgcga 840 aggtcgcaac caaaggaggg aggctgcctt aagtgctcaa acacgccggc gaaatncggc 900 ccgccgagct cggaatgccc aacaggctga cgagcccggc gatgatgagg aaatagagac 960 acacgggcct ctaactatcc ggacgncgga gccgatggag attgtcgcaa tagcgaaaaa 1020 cccacaggcc tgtcccaaat gcctgcaggg aggtacccaa cttctctgca tgggcagctg 1080 ggaactaagc aggcacatta ataaagaaca tccgtcagtc gacgtgacct gggtgtgcgg 1140 tgcttgtcaa aggcgctgca caacgctcag gtcgtggagc tgtcatgttc tncactgtaa 1200 ggggcgacaa gaaccaaagg atctgccgtt caaatgtgag cattgcagct tgtcgtttga 1260 ctcgcaaatc ggactctctc agcacgagag gcatgtccat ccagaggtgc gaaacgataa 1320 gcgcgcggca gaggccaata agccaaaggg caagagtggc cgtaggcctt ctatatggtc 1380 cgacgaagac ttgctgctca tccgggaatt agagagcgaa taccacggag ctcgaaatat 1440 caatgaaaaa atagctgaac atttcccgga tagaacaggc agacaggtgt cggacgcccg 1500 gaggcgtaag gattatgccg cgctccgagg gagaggaggc ccgcaaggcc cagcagaagg 1560 agtcgaggcc atcgaagagg tagacgaagg cgaaatccct gagggggaag agctggtcgc 1620 caccgatggc gctgcgttgg aaagcggccc cccggagaat ggagggagcg cacccgcaga 1680 acaagtcaat gcgcccgcgc tggaaagcag ttctcagcaa gatcgagagt gcagcccggc 1740 agtggggtcc gatgaacaaa tcgaggacag cagtgacgac gacgaattca gcgacgcatt 1800 aggagaaata tcactcccag aacctctctc ggttgaacgc acaacaatct caccacctcc 1860 tcgagatgac tggaaaggcc caatgaggtg ggagatttgc aatgcgagcg aagaagccgg 1920 aagttacgcg aactgggtga ctggactgca ggagctagtc aggaacaatg cgctgagtga 1980 aataggacta gactccctgt atgaccagct catccagatt atgcggcacc cttccgatga 2040 caacgaacag gatcgccttc aattgaacgc tagaggcccc ccacgaaggg gccaccgcaa 2100 gaaccgacgg cgccgtcgtc tcacggccgc tgatcgaaag cggtttgcct ttgccaggtg 2160 ccaagatctt tggaacaaca acccaaagaa gctagccgag ttagtgattg ccaatgacct 2220 gtccattctc caaaggcgcc aagcgccagg tagaacggag acacagactc tgtacaacga 2280 gctgtggggg agggtcggac ctaatatcga agcgccaagg cgcaccgaag acccgatacc 2340 cgtatcgagg atcttcactc cgatcactcc ccaagagata atgggcagaa tcaggcgaat 2400 caaaaacgac tcggcagcgg gtcctgacgg ggtaacgaag gacgacctga gaggaagagg 2460 agtcagcata gccctctcca agctgttcaa ctcgatcctg ctagcgggtt actacccaaa 2520 ggcatggaga gagaacagaa caacccttct gccgaagcca gaaaaagatc ctgctgacgt 2580 taagaactgg cggcccatta ccatcagctc aatggttagt cgagtctact caggcttgct 2640 tgaccagcga gtgagggccg tcattaagca gtgtgatcgg cagaaaggat tcacggagga 2700 aaatggctgt ttcagcaaca tacagttgtt ggatgacgcc gtatcgaacg caaagaaagc 2760 gggcggtgtc attactatct tggatgtttc gaaagcattc gacactgtcc cgcatgccgt 2820 gatccaaggg tgcttggaga aaaaaggaat ccccgaaacc gtggccgcct atatctcgag 2880 catgtatcgc gactgctcca ctgcaatccg aacgaggagc ggggacgtaa agattggaat 2940 gaagagagga gtcaagcagg gggatcccct gtcacctctc attttcaatc tggttctcga 3000 acctctatta gaacgattgc aagagacaag tggagtggaa atcgaaggca tgaatctctc 3060 gtgcgcggct ttcgcagacg acatagtatg ttttgcgaat acagcccccg aggcgggaag 3120 gcagctacgg atggtggcgg attatctggg ccgactcgat atgagtcttt cagtgtcaaa 3180 gtgtatagct gtagagtatg tcccccacag gaagacctgg tacactaaaa acccaggcct 3240 cgaggtgaac ggtaatgccg ttccgagcat ctcacctagt gagacgttca agtacctcgg 3300 ggcaaaggtc tctccctgga aggggctgct cgaaggcttc gaatctgacg cgttcaggga 3360 agtcatatcc cgcgtccaaa gactgccgtt gaagcccatg caaaaggtgg accttctaca 3420 gatgtatatc tttccgaggt acacctatgg gttgataaca tcgcctccgg cgaaggcagt 3480 cttaaagact atcgaccgga tcataagaac gagaatcaag gagatcctcc acctgccaga 3540 atcggtaagt agcagttttc tctacacgcc gaggaagcag ggtggattgg ggctccttga 3600 agtggagaag atggtgctga tagccgctct tcggaacggc ttgagagccc gtcaatccca 3660 cgatccggtc acacgcgcgg ccatgaactc gaacgcagcc gacgatcggc taaaatctta 3720 cgccgatgct ctaagactac actggccact aacaaccaag gagctagata cttataagta 3780 tcagcttcgc ctgagctatg cccagaaatg ggctgagcaa aaatggcaag gccagggggt 3840 cgaggagttc gcacaagatc ccgtcggcaa ctcatggctg cagcgctatg atcttctgcc 3900 cgcgtcaagg tacatcgatg ctatcaagct acgaacaaac acgtacccga cgcgagcact 3960 aatgaagatc atagatggac gtgttgatag ctcttgccga aagtgccaag gcagtagcga 4020 gacccttggt catatacttg gcagatgccg gtatactaag gataagcgaa taagccggca 4080 caatgaaatc aaagacctcc tcaaggctcg tctagccaaa aaccatcagg ttatggacga 4140 gccgcagata acggtccgag gccagaggtt taaacccgat ctcgtcgtga aaacgaatga 4200 gggaagggtg cacgtaatcg acgtaactgt ccgctacgag cacagaacct atctggatga 4260 gggccgtact gagaaaattg gcaaatatcg ccaaattctc agcacgcttc ggcgagatct 4320 gcactcgaac gccgaggagg tcattccaat tgtaatcggg tcgagaggtg caattccaag 4380 ggaaacgcgg aaagccctct cgaaactcgg aattggcaag agtgattggc ttacgatctc 4440 actaatagcg ctgcgtagct cgctagagat cgtcaacgcc ttcatggatg actgacctga 4500 acaaaacgtg ttgtcttgtc ttgtctaaaa ctatttattc gaaataaggg gaggctaact 4560 gcctgcaagt tgaacgcgaa agttagacct tcccacctaa agcccaaaag tgatcgggga 4620 atgaatccgc gggtgacccc agagttgggt aaacccttga aacgttggag aagcggaaga 4680 gagtcccgcc accgagcatc gagtgctgcg gcgcccgaat gaaaccgatc gcggatggtg 4740 caagtcgtag gacggggcac gacctaagcc tctgtcacgg cggcgaagcc aggaatcacc 4800 atgcaaaggt gtgaactggg gcggatacct ccacggggtt tccctgggca tcgcgcgagc 4860 gatggccaaa gtccgctttc tcagctacaa aacaaaaatg gtatgagact tcgttaacac 4920 taatttttcc gagcctagca ggctcccttg acaacgctta tgaatctgga aaaggacaca 4980 aagtggaaaa agcgctgatg gtggacaaaa gtcagttgag acttgatatc agttgttttg 5040 actaagaatt ttattatcgt tgacttttaa atattttatt attgactgtt aatatactga 5100 cttgggacca agtcatctct gttacccggt accggttcct gtcatcaaac cggaaagtcc 5160 gtcccacgta atgtggtaga cgcaggag 5188 // ID HERMIT repbase; DNA; INV; 2716 BP. XX AC U22467; XX DT 15-SEP-2005 (Rel. 10.09, Created) DT 15-SEP-2005 (Rel. 10.09, Last updated, Version 1) XX DE Lucilia cuprina transposon Hermit, complete cds. XX KW hAT; DNA transposon; Transposable Element; HERMIT. XX OS Lucilia cuprina OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Oestroidea; Calliphoridae; Luciliinae; Lucilia. XX RN [1] RP 1-2716 RA Coates C.J., Johnson K.N., Perkins H.D., Howells A.J., RA O'Brochta D.A. and Atkinson P.W.; RT "The hermit transposable element of the Australian sheep blowfly, RT Lucilia cuprina, belongs to the hAT family of transposable RT elements."; RL Unpublished (2005). XX DR EMBL/GenBank/DDBJ; U22467; Positions 1 2716. XX FH Key Location/Qualifiers FT CDS 345..875 FT /product="HERMIT_1p" FT /translation="MNLEIIKSIENGVFRLSKKAHSRSKIWEFFCQIEKED FT GKIIDGLVICEKCNRILKYNGKQTSNLIRHNCYVARSRNVALKQVNSADKE FT NYLLACSQWVVEDCRPFSIVDGKGFKKMVQSLLNIGTKYGNHIDIADMVPN FT SRTISRRISDIADDKRSKINEELQMAITSGTASITTDL" FT CDS 965..1882 FT /product="HERMIT_2p" FT /translation="MDFESSTANNVLTKLNSILKNFEVNSMENITFVTNRG FT SNIVKALKGQNRINCANHLLNNVLDSGFNSTSELTPFFGACKKLVKYFKKT FT SLQHILSTSLKNYCVTRWNSHLNLFQSISNNYFKIQEILASINEMQRISDI FT NLALLNALVAIFEKFDIMSKKLQGVNYVTINYVYLTINTLKTICEASDSDI FT EIIKTLKINILREMETKYVANLTILHYAACFLYPPTNSCLDERQLNEVKDF FT CVSELMKCVPSSESSVSSLNFNEPPKDFQLFFSTFMNSINPSMAESISDEV FT NRYSYLKINFDVKF" XX SQ Sequence 2716 BP; 997 A; 392 C; 406 G; 921 T; 0 other; cagagatgtg catgagatga ttcactattg attcctattc gtattatctt tgcttttgtt 60 ttgcttctaa tcatcagcat atcttaatag cattgcttgc attgtttgtt cttttacaat 120 tgattgtcat ttactcaatc acttttaaag tgccgttagc aataacaaat caaaaacaat 180 aattgctgtc ttatttttat gtaaagtatt atgtaaagta ttcaaagtat attcgtaaaa 240 gttcaatgaa aagtgagtac attttacaaa agcagtgttt tttttttatt tgaataaatt 300 ttcgtacgag gtttaaaaac attattttgt gtttttaaga tataatgaat ttggaaataa 360 taaaaagtat tgaaaatggt gtttttcgtc tttcaaaaaa agcgcatagc agaagcaaaa 420 tttgggaatt tttttgtcaa attgaaaaag aagatggtaa aattattgat ggactggtga 480 tatgtgagaa gtgtaatcgc atactaaaat ataatggaaa gcagacgtct aatctcatcc 540 gccacaattg ttatgttgca cgaagccgca acgttgcgtt aaaacaagtt aacagtgcag 600 ataaggaaaa ttatttgtta gcgtgttcgc aatgggttgt agaggactgt cgcccatttt 660 caattgtaga cggcaaaggt tttaaaaaaa tggtacaatc gcttctcaat attgggacaa 720 aatatggaaa tcacatagac atagccgaca tggtgcccaa ttccagaacc atatcacgcc 780 gaataagtga cattgcagat gacaaaagat ctaaaataaa tgaggaattg caaatggcaa 840 taacgagcgg aactgcatcc attactacag atctctgaat tttgtgaaaa gacaattttt 900 atgtgtaaca tttcatttaa taaaagattt aaagttaaaa gaaattgttt tagatgtcaa 960 atcaatggat ttcgagagta gcacagcaaa taatgtgcta acaaaattaa attccatttt 1020 gaagaacttc gaagtaaata gcatggaaaa tataacgttt gttactaata ggggatcaaa 1080 cattgtaaag gctcttaaag gtcaaaatcg gataaactgc gcaaatcact tgttgaataa 1140 cgttttggat tcgggattta acagcacaag tgaactaacg ccattttttg gggcttgtaa 1200 aaagttggtc aagtatttta aaaagacctc tttgcaacat atactatcaa cttctcttaa 1260 aaactattgt gttacccggt ggaattcaca tctaaattta tttcaatcaa tttcaaataa 1320 ttattttaaa attcaagaaa ttcttgcttc aattaatgaa atgcagagaa tttctgatat 1380 caatttagcc cttcttaatg cacttgtagc catttttgaa aaatttgata ttatgagtaa 1440 aaaattacaa ggtgtaaatt acgtcacaat taactacgtt tatttaacca tcaataccct 1500 taaaactatt tgcgaagcaa gcgattcgga catcgaaata ataaagacat taaaaataaa 1560 tattttaaga gaaatggaaa ctaagtacgt tgctaattta actatacttc attatgcggc 1620 atgtttcctg tatccaccaa ccaattcctg cctagatgaa agacaactaa atgaagtcaa 1680 agatttttgt gtttccgaat taatgaaatg tgttccgtcg tctgaaagtt cagtcagttc 1740 tttaaatttt aatgaaccac caaaagattt ccaactattt ttctcaactt ttatgaactc 1800 aattaatccg tccatggcag aaagtatatc agatgaagta aatcgttaca gttatcttaa 1860 aataaacttt gatgttaaat tttaaagtgt tgcaattgtg ggaaagtcat agttcagagt 1920 atcctcgatt atacaagttt gcccaaaaaa tactggcgat acctgcaagc agtgccgcat 1980 ctgaacgagt tttttctgca gcaggtaaca taataactga aaagcgtaac cgtattggcc 2040 caaaaactgt gaataattta ctttttttga attcactttg taaatatgag taataacatt 2100 attgccattt ataatttatt aaaaaaaata gttttaggat tgaactgaat tgcattaact 2160 tttttatttc acaacctttt aaatatagtt aaaatatgct caataaattt tttgtaagat 2220 aatactatgt aatacctata atacattaat aaatgaaata caaatatgtt tatttctaac 2280 gaaatataac atttagtttg atttattata tcttaatttc agaaaataac tttattttaa 2340 atctatttct tatataaaaa tataacatta taattttgaa taatattata tctttataaa 2400 taaaataaaa taagaaaatg caagaaccat gctcctaaat acaatcgtaa ataaattaaa 2460 acatactcac tcaaagaagc agaattttac tcaaaagcat tagcatattc acaacaatca 2520 tatatattca atgctttttg cgaatattat gaatatccag aaactgcaag caattatatg 2580 catttagtaa ctaaactctt gttgtagtga atcatgcaaa ggcacctttc gtttaagtga 2640 tgtatactga tgtctcagtc ttcacttttg caaaataatt aatatgtcgc ttgaatgcca 2700 ttcatgcaca tctctg 2716 // ID Gypsy-186_AA-LTR repbase; DNA; INV; 311 BP. XX AC supercont1.123; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-186_AA_; KW Gypsy-186_AA-I; Gypsy-186_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-311 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.123; Positions 2368797 2369107. XX SQ Sequence 311 BP; 99 A; 57 C; 53 G; 102 T; 0 other; tgatatgtcc tcgaaaatgt ttcaaacatt tctagagtgc atttgctata aagtaaacat 60 tattaatttg tacatgaatt tagtataatc gaataagagt gcatttgcta taaaaacctt 120 tatagtaaat gccttcgaaa taaatcaccc gatatctgtc gggcctcttg gattgtagac 180 accgaacgat aaggacgttt tcccgtgcaa gctccgaaag tctcttcaac cgatctttaa 240 tagtttagaa gtcttactga gtgtgaaccc cctctggtga tttaattgtt ttaaagtata 300 aaacccgtac a 311 // ID Mariner-1_HM repbase; DNA; INV; 3270 BP. XX AC . XX DT 16-MAR-2008 (Rel. 13.03, Created) DT 01-APR-2008 (Rel. 13.03, Last updated, Version 1) XX DE Mariner-type family: consensus sequence. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-1_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3270 RA Jurka J.; RT "Families of Mariner elements from Hydra magnipapillata."; RL Repbase Reports 8(3), 218-218 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(619..912,987..2030,1985..2749) FT /product="Mariner-1_HM_1p" FT /translation="MGKSFRVKPTSKSRCNVLRDVIVMKSKQLEEAVEWCQ FT INNKRGWAAINSGRFPLIKDLRTINKRIDGVIVTGNEKEYCSIFTKDEEES FT LVCYIRNKNRLRKLNLYCLVFLYCYFNNLFRCLQGLNKSELTRIMLNVLGV FT RKQACRNLRGRQITKLSPNAQRALQTKKLSKSFWKRWDAKYQKYITKKRQG FT TVSINCALNCTKEMACNHLNELAEELIKCNIFIDANQLETGVWQGDIDTTR FT IYNHDETPQFINYGVDGTPNGLVYAGRGESCQKMIRENRECVTIHPFVSFG FT GDIAMCHIIFKGKGISAHMAPKEAVKRIPNLLISTNDSGSQDHSTLLSAYQ FT MFDHYLEKNNIKRPVVVLSDGHSSRFDSDVLTFLRSKNIRLFITPPDTTGV FT TQLLDQINQKLHSEYRSAKSVLYSLFMTINREGFMNKYLQSYGQSGHQNKQ FT SILAELWPEWASKQTIINAGKKVGISSGGLSVKWMQEDKFLRAELCINENT FT ENNQLSTALTISSPKNVRYGSARYWKHKYESAMEAYQTISDKSISLEEIPN FT LLPVTKVTPKINKENVRVTQVHGSMEGKKILETVSSIKEDKAKKEKANKER FT KSKQQQQIEAFYKCKEECTCGKNVCAATKLRECSCCHNVLKSTCSKASCRG FT ETGLKPNMIKPFCEIGPKKCLQFESDEDNVEDFIPDFIQNTTYHSLFADFG FT L" XX SQ Sequence 3270 BP; 1163 A; 473 C; 562 G; 1068 T; 4 other; tacaggtaaa gggatctcag aatgttaggt gttttttaag attctgctaa actccaaata 60 tttgagcata tactttgttt ttacttttga tagttaaaaa aatactcctg ctagattgcc 120 aaaaagtttt agtagttttt taatacagtt tttatatgtc tttaaataaa tatttaggat 180 atgctttttt cattagggac ctcagaatgt caggttaaat aataggtagt attgttkcca 240 atattgctta gataatcatg aatttctgca tatattattt attttyaatg ctgaatatat 300 ttttaaatgt ttggttcact tatcttttat acttttttca ttccaccaca caatgctagg 360 ttaagggacc tcaaaacgct aggtaaatga aagttaattt atatataaaa aaaagttatt 420 tgcattcaat ttcttctcga cgttatkaaa tgaatattct gcttatagtt tcatcacgaa 480 acctctttca aaacacactt tagtatgttt tctttgtttg tttttgaaca aataaataaa 540 ttaaactaaa gtgcgttttc taaagtatac aacttaagtg tgttttgata ttcctttgca 600 gtagagaata aataagaaat gggaaagtcc tttagagtta aaccaacaag caaatcaaga 660 tgcaatgttc taagagatgt catagtgatg aaaagtaagc agttagagga agcagttgag 720 tggtgccaaa ttaacaataa acgtggttgg gctgcaatta attcaggaag atttccacta 780 attaaagatc ttagaacaat taacaaacga attgatggtg tcattgtaac tggaaacgag 840 aaggaatatt gttcaatttt tacaaaagat gaagaagaat cgttagtctg ttatataaga 900 aacaagaata ggtaatatgt acacataatt gaaaaataat ttttttttaa taaactgtta 960 taaaatcctg gttttcatat tgttaattaa gaaaactaaa cttatactgt ttagttttct 1020 tgtattgtta tttcaataat ttatttagat gcctccaagg tttaaacaag tcagagttga 1080 caaggataat gttgaatgtt ttgggagtaa gaaagcaagc ctgtagaaac ctccgtggac 1140 gtcagattac taagctatca ccaaatgctc aacgagcact ccaaacaaag aagcttagca 1200 agtctttttg gaagcgttgg gatgcaaaat atcaaaagta cattaccaaa aagagacaag 1260 gaactgtatc tataaattgt gcccttaact gcacaaaaga aatggcatgt aaccacctca 1320 acgagttagc tgaagaactt ataaaatgta atattttcat tgatgcaaat caattagaaa 1380 caggtgtatg gcagggagat atagatacca ctagaattta taatcatgat gaaacccctc 1440 agtttataaa ctatggagtt gatggtactc caaatggttt agtttatgct gggagagggg 1500 aaagttgtca aaagatgata agagaaaata gagaatgcgt aactatacat ccttttgttt 1560 cctttggagg agatatagca atgtgccaca tcatattcaa gggaaaaggg attagtgccc 1620 acatggctcc aaaagaagca gttaaacgaa ttccaaatct tctaatctcc acaaatgaca 1680 gtggcagtca agatcacagt acacttctaa gtgcatatca aatgtttgac cactatttag 1740 aaaaaaacaa tataaaacgg cctgttgttg tcctatctga tggtcattct tcaagatttg 1800 acagtgatgt attgacattt ttaagaagca aaaatattag actttttatc actccacctg 1860 acacaactgg cgttacccag ctccttgatc aaatcaatca gaaattgcac tctgagtaca 1920 gaagtgctaa atctgtactt tattctttgt ttatgacaat caacagagaa ggtttcatga 1980 ataaatactt gcagagttat ggccagagtg ggcatcaaaa caaacaatca taaatgcagg 2040 taaaaaagtt ggcatttcaa gtggaggact cagtgttaag tggatgcagg aagataagtt 2100 tttacgagct gagctgtgta ttaatgaaaa taccgaaaat aatcaattat caacagcact 2160 aacaatttca tctccaaaaa atgttcgcta tggctcagca agatattgga aacataaata 2220 cgaaagtgcc atggaagctt atcaaacaat atctgataaa tcaatatcac ttgaagaaat 2280 acctaacttg ttgcctgtaa ccaaagtaac acctaaaata aacaaggaaa atgttagagt 2340 tacccaggtt cacggctcaa tggaggggaa aaaaatattg gaaacggttt cttcaatcaa 2400 ggaagacaaa gccaaaaagg agaaagctaa taaagaaagg aaatccaagc aacaacaaca 2460 aattgaggca ttctataaat gtaaggaaga atgtacctgt ggaaagaatg tttgtgcagc 2520 aactaagctt cgagaatgct cgtgttgcca taatgtttta aaatctacat gtagtaaagc 2580 atcatgtcga ggtgaaacag gcttaaaacc aaatatgatc aaaccatttt gtgaaatagg 2640 acccaaaaaa tgccttcagt ttgagtctga tgaagataac gttgaagact tcattcctga 2700 ttttatccag aatacaacct atcattctct ttttgctgat tttggattat agtttaccag 2760 gatcttattt ttatattttt tgttcataaa ttttttttaa gtttgttaat gttatattaa 2820 aaaaaaatta tcaaaagtaa tcactgaagt ttgttatata gtttttgatt tggtacctca 2880 aaacgttagg tgcagcacct caaaatgcta ggtattttaa aakgcaattt taagaaaatt 2940 ggatttattc tgtgcacgca tctttttagc actgtttttg gagtcttaag gcaacatttc 3000 tgataaggtt ctgcaaactt atttttagaa gggcttgaaa tatttaaaat tatttttaaa 3060 aaaagtgtta atagcttgta gacttttttc aacgtcttta gaatattttt taagtattac 3120 atgtaatatt tttttattta tgacttaaca ttctttattc agattattca acgctttcag 3180 agtttattta aaaaaattga gttctatgtt ggtttatttc aaacctgaca ttctgaggtc 3240 ccactgcatt ctgaggtcct tttacctgta 3270 // ID BEL-137_AA-LTR repbase; DNA; INV; 757 BP. XX AC supercont1.286; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-137_AA_; KW BEL-137_AA-I; BEL-137_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-757 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.286; Positions 1185903 1185147. XX SQ Sequence 757 BP; 274 A; 109 C; 179 G; 195 T; 0 other; tgttgcgacg cgttttggac cagcctaaga aatgaaccgg tctagtcgcg agcataaccg 60 caacatgaca gttggcaaag ttgtagtggt aggcgaagga gagagatggc gaacaaacct 120 gtcgatgtgt agatagtaaa gggtgaaggt aaataaaaac ataagggcca taagtaaata 180 gaggaatttc gccatttcga aactagataa aaagtttatc gttcagtgag tggagtttat 240 ttgcttaaaa tataaaatat ccgggtggta aatagtgcgg acgctggaaa tcaacggtta 300 gagtttgaac agggaatccg tgagtacaac ctgaaaaaag aaaagatgaa aattagtgag 360 aaattaaatg taaactactt acctgcaggc ttggtaaaac caaaaattgt cgacgaaaac 420 ccgtgctaag actgtagttc tgcaattata taaactgtaa gtagaattga aatcagtgga 480 ttgagaaaat tagcaatatt atagttttac gtgtgatgtt attcaacagc aaaaacacca 540 tttgaaatcc ggtgaaactt agggtagttg gaccatagtt ggacaccaat accacggatc 600 ggaagaaatc gaacggcaag acggaaccaa tttattgtga gtaggcgtag attagatcag 660 ttgaaaataa ttaattgaaa tttaataaat tatagtttga gcattgctga aaaccagtca 720 gctgcttctg gaaaattttg gtttcccttc gggaaca 757 // ID hAT-4_AP repbase; DNA; INV; 3428 BP. XX AC . XX DT 31-JUL-2009 (Rel. 14.07, Created) DT 31-JUL-2009 (Rel. 15.12, Last updated, Version 2) XX DE DNA transposon: consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-4_AP. XX NM hAT-4_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-3428 RA Jurka J. and Baney O.; RT "DNA transposons from pea aphid."; RL Repbase Reports 9(7), 1369-1369 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX SQ Sequence 3428 BP; 1217 A; 470 C; 544 G; 1065 T; 132 other; cagtcttacc ccctaanttg gttccaatat atataaaaat tatgtgcgca aaatttcatg 60 tcgataggtt aagtttaacc cgtgcccccg ggcccttaaa gttcgaaaac actattttat 120 tcaattaacc acggtttttt tacgtanttc attgatatct cgttgtaact cattctatat 180 aaaaatacgt gtangacttt atttatagta aattacattt tctaaaacta ttgtactata 240 catttttttg attaacgtaa tatttattat atatagttga caatgaaaca taaaagtgta 300 taaataacgt aacgatatta aacgaaatta taacgtatct aacatattac taaatatcta 360 acgtcattat attnttacat tttgnaactg aactaattta ttatctgaga taagcaaaca 420 gcactataac aatttaataa tttgatttga ttatctttta atcttttant gtttattcga 480 agctgtttta aagatagacg acaccttaaa actgatagca aatgtttaaa catatttgta 540 tctaatatga cggtcgatta gtacactgcg gaccgcgatc gatatccana caaaatattg 600 cacatatttt tttcatcatg gcttcatcgt ctacttcaaa tattattctt cgtgagcaan 660 ataaaatttt tttaattggt agtanttgca atcaaattat cggtagtaaa ttaccntcaa 720 aaagacaggc gttacgtgta cttttttata atatgcgtga agtaaanctn aacttacacg 780 aaagtgcgaa acttgttatc caagaaatag ttgtattttg gcaaaaagct cgcataccaa 840 tangacaaga atacgataac attaagaagc tggaatcgtt gtacgaagaa tggagaantt 900 tacaaaaaca tgcaacaaga aaaactgaaa taaataagaa aaaacaagaa tgttttgcga 960 acgagttaga tgatctnttc gatattgcac acgcggatgc attggatatn attaaaatng 1020 angaagacag acaattttta ctatctcaaa gagaaaaagg ccgaattggn tgtatgcttg 1080 gtanagataa aaatttacgg aaaactgaag aaagagtagt cactcggtta gaagcagtta 1140 caaaacggaa aaagagagct tatagagaaa ttgaactagc aagtaagact gtaaatttnt 1200 atcgaaccat tttaaattat ttataaattt aatttatttt anaacgtatt tttgacagcg 1260 aaataatttg ttctacgagt tccgatagcg attctgatac tgacgaaaat aattcgagcg 1320 aatcgttnga tattgataac gcgtccgaat tcctagatct tcaaacgttt caaaanagag 1380 gtaagaaaga ttttatgacg ccgaaacttg caatcgcatt ngacaaatgt aaaataagtg 1440 atcgggatgc ggttcatata ttaacagcta ctgcngaagc nttnggtata gatgtaaatg 1500 anttaattat aaatcggncg tcaattaaac gaatncgcga gcgtttacgt aaagatagag 1560 cagatcaact tcgaaaagaa tttaatacat cagaagtagg tccagtcgtt gtncactggg 1620 acggaaaact nctnccagat ttaacnggaa aagaanntgt ngatagncta ccagtnattg 1680 tttcagctnn agatnnagaa caactnctng gtgttccnaa tcttatanct ggnactggag 1740 aagancaagc tgatgcagtg tttcaaacac ttgaagantg gggtntnatc gataaagtnc 1800 aagctttatg ttgtgataca acggcatcga acacgggtcg nttaaanggn gcntgtgtnt 1860 tattagaaca antattagaa cgcgatattt tatacttgcc ntgccggcat catatatacg 1920 aaatngtnct aagaagtgtt ttcgaagtaa aatttgcggn tacgtctggt ccagatgttc 1980 cnatttttaa acgattccaa caancctgga nnaaaatnga tacnaaaaat tataanactg 2040 gattagaaga tgtaattgtc agtgaaaaat taaatgacgt anctaatgtt atgttattat 2100 tttntataga ccaattanga aaatctcaca acagagacga ttacaaagaa cttctggaat 2160 tggccgtnat ttttctnggt ggnanncctc ctaacggtat atcttttnag tatccnggtg 2220 cttttcacca cgccagatgg atggcnaaag cnatttattg tttaaaaata tatatatttc 2280 gnaaacaatt taaattaana gaannagaag aaaacgcant tcgngatatt tgtatttttn 2340 ttgttcgatt atatattaaa gcatggttta gtgcacctgc tgctaccgaa gcaccattac 2400 aagacttacn ntttttaaaa aatctgtgat tatgaaaata tagataaaga natttctgaa 2460 tcagctataa aaaaattntg tggacacttg tggtatctag cgccngaaac agtagcattg 2520 gcattttttg atactaattt gacgatcgaa acgaaaatna aaatggtnga ttcataaaac 2580 taaanaattt aacttccgaa attaataaac gaatcattnt ttcgccgaac gaagtaactc 2640 aaattataaa aaaagaaata natgattttg tntacgtaga atctacgtca tttttttcac 2700 gatttggaat atcaacatna ttcttagann nncacccggn nanttggaan gaaaatgaag 2760 actntcaaaa aggnattgaa attntaaata catttcgtgt tataaacgat gcggcagaaa 2820 gaggngtnaa gctgatggag gaatacaatg ananantnac gaaagacgaa gancaaaaac 2880 aatntntnct acaagttgtc gacgactatc gcagaaaata ccccgaccgt aaaaaaaata 2940 cgttttaaan ccgtttataa ttatagatat tttatttgta cgctataaaa tataaatttt 3000 aaacaaatta attactccat cgaaccaatg aggctatttt ttaataacaa ttaaatgtat 3060 attatntgta gtacttacgt gttgtaatca aatattgttt aaatacatat tatttataat 3120 anttttatac gttttatact taattaacga ttatatntaa aaatataata cgttaatcaa 3180 ataaantacg ggagtacaaa attattgtaa atanaatttc ctataantaa tgtctcgcac 3240 attttatcgt atgaatttgt atttcaagat atgattggtt acatttaaaa aaaccgaaaa 3300 tttttgtaat taactatttt tcaaacttta gagcttcggn ggcacgggtt aaatttaacc 3360 gatccgtctg aaactttttt gtacgcttat tttttanaag attggaatag atttaggggg 3420 taagactg 3428 // ID BEL-3_DWil-LTR repbase; DNA; INV; 332 BP. XX AC scaffold_177548; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-3_DWil_; KW BEL-3_DWil-I; BEL-3_DWil-LTR. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-332 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (04-MAR-2011). XX DR Genome; scaffold_177548; Positions 947 616. XX SQ Sequence 332 BP; 102 A; 56 C; 62 G; 112 T; 0 other; tgttgaggaa ccaattgtca tttaacttac attgctatac gcagcaaggc aaggactcat 60 gaatgatctt gcacatctca caatgctttt gtttttgtat tttctcttaa gttttctaga 120 atatactggc gccagtctgg tatgctctgt ctctgtcact ctttataaac ctatatgtcc 180 tacgtagttg cgtaagaatt gaaaatttag taataagcaa gcaaacgaac gatcaagatg 240 tgttctggca gtggcagaca cagttcagta acattgaagt aaaaataaag ttaagttctt 300 atgagatatt aaacacagtg gtttattgtt ca 332 // ID Gypsy-20_CQ-LTR repbase; DNA; INV; 102 BP. XX AC AAWU01028573; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-20_CQ_; KW Gypsy-20_CQ-I; Gypsy-20_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-102 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 420-420 (2011). XX DR Genome; AAWU01028573; Positions 3745 3846. XX SQ Sequence 102 BP; 36 A; 28 C; 11 G; 27 T; 0 other; tgttatacac tgttatacac atacacacac ataaatgtca tccgtcacaa taaacagtca 60 gtccagtacc aactatcatg gcgtcatgtt cacttaccaa ca 102 // ID CR1-30_HM repbase; DNA; INV; 3987 BP. XX AC . XX DT 16-DEC-2008 (Rel. 13.12, Created) DT 16-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE CR1-type family - consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-30_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3987 RA Bao W. and Jurka J.; RT "CR1 families from Hydra magnipapillata."; RL Repbase Reports 8(12), 1858-1858 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(185..862,866..3517) FT /product="CR1-30_HM_1p" FT /translation="MPPKNSDINMLLQEMQEKIKQLEKKLDEKDVLIKNLE FT GRVNELENKLDQDKSKDLNNWSEVVKRKVKTNQGQNDIINTVLHEEKERNK FT RKNNIIIFGVKLANHDNIDKKKCDEDMVNNIFDSINADKDKIKKVIRLKTR FT DTSKIPPVIIELDNASDKISVLKAAYINRNKINEIYFNSDMTESERDLIKQ FT LRSEVKLLNASLNQKDEYYYTIRNFKVVKLIKKPKQQSSINLYQPAVMAVL FT STHNPVSNYVKSEVLLCKSIPKYHLCCRYTNATSLNNKFDLFLLEIAVYNP FT DIFIVTETWFNSSSAINVSGYNVFYNNRSKSNSTKSHGGGVAIYVKNKYIA FT YTPYDLASEEMTVEQVWCVIKTENENILIGSIYRPPDATELTNQKINKIIN FT IAKKSVDCNKYSGLLLAGDLNYPNIKWNENDENIQTVLKNDRQAQIFLDNL FT NDNFLTQNVTKPTFNSGTLSEGNTLDLILTDAPERISIIGYNPPLSNLNNA FT HQILYWNYILKKPTTSKSSYGKSKFNYKAGNYQEMNNSFHSIRWSHIFSNK FT NVDECYKIFVNKYQELSDCYIPKKVHKHIALKPYIDKQIKTEIRHKTTLWN FT KLISNKSKSPILVQQYKIQSKLVKNMVRLKRIEYEKKISETSKRNPKLFYS FT YVNSNKKVKGGIASMSNEYNNIVTNRESIANILNNSFNSVCVNEDKHNIPN FT IPQKTEIILEVDINNKITCHTIEKKLKKLDVYKAIGVDGISPYVLNKCHST FT LCEPLHLIYIKSLTEKKLPELWKLANITPIFKKGNTNIASNYRPISLTSIP FT CKVLESLIRDEIMDYLINNSLLNENQHGFRKNKSCTTNLLETLDIITDALE FT NGHSVDMLYLDFEKAFDKVPHSRLITKLQSYGIVRNYLGWIENFLTNRKQR FT VVLGDTVSDWLPVISGVPQGSVLGPLLFLIFINDLPDQVKNPIKLYADDSK FT IINIIKNPNDSISLQNDINNIMAWSSIWLTKFNYEKCKVMHIGKKNPNATF FT IMTDIDCNYNYTLCNTKAERDLGIMVQSDLKWHSQVDSATSKANRMLGTIK FT RSFKYLDVNMVKMLYTTFIRPHLEFAVQAWSPFLKQDIDMTRLRKSKKEQP FT N*" XX SQ Sequence 3987 BP; 1641 A; 592 C; 613 G; 1140 T; 1 other; atwtatacgg aaaaaacaca aaaaaaactt tacgttaaat ttgtgttaat aagtccctgt 60 gtaataaggt ttacaagttc acgacccaga ggtaacccac tgttttaggg tggtagccct 120 cttatcagag aaattgaagt tggtgtattg acgagagttg atttggtttg gtttcactct 180 taaaatgcca ccaaaaaatt cagacataaa tatgttacta caagaaatgc aagaaaaaat 240 aaagcaattg gaaaaaaaac tagacgagaa agatgttttg ataaaaaatc ttgaaggaag 300 ggtaaatgaa ttagaaaata aattagacca agataaatca aaagatttaa ataattggag 360 tgaagtcgta aaaagaaaag ttaaaacaaa tcaaggccaa aatgatatta taaacacggt 420 tctacatgag gaaaaagaac gtaataaaag aaaaaataat attattattt ttggtgttaa 480 gttagcaaat catgataaca ttgacaagaa aaaatgtgat gaagatatgg taaacaatat 540 ttttgactca ataaatgcag acaaggataa aataaaaaaa gtcattcgac tcaaaacaag 600 agacacatca aaaataccac cagttatcat tgagttagac aatgcgtcag acaaaataag 660 tgtattaaaa gctgcataca ttaacagaaa caagataaat gaaatctatt ttaactcaga 720 tatgacagaa tccgaaagag atctcattaa acagctacgt agtgaagtta aattgttaaa 780 tgctagccta aatcaaaaag atgaatacta ctacactata agaaatttta aagttgtgaa 840 actcataaaa aaaccaaagc aatagcaatc ctcaattaat ttgtatcaac ctgcagtaat 900 ggctgtcctt tctacacata atcctgtatc taattatgtt aaatcagaag tattattatg 960 caaatcaata cccaaatacc atctatgttg tagatataca aatgcgactt cacttaacaa 1020 taagttcgat ttgtttttgt tagaaatagc agtgtataat cctgacatat ttattgtaac 1080 tgaaacttgg tttaactcat caagtgccat aaatgtaagc gggtataatg ttttttacaa 1140 caatcgatct aaatctaatt caactaaatc tcatggcgga ggtgtagcta tttatgttaa 1200 aaataaatac attgcttata ctccatatga cctagcatct gaagaaatga cagttgaaca 1260 agtctggtgt gtcattaaaa cagaaaatga gaatatctta attggtagca tatatcgtcc 1320 accagatgct acagaattga caaatcaaaa aataaataaa ataataaata tagctaaaaa 1380 aagtgttgat tgcaacaaat attcaggtct actcttagcc ggagacctaa attacccaaa 1440 tataaaatgg aatgagaatg atgagaatat ccaaacagtg ttaaaaaatg atagacaagc 1500 acaaattttt ttagacaatt taaatgataa ctttttgact caaaacgtca caaaaccaac 1560 ctttaattca gggacactgt cagaaggcaa taccttagat ttgattttaa ccgatgcacc 1620 agagcgtatt tcgatcatag gctataatcc tccgttaagc aatttaaata atgcacatca 1680 gatattgtat tggaattaca tcttaaaaaa accaacaaca tcaaaatctt cctatgggaa 1740 atcaaagttt aactacaaag ccggaaatta tcaagaaatg aacaattcat tccatagtat 1800 aagatggtca catattttct ctaataaaaa cgttgatgaa tgctacaaaa tttttgtaaa 1860 taaatatcaa gaattatctg actgttatat acctaaaaaa gtgcataaac acatagcttt 1920 aaaaccatat attgacaaac aaattaagac tgaaatcaga cataaaacaa ccttatggaa 1980 caaattaata tcaaataaat ccaaatcacc cattttggtt cagcaataca aaatacaaag 2040 taaattagtt aaaaacatgg tgaggttaaa acgtattgaa tatgaaaaaa aaatcagtga 2100 aacatcaaaa agaaatccca agttatttta tagttatgta aattctaaca aaaaagttaa 2160 aggaggaata gcttcaatgt ctaatgaata taataacatt gtaacaaata gagaatctat 2220 agcaaatatt ttaaacaata gtttcaattc agtttgtgtt aacgaagata aacataacat 2280 accaaatata ccgcagaaga ctgaaattat tctagaagta gatataaata ataaaataac 2340 ttgccatact attgaaaaaa aattaaaaaa attagatgtt tacaaagcta tcggagtaga 2400 tgggataagc ccatatgttt taaataaatg tcatagtact ctctgtgaac cattacattt 2460 aatatatata aaatcgttga ctgagaaaaa attacctgaa ctctggaaat tagcaaatat 2520 aacaccaatt tttaaaaaag gtaatacgaa cattgcatct aattacaggc caatctcatt 2580 gacttctatc ccatgtaaag ttttggaatc actcattaga gatgaaatta tggattatct 2640 aataaataac agcttattaa atgaaaatca gcatgggttc agaaaaaata aaagctgcac 2700 aacaaaccta ttggaaactc ttgatatcat aacagatgca ctagaaaatg gccactcagt 2760 tgatatgttg tatctagact ttgaaaaggc ttttgataaa gtaccacact cgcgtttgat 2820 cactaaactt caatcatatg gtattgtgag aaattatctg ggatggatag agaacttttt 2880 aactaacaga aaacagagag tggtccttgg cgacactgtc tcagattggc taccagttat 2940 tagtggtgta ccacaggggt cagttttagg accactgctg tttttgattt ttattaatga 3000 tttgccagat caagtcaaaa accctattaa actttacgct gatgatagta aaataataaa 3060 tattataaaa aatcctaatg attcaataag ccttcaaaat gatataaata atataatggc 3120 atggtcatct atttggctca caaaatttaa ttatgaaaaa tgcaaagtta tgcatattgg 3180 gaaaaaaaat ccaaatgcta catttataat gactgatatt gattgcaatt acaattatac 3240 attgtgtaac acaaaagctg aacgtgattt gggtataatg gttcaatccg atttaaaatg 3300 gcactcacaa gttgattcag caaccagcaa agcaaacaga atgttaggta ccataaagag 3360 aagctttaaa tatttagatg taaacatggt aaaaatgctc tatacaacgt tcatacgtcc 3420 acatcttgag tttgcggtcc aggcatggtc tccattttta aaacaagaca tagacatgac 3480 aagattgaga aaatccaaaa aagagcaacc aaactagcac ctacaattag gaaattatca 3540 tatgaggcta ggttgaatat tatgcaatta acgtcactgg aagaacgtag actacgtgga 3600 gatctgattc aacaatataa aattcagcat ggccacgata aaataaattg ggttgctaac 3660 gattgtggaa atgcattgga ggagggtcca tgtctgcgac aacacaaatt caaaatgagg 3720 aagcctttta gcagaaatac cttcaggtat cattttttcc aaaatcgagt tgtaaatgcg 3780 tggaacagct taccatctga aataatagaa gctacttcaa taaataactt caaaaacaaa 3840 ctagacagag ctggattttg cagaaactat ttgttaaaac acaaacaaaa gacaaaatga 3900 gctgttacag tgcaatatct tatgttattg cacttcgcaa ctgtttaaaa cagttgttac 3960 agttatacct tattattatt atattat 3987 // ID Gypsy-18_DWil-LTR repbase; DNA; INV; 2208 BP. XX AC scaffold_181039; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-18_DWil_; KW Gypsy-18_DWil-I; Gypsy-18_DWil-LTR. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-2208 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_181039; Positions 238478 240685. XX SQ Sequence 2208 BP; 710 A; 389 C; 387 G; 722 T; 0 other; tgtagcgtgc gctgcacttt caaattttca aatttttaaa cttaacgtct ttacgcgcca 60 ctttgccaat agcgatcaat caataaataa aaatattggt atcgctcgat cgataaacgc 120 ataatagggg gatagcgata aatgatctat cgtttttctt gaggtggcaa cgcgcttcat 180 tttttttttt actaggcttc attttcattt ttaactctga ggcgaaaagt gtgaaaaaaa 240 aagctgtagt ccaaatcttt ggttataaat catagcaggt atgtaggtta attagaattt 300 ttaccccgcc gccatctgca tcttactcta tatcctagat tggaaacctt actttttctg 360 gccaatattc gtgatagctt tcgtttggga agattatttt ctggaaaacc ggcgagtttt 420 caaataccca tttctggtgc cggagcacgt gcatccgaat tttcggttgt cgttgccttc 480 tggaattggc attcggacat agagaagaca ccgccaacta aacaccggca gtgccacaac 540 gattctaagt taaggcttac ggtgcggcca aagtgtctca tttcccgaaa ggtgtcctaa 600 ccttcggtga accggatttt tgtatggcaa ggttttaaag gcagtggcct acctaacctt 660 ctgacgttgc tgacccagac aaagtttgtc tttagccaaa ttctggatac tgcttgctgt 720 tgccctattt agcgcattcc gtgtgaccac ctcgaaagct cctacaaatt tttttttgaa 780 attaacctcg tttggaacgg aatctttagc tcccgatatc ttcatttctg atcttatttt 840 taaattttaa ttcctaatca agttaaacaa aaaaaaaaaa gaaagagaat gtaaataagt 900 ccaatattga atcaattaaa agtctcgtta aggttatatt aaaaatttca agtcgagagt 960 aacgtaaaat agagtaaatg taaaagccaa gtgagtgaac ggagtttaat ggtggtcata 1020 aagcgcgaag gagaaatcat catataaaca ccctcccaca gtgcatcgat ggaaggcctt 1080 tcagcgaata aactatgggg taagtgccta gtgtattaag tagtatccat tcggcgacct 1140 tcaaacaagt ttttgcgaac tatggaggat ggtagtcaat tgatactctg acttgatgat 1200 gtaatttctg tttcagcgtt ttgcgatcaa ccagatagtt aaattaaaaa aaaaaaaatt 1260 gatttccgta aataatgttt gacttgaaga gcaatcaaaa gcaaagagtg aatttgtgcg 1320 acttgaaaat ttaacataac tttaaagaat ttaaaacaaa aaaaaaaaaa agataaaaga 1380 aaacttgaaa aagatgaata attacaaaaa aaaaaatgtt aactgaattc ctttaatatt 1440 ccttctatat ttttttttat tctataatct tctacaataa ttaataatgt ataactagta 1500 agctctcttt tatactgcta gttttagtcc ctcttattca atagtttaat taactttgag 1560 ttccatcgtt ttggcaactc tgaaagttca gaatctcaaa atgttgagtc atactagctc 1620 agctgtttaa aattagtctg taaatatttc tttaaaattt attgtataca acaagaaatg 1680 cttctatctg aaattataat gaatttgaat atgattgctt aagctttata attaattttc 1740 ctagtgacat gggtacatga gttgatatct atcagttttt tttgggaatc tttacaaatg 1800 ccaataaaga ttcatggttg ggagggtata gaactgataa ggtccatcta catcttgacc 1860 ccagtaacca ctagcttgtg aacaaaagga tacaccttca tttctttcgc ctttttatta 1920 acatgaattc cttgctgttt tcgtaagtat tttctgtcag tcattctatt ttcatttaaa 1980 gttagtttca gtagagatcc taataaaatt aaattttata attaaaggta gagattcagc 2040 aaacaacccc cccaccccgc aagagctacg gaaccagagg atctcgatct gctgcaattc 2100 acctaccgcc ttctttttgg ttatcgatag ctttatcgtc gctgtgatag gagaaccgat 2160 tatataaaat tcatggcgcg caacgagaaa ccgaatagga aggttaca 2208 // ID Gypsy-139_AA-I repbase; DNA; INV; 6284 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-139_AA_; KW Gypsy-139_AA-LTR; Gypsy-139_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6284 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1011-1011 (2011). XX DR [2] (Consensus) XX CC Positions [5196-5675] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 1138..2439 FT /product="Gypsy-139_AA-I_1p" FT /translation="MEKQVHKLLEELEAIILNFKKAPSRRYLKQTLVSKGK FT RAREIYEDILYLVEDFPQTQQKLLLSEARNLFSKIKITIDTRLDTATGRLI FT KLKTVTHLVVFFNSLYKKHEKMAVPKVDLKLGTSIIPIYDGNQEGLGAFID FT SVRLFEDSVNTDFNTATADQKAAALATLIRFIKTRLTGKARAAVGDNSPTI FT NDIIDKLKDKCGLATSPDVYISKLNHIKQTGDINKFTTEVENLTLLLERSL FT VTDNIPLDVASKMATKQGVEALCSGVRNNDTKLILKAGQFSTLSAAILKAT FT ENEPIATTHINQIMFTNRHGYKKHYGNSNSNRQGGHENYNRNRGNFSNRGN FT YSNFSNRGNFSDYRGQGNYQRNFQNPRFQRGRGSYRGNTQSRIFYANTDNE FT QNAPPTPTQNRQNNRTQQPNIQPNTQQQPNFLGQLQRYTQ" FT CDS 2481..6020 FT /product="Gypsy-139_AA-I_2p" FT /translation="MSNSTCTFMVDSGADVSIFKLNKVRNNQLLHQDQKCM FT ISGISGNKIETLASTTTRLTFQNSFTLEHEFQLVDEQFPIFTDGILGRDFL FT AKFRCTIDYESWTLNFRHNNEEVNIPIEDNLNEGIILPERSEVIRNLNYLN FT ITEDMLVCSQEIQPGIFCGNTLASPSAQYVKFINTTDKPALIRPFTPMMIP FT LKNFIVAKIDKNNNVPTRVEKIKQKISVSNLPLTAMKPFETLIEEYNDIFA FT LGEDNLTTNNFYTQNIELSDSVPVYIPNYRSIHAHNEEIESQVKKMLDNKI FT IEHSVSAYNSPILLVPKKSSTNEKKWRLVIDFRQLNKKILADRFPLPRIDT FT ILDQLGRAKYFSTLDLMAGFHQIPLDSNSRRFTAFSTTSGHYAFTRLPFGL FT NISPNSFQRMMTIAMAGLTPECAFVYIDDIVVIGCSINHHINNLKRVFDRL FT RRYNLKLNPEKCQFFKTEVIYLGHKITDQGILPDDSKFEVVKNYPKPTNAD FT DIRRFVAFCNYYRKFIPKFAQIAKPLNNLLKKNTTFSWNEDCQTAFETLRN FT KLLSSSILKYPDFSKTFTLTTDASNIACGAVLSQDYDGNDLPIAFASKAFS FT NAESKKPTIMQEMIAIHWAIDHFKPYLYGRKFIIKTDHRPLVYLFGMKEPT FT KKLTQMRLDLEDYDFEIHYIKGRDNVGADALSRIVTSEEIKNNNILMVHTR FT AMTKKQQGHDKNHIVSPKEKIDHLLVYETESPAEIRKLQKMSISMQQNMIV FT IKIMNQNMKKTLITLEQDIIHENESQTLEFALSEMNEILRNKKIAMSTNNE FT ILTHIPLQHFKRIALDVIRNFELILYRPPKELTNSEEIDNILHDFHMTPTG FT GHIGQNRLYLKLRDLYYWKGMRTTIIKFVKACKLCKINKVTRRTKEASVVT FT TTPAKPFEIISIDTVGPLPRTIKNNRYCITLQCDLTKYIIIIPLENKEADT FT IARALVQNFILNYGTFLELRSDQGTEFNNEVLKKISKILNFKQNFSTAYHP FT QSIGALERNHRCLNEYLRNFSNVHHNDWDEWTKFYEFSYNTTIHTDTEYTP FT FELVFGRKAKLPQDMRKFSTEPIYNLDEYYSELRFKLQKSSEIARKNLIEQ FT KTKRTNELNKNLNPINVAIGDLVYITNENRKKLEPHYIGPFKIKHIDDMNC FT TIVNINNKKENKIHKNRIIKM" XX SQ Sequence 6284 BP; 2472 A; 1153 C; 1043 G; 1616 T; 0 other; tggcgaccgg actaatcgga acctacaagg ccctggagga cgcatagaag ccaacggaag 60 caccggaaca gcaaatcgca gagcacgccc cggagggaac cttaacacgg gcgatcccgt 120 ataagcaacg atatcaacgg acaacagacc aaaagcagcc tgttagggtg atagagcagc 180 tttctccgtt aatccgccaa tccgtctcca gaaatacgct aattatcgct aaagcaaatg 240 aaaaacggat aaaaacatac gaaatacaaa aggaagatgg gaagttcaga atcccgagag 300 gaaatagtgg tccgcagcag cagcagcagc agcagcaaac aggacaacag caacagtcat 360 taccggtggc atcaggaatt ggtccagtac acgtaatcgc tacatgcatg gtgattatta 420 tcgctgcact attattgcga gttatctggc aaattctaat gcgagagatt gaaaaccgca 480 gccgtcgttc tcctacagtg gcaaccattg tgtaaaaaaa aaaaaaaaac actcaagaac 540 atcagcaaca gaaacgtgat tatgtgagaa aaaaaaaact acgtcgtgtg aaaataatgg 600 ataaactcag tctttcccta ttcatgacag cagcaatttt agaactagtg acattctata 660 gcctttttat gtggctaaca aactattatc taagtgaata agagaacaga cgaaaatggt 720 aaaacaaaga gtgtccaatg tgcaggcata tcatagcctg acacttagtg cagtaagtga 780 aaaaagagaa catttaaaaa aggtttttcg taatgaaaaa gaaaaacaac cgaaacgaga 840 taaaggacaa tcagaagcga aaaaatcaaa ctctatgaaa tcaaaacaac ctattcaaat 900 cgtaaaaatt gaaaacgaga ctcaacaaaa catcgtacag tggggaaaac aagttaaggt 960 ggacgtagaa aaagtaaagg cactggctgc caaactcaaa accactatga agaaaccagc 1020 cacaaggatt cgttaccaag gctgagtaac agcacaactc atcataagca acaacaacga 1080 gcgtagagga gagcgcctcc gccgtagtac cacaggtaat taccttaaaa ttccttaatg 1140 gaaaaacaag tgcataaact tcttgaagaa ttagaagcaa tcattttaaa ttttaaaaaa 1200 gctccttcaa ggagatattt aaaacaaaca ttagtctcaa aaggaaaaag agctagggaa 1260 atttacgagg atatcttata tcttgtagaa gattttccac aaacacagca aaaattgcta 1320 ttatcagaag ctaggaatct ttttagtaaa ataaaaatta caattgatac gcgcctggac 1380 actgcgacag gccgcttaat aaaactaaaa actgtaactc atttggtcgt attttttaat 1440 agtttatata aaaagcacga aaaaatggca gttccaaaag ttgatttaaa attgggaaca 1500 tcaattatac caatttatga tggaaatcaa gaaggattag gagcgtttat cgactcagtc 1560 cgactttttg aagattccgt taatacggat ttcaatacgg caactgctga ccaaaaagca 1620 gcagctctag caacactgat aagatttata aaaacaagac ttacaggaaa ggctagagca 1680 gccgttgggg ataactcacc aacaataaac gatattattg acaagttgaa ggataaatgt 1740 ggtttagcca catctccgga cgtttacata tcaaaattaa atcatataaa gcaaactggt 1800 gatattaaca aattcaccac ggaagttgaa aatctaactc ttcttctcga aaggtcacta 1860 gtaaccgata acattccatt agatgttgct agcaaaatgg ctacaaagca gggcgtcgaa 1920 gctctctgct caggcgttcg taataacgac accaagctta ttttaaaagc cggacaattt 1980 tctacactgt ccgcagcaat attaaaagca acagaaaatg aacctatcgc aacaacacac 2040 ataaatcaga tcatgtttac taaccgccat ggttacaaaa aacattacgg aaactccaat 2100 agcaatcgtc aaggaggcca cgaaaactac aatagaaatc gcggaaattt ctccaatcgc 2160 ggaaattact ccaattttag taatcgcggg aatttttccg actaccgtgg tcaaggaaat 2220 taccaacgta atttccaaaa cccacgtttc caacgcggta gaggatcgta tcgtggtaac 2280 acgcaatcac gtatcttcta cgcaaatacc gataatgaac aaaatgcacc accaacacca 2340 acacagaata gacaaaataa ccgaactcaa caaccaaata tacaaccaaa cacacaacaa 2400 caacctaatt ttttaggaca acttcaaaga tatacacaat aaatacatca gcttcaaact 2460 tcattgaggc acgtgttaga atgtctaatt caacatgcac ctttatggtt gattctggag 2520 ctgacgtttc tatatttaaa ctgaacaaag ttcgaaataa tcaactatta caccaagatc 2580 aaaaatgcat gatctcaggt atatctggaa ataagattga aacccttgca tcaacaacaa 2640 ctcggcttac atttcagaat tcatttacat tagaacacga attccaatta gtagacgaac 2700 aatttccaat attcacagat ggcatactag gccgagactt tttagcgaaa tttcgatgta 2760 ccatagatta tgaatcatgg acattaaatt ttagacacaa caatgaagaa gtaaatatcc 2820 caattgaaga caatttaaat gaaggaatca tactccccga acgaagtgaa gttataagaa 2880 atttaaatta cttaaatata actgaagaca tgttggtatg ttctcaagag attcagccag 2940 gaatcttctg cggaaacaca ttagcatcac cttcagcaca atatgtaaaa tttataaaca 3000 caactgataa gcccgcatta atcagaccat tcacacctat gatgatacct ttaaaaaatt 3060 tcatcgtagc aaaaattgat aaaaataata acgttcctac acgagttgaa aaaattaagc 3120 agaaaataag tgtttcaaat ttacctctaa ctgccatgaa accttttgaa acccttatcg 3180 aagaatataa tgatattttt gcattagggg aagataattt gactacaaac aatttctata 3240 cgcaaaatat cgaattaagt gacagtgtgc ctgtttacat tccaaattac aggagtattc 3300 atgcacacaa tgaagaaatt gaatcccaag ttaaaaaaat gctagacaac aaaattatcg 3360 aacattctgt atcggcttac aattcaccaa ttttattggt acctaaaaaa tcgagcacaa 3420 atgaaaaaaa atggaggtta gtaatagatt tcagacaatt aaacaaaaag attttagctg 3480 accgatttcc gcttcccagg attgatacca tattggatca gttagggaga gccaaatatt 3540 tcagcacatt ggatttaatg gcaggattcc atcaaatccc tcttgattct aattccagaa 3600 ggttcaccgc attctcgaca acctctggac attatgcttt tacgcgttta ccatttggac 3660 taaatataag tccaaatagt tttcaacgca tgatgacgat tgccatggca ggtttgacac 3720 ctgaatgtgc attcgtttat atagatgata tagtggtaat cggatgttcc atcaatcacc 3780 acataaataa tttgaaaaga gtctttgaca gattgagacg ttataattta aaactcaatc 3840 cagaaaagtg ccaatttttt aagacagaag ttatatattt aggacacaaa attacagatc 3900 aaggtatact accagacgat tcaaagtttg aagttgttaa aaattatccc aagccaacaa 3960 atgcagacga cattcgtcga tttgttgcct tttgtaatta ttataggaaa tttatcccaa 4020 aattcgcaca aattgctaaa cctttaaata accttctaaa gaaaaataca acattttctt 4080 ggaatgaaga ttgtcaaaca gcttttgaaa ctcttcgtaa caaattacta tcctcatcaa 4140 ttttgaaata tcctgatttt tcaaaaacct tcacattgac aacagatgca tccaacattg 4200 cttgtggagc agtattgtct caagattatg acggaaatga tttgccaatt gcttttgcga 4260 gtaaagcatt ttctaacgct gaatcaaaaa aacctacaat tatgcaagaa atgattgcta 4320 ttcattgggc aatagatcat ttcaaaccat atctgtacgg aagaaaattt attatcaaaa 4380 cagatcatcg ccctctagtt tatctttttg gtatgaaaga acctaccaaa aagctaacgc 4440 agatgcggtt agatcttgaa gattatgact tcgaaataca ttatataaaa ggaagggata 4500 atgttggtgc agatgcatta tccagaatag taacatcgga ggagataaaa aacaataata 4560 ttcttatggt acatactaga gctatgacca agaaacaaca aggacatgac aaaaaccata 4620 ttgtttcacc aaaagagaag attgatcacc ttcttgtgta tgaaacagaa tcgccggcag 4680 aaataagaaa attacaaaaa atgagtataa gcatgcaaca gaatatgatt gtaataaaaa 4740 taatgaacca aaatatgaag aaaacattaa ttacacttga acaagatata attcacgaaa 4800 atgaaagtca gacgctagag tttgctcttt cagagatgaa tgaaatatta agaaataaaa 4860 agatagcaat gtcgacaaat aatgaaatct tgacacatat accactacaa catttcaaaa 4920 ggattgcttt agacgtcatc cgtaatttcg aactaatact ttaccgacct ccaaaagaat 4980 tgacgaattc tgaggagatc gacaacatat tacacgattt ccacatgact ccaacaggtg 5040 gtcatatagg acaaaaccgg ctatacctta aactcagaga cctttactat tggaaaggca 5100 tgagaaccac aattataaag tttgttaagg cctgcaaact ttgtaaaatt aacaaagtaa 5160 cgagacgaac gaaagaagca tcagtggtaa cgacaacacc tgccaagcca ttcgaaatca 5220 tttcgattga cacggtaggt ccgttgccac gaacaattaa aaataatcgg tactgcatta 5280 ctttgcaatg cgatttaact aaatatatta taattatacc gctagaaaat aaggaagctg 5340 ataccatagc acgcgctttg gtgcagaatt ttattttgaa ttatgggaca tttttagagt 5400 taaggtctga tcaaggtaca gagtttaata acgaagtcct caagaaaatt agtaaaattc 5460 tcaattttaa gcaaaatttt tcaacagcct accatccaca gtcaataggt gctttagaga 5520 ggaatcatag gtgcctgaac gaatacctaa ggaatttctc caatgttcac cataatgact 5580 gggatgaatg gacaaaattt tatgaatttt cgtataacac cactatacat actgataccg 5640 aatacacgcc atttgagctt gtgtttggta ggaaagccaa acttccacaa gatatgagaa 5700 aattttcaac tgagccaata tataatttag acgaatacta ttcagaatta agattcaaac 5760 ttcaaaagtc aagtgaaata gcccggaaaa atttgattga acaaaagaca aaaagaacta 5820 acgaacttaa caaaaactta aacccaatca acgttgcaat aggagattta gtctatataa 5880 caaatgagaa cagaaaaaaa ttagaaccgc attatatagg accattcaaa attaaacata 5940 tagatgacat gaattgtaca atagtaaata ttaataataa aaaagaaaac aaaattcaca 6000 aaaaccgaat tataaaaatg taaaaatcat ttactttaaa cccaagaatg cgaaccccta 6060 tgtttttccc atatatatat gctaaataat aagacgctat acaaaaaaaa aacttctaaa 6120 tattttttta ttagttcttt tgataatgta ggcataggaa taaaaaaaaa tctctaacaa 6180 aaacaaaaat atataaaaaa ttgtaataat aaaaatatgt taaagtcttt tggagtgata 6240 gctagtcaga atgtttaaca acattctcct ttaaaggggg gagg 6284 // ID AACOPIA1_I repbase; DNA; INV; 4110 BP. XX AC AF134899; XX DT 30-MAY-2000 (Rel. 5.04, Created) DT 30-MAY-2000 (Rel. 5.04, Last updated, Version 1) XX DE AACOPIA1_I is an internal portion of AACOPIA1, a copia-like LTR DE retrotransposon. XX KW Copia; LTR Retrotransposon; Transposable Element; AACOPIA1; KW AACOPIA1_I; AACOPIA1_LTR; MOSCOPIA; endogenous retrovirus; KW internal portion. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4110 RA Tu Z. and Hill J.J.; RT "MosquI, a novel family of mosquito retrotransposons distantly RT related to the Drosophila I factors, may consist of elements of RT more than one origin."; RL Mol. Biol. Evol 16(12), 1675-1686 (1999). XX RN [2] RP 3916-4110 RA Tu Z.; RT "Mosqcopia, a novel family of LTR retrotransposons in Aedes RT aegypti are associated with genes and other transposable RT elements."; RL Unpublished. XX RN [3] RP 1-4110 RA Tu Z. and Hill J.J.; RT "AACOPIA1_I."; RL Direct Submission to Genbank (16-MAR-1999)Department of RL Entomology, The University of Arizona, Forbes 410, Tucson, AZ RL 85721, USA. XX DR GenBank; AF134899; Positions 4186 8295. XX SQ Sequence 4110 BP; 1108 A; 893 C; 1189 G; 920 T; 0 other; ggtgatgggc ccagcgcaag gcccccgcgg gatttgtgaa agtgaagaaa agttttggcc 60 ggccgagctg agccgccatt gctggtgttc cgctttggaa gcgaagagtg gaaaagaaca 120 tcgcgaaggc cccatcgtgc gggtgtgtta gtgagtttac cggccgagtg agaaaaagtg 180 cacgagtgca aaaatggccg acgcgaaatt ttcgatcccg aaacttaacg ggacgaattg 240 ggcaacgtgg aagttgcgag tggaaaactt gttggcccgt gacgatttat gggatgttgt 300 ggtcgaggaa gttcccgacg agtttgaccg tgacgatgac tggaaaatcg cgaatcgcaa 360 ggcgaaggcg acgctggtgt tactgttgga agacagtcag ctcgccatag tgaagaagtg 420 tgtgttcgct cacgatgtgt ttggtgcgtt aaaggcgtac cacgagaaga gtacccgttc 480 ggtgcgcgtg tccttgctca aaaagttatg tgcgatcaat ctttcggagc gcggtgactt 540 agaacaacat ctgttcgaag ttgacgattt gtttgaccgt ctcgacgccg cgggcacaac 600 tctggacgca gatacgaaaa tctgtttgtt gttgaggagt ctcccaccct cctttgacgg 660 gctagtcaca gctcttgatt cgcggtcaca agatgacatt acgctcgaag tggttaagtc 720 caaattgatg gacgaattcc agcgccggct ggaacgcgac ggtcatccgg tgaagaaaga 780 aaaggcaatg aaaactgccg taacgaagac gggtgaaacg cgtgtgtgct tttattgcaa 840 gaagccaggg cacttgcaac gcaactgccg caaacttctc gagacgaaga aagaagaaaa 900 caacagctcg tcgtccggca cgaagccgaa gaaaagtgac agtgtaaaag cgaaagccgt 960 acacagtgat acgagaggaa ttgctttcgt tgtgaatggt gaaaatgctc gatcctggat 1020 tattgacagt ggtgccagtg cgcacatgac gtgcgataaa tcgtttttca ccacattcga 1080 agaatcttgc ggagggtaca ttaccctcgc tgacggtaag aaaacgcaga ttctcggtga 1140 gggtgccggt gtgctccatg gcatagatgg tgacgatgaa gtgatcagga tcgacataag 1200 tggtgtaaag tacgtcccgg gactgtcgac gaacctaatt tccgttgaga aactagctca 1260 gaagaagctt gatgtgagtt ttaatagtga tggttgccgt attatcgatc cgaaggggaa 1320 cgtcgttgca acaggagttc gttgcggcgg cctgtatcat ttgcggcagg cagagtcttc 1380 gctgcaagct gcaggtgggc aacatcacga gaactgccaa cacttgtggc accgtcgcct 1440 cggccaccgc gattgggcgg ctacggaacg catcaacaaa gaggagttag caacgggaat 1500 gaaggtaggt gattgcggtc ttagacttgt atgtgagtgc tgccttgaag gcaaagcggc 1560 tagagctccg tttccgagca ttgtcgaaag gaagtcgacc aggattctcg atatcatcca 1620 cactgatctc tgtgggccca tgaaaaccac cacgccgagt gggaatagat tcgtcatgca 1680 cctcattgac gattacagtc gtttcacagt aacttacctg ttgaaacaca agtcggaggc 1740 ggcccaaaat attatcgact ttgtgaagtg gaccgagaat ctttttggtc ggaagccctc 1800 tgtaatccga tctgacggtg gaggggagtt tgataataag cttttgcggg atttctatcg 1860 tgctaacgga ataaagccac agttcacgac accctatact cctcaatcca acggtgtagc 1920 ggagaggaaa aacaggtcca tcaccgaaat ggcaacctgt atgcttttgg actctggact 1980 taataagcgc ttctggggcg aagctgtttt gaccgctacg tacatacaaa accgtatacc 2040 gtctagatca gtacctaaga ctccatttga gatgtggtgg ggacgaaagc cggaccttgg 2100 ccacctaagg gtgttcggca gtccagcgta cgttcatgtc cccgacgtga aacgcagcaa 2160 gatggatccg aaagcgaagc ggctcatttt cgttggttac tcaattgagc ataaaggtta 2220 taggtttgtg gacactgaga ctgattgcat caccatcagc agggacgccc gtttcattga 2280 gcaagagaat ggaacgtcgt ccgttgagat tccagcttcg gagaacggga caagtaagaa 2340 gcaagcgaat ggtgagatca atccgaatcc gttcaaggag gagaccgaca ccgaggagat 2400 tagtgaagaa gaagagttta gcaccccacg tgctgaatca agtggagcct cgggtccagt 2460 ccgtagatcg gcacgggaga accgaacaat tccgaaacat ctagaggact atctgttgga 2520 gtacgccgta ggaatagcgg catgcgcggt tgaagaaccg gataatcatc tggaagctct 2580 cgaaagtgcg gagtggcgca ctgcgatgaa ggacgagatg gactcgcatc agcggaacgg 2640 tacatgggag ctaatacatc tgccgccagg acggaaaccc gtcggttcca aatggatttt 2700 taaagtcaag cggaaccagg aagaacaagt agtgaagttc aaggcaaggg tcgtagcaca 2760 aggatacagt cagaaggatg gcatagactt tgaccaagtt tttgcaccag tcacccggca 2820 agctacactt cgtctgttcc tcactatcgc cgccaagcag aagctcatcg ttcagcacct 2880 ggatattagg actgcatacc tcaacggtgt gctcgaggag gaggtttata tgcggcagcc 2940 tcccggtttt acagtcaagg gtaaggagga gtacgtctgc cggcttagga ggagcattta 3000 tgggctccgt cagtcagccc gctgctggca taagaaactg aatgaagttc tcaccaagta 3060 tggattcaag tcatcggcgg ccgaccagtg cctgtatacg aagaatacgg atggagtgaa 3120 agtgtttttg atagtccacg ttgatgacat tctagtggcg tcagcagaag aagcgaatgt 3180 gaagagggaa ttcgaaaacc taggccgaga gttcgagttg acatgtctgg gcgaaatccg 3240 tcattttttg ggagtggaag tcttacgtga agatggcgtg ttcaagatca ggctgaagca 3300 gtttatcgat aagctgatta tcaagcacgg aatggagaat gcgaaaacta cgagatcacc 3360 aatggacatc ggttttttga aagacggcgc aaacagtgaa ccgtttgagg acgttaccct 3420 gtatcgaagc ttagtcggag gaatgctgta cttgtcagtg gttgcgagac cggatattgc 3480 agccagtaca gcgatattgg gaaggaagtt ttcggagcca agtcaagccg attggaccgc 3540 tgcaaagcga ttactacgct acctgaaagc aactcgtcac tattttcttc gattgggcgg 3600 tgctgcagaa gatccactgg ttggatacag tgatgccgat tgggcaggtg atcccgtcag 3660 ccggcgatca acatcgggat tcgtgttctt gttcgctggt ggtacagttt cgtgggctag 3720 tcgccgccaa acttgcgtaa cgctctcttc catggaagca gagtatgtcg cgctagccga 3780 agcgtgccaa gagaccatct ggttgaggca gcttcttcgc gacttcggag agcctcagtt 3840 acagccaact acgatgaaag aagacaacca gggctgtatt gccttcatta aaacggagag 3900 gtcaagcaag cgatccaagc acattaacac caaggaacgt tttgtgcaag aactgtgtga 3960 gaagaacgaa attgtgctcg agtattgtcc tacggagata atgatagctg atgtaatgac 4020 aaaaccgttg ggaccacaga aacatggaga tttcgtcgtg aaactgggac tggaggatcc 4080 gcagtgaagc aggtaaccgc tgaggaggag 4110 // ID Kolobok-7_HM repbase; DNA; INV; 2756 BP. XX AC . XX DT 26-DEC-2008 (Rel. 13.12, Created) DT 26-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE Kolobok-type family - consensus. XX KW Kolobok; DNA transposon; Transposable Element; Kolobok-7_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2756 RA Bao W. and Jurka J.; RT "Kolobok-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 2065-2065 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 432..2198 FT /product="Kolobok-7_HM_1p" FT /translation="MSSGKSSKQNFSRKRKKKPPVNKYSKKCLSNISSEKV FT TSSISKKRLIMNQTNLNTISEVIKQEDDYFLFVNFSVLKSVFQPLLVCPEC FT GHKKITLSDNRKKRKGFAHNLELLCESCEWYKSVYTSKECNTQNHGQGPKI FT FEANARAVIGFREIGKGFTAMETFSQCMNMCSLSKTPFSNLNDNEIYTAYE FT KAAFNSMKKAAEEAQDKTLSIPTKQRVTIDGSWQKRGFTSLNGVVTAVVDD FT KIIDTKAFSKHCKGCSMWKKHRGSPAYNRWKADHICHINHTKSSGAMEAAG FT ALEIFSRSVKKYNLIYHEYLGDGDTSSFKEVKESNPYKNYNIQPIKLECIG FT HVQKRLGTRLRNIVQKYKRTEKPLHGKGNLTNSAINSMQNYYGIAIRNNTD FT NIYAMKKAVGAVLWHSTNFSDNNIRHSMCPRNEYSWCKWQLDKLKGTNTHT FT NKINLPIFIHNIIKPIFKDLSDDDLLRKCLHGQTQNSNEAFHSILWTRCPK FT NVFVNRRTFELCINSAIVHYNDGGDGVKSVLSYFGLLGSVTIPKLVLKDQK FT RIVNNRRKSTSICKKQRKKLRSLYKGYLDTEKTDSYSSGSF*" XX SQ Sequence 2756 BP; 1005 A; 381 C; 440 G; 930 T; 0 other; ggtggtaggc ctatgaaaaa tagagatttt tgatcaaaaa ttgaaaaaat aggaaatata 60 ttgtatttat gtaaaaattt atcctctatc tgctgatata tgacttgttt aggtttgatt 120 aaattaaaag tgttttaaat aacatatttt gttagctagt ttttttgttt accttagcaa 180 cgtcatagca acgaaaaatt aagaccttat tttacccaaa ttagatggcc aaaatgacaa 240 atcttgtgaa gcttcttttt ccaatataat atagatctgc ggtgtttttg gtcatgcttt 300 aaagttacat tttcttataa taattgccat attaaactgt ttatattagc tgtggtataa 360 aatcattatt tatgaatttt tcaactctta atttaaatca atcaatttgt acttgtgtat 420 aatcgtaata aatgtcttct ggaaaaagca gcaagcaaaa tttttcacga aaaagaaaga 480 agaagcctcc tgtaaataaa tactcaaaaa aatgtctttc aaatatttca tcagaaaaag 540 ttacatctag cataagtaaa aagaggctaa ttatgaatca aactaatctc aatacaatca 600 gtgaagtaat aaaacaagaa gatgattact ttttattcgt aaatttctct gtactcaaaa 660 gtgtatttca acctttgtta gtatgtcctg aatgtggcca taaaaaaata actctcagtg 720 ataacagaaa gaagagaaag ggatttgccc ataatttaga acttttatgt gagtcatgcg 780 aatggtataa atctgtttat acatcaaaag aatgtaatac acaaaaccat ggtcaaggtc 840 ctaaaatctt tgaagcaaat gcaagggcag taattggttt tcgtgaaatc ggtaaaggtt 900 ttactgcaat ggaaacattt tctcaatgca tgaacatgtg ttctttatct aaaacaccat 960 tctctaacct taatgacaat gaaatttata cagcatatga gaaagctgct tttaatagta 1020 tgaaaaaagc agctgaagaa gctcaagaca aaacattaag tattccaaca aaacaaagag 1080 taacaattga tggctcttgg caaaagcggg gctttacatc actaaatggt gttgtaactg 1140 ctgtcgtcga cgacaaaatt atagatacta aagctttttc taagcattgt aaaggttgtt 1200 caatgtggaa aaaacataga ggatcaccag cctacaacag atggaaagca gatcacatat 1260 gtcacattaa ccatacaaag tcatctggtg caatggaggc ggctggcgca ttggaaatct 1320 tttctcgatc agttaaaaag tacaatttga tttatcatga atacctaggg gatggtgata 1380 cttcatcttt taaagaagtt aaagaatcta atccatataa aaattataat attcaaccaa 1440 taaaattgga gtgcattggg catgtgcaga agcgacttgg tacacgcttg cgaaatatcg 1500 tacagaagta taaaagaaca gagaaacccc tacatggaaa aggaaatttg actaatagtg 1560 caattaactc aatgcagaat tattatggta ttgcaattag aaacaacact gacaacattt 1620 atgcaatgaa aaaagcagta ggagcagttt tgtggcacag cacaaatttc agtgataata 1680 acattcgtca ttctatgtgt cctaggaatg aatattcctg gtgcaagtgg caactagaca 1740 aacttaaagg tacaaataca cacactaaca aaattaactt acctatcttt attcataata 1800 taataaaacc tatttttaaa gatttatctg atgacgattt gctaagaaaa tgtttacatg 1860 gccaaacgca aaattcaaat gaagcatttc attcaatttt atggactcgt tgtcctaaaa 1920 acgtttttgt aaatagaaga acatttgaac tttgtataaa ctctgcaata gttcattata 1980 atgatggtgg agatggggta aaatctgttt taagttactt cggattgtta ggatcagtga 2040 ctattccaaa actggtttta aaggaccaaa aacgaattgt aaacaaccgt cgtaaatcaa 2100 cttcaatatg caaaaagcaa agaaaaaaac ttagatctct ttataaggga tacttagata 2160 ctgaaaaaac agattcatat agctctggta gcttttaaga agtgtttata gacataaatt 2220 ttgattttta gtttttgatt tgatttttct cattttgaga tttttagttt ttttttaaat 2280 tttaaaacat gatatctcca gtttaaatta agcagttaag ctgaaatttt caggatatgt 2340 tcttcataca tgtagaattc atttgtcata agatttttaa attttctttt tttggggcgc 2400 tacaatgtgg ttttatctgt caaagttttt tcataaaaat atatattcaa gtttaaaagt 2460 caatataatc tgaaccagga aggcagatta aaaaatccta tgacaaatga attttataca 2520 tatatagaac taccatacca aatttggagt tttgatctat tgtcactttt gagaaatagt 2580 gttttgaagt tttcgttttt ttcacacttt ttaataaatt tttgaggcaa gcgaggcact 2640 tttttatata tttttttgat tttatgttta gtctcataat tattggtaat aaatgcttag 2700 ttttattgta aattttattt tgatgcttaa aaaaagtttt tcataggcct accacc 2756 // ID Gypsy12-NVi_LTR repbase; DNA; INV; 1188 BP. XX AC AAZX01024254; XX DT 13-NOV-2007 (Rel. 12.11, Created) DT 13-NOV-2007 (Rel. 12.11, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: long terminal DE repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy12-NV; KW Gypsy12-NVi_I; Gypsy12-NVi_LTR. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-1188 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from Nasonia parasitic wasp."; RL Repbase Reports 7(11), 1143-1143 (2007). XX DR Genome; AAZX01024254; Positions 5429 4242. XX SQ Sequence 1188 BP; 441 A; 221 C; 257 G; 269 T; 0 other; tgatgtatgt accataatta gtacaactgc cgtattattt aggcagacta tataagcgcg 60 tcgtacgcag tatgtaaaca caaggctcag cagtcgctcg caggcgttcc aagcgagaac 120 atctcttctt attgtaacac cctaataaag cttacctata ttctaatata tcaactgtat 180 ggttacatta ccgtcctaca ccttccatgc gatatacaac actggcgacg aggatataaa 240 aggattacgc gtgtacagtg cagtgacggt ataaagcgaa atcgcatcac gtaaaaattt 300 ttattttgta ttcgatcacg gaatgcgtat tcgcgaagtt tacgttgaac tatacgtatt 360 cgaaagtgca agcagtattc gtgtagagtg aaagtattcg ttgtaaatga gcgacggaca 420 gcaaacaaaa gtattggtcg gctgggtata caatctacgt aaagatcaag tcgaagaaga 480 attaaaaaaa cgtaacgtcg acttcgatga aaactcgaat ctaaacgagt tacgtaaatt 540 attagcgcag acgctaagat cagaaaaaaa tttacaagac gagcaagtag accagaaaga 600 ggaaaaccca agccacgaaa acgtacacga agaaacagaa acagaaagct cacactccga 660 tacagagtca gaaatgtcgg acacaccgaa actagaattc cgtctagata aaggagactg 720 ggaaacgttc gcagatcgat tagaagtata tttcttggcg aaaggtactg tagacaataa 780 aaaagtagct accgcgctta cgaaattcga tgaagaggct ttcaagctac tcaaaagcct 840 atgtgcacca aaaaaaccaa tagaattaac atatgaagag ttaacgaaaa tgatgaaaga 900 gcaacttaac ccggccccat ctgaagtaat ggagcgttgc aattttaata gagcgaagca 960 agaacaaacg gaatcgatag cggatttcgc agcgagatta aaaaaattag cattgaattg 1020 caacttcaaa gatcaactcc aaacagcatt gagggatcaa ttggtttgtg gattgaagga 1080 tcatgaaaca agagttgaac tttttaagaa gacgagttta acattcgaca cggcattcac 1140 agaagcagtg gccagagaaa aggcagtgca aaatgcggag ggatcgca 1188 // ID CR1-117_AAe repbase; DNA; INV; 5344 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-117_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5344 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1205-1205 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 18 sequences with >93% CC identity. XX FH Key Location/Qualifiers FT CDS 296..2251 FT /product="CR1-117_AAe_1p" FT /note="PHD zinc finger at the N-terminus." FT /translation="MTKKCDGAPCFGGSTRSNKIECRNCRKSFHAKCANLN FT GSQLKAIRDLPGVFXFCTVCRSIENRVDQQVHSSILDLVLHRINSLTRLVY FT AQIDATRSLCRSLGNEHTHVSVPRMSFLDKSSIVDVVKTNFENALDQMQFE FT FSRDFNDLVAENTHRNATDKPHTDDSSIVLNDNNKRDRTSSSDLSPNSEKR FT MRVDVPNSSCENSNTVALPLTASQQCSAEIVTVSTSQANVILPNETVLPSA FT NPPMPSPGAVAIAASYELPAVVGTAAANESANTVNNNAAITASAPANAVNA FT ATNYQATTVNTPANTYHASCTRAPTIATQATKTTQAINXYQATFTNQAIIS FT NHAVNTNQAFIASQATPAAFVSTTQNDFARPPAVTXLVTYAPSHQARFYPL FT IHDSCSNAARMHSVGTQTSPNAVSRRSSLEYIRRDNTTTISTSSGNTTVAS FT NHTVPATMFSPTMVNIADGPAQSHQSHPLPTRVNQLRNTAVISRQNGPVSP FT TLSIANREDPRKWFYVSPFQPHETAQNISDYVSMKANCRIDQVICQKLVSE FT NRSNGPPLTFISFKLSVPEDIVXTIRSDKFWPEGIVIKPFLYQRPPVNRLS FT APYPSTSVPRTPITRIAIQRPELFPSARQLPKRYQMSSPLISRTQTIHPFQ FT TLV" FT CDS 2014..5157 FT /product="CR1-117_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="IXQILARRNSYQAFFIPTPSCQSVISAIPINQRPSHS FT YHQNSNTEAGIVPICSPTPKTLSNELPVNFQNANNSSISNASLRSSLGVKG FT LSVFYQNVRGLRTKLDSFLVASTSSSYDVILLTETWLNDVISDRQLFGPEY FT TVYRSDRSTSNSSKQLGGGVLIAVKQQFNSTRMMSNDYVHIEQLWVKITLP FT EYSLYLCCFYLPPDLKHDLGILNAHITSIEQIYSSLKERDLLVICGDYNLS FT CIDWRYDKVDADSLFAFPIESNNSVPSYANHLIDSFDSLNLYQMNEHRNSN FT CRILDLIFMSDKAKDAASVHRALYPLLGEDKHHPSLEITFDSVCIKASPDP FT FDPTALNFNKVDYELLAVSLGDIDWSEILSSSSVNDAVQRYTTKIIDVLQA FT TIPKCKPKPSLPWITPKLLHLKSRRNAALRELRKNRCYVTRKRFHFVSDVY FT RKENNRQYQKYIRRSEENLRRNPKSFWSFVNSKRKDSGLPVSMHYNTQEAS FT SAKEKCNLFAKRFSSVFTADCLSECDINTATSDVPADISCLNNLHISAEDI FT IKASKRLKPSSSPGPDGIPTCILKKCIQQLSLPLHHIFNLSISSGVFPAQW FT KSSFMFPVFKKGDRQDMINYRGITSLCSSSKLLELIVNESLLHNFKSYIST FT KQHGFFPRRSVTSNLVEFTSFVLRKMDQGTQVDAVYTDLKAAFDCLNHDIL FT IAKLRKLGIGGILLDWFDSYLRDRALQVKIGSCCSSPFAATSGVPQGSNLG FT PTLFSLFFNDVTRKIPHTCRSLYADDLKIFNAVRSPDDAQALQQFLTLFED FT WCSRNKMSLSVDKCQVITFSRKLNNVNAVYRVNGVSLMRVTSIIDLGVILD FT TKLNFVEHMTSTICKANRQLGFISKVCQSFRDPYCFRSLYCSLVRSILEFS FT SVVWSPSYVTWNLRLESVQRRFIRRALSHLPWNDPIVLPPYENRCLLLNLE FT TLSARRLKAEALFAAKIILGEIDSPSILSDFHFSAPTRILRSSSFFALPFR FT RTNYGKSDPIRKMCVSFNRFSDCFDFTCNSSMFKSRMDRIRV" XX SQ Sequence 5344 BP; 1514 A; 1259 C; 988 G; 1578 T; 5 other; gtcgaagttg tcgaagtttt tagccgattg ttttttcgct tcaagttcaa tatcaatatt 60 agcccagaaa ttcgctttta ataattcgtt aaagcatcaa aagtgcatct ctctacgttg 120 ataatcaaca ttactgtgtt atttgtggat tttgtgactc tgcagggcat ataacgccga 180 aataaatttt gccaacggct tcgttgtgta gaagctgtta gtggccaacc gccattctgt 240 agtgccttct cactgatcgt gaaaccagtt gaaacgcatt agttgtttgt caacaatgac 300 gaagaagtgt gatggtgctc cctgttttgg aggatcgacc agaagcaata aaatcgaatg 360 tcgaaattgc cgtaaatcat tccatgcgaa atgcgccaat ctcaatggaa gccaattaaa 420 agcaattcgt gatttgcccg gagtgttctg kttttgtacc gtctgtcgta gtatagagaa 480 tcgagtcgac cagcaagtgc atagttctat acttgatctc gttttgcatc gcatcaattc 540 gctcacacga ctagtgtatg cccaaatcga tgccacccgt tctttgtgtc gttcacttgg 600 gaacgaacac actcatgtaa gcgtacccag gatgtcgttt ttggacaaat catcaattgt 660 tgatgtcgtg aaaaccaatt ttgagaacgc cctggatcaa atgcagttcg agttctcgag 720 ggatttcaac gatttggtcg ccgagaacac acacagaaat gctacggata aaccgcatac 780 cgacgattct tcaattgtgt tgaatgataa caacaagaga gatcgtacat ccagctcgga 840 tttatctcca aattccgaaa aaagaatgcg agttgatgtt ccaaatagtt catgtgaaaa 900 ttcaaatact gtagctctcc ctcttacagc tagccaacaa tgctcagctg agatcgtcac 960 cgtatccacg tcgcaagcta acgtcatttt accgaatgaa actgttttgc catctgccaa 1020 tccgcctatg ccttctcctg gtgctgttgc tattgctgct agttacgaac tccccgctgt 1080 tgttggaacc gctgctgcca acgaatctgc caataccgtc aataataatg cagccatcac 1140 cgccagtgct cctgccaacg ccgtcaatgc cgccacaaat tatcaagcca ccaccgtcaa 1200 tactcctgcc aacacctacc atgccagttg tacacgtgcc cccactatcg ccacccaagc 1260 taccaaaacc actcaagcca taaacgscta tcaagccacc tttaccaacc aagccattat 1320 ctcaaatcat gccgtaaata ccaatcaagc cttcatcgcc agtcaagcca ccccagccgc 1380 ctttgttagc acaactcaaa acgattttgc tcgcccacct gccgtcactw gcctggtgac 1440 ctacgctcca tctcatcaag ctcgctttta tccgcttatt catgacagtt gctctaacgc 1500 tgcaaggatg cactctgttg gtacacaaac ctcaccaaac gctgtgtctc gccggagcag 1560 tttggaatat atccgtcgtg ataacaccac aaccatctca acaagctcag gtaatactac 1620 agttgcctct aatcataccg ttccagccac gatgttttct ccaaccatgg tcaacattgc 1680 tgatggaccc gcccaatcac atcagtctca tccgcttcca acaagagtta atcagctacg 1740 caacactgcc gttatcagta gacagaatgg cccagtgtca cctacattat cgattgctaa 1800 ccgtgaagat cccaggaaat ggttctatgt ctcgcctttc caaccacatg aaacagctca 1860 aaatatttcc gactatgtat ccatgaaagc aaattgtcgc atagatcaag tgatatgtca 1920 gaaactagtc agtgaaaacc gttcaaatgg accacctctc acatttattt cctttaagtt 1980 aagcgtgcct gaggatatag ttmctactat tagatckgac aaattttggc cagaaggaat 2040 agttatcaag ccttttttat accaacgccc tcctgtcaat cggttatcag cgccataccc 2100 atcaaccagc gtccctcgca ctcctatcac cagaatagca atacagaggc cggaattgtt 2160 cccatctgct cgccaactcc caaaacgtta tcaaatgagc tccccgttaa tttccagaac 2220 gcaaacaatt catccatttc aaacgctagt ttaaggtctt ctttaggagt caaaggactc 2280 agtgtgtttt atcaaaatgt tagaggtctt cggaccaaat tagactcctt tctagttgct 2340 agtacctcct catcatatga cgtcatattg ttaactgaga cttggttaaa tgacgtcatc 2400 tctgatcgac aattgtttgg cccggaatat acagtgtacc gaagtgatcg atcgacaagc 2460 aatagctcaa agcaactcgg tggtggagta cttattgctg taaaacagca gtttaactca 2520 actcgtatga tgtcaaacga ctacgttcat attgaacaat tgtgggtcaa aataacgcta 2580 cctgaatact ctttatattt atgttgcttt tatctgcctc cggatttgaa acatgatttg 2640 ggcatcttaa atgctcatat aacatcgatt gagcagatat attcctctct caaggaacgc 2700 gatctattgg tcatatgtgg tgactataat ctctcgtgca ttgattggcg ttacgacaaa 2760 gttgatgcag attctctttt cgcttttcca atcgagtcca ataactctgt accttcctac 2820 gccaaccatc taatagacag ttttgactct ttgaacttat accaaatgaa tgagcatcgt 2880 aatagtaact gtaggatttt agacctaatt tttatgagtg ataaagctaa ggacgcagcc 2940 tctgtgcatc gcgcattata ccctcttctt ggtgaagaca agcaccatcc gtctttggaa 3000 ataacttttg attcggtgtg tattaaggct tctcctgacc ctttcgatcc cactgctctg 3060 aatttcaaca aagttgatta tgaactgtta gctgttagtc taggagatat agattggtca 3120 gagattctta gttcttcaag tgtgaatgat gccgttcaac gctacacaac taagataata 3180 gacgttctgc aagcaactat tccaaaatgt aagcccaagc cttctcttcc gtggattact 3240 ccaaaactgc tccatttaaa atctcgaagg aacgctgctt taagggagct tcgtaaaaat 3300 aggtgctacg tgacgcgtaa acggtttcat ttcgtaagcg acgtttacag aaaagaaaac 3360 aaccgccaat accaaaaata catccggcgt tcggaagaga atctccgacg taacccaaaa 3420 agtttctgga gtttcgtcaa cagcaaacga aaggatagcg gactaccagt gtctatgcac 3480 tataacaccc aagaagcatc cagcgcaaaa gaaaaatgca acttgtttgc taaacgtttt 3540 tctagtgttt ttactgctga ttgtctgtct gaatgtgaca ttaatactgc aaccagtgat 3600 gttcctgccg acatttcgtg cttaaacaat ttacacataa gcgcagagga cattattaaa 3660 gcttccaaac gattaaaacc tagttcctct cctggcccag atggtatacc aacttgcatc 3720 ctgaaaaaat gtatccagca attgtcactt ccgctccatc atatcttcaa cctgtctata 3780 tcctcaggag tttttcctgc tcagtggaaa tcgtcgttta tgtttccagt tttcaaaaaa 3840 ggggatcgcc aagacatgat caactatcgt ggaataactt cgttatgttc ttcttcgaag 3900 ctgcttgaat tgattgtaaa tgaatccttg ctacacaatt tcaaaagcta cattagcaca 3960 aaacaacatg gcttttttcc acgtcgatcc gtcacatcaa atttggtcga gttcacttca 4020 tttgtactcc gcaaaatgga tcaaggtaca caggttgacg ctgtgtacac ggacctcaaa 4080 gctgctttcg attgtctcaa ccacgatatt cttatagcaa agctaagaaa actcggcata 4140 ggtggcatac tattagactg gttcgattct tacttacgag atagggcgct gcaagtaaaa 4200 attggctcct gctgctcatc cccctttgcc gctacatcgg gtgtgccaca aggcagtaac 4260 cttggcccca cactgttctc gttgtttttc aacgacgtca caaggaaaat acctcacacc 4320 tgccggtcat tatacgcgga tgatttaaag atattcaacg ctgtccgttc tcccgatgac 4380 gcgcaagcgc ttcaacagtt tcttacgctt tttgaggatt ggtgtagccg aaacaaaatg 4440 tcattaagtg ttgataaatg tcaggtcata acattttccc gcaaactgaa caatgttaat 4500 gctgtctaca gagtaaatgg agtttctctg atgcgtgtga cgtcaatcat cgatttggga 4560 gttatactgg acaccaagtt aaattttgtt gagcacatga cttcaactat ttgtaaagct 4620 aaccgccagc tcgggttcat ttcaaaagta tgtcagtcgt ttcgtgatcc atattgtttt 4680 cgttcattat attgttcatt agttcgttca attcttgagt tttcttctgt tgtgtggtcg 4740 cctagttatg ttacatggaa tttacgtttg gaatctgttc aacgtcgctt cattcgccgt 4800 gccctatccc atcttccatg gaatgatcca attgttctgc ctccctatga aaaccgatgc 4860 ttattgctca acctcgaaac actttcggca cgtcgcttaa aagctgaggc cttgtttgcg 4920 gctaaaatta ttctaggtga aattgattca cctagtatac ttagtgactt ccatttctca 4980 gctccaactc gtatacttcg ctcgtcatcc ttctttgctc ttccattccg acgtacaaac 5040 tatggtaaaa gtgaccctat acgtaaaatg tgtgtctcat tcaatagatt ctccgattgt 5100 ttcgacttta cttgtaatag ctccatgttt aagagtagga tggataggat tagggtttag 5160 aataattaga ttaaggatca ttgttcaatt aggcataaga ttgtccgttg aatgtatgta 5220 tattgttgtg atgttgttta aaaagataca ggttttttaa tcagctttgc tggtttttcc 5280 tgtcacaaaa gaaaacgaat acaacaaata aataaataaa taaataaata aataaataaa 5340 taaa 5344 // ID DNA8-45_AP repbase; DNA; INV; 581 BP. XX AC . XX DT 25-AUG-2009 (Rel. 14.09, Created) DT 25-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-45_AP. XX NM DNA8-45_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-581 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 1975-1975 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 581 BP; 200 A; 56 C; 58 G; 266 T; 1 other; tagggctcgg atttatatgt aaatacatat tttctctgcg acatgataaa ctaatttcgg 60 caagtgataa atttgtttca caatattttt gataaaacga gctatatttg attttacata 120 tttttacata ttttgtcata aatacatgtt tttacatatt tcacaattat tcaatttttt 180 ctcacatatt tatacaatta tactgtttta atgtcttttc ttcagantta gaataattga 240 aaggttatca tgactatata ttatatcgac gatcttttga attcgaaaat ttaaaagtgc 300 aggttattat taattgtaat caagataatt agataactat agataaatat gcatttttta 360 ttacatttta ttttattatt aattattatt aagattttac gacaatatcg gcgatctttt 420 ggattcgaaa atttaaaaat gaatgttaat taatgtttgt aatcaatata attagataaa 480 tatacatttt ttattacatt ttattatatt attaattatt attttttttt tcgataatac 540 atattttgaa ttctttagca catatttatg tacatatttt t 581 // ID hAT-66_HM repbase; DNA; INV; 3572 BP. XX AC . XX DT 31-DEC-2008 (Rel. 13.12, Created) DT 31-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type DNA transposon family - consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-66_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3572 RA Bao W. and Jurka J.; RT "hAT-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 2054-2054 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 757..2691 FT /product="hAT-66_HM_1p" FT /translation="MSSKIKFKSTVWKYFEVSKKDNHFCICLLCKALLSRG FT GNCPKTFTTSAIRRHLKNKHFAEYKIAEKESDKKKIETSTASTSSDSELKK FT QLTLLESLNKKKLWDVNDQRAIKIHNHIGEMMALDIQPYSIVEDLGFRKLI FT EELCPNYQIPSRRYFSENIIPQIYNKLFSSIKSNISAAIHISLTTDIWTAN FT SSNVAFISMTAHWLSNEFSQHRAVLRVMHFPESHTGKNISEYLHKGLVSFE FT IPHSKIHIVLRDNAANMVAGVRDSGFRSMPCFIHTIQLCIHDSILSQKSVK FT DILACCRRLATHFHHSPTAAAKLEAIQEQLGTNKHRLIQDITTRWNSSYDM FT IERMLDQKVPLATYTADHPTQSTLNSFQWDLLDKVLHILEPFKTLTINFSK FT RESYLSDVIPSIMALKEFFNQALVCEAFLTINTLVEHLSGSVDKRLGPYLH FT DKYLCLSTFCDPRYKLTYQKEDKDKIKNWINEQMQEMQQSNLELESSDSDF FT DEPSVHNTNEVIAKPAYPKFNECFQKVSKLITIDECDSNFVNTENSKRLKR FT SNSNIQMINKEIEEYLKLPLVKEKESPLLWWKNCGNSFKNLKKITQKFLSA FT PPSSVESERLFSTGGIIYEPNRNRLSPESGEKLMFIHYNLRTTNDN*" XX SQ Sequence 3572 BP; 1329 A; 491 C; 520 G; 1232 T; 0 other; tagggatgca ccgaacccgg tttttggccg aacaccgaat ccgaacgttc ggccatgaaa 60 gttcggccga acaccgaacc gaataccgaa cttttgcttt ttttaaatat aaacattggt 120 ttacgtcaaa aacattttga actcgaagtg aaaactatat ttttaaaaat ttaatgtgtt 180 ttttaaaaca ttagatttat tttgaaaata tgtgaagaaa ctttaaataa ctatcaaaag 240 taaatttatt attagatgat tgtttgtagt cactacacta caatgtagtg taaagaagta 300 aaaatgttaa gaagttttta tttgaaaaat gaaaaattaa atttatgagt atgacttatt 360 aataatgagt ataacttatt aataatgagt ataaataatt aacaattata atataataaa 420 attatattgt aatttataaa ttaaacttct taaaataata ttttacgtta aggatttatt 480 ttaatagttg ttaattgtta atgttaaggc ttaaagttaa gctatttagc taacttttat 540 atttgtagac tatactaaaa gcttagctta tatgttataa catttaagct aaatttgtac 600 tctattctac aaatataaaa gttacctaat aaaaataaag gtataaaggt ataaggtatt 660 aaaacctagt atctggcatg ttttaagatt atattttttt cttaaacttt atttatttat 720 tagtatatac ttttaattat ttagaagcat aaaaaaatgt cttcaaaaat aaaatttaaa 780 agtactgttt ggaaatattt tgaagtttca aagaaagata atcatttttg tatttgccta 840 ctatgcaaag cattactttc tagaggtggg aattgtccta aaacattcac cactagtgca 900 attagacgtc atttaaaaaa caagcacttt gcagaatata aaatagctga aaaagagtct 960 gacaaaaaaa aaattgaaac atctacagcc tctacatcaa gtgattctga attaaaaaaa 1020 caattaacat tactggagag tcttaataaa aaaaagctgt gggatgtaaa tgatcaaaga 1080 gctatcaaaa ttcacaatca cattggagaa atgatggcgt tggacattca accatattcg 1140 attgtagaag atcttggttt tagaaaatta attgaagagc tgtgtcctaa ttatcaaata 1200 cctagtagga gatattttag tgaaaacatt attccacaaa tatataataa actatttagt 1260 tctatcaaga gtaatatatc tgctgccatt catatctctc ttacaactga catttggaca 1320 gcaaatagtt caaatgttgc cttcataagc atgacagcac actggctcag taatgaattt 1380 tcccagcaca gagcagtact cagagttatg cattttccag aaagtcacac aggaaaaaac 1440 ataagtgagt atttacataa aggtttagta agttttgaaa ttccacactc aaaaattcat 1500 attgttttac gagataatgc ggccaatatg gtagctggtg taagagacag tggttttaga 1560 tcaatgccat gttttataca tacaatacag ttatgtatac atgactcaat tctgtcgcaa 1620 aaatctgtta aggatatttt agcttgttgc agacgtctgg caactcattt tcaccattcc 1680 ccaacagcag cagcaaaatt agaagcaatt caggaacaac ttggtacaaa taagcacagg 1740 ttaatccaag atattactac aagatggaac agttcttatg atatgattga aaggatgctt 1800 gatcaaaaag ttcctttggc aacttataca gctgatcatc ctacccaatc aacactaaat 1860 tcatttcaat gggacttatt agataaagtg ttgcatattt tagaaccatt taaaacttta 1920 accattaatt tcagtaaacg tgaatcatat ttgtcagatg ttattccttc aattatggct 1980 ctgaaagaat ttttcaatca agcattagta tgtgaggctt ttttaacaat aaatacttta 2040 gttgaacatt taagtgggtc agtagacaaa agattgggtc catatttaca tgataaatat 2100 ttatgtcttt ccactttttg tgaccctaga tataaattga catatcaaaa ggaagacaag 2160 gataaaataa aaaattggat aaatgaacag atgcaagaaa tgcaacaaag taatttagaa 2220 cttgaatcct ctgattctga ttttgacgaa ccttcagtac ataacacaaa tgaagtaatt 2280 gcaaaaccag cttatccaaa attcaatgaa tgttttcaga aagtttctaa actgataact 2340 atagatgaat gtgattccaa ttttgtaaat actgaaaata gcaaaaggtt aaaaagatca 2400 aattctaata tacaaatgat aaacaaggag attgaagagt atttaaagct acctcttgtc 2460 aaggaaaaag aaagtcctct cttgtggtgg aaaaattgtg gaaactcttt taaaaattta 2520 aagaaaatta cccaaaagtt tctttcagca ccaccatctt ctgttgaaag tgaaagactt 2580 tttagtacag gtggcatcat ttacgaaccc aatagaaaca gactatcacc agaaagtggc 2640 gaaaaattaa tgtttattca ttataacttg aggacaacaa atgataattg attttcactc 2700 atgtttttaa tgttttattt tttcaataat cgattatttt gttaggtgaa agtttaaaat 2760 tgttgatttg tctagtatta atactattgc atgtttagtt atataataat aacataatag 2820 tatatgttat atagttatat agtaataact acataataaa ttggagcatt tttatgtttt 2880 tgatcaccac aataacattt ttccctaagt ttattccttt caaataattt tctcagttca 2940 actagttgag tagccaaatt acgagcaact ttctttataa agattttttt ttaataacac 3000 agcagcaata ggcgccactt aaaacggacg tttctatctg ttattttaag tttattttat 3060 attattacat ttttgcgatt tttccaaaat aaaaaatttg cttcttgcat gaattttata 3120 acttaagtta agtttaatta ttattattat tgtatataaa aggataaaat taactaaaat 3180 attgctcaat tgaaaggatt tttcatttta cacttatact agtactattt tttctaaatg 3240 atgtaaacta gttttacgta atacattcgg ttttaagtaa tacatgcgat caaattaaat 3300 ttattctaaa ttcaaaaaac ttgatgtaaa tctaacgtac tcatgtgttc tattgaaaaa 3360 catattctta cctactatgc gttgtagata agaatcgggt tttcaataaa aagccatatc 3420 tattggtact gtagctattt aggctgagtt atgatgattt gatgatatta atttccaaat 3480 tagcattttt agcaaatgtt cggccgaaca tttgtcaagg ttcggccgaa ccgaacgttc 3540 ggcaaaacag ccgaacgttc ggtgcatccc ta 3572 // ID Chapaev-1_AA repbase; DNA; INV; 3370 BP. XX AC . XX DT 30-SEP-2007 (Rel. 12.09, Created) DT 30-SEP-2007 (Rel. 12.09, Last updated, Version 1) XX DE Autonomous DNA transposon - a consensus sequence. XX KW Chapaev; DNA transposon; Transposable Element; Chapaev-1_AA. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-3370 RA Kapitonov V.V. and Jurka J.; RT "Chapaev - a novel superfamily of DNA transposons."; RL Repbase Reports 7(9), 774-774 (2007). XX DR [1] (Consensus) XX CC Chapaev-1_AA is a very young family of DNA transposons. The CC genome contains two copies of Chapaev-1_AA that are 0.5% CC divergent from each other. Chapaev-1_AA belongs to the Chapaev CC superfamily. Hallmarks of the Chapaev transposons are 4-bp CC target-site duplications, terminal inverted repeats with the CC conserved '5-CAC and GTG-3' termini, and the Chapaev transposase. CC The Chapaev transposase is characterized by the conserved CC D-x(60-80)-D-x(220-290)-E catalytic triad. Chapaev transposons CC populate genomes of different animals, including sea urchin CC Strongylocentrotus purpuratus, amphioxus Branchiostoma floridae, CC starlet sea anemone Nematostella vectensis, sea hare mollusc CC Aplysia californica, mosquitoes Aedes aegypti and Culex pipiens, CC and nematode Caenorhabditis elegans. The N-terminal portion of CC Chapaev transposase in Chapaev-1_ACa, Chapaev-2_ACa, CC Chapaev-3_ACa, Chapaev-1_BF, Chapaev-2_BF, Chapaev-1_NV, CC Chapaev-2_NV, Chapaev-3_NV, and Chapaev-1_SP is similar to the CC N-terminal portion of RAG1 (100-370 aa in the human RAG1). It CC includes a novel type of zinc finger, called Chapa: CC H-X7-C-R-X-C-G-X35-D-X4-H-X4-C-X2-C-W-Xn-C-X2-C-X8-G. In the CC amphioxus and anemone Chapaevs, the N-terminal portion contains CC also the RING finger motif. Some Chapaev transposases (e.g. CC Chapaev-2_ACa) show low similarity to the RAG1 core. XX FH Key Location/Qualifiers FT CDS join(705..2336,2403..2726) FT /product="Chapaev-1_AAp" FT /note="transposase." FT /translation="MKPKNHEQNQQTVCVLCLSKNKLQKLSEAAISRIRSF FT ILPDIVAKAWYYPKKICTHFSFVLYECNSTENLQSKINLHVYKYDQRVSTR FT SGALCQCEICIAARVTLPNLIDPQKKSRKRGRPAVLEESKRARFTKLCTKC FT FSEMSRGKQHICTIVQRQNNVLQIVQGDRPQAGDAIITAYLRHKQPDEDGT FT ISLANNRGRPTRCALNAPTPSSQKGLSVDDLCLIKKNCNLSTRNTIKLSSY FT LRGMDIQIPSKLRKSLTQENQKLKNFYSVAHFQFETTKASQENDLEALPKP FT VIYCNDVKGFINYVEKERNICQKDVFYKIGADGGGGSFKITLSIVDKERTT FT KKGEKDTGEKKLFLLAVAPGMSELYCNISKVWTTLLQLDNLEAVVAGDLKI FT INLLLGIMGHSSSYPCPYCITKKPVLSQVCKDSRTLGQILENYTQWMDQGA FT DGKRAKYHYNCVQKPLLCGSPETTTVTLCPPPTLHITLGILNAIFKAVEVA FT DPVTAARWVTVSACQRHSKFGFTGRNCHKLLAKRSVLSSDGPLGKFRSVLD FT GFAELFKACLQETLEVGYLQKIEDFATSWLDAELPNSSKFHILRFHVPEFC FT SSTQSGLGKYGEQACETVHFDFGITWEKLKVPESSECYDERLLRSVVLYNS FT SHV" XX SQ Sequence 3370 BP; 1131 A; 622 C; 654 G; 963 T; 0 other; cacggtgtct ttgtggggtc actttattac aaaaaaatag gtatgactga aatttaaaac 60 taaaaacatt tagactttgc tcgttctttt gcgcctaaaa tcaaaaaaat cgatcggggg 120 gtccagaaca ttttttttta ttttgattga ggtatttatg gatacatagt tctgtatacc 180 aacgtgcgtt tgtatatatg tgtatgtatg tatgaaggta tatatgtatg catacatgtt 240 tgtatgtatg tatgttatgt gtgcatgtat ttgtaagtac atgaataata aatatcgtag 300 caaaattatc tagttctatg cgtcatattg caaaacacct tttgtcttat gagaacaaaa 360 actaacagta ttttactgcc gttctacgca tatttgtatc gcagttcata aggtttacat 420 agaacatggg acaactaggc gcagaagggc attttattag ttgaaacata ggacccggtg 480 caagcaatgt ttacacaaaa tgaattcata aaacatccag tgaacgttga ttccctgtgt 540 ttaatgggaa cccatattca ctggatgttc atgttttgat tcatttattt ctaaacacta 600 cctgctgcgc atgacatttc ccgatgagaa gaaaaaagag taaatcgatc aaattagcca 660 gtgcagtgcg aaaactgaaa tcgacccctt ggtttgttgg taaaatgaag ccgaaaaatc 720 acgaacaaaa tcaacaaact gtttgcgttt tgtgtttaag taaaaacaaa ttacaaaagc 780 tttcggaagc agcgatttct cgtattagga gcttcattct tccagatatt gttgctaaag 840 catggtacta tccgaagaaa atttgtactc atttttcttt tgtgctgtac gagtgtaatt 900 ctacggaaaa tcttcaatcg aagattaacc tgcacgtgta caaatatgat caacgagttt 960 ccacaagaag tggcgcgctt tgccaatgcg aaatatgtat agccgcacga gtcacgttgc 1020 caaatttgat cgacccacaa aaaaaatcca gaaagagagg aagaccagct gttttggagg 1080 aatcgaaaag agcgcgtttt actaagcttt gtacaaaatg tttttccgaa atgtctagag 1140 gaaagcagca tatctgcaca attgttcagc ggcaaaacaa tgtgctacaa attgtacaag 1200 gggatcggcc tcaagccggc gatgccatca taacggcgta tctgaggcat aaacaacctg 1260 atgaagacgg gacaatatct ttagccaata atcgaggacg accaacgcgc tgcgccttaa 1320 atgcaccaac gccatcttcc cagaaagggc tttcggtcga tgatctttgt ctcatcaaaa 1380 agaactgcaa tttgagcacg agaaacacga taaaactaag cagctaccta cgaggaatgg 1440 acattcaaat cccttcaaag ttacgtaaaa gtttgactca agaaaatcag aaattgaaaa 1500 atttttattc tgtcgcccat tttcaattcg aaaccaccaa agcaagtcag gagaatgatc 1560 tggaagcttt gccaaaaccg gtcatttatt gtaatgatgt gaaaggattc atcaattacg 1620 tggaaaaaga gagaaacatt tgccaaaagg acgtattcta caaaatagga gctgatggag 1680 gaggtggatc gttcaaaata accttgtcaa tcgtcgacaa ggaacggact actaaaaagg 1740 gagaaaagga cactggagag aaaaaattgt ttttgctggc tgtcgccccg ggtatgtcag 1800 aattgtactg caacatttcg aaggtttgga caacgcttct tcagttggac aatttagaag 1860 cagttgtagc gggagacctc aaaattatta accttctttt aggtattatg ggtcacagtt 1920 catcctatcc atgtccatac tgcattacaa aaaagcctgt attgtcacaa gtatgtaaag 1980 attcgcggac cttaggacag atccttgaaa actatactca atggatggac caaggggcag 2040 acggtaagcg ggcaaaatac cactacaact gcgtgcagaa accgttgttg tgtggatccc 2100 ctgagacaac aacggtaacg ctctgtccac cccctaccct tcacataaca ctaggaatac 2160 tgaatgccat tttcaaagct gtggaagtgg ccgatccggt aactgcagca cgatgggtga 2220 cggttagtgc ctgccaacgg cacagtaagt tcggtttcac gggaagaaat tgtcataagc 2280 tgctagctaa gcgaagtgtt ttaagttcag atggtccact cggaaaattt cgttcggtaa 2340 gataggtaaa atattaattg atgcgttttt aataagtgta ctttctgttc taatttgcgt 2400 aggtccttga cggctttgca gagttgttca aagcctgcct tcaagagaca ctcgaagttg 2460 gatatcttca aaaaattgaa gatttcgcca caagttggct agatgccgaa ctaccgaaca 2520 gctcaaagtt ccacatattg agattccatg tcccagaatt ctgcagctca acacaatctg 2580 gacttggaaa atatggagaa caggcttgtg aaacggtaca ttttgatttt ggaatcactt 2640 gggagaaatt gaaagttccg gaatcatcgg aatgttatga cgaacgttta ctacgttccg 2700 tagtgcttta caatagcagt cacgtataat ttccaaagca tattcgaata attactttgt 2760 atatattaaa tatgaatcta aaaataaaaa ccaatttatt gccaggcgag tacggattct 2820 tttaaaatta aaatgcgata ctttcacaaa aataaaatat tgttttcgaa ttgggaaaac 2880 ttgctttaaa ataggtaaat aacgagcgta catgaatatt aattttctac aataacaagt 2940 agaatttgta aaagaagcat gtgaacctcg taaaaatgtc aatcctcagt atgctcagta 3000 aactaaaata tttactaaca tacaaacata catacattca ttcattcata tatatataca 3060 tccataaata catacattca tatatacata catacaaaca tacatatatg catacatgca 3120 tgcatacaaa cagacatata catttacata tgtacataca cacgtagcta cgcaaatatg 3180 tacatatgtg catatctcct gtaatgcatt gcaaatggaa aaatttcaaa aaaaaatttt 3240 actctagacc ccccgaccga ttattttcgt tttggccgca aatgaacggg aaagacttag 3300 acttttatgt ggcgaagttt cagacttacc tatttttttg taataaaatt gttctacgaa 3360 acacaccgtg 3370 // ID I-1_NVi repbase; DNA; INV; 5957 BP. XX AC AAZX01003261.1; XX DT 30-JUN-2009 (Rel. 14.07, Created) DT 30-JUN-2009 (Rel. 14.07, Last updated, Version 1) XX DE I-1_NVi is a non-LTR retrotransposon from the I clade. XX KW I; Non-LTR Retrotransposon; Transposable Element; KW reverse transcriptase; endonuclease; RNaseH; I clade; I-1_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-5957 RA Bao W. and Jurka J.; RT "I-type non-LTR retrotransposons from Nasonia vitripennis."; RL Repbase Reports 9(7), 1394-1394 (2009). XX DR EMBL/GenBank/DDBJ; AAZX01003261.1; Positions 14204 8248. XX FH Key Location/Qualifiers FT CDS 273..1877 FT /product="I-1_NVi_1p" FT /translation="MTDESDGEDHIMQTSPIKTFPEERTKKVDDLRKRKTT FT EQSQYKPEERNSTKRAHITQQDNNKDKTSQETNAQQHGNDQTDVLLYTEDI FT PPPYKIIIVPTTETKTSENTSESSTKSIRLEPIHIAKTIIPILPNNSIDEI FT KKSGKNRVTVITHNRKVANKILTLEILKEKKLTPFIPSSYIYRRGVIKHVP FT TDITENEIKNNIKLTSPVANLTIQSVKRFYRYNKSESGTTSTPTTTIMITF FT KGQVMPQYADLYRIKHSVEAYIPTVKLCANCYRHGHTKAFCKSKTRCIYCG FT ETDHDSTLCTMKNETLKCINCTENHLPTDRNCKIRIREQQIITMATKRNIS FT VQEAKQNYNELILNKKRIRFSEFPELSIATTPDELDYASQIYNTPRRRLFT FT DTLKTSRPVNSRPLTNNSPKKLHNNVINKENNQLTYQHKNILISPNGDYKL FT NYTATKQNQGNSPNNNNYLTTTPNLSTLASPLLTLNEKGFLLDKKISLTQN FT DIVNYLNNDPNFYENILNILQENSYQNSQNTYIENNSL*" FT CDS join(1890..3185,3062..3598,3647..3985,3915..4379, FT 4383..4928,4870..5808) FT /product="I-1_NVi_2p" FT /translation="VNIDLKEIKILQWNCHSLKSKLPIFRNSIEEYDIIAL FT QETWLDELSNPKIPNYSIIRKDRKNSYPSGGVAIIIRNNIPFKIIENIYYS FT EKTMETIAISIPLIQHNNDTNEENSLLIASVYRNPAKQTSINEWNSLFNSI FT KNYKNIYIGGDFNAHHFSWGSSKICNSGEFLVESLLDNNLHILNDGTHTYY FT TSKTITLPNNSVVQKIISSPLDLTLISTNLLHKTTWKIQEDNMYSDHYPIL FT CTISLEIETIPFNSTHKISHKNVNWKLFSTLLKNYFEKNHQTIENMTIDEQ FT YNFLINLCTTSIIQLNTKNNSKKTYENQALFSGDIFNTSPTELISQYHPSL FT VKTFNKRPIKAPWWNQVCTELARERRLASFKLLKNPTQENLDDLKQIEKST FT RKKLNKIKKIVLGNMLMKSLRLIQTPKLYGTSYKAIKTKIYKKKIKQNKKD FT SFREHVNEIFTLDTDSKTIWNFIQSYKNITNIKPEYAPNDIEILENINQFI FT EEFTSPAQSNNTDDDQYSIEPNNNNPYQELDAPFSPIELNLAIKEVKKQSS FT PGLDKVDYEMIKNTPDCFKELLLKLIHFYYTNCNSPISWNKFLIILLPKDS FT KKNSDPSHWPHVYKKNDLIPDTQNGFRKAKSCHHALAYILTDIHLATHKNL FT YTLCVLIDIKSAFDNVCPKILDKILKDLGIPYKTRSFIYKLISNKQLFFKI FT GNKLQGPYKKKWSSSRKCIESHFIETNYKDHIKKNGVPQGSVLSPILYNLY FT VSLLKKILPNDIFLVQYADDNLIYCIHQDIPTAIEKLQQALVTFNAYFENL FT KLQISPQKTKFIIFSNTNMQTNGNHSIEFNDAVIKESKSVKYLGMILDNTL FT SWNEHAEHIIAKSTKQLNILKFLRGTWGGHPKVLKQIYLSLIRSSLEYGLF FT LTHIKNSNRRSKIQKIQNQAIRLALGYRRSTPINVMHLESKIPYFNDRIRY FT LSDNFILKLIAHDTPLKQKIENLSITIANNDRLNIKNNFPIIRSYNEIKSL FT VQNKIQRHLHPIPYDYTYEAIVEQPDINVTSGLHIKKSANPPQEFKDNHSV FT HNLHKNQLIHPKNLKITTQYTIFTDASKMKNQEYTGLAMFMASNNQAYQFK FT IHTEASIFTAEGLAIFLSLNNIINSNFKRAYIFTYSLSILSALKDYTPNKS FT RNTSHIILDIKTLLNKCSQENIKVTLIWIPAHINIKYNETVDSLAKNAIKD FT GTNTIYLLPFSDFGDKIKQKLILNTQQLITDQGTYKGTHYFDHYYKDNEAK FT PWFTKINLPREHITTINRIRPNHYNLSASLFRKNIIQSPTCECGYESENID FT HILWDCPRFSNQRYSLINKLNHYHSKKKKNTPKDTTSLLKNPNEKPARIIT FT EYLKRCNLQI*" XX SQ Sequence 5957 BP; 2503 A; 1249 C; 738 G; 1467 T; 0 other; ccggcgacgg acgggagcca ctggcgcgat acaacgtgtt tctaaaactt tcttcatttc 60 tttatttatt tgatgccata tctttcctaa aactaaaaac acaactactt ataaaaataa 120 aaaactcgtt agaatagtta caactttaaa ctcactagta taatcagcaa taactactaa 180 tatcggaaca ataatttcat aacctaaaac tgagttgaaa ctacccagcg aggatacagc 240 cctaggggct tattctcaac aaacacctaa ggatgacaga cgaatctgat ggagaagacc 300 atataatgca aacatcacct ataaaaacct tcccagaaga gagaacaaaa aaagtggatg 360 atctacgaaa aagaaagacc acagaacagt cacaatataa gccagaagag agaaatagta 420 caaaaagagc acatattaca caacaagata acaacaagga caaaacgtct caagaaacaa 480 atgcacaaca gcatggaaat gatcagacag acgtcctatt atacacagaa gatattcctc 540 ctccatataa gatcatcata gtacctacaa cagaaacgaa gacctctgaa aatacctcag 600 agagctctac taagtcaata agactagaac ccatccatat agctaaaaca atcataccta 660 ttcttccaaa caattccata gatgaaataa aaaaatctgg gaaaaataga gttaccgtga 720 tcacacataa cagaaaggta gccaacaaga tactaacact tgaaattcta aaagagaaaa 780 aactcacccc atttatacca agctcataca tatacaggcg aggagtaatc aaacacgttc 840 caacagatat tacagaaaat gaaatcaaaa acaatattaa acttacatca ccagttgcaa 900 atcttacaat acaaagtgtc aagagatttt acagatacaa caaatctgag tctggtacaa 960 catcaacgcc aacaaccact attatgatca ccttcaaagg acaagttatg ccacaatacg 1020 ccgatctata cagaataaaa cactcagttg aagcttacat ccctacagta aaactctgcg 1080 ccaactgcta tcgacatgga cataccaagg ccttttgcaa gtcgaaaact agatgtattt 1140 actgcggcga aactgatcat gactccacac tatgcaccat gaaaaacgaa acccttaaat 1200 gcataaattg cactgaaaac cacctcccaa ccgacagaaa ctgcaaaatt cgcataagag 1260 aacaacagat cattactatg gcaacaaaaa gaaatatttc tgttcaggag gctaaacaaa 1320 actacaatga actaatatta aacaagaaaa gaataagatt ctcagaattc ccagagctta 1380 gcatcgccac aactccagat gaactcgact atgcaagtca aatttataac actccaagaa 1440 gaagactatt cacagataca cttaaaacat ccagaccagt taattctaga cctcttacta 1500 ataactcgcc aaaaaagctt cacaataatg taattaacaa agaaaacaac caactcacat 1560 atcaacacaa aaatatttta atctccccaa atggagatta caaactaaac tatactgcta 1620 ccaaacaaaa tcaaggaaac tccccaaaca ataacaacta tctaaccaca actccaaatc 1680 tttcaacact agcatctcct cttctcacgc taaatgaaaa aggattcctc ctagataaaa 1740 aaataagctt aactcaaaat gatatagtca attacctcaa taatgaccca aatttctacg 1800 aaaatatttt aaacatatta caagaaaata gctatcagaa tagccagaat acttatattg 1860 agaacaacag tctataatat aatatctgag tcaatattga tctaaaagag ataaaaattc 1920 ttcaatggaa ttgtcatagt ctcaagagca aactaccaat ttttagaaac tcaatagaag 1980 aatatgatat cattgcattg caagaaactt ggttagacga attatctaac ccgaaaattc 2040 ccaactacag tataataaga aaagatagga aaaatagcta cccatctggt ggagtcgcca 2100 taatcattag aaataatatc ccatttaaaa taatagaaaa catatactac agtgagaaaa 2160 caatggagac aatagctata tcaatccctt taatacagca caacaatgat accaatgaag 2220 aaaactctct cttaattgcg tccgtatacc gtaaccctgc aaaacaaaca tccataaacg 2280 aatggaacag cttatttaat agcattaaaa actataaaaa tatctacata ggtggagatt 2340 tcaacgctca ccatttttca tggggctcga gcaaaatatg caactcagga gagttcctag 2400 tagaatcatt attagacaac aacttacata ttctcaatga tggtacacat acatattaca 2460 ctagtaaaac aataactcta cccaacaact ctgttgttca gaagataatc tcatctcctt 2520 tagacttaac gttaatatcc acaaatttac tgcacaaaac aacttggaag attcaagaag 2580 acaacatgta tagtgatcat tatccaatat tatgcaccat ctctttggaa atagagacga 2640 ttcctttcaa ctctactcac aaaataagtc acaaaaatgt caattggaaa ctattctcta 2700 cactccttaa aaactacttc gaaaaaaatc accaaacaat agaaaacatg accatagatg 2760 aacaatataa ctttctcatc aacttatgca ctacatctat tatacaacta aacactaaaa 2820 acaatagcaa gaagacctat gaaaaccaag cacttttctc aggtgatatt tttaacacct 2880 ctccaacaga gcttatatca caatatcatc cgagcctggt aaaaactttc aacaaacgtc 2940 caatcaaagc cccatggtgg aaccaggtgt gtacagagtt agcaagagaa agaagacttg 3000 caagcttcaa attattaaaa aaccctacac aagaaaatct agatgacctt aaacaaattg 3060 aaaaatctac aagaaaaaaa ttaaacaaaa taaaaaagat agttttaggg aacatgttaa 3120 tgaaatcttt acgcttgata cagactccaa aactatatgg aacttcatac aaagctataa 3180 aaacataact aacattaaac cagaatacgc cccaaatgac atagaaatac tggaaaatat 3240 aaatcaattc atagaagaat ttacctcacc agcacaatct aacaatacag atgacgacca 3300 atacagtata gaacctaaca ataacaatcc atatcaagaa ctagatgcac cattttctcc 3360 aattgagtta aacctagcta taaaggaagt taagaaacaa tcttctccag gacttgataa 3420 agttgattat gaaatgatca aaaatactcc ggactgtttc aaagagctat tactaaaact 3480 aatacatttc tactatacaa actgtaatag tccaatcagc tggaataaat tcctaataat 3540 actactacca aaagacagca agaaaaattc agacccatct cattggcctc atgtttatta 3600 aaattaactg aaaaaatgat caacacacgc atacactttt tcgtagaaaa aaaatgattt 3660 gattcctgac acgcaaaatg gatttaggaa agctaaatcc tgccatcatg cattagcata 3720 tatcctcaca gacatccacc tagctacgca caaaaatctt tatacactct gcgtactaat 3780 cgacatcaaa tcagcctttg ataatgtatg tccaaaaatt ttggacaaaa tactaaaaga 3840 cctaggaata ccatataaaa caagatcctt tatatacaag cttatttcaa ataaacaact 3900 attcttcaaa ataggaaaca aactacaagg accatataaa aaaaaatgga gttcctcaag 3960 gaagtgtatt gagtcccatt ttatataatt tatacgtaag cttactcaaa aaaatcttgc 4020 caaacgacat atttttagta cagtatgctg acgacaacct tatctactgt attcaccaag 4080 atatacccac agctatcgag aaacttcagc aagccctagt aacattcaat gcatattttg 4140 aaaaccttaa actccaaata tcacctcaaa aaacaaaatt cattatcttc tcaaacacta 4200 acatgcagac aaatggaaac cactcgatcg aattcaatga tgctgtaata aaagaatcta 4260 aatcagtaaa atatctgggt atgatattag ataatactct ttcctggaat gaacatgccg 4320 aacatattat agccaaatcc acgaaacaac ttaatattct caaattccta cgtggtacat 4380 agtggggagg tcaccctaaa gtattgaagc agatttatct ttcactaatt agaagctctc 4440 tagaatatgg actattttta actcacataa agaactcaaa tcgaagaagt aaaattcaaa 4500 aaatccaaaa tcaagcaatt cgattggctt taggctacag aagatccaca cccatcaatg 4560 ttatgcactt agaatcgaaa ataccatact tcaatgaccg tatacgatac ctttcagaca 4620 actttattct gaaactaata gctcatgata caccgctaaa acaaaaaata gaaaatctta 4680 gtattacaat agctaacaat gacagactta atattaaaaa taattttcct ataattagat 4740 catataatga aataaaatct ttagtccaaa acaaaattca aagacatctt catccaattc 4800 cctatgatta cacatatgaa gcaatagtgg aacaaccaga tattaacgta acttctggac 4860 tacacataaa aaaatcagct aatccacccc aagaatttaa agataaccac tcagtacaca 4920 atcttcactg atgcatcgaa aatgaaaaat caagaataca caggactggc aatgtttatg 4980 gcatcaaata atcaagccta ccagttcaaa atacatacag aagcatcaat ctttacagct 5040 gaaggtttag caatttttct gtcgttaaac aacatcataa actccaattt caaaagagct 5100 tacatcttta cttattcact gagtatcttg agtgcactga aagattatac gccaaataaa 5160 tctaggaaca cttctcatat aatacttgac attaaaacct tgctaaacaa atgttcacaa 5220 gaaaatataa aagtcacttt aatttggatt ccagcacata tcaacataaa gtataatgaa 5280 acagtagatt cacttgcaaa aaatgctata aaagatggaa ctaacactat ctaccttcta 5340 cctttctcag actttggtga taagattaaa caaaaactca tcttaaacac tcaacagctc 5400 ataacagatc aaggaaccta taaaggcaca cactatttcg atcactatta caaagacaac 5460 gaagcaaaac catggttcac aaagataaac ctaccaagag aacatataac caccatcaac 5520 cgtataagac caaatcatta caatctgtcg gcaagtctat ttaggaaaaa cataattcaa 5580 tctccaacat gtgaatgtgg ctatgaatct gaaaacattg accatatact atgggattgt 5640 cccagatttt caaatcaaag atactcactt atcaacaaac tcaaccacta ccactcaaaa 5700 aagaagaaga atacaccaaa agatacaaca tcactgctaa aaaatccaaa tgaaaaacca 5760 gcaagaataa ttactgaata cttgaaacgc tgcaatctac aaatttgaac tctaacttga 5820 atttaaattc aaaccaagtt tatgcaagac aatccaaatc tacattacca tcagtcaaac 5880 atgcaaggtg gcacatcgga gtcgcgtccg ctcttggaca gcgatccaca gccaataaca 5940 aaaagaagaa gaagaag 5957 // ID Shinagawa-7_AAe repbase; DNA; INV; 1937 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.03, Created) DT 16-FEB-2011 (Rel. 16.03, Last updated, Version 1) XX DE A non-autonomous DNA transposon family from Aedes aegypti. XX KW DNA transposon; Transposable Element; nonautonomous; KW Shinagawa-7_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-1937 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the yellow fever mosquito."; RL Repbase Reports 11(3), 844-844 (2011). XX DR [2] (Consensus) XX CC ~95% identical to consensus. ~8-bp TSDs. TIRs are ~120 bp long CC and composed by degenerate repeats. Related non-autonomous CC elements, named Shinagawa, are found in Aedes aegypti and CC Culex quinquefasciatus. Sequence 1405-1659 is ~75% identical to CC FEILAI-1B_AAe; indicating it is a composite transposon. XX SQ Sequence 1937 BP; 562 A; 367 C; 361 G; 640 T; 7 other; ggttctccga cattacccgg aaaaccatta cccggaatgc aatttacccg gaagaccatt 60 acccggaaag ccatttaccg gaatggacca ttncccggaa aaccattnac cggaatgaac 120 catttaccgg aaaaccatta cccggaagga ccatttctcg acttntttca ttaacgtcct 180 tttaaccgtt ttacggtcat tcaggtcagt atgttgactt aaaaatattt aaaatgacca 240 ccaacgataa ggttcttagt tcaaaataca taagcaccag agaggagtct aagatatctt 300 catacttgat catgtcaata tgcaccatct tcatatattt gcatttcata aatgattcac 360 ccttctttta aaaataagct gttctttcaa cttatctttt cagcttgtct tggctgaact 420 ggcttaattt ggctacaact atgtaacttt acgtttaata ttcgaccgtc gattctccaa 480 gtgtccaaat gttaataaat aagcaatgat tagaattttt aatacttttc gccgagtgtc 540 aattcgccga atgtcgtttc cgcaaacgtc aatcaaccga ttgtaaatcc cccgtatatc 600 cctttttcta gaatgaccca tttcgggtca tctgatattc acccttcttt atttggttgg 660 cggttctttc gagttccacc gatctcggca tttttggcaa aatgtgttta gtcgaaaatg 720 accaaatatc ttattatttt gattgcttat tgttctttct tgtttattac actggcattg 780 aattcggggt acagacattc agagttatga cattcggaga aaaatagcac aatcaatcaa 840 antcaaagga aaagtagatc tattgaggtt ccaaagatgg acgttacaat ggcgaatagt 900 gtccctactc tcaatcacca aggcactcct agttttcgtc ttgatttttt tagctttccc 960 caaaccataa agctgacatg agaattgaac tgttgtatgt tgcttccctc gcaagataag 1020 tttttttttt atattttgat acaatattga aattggattc tgatctgttt taacgtaagg 1080 gaataagaaa accatcataa cnccaaacat ttttaaatgc tgagatgttc catttactca 1140 aatgcaacct accttatttg ttaggctttt ttcgatttcg atttataatg atggtacctt 1200 ccaagtggtc tcggacataa aagaaaatta ataattcttt tcattctttc tgtccttgcc 1260 tgtaatgttt agttttatag cctaaatttg aagacatagt tttccgatct atatcaggcg 1320 attcttttct aattaggtcg ccaacaaaat tacatgttca atcaaatttt ggtatcgtga 1380 tattcgttca tagtttaagg cttacgttgc ggcaaagtga agttattcag aaagatcaag 1440 ctgaggttgg tagaattgaa taccgccggt tcaaaattat ttctggntga caaatttgac 1500 aacgtttcag agctagagta cttacatact tgtcaaacta tgaacgaatg caaaaatggt 1560 cggttgacat aggctctcag ctantaactg tagaaatgct cctagaacag ctgagaagca 1620 ggttttgtcc cagttgagtc gtaacgccgg caagaagaac aacatatttg gtccatagct 1680 gtttggagat gtcttttgaa atagtgttta tggaaaccga gggagtcgag aatgtccatg 1740 gttagcgtga cataatttgt gctgcgcatc aatgaaaggt cgtcgaaaat ggaagatttg 1800 gctttccggg taatggtgca ttccggtaaa tggtacattc cggtaaatgg tttttccggg 1860 taatggtaca ttccggtaaa tggtcttccg gtaaattgca ttccgggtaa tggtattccg 1920 ggtaatggta tagaatc 1937 // ID BEL-82_AA-I repbase; DNA; INV; 6167 BP. XX AC AAGE02026656; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-82_AA_; KW BEL-82_AA-LTR; BEL-82_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6167 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02026656; Positions 18625 12459. XX CC 'GTGAC' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 317..1669 FT /product="BEL-82_AA-I_2p" FT /translation="MLAVPEKESKKGGKSTAGSKTSRKSKKTVGEKSVTSS FT VRARMAMELKVLEEQQRIQEEELAAEKEFKDLRLKHEQELQEKQRAFEAQK FT LADEKAFLERKMSEEQEYRKQQMAIRKQSLEEKVKLVRQMSERGSSATSGI FT TSVPDSREKVENWLKISGQQTAGKTLESTSAVRPAVNTEEESRGLYNGATS FT AIPKTNVGTPFVNNSDLPIRTVFVPTQTGTFPVVRQQLAGMNLEEDGRQAD FT GNIGFRESNVRAQTETRPQGMSNNRQEECWSNNRPNTGVPMAGGGRRVAFD FT VEDFDGPTNRQLAARQVMGKDLPPFSGRPEEWPLWFSNSQRSTTTCGFSDD FT ENLIRLQRCLKGEALDTVRSKLLCPSSVPHVIRTLEMRYGRPGTLIRIMTE FT RIRQLPSPRINDLNSIIEFGLAVDSLVEHLRNSGEQSHLDIHPYFTIWLLS FT CRSTIA" FT CDS 2347..5499 FT /product="BEL-82_AA-I_1p" FT /translation="MEPLGLQWTGNIKRNESKSRRVTMEVSGAGSIKRFLL FT SDARTVGSLLLPSQSMDYGNLAEKYPYLRGLPLQDYSNVSPKILIGLDNLK FT LTVPQKIREGGWKDPIAAKSRLGWSIYGCSQAPTMSVICGFHFGGWTDPDQ FT ELNELVRDFFTLESAGVTGPPRLLESEEDKRSRMLLETTTRRVPTGFETGL FT LWKTNDVRFPDSRGMASRRLRALERKLVNKPELHDNVCLQVRQYVEKGYAH FT VATKEELTQTPLDEAWYLPLGVHQNPKKPNKVRLTWDGKATVQGVSLNSAL FT LKGPDMLSSLPGVLSHFRLFRYALTGDIKEMFHRIRIREEDRQFQRFLWRN FT DVSQDPVVYVMDVATFGSTCSPCSAQYVKNRNASEFSGEYPRAAYAINHYH FT YVDDYLDSYGTREEAIKVGKEVKMIHSKGGFELRNFLSNDAEIAARVGAES FT EEIDKSLSLDRPEGVESVLGMRWNPGSDCFTYDLTMRSNLADIVKEGHIPT FT KREVLRVVMSLFDPLGLIAFFLVHGKILMQDIWSSRAHWDDQINEDLNLRW FT IEWSSLLPQLNTVQIPRCYFPKAEADVYSTLQVHVFVDASEKAYSCAVYFR FT VLGPDGPLVSLVGAKSKVAPLKMLTIPRLELQAAVLGTRLLNSVISMHGIP FT VSQRFLWSDSACVLSWLQSEQRRYHQYVGFRISEILSSTEVNEWRWIPSEL FT NVADDATKWGSGPKIDTSSRWFVGPEFLQLPEENWPRSCSNRWTTEEELRS FT CNVHVNRPELVDVARFSRWERLHRTMAYVQRFIHNLRRSLRGECLEKGALM FT QQELKKADEILWQQAQRRFYGVEIGLLQETSGTPAAQHAAVPKSSSIHKLW FT PFMDQNGVVRKRSRLAAASWIADEVKYPVILPREHPISFLLTDWFHRRFRH FT ANRETVVNEMRQQFEIPKLRSLVAKVSRSCMKCRIRRAMPHSPPMAPLPEV FT RLTPYVRPFTFVGVDYFGPVFVKTGRSNAKRWIALFTCLTIRAVHMEVVHS FT LTTESCVMAVRRFVSRRGSPAEIYSDNGDELPWSRESTEARNRGA" XX SQ Sequence 6167 BP; 1657 A; 1406 C; 1703 G; 1401 T; 0 other; aaacctttag aaatttatca cactggagcg ttccagagca gcaatggcag gagccggcat 60 ggatgattca caaagcgaaa ggagctgtgc cacttgcaat cgaccggatt cagtcgatga 120 tatggttgct tgtgattcat gccagaaatg gtttcattag tcgtgcgcga aagtggatgc 180 tggcgtcaag aatcgtgagt ggtattgttc catctgtata cccctgctga gagctaaatc 240 tgctgaggtg tcggagcaac tccaaggtgt ttctgctggt caagccaagt ccgatgtggt 300 acagaacgga cagtcgatgc tggcagtacc ggagaaggag tctaagaagg gcggtaagtc 360 tacagcaggt agtaagacgt ctaggaagtc aaagaagaca gttggtgaaa aaagcgttac 420 atccagtgtg cgtgcacgta tggctatgga gctgaaggtc ctggaggaac agcaacgtat 480 ccaggaagag gagttagcgg cggagaagga gtttaaagat ttgcggttga agcatgagca 540 ggaattgcaa gagaagcaaa gggcatttga ggcacagaag cttgcagacg aaaaggcatt 600 tttggaacgg aagatgtccg aagaacagga gtataggaag caacagatgg caattaggaa 660 gcaatcgttg gaggagaagg tgaaactggt gcggcaaatg tcagaacgtg gcagtagtgc 720 tacgagcggc ataacaagtg ttccggattc gagggagaaa gtagagaatt ggctgaagat 780 atcaggacag cagacagcgg gaaaaacttt ggaatccacg tcagcagttc gtccagctgt 840 aaatactgaa gaagaatcca gaggactata caatggcgcc acatcagcta ttcccaagac 900 caacgttggc actcccttcg tgaacaacag tgacttgccg ataagaactg tttttgttcc 960 gacgcaaact ggaacttttc ctgttgtaag acaacagttg gccggtatga acctggagga 1020 agatggcagg caggcggatg ggaacatcgg gttcagggaa tctaacgttc gtgcgcaaac 1080 agaaactcgt ccacagggta tgtctaacaa ccgtcaagag gagtgttggt ccaataatag 1140 acctaatact ggcgttccaa tggccggcgg tgggcgtcgt gtagctttcg atgtcgagga 1200 ttttgacgga ccaacaaatc ggcaattagc ggcgcgtcag gtcatgggta aggatctccc 1260 accattttcg ggtagaccgg aggagtggcc actatggttc agcaactctc agcgttcgac 1320 cactacctgc ggcttctcag atgacgaaaa tcttatcaga ttgcaacgct gcttaaaagg 1380 ggaagcactc gacactgtga ggagtaaatt gctttgccct agcagtgtac ctcacgtgat 1440 taggacgctg gaaatgcgtt atggtcgacc aggtacgctg attcgtataa tgacggaacg 1500 tattcgccag ttaccatcgc ccaggattaa cgatctgaac agtatcatcg agttcggatt 1560 agcagtcgac agtctggtgg aacacttaag gaattcagga gagcagtccc atctggacat 1620 ccatccctac ttcacgatct ggttgctaag ctgccggtcg actatcgcat gaagtgggcg 1680 gcgtataaaa gttctctacg tcatgcaaac tttactgatt tcggctgctt tatgtccaca 1740 atggtagagt tagcctacga agttgcagat gactttgttc ctgtaaaatt caacaagaca 1800 gttaatcagt cgaagcccaa ggagcgagcg tttctgcagg cacactctga atcttctgca 1860 cctgccagta aagatgtccc gagtagcagt attaaagaac gaccggcaaa gaaaacctgc 1920 gttgcgtgca aggcggaagg acacaaaatc gtggattgtt ccagattcaa gcagatgaac 1980 atcgatgagc gattcaaagt ggtgcagtgg ttatgcagga cgtgcctaaa tcagcatgga 2040 aggtggccgt gtaagtcgtg gaaaggttgc gaaatacgag attgccggat gcggcatcat 2100 acccttttgc accccaccaa cgggacaaca aaaaaaaaaa ccccaccaac gggacaacaa 2160 acgtggctgt tttaaccaac catttgggag aattccaatc gctgggtggt ccactcttca 2220 gaatcattcc agtcactcta tacggaaacg gctgccaggt ccaacgtctt cgcattcatc 2280 gatgaaggat ctcagctgac tcttctagag aataccgtcg ccgagaagct tggagtcgaa 2340 ggtcctatgg aaccacttgg cctacaatgg accggaaata tcaagcggaa cgagtcaaaa 2400 tcccggcgag tcactatgga ggtttctggc gctggatcaa tcaaaaggtt cctactttct 2460 gacgctcgga ctgtgggtag tctgctgctt ccatcgcaat caatggacta cggaaacctg 2520 gcggagaaat atccgtacct gcgaggatta ccacttcaag actattcgaa cgtatcgcct 2580 aagatcctca tcggtcttga caatttgaaa ctcactgtgc cacagaagat tagggaaggt 2640 ggttggaagg acccaatcgc ggcaaaaagc cggcttggat ggagcatcta cggatgttcg 2700 caggcgccaa cgatgtcagt aatatgtggg tttcattttg gaggatggac cgatccggat 2760 caagagctca acgaactagt tcgcgacttc ttcaccttgg agagtgcagg cgtcacaggt 2820 ccaccgcgcc tgttagaatc agaggaggac aaaagaagca gaatgctgtt ggaaacaacg 2880 actcgcagag tcccaaccgg attcgaaacc ggcctcttat ggaagaccaa cgatgtgcga 2940 tttccggata gccgaggcat ggcttctcgc agacttcgtg cactggaaag aaaactcgtc 3000 aacaaacccg agttacacga taatgtctgt cttcaagttc gacagtatgt cgagaagggc 3060 tatgctcatg tggccaccaa agaggagtta acgcagacgc ccctggatga agcgtggtat 3120 ctgccattag gtgtgcatca gaacccaaag aagccgaaca aggttcgtct gacgtgggat 3180 ggaaaagcaa ctgttcaagg agtgtcgctc aattcggctc ttttgaaggg accggatatg 3240 ctgtcctcac taccaggagt actcagccac tttcgcctgt tccgctatgc attgacaggc 3300 gacatcaagg agatgttcca ccgcatacgt atccgtgaag aagatcgcca attccagagg 3360 tttctgtgga gaaatgatgt cagccaggat cccgtcgtgt acgtgatgga tgttgcgacg 3420 ttcgggtcca cctgctcccc ttgttcagcc caatacgtta aaaaccgcaa cgctagcgag 3480 ttttcaggag agtatccacg ggctgcgtac gcaatcaacc attaccatta cgtggatgac 3540 tacttagaca gctacggaac acgcgaggaa gcgatcaagg tcggaaagga ggtgaagatg 3600 atacatagca aaggcggttt cgagctgcgc aacttcctct cgaatgacgc cgagatagct 3660 gcaagagtag gagcagaatc cgaggagatc gacaagtcct tgtcgctgga caggccagaa 3720 ggtgttgagt cggtactagg aatgaggtgg aatcccggaa gtgactgttt cacgtacgat 3780 ctaaccatgc gtagcaatct ggcggatatt gtcaaggagg gtcatatacc gactaagcga 3840 gaagtccttc gagtggttat gagccttttt gatcctcttg ggctgattgc gttcttcctg 3900 gtgcacggga aaatcctcat gcaggacata tggtcttctc gagcacattg ggatgaccag 3960 atcaacgagg acttgaactt gcggtggatc gaatggagta gtcttctgcc tcagcttaac 4020 acagtccaga ttccccgttg ttactttccc aaggctgaag cggatgtgta ttccacactt 4080 caagtacacg tattcgtgga cgctagcgag aaggcttatt catgtgcggt ttatttccgc 4140 gttctaggac cagacggccc tttagtctca ctcgttggcg cgaaatccaa agtggcccca 4200 ctaaaaatgt tgacgattcc acgcctggaa cttcaagccg ctgtcctggg aactcgcctc 4260 ctgaacagtg ttatttccat gcacggcatt ccagtttcgc aacgattttt gtggtcagat 4320 tcggcgtgtg tactgtcatg gcttcaatcg gaacaacgcc ggtaccatca gtatgttgga 4380 ttccgtatca gcgaaatact ttcgtccacc gaagtaaacg agtggaggtg gataccttct 4440 gaattgaatg tagcagacga cgccactaaa tggggttccg ggccgaagat tgatacaagt 4500 agccgatggt tcgtagggcc ggagtttctc caactgccgg aagaaaactg gccacgtagc 4560 tgtagcaacc gttggactac cgaagaagaa cttcgctcct gcaatgtgca tgtcaacagg 4620 cctgaactag tggacgtagc ccgtttcagc cgatgggaac gacttcatcg tacgatggca 4680 tacgtccagc gattcataca caaccttcgg cgatctcttc gtggtgaatg tttggaaaaa 4740 ggtgctttga tgcaacaaga gctgaagaaa gcggacgaaa ttctctggca gcaggcgcag 4800 cgaagattct acggtgtcga aattggactt ctgcaggaaa ctagtggtac accggcagct 4860 caacacgctg ctgtgccgaa gtcaagcagt atccataaac tgtggccgtt catggaccag 4920 aacggagtgg tacgcaagcg aagtcgtctt gcggcagctt cttggatagc ggacgaagtt 4980 aaataccctg tgattctgcc aagagagcac ccgattagct tccttctaac ggattggttc 5040 caccgtcgct tccgccatgc caaccgggaa acagtcgtca acgagatgcg gcaacagttt 5100 gaaatcccaa agctgcgatc actagtagca aaggtatccc gaagttgtat gaagtgtcgt 5160 atccgccgag ccatgcctca ttcgccgcca atggcaccgt tacccgaagt gcgactcacg 5220 ccctacgtac gtccattcac ctttgttggc gtagattact ttggacctgt atttgtgaaa 5280 acgggacgta gcaatgccaa acggtggata gcattattca catgcctaac tatacgggcc 5340 gtgcatatgg aggtggtgca tagtttgaca acagaatctt gtgtcatggc agtccgacgg 5400 tttgtctcca ggcgaggttc tccggcggaa atatattcgg acaacggtga cgaacttcca 5460 tggagcagag aatcaactga agcgagaaat cgaggagcgt aatcatcgtc tagcaacagt 5520 attcaccaat tcttgcacac gctggatgtt caatcctcct ggtgccccac atatgggagg 5580 ggtgtgggaa aggatggtcc gttccgtaaa ggctgcgatc gggactattc tggagactca 5640 acgtcgaccg gatgacgaat tattggagac ggtgatactt gaagcggagg caatgatcaa 5700 tagtcgtccg ctgacatata ttccactcga tttcgcggat caggaagccc taaccccgaa 5760 ccatttcctg cttggaagct ccagcggggt gaaacagtta ccggtcgagc ctgtagacta 5820 ccggtcaact ctgagaagtg gatggaagct cgcgcagtat ctggtggacg gtttttggaa 5880 gcgttggcta aaagagtatc tgccagtgat ttcgcggcgt tccaaatggt ttgccgaagt 5940 gagagagttg aagaaaggag atttggtgtt cgtagtggac ggaatgatca ggaaccaatg 6000 gctaagagga aaggtagaac atgttgtgtg cggtacagat ggacgagtac gtcaagcatg 6060 ggtgcagacg gcgaccggag tacttcgcag gccggtggtg aagctggcct taatcgacgt 6120 catggaggaa cgtaaaccac ccctcgagtg ctttacgggc gggggat 6167 // ID Tc1-1N_TCa repbase; DNA; INV; 1379 BP. XX AC . XX DT 21-MAR-2009 (Rel. 14.03, Created) DT 21-MAR-2009 (Rel. 14.03, Last updated, Version 1) XX DE DNA transposon: consensus. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Nonautonomous; KW Tc1-1N_TCa. XX OS Tribolium castaneum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Coleoptera; Polyphaga; Cucujiformia; OC Tenebrionidae; Tribolium. XX RN [1] RP 1-1379 RA Jurka J.; RT "Mariner/Tc elements from insects."; RL Repbase Reports 9(3), 676-676 (2009). XX DR [1] (Consensus) XX CC Putative non-autonomous. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Tribolium CC castaneum Genome Project. XX SQ Sequence 1379 BP; 432 A; 259 C; 266 G; 422 T; 0 other; cactgctcaa caattgaagt ggatattgag agaaattcta aattttggta attcattgta 60 tattaaataa caaaaaacaa ttacaaaaaa taaatttata ctcgttcaca ctcaaaatta 120 aaaactgaag aaaattgtcc gtaaaaaacc agcactagaa gtaaaacaca aaatcgaata 180 aaacgtaacg aaacttaaaa gttctcaaat ttaaatttta catcatccct acaaaaacct 240 tgtaccgtgt aggacctctc ctggctgtta gcagggctct tactcaattg gagagactca 300 ttatcaaatt gttcataaaa tcttgaggta tcacttccca aattccttgc aatgcattag 360 ccagctcttg taaagtgtca ggagactgct ggagctggtt taaatgccta gacatttcgg 420 cccaaacatg ttcgatgcag ttcatgtcag gcgaacaggg tggccactcc attcgtgtaa 480 tgccatggtg tcctatgcgc tcattcacaa atctggcgcg acgagggcgg gcattatcgt 540 caataaatac gaatcgtttg cctattgcac caaaaaatgg cacaattatc ggctctatca 600 ctctgtctcg atagcggaca gcagtcattg ttttattaat taagaccaat aggtctgtgc 660 ggctattaaa acatatgcct ccccatacgc aaactacccg ccaccaaaaa ggcaagtgtg 720 ggaggaataa tgcaagtctt gattaatgaa gcaatgactg ctgagcgttc tcgagataga 780 gtaatagagc cgataattgt acaatttttt ggtgcaatag gcgaaagatt cgtgtttatt 840 gatgataatg cccgccctca tcgcgccaga attttgaata agcgcataga acatcatggc 900 attacacgaa tggagtggcc accctgttcg cctgacatga actgcattga acatgtttgg 960 gccgaaatgt ctaggtgttt aaaccagctc cagcagcctc ttgaaacgtt acaagaactg 1020 gctaatggat tgtaagaaat ttgggatcga tacctcaaga ttttattaat aatttaataa 1080 tgagtctctt caatcgagta agagccctgc taagagccag gggaggacct acacggtact 1140 aggtttttgt agggacggtg taaaatttga ctttgagaac ttttaagttt cgttaggttt 1200 tattcatttt ttgttttact tctagttctg gtttttcacg gacaattttt ttcagttttt 1260 aattttgagt gtgaacgagt ataaatttat ttattgtgac tgttttttgt tatttaatat 1320 acaatgaatt accaaaattt agaatttctc tcaatatcca cttcaattgt tgagcagtg 1379 // ID Helitron-3_NVi repbase; DNA; INV; 5522 BP. XX AC . XX DT 14-APR-2009 (Rel. 14.04, Created) DT 14-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE Helitron DNA transposon - consensus. XX KW Helitron; DNA transposon; Transposable Element; Helitron-3_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-5522 RA Bao W. and Jurka J.; RT "Helitron DNA transposons from Nasonia vitripennis."; RL Repbase Reports 9(4), 764-764 (2009). XX DR [1] (Consensus) XX CC The consensus may be incomplete at both ends. XX FH Key Location/Qualifiers FT CDS join(3..776,780..1013,1017..3104,3080..3733) FT /product="Helitron-3_NVi_1p" FT /translation="NTSYPQELKDLLFQNNSKSINFKKNIRGYNNLFAFAS FT FGSKLVYFNNPGPQVLKICGQVYHNSYSLHPNENEDPKYGQLYIIDNEIAN FT KYRANNYKIDKPDSNILEFLCTFLSLNNPYAKAYKMMYEIENEENINAHNN FT NITPPEIVMSLTRDNHTKNKSYFATCNEVAVVYIGNYGEPPFDRDIRIYPK FT TEKPQQINILSKHLDPMTYPLIFPFGHPGWQPYIKCTKKEYKNNNISTLQF FT YSYRLSIRGDFNPYLNLNLSQQFIVVVWTRIEGCRLYYLRTHQAQLRTEMY FT KGLMDYVSNRANIDNSRIGKMVILPSSYIGSPKSIQQNYLDSMTLQVDGKP FT DIFITMTCNPNWKEIQENLLPYESTIDRPDLISRVFHEKVKILKTELLKNN FT IFGKVKGYVYVIEFQKRGLPHIHMLLWLHTNDKFRDTDQIDSIISAEIPNE FT NEYPRLYNIVKNNMIHGPCGSQNKNCVCMNEDKTRCTKNFPKPYVNQTHYN FT PDGYPSYRRRKNNNKITFSKHRIADNSFVVPYNPYLLLLLECHINVEICST FT IKAIKYLFKYFHKGPDTALVKFANEHNPGNEDNVNENNDTYKYDEISQHLS FT TRYVCAPEAIYRIWEFLLHEQSYVIKRLAVHEEDEQYVYYQEGCEHEMIDK FT NVHTTLTSWFKLNKEDKNATQYLYHEIPYHYVFDKKTKKWNKRKKFIKPLI FT SRMYFVNPNKREAFFLRLLLLHVRGAKSFADIRTVDGIIYDKYGNACISRG FT IISTDEEWNKCLAEAAVFKFPKSMCELFAYILVFQRPINSLELYDKYKCHF FT MDPKLPKIQAEQRALKNIESILLLHGLRLNDFHISSIENNYDDEEYNNELQ FT VCKDGENLETLINSLSTDQKKIYDLIINATINSIGNKYYFIDGLGGSGKSY FT LHNVMIKYLISMNVPFITMAWTGIAANLLINGKTVHSIFKLPLNINEQTTC FT NICPNSKYGXHIKNVKIIIWDEISMVSKHAFEAIDKCFRDICKNDLPFAGK FT VIVTSGDFRQTLPIVRHRNXTKIKMIKQNKNQNDLMTEIFGHKIDPNDSTL FT KNKVILSPLNSNVLKINESIVERIDGDHYIYYSDDRIQSDADDKLTNTIPT FT EFLNSLTPNGLPPHKLILKKGAIISCLRNLDIDGGLCNGTRLIIKDMKQYV FT LTAEIITGKYSGKQVLLPRIDLAPSLDEIPFGMVRRQFPIRLSFAMTVNKS FT QGQSFDKVGLYLSTPVFSHGQLYVALSRTTSKEKLKIFK*" XX SQ Sequence 5522 BP; 2159 A; 788 C; 752 G; 1795 T; 28 other; ataacacatc atatcctcaa gaattgaagg atctattatt tcaaaataat tcaaaaagta 60 ttaattttaa gaaaaatata agaggttaca ataatttatt tgcatttgcc agttttgggt 120 ctaaattagt atactttaac aatccggggc cacaagtatt aaagatatgt ggacaggtat 180 atcataattc atattcatta catccgaatg aaaacgaaga cccgaaatat ggccaattgt 240 atattattga caatgaaatt gcaaataaat atagagcaaa taactataaa attgataaac 300 ccgattcaaa catattagaa tttttatgta cttttttgtc tttaaacaat ccttatgcaa 360 aagcatataa aatgatgtat gaaatagaaa atgaagaaaa tattaatgct cataataata 420 atataacacc tcctgaaata gtaatgtcat taacacgtga taatcataca aagaataaat 480 catatttcgc aacatgtaat gaagttgctg ttgtatatat aggaaattat ggcgaacctc 540 cgtttgaccg tgatatacgt atttatccaa aaactgaaaa accacaacaa ataaatattc 600 tcagtaaaca tcttgaccca atgacatatc ctttgatatt tccttttggt catccaggat 660 ggcaacctta tataaaatgt acaaaaaaag aatataaaaa taataatata tctacactac 720 agttttattc atatagatta agcatacgtg gtgatttcaa tccttatttg aatttgtaaa 780 atttatcaca acaattcatt gttgttgttt ggacaaggat cgaaggatgt cgattatatt 840 acttacgtac tcaccaagca caattaagaa ctgaaatgta taaaggatta atggattacg 900 tttccaatcg ggcaaatatt gataattctc gaattggaaa aatggtaata ttaccatcat 960 cttatatagg tagtcctaaa tcaatacaac aaaattattt agattcaatg acttaactac 1020 aagtcgacgg aaaaccagat atttttataa caatgacatg taatccaaat tggaaggaaa 1080 ttcaagagaa tttattaccg tatgaaagca caatagatcg tcctgattta atctcaagag 1140 tcttccatga aaaagttaaa attttaaaaa ccgaactact aaaaaataat atttttggta 1200 aagttaaagg ttatgtttac gtaatagaat ttcaaaaaag gggattacca catattcata 1260 tgttattatg gttgcatacc aatgataaat ttcgagatac tgatcaaatt gattctatca 1320 taagtgctga aataccaaat gaaaatgaat atccaagatt atacaatata gtaaaaaata 1380 atatgataca cggtccctgt ggatcacaaa ataaaaattg tgtttgtatg aatgaagata 1440 aaacacgttg tactaaaaat tttcctaaac cttatgtcaa tcaaacccac tataatcctg 1500 atggttaccc ttcttaccga cgaagaaaaa ataataataa aattacgttt tcaaaacata 1560 ggattgcaga taatagtttt gtcgtaccgt ataatccata tttattatta cttttagaat 1620 gtcatattaa tgtagaaata tgtagtacga ttaaagctat aaaatatttg tttaaatatt 1680 ttcataaagg tcctgataca gctcttgtga aatttgcaaa tgaacataat ccaggtaatg 1740 aagacaatgt aaacgaaaat aacgatacat ataaatatga tgaaatatca caacatctct 1800 ccaccagata tgtatgtgca cctgaagcta tatatcgcat atgggaattt ttactacatg 1860 agcaatctta tgtaattaaa agattagctg ttcatgaaga agatgaacaa tatgtttatt 1920 atcaagaagg atgtgaacac gaaatgattg ataaaaatgt ccatactaca ttgacatctt 1980 ggtttaaatt aaataaagaa gataaaaatg caacgcaata tttatatcac gaaattccgt 2040 accattatgt ttttgataaa aaaacaaaaa aatggaataa aagaaaaaaa tttataaaac 2100 ctctcattag tagaatgtat tttgtaaatc caaataagcg tgaagcattt ttcctacgac 2160 ttttattatt acatgtacgt ggtgcaaaat catttgcgga tatacgtact gtagatggaa 2220 ttatttatga taaatatgga aatgcttgta tatcaagagg tattattagt actgatgaag 2280 aatggaataa atgtttggca gaagctgcag tattcaagtt tccaaaatcc atgtgtgaat 2340 tatttgcata cattcttgta tttcaaagac ctattaattc gttagaatta tacgataaat 2400 ataaatgtca ctttatggac ccgaaattac caaaaattca agcagaacaa agagctttga 2460 aaaacataga aagtatttta ttattacatg ggttgagatt aaatgatttc catatatcta 2520 gtattgaaaa taattacgat gatgaagaat ataataacga attacaagta tgtaaagatg 2580 gtgaaaattt agaaacatta attaattctt tgtctaccga tcaaaaaaaa atttatgatc 2640 tgatcattaa tgctacaatt aacagtatag gaaataaata ttattttatc gatggtctcg 2700 gtggaagtgg taaaagttat ttacataacg tgatgattaa atatttaatt tcaatgaatg 2760 ttccgttcat tacaatggcc tggactggta ttgctgctaa tttattaatt aatggaaaaa 2820 ctgtccatag tattttcaaa ttaccattaa atattaatga acaaacaacg tgtaatattt 2880 gtcctaattc aaaatatggt raacatatta aaaatgtaaa aattataatt tgggatgaaa 2940 tatcaatggt ttcaaaacat gcttttgaag caattgataa atgttttcgt gatatatgta 3000 aaaatgacct tccttttgca gggaaagtta ttgtaacatc tggtgacttt agacaaaccc 3060 tgcctatagt acggcataga aacaraacaa aaatcaaaat gatttaatga cagaaatttt 3120 tggacataaa attgatccta atgattcgac attaaaaaat aaagtgatat tgtcaccgtt 3180 aaattctaat gtactaaaaa taaatgaaag tatagttgaa agaatagatg gagaccacta 3240 tatatactat agtgatgatc gaatccaaag tgacgcagat gacaaattga caaacacgat 3300 acctacagaa tttttgaatt cattaacacc taatggattg ccaccacata aattaatatt 3360 aaaaaaaggt gcaataatta gttgcttaag aaatttagac attgatggtg gtctttgtaa 3420 tggtacaaga ttgataatta aagatatgaa acaatatgtt cttacagcag aaattattac 3480 tggaaaatat tccggtaaac aagtattatt gcctcgtatc gatttagctc catcactaga 3540 tgaaatacca tttggtatgg ttagaaggca atttccaata agacttagct ttgctatgac 3600 agtaaataaa tctcaaggcc aatcgtttga caaagttggt ttatatttat ccactccagt 3660 atttagtcac ggtcagttat atgttgcatt atcaagaaca acatcaaaag agaaattaaa 3720 aatttttaaa tgatgaatct gcatacaaaa ataataaaaa tattacaaat aacgaatgta 3780 aaaaacgaaa attaaacgac gaaatgaaaa tttacacaaa aaacatagtc tattctgaaa 3840 tattcagaaa ttaattatat taattataaa acattactaa aatatgaagt atcttttgat 3900 tgattgagta atcgctacta atacaacaat actaaagtat aaattatata ttgatttgct 3960 gagaactctt cactattagt acattactga aatctgcgtc atgtctcmay taattattat 4020 ctccagtaca ttactcaaat ttaaattatt ttttgattgg tcgaataatc gccactaatt 4080 atacattacc aaaatttaaa taatatattg attggctgag ttatctccac taattgtaca 4140 ttactaaaat ataaatttst attaawtatc grtkttrtgy ctaattatac taaattatat 4200 twaagtttgc caaagtntgc catagttttc cataatattt caaattrtty aaattcagaa 4260 atttagtaat ctttrgcata ccttggtata ctttggcaca ctctkataca cttnattaca 4320 tattgacaca catgggtttt tcataaattg argacttata aaattaaatc taaatatcat 4380 atgtcatatt gataattgta aataaaatgt gaattyaaca taattatatt aaaattagaa 4440 atgtaaaata ttggcaaata atcgccacag aaactatatt caaaatccaa atgttarata 4500 tacttaataa tatattgtat aartattaga taagtaatca ttaagtttat tttgtagcga 4560 tatattaaca ctattataca catctctaaa taatttgcaa tgcaggcttt attattaaaa 4620 tgttacattt aataattaat gayagtctcc accctgtgaa acgcagagag caaatttaca 4680 actaatatta atataaataa tcatttacaa tagtatcatt tccaggcttt tttacaaaaa 4740 caggcatgaa acaagatcat atctgcacaa atgattattc gcagcatccc ttttatccgg 4800 accaaatctg cagattacgt cttgtgatat gcgtgaatac atagagcagt cgtcatttca 4860 cattcacaaa cagatatctt taaaaaatca gatcatatct gcacaratga ttattcgcag 4920 catccctttt atccggacma aatctgcaga ttacgtctag tgatatgcgt aaatacatag 4980 agcagtcgtc atttctcatt cacaaacaga tatctttaaa aaaccagatc atatctgcac 5040 agatgattat tcgcagcatc tcttttatcc ggacaaaatc tgcagattac gtcttgtgat 5100 atgcgtgaat acatagagca gtcgtcattt ctcattcaca aacrgatatc tttaaaaaac 5160 cagatcatat ctgcacagat gattattcgc agcatccctt tcatctggac caaatctgca 5220 gattacgtct tgtgatatgc rtgatagagc agtygtcatt tcacattcac aaacagatat 5280 ctttaaaaaa ccagatcgta tctgcacaga tgattattcg cagcatccct ttcatctrga 5340 ccgaatctca agatcactac taatgatata tgtaaaacat acatgtatct atgtggcgca 5400 tatgatttaa aaaataattc agattataaa tacaaataaa catatttgcc atatttctct 5460 aaaaagatta tcatttacag catccgacac atctggacca atcttactgt gtttttgata 5520 ta 5522 // ID Gypsy-251_AA-I repbase; DNA; INV; 4483 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-251_AA_; KW Gypsy-251_AA-LTR; Gypsy-251_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4483 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1103-1103 (2011). XX DR [1] (Consensus) XX CC Positions [3342-3812] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 921..4286 FT /product="Gypsy-251_AA-I_1p" FT /translation="MEHNIPRSGSNRKRAFEDRINQLSAAMEDLKMSLAGE FT TDSEDDQSVEEPSDEEQQSNYMNNVLLGKSDDSKPAFVELNVNGGNLLMEC FT DTGACATICSFNTYKERFSKCSLMKDRRNFFVISGDTVSVVGKIIVRVKLQ FT ERVLTLPLLVIKSPKNFVSLLGRDWLNAIWPHWRSIFSLNSLHEAKRDQWA FT HRTVKQLKSDFASVFDDDLTEPIKNVIVDVRIDDQAKPIVHKPYTVAFKHR FT EMVSSHLDDLESKGILEKVEYAEWASPIVVVVKPNKKDIRICMDGSKTINP FT HIITHHYPLPVIDELITNKSGAKKFALIDLRGAYQQLVVSEASKKLMVINT FT HKGLYAYRRLPFGVKPAATIFQSVMDKILQGIPNVQAYIDDILIWAKTDEE FT LLASIKVVLNRLKEHNVKINAEKCKWFVSHVKYLGHILSEAGVSPNPEKVR FT AITAVPEPKSKTQLKTFLGMITFYTKFVPKLSLILSPLYDLLKKDTKWCWD FT AKCKNVFELSKTAICSAQVLTHFDPLKPITVTCDASDDGISGILSHKINNR FT EMPVFFISRRLSNAEKKYPILHREALAIVFAMEKFYKYVLGQKVSIITDHK FT PLLGIFNNKKGGPSVIATRLQRYFLRLSIFDFKLSHVSGRENQIADCLSRL FT PIDQDMSLADLEESRRSSFNCLNYLIDDQKININSKIIAEKSMEDTVLSKV FT LNYVQNGWPNSIKDRLVKHFFAKRHELDVESGCLIFGERIVIPNALKIPSL FT QLLHANHRGIQKMKQIARKYLYWDGCGTDIENYVGSCKQCQILGIDRTPRV FT YGNWPITKKPFERVHVDFFHKFNRTFLILVDAYSRWIEIRRMSKTNADNVA FT HELDSIFTIFGFATTIVSDNGPPFNSFGFKKFCEARNIEHILCPPYHPASN FT GLAERAVQTTKAVLGKLIRENDSCSSLQIDHEISKFLFHHHQTPTTEDNII FT PNERIFSFIPRSQITGIRKQKSIFGEVVQSEEKNDLKANNMVIYTYKSNGR FT AYSTEAKVVKQLSNMTYVIDIKGENRKVHRNQLKKILQKPFVLKNSNKNEQ FT EQKLHDKPSVATTVKSKKERKSSKKTTKSVSNQVLRRSARIYKSKYKNLNV FT GQLLKKKKQKNNNM" XX SQ Sequence 4483 BP; 1597 A; 740 C; 939 G; 1207 T; 0 other; attctggcga cgaaggttaa aaattgataa gaacatagtt gggtgttcaa ttcgcgagct 60 aaagcgagta ttttccgcga tttcggtggt gtcagtagta gcgtggttgt aagatgtcgt 120 ttgctgaaga aagtggcagt aaagtgatgt tggtgcaatg gagaagagaa caaaaggaag 180 atttcctgtg tatgtgccgg ggttaaatgt aagctcgtac ttgcaaactg tggatatatt 240 ttttgcgttg aataaaactc ctaatgacga gaaagcgcta gagttcatca caagtgtggg 300 ccaagaaacg gccaatcgta ttattggaag cttcaaaccg gataaaattg tgaataaaac 360 atatgcgcaa atagtagaaa agttcaaaat tttgcatgaa gagaataaaa acgtgtttgc 420 cgaacggcac cgtttaataa cacgtaagca agaagaaggc gaatcactag atgattttgc 480 tatagatttg cagaacattg tggaacattg tagtgttagt gctgaaacag aggctacact 540 agtacagtcg gtgttcgttg cgggtataag aaatgataaa acgcgagagt cgatgttacg 600 agatgcagat cacagcttaa atctagccaa acttctggaa aaggctaaga cgatcgagat 660 agcggctcag gagtcgcgaa aaatgtctaa gcaatgtgtt gaacatgtga actatgtggg 720 accgatgagc tcagcgcgca tcaagaatgt agtgaaaccg ggaaaatttg acgatagtgc 780 atggggtgga aaaacgcaga aggaaatttc aagctacaga gggaatcaaa aagtcagttc 840 agacacggtg tgctataact gctacaacaa aggacatttg tcctaccact gcacgtttcc 900 aaaggccaag aaaccgcgtt atggagcaca acataccaag gagtggttcg aataggaaac 960 gagccttcga agatcgaatt aatcagctgt ctgctgcgat ggaagatttg aagatgagct 1020 tagcaggtga aactgatagc gaagatgacc aatcagttga agaaccgtca gatgaagagc 1080 aacaatctaa ctacatgaac aacgtgttgt taggtaagtc agatgactca aaaccagctt 1140 ttgtggaatt gaatgtaaac gggggaaatt tgttgatgga gtgcgatacg ggcgcatgcg 1200 cgactatttg ctcttttaat acatataaag aaaggtttag taagtgttcg ctaatgaagg 1260 acagacgaaa cttttttgta atctccggcg atactgtgag tgtagtgggt aaaattatag 1320 ttcgagtaaa actgcaggag agagttttaa cattaccact cttggtgatc aaatctccca 1380 aaaattttgt atctctcttg ggaagagatt ggcttaatgc aatttggcca cactggagaa 1440 gtattttttc gctcaattct ctccatgagg caaaaagaga tcagtgggca caccgcacag 1500 taaagcaatt gaaatcggat tttgcatctg tgttcgatga tgatcttacc gaaccaatta 1560 agaatgtaat agtggacgtt aggatcgatg atcaggccaa accaattgtt cataagccat 1620 acactgtagc attcaaacac cgtgaaatgg tgagttcaca tttagatgat ttggagtcta 1680 aaggcattct tgagaaggta gagtatgcgg aatgggcatc gcctatagta gtcgtagtaa 1740 aaccaaataa aaaagatatt agaatttgta tggatggatc caaaacgata aacccccaca 1800 ttattacaca tcattaccct ctcccggtta tagatgagtt gattacgaat aagagcggag 1860 caaaaaaatt tgctcttatc gatttgagag gagcatacca acagctggta gtatcagaag 1920 cttctaagaa gctaatggta ataaacactc ataaaggttt gtatgcgtat cgtagattac 1980 cgttcggagt aaaaccagca gccacaattt ttcaatcggt catggataaa attctacaag 2040 gcatccccaa cgttcaggct tacatcgatg acatactgat ttgggcaaag acggacgaag 2100 aattattagc ttccatcaaa gtcgtactga ataggctgaa agagcataat gttaaaataa 2160 atgccgaaaa atgcaaatgg tttgtttccc atgtgaaata tttgggtcac attctgtcgg 2220 aggcgggagt atcgccaaat ccggagaaag tgagggcgat tacggccgtg ccagagccaa 2280 aatcaaaaac acaactcaaa acatttctcg gcatgattac attctatacc aaatttgttc 2340 ctaaattgag cttaattctt tcacctttgt acgacctgtt aaaaaaagat actaaatggt 2400 gttgggacgc aaaatgtaag aacgtttttg aactgagcaa aacagctata tgtagtgctc 2460 aagtacttac acatttcgat ccgttgaagc ccatcacagt gacatgcgat gcaagtgacg 2520 atggaatctc tggaatcctc agccacaaaa tcaacaacag ggaaatgcct gtatttttta 2580 tttctcgtcg attgtctaat gctgaaaaga aatatccaat attgcatagg gaagcattgg 2640 caattgtttt tgcaatggaa aaattttata aatacgtttt aggtcaaaaa gtatcaataa 2700 ttacggatca caaaccttta ttgggtatat tcaataataa aaagggagga ccttctgtta 2760 ttgccactag actacaaagg tatttcctac ggctatctat attcgatttc aaattgtcac 2820 acgtttcggg tagagaaaat caaatagctg attgcttgtc aagattgcca atagaccaag 2880 atatgagttt ggcggattta gaagaaagtc ggcgaagctc attcaattgt ttgaattact 2940 taattgatga tcagaaaatc aatatcaact ctaaaattat tgcggaaaag tcaatggaag 3000 acacggttct ctcaaaagtt ttaaattacg tccaaaatgg gtggccaaat agcataaaag 3060 atagattagt gaaacatttt ttcgcaaagc ggcatgaact agatgtagaa tccggatgtc 3120 ttatttttgg cgaacggata gtaatcccaa acgcacttaa aataccatct cttcaattgt 3180 tacacgctaa tcatcgtgga atccaaaaga tgaaacaaat tgctcgcaag tacttgtatt 3240 gggatgggtg tggaactgat atagaaaact atgttggatc ttgcaaacaa tgccaaatct 3300 taggaattga cagaacacca cgagtatacg gaaattggcc tataactaag aaaccttttg 3360 agcgagtaca cgtagatttt tttcataaat ttaatcgaac ttttttgata ctagtcgatg 3420 cttattcacg ttggatagaa attcgtagga tgtccaaaac aaacgcagat aacgtcgcgc 3480 atgagctcga cagcattttc accatttttg gttttgcaac tactattgta agtgataatg 3540 ggcctccgtt caatagtttc ggcttcaaaa agttttgcga agcgcgaaat attgaacata 3600 tattatgtcc tccatatcac ccagcgagca atggattagc agaaagggca gtacaaacca 3660 caaaagctgt tttggggaag ctaataagag agaacgattc ttgttcatca ttacagatcg 3720 atcatgaaat cagcaaattc ctgttccacc atcatcaaac acctacgaca gaggataata 3780 taatacccaa tgaacgtatt ttttctttta ttccacgctc tcaaataaca ggaattagaa 3840 agcaaaaatc aatattcggt gaagtagtac aatctgagga aaaaaacgat ttaaaagcta 3900 ataacatggt tatttacaca tacaaatcaa acggcagagc atatagtacg gaagcaaaag 3960 tcgtcaaaca gttatccaat atgacatatg taattgatat aaaaggggaa aatcgtaaag 4020 ttcatagaaa ccaattgaaa aaaattcttc aaaaaccatt tgttttgaaa aatagtaata 4080 aaaatgaaca agaacaaaaa ttgcatgaca aaccctcagt agcaacaact gtaaaatcaa 4140 aaaaagaacg gaaatcgagc aaaaaaacaa caaaatctgt ttcaaatcaa gtattacgta 4200 ggtcagcaag gatttacaaa tcaaaataca aaaacttaaa tgttggacag ttgttgaaga 4260 aaaagaagca aaaaaataat aatatgtaat tataagcaaa ctcttatgat tttttttttg 4320 catgtattct aattagtcta aatgaataag aaaatgaact ttataaattt taagttaaac 4380 aatgaaataa cgtgaattgc aatatattgt cataatgcaa atgctattga tgaaatgtag 4440 aaacaaacat tgtaaatagg ttatattaat ttaaaggggg aga 4483 // ID DNA-TA-3_AAe repbase; DNA; INV; 458 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A non-autonomous DNA transposon family from Aedes aegypti. XX KW DNA transposon; Transposable Element; nonautonomous; KW DNA-TA-3_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-458 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the yellow fever mosquito."; RL Repbase Reports 11(4), 1272-1272 (2011). XX DR [2] (Consensus) XX CC ~96% identical to consensus. TA TSDs. The sequence shows a CC similarity to hATm-2_AA 1440-1867. XX SQ Sequence 458 BP; 164 A; 88 C; 69 G; 137 T; 0 other; tacaggtcgg actcgattat ccggagtatc gatttttttt tcactccgga taatcgaatc 60 ctccggataa tcgaatcact aagaaaaaaa ttgaaatctt cgataaaaga acttaaatat 120 tatctttttt cgttgtttta tttatatgat gcggtggcgt agccagaaat tatttctagg 180 aacagtaggg gtctttacaa gaaaaaaatt tttttgacca gcatacacaa aaaaacactt 240 tttcaaaccc cccttttctt aaatacgttc aaaactgcaa aaccattcat attgtatatt 300 gcaataccaa attatctcag atttatgcat aaaaattgaa aaacaagaac atcaaaaaac 360 aaattccgga taatcgagtc taaaattccg gataatcgaa tcccggataa tcgagtcccc 420 ggataatcga gtctccggat aatcgagtcc gacctgta 458 // ID Gypsy-13_IS-I repbase; DNA; INV; 3733 BP. XX AC ABJB010321873; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the black-legged tick genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-13_IS_; KW Gypsy-13_IS-LTR; Gypsy-13_IS-I. XX OS Ixodes scapularis OC Eukaryota; Metazoa; Arthropoda; Chelicerata; Arachnida; Acari; OC Parasitiformes; Ixodida; Ixodoidea; Ixodidae; Ixodinae; Ixodes. XX RN [1] RP 1-3733 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the black-legged tick genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; ABJB010321873; Positions 3962 230. XX CC Positions [2434-2847] - Integrase core CC LTRs are 92% similar to each other. XX FH Key Location/Qualifiers FT CDS 679..2283 FT /product="Gypsy-13_IS-I_1p" FT /translation="MPSFDDQRDDLDAYLKRFEVLAMGQEWPRGKWATALS FT LCLVGEALKVFGRMSPDDSLDYEKVKMALLQLFRFTTEGYHERFRGSSPNN FT GKTRSQYAARLEGLFDRWVEMGLCPKTYGDLRDLVIAEQFINGCHLKLAVF FT LKERSCNALPLMAAAADKFMEAHRQDNLAHFKEDQRQSNNNLKNDLPSSKQ FT ETKIKCFLCGRVGHRATDCVSRTEQRHLVCQSCQRSGHDAKACFQRKGGKT FT QLSSFMVPESGSRGCECEGSHFHNDETHGVEHTNEALVSVVCDKNEQQPKR FT RMPVEQGLLDEQPTKVLRDTGCNTVIVRTKLVGPQHLTGKQRKGVLVDGTC FT RLLPEAIVRIPSPYFTGEVVAKCMDEPLYDVIISNVQGAKEVLDLDPNGRS FT GDEDYAANILQLDKNILAALKSHVPATPNPTPSLGIEDVTKGFLEREQLKD FT TSLDICRGRIADTHVTRGGNTYSFYLSEGVLFSRCLLTNGKEFRQVVVPRT FT LRHHILSLAHDIAMAGHQGVKRTSDSLRISLLARSLSRC" XX SQ Sequence 3733 BP; 1082 A; 831 C; 955 G; 865 T; 0 other; acactgaact gcaagcctgt ctcaagagtg aacaggcaaa gccatcccaa caaccggttg 60 gctgatcacc atcggtgcgc tggaagctgg agccaaacca gaccagcttc caccaccggt 120 caaggcgcct aaacctagtg caagccagtg ccaagctcgg caagtcgaac ttgggacgta 180 cgcccttttc ttttatttgc taataagttt cctttttttt cgtggattcg gaaagtagaa 240 caaaaactgg gtggaaatat atttctttga tatcaaaaat cagacggacc tctcagtttc 300 ctttcttttt accataaaga acctgacaat aactgtccgt ccgcgacagg acactgatga 360 tcatccgtct ggtttttggt acaacattta aggtatggat ttaaaggagt tgatggccct 420 gggtgaaaag ctggggatca caggaacaga attacgaaca tgctttgatc aggaacgagc 480 tctgcaaaga gaggacagag tggccgaacg agaagctctc cgacagaaag cggaactgga 540 ggaaagaact gaaagaacgc tccagcttaa gcttaggttg attgaggccg aggggtcggc 600 aagtgacagg catgatgcgg gcggtggtga aacccccaaa gagttactgc gtcacagtat 660 ctcagtcccc ataagctaat gccgtcgttt gacgaccaaa gagatgactt agatgcgtac 720 ctgaaaagat ttgaggtttt agccatggga caggagtggc caagaggaaa atgggccact 780 gcattaagcc tctgcttggt gggtgaggcc ctaaaggtat tcggaaggat gtccccggac 840 gactccttag attatgagaa agtaaagatg gcacttttac agctgtttcg ctttacaacc 900 gaaggatatc acgaacgatt ccgaggaagc tctccaaata acggcaagac acgcagccag 960 tacgccgcga ggctagaggg attattcgat cgctgggtgg agatgggcct ttgtcccaaa 1020 acgtacggcg atctgcggga ccttgtgatc gccgaacagt tcatcaacgg ctgtcatttg 1080 aagctggctg tgttcctcaa ggagcggagt tgcaatgcac ttcccttgat ggccgcggca 1140 gcggataagt tcatggaagc gcacaggcaa gacaatctgg ctcacttcaa ggaggaccaa 1200 agacagagta ataataattt aaagaatgac ctgccaagta gcaagcaaga aacaaaaatt 1260 aagtgtttct tgtgcggcag agttggtcac agagcgaccg attgcgtgtc cagaacagaa 1320 cagcgacacc tagtatgtca aagttgtcag agatcgggac acgatgccaa agcatgcttt 1380 cagcggaagg gcggcaaaac tcagctctca agctttatgg tacctgaatc tggaagtaga 1440 ggatgcgagt gtgaaggaag tcacttccac aatgatgaaa cgcacggtgt ggagcatacc 1500 aacgaggccc tcgtgagtgt ggtgtgcgat aaaaatgagc aacaaccaaa gagacgcatg 1560 ccagtagaac aaggactact ggacgaacaa cccactaagg tgttacgaga tactggctgc 1620 aacaccgtta tcgtacgcac aaagttggtt ggtcctcagc acctgacggg caaacaacgc 1680 aaaggggtcc tagtagacgg aacatgcagg ttgttgccag aagcaatcgt tagaattccg 1740 tctccatact ttactggaga ggtagtcgca aaatgtatgg acgagccact ttatgatgta 1800 attattagca acgttcaggg agccaaggag gtattagacc tcgatccaaa cgggagaagt 1860 ggagatgagg attatgcagc aaatatcttg caactggaca agaacatact cgccgcactg 1920 aagagccatg ttcctgctac tccaaatccc acacccagcc tcgggattga agatgttacg 1980 aaaggcttcc ttgaaagaga acagttgaaa gacacgtcat tagatatctg ccgcggaaga 2040 attgccgaca ctcacgtgac aagagggggc aacacctaca gtttttattt aagcgaggga 2100 gttctcttca gcagatgcct ccttacaaac gggaaggagt tccgccaagt ggtggtcccc 2160 agaacgttgc gccaccatat cctcagtctg gctcatgata tcgcaatggc aggccaccag 2220 ggtgtcaagc gtacgagtga cagtcttcga atctctttat tggcccggag tcttagtaga 2280 tgttaagcgg tttgtgcgtt cgtgtgatct gtgccagaga acgatcatta aaggaatggt 2340 gtcgaaagca cctttggggc gcatgcccat cattgagaca cctttcgaac gagctgctat 2400 cgacctcatt ggaccgctgt cgcccacttc taagaatggg aatcggttca tctcacccta 2460 gagaattttg cgacaggata cccagaggcc gtcgccctac cgtccactga ctcaaagaca 2520 gtagcggaga ggttaatcga aactttctcc agagtgggtc taccaagtga gattatttat 2580 gagaagggta cgagcttcat ctcggagttg gtgaatgagg tgacggagct gctcccgatc 2640 aagcacttat tgtctacccc ttgccacccc atgtgcaatg gtctggtcga gaacgtttac 2700 ggtacgctca gacaaacgcc gaagaagatg tgggaagaga aacatcagtc ttgggatgaa 2760 tacttggctc cggcgctctt cgcctaccgc gaagcaccac agtcgagtat aggtttttct 2820 ccttttgagt tgatttatgg aagacacgta cgaggcccat tgttggtttt gaaagaaatc 2880 tggactgaag atacaatcag cccggagctg aagacgacat attgctacgt cttagagcta 2940 cagagtaggt tagagcgaaa aattgaatta gctcaccaga aactggacga aacacgacag 3000 agtcaaaagc gaaattatga ccgaaacgcg agaggctgac aattgaaagt aggagatcgg 3060 gccctgatct tgctgcctac cagccataat aaattgttta tgcaatggaa aggtcccttt 3120 ctcgtggtcg aaaagaaaag caaggtcgac tacaagttaa atttaggtga tattcacaag 3180 gtttttcata taaacatgtt gaagcgggca cccccggaat cgccaactct taaaagcgca 3240 gacttctcga aagagtttgt cctaagaact gatgcctccg aacacagctt gaaggcggtg 3300 cttaggcaag aacacgaggg gacactgcac catttagcgc atgccagtaa gcgactcctt 3360 acaaaagaag aacttttccg cgattgtgag agagtgcctg gcactcgtat gggcagtgga 3420 aaattttcct ctctactttt acggtgggca ttgcatggtg caaacagacc atcagccgct 3480 tcattacttg agacagacca aagcagctca acagtcgttc actccactgg agtttgttgt 3540 tgcaagaata ttcgtttcga atacaacaca tttagcacac acacaaacaa aaaaaaaaac 3600 gcaggctgaa taccttggtt tgttgtgatt tttttgtact tggccaagga acacatcatt 3660 acgaattttg gtgttttatt tatttttttt tttttttggc tgaaagacgg aaaaaattta 3720 aaagggggat cat 3733 // ID Sat6_Cis2 repbase; DNA; INV; 120 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 06-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE Satellite from Ciona savignyi. XX KW Satellite; Simple Repeat; Sat6_Cis_; Sat6_Cis2. XX OS Ciona savignyi OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. XX RN [1] RP 1-120 RA Smit A.F.; RT "Sat6_Cis2 - Satellite from Ciona savignyi."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX CC Ci000006; The 3' end of Baggins1_Cishas turned into a common CC satellite. XX SQ Sequence 120 BP; 30 A; 38 C; 20 G; 32 T; 0 other; gtcccagtgc taccatggac ccacccatac taatgtaatg tatcatggcc tacattcctg 60 gtcccagtgc taccatggac ccacccatac taatgtaatg tatcatggcc tacattcctg 120 // ID Gypsy-42_OD-LTR repbase; DNA; INV; 363 BP. XX AC CABV01001282; XX DT 24-FEB-2011 (Rel. 16.02, Created) DT 24-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Oikopleura dioica genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-42_OD_; KW Gypsy-42_OD-I; Gypsy-42_OD-LTR. XX OS Oikopleura dioica OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; OC Oikopleuridae; Oikopleura. XX RN [1] RP 1-363 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Oikopleura dioica genome."; RL Direct Submission to RU (24-FEB-2011). XX DR Genome; CABV01001282; Positions 1862 1500. XX SQ Sequence 363 BP; 99 A; 84 C; 89 G; 91 T; 0 other; tgcacatggc agaaaggtga ggtcagaggc ggcgccctgc cctcaaggag gaataaacct 60 acatcggcaa cacgagagga gctcagcgtg atttcgcgtc attcttacgc tgagcgtttt 120 aactcgaggt attaactgca gatttattag agactgatct acgctagaga aggtcgatta 180 ataaatctat tcaggtttac tataaacgac ggatccagct tccacaacgc ggcagcagaa 240 atcgacgcgc cgaagtcatc ttcgttgcct tcaccgcctt tctgctccgc gtacaattgc 300 tgttataata aaccagttaa atactctaaa cggatacgag ggtgttgttg tttgtgcagg 360 gca 363 // ID Copia-15_DPu-I repbase; DNA; INV; 4476 BP. XX AC scaffold_27; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-15_DPu_; KW Copia-15_DPu-LTR; Copia-15_DPu-I. XX NM Copia-15_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-4476 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 693-693 (2010). XX DR Genome; scaffold_27; Positions 439478 443953. XX CC 'TGATC' target site duplication CC LTRs are 98% similar to each other. CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX FH Key Location/Qualifiers FT CDS 344..1495 FT /product="Copia-15_DPu-I_1p" FT /translation="MIEVSQQRILIGCTSANQMWQALCNQHLEQASDNLYD FT LQARFYQYQYKEGNDRKTHIADIKEIAHHLGEVDKAIEERELITKIVCTLP FT APFRNFVSSWRHISVDRQTMVSLTSLLLQEEREIGRWTPKGDSSQEAAFHA FT KQSASSDGGNNQSTQHALYAHPPAGPSHHNNRYHPKRGGRSNNRHQHEKDR FT QTRQHDDQHSKSTMKCDYCTMDNHNTVDCNTRKRHERTDRKREQMAKRVKR FT EHRKKAIFDAVVDVPLEDQDYSLMSITPRFETRSNGDWFVDSGATQHMTDQ FT REWLTNFIDVPDSSRTVKGIGSSNYPVRGSRDVQVWIATTDGVKKPATIKG FT VLYVPGLVTNLFSIAAATDLGWMATFKGANVYFSSADEKKT" FT CDS join(2906..3346,3350..4435) FT /product="Copia-15_DPu-I_2p" FT /translation="MNCPEYEKWKDPIKEEYDALIENDTWEIVPCPPDRKP FT IECRWVFTIKPAPNGVPPKYKARLVAKGFSQRPGIDFDETYAAVVSHDTLR FT ALFSVIASQDLEMHQSDVRNAFLQPLLEEEMFMKQPEGFILPGREQVVCRM FT KKSLYGLKAPHVWGELSTSFLKDQGFESSTAYPCLFIRLRGEERTYLAIWV FT DDAIIASSQQSAIDALLTTMDKTFKIRAHPVTRFVGIDINRDRAARKIYLS FT QEDYITKIVGAFNMESCAPKDVPADPNVRLIKPINDQSLLTNVPYREAVGS FT LLYLALMTRPDTSFAVGLVSRYSEKHDQTHWNAVRRIISYLKGTSQLGICY FT NGSTSGPELLAYSDSDHAGCLDSKKSTTASVFLLHGGPIAWKSRHQRCISK FT STTVSEYIAVSETASDAVWIRRILPDLIPGWGKLPVRIFCDNQAAILLTAH FT QHQRQSTKMVDIHYHYVREQQKKKEIVLEYMKSADQLADILTKSLPVSRFT FT DLRNRLGIVQVQH" XX SQ Sequence 4476 BP; 1312 A; 1059 C; 1014 G; 1091 T; 0 other; ggttatgggc ccagaactga cgcaatagaa tactacaagc gctatgtcga actcaaaaga 60 aacagcacat attgctacat atgatggcat aaatttctct atgtggaagt taggactatg 120 ggtccttctc gaacaacaca acttgattgg cattgtccag ggagaagaac tacttcccga 180 tatggtataa aagtctaaat tagctttgtt caccttggac accatttatc tttcaactct 240 tttctctttc aacaagcaga attttgcttg caatcttgcc aacgcagatg ccattgcgag 300 ctggaaacaa agagattgta aggcaagggg ctatatcttg tccatgattg aagtatctca 360 gcaaaggatc ctcataggct gcacatcagc aaatcagatg tggcaggccc tatgtaatca 420 acatctcgag caagcctcag acaatctcta cgatcttcag gccagatttt accagtatca 480 atacaaggag ggcaacgata ggaaaactca tatcgccgac atcaaggaaa ttgctcatca 540 cctaggtgaa gttgacaagg ctatagaaga aagagagctt attacaaaga ttgtatgcac 600 tctcccagcc ccttttcgaa attttgtgtc atcttggaga cacatatctg tggatcgaca 660 gacaatggtc tctctcacgt ctcttctttt gcaagaagag agagaaatcg gtcgatggac 720 tcctaaagga gacagcagtc aagaggcggc ttttcacgca aaacaatcgg cctcgtctga 780 tggtgggaac aaccagtcta cacaacacgc tctttatgcc catccaccgg ctggtccatc 840 acaccacaac aatcgttacc accctaaaag aggaggacga agcaacaatc gccatcagca 900 tgaaaaggat cgtcagacca gacagcacga cgatcaacac agcaaatcaa cgatgaaatg 960 tgattactgc acaatggaca atcacaatac tgttgattgc aacacaagga agcgccatga 1020 acgcacagat cgtaagagag agcagatggc taaacgagtc aagagagaac atagaaaaaa 1080 ggcaatcttt gatgcagtcg tcgatgttcc tctggaggat caggattata gcctaatgtc 1140 cataactcca cgttttgaaa caagaagcaa tggagactgg tttgtagatt ccggtgccac 1200 tcaacacatg acggatcaga gagagtggct gaccaatttc atcgacgtac cagatagttc 1260 taggactgta aagggaatag gttcctcaaa ctatccggta aggggctctc gagacgttca 1320 agtttggatc gctacgacgg acggagtcaa gaaaccagcc acaatcaaag gagtgctgta 1380 tgttccgggt ctcgtaacaa atttgttctc gattgcagct gccacggatc ttggatggat 1440 ggccaccttc aaaggcgcaa atgtttactt ctcatctgca gatgaaaaaa aaacataatg 1500 ttcggaaaac gtgtcgggcg gacactatat catctcgcca ttaatacacg ctatgaagaa 1560 gaggctgaat caactattgc ccttccctcg tctatatcac caggcatttc aacttggcat 1620 cggcgattgg ctcatgtcag ttataacacg atcgttaaaa tggcctccag cggagtagtt 1680 gatggacttg atatagcaag tgcagtaatt ccttcggaat cgtgaactgg atgcgcatat 1740 ggaaaacatc aacgactaaa gtttccagtt gggcgagttc gagcgacgta cactggccag 1800 ctgatccatt cagatctttg tggcccaatg gaaaaagcca cactcctaat ggagctcggt 1860 atttcgtctt atttatagat gagtacagcg gttgaagatt catatttttc ctcaaacaca 1920 aatcggaggc ggcttccaag ttcatggaac tcatcaatat tttacgagga gaaactggca 1980 atcttgtcca aactctgaga actgatggat gtggcgagtg gtcaagaaac gattttactg 2040 agtagctgtt gcgaaaagga atccgtcacg agtctagtgc accgcacaca cctgaacaag 2100 atggagtctc agagataggt attcgcaccg tcacagaatg ggcgcgcagc tgtctttacg 2160 attctccagt cccatctgag ccgtggggag aggaagtccg cagtggaacc accgagctca 2220 ttaaggactg tcgacttcca ctttgtctgt gggctgaggc ggccaacttc acagtttact 2280 cattgaatcg ggttctctgc aaaacgtctt ttcctgtcac tccttttgaa aaataccaca 2340 acaagcgacc taatctctcc catctgcacg tttttggatc aattgcgttt gttcatgttc 2400 caaaagctga acgacgcaag cttgaccaaa agagtctacg cttcatcttc gttggttaca 2460 gttcaactca gaaggctatc gtttttggga gcctgtgaca agagccatca agatcagtag 2520 agatgcaacc ttcgatgagc atcaccgcct cgccgatgca acaaaggaga cgtcagctca 2580 tgttcaccct cctaacaata attcaattgt tcttgaaccc aatcaacaca ctattccttt 2640 agtcgagccg acatcaccgg atttgctggt taaaggagtc gatgatctga ctacagttaa 2700 aaaaaagcag ccatctcctt ctccagaaga aatcgatcct tcaaaagacg tcacgaatcc 2760 acttgaccag agagagccag aagacgatca acatctgcca tcgcgacgct ctttgagagg 2820 tagaattcca agaagagaat ggaaggcatg gtcggccaag tttggattta gtgcgcctta 2880 tgaaccctct agttacaagg atgccatgaa ttgtcccgag tacgaaaagt ggaaggatcc 2940 aattaaggaa gagtacgacg cgttgataga gaacgacact tgggaaattg ttccttgccc 3000 acctgacaga aaaccgattg agtgccgctg ggtttttaca atcaaaccag ccccaaacgg 3060 agtaccacca aaatacaagg cgagactagt ggccaaaggt ttcagccagc ggcctggaat 3120 cgattttgac gagacatacg ctgctgtggt ctcacacgac actttaagag ctttgttttc 3180 agttattgct tcacaagatt tggaaatgca ccaatcagat gtcagaaacg cctttcttca 3240 accccttctt gaagaggaaa tgtttatgaa acagccagag ggtttcattc tacctggaag 3300 agagcaagtc gtgtgccgaa tgaagaagag cttgtatggt ctcaagtagg ccccacatgt 3360 ctggggtgaa ctctccacca gtttccttaa ggatcagggc tttgaatcca gcactgctta 3420 cccatgcctc tttattcgct tgaggggcga ggagcgcaca tacttggcta tctgggtaga 3480 cgacgccatc attgccagca gccagcagtc agcaatcgac gctctcttga ccactatgga 3540 caaaaccttc aagataaggg ctcatccagt cacccgtttc gttggcattg acatcaacag 3600 ggaccgtgcg gcacggaaaa tctatctttc gcaggaagac tacattacca agatcgtcgg 3660 ggctttcaac atggagtcat gtgctccaaa ggacgttccg gccgacccaa atgtgcgtct 3720 catcaaacca ataaacgacc aaagtttact caccaatgtt ccttacagag aagccgtggg 3780 cagtcttctt tatctggcct taatgactag gccggatact tcttttgccg tcggtttggt 3840 ctcacgatat tcagaaaaac atgaccaaac ccattggaat gccgttcgcc gtattatctc 3900 ctatttgaag ggcacctcac aacttggcat ctgttacaat ggatctacat caggtccgga 3960 actactcgct tattcagatt cagatcatgc agggtgtctt gacagcaaga agtccacaac 4020 agcaagtgta ttccttctac atggaggacc aatcgcctgg aagagtcgtc atcagaggtg 4080 tatttcaaaa tccacgacag tttcggagta catagcagtt agcgaaacgg cttcagatgc 4140 tgtatggatt cgtcgcattc ttcccgatct cattccagga tgggggaagc tgcctgttcg 4200 aatcttttgt gacaaccaag cggcgatttt acttacggcc catcaacatc agcggcagag 4260 caccaagatg gttgacattc actatcacta cgtcagagag cagcagaaga aaaaagagat 4320 agtccttgag tacatgaaat cggctgatca actggcggac attctaacca agtcacttcc 4380 tgtgtcacgc tttaccgatc tccgtaatag gctgggtatt gttcaagttc aacactaggg 4440 attaccggaa acaggaaaaa cgcatctgag agagag 4476 // ID Gypsy-223_AA-I repbase; DNA; INV; 6965 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-223_AA_; KW Gypsy-223_AA-LTR; Gypsy-223_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6965 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1049-1049 (2011). XX DR [2] (Consensus) XX CC Positions [3211-3672] - Reverse transcriptase CC Positions [4693-5169] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 2824..5391 FT /product="Gypsy-223_AA-I_1p" FT /translation="MDFWRIFGIYPAIASITSLSTESIPTLESSRECLTAE FT QREILEQTKKIFKVATGESLDTTHWCEHQIVLMDEFKNSKPVRLYPYPIAP FT KIQEGLFVELERLLDRGIIEESNSDWSLNIVPIRKQSGAIRLCLDARKLNE FT RTVRDAYPLPHPGRILSRLPRARYLSTIDLSEAFLQIPLARESRKYCSFSV FT QGKGMFQFTRLPFGLVNSPATLARLMNRVLGQGALEPYVFVYLDDIVIVTE FT TFEEHVRLLEEVARRLREANLSIKLEKSHFCLEEIPFLGYILSSQGLRVNP FT EKIRPIVEYERPTTITKLRRFLGMSNYYRRFIADYSRTTAALSELLKAKTK FT TLKWTPEAEEAFQSIKEKLITSPILTSPNFDHEFMVHTDASDYAIAGVLTQ FT KVDGQERVVEYFSKKLTTPERSYHATEKEGLAALLSIEHFRGYLEGSHFVL FT VTDSSALTFIMRSKWRTSSRLSRWSLALQNYDMTIVHRRGRDNVVPDALSR FT SVMVISKKSDSEWYNSLRKQVEESPEEYPDFRIHNGLLLKYAFNDDLADHR FT FDWKIFPSPETRKEIIKSNHDHMLHIGVDKTVGKIRQRFFWPNMYRDVREY FT VRKCTECKQIKPPNSSTTPLMGDMRDDSRPWQIIALDFIGPLPRSTSGKQY FT ILVTLDLFSKWVQLHAFSSISAKALSSVLVDHWFYRNSVPAIVLTDNATTF FT TSREFRDLMHRFEVQQWLTSRYHSQANPVERVNRVINTAIRSYARSNQRTW FT DSRLSEIEVVINSTIHSSTGYTPFRVAKGEEIVLKGTDHSRFESEEESTLR FT ERAVRVKEQNPQLYALVHKQLQKAHNISSKTYNLRTRRPAKPFAVGRAGKR FT QCK" XX SQ Sequence 6965 BP; 2003 A; 1561 C; 1547 G; 1848 T; 6 other; ctggcgccca acgataaaaa aaagaaacca tcagtggttc ctaggaggaa aaaatattct 60 gtaaggaata ttttttctca ctagcgttgg gggggttttc aagaattagc atttcattaa 120 aatccactct ggtcgctcta ggagcggaaa catataggtt aatctcgatg cacattggtg 180 aagcttgtac gcagaattcg tcttttgtaa aacaacatga gtagtcaaac caattagtca 240 ctttcactac caaactcttg ttgatcggtc tctcaaacac ttttgcaccg gctagaccct 300 acaagaaaat cgttccgtct attaaaattt gactttttgc tatccaccac actatcatcc 360 atctgctgcg tccgccgtca caatgtcgcc gcgcgtttcc gtttctgggc cggacttcag 420 cgtcctgtat cgcggaatga gagtagatca tttagaatcg ggagaactgg attatgaact 480 ttttcttcgc aacgtcttta tcgcggatga cgattcgcgg tgtaggcgtc gtaggagatt 540 gaaacagatt ctcaaggatg aacgagaagg tcatgaattc attatacact atgatcaaga 600 ccccgagata gatcttgagg cgtgtgacaa cattttcaaa ggaatcgaac gtaaactcaa 660 tcaaatcact tcctcgcaag caaaaaccgt tttcaaagcm cggttattac atttaggcca 720 tagactcgcg gtcataaaaa atcattcaat cggtgagctc mgagttcaag ctgcagaagc 780 ttttcgaagt gttctgaacc tcttttctga acacttctgg tcagaagatg tatttttcgc 840 agaatccgac caagatactg atgatgaaac tgggctaatg gatgaagcca ttggcggacc 900 tgccgaatct ccggtctccg ttcaaagaac taatcaaact acaggtacga ttcccaaaca 960 atcggagcaa gtaaaatacg tgaccgatga acggttcaat caagccatga cagatattgg 1020 tagcctattt aaggctttat ccgctcaaat cagtggactt agagaagaga ttagtaaaac 1080 atcttcccct tctcmattcc caaagctcaa tagtacgaat ccgtttattg aacctccgga 1140 agtaccagag gtagctccgg ctgcgcctcg agtaaattgg gccgatttcg agagtggtac 1200 gaggccctca tcaagacctg aacaaccaca aaagcggagt gtgggttttc cagagtctac 1260 cttggtagac ggggaagcta tgaatagctc tttccgtcgg aggattgacg agttatcgtt 1320 cactacttcg gaagcaggga atgcggcaaa tcaaaccgag caaactcgtt cagccccaat 1380 cattcaatat gtaccgtacc ctgtagctca gcatcgtaaa gttacgccgg tttcccagtg 1440 gaaaataaaa agatactccg gaaacgatca agggttggga ctgaacgact ttttatcaca 1500 tgttcaacag ttggccatct ctgaacatgt ctcagccgat gagttgttcg attcggcgat 1560 tcatttattc gagggtgcag cgcttagctg gtacacctca tgtcgaaatc gaaattctct 1620 tcttgggtgg gaacatctgg ttgaagaact caagaatgaa ttcagacacc cagatctgga 1680 ttccgtacta cgcacgaaaa tttatcaaac ccggcaacaa aaaggagaat cattccagca 1740 atattttttg caggtcgaga aactctttca ggccatgaat cakactatgc cagaaagtga 1800 gaaggttgaa gttcttaaaa cgaacatgcg gtatgattgc cgtaaggctc tagtggggaa 1860 aagtatccga tccttgaagg atcttataaa catcggcaaa gagctggatg ctactgactt 1920 ctcggcattt tcaaaggttt ttggaccatc caaaagggaa acctgtgcta ttaacacagg 1980 aaacagtggg actaccggtt ctaaaacgtw ctactcgcgg agtcagtctg ctcmatcggg 2040 aaaaatgaca aactttcctc cgaaagcttc taaacctaat gctaacagcg caaaaggtct 2100 gttttccaag cccaccaacc aaaaccaaaa ggttgagccg aaatcaggtg gaaaggaaat 2160 gaaaacatct gcctattcaa aagaaccgca accaggatca agtcagccca gtgctctttt 2220 gaaaatggtt tcaaattact ccccgcctga agagggccaa tgtttcaatt gcggtgaaga 2280 gcacgatctt tcggattgta cgattccacg tagggtgttc tgtgacgcct gtgccttcaa 2340 aggctttacc cgcaaaaact gtccattctg tctaaaaaac cagatgagga agtcctaaag 2400 tcgttggaac ttccacaagc gcaagatgaa gccgagttta tcaactggtt atccgagctg 2460 ggccgttttt acgaaactgg cacggatgaa acagttgaag aaacgactcc atctcaaata 2520 cacaaaatca ctctcaatta ctttcaagac gatcggcccc acgtcaatat taaaatattt 2580 gatgtgcaag tcactgctct ccttgattgt ggaagcaatt tgacctttat aaatcagtct 2640 ttattcgaca agttccgacg aattcaaatc cgagagcctg ctgatccagt cgagctacta 2700 acagcggatg gatcacaatt gcaagttatc ggagaagtcc ttgttccgta tacattcaat 2760 ggaaaaacca gggtcttgtc gactttagtt gcacctgagt tgaccaaaga atgtatttgt 2820 ggcatggatt tttggcgaat cttcggaatc taccccgcca tcgcttccat tacctctctt 2880 tcgacggagt ccataccgac tctagaatca tccagagaat gcctgaccgc tgagcaacga 2940 gaaatcttgg aacagaccaa gaagatcttc aaagttgcta caggagaatc attggacact 3000 actcactggt gtgagcatca aatcgtactc atggacgagt tcaagaactc taagcccgta 3060 cgattgtacc catatcccat cgccccgaaa atccaagaag gtttgttcgt tgagctagaa 3120 aggttacttg accgagggat tatagaggaa tcaaattcgg actggtccct taacattgtg 3180 cctatacgca aacaatccgg ggcaattcgt ctctgtctgg acgctcgcaa actcaacgag 3240 cgaaccgtac gtgacgctta tccccttccg caccctggac gaatactcag tcgccttcca 3300 cgagcgcgtt acttatcaac tatcgacctt tcagaggcgt tcctgcagat ccctctggcg 3360 cgagaatcac gcaaatattg ttcgttcagt gtgcaaggaa aggggatgtt ccagttcacc 3420 cgcctaccat ttggtttagt caacagcccg gctacgttgg ccaggttgat gaaccgggta 3480 ttaggtcagg gtgcactgga accgtatgtc tttgtctact tagacgacat cgtcatagtg 3540 acagagacat tcgaagaaca cgtacggttg ctcgaagagg ttgcaaggcg attgcgcgag 3600 gccaacttat caattaagtt ggagaagtct cacttttgtc tggaagaaat accgtttctt 3660 ggctatatct tgtcttccca aggattgcgg gtaaatcccg agaagattag gcctatagtc 3720 gaatatgaac gaccaacgac cataactaaa ctccgtcggt ttttaggtat gtctaactac 3780 tatcggagat tcatagccga ttatagccgg actaccgctg ctctttcaga gcttcttaaa 3840 gcaaagacga agactctgaa atggactcct gaagccgagg aagctttcca gagtataaaa 3900 gagaaattga ttacctcacc aatcttgact agcccgaatt ttgatcacga gtttatggtg 3960 cacaccgatg ctagtgatta tgctatagcg ggcgttctaa cgcaaaaggt ggatgggcaa 4020 gagcgagtgg tagagtattt ctccaaaaag ctcactactc ctgaacgatc gtatcacgcc 4080 acggagaaag aaggcctagc agctcttctc tccatagagc attttcgagg atatctagag 4140 ggcagtcatt ttgtcctcgt aacggattcc tcggcactaa cgtttataat gagatcgaaa 4200 tggagaacct catcgcgtct cagtcggtgg agcttggccc ttcagaacta cgatatgacc 4260 atcgtccatc gccgaggtag agacaatgtt gtcccggacg ccctgtcgag aagcgtgatg 4320 gtcatatcca agaaatccga ttctgaatgg tacaactcac ttaggaaaca ggtcgaggaa 4380 tctccagaag aataccctga ttttcgtatt cacaatgggc tacttttaaa atacgctttc 4440 aatgatgatc tggctgatca ccgatttgat tggaaaattt ttcctagccc ggagacacgc 4500 aaggaaatca taaaatcgaa tcatgaccat atgctccata ttggagtgga taagaccgtt 4560 gggaaaatcc ggcagagatt tttctggcca aatatgtatc gagatgtccg agaatacgtc 4620 cggaaatgta ccgaatgtaa acaaatcaaa ccacctaact cttccaccac accgcttatg 4680 ggtgatatgc gggatgacag tcgtccatgg caaataatcg cccttgactt cattggaccc 4740 ctcccacgct caaccagcgg taagcaatat attcttgtca cattagatct ttttagcaaa 4800 tgggttcaac ttcatgcttt ctcaagcatt tctgccaagg ctttgagctc agtattggta 4860 gaccactggt tttaccggaa ttccgtgcca gcaattgtgc tcacggacaa tgcaacgaca 4920 ttcacatccc gtgaatttcg cgatctaatg catcgattcg aggttcaaca gtggcttact 4980 tcgcgatacc actcgcaagc taaccccgtt gagcgcgtga atcgtgtaat caacaccgcc 5040 atccgatcat acgcccgaag caatcaaagg acgtgggact ctcgactgtc tgagatcgag 5100 gttgtgatta actccacgat ccactcgtcg acaggctata ccccgtttcg cgtagcaaag 5160 ggggaagaaa tcgtactcaa aggtaccgat cattctaggt ttgagagcga agaagaatcg 5220 accctgaggg aaagggccgt acgagtaaag gaacaaaatc cccagctgta tgcactggtg 5280 cataaacaac tccaaaaagc gcataacatt tcttctaaaa cctacaattt acgaacccgt 5340 aggcccgcta aaccattcgc tgtaggcagg gctggaaagc gtcaatgtaa atagaaagta 5400 tcgtgcgact tttacataga tgcttctctc gctgtcatcg gatggcgaga caatgacaga 5460 gatttttttc aaactctctt tcattttccc ttcgctacgg cttgtgacgg aattgtgaac 5520 ttgataattt ttagttcatt gtaatgtata attgttgttc atatcgctgg aattatagaa 5580 ggatattcac tcaacttatc aaacacattt acttcccaaa attttccaaa atgagaaagt 5640 aattttaaaa gcaaaacaac attaaaattg ctctcgtgaa agcccgttat cgcgacgtga 5700 cgaaatgagc gcatgaaata tagcttcgtc tagtgacagt gaggagactc gtttcgtcag 5760 tgttgcaagt cggtcatgaa acttaacagc cctggctgta ggtgacacgg ttttcaaaag 5820 gaatactaag ttgtccaacg ctagtgagtt ttataatgcc aaactcggcg ctcaatattt 5880 accttgcacg gtgattgcgc ggcacgggtc atcgtcatac gagctgatag atagccaggg 5940 ccgcaatatc ggcgtatggc cagccaatct tttaaaatcc ggtcattagc acacgtttgc 6000 gtgtggacga tcagttcgat gaaaagggga gtaaggtttc cagtctctgg aagtaaaatc 6060 tgaaatattc aatacgatta attcaaaaat tcacgacctt ctctggctca tttggaatcc 6120 gtaccttccg tcccgttacc tctgattctc ggatataagc tttccgcggg gctaatcacg 6180 acgatatatc ggcaaaaggg aattattgaa ggataggaga atagatgagt cagcttggta 6240 gctaggggga gacgtcctcc aataacgaaa aaaaaatcaa actcgcgcgc gctggtcttg 6300 ttcctaattg aacgagagcc aaattagaac tacggccaga ttttaaactg actgatccta 6360 tagctgatcg acagcaagtc ggaaactgtt atacgtttga attcgtatcc atattcattg 6420 ttgcattgtt acatagcatc atctttattg ttacccgtat gtacttcaga gtaaaaaaac 6480 cttttgtatg tcctcgtact cagtacataa tcagactact attgttcaag tcatacattt 6540 gatagccagt tttgttgatt gtcgaccgtt cggcagtatt gtttattact tcgttcgaat 6600 cagaactgtc ccgcgtgaca gtcggttccg gttttgtgat gtagaccacg gcataatcgc 6660 gatagcacgg aaagatagtg tggaatggta gagctttgtt tagatggcca caactagaag 6720 tgcaggggtg taggagaggt acaggcagag tggacagttg aaggttttga ccatatagtc 6780 cctcgacacc agcgttgtgg tcggaaaaga tgtgagtttt gttacggtta gccagtggtt 6840 tagcctgtta tgtttaaatg ttatgtttct gtttccagag ttttgttcag ttttctttaa 6900 cattaattct tttgttaagc aaaaaaaatg tatattagtt ttttgctacc ctgaccagag 6960 gggga 6965 // ID Harbinger1_DYa repbase; DNA; INV; 2324 BP. XX AC . XX DT 15-MAY-2009 (Rel. 14.05, Created) DT 15-MAY-2009 (Rel. 14.05, Last updated, Version 2) XX DE Harbinger-type sequence: consensus. XX KW Harbinger; DNA transposon; Transposable Element; Harbinger1_DYa. XX OS Drosophila yakuba OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; melanogaster subgroup. XX RN [1] RP 1-2324 RA Jurka J.; RT "DNA transposon families from fruit fly."; RL Repbase Reports 9(5), 937-937 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS join(299..418,523..798,780..1943) FT /product="Harbinger1_DYa_1p" FT /translation="MSISTFDYILNKINSRLMKNWPNFVIHPILPCQKLII FT TLRFFFGSIFRKYGSRLSKNASSSSFSFLLFNSNQFLRCSHLLLFFFLVDK FT LVDHHTSDENPGEAASKVNCDDASKLPTNLRGFIWSFKKSKLVKKKQTGEM FT RPRSVMAVSLLRNSRVSRFGNRNFFPFASELVADVLPFFFKQPSEIKIVLI FT TLMSKIIRNYRFLATGASFASLAYSFKIGRTTVSVVVKETVIALWEELQPL FT HMPQPTKEIISQTADKFWNLWNFPNCAGAIDGKHIRIKCPADSGSMFYNYK FT KYFSIVLQAVADANCKFIAIEVGGYGKQSDGGTFNASQLYMMLKARQFLPP FT DSCLPGTNTKMPYVFIGDEAYPLLGNLLRPYSRRDINANNEYFNSRLSRAR FT RCIECAFGIITSKWRLLWKPIETDPSFVDIIVKSICILHNTIIDLEGPYDI FT DMETQNFHIDTETHNDETFNHVNLRTGQLIRNAFRDFALTETPNSSQLLIN FT LYLLSVSNMFSISVQFFSTKILL*" XX SQ Sequence 2324 BP; 710 A; 449 C; 437 G; 728 T; 0 other; aggctgacgg cagagtgcgc gcgagaagcg aagcgtagcg agaagctatg agcaaagcga 60 gaagatgaca gagtgagagc gagacgagaa gcgagctaca ttgctctctt ttatttataa 120 atgtgaacga gatggattcg gacgaagaat ttttcgcatc aagcgcaact ataaacttta 180 tattaaaaaa aaaaccggcg tccaccccct taacaccaac agaaataaaa aaggggaatt 240 ccacaattta tacaatgatt tgcgaaacta cccagatagg ttctttaact atacgcgtat 300 gtctatttcg acttttgatt atatattaaa taaaattaat agtcgtttga tgaagaattg 360 gccaaatttt gttatacacc ctatactgcc ttgtcaaaaa ttaataataa cattgaggta 420 agtttttaat atatttattg tgtattatag tttgtcggta tcggaaatat cgtcaaaaga 480 aaaattaaaa attgtttttt gaatttccat tcggcataat aatttttttt cggatccatt 540 ttccgcaagt atggtagtag gctgtcaaaa aatgcctcgt cgtcgtcttt ctcttttttg 600 ttgttcaata gcaaccaatt cctccgctgc agtcaccttc ttttgttttt ttttttggtg 660 gacaaattgg ttgatcatca cacttctgat gaaaatcccg gtgaagctgc ctcgaaggta 720 aattgtgatg atgcgtcaaa acttccaacc aatttgcgcg gtttcatctg gtcttttaaa 780 aaaagcaaac tggtgaaatg aggccaagaa gtgtaatggc tgtaagcctc ctccgaaatt 840 ccagggtctc cagatttggg aataggaatt tttttccatt cgcttcggaa ctggtcgcgg 900 atgttcttcc attttttttt aaacagcctt ctgaaataaa aattgtatta attacactga 960 tgtcgaaaat tatccgtaat tacagatttt tagctactgg agcatctttc gcatcattgg 1020 catattcatt taaaattggg aggactactg tttcagttgt tgtgaaagag actgtaatag 1080 ctctttggga ggagctacaa ccgctacata tgcctcagcc cacgaaggaa ataattagcc 1140 agacggcaga caaattttgg aatctgtgga actttcccaa ttgcgctgga gcaattgatg 1200 gaaaacacat ccgtattaag tgtccagcgg attctgggtc tatgttctat aattataaaa 1260 agtacttttc gatagtactt caagcggtag cagatgccaa ttgcaaattt attgccatag 1320 aagttggagg ttacggaaaa caaagtgatg gtggaacatt caatgcatca caactgtata 1380 tgatgctaaa ggctcgacaa ttcttgccgc cagattcttg cttgcctgga acaaacacaa 1440 agatgccata tgtttttatt ggtgatgagg cttatccatt actcggaaat cttttaaggc 1500 catattcaag aagggatata aatgcaaaca acgaatattt taacagtaga ctttccagag 1560 ctcgaagatg cattgaatgt gcctttggta tcataacatc caaatggcgt cttctctgga 1620 agcctattga gacagatccc agttttgttg atataattgt gaagagtata tgtattttgc 1680 ataatacaat tattgacctg gaagggcctt atgatattga tatggaaaca caaaatttcc 1740 atattgacac cgaaactcat aacgacgaaa ctttcaatca tgttaattta agaacaggcc 1800 aattaattag aaacgccttt cgcgactttg cactcacaga aacaccgaat agttcacagt 1860 tattaataaa tctatactta ctttctgttt caaatatgtt ttccatctct gtccaatttt 1920 tttccacaaa gattctatta tgatagatct tgctcttttg atcccatagc gctgggcgta 1980 aaaatatttc gtcaattaat ttttcaacgt ttatctccat ttcgttttgt tttctcacaa 2040 atgccaccaa atgaaaacat taaaaacgca tgctttgaaa gcaaacggta aataaagcac 2100 agaatgcaac aaaacaatcg tacgctctga gcgagttgaa gcgataaagc aggttgcaaa 2160 ctgctctctc gctttgctct cgctttatcg cttgagctcg ctactcgcct cgctctcact 2220 ctgacatatt cgagcgattt cacttaaaaa tgcttgcaga ctttatcgct cgttctcgcc 2280 caatattatc gcatcgcttc gcgcgcgcac tctgccgtca gcct 2324 // ID CR1-3_BF repbase; DNA; INV; 3734 BP. XX AC . XX DT 30-JUL-2009 (Rel. 14.07, Created) DT 30-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE Amphioxus CR1-3_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-3_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-3734 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-3734 RA Kapitonov V. and Jurka J.; RT "Young families of CR1 non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(7), 1574-1574 (2009). XX DR [2] (Consensus) XX SQ Sequence 3734 BP; 1096 A; 914 C; 789 G; 929 T; 6 other; cgccgcctcc ggtggccggg gttccctccg ttgcgagtgt cggctccaat tcaccatgtc 60 actggctgtc tgcaggtcgg tagttctcct atttattctt actaatgctg ccaaggccac 120 agttgtcggc ctcagtccag gaaaggtaag tagagcgaag tcgttatgtt tactgtcgcc 180 gggtaacacc gggccgccat cttggtaccc tgtactcagc ctaaacccaa gcgaagccac 240 tccgctgagc ggcttttatt ttggcactac tgggctagca atgaaacaag cgagatgtct 300 aaaaaccaca gataaacttt gtttgatgta tttgtgctcc attcttatgg ctcaggctgc 360 cgacctagaa actaatcccg gccctcgccc ccccaaatat ccatgtggtt gctgcgggaa 420 agcggtaaca tttaaacata agggtgtttg ctgcgatagc tgtgataact ggttccacca 480 tgactgccaa gggctgagtt cattcatgta tccttacctt ggaaattcca atgtatcatg 540 gatatgtcta aactgtggcc tgcmaaattt gtcaacatcg ttctttgact cccacactag 600 tgtagaatct cctaatccat ttagcccact gagtgacagt agtccaggtt taccgcaagc 660 ctcgtcctcc ccgaagtctc cccacgtgag acggaggact ggtggacgac ccctacggct 720 tgtaaatgca aacttccagt caatcaaaaa taaaaaagtt gaacttgaaa ctttggttga 780 agaaacaaaa ccygatatyc ttgtcatcac agaaacatgg cttgacgata catgtaatat 840 atcagaattc tttccgaacc acctcaatat gtctgtgttc tttaaaaaya gagcagatga 900 cagccacggt ggtgtgctta tygcaatatc taatgacttc atttgcactc aagaacccca 960 acttgagaca gagtgtgaaa tggtgtgggt taagattcaa cttgtaggtg caaagtcttt 1020 gtacatatgc gcatattata gaccacatgt aggtgatgcc accagycttg aacaacttgg 1080 gcaatcaatg gataaaatat gcaaccaacg aaacaaccat gtgtggatcg ttggggactt 1140 taatcttcca ggttgggact ggactgatcc ccaacaacca gtcttaaagc catcctgtgc 1200 ctacccaggg ctacacagac agtttgtaga tctgctttgt gactacaata tgtcacaaat 1260 cgttgacaaa ccaactagga atgaaaacac cctagatcta gtactcatgt ctaacgagaa 1320 ctgtgttaat tgtgtacgta cactaccgcc tattggggac cacgacttag tgtttgtcga 1380 agctgatctc cacccgtgta aacacaaagc taaatcacgc aaagtctacc tatataaacg 1440 ctctaactgg gaaaagtttc gcgacgagat gactgacttc aagtcacaat tcttagagct 1500 tgcaaatact gatacaaatg tagatgagtt gtacaataaa tttacatcta aactatcctc 1560 ctgcgtggaa aaatatgtac cctcgaaaac gactaacgac agaaaacacc ttccctacat 1620 cactcctgac ctcaaacggc tcatgaggaa acgagaccgt tattacagga agacaatagg 1680 ccaaaacacc cccgaaagaa aagatagact aaaccagttc aagaaagaaa tcaataagaa 1740 aatgaaagac tgctactgga attatatcga agcagtagta ctggatattg atgttagtga 1800 tcatgatcag ggtaagtctt ctgccaagct aggaactaag aaattctggg gctttcttaa 1860 aagtatgaag tccgaaaggt ctggtgtcga ccacattaag agcaatggaa tcttggtgtc 1920 cgaaagcaaa gaaaaagcga acctccttaa ccaacagttc cagtctgtct ttacccgtga 1980 acccacagac gatcccttac ctgacatggg gcctagccct cacccctcta tgccagacat 2040 caccatagag gaaagtggtg tacacaaact tttaaagaac ttaaacccac ataaggcatc 2100 tggtccagac caggtccatg cttatgtact caaagaactt agtgatatca taagtcctat 2160 cttacaggtc atctttcaga aaagcctaga taccggcaca atgcctgaat cctggaaaga 2220 agccaacata tcaccaatct acaaaaaagg tgacaggtat aaagttgaaa attatcgacc 2280 aatttcgctt acatctatat gctgcaaact tatggaacat gttattgcta gtgctatgat 2340 gaaccactta gacaccaatg acatcttata tgacatgcaa catggtttca gacaaggcaa 2400 gtcttgtgag acacaacttc tctcactcat ggacgaccta gcagcaaata gaaacaatgg 2460 tattcagact gacattgtac tgatggactt tgccaaggcc tttgataagg ttcctcaccg 2520 ccgactctta cacaaacttg agcactatgg tgtcaggggt aaagccttaa actggataga 2580 gactttccta actggtagga cccaacgtgt cctactagaa ggggaatgct ctgactcagt 2640 accagtgact tcgggtgttc cccagggcac tgtcctgggg cccatcctct tcttggcata 2700 tattaatgac ctgcctaacc atgctgcata tgctaaggtt cgattattcg ccgatgactg 2760 catattgcag atggcagtac aagaaccaga agattgtgat aaattgcaac atgacataga 2820 tagcatttgt gattgggaaa ggacatggct catgcaattt aatccatcca agtgtgaagt 2880 tatgtccatc ccttcatcaa gaaaacccct caatcaccct tacacacttc acggcagcat 2940 aatgactaag gtaagtaaag caaagtacct tggcttaact atctctgcaa acctttcttg 3000 gaactgccat gtcgacaaca tcactgccaa agcaaaccga acactaggac tgctacgacg 3060 gaacctgcgg attgccagtg tggcggcaaa agaacgagcc tacatggcac tggtcagacc 3120 atctgttgag tatggtgctt cggtctggga cccccacact gccgaacagg tgagtagggt 3180 ggaggctgtg cagcgtagag ctgccaggta tgtatgtaat aactaccagc gtacggcctc 3240 cgtaacctca atgctcaaag accttggatg gcagccactg tcggaacgcc gcaaaatggc 3300 caggctgacc acaatgcaca aaatcctaca caacaccatc tccgtcccac acacctcgag 3360 tctcgtccca gcagctcggt gctcccgccg cactaatcat gctttcaagc tgcagaccat 3420 cgcgagtaag aacaactatt acaggctctc gtacttccct cgtaccatca aaaagtggaa 3480 tgaacttgag cctagtgtgg cggaggcgga gtccctctcc cagttcaaaa ctgagctggg 3540 acgggcttcg ctgcactgag ctgccccccg tccattgtac atagtgtaaa taaatgtgtt 3600 gtaatatcac ttgtcctgtc tgcacatgtc catgtccttt accctttgtt ttcacatgca 3660 cttacgtcat gcgcagctgc taataatcca cattgttgtg ggataagcag tactggagat 3720 gatgatgatg atga 3734 // ID Gypsy-2_IS-I repbase; DNA; INV; 4131 BP. XX AC ABJB010439064; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the black-legged tick genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-2_IS_; KW Gypsy-2_IS-LTR; Gypsy-2_IS-I. XX OS Ixodes scapularis OC Eukaryota; Metazoa; Arthropoda; Chelicerata; Arachnida; Acari; OC Parasitiformes; Ixodida; Ixodoidea; Ixodidae; Ixodinae; Ixodes. XX RN [1] RP 1-4131 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the black-legged tick genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; ABJB010439064; Positions 6387 2257. XX CC Positions [3375-3701] - Integrase core CC 'AATC' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 1612..3510 FT /product="Gypsy-2_IS-I_1p" FT /translation="MHELGFIRPSSSPWSSALHMAPKKTGDWRPCGDYRSL FT NKCTLPDRYPIPHIQDFGGILQGAKVFSKVDLVKAFYQIPIEPADIPKTAV FT ITPFGLYEFLRMPFGLRNAAQTFQRFVDTVLRGLSFTFAYIDDILVFSNTP FT EEHADHLRQLFSRFQEYGLVVNPTKCEFGVNELEFLGHHVDSRGIQPLPEK FT VEVIRSFPQPPTAKKLREFLGLVNFYRRFIPQCAQVCEPLYDLLPATKVKA FT TPLRWTPTANQAFERVKNIIANVTLLSHPKTDAPLSIMVDASDGAVGAVLQ FT QHDGNNWQPLSFFSKKLTPTERRYSTFGRELLAIYLAIKHFRHCVEGRSFF FT VLTDHKPLTYALASNSTAHTPREIRQMAYISEFTTDIRFVKGTDNAAADAL FT SRVGVNLCSTDRPVVPYAQLAEAQRGNLELQRLRTTSTTLVFEDIRMPDCD FT GTVACEVSTGRPRPFVPADLRRTVFDCLHSLSHPGIRATQRLIVSRFIWPR FT INVDVRSWAKTCMQCQRSKVPRHTKAPLGAFSTPDTRFAHIHVDIVGPLPP FT SQENRYLLTCVDRFTRWPEAAPMKDITATTVARTLIEMWISRFGVPQTITT FT DKGRQFEATLFRTLSHFLEFNTSIRRPTIRSPTAG" XX SQ Sequence 4131 BP; 1001 A; 1316 C; 947 G; 867 T; 0 other; tacttattta caatatttta cactgtccta ctgccccgac cactcctaca tactggtgac 60 cttggaccga gccgcccccg accacgccca gctacgctgc cctggagccc tggttctttg 120 agtgagtcca gcttcgtgcc cccgtcatga gcctcggaga cgatggcacc tctcccgaag 180 ccccgaccac actgggggcg acgactctca acatcaagtt cccagagttc tggtcgtccg 240 actccgagct gtggttcctg accgcggaat caattttccg aaagagcagg atcacctcat 300 ctttgaccaa atttgactac gtcgtcgcgg ccctcccgca gtcgaccgca tccgtggttc 360 gcgacattct gcgttctcca cccgcagatc agccctacga gactctacgg aaagagttga 420 ttcgtcgcac cacagagtct gagcaacggc gtcttcagca gctgttgact gcggaggagc 480 tgggagaccg caaacctaca gaattattgc gacgcatgga gcacctcctt ggagacaaag 540 ccacgactat ggatacatcc atatttaaag aactttttct ccaacgtcta ccaagccaag 600 tccggatgat tttgagtaca tctgccaccg actcaatctc cacgcttgct cgtatggccg 660 accaaatcat ggacgtaggc tcccctacca tctcagcggt gcgtcaagag accccctcaa 720 cgtcgtgcac tactgtgacc gacatcggat cccagctcgc tcgactcctc agcttcaacg 780 atcacataga ggccaaggtg agcgagctca gtcgtgagat tcgtaacctc caatcacgac 840 cttcccgatc cttgaccgtc aatccacgaa ctaacgctgg ccggcaccac agtgaatcaa 900 gagctgggtc cccccttcca agagtatgct ggtaccacca gagatacgga tcccaagcca 960 gagcatgccg cccaccatgt aactactcgg gaaacgacat gggcacccgc tgaaggccgc 1020 cagcgtcgaa agcccaacta gtcaggccgg cttttctacg tcacggaacg caacacgaag 1080 gtccgcttcc tcgtggatac tggggctgag gtgagcgtgg tacctcctac gctggaggac 1140 cgccggtgcc gccaagagtc cccaccgctc caggctgtta acggatcccc gatcaaagcg 1200 tacggcgaaa ggtccctgac gttggacatc ggactccgcc ggacattccg ttggattttt 1260 atcatcgcgg atgttcggca ggcgatcata ggagcagact ttctgcggcg attcgcactg 1320 accgtggacc tgaggagaaa tcaacttctt gacaccgaga cgcaattgag cgtccaagta 1380 attgtgtcga agtccccccc tcttaaactt gctcttccac gaacagaaac gaattctcca 1440 tggcaaacca tcgccgacga atttcctgcc gtgtttcgcc cacctgctcc cgatcaagcc 1500 atccgacact ccgtcaccca tcacatcgtg acgaaagggg ccccggtttt cgccagacca 1560 cgtcgactcg cccccgaccg cctcaagata gccaagaacg aattccaaca catgcatgaa 1620 cttggtttca tcaggccatc ttcgagtccc tggtcttcgg cacttcacat ggccccgaag 1680 aaaaccgggg actggcgccc ctgtggcgac taccggtcat taaacaagtg tactctgccc 1740 gaccgttacc ccatccctca tattcaggac ttcggtggca tcctgcaggg cgcgaaggtt 1800 ttcagcaagg tggacctcgt gaaggcattc taccagatcc ctatagaacc ggcagacatc 1860 cccaagacgg ctgtcatcac gccgtttgga ttgtacgaat tcttgcgaat gccctttgga 1920 ctgcgcaacg ctgcgcaaac ttttcaacga ttcgtcgaca cagtcctccg agggttgtca 1980 ttcaccttcg cgtatatcga cgatatcctt gtattcagca acactccgga agagcacgcc 2040 gaccatctac gacaactctt ctcgcgcttc caagaatacg gacttgtcgt caatccgacc 2100 aagtgtgaat ttggcgtcaa cgaacttgaa tttcttggcc accatgttga tagccgtgga 2160 attcaaccat tacccgagaa agtcgaagtc atccgaagct ttccacaacc cccgacggcg 2220 aagaaacttc gggaatttct cggactcgtg aatttctatc gacggtttat accacagtgc 2280 gcccaagttt gcgagcccct ctacgaccta ctgccggcca ccaaagtaaa ggcaacgcca 2340 ctccgttgga cgccaaccgc caaccaagct ttcgagagag taaaaaatat cattgccaac 2400 gtcacccttc tcagccaccc caaaaccgac gcacccctct ccatcatggt cgacgcctcg 2460 gatggagctg ttggagcggt tttacaacaa cacgacggaa acaactggca accactttcc 2520 ttcttttcga agaagctgac cccgacagaa cggcgataca gcacctttgg acgagagctg 2580 ctggcaatat accttgccat caagcacttt cgccactgcg tggaaggtcg aagcttcttc 2640 gtcttgacag accataagcc gttgacctac gcgctggcct ccaacagcac ggcacacact 2700 ccgcgtgaga tacggcagat ggcatatatt tcggaattca ccaccgacat ccgcttcgtt 2760 aagggcacgg ataacgccgc agctgacgca ctctccagag ttggggtcaa cctctgctcc 2820 acggaccgac ctgtagttcc ttacgcacag ctggcggaag cacaacgagg caacctcgaa 2880 ctacaacgcc tacggacaac gtcaacgaca ctagtctttg aagacatacg catgcccgac 2940 tgtgacggca cagttgcgtg cgaagtttcg acaggtcgtc cccgcccctt cgtaccagca 3000 gacttgcggc gaacagtttt tgactgcttg cactccttgt cccatcccgg gatacgagca 3060 acccagagac tgatagtttc gcggtttata tggcctcgta tcaacgttga cgtgaggtcc 3120 tgggccaaaa cctgcatgca gtgccaacgc tcgaaagttc cccggcacac caaggcgccc 3180 ttgggagctt ttagcacacc ggacactcgc ttcgcccaca ttcacgttga tatcgtggga 3240 ccacttcctc catcacaaga aaaccgttat cttctaacct gcgtcgatcg cttcacgcgg 3300 tggccggaag cggcaccaat gaaagacatc accgcaacaa ccgtagctcg cacactgatt 3360 gaaatgtgga ttagtcgctt cggtgtccca cagaccatca caacggacaa aggacgtcag 3420 ttcgaagcga ctttatttcg aactctctcc cacttcctgg agttcaacac atccatacga 3480 cgtcctacca tccgatctcc aacggcaggg tagagcgttt tcatcgtcag ttgaaagcct 3540 ccatcaaagc gcaaaccaac agctcggcat ggacagatgc tcttccactt gtcttgttgg 3600 gaatcagaac ggccttcaaa tcggacatca attgcagtgc agctgaactc gtttacggca 3660 cgtcactccg cttgcctggc gaattttttt cttcctcagc caacatttca caggaaacag 3720 cagaaaacta cgtccagtgg ctgcgactac taatgaaaga cctccagccc acctgcccaa 3780 ggacgccctc cacaagacac acttttgtta gccgcgacct cgacacaagt aagcacgtgt 3840 tcgttcgaca cgatggcgta aagaagccct tgcagccccc atatgacggc ccttatctta 3900 tactgcgtcg agacaagaag aacatgacgc tcgacgtcaa cggtcgagaa gaagtagtgt 3960 ctctagacag agtcaagcct gctcacttgg cacccgattt ttactcatcc gtgaccacac 4020 gccttccaaa gccacgtcac gtaaggctcc acataaacac cagcctgcac acagaccctc 4080 tcgaagccag gaacgctcgc cagaccgctt cctcgctagg gggggagtac c 4131 // ID Gypsy-239_AA-I repbase; DNA; INV; 4100 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-239_AA_; KW Gypsy-239_AA-LTR; Gypsy-239_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4100 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1079-1079 (2011). XX DR [1] (Consensus) XX CC Positions [3190-3669] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 1237..4092 FT /product="Gypsy-239_AA-I_1p" FT /translation="MKTIIEKSKINILKCPIMNGETNGAENKVKNINLVCD FT FKTIKNVGDIGKDHLSTLNNVLEMYYFNFNRPICPEINYEMKIMLKSGAPF FT HFKPRRLSQNDKIKVDDKVNELLKLGIIKESDSPFASPIVVIPKKDNDIRL FT CVDYRKLNKDTYRDNYPLPLIEDIYDKLKGKSVYTILDLKSGFHQIKISSE FT CTKYTSFVTPTGQYEYLRVPFGLCNAPAVFQRFVNKIFKSLIDEGKVVVYI FT DDILIASETLKEHFETLKQVLTILSRNLLELNLDKCKFCFNEIDYLGYSIN FT EYGRKPSKSHIEAISNFPLPKNSKDVQRFLGLTSYFRKFIKNFASIANPLY FT NLLKKDVEFVFGDKEIHAFELLRNKLITAPVLAIYDPKAETQLHCDASSFG FT FGAILLQKQNNGNYHPVSFFSKRTDEYEKKLHSFELETLAVVHAVKRFHVY FT LSGLKFKIFTDCNALAQTLAKKEINPKISRWAMFLENYNYTLEYRDGVKMQ FT HVDALSRFSMGEIEKKSDNRTLNENKEQNICLLDIEDIERNVILAQEQDEK FT IRNIKTHLESNTFPNFELRNGVLFRKDSEKFLLVVPKTMIDNIIRICHDKL FT GHIGIEKTIIEIKKFYWFSSMKKIVKKYINNCLSCIFYSPSDTKKEGFLKN FT IDKGQNPFQTIHMDHYGPIQLQSSKNFKYILVIVDAFTKYVKFYPTKTTNT FT KEVIDNLIIYMNHYSKPSRIITDRGSCFTSEQFKNFCMNQCIHHIKTASYT FT PEANGQAERINRTLTPMLAKLLDDTNLKWDNLLPNLEFIYNNTYNRSIKNF FT PSILLFGTFQNNLNNETNNIEIFVKNNQLSSRNYNLSEIRAKAKDNILKLQ FT EYNKEMVDRKRKPVQNYKEGDFVVLKTNVDNKLCKKFKGPYIIKKMLPNDR FT YLVTDIDGFQVSSLPFSSICSPNNMKKWLSDSSVLSNYDVDEDVDESG" XX SQ Sequence 4100 BP; 1517 A; 613 C; 724 G; 1245 T; 1 other; cgtattttac taaaatatca gaagtgggat atgaacacgt tcctgcggag tcattcacgg 60 tcggaagttg aagttatgga gaatctcgga agttttcttc cttcccaaga cgtcaataac 120 cagtccgata cgctgaacgc tgaagtagca aacactaata ccggaggaag tggagatgcc 180 cttactgtgg atcaaagatg ttgtactaca aaacgatcga taaagctaga cgaactgtcc 240 gcctttttgc cgttcttctc gggagcacca gacgaagacg ttgaattgtt catcaataca 300 gtggagaaga ctcgaaaaac gttcaatgta gaagaagatg taatgaaact gatgattttc 360 aagcaccttc gaagtaatgc gaaaatatgg ctatcgtcac aaccggatgt atttgaaatg 420 aattatcaag gtgtaatcaa tctaatcaag acaacgttca ttgtgtcctt cagcctgttt 480 gaaacaagaa agcaattaga atttcgaaag tggagtcctg aagaaacgtt tttggaatac 540 tttgttgcaa aaagaaacct tgctgtgccg ctgaaattac ctgaaggaga attgatcgaa 600 tatgccattg aaggaatccc cgatcgtcat ttgaaaaatc aagcaaaata gggattttcg 660 actgttgcaa gccttctcaa agcctttcaa gcgattcatc taccaaagca aaaattttca 720 tacgcaagcc cccctgtttg ttacaactgt tatttacctg gacatattgc cacacattgt 780 cggaaaccga gcagccatca atcaagaact tgtggtcgag atgctttccg taggaatcca 840 gttgagagaa aacgttttgg aaatgagaaa ataccaagga tcgcagctgt tcaaaatgat 900 ccagactttg ataacaagac ggttgacctc gaacatgacg gactggtaga tattcatttg 960 acagatttta aaattaattt gaaagcgtta tttgatacag gaagtccagt ttgtctgata 1020 cgccttagtt tagcaaaaag aagaaaaatt tatccttata atagcaaaaa cacattcaag 1080 ggtatcggag gatctggact aaagattgtg ggtagattag taactaatgt aaaaattcaa 1140 aatttaactt tcagtattga gtttttggtg gttcctgatt cggatatact ttcatatgac 1200 gttataattg ggcgagatgt tattcttaat cctggaatga aaactatcat agaaaagagc 1260 aaaattaata tacttaaatg tccaatmatg aatggcgaaa caaatggagc tgaaaataag 1320 gtaaaaaaca taaacttggt ctgtgatttt aaaactatca aaaatgtagg tgatattggt 1380 aaagatcatt tgtcaacgtt gaataatgtt ttggaaatgt actattttaa cttcaatcgt 1440 cctatctgtc ctgaaattaa ttacgagatg aaaatcatgc tgaaatctgg tgcaccattt 1500 cattttaaac caaggcgact ttctcaaaat gataaaatta aagtagatga taaagttaac 1560 gaattgctca aattgggaat aataaaagaa agcgattctc cttttgcaag tcccattgta 1620 gttattccga aaaaagacaa tgatataaga ttatgcgtag attaccgaaa attaaataaa 1680 gatacatatc gtgataatta tcctctacca ttaatagaag acatttatga taagcttaaa 1740 ggaaagtcag tgtatacaat tttagatttg aaatctggtt ttcatcaaat caaaatttct 1800 tctgaatgta ccaaatatac ttcttttgtt acgccaacag gtcaatatga atatttacgg 1860 gttccttttg gcctttgtaa tgcaccagct gtgtttcaaa gatttgttaa caaaattttt 1920 aaatcgctta ttgatgaagg aaaggtagtt gtttacattg acgatatttt aattgcttca 1980 gaaacattaa aagaacattt tgaaacatta aaacaagttt taacaatttt gagtagaaat 2040 ttgctggaat taaatttaga taaatgtaag ttttgtttca atgaaattga ttatttaggt 2100 tattcgatta atgaatatgg tagaaaacca agcaagtctc atattgaagc tatatctaat 2160 tttccactcc cgaaaaattc aaaagacgtt caaagatttt taggtttaac tagttacttt 2220 agaaaattta ttaaaaattt tgcatcaatt gctaaccctt tgtataactt attaaagaag 2280 gatgttgaat ttgtttttgg tgataaggaa attcatgctt ttgaactttt gagaaataaa 2340 ttgataacag ctccagtttt ggcaatctat gatcccaaag ctgaaacaca attacattgt 2400 gatgcaagct ctttcggttt tggtgcaatt ttattacaaa aacaaaataa tgggaattat 2460 catcccgtta gtttctttag taagagaact gatgagtatg aaaaaaagtt acatagcttt 2520 gagctagaaa cacttgctgt tgtgcatgct gtgaaacgat ttcatgtata tttgtccgga 2580 ttaaaattca aaatttttac tgattgtaat gctttggcac aaacattggc taaaaaagaa 2640 atcaatccta aaattagccg atgggcaatg ttcttggaaa attacaatta taccttagaa 2700 tacagagatg gagttaaaat gcagcatgta gacgctttaa gtagattttc gatgggtgag 2760 attgaaaaga aatcggataa ccgtacgtta aatgaaaata aagagcaaaa catttgtctg 2820 ttagatattg aggatattga aagaaatgtg attctggcac aagaacagga tgaaaagatt 2880 agaaacataa aaacacattt agaatcaaat acctttccga actttgaact gagaaatgga 2940 gtgttgttca gaaaagatag cgaaaagttt ttacttgtag taccaaagac aatgatagat 3000 aacataatac gaatttgcca tgataaattg ggccacatag gtattgagaa aacaataatt 3060 gaaatcaaaa aattttactg gttttcaagt atgaaaaaga tagtaaagaa atacataaat 3120 aattgtttga gttgtatatt ttacagtcca tctgacacta aaaaggaagg gttcttaaaa 3180 aatattgata agggtcaaaa tccgtttcaa acaatacata tggatcatta tggtccaatc 3240 cagctccagt catcaaaaaa tttcaaatat attcttgtaa ttgtagatgc ttttactaaa 3300 tatgttaagt tttatccaac caaaacaaca aacaccaaag aagtaataga taacttgata 3360 atatacatga accactacag taaaccatca agaattatta cagatagagg aagttgcttt 3420 acatcagaac aatttaaaaa cttttgcatg aatcaatgta ttcatcatat caaaacagcc 3480 tcatacacgc ccgaagctaa tggtcaggct gaaagaataa acagaaccct tactccaatg 3540 ttagcaaaat tattagatga tacaaatctt aaatgggata atttacttcc gaatttagaa 3600 tttatttata acaatacata caatagatca atcaaaaatt tccctagcat attattattt 3660 ggtacattcc agaacaattt gaataatgaa accaacaata tagaaatatt tgtaaagaac 3720 aatcagcttt cgtcgcgaaa ttacaatttg tcagaaattc gagcaaaagc aaaagataac 3780 atattaaaac ttcaggaata taataaggaa atggtggaca gaaaacgaaa gcccgttcaa 3840 aattataaag aaggtgattt tgttgtctta aaaactaatg ttgataataa actttgtaaa 3900 aagtttaaag gtccctatat tatcaaaaag atgttgccga atgatcgata ccttgttaca 3960 gatattgatg gttttcaagt atcaagtttg ccattcagct caatttgttc accgaataac 4020 atgaaaaaat ggctctcaga ttccagtgta ctctctaatt atgatgtcga tgaggacgtc 4080 gacgagtcag gatgaccgaa 4100 // ID Kiri-24_AAe repbase; DNA; INV; 3027 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 16-FEB-2011 (Rel. 16.02, Last updated, Version 1) XX DE A Kiri non-LTR retrotransposon family from Aedes aegypti. XX KW Kiri; Non-LTR Retrotransposon; Transposable Element; Kiri-24_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-3027 RA Kojima K.K. and Jurka J.; RT "Kiri non-LTR retrotransposons from the yellow fever mosquito."; RL Repbase Reports 11(2), 719-719 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 7 sequences with >95% CC identity. CC This family and related Kiri elements found in two mosquito CC species, Culex quinquefasciatus and Aedes aegypti, constitute a CC distinct lineage belonging to the CR1 group. The phylogenetic CC analysis with PhyML indicates that Kiri is a sister lineage of CC the L2 clade. Although several Kiri elements do not encode two CC proteins, probably due to 5' truncation, Kiri elements generally CC encode two proteins. Their ORF1 proteins commonly contain an CC L1-like RNA-binding domain. XX FH Key Location/Qualifiers FT CDS 74..454 FT /product="Kiri-24_AAe_1p" FT /translation="MANNLIVNTSNNWNTQGIVMKSAFVPNKLSVCCMNSQ FT SICARKMSKLEELRKIANVSNVDVICISESWLNERISDDVIKIENYSVIRN FT DRVGRLGGGIIVYVKKQFNFKILDKXXLXLVKPNIFFSN" FT CDS 577..2898 FT /product="Kiri-24_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MGDFNTNILNPRCDKSKKFVSFMENFSFSSLGTIPTH FT FHRTGSSQIDLMITNKRESVLRFNQVDVPCMSNHDLLFASLDVDVLLSSAD FT PVSYRDYSRFNPSTAISIFNQISWDEFYQSNNPEILLHFFNEHVKTMFDTT FT IPVKTNVINRKCNPWFNAEVSRAIVDRDLAFRQWKTTRNDSDFEHFKRLRN FT YATSSILNAKQRYWNSKIAGSGSVKNMWKNLKSLNITRKNDGFLMNCSPDD FT INKHFSQSFXSPTRSFYTQNVNSSETFFRFNEIQDYHIINAIYEVKSNAVG FT LDDIPIKFIKMILPLIMKPLKHIFNSIIKSGXYPKIWKYSKIVPIKKKSTE FT NSLENLRPISILSAISKAFERLLKNQITCYLSGNNLLSQYQSGYRKGHSIK FT TAMLKVCDDIGLVLDKGGSVVLLLLDFSKAFDTISHVKLCNKLEQNFGFSY FT DAVRLIHTYLTDRNQTVFVNDCFSNYLPILSGVPQGSVLGPLLFSIYINDL FT PLRLNGCQIHLFADDVQIYVNVSGMTSDAAAYLINDNLSRISSWASENSLM FT LNARKTQACYITRQSRSHDKPTLSLNGIPISYADKVVSLGVTIQSNFEWDS FT FILKQCGKIYSVLRTLRYNTDFLKTDTKLKLFKSLMYPHFLFCDFVTTQLS FT ANILHKLKVALNCAVRFVYNLNRYAPVTHLHKSLIGCSFNNFPLVRSVFLL FT HTIVTKKCPAYLHRKLIRLRSPRGIKFSMPRSITSYYASSLLVRSIYNWNS FT LPAEFRSIVSSSHFKKSVLQQVN" XX SQ Sequence 3027 BP; 984 A; 517 C; 498 G; 1016 T; 12 other; attaaaactc gatwttmaag tgcgtaggct tktaaatgtt ttcttgactt gtaattgctt 60 ttcgttaact attatggcta ataatttgat tgtcaatact tctaataatt ggaatacaca 120 aggtattgtt atgaagtctg cctttgtacc taacaaatta tcggtttgtt gtatgaacag 180 ccaaagtatt tgtgcaagga aaatgtcaaa acttgaagaa ctgcgtaaaa tagccaatgt 240 ttctaatgtg gatgttatat gcatttctga atcttggcta aatgagagaa tatccgatga 300 tgtcatcaaa attgaaaatt attcagttat cagaaatgat cgcgtaggac gcctgggtgg 360 cggcataatt gtgtatgtta aaaaacagtt taatttcaag attctcgata aatmwasstt 420 gwttctggtg aaaccgaata tatttttctc gaactagtag ttaacaatca aaaacttctt 480 ttaggtttct tttacaatcc tcctgagttt gattgttcgc aactgctagc ccaaaaattt 540 tctaattttg aattccaatt cgaacatgtg ttgttaatgg gtgatttcaa tacgaacata 600 ctcaatcctc gatgtgacaa aagtaaaaaa tttgtttcat tcatggaaaa tttttcattt 660 tcatctcttg gaactatacc aacacacttt cataggacag gctcttctca aattgatctt 720 atgataacta ataaacgaga atcagttcta cgatttaacc aagttgatgt accatgcatg 780 tcgaaccatg atttactgtt cgcatcttta gatgtggatg ttctgctatc cagtgccgat 840 ccagtgagtt acagagacta cagcaggttc aatccttcca ctgctatatc tatatttaat 900 caaataagct gggatgaatt ttatcaatca aataatccag agattttatt gcattttttc 960 aatgagcacg tgaaaacaat gtttgatacc actatacccg tgaaaacaaa tgtgataaac 1020 cgtaaatgca atccctggtt caatgctgag gtatctagag caatagttga tcgtgactta 1080 gcttttcggc aatggaaaac aacaagaaat gacagcgatt ttgaacattt taaaaggcta 1140 cgaaattacg ctacttcatc aattctgaat gctaaacagc gttactggaa ttctaaaatt 1200 gctggctctg gcagtgttaa aaatatgtgg aagaacctaa aatctttgaa cattacaagg 1260 aaaaacgacg gttttctaat gaactgcagc cctgatgata ttaacaaaca tttttcccag 1320 agttttwgtt cacctacacg atcgttttat acccaaaatg ttaactcgag tgagacattt 1380 ttccgattca acgaaattca agactatcat attataaacg caatttatga agtgaagtcg 1440 aatgctgtag gtcttgatga tatacccata aaatttataa aaatgatact tccgctaatt 1500 atgaagccat taaagcatat tttcaatagt attatcaaaa gtggtwtmta tcctaaaatt 1560 tggaagtatt ctaaaatagt accmattaag aaaaaatcca cggaaaactc actggagaac 1620 cttagaccta tcagcatttt aagtgcaatt tcaaaggctt ttgaacggtt gttaaaaaat 1680 caaattactt gttatctatc tggtaacaat cttttatcgc aataccaatc aggttaccgc 1740 aaaggacata gcatcaaaac ggcaatgtta aaagtatgcg atgatatcgg tcttgtttta 1800 gataaaggag gatcagtagt tctcttgttg ctcgacttct ccaaagcctt cgacaccatt 1860 tctcatgtta aactttgtaa taagcttgaa cagaattttg gattttcata tgatgccgtt 1920 agacttattc atacatatct tacagatagg aatcagacag tttttgttaa tgattgtttt 1980 tctaattacc ttccaatttt atcaggagtt cctcaaggat ccgtgttagg tcctttatta 2040 ttttccattt atattaatga tctaccgttg aggcttaacg gatgccaaat tcacttattt 2100 gcagacgacg tccaaatata tgttaatgtg tctggaatga ctagtgatgc tgctgcatat 2160 ttgatcaatg ataatttatc aagaatatca tcatgggctt cagaaaattc gttgatgtta 2220 aatgctcgaa aaacacaagc gtgttacata acacgccaga gccgtagtca tgacaaaccg 2280 accttaagtt taaacggtat accaatcagt tacgctgaca aagttgttag tttgggcgta 2340 actattcaaa gtaactttga atgggacagt ttcattttga aacaatgtgg aaagatttat 2400 tcggttttac gcacattacg atataataca gattttctaa agactgatac aaaactaaaa 2460 ttgttcaaga gcttgatgta tccacatttc ttattttgtg attttgtcac cacccagcta 2520 tctgcaaaca tcctgcataa actaaaggta gctctcaact gcgcagtaag atttgtatat 2580 aacttaaaca gatatgctcc tgttactcac ctacacaaat cgttgattgg ttgttcattt 2640 aacaattttc ccttagttag atccgtgttt ttactgcaca caattgtaac caaaaaatgt 2700 cctgcatact tgcatcgtaa acttattcga ttaagaagcc ctagaggtat taaattttca 2760 atgccgcgat ctataacttc atattatgct agttcccttc ttgttagaag tatatacaat 2820 tggaattctt tgccggctga atttcggtca attgtttcct catcgcattt caagaaatca 2880 gttttacagc aagtcaatta aaaattacat ttgatctgtt tctaatagtt tgattttttt 2940 ttatggcgca cttcatgtta ccttagacaa actgaaacat aaaagacatg gtcttaagtt 3000 tatggatgag taaaataaat aaataaa 3027 // ID Gypsy-190_AA-LTR repbase; DNA; INV; 1223 BP. XX AC supercont1.100; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-190_AA_; KW Gypsy-190_AA-I; Gypsy-190_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-1223 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.100; Positions 447517 446295. XX SQ Sequence 1223 BP; 374 A; 255 C; 260 G; 334 T; 0 other; tgtagcgagc caattagagc tcagggaaaa aaaataataa tttgaccaat cagagttcaa 60 ttccctctaa aaaccaacag ccaatcaaat ttgtttggat gagagagata gcaaattaga 120 agcccccgcg caactatata agttggggca aacacgcgcc catcatcagt tttaatttat 180 cagttggacc gaaaagtaaa aagcagagtg agctctcctc tcctatccca accacagcct 240 gcaaacctag gagcaaagcc ccatcgagga agccagatag tcgtacctac gacggcgtga 300 agcacgaggt agccacgatc cagtccgtgc tagcacctaa caagctgttt agcttctgtc 360 ccgacctttg ttcaaaggtg cgcgggtttc cgaaacatca aaaaagggct ttggctgctc 420 cagaggtttt gcacggtaaa ataaaccgca attttcattg cgattcccca atctgcgttt 480 ctaaatgcaa cgaaacaaat atttaaagag ttgttttccc ttccctagtg aaactccttt 540 gcaactgtgt ccaacaagtt tcaattgtac ctaaattgtg tgtgattagc ggtaaaaccg 600 atgtaagcag tgatcatgta gttaattgcc aatcatgtaa gtgatcaagt gctcatcctc 660 tcaacagcgc catccacggt ggtcggtttt ttttggaggc gcaccctagc cgccttcagt 720 tgaggttctc agccgtgatc caccagcatg atccagacct cttcggggga tctctgtggc 780 tgaagccgaa gaaaatgtgt gcacacaaac cagaaaactg atgaacttta gttttgcgcg 840 tgagttagta tgtaagcaga aaactagggt aagtcaaaaa tgaaaaaaaa aaaaatctct 900 agtacggccg ttatgaagaa gtgtagaagt aaaattttca aattgatttg attggtttta 960 ttagtaaatt gtattgaaat gattagtagt gttttttaat attgaatttt gattgatttt 1020 cctattgaaa ttctgtgcgt tcccacgact tgccgcagag ctgaaaccgg gcaacctttt 1080 gaaaaagtgg agttttgcat caggtcacgg cctgattggc gcttaagcaa aactttaagc 1140 aacggaatca atcagcattc gcgtcgaaat tagtttattt aaaatttgag tttttgtgtg 1200 atccaccaag cgaaaacgtt aca 1223 // ID Gypsy22-I_Dpse repbase; DNA; INV; 8650 BP. XX AC Unknown_group_236; XX DT 15-MAY-2009 (Rel. 14.05, Created) DT 15-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy22_Dpse; KW Gypsy22-LTR_Dpse; Gypsy22-I_Dpse. XX OS Drosophila pseudoobscura OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; obscura group; OC pseudoobscura subgroup. XX RN [1] RP 1-8650 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1122-1122 (2009). XX DR Genome; Unknown_group_236; Positions 7745 16394. XX CC Positions [130-636] - Reverse transcriptase CC Positions [1699-2175] - Integrase core CC LTRs are 85% similar to each other. XX FH Key Location/Qualifiers FT CDS 1..2544 FT /product="Gypsy22-I_Dpse_1p" FT /translation="MVPFPQTKWEMKLNFEHEKPFHCPPRRLSYSEKAQVQ FT KMIDDYTEKGYIRTSESEYVTPIVLVKKNSGELRLCVDYRTLNKGMIKDNY FT PTPLIDDLLDKLSGKRLFTKLDLKNGYFHVFMHKDSVKYTSFTTPMGQYEW FT LRMPFGLRNASAVFQRFTNKIFAEMVKEDKVIIYMDDIMVATKDSDSHMEI FT LQEVMRRLVENKLELRIDKCEFLQSEVKYLGYSISGNGIKPDTKGLQAVKG FT FPVPTKTNEVQSFLGLCSYFRRFVNDFSTKAKPLYDLVKKDRKFEFGIVEL FT ECFEQLKSNLLEAPILALYNPRDPTELHCDASSLGFGSILMQKKGDGRMHP FT VFYFSKRTTITEAKYHSFELETLAIIYALQRFRVYVQGIPFKIVTDCNALT FT MTLNKKELNPRIARWALELQNYDYKLEHRSGSRMQHVDALSRAVQILTVDT FT NTFEENLIICQNRDNKLMELRETLQKSEHKHYEMRNGIIYRKKDNNTILFC FT VPEEMEGHVLYKYLDELGHVGIDKMTDAISKSYWISNVRYKAKRHIENCLK FT CIAYSAKHGKEEGFLHNIPKGSGPFEVVHIDHFGPVDKGRANKHVLVVIDA FT FTKFVRLYTTKTTSTKEAVIAMKDYFRAYSRPRCIVSDRGSCFTSKEFEEF FT LEQSNVKHVKIATGSPQANGQVERVNRSLGPMIAKLVDPEKGLHWDMVVET FT VEHAMNNTVQRTINEHPSRMLFGVVQKGKVSDLLKDHLDELTEQRATRDIE FT KIRAVAGTHQEKIQSYNKQAADSKRREPHVYVDNDLVMVRNFDTHTGVSKK FT LIPKFKGPYKISKVLKNDRYLLEDVEGFQQSRIPYKGKKMKNDCNT" FT CDS join(3044..4699,4703..6742) FT /product="Gypsy22-I_Dpse_2p" FT /translation="MSDLVGKEEFSAAQLREWLESLNLPKGGSKAAMAARL FT NEIPVELRGQGPPAAETCENEREDEAAADQDSTKSERKEKNEKAPQAPGHN FT NNHGEYRAEVEMLKLQIELLKLQSEREKEGRGENTTPAANNDAGVMLLNAA FT KDMLPTYHGNISGNNDDVTTWIAQFTAVAKVNKLKDEKLLMLLMSKLKDKA FT LVWLHSSPEHMSLPIDQLLNVMEDTFHPKESKLLLRRKFESRSWARGEEFS FT MYFNAKVSLASRIVIDDEEFIDGVIEGIPDVGLRRQAHMQCFGAPYQLLKA FT FEKIMLPKKYGSTEGANTGASPTPIRCYNCNSLGHVAGECRKPKRERGACY FT GCGSMSHQVSHCDEKKYKVHNDYIYKFNIYISNYNNKCLSLDCLIDSGSPI FT SFMKLSCLEHQSENNLNKVEERSRLSSNKIKEDDLSLYTFNKDKKNREIEF FT ILTKNADYAYVGLNNSKLNIFGKIPSFVIINKNRINFELLVVADESMGYEA FT VLGRDFMNLCKFKIVSDICKEKESETKNECLEKNECLGENECLEKNECLGE FT NECLGNECLEKNEWKIENERLGEYECEVKDEGDKENKHEVESEGKVENECS FT AMVECLEDSDTCESVKTNFNELPSQYPDKENDWELKVGTDCSQELEGRLKY FT MFRTAYINAERPNLPQTKWEMKLNFEHEKPFHCPPRRLSYSEKAQLQKMID FT DYTEKGYIRTSESEYVSPIVLVKKNSGELRLCVDYRTLNKGMIKDNYPTPL FT IDDLLDKLSGKRLFTKLDLKNGYFHVFMHKDSVKYTSFTTPMGQYEWLRMP FT FGLRNASAVFQRFTKKIFADMVKEDKVIIYMDDIMVATKDSDSHMEILQEV FT MRRLVENKLELRIDKCEFLQSEVKYLGYSISGNGIKPDTKGLQAVKGFPVP FT TKTNEVQSFLGLCSYFRRFVKDFSTKAKPLYDLVKKDRKFEFGIVELECFE FT QLKSNLLEAPILALYNPRDPTELHCDASSLGFGSILMQKKGDGRMHPVFYF FT SKRTTITEAKYHSFELETLAIIYALQRFRVYVQGIPFKIVTDCNALTMTLN FT KKELNPRIARWALELQNYDYKLDHRSGSRMQHVDALSRAVQILTVDTNTFE FT ENLIICQNRDNKLMELRETLQKSEHKHYEMRNGIIYRKKDNNTILFCVPEE FT MEGHVLYKYHDELGHVGIDKMTDAISKSYWISNVRYKAKRHIENCLKCIAY FT SAKHGKEEGFYTTYQKGQDHSR" XX SQ Sequence 8650 BP; 3134 A; 1626 C; 2004 G; 1886 T; 0 other; atggtcccat tcccacaaac caagtgggaa atgaagttga atttcgaaca cgaaaaaccg 60 tttcattgcc cacctagaag actgtcgtat agcgaaaaag cccaagtaca gaaaatgata 120 gacgattaca cagaaaaagg ttacatcaga accagtgagt cagaatacgt tacgcctata 180 gtactggtaa aaaagaattc gggagagttg agattgtgcg tcgattatcg cacacttaat 240 aaaggaatga taaaggacaa ttacccaaca cctcttatag acgacctact agataaactg 300 tcagggaaaa gacttttcac caagttagac ctgaagaatg gatacttcca cgtatttatg 360 cataaagatt cagttaaata cacatcgttc acgaccccca tgggacaata cgaatggttg 420 aggatgccgt tcgggttacg aaatgcatcg gcggtgtttc aaagattcac aaacaaaatt 480 tttgcagaaa tggtaaagga agacaaagtt ataatataca tggacgatat tatggtagca 540 acaaaagata gtgatagtca catggaaatt ttacaagaag tgatgagacg attggtagag 600 aataaacttg aattaaggat tgataaatgc gaattcttac agtcagaagt taagtacctg 660 ggctattcca tatcgggtaa cggaattaaa ccagatacga agggattaca ggcagtaaaa 720 gggttcccag ttcccacgaa aacgaacgaa gtgcagagtt tcttagggtt atgttcttac 780 tttagacggt tcgtaaacga tttttctacg aaagctaagc ccctttatga tttggtcaaa 840 aaagacagga agtttgaatt cggtatcgta gaactagaat gtttcgaaca actcaaaagt 900 aatttgttgg aagcaccgat cttggcactt tacaatccca gagatccgac agaattacat 960 tgcgatgcaa gttcgttagg tttcgggtca attctaatgc agaagaaggg tgatggcaga 1020 atgcacccgg tgttctattt ctccaagcgg acaacaatta ccgaagcaaa ataccacagc 1080 tttgaactgg agacactagc gataatttat gcactacaac ggtttagagt atacgtgcaa 1140 ggcataccct tcaaaatagt gacagactgc aacgcgctaa ctatgactct aaacaaaaaa 1200 gaactcaatc cacgcattgc ccgctgggcg ttagagttac aaaattatga ctacaagtta 1260 gaacacaggt caggaagcag aatgcaacac gtagatgcgc taagcagagc cgttcagata 1320 ctcacggtag acacaaacac ctttgaagaa aatttaatta tatgccaaaa cagagataac 1380 aaactgatgg agctgagaga aacgttacag aaatcagaac acaaacatta tgagatgaga 1440 aacggaataa tatacaggaa aaaagataac aatactatcc tattctgcgt acccgaggaa 1500 atggaaggcc acgtactcta taagtacctc gatgaactag ggcacgtcgg tatagacaag 1560 atgaccgacg ctatatcgaa aagctactgg atctcaaacg taaggtataa agccaagcga 1620 cacattgaaa actgtctaaa atgcatagcg tactcagcaa aacacggaaa agaggaaggg 1680 tttttacaca acataccaaa agggtcagga ccattcgagg tagtacatat cgaccatttc 1740 gggccagtcg acaaaggcag ggccaacaaa catgttttag tagtaataga cgcgttcacc 1800 aaattcgtaa gactttatac gaccaaaacc actagcacaa aggaagcagt aattgcaatg 1860 aaagattatt tccgagctta cagtagaccc agatgcattg tgtcagacag aggaagctgt 1920 ttcacgtcaa aagagtttga agaattctta gaacagagca atgttaaaca cgtaaagata 1980 gcaacagggt cgccacaggc caacgggcag gtagaaaggg tgaacagaag tttgggaccc 2040 atgattgcaa agcttgttga cccggaaaaa ggattacatt gggatatggt agtggaaaca 2100 gtagaacacg cgatgaacaa cacagttcaa agaacaatta atgaacaccc gagcagaatg 2160 ttgtttgggg tcgtacaaaa aggaaaggtt tcagatttac taaaagatca tctagacgaa 2220 ctaaccgaac aacgggcaac aagggatata gaaaagatta gagcagtagc aggcactcat 2280 caggaaaaaa tacagtctta taacaaacaa gcagcagatt caaagcgaag ggagccacac 2340 gtgtacgtag ataatgatct agttatggtg agaaatttcg acactcatac aggggtgtct 2400 aaaaagctta tcccgaagtt caaaggacct tataagatat caaaggtgtt aaaaaatgat 2460 aggtatttgc tagaagatgt agagggattt caacaatcaa ggatcccgta taaaggaaaa 2520 aaaatgaaaa atgattgtaa cacgtaaact tagaatgtta agagtaaacc acacccaact 2580 gtaataaatc aacctatcta gaaactaact taaatttaag gagttcagga gcactccagt 2640 caggatggcc gaattgtagc aaggtaataa ttagtgaggc attcaactag tatttaaaat 2700 agcatacagc gcgcattggt atttaccgca ccatcgatac actcggcggg aaataccaat 2760 agtgagatat cggataagaa acatcgataa gagagtaagt tcactttgta aatggcgatt 2820 gtgagagatg gacgcgcagc taataaaaaa gagttatatc aaatcgccgg gcttgtgttt 2880 ttatacctag acagttcact cacgcaccaa tattggtcac tacaattccc gggttagacg 2940 gcctacaatt tcagaagtgg gatacgatca atcgccgcca gtgaaaaata gcgaaaattt 3000 ttcttttctc gtccgtcgaa gacgccatcg ctgaaaaaaa aacatgtcgg acttggtcgg 3060 caaggaggag ttctcggccg cccagctgcg cgaatggctg gagtccctca atctaccaaa 3120 aggtggaagc aaagctgcca tggcagcgag acttaacgaa ataccagtag agctgcgggg 3180 acagggacca ccggcagccg aaacatgcga gaacgagagg gaagatgagg cagccgcaga 3240 tcaagactcg acgaagagcg aacgcaaaga gaagaacgaa aaagcaccgc aagcgcctgg 3300 gcacaacaac aaccacggcg aatatcgagc agaggttgaa atgttaaaac tgcaaatcga 3360 gctgctgaag ctgcagagcg agagggagaa ggaagggaga ggagagaaca caacaccagc 3420 cgccaacaat gacgctggcg tcatgttgct aaatgccgcc aaagacatgt tgccaacata 3480 tcatggcaac atttctggaa acaacgatga cgtcacaact tggatcgcgc agtttacggc 3540 cgtcgcaaaa gtgaacaaac tgaaggatga gaagctacta atgctgctaa tgtcgaagct 3600 taaggacaaa gcattggtgt ggctgcattc gagtccggag cacatgtcgc tgccaatcga 3660 tcagttgttg aacgtcatgg aagacacttt ccatcccaag gaaagcaaac tgttgctccg 3720 tcgcaagttc gagtcccgtt catgggcacg tggcgaggag ttctcgatgt acttcaacgc 3780 caaagtgtcg ctggcatccc gcatagtcat tgacgacgag gagtttattg acggagtcat 3840 cgaaggtatc ccagatgtag gcctgcgtag gcaagcccac atgcagtgct ttggcgctcc 3900 atatcaacta ctcaaggcgt tcgagaagat catgctgcca aagaagtacg gttcaactga 3960 aggggcaaac actggagcct cgccaacacc cattcgctgc tacaactgca actctttggg 4020 ccacgtggca ggggagtgcc gcaagcccaa gcgcgaaaga ggagcgtgct acggatgcgg 4080 cagcatgagt caccaggtgt cacactgtga cgaaaagaag tacaaggtac ataacgatta 4140 tatatacaaa tttaatatat acattagcaa ttacaataac aaatgcttgt ccttagattg 4200 cctcatcgac tcagggagcc caataagttt tatgaagtta tcgtgtctag agcaccagtc 4260 ggaaaataac ctaaataaag tggaagagcg ctcacgatta agttcgaaca aaatcaaaga 4320 agatgattta agtttatata cgtttaacaa agacaagaaa aacagagaaa ttgaatttat 4380 tttgactaaa aatgcggatt acgcttatgt tggactaaac aatagcaaac ttaacatctt 4440 tggtaaaata cccagttttg taattattaa caaaaatcga atcaactttg aattactagt 4500 agtggcagac gagtcgatgg gctatgaggc tgtgttaggt agggatttta tgaatttatg 4560 caaatttaaa attgtaagtg acatttgcaa ggagaaggag agtgagacaa aaaatgagtg 4620 tttagaaaaa aatgagtgtt taggagaaaa tgagtgttta gaaaaaaatg agtgtttagg 4680 agaaaatgag tgtttaggat aaaatgagtg tttagaaaaa aatgagtgga aaatagagaa 4740 tgagcgttta ggagaatatg agtgtgaagt aaaagatgag ggtgataaag agaataagca 4800 tgaagtagaa agtgaaggca aggtagaaaa tgagtgttct gcaatggttg agtgtctgga 4860 agacagcgat acctgcgaga gcgtaaaaac caatttcaac gaactaccaa gccagtatcc 4920 agacaaagaa aacgactggg agttgaaagt agggacagat tgtagccaag aactcgaagg 4980 acgacttaaa tacatgttta ggacagcgta tataaacgca gaaagaccaa acttaccaca 5040 aaccaagtgg gaaatgaagt tgaatttcga acacgaaaaa ccgtttcatt gcccacctag 5100 aagactgtca tatagcgaaa aagcccaact acagaaaatg atagacgatt acacagaaaa 5160 aggttacatc agaaccagtg agtcagaata cgtttcgcct atagtactgg taaaaaagaa 5220 ttcgggagag ttgagattgt gcgtcgatta tcgcacactt aataaaggaa tgataaagga 5280 caattaccca acacctctta tagacgacct actagataaa ctgtcaggga aaagactttt 5340 caccaagtta gacctgaaga atggatactt ccacgtattt atgcataaag attcagttaa 5400 atacacatcg ttcacgaccc ccatgggaca atacgaatgg ttgaggatgc cgttcgggtt 5460 acgaaatgca tcggcggtgt ttcaaagatt cacaaaaaaa atttttgcag acatggtaaa 5520 ggaagacaaa gttataatat acatggacga tattatggta gcaacaaaag atagtgatag 5580 tcacatggaa attttacaag aagtgatgag acgattggta gagaataaac ttgaattaag 5640 gattgataaa tgcgaattct tacagtcaga agttaagtac ctgggctatt ccatatcggg 5700 taacggaatt aaaccagata cgaagggatt acaggcagta aaagggttcc cagtccccac 5760 gaaaacgaac gaagtgcaga gtttcttagg gttatgttct tactttagac ggttcgtaaa 5820 agatttttct acgaaagcta agccccttta tgatttggtc aaaaaagaca ggaagtttga 5880 attcggtatc gtagaactag aatgtttcga acaactcaaa agtaatttgt tggaagcacc 5940 gatcttggca ctttacaatc ccagagatcc gacagaatta cattgcgatg caagttcgtt 6000 aggtttcggg tcaattctaa tgcagaagaa gggtgatggc agaatgcacc cggtgtttta 6060 tttctccaag cggacaacaa ttaccgaagc aaaataccac agctttgaac tggagacact 6120 agcgataatt tatgcactac aacggtttag agtatacgtg caaggcatac ccttcaaaat 6180 agtgacagac tgcaacgcgc taactatgac tctaaacaaa aaagaactca atccacgcat 6240 tgcccgctgg gcgttagagt tacaaaatta tgactacaag ttagatcaca ggtcaggaag 6300 cagaatgcaa cacgtagatg cgctaagcag agccgttcag atactcacgg tagacacaaa 6360 cacctttgaa gaaaatttaa ttatatgcca aaacagagat aacaaactga tggagctgag 6420 agaaacgtta cagaaatcag aacacaaaca ttatgagatg agaaacggaa taatatacag 6480 gaaaaaagat aacaatacta tcctattctg cgtacccgag gaaatggaag gccacgtact 6540 ctataagtac cacgatgaac tagggcacgt cggtatagac aagatgaccg acgctatatc 6600 gaaaagctac tggatctcaa acgtaaggta taaagccaag cgacacattg aaaactgtct 6660 aaaatgcata gcgtattcag caaaacacgg aaaagaggaa gggttttaca caacatacca 6720 aaagggtcag gaccattcga ggtagtacat atcgaccatt tcgggccagt cgacaaaggc 6780 agggccaaca aacatgtttt agtagtaata gacgagttca ccaaattcgt aagactttat 6840 acgaccaaaa ccactagcac aaaggaagca gtaattgcaa tgaaagatta tttccgagct 6900 tacagtagac ccagatgcat tgtgtcagac agaggaagct gtttcacgtc aaaagagttt 6960 gaagaattct tagaacagag caatgttaaa cacgtaaaga tagcaaaagg tcgccacagg 7020 ccaacgggca ggtagaaagg gtgaacagaa gtttgggacc catgatgcaa aagcttgtga 7080 cccggaaaaa ggattacatt gggatatggt agtggaaaca gtagaacacg caatgaacaa 7140 cacaattcaa agaacaatta atgaacaccc gagcagaatg ttgtttgggg tcgtacaaaa 7200 aggaaaggtt tcagatttac taaaagatca tctagacgaa ctaaccgaac agcggacaac 7260 aagggatata gaaaagatta gagcagtagc aggcactcat caggaaaaaa tacagtctta 7320 taacaaacaa gcaacaaatt caaagcgaag ggagccacac gtgtacgtag ataatgatct 7380 agttatggtg agaaatttcg acactcatac aggggtgtct aaaaagctta tcccgaagtt 7440 caaaggacct tataagatat caaaagtgtt aaaaaatgat aggtattttc tagaagatgt 7500 agaggggttt caacaatcaa ggatcccgta taaaggagtt tgggcggtgg caaatatgaa 7560 gccgtggatt gaaaatggta ataatgcaaa tgtcacaggg aatcaaaatg aagaaaatga 7620 ttgtaacacg taaacttaga atgttaagag taaaccacac ccaactgtaa taaatcaacc 7680 tatctagaaa ctaacttaaa tgtaaggagt tcaggagcac tccggtcagg atggccgaat 7740 tgtagcaagg taataattag tgaggcattc aactagtatt taaaatagca tacagcgcgc 7800 attggtattt accgcaccat cgatacactc ggcgggaaat accaatagtg agatattgga 7860 taagaaacat cgataagaga gtaagttcac tttgtaaaat ggcgattgtg agagacggac 7920 gcgcagctaa taaaaaagag ttacattaaa tcgccgggct tgtgttttta tacctataca 7980 gttcactcac gcaccgatat tggtcactac aattcccggg ttagacggcc tacaatttca 8040 gaagtgggat acgatcaatc gccgccagtg aaaaatagcg aaaatttttc ttttctcgtc 8100 cgtcgaagac gccatcgctg aaaaaaaaca tgtcggactt ggtcggcacg gaggagttct 8160 cggccgccca gctgcgcgaa tggctggagt ccctcaatct accaaaaggt ggaagcaaag 8220 ctgccatggc agcgagactt aacgaaatac cagtagagct ccggggacag ggaccaccgg 8280 cagccgaaac atgcgagaac gagagggaag atgaggcagc cgcagatcaa gactcgacga 8340 agagcgaacg caaagagaag aacgaaaaag caccgcaagc gcctgggcac aacaacaacc 8400 acggcgaata tcgagcagag gttgaaatgt taaaactgca aatcgagctg ctgaagctgc 8460 agagcgagag ggagaaggaa gggagaggag agaacacaac accagccgcc aacaatgacg 8520 ctggcgtcat gttgctaaat gccgccaaag acgtgttgcc aacatatcat ggcaacattt 8580 ctggaaacaa cgatgacgtc acaacttgga tcgcgcagtt taaggccatc accaaattta 8640 aaattgtaag 8650 // ID DNA8-5_CQ repbase; DNA; INV; 1461 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A DNA transposon family from Culex quinquefasciatus - consensus. XX KW DNA transposon; Transposable Element; nonautonomous; DNA8-5_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-1461 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the southern house mosquito."; RL Repbase Reports 11(1), 82-82 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >94% CC identity. 8bp TSDs. XX SQ Sequence 1461 BP; 489 A; 274 C; 266 G; 432 T; 0 other; cagggttgcg aatgagcgag cgctcacgga aggcttgctc gctccagctg gagcgtgagt 60 ttttatcagg tagctcgtgc tcgctcacgc tcaccggagc gttttgcgct cacgtgagct 120 ggtgagccaa agtaggtagt tgttttttcc tgttgaagca tctgcctgtt tgaaacgaag 180 tttctagaac tatctcagca caggcagcaa aaacaaacaa gaaagtggtc gaacaaacaa 240 aagtaggcaa tgaaatggat ttgatcagcg aaccgaaatt tacgttctat ttaaaatcta 300 gacatttttc gaacaaggaa caaaaaacta aattcataat gtaaacattc attgacgtga 360 agagatgcac tgtttgttca caaatcaaat ccatattgga ccatattgta ttgttgttat 420 gtacaaaaat attgacaatt tatgtatatt ttatttattt ttaaataatc ttttaaacag 480 ttacaatcat ccatgttcaa aaaatcatat ataacttgta attgaatatt ttcagaggtt 540 tggttgcgaa agtgttgaat aaaatccact tgttggaaga agagccaacc ttttcgtggt 600 gaaaaatatt gacaatttat ttgatttgag tcaaaagggt tatttcaacc cttaggaatt 660 ttgaggcata tatttaaaat agaactgttt caaaaatgtt accctatttt ctaaaatgtt 720 tgccaaataa agactaacct tatagaaatg attagccgtt tctaatcact gaatcactga 780 aagatttaat aaacggaata acttttttta atggtagctc attctgaccc gcaatctgga 840 aatgtagtca aaacactgat taggtacata aaaacagttt aatgttatat tttaatcaac 900 caagcattta accttgaatc gaacttacca aagcataaat cctcttgaat caattcattg 960 ttttgtaagt ttttctaagg tttacaggcc cttttattta ggcgtctata cccctccgcc 1020 cctctctgct acaataaggc ctttaaattc taaaatattt atacattcga ctgtatgtct 1080 aaaaaattat ttgacccttg cgaatccaac cttttattta aaaacatgat tgatgaaaat 1140 aagctattcc aattcatata tcataaaatt tgttcacaaa gtcatatttc cacagcccta 1200 ctgcacctta cgcaaaaaat ccatctagaa ttgaaaaaaa aactttctca aaaacgttaa 1260 atgattagcg acacgtccaa aatgagcgct ccgtgagcga aatatgagcg cgagcgcgag 1320 cgtggagcaa acgcgagctg aaaaaatcgg agcaaacgtg agcgcgggca aacgtgagct 1380 agctcgcgca gcgctcctcc tcacgaatga gcgaaaagtg agcgctcacg agcgcgtgag 1440 cgcgtgagga acgcaacact g 1461 // ID Gypsy15-NVi_I repbase; DNA; INV; 7609 BP. XX AC . XX DT 02-MAR-2008 (Rel. 13.03, Created) DT 02-MAR-2008 (Rel. 13.03, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW internal portion; Gypsy15-NVi_I. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-7609 RA Jurka J.; RT "LTR retrotransposon from Nasonia parasitic wasp."; RL Repbase Reports 8(3), 247-247 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Nasonia CC Genome Project. XX FH Key Location/Qualifiers FT CDS 420..2807 FT /product="Gypsy15-NVi_I_1p" FT /translation="MSEYSTSTMSLTGSYIDLSRSHVAIQDHVRSMDTNTL FT GQALDFLGLQDRGADEDRMRRLFRCLAGDYAPEDFRPSLTLLNSEELFRSR FT ATRRASYITNYIPQEVSGIFEEEERNANQISLRILHQRNTIRNPDVVIQEQ FT QAYQPLDCSNPSEIENEFSWLNPVTIGFGQPLHREQVRQPLATIFGGNPCD FT NTSTVGNDVVEESLENPDQIRIDANPPELADEPERTSNVDFLMYLASDGCT FT YNLPTRVVDDLLKKDDEIAQYKKIQDEACEYIRKLRMANTAQREPPACSTR FT LETSEFTRSKEDAPPPVETRNKHVDFNDTYTTRYFEPTTKNFSTSDLEASN FT MYQQPQRRSNQQSTPHNHTPGQLRPIQPPYTLPQSSYERNIFSQNRMAQQA FT QIPTCHQSQCYQTCDQHRINQPQNTGDSQQRSGNAWREGDQPIRPNRLVSE FT RGPSNLQTNTSGYYDSNLQNTLRENSADINDKIRKWNVKFGGTSTEDVEEF FT IRLVEGCRKLARPRISEEELLDALSHTTSLLSGRALKWARVERPKWHTWED FT FCEEARATYGMTDEERSQLIVEASSRTQGPQEDVNDYYVSMVMIFNRMPEA FT LPIRQRLDILYQNMNPELKLIFGRMEFETLNEFKSKAVSAEKRTRAKSRYK FT PPPTPDKSMLPGAAYNPQEKAKNKNQPAIAAICPTDAEEPPLWFKTWSSTN FT YPSNAKKDNNGRQNQKGSNKPTQGKKKTSAANQQAKPTASTEDKNKSNSKE FT GDKSSGKPTNPANKDKKELVCWNCKLPGYTKKTCPDCAGKAKGEQ" FT CDS 2771..5416 FT /product="Gypsy15-NVi_I_2p" FT /translation="MSRLCGKSKRGTVSEIRTDPEVNSGISASQSTEQLNP FT TAIAHCAATSSNAVLSDQRWWLQVQVGEFRVRALYDPGATQTCIGATSVQL FT ASACNAEIKPCDGIVACMADGKTSMVTGTVDLPFTIGGQTRTLTTLVVPDL FT REGCNLGADFVLQFDARLNPRERTLTVENSTEVINACEAVTDGTTRPVLAS FT IGLQDLHEGQKRMLEELLDRLIPKDDGPLGFTNLVEFDLEVETCRPIKQKF FT YPVSKKLEEMLFSQVRELLAADIIEPLFSDWASPVVMVKKGNSGKRRLCID FT FRKLNKVSKVSAYPLPFMDTILSKLQCARYISTTDLSNAYHQIKIKKESRP FT YTAFKVPGMGLFQWKRLPFGLAGAPACFQRMIDSLITPDLEPHCYSYLDDI FT IVCTETFEDHLKVLEIVLTKLAKARLTINREKSHFCQSEVRYLGVLVNRDG FT FRPDPDKIAPIIEYPTPKTLKQLRRFLGMSSWYRKFLKDFATIVEPINRLL FT KKGTSYVWGKDQERAFEQTKMLIASAPVLHRPDFNKTFCLQCDASGTGLGA FT VLTQVIEGEERVLSFASRTMTPAERNYSASERECLAVLWAIQKFRPYVEGY FT HFLVITDHSSLKWLSNLRNPTGRLAIWALELQEYDYTIEHRKGSMNVVPDA FT LSRLYEGDDAEETRPIIASITPEDETTDKWYTELKKEVTDNPLEFPGWKLL FT GGKLYTYRPDAFVDDVFEDENAWKLVLPKEKRAQVLNECHDEATSGHFGRK FT KTQERVALSYYWPSMKKDVTNYVRSCLICQQCKVEQRKPAGQMGSRNFSRP FT WETVAGDVMGPLPKSTHGYEYLLVFQDMFTRWVECIPIRKANAQTIIKNLN FT ERVFYGSERQKSSFPTTEQSSEIRLWINS" XX SQ Sequence 7609 BP; 2385 A; 1687 C; 1817 G; 1719 T; 1 other; gccgctccgg cgtataattg ctttttcgaa ttcgcgtaga aaataaattt tttgcatgat 60 agagggagtt agtaacattt aaattcacaa ataatttatt aaagtggaat tgtttttttt 120 ttcttacaaa ttagtaagtt tttttctctt tctcagattt taaaacttgt agcacaaaac 180 gtgcgtgatt acttgctctg tcaggcaatt tatcgcggtt caaaaacaaa tmattgttac 240 cgttatacaa atttaattga gtgattaaaa ttaaatcaca cgtattaatc gaaaaagcaa 300 acgtgtgttc cagtgtcgac aactaggtat taaattcact tcgagagtaa cttggattgt 360 tgaatcctga gttttgtttt tggttgtgct gaatttgact cggctctgct tctgattaaa 420 tgtcagaata ctcaacgagc acaatgtctt taactggctc gtacatagac ttaagcagat 480 cgcatgtagc gatacaagac cacgtaagat cgatggacac gaatacgtta ggtcaagcgt 540 tagatttctt aggtttgcag gatagggggg ctgatgaaga cagaatgcgt cgattgttta 600 ggtgcctagc aggtgactac gcgccagaag acttccgtcc gtcgctaacc ctgttaaatt 660 cggaggagtt atttaggtcg cgggcgacac gacgcgcgtc gtacataaca aattatatcc 720 ctcaagaagt cagcggcata tttgaagaag aagagcggaa cgcgaatcaa ataagtctaa 780 gaattttgca ccaaagaaat acaatccgta atcctgacgt tgtcattcag gagcaacagg 840 cgtatcagcc attggattgc agcaatccaa gcgaaatcga aaacgaattt agctggttga 900 acccagtaac aatagggttc ggtcaaccgt tgcacagaga acaggttcga cagccgttgg 960 ctacaatttt tggtggtaat ccgtgcgata atactagcac ggtagggaat gacgttgttg 1020 aggaaagcct cgaaaatcca gatcaaataa gaatagacgc gaatccccca gaattagcgg 1080 atgaaccaga gagaacatcg aacgtcgatt tccttatgta tttagccagc gacggctgca 1140 cgtataattt accgacaaga gtcgtcgacg acctattaaa gaaagacgac gaaatcgcac 1200 agtataagaa aatacaagat gaagcgtgcg aatatataag gaaattacgc atggctaaca 1260 ccgctcagag agaacctccg gcgtgcagta ctcgattaga aacatcagaa tttacacggt 1320 caaaagaaga tgcaccacca ccagtggaga ccaggaataa acatgtagat tttaatgata 1380 cctacactac acggtatttt gaaccaacga ccaagaattt ctcaacttcg gatttagaag 1440 ctagcaacat gtaccagcaa cctcagcgaa ggtcaaatca acagtcgaca ccacacaacc 1500 atacgcctgg ccaactacgg ccgattcagc ctccgtatac gttaccacag tcgagttacg 1560 agaggaatat cttttcacag aacagaatgg cacaacaggc acaaataccg acatgtcatc 1620 aaagccagtg ttaccagacg tgcgatcagc atagaataaa ccaaccacaa aatactggtg 1680 attcccagca aaggagtggg aatgcttgga gagaggggga ccagccaata cgtccgaatc 1740 gcctggtttc agaacgaggg ccaagcaacc tacagactaa tacgtctgga tattatgact 1800 ctaacctcca aaatacgttg agagaaaatt ccgccgatat taacgacaaa atacggaaat 1860 ggaatgtaaa attcggaggc acgtcaacag aagatgtaga agagtttatc agattagtgg 1920 aaggttgcag gaaactagct cgtccaagga tttcagagga agaattgtta gacgcgctat 1980 cacacaccac ttcgctacta tctggaagag cactgaaatg ggctcgagtt gaacgtccga 2040 aatggcacac gtgggaagat ttctgcgagg aggccagagc aacgtacgga atgacagatg 2100 aagagcgttc acaattaata gtggaagcta gctcacgtac gcaaggacca caagaagatg 2160 taaacgatta ctacgttagt atggtcatga tattcaaccg tatgcctgaa gctctgccaa 2220 ttagacagcg gctggatatt ctataccaga atatgaatcc agagttaaag cttatttttg 2280 gtcgaatgga atttgagact ctgaacgagt tcaagagtaa ggctgtgagc gctgaaaaac 2340 ggacgagggc taagtcacgc tataagccac cccctactcc agataagtct atgctgccag 2400 gagcggctta caatccacag gagaaggcaa agaataaaaa tcagccggcg atagcagcta 2460 tctgtccaac ggatgcagag gaaccacctc tttggttcaa gacttggtct agtactaatt 2520 acccctcaaa tgccaagaaa gacaacaacg gccgtcagaa ccaaaaaggg tcgaataagc 2580 ctacccaggg taagaagaaa acctctgcgg caaatcagca ggctaagccc accgcgtcga 2640 cagaagataa aaataaatct aactccaagg aaggtgataa gtcgtcggga aaaccaacca 2700 atccagcaaa taaagataaa aaagaactag tttgttggaa ttgtaaatta cctggctaca 2760 cgaagaagac atgtcccgac tgtgcgggaa aagcaaaagg ggaacagtaa gcgagatccg 2820 tactgacccc gaggttaatt cagggatctc ggcatcacag tcgactgaac aactaaatcc 2880 gactgcaata gcacactgtg cagcgaccag tagtaacgct gttttatccg accagcgctg 2940 gtggctgcag gtgcaggtag gtgaattcag ggtcagggcc ctttacgacc caggagccac 3000 tcaaacttgc atcggggcta cgtccgtaca actagccagt gcatgtaacg cagaaataaa 3060 accatgtgac ggaatagtcg cctgcatggc tgacggtaaa acctcaatgg tgactggaac 3120 agtcgacttg cccttcacga tcggtggaca aaccaggacg cttactacgc tagtagtacc 3180 tgacttacgt gaaggatgca atttaggcgc tgacttcgtt ctacaattcg acgccaggct 3240 taaccctcgt gaacgaacgt taacggttga aaattcaacc gaagtcataa acgcctgtga 3300 agcggtaaca gacggaacca ctcgtccggt cttagcttcg attggcctgc aggatttgca 3360 tgaaggccag aaacggatgt tagaagaatt actcgatcgg ttaattccta aagatgatgg 3420 gccgctcggc tttacgaact tggtggagtt cgatctagag gtagagacat gtcgtccgat 3480 caagcagaaa ttctatccgg tttcgaagaa actcgaagaa atgctctttt ctcaagtaag 3540 agagttacta gcagcagata tcatcgagcc tttgttcagc gactgggcta gtcccgtcgt 3600 gatggtgaaa aaggggaata gtggaaagcg tcgattgtgc atcgacttcc gcaaattaaa 3660 taaagtttcc aaagtaagcg cttaccccct acctttcatg gatactatct tgagtaagct 3720 tcaatgtgct cgttacatat ctacgacaga tttgagcaac gcttaccatc agataaaaat 3780 aaaaaaggaa agtcgaccgt atacagcatt caaggtcccg ggaatggggt tgttccaatg 3840 gaaacgtctc ccgttcggct tagctggtgc tcctgcctgc ttccagcgta tgatcgattc 3900 cttgattacg cccgatctcg aaccacactg ttacagctat ttggacgata ttatcgtctg 3960 taccgaaacg ttcgaagacc atttaaaagt cttagagatc gtgctgacga agctagccaa 4020 agcacgattg actataaatc gcgagaaaag tcatttttgt caatcggaag tgcggtatct 4080 tggcgtgctc gtcaacaggg acggttttcg tccggatccg gataaaatcg ctccgattat 4140 agagtacccg actccgaaga cattaaaaca actgcgcagg tttttgggaa tgtcgtcgtg 4200 gtacaggaag ttcctcaaag attttgcaac gatagtggaa cctataaatc gactgcttaa 4260 gaaaggaaca agctacgttt gggggaagga ccaagaacgc gcgttcgagc aaactaaaat 4320 gctgatagca tccgctccgg ttctccaccg accagacttt aataaaacct tctgtctaca 4380 atgcgacgct tctggcacag gccttggtgc tgttttaacg caggtgatag aaggggaaga 4440 acgtgtgtta tcgttcgcga gtcgcactat gactccggcg gaacggaatt actcagcgag 4500 cgagcgtgaa tgtttagcgg ttctctgggc gatacaaaaa ttccgtccgt acgttgaagg 4560 ttaccatttt ctggtgatta ccgatcatag cagtctcaag tggctaagta atttacgaaa 4620 tccaactgga cgtttagcca tatgggcact tgagttgcag gaatatgatt acacgatcga 4680 acatcgcaaa ggctcaatga atgtggttcc cgatgcgctg tcgagacttt acgaaggaga 4740 cgacgcagaa gagacacgtc cgattattgc gagtattaca ccagaggacg aaacaactga 4800 caaatggtac accgaattaa agaaggaagt aactgataac ccgctagaat ttcctggctg 4860 gaagttatta ggtggaaaat tatataccta ccggcctgac gcattcgtcg atgatgtttt 4920 cgaagatgag aatgcttgga aattagtact tccgaaagag aaaagagctc aagttttgaa 4980 cgagtgtcat gacgaagcga cgtcaggtca tttcggccgc aagaaaacgc aagaaagagt 5040 cgccctgtca tactattggc catcgatgaa aaaagatgtg acaaattacg tccgaagctg 5100 cttgatctgt caacaatgca aagttgaaca aagaaaacca gctggtcaga tgggcagtcg 5160 caacttctcg cgtccatggg aaacagttgc tggtgacgta atgggtccgc taccaaagtc 5220 tactcacggt tatgaatatt tactggtgtt tcaagacatg tttaccaggt gggtagaatg 5280 catacctata cgaaaggcca acgctcagac catcattaaa aacttgaatg aacgggtttt 5340 ctacggttcg gaacgccaaa agtcttcttt tccgacaacg gaacagagtt ccgaaataag 5400 gctatggata aattcctaga ggaacgggga gtacaccata cgtacgcacc accgtatcac 5460 cctcagccta acaccgtaga acgtgcgaac gctgatccgc gaatatatcg agggtcatca 5520 taaaaactgg gacgaacata tagcagagtt cgcattcagt atgaacacgg ctacacaaga 5580 ctcactccag acttctccgg cgatgttgaa ttttggtaga cagccgagcc accccaaatc 5640 tctgcttcga caagaggaga ttgaagctat agccaacgaa gatcagatcg ctatatctga 5700 atggaaggct aggatggaga agctgagaga actccaagaa gcggctaacg ataacgctca 5760 agaagcgcaa gcgcgccagg ccaaatattt caacagccgg caccgaaacg tggagtactc 5820 agtaggcgac gaagtctgga agaagaatag agtactgtca tcagcagcgg agggagttca 5880 gctaaattag ctccaccgtt tatcggcccc ttcaaaatca gccgtatact gagcccaggt 5940 atttacgaac tagttgaccg aaacggaacg gttgatggac cgactgctgt cgaatatttg 6000 aaaaagtatc acaatagaga agaaaacgac ggtgagccta tcgcagcagc aggcggctgt 6060 gaagttactg atcaagtaga cgacatcagg actacgtccg tcaacaatga agaaactgaa 6120 gaagcgaaag aatctaacgc aagtagcgag cgaccgaaaa agcgaggtag gccgaggaaa 6180 acgcggttac ttgtaaaacg caaaacacta ccggcgaaga ttgtaaaccc gacaaacacc 6240 gtaagcgtcg aggtagacaa agcgaaacgc ggaagaccga aaggttcgaa aaataaaccg 6300 cgtgacgttc cgaatagttt gtctcctagg aaaacgcgag cgcaaaaggc ggcgggtacg 6360 gttgacaatt aaaatcatgc gattagggag cttcattctt ctgtataccg tcccattctc 6420 cactcataat tcccttcttc ccacatagac ggctgaacgc aagatggacg acatcaggga 6480 gttagcagag gagattgtgc aacaaatcgt cgaagagtcg atgagtcaca cggtcgacga 6540 cacgccagca gtaccggcca acgtcccatc cggattagag gtgggaagcg aactgctcaa 6600 ccggtcgatc tttgctatcc cggaatcacc gaccgagatc gaactgagga agcaagctct 6660 gctgaaagaa ctgcaggtga tagaagacgc acagttcgct caggctctag tcagaagaca 6720 gcaagcgaga aaccgagcga cgcagctcag aaccgagctg atcgctgcgc tggaagaagc 6780 catcgctatt gggtcggaaa ttaaggaaaa gtatggccac cggccggatt tacctaaggg 6840 agacacaacc ctagtagaaa ccgtggcatt tctaaaggcc aaatgcaaag cgaagccgca 6900 accattgcca gcgccaccgg ttgtcccgaa gacggctcca acagccgtaa ggccagattc 6960 gaccagccat cagcccgaca cgacgtcgaa aaccacggcc gctccagcag caacagccgt 7020 aaagaaacgg ccgactctaa ccaacgaaca gttgaacaaa gagttcgacc gtaaccgtga 7080 actgaagcgt cgagcgaaaa ataggctgcc gacttacacc gaacctggac cgcaatgccc 7140 catttgcgga taccgaggta ttgctcgatt ggacgagtgc cccaatttca acgtccatgg 7200 aatggactta acgtacggcg acaagtacaa gaagaagtag aagaagaagc tcaggctaac 7260 ggccgaacca cttcatttcc tcctgtacct tacctcaatc ccgtgcaatg aagcattttg 7320 ttttttctta tttctcttct ttctctctta atgttttttt ttagaggcaa aagactgccg 7380 tactggccaa cggccgtatg taaaaagaga ggcgagagac taaaaatgaa ctattttttt 7440 ttttatttac ttctttcatt atatgtatta tctaattaat gatgataata aaaagtcaag 7500 ttttccttgt aaaaattgat atttattaca agttcttatt acccagttat acatgactga 7560 gtgcgagctg gtacacctgg cagggtacac agcgtcaggg ggagggtgt 7609 // ID REP-2_CQ repbase; DNA; INV; 1985 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A repeat family from Culex quinquefasciatus - consensus. XX KW Repetitive element; nonautonomous; REP-2_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-1985 RA Kojima K.K. and Jurka J.; RT "Repeats from the southern house mosquito."; RL Repbase Reports 11(1), 605-605 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 18 sequences with >94% CC identity. No TIRs. XX SQ Sequence 1985 BP; 566 A; 456 C; 418 G; 540 T; 5 other; ttgttctgat accccaaaat gggaacggtt aacaaagggt tctgatcctc ttcgagtgac 60 cagttgacaa tagttgtttg cttttgtatt tcatgttgtt ggttgatgaa ctgtgttgaa 120 tctggaagga tcaattacaa gattggtatg ttttcgatgt tgtctatcgt caaaacataa 180 aacatttaca cctgcgttta ctttactcat taaggagtgg tggtccttaa ttctgamaaa 240 tcacaakggt ttagtggaga agagctttcc ttcaatggta ccaatcactt taactaaatc 300 aatgatttgg ctgaagctaa ggactcgagc aatgtctaga tgtgtatgtg cctaggataa 360 caaaaaaaaa ataactttgt catttttttt accccgactc cgggaagatt gtaacctttt 420 gaaaatcggt ttcaataaaa tcatactcga accacaccaa accagcttga twaaaggaag 480 attgtaaaat attgtcagta aatgtttaaa tgtgcagtat ttgttgttct gtgtttacac 540 tttgaacata tgatatttca tgtttattga gcgtccaggt tggtgtcttt cagatttagg 600 agttttgtta tttattaaat tatttattta tctatttgtt atctgttatt tattacctat 660 ttattattca aaattagcat gcaattaaat aattcaaata gtcgacwtta gggaaaaagg 720 ccttttgcat ctgagacacc aacccatatg aatwttgtta tccccacccc tgcgaacacc 780 atcatcacca ccgtacgtca gtgtcatcga gctgtcaccg atccctgtgg caaaacagga 840 aacagttctg gagggaacag ctaggcggcg cataaatacg ggcgcagacg atggtgtcgg 900 cctcttttgc tttttcggtc cggagtggtg aagaccagtg atcgccccgc aagttcgtcc 960 gtcaccacac gacacgggaa gaaagttcgc gtgtgtaatt gaaaaatcct aattaaaatt 1020 acacaaaaaa acattggtta cgaaacagtg aagtgccaca aacaaattaa aaccaataag 1080 cctaaagagt acggatcgga gtgtgacggc accacggacg gttagaagga caccacaaca 1140 cgttaaaaat tcctaaccga gttaggtccc tccggtcacc atcgtacgga tccgagcgag 1200 cgtggtgcaa gaacaagaac ctgcagtgaa gtgaagtgaa gtggtcggga ctggccgcac 1260 gaagcaccgc ggtgctgagt gccgagtggt acaacgtcaa atctgttggc cagcccccga 1320 gtcaaagact cccccagtgg tacctcgctc ggatcgacca cttccccccc ccctttcgcg 1380 cgtgcaccac cctccggacc cgcgaactcg accctccgct acaaataggg tccgcgatca 1440 cgtgctttgg cggtctcctt ctgagaccgc gccgacacca acctgtggcc ctctggggcg 1500 aagccaagaa gaagacgccc tctcggcggt ccaaccggac ccgccgttcg ttccctcctg 1560 ttccggaacc cgaccagcag cagcagccat gacagcgacg caacaagccg gccaccgagc 1620 atcaggatcc caacgataaa acgcatgcat gcaagtaccg tacacaaata cacgttaaaa 1680 tcggattccc ccgtctttcc cttaatttat tacccctaaa agttcttaaa attattgaaa 1740 gtcttaaaat ctaaataaaa agttaaaaat gtggtctcaa tcaagctgtg tttgtcctat 1800 ttatttaaaa tcctaaatat tagtctttgg ttgaaaatta gtgtggttca cgtgagtatg 1860 aaatgtttgt ctttgtgtga gaaatctccg ggtagatacg tgtgccgccc tatggagcaa 1920 tccccctaag ggactccatg tgctcgtctc atatatttaa ccccagaata gttaattact 1980 atcaa 1985 // ID CR1-85_AAe repbase; DNA; INV; 3646 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-85_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-3646 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1173-1173 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 11 sequences with >97% CC identity. XX FH Key Location/Qualifiers FT CDS 70..636 FT /product="CR1-85_AAe_1p" FT /translation="DRRSKGVSTNRLSPAVLSPVPNRFQFNRPSLKRSRER FT DEVAVXPSDRPSKVFRGTNQCTNASVRSADIPXDKFWVYLTKISPEATEDD FT ILNLARSCLQTEDIVAKALVPKGRPLSSLTFVSFKVGVNQDLKSRAMDPST FT WPSEIQFREFIDTSSSAXHFWTPRPRFDPGTLNIQEQQVTIAPTPNLNLQ" FT CDS 450..3584 FT /product="CR1-85_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="PGSEVKGNGPIHLAIRNSVSRVHRYQFKCPAFLDAEA FT EIRSRYVEHPRAAGNHCTNTESEFAIASLDEQLSFPSTSISMPHRIPEPSR FT ASTRSKSKPLTIYYQNVRGLRTKTNNLLLSLSACDFDIVVFTETWLHADIA FT SSELSCNYVIYRCDRNASTSCLQRGGGVLIAIKSELNCRAVHLVDSDDLEQ FT TAVRITLPLFSLFVCCIYIRPNSCPEKYKRHVDSVNQISTQAGPRDVIVCL FT GDYNLPNLVWHFDEEILAYLPINASTEQELALAEAMMSAGFYQVNDLLNVN FT GKLLDLVFVSDQSIIDLFESPYALLKVDAHHKPFILTVDACSRDEDSSATF FT TSNFDYDFNNCDYEELNARIASIDWQIVLNHVSIDDAVSAFYSTIHHVIQS FT TVPVRPRRNSKQFSQPWWTPQLRNLRNRLRKIRKRFLRHRNASNMRELKHA FT EEEYNLQQRNRFREYTDSLQENFRQNPSSFWSYINKRKSSSRVPVDVNYRS FT RVSCSPVATANLFADFFRSVHNDATPQLSQQYLDSLPAYNFRVPQVAFSNV FT EILEALSTLDASKGPGPDMLPPVFIKQCAHSLSLPVTSIFNRSLIEGVFPD FT AWKLASITPIHKGGNMNNVENYRPISILNCLAKVFESLIHGILYPVVQHAI FT SEYQHGFVRKRSTTTNLMSYTHSIINHLEKRNQVDAIYIDFAKAFDKVPHE FT LAIEKLSRLGLPFWVIQWLKSYLSSRKAFVKVHDARSNVFDIPSGVPQGSH FT LGPLIFILYINDLCETLNSSKLLYADDLKLFRVIKSLLDCCALQADIRTVS FT RWCEVNGMQMNVSKCKVITFTRRQSIISFDYSLDNTSLEKVRTIKDLGVTL FT DSKLRFNEHISVTTAKANAMLGFLRRNTSQFDNVYALKSLYCALVRSILEY FT GVQIWAPYHAVHIERIERVQKRFLRFALRGLPWNDPVNLPPYENRCALIGL FT QPLASRRVMLQRLFTFDVIRNNIDCSSLLENVRLHVPIRQLRNRQLLWIPG FT HSTLYGLNNPFDICCRLFNQVCNVFDFNISKLIYKNRIK" XX SQ Sequence 3646 BP; 986 A; 870 C; 765 G; 1021 T; 4 other; acgacgctta ccagaaactg gttgatgatc tgaagtccga gatcaaggag agtttgatca 60 tcgaattgag acaggagatc caagggggtt tcaacaaaca gactatctcc agctgtactt 120 tctcctgtac ccaatcgatt tcaattcaat cgtccatctc tgaagagatc acgcgagcga 180 gatgaggttg ctgttttkcc gtccgatcgt ccttcaaaag tttttcgagg tactaatcag 240 tgtacgaatg cctctgtgcg atctgctgac atcccgwtgg acaaattctg ggtttatttg 300 actaaaatct ctcccgaggc cactgaagac gacatcctca accttgcaag gagttgtttg 360 caaacggaag atatagtagc caaagcactg gttccaaaag ggcgcccgtt atcctcgttg 420 acgtttgtgt cgttcaaagt gggagttaac caggatctga agtcaagggc aatggaccca 480 tccacctggc catcagaaat tcagtttcgc gagttcatcg ataccagttc aagtgcccwg 540 catttttgga cgccgaggcc gagattcgat cccggtacgt tgaacatcca agagcagcag 600 gtaaccattg caccaacacc gaatctgaat ttgcaatagc ctcactagat gaacaacttt 660 catttccgtc aacgagcata tccatgcctc atcggattcc tgagccctct cgtgcttcta 720 ctcgctcaaa atctaagcca ttgacgatat attatcagaa cgttcgtggc ttaagaacta 780 aaacgaataa tctgcttctc tccttgtcgg cctgtgattt cgatatcgtt gttttcacag 840 aaacatggct tcacgctgat atagcttctt ccgaattatc ctgcaactat gtcatttacc 900 gttgcgatcg caatgcttcc accagctgtc tacaaagagg aggtggcgtg ctaattgcaa 960 ttaaatcgga gctaaactgc agagccgtgc atctagttga cagtgatgat ctagagcaaa 1020 ctgccgttcg gatcacctta ccactgttct ccttgtttgt ttgctgcatt tacattcgcc 1080 ccaatagttg cccagagaag tacaagaggc acgtcgactc agtcaaccag atttcgactc 1140 aagctggccc ccgcgacgtt atagtttgct taggagacta taatttgcca aatttagtct 1200 ggcatttcga cgaggagata ttggcatacc tgccgataaa tgcgtcaact gaacaagaac 1260 ttgctctsgc tgaagcaatg atgtccgccg gcttctatca agtgaacgat cttcttaacg 1320 tgaacggtaa gctacttgac ttggtatttg taagtgatca aagcatcatt gatctcttcg 1380 agtcaccata tgctttgcta aaagttgacg ctcaccataa gccgtttatt ttaaccgtgg 1440 acgcctgttc tcgcgacgaa gattcatcag caactttcac gtcaaatttc gactacgact 1500 tcaataactg tgactatgaa gagctcaatg cacgaatcgc ctccattgat tggcaaattg 1560 ttttgaacca tgtttccatc gatgatgctg tttccgcatt ctacagtact atccatcatg 1620 tcattcaatc gacagttccc gtgaggcctc gtcgaaattc taagcagttc agccaaccat 1680 ggtggacacc gcagcttcgc aacctgcgaa accgcttacg taaaataaga aaacgtttcc 1740 ttcgccaccg gaatgcttcg aacatgaggg agttgaagca tgctgaagag gagtacaatc 1800 tgcagcaacg taatcgtttc cgtgagtata ccgacagcct acaggagaat ttcaggcaaa 1860 acccttcgtc attttggtcc tacatcaaca aacgtaaaag ttcgagtaga gttcctgttg 1920 atgttaacta tcgctctcgt gtttcctgtt ctccagtcgc aactgcaaat ctatttgctg 1980 atttctttcg ttccgtgcat aatgatgcca cacctcagtt atcgcagcaa tacttagaca 2040 gtcttccggc atacaacttt cgcgttccgc aagtagcatt tagtaatgtt gaaattcttg 2100 aagcgctcag tacactagac gcatcgaagg gtccgggtcc ggatatgctt ccacccgtct 2160 ttattaagca atgcgcccat tcgctctctt tgcctgttac gtcgattttc aatcgctcgt 2220 tgattgaagg agtttttcct gatgcttgga agctcgcctc aattacccca atacataaag 2280 ggggaaatat gaacaacgtg gagaactaca gaccgatttc cattctaaat tgccttgcga 2340 aagtttttga aagcttaata cacggcatat tgtaccccgt agtacagcac gcgatatccg 2400 aatatcaaca cggctttgtc agaaaacgat caacgacgac aaatcttatg tcatatactc 2460 attctataat caaccatctg gagaaacgca atcaggtcga tgccatttac attgactttg 2520 ctaaagcatt cgacaaagtg ccgcatgaac tagctattga aaagctgagc cggcttggtt 2580 tgccattttg ggtgattcaa tggctgaagt cctatctctc atcgagaaaa gctttcgtca 2640 aagtgcacga cgcaagatca aatgtttttg atatcccctc tggagttcca cagggcagcc 2700 atcttgggcc actcattttc atcctgtata tcaacgatct gtgtgagacc ctcaactcaa 2760 gcaaactact gtacgccgat gacttgaagc tgttccgtgt catcaaatcg cttttggatt 2820 gctgcgcttt gcaagccgat attagaaccg tatctcgttg gtgtgaagta aatggaatgc 2880 agatgaatgt ttcgaagtgt aaggtgatta cattcacccg ccgtcagtct attatctcgt 2940 tcgattactc tctagataac acgtcgcttg aaaaggttcg aacgatcaag gaccttggag 3000 taacccttga tagcaagctt cgatttaacg aacacatctc ggtgacaaca gctaaagcca 3060 acgcgatgct cggtttcctg cgccggaaca catcgcagtt cgataacgtg tatgctctca 3120 aatcactgta ctgtgcttta gttcgaagca tactcgaata cggagtccag atatgggcac 3180 cgtaccacgc cgtgcatatt gaacgcatcg aaagagttca aaaacgcttc ctaaggtttg 3240 cattgcgtgg gctcccttgg aatgatccgg tcaatttacc gccctacgaa aatcggtgtg 3300 cactaatcgg tctgcagccg ttagcgagtc gtcgagtgat gcttcaacgc ctgtttacct 3360 tcgacgttat cagaaataac attgactgta gcagccttct ggagaatgtc agactacacg 3420 tgcccattcg tcagcttcga aaccgccaac tgttgtggat tcctggccat agcactcttt 3480 acggactgaa taatcctttt gatatatgtt gtcgactgtt taatcaagtg tgtaatgtgt 3540 tcgactttaa tattagtaaa ttaatttata agaataggat taagtgaatt ttaacagtct 3600 gtgcaacttc caagttgaag atgaagcaaa taaataaata aataaa 3646 // ID Copia-3_CQ-LTR repbase; DNA; INV; 176 BP. XX AC AAWU01023967; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-3_CQ_; KW Copia-3_CQ-I; Copia-3_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-176 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 322-322 (2011). XX DR Genome; AAWU01023967; Positions 10397 10572. XX SQ Sequence 176 BP; 55 A; 41 C; 32 G; 48 T; 0 other; tgtaggcgtt taactttcca tccccctggt caagcgcgtc atctgtcatc gacgtctgtg 60 ccatgtagtt tagttcaaag caaatgacat tacaaaaaca cacgcggttt agcaaaccag 120 aaataaagcc catttttcgt ttaaagttaa tcggagttta atcgcaaaaa ccagca 176 // ID Penelope-2_CQ repbase; DNA; INV; 2296 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A Penelope-like element family from Culex quinquefasciatus - DE consensus. XX KW Penelope; Non-LTR Retrotransposon; Transposable Element; KW Penelope-2_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-2296 RA Kojima K.K. and Jurka J.; RT "Penelope-like elements from the southern house mosquito."; RL Repbase Reports 11(1), 601-601 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >91% CC identity. Sequences 5-502 and 1773-2270 are terminal inverted CC repeats seen in several copies. XX FH Key Location/Qualifiers FT CDS 491..1792 FT /product="Penelope-2_CQ_1p" FT /note="reverse transcriptase and GIY-YIG FT endonuclease." FT /translation="CVCCLDMFCEIVEFCVNRSYFSFNGEYYIQKFGTAMG FT NPLSSPIADLVTEELIDHALEAVNFPIPHIKKYVDDMFLSLPADKIDYVKE FT VFNSQDNNIQFTVEVEQNRRLPFLDMTLVRQEDQTVRTEWYMKPIASGRFL FT NYHSHHPPHHKLNVAFNFAKRVKMLSTNLDHNTAANIIRKHLLINDYPKSL FT INRVISRTPQNQLFEEVNNNTVIDNSENDTAMDASTNNTPIFYSLTNIYGL FT TQKITKTLHKEYPNIQIAIKNTKTVASLLPLVKDKTPIQEQSNVVYAIPCN FT DCDACYIGITTTKLKTRMSGHKSNVKQLQELKAKGYTNTDAEISWAKEKSA FT LTSHVAAMDHSFGIDSVRIVDRSMKGANLPILESCHIKNTQHTVNKRTDTD FT NLHAAYAGILNEIKNIYTRKKNQNKINKNNNSRDNTHTNT" XX SQ Sequence 2296 BP; 755 A; 522 C; 463 G; 556 T; 0 other; aattataaat ttttaggtta ggttttggct ttcggtctgt ttggggaatc aaaacacttg 60 tatgttctct tcatccgacg tttcgacccg ttttgggcct ttttcaagga atctaggggt 120 agatttattg ggtaaggcac aaaaacataa aaaatgtaca cttatgctag gcttaccgaa 180 atgggagtgt ttgtcgttag gggatgctcc gtcaagcaca gaactgtctg gtagcggggt 240 aatctgtagt ttcagaaaac aaataaatat tcctaaatct acaaatattt ttctctccgt 300 gcactcacgt ttcagcgaaa ctggccgcta tccggtattc gctcgtcata cgggttcgtt 360 tcaacacgaa acgcgagaat ttttggggca cggataaatt cgacacactt tctaccattg 420 gcgcgcaaac tatggcaata aaactcattc tagtgtgtga gtgtgttaag gtgttggcat 480 tatgtgttag tgtgtgtgtt gtttggacat gttctgtgag atcgtagaat tttgcgtcaa 540 ccgtagctat ttttctttca acggagaata ctacatccaa aagtttggaa cagctatggg 600 caacccgctg tcatccccga ttgctgacct cgtaaccgag gagttgattg atcatgccct 660 cgaagctgtt aattttccaa tccctcacat caaaaagtat gttgacgaca tgtttctgtc 720 gctcccagct gataaaatcg actacgtgaa agaagttttc aacagtcagg acaacaacat 780 tcagttcacg gtcgaagttg agcaaaaccg acggctccca ttcttagaca tgactctggt 840 taggcaagaa gaccaaactg tccgcacaga atggtacatg aagccgatcg cgtcgggaag 900 gttcctcaac taccactcgc accacccccc acaccacaag ttgaacgtcg cgttcaattt 960 cgcaaagcgg gtaaagatgt tgtcaacaaa cttggatcac aacaccgctg caaacatcat 1020 ccgcaagcac ctgctgatca acgactaccc aaagtcgtta ataaaccgcg tcatttccag 1080 aacaccgcaa aatcagctgt ttgaggaggt aaacaacaac acagtgatag acaatagtga 1140 gaacgataca gctatggacg caagtacgaa taacacacca atattctact cgctcaccaa 1200 tatttatggt ctcacccaga aaataacaaa aacacttcac aaggaatacc caaacataca 1260 aatagccatc aaaaacacaa aaacagtagc ttccctccta ccactggtga aagacaaaac 1320 accaatacaa gaacaatcca acgttgtgta cgctatacca tgcaacgatt gtgatgcctg 1380 ctatattggg attacaacaa caaaactgaa aacaagaatg tctggacaca aatcaaatgt 1440 aaaacaacta caagaactga aagcgaaggg atacacaaac acagacgcag agatcagctg 1500 ggcaaaagag aagtcggcgt tgacgagtca cgtagcagct atggaccact cgttcgggat 1560 cgactccgta cgaattgtag atcggtcaat gaagggggcc aatctaccga tcctggagag 1620 ttgccacatt aaaaacacac aacatactgt caacaagcgg acagacacgg ataatctgca 1680 tgctgcatat gccgggattt tgaacgagat taagaacata tacacaagga agaaaaacca 1740 aaacaaaata aataaaaaca ataacagtag ggacaacaca cacactaaca cataatgcca 1800 acaccttaac acactcacac actagaatga gttttattgc catagtttgc gcgccaatgg 1860 tagaaagtgt gtcgaattta tccgtgcccc aaaaattctc gcgtttcgtg tggaaacgaa 1920 cccgtatgac gagcgaatac cggatagcgg ccagtttcgc tgaaacgtga gtgcacggag 1980 agaaaaatat ttgtagattt agaaatattt atttgttttc tgaaactaca gattaccccg 2040 ctaccagaca gttctgtgct tgacggagca tcccctaacg acaaacactc ccatttcggt 2100 aagcctagca taagtgtaca ttttttatgt ttttgtgcct tacccaataa atctacccct 2160 agattccttg aaaaaggccc aaaacgggtc gaaacgtcgg atgaagagaa catacaagtg 2220 ttttgattcc ccaaacagac cgaaagccaa aacctaacct aaaaatttat tgatacggtc 2280 gaataaacag caggat 2296 // ID Copia-7_CQ-I repbase; DNA; INV; 2617 BP. XX AC AAWU01003031; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-7_CQ_; KW Copia-7_CQ-LTR; Copia-7_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-2617 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 329-329 (2011). XX DR Genome; AAWU01003031; Positions 10959 8343. XX CC 'AAAGG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 88..2607 FT /product="Copia-7_CQ-I_1p" FT /translation="MDRIGVVKLNGTNYCSWKRKVEFLLIRDDLWRYVVGT FT KPVPRRAAVGSGSDGAAGADQAVLNAAEIEAWDNGDQRARATIGLLMEDTQ FT LPLIKNATTAKLCWNALKDHFETVTLTSKVALLKRLCGMQYSEGEDIQQHV FT QQMEEIFERLAMAGQELDEDLCVAMILRSLPSSFNTLTTALEARSDEELTL FT ELVKMKLVDEVAKSGTRGAGESVLKVDHQKRKMVLICFFCKKPGHKEKFCR FT AKNGESSQLEAEPDLPQAKTAHEDAKENFSFLTLDQKSGRDVWIIDSGATS FT HMCFNPDCFVSMDRSVKQDVYLADGKKTISEGVGSCKLIWESPEGKRNDVV FT LSDVMYVPKFETSIISVKKLTACGAKVDFDASGCRILKNNKVIGSAAMAGG FT LYQMRPVREALWRNGVSERKSRSARSIDPETHEWTISRHCHFMTHQEVRET FT EPSQGSIVLYPFSEVFPKECVADDVPVVLVPDDEHDGGEDPAGAVNDGEEN FT ADDPIEADSSDFEVEVSGDGWNDTDDVDDALSPEDVEIRTAGRQEELQDGR FT SVCRTTQGVLQCRDLVDPIQEQWEPCTLADVMECSDREKWLRAFKEEIQFL FT HENSVRFETGAMLPGDVYFLEKSCYGLEQAGRVWNQQISVVLRNLEYQPSE FT ADPCLFVRKKADNRSFDGSQNGRFWLDQNAYIRTITVRIGQEDAKLSRIPI FT VPGCPKRQQKEEESLPRKNDYVSALLYVAVNTQPDIAISAFTLGRKRSNRI FT ETGKTDAERTQRYPNATEELELGGTGRRSSGPSRKRGAVGDLPEVAIADEA FT NAERGRRPEPLRFLMVAAAEDLACEAVEGVEEFQTNAA" XX SQ Sequence 2617 BP; 622 A; 615 C; 852 G; 528 T; 0 other; ggttatgggc ctgcacctgt aaccaggcga cgaaacaaac atcaagaagt tttttcacga 60 agcaaatttt caagaccccc actgaagatg gaccggattg gagtggtgaa gctaaacgga 120 accaactact gcagctggaa gcggaaggtg gagttcctgc tcattcggga cgatttgtgg 180 cggtacgtag tcggaacgaa gccggttcca cggcgagcgg ctgtcggttc cggatcggat 240 ggtgctgctg gcgcggacca agccgtgctc aacgcggcgg agattgaagc ttgggacaac 300 ggcgaccaga gggcgcgggc cacgatcggg ctgctgatgg aggacaccca gttgccgctc 360 atcaagaatg cgacgacggc caagctctgc tggaacgccc tgaaggacca ctttgagacg 420 gtcaccctga cgtctaaggt ggctctgctg aagcgattgt gcgggatgca atactcggaa 480 ggagaggaca tccagcagca tgtccagcag atggaagaaa ttttcgagag gctggcgatg 540 gccggtcaag agttagacga ggacctgtgt gtggccatga tcctgcggag tctgccaagc 600 tcgttcaaca cgcttacgac ggctttggag gcgaggtccg acgaggaatt gactctggag 660 ctggtcaaaa tgaaactggt ggacgaggtg gccaaaagtg gaactcgcgg agccggcgaa 720 tctgtcctca aggttgacca ccagaagcgc aagatggtgc tgatctgttt tttctgcaag 780 aagccgggcc acaaggagaa gttttgtcga gcgaaaaacg gtgagtcgag tcagttggag 840 gccgagccgg acctgcccca agcgaagacg gctcacgagg acgccaagga gaatttttcg 900 ttcttgacgc tggatcagaa aagcggccgt gacgtgtgga ttatcgactc gggagctacg 960 tcccacatgt gcttcaatcc agattgtttc gtgtcgatgg atcggagcgt gaagcaggac 1020 gtttaccttg cggacggcaa gaagacgatc tcggagggtg ttggctcatg taagttgatc 1080 tgggagtctc cggagggcaa acgaaacgac gttgtcctgt ccgatgtgat gtacgtgccc 1140 aagttcgaga ccagcatcat ctctgtgaag aagctgactg cctgcggggc caaggtcgat 1200 ttcgatgcga gcggatgccg gattctcaag aacaacaagg tgattggatc cgcggcgatg 1260 gccggcgggc tctaccagat gcggccggtt cgtgaagcgc tgtggcggaa tggagtctcg 1320 gagcgcaaga gtcgctcggc tcgttccatc gatccggaga cgcacgagtg gaccatcagc 1380 cgacattgtc attttatgac acatcaggag gttcgggaaa ccgaaccttc gcagggaagt 1440 atcgttcttt atccattttc tgaagtcttc ccgaaagaat gtgtggctga tgacgtgccc 1500 gttgttctag tgcccgatga cgaacacgat ggcggtgagg atcctgccgg agcggtcaac 1560 gatggcgagg agaatgctga tgacccgatc gaggcagact ccagcgattt cgaggtggag 1620 gtttccggcg acggatggaa tgatacggac gacgttgatg acgcgttgtc gcccgaagac 1680 gtcgagatcc ggactgcagg cagacaggag gagttgcaag atgggcggag tgtttgtcgt 1740 acaactcaag gtgtcctgca atgtcgggac ctggttgacc ctatccagga gcaatgggag 1800 ccctgtacgt tagctgatgt gatggagtgc tcggatcgcg agaagtggtt gcgtgctttt 1860 aaggaggaga tccagttcct tcacgagaac tcggtcagat ttgagaccgg cgcgatgcta 1920 ccgggcgatg tctactttct tgagaaaagc tgttacgggc tggagcaggc gggacgcgtc 1980 tggaaccagc agatttctgt cgtgctgcga aacttggagt accagccgtc agaagcggac 2040 ccctgtttgt tcgtgaggaa gaaagccgac aatagatcgt tcgatggttc gcagaacgga 2100 cggttctggc tggaccaaaa cgcgtacatc cgtacgatta ctgtgaggat tggtcaagaa 2160 gatgccaagc tctcgcgtat tccgattgtt ccaggttgcc cgaaacgcca gcaaaaggag 2220 gaggaatcgc tgccccggaa gaacgactac gttagtgctt tgctgtacgt tgcggttaac 2280 acacagcctg atatcgcgat cagtgcgttc actctgggac gcaagaggag caaccggatt 2340 gagacgggta agaccgacgc tgagcggacg cagcggtatc cgaacgccac ggaagaattg 2400 gagctcggag gaacaggtcg acgatcatca ggaccaagcc gaaaacgtgg cgcagtcgga 2460 gacctgccag aagttgctat agcggatgaa gctaatgccg aacgtgggag acgaccagag 2520 ccgctccggt tcctgatggt cgctgctgct gaagatctag cttgtgaagc tgttgaagga 2580 gttgaagagt tccaaaccaa cgcagcttga ggaggag 2617 // ID Syrinx_DS repbase; DNA; INV; 3587 BP. XX AC . XX DT 17-FEB-2006 (Rel. 11.02, Created) DT 08-MAR-2006 (Rel. 11.02, Last updated, Version 1) XX DE Non-LTR retrotransposon Syrinx_DS - a consensus. XX KW Non-LTR Retrotransposon; Transposable Element; KW Interspersed repeat; SYRINX_DS; reverse transcriptase; KW AP-endonuclease. XX OS Darwinula stevensoni OC Eukaryota; Metazoa; Arthropoda; Crustacea; Ostracoda; Podocopa; OC Podocopida; Darwinulocopina; Darwinuloidea; Darwinulidae; OC Darwinula. XX RN [1] RP 1-3587 RA Schoen I. . and Arkhipova I.R. .; RT "Two families of non-LTR retrotransposons, Syrinx and Daphne, RT from the Darwinulid ostracod, Darwinula stevensoni."; RL Gene 0, 0-0 - (2006). XX DR [1] (Consensus) XX CC Syrinx is a non-LTR retrotransposon from a Darwinulid ostracod, CC Darwinula stevensoni, and is a sister element to the jockey CC clade. CC The consensus sequence is assembled from sequences of 32 CC PCR-generated clones, which diverge from the consensus by CC 0.5-4.5%, and is 5' truncated. Syrinx codes for a protein CC containing the AP-endonuclease and reverse transcriptase domains. CC The 3' UTR of Syrinx is ~1kb in length, contains internal CC polymorphic oligo(A) stretches, and ends with an oligo(A) CC sequence. XX FH Key Location/Qualifiers FT CDS 1..2613 FT /product="SYRINX_DS_1p" FT /note="AP-endonuclease/reverse transcriptase." FT /translation="GTYCRGSAVYILSNLDSELISRSWQTDSEIIGVKILF FT SLTTLSVFHAYFPPNFPPSPSDLDFINQINGPSLLLGDLNAHSPDLSFCSE FT PNTTGNILSDFIASSSFSILNDDSPTHHCPGLGNSYRIDLALGNAQFLPFF FT HSFSIGEDVGSDHVPILINCNFSHEPPPSTSLPRFDFRNADWHSFQDVLET FT LLPHSLPLETVPQLEEAVTLITEAIQIAQDIAVPKTTPSHKPYNLSVETLN FT KIKERRKLRRLFYRNKDPTLKPLINKLNHEIKALIKIESDQYWCDFCEQLN FT SEEDPANFWKLFKRVGNPRNSKSRPLKFDGGTHCSDESKASVFASTLLEAM FT TEPPSSLGRRPPEVCLDLEKKICNFSLSDLSSKPEYLSPSQKALTYEITLD FT EIEAAIKVSPNKAPGPDNIFVNSIKHLSSRALHSLYLIYNACLRLGHVPLG FT WKSAIITMIPKPDKDLSDPSSFRPISLLSCLQKLLERILTSRLNDYLETNN FT LLSPSQSGFRKNLCTTDQLVRLHHDALVAVHSKMHLLALFFDVTKAFDKVW FT HAGLIFKLFFHFKIPLQLLRWTASYLTERSFRVRVGNSFSETRSPTAGVPQ FT GSVIAPLLFILYVNDLSSSIPRKLNVCTSQYADDTALWLSGRDVTALESRA FT QAALNALSYYCRNWRISMNPSKSSLLLFARDRKPHSVNVRLDGVLIPRSRF FT TRFLGVNFDSNLRWNEHVNVIRTKSIRKLNCLKILAGVNKCEPHVILKLYV FT TYLRPVLTYAFAAWANCPETVLDQLERIERAAIRYAFRLPTHFSNSYIYRI FT SGLTPLSRHASQLAYNYYNDPKRPGDIKEIPVRLNKKFTIGRFKHCKTYPY FT NKILERLGKTTNCSPPDP" XX SQ Sequence 3587 BP; 909 A; 1047 C; 645 G; 986 T; 0 other; gggacgtact gtaggggttc agcagtttac attctcagta acctggactc agaattaatt 60 agccgctctt ggcaaactga ctcggagata attggagtta aaattctttt ttccctaaca 120 actctctctg tcttccatgc ttattttcct cctaatttcc ctccttcccc ttccgatctt 180 gacttcatca accaaattaa cggcccttca cttctcctag gcgatctaaa cgcccactct 240 cctgatctct ctttctgctc ggaacccaat accactggta acatactcag cgacttcata 300 gcctccagca gcttctctat tttaaatgac gacagcccca cccaccactg tccaggacta 360 ggaaattcgt acagaattga cctagcactg ggcaatgcac agttcctccc ttttttccat 420 agcttctcca taggagaaga cgtagggagc gatcatgtgc ccatcctcat taactgtaac 480 ttctcccatg aacctcctcc ttcaactagc ctccctcgct ttgattttcg taacgccgac 540 tggcatagtt tccaggacgt gttagagacc ctcctccctc attctcttcc tttagagacc 600 gtcccccaac tcgaggaagc agtcactctc atcactgaag cgattcaaat cgcccaggac 660 attgccgtcc ccaagacgac tccctcccat aagccttaca atctctctgt agaaaccctc 720 aataaaatta aggagcggag gaaactccgt cgtttatttt atcgcaataa agaccccacc 780 ctgaagccat taatcaataa gcttaatcat gagattaaag ccctaattaa gatcgagagc 840 gatcagtact ggtgtgattt ttgcgagcaa ttaaatagcg aagaagatcc agctaatttt 900 tggaagttat ttaaaagggt tggcaaccct agaaacagta aaagtagacc ccttaaattc 960 gacggcggga ctcactgcag tgatgagagc aaagcctccg tttttgcatc tactctgctg 1020 gaagctatga ctgaaccccc cagctccctg gggcgtcgcc ccccagaggt ttgcctcgat 1080 ctcgaaaaga aaatctgtaa cttctctctc tcagacctct cctctaaacc tgagtatctc 1140 tctccctcac agaaagctct cacttacgaa ataacgctcg atgagatcga ggccgcgatt 1200 aaagtttccc ctaataaagc ccctggcccc gataatattt tcgttaatag cattaaacat 1260 ctctcctcgc gtgcccttca ctctctctat ctcatctaca acgcctgtct ccgcctagga 1320 cacgtcccgc tcgggtggaa gtccgcaata attactatga tccctaagcc cgataaggac 1380 ctttccgacc cttcttcctt tcgtcctatt agccttctca gctgtctcca aaaactctta 1440 gaaaggattc ttacctctcg ccttaatgat tacctggaaa caaataattt actctctccc 1500 tctcaatcgg gctttcgtaa gaatctttgt acaaccgatc agctcgttcg tctccaccac 1560 gatgcgttgg tggctgtcca cagtaaaatg catctcttgg ctcttttctt tgacgtgact 1620 aaagctttcg ataaggtctg gcatgcaggg ctaatcttta aacttttctt tcactttaaa 1680 attcccctgc agcttctcag atggacagcc tcctatctga ccgagagatc gttcagggta 1740 agggtcggga actctttctc cgagacccgc tcccccaccg cgggggttcc gcagggctca 1800 gtaatagcac cccttctttt cattctctac gttaacgacc tctcgagctc gatccctcgt 1860 aaacttaacg tttgtacgag ccaatacgcg gacgataccg ccctgtggct ctcgggcaga 1920 gatgtcactg ctctggagtc gcgagcccaa gcggcgctca acgctctctc ttactactgt 1980 cgcaactggc gtataagtat gaatccctcc aaatcctccc tcctcttgtt tgcacgcgac 2040 aggaaacctc actccgtaaa cgtccgtttg gacggtgtgc tcatcccacg ctcgcggttc 2100 acccgttttc ttggtgttaa ttttgacagt aatcttagat ggaacgaaca cgttaacgta 2160 atccgcacca agtcgatcag aaagcttaac tgcttgaaga ttttggccgg ggttaacaaa 2220 tgtgagcctc acgtaatttt gaagttatat gtaacgtacc tccgcccggt cttaacgtac 2280 gcgttcgctg cttgggcaaa ctgcccagag acagtcctcg accaactaga acgcatcgaa 2340 cgagctgcta taagatacgc cttccgcctt cccactcatt tttctaactc atatatatac 2400 agaatcagcg gactcacccc tctctctcgg cacgcctctc aactagctta taactattat 2460 aacgacccta aaaggccagg ggacatcaaa gaaattcccg tgagattaaa taagaaattc 2520 accatagggc gcttcaagca ctgtaaaact tatccttata ataaaatctt ggaacgttta 2580 ggaaaaacca ctaactgctc tccccctgac ccctgaacca tcccgctctc atcattcttg 2640 tgtcaacagt gctttttttc tggggcacct gaactgatga gggtgaaaaa atatctcttt 2700 tttctgtctc actctatttt cgctccactt ccttcctcct cctattcctt cgcgcagtgc 2760 tttttttctg ccgcggagga atatccagag cgcactatcg ctcaaagcca cttttcattt 2820 ccttccctca caccttcctt tcactttctc tctcttctcc cactcgctcg ctcctctctc 2880 cttccttttt ctgttgctct ttctaagcat caaaaaaaaa aaataaaaat tcagaccctc 2940 tcactctcac tttagcggcc aatccaatac ctcactcctc aaactcctct cgcatgactc 3000 caatgtctca aataacaccc tgcttcagct cttctcacca gggcgcctga taaggccctt 3060 ccggccagct ggcacctgga gacacctatt cttcctatgg ggcatctatc ctttaaacaa 3120 cttccgacag gatgtaaatc cagtaaattc accatgaaat aagttaaaac gtacctaaag 3180 aaagaaaatc gtaaactgct agggcgtaaa ttaaacttct tcagaaacaa gtggacctaa 3240 aacgtaatca gccccaaatc tactaaccaa ttcaatgata ggatcttctt gtttttgctt 3300 ctcattaaag gccgctctct gtcaaaaaag gaatatagaa ttaacagcta aaagtaataa 3360 cgggagagat ccaggcgtgg tggcgagcgt gcctgcacac tcgtcctcac gcctgagtca 3420 tgggtcacct tcccatggcg ttcacaacag cttctccagg gtattgcact ctgccagccc 3480 tggtgttggg aggatctcac cgtggtgggc gggagtcatg tggcggtcca ccactgacag 3540 cggggcccac cataccatta gctcctttta ttattaaaaa aaaaaaa 3587 // ID Gypsy-200_AA-LTR repbase; DNA; INV; 200 BP. XX AC supercont1.67; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-200_AA_; KW Gypsy-200_AA-I; Gypsy-200_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-200 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.67; Positions 208125 207926. XX SQ Sequence 200 BP; 63 A; 44 C; 42 G; 51 T; 0 other; tgtagcgtac cgcgtacgaa tctaatctag cgcgttcggt aaaacgatcc aacccctgtc 60 gttatacggc cgagaagccg atcgattatc aagtaggcta agaagcgtat cgaatataga 120 tccagcaaca gaagattctg tgatctggta gaaaccacac gtagtttagt tttctcttca 180 atataccgaa agtttaaaca 200 // ID Pifo_I repbase; DNA; INV; 8112 BP. XX AC . XX DT 05-JAN-2009 (Rel. 14.03, Created) DT 05-JAN-2009 (Rel. 14.03, Last updated, Version 1) XX DE Internal portion of Pifo retrovirus-like element. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Pifo_I. XX OS Drosophila yakuba OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; melanogaster subgroup. XX RN [1] RP 1-8112 RA Bartolome C., Bello X. and Maside X.; RT "Widespread evidence for horizontal transfer of transposable RT elements across Drosophila genomes."; RL Genome Biol 10(2), R22-R22 (2009). XX RN [2] RP 1-8112 RA Bartolome C., Bello X. and Maside X.; RT "Internal portion of Pifo retrovirus-like element."; RL Direct Submission to Repbase Update (05-JAN-2009). XX DR [2] (Consensus) XX CC Conceptual translation suggests that ORF1 and ORF2 overlap 55 CC nucleotides. There is a one-nucletide frameshift (+1) between CC both CC ORF. CC Positions [3271-3306] - Retroviral aspartyl protease CC Positions [4117-4671] - Reverse transcriptase (RNA-dependent DNA CC polymerase) CC Positions [5701-6204] - Integrase core domain. XX FH Key Location/Qualifiers FT CDS 2120..3202 FT /product="GAG" FT /note="Conceptual translation, with homology to GAG FT protein." FT /translation="MSKKLTQNIKKTAQSVLGVPTPVSRTIRPNTRSSGLP FT IVPEIIDHPLTPNMMDSGNASNNSAHSVRPGPPSLTPTVSGISSLSTTFKP FT KDIMAFVEHLPTFDGTPRYLDRFITSVEEILMLIRGADHTPYGLLTLRTIR FT NKIVDRADEALELANTPLVWDEIKGNLIRLYSSKKSEASLLSEMSSFSDRQ FT SLGQLFFGLSKIKSQLFSILKNTELNHNIVEAKRTVYNEVCLNAFITGLRE FT PLKTFIRLKAPQTIEQAYEQCQIEQSLYRTNNRGPNETFTKNRRDNQNTRD FT RRHDGRTNNDRNAPYPNNNYNQQPNSRETNQSSTANPFRTQSGRLNNIEDD FT TPDTNFQQAASVTHQGT*" FT CDS 3148..6534 FT /product="POL" FT /note="Conceptual translation, with homology to POL FT protein." FT /translation="HTRHKFSTGSLRNPSGYISPATHESSLPFIEINLPFS FT PHLKFLIDTGSTHSFIDPKHIESKDCVGLEKPLTLKTALNTFVLQKKFVIP FT FPSEFKTQGKMNLLPFRFHSFFDGLLGMDSLSFLKAKIDILNSQLKTSNTI FT LPIINYTNQTSNVFCITAHTKTTIPLPVQNDQGDFLYDTTYINKDLIISGG FT LYKSNNHLTTFEVANYRDSDQLLYLESPLKGIPYVKADFVELNNISSDPPK FT LPDLVNPLDNLKMEHLNPEEKQNLITLCKEFSDIFFNETVKLSFTNKITHS FT IPTTDNTPIHVRSYRYPYIHKEEVKKQINSMLNQDIIRSSYSPWSAPVWVV FT PKKAGPNGEQKWRLVIDYRKLNDKTISDRYPIPNINDILDRIGKAKYFTTL FT DLASGFHQIEMNPDDIAKTAFTVEGGHYEFIRMPFGLKNAPATFQRVMDSI FT LGDVIGLNCLVYLDDIIVFSASLQEHVIHLTSIFQKLRDANFKVQLNKSDF FT LRKETEFLGHIVTQEGIKPNPDKIAAIKNFPCPKNKKEIKSFLGLLGYYRK FT FIKDFAKITKPLTKQLKGNNKTITIDDEFTKTVDFCKILLTNDPILQYPDF FT TKPFILTTDASNFALGAVLSQGTLQNDKPVCFASRTLSDTEVNYSTIEKEM FT LAIIWAVQYYRPYLFGTKFTIVTDHKPLTWLMNFKQPNSKVIRWRLQLLEY FT DFEVVYKNGSQNVIADALSRSNANLNHNEIVPNSPYECPVSDKPLNDFNIQ FT LVIKLSQDTGYNTSTPSKHKLRREYYRPLLIRCLSFLPELTDDSEITRTIV FT DYHTKNNHRGIDETFLHLKRKIFFPYMKDKITKIIRTCETCLKLKYDRHPQ FT KIPFQITETPLKPLDIVHIDIYTINNNYNLTIIDKFSKFAAAYPIPNRNCI FT NVVKSLKHFISQFGIPKKLVYDQGAEFSSDMFSNFCSQYNIVLHVTSFQQS FT SSNSPVERLHSTFTEIYRIIIDTRKQLKLNTEHDETMSETLITYNSAIHSA FT TEHTPFELFTGRTHTFENNTKFSNEHDYLVKLNEFKQKIYPLIADKLSEKA FT LQRTLKINETRTPPATLEPETLVLRKENRRNKITPRFSLHKVLHDQGPTLV FT TTRNQKLHKSKIRRIILKKK*" FT CDS 6524..8110 FT /product="ENV" FT /note="Conceptual translation, with homology to ENV FT protein. ORF extends 202 residues into LTR." FT /translation="RKNKFYCRFTIPTIYAISIQPLTNNTPIAKIELGQAL FT IINRYKRVSHVINLEEFSKCIEQFHKTLQTFKYDDTLIDSLAILKSKLGQA FT QTKLNALTPLKRNKRGLINGLGSLVKAVTGNMDAEDAREIGKDIENIKIAL FT STVNTNYQTQDVFNNEILIRFENITNHINNEQVLITTFFENMQNKIYKEEN FT TLEKLQYINRINYNLELLLNHLNDIIESMLLAKINVIPRFILTEQEIYKIK FT MLLENQNITIKSEQHIYDLLKMNTLSYENNIIFTIQIPIFEKDNYKIARLI FT PLPINNKYFVMIPNYLIYKNNVNKYYLTQNCPKVDDTFICNKDAYINTPQN FT DTCTQQLLDSKNSSCDVQERGPVADVFEAEKGYIFAFNANNLKAQLKNGSE FT ITINGSAIIKYVNETIRINGIEYDNGVETTFEHLDLILPPLKEMTRNKTVE FT ILSLQNIHLEAIETSNKILTIHQTATQHAWTLYILLGFVAILTVTAWLLRR FT TKHVFHVQDHNHHLPIYAPAIPSLWPSLQTGEG" XX SQ Sequence 8112 BP; 2994 A; 1803 C; 1223 G; 2092 T; 0 other; ggcgcagccg gtaggatgct ttcctacatc tatacattaa cttacctgta agtaaagaaa 60 aatgtaaaaa acaaagcata agtgatagct caataagtaa agaatcttaa ataaaaaggg 120 aaaaatatat tctgctacag tgacaacaaa atgtaaacag tgagttgtga gctgcacacg 180 gtgacgtata agagcttggg taaaattaaa aagaaaaata tttaaacata gccaacagac 240 aacccaagta atatgcgcga aattgtttgc agcgaacaac gcattgggtc agcaaaataa 300 aaaaagaaag aatttttttt tatcgcataa catgttacaa cacaaatttt tttttttact 360 tctctgctat tgtgacgaca aaatataaac agtgaattgt gagctgcaca tgatgacgca 420 taagtaaagt ttaaaagaaa aattattaaa gcatagccaa caaacaaacc gagtaatgtg 480 cgctaaattg tgtgtagcaa acaacgcatt gggtcagcaa aataagtaaa aaatagccac 540 ggtataacaa atttaatagt ctctacctaa aattccacgg cctaaggtat cccctacaat 600 ggcaaataga atatcaaaca taaccatata ttacttaata aactatcttg caattatgtc 660 acaactcaca ttaaatcact gcaaatcaaa tactctatga tatgggtctt atgctatacg 720 gctcacgaaa caagcgtgtt tcgtgccgct gtggcatacc atccatgcat acatatatat 780 atacctaaat acacaatatg cacacaagct cgcgaaccgg atactctgtg atatgggctc 840 atgctatacg gctcatggta caagcttgtc ccatgatgct gtggcagacc acccatgcat 900 acatatatat atacctaaat acacaatatg cacacaagct tgcaaccaac gtatatacac 960 acatatacac caaattacgc acacaagcct gcgaaccgat actctatgat atgggtctat 1020 gctatacggc tcagggtaca agcttgtgcc ttgatgctgt ggcatatcac gcatatacgc 1080 atagatatcc aacgtatata cacatgtata caccaaatta cgcacacaag cctgcgaacc 1140 gatactctat gatatgggtc tatgctatac ggctcagggt acaagcttgt gccttgatgc 1200 tgtggcatat cacgcatata tgcatagata tccaacgtat atacacatat atacaccaaa 1260 taacgcacac aagcctgcga accgatactc tatgatatgg gtctatgcta tacggctcag 1320 ggtacaagct tgtgccttga tgctgtggca tatcacgcat atacgcatag atatccaacg 1380 tatatacaca tatatacaat aaatcacgca cacaagcttg cgaaccgata ctctgtgata 1440 tgggttctat gctatacggc tcatgataca agcttgtgtc atgacgcagt ggcataccac 1500 gcatatacac atagatattc aaacacagta tatgtacaca tgctacaacc aataaatacg 1560 cttatacata taaccataaa attactgcaa tacgaaaaaa caatgagaca ctttacccca 1620 acctacccta acaattaaag tttatacgct tccactcgag taccgaatgg ctaatgtaac 1680 acaataccaa tagtcacgcc gtcagcataa acaaacaatt tatactcttg cacaccaaat 1740 acaatatgtg ttatgcgatt atacttgtac atatatatac gaatacgtat atgcatagtg 1800 agtggccaag cctatcgagt caaaattctt tcttttttat aatcgtcgac attcacttat 1860 gctatgttcc gaatactata aactatgtat acaacaattt tataatacca actatattac 1920 agatattgaa tcacataaaa accatacaaa acccatatac attaagaaat ttatacgcac 1980 aaatacaata tatacatata caatacaata catacttata cacacttata aaaaattcca 2040 cattatatta aaacttgttc attattttct ttaatataaa ttccacatca gaaacaaaat 2100 ttttttttaa ttaggttaaa tgtccaagaa attaactcag aatatcaaga agaccgcaca 2160 gtctgtatta ggagtcccca ccccagttag cagaactata cgccccaata cccgttcttc 2220 tgggttgcct attgtccctg aaataattga tcaccctctt accccgaaca tgatggattc 2280 aggcaatgct tccaataatt cagctcattc cgttcgccct ggaccaccct ctttaacgcc 2340 caccgttagt ggaataagct ccctgtcaac cacatttaaa cctaaagata tcatggcttt 2400 tgtcgaacat ttaccaacgt ttgacggaac ccccagatat ttagacaggt tcattacaag 2460 cgtagaagaa attctcatgt taatcagagg ggcagatcac acgccatatg gtttgctcac 2520 tttgaggaca atcaggaaca aaatagtaga cagagccgac gaagcgttag aactagcgaa 2580 tacaccgtta gtttgggacg aaatcaaagg caatctcatt aggctctact ctagtaagaa 2640 gagcgaggct agtttgctga gtgaaatgag ttcattttca gaccgccaaa gtttgggaca 2700 attattcttt gggctttcaa aaatcaaaag ccaattgttc tccatactca aaaacactga 2760 acttaaccat aacatagttg aggcaaaaag aacagtgtat aatgaggtct gcctcaatgc 2820 attcatcacc ggacttaggg agccgctcaa gacattcatt cgtttgaagg cccctcagac 2880 aatagaacaa gcttacgaac aatgccaaat tgagcagtcc ttatacagaa ctaacaatcg 2940 cggacccaac gaaacgttca ctaaaaatag acgtgataat caaaacaccc gagaccgacg 3000 tcacgacgga cgaactaata acgatcgtaa cgcaccatac ccaaataata actacaatca 3060 acaacccaac tctagggaaa ccaatcaatc cagtacagca aaccctttta ggactcagtc 3120 tggtcgcctc aataatatag aggatgacac acccgacaca aattttcaac aggcagcctc 3180 cgtaacccat cagggtacat aagtccagca acccacgagt cctcactccc cttcattgag 3240 ataaatctcc cctttagtcc ccatttaaaa ttcctaatag ataccggttc tacgcattct 3300 ttcattgatc ctaaacatat agaatcaaaa gattgcgtgg gcctagaaaa acctttaacg 3360 ctcaaaacag ccctaaatac ctttgtactt caaaaaaaat ttgtcattcc ttttccatct 3420 gagtttaaga cccaggggaa aatgaatcta ttgccgttta gattccactc cttttttgat 3480 ggattattgg gcatggactc actttcattt ttaaaggcaa aaatagatat attaaattca 3540 caattaaaga catctaacac aattttgcct ataattaact ataccaacca gacctccaac 3600 gtgttttgca ttacagccca tactaaaacc acaataccct tgccggtaca aaatgatcaa 3660 ggggattttc tctatgacac gacttatatt aataaagacc ttatcatatc agggggcctt 3720 tacaaatcaa ataaccactt aacaaccttc gaagtggcca actacagaga ttctgaccaa 3780 ctcctatatc tcgaaagccc tcttaaaggc ataccatatg tcaaagctga ctttgtagag 3840 cttaataaca tatcttctga cccacccaag ctaccagact tagtaaaccc cttagataac 3900 ttaaaaatgg agcacctcaa cccggaagag aaacaaaacc taatcacatt atgtaaagaa 3960 ttctcagaca ttttttttaa cgaaacagta aaactctcat ttacaaataa aattacgcat 4020 tctatcccta ccacggacaa taccccgatt cacgtaaggt cctatagata tccgtacatt 4080 cacaaagagg aagtaaagaa acaaattaat tctatgctaa accaagatat aatcagatct 4140 agttactcgc cctggagtgc tccggtttgg gtagtcccca agaaagccgg gcccaatggg 4200 gaacaaaaat ggagattggt tatagattat cgaaaactca atgataaaac catctccgat 4260 aggtacccaa taccaaacat aaatgatata ctagatcgca tagggaaagc caaatacttt 4320 accaccctcg atttggcaag tgggtttcat caaattgaga tgaacccaga cgacattgcc 4380 aaaacggcct ttacagtaga aggagggcat tatgaattta taagaatgcc tttcggctta 4440 aaaaatgctc ccgctacttt ccaaagagta atggacagca ttttaggtga cgtaataggc 4500 ctcaattgtc tggtatactt ggacgatata atcgtttttt ctgcttccct ccaagaacat 4560 gtaatacatt taacttcaat atttcaaaaa cttagagatg caaactttaa agtacaatta 4620 aataagtccg attttcttag gaaagaaacc gaattcctcg gacacatcgt cacccaagaa 4680 ggaatcaaac caaatccaga caaaatagcg gctataaaga atttcccttg tcctaagaat 4740 aagaaagaaa ttaaatcttt cttggggtta ttagggtact ataggaaatt tataaaagac 4800 ttcgcaaaaa ttactaagcc tttaactaaa caactaaaag gtaacaacaa gactatcacc 4860 atagacgacg aattcacaaa gacggtcgat ttctgtaaaa ttttacttac caatgacccc 4920 atactccaat accctgactt tacaaaacct tttattttaa cgacagacgc aagtaatttc 4980 gcattgggag ctgtcctttc acaaggcact ctgcaaaatg ataagccagt ctgtttcgct 5040 agccgaactc tctcagacac agaagtaaac tactcgacaa tcgaaaaaga aatgttagct 5100 attatttggg ctgtccaata ctatagaccc tacctcttcg gaacaaaatt caccatagta 5160 acagaccaca aacctctaac atggcttatg aacttcaaac agcctaattc caaagtaatt 5220 cgttggagac tccaactctt agaatatgac tttgaggttg tctataaaaa tggctcacaa 5280 aacgtcattg ccgacgcgct tagccgctcg aacgctaatc tcaatcacaa cgagatagtc 5340 ccaaattcac cgtacgaatg ccctgtatca gacaaaccac tcaacgattt caatattcag 5400 ttggtcataa agttaagcca agatacaggt tacaatacct caactccttc taaacacaaa 5460 ctgcgacgag aatactatag acctcttttg ataaggtgcc tatcttttct tcccgaactc 5520 acagacgatt ctgaaattac ccggaccata gtcgactacc atacgaaaaa taatcataga 5580 ggcatcgacg agaccttcct acacctaaag aggaaaattt ttttcccata tatgaaggat 5640 aaaataacta agataattag aacttgtgaa acttgtctaa aactaaagta cgatagacac 5700 ccacagaaaa ttccatttca aattacagaa acccccctta aacccctcga catcgttcat 5760 attgatattt acactatcaa caacaactac aatttgacaa taatagacaa gttctctaaa 5820 tttgcagccg cctacccaat tcctaataga aactgtatta acgtggttaa atctttgaaa 5880 cattttataa gccaatttgg cataccaaaa aagctagttt acgatcaagg ggccgaattt 5940 tctagcgaca tgtttagcaa cttctgctca caatacaata tcgtcttaca tgtcacatcc 6000 tttcaacaat cctctagtaa ctcccctgta gaacgccttc actccacatt tacagaaatc 6060 taccgaataa ttatagacac taggaaacaa ctaaaactta atacggaaca tgacgagaca 6120 atgtcagaaa cactaattac atacaatagc gcaattcact cagccacaga acataccccg 6180 ttcgaactat ttaccggacg tacccacaca tttgaaaaca acactaaatt cagcaacgaa 6240 catgattacc tagttaaact gaacgaattt aaacaaaaaa tatatccact catcgcagac 6300 aaattatcgg aaaaggcatt acaaagaaca cttaaaatta atgaaacaag gacacccccc 6360 gctaccctag aacccgaaac attggtactc agaaaggaaa ataggaggaa taaaataaca 6420 cctaggttct cattacacaa agttttacac gaccaaggcc caaccctggt taccactagg 6480 aatcaaaaac tacacaaatc taaaatacga agaattatac taaagaaaaa ataaatttta 6540 ttgcaggttc accattccaa caatttatgc tatttctata caacccctga ctaataatac 6600 ccccatagca aaaatagaat tggggcaagc ccttattata aatagatata agagagttag 6660 ccatgttatt aacctggaag aatttagtaa atgtatcgaa caatttcata aaacactcca 6720 gacatttaaa tatgatgaca ccctaataga ttccctagcc atactaaaat ccaaactagg 6780 acaggcccaa acaaaactta atgcattaac cccactcaag agaaacaaac gcggattaat 6840 taatggatta ggcagtctcg tgaaagctgt cacgggcaat atggatgccg aagacgccag 6900 agaaataggt aaagacatcg aaaatattaa aatagccctt tcgaccgtta acacaaatta 6960 ccaaactcaa gatgtattca ataatgaaat tttaattaga ttcgagaaca ttacaaatca 7020 tattaataat gaacaagttc taataacaac attctttgaa aatatgcaaa ataaaattta 7080 taaagaagaa aataccttgg aaaaattaca gtatataaat aggattaact ataacttaga 7140 attactactt aatcacttga acgatataat tgaaagcatg ttattagcta aaataaatgt 7200 aatacctaga tttatattaa ccgaacaaga aatatataaa attaaaatgt tactggaaaa 7260 tcaaaatatt acgataaaaa gcgaacaaca catttacgat ttgttaaaaa tgaacacgtt 7320 aagctacgaa aataacataa tttttactat ccaaatacct atttttgaaa aagataatta 7380 caaaatagcc agattaattc ccttaccaat taataataaa tactttgtta tgattcctaa 7440 ctatttaata tataagaata atgttaataa gtactacttg actcaaaact gccccaaagt 7500 agacgacacg tttatatgca acaaagacgc atacatcaac acaccgcaaa atgacacctg 7560 cactcaacaa ttattggatt ccaagaacag ctcctgcgat gtgcaagaga gaggccctgt 7620 tgcagacgta ttcgaggcgg aaaaaggcta tattttcgca ttcaacgcca ataacctgaa 7680 ggctcagcta aaaaacggta gcgaaatcac tataaatggc tcagctatca tcaaatacgt 7740 gaacgaaact atacgcatca atggcataga gtacgacaac ggagttgaga caacctttga 7800 gcacttagac ctaatcctac ccccccttaa ggaaatgacc aggaataaaa ctgtggaaat 7860 actgagttta caaaacattc acctagaggc catagaaaca tcaaataaga ttctgactat 7920 ccaccaaacc gccacacaac acgcttggac cctttacatt ctactgggtt tcgtcgccat 7980 cctcactgtt actgcatggc tgcttcgacg cacaaaacac gtattccacg tacaggatca 8040 caatcaccac ctgcctattt acgctccagc aataccttcg ctatggccgt cgcttcaaac 8100 tggggaggga gg 8112 // ID CR1-55_AAe repbase; DNA; INV; 4462 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-55_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4462 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1142-1142 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 13 sequences with >96% CC identity. XX FH Key Location/Qualifiers FT CDS 17..931 FT /product="CR1-55_AAe_1p" FT /translation="MAPAHELGGLLRSGRRRNNPVNSNTAIPSTPVMSPIP FT PDPATPRTQQAKSNAVAKLEQTVLFKPKKNQSAEKTKDDISAKFDPVAFAV FT KDVWLRDFGEVAVRCGSKDLALKMVSSASTVCAEKYIIEMQKPLKPRIKII FT GFSKDLNHEVLVDKLKQQNNLTTSFDLKVVRVTRNQKRKSNQMSAVIETDA FT TGFVTLMKLRRVYLGWERCRLVDATDALRCYNCSEFQHKASACTKAACCPK FT CAGGHKAEDCDSDYEKCINCHMENAKRTSKHDDLLDVSHAAWSLDCPINQK FT HLARARRRIDFSS" FT CDS 935..4387 FT /product="CR1-55_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="QSTTSAEKNLQSVISAGPGKLLTSICPAEVKQDTAPP FT LSGTEPXQQSLDIPHLTEYDRDHSTDNQISDAVSTMDFYLQLDDSAGPGKL FT LTSICPAEAKIDTAPFLSGIQPFQLCSNVNHSNERCSTEDKQFACATSTKD FT FNYQLEDSAGPGKFTTSICPAEAETDTAPPLSVTQLLQTPTTDLQHTAHVC FT PASMPKGIRIYYQNVRGLRTKIDNFFVAVSDNEYDVIVLTETWLXDVINSV FT QLFGQGYTVHRKDRCPDISGKSRGGGVLIAVKNTLRSSRCKIHSDPNLELL FT WINVESNDQVICLGVAYIPPEQSRDRCMIESYVSSISTVVSAGDFHTAYFL FT FGDFNLPDLCWLSTPAGYAYPEPSDSATPSNAFIDGMSLLNLKQLNVVRNN FT LDRTLDLLFVNDEALPMCDLTQPFEPLIPIDLLHPPLLATVRCSARSVFIE FT ATDDRLLDFSKANFEQLNEALLAIDWSRLYNERDVNVAVQLYTDTLTQMFH FT EFVPAPRPRQKPPWTNNHLRHLKRKRARALREYSNNRNSITKRRFTIASSE FT YKCYNRALYGRYVVRKQGVLKNNPKAFWAFVNEKRKETGLPSSMFLGSENA FT DTNEEMCNLFAEHFSSVFDCATINQDQVSCAVRDVPLNAMCMTIYEFDDED FT ILKAIKKLKPSTNPGPDGIPSVVLKKCSAAICAPLRSIFNKSISQGVFPER FT WKKSVMFPAFKKGDKRDISNYRGVTSLSAGSKLFEILVSNELFRHAKSYIS FT QDQHGFYPGRSTATNLVQFTSLCLKAMERGIQVDTIYTDLKAAFDRVNHHI FT LLAKLNRLGIATNVIKWMESYLTNRQLVVKLGSMESEPFSNYSGVPQGSNL FT GPLLFSLFFNDACFVVPQGCRLVYADDFKIFRSVRSSEDCRMLQVLVERFS FT SWCERNFLPISVKKCSVISFTRKKSPILWDYHMGNERVERTNVVKDLGVML FT DSELNFREHYSYIINKANRNLGFIFKISSEFRDPYCLRSLYFSLVRSILET FT AAVVWSPAHNVWIDRIERIQSKFIRYALRHLPWQNVIELPPYEARCRLLGM FT DTLERRRNNLKAIFVGKILLGEIDAPWILAQININVLPRPLRQRSFLRLQP FT HRTDYARNEPVNSMCVLFNLYHHMFDFNVSSRVFQNRVTNFYE" XX SQ Sequence 4462 BP; 1283 A; 1006 C; 980 G; 1187 T; 6 other; gtctatttct gtcccaatgg ctccagctca tgaacttggt ggtttgcttc gttctggtag 60 gcgccgaaat aatccagtaa attcaaatac tgctattccc agtacacccg tgatgtcacc 120 aataccgcca gaccctgcta ctcctcgcac gcagcaagca aaatcgaacg ccgtcgctaa 180 acttgaacaa actgtcctgt ttaagccaaa gaagaaccag tcagctgaaa aaacaaagga 240 cgacatatca gcgaaattcg atcctgtggc tttcgctgtg aaagatgttt ggctcagaga 300 ttttggggaa gtagccgtaa gatgcggttc aaaagatcta gcattgaaga tggtcagctc 360 tgcttctact gtttgcgctg aaaaatacat tattgaaatg caaaaaccgt tgaaaccaag 420 gattaaaata attggtttct caaaagacct gaatcatgaa gtactggtag acaaactgaa 480 acagcaaaac aatttgacga cttcgttcga tctgaaagtt gtgcgtgtaa cgagaaacca 540 aaagcggaaa tctaatcaga tgtcggctgt tatcgagact gatgcaactg gttttgttac 600 tttgatgaag ctccggcgcg tctatctggg atgggaaaga tgcagactgg tggatgctac 660 tgatgctctt cgctgttaca attgttcgga gttccaacat aaggcctcag cttgcacgaa 720 agctgcatgt tgtcccaaat gtgctggtgg acataaggca gaagattgtg attcggacta 780 cgaaaagtgt atcaattgcc atatggaaaa tgctaaacgt acatcgaagc acgacgatct 840 cctcgacgtc tcacacgctg cttggagttt ggattgtcca atcaatcaga aacatctagc 900 cagagcaaga cgaaggattg atttttcaag ctagcaatca acgacctcag cagaaaaaaa 960 tctgcagtct gttatctctg ctggtccagg taaattgttg accagtatat gtccagccga 1020 agtgaaacaa gatactgcac ctcctctcag tggaactgaa ccattmcagc aatctttgga 1080 catccctcat ctcaccgagt acgacagaga tcactccaca gacaatcaga tctccgatgc 1140 ggtgtcaacg atggattttt atctccagct tgatgactca gctggtccag gtaaattatt 1200 gaccagtata tgtccagccg aagcgaaaat agatactgcg ccctttctca gtggaattca 1260 accattccag ctctgttcga acgtgaatca ctccaacgaa agatgttcta cggaagacaa 1320 gcagtttgct tgtgcgactt caacgaagga ttttaactac cagcttgaag actcagctgg 1380 tccaggtaaa tttacgacca gtatatgtcc agccgaagcg gaaacagata ctgcgccacc 1440 tctcagcgtw actcaattgt tgcagactcc tacgaccgat ttgcaacaca ctgcccacgt 1500 atgtcctgca tcgatgccta aagggatccg aatatactat cagaacgtwa gagggcttag 1560 gacgaaaatt gacaactttt ttgttgcggt ttcggataac gaatatgacg tcattgtctt 1620 gacagaaact tggctgwatg atgtcataaa ctccgtacaa ctattcggtc aaggttacac 1680 agtgcatcgc aaagatcgtt gtccagatat ttccggaaaa tctcgtggtg gcggagtgct 1740 catagctgtg aaaaacacat taagatcgtc tcggtgtaag attcacagcg acccaaatct 1800 tgagctctta tggatcaatg tcgagagcaa tgatcaggtc atttgcctcg gcgtagcgta 1860 cattcctcca gaacaatcca gagatcgatg tatgattgaa agctatgtca gttccatttc 1920 gacggttgta tcagcaggtg atttccatac tgcttatttt ctcttcggtg attttaattt 1980 gcccgaccta tgttggcttt ccactcctgc gggctatgcg tatcctgaac cgtctgactc 2040 tgctactccc agcaatgcat tcatcgatgg tatgtccctg ctgaatctga aacaactgaa 2100 cgtagttaga aataatctgg accgaacact tgatctacta ttcgtgaatg atgaagctct 2160 accgatgtgt gatcttactc aaccatttga accattgatt ccaattgatt tgttgcaccc 2220 accacttctc gccactgtgc gctgttcagc tcgcagtgtg ttcatcgaag ccaccgacga 2280 caggttattg gacttctcga aagcgaattt tgaacagttg aatgaagctc tccttgcaat 2340 tgactggtca cgtttgtaca atgaaaggga tgttaatgtg gcggttcagc tttacactga 2400 tactcttacg caaatgttcc atgagtttgt acctgcacca cgccctcgtc agaagcctcc 2460 ttggactaac aaccatctcc gacacctgaa acgaaaacgc gcacgcgccc taagagaata 2520 ctctaacaac cggaactcaa ttacgaaaag gagatttacc atcgccagtt ccgaatacaa 2580 atgttataac cgagcactct acggtcgwta tgtggttcgg aaacagggag tcttgaaaaa 2640 caacccgaaa gctttttggg cktttgtcaa cgagaagcgc aaagagaccg gtttgccatc 2700 aagcatgttc ttgggaagcg aaaatgcaga cacgaatgag gaaatgtgca acttattcgc 2760 cgagcatttt tccagtgtat tcgattgcgc aacaatcaac caggatcaag ttagttgtgc 2820 tgtaagagac gtgccgttga atgcaatgtg tatgacgatc tacgagtttg atgatgaaga 2880 tattctgaag gctatcaaga agctgaaacc gtcgaccaac cctggaccgg atggaattcc 2940 ttcggtcgtt ctgaaaaaat gctctgctgc tatttgtgca ccactacggt cgattttcaa 3000 caaatcgatc tcgcaaggag tatttccgga acgctggaag aaatccgtta tgttccctgc 3060 tttcaaaaaa ggggacaaac gtgatatttc aaactaccgc ggcgtaacct cgctcagtgc 3120 tggttctaag ctgtttgaga tcctggtgag caacgaactg tttcgacatg cgaaatcata 3180 catttcgcaa gaccaacacg gcttctaccc aggcaggtcc actgctacta acttagtaca 3240 attcacttca ttgtgtctaa aagctatgga acgaggaatt caggttgaca cgatctacac 3300 cgacttaaaa gcagcatttg atagagtgaa tcaccacatc ctgctggcga agttaaatcg 3360 tctaggcatc gcaaccaacg tcatcaaatg gatggaatcg tacttgacaa atcgtcagtt 3420 agtagtcaaa ttaggttcaa tggaatcaga accgttcagc aactactctg gtgtgccgca 3480 gggtagtaac ctagggccat tactgttctc cttgtttttc aacgatgcct gctttgtcgt 3540 ccctcaaggc tgcagattag tatatgctga cgattttaaa atatttcggt cagtcaggtc 3600 ttcagaggac tgccgaatgc tgcaagtgtt agtcgagagg ttttcaagct ggtgcgaaag 3660 gaactttttg cctatcagcg tgaagaaatg ctcggtaatc tcgttcacca ggaagaaaag 3720 tccaattctc tgggattacc acatggggaa tgagcgagtg gaaaggacga acgttgtgaa 3780 ggaccttgga gtcatgctgg actctgagtt aaactttcgt gagcactaca gctacatcat 3840 caacaaagcc aaccggaatc ttggattcat atttaagatt tcgtctgagt ttcgagaccc 3900 atactgctta cggtcacttt atttcagtct agtccgttcc atactggaaa ccgcagcagt 3960 agtatggagc ccagcccaca atgtttggat cgacagaatt gaaagaatac agtcaaaatt 4020 catccgatac gcgctgagac atctgccctg gcagaatgta attgaacttc caccatacga 4080 agctcgatgc cggcttctag gaatggacac ccttgagagg aggagaaaca acttgaaagc 4140 catatttgta ggtaaaatac ttttgggaga gatcgatgca ccgtggattc ttgcccagat 4200 caatatcaat gtcctgcccc gcccactgag acagcgtagt ttccttcgtc ttcaaccaca 4260 ccgaactgac tacgcacgaa acgagccagt taactccatg tgtgtgttgt ttaatttgta 4320 ccatcatatg ttcgacttta atgttagttc tagagtattt caaaatcgag tgacaaattt 4380 ttatgaatag gtttaagtgt tcatggtaga ctaatgtgtc agatgaatgt aactctaata 4440 ataataataa taataataat aa 4462 // ID Sola1-2_AA repbase; DNA; INV; 2912 BP. XX AC . XX DT 28-JAN-2009 (Rel. 14.02, Created) DT 28-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE Sola1 DNA transposons from Aedes aegypti. XX KW Sola; DNA transposon; Transposable Element; Sola1; Sola1-2_AA. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-2912 RA Bao W., Jurka M.G., Kapitonov V.V. and Jurka J.; RT "New superfamilies of eukaryotic DNA transposons and their RT internal divisions."; RL Molecular Biology and Evolution 26(5), 983-993 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS join(374..1018,1056..1604,1608..2498) FT /product="Sola1-2_AA_1p" FT /translation="MSALKLLQVYGSESEGESSDEEFIGFGDSDDQFPANN FT SKHTAMWNALSAINLRHPENHPSDSAEMESGTERDDLSGSDEEDRGSDGSF FT FQLEKRKRKQTKRQKLKEKRTKRRRLSHKLRIVKCGCKKKCDQIFSKDERV FT NIHSRFWKLEPNEQCNFIRENVHRSSVQTRRTNRTVANHPKKNFSYTFNLS FT LINGEKLKVCRKFFLAVIGYGENCGLFSCLSFRNIIYRCLEHDFDGNPNPT FT KRGKYERSKAKREAVKTHILSYNPSVSHYRREHAPHRFYLPSDLTEASMHQ FT AYQDSASPALRVSYQFYCSMISKMNISLVKLGHEQCEVCVTATLHQNSSGH FT KEEDRFVTVCSTCKSHVEHSRLKNLSRADYQNDGDQIRAKEIVFSVDLQKV FT NFFELSINKLHQTIRFFIVLQVIQLPRLEGLKSIVFTQRVLAFNETFAPVG FT EYAKTYPVVACVWNESTSGRSAGDIASCFGKVVLWCQEFEKITFWLDNCSS FT QNKNWNLFLYLTLLLNSEKVKSNEIILKYFESGHTFMAADSFHAAVEKGMR FT HGSPASTFPEFKDVVQKAKKNVVAMDMAATDFFELRMTVSQYLLNQCKPRP FT YIENIRKIKLKKGSYEMSYSGSLDDDVESTSCCIFSKKQMKTVTDDSFELE FT SILSRRKVPVGICPTRKQALLRVILPVIEEHQKKFWIDIPERQET*" XX SQ Sequence 2912 BP; 939 A; 556 C; 600 G; 817 T; 0 other; ctgcccctct ttgcacaaca gtcccatgtt ggaaaatggc aaactgagaa aatgacgatt 60 taaatgtaca aacaactttg cttcatctcg tgtctccact tgaattaaaa tgtgttatat 120 ttttaaataa tcactgtttg ttgagtgcac gaactaccct gagaattttt tattgaataa 180 aactggaact ctgtttaaaa atgtacaaat agtaaagtga gactgacatg cgaagatata 240 caatataata tgccaatata attgcatatc agtcccattg tgatcatcgt cttgtttgac 300 agctactgtt tggctaccgg ttgcttagtt cgctgttcgc tgttttcagg ctgtgaaatt 360 cagtaattat acaatgtccg cgttaaaatt attacaagta tacggaagtg aaagtgaggg 420 tgaatcctcc gatgaagaat tcatcggatt tggtgatagt gatgaccaat ttccggccaa 480 taattcaaaa cacaccgcga tgtggaatgc gctttccgct atcaacctga gacatccgga 540 aaatcatcca tcggatagcg cggagatgga atccgggaca gaacgtgacg atctatcagg 600 cagcgatgag gaagatcggg gatccgatgg aagctttttt caattagaaa aaaggaagcg 660 gaaacaaaca aaacggcaaa aattaaaaga aaagcgcaca aaacggcgac ggctcagcca 720 caagttgcgc attgtaaaat gcggatgcaa gaaaaagtgt gaccagattt tttccaaaga 780 cgaacgcgta aatatccaca gccgtttctg gaagctagaa cccaatgagc aatgcaattt 840 cattcgggaa aatgtgcacc gcagctcagt gcaaaccaga cgcacgaacc ggaccgtagc 900 aaatcatcct aagaaaaact tttcatatac tttcaacctc agtttgataa atggtgaaaa 960 attgaaagtc tgtcgaaagt tttttttggc tgtcattgga tacggagaaa actgcgggta 1020 agttcaattt aaatggcata ttttaaaacg aataattatt ctcttgttta tcttttagaa 1080 acatcattta tcgatgcctg gaacatgatt tcgatggaaa cccaaatccg accaaacgag 1140 ggaaatacga gagaagcaaa gcaaaacgtg aagccgttaa gacccacatt ttgtcttaca 1200 atccgtctgt ctcgcattat cgtcgggaac acgcgccgca tagattctac cttccatccg 1260 atttgacgga ggcttcaatg catcaagcat accaagattc agccagtccc gcattacgag 1320 tcagctacca gttttattgc agtatgatct cgaagatgaa catttcactt gtaaaactcg 1380 gtcatgaaca atgcgaggtt tgtgttactg caacgctaca ccaaaactca tcagggcaca 1440 aggaagagga tcgtttcgtt acagtgtgtt ctacttgtaa gtcgcatgta gaacattccc 1500 ggttgaaaaa tctttctcga gctgattacc aaaatgatgg ggatcaaatc cgggcaaaag 1560 aaatagtgtt ttcagtggac ctgcaaaagg taaatttctt tgaataattg tctataaata 1620 aattacatca aactattcgg ttcttcatcg ttttacaggt tatccagcta ccacgtcttg 1680 aaggcttaaa atcaattgtg ttcactcaac gtgtgttagc ctttaatgag actttcgcac 1740 ctgttggtga atatgctaaa acatatccgg tcgttgcatg tgtatggaac gaatcaactt 1800 cagggagatc tgcgggcgat attgcgagct gtttcggcaa agttgtattg tggtgccaag 1860 aattcgaaaa aatcactttc tggctggaca actgttccag tcagaataag aactggaact 1920 tattcctcta tttgacgttg ttgttgaatt ccgaaaaagt aaaatcaaac gaaataatcc 1980 tcaaatattt cgaatccggc cataccttta tggcagctga cagctttcac gctgccgtag 2040 aaaaaggtat gcgccatgga tcgccagcct cgacgtttcc agaattcaaa gacgtagtcc 2100 agaaagcgaa gaagaatgtt gttgcgatgg acatggcagc tactgatttc ttcgagttga 2160 gaatgacggt gtctcaatac ctattgaacc agtgtaagcc tcgaccgtat attgaaaata 2220 taaggaagat caaattaaaa aaaggcagct acgagatgag ctacagtggc tcattggatg 2280 atgatgtcga gtcgacatct tgctgtattt tttccaagaa gcagatgaag acggtaaccg 2340 atgatagttt tgagttggaa agcatcctat cgcgtcgaaa agttcctgtt ggaatatgcc 2400 ccacccgcaa gcaagcgctt ctcagggtaa ttttaccagt catcgaagaa caccaaaaga 2460 agttttggat cgatattcca gaacgccagg aaacataaat cgaagcataa aaattcttcg 2520 aagcagacgg gaagtgatta atctttaatt attttttgtt tcatattatc gaacatgctg 2580 ttattagaat taaaaataaa cactcatgaa tttcaaaaat atatctgaaa caacgcattg 2640 tcgcattatc ttattatcgt aatcatatgg catttgaaaa aaaaatatcc cgtatcattt 2700 cgtttatatt cagtgaatca ctaacaatgt gactgttatg cgattatttg caatatggga 2760 caacaccatt ggggtttgtt tttgtacaag tcaggaacaa agactactat tttatcattt 2820 tttcttgagc gtctaacgat gttaagttga tttatgcaaa aaacgcaaat tggtcaaaag 2880 tcaacatggg actgttatgc gaatatgggc ag 2912 // ID Kiri-10_CQ repbase; DNA; INV; 4384 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 16-FEB-2011 (Rel. 16.01, Last updated, Version 2) XX DE A Kiri non-LTR retrotransposon family from Culex quinquefasciatus DE - consensus. XX KW Kiri; Non-LTR Retrotransposon; Transposable Element; Kiri-10_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4384 RA Kojima K.K. and Jurka J.; RT "Kiri non-LTR retrotransposons from the southern house RT mosquito."; RL Repbase Reports 11(1), 129-129 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 3 sequences with >99% CC identity. CC This family and related Kiri elements found in two mosquito CC species, Culex quinquefasciatus and Aedes aegypti, constitute a CC distinct lineage belonging to the CR1 group. The phylogenetic CC analysis with PhyML indicates that Kiri is a sister lineage of CC the L2 clade. Although several Kiri elements do not encode two CC proteins, probably due to 5' truncation, Kiri elements generally CC encode two proteins. Their ORF1 proteins commonly contain an CC L1-like RNA-binding domain. XX FH Key Location/Qualifiers FT CDS 58..840 FT /product="Kiri-10_CQ_1p" FT /translation="MSNKTKTRSASAVDGDGQTLKRSREDLTVADYPDEVE FT NITDLWRNMKKLLFNSNSRLEGKIDACKTSIDDLEEEFVALRKECGEQMEA FT VHHRVNSVEFGLQQTAEDVARLERASELIISGIPYQQTENLPQFFQNIATV FT LGYDQPPIVDLQRLAKLPIAAGSAPPILCQFALRNVRNDFYRRYMTNRDLT FT LRHIGFDADKRIFMNENLTKLAREIRSQALRMKKAGIFAQVFTRNGAVYVK FT RQGQPTSELVQCVQQLSLQK" FT CDS 1407..4241 FT /product="Kiri-10_CQ_2p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="MFDSSNDDASANYLIPRAVMNACLVNDKLNICHINVQ FT SLCARRFSKFEELKLNFITSKVDIICLTETWLDDSITDATIEIQGYNLVRQ FT NRNRHGGGVCVYFKQNISCKILKKSLGAISTNGRYETDFLLLEMQFNGNRF FT VLAVYYNPPQSDCSYLLRTHIEQFCLKYENSFFIGDFNTDIRKNTRKTNDF FT IDTIASMSFSCVNFEPTFFYNDGCSLLDLLLTNSPELVLKFNQVSMPGISA FT HDLVFATLDFEKRVDLDGYWYRDYNNFNNHAFENALSRLSINEFLSIDDSD FT ILLNTLNNHMFILHEQAFPVKFKKSFYKSWFNPDIERAIINRDIAYNAYKR FT SKSPEHRTNYNRLRNVVNYLTKTAKRDHDRQRINLNVPVRQMWKNIKKLGV FT SKSKNNNIDSTHSAEEVNNYFASNYSERSSSSIHYSHSTDSFHFRQIHSYE FT IVDAVYSIKSNAAGLDNLPISFLKVVLPFLLPFFEHLFNTIVKTSNFPALW FT KRVKVIPIYKKSGSSEITNLRPISLLCTLSKVFEKVIKDQISHYITEMNFL FT HALQSGFRKNHSTETALMKVHDDIALSVDKKCVAVLLLIDFAKAFDRVSHV FT KLLNKLISLYNFSNAATKLLKSYLTERYQAVFLNGILSSFILCESGVPQGS FT ILGPLLFSLFINDLPNVLKYCSVHLFADDVQIYFCSDQNFNIPDICSKINF FT DLNQIHTWSESNLLSINPAKTKALLITKLKTKPDFPNLIMNGAQICFVNEA FT NNLGLIFRDNLEWDLQIKSQCRKIYISLKQLTLTTKHLDVNTKLKLFRSLI FT FPHFIYTDFIYSNANGMHIDRLRIALNACVRYVFNLNRMSRVSHLQKHLIG FT CSFSNFYKFRSCVVLHRIINTKKPTYLYEKLVPFRNTRTYSFLIPNHSSFY FT YSQSFFARGIVNWNSLPTTLKQNTSKYNFKQSLLANLNINH" XX SQ Sequence 4384 BP; 1361 A; 851 C; 759 G; 1413 T; 0 other; cgttgcttcc cactattctt ggcagtcaaa cacctggaaa cgtcaggttt gtttacgatg 60 tcaaacaaaa caaaaacacg ctctgcctct gcagtggacg gagatggcca gactctgaag 120 cgctcgagag aggatcttac tgttgcagat tatccggacg aagtggagaa cattaccgac 180 ctgtggcgca acatgaagaa gctgctgttt aattcgaact cacgtctgga aggaaaaatc 240 gatgcgtgca aaacgagtat tgatgatttg gaagaggaat tcgtcgctct caggaaggaa 300 tgtggtgagc aaatggaggc tgttcaccac cgggtgaact ccgtcgagtt cggattacaa 360 caaactgctg aagatgttgc taggctggaa cgggcgtcgg agctgataat ctctgggatt 420 ccttatcaac agaccgagaa ccttcctcag ttcttccaaa acattgctac tgtgctcggt 480 tatgaccaac ctccgattgt tgacctgcaa cgactggcca agttacccat tgcagccgga 540 tctgctccac caatactgtg ccaatttgca ctgcgaaacg tacgtaacga tttctaccgt 600 cgctacatga ccaaccggga cctgacgtta cgacacattg gttttgatgc tgacaaaagg 660 attttcatga acgagaacct cacgaagctg gcccgggaga ttcgcagtca agctctgagg 720 atgaaaaaag ctggaatctt tgctcaagtg ttcacaagga atggtgctgt gtacgtgaaa 780 cgccagggtc aaccaacatc ggagctggtg cagtgtgttc aacaattgtc gctgcaaaag 840 taaacccttt cctcaaagtt cttcatctcc ttcccaagct tatcctttga ttccattcct 900 tttctccttt gattccttcc cctcctgaaa gtcaacaaca tagaagataa accctttcca 960 aaaagttatt ctttatccct ccacgatcat cctttgattc cgatcccttt cgatcctgtg 1020 tctcctccat ccgtcctgaa agtcatgctg gttcgtgtcg atgatgcctt ctgctggact 1080 gtaacgctgt agatgctgat ttggttgctg aagttgctgt tatggaaatg ctgttggtgc 1140 tgttcgtgcc atgggatggc tgtggtcagg tcgagactgg aacatggtac aaatagaaac 1200 gtgtgtgctt gctaacttta ctgctgccga tatttgtttt gatcaaattc tgattttttt 1260 ttcctacatt atggctcgta caaaaaagtt ttctaaaacc tatttagtta gttttaagtt 1320 atggttctct ttattaagct atggttcaat tggtgtaggt ttggtgggcg gcttttgttt 1380 gaaatttagc gcgttcccca ctaataatgt ttgattcttc taatgacgac gcctctgcaa 1440 attacttgat tccaagagct gtaatgaatg cttgtttggt gaatgacaag ctgaatatat 1500 gccatattaa cgttcagagc ctctgtgctc gccgtttttc caaatttgaa gaactcaaac 1560 taaactttat tacaagcaaa gtggatatta tctgtttaac agaaacctgg ttggatgatt 1620 ccataactga tgcaaccatt gaaatacaag gttacaactt agttcgacaa aaccgtaatc 1680 ggcacggtgg gggtgtatgt gtatacttta agcaaaatat tagctgtaaa attcttaaga 1740 aatctcttgg tgcaatttct accaatggtc gttatgaaac tgattttctg ttacttgaaa 1800 tgcaatttaa tggaaatcgt tttgttttag ctgtgtacta caacccgcca caaagtgact 1860 gttcttacct acttcgcacc cacattgaac aattttgttt aaaatatgaa aattcatttt 1920 ttattggcga ttttaacact gatattcgaa aaaatactag aaaaactaat gattttatag 1980 atacaatcgc tagcatgtcg ttttcttgtg tcaattttga acctactttt ttctacaatg 2040 atggttgctc tcttctagat cttttattga caaattcacc agaattggta ctcaaattta 2100 atcaagtatc catgcccgga atatcggctc atgatctagt gtttgctact ttggattttg 2160 aaaaacgtgt tgacctagac ggttattggt atcgtgatta caataatttt aacaatcatg 2220 ctttcgaaaa tgcattatcg cgtcttagca ttaatgaatt tcttagcatc gatgattctg 2280 acattcttct caatactttg aacaatcata tgtttatctt acatgaacaa gcttttccag 2340 taaaattcaa gaagtcattt tataaatctt ggtttaaccc agatattgaa agagctataa 2400 tcaacagaga cattgcttat aatgcttaca aaagatcaaa atcaccagaa catcgcacca 2460 attacaaccg tcttagaaat gttgtgaatt atttaactaa aactgccaaa cgtgatcacg 2520 atcgacaacg tatcaactta aatgttccag ttagacaaat gtggaaaaac ataaaaaaat 2580 taggagtctc taaaagtaaa aacaataaca ttgatagcac ccattcagcg gaagaagtta 2640 acaattattt tgcgtctaat tatagtgaaa gaagttcgtc tagtattcac tattctcatt 2700 caacagatag tttccatttt aggcaaatcc acagttatga aatagtggat gcagtttact 2760 caatcaagtc taatgctgct ggtcttgaca atcttccgat ttcatttcta aaagttgttc 2820 ttccctttct actgccattt tttgaacatc tctttaatac tatcgtaaaa acatccaatt 2880 ttccagcact ttggaaacgt gttaaagtga ttccaatata caaaaagagt ggatcttcag 2940 agattaccaa tttaagacca ataagtttgc tatgcaccct ctcaaaagtt ttcgaaaaag 3000 taataaaaga ccaaatttca cattacatta ctgaaatgaa tttcttacat gccttgcaat 3060 ctgggttcag aaaaaatcat agcacagaaa cagcactaat gaaagttcac gatgatattg 3120 cgttatctgt tgataaaaaa tgtgtcgctg ttcttctgct gattgatttt gccaaagcat 3180 tcgatcgcgt ttctcatgtt aagcttttga ataaacttat ttctttgtac aatttctcaa 3240 atgctgctac gaaattgtta aaaagttact taacagaacg ttaccaggca gtttttttga 3300 acggtatttt atcatcattt atcctttgtg agtctggagt tccacagggc tctattcttg 3360 gaccattact attttctctg tttataaatg atttgccaaa cgttttaaaa tattgctcag 3420 tacatctttt tgcggatgat gtgcaaattt atttctgttc tgatcaaaac tttaacattc 3480 ctgatatttg ttcgaagata aattttgact tgaatcaaat tcatacttgg tctgagtcta 3540 atttattatc aataaatcct gctaaaacta aagcgttact aattactaaa cttaaaacga 3600 aacctgattt tcctaacttg atcatgaatg gcgcacaaat ctgtttcgtc aacgaagcaa 3660 acaatcttgg cctgattttc agagataatt tagaatggga tcttcaaatc aaatctcaat 3720 gccgaaaaat atacatatct ttgaagcaac ttactctaac aacgaaacat cttgatgtta 3780 atactaaact taagttattt agatcgctga tttttccaca ctttatctat accgatttta 3840 tttactctaa tgccaacggt atgcatattg ataggctacg tattgctttg aatgcttgtg 3900 taagatatgt tttcaactta aatagaatgt ctagggtttc gcatttacaa aaacacctta 3960 ttggatgtag tttttcaaac ttctataaat ttagatcatg tgttgtctta cacagaataa 4020 ttaacaccaa aaaacccaca tacctttatg aaaaattagt accgtttcgt aacaccagaa 4080 cttacagctt tctaataccc aaccatagct cattttatta cagtcaatcg ttttttgcta 4140 ggggcattgt aaattggaac agccttccga caacattgaa acaaaatact tccaaatata 4200 atttcaagca gagtttactc gccaacctca acattaacca ctagttaaat aagagaacca 4260 aataaattaa agttagttaa tttaaattga agcagaactc agcagaacac acactgtaac 4320 ataaaaaaga ctatgtctta agttactata tggaataaaa caaataaata aataaaaaaa 4380 aaaa 4384 // ID Mariner-38_HM repbase; DNA; INV; 3038 BP. XX AC . XX DT 10-SEP-2009 (Rel. 14.09, Created) DT 10-SEP-2009 (Rel. 14.09, Last updated, Version 2) XX DE Mariner-type family: consensus sequence. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-38_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3038 RA Jurka J.; RT "Families of Mariner elements from Hydra magnipapillata."; RL Repbase Reports 9(9), 1926-1926 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 810..2420 FT /product="Mariner-38_HM_1p" FT /translation="MVRNFKRKTNRGYTSGDVIMCAVKDVKINNISIRSAA FT KNHGINYRTLARYCKKIIITNPSHGSIADLIDKIENPTSLNVIGYKSRLIL FT PEKVEKELVTYLLRAADIYFGLTPTEVKKLAFQLAIKNNMKVPNSWVVKEQ FT AGKDWFTGFIKRHSNISIRQPEATSLSRATSFNKQNVASFFSNLQHCMQKY FT SFSPNDIWNMDETAVTTVQQPKKVIAKKGLKQVGAITSAERGTLVTLACTI FT NAIGNSIPPFFIFPRKNFKDHFLISAPTGSAGDANPSGWMKEENFVKYLKH FT FVGNTKCTKEKPCLLLLDNHETHLSIEGLDLAKNNGIVMLTFPPHCTHKLQ FT PLDKAVYGPLKNYINTAMTSWLVMNPGKTVTIYDIPKIVNIAFPKAVSPHN FT ILSGFSATGIYPLNPDVFTENDYCPNYVTDRPDPPCSQADSNKKDCESYSM FT QPEPSNKRFKESIQIPDNQVDITIEDPVVSNKTSNQNPITTLGFNNSIIPP FT ESIRPLPKAGPRLASKRGRKKRSTMILTNTPVKAWLANFY*" XX SQ Sequence 3038 BP; 1083 A; 490 C; 507 G; 957 T; 1 other; ggggagagcg gggttggttg tcacgcgggt tggttgtcac aaagtttatt acaggaaatg 60 taaataagat atagaaattt caaattatac aatggaaatg ttatatcagc gtgcagaaga 120 aaagttattc gtttttatag gtttttgata tgaaaaagta aaatttcgcc tactcagaaa 180 ttttctttta ccaaacgaaa attattttat tgtgttttga atgtagattt ataaagtact 240 ttattcattt gagtaactta atccgtaaaa aattatcatg aaattctcac gcttttaggt 300 aaatgtacag gtgtccagca acttactcag aaggggttgc ttgtcacaaa atatatgggg 360 atggttgtca cacttgcctc gactccatgc gctgcaaatt aaacgccgct tgcagaatat 420 ttagtggctt tgtttatttt ccgcattatt gaggaatatt ttaatgttta agatattatt 480 taaaaatcta aagtcaaagt gttatatttg tttaactaaa tatctgtaat ctaaattatt 540 tttatttcaa cgtgcataac tttagattcc gatccaatgt ttagtataat tatagaaaaa 600 ggattgcaga agttcaactg tttattgttt tgtttacaaa ctaaacaact agctgttgtt 660 tagtttgtaa aactgtctgt tgttttgttt gtgcattgtt tatatttagt gggttaaaca 720 ataaaaaaaa agtakgattt atatatatac gtgtatgtaa tgttttgttt acattatttg 780 caaatatttt tatttgctta gtaataaaca tggtacgaaa ttttaaaaga aaaactaata 840 gaggttatac ttctggagat gtcataatgt gtgctgttaa agatgtaaag attaataaca 900 tttctattag atcagcagca aaaaaccatg gcattaatta cagaacatta gcacgttatt 960 gtaaaaaaat tattattact aacccatcac atggttctat agcagatcta attgacaaaa 1020 ttgaaaaccc cacctccctg aatgttatag gatacaaaag tcgattgata ttgccagaaa 1080 aagtagaaaa agaactggtc acctacctgt tgcgagcagc agatatatat tttggtttaa 1140 ctcctaccga agtaaaaaag cttgctttcc agctagcaat taaaaataat atgaaggttc 1200 ccaattcttg ggttgtaaag gaacaagctg gtaaggattg gtttacagga tttataaagc 1260 gccattcaaa tatttccatt aggcaacctg aagctactag tttaagtcga gcgacaagtt 1320 tcaataagca gaatgttgca tcatttttta gcaatttgca gcactgtatg caaaaatact 1380 cgttctcacc aaatgacatt tggaacatgg acgaaactgc tgtaactact gtacagcaac 1440 ctaaaaaggt tattgcgaaa aaaggattaa aacaagtagg agccatcacg tcagctgaaa 1500 gaggaactct tgtaacatta gcatgcacaa taaatgctat tggaaatagt attccaccct 1560 tttttatttt tcctcgtaaa aactttaaag atcatttcct aatcagtgca ccaacaggaa 1620 gtgcaggaga tgcaaatcca agtggttgga tgaaagaaga aaactttgta aaatatctaa 1680 aacattttgt aggcaataca aaatgtacta aagaaaagcc ttgtttattg ttgcttgaca 1740 atcacgaaac tcatcttagt attgaaggat tagatttggc aaaaaataat ggaatagtaa 1800 tgcttacctt cccaccacat tgcacacata agttgcagcc cttggataag gcagtttatg 1860 gtcctctcaa aaactacata aatactgcta tgacatcatg gttagtaatg aatcctggaa 1920 aaactgttac aatatatgat attccaaaaa tagtaaacat tgcatttcca aaggctgtat 1980 ctccccacaa tattttaagt ggtttttctg caacaggtat ttacccgtta aatcctgatg 2040 tgtttactga aaatgattac tgtcctaact atgtaactga tcgccctgac ccaccttgtt 2100 cacaggctga ttcaaataaa aaagactgtg aatcttattc aatgcaacca gaaccatcaa 2160 ataaaagatt taaagaatct attcagatac cagataatca ggtagacata acaatagaag 2220 atccagttgt atcaaataaa acctctaatc agaacccaat cacaacatta ggttttaaca 2280 attcaatcat accacctgag agtattaggc ctctgcctaa agcaggcccg aggttagcat 2340 caaaaagagg taggaaaaag cggtcaacta tgattcttac taacacgcct gtcaaagcat 2400 ggcttgctaa tttttactaa tggaagtaaa aaagaattaa taaaatcaaa tcttgctacg 2460 aaaaaacctt ctatagtctt aaattcatct gaagatgaag agtatctgtg catctattgt 2520 tttgaatcat tttccaacag ccgatcaaga gaacagtgga ttcaatgtca ggtttgccaa 2580 aaatgggctc atgaagaatg cacaccaggc ttacaacagt ttatttgtga attgtgttct 2640 ttatccaata atttttaaaa aaccaaacct aaaaacattt atagtgataa actaaatttg 2700 ttaattgtgt tatatttaac ttagtaaaac ataaacgaaa ttaatttgta ctcgttttac 2760 attaaattct tatctaaact tgtcaaaatc caaatgtgac aatcaacccc gcaccatgtg 2820 acaacctacc ccgtgagcgg ggctagttgt cacacagaag tggtctttga aaaatgattt 2880 tacagcaata aatatttcac tcaatgtcat aaaaatatta taatagttac tggaaatagt 2940 taaactagct ttgtgcaaag tttggcattg attggatgaa aattaaaagc tgagcagcct 3000 aaaatgtaaa aagtgtgaca accaaccccg ctctcccc 3038 // ID ORTE-3_AAe repbase; DNA; INV; 6180 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A non-LTR retrotransposon family encoding cysteine protease from DE Aedes aegypti. XX KW Non-LTR Retrotransposon; Transposable Element; ORTE-3_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6180 RA Kojima K.K. and Jurka J.; RT "A lineage of non-LTR retrotransposons encoding an OTU cysteine RT protease from the yellow fever mosquito."; RL Repbase Reports 11(4), 1126-1126 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with 94-99% CC identity. This family encodes OTU superfamily cysteine protease CC upstream of apurinic-like endonuclease. It is positioned at the CC sister lineage of the lineage including RTE and RTEX in CC RTclass1. XX FH Key Location/Qualifiers FT CDS 981..5642 FT /product="ORTE-3_AAe_1p" FT /note="OTU cysteine protease, endonuclease and FT reverse transcriptase." FT /translation="MMTMTSNGIRRLGWFSRVLSLIKQRLFNSSHESNGLS FT KLILKQICAKLNGVNFRQEDVDELNSIILSKIEGWEEDLNKFVEFFDNIQD FT EGFDIAEMANHCLLLAVSQIFECEVRVSNDKDDCKIIGEGYKNSWFGVLEV FT GMESSESVEKVTRALKXSSKIELGNCGKGMQVNGMAGKGNEPLIVKEECLW FT HTTIGNWIIEIESKGEDDGKLYSGLMMNVIQKMNEGEIGEQEMMKVLEQID FT LFFEMYGEVLNRLEEEVMINGNALYEIWKIVSKLAVALNREIIVLMXGGEE FT IVLGVTERKRKFASFRLVAEFQEAGVRYYSIKSSKLIQEWKENMSIKPAID FT VFEKRAEEVSYNLRFKESNQEAVIKRIDGDGSCLIRAMLDQMGRIDQNYYM FT SDYNVMKLRNNIASYINNSRERFNVYLYEHIEEVGTFERVIEKIREKSYWL FT GEEAIITVAELFEKQVEVYTNQGLSVRIGVRKGNESALRIFYTGDHYDSVI FT SIAEEEAGEVEAEMINVNEAEGNEMQAGGESRAVKSNKVREVNESSSHSKK FT ITQHGKQSTNTTGDSNNHEQSKKQVEKQLSQDTGIKGMENKALKIVSWNVR FT GCRRKVKRDEIDDILVANDVMIAALQEVNTIGDDLSTNNYSWQVKDNHTNK FT ARGLAIVVRKHQRIRILQAKECRKGIMWIKLTINGVKLIMINVHAPNSNQK FT NFLSYLNEVVTRESDRKSLIVLGDFNAQIGSEDLQPEDRVWIGTKLGHRYC FT NDNGELFKYFLHLSKLKNVSTKIGKSIEITWRTKDKSSQIDHVLVPQYTSF FT RIRYIKGCWTDLKTDHVMLTIGLVFTNNPVLKKSNKETARINVDLLKIPEK FT QKCFREKLSSIPTKTDIQTDTLDKCYEEMASRLRIAACNTLKVSTVPNTPR FT RKYALDRLRDALKVSKKHPDMINYKYRVANRRHEYQIAVKEHREKEITDFF FT KNLNDYDAGVRVRKTYNYLKDFVKRKKQKKAMISSRVWEDILKTCEGPEVE FT FCAEHDYTPLPNPPTNDEIKSILFSACNGKSPGSDRVVAEYIKAADEETLE FT RIIEIIKAVYIRNDMPKEWKKSTQIPIPKKSGASEADHFRRISLCNVVYKI FT YAKWIAKQLRAYAGEPGLHQAAFTANRSTDDHIFVARRMMEEYWNSGQDLF FT ILALDISKAFDNVKLQSIGKVLIDLGVPSRLVDRVLSCVKDEVTRIMWENQ FT YSAEVKRGKGIKQGCPLSPLLFNYLMQDVLKKVVAEIPELKLMGINQLILP FT LILVFADDILIIARTKAELEKILHELELQLAKVGLEINRDKSQVLIRYPND FT KAQHPKEILLNNKIFKVSDSLAYLGICLTATLNRKLTCKQRCLSAVKSSRI FT IVEFCKTFKPSWELGKLMYKTVIAPSLLYGTKVSVMTKKSRLKIGNYEKLI FT LRNIYQYCKKPKDLKFNARKLLDGKTINRRIRVGRISYFGHVRRREKNHPL FT KLAYKLKFDKKKEGRPSLTWLKSLDQDFERYQNMNRSSWEELSKDRERIKK FT KLDEIYKNEESEISDGESSDGDQNSRYKHWKWKRRNEK" XX SQ Sequence 6180 BP; 2245 A; 841 C; 1573 G; 1509 T; 12 other; catacgttga gtgactaccg ttgagagcgg acgtgtctgt gcttgcgtgc tgttacagtg 60 gttcgagtga aaagatcgtt tctgcgattg ccgtgggagg aattgtgctg gggtttttgg 120 gtttccggaa gtgccgtggt ggagcagttg ggtggtaacc gtgctgggtt tccgacggat 180 tgggccggat cagtgatttg gccaamaaca cggctgggtg aaataggctt ttcggagagt 240 agtgagaaaa cggaaaagca gctgtcggtg gagtgcggag ggggtkgaag cctagtggag 300 gagtgcggag ggggtgaaag tgcggtgagg gttgcagtcg gtaaatcacg catttcagat 360 tgctgattgt ttgtgtcctg aggagaaggc taaattcctg taactggggc agctggcgtg 420 gttaaggggt tgttccgggg aagcggaaag ggagtgtgaa gtgtttggcc agaggaagtt 480 ggkggaaaaa aaaattggaa cgccgaagaa agaagaaagt tttgaggcga gagtggcggg 540 ggggtgctac ggtcgagagc ttcaacggaa gaaaacacga gtggtgtgag tgagaagacc 600 ttgatcggtt ttggcggtaa agctcgttgc aggggttcga aaaaaaaaaa aaagaaaatt 660 ggaagcgaga gtgagtgcca cgtagaagcg aagtggggaa gagtggcttc gcacactgtg 720 agaggaggaa gtggagcatc gaaataggca agcggctgag tagcgacaga gagagagtaa 780 gtgggaaagc gcatctgtag cggkgaggca gaaataagaa gcagggctgc tggtcgatcc 840 ggggcgaata ctgtggagag cacgaaggaa gagcccaacg cagcgagact gagtgcaccg 900 gtttcattgg tggcagtatt ggtgagtcat tcactttctc gttccaataa tttttccaag 960 tatctcgaag tttggtcctc atgatgacga tgacgtctaa cggaattagg aggttagggt 1020 ggttctcacg tgtattatct ttaataaagc agagattatt caacagtagc catgaatcga 1080 atgggctgag taaattgata ctaaagcaga tttgtgcgaa attgaacgga gttaatttta 1140 gacaagaaga tgtagatgaa ttgaattcaa ttattttatc taaaattgaa ggatgggagg 1200 aagatttaaa taagttcgtt gagttttttg ataatattca agatgaggga tttgatatcg 1260 cagagatggc gaaccattgc ttgcttttag ctgtttctca aatctttgaa tgtgaagtga 1320 gggtcagtaa cgataaggat gattgtaaaa ttattgggga agggtacaaa aatagctggt 1380 ttggagtgct tgaagttggt atggaatcga gcgaaagtgt tgaaaaggta actcgggctc 1440 taaaagmgtc aagtaagatc gagttaggta attgtggaaa aggcatgcaa gtaaatggaa 1500 tggcagggaa gggcaatgaa ccgttgatag tgaaggaaga atgtttgtgg cacactacaa 1560 taggaaattg gataattgag atagaatcga agggggaaga tgatggcaaa ttatacagtg 1620 gattgatgat gaatgttatt cagaaaatga atgaaggaga aattggggag caagagatga 1680 tgaaggtgtt agaacaaatt gatctgtttt ttgaaatgta tggcgaggtt ttgaatagac 1740 ttgaagaaga agtaatgata aatggaaacg cattatacga aatatggaaa attgtcagta 1800 agttagctgt agcattgaac agagaaatta tagttttaat grraggaggt gaagagatag 1860 tcctaggggt gactgaaagg aaacggaaat tcgcttcwtt tcggttggta gcagagtttc 1920 aggaagcggg agtccgttat tactcaatta aaagcagtaa gctaattcaa gaatggaagg 1980 aaaacatgag tattaagcca gcgatagacg tatttgagaa acgagctgag gaagtatcat 2040 acaatttaag attcaaggag agcaatcaag aggcagttat aaaaagaatt gatggagacg 2100 gcagttgcct gatcagagca atgctggatc aaatgggaag aatagatcaa aactactata 2160 tgtcagatta caacgtgatg aagttaagaa acaatattgc aagctatatc aacaacagta 2220 gggaaagatt caacgtttac ctgtatgagc atattgaaga ggtaggaaca tttgaacgag 2280 tgatagaaaa gataagagag aaatcatatt ggttaggaga ggaagcaata ataacagtcg 2340 ctgaattgtt tgagaagcaa gtggaagtgt acacgaacca gggcttgagt gtaaggattg 2400 gtgttagaaa aggcaatgaa tcagcattga gaatttttta tactggagat cattacgaca 2460 gtgtgataag tattgctgaa gaagaagcag gagaggtaga agcagaaatg ataaatgtaa 2520 atgaagcaga aggaaacgaa atgcaagcag gaggagaatc aagggcagtt aaaagtaata 2580 aggttagaga ggtcaacgaa tcaagttctc attcaaagaa aatcacacag catggaaagc 2640 aatcaaccaa cacaacaggt gattcaaaca accacgagca aagcaagaaa caagttgaga 2700 aacaattgag tcaagataca ggaattaaag gtatggaaaa caaggctctg aagatagtta 2760 gctggaatgt tagaggatgc agacgaaagg taaaaagaga cgagatcgat gacattttag 2820 ttgcaaatga cgtaatgatt gcagcacttc aagaagtcaa tacgattgga gatgatttga 2880 gtacaaataa ttatagttgg caggtaaagg ataatcatac gaacaaggca agaggattag 2940 caatcgttgt caggaaacat cagagaatca gaattcttca ggcaaaggaa tgcagaaaag 3000 gtattatgtg gattaagcta acaataaatg gagttaaact aataatgatc aacgtacatg 3060 caccaaatag taaccagaag aatttcctca gctatctcaa tgaagtagtg actagagaaa 3120 gtgaccgtaa atcacttatt gtactaggag attttaacgc tcagataggt tctgaagatt 3180 tacaaccaga agatagggta tggataggaa caaaactagg gcacaggtac tgtaacgata 3240 acggagaact attcaaatat tttttgcatc tgtcaaaact taagaatgtg tcgactaaaa 3300 tagggaaaag tatagaaatc acttggagaa ctaaggacaa atcaagccag attgatcatg 3360 ttttggtacc acagtacacc tctttcagaa taagatacat taaaggctgt tggacagatt 3420 tgaaaaccga ccacgtaatg ttaactatag ggctagtttt tactaataat ccagtcctaa 3480 agaagagtaa taaggaaacc gcaaggatca acgtagacct ccttaaaatt ccggaaaaac 3540 aaaaatgttt tagagaaaaa ctgtcatcga ttccaacgaa aacagatatt caaacagata 3600 cgttagacaa atgctacgag gaaatggcca gtagattgag aatagcagct tgcaatacgc 3660 ttaaagtatc tacagtaccc aatacaccac gcaggaaata tgcactggat aggttaagag 3720 atgcactaaa agtttcgaaa aagcatccgg atatgataaa ttataaatac agggtcgcaa 3780 atagaagaca tgaatatcag atagctgtaa aggagcatag agaaaaggaa attactgatt 3840 ttttcaagaa cctcaatgat tatgatgcag gagtgagagt aaggaagacg tataattact 3900 tgaaagactt cgtaaagagg aaaaagcaga aaaaagccat gattagctcg mgagtttggg 3960 aagatatact gaagacatgt gagggacctg aagtagagtt ttgtgctgaa catgattaca 4020 ctccattgcc taatcctcca acaaatgatg agattaaaag tattctgttc tcagcatgta 4080 atggaaaatc tccaggatca gatagagtgg tagccgaata cattaaagct gcagatgagg 4140 aaaccttaga gagaataata gaaattatca aagcagttta cataaggaat gatatgccga 4200 aggaatggaa gaaatcaaca cagataccaa tacctaagaa gtcaggagct tccgaagcag 4260 atcacttcag gagaatctcg ctgtgtaatg tagtgtataa gatatatgca aaatggatag 4320 caaagcagtt aagagcatat gcaggtgaac ccggattaca tcaggcagca ttcacggcaa 4380 acaggtcaac agacgaccat atatttgtgg ctaggagaat gatggaggaa tactggaact 4440 ccggtcagga tctctttatt ctagcgttag atattagtaa agcatttgat aatgtgaagc 4500 tacagtcgat aggaaaagtg ttgattgatt taggagttcc ttcgagatta gtagatagag 4560 ttctgagctg tgttaaagat gaggttacta gaattatgtg ggagaatcaa tattcagctg 4620 aggttaagag ggggaaagga attaaacaag gatgtccgtt atcaccatta ctctttaatt 4680 atttaatgca agatgttctc aaaaaagtag ttgcagaaat cccagaattg aaactaatgg 4740 gcattaatca attgattctt ccactaattc tagtgtttgc agatgacatt ctaatcatag 4800 ctagaacgaa ggcggagtta gagaaaatat tgcatgaatt agaactgcaa ctcgcaaaag 4860 taggccttga aattaatcgc gacaaaagtc aggttttgat tagatatcca aatgataaag 4920 ctcaacaccc aaaggagatt cttctaaata ataagatatt caaagtgagc gacagtttag 4980 cttacctagg aatatgtttg actgcgacgt taaatcgtaa acttacatgt aagcaaaggt 5040 gtttgagtgc agtgaaatca tcgagaataa tagtagaatt ttgcaaaact ttcaagccaa 5100 gctgggaatt aggaaagtta atgtataaga ccgtgatagc accgtcsttg ttatacggaa 5160 ctaaagtgtc tgttatgact aagaaaagca ggttaaaaat aggtaattat gagaaactta 5220 tactcaggaa tatttatcaa tattgtaaga aaccaaaaga cttgaaattc aatgctagaa 5280 aattattaga cggaaaaacc attaacagaa gaattagagt aggaagaata agctattttg 5340 gtcatgtaag acgwagagaa aaaaaccatc cactgaagtt agcatacaag ttgaaatttg 5400 acaaaaagaa agaaggcaga ccatccttaa catggctgaa atcattagat caggactttg 5460 aaaggtatca gaatatgaat agaagtagtt gggaggagct atcaaaagac agggaaagga 5520 ttaaaaagaa actagatgaa atttataaaa atgaggagag cgaaatttcc gatggggagt 5580 cttcagatgg ggaccaaaac tcgagataca aacattggaa atggaaaagg aggaacgaga 5640 aatgatgcga ttcagaacga ctggagttga agaagagagg ggagaagatc accatattgt 5700 gcattgccag aataaaggta ttgtaagtac aaatcaatgt tttaaatatg taaaasggaa 5760 gtcaaacaag tatacgttat actgggaagg tttaatctaa tggtaagtga aataattaca 5820 aaacataatc aatagcacca atcgtaactt tcatatatta tatatcaaac aggacaccca 5880 attgaagcaa tatggatgag gagagtggaa ttgccaactc cttatcgcta acatactgtt 5940 gatagaggga tggtccttaa accccactca ttgagacatc cttttgtatg atgaaatgac 6000 cggtagatcg gtgaccagta tctaggttat aatttttgga atgccttagg taaaccttaa 6060 ataatagcaa agagctgaaa caatcaactg gataacatca tataaaacgg acagtctact 6120 gtctaacggc tgtgccatgc aacaatgagt ggttaccctt acatacatac atacatacat 6180 // ID Polinton-4_NVi repbase; DNA; INV; 12608 BP. XX AC . XX DT 14-APR-2009 (Rel. 14.04, Created) DT 14-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE autonomous Polinton DNA transposon - consensus. XX KW Polinton; DNA transposon; Transposable Element; Polinton-4_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-12608 RA Bao W. and Jurka J.; RT "Polinton DNA transposons from Nasonia vitripennis."; RL Repbase Reports 9(4), 794-794 (2009). XX DR [1] (Consensus) XX CC The consensus may be incomplete at both ends. XX FH Key Location/Qualifiers FT CDS 5021..3786 FT /product="Polinton-4_NVi_1p" FT /translation="MEEVLSIQSPVVFDESVAHYEIHAHQPYTLSNFNNSD FT EIRIGIQHQDLCVLPSRSSLHICGKLQKPDGTAIEGTRFVNNAICHLFEEI FT RYELNAIEIDRCKNVGLTSLMKGFVSFNPSQSSAIENAGWLDIAETQRLDD FT NGYFDVSIPLGMILGFAEDYRKIVVNAKHELILTRSNSDVNSIVQTQVVVA FT GANVYEDYQVEITKIEWLMPYVVLSDKHKIKLLNHLEKDRPITMSFRSWEL FT YEYPLLPSTSKHVWTVKTANQLEKPRFVILGFQTNRKGRKTANASRFDHCN FT ISNVTLFLNSQHYPYGNLNLNITNNQYALLYDMYANFQNAYYNKEVEPMLK FT KADYLSYAPLIVIDCSKQNESLKQASVDVRLEFEARANIPVGTSAYCLILH FT DRIVEYNPMSGDVKKIV*" FT CDS 7221..7916 FT /product="Polinton-4_NVi_3p" FT /translation="MDFQQQKIKLPVSNFDTLTEQQKVQKRHGELLPDNIR FT AVFCGPSNCGKTNTLLALITHPNGLKFENVYIYSKSLKQPKYKFLEKLLRP FT LPGIQYFPYSEHDTVVPPDDVLPNSIMVFDDVACERQDNVRAFFCMGRHKN FT VDCFYLCQSYAHIPKHLVRDNVNVLVVFRQDDVNLKHIYDDHVNPDMTYTN FT FKALCSACWKEGKHGFLFIDKERPINDGRYRKGFDCFATNI*" FT CDS 11961..10291 FT /product="Polinton-4_NVi_5p" FT /translation="REISIAKEKRRFPLRLCEFIGKTQRFRATAKRFWNKF FT NISNLGQYSDLYLKTDVLLLAEVFENFRTSCLKAYGLDPAHYYTAPGLTWD FT AMLKYTNIELELLTDIDKVMFIERGIRGGISQCCNRYSRANNIYMGTDYDA FT DQYSKYLIYYDVNNLYGWAMQQSLPYGGFKWMTETAISQTDFSSVDDESAI FT GYILEVDLEYPEQLHDDHRDLPLCPQHMKPPGSTYSKLMTTLHSKERYVLH FT YRNLKQALANGLQLQRIHRVLQFNQRPWLKAYIDLNSSMRKEAKNEFEKNL FT YKLMNNAVFGKTMENVRKRVDLKIVTKWDGRYGAEALISQPNFHSRSILDE FT NLTIIELTKTEVNMDKPIYVGLCVLDLSKTLYSFHYDYMKKQFGDDCKLLY FT TDTDSLIYELRNIDIYEVMKRDIEHFDTSDYSENNTFGMPVANKKVVGLMK FT DECNGRIMTEFVGLRSKMYSIRVEGQDKVKKVKGVKTAVVQNSITFDDYIQ FT CLQNMSTQTREQCIIRSHTHNVFSEKQVKLALSPHDDKRYLLRGETDTLPW FT GHYKIMQTD*" FT CDS join(7160..6288,6423..5950,5953..5216) FT /product="Polinton-4_NVi_2p" FT /translation="MYWDQLRAVLSCIMIIHKQQRRKGRGLVNSLINKLPV FT ELHLPGYQYCGPGTKLAKRLARGDPGINTLDRACKEHDIAYSQNRENVEAR FT NAADKVLAAKAWERVRAADAGVGEKVAALAIAGTMKAKSKFGMGLKRIKKK FT KPSISLAKVIKAASTSAVPSNNSHAVIRSALKAARQAVNKAGGKKNIRIPR FT MLSVPSKIGGFLPFLIPIFAGLSATGALAGGAAGIAKAVNDASAAKRQLDE FT SQRHNKTMEAIALGGRGLYLKPYKEGYGMYLKRGQGLRKKKKTSNCRNAHQ FT NNGSYCARWPRSLSETLQRGLWYVPETWARFKKKKKNFKLPQRALTDADLL FT KYARILKIPYFRGVYMRNALPASGPHYRESAIVNLDDKSGPGTHWVAYRKR FT GNHVVYFDSFGDLQPPQDLMEYWNVSRVKYNQEHYQDYNSYNCGHLCLKFL FT LNDIYISVGSASASVSLVRVVMEDSLTLSLSGTSAVLEAQYFPPLELSANK FT SYVLGLVELLTFNSIPNIDTGNNKFYVGGEVIILPTGSYEIEDIEKSLKEA FT LTPKGITLKLKPNNNTLRCMIKCNRSIDFQPDDSIGKLLGFTSRVLSPNTD FT YESDLPVTILKINALRVECNITSGAYINEHKVHTIHEFFPAVSPGYKIIEV FT PSPIIYLPVTVKTINNLQLHIVDQDGHLVNFRGEVITIRLHIKSV*" FT CDS join(7970..8926,8877..9356,9277..10239) FT /product="Polinton-4_NVi_4p" FT /translation="MADFRREKAVLQQIDKAREAIRRKHRLLKQGKASMEK FT ALGETFKPIVDPLEKLVRVKEEVKPDIKQELKKEKSFDDDVKDDEDNEDDT FT LLNETADTSFESAQEDSEDEKIEKDSKIKNKHSLSLKENRHEKDTEMNNKY FT LSALMQNRQRYLDHVFGIRQDGQILKIGDSPIEITRNNIIVLNRKYPKTEG FT LQELLFRLHPDEALIKSDDLQNYKNILEATNAHRKYYSYNQPIRQQNSKKY FT NNIIAPLFKVQSKTGGGLLPRYKVARKGTLMDYIYWDDPNELVDRLRLLMA FT ERLAGNNSHLNEIHSIIEELREAKIIYTKSIPSSKSCEKLKLYINRRREHV FT EVSSKMSVDVFGRQSNRSKGRRGPPGVGYKLTADGHFDIGSRRLCNVAEPL FT ELNDAVNLAVLQRSINSEIRSVHEITSDLRQSIDAVELAVQIAKDENAEKL FT RKIDLAISELYEILLKNSQNNNDTSKGETSGGTSVNCMRFCLKILRIIMTH FT QKARLVEELHKPARRNYQRRRFIMRGLDDTWQADLVEMQPYAQENKGYKYL FT LTVIDVFSKYAWAVPLKQKTGNEVAAAMKSILQQGRVPKNLHTDRGKEFYN FT SSFESLMKQYGINLYSTYSNLKASICERFNRTLKNKMWMQFSLRGNYKWLD FT ILPDLLTEYNNSKHRTIGMKPKDVSRENEAKVLKRFSHQESRKKAKFKVGD FT KVRVSKMKQVFEKGYTPNWSTEIFTISQVVPTYPVPTYKLKDYRDQPIAGG FT FYEQELLKAKYPDVYLVEKVLKKRGNKLYVKWLGFDNTHNSWINKSDL*" XX SQ Sequence 12608 BP; 3779 A; 2487 C; 2504 G; 3838 T; 0 other; acatgctgca tcacaggtca gacagcatct acgcaatgca taccagcagt agctagccgc 60 atggatcaag ttagacgtga gcacctttat ctgtacgcat cgacggcata ataagctcga 120 ctgcaaagtg tgacgagagc cgtgcgacgg gggtgattgc gtcagcggtt agctgcaggg 180 tggcttcgga cggcatactt cctgctgact ccaggaggct gctcaaggtg tgcatctgac 240 accgggagtc agctgctgtt gcaggctcca ggggcagcat atttcggtat actccgggta 300 gcttacgagg gatccgctca agctcagagg ctacgcatta cagaattttc tgggagcatg 360 ggatcaggtt gcatgtgttt ttccttgcca aacgtcagcc cggtgagcgg tagctgacag 420 tcgctgcagt atcgggttac ccagcattgt tttttcgcag gtccgtatat ccagcgtagt 480 atgtagccgc ggatattttg tcaagtggtg tcttattgtc actgctgcga atcactttcg 540 atcaagtcat ctttcgctag tgatattctt cttctacgaa tgcgcggctc ttttgaagac 600 gattcgcatg aacgtcgcag tgaatattat tattttctgc gataacgatg tatttttcac 660 tcacattgcg gctcttttaa gatgatctcc catgaacgtc gcagcgtata ttattatgtt 720 tatgcgtgtg ccttattttt ttcaccgctg tgaatttgcg cttcacgcag tcgaatcatc 780 tttcacaacg gatattatct ttctttgaaa tagtcttacg agcgcgcttg gatataagca 840 ttttgttatt attcaattgg atagttttac atttttctaa atagaggggg acccattaac 900 gggttgctat aaaagccagc ggactgacag aaaatgagca gttcgacttt tgagtaaata 960 ccttacacat cgtactatat caagcttact ttaagttgaa gtatacgaac tatggatttt 1020 gaagaattaa ataaagttgg tcaacacgcg tatctgccaa cgaaaaaaat ttctgaatta 1080 gaagtagaaa aaaagtttaa aataactaaa ataaaacagt ttaaaataac taaaataaaa 1140 cagttgaaaa caaaatttgg aaataaaatt tacgttgaat tagatgattg tttcggtatt 1200 tttgtaccat caagagtttc aaaatattta acagataatc aagattattt aaacaagtta 1260 ttagctcagt ctcaagaagg aaaaatttat ttacaatatt ttggtggtga tacaaataaa 1320 gtacaattcg gaaattttta aaataatcat aagagctata ctagtatata ctcatatgtc 1380 atctcgaacg cgaagagaga tgaaacggcg tggcgtgtcg aattaaaaaa gaggaacgca 1440 caacgctgca ctgcatattg tttatgaaaa cttatcctct accgtatact accataatac 1500 tgtctacgtg acccttcgag tatataaagc ctaaggatca tcagagagac catagttcca 1560 gctcatccac tcaagctaag acatcttgtc agttcttctg ttagctgcat tctctctcat 1620 cgtatccgag tttctacgtt tccatcgtat ttaacagtaa gtatcaagat attgttttat 1680 tttaaagatt tttgaatcta tcttatagta aacaaaaaat catgccgata ttagcgcagc 1740 ttgtttgctt taacgaaatg atactcggaa aaaaattatt atacatagat tttattatgt 1800 gcaatgtgat aaagcaagct gactccgctg cgtgcatcca tatttcattt taaaatcata 1860 gaaaacggat atctttttta gctgtcattt ttagtaaata ccttaaactt aattaatact 1920 tataaataat atttttattg caggaattag aaaaatgagc agcagcagtg gctatggaag 1980 taacagcaat gctagcgggg acagcggcgg ctcatccaac ggggagtata tttttatact 2040 tccaagagtt cgagcacctg caggaatgga ttaaccctac tcctgctgaa ctgcgctcta 2100 ttaatttggc tatagaccag tacgatgatg acacgtttga aattttcatg gtcatgagat 2160 ccataccaac gcatctgatg aaagtattca acatttttag ctttcaaaaa gtgtcaagtt 2220 taccaaataa cttctaaatt aaatatattc aataatgtag tgcaacgttt tgaaatgtta 2280 tatgtagtgc aacgttttga aatgttaagt gtagtgcaac gttttgaaat gttaagtgta 2340 gccttttgta gcaatataac cgctcttgga aagtattaca gtggttaaat aaaaataaaa 2400 gaaaaaacaa ataacaattt ttgtacactc gctggtagtg gattctcgtg gaatgtttgt 2460 gaaatgaagt agccttattg gttagttttt tttttgttaa gtcaaaccct ctgattggag 2520 cataattaaa aaaaaaaaaa aacctttccg taagaatccg ctactaccga gtgtacataa 2580 atcctatgat tattccttct cctaatgcac gtatttttat ttttaattat tcaattattt 2640 tgaaattttt taaaaaaatt ttgatgttaa aatttttaaa tttgggtttt ttaattgaaa 2700 ttatttttaa ttatagaggg attaggtaca ggtttataaa atagtagtca caattaaaat 2760 aaatacattt attttataga caacatctca agttttttca taagtactgg tacaataatt 2820 ttgtccgttc tttcaatcct taatttaaaa cgctctcgat ctcgggctgc ttgctcccac 2880 tctccacgac gtgccttttt atatgcatac acccatgtac acattaaatg aactgttgga 2940 ttttcttgaa aacgcacctc cttcttcatg ttgctattca ttttgagcat ttagtttaca 3000 ctgcacacaa tctcttttta aaggtggaac tccactctca attgaattgt gctctcgaca 3060 gcgtagacac tccaaaatct ctgagtcctg ctttaggtgc tctggcagct tatcccaagc 3120 tagctcgata ctattggcgg caaatgttat gaggaattct cgtggtaaat atgcaatatc 3180 tctcagagtc ataaatgata ggttctctgt gcgataaaat tctctgagag atctctcaaa 3240 agatggcgta ttgtcgtgat aaaacttttt cagcaacaat acatttctgt gtgcgcagtt 3300 tgccttgaac gctaacttat ggtgtgggca cagtggctca caaaatatca tatttcagcc 3360 tttgtaacga gggaaagccc atttcaccca tatcaactac gtgaaaattg aacttctcga 3420 gccattgctt cttctcggct cctttgacgt acacaacact cgcgttatgt agaacagttt 3480 tcaagacatt ttccagctcc tcgtaggcca gatcccccga actccatgat aatccgtgat 3540 aattgcgttc cagccagaga ttttcagact tgtacttggc tgtgagtcta cgccaaggtg 3600 tgggcggctt gaaaaaaaaa taccaatgtc ttgagttgag cattaagtgg aactatggca 3660 agttccttga cgacaaattc gttcacaggc tttctgaatc cttgtacatc gagcacgtac 3720 tccattttga ctgaatgcct accacaagct gcttctcaat ttataccatc ttctcccacc 3780 accctttaaa caattttttt cacatctcca ctcatagggt tgtattcgac tatgcgatcg 3840 tgtaggatga gacagtatgc agaggtgcca actggaatat ttgccctagc ctcaaattcg 3900 agacgtacat caactgatgc ttgcttaagt gactcatttt gttttgagca gtcaatgaca 3960 atgagagggg catacgacag atagtcggct tttttcagca ttggttctac ctctttgttg 4020 tagtaagcat tttgaaagtt ggcatacata tcgtagagta gagcatactg attattagta 4080 atattgagat tgagatttcc atagggatag tgttgagagt tgagaaaaag tgtaacgttg 4140 cttatattgc aatgatcaaa gcgactagca ttggctgttt ttcgtccttt gcgatttgtc 4200 tggaagccaa gaataacaaa gcgaggcttc tccagctgat tggcagtctt gacagtccac 4260 acgtgtttcg atgtactggg caacaatgga tactcgtata gctcccagct acgaaaactc 4320 atggtgatag gtctatcctt ttccagatga ttgagcaatt taattttatg tttatccgag 4380 agcaccacat atggcattag ccattcaatt ttggtgattt caacttggta atcctcgtaa 4440 acatttgcac cagccacgac tacttgcgtc tgcacaatag aattcacatc agaatttgat 4500 cttgtgagaa taagctcgtg tttggcattc acaacaattt tgcgataatc ttcggcaaaa 4560 cccaggatca taccaagtgg aattgatacg tcaaagtagc cattatcgtc aagccgttga 4620 gtctctgcaa tgtcgagcca gcctgcattt tcaatcgctg agctctgact agggttgaat 4680 gatacgaatc ctttcatgag acttgtcaag cccacattct tgcatcgatc aatctctatg 4740 gcgttgagct cgtaacgaat ttcctcaaat agatgacata tggcgttgtt tacaaatcgc 4800 gtgccctcga tagcagtgcc atctggcttt tgcaattttc cacatatatg tagtgaactg 4860 cgcgaaggca gcacacatag atcttgatgt tgtataccaa tgcgaatttc atcgctgtta 4920 ttgaaattcg ataaagtata gggctgatgc gcatgaatct cgtaatgcgc aacagattcg 4980 tcaaatacga caggtgactg tatgcttagc acctcttcca tttcgctgaa agcagcaaaa 5040 tattacacaa cttatttttt tccaaggaca actcttaaac ctagactttt tagaaactgt 5100 atgttttgac gtgttagcgc tttgagcact cgtcgacgac tgatcgctcg acggcatggc 5160 acgttactat tataaccttt accaacttta gtattgtaca caatacccat aatgctcaga 5220 ctgatttgat gtgtaatcta attgttatga cttcaccacg aaaattaact aaatgaccgt 5280 cttgatcgac gatgtggagc tgcaagttat ttatagtctt gacggtgact ggtaggtaaa 5340 taattggtga tggtacttcg ataatcttgt agcctggtga gacagctgga aagaattcgt 5400 gtatggtgtg taccttatgc tcatttatgt atgccccaga tgtaatatta cactcgacgc 5460 gtaaagcgtt gatcttgaga atggttactg gcaagtcgga ttcgtaatca gtatttggtg 5520 acaatacgcg actcgtgaaa cccaacaatt tgccgataga atcgtctggc tggaaatcaa 5580 tcgatcgatt acacttgatc atgcaacgaa gggtgttgtt attcggcttg agtttgaggg 5640 ttatgccttt tggtgtaagc gcctctttca gactcttctc aatgtcttca atctcgtagc 5700 tgcctgtagg caaaattatc acttctcctc ccacgtaaaa cttgttattg cctgtatcga 5760 tgtttggtat tgaattgaat gtcaacaact caacaagtcc gagaacgtaa cttttgttag 5820 ctgaaagttc aagcggtgga aaatattgag cttccaatac agctgaagtt ccagacaggc 5880 tcagtgttag tgaatcttcc atgactactc gaactaaact gactgaagcc gaagcagaac 5940 caacacttat atatcattca gaagaaattt taggcaaaga tgcccacagt tgtagctatt 6000 gtaatcttga taatgctcct gattgtactt gactctgctc acattccaat attccatgag 6060 atcctgaggt ggctgcaaat caccaaaact gtcaaaatac actacatggt tgcctcgttt 6120 tcgataggcg acccaatgcg tgcctggtcc gcttttatcg tcaagattga cgattgctga 6180 ttcgcgatag tgtggtccac tcgctggtag tgcatttcgc atgtaaacac cccgaaaata 6240 tggaattttc aggatgcggg catatttgag aaggtcagcg tcagttagtg cgcgttgcgg 6300 caatttgaag tttttttttt ttttcttaaa ccttgcccac gtttcaggta cataccatag 6360 ccctctttgt aaggtttcag atagagacct cggccaccga gcgcaatagc ttccattgtt 6420 ttgttatgtc tttggctttc gtcaagctgt cgtttagcgg cgctagcatc gttcactgct 6480 ttagcgatac cagcagcgcc acctgctaat gcacctgttg cactcaaacc ggcaaatatc 6540 ggaatgagaa atggtaggaa tcctccgatt ttggatggga ctgacagcat acgtggaata 6600 cgtatatttt tcttgccccc agccttgttt acagcttggc gagcagcctt cagagcgcta 6660 cgaatgactg cgtgagagtt gttgcttgga acagcagaag tggaagcagc tttgatcact 6720 ttagccagtg aaatagatgg cttcttcttt ttaatccgtt taaggcccat tccaaactta 6780 gattttgctt tcatggtgcc tgctatggcg agagcagcta ctttttcacc aactcctgcg 6840 tctgccgctc ggactcgctc ccaggctttt gcggccaaaa ccttatctgc agcgtttctt 6900 gcttcaacgt tttctcgatt ctgtgagtag gcaatatcgt gctccttgca agctctatca 6960 agagtattga ttccaggatc tccacgtgct agacgttttg ctagtttagt gccggggccg 7020 cagtactgat atcctgggag atgcaactcg actgggagct tattgataag tgaattcact 7080 aaaccacgac ctttgcgtcg ttgctgctta tgtatgatca tgatgcatga gagtacagct 7140 ctcaactgat cccaatacat tataaacgcg ctttttatag ctaacagcaa tagtcgctct 7200 caagctgacg aagcgttcac atggatttcc agcaacaaaa aatcaagcta ccggtgtcga 7260 atttcgatac tctaacagag cagcaaaaag tgcaaaaacg tcacggggaa ttactgcctg 7320 acaatattcg agcagtattt tgtggaccat caaattgtgg taaaaccaat acactgctcg 7380 ctctcataac gcatcccaat ggactcaagt ttgagaacgt gtacatctat tcaaagtcgc 7440 tgaaacagcc caagtacaaa tttctcgaga agcttttgcg acctctacct ggtatacagt 7500 acttccccta cagtgaacat gatacagttg tgccacctga cgacgtactc cctaattcaa 7560 tcatggtttt cgatgacgtc gcctgtgaga gacaagacaa cgtcagggct ttcttttgca 7620 tgggaagaca caaaaatgta gactgtttct atctctgtca gtcatacgct cacattccga 7680 aacatttggt gcgcgataac gtaaatgtgc tagtagtttt tcgtcaagat gacgtaaatc 7740 tcaaacacat atacgatgat catgtgaatc ccgatatgac atatacaaac ttcaaagctt 7800 tatgctcagc atgctggaaa gagggtaaac atggatttct ttttatcgac aaggagcgac 7860 ccatcaatga tggacgatat agaaaaggtt ttgattgctt cgccacaaat atataacgta 7920 tagcaggtgt gagtagttgc agtagactgt gattcatcaa agagtcaaca tggcagactt 7980 tcgacgtgaa aaggcagttc ttcagcaaat tgacaaagct agagaagcaa ttagacgcaa 8040 acatagatta ttgaagcaag gtaaagctag catggaaaaa gctctgggcg agacttttaa 8100 gcccatagta gatccactag agaaactggt gagagtgaaa gaagaggtta agccagatat 8160 taaacaagag ttaaaaaaag aaaaaagttt cgacgatgat gttaaagatg acgaagataa 8220 tgaggatgat acgctactga atgaaacagc tgatacttca tttgaaagtg ctcaagagga 8280 tagtgaggac gagaagattg agaaagacag taaaataaag aacaagcatt cactatcgtt 8340 aaaagaaaat cgtcatgaga aagacacgga aatgaacaat aagtatctgt cggcgttaat 8400 gcaaaatcgc caacgatatc tcgatcacgt gtttggaatt cgtcaagatg gtcaaatact 8460 caagattggt gattcaccta tagaaatcac cagaaataat ataattgttt tgaatagaaa 8520 atatccaaaa actgaaggac tgcaagagct gctttttaga ctacatccag atgaggcctt 8580 gattaagtca gatgatttgc aaaattataa aaatattctc gaagctacaa acgcacacag 8640 aaaatattac agttacaacc agcctatacg tcaacagaac agtaagaaat ataataacat 8700 cattgctcca cttttcaaag tgcaaagcaa gaccggtggt gggcttctac caagatataa 8760 agttgcaaga aaaggaactc tcatggacta catatattgg gatgatccca acgaactagt 8820 tgatcgtcta cgcttgctca tggctgaacg attagctgga aataatagtc atctgaacga 8880 aatccattcc atcatcgaag agttgcgaga agctaaaatt atatattaac cgtagacgtg 8940 agcacgttga agtcagtagc aagatgagcg tcgatgtgtt tggacgtcag tcgaatcgct 9000 ctaaaggtcg tcgtggtcca cctggtgttg gctacaaact aactgccgac ggacacttcg 9060 acattggaag cagaagattg tgtaacgtag cagaacccct agagctcaac gatgcagtga 9120 atttagcagt ccttcagcga tcaatcaatt cggaaatacg cagcgttcac gagatcacaa 9180 gtgatctgag acaatcgatt gatgctgtag aattagctgt acaaatcgca aaggatgaga 9240 atgctgaaaa actgagaaaa atagatttag ctataagtga attgtatgag attctgctta 9300 aaaattctca gaataataat gacacatcaa aaggcgagac tagtggagga acttcataaa 9360 ccagctcggc gtaattatca gcgtcgacgg tttataatgc gtggcttgga cgatacctgg 9420 caagctgatc tggtggagat gcagccttat gcacaagaga acaaaggcta caagtacctg 9480 ctgacggtta tcgatgtctt ctcaaagtac gcttgggccg tacctttaaa gcagaaaact 9540 ggcaacgaag ttgcagcagc aatgaagtct attctgcaac agggtcgtgt gccaaagaac 9600 ttgcataccg atcgaggcaa agaattttac aactcctcgt ttgaaagcct tatgaaacag 9660 tatggcatta atctctactc gacatatagc aatctcaaag cttcaatctg tgagcgcttt 9720 aatcgcacac tcaagaacaa gatgtggatg caatttagtt tacgaggtaa ttataaatgg 9780 ctcgatatct tacctgattt actgacggag tacaataaca gcaagcaccg caccatcggt 9840 atgaaaccta aagatgtcag cagagagaat gaagctaaag tccttaaacg tttctctcac 9900 caagaaagca gaaaaaaagc caaatttaag gttggagata aggtgcgagt gagcaaaatg 9960 aagcaagtat ttgaaaaggg ttatacaccc aactggtcga cagagatttt caccattagt 10020 caagttgtgc caacgtatcc agtgccaacg tacaagctga aggactatcg agatcaaccc 10080 atcgctggag gtttttacga acaggaattg cttaaagcca aatatccaga tgtttatctc 10140 gtggagaaag tattgaaaaa acgaggaaat aagctttatg taaaatggtt agggttcgat 10200 aatacgcaca atagttggat aaataaatca gatttataaa taaatgttta caaattgtct 10260 ttttattcta acattgtata caatttattc ttaatctgtc tgcataatct tataatggcc 10320 ccaaggtaaa gtatcagttt cacctcgtag caagtacctt ttatcatcgt gtgggcttaa 10380 agctaatttt acttgttttt cactaaatac attatgcgtg tgagaacgaa taatgcattg 10440 ttcacgagtc tgtgtagaca tgttctgtaa acactgtatg taatcgtcaa aagttattga 10500 attttgtact acagcagttt tcacgccttt tacttttttc accttgtcct gaccttccac 10560 tcgaatactg tacattttac tgcgcaaacc aacaaattca gtcattatgc gtccgttgca 10620 ctcatctttc attaagccaa ctactttctt gtttgccact ggcattccaa aggtattgtt 10680 ctccgaatag tccgaagtgt caaaatgttc aatatcccgc ttcataactt catatatatc 10740 tatattacgt aattcgtata ttaaactatc cgtgtctgta tacagcagct tacaatcatc 10800 gccgaactgc tttttcatat aatcatagtg aaagctatat aacgtttttg ataaatcaag 10860 tacacagagt ccaacataaa taggtttatc catattaact tcagttttag tcagttctat 10920 aattgtcaaa ttttcatcta gtatgctgcg actgtgaaaa ttaggttgac taataagagc 10980 ttcagcaccg tacctaccgt cccatttggt aactattttc aggtcaactc gctttctcac 11040 gttttccata gttttaccaa atactgcatt attcattaac ttgtaaagat ttttttcaaa 11100 ttcatttttt gcttcttttc tcatgctact attcaaatct atatacgctt ttagccatgg 11160 cctctgatta aactgtaaga ctctatgtat tcgttgaagt tgcaatccat tagccagagc 11220 ttgctttaaa tttcgataat gcaaaacata tcgctcttta gagtgtaaag tcgtcatgag 11280 cttcgagtac gttgaacctg gtggcttcat gtgctgagga catagaggca agtctctatg 11340 atcgtcgtga agctgctctg gatactccaa gtccacttca agaatgtagc caattgctga 11400 ttcatcatct acgctggaaa agtcggtttg actgattgct gtctctgtca tccacttaaa 11460 gcctccatat ggtagtgatt gttgcattgc ccatccatac agattattaa catcataata 11520 gattaaatat ttggaatatt gatcagcatc atagtcggta cccatgtaaa tgttattcgc 11580 tcgagaatag cggttacagc attgacttat tcctcctctt attccacgct caatgaacat 11640 tactttgtca atgtcagtga gtagctctag ttcaatattt gtatatttga gcattgcgtc 11700 ccaagttaag cctggtgctg tataatagtg tgctggatct aatccgtatg ctttcagaca 11760 gctggtgcga aaattttcaa aaacttcagc tagcagcagt acatctgttt tcaaatagag 11820 gtctgaatat tgacccaagt ttgaaatatt gaatttattc caaaatcttt tggcggtagc 11880 tcggaaacgt tgagtttttc caatgaattc acataatcgt aagggaaaac gccttttctc 11940 cttagcaatt gatatttctc ttcagtcagc tggttatctt tgttaaactc ttgcttgagg 12000 atattcacat cttcgaggta agatgacaac ttgtctagtg acatgccata aatcgaaaag 12060 agtcgataaa tttgaaattg atatcactac catcaatgtg tttcgtaaat gatataaatt 12120 tctccttatt tataggcagt aggttgactc taccaggtat acattgagca atatctctga 12180 ttaaaaagtg tgaatcatat ccagtcaggt tgtgaaagat cactggaaca gagcgagaat 12240 cttgatagtt caagttacac ttctcatgct cctcgatatc tgaaacacat ttaagtatag 12300 tacttgtcat cggggcgcta ttgttgttat tattattatt acgtgaaaat attaccttcc 12360 agtgagatgg cagtgatcac gtactctatc aagctccaga actttctcac aaatgtgaca 12420 agatgtagct cgctggaaag actgctcttg ctcccaagtc aatgtctcaa taggtatggg 12480 acaccaaaat actgtttcaa tgttttcagc aagttgtaaa agctcgtcaa caaaccattt 12540 tgcaggatta ggtcctctat agctacgata atacgagagt gaatcatcgt aactgcattt 12600 catataat 12608 // ID Gypsy-16_CQ-I repbase; DNA; INV; 4538 BP. XX AC AAWU01025728; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-16_CQ_; KW Gypsy-16_CQ-LTR; Gypsy-16_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4538 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 411-411 (2011). XX DR Genome; AAWU01025728; Positions 30168 34705. XX CC Positions [3450-3800] - Integrase core CC 'GGAAT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 168..4505 FT /product="Gypsy-16_CQ-I_1p" FT /translation="MDADQLKQLMDHQMKMVEEMIKGMSTANAQMVALAVT FT RATAAANVPAQSSQVPLPSPLVLEGDMEENMDFFEKSWNNYVKATRMDKWP FT ATDDQQKVSFLLSVIGEQARKKFFNFELTVDQSATPQNALTAIRAKVTPKR FT NIILDRLEFFSATQTSRETTDDFLTRLKTLAQMSKLGDQTNSLVTYKLVTA FT NKWPQIRSKMLAIPDITLDKAVDMCRAEEITARRSLELAMLPPESDVKKIG FT RKKVQSKAKQFRCKFCGDAHEFVKGACPAFGKKCNRCKGRNHFEKVCKQKS FT KPSRKSSKRVKEIKDDSSGTEEYSSHDDASSEDSEEEEYEIGKIFDDSDRG FT GSVCAELEVKLNKKWKKIVCELDTGANTSLVGYDYLVKQVKEDKPVLLPSK FT LRLQSFGGNPIKVLGQVKVPSRRKGRKYKVVLQVVEGSHRPLLSAKASRVL FT GFVRFCKAVSFGGPNPAFPPSDLLNIYRVEAQRLADSHKDLFNGYGKFDGE FT ISLEVDDSVAPSIQPPRRVPIALRGKMKHELEVMERDGIITKESAHTDWVS FT NILIVQRGDPNSPSVRVCLDPVPLNKALKRPHLQFVTLDEILPELGKARVF FT STVDAKKGFWHIVLDEKSSKLTTFWTPFGRYRWIRMPFGIAPAPEIFQMKL FT QGTIQGLDGVECIADDILVYGTGNTMEEALINHNERLKKLFRRLEEHHVKL FT NREKLKLCQTSVKFYGHVLTDQGLKPDESKISTIRKYPTPTNRKEVHRFVG FT MVNYLSKFIPNLSANLTNLRKLIVESQPWKWTSAEADEFDRVKSLVSDIGT FT LSYYDVSQPITIECDASCFGLGAAVYQNNGVIAYASRTLTDTERNYAQIEK FT ELLAILFACIRFDQLIVGNPQATVKTDHKPLITIFNKPLLSAPKRLQHMLL FT NLQRYRLSIVFVTGKENVVADALSRAPEPKDRLQEENKKWNIYKVFKELEN FT VEMSNFLSVSDSRLSEIMEETEKDPSMQQVIEFVQRGWPASADRVSENVKV FT YYGYREELSTQNGLVFRGDRILVPHILRRKLVDSCHVSHNGVEATLKLARA FT NLFWPGMSSQIKEVVARCNVCAKFAASQQNPLMKSHSIPVHPFQMISMDVF FT FAEYQGHKRNFLVTVDHYSDFFEVDLLKDLTPQSVIAACQQNFARHGTPQR FT VLTDNGTNFVNRQMVKFAKDWDFELVTFAPHHQQANGKSEAAVKIAKRLLM FT KANETGTDFWYALLHWRNIPNKIGSSPSARLFSRATRCGVPSSINNLLPKV FT VESVPEAIERNRKKIKFQYDKKARVLPELTTGSPVFVQLNPETSKVWTPGT FT IANRMNDRSYCVDVAGKEYRRSLVHIKPRQEHATTPVRSGTPVPESLPESG FT SNSAAADFPAITPSCQMDEFTGENSPPAAMPQSPPLMVSAPAATTRGAVRG FT TESKDNHVGTPGVTRPRRETRIPAKLKDFILH" XX SQ Sequence 4538 BP; 1211 A; 1184 C; 1233 G; 910 T; 0 other; tggtgtcaga agtgcgtgaa ctctgagaga catttccggc gtcgtctata tcgcggaaag 60 tgttgatttt ttcataaaaa agcgaaaaag ttttcgcgtc gaagttcgcg tcgttttttg 120 gttttcgtac cgccattttg ttttcgcgaa gtgaacatcc tcgtaaaatg gatgctgacc 180 agttgaagca gctgatggac catcagatga agatggtgga ggagatgatt aaaggaatgt 240 cgacggcgaa tgcccaaatg gtggccctcg ccgtaacccg agccacggcc gcagcgaacg 300 ttccagcgca atcatctcaa gttcctctgc cctcaccgct tgtgctcgag ggtgacatgg 360 aggagaacat ggattttttc gagaaaagct ggaacaatta cgtaaaggca acacggatgg 420 ataagtggcc agccaccgac gaccagcaga aggttagctt cctgttgtcg gtgatcggtg 480 agcaagcgag gaagaaattc ttcaacttcg agctcacggt agatcaaagc gccacaccac 540 aaaacgcttt gaccgcgatt cgagccaagg taactccgaa gcggaatatc atcctggacc 600 ggcttgagtt cttttccgcg acacaaacgt cacgggaaac aacggacgat tttttgaccc 660 gactcaagac gctggcacaa atgtccaagc tcggagatca aaccaacagt cttgtcacgt 720 acaaacttgt tactgccaac aaatggcccc agatccgctc aaagatgttg gccatcccag 780 acataaccct cgacaaagca gtggacatgt gccgggcgga agagatcacc gcaaggcgca 840 gcctggagct ggcgatgttg ccgccggaat ccgacgtaaa gaagatcggc aggaagaagg 900 tccagtccaa ggctaaacaa tttcggtgca aattctgcgg tgacgcccat gagttcgtga 960 aaggtgcttg tccggccttc ggcaaaaagt gcaatcgttg caagggccgc aaccatttcg 1020 agaaggtgtg caagcagaag tccaagccaa gccgaaagag ctcgaagcgc gtgaaggaga 1080 tcaaggatga cagcagcggg acggaggagt acagttctca cgacgacgcc tcatcggagg 1140 acagcgagga ggaagagtac gagatcggaa aaattttcga cgactccgac cgcggaggaa 1200 gtgtttgtgc cgagctggaa gtgaagttga acaaaaaatg gaagaaaatc gtatgtgaac 1260 tagacaccgg cgcaaacacg agccttgttg gatatgacta tctagtgaaa caagtgaaag 1320 aagataaacc agtgttgttg ccatcaaagt tgcgactgca gagcttcggt gggaatccca 1380 tcaaagttct gggtcaagtg aaagtgccaa gccgtcgaaa aggacgcaaa tacaaagtag 1440 tgctgcaagt ggttgaagga agccatcgac cactgctctc ggccaaggcc tcgagagtgc 1500 tgggattcgt gcgtttctgc aaagccgtat ccttcggtgg tccaaacccg gctttccccc 1560 cttcggattt gcttaacatc taccgcgtgg aagcccaaag actcgcagac agccacaaag 1620 acctgttcaa cggctacggc aagttcgacg gcgagatttc gctggaggtt gatgactcgg 1680 tagccccgtc gatccagccg ccacgccgag tccccatcgc attacgcggt aagatgaagc 1740 acgagttgga agtcatggag cgagatggaa tcatcaccaa ggagtcggcc cacacggact 1800 gggtcagcaa catactcatc gtgcagcgcg gtgatccaaa ctcgccgagc gttcgcgtgt 1860 gcctggaccc agtgccgttg aacaaagctc tgaagagacc gcacttgcag ttcgtcacac 1920 tggatgaaat ccttccggaa ctcggaaagg cccgggtttt ctcgacggtt gacgccaaaa 1980 aagggttttg gcacatcgtg ttggacgaaa agagcagcaa gctaacgaca ttctggacgc 2040 cttttgggcg ctatcgctgg atacgcatgc cctttggcat agcaccggct ccggagatat 2100 tccaaatgaa acttcaagga acaatccagg gtctggacgg cgtcgaatgt attgcggacg 2160 acattctcgt ctatgggacc ggaaacacga tggaggaggc actcatcaac cacaacgaac 2220 gcctcaagaa gctcttccgc agactcgagg agcaccacgt taagcttaac cgagaaaagc 2280 tgaagctgtg ccaaacgtcc gttaagttct acgggcatgt tctaaccgac cagggactga 2340 agccggacga gagcaagatt tcgacgattc gcaagtaccc aacgccgacc aacaggaagg 2400 aggtccacag gttcgtcggg atggtgaact atctcagcaa atttattccg aacctcagcg 2460 cgaatctcac caacctgcga aagcttatcg tcgaatcaca accctggaag tggacaagcg 2520 cagaagcgga cgagttcgat cgtgtgaagt cgctggtttc agacatcggc acgctctctt 2580 actacgacgt gagccagccg atcaccatcg aatgcgatgc aagttgtttc gggctgggtg 2640 ctgccgttta ccaaaacaac ggtgtcatcg cctacgcgtc gcgcacgctg acggacaccg 2700 aacggaatta cgcgcagatt gagaaggagc tgttggcgat cctgtttgct tgcatccgtt 2760 tcgaccaact catcgtaggc aacccgcaag caacagtgaa gacggaccac aagccgctga 2820 tcaccatctt taacaagccg ttgctctccg cgccgaaacg actccagcac atgctgctga 2880 acctgcagcg ttaccgtctc tcgattgtct ttgtcacggg caaagaaaat gtagtcgctg 2940 atgcactatc acgcgccccg gaaccaaagg atcgtttgca ggaggagaat aagaagtgga 3000 acatctacaa ggtgttcaag gaactcgaga acgtggagat gagtaacttc ctgagcgttt 3060 cggatagtcg attgagcgag ataatggaag agaccgagaa agacccgtcg atgcagcagg 3120 tcatcgaatt tgtgcagcgt ggatggccag catctgccga ccgagtctcg gaaaacgtta 3180 aagtttatta cggctaccgc gaggaacttt ccacgcagaa cggtcttgtc ttccgtggtg 3240 accgcatcct cgtcccgcac attctccgtc gcaagctggt ggacagctgt cacgtgagcc 3300 acaacggcgt cgaagccacc ctcaagctcg cccgtgctaa cctcttctgg ccgggaatga 3360 gctcccagat caaagaggtc gtagcacgct gcaacgtgtg tgccaaattc gccgcttctc 3420 agcagaatcc gctcatgaag agccacagca ttccggtcca cccgttccag atgatatcga 3480 tggacgtatt cttcgcggag tatcaaggac acaagcgaaa tttcctggtc acggtggatc 3540 actactccga cttcttcgag gtcgacttgc tgaaggacct tacgccgcaa tccgtcattg 3600 cggcctgtca gcaaaacttc gcgcggcacg gcacacccca aagagtgttg acggacaatg 3660 gcaccaactt cgtcaaccga caaatggtga aatttgcgaa agactgggat tttgaactcg 3720 tgactttcgc accacaccac cagcaggcga acgggaagtc cgaagctgca gtaaaaatcg 3780 ccaaacgctt gctgatgaag gccaacgaaa cagggacaga tttctggtac gcgctgctcc 3840 actggagaaa catccccaac aagattgggt caagcccatc tgcgcgattg ttttcccgtg 3900 ctacaaggtg tggcgttccc agctcgatca ataatctgct gccgaaggtg gtggaaagcg 3960 tcccggaagc tatcgagcgg aacagaaaga agataaagtt ccagtacgac aagaaggcgc 4020 gggttttacc tgagttgacg accggatctc cagtgttcgt ccagcttaat ccggaaacat 4080 ccaaggtttg gactcctggt acaatcgcca accggatgaa tgatcgcagc tactgtgtcg 4140 acgttgccgg caaggagtac cggcgcagcc tcgtccacat aaaaccacgt caagaacatg 4200 caacgacacc cgtccggagc ggaacacctg ttcctgaatc gctgccggaa agtggatcga 4260 attctgcagc agcagacttt cctgcgatca caccgagttg tcaaatggat gaatttactg 4320 gagagaactc gccgcctgct gcgatgccgc aatcgccgcc gctgatggtt tcagcgcctg 4380 ccgcgacaac acgaggagcg gtgcgtggaa cggagagcaa ggataaccac gtgggcactc 4440 caggagttac acgaccgaga cgagagacca ggattccagc gaagctgaag gattttattt 4500 tgcactagcg taacttttac ttttcactgt ggagagga 4538 // ID hAT-3_SM repbase; DNA; INV; 2569 BP. XX AC . XX DT 02-OCT-2007 (Rel. 12.1, Created) DT 08-OCT-2007 (Rel. 12.1, Last updated, Version 1) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-3_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-2569 RA Jurka J.; RT "haT-type repetitive family from freshwater planarian Schmidtea RT mediterranea."; RL Repbase Reports 7(10), 1032-1032 (2007). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 544..2259 FT /product="hAT-3_SM_1p" FT /translation="MLVCLICQQTVAVRKEFNVKRHYQTQHAEKFDKLNGQ FT LRKDKIQNLTAELSRQQTLFTKVNKDVECVTKASFEISEIIAKKLKPFSDG FT EFIKECIDAAVDCLCPEKKQLFSSISLSRMTVTRRIDDMADNIEAVLKEKA FT ENFDAYSICLDESTDMTDTAQLAIFVRGVDSDFNITEELLKLASMKGTTTG FT EDIFDEVKDALQENNLDLLKLTGITTDGAPAMVGKKNGLIGLFQTELDKSE FT NNTELIASHCIIHQENLCAKQLKMDHVMSVVVKTVNFIRARGLNHRQFKEF FT LADLEVEYCDVIFYSEVRWLSRGKVLQRFYNLRKEILVFMEMKEKFVPELL FT DASWILDLAFLVDITTHLNNLNTALQGKNQLVNNLYDHVKAFQAKLKLWEF FT QFSKGDTVHFPCLTVCKKTNEDLDKIKKYTEQISILRTEFSTRFSGFAAHE FT HDFKLFSDPFNFNAEAAPDSVQMELIELQCNSDLQRKHHEVSLIQFYKLYV FT SPVKFPSIRFHALKMVSLFPSTYICEQFFSKMKLTKSKNRCRLTDDNLSNQ FT LRVATSNIKANISELCHDKKCQMSH" XX SQ Sequence 2569 BP; 885 A; 419 C; 472 G; 793 T; 0 other; caggcatgct caacttacgg cccgccaact gtttttatcc ggccctttgg cagctagaat 60 gaatagccag taaacaggcc cttcgtttac caacaagaat attaccatta aaagtcttgc 120 tgtgattagt agaccatcaa aatcacatct gggttctatt tgcttgccaa ctgtaatgca 180 gcagtgtatc tgtagtaaat atttgatgtg ggccaaatac cacccattat gtttaactac 240 agtcttacaa ttagtgacta atatgaccca caatgtcatg taggttttct gtaactgtat 300 gcaagcagcc aatcaatgtt tcttgtctga ttatttttgt atgaagtatt ggctttattt 360 taaaatcttg ttctatcttg attaaaaatt aaattttact attgttatca agtattcagt 420 attaatttaa taaaatttac aaccactaag aatgaactca acaaaaaaga ggaaagtaga 480 cagtgaatgc cgtttattta atgaacggtg gactccagaa tattttttgt tgactgtaag 540 aatatgttag tttgcttaat ctgtcaacaa actgttgctg ttagaaaaga atttaatgta 600 aagcggcatt accagactca gcatgcagaa aaatttgaca agcttaatgg ccagctgaga 660 aaggataaaa ttcagaacct cacggcagaa ctctcaagac agcaaacatt atttacaaaa 720 gtgaataaag atgtggaatg tgtaaccaag gcgagttttg aaataagcga gatcattgct 780 aaaaaactga aaccattttc tgatggcgaa tttatcaaag aatgcattga tgcagctgtt 840 gattgtcttt gcccagaaaa gaaacaactt ttctcaagca tcagtctgtc tcgaatgaca 900 gtgacaagac gaatagatga tatggcggat aacatcgaag ccgtattgaa agaaaaagca 960 gagaacttcg atgcatattc aatttgtttg gatgaaagca ctgacatgac agacactgca 1020 caattagcta tttttgttcg tggagttgac agtgatttta acattacaga agaattgcta 1080 aaattagcta gtatgaaggg gacaacaact ggagaagata tttttgatga agttaaagac 1140 gctttgcaag aaaataatct tgatctttta aaactaactg ggattacaac agatggagct 1200 cctgcaatgg ttgggaaaaa gaatgggctt attggtttgt tccagacaga gctggacaaa 1260 tccgagaata atacagaact tattgcatcc cactgcatta tacatcagga aaatttatgt 1320 gcgaagcagt tgaagatgga ccatgtgatg tcagttgttg ttaaaactgt caattttatc 1380 agagcacgtg gattaaatca caggcaattt aaagaatttc ttgcagacct ggaggtagaa 1440 tactgtgatg tcattttcta cagtgaggtt cgctggctca gcagaggaaa ggttctgcag 1500 cgattctaca atcttcgtaa agaaatttta gttttcatgg aaatgaagga aaagtttgtt 1560 ccggagttat tggatgccag ttggatttta gatttagctt ttttggttga catcacaact 1620 catctgaata atcttaatac agcattacaa ggtaaaaatc aactagtaaa caatctttat 1680 gatcatgtca aagcatttca agccaaactc aaactatggg aatttcagtt ttcgaaaggt 1740 gacacagtgc attttccatg tctgactgta tgcaaaaaaa caaatgaaga tttggataaa 1800 attaagaagt atactgaaca aatttcaatt ctgagaacag agttcagcac aagattttca 1860 ggttttgcag ctcatgaaca tgatttcaaa ttattttcag atccattcaa ttttaatgca 1920 gaggctgcac cagactcagt tcaaatggaa ctgattgaac ttcagtgcaa cagtgacctg 1980 caaagaaaac atcatgaagt atctttgatt caattttaca aactttatgt ttccccagta 2040 aaatttccat caattcgatt tcatgcgctt aaaatggtat cactttttcc tagcacctat 2100 atatgcgaac agtttttttc aaaaatgaaa ctcactaaaa gcaaaaacag gtgcaggctg 2160 actgatgaca atctttcaaa tcaacttcgt gtggcaacat caaatattaa ggcaaacatt 2220 tctgaactgt gtcacgacaa aaagtgtcaa atgtcacact agaatagacg ggtggcgatg 2280 attaatgttt tagaatttta gatgtatggt agtattctgt aaaacgagat ttgctaaata 2340 attcaagtat tttaatatgc aaaagtaaag caaacatctt gaaatttaaa agaatagtaa 2400 ttttaagttt aactatatat gccgattatt taactttata atttatactt taatacgttt 2460 tgtttaaatg taaattttat tgaaactgta tccttaagtt tgttccggcc ctaacaacat 2520 cccaaaatta ataaccggcc ctcaggtcaa tacaagttga gcatgcctg 2569 // ID CR1-44_AAe repbase; DNA; INV; 4552 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-44_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4552 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1131-1131 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 17 sequences with >97% CC identity. XX FH Key Location/Qualifiers FT CDS 329..1216 FT /product="CR1-44_AAe_1p" FT /translation="MDTKCYQCSTPIKTFEFLQCKLCDXAAHLKCVGLKRS FT NMDFMNEHKNVLWFCDKCVDHLNLLKANPPITSHEIVTGVSEAIKESLADL FT KFDLQKTKELTESIVDKLPTQENLGLNTTRSAWPSIKRTREHVKETPRSRP FT NDKLAVGTKSVEKEQIVVETVPKPTEKFWLYLSRIARHVTEADISELVKNC FT LQTDDPIDVRKLVRKDADLKQFAFISFKIGLDLKLKNTALDPAVWPKGIYF FT REFENMSTERDFWGPAKIPRMDDGTPHVTGTPLIVSSPANRQTPLHQQRNA FT HHKA" FT CDS 1216..4440 FT /product="CR1-44_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MMEAPKPPDTVEPSRSSLVPACSGRPSRSVPVFGTGE FT GVFQHACSGKFNQLGISSLPESILISSAIAATTASLQPTRSDNASVNSLST FT ATSCLRADSEDANVQSALHVNVITTSSTKDDILDIYYQNVGGLNSCLNDYL FT LASSDCCYDIIAFTETWLTQNTLSSQVLDHQYEVFRCDRGPHNSRKTKGGG FT VLLAIRQGIKAHLIEDDTWSSVEQLWLAIELSDHTLYLCVVYIPPDRTRDD FT FLIEAHTNSVSTIVDRASNSDEIVVIGDFNLPGIRWRSSDNGFLYPDACHS FT TLNVSTTKILDRYSTATLRQINHVVNENGRSLDLCFVSSQDVAPNIALAPA FT PFVKPVIHHPALHISFDISNKVESKVDSATIFNDFRNADYDSIQRIFSTLD FT WDELLDVNDVNTAAQTFSHVLWYAINRHVPKKTVSEGRNQWQTPELRRLKR FT QKRAALKKYSKKKTASLRNHYVRVNNIYKSLSRRCFHRYQIGVQRKLKSHP FT KSFWNYAKQQRHVNGLPSSMKWNETIASEPQVICDLFATKFSNIFSEEVLT FT DRQISLASANTQTTGLLMPDVATDDESILTAVRKLKASTCSGPDGIPSVFL FT KNCIDSLLAPLQRIFNLSLSTGVFPSCWKSAYMFPVHKKGDRKDIDNYRGI FT SALCAVSKLFELVVTDRLYFHCKQVLSPDQHGFIPGRSTASNLLCLTSYVT FT DSMLEHAQTDVIYLDLSAAFDKVNHQILLAKLAKLGIHGSLLCWFQSYLYD FT RRSKVMIDDSVSADFAMTSGIPQGSHLGPLMFLLYFNDVNNVLEGPRLSFA FT DDFKLYCKIRSDADAQFLQQQLTAFANWCNVNRMTVNPEKCSIITFSRSKN FT PMAYSYDLNGVLLSRVDQIKDLGVILDSKLTFKQHVSYIVDKASKSLGLMF FT RITKSFTDIYCLKTLYCALVRSTLEYCSTVWSPYYQNGVLRIESIQRRFVR FT FALRLLPWHDPLRLPSYENRCRLISMDTLSTRRDVAKAVLVSDILQGRVDC FT PYILQQINLYVQPRLLRNNALLRPRTSRTNYGMNSAITGLQRVFNRVATDF FT DFNVSRDLLRYRFSNSLR" XX SQ Sequence 4552 BP; 1257 A; 993 C; 959 G; 1340 T; 3 other; ctatcgcatt agtttttttt tcactctgtt tgtgttgact atttttcgtt tacattcgaa 60 cgcaaaattg acgggagtgt attaagcgta tcgtgtttac atctgttttt ttttatcaaa 120 ctatctcgaa tttgattttg tggaagctgc ctgcaataaa tcgtgtttta tttgcatcgt 180 tcagtggwca attgtttttg gaacgaattt cttcctgcca tcatacctga cgtcatcgtt 240 tcgtktttcc gcaatcagcg ccatctgtgg acgaatagtt gcaatcttta tcgatcattc 300 atcgttcgag tcgacgaaga gagtggaaat ggataccaaa tgctaccagt gctctacgcc 360 gatcaaaact ttcgagtttc tacaatgtaa attgtgcgat maagcagcgc atctgaaatg 420 cgtcggctta aaacgctcga atatggactt catgaatgag cacaaaaatg tcctttggtt 480 ttgtgataaa tgtgttgatc acctcaacct actaaaagcc aaccctccaa ttacttcaca 540 cgaaattgtg accggggttt ctgaagctat caaggagtct ctggcggatt tgaaatttga 600 tctccagaaa acgaaggaat tgactgaatc gattgtggat aaacttccta cgcaagaaaa 660 tcttggactt aatacaactc gttctgcgtg gccaagtatc aaacgaactc gcgaacatgt 720 aaaggaaacg cctagatctc gtcccaatga caagcttgca gtgggaacaa aatcggtcga 780 aaaggaacaa atcgttgtag aaacggtacc taagcccacc gaaaagttct ggctgtattt 840 gtcaaggatt gcccgccacg tcactgaagc tgatatttcg gagttggtga aaaactgttt 900 gcaaactgat gacccaattg atgtgaggaa gctggtacgc aaagatgcag atttgaaaca 960 atttgcgttt atttcgttta aaattggctt ggatttgaag ctgaaaaata ccgctttgga 1020 tcctgctgtt tggcccaagg gtatctactt ccgtgaattc gaaaatatgt ctacggaacg 1080 ggatttttgg ggaccagcaa aaattccaag aatggacgat ggaacacctc atgtcaccgg 1140 gacaccattg attgtgtcat ctcctgccaa tcgtcaaaca cctttacacc aacaacggaa 1200 cgcacatcac aaagcatgat ggaagccccc aagccacccg acacagtcga gcccagccgt 1260 tcttcgttag ttcctgcttg ttccggccgt ccaagtcgtt ccgttcctgt gttcggaacc 1320 ggcgaggggg tcttccagca tgcttgctca ggcaagttta accaactcgg aatatcatca 1380 ctgcctgaaa gtattttgat ttctagtgcg atcgctgcta caactgcttc actacagccg 1440 actaggagtg acaacgccag tgtgaattcg ctatcaacgg ctacttcatg tttacgcgcg 1500 gactctgagg atgcaaatgt tcagtcagca ttgcatgtca atgtcataac cacgagttca 1560 actaaagatg atattcttga catttattat cagaatgtcg gtggattgaa ttcttgcttg 1620 aacgactatt tgctggctag ctcggattgt tgttacgaca tcattgcgtt taccgaaacc 1680 tggttgactc aaaacactct ctcgagccaa gtcctagatc atcagtatga agttttccgt 1740 tgcgatcgtg ggccacataa cagcaggaaa actaaaggtg gtggagtcct ccttgctatt 1800 cgacaaggca taaaagctca cttaattgaa gacgatactt ggtcttcagt tgagcagttg 1860 tggttggcaa ttgaactatc ggatcacacc ctctatctgt gcgttgttta catccctccc 1920 gatagaacgc gtgacgactt cctaattgaa gcacatacta attccgtttc aactatcgtt 1980 gacagagcct cgaattctga cgaaattgtg gtaattggcg acttcaattt gcctggtatt 2040 cgctggcgat cttcggacaa tggtttcctt tacccagatg cttgtcattc aacgttgaac 2100 gtgagtacaa ccaagattct cgaccgttac agcaccgcta cacttcggca aattaatcat 2160 gtagtgaatg agaacggccg cagcctcgat ctatgttttg ttagttccca agacgtggct 2220 cctaatatag cattagctcc agcaccattt gtgaagcccg tcatccatca tccagcatta 2280 cacatcagct ttgatataag taacaaagtt gaaagtaagg ttgattcagc tactatcttt 2340 aacgacttca ggaatgcaga ctatgacagt attcaacgta ttttttcaac gctagattgg 2400 gacgagcttc tcgatgtaaa tgatgttaat actgcagcac aaacattttc acatgttttg 2460 tggtacgcga taaaccgaca tgtcccaaaa aagacagtct ctgaaggtcg taatcagtgg 2520 cagactccgg aactccgtag attgaagaga caaaagagag cagcgctaaa aaagtactcc 2580 aagaagaaga ccgcgtcatt gaggaaccat tacgtgcgag ttaacaacat ctataagagc 2640 ctaagcaggc gatgctttca tcgataccaa atcggagttc aacggaaact caaatctcac 2700 ccaaaatcgt tttggaacta tgccaaacag cagcgtcatg tgaatggtct accatcttcg 2760 atgaagtgga acgaaacgat tgcttccgag ccacaagtta tatgcgacct gtttgcaacg 2820 aagttttcta acattttctc tgaggaggtg ctcacagata gacagatatc gttagcatct 2880 gccaacacac agaccacagg acttctcatg ccggatgtgg ccaccgatga tgaatctatt 2940 ttaacagcag tacgtaaact gaaagcatca acttgctcag gacctgatgg aattccgtcg 3000 gtatttttga aaaactgcat agacagcttg cttgcacccc tgcaaaggat ttttaatctg 3060 tcactttcga ccggagtatt tccatcctgc tggaagagcg cctacatgtt tccagtacac 3120 aaaaaaggtg ataggaagga catcgataac taccgtggaa tttctgcgtt atgtgcagtc 3180 tctaaactct ttgagcttgt cgttacggat cggctctatt tccattgcaa gcaagtcttg 3240 agtccagacc agcatggctt tattccagga cgttcaactg cgtcaaactt gttatgttta 3300 acttcctacg tgactgacag tatgctcgaa catgcacaaa ctgatgtgat ttacctggat 3360 ctttctgctg ctttcgataa ggtcaaccac caaattctct tagcaaagct ggcaaagttg 3420 ggcatacacg gcagtttatt gtgttggttt caatcctatt tatatgatcg ccgttcaaaa 3480 gttatgatag atgattcagt ctccgctgat tttgcaatga catctggaat cccgcagggt 3540 agtcatctag ggccattgat gtttcttctg tatttcaatg atgtcaacaa tgtgctcgaa 3600 ggaccaagac tttcttttgc ggatgatttt aagctttatt gtaaaatccg ctctgacgct 3660 gatgctcagt ttctgcaaca acagctgact gcctttgcca attggtgcaa tgtgaatcgc 3720 atgactgtga atcctgaaaa atgttctata ataacgtttt ctcgttcgaa gaaccctatg 3780 gcctatagtt atgacctcaa tggcgtgctt ttgagtagag tagatcaaat caaagatcta 3840 ggtgtgattc tcgattctaa attgacattt aaacaacatg tatcatatat agtggacaaa 3900 gcttccaaat cacttggttt aatgtttcgt atcactaaat catttacgga catttactgc 3960 ttaaagacgc tatactgtgc attagttcgg tccactttgg agtactgttc aaccgtatgg 4020 agtccttact accaaaatgg agtcttgaga atcgagtcaa ttcaaagacg gtttgtccgt 4080 tttgctcttc gtctgttgcc ttggcatgac ccgcttcgtc tcccttctta tgagaaccgg 4140 tgtcgtttaa tcagtatgga caccctcagc actcgtagag atgtcgccaa ggctgtttta 4200 gtatccgaca ttcttcaggg ccgtgttgat tgcccttaca ttctgcagca gataaatctc 4260 tacgtccagc caagactgtt gcggaacaac gcgttgctgc gtcctcgcac ttccaggact 4320 aattacggaa tgaacagtgc aattacaggc ttgcaaagag tattcaatag ggtggcaact 4380 gattttgatt ttaatgtgtc ccgtgatttg cttcgctata ggttttctaa ctctttaagg 4440 tagacagtaa gattgtgtgt tttttttatt gttttaatgt taacgacaac aattaataca 4500 tcattggggc taggagcctg ttgatgttca aataaataaa taaataaata aa 4552 // ID REP-1_ED repbase; DNA; INV; 604 BP. XX AC . XX DT 30-JUL-2008 (Rel. 13.1, Created) DT 30-JUL-2008 (Rel. 13.1, Last updated, Version 1) XX DE Unclassified repetitive element from Entamoeba dispar - a DE consensus sequence. XX KW SINE; Non-LTR Retrotransposon; Transposable Element; EdSINE1; KW Ed_SINE2; REP-1_ED. XX OS Entamoeba dispar OC Eukaryota; Amoebozoa; Archamoebae; Entamoebidae; Entamoeba. XX RN [1] RP 1-604 RA Lorenzi H. and Caler E.; RT "Genome wide survey and discovery of repetitive elements in three RT Entamoeba species."; RL Repbase Reports 8(10), 1687-1687 (2008). XX DR [1] (Consensus) XX CC Shares common 228- and 96-bp 5'-termini with EdSINE1 and CC R4-N1_ED, respectively. Most likely, REP-1_ED is a nonautonomous CC R4 non-LTR retrotransposon. XX SQ Sequence 604 BP; 248 A; 52 C; 71 G; 233 T; 0 other; tgattcgaat gtggtatgtc agatacacaa cacaaatcac taatatttcc atgtttcgaa 60 tcacctagtt atcttctggt tatgatgtgc ctttggtgag ataacatacc acataacact 120 aattggttat ttttattatg actgattatt atggaggtat tatcaataat tattaagatt 180 attatttatt attatttatt aatgattatt atttattatg atataagatt atttatttat 240 tatgatggta gtattattat ttattatgat acaatatatt tatcattgat agttttgagt 300 tgaatacatg atgtgttatc aataaagatt atttaaacaa tgttgatgat aaatgagata 360 aaccataatc atagtgttat tgatatgtca aacataaaat aacaatataa ataatgatga 420 attttaatat tcatttattt caaaataaaa atcagataac aacttaaatg atgaaatatt 480 taaattgaga atgattgaaa aaaacaataa atgaatagaa ataaaacatg aagaataaag 540 tgaaataatc aattctcaac atgatgaaaa cataatcata aataattcat atttaatttc 600 ttta 604 // ID ISL2EU-5_HM repbase; DNA; INV; 3885 BP. XX AC . XX DT 10-SEP-2009 (Rel. 14.09, Created) DT 10-SEP-2009 (Rel. 14.09, Last updated, Version 2) XX DE A family of autonomous ISL2EU transposons - a consensus. XX KW ISL2EU; DNA transposon; Transposable Element; ISL2EU-5_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3885 RA Jurka J.; RT "DNA transposons from Hydra magnipapillata."; RL Repbase Reports 9(9), 1916-1916 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 172..1746 FT /product="ISL2EU-5_HM_1p" FT /translation="MGGGDHCVVQNCNNDRRYPNKYVIKDHINSFNGSLQL FT KFWRCKDAKMLPKWNSKIGLFRSHFRVSLNSPVCSNHFKMGRPTNIYPYPT FT EYLNGYDNELKNRKSPLKRSSELNKKKFVAKLSDNHYQNEYNHDQNIINHD FT KNNDYHHQNENTQPMDIHYDHDYALPVVPVSVSLEKDLKILQNTPKRILKK FT PLLLKHRKRLLSKFSLIPQAEFTWNRLSHSDRLIKFYTGCTKKVFMFLVSK FT GKKKYENVCYYKGKPSQSFKFRSENKKKPGNKRTLTIEDEILLMCMKLRLD FT LAIQDLAFRFSISNSQCSSILTTWITYFGNELKPLLLWPTKEANLLYKARH FT FSGKLHNVEGIIDCTEQKIQKPSNAKAQYQTFSTYKSCNTLKKLVVTTKTG FT SFSFISKAYGGQASDRHITEDCNVINMFSEGSVCLADKGFNIQDLLLKKKV FT VLVCPPFKKKGLHFTRTKVLSAKEIARSRVHVERSIRRLKEFKICKNELSL FT TMLDLIDYIYIICGALSNLQPSIVKEFI*" FT CDS 3771..1876 FT /product="ISL2EU-5_HM_2p" FT /translation="MSDSSSDNEKIKPFMDEIKNWNVVKLKEFLQARSIPL FT GENPSKKKLLKNVYYANKLKLPLNKTVQEETEEIKSRSLQKLKLKSGVQLP FT HPFEIGNWVLEKEVVIKLLPTTNYLDIETYSFLTHSSKALKEGHSLYSSGH FT VTSVAFNSLSDAVKFCYLKGTVVPQTRINDEEYESWVCLNKNGSIFTAECN FT CVAGYGESCKHIFALLLFVEEHVRLGDNVTCTSVKQKWGTKTQKRKLHEPD FT IFQNISIKKVKSSLENSNLKLSRFLYDPRPYYLKQSEFTKEDWEAVAEATD FT GKCAVICLKPNIQSEYFMKKLPLNATKNKVLPPTIEHYVAQIECSYKNESI FT TFKCEKFLRTFSITPSQAELIKEATISQSKSLLWKSYRVGTISSSIAHAVI FT AKFDVDLKVKNKKAAFNLCAQILCYKNNAKSKSIQWGNMFESYARKRYIRE FT MKKTHKNFTCYQTGLIVSIKTPYLTASPDGITSCTCCGLGLLEIKCPWTFR FT QKKICEYAQSAESFFEKNSCTNTYSLKSSHQYRTQIQHQLFISGYPHADFY FT VCLASDQNCERIYSDPNYKKNEQKFQKFFTEYVFPELITKKLEKKEIVTTI FT LNDLIKKCINDSNSKQVNEKLTNDIINLDQQKI*" XX SQ Sequence 3885 BP; 1335 A; 590 C; 584 G; 1376 T; 0 other; cgccttttcg ctattaattt ttcaaatttc acgcgtaaag catgatggga taacggaggc 60 gcaaacaaac caaaacaatt tttaactaag taaattaaag aaaataaata aagaaaaact 120 aagtaaataa aacatataaa ttagaaaaaa taaaattata taaattataa aatgggtggg 180 ggggatcatt gcgtcgttca aaactgcaac aacgatagaa gatatccaaa taaatacgtt 240 atcaaagacc atattaacag ttttaatggt agtttgcaac taaaattttg gagatgcaag 300 gacgctaaga tgttgcctaa atggaattca aaaattggac tttttagaag tcattttaga 360 gtttctctta actcgccagt ctgttcaaat cattttaaaa tgggacgacc tactaatatt 420 tatccctacc ccacagaata tttaaatggt tatgataatg agctaaaaaa ccgaaaatct 480 ccattaaaaa gatcctcaga actcaacaag aaaaagtttg ttgctaagct tagcgataat 540 cattatcaaa atgaatataa tcacgatcaa aatataatta atcatgataa aaataatgat 600 tatcatcatc aaaatgaaaa tactcagcct atggatattc attatgatca cgattatgca 660 ttacctgttg ttcctgtttc tgtttcgtta gaaaaagact taaaaatact tcaaaatact 720 cccaaaagaa tactaaaaaa acctttgtta ttaaaacatc gtaaacgatt gctgagtaag 780 ttttctttga taccacaagc agagtttact tggaataggt tatcccattc ggatagattg 840 ataaagttct acactgggtg cacaaaaaaa gtatttatgt ttttagttag taaaggaaag 900 aaaaaatatg aaaacgtatg ttattacaag ggaaaacctt cacaatcttt taaatttaga 960 agcgagaata aaaaaaaacc agggaataaa cgcactttaa caattgagga tgaaattcta 1020 ctaatgtgca tgaagctccg actagacttg gctattcaag atttagcatt tagattttca 1080 atttctaatt ctcaatgctc aagtatatta actacctgga ttacttattt tggaaatgag 1140 ttaaaacctc ttcttctttg gccaacaaaa gaagcaaatt tgttatataa agcaaggcat 1200 ttttctggaa agttacacaa cgtagaaggt ataatagact gtactgagca aaaaattcag 1260 aaaccatcaa atgcgaaggc acagtatcaa acatttagta cttataaaag ttgtaacaca 1320 ttaaaaaaac tagttgtgac aacaaaaact ggttctttca gttttatctc caaagcttat 1380 ggtggccaag cttcagatag acacattaca gaggactgca atgttattaa tatgttttct 1440 gaaggatctg tttgcttggc tgataaaggt tttaatatcc aagatttgct cttaaaaaaa 1500 aaagtcgtat tggtttgtcc accttttaaa aaaaaaggac tacattttac acgaactaaa 1560 gtcctctcag ccaaagaaat tgctagatca cgagtccatg tggagcgatc aattcgacga 1620 cttaaagaat ttaaaatatg taaaaacgag ttatcactaa caatgctaga tctcattgat 1680 tacatatata ttatttgtgg agctttaagt aatttacaac cctcaattgt aaaggaattt 1740 atctgatgta aattaaatat gctgatacta ataaaaagat atctctattt tttcttttta 1800 tttttgtata aaaaaaaaca aaacatcaca aaatacttat aatataaatg ttttcatcaa 1860 attttcagca caatgtcata ttttttgttg atcaagattt atgatgtcgt ttgttaattt 1920 ttcattaact tgtttactgt ttgaatcatt aatgcatttc tttatgagat catttaaaat 1980 tgtagtaact atttcttttt tctcaagttt cttagttatc aattcaggga aaacatactc 2040 tgtaaaaaat ttctgaaatt tttgttcatt ttttttatag ttaggatcag aataaattct 2100 ttcacagttt tgatcacttg ctaaacatac atagaaatca gcatgaggat agccacttat 2160 aaacaactgg tgttgtattt gggtgcgata ttgatgacta cttttcagtg agtatgtgtt 2220 agtacaagaa tttttttcaa aaaaactttc agcactttga gcatattcac atattttttt 2280 ctgcctaaat gtccatggac atttaatttc aagcaaaccc aacccacaac aagtacaact 2340 tgttatacca tctgggcttg ctgtaagata aggtgttttg atggatacta ttaggccggt 2400 ttgataacat gtaaagtttt tatgagtttt tttcatttcc ctgatatatc tttttcgagc 2460 ataggattca aacatgttac cccattgaat tgatttactt ttagcattat ttttgtaaca 2520 taaaatttga gcacataaat taaatgctgc ctttttgttt ttcactttca agtcaacatc 2580 aaattttgct atgacagcat gagcaataga tgatgatatt gttccaactc tatatgattt 2640 ccaaagtaaa ctttttgatt gagatatcgt agcttctttt ataagttctg cttgagatgg 2700 tgttattgaa aacgtcctta aaaatttttc acatttaaat gttattgatt cattcttata 2760 agaacattct atctgagcta cataatgctc tatggtaggt ggtagcactt tgtttttagt 2820 tgcatttaaa ggtaattttt tcataaagta ttctgactgt atatttggct tgaggcaaat 2880 tacagcacat tttccatctg ttgcttctgc aactgcttcc caatcttcct ttgtaaactc 2940 tgactgcttt aaataataag gacgaggatc atataagaat cgactgagtt ttagattact 3000 attttctaag cttgatttga cttttttaat ggaaatattt tgaaatatat ctggctcatg 3060 caattttctc ttctgtgttt tagtccccca cttttgtttt actgatgtgc atgttacatt 3120 atctcctaat cgcacatgct cttcgacaaa caacaacaaa gcaaatatat gtttgcagct 3180 ttctccataa ccggctacac aattgcattc tgcagtgaat atacttccgt ttttattcaa 3240 acatacccat gattcatatt cttcatcatt tattcttgtt tgagggacta cagttccttt 3300 taaataacaa aattttacag catctgataa gctgttaaaa gcaactgagg ttacatgacc 3360 tgaagaataa agagaatgtc cttctttcaa ggctttagag ctgtgagtga gaaaagaata 3420 agtttcaatg tccaaatagt ttgtggttgg taataatttt attactactt ctttttctaa 3480 cacccaattt ccgatttcaa aaggatgggg aagctggaca cctgatttta attttaattt 3540 ttgaagagat ctggatttaa tttcttcagt ttcttcttgt acagttttat ttaatggtaa 3600 tttaagttta ttggcataat aaacgttttt taacaatttt ttttttgaag gattttcacc 3660 aagaggaata ctccttgctt gcaggaattc tttcagttta acaacattcc aatttttaat 3720 ttcatccata aatggtttaa ttttttcgtt atcagaggaa gaatcagaca ttgttacagt 3780 attgtagaaa acagtaatgt taaataattg ttttggtttg tttgcgcctc agttatccca 3840 taatgcttta tgcgtgaaat ttgaaaaatt aatagcgaaa aggcg 3885 // ID I_Ele4D_AAe repbase; DNA; INV; 5673 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A I non-LTR retrotransposon family from Aedes aegypti. XX KW I; Non-LTR Retrotransposon; Transposable Element; I_Ele4D_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5673 RA Kojima K.K. and Jurka J.; RT "I non-LTR retrotransposons from the yellow fever mosquito."; RL Repbase Reports 11(4), 1373-1373 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 8 sequences with >99% CC identity. CC The consensus is ~89% identical to I_Ele4C_AAe. XX FH Key Location/Qualifiers FT CDS 354..1655 FT /product="I_Ele4D_AAe_1p" FT /translation="METDGDVVSVVDGDSKGSSSTIKPIRIKTYPLTFMGP FT FVVFFRKKEKPINVLLISSEIYKIYKSVKEIKKISLDKLRVIFGSREDANA FT LLESKLFLNSYRVYAPCDSCEINGVIYDESLECDEILNHGSGIFKNKAILP FT VKILECVRLSKLLFSDKGSSYTHSNCIKITFEGSVLPDFVVVDNVKFHVRL FT FYPKIMHCDRCLLFGHTSHFCSNKQKCSKCGGIHSPSDCKKLSDICIHCGK FT KHNFLKECAVFIAHQKQFNLKIRNKNKLSYSEVIKTSNGFSSNNIFEPLTE FT NIGIQDSNEEHNFVYKPPIKRKRINKSINQNNNLNPQPSTSYEKNFPPINL FT SKSQNIPGFQKINHDFFGNKSDDIKKNNNNSQNHVHNDTGGTILNILEDII FT EFLGLNDFWKKLIKKCLPFVASLLEKLNSFGPLISTLFCS" FT CDS 1658..5350 FT /product="I_Ele4D_AAe_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="MAQINNNNLNILQWNCRSIIPKIDRLKALIINNDIDL FT FCLNETWLVNTNVFRIPSFNIIRKDRNIAYGGVMIGIRQNIEFKFLNFSLD FT LPIEYVAISVKHNGLEFSILCLYIPPQARFSLQDLKTILNNIPAPFYILGD FT LNAHHLAWGSDISDGRGSLIMDLIDELNLNILNDGSFTRIAVPPVRHSCID FT LSLCSNSLSMKSTWKSINDPNGSDHLPIKIIVHFALNEPFHQEPFVPDLTK FT YVDWTKFSDLVFTALINFDYSLSPLQNYDKFSAILIQCLQRSQTRKLCLEP FT TRRRLNSFWWDHDCTVALKNKSEAFKKFRNSGTRDNYFLYRKSEAQFTRVI FT KYKKRNYWKTFIENLDSETSLTKLWSVARNLRNYNIPSTSVSEYSENWIDQ FT FASKICPDFAPTSITFKTHQLYNYYPDLCSEFSIEEMNLALSITKNTAPGI FT DNIKFIVLKNLPIDGKLHLLSLYNSFLFQNIFPLEWRSIKVVSLLKPDKNP FT SLVESRRPISLLSCLRKLMERMILNRLELWAEKNNIFSSSQYGFRKGRGTR FT DCVALLASHIELSFNKKQDVVSTFLDVSGAYDSVLIDLLYKKMNDCKIPII FT ISNFLCNLFSFKIMHFFHNGSSRMVRYSYFGLPQGSCLSPFLYNLFTRDII FT SIIPNGCYFIQFADDKVISIHGRSREVIRHYMQICLDNIYTWAHNNGFSFS FT VQKTKFILFSRKHSPINIDLYLNNQQIEQVIDYKYLGIWFDSKLKWNNHII FT YIQKICSKRINFLRLITGTWWGSHPNDMITLYKTTIRSVMEYGCFAFGSAA FT QIHFSKLEKIQYRCLRICLNLMNSTHTKSIEILAGIIPLKYRFQELNCKFL FT INCFSNEHPIIDTLKSLFEINPTNRILNSFIHCSTENIIPNLSVHFHEYSM FT NVHSFQPIVDLSLLEELKQIPCHAYPRYAPLLFKRKFIGVNNDQIFFTDGS FT LIENMAGFGVYNFQLAHFYKLESPCSIFTAELTALYFTCNLIKNYTPNIFV FT VCSDSLSCIQALNSISFNIKTHHIVLSIKGLLYDLYSRGFVIKFVWIPAHC FT NIYGNEQADLLAKLGVFRGPIYKRDINYSEYFTNLKKHSINDWQISWNTSD FT QGRYCYSIYPKVKVFPWFRHLPVGRNFICSFSRLMSNHYICSSHLYRMNIT FT DSNICECNKSYEDIDHIVFECCRFEAPREKLLENIISLGLDIPVSVRDILG FT NRSFLIMKILYEFLNEISYRV" XX SQ Sequence 5673 BP; 1817 A; 824 C; 876 G; 2156 T; 0 other; atcttcggta agtaggcctt gaccgggtta caggtttttt tttctgcgct tgtcattttt 60 tttctttttc aggtgcattt ttttccgaag ttttgcagtt tgaaggcgat tgctactacg 120 tttttgtgat ttgaagaaaa ggagaaagaa gtgggttgcg aagtcgaaga ttcaaattgc 180 aagactcttc acaagtgttc ccagcctttg acgtttttgt gattgctgtt gctggtgttg 240 ctgagtttgt atggtgtttg ttaagtatta attttatttt ttaattgttt attattgatt 300 ttactatttc ctggtgttga tttggttttc gttcaccccg tcatttttcc attatggaaa 360 ctgacgggga tgtagtttct gtagtagatg gggattcgaa agggtcttca tctacaatca 420 aacccattcg aataaaaaca tatcctttga catttatggg accttttgta gtgttttttc 480 ggaaaaaaga aaaacctatc aatgtccttt taatttcatc agaaatttac aaaatctaca 540 aatctgttaa ggaaatcaaa aagatttcct tagacaaatt gcgtgtcatt tttggatctc 600 gagaagacgc taatgcgcta ttagagtcca aattattttt gaattcctat cgagtttacg 660 ctccatgcga ctcgtgtgaa atcaatggtg tcatctatga tgaatcattg gaatgtgatg 720 agattttaaa ccatggttca ggaatattta aaaataaagc aattttgcca gttaaaattt 780 tggaatgtgt tcgtttgtcg aaattattat tttctgataa aggttcctca tacacacatt 840 caaattgcat caaaatcaca tttgaaggct ctgttcttcc tgattttgtt gtagttgata 900 atgtgaaatt tcacgttcgg ttgttctatc ctaaaattat gcattgcgat cgttgtcttc 960 tttttggaca cacgtcgcat ttttgttcca acaaacagaa atgttcaaaa tgtggtggaa 1020 ttcattctcc atcagattgt aaaaaacttt ctgatatttg tattcattgt ggcaaaaaac 1080 ataatttttt gaaggaatgt gcagttttca tagctcatca aaagcaattt aatttgaaaa 1140 ttaggaataa aaataaatta tcttattccg aagttattaa aacatctaat ggattttcgt 1200 ccaataatat ttttgaacca ttaactgaaa atattggtat tcaggattca aatgaggaac 1260 acaattttgt gtataaacct cctatcaaaa ggaaaagaat taataaatca attaatcaaa 1320 ataataattt gaatcctcaa ccatcaacat cttatgaaaa aaactttcct ccgattaatt 1380 tatcaaaatc tcaaaacatt cctggttttc agaaaattaa tcatgatttt tttggaaaca 1440 aaagtgatga tattaaaaag aataataata attctcaaaa tcatgttcat aatgatactg 1500 ggggtactat tttgaatatt ttagaagata taatagaatt tttgggatta aacgattttt 1560 ggaaaaaatt gattaaaaaa tgtttgccat ttgtagcaag tctacttgaa aaattgaatt 1620 cttttggacc cctcattagt accttgtttt gttcctaatg gctcaaatca ataataataa 1680 cttgaatatt ttacaatgga attgtagaag tatcattcca aaaattgata gattaaaagc 1740 tttaataatc aataatgata ttgatttatt ttgtttgaat gaaacatggt tagtgaacac 1800 taatgttttt agaattccat cttttaatat tatccgaaaa gatcgtaaca tagcatatgg 1860 gggtgtaatg attggaattc gtcagaacat cgaatttaaa tttttaaatt tttcattgga 1920 tttacctatt gaatatgttg ctatatctgt aaaacataat ggtctggaat tttcgatttt 1980 gtgtctgtat attccacccc aagcaagatt ttcattacaa gatttaaaaa caatattaaa 2040 taatattcct gctccatttt acatacttgg tgatcttaat gctcatcatt tagcttgggg 2100 tagtgacata tctgatggca gaggttcatt aattatggac cttattgatg aattgaattt 2160 aaatattctg aatgatggtt cattcactag gattgcagtt cctcctgttc gtcattcttg 2220 cattgactta tcactttgtt caaatagttt atccatgaaa tcaacttgga aatctattaa 2280 cgatccaaat ggtagtgatc atttacctat aaaaattatc gttcattttg ctctaaatga 2340 gccgtttcat caagaacctt ttgttcctga tttaacaaaa tatgtagatt ggacaaaatt 2400 ttcagattta gtttttacag ctctaatcaa tttcgattat tcactttctc ctcttcaaaa 2460 ctatgacaag ttttcagcca ttttgattca atgtttacaa agatctcaga ctagaaaact 2520 atgtttggaa ccaacaagaa gaagacttaa ttctttttgg tgggatcatg attgtactgt 2580 tgcacttaaa aataaatctg aggcttttaa aaaatttcga aattcaggta ctagggacaa 2640 ttatttcctt tatcgtaaat ctgaagctca gtttactcga gttatcaaat ataaaaaacg 2700 aaactattgg aaaactttta ttgaaaatct tgattctgaa acatcattaa ctaaattatg 2760 gtctgttgct cgtaatttaa ggaattataa tattccttct acatctgttt cggagtattc 2820 ggaaaattgg attgatcaat ttgcttcaaa aatttgccct gattttgctc ctacctcgat 2880 cacattcaaa actcatcaat tatataatta ttatcctgat ctttgtagcg aattttcaat 2940 tgaggaaatg aatttggcat tatctattac caaaaatact gctccaggta ttgataatat 3000 aaaatttatt gtgctaaaaa atttacctat tgatggtaaa ttgcatttac tatccctata 3060 taattcattt ttgtttcaga acatttttcc cttagagtgg cgttccataa aagtagttag 3120 tttacttaaa cctgataaaa atccttcact agtagaaagt agaagaccta ttagtttatt 3180 atcgtgtctt cgcaaactta tggaaagaat gattcttaat cgtcttgaat tatgggctga 3240 gaaaaataac atattttcat catctcaata tggatttcga aaaggtcgtg gaactagaga 3300 ttgtgtagct cttctagctt cacatattga actgtctttc aataaaaaac aagatgttgt 3360 tagtactttt cttgatgttt ctggtgcgta tgactccgtc ctaattgatt tgctttataa 3420 aaaaatgaat gattgtaaaa ttccaatcat tatttcaaat ttcttgtgca atttgttttc 3480 cttcaaaata atgcattttt ttcataatgg atcttccaga atggtccgtt atagttactt 3540 tggtttgcca cagggttctt gtttaagtcc atttttatac aatttattca ccagagacat 3600 catttccatt attccaaatg gatgttattt catacagttt gcggatgaca aggtaatttc 3660 tattcatggt cgtagtagag aagtaattcg tcattatatg caaatttgtt tggataatat 3720 ttatacatgg gctcataata atggtttcag cttttcagtt caaaaaacaa aattcatatt 3780 attttcacgg aaacattctc caataaatat tgatttgtat cttaataatc aacaaattga 3840 acaagttatt gattataaat atcttggtat atggtttgat tcgaaattaa agtggaataa 3900 tcatataata tatatccaaa aaatttgctc aaaaaggata aattttcttc gtttgataac 3960 tgggacatgg tggggttcac atcctaatga catgatcact ctttacaaaa caactattcg 4020 ttcagtaatg gaatatggtt gttttgcttt tggtagcgct gctcaaattc atttttctaa 4080 actagaaaaa atacaatatc gttgtttgag aatttgtcta aatttgatga attccactca 4140 tactaaatct attgaaattc ttgctggtat tattcccctc aagtatcgct ttcaggaatt 4200 aaattgcaaa tttttgataa attgtttttc aaatgaacat cctataattg atacattaaa 4260 atctttgttt gaaattaatc ctactaacag aatattgaat tcatttattc attgttctac 4320 agaaaatatt attccaaatt tatctgtaca ttttcatgaa tatagcatga atgttcattc 4380 ttttcaacca attgttgatt tatctttact tgaagaatta aaacaaattc cttgtcatgc 4440 ttatccccgt tatgccccat tgttattcaa acgtaaattt attggggtaa ataatgatca 4500 gatttttttc accgatggat ctttaattga aaatatggct gggtttggtg tatataactt 4560 tcaattggct catttttata aattagaatc tccttgttcc atttttacag ctgaattaac 4620 agctttatat tttacatgca atttgatcaa aaattataca cctaatatat ttgtggtgtg 4680 ttcagatagc ctaagctgta ttcaagcttt gaattccatt agcttcaata tcaaaaccca 4740 tcatattgtt ttatcaataa aagggttatt gtatgattta tattccagag gattcgttat 4800 taaatttgtt tggattccag ctcattgcaa tatttatggt aatgaacagg ctgatttatt 4860 ggcaaaactg ggtgtttttc gcggaccaat atataaacgc gatattaact attcggaata 4920 ttttactaat ttgaaaaaac attcaataaa tgattggcaa atttcttgga atacaagtga 4980 ccaaggtcga tattgttatt ccatttatcc gaaagttaaa gtatttccct ggttccgaca 5040 tcttcctgtt ggacgtaatt ttatttgttc cttttccaga ttaatgtcta atcattatat 5100 ttgtagtagt catttgtacc gcatgaatat cacagattca aatatttgtg aatgtaataa 5160 gtcatatgaa gacatagatc atattgtttt tgaatgttgt cgttttgaag cacctagaga 5220 aaaattactg gaaaatataa ttagtttggg tcttgatatt cctgtgtcag ttcgagatat 5280 tttgggaaat cgttcattcc taatcatgaa aattttatac gaatttttaa atgaaatttc 5340 ttatcgtgta tgatactgct cgtttttttt ctggttttac tttcagagac aacaattttg 5400 gcactcatcc catctaatcg attggctctt tggatacgtg cttgtgaaca ttactgatga 5460 acatactgat tgaacatgga agacttcggc tcagttatgg atcgattccg ggagagcctt 5520 tatttaatat atattatttt ataacgttat tcgaaaagat aaagaggttt tgtgcctttt 5580 tgagaagatt tcatttgaat atcactcaaa ggggcttttc cctctttcaa aattcataag 5640 ttaaataaat aaataaataa ataaataaat aaa 5673 // ID Sola1-6_AAe repbase; DNA; INV; 2859 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A Sola1 DNA transposon family from Aedes aegypti. XX KW Sola; DNA transposon; Transposable Element; Sola1-6_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-2859 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the yellow fever mosquito."; RL Repbase Reports 11(4), 1290-1290 (2011). XX DR [2] (Consensus) XX CC 4-bp TSDs. XX FH Key Location/Qualifiers FT CDS join(371..1038,1099..1591,1662..2537) FT /product="Sola1-6_AAe_1p" FT /translation="MSALDYLKSYGTSDSSEDECRNLADGEPLDNDTGSED FT NFAGFDDKDGCDGNAGGDGSFLFPPKTYPEHWVSYLKSLEDTEYATGTXEM FT DSDGSYTAADEFAKKRNRTKKELKARYNAKKRRLLHQVKLVDCGCKLNCGI FT KISKEDRIRHNTVFWALDSREQTSYIRQSVQRVDVQRRRRYKYEEDDPKKF FT NSFVFHLAESNGSSVKVCRRFFLNTLGYGNNCGNIIYRCVEQDPPKTSQRG FT KYKRSTEKRDAIQKHIMSYGPTVSHYRRAHAPNRLYLPSDLSEKNMHQNFL FT DTHDITATYALYCRKMREMKISLVKLGKEECEVCVSALQHRKTSSHPETIS FT TDPCSTCTRYDKHLRIAKISRDAYRSDGEAVIEDTLVLAVDLQKVIQLPRL FT DGLKSIMFSQRLLAFNQTFAPVAKYAKSFPVMACTWDESTAGRSANDIAST FT FHKVILRCYKKQIILWLDNCASQNKNWGLFLHLILLVNSKYTQVQELTLKF FT FESGHTFMAADSYHAAVEKKMKSKPPVTFHDFKDVLLFAEKKVEVLEMQPQ FT DFFESQLQISQYTLNNLNPRPYMENIKQVVIRKGSFVLNYAESLDSDAVLH FT SAMLFSKKQLKMVTLPNFDLVAGLKFKQEPRGIEPERKTALLKVILPVIDD FT EKKEFWRTLAEKAILNDSDDDDDDEYDTI*" XX SQ Sequence 2859 BP; 881 A; 570 C; 612 G; 794 T; 2 other; ctgcccataa atgcatacca gtctcactag caaaacagcg cacctaagaa aaacgcgatg 60 gaagtcgtga acactttcat gctttattca ctccaattaa attgaaaatg cgatgttttc 120 gcacccgaat ttgcactcca atggtgcaat gaactattaa ttacagtatt acatacattt 180 tgtttattaa ttttctcaaa taattagtta ttatacggga aaatgtacgc gtgactgata 240 tgcgtttatt tgctcttgaa tgtacgagaa taaacgcata acagtatccg tgcctactat 300 gatgcatcgg tcttcacatg tgaattattg ttgttttgaa agtttggtgt ccgcatgcaa 360 agtcgcgaaa atgtctgctc tcgattattt gaagagttat ggtacttctg atagttccga 420 ggatgagtgc aggaatttgg ccgatggcga gccgttggac aatgacacgg gatctgaaga 480 caacttcgct ggcttcgatg ataaggacgg ctgtgacggc aatgctggcg gcgacggcag 540 ctttctgttt ccaccgaaga cgtaccctga gcattgggta agttatttga aatcgctgga 600 ggacactgaa tatgcgacgg ggactgawga aatggactca gacggaagtt atackgccgc 660 ggatgaattc gccaaaaaac gcaatagaac gaaaaaagaa ttgaaagcga gatacaacgc 720 caagaagcgc cgtttattgc accaagtaaa gcttgtcgat tgtgggtgta aattgaactg 780 cgggattaaa atctcgaagg aagaccgtat tcgtcataac acggtgttct gggcattgga 840 ttcacgagag caaaccagct atattcgcca gagcgtgcaa cgagtggatg ttcaaagacg 900 gcgtcgctac aagtacgagg aagacgatcc aaagaaattc aatagctttg tgtttcatct 960 ggctgaatcg aacggttctt cggtcaaagt gtgtcggaga ttttttctga acactctggg 1020 ttacggcaac aattgtgggt aagtatattt cccttacttg gcttataaac acaatttgag 1080 attttttttc ttgtttagca acataattta tcggtgtgtt gaacaagatc ccccaaaaac 1140 atcacagcgc ggaaagtata agcgaagcac ggagaagcgt gatgcaatcc aaaagcacat 1200 aatgtcgtac ggtccaacgg tttcccatta ccgcagagcg cacgcgccca accgattgta 1260 cctgccatct gatctatccg aaaaaaatat gcatcaaaac ttcctggaca cgcacgacat 1320 tacagcaacg tacgctcttt actgcagaaa aatgcgagaa atgaaaatct cgttagtaaa 1380 actgggaaag gaagaatgtg aagtttgtgt ttctgcactt cagcatcgca aaacctcatc 1440 ccaccctgaa acgatcagca cagacccttg ttccacatgt acacgctacg acaaacattt 1500 gcgaattgct aaaatctctc gtgatgccta ccgatcggac ggcgaagcag taattgagga 1560 taccttggta ttggcagtgg atctccaaaa ggtaacgtta aataaaaaca taagccagta 1620 gtattaaata aatctatttt tttataattt tcatctccta ggtcattcag ctgcccagac 1680 tcgatggtct caaatccatt atgttttctc aacggcttct ggcatttaat caaacattcg 1740 ctcccgtagc gaaatatgca aaatcgtttc cagttatggc gtgtacatgg gacgagtcta 1800 cggctggacg gtcggcgaac gatatagcaa gcacttttca taaagtaatc ctgcgatgct 1860 acaaaaagca gatcatcctt tggcttgaca attgtgcctc ccaaaataaa aattggggtt 1920 tgttcctaca tctgatactt ttggtcaact ccaaatatac tcaagtgcag gagcttaccc 1980 taaagttttt tgagtcggga catacattta tggcggcgga tagctaccat gcagctgttg 2040 agaaaaaaat gaaaagtaag ccaccggtaa ctttccacga cttcaaggat gtgttgcttt 2100 ttgccgagaa gaaggtagag gtacttgaaa tgcaaccgca ggattttttt gagtcacaac 2160 tgcaaatctc gcaatataca ctcaacaacc tgaatccccg gccatacatg gaaaatatta 2220 aacaggttgt gattcgaaaa ggaagcttcg tgttgaacta cgctgaatcg ttggactcgg 2280 atgcagtatt gcatagtgcc atgctgttct cgaaaaaaca gcttaaaatg gtaacattgc 2340 ccaatttcga tctggttgct ggtctaaaat ttaagcaaga accccgagga attgagccag 2400 aacggaaaac agctcttctg aaagtaattc tcccagtaat tgacgacgaa aagaaggaat 2460 tttggcgcac cttagcagag aaggcgattt tgaatgactc tgatgatgat gatgatgacg 2520 aatatgatac aatctagaaa ttcttcgaac ctttatataa tgaaatcgac gtaaatcaat 2580 aaaacttcct ttaaatttct ttgcttgtga actgcgacat taatctagta gcaattcttt 2640 ttttttcctt gtattaaata tatatcttga gtcgcgagac tgacatgcgt ttattagtaa 2700 cgtgggacag accaagttcg atgttgtttt taacatttct agaacaaacg tattattttc 2760 tcataatatc atgaaagtgc atgcaattta atgaagtttc ctgtaattaa ctaaaaattg 2820 acgaaaattc acgtgggact gttatgcatt tatgggcag 2859 // ID hATm-29_HM repbase; DNA; INV; 3389 BP. XX AC . XX DT 07-DEC-2008 (Rel. 13.12, Created) DT 10-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type family: consensus sequence. XX KW hAT; DNA transposon; Transposable Element; hATm-29_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3389 RA Jurka J.; RT "Families of hATm elements from Hydra magnipapillata."; RL Repbase Reports 8(12), 1923-1923 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(566..1165,1098..1655,1613..2998) FT /product="hATm-29_HM_1p" FT /translation="MNLRKNNQDWLIGYPESNKLSSFSRLPTKKHVLGRYS FT HLHFYSKGDTSARDIFKLVLGELKEVWEKAGVPIRSDNSCLSLLTKLFVEK FT VKIKKIDHASRDVKAGKEKIVRFCNELEQLCDISAVNAYEQLLVSRRPKWK FT EDWAFYEDQKSSRKYHMTGSIDLGVSKFLERQEDRLSIDFRRSSTEDQKSS FT AFEVSELFCQIFGEVQPKIKNRVPLKSVNYFVSEFEVVESADEEDQKDTEF FT TPLPLKKRKMPLTNSLEISAKKLSEVTASVADRCCLSVRQQLLFQSTIICK FT SGGKLDEISMSVSTVHRQRQNARENIVLSIKADWEINKPKKAILHWDSKLF FT HLDIAHNEERVAVLISGSLNGYLFIYLYKSYVYFLFLYSKKLTILCIFFIF FT VFKETNLTFICRPKLIGVPLIKDSTGKTQCDVAVKLAQDWNILENIVGICF FT DTTASNTGNKKGAATLIEIELKRPLLWLACRHHHNELHIKHAFTALRGGSK FT SPDEPIFKRFQAEFSRIDIDYSNFNFFEWPTDIKSEIFNQASLVLKWAYQC FT LEEKIFPREDYLELIELTIIYLGGKLSMDRIFKIHKPGAIHNARFMSHSIY FT ILKMELLSNKFIMSNNEQIMIHRMAKFISLFYSMQFLRSRISVFAPVDDFK FT FFVAMNWYQEEDSDIATAVISSINRHLWYLTEELVILSLFNEKLSEFTRTI FT MAKKLFSTPRPKTFLIGKPKFPTLNSSTFIYFLIGPRSWLLFDLLGLINNQ FT EWLQMNPQEWKILFQDYRTAENFVKQLEVTNDCAERGIKLIGDFRECAQNE FT EQQQFILQVVEQHRKQNPSYSKSKLILILYSIFYITTIFFINYYS*" XX SQ Sequence 3389 BP; 1203 A; 509 C; 549 G; 1127 T; 1 other; gggtgtttca tttccgacaa tttttcgaaa atggttacgt tacatgcata acggggtaaa 60 taatgggcca aaaaaaagtc tgtaaatttt tcaaaatttt aaaatatcat ctagaggtcg 120 ccgtcttgaa aaaatggcgt tttttactct tttttcaact ttaaatttaa attacttaaa 180 aaataattaa tttaatttta tgttacctgg tgtgtataat aaaggttatc ggataaacat 240 ttggccaaat tttcaaaaat attggttaat gtctagaggt cgccgaatta taaagtttta 300 ttttttcaac atattttttt gctaaaaaat tgcttaaatg ctttaaaaaa acttcacctt 360 aaataaaatg cataatttat gttttgagca taattgacca ctaaaagttg ttgcgtgatt 420 aaatagtttg caaaataaac atctggacgt atgtatataa atttcataaa cttataaaaa 480 aaattggttc ttatatcaat tagcaaaatt gaaattttaa aacaatttta aaactagttg 540 gataagcgaa tattttattg taacgatgaa cctaagaaaa aataatcaag attggttaat 600 tggttaccct gaatccaata aattatcaag tttcagtcgt ctaccaacaa aaaaacatgt 660 tttgggtaga tactctcatt tacattttta ttcaaaaggc gacacttctg caagagatat 720 ttttaaactt gttcttggtg agcttaagga agtttgggaa aaagctggag taccaattcg 780 atcagataac tcttgtcttt cattgcttac caaattgttt gtggaaaagg tgaaaataaa 840 aaaaattgat cacgcaagtc gagatgttaa agctggaaaa gaaaaaattg tcagattttg 900 taatgaactt gagcaacttt gcgacatttc agcagtaaat gcatatgaac agttattagt 960 atcacgtaga ccaaaatgga aagaggattg ggcattttat gaagatcaaa agagtagcag 1020 aaagtatcac atgacaggta gcatcgatct aggtgtttca aaatttttag aacgtcaaga 1080 agatagacta agcatagatt ttcggagaag ttcaaccgaa gatcaaaaat cgagtgcctt 1140 tgaagtcagt gaattatttt gtcagtgaat ttgaagtcgt tgaaagtgct gatgaagaag 1200 atcaaaaaga tacagagttt actcctttac cactaaagaa aagaaagatg cctttaacaa 1260 attcactaga gatatcagca aagaagctct cagaagtaac ggcatctgtg gcagatcgat 1320 gttgtttatc agttcgccaa caacttctat ttcaatccac tatcatctgc aaaagcggag 1380 gcaaacttga tgaaatttct atgtcagtat caactgtgca tcgtcaaaga caaaatgcgc 1440 gggagaatat tgttctgtca attaaagctg actgggagat aaacaaacct aaaaaggcca 1500 ttttgcactg ggattcgaag ctttttcatt tagacatagc tcataatgaa gaacgtgtag 1560 cagttcttat tagcggatct cttaacgggt acttatttat ttatttatat aaatcctatg 1620 tatatttttt atttttgtat tcaaagaaac taacttaact tttatttgca ggccgaaact 1680 tattggtgta cctttgatta aagattcgac aggtaaaact caatgtgatg tggccgtgaa 1740 acttgcgcaa gattggaaca ttttggaaaa cattgttggt atatgctttg atacaacagc 1800 aagcaacaca ggtaataaaa aaggagcagc aactttaatt gaaattgagt taaagagacc 1860 tctcctatgg ctagcatgtc gccatcatca taatgagcta cacatcaaac atgcatttac 1920 cgcattacga ggaggtagca aaagtccaga tgaacctatt tttaaacgtt tccaagccga 1980 attttctaga attgatattg attactcaaa ctttaatttt tttgaatggc ctactgatat 2040 taaatcagaa atatttaatc aagcaagttt agttctgaaa tgggcttacc aatgcctcga 2100 agagaaaata tttccgcgag aagattatct tgagttaatt gaattaacta ttatctacct 2160 tgggggaaaa ctatcaatgg acagaatatt caaaattcac aaaccaggtg caattcacaa 2220 tgcgagattt atgtcacatt ctatatatat tttaaaaatg gaactcttat ctaataaatt 2280 tattatgtcc aataatgagc aaattatgat ccaccgcatg gctaagttta ttagtctttt 2340 ttattcgatg caatttctcc gctcgaggat ttccgtattt gctccagttg atgacttcaa 2400 attttttgtt gcaatgaact ggtatcaaga agaagattca gatatcgcaa ccgctgtgat 2460 ttcgtcgatt aaccgtcacc tttggtattt aacagaagag cttgttattt tatctttatt 2520 taatgaaaaa ttgtcagaat ttaccagaac aataatggca aagaaacttt tttctacccc 2580 aagaccaaaa acatttttaa ttgggaaacc aaaatttccg acattaaatt cttcaacctt 2640 tatttatttt ttaattggac cacggtcgtg gcttcttttt gatttattag gactaataaa 2700 caaccaggaa tggctccaga tgaatccaca agagtggaag attctyttcc aagattatcg 2760 aactgcagaa aattttgtta aacaattaga ggtaacaaat gattgcgccg aaagaggaat 2820 caaacttatt ggtgatttcc gcgagtgtgc gcaaaatgaa gaacaacaac aatttatttt 2880 gcaagttgtt gaacaacaca gaaagcaaaa tccatcctat tcgaaatcaa aactaatttt 2940 aatactttat agtatttttt atattactac tatcttcttt attaattatt attcttagaa 3000 aagctgaagc ctaaaaatag cctagttgaa tttttttttt taagatttaa aaaaataaaa 3060 agttttttag ctttatattt taatacggcg acctctagac attgaccaat ttttctgaaa 3120 atttcaccaa atgttcatct aataaccttt gatatataca cccagtaacg taacataaaa 3180 ttaattattt tttaagttat tcaagtttaa agttgaaaaa agtgtaaaaa acgctatttt 3240 ttcaagatgg cgacctctag atgacatttt aaaattttga aaattttaca attttttaaa 3300 aattttttag actttttttt ggcacattat ttaccccgtt atgcatgtaa cgtaaccatt 3360 ttcgaaaaat tgtcggaaat gaaacaccc 3389 // ID DNA-9_CQ repbase; DNA; INV; 2039 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version 1) XX DE Non-autonomous DNA transposon from Culex quinquefasciatus - DE consensus. XX KW DNA transposon; Transposable Element; DNA-9_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-2039 RA Jurka J.; RT "DNA transposons from the southern house mosquito."; RL Repbase Reports 11(1), 47-47 (2011). XX DR [2] (Consensus) XX CC ~92% identical to consensus. 3 bp TSD. XX SQ Sequence 2039 BP; 646 A; 269 C; 274 G; 849 T; 1 other; ggtctattcc cgtactgcag aattctgcaa ttctgcacaa aaacggcagc agagcgttcc 60 cgtacttttc tgcaacaaaa gtaacttttc tctcaggcag aacttctgca gtgagagagg 120 tagaagttgc agcaaaaccg ttcccgtact tttctgcaag ctctctcttc tctttcttgt 180 tgaaaagtta ctgcaatttg gtgtgatgga tgttgagtac acgcttacat aaaatttgca 240 agttgggtga aatcaagcat tttattatta gttgtaataa taaatgattt tggggtagtt 300 tttttttatg tctggtttta ttgagaacga tttttttatg ttaacgattt caggcttagt 360 tcattcatgt aaatgataaa gcgatttttt tagtgttata tttttacctg ataaaaaaga 420 aacttaagcc atagtattac agcacggcta gtacctcaac aaaaaaaatg tactataaaa 480 acaaattgtt ttaaaaactt ttaaaagaca catttctttt gagtttaata atttcaaata 540 aatatttgtt taggaataat ttttgatctt aaattgattt tgaatatact aaatatgtgt 600 gttaaaataa taggtaaatt taattggcaa ataataaatt gtacctttaa cggtgctacc 660 aatttgtatt cttatatgaa ccaaaaaaaa ttgatgtaat tttgtcgaaa ttaaaattct 720 gaaactctga aattctaatt cttaaattct taaattccta aattctcaaa ttcttaaatt 780 cttaaattcc taaattctta aatttcgcca tcttgaatca taaaaaacac acattttttg 840 gagtcatatt ttaggttaga aactcaaatt tagaggattt agagattgga aatttkgact 900 gtttctattt tatttatgtt tgagttttaa aatttttgat tacagaattt tttaatttta 960 cttatttttg aaattataat ttctttaatt ttctatctta ttatttttaa ctgtagcttt 1020 tgaatttttg gtgttctgaa ctcataaata ttcaaacttt tgttttttta tttcgatttt 1080 tttcaacctt tattttttta ttttgaattt ttaaaattaa gattcttttg aaattttaaa 1140 cattttgaaa tatttagttt ataattattt gaatttttaa gctttgaatt tgtgatttat 1200 tatttttttt tatttttctg atataattat ttgcattttt aaatttctca aatatctgat 1260 ttatgttttt ttgaaagttt ctcgatttga aaatattagt ttttttatat ttttaattat 1320 ggatttctta atttccagat ttcttaattt aagtgtttca aaattttgag tttttgattc 1380 atttttttaa atttgtcaat ttcgctgcgt caccgttgtt ctttcatgtg acatccagtc 1440 ataatttatt tatgttatca tgaataatct ttcaaacatt ctgcaattaa aacatgtgtt 1500 tttggtccct tctatataaa acctttgttt gtcttttttt aatcatattt attaaacgta 1560 cctgccactc tcatttgtaa aattgtttac ctaatgtaaa aaatttgagt tgtttctaat 1620 atataatttc tacctaagca taattaattt ctagttactt aataaataat acatccttga 1680 ttttttaaat aaaatgttgt gctgcataaa aaataatgtc cccgttctag aggtaaactt 1740 ctttcaatta ctttttttgt gaaccatttt tttaatattt tgaaataatt aaaaaacttt 1800 aatacccaat ttgagtaaat ccactcaact gtgtacaata ttgcagaaaa agtactggaa 1860 cgcgctaccc aggcaaaagt tttgccgctt ctctccttgc agaacttctg caaaagagag 1920 agatgaagag agcgagctga aaagtacggg aacgtgctgc aaaaaagtga cttatgctgg 1980 ctgcagaatt ctgcagaagc tgcagaaatt ctgcaattct gcaagtacgg gaatagacc 2039 // ID hATm-43_HM repbase; DNA; INV; 3466 BP. XX AC . XX DT 07-DEC-2008 (Rel. 13.12, Created) DT 11-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type family: consensus sequence. XX KW hAT; DNA transposon; Transposable Element; hATm-43_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3466 RA Jurka J.; RT "Families of hATm elements from Hydra magnipapillata."; RL Repbase Reports 8(12), 1937-1937 (2008). XX DR [1] (Consensus) XX CC This is a relatively old transposon (~17% divergence from CC consensus). CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX SQ Sequence 3466 BP; 1171 A; 600 C; 622 G; 1070 T; 3 other; ttaggggtgt tcacgtaaac actggaaaaa aaaatttttt ctctgatctg aatgcacact 60 agtttatttt gtgccaattg gcattaaaat ttactgtgca aaagatctaa ttaaaatcaa 120 attaatgttt agaggttgct caatggcctt ttagttttac aataatagca ttacgcctta 180 aataattttc taattttatc ccatgttcaa tttagataaa atattaaaca aatttgatgt 240 accattgtct tttatgagct aacttctttg gtcatgtgac aagcagccat cttgtcatct 300 gatgtttttg attgaaaaac attttcatta attgtaattt tttgctgaac aactgaaata 360 tttaaagatt ttgaggttct tatttatttt aatcctataa aataataatt atagctgtta 420 tgccaatatc ttgtttttta gtgctaaata ttattgggtg acttaactta ctttagcgtt 480 tgtagttttg taatacttat gatttcatca cattcatgtg cagataataa taatttctaa 540 tttgaaaata tttattgaaa tactttgatc acaattatta ttaaattatg ttgcagctgc 600 ataaattaat atttaatttc tcaagttgag tttaatttgg tatcgagttt taattttagc 660 ttgttgctga aattatgagt gaaaagataa gttcttcaaa ttttcctatc ttgacacgta 720 atcgaactgc aatatggttg attggtcaac ctgagaagaa tctaccaaac aatgtccttc 780 caacaacggt tgatgtgtta agaacgtttt tccactatca caaagcattg aaacaaacag 840 ttccagccag tgcaaaagca acatctaatg gtttagctca catctggaaa aaggccagaa 900 ttccaacgac ttaccagcct catattgtat caaaagtgaa agcatgtgtt gatgaatata 960 atttaattaa aaagaataaa ggaagagcta gtaagattca cagattgcta gagagaaaga 1020 ttttagtatc aaattagatt tactgtttga catttcacac aaggatgctg aaacaagcta 1080 attaaaattg atgaagataa aatatttctg gaagaccaaa agaatgctcg tatgaaaatg 1140 aagatggctg gagaagacgt aaaattatct caacaagaaa agcgaacagc agagcgtcaa 1200 caagcagaaa taaaaagaaa gcagaatgag gaagagagaa gacaacctga agcaaatgct 1260 gtccttaccc ctagctgtag ccactttagt cttgattgtg atagtgatag cgaagcaaca 1320 acgaagtcag tgacgttcct gatgacaaaa tggttgatag agattatgaa atagaaattc 1380 cggtttatca caaaaaacta gtaacccaat caacttcttc agggactgct gaacaaagtc 1440 ctaaaaataa tccaaagcca caatgtattc tcaatgacaa ttcttgtctt ctccagatgt 1500 atcatccaca cttgatcgaa tcaatttatc tgaccaaaaa ttcactatat tggcagctgc 1560 aattgctaga gcaagtggag aggacctcca aagtacacca ttatcaagat caacagtccg 1620 ccgcaaacgc atcatgcacc gttcatcaac tgaaaatcat attcggacag aaatttatgt 1680 caagtggaga aacaacatat ggtcgtccac tgggatggta agatgatgag agacagcaca 1740 aactatgaaa atccaaagtc caatgttgat agaattgcta ttgatgtgac tggtcgtaat 1800 ctggagaaaa tactgggtat tgtaaagata tcatctggca ctggactggc acaagcaaaa 1860 gcaaccttcc agttactgac catatggaac gttgcaaatt atattgttgg tatgtgtttt 1920 gacacaacag cgtcaaacac gggttccaaa aatggtgcgt gcgtcttgct agagaagttg 1980 atggagaaaa atcttctcta ctttgcttgc cgtcaccata tgcatgaagt aattattggg 2040 gaaatatact cagttttgct tggacccagc agctgggccc aacatagctc tatttgaacg 2100 atttcagaag tgttggccaa gcataaatca agccaattat gctccattag atgacgccca 2160 tcttgctgaa aataccacag cttcagcaac ttcaagatca gaggctggct cctttctgca 2220 gcctttctta tcagatgact cttcgcactt accaagagaa gactacaaag agatgatcga 2280 gctgtgcctt ttaatatttg ggaaactgcc tgatcaagaa gagaaaaatt atcattttaa 2340 gatccctggt gcttatcacc gttatcagac tatgccacca agaagtatcg ctttcgcatg 2400 ccgggagctt atcacatggc tagatggatg gctaaagtca tctattgtat gaagatatac 2460 ctcttaagaa atgaattcaa gctgactacg aaagaacaga acaacttgtg cgaattttgt 2520 atcgttgctg ctcatatyta tgtaccagca tggattgcct gtccaattgc aagtgatgct 2580 ccagcaaatg acctgatgct ctttacaaga atttaaacaa tattctgaga tcaacaaggt 2640 catttcaaat gctgctatga agaagcttca aaatcacact tggtaccttg ggtcagaaat 2700 gattccactt tcattatttt ctgacagggt ttgtgacaca gaaaaaaagt tgattgttga 2760 agctatgatt tccaaagtgg agctgattgg agtgtcaggg gtataaagtg tccagcagca 2820 gaattaaaca agttggaaag tctagagaag caacttcatg acatgcttgt tacatcatca 2880 tcaactgctg ctttgcagtc acttggtctc gatgtaacca ttttatctgg aacaacgatc 2940 caaagacctg ggaggaaatt gctcagaatt tcgttatact aaagcaattt agttaaatct 3000 gtgaaagtta tcaatgacgc tgctgagcgt tctgttgctc ttatgagcac gtttaatcaa 3060 tcsatcacaa aaactgaatc tgaaatgcag aaactaatac aagttgttga ggacaacaga 3120 aaacgcattc cagattgccg caaaagcatc ttgatgacat atgccactgt tgctacagac 3180 taaacttttg atcttgtaaa tagtgcaaca tttgaatgtt atttgcatta tttaaggaat 3240 tttttgaaaa tgattaattg tttatcatag ctataactca agttttttam aatattgttg 3300 ttgcggcgca tgcttttaac acagtaaatt tttattttcc ttgagcaacc tctaaatatt 3360 atccaaaaca cctaagattt tgcacagagt ttttttcaat caaagaaaca atatttaaaa 3420 ctgtgcaaca tttaattcaa aaatattttt ttttggacac ccctaa 3466 // ID Ginger1-2_HM repbase; DNA; INV; 3064 BP. XX AC . XX DT 02-FEB-2010 (Rel. 15.01, Created) DT 02-FEB-2010 (Rel. 15.01, Last updated, Version 1) XX DE Ginger1 DNA transposon from Hydra magnipapillata. XX KW Ginger1; DNA transposon; Transposable Element; Ginger; integrase; KW Ginger1-2_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3064 RA Bao W., Kapitonov V.V. and Jurka J.; RT "Ginger DNA transposons in eukaryotes and their evolutionary RT relationships with long terminal repeat retrotransposons."; RL Mobile DNA 1, 3-3 (2010). XX DR [1] (Consensus) XX CC TSD is 4-bp long, TIR is 126-bp long. One intron is pos 317-482. XX FH Key Location/Qualifiers FT CDS join(161..316,483..2186) FT /product="Ginger1-2_HM_1p" FT /translation="MHSRFADIFSYLKYQIYPKEVLNKGDKANFRRQTKPF FT IVDNNELYHLNKAAKRKVIWSRDEQQNIIKFVHEGNDVSVESKALSGHTGI FT NNTTYHIQSRFFWYGMIKDITTYVSKCDQCQKSKNRKLQSKPLLQNISIPK FT GNMKQVGIDLTQLPEVNGCKYLVVLIDYFSKWVEAEALTDKTAKAVAFFLY FT KQICRHGCFEIQINDQGREFVNEISKELYSKTGTIQRITSAYHPQANGLVE FT RQNQTIKRILVKVLDEAALEWPYIIDSVLFSLRVRKHKSTGFSPFALLYQR FT EAVLPIDIDCNLIDFNDSNALSNDDFSNKKIVSDTFHAMNKMKNKIFDDAY FT KNIEKSQKRQKHDYDKRIAPNNEISINKKVLLRNSRRDDRKGGKLVKPWTG FT PYIVTSISSTNNCTLKNLKGQILKTKYNLTKLCLYNEKITNDEQEITFDEI FT NMQISNDKIGKYATIEMRTSISEKFGIGITKNIFIADINLLSRPKQIHKTI FT GDGNCFFRAISYIITGSECSYEIIREKVVKHMCTHINNEMTLYMNQSIITY FT LNESGMSDNTTWATDAEIIGCASFMGIDIKVYSKYGEKLKWLTYPCSLTLT FT QLSEHAIFLDNSTASHFNVVLL" XX SQ Sequence 3064 BP; 1197 A; 403 C; 457 G; 1007 T; 0 other; tgtagtatga gatttttact cccgtgtttt ttttactccc cagtaaaaat atcgtatgaa 60 gtttttaccg cggagtaaaa aaaagcatat gagattttta ctctcactat tttattttta 120 ctctcaagtt gttattagcg gaattgcata tcaataaaac atgcactcta gatttgcaga 180 catttttagt tacttaaaat accaaatata tccaaaagaa gttttaaata aaggcgataa 240 agccaatttt agaagacaaa ctaagccatt tatagttgat aataatgagt tatatcattt 300 aaataaagct gcaaaggtat gctattattt caaaatgttt taatagtttt gccattttgc 360 aactagaatc tagtcaccaa ataaacacga atatttacaa ataatttatt taataaataa 420 tgaaaattaa tttagttttg attctcatta tattgatggt ttacagcttc ttttttttat 480 agagaaaagt tatttggagt agagatgaac aacaaaacat aataaagttt gttcatgaag 540 gaaacgatgt ctctgttgag tctaaagctt tatctggtca tacaggtatt aataacacaa 600 cctatcacat tcaaagtaga tttttctggt atggaatgat aaaagatatt acaacttatg 660 tttccaaatg tgaccaatgc caaaagtcaa agaacagaaa acttcagtcg aaaccattgc 720 tgcaaaatat ttcaattcct aaaggcaata tgaagcaggt aggaatcgat ttaacacagc 780 taccagaagt taacggttgc aaatatctgg ttgtattaat cgactacttc agtaaatggg 840 ttgaagcaga agcacttacc gataaaactg ctaaagctgt ggcttttttt ttgtataaac 900 agatttgtcg acatggctgt tttgaaattc aaataaatga ccaaggtcgt gagtttgtaa 960 atgaaatttc taaagaactt tactcaaaaa caggaacaat tcaaagaatt acaagtgcat 1020 atcatcctca agctaatgga ttagtagaga gacaaaatca aacaattaaa cgtatattag 1080 taaaagtgct tgatgaggct gctttggaat ggccatacat tattgatagt gttttgtttt 1140 ctttaagagt tagaaagcac aaatctactg ggttttcacc gtttgcattg ctttaccaaa 1200 gggaagcagt tcttcctatt gatatagatt gcaatttaat tgactttaat gatagtaacg 1260 ctcttagcaa tgatgatttc tctaacaaaa aaatagtttc agataccttt catgcaatga 1320 ataaaatgaa gaataaaatt tttgacgatg cttataaaaa tattgaaaaa tctcaaaaaa 1380 gacaaaaaca tgattacgac aagagaattg caccaaataa tgaaattagc ataaataaaa 1440 aagtgttact aagaaatagc agacgtgatg atagaaaagg tggtaaactt gtaaagcctt 1500 ggactggacc atacattgta acttctattt ctagcacaaa taactgcact ttgaaaaatc 1560 taaagggtca aattttgaaa acaaagtaca acttgacaaa gttatgttta tataatgaaa 1620 aaataactaa tgatgaacaa gaaatcacat ttgatgaaat taatatgcaa atttctaatg 1680 ataagatcgg aaaatatgca actattgaaa tgagaacttc tatttctgaa aaatttggaa 1740 ttggtattac caaaaatatt tttattgcag atattaatct actttcacgt ccgaaacaaa 1800 ttcacaaaac cataggcgat ggtaattgct tttttagggc aatatcgtat atcattacag 1860 gcagtgaatg cagttatgaa attataagag aaaaagttgt taaacatatg tgtactcata 1920 ttaataatga aatgacttta tatatgaatc aatcaataat aacgtatcta aatgaaagtg 1980 gtatgagtga taacacaact tgggcaactg atgcagaaat aattgggtgt gcctcgttca 2040 tgggtattga tataaaagta tatagcaaat atggagagaa gttgaaatgg ctgacttacc 2100 catgtagttt aacacttaca cagcttagtg aacatgctat attccttgac aattcaactg 2160 caagtcactt taatgttgta ctgttgtaaa atattatcaa aaaaaaaaat aatttataaa 2220 ttatcttctt ttcataattt ttagacattt tttaaatata tttttatttt aagtctgtaa 2280 acaaatgttt taaaaacgta aaaataagcg aatttacaac ggtatgtagg agaaggaata 2340 aaaatgtttg tctttttaaa tttggtggca atactttttc atatgcctga ctgattccac 2400 attagtaaag cgaaagcaaa gaatccagaa ataaacctat ataacaatat gtgtttaatt 2460 aaaaaaaaaa tcttaaccct atctgataca tagggtaacc tacgataaat tttttttttt 2520 tttttgcaga tgtgaattgt taaataaaaa aagtcagaaa gaaataaaaa gctgcgattt 2580 tcagattctg caggttagca aggtaaactt gttgtttaga aaactccttt tctaacaagt 2640 ttgcaacaag taccatagcc taatagtaaa ataaaacaat tttggatatt ccccttcatt 2700 ttttaataaa acaagtacac attttgatca aacaagtact tgcatattct atgatcttta 2760 agtttaaaaa agattatctg taaaatttat attttcaccg tttatttaaa ataatataag 2820 aaattttact cccactaaaa tgtagaaaat aaattaagtt aaaaaagttt attaaaagct 2880 taacttttat aaaatatatt aattaagcac tatattaaca taacggtaaa aatatctaat 2940 gagagtaaaa atctcataaa gggagtaaaa atttcatacg cattttttac tccgcggcac 3000 aaatttcata taaaattttt ttaggggagt aaaaaaataa tgggagtaaa aatctcatac 3060 taca 3064 // ID hAT-1_HM repbase; DNA; INV; 2498 BP. XX AC . XX DT 07-DEC-2008 (Rel. 13.12, Created) DT 10-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-1_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2498 RA Jurka J.; RT "hAT-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 1990-1990 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 414..2213 FT /product="hAT-1_HM_1p" FT /translation="MFKMENIKISNYLTNLDRGKRSGTCLACNKTVQWSKE FT RLAAHKRSTCQNASVEEKRLFSKRNYESSHLINDSQQLPSTSDSPMQRVHP FT PINEELKNDIDTKLANFFYRTGISFRLVESEVFKDFVKSLNPSYASVIPKA FT KALSGSLLDKQYTKCSIYVNEILNSETNLTLMSDGWTNIRGDHIVNFCIKA FT PEQKPFFYTSINTSGIIQNSSAVAAAILQVIEKIGSQKFTSFISDNAPVMK FT SAWRIIEEKYPHISASGCGAHGVNLLVKDIVSTIEATKTVKDAEKIIKYVK FT NHHIVKAKFDERRIAANISLSLSMPVSTRWFSYYKSMSSLQLSKYVLIKLV FT DEESPLLKEIQPKNTSAAVMALIKSNPFWDRLSKLVKSIEFPSNVIGKLES FT DTAPLSLVYDYFGQMYHSYMDDKDIQQKVQSRLDFLFTPCIGIAFILTPKN FT AAEGLYLNEDKTDFITATVEFAKKIKPEIAATAEDELIAYIGEISVLPERR FT KETIFKMNSRNYWNIIGRDKYPALYEIAKPINEMICSSATAERAWSTFRFI FT HSRLRNRLTNERVEKLVFLYTNSVLMDTNDKTDYILEEGAILNEIECQEIT FT E*" XX SQ Sequence 2498 BP; 839 A; 457 C; 452 G; 750 T; 0 other; gggttcggca aatgcatctg cgcagatctg cgcagacata atatgcgcat atctgcgcag 60 acgtcggcgc cgacgtctgc gaagacagtt taaaggatac gaaaaattta aattatgata 120 aaaaaattac ttcagacatt tgattcagat aatttaatta tctgaatcaa atgcctgaaa 180 ttaattattt ttttaaattt aatttcagac aatatttaat aaaataattt cagacatttt 240 gaaaaactcg taaaaagtag gtaattatct actttttacg agtttttcat tacttgtctg 300 tttctttttt gttcgagcta catataatca atcgatataa actttttaaa gtgtttctta 360 ctacaattta aagtgattta aaataaagaa atcgcgtatt ttcgtcataa caaatgttta 420 aaatggaaaa catcaaaatt tcgaattatt tgacaaatct tgatcgaggg aagcgttctg 480 gcacttgttt agcatgtaac aaaactgttc aatggtcgaa ggaacgcctt gcagcgcata 540 aaagatcaac atgtcaaaac gccagtgtcg aggaaaaaag attgttttct aagcgtaatt 600 atgaatcgtc acatctgatc aatgactctc agcaacttcc gtctacgtct gattcaccga 660 tgcagcgtgt acatccaccg attaatgaag agctcaaaaa tgatatcgac acgaagcttg 720 caaatttctt ttaccgtacg ggtatatcat ttcgtttagt ggaatcggag gtctttaagg 780 attttgtgaa gtcgctcaat ccatcgtacg catctgtcat cccgaaagct aaagccttat 840 caggatctct gctcgacaaa caatacacaa aatgttctat ttatgtaaat gaaattttga 900 attctgaaac aaacttaaca ctgatgtctg acggctggac gaacataaga ggcgatcaca 960 tagtgaattt ttgtattaaa gcgcccgagc agaagccttt tttttacacc tcgataaata 1020 catccggaat cattcaaaat tcatctgcag ttgctgcagc aatcttacaa gttattgaga 1080 agattggctc acaaaaattt accagtttca taagcgacaa cgctcctgta atgaaatcag 1140 cgtggaggat catcgaggag aaatatccac acatctcggc tagcggatgc ggagctcatg 1200 gagtcaactt gcttgtgaag gacatagttt ctacaattga ggccacaaaa acagtcaaag 1260 atgcagagaa aatcatcaaa tatgtgaaaa atcatcacat cgtgaaagca aagtttgatg 1320 aaagacgaat tgcagcaaat atttctcttt cattgtccat gcctgtatct acacgatggt 1380 ttagttatta taaatcaatg agcagtctcc agctatcaaa atacgttctt attaagttgg 1440 tagacgaaga aagtccatta ctgaaggaaa ttcaaccaaa gaatacatct gctgctgtta 1500 tggcgttgat aaaatctaac ccattttggg accgtctctc taagcttgtc aaaagcattg 1560 aatttccttc caacgtgatc ggaaagctag aaagtgatac agctccattg tctctcgtgt 1620 atgattactt tggtcaaatg taccattcat atatggacga taaagacata caacaaaaag 1680 tgcaatctcg cctcgatttt ctctttacac catgtattgg aattgctttc atattaacgc 1740 ccaaaaatgc cgcagaaggt ttatatttga atgaagataa aaccgatttt atcactgcta 1800 cagttgaatt cgccaaaaag atcaaacctg aaattgctgc cactgctgag gatgaattga 1860 ttgcttatat aggagaaatt tctgtgctgc ctgagagaag aaaggaaaca atattcaaga 1920 tgaactcgcg aaattattgg aacattatcg gtcgcgataa atatcctgct ctttatgaaa 1980 ttgctaagcc tattaacgaa atgatctgct catcggctac agcagaaaga gcttggtcga 2040 cctttcgatt tattcactcg cgactaagaa atcgtcttac gaatgagaga gttgaaaagc 2100 tagtattctt gtataccaat agcgtgctga tggacacaaa tgacaagact gattacattc 2160 ttgaagaagg tgcaattctc aatgaaattg aatgtcaaga aatcaccgaa taggcgccag 2220 cattatatat ttttattact gtaagaatct ttgaaaactc ctattgttac taatcactga 2280 atttataata aagttaaaag attacttaaa acatttttta agtttaaaac tttttttgtt 2340 gatttcaata aatcgtcagt aataataata tctgcgcaga tctgcgccga gagacgaacc 2400 ttagtatatt ttagcagtaa taataaattt tgtcggcgcc gacgtctgcg cagacaaatt 2460 acaatgtctg cgcagatctg cgccgacagt gcgaaccc 2498 // ID BEL-231_AA-LTR repbase; DNA; INV; 630 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-231_AA_; KW BEL-231_AA-I; BEL-231_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-630 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 918-918 (2011). XX DR [1] (Consensus) XX SQ Sequence 630 BP; 230 A; 112 C; 110 G; 178 T; 0 other; tgttgccgct gggaacactt ccacggctat gtattccgca gagttagcct caccggagta 60 aacgtcattc cgacagatca atctgtcaac tagcttagag gatctaatga atagattcag 120 agtcactgtg gatgagtgcg cagatttgcc attccgtgaa atattaacag aattattata 180 aaagttggat tgttattcct gaactctttg aattaaaact aaaagtagtg gtttgaaaca 240 gtaggtgaaa tgttaaaact taaaactata aaacagttaa aattactttt tattacagaa 300 cacaagtagt tacaaggcta aaacttaaag ctagagaggt taaaatccta caaggtacat 360 taaaagtact taataattaa tcatgcaaca tgcactagag tactaattac gttatcaatt 420 acagttgaat tgcgattata attattagag gcaaagcaga acagttattc tgtagttaaa 480 acaccgatcg tgtaagtagc atacacaaga caaagtaatc taaactaata aataaaattt 540 gcagtttaaa gcatcgccaa actacacaaa acagtgtttg tgctgctgaa aacctggtga 600 ccagctcccc atttctgctc cctgccaaca 630 // ID Gypsy-90_CQ-I repbase; DNA; INV; 8388 BP. XX AC AAWU01007292; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-90_CQ_; KW Gypsy-90_CQ-LTR; Gypsy-90_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-8388 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 559-559 (2011). XX DR GenBank; AAWU01007292; Positions 99839 91452. XX CC Positions [5333-5812] - Integrase core CC 'ACGT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 558..2942 FT /product="Gypsy-90_CQ-I_2p" FT /translation="MSFKFPYAKHLLNDEVDYELRVRSLDDALNLDTETKV FT EILQKHYQEDSQEKREYLSIYTIEQDYDFILSRVSSIELALREKKDPKLIS FT RLQYYYLRTKNSQITPQTKEMRDVLVQDICDLMKHVGVQLQPISEEVHQDE FT GAAALPVEQNQSGNLSVNQTAGRNTSVRSNLVVDTVTDLLGDLNGSNQNQH FT NLTFGDFGIDLSPVRVNPSYTGAIPKVPASKRLGIVPSREQLQAEMEQMRK FT DNKELRQTCRDLTDRLQNINLAELRKKAINPQNNSEVPSRQTEPPLNKSSS FT RTNQFPPSAQNQEPPPLPDNRQNPQFSQGLPLPPNPSNQTSQSYQWPRPMK FT WPNPQQNRGVPPPPNETHQLYQWPRSNPQQERGAPPQSSQTPQLYQWPQQE FT QGYRFGLSGQNNRFQPTGQVNSVKGNQTPSQRNQFQQQNAQNNLSQGLTLR FT LHRPPFGNEVSGRHGGYENYDHGDEDDDSEDDGDYCQGNFIPFERTRRRRT FT EQYDRRIEKWNISFSGEPRSTTLEDFIFKVKRLAMMNSISEGSMLSHIHLL FT LRGEASDWFFTYYEEDWDWLTFETKIRFRFGNPNENQSIRQRIADRKQQRG FT EKFSAFLSDIERLNKLLTKPLSERRMFRIIYENMQQRYRSKLLYFNIDSKD FT LLAALCQRIDANDPSLNPINTRHGVNNLEADQPETDSEEELNAIDRRNQRQ FT PENRSWRRDRPAEGYRGNPSQETTRLPLCWNCRQHGHFWRQCQAEKTTFCY FT VCGNAGTVAATCNVHPRRSAPAQAAQTTQGQGNPHPQSGQNPGN" FT CDS 2780..6205 FT /product="Gypsy-90_CQ-I_1p" FT /translation="MSSRKDYVLLRLWKCRNSCCDLQRPPKKERARTGCSD FT NTRSRKPTPSVWTKSGKLNSECRNGNSSIPTKNKVPSSTPFFDPFQNLLEI FT KIHTNLCPHVKIRVFETEFEALLDSGASVSVTNSRDVADRYGLKIFPSPIR FT ICTADHTQYSCVGYVNMPVTFRNVTKVVAVVIVPEITRSFILGINFWTTFG FT IKPMMQGETGLEEIAEITAGSQDVPEELFHFFIHPVEALPVIEQPPPDDSL FT DIPGIDLPEPSATTPETIKTEHDLTEAERGSLADVIRTFPCTTEGRLGRTA FT LLQHEIVLREDAKPRRMPLYRCSPSIQAEMDKEIERYRQMDAIEECTSEWA FT NPLVPVRKSNGKLRVCLDSRRINAMTKKDSYPMRDMKGIFHRLQSACYFSV FT IDLKDAYFQIPLKEDCRDFTAFRTSKGLFRFKVCPFGLTNAPFTMCRLMDK FT VIGFDLEPHVFVYLDDIVVATKDLSEHLRLLGIVAERLEKANLTISLDKSQ FT FCKKQVNYLGYLLTGQGVAIDSKRIDPILNYSRPKSVKDIRRLLGLAGFYQ FT RFISNYSKIVAPISDLLRKEKNKFQWTEAAEEGFQELKAALVSAPILANPD FT FAEPFVIESDASDNAVGAALIQKLDGEPRIIAYFSKKLSSTQKRYASVEKE FT CLGVLLAIEHFRHYVEGSRFKVVTDARSLLWLFTIGVESGNSKLLRWALKI FT QSYDIELEYRKGKQNITADCLSRSLDAISVAQVDPEYQDLVKQITMDPQAY FT TDFRVVDGSVYKFVKNKGKVEDSRFCWKVYPPKSEREEILRKTHGTAHLGF FT EKTLATLRERYFWPLMNTETKRFCQDCLVCQTSKATNVNTTAPLKVQRKIA FT EYPWQFVTMDYVGPLPASGKGRSTCLLVLTDVFSKFVLVQPFRQATADSLV FT PFVENMLFLLFGVPEVILTDNGTQFTSKLFQDLLSRYHVTHWRTPSYHPQI FT NDSERANRVITTAIRATIKKDHKDWANNIQTIANAIRNSVHDATKYTPYFV FT LFGRNMVSDGREYRCMRDTSIDSGSLNNAEREKLYADVKENLKKAFERHSK FT YYNLRSNANCPQYTVNEKVLKKNTELSDKGKGYCAKLAPKYVPALVKRVVG FT EHCYELEDEKGKRLGVFNCKYLKKFHQPPSRSVD" XX SQ Sequence 8388 BP; 2421 A; 1874 C; 2024 G; 2069 T; 0 other; ttttggcgcc caacgtaaaa ttcgttgaat ttcgtactga tttggagtaa aacttgatta 60 ttcgtggagt aattaatgtt cttttggagt tactgtgtta atttaattgc tgggcaagcg 120 tgagagtttt ttggagtact aatgatagaa agttacacat gctgtggtct aacgaaaaat 180 gagattaata aacgataaaa aatgatttac tgatgattta ttcaaggaaa actattacga 240 tttgtttcgg gaaatattaa ttttggagtt ctgatcgaaa agattcgaaa aagagattat 300 tttggaaaag attcgaaaaa gaatttattt tgggaaatta tttcggatac tgaggacatt 360 caggcatatt tatttgtgga agtcactatc ggtgttattt atttatttat ttatttattt 420 atgattattt attacagttt atgtggtatt tattaactgt cttgattaca gagtgcttgt 480 gtcatagtag tatttgattg cgagtttgtc cagtttcgtt ttagggagat ttatcacaat 540 aataaaattg attcaaaatg agtttcaaat ttccttacgc gaaacatctg cttaatgatg 600 aggtggacta tgagttaagg gtgagaagtc ttgatgatgc gcttaactta gacacagaga 660 ctaaagttga aattttgcaa aaacattacc aagaggacag tcaagagaaa cgcgaatatc 720 tgtcaattta cacgatcgaa caggactacg acttcattct ttcgcgagtg tcttcgattg 780 aactagccct acgagagaaa aaggatccaa aattgatctc acgtcttcag tattactatc 840 tgcgtactaa aaacagtcag attacgcctc aaacgaagga aatgagggat gttttggtac 900 aggatatttg tgatcttatg aagcacgttg gggttcagtt gcagccgatt agcgaagaag 960 tacaccagga tgagggagca gctgctcttc cggtggaaca gaaccagtcg gggaacctgt 1020 ctgtcaacca aactgctggc cggaatacca gcgtacgatc gaaccttgtc gtcgatacgg 1080 tgactgactt gcttggagac ttaaatggga gtaaccagaa tcagcacaac ttgacctttg 1140 gcgattttgg gatcgacctt agtccagttc gagtcaaccc aagttacact ggtgcgattc 1200 ccaaagttcc cgcgagtaaa cgtcttggta tcgttccatc cagggaacag ttacaagccg 1260 aaatggaaca aatgcgcaag gataacaagg aacttaggca gacgtgtagg gacttaaccg 1320 atcgcttgca gaatatcaac ttagctgagt tacggaagaa ggcaattaat cctcaaaaca 1380 actcggaagt accttcgcga caaaccgaac ctccgctgaa caaatcttca tcgagaacga 1440 accagtttcc accgtcggcg cagaaccagg aaccgccgcc attgccggac aatcgacaga 1500 acccgcagtt cagccaaggt ttaccactcc ctccaaatcc atcaaaccaa acctctcagt 1560 cgtaccagtg gccgagaccg atgaaatggc cgaaccctca acagaatcgc ggagttccac 1620 cacctccaaa tgaaactcac cagttgtacc agtggccgag gtcaaaccct cagcaagagc 1680 ggggagctcc accacaatcg agccaaaccc cccagttgta ccagtggcca caacaggagc 1740 agggttaccg gtttggtcta tcaggacaga ataaccgttt tcaaccaaca ggacaagtca 1800 actcggtaaa agggaaccaa acgccgtcgc agagaaatca atttcaacag cagaacgcgc 1860 agaacaacct gtcgcaggga ctgactttgc gcttgcatcg accgccgttt ggaaatgaag 1920 tttccggaag gcacggtggc tacgagaact atgaccacgg agatgaggat gatgactccg 1980 aagatgatgg cgattactgt caaggaaatt tcataccgtt tgaaaggacc agacgccgcc 2040 gaacggagca gtacgacaga cggattgaga aatggaacat ttctttttct ggggaaccgc 2100 gctcgaccac gctggaagac ttcatcttca aggtgaaaag actggccatg atgaactcga 2160 tctctgaggg tagtatgctt agtcacatcc atctactact aagaggcgag gcatcagatt 2220 ggttcttcac ctactacgag gaggattggg attggcttac cttcgaaacg aagattcgtt 2280 tccggttcgg taatccaaac gagaaccaga gtattcgtca acggatcgcg gaccgcaaac 2340 aacagagagg agaaaagttc tcagctttcc tgtcggacat cgagcgtttg aacaaacttc 2400 tgacaaaacc gttgtcggaa agaaggatgt tccgcataat ttatgagaat atgcaacaac 2460 gttaccgttc caagttatta tactttaaca tagacagcaa ggacctgctc gcagccttgt 2520 gtcaacgcat cgacgctaac gatccttctc tcaacccgat aaatacgcgt cacggtgtca 2580 acaatctcga ggcagatcaa ccggaaacgg atagcgaaga agagttgaac gccatcgata 2640 gacggaacca gcgacaaccg gagaaccggt catggagacg ggaccgtccc gctgaggggt 2700 acagaggcaa tccttcacag gaaacgacga gacttccgtt gtgttggaac tgcagacagc 2760 acggacattt ctggagacaa tgtcaagcag aaaagactac gttctgttac gtctgtggaa 2820 atgcaggaac agttgctgcg acctgcaacg tccacccaag aaggagcgcg cccgcacagg 2880 ctgctcagac aacacaaggt caaggaaacc cacaccctca gtctggacaa aatccgggaa 2940 actaaactcg gagtgccgca acgggaactc gagcattcca accaagaaca aagttcccag 3000 ttctacacct tttttcgacc cttttcagaa cctgctggag atcaaaatcc acaccaacct 3060 gtgtccacac gtgaagatcc gcgtgttcga aactgaattc gaggccctgt tagactcggg 3120 tgccagcgta agcgttacca attcgagaga cgtagctgat cgttacggtt tgaagatttt 3180 tccgtcaccg attcgaatct gtaccgctga ccacacacag tactcatgcg tggggtacgt 3240 caacatgccg gttacgtttc gaaacgtgac gaaagtcgta gcagtagtaa ttgtaccgga 3300 aatcacacga agcttcatcc taggcatcaa cttctggaca accttcggga ttaaaccgat 3360 gatgcagggt gaaacgggac tagaagagat cgctgagatc accgcaggaa gtcaggacgt 3420 tccggaagag ttgttccact tcttcataca tccggtcgaa gcacttccgg tcatcgagca 3480 accaccaccg gacgattccc ttgacattcc gggcatcgat ctgccagaac cgtcggcgac 3540 cacgccggaa acgatcaaga ccgagcatga cctgacggaa gcagagagag gaagtctggc 3600 cgacgtcatc aggacattcc cgtgtacaac ggaaggtcgt ttaggaagaa cagcgcttct 3660 gcagcacgag atcgtacttc gagaggacgc gaaaccccgc cgaatgccac tctaccgctg 3720 ctcaccttcc atccaagcgg aaatggacaa ggagatcgaa cgctacaggc agatggatgc 3780 catcgaggag tgcacgagtg agtgggcgaa cccattggtt cccgtccgca agtcgaatgg 3840 taagctaaga gtttgtttgg actcgaggag gatcaatgcc atgacgaaga aggattccta 3900 cccgatgcgc gacatgaaag ggatattcca tcggttgcag agtgcgtgct acttctccgt 3960 aatcgacctg aaagacgcgt acttccagat accgctcaaa gaagattgtc gcgacttcac 4020 ggcgttccgg acgtcaaagg gactgttccg ctttaaggtg tgtccattcg gactcacaaa 4080 cgctcctttc acgatgtgca gactgatgga taaggtcatt gggtttgatc tggagcccca 4140 tgtatttgtc tacttggacg acattgtggt ggcgaccaag gacctgtccg agcaccttcg 4200 actactggga atcgtggcgg aacgtttgga aaaggcgaac cttacgattt cgctcgacaa 4260 gtcgcagttt tgtaagaagc aggtgaacta tttggggtat ttgttaaccg gacagggcgt 4320 cgcgatcgac agcaagcgga ttgatccgat cctgaactac agccgcccta agagcgttaa 4380 ggacattcgc cgtctgctcg gcctggcggg cttttaccaa cggttcattt ccaattacag 4440 caagatcgtc gcgccaattt cggacctgct acggaaggag aagaacaagt tccagtggac 4500 ggaggccgcg gaagaaggat tccaagagtt gaaagccgca ctggtgtcgg caccgattct 4560 ggccaacccg gattttgccg agccgtttgt gatcgagtca gacgcatctg ataatgctgt 4620 cggagccgcg ctgatccaga agctggatgg cgagccacga atcatcgctt acttcagtaa 4680 gaagttaagc agcacccaga agcgatacgc cagcgtggag aaggaatgcc taggcgttct 4740 gctcgcgatc gagcacttcc ggcactacgt ggaaggaagc cgtttcaaag tcgtcacgga 4800 cgcgcgtagt ctcctctggc tgttcacgat aggtgtcgag tctggtaact ctaagctgct 4860 caggtgggct cttaagatcc aatcatacga tatcgagttg gagtatcgaa aggggaagca 4920 aaacatcacc gcagattgtt tgtctcgctc gttagatgcc atcagcgtag cgcaagtcga 4980 tccggagtac caggacctgg tcaagcagat cacgatggac ccgcaggcct acacggactt 5040 ccgagtagtc gacggcagcg tttacaagtt cgtcaagaac aaaggaaagg tcgaggatag 5100 tcgtttctgc tggaaagtct acccgccgaa gtcggaacga gaagagattc tccggaaaac 5160 tcacggaacc gctcacctcg ggttcgagaa gactctggcc acgcttcggg aaagatattt 5220 ctggccgttg atgaacaccg aaacgaaacg tttctgccag gattgtctcg tctgtcaaac 5280 cagcaaagcg accaacgtca ataccacagc tccactgaaa gtgcagcgga agattgcgga 5340 atacccgtgg caatttgtga cgatggacta tgttggaccg cttccggcat cggggaaagg 5400 gagaagcacc tgtctgctgg tactaacaga cgttttcagc aagttcgtac tagtacaacc 5460 atttcggcaa gccacggcgg attcgctcgt tccgttcgtt gagaatatgc tgtttttgct 5520 gttcggggtt ccggaagtaa tcctcacaga taacgggaca caattcacgt ccaaactgtt 5580 ccaagatctg ctgagtcgat atcacgtaac ccactggaga acgccgagtt accatccgca 5640 gatcaacgat tcggagagag ccaaccgggt gataacaacc gccatccggg ccacgatcaa 5700 aaaggaccac aaggattggg cgaacaacat ccagacgatc gccaacgcca tccgaaactc 5760 ggtgcatgac gcgacgaagt acacgcccta ctttgtcctg ttcgggcgta acatggtctc 5820 ggacggaaga gagtaccgtt gcatgaggga cacgtcgatt gatagtggga gtctgaacaa 5880 cgctgagagg gagaaactgt acgccgacgt caaggagaat ctcaagaagg ccttcgagag 5940 gcactcgaag tactacaacc tcaggtcaaa cgcgaattgt ccacagtaca cggtcaacga 6000 gaaggttctg aagaagaaca ccgaactttc cgacaagggt aagggttact gtgccaaact 6060 tgcgcctaaa tacgttcctg cgctggtcaa aagggtagtg ggcgaacact gctacgaact 6120 agaagacgaa aaaggtaaac gtctgggcgt tttcaactgc aagtacctta aaaagtttca 6180 ccaacctccc tcgcgttcgg tcgactaaca ccgcggtaga atcgatcagt ttcagttagg 6240 attttgcaat tcaggatttt caaatttaaa tttctcgagc tatgtaccct ctaaccgtta 6300 gagaggcaac caagcataaa ttatgctcca aaacactttt taggcatcaa caactacaac 6360 tacatgccac acctgtccag cttcagaaga gagtacagat gattcatgag cttttgcgga 6420 cgagatcgaa agaccaactc ccctagccaa catgaaaatc tatgcttgaa cctgctgact 6480 gtcgacaaaa tatgctaggc aaactatgac ccaaccagct atgtcccttt cacccaatcg 6540 aaagcaacaa gatctcataa aacctctaac cttccagtct cctcgatcaa agagttgagc 6600 gtcccacaat aacataaact actggcgaca aaaacgaaga actagagcat aaataccacg 6660 aacaccctcg aactagtcaa tcgcgatctc ggacatcgta aacagaccgt ccaacgagac 6720 aaaattagat ttaggatagg aataggatca aggatgggag caaataccct acctgatgta 6780 aataaaactt gttaattgtt agtcgcgcct aatcttataa aaactacatg tgagtactta 6840 tcgtagtgtt agtttcttct tggatctctg tagattatgt ttatgttgag tcccacgaag 6900 taatgttgcg tagcatgatc gatgatgtaa agttgtcttc aatttcgtcg tctggagcat 6960 attttcggga acttgagatg agtagagatg tagtcagcat aattcagcat cgcgcagcat 7020 aattttcgtc gtgacgtata ggttaggtta tccaattaga gttcgtcgca gcttccaatc 7080 agatgagaag ttgatgacgt cacaccaaat tcagccttca gcaacatatt tctgtcgtaa 7140 ttttcagcat tccagcaata aattttggcc ttccgcactt ggtttttagt tcagcagcat 7200 aattgattca gccttccgca taactattag taaatttccg tcagccttca gcacccagag 7260 ttgtttttgt cccacgatgc agctcctaca aaagaaaaca caaagcacat caatttttgg 7320 atttttttta gggaaagtta atactccccc cgtaattaat catccggtaa agttgttttg 7380 gaacaatcag gcacttttca ctactttttg gacacatttc agagcacatt ttgagacaaa 7440 accacacgcg agttaaaatt acggacagcc acgtttgttt acgttttcgt ttgacgtcct 7500 cggcgaagct ttttgacgaa gcttttggtt tgacaggtcc agttagttag gtagagtgaa 7560 attttttcag tgtttatttg ggaaaaccct tgtggcgaat ttcagttgtc ttgttgaatt 7620 tcgggtgacc agagtcgatt ctggggtgga aattcaatat agatatttta attaggctgc 7680 gttctgtcag agttaaggac aaatttgcct gaagtaaagc tttggggtgg gcctaaactc 7740 tgcagaacag atgatttcgg tgaattagga cgaagtttca tttgctcaac gtaatgattg 7800 aggtagattc tgagtcttaa tgagccagta gtatttcggg aacacaaagt ttggagattt 7860 taggttgtat tttcattaaa tatggaatat tatttattgg aaagtatatt ttagtttggg 7920 gtattgattg ggatttgagt gttaggatga atgaatgagg aatgtcaacc attagtcata 7980 gtcgaactgc tagtcctgga ctgttcaaac aagtagtttt ttctgggaga aatttagaga 8040 ttatagccgg taaagtaaat gaccgaaaaa taaagtatct gtgattaggt tatacaattc 8100 tgcactagca ttttgttgag gatttctgaa cgttcgctgg agaccggagt gaagacgttg 8160 atggtcagta acaagcattt tctgtgatgg atcatgactt ggaaatagag atggaggtta 8220 cttgtaaccg atgatcatat atgaccccga actgttgaag aatctttcgc aatttgtttt 8280 tagtgaaatg taaataaatg tgtagttgtt agttttcggt ttttcgtaag atttcccctt 8340 acgaaaattc agtttacact gaattttcgt aatttaagca tggagtga 8388 // ID Gypsy-8_RP-I repbase; DNA; INV; 2149 BP. XX AC ACPB02034718; XX DT 28-JAN-2011 (Rel. 16.02, Created) DT 28-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Rhodnius prolixus genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-8_RP_; KW Gypsy-8_RP-LTR; Gypsy-8_RP-I. XX OS Rhodnius prolixus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Euhemiptera; Heteroptera; OC Panheteroptera; Cimicomorpha; Reduviidae; Triatominae; Rhodnius. XX RN [1] RP 1-2149 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Rhodnius prolixus genome."; RL Direct Submission to RU (28-JAN-2011). XX DR Genome; ACPB02034718; Positions 14475 16623. XX CC Positions [608-1087] - Integrase core CC 'AAAG' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 86..1927 FT /product="Gypsy-8_RP-I_1p" FT /translation="MLNLRDPSARLTKWALKLMEYNYQVIHRPGAQHKNAD FT ALSRSVGGVDSSPGEFPTESVLAQEQEKDEWCRNFALSRPNQVQKGPNGLL FT YFTDGESDSEKWRVAVPKALRAHILFMLHDTPWSGHPGMERTFSRVRRWFY FT WPNLRRDVENYCRNCVLCTERKTPSGLTAPLGEPFEATCPFEQVSLDIVGP FT LPLTKQGNRFLLTFIDNFSRYAEAVPLREQSAEATARAFVREIVLRHGTPR FT RLLTDQGTNFTSGVFKAMCRMLQIKKLQTTAYHPQCNGLVERFHRTLIDEI FT SFFVRRDGKDWDQWLPFALMAYRSTPHTSTGYSPHFLLYGKELEIPFECPL FT DLTVYSGGVEEYVRDWQKRMDMALQSAWATMKDAREKRAEYFNAGKSLREF FT NVGQRVYLFEPATPVGHARKFRRPWTGPHTVVHRKSPWDYQISLRKGGFVV FT VHANRLKAAAPDEDNPEGNGCNLEEPHNIGDPVGENDIEINRENMAVGGGS FT GDEDEFRQITIAREQPSGADTDDIEARAALDEDTVCPDVSDWEVRDFRDQT FT WAPDLNESDTLLIPRSPYGLRNAATIPPEVRINARPRRRGERSSGAESLHG FT GELASLADEGDTNYSS" XX SQ Sequence 2149 BP; 597 A; 471 C; 611 G; 470 T; 0 other; ttggtgtcag gtgtggggtc aattcagatg ctatttactg gggcgacagt tcaagttaat 60 tactgatcat tctgcgattc agtggatgct aaatctgagg gatccgtcgg cacgtctcac 120 gaagtgggca ctgaaattaa tggagtataa ttatcaggta atccaccggc ctggggcaca 180 acataaaaac gccgatgctc tcagtcgctc cgtaggaggg gtggactcaa gtccgggaga 240 gtttcccacg gaaagtgtat tggcccagga acaagagaaa gacgaatggt gtcgcaattt 300 tgccctctcc agacctaacc aagtgcagaa ggggccgaat gggttgttgt atttcacaga 360 tggggaaagc gacagcgaga aatggcgagt ggcagttccc aaagcattaa gagcccacat 420 cctgtttatg ctccacgata caccatggtc cgggcacccg gggatggaaa gaaccttttc 480 tagagtccgt agatggttct attggccaaa cttgcgtagg gatgtagaga attactgcag 540 aaattgtgtg ctatgtacgg agcgaaagac ccccagcgga ttaacggcac ctttaggaga 600 gcccttcgaa gcaacatgtc catttgagca agtgtcctta gacattgtgg gccctctccc 660 tctaaccaaa cagggcaatc gctttctctt gacattcatc gataacttta gcagatacgc 720 ggaggcagtg cccttaagag agcagagcgc tgaggcgacg gcacgagcct ttgtgcggga 780 aattgtactc agacatggaa cacccagaag gcttctaacc gatcaaggaa cgaattttac 840 gtcgggggta ttcaaagcca tgtgccgcat gctccaaata aagaagctgc aaactaccgc 900 ttatcacccc caatgtaacg gattagtaga acggtttcat agaacactaa tcgacgaaat 960 ctccttcttt gtgcgaaggg atggaaagga ttgggatcag tggttgcctt ttgctctgat 1020 ggcctatagg tcgacgccgc atacatctac cgggtatagc ccacattttc tgttatatgg 1080 gaaggaacta gaaattccgt ttgaatgccc cttggatttg accgtatact caggcggggt 1140 agaagaatac gttcgggact ggcaaaagag aatggacatg gcattgcaga gtgcctgggc 1200 aactatgaaa gacgcaaggg aaaagcgtgc cgagtatttt aatgcgggaa aatcgttaag 1260 ggaattcaac gtgggacaac gtgtttattt gttcgagccg gccaccccag tcggacatgc 1320 taggaagttt aggagaccat ggacgggccc acacacggta gtacatagaa aatctccgtg 1380 ggactaccaa atttccttaa ggaagggggg attcgtcgtt gtccacgcta atcgcctcaa 1440 agcggcagca cccgacgagg ataacccgga agggaatggg tgtaatttgg aggaacctca 1500 taatattggt gatccggtgg gcgaaaatga tatcgaaata aacagggaga atatggcggt 1560 cggcggggga tcaggggacg aggatgagtt ccgccaaata accatcgcga gggaacaacc 1620 cagcggggcg gacactgatg acattgaagc tagggctgca ctggacgagg ataccgtgtg 1680 tccggacgtc tcagattggg aggtccgcga ttttcgggat caaacgtggg caccagacct 1740 gaatgaatct gatactcttt taattccgcg gtcaccatat gggttaagaa atgccgcaac 1800 gataccgccg gaggtacgga tcaatgcaag gccgcgacgt cgcggagagc gatcatcggg 1860 ggcggaaagc ttgcacggag gggaactggc cagccttgcc gatgagggtg ataccaacta 1920 ttcctcttag tagagagcaa cccttccttt ttattccgtc tggctgaaat gtaaatttga 1980 aataggtagg caagaaagaa atttagagga agacttgtac caacagtggg aggaatcaac 2040 acgccccgta ggagacctga aacagaagag tatccccggc gtcatccgga gaagcaccgc 2100 taagaagatt cgaggggaac atgtttcact tttctaaaag gggaagtga 2149 // ID BEL-190_AA-LTR repbase; DNA; INV; 395 BP. XX AC supercont1.75; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-190_AA_; KW BEL-190_AA-I; BEL-190_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-395 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.75; Positions 193456 193062. XX SQ Sequence 395 BP; 144 A; 66 C; 66 G; 119 T; 0 other; tgttgcgacg actgcatcca tcgaagcgcc cctcgaacgc atagaaacca tccatgcgaa 60 tgagtgggga actgtcaaat caaataagaa ttgagaaaca gcttatcatt gttgattgat 120 gtggaagcag aatcatttat taagaagtga attataatct atattattac tatatcctta 180 ttaaaactaa atatttacag atagcgcctt gaattacaat atattaaata taaaaatttg 240 aatttgtttg cctagtaaga gaccaaatgt aagtaatatg aaatatgcca ttcgactatc 300 tctaactaat aaatttatag ctttgagcta atctcatcgg gaacattgac gagtttttgc 360 tgaaaagacc tccgaaaggc caatacgtct taaca 395 // ID CR1-21_HM repbase; DNA; INV; 4448 BP. XX AC . XX DT 15-DEC-2008 (Rel. 13.12, Created) DT 15-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE CR1-type family - consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-21_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4448 RA Bao W. and Jurka J.; RT "CR1 families from Hydra magnipapillata."; RL Repbase Reports 8(12), 1849-1849 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 118..930 FT /product="CR1-21_HM_1p" FT /translation="MTKSEEFVSIVMMREMLNLQKETIISFFRESILSTNL FT KFDIFQSSIQDIRRELDDLKSGLNFVGGVCEDKVKAVGEKINKVCEDLTNV FT RKMQGKISSDNTELKLKSVDNEDRNRRNNLRIDGIIETNEEKDWDITKEKI FT KQLFNVKLGIEKDIKIERAHRVGLAKLERERTIVLKLRDFEDKKVILENAI FT KLKGSNIFINEDYSFTTRKIRKELFEQAKLHRQNGHYAKVVYNKLIVHEFK FT ENIHKVVNLTHGNEAIERNAAVCLTHGNE*" FT CDS 999..4013 FT /product="CR1-21_HM_2p" FT /translation="MATHFENLKSFEENINIFQETQINLIDTPYIDPFDFK FT TTNDHCSFSVLHINIRSMQQNFEKLKEFLNIINYTFDVIALSETWHDFENS FT LELNSIYNLPFYKLISQPRKSGKKGGGLGLYVSKKHDFKIKNVLSFSNDNY FT ESLFVEIINKKIRNVLIGCVYRPPSGKVKTFETFIKNTIEKINKENKTFYV FT IGDINLDALTCSKFPKTKSFFDMLLKFNVLSAINKPTRVTKTSATAIDNIF FT INNYLEMELETGIFMTDISDHFPIFIKVRNLKTMSDYEPKVINLKKRNLKT FT SNTNELSIRLKQETWSNVYRCQDTNEAFNNFLDIFLDCFNKTCPLEGIIIK FT TKTVSNPWMDKSLIKCSKKKQKLYNKYLKNKNTDNEKNYKNYKYFYQNLIK FT KAKNKYYSNEIIKCKSDSKKTWSVINKIMGRKKINTTSLPKRININNTDIF FT CEKQISEEFNKYFINIGPNLANKINPTTDSFKNYLTPTNKAIMYDKEFVYK FT EFEEAFSSLKKKKAPGFDEITSDLVIFNKTSLSRPLIHILKLSITSGIFPD FT VLKLAKVIPIFKCNEHSDITNFRPISILSVFSKLFERVIYNRIYNHFTKNK FT LFYPNQFGFQKNLSTEHAIIELVNQITDGFNQNKFTLGVFIDLSKAFDTVD FT HCILLDKLKYYGIINKTYNWIKSYLTNRKQYVCNIQSGVLNVLCGVPQGSI FT LGPLLFLIYINDFCNASLKLNSVMFADDTNLFLTNSDIKNLYADMNIELIK FT VNNWFRANKLSLNSEKTKYILFHKKAQEENLPLKLPDLILNDNLIKKQASI FT RFLGVLVDEKLSWLPQIRYIQSKISNIIGLMMYRVRSYINKQSLKLIYFGL FT IHCFISYANISWASTQPSKLKKIYSLQKHACRIIYFKNKREHAKPIMKDMK FT MMDVFEINIYQHLIFMYQFNNNLSPTNFTNKFEININENYYLRANISNTYK FT LPQNLNKYTEYSIAYRGPNIWNRYQKGSKCMAKSLSSFKFLIKKEIFKI*" XX SQ Sequence 4448 BP; 1770 A; 617 C; 620 G; 1440 T; 1 other; ttttttggtc tttgagcatt gtggtgaacg gacgtgtttt tcttcgcagt aataaaaagt 60 atataaaata ttaataattc tttgtgtatt atttttacat aaaatatttc atctggaatg 120 actaaatcgg aagaatttgt atcgattgta atgatgaggg aaatgttaaa tcttcaaaaa 180 gaaacaatta tatctttttt tcgggaatca atattaagca caaacctaaa gtttgatatt 240 ttccaatcga gtattcaaga tattcgtcgc gaacttgacg atctaaaatc agggttgaat 300 tttgttggtg gagtatgcga agacaaggtt aaagctgtag gcgaaaaaat aaataaagtt 360 tgtgaagatt taactaatgt cagaaaaatg cagggaaaaa tatcctcgga caatacggaa 420 ctaaaactaa aaagcgtaga taatgaagat cgaaacaggc gcaataactt aagaatcgat 480 gggattattg aaacaaatga agaaaaggat tgggatataa ctaaagaaaa aataaaacag 540 ctttttaacg taaaacttgg aatcgagaaa gatattaaaa ttgaacgagc ccatcgagtt 600 gggctcgcta aattagaacg agagcgaacc attgtactaa agcttagaga ttttgaagat 660 aaaaaagtaa tcttagaaaa tgcaataaaa ttaaagggga gtaatatttt tataaatgaa 720 gattatagtt ttacaactcg taaaatacgc aaagaacttt ttgagcaggc gaagcttcat 780 cgtcaaaatg gtcattatgc taaagtcgtg tacaataaac ttattgttca tgaatttaaa 840 gaaaacatcc acaaagtagt taacttgacg catggaaatg aagcaatcga acgcaacgca 900 gctgtttgtt tgacgcacgg aaacgagtaa gctttttaaa agtttttggt ttaatttatt 960 gttatatttt tttaatcttt ttaaattatt cgttaaaaat ggctacgcat tttgaaaatt 1020 taaaatcttt tgaagaaaac ataaatattt ttcaagaaac tcaaattaac ttaattgaca 1080 caccatatat tgatcctttt gattttaaaa caacaaatga tcattgttct ttttcagttt 1140 tacatataaa cattagaagc atgcagcaaa attttgagaa actaaaagaa tttttaaata 1200 ttataaatta cacgtttgac gtcatagctc tttcagagac ttggcatgat tttgaaaatt 1260 ctcttgaatt aaattctatt tataatttgc cattttataa acttataagt caaccaagaa 1320 aaagtggaaa aaaaggtgga gggttagggc tttatgtctc aaaaaagcac gattttaaaa 1380 taaaaaatgt attaagcttc tcaaatgata attacgaaag tctatttgtt gaaatcataa 1440 acaaaaaaat tcgaaatgtt ttaattggat gcgtttatcg tccaccaagt ggtaaagtca 1500 aaacatttga aacttttatc aaaaatacaa ttgaaaaaat aaataaagaa aataaaactt 1560 tttacgttat tggagacata aaccttgatg ctctaacttg ttcaaaattt ccaaaaacaa 1620 aatccttttt tgatatgcta cttaaattta atgtcttgtc agcaattaat aagccaacac 1680 gagtgacaaa aacctcggca acggcaattg acaatatttt cattaataat tacctggaaa 1740 tggaacttga aaccgggata ttcatgactg atattagtga tcactttccg atatttataa 1800 aagtaagaaa tttaaaaacc atgtctgatt atgaaccaaa agttataaat ttgaaaaaac 1860 gcaatcttaa aacaagtaac actaatgaat tatcaatcag actaaaacaa gaaacgtggt 1920 ccaacgtgta tagatgtcag gacaccaacg aagctttcaa taatttttta gatatattcc 1980 tagattgctt taataaaacc tgtcctctag agggcataat aattaagaca aaaacagttt 2040 ctaatccgtg gatggataaa tcactgataa aatgctcaaa aaagaagcaa aaactctata 2100 ataaatatct taaaaataaa aacactgata acgagaaaaa ttacaaaaat tataaatatt 2160 tttaccaaaa tctgattaaa aaggcaaaaa ataaatacta tagtaatgaa atcattaaat 2220 gcaaatctga tagtaagaaa acgtggtccg tcattaacaa aataatggga cggaaaaaaa 2280 taaacacaac ttctttacca aaaagaatta atattaacaa tacagacata ttctgtgaaa 2340 aacaaatatc cgaagaattt aacaaatact ttataaatat tggtccaaat ctcgctaata 2400 aaataaatcc aacaactgat tcatttaaaa attatctaac acccacaaat aaagccatca 2460 tgtacgataa agaatttgta tataaagaat ttgaagaagc cttttcctct ttaaaaaaaa 2520 agaaagcacc tggatttgat gaaataacta gtgatttagt tatctttaat aaaaccagtt 2580 taagtagacc tctgatccat atactaaagc tctcaataac atctgggatt ttccctgatg 2640 tacttaaatt agcaaaagta atccctatat ttaaatgtaa tgaacattcg gacataacca 2700 actttagacc tatatcaata ctttctgttt tttcaaaact cttcgaacgt gtaatttata 2760 atagaatcta taatcacttt accaaaaaca aattatttta cccaaaccag tttggatttc 2820 aaaaaaayct ttcaacagag catgcaatta ttgaattagt taatcaaata accgatggtt 2880 ttaatcaaaa caaatttact ttaggagttt ttattgactt atcaaaagct tttgatacag 2940 ttgaccactg catcctatta gataaactaa aatactatgg tataattaat aaaacttaca 3000 attggattaa aagttatctt accaatagaa aacaatatgt gtgcaatata caatctggag 3060 tattaaatgt tttatgtgga gttccccaag gatctatcct tggtccactt ctgtttctaa 3120 tttatataaa tgatttttgc aatgcatctt taaaactaaa ttcagtcatg tttgctgacg 3180 atacaaatct atttcttact aacagtgata tcaagaatct atatgcagac atgaatatcg 3240 aactaattaa agtaaataat tggtttagag ctaacaaact ctcgcttaac tcagagaaaa 3300 caaaatatat attattccat aaaaaggcgc aagaagaaaa ccttcctctt aagttgcctg 3360 accttatcct aaatgataac ttaattaaaa aacaagccag cattagattc ttaggagttc 3420 ttgttgatga gaagctatcc tggcttcctc aaataagata tattcaatct aaaatcagta 3480 atataattgg tttaatgatg tatagagttc gctcatatat caataaacaa agtctaaaac 3540 taatttactt tggactaatt cattgcttta tcagctatgc aaacatatca tgggcaagta 3600 ctcagccatc aaaacttaaa aaaatctata gtctacaaaa acacgcatgc agaatcattt 3660 actttaaaaa taagcgggag catgcaaaac ctataatgaa agatatgaaa atgatggatg 3720 tttttgaaat aaatatctat cagcatttaa tttttatgta tcaatttaat aataatctct 3780 ctcctacaaa ctttactaac aagtttgaaa taaacataaa tgaaaactac tatctaagag 3840 caaatataag taacacatat aaactgcctc aaaaccttaa taaatatact gaatatagca 3900 ttgcatatcg ggggccaaat atatggaata gatatcaaaa aggatccaaa tgtatggcaa 3960 agtcattaag ttcatttaag tttctgataa aaaaagaaat ttttaaaatt tgattattat 4020 ttgtgcttaa aaggattaca aagtaatttt ctttataatc ttatttattt gtttttactt 4080 ttgatcagcg cttagcagtt ttattagccc attagggttt ttaatttaat attttaatga 4140 aatttttatt ttgtaatcca tattacgata tctaatgaaa atgtatatag ggggctctat 4200 gaaaagattg cgatgacgtt ttgtcatctt catcttcttt gagcccctgt ctgtttaact 4260 ctttatttta taaagatcat tgtaaaaaag tatagtttat tttatatact gtatatacaa 4320 aaacggtgct tgttgacaag attttactgt cttctttgtt atacggacat atttctttct 4380 ttcttgtata tatatgtgtt gattatattt gttaaacagc aaaataaatg taaaaaaaaa 4440 aaaaaaaa 4448 // ID MarinerN-1_AP repbase; DNA; INV; 244 BP. XX AC . XX DT 25-AUG-2009 (Rel. 14.09, Created) DT 25-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Nonautonomous; KW MarinerN-1_AP. XX NM MarinerN-1_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-244 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2067-2067 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 244 BP; 55 A; 68 C; 71 G; 49 T; 1 other; ctatactccg ttcagattaa gattgaaccg gcggctcgac caaaacacgt ttttttcgcg 60 catttcggac attggtggga gcgtatgcac ggcggccagc accggcgccg acgatcgacg 120 acggccggcg atgactgtcg gccaatcaca gaacgcatag ccatcgcata gggggtcagg 180 cactgttcgg cggctcagtc tgcatgcgcg aagccggttc aatcttaacc tgaacngagt 240 atag 244 // ID Mariner-8_BM repbase; DNA; INV; 1654 BP. XX AC . XX DT 28-APR-2010 (Rel. 15.07, Created) DT 28-APR-2010 (Rel. 15.07, Last updated, Version 2) XX DE Mariner-like DNA transposon - consensus sequence. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-8_BM. XX OS Bombyx mori OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Bombycidae; Bombycinae; Bombyx. XX RN [1] RP 1-1654 RA Jurka J.; RT "DNA transposons from Bombyx mori."; RL Repbase Reports 10(7), 943-943 (2010). XX DR [1] (Consensus) XX CC >94% identical to consensus. XX FH Key Location/Qualifiers FT CDS 346..1374 FT /product="Mariner-8_BM_1p" FT /translation="METTPTEAAQIVALLQEGLSQRVVARRLHISQSCVSK FT ACKRFRETGSFIPRPRSGRRRCTSERDDRFIVSTSLRNRHLPGVDVQQELR FT DVRGVAASEWTVRRRLKQANLTPKRPVTGPKLTVAHRQARLQFARTHLDWE FT VEQWRQVLFSDESRMCLHGSDRRGRVYRRPGERFAQCCFAETVAYGGGSCM FT MWAGISFDGKTELVFVPGGGRGGGLTSDRYISDILLEHVVPYAGYTGDDFL FT LMHDNARCHTARVTTEYLEEVGIATLDWPALSPDLNPIEHVWDEPKRKVRS FT RTPAPSCLNELKSALIEEWEGIPQESIQKLIRSMKNRLRAVIRARGGNTKY FT " XX SQ Sequence 1654 BP; 426 A; 373 C; 395 G; 459 T; 1 other; cactcgcgag caaaagtttg gaatcactta cttgaaattg gttccgcgcg atctcgtgta 60 ctaaatattt ttttgataat cataaaaatg acatcgtttt aaaggttcaa ctcttaactt 120 taaactgata ccaaattcat taaaatcaag ccagtattta agaagctatc ccggattaag 180 tgaaggaata acgaaaaaga cgtgtttcaa ttgcttgata cgtcgcgagt tgcgacctat 240 tgacccctat aaaagcacct gttatcggat tttctttatc agtcgacttt cacacgttga 300 ccggtacaca tctccgctac ttttgaccct ttaccttgta cgacaatgga gacaacacct 360 acagaagcag cacaaatcgt agccctattg caagaagggc tcagccagcg ggttgtcgct 420 cgtcggctcc acataagcca gtcctgtgtt tcgaaagctt gcaagcgctt tcgggagact 480 ggtagcttta tcccgagacc aagatctgga cggcgccggt gcacatcgga gagggatgac 540 cgttttatcg tgtcaacctc tctccgaaat cggcatttac ctggtgtcga cgtccaacag 600 gagctccgag atgttcgtgg ggtagcagcc agcgagtgga cagttcgtcg acgactcaag 660 caagcgaatc tgactccaaa aaggcctgtc acaggcccga aactcacggt agctcaccga 720 caagcacgcc ttcaatttgc tcgcacccat cttgattggg aggttgagca atggaggcaa 780 gtcctgttct ccgacgagag caggatgtgt ttgcacggta gcgaccgaag aggtcgggtc 840 tacaggcggc ctggggagcg atttgcgcag tgctgtttcg ctgaaacagt ggcttatggg 900 ggcggctctt gtatgatgtg ggccggcatt tccttcgacg gtaaaaccga gcttgttttc 960 gtgcctggcg ggggacgagg aggcggtcta acttcggacc ggtacatttc cgatattctg 1020 ctggaacatg tcgttcccta tgcgggatat accggtgatg acttcctcct tatgcacgat 1080 aacgctcgtt gtcacactgc ccgtgtaaca actgaatacc tcgaagaagt cggtatcgcc 1140 acattggact ggcctgcgct cagccctgac ttgaatccta ttgagcacgt gtgggatgaa 1200 ccgaagagga aggttcgttc cagaactcct gctccttcat gtctgaatga gctgaaatcg 1260 gcgttgattg aggagtggga aggtatccca caagaatcaa ttcagaagct gatcaggtct 1320 atgaagaatc gtcttcgagc agttattagg gcgaggggag ggaacacaaa gtactgacac 1380 ttaattttaa ggctaaataa aanttttatt tattaatttt tgttgtttta ttcattcatt 1440 atttttccat atttccttgt tttcctacct ccttcactta atccgggata gctccttaac 1500 tactggctcg atttgaatga atttggtatc aatttaaagt tgacagttga acctttaaaa 1560 tgatgtcatt tttatgataa ttaaaaattt atttagtaag caagatcgcg cggaaccaat 1620 ttcaagtaag tgattccaaa cttttgctcg cgtg 1654 // ID Copia-4_SI-I repbase; DNA; INV; 4115 BP. XX AC AEAQ01007866; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-4_SI_; KW Copia-4_SI-LTR; Copia-4_SI-I. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-4115 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01007866; Positions 4572 458. XX CC Positions [436-933] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 400..2019 FT /product="Copia-4_SI-I_1p" FT /translation="MSKLIQKPHNIKVTKKPLELIYLDLCGLMPTNSIKGS FT RYMLVIVDDYSGMYFVYFLKHKSEAFDHFMEFQKKFENRLETKIKSIRTDN FT GREFINENFRKYLNESRISHQKTVPFNPQSNGKMERANRVLLDRARTILND FT NKLPSEFWAEAVATACHVSNITPRKGKDNTPFESFFGHKPSLEHLKVFGCV FT AFFYVPKQHRDKLELRGKIGIMVGYAKSRTGYHIYDIKNQKIVEERTAKFH FT ENTMGYTLLKTKESFDEQFKNFDVESLFKETNEELVEDIDEIHEHDVDARS FT DYDSDIIDDVDELPQSIDVNEKRREKPEGTTKAVMQSRLSEQQRNRKAKLL FT KYGVRRSKRIADKSNDKVPRNDSNIFETNKTSTEIVPSNFKDAKESREWIN FT WYKAMKDEIESLNYHNVWEVIDKVPGVKVIKSKWVYSIKNNPENKTKKYKA FT RLVAAGFNQIKYKDYEESYSPVVTIETWRLLLSVAAKRDMRVRFYDVKTAY FT LYGSIDETVYMTAPPGFKKTIGLDKICRLQKKYIWIATIWAQLV" XX SQ Sequence 4115 BP; 1546 A; 657 C; 849 G; 1063 T; 0 other; atagtagatt cgaaattgat gataaatatg gcaccaatgt atgtctgaaa gatactctgt 60 acgtaccgga attaagaaat aatttgcttt ccgtaagaaa atgagatgaa ataggctata 120 aaataacgtt cggtaatgag ttagcattaa tttacgatag cgatagaaaa attgtaggta 180 aaggaattaa agacggatca aagtatatta taaaaggcaa aactacaaat gacgtgtgtt 240 tcatgggcac aaagaataca gggaacatga gtaacaaaat gctatggcgc aagcgattga 300 cacttaagtt ctaaatatac taacagacta gttaaagaat cactcgttga aaatgcgcat 360 aatattccaa aaaatgaagt cgaatgcgag tcatgtagta tgtcaaaatt aatacaaaag 420 ccacataata ttaaagttac aaagaaacca ttagagttaa tatatcttga tttatgcgga 480 cttatgccaa cgaattcaat taaaggttct cgatatatgt tagtgatagt agacgattat 540 tcgggcatgt attttgtata cttccttaaa cataaaagtg aggcattcga tcattttatg 600 gaattccaga aaaaatttga aaatagacta gaaactaaaa ttaaaagtat taggaccgat 660 aatggtcgag aatttatcaa tgagaatttt cgcaagtact taaatgaatc aaggattagt 720 catcaaaaaa cagtaccatt taatccacag agcaacggaa aaatggaaag agcgaatcgc 780 gttttgttag acagagcacg caccatacta aacgataata aattgccttc tgaattttgg 840 gcagaagcag tggcaacagc gtgtcacgta tcgaatataa ctcctagaaa agggaaggat 900 aatactccgt ttgaatcatt ttttggccat aagccatcat tagaacactt gaaagtattt 960 ggttgcgttg cattttttta tgtgcctaaa cagcatcgtg ataagttaga actaagaggt 1020 aaaattggaa taatggtagg atacgctaaa agtcgtacgg gatatcacat ctacgatatt 1080 aaaaatcaaa agatagtcga agaacgaaca gccaagtttc atgaaaatac tatgggttat 1140 acgctactta aaactaaaga atcatttgat gaacaattta agaattttga tgttgagtca 1200 ttatttaaag aaacaaatga agaattagtt gaggacatag atgaaataca tgagcatgat 1260 gtagatgcga gatccgacta cgattcagat ataatagatg acgtagatga gctacctcaa 1320 agtatagacg taaacgaaaa acgaagggaa aagccagaag ggacaacgaa agcggtaatg 1380 caatctcgat tatctgagca acaacgcaat cgtaaggcaa aactattgaa atatggtgtc 1440 cgtagatcta agcgaatcgc tgataaaagt aatgacaagg tacctagaaa cgatagtaat 1500 atttttgaga ctaacaaaac aagtactgaa atagttccta gtaactttaa agatgcaaag 1560 gaaagtagag agtggataaa ttggtataaa gccatgaaag atgaaataga gtcattgaat 1620 tatcataatg tatgggaagt tattgacaaa gtaccagggg taaaagttat aaaaagtaaa 1680 tgggtctatt ctattaaaaa taatcctgaa aacaaaacta aaaaatataa ggcgcgatta 1740 gtggcagcag gatttaatca aataaaatat aaagattatg aagagtcata ttcccccgta 1800 gtaacaattg agacatggag attgctgttg agcgttgcag ctaaacgaga tatgcgtgtg 1860 agattctatg atgtaaagac agcgtacctg tacggatcca ttgacgaaac tgtatatatg 1920 acggcacccc caggattcaa aaaaacaatt ggtcttgata aaatatgtag attgcaaaaa 1980 aagtatatat ggattgccac aatctgggcg caattggttt aaacgactcc gtaaagaatt 2040 gttaaattta ggccttaaac aattagccag caataattgt attttcatac ataacagagg 2100 taatacatat attcctatgc gtgagcgtct atgtcgatct atctattatt gataacaaca 2160 caaaggcagg tgacatgctc gtagaaaata tacgtaaagt atttaaactt aacgaaacca 2220 caaataaagg tatattttta ggcatggaaa ttaaagagac cgaaaaagaa ataacgataa 2280 ctcatgaggg gtatattaaa acattgctag aaaaatacgg tatattacat tgtaaaccag 2340 taagtacgcc aatagtacca ggacaggata aagagcccga ttcacctgat gatattgttg 2400 agccaaaagg ctaccaagag ataataagag aattacttta tttaagtaac agatctaggc 2460 cagacattac attcgcgaca aattacttat cacaattcaa tgtacggcca gagaagcgtc 2520 actatgtaat ggccaaaaga atttcacgtt acttatcagg cattttgaat tataagctcc 2580 gttatagtcg aaaacaaggc agattgaaca cgagtagtga tgcaagtgga gaaatgggat 2640 ctaaaatgaa tgggactaaa atgaaatctt tttcgggtgg cgtcctccaa ttagaaaaat 2700 cgcttataat gtggaactgt cgtaaacaga aatgtgtagc ggattctacg tgtgaggcag 2760 aattgtttgc aattaacgat gtaattaaaa atgtaaaatg actaatagga ttattatcgg 2820 agctcggctt cgaaacactt tataaactac ctgtgtgtat agcaagcaat aatcaatcgg 2880 cgatagacgt actcaaagac gcgaagtcat cacgtagata acgccatgtt ttactaaagt 2940 cacaatacat taaagatgag attgccgagg ggcgtgttta tgtatcttaa tgtatgttaa 3000 tactgatttt atgaaagcgg attttttgac caaagctgta accaaagaaa aacttgtgtg 3060 gagttgtaaa gaactgaatc tatattaatt gttccaaaaa aaaatgataa agtttgtgtg 3120 tttatatatg tcggttattt tataaaatta tacaattcca cgcaggtggg gaaatgtgga 3180 gttttgcgta ctcattgtat taattttagt tagttcgttt ataagaggtt ggtagaccta 3240 gtacatttat gttatgcgag ttccgatctg actagattta taccctatgt tcatgtctca 3300 tggcgcgctg ttcttcccgc gtaccctctc ttctgctgta agccgccaga cgacacacac 3360 cttcgcccgt ttgtaataaa gctatcttat tttcatacgc atctatcgga gattctctca 3420 ttataaccca acaaagtggt agcagagcgt ggttcgtgca atcagaagct gctgagagca 3480 aaagccgaaa aataagaaaa ttcagcgacc gaggtaagaa gcggccacgt ggaaagaaag 3540 aaaatggata agatacctat gagtaatatc ccacaactca cggcgacaaa ttattacgta 3600 tgggccatga aagtagaagc cgtgttaggc ctaaaaaagg ttagatacgt tctcacgacc 3660 gatcgacctg taaacgacaa agaccgcgag agatgggaca gcgataacga cgtcgtgtcc 3720 atcataaagc taaccttgtc cgacgaccag gcaatgcagt tcgcggatga attccacgca 3780 aggcaactat ggctgaaaat aaaggaaacc tacgttggac gcctcgaaga ctcgaaaatc 3840 gacgccacag tcaagttacg tagcatcaca atgaaagaca aggaaacagc agcagactac 3900 gtggcacgtg cacaaggcct agcatcaaag tgcaaaggac tgaacgtcaa catcacagat 3960 cgcgagctcg tgtactacgt cgtgagagga ttaaatggca aattcagtcg aataaggaac 4020 acacttaaga cgcaacaaga caagagactc aacgacgtac tagaagactt acgagaggaa 4080 gaaagagaga taaatactca aaagaagagc aacga 4115 // ID DNA-1_Bf repbase; DNA; INV; 1581 BP. XX AC . XX DT 06-NOV-2008 (Rel. 13.11, Created) DT 06-NOV-2008 (Rel. 13.11, Last updated, Version 2) XX DE Putative DNA transposon. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA-1_Bf. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-1581 RA Jurka J.; RT "Non-autonomous DNA transposons from lancelet."; RL Repbase Reports 8(11), 1798-1798 (2008). XX DR [1] (Consensus) XX CC 7 bp TSD. XX SQ Sequence 1581 BP; 429 A; 338 C; 361 G; 451 T; 2 other; catgggcgat aggccatcga gtttttgttg aaatcgctgt tgctatcttt ttgtacgtat 60 atactcaaca cacactgtat acagtaatac cgtactgtgt actcagtcca ttgaatcaaa 120 taataaacaa gcatgctgag cttgaaagac gcactgtgta cgccgagctc cgtttgctgc 180 ggtcgttcta gatcacgggg atagttcgcg caaaggccga cgggagtaga acagtttcgc 240 agttggctcg aacgcgttcg cgaagccctg ctcgaaccga tgaaccgcca agagggctaa 300 cttcactaac tacataaaaa tgactgggta ggaaatgcta tccggcaaga gcgttaccaa 360 taggggtgac cagcaggaaa agtgagttag aaccacggtt ctatggtgga tctgggactt 420 ctcggctttt acgtgagaat agttttcaac gtggctggtt tgacaaaagg cgataggcca 480 tcgagtattt gttgaaatca gtattacatg tactttgtcg tactaaaatc agactctatg 540 aactcaaaca gtgtgtactc gttccacaga gattcacgag aatcaatcat acactgacta 600 ccamacacac agtgtactcg gtccactcaa atattcccga gaatcataca ttaacacact 660 ctgtgtcttg ttcagactct gaaagaattc cgagaatcat aacactaaca ttgacatcac 720 tgtgtactca gtccactgaa agattcccga gaatcataca ctacactatt acactataaa 780 acacgcggtg tgtacttggt tcactgaaac tttcggcggg aaattttctg agtctcgcgt 840 ctctctgatt ggctagcaat gatgtaatcg tcttgtgcag cagcgaccct ascggccaga 900 tatggtactg ctggtaacgc tgttccccgg aaatcggact ttacaaaatg ctgcatatct 960 tctttattat taggtgttgt gtcccaagta atcttacatg gctagtaagt aacacctaga 1020 taataagtag tatcaacgtg ttttatttca ttgcggcatt tcgccgaaaa caagggcttt 1080 tgaaacggcg tttcctacct attcattatt gttcctaaca ccaaagcgtt cggaagtcgg 1140 tgttcggcaa acttcgggtg atttcggcaa ccgttcgtat gtgtgattct ttggaatcga 1200 cagcgtgttt gtgtaaccgc taagtgtacg attctcggga atctttccgt ggtcgggaat 1260 ctttccgtgg accaagtaca cagtgtgtgt aagtgtacga ttctcgggaa tctttccgtg 1320 gaccgagtac acagtgtgtg taagtgtacg attctcggga atctttccgt ggaccgagta 1380 cacagtgtgt gtaagtgtat gacgattctc gggaatgttt cagtggaccg agtacacagt 1440 gtatgtttaa tatagtgtat gattctcggg aatcccatgg tggaacgagt aaactacaca 1500 ctttgagttc attagagtcg gatttttatt acagcaaact aatactgatt tcaacaaaaa 1560 ctcgatggcc taccgcctat g 1581 // ID P-9_HM repbase; DNA; INV; 3007 BP. XX AC . XX DT 17-MAR-2008 (Rel. 13.03, Created) DT 17-MAR-2008 (Rel. 13.03, Last updated, Version 1) XX DE P-type DNA transposon family: consensus. XX KW P; DNA transposon; Transposable Element; P-9_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3007 RA Jurka J.; RT "P-type DNA transposon families from Hydra magnipapillata."; RL Repbase Reports 8(3), 355-355 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 144..2630 FT /product="P-9_HM_1p" FT /translation="MGRKCCVTNCNGNYDQSSKEKTFRLPRIPEERKRWLA FT VIPRDNIPDKKDTVVCERHWPVGYVLTKDYGKERPRDPPSVFTCVKSSLIP FT SPSGCLRTTLRSHAESRNVQPDELSAFNLKDVIKSFNDLKTNIESYNFNGF FT DILFNTKTDNQVLIQSLEFFHNTGVPLFILKINNNFQYEAFHCGVKCVIKT FT LSQNRITILNRWSHIVEAVRFLNAYEISSKKNIIEQQLFSMGDKLVTDKKY FT SVETMIRAFEYFSMSRATYNRLREDFQLPSITTLTRLTSKVKSVHEDIFLQ FT KVFSNLSDIRHKNCILLLDEVYVKTMLQYHGGTVFGKAVNNPNVLANTVLS FT FMVVTLFGGPKFLCKMLPVREIDSNFLFEQTNFLLSAIKAAGGNVVCIVCD FT GNRVNQAFFKKFETDGVPWRTKDNIYLLYDFVHLLKNIRNNWITEKTQELE FT FYVNKEKKVAKWSHIVALHKLESNQMIKMSKLTDVAVFPKPIERQKVSTCL FT KVFCEETVCALKCHPDLKNVDDTISFLSIIIEFWRIVNVHSPYADIRMRDL FT NRAAIFSSGDFNLQKLLEFGNLAKQMCSSQKVRCKSFTKDTSKSLYHTCNG FT LVELSKFLLSTNHKYVLLGTFTTDPLEKQFGKLRQGSGGTYFITVQQILEK FT VGIIKTKLLLQLDNNGDSLDFTTSGHSCERCGFYMNEEKCLIFDNLPELEN FT ILSIDVKMSLIYIAGYVVRNDEDSDDSYFYYEKFRYFTDEINRGGLTVPGD FT FVCQWVIYCYIMFREVADNTCISSLCNLMMLISESYQLNMNRKHGVILSNI FT FFKNYCSLYSPRSEKEPKQKILKLSVK" XX SQ Sequence 3007 BP; 1067 A; 406 C; 488 G; 1046 T; 0 other; atggcgtact ttataacagg cctattcacc gtgcggatga cgtcaaaaac tggaggcgaa 60 cattttttaa accatatttt ctgttaagcg tcgaagctct tgttatttat ttatttagtt 120 aaaaatttaa atttttaata aacatgggtc gtaaatgctg tgtcacaaat tgtaatggaa 180 attatgatca aagttccaaa gaaaaaactt ttcgtcttcc tcgaatacca gaagaaagaa 240 aaaggtggtt ggcagttata ccaagagaca atataccaga taaaaaagac accgtagttt 300 gtgaaaggca ctggcctgtt ggatacgttt taacaaaaga ctatgggaaa gaaagacctc 360 gtgatcctcc atctgtattt acgtgtgtaa aatcaagttt aataccttca ccatctggtt 420 gtctcagaac tactttaaga tcacatgcag aatctagaaa tgttcaacct gacgaattgt 480 cagctttcaa tttaaaagat gttataaaat catttaatga tctgaaaacg aacatagaaa 540 gctacaactt taatggattt gatatattat ttaacactaa aacagacaat caagtattaa 600 ttcaatcatt agagttcttt cataacaccg gtgtgccatt atttattctg aaaataaata 660 ataattttca gtacgaagca tttcactgtg gagtcaaatg tgtaataaaa acactctcgc 720 agaacagaat taccatatta aatagatggt cgcatattgt agaagcagtt cgatttttaa 780 atgcttacga gataagcagt aaaaaaaata ttattgagca gcaactattt tcgatgggtg 840 ataaacttgt cacagataaa aaatattctg ttgaaacaat gattagagca tttgaatatt 900 tttcaatgtc acgtgctaca tataatcgtc tcagagaaga ttttcaactt cctagcatta 960 caacattaac aagactaaca tcaaaagtaa aatctgttca tgaagatata tttctacaaa 1020 aagtgttttc aaacttatct gacattagac ataaaaattg tattcttcta ttagatgaag 1080 tttatgttaa aactatgttg cagtaccatg ggggaacagt gtttggaaaa gcagttaaca 1140 atccaaatgt tcttgctaat actgttttga gttttatggt agttactttg tttggtggac 1200 ccaaattttt atgtaaaatg cttccagtac gtgaaattga ttcaaatttt ctttttgaac 1260 aaactaactt tcttctatct gccataaaag cagcaggtgg taatgttgta tgtatagttt 1320 gtgatggtaa cagggtaaat caggcctttt ttaagaaatt tgagacagac ggcgtacctt 1380 ggcgtacaaa agataacatt tatcttttat atgattttgt gcacctttta aaaaatattc 1440 gcaataattg gataactgaa aagactcaag aacttgagtt ttatgttaat aaggaaaaaa 1500 aagttgccaa atggtctcat attgtagcat tgcataaatt agaatcaaat caaatgatta 1560 aaatgtctaa acttactgat gttgctgtat tccccaaacc aattgaaaga caaaaagttt 1620 ctacttgtct aaaagtattt tgtgaagaaa ctgtctgtgc cttaaaatgt catccagact 1680 tgaaaaatgt tgatgatact atttcatttc tttcaataat aattgaattt tggagaattg 1740 ttaatgttca tagcccttat gcagatatac gtatgcgtga tctaaaccga gctgcaattt 1800 tttcttctgg tgattttaat cttcaaaagc tgttagaatt tggaaactta gctaaacaaa 1860 tgtgttcttc tcaaaaagtt cgttgtaaaa gtttcacaaa agatacttct aaaagtctct 1920 atcatacatg caatggtctt gttgagttgt caaagtttct cttgtctaca aatcataaat 1980 atgttttgct tggaacattt accactgatc ctttagaaaa acagtttggg aaactcaggc 2040 aaggatcagg tggtacatat tttattactg tacaacaaat attagagaaa gttggtatta 2100 taaagacaaa actacttttg caattagaca ataatggtga tagtcttgat tttactacat 2160 cagggcattc ttgtgaaaga tgtggttttt acatgaacga agagaaatgt ttgatatttg 2220 ataacttacc agaattagaa aatatattat ctatagatgt taaaatgagt ctgatttata 2280 tagctgggta tgttgtccga aatgatgaag attcagatga ctcatacttt tattatgaga 2340 aattcagata ttttactgac gaaatcaatc gtggaggatt aactgtacct ggtgattttg 2400 tatgtcaatg ggttatatat tgttatatta tgtttcgtga agtggctgat aacacatgca 2460 tttcttcttt atgcaatcta atgatgctta tttcagaatc atatcaactg aatatgaata 2520 gaaaacatgg tgttatactt tcgaacatat tctttaaaaa ctattgtagt ttgtattcac 2580 caagatcaga aaaagaacca aaacaaaaaa ttttgaaact aagtgttaaa taacattgtt 2640 atttaacact gagtttcaaa aaaaaatcca acataaaata ctgtttctgt tttaaaaaat 2700 ttaaataaaa ataatatgat gaagaatgat ggtgaaggta ttcaacttgg ttttacttgg 2760 tttttatggt tagtaaaata aatgataatt tgtaaaaatt tatacaaaat ttttaaattt 2820 atgtataaat ttatttatag atatatttgt aaaaaaatta tttagtattt ttttatttag 2880 tatttttttg attgaaaatt aaatttcaag tttaattcat gttattagtg ttttttctgt 2940 tcaaaaaact cgcctcagtt tatgacgtca tccgcacggt gaataggcct gttataaagt 3000 acgccat 3007 // ID Gypsy-604_AA-LTR repbase; DNA; INV; 216 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-604_AA_; KW Gypsy-604_AA-I; Gypsy-604_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-216 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 216 BP; 63 A; 33 C; 51 G; 69 T; 0 other; tgtaaacttg ccagtggagc gaaatcatat tttgcgttgt ttaaattgag agtgtatggt 60 caagagtgta gaagtcagtg tagagttgaa cgtaggctca tcattccatc ggtagcttag 120 gtcagttgta gaaaaagtac gaacacgatc ggtttataaa cacgttgtca accattattc 180 gaagtgtttt atttgaaatc cggatcctag tcttca 216 // ID Copia-22_DPu-LTR repbase; DNA; INV; 288 BP. XX AC scaffold_66; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-22_DPu_; KW Copia-22_DPu-LTR; Copia-22_DPu-I. XX NM Copia-22_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-288 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 708-708 (2010). XX DR Genome; scaffold_66; Positions 42296 42009. XX CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX SQ Sequence 288 BP; 74 A; 65 C; 39 G; 110 T; 0 other; tgttgatgtt gaccgttatc tacacaacgc catctacccg tcatgtctca atgcaaaccc 60 acgcagacga tcgccttact ttctgtttaa gtacttgtca atcttataac gttttcttct 120 gtgttttctc tattacgttc atcgtacgta tgagattttg ttctctcgag tcctacttat 180 tcatcgttcg atactgaaag acgtgtactc tttactcatc ctaatctggt aatacaaatc 240 tcatattcaa gttatatatt gttcatatgt gtcaatcaaa tattaaca 288 // ID DNA8-100B_AP repbase; DNA; INV; 501 BP. XX AC . XX DT 30-AUG-2009 (Rel. 14.09, Created) DT 30-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; KW DNA8-100B_AP. XX NM DNA8-100B_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-501 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2037-2037 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 501 BP; 167 A; 78 C; 69 G; 187 T; 0 other; cagtgttcgg aacgttcact aaaaaaatga attcgttcac gttcagtgct taaaaaagga 60 actcgttcac gttcaaattc atattttttt caaatgaaca cgttcacgtt cacgttcttt 120 gaaaatatga atgcgttcat ttcgtcgttc atttaattta ttttatttta tttttatgta 180 tcgtttaaat acgaatacaa aatcgtggaa atcataatat tattgttgtt ttatgtccag 240 gcgtgcccac tggaatttca tgtattttat ttttataatt cgtaatatat tatatatcat 300 aatagtatta tcaatcaata tttattaaat aataaataaa tttgaacgtc ttaaaaatcg 360 ttcaaattca ttaaatatat aaaacgtcgt tcacgttcac gttctatttg aaaaaaacga 420 gtgacgttca cgtatcgttc actaaatatg aacgtgaacg cgtgaacgag cgttcatgaa 480 cgacgttctt tccaaacact g 501 // ID BEL-593_AA-LTR repbase; DNA; INV; 579 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-593_AA_; KW Pao_Bel_Ele54; BEL-593_AA-I; BEL-593_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-579 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 579 BP; 154 A; 104 C; 173 G; 148 T; 0 other; tgtctacgag cgacataagc ggcacggagt gatccagcca ctctgcctgg ttcaggtact 60 ccaccggcta ccaagcggat tgagtgccgt agctggtgat ggtgaaccgg ttcaatgggg 120 agctcaccac tacggtagca tctacggaga gggagagagg ttcgcttggc catggcggag 180 atggttgaaa gcgaaagtga tcgccatgta tatagtgaaa tgtaggtcag tagggcaggt 240 cagtgtcgaa accaaccact gtcgcgatcg gtcgaccggg ggcgaacccc gtaacatagc 300 ttgttgtaga ttttgttcaa tgtctagttt aataagttta aataaatgtt tagtactgta 360 aaattgtata gtgtagttga ataaagctaa tgtagtgtat ttaatgtaaa gcaatgtgtt 420 aaataaacgg tagtttgtga ctattgcttg tggtgtgctg catgttattt ctgaacggat 480 tggtcgttgg atgacgggtg ccaggaaagg acaacccaac caaggaagtg ggttcatcgt 540 agtagagaga accgagagcg gtcagggcgg tccctgaca 579 // ID Harbinger-N9_BF repbase; DNA; INV; 201 BP. XX AC . XX DT 04-AUG-2008 (Rel. 13.08, Created) DT 04-AUG-2008 (Rel. 13.08, Last updated, Version 1) XX DE Amphioxus Harbinger-N9_BF non-autonomous DNA transposon - DE consensus. XX KW Harbinger; DNA transposon; Transposable Element; Nonautonomous; KW Harbinger-N9_BF; non-autonomous. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-201 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-201 RA Kapitonov V. and Jurka J.; RT "Harbinger-N9_BF - a family of non-autonomous DNA transposons RT from the amphioxus genome."; RL Repbase Reports 8(8), 822-822 (2008). XX DR [2] (Consensus) XX CC This transposon contains 37-bp TIRs and is flanked by 3-bp TSDs. XX SQ Sequence 201 BP; 53 A; 45 C; 41 G; 62 T; 0 other; ggcttaggtc acatttccaa accggggccc ggccgggatg ttttaagaaa cgaaaaatta 60 aaatgtatac ctagaaatat acacatatca tgcccatgaa tcttattttg acattttgtg 120 tattttgatg tcttttatat cattcttttc gctcccgaaa gctgcccggc cgggccccgg 180 tttggaaatg tgacctaagc c 201 // ID Kolobok-3_Aplcal repbase; DNA; INV; 5081 BP. XX AC . XX DT 17-MAY-2011 (Rel. 16.05, Created) DT 17-MAY-2011 (Rel. 16.05, Last updated, Version 1) XX DE DNA transposon: consensus. XX KW Kolobok; DNA transposon; Transposable Element; Kolobok-3_Aplcal. XX OS Aplysia californica OC Eukaryota; Metazoa; Mollusca; Gastropoda; Heterobranchia; OC Euthyneura; Euopisthobranchia; Aplysiomorpha; Aplysioidea; OC Aplysiidae; Aplysia. XX RN [1] RP 1-3053 RA Yuan Y.W. and Wessler S.R.; RT "The catalytic domain of all eukaryotic cut-and-paste transposase RT superfamilies."; RL Proc Natl Acad Sci U S A 108(19), 7884-7889 (2011). XX DR [1] (Consensus) XX SQ Sequence 5081 BP; 1422 A; 1008 C; 1082 G; 1569 T; 0 other; ggcatgatgc gggggaaggc catgatcaaa tgacgtcatt ttttagtctt agatcgaaat 60 tgggctttta acggattcga gtactttgaa atgtctactt tcacaaaatg aaggtcttat 120 gtccctgtca gcatcagaac atgttcaaaa acgctcacaa agatgacacc cccccccccc 180 ctttcctgca cgcaaaaaaa tgacgtttta ccctaaaaca ccaagctggc ggaataataa 240 ataatatatt tgaggttgta gatggaaaaa gctgacgttg cacccaatag cactaacaga 300 gaatgcaaaa aagttgcgaa aacttccgct ctaaagaata cgttcataat tgcaaaaagc 360 atcccgcagt ttgatttttg ttgttgaaaa tctacggaaa attttgagtt gcgcgttgat 420 aaacatggca cgaaaacgat cagagaagta tcagagtcgc cgtcatacac caagagggcg 480 caaaggtgga agaaagcgac tagaaggtca acaggctgag aattttaatc tactttgtgt 540 gaaccagaga aacagcagtg tcgacaatga tgtgcgacgg catgctgagt gtgacaatac 600 caatgtagat gaaaagtaag tggaacgatg tatcacatcc aaagctctct ctctctctct 660 ctctctctct ctctctctct ctctctctct ctctctctct ctctctcccc cctctctctc 720 tctctctctc tctctctctc tctcacactc tctatctacc tatctatcta tctatctcta 780 tctatctacc tctctctcta tcgatctatc tctctctcta gattttctat gtgtgttagt 840 atatgtgtgt gtgagtatat ttgtgagtgt gtgcgtttgt gtgtgtgtgt gtgtgtgtgt 900 gtgtgtgtgt gtgtgtgttt gtctatctgt gtgtgcgtgt ctgtgtgtct gtctatgtgt 960 gtttgtgatg tgttttccga gggtcagtca ttcgttcttt catccttgct atttcccagt 1020 tgataatgta tacatgtaga aacagggttg atactgcaat tatctttttc cttccttgca 1080 attgaaagtg caaacgacag ctgtgaatta gagctttcaa tactatcgct tggtctcaat 1140 atctgagtga ctccttattg ttttattttc tcctgtctgc cttattttct tctgttccaa 1200 cataattttg cctgtttccg tataatacag attttgtaca gaactatctc tatagcttgt 1260 taactttcgt ttggccgtgt tttaaaaaat tctatctcat ttactaattt acctatgccc 1320 acgaaacttt gcatggcact tctgcaggcc agtggccaaa caattttttt gttcaaaaaa 1380 tcttgagcca gtaataaaaa aagtttttca ctttgggccc tatattaaaa accactctta 1440 ttgatgcata gacaggatca ctaaaataaa tgacttacag ttacagcaac ctaacataca 1500 cattgctcat tttgagacag cgacatcgcc ctgtccccac atttcagtgc aagcttgtag 1560 attttatccc ccaactgtct ggacgtctcc acgtgcttca cgtgagtctc gaaaattact 1620 ctcgccgtct cttgaatcgc gtttcttagg cgaacgttga actgtggtgt tcgagaaaat 1680 attatgacaa tttttatgct taaggagtat gctgtatatc ccttattata tttttgaccc 1740 gctttaggct tacatcaaat aaaacgtgtc ggaaccttag tttgcagtga cctttataaa 1800 tgggctttta gattctgttg gtcatccaac ttttagactc gtatgcaatg atctgagaag 1860 attaaactga gcgttttcat tttgttttgt ttcaacggcg gtcagacgaa agtgaacaag 1920 ttataatgcc ttacattgtt ttgttccagc ctcatacttc ctcccccagc agccttcgcg 1980 gatgctgatg caagcgtgtc tcaccaacga ggtgaaagta ctgtttcaac actgcccctt 2040 tttcaacaat catcctgctc gaaaaatggt tcgttggaca ttcttcctgt atgtgttgat 2100 gccctaacct caatgtctca acaatcacca cagccttctg ccacgacatt gactttctcg 2160 gacacagcac agtacgattc gaatatagct gggcctagta gtttaggtag tgggatcttc 2220 aggtatagtg atggaacatc aagtgaaagt gataacgaca gtagtgaaga ggaaattttg 2280 gatgaggatg aacgtgctcc tgatcccatt gccagaaaaa aaatttcact aatgaggaaa 2340 gttacaaaag taggcgacaa tgagggatat gcactttttc attggtcatt tctaactgaa 2400 attattgcac tgcttgcatg cccacgatgt aagagctgca gtttgtcagt ttgttctgat 2460 tacaaaggtg gacttatgtg gaggattata gtttcgtgct ctgtgtgtga ggaggacatt 2520 tttgtaggag gctcttcccc gggaggaaag caaaaagata taaccaagcg acttgtttta 2580 gctagtaaag agtgtgggct tggatatgaa ggactttgca attttttttc aattttaaac 2640 ataaacaagc cgttgcatca taaaacctac caggaaacat cccaaaagat acacgcagct 2700 gcaacccagg aagccgaaaa atgcatgaaa acggctgcga agattgtttc agaaacgtct 2760 acagttataa atcctggtca aacaagagtg ccagcaacta cgatttcatt tgatgggacc 2820 tggcataaaa ggggtcattc ttcccatttt ggtgttggtg tggttattga ctgtaagaca 2880 ggtttcgtct tagattacca agtcctcagc aactattgcc atggatgcga ggtagggtta 2940 aaatcaggag atgaacagta tttgctttgg aaaaacaagc atcaattaaa atgccagcaa 3000 aattttcaag gaagcgcaaa agcaatggag gctgaagcag cagtaacaat attcagaaga 3060 tcagtccagc accgtggtct tgtgtacagc aggatgctgt gtgacggaga tgccagatct 3120 caccagttga tcaacacaaa aggaatatac gactttgaag tcataaaaga agactgtatt 3180 aatcacatca gcaaacgtat gttcaacgcc ctagagaata ccaaaaacag caacaagaaa 3240 gaactgaaca gaaaactcac gaaaacaaag atagaaaaaa tcaccaacac atatgcaaca 3300 aacttaaagc agaatgcacc cgacacagag caaatgcaga gtgatgtcta tggtggaatt 3360 taccacatgc tcagcaccga cgacaaccca caacaccatc tctgccccac tggcatcagt 3420 tcatggtgcc acttccaacg agccttggca acgaaagaag agccgcgcaa gcacacacca 3480 accataacag aggatgttgc aaaatttgtg tggccagttg tggagagact gactcgacca 3540 gatgttttaa aacgatgtgc gtccatgcaa acgcaaaatg ccaacgagtg ctttaactct 3600 ttgatttggt cacgctgcac caaaactcga ttcgcgtcgt tgcggtctgt cgaaacggca 3660 acagctctct cagtcttggc gtttaactgt ggtccttctg ggctcttcgc cgttttggaa 3720 gccttgaaca ttccagtagg aggaagccac cacaagcacc aagccaggaa aaccggtaac 3780 aagctaaaga gtgccatgaa gtgcaggaaa cgcgcatcaa aatggggtcg caaagaccga 3840 aagagacagc gactccagct tgagaagaag agacagcaag cagagggtga tctgtatgaa 3900 gcaggggcct tcaatatgtg aaccgtcata gtggacgaga catcttagca tcatttttat 3960 gtcagtagat tcatgttcag tacaatttta tgtaaaggac catgtatcca tttatgtatt 4020 tttcaaattg cattgtgtat attattttat tatgtatgtg tatgtgctca tttagtagcg 4080 atgagtaaac gtgcgcgtgt gataagcaag taggtgtgtg tttgtgcgtg tgtgtgtgtg 4140 tgtgtgtgtg tgtgtgtgtg tgcgtgtgtg tgtgcgtgcg cgtgtgtgtg tgtgagtggg 4200 gggggggggg gtgttgtgca tgtgtgggca tatgtgtgcg tgtaatgtgt gcgtgagtgt 4260 gtgggttgtt tgtgtttgta tgtgcgtgtg ttagtgtgtg tgtgtgtgtg tatttgtgtg 4320 tatgagtgtg tgtgtgtgtg tgtgcgtgtg tgtgtcgcgt gcgtgaatat tgttcgtaca 4380 tgcatttatg taaagcgcct agagatgtgt ttacacggga gtgggcgctg tagaaatgat 4440 cttcattata attattatta tctaagccat tgagagtaat aaatgtgagc gtatgactat 4500 tttatttcaa tacgttttgc aagccaaaat tttgacgtcc attttctcaa attaaatttt 4560 ttgagggggt cgcaatcgga aaaaatctat cattcgaaaa gtatttaagg tatctaccta 4620 atttttggta cattgatagc tgaaacatta ttctagtgaa taaagctctg agtttttaaa 4680 aagaccaaaa ttaaatggtt ttattacaga ttttatcagc tgccactctc catctttttt 4740 gtggttgaaa tgtttcccta atttgaacct tgattctgaa aaactttctt ttcctacatc 4800 aaaaatgcca gttttatcca ctaaaacagc cacagtatgc tacaaaatcg cataaaatac 4860 ccatttggtg ccatttataa ccttatgata gattttttag aaatgtatac attttctgta 4920 tatgacattc caatatggca gccgttacca tagcaacaca aaattttttt caaaatttta 4980 cttctggccc aattctccac tctatttatg ttgatgctac catatttgta agcttagttt 5040 cttcttatta atgaatttca aaatttcccc cgcatcatgc c 5081 // ID Gypsy-148_AA-I repbase; DNA; INV; 7001 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-148_AA_; KW Gypsy-148_AA-LTR; Gypsy-148_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-7001 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1021-1021 (2011). XX DR [2] (Consensus) XX CC Positions [4616-5128] - Integrase core CC 'AGGT' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 452..2266 FT /product="Gypsy-148_AA-I_2p" FT /translation="MAKASNASIINAFDLSEEELDYELKIRNIFLEEPEPH FT KRERLRNAFKNESQAPVLTEAIYFLNEWPIAVRKLRIIEQGLKGDIHIRWI FT SQLRHWKERIERSKVTDLAQEEQKYDVLLAIRQLMNNYKETIKKLMNLSDS FT EEDKLKWDMEGLKLGQNSKPKTQFQEKFEGNIQFHSSLAPRAPKVNEMPYE FT NNAGKNTHKTVGDLFTNLPNIRDIRIGPETARSMMDEREALRLGTRPKDLN FT QRHKEVKIKGHKVEKHKKHDKSSKKRLPQSSSSSEESSSLDSLSSLLSSDS FT SSDAELLESSERGFSRNRRDDRFQHYRLDRWGIQFSGDAQGMDVADFVFQV FT NELIASERIPDDRFLDQAYILFSGEARRWYFTYKKKYKTWNSFAKHLKIRF FT GDPNKDRKLLQDIKDRKQKKGESFVAFCAEIEGMFERMTKQYSERKRLKVL FT RNNMRRWYKTKLTFYKIKSIAHLSMLCQQLDKDSGRIYSKNTQPFKKHVRN FT VEATSESSSSSSEDADVCAFERRNRQEQKERSKYFSQKTGDLSAGKEVLKE FT PTALCWNCRKYGHRWRDCKQPKVIFCHACGTPGVTFLTCPKSHLLPQKNSK FT NEVLEEN" FT CDS 2395..4467 FT /product="Gypsy-148_AA-I_1p" FT /translation="MALLDSGAEISVMNSLDIVNKYNFKLHKSKLKVDTAG FT DSSYECVGFVNLPITYNNITKVIRIYVVSEFSKKLILGINFWKAFDIVPAI FT KGESNQIRTTHLSEIQSNDSLEFSENHFSSENQCIALKFNTFVGALEETMD FT NEEDLTLDIPTLDFESVPIDTIETEHSLTEQDRAKLLDAIGQFEVSEENRI FT GKTALLEHEIKLIPGVELKASPMYRCSPYVQKFVDEEIERMKLMDVIEPCE FT SEYASPLLPVKKPNGKFRVCLDSRRVNSATRNDAYPMPHLHEILHRIEHAK FT YFSVIDLKEAYWQIPLTENSRGLTAFRTKMGLFRFKVMPFGLKGAPFTMAK FT LMDLALGCDLQPYVWVYLDDIIIATKTLSHHIQLIKEVAKRLKQANLTISL FT AKSKFCRKSVRYLGYIVSEQGISIDMEKVRPILDYQPPKTVKDIRRLLGLA FT NFYQKFIKNYSAITAPITELLKKENKKFLWTAEAEKALENLKEALISPPIL FT ANPDFNELFTIESDASDLAVGAVLTQIQNGEKRVIAYFSKKLNSTRRKYAA FT VEKECLAVLWAIEAFRNYVEGTHFKVITDARSLVWLSKVSAEKGSAKLCRW FT ALKLQQFDFSIEYRKGRDNITADCLSRSLNTISTKWEDLEHDSLKKKISEY FT PNEFKDFKVIDNNIYKYVKDDSKITDNRFNWKIVPKKIRDFN" XX SQ Sequence 7001 BP; 2592 A; 1085 C; 1368 G; 1956 T; 0 other; ttggcgccca actttcttaa agcttcgacc cactggaatt ggaattttga atatttgaat 60 tagaccaaca acacctatta ttttagttaa ttaaggaaaa gttatattag tttatgagat 120 atattgaaga attattgatt cgagcactta gattaagaaa tatttgaatt ggatatcgaa 180 ttaagcaaag ttttttcttt attattattt ttcttttcgt ttttttttct tttttttttt 240 tggtgaaagg tttttttttt ctttgtttta taggataagt agataaaaaa agatttttgt 300 ttctttcttg aattttgttt tatattttgc tattttctat atttttttct tcttctttta 360 aaatttttat tcggacgaaa ttgttttcct taaccatcaa cggggtaccg atttcctaaa 420 aaaaaaaaaa ataaacataa aaataataaa aatggcgaaa gctagtaatg caagcataat 480 aaatgcattt gacttaagtg aagaagaact agactacgaa cttaaaatca gaaacatatt 540 tctagaagaa ccagaaccac acaaacgaga aagattgagg aatgctttta aaaatgaaag 600 tcaagcacca gtgctgactg aggccattta tttcctaaat gagtggccca tagctgtaag 660 aaaactcagg attattgaac aaggactaaa aggagacatt cacattcggt ggatatctca 720 gcttcgccat tggaaagaac gcatagaaag atcaaaagta acagatttgg cacaagaaga 780 acaaaaatac gatgttttat tggccattcg acagctcatg aataattata aagaaacgat 840 taaaaagtta atgaacttga gcgactccga agaggataag ctaaaatggg atatggaagg 900 attaaaacta ggtcagaaca gtaaaccgaa aacacaattt caggagaaat ttgagggtaa 960 tatacaattt cacagtagtt tggctcctag ggctcctaaa gttaatgaaa tgccttatga 1020 aaataatgct ggtaaaaata ctcataaaac agtgggcgat ctttttacaa atctaccgaa 1080 cattagggat attagaattg gtccagaaac cgcgaggagc atgatggatg aaagggaggc 1140 gttgagatta ggaacaagac caaaagacct taatcagaga cataaggaag ttaaaatcaa 1200 aggacataaa gttgagaaac acaagaaaca tgataagagc agtaaaaaac ggttacctca 1260 aagttcgtca tcctcagaag agagttcttc cttagattcg ctgagtagtt tgttgtcctc 1320 agattctagc tcagatgcag aacttctaga aagttctgag cgaggattct ccaggaatcg 1380 tcgcgatgat cgttttcagc attatcgttt agatagatgg ggaattcagt ttagtggtga 1440 tgctcaagga atggatgtcg ctgattttgt attccaagta aatgaattga tagcttcgga 1500 acgaatccct gatgacaggt ttctagatca ggcatacatt cttttttccg gagaagcgcg 1560 acgttggtac ttcacctata agaagaaata caaaacatgg aatagttttg ctaagcactt 1620 gaaaatacgg tttggagatc cgaataaaga tagaaagtta ctgcaggata ttaaggacag 1680 gaaacagaaa aaaggggaat cttttgtcgc gttttgtgca gaaatcgaag gcatgtttga 1740 gcgaatgacg aagcaatatt ccgaaagaaa gaggctgaag gttctaagaa ataatatgcg 1800 taggtggtat aaaacaaaac ttacgtttta caaaattaaa agcattgctc atctcagtat 1860 gttatgccag cagcttgaca aagacagtgg taggatttat tccaagaata ctcaaccgtt 1920 taaaaaacat gttaggaatg tcgaagctac ttcagaatct tcatcgtcat cttcggaaga 1980 cgctgatgtt tgtgcttttg aacgcaggaa cagacaagaa caaaaagaaa gatctaaata 2040 tttttcacag aaaacaggtg acttgtcagc agggaaagaa gtgttgaaag aacctaccgc 2100 actgtgctgg aattgtcgaa aatatgggca ccgttggcga gactgcaaac agccaaaagt 2160 aattttttgt cacgcttgtg gaacgccagg ggtgacattt ttgacttgcc caaaaagcca 2220 tttgctacca caaaaaaact caaaaaacga ggttttggaa gagaactaag gggtatttct 2280 tttccaaata tcaaaaataa ccccaacaac ggggagattt ccaatgttta tgagttgatc 2340 gttgagccca gaaaatgccc tcatatcaaa attaaaatag tggactcaga aattatggca 2400 ctcttagatt cgggcgcgga aatatcagtt atgaattcct tggatattgt aaataaatat 2460 aattttaaac tacataaaag taagctaaaa gttgatacgg caggtgatag tagctacgaa 2520 tgcgtaggtt ttgtaaattt accgatcact tataataata tcaccaaagt tattcggatt 2580 tacgtggtct cagagttttc aaaaaagtta attcttggaa tcaatttttg gaaggccttt 2640 gacattgtgc ctgcaattaa aggcgaaagc aatcagattc gcactactca tttgtcagaa 2700 attcaatcaa atgattcatt ggaattttca gaaaatcact tttcgagtga gaatcaatgc 2760 attgccttaa aattcaacac ctttgtgggg gcactggaag aaaccatgga taatgaagag 2820 gacctaactc tcgatattcc gacattagat ttcgagtccg tgccaataga cactattgaa 2880 acggaacatt cgctcactga acaagaccga gcaaaattac tagacgcgat aggccaattt 2940 gaagtttctg aagaaaacag aatagggaaa acagctttgc tagaacatga gattaagtta 3000 atcccaggtg tagaattgaa ggcatctcca atgtacagat gctctccata cgtacaaaaa 3060 ttcgtagatg aggaaataga aaggatgaag ctaatggatg ttatcgaacc atgcgagtcc 3120 gaatacgcaa gtccgttgct tccagtaaaa aaacctaatg gtaaatttag agtatgcctc 3180 gattcgcgtc gagttaattc agcgacaaga aatgacgcat atcctatgcc tcatctacat 3240 gagattttgc acagaataga acacgccaaa tattttagtg tgatcgactt aaaggaagct 3300 tattggcaaa ttcctctaac cgagaactct cggggtctca cggcttttag gacgaaaatg 3360 ggccttttta gatttaaggt aatgcctttt ggattaaaag gagccccatt caccatggcg 3420 aaattaatgg acctagcatt aggctgcgat ttacaaccat atgtgtgggt ctatttagat 3480 gatatcatca tcgctaccaa aacacttagc catcatattc aactaataaa agaggtagca 3540 aaacgattga aacaagctaa tttaacgatt agcttggcta agtcgaaatt ttgcagaaaa 3600 agcgttagat accttggtta tattgtttct gagcagggaa tttccattga tatggaaaaa 3660 gtgcgaccga tattggatta tcagccgcca aagacggtaa aggatataag gcgactatta 3720 ggattagcga atttctatca gaaattcatt aaaaactata gcgcgattac agcgcctata 3780 acagagctat taaaaaaaga gaataaaaag tttttatgga ctgcagaggc agagaaagca 3840 ctggagaatc tcaaagaagc tcttataagt ccaccaattt tagctaaccc agatttcaat 3900 gaactattta cgatcgaatc tgacgcatct gatctggcag taggggcagt tttgacccaa 3960 attcaaaacg gggaaaaacg agttatagcc tatttcagta agaaactgaa ctctaccaga 4020 agaaaatatg cagctgttga aaaagagtgt ttggcggttt tgtgggctat tgaggcattt 4080 aggaattacg tagaaggaac acattttaaa gttataacag atgctcgtag tttagtatgg 4140 ctatctaagg taagtgcaga gaaaggatcg gctaaattgt gcaggtgggc tttaaaactg 4200 caacaattcg atttcagtat tgaatatcga aaagggcgcg acaatataac agctgattgc 4260 ctatcaagat ctctcaacac catttcaacg aaatgggagg acttggagca cgatagccta 4320 aagaaaaaaa tctcagaata cccaaatgag tttaaagatt ttaaagtaat cgataataac 4380 atttacaaat atgttaaaga tgactctaaa ataacagaca atcgatttaa ttggaaaatt 4440 gtgcctaaaa agatcagaga tttcaactaa taaaaactac ccatgaggaa gcacacctag 4500 gatttgaaaa aacgttagaa aaaattaaag aaaaatatgt ctggccttta atgtacacgg 4560 aagtgaaagc attttgtaat tcatgtttgg catgtaaaac gtctaaatct gataatcaaa 4620 atcatgtccc accaatggga aaacaaaaat tagcaacaca accttggcaa atttagctat 4680 tgattatgtt ggcccatttc caagggcgaa aaagacagga aatacatgcc tccttgtcat 4740 cactgacatt ttttctaaat ttgttataat acagccgtta aaagaagcca aagccaaaca 4800 actagttttc ttcctagaaa atatgatatt tttacttttc ggagtgcccg aaatagtttt 4860 atccgataat ggagcccaat ttttgtctaa ggacttcaaa agccttttgg aaaaatacag 4920 tgtgactcac tggttaacgc ctgtgtattt tccacaagta aataatgctg aaaggaccaa 4980 ccgtgtgatt acttcgtcga ttagggctct aataaaaaag gaacaggatc actgggacga 5040 aaatatttac aagattgcaa atgcgattaa taatgcgatt cattcatcca gtggattttc 5100 gccgtacttt ataaattttg gtaagaatca aataagttca ggtgaagaat atcaaaatct 5160 gagagactta aacaatgata ccacaccctc agggaaagag cattctgaat caatgaaaca 5220 aatatacgaa aaggttagaa ataatttaaa aatagcgtat gataaatatt caaaatatta 5280 taatttaaga actgggaaat taattgaatt tcgggatgga gacaaagtgt taaggaaaaa 5340 tcactttttg tcagataaat caaaagcatt caatgccaaa ttagctccaa aattttcgga 5400 ggcaataata agaaagaaag taggggaagt ttgttacatt ttagaagact taaatgggaa 5460 atcgttagga ctataccatg tatcccaatt gaaaaagtta taaaagtaaa aagtaaaaag 5520 gaaccagcta tgtacagaaa caaagcaagc aaattgcaaa tgctgatgca taaaaatacg 5580 tcaaatcgga aattgaacaa aaaaaaaaaa aagctaaacg agcaaataag gaaaacaaat 5640 cttaattatt tacaggattt ttacacatcg caacgtacta cgaaatataa aaaggagttt 5700 ttgaattcga attatcttaa aagagcataa tagtaaaaaa aaaaaaaaaa ataacaactc 5760 aagctgaaaa gcgaatgaaa ataaaaaaga ttacaagcta tgtacacaaa caaggttgca 5820 ggaaaataac tgaagcacaa aaatacgcaa agtcggtaat aggagttttg gtagggaagg 5880 acaaaaaatt aaaacgatga agaaaacgga ataaaaacta gcgacttacc aaaaaccaca 5940 atcattgcat gcctgaggat caagtgtcct acgcgtttga ttcctccaat gggtgtgcta 6000 aaataagaaa ttattttgta aatatgtaaa tagtaggttt aaataattat agccagagct 6060 ttgaatcttt tagcaaaaag acatcaaaag ttaaaaaaaa aaaatcaaaa caaaataaag 6120 gttggctcag cgttagattc acaagaatct agcaaccgag accatccaat acattggagt 6180 taattattca aaacaaattt agttttaggt caaagaatac actaacaagt tcaaaaggcc 6240 atgtatataa actaaatatt taaagtaata aaaaaataga atatttatta ttaaggaata 6300 cggtacttac gtttttggtg aagtaaacac aggttttcat ccacgatcat tatccacaat 6360 agcccggtac acacagttca ttggcattat agatttcata tccaactgct cgcagtaaca 6420 taccatcgag gacaatagca atgtttgatt tgatagccag caatagccta ggtacccaaa 6480 agcaataata caaaacccga gagatgtgtg cggtaataca cgtttttttc agagagttga 6540 tgttgtttcc atatcgtgat atggtgtaga atcggttaga gtagatacat tttttttaat 6600 aaccaatacg aatacgcatt attcaattcg tgattcgttt ttggattgtt tggaatatgt 6660 attcatagta gtttgggttc agtatccaat atgtgtattg gagtaaataa tttgacaaag 6720 gagtttccaa taagtctatt ggaggttttg ggatacaact cagttagtgt cgtgtaaatg 6780 atccggaaag cacggatctc gggtgcgcag tttcggttca aaatgtcggt ttctcgccaa 6840 aataagtttc ggttattagt ttactacgtc ggtatcgtca ttaagaatga attcggttag 6900 tagaaaagcg tttttatgtg ctagataaga taaagtgcta agtagataaa gtaagaaaaa 6960 aaaaattgcg aagcaatttt tttgggataa gtgggggatg a 7001 // ID Polinton-3_NVi repbase; DNA; INV; 12588 BP. XX AC . XX DT 14-APR-2009 (Rel. 14.04, Created) DT 14-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE autonomous Polinton DNA transposon - consensus. XX KW Polinton; DNA transposon; Transposable Element; Polinton-3_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-12588 RA Bao W. and Jurka J.; RT "Polinton DNA transposons from Nasonia vitripennis."; RL Repbase Reports 9(4), 793-793 (2009). XX DR [1] (Consensus) XX CC The consensus is incomplete at both ends. XX FH Key Location/Qualifiers FT CDS join(4443..3613,3664..3245) FT /product="Polinton-3_NVi_1p" FT /translation="MITHNGSGLVNKLINKLPIELHLPGYQYCGPGTKLSK FT RLARGDLGINPLDAACKEHDIAYSENPNNIQARNVADRVLAEKAWQRVTSS FT DANLREKAAGFAVSNIMKVKSKLGMGLSGKRRKMQKKRKLLALKNVIKSTA FT KSFIPSNDSHSSIKSALMVARKHIKNVGGKTNIKIPRVLPVPKVGGFLPAL FT IPIFAGLSAVGSLAGGAAGITQAINRANAAKQQIEEQKRHNKKIEMLALGK FT GLYLKPHKTGYGICLKQRKRKRMQQKRSQRKKTSLTAETYATKTFAKKKNV FT VDVIRLPHRPLTDYDLLKYARLLKIPNFRGVFMRNDLPANKSHYRETAIIN FT LDDKKGAGTHWVAYRKNGNNVFYFDSFGDLQPPEDLMKYLKIDKVKYNYKR FT YQEFDTFVLWSPLPKIFMQSTR*" FT CDS join(10800..10399,10456..8579,8661..7915) FT /product="Polinton-3_NVi_3p" FT /translation="MLKILIINVPSKKHPNRVTNYFPYEDELNMSGIEFPV FT SLKKIPKFEKQNNGISVNVFGLDEKSIVPLYLTSEKKENHVNLLLLTEVET FT DSLEENISHFVYIKNLSRLVSDNLARRKKKRFICDRCLNFSLVKKIKEEEV FT YMRPMLKFFFSEKNLAMHEEHCKLLNDCKVKLPTEGRDYVEFKNYGFKEKV FT PFTIYADCECLLKPLESKDGSESNTTPMQKHEIYSIGYYFKCAYNDTLSVY FT RSYRGVEAARWFSEELRIIEMSVTDLYKNVIPMTPLTSDEIISYNTATVCH FT ICEKPFLDGNVKVQDHCHLSGKYRGAAHAGCNLNYQDSRIIPVIFHNLSGY FT DAHFIISDVSNHFPGQVYLLPITKEKYISFTKSVENSDIKFKFIDSFRFMP FT SSLEKLSSYLEEYKIVDREFSSLEKEKINLLTRKGVLPYEYITSWDKLDET FT TLPDLDKFFSTLNDSGISMEDYEHAQNVWHSFNIQTLGEYSDLYMKTDILL FT LADVFENFREHSIKAYGLDPAHSTPGFSWDAMLKHSGIRLKLLTDIDMVLF FT IERGIRGGLSQCSHRYATANNKYMLQDYKTEEETSFLIYLDANNLYGYGES FT QYLPYDDFEWITDCENFDFFSVADNAPNGYILEVDLEYPYELHDEHNDLPF FT YPEHLTPPGSKQNKLLATLQHKKNYVIDYRALKQVIAHGLRLRKIHRVLKF FT KQSPWLKSYIDLNSSMRAQPKNEFEKYLFKLMKEKSERKASNKMGWKIWSP FT GIYIIPKFSKKRVNVKLLTKWDGRYGAQAYISYPNFHSSSIFNENLVAIQL FT RKTKVIINKPIYVGLVVLDLSKRLMYDFHYNYMRKMYKDNCKLLYTDTDSF FT IYAIKCDDFYEDMKSNIHRFDTSDYPADNIFNMPRVNKKIVGLMKDECNGK FT ILTEFVGLRSKMYATRINGQDCVKKIKGIKGSVVKKSIKFDDYVDCLRNTC FT IKHRKQHVIRSRLHNVETIRQNKIALSPYDDKRRIQDDNIQTLAWGHYTMK FT GT*" FT CDS join(4539..5234,5244..5660,5611..6228,6219..6638, FT 6642..7565) FT /product="Polinton-3_NVi_2p" FT /translation="MDLQEQSLKLPVINFDTLTESNPKRYKRHGNLLPDSI FT RAVFCGPSNCGKTNALLSLIIHPNGLKFENIYVYSKSLNQSKYKFLEKLIE FT PLEGIQYLPFGEHDLVVSPDEARANSIFVFDDIACEKQDNVKAFYCMGRHK FT NVDCFYLCQSYAQVPKHLVRDNVNLIILFRQDDMNLKHVYNDHVNTDMSYQ FT RFKDLCSSCWNSNHGFLVIDKDSKINDGRYRKGFDKFIINMEYSTVRLESD FT KMADFNQQIAVLREVEKARNAIKRKHTLLKSQKIDFEKAVNDSVKPIVDPL FT EKLIHVTETTNHMKEKSEPKVETGRKKKLKIEKVEKKDDIAEDSEGDIEVD FT TEDDIKFESALSESEEEDLEETIKNKKKVSLKKKILKRLLKIKKNEIEDEY FT IDLLLNGQMQTLDTVYGVRRMRNDKLKVGDSFIDFDDNFITIGRKKYKKRP FT GLIQLLFMKNPDQSLITAEDMKDYDEILNDTNAHRKYYLPHEAIRVQDSKK FT YTHYISKSMKEVKLKKGKGLSSISYKVARKNAYMDYIYWDDPNELVDRLRL FT LTAETAAGNQSHINEIQSIIEELREAEYIYIYLLICRVELSLSIKMSIDVF FT GRQLIRTDTVRGPPGQGFKLTADGQFDLENHKLCNVADATEANDVITLKVM FT KENVKLESEKIIHDFNQILNDKKLDTIGNDVSEIKKSIIELYDKIKVIEFF FT INENSSTTTTTTRAHFMIKMKESLVEEIHKPARRNYPRRSFKIRDIDETWQ FT ADLVEMLPYANENKGFKYMLTVIDIFSKFAWTVPIKQKSGKYVTKAMKSIL FT VKNRVPRNLQTDRGKEFYNKEFQQLMKNFKINLYSTYSNMKASICERFNRT FT LKSKMWKRFSLRGNYKWIDILQDITLQYNKSKHRTIGMKPIDVNSTHVKEI FT LTRLAAQKKLRQHKEAKFKVNDKVRVSKSKQLFEKGYTPNWSTEIFTVYKV FT SNTVPRTYHLKDYQDKPIAGGFYEEELLKTRYPDIYLVEKVLKKRGNKLYV FT KWLGFDNSHNSWINKNDV*" XX SQ Sequence 12588 BP; 4059 A; 2153 C; 2182 G; 4194 T; 0 other; agtagtatgg gctgcgcggc tacggcgcca ccgaataccg acagtcggta gccggcaaat 60 ctactttacc gacaggtaaa tctgtcatac cgacgcgggc agcctgcatt tgaatttctt 120 gccgatttac gacgccagct tgcattcgat gttggtaaga tatatttttg aaacatctga 180 tgacgtggta cggatgtgat ggcaatcccc ctcccgtgtt aaatatagct cttcaccaca 240 ttatacagtg tgccgagctc tccgcattgt atctatactc tcttctcccc gcagtccgat 300 gcgggctgca acgatctgcc ttaccgacgg gtgaatctgt cataccgacg tccgacgcgg 360 gcagccggaa tttaaattcg atgcatgtat ggaatttaaa aattcgttct tatctgtttt 420 accgacgtcc cacgcggcag cctgcattcg aatttcttac cgatttaatc ctagtattat 480 aatttattac tttatgaatc atttattatt tgttaaagta tacgtataga aagagtttaa 540 ataaatacat tatgaatttt ttctttcttt ttctatatca atgtagaatt aagacttatc 600 caacttataa gaattcggaa gagatgacag aaaggaataa taaataaatg tatgcaaaat 660 ataagattac tttttgttta ttaaaaaatt ataacttaat acaaacatta ttcttaacaa 720 tatgtatgat attatctcag aacttcttgt aaccttgctg ttctggcatg gaattcatgt 780 tgctggaata cttgctgaaa taatttttcc attctttgct tttatcagca ggcttagagc 840 aaggctcacc tcgatatgta ttacattgtt tgccatcttg tacttttcta gcgttcttat 900 cttcaaccag aggtgtatat gaaactttat ttcgactcat tttggagaaa gtcgtaacca 960 cactaagatc ttatggttat ttgtacagct ttatatagag aaaaaggtgg tggtggagga 1020 aaatttttta aattaaataa tttttttaac atctccactt ataggattat acttgacaat 1080 gcgatcatgt ataattaaac aatatgctgt tgtatcagcg ggtatatttt cttttgtttc 1140 aaactcgaga cgtacatcca cagctgcttg tttcaatgat tcattctgct tggaacagtc 1200 aataactata agcggtgcat atgtgatgaa ttcttccttg cttaaaagag gattcatatt 1260 ttcatgataa taagcacttt gaaattgtgc atacatatca tacaacattg caaattgatt 1320 attatcaata tcaagattaa gattattgta tggataattc tgcgagttaa gaaaaagttt 1380 tacattgcta atcttacaat gatcaaaatg gctacaattt ttactattat gtccttttct 1440 agcagtttga aatgctagga taataaagcg tggtttttcc aactggtttg atgttttgac 1500 ggtccataca tgctgagtag tacgtggaag tactggatat tcatacaatt cccaagaccg 1560 aaaactcata gcaatgggta tatctttttg aatgtagtta taaagttgta ttttcctttt 1620 gtctgataaa actacataag gcataagcca ctctatacga gatatggtta atttaaaatc 1680 atcatacgta gctctatttt caaccattat tgcctcctgc attcatatca ctccttgatc 1740 ttgtgagaat aagttcatgt ttagcattta caataatttt tcgatagtct tcagcaaaac 1800 cgaaaatcat tgatagaggt atagtgatgt caaaatttcc attgttatca gttattaccc 1860 ctcttctgaa acattagtcc atccagaatt ctcattatat ttattctgac ttggtgcgaa 1920 ggaaacataa cctttcattg tactagttag gccgacattt ttacatcgat caatttcaat 1980 cgcattcatt tcatatcgca tttcctcgaa cagatgacat atagcattgt tcacaaactt 2040 ggtatgttgc agcgcagtac catctggttt tgttaattta ccacaaatat gaagtaaact 2100 tttggaaggt agcaaacaaa tcttgatgtt gaatagaaat acgaatttca tcactattat 2160 tgaacgatgc tgatgtatag ggttgatgtg catgaatctc ataatgagct acagactcat 2220 aaaaaatgat tggtgattgt atatcaaggc ccccctccct ccattttttt caggtataca 2280 gcagcaaaca aaaacacaat ttattttttg ctccttttta cttttctgaa accgtttcct 2340 ctttttacct tgagacctag actccgtaga aactgaatgt tctgacgtgt taactgtttg 2400 acgattcgtt gtcgactgac tttctgacga catactggat tattaatata atttttttag 2460 gcttggtatt gaaaacgatt cccattatta aacattcttc aaatgtagac gaattgttat 2520 aacttctcca cgtagattta ccaaatcacc atcttgatct acgatccgca attgtaagtg 2580 atcaataatt ttaactgtta tcggtaaata tataatttgt tttggaactt caattatttt 2640 gtaacctggt ggaactgcag ggaaaaactc gtgaatggta tgaacttgat tttgattaat 2700 gtatgctcca gatgtgatat tacactctac tcgtagggca tttactttta gaatggctac 2760 aggcaagtca gagttgtgag ttatataggg tttcaacagt cgtggagtga aacctagaag 2820 tcgagctatc gagtctttgg ggcgaaagtc tacttcatgg ctacatgtaa tgacacttct 2880 cagtgtatta ttattaggtt taataattag cgtgatattt tttaggctga gaatttcctg 2940 cagagtacgt tcaatatctt ctatttcata gctaccagtt ggtaattcaa tgatttcttg 3000 tccaacataa aatttattgc ttttgaagtc cacattaggt atggagttga aagttagtaa 3060 ttcaaccaat ccgagaacgt acattttatc tgaatcgagt tcaattggtg gaaagtattg 3120 agtttccaga acagacgaag tgcctgacag gctcaaagtt agcgattcaa tcattttgaa 3180 tgtaagtaag atactgatag tttatcgatc taatcacttc ttatatagac gaaaattgtc 3240 tgtgtcaccg agttgattgc ataaaaattt taggcagagg tgaccacaaa acaaaagtat 3300 caaattcttg atatctttta taattatatt taactttatc aattttcaag tatttcataa 3360 gatcttcagg tggttgtaag tcaccaaaac tgtcgaagta gaaaacatta ttgccatttt 3420 tacgataagc tacccagtga gtaccagctc cttttttatc atccagattt ataattgcag 3480 tttctcgata atgactttta tttgcaggta aatcatttct catgaagact cctcgaaaat 3540 ttggaatttt caataagcgc gcgtatttca gtaaatcata gtcagtcaat ggacgatgtg 3600 gtaaacgtat tacgtcaacg acgttttttt tctttgcgaa cgtttttgtt gcatacgttt 3660 ccgcttacgt tgtttcaaac aaatgccata acctgtttta tgaggtttta agtaaagtcc 3720 ctttccgagt gctaacattt ctattttttt attatgtcgt ttttgttctt caatttgctg 3780 cttagcagca tttgccctat taatagcttg agtgatacct gcagctcctc cagcaagtga 3840 gcctacagca ctcaagccag caaaaatagg aattagagct ggaagaaaac cacctacctt 3900 aggcacaggt agtacacgtg gaatcttgat atttgttttt ccacctacat tttttatatg 3960 cttacgagct accattaaag ctgatttgat tgaactatgt gaatcattgc taggaatgaa 4020 tgattttgcc gtagacttga tcacattttt caatgctaac aattttcttt tcttttgcat 4080 ctttcttctc tttccactga gtcccatacc taactttgat tttactttca tgatgtttga 4140 aacagcaaaa ccagctgctt tttcacgtaa attagcgtca ctggaggtca ctcgttgcca 4200 cgctttttcc gctaaaactc tatcagcaac gtttcttgcc tgaatattgt taggattctc 4260 tgagtaagct atatcatgtt ctttacatgc agcgtctaga ggatttattc caagatctcc 4320 acgtgctaac ctcttagaca gtttcgtgcc tggaccacag tactgataac caggcaaatg 4380 taattctata ggtagcttat ttattaactt atttacaaga cccgatccgt tatgggtgat 4440 catcactctc ggaaagtttt ctgtaaactg acgtaataac tataaatgca tgcttttata 4500 cttatatcca gtcagttgga attgatcttt aaaagaagat ggatctgcaa gaacaatcac 4560 taaaattacc ggtaatcaat tttgatacac tcactgaatc aaatcctaaa cgttataaac 4620 gacatggaaa cttacttccg gatagcattc gagctgtttt ttgcggacca tctaattgtg 4680 gaaaaacaaa tgctttatta tctctcataa tacacccgaa tggtctcaaa tttgaaaata 4740 tatacgtata ttcaaagtct ctaaatcagt caaaatataa atttctcgaa aagcttattg 4800 aaccactaga aggtatacaa tacttacctt ttggcgagca cgaccttgtt gtatcacctg 4860 atgaagctcg agctaattct atatttgtct ttgacgatat tgcatgtgaa aaacaagata 4920 atgtcaaagc gttctactgc atgggcagac ataaaaatgt tgactgtttc tacttatgtc 4980 aatcgtatgc acaagtacca aaacatttag ttcgtgataa cgttaattta ataatacttt 5040 ttcgacaaga tgatatgaat ttgaaacatg tatataatga tcatgtgaat acagacatgt 5100 catatcaaag atttaaagat ttatgttcat catgttggaa tagcaatcac ggctttcttg 5160 taattgacaa agattcaaaa attaatgatg gccgttacag aaaaggtttc gataaattta 5220 taataaatat ggaataagtt tagtattcga ctgtgagact tgaaagtgac aagatggctg 5280 attttaatca acagattgct gtgctgcgag aagtagagaa ggctcgtaat gctataaaac 5340 gaaaacatac tttattgaaa agtcagaaaa tagattttga gaaagcagta aatgattctg 5400 ttaagcctat tgttgatcca ctggaaaagt tgatacatgt gactgagaca actaatcaca 5460 tgaaagagaa aagtgaacct aaagtagaaa ctgggagaaa gaaaaaatta aaaatagaaa 5520 aagtggagaa aaaagatgat attgcagaag atagtgaagg tgatattgaa gttgatactg 5580 aagatgatat taagtttgag tcagctttaa gtgagtctga agaagaagat cttgaagaga 5640 ctattaaaaa taaaaaaaaa tgagatagaa gacgagtaca tagacttatt gttaaatggt 5700 caaatgcaaa ctttagatac agtatatggt gtacgaagaa tgcgtaacga taaactcaaa 5760 gtcggtgata gtttcataga ttttgatgat aatttcatta caattggtcg aaaaaagtat 5820 aaaaagaggc ctggtctgat tcaattgttg ttcatgaaaa atcctgatca aagtttaatt 5880 acggctgagg atatgaaaga ttatgatgaa attttaaatg ataccaacgc acatagaaaa 5940 tattatcttc cacatgaagc tatccgtgtg caagacagta aaaagtatac tcattacata 6000 agcaagtcta tgaaggaagt caagcttaaa aaaggtaaag gcttatcatc catatcgtat 6060 aaagttgctc gtaaaaacgc atatatggac tacatttatt gggatgatcc aaacgagctt 6120 gttgatcgtt taagattgct tactgctgaa acagcagctg gaaatcagag tcatataaat 6180 gaaatacaat ccattatcga agaactccgt gaagctgaat atatttatta atatgtagag 6240 tagagctatc gctcagtatc aaaatgagta tcgatgtatt cggacgacaa ctgatccgta 6300 ctgacacagt tcgcggtcca ccaggtcagg gtttcaagct tactgcagac ggacaatttg 6360 atttggaaaa tcataaattg tgtaacgtag ctgatgctac ggaagcaaac gatgtgataa 6420 cgttaaaagt catgaaagaa aatgttaaac tggaatctga aaaaattatt catgatttta 6480 atcaaatttt aaacgataaa aagttagata caattgggaa tgatgtatct gaaataaaaa 6540 aatcaatcat cgagttatat gataaaataa aagtgattga attttttatc aatgaaaact 6600 catctactac taccactact actcgtgcac attttatgta aataaaaatg aaagaatcgt 6660 tagtggaaga aatccataag cctgcaaggc gtaactatcc tcgtcgctca tttaaaataa 6720 gagatattga tgaaacatgg caagcagatc tcgttgaaat gttaccgtac gcaaatgaaa 6780 ataaaggctt caagtatatg ctcactgtta tagatatttt ttccaaattt gcatggactg 6840 tgccaataaa gcaaaaaagt ggaaaatatg taactaaagc tatgaagtcg attctagtaa 6900 aaaatagagt accacgaaat ttgcagacag atcgtggtaa agaattttac aacaaggaat 6960 ttcaacaatt aatgaaaaat tttaaaataa acttgtattc gacttatagc aatatgaaag 7020 cttcgatatg tgaacgcttt aatcgtaccc ttaaaagtaa aatgtggaag cgttttagtt 7080 tgcgtggtaa ttataagtgg atagatatac tgcaagatat aaccttgcag tacaataaat 7140 ctaagcatcg gacaattggt atgaaaccta tagatgtaaa ttccacacat gttaaagaaa 7200 ttcttactcg attagctgct caaaaaaaat tacgtcaaca taaggaagct aaattcaaag 7260 taaatgataa agttcgagtt agtaaaagta agcaactgtt tgaaaaagga tatacaccta 7320 actggtcaac tgaaatcttt actgtatata aagtatcaaa tactgttcca cggacttacc 7380 atttgaagga ttatcaagat aaaccaatag caggtggatt ttatgaagaa gaacttctca 7440 agactagata tccagatatt tatttagttg aaaaagtttt aaagaaacgt ggaaataagt 7500 tatatgttaa atggcttggt tttgataact ctcataatag ttggataaac aaaaatgacg 7560 tgtaactatg gatataagta aaataaaaaa attaataaaa attttcttgc ttatttattt 7620 atttttatac caatttcttt acattttctt aagagtactg gaataagaat tgtatgtgtt 7680 ttctcaatcc tcattttgaa acgttctcga tcacaggcag cttgctccca ctctccacaa 7740 cgagccttat gataggcaaa tttccatgca tacattggat gaactgtgac atcgtccgaa 7800 aaatgtacgc gttttttctc ctctttgctt ctcacgcttc tgaaattaaa tgttatgcat 7860 gtttaaaaaa ctagaaaata aaaataatta aactgatctg ataaataatc atacttacgt 7920 acctttcatc gtataatgac cccatgctaa agtctgtata ttatcgtctt gaatgcgtct 7980 tttatcgtcg tatggactta gagcgatttt gttctgtctt atggtttcca cattatgcag 8040 tcgtgaacgt ataacgtgct gttttcgatg tttgatgcat gtgtttcgaa gacaatctac 8100 ataatcgtca aatttaatgc tcttttttac aacggaaccc ttgattccct tgattttttt 8160 cacacaatct tgaccattga tccgagtggc atacatttta cttctcaagc cgacaaactc 8220 agtaaggatc ttgccattgc actcgtcctt cataagacca acaatttttt tattgacacg 8280 aggcatgtta aatatattgt cagctggata atctgatgta tcaaacctat gtatgtttga 8340 tttcatgtct tcgtaaaaat catcgcactt tatagcgtat ataaaactgt cggtgtcagt 8400 gtacagaagt ttacaattgt ctttatacat ttttcgcata taattgtaat gaaaatcgta 8460 catgagtctc ttagaaagat ctaagactac aagtcccacg taaataggtt tattaattat 8520 aactttagtc ttgcgaagct gaatagcaac caaattttca ttaaaaatag aactgctatg 8580 aaaatttggg tatgatatat atgcctgggc tccatatctt ccatcccatt ttgttagaag 8640 ctttacgttc actcttttct ttcattaatt tgaacaaata tttttcaaat tcatttttag 8700 gttgtgctct catcgagcta ttcaggtcta tgtaagactt taaccatgga gattgcttga 8760 attttaaaac tcgatgtatt tttctcagac gaaggccatg agcaatgact tgtttcaaag 8820 ctctataatc aatgacataa tttttcttat gctgaagtgt agccaatagt ttgttttgct 8880 tagatccagg aggtgttaga tgctctgggt agaagggtaa atcattgtgc tcatcgtgca 8940 attcataagg atactctaaa tccacttcta aaatataacc gtttggagca ttatctgcta 9000 ctgagaaaaa atcgaaattc tcgcaatcag ttatccactc aaagtcatcg tatggaagat 9060 actgcgattc tccataaccg taaaggttgt tggcatctaa gtaaattaaa aaactggttt 9120 cttcttcagt cttataatcc tgcagcatgt atttattgtt agcagtcgca tatcgatgtg 9180 agcattgact taaacctcct cgaatacctc gctctatgaa taaaaccatg tcaatgtctg 9240 tcaatagttt taatctaata cctgaatgtt taagcatagc atcccaagaa aatcctggtg 9300 tagaatgtgc aggatctaag ccatatgcct ttatactatg ttctcgaaaa ttttcaaaaa 9360 catctgcgag taataatata tcagttttca tgtataaatc agaatattcg cccaatgtct 9420 gtatgttaaa cgaatgccat acattttgcg catgttcata gtcttccatg cttattccag 9480 aatcattgag tgtactgaaa aatttgtcta gatcaggtaa agtcgtttca tccaatttat 9540 cccacgaagt aatgtactca tatgggagta caccttttct cgtcagcaga ttaatttttt 9600 ctttttctaa gcttgaaaat tctctatcaa cgatcttata ttcttcaaga tatgaggata 9660 atttttcgag cgatgaaggc atgaaacgaa aagaatcaat gaatttgaat ttaatatcac 9720 tattctccac actcttcgtg aacgaaatat acttctcttt tgttatagga agtaaataga 9780 cctgcccagg aaaatgattt gatacgtctg agattataaa atgagcatca taaccactta 9840 agttgtgaaa tattacaggt ataattctag aatcttgata attgagattg caaccagcat 9900 gtgcagctcc acgatatttt cctgaaaggt gacaatgatc ttgaactttt acattaccgt 9960 ccaaaaatgg tttttcacaa atatgacaga cagtagcagt attataactg attatttcgt 10020 ccgacgtgag tggagtcata ggtatcacat ttttatataa atcggtgaca ctcatttcaa 10080 taattcgcag ctcttcagaa aaccagcgag cggcttcaac acccctgtaa gatctgtata 10140 cagatagagt atcattgtaa gcacacttaa aataatagcc aatgctataa atttcatgct 10200 tttgcatagg cgttgtattg ctctcactgc cgtctttact ttcaagtggt tttaatagac 10260 attcacaatc agcataaatt gtaaatggca ctttttcttt aaaaccatag tttttaaact 10320 ctacgtagtc tctgccctct gtagggagtt taactttaca atcgtttagg agtttacagt 10380 gttcttcatg cattgctaaa tttttttcac taaagaaaaa tttaagcatc ggtcgcatat 10440 aaacctcttc ttctttctac gtgcaagatt atcacttact aagcgtgata gattttttat 10500 atacacaaag tgagatatgt tctcttccaa agagtcagtc tccacttcag tgagtaaaag 10560 aagatttacg tggttttctt ttttctctga tgtcagatag agaggtacaa tactcttctc 10620 atcaagtcca aaaacattta ctgaaattcc attgttttgc ttctcaaact ttggaatttt 10680 tttcagtgaa actggaaatt caattccact catattaagt tcatcctcgt atggaaaata 10740 atttgttact cgatttggat gtttttttga tggaacattg attatcagaa tttttaacat 10800 ttatgcatgc ttctttttta actaaaaatt ctggcagagg tatatacgag ctaccagtac 10860 gtattggatt aaatacattc atattaattt gtatattgac aattgaatgt aagtccaacc 10920 agaatcttgc tcttcaaact cctctatttt agtgagaaga ggttccatga tgtgatcaat 10980 aaaccactgt ttaactatag atgctgcaaa aatttctgta ttttttgtat tgaaatgctt 11040 tgtttcaata cattctccct cttcatcacc atccacattt ttcacaattt ttttaaattt 11100 acatgataag actgaattaa ttttctatct caatatgctg aatattagct actatactgg 11160 tacgcatcct attattaaat gcactatcta catcaatcca tcttacacgg gcatttatct 11220 tattttgcac aagcccacca ccggattttg tcaatctctc atgtaatttt atgtaaaaac 11280 attcaatttt accaatcata tacatattga tcgtttttag agacagagaa gagtttgtct 11340 tttaaggcaa tacgtaattg tttaagacat aaatcacatt gacttttcca acgatttatc 11400 acaatcgtat cagtagtagg tctggataat aaaagctcct cagcttcctc cagatgatgc 11460 actaatgatc gacaattatc caaaagagta gccatgtcac ttgcttgttg acaactaaaa 11520 aattaagcat gctgctaaaa atatataacc aggggagatc aaagcattta taaatattaa 11580 agaaggggta aaatcataga gattttttga taaatcaact tagtatgaat ttaaataaaa 11640 tttgtctata tcaattaatg tttttgtatt tgttttttat atttgaattt ttgcattagc 11700 attttatagt gtgtgattta atgtactgat atgtgaatta ttatcgttaa gatatatatc 11760 gtgttactgt aagatacaaa tctcgacacg gcggtgccat gtacagtttt tatatcgtat 11820 aaataggctt tcttaaactt attcgataca gcgaaaatct agaaatatat tgtcataaga 11880 atagcataaa acgtttgcaa tataatgttg gtttgagtgg cgactcccct ccatacgctc 11940 cgcacatcat acatttgaat ttgcaggtca aacatagcgc gcgcctgacc aagagtagaa 12000 gtatatgtat ggctatactc ggcacgcata gaactcggca ctatgtgcgc gctgtggcga 12060 tgagagtata tctatatcta tactcagcgc gcagtatagc ccgcgacgga ctgctgcggc 12120 gagaagagag tataggtata cacggagagc tcgacactgt atgcgtacag cgtgcgctgt 12180 atagtaaaga gagtatatct atatcgatac tcagtgcgta catcggactg aaagctcgac 12240 tgccagcatg cgctgttggt atgagagaat atctatatct aactcagcgc gttgcagcgc 12300 gtatcggact gtgggaagaa gagagtatag atacgatgcg gagagctcgg caaactgtat 12360 aatgtggtga aaagctgtct ttagcacggg aggaggattg ccatcacata tgcaccgcgt 12420 catcagatgt tttaaaaata tatcttacca acatcgaatg caagctggcg tagtaaatcg 12480 tcaagaaatt caaaggcagg ctgcccgcgt cggtatgaca gatttacctg tcggtaaagt 12540 agatttgccg gcattcggtg gcgccgtagc cgcgcagccc ctactact 12588 // ID Gypsy3-LTR_AP repbase; DNA; INV; 105 BP. XX AC Contig4708; XX DT 21-APR-2008 (Rel. 13.04, Created) DT 21-APR-2008 (Rel. 15.12, Last updated, Version 0) XX DE LTR retrotransposon from pea aphid: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy3AP; KW Gypsy3-I_AP; Gypsy3-LTR_AP. XX NM Gypsy3-LTR_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-105 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from pea aphid."; RL Repbase Reports 8(4), 442-442 (2008). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 105 BP; 30 A; 16 C; 11 G; 48 T; 0 other; tgtagtggga tattccccga acggccacta ttattattat tattattatt attattatta 60 ttattattat tatcatcatt attatttatt tacgcctagc cgaca 105 // ID Proto2-5_CS1 repbase; DNA; INV; 4471 BP. XX AC . XX DT 15-JUL-2009 (Rel. 14.07, Created) DT 15-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE Annelida Proto2-5_CS1 autonomous Non-LTR Retrotransposon - DE consensus. XX KW Proto2; Non-LTR Retrotransposon; Transposable Element; KW Proto2-5_CS1. XX OS Capitella sp. 1 OC Eukaryota; Metazoa; Annelida; Polychaeta; Scolecida; Capitellida; OC Capitellidae; Capitella. XX RN [1] RP 1-4471 RA Kapitonov V. and Jurka J.; RT "Proto2, a novel clade of metazoan non-LTR retrotransposons."; RL Repbase Reports 9(7), 1560-1560 (2009). XX DR [1] (Consensus) XX CC Proto2-5_CS1 is a very young family of non-LTR retrotransposons CC present in the annelid genome. It belongs to a novel clade of CC metazoan non-LTR retrotransposons called Proto2. This clade CC includes families of non-LTR retrotransposons present in the CC hydra (from Proto2-1_HM to Proto2-5_HM), annelid (from CC Proto2-1_CS1 to Proto2-8_CS1), hemichordate (Proto2-1_SK) and CC amphioxus (Proto2-1_BF) genomes. A model Proto2 non-LTR CC retrotransposon is ~4.3 kb long, contains two ORFs; its 3' CC terminus is composed of microsatellites; usually there is no CC target site duplications generated upon retrotransposition of CC Proto2 elements. ORF1 codes for a protein conserved in Proto2 CC elements from all species mentioned above. ORF2 codes for a CC protein composed from the AP endonuclease and reverse CC transcriptase domains. It appears that the Proto2 clade is a CC clade ancestral to the RTE and RTEX clades. XX FH Key Location/Qualifiers FT CDS 222..1352 FT /product="Proto2-5_CS1_1p" FT /note="ORF1." FT /translation="MTGTSISTPPPEVKLVHSELLCFALTHAHNSTLTMLA FT ECLNEFYRPEEVLHARETLWRECHDLLKDARKPRRSQTPVDRQTMRPFVDD FT VCSWIGMLVNNQRCENMNVQFYALNLRRTPPCPPEEINIFSLAARVAALER FT TRKSCTQRDAPLAVQVHGQSTWDSIPVPEQPRNGEAPSREPPSTAGLHTSK FT PVAVTAESAGGAPWSTVVKRKTIKKRDEARKQLRDAAKDLRVVVGTEKGTV FT LKGCRPTKQLFVNRLERCSTDTVKKYMISKGVTPRDVHCTSKESWLNASFR FT LTVVATDMDRVFDAHFWPVGVRCREWLPNASNKRRASVSTCDDPQDDPPET FT HYSGMEDAEDHDNAADDAAADDAADDVDDNNRHG" FT CDS 1135..4380 FT /product="Proto2-5_CS1_2p" FT /note="ORF2, AP endonuclease, RT." FT /translation="MPISGRSEYVVVNGCQMHQINVGPQFPPVMTHKMTPL FT RRITLEWKTPKIMITPLMTPPPMTPLMMWTTITAMDSCTKLRLITWNSRGH FT RMDRMEYIQTLMCRCDVLFIQEHWFLDADICKISEHVDNVEVFGVSGMDDS FT RLISGRPFGGCAIVYSKSLRCDVSIIKTESRRLCAAKLSLPDGNNFLLCNV FT YMPCDQLTNQRDVFLSTLCEIDSILSATELNGGIVGGDFNTEMSRLSSHHT FT RDLVDFCSSNGLDMCCDLDVCQVDYTYENEASRARSVLDHFIVSENISADV FT ALYECLHEGDNPSDHCPVMMHVNVNVEHLPTDRSPILRSGLPWRTVSPRVL FT GDYGAALHVALTSIVVPNAALQCDDWSCEQHTHDIQAYYDNIVKACLDAAE FT GTVGRKRKRRGNTPLRRPGWKDHVEPHRERAIFWHALWKSSGSPHDGHVAD FT IRRRTRARYRYAVRYVKRHEEQCRANKLAEALAKDRGRDIWNEVQKISGRK FT TNRVAMVDGMVDDRDIGELFARKTSEVFNAVAYDVDRMDSMLASLESDIRS FT RCARSQCYHNHSVSLADVFKAVKCLKAGKKDGTVDFSSDYIVNGPDSLSIH FT FSLLFSLILKHGRFPRDFALSTVIPLPKNKKKSLKCSSNYRGIALGSVIAK FT LFDVTILHSNSQVLRCSDLQYGFRKNHSTTQCTFVLNETIQYYLNGRSNVH FT VMLLDASKAFDRVEFVKLFEVLRAKGLCPLVCRILVNMYIVQKFRIRWNTC FT QSSWHRASNGVKQGGVISPVLFVNYIDELLIRLSRSGVGCYIGNVFCGSFG FT YADDITLLAPSIHALKTMLNVCSEYAHDFNLLFNTDKSKYIVFNGKRNHPS FT VNITWDGKTLNSSSSDLHLGNLFGPRANDHLITTAIKIFYCRFNVLFRTFH FT FANLDVKYHLFKPHCMSLYGSPLWDYSSTSCDRFYIAWRKSLRLLFKLKSR FT THCHLLPLIVKDHPVDVQLHRRFLKFYFKALFSFNTCTRTCAELALRRSES FT NVSRSLHFISTKYGINTHMLHVTTLPALLSRVRHEPSAGHLTYASALHDFI FT LLRDTHNSPTEHQHFSDIVDYICTM" XX SQ Sequence 4471 BP; 1187 A; 927 C; 1031 G; 1326 T; 0 other; agatataacg agcgaaagta cgaattgact gcgctccttg aatattacag ttcattacat 60 catttgtgag agcagaagga taatttttat accaacgtga ccgtaaagtg ttggacattc 120 ctttgggata tgtcgagtgc atccacggtc tcacataagg acgggagatc cggatttgtg 180 tttttctttt gttcagtgga actactcaca gagcaatcac catgacagga acgtcgattt 240 caaccccgcc ccctgaggtc aagctggttc acagtgagtt gttatgtttc gcgcttactc 300 atgcgcacaa cagtacgctt acaatgctag ctgagtgctt aaatgagttc taccgccccg 360 aagaagtgct ccatgccagg gaaacccttt ggcgagagtg ccatgacctg cttaaagatg 420 ctaggaagcc ccgtcggtcg caaactccgg tcgaccgtca aacaatgagg ccatttgtag 480 atgacgtatg cagctggatt ggcatgctag tcaataacca gagatgtgaa aatatgaatg 540 tacagtttta cgctctcaac ctacgcagaa cgccaccctg tcctccagaa gaaataaaca 600 ttttctcgct tgctgctcgt gttgcagcgt tggagaggac tcgcaaaagt tgtacccaac 660 gagatgctcc tctggctgtt caggttcatg gacaaagcac ctgggatagc attcctgtcc 720 ccgagcaacc tagaaacgga gaggccccgt cacgggaacc tccatctact gctggtttgc 780 atacttccaa gccagttgct gttactgctg agtctgctgg tggtgcacct tggtccactg 840 tggtgaagcg aaagacaatt aaaaagagag atgaagcgag aaagcagttg agagatgctg 900 cgaaagacct acgagtagtc gtgggaacag aaaaaggcac tgtgctaaag ggctgcaggc 960 caacaaagca gttgtttgtt aatcgtttgg agagatgctc aacggacaca gtcaagaagt 1020 acatgattag caagggagtc accccgagag atgtccactg tacatcgaag gagtcgtggc 1080 tgaatgcttc gttcagacta acagtcgttg caactgatat ggaccgtgtc tttgatgccc 1140 atttctggcc ggtcggagta cgttgtcgtg aatggctgcc aaatgcatca aataaacgtc 1200 gggcctcagt ttccacctgt gatgacccac aagatgaccc ccctgagacg cattactctg 1260 gaatggaaga cgccgaagat catgataacg ccgctgatga cgccgccgcc gatgacgccg 1320 ctgatgatgt ggacgacaat aaccgccatg gatagttgta caaaactgag actaattacg 1380 tggaattccc gtggtcatcg catggaccgt atggaataca tacaaacatt gatgtgtcgt 1440 tgtgatgttc tgttcatcca ggagcattgg ttcttagatg cggacatatg taaaatctct 1500 gaacatgtgg acaatgtgga ggtttttggt gtttccggaa tggacgactc tcggttgatt 1560 tccggtagac cgtttggtgg ttgcgcgatt gtttatagca agtctttacg ttgtgatgta 1620 tcaattatta aaaccgaatc caggcgattg tgcgccgcta agctctcact tcctgatggt 1680 aataattttc ttttatgtaa tgtgtatatg ccttgcgatc agctgactaa tcagcgcgat 1740 gttttcttat ccacgctgtg tgaaattgat tcaattttga gtgctactga actgaatggc 1800 ggtattgttg gtggagattt taacactgaa atgagtcgtt tatcgtctca ccacacacgt 1860 gatcttgtcg atttctgttc ttccaacggt ttagacatgt gttgtgatct cgatgtatgt 1920 caggtcgatt acacctacga gaatgaggcc agtagggcac gctctgtgtt agatcatttc 1980 attgtatctg agaatatttc tgctgatgtt gccttgtatg aatgtttgca tgaaggtgat 2040 aacccatctg atcattgtcc tgttatgatg catgttaatg taaacgttga acaccttccg 2100 accgaccgct ccccgatact gagatccggg cttccatggc gaactgtctc ccctcgggtt 2160 ttgggggatt acggagcagc gctgcatgtc gctttaacca gcattgttgt ccccaacgcc 2220 gcactgcaat gcgacgattg gagttgcgag caacatacgc atgatattca ggcatattat 2280 gataacattg taaaagcgtg tctcgatgcc gcagaaggca ctgttggccg taagcgcaag 2340 aggagaggca atactccact gaggcgtcct gggtggaagg accatgttga gccgcaccgc 2400 gaacgggcga ttttctggca tgcgttatgg aaaagcagcg gatccccaca tgatggccat 2460 gtagctgaca ttcgacggcg gacacgggca cgttatcgct acgctgttcg ctatgtcaag 2520 cggcatgaag agcagtgcag agccaacaag ttagctgaag ctttggcaaa ggacagggga 2580 agagatattt ggaacgaggt gcagaaaata tctggaagga aaaccaacag ggttgctatg 2640 gtagacggga tggtcgatga ccgggacata ggggagttgt ttgcgcgaaa aacatccgag 2700 gtgttcaatg ctgttgcata cgatgtcgat agaatggatt ccatgttagc ttcacttgaa 2760 agtgacatac gctcacgatg tgctcgttca cagtgctacc ataatcactc ggtatcactc 2820 gctgatgttt ttaaggctgt caagtgtttg aaagcaggaa agaaggacgg aactgttgat 2880 ttctcgtctg actatattgt taatggtcct gattctctat ctattcactt ttcgttgttg 2940 tttagcttaa ttcttaaaca cggtcgtttt cctcgtgatt ttgcactcag tactgtcatc 3000 ccattaccta agaataaaaa gaaatcatta aaatgttcat ctaattatcg aggaattgca 3060 ttaggtagtg taatagccaa attgtttgat gtcaccattc tccatagcaa tagtcaagta 3120 cttcgctgtt cagatttaca atatgggttt cgtaaaaatc attccacaac gcaatgtaca 3180 tttgtactaa atgaaacaat tcaatattat ctcaatggaa gaagcaatgt gcatgttatg 3240 ctgttggatg ccagcaaagc ttttgacaga gtggagtttg taaaattatt tgaggtcctt 3300 cgagcgaagg gattatgccc acttgtatgc agaatacttg taaatatgta tatagttcaa 3360 aaatttcgaa tccgatggaa tacttgccaa agtagctggc accgagccag caacggtgtc 3420 aaacaaggag gagtaatctc tccggtttta tttgtgaatt atattgatga attgttaatt 3480 agattaagtc gttcaggagt tggctgttat attggtaatg ttttttgtgg tagtttcggc 3540 tacgccgatg acatcacact gttagcaccg tcaattcatg ctctcaaaac tatgttgaat 3600 gtatgttctg aatatgcaca tgatttcaat ttattgttta ataccgataa aagtaaatat 3660 attgttttta atgggaaaag aaatcaccct tctgtaaata ttacttggga cggtaaaact 3720 ttgaacagta gttcatccga tttacatctt ggaaatttgt ttgggccacg tgccaatgat 3780 cacttaatca caactgctat caaaattttt tattgtcgct ttaatgtttt gtttcgtact 3840 tttcactttg ctaatcttga tgttaaatat cacttgttta aaccgcactg catgtcctta 3900 tacggctcgc cgctatggga ctattcaagc acttcttgtg atcgctttta tattgcctgg 3960 agaaaatcac tccgtctttt atttaaacta aagtcacgca cacattgtca cctcttacca 4020 cttattgtaa aagatcaccc tgttgatgtt cagttgcaca ggcgcttttt aaagttttat 4080 tttaaagcct tatttagttt taacacttgc actcgcactt gcgcagaact tgcacttcgc 4140 agaagcgaat caaacgtttc acggagtttg cactttatca gcacaaaata cggcattaat 4200 actcacatgt tacatgtcac tacactccca gcactgctct ctcgagttcg tcatgagccc 4260 agtgctggtc acctcactta tgcctcagca ctacacgatt ttatattatt aagagacacc 4320 cataattcgc caactgagca ccagcacttc tccgatattg tggattatat atgcaccatg 4380 tgattacatg taagagtgtc tatgattttt gttgattttt gttttgtttg tttgattttg 4440 attttgattt tgattttgat tttgattttg a 4471 // ID Gypsy-80_AA-LTR repbase; DNA; INV; 251 BP. XX AC supercont1.247; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-80_AA_; KW Gypsy-80_AA-I; Gypsy-80_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-251 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.247; Positions 1368984 1368734. XX SQ Sequence 251 BP; 82 A; 52 C; 47 G; 70 T; 0 other; tgtatattga aataacattt gactgaaaac atttatacgt gcgtatgcat attcagcaat 60 catgttaaga aaagaaagca ttacgcgcaa cactgccagc tacttgctag ctgttctgat 120 gtcaataaag agatcgaaag atcagttctg atcgtgaagt taatctgaac acaagttgtt 180 ctttttcaag tcaccctccg gaagaatacc cccgtaatca gtcacccttc ttgaacagta 240 gtgagcggac a 251 // ID BEL-116_AA-I repbase; DNA; INV; 6337 BP. XX AC AAGE02029143; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-116_AA_; KW BEL-116_AA-LTR; BEL-116_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6337 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02029143; Positions 15761 22097. XX CC Positions [5197-5778] - Integrase core CC 'AGTAT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 817..6207 FT /product="BEL-116_AA-I_1p" FT /translation="MEGGSSNSAAGNARMREQLTTRRTTLLAQLNRAEQFV FT AKYDAGRDELEVPLRMENLDVLWASLEEVQAALEDLETSSEGKTANLEVRA FT SFEPKLFRIKANLKAKLPPPIIPQASRPDNPSRVPSTLSSLKLPTISLPEF FT DGDYRQWLTFHDTFQALIHDNDELPVIQKFHYLRAALKGEAAQLIESIAIS FT AANYPLAWDSLISRYSNEYLLKKRHLQDLMDVPRMKKETAAALHSTLDEFQ FT RHIKILKQLGEPTDAWSTLLEHLLCSRLHDDTIKAWEDHAASVDDQSYSCL FT VEFLEKRVRVLESISANHHGVQSTSASQPSGPNFRKPFFKMASHSVTENSF FT PKCHACDNRHLLVKCPRFMSMAVAEKLRLVNSKRLCVNCFRQDHFARDCSS FT NYTCRVCRKRHHSLLHLGFASGSNSTNPDPPSNQSHGNSSAAVASTSSQIS FT DHRRVQSNPAITKSNDVQKSNVARDAAPTAFLLTVVLKIVDVYGKEHYARA FT LLDSGSQPNLITDRLAQLLRLKRQRSNVQVQGIGEQPEYSNVSVTTEVRSR FT KGDFARNVTFLVLKKLTSSLPSCSVSVDHWKLPKDLFLADPGFNHSSDVDL FT ILGSQYFFDCFPTAARIQLSDTLPILVDSEFGWIVAGGTYLVPPSMEAVCC FT KTVTVSVDPLEECMEKFWKIEELPTRSTYSVEEKACEDHYVSTVSRTEEGR FT YVVRYPKRENFITLIGESKSTALRRFALLERRFFKNPELRESYESFMKEYL FT ALDHMRPVQEEKDATLSYYLPHHPVIKEESTTTKTRVVFDGSSKTSSGYSL FT NDALCVGPVVQDDLLTLVIRFRKYPVALVADIAKMYRQVLIHPEDAPLQRI FT VWGSSPSQPPSSFELQTVTYGLAPSSFLATRTLQQLAVDEGHAYPLGKPAL FT QKSFYVDDFIGGAESVPEAIQLREELTDLLAKGGFPIRKWTSNKLEVLQGL FT DADCIGTQSSVRFDPDETVKTLGICWEFERDQFRFHYHVSQNIIRATKRSI FT LSAISQLFDPLGLVAPIVVRGKMLMQELWLTACAWDDDVPDPLKQKWESFH FT HQLPKLSEFRIPRYAFQQDSLVQLHTFADASEAAYGACTYARSVDSEGRVR FT IQLLAAKTRVAPLKRLSLPKLELCAAVIAAQLHVHITKALEMNIACSFFWS FT DSTVTLQWLKSPPNTWKTFVANRVSEIQTTTHGARWNHVAGSQNPADLLSR FT GMIVEDFLERALWTTGPDWLSRSEDDWPISKQIDSAEANSERKPAISAVAR FT ADSKYNPLFSRYSQFSRLTRIVAYCMRFVENARSKSRTRPVAFIGPNPSLS FT LTVEQLSHARNKLIVLAQADSFQEEIKDLEQNRPLKRQSSIRLLHPFLDQK FT GVLRVGGRLRLAQRSYDFKHPALIPSFHPLAKLIASYFHTKLIHGGGRLTL FT AAMREEYWPVHGRRLVRSVIRNCIRCARANPVPVRQQIGQLPAARVTPSRP FT FAVTGVDYAGPVYLKAIHKRASPTKAYICLFVCFTTKAVHLELVSDLSTPA FT FLTALRRFVARRGRPSHIHSDNGKNFIGAKNELHQLYRMLANHKEVEKIQR FT VCAEDQITWHLNPPKAPHFGGLWEAAVKVAKSQLYRQLGHSHLSFEDMSTI FT LAEIEGAMNSRPLVPMTEDPDDCSAITPAHFLIGTSMHAIPDPDVSEINMT FT RLAHHQRLQHLFQQFWHHWRTEYLQELQRDTTRCQSNNEIRPGRLVVLVDE FT FQHPTRWPLARIEAVFPGADGLVRVVKLRTSKGIFTRPTTKICLLPTEAVN FT AEGTQQFNEIRDEDQPAENNEGIQKV" XX SQ Sequence 6337 BP; 1632 A; 1658 C; 1461 G; 1586 T; 0 other; ttttctggtg ccgtgaccag gatcgtgggc tgcaccattg ttattctttc gacccagcta 60 gtgagagctg tgacctagca gccatcatgt tacaaaaccc acgctacccc tagtgaaaat 120 cgccattgca gttgaagata gacgccgtcc ttgcatgggt attgtgcaga tgccatttgc 180 atgtcgctat tatcaccatt gttcctgatc cacgttgctg cgattgaaat atttcaaggc 240 ctatttgata taggtacaag gtaatgtacc tctccaacgt ttccaagcgc atctttaggt 300 gcttttagat gtgtttttcc ttttgcatga tttcttctgc atcggtgctg ctagcgtgat 360 ataccatcta gacgatcact cccaatacga aaccatcctc gccgaccatt cacacccgtc 420 ggcatacatt tgattcgtta agctgctggc cgtagttacc tgctacatgg tggcggccat 480 tcggaaacac atcgacgaac cattcccgtt tttgaagaca cctgcatgct atagctgagt 540 atatttgcat gccatctcag cgatcatccg cactcattct ggaataaatc accccaactt 600 ccctacactg gacccgtgtg aggctccaaa tctgttcctg cgctcctgat tttgattata 660 caaggctcga atagccttgg caaaggttag tacctgtcca agttgcctat ttgtctccat 720 ccaggtctac ctggtctcta tttgccaggc actttccgac ttccttgtcg atactgagca 780 ccgtcaacag catcgagtgt cgatccatcg agcaagatgg agggtggcag ttccaactcg 840 gccgcaggca acgcacggat gcgagagcag ctcaccaccc ggaggacgac attgttggcg 900 cagctaaacc gtgcggagca gtttgtcgcc aaatatgatg ccggacggga tgagctcgag 960 gtaccactac ggatggaaaa tctggacgtt ctgtgggcat cgctcgagga ggtgcaagct 1020 gcccttgaag acctggagac aagcagtgag ggtaagaccg ccaaccttga agtccgtgct 1080 tcgtttgaac cgaagctttt ccgaattaaa gccaatctaa aagccaaatt gcctccgcca 1140 atcattccac aagccagccg ccctgataac ccctctcgcg taccctccac cttgtctagt 1200 ttgaagttgc ctaccatttc gctcccagag tttgacggtg attaccgtca gtggctaacg 1260 tttcacgaca cgtttcaagc cctcattcac gacaacgacg agttgcccgt gatacaaaag 1320 tttcactact tgcgcgctgc gctgaaaggt gaagctgcgc aactgatcga atcaattgcg 1380 atcagtgctg caaattatcc tctagcatgg gactcattaa taagccggta ttctaacgaa 1440 tatttattga aaaagcgaca tctacaggac ctgatggacg ttccacgaat gaagaaagag 1500 actgcagcgg cgttgcactc caccttggat gaattccagc gacacatcaa gatccttaag 1560 caactgggcg agccaacgga tgcgtggagt acattgttgg agcacctgct gtgttcaaga 1620 cttcatgatg acaccattaa ggcctgggag gaccatgcag catccgtcga cgaccagagc 1680 tactcctgtt tggtcgagtt tctggagaaa cgggttcgag tgctcgaatc gatttccgcc 1740 aaccatcatg gtgttcaatc tacttccgct tcccagccga gcggaccgaa tttccgcaaa 1800 cctttcttca agatggcatc ccactccgta acggaaaatt cgtttcccaa gtgccatgct 1860 tgtgacaacc gtcatctctt agtgaaatgt cctcggttca tgtcaatggc cgtcgccgag 1920 aagctgcgtc tcgtgaattc caagcgcctt tgtgtaaatt gcttccgcca agatcacttc 1980 gcccgtgatt gctcatcgaa ttacacatgt cgagtgtgta gaaagcgtca ccattccctc 2040 ctgcatttgg ggtttgccag tggatccaat tcaactaatc cagatcctcc ctccaaccaa 2100 tcccatggta actccagtgc tgcagtagct tccaccagct cgcagatcag cgatcaccgc 2160 cgagtacagt ccaatccagc aatcacaaaa tcgaatgacg tccaaaagag caatgttgcc 2220 cgtgatgccg ctcccaccgc gttcctgctt acagtagtcc tgaagatcgt tgatgtatac 2280 ggaaaggaac actatgctcg tgctcttctc gacagtggtt cccaaccgaa tttgattacg 2340 gatcggttgg ctcagctact gagactaaaa cgacagaggt ccaacgtgca ggtgcaagga 2400 attggggaac agccagaata ctccaacgtg tccgtaacca ccgaagttcg ttcgaggaag 2460 ggtgactttg ctcgaaatgt tacgtttttg gttctcaaaa agttgacttc cagtcttcca 2520 agttgttccg tgtcggtgga ccattggaaa cttccaaagg atttgttctt ggcagatccg 2580 ggcttcaacc attccagtga tgtagatttg attctaggat cgcaatattt cttcgattgc 2640 ttccctacgg cagctagaat ccagctgtcg gacactctcc cgatattagt tgacagcgaa 2700 tttggttgga tcgtagcagg cggtacctat cttgttcctc catccatgga agcagtgtgt 2760 tgcaaaacag ttacggtttc agtggatccc ctcgaagaat gcatggagaa attctggaaa 2820 atcgaagagc tgcccaccag atccacatat tccgtagagg agaaggcatg cgaggaccac 2880 tacgtatcaa cagtttcccg aaccgaggaa ggacgttatg tggtgcgtta cccaaagcgc 2940 gagaacttta tcaccttgat cggggaatcc aaatccacgg ctcttagacg ttttgcactg 3000 cttgagcgac gctttttcaa aaatccggaa ttgagggaaa gttacgaatc cttcatgaag 3060 gaatatttag ccctggacca catgcgccca gttcaggaag agaaggacgc cacgctctct 3120 tattaccttc ctcatcatcc ggtcataaaa gaggagagca ccaccaccaa gacacgcgtc 3180 gtcttcgatg gatccagcaa aacctcatct ggttactcct tgaatgatgc gttgtgcgta 3240 ggtccagtag tccaggacga tctgctaact ctggtgatcc gttttcgaaa atatcctgtg 3300 gctcttgtgg cggacattgc gaaaatgtat cgtcaggtgc ttattcatcc cgaagatgca 3360 ccactccaaa gaattgtttg gggatccagt ccttcgcaac ctccatctag ttttgagctg 3420 cagacggtaa cttatggcct cgccccttca tccttcctag ccacccgcac cctgcagcaa 3480 ctcgcagtcg atgaaggcca tgcttatcca ctcggcaaac ctgctctcca aaaatccttc 3540 tatgtcgacg attttatagg cggagccgaa tctgttccgg aagctatcca gttacgtgag 3600 gagctgacag acctgctagc gaaaggagga tttccaattc gtaagtggac gtccaataaa 3660 ttagaagttc tgcagggact tgacgccgat tgcattggca ctcaatctag tgtccggttt 3720 gatccagatg agaccgttaa aacattgggc atttgttggg aattcgagcg agatcaattt 3780 cgattccatt accacgtgag ccagaatata attcgagcaa ctaaaagatc catcctttcc 3840 gcaatttctc aactatttga cccgttagga cttgtggctc caatcgtcgt gcggggtaaa 3900 atgttaatgc aggagctgtg gttaactgcc tgtgcctggg acgacgatgt tccggatcct 3960 ctgaagcaga aatgggagag cttccatcac caacttccga agctatcaga atttcgaatt 4020 ccacgctacg ccttccagca agattccctt gtgcagcttc acaccttcgc ggatgcatca 4080 gaagccgcct acggagcttg tacttatgca cggtctgtgg actcagaagg aagggtgaga 4140 atccagctgt tagctgcaaa aacccgggta gcacccctca aaaggttgtc actacccaaa 4200 cttgaactat gtgccgcagt tattgcagct caactccacg tccacatcac caaagcgctg 4260 gaaatgaata tagcgtgctc atttttttgg tctgattcca cggttacgct ccaatggctg 4320 aaatcccctc caaacacgtg gaagacattt gtagcaaaca gagtgtccga aatccagacg 4380 acaacccacg gagctcgatg gaaccacgta gctggctcac agaaccctgc tgacttactc 4440 tctcgtggaa tgattgttga agacttcttg gaacgtgcgc tttggacaac aggaccagat 4500 tggctatccc gctccgaaga tgattggcca atttccaagc agatcgactc cgccgaagca 4560 aatagtgaac gcaagccagc aatatccgca gtagctcgtg ctgattcaaa atataatccg 4620 ctattttccc gctattcgca gttctctcga ttgacccgca tagtagcata ctgcatgcga 4680 tttgtagaaa acgctcgatc gaagtcccga acacgtcctg tcgcttttat tggtcccaat 4740 cctagcctat cgctaaccgt tgagcaactt tcgcatgcta gaaacaaatt aattgttctc 4800 gcccaggcag attctttcca agaagagata aaagatctgg agcaaaatcg tccactgaaa 4860 aggcaatctt ctatacgtct tctgcatcca ttcctagacc aaaaaggtgt gttgagggta 4920 ggggggcggc ttagattggc acaacgttcc tacgatttca aacacccagc cttgattccc 4980 agtttccatc cccttgctaa actcatagca agctatttcc acacgaaatt gattcacggt 5040 ggcggccgat taacactggc agcaatgcga gaagaatatt ggccagttca tggtagacga 5100 ctggtgcgga gcgtcattcg caactgcatt aggtgtgccc gagcaaatcc cgtcccggtt 5160 cgtcagcaga ttggccaact tccagctgcc cgagtcaccc caagtagacc cttcgccgtg 5220 acaggggtag actacgctgg gcccgtttat ttgaaagcca tacacaaacg agcgtcgcca 5280 acgaaagcgt atatatgtct gttcgtttgt ttcacaacga aggcagttca tttagagctg 5340 gtgagcgatc tttcgacacc cgctttcctc actgctcttc gtcgcttcgt tgcacgtcgt 5400 ggccgaccat cccacataca ttcagacaac ggcaaaaact ttatcggagc gaagaatgaa 5460 ctgcaccagc tgtaccggat gttggctaac cacaaggagg tcgaaaaaat ccagcgagtc 5520 tgcgccgaag atcaaatcac ctggcatctg aacccgccca aagccccaca ctttggcggg 5580 ctctgggagg ctgccgtgaa ggtagcaaaa tcgcagttat accgtcaatt gggacattcg 5640 cacctgtcat ttgaggacat gtccacaata ctggcggaaa tagagggagc catgaattcc 5700 cgcccactcg taccgatgac tgaggatcct gacgactgtt ctgccataac gccagcgcat 5760 tttcttatcg ggacgagtat gcatgcgata cccgatccag acgtcagcga aatcaacatg 5820 actcgtctgg cacaccacca acgattgcag cacttattcc aacagttttg gcatcattgg 5880 agaacggagt atctccaaga actccaacgg gataccacta ggtgtcagtc caacaacgag 5940 atacgccctg gcagactggt tgtgctggtt gacgagttcc aacaccctac tcgttggcca 6000 cttgctcgca tagaagctgt ttttcctgga gctgatggtt tagttcgtgt agtcaagttg 6060 cgaacatcga agggcatctt tacacgacca actacaaaaa tatgcctttt acctacagaa 6120 gcagtcaacg cagaaggcac acaacaattc aacgaaatac gagatgaaga tcagccggcc 6180 gaaaacaacg aaggcatcca aaaggtttaa atgagagaaa tttgtaacgt agtgtgtaag 6240 caattatgaa aaatgaattc gtttgtaata gtttgtagcg ttagtattag ctgaagttat 6300 ttgatttgaa atttgttaat ttcaaaggtg gcggcga 6337 // ID Zator-N1B_CQ repbase; DNA; INV; 469 BP. XX AC . XX DT 28-DEC-2010 (Rel. 16.01, Created) DT 28-DEC-2010 (Rel. 16.01, Last updated, Version 1) XX DE A Zator DNA transposon family from Culex quinquefasciatus - DE consensus. XX KW Zator; DNA transposon; Transposable Element; Nonautonomous; KW Zator-N1_CQ; Zator-N1B_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-469 RA Kojima K.K. and Jurka J.; RT "Zator DNA transposons from the southern house mosquito."; RL Repbase Reports 11(1), 649-649 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >91% CC identity. 3-bp TSDs are usually TWA. ~150-bp TIRs. ~83% identical CC to Zator-N1_CQ. XX SQ Sequence 469 BP; 166 A; 85 C; 64 G; 154 T; 0 other; ggccgttgca aatatttttc aaagtttatg tcgccccccc ttcaaaattg gtccgaaaaa 60 tcagggggca aaaaaatatt tttccaaaaa acttcaaaat ttcaatgaaa atagaagtca 120 aatcaactga aaacaatcta aaatgcattt ttctgcattg ataatcatat ttagcatgtt 180 tgggctggat taaaaatatt ttgaattttt atgaaattcc aatgcacagc accgcaaaaa 240 cttttttttc gcaaaaaata aaattttcgt caatacttag atattttgga aactaatgat 300 tgcaaaacaa ctggacaggt gtataatgca ttttaaaaca cttttttcat tcaaatgttg 360 aaaccatggc tcgtaaattc aatttttaaa cttttttatt ttttcccccc ccccctcgac 420 tttggtcaga gtcgagggac ataaacttca aaaaatattt gcaacggcc 469 // ID Gypsy-37_NVi-LTR repbase; DNA; INV; 486 BP. XX AC . XX DT 01-JUL-2009 (Rel. 14.07, Created) DT 01-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: long terminal DE repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy-37_NVi-LTR. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-486 RA Bao W. and Jurka J.; RT "Gypsy LTR retrotransposons from Nasonia vitripennis."; RL Repbase Reports 9(7), 1385-1385 (2009). XX DR [1] (Consensus) XX SQ Sequence 486 BP; 134 A; 140 C; 96 G; 116 T; 0 other; tgtcacgtgt gaatactcac acgtatatag cgttgctagt tataatgcaa tcgatgcata 60 agtcgccaaa tacgcgcata gcaacagctc ggactcaccc cgaagcccgt ttctttcgac 120 tcagcataac ctcccttcca cgatttcgga cgaccgcgcg ggtaggaatc acaccacccg 180 gggagtctat aaaagggggt gcatgcgggc gctcgctctc gattccgcca ctcacgatcc 240 agcactagac aggtagtttt gactcggaga cgacagatag gatcgtatac ttagaacgca 300 cattctccga gatcacttta acacgcgttt ctctctttca tacaatcagt ccattgtgaa 360 ttacgattaa taaacagaca cattcaccga taatctctcg tgttcattca ttaccccggt 420 attttcgcag caaccgtgag cttaacacct cacggagaac aaaaccatcg accctcacac 480 gtaaca 486 // ID TRAS9_SC repbase; DNA; INV; 1908 BP. XX AC AB046675; XX DT 09-JUL-2004 (Rel. 9.06, Created) DT 14-AUG-2009 (Rel. 14.09, Last updated, Version 3) XX DE Samia cynthia TRASSc9 gene, non-LTR retrotransposon, partial cds. XX KW R1; Non-LTR Retrotransposon; Transposable Element; KW endonuclease domain; reverse transcriptase domain; TRAS9_SC. XX NM TRAS9_SC. XX OS Samia cynthia OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Saturniidae; Saturniinae; Attacini; Samia. XX RN [1] RA Kubo Y., Okazaki S., Anzai T. and Fujiwara H.; RT "Structural and phylogenetic analysis of TRAS, telomeric RT repeat-specific non-LTR retrotransposon families in Lepidopteran RT insects."; RL Mol. Biol. Evol 18(5), 848-857 (2001). XX DR Genbank; AB046675; Positions 1 1908. XX FH Key Location/Qualifiers FT CDS 307..1905 FT /product="TRAS9_SC_1p" FT /translation="MGLEILNTGTTPTFEVLRGDRLYTSCIDVTACSGSLL FT GRVEDWRVDRGLTTSDHNTITFSLRVERALTPLPSITTRKYNTKKANWTDF FT GRLFGAMLEENNISESIRKSRTPEDLEIAIQAYTSAIHEACSTTIPQARPW FT KGNPVPPWWNRQLEDLKREVLRRKRRIRNAAPLRKQFVIDEYLRAKLEYAA FT EARKAQTESWKEFCSTQKRESMWDKIYRVIRKSSRRQEDVLLRDQVGNTLS FT PQQSAELLAKSFYPDDLEASDDQYHKDLRTTVEVGQSGFFSEDDPLITSTE FT LDTVLRAQNPKKAPGPDGLTSDICTAAINCDRVVFLALANKCLALSHFPRA FT WKVAHVIILRKPGKDDYTSPKSYRPIGLLPVLGKIVEKLIIGRLQWHIMPA FT LNRRQYGFMPQSSTEDALYDLVHHIRTELQDKKSVLVISLDIEGAFDNAWW FT PALKLQLQERRIPRNLYKLVDSYLRDRKITVNYARATYEKGTTKGCVQGSI FT SGPTFWNIILDPLLQLLAREGIHAQAFADDVVLVFDG" XX SQ Sequence 1908 BP; 544 A; 474 C; 503 G; 387 T; 0 other; gggactgtta aagcggctat catcgttttc ggtgacctac tgaacgtcat tcatgaccct 60 cagctggtga ccgagaccga ggcggctgtc ttgctggaag ggggaggcct gaaacttgga 120 gtagtgtccg tctaccttga gggaaacaat gacatcgagc cctacctaca taggataaag 180 ctgacctgcg gaaaactcga caccggacac ctcatcgtgg caggggatgt aaatgcctgg 240 agccactggt gggggagcag ctcggaggac ggcagaggcg tagcgtacca ttccttcctg 300 aacgaaatgg gactggaaat cctaaacact ggaaccactc caacattcga ggtccttagg 360 ggtgacaggt tgtacacaag ttgcattgat gtaacagcgt gcagcgggtc actcctaggg 420 agggtcgagg actggagggt cgatcgagga ctgacaactt ccgaccataa tactattacg 480 ttttcactgc gcgtagagag ggcactgaca cctttgccct caatcaccac gcgcaagtat 540 aacacgaaga aagcaaattg gacggacttc ggtcgactct ttggtgccat gttggaggaa 600 aataacatct cggagtccat taggaaatcc agaaccccag aagatcttga gatcgcaata 660 caagcctaca ccagcgcaat ccatgaagcc tgttccacaa ccatcccaca ggcaagacca 720 tggaaaggaa atcccgttcc tccctggtgg aataggcaat tggaagacct caagagagaa 780 gttcttagaa ggaaacgaag aataaggaac gccgcgccgt tacgtaaaca atttgttatc 840 gatgagtacc tccgagctaa actggagtac gctgcggagg ccaggaaagc ccaaacagag 900 agttggaagg agttctgctc gacgcaaaag agagagagta tgtgggacaa aatctacagg 960 gtcatcagga agtcgtcaag gaggcaggag gacgtgctcc tgagagacca ggtaggtaac 1020 accttgtccc cccaacagtc ggcagaactt ctcgccaaat ctttctaccc agacgacttg 1080 gaagcttccg acgaccaata ccacaaggat ctcaggacaa cggtagaagt tgggcagtct 1140 gggtttttct cagaggacga cccccttata acatctacgg aactggacac agtgcttagg 1200 gctcaaaatc cgaaaaaagc accgggtcca gatggtctga cctcggatat atgtacggcg 1260 gcaatcaact gtgatcgggt ggtgttccta gcgctggcca acaagtgcct agcgctgtca 1320 catttccccc gagcatggaa ggtagcgcac gtcattatcc tgagaaaacc gggcaaagat 1380 gactatacca gccctaaatc ctatagacct ataggccttc taccagtcct agggaaaatt 1440 gtggaaaaac tgatcatagg tcgcctccaa tggcacatta tgccagcctt aaaccgtagg 1500 caatacggtt ttatgccgca aagcagcacc gaggatgccc tctatgactt agtacaccat 1560 attaggacag agctgcagga caaaaagtca gtgctcgtca tatcactgga catagaggga 1620 gccttcgaca acgcatggtg gccagctcta aaacttcaat tgcaggagag gaggattcct 1680 cgaaacctat acaagctggt ggactcgtac cttagagacc gcaagatcac ggtcaactat 1740 gcacgagcga catacgagaa gggtactacc aaaggttgtg tccaaggttc cataagtgga 1800 cccacctttt ggaacatcat actcgacccg ttgttgcaac tacttgcaag ggaagggatc 1860 cacgctcaag cttttgcaga cgacgtggtc ctggttttcg acggagac 1908 // ID Copia-2_TCa-I repbase; DNA; INV; 4112 BP. XX AC ChLG7; XX DT 21-FEB-2011 (Rel. 16.02, Created) DT 21-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the red flour beetle genome: internal DE portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-2_TCa_; KW Copia-2_TCa-LTR; Copia-2_TCa-I. XX OS Tribolium castaneum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Coleoptera; Polyphaga; Cucujiformia; OC Tenebrionidae; Tribolium. XX RN [1] RP 1-4112 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the red flour beetle genome."; RL Direct Submission to RU (21-FEB-2011). XX DR Genome; ChLG7; Positions 6101252 6097141. XX CC Positions [1499-2032] - Integrase core CC 'ATAAT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 158..3532 FT /product="Copia-2_TCa-I_1p" FT /translation="MADGINLSAINKFDGNNYKQWRFQLKCALKAKGVYSI FT AIGEIEKPSANRVEELNTWNKKDAIAMCVITSTMELSQITMIESCDTAKEI FT LDKLDAIYDIKTETNKMLIHEEFHQYKMCLNDSIAQHISKVENLARRIRES FT GDNISEVAIITKILGTLPAKYRNFRQAWLSLAEDKQTLSNLTSRLIDEERN FT LTTVETTENALVTTTRVVPNKKTIRGKPRITCYNCNKKGHISRECRAPRKT FT HVTQGHSGAFLIEDVNEIATKIQEDEAWILDSGASAHMTSRRDMFSTIQEV FT DEFSVKLGNGSELKVKGKGTIEIECWLENEWIKNKMTDVWYIPNLKRNLFS FT EGQITKKGMTITKENNKAKIVCDGVVKACAVRQSNNIYKLLIRCTTKTSEV FT NLTATRSLQTWHQRLGHINMKTIQEMAKRGLIDAHGTTDDEVNQICEGCRY FT GKQHKMPYHKIEKRNYKAGELIHTDVCGPMSVDSVGGCRFFILFKDDSTNF FT RTVYFVKHKSDALDCFKQFYYMCKNKFGHPIKTLRADNGGEYVNQEFHKFL FT KDRGIILETCAPYCHEQNGKAERENRTLVESARSMIFTKNLPLYLWAEAVN FT TAAYILNRTTNSINQEVTPYELWTNKTPDLKHIRIFGSDAYMHVPDNLRKK FT LEPKSKKLILVGYDNDSTNYRLFDRATRKIKIARNVTFGEDNELTLPRANK FT IVITDEIHDDERNLEHNNQNDQTKEPEEEPVPETEDRRGYNLRPREILKTP FT NNLDDYVINIVEGDTPNTYDEAITSSESEEWRNAIQEELDALKKNDTWNLV FT NLPVDKKAIGSKWVFKIKRSPNNKNIRYKARLCAKGFAQTEGVDYNETYAP FT TTRYDTIRILLAVAARENYQMIQFDVKTAFLHGELEENIYMQPPPGLSVPP FT NSVLKLKKSLYGLKQSPRCWNRKFNEFLELFGLKQCESDKCVYTGKFNEQL FT MILLLYVDDGLILAKDMDTLHKMGNKLKSAFEVTICEPEYFVGLEIKRQSD FT PKVGKNSIFIHLSNFIDRIIDRFNMNDARTCNTPADPNAILSQSDPNEPKN FT NDVLFPYREAIGCLMFAAISARPDIMYAVNVVSRYQNNPTIAHVNAVKRIF FT KYLKGTKELGNPLQQLR" XX SQ Sequence 4112 BP; 1450 A; 771 C; 896 G; 995 T; 0 other; ggttatgggc ccaggtcgca agggtagttt tagttcagtt ttgtctacag taacgttcga 60 gtgaaacacg cgttcgggcg attaaaaact ggttcgtgcg gtcaaaacca gtgtgtcggg 120 tgattcacag acaaaaacga gtgtcgtgaa aacaagaatg gcagacggaa taaatttgtc 180 agcaataaac aaattcgacg gaaacaacta caagcagtgg cggttccagc taaaatgcgc 240 cctaaaggca aagggggtgt attcaatagc aataggcgaa atagagaaac cgtcagcgaa 300 tagggttgaa gaactcaaca cctggaacaa aaaggacgcc atagcaatgt gcgtaataac 360 atctacgatg gaattgtctc aaataacgat gatagaaagc tgcgatacgg ctaaagaaat 420 tctcgacaag ttggatgcga tctacgatat aaagacggag acaaacaaga tgcttataca 480 cgaggagttt catcagtaca aaatgtgttt aaatgattcc attgcacaac acatttcaaa 540 agtcgagaat ctggcaagac gaataagaga atctggcgac aatataagcg aagtggccat 600 aattacaaaa attcttggta cgctgccggc gaaataccgc aattttagac aggcatggtt 660 gtctttagcg gaagacaaac aaacacttag caacctgaca tcaagactca tcgacgaaga 720 gcgtaattta acaactgtgg aaacaacaga aaacgcgttg gtcaccacta cacgtgtcgt 780 accaaacaag aaaacgataa gaggaaaacc acgcattacg tgctacaatt gtaataaaaa 840 agggcatatt tcaagagagt gtcgtgcccc acgaaaaacc catgtgactc aaggtcacag 900 tggagcattc ttaatcgaag atgtcaacga aattgctacc aaaatccaag aagatgaagc 960 atggatcctg gacagcggcg cctcggcgca tatgacgtca cgacgcgaca tgttttcaac 1020 aatacaagag gtcgatgagt tcagtgtgaa acttggcaat ggcagcgaac taaaagtaaa 1080 aggaaagggt acgattgaaa tcgaatgctg gctagaaaac gaatggatta aaaacaaaat 1140 gacagatgtt tggtatattc caaatctaaa gagaaacctc ttctcggagg gtcaaattac 1200 gaaaaaggga atgacaatta caaaagagaa taacaaagcc aaaatagttt gtgatggtgt 1260 cgtaaaagcc tgtgcagttc ggcagtcaaa taatatttat aaattactga tacgttgtac 1320 aacaaaaaca agcgaggtta atttgacagc gacaagaagt ctacaaactt ggcatcagcg 1380 cttaggacac atcaatatga aaacaatcca ggagatggcg aagagaggct taatcgacgc 1440 acatggaaca actgatgacg aggtcaatca aatttgcgag ggttgtcggt atggtaagca 1500 acacaaaatg ccatatcata aaatcgagaa aagaaattac aaagccggtg aattaattca 1560 cacagacgtg tgtggtccaa tgtcagtaga ttccgtcggc ggatgtcgat tctttatact 1620 tttcaaagac gattcgacga attttagaac cgtttatttc gtaaaacata aatcagatgc 1680 tctcgattgt ttcaaacagt tctattatat gtgcaaaaat aaatttggtc acccaattaa 1740 aaccttgcgt gcggacaatg gcggggaata cgtgaatcag gaatttcata agttcttaaa 1800 agatagaggc ataattctcg aaacctgcgc cccatattgt cacgaacaaa acggaaaagc 1860 tgaacgcgaa aatcggactc tagttgaaag cgcccgatct atgatcttta cgaagaattt 1920 accattatat ttgtgggcgg aagccgtaaa cacggcggca tatattttga atcgaacgac 1980 aaattcgatc aaccaagaag tgacgccata cgaactttgg acaaacaaga cacctgatct 2040 aaaacacata agaatctttg gttctgacgc gtacatgcac gtacctgata acttaagaaa 2100 gaaattagag ccgaaaagca agaaactgat ccttgtaggg tatgataatg actcgacaaa 2160 ttacagactg tttgatcgag caactcgaaa gattaaaatc gctagaaatg taacattcgg 2220 cgaagataat gaactgactt tacccagagc aaataagatt gtaatcacgg atgaaattca 2280 cgacgatgaa agaaatttag agcataataa ccaaaacgat caaactaaag aacccgagga 2340 agaacccgta ccagaaactg aagatcgaag agggtataat ctgagaccac gtgaaatttt 2400 aaaaacacca aacaatcttg atgattatgt gataaacata gttgagggcg ataccccaaa 2460 tacatatgac gaagcaatca ccagtagcga atccgaagag tggagaaatg caatacagga 2520 ggaactggac gctttgaaga agaacgatac ttggaatctt gtcaacctac cagttgataa 2580 aaaggcaatt ggatctaagt gggtgttcaa aattaaaaga tcccccaata ataaaaatat 2640 tcgttataag gcacgattat gtgcaaaagg ttttgcccag accgaaggcg tagattataa 2700 cgaaacgtat gcaccgacaa ctcgatacga cacgataaga attcttttgg ccgtcgccgc 2760 aagagaaaat tatcagatga tacagttcga cgtcaaaaca gcattccttc acggagaatt 2820 agaagaaaac atttacatgc aacctcctcc cgggttatct gtgccaccta attctgtgtt 2880 aaaattaaaa aagtcattgt acggacttaa gcaatcacct cgctgctgga acaggaaatt 2940 caacgagttc ctcgaacttt tcggacttaa acaatgtgaa agtgataaat gtgtttacac 3000 aggaaaattc aacgaacaat taatgatcct tctactgtat gtagatgatg gtctaatttt 3060 ggcaaaggac atggacactc tacacaaaat gggcaacaaa ttaaaaagtg cgttcgaagt 3120 gaccatttgt gaaccagaat attttgtggg actggaaatt aaaagacaga gtgatcctaa 3180 agttggtaag aattcgatct tcatacattt atcgaacttt atcgatcgta ttattgaccg 3240 ttttaatatg aacgatgcga gaacatgtaa tacaccagct gatccaaacg cgatattgtc 3300 gcaatccgat ccaaacgaac ccaagaacaa tgacgtgcta tttccttacc gagaagcaat 3360 tggatgttta atgttcgccg caatttccgc gcgtcccgat attatgtacg cggtaaatgt 3420 agtgagtcgt taccaaaaca atccaacaat cgctcacgta aacgccgtga aacggatctt 3480 caaatattta aaggggacta aggagcttgg gaatcctcta caacaactcc ggtaaactca 3540 tcggatactc ggatgccgat tatgcgaacg acatcgattc caggaagtcg acaagcggct 3600 tcgttttcaa actgggagac ggcgccatca cctggtgtag tcgtcgtcaa aggtgcaact 3660 ccctatctac aacggaggcc gagtacgtgg cagcttcgga agctaccaag gaagcagtgt 3720 ggataagtgg cctcttaagt gaagtgggtg aaaagtgcgg tggtgtcgct ttgtgcgttg 3780 acaaccaatc agcgattaag cttgttaaaa atccaatgta tcataaacgc actaagcata 3840 tcgatgtccg atatcacttt gtacgtgaga aatatgagaa tggagatatt gtcttgaaat 3900 atgtaccatc gactgaacaa gtcgccgatg tgtttactaa agcattatgt tatgcgaaat 3960 ttaatgtttt tgttttgaat ttgggaatgt tgttcattag tcccactgtg taattttttt 4020 ttgtttgttt cttgtatttt gttctgatgt actctaatta atgtaagttg tttaaaaaaa 4080 aatgtaacaa attccggcag aatacggggg ag 4112 // ID Crack-12_BF repbase; DNA; INV; 2426 BP. XX AC . XX DT 30-APR-2009 (Rel. 14.04, Created) DT 30-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE Amphioxus Crack-12_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; KW Crack-12_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-2426 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-2426 RA Kapitonov V. and Jurka J.; RT "Young families of Crack non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(4), 817-817 (2009). XX DR [2] (Consensus) XX CC Its 5'-terminal portion is incomplete. XX FH Key Location/Qualifiers FT CDS 1..2235 FT /product="Crack-12_BF_2p" FT /translation="QILILACAQAGREACGTRWCEYRSYKQYTQQSFIDGL FT KSVHWDTVLDCTDVTDAWDAFKDIFLNVADKFAPLCKRRVRDNNTVAPWMT FT DRVKNMMGRRDAARRKAIKTKEGKDWEVYRVIRNQTTSAVRKAKKIHFAEA FT VTEAKGDQSLMLKIINAFTGKSKGKCQVQNLARPDGTNISDPSEMSQEFNS FT YFATCATSLADSMPTSQKDPLRHIPEPSTEFSFESVDESTVLSELLRLKAK FT KATGLDNIPSKLLKDSAPVIVRPLTHIFNLSLATGEVPSDWKTAKITPIYK FT SGNRTNVANYRPVSVLSVTSKVLEKLVGNQVSRYMAQNNLLTVYQSGFRRN FT HSTASAVLKIVEDIRSANNSRQVTVALFLDLRKAFDTVNHAILLSKLKKLG FT FDRDATKWFTSYLSGRSQCTSLQSQCSEMAAVTCGVPQGSVLGPLLFCLYV FT NDMPQVLEKCKIHLYADDTAIYYSAGTMKECEEAVSQDMKRVSDWLIDNRL FT SLHHDKTKSMLFGVPQKLRHVGTTVQITDGVNIYEQVNTFKYLGITLDPSL FT QWSAHITNITKKIFSGLSAMRRAKPYVTKEILHTMCQTLLLSHLDYCGAAW FT LPSLVQGKKTQMLQLDRLLFRAARMITGYTLRDHVPVGKLLAEAGLVSVRN FT RAEKISLATVFNAVRGKAPLYISDMFRWKSPPTIRARTRTAVKMFADWDPH FT QLNCPVASLKCYEGSLKSFGPQLWNKLSLKHRKTLGLVKFIKEL*" XX SQ Sequence 2426 BP; 725 A; 529 C; 577 G; 595 T; 0 other; cagatcctta tcctggcctg tgcacaagca ggaagagagg catgcggtac acgctggtgt 60 gaatacaggt cttacaagca gtacacccaa caatctttta ttgatggact caagtcagtt 120 cactgggata cagtattaga ctgcactgat gtaacggatg cttgggatgc ttttaaggat 180 attttcctga atgtagctga caaattcgct ccgctgtgca agaggagagt gagggataac 240 aacactgttg ctccttggat gacggacagg gtaaaaaaca tgatggggcg acgcgacgca 300 gcaagacgca aagcaatcaa aaccaaggag ggaaaagact gggaggtcta ccgggtgata 360 aggaaccaga caacatcagc agtcagaaag gcaaagaaaa tccactttgc tgaagctgtg 420 acagaagcga aaggagatca aagcctcatg ttgaaaatta ttaacgcctt cacagggaaa 480 tccaaaggga agtgtcaagt tcagaacttg gcgagacctg acggtaccaa tatatcggac 540 cccagcgaaa tgtctcaaga gttcaatagc tacttcgcaa cttgtgctac aagccttgcc 600 gatagtatgc caacctccca gaaagatcct ctccgacaca taccggagcc tagtacagag 660 ttcagctttg agtcagttga cgaatcgaca gtgctaagtg aacttctcag attgaaagca 720 aagaaggcca ctgggctgga caatatcccc tcaaaactgc ttaaggactc cgcccctgtc 780 attgtcagac cactgacaca catttttaac ctatcacttg ccacgggaga agtccctagt 840 gactggaaga cggcaaagat cacacccatt tacaagtccg gtaatcgaac taatgtggcc 900 aactaccgtc ctgtctctgt actgagtgta acatccaaag tgttggaaaa actcgttggt 960 aaccaagtat cacgctacat ggcccagaac aaccttctca cggtatacca aagtgggttt 1020 cggagaaacc acagtactgc ctctgcagtt ttgaagattg tggaggatat taggtcggcc 1080 aataatagtc gccaggtcac agttgctctc ttcctagact tacgcaaagc atttgacact 1140 gtgaatcatg ccattctgtt gagtaagctc aagaaactgg gttttgatag ggatgcaaca 1200 aaatggttta catcatacct ctctggtcgc tcacaatgca ctagtctgca aagtcaatgt 1260 tcagagatgg ctgcagtcac ctgcggggta ccacagggga gcgtacttgg tcccttgctg 1320 ttttgtttat atgtcaacga catgccacag gttctagaaa aatgcaaaat acacctgtat 1380 gcagacgaca ccgcaatcta ttactcggcg ggcacgatga aagagtgtga agaggcagtt 1440 tcccaggata tgaagagagt gtcagactgg ttaattgata atagactttc tctccatcat 1500 gacaagacaa agtctatgct attcggggtt cctcagaagc tgagacatgt tgggacaaca 1560 gttcagatca cagacggtgt taacatttat gagcaagtta acactttcaa atatttgggc 1620 atcactctag acccatcgct ccagtggtca gctcacataa caaatatcac gaaaaaaata 1680 ttcagtggcc tcagtgcaat gagacgtgca aaaccctacg taacaaaaga aatattacat 1740 actatgtgtc aaactttgtt gttgtcgcac ttggactact gtggcgcggc gtggttgcca 1800 agccttgtac agggcaagaa aacacaaatg ctgcaactcg acaggctgct gtttagagct 1860 gcaaggatga ttacaggcta cacactccgg gatcatgtcc cagttggtaa actgttagcg 1920 gaggcggggt tggtgtcggt gagaaatcgg gcggaaaaaa tcagtctggc cacagttttc 1980 aatgctgtac gaggaaaagc ccctttgtac atatctgaca tgttcaggtg gaagtcacca 2040 ccaactataa gagctcgcac ccgcacggct gttaaaatgt tcgctgactg ggaccctcac 2100 cagttgaact gtcccgtagc aagcctgaag tgctacgagg gaagcttaaa atcttttggc 2160 ccccaactgt ggaacaaact gtcactgaag caccgcaaga ctctgggact ggttaagttt 2220 atcaaggaac tgtgaggtct agatggaaat gacaagtgaa attgtgagtt ttgatatgtc 2280 gttttgtggt tatgcactat tgtatgtgtt attgtaatat tgcaaagttt taaattgtat 2340 ttttgtttac ccagcattgc ctgaaaaaca ggttgttatg tacctgagtg ttactgctgg 2400 taaaataaat aaactgaaac tgaaac 2426 // ID BEL-52_CQ-LTR repbase; DNA; INV; 435 BP. XX AC AAWU01015848; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-52_CQ_; KW BEL-52_CQ-I; BEL-52_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-435 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 258-258 (2011). XX DR GenBank; AAWU01015848; Positions 28573 28139. XX SQ Sequence 435 BP; 139 A; 112 C; 80 G; 104 T; 0 other; tgttggagac caacccgatc acatcttccc agtagcttgt gctcacgggc atggctacca 60 accattttta tctaaacgaa aaattcttgt agatctaaaa caaaagcgac tagcgcggca 120 agaagtggct tcaagccgaa gaatgttata ttccccccat tccaatttga cccatcttac 180 gagcagtgcg aaaaaggaaa attatttgac tagaattaat aaaacttaaa ttagtgatac 240 gtctctttaa ataaagtact agttttacta gttttgtggt ttcaattaaa catcgcgagt 300 ttttgtgcta caccaaccgg gtccgacgat cactagtgcc aacccacggc aaaccctcca 360 acccgaaccg cagaaggacc tgcgattccc caaacgacca gccaatccgg agcgaccaca 420 acgagctatc gaaca 435 // ID CR1-21_CQ repbase; DNA; INV; 4070 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-21_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4070 RA Kojima K.K. and Jurka J.; RT "CR1 non-LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 25-25 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 4 sequences with >99% CC identity. XX FH Key Location/Qualifiers FT CDS 133..3867 FT /product="CR1-21_CQ_1p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="MSIPHSHPSAPIDSQPPRDVAGCPAAYVVGLGRTYQS FT IEVGPHSSDTTEPIADTVSSAQICSSHQRRPGHVFGHGSGANQASILGKYL FT RSPGTSSLPHVPVAFSDRYSSDVEFELTSRFLNGPSNDYTHLAEWAGACEL FT FTSDNVPVLEGCQCPGCFLVPFTQPPRDPGGQLAPQPVTFLCFNCIQKNHE FT ISVLHSPSSTRRHPSCPDPSCHALKDTGDFAAIVPSPEIADAQATQDPLPD FT ILSQQDISSGLQPVAFQQVRFYYQNVRSLNNKVDEFFLACTDCEYDVILLT FT ETWLDGSVKPLQLFGDLFAVYTTNRNPANSRRKVGGGVLIAVRKTLDSVVC FT VEAAADNLEQIFVTIHLKNQKMFIGCFYLPSEKRYEHQLVEQHLTCIETVR FT SISAPNDTIVVVGDYNQSHLVWKPLAGNRSFVDPLLTHTRGASERTTHEVL FT LDGIASNNMHQSNLVCNHQNRILDLVLINDGATHSAPEAALESLVSLDNYH FT PALTFEVSAALNNSYEDTLDPLGLNFRKTDFVGLQRSLGETDWSSVLTSID FT VDEAVAAFGQILRSNLTRHTPGFLPARKPAWSNERLRKLKQFRASALNRYS FT QRRCPETKRVFNAASSRYKAYNKRRYAEHIKRTQANLRRNPSQFWKFVKSK FT YKETGLPAVMTLGETVATTNIEKCELFARHFASVFRPPGTPPTSQQHLETV FT PCDLIDLDVFQITEDMLKRGIQKLKSSNSPGPDGIPPVIFKRCFNQITAPL FT LRVFNLSMSQCKFPEEWKRSVMFPVFKKGAKRSVENYRGITSLCSGSKLLE FT IVVGEVIMFNCRPFISSEQHGFTPGRSVTTNLMEFTNFCIENLAAGRQVDA FT VYTDLKAAFDRIDHDIALQKFAKLGFSARLCRWLESYLKNRIIQVKIGTSL FT SMEFSNGSGVAQGSNLGPIVFNTFFNDSNSLFNDNCNLSYADDFKIFLAID FT NLPDCYRLQSRLDKFANWCALNRLELSVSKCTTISFHRKTSREVFTFEYTL FT DGHALERLKVVKDLGALLDEPLSFRPQQAAVVDKANRQLGFLFRMTREFDD FT PLCLRSLYCALVRSHLESSAVIWAPYHQNWIDRIERIQKKFVWFACRRMPW FT TDPTRLPRYEARCNLIGLETLENRRTISKSIFAAKILTAQIDSPNLLNLLN FT TQISSRDLRNRPNFLARPRVRTEFHNNSPVRSMSAAFNEMYFLFEFHEPIP FT RFREKLRIEFQERTRTLLGAPRQDPVVGRLRIGRQ" XX SQ Sequence 4070 BP; 1024 A; 1079 C; 919 G; 1047 T; 1 other; cgacaacttg gggttcctca aaaaccgtcg cgtggagtga agtgccatca gtctgcgtgg 60 gagtacgggg gccaccgggc ccccaaacca gttaagttat tgcatgtaaa wgaatttcca 120 aaaagtcaag aaatgtccat cccccatagc catccatccg ctccaattga ttctcaaccg 180 ccgagggacg tcgcaggttg ccctgctgcc tacgttgttg gtctgggacg cacgtaccaa 240 agcattgagg ttggccccca ctcctccgac actactgagc caatcgccga caccgtttcc 300 agcgcccaga tatgttccag tcaccagaga cgtcccggcc atgtgttcgg tcatggaagt 360 ggggccaacc aagcctccat cttaggcaag tacttgcgtt ctcccggaac ttcttcgctg 420 cctcatgttc ccgtggcttt tagtgaccgc tactcatctg acgtggaatt cgagcttacg 480 agtcgttttc tgaatggacc atccaatgac tacacccatc tggccgaatg ggctggcgca 540 tgtgagctgt tcacctccga caatgtgccc gtactcgaag ggtgtcagtg ccccggctgc 600 ttcctagtgc cttttacgca accaccgagg gatcctgggg gccagctggc cccccagcca 660 gttacgtttt tatgttttaa ttgtattcaa aaaaatcacg aaatttccgt actccatagc 720 cccagttcca ctcgccgcca tccgagttgt ccggaccctt cgtgccatgc gctaaaggac 780 actggcgact tcgcggcgat cgttcctagc cccgagatag ccgatgcaca agcgacgcaa 840 gatcctctgc cggacatcct gtcgcaacaa gacattagca gtggtctgca gccggttgcc 900 tttcaacagg tccggttcta ttatcaaaac gttcgatcgc tgaataacaa agttgatgaa 960 ttcttcctgg cttgtacgga ctgcgaatac gacgttatct tgctcaccga gacgtggctc 1020 gacggtagcg tgaaaccact ccagttgttc ggcgacctgt ttgcagttta cacaacaaat 1080 cgaaatccag ctaacagccg tcgtaaagtt gggggaggtg ttctaattgc ggtcagaaaa 1140 acactggatt ccgtcgtctg cgtcgaggca gctgccgata atctggaaca aatttttgtg 1200 accattcatc tcaagaatca aaaaatgttc atcggctgct tctatcttcc ctcggaaaaa 1260 cgatatgagc atcaactggt agagcaacac ctgacctgca tcgaaactgt tcgctccatc 1320 tccgctccaa atgacactat agtggtagtg ggtgactaca accagtctca tctcgtttgg 1380 aagccgcttg ctgggaaccg ttcattcgtt gatcctttgt taacccacac gcggggtgca 1440 tcggaaagaa ctacgcatga agttctgctg gatggtattg cgagtaacaa catgcaccaa 1500 agcaatcttg tgtgtaatca ccaaaaccga attctggatt tagtactcat caacgacggt 1560 gcgacgcact ccgcccctga agctgctcta gaatcgctcg tttcactgga caattaccat 1620 cctgctctaa ccttcgaagt gtcggctgcg ctcaacaata gttacgaaga cactctcgac 1680 cctcttggat taaatttccg aaaaactgat ttcgttggcc tacaacggag tttaggagaa 1740 actgattggt cgtctgttct cacttccatc gacgttgacg aagccgtagc cgcctttggc 1800 cagatcctgc gaagtaatct tacacgccac acgcccgggt tcttgcctgc aaggaaacct 1860 gcttggagta atgaacgcct tcgcaagctt aagcaatttc gtgcctctgc acttaaccga 1920 tattctcaac gccgatgtcc tgaaaccaag agggtcttca atgccgcaag ttcccgttat 1980 aaagcttaca ataaacgccg gtacgccgaa cacattaaga ggacgcaggc gaatcttcga 2040 cgcaatccgt cgcagttctg gaagtttgtg aaatcgaagt acaaagagac tggacttcct 2100 gccgtgatga cgttaggaga aaccgtggct actaccaaca tcgagaaatg tgagctgttt 2160 gcgagacact ttgcttcagt tttccggccg cctggcactc caccgacttc acaacagcat 2220 ctagaaacag ttccatgcga tttgattgat cttgatgtgt ttcaaattac ggaggacatg 2280 ctgaaaagag gaatacagaa attgaaatcg tcgaattctc ctggccctga cggaatcccg 2340 ccggtgatct tcaagcgttg tttcaaccag attactgccc cactgctccg agttttcaac 2400 ctctccatgt cgcagtgcaa gtttcctgag gagtggaaaa gatccgtaat gtttccagtg 2460 tttaagaaag gtgcaaaacg gtcggtcgag aactacagag gaattacgtc gctgtgctct 2520 ggttcaaaac ttctcgagat cgtggtcggg gaagtcatca tgtttaactg ccgaccgttc 2580 atcagttctg agcagcatgg atttacacct ggtcgctccg tcactactaa tttgatggaa 2640 tttaccaatt tctgcattga aaatctggca gctgggcgtc aagtggatgc tgtctacacc 2700 gacttgaaag ccgcattcga tcgaattgac cacgacattg cccttcaaaa atttgcaaaa 2760 ctcggtttct ctgcacgtct ttgtcgctgg cttgagtctt atctcaaaaa tcggatcatt 2820 caggtgaaaa ttggaacatc gctgtcgatg gagttctcca acggttcagg agtggcacag 2880 ggtagcaacc tcggcccgat tgttttcaac actttcttca acgactccaa ctctctcttc 2940 aacgacaact gcaatctctc gtatgctgat gacttcaaga tcttcctcgc tattgacaac 3000 ttacctgatt gctacagact gcagtcccgc ctagacaaat ttgccaactg gtgtgcgttg 3060 aatcgtttgg agctgagcgt ctctaaatgc actacgatat cgtttcatcg caagaccagc 3120 cgggaggttt ttacatttga atacacactg gatggacacg cgctcgaacg tctgaaagta 3180 gtgaaagatc taggagctct actagacgaa cccttatctt ttcgcccaca acaagcagca 3240 gtcgtggaca aggccaacag acagctcggt tttctgttta gaatgacccg cgaattcgac 3300 gaccccctct gtttacgctc gttgtactgc gcgctagtcc gctcacacct ggaatcatcg 3360 gcagttatat gggctcccta ccaccagaac tggatcgatc gtatcgaaag gatccagaaa 3420 aagtttgtct ggtttgcctg taggagaatg ccatggacgg acccaactag actaccacgt 3480 tacgaggcac gctgtaacct gattggactc gagaccctgg aaaatcgtcg gacaatatcg 3540 aaatctatct tcgctgctaa aatactgact gcgcaaatcg actcccccaa tctgttaaac 3600 ttacttaata cccaaatttc ttctagggac ctgcgaaatc gtccaaactt cctcgcacga 3660 ccgcgtgtca ggaccgagtt ccacaacaac tcaccagttc gatcgatgtc ggcggcgttc 3720 aacgaaatgt acttcctgtt tgaatttcac gagcccattc caagatttcg cgaaaaacta 3780 cgcatcgagt tccaggagcg tacaagaact ttgcttggtg cgcctcgcca ggatcccgtc 3840 gttggccggt tacgcatagg acgacaatga cttgactaat ttttaaaatg tattgtaatt 3900 taccacgtat gttgtcatgt tatcaatctc gatgtaattt attttaaaag aagtgaggtt 3960 ttgtgccttt gtgagaatga ttctaatgtt tcaactcaca tcggcttttg ccctcgccaa 4020 ccaaaacatc catgtagacc cttggttcga tggattaata ataaataata 4070 // ID Gypsy-622_AA-LTR repbase; DNA; INV; 1266 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-622_AA_; KW Ty3_gypsy_Ele157; Gypsy-622_AA-I; Gypsy-622_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-1266 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 1266 BP; 325 A; 317 C; 322 G; 302 T; 0 other; tggcagcagt ggccattgcc ggcacaccac catccaccag aaacccgctc ttgaccccac 60 cgttcgactt cggaggattc ccggggcaca cgtgaccgga agagggttgc gccgcgccat 120 tctgcgcgcc ctagacaggc aggccgccat taccgtacca gtatcgtcag cgtcagcaag 180 ttcataccgt gacatcaacc caaccgccaa gaaggaggga gacgaacgaa cgcacgtgcc 240 gcgtttgcgc gtaagtccgc ggatccagta aggaccaacg gaatggaccg ttgatcgcat 300 ccgcgatcgt gtacaggccg gagcagatgc caacgcagct ccgtgtccac cgctagtcca 360 cccccaccat ccgtaattcg cccgccactc gtaaatccat ggtagagtaa gtaccaggca 420 tgcatgtttg ttcgttgtgt cgtccatgcg catgagacag aacggcacaa atcgatagcc 480 cgtagccgtc tcgaacgtta tttgtcaatg attcattgag cacattatct acccaataca 540 ccgaaaccca cacccgtagc caccaaacac acacgcagga gaagagtccg tgcacggcaa 600 gcgccgtcga agagaaggaa tgaagtagga gagagtaaat gcacgttaga gtagaggaaa 660 gggaacagaa ggcctaggca aacatgtaga gggagagcgt tttgtaaata tatttatagt 720 aaaccgtatg cgttattcgt tacagctttg cgctgagctt ttatagctaa gttttgtgtg 780 caataaatgt tgtcagtggt tcccaatcta aattgtttga tgtagtcagt tgaagaaatt 840 agtgtcttgt gagggatttt catgtagttc gttttcgttt cgcttttctt gtagtttgtt 900 ttactgggta agtcggcccg aaatcgacct ttaaccattc tcaaattgga aacagtttgg 960 aatggagtga cgttcatact gggccagtga tgagaacacc ccgctccagt gattaaggtt 1020 tcttagaatt ctaggaattt gtatttggaa cttcagtggt attgatggcc tttagcctct 1080 accgagaagt acaacggcta atttaacgca gccagtcagc tttcttccca agttctttaa 1140 aattaacccc tcccttgaag ggtctcaggc cgggcaaacc tgcaaactgc tgggtggtct 1200 ccttttagga gtggcgctaa agccatccac ttacgaacgg gaagcgcgac gggctggcgg 1260 gttaca 1266 // ID R5 repbase; DNA; INV; 4814 BP. XX AC AY216701; XX DT 09-JUL-2004 (Rel. 9.06, Created) DT 21-JUL-2010 (Rel. 15.08, Last updated, Version 4) XX DE Girardia tigrina R5 retrotransposon, complete sequence. XX KW NeSL; Non-LTR Retrotransposon; Transposable Element; KW reverse transcriptase; R5 non-LTR retrotransposon; GENIE_GT; R5. XX NM GENIE_GT; R5. XX OS Girardia tigrina OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Girardia. XX RN [1] RA Burke D.W., Singh D. and Eickbush H.T.; RT "R5 retrotransposons insert into a family of infrequently RT transcribed 28S rRNA genes of planaria."; RL Mol. Biol. Evol 20(8), 1260-1270 (2003). XX DR Genbank; AY216701; Positions 1 4814. XX FH Key Location/Qualifiers FT CDS 1124..4636 FT /product="R5_1p" FT /note="reverse transcriptase and restriction-like FT endonuclease." FT /translation="TTGRNLGQWSCYSRSIQQSNYSFKLSSTEVGELVEQS FT PAPLQSPQFSNNYNNLNINNNLYYSLNTFNQSNNLCCLVNIEFFPTQHLLG FT DIVNSGCINYMNNYNNFDNINLYINSNVLSYNNYNHSFLASPYTTNITEHA FT DINMHVQEVNMQQDNNTQHAITQQVSLQATSLQHTLDEMIVQFNTAVRLKK FT KHKVAKIFRGHNHRKDLPTLPAREQYKTKPKLAIREVLHRKTTATSSPSEN FT AIKAFFSSYSRPAELFTGQELLESSWFPVHPEDDFEFRIPGRDQIAKYIKF FT ASKSAAGLDWITYEDIKLGDPSGEILQPIFEYIVQNNICPSEGKASRTIMI FT PKPGKSDYSDPSSWRPITITSAVYRLLMKYLTWELYNWILLNQMLSRSQKS FT LGKFEGCHDHNAMLNMLIQDVRRQTNPSNPINKNKRLYIVFLDFTNAFGSV FT PLDTLMYVPQRFGLGTSALTLIKNLYLDNYTNVTCGESKIENVKLNKGVKQ FT GCPLSMLLFNIFINIIIRAIEAMPDVHGYPLGDMDIRILAYADDIALISDS FT HKDLQEMVYKAEYIGRILGLLFNPSKCALMDIPHDKKRTPPILVNGEMIKC FT VGKADPYKYLGTFRSWFRKLDIKELLQMMMDETKLITESNLHPHQKIHAYE FT TFIHSQLPFHLRHSRIPFSDFITNRKTNKTTNNSNDSEKSIQKAYDPESGQ FT LFLNTFALPSGCAKDFFYITKDAGGPQLTSGLDEYLIQSIMYIFRLLGSED FT PTLNSAIKHDLISHLNLKGFVNINFSQAISIFNSNFTDRTDHFSHLSRTEW FT ARLQLARKKLKSTLAIQTNVCLINGHLVLTLSLENNVLLIDSKEKGDVKKI FT HASLMGFLRLAHLIRLQKHGWSKLLFSATTHHEILNKRILNGHVPYKIWYF FT IHRARLGLLPTKLFSVSNLCRKCGGKKETMSHALVNCPMMQTLINERHDAL FT EISLVQILSSKFQGTVIRQKTYVNELRPDITMESDTQYYLVEVKCPFDTKM FT SFELRTQQTTDKYNIIIEILEDVHPGKEVRLVTFIVGTLGSWGPQNSDFLR FT DLGFSKDEIDQVKTRLMLQNINSSCEQWKRFVQYAPTITPGPIPDAESEDD FT QGTSDNGPTAATVQGPVIGDEEEELQIYDSGLDESSDDEPDPDDAELLFTI FT DIEQYLNSVITD" XX SQ Sequence 4814 BP; 1571 A; 1004 C; 902 G; 1337 T; 0 other; gtaggtaact atgactgcaa aataataatt ctacacctat tgttgataac tcatctcgtg 60 cgcaaacgga gcatgttatt tctaatcatt tcgtcacaca ggattcttct aattctgata 120 gtaatattat agatagagat aggaaccttg ttgatttaga tgcgtcaata acttctccta 180 ctattataca gccagaggat agtaagatat ctgaggatga ggacttcatc ttagtcaata 240 ggaaaaagag caaaaataag aaaaaatcta agaaaacaac tgaaaataaa aatgaaattc 300 ctattcaaaa gagtaaagat aagaaaaaga agtctaaaat taataccgaa aaactaactg 360 aaaatattac tacttctgaa ataccacttg aaattgctcc ttccatacct ttaccttcag 420 caagtacctc gggttctcaa caaccggcca atcctccaga agacgctact ctaagtgata 480 cggatctctt ccttacacag gatgatcccg atagtcttat tctttctgga agtactcaac 540 caacctttgt tgacctcaac ccttcacagc aatcggaact tccttcaaat actgacagcc 600 aaagatttga ggcgggtgaa acacccaaaa tcataacttc ttacagggat gaccttttct 660 actctacagt ccttcactac aactcagata caggttacgg tataagtgtt gacaatgggg 720 agcagaggtt tcgaattctt gctaggaatc ttgtcaggaa aaccaaggat aagttcccct 780 ctttatatgc tggacaagta attagacaca cagtcttctt caatcacttc aaccaggcat 840 actacgccaa taatataact gatagtaaag gtaatctaat tgagttttct gatgataagc 900 cttttcaaag tataccgact gacccaaaaa ctgaactaga gcaaattagg agagagagac 960 aacatctagt tgatagagct cttagacata atcagttacg ggaaacttat attttaaata 1020 aacttaataa taataatggg gggggtggcg aacatttgaa aaggaaaaag atcaaagtca 1080 atacggatga tgtctccagc aatgatggag acagaaaaca tagacgacag gaagaaacct 1140 cggacaatgg agctgctatt cccgctcaat ccaacaatca aattactcct ttaaactgag 1200 ttcgactgaa gtgggagagc tcgttgagca atctccagct ccccttcagt cgcctcagtt 1260 ctctaacaat tataacaatc taaatatcaa caacaactta tactatagtc tcaatacttt 1320 taatcagtct aataaccttt gctgtcttgt taatattgag tttttcccaa ctcaacacct 1380 tcttggtgat atagttaact cggggtgcat aaactatatg aataattata ataactttga 1440 taatattaat ttatatatta atagtaatgt attatcttac aataattata atcacagttt 1500 tctcgcttcc ccatatacta ctaacatcac agaacatgca gacataaaca tgcacgtgca 1560 agaagttaac atgcagcaag ataacaatac acaacatgct ataacacaac aagtctctct 1620 acaagcaaca tctctgcaac acacgttgga cgaaatgata gtccagttta acactgctgt 1680 caggttaaag aaaaagcaca aagttgcaaa aatctttagg ggacataatc atcgtaaaga 1740 ccttccaaca ttgcctgcta gggaacagta taaaactaaa ccgaaacttg caattagaga 1800 ggtacttcat cgaaaaacaa cagctacgtc ttccccttct gaaaatgcaa ttaaggcttt 1860 tttctcctcc tacagccgtc cagctgaact tttcactggt caggaacttc ttgaatcatc 1920 ttggttccca gtacacccgg aagatgactt tgagtttaga attccgggta gagaccaaat 1980 agcgaaatac atcaagtttg ctagtaaatc agctgctggt cttgactgga tcacgtacga 2040 ggatattaag ttaggcgatc cgtccgggga aattctccaa cccatttttg aatatatagt 2100 acaaaataac atatgcccat ccgaggggaa ggctagtagg accattatga ttcccaaacc 2160 gggaaaaagt gactattcag atccttcttc ttggcggccc attacaatta ccagcgcagt 2220 atacagactt ctcatgaaat atcttacatg ggagctgtat aactggattc ttcttaatca 2280 gatgctgtcc aggagccaaa agagtttagg gaagtttgag ggatgtcatg atcacaacgc 2340 aatgttgaac atgctcatcc aagacgttag gagacagacc aacccgtcta atccaatcaa 2400 taagaataag aggctataca tagtcttcct agactttacg aatgctttcg ggtcggttcc 2460 gttagatact ctcatgtatg tccctcaacg ctttggctta ggcacctctg ctttaacgct 2520 gattaaaaac ttatatctag ataactacac aaatgtaaca tgtggggaaa gcaaaataga 2580 aaacgtaaaa ttaaataaag gggttaagca aggctgccct ctatctatgc tgcttttcaa 2640 catttttatc aatattataa ttagggcaat agaagctatg ccagatgtcc atggataccc 2700 acttggagat atggacatcc ggatactggc atatgctgat gatattgctc taatatctga 2760 ctcccacaaa gacctgcagg aaatggtcta caaggcggaa tatatcggtc ggattcttgg 2820 actactcttc aacccgtcaa aatgtgcact tatggacatt ccgcacgaca agaagaggac 2880 gccgcctatc ctcgtcaacg gtgagatgat caagtgtgtt ggaaaggccg acccatacaa 2940 atatcttgga acctttagat cctggttccg gaagctggat ataaaggagc tcctccagat 3000 gatgatggat gagactaaac tcatcaccga gtcaaatcta catcctcacc aaaaaatcca 3060 cgcgtatgag accttcattc acagccagct cccatttcac cttagacaca gccgaattcc 3120 gttctcagac ttcataacaa acagaaaaac aaacaaaaca acaaacaatt caaacgactc 3180 agaaaaatct atacagaaag cctacgatcc ggaatcagga caattattcc tcaacacctt 3240 cgcccttcca agtggatgtg ctaaggattt cttttacatt acaaaagatg caggtggacc 3300 tcaactcaca agcggactgg atgagtactt aatccaatca attatgtaca tcttccgact 3360 attgggcagt gaggacccca ccttaaactc tgcaataaaa catgatctca tttcccactt 3420 aaatttaaag ggttttgtaa atattaattt ttctcaagcc atttcaatct ttaattcaaa 3480 ttttaccgac cgaaccgatc acttttcaca tcttagccgc actgaatggg caagacttca 3540 attagctcgg aaaaaattga agtcaacctt agccatccaa actaatgtct gtctgataaa 3600 tgggcatctt gtcttaactc tttcgctaga aaacaacgtt ctgttaattg atagtaaaga 3660 aaagggggat gtcaagaaga tccatgcatc cctcatgggg tttcttaggt tagctcacct 3720 tatcagactg caaaaacatg gatggtcaaa actgctcttc agtgcgacca ctcatcacga 3780 aatactaaat aagcgtatct tgaatggtca cgtcccttat aagatttggt actttattca 3840 tagggcgcgg ctggggttgt tgcctactaa actctttagt gttagtaacc tttgtaggaa 3900 gtgcgggggg aagaaagaga ccatgtcgca tgctttggtc aactgtccaa tgatgcagac 3960 cctcattaat gagagacatg atgctcttga aatctccctt gtacaaattc tttcttctaa 4020 atttcagggt acggttataa ggcaaaagac ctatgtcaac gagttaagac ccgatatcac 4080 aatggaatcg gatacccaat attatcttgt tgaggtaaaa tgcccctttg acacgaagat 4140 gagttttgaa ttgagaacac aacaaactac tgataaatac aacattatta ttgaaatatt 4200 agaagatgta caccctggga aggaggtgcg ccttgttacg tttattgtag gcaccttagg 4260 ctcatggggc ccgcagaact cggacttttt gagagatctg ggattctcca aagacgaaat 4320 cgaccaggtg aagacgcggc tgatgcttca gaatatcaat tcctcctgcg agcagtggaa 4380 aagatttgtg caatatgcac ccacaattac acctgggccg attccagacg cggagagcga 4440 ggacgatcag gggacgagcg acaatgggcc aacagctgct acagtgcaag gaccggtgat 4500 tggcgatgag gaggaggaac ttcaaatcta cgattccggc cttgacgagt ccagcgatga 4560 tgaacccgac ccagatgatg ctgaattact tttcacaatt gacatagaac aatatttgaa 4620 ttctgtgata acagactgat ccgtgtgttt gtgtcgtatg attgtttccg tgtgtgtcta 4680 tatttttctt ttttatactt tcaattacct cgttgtaatg ttataacttc atatggaata 4740 tatgtaattt agtttagttt agttagttta gtttagttta gtttagttta gttagtttag 4800 ttagtttagt tagt 4814 // ID P-20_HM repbase; DNA; INV; 3132 BP. XX AC . XX DT 21-MAR-2008 (Rel. 13.03, Created) DT 21-MAR-2008 (Rel. 13.03, Last updated, Version 1) XX DE P-type DNA transposon family: consensus. XX KW P; DNA transposon; Transposable Element; P-20_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3132 RA Jurka J.; RT "P-type DNA transposon families from Hydra magnipapillata."; RL Repbase Reports 8(3), 366-366 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 155..2701 FT /product="P-20_HM_1p" FT /translation="MVNKCAAPKCTSGYLSNKHKQIASFHFPTDQNLNQLW FT IRFVNRSDWTPTKHSVLCELHFEDKYILRGDKCTLNWLLKPVPSKYSCELL FT QKKSSLPVLKTSRSPPKLRIFQKDQLEIFKKNDTITNVHNLNEAVAPPGFQ FT FKRFEKGVVFYNIVFNEQTYFPTILESITVDVDLNVRLQYKGTPLPLPSWF FT VKGHNAKLNKISMLENFPSYMRNYTLENHDTILDEIKDRELYQPQGRPPYS FT AEMIRYALLLRYTSLQAYKLLLKKLPLPSISLLNKVQQGGIDSIKALKLLR FT EKGKISPDCIMIIDEMYLQKAAQYQGGEYVGADEEGNLYKGIVAFMIIGLK FT ESIPFIVQAIPEVTFSGNWLSQKIADNLKNLLGAGFTVRGIVTDNHSTNIN FT AFSSLVTMFKSESKYFIIHPQNHGKKTYLFYDSVHLVKNLRNNLLNGKKFV FT FPDFKYHNAYINIDCPSGYISWDDLFKIYHNDNDLKGKLRKAPKLTYEALH FT PGNNKQSVPLALAIIHETTIAASKSYYPNRKDVSSFLTIFNTWWTIANSKQ FT RFSPNIIGNAIVLGDNKMIFYQALADWINQWSLSPAFTLTTQTASAFVNTL FT RAQALLVEELLNDGYDYVMTARLQSDPIERRFSQYRQMSGGRFLVSLREVL FT NSERILSCCSLIKEHINFWEENLIPDIHPSLTILDELFENRIEEIMESSLD FT YNSQEVAITIAGYVAKKLSKRSKCENCKACLKGGDIDLENNSYLSLLSRGG FT LFVPSKQLAEFVCSVFAILDLIENEISSLSIPVIKSSTYLLHKYAPKSKFV FT CENHENWGITFSTKILVNVFFNNKQKLSKDSVSNNNLKTFKKRQRKK" XX SQ Sequence 3132 BP; 1151 A; 472 C; 469 G; 1039 T; 1 other; ggcctactat attcacggcc gtatatctta catttttgag ggggaaatta agaggccggt 60 tttgaaagtt ttagatgcaa aatacgctaa aatattattt ttaaaaatat tcttctttct 120 gaatttacta atttttaaag aatttatatt raaaatggtc aacaaatgtg ctgcaccaaa 180 atgtacatcg ggatacttaa gcaacaaaca caaacaaatt gcatcttttc attttccaac 240 agaccaaaac ttaaatcaac tttggatacg atttgttaat cgttcagatt ggactccaac 300 aaaacattct gttttatgtg aacttcattt tgaagataaa tatatactta gaggtgataa 360 atgcacactg aattggcttt taaaacctgt tccttctaag tattcttgtg aacttttgca 420 aaaaaaatca tctttgccag ttttaaaaac ttctcgaagt cctcctaaat tgagaatatt 480 ccaaaaagat cagttagaaa tatttaagaa aaatgataca attacaaatg tacataattt 540 gaatgaagca gtagcaccac cagggtttca atttaaacga tttgaaaaag gtgttgtatt 600 ttataatata gtatttaatg aacagacata ttttccaact atattagaat caataacagt 660 tgatgtagac ctaaatgtac gtttgcaata caaaggaaca ccgcttccat taccaagttg 720 gtttgttaaa ggacataatg caaaacttaa taaaattagt atgctagaaa attttccttc 780 ttatatgaga aactacacat tagaaaatca tgatactatt ttagatgaaa ttaaagatag 840 agagctttat cagccacaag gcaggccacc atactcagca gagatgattc gatatgcatt 900 acttttacga tatacatctt tacaggcata caaactacta cttaaaaagc tccctttacc 960 atcaatatct ttattaaata aagttcaaca aggaggtatc gattctataa aagctttaaa 1020 acttctacgt gaaaaaggaa aaatttctcc tgattgtatt atgattatag atgaaatgta 1080 tttacaaaaa gcagcacaat atcaaggagg agagtatgtt ggtgctgacg aagaaggtaa 1140 cttgtacaaa ggaattgtcg cattcatgat aataggacta aaagaatcta ttccttttat 1200 tgttcaagca attccagaag tcacattctc tggaaattgg ctatcacaaa aaatagctga 1260 caatcttaaa aatctattag gtgctgggtt cactgtgcgt ggtatagtta cagacaatca 1320 ttccactaac attaatgcat tctcatcatt agttacaatg ttcaagtctg aatcaaagta 1380 ttttattatt cacccacaga atcatggaaa gaagacatat ttgttttatg atagtgtgca 1440 tttagtcaaa aatcttagaa ataatttatt aaatggtaaa aaatttgttt ttccagactt 1500 taaataccat aatgcatata tcaatattga ttgtccatca ggatatatta gttgggatga 1560 tttatttaag atttaccata atgataatga tctgaaaggt aaacttagaa aagctccaaa 1620 gcttacatat gaggcattac atcctggaaa taataaacaa agtgtgccac ttgcgttagc 1680 aattatacat gaaactacaa ttgcagcttc aaaaagctac tatccaaatc gaaaagatgt 1740 ttcaagtttt ttaactattt ttaacacttg gtggactatt gctaattcta aacaaaggtt 1800 ttcaccaaat attattggca atgctattgt attaggagac aataaaatga ttttttacca 1860 agctctggct gattggatca atcagtggtc tctttcacct gcattcaccc tcacaactca 1920 aactgcatca gcatttgtaa ataccttacg tgctcaagca ctgcttgttg aggagctcct 1980 gaatgatgga tatgattacg taatgacagc tcgtttacaa agtgatccaa ttgagagacg 2040 tttctcacaa tacaggcaaa tgagtggggg aagatttctt gtaagtttaa gagaagtttt 2100 aaattcagaa agaattttgt catgttgttc tttaattaag gagcatataa acttctggga 2160 agaaaatcta ataccagata ttcatccttc cttgacaata cttgatgaac tgtttgaaaa 2220 tcgaatagaa gaaattatgg aaagttccct tgattataat agtcaagaag tggcaataac 2280 tatagcaggc tatgttgcaa aaaaattgtc aaaaaggtct aaatgtgaaa attgcaaagc 2340 atgtttgaaa ggtggagaca ttgatcttga aaataattct tatctatctc tcttgtcacg 2400 tggtggacta tttgttccat ccaaacagct tgcagaattt gtttgcagtg tatttgctat 2460 tttggattta attgagaatg aaatttcatc tctatcaatt ccagtcatta aatcatcaac 2520 atatctgtta cataaatacg caccaaaaag caaatttgtt tgtgaaaatc atgaaaattg 2580 gggaatcacg ttttcaacaa aaattcttgt taatgtattt tttaataaca aacaaaaact 2640 atcaaaggat tcggtttcca ataataactt aaaaacattt aaaaagaggc aaagaaaaaa 2700 gtgaaatcaa aattgccaat atgctagttg catagcattt gtttatatag tttaataaaa 2760 tttacctatt gtttgatttc tttgaactac ttcttgacat tttttttaaa gcgttaaaat 2820 atacataaat atatattata tataaatata atatacatag atataaatgt atattgattt 2880 gaataatttc tgtgacaatt ctccaacaaa tattacacaa atgtctttaa atctttgata 2940 tctattttat cttaaagagt atatatatta aaaacaactg ccacttttac attgaagaac 3000 tttaagttaa aaattaagaa taaagttgct ttttaaaaat ataatattct gcgcagccat 3060 tttttcagta aaaatcggcc tcttaatttc cccctcaaaa atgtaagata tacggccgta 3120 atatagtagg cc 3132 // ID Gypsy-216_AA-I repbase; DNA; INV; 1997 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-216_AA_; KW Gypsy-216_AA-LTR; Gypsy-216_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-1997 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1035-1035 (2011). XX DR [1] (Consensus) XX CC Positions [963-1439] - Integrase core CC 'TTAT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 222..1958 FT /product="Gypsy-216_AA-I_1p" FT /translation="MVDPSNEITPKFNRTQLLLHPVMYHSRKTTPAESRLH FT SFELETLAIVYCLQRFRTYLFGLSFKVVTDCSSLKHTLQKKDVNPKIARWA FT LYLEQFDFEVIHRPGLRMQHADALSRVNVCLIEEDQCSSTLFENALYVSQL FT RDPEVAKLKRAVELGTLKDYEIRDAILYKVVGVKSLLYVPADMIPSVINKF FT HNDMGHFGVDKVCNLIKRTYWFPKMRDQVQNHAKACITCIAYNPRNKRYDG FT HLSTVEKPTVPFEILHVDHLGPLEKGKGKNEYILAVVDAFSKFIKLYATKT FT TKTSEVMKSLKNYFHVYSTPRVMISDRGSAFTSDAFGKFVEDHGFKHIKVA FT TACPKANGQVERYNKTMLPLISKLVEESGNSWDSVLSDAEFLLNNTVNRAT FT GCTPSTLLFGVDQRRRIVHDLTDYLQNMENPDERDLIKVRETAGEQMKKQS FT NYNKRTHDRHCRRNTNYEMGDLVMLRRVNVVGDRNKLKPKFRGPYEVKKVL FT NRNRYVVGDIEGYQVSGKRFEGIFDPLNMRLYQKVNCPKEEKVEDEESDVE FT YQDIEYLNEEEVNLGSDVEYEDVEYLDEAIEV" XX SQ Sequence 1997 BP; 641 A; 363 C; 468 G; 524 T; 1 other; gttttcagaa gtgggatcct cttggtcctg tcggaaacta cttcggacct cggaacgtgc 60 tggtgacgat acatccgacg ttggtggaga cgattttccg gtttgttcca tgacgatgcc 120 gataacgaga agtatggctg cccggcagac ggaaagcgct gtgaacgacg agaagtttta 180 cgatgtgaaa agcgaagaca aagctgataa gaagaagcca gatggtggat ccatccaacg 240 aaataacacc gaagttcaac agaacccaac ttttgctcca tccagtaatg tatcatagtc 300 gtaaaactac acctgccgag tccagattgc atagtttcga attagaaact ttggcaatcg 360 tatactgctt acagcgtttt cggacttatc tattcggact cagttttaag gtagtgaccg 420 attgctcttc attgaaacat accctgcaga agaaagacgt caatcccaaa atagcgaggt 480 gggcgttata tttggaacaa ttcgattttg aagttattca tcgacctggt ttgaggatgc 540 agcatgcgga cgcgctatca agggtaaacg tatgtctaat tgaggaagat caatgttctt 600 caaccctttt tgaaaatgcg ctgtatgtat cacagttgag agatcctgaa gtggctaaac 660 tcaaacgagc ggtcgaatta ggtactttga aggattacga aattcgtgat gcaattctct 720 acaaagtagt tggtgttaag tcgttactct acgttcctgc agacatgatt ccctcagtta 780 ttaacaaatt tcataacgat atgggccact ttggagttga caaggtgtgt aatttgatta 840 aacgcactta ttggtttccg aagatgcgag atcaagttca aaaccatgct aaagcttgta 900 tcacatgcat agcatacaat cctcgaaata aaagatacga tggacatctt tccacagttg 960 aaaaacctac ggttcctttc gaaatattgc acgtggatca cctgggacca cttgagaaag 1020 gcaagggaaa aaacgagtat attttagctg tcgtggatgc cttttcaaaa tttatcaagt 1080 tatatgcaac taagactacg aaaacatcag aagtgatgaa gtctttgaaa aattactttc 1140 atgtgtattc aacacccaga gttatgatct ccgacagagg gtctgccttc acttctgatg 1200 cgtttggaaa atttgttgaa gatcatggat tcaagcatat caaagtggct acagcgtgcc 1260 caaaggctaa cggacaagtt gaacgataca ataaaaccat gttgccactg atcagcaagc 1320 ttgtcgaaga atctggaaat tcctgggatt cagtgctaag tgatgccgaa tttctgttga 1380 acaacacggt gaaccgagca acgggatgta ccccgtctac tttactgttt ggtgtagatc 1440 aacggaggcg catcgtacac gatctgacag actacttgca aaacatggaa aatccagatg 1500 aacgagatct catcaaagtg agagaaacag ctggtgaaca aatgaagaaa caatcgaact 1560 ataataaacg tactcatgat cgtcactgca gaagaaacac taactatgaa atgggtgact 1620 tagtaatgtt gcgaagggtt aatgttgttg gggacagaaa taaactcaaa ccaaaatttc 1680 gtggaccgta tgaagttaag aaagtgctca atcggaatcg gtatgtkgtg ggagatatag 1740 agggatatca agtgtcgggt aaacgtttcg aaggtatatt tgatccgttg aatatgaggc 1800 tctaccagaa agtgaactgc ccaaaagagg aaaaagttga ggatgaggaa tctgacgtag 1860 aataccaaga tatagaatac ttgaatgagg aagaagtaaa tctagggtca gatgtcgagt 1920 atgaagacgt agaatattta gatgaggcca ttgaagtata agttttattt cagataatta 1980 gtgcaggatg gccgagc 1997 // ID Mariner-24_HM repbase; DNA; INV; 2498 BP. XX AC . XX DT 22-DEC-2008 (Rel. 13.12, Created) DT 22-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE Mariner-type family: consensus sequence. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-24_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2498 RA Bao W. and Jurka J.; RT "Families of Mariner elements from Hydra magnipapillata."; RL Repbase Reports 8(12), 1958-1958 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 588..2276 FT /product="Mariner-24_HM_1p" FT /translation="MPRNYIKKKESRRKTYAPNLLQNAISKVKNKEMTSYA FT ASKEYGVPRSTIARHLLGIKCSKVGEGRPNRLSADAEKDLVSLLSAFSDFG FT FGLDKKQFMKLVQTYIELNGFQKKFVNGVAGDQWFCNFLNRWQKEISIRTP FT ELLTLTRALACNKTVIDAWFLLLKATLEKCDIMNRSAQILNCDESGFCTNP FT INKKIICKRGIKNPVSIAPGSGKEQFTVLATISAAGRNFPPFILFAGKNLY FT KEWMTGGPNGSLYGVTKNGWMETEVFTEYFNHLVLWLKDTPKPVLLIFDGH FT ISHISLETVKKAVENHITILCLPAHCSHLLQPLDVGVFRQVKHVWREILAN FT WYTESRLKVVGKSVFPSLLKKLYDTSFKPAHLVQSFAKTGIFPFDPSAIAS FT DKLNPSEIYEHPFSQKTNNYNNNVPQNIVLDTNVNDQNDTCENFSIESTNK FT YNSQKTIISVGKDLNNEKINQLSISNVDCSTGNKIIREIVKEQLMATKGDG FT RSYNKRPTKKIKRHYGGECLTSEDAFKRLNDEENECFKRKIKQQGKIQNKK FT TNLLEITNTKELKMK*" XX SQ Sequence 2498 BP; 921 A; 342 C; 392 G; 843 T; 0 other; gggtaaactg gggtaaagtg aactaggggt aaagtgaact atttttttaa actaaaaagt 60 tatcatgttt aaaaaacttt ttgttaagta atttagcatt atttttgctt aatacaacat 120 tttgaaattt tttaaataaa aaaattaata aaatatagcc tttgttttga tgtaatagcc 180 attttaaaaa gcgagtttgg cggaagtttt aaaattacga tctcaaactt tgctatgggt 240 tttagtaaat tcatgatttg ttgttattaa tgcgaacatc agttatataa tttatataca 300 ttttaaagat aatataatgc tgttaaaaaa aaattgtagt ttttatcata atataaaaaa 360 gctaagcttt tagtttgtaa atcaatttag ttcactttac cccttatttt tctgcctatt 420 gggggtaaag tgaactagtc tttttcttat gttttttcat caaatatttt tagtagaact 480 gaatatataa attgtaatta cttaactaaa ttgtttactc ttaaaataag tttttttttt 540 tttaaatatt cttaaaatta ttttcagaaa tataatatat tgaaataatg ccaagaaact 600 acattaagaa aaaagaatct cgaagaaaaa catatgcacc taatttgttg caaaatgcca 660 tttcgaaagt taaaaataaa gaaatgactt cgtatgctgc atcaaaagaa tatggtgttc 720 caagaagtac aattgcaaga catcttctag gcataaaatg ctcaaaagtt ggtgaagggc 780 gaccaaaccg attatctgct gatgctgaaa aagatcttgt ttcactttta tctgcatttt 840 ctgattttgg atttggatta gataaaaaac aatttatgaa acttgttcaa acgtatattg 900 aattaaacgg atttcagaaa aaatttgtta atggtgttgc tggtgatcag tggttttgta 960 attttttaaa tcgttggcag aaagaaatat ctataagaac tccagagctt ttgacattaa 1020 caagagcctt agcttgtaat aaaactgtaa ttgatgcttg gtttttatta cttaaagcaa 1080 cattggaaaa atgtgacatt atgaatagat ctgcccaaat actaaactgc gatgaaagtg 1140 ggttttgcac taacccaata aataaaaaaa ttatttgcaa aagaggtatt aaaaaccctg 1200 tatctattgc tcctggatca ggaaaagaac agtttactgt actagcaaca atatcagctg 1260 caggaagaaa ttttcctcca tttattttat ttgctgggaa gaatttgtat aaagagtgga 1320 tgactggtgg accaaatggt tccttatatg gggtgacaaa aaacggttgg atggaaacag 1380 aagttttcac tgaatacttt aaccatcttg ttttatggtt gaaagacaca cccaagcctg 1440 ttttgcttat ctttgatggt cacattagtc atatttcttt agaaactgtt aaaaaagcag 1500 ttgaaaatca tataactatt ttgtgtttgc cagcacactg ttcgcatttg cttcagccac 1560 ttgatgtagg agtctttcgc caagttaaac atgtatggcg tgaaattctg gccaattggt 1620 ataccgagtc aaggctgaag gtggttggga aatctgtttt tccatcttta ttaaaaaaac 1680 tttatgatac ctcctttaag ccagctcatt tagtacaatc ctttgccaaa actggaattt 1740 ttccctttga tccatctgct atagcatctg acaagttaaa tccttctgaa atctatgaac 1800 accctttttc tcagaaaaca aataactata ataataacgt ccctcaaaac atcgttttag 1860 acaccaatgt caatgatcag aacgatactt gtgaaaattt ttctattgaa tcaacaaata 1920 aatacaattc acaaaaaact ataatatcag taggaaaaga cttaaacaat gaaaaaataa 1980 atcaactatc aatatcaaat gtagattgtt ctactggaaa taaaatcatt agagaaatag 2040 tgaaagaaca gttaatggcc acaaagggtg atggtcgttc atacaacaaa cgacctacaa 2100 aaaaaataaa aagacactat ggaggtgagt gtcttacatc agaagatgca tttaaaagat 2160 taaacgatga agaaaacgaa tgctttaaaa gaaagatcaa gcagcaaggc aaaatacaaa 2220 ataaaaaaac taatttatta gaaataacaa acacaaaaga gttaaagatg aagtagttca 2280 ctttacccca atgtgggggt aaagtgaact agtggacatg ttttttttga atatatattt 2340 aatattgtaa aagtagcttt taatcttagt attttgtttt atattgtacc taatacttta 2400 aatggtataa attttgaaaa ctgcaaaatg ttttggcttt tattttgaaa atatttcctt 2460 taaatatcaa aaacagttca ctttacccca gtttaccc 2498 // ID Gypsy-1-LTR_LG repbase; DNA; INV; 566 BP. XX AC . XX DT 18-MAR-2009 (Rel. 14.03, Created) DT 18-MAR-2009 (Rel. 14.03, Last updated, Version 1) XX DE Long terminal repeat of the LTR retrotransposon - a consensus DE sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; 4-bp TSD; TG-TA; KW Gypsy-1-LTR_LG. XX OS Lottia gigantea OC Eukaryota; Metazoa; Mollusca; Gastropoda; Patellogastropoda; OC Lottioidea; Lottiidae; Lottia. XX RN [1] RP 1-566 RA Bao W. and Jurka J.; RT "Gypsy LTR retrotransposons from Lottia gigantea."; RL Repbase Reports 9(3), 728-728 (2009). XX DR [1] (Consensus) XX SQ Sequence 566 BP; 183 A; 92 C; 100 G; 191 T; 0 other; tgtaacaaaa ctgagatccg cccgtattca gatttttgtt agtaggttaa tgtgttattg 60 cctacggcgc tttggatcat gaaattggta tagaaattaa tagcccatga attaggactc 120 gtggtaggac tcgtggccgt tatataatgg tcattcatat tcaaaaaggt acaatttaat 180 ggctaatttt ttttaagtac caacattaaa agattaacag attatcattc ctgtcgatat 240 gacgtataca gtcgagagta taaatatgag aagactactc tctgtcttca gtacttaaca 300 ggaaacgtta ttactcataa agtcttatct ttggctacct ttaagactag taggcattcg 360 aaattattct cgactagaaa tacactttaa attattctct tgatgctatt ctcttgatgt 420 tattctattg aatgtcaact ttaagaacac aacgtgaagt ctgatattta tataaagata 480 tttgaaaact gttattcaac ttatttacgg acggggacca ccagcattgg atcgcaaaaa 540 ggggatatcg acaacacttt gtcata 566 // ID SMARN4 repbase; DNA; INV; 793 BP. XX AC . XX DT 16-AUG-2009 (Rel. 14.08, Created) DT 16-AUG-2009 (Rel. 14.08, Last updated, Version 2) XX DE Consensus sequence of non-autonomous Mariner-type family of DE repeats. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Nonautonomous; KW SMARN4. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-793 RA Jurka J.; RT "Mariner-type element from freshwater planarian (Schmidtea RT mediterranea)."; RL Repbase Reports 9(8), 1892-1892 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX SQ Sequence 793 BP; 278 A; 112 C; 120 G; 283 T; 0 other; cacccaatcc tgtttttaca cggtcttttt tgcacgattt tgaaataaca cggttgaaga 60 aatataaaca atcctgattt tacacgattt gatttttttt acacggacaa aaattctact 120 tctaaaaaaa ttcacaaaaa attttatgaa atattttgtg atttggttaa aatgtacata 180 ggccagccat ggtgtgttaa caaaattaat aagtttttat tttttattta cttttatagt 240 tttttaattt gaggtgaatt atttacgagt tgaatacatc tctgttttcg aaaaacaaaa 300 aatatttaat cttgtatggc tcgctacaaa gagctgtaca gagaactgtc cagactagga 360 gcacaaccaa aaattacgga atttgctatt aaaataagta acactgcgtg aaactgtctc 420 tgataaccga ttagcactaa cagcatttcc tctgactaaa gtgatttaaa gccaacgcag 480 aaaaagagaa cgcggctact atctgaaacc gactcagaat aaaaataatt aatattaaaa 540 accattttta tgttcgtgaa tacaaatgtg attttttaaa tctgtattaa aaatttaata 600 gctatataat atatgtatat tagattaata aaatatgtat ggatatatgt actaaactac 660 aaaacttgat tttgattctt taattattag agtttttatt tgtggaacct atctcatgtt 720 tttgtactac tgtgtgtctt ttttacacgg tggtccgtgg aacgtatcta ccgtgtaaaa 780 acaggattgg gtg 793 // ID Transib-8_HM repbase; DNA; INV; 2975 BP. XX AC . XX DT 30-JAN-2008 (Rel. 13.01, Created) DT 30-JAN-2008 (Rel. 13.01, Last updated, Version 1) XX DE Autonomous Transib DNA transposon -a consensus sequence. XX KW Transib; DNA transposon; Transposable Element; Transib-8_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2975 RA Kapitonov V.V. and Jurka J.; RT "Autonomous Transib transposons from the hydra genome."; RL Repbase Reports 8(1), 8-8 (2008). XX DR [1] (Consensus) XX CC Transib-8_HM is a young family of autonomous Transib DNA CC transposons that were active in the hydra genome just a few CC million years ago (they are ~3% divergent from their consensus CC sequence). The consensus sequence was obtained based on a CC multiple alignment of ~50 copies; it codes for a 636-aa Transib CC transposase. Like other Transib transposons, Transib-8_HM is CC characterized by 5-bp target site duplications and short terminal CC inverted repeats. CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 729..2636 FT /product="Transib-8_HMp" FT /note="Transib transposase." FT /translation="MKYFDIYAFIMHKPNNLQISDSGIVDKLIDAMNLDCN FT KQQLGTIITHLRDKWRKSYCSMARFEEKNKKWLQSHVPENLNVTPIPIGRP FT NKTNFDDYSTKTKKHKVSHLTEMYSASALQFASEIKRRKENPTEKTSKKDC FT RMSKDEALCLLLDARLTKASYQIIRNQAVEMGVDIYPPYNDVRDAKLLCYP FT ADISVSDFSAEIKLQALVDHTANRICKAQYDVLSTEGTPDRLSLRYKIGFD FT GATGQSIYKQISQGTVIRNLNNEQALFITCLVPLDLCGFSNGKQYPAWRNP FT RPSSTMYCRPLRFKFEKETAAVSKSELDLIRVEISNLQDTEVDINNKKIKI FT HHIIQITMIDGKVHTALSDVTNSGQCCTICGASPKAMNDIEKMVLKKGMAG FT SYSFGLSSLHAWIRFFECLLHIGYKITLKKWQARTAEDKSKVKALKQQIQN FT RFKDEMGLIVDVPKSGGSGTSNDGNTARRAFQNAEKFAEILNIDVELILNL FT HIILATISCGLDIDTDKFEAFCLKTARLYVHHYPWYYMPQSLHKILIHGSE FT IIKSLALPIGMYSEEAQEARNKDFQNYREHFARKSSRVKTNQDLLNRLLCT FT SDPFIASLRKTAGSHTKTKNLPAGVIDLLRSSTDDISL" XX SQ Sequence 2975 BP; 1051 A; 452 C; 494 G; 978 T; 0 other; cacagtgggc caggtggtgg aaaaaagtat aaaaaaaata ttctcaaaaa tgacattgtt 60 atataccatt ttatagctta aacatccggc ttttataaaa tgtatatatt tctgtataac 120 tatcatgccg tttgcatttt aagctgttaa taaaaaaggt attttttttt caagtttttt 180 ttttttttgt tgttttttaa tttaataata tttttcgtgt tagtaataag cataaaattg 240 gtattaatta ttaaaacttc atttttttat tgaataatta aataaaacta agtattatct 300 atcaaatgag ctattgcaaa attattttac acgtaagcta aatcaaataa attgatttct 360 aatgatggcg tatttaaagc acttttttta gtgaaaattt cacttaaata atgaaatact 420 tttgcgctcc aaggtgctcc agaaatttga gacttaaaat atcttgccca atatgtacat 480 tatcgatata tattttacct attgtcggcg ggtgtcagct gcgtctctat taatgatttt 540 tacaagtttc atctttgggg caggggggcc aacattctgg cttttctttg tgttcccagg 600 ctccaatcat cgcaattttt tagcaaatca tatttttatc acattaaaat aaacatataa 660 atatttggag tttgttttat ctccaaatgt ctcataaata gtatattttt tgtatttgta 720 ggctgaaaat gaaatatttt gatatctatg cattcattat gcataagcct aataatctac 780 aaatatctga ctctggcatt gtggacaaac ttattgatgc tatgaatttg gattgcaaca 840 agcagcaatt aggaactata ataacgcatc tacgagataa gtggcgcaag tcatattgca 900 gcatggctcg atttgaagaa aagaacaaaa aatggctgca aagccatgtt cctgaaaatt 960 taaatgtaac accaatacct attggaagac caaataagac aaactttgat gactactcta 1020 ctaaaaccaa aaaacataaa gttagtcact taactgaaat gtattctgct tctgcgttac 1080 aatttgcttc agaaattaaa cggagaaaag aaaacccaac tgaaaaaacc tcaaaaaaag 1140 attgtagaat gtctaaagat gaagctcttt gtcttctttt agatgctaga ttgacaaaag 1200 cttcctacca aattattcga aatcaagcag ttgaaatggg agttgatata tatcctccat 1260 ataatgatgt tcgggatgca aagctgttat gctatcctgc agatatatct gtctcagact 1320 tttcagctga aataaaactg caagctttgg tcgatcatac agcaaataga atatgcaagg 1380 cacagtatga tgtccttagc actgaaggca ctccagatag actttctctg agatacaaaa 1440 taggctttga tggtgccaca ggacagtcta tatataaaca aatatcgcaa ggaacagtaa 1500 taagaaattt aaacaatgag caggcacttt ttattacttg cctagtgcca ttggacttgt 1560 gtggtttttc aaatggaaaa cagtacccag cttggcgaaa tccacgacca tcatcaacaa 1620 tgtactgtag acctctgcgt tttaaatttg aaaaagaaac ggcagcagtt tctaaatcag 1680 aattggatct cattagggtt gagatttcaa acttgcaaga tactgaagtt gatataaata 1740 ataaaaaaat caaaatacat catataatac aaattactat gattgatgga aaagttcata 1800 ctgctctttc agatgttaca aattcaggtc aatgctgtac tatatgtggg gcaagtccta 1860 aagcaatgaa tgatattgaa aaaatggttt tgaaaaaggg tatggcaggt agttattcat 1920 ttggtttatc atccttacat gcttggatac gattttttga atgtttatta catattggat 1980 ataaaataac tttgaaaaag tggcaggcaa gaacagcaga agataaatct aaagtaaagg 2040 ctctgaaaca acaaattcaa aatagattta aagatgaaat gggcttgata gttgatgttc 2100 cgaaatctgg tggtagtgga acatcaaatg atggaaatac ggctcgaaga gcatttcaaa 2160 atgcggaaaa gtttgcagaa attttaaata ttgatgttga actgattcta aatttgcata 2220 ttatattggc aacaatttca tgtggtttag atattgacac tgataaattt gaggctttct 2280 gtttaaaaac agctcgactt tatgttcatc attatccttg gtactatatg ccacaatctc 2340 tccacaaaat tttaatccat ggttctgaaa ttatcaaaag tttagcatta cccattggta 2400 tgtattctga agaagctcaa gaagcacgca ataaagactt tcaaaactat cgtgaacatt 2460 ttgcaagaaa atcatctcgt gtgaaaacta atcaagattt gttaaatcga cttctgtgta 2520 catctgatcc atttattgca tctctgagaa agacagcagg aagtcacacc aaaacgaaaa 2580 acttaccagc tggagttatt gacttattga ggtcatccac agatgatatt agtttataaa 2640 tgtatctttt gtatctttat aattttggaa atgtctgtac ctatatgaaa gtgtgactgt 2700 aaattaaaac aattagttat gctcaatcct attagaatca tcaaattaat ttgcatacca 2760 attttagaaa catttggcta aaaataaccc ttaggaacta agcataaata tgtgttgcta 2820 tgcaaatggg gttgccttgg caaccacacc aaaaaacaat gctcatttaa attataatca 2880 tccaattaat ttatatacta atttttgggg cctatggatg aaaggaaaaa aagttatgac 2940 ttttttccaa aaatatggga ttctggccca ctgtg 2975 // ID Gypsy-38_CQ-I repbase; DNA; INV; 7860 BP. XX AC AAWU01014491; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-38_CQ_; KW Gypsy-38_CQ-LTR; Gypsy-38_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-7860 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 455-455 (2011). XX DR GenBank; AAWU01014491; Positions 84087 91946. XX CC Positions [4667-5146] - Integrase core CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 318..2288 FT /product="Gypsy-38_CQ-I_2p" FT /translation="MARLPCAEYLNAEQVDYELGIRNQLTPETDALDLVGK FT QTLLRTLFKKDVKNWQNYHSHYMIGQEIDGIKKHVALMYSALLKTGRTPKI FT EARLLHYWFRTKRCIAMSEEEKAQRREMVRELEGIMRKFEIEPPSPAQGQL FT KHWLNPGTIPDPGNEVGADGNPAGGSGTSGNGTPAPGSGYGTPRTPGNHNT FT GGNGGNGGGTGNEDGVMDRRLAHLERMLSGLTVTVTGMAEQQKSQLNQSQG FT GNGNALGNPTGSIQKEKPRLRVRRSGLFSPVAITDSSDDDGDFESRKRHKD FT TVKDSYQRGQAGKNRSDDDDLSSLLSYGSRGRRRHAERLDDRDLLYRIEKW FT RLRFSGDNRSMSIEDFLYKLDTISRREGVSQEQIFRHAHMFLDGTASDWFF FT TFVDEMDNWETFEKLVRIRFGNPNQEQGIRSRIQGRKQHRNEKFIEFATDI FT ERLNKQLSKPLSTHSKFQTLWQNMHSHYRTRIAPGTQIKSLKDLTEACQRI FT DAVDTSLNPSGEIAHQRMVNNVDVEESENDSEASADVNVVRTRQARDNRYT FT ARRREQPEQEGEREKVFRQQPPQHQRTERNQVNPSHLGQLRHNQSHGGNGE FT PSGVVKCWNCLEEGHIWRECSRPKVIFCYGCGNLGRTITCCDRCATRNFQF FT SSRNNQGN" FT CDS 2240..5533 FT /product="Gypsy-38_CQ-I_1p" FT /translation="MRDKKLPVQQPEQSGKLISGCKKGNPSNPQPELIPNA FT RTNPTEFDPLLRVHHIRVSVTKCPHIRVKIFDLEMEALLDSGAGISVINSP FT SLAEQYGLKIQPAAVKVSTADGTEYKCLGFLNIPFTYKNITRIIPTIIVPQ FT ISRSLILGADFWESFKIQPMIDVGNGPESIETLDIATETQLCFNIEPSGEL FT PKVDEKEPDDTLDIPAFDTPGEPAPESIETEHELTPEERAELIEVVKDFEF FT TAPGKLGRTHLIQHEIILKEEAKPRNQPVYKCSPFIQKEIDAEIERFKELD FT AIEECYSEWTNPLVPVRKSNGKIRVCLDSRRINAMTVKDAYPMQNMQDIFH FT RLGRAKYFSIIDLKDAYFQIPLKEESRNFTAFRTSKGLFRFKVCPFGLTNA FT PFTMCRLMNKVIGFDLQPQVFVYLDDIVIATEDLEQHLRLLRIVAERLRNG FT GLTISLEKSRFCRKQVMYLGYLLTERGVCIDQSRIQPILDYARPKSVKDIR FT RLMGLAGFYQRFIEGYSRITAPITDLLKKEKKKFTWSEEAEKSFNELKSVL FT TSAPILANPDFTKPFTIESDASDNAVGAALVQEQEGETRIIAYFSKKLSST FT QKRYSAVEKECLGVLLAIDNFRHYVEGTRFKVVTDARSLLWLFKIGAESGN FT SKLLRWALRIQSYDIQLEYRKGKSNITADCLSRSIESLAVFQIDPDYEELA FT AKIASSPTEYADYRVIDGRIMKYVKSSSRQQDPRFAWKQFPMPEERREIIQ FT QEHNKAHFGFEKTVAAIKQRFFWPKMNEQIRKFCRECLPCQTSKAGNVNVT FT PPMGSQKPVEYPWQFVTLDYIGPLPASGKNRCTYLLVATDVFSKFVLVQPF FT REAKAGPLADFVENMIFRLFGVPEIILTDNGSQFVSKIFRGLLESYHVTHW FT LTPAYHPQVNNTERVNRVITTAIRATLKKEHKHWADDIQAIAAAIRTAVHN FT STKYSPYFIVFGRDQVSDGREYSRIRDTHAPEATKETEVSAQRKKLFEDVK FT VNLAAAYQRHAKQYNLRSSSNCPKYAVGEKVLKKTFDLSDKGAGFCKKLAP FT KYEPCVVKKVLGSNTYELETDEGKRIGVYFADQLKKLKSAD" XX SQ Sequence 7860 BP; 2121 A; 1717 C; 1964 G; 2058 T; 0 other; tttggcgccc aacgagattt ggcacgtaaa ttttcttaag cgaattgatt tttttctggc 60 tattttttct tttgtttgat tttttttttg gttctcgcag tttgattaac ttttgcgtaa 120 tttttttctt gtaaataatt tgtagctttt aatttaattt tttgagtttt gaggcaattt 180 atttggtttc ataaattctt cccgtttttc ttattcataa tttgtagttt taataatttt 240 gtagttttag aaatttgtag atcttaagaa tatttttttt tgctgcacca ttccatttaa 300 ataataatat tttcaaaatg gctcggcttc cttgtgcaga atatttgaat gcagagcaag 360 ttgattatga gctagggatc cgcaaccaac tgacaccgga gactgacgct cttgatttgg 420 taggcaaaca aacacttttg agaacactgt tcaaaaaaga cgttaaaaac tggcaaaatt 480 accattctca ctacatgatt ggtcaagaaa ttgacggaat taagaaacat gtcgcgttga 540 tgtacagtgc gttgctcaaa actggtcgaa ccccaaaaat tgaggctcgg ttgttgcact 600 actggttccg aacgaaacgt tgtatagcta tgtctgagga ggagaaggct caaaggagag 660 agatggtaag ggaactagag gggattatga gaaaatttga aattgaacca ccttcaccgg 720 ctcagggaca gctcaagcat tggttgaatc ctgggacaat tccagatcca ggaaatgagg 780 ttggcgcaga cggtaacccc gccggcggtt caggaacgtc cggaaacggt actccagcac 840 caggaagcgg atacggtaca ccacgtacgc cgggaaacca caacaccggt ggaaatggag 900 gtaacggtgg aggaaccgga aatgaggatg gggtcatgga tagacgcctt gctcatctgg 960 aacggatgtt gtccggattg accgttacgg tcacgggtat ggccgagcag caaaaatctc 1020 agctaaacca atctcaaggc ggcaacggga atgccctcgg gaatccaacc ggatcgattc 1080 agaaggagaa accgagattg cgagtccgcc gcagtggttt gttttccccg gtcgcaatca 1140 cagactcgag tgatgacgac ggggactttg aatcgaggaa acgtcacaag gacacggtaa 1200 aagattctta tcaacgtggg caagcaggca agaaccgaag tgacgatgac gatctgtcct 1260 cgcttctcag ttacggttcg cgtgggcgga gacgacacgc tgagcggttg gatgaccggg 1320 atctgctgta ccggattgag aagtggaggt tgcggttctc aggggacaat aggtccatga 1380 gcatcgaaga ttttttgtat aaattggaca caatttccag gcgcgaggga gtttctcagg 1440 aacagatctt ccgacacgct cacatgtttc tggatgggac agcctcggat tggttcttca 1500 cgtttgtcga tgagatggac aattgggaga cattcgagaa gctggtcagg attcgatttg 1560 ggaatccaaa ccaggaacaa ggtattcgga gcagaatcca agggcggaaa caacaccgga 1620 acgagaagtt catcgagttt gctacggaca tcgaaaggct caacaagcag ttgtcaaagc 1680 cgctatcaac tcacagcaag ttccagacgc tctggcagaa catgcactcg cactaccgga 1740 caagaatcgc acccggaacg caaatcaagt ccctgaaaga tttgacagaa gcgtgtcaac 1800 ggatcgacgc tgtggacacg agcttgaatc cgtccggtga aatagctcac caaaggatgg 1860 tgaacaacgt ggacgttgaa gagagcgaga acgattctga ggcgtcggcg gatgtaaacg 1920 tggtacgaac ccgtcaggcg agggacaacc ggtacacggc taggcggcga gaacaacccg 1980 agcaagaagg agagcgagaa aaggtattcc ggcagcaacc tccgcagcat cagaggacgg 2040 aacggaatca ggtgaaccca agccacctag gacagctacg acacaaccaa agtcacggcg 2100 gaaacggtga accatccggc gttgtgaagt gctggaactg tctggaagag ggccacatct 2160 ggagagagtg ttcgcgtccc aaggtgatct tctgctacgg ttgcggaaac ctaggccgaa 2220 ctatcacgtg ttgcgataga tgcgcgacaa gaaacttcca gttcagcagc cggaacaatc 2280 agggaaacta atttcggggt gcaagaaagg gaatcctagc aacccccaac ctgaattgat 2340 tcccaacgca agaacaaatc ctactgaatt tgacccactt ttaagagtac accacatccg 2400 cgtgagtgtg accaaatgtc cacacatccg cgtgaagata ttcgacctcg agatggaagc 2460 gctgctggat tcgggcgctg gaatcagcgt gatcaactcg ccgagcttag ctgagcaata 2520 cggcctgaag atccagcccg cagcggtaaa agtcagcacg gcagatggca cggagtacaa 2580 atgtttaggc tttttgaaca ttccgtttac ctacaagaac atcacgagaa ttatccctac 2640 gattattgtc cctcagattt ccaggtcgct gattctaggt gccgactttt gggaaagctt 2700 taagatccag ccgatgatcg acgttggcaa cggaccagag tccatcgaaa cgctggacat 2760 cgcgaccgaa acgcaactct gttttaacat cgaaccgtct ggtgagcttc cgaaggtcga 2820 cgagaaagag ccagacgata ccctggacat tccagcgttc gacacgcctg gcgaaccagc 2880 tcccgagagc atcgagacgg aacacgagct gacgccggaa gagcgggcgg aactgatcga 2940 agtcgtcaag gatttcgagt tcaccgcgcc cgggaagcta ggacgcacac atttaatcca 3000 gcacgagatc atcctgaagg aggaagcaaa gccgaggaac cagcccgtct acaagtgttc 3060 gccgttcatc cagaaggaga tcgacgcgga gatcgagcgg ttcaaggagc tcgacgccat 3120 cgaggagtgt tactccgagt ggacgaaccc gttggtgccg gtccggaaat cgaacggcaa 3180 gatcagggtt tgtctcgatt cacggcggat caacgcgatg acggtcaagg acgcgtatcc 3240 gatgcaaaat atgcaggata tttttcaccg gctgggccgc gcgaagtact tttcaattat 3300 cgatttgaag gacgcttact tccagattcc gttgaaggag gagagtagga atttcacggc 3360 ttttcggaca tccaaggggt tgttccggtt taaggtctgc ccgtttggtc tgaccaacgc 3420 accgttcact atgtgccggc ttatgaacaa ggtcataggt ttcgaccttc agccgcaggt 3480 cttcgtctat ctcgacgaca tagtgatcgc gaccgaggat ctggaacagc atctgcggtt 3540 gctgcgaatc gtggccgaac ggttacgtaa cggcggtctc acgatctctc ttgagaagtc 3600 ccgtttctgc aggaagcagg ttatgtacct gggctatctg ctgacggaac gaggtgtgtg 3660 cattgaccag tcgcggatcc aaccgatcct ggattacgct cggccgaaga gcgtgaagga 3720 catccggcga ctcatgggcc tcgcgggctt ctaccagagg tttatcgagg ggtacagccg 3780 cattacggcg ccgatcacgg atcttttgaa gaaggaaaag aagaagttta cgtggtccga 3840 agaagcggag aaaagtttca acgaactgaa gtcggttctg acgtccgcgc cgattctggc 3900 caatccggac ttcaccaaac cgttcacgat cgaatctgac gcttcggaca acgccgtagg 3960 agccgcgttg gtccaggaac aagagggaga aacccgaatt atcgcgtact tcagcaaaaa 4020 gctcagcagt acgcaaaaac gatattcggc tgttgagaag gaatgtttgg gggtactctt 4080 agcgatcgac aacttccggc attacgtcga ggggacacgc ttcaaggtgg tcaccgacgc 4140 gcgcagcttg ttgtggttgt tcaagattgg ggcggagtcc ggcaactcga aacttcttcg 4200 ttgggccctc cgcattcagt cgtacgacat ccaacttgaa taccggaagg gcaagagcaa 4260 catcaccgcg gactgcctgt cgcgttccat cgagtctctg gccgtttttc agatcgatcc 4320 ggactacgag gaactcgctg ctaagatcgc gtccagcccg acggagtacg cagattaccg 4380 ggtgatcgac ggaaggatca tgaagtacgt caagagctcg tcgcggcaac aggacccacg 4440 gttcgcttgg aagcagtttc cgatgcccga ggagaggaga gagatcatcc agcaggagca 4500 caacaaggcg cacttcggat ttgagaagac ggtggcggcc atcaagcagc gatttttctg 4560 gccgaagatg aacgagcaga tccgcaagtt ctgtcgcgaa tgcctaccgt gccagaccag 4620 caaggccgga aacgttaatg tcacgccgcc catgggatcg cagaaacctg tcgagtaccc 4680 gtggcaattc gtgacgctcg attacatcgg cccgttgccc gcgtccggaa agaaccgctg 4740 cacctatctt ctcgttgcga ccgacgtgtt tagtaaattc gtcctcgtcc aaccattccg 4800 agaggccaaa gccggtcctc tagcggattt tgtcgaaaac atgatcttcc gactgttcgg 4860 cgtacccgag atcattctca cggacaacgg gtcccaattc gtctctaaga tcttcagggg 4920 tttgttggaa tcgtaccacg tgacgcactg gttgacacct gcctaccacc cgcaagtcaa 4980 caacacggag cgcgtgaaca gggtcatcac gacggcaata cgagccacgc tgaagaagga 5040 gcacaagcat tgggcggatg acatccaagc cattgcggcc gcgatccgga ccgctgtcca 5100 caactcgacc aagtacagtc cgtatttcat cgtgttcgga agagaccaag tttcagacgg 5160 aagggagtac tcgcggatca gagacacgca cgccccggag gcgaccaagg agacggaagt 5220 ctcagcgcag aggaagaagc tgttcgaaga cgtgaaggtg aacctggcgg ccgcgtatca 5280 gcgccacgcg aagcagtaca acctgcgatc tagctcgaac tgcccgaagt acgccgtagg 5340 ggaaaaggtc ctcaagaaga cgttcgacct gtccgacaaa ggagcagggt tttgtaagaa 5400 gctggcccct aagtacgagc cgtgcgtggt gaagaaggtt ctgggaagca acacctacga 5460 gttggaaact gacgagggaa aaagaattgg agtttacttc gcagaccagc tgaaaaagct 5520 gaaatctgct gactgatttt gacgaaattt gtaaatatta tgacccaaca agctatgtaa 5580 cttttatcac cattcaaaag tgaccacaag ttcgaaaatg aacgaatttt caaaaatacc 5640 attaatggga aatttaaaac cacgaaaatc caccgatttt gactcctgtc gaggcagatg 5700 atttgataat gggaggttaa ttcgatcaat gaaacccttg aacgcgggaa agcttttcat 5760 tccacaaata agattgagtt aggtcgatga ccggcccaga gtagtgaact cgattcatga 5820 ctgtttaacg ggagactaaa cgatgatatc gaatgagtcg tggcctgaat caggcgaaag 5880 aaaaaaaagc gagtaccaaa cgatcgtatt gaccgaacag catcgaaaat tcacgccaaa 5940 acgacggaat ttgtacaaat ttgtaaatat ttagacataa tttagttagt aaaagtttag 6000 ttagtagtgt acctacatta attctagttt tagtacttat tctattgaat tttgttagga 6060 aacttgaaaa ctattcacgg ttttgcacag gggacagtgt gcagctgttt gctgtagttt 6120 tttcgaagct tttccatctt gttatcatct ttttgtaaat attttccatc cttttccatt 6180 gtgtacatat tttccatctt gatgatattt tgttgcattt ccagttgata atttttccat 6240 tttgcatatt ttttcgatga atttcctcca gtagtggtca aggttgatgt tcacctaaaa 6300 ctgacacatt ctggcattag ttttttgtaa atatttagtg aattgtgcta attagcactc 6360 aaattacctt ttttcattga attttccctc gcagtaagtt tttccatcgg aggtaaatag 6420 ttgcttccat aatatcagca taatttctaa ttggtgttga ttatggtcga tgaagtagtt 6480 ttgtttacgt tgagaggttt gctttaccga tcttcttctc ctttgcagtt cttcttgatc 6540 gatcctcctt tgtatttagt cgtaggtaaa cattgcattt ttttccattc gcgccgcatg 6600 atcacgttgc agagtgagcg aaagctctac tttgccgaca agaatctgcg ctatttttag 6660 tttcttcagt tcttttgatc cagttcatcc aggttctatg gccgccttct gcagcctgca 6720 cctgaaaata aaatagaaaa aaaacacctt tttacaacca aacacaatta gcgacacatt 6780 caatttcact ttccttttcc gtttttccat cactagagca gttttcctag caccataaca 6840 ttttttccgc gacaaaaaca cagatttccc gacacacatt ttttttcgtt ccacgcactt 6900 tgtttacatc ccgttgactg gtagactggt aacagggtga acgtgaaaat gaacacttgt 6960 tcattcagga tggacgaggg gtacgtcaat gcagtttttt ttatgttttg ttgtttcatg 7020 aacagtttcg taattaatgc aaattaatgt tagagattgt tgtgaatttg agttttaaac 7080 ctcatggaaa cgtttttttg gggatttccc gaggtagttt cctgttgttt agaacagacc 7140 aatttttttt aagacaaatt tgatgatgtt ctgttttctt gggaaatttt cttctgaggt 7200 ttgaacactc gttaaatttg tcggttattt tttcacttaa ggtgatagat tgaatagagc 7260 gactctttgg ttttttggtg tttatcactg aggtagaagg tcgttgctga tttttttggg 7320 aatgttttta ggatgagatt ttttttatga ttgttagaat gtttgcgatg ccagctacca 7380 ggaaactttt caaagtcagt tctcaatatt ttttgcagta attcagttca agatcaacag 7440 atgatttatg ctattgtttg atattttttt gaagttgcgg ttttttttga tatcctttcc 7500 tcatgtaata tattttttag ctcttcttta ttcttgtaca tattttttgc cgaagttttg 7560 tagatattat tatagtttta tttcgatgaa ttttatatga agttttggat ttgcttagta 7620 gcttctgctg gagaccggag tctaaacacg ttgatggtct agtagcaggc gtagtttttg 7680 aaactacgtg ctggaaacag agagtgagcc actggcgatg accctatatg accccggaag 7740 cgaaatgtta ttcagagatt cagataaatt tcagcttcga aaatttgatt ggttttgggt 7800 aagaaatttt caatttcagt tccaaaccaa tcaaattttc gaaaatttag tagggaatga 7860 // ID Copia-38_CQ-I repbase; DNA; INV; 4348 BP. XX AC AAWU01006555; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-38_CQ_; KW Copia-38_CQ-LTR; Copia-38_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4348 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 379-379 (2011). XX DR GenBank; AAWU01006555; Positions 22496 26843. XX CC Positions [1481-2017] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 113..4330 FT /product="Copia-38_CQ-I_1p" FT /translation="MPNGDGSGAGAQGAVQPEARGQNYGAGGHPPIERLVG FT RDNWRTWRFAVRTYLEIEDLWDVIEPAQGAADAKKDRIARGKIIFFLDPSI FT FHHAENAKTAKELWTKLCGLYEDTGLTRQWALLHKLITTNLASSGSMEAYV FT NRMTTTANQLIGIGFPLDEKWIGMMLLAGLPLDYRPMVMGLENSGIAITGD FT AIKTKLLQEDDLPVEPAGGQAYVSKGRHKDNKQGKPQPSAAPSKSKGSKCH FT RCGKAGHFARDCTAKELVRNDRDGKKKAFCAVLSVLDSDDANDWYFDSGAV FT KHLTRNEALLENPRTTSGSIIAANNQAMKIVAEGTMTLQPTCYDDTIEVEN FT VELVPELAVNLLSVGKIVDQGHTVVFNKDGCEVIDADNSVFATGSRSNGLF FT KLDQIKPRALACSSRETAELWHKRLGHMCMKNLIKLKNGLVNGIKFEGNEC FT GRGCIVCAEGKQTRLPFSHAGTRAAGILDLVHSDICGPMEEKSLGGCRYYL FT SFTDDKTRKIFVYFLRTKSKEEVLQCFKDFHVTAERQTGRKLKVIRTDNGL FT EYTNRLFQDYLSGMGIRHQTTVEYTPEQNGLGERVNRTAVERARCMLFEAY FT LPKPFWAEAVAAAIYLINRSPTKGIEMTPEEAWSGRKPDLSHVRVFGTIAM FT AHVPKQKRRKWDKKAVECILTGYEEDTKAYRLYDKKTRSIINSRDVTFVSE FT GVVKRGVTAEPRASGPERTCVRLDFWEYVPADAVQEVVPPEAAEPQKEPDE FT EPEDAFESAESSDDDVSFAEDDPLPALPPQSSSEPPKEGLRLSGRERFLPG FT KYKDFKLGNKGLPVTKFADKTTTMTNAAARPEDPAAAPPSCSYQRSVADDG FT QQRLMLCDRGRPPQGKYSDNPDESPDTGIDTPQPLLDDPFTHQQALARDDA FT EHWKQAMKDEYEALISNGTWELCDLPEGRSAIKCKWVYKTKLDVNGDIDRY FT KARLVIKGYSQRKGVDYEETYAPVVRYSSLRYLFALAARLDMKVDQMDAIT FT AFLQGELSEEIYMEQPPCFVDGKKRSKVCRLKKALYGLKQSSRVWNKKLDA FT ALRKFDLVCTDYDPCVYTKVRGDKMLFVAVYVDDVLIYSNCDRWKTEIKAQ FT LSREFKMKDIGPAKYVLGIRVSRSKDEIALDQEKYVEAILSRFQMTDCKPV FT STPMNVSEKLTRDACPSTDQEKERMKSVPYQEAVGCLMYLAQSTRPDICYA FT VNILSRFNSNPGEKHWCGVKHLFRYLRGTSKYRLTYKKGGVSKIEGFSDAD FT WAADLEDRKSITGYVFTAQGGAVSWSCKRQQTVALSTCKAEYMALSVAVQE FT ALWWRRLRGRIESEEAITIHCDNQSAIAVARNGGYHPRTKHIDIRHHFIRD FT ALEKGEVVVDYVRSEEQTADGLTKPLCRTKMEICRYQLGLQHP" XX SQ Sequence 4348 BP; 1088 A; 1106 C; 1349 G; 805 T; 0 other; agaggttatg ggcccatagt gggttccgga acgctgaaga acttgaaaaa tctacctcaa 60 gaagtttaac tttcaactga agcttatgaa gaagtaggaa aatctagaga agatgcctaa 120 cggagacgga agcggtgctg gtgctcaagg tgctgtccaa cctgaggctc gtggccagaa 180 ctatggagcc gggggccatc cgccaatcga gcggctggtc ggacgtgaca actggcggac 240 gtggaggttc gcagttcgca cgtacctgga gatagaggac ctgtgggacg tgattgaacc 300 agctcaaggc gcggccgacg cgaagaagga ccggatcgct cgaggaaaaa ttattttctt 360 cctcgatccg tcgattttcc accacgcgga gaacgcgaag actgccaagg agttgtggac 420 gaagctgtgc ggcctgtacg aggacactgg actaacccgt cagtgggcgc tcctgcacaa 480 gctcatcacg acgaacctgg cgagcagtgg gtccatggaa gcgtacgtga accggatgac 540 gactacagcg aatcagctca tcggaatcgg cttcccgctg gacgagaaat ggatcggcat 600 gatgctgctg gcgggactac cgctggacta tcgtccgatg gtcatggggc tcgagaattc 660 cgggatcgca atcactggtg acgccatcaa aaccaaattg ctccaggaag acgacctgcc 720 cgttgaacca gctggaggac aagcgtacgt gtccaaggga cgccacaagg acaacaagca 780 gggtaagcct caaccgagtg ctgctccaag caaatcgaag ggatcgaagt gtcaccgctg 840 cggcaaagct ggccatttcg ccagggactg cacagccaag gaactagtgc gcaacgaccg 900 cgacggcaag aagaaagctt tctgtgccgt gctctcggtg ctggattcgg acgacgccaa 960 cgattggtac ttcgattcgg gagctgtgaa gcacttgacc aggaacgagg cgctgctgga 1020 gaacccccga acgacgtctg gaagcataat tgcggcgaac aaccaggcga tgaagatcgt 1080 tgcagaggga actatgacgt tgcagccgac gtgctacgac gacaccatcg aggtcgagaa 1140 cgttgaactg gtaccggagt tggcggtcaa tctactctcc gtcgggaaga tcgtggatca 1200 aggccatacc gtcgtgttca acaaagatgg atgtgaggtg atcgatgcag acaacagtgt 1260 cttcgcaact gggagcagat cgaacgggct gttcaagttg gaccagatca agccgagagc 1320 gctggcgtgc tcttcccgcg aaactgctga gctatggcac aagcgtttgg gacacatgtg 1380 catgaagaac ctgatcaagt tgaagaacgg tctcgtcaac ggaatcaagt tcgaaggcaa 1440 cgagtgtggc agaggctgca ttgtctgtgc agaaggtaag caaacgcgat tgccattcag 1500 ccacgcaggg acccgtgcgg ctggaatcct ggacctggtg cactccgata tctgcggccc 1560 gatggaagag aagtcgttgg gtggctgtcg ctactatctc agcttcaccg acgacaagac 1620 gcgcaagatt ttcgtttatt ttctgcgaac caagtccaaa gaagaggttc tgcagtgctt 1680 caaggatttc cacgtgactg ctgagcggca aacgggccgc aaactgaagg tgattcggac 1740 ggacaacggc ctggagtaca cgaatcggct gttccaggac tacctgtcgg ggatggggat 1800 ccgtcaccaa accaccgtgg agtacacccc tgagcagaac ggactcggag aaagggtcaa 1860 ccggactgcc gtggaacgcg ctcgatgcat gctgttcgaa gcttatctgc caaaaccgtt 1920 ttgggcagag gctgtggctg ctgcgatcta cctgatcaac aggtcgccga cgaagggcat 1980 cgagatgact ccggaggaag cgtggtcagg tcgcaagccg gacttgtcac acgtacgagt 2040 gttcggcacg atcgcgatgg cccacgtacc gaagcagaag cgtcgcaagt gggacaagaa 2100 agccgtggaa tgtatcctca ccggctacga agaggacacc aaggcgtacc gcttgtacga 2160 caagaagacg aggtcgatca tcaacagccg agacgtgacg ttcgtgtcgg aaggagtggt 2220 caaacgtgga gtgaccgctg aaccgagggc tagtggaccg gagcggacat gtgtgcggct 2280 ggacttttgg gagtacgtgc cagcggatgc cgttcaagag gttgttccac cggaagccgc 2340 ggaaccacaa aaggagcctg acgaagaacc tgaagatgca ttcgagtcgg ccgagagcag 2400 cgacgacgac gtgtcctttg ctgaagatga cccgttaccc gcgctcccgc cacaatcatc 2460 ttcagaacca cccaaagagg ggttgaggct cagtggtagg gagcgctttc ttccaggcaa 2520 gtataaagat tttaaacttg ggaacaaagg cctacccgtg acgaaatttg cagataagac 2580 gacgacgatg accaacgcag ccgcgaggcc cgaggatcct gccgctgcgc ccccgtcgtg 2640 ctcttatcaa agatcagttg ccgatgacgg tcaacaacga ttgatgcttt gcgatagggg 2700 gcgcccgccc caaggcaagt attctgacaa ccctgatgaa tcacccgata ctggtatcga 2760 cactccacag ccgcttctgg atgatccgtt tacgcatcaa caagccctgg ctcgggacga 2820 cgctgaacac tggaaacagg ccatgaagga tgagtacgaa gccctcatct cgaacggaac 2880 gtgggagctg tgcgatctac cggaaggacg ttcggcgatc aagtgcaagt gggtgtacaa 2940 gaccaagctg gacgtgaacg gggacatcga ccggtacaaa gcgaggctgg tcatcaaagg 3000 atactcgcag cgaaaggggg tagattacga agagacctac gccccggtcg ttcgctatag 3060 ttcgctgcga tatctcttcg cgctggctgc ccgtctggac atgaaggtcg accagatgga 3120 cgcaattacg gctttcctgc aaggagaact gtcggaggag atctacatgg agcaaccccc 3180 ttgcttcgtg gacggcaaga agcggtccaa ggtctgccga ctcaagaagg cgctgtacgg 3240 gctgaagcaa tccagtcgtg tctggaacaa gaagctggat gctgccctga ggaagttcga 3300 cctggtatgc accgattacg acccgtgtgt gtacaccaag gtgcgcggcg acaagatgct 3360 gttcgtcgct gtgtacgtgg atgatgtcct gatttactcg aactgtgacc gctggaagac 3420 ggagatcaag gcccaactca gccgcgagtt caagatgaag gacatcggac ctgcgaagta 3480 cgttctgggg atccgggtga gccggagcaa ggacgagatt gccctggatc aggagaagta 3540 cgtggaagcg atcctgagtc ggttccagat gacggactgc aagcccgtca gtacgcccat 3600 gaacgtgagc gagaagttga cgcgcgacgc gtgcccatcg actgatcaag agaaggagag 3660 gatgaagtcc gtgccgtacc aagaagccgt tgggtgcttg atgtacttgg cccagagcac 3720 cagaccggac atctgctacg ctgtcaacat actcagccgc ttcaactcga acccaggtga 3780 gaagcactgg tgtggtgtga agcatctgtt ccgctacctg cgcgggacct cgaagtatcg 3840 tctgacctac aagaaaggag gagtttcaaa gatcgaaggt ttttcggacg ctgactgggc 3900 tgctgatctg gaggaccgga agtcgatcac ggggtacgtg ttcaccgccc aaggtggagc 3960 cgtgtcctgg agctgcaaga ggcaacaaac cgttgcactc tcgacctgca aggccgaata 4020 catggcgttg tcggttgcgg tgcaggaggc gctctggtgg cgccggttgc ggggacgtat 4080 cgagtcggag gaggcgatta cgatccactg cgacaaccaa agtgcgatcg cggtcgcacg 4140 gaacggtggc tatcatccga ggacgaagca catcgacata cgacaccatt tcatccggga 4200 cgcgctggag aaaggcgagg ttgtcgtgga ctacgttcgg agcgaggaac aaacagcgga 4260 tggtctaacc aaaccgctgt gccgtacgaa gatggagatc tgcagatatc aacttggtct 4320 gcagcatccc taggttgagg aggagtgt 4348 // ID CR1-120_AAe repbase; DNA; INV; 4568 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-120_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4568 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1208-1208 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >91% CC identity. XX FH Key Location/Qualifiers FT CDS 109..1497 FT /product="CR1-120_AAe_1p" FT /translation="MPCASNNCTVSDDKHVWYCHGSCGKKFHAACVGAQRN FT HEQSILSYMLPLCYDCQKRFVLEANFEKFFNQQQETNKLNNLLMESNYKIS FT SNLNKFDISEGFESIEMLLNDLKSDFKAIAAKHNTTAEEIAKSSAQVNELA FT LRCKNGKQDDVAIKNHLTSLFDISMQATKNSIIEYVDVLTSDLTRELKNIC FT TEFEKVSSLIIDMASHCSEHNASHANSSSADIIDELKQISNAVNSLEKKST FT SSPPLESHPNLGTEMMSAQAENNSGWRLLGTQKVWKASWAEYDARQLRRLN FT QQKNADKARKRRKRNAVRMSNTNSYNSVRNNNNNNNGNNNDDRSRNKYKGI FT LNNNKTVFNKTNIGSKPTCSYYNYNNNNKCNSQQAATSSLPPDKDLLAAAK FT VKFSGHSTANQRGIKFQRGETLNPYPVDHQFPQTSSSSSPNWTLGPALRNS FT FSQHCSSCSCERSCFRPT" FT CDS 1476..4496 FT /product="CR1-120_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="TFVFSSDLTHSTEADHDEAGNVSIDIENSFKNDELPS FT ISSENRSTEILVYCQNFNRMKSASKMSIISKNILSSSFSVILGTETSWNDS FT IKNEEIFGSNYNVFRNDRDLVLTQRKSGGGVLVAISSKFNSELIASPKFKD FT FEHVWVKTEIIGETHIFASVYFPPDRATKSTYEVFFKAAEEIVSYFPPEYK FT IHIYGDFNQRNVDFIPDYDNESILLPVVGENETLQFIFEKIAFLGLNQINH FT VKNQQNCFLDLLLTNICEDFCVVESVSPLWKNEAFHTAIEYSIFVHSNLNP FT NDCSYEYVFDYNMANYDNIRYKINRVNWQSNLKEEVNIESAVQNFYKILSE FT VIQEEVPLKRKRRGFGSKNPVWFNKQIINLKNRKQKAHKTYKAHNSQANLE FT KYLSICDQLNLTISNALTDYNIKTENEVKSCPKNFFNYVKSKINSNNFPSK FT MTLDDKEGNNSEDICNLFATFFQETYTNFSDNDRDFGYFSYLPEISRDVGV FT NQIHVQDITTGLNDLDATKGSGPDGIPPVFIKKLAVELTTPLFWLFNMSLE FT SGEFPKEWKKSFLVPIYKSGKKSDIRNYRGIAIISCIPKLFESIINKCIFG FT QIKHRITNSQHGFFKGRSTTTNLLEFVDYSIMAMDKGNHVEALYTDFSKAF FT DRLDISMLIFKLSKIGIEKKLLKWIESYLTDRQQIVRFNDKKSKPIQVTSG FT VPQGSHLGPLLFILYVNDVSFLLKKIRILIYADDMKLFLEIMKDDDYETFH FT NEILLFNTWCFKSLLKLNVEKCNLITYSRKRNTPNLTIRLGNQQVKKCDKI FT RDLGVILDSKLTFVEHYNTITHKAGNMLNFIKRFGYHFQDPYTIKTLYVAY FT VRSILEYCSVVWSPYLKSHEERIESIQKQFLLYALRKLGWTVFPLPSYKAR FT CMLINIQTLKQRREYAMVAFVNDIVSQRIDSTKLLSKLNFYAPNRQLRHRN FT LFTLDRYRTNYAKFSPLNQMMSVYNEHCEMIDLSMPRTKLKSYFNSLGNLR FT I" XX SQ Sequence 4568 BP; 1616 A; 838 C; 795 G; 1319 T; 0 other; agtctttgta caggtcggac gtatttcata tttgctccgc ttcaaataaa tcgcgcgcgt 60 ttgttttttt tattttgttc ttttgtctaa ctgctgcgta agtgaatcat gccgtgcgcg 120 agtaataatt gtaccgtcag tgacgataaa catgtgtggt attgccatgg gtcgtgcgga 180 aaaaaattcc atgctgcgtg cgtcggcgct caaaggaacc atgagcagag tatcctgtca 240 tatatgcttc cgctctgtta tgactgtcag aagcgatttg tactagaagc gaactttgag 300 aagtttttca atcagcaaca agagaccaac aaactcaata atttattgat ggagtcgaat 360 tacaaaattt catccaattt aaataaattc gatattagcg aagggtttga gagcatcgaa 420 atgctattga acgacctgaa aagtgatttc aaagccatcg cagcaaaaca taacactaca 480 gctgaagaaa ttgcgaaaag ctctgctcaa gtgaatgaat tggcattgcg ttgcaaaaat 540 ggaaaacaag acgacgttgc cataaaaaac cacttgacat cactgttcga catatcaatg 600 caggccacca agaacagcat aatagaatat gttgatgtac tcacttccga tttgacacgg 660 gagcttaaga atatttgcac tgaatttgaa aaagtcagta gtttgattat tgatatggct 720 agccattgtt cagaacataa tgcgagccat gcaaattcat catcggcaga tatcatagat 780 gaattgaagc aaatttcaaa cgccgtaaat tctcttgaaa aaaagagtac atcttcaccc 840 ccattggaat cacacccaaa tttggggaca gagatgatgt ccgcgcaagc tgaaaacaat 900 tctggctggc gactgcttgg aacgcaaaaa gtttggaagg ctagttgggc agaatacgat 960 gcacgccaac ttcgccgctt gaatcaacaa aaaaacgcag ataaggcaag gaaacgtcgc 1020 aagcgaaacg cagtgcgaat gtcaaatacc aattcataca actcagtccg caacaacaac 1080 aacaacaaca acggaaacaa caatgacgac agatcccgga acaaatataa agggatactc 1140 aacaacaata aaacggtctt caataaaacc aacataggca gcaaacccac atgtagctat 1200 tacaactaca acaataacaa caaatgcaat agtcaacaag cagcaacaag ctctcttcct 1260 ccggataagg atcttttggc agcagcgaag gtgaaattct ctggtcactc tactgcaaat 1320 caacgaggta tcaagtttca acgaggagaa acactaaacc cgtacccggt ggatcaccag 1380 tttcctcaaa caagctcctc ttcttcccca aattggactc tcgggcccgc tctgagaaac 1440 tctttttctc agcattgttc gtcgtgttcc tgtgaacgtt cgtgttttcg tccgacctga 1500 cgcactctac agaggcagac catgacgaag ctggtaacgt ttcaatagat atagaaaata 1560 gttttaagaa tgacgaacta ccatcaattt cttcagaaaa tcgttctact gaaattctgg 1620 tatattgcca gaatttcaat cgcatgaaga gcgcatcaaa aatgagtata attagtaaaa 1680 atatattaag ctcctcgttt tcagttattt tgggcactga aacaagttgg aacgatagca 1740 ttaaaaacga agaaattttt ggtagcaact ataatgtatt taggaatgat cgggatttag 1800 ttttaaccca aaggaagtca ggtggtggag ttctagttgc aatttcttca aaattcaatt 1860 ctgagcttat tgcttcacca aaatttaagg attttgagca cgtttgggta aaaacagaga 1920 tcattggaga gactcatata ttcgcgtcag tttactttcc tcctgatcga gctactaagt 1980 cgacgtatga agtgttcttt aaagcagccg aagaaatcgt ttcatacttt cctcctgaat 2040 ataaaattca tatttatggt gatttcaatc aacgaaatgt agattttatt ccagattatg 2100 ataatgagag cattcttctt ccagttgttg gtgaaaatga aacattacaa tttatatttg 2160 agaaaattgc attcctagga ctcaaccaaa ttaatcatgt caaaaaccaa cagaattgtt 2220 tcttagattt acttctaacg aatatatgtg aggatttttg tgttgtggaa tctgtttctc 2280 cgttatggaa aaatgaagcg ttccacacag cgatcgaata ttccatattt gtacacagta 2340 accttaatcc caacgactgt agttatgagt atgtttttga ttacaatatg gctaattatg 2400 acaatataag gtacaaaata aatagagtaa actggcaatc caatctgaaa gaagaagtaa 2460 atattgaaag tgccgtacaa aacttctaca aaatattgtc ggaggtaatt caggaggaag 2520 ttccactcaa acgtaaaaga cgaggttttg gctcaaaaaa tccagtctgg ttcaataagc 2580 aaatcattaa tttgaaaaat cgcaaacaaa aagcacacaa aacctacaag gcacataata 2640 gtcaagcaaa tttagaaaag tatctgagta tttgtgatca actaaattta actatttcta 2700 atgcacttac agattataac ataaaaaccg aaaatgaagt caagtcatgt ccaaagaatt 2760 tcttcaacta tgtcaaatca aagattaatt caaacaattt cccatcaaaa atgacattag 2820 acgacaaaga aggaaataac tcagaagata tttgtaacct ttttgcaact ttttttcaag 2880 aaacttacac gaacttctct gataatgatc gtgattttgg atatttttca taccttccag 2940 agatttcaag ggatgttgga gtaaatcaaa ttcatgttca agacattacg acaggtctga 3000 atgatttaga tgctactaaa ggatcagggc cagatggaat cccaccagta tttataaaaa 3060 aattagcggt agaacttaca actccactat tctggctttt caatatgtca cttgaatctg 3120 gtgaatttcc aaaggaatgg aaaaagtctt tcttggtacc tatttacaaa tcaggcaaaa 3180 aatctgatat caggaactat cgtgggattg ccatcatttc atgcattcca aaactgttcg 3240 aatcaattat taacaaatgt atttttggtc aaattaaaca cagaataact aattcacaac 3300 atggcttctt taaaggccgt tcaactacta caaacctcct ggagtttgtt gattattcca 3360 taatggcaat ggataaaggt aaccacgtag aggctctcta cactgatttt agtaaagcat 3420 ttgatagact tgacatttca atgttaattt tcaaactgag taaaataggt atcgagaaaa 3480 agcttctcaa atggattgaa tcatatctaa ctgaccggca gcaaatagta agatttaatg 3540 acaaaaaatc caaaccaatt caagtcacat caggtgttcc tcaaggctcc cacttaggac 3600 ctctcctttt tattttatat gttaacgacg tatcttttct tcttaaaaaa atacggattc 3660 ttatatatgc agatgacatg aaattgtttt tagaaatcat gaaagacgac gattatgaaa 3720 cattccataa tgaaatactt ttatttaata cctggtgttt taaaagttta ttgaagttga 3780 atgttgaaaa atgtaatcta ataacttata gcagaaaacg aaacactccg aatttaacaa 3840 taaggttagg aaaccaacaa gttaaaaaat gtgataaaat tagagatcta ggagtaattt 3900 tggattccaa attaacgttc gttgaacatt acaacacaat aactcataaa gcgggcaata 3960 tgcttaactt cataaaacgt ttcggatatc attttcaaga cccttatacg atcaaaaccc 4020 tttacgttgc ttatgtgaga tccatattag agtattgtag tgttgtctgg tccccatatt 4080 tgaaatcaca tgaagaacga atagagtcga ttcaaaagca gtttttacta tacgcactcc 4140 gcaaattagg atggactgta tttcctcttc cgtcatataa agcacgatgc atgttgataa 4200 acattcagac attaaaacag cgacgagaat atgccatggt cgcatttgtt aacgatatcg 4260 tttcacaacg tattgactca acaaagctat tatctaaatt aaacttttat gctcctaatc 4320 gacaattgcg tcacagaaat ttgtttactt tagatcgtta tcgtacaaac tacgccaaat 4380 ttagtcctct aaatcaaatg atgtctgttt acaatgagca ctgtgaaatg attgatctta 4440 gtatgccccg gaccaaattg aaatcatatt ttaactcact aggaaatctt agaatataag 4500 aaactatgta actatgtagt ctacaaatga ttgacgaaat aaataaataa ataaataaaa 4560 gaaataaa 4568 // ID Sat4_Cis repbase; DNA; INV; 129 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 06-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE Satellite from Ciona savignyi. XX KW Satellite; Simple Repeat; Sat4_Cis. XX OS Ciona savignyi OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. XX RN [1] RP 1-129 RA Smit A.F.; RT "Sat4_Cis - Satellite from Ciona savignyi."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX CC Ci000004. XX SQ Sequence 129 BP; 36 A; 53 C; 8 G; 32 T; 0 other; caactgaacc cctttcaact gaaccccttt acaactgaac ccctttccaa ctgaacccct 60 ttaccaactg aacccctttc caactgaacc cctttaccaa ctgaacccct ttccaactga 120 accccttta 129 // ID BEL-600_AA-I repbase; DNA; INV; 5938 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-600_AA_; KW BEL-600_AA-LTR; Pao_Bel_Ele215; BEL-600_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-5938 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [4981-5538] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 1227..2498 FT /product="BEL-600_AA-I_2p" FT /translation="MLYGRPEQLIHALLGKVRKTEPPRADRLESFIPFGMA FT VQELCDHLEAAELHDHLVNPVLIQELVDKLPAATKREWVHYKRNALHVTLR FT TFAEFSSSIVSEASEVTLTLDLRQTTKPDRSKVKDKGFIHAHTGEPSIASP FT PMSPCRICKTAAHRIRNCEEFRRLNVSERVKLMERWKLCERCLNEHTGWCR FT FKITCNVGSCRQPHHPLVHRGSATTSQPCQGQLHMHLDGKLPVIFRIVPVT FT LCSGDKTYSTFAYLDEGSDMTLVEESIVRELGVRGILQPLTLQWTGNVKRR FT ESSSECLSLEVVGEAGNRLELKEVHTVNRLYLPKQRVDFQQISNQYSHLHG FT LNVPNHTGEEPKILIGLNNAYLLAPLESRVGRANEPIAVRSHLGWSIYGPR FT ETVIAAGFLGMHKGYTNQQLHEIMRGYFLN" FT CDS 2914..5937 FT /product="BEL-600_AA-I_1p" FT /translation="MLLVGPDLLAPLDAVVQKFRERPIAFGADIREMYHQF FT LITPEDKHVLRFLFRTSQQDKPDIYLMDVAIFGSACSPASAQYIKNLNASH FT YSEHYPDASNAIIRKHYVDDYMDCANTTEEAIERAKQVRYIHSQAGLEIRN FT WVSNSTQFLEAMGEPIKPERVSLSVGKEGACERVLGIIWRPYDDVITFSTQ FT MRDDLMTYMTSDIRPTKRIILSCIMSLFDPLGLLAPFTVHGKGLLQDIWRS FT GCDWDELVNEDCYVKWTRLKAVFPFINRIEIPRCYLRGAPPSAYDTLEAHV FT FMDASQDYYSCAVYFRITHEGAPKCALVSAKTKVGPLKPLSIPRMELLAAT FT LGVRLLDTVLANHDVKVTKRFLWTDSSTVLHWIAADPRKYKPFVAYRITEI FT LQGSRLDEWRYVRSKLNIADILTKWKTENFFDGEGPWFCGPNFLQHPEESW FT PTHNRPPANRVLEMRACHMYHTTMAFVPLINATRVSKWNVLVRTMCCVIRF FT VANCRKRMQRMPIETLVIPKLASLVRRSICSIDVPLKQEEYEVAEQVLWKQ FT AQLECYEDEVATLLRNQKLPREEWQRIPKSSKLYKKNPFLDEYGVLRMEGR FT IGAAEFVPFDTKFPILLPREHIITYRIIEFYHQRYGHANKETVYNEIRQRF FT SIPNLRAGLVLVMKNCQWCKVRKAQPLTPRMAPLPLERLTPFAKPFSYVGI FT DYFGPIDVSVGRRMEKRWIVLFTCLNMRAIHLEVAYRLNAESCIMAIRRFV FT VRRGPPIAIFTDNGTNFKAASKELQNQIQRIDAECANVFTNARTRWSFNPP FT SAPHMGGIWERMVRSVKEMMSALNERHRLNDEILLTVIAEAEEIVNSRPLV FT YQSQDEEAGEVITPNHFIRGSSSGLKDPSIAPTNAAEALRNAYHRSQIISD FT ELWKRWLIEYLPSLNARSKWLEESKPLVPGDLVFIVDDHGRNGWIRAKVEE FT VIHGKDGRVRRAKVRTSNGVFLRPVCKLALIEVRASSKSCQEAISRQGLRV FT GE" XX SQ Sequence 5938 BP; 1725 A; 1287 C; 1483 G; 1438 T; 5 other; atttctcaaa aattttaagt taacctcact ttgagatgga gagttcctgc ggaaagtgcg 60 ataatcccga caatgatgaa atggttgctt gcgattcctg cgcggtgtgg taccacctag 120 attgtgttgg agaatcaccg ggagtggcga ataggcagtt tgtgtgcccc aagtgcaccg 180 gcaagaaaga gaagagagcc aggaagggaa aggctgaagc caacgtgccc agggttgata 240 agtctctagt gccgaaacgt ccaacgtcaa ccgagtccgg tggtagtacc gatcaggtgc 300 ctacaaattt gatggtagcg tgtgatttgg agccaatcga atgtgaaagc cagataggaa 360 aaacttcggg gattgcgact gcgaacgttg aaggcgagat agaaacgact ggtgcgatag 420 gtggacattc gacggaatct tcaatgcaga gagaattacg tcgattagaa gaagaaacgc 480 agcaaagaga gaagcaaatg gaggaagaaa ggattattcg agagaagaag ttagaatggg 540 aagcgaaact ccagcggaag cagctagaac aagatcgtgc cctgcggcaa aagaagctgg 600 atcagcaaag agaaatgtat cgcattcagt tgcaggaaga agcagatttt cgacgccagc 660 aagaaaaact tttggaagat tttcgagagc agaagagagc agttcagcaa agcactatac 720 cgtctgacga taccgtgggc aawccaacga tagaaggtgc ccccatagaa cgcgaaacgg 780 ctagtagtgt ttttcacccc gtaccggaac tgccagacac acctaaacgt attgtcaagt 840 caacgatacc cgttgttaag gagcgaaacg aaacctatga acaacattgt tcaactcagg 900 taatacgcga ggagcatagt ttcgaatctg atgatgctgg gactgatgac gaaatagaat 960 cwacagtatc acgtaggtct ggaccgacta aatctcagat agcagctcgt caggttctat 1020 cgaagaaact cccgattttc agcggcaaag ttgaggaatg gcccctgttt tacagtagtt 1080 acgtgaattc cacagaagca tgcggcttca ccgaagttga aaatttagta cgactccaag 1140 agtgccttaa aggtccagca ttggaatcag tgcggagtag attgctgcta ccaaaagctg 1200 tcccgcaggt gattgctacg ttacgtatgc tttatggacg tccagagcag ctcatccacg 1260 cgttactggg taaagtgagg aaaacggagc cacccagggc agatcgtctc gaatcgttca 1320 ttccctttgg aatggctgta caggagcttt gtgatcatct ggaagcagca gaattgcatg 1380 atcatttagt gaaccccgta ttaattcagg aactggtcga taaattacca gctgcaacca 1440 aacgagaatg ggtacactac aagcgaaatg cgctgcatgt cacgttgcga acgtttgctg 1500 aattttcatc atcaattgtg tccgaagcaa gcgaggtgac gttaacgctg gatcttcgac 1560 aaacgacaaa accagacagg tctaaggtaa aagacaaggg gttcattcat gcacataccg 1620 gtgaaccttc catagcttca ccgccgatga gtccgtgtcg aatctgcaaa actgctgctc 1680 atcgtatacg taactgcgaa gaatttcggc ggttgaatgt aagcgaaagg gtgaaactga 1740 tggagcgctg gaaactgtgc gaacgatgct taaatgagca cacagggtgg tgccgtttca 1800 aaataacatg caatgttgga agctgtcgac aaccccatca tccattggta catcgtggtt 1860 ctgcgactac ctcccaaccc tgccagggac aactccatat gcatctggac ggaaaactac 1920 cagtaatatt cagaattgtc cccgtcacgt tatgcagcgg agataaaacc tattccacct 1980 ttgcctacct tgatgaaggt tcggacatga ccctcgttga agaaagtata gttcgtgaac 2040 taggagtacg tggaatcctt cagcccctaa cgttacaatg gacaggtaat gtcaaacgtc 2100 gagaatcctc ttcggaatgc ctatctctcg aggtagttgg cgaagctgga aatcgtttag 2160 aactgaagga agttcataca gtgaatagat tgtatttgcc taaacagcga gttgattttc 2220 aacaaatttc caaccaatat tcgcaccttc atggattaaa cgtgccgaat catactggag 2280 aagaaccgaa aatcttgatc ggattgaata acgcctacct tttagcgcca ttggaatcga 2340 gagttggccg tgccaatgaa ccaattgccg tccgatcaca tttaggctgg tccatatatg 2400 gaccaagaga aacagtgata gctgcgggat ttctcggaat gcataaaggt tatacgaatc 2460 aacagttgca cgagattatg cggggctatt tcttgaatgm agaaccgaaa tctacagtca 2520 catcgcttcc ggagactgcc gaagatcgca gagcacggga gatgctggag tcaactacaa 2580 caaaggttgg gaatcggtat caaaccggat tactgtggaa ggcggaagag gtgaatttgc 2640 sggacagtta tagcatggca taccggcgtc ttcaaggatt ggaacggcgt cttctaaaaa 2700 aatccggagc taagagaaca ggtaaatcag aaaatacaag aatatcagtc aaaacattat 2760 gcgcataagg cgactccaca agaattagca ttggcaackc cgggaagagt gtggtacttg 2820 ccactaagcg ttgtgacgaa ccccaaaaaa ccgggtaaaa ttcgactggt gtttgatgca 2880 gcggctcaag taaatggagt ttctttaaat tcgatgctgt tggtcggccc cgatcttctc 2940 gcgcccctcg atgctgtggt acaaaagttt cgcgaacggc caatcgcctt tggcgccgat 3000 attagagaaa tgtaccacca attcctgatt acacctgaag acaagcacgt gctgcgcttt 3060 cttttccgaa cgagccaaca ggataaacca gatatatatc taatggatgt ggccattttt 3120 ggttctgcat gctcgccagc atcagcacaa tatattaaaa acttaaatgc atcccattac 3180 tctgaacatt acccggatgc ttccaacgcc attattcgta aacattatgt tgatgattat 3240 atggattgtg ctaatactac agaggaagca atcgagcgtg cgaaacaggt ccgatacatc 3300 cacagtcaag ctggcctaga aattcggaac tgggtttcaa actcaacgca atttctcgaa 3360 gctatgggtg aaccaatcaa acccgaaaga gtttcactaa gtgtcggaaa agaaggggca 3420 tgtgaacgtg tactgggtat catttggcgc ccgtacgatg atgtgatcac attttctaca 3480 caaatgcgtg atgatctgat gacgtatatg acctccgata tccgacccac caagcgaatt 3540 attcttagtt gtataatgag cctgttcgac cctctagggc tcttggcacc cttcactgtt 3600 catggtaagg gtttacttca agatatatgg cgctcaggtt gcgattggga tgagttggtt 3660 aacgaggatt gctatgtaaa atggactcga ttgaaagccg tttttccatt tatcaatcga 3720 attgaaattc caagatgtta tctacgagga gctcctcctt ctgcgtacga cacattggaa 3780 gctcacgtat tcatggatgc cagtcaagac tactacagct gtgcagtgta ctttcgcatt 3840 actcacgaag gagctccaaa gtgcgcgttg gtgtcggcaa aaacgaaagt tggacctttg 3900 aagccgctat ccatcccgcg gatggaactt ctcgctgcga cacttggcgt gcgtttgcta 3960 gacacagtat tagctaatca cgatgtaaag gtgacgaagc gattcctttg gacagattcg 4020 tctacagttc tacactggat agctgcagat ccgaggaaat ataaaccgtt cgtggcctac 4080 cggattactg aaattttaca agggtctaga cttgatgaat ggcgttacgt gcgcagtaag 4140 ctaaacatcg cagacatctt aaccaagtgg aaaaccgaaa atttcttcga cggagaaggg 4200 ccatggttct gcggtccaaa ttttcttcag catccggaag aaagttggcc aacacacaat 4260 cgtcctccgg caaatcgtgt tctagaaatg cgagcttgtc atatgtacca cacaacaatg 4320 gcgtttgttc cactaatcaa tgctactcgt gtttccaagt ggaatgtgct ggtacgcaca 4380 atgtgttgcg taattcgatt cgtcgccaac tgccgaaaaa ggatgcaacg gatgccgata 4440 gaaacgttgg tgattccgaa gctcgccagt ctggttcgac gatcgatctg ttccatcgac 4500 gtgccactga agcaggaaga atacgaagtg gccgagcaag ttttgtggaa gcaggctcag 4560 ttggagtgct atgaagacga agtggccaca ttactgagaa atcaaaagct accacgcgaa 4620 gaatggcaac gaattccgaa gtccagcaag ctgtataaaa agaatccgtt tttagatgag 4680 tacggtgtac tccgcatgga aggcagaatc ggagcagccg aatttgttcc atttgacacc 4740 aagttcccca tcctactacc gagggagcat atcatcacct atcgcataat cgagttctat 4800 catcagcgtt acggccacgc taacaaggaa acggtataca acgaaatacg tcagcggttt 4860 tccattccaa atcttagggc tggcttagta ctcgttatga agaactgtca gtggtgcaaa 4920 gtgcgtaagg cgcagcctct tactcctcgc atggcaccac ttcccttgga gcgtctcact 4980 ccctttgcca agcctttcag ttatgtcgga atcgattatt ttggtcctat tgatgtgagt 5040 gttggtcgac gaatggaaaa gaggtggata gtcctattta cgtgtttgaa catgcgggct 5100 atccatcttg aggttgcata tcgtttaaac gctgaatcct gtattatggc gatacgtcga 5160 ttcgttgttc gaaggggacc accaattgca atttttaccg acaacggtac taattttaag 5220 gctgcgagca aggagctaca aaaccagatc cagcgaattg atgctgaatg tgccaatgtg 5280 ttcactaatg caagaacacg ttggagcttc aatcccccat ctgccccgca catgggggga 5340 atatgggagc gtatggtacg ctctgttaaa gagatgatgt ccgcactcaa cgaaaggcat 5400 cgattgaatg atgagatact cctgacggtc atcgcagaag cggaagagat agtcaattca 5460 agacctttgg tttatcagtc tcaagatgaa gaagcagggg aagttataac gccaaatcat 5520 ttcattcgtg gctcatcttc tggcttaaag gacccatcga tcgcaccaac caatgcagcg 5580 gaagccttgc gtaatgcgta tcatcgatcg cagatcataa gcgatgagtt atggaagcgt 5640 tggttaatcg agtatttacc atcccttaat gcccgatcaa aatggttaga ggaatcgaag 5700 ccattggttc ctggagatct tgtatttatt gtggacgacc atggacgcaa tggatggatc 5760 cgcgcgaagg ttgaagaagt gattcatggt aaggatggtc gcgttcgacg ggcaaaggtg 5820 cgcacctcga atggagtatt cctgagaccg gtgtgcaagc tggcgttaat agaagtgagg 5880 gcatctagta aatcttgcca ggaagcaatt tcacggcaag gtttacgggt cggggaat 5938 // ID Gypsy-262_AA-I repbase; DNA; INV; 4322 BP. XX AC . XX DT 13-JAN-2011 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-262_AA_; KW Gypsy-262_AA-LTR; Gypsy-262_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4322 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (13-JAN-2011). XX DR [1] (Consensus) XX CC 'TTTA' target site duplication CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 99..1703 FT /product="Gypsy-262_AA-I_1p" FT /translation="MADPVVNLCSTFTRVGLERECKKLGLTTRGSKAEMAK FT RIVAHNTNTPANDDRQSTNDEVHDGSLGDGENDPRDDASSNDDENDDAADD FT DGSMSGASDDAGNHHNDDENDEVNSSSDAGRVFHTALHSTPKSKKTSSKAS FT RPYSFRDVEDSIESFGAEDGQDVKVWIRQFEEISAAAYWTDEQRLIMLRKK FT LSGTARRFVFTQRNVGTFAKLKRALIAEFAPLVRACDVHRQLAARKKKPTE FT PTRDYIYEMQRIALAIELDEASICEYIVDGVTDDEYHRSMLYEAHTIKQLK FT EKLLTYEKAQAKSVKKHRSDDRFSASQKRERREKGEEKNTKAKSEAKRHCF FT NCGDPTHVASDCPQKGDGPKCFSCNGFGHLSKECAKKGEEQSTKKKSAKVN FT TIEKVNKSQPAVSVKINGYEVPAIVDTGSEITLMRKDLWLDIEKSGTKLGR FT SDMKVRGYGGKVEIVCGEVVLDAKIEDEDFEIRFYVVPQNAIDMQVLIGMD FT FLDGVKLLDHPGWCACPKISAQSHGRRRCEVDPTDRRVR" XX SQ Sequence 4322 BP; 1216 A; 971 C; 1267 G; 857 T; 11 other; tgggggctca gcccggatgg agtttccgtg aaatctgcgc gaaaaaatag taaaatcggt 60 cgttgaacgt tgtgaaaaag ttgatccgaa aaagcaagat ggctgatccg gtggtgaact 120 tgtgttccac atttacgaga gtcggacttg agcgcgagtg taaaaagttg ggccttacga 180 cgagaggcag caaagcagag atggcgaaga gaattgttgc tcataacacc aatacacctg 240 caaacgatga tcgtcagagc accaacgacg aggtgcacga cggcagcctt ggtgacggtg 300 agaacgatcc cagagacgac gcttccagca acgacgacga aaacgacgac gctgctgacg 360 acgacggcag catgagtggt gcatccgacg acgcgggtaa tcaccacaac gacgacgaaa 420 acgacgaagt gaacagtagc agcgacgccg gaagagtttt ccacacagct ttgcactcga 480 caccgaaaag caagaagact tcgagcaaag cttcccgacc gtattctttc cgcgacgtcg 540 aggacagtat tgaatctttc ggtgctgaag acggccaaga cgtaaaagtg tggatccgcc 600 agtttgagga aatatcagct gctgcatatt ggaccgacga gcaaaggttg attatgctcc 660 ggaaaaagtt gtccggaacc gcgagacgat ttgtgttcac ccagcgcaat gtgggcacat 720 tcgcaaagtt gaaaagagct ctaattgccg agtttgcacc tctcgtgaga gcgtgcgacg 780 tgcatcggca gctagcagcg cgtaagaaga agccgaccga gccgaccaga gattacatct 840 atgagatgca acggattgct cttgctatcg agctcgacga agcgagcatc tgtgaataca 900 tcgtcgacgg cgtgacagat gacgagtacc atcgctcaat gttgtacgag gcacacacca 960 tcaaacagct gaaggaaaag ttgctcacat acgaaaaagc acaggcgaaa tcggtcaaga 1020 aacaccgatc cgacgatcga ttcagcgcca gccagaaacg agagcgccgt gagaaaggag 1080 aggagaaaaa cacgaaggcg aaatccgaag caaaacggca ctgcttcaac tgtggcgatc 1140 ctacgcatgt tgccagcgat tgtccgcaga aaggcgacgg cccaaaatgt ttttcgtgta 1200 acggtttcgg gcatctatcg aaagagtgtg ccaagaaagg cgaagagcaa agcacgaaaa 1260 agaagagtgc gaaagtaaac acgatcgaaa aagtgaacaa atctcaaccg gccgtcagtg 1320 tgaaaataaa cggctacgaa gtaccggcca ttgttgatac gggaagtgaa atcacattga 1380 tgcggaagga tttgtggttg gatattgaga agagtggcac caagctggga cgatccgata 1440 tgaaagtgcg cggatacggt gggaaggtgg aaatcgtttg cggagaggta gtgttggacg 1500 cgaaaatcga agatgaagat tttgaaattc gtttttacgt tgtgcctcaa aatgcaatcg 1560 acatgcaagt gctgatcggt atggactttc tcgatggtgt taaattactc gatcaccccg 1620 gatggtgtgc gtgtccgaaa atatccgccc agagccacgg aagaagaaga tgtgaagtgg 1680 atccgacgga tcgaagagta cgatgagaac gaactaacgg taccctatca gtatcgatcg 1740 gacgtgatga agattatcga gcagtacaaa ccggatacca ctgtgaccgc gaaaaatcct 1800 ctgtcgatta agctggcaga taacgacgta gtgcgtgaat gtccscgacg kcttgctccg 1860 ttggaaaagg agattgtgcg aaagcaaata ggcgagtggc tcgaaaatgg catcattcaa 1920 ccgtcggaaa gcgaatatgc cagtgcagtg gtagtagttc cgaagaagga cggttcgcgc 1980 cgcgtttgtg ttgactatcg cgaagtgaac aaaaaggtga tccgtgacaa tttcccgatg 2040 ccgaacgttg aagagcaaat tgatcagttg gcaaacgctc gagtgtacac tacgctggat 2100 ttgaaaaact cctactttca cgtgccggtg gaagagagca gccggaagta cactgccttc 2160 gtgactagtg aagggcagta cgagttcctg cgagctccgt tcggactgtg caccagcggc 2220 aacgcgttcg gacggttcat caccgacgtg ttcaaagact tgataatcga tggcacgatt 2280 agtggcgttc gtggacgatg tgattattcc gtccgaggcc gaagaagaag gactcgaagt 2340 gttgaagaaa gtgatgagtg ttgctgagaa ggccggattg cagttcaact ggaagaagtg 2400 cgcgttcctg caacgtcgag tcgaatacct cggctatacg gtttacgatg ggcaagtgga 2460 gccagctcca gcgaagatcg agaaggtgaa gcatttcccc cagccgatca acgtgaagca 2520 gctgcaacgt ttctacggat tggcaagcta cttcaggaag ttcgtcccgk gttttgcgga 2580 catcgcccga ccgctgtccg gcctgttgaa gaaagagagt gtttttgtct tcgacgagaa 2640 agctgtagcg tctttcgacc gcatcaagga tatcctggcc agctacccag tgctgcgaat 2700 tttccatcct gatgccgaga cggaggtcca caccgacgcc agcaaggaag cactggcagg 2760 tattctgatg caacgggcga cgatgacggt aagttccacc catgctacta ctacagccgt 2820 ctgacgactc sggccgagaa gaattatcac tcgtacgatt tggaagcttt ggcagtggta 2880 gaatctgtca aaaagttccg tgttatctgc tgggtcgtaa gttcaaaatc gtacggattg 2940 cgtggcgttc aaggattcat gccgaagaag aggctgaatc ctcgagttgc aaggtgggcg 3000 tggatcttgg cagagtttga ctacaatgtc gaacaccgac ctggagacaa gatgccccac 3060 gttgatgcgc tgtccagagc aaacgtgatg ctcatgtccg ctccgatggt cgcgaagatt 3120 cgtgctgctc aagacgacga cgagaggtaa aagcgatcaa aatggctctg gagcagggag 3180 gaaccgtcga gggctactct gtgtmgaatg gtgtcctgta cgaagaggat ggtggagacg 3240 aagtgtgtwc gttccggaag cgatggagat ggaggtggtt cgatcggcgc acgagaaagg 3300 acatttcggt gtgaagaaga tgaaggagag gatcgacgcs gactactaca tcccaggtct 3360 ggacgagaag atcaagcggc gcatcgckac gtgtgttgct tgtatcgttg gcgaaaagaa 3420 gcgaggaaag cctgaaggtg agctgcatcc gatcccgaaa ggtgatgttc cgttggatac 3480 tttccacgtt gaccatctag gtccaatgcc gtccaccagg aagtcgtaca actacatctt 3540 gaccgttatc gatgccttta ccaagttcgt tggctgttcc caacgaagtc gacgactgct 3600 gatgaggcga tcaagaagct tacgtataat taccgacgtc ttcggaaacc cgagamggat 3660 catctgtgat cgagggctgc gttcacatcg agacagttca agaagttctg cgatgatgag 3720 ggcatcgagc tgcacacgat cgccaccgga gtccctcgtg gaaacggcaa gtggaaaggg 3780 tgcaccgaat cataattccg atgttcacga agctgtctgt ggacaaaccg agaatggttc 3840 aagcacgtcg ctgatgtgca gaagtgtctc aacaacagct ggcaacgagc aatacagacg 3900 acccattcca gttgctgctt ggagtgaaaa tgcgaacgaa ggaggatgac gtcctacatg 3960 agttgctcaa caaggagatc caggaccagt ttgaggagga acgaagcgaa ctgcgacaga 4020 tcgcgaacga gaacatccag aagatccagc aggagaaccg gaagtattac aatctgcgac 4080 gaagccggca gaagggtaca ggaagggcga cgtcgttgca atcccgagga cccagttcgg 4140 gtcggccaga aagtgaagca ccggtacttt ggaccgtacg aaatagtcgg cgttttgccc 4200 aacgaccggt acgaggttcg gaagctcgac gagaagmgga aggaccgaag aagacaacga 4260 cggctggtga ccgtgtgaag ccgtgggcac ttccgggccg gaagtgawgt caggaaaggc 4320 cg 4322 // ID BEL-79_CQ-I repbase; DNA; INV; 5870 BP. XX AC AAWU01022182; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-79_CQ_; KW BEL-79_CQ-LTR; BEL-79_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-5870 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 299-299 (2011). XX DR GenBank; AAWU01022182; Positions 4016 9885. XX CC Positions [4845-5429] - Integrase core CC 'CCCTA' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 567..5840 FT /product="BEL-79_CQ-I_1p" FT /translation="MAAEKKIAVTSYTRMIDSIKERGKKLLIFVRDFDSEK FT LDRSLLEDKQESVEEMRSLFHETEVKLYGVIKEDEVSAFQQTNEEVEDLLD FT EIRHLVRQRLREIDPPKPVSEQVKVSSSHQAHPKLPDIPLPHFSGNPEDWH FT LFKDTFTALIKRREGLADSEKLFYLRAAIQDGKARFIQSAEDTFVSLWKAL FT RQEYENKRVLVEKHIAQLFHLEPIASESGSDLRTLINEVSQSLRALANMNL FT KVESLSEQFMIHLVCSRLDPGTRKDFELQLTNNALPSWEKLFEYLQSRCRC FT LESIENETKDKSVPRGKTTPQERSAPRQSKAFPVQTAPPRSRSSNINVKCF FT VCKGAHYLNKCGQFLALSASDRSAKIKELGLCLNCFSNKHYLRECRSSSCR FT TCGGRHHTLLHLPHTAPTSNVPNQGQAPTPSQSMSSSNSVVGCDTSSIPSA FT KQFVLYTAVEVVEDAYGNSIKCRILLDCGSMHNIISSRIVNMLKLPKVKAN FT VCVAGVSGAPQLIKSKVAARIRSTTNPDCLYELEFLMMKKITTDIPIETFD FT VNPERFPAGVHLADPLFNRSRRIDILLGIQTFNDLFTGASFPLADDRTLWC FT KETTFGWVVGGAVTERRADETSTSTCGVVTNEMLSEQIKQFWDSEAVPEIR FT ALTADERAAEQSYSDTCRRLRDGRYEVGLPVKESIRELGDSRAMALSRFVQ FT MERRMHRDENLREQYVAFMKDYEDQEHMIEADEVPGGYFMPHHAVFNPAST FT TTKTRVVFDASAGTTSGSSLNDHLYVGPALQRKLYDIVIRFRFPQFVFTAD FT MTQMFRQIRIRSEDRKYQQIFWRRHPDELVKVYQLATVTYGTACAPFLATR FT TLAQVCKDEIERFPLAAKAGTDDVYMDDLLSGADTLEEALVLQKEFVAMMK FT SAGFDLHKWASNHPDLLKSVPNADNEQDAFFKDQKTTRTLGLTWQPSSDKF FT LTKLHEIRFNTGPPTKRSQYSDIAKLYDPIGLLGPVIFKAKVMMQKLWEVE FT VEWDDVLPDTEEWDTFRNQIAEMGEIEIPRNVIPHPNPLQIELHGFSDASM FT LGYGASVYARCVYSREYSTVRLLTSKSRIAPSKNAKQTIPRLELDGGLVLA FT RLISNLTHILSIPFTRIVCWMDSTVALSWIRTDPGRLDTYVSNRVIQIQEL FT TKNFEWKYVSSHENPADVLSRGLLPTEIRECPIWWQDLTYLDQNEPLWPSQ FT PPTIPVEQLPETRNPTVSLPITDPPAMFNLFEIESSFRRMQRTMATMLRFI FT DCIRKDDQRERRSGHLTIAELNRATAALTSIAQREAFPDDFQQLESGKVIH FT SRSKLITLCVFSDRSKFNVLRVGGRLRHSNLTVGQKHPMVLPSKHPFTDAV FT VRSYHEEFLHAPQQLLLSALRRRFWVQHGCSTVRKIIRRCVTCFKAKPVAM FT QQMMGDLPKSRLEGVCPFLNAGVDFAGPMYIRQHNKRSTVTYKAYVAVFLC FT FATRAIHLELVGDLTADAFIAALQRFVSRRGKPVKLFSDNGLNFVGSKNKL FT RDLYKLFRSQQLKGKLDDFCTKTSIDWRMIPPNAPHFGGLWEAGVRSAKYH FT LKRITGTANMNFEEFTTVLARIEAVLNSRPISPMSEDPTDATPLTPGHFLV FT GRPLTDIAEPDLSDRKETTLSRWQRQTHMVQHFWARWHQDYLTTLQNRNKW FT HKRYEIKPGQLVIIREDNVPTMQWKLGRIENVIPGPDGLVRVADVRAGKKV FT IRRAIAKLCLLPIEDNNAEQPEAEAADDGSNDQEEQED" XX SQ Sequence 5870 BP; 1448 A; 1584 C; 1626 G; 1212 T; 0 other; taaaagtggt ccttcgaacc ggattcgaag accgggtccg aaggacgtga aggacggtga 60 ccagtgacag tgaagtgacc acgaaaaacg gtggacacgg acgcgtgtgg gggaacaatt 120 gttcctgggc gttccgcgtt ggacattgtt tgtgcgaaca aaatggacgg tcggtccatg 180 tgattcacga tcgaagcaac gaagaacaat tggtgcgaga aattggacga ttgaagtgca 240 cggacaattt gtgctatttg aacacgaaaa gaactgtgtg cgacaaaact gcgagccaaa 300 aaagagtgca acaattctca gccgagggct gaaacagtgg ccttcgggca aaaagaacac 360 gtgcagttga gttgatcgtg actgtggggg tcattgagcc gcggtgcggt gcgcactgct 420 cgatgccagc tggatcaccc acgtgatcgc tgtatgtatg ctcttcatgc ctttcaggtc 480 ggtctaggtg gaatccaggt tagaggtcag gtaggtcgtg caagtgggtg cgttgtttcc 540 gggagaattt ttgggaatcg ctcaaaatgg cggccgagaa gaaaatcgcg gtgacgtcat 600 acacccgtat gattgactcc attaaggagc ggggtaaaaa actgttgatc tttgtgcgtg 660 attttgattc cgagaagctt gatcgctcac ttctcgaaga caagcaggaa tcggtcgaag 720 aaatgcgttc tttgttccac gagaccgagg ttaaattgta cggtgtcatt aaagaggatg 780 aggtttcggc attccaacaa accaacgagg aagtggagga cttgctcgac gagatccgcc 840 atcttgttcg acaaagactg cgagaaatcg atcctcccaa gccggtgagt gagcaggtca 900 aggtatcttc ctctcatcaa gcccatccca agttgcccga catccccctc cctcacttca 960 gcggaaatcc ggaagattgg cacctgttca aggacacgtt tactgcgcta atcaagcgtc 1020 gcgaaggtct cgccgacagc gagaagctat tctacctgag ggcggccatt caagatggca 1080 aagctcgctt catccagtct gcggaagaca cttttgtctc tctgtggaaa gctcttcgcc 1140 aagagtacga gaacaaacga gtcctggtag aaaagcacat tgcccaattg ttccacctgg 1200 agcccattgc tagcgaatct ggatcagatc tgcggaccct gatcaatgaa gttagccaat 1260 ctctaagagc ccttgcaaac atgaacctca aagtcgagtc gttgtccgag cagttcatga 1320 tccacctggt gtgttcgcgt ctcgatccag gtacacgcaa ggatttcgaa ctacagctga 1380 ccaacaacgc gttgccgtcc tgggaaaagc tgttcgaata tctccagtcc cggtgcaggt 1440 gtcttgagag catagagaac gagactaagg acaagtctgt accacgtggt aagaccaccc 1500 cccaagaaag atctgcccct cgccagtcca aggcctttcc ggtgcaaaca gctcctccaa 1560 ggtcgaggtc cagcaacatc aacgtgaagt gcttcgtgtg caagggtgcg cactatttga 1620 acaaatgtgg tcagttctta gccctgtctg cgtcggatcg ttctgccaag attaaggagt 1680 tgggcctttg tctaaactgc ttcagcaaca agcactacct gagagagtgc cgaagcagct 1740 cgtgtcgtac gtgtggtggt cgccaccata ctctgttgca tctgccccac actgccccaa 1800 cctccaacgt ccccaaccaa gggcaagcac caacaccatc acagagcatg tcgtcatcga 1860 actctgtcgt tgggtgtgac acgtcgtcga tcccttcagc gaagcaattt gtactgtaca 1920 ctgcggtgga ggttgttgag gacgcctatg gaaattcgat caagtgccgt attctgctcg 1980 attgtggctc gatgcacaat atcatctcgt cccgaatcgt caacatgctg aaactcccca 2040 aggtcaaggc caacgtttgt gtagctggtg tgtctggagc accacaattg atcaagagca 2100 aggtagcagc aaggattcgc tctaccacga accccgattg tttgtacgag ctggagtttc 2160 tcatgatgaa gaagatcaca acagacatcc cgatcgaaac ctttgatgtc aaccccgagc 2220 ggttcccagc aggtgtgcac ctggcagacc ctttgttcaa tcgatcacga cgaatcgata 2280 ttctgttggg catccaaacg ttcaatgatc tttttacggg cgcgtcgttt cctctggcag 2340 atgatcgtac actttggtgc aaggaaacga cctttggctg ggtcgtgggc ggagccgtga 2400 ccgagcgtcg agcagatgag acttccacga gcacgtgcgg agtcgtgacc aacgagatgc 2460 tgagcgagca gatcaagcag ttctgggact ccgaagccgt tccagaaatc cgtgcgctga 2520 ccgcggacga gcgggcagcc gagcagagct actcggacac ttgccggaga cttcgagacg 2580 gtcgatacga agttggactt ccggtgaagg agagcatccg agaacttggc gatagtcgtg 2640 cgatggcgct gagcagattt gtgcagatgg agcgaaggat gcaccgcgac gagaaccttc 2700 gtgagcaata cgtggcgttc atgaaggact acgaagacca agagcacatg atcgaggcgg 2760 acgaggtccc gggtggatac ttcatgcccc accacgcggt gttcaacccg gcgagcacca 2820 ctacaaagac gcgtgtcgtg ttcgacgcct cagccggtac aacatctggg agctccctga 2880 acgatcatct gtacgtcggt ccagccctgc agcggaagct ctacgacatc gtgatccgct 2940 tccgttttcc ccagttcgtg tttacggccg acatgaccca gatgttccga cagattcgga 3000 ttcgatcaga agaccggaag taccaacaaa ttttctggcg gaggcatcca gatgagctcg 3060 tgaaggtgta ccagctagct acggtgacgt acgggacagc ctgtgcacca ttcttggcaa 3120 ccagaacttt ggcgcaagta tgcaaagacg agatcgagcg ctttcccctg gcggcgaagg 3180 caggaactga cgacgtgtac atggacgact tgttgagtgg tgctgacacg ctggaagaag 3240 cgttggtgct gcagaaggag tttgtcgcga tgatgaagtc ggctggattt gacctgcaca 3300 agtgggcatc caaccacccg gacctgctga agtcagttcc gaacgccgac aacgagcagg 3360 acgcgttctt caaggaccag aagaccacgc gaacgttggg gttaacttgg caacccagca 3420 gcgacaagtt cctcaccaag ctgcacgaga ttcgcttcaa tactgggcca ccgacgaagc 3480 gatcccagta ctctgacatc gcgaaactgt acgacccgat cggactgctg ggtcctgtca 3540 tcttcaaggc gaaggtgatg atgcagaagc tgtgggaagt tgaggtcgag tgggacgatg 3600 tcttaccaga caccgaggag tgggatacct tccgcaacca gatcgccgag atgggtgaaa 3660 ttgaaattcc tagaaatgta atcccacacc caaatccgct ccagatagaa ctgcatggct 3720 tcagcgatgc atcgatgttg ggatacggag catctgtcta cgctcggtgc gtgtacagca 3780 gggagtacag cacagtcaga ttgctcacct caaagtctcg aatcgcaccc agcaaaaacg 3840 caaaacagac cattccgcgc ctagaacttg atggcggact tgtacttgca cgactaatct 3900 ctaacctaac gcacatattg agcattccct ttactagaat agtttgttgg atggattcca 3960 ccgtggcgct ttcctggatc cggaccgacc caggcagact cgacacctac gtcagcaacc 4020 gagtcattca gatccaagaa ctgaccaaga actttgaatg gaagtacgtc agctcgcacg 4080 agaaccccgc tgatgtactc tctcgcggct tactgccgac cgaaatccga gagtgtccga 4140 tctggtggca ggacttgacg tatttggacc agaacgagcc cctctggcca tcccaaccac 4200 ccaccatacc tgtagagcaa ctcccggaaa cgcgaaaccc aacagtaagc ctcccaatca 4260 ctgatcctcc tgctatgttc aacctgttcg agatcgagag cagcttccgg agaatgcagc 4320 gaacgatggc gacgatgctg cggttcatcg actgcatccg gaaggacgat caacgagagc 4380 gtcggagcgg acatctgacc atcgccgaac tcaaccgagc gacggcggcg ctgacgagca 4440 ttgctcaacg agaggcgttc cccgacgact ttcaacagct ggagtccgga aaggtaattc 4500 acagtagaag taaactaatc accctctgtg tgttctcaga ccgttctaag ttcaacgtgc 4560 tgcgagtagg cggacggctc cgtcactcca acctcaccgt gggtcagaaa caccccatgg 4620 tcttgccgtc gaagcacccg ttcaccgacg ccgtggtacg gtcctaccac gaggaatttc 4680 tgcacgcccc gcagcaactg ctgctgagcg cgttacggcg acgattctgg gtgcagcacg 4740 ggtgcagtac ggtgcgtaag atcattcgac ggtgcgtcac ttgcttcaag gcgaagccgg 4800 tggctatgca gcagatgatg ggcgacctgc cgaagtcccg tctcgaaggt gtgtgcccgt 4860 tcctcaacgc tggtgtggac ttcgctggcc cgatgtacat ccgccagcac aacaagcgtt 4920 cgacggtcac ctacaaggcg tacgtggcgg tcttcctctg cttcgctaca cgtgccatac 4980 atctggagct tgtgggagac ctgacggccg atgcgttcat cgccgccctg caacgtttcg 5040 tctcgaggag aggtaaaccg gtcaagcttt tttccgataa cggtcttaat ttcgtcggca 5100 gcaagaacaa gttgagggat ctgtacaagc tgtttcggtc ccagcaactc aagggcaagc 5160 tggacgactt ctgcaccaag acttcgatcg actggcgcat gatccctccc aatgctccgc 5220 acttcggagg actctgggag gccggcgtgc gctctgccaa gtaccacctc aagcggatca 5280 ccgggacagc caacatgaac ttcgaggagt tcacgaccgt tctcgctcgc atcgaggctg 5340 tgctgaactc gagaccgatc agcccgatgt cggaggaccc aaccgatgcc acacctctca 5400 cgcctggtca cttcctggtg ggacgaccgt tgaccgacat cgccgaacca gacctgtcgg 5460 accgcaaaga gacgaccctg tcgcgttggc agcggcagac gcacatggtg cagcactttt 5520 gggctcgctg gcaccaggac tacctcacaa ccttgcagaa ccgaaacaag tggcacaagc 5580 ggtacgagat caagcctggg cagctagtga tcatccgaga ggacaacgtg ccgacgatgc 5640 agtggaagct agggcgaatc gagaacgtca tcccgggccc tgacggccta gtgcgggtgg 5700 ctgacgtacg agctggcaag aaggtgattc gacgggcgat cgcgaagctg tgcctgttgc 5760 cgatcgagga caacaacgct gaacaacccg aagctgaggc tgcagacgac ggcagcaacg 5820 atcaggaaga acaagaagat tgaaattcgt ggatttcaat ggggggagca 5870 // ID piggyBac-N1_BF repbase; DNA; INV; 972 BP. XX AC . XX DT 13-APR-2011 (Rel. 16.04, Created) DT 13-APR-2011 (Rel. 16.04, Last updated, Version 1) XX DE piggyBac-N1_BF DNA transposon DNA transposon - consensus. XX KW piggyBac; DNA transposon; Transposable Element; Nonautonomous; KW TTAA TSDs; piggyBac-N1_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-972 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M. and Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-972 RA Kapitonov V.V. and Jurka J.; RT "Transposable elements in the amphioxus genome."; RL Deposited to Repbase as the tmplanrep.ref file (02-May-2008). XX RN [3] RP 1-972 RA Kapitonov V. and Jurka J.; RT "A family of piggyBac-N1_BF DNA transposons from the amphioxus RT genome."; RL Direct Submission to Repbase Update (13-APR-2011). XX DR [3] (Consensus) XX CC It shares common termini with piggyBac-1_BF. XX SQ Sequence 972 BP; 317 A; 181 C; 175 G; 299 T; 0 other; ccctcaaaca cccgagtggg gtccatttgg accccaggcg tacattttga agtgccattt 60 gaacattttt gatctgaaga aaaaatcctt ccgtgacttt gtcagtcaat atgtcttata 120 ccttctccta agttttcgca aagattggta caataatgtg gaaattataa agaaagtttg 180 tgttcagtcc ctctggggtc caaacggacc ccagcaaaaa tgtgcagttt ttaaaacaaa 240 ctttggaggc taactcctta accacttaca gtatgactac caaactttca gacaataata 300 agaatgtaaa cctaaatcat cacaaaaatt tctgtgtcat caacatgtaa tgtgacgaca 360 ttatgacgtc atttgcctaa tcagggcggc catcttggat tttgaccaat gacgtcatga 420 aattagcata aattataaat tttaagtcat gaaaattaca ttgaatttca tagaatttaa 480 ttgttgtcag aaaacaggtg tagtttacca agaaaacctt gtttaaattg aaattgtcaa 540 attttggcaa aaatatgcct gttagaatat ggttgccatg gcaaccccca aaaatgataa 600 acttacttta ttgaccaaaa acttttgcaa acaatatttt gtgataaatt accaagtttc 660 gtagctttag ctctagccgt tcatgagtta tgcgacataa atgttgctga gggcctcaaa 720 attcccctat ctggtcggaa taggtttagt gcaaaacgtg gtccttctga acttttgcat 780 aaattatgat aatgagttgg aattagcaac accttaacat ctcaaaatct tcctcttaca 840 ttgacgaagc aatgtatata caatgatcgc cgataagaaa tggtactgag attttatgaa 900 caaaacattt tggggtccgt ttggacccca ctcggtcatt ttagtcgcaa aaaaaggtcg 960 ggtgcttgag gg 972 // ID Sola3-2N1_CB repbase; DNA; INV; 1917 BP. XX AC . XX DT 28-JAN-2009 (Rel. 14.02, Created) DT 28-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE Sola3 DNA transposons from Caenorhabditis brenneri. XX KW Sola; DNA transposon; Transposable Element; Nonautonomous; Sola3; KW TTAA TSD; Sola3-2_CB; Sola3-2N1_CB. XX OS Caenorhabditis brenneri OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-1917 RA Bao W., Jurka M.G., Kapitonov V.V. and Jurka J.; RT "New superfamilies of eukaryotic DNA transposons and their RT internal divisions."; RL Molecular Biology and Evolution 26(5), 983-993 (2009). XX DR [1] (Consensus) XX SQ Sequence 1917 BP; 631 A; 332 C; 326 G; 628 T; 0 other; gaggcgctat ctgtaccagc tggaatgttc caggcacagt tgcaaaaagt taatccgttc 60 catatgagtt ttaaaccatg gaaatacaca tggaagtgat ttccaaaaag tttgctaaaa 120 ttcggatttt tcgattttgg ctgaaattat agccaaaaat caacttttca gatcgaaaat 180 tctaaaaatt tcagtttttg agataaagtt tccaaaattt taggggtgca tttgtacacc 240 ctggagcaca ttttgatcaa atttgggctt ctcgaaatga cggcaagtga tgaaaacaca 300 actgagatga cggttttttg ggtttttctc gaaaactttg aatacctggg gtaacttttg 360 acaatttttt cggattcccc ttgcaatttg gaacaaaaaa gtctcttgga ccaaaaaaag 420 ttagacggac taaattgggt ttttttcgat tttttcgaaa atcaccaaag ggtcccccct 480 tatgaaattt ttttttcgaa aaaaaatcaa aatttttttt tgccccaatt tacaaaattc 540 atgtttttat catcataaat aagccactca caatttttca gatgatacga atgcgttcca 600 gccgagtttt atcatgaaat accggttctt ttcaggaatc tcgatttttc acgaaaatct 660 tcaacagaag agtaactttt aacaactttt ttgaattccc catgcaattt ggagcaaaaa 720 agtctcttgg accataaaga ttcaggcggg ctaaattgag ttatgacgat ttttttgaga 780 aacaccattg tctgcaaatg tcggaaaact aacgtaccaa attgtggttg taaggaccaa 840 aatttttgtg cttaatgtta tttttttatt ttttcgagtc gaaaatgttg taatagggtc 900 gattaaaggt aaaaaatcga tagactaagt cgatcttgta tcattacctt caggctttaa 960 agttcaaaat gagcagttgg aacaaagaag aataactctg ttcgaacagg tccaaacaag 1020 ctcaaaaatt gtgagagcat tattcaccaa cttttcattt ttcataggag ctctagaaaa 1080 attttatttt tccgagtttc accactttca cctcatttta gaggtcctca aaaaaattgt 1140 cataactcaa tttagtccgc ctgaatcttt atggtccaag agactttttt gctccaaatt 1200 gcatggggaa ttcaaaaaaa ttgtcaaaag ttaccccttc tgttgaagat tttcgtgaaa 1260 aatcgagatt cctgaaaaga accggtattt catgttaaaa ctcggctaga acgcaatcgt 1320 atcatctgaa aaattgtgag tggcttattt atgatgataa aaacatgaat tttgtaaatt 1380 ggggcaaaaa aaattttgat ttttttttcg aaaaaaaaaa tttcataagg ggggaccctt 1440 tggtgatttt cgaaaaaatc gaaaaaaacc caatttagtc cgtctaactt tttttggtcc 1500 aagagacttt tttgctccaa attgcatggg gaatccgaaa aaattgtcaa aagttacccc 1560 aggtattcaa agttttcgag aaaaactcaa aaaactgttg tctcagaggt gtttttccca 1620 ctttccgata tttcaagaag cccaaatttg ttcaaaatgc gcccctatgc atccaaatgt 1680 acctctaaac gtttcagaac aaaatctcaa aaacgaaaat ttttagaatt ttcgaactga 1740 aaagtcgatt tttgactata atttcagcca aaatcgaaaa atcagacttt tgataaactt 1800 tttggaaatc acttccatgt atgtttccat ggttcaaaac tcttctggaa cggattaact 1860 ttttgcaact gtgcctgggc tctgtagaaa aagtccactt tttacagaga gggcctc 1917 // ID hAT-16_HM repbase; DNA; INV; 3529 BP. XX AC . XX DT 07-DEC-2008 (Rel. 13.12, Created) DT 12-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-16_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3529 RA Jurka J.; RT "hAT-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 2005-2005 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 836..3397 FT /product="hAT-16_HM_1p" FT /translation="MAERKKPSGAQYRKRKVEQQREQEKQKGSLLKFICRQ FT DSTKEDSNGGQIPTDMEAEEISNVVDNDNVMMSETLSYSTVPVSHADNGQL FT NNSLEEHVQQASEEKTDNVIPQDPALWPLLINHNTRAIIVERGPQQIKDIS FT FPNDKNNRKFSVFHYLRKLPNGEELCRSWLLYSVLKDSVFCFCCKLFSMKA FT IGISSLTHTGSSDWKNMAAILSSHEKSPEHLQNYQKWKELHQRLQRDSTID FT AEILRKMKNEEKYWQQILKRLIALVRVLGEQNLAFRGTNETLYSANNGNFL FT KFVQYLAIFDPLMNEHLRKISNKELHTHYLGKDIQNELIQLLGNAIKKEII FT QTANAMKYFSIVLDGTPDCSHVEQMTIIIRFVKVDSLKKEFSIKEHFLGFV FT PLKKTTGAYMAETIIQQLEEMELPIDNLRGQGYDNGANMIGKNNGVQKKIL FT NINPRALFVPCSAHTLNLVVNDAANCCLMATSFFDIVQRVYVYFSSSTHRW FT VVFTSHQPTLTVKPLSETRWESRIDAIKPLRYELGKIYDALLEIADDTSLT FT GSSGSTARSDAKALANSVAKFKFVVSIVVWYNILFEINITSKQLQAKDLDI FT HAAVQQLQHTQAYLVDCRSDIGFARMLVDAAEIAKDLEIPPTFEAEPRLRR FT RKKQFAYEAEDEPVQDPKQNFKVNFFFAILDTAIRSVEERFEQMRTIESVF FT GFLYHIHGLQSKTSQEILECCMKLESALQHGDNRDLVASDLCGELQSIARR FT LSEETKSPQDVFRFILCQNLEDSLPNLCIALRILLTLPVSVASGERSFSKL FT KLIKTYIRSSMCQDRLVGLATLSIEHELADKLDLKDLVIDFAQKKARKVQF FT *" XX SQ Sequence 3529 BP; 1208 A; 586 C; 639 G; 1096 T; 0 other; cagggccggt gcaccctata ggcgaactag gcaatcgcct agggcgccaa agtttagggg 60 cgcctaaaaa ttgataagtt tttttatatg ctaacaaaaa acaacgcttt tgctcattga 120 gcaatatgta cgtacatttt accgaacgca atctaaatca taacaaatat ggcggcgtct 180 ttacaatcaa agtgattttc ataacaactt tcaatagtta gtcgactata ttctttatat 240 aatttatact gacaatatat atatatatat atatatatat atacatatag atgaatagtg 300 acattctata tactattata catttatatt aagttttaga cacttattaa gaaaagtaaa 360 atatttttta tatgaatata taataactag cttaaatcta gggctattat atattacgta 420 ttatattatt attatacttc accatacgta taactttgct atacaaaata aaatttgaat 480 tatttaaaca caatgcatct agcaggttaa attttttgtt gagtcgcttc tacaatatgt 540 cttgatctgc accagctaat tacacattga aatctcattc atatagctaa cttccatttc 600 cgtttaaaaa aacacaaaaa atactagcag gacgtataac tttgccaagt tgttttttgt 660 ataatacagt taaatgataa cctgttaaaa ccttacagag ctgttcagac ctgttggaaa 720 ctatagctag ctatatataa tttattcttg aagccaatca cagaagacct attaatgtat 780 tagttaaaca tcgcaacatt catttatttt attattttct tgttacattt cagatatggc 840 tgagagaaag aagccttcag gagctcaata tcgtaaacga aaagttgaac agcagaggga 900 acaagaaaaa cagaagggtt cattgctaaa gtttatatgc cgacaagact ctactaaaga 960 ggatagtaat ggtggacaga ttcccactga tatggaagct gaagaaatat ctaatgttgt 1020 tgataatgat aatgtgatga tgagtgaaac actttcttac tcaacagttc ccgtttccca 1080 tgcagataat ggccaattga ataattctct cgaggaacat gtacagcaag caagtgaaga 1140 aaaaacggat aatgttattc cacaggatcc agcactttgg cctctactaa ttaatcataa 1200 tacacgtgca atcattgtag aacgtggacc tcagcaaata aaagatatca gtttcccaaa 1260 tgataagaac aatcgaaagt tttcagtatt tcactactta agaaaacttc ctaatggaga 1320 agaattatgt cgcagctggc tcttgtattc agtattaaaa gactcagtgt tttgtttctg 1380 ttgcaaactg ttcagtatga aagcaattgg catatcatca ctaactcata caggttcaag 1440 tgactggaaa aacatggcgg caattctttc ttcccacgag aaaagtcctg aacatttgca 1500 gaactatcaa aagtggaaag aactacatca gcgattacag agggatagta caatagatgc 1560 agaaattttg cgcaagatga agaatgaaga aaagtattgg cagcaaatac taaagcgttt 1620 aatagcatta gtgagagttt tgggtgaaca gaatctagct tttcgcggta caaatgaaac 1680 attgtacagt gccaataacg ggaatttttt aaaatttgtg caatacttag ccatttttga 1740 tccattaatg aacgaacatc tacgaaagat atcaaataaa gagcttcata cacactatct 1800 tggcaaagat atacaaaatg aacttattca actgttagga aatgctatta agaaagaaat 1860 catacaaaca gcaaatgcaa tgaaatattt ttcaatagtt ctcgatggta ctcctgactg 1920 tagccatgtt gagcaaatga caataattat tcgttttgtt aaagttgatt cattgaaaaa 1980 ggagtttagt atcaaagaac attttttagg ctttgtacct ttaaagaaaa caactggagc 2040 ctatatggca gaaacgataa tacaacaact agaagaaatg gagttaccta ttgataattt 2100 acgtggccaa ggctacgata atggcgcaaa tatgataggc aaaaataatg gtgttcaaaa 2160 gaaaatatta aacattaatc ctcgagcatt gtttgttcct tgcagcgcgc atactcttaa 2220 tttggttgtc aatgatgctg caaattgttg tctaatggct acaagttttt ttgatattgt 2280 ccaacgtgtg tatgtatatt tttcgtcttc aacacatcgg tgggtagttt tcactagtca 2340 tcaacctacg ttaactgtta aacctctaag tgaaacaaga tgggaaagca gaatagatgc 2400 tataaagcct ttgcgatatg aacttggtaa aatatatgat gcactgctag aaatagctga 2460 tgatacttct ctaactggat cctcaggaag cactgcacgc agtgatgcta aagcacttgc 2520 gaatagtgtt gcaaagttta agttcgtggt ttcaattgtt gtatggtaca acatactttt 2580 tgaaatcaat ataacaagta agcagctcca agctaaagat ttggatattc atgctgctgt 2640 acaacagtta caacacacac aagcctattt ggttgactgt aggagtgaca taggttttgc 2700 aagaatgcta gtggatgctg ctgaaattgc taaggatttg gagataccac caacctttga 2760 ggcagaacca cgattacggc ggagaaaaaa gcaatttgca tatgaagctg aagatgagcc 2820 tgtgcaagat ccaaaacaaa atttcaaagt gaattttttc tttgctatac ttgacacagc 2880 aataaggtcg gttgaagaaa ggtttgaaca aatgagaaca attgaatctg tatttggttt 2940 cttatatcat attcatggtc tacagagtaa aacttcacaa gaaatattgg aatgttgcat 3000 gaaactagaa tctgctttac aacatggtga taacagagat ttagttgcat cagatttgtg 3060 tggcgaacta cagtctattg cacgacgact ttctgaagaa acaaagtctc ctcaagatgt 3120 ttttcgattt attctttgcc aaaacttgga agacagtcta cccaaccttt gtattgctct 3180 tcgtattttg ctgacattac ctgtttctgt cgctagtgga gagcgtagtt tttccaaact 3240 aaaattgata aagacataca ttcgatcatc catgtgccaa gacagattgg ttgggttggc 3300 aactctttcc attgagcacg aacttgcaga caagctggac ctgaaggacc ttgtgatcga 3360 ttttgcccag aaaaaggcac gcaaagtaca gttttgatat ttttaaataa ataaatactt 3420 acctggtgtt ttcactttat tttgcatttt attttgattt ttttacgtaa aaggggtgcg 3480 acctttgaag ttttgcctag ggtgtcataa cccctttgca ccggccctg 3529 // ID Sola2-2_AAe repbase; DNA; INV; 4680 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A Sola2 DNA transposon family from Aedes aegypti. XX KW Sola; DNA transposon; Transposable Element; Sola2-2_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4680 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the yellow fever mosquito."; RL Repbase Reports 11(4), 1300-1300 (2011). XX DR [2] (Consensus) XX CC >98% identical to consensus. 4 bp TSDs. TIRs are ~780 bp long. XX FH Key Location/Qualifiers FT CDS join(1411..2472,2429..3568) FT /product="Sola2-2_AAe_1p" FT /note="transposase." FT /translation="MDKVQYYEYNSLCCNPFGLDGHKSVRTNLRSISQGLI FT SKLHSNGVRWITEKMKICVKCSITGDQKALVTDLPEAMAFEEQVSPLSSLA FT SPKETPSSGESVGSINNVQKIQELAKFLEISFQVKSSRLDASSSNYRNASM FT DEMFNQILSKIKTWFPPNIVDVTEDFNSVLLNLNSAVTKAEPDKQIELLKL FT LPRNWSYAKVKAHFDVSQHVITESKKYNLGIQPLSKVGRPSHGSDVQDIVS FT NFYLRDDISRPFPGLKDTISIKLPNGVRQNVQKRLLLDPLDXLYKQYLETC FT KSEQESVSFTSFWKLKPKQCVYTKDSSAMNVCVCMIHENMKFMVDALKKTN FT CFEVHNTEKKTKKLIALKFIIQKKKLNTFLTSQMICPDSTGDCYLRSCEDC FT KLKKLDFVANRLDENNVEEVKYCFWIISPRCEIINKEENVNDFIENLKNLT FT EKFLVHQFKVDKQNXFIRAKKESLVENKEIMCQMDFAENYSCVIQDSIQSH FT YFVRPQVTIHPFVIYYKDKSSIKVLNFVVIADIKKHNTTSVYAFQTKLISR FT LKNKFPELEKIIYLSDGCGEQYKNKSNFKNVCNHENDFKIRAEWHFFPTSH FT GKGPCDGIGGNIKRMARDASIRKSAEINNAKQFFDWAVSQKVKDQFKKDWE FT FIYATENDYSEAEKLLQERFSNLVPIPGTKKYHSFIPKDERSIFASEFSDA FT HENQENTACFVLNTTKKRKSSGGSNQRQSSRLKK" XX SQ Sequence 4680 BP; 1677 A; 739 C; 767 G; 1492 T; 5 other; ggggattttt ttttattacc gtacgggttt gggccgaagg gtctcagatt ttcatgaaac 60 tttttccaca ggcagggctc atggatatat gaataaaaaa aaattgagaa aaattcaggg 120 tcgcctattt ttccggaaaa ctcaggtgga aattttttgt tttcccttga cactacttac 180 tttgaaaaat cataactcaa gaacgaagca tcgtagaaac aaagttttta tatgaaaatt 240 taagcaaatt ttctcaaaaa tccaaaaaaa atatgaactg gaaaaagttt tccacaaaat 300 tttccaccgt tgggaaaatt cgtaaagaaa agccggaaaa actatgcccg aactcgtgga 360 aaattttcaa aaaaatattt tcgagaaggt aattttataa gctttaatca ctgaaatttt 420 tggaatgcac tttttttcgt ttttgagtta tggccaattt tgtgaaaaat gtccagatgt 480 gccatataag acttwtcttt gaaaaatcat aactcaagaa cgaaacattg tagaaacaaa 540 gtttttatat gaaaatttaa gcaaattttc tcagaaatcc aaaaaaaata tgaactggaa 600 aaagttttcc acaaaatttt ccaccgttgg ggaaattcat aaagaaaagc tggaaaatct 660 atgcccgaac tcgcggaaat tttttaataa aatattttta agaaagaaac tttataaact 720 tcaattctgg taacttttag gatgtacttt tcttaagttc ctgatctatg gtcaattttg 780 taaaaaatgc aaagatttgc catacatgcc ttttcgttta aaaatcatga ctcaagactc 840 atcatgactt gtcgaaccgg attcaaatta tttcaaaaat atgaaatttt tgaaatggaa 900 tcagtttccc aggacatttt tcactgttta tgaaaacaaa taatgaaaac ccgattagct 960 attccagctt tcaaacaaag tatatttgaa aagagcgatc aataagatcc aatcctacaa 1020 tatttttcaa tacgctcatt tttcgttcct aaaacatgat caaatattgc taggatgagc 1080 tgtttgaacc atatatttac aaatttataa gttcataaac aaaatttctt aagaacgaaa 1140 ctacatagaa acaaagtttt cttcaatcag aatgtaggca attttttcct gttaggtaac 1200 tacaaaattg acagttaaat aaatgggtag acaatatgac aaataggtat ttaccaattc 1260 aagtttaaag cgttgaaatg gcttgagaat cataagttta ccaaaaacta acaaagagca 1320 gtgagaaagt gcaagtgttg tctatcagct gtatattcct cgttattawt ttttgtggaa 1380 agtgaaaaaa gttttcccga agctgttgaa atggacaaag ttcaatatta cgaatacaat 1440 tctttatgct gtaacccctt cggactggac ggccacaaat cagtaaggac aaatctacga 1500 tcgatcagcc aaggcctcat cagtaagctt cattcgaatg gagttcggtg gattacggaa 1560 aagatgaaaa tttgcgtgaa gtgctccatt accggtgacc agaaggctct cgtaacggat 1620 cttccagaag caatggcgtt tgaggaacaa gtttcaccac tatcttcgtt ggcgtcgcca 1680 aaggaaactc cttcctcggg tgagtcagtt ggttctataa ataatgtaca aaaaatacaa 1740 gagcttgcta aattcttgga aatttctttt caagtaaaat cttcacgttt ggatgcttct 1800 agtagtaact atcgcaatgc ttctatggac gaaatgttca atcaaatttt gagtaaaata 1860 aagacttggt ttccgcctaa tattgtcgat gttacagagg attttaattc cgttttatta 1920 aatttgaact cagctgttac aaaagccgaa cctgacaagc aaattgaact tctaaaatta 1980 cttcctagga attggtcata tgccaaagtt aaagctcatt ttgacgtgtc tcaacacgtt 2040 ataactgaat caaaaaaata caatttaggg attcaaccac tttcaaaagt aggacgaccc 2100 tcgcatggat cagatgtcca agacatagtt tcaaattttt acctaaggga tgatatcagt 2160 cggccatttc ctggcttgaa agataccata tctattaagt tacccaatgg agtgcgccaa 2220 aatgttcaaa aacgcctttt gcttgaccct ttggatastt tatataaaca atatttggaa 2280 acctgtaaga gtgaacagga atctgtctca ttcacatcgt tctggaaact taaaccaaaa 2340 caatgtgttt atacaaagga ttcatcggcc atgaatgttt gtgtttgcat gatacatgag 2400 aatatgaaat tcatggttga tgcattgaaa aaaactaatt gctttgaagt tcataataca 2460 gaaaaaaaaa cttaacacct ttttgactag tcaaatgata tgtccagata gtacaggtga 2520 ttgttacttg aggtcctgtg aggattgtaa attaaagaaa ttagattttg ttgccaatcg 2580 cttagatgaa aacaacgttg aagaagttaa gtattgtttt tggattattt ctcctcgttg 2640 tgaaattatc aacaaagagg aaaatgtaaa tgattttata gaaaacttga aaaatctcac 2700 agagaagttt cttgtgcatc aattcaaagt agataaacaa aataawttta taagagctaa 2760 aaaggaatca ctagttgaaa acaaagaaat aatgtgccag atggatttcg cggaaaatta 2820 ttcgtgcgtg atacaagatt ccattcaaag ccattacttt gtacgacctc aagtaacaat 2880 acatccattt gtaatttatt acaaagacaa atcttcgatc aaagtcttaa attttgttgt 2940 aattgctgac ataaaaaagc ataacacgac atcagtttat gcttttcaaa caaaactaat 3000 ttcaagattg aaaaataaat ttcctgagct tgaaaaaatc atttatcttt ctgatggttg 3060 cggagagcag tacaaaaata aatcaaattt caaaaatgta tgcaaccatg aaaatgattt 3120 taaaattaga gcagaatggc actttttccc aacctcacat ggaaaaggtc catgtgatgg 3180 tatcggtggc aatattaagc gaatggctag agatgctagt attagaaagt cagcggaaat 3240 taataacgcc aaacaatttt ttgactgggc tgtgtcgcaa aaggtgaaag accaatttaa 3300 aaaagattgg gaattcatct atgcaactga aaatgactat tcagaggctg aaaaattact 3360 tcaggagaga ttttctaacc ttgttccaat tcctggaaca aaaaaatatc attcatttat 3420 tccaaaagac gaacgaagca tttttgcgag tgaattctca gatgcacatg aaaaccaaga 3480 aaatacagca tgctttgttc ttaatactac aaagaaacga aaatcttctg gtggtagcaa 3540 ccaaaggcaa tcttcccggt tgaaaaaata gatagtttta agataaaatg twagttatgt 3600 actgaataat tgaataaata aaattaaaga aatatataaa aagaatctta tttgttccat 3660 agaaaagaaa gaatcccttg gaatggaaaa gaaacagttt tttcagcttt gcttaacggt 3720 gtaatgtttc atgaaaaata taatccaatc cgacttatac ttcattaaaa ccaatcccat 3780 atcgtttctt tcagatcttt tagaaaattt gcaattgctt tgataaaaaa aaccttgttt 3840 ttctgatgct tcgattgaaa tatgggtttt caaagaaaag gctaatatgg tcatttttca 3900 caaaattgac cataactcag gaacaaaaaa aaagtacatc ctaaaagtta ctagaattga 3960 agtttataaa gtttccttct caaaaatatt ttaataaaaa ttttccgcga gttcgggcat 4020 agattttcca gcttttcttt atgaatttcc ctaacggtgg aaaattttgt ggaaaacttt 4080 ttccagttca tttttttttt tggatttctg agaaaatttg cttaaatttt catataaaaa 4140 ctttgtttct acaatgtttc gttcttgagt tatgattttt caaagaaaag tcttatatgg 4200 cacatctgga catttttcac aaaattggcc ataactcaaa aacgaaaaaa aagtgcattc 4260 caaaaatttc agtgattaaa gcttataaaa ctaccttctc gaaaatattt ttttgaaaat 4320 tttccacgag ttcgggcata gtttttccgg cttttcttta cgagttttcc caacggtgga 4380 aaattttgtg gaaaactttt tccagttcat attttttttg gatttttgag aaaatttgct 4440 taaattttca tataaaaact ttgtttctac gatgcttcgt tcttgagtta tgatttttca 4500 aagtaagtag tgtcaaggga aaacaaaaaa tttccacctg agttttccgg aaaaataggc 4560 gaccctgaat ttttctcaat ttttttttat tcatatatcc atgagccctg cctgtggaaa 4620 aagtttcatg aaaatctgag acccttcggc ccaatttgta cgataataaa aaaaatcccc 4680 // ID BEL-71_AA-LTR repbase; DNA; INV; 678 BP. XX AC supercont1.155; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-71_AA_; KW BEL-71_AA-I; BEL-71_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-678 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.155; Positions 490446 489769. XX SQ Sequence 678 BP; 241 A; 131 C; 127 G; 179 T; 0 other; tgttggcgaa tcctcgccat cgcgccttca caaccatatt tgggacaccc tattgtcggg 60 tgaacgggac aggaatgaca gttttcactg tcatgcggtg agtcaggtga aagtacacgt 120 tccacaaact gatagttaaa cctaaattct tagcctaata gtctaaacct aagcagtaaa 180 attaagtgaa ttctaagcct atgctaaaag gtaaattgaa ctataaaatt tgttgaactt 240 atgaaactat atgggctatt tacatagtta tgcagaaacc tacaccctac atggtgtaac 300 ctaaaagtcg ttagagttac tatcgcctac cattacaact gaggtttaat agtacggact 360 aatacgacaa aatacgagga tagacatttg tacaacggtt gaagaccaac acgacacgag 420 acagactgaa ttgactacac ggagggaaac gaaatactga aatcgggact aaacgtaagt 480 tcacaataat gcttataatt atttaaaaca cagcaaacaa cataatgata cacaaactgg 540 attggaattt gataaaatta tgtaattttc tcagggaaat aataatctac cggacagtaa 600 ccaaaaccca ttctccggtc gtttcatttc cgggattatt attgcggaaa attccgagag 660 cgttggcaat ttccaaca 678 // ID Gypsy-2_AA-LTR repbase; DNA; INV; 254 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-2_AA_; KW Gypsy-2_AA-I; Gypsy-2_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-254 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 974-974 (2011). XX DR [2] (Consensus) XX SQ Sequence 254 BP; 80 A; 50 C; 37 G; 87 T; 0 other; tgttatatct tgaccttaaa ttacattaca taaaccttgc tactttttat acaaacctac 60 attcgtatat cttgatcgcg cataccttca aaatacaatg tctttgtttt tccttctgtg 120 cgatcagtta gaacataacg gtatttgctc gccattatga gcgcacgagg cgctgaagaa 180 atcattatta atttagaata ataaagtgat taaacaagta aaactattgt gagtactttc 240 actcggccgt caca 254 // ID Gypsy-21_DPu-LTR repbase; DNA; INV; 110 BP. XX AC scaffold_12; XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from Daphnia: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-21_DP_; KW Gypsy-21_DPu-I; Gypsy-21_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-110 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Direct Submission to RU (16-DEC-2010). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR Genome; scaffold_12; Positions 1361474 1361365. XX SQ Sequence 110 BP; 33 A; 35 C; 21 G; 21 T; 0 other; tgttgtaaag tctaccacta ggtggcctcg catcgtcctc tctcatgtcg agaccaccca 60 gacggcactg gtgaatacac agacacacac agactaccaa gactacaaca 110 // ID Copia-30_DPu-LTR repbase; DNA; INV; 252 BP. XX AC . XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from Daphnia: long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; KW Copia-30_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-177 RA Jurka J.; RT "LTR retrotransposons from Daphnia."; RL Direct Submission to RU (16-DEC-2010). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR [1] (Consensus) XX SQ Sequence 252 BP; 61 A; 49 C; 48 G; 94 T; 0 other; tgttggttgt agagtgcgat gtgcaatcac tccactcgta tgagaagatg cttcccattc 60 tcagtacccc gtattctcag ttgtctccct ctgctgtcaa tgtcagttta tcagttgaaa 120 tatgtttcag tattctcttc tcaataaagc agaggtaaag acttaaatct caagtggtat 180 atattttcct tctttccaac aggttatggg cccagttggc ttttgtaaat tgaagttgtt 240 ttgtatatat ca 252 // ID BEL-621_AA-I repbase; DNA; INV; 6151 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-621_AA_; KW BEL-621_AA-LTR; Pao_Bel_Ele9; BEL-621_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-6151 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [5217-5774] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 1128..6149 FT /product="BEL-621_AA-I_1p" FT /translation="MTHHAHYDAPGNAPVSFEQLHTFQAPPNFSAAPPNEV FT QRRPSPEHLAARQVMPRDLPEFTGDPEEWPIFFSSFTNSTTACGFNNVENL FT ARLQRCLKGNALKSVRYYLLSPDSVPDVINTLQTLYGRPEVIINKLIRTVR FT ETSAPKSERLDTLIDFGMSVRNLTQHLIAAGQQAHLLNPALLQELVEKLPA FT NIKLQWAQHLSFYPEASLQNFSNFMSAIVESVSKVVVFAGGQSTVKNEKPR FT NKDKSFVHAHAESSTAGHAVKEKEEREKVCLFCGKVNHRLKDCLKFQQLQV FT EERWKNVQVLKICRSCLNPHGRRPCRNSNRCGIDGCQFRHHHLLHKNDGEA FT SKKSAGVSRVEDHAHHHHEQSILFRIIPITIHGATKSVTAFAFLDEGSSAT FT LVERSLVEELGVEGPNVPLCLKWTANMTRNEDESQIVSLEVSGIDQKNRFH FT LVNVRTVESLNLPTQTICFEELQKDHPHLRGLPVHDYKNGTPKILVGLRNL FT QLAVPLKIRKGKSGIIATKTRLGWCLYGSLDRERQSEDFNYHVCECDSEAR FT LEQLVKDYFNVEDCGVVRRSIMESDADARARKIMEDTTRRIGNRFETGLLW FT KTDFVELPDSFPMALRRLQCLERRMEKNPELKENIHRQIQEYQAKGFAHLA FT TEDELSRADPRRIWYLPLGAVFNPKKPGKVRLIWDAAAKVDGISLNSLLLP FT GPDLLVSLVSVQFLFRQYPVAVSGDIKEMFHQTRVIERDRPSQCFLFRSNP FT SVEPSVYIMDVLTFGATSSPTSAQFVKNQNAQEFGDKCPRAADAIIKRHYV FT DDYLDSFETIEEARTVSSKVKWIHAQGGFEIRNWSSNEKAVLEHLGDQPRK FT ETKDLSLKDDTERVLGMLWSTGEDILCFSTTFKDEIAGLIETETKPTKRQV FT LKCVMSLFDPLGLLACILVHGKIMMQNIWRSGIKWDDFIDEITYEAWRKWI FT GLLDEVNDIRIPRCYFQNAGAELYLSLEAHVFVDASEAAYAAMVYFRIVEQ FT DGTGKCALVTAKTKVAPLRYVSIPRMELMAALLGARLFLFVRENHSVKINR FT VVYWSDSEVTLAWIRSEHRKYRPFVACRVGEILLSTNVNDWRYVPSKQNVS FT DEATKWNDGARLSAESRWFNGPTFLQLPEKEWPKSKKNITATDEELRACYL FT HQEANVLRTPIEYIRFSNWNRLLRATAYIVRFVNLRNAVEKKSSETLTHEE FT LKAAEILLWRLVQSEVYPDEIAILFKNKTRKVGDRLELEKSSPLWSLTPML FT DDNGVLRIDSRITEAQRVAEDVKYPVILPRKHHVTSLIVNDYHRRSLHGNS FT ETVVNELRQRYYIAQLRTLVRKVEQQCQWCKVYKAKPSIPRMGPLPQARLS FT PGIRPFTYIGVDYFGPILVKVNRSVAKRWVCLITCLTVRAVHLELAHDLST FT KSCIACLRRFIGRRGSPKEIYSDNGRNFIGANRVLRDQIHLIEQDVATTFT FT NTETKWIFTPPSCPHMGGSWERMVRSVKKAMASLPQDNKLDDEGLQTVLVE FT AEAIVNSRPLTYLPLNSAEQEALTPNHFIMGSSSGIKQPTIAIQESASRIT FT ASWDLIQRRTNHFWRRWVLEYLPTLTKRTKWFNEVKPIKIGDLVIVIDETR FT RNGWLRGRVLEVTAGKDGRVRQAIVQTSGGLFRRPVSKLAVLDVVGCSEGD FT PHGEG" XX SQ Sequence 6151 BP; 1813 A; 1415 C; 1485 G; 1437 T; 1 other; aacttcaaga tttaatatcg acatggaaag acataccgct accccaaccc ctcgtccggg 60 gtacagttgt gtcgcttgta agcgaccgga ttcagcggac aatatcgttg cttgcgataa 120 atgcagtttg tggtggcatt atacatgcgc gggagttagc gattccgtta aggaacagga 180 atggacttgc aggaattgtc ttcctgttcc cacaatacca caatcagtgc gatcgactac 240 atcaagtcgc agagcgattc tccagatgaa acggctagcc gaacagcaag aattagagag 300 aaaacagttg gagttggaga agaagcaact ggaacttcag aagaagcact cgcaggagaa 360 atttgatctg gaggagacta tagcggaaga ggaagataat cgcagtgtca gaaaccgcat 420 actcgagatt gagtcacgag ataagcaggt atccacgtgg attaatcagc atgcccccgt 480 aatacaacaa caagtcgacc agtcttcaca tcccgcagca gcgcttactg cttttccacg 540 ttcgatgcaa tcctacacca acaaccaagc aaatgatgag caagatatac atggcgccgt 600 aggaggaaac gatccctcga tccaatacgg gcgacgtaaa aatcctactc cccacgaaat 660 ggagaaagcc gagcaagaag tgcaaactga attccagcag ctacagcaac agctgcgaac 720 aatgccaaca gcagtcggaa ccgacgctcg cccaaataca ggcgcttgaa acacagttgc 780 aaagattccg gattcaagtt caacaacgaa atcactctac tcctcttgga cttcctgcgc 840 gaagcacatt taaaggtgcg atccctaaag caaaggaagc aggacaaatc aacgaaatac 900 aacgatcgtt tgagaagtcg ctcccaacga cagtctcgct gaaccaatcg cggcagaatg 960 tattgccaaa tcgttcggca gataagctta tttcgttcga gcagacgcga acgtatcacg 1020 cgcactcaaa tccaatctta tctgagcggg tgacagacga acagccgcgt accctgcaag 1080 cctcatcaac tattttccct cctggatcga ttccatgccg accatcgatg acacaccatg 1140 cccactatga tgctcctggc aatgccccag tttccttcga acagttacat acgtttcagg 1200 cacctccgaa tttctcggcc gcaccaccaa atgaggttca gcgtcgaccg tcaccagaac 1260 acctcgcagc acggcaggtg atgccacgtg acttacctga attcacaggc gatccagaag 1320 aatggccgat tttcttcagc agctttacaa attccacgac tgcttgtggg ttcaacaacg 1380 ttgaaaatct tgcccgcttg caacgctgcc ttaaaggaaa cgccttaaag tccgtgcgat 1440 attacctttt gtcacccgac tctgtacccg acgttattaa tactctccaa acgctctatg 1500 gccgcccgga ggttattatc aacaagctga ttcgaactgt gcgcgagact tcagcaccaa 1560 agtccgagag actggacacg ttgattgatt tcggaatgtc cgttagaaac ttgacgcaac 1620 atcttattgc agcgggacag caagctcacc tactgaaccc ggctctacta caagaactgg 1680 ttgaaaaatt accagccaat attaaactcc agtgggccca gcatctgtca ttctaccctg 1740 aggctagcct gcagaatttc agcaacttta tgtcagcaat agtcgaatcg gtcagcaaag 1800 tggttgtatt tgccggcgga caaagcacag taaagaacga gaaaccgcgg aacaaagaca 1860 aaagttttgt tcatgctcac gccgagagct caacagctgg tcatgctgtg aaggaaaaag 1920 aagaacgcga gaaagtatgc ctattttgcg ggaaagtgaa tcatcgtctt aaagattgtt 1980 tgaagttcca acaacttcag gtagaagaac gatggaaaaa tgtccaggtc ttgaagattt 2040 gcagaagttg tcttaatcct cacggacgtc gcccatgccg aaattcgaat cgctgtggaa 2100 tcgacggttg tcaattccgt catcatcact tgttgcacaa aaacgatgga gaagcttcca 2160 agaagtcggc tggggtttca cgagtagaag atcacgcaca tcatcatcat gagcaatcca 2220 ttctttttcg catcattccc ataacaatcc atggggccac aaagtctgtt actgcgttcg 2280 ctttcttgga tgaaggttct tctgcaacac tcgttgaacg cagtttagtc gaagaacttg 2340 gggtggaagg accaaacgtg ccactttgcc tcaaatggac agctaatatg acaaggaatg 2400 aggatgagtc acagatagtg tcacttgaag tatccgggat agatcaaaag aatcgattcc 2460 atttggtgaa cgtacgtacg gtggaaagtt tgaatctgcc tactcagacg atatgttttg 2520 aggaactcca aaaggatcat ccacacctta ggggcctccc tgtccatgat tacaagaacg 2580 gcacccccaa gatactcgta ggccttcgaa acttgcagtt agcggtacca ttgaagatca 2640 ggaaaggaaa aagcggcata atagcgacca aaactcgttt gggatggtgc ctgtatggca 2700 gcctggatag agaacgacaa agtgaagatt tcaactatca cgtatgtgag tgtgattctg 2760 aagcaagact agaacagcta gtcaaagatt atttcaacgt tgaagactgt ggagtagtgc 2820 gtcgaagtat tatggaatcc gatgcggacg caagagcacg aaagatcatg gaggacacca 2880 cccgcaggat aggaaaccgt ttcgaaactg gtcttttatg gaagactgat ttcgtcgagc 2940 tgcctgatag ttttccgatg gctttgcgcc gtctgcagtg tctggagaga cgcatggaga 3000 agaatcccga actaaaagaa aacattcatc gtcaaattca ggaatatcaa gcgaagggtt 3060 ttgctcacct ggcaacggaa gacgaacttt ccagggcgga ccccagacgg atctggtacc 3120 tcccactagg agcagttttc aatccaaaga aacctggaaa agtgcggcta atctgggacg 3180 cagcagcaaa agtggatgga atctcgctga actcactatt actgccaggt ccagaccttc 3240 tcgtatcgct ggtatcggtt cagtttcttt ttcgtcaata cccagtagcc gtcagtggag 3300 acatcaagga aatgttccac cagacgagag ttattgagcg agatcggcca tctcaatgct 3360 ttctgttccg cagtaacccg tcagttgagc ctagcgtcta cattatggac gtgttgactt 3420 tcggagccac tagctctcct acgtcggctc aatttgtaaa aaatcaaaac gcacaggaat 3480 ttggagataa gtgtccaagg gcagcagatg ccataataaa aaggcactac gtcgatgact 3540 acctcgacag ttttgagaca atcgaggaag caaggacggt atcaagcaag gtaaaatgga 3600 ttcacgcaca aggaggattt gaaatccgaa attggtcgtc gaatgaaaaa gcggtattgg 3660 agcatcttgg tgaccaaccg aggaaagaaa cgaaggattt atcattgaag gacgatacag 3720 aacgtgttct aggaatgttg tggagtactg gagaagacat tctatgtttt tctacaacct 3780 tcaaggacga aattgcaggg ctaattgaaa cggaaacaaa accaaccaag cgtcaagtac 3840 tgaaatgtgt catgagcctg tttgatccgt tggggctgtt ggcgtgcatt ctagtacatg 3900 ggaagatcat gatgcagaac atttggcgca gtggcataaa atgggatgac tttatcgatg 3960 aaatcactta cgaagcgtgg aggaaatgga ttggactatt agatgaagtc aatgacattc 4020 ggattcctag gtgctacttt cagaatgcgg gggccgaatt gtacctttca ttagaggctc 4080 atgtttttgt cgacgctagc gaagcagctt atgctgctat ggtgtacttc cgtatagttg 4140 agcaagatgg cacagggaaa tgcgctttag tgacagctaa aacaaaggtt gccccgttgc 4200 gttatgtttc tattccacgg atggagctaa tggccgcact tttgggagcc cggttgtttc 4260 tgttcgtccg tgaaaatcac tccgtcaaaa tcaaccgagt agtctattgg tctgactcag 4320 aagtcaccct ggcttggatt cgatctgaac atagaaaata tcgcccgttc gtagcctgtc 4380 gcgttggaga aatcttgctc tcgaccaatg taaacgattg gcgatacgtt cccagcaaac 4440 agaatgtttc agatgaagct accaagtgga atgacggagc aaggcttagc gctgaaagcc 4500 gttggttcaa cggcccaaca tttctgcagt taccggaaaa ggaatggccg aaatcgaaga 4560 agaatatcac ggccacggat gaagaactga gagcatgtta cctacatcaa gaagccaatg 4620 ttctgcgcac accaatagaa tacattcgat tttccaattg gaatcgattg cttcgagcca 4680 cggcgtatat agttcgtttt gtgaatcttc gaaacgctgt ggaaaagaaa tctagcgaaa 4740 cattgactca cgaagaattg aaagcagcag aaattctact ttggagacta gtgcaatcag 4800 aagtctatcc ggatgagatt gcgattctct tcaaaaacaa gaccaggaaa gtcggcgaca 4860 ggttagagct tgagaagtca agccctcttt ggagtttaac tccaatgctt gatgacaacg 4920 gagttcttcg aatcgacagt cgcattacgg aagctcaaag agtagctgaa gacgtgaagt 4980 atccagttat actacccaga aaacatcatg ttaccagcct tatcgtgaat gactaccaca 5040 gaagatccct acacggaaac tctgaaacag tggtcaacga attacgtcag cgatactata 5100 ttgctcaatt gcgcacccta gtaagaaaag tagagcaaca gtgtcaatgg tgcaaggtgt 5160 acaaagctaa gccttcaatt ccaagaatgg gtcctctccc gcaagcacgt ttatctccag 5220 gaatacgccc attcacctat atcggcgtcg actattttgg gccaatcttg gtcaaagtga 5280 atcgatccgt agctaagcga tgggtgtgct tgattacgtg cctaacagtc cgtgccgtgc 5340 acttggagct ggcgcatgat ctttcgacga agtcctgcat tgcctgcctc cgtcgtttta 5400 ttggtcgcag gggttcaccg aaagaaatat attccgacaa tggacggaac tttattggag 5460 cgaatcgcgt tctaagagac caaattcatc tcatcgaaca agatgtggcg acgactttca 5520 ctaatacgga gacaaagtgg attttcactc ctccgtcctg tcctcacatg ggcggttcgt 5580 gggagcgtat ggtacgctca gtaaagaaag ccatggcaag ccttccgcaa gataacaaac 5640 tcgatgatga agggttacag accgtcttag tggaagccga ggccattgtt aactcgcgac 5700 cacttaccta cttaccgctc aactcggctg aacaggaagc actaacscca aaccacttca 5760 tcatgggaag ctcgtccgga ataaaacaac caacaatagc tatacaagaa tctgcgagca 5820 gaattaccgc gtcttgggat ttgatacagc gacgcactaa ccacttctgg agacgttggg 5880 ttctggaata tctgcctaca cttaccaaac gtacgaagtg gttcaacgaa gtgaaaccta 5940 ttaaaatcgg agatctggtg atcgtgattg atgagaccag aagaaatggc tggctacgtg 6000 gtcgtgttct agaggttaca gcgggcaaag atggtcgagt taggcaagcg atagttcaaa 6060 catctggtgg attgtttcga cgacctgttt ccaagttggc agtgctcgac gtagttggat 6120 gcagtgaggg ggatcctcac ggggaggggg a 6151 // ID PAO_LTR repbase; DNA; INV; 629 BP. XX AC L09635; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 28-SEP-1995 (Rel. 12.07, Last updated, Version 1) XX DE Bombyx mori retrotransposable element Pao. XX KW LTR Retrotransposon; Transposable Element; KW Long terminal repeat (LTR); PAO; PAO_I; PAO_LTR; KW Repetitive element; retrotransposable element; KW reverse transcriptase. XX OS Bombyx mori OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Bombycidae; Bombycinae; Bombyx. XX RN [1] RP 1-629 RA Xiong Y., Burke D.W. and Eickbush H.T.; RT "Pao,a retrotransposable element from Bombyx mori with a highly RT divergent reverse transcriptase domain and unusual long terminal RT repeat."; RL Unpublished (1993). XX DR GenBank; L09635; Positions 1 629. XX SQ Sequence 629 BP; 234 A; 130 C; 128 G; 137 T; 0 other; tgtgcaggac gaaatagggt tttttatagt ttaagaaaca aatagattta atttaaccgt 60 taataagtga ttacgtaata aatgtataat cttaagttat tttatagtca atatgttagt 120 gtatcagatc agagaatgta aatagaccaa taggaacgat ttgtggagcg ctttcactac 180 ggtaacatac agatcaaggg acccccgaaa gtggcgcgtc gatagccggt cgcgagatgg 240 ggtactgaaa taaatataaa aggatggcca tttataggca aaaacagttc tcagtggatc 300 aagacagaaa cagttcccag cgaatcttcg acaaagaaac atcagcaaga tagaaacagt 360 tctcagcaga tcatcaagac agaaacagtt cccagtgaat cttcgacaaa gaaacatcag 420 caagacagaa acagttacca gcagttctcc gatagtacag ttaccaccag tgaatcagct 480 accagtgaac cagacagtta cctgtgaaac ctcgctacag aagccgaaac ccatcgaaag 540 aaacagccgt cttctaccag cgcacaaaca gacagcaaaa accattatta cacgactcgg 600 aaacagccat acagaccggg tcgtgtaca 629 // ID Kolobok-5_HM repbase; DNA; INV; 2833 BP. XX AC . XX DT 19-DEC-2008 (Rel. 13.12, Created) DT 19-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE Kolobok-type family - consensus. XX KW Kolobok; DNA transposon; Transposable Element; Kolobok-5_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2833 RA Bao W. and Jurka J.; RT "Kolobok-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 2063-2063 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(637..1464,1428..1823,1802..1999,1896..2150) FT /product="Kolobok-5_HM_1p" FT /translation="RYNKLLYFKKQLLDNIGGCPECNSCIVFTDDNDKRSG FT FSHKLNFKCTKCKFQYCTYTSDDVKKSVPKQGRNYFDINLRMVIAFREIGK FT GHQALVDFARIANTSCMNNNAFNKLNSTIQNAYKLSAEQSTKAAAIKVKHK FT VKEEHIGSGKKLIXVSSDGSWQKRGHVSLNGVVTTISEGRCIDFEVFXKYC FT RGCXMWKRKQNHPGYTEWKANHICSLNHTASSGAMESAGAVKIFNRSVEKY FT DLIYKYYLGDGDSSXFNDVXSSNPYGKYQVSNRKSSIWKVSSIEPEKLECV FT GHIQKRIGTRLRNRRKDRHHGNQLTGKGKLTECASVNTIQNYFGMAIRQAA FT ANNSLSENEKVYQMKKNISAVLYHCTNFPDQFHRHVLCPVGPNSWCKWKKG FT KNSDSGNSYLHTKSQLFTYQKLNLPVWIYDIIKKDFXELSDDDLLKKCIHG FT QTQNXNEGLNNIIWSRCPKNIFVHKASCSKNAFMVKHKIXMKVLIILYGLD FT AQKIFLFTKLHVLELGVNSAVLYFNEGTAGVQRVMSHLNLENGVKFAVSSF FT CRKRPKTRCQT*" XX SQ Sequence 2833 BP; 964 A; 392 C; 493 G; 974 T; 10 other; ggtggttcac caccaaaaaa agctaaattt ttatcaaaag ttggtttttg agattttttt 60 tgttttatag taattcaaca ttgctgaaaa tgtttctgga ttctgttttc gtatatcttg 120 agtatttttt gaagtatttt gctttaatta gaccttatca gataacttta ccatagcaac 180 ggcccctagc aacgcaaact tatgtttata attgccctct ctctttgcct tgatttaagt 240 ttttttgtta gttacgcatt agtttagtag gccagcatgg tttgacaaaa gtttagaatg 300 taacagtcta tggaagttgc tatttttgta tgatttatct gaagccatct tgtgataatt 360 tattgttgta tttgaaaagc tttgtatttt tcttattatt tagtataact tatcagtttc 420 atattatata ttttaaaata ttaggatatt atatataatg ggaagcaaaa aaaaaaagag 480 ttcaaaaaga aaaagagcat tttgtggcaa ccagcggaca caaaacccct accctaaaag 540 aagacgaaaa cttgtctgag catgtttcat gtgttagtgc cagcaaagtc tcagtttaac 600 gaaagctgat aattatatta attcagaaaa ttataacgtt ataataaact tctctatttt 660 aaaaaacagt tacttgataa tattggcggt tgcccagaat gtaacagttg tattgttttt 720 acagatgaca acgataaacg aagtggattt tctcacaagt tgaattttaa atgtactaaa 780 tgtaaatttc agtattgtac atatacatct gacgatgtta aaaaatctgt acctaaacaa 840 ggcagaaatt attttgatat taaccttaga atggttattg cttttcgtga aattgggaaa 900 ggacaccaag ccttagtgga ctttgcccga attgcgaaca cttcatgtat gaataacaac 960 gcatttaaca aacttaatag cactattcaa aatgcatata aattatcagc agaacaatcc 1020 acgaaggcag ctgcaataaa agttaaacat aaagttaaag aggagcacat tggttcagga 1080 aaaaaactaa ttaragtttc ttcagatggy tcatggcaaa aacgcggtca cgtttcacta 1140 aatggtgtgg tgaccactat ttcagaaggg cgatgtattg actttgaagt ctttwgcaag 1200 tattgtaggg gatgtaraat gtggaaaaga aagcaaaatc atccaggtta tactgaatgg 1260 aaagctaatc atatttgttc cttaaatcac accgcatcgt caggggctat ggaaagtgct 1320 ggtgcggtaa agatttttaa tcgttctgta gaaaaatatg atttgattta taaatactat 1380 ttgggagatg gagattcctc tkcatttaat gatgttstta gcagtaatcc atatggaaag 1440 tatcaagtat cgaaccggaa aagttagagt gtgttggtca tatccagaag cgcattggta 1500 caagattacg taatcgaaga aaagatcgac atcatggtaa tcagttaact ggaaaaggaa 1560 aattgacaga atgcgcaagt gttaatacaa tacaaaacta ttttggtatg gctattagac 1620 aggctgcagc taataattcc ttaagtgaaa atgaaaaggt ataccaaatg aaaaaaaaca 1680 ttagcgcagt tctttaccat tgcactaatt ttcctgatca atttcatcgc cacgtgttat 1740 gtccagtggg tcctaatagc tggtgtaaat ggaagaaagg taaaaactca gattcaggta 1800 acagctattt acataccaaa agttaaattt accwgtatgg atwtatgata ttattaagaa 1860 agactttgaw gaattaagtg atgacgacct tttgaaaaaa tgcattcatg gtcaaacaca 1920 aaattstaat gaaggtctta ataatattat atggtctaga tgcccaaaaa atatttttgt 1980 tcacaaagct tcatgttctt gaattgggag tgaactcagc tgtattgtat tttaatgagg 2040 gaactgctgg tgttcagaga gtgatgagtc acctcaatct tgagaatggt gtaaaatttg 2100 cagtgtcctc attttgtaga aaaagaccaa aaacgcgttg ccaaacatga ataaaaaaga 2160 cacttttgaa aaaaaaagaa gaaggataca tttaaaatca attaacaaag gttggcaaga 2220 tgaagaagaa caactggaaa cctttaaacc atactactca tctggaagtt tttgatattt 2280 ttattagtta taaaatgttt attgttgttt ttcttcattc acttttttgt gcaattttta 2340 atttttgaaa tatcataact tttaaatgac taaagctttt tgcttcaaat tttcatgaca 2400 tgttctacgt gtaatgatgg agggcctgaa ccaaaaatat tgttggattt gataaattat 2460 ttacattttt ctatttcatt tcattagcca accccccaaa ttttgatgtt tttgccaaaa 2520 aaaatcaaaa atttcaactt ttactaagaa tatagcagtg attttggttc aggcccttcg 2580 ttttatgtgc atcaaactgt ttgcaaaata taaagatctg atttcaaatg gtttccaagt 2640 tatcgttttt caaaattttg ttcaaatttg actgattttg caatcttttt gacatatttt 2700 cgcgatttta ggaataaatt gttgtaaaat gacacttagg ttgtttttta ttttcatata 2760 ttatgctttt atacactttt taatttgaaa ttatgaaatt tggggatttt ttttttttta 2820 gtggtgaacc acc 2833 // ID Gypsy-88_AA-I repbase; DNA; INV; 5989 BP. XX AC supercont1.20; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-88_AA_; KW Gypsy-88_AA-LTR; Gypsy-88_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5989 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.20; Positions 2604081 2610069. XX CC 'CCACT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 1744..3579 FT /product="Gypsy-88_AA-I_1p" FT /translation="MDSFRSIRPFDTKTDQSQLATEWRKWKRSLEYYLAAS FT GITGQREKRNQLLHLGGPDLQDIFDNLPGVHDIPHVAPDPPFYDVAITKLE FT AHFQPCRRRTYERHVFRQIAQQPGERFGDFVMKLRVQANRCDFDLEGTSVA FT ESMIIDQIAEKCVSSALRKKILERDRSLEEVVVIGKTMEDVELQCKEMSQK FT EKPATVMETVHKVNAQNYFARKISHQPPGYRTPGFSGDLRRQYAYPLSRFR FT SDDKANWNVQSNSRPSDQLPSDRRFAIDNRICFGCGRRGHLRGSPACIAKG FT AQCLKCRSFGHFAKWCTKRLNDASNETVPPTKRIKAVYEESTQDRSIDSKT FT ANKVDEDICYVMGANVFRFKVGGVGIEMTIDSGAAANLIDFNTWEKLCAKG FT ATLSFSTHADRSFKAYGSSVPLNIVGMFSALIEADESKVDAVFYVAKNGGQ FT SLLGDETAKKLKVLKIGYNVGSVQHSEMFPKIKGVLVEIPVDPNVRPVQQP FT YRRAPFALESKIAEKLQSLLDRDIIEKVDQPSAWVSPIVPVLKDNGEIRLC FT VDMRRVNRAVLRETHPLPIIEELFGGINGAIRFSKLDVKEAYHQVEIAERS FT REVTTFISKQGLFR" FT CDS 3701..5968 FT /product="Gypsy-88_AA-I_2p" FT /translation="MFGISCAPELFQKVMESIIAGLDGVVVYLDDLMVSGR FT TQKEHDSRLALLMERIQEYGILLNEEKCVFNASRIEFVGHELSSEGIRPME FT SKVAAIASFRTPANVSELRSFLGLITYVGRFIPFLADKTEVLRNLLRIGEK FT FNWRDEHSKAFEEIKTAICNSKCLGYYDPKDQCIVIADASPVGLGAVLLQQ FT NKAGQKRIISFASKALSDTERKYFQTEREALALVWAVEKFQLYLLGKAFKL FT VTDCKPLHFLFKERTKPCARIERWVLRLQTYDYEVIYEPGSSNLADVLSRL FT SVSEPTEFSVTDDSCIYQLALADIPNAITVQEVSQETLKDEVLQKVIESLA FT TGIWSESVKEFKVFASELCVVQDLLLRGDRLVIPRSLRNQTLEIAHESHPG FT MVAMKRRLRQKVWWPQLDKQVEDFVKKCKACTLVSTPNHPEPLIRTKMPED FT AWTDLAIDFVGPLPSGHNLLVIIDYFSRFTEIIIMKQITASLTVKALHETF FT CRFGMPASIKTDNGPQFISNELRDFCNQYGIDHRRTTPYWPQANGEVERIN FT RSIGKRLKISQETADSDWQWDLRNFVLMHNSTPHSTTGVAPSSLMFGRRLK FT DKLPGLMLKGNSVLEEIRDRDHIKKTKEAEYADKRRMAKPDELTVGDTVVL FT KRVQKDNKLSTTFDPEEYMVIDRRGSDCTLQSKESTRIIHRNVSHVKKLFS FT ATNNSMEQQEEPNISDTRDNELVLNSKPGSDVDKLRPRRESKKPLYLKDFE FT ISNVE" XX SQ Sequence 5989 BP; 1898 A; 1011 C; 1419 G; 1661 T; 0 other; ttggctgacg aggtggaaac gagatttttc tcgttcaatt tcagtgaaat actgcccgcg 60 caaggttttt tttatccgat ataatttttc ggagatttgt aagtgaatct gtggtcgatt 120 gctgaagtaa gtgttggaaa ttgaaaaatc ttctgaggtg catttttgtg tggaaaaggt 180 ttcaaaatgg cgaaagagag gcaataagag tggtgagggc atagcaccgg taacggcgtc 240 acaatgtttt tggttatgtt gtggatgatt ttgaattaag gtgaacgcaa acctggtgaa 300 caagaacgga gaaaatgaaa aggtgcattg aataatttgt acgaatgctg gggagtgatt 360 gtatttggat caattttagt gatggaggta tatggacgat agaaagaaaa aaaagtgaag 420 cttttcgtcg atgtgcagat gtcaaacaaa atggcgttgt tgagaagaac ttggtgttat 480 tttgaacgca gatattaatt ctatcgaaga acgaggtcac atagaaaaaa gaaaatggtg 540 gcggataaat ggagctgtgt ttaaaaaaat gcatttggcc caacgggtat aattagtgcg 600 tagatcgttt tgtgctgagt agtcatttcg tttggtgagt tttagtgaag gagatgaacg 660 gatgaaaaag aaaagcaatt atgaaagaat gagaaaggtt aacatcatat tcaaatggga 720 aatcgcagat ggtcaataag aaaacgttcg aaaggaattt atgttttacc gatgaaattt 780 attttattga gaacgtggtt aaagaacggt gacaagtaaa agaaactgtg gttatgtttt 840 tccgaaaagt gttaaatggc catgatacat gatcggtaga tctcagggca gattttatgt 900 ggcattaatg tgattgctgg ctgagaggaa caaatttatg aactgtggca tagagcatgt 960 ttacgtgtat gtttacgtgc atacattgta atgaactaca gagtttccct ggtaagtctt 1020 tatttttttt tcctctttcc atgtctaagg atgcagaagt attgacatga ttgtctacga 1080 ttattgacat aaagtgagta aaataataaa aacaatagac gacagttact gcaattttaa 1140 ggttaactga atgatttgtg agatttgagg tgaaatgagt cgcgaactgg attcacttct 1200 cagtttaatt gagtgttggt ttgatatgat ctaaagtgaa atgagtcgag aactggattc 1260 acttaggata ttttcaggtt ttaatctgat gtggaatgag tcgagaactg gatccacatc 1320 cttcaaaatg aaatatcgtt gtaagtatta tacagaatgt tgaaataagt ggaatggagt 1380 acatcgaaca agaagtgaaa tgagtcgaga actggattca cttcatatat agcaaatgca 1440 ttagaagagg ttgaatcaag aaatgtgttc attgaaaaga gatggatgat ttggaataaa 1500 tgtgttttaa cagtgaaata caataaaaca gaacagagat tgttttttag tctcttctga 1560 aatatctgct gagtagggtg ccctcttttg ctgtgttttt atgtattcat gttattcttt 1620 tgcaattgcc ttttcgcttt aacatttatt ttattggata taaaatgcac ccagaagata 1680 ttttaaaaat atttaaataa tatttaaaaa atatatatac tcaaataaaa cattttgaag 1740 cagatggatt cattcagatc gatcagaccg ttcgatacaa aaactgacca gtcgcaactc 1800 gcgactgaat ggagaaaatg gaaacgcagt ttggagtatt acttggctgc aagtggcatc 1860 actggtcaac gtgaaaaacg gaatcaactt cttcatcttg gaggtcccga tcttcaagat 1920 attttcgata atctacctgg agttcatgac attccacatg ttgccccgga tccaccattc 1980 tacgacgttg caattacgaa actcgaagcg cattttcaac catgtcgtag gcgtacatac 2040 gaaagacatg tcttcaggca gatcgcacag cagcccgggg aacgatttgg ggatttcgta 2100 atgaaattac gggttcaagc aaatcgttgt gattttgacc ttgaggggac ttcagtcgcg 2160 gaaagcatga tcatcgacca aattgcagag aaatgcgttt catctgcact ccggaagaaa 2220 attttggaaa gggaccggtc tttggaagag gttgtggtta ttgggaagac aatggaggac 2280 gtggagctac aatgcaaaga gatgagtcaa aaggaaaaac cggcaactgt tatggaaacc 2340 gtacataaag tcaacgctca aaattatttt gctcggaaga tatcacatca accacctggt 2400 tatcgtactc ctgggttcag tggagattta cgtcggcagt acgcatatcc actatctcga 2460 ttccgttcag atgacaaagc taattggaat gtgcaatcaa attcacgtcc ttccgatcaa 2520 cttccgagtg atcgacgatt cgctattgat aatagaatct gttttggatg cggacgtcga 2580 ggacacctgc gaggaagccc tgcttgcatt gcaaaaggcg cacagtgcct aaaatgtcga 2640 agttttggac attttgctaa atggtgtaca aagagattga acgatgcctc gaatgaaaca 2700 gttccaccta ctaagcggat caaagcagtg tacgaggaaa gcactcaaga cagatccatc 2760 gacagtaaaa ctgcaaataa agtcgatgaa gatatctgct atgttatggg tgccaacgtg 2820 ttcagattca aggtgggtgg agttggaatt gaaatgacta ttgactcagg agctgcagct 2880 aatctcatag acttcaatac ttgggaaaaa ctatgtgcta agggtgcgac attgtcattc 2940 agtacccatg cagatcgctc gtttaaggca tatggttcct ctgtacccct caacattgtc 3000 ggtatgtttt cggcgttaat tgaggcggat gaaagcaaag tggatgctgt attctatgtg 3060 gcgaaaaacg gtggccagag tcttctagga gatgaaacag ccaaaaagct aaaagttctg 3120 aaaataggat acaatgtggg ttccgtgcaa cattcggaaa tgtttccgaa aatcaaaggg 3180 gtattggtgg aaattccggt tgatcccaat gtgaggccag ttcaacaacc atacagacgt 3240 gcaccgtttg ctctggaatc aaaaattgct gagaaactgc aaagcttgtt ggatcgagat 3300 atcatcgaga aagttgatca accttccgca tgggtgtctc ccattgttcc agttctcaaa 3360 gacaatggag agattcggct gtgcgtagac atgcgccgag tgaacagagc tgtgctacga 3420 gaaacgcatc cattgccgat tatcgaggaa ctgtttggtg gaataaatgg agctattcga 3480 ttttcaaagc ttgatgtcaa ggaagcgtat caccaagtag aaatcgccga acgttcgcgg 3540 gaagtcacta ctttcatttc gaaacagggt ctgttcaggt aaatggccca tttctgtata 3600 cgataaatga attgatattt gaagttgttt tgtttcctgt tttttttttt ataaaaataa 3660 aattcaataa aatgcttttt gatacagatt taaacgacta atgttcggaa taagctgtgc 3720 tccagagtta ttccagaaag tgatggagtc tataattgct ggattggatg gagttgtcgt 3780 ctacctagat gatttgatgg tttcaggtcg aacgcaaaaa gagcacgata gcaggctggc 3840 tcttctaatg gagcgtattc aggagtatgg tattctttta aatgaggaaa aatgtgtttt 3900 caatgcgtct cgaatagagt tcgtaggaca cgaattgtct agtgaaggta ttcgtcccat 3960 ggaaagtaaa gtagccgcta ttgcttcttt tagaactccg gcaaatgtat cagagctgcg 4020 aagtttttta ggcttgatta catatgtcgg acggtttatt ccatttctgg cggataaaac 4080 tgaagtctta cgaaatttgt tgcgtatcgg ggaaaagttc aattggcgag atgagcattc 4140 aaaagccttt gaagagatca aaacagccat ctgcaactca aaatgtcttg gttactatga 4200 cccgaaagat caatgcattg ttatagcaga tgcaagtcca gtcggattgg gagccgtact 4260 tctacagcaa aataaagcag gtcaaaaacg tatcatttcg tttgcgagca aagctttgtc 4320 agacactgaa cgaaaatact tccaaacgga aagagaagcc ttggctctag tttgggctgt 4380 cgaaaaattc cagctgtatt tactcggtaa agcgttcaag ttggtaaccg attgcaaacc 4440 actacatttc ctgtttaagg aaagaaccaa accgtgtgca cgaattgaga gatgggtgtt 4500 gaggctccag acatacgatt atgaagtaat atacgaacca ggatcaagta acttggcaga 4560 tgttttgtca cgattgtccg tgtcggaacc aacagagttt agcgttactg acgatagttg 4620 tatctatcaa ctggccctag ctgatatccc caatgccatt acggttcaag aagtatcgca 4680 ggaaacatta aaagatgagg tgttacagaa ggttatcgag agtctagcaa caggaatatg 4740 gagcgaatca gttaaggagt tcaaagtatt tgcatccgag ctatgtgtcg tacaggattt 4800 attactccgg ggcgatagac ttgtaattcc aagatctttg aggaaccaga ctcttgaaat 4860 tgcccatgag tcgcatccag gaatggttgc aatgaaaagg agactgagac aaaaagtctg 4920 gtggcctcag ctagataaac aggttgaaga cttcgttaaa aaatgcaaag catgtacatt 4980 agtatcaact cctaatcatc ctgagccgct aattcgaacc aagatgcctg aggatgcttg 5040 gactgatctg gcaatcgatt ttgttggacc tctcccatcc ggacataatc ttctagtgat 5100 cattgactat ttcagtcggt ttacagagat tatcattatg aaacagatca ccgcaagttt 5160 gacggtaaag gcgttgcatg aaactttctg ccgtttcgga atgccggcgt ccatcaaaac 5220 tgacaacggc cctcagttca taagcaacga acttcgagat ttttgcaacc aatatggtat 5280 tgatcataga agaacaaccc cgtactggcc acaagcaaac ggagaagtcg agagaataaa 5340 tcgatcaatt ggcaagcgtt taaagatcag ccaagagacg gctgattctg attggcagtg 5400 ggacctgaga aattttgtct tgatgcataa ttcgacaccc cattcaacaa ctggcgttgc 5460 accatcgtcg ttaatgtttg gaagacgact aaaggacaag ttacccggtc tcatgctcaa 5520 agggaattcg gtactcgagg agattaggga ccgcgatcac attaagaaaa cgaaggaggc 5580 ggagtatgca gataagcggc gtatggcaaa gcctgatgaa ctaaccgtgg gagatacagt 5640 tgttttgaaa cgagtccaga aagacaacaa actctcaaca acgtttgatc cagaggaata 5700 tatggtcatt gatcgtaggg gctcagactg tacactacag tctaaagaat ctacaagaat 5760 catccaccga aacgtgtcac atgtaaagaa gctattctca gcaacgaata atagtatgga 5820 gcaacaggaa gaacccaaca ttagtgatac tagggacaac gagctagttt tgaattctaa 5880 gcctggatca gatgttgaca aactgcggcc acgaagagaa tcgaagaaac cgttatattt 5940 gaaagatttc gaaataagca atgtggaata aataaaagta agggaggaa 5989 // ID WORF_DMi repbase; DNA; INV; 4174 BP. XX AC AY144572; XX DT 09-JUL-2004 (Rel. 9.06, Created) DT 15-MAR-2011 (Rel. 9.06, Last updated, Version 2) XX DE Drosophila miranda non-LTR retrotransposon worf, complete DE sequence. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; KW non-LTR retrotransposon worf; WORF_DMi. XX NM WORF_DMi. XX OS Drosophila miranda OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; obscura group; OC pseudoobscura subgroup. XX RN [1] RA Bachtrog D.; RT "Accumulation of Spock and Worf, Two Novel Non-LTR RT Retrotransposons, on the Neo-Y Chromosome of Drosophila RT miranda."; RL Mol. Biol. Evol 20(2), 173-181 (2003). XX DR Genbank; AY144572; Positions 1 4174. XX SQ Sequence 4174 BP; 1010 A; 1004 C; 752 G; 1408 T; 0 other; ttttcttttg tgtctctgct ttggtgagtt taccgctcgt tttctgcact tcgttggatc 60 atctgctttg aattattgcc gctgcagctg attgtctggc tcgctctgct gtctgctctc 120 ctactttacc tgcgatctca ctccgttctc ttattattcg tgcacagtga ttctactgct 180 gtcaataatg gcaatcgttt gtagtgcaaa gaaatgtgaa ttcggtggtg ttatcattgg 240 tgattcattt cttagctgct ggctgtgcga caattttgcc cacataaaat gtgctggtgg 300 tggaaatttt ggccggctga atgatctgat ctcgaaacgc atgggcctgt cttggtcatg 360 tctggcttgt cgggagattg aggccgaaat gcgcacattt atgagacaga ctcgaactgg 420 gtttctggat gtccggaaac aatttgtggc tctcaacgag aaatttcttg cccttgaatc 480 acagtttctt gggctcaaac tcttgagcga atcgcctaga cggaaaattc cgcataatga 540 taccaatctc ttgcaaccta atccgattgg tacgcctgcc tcccaccttc atccatcgga 600 agcatttccg tgtaggcatg ccacgccgtt gtctgtaaat ccgccaacgg tggctccgga 660 gttctttaca ccgagcaatg tgcttccttc gagcacccaa ctccccaaca gcatcaatcc 720 catccctgca gtttctgtgg ccgtcacttc tgacgcggtg agtgctccta gtgcgggtgt 780 gctggttgct gacgacgtct tgccaattcc tgctccaatt cctgctccaa ttcctgctgc 840 aattcctact ccaattcctg ctacagccat tgctgccaat tcctctgccc ttgtaccgag 900 atctctttta ggtgtggctc caccatctcg atctcgctcg ggacttcaac ttaaagcagt 960 ggtccctcgg aaagcaatat ttgtttctcg tcttattcct gaggctacaa cggaggatgt 1020 taaacaacat cttgccacta aacttaacac ttcgcctgtt gatatagttg tgactaaatt 1080 ttcatttaaa cataagcgca acatatcatc atttaaaatt cttctccctg attctttact 1140 atcctgttct ctagaaccgt caatatggcc tgagcataca attgtgcatg agttccttct 1200 caaagattcg aactcgaatc caagaattac cgaacatgct ccaaaaaact aatgtactat 1260 ttttccatgc actatcagaa cgttcgcagt ttgctgggaa agttgcgtca aattcatact 1320 aatagcgcgt cctttgattt cgatgccatc gcgtttaccg aaacctggct taactcctct 1380 gtcaatgatc acgaaatttt cattgatagt tacactattt atagaatgga ccgcccatct 1440 tttgcaggtg gggttctgat tgcagttaaa tctgttttct catctgagtt attcccattc 1500 aataacattc atggaattga atttgttgca gtcaaagttc gtgttggctc cgcatttttc 1560 tatttaacct gctcttacat tcctcccagg tctgatgctg agctttactt acaccacctt 1620 tcagcaatta ataatgttgt ttctacatta gggtgcaatg atcgaattat tgtcatgggg 1680 gactttaacc tcccatttct ttcttggctg ccttgtaatg acgctaacct gttgtttccc 1740 aattgccata atgactttat caatgggcta acggatattt cccttgtcca aattaatgcc 1800 gttaaaaaca ttagggacag acttcttgac ctcgtttttg ttaacgatgg ttctcttact 1860 actgtatcta gagcaagccc aatatctctc cctgaagatc cttaccatcc aactttgttg 1920 atatcactgg agtgtacaca atctggaggt gctgacagtt ccatggctcc ttgtcacata 1980 aagtgttttc gcaaaacaaa ctttattgat ttagatctac atctctcgcg tgttgattgg 2040 tcgtttcttt attcattacc gaacatagat gcaactgtta actcgtttta tagttctatt 2100 tactccgccc ttaatacgtt tgttccagat atcactgtac ctgtatcgtc caagcctccc 2160 tggttctcca agtatttgtc atacttaaaa aataataaat cccggctcta taaaaagtac 2220 cagaagtcgg gctccacgtt ggccttagct ctatactcct ccgctcgctc tctgttcctt 2280 gccgtcaata gtcagtgtta taatcattac ctttcacaat gtagtagtaa ttttcgtagt 2340 gatcctaaaa aattctattc gttcgttaat tctaagcgca agtcaaacgt ttttcctccg 2400 tcccttcatt accagaacaa aacagaagct tctgctgttg gtattgcaaa tttattcgct 2460 aactttttcc aaacaacgta ctcgtctcat atttacaacg catccactcc gtatccgtac 2520 cagctacctc aagctaacag tatttttctg ccctttttcg aggaaagcgt tgttcttgaa 2580 ggtttgtcat ctatggacat atcgttttct gcgggtccag ataaggtacc aagttgcatc 2640 ttaaaacact gtgcccagtc cctttgcaag cccttgacct ttctcttcaa cctctccttg 2700 gaacagtctt gtctcccagt aatttggaat gagtcctaca tcattccgct tcacaaaaaa 2760 ggttcaagat caaacattga aaactatcgt ggtatcgcaa agctttccgc catcccgaag 2820 cttcttgagt tcctggtcac ccggcaactg caacatcttt gttgcagctt gatatctccg 2880 tcgcaacacg gttttttcag acatcgttca acatcgacta accttcttga gttttctaac 2940 ctaatccacc gtggttttca aattggtttg cagacggatg tagtttttac ggacttcagc 3000 aaggcattcg attctgtgaa ccatgctctg cttattcaaa agctctcctt attagggttc 3060 ccaacgaatc ttctagattg gattttgtcc tatctctcta accgtactca acgtgttttg 3120 ttttctaacg tgttgtcaaa tactgttaat gttacttcag gtgtgccaca gggaagtcat 3180 ctgggcccgc ttctgtttat tttatttgtg aacgaccttc ctcaagttat aacatactct 3240 actacactaa tgtatgctga tgatgtcaaa atctgtcttt cttactctga ttggtatttg 3300 cacacacgcc ttcaacttga tctaagtgaa ctactattgt ggtgttcaac taatcttctt 3360 tttctgaacc tttccaaatg caaacttatg acattttacc gtcgcgctcc tcattttgtc 3420 tcatatgttc taggaaatca tgtccttgag cgaatttcga gttcaaatga cctcggagtc 3480 ctttttgatc ataagatgtg tttcaacacc catatagctg caactgtaaa taaagctaag 3540 ggtgttttag cgttcatcaa gcgttggtcc aaggagtttg acgacccgta cgttacgaaa 3600 caattgtaca tctcgttagt acgtcctata ttggagtatt gttcttgtgt gtggagcccg 3660 cagtataaag agcagcaggc tgttattgaa tccgtgcaaa agcaattttt aatttttgcc 3720 cttcggaact ttaactggga ctcgggtaga atcttgccac cctaccggtc taggctaaat 3780 cttattgacc tgccgtcgtt gcaccatcgc agaatatgca atggcgtaat gttcgtgcac 3840 aagctccttc ttgggactgt tgactcccaa actctcttgg gtcagattga cttggccgtt 3900 ccatccagac ctacccgtac ttttaggcct atccgtctac ccatatgtag gtctaattat 3960 gctgatcatg aaccttttag ggttttatgc cataattata actccctctg tctaacccta 4020 tcccctgaac tgtctcttaa actaattgca tgcaatattt ataatcattt aaatttagct 4080 aacttttaac ttatcttttt ctgcctttag taactaagta ccattattaa cagataattt 4140 gttaatttaa taaataaata aataaataaa taaa 4174 // ID BEL-121_AA-I repbase; DNA; INV; 6508 BP. XX AC supercont1.19; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-121_AA_; KW BEL-121_AA-LTR; BEL-121_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6508 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.19; Positions 1561038 1554531. XX CC Positions [5568-6125] - Integrase core CC 'CCTGAT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 62..3373 FT /product="BEL-121_AA-I_2p" FT /translation="MAAARYNCTCGEPIVGEEKMAQCSECVRPIHPACAMI FT GRSIGGDRVVCARCYRRDPPPPPSTTRRSTSSSTRAKLLLDLQRLEEEKCL FT QEKAAEEKVRREREYLAKKYELMQAELLEQSTASSRRSVSSIEKVQHWLLD FT KPTDNVESGIGQSEIPAGVSSRTGTVYKPIPTVPTTVAAVSTPKSGQHAQL FT SFGLNPSQPLSQDASQCQFSGTMGEQQVAHAPPENQTQNPLDWSLTWEFPE FT EPMPIPKRRIAVDLEEIQRKFCGVQLRPPNSSQHPVSRPPLVSDPVQREAQ FT HNVIHGIPTQQHETPATSPVSASGAGQVQAVLQPITTQYQQTSGLESAPTG FT QESSAKITLVSQTCSNINPIQIRPLQSNVTFDVPKLSSFISHTPFPLTEPC FT KPPQISQNNAFTFNPAHRSTPHVRPATAPSVRFADFPPVSSSSLPLSTGVQ FT TVQSNTVPIASSVAANSDPVVPNVFPQAFLPFDSTSAPAPVHQPWISSPTS FT QQLAARHIVPKELPFFSGDPVEWPLFLSCFQNTTQLCGFSHGENLMRLQRS FT LKGNALEAVRSLLLEPSSVPMILSTLQTLYGRPDLVINSLLQKVRSTPAPK FT PDKLESLISFGLACQNLCGHLRAAGQQAHFSNPALLQELVSKLPANIKLDW FT ALFKQKCPAVDLGTFGDYMAQLVVAASDVAPYQPTEEARSSSGKVKGKEKL FT YLNTHASGNTVDNVGRRNPPSNPGGKQNQPKPCPICGKQGHKVRDCDDFKK FT CNLEERWKRIQEHYLCRRCLIAHGKFPCKATSCGMEGCEERHHKLLHPGKP FT QSIVPAIQPTATETVNVHTALKVSTLFRIVPVTLYGTDRSVHTFAFLDEGS FT SSTLLDVRIARELGLKGEVHPLYLQWTSDVERCEDNSQLIRLEISGRGAKK FT RYVVAAAHTVEKLSLPQQSLPFDQLAQQFPHLRGLPIRGYCNAIPTILIGL FT DNTRLKIPLKVQEGKVGQPAAAKTRLGWTVYGPIPGECSSTQKCQLHLYKE FT ARNPDDVLHDLVKEFFSMEHVGVAVAPLLEGSDEMRSRKILEETTSRLPSG FT RFQTGLLWRYDRINFPDSKPMAESRLKSLEGRLWRKPELFENLKQQR" FT CDS 3649..5424 FT /product="BEL-121_AA-I_1p" FT /translation="MFHQFQIRPEDRQAQRFLFRSDPSKAPDIYVMDVATF FT GVTCSPSAAQFIKNKNAKEFESEFPEAAAAIVHNHYVDDYLDSRDTVDEAA FT DMALQVRTVHAKAGFHIRNWMSNSREVLLRVGDSAEGSAKNFKTHAATVSE FT RVLGMSWFPESDEFRFRGFFREQIIPLLHGDVVPTKRQLLQVVMSIFDPLG FT LVSLIVVHGKILVQKSWRANIGWDDKIDEDLLKLWRRWTELLGQLDTVRIP FT RCYFPGYLPESYESLQLHIFVDASEEAYVATAFFRIVDQSQVRCALVASKT FT KVAPLKLLSIPRLELQAALLGARLAKSVAENHTLRIKQRFFWGDSSTVLSW FT LRSDHRRYRQFVAYRVSEILDTSKVDEWHYVPSHLNVADDATKWKERLQIS FT NSHRWFCGPDFLYEPQNLWPKQEAQSTSTTEELRPIHVHCEVRKEQVVQFC FT RFSKWERCLRSVAYVHRFIDQLKRKRNKEESDVTAILTREELQRAEQTIIK FT LAQFEVFGDEVVTMQNNQSLPVDQQQRLESSSKLYKLSPFLDNQGVVRMEG FT RIDGFSDVEGRTSNTPLCYRRTTTLPSSSLTRTIDGTNIPTEKQQ" FT CDS 5433..6506 FT /product="BEL-121_AA-I_3p" FT /translation="MRQKYHLSEMRAAFRKIGKSCMWCKIYKATPAVPRMA FT PLPEARVTPRIRPFSFVGVDYFGPLLVVQGRREVKRWIALFTCLTIRAIHL FT EVVTSLSTECCKMALRRFIARRGAPSEIYSDRGTNFVGVSGELSKQVRAIN FT QELASTFTNTVTQWRFNPPAAPHMGGSWERMVRSVKCALASLSVERKPSEE FT VLVTLLVEAESMVNSRPLTYMPLQTSEHPALTPNCFLMLSTSGVNQLSTQL FT VDDRQALHTSWFLCQRLLDQFWTRWVKEYLPTITRRTKWFVDTRPVSAGDL FT VVIVDDRVRNGWIRGQVLRVFPGRDGRCRSANVQTATGVLRRPVAKLAVLD FT IGGNAREDTEQYGSG" XX SQ Sequence 6508 BP; 1673 A; 1694 C; 1682 G; 1459 T; 0 other; aatcttttag atttttaacc gcgaccgttt tcaacgacgc cacgaggagc cacaaagtgc 60 gatggcggcg gccagataca attgtacgtg cggcgaaccg atcgtcggtg aagagaagat 120 ggctcagtgt agtgagtgcg tacgtcccat tcatcccgcg tgtgcaatga tcggtcgatc 180 tatcggtggc gatcgcgtcg tgtgtgcgcg ttgttatcgg cgagaccctc cacctccccc 240 ttcgactaca aggcggtcga caagttcttc tactcgagcc aagctgttgc tcgatttaca 300 acggctcgaa gaagaaaagt gcttgcagga aaaggctgcc gaagaaaaag tgagacgtga 360 aagggaatat ctcgcgaaga aatatgagct catgcaggct gagctgctag agcaaagtac 420 agcaagcagt cgtcgatcgg tgtccagcat cgagaaggtt cagcattggc ttctggacaa 480 gcccaccgac aacgtagagt ccggcatcgg gcaaagtgaa atccccgccg gtgtatcgtc 540 tcggacaggc acggtctaca agccgatccc gacagtacca acgacagtcg ctgcagtttc 600 aactcctaag tcagggcagc atgcgcagct ttcttttggg ctcaatccga gccagccact 660 cagccaagat gcgtctcagt gtcagttctc gggcacgatg ggtgagcaac aagtcgccca 720 cgctccacca gaaaatcaaa cgcaaaaccc gttggattgg agtttgactt gggaattccc 780 tgaggaaccg atgcccattc caaagcgacg aatcgccgtg gacttggagg agatccagcg 840 aaagttctgt ggagtacagt tacgtccacc taattcaagc cagcatcctg taagtcgtcc 900 gccactggtc agtgatccgg tacaacgtga agcccagcat aatgtcatcc acgggattcc 960 cacacagcag catgaaacgc cggccacaag cccagtgtcg gccagtggcg ccggacaagt 1020 ccaagcggtg ttgcagccca tcacaaccca gtaccagcag acatccggcc tcgaatcagc 1080 accaactgga caggagtcgt cggcgaaaat tactttggtg agtcaaacgt gctctaatat 1140 taatccgata caaattagac cattacaaag caatgtgaca tttgacgtac caaaattgtc 1200 gtcatttatt tctcacactc ccttcccctt gaccgaacca tgtaaacccc cccaaatcag 1260 ccaaaacaat gcgtttacct tcaatccagc acatagaagc actcctcatg tccgccccgc 1320 gactgccccc tccgtgcgat ttgctgactt tccgcccgtt tcgtcgtcca gtttgccatt 1380 gtctactgga gtgcaaacag ttcaaagcaa tacggtgcct atagcgtcgt cggttgctgc 1440 caacagtgat ccagtggtgc ccaacgtgtt tccacaagcg tttcttcctt tcgactctac 1500 ctctgcacct gcgccagtcc atcaaccatg gataagttct ccgacatctc aacagctcgc 1560 tgctaggcac atcgttccca aggagcttcc tttcttttcg ggcgatcccg ttgaatggcc 1620 gctgttcctg agttgtttcc agaacaccac ccaattatgt gggttttccc atggagagaa 1680 cttgatgcgt ctgcaacgga gccttaaagg aaacgccctg gaagcagtga gaagtttact 1740 tctggagcca tcatcggttc ctatgatcct ctcaacactt cagacgctgt acgggcgtcc 1800 cgatctggtc atcaactctc tactgcagaa ggtacgttct actccagccc caaaaccgga 1860 caagttggag tctctgatat cgttcgggct agcctgccaa aacctttgcg gtcatctgcg 1920 cgcagctgga caacaggctc acttctcgaa tccggcattg ctgcaggagc tagtcagcaa 1980 attaccagcg aacatcaaac tggattgggc gcttttcaaa caaaagtgtc cagcggttga 2040 cctcggcaca tttggtgatt acatggcgca actggtggta gctgcaagtg acgtcgctcc 2100 ctatcaaccg actgaagaag ctcgttctag ttcgggaaaa gtaaagggaa aggagaagct 2160 gtatttgaac acccacgcat ctggcaatac ggtggacaac gttggtagac ggaatcctcc 2220 aagcaatcct ggaggtaagc agaaccaacc aaaaccgtgc ccaatttgtg gaaaacaagg 2280 acataaggtg cgagactgcg acgatttcaa aaagtgcaat ctggaggaac gttggaagcg 2340 tattcaagag cattacctgt gtagacggtg cctgatagca cacggaaagt tcccgtgcaa 2400 agcaacatcg tgcggaatgg aaggatgtga agagcgacac cacaagctat tgcatcctgg 2460 caagccacag tcgatagttc cagcaataca gcccacagcg accgaaactg ttaacgttca 2520 caccgctctg aaggtatcca cgctctttcg tatagtacca gtgacgctgt acggtaccga 2580 tagatccgtc cacacgtttg cgtttttgga tgaagggtcc tcgtctacat tgctcgacgt 2640 ccgtatcgcc agggagttgg ggttgaaagg tgaagtccac ccgctttact tgcaatggac 2700 cagtgatgtt gaacgttgtg aggacaattc tcagctaatt cgactcgaaa tctcgggtcg 2760 tggagccaag aaacggtacg tcgttgcagc agctcacacc gtggaaaaac tgtctcttcc 2820 tcaacaaagt ttgccgttcg atcaactcgc tcaacagttt ccgcacctcc gtggtctacc 2880 cattcgagga tactgcaacg ccattccaac aatcctgata ggtttggaca acacgcgcct 2940 gaagattccc ctcaaggtcc aggaaggcaa agtcggtcaa ccagcagcgg cgaaaacaag 3000 gctgggttgg acggtatacg gccccattcc tggtgaatgt tcctctactc agaagtgtca 3060 gcttcatctg tacaaagaag ctcgaaaccc tgatgatgtg ttgcacgacc tagtgaagga 3120 gtttttctcg atggagcatg ttggcgtcgc tgtagcacct ttgctggaag gttccgacga 3180 aatgcgatcc agaaagattc tcgaagaaac gacttcacgg ctgccatccg gacgattcca 3240 gacagggctg ctttggagat atgatcgcat caactttcca gacagtaagc cgatggcaga 3300 gagtcggctc aaatcgctgg agggtcgttt gtggcgcaaa cctgaactgt tcgagaacct 3360 gaagcagcag agataatcga gtacgtggaa aaagggtacg cgcacaaaat tacacaggag 3420 gagattgtca gctcggatcc ccagaaagtg tggtatcttc cattgggagt agttgttcat 3480 cccaaaaagc ctgggaaagt tcgcatagtt tgggatgcag cagcgaccgt ccaaggccaa 3540 tcccttaatt ccgccttgct tccgggacca gatctgttgt cctccctccc gtcggtgctc 3600 tccaaatacc gtcaacgcca ggtggcgatc tgtggcgata taagggagat gtttcaccag 3660 ttccagatca gacccgagga tcgacaggca caacggttcc tgtttagaag tgacccttca 3720 aaagcacctg acatctacgt gatggacgtg gccacttttg gtgtaacttg ttctccctct 3780 gctgcgcagt tcatcaaaaa taaaaatgcc aaggagtttg agtccgaatt tcccgaggcc 3840 gccgcagcaa tcgtccacaa tcactatgtg gacgactatt tggatagtcg ggataccgtt 3900 gatgaagcag cagacatggc gctgcaagtg agaacggtcc atgctaaggc cggatttcac 3960 attcggaact ggatgtccaa ctccagagag gtgcttttac gggtcgggga ttcagctgaa 4020 ggatctgcga agaacttcaa aacgcatgca gctacagttt cggaacgtgt actcgggatg 4080 agctggtttc cagaatcaga cgagtttaga ttccgtgggt ttttccgcga acaaataata 4140 cccctgcttc acggagatgt tgtcccaaca aaacgtcagc ttctccaagt ggtcatgagt 4200 atttttgatc cactaggtct ggtgtcgctt atcgtggttc acggaaaaat cctcgttcaa 4260 aaatcgtggc gagcaaacat cgggtgggac gataaaatcg acgaagatct actcaaactg 4320 tggcgtcggt ggacagaact actggggcag ctcgacacag ttcgcatacc acggtgctac 4380 ttcccagggt accttccgga aagctacgaa tctttgcagc tacatatttt tgtagatgcg 4440 agcgaagagg cgtacgtggc aaccgccttc ttcaggatcg ttgaccagtc ccaagttcgc 4500 tgcgctctgg tagcttcaaa gaccaaagtt gccccactga aattactatc gattccccgt 4560 cttgagctcc aggctgcgtt actaggagcc agattggcga agtccgtggc ggaaaaccac 4620 acactgcgaa tcaaacaacg cttcttctgg ggtgattcct ccactgtcct ttcttggcta 4680 cgctcggacc atcgaagata tcgacagttt gtagcgtacc gcgtatcgga gatcttggac 4740 acttcgaagg tcgacgaatg gcactacgtt ccctcgcacc taaacgtggc agatgacgcc 4800 actaagtgga aagaacgact ccagatcagc aacagtcatc gctggttctg tggaccagac 4860 ttcctatacg agcctcagaa cctttggcca aaacaggagg cgcaatctac ctcaactacg 4920 gaggaactga gaccaatcca cgtacattgt gaagtccgga aagaacaagt ggtacagttc 4980 tgtcgctttt ctaagtggga gcgttgtcta cgatcagtcg cctatgttca ccgattcatt 5040 gaccagctga agcggaagcg gaataaagaa gaatccgatg taactgccat ccttactcga 5100 gaagaacttc agcgagcgga gcagacgatc atcaagctgg cacagtttga agtttttgga 5160 gatgaggtgg tgacgatgca gaacaaccag tctctaccag tggatcaaca gcagcgcctc 5220 gaaagttcaa gcaaactgta taaactgtct ccatttttgg acaaccaagg cgtagttcga 5280 atggaaggac gaatcgatgg gttctctgat gtcgaaggta ggacttcaaa taccccactg 5340 tgctaccgaa ggaccactac gttacccagc tcatcattga ctcgtaccat cgacggtaca 5400 aacattccaa cggagaaaca gcagtgaatg aaatgcggca gaaataccac ttgtcggaga 5460 tgagagcagc atttcggaag atcggcaaat cttgcatgtg gtgcaagatc tacaaggcga 5520 cacccgcagt tcctagaatg gcaccactac cggaggcaag agtcactccc cgcatcaggc 5580 cgttcagctt cgttggagtg gattacttcg gcccgctgtt ggtcgtccaa ggtcgtcgtg 5640 aagtgaagcg atggatcgca ctctttacct gtctgacaat cagggcgatc catctcgaag 5700 tcgtcaccag tttgtcgacg gagtgctgca agatggcatt acgacgattc atagcacgga 5760 ggggggcacc ttcggagatc tacagtgatc gggggacaaa ctttgttgga gtcagcggcg 5820 aattaagtaa gcaagtaaga gcaatcaacc aagagctggc atcgacattc actaatacgg 5880 tcactcaatg gcgcttcaat cctccagctg caccacatat gggggggtct tgggagcgca 5940 tggttcggtc ggtgaagtgt gcattggctt cgctatcggt cgagcgcaaa cctagcgagg 6000 aagtcttggt cacgttgctg gtggaagctg agtcgatggt gaattcgagg ccgttgacct 6060 acatgccgct ccagacttcg gagcacccgg cacttactcc gaactgcttc ctcatgctga 6120 gtactagcgg ggtgaaccag ctttcaaccc aacttgtgga cgatagacag gctctacaca 6180 ccagctggtt cctgtgccag cgcttgctgg accaattctg gacgcgctgg gtgaaggagt 6240 atctgcccac tatcaccaga cgaaccaaat ggttcgtaga caccagaccg gtctctgccg 6300 gtgatctggt ggtcatcgtg gatgatagag tccgcaacgg atggataagg ggacaagttc 6360 tacgcgtgtt ccctggacga gatggcagat gtcgcagtgc aaacgtacag acagcaactg 6420 gtgtactacg gcgaccggtg gcaaaactag ccgtcctaga tattggtggt aatgctcgag 6480 aggacacgga gcaatacggg tcggggaa 6508 // ID SMAR14 repbase; DNA; INV; 1722 BP. XX AC . XX DT 02-OCT-2007 (Rel. 12.1, Created) DT 22-OCT-2007 (Rel. 12.1, Last updated, Version 1) XX DE Consensus sequence of Mariner-type family of repeats. XX KW Mariner/Tc1; DNA transposon; Transposable Element; SMAR14. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-1722 RA Jurka J.; RT "Mariner-type element from freshwater planarian (Schmidtea RT mediterranea)."; RL Repbase Reports 7(10), 1072-1072 (2007). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 204..1541 FT /product="SMAR14_1p" FT /translation="MSSKRCSYTTGFKLKVIEAAERIGNRGASREYGLSEA FT NVRRWRKNKNTLKECRSSTKASGRGRKLFFPDVEKKLVEYVHERRFSGSSI FT STVEVRLKALELVKEMYPGSIFKASARWCQLFMKRNDISIRRRTSIAQRLP FT DDVENKLLDYQKYIINLRKRHEYPISCIGNGDQTPLSFDIPSFQTLDFKGV FT KSVPIKTTGNEKNRFTVMLGAYGDGKKMPPYVIFKRKSLPKNIHWPAGMIV FT KAQENGWMNEELTKDWIKTIWCNDRKDEFRNSRRLLVLDAFRCHRMPCVKE FT ILAKNKTDLVIIPGGMTGQLQVMDVVCNRPFKQEMRKRWNEWFFSGAPSFT FT ASGNRRKVGLETICKWIQLSWDAVSEAIIIKGFKKCCLSNAMDGSEDDIIW FT QDDISDVGNEEDDIDDMENELMFGEVDESIEIENLMESEDENNGLCNEKNI FT L" XX SQ Sequence 1722 BP; 611 A; 233 C; 365 G; 513 T; 0 other; taccgtaatt atccgaatat aagacgcacc caatataaga cgcatcatta aaaattgact 60 gaattcaaga aaatataagt aacccaaata taagacgcat gtataattat taccttacat 120 taaatttatt attaaattta ttttttgttt aaaattaaaa tttcagttac ctaaattcaa 180 atttaaattt ttatttccct ataatgtctt caaaacgctg cagttatacg acaggtttca 240 aattgaaggt aattgaagca gccgaacgta tcggaaatag aggagcatct agagaatatg 300 gactcagtga agctaatgta aggcgttgga gaaaaaataa aaatacttta aaagagtgca 360 gatcctctac aaaggcctct ggcagaggta ggaaattatt ttttcctgat gtggagaaaa 420 aattagttga atatgttcat gaaaggcgat tttcaggttc ctcaatatcc actgtagaag 480 ttaggttgaa ggctttagag ttagtcaagg agatgtatcc tggttcaata tttaaagcct 540 cagcgcgctg gtgccagctt ttcatgaaaa ggaatgatat ttctataaga agaaggactt 600 caattgcaca gcgactgccg gatgatgtag aaaataagct gctcgactat cagaaatata 660 taataaattt aaggaaaagg catgaatatc caattagttg catagggaat ggggatcaga 720 ccccactaag ttttgatata ccttcatttc agacattgga tttcaaagga gtcaagtctg 780 tacctattaa aactacaggc aacgaaaaga atcggtttac agttatgttg ggagcttatg 840 gggatggcaa gaagatgcca ccttatgtaa tttttaaaag aaagtcttta ccaaagaaca 900 ttcattggcc agctggtatg attgttaaag cacaagaaaa tggatggatg aatgaggagt 960 tgactaagga ttggattaag actatttggt gtaatgatag aaaagatgaa tttcgaaatt 1020 caaggaggtt actggtatta gacgcattta gatgccaccg tatgccatgc gtaaaggaaa 1080 ttttggcaaa aaacaaaaca gatcttgtga ttattcctgg agggatgacc gggcaacttc 1140 aagttatgga tgtagtttgc aataggccat ttaagcaaga gatgcgaaag agatggaatg 1200 agtggttttt ttctggtgct ccatcattta ctgcttctgg aaacagaaga aaggttggtt 1260 tagaaactat ttgcaaatgg attcaactat catgggatgc agttagtgag gcaattatta 1320 tcaaagggtt caagaaatgt tgtttgagta atgctatgga tggatcagaa gacgatatta 1380 tttggcagga cgatatatct gatgtaggga atgaggaaga tgatattgat gatatggaaa 1440 atgaattaat gtttggagaa gtggacgagt ccattgagat tgagaatttg atggaaagtg 1500 aagatgaaaa taatggtctc tgtaatgaaa aaaacattct gtaactaaaa ataaacattt 1560 tgtattaatt tatactttat ttcccttgcc cataccggtt tttcagaaaa ttttctgagt 1620 aggaaaaaat atttcctgtt atacccatgt ataagacgca ccacgattat catccccaaa 1680 agtgatgaaa aaaaagcgtc ttatattcgg ataattacgg ta 1722 // ID Kiri-1_CQ repbase; DNA; INV; 4655 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 16-FEB-2011 (Rel. 16.01, Last updated, Version 2) XX DE A Kiri non-LTR retrotransposon family from Culex quinquefasciatus DE - consensus. XX KW Kiri; Non-LTR Retrotransposon; Transposable Element; Kiri-1_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4655 RA Kojima K.K. and Jurka J.; RT "Kiri non-LTR retrotransposons from the southern house RT mosquito."; RL Repbase Reports 11(1), 120-120 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 6 sequences with >99% CC identity. CC This family and related Kiri elements found in two mosquito CC species, Culex quinquefasciatus and Aedes aegypti, constitute a CC distinct lineage belonging to the CR1 group. The phylogenetic CC analysis with PhyML indicates that Kiri is a sister lineage of CC the L2 clade. Although several Kiri elements do not encode two CC proteins, probably due to 5' truncation, Kiri elements generally CC encode two proteins. Their ORF1 proteins commonly contain an CC L1-like RNA-binding domain. XX FH Key Location/Qualifiers FT CDS 301..1125 FT /product="Kiri-1_CQ_1p" FT /translation="MSTKRTPPINPATPSGSSNNSNQNKRYRSSSAGSVSP FT GASPGPLTLEAVMKEINMQFTKTFDRIDTISNKIDTVKAELNEQLSSVSRD FT FNSFKAECADKFKFTDDAMCELESRIDGISQEIGGIENRNELIVSGVPHIP FT GENVAAYFKDMWKQVGLPENPAPLVDVRRLKTGTQGDGLILLQFALRNNRD FT DFYSCYLRKRDLKLQHLGMNSTRRFYVNENLAVPARKIKKAALELKKSGKL FT SSVYSKKGIVHVKRTADQQPGTAVHSENQLQQYS" FT CDS 1299..4148 FT /product="Kiri-1_CQ_2p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="MPPLTNHTSANIPGVVMNCALHSGKLNICHGNTQSLC FT ARKFAKLDEVKDLLHNSKVTVACFTESWLTGQISDRSIAIPGFAVCRNDRM FT YQRGGGIVIYYRSHLSCRKVFCTQLSAESGNKTESLVVELRFGGETVLLVA FT VYNPPGNNCSDFLEEKLVEYASTYNNVLLIGDFNTDMSKPSNLQSQFQSVL FT DTFALVSVGEEPTFFHNRGSSQLDLFITSCSDRVLRFNQVGCPALSQHDLI FT FASLDFDANPVPRVNTYRDYVNFDSTSLVDAVSSVPWHNFYEVNDPNELAD FT FFNSRMKQIHDVCIPLRVIRCRNESNPWFSYEIRKSMLERDLAYSDWRRAP FT AESKLVARQRYNSLRNRTNTMIANAKVQYFNRFLDSRVPSNVLWKRLKSVG FT VGKEKSSSVCEFDPDEVNRAFLSSFIQNEQGSGLRSSAADSPYSLTFQPVE FT GWQIVNAVCDVKTNATGLDGLPIKFIKIVLPLVLQQITHLFNSIIATSIFP FT TCWKHAKIIPLRKKPHLSALTNLRPISILCALSKVFEKLLEQQMSSFITEN FT NLLTEHQAGFRKGQSIQTATSRVFDDLARTTDRRGSAVLLLLDFSKAFDTI FT PHNKLCTKLETQFNFGPSAVNLIRSYLENRRQTVFCGDLQSVSSIVDSGVP FT QGSIVGPLLFCCHVNDLPPVLRYCSIQLYADDVQLYIGRLGPCSRELIRMV FT NEDLDRIAVWSQRNQLFVNQAKSKALFVQGRRRNKAQLDLLPRITMSGQAI FT EWVDSAKNLGFIFQADLQWDGLIQQQCGKIYGSLRTLRSCTSAAPVSTRLK FT LFKSLILPHFLFGELLHVNPSAGALDRLRVALNCCVRYVYGLTRYDHVSHL FT QRNLVGCPLDNLYAHRSCLFLRKLLTTRSPPELYQKLQQFRGRRLLNLIIP FT ANRTLTYSSSLFVRGVVNWNMVPTELKHGVSQATFKRNCLEFWNRE" XX SQ Sequence 4655 BP; 1246 A; 1216 C; 1086 G; 1106 T; 1 other; aaagtttggc aacacttacg tcaatatgtg ttgcgagtaa aagctgcgta aacaaattac 60 gatcgtggag tgaaaccgcc ccgttcagca gtgaaaaccc tctgagtgaa atccggaacg 120 ttcgccgtac ctctgttggt gttatccagc tctgccgtgc cgcacttgat ccgcaccagc 180 caccggaatt tttggtttgt tttgatcgga cctgacatcg tcaacaagcc ggcgcgtttt 240 gttctaaaaa cgataagttt cgttcacacc aacaccacaa cgcagtgcaa gagtatcagc 300 atgtcaacaa agcgtacacc accaattaac cctgccacgc ctagcggcag cagcaacaac 360 tcaaatcaaa acaaacggta tcgcagcagt agtgctggtt cggtttcccc gggagcatcc 420 cccgggccgc tgacgctgga ggcagttatg aaggagatca acatgcagtt caccaagact 480 ttcgatagga tcgacaccat cagcaacaaa atcgacactg tgaaagcaga acttaacgag 540 cagctcagtt ctgtttcgcg tgacttcaac tcgttcaagg cggagtgtgc tgataagttc 600 aagttcaccg acgatgcaat gtgtgaactg gaatccagaa tcgatggcat ctcccaggaa 660 atcggcggaa ttgagaatcg gaacgagctc atcgtaagcg gtgtcccaca cattcccggg 720 gaaaatgttg cggcgtattt caaggatatg tggaaacaag tcggtcttcc tgaaaatcca 780 gcccccctgg tcgacgttag gcgactgaag actggaaccc agggtgacgg cctgatccta 840 ctgcagttcg cgcttcgaaa caaccgggac gacttctaca gctgctatct tcggaaacgt 900 gacctcaagc ttcaacatct aggcatgaac tcaactcgcc ggttttacgt caacgaaaat 960 cttgccgtcc ctgccagaaa gatcaagaaa gctgcgttgg agctcaagaa gtctgggaaa 1020 ctgtcgtcgg tctattcaaa gaagggaatc gtccacgtga aacgcacagc agatcaacaa 1080 cccggaactg ctgtccactc ggaaaaccaa cttcaacaat actcttagtt aagtacattg 1140 tgcctccgtc cacgtcttaa tttgtttaaa tgttttattt tgagaacttt tattgttttg 1200 tgatagaaga ttgtattttt gattgtattc taaattgtgc ctcgtttaac aaatttttga 1260 acggtagtag tagtatcagt taattgaatt gctccccgat gccaccctta accaaccaca 1320 cctcggcgaa catcccggga gttgttatga actgtgctct tcactcgggc aagctcaata 1380 tctgccatgg taacacacag agcctctgtg cgaggaaatt tgctaaacta gacgaggtga 1440 aggatttgct gcacaattcc aaggtaaccg ttgcttgctt caccgagtct tggttgactg 1500 gacagatctc agaccgtagc attgcaatcc ccgggtttgc cgtctgtcgc aacgatagaa 1560 tgtatcaacg cggtggtgga atcgtcatct actacagaag tcacctcagc tgcagaaaag 1620 tcttctgtac ccaactatcg gcggaatccg gcaacaaaac tgagagtctg gtggtagagc 1680 tgcgttttgg cggtgagacg gtccttcttg tagcagttta caaccctcct ggtaataatt 1740 gctctgattt tctcgaggaa aagttggtgg agtatgcctc aacctacaac aacgttctgc 1800 tgattggaga cttcaatact gacatgagca aaccgagcaa cctgcaatcc cagtttcaat 1860 ccgtccttga cacatttgcg ctcgtatctg tcggcgagga acctacattt ttccacaacc 1920 gtggcagctc tcagcttgac ctgttcatca ccagctgtag tgacagagtg ttgcgtttca 1980 accaagttgg ctgtcctgcc ctttcacagc acgatctcat ctttgcctcg cttgactttg 2040 atgcaaatcc cgtccctcgc gtcaacacct atcgtgatta cgtgaatttc gattctacca 2100 gcctggttga tgcggtttcg tccgttcctt ggcacaactt ctacgaggtc aacgacccaa 2160 atgagctggc tgattttttc aactcccgga tgaagcaaat tcacgacgtt tgcattccac 2220 ttcgtgttat cagatgccgc aatgagtcca acccctggtt cagctacgag atcagaaagt 2280 cgatgttaga acgtgatctg gcgtatagtg attggcgaag agcacctgca gaatctaaat 2340 tagtggctag gcagcggtat aactcactca ggaacaggac aaacacgatg atagcgaatg 2400 caaaagtgca atatttcaac cgttttctgg acagccgtgt tccatccaac gtgttatgga 2460 aacgcttgaa atctgttggc gtcggcaagg aaaagtcttc cagcgtgtgc gagttcgacc 2520 ctgacgaggt taatcgtgct ttcttgtcga gttttatcca aaacgagcag ggcagcggac 2580 tgagaagttc agcagcagat tctccgtaca gccttacttt ccaaccagtg gaaggctggc 2640 agatcgtcaa cgctgtgtgt gatgtgaaaa ccaacgccac tggtctagat ggactgccga 2700 tcaagtttat aaaaatcgtc ctccccctgg tcttgcagca aattactcac ctattcaaca 2760 gcatcatcgc aacctcaatc tttcctacct gctggaaaca cgctaaaatc ataccactga 2820 gaaagaaacc ccacctgagc gctttgacga atctacggcc cataagcatc ttgtgtgcgc 2880 tctcgaaggt gtttgaaaag cttctagaac agcaaatgtc gtcttttatc accgagaaca 2940 acctactcac ggagcaccag gcgggtttcc ggaaaggcca aagcatacaa accgcgactt 3000 cccgtgtgtt cgacgacttg gcacgcacga cagacaggag aggatctgcg gtgctgttgc 3060 tgctcgattt ttccaaggcc ttcgacacca taccccacaa caagctgtgt acgaagctcg 3120 aaacgcaatt caattttggt ccatccgcgg tgaaccttat cagatcgtat ttagaaaacc 3180 gacgccaaac cgttttctgt ggagatctac agtcagtaag cagcatcgta gattccgggg 3240 tacctcaagg ctcgattgta gggcctctgc ttttctgttg tcacgttaac gatctaccac 3300 ctgtgctcag atattgctcg attcaactct acgcggatga tgtgcaactg tacattggtc 3360 gactgggccc atgctcccga gaactcatca ggatggtcaa cgaggatctc gataggattg 3420 ctgtttggtc gcaacggaac cagctatttg tcaaccaagc gaagagcaaa gcgctgtttg 3480 tgcagggtcg tcgtagaaac aaagcgcaac tcgatctgct tcctcgtatc accatgagcg 3540 gccaagccat tgaatgggta gacagtgcga aaaatctggg tttcatcttc caagcggacc 3600 tgcaatggga cggactcatc cagcaacagt gtggaaaaat ctatggcagc ctacgcacac 3660 tccgaagctg cacatccgct gctccagtca gtacgcgtct taagctgttc aagtcgctga 3720 ttctgcctca ttttcttttc ggagagctgc tacatgtcaa ccctagtgct ggtgcattgg 3780 atcggttgcg tgttgcgcta aattgctgtg ttcgttacgt ttacggcctc acccgatatg 3840 atcatgtcag tcacctacaa cggaacctgg ttggttgccc tctggacaac ctgtacgcac 3900 accgctcgtg tctattcctg cgcaaactmc tcaccacgcg ctccccacca gaactctacc 3960 agaagctgca gcagttcaga ggccgccgcc tgttgaattt gatcattccc gccaacagaa 4020 ctttgaccta ctccagttcc ctgttcgttc gaggtgtggt caactggaac atggtgccaa 4080 cggaactcaa acacggggta tcgcaagcaa cattcaaaag aaactgttta gagttctgga 4140 acagggagta aaccaaacaa acgccgggaa acgagagcag cgtaagagca gcgcgcagcg 4200 tgtaaccact gtcgcgcccg gaggtcccgg agccccccta accacaagct ataggaagag 4260 tggcataggg acaaccctcc ggctcctccg gggtgcctgg aaaacacgac acgccagctg 4320 aagaatccac ggaggtatcc gagtgacccg gggcccccct aaccacaagt tacaagaaga 4380 gtggaatagg gacaaccccc cgggcactcg agtcccctgg atgaggtcgc ataaggggca 4440 aacccccatc ccctcggatg aagcatggac tacgcatgga caactcaacc cactactcgc 4500 aactcaagag aacaccgaga agaacgacat caaccaagga aaaccgttta gttaataagt 4560 gtaatattag ttagcaaaca cagatgatac cggaggtagc aattaaaaag gtgcaaacct 4620 tacgctacca gattaaataa ataaataaat aaaaa 4655 // ID Gypsy-23_IS-I repbase; DNA; INV; 4060 BP. XX AC ABJB010963117; XX DT 15-FEB-2011 (Rel. 16.02, Created) DT 15-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the black-legged tick genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-23_IS_; KW Gypsy-23_IS-LTR; Gypsy-23_IS-I. XX OS Ixodes scapularis OC Eukaryota; Metazoa; Arthropoda; Chelicerata; Arachnida; Acari; OC Parasitiformes; Ixodida; Ixodoidea; Ixodidae; Ixodinae; Ixodes. XX RN [1] RP 1-4060 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the black-legged tick genome."; RL Direct Submission to RU (15-FEB-2011). XX DR Genome; ABJB010963117; Positions 1075 5134. XX CC Positions [1473-2012] - Reverse transcriptase CC Positions [3102-3485] - Integrase core CC 'ATATAT' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 94..1350 FT /product="Gypsy-23_IS-I_1p" FT /translation="MAEGEQAAAPAIRTGSIPYRQRDPPTFSGFGSDDVDE FT WLRTFKRVSVHNAWDNTLKLANVVTYLRDTAKVWFDNHEEQIASWDAFKTR FT LSDVFGCPVSRKENATQRLATRAQSTGESITTYVEDILALCKRVNPTMSEK FT EKVEHIMKGIAEDAFAVLVSRNPTTVAQLQEECQRWEGHRSRRIASPFTFS FT RLQNCVASSGPTLDRRHQEPIIRQIIKEEHRHIFALLPSCSHDIILGWHFL FT VTRRALIDCETKSLYLSKVTPGVEPLSYNRVSFRATSNTRISPFTAALIQL FT TGDLPDSDYLLEPNLEVFFQKGLALPFALTRSRSCISTVLVANTTNIYQIL FT PAGSCIATTTLHDVIYINAVSDTCLTSNAPLISDFMHDAHKLLRQTINANL FT PESQRDALHKLLALFLISLTFNLLP" FT CDS 1470..3485 FT /product="Gypsy-23_IS-I_2p" FT /translation="MLRKNIIQDSSSPWSSPVVLVKKKDNTWRFCVDYRRL FT NKITKKDVYPLPRIDDALDSLHAANYFSSIDMRSGYWQISADEKDREKTAF FT VTPDGLYEFRVMPFGLCNAPATFERMIDSLLRGLRWTTCLCYLDDVIIFSS FT NFDTHLSRLTAVLKIFRNAGLQLNSKKCVFGNTEIKILGHLVSPQGISPDP FT EKLTAVSQFPPPTNQKALRSFLGLCSYFRRFIRGFAEISAPLHHLLKKNVP FT FIWSQSEQRAFHALKTALTTQPVLAHFNPSLEVEIHTDASGYGLGAVLIQK FT HKGKEHVVAYASRALSKPEMNYAITEKECLAIIWAIAKFRPYVFGRPFKVV FT TDHHALCWLSNLKDPSGWLGRWALKLQEYDVSIFYKSGRKHEAPDCLSRNP FT IASDPSLPSTPINVIALTSSLDFAVEQRKEKSLRQVFQCSQKCPQTRKEKR FT LTDIYTLEHGILYRRNYDPQGAPLLYVVPKHLRGDILRTLHDDPTAGHIGF FT FKTYMRVRHTFYWQGMYRDIFKYVASCLPCQRRKRPTTAPTGRLHPLHPPH FT YPFECVGIDLLGPLPVTQSGNRWIIVAIDHLSRYAETSPLPTSNAEDVAAF FT LLKQVFVLHGPPRLLISDRGTVFLSEVVQQLLGLCGTVHKPTTAYHPQTNG FT LVERFNRTIADMISMYVQSDHQN" XX SQ Sequence 4060 BP; 1032 A; 1196 C; 820 G; 1012 T; 0 other; tacttggtgg aggtgcgggg gtccctcgta aaagccatct acagctctgc catcacgcat 60 cttcgaagtc atcgggtgct tacaccgcct tccatggcag aaggtgagca agcggcggct 120 ccggccatcc gtactggctc cataccctac cgtcagcgtg atccccccac cttctctggc 180 ttcggaagcg acgacgtcga cgaatggctt cgcacattca aacgagtcag tgtgcataac 240 gcttgggaca acacacttaa acttgccaat gtcgttacct acctccgcga cacggcgaaa 300 gtgtggtttg acaatcatga agaacagata gcgtcatggg atgcctttaa gacccggcta 360 tcggacgtct tcggctgtcc cgtgagccgc aaggagaacg ccacccagcg actcgcaacg 420 agagcccaat ctacaggtga gtccatcacc acctacgtcg aggacattct ggcactttgc 480 aaacgtgtga accccaccat gtctgagaag gaaaaggtcg aacacattat gaaaggaatt 540 gccgaagacg ccttcgcagt gttggtctca cgaaatccaa caaccgtggc ccaacttcag 600 gaagagtgcc aacgctggga aggccacaga agcaggcgca ttgcctcccc ctttaccttc 660 tctcggcttc agaactgcgt cgcgtcctcc ggacctaccc ttgacaggcg acatcaagaa 720 cccattattc ggcagatcat caaagaagag catcgtcaca ttttcgcatt actaccgagt 780 tgctcacacg acattatttt aggctggcac tttttagtca ctcgacgtgc gcttattgac 840 tgtgagacaa aatcgctata cctcagcaaa gttactcctg gcgttgaacc tctttcatac 900 aaccgtgttt cttttcgtgc aacaagcaac actcgaattt cacccttcac tgcagcgctt 960 attcagttaa ctggagacct tccagactct gattatttac ttgaaccgaa cttagaagtt 1020 ttctttcaga aaggtctcgc gctgcccttt gctctcacgc gatcgcgttc ttgtatatct 1080 actgttctcg tagcaaacac tacaaacatc taccaaatcc ttccagctgg ttcatgcatt 1140 gctacaacga cgcttcacga cgtcatatac atcaatgctg ttagcgatac ctgcttgacg 1200 tccaacgcac ctttgatttc cgactttatg catgatgccc acaaattgct tcgtcagacc 1260 attaacgcca atttgcctga gtcacaacgt gacgccttgc ataagctact ggctcttttc 1320 ttgatatctt tgactttcaa tctactcccc tgacacaaac gtcaattgtt caacatacca 1380 ttgcaacggg agaccaccag cctttacgca gtcgtccata tcgtgtttct gccaaagagc 1440 gcgaagttat acaaaatcat gtgagtggca tgttgcgcaa aaatatcatt caggattcat 1500 ccagtccctg gtcatctccg gtggtactgg ttaaaaagaa ggataacacc tggagattct 1560 gtgttgatta caggcgcctc aacaaaataa ccaagaaaga cgtttaccct cttccacgca 1620 tagatgacgc ccttgatagc ctacacgctg ccaactactt ttcttcgata gatatgcgtt 1680 cagggtattg gcaaatcagc gcagacgaaa aagatcgcga aaagacagct ttcgttactc 1740 ctgatggcct ttacgagttc cgtgttatgc ccttcggatt gtgtaacgct cctgctacct 1800 tcgagcgaat gatagattct ttgctccgag gcttgcgttg gacaacttgt ttatgctatt 1860 tagatgacgt tataatattc tcgtccaatt tcgacacaca cttgtctcgc ctgacggctg 1920 tgctgaaaat atttcgcaac gctggccttc agctgaactc taagaagtgc gtatttggaa 1980 acaccgaaat taagatcctc ggccacttgg tcagccccca gggcatatcc cctgatcccg 2040 agaaattaac cgcagtttcc cagttccccc caccaacgaa tcaaaaggct ttgcgctctt 2100 ttctcgggtt gtgctcctat ttccgacggt tcattcgtgg cttcgctgaa atatcggccc 2160 ccctgcacca ccttctgaaa aagaacgtcc ctttcatttg gtctcaaagt gaacaacgtg 2220 catttcatgc tctcaaaacc gctcttacaa cacaacccgt tttggcccat ttcaacccct 2280 ctcttgaagt ggaaatacac actgatgcta gtggatacgg tcttggtgct gtcctcattc 2340 aaaaacacaa aggcaaagaa catgtcgttg cctatgcgag ccgtgcttta tctaaacctg 2400 aaatgaatta tgcaataaca gaaaaagaat gtctcgcgat tatttgggca attgccaagt 2460 tcagaccata cgtatttgga cgcccattta aggtcgttac agatcaccat gccttatgct 2520 ggttatctaa ccttaaagat ccctcgggat ggctcggcag atgggccctt aaacttcaag 2580 aatacgatgt atctatattt tataaatctg gtagaaaaca tgaagcacct gattgcttgt 2640 ctagaaatcc gatagcttca gacccatcac tcccttctac gcccatcaat gtaatcgctt 2700 tgacctcaag cttagacttt gctgtagaac agcgaaaaga gaaatccctc cgccaagtat 2760 ttcagtgttc acagaagtgt cctcagacac gtaaggaaaa acgccttacc gatatctaca 2820 cattggaaca cggaatatta taccgccgga actatgaccc gcaaggtgcg ccactcctgt 2880 atgttgttcc taagcatctc cgtggcgaca tattgcgtac tctccatgat gaccctacag 2940 ctggacacat cgggttcttc aagacataca tgcgtgtccg acacactttc tactggcagg 3000 gcatgtaccg tgatatcttc aaatacgttg catcatgtct accatgtcaa cgccggaaac 3060 gacccaccac cgcgcccacc ggacgcctac atcctcttca ccctccccat tatccatttg 3120 aatgcgtggg gattgatctt ttaggtccac ttcctgttac gcaatcaggc aacaggtgga 3180 ttattgtcgc aattgaccac ttgtcgcgtt atgccgagac gtcaccattg cctacctcca 3240 atgctgaaga tgtcgccgct ttccttctga aacaagtttt tgtgctgcac ggacctcctc 3300 ggcttctcat cagcgaccga ggcacagtct tcctgtccga agtggttcaa cagttgttag 3360 gcctctgtgg aactgtacat aaacctacta cagcttacca tccccagacg aacggtctag 3420 tggaacgctt caatcgcaca atagcagaca tgatctccat gtacgtccag tccgatcatc 3480 agaactagga cagtgtactt cccttcgtca cctacgccta caacactgct ctccaagaca 3540 cacacggcta cactccattt tttcttctac acaatcgatc cccaaccaca cttctcgacg 3600 cgttacttcc ttacaaccat aatgatcagc tagaggacac tgtcgcccgc cttgtctgcc 3660 gtgccgaaga ggcgcgacag ctcgcccgcc tacgcacaat ggcttcacag cagcgccaat 3720 gtcaacgtta caacgagaca catcgacctg cgtcttacac cgccggtgac cttgtgtggc 3780 tctggacgcc acgtcggcaa cagggcaagg ccaccaagtt cctccaccgg tacgccggcc 3840 cataccgcat cactgccaag ctgtcggacc tcacctacca ggtacagctc ctgcacccaa 3900 cacctgaccg acggacacct tcgaaggaca ccgttcacat cacccgcatc aaaccctacc 3960 atcatcccaa tcaaaccctc gccctagaac acccctcttc tccaccaaac ttccccctcc 4020 actatgcggt cgagacgacc accaacaacg gcggcggtaa 4060 // ID L1-43_AAe repbase; DNA; INV; 4598 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE An L1 non-LTR retrotransposon from Aedes aegypti. XX KW L1; Non-LTR Retrotransposon; Transposable Element; L1-43_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4598 RA Kojima K.K. and Jurka J.; RT "L1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1396-1396 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 8 sequences with >93% CC identity. XX FH Key Location/Qualifiers FT CDS 135..1148 FT /product="L1-43_AAe_1p" FT /translation="MDSLRKQNFFIDIREIRKLRDENIFNFIAKNLKLTPS FT DVAAIQIDRIEGKVFVELQTQEAVQKVVEEYDNKYVLNYNGATHRMRLYIE FT DGGTDVKLHHLPPKMPEEWIVDYLSNFGEIISVKQEMCRSQFFSQTPSGVR FT IVRIRMTTPIPSFINIKGYSSHCTYSNQIQTCKHCNQEVHFGISCAENRAK FT LSTQNKPSSLYSRVLVGTTKEQAETAPDTATQKHSSTQQQISTQQQPPSHF FT KRPASPLNTESREKSVKTVTSETESTTEQPAPQATRPTKXTEMKSSVVSDR FT SRTTPTNSRSNSLTRATDKITMEISDDELLDMTRTRRSSKNRHDGK" FT CDS 1161..4559 FT /product="L1-43_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MEDSIRIQHFSYTFATINLNGISNTTKINALRDFVYT FT LDLDIVALQEVVGDCISIPGYAMIMNIDETQRGTAIGVRNGIGVKQVHRSL FT DARVICIELGNDENMIKIVNLYAPSGSNNRSARELFFNQTVPQFLXNHPAH FT VIMLGDFNSVIESRDATGVSNMSPMLKRLCDSLNLTDVWKHFNRQIDYTFS FT RGRSMSRIDKIYVTQACLQNTRYCRHEANCFSDHKAVVSRIVLPLQTQPVE FT RPYWKLNNQLLNDENLNELRIKWNYWVRQRRNYASWVDWWAIYAKGKIASF FT FRWKNSVAVQTFRNESMTKGRKTKGRKDKRSKNKKEGTKGRKSFFKEGKIS FT HQANHFRPFVLSTFCLSTLLSFRRFVFRPFLS*TFRNTMGFYYTCLNRAFE FT QHRDLDCTTEINRIKATMLQKQKQFSESRKIPNVSFLGGENVSIFQIEDQE FT NRHRDTSISMLDVNHRRITDKMEINNCILEHYRNLFDEEPIEEDDMFYPRR FT AINAEHEGNSHLMDDFTENEIFTVIRESAPNKSPGKDGLTREFYVKAWDII FT KRELILMINEIKNGNVRHDLQDGVIVLVRKKGGDNSINALRPITLLNVDYK FT IFSRVMKQRLEPIATAVLSPHQKCSNGDRTIFEATCTIRDHIAASKQERTG FT GIVIACDLSNAFDRVSRSYLLSTIRGMRFNERFVDLLTDLSAHTNSQITIN FT GHTTECFPIKRSVRQGDPLSMLLFVLYLQPLLDALAEICNGPGECVIAYAD FT DITIIIRCSSKVRRLIDCFNRFGIAAGARLNMRKTEALNIGVPLPNWDGFI FT QKDVIKVLGVLFKSSIKETVARNWEAVYRSVNGLLWMHRPRLVNIKQKVVL FT INTYICSKLWYMSSILPMPNKYVAQFTKQIGYFLWNGVSAQRVKMIDVFRP FT RLQGGLGLHSPDLKTKALLTNRYLRLLGITPCMNRYMQMQQVPSDLLHLKI FT IKSVKEGIPRQIIDNPTASLIYNHLLKQLPAVSITTRHTRNWKIIWKNINH FT KNLTSDEKSAFFLLVHGKLNHNELFFRQERRDDADCQLCGLVETLEHKFCT FT CVGINNLWQSTISNIQVIAQQNNINFENLHYPTISCNRRIKGKILKIFIRY FT IMYHLEIPDNAKSCESLRNYFVMYV" XX SQ Sequence 4598 BP; 1547 A; 939 C; 954 G; 1156 T; 2 other; cagttatcgt caagctacca agctatacag acgtgagtca aagcgatcgc tattgtaaaa 60 taaaaaatca catcggtacg gtattggaaa ggggtttaaa gcttgatcct aggaccaagc 120 ccccaaccgc gactatggat tcgctgagga aacaaaactt cttcatcgac attcgtgaaa 180 ttcgcaagtt gagggacgaa aacatcttca atttcattgc gaaaaacctg aagctaacgc 240 catcagacgt tgccgcaata caaattgatc gaattgaggg caaagtattt gtggagctac 300 aaacacagga agcggttcaa aaggtagttg aggagtacga caacaaatac gtgctcaact 360 acaatggcgc tactcaccgt atgagactgt acatcgagga tggaggtacg gatgtaaagc 420 tgcatcatct gccaccgaag atgccggagg agtggattgt tgactaccta tccaacttcg 480 gagagatcat cagtgtcaag caggaaatgt gccgttcgca gttcttctcc caaacaccaa 540 gtggcgtacg tattgtacgt atacgcatga caacacctat tccttcgttc atcaacatca 600 aagggtacag tagccactgt acctacagca atcagattca gacctgcaag cactgtaacc 660 aagaagtcca tttcggaatc agctgtgcag aaaaccgagc aaaactgtca acacaaaaca 720 aaccttcatc actctactcc agagtgcttg ttggtaccac caaagagcaa gcagaaaccg 780 caccagatac agcaactcaa aaacactcgt cgacccagca gcaaatctca acacaacaac 840 aaccaccaag ccacttcaag cgtccggctt cgccgcttaa cacggagtct cgagagaaat 900 cagtgaaaac tgtaacctcg gagactgaat ctacgactga acaacctgcg ccccaagcaa 960 cccgaccaac gaaggkcacc gaaatgaaga gcagtgtggt gagcgaccgt tcgcgcacta 1020 cgcccaccaa cagcagatca aattctctca ccagagcaac cgacaagatc acgatggaaa 1080 tttcggatga tgagttactc gacatgacga gaacacgacg atcatcgaag aaccggcacg 1140 acggcaagta aaatatagtt atggaagact cgattagaat acagcatttt tcctacacat 1200 tcgcaacaat caatttgaac ggcataagca atacaaccaa aataaacgcg ttgagagatt 1260 tcgtatacac actagactta gacatagtag cactacaaga ggttgtcgga gattgtataa 1320 gtataccggg atatgctatg atcatgaata tcgatgaaac acagcgaggt acagcaatag 1380 gagttcgaaa cggtataggt gttaagcaag tacatcggag cttggatgca agggtcatat 1440 gcatcgaatt aggaaatgat gaaaatatga tcaaaattgt caatttgtac gcaccgtcgg 1500 gatccaacaa tagatcagcc cgcgaattat ttttcaatca aaccgttcct cagtttctgc 1560 waaatcatcc agcacacgtc ataatgttgg gtgattttaa ctccgtaatt gagtcacgag 1620 atgccacagg tgtctccaat atgagtccaa tgcttaaaag attatgtgat agtcttaatc 1680 tcacagacgt ttggaagcac ttcaatcgac aaatcgatta cacgttttcg cgagggcggt 1740 ctatgtctag aatcgataag atatacgtta cacaagcttg cttacagaat acacgatact 1800 gtaggcatga ggctaattgc ttcagtgacc ataaagccgt tgtatctcgt atagttctcc 1860 ctttgcaaac tcaaccagtc gagcgtccct actggaaact taacaaccag ttactgaatg 1920 atgaaaatct gaatgaattg agaatcaaat ggaattactg ggtacgtcag agaagaaact 1980 atgcatcgtg ggttgactgg tgggcaattt acgcaaaagg gaaaatagcc tcttttttcc 2040 gttggaaaaa ctcggtggct gtacagacat tcaggaatga atctatgaca aaaggtcgaa 2100 agacaaaagg tcgaaaggac aaaaggtcga agaacaaaaa agaagggaca aaaggtcgaa 2160 aatctttttt caaagaagga aaaatttccc accaagcaaa tcattttcga ccttttgtcc 2220 tttcgacttt ttgtctttcg acccttttgt cctttcgacg ttttgtcttt cgaccctttt 2280 tgtcctaaac cttcaggaat accatgggtt tttactacac atgcctaaat cgtgcttttg 2340 agcagcatag agatctagat tgtacaacgg aaatcaacag gatcaaagcc acaatgctgc 2400 aaaaacagaa acaatttagc gaatccagga aaataccaaa cgtatctttt ctgggcggtg 2460 agaatgtttc aatttttcaa attgaggacc aagaaaacag acatcgggac actagcatct 2520 caatgttgga tgtgaatcat cgacggatta cggataaaat ggaaatcaat aattgcattc 2580 tagagcacta tcgaaacctt ttcgacgaag aacccatcga agaggatgat atgttctatc 2640 caagacgagc aataaatgct gagcatgaag ggaacagcca ccttatggat gatttcacgg 2700 agaacgaaat tttcaccgtg ataagggagt ctgcaccaaa caaatcacca ggtaaagacg 2760 gtcttacgag agaattctat gtcaaggcat gggacatcat caaacgagag ctcattttga 2820 tgatcaacga aatcaaaaac ggtaacgtga ggcacgattt gcaggatgga gtaattgttc 2880 ttgtgcgcaa aaagggagga gataacagta taaacgctct aagaccgatt acacttttaa 2940 acgtggacta caaaatattt agcagagtaa tgaaacaacg gctagaaccg attgctacag 3000 cagtgctttc tccccaccag aagtgcagca atggagatcg tacaattttt gaagcaacat 3060 gtacgatcag agatcacatt gctgcatcaa agcaagaacg tactggaggt atagtcatag 3120 catgtgatct tagcaacgct tttgataggg tcagtcggtc ttatctttta tctaccattc 3180 gtgggatgag attcaacgaa cgctttgtgg atcttttgac tgacctatca gctcatacga 3240 attcacagat caccatcaat gggcatacta ctgaatgttt cccaataaag agatctgtca 3300 gacaagggga tccgctaagt atgttgctgt ttgtgttata tctacagccg cttctagatg 3360 ctttagcaga aatatgtaat ggtcctggtg aatgtgttat agcttatgca gatgatatta 3420 cgattattat tcggtgttca tcgaaagtac gtcgattaat agattgtttt aaccgattcg 3480 gaatagctgc tggcgcaaga ttgaatatga ggaaaacgga agcattaaat atcggtgtac 3540 cacttccaaa ctgggacgga ttcattcaaa aagatgttat aaaagttctg ggggtactat 3600 tcaaaagcag cattaaggaa actgttgctc gtaattggga agctgtatac cggagcgtaa 3660 atggtcttct ctggatgcat agaccgcgtt tggtgaacat aaagcagaaa gtcgttctaa 3720 taaatacgta tatatgttca aagctctggt atatgtcgtc gatccttcca atgccgaata 3780 aatacgtcgc acagtttact aagcaaatag gatattttct gtggaacgga gtatcagcac 3840 aacgggtcaa aatgattgac gtttttaggc cgaggctaca aggaggtttg ggattgcatt 3900 caccagatct caagacaaag gctctcttga ctaaccgata cttgagactt ctgggtataa 3960 caccctgcat gaaccgatac atgcaaatgc agcaagttcc ttcagatctc cttcatctaa 4020 aaattataaa atcagttaaa gaaggcatac cacgtcagat aatcgataat ccaacggcaa 4080 gtctcatcta caatcatttg ctgaagcagt tacccgctgt aagtatcact acgcgacata 4140 ctcggaactg gaaaataatt tggaagaaca tcaaccacaa aaatctgacg tctgatgaga 4200 agtccgcatt ctttcttcta gttcatggaa aattgaatca caatgaacta tttttccgtc 4260 aggagagaag agatgatgct gattgtcaac tatgtggact agtagaaact ttagagcata 4320 agttttgcac atgtgtcgga atcaataatt tgtggcaatc aaccatttca aacattcaag 4380 taatagctca acaaaataat attaattttg aaaacttgca ttaccccaca ataagttgta 4440 atagaagaat taagggcaaa attttaaaaa ttttcattag atacataatg taccatttag 4500 aaattcccga caatgctaaa agttgtgaat cacttagaaa ctattttgta atgtatgtat 4560 aaaacggtaa aataaaacac tttttagaca aaaaaaaa 4598 // ID Gypsy-34_OD-LTR repbase; DNA; INV; 364 BP. XX AC CABV01001480; XX DT 24-FEB-2011 (Rel. 16.02, Created) DT 24-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Oikopleura dioica genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-34_OD_; KW Gypsy-34_OD-I; Gypsy-34_OD-LTR. XX OS Oikopleura dioica OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; OC Oikopleuridae; Oikopleura. XX RN [1] RP 1-364 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Oikopleura dioica genome."; RL Direct Submission to RU (24-FEB-2011). XX DR Genome; CABV01001480; Positions 52595 52232. XX SQ Sequence 364 BP; 106 A; 91 C; 47 G; 120 T; 0 other; tgacacgaca aggcaaaaaa cgaaaatttt atgatttgta aatatcccta cttattcttc 60 cgaaataact gtattccttt aaccaattca ctatctattt tcgattttcc actacgacca 120 cgatacttgt tgattcgccc tactttctat gcgcttaact gactgacatc aatcatggga 180 gtgacgaccc actgattggt atcgaaccat tttacccgtg acaactgacc tgatttcaca 240 gctgaccaac tttaactgtc taccctcagt ctatttaagg cgactccgac tgagaaaaaa 300 cattcttttt cacaatacat cccacgacat atttttgact cgactctttt tattgcattt 360 atca 364 // ID Gypsy-55_CQ-I repbase; DNA; INV; 6845 BP. XX AC AAWU01036466; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-55_CQ_; KW Gypsy-55_CQ-LTR; Gypsy-55_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-6845 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 489-489 (2011). XX DR Genome; AAWU01036466; Positions 8232 15076. XX CC Positions [4685-5161] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 468..2423 FT /product="Gypsy-55_CQ-I_2p" FT /translation="MRQELEMHYRSMDVKHLLVDEVEHELLIRQEEFSGGD FT TLDAKRRKLRGVLKAQKESNNFDVLLVSEKYCEGEYEQVVTKLDHIRMMLE FT NKKVKKDELPPYKTRLIHLLFRLVRLKESGFDTLGSVREAKKLYIQHFTEW FT SSDPGVVKDVTAGTLLNISKAHDKLNKQTGKESDSWSDVPTGGRNRTSTPK FT AKNKKKKQERSIASKKGKKEDPNPTPQLVDMVMRQVQEYLDRRLSKLRLSD FT HEDWVTDSEYDQLKKKKKSKKLKTLKKVVPLRSRTVMPKKTKPKFVSREDK FT PGRKRSSILFTSGDDTEDFEADSDTSSSSEETDFYRNSKPLKRRPRPVADW FT KLKYDGKDDGKLLNKFLAEVEFMAEAEHLSKKDLFSEAIHLFSGEVRNWYM FT EGKRNKEFRSWKELITELKLEYQPPDMDYYYEQQATLRRQRKSEKFSDYYY FT AIKEIFGYMSKPPPPGRKFDIVYRNLRSDYRNALVVVKEIKNLSTLKFWGR FT KLDAANWYLYRGKENETGQKSAHVNEVSERPSQKASFPPSNKGWKPKDMVG FT KLNWNQNQGSSSQPNQKFGKPQTKNPSQSAPSPAPEQSNSSRGDLEKRVAE FT YRVPDGSTCFNCRGKYHHFGSCLKQKEIFCFVCGFHDFKSENCPYCAKNGR FT KST" FT CDS 2546..5527 FT /product="Gypsy-55_CQ-I_1p" FT /translation="MGLRVVGLLDSGAQVSILGAGAEHLLRKLNLREFGSE FT TKLFSAGGDRLEVSGYVHLPMTFNGTTKLVTTLIAPSLKRQMVLGMNFWRQ FT FGIEPTVPNTSEMLVEEVMLEQQEEVTLNPEQLAELEEIKKEFKVFVEGEV FT LGTTPFINHKIEFLEEFQHADPVRLNPYPWSPEVQKHVNCELDKWIEAGVV FT ERSTSDWALLIVPVVKKSEDGEAQMKVRMCLDARKLNERTRRDAYPLPHQD FT RILGRLGKSRYLSTIDLSKAFWQVPLDQESRKYTAFRVFGRGLFQFTRLPF FT GLVNSPATLSRLMDQVLGYGELEPNVFVYLDDIVIANDTFEDHLRCLKEVV FT ARLKAANLSVNMEKSKFCVSELPYLGFILSSDGIRPNPDKVEAIVNFERPS FT SVRSLRRFLGMVNYYRRFIEGFSEVTAPLTDLLKGKPKVVQWNQAAEDAFI FT ELKQRLISAPILANPNFELPFTVQTDASDSAIAGVLTQEQDGAEHVIAYFS FT RKLTTPQRSWKAAEKEGLAAMEAIEKFRPYIEGTEFMLITDSSALSFIMNT FT KWKSSSKLSRWSMILQQYAMTVRHRKGSENIVPDALSRAVELVEVETRDDW FT YADLFKKVQKSPEDFVDYKIEEAKLYKLVSTPTDVLDYRFEWKLCLPTDLR FT RPVLKAEHDDSMHPGYEKTIQRVKVKYYWPRMAAQCKKYVQSCLVCRQCKP FT STVGKAPEMGNQRLTNKPFQILAIDFIQSLPRSKSGHCHLLVMLDLFSKWT FT VLVPLRKIEAKEVCRIVEDCWMRRFGTPEVMISDNATTFVGKEFQALLRRR FT GVQHWPNSRHHSQANPVERANRTINACLRTYMKEDQRVWDTKVMEVEEMMN FT TTVHASTGMTPYRILYGHEKATEGSQHRLERDDQELSMGERDACRRKLNEK FT VFKVVEDNLKKSYDKNLRTYNLRHKKFAPTYEVGQQVFKRNFQLSSAADRY FT NAKYGPVYVPCVVVARRGTSSYELADENGRNLGVFSAAD" XX SQ Sequence 6845 BP; 1978 A; 1343 C; 1796 G; 1728 T; 0 other; ataacaaatc gttaaagttg aacagaggat tagttaaaat tttattaaaa ttttcaaccc 60 gagtgggagg agagtacaca aacaagttaa attgttgtgt atgatccaaa ttagtagccc 120 tcactgagat tagcctacaa agaagtgtaa ttagactttt agtaagtggg agaatggtgg 180 catttgagct cgaaatttaa atgaaagctg ttatttttgg ataaaaccac aacataaact 240 attattaatc ttttgagagc tgaaggctaa atattagtac acaaaaaggt cctgaatggg 300 aaaataaaac acataaacac caacattgat atttttatta tttacttaca taaatgtgtt 360 agtttgtcat tgcgtttgcg tattttatct atttatttat ttattatttt aacttgttgc 420 ttgtcttttg ttagaattaa acaataaaag tacttaaata cgtcacaatg agacaggaac 480 tggaaatgca ctaccgttcc atggacgtga agcacctgtt ggttgatgaa gttgaacacg 540 agttgttgat tcgtcaggaa gagttctcag gtggtgatac gttggatgct aagagacgca 600 agctgcgcgg tgtcctaaaa gcccaaaaag aaagtaacaa ttttgatgtt ttgttggttt 660 cggaaaagta ctgtgaggga gagtacgagc aggtcgttac aaagttagat catattcgca 720 tgatgttaga gaacaagaaa gtgaagaaag acgaactccc accgtacaaa actagattaa 780 ttcatttgtt gttccgtctg gtacgtttga aggaatcggg ttttgacacg ttaggatcag 840 ttagagaagc caagaagctg tacatacaac attttacgga atggtcatct gatccaggtg 900 tggtgaaaga tgttacagcg ggaacgttgt taaatatcag caaggctcat gataagttga 960 acaagcaaac tggtaaggag tcggattctt ggagcgacgt gccgactgga gggagaaaca 1020 gaacaagtac gcccaaagcc aagaataaaa agaagaaaca ggaaaggtcg atcgcgtcaa 1080 aaaagggcaa gaaggaggat ccgaatccca cacctcagtt ggtggacatg gtgatgagac 1140 aggtgcaaga gtatcttgat cgtcgtttga gtaaactgag gctgagcgat cacgaagatt 1200 gggtgaccga cagcgaatat gatcagctta agaagaagaa gaaatctaag aagctaaaaa 1260 ctttgaagaa ggttgttcct cttaggtcaa ggactgtaat gccgaagaaa acgaagccga 1320 aatttgtgtc acgggaggat aaaccgggaa ggaagcgttc atcaatactg tttactagtg 1380 gagacgacac cgaggatttc gaagcagact ctgatacatc aagtagttcg gaggaaacag 1440 atttctaccg caacagcaaa cccttgaagc gtagacctcg tccagttgcc gattggaagc 1500 tgaagtacga tggtaaagat gatgggaaac tcctgaacaa gttccttgct gaggtggagt 1560 tcatggcgga agctgaacac ctcagtaaaa aggatctctt cagcgaggcc attcacttgt 1620 tctccggaga agtccggaac tggtacatgg agggaaaacg caacaaggag ttcaggagct 1680 ggaaagaatt gattaccgaa cttaaacttg agtatcagcc accggacatg gattactatt 1740 atgagcagca ggctacgttg agacgtcaga ggaaatcaga gaagttttct gattattact 1800 acgctatcaa ggagatcttt ggatacatgt caaagccacc gccacctggc cggaaattcg 1860 acattgtgta ccgcaatttg cgatcggatt accggaacgc tttggtggtt gtaaaggaga 1920 tcaaaaactt aagtacatta aagttttggg gacggaaatt ggatgcggcc aactggtatc 1980 tctaccgagg caaggaaaat gagactggac agaagtcggc acatgttaac gaggttagcg 2040 agagaccgtc tcaaaaggcg tcgtttccgc cgtccaacaa aggatggaag ccaaaggata 2100 tggtagggaa gcttaattgg aaccagaacc agggttcgtc ttcacagcca aatcaaaagt 2160 ttgggaaacc gcagaccaaa aatccatctc agagcgcgcc gtcgccagct ccagagcagt 2220 ccaacagcag cagaggtgat ctggagaagc gggtcgcaga gtaccgcgtt cctgacgggt 2280 cgacctgttt caattgtcgt ggtaaatacc atcattttgg gtcgtgtttg aagcagaaag 2340 aaattttctg ttttgtttgt ggatttcacg acttcaaatc cgaaaactgt ccgtactgtg 2400 caaaaaacgg ccgaaaatcg acgtaggagg tcgtcgaaca tcaggtcgaa aacctccaac 2460 tgtgtttcta cccgctgaca ctgattgcgc tgagctgatc gtagaagtcg atggggataa 2520 cagaccgttt gttgctgtag atgtaatggg tttgagggtg gtcggattat tggacagtgg 2580 agctcaagtt tcaatacttg gagcaggagc agaacacctg ctcagaaagc tgaatcttcg 2640 ggaatttggt tccgagacga agcttttctc tgcaggtggc gatcggctag aagtttcagg 2700 atatgtccac ctccccatga cattcaacgg cacaacaaaa ttagttacta ccttgattgc 2760 tccctcactt aagcggcaga tggttttggg tatgaatttc tggcgccagt ttggcataga 2820 acccacggtg cccaatacct cggagatgct agttgaggag gtgatgttgg agcagcagga 2880 agaagtaaca ctgaatccag aacagttggc agagttggag gagatcaaaa aggagtttaa 2940 ggtgttcgtc gaaggtgagg ttctgggaac gacacctttc atcaatcaca aaattgagtt 3000 tttggaagaa tttcagcatg ctgacccggt tcgactcaat ccgtatccgt ggtcgccgga 3060 ggtccagaag catgttaact gtgagttgga caaatggatc gaagccggag tagtagagag 3120 atcaacaagc gattgggcac tgctgatcgt gccagtcgta aagaagagtg aggacggaga 3180 ggcgcaaatg aaggtgcgga tgtgtctgga cgctcgcaag ctgaacgaaa ggactcggag 3240 ggacgcctac cctctaccac accaggaccg tattcttggg cgacttggta aatctcggta 3300 cttatccacc attgaccttt ctaaggcctt ttggcaagta ccgttagatc aagaatcgcg 3360 taagtacacg gcattccgag tgtttggtag agggctgttt cagttcaccc gactcccatt 3420 tggtttggtc aatagtcctg cgactttgtc caggctcatg gaccaggtct tgggatacgg 3480 tgaactggag ccgaatgtgt tcgtgtatct ggacgatatc gtcatcgcca acgacacatt 3540 cgaggatcac ctccggtgtt taaaggaagt ggtcgcacgt ctgaaggccg cgaatttgtc 3600 ggtcaacatg gagaagtcga agttttgcgt atcagaactt ccgtacctgg gcttcattct 3660 ctccagcgat ggcattcggc ccaatcccga caaggtggag gcgatcgtca atttcgaacg 3720 accgtcgtct gtccgttcac tccggagatt tttgggaatg gtaaattact atcgacgatt 3780 tattgaaggc tttagtgagg ttaccgcacc attaacggac ttgctgaagg gtaaaccgaa 3840 ggtggttcaa tggaaccaag ccgctgagga cgccttcatt gaattgaagc agcgattgat 3900 ctcggcgccg attctggcga acccgaattt cgaactgccg ttcacggttc agacggatgc 3960 aagcgacagt gccattgccg gagtgttgac ccaggagcag gatggagcag agcacgtgat 4020 tgcctacttc tctcgcaagc ttacaactcc acaacgttcg tggaaagctg cggaaaagga 4080 aggtctggcg gcgatggaag ctatcgagaa attccgaccg tacatcgagg gaaccgaatt 4140 tatgctcatc accgactcat ccgccctgtc gttcatcatg aacacaaaat ggaagtcatc 4200 gtcaaagctg agcaggtgga gtatgatctt gcagcaatac gcgatgacgg ttcggcaccg 4260 caagggatca gagaacatag ttcccgatgc tctttcgcgg gcagtagaac tagtagaagt 4320 ggaaactcgt gacgattggt atgctgacct gttcaagaag gtgcaaaaat ctcccgagga 4380 tttcgtcgac tacaaaatcg aagaagctaa gctgtacaag ttggtctcga cgcccacgga 4440 tgtgttagat tacaggttcg agtggaagct gtgtttacca acagacctgc gtcggccggt 4500 cttaaaggcc gaacatgatg actcgatgca cccaggatat gagaaaacga ttcagcgcgt 4560 aaaagtaaag tactactggc ctaggatggc ggctcagtgt aaaaagtacg ttcagtcctg 4620 tttggtgtgt cgacagtgca aaccatcgac tgtgggcaag gcacctgaga tgggaaacca 4680 gcgcctcacc aacaaaccct tccaaatcct agccattgac ttcattcaga gtcttcctcg 4740 gagcaagagt ggtcattgcc atttgttagt gatgttagat ctcttttcaa aatggaccgt 4800 acttgttccg ctacggaaga tcgaggcgaa ggaggtgtgc aggattgtgg aagattgttg 4860 gatgagacgt tttggtacgc ccgaggtgat gatctccgac aatgcgacaa cttttgtagg 4920 aaaagagttc caggcgttgc tccgtaggag gggagtacaa cattggccga actctaggca 4980 ccatagccag gccaacccgg tagaaagggc caaccgcacg atcaacgcgt gtttgcggac 5040 ctacatgaag gaagatcaac gtgtctggga cacgaaagtg atggaagtcg aagagatgat 5100 gaacacgacc gttcacgcat cgaccggaat gaccccgtac cgcattctgt acggccacga 5160 gaaggccacg gaaggaagcc agcatcgctt agaaagggat gaccaggagc tgtccatggg 5220 tgagagggat gcatgccggc gtaagctgaa tgaaaaggtg tttaaggtag tcgaggacaa 5280 cctgaaaaag agttacgaca agaacctacg aacctacaac ctaaggcata agaaatttgc 5340 gccaacttac gaggtgggtc agcaggtgtt taagcgtaat ttccagttgt cgtcggctgc 5400 ggatcggtat aacgccaagt acggtccagt gtatgttcct tgcgtcgtcg ttgcacggcg 5460 tgggaccagt tcttacgagt tagctgacga gaatggtagg aaccttgggg tgttttccgc 5520 agcggattag cgtcctgatg gttcggttgg ccagccggaa gtaacctaag aagtagaagc 5580 ataagtatcc agaagcagag gtctcatgaa ctcaaggtac cagagattcg tgattcactc 5640 gtggagctgt atgcagatgt caacagagcg gtcatcacga aaaggaggcc gtgagaaccc 5700 atgatggagt aaatttgtcc tcatgtggtg acgatctcgg ccagtggtat aaaagaccgt 5760 gttggtagtt gttggtagtt cactgttgtg gtcatcgagg ttagcccaca gcagcgatga 5820 gtcatttgct gaagattggt tgagcgatga taacagacta ccgtgagtag tgagatgaag 5880 cacgtgagat gaagaggcag cagtgagtga gctgattata ttatatccac ggcgctgaaa 5940 gagaggaaga gagttatgac gtcattgatt tgatagactg tccggacggg ttagaataat 6000 agcaggagat ctccttttgt agtttgtagc gtgaagaatt gtagtagtaa taagaagatg 6060 catgcatgaa gaagagtagc gtagatggtc gtgaactgtc caatactgac cgtttgttta 6120 aatttccaat aatcattccg cgtcttttgg tagagatctc ttaggtgaaa aatatgaaag 6180 cagcttttaa tgcgcagaaa gcttgaatcc atgaatagga gcttggccaa agctcgcacc 6240 taggaaaagc caggtaggcc acctggttgg atttcctgaa aatgattaga tttctttagc 6300 tcaacatcat ttagatacag ttagatctac ttatcgtcat aaaccatcac aacatcccca 6360 tctgataact caaccgtact atttctgttt tgtagactgg ttttaaattt gttaaattat 6420 ttcacttttc gtttgtaatt tcttgtcgta ctaccgttca cgtccgcgcg ttgagagaaa 6480 ctgaacgtac tctctttcgc ttacatttgg ttaggctgaa atgagaggta ctaaataatt 6540 tgttttgaga gcagtaaaac tcaccactgg tgttaatgag agtcatactc gcagtataaa 6600 gtcgcacgtg gccaattatg gcatatttca aacgaaattt agcttttagt tgccgagatt 6660 ttaatgtgtt tatgttgtct tatataattt attcaaacat tagctgctac gcggacggat 6720 gatttgtatt ttccgtatag ttctgctaca cttttcgtaa cttattgagg tttcattaca 6780 atttttccaa ggtttaaata attgcactgt ttcccagtac aattattaat tgaagtgggg 6840 gatag 6845 // ID Gypsy-53_AA-I repbase; DNA; INV; 5358 BP. XX AC AAGE02021081; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-53_AA_; KW Gypsy-53_AA-LTR; Gypsy-53_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5358 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02021081; Positions 28216 22859. XX CC Positions [2627-3127] - Reverse transcriptase CC Positions [4262-4732] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 1049..5266 FT /product="Gypsy-53_AA-I_1p" FT /translation="MSSSSIACTVEPYRRGNSFNDWYTRLKYFFKVNKVKD FT EDKMAYFITLSGPVIFEEIKLLYPTGNFEEIPFDDLINKLKNRLDKIDPDL FT VQRYKFSTRVQQPDETTEDFVLALKLQAEFCDFENFKEVAVLDRIIAGLRD FT KNFRLKLLSEEKLKLASAEKIIATWEVAKANAGTAETSRDSPNMVAAIGNY FT SGRESGMAMRKLAKVYGGINDYQREEVGGEASGSRGPVKNRLGYRPYGRTD FT QTGRGRQNSGSAGRRGEMYARQRPDYSQMVCNYCGIKGHIKKKCFKLKNLH FT RDAVNLVESYKPGPSANRHINELLSRMRTRDSEDDDSDTGDLECMLVASIN FT KISNPCLIQVKIEGKYLEMEIDCGSSVSVISKRQYFALFDKPLQSYSNQLI FT VVNGAKLKIAGETMVSVRINGLEKLMSLLVLDCENSFYPLFGRPWLDVFYS FT DWRHFFSNSMTINSLQDESSENVVDKLKKKYSKVFEKNFSSPIQGFKAELV FT MKDETPIFKKAYDVPYRLREQVADYLDKLENEKVITPIDTSQWASPIIIVM FT KKNSDIRLVIDCKVSLNKLLVPNTYPLPTAQDIFANLAGCKFFCALDLEGA FT YTQLELSKRSKRFVVINTMKGLYTYNRLPQGASSSASIFQQVMDKVLEGIE FT NVTVYLDDVLIAGKTREECLKKLVIVLDRLTKANIKVNFDKCKFFVTKLTH FT LGHLITDKGLLPCPDKILTIEKANAPKNESELKSFLGLINYYHKFIPHLSA FT KLHYLYNLLKNNTKFSWDGNCQKAFEESKAALVNANILEFFDANKPIVVIS FT DASSYGLGGLIAHVVDEMEKPISFTSFSLNSAQKKYPILHLEALALICTIK FT KFHKYLYGKHFTIYTDHKPLVGIFGKEGKHSIYVTRLQRFILDLSIYDFDI FT IYRPSHKLANADFCSRFPLQQGVPEELDTDVIKSINFGKHIPIDSKMIAEE FT TKKDDFLHKIKWFMQNGWPERVEKRFVDVFANQHDLELIDDCLLYNDRVIV FT PTIYQKQILKLLHGNHSGIVKMKQLARRMVYWFGINTDIENYVSKCDACNG FT MSISSKPKESSKWMPTNKPFSRIHIDFFFFEHHTYLLIVDSFSKWIEIEHM FT KQGTDCNKVLKKLVCFFARFGLPDVLVSDNGPPFNSHNFVNFLERQGIRVM FT KSPPYNPASNGQAERLVRTVKEVLKCFLLEPEVAILDLEDQINLFLFNFRN FT NNLSNDGRFPSEKIFCYKPKTVLDLINPKKHYKNNISEPQSDEETTTKNQC FT GRIPRRPNDAIDELMTGDEVWYKNHNPQIHARWIKATFIKKYSHNTFQIRI FT GSANVMAHRTQIKICKDTSWQRPNVLINLGGAESVSNEEQGASTVQQNESR FT DKPKELPRGSKKRKHPVSEDEATEPRRSKRERKTNKSDDYFYS" XX SQ Sequence 5358 BP; 1768 A; 901 C; 1155 G; 1534 T; 0 other; aagatataaa agtggatacg aggtagtagg ataaaacgcg tgcatagaaa gagaagttaa 60 cctttacgtg cgagtgttag ttttctccat cgctggcgcc caagggtgga aaaggtcggc 120 agagtccatt ttgacgtcgg tctggtgagg tcgaaacgct aagcgttctg tacgggcgac 180 aaattttttt cgtatcaccg aggaagccta atctaaagcg actgaacatc ttaggctgaa 240 gttttcgccg attcttcttg ggtaaggtaa agtgaagggt aaacgttacc tcgtgatttt 300 ctcttgattt tacagccaaa gtaaaacaaa aaagagcgct tcagtagtgc tatatggtgt 360 gaaagcattg tgtgacattt cctcataact cctcccctcc ccatcctgac taacaaagaa 420 agaaatagct gtgagtgaaa gaaacccatt gtatattgaa aggtgcgaat agggtggact 480 aaaatttgtg tgttcaccaa atctaagaag atttttttct cgtttttgaa acaacacttc 540 gtcagttcct tttaataaga gagtttcttt ctttaatacg cagaagccag ccagatttta 600 agtgctaaaa accgattgtc aagtgatttt atcacaaggc tcattggcgg tatcgacgac 660 aaaactggtg gacgaatcag gaggagccaa atcccggaaa ataactaatc ggcaccgctg 720 ctacgcccaa ttccggtcgg catttattcc ggttaaaacc aaggttggga ctttcttgtt 780 cggcaaaggt cgcaagcggg caaaggccag gatgcaattg gtgagtcaga catttttttt 840 cctttttcct cttttattat ttgttaatca ttacatcaaa agagcttttt gttcatttgt 900 acattcaatt tgatagttat tgttgctttt gtttcttttt caaatttcat tctttgttcg 960 gtagtccgga aacattgctc aattgactat ttattgcaaa tttcaatatt tagatttaca 1020 gacagtaatt tatttctgct aaatcaatat gagttcttca agcattgcct gcactgtaga 1080 gccttatcgt agaggcaaca gttttaatga ctggtacaca cgtctaaaat atttcttcaa 1140 agttaataaa gtgaaagatg aagataagat ggcctacttt ataactctta gtggaccagt 1200 catctttgag gagataaagc tcctttatcc cactggaaac tttgaggaaa ttccttttga 1260 tgatttaatc aataaattga aaaatagatt agataaaatt gatccagatt tagtacaacg 1320 ctataaattt agcaccaggg ttcaacagcc cgatgagact acagaagatt tcgtgctagc 1380 actcaagttg caggcggaat tttgtgactt cgaaaatttt aaggaggtag cagttctcga 1440 tcgcattatt gcgggactac gagacaaaaa ttttaggctg aaattgctta gtgaagaaaa 1500 gttgaaatta gcttcggctg aaaaaattat tgctacatgg gaggtcgcga aagcgaatgc 1560 gggaactgct gaaacttcta gggattctcc aaatatggtg gctgcaatag gaaattattc 1620 cggacgagag agtggaatgg ctatgaggaa gctcgcaaag gtctatggtg gaattaatga 1680 ttaccagaga gaagaggtag gaggtgaggc cagcggtagt cgtgggccag tcaagaatcg 1740 attagggtac aggccatatg gaaggacaga tcaaacaggc agaggaagac agaattctgg 1800 atcagcaggc aggagaggag aaatgtatgc aagacaacga ccagactatt ctcagatggt 1860 gtgcaattac tgtggcatta aaggccacat aaagaaaaaa tgcttcaaac taaaaaacct 1920 tcatcgagat gcggtaaatt tagtggagtc ctacaagcct gggccttcgg ccaacaggca 1980 catcaacgag ttgctcagca gaatgcggac gcgggattcg gaagatgatg acagcgatac 2040 aggtgatttg gaatgcatgt tggtggcgtc cataaataaa attagtaatc cttgtttaat 2100 acaagtgaaa attgagggta aataccttga aatggaaatc gactgtggtt cttccgtttc 2160 ggtaataagc aaaaggcagt actttgcatt atttgacaaa cctttacaaa gttatagcaa 2220 tcaattgatt gtcgttaatg gggctaagct taaaattgct ggagagacga tggtttcggt 2280 gagaattaat ggactagaaa agctcatgag tctgttggtt cttgattgtg aaaattcctt 2340 ctatcctttg ttcggccggc cttggctaga tgtcttttat agtgattgga ggcatttttt 2400 ttcaaactct atgactatta acagtttaca agatgagtcc agcgaaaatg ttgttgataa 2460 attgaaaaaa aagtatagca aggtgttcga aaaaaatttt tctagcccta ttcagggatt 2520 taaggcggaa cttgtgatga aagatgaaac gcccattttc aaaaaggcgt atgatgttcc 2580 gtacagactg agagaacaag tggcagatta tcttgataaa ctggaaaacg aaaaggttat 2640 aacaccgatt gatacaagcc aatgggcttc acctattatc atagtcatga agaaaaacag 2700 tgatatccgt ttggtaatag attgtaaagt atcactcaat aaattattgg ttcctaacac 2760 atatcctttg cctactgccc aagacatttt tgctaatttg gcaggttgta agtttttctg 2820 tgcccttgat ttggaaggag cttatactca gttggaactc tctaagagat ccaaacgatt 2880 tgtagtgatc aatacaatga aagggctgta cacttacaat agactgccac aaggggcttc 2940 atcaagcgca tccattttcc aacaggtgat ggacaaagtg ttagagggga ttgagaacgt 3000 tacagtttat ctggatgacg tattaattgc cgggaaaacc cgagaagagt gtttaaaaaa 3060 acttgtgatt gttttggata ggttaactaa agctaatata aaagtgaatt tcgataaatg 3120 caaatttttt gtgaccaaac tcacgcattt aggacatttg attacggaca aagggttatt 3180 gccttgccca gataaaattt tgacgataga gaaagctaat gctccgaaaa atgagtctga 3240 gcttaagtca ttcttaggtt tgataaatta ctatcacaaa tttattccgc atttatcagc 3300 taaattgcac tatttataca atttacttaa aaacaataca aaatttagct gggatggcaa 3360 ctgccaaaaa gccttcgaag aaagtaaagc tgccctggta aatgcaaaca ttttagaatt 3420 ttttgacgcc aataaaccaa ttgtagttat ttctgacgct tctagttatg gtttaggagg 3480 acttatagct catgttgtgg acgaaatgga aaaaccaatt agtttcacat ccttctcgtt 3540 aaactcggcc caaaagaaat atcctatatt acatttggaa gcattagctc taatatgtac 3600 tataaaaaaa tttcataaat atttatatgg aaaacatttt acaatctaca cggaccacaa 3660 gcctctagtt ggaatttttg gaaaagaggg taaacattct atttatgtaa ctaggctcca 3720 gagatttatt ttagatttgt ctatttatga tttcgatata atatacagac catcacataa 3780 actagcaaat gcagactttt gttctagatt ccctctacag cagggggtcc ccgaggagtt 3840 agacacagat gtgatcaaaa gcataaattt cggtaaacat atacctattg attcaaaaat 3900 gattgcagaa gaaactaaaa aggatgattt cttgcacaaa attaaatggt ttatgcaaaa 3960 cggttggcct gaaagagtgg aaaaacgttt tgtggatgta tttgcaaatc aacacgattt 4020 agaactgatc gatgattgtt tgctttataa cgacagagta attgtgccca caatttatca 4080 aaaacaaata ctaaagctgt tacatggcaa tcactctggt atagtgaaga tgaagcagtt 4140 agcacgaagg atggtttatt ggtttggcat taacacagac atagaaaatt atgtgagtaa 4200 atgtgatgca tgcaacggca tgtcaatttc gagcaaacca aaggagtctt ccaaatggat 4260 gcctacaaat aaaccattta gcagaatcca tatagatttc ttcttttttg agcaccatac 4320 ataccttttg atagtagata gcttctccaa atggattgag attgaacata tgaaacaggg 4380 aacggattgt aacaaagtgt tgaagaagtt agtgtgcttt ttcgcaaggt ttggtttgcc 4440 agatgtttta gtctccgata atggacctcc tttcaattcc cataattttg taaacttttt 4500 agaaaggcaa ggtataaggg ttatgaaaag tccaccttat aacccagcaa gtaatggaca 4560 agcggaaaga ttagttagaa cagttaaaga ggttctaaaa tgtttcttat tggagccaga 4620 agtggcaatt ttggacctgg aagaccagat aaatctgttt ttgttcaact ttagaaataa 4680 taacttatct aacgacggaa ggtttccttc tgagaagata ttttgctata agccgaaaac 4740 agtgttggac cttatcaatc caaagaaaca ttacaagaat aacatttcgg aacctcagtc 4800 ggatgaagag accactacaa agaatcagtg cgggagaatc ccgaggcgtc caaatgatgc 4860 aattgatgag ctgatgacag gggacgaggt gtggtataaa aaccacaatc ctcaaattca 4920 cgcaagatgg ataaaggcaa ccttcattaa aaagtattct cacaatactt tccagatacg 4980 cattggaagc gccaacgtaa tggctcatcg gacgcagatc aagatctgta aggatacatc 5040 ctggcaaagg ccaaacgtcc tgatcaactt aggtggagca gaaagtgtgt caaacgaaga 5100 gcagggggca tcaaccgtgc aacaaaatga atctcgagac aaacctaagg agcttcccag 5160 gggaagtaag aaaaggaaac atcctgtttc tgaagatgaa gcaacggagc ccagaagatc 5220 aaagcgagaa cgaaaaacaa acaaaagtga tgattatttc tattcatgat taagataccg 5280 taatttcgga ttaggttaag tgttgaatta gttttcattc tgaagttgtg caatacactt 5340 aaagggggaa ggaactgt 5358 // ID L1-44_AAe repbase; DNA; INV; 4400 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE An L1 non-LTR retrotransposon from Aedes aegypti. XX KW L1; Non-LTR Retrotransposon; Transposable Element; L1-44_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4400 RA Kojima K.K. and Jurka J.; RT "L1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1397-1397 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 5 sequences with >98% CC identity. XX FH Key Location/Qualifiers FT CDS 159..1142 FT /product="L1-44_AAe_1p" FT /translation="MSAKKRRENTFRIDYANVPKKPLSEEVHQFVGATLGL FT KREDIVRIQYSRNLGVAFIKAACLEVAQXVVDEHDNKHELTVDGKSYKVRL FT VMEDGAVEVKLFNLSEDVTNEKIARFLMGYGELFSIREEVWDEKHLFAGLP FT TGVRVVRMIVRKNIPSYVTIDCETTLVAYYGQQQSCRHCSESVHNGVSCVQ FT NKKLLLQKLATRSTSYADVAKNPKPSRTKVSAPKPATPSAVQAQPISSSPA FT PSTSGIMPPPAVPTQHDTGAENNPATTTQMRESEKDLWVRVTRRSGKKTDG FT NETDSSTSSRSSQKRPLGKKMRCDEDNDSRNTDMQL" FT CDS 1227..4325 FT /product="L1-44_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MFIQTMQLDIVFIQEIENEHLTIPGFNIIANVDHSRR FT GTAIALRDHIRYSHVERSLDGRLVALRVHDTTLCCIYAPSGTVYRAQRERF FT FNDTIAYYLRHRTEHVFLAGDFNCVLRQCDATGYNMSPSLQATVRQLKLHD FT VWLKLRSRDAGHTYITSNSSSRLDRMYVSTSLCEHLRRVETHVCSFSDHMA FT VTLRLCLPHLGQPQGRGFWSLRPHLLNEENLAELQNRWQFWTRERRNYRSW FT MDWWLEHCKAKIVSFFRWKSKQAYDEFNHAHQRLYNQLRRAYEGYYRNPTM FT LTTINRLKSELLLLQRRFTHAFVRINEPMVAGEPLSVFQLGDRRRKRTTIT FT RIFDEQGGIIDDSQAIQHNMFHYFSELYSEPDREIEPIEEFQCDQVVPQND FT PTNVACMDEITTADIWLAIKSSASKKSPGPDGLPKEFYHRAFDIIHRELNL FT VLNEALASNFPSKFVDGVIVLVKKRGTGDTVTAYRPISLLNFDYKLLSRIL FT KTRLENVMKTHHILSDAQKCSNADNNIFQAILSLKDRIAQLAARKQAGKLV FT SYDLDHAFDRVRHTFLYENMRSLGINPELIALLSRISSLSSSRLLINGHLT FT AAFPIERSVRQGDPISMHLFVLYLHPLIASLERVCGTDLIVAYADDISVIA FT TSTRKIELMRDIFLRFGHVSGAILNLQKTTSIDVGFINDHPLQVDWLRTET FT SIKILGVLFANSIKRMVSLNWDLLVGKVAQQVYLHSMRTLTIQQKVILINT FT FITAKVWYMASVLPLYSLHTAKITATIGTFLWKGHPARVPMTQLARPKDKG FT GMKLQLPVFKSKSLLINRHLQEIGSTPFYNSFLAQANPPNPPTDLLCLKAI FT LQTRHSLPPHVQQNPSSDQIHRFYIDQTDLPRVEQRYPTVNWKRVWLNIAS FT KDLNSTQRSAYYLLVNEKIEHRQLWHTIRRVDTERCIHCNAASETLKHKFS FT ECRRVAAAWEVLNRTFQNLSNNRRRFSFEELLHPELNNMNKRQKTLIMKYF FT INYVTFINKCDNFVDVSELEFYLQVEL" XX SQ Sequence 4400 BP; 1279 A; 1102 C; 966 G; 1052 T; 1 other; cagtaagcgc tcaacttccg agccgatcag acgcactttt gcctccgtat cgtctgtgga 60 agtcaatcgc cgacttcgca tagatttctt catttttcgg ttaagtagtg atacatcgcc 120 gctagtttcg ctacgctttt gtgtgatcat cgtaatcgat gtccgcgaaa aaacgtcgcg 180 aaaacacgtt tcgcatcgat tatgcgaacg tgccaaagaa gcctttgtcc gaagaggtgc 240 accaatttgt tggtgcgacg ctcggcctga agcgagaaga tattgttcgg attcaataca 300 gccgcaacct gggtgtcgca ttcatcaaag ctgcttgtct cgaagtggca caaaawgttg 360 tcgacgaaca cgacaacaaa cacgagctca cggttgacgg aaaatcgtac aaggtacggc 420 tggtgatgga ggacggggcg gtagaggtga agctcttcaa cctctccgaa gatgtcacca 480 acgaaaaaat cgctagattt ctaatggggt acggagagct gttttcgata cgagaagagg 540 tgtgggacga gaaacatctc tttgctggat tgccaaccgg cgtccgagtc gttcgaatga 600 tcgttaggaa aaatattcct tcctacgtca ctatcgactg cgaaaccacc ctagtggcct 660 actacggcca gcaacagagc tgccgccatt gtagtgaatc tgtacacaat ggcgtttcat 720 gtgtacagaa taaaaaattg ctattgcaga agctagcaac acgcagcact tcgtatgcgg 780 acgtcgccaa aaatccgaaa ccttcacgaa cgaaggtcag cgcaccgaag ccagcaactc 840 cgagtgcagt ccaagcccag ccgatctcgt cgagcccggc accatccact agtggcataa 900 tgcctccgcc ggctgttcca acacaacacg acaccggagc ggagaataat ccagcaacaa 960 caacccaaat gcgagaaagc gagaaagatc tgtgggtcag agtaacacgc agatcgggga 1020 agaaaaccga tggcaacgaa accgattctt caacctccag taggtcctct caaaagcgcc 1080 cgctaggtaa aaagatgcgc tgcgacgaag acaacgactc tcgcaacact gacatgcagc 1140 tttaatcagt catcttcacc agctacaata ctgcaactat aaacataaac actatcacga 1200 acgaaaacaa gatcaacgcg cttcgaatgt ttatccaaac tatgcagctg gatatagttt 1260 tcatccaaga gatagaaaat gagcatctaa caattcccgg cttcaatata atcgcaaatg 1320 tggaccactc tagaagaggt accgcaatcg cactgagaga tcatattcgc tactcgcatg 1380 tggagcgaag tttggatggc cgacttgtcg ctctgcgcgt ccacgacaca acactttgct 1440 gtatttacgc tccgtccggc actgtttaca gagcacagag agagcgtttt ttcaatgata 1500 ccatagcgta ctacctacgc caccgtactg agcatgtatt tctagcaggc gatttcaact 1560 gcgtgctacg acagtgtgat gccacaggat acaacatgag cccttctctc caggccaccg 1620 tgcgacagct taaactgcat gatgtatggc ttaaactgcg ctcccgggac gctgggcata 1680 cgtacatcac atcaaactct tcctctcgtc tggatcgcat gtacgtcagc accagtttat 1740 gtgaacatct acgaagggtg gaaacccacg tttgttcatt tagcgaccat atggcagtga 1800 cactgcgttt atgtttgcca catcttggtc aacctcaagg gcgaggtttt tggtcccttc 1860 gtccccacct cctcaacgaa gaaaacctag ccgagctaca aaatcggtgg caattttgga 1920 ctcgcgaaag acggaactac cggtcctgga tggattggtg gttggaacac tgtaaggcga 1980 aaattgtcag cttcttccgt tggaaatcca aacaagcgta tgacgaattt aatcacgcgc 2040 accaacgtct ttacaatcaa ctgcgacgcg catacgaggg atactaccga aatccaacaa 2100 tgctgaccac catcaaccgc ctgaaaagtg agttacttct gctgcagcga cgctttaccc 2160 acgccttcgt acgtataaac gagccgatgg tggcgggaga acctctctcg gttttccaac 2220 tcggcgacag gcggcgaaag cgaacaacga taacacgtat cttcgacgag caaggaggaa 2280 tcattgatga ctcgcaagct atacaacaca acatgtttca ttatttctct gagttgtatt 2340 ccgagccaga tagagaaatt gaaccaatcg aagaatttca gtgtgaccag gtcgtaccac 2400 aaaatgatcc aacaaacgtt gcatgtatgg acgaaattac tacagcggac atatggctcg 2460 cgataaaatc aagtgcgtca aaaaaatctc ccgggccgga tggtttgccc aaggaattct 2520 atcaccgagc tttcgatatc atccaccgcg agctgaatct tgtgttgaac gaggcactgg 2580 catcgaactt tccatctaag tttgtcgacg gggtgatcgt acttgttaaa aagaggggta 2640 ctggcgacac tgtcaccgca tacaggccga tttccctttt aaattttgat tacaaacttc 2700 tctcgcgaat attgaaaaca cgccttgaga atgtaatgaa aactcaccac atactcagcg 2760 acgcacagaa gtgttcgaat gcagacaaca acattttcca agccatacta tcactaaagg 2820 acaggatcgc acagctggcg gcgcgcaaac aagcagggaa gctcgtgagt tacgatctcg 2880 atcacgcttt tgaccgagtg cgacacactt tcctgtacga gaatatgcgc tctcttggta 2940 tcaatccaga attaattgct ttgctctcga ggatctcatc tctctcctct tcccgcttac 3000 tgataaacgg acatttgacg gcggcgtttc ccatcgagcg ctcggttagg caaggggacc 3060 caatctccat gcatttgttt gtattatacc ttcacccctt gatagcctca ctagagcgag 3120 tatgcggaac ggacctgatt gtggcatacg ctgatgatat tagtgtgatt gccacttcga 3180 cacgcaagat cgaactgatg cgagatattt ttcttcgttt tgggcatgtt tctggtgcaa 3240 ttttgaatct gcaaaaaacc acatccatag atgtgggttt catcaatgac cacccattgc 3300 aagtggattg gttacgaact gaaaccagca tcaagatact gggagttcta tttgccaact 3360 caataaagcg aatggtaagc ctgaactggg acttgttggt cggaaaagtt gcgcagcaag 3420 tttatcttca ttcgatgcgc acacttacta tacagcagaa ggtaatactc atcaacacgt 3480 tcattactgc aaaagtttgg tacatggcgt ctgtgttgcc gctatacagc ctacacacag 3540 ccaaaattac agctaccatc ggaacctttt tgtggaaagg acacccagct cgtgtaccaa 3600 tgacgcagct tgcgcgacca aaagacaaag gcgggatgaa acttcaatta ccggttttca 3660 aatcgaaatc tcttttgatc aaccgacatc tgcaagagat cggctccact cccttttata 3720 attcctttct tgcacaagct aatcccccaa atccacctac agatctcctt tgcctgaaag 3780 caatcctgca aacgcgtcat agtttgcctc cccatgtcca acaaaatccc tcctccgatc 3840 aaatccaccg tttttatatt gatcaaaccg accttccgag agtggaacag agatatccaa 3900 cagtaaactg gaagcgtgta tggctgaaca tcgcaagcaa ggatttgaac tcaacccaac 3960 gtagtgcgta ttacctcttg gtcaacgaaa aaatcgagca ccggcagctg tggcacacca 4020 ttcgccgagt cgacaccgaa cgctgtatcc actgcaacgc agcgtcagag acactcaagc 4080 acaaattcag cgaatgccga cgcgtggcag ccgcgtggga ggttttgaat aggacgtttc 4140 agaacctttc gaacaatcga aggaggtttt cctttgagga gctgttacat cctgaactaa 4200 acaatatgaa taaacgacag aagacactaa taatgaaata tttcatcaat tacgtcacat 4260 ttatcaacaa gtgtgataat ttcgtggacg taagcgaatt agaattttat ttacaagttg 4320 aattgtgaac aaattgcact gtaattagtt ttaaacaaga tcaaccgaaa tacactatat 4380 tttatgtaaa aaaaaaaaaa 4400 // ID Copia-16_SI-I repbase; DNA; INV; 4004 BP. XX AC AEAQ01021270; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-16_SI_; KW Copia-16_SI-LTR; Copia-16_SI-I. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-4004 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01021270; Positions 7541 3538. XX CC Positions [1454-1951] - Integrase core CC 'CGTCC' target site duplication CC LTRs are 94% similar to each other. XX FH Key Location/Qualifiers FT CDS join(53..1111,1115..2542) FT /product="Copia-16_SI-I_1p" FT /translation="MADTTKTTLAKLNRQNYFVWKYRMEILLMKEKLWSVV FT NEDKPVTGTDGRSAEQVAAWQQRDDQARGWIGLLVDDSQFCYIRNSTTAKK FT AWYALKAYHEKDTLSNKVTLIRRICCSKMEKSGNMDDHLTNLTGLFQKLTD FT LGEELSDSWIVGMINSLPRSYDTLVTALKTRLEADLTLSLVQSKLLAEHNR FT RKEAHGGDGGEAVLKTIVRSTTCFFCKKSGHIKRDCTQYKNWKEKQNKKTK FT TKEKQDKENKVPKLEKLNKVEQNQEFLFMVKAGSHAGWVIDSGATCHIASD FT ERYFVSLDKTVRNNLNVANCEKVSVKGKGTCVIPFVNVDGVESRATISEVM FT YAPDIDGNLLVKKLLDNGYTMNFVPGACEILYKDRKMGIADEHSNLFRLRK FT PIRACMVREHSSKCIHYRHRVFGHHDPEAMKLMCLKELTNDMVIEDCDIKT FT LCEVCAQGKLTRMKFPKQSSSKSTEPLDLIHTDVCGPMQTVSPSGMRYVLT FT FIDDFTRHTVIYLLKQKSEVEIKLKEYIEMVKTMFGRKPKVIRSDRGEEYV FT RKQVIGMLQNEGIRVQYIAPYTPQQNGVAERKNRTLIEMARCMLFEAGLPY FT SFWAEAVNTANYIQNCSPSQSVDKTPHELWTGSKPGTKHLAIFRIKRFVHI FT PSEERRKLNKVAAPMIFVGYDEHSKAYWCYDPLQKKMVISRDVRFTGKMYS FT ASKVDINLSKGKISSEDESANVEIDLNKCDPEPVIVQEEWNDIREWNNDSE FT YESANERTEQSDNEEGKAEQQQQLRVSQRVNKGVPPRRFIEEINAVSTQVT FT EPKTLTKVMASEQNDEWIKAMTVRWTR" XX SQ Sequence 4004 BP; 1345 A; 742 C; 1002 G; 915 T; 0 other; ggttatgggc ccaggttccg tgcgtgcata aagtgatttt gcatacgaca aaatggcgga 60 cacaacaaag acaactttag caaagttaaa cagacaaaat tacttcgtat ggaaatacag 120 gatggagata ttactcatga aagaaaaatt gtggagtgta gtgaatgaag acaaacctgt 180 aacagggacg gatggcagaa gtgcagaaca ggttgccgca tggcaacagc gagacgatca 240 agcgcgcgga tggattggac tcttggtcga cgacagtcaa ttttgctata taaggaactc 300 gacgacagca aagaaagcat ggtatgcgct caaggcatat catgagaagg acacgctgag 360 caataaagtg actctgattc ggcgtatttg ctgcagtaaa atggaaaaaa gtggcaacat 420 ggatgatcat ttgaccaatc tgacgggtct gtttcaaaaa ttaacagatc ttggtgaaga 480 gctttcggat agttggatcg tcggaatgat taacagccta ccaagaagtt acgacacgtt 540 ggtcacagcg ctcaagacaa gactggaagc agatctgact ttgagtttgg tgcaatcaaa 600 gttgctcgcg gaacacaatc gccgaaaaga agcccatggt ggcgatggcg gtgaagcggt 660 attgaagaca atagttagaa gcacaacatg ttttttctgc aagaaaagtg ggcacataaa 720 gcgtgattgc acacaataca aaaactggaa agaaaaacag aataagaaaa cgaagacaaa 780 agaaaagcaa gataaagaaa ataaggtgcc gaaactggag aagctcaata aagtagaaca 840 aaatcaggaa tttttgttca tggttaaggc tggcagtcat gcaggttggg ttattgattc 900 aggggctacg tgccatatag cgagcgatga acggtatttc gtgtcgcttg ataagacagt 960 gcgcaataat ctcaacgttg cgaactgtga aaaagtaagc gtaaaaggca aagggacatg 1020 cgtaatccca tttgtgaatg tagatggtgt tgaatcgcga gctacaatct cagaagtgat 1080 gtatgctccg gatattgatg gaaacttact ataggtaaag aaacttttag acaatggtta 1140 tacaatgaat tttgtgccgg gcgcgtgtga aatcttgtat aaagacagaa agatgggaat 1200 tgcggacgaa cattcaaatt tattccgatt acgcaaacca ataagggcgt gtatggtcag 1260 agaacacagt tcaaaatgca tccactatcg gcatcgagtt tttgggcacc atgatccaga 1320 agcaatgaag ttaatgtgtt tgaaagaact gaccaatgac atggtgattg aagactgtga 1380 cattaaaaca ctttgtgaag tgtgtgctca agggaaactg acacgcatga aattcccgaa 1440 acaatcatca agcaaatcga cagaaccgct cgatcttatc catacagacg tttgcggacc 1500 gatgcagaca gtatcaccaa gtgggatgcg ttacgtgctc acatttatcg acgactttac 1560 taggcataca gtaatttact tgcttaaaca aaagtcagaa gttgaaatta aactgaagga 1620 atatatagaa atggtgaaga ctatgtttgg gcgtaaacct aaagtgatca ggtcagatcg 1680 aggagaagaa tacgtcagaa aacaagtcat tggaatgctg caaaacgaag gaattcgagt 1740 gcagtatata gcgccataca cgccacagca gaacggcgtg gcggaacgta aaaaccggac 1800 gcttattgag atggcgaggt gtatgctttt tgaggccggg ctgccataca gtttttgggc 1860 agaggcggtg aatacagcca attatataca gaattgttca ccttcccagt cagttgacaa 1920 aacaccgcac gaactgtgga ctgggagtaa accaggtacg aagcatcttg caatttttag 1980 aattaaacgt tttgttcaca taccctcaga ggaacgacgc aaattgaaca aagtggcagc 2040 accaatgatc ttcgttgggt atgacgaaca ctcaaaagca tactggtgct atgatccgct 2100 tcaaaagaag atggtcatca gtcgagatgt gcgctttact ggcaaaatgt attcagcatc 2160 aaaggttgac atcaatttat ccaaagggaa aataagttca gaagacgaat cagccaatgt 2220 cgagatcgac ttaaacaagt gtgatccaga accagtgatc gttcaagagg aatggaacga 2280 tattcgtgag tggaacaacg attcagaata cgaatcagca aatgagagaa cagaacaatc 2340 agataatgaa gaaggcaaag cagagcagca acaacaatta cgcgtctcac agcgcgtaaa 2400 taagggcgtc cctcctcgtc gctttatcga ggaaatcaat gcagtaagca ctcaagtgac 2460 agaaccaaag acattgacca aagtcatggc tagcgaacag aatgacgaat ggatcaaggc 2520 catgacagtg agatggactc gctgatgagg aatgggacct gggagctgtg cgagctgcca 2580 agcgatcaaa aaccagtagg atgcaagtgg atttttaagt tgaaaacgaa tgcagacgga 2640 agcatcaaat gctataaagc gcggttaatc gcccaaggtt tttcgcaaaa atatgggaca 2700 gactacgatc aggtatttgc tcccgtggta aagcaaacga cattccgaat tcttttaact 2760 atagcagcgg aacgaggcat gctggtattt cactgggatg caaaaacagc tttcctaaat 2820 ggaaaactga aagaaaagat ctgtatgaaa cagccaccag gatatgagcg agaaagcgca 2880 ttggatcttg tctgcttgct catgaaaagc acggactcaa gcaggcagct aaatcatgga 2940 acaaggcgat tcatgatgcg ctagagaaga tcgggttcat ccaggatgat gcggatccgt 3000 gcatgtattc ggcaaaactg agtaatgagt ggtgctttat attcatttat gtggatgatg 3060 tggtcattgc aagtaaggag cttcaagtga ttgaatcagt gaagaatgcg ctctcttcgg 3120 aattcgagat gcaggacctt ggggagatcc ggcactattt gggattggag gtcaataaga 3180 atccaggagg atattaccaa atctatcaga caaactacat tcatcaagtg gcatcatcgt 3240 ttgggttaag tgaagccaag gcatcaaatg tgccaattga cccaaactac aacaaatcaa 3300 aggataaaac agaggttttg ctcaacaaca caaactatca acgtctgatc gggtgtctgc 3360 tatatatttc gatatgcact cgaccagaca tatcggcgag cgtgtcaatc ctggcacaaa 3420 aggtcagtag accaactcaa cgagactgga acgaattgaa gcaagttgtc aaatacctgg 3480 ttggtagttc caaattgaag ctgacattgt aaggcaacaa agcaggagtt tgtttttggc 3540 tacgctgatg caaattgggc taaaagcaag acagatcgga aatcaaacag tggctacata 3600 tttctggtaa atgggggaat tgtgagttgg gcatgtcgga agcagacgtg cgtggctcta 3660 tcaacaatcg aggcggaatt catcgctctg tccagtgctt gtcaagaagc tttacggcta 3720 catcgaatct tagatgacat gaagcaacca gttagtggac caatcactat atacgaggac 3780 aaccagagct gcctcaaact cattgaggaa gagaaactgt caaaccgcac taaacatatt 3840 gatactaagt atcattttgt gaaggacttc gtcaacagag gggaagtgca atgcaaatac 3900 tgtccaacag aggctatgct ggcggatcta cttgcaaagc ctctaccagc acatcggttc 3960 aaggcattgc gagaggattg tggtcttgtg tgattaaggg ggag 4004 // ID MuDR-1_DPu repbase; DNA; INV; 4160 BP. XX AC ACJG01002029; XX DT 27-FEB-2011 (Rel. 16.02, Created) DT 27-FEB-2011 (Rel. 16.02, Last updated, Version 1) XX DE MuDR-type DNA transposon from Daphnia. XX KW MuDR; DNA transposon; Transposable Element; MuDR-1_DPu. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Direct Submission to Repbase Update (09-FEB-2011). XX RN [2] RP 1-4160 RA Jurka J.; RT "DNA transposons from Daphnia."; RL Direct Submission to RU (24-MAY-2010). XX DR EMBL/GenBank/DDBJ; ACJG01002029; Positions 1156 5315. XX FH Key Location/Qualifiers FT CDS join(1048..1725,1721..2122,2159..2683,2747..3160) FT /product="MuDR-1_DPu_1p" FT /translation="MAITQDTNYNLQEIADDTKFIREIISYPDVAIFMYHP FT QLWQLFIGLLNRKDLPYITLSVDTTFEMTDGYVTVSVARFVEFNECPVFPL FT ATMIHERKLGETHRFFWQKVTQHFPQLVTASNIYIVSDEEEAIVGGVKEFM FT PHTDAYRCWNHIITNAKLKLKRLNITAKNEVSKYVDDIFFLLNQEEEKSYY FT KELSVINADPERWDPVIMFLKCIHSRLIFQTFKLGFGFKKYFMDHINSKVD FT RAGRWIIKKHNLSSVTTNISESFNCLMKRFMDWTDRPIDVLCVAFYRLAKT FT FVMEFIRSRYRKGNYTIRSSVSHLYNIRDNPPDPAMLEKVEKEREIFDRLR FT RVREEQKKVILLFVVVTGGPSYLFALFGLQSTAEVQIKEEGIEKQVIEDHE FT FTTWERAQHVVSNDLIKLDVKTHVFIVVGSAENRLVTLCPKITCTCPATTV FT CYHVLAAHLAIGDCHPSVTKKPKNSTRYRKRAKKTADKRSGNKAPRKLDID FT RPPVTSPSTPLPTKRKKSAKSSSNTAPDICTSPKKRKVSVSVVTRNGTHIP FT ESSTDFSDVTLPSETDSDDDPVLENSSSPFHDHRGRSDVIKRVSTESPKKC FT IGGQVVTNAMLEEILQMEETVTLTVYRIIPIDAIHINDISTESITGTAVAI FT IHRENDLEGDASTDASDSIPVIKY" XX SQ Sequence 4160 BP; 1339 A; 810 C; 838 G; 1173 T; 0 other; gtaatgtgca tgacaactct gcatgactgc atgataataa tctgcatgaa ttcgccgcat 60 gaaccgagct gtcatgcacg ttgcgatggc atagtaaact gcatgactat ttctgcatga 120 caccaaaaga aagcttttgt tttaatgggg ggggggggaa tggccatgca tacttaaagg 180 aagatgaaat gcttctggca agcgcccatt tgaaaaacaa ccaattgtta catgattggc 240 gcgccgattg ctttgttaat cttttagagt atgagttgga atcagcgtct attcattagt 300 tgtgacgtga ctgtccaagt gaaaagatat ggctttcaaa agcaaacggc aagtggctgc 360 tgaaagagtt attcgacttt gcgagaaact tttgaatacg gaaccagttg taaacgttat 420 aatcaatcca tgtgctggtg tagaatacgt gtacgagcca gctgaagaga aacaaataaa 480 agacggcaag tagaaaaaaa gttaactagt atagtaaaat aggaacacgt accattttta 540 ttttgtaagg tgcactaagt tagcaattta agtcatatat tttaagatcg aatctatatt 600 tttgtagact ggcgggcgga cggctggcat tggcgccaga atgcttctta taaccacaca 660 gtaaatgggg ctacattcaa aaaaattcct ttttactgta taatcggttc agaagtcaat 720 ggcactgctc caataactac gtcttttact cgggatgtgt tttaccatcc aacaattccg 780 catcgtgtat tgatagctta cagtggtgac gaaagaaatg cgaaagaaca cagccacggc 840 aacgcaaaaa ctgagaaaaa acgagcagaa aattttattc ctacaagaaa ttcaacaaaa 900 gaaaagttga agaagagttc tggctatccg atcaacgttt atagagagga atgtgctgac 960 ggaccagcag atttacaaga acaggctgtt gcgctacctc gcgatgtgat tcaggttcgc 1020 aattttcaaa aaaacgagaa gcgtaaaatg gccatcactc aggataccaa ctacaacctt 1080 caagaaattg cggatgacac caaatttatc agagagatta tatcgtatcc cgatgttgca 1140 atttttatgt accatcctca gttatggcaa ttgtttatag gactattaaa tcggaaagat 1200 ttaccctaca ttactttgtc ggttgataca acatttgaaa tgactgatgg atacgtcacg 1260 gtatcggtcg ctcgctttgt agaatttaat gaatgccccg ttttcccact tgctactatg 1320 atccacgaaa gaaagcttgg tgaaactcat cgttttttct ggcaaaaagt tactcaacac 1380 ttcccacaac ttgtcaccgc atctaacatt tatattgtgt cggacgaaga agaggcaatt 1440 gttggtggtg ttaaggaatt tatgccgcac actgacgcct atcgttgctg gaatcacatt 1500 ataacaaatg caaaattaaa attaaaacgc ctcaacataa ctgctaaaaa cgaggtatca 1560 aagtatgtgg atgacatatt tttcctccta aatcaagaag aggagaagtc ttactataaa 1620 gagctatctg tcataaatgc tgatccagaa cgatgggacc cggtaataat gtttttaaag 1680 tgtattcatt caagattaat atttcaaaca tttaaattag ggttttaaaa aatacttcat 1740 ggatcatatc aattcaaaag tcgatcgggc aggtcggtgg ataattaaaa aacataactt 1800 atcctccgtg acgacaaaca tttcagagtc ctttaattgc ctgatgaaga gattcatgga 1860 ctggacagat cgtccaatcg atgtcttgtg tgtagcattt tatcgtcttg ctaaaacgtt 1920 tgtaatggaa tttatccgaa gtcgatacag aaaaggtaac tacaccattc gctcgagtgt 1980 ttctcacctt tacaacattc gagataatcc accggatcct gcgatgctag agaaagtgga 2040 aaaagaacgt gaaattttcg atcgtcttcg tcgggtgaga gaagaacaga aaaaggtaat 2100 cttgttattt gttgttgtta cctaaagctt cgttcttaat gtacctgaaa cttgataagg 2160 aggtccatca tacttattcg cgctttttgg tttacagtcc actgcggaag ttcaaataaa 2220 agaagaagga attgaaaaac aagttatcga agaccacgag tttaccacct gggagcgcgc 2280 tcaacatgtt gtttctaacg acctaatcaa actggacgtc aaaacgcacg tttttatcgt 2340 tgttggctcg gctgaaaatc ggcttgtaac cctttgcccc aaaataactt gcacctgtcc 2400 ggctacaact gtgtgctacc atgtcttggc agcccattta gctatagggg actgtcatcc 2460 atcggtgact aagaagccta aaaattccac aagatacaga aaacgagcaa agaagaccgc 2520 agacaaaagg agtggcaata aagccccgag aaaactagac atcgacaggc caccagttac 2580 ttctccgtca actcctttac cgacgaaaag aaagaagtca gcaaaatctt cgtcgaatac 2640 agctcccgac atttgtacat cgccaaagaa aagaaaggtt agttagacta aaagatctta 2700 tgtagaattt gaacatcaat cttagtgaca tttatattaa ttctaggtgt ctgtcgtgac 2760 gcggaacggt acacatattc ccgaatcatc aactgacttc tcagatgtta ctttgccttc 2820 cgagacagat tccgatgacg acccagtttt agaaaactcc tcgtctccat tccatgatca 2880 tcggggtagg agtgatgtca taaaaagggt atcaactgaa tcgccaaaaa aatgtattgg 2940 tggccaggtg gtaacaaatg caatgctcga ggaaatattg caaatggaag agaccgttac 3000 cttaaccgtc tatcgcatta taccaatcga cgcaattcac atcaatgata tttcaacgga 3060 atctattacc ggtactgcgg ttgcaattat acatagggag aacgatttgg aaggcgatgc 3120 tagtactgat gcttccgact ctattccggt aattaagtat taagtaacta cgagaccata 3180 aataaatact cttttgattt cttacagaat gcagaagttc agtttctggc cgcttctctt 3240 gctacgtcgg aaatagatta tcgccagact tggcaattag cggcggaaga aatcaatgaa 3300 ataaaacgtt ctacaaaact tacaacgcgc cacatcaact ctgcactggc acttgtgcgt 3360 aagaattttc cggaaatcgg tggattattc aatgttcatc acgcaacgtc cactggaact 3420 taccctgtgc cgaaagaaaa acgttggata caaataattc ataccggaaa atgccattgg 3480 gtcctagctg tttccggttt cccagtgtgc agaaaccata atgtagccgt ggctacatga 3540 cagcttggga tttgaaagga gttccgaaga ggaaacggtg aaagcaattt ccagcttgtt 3600 gggttataca gattacagct tgtatgcccc aagttgtcaa aaacaagctg acaaaagcag 3660 ttgtggtgtg tttgcacttg cattcgccgt tgaagttgct ctaggagcag acccttcaac 3720 cattgtcttc gtaaaagaat ctctaatgaa ggatcactta aagggatgtt tacgagtctt 3780 agaattaaaa ccttttccaa aaacatcgac atccaaagta caaccaacca gacaaaaagt 3840 atggaaaata accagttaaa attaagttca ttgttgctat gttgttgctt ttttgtaacc 3900 ggttaagccc tttaacttta ctttattatg taaatccatt attttccttt cctgtttttt 3960 tttctttcat gaaaaatttt tcattttgtt ttgacttcat tttaacgaca cgatacaata 4020 caagatacaa caaaatacaa tacagtgatt gaatttcaat ataaaatatg ggggtaatct 4080 gcatgaatcc ttttcgattc atgcggcgaa ttcatgcaga ttattatcat gcagtcatgc 4140 agagttgtca tgcacattac 4160 // ID Gypsy-144_AA-LTR repbase; DNA; INV; 1174 BP. XX AC AAGE02024568; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-144_AA_; KW Gypsy-144_AA-I; Gypsy-144_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-1174 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02024568; Positions 28674 27501. XX SQ Sequence 1174 BP; 337 A; 266 C; 255 G; 316 T; 0 other; tgttaccatt ttggttcttt gccctatata gcctagcaca gcaaggagag tgagtggcaa 60 tactactcac catactactg ttcatgatga gtggtaaagc tttgcctgcg aatttatagc 120 acaaacaagt atatcgtttt tgttcaactt ctctgtgtcc cttataacta ttgaacataa 180 tttccaatta gaatatttgt acttcattca taagctaact aataagattg caataaaacc 240 cattaagtat ttgaggtgtt aggagattgt tatgactcca tataagaacg gtcattttga 300 atcctaggga ataagattgt taaaacactc acatgcacac tttacacgaa aaatgctgaa 360 ccaaaggaat ggattctgca gtgcctggat atatatgaga tatagttata taggtctgaa 420 tgggaggtag gaagggcttc acgaatccta tcagcgttct tttccactca agtgctcgaa 480 ggagtaacat cagcttaagc ctagaaaagt gaacaccaag tgcctttgat taaaaagttc 540 caagtggaaa ataagaacaa ctcttagtgg gatttagtgg agaatcaggt atgttaccct 600 gcctaggagc cattctcgtg accctattct aagcccagtt tatacggctt aacaggaccc 660 attaggtgtt gaccacgcat tggtttaagc tccaagcaag tgcggagcgg cgtgttcccc 720 ggttagtcat ccgccgttgt gagaatctca agctcttcct ccagtgacca gaagcgacta 780 ttgagccacg acctggccgt ttggtttagg gtaggcaatc tgtaaaacca ccagcgttcc 840 tcaaagccgc cacacgtgac gagcacgtgt ccatccattg gtccagctaa gccggccggc 900 caacgatcat cgcgccttaa cccacacaca cacacatcac acattcatag cgtaagtaaa 960 atttaaataa aaagtgattg aaagtatagg accgaaagag ttggttttgt gttctgatcc 1020 gccccgttcc atccgttgtc tagccgaccc tgtgtaaaga ggcgggtgta gcccacccca 1080 gacgactccc gtggcttgac caggagtcag gggattgcta ggtcaagacc cttcaggtct 1140 tgacattctg tccaggaatt gttcccaagt aaca 1174 // ID DNA8-21_AP repbase; DNA; INV; 799 BP. XX AC . XX DT 22-AUG-2009 (Rel. 14.08, Created) DT 22-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-21_AP. XX NM DNA8-21_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-799 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(8), 1763-1763 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 799 BP; 350 A; 68 C; 83 G; 298 T; 0 other; tagtgtttga aaatttcatg aaatttaaaa aaatgcgtga aatatttcaa ttatcattat 60 ttttgtagtt ttgtcttgta aaatattaaa taattaatct aaataaatta aatacctcat 120 tagtcattga cacacatttt tttcttagtt ttaacgtata ataatataat attgtatttt 180 tttgctgtac ctattatttg atgtaaaaat gtaaaataaa taaggaatag ataggtatat 240 tgtatatgta ccatgtatgg tatattgtat atgtaccatg tacgtggtaa cttatatagt 300 tatatgttaa aaatcagatc agagttttct tgatcgagaa tccttatgac atgtatttgt 360 ataattgtat ataaaaaaaa cataataaaa ataaaaatat aaaaatggaa tataaattta 420 aaaaaaaacg cttgggtata acgtttatca tataacgtgt tatcgtataa tgtaataatt 480 atgttatgtt atgaagatca ttgtttctta ttaatgttaa catgtattat aattatataa 540 aacaaaaaac ataataaata ttaaaaagaa acagtaaaag gaaaccagta cttaggtata 600 tattataaag atcatagttt tcttgctaac gattcacatg aaataatata aattataatt 660 ataatttata tataataaaa atataaaaca aaataatatt taaaaaataa aaatgaaata 720 attaataaaa atccaatatt tcacaaaaca ttgaaatatt tcacaactta gtgaaatatt 780 tctcatactt caaacacta 799 // ID TCB1 repbase; DNA; INV; 1616 BP. XX AC X54217; XX DT 15-SEP-2004 (Rel. 9.08, Created) DT 15-SEP-2004 (Rel. 9.08, Last updated, Version 1) XX DE Caenorhabditis briggsae TCb1(#5) repetitive element. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Cb000450; DNA; KW TCB1; mariner/Tc1 superfamily; transposase. XX OS Caenorhabditis briggsae OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-811 RA Harris J.L., Prasad S. and Rose M.A.; RT "Isolation and sequence analysis of Caenorhabditis briggsae RT repetitive elements related to the Caenorhabditis elegans RT transposon Tc1."; RL J. Mol. Evol 30(4), 359-369 (1990). XX RN [2] RP 1-811 RA Harris L.; RT "Direct submission."; RL Direct Submission to EMBL/GenBank/DDBJ (19-NOV-1991). Harris L., RL Agriculture Canada, Plant Research Centre, Bld. 21, Central RL Experimental Farm, Ottawa, Ontario Canada K1A 0C6. XX RN [3] RP 1-1616 RA Pavlicek A. and Jurka J.; RT "TCB1 - a family of autonomous mariner/Tc1-like transposons."; RL Direct Submission to Repbase Update (AUG-2004). XX DR [2] (Consensus) XX CC The TCB1 family is young and copies are 99% identical to the CC consensus. CC TCB1 contains ~80bp-long TIRs and is flanked by TA target CC duplications. CC TCB1_ORF: 529-1347 (273 aa) CC MDRNILRACREDPRRTSTDIQLSVTSPNEPVPSRRTIRRRLQVAGLHGRRPVKKPLVSLKNRKARVEWAK CC QHLSWGPREWANHIWSDESKFNMFGTDGIQWIRRPIGSRYAPQYQCPTVKHGGGSVMVWGCFSDTSMGPL CC KRIVGTMDRYVYEDILENTMRPWARANLGRSWVFQQDNDPKHTSGHVANWFRRHRVNLLEWPSQSPDLNP CC IEHMWEELERRLKGVRASNANQKFAQLEAAWKSIPMTVVQTLLESMPRRCKAVIDAKGYPTKY. XX SQ Sequence 1616 BP; 478 A; 341 C; 318 G; 479 T; 0 other; cagtactggc cataaagaat gcgacaactt gttttttgaa gataactttt tgaaaactca 60 acttttcaat ccgaattttt ggtaagattt tttcagcact atcaaaaact tttaggctaa 120 gttggtttta atattctgct acaatttttt tgaaaattaa tttttttcga aaattcatga 180 aaatgacatt tttggagatc tatacaaaga agctcaagaa acaccggagg tcaaaaacaa 240 aataaaggta acaataaaaa gtgatttaat atttcgttgg gtatcctttc gcatcgataa 300 cagccttgca tctacgtggc atcgactcca ggagcgtctg aaccaccgtc atcgggatac 360 tcttccaagc agcttcgagt tgagcaaact tttgattggc attggatgct ctgactcctt 420 tgaggcggcg ttccagctcc tcccacatat gctcgatggg attcaagtct ggagattgac 480 ttggccattc taggaggttc acacggtgac gtctgaacca attggcgaca tgacccgaag 540 tatgcttcgg gtcattatcc tgttggaaca cccacgatcg gcccaaattt gctcttgccc 600 atggtctcat tgtgttctcc aggatgtctt cgtacacata tcgatccatg gttccaacga 660 ttctcttcaa tggtcccata gaagtgtcgg agaagcatcc ccaaaccatc acagatccac 720 ctccatgttt cacagttgga cattggtact gtggagcata cctggagcca atgggacgtc 780 gaatccactg aataccatca gttccgaaca tattgaactt cgattcatcg ctccagatgt 840 gatttgccca ctcacggggg ccccaggaca agtgctgttt agcccattca acgcgagctt 900 ttcggttttt caaactgacg agtggttttt tgactggtct tcgtccgtgc agtccagcaa 960 cttgcaaacg tcttctaata gttcttctcg atggtaccgg ttcatttgga gacgtcacag 1020 aaagttgaat atccgtagat gtgcgtctag gatcttctcg gcatgcgcgc aaaatgttcc 1080 gatccatatt tctggaagtg gttcggggtc ttcctggaga ttggcgatga acaacgccat 1140 tctaaattgt taaatttagt tgagaaagct gattgaaacc tcacatcatc tcgatttttc 1200 ttcagaatac ggtaaatttg gctggaagaa caactaaact gagttgcaag tatcttcgga 1260 tgcgcgccga gttcatgacc acgcacaatg ctttttcgct cgtccacaga gaaggatttt 1320 ttcccgaatg gtctaaccat atctgaaaaa caataaaaca ataactaaaa ctaattccaa 1380 ttagttgtct ttcaacttta tttaaaaaaa ccaaaaatca tgaaaaatat gcaaaatgaa 1440 aaaaaaaaag aaaacaattg aaaaaaaatt ttgtttgcaa gatgatgcaa tttccaagta 1500 cgcaagtttt cgtaactttc caatgtgtct acagtaaaaa attcgagttg aaaagttgag 1560 ttttcaaaaa gttatcgtca aaaaacaagt tgtcgcattc tttatggcca gtactg 1616 // ID CR1_Ele42 repbase; DNA; INV; 5865 BP. XX AC . XX DT 08-OCT-2010 (Rel. 15.1, Created) DT 08-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE A CR1 clade non-LTR retrotransposon family from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1_Ele42. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-5865 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5865 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (08-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. This consensus is generated from 10 CC sequences with >99% identity, and ~100% identical to the original CC sequence in [1]. XX FH Key Location/Qualifiers FT CDS 605..1468 FT /product="CR1_Ele42_1p" FT /translation="MVCKRCQKKVSGDDGVVCRGYCGASFHAFCVNVDEPL FT RKQLGQYRNNVFWMCDGCANLFGNAHFRALMTGFDEKSSMVPSAIQSMQCE FT IEKLHASVKTLSAKVDGMPTTPTPFSTPNPWPAIHRINRFTKSAKRFRDTD FT GNPVNVEDGSMKMGTKTTNPCSTICLDSRQEDELIWIYLSAFHPNTTESQI FT SSFVTECLELTANAHLKVIKLVPKGKDLNTLNYVSFKIGLSVQFKEKAFSC FT ESWPENIRFRQFEDNRAKNLPRVISLSSTVQQGITVPPPMDSSSLDI" FT CDS 1510..5754 FT /product="CR1_Ele42_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MGTPSDPVTVEPNHHATSNASVFASCHLRRPGPVYGG FT LEGVSQPAFSGKYAYHNGHSLGDISNNSSSSENDDPPANLHQNSFSTSQFH FT LPGRNGEGLMEAPGPSDPVEQSTASVLHNRPGSVISCGAGVFQSVSSGCLT FT RHSHYGPKLINPSSSSFMPSSLTSSKAPGCTPVSLMEAPYPLVTVEPFLPA FT TSSHPGPVYECGDGVFQTTTAGKYSHNMNILPPVTSVASSCSSSTSSACGL FT PITDILDVSISAKWTPGRTLAASHKEAPIPLSVVEPPLPATSSRPGPTSEL FT GDGVFRTQISGKYDDFMSFPYPEPIVASSPSTIFAPSNAILSQQPTSIVRS FT CPDENIQSHNLDNRIPSSQPPLSANWTPGRILAASLEEAPIPLSVVEPPPP FT ATSSRPGPTSELGDGVFRTSTSGKYDLHPVFPEPESISNSSTSNLVLPPEA FT TSRNSTSAPASSDVLLYYQNVGGINTSLAEYQAAVSDGCYDVYALTETWLN FT GNTLTCQLFDASYSVYRQDRSSLSSNKSTGGGVLLAVRSNFKSRSLSPPNN FT ALVEQLWVAISTVDGTLFICVVYIPPDRVNDAALIDKHVESLSWITSQLGA FT RDKVIILGDFNMGSISWQRHSLGFYYPIASRSSISQTSRDLLDAYSTAGLK FT QAIGIENEHNRMLDLCFVSDQLLESCTIIQAPSPLVKLCRHHPPILMNFQI FT YPIRHFHNTAESISYDFNKANFNGMNAFLAHVDWDEILRDYDANLAASTVS FT GILLYAIDQFVPVKSRPEAAKPPWSNAHLKNLKRVKRAALRRHSKFRTDTT FT RASYMKANQDYQQLNDHLYNAYQNRMQRQLRTNPKSFWRYVNDQRKESGLP FT STMNDGSSVADSIGGIADMFRSQFSRVFTNEHLDSHDVALAIRNVPRLPGS FT GQLCIINDDMVIAASKEIKSSTGYGPDGIPSLIIKRCVDTLAKPLTAVFNL FT SLSTGVFPNCWKHSYVFPVFKKGCKSSVSNYRGIAALSSISKLFEVIVLRQ FT LVQSYAHYISEDQHGFMPKRSTTTNLTCFISFVTRQIESGQQVDAIYTDLS FT AAFDKMNHQIALAKFDKLGMNNTLLTWLRSYLTGRSMSVKIADHVSLPFPA FT WSGVPQGSHLGPFLFLLYMNDVNFVLKCLKLSYADDVKLYYTIKQPQDTVF FT LQRQLETFAEWCRLNRMLLNTSKCSVISFGRKQSMIPFAYALSGEQLQRES FT TIKDLGIMIDSKLTFKDHVSYIVSKASAQLGFVFRFAKKIKDIYCLKSLYC FT SIVRPILEYSSVVWSPYYQNEIQRIETVQRKFIRFALRHLRWRDPLNLPSY FT SSRCQLINLDSLEARRNVAKACFIGDLLQGNMDCPALLNMLDINTRRRNLR FT THSFFNLSVTRTNYSLHEPVRSMSRLFNQCYFVFDFNVSRERNKCNFRRVL FT C" XX SQ Sequence 5865 BP; 1516 A; 1467 C; 1219 G; 1662 T; 1 other; ggaattattt gaaatgttct caatatattg actcgaataa cccctcgagg agcttttgga 60 tctattcttt ggaataaccg tagacaaaat ggcgttgaaa ttcttgtcaa gttttcttgg 120 gagaaattct gtgggaaact ttggatatct ctttgattaa cgactgtaaa attcggttgg 180 atagttagtg aaaaattatc tggaggaagt cctaagatag tctttgttca aagtactgcc 240 tgagaaaaat ttctagaggt attaacgttg aaatgtcttc ggccctgtgt ggtttgtgta 300 tagtttgctt tatatatcgc acgtttcttt atcgctgtta attgtgtaat tggccgatta 360 taaataccca gaaaagtttg cgccgtctcg tattgctacc cgccgcgttt ttagcataaa 420 gttgtccgtt aagcttcgtt tgtttcgtga tatttcacgg attttgtata cactttgcta 480 caacaagcgg tggtgattgt ttttactcgc cttctcgcct acgtcatcgt ttctgccaag 540 atctgctttg ctaaacttac accgtatcag tcaaattgct tgtagcctac ctaaaggcgc 600 tatcatggtt tgcaagcgtt gccaaaagaa agtatccggc gatgatggtg tcgtctgtcg 660 tggctattgc ggtgcatcgt tccacgcttt ttgcgtaaac gttgatgaac ccctccgaaa 720 gcagctcggg caatatcgaa acaacgtttt ttggatgtgt gatggatgcg cgaacttatt 780 tggaaatgca cattttcgtg ctttgatgac cggatttgac gaaaaatctt ctatggtgcc 840 ttctgccata cagtctatgc aatgcgaaat tgaaaagctc cacgccagtg tgaagacctt 900 gtcggctaaa gtcgatggga tgccaactac tcccacacca ttctccacac ccaatccctg 960 gcccgcaatt catcgcataa accgcttcac aaaatcagct aaacgcttca gggacactga 1020 tgggaacccc gttaacgttg aagatggctc aatgaaaatg ggaacgaaaa cgacaaatcc 1080 ttgttcaacc atttgtcttg attcacgcca agaagatgag ttaatttgga tctacttgtc 1140 cgcgttccat ccgaatacaa ctgaaagcca aatttcctcg tttgtcacgg agtgcttaga 1200 actgacagcc aacgctcatc tcaaagtgat taaactggtt ccaaaaggaa aggatttgaa 1260 cacgcttaat tacgtgtcct ttaaaattgg gttgagcgtc caattcaagg aaaaagcttt 1320 ttcgtgtgaa tcgtggcctg aaaacatacg gtttcggcaa tttgaggata atcgagcaaa 1380 aaacttaccg agagttatca gcctatcttc aacggtacaa cagggaataa ctgttccgcc 1440 tcccatggat tcttcgagct tggacatcta gggatcagca tctcaccagg acgcacgcaa 1500 cgaagcacga tgggaactcc ttctgacccc gtaacagtcg agccaaatca ccacgccacc 1560 tcgaatgcct ccgtttttgc atcctgccat cttcgtcgtc ctggtcctgt ttacggcggt 1620 ttggaaggag tctcccagcc tgcattttca ggcaagtatg cgtatcacaa cggtcattca 1680 ctcggtgaca tttcaaacaa ttctagctct tccgaaaatg acgatccgcc ggcgaatttg 1740 caccagaatt ctttttcaac gtcccagttc catctaccag gccgcaatgg ggaaggcctt 1800 atggaagctc ccggcccgtc tgacccagtt gagcaaagca ccgcctccgt tcttcacaat 1860 cgtcccggct ctgtgattag ttgtggtgcg ggagtcttcc aatctgtttc ctcaggatgt 1920 ctcacccgtc actcccacta tggacctaag ctaatcaacc caagtagttc gtcattcatg 1980 ccttcatcgc taacttcatc caaagccccg ggatgcacgc ctgttagcct catggaagcc 2040 ccttatcccc tcgtcacggt cgagcccttc ctgccagcga cctccagcca tcccggtccc 2100 gtgtacgagt gtggagatgg ggtcttccaa accactactg caggcaagta ttcacacaat 2160 atgaacatac ttcctccggt aacatctgtc gcttccagct gttcatcttc aactagttca 2220 gcgtgtggtt tgcctatcac tgacattttg gacgtttcga tttccgcaaa gtggacaccg 2280 ggacgcacac ttgccgctag ccacaaggaa gcccctattc cactcagcgt agtcgagcct 2340 ccgctgccag cgaccagcag ccgtcccggt cctacgtctg agctgggaga cggggtcttc 2400 cgaactcaaa tttccggcaa gtacgatgat tttatgagct ttccatatcc tgaaccgatt 2460 gttgcttcta gtccatctac aatatttgct ccatccaacg ctattctgtc acaacaacct 2520 acgagcatag tgcgctcatg tcccgacgag aacattcagt cccataatct tgataatcga 2580 attccatcaa gccaacctcc tctttctgct aactggacac cgggacgcat acttgccgca 2640 agcctcgagg aagcccctat tccactcagc gtagtcgagc ccccgccgcc agcgaccagc 2700 agtcgtcccg gtcctacgtc tgagctggga gacggggtct tccgaacctc cacctccggc 2760 aagtacgatt tacatccggt atttcctgag cctgaaagta tctccaattc cagtacatca 2820 aatttggtgc tgccaccaga agcaacttct cgaaatagta catcagcacc agcttcgtcc 2880 gatgtcctgt tgtactatca aaacgttggc gggattaata cttccctcgc tgaataccaa 2940 gcggccgtga gtgatggttg ctacgacgtt tacgccctta ccgaaacctg gctcaacggt 3000 aatactctga catgtcaact gtttgacgcc tcatattccg tttatcgaca agaccgatca 3060 tctttaagca gcaacaaaag taccgggggt ggtgttttgc tagctgtccg ctctaatttt 3120 aaatcgcgtt cgcttagtcc tccgaacaat gctttggtgg agcaattatg ggtcgccatt 3180 tctactgtcg atggaacatt attcatctgc gttgtctaca tcccgccaga ccgcgttaac 3240 gatgcagcat taattgacaa acacgtcgag tcgcttagct ggataacttc tcaacttgga 3300 gctagggaca aagtgatcat cctgggtgac tttaacatgg gctcaatctc ttggcagcgt 3360 cattctcttg gcttctacta cccgattgcc tcaagatctt cgattagcca gacgtctcgt 3420 gatctgcttg atgcatacag cactgctgga ctcaagcaag ccattgggat tgaaaatgaa 3480 cacaatcgga tgctcgattt gtgttttgtt agcgaccagt tattggagag ctgcacgatt 3540 atacaagctc catcgcctct cgtcaaactc tgcagacatc atccccccat tctgatgaat 3600 ttccaaatat atccgattcg tcacttccac aacactgccg aaagcatctc ttacgatttc 3660 aacaaggcta atttcaacgg tatgaatgct tttcttgcgc acgttgattg ggacgagatt 3720 cttcgggact acgatgccaa tcttgcagca tcgacagtgt ctggaatttt gctgtatgct 3780 atcgaccagt tcgttcccgt waaatccaga cctgaagctg caaaaccacc ctggtctaat 3840 gctcatctca agaacctgaa aagagtgaag agggctgcac ttcgacgaca cagcaagttt 3900 cgtacggaca ccacgagagc tagctatatg aaggcgaacc aagactatca gcaactcaac 3960 gaccatttgt acaacgccta ccaaaaccgt atgcaacgtc aactaaggac taatcccaaa 4020 agtttttggc ggtacgtcaa tgaccaacgg aaagaatccg gattaccctc aacaatgaac 4080 gacggttcat cggtcgccga ctccatcggt gggattgccg acatgtttcg ttcacaattc 4140 agtcgtgttt tcaccaacga acacctcgat tcacatgatg ttgcccttgc tatcaggaac 4200 gttccacgtc ttccaggttc tgggcaactg tgcatcataa acgatgatat ggttatcgca 4260 gctagtaagg agattaaatc gtctacggga tatggaccgg atggtatacc atcacttatc 4320 ataaaacggt gtgtggacac gcttgcaaaa ccgttaactg cagtattcaa tctttccttg 4380 tcaaccggtg tcttcccaaa ctgctggaaa cattcatacg tattcccagt tttcaaaaaa 4440 ggctgcaaaa gttcggtttc aaattaccga ggtatagccg ctttgagctc aatctctaag 4500 ctatttgagg tgattgtgct acgtcagtta gttcagagct acgcacacta catatcagaa 4560 gaccagcacg ggttcatgcc caaacgttcg acgactacta acttgacatg tttcatttct 4620 ttcgtgactc gccaaattga aagtggccaa caagttgacg ccatttatac ggacctttcc 4680 gcggcatttg ataaaatgaa tcaccaaatc gccttagcta aattcgacaa acttggcatg 4740 aataatactt tactcacttg gcttcgatcc tatttaactg gtcgtagcat gtcggtaaaa 4800 attgcagatc atgtttcgtt gccctttcct gcttggtccg gcgttccaca aggcagtcac 4860 cttggcccat tcttgttcct tttgtatatg aacgatgtca atttcgtcct caaatgtttg 4920 aagttgtcat atgctgatga tgtaaaactt tactacacga taaagcagcc acaagatact 4980 gtgttcctgc aacgacaact cgaaacattt gctgaatggt gccgcttaaa tcgaatgttg 5040 ctgaacactt cgaaatgttc ggtaatttca tttgggcgca aacaatctat gatcccgttt 5100 gcctacgcct tgtcaggtga gcaactgcaa cgcgaatcga ctatcaagga tttgggaatt 5160 atgatagatt ccaagctgac ttttaaggac catgtttcat acattgtgtc gaaggcttct 5220 gcacagctag gtttcgtgtt tcgattcgcc aaaaaaatca aggatatcta ctgtctgaaa 5280 tcattatatt gctcaatagt acgccctatt ttagagtatt cttcggttgt ctggtcaccg 5340 tattatcaaa atgaaattca gcgcatcgaa actgttcaac gaaaattcat ccgtttcgct 5400 ttacgtcatc tcagatggag ggacccgttg aacctgccta gttatagcag ccgctgtcaa 5460 ttgattaacc tcgactctct cgaagcaagg cgcaacgttg ccaaggcctg ttttattggc 5520 gatcttctgc aaggcaacat ggattgccct gcgttgctca acatgctaga catcaataca 5580 cggcgccgca atctccgaac gcactcgttt ttcaaccttt ctgtcactcg tactaattat 5640 agccttcacg aaccagtccg aagcatgtcg cgtttattca atcagtgtta ttttgtattt 5700 gactttaatg tatcgcgtga aagaaataag tgtaatttta gacgtgtctt atgttagtat 5760 aaatttgact tcatcacttt atgtcttagt tttaagaatt ttgtcattgg ggtgaccatt 5820 ttacctgttg actaataata aataaataag taaataataa ataaa 5865 // ID CR1-2_BF repbase; DNA; INV; 4050 BP. XX AC . XX DT 30-JUL-2009 (Rel. 14.07, Created) DT 30-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE Amphioxus CR1-2_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-2_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-4050 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-4050 RA Kapitonov V. and Jurka J.; RT "Young families of CR1 non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(7), 1629-1629 (2009). XX DR [2] (Consensus) XX FH Key Location/Qualifiers FT CDS 318..3833 FT /product="CR1-2_BF_1p" FT /note="PHD finger, APE, and RT domains." FT /translation="MSLTISRSVVFLLLVLNSIQASTVGYGPSKVRRTGTP FT CLLFTGGNAPPSWYPVLSLNNNEPTPWLKINFGTTGILLKHAKILRTSDRL FT CLLYLCSVLMSQAVDLETNPGPRPPKYPCGSCGKAVTFKHKGVCCDRCDCW FT FHHDCQGLSSYMYPYLGNSNVSWICLNCGLPNLSTTFFDSFNIESDSNPFS FT PLSDSSPGMPLAASSPKSLTQNKARPTTRPIRLVNANFQSLRNKKLELETL FT VEETKPDILVITETWLDDSCNISEYFPTHLNMSVFYRNRPDDNHGGVLIAV FT TNEFICSQEPQLETDCEMVWVKIQLAGSKSLYVCAYYRPHVDDSNSLDNLE FT QSMNRICNKRNSHIWIVGDFNFPGWDWSDPQHPVLKPGCAYPGLHRQFLEL FT LSDQNLSQVVDRPTRNDNTLDLVLMSNDNCVNRINTLPPVGDHDIVFVEAD FT IHPRKIKAKPRKILQYKRTNWEKFRDEMEDFQTEFLDLAQGDASVDDLYNA FT FSSKLSQCVDKYVPSKMVGAKRHLPYVTPDLKRLMRKRDRYYRKTLGQTSH FT ERKDKLNQYKKEIKKKLKECYWQYIESIITDIPVPGDTSDNTNNPPAKQGT FT KKFWSFLKSIKSERSGVSHLRNNGTLVSDSKSKADLLNQQFQSVFTNEPSD FT DPLPDLGPSPHVSMEDIQIEVSGVDKLLGNLNIHKASGPDQVHAHVLKELS FT SVLSPILQVIFQRSLDTGIVPEAWKEANIAPVYKKGDRSKPENYRPISLTS FT ICCKIMEHIIASTMMTHFQANNILYDLQHGFSHGKSCETQLLSLTDDLAVN FT RNNGIQTDLVFMDFAKAFDKVPHRRLLHKLQYYGVRGNTLVWIQNFLLGRS FT QTVVLDGERSDPMPVTSGVPQGTVLGPILFLAYINDLPEHAANAKVRLFAD FT DCILQMPIHDSSDCNKLQEDIDNICLWEQRWLMEFNPSKCEVMTIPSSKTP FT ITHTYSLHNHPLNKVSTTKYLGVTISSNLTWGSHVDAITSKANKTLGMLRR FT CLRVASTAAKERAYKALVRPHLEFACSVWDPHTQDQVRKVEMVQRRAARYV FT TNNYQRDSCSVTALINQLGWQSLQSRRKNARLITMYKIIHNNISVPHASKL FT VLATRCSRRTNHAFKLQAIASKSNFYRLSFFPRTIQEWNELEPGVAEAGSL FT SQFKTELGRALLH" XX SQ Sequence 4050 BP; 1114 A; 1080 C; 864 G; 992 T; 0 other; cgaaagatgc gcatgcccgt cgccgccgca cgcttcacca tggctctccc tcttcctttc 60 ccctttttca ctagtaacta attctcccca gccccaattg tctcccccgg gtgtctctga 120 ccggttacaa acatctggaa gtgtgttggg tttgaaataa agcgtgacag atagtttagt 180 tctcaaaact caccgcccga ttctcagggg tgggagggcg agctcacgca cgtccatctt 240 tgacagcgac tcgccacctt cggtgcccgt tggggcccac tcacgtaagt gcaaaccctc 300 cagcgtctcc tgccgtcatg tcgctgacga tctccaggtc agttgtcttc ttattacttg 360 tacttaatag tattcaggcc tcgactgtcg gatatggccc aagtaaggta cgtaggactg 420 ggacgccatg tttgttgttt accggcggca acgcgccgcc atcctggtat ccagttctga 480 gtttgaataa caatgaaccc actccctggc tcaaaatcaa cttcggtacg actggcattc 540 tattgaaaca tgcaaaaatc cttagaacat ctgatagact ctgcctatta tatctctgct 600 ccgtcctcat gtcccaagcc gtcgacctgg aaacaaatcc cggtccccgg ccacctaagt 660 acccatgtgg gtcatgtgga aaggcagtga cctttaaaca caaaggagtt tgctgtgaca 720 ggtgtgactg ttggttccac catgattgcc agggtcttag ttcatacatg tacccatacc 780 taggtaactc caacgtgtcc tggatctgtt taaattgtgg cctacccaat ctttcgacca 840 cattttttga ctccttcaac atcgagtctg actcaaaccc attcagccct ctaagtgata 900 gttccccagg tatgccactg gctgcctcct ctcctaaatc cctgactcag aataaagcaa 960 gaccaactac caggcctata cgcctggtga atgctaactt ccagtcactc aggaataaaa 1020 agttggaact tgaaacattg gtagaggaaa ctaaacctga catcctagtt ataactgaga 1080 cctggctaga tgactcttgt aacatctcag agtactttcc cacccatttg aacatgtctg 1140 tgttctatag gaacagaccc gatgataatc atgggggtgt gctaatagct gtaactaatg 1200 aatttatatg ctctcaagag ccacaactag aaacagactg cgaaatggtc tgggttaaga 1260 ttcaacttgc tgggtccaaa tccttatatg tctgtgctta ctatagacca catgttgatg 1320 acagtaatag cctggataat cttgaacagt ctatgaaccg catatgtaat aaacggaaca 1380 gccatatatg gattgttgga gactttaact tccctggatg ggattggtct gatcctcagc 1440 acccagtcct caagcccggc tgtgcctacc cgggcctgca caggcagttt ctggagctgc 1500 tgagtgatca gaacctgtca caggttgttg ataggcccac taggaatgac aatactttgg 1560 atcttgttct catgtccaat gataattgtg taaacagaat taacacctta cctcctgtag 1620 gagaccacga catagtcttt gtagaggcag acatccatcc tcggaaaata aaagccaaac 1680 ctcgcaaaat cctacaatat aagcgtacta actgggagaa gttccgggat gagatggaag 1740 atttccaaac tgaattcctt gacttggccc agggggacgc ttctgttgat gatctgtata 1800 atgctttttc atccaaatta tctcaatgtg ttgacaaata tgttccctcc aaaatggtag 1860 gcgccaaacg ccacttacca tatgttaccc ctgacctcaa acggttaatg aggaagaggg 1920 acagatacta cagaaagacc ctaggtcaaa cttcacatga gagaaaagac aagttgaacc 1980 agtacaaaaa agaaatcaag aaaaagttaa aggaatgcta ctggcagtac attgaatcta 2040 ttataactga catacccgta ccaggagata ctagtgataa caccaataac ccccctgcaa 2100 agcaaggcac caagaagttc tggagttttc ttaaatccat taagtcagaa aggtctggtg 2160 tttcccatct taggaacaat ggcaccttgg tgtcagacag taagtctaaa gctgacctac 2220 tcaatcaaca gtttcaatct gttttcacca atgagccctc agatgacccc cttccagact 2280 taggacctag tcctcatgtc tctatggagg acatccaaat agaggtgagt ggtgttgaca 2340 aactgttggg gaatttaaac atccataaag cctctgggcc agatcaagtt catgcccatg 2400 tcctgaagga actcagctca gtcctcagtc ccatccttca ggtcatattc cagagaagcc 2460 tagacaccgg cattgtgccc gaggcttgga aagaggctaa catagcccca gtgtacaaga 2520 aaggtgaccg ctccaaacca gaaaactacc gcccaatctc actcacatcc atctgctgta 2580 agataatgga acacatcatt gctagcacca tgatgacaca ctttcaggct aacaacattt 2640 tgtatgacct ccagcacggt ttcagccatg gcaagtcatg tgagacccaa ttgctctcac 2700 taactgatga cttggccgtc aacaggaaca acggaatcca gacagacctc gtttttatgg 2760 acttcgctaa agcgtttgat aaagtcccac accgtagact gctacacaaa ctacagtact 2820 acggtgtccg agggaacacc cttgtctgga tacagaattt cctgctgggc cggtcccaaa 2880 cggttgtgct tgatggagag cgttcggacc ctatgccagt gacctcgggc gttccccagg 2940 gtacagtgct ggggcccatc ctcttcctag catacataaa cgatctccca gaacacgctg 3000 caaatgcaaa ggtgaggctt ttcgccgatg attgcattct gcagatgccc atccatgaca 3060 gctctgattg caataagcta caggaagaca tagataacat ctgcctctgg gaacaaaggt 3120 ggctgatgga gttcaatcca tctaagtgtg aggttatgac cataccatcg tccaaaacac 3180 ccatcaccca cacatacagc cttcacaacc atccactaaa caaggtaagt accaccaaat 3240 accttggggt caccatctct tccaacctca catggggcag ccacgtcgac gccataacat 3300 caaaggccaa caaaacactg ggcatgctca gacgctgcct gcgggttgcc agcactgcgg 3360 caaaggagag ggcctacaag gccctggtaa gaccccacct ggagtttgcc tgcagtgtgt 3420 gggatccgca cacccaagat caggtgagga aggtcgagat ggtccagcga cgagcggccc 3480 ggtatgtaac aaacaactac caacgagact catgctctgt cactgcttta atcaatcaac 3540 tgggctggca gtccttgcaa tccagaagga agaacgccag actcattacc atgtacaaaa 3600 tcatccataa caacatctct gtcccacatg catccaaact ggtcctggcg actcgatgct 3660 cccgccgcac caaccacgct ttcaagctgc aggccatcgc gagcaaatcc aacttctata 3720 ggctctcgtt cttccctcgt accatccagg agtggaatga acttgagcct ggtgtggcgg 3780 aggcggggtc tctctcccag ttcaaaactg aactggggag ggccctgctg cattgagtcg 3840 cccctacacg tctttgtata tatgtaaata acactaaccc accgcactca cttgtcttgt 3900 cccttgcacc catcagttgt ttatgtcctt ttgtgttttg cctttatttt ccctgtgttt 3960 taaggcacta tcacccgtaa ctaacaaaac atgcgcactg ctattaatcc ccgtcaaggg 4020 ggctgagcag tacaaaagaa gaagaagaag 4050 // ID Kiri-3_AAe repbase; DNA; INV; 4207 BP. XX AC . XX DT 19-OCT-2010 (Rel. 15.1, Created) DT 06-JAN-2011 (Rel. 16.02, Last updated, Version 3) XX DE A Kiri non-LTR retrotransposon family from Aedes aegypti. XX KW Kiri; Non-LTR Retrotransposon; Transposable Element; L2_Ele4; KW Kiri-3_AAe. XX NM Kiritsubo-3_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4207 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4207 RA Kojima K.K. and Jurka J.; RT "A distinct group of non-LTR retrotransposons from the yellow RT fever mosquito."; RL Direct Submission to Repbase Update (19-OCT-2010). XX DR [2] (Consensus) XX CC [1] Originally named as L2_Ele4. CC [2] Consensus update and re-classification. This consensus is CC generated from 20 sequences with >97% identity, and ~98% CC identical to the original sequence in [1]. This family does not CC belong to the L2 clade and is renamed as Kiri. It could CC constitute a new clade with other Kiri elements. XX FH Key Location/Qualifiers FT CDS 279..1046 FT /product="Kiri-3_AAe_1p" FT /translation="MCTGMTSNTMDSIDGQLKPGSHLSTLDGLLKKVLHLC FT EIECSTIRSKFQDSVSDIRNQISIVQNEITSLQTVYLQNVKKLAASVSAIE FT KDLEKSRKERIDRSNDLIVSGIPYVPNENLNDMVQNIAQHLGYENNNLPIV FT YATRLPKRTRKYRETTPVLIQFVTNLSREQFFRRYLAKRDLSLQHVGFNSR FT QRVYINENLSTVDRQIWRYACKLRNDGKLVRVHLKNGIVLVKRSNDNHSTP FT IHSMKQLFVFMKFES" FT CDS 1195..3645 FT /product="Kiri-3_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MSDNMLNNVSSNEGIYHIPRAVMNAALRKDFLNICHM FT NVQSICSRQLSKFNEFTSCFHNGKVDLICLTETWLTNDISDDTIAVEGFKL FT IRNDRNYGRGGGVCLYFRSNLNCKKVSASDLFVGLGDACNTEFLFVEVSTG FT ADKFLLGIFYNPPRVDCCELIFQQLSDHILKYKNIVLIGDFNMNVKKDSPI FT VNRFCSVFDHFGFTCVGNEPTHYYSGGSSQIDLMFTNDPEFILNFNQVSGP FT GFSAHDIIFSSLNVARSCFSSAVFFRDYKHINFESLNHTLTTIDWSALYSI FT NDSNLALDYFNQVVLSLFDHFVPLRSSKLXRHNKWFNNDISRAMVERDLAY FT RVWKRTKGQQERDQYKRLRNRVTMLINQAKSVYISRSSELSRSNNDLWRRL FT KEINVKKSGAGMLFNNSCDEINNYFASNFTVDNYFPRLPPPNENGFRFVTT FT NEYEIDKMINSITSDAVGLDGVPLKFIKLILPFVITPITHVFNTIILTCNY FT PRAWKTAKILPIRKKSRGCDLKNLRPISILSALSKAFEKILKNQMLEFLNN FT FNLLTDVQSGFRAGHNTTSALLKVHDDVHRTIDRKGIAFLLLIDFSKAFDR FT VSHYQLLRKLSDQFNFSAPAVNLLQSYLTDRSQQVSIDANLSEIAHIVSGV FT PQGSVLGPLLFTLFINDLPSVLKYCSIHMFADDVQIYFHSCNLSSSEMAHL FT INEDLSNIYDWSCRNTLPINASKTKIMHISRARQSQSLPVISLGGETLSYV FT NHVSNLGVIFQHDLEWDSHINTQCGKIYGGLKHLKLTANMVSVEIKLRLFK FT SLILPHFLYGIELLQ" XX SQ Sequence 4207 BP; 1280 A; 771 C; 734 G; 1417 T; 5 other; tttagcgatg attttgtagt gctaactaat gtgatcgact gttgtttatt catattatgc 60 tgtttgaggt cccgatttat gtgctggtta tggagtttga tgattattgw tatcccagtg 120 ctctcttctg gtgctttaat attcttatca attcgactgt gaacgccatc atagcgacgt 180 ttctcacgtt catcattgtc tgctgagaga aaggtaactc acgggtttgt tctcgctctc 240 gccctttgta ctattatcac gcgcctaaag caattgatat gtgtactggc atgacctcga 300 atactatgga ttctatcgac ggccagctga agccgggctc ccatctttca acgttggacg 360 gtttgctgaa aaaagttctt catctttgtg aaatagagtg ctccaccatt agatctaaat 420 ttcaagatag cgtttctgac atcagaaatc aaatttcaat wgtgcaaaat gaaatcacgt 480 ctttacaaac tgtttattta caaaatgtga aaaaactagc ggcttctgtt tcagcaattg 540 aaaaagatct ggagaaatct aggaaagaaa gaatcgacag gtctaacgat cttattgttt 600 ctggtatccc atatgtgccc aatgaaaact taaatgatat ggttcaaaat attgcacaac 660 atctcggata tgaaaacaac aatcttccaa ttgtttatgc tactcgatta ccaaaaagaa 720 ctaggaaata tagggaaacc acacccgtgc tgattcaatt tgttaccaat ttatccagag 780 agcaattttt tcgacgttat ttagcaaagc gagatttatc actgcaacac gtaggcttca 840 attctagaca gcgtgtatat atcaatgaaa atctaagcac agtagatcga caaatttggc 900 gctatgcatg caaacttaga aatgatggaa agctggttag ggtacacttg aaaaatggta 960 ttgttctagt taaacgctct aatgataatc attcaactcc gatacattct atgaaacagt 1020 tatttgtgtt tatgaaattt gaatcttaaa tcatgaaatc atcgtttcag ccaccagctt 1080 accattcgaa aataataata attattaatt gtacttatat catgaattca tctgtaatcc 1140 ttctgttgat cctctttgcg ctctgcgttc cgcgctcata ctactattac cacgatgtca 1200 gataacatgc taaacaacgt cagttcgaat gaaggtattt atcatattcc tcgtgctgtc 1260 atgaatgcgg ctcttcgtaa agatttccta aatatttgcc acatgaatgt gcaaagtatt 1320 tgttcacggc aacttagtaa gttcaatgag ttcacttcct gttttcacaa cggaaaagtt 1380 gatttgattt gtttgactga gacatggttg actaacgata tttctgacga tactatcgct 1440 gtagaaggtt tcaaattgat tagaaatgat cgcaactatg gcagaggggg aggtgtttgt 1500 ctttatttta gaagtaatct aaactgcaaa aaagtttctg cttcagattt atttgtgggg 1560 ctgggtgatg cctgtaatac tgaatttttg tttgtcgaag tatcaacagg ggctgacaag 1620 tttttgcttg gaattttcta caatcctcca cgtgtcgact gctgtgaact tatttttcaa 1680 cagttgagtg atcatatctt aaagtacaaa aatattgttt taataggcga tttcaacatg 1740 aacgttaaga aggattctcc aatagtcaat cgtttttgta gtgtgtttga tcatttcggg 1800 tttacatgcg ttggaaatga gccaactcat tattattctg gcggaagttc tcaaatagat 1860 ttgatgttca caaacgatcc tgaatttatt ctcaatttca accaagtgtc cggacctgga 1920 ttctcggctc atgatataat attttcttcc ttaaacgtag ctcgatcatg tttcagtagc 1980 gcagtatttt ttcgtgatta caaacacata aattttgaat ctctaaatca cactttaact 2040 acaattgatt ggtcagctct ttactctatc aatgattcta atttagcttt agactatttc 2100 aatcaagtag ttttatcact ttttgatcat ttcgtacctc ttcgttctag taagcttktc 2160 agacataata aatggttcaa caatgacatt tcwagagcaa tggtagagcg agatttagcg 2220 tatcgtgttt ggaagcggac taaaggtcaa caagaaagag accaatacaa acgcttacga 2280 aatagagtaa caatgctaat caatcaagct aaatccgttt atatttctcg ttcatctgaa 2340 ttatctcgtt caaataatga cttgtggaga agactgaaag agataaatgt taaaaaatct 2400 ggcgctggta tgcttttcaa caactcttgt gatgagatca acaactactt tgcgtctaac 2460 ttcaccgtcg ataattattt tcctcgatta ccaccaccga acgaaaatgg tttcagattt 2520 gttactacta atgaatacga aatcgataaa atgatcaata gtattacctc ggatgctgtt 2580 ggattagatg gtgtacctct taaattcatt aaattgatct taccgtttgt gattacacca 2640 attactcatg tcttcaatac aatcattttg acttgtaact acccacgagc atggaaaaca 2700 gctaaaattt tgccaattcg taaaaaatct agaggctgcg atttaaaaaa tctaagacct 2760 ataagcattt taagcgcgct ttcaaaggct tttgaaaaaa tactgaaaaa tcaaatgctt 2820 gaatttttga ataattttaa tcttttaacg gatgttcagt caggctttcg tgcggggcac 2880 aatacaacat ctgctctttt aaaagtccat gatgatgttc atcgtacaat tgataggaaa 2940 ggaatcgctt ttttgctttt aattgatttt tctaaagcgt ttgaccgagt ctcccactat 3000 caactgcttc gcaaactttc ggatcaattc aacttctctg ctcccgccgt gaatcttctg 3060 cagtcctatc taacggatcg ctcgcaacag gtttcaattg atgcaaacct ttctgaaata 3120 gctcatattg tctcaggmgt cccgcaagga tcagttttag gccctttact gtttaccctt 3180 tttataaatg acctaccgtc tgtattgaaa tattgttcca tacacatgtt cgctgacgat 3240 gttcaaattt attttcactc ctgcaacctt tcttcgtcag aaatggctca ccttatcaat 3300 gaagatttat caaacatcta cgattggtca tgtcgcaata ctctacctat taatgcttct 3360 aaaactaaaa taatgcatat ttcaagggct cgtcagtccc agtcactacc ggtcatttcc 3420 ttaggaggtg aaactctttc ttatgtaaat catgtttcaa acctaggggt gatatttcaa 3480 catgatctag aatgggattc ccacattaat actcaatgtg gaaaaatcta tggaggttta 3540 aaacacctta agctcacggc taatatggtt tctgttgaaa ttaagcttag attgtttaaa 3600 tcacttattc ttccgcattt tttgtatggt atagagctat tacaataatg cttcagctcg 3660 agccctggac agattaagag ttgccctgaa ctgctgtgtt agatgggtct ataatttaac 3720 aagattttct agtgtttcac gactacagca caatttatta ggctgtaatt tttatgattt 3780 ttttaaacta cgttcttgca taacactctt taagataatc aaaacaacta caccgcatta 3840 tctcttcact aaactgcaac cattccgtag catgcgtact tgtaattttg ttttaccgca 3900 gtatgctaca tctcattatg gtggtacctt cttcgttcgt ggcgttgtgt tttggaacca 3960 actacctcta gaaataagat ctattgatcg attggatagg ttccgaacag agtgcagaga 4020 gtttttcaac agaaggaatt agtattagaa atatgaaaat agcaacgtat aagtgaacag 4080 aattcagaaa ctgttatgtt tagttttaag tgtttttgta actaaatgaa ttcggaatgc 4140 aataattgaa aaaggtgtaa ccttacattg caaattaata aatataaata aataaataaa 4200 taaataa 4207 // ID Gypsy-29_OD-LTR repbase; DNA; INV; 230 BP. XX AC CABV01003676; XX DT 24-FEB-2011 (Rel. 16.02, Created) DT 24-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Oikopleura dioica genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-29_OD_; KW Gypsy-29_OD-I; Gypsy-29_OD-LTR. XX OS Oikopleura dioica OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; OC Oikopleuridae; Oikopleura. XX RN [1] RP 1-230 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Oikopleura dioica genome."; RL Direct Submission to RU (24-FEB-2011). XX DR Genome; CABV01003676; Positions 6377 6606. XX SQ Sequence 230 BP; 64 A; 53 C; 42 G; 71 T; 0 other; tgctagactc tcgcccacga tatttatatc tgatcgaggg agactccggt ctcactcgcg 60 cagatatgat cattcgtatt ttctcctctt ccgagcaaca acgtttatta tttctacctt 120 ttctcttccg aatataaaat acagtacgat cttcaactac agtcgtcagt gatattatag 180 agagaagcag gcattcttca gtgggaaagg gattagcaaa gatccttaca 230 // ID BEL-47_AA-I repbase; DNA; INV; 6225 BP. XX AC supercont1.248; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-47_AA_; KW BEL-47_AA-LTR; BEL-47_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6225 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.248; Positions 1220400 1214176. XX CC Positions [5232-5810] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 902..4417 FT /product="BEL-47_AA-I_1p" FT /translation="MPGKSKTSFKMLQARLRGLQTSFDNMFSFMEDYTDDT FT RVTEVKVRLDRLDALWDRINENIEEVESHEESTSEPESFVKVRIDFENRFY FT QLKSFLLDKIRENSDETVLNPIIRSHDNTHSAPNPHVRLPQITLPKFSGKI FT DEWQTFRDLYTSLIHWQPDLPEIEKFHYLRSQLEGEALAVIDSLPLTRANY FT AVAWDLICTRYSNSKILRKRQVQALFELPDVKRESAAELHTLLDSYEKIVK FT SLDQATPQSTEFKDLLLLHLLSSRLDSATRRSWEEYSATKETDTVKDLTDF FT LQRRIQILEALPSRPVEQRMEAGSSKFPRKPNVVKVCNTTLQPSAPIKCVA FT CSDSHLLYQCAHFLKLPVSERDGLLRSHSLCRNCFRRGHQARECNSKFTCR FT QCKGKHHTLVCFRGKPNENRATHTNEPSTTPTATESNGEKTNTKVVNVATT FT STLTCSSATGSPTGVLLLTAIVILEDDQGHTVQARALLDSAAECNLITKRM FT RKQLVLKEAASMVQVVGIHGTSNKVHGRVTVTVRSRISNFSQGMEFYVLPK FT LSAQPGIASVDPSKWNLPVGVELADPHFLTSEPIDLVLGAEFFFDMFVSGQ FT RITMGDNLPLLVDSVFGWIATGRYPIDSPVTSVLCDVATSSQVERLIEKFW FT EYEEVGLESNMSPDEAKCEDYYSQTTRRQATGRYSVSLPRNGELICNLGDS FT RASAEKRFLQIERRLNRESSLRDQYVSFMEEYEALGHMSRVDHEGDGVERC FT YLPHHPVVKEGSTTTRVRVVFDASARTSTGLSLNDCLHSGPVIQRDLRSII FT LRSRFRPIMLVADIEKMFRQVDVCPEDRRFQSILWRPSPDQPLATYELKTV FT TYGTKPAPFLATRTLVQLVNDEAERFPLASATVKNDFYMDDAITGANDSVT FT AKDLRIQLVDMLQSGGFKLRKFASNCPSVLEGIPNEDLSIQAADGIYLDPD FT PMVKTLGLIWIPPTDEFRFHFKAPSLVDTPLTKQKIFSIIASLFDPLGFIG FT PVITQAKIIMQLSWQLLDDKGQKLAWDSPLPSKVEEEFRRFYQQIPLLNDL FT RIARLVVTPQATQTQLHIFSDASEKALGACVYVRTTDSSGRIKVALLLSKS FT RVAPLKTQTIPRLELRAATLAAEMYSQVMDSIETPIENFLLDGFRNCPSLV FT CCNPIDLDDLCCE" FT CDS 5163..6224 FT /product="BEL-47_AA-I_2p" FT /translation="MRCYRTKPATIRQFMAELPKPRVVPAKPFSTTGVDYF FT GPIYVRPGYRRTAVKAYVAVFVCFSTKAVHLELATDLSTACFIQALRRFIS FT RRGKCVQLFSDNGTNFVGAKNAMEKLLQNLRSKEHNEAVSKWCTDEGMQWS FT FIPPGAPHFGGLWESAVRSTKVHLLKVLGDIAVSYEDMTTLLAQVECCLNS FT RPLTPLSDDPEDLEALTPGHFLVSMSLQALPEENLTSVPPGRLQHRDFIQQ FT CLQFYWKRWKSEYLTQLQARTKWWQPPTEIRTGSLVVIRDDNQPPTRWRMA FT RIHAVHPGEDNVVRVVTLKTADGFIKRPVSKICILPVADPVEADVMDPQHT FT SETPAVSEGGR" XX SQ Sequence 6225 BP; 1719 A; 1473 C; 1493 G; 1540 T; 0 other; gccatgtgaa cagttgtgtc gaggaagaaa ttttactagc ttaactaaat ataatccact 60 taaacgttaa attcaataaa gttcagtccg gagtatggtg ttttagtgct aaatagtcta 120 gtgtttcttg tgctcaaccg aatcaactca agtgctaagt gaattggccc aattccaaaa 180 gcccaaggcg acccaggacc gaaaccccag gaatacttac cccactatcg accaacccgc 240 caaaaatccg gaacggcatc ctcagctatc gatcagttgg tccttcgaac cggattggtt 300 ggtcgaaggc accgaatcgg agaaacccat ccgccatcgt agaatctcca accaattcac 360 tgcacagcga ctcaagtacg ccatcttggc ttttggcgtc gatgcacatc acctttcatc 420 atccaatcac caaatcatcg gcatttaatt atccaaggca tatgttcaac ggaaaatcgg 480 tgagtgcaat tccaaggctt tatacccaaa aatccagtac tgctcgtggt cttcatgtgt 540 aataggttat cctcgacatc ggtactgttc cacgtcatcc cattcatcat tacaacgagg 600 tacatcacta cgactggcaa acaacttatc aatcagtgga actgaaaact gcatacccga 660 agaaactcca ttcagctccg cttaccttgg aggaaggtcg ccattctgat catcgagtgc 720 atctggtagc aaactccaaa cgtccggctt catccgctgc gtagcaaccg ctggctctgt 780 tcctgctcac ctctaatatc aatacaaggc tcatacataa gagccagcag gtaattagcg 840 ctccaacgtt actactgtcc caaattgtgc tactttggtc tcatttttcg aggaacacac 900 tatgcccgga aaatcgaaaa cgtcgttcaa gatgttacag gcgcggctgc gaggtctaca 960 aacgtccttc gacaacatgt tcagcttcat ggaggactat accgacgata cgcgagtaac 1020 cgaggtcaaa gtacgactgg accgattgga tgcactttgg gataggatca acgagaacat 1080 cgaagaggta gaatcccacg aagaatccac tagcgaacct gagtctttcg tgaaggttcg 1140 tatagacttc gagaatcgat tttaccagct caaatcgttt ctcttggata agatcaggga 1200 gaatagtgat gaaactgtgc tcaatccaat tattcgctct cacgataaca ctcattccgc 1260 acccaacccc cacgttaggc taccccaaat aacactaccc aaattcagtg gcaaaattga 1320 tgaatggcaa acctttcgcg acttatacac ttctctaata cattggcaac ccgacctccc 1380 cgagattgaa aaatttcatt acctccgcag tcaacttgaa ggtgaggcgt tggctgttat 1440 tgattcgctt ccactcacca gagccaatta tgctgtggca tgggatctaa tctgcactcg 1500 gtactcgaat tccaaaatcc ttcgtaagcg tcaggtacaa gccttgttcg aattgccaga 1560 cgtcaaaagg gaatcagcag cagaattgca caccttgttg gactcctacg agaagattgt 1620 gaagtcattg gaccaagcaa ccccccaatc cactgaattc aaggatcttt tgcttcttca 1680 cctactcagc tcgcgacttg atagtgcaac gaggaggagt tgggaggaat attctgctac 1740 caaggagacc gacaccgtaa aggatcttac ggacttcctg cagcgtcgta ttcaaatttt 1800 ggaggcactc cccagtcgac cggttgaaca acgaatggaa gctggttcat cgaaatttcc 1860 aaggaaaccc aatgttgtga aggtttgcaa caccactctt caaccatcag ctccaattaa 1920 atgcgtggct tgttcggata gtcatctgct ttatcagtgt gcacattttc tgaagctacc 1980 tgtttcggaa agggatggac tgctacggtc tcactcgctg tgtcgaaact gttttcgacg 2040 tggacatcag gctagagagt gtaactcaaa gttcacttgt cgacaatgca agggcaagca 2100 tcacacgctt gtgtgcttca ggggaaaacc gaacgagaat agggcgaccc acaccaacga 2160 acccagcact actcctacag caaccgaatc aaacggagag aaaaccaaca ccaaggtggt 2220 taacgtggca accaccagca cactaacttg cagttcagct actggatccc caacaggagt 2280 gttgttgctc acggcgatcg ttattctgga ggatgaccag ggtcataccg tacaagcgag 2340 agcgctgttg gatagcgctg cagagtgtaa tcttatcacc aagcgtatga ggaagcaatt 2400 ggtactcaag gaggcagcca gcatggtaca ggttgtaggc atccacggta cttctaacaa 2460 ggtgcatggg agggtcacgg tgactgttcg ttctcgaatt tcgaacttct cccaaggtat 2520 ggagttttat gttttgccca agttgtcagc ccagccagga atagcttccg tagacccaag 2580 taaatggaat ttaccagtag gagttgaatt agcggaccca cactttctca cgtctgaacc 2640 gattgacttg gtgctaggag cggaattctt tttcgatatg tttgtgtctg gccagcgtat 2700 aacgatgggt gacaatctgc ctctattagt tgactcagta ttcgggtgga tagctacagg 2760 tcgataccca atagatagcc cagtcacctc tgtgttgtgt gatgtagcga cttcgagtca 2820 ggtagagaga ttgattgaga aattttggga gtatgaggag gtcggtttgg agagcaatat 2880 gtcgcccgac gaagccaaat gcgaagatta ttattcccaa acaacccgaa ggcaagcaac 2940 tgggaggtat tcggtgtccc taccgaggaa cggagagtta atttgcaatc tgggagattc 3000 gagagcatca gcagaaaaga gattcctaca aatcgagcga cggttgaacc gagagtcatc 3060 gttacgcgat cagtatgtgt ctttcatgga ggagtatgag gcattagggc atatgtcacg 3120 agtcgatcac gagggggatg gagtggaacg atgctatctc ccacaccacc cagttgtaaa 3180 ggagggaagt accactacac gggtaagggt ggtgtttgat gcatcagcca gaacatcaac 3240 tggcctttcg ctaaatgact gtttgcattc tggtccggtc atccagagag atcttcgatc 3300 gattattctt cgtagtcgtt ttcgcccaat tatgttggta gccgacattg agaaaatgtt 3360 tcggcaagta gatgtttgcc cagaagacag gcggtttcaa tcaattcttt ggcgtccgtc 3420 accggatcaa cccttggcaa cttatgaatt gaaaacggtc acctatggaa ccaaaccagc 3480 cccatttctt gcgacaagga ctttggtaca acttgtaaat gatgaagccg aacgattccc 3540 acttgcatct gctacggtga aaaacgattt ttatatggat gatgccataa ctggggcgaa 3600 cgattctgtg accgccaagg atttacggat tcagttggtg gacatgcttc agagtggtgg 3660 atttaagctc cgaaaattcg cgtcgaattg cccatccgtt ttagaaggta tacccaatga 3720 agacctctca attcaagcag cggatggaat ttacctggac ccggacccca tggttaaaac 3780 gttaggactg atttggatcc ccccaacaga tgaattccga tttcatttca aggctccttc 3840 tctcgtagac actccactca ccaaacagaa aattttctcg atcatagcta gtttgtttga 3900 cccacttgga tttataggac ctgtaattac ccaagccaaa atcattatgc aactctcatg 3960 gcaattgttg gatgataagg ggcaaaaact agcatgggac tcaccattgc catcgaaggt 4020 agaagaggag tttaggagat tttaccaaca aattcctctg ctcaacgatt tacgcattgc 4080 aagattggta gtaacacctc aagcgacaca aacccaatta cacatatttt cggatgcttc 4140 ggaaaaggcc cttggagcgt gtgtatacgt gaggacgaca gattcatcag ggcgaatcaa 4200 ggtggctcta ctgttatcca aatcgcgggt agcaccgctc aaaactcaaa cgatcccgag 4260 gcttgagttg cgcgccgcaa ctttggctgc tgaaatgtat tcccaggtta tggattcgat 4320 cgaaactcct atcgaaaact ttcttctgga cggattcaga aattgtcctt cattggtgtg 4380 ctgcaatccc atcgacttgg acgacctttg ttgcgaatag tgtcgccaag atttaacggt 4440 tgactgaaaa ttgccactgg aatcatatcc ccggagagaa aaatccagca gatctcattt 4500 ctcgtgggat cctccctgaa gaaattttgg agaaccgttt gtggtgggac gtcaactggt 4560 tgcacacaga tatggaaaag tggccgaagc aaaaggtgtt caccgcaaat ggaatggcag 4620 aggagagacg acaagttgta ttatcatcaa gggtgtcgga ccctagtttc atcgaagagt 4680 acgtactacg atattcaacc tacactacaa tggttcgtca tgctgcttgg tgtcgtcgtt 4740 acctatacaa cctgcgcgtg aagaagggca atcgcaaggt cggaccattg accgtagaag 4800 aactactaca agccgaaacc aaaatattac agcgggttca gaaagatgtt ttcgatggag 4860 agttgaaggc ggtaaataag ggcgagagtg tatcgcgaca atcaccccta cgctggtatt 4920 gcccatttgt cgcgcacgat ggtttgttga gggtaggggg gagacttggg aagtcggatg 4980 agtgtgagaa tgcgaagcac ccaattgtgc tacctgcacg ccatacgcta acaaggctaa 5040 tacttcgtca ctatcattta aaactgttgc acgctggacc acaactcata ctgagcacag 5100 ttcgactctg attttggcct ctcgggggac gtaatttagc gagacagatt tgtcatgagt 5160 gtatgcgttg ctatcggacg aagccagcga ccattcgtca gtttatggcc gaactaccta 5220 aaccgagagt tgttccagca aaaccgtttt cgacgacagg ggtcgactac ttcggcccga 5280 tctatgtgcg acctgggtat cgaagaacag cagtaaaggc gtatgtggca gtgtttgtat 5340 gtttctccac gaaggcagta catctggagt tagccacaga tttatcgacg gcctgcttca 5400 tccaagcttt aaggaggttc atttctcgtc ggggaaagtg tgtccaattg ttctccgaca 5460 acggcacaaa ttttgtaggg gcaaagaacg ccatggaaaa gctcctacag aatctacgat 5520 ccaaggagca taacgaagca gtttcgaagt ggtgtacaga cgaaggaatg cagtggagtt 5580 tcataccccc cggcgccccc cactttggcg gcctgtggga gagtgccgtc cgatcaacaa 5640 aggtgcacct gctgaaggtt cttggcgata tagcggtatc gtatgaggat atgacgactc 5700 tcctagccca ggtggagtgt tgtttaaatt caaggcccct aacacctcta tctgatgacc 5760 cagaagattt ggaggcctta actcctggcc attttttggt ttccatgtcc ttgcaagcat 5820 taccagagga gaatttgacc agtgtaccac caggaaggct gcaacaccga gacttcattc 5880 aacagtgtct tcagttttat tggaaacggt ggaagtccga atatctcacc caactgcagg 5940 cacgtaccaa gtggtggcaa ccgccaacgg aaattaggac tggaagtctg gtagtgattc 6000 gcgacgataa tcaaccaccg acacgctgga gaatggcacg tattcatgcc gtgcatcccg 6060 gggaggacaa cgtagttaga gtcgtcacac tcaagacagc cgatgggttc atcaagagac 6120 cggtttccaa aatatgtatt ctgccggtag ctgatccggt ggaagcagat gtgatggatc 6180 cacagcatac ttctgaaact ccggccgttt cagagggggg gagga 6225 // ID LDRP2 repbase; DNA; INV; 76 BP. XX AC M21010; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 28-SEP-1995 (Rel. 1.08, Last updated, Version 1) XX DE L.donovani highly repetitive DNA. XX KW LDRP2; Repetitive sequence. XX OS Leishmania donovani OC Eukaryota; Euglenozoa; Kinetoplastida; Trypanosomatidae; OC Leishmania; Leishmania donovani species complex. XX RN [1] RP 1-76 RA Ellis J. and Crampton J.; RT "Characterization of a simple, highly repetitive DNA sequence RT from the parasite Leishmania donovani."; RL Mol. Biochem. Parasitol 29(1), 9-17 (1988). XX DR GenBank; M21010; Positions 1 76. XX SQ Sequence 76 BP; 30 A; 24 C; 10 G; 12 T; 0 other; cccctaacat aacataataa ccctaacccc cccctctttg aaggaacgaa gaatcagtgg 60 agagaaaaac cctaac 76 // ID I_Ele19 repbase; DNA; INV; 6775 BP. XX AC . XX DT 07-OCT-2010 (Rel. 15.1, Created) DT 07-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE An I clade non-LTR retrotransposon family from Aedes aegypti. XX KW I; Non-LTR Retrotransposon; Transposable Element; I_Ele19. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-6775 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6775 RA Kojima K.K. and Jurka J.; RT "I clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (07-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. This consensus is generated from 8 CC sequences with >99% identity, and ~100% identical to the original CC sequence in [1]. XX FH Key Location/Qualifiers FT CDS 381..1724 FT /product="I_Ele19_1p" FT /translation="MTDSSAGPFGGAAGLRRNKPDWMMDPNDLGQVMVLIL FT RRKTDAADGQRVQDESLPDSIIVGTSIEIIIGKKAARALKATREGRGTRYL FT LRTSSTEIIEKLTKMTELTDGTKIEIVPHPTLNTVQGIVYDPDTINKDEKS FT ILEYLDSQGVHAVRRIQKRINGTLKNTPLLVLSFRGTILPEHVYFGLLRIQ FT VRIYYPTPMLCFNCGCYGHSRKFCQQSAICLRCSAPHHVSEGEQCTNPPNC FT LHCRTEHQVTSRECPKYREEEKIVRLKTDKGISFSEARRLCAQEAKSQTFS FT GIVQEQIQQELAAKDQLIATLQKQVAVLTKELSALKKMLRPATQSQSPAPR FT DRRPSLSNQKSSSLMAPPTQNSTQHSTDRPSRKDQPTTSPKAPRQEDRKTN FT KSSGGIQTRSRSGKRHMEVSPTEAAHSRGKRMPSQLDTNTTFVDIEMNNGP FT GPS" FT CDS 1708..6666 FT /product="I_Ele19_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="MDQGLHNKIRTVPYDYDSATIMDMKKEERNRPNTFDD FT SRNSMDSDKRPTTTTNTSSNTVVTKLKFQTITSEEYDRMHNLIVVPSTSRT FT SSYITTIDSPSARLTTPTLTEEVPEYSEEPSAAVGVVCPHLRASIPIHPIS FT VKQPNPTFIPINTGFDTTLNSTTYSVWPTPLHAKSSESSAKVDSRKISYGT FT KRNSGIFRSSSNSNFHHGSIHQHKTDLPARCTRLTATIDSPSARPTTLTLT FT GDVPEQSEEPPAAVSVVCPHPWASITSNFIFEEPNTASVPIIPGFDARSRS FT MESNSTSPTSVLQHSHGNVHQPAAIFDSPSARPSTSTLTREVPDPSEEPLA FT AVDVACLQLSASIASFSPKDQSTATSLPTNTDLNIEVNSREVTTATTVPLQ FT TSVEDSPAELDYSSYWNPIGTEFNNHIADLQSPLQTSTDSDSCVTETNENS FT PTFAIQWNICGLRSHLSELQILLAKYQPVIVCLQETNGDYRKLSPSCLGKD FT YQLLLGQCSAHGRQGTGIAIKNGTPFQRIWFQTKIQAVAVQLGAPIAITAV FT SVYLPPKDKEAAKHLGELLEELPKPIIVLGDLNAHHDAWGSRTVQSSSGAK FT QRGEAILELVVQNDMIVLNNGSHTRIDPATGTSQALDVTMCSTSHAAKFSW FT KTLLDFSGSDHLPILVDTLCKQNQPTRRTRWIFENADWPLYEQLTNNSIKP FT GSQFTVDEFTNRIIAAAEASIPKTSGRVGPKSVVWWNQDVELAVKARRKRL FT RALRRLKEGDSRKVEALKLFQEARSHCRKTIADAKQKSWDTFVEGINPDSP FT ASQVWNNINRLQGKRQRNVITLNLPTGHTNNGEEVANALADEYQLKSSDAN FT YPEKFRRKHKTEQRSIRKNQRPNLHKRYNVDLTMSELMWALDRRTGSSTGA FT DNVNYQLLQRLPFSGKVALLELFNQIWHSGTFPEEWKIGIVIPIPKPEADR FT SRPEGYRPITLLSCLGKLFERIINRRLVTELESTRKLDPRQYAFRSGKGVD FT SHLAKLESLITLHEDEHAEIVSLDISRAYDTTYRPGILRTLMNWRITGRMM FT NMLSSFLCDRFFQVAVNGALSTLRRADNGVPQGSILSVTLFLVAIQPMFDV FT IPSDAEILIYADDVILIVKGKNHVTIRRKLRRAVKAAVDWAASVGFTIAPT FT KSKLLHVCNLTHRKRGRAIKIDSIPIPQIRHLKVLGIIIDSKLKLSKHLAS FT VKQNCRKRMHILRILGYRLKRSSRTSLLRIGSALILSKVYFGIGLTSINFE FT ALQQTLEPIYNDVVRQASGAFKTSPTISIMAESGWLNFRLALIQRLCILVV FT GLIEKNEDAVAYPVVQRAKTLMLETTGWTLPNVCNVLRLSNREWYAPVPIF FT ENALRNNIKAGTNRAIVVPRFKELITSRYKIHDQIYTDGSKDRDQIGAGLT FT IGDRNYSFRLPGICSVFSAEAFALMNAVSMSQGHTNVIILTDSASCLDALR FT EGRSKHPWIQSIEKNIEGRNVTFCWIPGHSGIVGNEKADELAKQGCNQPSV FT DIPTPAQDAVRAVKEKIWSYWEAEWNQSRCHLRAIKPTAGKYLDRKCPSEQ FT RVLTRIRIGHTRLTHAHLMNNSNPPVCTYCGDRVTVQHIITECRGLEGSRK FT KCGINGSMAEILAYNAENEKAVIQFLKDCELFDKI" XX SQ Sequence 6775 BP; 2145 A; 1631 C; 1450 G; 1549 T; 0 other; catcgtgctc gggatactgt attacattat caagcgcgca tttgctctct ccaaatctac 60 aaatttccga atacatactg tgcgaaatcc ttcgttttgg ccgttacatg gtggtttcac 120 gcggattgtc cgtcggtaaa gtgaattcgt tgggaaattt accataaaag tgataaatta 180 gacaacttac cgatagcgta aatatcacgc ggtgcagcca ccgcttgtgg ctatacccac 240 acatagatcg tcgcgatagt aggtaccgaa agacttgtcc cgatcggtag gttgtgttaa 300 agcaaaaaca aaatcggtag tggatatttt tttggaatcc tggatctgaa ttctaagtgg 360 tgaacagtag tctaggacca atgacggata gttccgctgg ccccttcggg ggtgcagcag 420 gactgagaag gaacaaacca gattggatga tggatccaaa cgacctcgga caagtgatgg 480 tgttaatcct ccgtcgaaaa acggacgcag ctgacgggca acgagtacaa gatgaatccc 540 tacctgattc gatcatagta ggaacatcga ttgagatcat catcggtaag aaggcagcaa 600 gggcactcaa agcaacccgt gagggtcgtg ggacgcgcta tctgcttcgt accagctcta 660 ctgaaatcat tgaaaagctg acaaagatga ccgagctaac tgatggtaca aaaattgaaa 720 tagtgcccca ccccacactt aatacggtcc aagggatcgt gtacgaccct gacaccatta 780 acaaagacga gaagtccatt ttggagtact tggattccca aggcgtacat gctgtgcgga 840 ggatccaaaa acgtattaat ggcacattaa aaaatacccc tctactggtc ctatcattcc 900 gtggcacaat tctcccagag cacgtatatt tcgggttgct gcgaattcaa gtgcgaattt 960 attaccccac acccatgctc tgtttcaact gtggctgcta tgggcattcg agaaagttct 1020 gccaacaatc tgctatctgt ctgcgatgct cggcaccgca ccacgtatcc gagggagagc 1080 agtgcactaa cccacctaat tgtcttcact gcagaacaga acaccaggtc acgtcacggg 1140 agtgccccaa gtacagagag gaagaaaaaa ttgttcgcct caaaaccgat aaagggatct 1200 ctttctccga agcaaggcgc ctgtgtgccc aagaggcaaa aagtcaaacg ttctctggta 1260 ttgttcaaga gcaaattcaa caagagctgg cagctaagga tcaactaatc gctactctac 1320 aaaaacaagt tgcggtatta actaaggaac tctccgcctt gaagaaaatg ctaagacctg 1380 ccacacaaag tcaatcacca gctccacgcg atcggcgacc gtcattatcc aaccaaaaat 1440 cgtcttcgct aatggcacca cctacacaaa actctaccca acactccact gaccgcccgt 1500 cccgaaaaga tcagcccacc acatcaccga aagcgcctcg acaagaagat cgcaagacca 1560 acaaatctag tggtggaatt caaacacgta gcagaagtgg taaaaggcat atggaagttt 1620 cgcctactga agccgcccac agccggggta aacgaatgcc ttcgcaactc gataccaata 1680 ctacatttgt tgatatcgag atgaacaatg gaccagggcc ttcataacaa aatacgcacc 1740 gtcccttacg attacgactc cgctacgatt atggacatga agaaggaaga acgaaaccga 1800 ccaaacacgt ttgatgactc ccgcaattcc atggactccg ataaacgacc cacaactact 1860 acgaacacat cttctaatac cgtcgttacc aagttgaaat ttcaaaccat cacgtctgaa 1920 gaatacgaca gaatgcataa tctcatcgtc gtaccatcaa cgagcagaac gagcagctat 1980 attacaacaa ttgattcgcc aagtgcgagg ctcaccacgc caaccctgac cgaggaagtc 2040 ccggagtatt ccgaagagcc ctcggcggca gttggcgtgg tttgcccaca tctcagggca 2100 agtattccta tacatcctat atctgtgaaa cagcccaatc ccacttttat cccaatcaac 2160 acaggcttcg atactacact caattcgaca acgtacagtg tatggcctac gccattacac 2220 gcaaaaagct ccgagtcatc agcgaaagtg gactctcgaa agatatcgta tggaactaaa 2280 cgaaatagcg gcatattccg atcctcttcg aactcaaatt tccaccatgg ctccattcat 2340 caacacaaga cagacctacc agctagatgc acaagactca cagcaacaat cgattcgcca 2400 agcgcgaggc ctaccacgct gaccctgacc ggggatgttc cggagcaatc cgaagagcct 2460 ccggcggcag tcagcgtggt ctgcccacac ccatgggcaa gtattacttc aaattttata 2520 ttcgaagaac ccaatactgc atctgttcca attatcccag gctttgatgc tagatcgaga 2580 tcaatggaga gcaacagtac atcacctacc agcgtacttc aacatagcca cggcaatgtt 2640 catcaacctg cagccatctt tgattcgcca agcgcgaggc cttccacgtc gaccctgacc 2700 agggaggtgc cggatccatc cgaagagccc ctggcggcag tcgacgtggc ttgcctgcaa 2760 ctctcggcaa gtatagcttc tttttcgccc aaagatcagt caactgccac ttctcttcct 2820 accaacacag acctcaacat agaagtaaat tcaagggagg tcactacagc cacaaccgtt 2880 cccctgcaaa catcggttga agattctcca gcggaattgg actactcatc gtactggaat 2940 ccaatcggga ccgaattcaa caaccacatc gccgacttgc agtcaccgtt acaaacatca 3000 accgattcgg attcctgtgt taccgaaacg aacgaaaaca gcccaacgtt cgcaattcag 3060 tggaacatat gtgggctccg ctcccatcta agcgaactac agatcctttt agcaaaatac 3120 cagcctgtaa tagtgtgctt acaggaaacc aatggagact accgaaagct aagccctagc 3180 tgtctgggga aggattacca gcttctactt ggccagtgct ctgctcatgg aaggcaaggt 3240 actggaattg caattaaaaa cggcaccccc ttccaaagaa tatggtttca aaccaaaata 3300 caggccgttg cagttcaatt aggcgctcca atcgcaataa cagcagtttc ggtttatctt 3360 ccacctaaag acaaggaagc cgcgaaacat ttgggtgagc ttttggaaga acttccaaaa 3420 cctatcatcg tgttaggtga cttgaatgcg catcacgacg catggggtag tagaacagtt 3480 caatcatcat ctggagccaa acagagaggt gaagcaatat tagaactagt tgtgcaaaat 3540 gatatgattg ttcttaataa tggatctcac acccgcatcg acccagcaac agggacctct 3600 caagccttag acgtaacgat gtgcagcacc tcacatgcag caaaattttc ctggaaaacg 3660 ttactcgatt tttcaggtag cgaccatcta ccgatattgg ttgacacact ttgcaagcag 3720 aaccaaccca cacgccgaac ccgatggatt tttgaaaatg cagattggcc actctacgag 3780 cagctaacaa acaattcgat caaaccaggt tctcagttta cggtagacga atttacaaac 3840 agaataatcg ctgctgcaga ggcaagcatc cccaaaactt cagggagggt aggaccaaaa 3900 tctgttgtat ggtggaatca agatgtagaa ttagcagtaa aggccagacg aaaacgacta 3960 cgagctcttc gtcgcttgaa agaaggtgac tctagaaaag tcgaagctct taaattattt 4020 caagaagcac gatcacactg ccgcaaaact atagccgatg ctaaacaaaa aagctgggat 4080 acatttgttg agggcataaa ccctgattca cctgctagtc aagtttggaa caacatcaat 4140 aggctccagg gaaagcgaca acggaatgta ataaccctaa accttccaac agggcatacg 4200 aataacgggg aagaagtagc aaacgcccta gctgatgagt accaactgaa atcttcggac 4260 gcaaactatc ccgaaaaatt ccgtcggaaa cacaaaaccg agcaacggtc aatacgaaaa 4320 aaccaacgtc ccaacctaca taagcgatac aacgtagacc ttaccatgag cgaactgatg 4380 tgggctcttg atagacggac gggttcttca acgggagcgg ataacgtaaa ctatcaattg 4440 cttcaaaggt taccattttc tgggaaggtt gctcttttgg agttattcaa tcaaatatgg 4500 cacagtggca cttttcctga agaatggaaa ataggaatag taattccaat tcctaaacct 4560 gaagctgatc gcagccgacc agaaggatac cggccaataa ctttattgag ttgcttagga 4620 aagttgttcg aaagaataat caaccgtcgg cttgtaacag aattagagtc cacgagaaaa 4680 ttagacccac gacagtacgc tttccggtca ggcaagggag ttgactctca ccttgccaaa 4740 ttagaatcgc ttatcaccct ccatgaggac gaacatgcag aaatcgtatc tcttgacata 4800 tcaagagcct acgatactac atacagacct ggtatccttc gtactctgat gaattggaga 4860 ataacaggga ggatgatgaa catgctatct agttttctgt gtgacagatt ctttcaagtg 4920 gcagttaatg gagccctctc aacccttaga agagcagata atggagttcc acaaggatca 4980 atcctatctg tcacattgtt ccttgtagca attcaaccca tgttcgatgt tatacctagt 5040 gatgcagaaa tcctaatata tgcagatgac gtcattctaa tagtgaaggg taagaaccac 5100 gtcacaatac gtagaaagct tagaagagcg gttaaagctg cagtcgattg ggccgctagt 5160 gttgggttca ctatagcccc tacgaaatct aagctactgc atgtctgtaa cctaacacac 5220 cgtaaaagag gtagagcgat caagattgac agcattccaa tccctcaaat aaggcattta 5280 aaagttctcg gaatcataat cgattctaaa ctcaaacttt caaaacacct agcgtccgtc 5340 aagcaaaact gtcgtaaaag aatgcatatt ctacgtatac taggatatcg tctgaaaaga 5400 agtagcagaa ctagtttgtt aaggatagga tcagctctta ttctttcgaa agtctatttt 5460 ggaatcggac tcaccagcat aaattttgaa gcattgcaac aaactctgga acctatctac 5520 aacgatgtcg ttcgtcaagc atcaggagca ttcaaaacta gcccaacaat ttctataatg 5580 gcggaaagtg gttggctgaa ctttcgctta gctttgattc agcgtctgtg tatactggtt 5640 gttggcctca ttgagaaaaa tgaggatgca gttgcctatc cagttgtcca gagagctaaa 5700 acacttatgt tggaaacgac gggatggacg ttgccaaatg tgtgtaatgt gttaaggttg 5760 tcaaacagag agtggtatgc gccagtccca atatttgaaa atgcactgag gaacaacatc 5820 aaagcaggta caaaccgggc catcgtggta ccaaggttta aggaacttat cacaagtcgc 5880 tacaaaatcc acgatcaaat atacacggat ggctcaaaag atcgcgacca aattggagct 5940 ggactaacaa ttggtgatag aaactatagt tttcgtttac ccggcatatg cagcgtattc 6000 tccgccgaag ctttcgcctt gatgaacgct gtttcaatgt cacaaggaca cacaaatgtt 6060 atcatcctaa cagattctgc cagctgtctt gacgcgctac gcgaaggacg atctaaacac 6120 ccctggatac aatccatcga aaagaatata gagggacgaa atgtgacatt ctgctggata 6180 cctgggcatt ctggcatagt cgggaatgag aaagctgatg agctggcgaa acaaggttgt 6240 aatcaaccat cggtggatat accaacacct gcccaggatg ctgtacgagc ggtaaaagaa 6300 aaaatttggt catattggga agcagaatgg aaccagagcc gatgtcacct aagagccatc 6360 aaacccacag ctggaaaata cttggatcgt aaatgcccat ccgaacaacg cgtgctaact 6420 cgcatacgaa tagggcacac tcgcctcacc catgcacacc tgatgaacaa cagcaatcct 6480 cctgtatgca cgtattgtgg agatcgagtt acagtgcagc acataataac ggaatgcaga 6540 ggactcgaag gaagtaggaa aaaatgtggt ataaatggat cgatggctga aatactcgcg 6600 tataacgctg aaaatgaaaa agcagtgata caatttctca aagattgtga attgtttgat 6660 aaaatttaac aaaggttaat ctgataaaat ttgtacgaaa ttagtattaa acagacacga 6720 atgccactca gatatgtggt aaagtgtctc taataaagca taataataat aataa 6775 // ID MuDR-6_TV repbase; DNA; INV; 3583 BP. XX AC . XX DT 16-DEC-2008 (Rel. 13.12, Created) DT 16-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE MuDR DNA transposon from Trichomonas vaginalis. XX KW MuDR; DNA transposon; Transposable Element; MuDR-6_TV. XX OS Trichomonas vaginalis OC Eukaryota; Parabasalia; Trichomonadida; Trichomonadidae; OC Trichomonas. XX RN [1] RP 1-3583 RA Kapitonov V.V. and Jurka J.; RT "MuDR DNA transposons from protozoans."; RL Repbase Reports 8(12), 1816-1816 (2008). XX DR [1] (Consensus) XX CC The MuDR-6_TV consensus sequence was derived from multiple CC alignment of 20 copies less than 1% divergent from it. MuDR-6_TV CC copies are usually flanked by 9- or 10-bp TSDs. MuDR-6_TV CC contains imperfect 23-bp TIRs (5 mismatches), subterminal CC inverted repeats (pos. 132-704 and 2911-2339) and codes for a CC 482-aa MuDR transposase. CC This sequence was derived from sequence data generated by TIGR CC Database: The Trichomonas vaginalis Genome Sequencing Project. XX FH Key Location/Qualifiers FT CDS 876..2321 FT /product="MuDR-6_TVp" FT /note="MuDR transposase." FT /translation="MELEEETIFTVMDLCKFGVKQEIQAQYGYGERVRKER FT SQISFHCKYPKCPAHFNVAILDQNKYKLIKIVEEHDHSAPNDKSQYSSVFY FT RKYLEHYFQTDDNQRAIAQLNAFKDLEIPLTGPSIPEMYAQKPRAIKRFAE FT RIHALQTRYSPDDVVSLQIFVQTVQEESPDDLIAFEVSDNNMCFYYAPFEG FT KAYAHSIKTFHIDSTYKLLRSNIPLYAFTGKIKDFHIFPFLYFFCQPDTYQ FT NIQVCVCAYLRWVTIEPEYISMDCAPQIAQAIEESIPQAQILWCGVHVLRA FT ILRKSNKFSSQEKFEQFYNLMKDLVFKLDAEHEEEANAAYDQVLELLDTDP FT TASQYFNRQWRYNISRWIARDRREGDATNNIAEAHFRTLKHDYFPDRKNLR FT IDDVIIEIYTKVIPSFVIKLKIDQNPADDRVIRIMEKATNAEIDVKRQRKI FT ESISMLNRLLNAVSDDSIDPTKIYPTLKVLLQRTHFI" XX SQ Sequence 3583 BP; 1251 A; 581 C; 505 G; 1245 T; 1 other; ggggcggcga caaatttgcc cagccagtaa agtattcaat tttttttgca tttttttaag 60 ggtgcacacg agaggatcat gtaaacttca gaatttagct tcttaactta aatcatggaa 120 tgtctcaaag acaaaagtag catacagcat gtaaagctaa agaaagataa tcataattaa 180 taccttaatt tttaatatta cgcattttta ggtgcaaaat ttaaaaacat tctaatacaa 240 tagagcagtt tccaaaagta gcatacagca tgtaaagcta aagaaagata atcataatta 300 ataccttaat ttttaatatt acgcattttt aggtgcaaaa tttaaaaaca ttctaataca 360 atagagcagt ttccaaaagt agcatacagc atgtaaagct aaagaaagat aatcataatt 420 aataccttaa tttttaatat tacgcatttt taggtgcaaa atttaaaaac attctaatac 480 aatagagcag tttccaaaag tagcatacag catgcaaagc taaagaaaga taatcataat 540 taataccttt aaagtttaaa atcttgattt tttttaaaat caaatattta taactttaat 600 acatacagct taaaccttat acttgttttc tcttgtgaga aagcaatctt acaaagctaa 660 caatcctaaa aaataaagaa ttaaaattta ataatttaat ctaacaaaat ttctctcttt 720 atccttttgc actctttata tgcaactcaa atccatcata taatattcct ataatactct 780 tgcctcaaac aatgacgaac aaactgctat tctcatttaa ttacagacca aatctattca 840 aatttctgaa tttttaattt tcatttcaat ttcaaatgga attagaagaa gaaactatct 900 ttactgttat ggatctttgt aaatttggag ttaaacaaga aatacaagca cagtatggat 960 atggtgaacg tgtcaggaaa gaaagatctc agatttcatt ccactgcaaa tatccaaaat 1020 gtcctgcaca ttttaatgtt gcaatcttgg atcaaaataa atacaaacta atcaaaatag 1080 ttgaagaaca tgatcattcc gccccaaatg acaaaagtca atattcgtct gtcttttatc 1140 gcaaatattt agaacattac ttccaaacag atgacaatca aagagcgata gcacagctaa 1200 atgcattcaa agatcttgaa attcctctta caggtccgtc cattccagag atgtacgctc 1260 aaaaaccacg tgcaatcaaa cgatttgcag agagaataca tgctcttcaa actagatatt 1320 ctcctgacga tgttgtgtca ttgcaaatct ttgttcagac agtacaagaa gaatctccag 1380 atgatctcat tgcatttgaa gtctcagaca ataatatgtg tttctattac gcgccttttg 1440 aaggcaaagc ttatgcgcat agtatcaaga catttcatat agacagcacc tataaattgt 1500 tgcgatcaaa cattccattg tatgcattta ctggcaaaat caaagatttt cacattttcc 1560 cttttttata tttcttttgt caaccagaca cttatcaaaa cattcaagtt tgcgtgtgcg 1620 cctatctcag atgggtaaca attgagccag aatacattag catggactgt gctccacaga 1680 tagctcaagc aatagaagaa agcattccac aagctcaaat actttggtgc ggagtacatg 1740 ttcttcgtgc aattcttcgc aaatcaaata agttttcttc tcaagaaaaa tttgaacaat 1800 tctataattt gatgaaggat ttagtcttca aattagatgc agaacatgag gaagaagcaa 1860 atgctgccta tgatcaagtt ctggagttgc tcgacacaga tccaacagca tctcaatact 1920 ttaacagaca atggcgttac aacatatcaa gatggattgc tcgtgacagg cgtgaaggcg 1980 acgcaactaa caatattgcc gaagctcatt ttagaacact caaacatgac tattttcctg 2040 atcgaaaaaa tctcagaatt gatgatgtta ttatagaaat ttatacaaaa gtgataccat 2100 ctttcgttat caaactcaaa attgatcaaa atccagcaga tgaccgtgtc attaggatca 2160 tggaaaaagc aactaacgca gaaattgatg taaaaagaca aagaaaaatc gagagtattt 2220 caatgctcaa tagattgctc aatgctgtta gtgatgactc aatagatccc actaaaatat 2280 atcctacctt gaaagtatta ttacaaagaa ctcattttat ttaattttga ttttaatttt 2340 agattaaatt attaaatttt aattctttat tttttaggat tgttagcttt gtaagattgc 2400 tttctcacaa gagaaaacaa gtataaggtt taagctgtat gtattaaagt tataaatatt 2460 tgattttaaa aaaaatcaag attttaaact ttaaaggtat taattatgat tatctttctt 2520 tagctttgca tgctgtatgc tacttttgga aactgctcta ttgtattaga atgtttttaa 2580 attttgcacc taaaaatgcg taatattaaa aattaaggta ttaattatga ttatctttct 2640 ttagctttac atgctgtatg ctacttttgg aaactgctct attgtattag aatgttttta 2700 aattttgcac ctaaaaatgc gtaatattaa aaattaaggt attaattatg attatctttc 2760 tttagctttr catgctgtat gctacttttg gaaactgctc tattgtatta gaatgttttt 2820 aaattttgca cctaaaaatg cgtaatatta aaaattaagg tattaattat gattatcttt 2880 ctttagcttt acatgctgta tgctactttt ggaaactgct ctattgtatt agaatgtttt 2940 taaattttgc acctaaaaat gcgtaatatt aaaaattaag gtattaatta tgattatctt 3000 tctttagctt tacatgctgt atgctacttt tgtctttgag acattccatg atttaagtta 3060 agaagcttta aaaaaaaaag caaaaaaaat gtaataaaat aaaataaaaa aatttttagt 3120 atcgataatc tgaaatatgt tgtttactta agccgcgagc gaagcgagcg gcttagtttc 3180 gtaatttcgt acaatctttt ttcgaatctg ttcttttctt aagccgcaag cgaagcgagc 3240 ggcttagttt cgtaatttcg tacaatcttt tttcgaatct gttcttttct taagccgcga 3300 gcgaagcgag tggcttagtt tcgtaatttc atacaattat ttttcgaatt tgttcttttt 3360 gtcaaccgcg agcgaagcga gtggcttttt attttttaaa aatttttttt tctttcttac 3420 ttttttttcc cttacttaca aatttgatag taacttacca aatctctcta tcatataata 3480 ataaattaat ccattttttg tttcaaatct gacactttac atttctgaca tgtaaacctg 3540 tcacttttca cttcacctca ctgggcaagt ttgtgccccc ctc 3583 // ID Copia-4_DGri-I repbase; DNA; INV; 4120 BP. XX AC scaffold_5173; XX DT 07-MAR-2011 (Rel. 16.03, Created) DT 07-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the Drosophila grimshawi genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-4_DGri_; KW Copia-4_DGri-LTR; Copia-4_DGri-I. XX OS Drosophila grimshawi OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Hawaiian Drosophila; OC grimshawi group; grimshawi subgroup. XX RN [1] RP 1-4120 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Drosophila grimshawi genome."; RL Direct Submission to RU (06-MAR-2011). XX DR Genome; scaffold_5173; Positions 5431 1312. XX CC Positions [1528-2061] - Integrase core CC 'TGAAG' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 166..4110 FT /product="Copia-4_DGri-I_1p" FT /translation="MSALYNIDKLDETNYDSWCVQMKSVLIHQELWSVTSG FT AMKCPSSDKTSEYSQWLSKNEKALAVITLSIRGSEVGHIKKCESASSAWII FT LKAVHQPTGPVRKVTLFKRLLFMRMNENENMQIYIANFVNTVEQLTEIGIV FT LQEEIYVILLLASLPKSYENLVVALESRDELPKFSTLKVKLIEECDRRKSE FT AQRESVNENNAQAFLSKAHYPKESAKKTDKVGKNRNKKIKCYNCGKIGHIS FT AECRSKANEGHAKGRDEKSFFTLLSAKNGERLLDKSAWCLDSGATSHMCAN FT QNMFNSFEQFTDTVTLAGEKSLEVVGRGVVMIKTESSTVKLTNVLYVPQLQ FT CNFISVSRAVKGGCTVNFKNNRACVCDENGKTLIIANHRSNLFLCEITKTI FT NTVFLSENGKSGSEDIIKWHKRYGHLNMQSLKELNNMKLVRGVNFNSTSEF FT DCKICLQSKCSAKPFNMSVSKSEDVLNLIHSDVCGPINICSNGGSRYFVTM FT IDDKSRYVHVYFIKNKSEVLNVFKNYKNMVECQTGKRIKVLRSDNGREYVN FT NEFDIFLKQNGIVRQLSVPHTPQQNGVAERMNRTLVEMARCMLLEAKLKES FT FWAEAVHTAAYLRNRSPTKAVLQSTPYELWTGRKPSVGHLRTFGCLAIALD FT KTQNRKFRPKGIEYILVGYADQAKAYRLYNSASGKMIESRDVYFVESTTES FT SAVDIPMLPDFNCSKVITEVQSNRSSSRGSSRASSRASSRASSRGSSKGTG FT EDNGNTDINKSEISKKSNRLERTSGHGWRRPDQMHKLNLVSSDAVVTPETV FT EEALESPQAIEWQKAMAQEFGALKVNGTWEPAELPSNHKAIGCRWVFALKR FT DKNGTIERYKARLVAKGCSQKQGVNYDETFSPVVRYATIRMVLAIAAELVL FT HVHQLDVSTAYLNGELQEDVFMKQPRMFEEQGVSVLKLRKSLYGLKQSGRM FT WNKKLNEMMTKIGFSACPSELCVYTRHTQNNINIIAVYVDDMMICSSDIGE FT LASIKESISRIFQVVDKGPIHHFLGMEINRKGIIGAITITQQQHVLQLLKK FT FGMEACRPASVPLASGYTTKCESQQCKTVNQAEYQSLIGSLMYLAVTSRPD FT ILHSVCKLSQRNTDPHQEHWAAAKQVLRYLRTTSSMGITYAKTGNPIKGYV FT DADWGGDNTDRKSYTGYVFILGGSAFSWESKKQSSVALSSTEAEYMGMSEA FT TKEAVYILKLCNEMNIYRNMKSIVIHGDNLSAMNLVKNPVYHARSKHIDIR FT YHFVREVFNKLMIQLEYCPTSQMMADILTRNLSKLKHENCNRLLGLKLIVN FT NVY" XX SQ Sequence 4120 BP; 1414 A; 724 C; 989 G; 993 T; 0 other; ggttatgggc ccaggcataa aagaaaaagt tgaatgaaac taaggttatg ctgaaaagcc 60 gcgtttaagg tcgagtgaca ttttttcatt agaattgcat aattttgtat tgctgtgtgc 120 tttaccaact aaagttttct gtgtgagaac aagtacgagc gaaaaatgag tgcgctgtac 180 aatatagaca agttggacga aacaaattat gattcgtggt gtgtccaaat gaaaagtgtg 240 ctaattcacc aagagctatg gtccgtcact tcgggagcta tgaaatgccc aagctcagac 300 aagacgagtg aatatagtca gtggttatca aaaaacgaaa aggctttagc ggtgattact 360 ctctcaataa gaggctctga ggttggtcac attaagaaat gtgaatcggc gtcgagtgcg 420 tggataattc taaaggcagt gcatcagccc accggtccag tgagaaaagt cacattgttc 480 aagagattac tcttcatgcg gatgaacgag aatgagaaca tgcaaatata tattgcaaat 540 tttgtgaaca ccgtggagca gttaacagaa attggcattg ttctgcaaga ggaaatatat 600 gttattttac tcttggcaag tctacctaaa tcatatgaaa atttggttgt agcattggaa 660 tcacgtgatg aattaccaaa gtttagtact ttgaaggtta aattgatcga agagtgtgat 720 agacgcaaga gcgaagcaca aagagagagt gtgaacgaaa ataatgcaca ggcattttta 780 agcaaagcac attaccctaa agagagtgca aaaaagacag acaaagtagg caagaaccga 840 aacaaaaaaa tcaagtgtta taattgcggt aaaattggcc acatttctgc agaatgtagg 900 tccaaggcga acgaaggtca tgcaaaagga agagacgaaa aatcattctt cacgttgcta 960 agtgcaaaaa acggtgaacg attgcttgat aagtcagcgt ggtgcttaga tagtggcgcc 1020 acatctcata tgtgcgcaaa tcaaaatatg tttaattcgt ttgaacaatt cactgataca 1080 gtcacacttg ctggtgaaaa atcactagaa gtagtcggac gaggagtcgt catgattaaa 1140 acggaatcgt caacagtaaa acttacaaac gttctttatg tgccacagtt acaatgcaat 1200 tttatttcgg tgagcagagc agtaaaagga ggctgcactg taaattttaa aaacaatcgt 1260 gcgtgcgtat gtgacgaaaa cggtaaaact ctgattattg cgaatcatag aagcaattta 1320 tttctgtgtg aaataacaaa aacaataaat acagtgttcc taagtgaaaa tggcaaaagt 1380 ggcagtgagg acattataaa atggcataag agatatggac atctgaacat gcaaagcctc 1440 aaagagttaa ataatatgaa attggtgcgt ggagtgaact tcaacagcac aagcgaattt 1500 gactgtaaga tatgcctaca gagtaagtgt tccgcgaagc catttaatat gtcggtcagc 1560 aaaagtgagg acgtgctaaa cctcatacat agtgatgttt gcggccctat aaacatatgt 1620 tcaaacggag gatcacgtta ctttgttaca atgattgatg ataaatcgcg atatgttcac 1680 gtatatttta tcaaaaataa atcagaagtt ctaaatgtgt ttaagaacta taagaacatg 1740 gtggaatgcc aaacgggaaa acggatcaaa gtactccgca gtgacaatgg tagagaatat 1800 gtaaacaacg agtttgacat atttctgaaa caaaatggga tcgtccgtca attgagtgtg 1860 cctcacacac cccagcaaaa tggagttgca gaaaggatga atcgtaccct cgtggagatg 1920 gcgcgatgta tgctgctaga ggccaaactt aaagaaagtt tctgggcaga agctgtacat 1980 acggcagctt atctacggaa tagatctcct acaaaagctg tgctccagag tacgccatat 2040 gagttatgga cagggcgaaa gccatcagtt ggccaccttc gtacatttgg gtgcttggcg 2100 atagcgttgg acaagacaca gaatcggaaa tttagaccga agggtataga atacatactg 2160 gttggatatg cagatcaagc taaagcatat cggctctaca acagcgccag tggtaaaatg 2220 atcgaaagcc gtgatgtata ttttgttgag tcaacaacag aatcaagtgc agttgatata 2280 cccatgttac ccgattttaa ttgcagcaag gtaattacag aagtgcagtc caatagaagc 2340 agcagcagag gcagcagcag agccagcagc agagccagca gcagagccag cagcagaggc 2400 agcagcaaag gcaccggcga agataacggt aacaccgata tcaataaaag tgaaatcagc 2460 aagaaaagta accgacttga aagaacaagt gggcacggat ggcgaagacc agaccagatg 2520 cacaagctga atcttgtgag cagtgatgcc gtcgtaacac cagaaactgt ggaagaagca 2580 ttggaaagtc cacaggcgat cgaatggcaa aaggccatgg cacaagaatt cggagcactg 2640 aaagtgaatg gaacttggga accagccgaa ttaccatcca atcataaggc gattggatgc 2700 agatgggtct ttgcactaaa acgtgacaaa aatggcacca tagaacgata caaggctagg 2760 ctagtagcaa agggctgcag ccagaaacaa ggagtgaact acgatgaaac gttttcgcca 2820 gtagtgcgat atgcgacaat aaggatggtg ttggccatag cagctgaact ggtgctacat 2880 gtacaccaac ttgatgtatc cacggcatat ctaaatggag aattgcagga ggatgttttc 2940 atgaagcaac cccgtatgtt tgaagaacaa ggagtgtctg ttcttaaact tcgaaagtcc 3000 ttatatgggc taaaacagtc tggacggatg tggaacaaaa agctgaacga gatgatgaca 3060 aaaattgggt tttcagcctg tccaagcgag ctgtgtgtat atactaggca tacacaaaac 3120 aatataaaca taattgcagt atatgtggat gatatgatga tatgttcatc agatattggt 3180 gagttggcca gtataaaaga aagtatatct cgaattttcc aggtggtcga caagggccct 3240 atacatcatt tccttggtat ggaaatcaat aggaagggca taataggggc gataacgatc 3300 acccagcagc agcatgtgtt gcagctgctt aagaagtttg gcatggaagc gtgtcggcca 3360 gcatcagtac cgttggcgtc cgggtacacc accaaatgtg agagccaaca atgcaagacg 3420 gtcaatcagg cagagtacca gtcactgatc ggatcattaa tgtatctggc agttacatcc 3480 agacctgaca ttttacattc agtatgtaag ctgtctcaac ggaatacgga tccccaccaa 3540 gagcactggg cagcagcaaa acaagtgctg cgatatctga ggacgacgtc ttcaatgggc 3600 ataacatatg ctaaaactgg aaatcccatc aaaggctacg ttgatgcgga ctggggtgga 3660 gataacacgg accgaaagtc gtacactgga tacgtattca ttcttggcgg tagcgcattc 3720 tcctgggaaa gcaagaagca gtcgtcagta gcactaagta gtactgaagc cgaatatatg 3780 ggaatgtcgg aagcaactaa agaagctgtt tatattctga aattatgtaa cgaaatgaac 3840 atatacagaa atatgaaaag tattgtaatt catggcgata atttgagcgc tatgaactta 3900 gttaaaaatc cagtttacca tgcaagaagc aagcatatag atattcgtta tcacttcgtg 3960 cgagaagttt ttaataagct tatgattcag ctggaatatt gtccaacgag tcaaatgatg 4020 gcagatatct taactaggaa tttgagcaag ttaaaacatg aaaattgtaa tagattgctt 4080 ggtctcaaat taattgtcaa taatgtgtat tgagggggag 4120 // ID hATx-5_SM repbase; DNA; INV; 2577 BP. XX AC . XX DT 23-JAN-2008 (Rel. 13.01, Created) DT 23-JAN-2008 (Rel. 13.01, Last updated, Version 1) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hATx-5_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-2577 RA Jurka J.; RT "A distinct branch of hAT superfamily from freshwater planarian RT Schmidtea mediterranea."; RL Repbase Reports 8(1), 16-16 (2008). XX DR [1] (Consensus) XX CC Present in several hundred copies in the genome. >99% identical CC to consensus. CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 557..2407 FT /product="hATx-5_SM_1p" FT /translation="MSDVWKYYKKDGNDHGICNECFSKLKTKGGSTKGLWT FT HLKTTHPDVLKNSQATSTKRKNYSDLEPTKKSKSSHSIDSYFYKETLGEIL FT AKFAAKDGFSIHAILNSEGIREYIRLKKMEMPNSLSTVASLIYEFYEHKKG FT ETIQKLKKRLEIGGKYSISIDEWSDCQQRKYLNITLHEPEEDIVLGLKLVE FT GSCNAERTTELVTEFLDEFKLDLENDIVASSNDGAAVMVKYGKLNKNISQV FT CYNHAIHNSVVKVFYENKSTQVDSESGQEIGVVEDEDDTEYEESKGRDAFF FT NDLDSEPDYTEIQSISEVLNQTRKIVKFFKYSSVRKSIFQKHVLEQEGKNL FT NLILDCKTRWKSTDQMVARFIKLEKPINDTLIEIGREILPKEHFSILTVIS FT QILAPAAEVINELSKNNSNLLTSEGSIAYLFETLNENPHPLAAELVSELKD FT KLLARRNKKVVSLLYFLHNGTYAKRNKFLEYSSKEEVKSFAEQLHQKFSLN FT NKEKETSSSGSGSEDEAQPDKIMQRENKLQESINKVLNAKKVLNKVSIKNE FT MSQYECSGAKSANLEKLYESLLTIKSTSTASERIFSISGFICSKIRNRMST FT RMLNALVFLRYYFLKKIER" XX SQ Sequence 2577 BP; 968 A; 383 C; 451 G; 775 T; 0 other; tagggttttc aaaacgggat cccgggatcc cgaatcccgg gaatatagcc ttttttcaat 60 cccgggatcc cggaaaaaat tgattgggat cccgggattt tgttcctgtt tttttttggg 120 attacgaatc ttctgttgaa aatatctcca ctttttttga atataattca gtttcaaagg 180 aaaaattgaa gagttttttt tatgattcga cggaatgaat tgaagattcg gatatgaaaa 240 gctccagttt aaaaaaaccc agaaagttcc acgtttttct ctattgaatc ttcaagttaa 300 tgccgaaaag ctacaaaaat accttgcccc ccgtttttta ttctgaactt gctcttgttt 360 cttttgtata atttataatt gaaaaatttg tcgtgcattt tctggattga aaaaattatc 420 tgaactgtaa atctttgaca gtgattcaaa ccttgaattt caaagctagt atattttgat 480 aactatggct taaaatgatt attaaattga tgttttatta ttttatcgtg aaataattat 540 aatcattttt tagaaaatgt cagacgtctg gaaatactac aaaaaagatg gtaacgatca 600 tggaatttgt aatgaatgtt ttagcaaatt gaaaaccaaa ggaggttcaa caaaaggttt 660 atggactcat ttaaagacca cccatccaga cgtgttgaaa aacagccaag caacctcaac 720 aaaaaggaaa aattattcag atcttgaacc cacaaagaaa tcaaaaagca gtcattcaat 780 agattcgtat ttttacaaag aaacactcgg tgagattttg gcgaaatttg cagcgaaaga 840 tggtttttct atacacgcaa ttttgaattc cgagggtata cgagaataca tcagattgaa 900 aaaaatggaa atgcccaaca gtctatcaac tgtagcaagt ttaatctatg aattctatga 960 gcacaaaaaa ggtgaaacta ttcaaaaatt gaaaaagaga ctggaaattg gtggaaaata 1020 ttcaataagc atcgatgaat ggtcagattg tcaacaacgc aaatatttga atattacatt 1080 acacgagcct gaagaagaca ttgttttggg gttgaaactc gtggaaggat catgtaacgc 1140 cgaaaggaca actgaacttg tcacagaatt tttagatgaa ttcaaattgg atttggaaaa 1200 tgatattgta gcatccagca atgatggagc tgcagtaatg gtgaaatatg gaaaactcaa 1260 caaaaatatt tctcaagtct gctacaatca cgctattcat aattctgtgg taaaagtttt 1320 ctatgagaat aaatcaactc aggttgattc agaaagtggt caagaaattg gggtcgtaga 1380 agacgaagat gacacagaat atgaggaatc aaaaggcaga gatgcatttt tcaatgattt 1440 ggattcagaa ccagattaca cagagattca gagcattagt gaggtgttaa atcaaactag 1500 aaaaattgtc aagttcttta aatattcgtc tgttcgcaaa tcaatttttc aaaagcatgt 1560 attggaacaa gaaggcaaaa acttaaacct aattttggat tgtaaaacaa ggtggaaatc 1620 tacagatcaa atggttgcac gattcataaa gcttgagaaa cccatcaatg atacgctaat 1680 agaaattgga cgagaaattt tacccaaaga acatttcagt attctgacag taattagtca 1740 aatacttgca cctgctgctg aagtaattaa tgagctaagt aaaaacaaca gtaacttgtt 1800 aacttctgaa ggttcgattg catacttatt tgagaccctg aatgaaaatc cacatccgct 1860 tgctgcagag ctcgtctcag aattgaaaga caaactttta gcaagacgaa acaaaaaagt 1920 tgtgtctttg ttatatttcc ttcacaatgg tacatatgca aaaagaaata agttcttgga 1980 gtattcaagc aaagaagaag ttaaatcatt tgccgaacaa ttacaccaaa agttttcttt 2040 aaacaataaa gaaaaagaaa cctcaagctc gggctcaggc tcagaagacg aagcacaacc 2100 tgacaaaatt atgcagagag aaaataagct acaggaatca attaacaaag ttttgaatgc 2160 aaagaaagtt ttaaataaag tttctataaa aaatgagatg tcacaatatg aatgttctgg 2220 tgctaaatct gcaaatttgg aaaagttata tgaatcttta ttgacaataa aatcaacatc 2280 aacagcttct gaacgaattt tttccatatc tggattcatt tgctcgaaaa ttcgaaatcg 2340 aatgtcaaca agaatgctga acgctttagt atttttacga tattattttt taaagaaaat 2400 cgaaaggtag atgtttaaat aagcttaatt tacaataatt tatgtaaaat ctttagtttt 2460 gatataataa aaaaatattt ggtatataaa tttagaaaaa atataaaaat ttattttgaa 2520 aaaataaaaa tcccgggatc ccgaattttc cgaatcccgg gatttttgga aacccta 2577 // ID Gypsy-133_AA-I repbase; DNA; INV; 4200 BP. XX AC AAGE02025031; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-133_AA_; KW Gypsy-133_AA-LTR; Gypsy-133_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4200 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02025031; Positions 12003 16202. XX CC Positions [1159-1581] - Reverse transcriptase CC Positions [2800-3267] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 1069..3669 FT /product="Gypsy-133_AA-I_1p" FT /translation="MPPARNVYTHIPIAFKEATKQKLQELLSSGIIEEVTN FT DMDRSFCSSLLVVPKGKNDIRLVIDLRGPNRCIYRTPFKMPTFESIIMELH FT GTKWFSTIDLTSAFHHIELNEDSRHLTNFFAGEGLYRYCRLPFGLTNAPDI FT FQEVLQTIVLAGCKGVVNYLDDVMIFGRTKKEHDENLAKVLECFKNHNVKI FT NEAKCAFGKRSVDFLGFVVSDNGWKIENEKIAAIQNFRTPETVAEVKSFLG FT LINFVEKFIPQRADKTWRLRELAKSKNFFWNQALEDEFEYVRTTAWKTIET FT LGYFKREDRTELYVDASPHGLGAILVQFDEMSKPRVIACASKTLTDTERKY FT PHTQKEALAMVWGIERFSMYLMSMNFTVRTDAESNEFIFGGLHRIGRRAVT FT RAESWALRLQPYNFNVCSIPGEMNVADALSRLVKQTQSAEDFDDGADEKHL FT LHFIDAGAMEICWDEVELCSEEDQELKSVRASIKTNHWNNDLRRYESEAKD FT LTIFGNMVFKNDKIVLPDSLRMKAIHSAHQGHMGVGSTKRILRQHFWWPGM FT SKAVETFIKNCETCLLLSRKNPPVPLVSRELPNGPWEVLQIDFFTDKDFGF FT GEFLVIVDTYSRYLHVIEMRRIDAEATIEALNKIFVLWGQPLILQSDNGPP FT FQSERFVKTWENRGVKIRKSIPLSPQSNGAVERQNEGIKKILAASRLDNSN FT WKAALNDYVHMHNKIRPLTRLGVTPFELLVGWKYRGMFPSLWKNNGESEID FT RENVRELDASSKLASKKNADFRRGAKYSDITVGDKVVLAQFKRVKSDPTFG FT SERFTVIARDGAKLVVKSNRGVVYSRNVADAKRASERCDESNVTKSQGQDI FT LTSAETFITGNIYI" XX SQ Sequence 4200 BP; 1403 A; 750 C; 932 G; 1115 T; 0 other; ttccagggtg aatcagcgcg gtaataatcg taatggacga ggaggatatg gaaatgtaag 60 aggtaattca cgaaaccaac cggtttcgaa ttgctggcga tgtggaagcg ccttccatcg 120 ggcatcagtg tgcttcgcta ttgacaaaga atgtcgagtt tgtggtagag ttggccacat 180 tgcgcggacc tgtgtttctc gaggctcatc caatcaggcc agaggagtga aacgctttgc 240 tgatcatgat gaatcgggtg tcgacaagaa aatcgctgcc attgaggata ccaaaactga 300 accatgtgac gttaaggtac gagaagacga agatcttgag taaaattcaa ttcctcgtaa 360 gaaaatcttt ccaattattt cgtttacctt tgttttggaa cttttatatt ggatttaaaa 420 tactttagtt gtaatgaaca tgtgactgaa ctcgaataaa cttattaata aatgtatact 480 acattacaga aattgcattt gggatcaatt cagtcaaagg actctgcgaa tctctataca 540 aatcgtgccc acccacaaaa taagccactt aaatgtggaa tcaccagcga taattatgca 600 atgattgaag caaaagtagc aggagtaaga ttacagtttc tgatcgattc tggcgctcac 660 gttaatacca taaccaaaac gatatttgat gaaataatga ccaacaagaa ggcagcgtcc 720 aagatctttg ctttgcatta ttctactgac aagccgttga aggcatatgc gacaaatggt 780 cagattaaag taattgcaat ttttttcgct gagcttttca tatcagaaga aagaccagta 840 atggtagaaa agttttacgt tgtagacgag tcaagagcgc tgttaggctt caatacagca 900 atcaggtatt gcgttctaga tgttggtctg aatgtcccat tacgcccttg caggatgttg 960 gcaaaccaag ataacacaga cacaagttgt tacgctcaag agatctcagt cgcagccgaa 1020 tttcctaagt tcaatattcc agctgtttaa cttaagtacg ataaagaaat gcctccagct 1080 cgaaacgtgt acacgcatat accgattgcg tttaaagaag ctactaagca gaagcttcaa 1140 gaattattgt ccagcggcat cattgaagaa gtgactaatg atatggatcg ttcgttttgt 1200 tcttctctac tggtagtacc aaaaggaaag aacgatatac gtcttgtcat agatcttcga 1260 ggaccaaatc gttgtatcta ccgtacacca tttaaaatgc ccacctttga gtcaatcatt 1320 atggagctac atggtactaa gtggttttca acgatagatc tcacgagtgc gttccatcac 1380 attgaattaa acgaggactc acgtcacctc accaattttt tcgctggaga aggactctac 1440 agatactgta ggttaccttt tggattgaca aacgcacccg acattttcca agaagtgctt 1500 caaactatag ttcttgccgg atgcaaaggg gtcgttaatt acttagacga cgtgatgatt 1560 tttggacgaa caaagaaaga acacgacgaa aatttggcca aagtactgga atgtttcaaa 1620 aaccacaacg tcaaaatcaa tgaagcgaaa tgtgcattcg gtaaacgttc tgtcgatttt 1680 cttggattcg tagtatcaga taatggatgg aaaatagaaa atgaaaagat tgcagctata 1740 caaaacttta gaactccgga aacagtggcc gaggtcaaaa gcttccttgg tctgataaat 1800 tttgttgaaa aatttattcc acaaagagca gacaaaacat ggaggctacg tgagctagca 1860 aaatcgaaaa actttttttg gaatcaagcc ttggaagacg agtttgaata tgttcgaaca 1920 actgcatgga agaccatcga aaccttgggt tactttaaac gcgaagatcg aacagagctt 1980 tacgtggacg cctcgcccca tggactagga gccatcttag tgcagtttga cgaaatgtcg 2040 aagccaagag tgattgcttg tgcttccaaa actctgacgg atacagagag aaaatatccc 2100 catacgcaga aagaagcatt agccatggtg tggggaatcg aaaggttttc catgtacctt 2160 atgagcatga acttcacggt gagaactgat gccgaatcaa acgagtttat ttttggcgga 2220 ctacacagga tcggaagaag ggctgtaacc agagcagaat cttgggcgtt gcggctgcag 2280 ccctataatt tcaatgtatg cagtatacca ggagaaatga atgtggcaga tgcactttca 2340 agattggtca agcaaacaca atctgccgaa gattttgacg atggagctga cgaaaaacac 2400 ttgctccact tcattgatgc gggtgccatg gaaatctgct gggatgaagt agagctttgc 2460 tcagaggaag atcaagaact aaagagcgtc agggcatcaa ttaaaacgaa tcactggaat 2520 aatgatttga gaaggtatga atctgaggca aaagacctta cgattttcgg aaacatggtg 2580 tttaaaaatg acaagatcgt acttcctgac tctttgagaa tgaaggcaat tcattcggct 2640 catcagggtc acatgggagt tgggtctact aagagaatac tacgtcaaca tttttggtgg 2700 ccaggcatga gtaaagcagt agagacgttc atcaaaaact gcgaaacatg tctcttgtta 2760 tcgagaaaga atcctccggt acctctggtc agcagagagc ttcccaacgg tccatgggaa 2820 gttctacaga ttgatttttt caccgataag gatttcggtt tcggagaatt tcttgtgata 2880 gttgacacat attctcggta tctccatgtt atagaaatgc gacgaattga tgcagaagct 2940 actatcgaag cgcttaacaa aatattcgtc ctttggggtc aaccattaat acttcaaagt 3000 gacaatggcc ccccatttca gagcgagaga tttgtgaaga cctgggaaaa ccgaggagtg 3060 aaaattcgga aatctattcc cttaagtccg caatccaacg gagctgtgga gcgtcagaac 3120 gaaggtataa agaaaatatt ggctgcgtcg aggctagata atagcaattg gaaagctgcc 3180 ctgaacgact atgtccatat gcacaataag ataagaccat taactaggct tggggtcaca 3240 cccttcgagc ttctagtagg ttggaagtat cgaggaatgt ttcctagctt atggaagaac 3300 aacggtgaat cagaaattga tcgggaaaac gtcagagagc tagatgcatc ttccaagctt 3360 gctagtaaga agaacgcaga ttttcgaaga ggagccaagt attcagatat tactgttgga 3420 gacaaggtgg tgttggcaca attcaagcga gtcaaatcgg atccaacatt tgggtcagaa 3480 agatttactg taatagctag agatggagcc aaactggttg tcaaaagtaa tcgaggtgtt 3540 gtgtactcga gaaacgtagc ggatgccaaa agggctagcg agcgatgcga tgaatcgaat 3600 gtaaccaaat cccaaggtca agacattttg acgagtgcgg aaaccttcat aacaggtaat 3660 atttatatat aaaaacaata ataattatta ttgtttataa ttaatttgaa tgttttatgt 3720 ttgatcagaa atgccgttac cagattccaa gtatcaaaac catggaagga gtggaagctt 3780 ggctgtattt gacggtgcaa gtaaaaatgg taagattttt tttaataacg cgttattttt 3840 tttaaatgaa acatttatta tatacagata aacctaccga cgaagacgaa acagaggaaa 3900 gacctgacaa cggtgatgca ttaggttcac gcccaagacg caatagaaaa ataccggaca 3960 agctgaagga catggtttta tataacattt atgattagaa attattggtt attaaaaaaa 4020 aaatgttatg aaaatgtttt acagaaatgg ataattcaaa cgactcagta ctgaagaaat 4080 aaaaaaaaaa ttatatatat atcataagat tgataaataa atacataaca ataataaaaa 4140 aaaactgctc tgaaacagta ttacttcaat aaaagatttg agagtagagg tgaaatgaat 4200 // ID Gypsy-156_AA-I repbase; DNA; INV; 5873 BP. XX AC AAGE02024556; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-156_AA_; KW Gypsy-156_AA-LTR; Gypsy-156_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5873 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02024556; Positions 19063 13191. XX CC Positions [4348-4824] - Integrase core CC 'CCTT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 152..2044 FT /product="Gypsy-156_AA-I_3p" FT /translation="MDVEFDLLYKNLLLHHLEREEIEYELTIRGLPYTDTE FT TRAALMRRLKDKLKTDPNANIDITQCDFEIGSEFKQIDSKIKEIRDCLAHR FT NKFEGIRESLKTRLAHYFKRTKRLTELTDKDEDLSDIDGLIACIRGTFNDN FT FSIFAPLGQRDVIETLNQSISNLLITNENSSATVQQNKTSTPYEEDVEVLE FT GASSSRTRRLKSGIDHSGEKLLPESESNIMPLLLQQWMMRDTQIQNWVQQM FT VREQASEYFENAIRTNSRSVPKEKPSVNTVRKTTSHRSNNRSRRSSSESRK FT SSSTEAGWSSDEPRNSSRGSKRVQKSGKRRPVSDWKLKYDGSDNGLALMKF FT LREVEFYAKSEKMSSKELFQSAIHLFNGAAKTWFMTGFDNEDFTSWEELKE FT ELKKEFLSPDHDHSTEIRAIARKQAPRESFHDYFLELQKIFNSLTKPMTER FT RKFEIVFRNMRADYKGHAVSSEIDNLADLKKFGRRLDATYWFKYQNPSNES FT SNPRGKPSQVNDLNADARNKSKSKAGFTKQKSRTFHNSSTQPRGSDNEEKP FT SQKPPESGAKPKEQNPPIPSQYNQPAPKYCLNCRTKGHHTRDCDQPKQKSC FT QKCGFYNVDTASCPNCAKNELKTVAEGRQFNQN" FT CDS 2263..3261 FT /product="Gypsy-156_AA-I_1p" FT /translation="MGAMKLIRKCGLKIFPSEVSLRTASGEKLGVTGMVYL FT PITFNGKTKIVDTLIAPSLKRKILLGMNFWRAFEIRPTVDEVQVEETVVDE FT DSEASFPDSKEPEIELTEEQTARLNIVKNQFKIASEGILDTTDWISHRIEL FT TEEAKKLSPARVNPFPVSPKKQEQINNELDKMLECKIIEKSYSDWALRLVP FT VDKSDGTVRLCLDARKLNERTVRDSYPLPHADRILSRLGHANYISTIDLSK FT AFLQGPLHPRSRKYTAFSVLGRGLFQFTRMPFGLVNSPATLSRLMDRVLGG FT GELEPNVFVYLEDIIVISETFEEHLTLLLKWHPDLELPIYL" FT CDS 3418..5205 FT /product="Gypsy-156_AA-I_2p" FT /translation="MCNYYRRFLANYSRYTQPLTDLLRNKPKTVCWNDEAE FT ESFRRIKELLISSPILTNPNFNLPFAIHCDASDAAIAGVLTQTHDGIEKPI FT TLFSQKLTTTQQRYCATEKEGLAVLNSIEKFRCYVEGSKFTVYTDASALTY FT IMRSSWRTSSRLCRWSIELQKYEMTILHRRGVDNIIADALSRSVEELEACE FT EDNGWYADLKRKVQSDPEKYKDFRIDHGVLKKLVSTQDDLLDYRFSWKTCV FT PKSLRETKLVEEHDDKMHLGVEKTLALIKKKYFWPKMLEDVREYVRKCTIC FT RQNKPANRSQMPEPGQQRLTNRPFQIVALDFIQSLPRSKNGNTHLLVIMDI FT FSKWCLLTPVKKISAPLVTKILEESWFRRYSVPEFIITDNASTFLSSEFKA FT LLAKYQVQHWRNARYHSQANPVERLNRTINACIRSYVKDNQKVWDTKVSEI FT EHCLNNTPHSSTGFSPYRVLFGHEIVGTGEEHRMDREKEEVSEDGRLERKG FT KIDEQVYSTVKKNLQKCHEKNIKTYNLRSKQSAPVYVVGQRVFRRNFRQSS FT AADSYNAKLDALYVPCTILARVGTSSYELADDKGKPIGVFSVCDLKPEA" XX SQ Sequence 5873 BP; 1803 A; 1334 C; 1311 G; 1425 T; 0 other; atttggcgcc caactaaaat ttaaaattct aacattacca cccccctttg gtaagggcag 60 taaaacagtg tcctgtttta gttttctttt tcgtcctcca aacgaagaat cattagtggt 120 tgagaactgt taaattgatc gaaatttcac gatggatgtt gaattcgacc tactctacaa 180 gaatttatta ctacatcatc tcgaacggga agagatagag tatgaactta ctattcgtgg 240 tcttccttat acggatacag aaacgcgcgc ggctttaatg agacggttga aggataaact 300 caagaccgat ccgaacgcga atatagatat tactcagtgc gattttgaaa tcggttctga 360 attcaaacag attgactcga agattaaaga aattcgggac tgtctagcac ataggaacaa 420 attcgaagga atccgagaga gccttaaaac acgattggct cattatttca aacgtaccaa 480 acggttaaca gaattaactg ataaagatga agatctttcg gacatcgacg gtttgattgc 540 gtgcattcgc ggaacattta acgataattt ttcaatcttt gcgccactcg ggcaacgaga 600 tgttatagag acactgaacc agtctatttc gaatctcctt attacgaatg aaaactcgtc 660 ggctacggta caacaaaata agacgtctac cccatacgag gaagatgttg aagtgcttga 720 gggagcatca agctcgcgta cgcgtaggct caaatccggt attgatcatt cgggagaaaa 780 gctattacct gagtcagagt cgaacatcat gccattgctt ttgcagcaat ggatgatgcg 840 agacacacaa attcaaaatt gggtccagca gatggtgaga gaacaagctt cggagtattt 900 cgaaaacgcg atccggacaa actctcgctc cgtccccaaa gagaagccgt cggtgaatac 960 agttcgtaaa acaacctcac accggtcaaa taatcgttca cgccggtcaa gctcggaatc 1020 tagaaagtcc tccagtaccg aagctggttg gtcttccgat gaaccacgga attcgtcgcg 1080 cggaagcaaa cgtgtccaaa aatcgggaaa acgtcggcca gtctcagact ggaaacttaa 1140 atacgacggg tctgataacg gactcgccct aatgaaattc cttcgtgaag tagaatttta 1200 cgctaagtcc gaaaagatgt cgagcaaaga attatttcaa tccgcgattc acctgtttaa 1260 cggcgccgcg aaaacatggt ttatgacggg ctttgacaac gaggatttca cctcgtggga 1320 agagctgaaa gaggagctca agaaggaatt cttgagccct gaccacgatc attccaccga 1380 aattcgtgcg atcgcgagaa agcaggcacc acgcgagtcg ttccatgatt attttctaga 1440 gttacagaaa atatttaact cgttaactaa acctatgacc gaacgtcgga aatttgaaat 1500 agtgttccgt aacatgcgcg cggattataa aggacatgcg gtgtcctcgg aaatcgataa 1560 cttagccgat ctcaagaagt tcggtaggcg tttggatgcg acatactggt tcaaatacca 1620 aaatccatcc aacgagtcgt ccaatccacg cggtaagcca tctcaagtga atgatctcaa 1680 tgcggacgcg agaaacaaat cgaaatccaa agcagggttt acaaagcaaa aatcgcgaac 1740 tttccataat tcgtcaactc aacctcgcgg ctcagataat gaagaaaaac cgtcgcaaaa 1800 accgcccgaa agcggtgcga aaccgaagga gcaaaatcca ccaatcccct ctcagtacaa 1860 tcaaccagcc cctaaatact gtttgaattg tcgcacaaag ggtcatcaca ctcgtgactg 1920 tgatcagccc aaacaaaaaa gttgtcaaaa gtgtggtttc tataatgtag acacagcttc 1980 ttgtccaaac tgtgcaaaaa acgaattgaa gactgtcgcg gagggccgac agttcaatca 2040 gaactgaagt cccttgctaa ttacgacaat gtaacttatg ctctccagaa ccaagggttc 2100 gacagatttt cccccacaga ttatgtccac actcacaatt atcagattaa tgagacgatc 2160 attaaagtag aaaatgacga tagaccattt gttaagctgt ccgtcttcca aattcccctg 2220 gtaggcttgt tagacagtgg tgcccatctc accattctgg ggatgggagc tatgaaatta 2280 atacgcaaat gtggccttaa aatctttcca tccgaggtga gtcttcggac tgccagcggg 2340 gaaaaattgg gagtcactgg tatggtctat cttcctatta cctttaatgg aaaaaccaag 2400 atagtagaca cgctgatagc accttcactc aaaaggaaaa tccttttagg aatgaacttt 2460 tggcgtgcgt ttgaaattcg ccctaccgta gacgaagtgc aggttgaaga aacggttgtc 2520 gatgaagact cggaggcaag ttttccggac tccaaggaac ccgaaataga attaaccgaa 2580 gaacaaactg ctcggttgaa cattgtaaaa aatcaattta aaattgcatc ggagggtata 2640 ctggatacca ccgattggat aagccacaga atagaactca cagaggaagc caaaaagctt 2700 tcccctgcta gagtaaatcc gtttccagtt tctccaaaga aacaggaaca gatcaacaac 2760 gaattggaca aaatgctcga atgcaaaata attgagaaat cttacagcga ctgggcttta 2820 cgtctggttc ctgtagacaa atcagatgga acagtgcgcc tttgcctgga tgcacgcaag 2880 ctgaatgaac gtacggtcag ggactcttat cccctcccac acgccgaccg tatcttgagc 2940 agactaggac atgctaacta catatctact atagaccttt caaaggcttt tttacaaggt 3000 cctctgcatc ctaggtctcg caagtatacg gcattttccg tgttgggaag gggactgttc 3060 cagttcacgc gtatgccttt cggtttggtg aacagtcctg cgacactatc acgactgatg 3120 gatcgagtgt tgggcggtgg tgagctggaa ccaaacgttt tcgtctacct ggaggacatc 3180 atcgtcatta gcgaaacgtt tgaagagcac ctgaccctac ttctgaagtg gcatccagac 3240 ttagagctgc caatctatct ataaatctag aaaagtccca tttttgcgtc actgaagtta 3300 cttacttagg ctacattctg gatcgtgatg gtctgaggcc gaatccagat cgtgtggccg 3360 cggggtgtaa actatgagcg acctacgtct ttgcgggctc taaggaggtt tcttgggatg 3420 tgtaactatt atagacggtt tctagccaac tatagcaggt acacacagcc cctaactgac 3480 ttacttagaa acaagcctaa gacagtttgt tggaacgacg aagccgaaga gtcattccga 3540 aggataaaag aacttcttat cagttcccca atcctaacca atcccaactt caatctaccc 3600 tttgcgatac attgcgacgc gagtgatgcg gccatcgccg gtgtgcttac gcagacgcac 3660 gacgggatcg agaagcctat tacgctcttc tcgcaaaagc tgaccactac ccaacaaagg 3720 tattgcgcga ccgaaaagga gggccttgcg gtcctcaatt cgatcgagaa gttccgatgc 3780 tatgtggaag gtagcaaatt taccgtgtac acagatgcat ccgctctgac gtacataatg 3840 cgcagtagct ggcgaacgtc gtcgcgtcta tgtcgctgga gcatagagct acagaaatat 3900 gaaatgacta tcctgcaccg ccgaggtgtc gacaacatca tagccgatgc cttgtccagg 3960 tccgtcgagg aactggaggc ctgcgaagag gataacggct ggtatgcgga tcttaaaagg 4020 aaagtacaat ccgatccaga gaaatacaaa gattttagaa tagatcatgg agtcctcaag 4080 aagcttgttt caacacaaga cgaccttttg gattaccgtt tctcttggaa aacctgtgtt 4140 ccaaaatccc tccgagaaac aaaactcgtt gaagaacatg atgataaaat gcatcttgga 4200 gtcgaaaaga ctctggcgtt aataaagaaa aaatacttct ggccgaagat gcttgaagac 4260 gtgcgagagt acgtcagaaa gtgtactatt tgccggcaaa ataaaccagc aaatcggtct 4320 caaatgcccg aacctggtca acaacggttg actaataggc cttttcaaat tgtggccttg 4380 gactttattc agtccctccc ccgcagtaaa aacggtaata ctcacctact ggtaataatg 4440 gacattttct cgaaatggtg cttacttacc ccggtgaaga aaatctccgc tccactggta 4500 accaagattc tcgaagaatc ttggttcaga aggtattcag ttccagagtt cattataacc 4560 gataacgctt cgactttcct aagctcagag ttcaaagctc tgctagccaa ataccaggtc 4620 cagcattgga gaaatgcccg ataccatagc caggcaaatc ccgttgagcg tctaaatcgc 4680 acgatcaatg cttgcatcag atcgtacgtt aaagacaacc agaaagtttg ggacacgaaa 4740 gtgtccgaaa tagaacattg ccttaacaat actccccact cctctaccgg gttttcaccc 4800 taccgagtac ttttcggaca cgaaatagtt gggacaggtg aggaacacag gatggatagg 4860 gaaaaggaag aggtatccga ggacgggagg ttggaaagga agggaaaaat cgatgagcaa 4920 gtttattcca ctgtcaagaa aaacctacaa aaatgccatg aaaagaacat aaaaacttat 4980 aatctacgct ccaaacaatc tgctccggtc tacgtggtgg gccagagagt ctttcggcgg 5040 aacttccgac agtcatccgc agcggactcg tataacgcca aactggatgc cctatacgtt 5100 ccgtgtacca tcctggctcg agtgggaacg agctcgtacg agctagcaga cgataaggga 5160 aagcctattg gcgtattctc agtctgtgat ctgaagcccg aagcctaaaa gtaatcacca 5220 ttcattaaaa tcttcaagtg ggtcgtttta tttgagaaac actcaccttc cattcatacg 5280 tccggacgta aaaacaaaga aattttcctc ggtttgatcc ccgagaccta aaaagattga 5340 aaacaaaatg gtttagcact ctgtcgtgat tgtggtcgac tgaaaactac tcgtatcatc 5400 tcttacctgg ccaagatcca aatttcctcc tccaacagca tattctcacc tccagctgat 5460 ccctcaaaag tccaatggga attcgaccgg taaaagcttc tccgatcgct ggacacgcgc 5520 gcagtcagaa tttgctgctg gaagacccga atttagtcga acacgccgga aaactcgcgg 5580 aaaccgtgaa aacgcgtcgg aaaagacaac tcgcatgcac cgtcgttcga tcgcgcagta 5640 gtaaataaaa gttgacgatt tgacagttct caacacagta gtgttggtaa aattgagtta 5700 ccttttcttc atcgtgagtg cgatgaaggt gaatggaaat tcagagtggg ccctaggggt 5760 tcatatattt aacataaatc ggtaatcgat taatgttgta attaatatgt aaaaaccctt 5820 gttttgattt gaaatgtaat aaacagcatt tcagaagagg gacgatgggg caa 5873 // ID Crack-9_BF repbase; DNA; INV; 3514 BP. XX AC . XX DT 30-APR-2009 (Rel. 14.04, Created) DT 30-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE Amphioxus Crack-9_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; Crack-9_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-3514 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-3514 RA Kapitonov V. and Jurka J.; RT "Young families of Crack non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(4), 814-814 (2009). XX DR [2] (Consensus) XX CC Its 5'-terminal portion is incomplete. XX FH Key Location/Qualifiers FT CDS 1..3165 FT /product="Crack-9_BF_2p" FT /translation="MASCLYANLAVLAAIFLQFTNPVTELKCISSSQPNPC FT EVFCSRFNFTAISQADHVLSLLYPTGHITRIARSKKDIHVKTSVNLAHLCI FT HTLLLLSGDVACNPGPNQSEDNVVANVDWQPLFERIKSKNGIKMVHLNTRS FT FLPKLDEIRMAMTDNPIDILTLNETWLDSSIDDNELYIPNYTLYRKDRNRH FT GGGVACYVTDELQHKLIPELTELDIESVWVEVKHTVGKPIVIGSAYRPPNA FT TTQFFETLETAMTTATAISDEVFLLGDLNCDVLTKGACKKIDKICSTFQAK FT QLIDEPTRVTETSSTCIDIIITTSPEKVTESGVCSTGLSDHCFTYAVRKAK FT RPPGNPRTATVRSYRRFDESAFQEDLRNAPWSKVDNCSNVDDALSCFQTML FT TNVCDDHAPWVSIRIRGHEPPWITQEYLAMAHERDFYFDKAKKTKLTSMWE FT TAKKLRNKCNNMARYLKKQYYCNEIETKGKDSKSLWATLKTLLPSQAKQPV FT NVPKQQQKSIANKFNTYFTSIGSKLAAAFTGAYRAVLGPAQFSFKFTNIST FT EFTYKQLMSIPLGKSTGLDGVSSRLIRHAAPIIAGPLTHIYNLSLTTGEVP FT KDWKRAKVTPLFKDGDKTSCSNYRPISVLPTFMKIFEKAIHTQVYTYCKDH FT KILNSFQSGFRPAHSTTTTLVHVTDSIIENMEKGLVTGAIFLDLKKAFDTV FT CHKILLLKLRMYGVQDKELTWFCSYLTSRYQTTVVNGVQSDFLEVTVGVPQ FT GSVLGPLLFILYINDLPDYISHGQVVLYADDTALFYASKTVRDVNTCLNAD FT LRNIETWLHTNKLTLNVTKCKAMLFGTARRLQSSTEQLSTTLSGTSLEVVT FT SFKYLGVWLDPCLTWEVHIDKLCNAVSSRLGVLRRLIPCLPQRTLVMLFNA FT MVLPKMDYCDVIWGNCGKCLSDRLQKLQNRAARLILGLKYTHHVGVDELSA FT LGWKTLTSRRNLHLLQSMYKAVNNELPDYINSFQRTSDSHSYPTRHSQNLC FT LKLPLVRLECGKRKFSYRGITAWNSLPPSVKTCTSKSTFRTNCRAWLA*" XX SQ Sequence 3514 BP; 1065 A; 771 C; 691 G; 987 T; 0 other; atggcaagct gtttatatgc aaatctggcg gtacttgccg ccatcttcct ccagtttacc 60 aacccagtta cagagctgaa atgtatcagc agcagtcaac ccaacccttg tgaagtattc 120 tgttcaaggt tcaattttac tgcgatttca caagcagacc atgtactatc cttactgtat 180 ccgactggcc atataaccag gatagccagg agtaagaaag atatacatgt taaaaccagt 240 gttaacctgg cacacctttg tatccataca cttcttctac tgtcaggtga tgtggcctgt 300 aaccctgggc ccaaccagag tgaagacaac gtagtggcta atgtggattg gcaacctttg 360 tttgagagaa taaaatcaaa gaatggtatt aaaatggtac atctaaatac tcgtagtttc 420 ttgccaaaac ttgatgaaat tagaatggca atgactgaca atcctataga catattaacc 480 ctgaatgaaa cttggctaga ttctagtata gatgacaatg aactgtatat accgaactat 540 acactgtatc gtaaagacag aaaccggcat ggaggtggtg tggcgtgtta tgtgaccgat 600 gaattacaac acaaattgat acccgaactt accgagttag atattgaaag tgtttgggtg 660 gaagttaaac atactgtcgg caagcctatc gttattggct ccgcctaccg cccaccaaat 720 gcaacaacac agttttttga aacactagag acagccatga ccaccgccac tgctatttct 780 gatgaggttt tcttacttgg tgacctaaac tgtgatgtac ttacaaaagg agcctgtaag 840 aaaattgata agatttgtag cacctttcaa gcaaaacagc tcatagacga accgacacgt 900 gtgactgaga cttcgtccac atgtatcgac atcatcataa caacttcccc tgaaaaggtg 960 acagaatctg gcgtgtgttc tactgggctg agcgaccact gcttcaccta cgctgtacgg 1020 aaagccaaac gacccccggg caaccctagg acggcaacag tgagatccta caggcggttt 1080 gatgaatcag cttttcagga agatcttcgc aatgcaccct ggtctaaagt tgataactgt 1140 tcaaatgttg atgatgcgtt gagttgcttt cagactatgc tcaccaatgt ttgcgacgat 1200 catgccccgt gggtatctat acgtatccga ggccatgagc caccatggat aacacaagaa 1260 tacctagcca tggcccatga aagggacttc tacttcgata aagctaaaaa gacaaaacta 1320 acatctatgt gggaaacagc taagaaatta cgaaataagt gtaacaatat ggcacggtat 1380 ttgaaaaaac agtactactg taacgaaata gagactaaag ggaaagacag taaaagtctc 1440 tgggccactt taaaaacact cttaccgagt caagccaaac aacctgttaa tgtccccaaa 1500 caacaacaaa aaagtattgc aaacaaattt aatacttact ttacttcaat tggctctaaa 1560 cttgctgcag cgttcactgg tgcgtacaga gcagtcttag gacccgcaca gttttctttt 1620 aaattcacaa acatctccac tgagttcaca tacaagcagc tcatgagcat tcctttggga 1680 aaaagcactg gtctggatgg agtgagcagt aggctcatcc gacatgctgc cccaatcata 1740 gcaggacctc tcactcacat ttacaactta tccctaacta caggcgaagt tccaaaggac 1800 tggaagagag ccaaggttac acctctgttc aaagacgggg acaaaaccag ttgtagcaac 1860 tacagaccaa tttcagtcct gccaaccttt atgaaaattt ttgagaaagc catacataca 1920 caagtataca cttactgtaa ggaccacaaa atactgaaca gttttcagtc tggttttagg 1980 cccgcacact caaccactac aaccctggtt catgttactg actccatcat tgaaaatatg 2040 gagaagggtc ttgttaccgg agcaatcttt cttgatctca agaaagcttt tgatacggta 2100 tgccataaga tcctcctact caagctaaga atgtatgggg tacaagataa ggaactgaca 2160 tggttttgtt cttatctgac aagccgatac cagactactg tcgtaaatgg agtccagagt 2220 gactttcttg aagtcactgt tggtgtcccc caaggctcgg ttctcgggcc actcttgttt 2280 atactgtaca ttaatgattt gcctgattat atatcacacg gacaggttgt gttgtatgct 2340 gatgatactg cacttttcta tgcatctaaa acagtgaggg atgtgaatac ttgtctgaat 2400 gccgaccttc gtaatatcga aacatggtta cacacaaata aattgacact caatgttact 2460 aaatgtaaag caatgctctt cggcacagcg cgtaggctgc agtcaagtac agagcaactg 2520 tcaaccactc tttctggcac cagtttggaa gttgtcacta gctttaagta tctgggcgtt 2580 tggcttgatc cttgcttaac atgggaagtc catattgaca agttatgtaa tgcagtctcg 2640 tcaagactgg gagtcttacg gcgcctaatt ccttgtctac cacagcgcac tcttgttatg 2700 ttattcaacg ccatggtttt gcctaaaatg gactattgcg atgtgatttg gggtaattgt 2760 ggtaaatgtc tatccgatag actacaaaaa ctccagaacc gggcagctag gcttattctt 2820 ggtttaaaat acacccatca tgttggcgtt gatgagctat ctgccttggg ctggaaaacc 2880 ttaacatcca ggagaaactt gcacctgcta caatcaatgt acaaggcagt taacaatgaa 2940 cttcctgatt acatcaatag ttttcaaagg acttctgact ctcacagcta ccccacccga 3000 cacagtcaga acctatgcct caaactaccc cttgtacggt tagaatgtgg gaagcgcaag 3060 ttctcttata gaggtattac tgcatggaac agtcttccac catctgtaaa gacctgcacg 3120 tcaaaatcta cctttagaac caactgcagg gcttggctag cctgacctct gaccactcaa 3180 tgaaactgac aacttgatag tatttatttt gttactatta ttattgtttt ctttagtttc 3240 gtaccattat ttttcattca tcatttgtat tgatatcatc atcactagca gcatctattg 3300 ttatgattag tattattgtt tgcatcagtt tcattactat tgttatttaa tcatttcatt 3360 tattgctgtc tatttatgtt tgcctatgtt tgtgatgaat atgtttaatt tctttttatg 3420 taacacgcag ggctccactg aaaagcagtg caagagcact gagcttggac caccctggat 3480 aaagattaca gaaataaata aataaataaa taaa 3514 // ID Gypsy-20_AA-I repbase; DNA; INV; 4477 BP. XX AC supercont1.196; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-20_AA_; KW Gypsy-20_AA-LTR; Gypsy-20_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4477 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.196; Positions 276258 271782. XX CC Positions [1865-2293] - Reverse transcriptase CC Positions [3691-3948] - Integrase core CC 'GTAGG' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 471..1622 FT /product="Gypsy-20_AA-I_2p" FT /translation="MEKWEIQPFKFKALPRNIVRNEWIKYKRNFEYIIAAS FT GETDKTRIKNLFLARAGPDLQEVFASIPGADVAEDKEEGVDPYAVAMQKLD FT AYFSPKQHETFERNVFWTLKPNPEETLEKFMLRCQDQANKCNFGSTAMESK FT AISVLILFAPSDLKEQLLHRDSLEIDEVAKIVGSYESVKHQARLMNFGQSG FT LVEPVDMATAGNNINKIRNIPIKECMRCGKRDHFANDQNCPARGKECIKCR FT RIGHFAARCRTPATKRRYTEEPKPASSKGFKRQRIHEIETDDERKDRRFIF FT NINDGGEWLKLGGVTLQLLIDSGCKKNIINEKAWECMKNNGAKIWNQDKNC FT TEVFLPYGKDAEPLCVLGKFDALISVEDDGRTIEQLVFINA" FT CDS 1655..3748 FT /product="Gypsy-20_AA-I_1p" FT /translation="MSLGVLVIGLPSTHGVNMIGSAQKRPFPKIKGVQVEI FT PVDNSVVPVCQHPRRPPFALLSKIEDKLTSLLISDIIEPVEEGYQWVSPLV FT TVIKDNGDLRLCVDMRRANVAILRERHIMPTIEDFLPRFTSAKYFSRVDIK FT EAFHQVELKGESRYITTFITHMGLFRYIRLMYGIVIAPEIFQRIMEQISSR FT CEHTVNFIDDILVFGDTEEEHDAELKLVLSILNERGILLNQDKCLFKVTSL FT DFLGHTVSSDGIKPSDSKIESLQRFRRPATTEEVRSFLGLVTYVGRFLPNL FT ATVTAPLRELTRSGHKFSWGKDQESSFVKLKDMIGNVQQLYFFDNKLRTRL FT VADASPVALGAVLIQFDGSTDVHPRPIAYASKSLTATERRYCQTEKEVLAL FT VRSVERFAMYLYGRTFELETDHKPLEAIFQPTSRPCARVERWVLRLQSFSF FT MVKYRSGSSNIADPLSRLVENRSSETFDAESKFMVLAVMESAAIDVQELED FT ATNTDPVLEKVKKSMRSGNWDKEEVKPFLPFKNELGFVGDLLVRGDKLVVP FT NKLKSRILDIAHEGHPGESLMKRRLCDRVWWPRMDSEAMMRVKSCEGCRLV FT GLPSRPEPMNRRPLPSGPWIDTAMDFLGPLPCGSYLLVIIDYFSRYKEVEI FT MSKITAKDTISKLDRIFTRLGYLRTITLDNAKQFVSTDLELYIKLHGIT" XX SQ Sequence 4477 BP; 1360 A; 917 C; 1063 G; 1137 T; 0 other; ttggcgacga ggtaactgga caggtaagaa tgatgttttt taatacgaac gaatgaatac 60 attgaattac atatttgagc tgtgctcttg caagctgcca gatttgaggt tatggctaca 120 atgttttttt ttgttgaagg tggcgttgat ttgtttattt ccaacaactg tcgaagacaa 180 aaaaaaaaaa aaattgtcgc tggaagcgat aggctactgg gtagctactg ctagaatagc 240 agtggattgc atttcgtcgc tggaagcgat aagctacata gtagcggctg tgacagcgga 300 ttaatcgtca ctaaatgtga taggctacaa tttagtagcg gctgtacagc ggatcataaa 360 tcggatcaca acaaaactgt ctgttaagat aaaggaaaat atgacgaggt aaataaaaat 420 taatgttatc gtcgatatcg ttttaggata caatcagaca aggttggatc atggaaaagt 480 gggaaatcca accctttaaa ttcaaggctt taccacggaa tattgtccgt aatgagtgga 540 ttaaatataa acggaacttt gagtacatta ttgcggcttc aggagaaacc gataaaactc 600 gcatcaaaaa ccttttcctc gccagagctg ggccggacct tcaagaagtg ttcgcatcca 660 ttccaggagc ggatgtggct gaagacaaag aagaaggtgt agatccgtat gcggtcgcta 720 tgcaaaaact agacgcctat ttctctccaa aacagcatga gacatttgaa cgcaacgttt 780 tctggacact caagccgaat cctgaagaga cacttgagaa atttatgttg cgttgtcagg 840 atcaagcgaa taaatgcaat tttggaagca cggcaatgga aagcaaggca ataagtgtcc 900 tcatcttatt cgccccgagt gatttgaagg aacaactgtt acatagagat tctctagaaa 960 tcgacgaggt cgctaagatt gttggatcgt atgaatccgt gaaacaccaa gctcggttga 1020 tgaattttgg gcagtctggg ctagtagaac ctgtcgacat ggccaccgcc ggaaataaca 1080 ttaacaaaat ccgaaatatc ccgatcaaag aatgcatgag gtgcggaaaa cgagatcact 1140 tcgccaacga ccaaaattgc cctgcacgag gcaaagagtg cattaaatgc aggcgaattg 1200 gtcatttcgc agcacgttgt cgtacacctg ctacaaaaag aagatacact gaggaaccca 1260 aaccagcctc tagcaaagga ttcaaacgtc agcgaataca tgaaatcgaa actgacgatg 1320 aaagaaagga tcgccgcttc atatttaata ttaacgatgg aggagagtgg ctcaagcttg 1380 gcggagtgac gctacaattg ctgatcgact ctggctgcaa gaaaaatatc attaatgaga 1440 aagcgtggga gtgtatgaag aataatggag caaagatttg gaatcaagat aagaactgca 1500 ccgaagtttt tcttccatac ggtaaagacg ctgaaccgtt gtgcgttctt ggaaagttcg 1560 acgcccttat atctgtagaa gatgatggaa gaaccatcga acagctagta tttatcaatg 1620 cttaatttac tcgagtttac ttaaaataac tgccatgagt ttgggtgtac tggtgatagg 1680 attaccgagt acacatggag taaacatgat tgggtcagcc cagaaaagac catttcccaa 1740 gattaaaggc gtccaagttg aaatcccggt ggataacagt gtcgttcccg tatgccaaca 1800 tcctcgccgg ccaccgtttg cccttttatc taaaatagaa gacaaattaa catcactgct 1860 tatcagtgat ataatcgaac ctgttgaaga aggatatcaa tgggtttcac cattggtgac 1920 tgtgataaaa gataatggcg atctcaggct gtgtgttgac atgcgtcgag cgaacgttgc 1980 aattttacgg gagcgccata ttatgcccac cattgaggat ttcctgccgc ggtttacatc 2040 cgcaaagtac ttcagtcgcg tggatattaa agaggcattc caccaagtgg aattgaaggg 2100 agagagtcgt tacattacga cgtttataac ccacatgggt ctctttcgat atataaggct 2160 gatgtatggc attgttatcg cgcctgaaat attccaacgt atcatggagc agatttcgag 2220 ccgctgtgag cataccgtca attttatcga cgacattctt gttttcggcg acacagagga 2280 agagcacgat gcggagctca aactagtatt gtctattttg aatgaacgcg gaattctctt 2340 gaaccaagac aaatgtttgt ttaaggtgac cagtctggac tttcttgggc atacggtttc 2400 gtcagatggc ataaaacctt cggatagtaa aattgaatcg cttcagcgat tccgtaggcc 2460 agcaaccact gaagaggtcc gaagtttcct cgggctggtt acgtatgtcg gtcggttcct 2520 cccaaacctg gcgacagtta cggctcccct tcgcgagttg actcgttcgg gccataagtt 2580 ttcgtggggc aaggatcaag agtcatcttt tgtgaaactt aaagacatga ttggcaacgt 2640 tcaacaactg tactttttcg acaacaagct tcgaacaagg ctagttgcag acgcatcacc 2700 agttgcgctt ggtgctgtct tgatccaatt cgacggttca actgatgttc atccgcgccc 2760 gattgcctac gccagtaaaa gtttgacagc aaccgaacgt agatattgcc agaccgagaa 2820 ggaagtgttg gccttggtaa ggtcagttga gaggtttgcg atgtacctgt atggcagaac 2880 gttcgaattg gaaacggacc ataaaccatt agaagcgatt tttcaaccaa cttctcgtcc 2940 ttgcgctaga gtggaaaggt gggtgctacg cctccaatcc ttttccttta tggtaaagta 3000 ccgaagtgga tcatcaaaca tagccgaccc attgtctcgt ctggtcgaaa atcggtcttc 3060 tgaaacattt gacgctgaaa gcaagttcat ggtgttggcg gttatggagt ctgcagcgat 3120 tgatgtccaa gagcttgaag atgcaactaa caccgatccg gtcttggaaa aagttaaaaa 3180 gagcatgcgt tccgggaact gggataaaga ggaagttaaa ccgtttttgc ctttcaaaaa 3240 tgaactggga tttgttggcg atttgttagt acgtggagac aaacttgtcg tcccgaacaa 3300 actgaagagt agaattcttg atattgctca tgaaggtcac ccgggtgagt ctctaatgaa 3360 aagacggctt tgtgatcgtg tttggtggcc aaggatggat agtgaggcta tgatgcgcgt 3420 aaaatcatgt gaaggttgtc gtttggttgg actgccgagt agacccgaac cgatgaatcg 3480 ccgtccacta cccagcggac cgtggattga caccgccatg gatttcctag gtccgttgcc 3540 ttgcggttcg tatctgttgg tgatcatcga ctattttagt cgttataaag aagtggaaat 3600 tatgtctaaa atcactgcta aggatacaat aagtaagctc gacaggattt tcactcgctt 3660 gggttacctt cgaactataa ccttggataa cgccaagcaa tttgtcagca cggatttgga 3720 actctacatc aagctccatg ggatcacttg aatcactcga caccgtattg gccccaagaa 3780 aatggtcttg ttgaaagaca aaaccgatct ctcgtaaaac gactacagat cagcgctgct 3840 ctcggtagag actggaaaca agacttgcat gattatctaa tcatgtatta cactacgcca 3900 cactcgacca ccggtaaaac accaactgaa cttatgtatg ggcacacaat tcgatcgaag 3960 ctaccagcga tcgaagacat cgaaacaatt ccacaaaatg ctgattttcg tgatagagac 4020 caagaattga aagaaaaagg gaagaaggca gaagatatcc gacgtcgagc taagcgatca 4080 tcaattgatt caggagacac tgtactcatg cagaatttat taccaggaaa caagctatct 4140 acaacctata atccaaagca atatatcgtt gtttcacggt ctggaccacg tgtgacggtg 4200 gaagaccctc aaaacggaaa gtcgtacgac aggaatgtcg cacatctcaa aaaggttgtt 4260 gagccagaac aagaactaaa cgaaacatct gacggctttc gttgtcagaa ccaggtgcag 4320 ctagactcat cctctgaaga agactttcgt ggatttgagc ccgaggaaat cacgccatca 4380 gaaacaatcg tagctagccg acagcgtaga tcaacgaaga agccgacgcg attcaacgat 4440 tacctgatgt aacatccatc atttaaacaa agggaga 4477 // ID DNA8-30_AP repbase; DNA; INV; 710 BP. XX AC . XX DT 23-AUG-2009 (Rel. 14.08, Created) DT 23-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-30_AP. XX NM DNA8-30_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-710 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(8), 1772-1772 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 710 BP; 256 A; 105 C; 112 G; 234 T; 3 other; caggggcgtc atttgggggg gggggggggg grtgcagggt gtacatttgc accaactttt 60 ttttaaaatg ccttctttgg cctgccaaca ataaatataa gcgccgatat acctaagtgt 120 tttatattat atattatatt ttatattttt tcagtattaa aaaaaaatgt attgtactaa 180 tgtttaattt ccaagaatgt ttgatgtgat tttgtggtcc cacaatatta taattgataa 240 aattataaca caaacattgt gaagaaggat accaataaaa taaatttctg gttttattaa 300 gctagctata ttatatattt tgagtatgca gggagaaaaa atgtgtaggt taggttcggt 360 taggtatttt attataaagg tgatttatac ccatacattt atttaccaag tgtaaatacm 420 actgtaaaat accactgtac actggtatcc acaatataaa ataaatacst acattaaaag 480 tacctacaca tagttcgtca aaaatagtaa caatattgtg atattttcat actaatataa 540 gtgacatatt aaataaatta tacaaatatt ctacagtttt tgtccccact ttaacccccc 600 ccccccaaaa aaaaaaaaaa aaataaaatt ttctatgtgg cctggtaatg atctgaatgg 660 cctgcaaaca ttttggcacc aacaaaaaaa aagttgaaat gacgcccctg 710 // ID L1-5_CQ repbase; DNA; INV; 4815 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE An L1 non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW L1; Non-LTR Retrotransposon; Transposable Element; L1-5_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4815 RA Kojima K.K. and Jurka J.; RT "L1 non-LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 135-135 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 10 sequences with >91% CC identity. XX FH Key Location/Qualifiers FT CDS 138..911 FT /product="L1-5_CQ_1p" FT /translation="MGRQNTIRIDYGDLPNKPGMPKVQKFCADKLGLKRGE FT VIRIQNSQTLGVTFVTVADLTLALKVCEEHGKAHELTGSDKKQHQVTITIE FT DGTVLVKLFDLSADVSNADVAKFLERYGEVRDVYEEQLGDDQEFAGAYTGI FT RVAKMVVRENIASWITINGEATKCEYYAQKATCKHCHDYLHVGVGCVQNKK FT LLVQKTFADAVKQPAKPQQPAMPQQPXNPQQPLNPQQPADXQQPLNPQQPS FT TRATAETKRREVEAKRR" FT CDS 1130..4372 FT /product="L1-5_CQ_2p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="EAKIQRRQRRRRRGRRGRGIIIFDFSSYNLATININN FT ITNATKVDALRSFIRTLELDIVFLQEVENERLAIPGFNVICNVDHARRGTA FT IALKEHIEYTHVEKSLDGRLIALRVQNTTLCNVYAPSGTALRAERERFFNG FT TLAYYLRHNTDHIVLAGDFNCVLRPCDSTSSNPSAALQTAVQQIRLHDVWL FT KLHPNTHAPTYVVHNAESRLDRVYVSAGLCDQLRTAATHVCSFTDHKALTV FT RICLPHLGGERGRGFWSLRPHLLTQENKEEFAVRWQYWTRQRNRYRTALDW FT WISFAKPKIKSFFKWKSNEVYRVHHARHQQLYAELREAYDAYYQNPAMLTT FT IHRVKAQMLTLQREFSQNFARVNETRVAGESLSTFQLGERRRKKTTITRLK FT NARDEVLQTQPEIEQHVLQYFTELYAADPARAPDIDAGLGCERIIPVDDDA FT NASLVEEITLEEIKLAIITSPKRKSPGGDGLTAEFLNWAFDVIYRELHAVM FT NELLVAPVPAEFSEGTIVLVKKSGRSAENISSYRPISLCNTDSRVFSRIMR FT QRLTRVLQAHNVLTDSQKCGERRSIFQATLALKDRIASLLARKVKGKLISF FT DLDHAFDRVSLPFLYRTMLSLGINPDFVSLLERHFSRAGSRILLNGFLSPR FT FDIKKSVRQGDPLSMVLFSLYMHPLLLALERICVGDLVVAYADDVSVVVTA FT TETIEAINQLFGRFEIAAGARLNRQKTVAIDVGHVEGRPLVVPWLRTVEKV FT KVLGIFFANSIRLMIKLNWDAQVTKVAQLIWMHSMRSLTLQQKVILLNTFI FT TSKVWYLASTLPPYQVHVAKLTATMGTYLFRGLPARIPMQQLARRREDGGL FT KLHLPAIKCKALCINRHLQEIDSLPYYKSFLDRAIPPPTSLPCLKLLCEQT FT PLLPAHIQNFPSSDRIHQHFLDGTERPRVETKQPNLNWRRLWKNISSRRLT FT SGERSALYLLVNEKTEHRRLMHVIQRADGPNCQHCAANEETLTHKFSECPR FT VAAAWAHTQRRLTAIDGGRQQFSFGELFRPVLNGIADGRRARXLKTFISYI FT VFVDXCATTAIDVAALDFELSLVG" XX SQ Sequence 4815 BP; 1223 A; 1426 C; 1264 G; 887 T; 15 other; cactcagcgt cgcacctctg ccgcgatcgg acgctaaaag ttttttctcc gccgggcaga 60 gttcgaccga aacagtgttt tttttctcta ttgtgtgctt ccaatcacgc gtcgcgaact 120 cgtcgcgagt gtgaaagatg ggaagacaaa atacgatccg catcgactat ggtgacctac 180 cgaacaaacc gggaatgccg aaggtgcaga aattctgcgc cgataagctc ggcctcaagc 240 gcggtgaagt gatccgcatc cagaacagcc aaacccttgg tgttactttc gtcacggtcg 300 ctgacctcac ccttgccctt aaggtgtgcg aggaacacgg caaagcccat gagctgaccg 360 gaagtgacaa gaagcaacac caagtgacca tcaccatcga ggacggaaca gtgttggtga 420 aactgttcga cctttcggcg gacgtgtcga acgccgacgt cgccaagttt ctcgagcgtt 480 acggcgaagt tcgtgatgtt tacgaggagc aactcggcga cgaccaggaa tttgccgggg 540 cctacaccgg tatccgcgta gccaagatgg tggttcggga gaacattgca tcctggatca 600 cgatcaacgg sgaggccaca aagtgcgaat attatgctca gaaggccacc tgtaaacact 660 gccatgacta cctccacgtc ggcgtgggtt gtgtgcaaaa caaaaaactg cttgtgcaaa 720 aaacttttgc cgacgccgtc aagcaaccgg cgaagccgca acaaccggcg atgccgcagc 780 aaccgckcaa cccgcagcaa ccgctcaacc cgcagcaacc ggccgaccmg cagcaaccgc 840 tcaacccgca gcaaccgtca acccgagcaa ccgccgaaac caaacggcgg gaagttgaag 900 ccaagcggcg gtaagcccaa gccgatccgg acaaccaccc ccmaaaaaaa ccaaccgacg 960 agggaaacgg ctcatctttt ttgatgcccc caccccagga cgtgcagcca tcgggcagtg 1020 ggcttacccc cgctggaggt gcaggattta aaaaatctgg ccatggcaaa tccgatggca 1080 atgacaccga ctcgtcgacc agcagcagaa aacttcgccc aaaaggtaag aagccaaaat 1140 acaacgaaga caacgacgac gacgacgagg aagacgagga agaggaatta taattttcga 1200 tttctcgtct tacaaccttg caactataaa tatcaacaac atcaccaacg ccacaaaggt 1260 cgacgccctg cgaagcttca tccgcacgct agaactagac atcgtttttc tgcaagaagt 1320 tgaaaacgag cgactcgcga tccccggttt caacgttatc tgcaacgtgg accacgccag 1380 gcgcggaacc gcgatcgctc taaaagagca catcgaatac acccacgtcg agaagagttt 1440 ggacggccgc ctcattgcgc tgcgtgtcca aaacacaacg ctctgcaatg tgtacgctcc 1500 gtcgggtacc gcactccgtg ccgagcgaga gcgatttttc aatggtacgc tcgcgtacta 1560 ccttcggcac aacaccgatc acatcgtcct cgccggcgat ttcaactgcg tgctacgacc 1620 gtgcgattcc acgagctcga acccgagcgc cgcgctacag acggccgtcc agcaaatacg 1680 actgcacgac gtgtggctga agctgcaccc caatacacac gcacccacgt acgtcgtaca 1740 caacgccgaa tccagactcg accgcgtgta tgtgagcgcg ggactgtgcg atcagctgag 1800 aactgccgcg acacatgtgt gttcattcac ggaccacaaa gcgctgaccg tgcgaatctg 1860 cctcccccac ctcggcggcg aacgcggccg tgggttttgg agtttgcgcc cccacctcct 1920 gacccaggaa aataaagaag agttcgccgt acggtggcag tactggacgc gccagcgaaa 1980 ccggtaccga actgcgctgg actggtggat ttcgttcgca aaaccaaaaa taaaatcctt 2040 tttcaagtgg aagtcgaacg aagtctaccg tgtccaccac gcgcggcacc agcagctgta 2100 cgcggaatta cgtgaggcgt acgacgccta ctaccagaat cccgccatgc tcaccaccat 2160 ccaccgcgtg aaggcgcaga tgctgacgtt gcagcgcgag ttctcacaga attttgcgcg 2220 cgtcaacgaa acccgcgtgg ccggcgagag cctgtccacg ttccagctgg gggaaagacg 2280 aaggaaaaag acaacgatca cgcgccttaa aaatgcgcgc gacgaagttc tgcaaacaca 2340 gcccgaaatt gagcagcacg tgctccagta cttcaccgag ctgtacgccg cagaccctgc 2400 gagagcacca gacatcgacg ccggcctcgg ttgcgagagg attatccccg ttgacgacga 2460 cgccaacgcg agtctcgtgg aagagatcac gctcgaggag atcaaattag cmataattac 2520 cagccccaaa cgaaaatcac cagggggcga tggtctaacg gccgagtttt tgaactgggc 2580 gttcgacgtc atctaccgcg aactgcacgc ggtgatgaat gaactgctgg tggcgcccgt 2640 accagcagaa ttctccgaag ggaccatcgt gctcgtgaaa aagagcggcc ggagcgcaga 2700 gaacatctcc tcgtatcgtc ccatcagtct gtgtaataca gacagccgag tcttcagccg 2760 tatcatgcgg caacgtttga ccagggtgct acaggcgcac aacgttctca ccgactcgca 2820 gaagtgcgga gagcgccgga gcatcttcca ggccactctc gcgctgaagg accgaatcgc 2880 cagtctgctg gcccgaaagg taaagggtaa gttgatctct tttgacctcg accacgcttt 2940 cgatcgagta tcactgccct tcctttaccg cacgatgctc tctctcggga tcaaccccga 3000 cttcgtgtct ctgctggaga gacatttctc ccgagctgga tcccgcattc ttcttaatgg 3060 gtttttgtcc ccacggttcg atataaaaaa atctgtgcgc cagggtgatc ccctgtcgat 3120 ggtgttgttc tccctataca tgcatcccct gctgctggcg ctagagcgta tctgtgttgg 3180 cgacctcgtc gttgcgtacg cagacgacgt atcggtggta gtcacagcga cagagacgat 3240 cgaggcaatc aatcaactat ttggtcgatt tgaaattgcc gcgggcgcgc gcctgaaccg 3300 gcaaaaaacc gttgcgatcg acgtggggca cgtcgaaggg agaccgctgg tcgttccctg 3360 gctgcgaacg gtcgagaagg tgaaagtgct tggcattttc ttcgccaact cgatccgact 3420 aatgatcaag ctgaactggg atgcgcaggt gacgaaggtg gcgcaattaa tctggatgca 3480 ctcgatgcgc agcctgacgc tccagcagaa ggtgattctc ctgaacacct tcatcacgtc 3540 caaggtgtgg tacttggcgt caactcttcc accgtaccag gtgcacgtgg cgaagctgac 3600 agcgacgatg gggacgtacc tgtttcgggg cctcccggca cgcataccga tgcagcagct 3660 ggcgcgacgc cgtgaagacg gcgggctgaa gctgcactta ccggcgataa aatgcaaagc 3720 actttgcatc aatcgccacc tccaagagat cgactccctt ccctactaca aatccttcct 3780 cgaccgagct attcctcccc cgacaagcct tccctgcctg aaactccttt gcgagcaaac 3840 gcctttgctc cccgcccaca tccaaaactt cccctccagc gatcgaatcc accagcactt 3900 ccttgatgga accgagcggc ccagggtcga aacgaagcag ccgaacctca actggaggcg 3960 tctgtggaaa aacatctcgt cccggcgact aacttccggc gaacgcagcg cgctgtattt 4020 gctggtgaac gagaagacgg agcaccgacg gctgatgcac gtgatccagc gtgcggacgg 4080 gccgaactgc cagcactgcg cggcgaacga ggaaacgctg actcacaagt tcagcgaatg 4140 cccccgcgtg gcagcggcct gggcgcacac acagcgacgg ctgacggcga tcgacggagg 4200 aaggcagcag ttttccttcg gcgagctstt caggccagtt ctgaatggaa twgcggatgg 4260 aagaagagcm agawtgctga aaacattcat cagctacatc gtctttgttg acaawtgtgc 4320 aacaactgca attgatgtag ctgctctgga tttcgagcta agcctggtcg gttgaaatgg 4380 tgtaaccact gtcgcgcccg gaggtcccgg ggcccccctg accacaagct acacgaaaag 4440 tggaacaggg aacaccctcc ggctcctccg gggcgcctgg taccgacacc ccaacgaaga 4500 atccacggag gcgctcggga gacccggggc ccccctaacc acaagtcttc ggaggagtgg 4560 aatagggaca accccccggg ctcctgagac ccctggacga ggtcgcacaa ggggtaaacc 4620 cccatcccct cggatgaacg caaggattas gccgagggac aacgatcgac cggawccacc 4680 gamgaatgac aacacaacag taagaktgta atctccwagc gtttaggatt atcaaatata 4740 ggttatcaaa ttcgctgttt aattttaagc aatcacaaat aaactatatt ttattaaaaa 4800 aaaaaaaaaa aaaaa 4815 // ID Polinton-2_TC repbase; DNA; INV; 16981 BP. XX AC . XX DT 15-MAR-2006 (Rel. 11.02, Created) DT 15-MAR-2006 (Rel. 11.02, Last updated, Version 1) XX DE A family of autonomous Polinton DNA transposons - a consensus DE sequence. XX KW Polinton; DNA transposon; Transposable Element; KW Interspersed repeat; Maverick; Tlr; Polinton-2_TC. XX OS Tribolium castaneum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Coleoptera; Polyphaga; Cucujiformia; OC Tenebrionidae; Tribolium. XX RN [1] RP 1-16981 RA Kapitonov V.V. and Jurka J.; RT "Self-synthesizing DNA transposons in eukaryotes."; RL Proc. Natl. Acad. Sci. USA 103(12), 4540-4545 (2006). XX DR [1] (Consensus) XX CC This transposon is characterized by 225-bp terminal inverted CC repeats and 6-bp target site duplications. The consensus sequence CC was built based on multiple alignment of several copies that are CC >90% identical to each other. It encodes a family B DNA CC polymerase (POLB-2_TC), retroviral integrase (INT-2_TC), ATPase CC (ATP-2_TC), cysteine protease (PRO-2_TC) and one unclassified CC protein (PY-2_TC) conserved in Polintons from different species. XX FH Key Location/Qualifiers FT CDS 1834..2235 FT /product="PRO-2_TCp" FT /translation="MVSEKKLPVKLPRHALTNIEIIKFARLLQIPNFRGVY FT MRNKLPKIIHKNETGIINLDEEKNSGTHWTAYVKNKNNILYFDSMGNLKPP FT TEIINYFKSDNDGNNITYNYDGYQKMGSYNCGQLCLKFLYRHTCKL" FT CDS 3082..4287 FT /product="PY-2_TCp" FT /translation="MFDIEKGIEWDEGVVGYKFQSHEAYTARYENTDEIRI FT PLQEDLCTLPCDSLILIEGQLVKTDATTNKTVPATETKFVNNGVAFLFSEI FT RYEVSGVTVDTNTKPGITTTMKNLASLNQSESLKLTMSGWDLNEEATKPID FT GYFQACIPLNRLLGFAEDYKNIMLNIPQELILIRSNSDVNALICATNEKAR FT VVIDKIAWLVPHIVPGLKEEVKLTKTIEKDKEIAVPYRSWELHSLPLFPNT FT TKCSWPVKISTKFETPRYIIFGLQSNREGQLDKNMSKFDRGDITNMRVFLN FT NERYPYENLNISYKKHHFGILYEMFTNFRSRYYYTDKKETHITPTQFFDSY FT PLIIIDCSMQKTGLQTQSIALRIEFDTDTGISEGTTAYALVISDRAFTYTP FT LTKSVKQV" FT CDS 8033..11146 FT /product="POLB-2_TCp" FT /translation="MKRAIVNVQNQDIHCFGWSVVAAVCLPMGPEHLPSSY FT PHWSTLFNFQGIEFPVKLDSIAKFESQNNVSINVYGLELLYVNNKVKFEIV FT GPLYFTNFKKPVHINLLLLSDNDGNQHYCTITNLPRLVSSQLSKHHHSKFF FT CDGCLQYFTTQALLDRHSSTDCGKLYARIPNNELITNRFGYNVPANVLEFQ FT NYHHKMTIPFIVYADFECLLKPMSTVAPNPNQSFSIKTFLHQPYSCAYYIK FT CSYNDQLSKFQSFRGQSSVVDFITALEIDLIEIYKKYLSIVQPMLPLTIQE FT EFDYLTATECHICQKPFQQNDIKVKDHCHLTGRYRSAAHSNCNLNFQIPNF FT IPIVFHNLTNYDSHLFIKELALEESEINILGKTKEKYITFSKKICVGEYLN FT SNNRIVKEHLHLRFIDSFQFLACSLEKLAAALDDAQCMEVKRYFPVDREFQ FT LIRQKGVFPYSFIDDFSKLDLTELPQKIDFYDKLRDEHISEDDYQRANTVW FT DTFHCQTLGEYSDLYLKSDILLLADVFENFRNMCLKHYELDPAHYVTAPSL FT SWNAMLKFTHVKLELLTDIDMLHFFKKGIRGGVSTCVKRAAQANNKFLPNY FT DPSKPTSYILYLDATNLYGWAMSEVLPLGGFTWLTQDEINSLSIMDLTDTT FT PEGYVLEVDISYPHHLHNSHNDMPFLPENLIPPNGKCAKLIPNLCAKTNYI FT IHYRNLKQALQNGLELTKIHRVLKFNQSSWLKPYIDLNTKLRNQTKNKFEK FT DLFKLMNNSVYGKTMENVDNRVDIKLATHWKKNGHKYGAETWIAKPQFKNC FT SIFTENLVAIEMNKVQVTYDKPIYVGFSILEISKTLMYDFFYNFLKVKYGN FT NVQLLYTDTDSLILEIYTENVYNDIKQNIDRFDTSNYPENNIHDIPKTVSI FT IGKMKDEYAGVPIVLLYGTGAKAYCVQTVNDLIKKAKGVSKHVIDKSLTLL FT QYRAIATNPSSSSVFCVMNVFKSYLHNMYTELRNKIALSNFDDKRYILGNS FT IDTLAWGHYDIDRDRNLDSLIRELQKYVEPTD" FT CDS 11237..11968 FT /product="ATP-2_TCp" FT /translation="MCKVFLFVFCKSTMKLVQQKDLLKISNHDYSVCVPYQ FT HFPKHNTLFNNSVKRGLVVGRSGCGKTNVILSLLLHPNRLRFTDIYLFSKT FT LQQPKYQFLKSVLAPLDKVGYYEYEDGAEIPPPKEIKPYSIIIFDDVVTCN FT QNIIRDYFCFGRHYNIDCIFISQTYSSIQKQLIRDNANFIIIFPQDVLNLK FT HIFNDHVGTDMSFETFQKMCSQCWQKPYGFVTIDKDAEQDNGRYRNGFDQF FT IKI" FT CDS 13634..14575 FT /product="INT-2_TCp" FT /translation="MKRSDLNIKRQVVNELHAPARVNFKRRRVIVKGLNDL FT LQLDLVEMIPYSRVNKGFKYILMVINVFSKFVWAVPLKNKSALSVVNAMSE FT ILKTRRNVPKNIQTDLGKEFYNKHFQQLMNKYKINHYSSYSNLKSSVVERV FT NRTIKGMMYKEFSVQGHYNWINILPDIIKKYNNTRHRTTGLKPISINKKNE FT RKVLDQVYSHLKTMDIYKPKFKVGDFVRISKHRHLFKKGYTPNWSNEVFTV FT SKIQNTNPRTYLLQDVSGETIRGGFYTEELQKVKYPNVYLVEKVLRKKGNR FT LFVKWSGIDKTHNSWINSSDIT" XX SQ Sequence 16981 BP; 6035 A; 2811 C; 2839 G; 5293 T; 3 other; agtagataac gaatgggggg acaacacaac acaaccttga ccttgagttt gaccccgccc 60 attttccctc tctttctttc aaggccactc gttctctctc ttactttttc aaggccaacc 120 cctcccactt cccccaccac ttcaaggcca aggcattatt ggaccttgat gttttcaagg 180 tcaaggtcat tcatgttact ccccactttt gcaaggccaa caacttccct ctttacttat 240 cattcaaggt cactgcagct aatatatata gaattcaata caaattcaaa aacagtatct 300 aataagtatt tctcaatatt aaagtagtaa aaaaaatgtg gtatcgtttt acttgtgaag 360 gtgtcataat agtggaagaa acgtgtttcc acaaccagtt aggacacata tctttggaag 420 gtggttggga acgtttcttt aatagagtgg ttgtgtgtga acagtgcagc actaaggaca 480 tttctattat taatccagat gggtggaata attctttaag tattgacatt acgtgtaatg 540 aagaagaatt taaagagtta cttagaaaat atggtttcct ttgtcataag tgtaaaactc 600 cgtgttacac agctagccaa gtggatccgg aaaatgccca aaaaattata gctatatacc 660 actaacttaa atcagcataa tggaagttga caatggtatg taatatgaga ttgtatgtac 720 ttgtagaaat taagttttaa atgttttcag atccttactg ctacatgctc ttttgcctgt 780 ctccaaagga ctgtagtaaa agaaaattct tttgtttaaa atgtgtgaag attatggcag 840 tgagatttca catttgtcat ctacatgtga cttcagtgtt aaactacaag tgtttttggt 900 gtaatacagt aataggacag aattctcagt tgtgtggaaa taatttaata aacaaaatga 960 ctgttaaaat atggtatttt agtagagagc gacaaatgat gttttatgat tctgatgact 1020 atgattctga cgactcatca agtagctgtg atagtggtat agaatgttag ttagttaata 1080 agagtagaaa ttaaatataa acaacttgta accttagatt aaattagtat ctagattgtt 1140 cttttgctgt atggataaaa gtaaataaat gaagttatat aataaacaga cttgtgattc 1200 tcaacatagt attgattcaa ctctgacaaa gagaggttgg ggtctaataa ataaattaat 1260 aaataaatta cccattgaat tacatatacc gggttatcag ttttgtggtc ctggaaccaa 1320 attacaagaa cgaattgcac ttggacatag tggtagaaat aaattagatc aggcttgtaa 1380 agaacacgat atatcgtacg ccaacgaaac taatttgcaa aaacgacacg aagcagataa 1440 gttattatta agtaaagcag ttgaacgttt aaaagcaaaa gacgcatcgt ttggggaaaa 1500 agcagctgca ttagcaattg ctggtataat gaaaggcaaa gtaaaattcg gaatgggagc 1560 taaaagtaaa aagggtaaaa aaaagaagac aaaaaaacaa cgagttctag ttgctcctaa 1620 atatggtggt tttttaccat tcctattacc actattaggt gctctaggcg ctttaggagg 1680 aggagctgca gggatagcga ctgctgtaaa taaagcttcg gtagataaaa aaatgttgga 1740 ggaaactatg cggcataata aagcaatgga aatgagtagc ggtaaaggtg taggaaaaac 1800 taaagggaaa ggtctttaca ttggtccata taaatggtat cagaaaaaaa actcccagta 1860 aaattaccac gtcacgccct cactaatatt gaaataataa agtttgctcg actattacaa 1920 attccaaatt ttagaggtgt atatatgcgt aataaattac cgaaaattat acataaaaat 1980 gaaacaggaa ttataaattt agatgaagaa aaaaatagtg gaactcattg gacagcatat 2040 gtaaaaaata aaaataatat attatacttt gatagtatgg gcaatttaaa acctccaaca 2100 gagattatca attatttcaa aagtgataat gatggtaata acattacata caattatgat 2160 ggatatcaga aaatgggttc atataattgt ggacaattat gtttaaagtt tttatataga 2220 cacacatgca aattataagt ttagtcgaat caaagatgct gtttgtatta acaggaaaat 2280 cagcgatact aagtgctgac tttaatccac cgattgatgt tagtgatggt gtctatgaac 2340 tgggtgttac aaattttgaa gtctacaaca gtataccgaa tattgatgaa gaaaataata 2400 aattcttttt cggggatgtc gaatttaaaa taccgaccgg ttgttatcaa ttaaccgata 2460 taaacaacta tttacaacat gtaattgaaa aacaattcag taatgacctc ttatctatta 2520 ctgcaaacaa taatacatta catacacata tcaaagcaac aaaagatgtt gactttacca 2580 aaccaaatac tatcggacca gtgttgggat ttaatagtca aatagttcca aaaaatattg 2640 gaaaagattc ggataatatt gcagacataa tgaaacttaa ttcgattatg attgaatgta 2700 atattacaat tggaagtttt aaaaatggag aacctgtaca cataatttat caattttttc 2760 caaacgttcc acctggattt aaaatagtac agtcaccaga tcatgtgatt tacctaccca 2820 ttagtgttaa aactatcaga aatattacac ttaaaataat agatcaagat gaaaagttgg 2880 tcaattttca acaagagact gtaacagttg gattacattt gcagaagaaa gaagaaaatg 2940 ggtattagtt ttagaactga ctcatataaa aagcaacaaa agtgtagaaa aacagttagt 3000 acgaaacgaa acgtgcaaca attaagtaat aaaaataaac tgtttctaaa atcattagga 3060 tttgaattat acacataaaa aatgttcgat attgagaagg gtattgagtg ggatgaaggt 3120 gtggtaggtt acaaatttca aagtcatgag gcatatacag ccagatatga aaacacggat 3180 gaaattcgaa ttcctttaca agaagatttg tgtacactac catgtgatag tttaatttta 3240 attgagggac aattggttaa aaccgacgct acaaccaata aaactgtacc agcaacagaa 3300 accaagtttg taaataatgg tgtcgctttt ttgtttagtg aaatacggta tgaagtaagt 3360 ggagtaactg tagatacaaa cacaaagcca ggtattacaa caactatgaa aaatctggca 3420 tcattaaatc aaagtgaaag tctgaaatta acaatgtctg gttgggatct aaatgaagaa 3480 gccactaaac ctatagatgg atattttcaa gcatgtatcc cattaaatcg actattagga 3540 tttgcagaag attataaaaa tataatgcta aacataccgc aagagttaat attaattcgt 3600 agtaatagtg atgtcaatgc attaatttgt gcaacaaatg agaaagcacg agttgtaatt 3660 gataaaattg catggctagt tccacatatt gtgcccggtt taaaggaaga ggttaaattg 3720 acaaaaacca ttgaaaaaga taaagaaatt gcagtaccgt acagaagctg ggaacttcat 3780 tcgctgccac tctttccaaa tactacaaag tgtagttggc ctgtgaaaat ctctacaaaa 3840 tttgaaacac ctcgttatat tatctttggc cttcaaagca atagagaagg acaacttgat 3900 aaaaatatga gtaaatttga tagaggcgat ataaccaata tgagagtatt tctaaataat 3960 gaacgatatc cgtatgaaaa tttaaatatt tcatataaaa aacaccattt cggtatatta 4020 tatgaaatgt ttacaaactt tagatcccga tactattata cagataaaaa agaaacacac 4080 attacaccta cacagttttt cgacagttat ccactaatca taattgattg ttcgatgcaa 4140 aaaactggtt tacaaacaca gtctattgca ctgagaattg aatttgatac agacaccggt 4200 ataagtgagg gaaccaccgc atatgcatta gtcattagtg accgagcctt tacatataca 4260 ccgttgacaa aatctgtaaa acaagtctaa attaatatta atagtaatgg agaacgaagt 4320 gattatagac gttcaaggtt acaaaattgg aaaaaagttt atagttaaag aactggctgc 4380 tttaagagga gacaaactcg cacactatgt ttttcaaccc cctttccctt cacaactttt 4440 gaatcctcaa gatgaaaaac aagtaaaatg gttaatgaag aattatcatt gtattgactg 4500 gaatatggga catattccat attggaaaca cgtacaagtt ataaacaatg tgttgaagga 4560 tgttgacaaa atttacgtaa aaggaagaga aaaagccgaa tttttgaaaa aatatacaca 4620 taaacaagtt attgaatttc cacaacaacc aacattacac gccgataaag ttaaatgcat 4680 gtaccatctg aacgataacg catattgcag tctacataat gtattttttt taaaacaaac 4740 atttcgttaa aagtgtacaa aacattgtgt taataatctc aactcatcag tctattggta 4800 atcatggatc aagagagttg gataacgtgt cgtaataatc caggacatcg cctgaaagta 4860 aaacatcttc gacgacattt acgaagatgt cttccacaac gtgaatattc ttggaggagt 4920 tttgattgcg gaaatgggaa ttattttgat tcatctttat cccataatac taataatgat 4980 tatcaacaat catcgagaag caactttgaa aatacaaact cttcaaaaaa tattttttgt 5040 aatgaaatgg aatattaatt gaattaataa ttaattgtag tatttataaa taaatgttat 5100 aaatttataa acatttgtat ttttttcata atatttacca ctcccttaca cattcaataa 5160 tccttataca ccacccaaaa aaaaattcat ttcatttata aaaaagagtt gattattgca 5220 gtggatataa gtgatttttg tcatttttca ataaattaaa caggttaacg atgtccttga 5280 aacaggcggt tacactccat cctgtctaca tagggcttct ttttacattt ttttgtatca 5340 atatgttaac cttgatggaa tacaccaatg tagttgcatt aagtgcatac aaagaagttg 5400 accttgacgg gagctttcaa caatttataa aattgcaggg agtgcatgca gaaattcatt 5460 ctgtgttgta caggtagtct gtgtacgaaa taatgtgttc aaattatttt gctaatttta 5520 aaccgtgtac tcatggtgac gagtgttctc agtgtcccgg tccctcaact aggacaactg 5580 taagtaacca gtcatcaaag ttattattta aagtattgaa gggaaaagtt gcaaatccct 5640 atcaaaaccg tagtggtttt aatagggata aacctagaat caatatcgtt tctgatgtaa 5700 ttattagacc tgctaattca aaattaatca aaaatgatga gacacaattt gaacaatttg 5760 tcaagtcgtc tttaagtcaa tatttgaggg aagattgtct tgaagaggat gtgattgaat 5820 gtagtcagtt aagtgtgaaa gagaagcaat tagctaatca cgctgttgct aatattgttc 5880 aaatcgttga tgaggatgat gatgattcta tggttactcc ttcatcacaa cctgatagtg 5940 aattatcgtt ttctttacca ttcacgcaat acctggctca acatggtatt gaaaatgatt 6000 attcggacga catctcgttg accttaatag accagtttat taacgaagac gaggcggcgt 6060 ccaatccggt tgtggatgaa ggaacttgtc ccatcaataa ttcggttacc gatgagttaa 6120 atgtatttga taatgatatg gtggtggtgg tggaagaacc cccaatctct cccagtccac 6180 cactaccacc acaaagcccc ttcgttgagc cagtgtttac ccgccccgtt acaccaacat 6240 ctcctgaagc ggcacccgcg gtattaactc aaccccctat gataatatcg caggacatca 6300 ttgacgcttt atcggagcct tgggttgagg tggaaaacat tatcaatggt aacgttgaac 6360 atccccctca acccaattca ccgattattg aggaaaatga tgttctgcca gagagattgg 6420 tggtggtaga ggccgaaaat aacatcattc ggtcaccaca accatcacca ccaccgtcac 6480 ccataatatt atcacagggc gtcaatgaca tgcaacaacc atctggatcc gaggttgatg 6540 tggaacttaa cattatcaca ccaccactcc ctcatcctcc tgcaacgttc gatccgatga 6600 gtattattac tcaaaatgtc attgacgcac tggcagagcc gtgggttgaa gagaggaggc 6660 agggtgaaga gatggatgtt gaaaatgaca tcattcaacc ctctcaattg tcacccattc 6720 tagggcaaat aaataatagg attgagaata gggtggaggg tggagacgtt gatgtcgaaa 6780 acgaagacaa tgacgacgac gacgacatca ttccacccac tccccctcac tcacgatcaa 6840 ggaggcgtcg gcgccctaga aagcctcgta gacgttcgcg aaagtcgcca ttaatattaa 6900 ttattaataa ttataatatt aatattaata ttaatgatgt taatattaat aattaatatt 6960 aataaatcct atttccatcc cacactcata tcaagtacat ttaattcgga cgcgattttc 7020 caaattttaa catcatacta cctacaacac gatggcaagc atgctgctcc tcctcgaggt 7080 cttgctggat acctatctct gaacacaaaa aaatatatat ttatatagta taaataaaag 7140 tcattcaaag atagtaacag aaggcagcaa atttataatt aaactttgtt agtaacctgt 7200 ttaatttatt gatttatatt ttcataagaa aatgcatcac tgtgaaaact gtaataaaca 7260 atttacaacc cgtcaatcac gactacgcca cgaaaagctc tattgtcaag tggtcaaaaa 7320 acaggaaaat ggaggaccgg ccgccaaaaa acagaaaata tgtcattcaa ctggtaagtt 7380 ttaactaata ggtactctat ttttcaatta tttttctttt ctagctagca acaattatat 7440 gtgcgttaca tgtaatcaag aaattgctcc aaagcttatt acggcccatg agagaagtag 7500 acagcataga aataattgtc aaagtactcc cttggaggat ggggtgtttg cactcagcag 7560 tgcatttaaa tcacgcatcg cttcatatag gatttgtgat gagggccatt atatcgattt 7620 aaatgaattt atggataaaa ttcacaataa attacttaga cttattgttt cacaaattga 7680 aaaatttagt acaatcaaaa taaatttgga attatttgga ttgtatacca ttgagtcaaa 7740 aaatatattt gatgtaaaat ctttcaatac tactaacaga attgtcactt taggcacgaa 7800 tattaggaat atgttgcatg actttcaaga agttattagt caaaaaatga tggatttttt 7860 ggaacgagat tcaggtagtt atatattgta aaaatcttta taatacaatt attttttttg 7920 cttctttgtt ttaggctgga cacttgtgaa aatcttacat ttagaattaa atgtaaacta 7980 ttacaatcca ttgaaagctt catcgtatat cccactaccg ccaagcgtga aaatgaagag 8040 ggcaatagtt aatgttcaaa atcaggatat tcattgtttt ggttggagtg tcgtagcagc 8100 agtgtgtcta cctatgggac cagaacattt gcccagttca tatccacatt ggtcgacact 8160 ttttaatttc caaggtattg aatttcctgt aaaactggat agtattgcaa aatttgaaag 8220 tcaaaataat gtctctatta atgtttacgg acttgaatta ctttatgtaa acaataaggt 8280 aaaatttgaa attgtgggac cattatattt tacaaacttt aagaaaccgg tacacataaa 8340 tttattattg ttgagtgata acgatggtaa tcaacattac tgtacaataa ctaatttgcc 8400 acgtttagta tcttcccaac tctcaaaaca tcatcattca aaatttttct gtgatggatg 8460 tctacaatat ttcacaaccc aagctttatt agatagacat tcttcgactg attgtggaaa 8520 actttatgct agaatcccaa ataatgaatt aataacaaat agatttggtt ataatgttcc 8580 cgctaatgta ttagaatttc aaaattatca tcataaaatg actatcccat ttattgttta 8640 tgctgatttc gaatgtttat tgaaacctat gagtacagtt gcaccaaatc caaatcaatc 8700 attttcgata aaaacttttc tccatcaacc atattcatgt gcatattaca ttaaatgttc 8760 atataacgat caattatcta aattccaaag ttttcgagga cagtcatctg tggttgactt 8820 tattacagca ttagaaatag atttgataga gatttataaa aagtatttaa gtattgtaca 8880 accaatgctt ccattaacta ttcaagaaga gtttgattat cttacagcaa cggaatgtca 8940 tatttgtcaa aaaccttttc aacagaatga tattaaagta aaagatcact gtcatttaac 9000 gggtcgttat aggtccgccg cgcacagtaa ttgtaacttg aattttcaaa ttccaaattt 9060 tattccgata gtatttcata atttaacaaa ttatgatagt cacttattca ttaaagagtt 9120 ggcccttgaa gaaagtgaaa ttaatattct tggtaaaact aaagagaaat atattacatt 9180 ttcgaaaaaa atttgtgtag gcgaatatct aaattcaaac aatcgaatag tcaaggagca 9240 cttacacttg cgttttatag atagttttca gtttttggca tgttcattag aaaaactggc 9300 agcagccttg gatgatgctc aatgtatgga agtaaaacga tactttcccg ttgacagaga 9360 atttcagtta attagacaga aaggggtttt tccatattca tttatcgatg atttttctaa 9420 attagatctc acggaattac cacaaaaaat tgatttttac gataaacttc gcgatgagca 9480 tatttctgaa gatgattatc aacgagctaa taccgtatgg gatacattcc attgtcagac 9540 tctaggcgaa tattctgatc tctatttaaa aagtgatatt ttactattag ctgatgtctt 9600 tgagaatttt cgaaacatgt gccttaagca ttatgaactt gatccagcac attacgtgac 9660 agcaccatct cttagttgga atgctatgct taaatttacc catgtaaaat tagaattatt 9720 gactgatata gatatgttgc attttttcaa aaaagggatt cgtgggggag ttagtacatg 9780 tgttaaacga gcggctcaag caaataataa atttcttcca aattatgacc cttcgaaacc 9840 gacatcatac attttatatc tcgatgcaac aaatctctat ggttgggcta tgagtgaagt 9900 actcccttta ggtggattta cttggttaac gcaagacgaa attaattcac ttagtattat 9960 ggatttgact gataccacac ctgaaggtta tgtccttgaa gtagatatct cttatcctca 10020 tcatttacac aactctcata atgatatgcc atttctacct gaaaatttaa ttccaccaaa 10080 cggaaaatgt gctaaactca ttccaaatct atgtgctaaa acaaactaca taatacacta 10140 tcgaaattta aaacaagccc tacaaaatgg actggaattg acaaaaattc atagagtact 10200 aaaattcaat caatcgtcat ggctcaaacc atatattgat ttaaatacga aattacgaaa 10260 tcagactaaa aataagtttg aaaaagattt atttaaactc atgaataata gtgtgtatgg 10320 aaaaaccatg gaaaatgtcg ataatcgggt agatattaaa ctggctacac attggaaaaa 10380 aaacggtcac aagtatggag cagagacatg gatagccaaa ccacaattta aaaattgttc 10440 tatttttact gaaaatttag ttgcaataga aatgaataaa gttcaagtaa cgtatgataa 10500 acctatttat gtaggattca gtatattgga aatttcaaaa actttaatgt atgacttttt 10560 ttataatttc cttaaagtta agtatggaaa taatgtacag cttttgtata cggatacgga 10620 ttcattaatt ttagaaattt atacggaaaa cgtatataat gatattaaac agaatatcga 10680 tcggtttgat acttcaaatt atcccgagaa taatattcat gatattccta aaactgtgtc 10740 tattatcggt aaaatgaaag atgagtatgc tggtgtacca attgtactac tttatggcac 10800 cggtgccaaa gcatattgtg tgcaaacagt gaatgattta attaaaaagg ctaaaggtgt 10860 aagtaaacat gtaattgata aatctctaac actgttacaa tatcgggcaa ttgcaacaaa 10920 tccttcatct tcatcagttt tctgtgttat gaacgttttc aaatcttatt tacataatat 10980 gtataccgaa ttaaggaata aaatagcact gtctaatttc gatgataaac gatacatttt 11040 aggtaattca attgatacat tagcctgggg acattacgat attgatagag atcgaaattt 11100 agattctcta attagagaat tgcaaaaata tgtggaaccc actgattgaa ttttacctta 11160 ttgtagatat gtaacgatat gtaaccattt atttatacta catatttatt attattatgt 11220 atttataatt gatgtaatgt gtaaagtatt tttatttgta ttctgtaagt caacaatgaa 11280 acttgtgcaa caaaaagatt tattaaaaat ttctaatcac gactattcag tttgtgtacc 11340 ctatcaacac tttcctaaac ataatacttt attcaataat agtgtaaaaa gaggactagt 11400 ggttggacgt tctggatgtg gaaaaacaaa tgtgatatta agtctcttgc tgcatccaaa 11460 tagattacgc ttcacagaca tttacttatt ttcaaaaacc ctacaacaac caaaatatca 11520 atttttaaaa tcggttctgg cacctctcga taaagtcggt tactatgaat atgaagatgg 11580 tgccgaaatt cctcctccta aagaaatcaa accctattcc ataataatat ttgatgatgt 11640 agtaacttgt aaccaaaata taatacgtga ttatttttgc tttggtaggc attataatat 11700 cgattgtata tttattagcc aaacgtattc ttccattcag aaacaattaa ttcgtgataa 11760 tgctaatttc ataattatct ttcctcagga tgtattaaat ttaaaacata tttttaatga 11820 tcacgttggt acagatatgt catttgaaac ctttcagaaa atgtgttcac agtgttggca 11880 aaaaccgtat ggatttgtta ctattgataa ggatgctgaa caagacaatg ggcgttatcg 11940 taacggtttt gatcagttca ttaaaatata aatgtaaatg tttgtcaaag atttatttat 12000 tagtcgacga tcgacatgga taaacgttta gctttagagc tggataaggc aagacgacac 12060 gttaaacaaa aagtacgatc attatcttca cacattgcaa gttcgcaaag acgtttcgca 12120 acccacattc aacctctaac cgatccaata aaaagtttaa tttctgaaat taaacatgaa 12180 cgcgaagttg tcccaaaaca ggaaaaaata tcaattcatc atgaggtcaa gcaaccggag 12240 gcgaagcgac tttttgagtc aacgactaca agccgtagta ctacatcgct cgatccacga 12300 ttcagtttaa tgcgacaaca tgcggctgct agcacaccca gagaggaggg agaatcactc 12360 aggaacgtga ccattggaaa gttgcaagaa acattaataa atttgtcaaa aaccgatgca 12420 tttcgccatt ttcttaaaca atggaccggt ttacctcgac agtacattga agatatgatt 12480 attgacacag aaaacaaatt tgatcatcaa tatggtgtga gacatggaac cgaggcaaat 12540 aaatttttta ttggcaattc tgaaattgat ttcaaagacg atcatgtaat cgttcaaaat 12600 gtagcttatc ctgatacacc tggattgtat gaattatgaa tgcagacaga aaaaactata 12660 aaccaaatgc acaaattctg gggaatgtag ggaacaaata taaaacaatt attcgacatt 12720 tctcgagaca gccactaccg tttcaggaac accgtgcatc tcagtcgccg ccattttaca 12780 gtgatactga agaaagcctt gatgccgatc aaggtcaatc gactcctgtc cagcaacgaa 12840 gccgcagtag tagtctttca attctcgtac caccagcacg taaacagtat ccgcgacttc 12900 gcaacatacc tgcaatacaa cgattgcagg ctcaaggtca tgaacaatca actccgacaa 12960 gtagtatcca accaatatca aaaagagtta agggtggtgg tattcttcgc acattggctt 13020 taacagataa aaatttagaa tttgttccct acaaaaatcc caacacactc gtaaatagac 13080 tacgtctttt attatcatcg acagtagctg gtaataataa tcatatgaat gaaatcaatc 13140 gcatcataac agaattgcgt gaatcgaaaa taattcgtta ataatttaga gacgacttgt 13200 atatattgca aacacattat attttatagt tagtatttct acatacttta ttcattcaat 13260 agacaagaat atgacattag ataaatttgg tgaacacatt agtgaacaca gaattcggac 13320 aactttacaa cgtgtcaaat ctttctcata taccacaatc tcactctttg ggacactgga 13380 tccagctaaa aaactttttg tattatttaa tattggtacg aattatattt tcccattaaa 13440 agaagcaacg attacgtctg taaatggtag tccatcgagc gggtggattg cattaattaa 13500 ttcgaaacaa actacgaatt taattggtca aattttacgt gaaggtgaca aactatcatt 13560 taaatatgga cccaaaacat cagaagctaa accgatatat atatagaaat agttttaaaa 13620 gtaccaatcc accatgaagc gtagtgattt aaatattaag cgtcaagttg ttaatgaact 13680 acatgcaccg gcgagagtga actttaaaag acgacgtgtg atagtgaaag gtttaaatga 13740 tttattgcaa ctcgatctgg ttgaaatgat accatattca cgcgtcaata aaggctttaa 13800 gtatatctta atggttatca atgttttttc taaatttgta tgggctgttc cattaaagaa 13860 taagtcggct ttaagtgttg taaatgcaat gagtgaaatt ttaaaaactc gacgtaacgt 13920 tcctaaaaac attcaaacag atctgggaaa ggagttttac aataaacatt ttcaacaatt 13980 aatgaataaa tacaaaatta atcactacag ttcttattcg aatttaaaaa gtagtgtcgt 14040 cgagcgcgtg aatcgaacaa tcaagggaat gatgtataaa gaattttcgg tgcaaggtca 14100 ctacaactgg ataaacattt tacctgacat aataaaaaaa tacaataata ccagacatag 14160 gactactggt ttaaaaccga tttcaataaa taagaaaaac gaacgcaaag ttttggatca 14220 agtttatagt catcttaaaa caatggacat ttataaaccc aaatttaaag ttggtgattt 14280 tgttcgaatt agtaaacatc gacatctttt taagaaaggg tacaccccaa attggagtaa 14340 tgaagtattt acagtttcga agattcaaaa tactaatcct agaacttatt tgctacaaga 14400 tgtttctgga gaaacaatac gaggaggttt ctacactgaa gaattacaaa aagttaaata 14460 tccgaatgta tatctcgttg agaaagtttt acgaaagaaa ggtaaccgtt tattcgtcaa 14520 gtggagtgga atagataaaa ctcataactc ttggattaat tcatcggata ttacatagat 14580 aattattatt attattataa attgtttttt ttttaataaa ataaactaaa aggaatcaat 14640 taaaacactt tatttcatca atattatagg tagtacatta aatatcctta aatctattat 14700 aaatcctcca tattattcta ccacttattt aattataaac taattaaatc aagacgcctc 14760 ttgaaacctt tagtttacct cgcctttggc gcgcgcccga cactgatcaa ggttaacaag 14820 tctgcatcac attacctatg cataagaaga cgcacataaa taatacatga atacaaatac 14880 tataaaaaat aaaactacaa caatgctatt tttagtaact aatttgaatt tttgaattgt 14940 agttttgtct gtggaaagca atttcacata tccacatttt ggagaatgtt ttttatgctc 15000 cataaacgca tcatcacttt tttcccatcg tgaaattatt actccacaaa caaaacactg 15060 ggtacaatca gatatttgaa tataaaacaa tcccgctttt gctaaactcc atggagatgg 15120 gtatgaatgt acccagttct tatatgataa cagtctttta tcaaaacact cgtataaact 15180 tttttcatat tgtccatcgt ctagacacat gattttttat ttattcattt attttaagaa 15240 actatcacta gcacaaaacg cacactactt cacacacata agtctttttc cgataatttc 15300 atcctactct tcggcaccct cgttaactgt aaatgaaata tacatttctt taatttcttt 15360 tcatcctcca ttttacatta tgttcactca cctaatattt catcatcaac acgttgaaac 15420 tcaaaacaat gtgcagatgt cgggtttcta gcatcttctt ttcccaaata tattaaatat 15480 agtggacgtg gatctcgtaa taactcatca attttctcct cagtcaaata atcacttatt 15540 cttctcggca gaaacactgc ataattgtcc aactcaacaa gcactgaatt tccgaacaga 15600 gactttgtgc gtctaatatt atgtatgcgg aatcttgtgt tcaactctag ctcaccagct 15660 tttaaaatgg gcttctgtcc gcatcgagcg attcgattca atacatccat gacccagcag 15720 cgaatactct tagacattaa cacagatata tgcacttttt tatattctta tttctcggag 15780 gattaaatca gtactgactg ctatcaaact tgtactaaat tctatatatt aactataccc 15840 ccacttttta accttgaatg aaaatgcacc aatattgtgg ccttgaaaaa gtgagataat 15900 gagaaaaagt gagatgaaaa catcaaagat aaagagggaa tgagtggtct tgaaacagtg 15960 agcgactctc tgcttgcagg tcacacttac ttgtatcagt aggttgaaat aggtgtacat 16020 aaaaacatgt agcttttttt tacaactttt atttttttat acaataatat aatatttaca 16080 ttttactacg actactacta ctacttagct aaacttattc tacactattt tctacgaaag 16140 aaccccggcc ctaaaaaaat tcacaataaa ttactgttga aaaagattca ataaaactta 16200 cttctttaat ttaaggcatt atgattcttt ttagttttta ggaagtcgag gtagctgggc 16260 tcattgaaat gatcccagtt ctcctcttct tctttcamca tcacttcaca tgaggccctt 16320 tgttttatga catgtttcct tttactyytc aggaagtcga ggtagctggg ctcattgaaa 16380 tgatcccagt tctcctcttc ttcttcttgc accatcactt cacttgaggc cctttgtttt 16440 tctttatcgg cccgatggca cttccaaatg tgcctttcga gccgaaacca aagaagaacg 16500 tggtttttgt cgtatggaca aattatcaca tcctctgaaa aaaaattatt tccaactaaa 16560 ctactaaaaa aatattaaac ttaccaggag tattataaat cgaactcttt cttatggctt 16620 ccatttttca agaatttctt gttatatgca gatatacaca agaactgtgt tacacaacac 16680 aacacattaa cttttataca gtttggggtg gaaaatgagt aaggacaatt gcgcgattca 16740 tcttgaatga gtgaggagtt gttggccttg caaaagtggg gagtagcatg aatgaccttg 16800 accttgaaaa catcaaggtc caataatgcc ttggccttga agtggtgggg gaagtgggag 16860 gggttggcct tgaaaaagta agagagagaa cgagtggcct tgaaagaaag agagggaaaa 16920 tgggcggggt caaactcaag gtcaaggttg tgttgtgttg tccccccatt cgttatctac 16980 t 16981 // ID RTE-6_PPac repbase; DNA; INV; 2937 BP. XX AC . XX DT 07-JUL-2010 (Rel. 15.07, Created) DT 07-JUL-2010 (Rel. 15.07, Last updated, Version 2) XX DE A family of RTE non-LTR retrotransposons: consensus. XX KW RTE; Non-LTR Retrotransposon; Transposable Element; RTE-6_PPac. XX OS Pristionchus pacificus OC Eukaryota; Metazoa; Nematoda; Chromadorea; Diplogasterida; OC Neodiplogasteridae; Pristionchus. XX RN [1] RP 1-2937 RA Jurka J.; RT "RTE non-LTR retrotransposons from nematodes."; RL Repbase Reports 10(7), 1065-1065 (2010). XX DR [1] (Consensus) XX CC >99% identical to consensus. CC This sequence was derived from sequence data generated by Genome CC Sequencing Center at Washington University School of Medicine in CC St. Louis. XX FH Key Location/Qualifiers FT CDS 119..2911 FT /product="RTE-6_PPac_1p" FT /translation="MPLSTFRIGSLNTRTLKSDVRVAELEDSVCSVQFDVI FT ALQELRRCGQTSLTLAATGHLLVGYGIRSANGTGFLVNKTWASNCLFHKVN FT DRLSYLDLPAIGLRIINAYAPTSSHSDQEYENFLNDLATLVTSSTDRVHLP FT GGGRGHVVVAGDFNAKIGSRIAPDEEYVGNHGLGKRNERGQTLADFCCELN FT LYAHNTRFMKRETRKWTWESPNMATRNQIDFIMSRSRSNVQDVSVVGSLRF FT TTDHRLILARFKFETKGRKFYHKPRSLLQFDRRIFTAKIHATTVNFKTYDT FT LSKDISSIGKESMIPVQKVARFSAKTMHLFSERKKLRDNANTAQKRIEFSL FT TSKALRVSIHEDLKRKHLEVVRDAVRYGMSTRRAIQSSQIGRSQLTQLNKP FT DGSVATSKSEMSSVVQTFYNELYKRTLRPTRLPLRSNEPFLEFLPSEVTYA FT IRKLKENKSAGPDSITAEMLKSGIDIFVKPLCDLFNKWVIEEIVPNSLPDS FT VISLIFKKGSQLEISNYRPIALLNLVYKVFTSVIDTRIEGTLNSAQPHEQA FT GFRRNYSTTDHIFTISELISRSVEYNFPLYLVFVDYLKAFDSIEFGSLWKA FT LLNQGVHAKLICVLKDVYERGKVFVRLGKEKVRIHVERGVRQGDPLSPHLF FT NCVLEEAMKQVNWNQYGVNINGKRLHHLRYADDLVLISHDPKEVQNMLGDL FT CRESRKVGLVVNIQKTVAMSNRTLYPFTIDHQSLKYVTNFIYLGVRISFDQ FT DTMLEVNRRVGSAWKGFNRFYNFLTNRRVEMGFKERVYKMCIEPALIYGSE FT LWSLTKKVRNRLVVVQRSMMRKMAGITRMERRSNEWLSQKVPLPDVRIQAM FT LRKWNWARRLALTEDERWDKLIMEWTPIDRTRPIGRPKTRWRDELTELLGV FT NWQSICRRDPNAWNLAIIQQAQSLNIE" XX SQ Sequence 2937 BP; 888 A; 621 C; 667 G; 761 T; 0 other; attctctatt cttgcgcgct tgctgggcgg tcgcctccgc tctctcgtct cggcgagccg 60 ccgtctggat ccccttcgtg tcgcttggac aaccacgaag ggtagcctgt ccacccggat 120 gccactttcc accttccgca tcgggtctct gaatacgcga acgctcaagt cggatgtccg 180 tgttgctgag cttgaagatt ccgtatgttc cgtgcagttt gatgtcattg cattacaaga 240 gctaaggagg tgtggccaga catcacttac cctagcagca actggtcacc tattagtcgg 300 atacggtatt cgttccgcta atggaaccgg tttcttggtt aacaagacct gggcatcaaa 360 ctgtttattc cacaaagtga acgaccggct atcgtatctt gatctacctg caattggcct 420 gcgtattatc aacgcgtacg ctcccacttc aagccattcc gaccaagaat acgagaactt 480 tttgaatgat cttgcgactc tcgtgacttc ttccactgat cgcgttcatc taccgggcgg 540 tggtcgagga cacgttgttg ttgcgggaga tttcaatgcg aaaattggat caagaatagc 600 acctgacgag gaatatgttg gaaaccatgg attgggaaag agaaacgaac ggggtcaaac 660 ccttgccgat ttttgttgtg aattgaatct atatgcgcat aatactcgat tcatgaaacg 720 cgagacccga aaatggactt gggagtctcc gaatatggca actcgcaatc aaatcgattt 780 tatcatgtct cggagtcgaa gcaatgtcca agatgtttcg gtcgtcggaa gccttcgatt 840 tactactgat catcgattga ttcttgcaag attcaaattt gaaacaaagg gaaggaaatt 900 ttatcataaa cctcgttcgc tcttgcagtt tgatcggcga attttcactg cgaaaatcca 960 tgcaacaacg gtcaatttca aaacctacga cactctatcg aaagatatat catcgattgg 1020 caaggaaagt atgatcccag ttcagaaagt tgcaagattc tcggcaaaga cgatgcattt 1080 gttctccgaa cgaaagaaat taagggataa tgcaaatacg gcacagaaac gaatcgagtt 1140 ttctttgaca agcaaagcct tacgcgtatc cattcatgaa gatttgaaaa ggaaacactt 1200 ggaagttgta agagacgcag taagatatgg gatgagtact cgacgtgcaa tccaatcatc 1260 tcaaatcggt cgatcgcagc tcacccaact taacaaaccc gacgggtcag ttgctacttc 1320 aaagagcgaa atgagttctg tagtacaaac gttttacaat gagctttaca aaagaacatt 1380 gcgcccaaca cgattacctc tacgctcaaa tgaacccttt cttgaatttt tgccaagcga 1440 agtgacctat gcgattcgga aactaaagga aaataagtct gcggggccag attctataac 1500 agcagaaatg ctgaaatcgg gaatcgatat attcgtcaag cctctatgtg acctcttcaa 1560 taaatgggtc atcgaagaaa ttgttcccaa ttcgctacca gactccgtca tatcgctaat 1620 attcaaaaaa ggaagtcaat tggaaatatc aaactatagg ccaatcgccc ttttgaatct 1680 cgtgtacaaa gttttcacgt cagttattga tacacgaatt gaaggaactc tgaatagtgc 1740 gcaaccacac gagcaagcgg gattcaggag gaattactca accacggatc atatattcac 1800 aatatccgaa ttgatcagtc gctcagtcga gtacaatttt cccctttatc tcgttttcgt 1860 cgattatctc aaagcctttg actcgatcga gtttggatcg ctctggaagg cactactcaa 1920 tcaaggcgta catgcaaagc tcatatgtgt attgaaagat gtgtacgaac gaggaaaagt 1980 tttcgtgagg cttgggaaag agaaagtaag aatccatgtg gagagagggg ttcgccaagg 2040 cgatccatta tcgccccatt tattcaactg tgtactggaa gaggcgatga agcaagtaaa 2100 ctggaatcaa tatggagtga atatcaatgg gaagagattg caccatttac gatacgcaga 2160 tgatctcgtt ttgatctctc atgatcctaa ggaagttcaa aatatgctcg gagatctctg 2220 tcgagaaagc aggaaagttg gcctagttgt gaatatacag aagacggtag caatgtccaa 2280 tcgaacgtta tacccattca ctatcgatca tcaatctctg aaatacgtta caaatttcat 2340 atatctcgga gttcgtattt cattcgacca agatacgatg ttggaagtca acagaagagt 2400 tggaagcgct tggaaagggt ttaaccgatt ctataacttt ctcactaacc ggcgagttga 2460 aatgggattc aaagaaagag tgtataagat gtgcatcgaa cctgctttaa tctatggaag 2520 tgaattgtgg agcttgacaa agaaagtccg aaatagactg gtagtagtac agagatctat 2580 gatgcgcaaa atggcgggaa ttactagaat ggaaagaagg agcaatgaat ggctctcaca 2640 gaaagtcccc cttcctgatg taaggattca agcaatgttg cgaaagtgga actgggcaag 2700 gaggcttgca ttaactgaag acgaaagatg ggataagctt attatggaat ggacaccaat 2760 agaccgtact cgaccaatcg gaagaccaaa gacgcgatgg cgtgacgagc taacagaact 2820 acttggagtt aactggcaga gcatttgcag aagagaccct aatgcctgga acttggctat 2880 cattcaacag gcacaatccc tgaatattga ataaatctgt atctgtatct gtatcta 2937 // ID MLE1_TC repbase; DNA; INV; 893 BP. XX AC U57842; XX DT 28-AUG-1997 (Rel. 2.07, Created) DT 28-AUG-1997 (Rel. 2.07, Last updated, Version 1) XX DE Mariner-like transposon mle-1. XX KW Mariner/Tc1; DNA transposon; Transposable Element; MLE1_TC; TIRs; KW mariner; mle-1; transposase. XX OS Trichostrongylus colubriformis OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Strongylida; Trichostrongyloidea; Trichostrongylidae; OC Trichostrongylinae; Trichostrongylus. XX RN [1] RP 1-893 RA Wiley J.L., Riley G.L., Sangster C.N. and Weiss S.A.; RT "mle-1, a mariner-like transposable element in the nematode RT Trichostrongylus colubriformis."; RL Gene 188(2), 235-237 (1997). XX RN [2] RP 1-893 RA Weiss S.A., Wiley J.L., Riley G.L. and Sangster C.N.; RT "MLE1_TC."; RL Direct Submission to Genbank (09-MAY-1996)Anthony S. Weiss, RL Biochemistry, University of Sydney, Sydney, NSW 2006, Australia. XX DR GenBank; U57842; Positions 1 893. XX CC Representative member of mariner-like transposable elements; CC it is estimated that approximately 50 copies of MLE1 are in the CC Trichostrongylus colubriformis genome. CC repeat_region 1..27 CC /rpt_type=inverted CC CDS join(166..519,534..818) CC /note="mariner-like transposase CC repeat_region 867..893 CC /rpt_type=inverted. XX SQ Sequence 893 BP; 250 A; 231 C; 207 G; 205 T; 0 other; ggttgattca taataaatcg gacaaatggt agtgatgatt tctagagatc ataactcagt 60 aacgaattaa cgaaaatgga cgcgcgaggt accgttggaa tgggtatgaa aagctcttca 120 aagtaccgga aaataatcac tccaaagtaa attcattccg ccacaatgcc gcacgatttc 180 aagcggtcca gagaccacat ccgcaacgtc atactattta tatcgctatc tggccaggaa 240 cccgcggata tcggcagacg tctgatagaa gttcacaagg agcatgcccc cggcagaagt 300 acactgtagt tgtagcattc gaagtttgcg tcaggcgact attccatcga agacgaaaac 360 cgcggttggt ggagcgtaca cggcgtagag tactgggagc tgctcgatga aggtaccaca 420 gtatccgccg acgtctacgt tcgtcaatta cgagaattga aggccaatgt ccaaagttcg 480 ctacggcggc ggtcacatgt ctacttccag cacgacaacg tgtctacttc cacagaccgc 540 acttcgcaag atcgaccaag gccgagctgc tgtcgtacag ttggaccgta cttctatacc 600 caccgtactc cccagacctg gccccttcag atttccacct tttctcccac agtaaacgcc 660 acctggacgg ccaagacttc aaaactcgcg acgaggtcaa ggcggcactc ggcaacttct 720 tcgagctcca gcttccggcc ttctggagca agggcatcca tgatctgcct ataccgtggc 780 agaaggtcat cgataaagat ggcgtaaatt tcaagtgaat ccctgttgtc gtaaaaaagt 840 tgtacgacaa tgattaaaat tggtcaattt gtccgattta ttatgaatca acc 893 // ID Chapaev3-2_HR repbase; DNA; INV; 2339 BP. XX AC . XX DT 29-FEB-2008 (Rel. 13.02, Created) DT 29-FEB-2008 (Rel. 13.02, Last updated, Version 1) XX DE Chapaev3-2_HR is an autonomous DNA transposon - consensus. XX KW Chapaev; DNA transposon; Transposable Element; Chapaev3; KW Chapaev3-2_HR. XX OS Helobdella robusta OC Eukaryota; Metazoa; Annelida; Clitellata; Hirudinida; Hirudinea; OC Rhynchobdellida; Glossiphoniidae; Helobdella. XX RN [1] RP 1-2339 RA Kapitonov V.V. and Jurka J.; RT "Chapaev3, a distinctive group of animal Chapaev transposons."; RL Repbase Reports 8(2), 54-54 (2008). XX DR [1] (Consensus) XX CC Chapaev3-2_HR belongs to the Chapaev3 group of the Chapaev CC superfamily of DNA transposons (see comments on Chapaev3-1_PM). CC Chapaev3-2_HR is a very young family of leech Chapaev3 CC transposons: genomic copies of Chapaev3-2_HR elements are ~99.6% CC identical to their consensus sequence, which was derived from CC multiple alignment of five Chapaev3-2_HR elements. Chapaev3-2_HR CC contains imperfect 18-bp terminal inverted repeats (3 mismatches) CC and encodes a 558-aa transposase. CC This sequence was derived from sequence data generated by DOE CC Joint Genome Institute. XX FH Key Location/Qualifiers FT CDS 399..2072 FT /product="Chapaev3-2_HRp" FT /note="transposase." FT /translation="MKKSNRSCLNEPDNFCYICGQFTPRDQRKNLSNRIKI FT AYYHYFGVKVADQERTWAPHICCSVCYVGLTQWLNGKRKQIPFAIPMIWRE FT PRDHHSDCYFCMTNIQGHSKKTKASIVYPNCSSAIKPVPHSVEYPVPTPPT FT DTVLYSEEEQSGDEKVDVEYKPDYDKDKPHMITQGELSDLVRDLGLTKNKA FT ELLGSRLQQWNLLDRGTKISHFRDRHTEFAKFYNKEDNICYCVDIAGLMTK FT LDDEYDPVDWRLFIDSSKVSLKAVLLHNGNVKPSIPVAHAVGMKETYESMK FT TLLKVIKYTDHNWNISGDLKVVALLLGMQLGYTKHMCFLCLWNSRDDANHY FT LVKDWPARIDPVPGQFNILHESLVNPDKIFLPPLHIKLGIFKNFVKALPTD FT SKGFIYLKDKFKTTLTNAKIAAGVFTGPQIREVIRDPNFKLQLEPVELLAW FT EAFVALVQNFLGNHRSEDYVNLVENFIMAYQNMGCRMSLKIHFLHSHLSFF FT PANLGAVSDEQGERFHQEISVMEHRYQGRFDSNMMGDFCWFLQRESESQYK FT RKRSLSTNYFY" XX SQ Sequence 2339 BP; 765 A; 403 C; 408 G; 763 T; 0 other; cactagacaa caaattttaa ctttgtgtct gctacttgag caaaattata ttatccatat 60 agagtttttg atgctgattt caaatatgta atcagaattt ttgtagcaca tacagttttt 120 tagttatttg ttttttttat tttttttaaa ttttgctgta tttaatgttc aatttcaagt 180 ttttcatacc aaatttaaat atctaaatgt ttttctttgt tctattttat tatctttgtc 240 ttttattctt tgaatctatt ttgtttccct taaattttat tagtcatctc ttctttttat 300 tctttttttg ttcaattttg ttatataaat acactattcc agaactatat tataaatata 360 caattctaaa ttaattttaa atttatccag tgataaaaat gaagaaatct aaccgtagtt 420 gtttgaatga accagacaat ttttgttata tttgtgggca atttacacct cgtgatcagc 480 gaaaaaattt gtcaaacaga ataaagattg cctactacca ttactttgga gtaaaggtgg 540 cggatcaaga aagaacctgg gctccacata tttgctgcag tgtttgctac gtaggtctca 600 ctcaatggtt aaatgggaaa agaaagcaaa taccttttgc cattccaatg atatggcgtg 660 agccaagaga tcaccattca gactgctact tttgcatgac aaatatacaa ggacattcta 720 agaaaaccaa agcatcaatt gtgtacccta actgttcatc tgcaataaaa ccagttcccc 780 acagtgttga atatcctgtt ccaactcctc ctacagacac tgttctttat agcgaagaag 840 aacaaagtgg tgatgaaaag gttgatgttg aatacaaacc agattacgac aaagataaac 900 ctcacatgat cacgcaggga gaacttagtg atctggtaag ggaccttggg ttaactaaaa 960 ataaagcaga actcctgggg tctagattac agcaatggaa tttactggac agaggaacta 1020 aaatatccca ttttcgtgac cggcacaccg aatttgcaaa attttacaac aaagaggaca 1080 atatatgtta ctgtgtagac attgctggat taatgacaaa attggatgat gagtatgatc 1140 ctgtagattg gcgtctgttt attgattcta gtaaagtcag tctaaaagca gttctgctac 1200 ataacggaaa tgttaaacca tccattcctg tagcccatgc agtgggtatg aaagaaacat 1260 atgaatctat gaagacactt cttaaagtta taaagtacac tgatcacaac tggaatatca 1320 gtggggatct caaagttgtt gctcttcttc ttgggatgca attggggtac acgaaacaca 1380 tgtgcttttt gtgtctgtgg aacagtcgtg atgatgcaaa tcactatctc gtcaaagact 1440 ggcctgcgag aattgatccg gtacctggac aatttaacat tcttcacgaa tcgttggtga 1500 atcctgacaa aatcttcctt cctcctcttc atatcaaatt aggaatattt aaaaactttg 1560 tcaaggcgct acctactgat tcgaaaggat ttatttatct taaagataaa ttcaaaacca 1620 ctctcacaaa cgcaaagata gctgcaggag tttttactgg accacagata cgtgaagtta 1680 ttcgtgaccc aaatttcaaa ttacaactag agcctgtgga gttgctggct tgggaagcgt 1740 ttgtggcact ggttcagaat tttttaggaa accacagatc agaagactat gtgaacttgg 1800 tagaaaactt cataatggca taccaaaaca tgggttgtag aatgtcccta aaaatccatt 1860 tcctccattc ccaccttagc ttctttcctg ccaatttagg agctgtcagt gatgaacagg 1920 gagaaagatt ccatcaagaa atctctgtta tggaacaccg gtatcagggt cgttttgata 1980 gtaacatgat gggggatttc tgttggtttc tacagcgtga aagtgaaagt caatataagc 2040 gcaaaagatc attatcgaca aattattttt attaatttaa ctttgagaca ttacctatgt 2100 aataattttg atttaaataa ataaaaaaac aatcagatta tgaatatgtt ttggttttga 2160 gtattcttga gcacagctga aaatggacta atttagcacc taaaattcac aatcttttag 2220 ctgtaacttt ataacctgac gtcctatgac aaaaagaata gcagatttgt aatcagcata 2280 cccaaattac cctagaacaa ctaacaaagt ctgagtagca aaaaacttgt tgcatagtg 2339 // ID Chapaev3-1N1_AAe repbase; DNA; INV; 2706 BP. XX AC . XX DT 11-OCT-2010 (Rel. 15.1, Created) DT 11-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE A non-autonomous Chapaev3 DNA transposon family from Aedes DE aegypti. XX KW Chapaev; DNA transposon; Transposable Element; Nonautonomous; KW otherMITEs_Ele22b; Chapaev3-1N1_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-2706 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-2706 RA Kojima K.K. and Jurka J.; RT "Chapaev3-type DNA transposons from the yellow fever mosquito."; RL Direct Submission to Repbase Update (13-OCT-2010). XX DR [2] (Consensus) XX CC [1] Named as otherMITEs_Ele22b. CC [2] Consensus update and characterization as a non-autonomous CC Chapaev3. >98% identical to consensus. This consensus is ~99% CC identical to the original sequence in [1]. 3-bp TSDs; usually CC TWA. TIRs are ~1140 bp long. Both termini are ~90% identical to CC Chapaev3-1_AA. XX SQ Sequence 2706 BP; 806 A; 548 C; 546 G; 806 T; 0 other; cactgttcga caaaaaaagt cgatatgaat gcacagagac cggacacccc gggcttgaaa 60 gtataaaaat aaacaaaaat aatggcgttt tcagaatgtt cgtgtctgcc ccaggtcatt 120 tttaaaatat acttgaactt tttgcatttt ttatttaaaa atctaatcta atcaataaac 180 cgacacagtg ttttatattc aggacagggg aattcatgtg ccatataatg ttttcaactt 240 taaatacaat atgaagagtt gtaataagat ttgcagggag ccctcccccc ttttttgtac 300 cctccccatt tttaaagtat cgcaaatgat ggaatatttg atatacaggg ggttgtcaaa 360 ataactggga caggcaaaaa ttgggccaac tttggaatgc tgtaactttg acaaaaattg 420 accgatttca attctttaag aagtaatgga cgggtcaact aatctagttt tgaggtgcat 480 ccacggagat gaactatgac caccggatac cggtgataat ccggatttcc ggaagcatgt 540 cttatgcagt aaaattatga cgtgttttta gcaaaggtct cggctaaaaa atcaaaattt 600 tactacacat gaagatagaa gatctaattc tgaagcgatt ggtgcgccaa gtattaatat 660 tggtccagaa acaaccaata tatggccatt tccccggaat cggtgccggt agtggatccg 720 aattgggatc aaacaattta ttcactcaaa atatgtcgcg caatattgtt ttctccctag 780 cttatcataa aataccctat tataagttga aaatgagtct tgtacagatt tggccactca 840 tggcgccgcc cagtgccccg ggggaacctt gcatagggga catttcgatt ttgacaccaa 900 agcatatcat gcgacggctc attcttcatg tcttgtcgta aatagggcaa ctgtagactc 960 aaaatggata gttgactaca gtgactactt ccgggaccac cggatgtccc agagggaacc 1020 tgtaattagg gacaattgaa attgaactcc aatacatatc gtgcgacagt tcatttttca 1080 tgtctcgtgc taaatagggc tattgtagac ctaaagtgga taattgacta cagtggccaa 1140 ttttgggact accggaattt cccggaggaa cttgtcatcg gggacatttc tgtttttaca 1200 ccaaaacata tcatgcgacg gctctttctt catgccttgt cgtaaataaa ataactgtag 1260 actcaaaatg ggaatttaat tgaattggcc acttttggga ctaccgggtg tcccctaggg 1320 aacccgtgat tagggacaat tggaattgag ctccaatact catcgtgcaa cggctcattc 1380 ttcgtgttta gtcatgaata ggtcaacttt agaccgaaaa tggatattta actagaatgg 1440 ccactttggg gacctcctat agttccctaa ggaacctatc acttttggga ccaccggaat 1500 ttcccggagg aatctgtcat cgaagacatc acagcctccc tcagggatat ccggtggtca 1560 caaaagtggc cactgtagtc aaatatccac ttaaggtcta caatagccct atttagcacg 1620 agacatgaaa aatgaactgt cgcacgatat gtattggagt tcaatttcaa ttgtccctaa 1680 ttacaggttc cctcggggac atccggtggt cccggaagta gtcactgtag tcaactatcc 1740 attttgagtc tacagttgcc ctatttacga caagacatga agaatgagcc gtcgcatgat 1800 atgctttggt gtcaaaatcg aaatgtcccc tatgcaaggt tcccccgggg cactgggcgg 1860 cgccatgagt ggccaaatct gtacaagact cattttcaac ttataatagg gtattttatg 1920 ataagctagg gagaaaacaa tattgcgcga catattttga gtgaataaat tgtttgatcc 1980 caattcggat ccactaccgg caccgattcc ggggaaatgg ccatatattg gttgtttctg 2040 gaccaatatt aatacttggc gcaccaatcg cttcagaatt agatcttcta tcttcatgtg 2100 tagtaaaatt ttgatttttt agccgagacc tttgctaaaa acacgtcata attttactgc 2160 ataagacatg cttccggaaa tccggattat caccggtatc cggtggtcat agttcatctc 2220 cgtggatgca cctcaaaact agattagttg acccgtccat tacttcttaa agaattgaaa 2280 tcggtcaatt tttgtcaaag ttacagcatt ccaaagttgg cccaattttt gcctgtccca 2340 gttattttga caaccccctg tatatcaaat attccatcat ttgcgatact ttaaaaatgg 2400 ggagggtaca aaaaaggggg gagggctccc tgcaaatctt attacaactc ttcatattgt 2460 atttaaagtt gaaaacatta tatggcacat gaattcccct gtcctgaata taaaacactg 2520 tgtcggttta ttgattagat tagattttta aataaaaaat gcaaaaagtt caagtatatt 2580 ttaaaaatga cctggggcag acacgaacat tctgaaaacg ccattatttt tgtttatttt 2640 tatactttca agcccggggt gtccggtctc tgtgcattca tatcgacttt tttttgtcga 2700 acagtg 2706 // ID Gypsy2-I_Dmoj repbase; DNA; INV; 4763 BP. XX AC scaffold_6680; XX DT 13-MAY-2009 (Rel. 14.05, Created) DT 13-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy2_Dmoj; KW Gypsy2-LTR_Dmoj; Gypsy2-I_Dmoj. XX OS Drosophila mojavensis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; repleta group; OC mulleri subgroup. XX RN [1] RP 1-4763 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1032-1032 (2009). XX DR Genome; scaffold_6680; Positions 4261838 4266600. XX CC Positions [2494-3000] - Reverse transcriptase CC Positions [3681-4157] - Integrase core CC 'TAAT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 62..2476 FT /product="Gypsy2-I_Dmoj_2p" FT /translation="MKLSIDEVSVVQLKKWLAALGLSTQGCKTELVARLNG FT VPANLRGSAPEVEQGEEESDQTGVSQTVQMQQEEIDRNENILRELSEQIDA FT AKKQLKNFHTRQLDRDGNSKRAEISANGVDAEVEFEERATDARQKGASENC FT VINTTHNDEVMSMMFKLAKEIMVEFTGEESISNWIAQLNNVSQLYQLTDMQ FT KKLLCMAKLKGKAMQWIHDDSSRIVQPLGTLLDGLAAAFGSKISKAEMRRK FT FGARIWSVDEKFTQYYEEKIRLAREIRLDVDELLDGLIEGIPNAVLRAQAR FT IQCFEDPSKLVRAFAEVRLPLHRSTGGKGTTAVGPGLDNAMKETRCFNCNY FT KGHWARDCRKPKREKGTCYHCGAKDHMVAQCTKRKDDIGNNFVRFISIHFK FT SKDNASIIAECLIDSGSPISFIKKSCVPVGIKLKAHAENYFGLNESQLNVI FT GKTFVHIFKKFEKINFWLIVVPDESMRSEVILGRDFMEACNLYVDPIALRM FT ISVERAAGTLDKSLENTLLGNTSLDKSLENTLLGNTSLDKSLENTLLGNTS FT LDKSLENTFLGNTSLDKSLENTFLGTTSLDKSLENTLVNKENEELGKKLVN FT KVLENEELGKKLVNKVLENEELGKKLVNKVLENEELGKKLVSKVLENEELG FT KKLVSKVLENEELGEKLVSKLLENKELGRKLVNKVLGNKELEEKLLDKEEI FT DRSKECLEKSMEINSKSVFDDNFENSMLQINVVEGDNEKEIYIIGENISYD FT VRKKLIRLLKVEYVDAERPVKPKKKCEMNIRIEKAKPLAVHQEGYHTLKRK FT NYR" FT CDS 2479..3519 FT /product="Gypsy2-I_Dmoj_1p" FT /translation="MLDEYLRNGIIQPSCSEYASPIVLVKKKTGDLRMCVD FT FRKLNKIMIKDNYPLPLIDDLLDRLVGKSIFTKLDLKNGYFHVFVEKESVK FT YTSFVTPLGQFEFLRMPMGLKTAPQIFQRFVNDIFADLIKESKVIVYMDDI FT MIASKDVSEHLKILEEVLKRLVENKLELRLDKCEFLQNKVKYLGFVISEDG FT IRADDKGLEAVKNFPIPNKVQAVQSFLGLCSYFRRFIKDFSTLAKPLYDLL FT RKDRKFQFEEKELNCFLCLKEKLLEAPVLALYNHKDDVELHCDASAMGFGA FT VLLQKKEDGKLHPVFYFSKRTTETEAKYHSFELETLAIMYQKRWKNMYCTN FT ITMN" FT CDS 3483..4544 FT /product="Gypsy2-I_Dmoj_3p" FT /translation="MEEHVLYKYHNELGHVGRDKMLNAISKSYWFPNMKKK FT ILEHIENCLQCIAFSPRTGKAEGLLHSIPKGNKPFQIIHIDHYGPVDNGRS FT KKYIFAVVDGFTKFVRLFTTKTTSTREVVKALREYFRTYSKPECIISDRGS FT CFTSTEFDQFLEEAKVKHIRIATGSPQANGQVERINRSLGPMIAKLIEGDN FT GQHWDSIIERVEYVLNNTQHCSTKQLPSVMLFGIAQKGEISDSLGEQLEKI FT IGLDICSSQNLEDIREQAQHYQSAAQTRNEENFNNARKAAKVYKPGDYVMV FT KNFDNTVGISRKLIPKSKGPYVIDKVLKNDRYLIKDVDGFQLSRNPYQGVW FT SASNIRHWIKA" XX SQ Sequence 4763 BP; 1810 A; 619 C; 1079 G; 1255 T; 0 other; atctcagaag tgggataaag tacactaatt tttcaaatta aaaaaatgaa tttaagcata 60 aatgaaatta agcatagacg aagtatctgt ggtgcagtta aagaagtggc tagctgcatt 120 gggcttgtcg actcaagggt gcaagactga acttgtcgct cgcctcaatg gcgtgcccgc 180 aaatttgcgc ggcagcgccc ccgaagtgga acagggagag gaggagtcag atcaaactgg 240 agtatcgcag acagtgcaaa tgcaacagga ggaaatagac aggaatgaaa atattttgcg 300 agagctaagt gagcagatcg acgcagcaaa gaagcagcta aaaaattttc acacacgaca 360 attggacaga gacggcaata gcaagcgtgc cgaaattagc gcaaatggcg tcgatgcaga 420 agttgaattc gaagaacgcg cgactgacgc gcgccaaaag ggcgccagtg aaaattgtgt 480 aataaataca acgcacaatg atgaagtgat gtcaatgatg tttaaattag ctaaagaaat 540 tatggtggaa ttcactgggg aggagtctat tagcaactgg atagcgcaat tgaataatgt 600 atctcaatta tatcaactaa cggacatgca aaaaaagttg ctatgcatgg ccaaattaaa 660 agggaaagcc atgcaatgga tacatgatga ttcatcgagg attgtgcaac cgttgggcac 720 attattggat ggattggcag cagcattcgg aagcaagata tccaaggccg aaatgcgtcg 780 caagtttggg gcacgtatat ggagtgtcga tgagaagttc acccagtatt atgaagagaa 840 aatacgttta gcaagggaaa ttcgtttaga tgtagacgag ctactggatg gtctcattga 900 aggaatccca aatgcagttt tacgcgcgca ggccaggata cagtgctttg aagaccccag 960 caaattggta agggcttttg cagaggtgcg tttgccactg catcgatcta ctggaggaaa 1020 agggacaaca gcagtgggac cagggctcga taatgcaatg aaggagacgc gttgcttcaa 1080 ttgcaactat aagggacatt gggcaagaga ttgtcggaaa cctaagaggg agaagggtac 1140 gtgctaccac tgcggcgcta aggatcatat ggtggcacaa tgcacgaaga ggaaggatga 1200 tattggaaac aatttcgtaa ggtttatcag tatacatttt aagtccaaag ataatgcctc 1260 cattattgca gaatgcctca tagattcagg aagtcctatt tcgtttatta aaaaatcgtg 1320 tgtcccagtt ggaataaaac ttaaagcaca cgcagaaaat tactttgggc taaatgaaag 1380 ccagttaaat gtgattggaa aaacatttgt tcatattttc aagaagtttg agaaaataaa 1440 tttttggtta atagttgtcc ctgatgaatc tatgaggtct gaagtaattt tagggagaga 1500 tttcatggaa gcttgcaatt tatatgttga cccaatcgcc ttgagaatga tttcagtgga 1560 aagagcagct ggcacgttag ataaatcgtt agaaaataca ttgttaggaa acacatcgtt 1620 agataaatcg ttagaaaata cattgttagg aaacacatcg ttagataaat cgttagaaaa 1680 tacattgtta ggaaacacat cgttagataa atcgttagaa aatacattct taggaaacac 1740 atcattagat aaatcgttag aaaatacatt cttaggaacc acatcattag ataaatcgtt 1800 agaaaataca ttagtaaata aagaaaatga agaattagga aagaagttag taaataaagt 1860 gttagaaaat gaagaattag gaaagaagtt agtaaataaa gtgttagaaa atgaagaatt 1920 aggaaagaag ttagtaaata aagtgttaga aaatgaagaa ttaggaaaga agttagtaag 1980 taaagtgtta gaaaatgaag aattaggaaa gaagttagta agtaaagtgt tagaaaatga 2040 agaattagga gagaagttag taagtaaatt gttagaaaat aaagaattag gaaggaagtt 2100 agtaaataaa gtgttaggaa ataaagaatt agaagaaaaa ttattagaca aagaagagat 2160 cgataggtca aaagagtgtt tagaaaagtc aatggaaata aatagcaaaa gcgtttttga 2220 cgataatttt gaaaacagca tgcttcaaat taatgtggta gaaggagaca atgaaaaaga 2280 gatatatatt atcggtgaaa acattagtta tgatgtacgg aaaaaactaa ttagattgct 2340 gaaagtggaa tatgtagatg ctgaaaggcc agtgaaacct aaaaagaaat gcgaaatgaa 2400 tatacgaatt gagaaagcaa aacctttagc tgttcaccaa gaaggttatc atacactgaa 2460 aaggaaaaat tacagataat gttagacgaa taccttagaa atggcatcat acaacctagt 2520 tgctcagagt atgcttcacc aatagtatta gtaaagaaaa aaacaggaga tttaagaatg 2580 tgcgttgatt ttagaaaact caacaagatc atgatcaaag acaattatcc tttgccgttg 2640 attgatgatt tgttagacag actagtgggg aagtctattt tcacgaagct tgacctgaag 2700 aatggatatt ttcatgtttt tgtagagaaa gagtcagtaa aatacacatc gtttgttacg 2760 ccgttaggac agtttgagtt cttgagaatg cctatgggat taaaaacagc tccgcaaatt 2820 tttcagaggt tcgttaatga tatatttgca gatttaataa aggaaagcaa ggtaattgta 2880 tatatggatg atataatgat tgccagcaaa gatgtgtccg agcatcttaa gatactagaa 2940 gaggttttaa aacgattagt agagaacaaa ttagagttga ggctggataa atgtgaattt 3000 ttgcagaaca aggttaaata tttaggattt gtaatatcag aagatggcat tagagcagat 3060 gataagggat tagaagcggt aaagaatttc ccaataccta ataaagttca ggcggtgcaa 3120 agttttctcg ggttgtgctc atattttaga agatttataa aagacttttc aactctggca 3180 aaaccgttgt atgacttatt gcggaaagat agaaaatttc aatttgaaga aaaggaacta 3240 aattgttttt tatgcttgaa ggagaaattg ttagaggcac cagtattagc gttgtataat 3300 cataaagatg acgtagaatt gcattgtgat gccagtgcaa tgggttttgg agcagtgctt 3360 ttacaaaaga aagaagacgg aaagctgcat cctgtatttt atttttcaaa gcgaacgaca 3420 gaaacggaag caaaatatca tagttttgaa ctggaaactt tagccattat gtaccagaaa 3480 agatggaaga acatgtattg tacaaatatc acaatgaatt agggcacgtt ggaagggaca 3540 agatgcttaa tgcaataagt aaatcatatt ggtttcccaa tatgaaaaag aaaattttag 3600 aacatattga aaattgtttg caatgcatag cattctcgcc tagaacgggc aaagcagaag 3660 ggttgctgca tagcattcca aaggggaata aaccgttcca aataattcat attgaccatt 3720 atggaccagt ggataacggc agatcaaaga aatacatttt tgcagtagta gatgggttta 3780 caaagtttgt cagattattc actactaaaa caacaagcac tagggaagtt gtaaaggcat 3840 tgagagaata ttttagaaca tacagtaagc cagaatgtat tatatcagat agaggaagtt 3900 gttttacgtc aacggagttt gaccaatttt tagaagaagc aaaggtaaag catattagga 3960 ttgcaacggg atcgccacaa gcgaatggtc aggtggagcg aataaacaga agtttaggtc 4020 cgatgatagc taaactaata gaaggtgata atggacagca ttgggattca ataattgaga 4080 gggtggaata tgttttgaat aacactcaac attgtagcac caagcaattg ccaagcgtaa 4140 tgttattcgg aatagcgcaa aaaggggaaa ttagtgatag tttaggagag caattagaga 4200 aaataatagg attagatata tgcagttcac agaatttaga agatattaga gaacaagcac 4260 aacattatca atctgcagca caaacacgga atgaagagaa ctttaataat gctaggaaag 4320 ctgcaaaagt atacaagcca ggagattatg taatggttaa gaattttgac aacacggtgg 4380 gaatatcaag gaaattaata cccaaatcca aaggaccata tgtcatagat aaagtactga 4440 agaacgatag gtatttgata aaagacgttg acggtttcca attatccagg aatccttatc 4500 aaggtgtttg gagtgcaagc aatataagac attggataaa agcataagag agataataac 4560 ttatatttta gatattaatt tcataactag tgtgtaacat ttaattaata atttaatttt 4620 tataacatgt aaatagaata tctgtaagat tttttttttg tatcaaacat gtaaacacaa 4680 tattattttt aggaagttaa gtatcaaata ttatataaaa ttaacagaga tagagggcta 4740 tctcagagtc aggatggccg aat 4763 // ID BEL-58_CQ-I repbase; DNA; INV; 4857 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-58_CQ_; KW BEL-58_CQ-LTR; BEL-58_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4857 RA Jurka J.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 269-269 (2011). XX DR [2] (Consensus) XX CC 'CATAC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 679..3570 FT /product="BEL-58_CQ-I_1p" FT /translation="MDELKDLLKQERQLILTINGVGDFVEAYKKAEHENQI FT SIRLDTLEEAMRKFFKVRRKIEAMIDDEDEDQVVGESKEARKKRLADLVVQ FT REKEYNKALRDVEERYFVVKAKLVALRPIKVEPTPGADLNETCFDRSISRI FT KLPDIKLPNFSGELKDWISFRDTYRSLIHSNVQLPDIDKFTYLRSALQGEA FT QLEILSVDFSAEGYDVAWKALEKKYDNHKLIVKAYLDALFDIEPLRKESCD FT GLSHLISEFETNLQMLKMLGEGTEAWSTILVHMLCTRLDHTTLRLWESHHN FT SKAVPKYEVLIEFLRSQCTVLQSIKANRPGEGDGRQNRSRVSTTHTSSQSQ FT RRCLFCEEAFHMPFSCSKLRNMSVSQRVEEVNRRRLCRNCLHAGHYADGCS FT RGSCTRCGGRHHTLLHYDTPAPAGARQERSSVPNTQGQSHEAGQQQQPTRQ FT QDQTQNISTQPTTSFYSVIQSTSRNTHPPPPTDFHNTTTLKAASLPAQNAP FT TLSRRVLMSTAVVRVEDQFGNFSFARTLLDSCSEFCYMSSNFSKKLKFRTT FT PDVLRVQGIGNGSALSLEAVRAKIQPRLATISRFSKEMQFHVLEKIANDLP FT VTPVDVSQMMLPCDIILADPHFGKPGPIDMIIGAEFFFDLLAAGRRKIVED FT GPTLQETVLGWIVSGKVPVSSPSVPCTATYFSSAVDLKECCELESSNVNST FT HSVAEFTREELLKKTTKQDAKGRSGTLNVRHAQKVRQFTDSAGVTVKKWTS FT HCPEQLKQLSKHLKDERSTLNTPHSVMHACVLCQQLWRVKSDRDQPSDESL FT QQLWRVRMSLKAVATRKVPRWIGFSNDCVENKIYGSERAYGASLYLRCTAL FT DGSVTFRQLMVEAKVAPLENPKREGQHSTVGDVTWQMVKASEKVTRPNTQL FT VDGVMRVCVRLSNISHCKWTLRSCELEDLLAFHPEKPRIWSNLKLSTRWSA FT PRRCCVNLS" XX SQ Sequence 4857 BP; 1155 A; 1267 C; 1355 G; 1080 T; 0 other; ttggtccttc gagccggatc gaagtgaccc ggcccgaagt ggaaggactt tttgtgtgcg 60 agcgtgtgta gtggcctgtt cgcggaacag tggccatcgc gattcggact gtgcgagtcc 120 gcggcagttg gcgccatcgc gcagtgattt gcggctttgt gctgcagttt tttatactgt 180 tctgcgcgtg gtgctgattg gaagaagcag ctggagcgat cgtggtctgc tgggagctgg 240 tttactgtga tctgcggacg gaagctgcgc tgctgccgga agctgtgtgt tcgagagtga 300 tcgtgtgtgt gtgcgctgga cgttgcccat ctaccggagg ttccaggagg gccaagcagc 360 gttccagagg aagaattggg tttgcacgaa ttgggttaga gtgaaggagc gtttttcgag 420 agaagttttt aataaagaag tttggatttt tcacgcgttt gcgtttattt ctggggttgt 480 ggtttgggtt attcctgcgc aagccctgag ttgttggata gttctggtct ctgtcggtcg 540 gtccactgct ggttctccac cttggctgtc tctccccact gtcaatcagg tcgcgtagaa 600 cgttgagtcg atcgtttcgg tgttcgcggt tgcagtgttc cgggttttag cagtcaacgt 660 gagtgaaggt ggtgaagcat ggacgagttg aaggatttgc tgaagcagga gcggcagttg 720 attctgacca tcaatggtgt aggagatttc gtggaggcgt acaagaaggc cgagcatgag 780 aaccagatct cgattcggct ggacaccctg gaggaggcaa tgaggaagtt cttcaaggtg 840 cgccgcaaga tcgaagccat gattgacgac gaggatgagg atcaagttgt tggagagtcg 900 aaggaggctc ggaagaagcg gttggctgat ctggtggttc aacgtgagaa ggagtacaac 960 aaggctcttc gagacgtcga ggagaggtac ttcgtggtaa aggcgaagct ggttgcgctg 1020 cgtccgatca aggttgagcc cactcctggt gctgacctga acgaaacctg cttcgatcga 1080 tcgatttcgc gcattaagtt gccggacatc aagctgccta actttagcgg cgagctgaag 1140 gattggatct cgtttcgtga cacgtacagg agcctcatcc actccaacgt gcagctgccg 1200 gacatcgaca agttcaccta cttgaggtcc gctttgcaag gagaagcaca gctggagatt 1260 ctgtcggtag atttctctgc agaaggctac gacgttgctt ggaaagcact ggagaaaaag 1320 tacgacaacc acaagctcat cgtgaaggcg tacctggacg cgctcttcga catcgagccg 1380 ctgcggaagg agagctgtga cggtctgtct cacctgatca gtgagttcga gacgaacctg 1440 cagatgttaa agatgctcgg tgaaggaacg gaagcctggt cgacgatcct ggtccacatg 1500 ctatgtacgc gtttggatca taccacgctg cgactgtggg agtcgcacca caactcgaaa 1560 gctgtaccga agtacgaagt gttgatcgag tttctgcgta gtcagtgtac ggtgctgcaa 1620 tctatcaagg ccaaccggcc cggcgaagga gacggacgac aaaaccgctc gagagtctcg 1680 accactcaca cgtcctcgca gtcacaacga cgctgcctgt tttgtgaaga ggcatttcac 1740 atgccgttca gttgcagcaa gctgaggaac atgtctgtgt cccaacgagt ggaggaagtg 1800 aatcgacggc gactgtgccg aaattgtttg catgctggtc attacgctga cggatgttct 1860 cgtggttcct gtacccgctg cggcggaaga catcacacgc tcctgcacta cgacacgccg 1920 gcccctgctg gcgctcgaca agaaagatcc tccgttccta atacgcaagg tcaatcccac 1980 gaagctggac aacaacaaca acccacgaga cagcaagacc agacacagaa catttccacc 2040 caaccaacca caagtttcta ctccgtgatc caatccactt ctcgaaacac tcacccacca 2100 ccacccacag actttcataa taccaccacc ctcaaagcgg catccctccc cgcacaaaat 2160 gcacccacac tttctcgccg agtcctcatg tccacggctg ttgttcgcgt tgaagatcag 2220 ttcgggaact tctcgttcgc tcggacactg ttagattcgt gttccgagtt ctgctacatg 2280 tctagcaact tttcgaagaa gctgaagttc cggacaacac cagatgtact gagggtacag 2340 ggcatcggaa acggctcagc gttgtcgctc gaggccgtac gcgcgaagat ccagccacgg 2400 ctggctacga tctcgaggtt ctcgaaggaa atgcagttcc acgtgctgga aaagatcgcc 2460 aacgatctgc ccgtcacacc ggttgacgtc agccagatga tgctcccctg tgacatcatt 2520 ctcgctgatc cgcactttgg gaaacctggt cccattgaca tgatcatcgg tgctgagttc 2580 tttttcgatt tgctggcggc tggtcgtcgt aagatcgtcg aggacggtcc gacgttgcaa 2640 gagactgtgc ttgggtggat agtctcaggg aaagttccag tgtcgtcacc cagcgtccca 2700 tgcacggcaa cctacttcag ctcagcagtt gatctgaagg aatgctgcga actggagtcg 2760 agtaacgtca acagcaccca ctctgtggcg gagttcaccc gcgaagagct gttgaagaag 2820 acgacgaaac aagatgcgaa gggaagatca ggcacgctga acgtccgaca cgcgcagaaa 2880 gtccggcagt tcaccgattc agctggtgtc acggtgaaga agtggacctc ccactgccct 2940 gagcagctca agcagctctc gaagcacctg aaggacgagc gcagcacgct caacacgcca 3000 cacagtgtca tgcacgcttg tgtcttgtgt caacagctgt ggagagtcaa gagcgaccgg 3060 gatcaaccgt cggacgaatc gttgcagcag ctgtggagag ttaggatgag tttgaaggcg 3120 gttgcgacga gaaaggtgcc gcgctggatt gggttcagca acgattgcgt cgagaacaag 3180 atctacgggt cagagcgagc ttacggtgcg agtctctacc tccgctgcac tgcgcttgac 3240 ggttccgtca cgtttcgaca gctcatggtc gaggccaagg tggctccgct cgagaaccca 3300 aagcgcgaag gtcaacattc cacggttgga gatgtcacct ggcagatggt caaggcgtcg 3360 gagaaggtta cgcgccccaa cacacaactg gtggacggcg tcatgcgagt ctgtgttcgt 3420 ctgtcgaaca tctcgcactg caaatggacg cttcgatcgt gtgagctcga agatctgctc 3480 gctttccatc cggaaaagcc aagaatttgg tcaaacctta aattaagcac tcgctggagt 3540 gcgccccgac gctgctgcgt taacctttct tgaagtttaa gttggtgttt gtagtctgta 3600 tgtgtgcatg tttgttgttg aagaaagctc gctggacgct gtccagtttc gctactcggt 3660 agacacctgt ctgtccgagc tcagcgcaca aggcgctcac ctggttatgg gattgcatgt 3720 caagtatagt cgctaatcac ctctatcatc ggtagcacac ccacccacgc taccacatct 3780 cactggagga agcctaggtt cctgccagtc ggtctcggct gctcgttcga cagtgggctt 3840 ggccagctca cgcgacgatc gacatcccga ccacgaagtt ttgaagtcga agtttcatcc 3900 ctagccagcg aagaagaaca acgcccccag gaattcaccc atgatctgcg gatcgagcat 3960 cagaattcac ctggtggggt aagttgttcc agtttacgca cgctacatac acccactaca 4020 tccaagtttt gttcgaggca caccaccaac ttaaccaccg acaccgacgt ccgaaagtcc 4080 accagcatga agcgcatcca caccggcgat cagcacaccg gcgatcagca caccgacgaa 4140 cgacacacca ccagcgtaac atctttatgg cctgcgggcc cagtatcaac gttacctggt 4200 tgggtaagtt gttccggttt tacatttttt tctgcacaca ctacatcgca tctctcccga 4260 ggaaatctgc aagaagaaca acgtcatcaa gatggcatcc acaacgaaga agaagaaaga 4320 aggcatcatc cgatgcaact gcattccgtg tgcattcagt ggtcgaccgt ttgtcccacc 4380 cgaagcagag catgttccgc tggagcataa ccccccaccc acaccgaaga agcgcagcgc 4440 gaggctgcac cagaagcacg gttcgcgatc gtggcacacg gccagcgatc tggcgaatga 4500 tgacgacgtg aagcagaggc tgcaaaggca gcaaaacaca ccaccacgac cgctactcgg 4560 cggcgacccc acgagaaggc tgcaccagca gcaacgaagc agcagatgcc gttccaggga 4620 gcagagatga agaattgcag cagagaatgc accgtcgtcg tccacgatgc aatccacgag 4680 aacagcaacc accccacaag aagttctgtt tacagcagaa gatgctaaga tcaagtgatc 4740 gtattgaaat tattgaatgt ttacgtttgt ttatttttga gttcaattag tattagtgtt 4800 atcgagtagg tagtaagttt tctttggttg aaaccactgg tttcaaggcg ggcggca 4857 // ID BEL-38_CQ-I repbase; DNA; INV; 5693 BP. XX AC AAWU01044769; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-38_CQ_; KW BEL-38_CQ-LTR; BEL-38_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-5693 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 229-229 (2011). XX DR Genome; AAWU01044769; Positions 15984 21676. XX CC Positions [4715-5296] - Integrase core CC 'CAAAT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 383..5692 FT /product="BEL-38_CQ-I_1p" FT /translation="MVLHTPMKQPVNTDEVKSLIHQRGQVKGKVTRIKSAL FT DEAKKNPQRITKATLKVYEKKLEAHYQEYVLRHREVIKAVDKKEEQDDVLD FT VFDQLHTETLVLVEELMEMFNQPVAGPAPFQVGANGVAPQVIVQQQPLRAP FT IPTFDGQTENWPKFKAMFEDLVGRTRDSDAMKLHHLDKALVGDAAGLITAK FT MIQDNNYEQVWKEISEQFENQRVIVDTHIDGLLQLKPITKGNHKDLQALTK FT ACDRHVAGLQYQGLVVDRLSGLIINKLVVGCLDDSTRQQWERTQKQGELPD FT FNQTLQFLKTECQVLERSQNSRVLSSAKEQSTTKPSCPKPTQKSHTATSAK FT VAKSCLVCGADHRHFECSKLLAMTVPERNAKVSELKLCFNCLRAGHRLSSC FT SSNLKCQTCHKKHHTLLHNDSFSKAQSVRPIPSLQVTNPTTPQSESPNLPT FT STTLSQNLQQQIPLNSSCSSNHSQSLKTVMLLTALVLIENDGDPVPCRALL FT DSGSQVNFLSERIADKLKAPRESLYVPIAGVGGSKMYAREKVRVTVPSRYS FT AFAASIACLVVPKVTGIIPGSKIDVSSWSIPVGIQLADPEFNVPERIDMLI FT GASMFFSLLKSGQLHLSDDLPELRETHFGWVFSGEIENSVNHSHIANNASL FT DVLNQTISKFWEVEDLADPMPPPSDVDECEKLFVETHRRLPSGRFVVRLPF FT REDVSGLPDNRSLALRRFLLLERRLNREPSLKQQYAKFIDEYETLGHCREV FT DESKDVDQHRYYMPHHAVLRPTSSTTKLRVVFDASAKQGSTAKSLNEVLHV FT GKPVQSDLFSILLRFRKNPIAFTADIEKMYRQVLIDSSQTRFQRILWRSDS FT SKPIRVLELQTVTYGTAAAPFLSTRCLVQLCQDEREKFPLAAEVVRDDCYV FT DDILSGASSADEAVVVQRQIRKMLESGGFRVHKWSSNCQEVLSSVPESDRE FT KLVCLDQGNEVVKALGLTWCPQSDEFMFVVRLPKDVTEFTKRSIFSEIGRL FT FDPIGFLSPVIVVAKLYMQKLWLIELPWDAKVEEDLLASWLCFRRALPKVS FT EIRIPRYVISPSTVAIELHGFSDASVVAYGANIYVRCILSDGRAQLRLLCS FT KSRVCSPKKDLNIHRKELMACELLARMMVRVLETVKFKVSQVVLWTDSQVV FT LAWLKKSLAKLDVFVRNRVAKILELTAGFIWKYVRSKDNPADIVSRGMLPE FT ALMSKSEFWEGPFFLRIEVYDEEIPPEIPDDELPELKLGPIIATPVFNEDQ FT LPVFTRFSSFRKLQRVMAYVQRFIQNCRQKDPSCRVFMIHPTIPELRASLE FT LIVKIIQHEALSDEIHRVQNDEPCKKIAGLHPFYQDGVLRVGGRLQQSSLL FT FHSKHPYILPKHPVVDLLIRAYHEENLHVGPKSLIAALRTRFWLIDGRSSV FT RRITHRCVTCFQARPKVASQLMGNLPAYRVNQALPFEVTGVDYAGPVYVKE FT GRYKPKIVKAYISVFVCMVTRAVHLELVSDMSTETFLAALKRFVGRRGLPR FT EIHSDNGTNFRGAKTELHELYELLKSQVTVDEIAQFCQPREINWSFIPPEA FT PNFGGIWEAAVKRTKFHLKRTLKEAKLSFEEYATVLTQIEGILNSRPLYAT FT SSEPNDSEILTPGHILIGRPLTAVPEPSCEGVQINRLIRWQYMQRLRDEFW FT RKWHRDYLQTLQPRGKNRVKSANIKPGMIVLLEEKDAPVQVWKLGKITKTF FT PGKDDLVRTVEVQIGTVVYKRAIHKIAVLPIIDNANLVKVTEIPSLPGGS" XX SQ Sequence 5693 BP; 1454 A; 1369 C; 1455 G; 1415 T; 0 other; tggtccttcg agccggattt gtgtgaagtg ttcggaccta caaggtcctt cagaccggaa 60 ttgtgcgaaa agtgctgacc ttcatggggt tttaacccgt tattctgtga acgttcaatg 120 agagtcgcca accggaagta aaagtgatcc gtcaacccgg acatgtgaaa gtggccttta 180 gcggatgcga gaaagaactg acaagtgggt ctaaagagac caaagtgaag cacagccaaa 240 gtgttttaca aaccaaaaac tcgtgtgcca aaaaagttgg caaagtgcgt ccacgccgtg 300 ggcaaaaagt tgcaacaagc aaaaggttgc aaaagtgaac tctaactttg agacacaagt 360 gaagtgaagt gagagcgcca aaatggtgtt acacacgcca atgaagcagc ccgtgaatac 420 agatgaagtg aagtcgctga ttcatcagcg gggacaggtc aagggaaaag tgacgagaat 480 taagagtgcc cttgacgaag ccaaaaagaa cccgcagagg ataaccaagg cgacgctgaa 540 ggtgtacgag aagaagctgg aagcgcacta ccaggagtac gtgttgcgcc atcgtgaagt 600 gatcaaagcg gttgacaaga aggaggaaca agatgatgtg ctcgacgtgt ttgaccagct 660 tcacaccgaa acgctggtgc tggtcgaaga gctgatggag atgttcaacc agccggtcgc 720 tggaccggcc cccttccaag tcggtgccaa cggagtggcg cctcaagtga tcgtgcaaca 780 gcaacctctt cgagcaccga taccgacctt cgacggacag acggagaact ggccaaagtt 840 caaggccatg ttcgaggacc tcgtcggtcg cacccgcgat tccgacgcca tgaagctgca 900 tcaccttgac aaagccctgg tcggtgatgc agctgggttg atcaccgcaa agatgattca 960 agacaacaac tacgagcaag tttggaagga gatcagcgag cagttcgaga accaacgtgt 1020 catcgtggat acgcacatcg atggattgtt gcagctcaag ccaatcacca agggaaacca 1080 caaggacttg caggcgctca ccaaagcctg tgatcgccac gtagccggat tgcagtacca 1140 aggactcgtc gtcgacaggt tgtcaggact tatcattaac aagctagtgg tagggtgctt 1200 agatgatagt actagacagc agtgggaaag gacgcagaaa caaggtgagt tgccagactt 1260 taaccaaaca ttgcaatttt tgaaaaccga gtgccaagtc cttgaacgtt cccaaaattc 1320 gcgcgtcctg agctcagcaa aggagcaaag caccaccaag ccatcttgcc ccaaaccaac 1380 ccaaaaatcc catactgcca catctgcaaa ggtcgccaag tcatgtctgg tttgtggggc 1440 ggatcaccgt cactttgaat gttcgaagtt gctggcgatg acagttccgg agcgcaatgc 1500 caaagttagt gagttgaagt tatgttttaa ctgcctccga gctggtcatc gactttctag 1560 ttgctccagc aacctgaaat gtcagacttg ccacaagaaa caccacacct tgcttcataa 1620 cgactctttc tccaaagctc aaagcgtaag accaatccca agcctccaag taacgaaccc 1680 tacaacacct cagtctgaat ctcccaacct tcccacgtcc actacactga gtcagaacct 1740 gcagcagcaa atccctctca actcttcatg ttcatccaac cattcccaga gtttgaaaac 1800 tgtgatgttg ctgacggctc ttgtcctgat cgagaatgat ggagatcccg ttccttgccg 1860 tgcgcttctt gatagtggtt cgcaagtgaa ctttctctcc gagcggattg ccgacaagtt 1920 gaaggcccct cgtgagtcgc tgtacgttcc gatcgctggt gtaggtgggt caaagatgta 1980 tgctagggag aaagtcaggg ttaccgttcc ttcgagatac tcggccttcg ccgccagtat 2040 cgcatgtctg gtggttccaa aggtgactgg aatcatcccc ggatcgaaga ttgatgtttc 2100 ctcctggtcg atcccagtgg gaatccaact tgcggatcca gagttcaatg tccccgagag 2160 aattgatatg ttgattggtg cctccatgtt tttcagcctg cttaagtcag gacagctgca 2220 cttgtcagac gatctgccag agttgcgcga aactcatttc ggctgggtat tttccggcga 2280 aatcgagaat tctgtgaacc attcacacat cgcgaacaac gcttcgttgg atgtgctgaa 2340 tcaaacgatt tccaagtttt gggaggtcga ggatcttgcc gatccaatgc ccccaccctc 2400 tgatgttgat gaatgtgaga aattgttcgt ggaaacgcat cgtcgtttgc cgtcaggtcg 2460 atttgttgtt agattaccgt ttcgtgaaga tgtttcaggg ttgccagaca accgttcgct 2520 cgctcttcgt cggtttctgc tgttggagcg tcgtctcaac agagaaccaa gtctgaagca 2580 gcagtatgct aagttcattg atgagtatga aacccttggt cattgtcgtg aggttgatga 2640 gtccaaggat gtagaccagc atcggtacta catgccccac cacgccgtcc tgaggccgac 2700 gagctcgacc acgaaactgc gagtagtgtt cgacgcgtcg gccaagcaag gttctacggc 2760 gaagtcgctc aacgaggttc tccacgttgg aaagccggtc caaagtgatt tgttcagtat 2820 cctcctcaga tttcgtaaaa accccattgc attcactgcc gatattgaaa aaatgtatcg 2880 gcaagtgctt atcgactcgt ctcagacgag gttccagaga attctgtgga ggagtgactc 2940 ttcaaagccg attcgcgtgc tggagttaca gacggtaact tacggtacag cagctgcccc 3000 gttcctctcg accagatgtc tagttcagtt gtgtcaagat gaaagagaga agtttccgct 3060 agcagccgag gtggttcgtg atgattgtta tgtcgatgat atcctttccg gtgctagttc 3120 cgctgacgaa gctgtagtcg tccagcggca gattcgaaag atgcttgagt ctggaggttt 3180 tcgtgtccac aaatggagct cgaattgcca ggaagtgttg agtagtgttc ccgaatctga 3240 tcgcgagaaa cttgtgtgtc tcgatcaagg aaacgaggtc gtcaaggcac tgggtctgac 3300 atggtgtccc cagtctgacg aatttatgtt cgtggtgcgt cttccaaaag atgtcaccga 3360 attcaccaag cgaagtattt tctctgaaat cggcaggttg tttgatccca tcggatttct 3420 atcccccgtg attgttgttg cgaagcttta catgcagaag ttgtggttga ttgagttgcc 3480 atgggacgct aaggtcgaag aagatcttct ggcgtcgtgg ttgtgtttcc gtagagccct 3540 ccccaaggtt agtgaaattc ggattccacg ctatgttatc agcccgtcga ctgtcgcgat 3600 tgagcttcat ggattttccg acgcttccgt tgttgcttac ggtgcaaaca tttacgtccg 3660 gtgcatcttg tccgacggta gagctcaact tcgcctcctc tgcagcaaat caagagtttg 3720 ttcgcccaag aaagatctca acattcaccg caaagagttg atggcgtgtg aactgttggc 3780 tcgcatgatg gttagagttc tcgagactgt caagttcaag gtaagccaag tcgtactttg 3840 gactgacagt caggttgtcc tcgcgtggct taagaagtct ttggcgaagc tcgatgtgtt 3900 cgtccggaac agagttgcta agattcttga gctcaccgca ggatttatct ggaaatatgt 3960 caggtcaaag gacaatcccg ccgatattgt ttccagaggt atgctccccg aggctctcat 4020 gtcgaagtcc gagttctggg aaggcccatt cttcctgaga atcgaagtgt acgatgagga 4080 aatccccccg gaaattccag atgatgaact ccctgagttg aagcttggtc cgatcattgc 4140 aaccccagtg ttcaatgaag accagcttcc ggtgtttaca aggttcagtt cgttcagaaa 4200 gctgcagcgt gtgatggcat acgtgcagcg tttcatccaa aattgtcgtc agaaagatcc 4260 gtcttgtcga gtgttcatga ttcacccaac gatccctgag ttgcgcgctt cgctggaact 4320 gatcgtaaag attattcagc acgaagcact cagtgacgaa atccaccgtg ttcagaatga 4380 cgaaccgtgc aagaagattg caggtttgca cccgttctac caggacggtg tcttgagagt 4440 tggaggtcgg cttcagcaat cctcgctgtt gtttcactcc aaacacccct acattttgcc 4500 gaaacaccct gtagttgatc ttttgattcg cgcttaccat gaagaaaatt tgcatgtcgg 4560 tcccaagagt ctgatcgctg ctttgaggac aagattttgg ctgatcgacg gaaggtcgtc 4620 agttcgcaga atcacccatc gttgcgtcac gtgttttcag gctcgcccga aggttgctag 4680 tcagttgatg ggaaatcttc ctgcatatcg tgtcaaccaa gctctgccgt tcgaggtcac 4740 tggtgtggat tacgcaggtc cagtttatgt caaggaaggt cgatacaagc ccaagatcgt 4800 aaaagcttac atctcggtgt ttgtttgtat ggtcactcga gccgtacatt tggaactcgt 4860 ttcagatatg agtacagaga cttttctcgc agctctgaaa cgcttcgtag gtcgtcgagg 4920 acttccccga gaaatccact ccgataacgg aaccaatttc cgcggagcga aaaccgagct 4980 acatgagctc tatgaattgt tgaagtccca agtcacagtc gatgaaatcg ctcagttctg 5040 tcagcctcgt gaaattaatt ggtcgttcat tccccccgaa gcccccaact ttggtggtat 5100 ttgggaagcc gccgttaaga gaaccaaatt tcatttgaag agaactctca aggaagccaa 5160 gttgtccttt gaagagtacg ctacagttct gactcagata gaaggaattt taaattcacg 5220 tcctctctat gctacctctt cagaacctaa tgattcagag attctcaccc ccggacacat 5280 tttgattggt cgtcccctga ccgccgtacc tgaacccagt tgtgaaggtg ttcagataaa 5340 tcggctcatt cgctggcagt acatgcagcg attgcgcgat gagttttgga gaaagtggca 5400 tcgcgattat ttgcaaaccc ttcaaccccg cggtaagaat cgagtcaagt ccgccaacat 5460 caaaccaggt atgattgtct tgttagaaga aaaagatgcg cccgtgcagg tttggaagtt 5520 gggaaaaatc acaaaaacct tccccgggaa agatgatttg gtgcgtacgg tagaagtgca 5580 gatcggtaca gtagtgtaca agcgggcgat ccataagatt gctgttctgc caatcattga 5640 taatgcaaat ttggttaagg tcacagagat tccatctctg cccggcggga gta 5693 // ID SMAR15 repbase; DNA; INV; 2127 BP. XX AC . XX DT 02-OCT-2007 (Rel. 12.1, Created) DT 22-OCT-2007 (Rel. 12.1, Last updated, Version 1) XX DE Consensus sequence of Mariner-type family of repeats. XX KW Mariner/Tc1; DNA transposon; Transposable Element; SMAR15. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-2127 RA Jurka J.; RT "Mariner-type element from freshwater planarian (Schmidtea RT mediterranea)."; RL Repbase Reports 7(10), 1073-1073 (2007). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 318..1409 FT /product="SMAR15_1p" FT /translation="MEKERLTKKRKKIVLTIKQKLTLIERFEKGESTSKLS FT EEYGIGIQTVRDIVKQKNKLEAFARDCDSSAGPSKRKSMKTSTFEDLDAAM FT LIWFNQKRAEGIPISGQMCIEAAKTFHENLGIKESFNASSGWMTRFKQRHG FT IRQLTIQGERLSSNAEAADEFCVEFQEYLQRENLQPDQIYNADETGLYWKC FT LPTKTLASMKEKSAPGHKSSKERITVMCCGNASGTHKMKLLVIGKAKKPRS FT FKGTEIKNLPVDYYSQKEAWMDREIFEDWFKKKWVPEVQSFLKNKGLPQKA FT VLLLDNAPSHPHESILKTNDGLMISNFFHPNVTSLIQPMDQGVISSMKRLY FT RQKLLKTLVEEDDNLINFWKK" XX SQ Sequence 2127 BP; 761 A; 304 C; 405 G; 656 T; 1 other; tacagtgtac tctcgattat ccgttcatgg attatccggt tttcggatta ttcgtgcctg 60 aaaataaata ttttttaatt caaatttaat ttgattgatt tttttctatt acgattataa 120 gacgcattat ttaatttcga tcttgaatat gcgcgttgtt tgaaagaaat acgtataggg 180 acacacagaa acgtcncagt ttttttaatt tcttttgttt gattaattgt aatttctttc 240 tgtgtgatat tatttttctt gttatatgta atctcaaaaa ttcatatttt agctctacat 300 atctgtaccg tttaaaaatg gagaaagaac gattaacaaa aaaacgaaag aaaattgtac 360 tcacgataaa acagaaactt acattaattg agagattcga aaaaggagaa tccacgtcaa 420 aactttctga agaatatgga attggtattc aaactgttcg agatatagtc aaacaaaaaa 480 ataagttaga ggcgtttgct agagattgtg atagttctgc aggtccttcg aaacgaaaga 540 gtatgaaaac gtcgacattc gaagatttgg atgcagccat gctaatatgg ttcaatcaaa 600 aacgagcaga gggaatacca attagcggcc aaatgtgcat cgaagcagcg aagacatttc 660 atgagaatct aggaattaaa gaaagtttca acgcctcgtc aggctggatg acgaggttca 720 aacagcgaca cgggattcgt cagcttacga ttcagggaga acgtctcagt tcaaatgctg 780 aagctgctga cgaattctgt gtcgaatttc aagaatattt gcaaagagaa aatttgcaac 840 ctgatcaaat ttataatgca gatgaaactg gcttgtactg gaagtgctta ccaacaaaaa 900 cgctagcttc gatgaaagaa aagtccgctc ctggacataa atcgtctaaa gaaagaataa 960 ctgttatgtg ttgtggaaat gcatcaggaa ctcataaaat gaaacttttg gtgattggaa 1020 aagcaaaaaa accacgatca tttaagggaa ctgaaataaa aaatcttccc gtcgattatt 1080 atagtcagaa agaggcatgg atggacaggg aaattttcga agattggttc aaaaagaaat 1140 gggtcccaga ggtgcaatca tttttgaaaa ataagggatt gccacagaaa gcagttttgt 1200 tattagataa tgctccttct catccgcacg aaagcatatt gaagacaaac gatggactta 1260 tgatttcaaa ttttttccac cctaatgtta catcactgat tcaacccatg gaccaaggtg 1320 ttatatcatc aatgaaacgt ctttatcgtc aaaaacttct gaaaactctt gttgaggaag 1380 atgataatct gatcaatttt tggaaaaaat gacggtattg gatgctatac atgggatagc 1440 acaatcttgg tcaaaagtaa agccagtaac acttcttcga tcatggagaa agattcttcc 1500 tgacatagaa acagatttaa tggattcgga agagaatgaa gaaaattttg tatctgaaat 1560 gtgtggatta ttggaaaatt taaatttttt tgaagacgta gataaggaaa acatagagga 1620 gtggctcaat catgatttaa atgatccagg ctttgaacgc atggacgatc ctgatatcgt 1680 ttctcttgtt actcaaaagg aagaaaatga tagtaatact acaagcgaag aagataccag 1740 tggctacgtt agtcatgaaa cggcatgaat tgtattgaaa cattgctaga ttacgctgaa 1800 caaaggggaa ctgaatataa taatgtaata gctttaagaa tatttcgaag tgaaataaga 1860 aattcgttaa aaaaatctca aaaacaaatg aaaataacag atttttttaa tttgtgtaca 1920 tataaaagta taatttttta tatatatttt tatgttcgta tatatgataa aatattatat 1980 ttttacttat ttgtttgtgt tttcttgata aataatgaat ataaaaatag tgtgttaaaa 2040 atttgtgttg tgtatatgat attatatatc tgattatccg tgaccatcct ccccagcatt 2100 agcccggata atcgagagtg cactgta 2127 // ID Gypsy1-SM_I repbase; DNA; INV; 10299 BP. XX AC Contig17113; XX DT 15-AUG-2007 (Rel. 12.08, Created) DT 15-AUG-2007 (Rel. 12.08, Last updated, Version 1) XX DE LTR retrotransposon from Schmidtea mediterranea: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy1-SM_I; KW Interspersed repeat; LG_I; internal portion. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-10299 RA Xu Z. and Wang H.; RT "LTR_FINDER: an efficient tool for the prediction of full-length RT LTR retrotransposons."; RL Nucleic Acids Res 35(Web Server issue), W265-W268 (2007). XX RN [2] RP 1-10299 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from Schmidtea mediterranea."; RL Repbase Reports 7(8), 757-757 (2007). XX DR Genome; Contig17113; Positions 1081 11379. XX CC Positions [6664-7143] - Integrase core CC 'CATC' target site duplication CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 6679..8226 FT /product="Gypsy1-SM_I_1p" FT /translation="MELCAMDFLCKLPTTINGNKHILVFSDYFTKWAEGFP FT NKDETAETVAKLLVEKIIFKIGTPRKLLTDKGKNFMSETIKRVANMFNMHK FT INTTAYHPQTDGLVERFNGTLLSRLAVFVGKNQNDWDEYVSQCTYIHNITP FT NETTGYSPYFLMFLREPATPIDTVFEPRVSQYLDAPDYVTLMQERMQEVYK FT VANLKVKFRQEQYKDDYDKTSKPHTFKVGDKVRLMSPQVGKGKTTKLARPF FT KGPYEVVKVINENLHIQKHKNEDPIVVHVNRCKLAESEIIEGPLHRYNLRS FT QKGLMMSAIFLTIIGIINTTNVKEKYKTEFMGDRDILINISPHEIKLFNLK FT KKPIGDLSLESTDIKPKLMEKRERKTTHEFKRGDNVRFINLEPNTKYVLCA FT QLDTDYDISQIQCKKVKTIQTKKHKMTKKVIINTTPIIIQKINSSPNMNTT FT TTKVTTQTIVDEVTTKTYKIPIQLKEKLIKLPTEKPIVIKSWNELQAFQKQ FT TVKPLWPFYVISASILARAI" XX SQ Sequence 10299 BP; 4325 A; 1587 C; 1885 G; 2502 T; 0 other; tttggtgcct cgaaccggga ctttaaaata aattattgtt atcaacaatt tgggggatag 60 gaattgcgat cagagggtta gtgtagaaat atattgggaa aatatattta taataaataa 120 tattattgac cataatttaa aatcataatc tgtaatctaa aatagataat ccacctgaaa 180 taatttataa tctaagaaat aggatcgtat ataggatgaa taataatcga aataataata 240 attttattaa aatgttacca attgaacctt ataaaggaga acctgaaaaa atcgacgatt 300 ttttcactga aattactcag tatgcagacg cttgtggttg gacacctgat gaattaaaga 360 agagaattcc gttgtatttg aaaggatatg cgttacaggc atataaccat ttagagtgta 420 gaggaaacaa tcttgatcaa ctcatagaaa atttgagagc agagatagta atcgacagtg 480 atttacaaaa aataaatgta taaaaattta gaaatcgaaa tcaaggatca agagagtcag 540 ttggagagta cgtgtatacg ataacagaat tagcaaaaaa ggcatttggt gccggagagc 600 acactccaaa attaattgag gatcagtttt ggaatggaat ccagaaatat ttatatagag 660 cattaataat ggttgagtat agagatttga atgaattagt tcaaaaagca aaaagagtcg 720 aagcgataca aagagataga ttcaatgcat ctatagatgc agttgagatt caaaagccaa 780 cgatagcgaa acctaagttg gagaaagatt atgatgttcc atgggaagaa atgaaacagt 840 tatatttaaa ttttgtaaaa aatagacgaa aaaatagtcc gaactcgagc agaagtcaat 900 cacctgaaaa caggaaaaga ttcaacgcat tgaatttgac agaacagaac caaaagcgat 960 ttagagacga taatagagat agtcgttctg actataacag aaacaattct tatagaaaca 1020 atcgttattt taatagagac agtcgttctg actataacag aaatagtttt agagataatc 1080 gttataacaa aaatagtgga gacagttatt ctgggaataa tagagataat cgttccgaat 1140 accgccagta taacagaaat agttcaagag gtagacaatc gagtccatat agacaaaata 1200 aatatcgaga gacgcgagca agcccgtata atgatgatgg aagggataaa gtgaacttca 1260 atggacaaat caaatgttac aactgccaga gatatggtca tttagcaaaa gtatgtacta 1320 atcagcgaag tgagtcgcca aatagagtaa aatttaaagc gagcctgaag taaaacaaat 1380 aaatggagta gaattttata atacgagtaa tgagataaat acgttgaggg aggaattaaa 1440 agagtttaaa actcaatatg gccgacttat ggagagaaag tcggaggcgt tacttaacgg 1500 gtatgaatgt tctataaatg aagttgatat tgataatgag gaaactgaga gacaaataaa 1560 aatgaaagag gaattaaaag aagttttaaa aaaacgaccg tatgatgaca atgaaacatt 1620 gagagtaata aataaagttc aagtaaacag aattgatcct gaatatttaa cccgtttaat 1680 cccgttattg gtaaagagac aaacggaaac acgagtattt gaaatagaaa tagagctaaa 1740 atagatgatc aaaagttacg gagaatttgt gataaaaagt ccgaaatttt acgaaccttt 1800 aaaaataaga ttggatacta aggacgaata tttctttcaa atgctattat cagaaacaga 1860 agtccatgtg tgtacaatgg taaacaaaat acgacaactg gcaagagata aatggcctcg 1920 aaaaatagga ccaaatgatg aaataaaatt atattttaac gatatttagt tgaaagatgg 1980 cgaatacctg attgactatg gaataaatac aatagatgag aatactatat ctgcagcaat 2040 aaatagagaa actacctcaa gagcagcaac tccttttgtt gaactaaatg agaaaaacga 2100 ttatgatatg ttatttaaaa tgtggtatca accatccgaa gggagtacaa agtcaccact 2160 cgaaaaagag ataattttcc aaccattgga taataaatac ggtaacttgc aggagacgat 2220 agaaatgatc tccaagcaag caatgatgca agatgctgaa acaatgaaac aagatagagg 2280 acgacctcga aagtcagaaa tagcggagaa atcagatgat cgaaaaacgg cagagcgttt 2340 gagagagata gccaataaaa agagacacga gaatccgaaa gtagcagacg ataataacga 2400 cataattatg ataattgaac tactaaaagc aatgaaccta aaacgatatc cggaaacaaa 2460 accatcaaag ttaactgaaa atatggttca gaaaatcaaa tttatggaga aagaggtcag 2520 agagttgagt gcagaggatg taaatgagga agtggtttta aagatactct tcacagctgg 2580 tataaataga gacaaagtaa ttacggcaat ggcatatcta attgcgagat ataaaaagct 2640 cttaagatca aatccaaaag ggcaaataga tgaggtagca aaaacaattt atcagagatt 2700 aaaggctgat atgatgccat ggataaaatc gaaaggaggt tggataataa tgtatgatga 2760 gacattaaaa acaagtgaaa gcgagacgat agaaaatgat gataccgagg aaattgtgaa 2820 aaatagcaga ctcagaaaca gaggccatga gatttgtaga aaaaggggaa actattaaat 2880 gcggaaaggc agttgcaaag attaattatg aaggtattta tattagtagt aatgaaatta 2940 aagacgcaaa acaaaaaatc gataaatttg atgtcaaact atcttcgtat tttaataata 3000 aaattgatta tttttatcat catcaattag tccaattaga taaagtatat caagcaacga 3060 ttattaacga ctgcaaactc aatagagaga tattgagaac aaagatggct gtagcggtta 3120 caaacccaga tttgcggaaa aaggaacttt tgcgagagtt gtaggagaat tgttacagac 3180 ctttcaatgt aaaccggtta gcgtgtcact agcaacgaat aaccaatgta ccaacgagtt 3240 accagttatc tataaaggag aaacagtata tttacagccc ataaccagga tactaacaga 3300 tgaaacttat attccaagaa aaatcgataa atgtaccaat ctgcttgatc cattatatca 3360 actaaatgac gagatgtgga ttacgatgtc cgaaagaaaa taggcaacta aaccgtttaa 3420 actagaatta acagaactcg aaaagaagtt agaatttcaa gaaataaacg atatgaataa 3480 taatgttatg tatacacgag atgccataga aagtgcaaga aagcacatgt tattccctaa 3540 cgaaaaagag aaaatattat caataatggt gagtaaagtt atcgagggta gtgatggtgg 3600 tgattataac ttcgatgtgc tgctatcaaa ggaacatttt aaaaaggtgg tatataaagt 3660 actatacagt atatggggat attttgcagt tttggggaat atgttttcaa ctatattggg 3720 aatatactat acagtagcat tactcaaaat gatctgttca tcattagtat tattacgaca 3780 acttcgacaa gtatttggta actcatataa gatgttggaa tgtttatgcc cctttgtcgc 3840 aaaatatctg ataacaacga aacatgataa aaaaatacga ctaataaaaa ctctacgaat 3900 ggaagaggaa caattgatag aaaacaaaaa cgacgatcat ccaagcggat cagaaaatgt 3960 ggaacagcca caaggtggac tgtatgacaa tcaaaatcag caactaagag aattaagcag 4020 taattggaat acaataacca actatcattg taacaagaaa agtttacgga tgctacgggc 4080 tagaaccaaa tgagaggtta ttaaaactca atgaagccag accaagcatc caagtgtcga 4140 tcgataatag gatcataaaa gctctagtag ataccggagc agcaacctcc atgataaggg 4200 aggatcaatt atcagaaaat cagagaaaga gtataaagag cacaaacgat acgatgatat 4260 ctatgtcatc gcataccatg caactaagag gagatatcaa actgaccgtt cacttctcaa 4320 atcaaacagt taatcacgta ttcaaagtaa tgactgaatg tagaagtgaa tgcatcttag 4380 ggatagatat ccttcgcaag ttagataact gcgtcatcga tttaactgat ggtacaataa 4440 aaagaaaagc aaagtatcag ggtcacaatt tgcggaagta atagcattag aagccttcta 4500 attagaatat cactgcgatc atataattcc tgtcaaagtc aacgtcaaaa gtgaggacac 4560 ttttattttt tagccggata agtcaattac aaagaattat aatataatgt taaccgatga 4620 gtgtgttata ccaaaaaacg ggataatccc aataagaatt gctaattaca aaagaaaaat 4680 gtaaaaattc tcaaaggaac aagattggga aagttattca aaggtgaatt agaacaaaca 4740 ttggaaatgt gtgaaacgat gataatgaga gacgacagtg aacaaatggt cgaaccgagt 4800 aagataaata acgttaaatt ctatgataaa gtaaaaatta ataacaacta tttaaatgaa 4860 acccaacaat ttcaattaca aactttactc gatgaatata ttgacatatt tgcaaaacat 4920 gagttcgatt tgggagatac tgacatcatt aaacatgtaa tcgatacggg aaacgccaga 4980 ccaataaaac aacgacctta tggggtacct tataaactaa gggaagagat aaagcgacaa 5040 ataaaagagt tgaaaaaggt aggcgtaata aaacaaagtt tttcacaatg ggcatcaccc 5100 atcgtaccag ttagaaaaaa agatggtgaa attagaatgt gcattgacta gcgtaaatta 5160 aatgacgtaa caattaaaga ttcataccca ttcccgaaag cacaagagct atcgataaac 5220 tcagagacac taaatacttt acagttatcg acgcaagtaa agggtatatg caaattcaaa 5280 tggaagaggg ggatgcggaa aaaactgctt ttgtcattga agatggattg tacgaataca 5340 ccagactacc ttttgggtta actaatgcac ccgcaacatt tcaacgccta atgaatacaa 5400 tactggtaga tgtacaacat tgcgcagttt atatgaatga tatattaata gcatcaaaaa 5460 cctttgagga acatttaaaa gacatagcaa atgtgttaca gagactaaaa aggccaggat 5520 taaaatcaaa ccaacaaaat gtaaatgggc agaacaatca gcgctatttt taggccacat 5580 agtcacaata gagggcataa cacctaaccc agataaaatt gaagttgtta aaaattttcc 5640 agtcccaaca acgttacaac aaatacaagg atttttggga ttgaccgggt actaccgtaa 5700 attcgtacaa gattacgcaa aaatagcagc accaatgata gaactaacaa agggagttaa 5760 aacaaaggga gagagtaaag gagtattaat agtacaaaca atcaacgata aaataagcaa 5820 accacaaaag aaatggacat ctgagcatca aagggcattt gatcaactaa aagaaaaact 5880 aataacagca ccaatattac gttacccaga ttttaagaaa caattcatcg tcatgacgac 5940 gccagcagga aagccgtaac aagcaggaca agaagaagaa aagggtaaag attacgttat 6000 agcatacgct agtaaaacgt taaaaggagc acaacttaga tactcaacta tagaaaaaga 6060 atgttattcc atagtatttg cattaaaaca gtttaagcca tacatatatg gaacagaagt 6120 agtgataaga acagatcata aaccattaga aggcctttgg aaacataaag atacctcaag 6180 tagacttcta aaatgggcaa tgaaatcata tataaaccag gtagaataaa caaaaatgca 6240 gatgcactat cacacattcc tgagtaacga gcactagaag tttttgcatt aataacggaa 6300 aagccaatag acatatatga tgaacaagaa agagatccag aaatttctaa gattagagaa 6360 gaaattagaa aaggaataaa cgaaaagtat aaaattgcaa aaaaactagt cgttatcgag 6420 gattttagtg gaaaacagtt tatagtagta ccacaaaaat taaggcaaat tcttttgtta 6480 caatatcatg atggtgcttt agggggtcac ttatcgagta gaaaaacaac gagcagatta 6540 ttacaaaaat attattggga taacttaaaa aaagacgtta agagatggtg caatgcatgt 6600 acaatatgcg caactagaaa gaatacaggg acaaagacaa aagcgccctt acactcaatc 6660 ccagctacaa catcgccaat ggagttatgt gctatggatt ttttatgcaa actacctacc 6720 actattaatg ggaataaaca tatattagtt tttagcgact attttactaa atgggcagag 6780 ggattcccaa acaaagatga aacagcagaa acagtagcca aactacttgt agaaaaaatt 6840 atttttaaaa tcggtacacc tagaaaacta ttaacagata aaggaaaaaa ttttatgagt 6900 gaaacaatca aaagggtagc taatatgttt aacatgcaca aaattaatac aacagcttat 6960 caccctcaaa ctgatgggtt agtagaaaga tttaatggta cactcctcag tagattagcg 7020 gtttttgtgg gaaaaaatca aaacgattgg gacgaatacg tgagccaatg cacatacatc 7080 cataatatca caccaaacga gacaacaggg tattcaccat attttttaat gtttctaaga 7140 gaaccggcaa cacccattga tacagttttt gaaccacgag tatcacaata tttagacgca 7200 ccagattacg taaccttaat gcaagagcga atgcaagagg tttataaggt agctaattta 7260 aaagttaagt ttagacaaga acaatataaa gacgactacg ataaaacgag taaaccccat 7320 acttttaaag tgggagataa ggttagactt atgtctccac aagtaggcaa gggcaaaacg 7380 accaaactag caagaccatt taaaggccca tatgaagtag ttaaagtaat caacgaaaac 7440 ttacatatac aaaaacataa aaatgaggat cccatagtag tacatgtaaa tagatgcaaa 7500 ttagcggaat cagaaataat agagggacca ttacatagat ataatttacg atcccagaaa 7560 ggattaatga tgtcagcaat cttcttaaca atcataggaa taattaatac aactaatgtt 7620 aaagaaaagt acaaaaccga atttatggga gatagagata ttttaataaa tatttcacct 7680 cacgaaatta aattatttaa tctaaaaaaa aagccgatag gagacttatc cttagaatca 7740 acagatatca aaccaaaact tatggaaaaa agagagagaa aaaccactca cgagtttaaa 7800 aggggagaca acgttagatt tattaacttg gaacccaata ctaaatacgt tctatgcgcc 7860 caattggata cagactatga tatatcacaa atacaatgta aaaaagtaaa aaccatacag 7920 acaaaaaaac acaaaatgac aaagaaagta atcataaata caacaccgat aataatacaa 7980 aaaattaatt caagcccaaa catgaataca accaccacga aagtaacaac tcaaaccata 8040 gtagatgaag taaccacaaa aacctataaa ataccaatcc aactcaaaga aaagctaatt 8100 aaactaccta cggaaaaacc aattgtaata aaaagttgga atgaactaca agcatttcaa 8160 aaacaaacag taaaaccact atggcctttt tatgtaatat cagctagtat tttagcaaga 8220 gcaatatagg cagtagttag tagtgtacaa atctacaaaa gtacaagtgg aaaaggcagc 8280 ggtatctgca ccgactggtc agaaatgtac cagggcgttt tcacttggac aaatatatac 8340 caggtcaata cacaatgtcc actcttgttt aaaaaattat ttcaaattaa tctaatatga 8400 tcaataatat tttctaaatt tgtatatatt gatatctata acttaatagc gtataaacgc 8460 aaacctcata tgcgcaagag gattatttat ttactcttca ttttagaaca tatacaattc 8520 aacacgcaat tcagtcaatc taaccaatcc aaatcgcaat ccaatcaata caactgcata 8580 ataaacaaac attatggaag tttttataga aacttacaga ataaaatttg cacagacgga 8640 accccaatca ctatccgcag ctttccgaat caagtgtgca tcccaaacga catgaaagta 8700 gaaattgaag agagagagct gcatctaatc aaattcaacg aagctagaag agtgaaactc 8760 aaagagctta tcggagcatt cgacgaaatt gcaaatgtgg acaatctgca gctgaaggaa 8820 aaaaggtgcc ccatcgaata attttccata tgacaaaaac agaggtctct atttgtcagc 8880 aaagaatcca actaatcacc acgagcaatt gttcaataat gataaagact atcaaaaata 8940 ggaaaatgtc aaatgaattt aataacctta tgattcaaat gaacaattta ccatgcctca 9000 tggcaattga aattctctat ataccaacaa taccaaaccc tttcgtgaat ctggaatata 9060 tcatatcgag tctacacatt cctcaagagc ttgttagcaa atgtgtatcg ttcacacacc 9120 aagaattaac gacaaaggag agacaagaaa agataaaatt cgtagatgtt aaacttgttt 9180 gcgacattaa caacaatcag agaccagtag aagtatgtat cagagatttc aacgaaaccg 9240 ttctattcaa caagaaaatc acaccgagga caaaaatact ccattactat ccggaaaaaa 9300 caagagtaaa caaggaaaat attatttgta acgagggcga ataccaagca ttagtagata 9360 ttcgaaaact cctatatgga caaatagttg tgggatatga cataacaaac aaattaaaag 9420 ctctcacctt ggacatctac aatgtaatgg gagtcagaaa aatggctgaa gactctttta 9480 tgtgttcctt cgtaaagaat gtagcacccc acaaaattaa actatctaat ctggcatacc 9540 agctaatcag ataccatgac aagtggttca acgacaaaat tatcaatcga atactacaaa 9600 cagcagaacc attttcagca aaacgaaaca acaatacttg taaaagttat ttatggtatc 9660 acagagcaaa agtgggaata ggttgttaaa tgggattcct caaatcaaaa agaaatgaag 9720 agggagatca agaaaaaaaa gaaacaacaa caaatgcctt agaagacaat gatatggtag 9780 aagatatgta catcgaacct tcagaatggg agttaaagtt caagagaaac ggcactagaa 9840 aaacattcga aagcgaacta atggatagcg acgacgatat catattcgtc tcagaagaaa 9900 ggccacagaa caacaaaatc tccatcaaaa gcgaaccaca actcagagaa aacaacgaaa 9960 tgcaatgtga catggattcg gatgaagaat atttttcctt cgaagaagat aatcaagaat 10020 catcacaaag ctcgacaacc gaagatagtg atgaatatgg acccgaaccc gatataatac 10080 tagaagatcc tgattttgtg atcggggaaa accagacaga agtatcacat aaagaactgt 10140 gccgtcattt cggtagacca ataaaggacg gagacatatt gaacttggca ctcaaaaagg 10200 gattcagacg caccactatg aaaataaatg actgtgaaat caaactagtc cttagtgaca 10260 aaggtactga ggacattact ttaacgaccc agggaggaa 10299 // ID Copia-54_AA-LTR repbase; DNA; INV; 248 BP. XX AC supercont1.273; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-54_AA_; KW Copia-54_AA-I; Copia-54_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-248 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.273; Positions 1424290 1424537. XX SQ Sequence 248 BP; 69 A; 52 C; 48 G; 79 T; 0 other; tgtagaagta ctggcaacac ggcatcagtt gttttggtaa tttaagagta aaaactaccc 60 ataccacttt acagtgtatt ttgttctatt accactcgat ctacttcctg tgacataacc 120 aaaagaaaaa tcgaaaagaa taaacgtgtc ctaaaagtag tgcgttgtaa gtgttttgtt 180 ccgcagtccg attctcgata tttcgtccga ttccgttcgt tcgtgcgtgt ccgcctggta 240 aatgtcca 248 // ID Gypsy1-I_Dya repbase; DNA; INV; 4257 BP. XX AC chr2L; XX DT 18-MAY-2009 (Rel. 14.05, Created) DT 18-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy1_Dya; KW Gypsy1-LTR_Dya; Gypsy1-I_Dya. XX OS Drosophila yakuba OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; melanogaster subgroup. XX RN [1] RP 1-4257 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1028-1028 (2009). XX DR Genome; chr2L; Positions 21285984 21281728. XX CC 'TATA' target site duplication CC LTRs are 90% similar to each other. XX FH Key Location/Qualifiers FT CDS 124..2457 FT /product="Gypsy1-I_Dya_1p" FT /translation="MRVSEKEIKQEPSRIPISKGKQPYYTRSVAAADKQIK FT MSTPLESDCEISGNENTKEMKIVQELFATSGEQCGNARSNMQNLVATSGEQ FT GGNARNNMQNHVGTSGEQRGAAGLIESSSIDENTRTLKDLVQLLAIKIGAE FT ERNNGCSITAESFAKIIPEFDGESMPVKNWFENFEMNAAAYDLNIKQMYVQ FT PRAKMTNTAKLFLDSTYVHQYSEMRMLMETEFSRQYVCSADVHEQLRDRKK FT RKQESFHEYLLQMKKIASLGNIDVRSVIRYVVDGLNMRSDFRYSLYSCKSY FT KELQEQYEVFDRVVEKPFNPNEGKWTSSQKKQHEGQHDRKSHCFNCGSLDH FT HRKDCKSAVKCFSCNKEGHMSRDCPGSAAGVQVVRSSSRMKNATINNVAVE FT CLVDTGADVSILRKYIFNKIPNVTLERCASKLRGLGRKITCTVGYFLAEVA FT VDKEMTRHNFVVVEDEDIEYDALLGFDFVSKFDFSLSADGYKFSSPLGEKA FT CEEPNQMSIYNIINSDDELDVPPQFAKEVSLLINNYKAVDSVAEVPVRLRI FT VPDGEIIPFRQSPSRFAIAEENDVEKQINEWLDTGIIRPSTSNFASRLVLV FT NKKDGTRRICVDFRKLNSMVLRDCFPVPIIDDVLQKLQKARFFTVMDLEND FT FKNDSKKFTAFITKFGLYEFNRTPFGFRNSPAVFIRYVNHVFQELMSRDVL FT DLYMDDIIIHAETAEQCLEKIKLVFNRAGAFGLKIKWRKCHFLHTTTLRTE FT TYGLDRKRQLRSANFRCQKTSEQSKLF" FT CDS 2340..3515 FT /product="Gypsy1-I_Dya_2p" FT /translation="MSLLAHNNIEDGNIWPGQEKTAAVSKFPVPKNVRAVQ FT AFLGLTGFFRKFVLNYSQIARPLTDLVRKDVCFQMAHPQMNAFHSLKDALT FT KEPFLKLYIRAAKTELHTDASKDGFAAVLMQWFDGRLHPIQYWSKKTSEAE FT SRQHSYILEAKAVYLAVKKFRHYLLGSPFKLVTDCSAFKQTLQKADVPRAV FT TQWIVYLQDFEFAVEHRPGERLKHVDCLCRYPDRVMLVSSEITARIKVAQR FT KDEMLKAIMEILESRPYEGYKMKAGLVFKEVNGNDLLVIPKSMEKEVIVEA FT HSAGHFALQKTVQQLYWIPHLEQKAQQVVNNCVRCIIFNKKLGKKEGFLSC FT IDKGDQPLHTLHVDHLGPMDATKTVQVHICDGGFIYKVYMAVPNENYKL" XX SQ Sequence 4257 BP; 1306 A; 822 C; 1104 G; 1025 T; 0 other; tttgggggct cgtcctaacc tgaaaattgc aagtgacgcc gtcccgaaaa gtctcaacgt 60 tgccccgaga agtcttaaaa gacgaagaag aagagtttga cagacgccgg ttagcaacgg 120 ggaatgcgtg tgagtgagaa ggaaattaag caagagccgt cgcggattcc aatttcaaaa 180 ggaaaacagc catattacac aaggtcagtg gctgcagcag acaaacagat taaaatgagt 240 acgccgttgg aaagcgattg cgaaataagt ggaaatgaaa atacgaaaga aatgaaaatc 300 gtgcaagaac ttttcgcaac aagcggagag caatgtggga atgcaagaag caacatgcag 360 aaccttgtcg caacaagcgg agagcaaggt gggaatgcaa gaaacaacat gcagaatcat 420 gtcgggacaa gtggagagca acgtggcgca gctggtttaa tcgaaagcag ctcgatcgac 480 gaaaatacaa gaacgctgaa agatttagtg cagcttcttg ctattaagat tggagcggag 540 gagcgaaaca acggttgctc gattacggca gaaagctttg cgaagatcat tccagagttt 600 gacggagagt ccatgccggt taaaaattgg tttgaaaatt ttgagatgaa tgcggcagcc 660 tacgatctga acattaagca gatgtatgtt cagcctcgcg ccaaaatgac gaacacagcg 720 aagttgtttt tggactctac gtatgtacac caatactcag agatgcgtat gctgatggag 780 acagagttca gtcgacaata tgtgtgcagc gcggatgtac acgagcagct ccgcgaccgc 840 aagaaacgca agcaggaatc ttttcacgaa tacctactac agatgaaaaa gatcgcttca 900 cttggcaaca ttgatgtacg ttcagttatc cgctatgttg ttgacggctt gaacatgaga 960 agtgatttca gatactctct ctacagttgc aagtcttaca aagaactaca ggagcagtat 1020 gaggtgttcg accgtgtggt cgagaagcca tttaatccta acgagggaaa gtggacgtcg 1080 agccagaaaa agcagcacga gggacagcac gacaggaaga gtcactgctt taactgcgga 1140 tcattagatc atcatcgtaa ggactgcaag tctgctgtaa aatgcttcag ttgcaacaaa 1200 gagggtcaca tgtcaagaga ttgccctggg tcagctgctg gtgtccaagt cgtacgcagc 1260 tcaagtcgca tgaaaaatgc cactatcaac aatgtggcag tggagtgtct ggtagatacg 1320 ggagccgatg tatctatact gcgcaagtat atctttaaca agataccaaa tgttaccctg 1380 gagagatgtg cgtcgaagct tcgtgggctg ggaaggaaga ttacgtgcac agttggctac 1440 tttttggcgg aggttgcggt ggacaaggag atgactagac acaatttcgt cgtagtggag 1500 gatgaagaca tcgagtatga tgcgctgctc ggttttgatt ttgtatcgaa gtttgacttt 1560 tcgctgtcgg ccgacggata caagttttct tccccgctgg gggagaaagc ttgtgaggaa 1620 ccaaatcaga tgagtattta caacataata aattcagatg atgaattaga tgttcctccg 1680 cagttcgcta aagaggtgtc gctgctgatc aataactaca aggcggtgga ttcggtagcc 1740 gaagtcccag tgaggctaag aatcgttccg gatggtgaaa tcatcccatt tcgtcaatcc 1800 ccaagtagat ttgcgattgc cgaagagaat gacgtggaaa agcagatcaa cgaatggtta 1860 gatactggaa tcattcgccc atctacatct aactttgcta gccgtcttgt actggtgaac 1920 aaaaaagatg gcacacggcg tatttgcgtc gattttcgaa agctgaactc gatggttttg 1980 agagactgtt ttcccgttcc tataatcgac gacgttctgc aaaagctgca aaaagcacgc 2040 ttttttactg taatggactt ggagaatgat ttcaaaaatg atagcaaaaa attcacagcc 2100 ttcatcacca aattcggatt gtacgaattc aatagaaccc cgtttggatt tcggaactcg 2160 ccagctgttt ttattcgtta cgttaatcac gtgttccaag agctgatgag cagagatgta 2220 ctcgacttgt acatggatga tataattatc cacgccgaga ctgcagagca atgtttggaa 2280 aagattaaac tcgtgtttaa cagagctgga gcgtttggcc tgaagataaa gtggcgaaaa 2340 tgtcacttct tgcacacaac aacattgagg acggaaacat atggcctgga caggaaaaga 2400 cagctgcggt cagcaaattt ccggtgccaa aaaacgtcag agcagtccaa gcttttttag 2460 ggctgactgg atttttcaga aaatttgtat tgaattattc gcagattgca cgcccattaa 2520 ctgacctggt gcgaaaagat gtttgctttc aaatggccca tccgcagatg aatgcattcc 2580 atagtttgaa ggatgcgttg acaaaggaac cctttttaaa gttgtatatc agagcggcca 2640 aaactgaatt acacacggat gcatcaaaag atggatttgc agcggtttta atgcagtggt 2700 tcgatggcag actccatcca atacaatact ggagtaagaa gacgtcagaa gctgagtccc 2760 gccagcacag ttatatccta gaagcaaagg ccgtttatct agcggtgaag aaatttagac 2820 attatctgct gggatcgccg tttaagctgg ttacggattg ctctgctttt aaacaaaccc 2880 tacagaaagc ggatgtgccg agggcagtca cgcagtggat tgtttacctg caggatttcg 2940 agtttgcagt ggaacatcgc cctggggaac ggttgaagca tgtggactgc ctttgtcgct 3000 atcccgatcg tgtgatgctc gtgtcctctg agatcacagc tcgaatcaaa gtcgcacaaa 3060 gaaaagatga gatgctaaag gcaattatgg aaattctgga gagtcggcca tatgagggct 3120 acaagatgaa agctggactt gtcttcaagg aggtgaacgg caatgacctc cttgtgatcc 3180 cgaaatcgat ggagaaagaa gtaattgttg aagcccatag tgcaggtcat tttgcactac 3240 agaaaactgt tcaacaattg tattggattc ctcacttaga acagaaggct cagcaggtag 3300 tgaacaactg tgtgagatgc atcattttca acaagaaact tggcaagaag gaaggattcc 3360 taagctgtat tgataaaggt gatcaacccc ttcacacgct gcacgttgat catttgggcc 3420 ccatggatgc tacgaaaaca gtacaagtac atatttgcga tggtggattc atttacaaag 3480 tttatatggc tgttcccaac gaaaactaca agctgtgagg agacactgaa gaagttgaag 3540 atttggtcag aagtttttgg caaccccgtg aggatggttt cggacagagg ctccgctttc 3600 acagccaact tgtttgcgga acatatgaag gaaaacggaa tcgagcacgt ttggagtaca 3660 acaggggttc cacgtggaaa tggacaagtt gaaagggtga atcgtacaat tttgtcgatt 3720 atttcaaaga tgtcggcgga tgaaccagcc aagtggttta aggcagtgcc agacgtgcag 3780 agagcggtca attcacacgt acatgcatca accaagaagt caccatttga gctgctattc 3840 ggagtgcaaa tgaacaataa gcttcagctt ttggaggagg agatgtattt gaattttgac 3900 gaccaacgac agcaactgag aaaggatgca aagatcgaga ttcaacgcgc acaggaatct 3960 tataaaagga attttgacct taagcggaaa ccagaagtcg catataaagt aggagaccta 4020 gtcgctatac gccggacaca atttgttgcg ggaaagaagt tggctggaga atatatggga 4080 ccgtacgaag ttttaactgt caagaggaac ggcagatacg aagttcgaaa ggcagcaggt 4140 tttgaaggcc catgtaagac ttccacaagt tgtgattaca tgaagttgtg gaggtatgtc 4200 caggacaatg aggatgattg gtcatctggg acagatgaat agtcaggaat ggccgaa 4257 // ID Gypsy-67_AA-I repbase; DNA; INV; 4530 BP. XX AC supercont1.238; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-67_AA_; KW Gypsy-67_AA-LTR; Gypsy-67_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4530 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.238; Positions 143162 147691. XX CC 'CTATG' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS join(210..1793,1797..4355) FT /product="Gypsy-67_AA-I_1p" FT /translation="MMAEGHEPKPGTSGTPGVIKMSMPMFGHMEPFVVGEK FT FVEYVSRLEQFFLVNEVPDNKKVPMLVTMAGPSLYSIASRICSPEDPCTKS FT YEQLIALLKKHLAPTVNVVAERYKFQKCEQSSSHTISEFIIDLKARSQSCE FT YNAFLQEALRDQFVAGVYNTNLRTKLLTEANLTFERACEIARSWEAAEQES FT KAMQGASKIAALNRRPGQKLFKPEKTKFIVQEKKKQASSPAKKSSEPSKTC FT FRCGRSHNPESCPAKQWTCFACGKVGHVSTMCRSSKQKPSNQNRVAEMSEV FT VENWKLNRLSSSSEPSEDVAAAERVNSIFVLGETESSRARIDVPAIMELEI FT EGVPVKFEVNTGACDSVISKNTYQDKLSHVKLRRTVKCFQSVSGQDIRPKG FT ELAVKVRNRSGDLVSLQLFVLQPERNRSITPLLGRSSLDILVPEWRNMVSF FT NFSSISAIQHSLKSEIQSKFPNLFAGNSNQAIEGFSADIVLKQNAAPIFHK FT PYTIPFKFRENVDQELDRMVESGILVPVRFSTASPIVITPKKDGSICICLD FT GKATLNRFISTEHYQLPLIDDVLASLSNFKVFCKIDLTGAYLQVKLSDAAQ FT ELCTINTHRGLFKYTRMPFGISSAPSLFQSIIDQILLGTVAVPYLDDIVVG FT GRTVEECKRNLFQVLDKLNQHNVQINWSKSTFFETEIEHLGFRLTADGIRP FT SSSKVEAICCAPAPKDVGQLQSFLGLLNYYHRFLPNLSTELRPLYDLLRKD FT QEFVWSSACQKAFEISKKLLVANDVLEPYDPSKPLILTTDASPYGTGAVLS FT HLVGQIEKPVCFASSTLTPAQVNYGQIHKEALAVMFGVSKFHKYLYGTEFT FT LVTDSSALKQIFNPNKGSSAVAVSRLHRWAVTLSNYSYRVVHRPGKSFSNA FT DALSRIPLPVENRIEHMSVAEHNLQNYVGIGEIRTQQHCDPILGKVIRFVK FT NGWPRVVETDLKPFAQINTNFEMEDNCLYFEDRVVIANSLQRRVLEKLHEN FT HDGVVRMKMIGRSYFWWKGFDKDVANFVKSCVTCQKTQKVPREVVESSWPP FT ANHAFERLHLDLFHFEGQTFLIVVDAYSKFIDVKLLRRTDANSVLEKLDEI FT YTYFGLPTEVCSVNGSPFNSSLFVEACKANNINVLKSPPYHPQSNGLAERG FT VQTIKTVLKKYLLDEKFKPMPIQRKLNRILFNYRNTPSTTTSRTPSSMMFS FT YVPRTLLNSSNPVKLDIQNEQKNVVKPSLRPEKSRSLVCSKTYSRGDKVMY FT RNHMDDYVRWIPAIVLEKISPHTYLINLNGNVRMVHANQIRMSDLSDKFHP FT SIPVICPIPESEQPTTKVEVPGSSTAIQQRPLNKKKSPKKKKSEKRKRSDS FT TSPKLRRSKPIRDKTCKR" XX SQ Sequence 4530 BP; 1282 A; 939 C; 1073 G; 1236 T; 0 other; aatttttggc gacgagtgaa acggaaaacg cgtctgtaat ccggaaagtg tcgtaaagtc 60 ggtcggtcga gtgcacacgt gagtggagga agtctaacct cacttttcag aacaacgtcg 120 gttcggaaca cgtggtgact gtagtcagtg cgatcatgcg tctgagatca gtgataagtg 180 tagtgtcggt gcggtgaaaa acgaacaaaa tgatggcaga aggacatgaa ccaaaaccgg 240 gtacgtcggg aactccggga gtgataaaaa tgtcgatgcc aatgttcgga cacatggaac 300 cattcgtggt tggcgaaaag ttcgtcgagt acgtcagtcg gttagaacag tttttcttgg 360 tgaatgaagt gcccgataat aagaaagtgc cgatgttagt gacaatggcg ggtccgagcc 420 tgtattcgat tgcgtcgaga atttgttctc cggaagaccc gtgtacgaaa tcttacgagc 480 aattgatcgc gttattgaaa aagcacctag ccccaaccgt caatgtggtg gctgagcgtt 540 ataaattcca aaagtgtgag cagtcttcaa gtcataccat ttcggagttc atcatcgacc 600 tcaaggcccg atcccagtcg tgtgaataca atgctttcct tcaagaagcc ttgcgtgatc 660 agttcgtggc cggagtgtat aatacgaatc ttcggacgaa gttgctgacg gaagctaacc 720 tcactttcga aagagcatgc gaaatcgctc gcagctggga agcagcggaa caggagtcga 780 aagcaatgca gggagcatcc aaaattgctg ccctcaaccg gcggccggga cagaagttgt 840 tcaagcctga aaagacgaag ttcatcgtcc aggagaagaa gaagcaagcc agtagcccag 900 cgaagaaatc atcggagccc tcgaagacat gtttccgttg tggtaggtcc cacaacccag 960 agtcgtgtcc agccaagcag tggacgtgct tcgcatgtgg caaggtcggc cacgtgtcca 1020 caatgtgtcg gtcgtcgaag caaaagccga gcaaccagaa tcgagtagca gaaatgtcgg 1080 aagtcgtcga aaactggaag ttgaaccggt tgtcctcatc gagcgagcct tcggaagatg 1140 tagctgcggc ggaacgtgtc aacagcattt tcgttcttgg ggagactgag tcaagtcgag 1200 cgcgaataga tgtgcctgcg attatggagc tagaaattga gggagtgcca gtgaaatttg 1260 aagttaatac aggagcgtgt gattcagtga tatcgaaaaa tacgtatcaa gataagcttt 1320 cccacgtcaa attgcgacgg actgtcaagt gctttcagtc agtgtcgggc caagacatcc 1380 gtccaaaggg agaactagcg gtcaaggtaa gaaatcggag tggtgattta gtgtcgcttc 1440 aactttttgt tcttcaaccg gagaggaata ggagcatcac ccctctactt ggtcgttcga 1500 gtctcgatat cctggttcct gaatggagaa acatggtgag tttcaatttt tcttctatca 1560 gtgctattca acacagtctt aagtctgaaa ttcagagcaa atttccaaat ttgtttgctg 1620 gcaactccaa ccaggcaatt gaaggtttca gtgccgatat tgttttgaag caaaatgcag 1680 ccccaatttt ccataaaccg tacactattc cgttcaaatt tcgcgaaaat gtagatcaag 1740 agttggatcg catggtggag agcggaattc ttgttccagt ccgtttctcc acgtaggcga 1800 gtcccattgt gatcactccg aaaaaggatg gctcaatttg tatctgtcta gatggaaaag 1860 caaccttgaa tcggttcata tccactgagc attatcagtt acctttgatt gatgatgtcc 1920 tcgctagtct atcaaatttc aaagtctttt gcaaaatcga tttaactggt gcatatttgc 1980 aagttaaatt gtccgacgca gcgcaggagt tgtgtacgat aaacacccat agaggattgt 2040 ttaagtacac ccggatgcct ttcggtatca gttcagcacc atcgttgttt cagtccatta 2100 tcgatcaaat tttactagga acagtagccg ttccgtattt agatgatatt gtagttggtg 2160 gcagaaccgt tgaagagtgt aaaaggaacc tcttccaggt tttagataag ttaaatcagc 2220 acaatgtgca aattaattgg tccaagtcaa catttttcga aaccgaaatt gaacaccttg 2280 gcttcaggct aactgctgat ggaattcgac ctagttcttc aaaggtcgaa gccatttgtt 2340 gtgcacctgc accaaaagat gtaggtcagt tacagtcgtt tttaggtctt ttgaactatt 2400 atcatcgatt tttgcctaat ttgtccaccg agttacgtcc gttgtacgat ctgttaagaa 2460 aagatcaaga attcgtgtgg tcgtccgcgt gtcagaaagc gttcgaaatc agtaaaaagc 2520 ttttggttgc taacgatgtg ttggaaccat atgatccgtc gaaaccgttg attctcacta 2580 ccgatgcgag tccgtatggt accggtgcgg ttttgtctca ccttgtcggt caaatcgaaa 2640 aaccagtgtg ttttgcttcg tctacattaa cccccgcgca ggttaattac ggtcaaatac 2700 acaaagaagc cttagcagtt atgttcggtg tgtcaaaatt ccataagtat ctgtatggta 2760 cggaatttac tctcgtgacc gatagctccg cgttaaaaca aattttcaat ccgaataaag 2820 gttcgtcggc ggttgcggtg tctcgattac accgttgggc agtaacacta tcgaattatt 2880 cgtatcgcgt tgtacatcgt cccggaaaaa gttttagcaa tgccgatgcg ttgtctcgta 2940 tacccttacc cgtcgaaaat cggattgagc acatgtcggt agctgaacat aacctacaaa 3000 actacgtcgg tatcggtgag attagaacac agcagcattg tgatcctatt ctcggtaagg 3060 ttattcgttt tgttaaaaat ggatggcctc gggttgttga aaccgatttg aaaccgttcg 3120 cgcagattaa cacaaatttc gaaatggagg acaattgcct ttacttcgaa gaccgcgtag 3180 tgatagcaaa ttccctccag cggcgcgtgc tcgagaagct tcatgagaat catgatggcg 3240 tcgtgcgaat gaaaatgatc ggtcggtctt acttctggtg gaaaggattc gataaagatg 3300 tcgcgaattt cgtaaaatcg tgtgtcacat gtcagaaaac tcaaaaagtg cccagagaag 3360 tagtcgagtc tagttggcct ccagcaaatc atgcttttga acgccttcat ttggatttgt 3420 ttcattttga aggtcaaacg tttttaattg tcgtcgatgc atattcgaaa tttattgatg 3480 ttaaacttct gcgaagaacg gatgccaata gtgtattgga gaaattggat gaaatataca 3540 catacttcgg tcttccaact gaagtgtgtt cagttaatgg ttcaccattt aattcgtctt 3600 tgtttgttga agcttgcaaa gcaaacaata tcaatgtgtt gaaatctcct ccataccatc 3660 cacagtccaa tggattagcg gagagaggag tacagacaat caagactgta ttgaagaagt 3720 atctgttgga cgagaaattc aaaccgatgc caattcagcg aaaacttaac cgcattctgt 3780 tcaactacag aaatactccg tctaccacta ccagtagaac gccctcgtct atgatgttct 3840 cttacgtgcc tcgcacactt ttgaattcaa gtaatccggt caagttggac attcaaaatg 3900 aacagaagaa tgtggttaaa ccttcgttgc gtcccgaaaa gtccaggagc cttgtatgtt 3960 ccaaaacgta ttctcgaggt gataaggtta tgtatcgcaa tcacatggat gattatgtcc 4020 gttggattcc agcgattgtt ttagagaaaa ttagtcctca cacttacttg ataaacctca 4080 acgggaatgt acgtatggtt catgccaatc agatccgtat gtctgatctg tcggacaaat 4140 tccacccttc catacccgta atatgtccta tccctgagtc cgagcaacca acgacaaagg 4200 tagaggtgcc tggttcctcg accgcgattc aacaaagacc gttgaataag aagaaatctc 4260 cgaaaaagaa gaaatcggaa aaaagaaaac gcagcgattc tacatcacca aaactccgtc 4320 gttccaaacc tattcgcgat aaaacttgta agcgctagtg tgggtttagg aaaacggatg 4380 ttttgtccct tcgtgtatcg aaagtatttg acagttagga attacaattt tttcaagtat 4440 gatagctgtt tcgcgttttc gatatcgttg gattacagtt tttttgtaca gaagaattag 4500 ctctacagtc tcatttaaag ctgggagaag 4530 // ID BEL-49_AA-I repbase; DNA; INV; 6150 BP. XX AC supercont1.247; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-49_AA_; KW BEL-49_AA-LTR; BEL-49_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6150 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.247; Positions 1365060 1358911. XX CC Positions [5199-5735] - Integrase core CC 'CTACA' target site duplication CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 33..4751 FT /product="BEL-49_AA-I_1p" FT /translation="MPEENETSGSGDRNCTLCEDHDSVDDMVQCDNCDLWT FT HYSCAGVTEAVKESQWVCPKCTNTLQVPKNTKKVTSKKTGKKGSNKSDTGS FT DSGKVDAIVASFEESLAQLQQERSAKTKHLDEEKVLHEKRLQMEREILERK FT RLQKRQMMEKELAQEQEILEQQLRDEQEFLQRRQDLRNRFQQIKLDLSQRY FT EEPDDSDDDEDGANGAEKAKSWLQTTKEDPRGAYPKKGGPHETGFQSSQVS FT QLAGTRKQSQYLEFPGPTMQNEPGLKHAGSFVERSLPNQFQGITMPNEPGS FT KQAASFVERSQPEQFPGPTMLNEPWLKQAPNFIEHPLPNQLAADNASREWR FT ELLQRSDLTAEEEDVIRNIMSRRRPGGNVQGSFVNRGPTPEQLAARQAVSK FT HLPVFRGEPEVWPLFISCYEYTTAACGFTNTDNLKRLQDSLQGLAKEAVQS FT RLLIPDSVPEVIEDLRKLFGNPEKLLKTLVTKVRQAQAPRLDRLETFLYFG FT ITVKQLCDHLEAAGLHDHLSNPMLVQELVNKLPPEYKLDWVRFKRGKPGTP FT LRNFTDFINEIVSEVSEVADFTGAEQGVTGQPWKNKPKKKEFVNTHNTNSS FT RSGGPYSEMQPKPRKPCVVCKRNDHKVRYCDDFSKLSIVERLKIVDKFKLC FT QLCLNDHGKAQCTFKGRCNVPSCNKLHHSLLHRAVESIQLSVAQSNTHFQS FT FRSVIFRIVPITLYCRGRAVKTSAFLDEGASTTLVEDSIADKLNADGIPEP FT LEVIWTANMKRYEEGSRRIDLQISGRESSKRYELKEARTVNELVLPKQEVN FT FEEMVSRYRHLEGIPITDYGVEVPRILIGLDNLHLFAPLESKLGDIGEPIA FT VRSILGWCIYGPNQQRTSPKAYVHYHSIEPITNQHLHDLLRAQYTLEDPYE FT QTEAIPETEEIVRARTILEKTTKRIGDRFETGLIWKHDNVTFPDSYPMALK FT RMTALERRLAKEPRLQTEVHQQINAYVQKGCAHLATKEELNGVCSSNIWYL FT PLSVVLNPKKPNKVRLVWDAAATVGGVSLNSKLLKGPDLLTSLPSVISKFR FT ERPVAIGGDIREMYHQLRIRDSDKQAQRFLFRFNSSEPPGIYIMDVATFGA FT TCSPCSAQFIKNRNALEHSEQYPEAALAIVDRTYVDDYFDSLDSEEEALRR FT AKEVKFINAKAGFEVRNWSSNSSLVLCELGEEKSSGLVHFHQDKATENERV FT LGIIWNPRLDTLSFAVPSGAEISLDHSGKTTKRIVLSSVMSLFDPLGLLAP FT FTTQGKMLIQDLWRSGCDWDQVIDDDSYKKWRSWVSVLSKIADLEIPRSYF FT GMVQSKDFGHIQLHIFTDAGENAFGCAGYLRIVVDGVVKCSLAMARSKVAP FT LKQLTIPRLELQAAVLGARMSHQIKQNLSLQLHQTFFWTDSRTVMSWIASN FT QKRYKQFVSFRIGEILSLTKLSEWRWIPSKLNIADCLTKWKGEWDLDPEWF FT SGPSFLYQIPEMWPRQQLPPMNTRIELRAVHLFHDVILPDKLIDAAMFSKW FT NVLVRTLASVLRFVSNCRRKLRGEPIETLKPTVNQRSNIVVEMRAARTPLK FT QRISTS" FT CDS 5169..6149 FT /product="BEL-49_AA-I_2p" FT /translation="MASIPVQRLTPYKRPFTFVGIDYLGPVEVSVGRRAEK FT RWIVLFTCLVVRAVHLEVAYNLTAQSCTMAIRRFICRRGPAAEYFSDNGTN FT LKAASKEFVGRLQAIGSECAEEFTTARTKWHFNPPAAPHMGGIWERMVRTV FT KNVMAVLNDGRRLTDELLLTTLAETEDMVNSRPLVYAPQGSPQESLSPNHF FT LRGIAANEPHEVLPPTNPALALRDTYQRSVELSNNLWERWLKEYTPSLNHR FT RKWFTESSPLRKGDLVYVVDGNRRKAWIRGVVEEPIVSGDGRIRQAVIRTN FT YGLVRRATANLAVLEIEGSNAAPDSNSGTGLRAGD" XX SQ Sequence 6150 BP; 1828 A; 1402 C; 1545 G; 1375 T; 0 other; ttcttgaaga acaactctcg gataacctcg aaatgccgga ggagaacgaa acatcgggaa 60 gcggcgatcg caactgtacg ctttgcgagg atcacgatag cgtagatgac atggtccaat 120 gtgacaactg cgacctctgg actcactata gttgtgcggg cgtcaccgaa gccgtgaagg 180 agtcacagtg ggtgtgcccg aagtgtacga acactctgca ggtgccgaag aacaccaaga 240 aggttacctc caagaagact ggtaagaaag gtagcaataa gagcgatacc ggatccgatt 300 ccggaaaagt agatgctatt gtagctagtt ttgaggagtc actggcgcag ttgcagcaag 360 aaagaagtgc gaaaacaaaa catctcgacg aggaaaaggt tctgcatgaa aagcgtttgc 420 agatggagcg ggaaattctg gagcgaaagc gcttgcagaa gaggcaaatg atggagaaag 480 agctggcaca ggaacaagaa atcctggaac aacagctacg agatgagcaa gaatttcttc 540 aacgccgcca ggatcttcgc aaccggtttc aacaaatcaa gttggacctc tctcagcgtt 600 acgaggaacc cgacgactca gatgatgatg aagacggagc gaatggagcg gaaaaagcga 660 aaagttggtt gcaaacaacc aaagaagatc cccgaggagc atatccaaag aaaggcggac 720 cacatgaaac aggtttccaa agctcacaag tgagccagtt agctggtacc aggaagcaaa 780 gccaatacct ggaattccca ggacccacga tgcagaacga gccggggttg aaacatgctg 840 gcagcttcgt cgaacgttcg ctgccgaacc aatttcaagg catcacgatg ccaaatgagc 900 cggggtcgaa acaagctgca agtttcgtcg agcgttcgca gccggaacaa tttccaggac 960 ccacgatgct aaacgagccg tggttgaagc aagctcccaa cttcattgaa catccactgc 1020 caaatcaatt ggcagcagat aatgcatcac gggaatggcg tgaactactg caacgatcgg 1080 acctgaccgc agaagaagaa gacgtcatca gaaacattat gagccgcaga agaccaggag 1140 gtaatgttca aggcagcttt gtaaatcgag gacctacgcc ggaacagttg gcggctagac 1200 aagctgtatc caagcatcta cctgtatttc gtggggaacc tgaggtttgg ccgcttttca 1260 taagctgtta cgagtatacc acggcagcgt gcggctttac gaacacggac aatttgaaaa 1320 gactacaaga cagtctacag ggacttgcga aggaagcagt gcagagtcgc ctcctgatac 1380 cagattcagt tccagaagtg atagaggatc tgcgaaaact cttcggtaac ccagaaaagt 1440 tactgaaaac tctggtaacg aaagtgcgac aggcacaagc tccacgattg gatcgacttg 1500 agacgtttct atacttcgga atcacagtta aacaactgtg tgaccatctc gaggcagccg 1560 gtttgcacga tcatctgagt aatccaatgc tcgtgcagga gcttgttaac aaattaccac 1620 cagaatataa attagactgg gtccggttca aacgcggtaa gccaggaaca ccgttgagga 1680 actttaccga cttcatcaac gagatagtgt cggaggtgtc ggaggtagcc gactttacag 1740 gagcggaaca aggagtgact ggacaaccgt ggaagaataa gccgaagaag aaggagttcg 1800 tcaatacgca caatactaat tcgagccggt ctggtggacc gtattcggaa atgcaaccga 1860 aaccgagaaa gccgtgcgtc gtttgtaagc gaaacgacca taaggtgcga tattgtgatg 1920 acttttccaa actaagcatc gttgaacgct tgaaaatcgt cgacaaattc aaactttgcc 1980 agttgtgtct aaatgaccat ggtaaagcgc aatgcacatt caaaggccgt tgtaatgtgc 2040 caagttgtaa caaactccac cattctctac tgcatcgcgc ggtagaaagc atccaacttt 2100 ctgtagctca aagcaataca catttccagt cgtttcggtc ggtgattttt cgcatcgtac 2160 ctattacgct gtactgcaga ggtcgtgctg taaaaacatc agctttcctc gacgaaggcg 2220 cgtcaacaac actggtcgaa gactcgattg ctgataagct gaatgcggac ggtattccag 2280 agccgttgga agtcatttgg acagcaaata tgaagcggta tgaagaaggg tcacggagaa 2340 tcgatcttca aatctcagga cgcgaatctt cgaaacggta cgagctgaag gaggcacgta 2400 ctgtaaatga attagtgctg ccaaaacagg aagtaaactt cgaggagatg gtcagccgat 2460 atcgtcatct agagggaata ccaatcaccg attacggtgt ggaagttcct cgcatattaa 2520 tcggcttaga taatctacat ctcttcgccc ctctggaatc caagttggga gatatcggcg 2580 aacctatagc agttcgttct atactggggt ggtgcattta cggcccaaat caacagcgga 2640 cttcaccgaa agcgtacgtg cactaccact cgatagagcc aataactaat cagcacctgc 2700 acgatctgct tagggcgcag tacaccctag aggacccgta tgagcaaacc gaggcaatac 2760 cagaaaccga agaaatcgtg cgagccagaa ctattttaga aaaaacgact aaacgcattg 2820 gagaccggtt tgaaacgggg ttgatatgga aacacgacaa cgtaactttc ccagacagct 2880 acccgatggc gttgaaacgg atgacggcac ttgaacgacg actcgccaag gaacctcgac 2940 tccaaactga agttcatcaa cagatcaatg catacgttca aaaggggtgc gcacatcttg 3000 caactaaaga ggaattgaat ggtgtgtgca gctctaacat ctggtatctt ccgctaagtg 3060 tggttcttaa tccaaaaaag ccaaataaag tgcgcctggt atgggatgcc gcagcgactg 3120 tcggaggtgt ttctctaaat tctaaactgt tgaaaggccc agacctgctc acgtctctac 3180 catcggttat cagcaagttt cgcgaacgac cggtagcaat tggtggcgat atacgcgaaa 3240 tgtaccacca gttgcgcatc agggactcag ataagcaagc acagcgcttc ttatttcggt 3300 tcaattcatc agaaccacct ggtatttata tcatggacgt agctactttt ggggctacat 3360 gctccccatg ttcggcacaa tttattaaaa accgaaacgc tcttgaacac tcggaacagt 3420 accctgaagc agcattggca atcgtggatc ggacgtatgt ggacgactat tttgatagtc 3480 ttgactccga agaagaggcc ctacgccgag ctaaggaggt gaaattcatc aacgccaaag 3540 cggggtttga agtcagaaac tggtcttcaa attcgtcgtt agtactttgc gaactgggcg 3600 aagaaaaatc aagcggattg gtacattttc atcaagataa agcaaccgag aacgaaagag 3660 tattgggcat tatctggaat ccacgcttgg acactctgtc attcgctgtc ccatctggag 3720 cggaaatttc gttggatcac tctggcaaga ctacaaaaag gatcgttctt agttcggtta 3780 tgtccctatt cgacccgtta ggtctactag cgccatttac cacccaggga aaaatgctaa 3840 tccaagatct ttggaggtcc ggatgtgact gggatcaagt cattgacgac gattcgtata 3900 aaaagtggag atcatgggta agcgttttat ctaaaatcgc cgacttggaa atccctcgca 3960 gttactttgg tatggtgcag tcaaaggact tcggccacat ccaactgcac atatttaccg 4020 atgctggaga aaatgccttc gggtgtgctg gatatctacg gattgttgtc gatggagtag 4080 ttaagtgctc tttggcaatg gcacgatcga aagtagctcc tctgaaacaa cttacgatcc 4140 cccgcctgga actacaagcg gctgtattag gtgcaaggat gtcgcaccaa atcaagcaaa 4200 atttgtcgct tcaactacat caaacctttt tctggactga ttcccgaaca gttatgtcct 4260 ggatagcttc gaatcagaag cgttataaac aattcgttag ctttaggatt ggagaaatct 4320 taagtctgac gaagttgtcg gagtggagat ggataccttc gaagctgaac atagcggatt 4380 gtcttacgaa atggaagggt gaatgggatc tggatccgga gtggtttagt ggaccgagct 4440 tcctgtatca aattccagaa atgtggccac gtcagcaact gcctccgatg aatacaagga 4500 ttgaattacg agctgtgcac ttattccacg acgtaatatt gccggataag ttgatcgatg 4560 ctgccatgtt ttccaagtgg aatgttctcg tcagaacgtt agcaagcgtg ctacgttttg 4620 tttcaaattg tcggcgaaaa ttgcgaggag aacccatcga aacactcaaa ccgacggtga 4680 accaaagatc aaatatagta gtcgagatgc gagcggctcg aacaccattg aaacagcgaa 4740 tatcaacgag ctgaaacttt actgttcaga tcagcgcagt acgacgcttt tggtgatgaa 4800 ataaaaacgc ttcttaaaaa tcaagacctt gaaccatcca agtggataaa cctggaaaaa 4860 ggcagctcac tttacaagct cacaccactc ttagacagcg aaaaggtatt aagaatggaa 4920 ggacgctgtg ataaggctga aaacctgcca ttcgacatgc ggtttcctgt cattcttcca 4980 cgcgatggac cagtgaccga acttctagta cgccactatc acgaaacgta cggtcacgct 5040 ttcagggaaa ctgtaaaaat gagatcaaac agcgctttct catcatcaat ttgaacacag 5100 tggtggccaa aatcgagcga agctgtactt ggtgcaaggt gaacaagaac tcgccaagaa 5160 ttcccaggat ggcctccatt ccagtacaac gcctaacgcc ctacaagcga ccctttacct 5220 ttgtcgggat cgactacctt gggccagttg aagtttcagt aggccgacgt gcagaaaaaa 5280 ggtggatagt attatttact tgcctggtag ttcgggctgt ccacctcgaa gtggcctata 5340 atcttaccgc acaatcatgt acgatggcga ttcggcgctt catctgtcgg cggggtcccg 5400 cagcggaata tttttccgac aacggtacca accttaaggc tgccagtaaa gaattcgtcg 5460 gaaggctgca agcaatagga agtgaatgtg cagaggaatt cactaccgca cgaacgaagt 5520 ggcacttcaa tccacctgca gcccctcaca tggggggcat ttgggagagg atggtgagga 5580 ctgtaaaaaa cgtgatggca gtattgaatg acggacgaag attaacggac gaactgcttc 5640 tgaccaccct agctgaaact gaggatatgg tcaacagtcg ccctctagtt tacgcgccgc 5700 aaggttctcc ccaggaatca ctctcaccta accatttcct tcggggaatt gcagccaacg 5760 aaccgcatga agtactgcct ccaaccaacc ctgccctagc tcttcgtgat acatatcaac 5820 gttcagtaga gttgtccaat aatctgtggg aacgttggct gaaggagtat acaccaagct 5880 taaatcatcg cagaaagtgg ttcactgaat cttcgccgct gagaaaaggt gaccttgtct 5940 acgtggtcga tggaaatcga cgtaaagcat ggattagagg agtagtagaa gaaccaatag 6000 tttcgggcga tggaaggatc cgccaagcgg tcattagaac caattatgga cttgtgaggc 6060 gagcgactgc taatctggcg gtactagaga tcgagggaag taacgctgct ccggatagca 6120 attccggaac aggattacga gccggggata 6150 // ID Sola3-1_DPu repbase; DNA; INV; 5737 BP. XX AC ACJG01002741; XX DT 27-FEB-2011 (Rel. 16.02, Created) DT 27-FEB-2011 (Rel. 16.02, Last updated, Version 1) XX DE Sola3-type DNA transposon from Daphnia. XX KW Sola; DNA transposon; Transposable Element; Sola3-1_DPu. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Direct Submission to Repbase Update (09-FEB-2011). XX RN [2] RP 1-5737 RA Jurka J.; RT "DNA transposons from Daphnia."; RL Direct Submission to RU (24-MAY-2010). XX DR EMBL/GenBank/DDBJ; ACJG01002741; Positions 20125 25861. XX FH Key Location/Qualifiers FT CDS join(1515..2135,2083..3198,3521..4474) FT /product="Sola3-1_DPu_1p" FT /translation="MKTRVFQVYKEMFTFISNILCGEAAEISLAATVKEFR FT EKGVTKKSSKSVSHVMIKTMSSSVKKKTVELRTTRALLCSSLTRAEVKKMC FT DKNGNVTISKDGFRKGKCDMQELLSGKTLTPALRSVRRFRSVAVDSAITFI FT LSSENVSFFSWGTKRWKVDGVEKFFPAVSRKTTRANMYCKYIATQEIPEVN FT KVKRSSFFYDRRYPYSFTRLNVQVFFMIADTLTHSDAILRRAVDYVTGFLV FT NDSFAIIEKLITHFSGSATERQRLLVEMEVCKRYLTYGFNNGVVEKIPCTA FT HDIRYGLGEDRTAPENFCDSCRNLFCCFHYLKKKIDGSGFDAAQSAMKALI FT GCEEKAVLYMGHRMRVTNQQMSIQKIFDQMKDECVNNGKKLCVITLDYKMK FT LDPLYFREKTVDHYGKRGMSWHGSMVQYYTMENFEETSAPFLNKLYLDHIV FT DNENKQDKLAVISILEAVILAIRKHLPNIDQIILQSDNAGCYQNTMLMLLI FT PYLSYAHGIEISRIIHTETQDGKSVLDAHFARSMQRIISWCKEGNYVMCGL FT IVKRNEIIFSERSQLRHSCSSCCCSESEWRIATTDKGQGKFVCVLPSERRC FT TTVQQTAFDDDSEEDITLEEEESLPITQLDDAESINDTDQSTSDVSDVEAE FT GTELNPPVEPEKVEGKVTGITVLTDNALTKRSAKWKKKTISSAVTTVNFLN FT RHKDMKTYAMKRALQLQDAGDIVFRSISDNADVSTDSYLTTQEDYLDPIFR FT CGWAKRPKVGQMYGTKYLEDYRSDIKSLYNSGEADKKNKKSPAQMLEIFES FT RYPNEFCLPSENDIRTEINKLQTQKKSATRDNRPRGSKDDDEQTNFIKDLA FT LSNPSLKPTAAVTAMKARFAVVTLTDAQIKSKFSYYKAKANKNT" XX SQ Sequence 5737 BP; 1919 A; 1138 C; 1160 G; 1520 T; 0 other; gctgtacatc gataccattt ttttctcaat cattccgcgc agtcactgtc taaacaacaa 60 caacaaggtt gaaatgtcct cgcaagaaga ttattctgat tttgaatctg aaccagaaaa 120 tggggaaaag ttagatgacg agtgctgtga acgttatgca gctgtcattt ggagttctcc 180 aaagtttgtt gagaacgact ataactgtgt tgacattagg aaaatttcta aagacatgct 240 gttgaatgga gagctccacg gcccaattgt tgtaaacgta aacatcaatg ttggttgttt 300 acaagcagtg actacagaaa cgtacattgg aagagtattt gctactggag gtataaaata 360 tgcatatgtt atcccatgtc aatcgttaac atttttattc ttgtgtcaaa tagatgataa 420 aaaaatcctg aaatcccttc ttgaggcaga aattataagg atgaaaacac aaggcaaggc 480 taacactgaa atgtataagc aacttaatac acgccaagga atgaaattgt tgttttacaa 540 tatttgaatt atcgtcttta gtagttgcca aagtctgtga ctttcatttc ctgaatcgtt 600 ttgagggggt tgcgtgaaaa gaggcgacac acttcacgat attcagtagt aatgcgatgg 660 cgatcctatt ggaatttaat gttttgaaat tcagttcatg ttgtgttcca aagctcaatt 720 tcagaattac cttgtagtgg gtttcttggc tgactgtgca gagccgggcg cttgtgacgg 780 aagtcaaatg gatagatttc tgagaggatt tgctacggga gcttcactgg aatgccgtgt 840 acggatgccg gatatcatat tttcaccttt cataccaact aattaagaaa taaatattaa 900 tgtcacgtaa ttgttttatg aaataagatt tttccggtaa tgtaaactga tcacgcggag 960 ttggtgctta gaactttaca agatccatgg aagttggtcc cactgggcca accgataatt 1020 tttcttttgg ttttttcttt cgtttcttac ccttaccctt tcatttcctt tgttttacgt 1080 tacccatcct gtgacgttac acttgaccaa aaagttcaag ctaaagaggt atggtaataa 1140 aaaccaaaag taaagttagt tgccagaaat tcttatccaa gtggagtaga gaaatggcag 1200 gcctcttgtg taaatgtaaa tcagcagttt gcgatagttg caataattgc agacggtgca 1260 attgtgagtg tcaagatgca gtacaaaatg atgtgccgtc ttcaagtaaa cgacgaagaa 1320 atgaagtccc actggggcag cgtagacagg tgtgtgaaaa tgtgaatacc ggcacagatt 1380 tttccaaaaa tccgcgcggc atgtacacag cattaggtat ggatccaaat tctccaatta 1440 tacggcattt gcctgctaag gcaaagcgct cttgtgtaac gacatggatg gataactaac 1500 cacaagctaa atccatgaaa acgagagtat tccaagttta caaagaaatg tttactttca 1560 tttccaatat tttgtgtggc gaagcagcag aaatttcctt ggcggcaact gtaaaagaat 1620 tcagagagaa aggagttaca aaaaaatcaa gtaaatccgt ttcgcatgtc atgattaaga 1680 ctatgtcatc atcagtaaaa aagaaaacag tagaattgcg gacaaccaga gctttgcttt 1740 gctcttcact gacacgggcg gaagtaaaaa aaatgtgtga caagaatgga aatgtaacta 1800 taagtaaaga cggattccga aaggggaagt gcgatatgca agaactgtta tccgggaaga 1860 cacttacacc cgcgctacga agtgttcgcc gctttcgatc tgtcgctgtt gactcggcca 1920 tcactttcat actctcttcc gaaaatgttt ccttcttttc atggggaacg aaacgttgga 1980 aagtagatgg agttgaaaaa ttttttcctg ctgtatctcg aaaaaccaca agagcaaaca 2040 tgtattgcaa atacatcgca acgcaagaaa ttcctgaagt gaacaaggtt aaacgttcaa 2100 gtttttttta tgatcgcaga tacccttact cattctgatg caattttgag acgagctgta 2160 gattatgtta ctgggttttt ggtgaatgac agctttgcca taatagagaa gctcatcacc 2220 catttcagtg gtagcgctac agaacgacag cgtcttttgg ttgaaatgga agtttgcaaa 2280 cgctacttga catatggctt caacaatggt gttgtcgaaa agataccgtg tactgctcat 2340 gacatacgtt atggtctcgg cgaggatcga acggcacctg aaaacttttg tgattcttgt 2400 cgtaatcttt tctgttgctt ccattaccta aaaaaaaaaa ttgacggttc cggatttgac 2460 gcagcacaat cagctatgaa ggctttgatc ggctgcgaag aaaaggccgt gctttatatg 2520 ggtcatcgaa tgcgcgtcac caaccagcaa atgtcaattc aaaagatttt tgatcaaatg 2580 aaagacgaat gtgtgaataa cgggaaaaaa ttgtgtgtga tcactctcga ttacaaaatg 2640 aaacttgatc cactttattt cagagagaaa accgtggatc actacgggaa gagaggcatg 2700 agctggcatg gatcaatggt ccagtattac acaatggaga actttgaaga gacttcagct 2760 ccctttctca ataaactgta cctggatcac atcgtcgaca acgaaaataa acaagacaaa 2820 cttgccgtta tctctatttt ggaggctgta atattggcca tcagaaaaca tctaccgaac 2880 atcgatcaaa taattttaca gtccgacaat gcggggtgct accagaatac gatgttgatg 2940 ttgctgatac catacctatc ctatgctcat ggtatagaaa tttcacgcat tatccatacg 3000 gaaacgcagg acgggaaatc ggttctcgac gcacatttcg ctcgttctat gcaacgtata 3060 atatcatggt gtaaagaagg taattacgta atgtgtggcc taattgtgaa acgtaatgaa 3120 attattttct ctgaaaggtc acaattgcgt cactcctgct caagttgttg ttgctctgaa 3180 agcgaatggc ggattgcata actgcacctc cgaattagtt acgctcaaca gagaaaggct 3240 gcaaaaaatt gaagttgaat ttgctgcgct agatgggcaa ctaaagaagt acgttggtag 3300 ggcaaacgat gtgctatttt gctttcagga taatgtaact ggacgtcata tttcaaccca 3360 aaccatgccc gaaaacttta gcagccttcc tgatttttcg ttgcgcgtct ttcaacactc 3420 gggtaaagtt ttcccagtca tataaacata cttgacaaca tccctcccct tcttgtcact 3480 cactcataaa ctttgttttc ttacgtaact gtgcttctga actacagata aaggacaagg 3540 caaatttgtc tgtgtacttc catcggaacg aagatgtacc acagtacagc aaactgcctt 3600 tgacgacgac tcagaagaag acataacgct agaagaagag gaatctttgc ccatcacaca 3660 actagatgac gcagaatcaa taaacgacac ggatcaatct acttcagacg tcagcgatgt 3720 tgaagctgaa ggaacagaac taaatccccc cgttgaaccc gaaaaagtag aaggaaaagt 3780 aaccggtata actgtcttga ctgacaacgc cctcaccaaa agatcagcga aatggaaaaa 3840 gaaaactatt tcatctgcgg ttacaactgt caacttccta aatcgtcata aagacatgaa 3900 aacttatgct atgaaaagag cactacagtt acaagatgct ggtgatattg tctttcgctc 3960 aatatcagat aacgcggatg tttctacaga ttcctatctc acgactcaag aagattatct 4020 cgatcctata ttccgttgtg gctgggcaaa gagaccgaaa gtagggcaaa tgtacggtac 4080 gaagtacctg gaagattatc gaagtgacat taaaagcctc tacaattccg gagaagccga 4140 caagaagaac aaaaagagtc ctgcgcaaat gcttgaaatt tttgagtcgc gatatccgaa 4200 cgaattttgc ctgccatctg aaaatgatat tcgcaccgaa ataaacaaac tccaaacaca 4260 aaaaaaatca gcgacgagag ataaccggcc aagaggatcc aaagatgacg atgaacaaac 4320 aaacttcatt aaagatcttg ctctaagcaa cccatccctg aaacccaccg ctgctgttac 4380 agccatgaaa gctcgatttg ctgttgtcac tcttacggac gctcaaataa agagcaaatt 4440 ttcctactat aaggctaaag cgaataaaaa cacgtgaaac gtaccaacaa ttgcaacact 4500 tattgtccct ttcctaagaa ttacttttta tgatcggagg agtagttgag acagacgcgt 4560 tatcttgcct aattgcgcaa agagtatgtc tgtgctgttt gcaagacgtt tcaaataaca 4620 tggtaactat gaaagatgta aaccaggcaa ctggaatacc tgattgtaat agtcagaaga 4680 aattgcatgg cacttgcaga acctgggcca ttgccatcaa aagaaatccc agggaagtag 4740 agagctagtc aacaaaccta aagagataaa aattagattc tcacaacaag taagtaaaca 4800 tgtctgacag ttatgttacc agagactcca taccagtgac ttcataccag tgacgtctaa 4860 aacaaatccc agtgaagcag agagctagtc aacacacata gaaaaataaa agttagattc 4920 acgcaacaag taaacatgtc tgacagttat gttaccagag actccatacc agtgacttca 4980 taccagtgac gtctaaaaca aatcccagtg aagcagagag ctagtcaaca cacatagaaa 5040 aataaaagtt agattcacgc aacaagtaaa catgtctgac agttatgtta ccagagactc 5100 cataccagtg acttcatacc agtgacgtct aaaacaaatc ccagtgaagc agagagctag 5160 tcaacacata gaaaaataaa agttagattc acgcaacaag caatcatgtc tgacacttac 5220 tttaccagag actccatacc agtgacttca taccagtgac gtctaaaaca aatcccagtg 5280 aagtagagag ctagtcaaca aacctaaaga gataaaaatt agattctcac aacaagtaaa 5340 catgtctgac acttacttta ccagagactc cataccagtg acttcatacc agtgacgtct 5400 aaaacaaatc ccagtgaagc agagagctag tcaacacaca tagaaagata aaagttagat 5460 tcacgcaaca agtaaacatg tctgacagtt atcttaccag aaacttcata ccagtgactt 5520 cataccagtg acgtctaaaa gaaatccgta gtagtgagac agacctagtc aacatgcaag 5580 ataggaatta aggaatgtca tcataaaaca atgtaaacac caatatgtgc aacagtaatc 5640 agaataatct tcttgcgagg acatttcaac cttgttgttg ttgtttagac agtgactgcg 5700 cggaatgatt gagaaaagaa tggtatcgat gtacagc 5737 // ID BEL-63_AA-LTR repbase; DNA; INV; 648 BP. XX AC AAGE02019161; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-63_AA_; KW BEL-63_AA-I; BEL-63_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-648 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02019161; Positions 53989 54636. XX SQ Sequence 648 BP; 240 A; 105 C; 115 G; 188 T; 0 other; tgtgacggcg aacaacccta gccttagcca gactgccctg ttgtgctttt tgtacctgaa 60 ttccctcccg ttatgtcagt tcgacagttc gagaaacagc gattgaactg agtcataagc 120 agagaagaag aaatggaatg atttggatta ttggaattct ttattatatt gaagttgaag 180 ctatcgaagt tgaatttgcg caatactagt agttgaagcg gacagagagc ctttcccgag 240 ataatagtaa gagttgtaga actaaaaaag aaaattatta ctactaatta aaatttacaa 300 tgtttacatt tacacagcat agattgattc tatcaacaca aacatttaaa caaagcttac 360 ggcttacaaa ttaaaaataa agctaagcta ccatagtgag taaatgtaac taatagttca 420 tagttaaaat tgttaaaaat aacatgaaat tatagtctac agctggaaga ttaaccctta 480 aagtaaatgc taggagagga atccgtgaat gagcaaaaac tgaaaatttg taagtaatct 540 actatatcta tgaacgcata tgtaatctaa caacaaaata tattccagtt aaagttgcct 600 tgatactaca gttctgtggt cccgcttcta aaaagttcag tccgaaca 648 // ID Gypsy-217_AA-LTR repbase; DNA; INV; 1038 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-217_AA_; KW Gypsy-217_AA-I; Gypsy-217_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-1038 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1038-1038 (2011). XX DR [1] (Consensus) XX SQ Sequence 1038 BP; 254 A; 292 C; 274 G; 218 T; 0 other; tgttaccgta cggtaacctt ttgattaaaa tagatgtggg actgtaggtt aatcaaagat 60 gtaaagcaac ttaacgtgca aattgaacgg tgcgggcggc ttgctatcta acgctccaaa 120 cgctgcttga agaatgggtg gttgcgtaac agtggctgca ccacaaaacg gatgggggaa 180 aacgcagcat cgacccgggg agaatgaatg tgttggggct gatatgatgc gatgaggctg 240 agaagcggca gtatagaaga gactcccacc ctgaaaagag gtccgattta gagaatttga 300 ccgaatcttg tctcgtggcg tcaattacca gcgacatcag cccttagtct ccaagccagt 360 ggtgtatagt ggaacttcgg ccgaccgcct accgggttcg ccaacctata agctgaaccg 420 aagtcagttt tggtagtacc tcccgctcag taaccgaaga gattctctgg tgacgtccgg 480 ggagcacgtg gctctaaccc cagcagcgaa gttccaagcc cggtccttga ggcccatcag 540 tccagacact gtcatcgtca ggcccgccgg ccagcccctt gacaccggct attgtagaca 600 cccgcatata gatcggtgac cagagactgc ttcgaacgcc ttccacgaac accctaaact 660 gcccgccagc ggagacgatc ctgctagtta ccgcatgaag caaccgtgtt gaagataccc 720 tatcgcatcc gcttccctcg ggaagtccgc aacatagagc cctccagcgt cgagtaatcg 780 tggtgaaccc ccgccgccgt agacgtcgcc ctgcaccgaa ggttagcaat aaattgaagt 840 aagtgagaat tcattccatt tctgtaggac tcccacatcc gaaactctcc cctactgaac 900 acgccctggg acccgggaga cagaagacgg cctttggtgc ccaccccgga agcggccgtt 960 ggctcaagac cagctgcttg gggcttaggc cagtcctctt ctgggactga gcatttctcc 1020 cggggtcaac tcgtttca 1038 // ID Gypsy6-NVi_I repbase; DNA; INV; 5046 BP. XX AC AAZX01003764; XX DT 05-NOV-2007 (Rel. 12.11, Created) DT 05-NOV-2007 (Rel. 12.11, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy6-NVi; KW Gypsy6-NVi_I; Gypsy6-NVi_LTR; internal portion. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-5046 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from Nasonia parasitic wasp."; RL Repbase Reports 7(11), 1124-1124 (2007). XX DR Genome; AAZX01003764; Positions 5527 482. XX CC Positions [2236-2778] - Reverse transcriptase CC Positions [3877-4356] - Integrase core CC LTRs are 77% similar to each other. XX FH Key Location/Qualifiers FT CDS 138..2144 FT /product="Gypsy6-NVi_I_2p" FT /translation="MSDIDDFEKFVEDKQLHDIISGNMADKQLQEALAEIE FT RQKARRLEERLEADKKLKELQAALQQQRNQQDSQNSNNPSDTSTDSLASAI FT TSLSDVMKSQSETLSSFQGILKNITQVDQGQVMSAASAQSIIREFYGNEGP FT EKAQVWLNELKNTKTLYNWSDAIVLSVAKSRLKKGAWKWLMTRSTTVITFE FT AFLEAFSAVFTYKRSRSEKLKVMMARTQSRNEPIQDYFLDKVWLCEGLEFT FT VNDIRDEVAAGLWSRELSQHILGKSYLTSDEILRDLFRIEKIQNGRVQRIN FT EQREQRTTTQDSKYGGGTNNSTKSWRADRTSNGVGGGVSASGSFTSTARDG FT ATDKTQGNSKETRDAAVKCYNCNQIGHISRECTKPKREITCYLCKRPGHIA FT TRCTFKPNNSTDTSKLESVNLVNSVNTSNLGNKFIREVKIGSLKTTALIDM FT GATVCTIKSTAVLCAGYKVNRVRSVLEGFGGQKVESPGVVVESIQIDDLKP FT RELTFRIVSDTAQKEDVIIGQPFTEALDISYTRSGNKLKFFDTNIIETNDT FT SSVKIYSLDNVELESGEVKFINVNVSSHELVIPVINNTDKVNRINVNEKVG FT ESILSIEPVNEMEPRKELVKSDEIVTDENVTFEQKNELLQILNSNRTCIAK FT NLSEIGKTNLMTMDIGRYGY" FT CDS 2137..4974 FT /product="Gypsy6-NVi_I_1p" FT /translation="MDIEFEDDRVISSKPYKLNATDKRDLDNLVIEYKKAG FT IITETDSKHASPAFVVRKKDGSASMVIDYRKANRITKGVHFPIPNFDDLLE FT KLSNARIFITVDLYRGYLQMPLTERAKPLTAFITETTTGQFERAMLGLKCA FT PMYFAKLMEKVLGDARRHGIAFNFFDDIFIYAENWDQLMKNFKIVLQNLKD FT AGLTLNLKKYRFGMRKVEFLGYVLGDGELKPGDRKIVAIEQFPRPSDKHEV FT RRFVGLASFFRRFVPKFADKIKPLLLLLRKDAKFEWNREQEMSFINVKSIL FT VSKPVLKLYNQNAYRTELHTDASSVGIAGMLLQSDNEGEPLSLVYAVSRCT FT TEVESKYHSSRLELMAIAWSLQRLRNFIIALKFTVVTDCQSLVYLNAWKTK FT NAQISRWMSELSEYNLEIKHRRGELMQHVDALSRAPILIDPIINSINTITE FT VRTNDYEILVFQRSDPFILEIISILRKSEGERSKLEKSKMKDFILRGGLLY FT KKVIRNDKEIKLFVVPNAMRKSLVIRYHDLCNHFGVDKTVKRIEEYYYFPR FT LRRYVKIHIKNCLECIMSKRKAGPGEGELHPIPPGKRPFELIHIDHVGPFP FT TTSRGNIYILGIIDNLTKYVKFEPVKNVTTQVTVKKMEEFVNRFGAPDRIV FT SDRGTCFTSHLFQEFCLRHGIKHSLNSSRHPQANGLIERMNQTLIPALRIS FT EHNNLNYNWDRDVAIVERNINSTISTATGIEPYRALLGYLPRFNDGNLRDL FT TKHCETYTPPNEIQCKLRDRIIAEQAKYKNYYDKNRYKNVKYNIGDIVFMK FT RNAIASGESTKLQNPYGEPLVITEVYPNDIYRVKKLNDYGDRGYETTANVS FT QLKIWKNSESIHSDDLCDDGENSDHDIEYNNDCNDNNLDLENNIIDTPDNV FT DNQTENVNKNTLDNSIVGKNFELVERPKRVRKTPTYLRDYIV" XX SQ Sequence 5046 BP; 1832 A; 687 C; 1013 G; 1514 T; 0 other; taattaaaag ttgaattata ataattcaga agtgggatat tccaaaaatt taaggaagac 60 aagatttgaa cgaacagtga aggttgtgac aacggatttt gagttccgag aattgtggct 120 cgtcgaatgt tgtgttgatg tcggatatcg acgattttga gaaatttgta gaggataaac 180 aattacatga tataatatct ggcaatatgg cggacaagca gttacaagaa gctttagctg 240 aaattgagag acaaaaggca agaagattag aagaacgcct tgaagccgac aagaagttaa 300 aggaattaca agctgcacta cagcaacaaa ggaatcaaca ggattcacag aattctaata 360 atcccagtga cacatctact gactctttag cctcagctat tacgtctctc agtgatgtaa 420 tgaaaagtca gtcagaaact ctttcaagtt ttcaaggaat attgaaaaat attactcagg 480 ttgatcaagg tcaagttatg tctgcagcta gtgctcagag tattattcgt gaattttatg 540 gaaacgaagg tccagaaaag gcgcaagtat ggttaaacga gcttaagaat actaagactc 600 tatataattg gagtgacgct attgttttga gtgttgccaa atcgcgtcta aagaaaggag 660 cctggaaatg gctgatgaca agatcaacta cagtcataac atttgaagct ttcttagagg 720 ctttttcagc agtgtttacg tataaaagat cgcgtagtga aaaattaaaa gttatgatgg 780 cacgtacgca aagtcgtaat gaaccaatac aagactattt tcttgataaa gtgtggttat 840 gtgaaggact agaatttaca gtgaatgata tacgtgatga ggtagcagct ggcttgtggt 900 caagagaatt atcgcaacat attttaggta aaagttattt aacgagtgat gaaattctac 960 gagacttatt tcgcattgag aaaattcaga acggccgagt tcaaagaatc aacgagcagc 1020 gtgaacaacg aacgacaaca caggattcca agtacggagg aggaaccaat aattccacga 1080 aatcttggag agcggacaga acaagcaacg gcgtcggcgg cggcgtctca gcgtcgggga 1140 gcttcacgtc aacagcgaga gatggagcca ctgacaagac tcaagggaat tcgaaggaga 1200 cacgcgatgc tgcggtaaag tgttacaatt gtaatcaaat cggacatatt tcgcgtgaat 1260 gtacaaagcc gaaacgagag attacgtgtt atttatgtaa acgccccgga catattgcga 1320 cgcgatgtac tttcaagcca aataatagta ctgatacatc taaattagaa tcagtaaacc 1380 ttgtaaatag tgtaaatacg tctaacttag gaaataaatt catacgtgag gtaaaaatag 1440 gctcattgaa aactacagca cttatagata tgggcgcaac ggtgtgtact ataaagtcta 1500 cagctgtttt gtgtgcaggt tacaaagtaa atagagttcg ctcagtttta gaaggttttg 1560 gaggtcaaaa ggtagaatct ccgggtgtcg ttgtagaatc tatacagatc gatgatttaa 1620 aaccacgtga acttactttt agaatagtat cagatacagc acaaaaggaa gatgtcataa 1680 ttgggcaacc atttacggaa gcgcttgata ttagttatac tcgttctggt aataaattaa 1740 aattttttga taccaacata attgagacta atgatacatc atcggtaaaa atttattctt 1800 tagataatgt tgaattagaa tctggtgagg ttaagtttat aaatgttaat gtaagctctc 1860 atgaattagt aatacctgta ataaataata ctgataaggt taatcgaatt aacgttaatg 1920 aaaaggtcgg tgagtcaatt ttatcaattg aaccggtaaa cgagatggaa ccgcgtaaag 1980 aattagtaaa atcagacgaa attgtaacgg atgaaaacgt aacgtttgaa caaaagaacg 2040 agttgttaca aattttgaat tcaaatagaa cttgtattgc gaaaaattta agtgaaatag 2100 gtaaaacaaa tttaatgact atggatattg gacgatatgg atattgaatt tgaagatgat 2160 agagtaatta gctcaaaacc atacaaatta aatgctactg ataaaagaga tttagataat 2220 ttagtcattg agtacaaaaa ggctggaatt ataacagaaa ctgattcgaa gcatgctagt 2280 ccagcgtttg ttgttcgcaa aaaggatggt tcagctagta tggttataga ttatcgaaag 2340 gctaatagaa ttactaaggg tgtacatttt ccaataccta attttgacga tttgcttgaa 2400 aaattaagta atgctcgtat tttcataaca gttgatctct atcgtggata tttacaaatg 2460 cctttaacgg aaagggccaa acctttaact gcttttatta ctgaaacgac gactgggcaa 2520 tttgagagag ctatgcttgg tctaaaatgc gctccaatgt attttgctaa attaatggaa 2580 aaggtacttg gtgatgcgag aagacatgga atcgctttta acttctttga cgatatattt 2640 atttatgctg agaactggga tcaattaatg aaaaatttta aaatagtatt acaaaacctt 2700 aaagatgcgg gattaacttt aaatttaaag aagtatagat ttggaatgcg aaaagtagaa 2760 tttttagggt atgtattggg agacggtgaa cttaaaccgg gtgatcgcaa aattgtagct 2820 attgaacaat ttccacgacc aagtgataaa cacgaagtac gaagattcgt agggttggct 2880 agctttttca gacgttttgt acctaaattt gcagataaaa taaaaccact attactatta 2940 ttgagaaaag atgctaaatt tgagtggaat cgtgaacaag aaatgagttt tataaatgta 3000 aaatcaatat tagtttcaaa acccgtatta aaattgtata atcagaatgc ttatagaact 3060 gaattacata ctgatgcttc ttcagttgga attgctggaa tgttattaca atctgacaat 3120 gaaggagaac ctttaagttt agtgtatgct gtaagtaggt gtaccactga ggtagaatct 3180 aagtatcact ctagcagatt agagcttatg gccattgcat ggtctttgca acgattacga 3240 aattttatta ttgcgttaaa atttactgtt gtaactgatt gtcaaagtct tgtttattta 3300 aatgcatgga aaactaaaaa tgcgcaaatt tctagatgga tgagtgaatt atctgagtat 3360 aatttagaaa ttaagcacag acgtggagag ttaatgcaac atgttgatgc tttatctaga 3420 gcaccaattc taatcgaccc tattattaat tctattaata ctattactga agtaagaaca 3480 aatgactatg aaatccttgt atttcaacga tctgatccat ttattttaga aataatatcc 3540 attttacgaa aaagtgaggg agaacgtagt aaattagaaa agagtaagat gaaggatttt 3600 attctgcgag gtggtttgct atataagaag gtaattcgaa acgataagga aattaaatta 3660 tttgttgtac ctaatgcaat gcgaaaatca ttagttatac gatatcacga tctttgtaat 3720 cattttggag tcgataagac tgttaagcgt attgaagaat attattattt tcctcgattg 3780 agacgatatg taaaaataca cattaaaaat tgtttagagt gtattatgtc aaagcgaaaa 3840 gcaggtccag gtgaaggtga attgcatccg attccgccag gtaaacgacc atttgaactt 3900 attcatattg atcatgttgg cccatttcca actacttctc gaggaaatat ttatatactt 3960 ggtattattg acaatttaac taaatatgtt aagtttgaac ctgtgaaaaa tgtaaccacg 4020 caagttactg ttaaaaagat ggaggaattt gtaaatagat ttggagcacc ggatagaatc 4080 gtatctgatc gcggtacatg ttttacatca catttatttc aagagttttg tttaagacat 4140 ggaattaaac atagtcttaa ttctagtaga cacccacaag caaacggtct aattgaaaga 4200 atgaatcaaa cattaatacc agctttaaga attagtgaac ataataattt aaattataat 4260 tgggatcgcg atgtagcaat agttgaacga aatattaatt ccacaattag tactgcaact 4320 ggtatagagc catatagagc gttattagga tatttaccta gatttaatga tggtaatttg 4380 agagatttaa caaaacattg tgaaacttac actccgccta acgaaattca atgtaaactg 4440 cgtgacagaa tcatagctga acaagctaaa tataaaaatt attacgataa aaataggtat 4500 aagaatgtta aatataatat tggcgatata gtatttatga aaagaaatgc gattgcttca 4560 ggggagtcga ctaagcttca aaatccttat ggagaacctt tagttattac agaagtttac 4620 cctaatgata tatatagagt taagaaacta aacgactatg gagatcgtgg ctatgaaact 4680 actgcaaatg tatctcaatt aaaaatttgg aaaaattctg aaagtattca ttctgatgat 4740 ttatgtgacg atggagaaaa tagtgatcat gatattgaat ataataacga ttgtaatgat 4800 aataatcttg atttagaaaa taatattata gatacgcccg ataatgtaga caatcaaaca 4860 gaaaatgtaa acaaaaacac tttagataat agtattgttg gtaaaaactt tgaattggtt 4920 gaaagaccta aacgtgtacg aaaaacgccg acttatttaa gggattatat tgtataatta 4980 ttttattatt atatttaatg attttgtaaa tgtttgggga ccaaacatac gtttcggatg 5040 gccgat 5046 // ID MBSAT1_MB repbase; DNA; INV; 234 BP. XX AC AY136944; XX DT 09-JUL-2004 (Rel. 9.06, Created) DT 09-JUL-2004 (Rel. 9.06, Last updated, Version 2) XX DE Mamestra brassicae satellite MBSAT1 sequence. XX KW SAT; Satellite; Simple Repeat; MBSAT1_MB; satellite repeat. XX OS Mamestra brassicae OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Noctuoidea; Noctuidae; Hadeninae; Mamestra. XX RN [1] RA Mandrioli M. and Manicardi C.G.; RT "Cytogenetic and molecular characterization of the MBSAT1 RT satellite DNA in the holocentric chromosomes of the cabbage moth RT Mamestra brassicae."; RL Unpublished. XX DR Genbank; AY136944; Positions 1 234. XX SQ Sequence 234 BP; 83 A; 38 C; 51 G; 62 T; 0 other; aaataaaatt tcaagcgcta gctagctcga taggtagatt ttgaatagct cgatcgtaag 60 ctaggggaaa attcgatagg ctagttagac agatccttag agatcgccgc atataaccca 120 atgcagaatt gggacaaaaa ttggaaattt ttaagggaaa ttttcctttt agacagacac 180 agacatgctc gatcgtatcg cgagatcgta gatagatagc tcgaaaaaat attt 234 // ID Copia-114_AA-LTR repbase; DNA; INV; 232 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-114_AA_; KW Ty1_copia_Ele29; Copia-114_AA-I; Copia-114_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-232 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 232 BP; 59 A; 58 C; 42 G; 72 T; 1 other; tgtaatccaa gtagcaatcg gctattctgg tcattccatc catgctgggt ggcaatgcct 60 gttgcatagc aaccaaccca acaaggtcgt agtaagctgc gacccatatt caaattgttc 120 ttcctctttc aattttgcac atcgtgcagt atagtcgcat ttwaataaag ctccttaatt 180 gttctcgaag tatccgcgtt tttcaattcc gagttccggt aacgaaatcc ca 232 // ID CR1_Ele3 repbase; DNA; INV; 4547 BP. XX AC . XX DT 25-OCT-2010 (Rel. 15.1, Created) DT 25-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE A CR1 clade non-LTR retrotransposon family from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1_Ele3. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4547 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4547 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (25-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. This consensus is generated from 12 CC sequences with >95% identity, and ~97% identical to the original CC sequence. XX FH Key Location/Qualifiers FT CDS 751..1578 FT /product="CR1_Ele3_1p" FT /translation="MAASEVCHSCARGMTAVEVVCGGFCKATFHFSCANIT FT EDLYKEIRGKPAIFWMCNGCRELMSNARFKNALSSMNAAAIEVNDTYLKLI FT EEMKSEIKDSLIAEIRQEIKGGFNQLSPAVLSPLPRYFKFGHANSPKRRRD FT DEAVASDQPSKILRGTGPSTADIPSIHMDRTSDKFWVYLSKICPEVSETDI FT ERLAKDCLHTDEVVVKSLIPRGRSLSTLSFVSFKVGVSLELKPKAMDPVSW FT PQGIEFREFIEDEGRKTQHFWKPVLSVDSGPMTSS" FT CDS 1488..4493 FT /product="CR1_Ele3_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="ISRIHRGRRTKDSAFLEAGTVCRLRTNDQLVEPTESS FT NIPDVLPALSDRGLPRSQLTVYYQNVRGLRTKTNALLQQLLSCDFDVVIFT FT ETWLHTDISNAELTSNYIIYRCDRSSRTSVLRRGGGVLIAVKSELNCKAVH FT LVDCENLEQVAVHITLPHKSLFLCCIYVRPCSHPDIYIKHAESVQELLDKA FT TPFDTTIIVGDYNLPNLSWDYDDDVQGFLPRNASAEQELVVVDSLLSTGLK FT QINDLVNDNGKLLDLVFVSDTESMELFEPPSAMLKVDRHHKPLVLKLDSQS FT PPLAPASERSEYNFARCDFNSINERLATLDWMQLLNLPTVDAAVSAFYDNI FT NQIIRETVPLKTRRPTKLFNQPWWTPQLRNLRNRLRKARKRYFRCRSLENK FT CNVQLAEAEYASLHEIRFRDHIKNIQTGCKENPISFWSFVNARRRSSGIPV FT DVSYCDRSSSTDTTSANLFAEFFQSVFEITESPAPQQYVDQLPSFNIHLPQ FT PEFSLDIVYRALSSVDASKGPGPDRLSPVFIKNCAASLATPIMCIFNRSLQ FT EGVFPDVWKLASITPIFKSGNLHRVDNYRPISILNCLSKVFERLLHDMLYP FT AVQPMISEFQHGFMKKRSTLSNLMTFTSSVVENVEKRQQVDAVYVDLSKAF FT DKVPHDLAVEKLSRLGLPPWIVNWLKSYLSSRKAFVKIRDSSSNVFEITSG FT VPQGSHLGPLIFILFINDVCDHLRSSKLLFADDLKIYRVVKSTLDCCVLQA FT DIEKLLSWCQINGMQVNASKCKVITYTRGQSPIRFDYTMNHAPLERVYSIK FT DLGIILDSKLRFTEHISKTIAKASSMLGFMCRNAADFDDVYALKTLYCSLV FT RSVLEYGVQIWAPHHAVHELRIERIQKRFVRFALRRLPWTNPDSLPAYEHR FT CALIGLETLSKRRTSLQRLFIFDMINHHIDCNSVLQKINIHAPLRQLRNTN FT FLWTPAHRTAYGYNNPLDRSCRLFNGVYDLFDFNISRLVFKNRIKNI" XX SQ Sequence 4547 BP; 1252 A; 1091 C; 936 G; 1268 T; 0 other; ctggcaacac tgctgtgctg attagttgtg ttcttctgtt tgtgtcatcg attttaaact 60 tatttgacga aatctacacc cgaaattgct aaacgctgct gaattcatcc cacaagtgcc 120 cgttccgtac tgctcacagt ttgtttgcaa cacacagtca ggtaggataa gtgaaattgc 180 attttccgag caggcttgag tgtggtttga gtaagccgcc caccgccatt gcccccctat 240 aaaaaaatct cccacctcgc cgctctccac ctgcactgct attgcaacac tgcactgtgg 300 attgcgccac caaaacataa acaataacat ccagcatctg ttcactccgc aaagcctcgc 360 aatcaattga tattacgcct ccatgtaccc acctgtgtac cattcatgca aataaacaac 420 gccagcttct gctcatcttc tcattctgca atccgaaaaa cccatcagcg cttcgtccgc 480 aacatagtcg tccgcaattc cgcatcgccg tgttcgctgc caatcatcat cgcaaaattc 540 ttcgctgcat aaaaaaggta aacaaccgta tcatcacgtg catgagtttg tgtgagtaga 600 acgaaagcag acaacaatca acttcgcatt gtttgagatc cgtcaacact cccctgtcat 660 cgatcaagtc tgcgtagcca tctgtttgtc agtgcccaaa ctaacgttag agcatctcca 720 ccggtgttgt tttgcaacag caaatatcaa atggctgctt cagaagtttg tcatagctgc 780 gcgcgtggta tgactgccgt cgaagttgtt tgtggcggtt tttgcaaggc aactttccac 840 tttagctgcg caaatataac ggaagacctt tacaaggaaa ttcgtggtaa acccgctatt 900 ttctggatgt gcaatggctg tcgggaactg atgagcaacg ctcgttttaa aaacgcgtta 960 tcatctatga atgccgctgc aattgaagtg aatgatacct atctaaaact gattgaggaa 1020 atgaagagcg agattaaaga cagtttaata gccgagatca gacaggaaat taaaggtgga 1080 ttcaaccaat tatctcctgc ggtgctatct ccgcttcctc gatacttcaa gtttggtcat 1140 gccaattccc caaagagaag acgtgacgat gaggctgtgg cgtccgatca accctctaaa 1200 atccttcgtg gtacgggacc ttcgacagcc gacataccat caattcacat ggaccgaaca 1260 agcgacaagt tctgggtgta cttatcgaaa atatgtccag aagtatccga aactgacatt 1320 gaaagactcg ccaaagattg cttgcacacg gatgaagttg ttgttaagtc gctgataccg 1380 agaggccgct cactgtcgac gctgtctttc gtttcgttca aggttggagt cagcctcgaa 1440 ttgaaaccta aggccatgga tcctgtatca tggccacaag gaattgaatt tcgcgaattc 1500 atcgaggacg aaggacgaaa gactcagcat ttttggaagc cggtactgtc tgtagactca 1560 ggaccaatga ccagctcgta gaaccaaccg aatcaagcaa cataccagac gtattgcctg 1620 ccttgtccga tcgcggttta cctcggagtc aattaactgt ttattatcaa aacgtccggg 1680 gcttgagaac caagacaaat gcgctactcc agcaactcct ttcctgtgac ttcgacgtcg 1740 taatcttcac agagacgtgg cttcacaccg acatttccaa tgctgaattg acgagtaact 1800 atattattta ccgctgtgat cgaagctccc gcaccagtgt attgcgtcgc ggtggcggcg 1860 tgttaattgc agtgaaatcg gagctgaact gtaaggctgt acatcttgtc gattgtgaaa 1920 atctcgagca agttgctgtg catattacgt tgccacataa atcgctattc ctatgttgca 1980 tctacgttcg tccctgtagt catcccgaca tctacatcaa gcatgccgaa tccgtgcaag 2040 agcttcttga caaagcaact ccatttgata cgactattat tgtgggcgat tacaacctgc 2100 ctaatctttc ctgggattat gatgacgacg tacagggttt cctgcctcga aatgcctcag 2160 ccgaacaaga gctcgttgtg gtcgattcct tactgtcgac agggcttaag caaatcaacg 2220 atttagtcaa cgataacgga aagctgctcg acttggtttt tgttagtgac acagaatcga 2280 tggaattgtt cgaaccgccg tctgctatgc tgaaagtaga tcgtcatcac aagccactag 2340 ttttgaaact tgacagtcaa tctcctccat tggctcccgc atcagaacgc agtgagtata 2400 atttcgcaag gtgtgatttt aattccatta acgagagatt agcaaccttg gattggatgc 2460 agttacttaa cctcccgact gttgatgctg ccgtatctgc tttctacgac aacattaatc 2520 aaataattcg cgaaactgtt ccgttgaaga ctcgtaggcc tactaagctt ttcaatcaac 2580 catggtggac cccacaattg cgtaatctgc gaaaccgttt gcgtaaggct cggaaaaggt 2640 actttcgctg caggagtttg gaaaataagt gcaatgtaca gcttgcggag gccgagtatg 2700 caagcttaca tgaaattcga ttccgagacc atattaagaa tatccagact ggctgcaagg 2760 aaaatccgat atcattctgg tcttttgtca acgctcgtag aagatcttct ggcattccgg 2820 tcgatgtatc ctattgtgat cgaagttcct ccaccgatac cacttccgct aacctctttg 2880 ctgagttctt ccaaagcgta ttcgaaatca ctgaatcacc agcaccgcag caatatgtcg 2940 accagctacc aagcttcaac attcatctac ctcagccaga attcagtctt gacattgttt 3000 acagagcgct atcatcagtc gatgcatcga agggccctgg tccagatcgt ctttcacctg 3060 tttttatcaa aaattgtgca gcatctttag ccactcctat catgtgtatt ttcaaccgtt 3120 ctctccaaga aggcgttttc ccagatgtct ggaagcttgc atccatcaca ccaatattca 3180 agtccggaaa tctgcataga gtagacaact atcgaccaat ctcgatcctg aattgcttgt 3240 ccaaggtttt cgaaagactg ctgcacgaca tgttgtaccc agcagttcaa ccaatgatct 3300 cggaattcca acacggattc atgaaaaaac gttcgacact ttccaatttg atgactttta 3360 ccagctccgt tgttgaaaac gttgagaagc gccagcaagt agatgcggtt tatgtagatc 3420 tttccaaagc ctttgataag gttcctcacg accttgctgt agaaaagcta agccgccttg 3480 ggcttcctcc gtggatcgta aactggctga aatcgtattt atcttctagg aaggcttttg 3540 tcaagattcg tgattcttca tccaatgttt ttgagattac atctggtgtt ccacaaggaa 3600 gtcacctggg accgcttatt tttattctat ttataaatga tgtgtgtgac catttaagat 3660 cctctaaact gctatttgca gacgacttga aaatttaccg tgtggtaaaa tcaactctag 3720 attgttgtgt tcttcaagct gacatcgaaa agctattatc gtggtgtcag atcaacggaa 3780 tgcaggtgaa tgcctccaaa tgcaaggtaa tcacttatac tcgtggacag tcaccgataa 3840 gatttgatta cacgatgaat catgctccgc ttgaacgagt ttactctatt aaggatctgg 3900 gaatcattct agacagcaaa ttgagattta ctgaacacat ctcaaaaaca atcgctaagg 3960 caagttccat gctcgggttc atgtgccgca atgccgcaga cttcgacgac gtttacgctc 4020 tgaagacatt gtactgctcg ctagttcgaa gtgttctcga atacggagtt caaatatggg 4080 ctccacatca tgctgtccac gaattgcgta ttgaacgcat ccaaaaacgt tttgtacgat 4140 ttgctttacg gcggctccct tggaccaatc cagatagctt accagcatac gaacatagat 4200 gtgcgcttat cggactggaa acgctgtcaa aacgccgtac ttcgctgcag agattattca 4260 tatttgacat gataaatcat catattgatt gcaacagcgt gctccagaaa atcaatattc 4320 atgcccctct ccgacaactg cgcaacacta acttcttatg gactcctgca caccgaaccg 4380 cttatggata taataaccca ttggacagat cttgtagatt attcaacggc gtttacgatt 4440 tgttcgactt caatattagt agattagttt ttaaaaatag aattaagaat atttaagcag 4500 tctgtgtggc ttccgccaaa gatgttgaac taataaataa ataaata 4547 // ID Copia-2_DPu-I repbase; DNA; INV; 4510 BP. XX AC scaffold_41; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-2_DPu_; KW Copia-2_DPu-LTR; Copia-2_DPu-I. XX NM Copia-2_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-4510 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 667-667 (2010). XX DR Genome; scaffold_41; Positions 662756 667265. XX CC Positions [1714-2097] - Integrase core CC 'GCTAG' target site duplication CC LTRs are 99% similar to each other. CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX FH Key Location/Qualifiers FT CDS 2810..4318 FT /product="Copia-2_DPu-I_1p" FT /translation="MSSHIKNETWTLVPLPPGRECIPSGWDFKMKTDRSGL FT PCRRKARFFAKGYRQVKGIDYQESFAPVVRYDSLRVIISIAAHRDLELIQL FT DVTATFLNGVIDELVYIAQPEGYVVTGRESEVCRLNKGIYGICQASRIWNK FT TLHDALIDYGFTQSSADPCVYHQITNSHYLIIAVWVDDGLVAGSSMDVVNE FT VVRFLNGRFEITAIPADLFVGIVLTRDRPNRKIYMSIPQFIDKIMAKFNFS FT DAHPVSLPVLKGAPRLSRDSSPSSPADIAAMSTIPFREAVGCLMYAALTVR FT VDIAYKASQLAQHCKNPGMEHWKAATRVLRYLSGTRDHGLCFGGNGPTRQA FT LVGYWDADYAGDPNTRRSTSGYLFIFNGAAITWSSRKQPIVALSTMQSEYI FT AASDSTREAVWLRRLLDNLGESQMEPTCLRCDNESAIGLAYNSLAHKGSKD FT IEVRYHYIREQVADGTIDVEYVDTKNQLADVLTKAVDGETFSKCLSGFGLA FT RVPKELG" XX SQ Sequence 4510 BP; 1221 A; 1129 C; 1021 G; 1139 T; 0 other; ggttatgggc ccaggccacg ctaagctttg aacacattcc agaatggctc agaatttttc 60 actagatgtg atcaagcaca ctagaaagtt cgatggaaag gaatttacaa cttggaagca 120 caacatggag atgttgttca cattaaagaa tatgaaactc atgtggatgt aagttaccca 180 accctttttt tttacgttga atgagaattt gcacattcta ataccgttgc atctctttct 240 atacactcag ggtcaactca ctgaacctga agtagaattt gatgacgatg acccatccat 300 catgttaaac ggtgatgata ttgaagaatg gaaaatgagt gactgctatg ccaggtacct 360 catctttaat agctgcgatg agatcaggaa gcaagcccta ctcaacagca gaaccagcca 420 tgaaatgtgg actagattgg aaactcagta cttacaacgt gctgcagaca acaagcactt 480 gctacacaga gactttctga atctacggta agcaatttac tagcacactt gtgttaagca 540 gagactaaca attgtctttc taattctact cgcagaccca aacctaatga agacattatg 600 attcacataa cggccctaga gtccatggcg actgaactta atgatctagg agtgcacgtg 660 acagaacacg acttaatcac gaccatcatg tgtagtttgc cagccagatt tggattctta 720 tcttcctgtt gggacaacgt gccagacaac gaacgatcca tggatgctct tcgggcaaga 780 atcgtgtctg agcaacggcg aattgaactc cgaaggacag aagaggaagg ttttaaacct 840 ttaactaaca ctccagaggg gactgctctt caagcttacg gtggacaaaa agggcatttc 900 ccacgtgcgt ttcgaggaag tcgtggccga ggtggagcat cacgagatat catcaatcgc 960 gacacagcaa aatgcacgta ctgcggaaaa ccacgccact acgaatttga atgcaggact 1020 cgtattgcta acgaaggaga caaacaaaca acctcattca aaagacccaa caacgacggc 1080 gatgggagtg gtccctcacg gaaagcagat ttctccttca tctctgtatg ctacctaaca 1140 acaacagacc ctgatgctct ttaccttgat tccggagcca cccggcacat gagtggagta 1200 cactcctact tccaagagtt gagagacata ccaccaaaca gctggccgat caatggcatc 1260 ggaggtacgg tgctacatgc caaaggtgtt ggaacgataa agttgtcttc ccatgtacac 1320 ggtgagacga tcaatggaag cctgaaggat gtgctgtacg tccctggatt gggagtaaac 1380 ctcatataag tcgtgtgcct atctatcaac ggatattcag tatccttcca cggtctagag 1440 gcttgtatca tgcgtgacaa tactgttatc atgacagcat caaggtcggg tgaaactctc 1500 tacaaagtta acgctgtcgt ctctaccccg tcgaccatgg gccttgcagt cgcaacaact 1560 caagcaacgc tcaacacctg gcacgaacgt ctgggacacg tgaatcgtcg agctgttcta 1620 cgcatggcat caggcatcgg tgtcactggg atggacattt ctcctagatc taccactctc 1680 aacgaatgct gtcatgggtg tgaagtcgga aagatgcaca aacttccgtt caccaagagt 1740 caatcctcgt tctcagcaat tggggaatgt atcgtctccg atctagtcgg tcctttacaa 1800 gtagaatcca ttggaggtgc gcggtattac gtcatattca aagacgtctt tatcaagtac 1860 aagactgcct acttcctcaa acaaaagtca gagacagctg attgttttct cgcatacgtc 1920 aaaaaggtcc acaccgatac caatcaacgt gtcaaaatgc ttcgctgtga cggtggtacg 1980 gagtataaca accgttactt gaaaacctct ctagaaaatc ttggcatcaa acttcaaacc 2040 agtgcgccgt atacgccgga taaaacggga tcgctgagag agaccatcgc acaactgtag 2100 aatccgctcg cagccaaatc catgcccgtg gtattccact caaactatgg gctgaagcta 2160 taaactattc ggtctatgtc ttgaacagga ccttatcaaa tacgaaaagc gtaaccccat 2220 ttgagcactg gtatggagta cgacccgaca tctcgaacct tcgtattttt gggtcggtca 2280 gctatttttt agtcgctgat gatctacgcc agaaactgga tcccaaagct accaaaggtg 2340 cctatgttgg tgagagtgaa gagcaaaagg ctagtcgggt gtttgtggaa gctaccgggc 2400 ggacccacat cacacgccat gttcgagttt acgaaaatat cccatattgg tcaccggtga 2460 tagagcccgc ctcaagtgtt gcagaaccac tgccatcacc tctcattgat gacatcacac 2520 ctgagaccac catccatcct gcaacatcgg atccggctct cgggaccaaa actaagctcc 2580 gcgccgttca cctgcccgtc cggaaatccg ccagaggtct cataccgaag aaactttttc 2640 cgatggacat ggattgtgcc atggcggaca cacatccccg agatgcatct atgtactgtt 2700 tcatatcgat ggcgttgaag tcaagctctc tgtactacga accgaagaca ttccgtgaag 2760 ctgtggatgg ccctgaaggt acgctgtgga acaagcagcg gacaacgaaa tgtcctctca 2820 catcaagaat gagacttgga ctcttgtccc actgcctcca ggccgggaat gcattcccag 2880 cggttgggat ttcaaaatga agacggacag atcaggcctc ccgtgtcgcc gtaaagctcg 2940 atttttcgct aaagggtatc gacaagtcaa gggcattgac taccaggagt cctttgcacc 3000 ggtcgttcgc tacgactctc ttcgagtcat catctctatc gcagcccatc gtgatctgga 3060 actcatccag ctcgacgtga cagccacttt tctcaacggt gttatcgatg aactggtgta 3120 catcgctcaa cccgaaggct atgtcgtcac tggtcgtgaa agtgaagtgt gccgactgaa 3180 caaaggaatc tatggcatat gccaagcctc tcgcatctgg aacaaaacgc tacacgatgc 3240 cctcatcgac tatggtttca cgcagagctc tgcggaccct tgtgtgtatc atcagatcac 3300 taattcccac tatctaatca ttgccgtttg ggtcgatgat ggccttgtgg ccggcagttc 3360 catggatgtt gtcaacgaag tagtccgatt cctcaacggg cgatttgaaa ttactgccat 3420 tcctgctgac ctcttcgtcg gaatcgtcct cactcgggat cgacccaacc gaaaaatcta 3480 catgtcgatc cctcagttca tcgacaaaat tatggcaaag ttcaacttct ccgacgcaca 3540 tccagtttct ctcccagtac tcaaaggagc gcctcgtctc tctcgtgatt catcgccctc 3600 ttcacctgcc gacattgctg ccatgtctac catccctttc cgcgaagccg tggggtgtct 3660 catgtatgca gcgctcactg tacgtgtaga catagcttac aaggcaagcc aactagctca 3720 acactgcaaa aatccgggga tggaacactg gaaagctgca acgcgtgtgc tgagatacct 3780 atccggaaca cgcgatcatg gtctctgctt cggcggaaac ggacccacaa gacaggcact 3840 agtcgggtat tgggatgcgg attatgcagg ggatcctaac accaggcggt ccacttccgg 3900 ttatctgttc atcttcaacg gagcagccat aacttggtcg agtcggaagc aacccattgt 3960 tgcactgtcc accatgcaat ccgagtacat cgccgctagt gattctaccc gcgaagctgt 4020 gtggttgcgt cggcttctcg acaatctcgg tgaaagtcag atggaaccaa cttgtctccg 4080 ttgcgacaac gaaagtgcta tcggtctagc gtacaattct ttggcccata aaggttcgaa 4140 agacatagag gtgcgctacc attacatccg tgagcaggtt gctgatggaa ccatagatgt 4200 ggagtatgtc gacaccaaga atcaactagc tgatgtcctc acgaaagccg tagatggtga 4260 aactttttct aaatgtttga gtggatttgg cctagcgcga gttcctaaag aattgggcta 4320 acttgtgttt ctttcctcta attactctta tttgattttt gcgcattctt aattgtcttc 4380 gaagactgga gtgaaaacta tttttttttt gaacaattct tttttttatt attcatcctg 4440 tactgatcta tgtcatcaca ttaagttcaa tctgtatttg tgtttgttgt ctgtttcgaa 4500 tgagagagag 4510 // ID CR1-87_HM repbase; DNA; INV; 2441 BP. XX AC . XX DT 12-APR-2009 (Rel. 14.04, Created) DT 12-APR-2009 (Rel. 14.04, Last updated, Version 2) XX DE CR1-type family - consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-87_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2441 RA Jurka J.; RT "CR1 families from Hydra magnipapillata."; RL Repbase Reports 9(4), 734-734 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 158..2116 FT /product="CR1-87_HM_1p" FT /translation="MTSNLIKSSKRKQKLYIKFLKSKDKGDEKTYKAYKQF FT FQKNIRQAKIMYYSNLLNKNKLDIKKTWSIINEVTGREKKKFFSIPQKIFC FT DNKYITNESDIYHEFNKHFVNLGPSLASKIVMPNKKFDEFIGEKTLSCISN FT EKISYKEFNKALTELKRNKSCGYDDITSNVAIYVIECIRKPLFYLISSSFE FT NSTFPDKLKLAKIHPVFKNGDCSSVCNYRPISLLPVFSKIFERVIFNRIYN FT YMTLNTLLYKNQFGFQKYCSTEHAIIELTNKIFMFFDKGEHVLGVFIDLSK FT AFDTVDHGILLSKLKHYGIKGLIYKWVKSYLFNRKQFVVLEESGALDIVCG FT VPQGSILGPLLFLIYINDMNKASHKISSIMYADDTNLFYNNSDVKILFETM FT NTELESFNQWFIANKLSLNCEKTSYTLFHKIRQSTNLPLKLPKLSINKNEI FT NRVNHSKFLGVIFDENLSWKKHISLIEAKISSAISVLYKSRAFLDYKSRKM FT LYFSLVHCHLSYANIVWANTHKTKLLRLQSLQNHACKAINYLKRLENPSPV FT MKYMNVLNISKLNSLQILIFMYKYKNNLLPKVFSDIFTSAFSQKYNLRSNN FT NHNYHLSIVKSQVSKFSIISQGPSLWNQLKNNNIKNLKSTFSFKRQMKLHL FT LNF*" XX SQ Sequence 2441 BP; 951 A; 335 C; 326 G; 829 T; 0 other; ataatttaaa aaataaacta tttgaggagt cctggaaaaa tacagctaaa gatatcaggc 60 atttaataac tttacggcta cttttaaatg tatttttgat actgtctgtc caataataac 120 atctgaagta aaaaataaag aattaaaaaa tccatggatg acgagtaatc ttataaaatc 180 gtctaaaaga aagcaaaaac tttatataaa atttttgaaa tcaaaggata aaggtgatga 240 aaaaacgtac aaagcctata aacagttttt tcaaaaaaat attagacaag ctaaaataat 300 gtattattct aatctgctaa ataaaaataa attagacata aaaaaaacat ggtcgattat 360 aaatgaagta actggcagag aaaaaaagaa attctttagc atacctcaaa aaatattttg 420 tgataataaa tacattacaa atgaatcaga tatttaccat gaatttaaca aacactttgt 480 aaatcttggt cccagtctag cttcaaaaat agtgatgcca aataaaaagt ttgatgaatt 540 tattggagaa aaaaccctat cgtgtatatc aaatgaaaaa atatcctata aagaatttaa 600 caaagctctg actgaactaa agcgaaataa gtcatgtgga tatgacgaca ttactagtaa 660 tgtagcaata tatgtcattg aatgtattag aaaaccatta ttttatttaa tatcatcttc 720 ttttgaaaac agtacttttc ctgataaatt aaaattagct aaaattcatc ctgtttttaa 780 aaatggtgat tgttcttctg tttgcaacta ccgtcctata tcactactcc ctgtgttttc 840 aaaaatattt gaaagagtaa tttttaatag aatttataat tacatgacat taaatacact 900 tttatacaaa aatcaatttg gatttcagaa atattgctca actgaacatg ctattattga 960 acttactaat aaaatattta tgttttttga taaaggggaa catgtactgg gagtatttat 1020 cgatctctca aaggcattcg atactgttga tcacggaatt ttactttcaa agttaaaaca 1080 ttacggcatt aaaggcctaa tatataagtg ggttaaaagt tatcttttta atagaaaaca 1140 gtttgttgtc ctggaggagt caggagctct tgatattgtt tgcggagtcc ctcagggatc 1200 aatcttagga cctttattat ttttaatata tatcaatgat atgaataaag cttctcataa 1260 aatatcgtca attatgtatg cagacgatac aaatttattc tacaacaatt ctgatgtaaa 1320 aatattgttc gaaacaatga acactgaact agaaagcttt aaccaatggt ttatagctaa 1380 taaattatca ttaaattgtg aaaaaacaag ttatactctc ttccataaaa taagacaatc 1440 aactaatctt ccgttaaagt tgccaaaact gtcaatcaat aaaaatgaaa taaatcgagt 1500 aaatcattca aagtttttgg gagtaatatt tgatgagaac ctatcatgga aaaaacatat 1560 tagtctcatt gaagcaaaaa tatcttctgc tataagtgta ttatataaat ctagggcctt 1620 tcttgattat aaatcacgca aaatgcttta cttcagtctt gttcattgtc atctctccta 1680 cgctaatatt gtttgggcta acacacacaa aactaaatta ttaagattac agagcctaca 1740 aaaccacgca tgtaaagcaa ttaactattt aaagagatta gaaaaccctt cccctgtaat 1800 gaaatacatg aatgttttaa atatatcaaa acttaattcg cttcaaatat taatatttat 1860 gtataaatat aagaataact tgctgccaaa agtgttttct gatattttta catcagcatt 1920 ttcacaaaaa tacaatcttc ggtcaaataa caaccataac tatcacttga gcatagtaaa 1980 atcgcaagtg tcgaaattct caattattag ccaaggcccg tcattatgga atcaattaaa 2040 aaataacaat ataaaaaatt tgaaaagcac tttctctttt aaacgacaaa tgaaactgca 2100 tctccttaat ttctgacata tatgtaagta tgtttgtata tgtgtgtgtt tttatgggta 2160 tatgtatata tgagtatata tgtatatatg agtttttttt tattcttcac gtagtattta 2220 acagataaaa tatttgttgt aaattttcta aagccatgtg ggctttaaat gatattgtaa 2280 aggggctcta tgaaaagatt gtggtgacac cagtcatcct tatcttcttt gagcccctgt 2340 ctgttttata tatactcttt gtgtaatgta aaagtttcat tgtaatttta ttattttatt 2400 tttatataaa cagcgaaata tattaaaaaa aaaaaaaaaa a 2441 // ID Sola1-6_AP repbase; DNA; INV; 3804 BP. XX AC ABLF01029374.1; XX DT 28-JAN-2009 (Rel. 14.02, Created) DT 28-JAN-2009 (Rel. 15.12, Last updated, Version 2) XX DE Sola1 DNA transposons from Acyrthosiphon pisum. XX KW Sola; DNA transposon; Transposable Element; Sola1; Sola1-6_AP. XX NM Sola1-6_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-3804 RA Bao W., Jurka M.G., Kapitonov V.V. and Jurka J.; RT "New superfamilies of eukaryotic DNA transposons and their RT internal divisions."; RL Molecular Biology and Evolution 26(5), 983-993 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX FH Key Location/Qualifiers FT CDS join(726..770,789..998,1045..2910) FT /product="Sola1-6_AP_1p" FT /translation="MAYILCLNFGLKYVYIILFFTVGSNNDDVIGNNNIMV FT PFAFENSNEYMSNNNNNDSAESFSNNIENQNDFTESTETSGLNFKINFNKH FT FTHNIGSGCNVIVDNSEVNTGLENADESNVLAVRTRKKRSNPKNWSYNLNK FT KNKLNGKEYYGKKKTNGVWSYNIPKPAKVLKFVCNCKLSSKNTKLKCKKVK FT EEERLKIFEEFWKLSKNEKQLYIKNYTKIISTLRLRGTVTPSRRNSSIQYY FT LQTADDRVRVCKVLFLNTLGIGEKVCLKILSHKDSNNSISDNDIDSNFNPT FT EDTLNISNNQRSTKTTVTEKKSFLYKFLNSLPKLESHYCRANTSKLYLEPN FT WTSKTELYNFYCNDYCTTHNTTPMSSYVFCKALEEQNIGLFQPKKDACDIC FT TAFDTGNLSQEEKEKHDSMKKEARMEKERDKMSENEIFTMDLQSVLLCPKS FT NVSSLYYKTKLIVHNFTVYDLKRKNGFCFLWNESEGGLSSNEFSSIIIYFL FT QKYVINSNIGNLNSIILYSDGCTYQNRNSTLSNALLNLSINSNVTIIQKFL FT QRGHTQMEVDSMHSTIERKVRNRKINVPADYISICKSACKKSPYTVEYLTH FT DFFKSFSSLNFCKSIRPGNKKGDPVVTNLKAIKYEPDRQIKYKLRFTEEDW FT STLKLKNSKHIAVKFDDLPQLHNDRLKIKKEKFEHLQQLKLTMEKDYHAFF FT DNLPKE*" XX SQ Sequence 3804 BP; 1532 A; 461 C; 507 G; 1304 T; 0 other; cctatttagc aaacggcgta actccaaaaa attcaaattt cattttttca tatcattcaa 60 tagaaaaaaa aaatgcgcgt cgaatggtat cataatcata atatttaagt gtgttttctt 120 ccatgaaacc ggtgataatt tatacgcatt agtcgtaact ccgataaaca aataatataa 180 tttctataga aaccggcgta tctccatatt atcagtttac aacatcattc tgtttaaatt 240 tatgttaaaa atctaaaatg tatcgttgct aaaaccatta aatttattct gttggctaaa 300 acgtgaacaa gatgacatct agagcaaaaa aacttttaga aatggctaaa agtgttgctg 360 caccatacaa tatcatgcca agttctccaa agttgactcg gaaaaagtca agtgtgtaaa 420 ataaaaaact ttaattgtat tatataaata tgaaatttgt tttattttat tccataaata 480 cgtagtgtca attcaaaata ggtaggtatg tactttttat ttttcgttta ggtaatatta 540 gaaatgaaaa caaacaaaat aacagagctg ttgaaaacaa tttcgatatt gacacttata 600 ttagtgaaca gttagaaaat ttaaatgaag atgatcagtt aatttatact tcaaataagt 660 atgacatagt agagtttgtt gctagtgaag ctgttcaaga aactggtagt ttagattcag 720 gtttgatggc atacatttta tgtttaaatt ttggtttgaa gtatgtatat tagctactag 780 atttttaaat aatattattt tttactgtag gtagtaataa tgatgatgtt attggaaata 840 ataatattat ggttccattt gcatttgaaa attcaaatga atacatgagt aataataata 900 ataatgattc tgcagaaagt tttagtaata atatagaaaa tcaaaatgat tttaccgaga 960 gtacagaaac ctcaggtttg aattttaaaa ttaattttta actaacttct aatcgatagg 1020 tacttattat gtttagtata gtaaaacaaa cattttactc ataatatagg atcaggatgt 1080 aatgttattg tagacaattc agaagttaat actgggttag aaaatgctga tgagtccaat 1140 gtgttagcag tacgtacaag aaaaaaaaga tctaacccaa aaaattggag ttataactta 1200 aacaaaaaaa ataaattgaa tggcaaagaa tattatggta aaaaaaaaac gaatggtgta 1260 tggtcataca atatacctaa acctgcaaaa gttttgaaat ttgtatgcaa ctgtaaacta 1320 agctcaaaaa atactaaatt gaaatgtaaa aaagttaaag aagaggaacg gttgaaaata 1380 tttgaagaat tctggaaatt gtcaaaaaat gagaaacaac tatacattaa aaactataca 1440 aaaataattt caactttaag acttagaggt acagtaactc cttcaaggag aaattcatct 1500 atccaatact atttacaaac agctgatgat agggtaagag tttgtaaggt actattttta 1560 aatacactag gaattggtga aaaagtttgt ttaaaaattc tcagtcataa agacagcaat 1620 aattcaattt ctgataatga tattgatagt aattttaacc ccacagaaga cactttaaac 1680 atttcaaata atcaacgaag tactaaaacc actgtaacag agaagaaaag ttttctttac 1740 aaatttttaa actctttgcc caaattagaa agccattatt gtagagcaaa tacatccaaa 1800 ttatacttag aaccaaactg gacttctaaa actgaattgt ataattttta ttgtaatgat 1860 tattgtacca cccataatac cacaccaatg agcagctatg tattttgtaa agctttagaa 1920 gaacaaaata ttggattgtt tcaacctaaa aaagatgcct gtgatatttg cacagccttt 1980 gatacaggaa atttatccca ggaagaaaaa gaaaaacatg acagtatgaa aaaagaggca 2040 aggatggaaa aagaaaggga caaaatgtca gaaaatgaaa tctttactat ggatctgcaa 2100 tctgtcctct tatgtccaaa atcaaatgtt tcgtcgcttt attataaaac caaacttatt 2160 gtgcataatt ttacagttta tgatttaaaa agaaaaaatg ggttctgttt tctatggaat 2220 gaatctgaag gtgggttatc ttccaacgag tttagttcta ttataattta ttttttacaa 2280 aaatatgtta taaactcaaa tattggaaat ttgaattcta ttatactata tagtgatggt 2340 tgtacatatc aaaacagaaa ttctacttta tctaatgcat tacttaactt atcaatcaac 2400 tcaaatgtca ctataataca aaagttcctt caacgtggcc atacacaaat ggaagtcgat 2460 agcatgcatt ctacaattga aagaaaagta cgaaacagaa aaataaatgt accagcagac 2520 tatatttcaa tttgcaaatc agcttgtaaa aaaagtcctt ataccgtaga atatttaaca 2580 catgattttt ttaagtcatt ctcttcctta aatttctgta aatccattcg acctggaaat 2640 aagaaaggag atcctgttgt aacaaattta aaagctataa aatatgaacc tgatagacaa 2700 atcaaatata aattaagatt tactgaagaa gattggagta ctcttaaact taaaaatagc 2760 aaacacattg cagttaaatt tgatgatcta cctcaattac acaatgatag actcaagata 2820 aaaaaagaaa aatttgagca tcttcaacaa ctaaaattaa ctatggaaaa agactatcat 2880 gcgttttttg ataatttacc taaagagtaa agctattttg cacaacaatt gataatgatt 2940 tataattttt aatttgttat aattttactt agacatttta tttaaagagg aaaaaatgtt 3000 ttatttgttt attattttta gtttctaata ttaattaatt tgtataatat tgttattatt 3060 tatcaaggtt atttactttt attttatatt atattctata ccaagaatta gttcctacct 3120 aatgttgaac ttaaacagtt acttaattat taaaacagat actcaatttt attttataaa 3180 acttattata ttttataaca tttattttat atttataaat attgtatact gtgatgatta 3240 atattaatac tattcatcaa tattgattaa taattaatat tatttataat ttgttgttat 3300 ttaggttaag aagttgtaaa accttaaata attattttta ataaatacaa agtcaaaata 3360 aatcaagtaa taaatataat ttataatatt tttaaaataa attaacttac attgttctca 3420 aaattatttt tgaaaacaat aaaataaaat ttgtcttaac tataaaaata gaaattctag 3480 aagaaaaaga cgtaagtcca aaaaattaga attgatcgta tctccaagaa tgtaactatt 3540 atagaagaaa cagacgtaag tcgaaaaatt agaattgatc gtatctccaa gaatgtaact 3600 attatagaag aaacagacgt aagtccataa aaaatgttat ttattacata catttaatta 3660 tattacttct aaaataccaa ctactaatga agtaatacta acaaatattt gtttggtcaa 3720 aattagttaa ttggaattta ttaaacgttt ttttcctcta ctgtaaaatt tggtgttttg 3780 tagttacgcc gtttgctaaa tagt 3804 // ID Chapaev3-3_AA repbase; DNA; INV; 3631 BP. XX AC . XX DT 29-FEB-2008 (Rel. 13.02, Created) DT 29-FEB-2008 (Rel. 13.02, Last updated, Version 1) XX DE Chapaev3-3_AA is an autonomous DNA transposon - consensus. XX KW Chapaev; DNA transposon; Transposable Element; Chapaev3; KW Chapaev3-3_AA. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-3631 RA Kapitonov V.V. and Jurka J.; RT "Chapaev3, a distinctive group of animal Chapaev transposons."; RL Repbase Reports 8(2), 56-56 (2008). XX DR [1] (Consensus) XX CC Chapaev3-3_AA belongs to the Chapaev3 group of the Chapaev CC superfamily of DNA transposons (see comments on Chapaev3-1_PM). CC Chapaev3-3_AA is a very young family of mosquito Chapaev3 CC transposons: genomic copies of Chapae3-3_AA elements are ~99% CC identical to their consensus sequence, which was derived from CC multiple alignment of four Chapaev3-3_AA elements. Chapaev3-3_AA CC contains 13-bp TIRs, imperfect ~400-bp subterminal inverted CC repeats, and encodes a 552-aa transposase (2 exons). XX FH Key Location/Qualifiers FT CDS join(877..1084,1163..2610) FT /product="Chapaev3-3_AAp" FT /note="transposase." FT /translation="MARKCRNDPDSFCFICGQFITVKQKRKLTSFIKLCYE FT LYFKRKLIHQDKEWVPHVCCSTCARLLSAWFKGARHMHFAVPMIWREPTNH FT AKDCYFCLVTVQGISTKNKKMICYPNVRSVDQPTPHSNDLPIPVPPNFQGE FT EMDGVEKISDILTECSSVATYSSNVDVRSMSQADLNDLVRDLNLSKKQSEL FT LASRLKERNFLERDVQITFYRQRQKEFENFFCEENGLLFCQNVDGVMDTLG FT HSHVPSEWRLFIDASKSSLKAALLHNGNQYPTIPVAYSVTMKETYSNLKQV FT LNKIHYDDHCWPICGDLKIIALLLGMQLGFTKHSCFLCEWDSRDRKKHYVK FT KIWPKRQDFTPGNKNVIHSPLVDPLLVLLPPLHIKLGLMKNFVKAMDQSSS FT GFDYIKTKFPEISEVKKKEGVFIGPQIAKLMKDQEFDKTLSKAELRAWYAF FT KTVCRDFLGNRKAENYREKISELLKAYRSMGCNMSLKIHFLHSHLEFFPDN FT LGDVSDEHGERFHQDLKIFEKRYQGQNNTNMLADYLWTLQAGSSEIHHRKS FT YRKTF" XX SQ Sequence 3631 BP; 1188 A; 657 C; 654 G; 1132 T; 0 other; cacagcggaa catgattttg tctcaaagac caaattatgt gtctctgaag gtttttgatc 60 cgctgaatcc aaatctgact tccgatttgc tccagcacgt cacaattttt agatatagct 120 cgttatatgt agcaaaatta gcgatttcaa tgatttttga caatcaatct atgagtaggc 180 aaaacgtatt ttatgacaat caatgtcaaa attagttcat ttacatctaa tttatcgatt 240 catgtaaaaa agttgttcat tttaatataa ttttaaacag tttttagcat tttgtgtaaa 300 gtttgctttt ccgaaaaaca taatatgcaa gtcagccttt aaatgttgca tgcaagtaag 360 aaggtgaacg cgtgagtttt aaattgacta aaaaaattta aattacaaat gctagcgttg 420 tctcaaggat aaaaatgtgt gtctctagtg gaaccaccac ccctgaatct aaatctgtgg 480 ttgtaatgac tccagcacgt cacatttcat cgaaattact gtattttgat agagtaacaa 540 acgagaacaa tgtgaagtaa atctatattt ccgttcagct ttggaacttt tatattattt 600 attttattct attacgctaa catcagtcta ccttgataaa tgatctggta taataacatt 660 tttttacaat aattacttgt agtgtaatac atttcaatgg gacacaaaat cagcgcgttg 720 atttcaaatc agttgcatgc gactttgttc gtatgccttt gcatacaagt tgataggaat 780 tttgacagct ttacaagtcg cgcaaaattg gtgaaccgga aaatcgttaa ttgtccgttc 840 tttatcaaac tattttcggt ttcaaatccc accaaaatgg cacgtaagtg tcgcaacgat 900 ccagactcct tttgttttat ctgtggacag tttattaccg tgaagcaaaa gcgtaaactc 960 acgagtttca tcaaattatg ctacgaatta tatttcaaac gcaagttgat acaccaagac 1020 aaagaatggg ttccgcatgt ttgctgctct acatgtgcta gacttctttc cgcttggttc 1080 aaaggtatga ttttttcatt attatgaatg ttggttttgt atttatgcat tgaaaataac 1140 aaacacatat caacttgctt aggtgcaaga cacatgcatt ttgctgtacc tatgatctgg 1200 cgagagccta caaaccacgc caaagactgt tatttctgcc tcgtcactgt tcaaggaatt 1260 tcaacaaaga acaagaaaat gatctgttat ccaaacgtac ggtcagttga ccaaccaact 1320 ccacattcaa atgatttgcc gataccagtt ccacccaatt tccagggtga agaaatggat 1380 ggcgtagaaa aaatatccga catattaaca gaatgttctt ccgttgcaac ttactcttcc 1440 aacgtcgacg ttcgctcgat gagtcaagca gatttgaatg atttagtgag ggatttgaac 1500 ttgtccaaga aacaatcgga actactggct tctcgattga aagaaaggaa ttttttggaa 1560 cgcgatgttc agattacatt ctatcgacaa aggcaaaagg agtttgaaaa ctttttttgt 1620 gaggaaaatg gacttctctt ttgtcaaaat gtcgacggag ttatggatac cttaggacat 1680 tcgcatgtcc caagtgaatg gcgacttttc atagatgcgt ctaagtcaag tctcaaagca 1740 gcattacttc ataatggaaa ccaatatcct accattccag tggcctattc cgttactatg 1800 aaggaaacat acagtaatct caaacaagtt ctgaacaaaa tacactatga tgaccactgt 1860 tggccaattt gcggtgattt aaaaattatt gcgttacttc tcggcatgca gttgggcttt 1920 acaaagcata gctgttttct atgcgaatgg gacagcagag atcgaaagaa gcactatgtg 1980 aagaaaatct ggccaaaacg acaagatttc acaccaggca ataagaatgt catacattca 2040 ccattagttg atcctctact cgtcttgctg cctccgttgc atatcaaact cggattgatg 2100 aaaaatttcg taaaagcgat ggatcaatct agttctggtt ttgattacat aaaaacaaaa 2160 ttccctgaga ttagcgaggt gaagaaaaag gaaggagttt tcatcggtcc acaaattgcc 2220 aagttgatga aggatcaaga atttgacaaa actctcagca aagctgaatt gcgagcatgg 2280 tatgcattca aaacagtttg tcgagatttt ttgggaaacc ggaaagcgga aaactatcga 2340 gaaaaaataa gcgaactttt gaaggcgtat cggtcgatgg gttgcaatat gtctctcaaa 2400 attcatttcc tccactcaca ccttgagttt tttcctgata atcttggaga cgtgagcgac 2460 gagcacggtg aaagatttca ccaggattta aaaatttttg agaaacgcta tcaaggacaa 2520 aacaatacaa acatgctggc tgactacttg tggacactac aagctggttc ttctgaaatt 2580 caccatcgaa aatcatatcg taaaacattt tagatgtatt cataatagcc gatacatgta 2640 tcatgtacaa aaatataaag agaaagctcc aaagtaaata aaattttatt tgttcttcat 2700 tcatacaacc gaataaaaaa acactaatcg atacttcaga gaaactgttt cccaatcttg 2760 aaaaaaggga ctgcaaggtg atcttttcag cgagctttcc acatggtcag tggttcaccg 2820 tcactcggat gggaaatttt tgctttagag ggtatacctc tcccaacttg tacgcaatca 2880 atttcgcatg caactgcttt gactgcgctg attccgtttt tcatgaacat ttttcatcgc 2940 tgccggaatt attcaaacta tgattatatg atcgtaagta agttatgaat ttattccaaa 3000 ggtctacatt gttttggcta tgtaaagatg ttctttaaag gaagtagaaa atatattgag 3060 ttagtattaa tttcgtctgt ttactttatc aaaacaaagc tttttacaaa aaaaaaaatt 3120 acgtgctagg gaaaatgcaa tggcagattc tgattcagca atctttcttc tactatagac 3180 acgcattttt gtcgctgaga caacgccaac atataacatt ttcgatttat gatgtcaaag 3240 aagagccatg ctatcgccgt tcaacttgca tgcaacatct tcaggctagc ttgtaaggga 3300 tttatttcga taaagttgac ttaatctaaa atgctcaaaa cttttggaaa aaaaatttat 3360 aattcacaaa tttttacata agtagataga ataaatgttg attaactgat tttgacattg 3420 attgccatga aatacaattt gcatgcttaa tggctgtcaa ttaaatagcc cccaaaatgg 3480 taatattgtt atataaatcg agctatatct caaaattgtg acgtgctaga gcagatctga 3540 ggtaagattc ggattcagcg agccaaaaac tttcagagac acataaaatt atccttgaga 3600 caaaccaaag gctatttttt gttccgctgt g 3631 // ID Gypsy-13_CQ-I repbase; DNA; INV; 6765 BP. XX AC AAWU01032992; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-13_CQ_; KW Gypsy-13_CQ-LTR; Gypsy-13_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-6765 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 405-405 (2011). XX DR Genome; AAWU01032992; Positions 8267 1503. XX CC Positions [5352-5828] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 1221..2990 FT /product="Gypsy-13_CQ-I_2p" FT /translation="MAEEEEEIFLTSSETSATESSGTESETESSDSTGESV FT KSSSPVMNIKKMNPKQIKRLTISPRFMRKLSEVYASKSQEQKKRCEKDDPA FT KKKSPIPDPPPSPPKPKPDSPKSDPPNPDPQNPESSKSKPSKDKKNKKKED FT ESGKKNKKKSKSKDDKGGAPDLDLDKPPNLDDVKPDDSEKLLPPGFNIKSA FT SFENVNFSNNPLFYSMNTSGRANKKTLPISQWPIKYSGTDNGIGLNLFLRR FT VEFFAHSERVTKSELFESAHLLLVGPAQDWFVSKWPTFRNEDWDFFIHALR FT HQYLPNNIDHYIKVRSFSMSQNKNETFSNFLVRMEQFFLCRTTPLAEEDKF FT DIIWHTMKWHYRDRLALIKRKGMSIIELEDLCGRIDNCNEGLMNRLVQSFD FT NHNIHEIITEGPSTSYKLQEPHERRRNQQNQSQLVNQSREDRDNHAGQLTT FT RNNQTNQQNSNRHNQANQNTTRSSQSNQNHSNTGQSSSYNQNNYRNNQQQQ FT QNYQNTNRNQNTQERGRTQEEFYPENGWRSLNRDQILHYYKQPDGRICLNC FT RKFGHHFSTCYSRRNVFCCICGLPEFHYEECPFCEEKNRRREN" FT CDS 3047..4915 FT /product="Gypsy-13_CQ-I_1p" FT /translation="MIDLFQQIADEVDVGGECASDSEHSDREYERTFESVN FT EVCVDFSGDQRYFLKFNVLGLQLHGLLDSGSNVSIVGENFGNLTSFLEIQP FT LENEISISTASGDAMEVLGFVDVPFNVDRVTKILPCLVVPELDNRCILGMD FT FFKLFNITLSFSGCDRNNVCCIEQSTYFVNNTESSQNLNPDSHVLTAEEAE FT QLKVISDSFKVSKPGTLETCNVMEHTIELSDSTPIHLNPHPFSPAIQKKVY FT TEIDRLLKMEIIEPSRSDWALHVVPVQKESGEMRLCLDARKLNAKTIRDAY FT PLASTMRIISNLGKNKYFSVIDLKESFLQVMLSEESRRLCSFKIIGRGLYQ FT YKRLPFGLINSSATLSRILDKVLKEGMFEPYIFSYLDDIIVATETFEDHVK FT YLKIVSECLREANLSVNIDKCKFCLRKIKYLGFILSEEGYQPNPERVAAIS FT KFVRPQTPKEIRRFLGMAGYYRNFIPNFGGIAAPISDLLKGKPKKVQWNDQ FT AEHAFIKLKECLMSEPVLANPDWSKEFKIQTDASDLAIAGILTQEVEGKEH FT IIAYFSRKLRSCEKKYAATEKEGLAALESIDHFRTYVEGSHFELITDCSAV FT TYIRDSKWRPGSRLSRWSVQLQAMR" XX SQ Sequence 6765 BP; 2245 A; 1028 C; 1427 G; 2065 T; 0 other; ggaaatggta ctcctgctat tcaatagtac tccatcttgt ctcatgctcg cactttccct 60 ggaaatgggg aggtgacaaa atggcgccca acgttaataa aagtacagca gaatcaatta 120 ccagtgtctt ctgaacaatc agccattttg aaaaggattt ttcagcgttg ctttttgtga 180 aataaaagaa gattttgtac atttagtttg taatcttctt attcttcgct gttgtaaata 240 attttcccat tttttggact tttgtaaata atttccctag ctttagaaac ttgtaaataa 300 ttcactaatt ttagaatttt gtaaataata tactaatatt tagttataat tgtaaataat 360 ttgcaaatgg aattgtaaat aaccctgaac gtggaaaatt ttgtaaattt tgtacataat 420 tttgaactaa tcatatattc tctcctgatt ttgattttct aggattatgt gatcaattca 480 agagatcgcc aacgatggaa tagtgtacac acacaagaat tgatgaaaag tgataaattt 540 gaaacataaa actgtaaata gatcgttttg gttgggatta atttgatttt tctcccagat 600 gtagttgagg ttgaagtgat catgagaatt ctttgtaaat atttcaatac gagaagtgat 660 ggggggaaat gttgtatatt ttgtacattt gtatatatga ttttttttgg ctgtaaataa 720 atgacgaatt tgaacttgca tgaattttaa ctactttaag aagaaaacat aaaaaagaag 780 aagttttgca ggatgaaacc ggcctcgctg gagacttgag tccaagacga tgacggtcag 840 taataggatc ttgattggat ctcgaattcg gtcttgataa cctggaaaca gcgctgagtc 900 actgacgatg acgccctatg accccgagga gtaacgattt atcggaggtt gattcttgtc 960 tgactgattc ttaccaaaaa acattagaac ttttagaatt ttgcatgaat atttattatt 1020 ttaattctag taaatggtgg atttggaaat gaagcaggat gaaggccgat tctcaaagta 1080 tcgttcatgt aaacagtttt tgctagaaaa gtataatttg ttgtgtgagt gatttggaaa 1140 tctattcaat catttttatc ttatttattt atttattttt ttaaattgtg cttgtgaaag 1200 attaagtgaa aaaggacact atggctgaag aagaagagga aattttcttg acctcatctg 1260 agaccagtgc tacggaatca tcgggtactg agagtgaaac tgaaagtagt gatagtacgg 1320 gtgaaagcgt taaaagttct agtccggtta tgaatattaa gaaaatgaac cccaagcaga 1380 tcaaaaggtt gacaatttcg ccgagattta tgaggaagtt atctgaagtt tatgcaagca 1440 aatcacaaga gcagaaaaaa aggtgtgaaa aagatgatcc tgcaaagaag aagagtccca 1500 tccctgatcc tcccccgtcg ccaccaaaac ctaaacctga ttccccaaaa tctgatcctc 1560 caaatcctga tcctcaaaat cccgaatcat caaaatcaaa accatccaaa gataagaaga 1620 acaagaagaa ggaagatgag tcagggaaga agaataagaa gaaatctaaa tctaaagatg 1680 ataagggagg tgcacctgat ttagaccttg ataagccacc taatttagat gatgttaagc 1740 cggatgactc tgaaaaactt ttgccgcctg gatttaacat aaaatcagct tcttttgaga 1800 atgttaattt ttcgaataat ccgttgtttt acagcatgaa cacttctggt agggcgaata 1860 agaagacttt accaatatcg cagtggccca tcaaatacag tggcacagac aatggtatag 1920 ggttgaattt gtttttgcgc agagtagaat tttttgctca ttctgagcgg gtgactaaat 1980 cagagctgtt tgaatcagca catttgttac tggttgggcc agctcaggat tggtttgttt 2040 cgaagtggcc gacttttagg aatgaagatt gggatttttt cattcatgct ttgagacatc 2100 agtacctgcc gaacaacatt gatcattaca tcaaagtgag atcgttttct atgtctcaaa 2160 acaaaaatga aactttctca aactttttgg tcaggatgga gcagttcttt ttatgccgaa 2220 caacaccact agctgaggaa gacaaatttg atattatttg gcacacaatg aagtggcact 2280 acagggacag gttggcgctt atcaaaagga agggaatgag catcattgag ctagaagatc 2340 tatgtggcag aattgataat tgcaatgaag ggttgatgaa tcgtctggta cagtcgtttg 2400 ataatcataa tattcacgaa atcattacgg aaggaccatc tacgagttac aaactgcaag 2460 aacctcatga gcgtaggaga aatcaacaaa atcagagcca gttagttaat caaagtcgtg 2520 aggataggga caatcatgca ggtcaattaa caacaaggaa taatcaaaca aatcaacaaa 2580 atagcaaccg acacaatcaa gcaaaccaaa atacaacaag atctagtcag tctaaccaaa 2640 atcattcaaa tactggtcag tctagttcat acaatcaaaa caactataga aacaaccagc 2700 aacaacaaca aaattaccag aatacaaaca gaaatcaaaa tactcaagaa agaggaagaa 2760 cgcaagaaga gttctatcca gagaatggtt ggagatcgtt aaatagggat caaattttgc 2820 attattacaa acaacctgat ggtcgtattt gtttaaattg tcgtaagttt ggacatcatt 2880 ttagtacctg ttattctcgt cgcaatgttt tttgttgtat ttgtggtttg ccagaatttc 2940 attatgagga gtgtccgttc tgtgaagaaa aaaacaggcg aagggaaaat tgaaggaggt 3000 ggttttcccc tcccccgcaa atcctccagg tgggagaact gaaccgatga tagatttatt 3060 tcagcaaata gcagatgaag ttgatgtggg tggagagtgt gcatctgatt cagagcattc 3120 tgatagggaa tatgaaagga catttgaaag tgtgaatgaa gtttgtgttg attttagtgg 3180 cgatcagaga tattttctaa aattcaatgt tttagggttg caactgcatg gattgttaga 3240 cagtggtagt aatgtgtcaa ttgtaggaga aaattttgga aatcttacta gttttttgga 3300 gattcaaccg ttagagaatg agatttcaat ttcgacagca agtggtgatg cgatggaagt 3360 tttaggtttt gttgatgtac cttttaatgt ggatagagtt actaaaattt tgccttgttt 3420 agttgtaccg gagttagaca acaggtgcat acttgggatg gattttttta aactgtttaa 3480 tattactttg tcattttctg gttgtgatcg taataatgtg tgttgtattg agcagtcaac 3540 ttattttgtt aacaatactg aatctagtca aaatctaaat cccgattctc atgtcctcac 3600 tgctgaggaa gcagagcaat tgaaagtaat ctctgattca tttaaggtgt caaaaccagg 3660 gactttagaa acgtgtaatg ttatggaaca cacaattgaa cttagtgata gcactccaat 3720 tcatcttaat ccacacccat tttcacctgc gattcaaaag aaagtttaca cagaaattga 3780 tcgtttactt aagatggaaa ttatagagcc ttcaagatct gactgggctt tgcatgttgt 3840 tccagttcag aaagaaagcg gtgaaatgag actgtgtctt gatgctagaa agttaaatgc 3900 gaaaacaatt agggatgctt atccattggc aagtacaatg agaattatca gtaacttagg 3960 caaaaataaa tacttttctg tgatagattt aaaggagtcg tttctgcaag tgatgttgtc 4020 tgaggagtct aggagattat gtagtttcaa aataatagga agaggtttat accaatacaa 4080 acgattacca tttggactaa taaatagtag tgcaactttg tccaggattt tagataaagt 4140 tttgaaagag ggaatgttcg agccatatat tttttcctat ctcgatgaca tcattgttgc 4200 gacagagact tttgaagatc atgtgaagta tttaaaaatt gtgtccgagt gtttacgtga 4260 agcaaatctg tctgtgaata ttgataaatg taaattctgt ctgcgtaaaa tcaaatattt 4320 aggtttcatt ctttctgaag agggctatca acccaaccct gagagagtgg cagcgatctc 4380 caagtttgtt agaccacaaa ctccaaagga gattcgcaga tttttgggga tggcaggata 4440 ttatcgtaat tttattccaa attttggcgg gattgcagcc ccaatctcag atttgctcaa 4500 aggtaaaccc aagaaggttc agtggaacga tcaggctgaa catgctttta ttaaactaaa 4560 agaatgtctc atgtctgaac ctgttctcgc taatcccgat tggtcgaaag aatttaaaat 4620 tcaaactgat gcaagtgatc ttgccattgc aggtatttta acgcaagagg ttgaggggaa 4680 ggaacatatt attgcctatt tttccagaaa gttgaggtcg tgtgagaaaa agtatgctgc 4740 gacagaaaag gaaggattag ctgcacttga atcaattgat cattttagaa catatgttga 4800 gggttcacat tttgaattaa ttacagactg ttcagcggta acgtacattc gagattctaa 4860 gtggaggcca gggtcacggt tgtctcgctg gtcagtgcaa ctgcaagcca tgcgatgacc 4920 attaagcata ggaaagggaa agagaatatt gtctgtgata ctttgagccg agcagtatgt 4980 gctatcttca ctgatcacgt cagctggttc acaaattgaa acagaaaagt aactaacaac 5040 ccccgaaact acccaatttc aaaattgagg atggagattt acacaaattc atcgcagcaa 5100 gatcagactt tgaaggtagc cgggttgagt ggaagttagt tgttccacca gacaaagtca 5160 cacagttagt cgtagaacaa catgaacaac taatgcacct aggaactgat aaaacgttag 5220 aaagaatcaa attaaaatat tactggccaa aaatgaaatc agatgtgaaa cgaattttgt 5280 ctaaatgtgc taaatgtaaa caagcaaaat atccaactgt tgcaactgta ccgccgatgg 5340 gggagcataa aaatgctact cgtccttttc aaatgatcgc gttagattac ataagtggtt 5400 ttgttcgtag taaacatggt aacatggatt tgttagtttg tttagatgtt tttacaaagt 5460 ttgtcagatt atttcctgtg agaaagataa gtgtggaggg tttaactgaa attatagaaa 5520 aagagtggtt tttaaaatac ggttctccgc aagttgtgat atcggataat gctgtgacgt 5580 ttctaggcaa caaatttcaa gatttattac aaaaatatca tgttcatcat tttaaaaata 5640 gtcgaagaca ttgtcaaaac aaccctgtag aacgcgtcaa cagagttata ttagcttgta 5700 ttagaacgta ctgccaggaa gatcaccgac tgtgggatag tcgcttatca gaaatagaat 5760 ttgcaattaa caatacaaaa cattattcaa ccggatttac acctttcttc ttagcgcatg 5820 ggtttgagtc ggttctagat gggtgagatc atcatcaagg tggtcaaact tctgacccat 5880 cacctgaccg atttgtccag cagagaaggg cagtgcttgg tccgttgtat gagaaagtaa 5940 ttgaaaataa cagaaaacaa tttaataaat acaaaaagag ttacgatagc aaacataaaa 6000 cagtgccgca ggaatttaat gttggtcaaa aagtgtataa gaagaacttc aaaatttcaa 6060 gtgccccaga taattatgca gctaaattgg gcccagttta tgttccatgt aaagtaattg 6120 ggagaaaagg cgcttcatct tatgagctag ccgatgaaga aggaagaaat attggtattt 6180 ttgctgctca ggatttaata cccgcataga ttcgtactaa ctgattaggt ttcaatttta 6240 gtgtgttgtg attatttatt taagtgaaat tagtatgata tagtatgttt gatagtatac 6300 atgagggttg gccacccaat taatgagtga gattgtttga tagtgagagt gaatgaataa 6360 aaatgattga agggatttag tgttgatttt aaagtgtgtg aaagtttaat ttattatttt 6420 aaagtgttgc atgaacgact ttgcatgaat gggatgaatt caattttacc taggaatttt 6480 tgaaagagtg tgtgtaccaa gaatgtattt atcgaaacct aattgaatgt actgaatcca 6540 gaatccaatt tatgatcaat gaacaaatgg acttacaaga agaaaagaac tgtatcataa 6600 ttagacactg ggattgtttt aactgtgagt ttattaaatt ggattccgct tgtctgttgt 6660 gggagagtgg gttacgagga gtgtctagcg accacagcaa tactggtctt agatatcttc 6720 ctttggttga tactaatccc atatttctgt ctggtgtggg ggtat 6765 // ID CRMAR repbase; DNA; INV; 1320 BP. XX AC AY034623; XX DT 10-SEP-2005 (Rel. 10.09, Created) DT 10-SEP-2005 (Rel. 10.09, Last updated, Version 1) XX DE Ceratitis rosa mariner-type transposon. XX KW Mariner/Tc1; DNA transposon; Transposable Element; CRMAR. XX OS Ceratitis rosa OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Tephritoidea; Tephritidae; Ceratitis; Pterandrus. XX RN [1] RP 1-1320 RA Gomulski L.M., Torti C., Bonizzoni M., Moralli D., Raimondi E., RA Capy P., Gasperi G. and Malacrida A.R.; RT "A new basal subfamily of mariner elements in Ceratitis rosa and RT other tephritid flies."; RL J Mol Evol 53(6), 597-606 (2001). XX DR EMBL/GenBank/DDBJ; AY034623; Positions 1 1320. XX FH Key Location/Qualifiers FT CDS 114..1196 FT /product="CRMAR_1p" FT /translation="MERYTIQQRVKVIQTYYENGRSNQNAYRALRDFFGQF FT DRPNVRTIAKIVEKFEQIGSVEDVRTPVHARTARTAENIAAVRDSVAEEPS FT TSTRRRAQQLHLSRSSMMNIMHKDLHLHAYKVQLAQELKPLDHSKRREWAE FT WFQEMATVDDQFSKKIIFSDEAHLHLSGFVNKQNCRIWANENPRVIVEKPV FT HPQRVTVWCGLWAGGIIGPYFFQNEAGQAVTVNGVRYREMITNFLWPQLED FT MDVDDMWFQQDGATCHTANETMALLRNKFNGRVISRNGDVNWPPRSCDLTP FT LDFFLWGYLKEKVYVDKPATTQELKDEIIRHINGIETPLCLSVIENLDHRM FT EVCRRGRGAHLADILLHT" XX SQ Sequence 1320 BP; 395 A; 268 C; 290 G; 367 T; 0 other; aagggtggtt aaattgtaag ggccaatgtt gaatgtgaac cacacctaaa cgccaagttt 60 ttttccggaa ttaatttgac atttctctat ttcagactta ctcaatttga accatggaga 120 gatacacaat ccaacaacgt gttaaagtta tccagactta ttatgaaaat gggcgttcaa 180 atcaaaatgc atatcgtgca cttcgtgatt tttttggtca atttgatcgt ccaaatgtgc 240 gtacaatcgc aaaaatcgtg gaaaaattcg agcaaatcgg gtctgtagaa gatgtgagaa 300 caccagtaca tgctcgtaca gctcgtactg cagaaaatat tgctgctgtt cgcgatagtg 360 tggctgaaga gccgtccacc tcaactcgtc gtcgtgccca acaattgcac ctctcacgct 420 cgtcgatgat gaacattatg cataaagact tgcatttaca cgcttacaag gtgcaattgg 480 ctcaagaact aaagcctctt gaccattcca aacgtcgtga atgggcagaa tggttccaag 540 aaatggcaac agtggatgat caattttcga agaaaatcat cttcagtgat gaggcacatt 600 tgcacctcag tggattcgtc aataaacaga attgccgcat ttgggcgaat gagaatccaa 660 gagtgattgt cgaaaaacca gtgcacccac aaagagtgac tgtttggtgc ggcttatggg 720 ctggcggcat catcgggccg tattttttcc aaaatgaggc cggtcaggca gttactgtga 780 atggtgttcg ctatcgtgag atgataacga attttttatg gccacaattg gaagatatgg 840 atgtggacga tatgtggttt cagcaggacg gtgccacttg ccacacagct aacgaaacaa 900 tggctctttt gcgcaacaaa ttcaatggcc gtgttatctc acgtaatggc gatgtcaatt 960 ggcccccaag atcatgtgat ttgacaccgt tggacttttt tctctggggt tatttgaaag 1020 aaaaggtgta cgtcgataag ccagcaacaa ctcaagagct aaaggatgag ataattcggc 1080 acattaacgg catagaaact ccattatgcc tcagcgtcat cgaaaatttg gaccatcgta 1140 tggaggtgtg ccgccgaggt cgcggcgctc atttggcaga tattttgtta catacataat 1200 tgagtaatac caatatatca taataaaata aaatttcaat aatttcctaa atagtttgtg 1260 ttttattcaa aatcaacatc ggcccttgaa atttcggccc ttgaaaattt aaccaccctt 1320 // ID Gypsy-78_AA-I repbase; DNA; INV; 6590 BP. XX AC supercont1.245; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-78_AA_; KW Gypsy-78_AA-LTR; Gypsy-78_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6590 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.245; Positions 847815 841226. XX CC Positions [4641-5117] - Integrase core CC 'TAAC' target site duplication CC LTRs are 98% similar to each other. Includes an insertion of CC a non-autonomous DNA transposon at positions 2735-2934 CC (masked). XX FH Key Location/Qualifiers FT CDS 523..1854 FT /product="Gypsy-78_AA-I_1p" FT /translation="MIAHLRTKKKNIIMSIKQKSNRITVNPHYRRDTMANN FT GKYIRPIDPTIVEDDDTSSGDESANQPDDEHSSSPNSSSSSSDSSSSSDSD FT TDNEENQEELTCVSETDAPVALQIPSPVALSASETGSIENPPRDAVVQAIH FT EERIERMEKMLTSINETLSHFQPSSRPAVNETPDDSWNVSNQANTCANTGG FT SSSFNIRWDHLKPFPSGVAANKMWEEWNRYIENFELAVSLSNLYDPVKRTQ FT LLFLSIGSELQEVIKAVKLRPSLKTPDCYTTFVSNIKSYFRSMTDTAAEHE FT AFARMKQENGEPAVAFHARLMGKVRSCNYNVDDEDRFVRAQLLSGLRNREL FT VKQARTYGHDTNFIVQSAARSEAFEAETYQREGTGVFEVKRLHRVSPHERD FT NHKRSNAGNRAEGPPFKQHRNGVLNQRPQERRQRCTRCFLFSHRNGQCPA" FT CDS 2937..5600 FT /product="Gypsy-78_AA-I_2p" FT /translation="MFFTFFFSLDCNDYREAARNRLKDMESQGIIEKVTWA FT PNWISGMSAVAKGNNDFRLVVNMRAPNRAINREYYRLPLLDEMRVKLHGAR FT YFSKLDLSNAFYHLELSEESRDLTTFLAEDGMYRFTRLMFGVNCAPEIFQR FT EMVRILKDVDNIIIFIDDILIFAETVESLRATVSRVLQILRENNLTLNQAK FT CEFDQTKIKFVGHILDADGFHIDDEKIKSVRRFREPTTLSELRSFLGLASF FT LSPYLENFAKITSSLWTSTASKTWSWGPEQSESFEALKQQIIHCTISLGFF FT SEEDDTILYTDASPIALGAVLVQQGSNQPPRIISFASKALTPTEKKYAQNQ FT REALGAVWAVEHFSYFLLGRHFTLRTDAQGMTFILNRSREDSKRALTRADG FT WALRLSPYNYSVEYVRGIDNIADSSSRLYDGEDAPFDEETSPWEIASLETN FT SVEFLTEAEIRDATDQDDILLQIISALETGMWHKDLRKYQAIENDLAIQNG FT ILLKTGCAVVPKTLRTRALEVAHEGHPTIAKMKSIMRQRVWWPGMCSEITK FT WVSSCQTCCVNGKPERPPPMERVFAPKVAWESIAIDFNGPYTKFGGILILV FT IVDYRSRYIIARPVRSTKFECTQKVLDDVFEKEGYPKTIRSDNGPPFNGTD FT FAEYCKKRDISMTFSTPLFPQQNGLAESSMKLINRAMAAATANNTNYVEEL FT KKAINAHNAAAHSVTNVPPEEVMYGRKIKRGLPLVQHGKSNYDENLFERND FT REEKLASKFREDCRRGARQCRVKPGDEVVVERHNRSKGDSRFSPTKYTVIQ FT ERNGSLILNDREGKVTKRHVTQTKKVGQWRESHHLTTTPTSEKQSNIEKPT FT LEPFQRPSRERRTPAFLQDYVQVVQSDLNMNPERE" XX SQ Sequence 6590 BP; 2035 A; 1437 C; 1413 G; 1505 T; 200 other; atggcgatcc atagccaggc gttgctgctt ccttgcagta acgtaaactc agaacctaat 60 cacacttgat tacctatatc agcgttgaaa aggaattaac tagttggtaa aagctccttc 120 gctagaaagc aaatcgaagg ttaacaattt gtcattaaaa catggcggat tgtgaaaacg 180 tgcttggtaa aggcgaccta taatcagcgg atcacttcga tcgcgcattt taaagcggac 240 cataagcgtc gcgcacttgc aatcaaaaat aaatgaataa gcggacaaac agtcgcgcac 300 tttataatgc ataggatgac cgcgctccag agaagtggac aacgcatttg tcacgcataa 360 gagaagcggg tcagtttaat caataagcgg acgaaacgtc gtgcattaaa agagaagcgg 420 actaggtcgc gcattgtaga gcggactagg tcgcgcattg tagagcggac taggtcacgc 480 actgaaaagc ggactaagtc gcgcattata gagcggacca gaatgatcgc gcatcttagg 540 accaaaaaaa aaaacataat aatgagtata aagcaaaaat ctaacagaat tacggttaat 600 cctcattaca ggcgtgacac aatggcgaac aatggcaaat atatccgacc tatcgacccg 660 accatcgtcg aagatgatga cacgtcgtcc ggtgatgaat cagcaaacca gcccgacgat 720 gaacactctt caagcccgaa ctcttcaagc agcagtagtg acagcagtag cagctcggat 780 agcgacaccg acaatgagga gaatcaggag gaattgacat gtgtctcgga aacggatgct 840 ccagtagccc tgcaaattcc ttcgccagtc gctctttcgg cgtcagaaac aggttccatc 900 gaaaatcccc cccgtgatgc agtagtacaa gcaatacacg aggagcgcat cgagcgaatg 960 gaaaaaatgc tcacaagcat caacgaaaca ttgagtcatt ttcaacccag ctctcgaccg 1020 gcagtgaacg aaacccctga tgacagctgg aatgtatcca atcaagcaaa cacatgtgcc 1080 aataccggcg gaagctcctc cttcaacatt cgttgggacc atttaaaacc attcccgagc 1140 ggagtagccg ctaacaaaat gtgggaggag tggaaccgct acatcgagaa ctttgagtta 1200 gcggtttcac tcagcaacct atacgaccct gtcaaaagga cgcaactgtt atttctgtcc 1260 attggcagtg agcttcaaga agttatcaaa gcggtcaagc tgcgcccaag tttgaaaacc 1320 ccagattgtt acaccacttt tgtctcgaac ataaaaagct acttccgttc gatgacagat 1380 accgccgccg agcacgaggc ttttgcacga atgaagcagg agaacggaga accagccgtc 1440 gcttttcatg cacgattgat ggggaaagtg cgctcatgca actacaatgt tgacgacgaa 1500 gaccggtttg tgcgagcaca actactaagt ggactaagga atcgagaact ggttaaacaa 1560 gctcgaacct acggacatga tacaaacttc attgtccagt cggcagcacg gagcgaagcg 1620 tttgaggctg agacatacca gcgcgaagga actggtgtct tcgaagttaa aaggctacac 1680 cgagtttcac cacacgagcg agacaatcat aagcgatcaa atgcaggaaa ccgagcagaa 1740 ggaccaccat tcaagcagca tcgtaacggt gtgttgaacc aacgaccgca agaacgacgc 1800 cagcgttgta cgcgttgttt tctgttcagc caccggaacg gccaatgccc tgcgtagaat 1860 cgtaattgca acagatgtgg caaacgtgga catttcgtgg cagcttgccg acagaggcaa 1920 ataaatcaca tgcagtacga acgaagcgat acgtcaatgg acaaattgat gccccctggt 1980 gaggagaagt acgctgacga caaacaggta ctagaccatt aatgtatacc ttaaatgatt 2040 gaatttcatt tgttatttca aataaagcta atacctgcgt aatatgctag ttttctcata 2100 tcaatcgaaa tatccattct ccactactat aggaaatcaa cgcactctct ttgcaagacg 2160 tattgattga ttgttctgtt ggatcgtcca gtgctatacg tttcttgata gactctggtg 2220 ctgatgctaa cgtcattggc gggaaagatt gggagcggtt agaacgagag gccagactag 2280 gcgaagcaaa atttgaaata ctgagtggca gttctagcaa tagactgcac gcatatggag 2340 cgaaagatcc catgaccata gagtgtgttt tcaaagcaga aatcacgaaa gcaagttcag 2400 gtcccttgca aatagcaaca ccggccgtgt tccacgttgt acttaaggga acaaggtccc 2460 tacttggaag gtcaacagca agcgacatgg gactgctaca gattaacaac accatcaacc 2520 aatgcgagaa gaacgagatt ttccccaaga tgcccggagt taaagtacgt tttagtgtca 2580 ataaggacat tccgtcatca aaaagcgctt actataacgt gccagctgca tacaggtgag 2640 cataaacatt ttcaaaataa atgggtattt aaaaaaaaaa tactccggta ccaccaaaat 2700 taggagccac tgttttctca caataacgaa ccaaxxxxxx xxxxxxxxxx xxxxxxxxxx 2760 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 2820 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 2880 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxctatgt 2940 tttttacttt tttcttttct ctcgattgta atgattacag agaggcggca cgcaacagat 3000 tgaaagacat ggaatcacaa ggcataattg agaaggtcac atgggccccg aattggatca 3060 gcggaatgtc cgcggtggcc aaaggcaaca atgatttcag acttgtagtt aatatgcgcg 3120 cccctaaccg cgctattaac cgagaatatt acagacttcc tttgctcgac gaaatgagag 3180 taaaactcca cggtgcaagg tatttttcaa agctcgatct gagcaatgcc ttctaccatc 3240 tcgagctgtc tgaggaatcc cgagacttaa caacgttttt agccgaagat ggcatgtata 3300 ggtttacccg tctcatgttt ggcgtaaatt gtgcgcccga aatttttcag cgcgaaatgg 3360 ttcgcatcct gaaagatgtc gataacatta taatattcat cgacgatatc cttattttcg 3420 ctgagacggt ggagtctctt cgagcaacag tctctagggt gctacagatt ctgagagaaa 3480 ataacttaac tttgaatcaa gcaaagtgtg agttcgacca gaccaaaatc aagtttgttg 3540 gtcatatact cgatgcggat ggtttccaca tagacgatga gaagattaaa agcgttcgtc 3600 gctttagaga accgacaact ctgtcggaac ttcgaagctt cttaggctta gcatcgttcc 3660 taagtcccta tctcgaaaat tttgcgaaaa ttacaagttc actatggacc tcaaccgcgt 3720 ccaaaacatg gtcttgggga ccggaacaaa gcgaatcatt cgaagcactc aaacaacaaa 3780 tcatccattg tacaatttca ttgggatttt tctctgagga agacgatact atcctctaca 3840 ccgacgcctc gcccattgct ctgggtgctg ttttggtaca acagggttcg aaccaaccac 3900 cgaggattat aagcttcgcc tccaaagcac tcactcccac cgaaaagaag tatgcacaaa 3960 atcagcgtga agccttaggt gcagtatggg ccgtggaaca cttttcctac ttcctgctcg 4020 gtaggcactt cacgctccgt actgacgcac agggaatgac attcatcctc aaccggtctc 4080 gagaagattc caaaagggca ttgacacgtg ctgatggatg ggctctgagg ttaagcccgt 4140 acaactatag tgttgagtac gttcgtggca tagataatat cgcggactca tcatctcgat 4200 tgtacgatgg ggaagacgca cccttcgatg aggaaactag tccgtgggaa atagcgtcac 4260 ttgaaacgaa ttctgtggag ttcctaacgg aagcagaaat cagagacgca actgaccaag 4320 acgacatcct gctacaaatc atttcggccc ttgaaacagg catgtggcat aaggatctac 4380 gcaaatacca agcgatagaa aatgatctgg cgattcaaaa cggaattctc ctcaagactg 4440 gctgcgctgt cgtacccaag acacttcgaa cacgggcatt agaagttgct cacgagggac 4500 acccaacgat cgcaaaaatg aagagcataa tgcggcaacg ggtgtggtgg cctggtatgt 4560 gcagcgaaat caccaaatgg gtgagctctt gccaaacatg ttgtgtcaat ggtaaaccag 4620 aaagaccacc cccaatggag cgcgtatttg cgccaaaagt agcttgggaa tccatcgcaa 4680 tagacttcaa cggaccctac accaaattcg gcggcatctt gattctagtc atcgtcgact 4740 acagatcaag gtacatcatc gcacggcctg tgagatctac aaaatttgaa tgtacgcaga 4800 aagttctgga cgatgtgttt gaaaaagaag ggtacccaaa aacaataagg agcgataatg 4860 ggcctccatt taatggcaca gattttgccg agtactgtaa aaagcgagat atctcaatga 4920 cattttcaac acctcttttt ccgcagcaaa acgggcttgc cgagagctca atgaaactaa 4980 taaacagagc tatggctgct gcaactgcaa acaatactaa ttatgttgag gaattgaaga 5040 aagcaatcaa tgctcataac gcggctgcac atagtgttac taatgtacct cccgaagaag 5100 ttatgtatgg tcgcaaaatc aaacgtggat tgcctttagt acagcatggg aaatcaaact 5160 acgatgagaa ccttttcgaa cgtaacgatc gcgaggaaaa attggccagc aaattccgcg 5220 aagattgtag acgaggtgct cgtcaatgta gagtaaaacc cggtgacgaa gttgtggtcg 5280 aacgccataa tcgttccaaa ggagactccc gcttttctcc aacaaaatac actgtaatcc 5340 aagaacgaaa tggaagcctt attctaaacg acagggaagg aaaagttacc aaacgacacg 5400 tgacacaaac aaaaaaggtt ggccagtggc gagaatcgca tcatttgacc actaccccaa 5460 cttcagagaa gcaatcaaac atagagaaac caacactgga accgttccag aggccgtctc 5520 gagaaagaag aactccagca tttctccaag actacgtaca agtagtacag agtgatttaa 5580 acatgaaccc cgaaagggag taagtcgaat caattggtaa tgtaatcttt catactgttg 5640 cttagaagct aaaataaaat aagggtataa aaacatatta ctgaccttga ccttttcttc 5700 aatcactcca ccaactgtga aaacgtgaca atttcttttt atgggccttt cccatctttc 5760 tttttattca cagtcgagat cattgaaatc tagagattat tcttaaatcc cacagtaatt 5820 acggccgcaa cttctcccag ttgaagtgtc caattggcaa gagactgtga gcggaagtct 5880 cgctaggcgt ccttgacttc tgtgcttaac ttatcgtcga cgttatcaca ggatagttgt 5940 tcgctccaaa cactttttta ggaatcccac tggagtgagg gccgccagaa ccgcagctga 6000 agtgttcaat tgtgaattgg atcctggacc tgttgccgac gtaaaccaac atcgaatgac 6060 tgttacctcc cccacattcc tctgcaaaaa aaaaaaaata gctcttggat actgtttggc 6120 aatgcaaaaa gcatacatct atggtaatta gatacttacg atacaatatt acagccgaca 6180 atcgacctat aagcttttta ttaatctaat aaactcgatc ctcaaaacag cgaacgaaca 6240 acaaaacaaa tcaacagcaa taaaacaaat tagtttgaca gcaccaaatt tgataaaggg 6300 cgagattccg aaaactcaac cgcaggggtg aacagtgggc gcattctgtt ttttttgagc 6360 ataaatacac tgaaatattt tcgaaaatga aacgaaaaaa ttacccaaca catttattca 6420 aattcgcatt cattgtgtaa gaaacagtgt accgcgtcaa ttttattact ctgagagatc 6480 caatcaagcg aaaaattacg atttcaattt gaattactgt aggatttaca aaaaataatc 6540 tgtttatttt agagaagggg gtggatgtgt gggtgcgaaa gggttagtta 6590 // ID hAT-33_HM repbase; DNA; INV; 3017 BP. XX AC . XX DT 07-DEC-2008 (Rel. 13.12, Created) DT 13-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-33_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3017 RA Jurka J.; RT "hAT-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 2022-2022 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 482..2596 FT /product="hAT-33_HM_1p" FT /translation="MDQTIFSNDWLNHPEYKEWLVKTDNNKVARCSSCCKT FT FNLSNMGVQAVKSHNEGKKHQLNVLAKSKSHFFKPVVKHSTITPTLPLAIG FT QKTLELVVTSNEETRSEIRWVLATVMNGLSNNASANLSKLFTSIFPDSKVA FT KGLTLGRTKIGYMINYGLAPYFKSLLLESIKNSPCYVVSFDESLNKVTQNC FT QMDIVVRFWNTLENQVNVRYLNSVFLGHSSAAHILKNFNKEIESLDFSKMI FT QVSMDGPNTNWKFFKELSIYRNECELNSLLDIGCCSLHVVHGAFKLGSEST FT SWNLKSILKGAYQILHDTPARRDDYNSVTGSNRYPLSFCSTRWIEDQIVAD FT RLIEIWDNIVKLIRFWEKLSKSKQPSSKSFFNVQKASNDFLTVSKLQFFSY FT VASLFKTYLTLYQTDQPMIPYMYADLRELITNILALFIKPSVIQSCKKGSD FT LKKIDLYKKENLLPRKDVIIGFAAEVSIKELLVKDNITLNEINQFRNECVL FT FLTKTMESLLKRIPLDSAIVANASCFEPINLKTLTIESNQNKLKRLLHYLV FT SLKIISATQSDNVMLQYSIFLRNDMKINIEKFLSFDKKTTRLDIFYFHQLD FT IAKHKELSYIIKMILTVSHGQAFVERGFSVNETILNENIQSNSIVSHRIIK FT DHMNSQNIQADTINITNALMLSVKSARQQYQISLAEQANLKKERESKPPKS FT NHIR*" XX SQ Sequence 3017 BP; 1155 A; 395 C; 441 G; 1023 T; 3 other; cagggttgcc actcaacctg gaaaacctgg aaatagcatg aatcccaggg aaaacctgga 60 aaactcaggg atttttttta tttatttaat attcagggaa aactcaggga aaaatttttt 120 ccaattttga aaataatttt ttattaaatt attaacaata atttaagtta tgataaattc 180 gtaatatata ttatcataat tttttatgag attttacatt tgatgttaat tttgtatggt 240 ttgcttattt tcaatattaa atttgaattt atattttata gttaagttat ttaattattt 300 taagtagaat tttcattatg ggcttaatta ggaaatacta aaatactgag taaagtttga 360 tttaataagt tatataaaaa tgttttatat ataaatttat aatctttcat cattaaatct 420 ataaatatat gtaatatggt tgtaagaaga atatattttt ttagagttat taataattaa 480 aatggatcaa acaatttttt caaatgattg gcttaatcac ccagagtata aagaatggct 540 tgttaagact gacaacaata aagttgctag atgtagcagc tgttgtaaaa cttttaattt 600 gtcaaacatg ggagttcagg cagttaagag tcataatgaa ggtaaaaagc accaacttaa 660 tgttttagct aaatccaaaa gtcacttttt caaacctgtt gtaaaacatt ctactattac 720 accaacacta cccctggcaa ttggtcaaaa aactttagaa cttgttgtta cttccaatga 780 agaaacaaga tctgaaataa gatgggttct tgcaactgtt atgaatggat tgtcaaacaa 840 tgcatctgct aatttgagca aattatttac tagtattttt ccagatagca aagtggcaaa 900 aggattaaca ttaggaagaa caaaaatagg atatatgatt aactatggat tagcaccata 960 ttttaaatca ttacttcttg aaagtatcaa aaattcgcca tgttatgttg tctcatttga 1020 tgaaagtcta aacaaagtta cacaaaattg tcaaatggat attgttgtta gattttggaa 1080 tactttagaa aatcaggtta atgtgcgata tttgaattca gtttttcttg gccattcttc 1140 tgctgcgcat attctaaaaa atttcaataa agaaatagaa agtctkgatt tttcaaaaat 1200 gatycaagta tctatggatg ggcctaatac aaattggaaa ttttttaaag aactttccat 1260 ataccgaaat gagtgcgaat taaattcttt acttgacatt gggtgttgta gtttgcatgt 1320 agtacatgga gcttttaaat taggttctga gtctacatca tggaatctaa aaagtattct 1380 taagggagca taccaaatac tccatgatac tccggctagg agggatgact ataacagtgt 1440 tactggctca aatcggtatc cattaagttt ctgttcaaca agatggattg aagaccaaat 1500 tgttgctgat cgcttaatcg aaatatggga caacattgta aaattaatta gattttggga 1560 aaaattatca aagtctaaac agccttcctc aaaaagtttt tttaatgttc aaaaagcatc 1620 aaatgacttc ctgacagttt caaagttaca attttttagt tatgttgcca gcttgtttaa 1680 aacctacttg acactctatc agactgatca accaatgatt ccatacatgt acgctgatct 1740 tagagaattg ataacaaata ttttagctct atttattaaa ccttcagtaa ttcaaagctg 1800 caaaaaaggt agcgacttaa aaaagataga tctctataaa aaagagaatt tgctaccaag 1860 gaaagatgtt attattggtt ttgcggctga agtttcaatt aaagaactcc ttgttaaaga 1920 taatataact cttaatgaga taaatcagtt tagaaatgaa tgtgttttat ttctcacaaa 1980 aaccatggaa agtttactta aacgcatccc actagactct gcaatcgtag caaatgcaag 2040 ttgtttcgaa ccaatcaatc tcaaaacatt aacaattgaa tctaatcaaa ataaactaaa 2100 aagattgtta cattatcttg tcagtttaaa aattatatct gcaacacaaa gtgataatgt 2160 catgcttcag tattcaatat ttttaagaaa tgatatgaaa ataaatatag aaaaatttct 2220 tagctttgat aaaaaaacca ctcgtttaga tatcttctat tttcatcaac tagatatagc 2280 caaacataag gaactttctt atattatcaa aatgatttta acggttagtc atggtcaagc 2340 ttttgtagaa agaggtttca gcgtaaacga aacaattcta aatgaaaata ttcagtctaa 2400 ctcaatagtc tctcatcgca ttataaaaga tcatatgaat tctcagaaca ttcaagctga 2460 cactattaat atcacaaatg cattaatgtt atcagtaaag tcagccagac aacagtacca 2520 aatcagttta gcagagcagg cgaatttaaa gaaagagcgt gaatcaaaac caccaaagag 2580 caatcatatt agatgaaatt aaagaagtag aaaaaaaaaa gaatttactt gtaacagcat 2640 gtagtacgtt aaacaaagat tttattcaat ctgttgaaga tgctgacaaa aataacgatt 2700 taacaatgct taaaaaagca actgctttaa aaagaaaatg cgacgaaaca gaaaaagaat 2760 caaaagagtt agattcagtt ttagaattgt ttaaagttag atttagtttg tgtttaattt 2820 tagaataaat ttaaaaaaat tccattattt tatgactttt agtatgttta gaatctattt 2880 aaattatata tatatatata tatatatata tatatatata tatatatata tattaaatct 2940 taaaytcagg gattttaaaa aactaatcat gtactttaga atactcagaa aaacttttaa 3000 aattatagtg aaccctg 3017 // ID CR1-54_AAe repbase; DNA; INV; 2808 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-54_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-2808 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1141-1141 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 10 sequences with >98% CC identity. The consensus is likely 5'-truncated. XX FH Key Location/Qualifiers FT CDS 2..2749 FT /product="CR1-54_AAe_1p" FT /note="reverse transcriptase." FT /translation="WLRPDVLNSELCSNYTIFRCDRSEQTSSFQRGGGVLI FT AVKASFRCTSVSLNDCDSLEQVVVSVNLPMSTIYLCGIYIRPASPPEAYSM FT HATAIQSICNFSSETDAIIVVGDYNLPGLSWIYDDELNSYLPCNASTEAEM FT TFVEAVMATGLYQVNSIRNVNNRILDLAFVSDVCDIEVLEAPSYLLRMDAH FT HKPFVLRVQIWETRNNSSSDSNDEADFDFSLCDFESLNASLSTINWTNELA FT NKGTDEATAHFYDKLYDVLHSNVPRKRLRPKQKGKLPWWSPELRRLRNGVR FT KARKRYYRSKSPVARQNLHVIESRYNECKVFTFRNYINTIESNLKNNPNSF FT WSFVKSRKSNNRIPEQIRFQDRTSANVDESANLFADFFSSVNSTISPTLST FT ATRHSIPTHDITLSFLNFTEQDVKTALTSLNVKKGAGADRLPPSFWKECAE FT SMKLPISLIFNRSLADRKFPAVWKTASVVPIHKSGSINSVDNYRGISILCC FT IAKVFESMMHMVLYNATRHLISDSQHGFVKKRSTVSNLMCYTNFLSSEIER FT RQQVDAVYFDFSKAFDKVPHDLAIKKLDRMGLPDWITEWLRSYLSGRKAFV FT RIGDTQSRGFDITSGVPQGSVLGPLIFVLFINDLSYRLKSCKLFYADDLKI FT YRAITSVIDCYALQSDVNELLLWCSENGMQLNNSKCKCITFTRRLSSISFN FT YEINGSIIDRVTTINDLGVTIDSKLKFSKHVSVTTAKAFSVLGFIRRNSQA FT FQDIYTLKTLYCSLVRTILEYAACVWSPYYTTQVLHIEKVQRSFLRYALRS FT LPWNDPVNLPDYESRCMLLNMETLSFRRTKLKRMFLFDLITHNIDCSDLLA FT EVGFLAPSRYLRSRQLLAPRTHRTAYGQNSPLTSCIRSFSIVNDVFDFNQS FT RVEFKRRISSIR" XX SQ Sequence 2808 BP; 775 A; 654 C; 572 G; 806 T; 1 other; atggctccgt cctgacgtac tsaactccga actctgttcc aattacacaa tattccggtg 60 cgatcgcagt gaacagacaa gctccttcca acgtggtggt ggtgtactta ttgcagttaa 120 agcttctttt cgttgtacat cggtctcttt gaacgactgc gactcccttg aacaagttgt 180 tgtctcagtt aatcttccaa tgtcaacaat ctatctttgt ggcatttaca ttcgaccggc 240 ttcaccacct gaagcgtact cgatgcacgc aactgcaata caaagtattt gcaacttctc 300 ttccgaaacc gacgcaatca ttgtagtcgg cgactataat ctaccagggc tttcttggat 360 atatgatgac gagcttaata gctatctacc atgcaatgcg tcaactgaag ctgagatgac 420 ttttgttgaa gcggtgatgg ctacaggact gtaccaagtc aactcaatac gtaacgttaa 480 taatcgtatt cttgatttgg cttttgtcag tgatgtctgc gatattgaag tgttggaggc 540 accatcttat ctcttgagaa tggatgctca tcacaagcct ttcgttcttc gagtacaaat 600 atgggagaca cgcaacaatt catcttccga cagcaatgat gaagcggatt tcgacttctc 660 gctttgcgat tttgaatcgc taaatgctag tttatcgacg atcaactgga cgaacgaact 720 ggcaaacaaa ggcactgatg aagcaactgc tcacttttac gataagctgt acgacgtttt 780 gcatagcaat gtccctcgta agcgtttgcg tcctaaacag aagggcaaac tgccgtggtg 840 gtcacctgaa ctgagacgtc ttcgaaacgg tgttcgtaag gctcgcaagc gctattatcg 900 ctccaaatcg cctgttgcca gacaaaatct tcatgtgatt gaatctcgtt acaacgaatg 960 caaagtcttc acgttccgta attacatcaa tacaatcgaa tcaaatttga agaacaaccc 1020 gaatagtttc tggtcattcg tcaagagccg taaatcgaat aatcgcatac ctgagcaaat 1080 acgttttcaa gacaggacgt cagcgaatgt ggatgaatcg gccaatctat tcgctgattt 1140 tttcagcagt gtgaatagca ccatctcccc aactctctcc actgcaacgc gacattctat 1200 tcccacgcat gacataacgc tgtcattcct aaattttacc gaacaagacg tgaaaactgc 1260 actaacgagc cttaatgtta agaaaggcgc tggagctgat cggcttcccc cttcattttg 1320 gaaggagtgt gctgaatcta tgaagctgcc aataagccta attttcaacc gatctctcgc 1380 tgacagaaag tttcctgcag tttggaaaac ggcctcagta gtcccgattc ataaatctgg 1440 cagtatcaat tctgtcgaca actatcgtgg aatatcaatt ctttgttgca ttgccaaagt 1500 ttttgaaagc atgatgcaca tggtcctgta taatgccact cgacacctga tatccgactc 1560 ccaacacgga ttcgttaaga aacgatccac agtttcaaat cttatgtgct acaccaattt 1620 tctttcatcg gaaatcgaac gccgtcagca agtggacgct gtgtacttcg acttctcgaa 1680 ggcatttgac aaagtccccc acgatcttgc tatcaagaag ttagaccgaa tggggctgcc 1740 cgactggatc acagaatggc ttcgttccta cttgtccggc cgcaaagcct tcgtcaggat 1800 cggagacaca caatctcgtg gttttgatat cacctccggt gtcccccagg gcagcgttct 1860 aggcccactt atatttgtac tgttcattaa cgatctgtcc taccgtttga aatcgtgcaa 1920 gctcttttat gctgatgacc ttaagatcta cagagctatc acctcagtca tcgactgcta 1980 tgctctacaa tctgacgtca acgagctgct tctatggtgt tcagaaaatg gaatgcagtt 2040 gaacaacagc aaatgcaagt gcataacatt tacacgccgt ctttccagca tctctttcaa 2100 ctatgaaatc aacggtagta tcatcgaccg agtgacgacc attaacgatt tgggtgttac 2160 cattgacagt aaactgaaat tcagcaaaca cgtgagcgta accactgcta aggctttctc 2220 tgttcttggg ttcattcggc gtaactcaca agcgttccaa gatatctaca ccctaaagac 2280 attgtattgt tcattggtgc gaaccatact ggaatacgct gcctgtgtgt ggtcacctta 2340 ttacactact caggtgctcc acattgagaa agtacagcgt agctttcttc gttatgcact 2400 caggtcacta ccgtggaacg accctgtcaa tcttccggac tatgagagtc gctgcatgtt 2460 gctgaatatg gaaacactgt catttaggag aaccaaatta aagcgaatgt ttttatttga 2520 tttgattacg cacaatatcg attgtagcga tttattagcg gaagtagggt ttctcgctcc 2580 aagccgttat ttacgtagtc gccaactctt ggcacctaga acgcatagaa ctgcctatgg 2640 tcagaatagc ccacttacta gttgtattcg tagttttagt attgtgaatg atgtgttcga 2700 ttttaatcag tccagagttg agtttaaacg tagaataagt agtattcgat aagtaaatca 2760 gtctgtagga aatttatcca agacgttgca aataaataaa taaataaa 2808 // ID Gypsy-8_PPc-LTR repbase; DNA; INV; 338 BP. XX AC . XX DT 08-JUL-2010 (Rel. 15.07, Created) DT 08-JUL-2010 (Rel. 15.07, Last updated, Version -1) XX DE LTR retrotransposon from the Pristionchus pacificus genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-8_PPc_; KW Gypsy-8_PPc-I; Gypsy-8_PPc-LTR. XX OS Pristionchus pacificus OC Eukaryota; Metazoa; Nematoda; Chromadorea; Diplogasterida; OC Neodiplogasteridae; Pristionchus. XX RN [1] RP 1-338 RA Jurka J.; RT "LTR retrotransposons from the Pristionchus pacificus genome."; RL Repbase Reports 10(7), 1009-1009 (2010). XX DR [1] (Consensus) XX SQ Sequence 338 BP; 87 A; 96 C; 59 G; 96 T; 0 other; tgactagaga tactctcagt ctgttttgct ctttcttaat ctcaaatctc aatatctcac 60 tctcgtctca atgagagaaa atccgatagt aatcctattc tacgcaggaa cagaccgacg 120 atttgactgg agaaagcaaa gatgataatg aagatcgatg cctataaaaa cccccctctc 180 ccctcttctc gtcgtctctc cgagcggtct ctcgaagtcg tcactcgtcg actccgctct 240 ctctcaccga ccctctcact tgcatctctt tgctccctca gtggaagcga taaagtgttg 300 agcgtcaagt ctcagtgtac tgtacattgc aaataaca 338 // ID hAT-12_SM repbase; DNA; INV; 2678 BP. XX AC . XX DT 23-JAN-2008 (Rel. 13.01, Created) DT 23-JAN-2008 (Rel. 13.01, Last updated, Version 1) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-12_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-2678 RA Jurka J.; RT "haT-type repetitive family from freshwater planarian Schmidtea RT mediterranea."; RL Repbase Reports 8(1), 13-13 (2008). XX DR [1] (Consensus) XX CC Present in >1500 copies in the genome. The youngest copies are CC ~99% identical to consensus. CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 597..2393 FT /product="hAT-12_SM_1p" FT /translation="MSSRKRKIADEGRVFNEEWNSKYFFTENGGKPFCLIC FT HKSVAVMKEYNVKRHYEKEHERQYKDLTGEIRKNKFRTLKASLTAQQSIFR FT KQSIHNELVVHSSYIVAELVAKERRPFTDSEFVKRCLVAVAEKLCPETKTL FT FQDISLSARTCARRVEEIGTNLFEQLKCKAKSFDCYSLAMDESIDITDTAQ FT LIIFIRGIDGDFNVHEELASLCSLKGATTGNDLFIKVKETLNSLELGWEKL FT KCVTTDGGRNMCGSKTGVVGRICAELQNIACDNPMVFHCIIHQESLCCTIL FT SSMKDIMNTVISTVNYIRRHGLKHRQFKEFLKEIDSEFNDVIYYSAVRWLS FT RGAVLRRFFNLREEIDIFMTEQENTILQLSDQKWIMSLAFMVDITAYLNEL FT NLNLQGKGKLLADLFCDINAFEAKLCLLKKHMIEENLTHFICCKSVPADGW FT QTQKGTFVSVIEDLQTQFSTRFMDFHNKSSEIRLFLNPFNVDINDVPNELQ FT MEILQLQHNEILKNAFLLENIQHFYRCLPKQYEGLISFAKKMIVAFGSTYI FT CEQAFSAMSFRKNKFSSQLTDEHLHASIRICTSGLKADIDNLAKDKQPQKS FT H" XX SQ Sequence 2678 BP; 901 A; 414 C; 495 G; 868 T; 0 other; caggggtcgg caaccggcgg cccgcggacc gcatgcggcc caaaagtgtt tttttggcgg 60 cccataaatg attgcccata gtggtatacc attttggtat attatttgca aaataaattt 120 taaataaata ataaattata ttatattatg tacactcatt caaatatgtc tgcccgctca 180 attctaatat atagaagcta tatattaacc aagaatttga tgcatgtttt gaaacactta 240 taggagaagc ttcatgaata agtttggcct aagccctaat gtaatgccca ataatcgatt 300 gtcacaaaaa ttcattctac tgtataattc ttcttaccag acgatctgtc atctaatttc 360 atcatgcaac ttggcgcaag ccaattgtac gataatgagt cactgggtgg ctgaattctt 420 ggaaactaac tgttttggca tgcttttgaa accgcattgc ctggtgtctt gcatataaca 480 tacacgtgtg ttatgcacac attcaattgt gtggtttcat tattgtcgca gctttgtaat 540 tattattaat atcactatta ttgtgattat atatattttt caaataattt acaaaaatga 600 gttcaagaaa acgtaaaatt gcagatgaag gaagagtttt taacgaggaa tggaacagca 660 agtacttctt tacagaaaat ggtggtaagc ccttttgtct tatttgtcat aaaagtgttg 720 ctgtaatgaa agaatataat gtcaaacgcc attacgagaa agaacatgaa agacaataca 780 aggatttgac tggtgaaatt agaaaaaata aatttagaac attgaaagct tctttgactg 840 ctcaacaaag catatttagg aagcaaagta ttcataatga attggttgtt cattctagtt 900 atattgtagc ggaacttgtt gcgaaagaaa gaaggccttt tacagattca gaatttgtta 960 agcgatgttt agtagctgta gcggaaaaat tatgcccaga aaccaagacg ttatttcaag 1020 atattagcct atctgcgcga acttgtgcaa ggcgagtgga agaaatagga accaatttat 1080 tcgaacagct taaatgtaaa gcaaaatcat ttgattgtta ctctctagca atggatgaaa 1140 gtattgacat aactgacaca gcacagttaa taatatttat tcgaggcatt gatggtgact 1200 ttaatgttca tgaagaattg gctagtttat gtagcttaaa aggcgctaca accggcaatg 1260 atttgttcat aaaggtaaag gaaactctca attctttaga attgggatgg gaaaagttaa 1320 aatgtgttac aaccgatggt gggagaaata tgtgtggctc taaaactggt gtggtaggcc 1380 gaatttgtgc ggaattacag aatatagctt gtgataatcc aatggtgttt cattgcatca 1440 ttcatcagga atcactttgc tgtacaattt tatcatcaat gaaagacatt atgaacacgg 1500 tgatatcaac tgtgaattat attcgccgac atggtctgaa acaccgtcaa tttaaagaat 1560 tcctgaagga aatcgattcg gaattcaatg acgttattta ctactcagca gttagatggc 1620 taagtagagg tgctgttcta agaaggtttt ttaatttgcg tgaagaaata gatattttca 1680 tgacggaaca agaaaatact atcttacagc tatcggatca aaaatggatc atgtctcttg 1740 catttatggt agatattaca gcgtacctta atgaacttaa tttgaactta cagggcaaag 1800 gtaaattact cgcagatctg ttttgtgaca taaacgcttt tgaagccaaa ctatgtctgt 1860 tgaagaagca tatgattgaa gaaaatctaa cccattttat atgttgtaaa tctgtacctg 1920 cagatggatg gcaaacgcaa aagggcactt tcgttagtgt aattgaagat ttacaaactc 1980 aattttcaac aaggtttatg gattttcata acaaaagcag cgaaatacga ctgtttttga 2040 accctttcaa tgtcgacatc aatgacgtac caaacgaact gcaaatggaa attttgcaat 2100 tgcaacataa tgaaatatta aagaatgcat ttttgttgga aaatatccaa catttttatc 2160 gatgtttacc aaaacaatat gagggattaa tcagcttcgc gaagaaaatg atagtagcat 2220 ttggaagcac atatatttgt gaacaagcat tttctgccat gagttttaga aagaataaat 2280 tttcttctca actgactgat gaacatctgc atgcttctat tcggatttgc acatcaggtt 2340 taaaagccga tatagataat ttggctaagg acaaacaacc acaaaagtct cattagcaaa 2400 ggagatagag gcaagttatg catttactgt attttggttt tattaaaaca aatgtcattc 2460 ggtttctttt ttataaatca tgtttgtttc atatctctta ctttaaaagt tttaacttta 2520 aatatttctt gttttacaaa tgattgttat aataaaaatt tattagtaat ttttttacac 2580 ttcccttctt ctaaaatcta aaatataaca cttgcggccc ttttattaaa ctaaaagctg 2640 aatgtggccc ttatgtcttg aaaggttgcc gacccctg 2678 // ID Gypsy-14_DWil-I repbase; DNA; INV; 7978 BP. XX AC scaffold_180708; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-14_DWil_; KW Gypsy-14_DWil-LTR; Gypsy-14_DWil-I. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-7978 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_180708; Positions 5624243 5616266. XX CC Positions [6162-6623] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 4725..6899 FT /product="Gypsy-14_DWil-I_3p" FT /translation="MIDDVLRDLIGVCCYVYIDDIIIFSKSEEEHMTDIRR FT VFEKVSEANLRINLEKTTFMKNEVNFLGYTVSSEGIRTDEKKVAAVTAMKP FT PTNLKELKSFLGMTSYYRKFIRDYAKVAKPLTNLTRGELGQVKANQSRKVQ FT ITLGEIELKAFEDLKALLVSSEVLAFPDFNKPFNLTTDASNEALGGVLSQV FT QDGKDRPISFISRGLSQTEENYATNEKEMLAILWGLETLRSYLYGAEDIRV FT YTDHQPLTYALGNRNSNAKLKRWKARIEEYNCKLFYKPGKSNAVADALSRI FT TSHLNPLRLAPSAGQTLRLPLLRVNTPLNAFRNQIILVMGTDIDTSTEPHP FT GVQRHIVHINLTQDIAISRAALEGTLSKTHINGIKVPETAVSEMQPIIWEY FT FNNYKIRLAQEIVEDITSEDQIQEIIQREHRRAHRSYWENKQQILSRYYFP FT KMAARTKSYVSKCTTCGTQKYNRRPKKPHLQPTPIPRMPCEILHIDLFVIE FT KLWFLSCIDKFSKYAMLFPIENRSSIHLKEKLMDVLAYFSVPKLIVMDNER FT GLLCPIIINLLHELQIEIYRTPSQKSEVNGQVERFHSTFLEMYRCLRIEHP FT KLSAAQLAKLTVARYNESIHSVTKQKPANLFFNRTSEQELANNTNPRETLL FT ENIRGLLEHQQTTTNRAKNLKREKPLSYEVDQEIFVANKQIKGKHVKRFFQ FT EKIAQDNKVTVVTQTGKNLHKSHIK" XX SQ Sequence 7978 BP; 2705 A; 1823 C; 1677 G; 1773 T; 0 other; tgggaactcg taactggcgc ccgagcaggg acccgtaatt tttctagaag ccatgatagt 60 gctaaccgcc cttcgggggg aatctagaaa cattacgggt ctagacaata acaggaagag 120 ctccacagtt gaatgcaagc catgtcagta aaacgcatat taaagaaaaa taattaattg 180 ctcaggcaca aacctgacct atcccaaata tcaaaaaaaa acaaaacacg aaaataataa 240 tattggccac taggccaacg ggaaatgctt cacctgatta tggtgatatt tgaaaccaac 300 gcgtgcctcg gcttgcataa agctgctgac tcttcaagta accatcccga acgcgttttg 360 ctaaaaccgc attctcacgc cagcagagtc aggcaaggtg gaattagcaa aacgaaggga 420 aatttttgcg cctcttgcat ttttggcgtg aacatgtaaa taaaagaaac cggaagggaa 480 atgacacgct cccgtggcaa caggccttta aaaaaaaaaa aaaagagaag aaaaaattgt 540 aaaaaaaaaa gaaagatgac cacatgcagt gaggtggttc aatatggggg gacatgtcat 600 ctaattcctc tttccggtgt cttgcaataa gtgaatgact ggttgaaaga ccaagtcctc 660 cttcagagag ggcggaaaga gttcattctg aattaacgac taactggttg aaagaccaag 720 tcctccttcg gagagggcgg aaagagtcgt tttgattagc gaatgactgg ttgaaagacc 780 aagtcctcct tcagagaggg cggaaaaagt tcattctttg aatgactggt tgaaagacca 840 agtcctcctt cagagagggc ggaaagagtt cattcttgat tagcgaatga ctggttgaaa 900 gaccaagtcc tccttcagag agggcggaaa aagttcattc tttgaatgac tggttgaaag 960 accaagtcct ccttcagaga gggcggaaag agttcattct tgattaacga ataactggtt 1020 gaaagaccaa gtcctccttc agagagggcg gaaagagtcg tttgattggc tgaatgactg 1080 gttgaaaaac caagtcctcc tgcagagagg gcggaaaaag ttcattcttg actgaaaagc 1140 caaaccttcc ttctgagaaa ggaactgctg aaagtaaacg tatacgaatg gttgaaaacc 1200 caaaaactag taccccaact gaaaatatta aacttttgtt cttgattctg tcatttagtt 1260 tattactcag gttatcttgg ccgtctgcga gcaggagtgg ggtgcctcca cttgagtata 1320 ctcctcaggt tgggacaccc agctcgcctt ctatttttcc tcgtatcgga aaactgactc 1380 taaagaatgt gtcttcaacc tgtgtataat ggtgatcgat tcctcacatg gtggaggtta 1440 atccagatat tcctgaatga tcaactcagg aacgttaatc ggcttcggga acggaaaggg 1500 gattgtttcc tcatcttcat ccggaacagg cgcagtaatt cctgagctgg caaagtcatc 1560 ctcccatata ttgtaactct gatcaaagct gggggaccct cctctgatga cccgagggaa 1620 caattcagaa ctcaaatccg caactatgat ttcctcctcc atgctctcgt catcagaagg 1680 ggaatcgcaa taaattattc ccgtttgcac agttgggttg aagtttccgt ccgatcgctt 1740 catattcttg gctgatacta aattaaacct tacaccagca ctgaccacta tggtctttga 1800 agacgagaat atgcttagtc agacggttga ggtaaaagac acaaaaattc ctaaaaaaca 1860 acaatgcagg acagaatcaa cgcataaaag gggtgaaggg gaaaatatga aacacatgca 1920 caagcgcatt ttcaattatg cacgctcagc ttaaataacg cgaaaaccgg aaattgttac 1980 cccttccaag agtttgagaa atagtgacag tagtagagat tcaaatccag gtatacacat 2040 acacacttta ggagtcaccc actcataaaa acaagtatcg cgtcctacaa tagggaatat 2100 agaactatgt atttaaaaat tgaaaacagt tgctgtaaaa aaaatgaaat gagtgacaac 2160 gaaaacttga gcaattggca gccgcaggag atgccgcgta cgtcacattg gacaatgata 2220 ccatggtgaa tacgcaccag tagtcacccc tgtgacccat agcgaaccta gatcccgagt 2280 cgagtcaacc ggtcaagccg gaatactcaa tttaaaaacc tacgttccca atttgaatgc 2340 caacgcagca ttttagagat ggaagaaaaa gtaatattta agtatcatga tgaaatgggg 2400 catgtaggag tagacaaagt agccgacttg ataggtaaaa atgattggtt tcccagcatg 2460 agaaagaaaa ttgataagct tgtaaaaaaa ttgtctcaaa tgtataacgt tttcaaacaa 2520 atcaggaaaa cagaaggatt tcttcattcc atacccaaag gagataaccc gtctcaatat 2580 ttacatatac atagatcact atggaccgat acatagtaaa gatagcaaaa tgaaacacat 2640 attagtcgta atagatgcat gtaagttatc atataaagct ttgagactcc aaagcttgcg 2700 cagatcgcag gtgatatcca gctctacagt tatcactcga ggctttggtc ccacaaagct 2760 tgtatattgc agctgatagc agcttgagct cgataggctt gaagcccgca cccacgtgtg 2820 cggttatttg ctacaaagag tacgaaatca agtgaagcaa tcgaagcact ggaagaatac 2880 ttcaaagtat atagtaaacc caagaatata atatcagata gaggcagttg cttcacctcc 2940 aaaggtttcg atgattttat taaggaatgt aatgttaatc atataaaaaa agcgacagca 3000 tcaccacagg ctaatggaca agttgagaga gtcaataaag acctaggacc catgattgcc 3060 aagttagttg acagtgaaaa caataagcaa tggcatacag tattggataa ggtagagttt 3120 acaatgaata acacaagaaa cagatgcacc aatgaatatc caagtataat gctatttggt 3180 ataaaacaag aggtagcatg atagacagct taaaggaaaa tgtaatagaa gtgaatcaag 3240 tttaagaaaa tcgaaactta acagaaataa aggaaactac gcttcgacaa caggaaatca 3300 acagttcgag cgaatttatt cgcaacctaa acatggaggt gaggatacac ctcgtacatt 3360 acgctccgac aactttaaac accacctacc gggccgcttg aacgttgttg ataagatcca 3420 agagtacgaa atcgacatgc cagggaccag ccagggtggt aacgactatc ggcccccact 3480 ctccgataaa aatcataagc ccaatgacaa tccgttcagg ggaaattaca agtctggtaa 3540 agactacaaa cataacaact acaggggacg aaattcctac aagaaatcca attgggaaaa 3600 caattacggc cagaacaacg gacaccaaca aggtgggaat aaaaatgatc aaaggaaacc 3660 ggaaaatcaa acacacaacc aaaggcaagc cttgaatcat ctaacgatgg aggaatcgaa 3720 tatccccgaa aactccaggg gtgagtctaa acacttggat gatcaacgaa gcccaaaaaa 3780 ggctgaaaat tttatgaagc cagccgaacc ggagttccgt acctaaagta cgtcaccaaa 3840 gatggggaat gtctcacctt cctcatcgat actggggccg cggccaatta catccagaag 3900 cgacacgccc cccatggtca acaaatgaca cgaccttttt acaccgcttc cgttggcgga 3960 agcatgaaaa ttacacacta catagagggg caatttttta aacctttcgg gatcgaaaca 4020 accgtcattt tttatgtctt gcccaacctg gtcgactttg acgggatcat cggcgacgac 4080 acactacgca ggatacaagc cgtattggac cgctccacca atcaactcat catgccacaa 4140 ggcgtaaaaa tcccactatc tatgatgagc tcaaccgcag caaacatgat ggccaccccg 4200 aagtcgtata cccctcagct tgatgaaata atccattcct tcagccacat ttttggaccc 4260 ttagttgagt cagaaaccgt gacaacggag gtaagggcgg aaatccgcac gattgaccag 4320 gagcccgtct acacacacac ctaccctttc ccagcgccaa tgatgggaga ggttaagagg 4380 caaatcaaca agctactgga caacaaaacc atcaggccat caaagagccc atataattct 4440 cccttatggg tggtacctaa aaaaccaaaa ccaaatggcg agaagcagta tagattggta 4500 atcgacttca agaaattgaa tactgtgact atcgctgatc gataccccat tccggatatt 4560 aactccacgc tggctagcct cggcaaggcc aaattcttca ccactttgga tttgacttca 4620 ggttttcatc aggttcggat gcgcgaaaag gacataccga aaaccgcttt ttcagtccta 4680 aacgggaaat acgaattcct aagactcccg ccatctttca acgaatgatc gacgatgtcc 4740 ttcgagacct tatcggggta tgctgctacg tatacattga cgacatcata atattcagca 4800 agagcgagga agagcatatg accgacattc gaagggtctt tgagaaagtc tcggaggcca 4860 accttcggat caacttggaa aaaacaacct tcatgaaaaa cgaggtaaac ttcctgggat 4920 atacggtgtc aagtgaggga atacgcaccg atgagaagaa agtggcggcc gtcaccgcaa 4980 tgaagccccc aacaaacttg aaggagctca agagcttttt ggggatgacg tcatattaca 5040 ggaagttcat ccgtgattac gccaaagtag ccaaacccct caccaacctc acgagggggg 5100 aactggggca agtaaaagca aaccaatcga ggaaagtaca aatcaccttg ggtgaaattg 5160 aattgaaggc cttcgaggat cttaaggcgt tgctggtttc atctgaagtc ctggcttttc 5220 ctgatttcaa caagccattc aacctcacga ccgacgcatc caacgaggca ttaggcggtg 5280 tcctgtccca ggtacaagac ggaaaagatc ggccgatttc cttcatctct aggggattat 5340 cccaaacaga ggaaaactac gcaacaaatg aaaaggagat gctggcaata ctatggggac 5400 tggaaacgtt gcgctcgtat ttgtacggag cagaagacat cagggtatac accgatcacc 5460 agccgttgac ttacgcctta ggaaaccgaa actcgaacgc caaattaaaa agatggaagg 5520 cgaggatcga agaatataac tgcaaattgt tctataagcc aggaaaatct aatgctgttg 5580 ctgatgcctt atcacgcatc acctcccatc taaaccctct cagacttgca ccgtcagcag 5640 gacaaacact ccgcctccca ttactcaggg tcaacactcc actcaatgct ttccgaaacc 5700 agataatctt ggtcatgggg acagacatcg acacctctac cgaacctcac ccaggtgttc 5760 agcgacatat tgttcacata aatttaacac aagacattgc catatctaga gctgcgctgg 5820 aagggactct ttccaaaacc cacataaatg gaatcaaggt tccggaaacc gctgtttcag 5880 aaatgcaacc cataatctgg gaatatttca acaactacaa aatccgtcta gcgcaagaaa 5940 tagtggagga tataacttct gaagatcaga ttcaggaaat aatccaaagg gaacatcgaa 6000 gagcccatag gagttactgg gagaataaac aacaaatctt atcgaggtat tatttcccga 6060 aaatggcagc aagaacaaaa agctatgtgt caaaatgcac tacatgtgga acccaaaagt 6120 acaaccgacg accaaagaaa ccacacctac aaccgacacc gataccccgc atgccatgcg 6180 aaattctaca catcgatctc ttcgtaatcg agaaactatg gttcttgtcc tgtattgaca 6240 aattctccaa atatgcaatg ttgttcccta tcgagaacag gtcgtctata cacctcaagg 6300 agaaactcat ggacgtccta gcatacttct cggtgcccaa gctaatagtc atggacaacg 6360 aacggggact attatgcccc attatcatca acctcttgca cgaactacaa attgagattt 6420 atcgtacacc ttcacaaaaa agcgaagtaa atggacaagt agaacggttc cactcgacat 6480 tcttggaaat gtaccgatgt ctacgcatag agcacccgaa gttgagtgcc gcccaactag 6540 caaaactaac ggtcgctaga tacaacgaat ccatacactc cgtaaccaag cagaaaccgg 6600 ccaacctttt tttcaaccga acatcggagc aagaattagc caacaacaca aatcccaggg 6660 aaacgctctt ggaaaatatc agaggattgc tcgaacatca acaaacgact acaaataggg 6720 ccaagaactt aaaaagggag aaacctttgt cgtacgaggt cgaccaagaa attttcgtag 6780 ccaacaaaca gatcaagggt aaacacgtaa aaaggttttt ccaggaaaaa atcgctcagg 6840 acaataaggt cacggtagtc acacaaacag ggaaaaatct acacaaatca cacattaagt 6900 aaaaagaaaa aactgccccc tcattctgtg tctctcattg gcggagagct acaaggtctt 6960 taagtatgag tccccaatcg tgacgttgca acaaggaaag ggatggagaa ttgaaggaca 7020 ctcaaatctg gtacatgtaa tccctctcga ttcattcgcg ggctttgtcg aaaacttgtc 7080 atcagccctg gatcaatatc attctgggga cttgaagaca ttggccaccc tgaaaattaa 7140 ggatctaaac ataaaaatcg aggaactcac ggccccccac aggaggatta gacgagcaat 7200 cagatggtta ggctcagctt ggaaatgggt agcaggatca ccagacgcag ctgactggga 7260 ccagatccta caatcacagg actctctgat tagagaaaat aaccaacaat tcactatcaa 7320 taaacaattc ctatccggca ttcaagaagc attcgacaaa acaaacgaac ttgcaggagc 7380 cctgaacaca ataaacaagg ataacgactc cgaactatcc atcctgttga acaaaatctt 7440 gctacttgaa cggaagctgg atgagatcgt gcgagcatgc caactcgcaa aagttggggt 7500 gatcaactca aacctactgg atatagacga gttgcaggac attatcaaca ccacaccaag 7560 cttgccctat aacaacatca ttgaggccat cgaatttgcc aaacctacta tcttcgtaaa 7620 cgggaccttt gttatccaat tttccaacga atcggtgacc atcggtaaca aaacttccag 7680 aagccagacg gttacaagga tggtggctat accatccacc ttgtccgaaa tcgtaaacga 7740 aggacaccga ctaaacgtgg aatacattca cgacttgcac ataaacaatc tacgacgcct 7800 caacactcta acgaaaggaa ccaccatcac ggccggctgc accatcctcg tattcctcgc 7860 catcttcata ggttggattc tgaagaagac gctccagaaa ccagaaacaa cacccgaaat 7920 taagggatct ccgggacgtc gatttttaag gggggaggag ttagcacact catcctca 7978 // ID Copia-24_DPu-LTR repbase; DNA; INV; 208 BP. XX AC scaffold_34; XX DT 11-MAY-2010 (Rel. 15.05, Created) DT 11-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE Copia-like LTR retrotransposon from Daphnia: long terminal DE repeat. XX KW LTR Retrotransposon; Transposable Element; Copia-24_DPu-LTR. XX NM Copia-24_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-208 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 712-712 (2010). XX DR Genome; scaffold_34; Positions 704481 704688. XX CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX SQ Sequence 208 BP; 52 A; 46 C; 34 G; 76 T; 0 other; tgttgcgatt tatcgacctg gcaacccttc ttcttgtctt cacgtcgtct gcaccgtaaa 60 actgtctatc actgttctta cttgaaaact ctcgtgtctt atggatcctt ttgttctcgc 120 ttgatatatt tgtcgagtct catatcaagc aataataaag aaacaagtta atcagtgaaa 180 ctagtctacg aggtgtttac tttctaca 208 // ID Sola1-10_AP repbase; DNA; INV; 3836 BP. XX AC ABLF01041013.1; XX DT 28-JAN-2009 (Rel. 14.02, Created) DT 11-MAY-2010 (Rel. 15.12, Last updated, Version 3) XX DE Sola1 DNA transposons from Acyrthosiphon pisum. XX KW Sola; DNA transposon; Transposable Element; Sola1; Sola1-10_AP. XX NM Sola1-10_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-3836 RA Bao W., Jurka M.G., Kapitonov V.V. and Jurka J.; RT "New superfamilies of eukaryotic DNA transposons and their RT internal divisions."; RL Molecular Biology and Evolution 26(5), 983-993 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX FH Key Location/Qualifiers FT CDS join(959..1027,1006..2292,2241..3158) FT /product="Sola1-10_AP_1p" FT /translation="MKLVIVIVIVIFYCFRKMRDTKMEDERYKNVGAWLES FT QQPSISTTLNTVPYISNPETVSYNIINQLSDDSVTVETEKSDECVDNGSDE FT DPTFMINTIDRSSESLSCIVQVPVNSTVITENDTDFIIHEVVDYLIKEVCK FT KNKSRKRSCDPSSWESNIRKKRLASGMSYLSKKGDKIIQCKQLKPPCVNCK FT KKCSEKIDEHDRKKIFCTFWDGDKDINLKRQYITTCIEEKCVDRKRERNGN FT RAGVRQKSLVYSFEIENVNTGIKNKIDVCKTFYLNTLNISDMFVRTALKKK FT THGGMVSLDRRGYHAPPNKISDDIRDGVKKHIKSFPAIESHYSREKTVKKY FT LGNELNISKMYKLYVDNCKEHKVPTGNIANYWLYSEIFNNEFNLAFKEPAN FT DSCDICDEFIIKIKNSTIEEEACLQKNYESHLDEAKLRYAEKKKTNLKQRK FT LAVTKKPKKKDKLEAKETSCNKKTIMVDLQKCLPTPYLTNGQSFYLRKLWT FT LNLTIHDDTLNKATCVLWDETKGGRGGNEIASCVLKWALQLKETESNIEEL FT NIWSDNCAGQNRNFMIVTLYMWLLKKMPNLKIINHKFLLRGHTHMEVDSDH FT SIIERAKKKIEHSSIMTPWDWQKFIRSCKNKNPFEVINMELEDFLDFQCLY FT NSKNSPLIYRRKEVKISEIVHLRLEKEKEGLLMFKTDFAEKEFKTVSLNRT FT TRSSVWPEHLPQVTSEPKPISKLKYKDLQTALKWVPRIFHDFYTNMKYDDK FT IADYPE*" XX SQ Sequence 3836 BP; 1530 A; 434 C; 588 G; 1284 T; 0 other; ctgtaccgca gaagtcaact aacgcaagtc cattttttga cattttgagt tagtcatata 60 atcagaatgt gcgttttatg atatttttaa aggtattaaa acgtaagtcc tatgtacaat 120 aaatgcaaag ttacctagaa gtatactaac ttaagtccta aactatgata gaagtattga 180 aatgtaagtc cagcatgata gtttacttct gggtcaaaat agtaggacga aataatagtg 240 ttggagtata caaacgtaag tccctttacg atactttacc tctagcactg ataagaatgg 300 cacactgcta acatcacatg tttttcttat cgtaagtcca tttatcaatc agtaatcaac 360 ccgccaaatg taatattttt atcatattta tattttttaa tataaattat taattataaa 420 cgtcattatt gtttattttg ttagatgatt aagcataaat ttgttataaa ttatatttta 480 ttagctattt tatagtataa tattgtatat tttacgtgta agaatgcaat caagtcgtgc 540 aaaacgcatt tttaaactag ctacaaatga aaaaggtgtt catgatgtaa gtataaatat 600 taatttcatg aatagtgctt attatcaaaa aaggctctta ttaattattt ttataatatt 660 atatgaatta ttgtaatatt aattataggt atttaaagac acacatatta tgtatgactg 720 ctcaatacca actactgtta gcatggtaca tacaacaaat aatatcaata atggtaatga 780 tgttcttact ctcagtaatt taccagtata cagtataaca gtaagaaacc atgtttagta 840 ataaagaatc tttgtctttc atctacctaa aacaataaaa aaattaatat tactaatact 900 aatttcatat cataataata tttattatat cagaataaat aataaatatt gttatgatat 960 gaaattagta atagtaatag taatagtaat tttttattgt tttaggaaga tgagagatac 1020 aaaaatgtag gtgcctggtt agaatcacaa caacccagta tatctacaac tttgaatacg 1080 gtgccataca tttctaatcc ggaaactgta agttataata ttattaatca gctaagcgat 1140 gattcagtta cagttgaaac tgaaaaaagt gatgaatgtg ttgacaatgg ttctgatgaa 1200 gatccaacat ttatgatcaa tactattgat agatcaagtg aaagtctcag ttgtatagtt 1260 caggtaccag ttaattctac tgttattact gaaaatgata cagactttat tatacatgaa 1320 gtagtggatt atctaatcaa agaagtttgt aagaaaaata aaagccgtaa gaggtcatgt 1380 gatccttcat cttgggagag taatataaga aaaaagagat tggcttcagg aatgtcatat 1440 ttgtcaaaga aaggtgataa aattattcaa tgcaaacaat taaagcctcc ttgtgtaaac 1500 tgtaaaaaaa agtgcagtga aaaaatagat gaacatgata gaaaaaaaat attttgtaca 1560 ttttgggatg gagataagga tataaatttg aagagacaat acattaccac ttgcattgaa 1620 gaaaaatgtg tggatagaaa acgagaacgt aatggaaatc gtgctggtgt aagacaaaag 1680 agtttagttt attcatttga aattgaaaat gtgaacacag gaataaaaaa taaaattgat 1740 gtgtgcaaaa cgttttacct taacacattg aatatatcag acatgtttgt tagaactgct 1800 ttgaagaaga aaacacatgg tggtatggtt tctcttgata gaaggggtta tcatgcccct 1860 ccaaataaaa taagtgatga cataagagat ggtgtaaaaa aacatattaa gtcatttccg 1920 gcaattgaaa gccattatag cagagagaaa actgttaaga aatatttagg aaatgaacta 1980 aatatatcaa aaatgtataa actttatgtt gataactgta aagaacataa agtaccaacg 2040 ggtaatatag ctaattattg gttgtactct gagatattta ataatgagtt taacttagcg 2100 tttaaagagc ctgctaatga ttcttgtgac atttgtgatg aatttataat aaaaataaaa 2160 aattcaacca ttgaagaaga agcttgtcta caaaaaaatt atgaatccca tctagatgaa 2220 gcaaaactaa gatatgctga aaaaaaaaag acaaacttga agcaaaggaa actagctgta 2280 acaaaaaaac cataatggtt gatttacaaa aatgccttcc aactccatat ttaaccaatg 2340 gacagagctt ttatttgagg aagttatgga cacttaacct aacaatacac gatgatactc 2400 ttaacaaagc aacgtgtgtt ctttgggatg aaaccaaagg tggtcgaggt ggtaatgaaa 2460 ttgcatcgtg tgtgttgaag tgggccctac aattaaaaga aacagaatct aatattgaag 2520 aattaaatat atggtctgat aattgtgctg gacagaatag aaattttatg atagttactt 2580 tgtatatgtg gctgttgaaa aaaatgccaa atttaaaaat aataaaccac aaatttcttt 2640 tgagaggtca cacccacatg gaagtagaca gtgatcattc tattattgaa agagcaaaaa 2700 aaaaaattga acattcaagt ataatgacac catgggattg gcaaaaattt attaggtctt 2760 gcaaaaacaa aaaccctttt gaggtaataa atatggaact ggaagacttt ttggattttc 2820 aatgtctata caattctaaa aattctccat taatatacag aagaaaagaa gtaaagattt 2880 ctgaaattgt tcatttgcgt ttagagaaag aaaaggaagg actattaatg tttaaaacag 2940 acttcgctga aaaagaattt aaaactgtgt cattaaacag aacaactaga agttcagttt 3000 ggccagaaca tttaccacaa gtaacaagtg agcctaagcc tataagtaaa ttaaaataca 3060 aagatttaca aacagctctt aaatgggtgc caagaatatt tcatgacttt tatacaaaca 3120 tgaaatatga tgataaaatt gctgattatc cagaataaaa ctagtcagtt attataataa 3180 taagaactaa atagtcacta atcattacta tgttatttta taaaaaaaag tttcctattt 3240 atgttaaagt attataatat aaattttaat tttgttttag ttgtgttctt atgttattta 3300 attatattat gtattattat tatattgtta tgaaataatt atattttaat cagagcaaaa 3360 gtgattttag tttatatagt taacaattga atttaaaaaa aaaaaagtat tataaaatgt 3420 tttaaaatat ttcaataaaa gttaaatgtt attttataaa aaaaagtttc ctatttatgt 3480 tgaagtatta taatataaat tttaattttg atttagttgt gttcttatgt tatttaatta 3540 tattatattg ttatgaaata attatatttt aatcagagga aaagtgattt tagtttataa 3600 tgttaacaat tgaatttaaa aaaagtatta taaaatgtta tatataatgt tataaatata 3660 tttcaataaa agtaaatata ttgtattagt atttataaaa aaattttgat gaaatatgtt 3720 caaaaataaa gacacagaaa aacaatttta ccaaaaatcc atgttaaaaa ctttgttttg 3780 catctcctga aaaatctact ttttgtgact tgcgttagtt gacttctgcg gtacag 3836 // ID R1-1_BM repbase; DNA; INV; 2930 BP. XX AC . XX DT 25-APR-2010 (Rel. 15.04, Created) DT 25-APR-2010 (Rel. 15.04, Last updated, Version 2) XX DE Non-LTR retrotransposon - a consensus. XX KW R1; Non-LTR Retrotransposon; Transposable Element; R1-1_BM. XX NM L1A_Mim; LTR6_MD; LTR86_MD. XX OS Bombyx mori OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Bombycidae; Bombycinae; Bombyx. XX RN [1] RP 1-2930 RA Jurka J.; RT "Non-LTR retrotransposons from Bombyx mori."; RL Repbase Reports 10(4), 625-625 (2010). XX DR [1] (Consensus) XX CC >95% identical to consensus. XX FH Key Location/Qualifiers FT CDS 39..2918 FT /product="R1-1_BM_1p" FT /translation="MTGVLRFLQGNLNHCARAQDLLFQSMAEGLTHLAVVA FT EPYRVPSSPDWAADLEGRVAIIRRCCVGAPPRFXVVERGRGFVAVLWAEVF FT VLGVYFSPNRTLAEXXVFXSELSRVVGRSHSRRXLVLGDLNAKSLXWGSSR FT TCPRGRAVEERLVGSGLLVLNRGAELACVRRLGGSVVDVTFATPDVANRVR FT GWAVMVGEETLSDHRYIRFGVAAPPAESIRGSSLLCGGGAGGPRWAQKRLN FT VERLCEAAVVQAWRLDSLGEPADVCEGVEHLREAMSRVCDAAMPRIRALAP FT KRRVHWWTEEIASQRRLCDVSRRAYQRYRRRRTRRDPDEEDRLYEVYRTAI FT RALRLAIGEAKEAAWNDLLASLDRDPWGRPYRLARNALRRWAPPATSTLPP FT ETLQRVVGGLFPDFTGTAFVPPVMTTARVADGGEPEDVSQAEFDSAVQRMR FT AKRTAPGPDGISSRAWALALTGDGLGPALRGLFSXCLREGRFPEPWKTGRL FT VLIPKEGRPRDEPSGYRPIVVLDEAGKLLERVVASRLVQHLESVGPDLAPN FT QYGFRRGRSTVDAVLRVRHLSDRACSEGGVLLAVSIDIANAFNTIPWSTIV FT ESLRFHRVPPSLRTLIEDYLSGRNVVFPERRGWGRKAVSCGVPQGSVLGPL FT LWDIGFDWVLRGASLRGVDVVCYADDTLVTARGADYRAAAILATAAVSTVV FT SRIRRLGLEVALRKSEAVCFHPVRRGPPPGANLIVGGVSIAVQPKLKYLGL FT VLDSRWRFDHHFGELVPKLLGMAGALARLLPNVGGCSAGVRRLYLGVVRSM FT ALYGAPVWSPALSARNAALLLRAQRALAVRVIRGYRTISREVACALAGSLP FT WDLEAEVAAAVYRRRTQSLSRGRTPGPSAVGRWRRAARHLAYESVVVLRGK FT CFARLFQGVVSREKLGEVKDLGSSLLPFFLSGGAGVRHXPVRYPPRPGIRK FT WIPQR" XX SQ Sequence 2930 BP; 426 A; 886 C; 1020 G; 580 T; 18 other; gctcggacgg gctgagagac atcgaatggc gccctgctat gacaggtgtc cttcgattcc 60 tgcaggggaa tctcaaccac tgcgccagag ctcaggacct tttgttccag agcatggcgg 120 aggggttgac ccatcttgcg gtggtcgccg agccgtatcg ggtcccttcg agccccgatt 180 gggcggccga tttggagggc cgagtggcna tcatccggcg ttgttgtgtg ggtgctccgc 240 ccaggttcgn cgttgtcgag agaggtcgcg gcttcgttgc cgtcctctgg gccgaggtat 300 tcgtgttggg agtgtacttc tccccaaaca ggacgctcgc cgagttngng gttttccnna 360 gcgagctcag ccgcgtcgtt gggaggtcgc actcccgacg gatnctcgtt ctnggngacc 420 tnaacgcnaa gtcattgnct tggggntcct cnaggacgtg ccccagaggt agggcggtgg 480 aggagcggct ggtcggaagc ggtctcctcg tcctcaatcg cggcgcggaa ctcgcgtgcg 540 tgcgacgttt gggcgggtcc gtggtcgacg ttacattcgc cacgcccgac gtggcgaatc 600 gcgtacgcgg ttgggccgtg atggtcggcg aggagacgct ctccgaccac cgctacattc 660 gtttcggtgt cgcagcgcct ccggcggagt ctatccgggg ctcttccttg ctctgcggtg 720 gcggcgcggg gggtcctcgt tgggcccaga agcgcctcaa cgtcgagagg ttgtgtgagg 780 cggccgtagt ccaggcatgg cgtcttgact cgcttggtga gccagcagac gtgtgcgagg 840 gggtggagca tctgcgcgag gcgatgtcgc gggtgtgcga cgccgctatg cctcgcatta 900 gagctctcgc tcctaaacgc agagtccact ggtggaccga ggagatcgcc agccagcgcc 960 ggctgtgcga cgtcagtcgt cgcgcatacc agcggtatcg acgacgaaga acgcgccggg 1020 accccgacga ggaggaccgc ctgtacgagg tgtacaggac ggcgataagg gccctgcgct 1080 tggctatcgg ggaggcgaag gaggccgcct ggaacgacct actggcctcg ctggaccgtg 1140 acccgtgggg gcggccctac aggctggcgc gcaatgcgct ccgccgttgg gccccccccg 1200 caaccagcac cctgccgccg gagacattgc agcgggtagt cgggggtctg tttccggatt 1260 ttaccgggac ggccttcgtc ccccctgtga tgacgacggc gcgggttgct gatggcgggg 1320 agcctgagga cgtctcccag gcggagttcg attcggctgt gcagaggatg cgggcnaagc 1380 gcacggcncc cggtcccgat gggatctcgt cccgagcgtg ggcgctcgcc ttgacgggcg 1440 acggcttggg gcctgccctc cgagggctgt tcagcnggtg cctccgtgag ggcaggttcc 1500 cagagccatg gaagactggt cggctcgtcc ttattcctaa ggagggccgg ccacgtgacg 1560 agccgagcgg gtaccgccca atcgtcgtgc tggacgaggc cggtaagctc ctcgagcgcg 1620 tcgtcgccag tcgcctcgtc cagcacctcg aaagcgttgg gcctgacctg gctcctaacc 1680 aatacggttt ccggagaggt cgctccaccg tggacgcggt cttgcgcgtc cgccacctct 1740 ccgatcgtgc gtgctccgag gggggcgtgt tgttggctgt gtcgattgat atcgccaacg 1800 ccttcaacac gatcccttgg agcacgatcg tggaatcgct ccggtttcac cgcgtccccc 1860 ctagtctccg caccctgata gaggattacc tctcagggcg aaacgtggtc ttccccgaga 1920 ggagggggtg gggacggaaa gcggtgtcgt gcggggtccc gcaggggtcg gtactgggac 1980 cactcctgtg ggacatcggt ttcgactggg tcctgcgcgg tgctagcctg cgtggcgtcg 2040 acgtagtgtg ctatgccgac gacacgctgg tgacggcccg cggagccgac tacagagccg 2100 cagcgatcct tgcgacggcg gcggtctcca ccgtcgttag tcgcattcgg agattaggtc 2160 ttgaggtggc cctccgcaag tccgaagcgg tgtgctttca cccggtccgg aggggacctc 2220 ctccgggagc gaatctcata gtcggcggag tatcgatcgc tgtccagccg aagctcaaat 2280 atttgggcct cgtgctggac agtcgatggc gcttcgacca ccactttggt gagttagtcc 2340 cgaagctgct ggggatggcg ggcgcgctag cccgtcttct ccccaacgtc ggtggttgca 2400 gcgccggcgt ccggcgtctg tacctggggg tcgtgcgcag tatggctttg tacggcgctc 2460 ccgtgtggtc gcccgcactc tccgcgcgca atgcggcttt gctgctacga gcgcagcggg 2520 cgctcgcggt gagggtcatc agggggtacc gtacgatctc ccgggaggtc gcctgcgccc 2580 tcgccggttc ccttccttgg gatctcgagg ccgaggtagc ggctgcggta taccggcgca 2640 gaacacagtc cctgagtcgg ggacggacgc ccggcccgtc ggctgtcggt cggtggaggc 2700 gtgctgcgcg tcatctggcg tacgagtcgg tagtcgttct gcggggcaag tgcttcgcgc 2760 gcctttttca aggcgtagtc tcccgtgaga aattggggga ggtgaaggac ctcggctctt 2820 ctcttctccc cttcttcctc tcaggaggcg ccggagtccg acatnacccg gtccgttacc 2880 cccctcgacc gggtatccgt aaatggattc cccagcgtta aaaaaaaaaa 2930 // ID BEL2-LTR_Dya repbase; DNA; INV; 310 BP. XX AC chrU; XX DT 18-MAY-2009 (Rel. 14.05, Created) DT 18-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL2_Dya; KW BEL2-I_Dya; BEL2-LTR_Dya. XX OS Drosophila yakuba OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; melanogaster subgroup. XX RN [1] RP 1-310 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1015-1015 (2009). XX DR Genome; chrU; Positions 7629070 7629379. XX SQ Sequence 310 BP; 86 A; 87 C; 49 G; 88 T; 0 other; tgtttactcc agctcccgtg tgccagttcc acgcgtccct aaatggagaa gggaatccac 60 ccactctctt ttgaaaaagg gagtattctt ccataaggga ccgctggcca cacctttttt 120 tttttctcac ttactttttt tctacaagaa cacgaagaca cgattttgga gcaacctttt 180 ttatacacat ataaccactg aataaagact attttttgga gaaaccccaa ccctcgccgc 240 tgcgttttta attcaagtca gaacaatcca gttctcagcc gcaacaaccc ccagcagagc 300 cattactaca 310 // ID BEL-17_AA-LTR repbase; DNA; INV; 651 BP. XX AC supercont1.330; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-17_AA_; KW BEL-17_AA-I; BEL-17_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-651 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.330; Positions 296156 296806. XX SQ Sequence 651 BP; 234 A; 112 C; 107 G; 198 T; 0 other; tgttgcgtac accctagttc tgacgccgca cctaccattg aagttcgttg atcagcatac 60 ctgcaacagc attttacgcc gagctaagtt gctttcaatg tgcccggcat cgctatagaa 120 ttgtcaaaag gaatttaaaa gctcacatgc ggagttcaga ttaggtgtct tatcttggtt 180 gaagttattt cttaaaaact agtgaaatta gactacagtt agataaaact taaatactat 240 tgtgctgaaa tcgaaacctt atactataca tggtaaaata acccattata caagttgaac 300 ttaaaactaa agaaattatc taaatattat aggttagtta gaaaaagtgg ttggaaacta 360 catacacaga actaaatgaa ttgattaaac tctaaggtaa atagtgacta tatacagtta 420 gacgtcataa ctaatgattg tgcttaaaaa ctaggagttc tgttcttcac tgatttcgct 480 gcagaataat ttatacccaa gtcttaggaa acgtacaaac gtaagttccc tgaaaactaa 540 cccaaaccac aaaaaactta ttatgtacta tgttacagga aaattttaat tgacgagtca 600 ataaatacct caattaaagt tttggaaagt gtttccttcg ttgcataagc a 651 // ID REP-3_CQ repbase; DNA; INV; 442 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A repeat family from Culex quinquefasciatus - consensus. XX KW Repetitive element; nonautonomous; REP-3_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-442 RA Kojima K.K. and Jurka J.; RT "Repeats from the southern house mosquito."; RL Repbase Reports 11(1), 606-606 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >98% CC identity. No TIRs. XX SQ Sequence 442 BP; 157 A; 64 C; 78 G; 143 T; 0 other; aaaatagtag tttatgcaac aagttgcaaa aagaggattt tttcagcacg agtcgtacat 60 ttatccaacg aggttcaccg agttggataa atacgaagag tgctgaaaaa atcaagtttt 120 gcaacgagtt ccatacaaca ttttttgcaa ttccgaaaaa cacccattga gtgaaatttt 180 aagtcaaatt ttcatgtatt ttgtcaataa atcgtttaaa tcaaaaaaat gttgaaaagt 240 gttacttttc gaaacaagtg ctgaaaagtt caacttttca gcacccattt cagtgctgaa 300 aagtagaact tttcagcatt tattttgaaa agtgttgcta ttcgattctg ttatttttgg 360 tacagaaaag taggctattt cgtcgttcaa gaatgacagg aaaagtaagt agtttcacga 420 cggaattgca aaaaatattt tt 442 // ID CR1-24_BF repbase; DNA; INV; 3062 BP. XX AC . XX DT 30-JUL-2009 (Rel. 14.07, Created) DT 30-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE Amphioxus CR1-24_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-24_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-3062 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-3062 RA Kapitonov V. and Jurka J.; RT "Young families of CR1 non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(7), 1595-1595 (2009). XX DR [2] (Consensus) XX SQ Sequence 3062 BP; 860 A; 815 C; 659 G; 728 T; 0 other; atgttccaag gtgtgattgc cgttgtactt acaacagtca tgtttacaat tggtagctca 60 aatgtccgta gcttgaatcc acacacggcc tctggtcact ataagacctc agaactcgaa 120 gagattctcc gtaccaacaa aatctctgta atgggcatca ctgaaacatg gctacaccat 180 ggtattctcc tcaacatccc atggtatagc acaatacacc gtaaggacag acagggcaaa 240 acaggaggag gcgtcgccat cctcattagt gacactctac ccagtaaacg tagagttgac 300 ttagaaacgt gtgattctaa ctgggaagac gtatgggtcg aggtcactgt ggcgggcaac 360 aaagttcttg tctcatgtgt ctacagacca ccagttgccc cggatagctt ctttgaccag 420 ttcgaaacct ccatctctaa agttgctacg gacaaaggca ctattgttat taccggggac 480 tttaattgtc accatacgga ctggggtgat acatcaactg acaataatgc acttaacctt 540 gtagatatca tagacaggta cggcctgtac cagtgtcaac accagccaac ccgatgtacc 600 cctacatgcc aaagtacact tgacctcatg atactgtctg acccgtctag aatcgacgac 660 atgtctaccc tagcacctgt cggaaattct gatcatgctg ttataacttg tagacttaac 720 ctatctattc ctactgtgca ccctactaga aaacatgttt ggtactataa ccgtgcggat 780 tttgatgtat ttaggtctga actaagtaag atcaattgga attcatgtat gcatggctct 840 attgatgaga gatggaatag ctggaaggct aagttcttaa atgttgcaaa aaacaccatc 900 cccaacaaga acatcaaggc tacagctcca cgtaaaccct gggtaacctc tgaattactg 960 gaggctatcc aacacaaaac cgaactgtac aaaactttta cgtcttcgcc atccaatgaa 1020 aattggaaaa gttacacaaa ggccaaaaac cgcctgacta aagaccttcg ccaagccgag 1080 gctggatact acgctgccgt tagcgagaga cttaaaaccg cggaaggtgc tcgtcagttc 1140 tggtctgttc tcaaacaggc aactgggaag ggcaagtctg caatcccggc tctctctgcg 1200 aacggctcag tggtggctaa ggacaaggac aaggctgaac tgctaaacga catcttcgtc 1260 aacgttacaa gtgaagcaac ccaccctgac tgtaccagac gacttcctcg gttcaccaat 1320 aaagaactaa acacaatcgt agtgtcggtg gaggaagtcg aacgcaccct acaaacactc 1380 cagtccaaca aggcccctgg tcctgacggg atttcaaaca gactgcttaa ggaggcagca 1440 cctatcatct gcgcctccct gtgcgaactg tttaattttt ccttgtcaac cggccagttc 1500 cctttagagt ggaaaaggtg caacatctca ccggtctaca agaagggtga tcgcactaat 1560 ccctccaact accggcccat cgccttactt tctacagttc ccaaggtctt ggagagactt 1620 gttcataatc acttgtacgc ttaccttatg aacaacaacc tgattaacgt caaccagtca 1680 ggcttcaaga aaggagacgg aacagtactg cagctcatgc ggcttgttga cgaatgggcc 1740 aagtccattg acgatcccaa cattgcctgc actgccgctg tcttcctgga tgtccgtcga 1800 gcattcgaca ccgtgtggca cgatggtctc acctacaagc tctcccgtta cggtgttaag 1860 ggagccctca acatgtgggt tgcagactac ctcacgggcc gacaacaacg agttgtgatt 1920 aacggcgtgg cgtcctcctg ggggcacact aaggcaggag tcccacaagg aagtatccta 1980 ggaccgctcc tattccttgt gtacattaat gacatcaaag acctaccttg ttcctcaaac 2040 atcaattgct tcgctgacga cacgtcgctt tccaagtccg gaccgacagc tcaggaagtg 2100 gccagcacaa caaacactga ccttcaacat gtctccacct ggttcgttga ctggggcctt 2160 gagctccatc cggacaagtg caaggtcatg tgtataaagt cccctcgaag taaagtacag 2220 ctacctacaa tctacatagc tggacagata gttgaacaag tgccgttcta tactcacctg 2280 ggcaccacca tccaccagac cttacgttgg actgaacacg tccagactac atctaacaag 2340 gccaggagaa ctctgggatt cctgtggaaa ctcagaggca agctatcccg ggaggcacta 2400 gagatggcgt acaacacgat ggttcgacca aaactggagt acgcctctgt actgctaggg 2460 gacctggctt cttcatccag caagatgttg gagcgagttc actaccaagc cgcctgcctt 2520 gtcaccgggg ccgctagacg aacaccatca agcgtcgtca tgcaagaact tggatgggac 2580 agcctagcaa ctagaagaca ttcccagtcc atggttctca tgtataaact ggtgaatggc 2640 ttagtccctc ctcacctgca accgctgata cccaccacaa gaggcgagca cagaaccaca 2700 cgcctacggc ttcgcaactc tacccatctc cacataccac gctgcaggac ccagacttac 2760 cagtcaagct tcattccgca cgcttcccgc ctctggaacg atcttcccca ggcaataaag 2820 caagctggta gtctcagatt gtttaaacga agactggcaa actggactga ctgattgata 2880 cccagattta tagattgtat tgtaatttta aattgtatgc aatgattgta ttttaacaat 2940 tgtacaaatt gatatgtatg tatgtattca tgtgtgggcc acgacaccag catatcgctg 3000 cctgtgtccc acgtgtattg tattgtttgc aattccaata aatcaaatca aatcaaatca 3060 aa 3062 // ID Gypsy-10_TCa-I repbase; DNA; INV; 5085 BP. XX AC singleUn_1004; XX DT 21-FEB-2011 (Rel. 16.02, Created) DT 21-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the red flour beetle genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-10_TCa_; KW Gypsy-10_TCa-LTR; Gypsy-10_TCa-I. XX OS Tribolium castaneum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Coleoptera; Polyphaga; Cucujiformia; OC Tenebrionidae; Tribolium. XX RN [1] RP 1-5085 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the red flour beetle genome."; RL Direct Submission to RU (21-FEB-2011). XX DR Genome; singleUn_1004; Positions 12014 6930. XX CC Positions [2260-2685] - Reverse transcriptase CC Positions [3838-4311] - Integrase core CC LTRs are 95% similar to each other. XX FH Key Location/Qualifiers FT CDS 184..4674 FT /product="Gypsy-10_TCa-I_1p" FT /translation="MSEEQRGDEEGITLTMDEDIDLEEAAPRTISATTQVT FT SETLDNPIARTLEQLLGSELKSQIEKAVKRQAALILGEITDTQAAGTEGAE FT VTLRNRIAEMGASAQQSPATGSDCSTVRSWKIGAEIVDYFDPDDADCNIER FT WLAKIDQLGQVHGWSELEKAGIMQSRLMGAAKAWFNHLGEYNLTWDSWKAR FT LRMAFPRRQDFAITLEELVQRQKLPGEVMTKYYHEKLALCDRVGISGENAV FT AVIIRGLPKELRANAYATRSTTPEALYNNFLVGLEFYEYRGATAKAERVTR FT RESTSSGQAMKRRSDQPVATTKAKSALIRCFNCQRYGTHKSRECPYERRER FT CRKCGRPGHEASTCNPQADSTAVKGANPQVRILQLGLENTYKKMGTACGAQ FT IRVYIDTGSEHNILAWSWAQGRNLNIKPGEWILQGFAGGQGVAIGEATFDI FT KVDEVTLSITALITKCDLGRIELILGQPAISTSGVALVVRGDKACVIRDED FT LFEVFKNLTLETDRPRPRILVKEDTVIPAGLSLQMGVTITDCEIGEIVSMA FT PVTFQSKDDMVAIPGGVLKCRDDNKILFTNLAPRTLVWKAGRLVTKAERCK FT EGVPVTKVMNIQQVSACFDLSSVTVGEIGDYYKQQLFDLLRSMSCCFSAND FT EDLGLTQLGSMSINLTSDIPVYYRPYRLSHSERAVVRQKVQSLLDGDVIRE FT SNSNYASPILLVPKKTGEMRLCIDYRALNAITVKDRYPLPLIADQLDRLAG FT NRYFTSLDLAQGYHQVPMHSDSIHKTAFVTPDGHYEYLRVPFGLANSPAVF FT QRIINQMLGNMRHDQVLAYMDDLLIPSVDVATGLELLRKVLELIRDAGVKL FT KLAKCSFLQEKIDYLGHEISVEGIRPGQRKVDAVLKFPEPSDVHSVRQFMG FT LASYFRKFIRNFALLAKPLTNLTKKDVEWRWGEEQQDAFQRLRHLLSERPV FT LAAYNSSFSTELHTDASKIGVGGILLQRQPDGELKPVFYFSRTTTPQEQVY FT HSYELETLAVVESIKRFRVYLLGTQFKVVTDCSAVRATFLKKDLVPRIARW FT WLAIQEYGMTVEYRPGVRMQHVDALSRNPVQISIVHADEADWFLTVQLQDP FT KAQELVTVLTTGVTPRDVKAVYKVKNGRLYRITPNGDRLYVPATARFTLAH FT KHHDEIGHPGYNRALTLMKETYWFPKMARFMLKYVGSCLRCAYGKGDYGKA FT EGRLHPIKRLPIPLDTVHVDHLGPFMRSSKGNSYLLMAIDGFTKFVWAKPT FT RTLRSTEAIEKLRDIFGVFGYPRRVITDRGLAFQSKAFGDFMAEKGIQHIQ FT NAIATPRANGQVERPNRTIEEALTCSADSENRWDDGLPEIVWGLNNTVNAS FT TRFSPAQLMFSHRRGVIANLADNVVQATLNSIENGETGANLVKHGEAVPHG FT EASTTPGTGCSANAQGGAGLSQHAGPSSDDSGYPVLAEALSEELQANRAEA FT ERNLKRTALQMKSRFDKRRKVATLYKVDDLVL" XX SQ Sequence 5085 BP; 1458 A; 986 C; 1444 G; 1197 T; 0 other; aatcagaagt gggattattt cacgtgctag ttggtgaaat ttgagtggcg taattgaaag 60 tgcggcgaaa ttagcttcaa ggacttgctc gggtgaaaaa cgtcagcgca aaaacggcga 120 taagaaaagt cgactgttac tttgggtgtg gcaagacgaa tttcggcgaa aagtgttcta 180 aaaatgtctg aggagcaaag aggcgacgaa gagggaatta ctttgacgat ggacgaagac 240 atcgacttgg aggaggcggc cccaaggacg atttcggcga caacccaggt gacgtcagag 300 accttggata acccaattgc gcgtaccctg gagcaattat tgggatcgga gttaaaatcc 360 caaatcgaaa aggcggtaaa aaggcaggcg gcactaattt taggcgaaat tactgatacc 420 caggcggcgg gtacagaggg ggcggaggta accttacgca ataggatagc tgagatgggc 480 gcgtcggcac agcaatcgcc ggcgacaggc tctgattgtt ctacagtgcg gagctggaaa 540 attggagccg aaattgtcga ctattttgac ccagatgatg ctgattgcaa tatcgagagg 600 tggctcgcaa aaattgacca gctggggcaa gttcacgggt ggtcagaatt ggaaaaggcg 660 ggcattatgc aatcacgatt gatgggggcg gcgaaagcat ggtttaatca tttaggcgaa 720 tacaatttga cctgggattc ttggaaggcg agactccgaa tggcgtttcc tcggcgacag 780 gactttgcca ttactctcga agaattagta cagcgtcaaa agttaccagg cgaggtaatg 840 acgaagtact atcacgagaa gttggcgcta tgtgacagag tcggcatttc tggcgagaat 900 gctgtcgctg tgatcataag ggggctgcca aaggagcttc gtgcaaatgc atacgcgact 960 agaagtacca ccccagaggc gctgtataac aactttcttg tgggactgga gttttatgaa 1020 tatcgtgggg cgacggcaaa ggctgaacgg gtgacacggc gtgagtcaac ctcctccggc 1080 caggcgatga aacggcgatc ggaccaacca gtggcaacaa caaaggcgaa gtccgcatta 1140 attcgctgtt tcaactgtca gcgatacggt acccacaaga gtcgagagtg tccttacgag 1200 aggagggaga gatgtcgaaa atgtgggaga ccaggacatg aggcgtcaac ctgtaacccc 1260 caagcagact ccacggcggt gaaaggcgcg aacccacagg tgagaatttt acaattaggg 1320 ctagagaaca cttataaaaa gatgggtacg gcatgcggcg cgcaaattcg tgtttatatt 1380 gacaccggga gtgagcacaa cattttggcg tggtcttggg cgcaagggcg taatttaaac 1440 ataaaaccag gcgaatggat actacagggg tttgcaggcg ggcaaggtgt agcgattggc 1500 gaggcgacat ttgacattaa ggttgatgag gtaacattga gtatcacggc gttgataacc 1560 aagtgtgatc tcgggcgaat cgagctcatc ttgggtcaac cagcaattag tacctccggc 1620 gtagcacttg tagtgagagg cgataaagca tgcgttatca gggacgaaga tcttttcgaa 1680 gtttttaaga acttaacctt ggagacggac cgaccaaggc cacgaatcct cgtgaaggag 1740 gatacagtta tcccggcggg gctgtcactc caaatgggcg ttacaataac tgattgtgaa 1800 ataggcgaaa tagtttcaat ggctcctgtg acgtttcaaa gcaaggacga catggtggcg 1860 atacccggtg gcgtgttaaa atgccgagac gacaacaaaa tcctcttcac aaatcttgca 1920 cccagaacac tggtttggaa ggctggacga ttagtgacaa aggcggagag gtgcaaagaa 1980 ggagttccgg tgacaaaagt catgaacatt cagcaggtga gcgcatgttt tgacttatct 2040 agtgtcactg taggcgaaat aggtgattat tataaacaac agctatttga cttattgcgt 2100 tctatgtcgt gttgtttttc ggcgaatgat gaggatctag gtttaactca gttagggtcc 2160 atgtcgataa atctaacgtc agacattcct gtgtattatc gtccatacag attgtcacat 2220 tcagaacgag cggtggttcg acaaaaggtg cagagtctgc ttgacggcga tgtcatcaga 2280 gaatctaatt caaactatgc gagtcctatt ttgttggtcc ctaagaaaac aggcgaaatg 2340 cgtctctgta ttgattatag ggcactaaat gcaattacgg tgaaagatag gtaccctttg 2400 ccactgattg cggatcagtt ggatcggttg gcgggaaacc gatactttac atccttggac 2460 ttggcgcaag gatatcatca ggtaccaatg cattctgatt caattcacaa aacggctttt 2520 gtaacacctg atggacatta tgaatacttg cgagttccat ttgggttggc aaattctccg 2580 gcggtatttc agagaattat caaccaaatg ttgggcaaca tgcgacatga tcaggtcttg 2640 gcgtacatgg atgatcttct gataccttca gtagatgtag cgacggggtt agagctactt 2700 cgaaaggtgt tggaactaat tagagatgca ggcgtgaaat tgaaattggc gaaatgcagt 2760 ttccttcaag agaagatcga ttatttaggc catgagatca gcgttgaagg gatacgacct 2820 ggtcaacgca aagtggatgc agttctaaag ttcccagaac cctcagacgt acatagtgtt 2880 agacagttta tgggtttagc tagttacttc agaaaattca ttcggaattt tgccctgttg 2940 gcgaaacctt tgacaaatct cacgaaaaag gacgtggagt ggcgatgggg agaagagcag 3000 caggacgctt tccagagact tcgtcacttg ttgtctgaga gaccggtttt ggcagcttac 3060 aacagcagtt tttcgacaga gttgcataca gatgcaagca aaattggcgt gggaggaatt 3120 ctgctacaga gacaaccaga tggtgagcta aagccagttt tctacttcag tagaactacc 3180 acacctcagg agcaggtata tcatagctat gagttagaaa ccttggcggt ggtagagtca 3240 attaaacgtt ttcgggtgta tttgctgggt acacaattta aagttgtaac tgattgttca 3300 gcggtaagag caactttctt aaagaaagac ttggtccctc gaatagctcg gtggtggcta 3360 gcaatccaag agtatggcat gacagtagaa tataggccgg gcgtacggat gcagcacgtt 3420 gatgctttaa gtcgaaatcc tgttcagatt tctatcgttc atgcagatga ggcggattgg 3480 tttcttacag ttcaattaca agatcccaaa gcgcaggagc tggtcactgt attgacaacc 3540 ggcgtaacac cgcgagatgt aaaggcggtg tacaaagtga aaaatgggcg tctgtataga 3600 ataacaccaa acggcgacag actttatgtg ccagccacgg cgaggtttac tttggctcac 3660 aaacaccacg atgaaatagg gcatcctggt tacaataggg cgttaacatt gatgaaagaa 3720 acatattggt ttcctaaaat ggcgcgattc atgttgaaat atgtaggatc ctgtttgcga 3780 tgtgcctatg gcaaaggtga ttatggtaag gcggaagggc gactgcatcc aataaaaagg 3840 ttacccattc ctttggatac agtacatgta gatcatttag gtcctttcat gagaagtagc 3900 aagggcaaca gttacttgct aatggcgata gacggattta caaaatttgt ctgggcgaaa 3960 ccaactcgaa cactgagatc tacagaggcg attgaaaaat tgagagatat ttttggggtg 4020 tttgggtatc cgcgaagggt aatcacagat cgtggattgg cgtttcagag caaggcgttt 4080 ggtgatttca tggcggaaaa agggattcag catattcaaa atgcgattgc tacccctagg 4140 gcgaatggtc aggttgaaag acccaaccga accattgaag aggcgttgac gtgcagtgca 4200 gattccgaaa atcggtggga cgatgggtta ccggagatag tttggggttt gaacaataca 4260 gttaatgcga gtactcgatt ttctccggcc cagttgatgt tctcccacag gcggggcgtt 4320 atagctaatt tggcggataa tgtggtacag gcgacattaa attcgatcga aaacggcgaa 4380 accggcgcga acctagtaaa acacggcgag gcggtaccac acggcgaggc gagcacaaca 4440 cctggtaccg ggtgcagtgc gaatgctcag ggcggagcag ggcttagtca gcatgcggga 4500 ccttcgagtg acgactcggg gtacccagtt cttgctgaag cactcagtga agagctacag 4560 gcaaataggg ccgaggcgga aaggaatctg aaaaggactg cactacaaat gaaatctagg 4620 ttcgataaaa ggcgaaaggt tgcaaccctg tataaagtag acgatttggt cttgtgacgt 4680 cagtcggcga ctggttgttt ggatccaggt actaatagaa agttggcgaa taagtacgac 4740 ggaccatata gagtgtcacg tcggcttggt aacgacaggt accagatcga ggcgattaaa 4800 ggtatgagag ggtacaaacg ttttaaggcg gttgtggcgg tggattcact ccggcgctat 4860 tgcagtactg ctcaggaggc gaatgacaag gcggatggcg aggaaagcga cagtggcgag 4920 gaaattgatc ggttagatct gatcgatctc ctcgagaatt aggctacgaa ttaatgagga 4980 cattaatttt gcaggaaggc cgaatgttac gaccaaaatc agccaggcgc tgattaagga 5040 cacggcggga atcgaggcgg cgcgatagac acaacaacca ggcga 5085 // ID Gypsy-19-I_NVi repbase; DNA; INV; 5914 BP. XX AC . XX DT 16-APR-2009 (Rel. 14.04, Created) DT 16-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW internal portion; Gypsy-19-I_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-5914 RA Bao W. and Jurka J.; RT "LTR retrotransposon from Nasonia parasitic wasp."; RL Repbase Reports 9(4), 775-775 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 109..1179 FT /product="Gypsy-19-I_NVi_1p" FT /translation="MRVSWIYRLNKEQLISELRGKNLNCEGTVVELRQRLV FT XYVRKNPQDFVGKAEDPTGYDEEADLKQDLELERYELIQRTLEETRNAGRS FT STPTGDPPEIPPGENKVLDQIRKWNCHFEGRDVYSFLERVRELQASYGFTG FT EQLLQGAPELLRGEALQWRRNYASDCRTWEELEIRLRNFYLSSGERRNLTR FT QVAERIQKPNENIRSYCNALTTLMRRRGGYTLIEQLDNIYYNMKPELQLRV FT RREEITDVPQLIQRIEEYEDIAAKLREQERKTAINSASTSKNTTDYYNRAE FT CCWRCKQRGHSRADCKNVARKFCSVCGKDGVLTRDCHGFPDRSGNAPRDGS FT TVAGDRPKTADTS*" FT CDS join(984..4178,4182..4691) FT /product="Gypsy-19-I_NVi_2p" FT /translation="MLLALQTTRPFPRRLQKCSTEVLFRMRKGRRSHKRLP FT RIPGSFGKRAKGRLYSSGRPSQDGRYLVNVQINQDTVLALVDPGAVSSFIT FT PETAMIGQNNGWRHGKQETIATLADGSTTELTDYVEGKATTLGATFRNKFI FT IMKGLRHEMLLGMDALRKLNIRLIIAGKELQPNRSNTSVLSVSDVGIQEIS FT KDQQTRVDELIQYEKELFGXIQGPTPLIEYTIKVTESNPIKQRYWPRNPAM FT QQIIDEAVEEMLQEDVIEPSDSAWSSNIVLAKKKDGKRRFCINFQGINKIS FT IKDAHPLPQVHATLNKLRGAKYLTTIDLKNGYWQIPLSEESKKYTAFTVPG FT RGLMQFKVLPFGLHSAPAAFMRLMNAVITPDLEPNVFCYLDDIVIVTEDFE FT QHLQLISETFRRLRDAKLKPNWEKSHFCRKQLKYLGHVVNEEGLHTDPEKV FT RAITDLQPPTNLKELRRFSGLISWYRRFIPQVAKKTAPLNRLLKKKVKWEW FT GPDQQEAFERLKEDLVQAPVLACPDFAKTFVLQTDASNEGLGAVLTQEEEG FT KERVIAYASRSLNQAERNYSATEKECLAIKWGIWKFREYLEGYHFVVLTDH FT QSLKWLEKIDNPSGRLARWAMELAQWDYEVKYRRGKDNTVADALSRQPVAV FT CNIDIKTNCTWYTKKLTAVRQDPKTYPEYCIKNGALHRHILHTLDFNDTDA FT TEDWKICIPTEKRTEVLRECHDDLTAGHLGVAKTLQRLARRYYWPGMLQDG FT ARYVRSCINCQTYKAQQQTKAGTMHATNVEQPWEMVSIDLVGPQPRSNKGN FT IWLLVMQDRFTKWVELAALKKADSRAIIRHVKEKVFLRHGCPNTLVSDNGR FT QFVSREFEEFLKENGVNHRKTPPYSPQCNPVERTNRVVKTMIAQYVGKHQK FT KWDENLRELEFAYNTASSTATGYTPAYLNTGREFRTPGSLHQKVGSQHTTP FT LSSRIQHIKDAIELAKAQMARQFEKQQKHYNLRRREWKPNIGDKVVRKLVT FT LSSKADARNAKLAPKYSKPLTVKRQVSPVIFDLQDDRGRYVKHIHVKDLKP FT FTDAEQELMRQRRKTTVSRQTNAKKDGQPASQKRPRESHTDVGRDIGIRTR FT VSRHHRHRSPNLHPEPRGRGILSGTWGRSNNSRVSNRASFRQPYQQHHPKD FT STSPNKNKGLRSDESLGRSGRATTTLDHRHECPRVRSCSNFYEPSQPGEKE FT PEAKAQQAPGTKTTMAHYRRRNRE*" XX SQ Sequence 5914 BP; 1867 A; 1481 C; 1450 G; 1104 T; 12 other; tggcgctcgg acagggacct gattaagaat atttcgccta aaattttttt ttacgataaa 60 ttgtacacag aacatcgtct cgaactaaaa ccggggattt tttttacgat gagggttagt 120 tggatttacc gcctcaacaa ggagcagctt atctccgaac tacgggggaa aaacctcaac 180 tgcgagggta ccgtagtaga gcttcgccaa cgactagtty gatatgtccg aaagaatcca 240 caggatttcg tcgggaaagc agaagatcca actggatatg atgaagaagc cgatttgaaa 300 caagacctag aactcgaacg ctacgaatta attcaacgaa cgcttgaaga gacccgcaac 360 gccggacgct catccacccc gaccggagat ccaccggaaa taccaccagg agaaaacaaa 420 gttctcgacc agatacgcaa atggaactgt cactttgagg gaagagacgt ctactctttc 480 ttagaacggg tccgggaact acaggcatcg tacggattta ccggtgaaca gctactgcaa 540 ggagcccctg aactgctgcg aggagaagcc ctgcaatggc gacgaaatta cgccagcgac 600 tgtcgracct gggaggagct cgaaatcagg ttaagaaatt tttatttatc gagcggagaa 660 cgacggaatc taacaagaca agtcgccgaa aggattcaga aacccaacga gaacataagg 720 tcttactgca atgcattgac gaccctgatg agaagaaggg gaggatatac gttgatagag 780 caactcgaca atatatatta caatatgaaa cccgagttac aattgcgggt gcgcagggaa 840 gagataacag acgtacctca gctcatacag cgcatcgaag aatacgaaga tattgctgcc 900 aaactgcgcg aacaagaacg aaaaaccgcc atcaatagcg cctccacrag taaaaacacc 960 accgactatt ataacagagc cgaatgctgt tggcgttgca aacaacgcgg ccattcccgc 1020 gccgactgca aaaatgtagc acggaagttc tgttccgtat gcggaaagga cggcgttctc 1080 acaagagact gccacggatt cccggatcgt tcgggaaacg cgccaaggga cggctctaca 1140 gtagcgggag accgtcccaa gacggccgat acctcgtaaa cgtacaaata aaccaagata 1200 cggtgctggc attggttgac ccaggggccg tgagttcgtt cataacgcca gaaaccgcca 1260 tgatcggaca aaacaacggt tggcggcatg ggaagcaaga gacgatcgct acactggcag 1320 acggctcaac caccgagctg acagactatg tggaaggtaa ggcgacaacc ctcggcgcca 1380 ccttcaggaa caaatttatt atcatgaaag ggctacgcca tgagatgcta ttggggatgg 1440 acgcgttacg caaactgaac atccgcctga tcatagcrgg aaaggaatta caacccaacc 1500 ggtccaatac ctcggtactc tcagtcagcg acgtcggcat tcaggaaatc tcgaaagatc 1560 aacagacacg agtggacgaa ctcatccaat acgaaaaaga gctcttcgga artatacagg 1620 gccccacacc gctcatcgaa tacacaataa aggtaaccga atcaaatcca atcaaacaac 1680 gctactggcc acgcaacccc gccatgcaac aaataatcga cgaggctgtc gaagaaatgt 1740 tacaggaaga tgtgattgag ccatccgaca gcgcytggag ctccaacata gttctcgcaa 1800 agaagaagga tggaaagagg cgtttctgca tcaacttcca gggcatcaac aaaatcagca 1860 tcaaagatgc acacccgcta ccacaagttc acgcgacact aaacaaactc cggggagcaa 1920 agtacctaac cacgatcgac ctgaaaaatg gatactggca aataccgctg tcagaggaaa 1980 gcaagaagta caccgctttt accgttcccg gaagaggctt aatgcaattt aaggttcttc 2040 cattcggttt acactcggca ccagcagcat tcatgcggct gatgaacgca gtcatcacac 2100 cagatttaga accaaatgtg ttctgttacc tcgacgacat agtcattgtt acagaggact 2160 ttgaacagca cctacagctg attagcgaaa ccttccgccg gttacgggac gccaagctaa 2220 agccgaattg ggagaaaagc cacttttgca gaaaacagct gaaataccta ggtcatgtcg 2280 tcaacgagga ggggctacac acggacccag agaaagtaag ggcaatcacc gatctgcaac 2340 cgccaacgaa cctcaaggag ttacgccggt tttcaggatt aatatcctgg tacaggaggt 2400 ttattccgca agtggcaaag aaaacggccc cccttaacag gctattaaag aagaaagtga 2460 aatgggagtg gggtcccgac caacaagaag cattcgaaag gctcaaagaa gacctagtgc 2520 aagccccagt cctagcgtgc ccagatttcg ccaagacatt tgtgttacaa accgatgcca 2580 gcaacgaagg attaggggca gtcctaactc aagaagagga gggaaaagaa cgggtgatcg 2640 cctatgccag ccgatcatta aaccaggccg aacggaacta ctcagcaaca gaaaaagaat 2700 gtctcgccat aaaatgggga atttggaagt tccgagaata tctcgagggc tatcattttg 2760 tcgtcctgac agaccaccag tctcttaaat ggctcgagaa gattgacaat ccgtccggta 2820 gattagcacg atgggcgatg gaattagcac agtgggatta cgaggtcaaa taccgaagag 2880 gtaaggataa caccgtcgca gatgctctat ccagacaacc agtagctgta tgcaacatcg 2940 atatcaaaac gaactgcaca tggtacacaa aaaaattgac tgctgtccgg caggatccaa 3000 agacctatcc ggagtactgc atcaagaatg gagcattaca ccgccatatc ctacacacgt 3060 tggattttaa cgacacagac gcgaccgaag actggaaaat ctgcataccc accgaaaaac 3120 gaaccgaagt tttaagagaa tgccatgacg atctgaccgc agggcatttg ggagtggcca 3180 aaacactaca gcgactggcc cgaaggtact actggccagg tatgttacag gatggagcca 3240 ggtacgtgcg tagctgcata aattgccaaa cttacaaagc tcaacaacag acaaaagcgg 3300 gtaccatgca cgcgaccaac gtcgaacagc cttgggagat ggtctcgatt gatcttgtcg 3360 gacctcaacc acggtccaac aaaggtaaca tatggctact ggtaatgcag gacagattta 3420 ccaagtgggt ggagctagca gcacttaaga aagccgacag cagggccata atccgtcacg 3480 taaaggaaaa agttttcctc cgccatggtt gtcccaacac gctagtcagc gataatggtc 3540 gacaatttgt cagcagagaa tttgaagaat tcctgaagga aaacggggta aatcaccgaa 3600 aaaccccgcc gtatagtccg caatgcaacc cggtagaacg gaccaacagg gtggtaaaaa 3660 caatgatcgc ccaatatgta ggcaaacacc agaagaaatg ggatgaaaat ttgcgcgagc 3720 tagagttcgc ctacaatacc gctagcagca cagccaccgg ctacactccg gcatatttaa 3780 acacaggaag agagtttaga acgccaggta gccttcacca gaaagtggga tcgcagcaca 3840 ctacaccact gtccagtaga attcagcaca taaaggatgc cattgaatta gcgaaggcgc 3900 aaatggctcg tcaattcgaa aaacaacaga agcattataa cctccggcga cgggaatgga 3960 aaccgaatat cggcgacaaa gtagtcagga aattggtaac tttgtcgagt aaggccgacg 4020 cacgaaatgc caagctagcc ccaaaatact caaaaccgct tacggtcaag agacaggtgt 4080 cgccggttat atttgatctg caggacgaca gagggcgcta cgtgaaacat atacacgtga 4140 aagacctcaa gcccttcacc gacgcagaac aggagctata aatgcgccaa cgacgaaaaa 4200 ctacagtcag tcgtcaaacc aacgccaaga aggatggaca accggctagt cagaagcgcc 4260 ctcgtgagag ccacaccgat gtcggtagag atatcggtat acggacaaga gtttctcgcc 4320 atcatcgaca ccggagcccg aatctccacc ctgagcccag aggccgcgga atattgtctg 4380 gaacatgggg ccgaagcaat aacagcagag tcagcaatcg agcttcgttt cgccagccat 4440 accagcagca ccatccaaaa gactctacta gtccaaacaa gaataaaggg cttcgtagtg 4500 acgagtcctt gggcagaagt ggcagagcaa caaccacact ggatcatcgg catgaatgtc 4560 ctagggtccg ttcatgctcg aatttttatg agcccagtca accgggagaa aaagagccag 4620 aggcgaaggc tcaacaggca ccaggcacaa agactacgat ggcgcactac cgacgaagga 4680 atcgggagta gctccgaaga agacgaagaa ggtagaaatt tcgtttatag aataaaatta 4740 ggatattata ttttagtttt aagttagatt aaggcgaaaa taagcgaatt tgtagagttt 4800 aagacatatt cagggtaaac cgataagttg cgaccaaaca agccgttatt gtagactcag 4860 ccgcaaaaat cactagggtc taggaactag gctaggtcga ctcctatgca cacggacctt 4920 aagtctgtgg acacaaccgc tatagaaaaa ccgaaagcac cgcgccgggc acgatcaaaa 4980 caaatcggcg aagaagcggg aaattttttt tatatggagt taaacgcaca gcgacgcgaa 5040 tgaattttcg cccacataaa aaatgtaaac atgcgaaayt tcgagatcgt ctcacaaatt 5100 cacacattta cacgtcgcta gcatataaat agagacgata aagggcgcgc cggcagtaaa 5160 atggaaaact tcgatatcga aacgctggac gaaattcttg cgctgttgga ggccgacctc 5220 ccagcggacg cactggagta tagtccgggc gagcctttat cggcaccgga ctatactcca 5280 tcaccaatcg cgccgcatcc gcaaccggca ttggcatcga agccaatgca ggtgccagca 5340 ggggacaatc cgcggccaac aacatcaaag ccgcggatca tcagcgatgt cgtggtgcct 5400 cccctgctgc acctcaaacc ggcaccgaaa ataaagagca cgaggccgar gagcaccgca 5460 cgccacgttc caacgaggaa gaaccgccgg ggcccacaac acgraacrat tgtggctaca 5520 atcgacctaa cgacaccgcc atcgtcgccg aaacatcmtc cgccgccgcc tgtgacgaca 5580 acacgaccgg tcgtcacgcc ctcgcggcca cctccaccac cacgcgactc aacggccagc 5640 ctcccgcgtc gtttggtgcg gctaccggat gggcatgacg tcattgtgcc aatcggcgta 5700 aaaaaatacg tcgtgcggta caatggcgtt aagcggctac tccgcctggt accgacgacg 5760 ggcgaggtca gacatgtggg cgaggccaga agcacggaag tgagttttat tattttttta 5820 gatttatttt tcttttctcg tcgcgcatag tygttttttt tctaagcgcg aaatttcgct 5880 tcgcgtaatt tcacgctccg aagggagggg gaga 5914 // ID Gypsy-198_AA-I repbase; DNA; INV; 4096 BP. XX AC supercont1.72; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-198_AA_; KW Gypsy-198_AA-LTR; Gypsy-198_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4096 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.72; Positions 711863 715958. XX CC Positions [1267-1641] - Reverse transcriptase CC Positions [2905-3372] - Integrase core CC 'GTTG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 757..3918 FT /product="Gypsy-198_AA-I_1p" FT /translation="MIDSGADVNTVGMEVLDALMDSDPLKSSLFSLKNGTD FT RPLKAYATPTAIPVVATFVAELFITSDRPRLLEKFYVVHDAKPLLGRSTSM FT RYSLLRMGLNVPIRDIVANEAQLLPGEIHVVSSTNEFPKFNIPPVQLHYDK FT GLPPSRHVFTSIPPAFKEETSRRLEELLSSDIIEKVTSDMDKSFCSSLLVV FT PKGKCDIRLVVDLRGPNKCIVRTPFKMPSLESIVSDLHGAKWFSTIDLSSA FT FYHVVLHEDCRHLTNFFAGDATYRFKRLPFGLCNSPDIFQEILQTVVLEDC FT PGTVNYLDDILVFGGTKEEHDRNLEKVLRRLREHNVLLNAAKCVFSKQSVT FT FIGFRLSQDGWSVDDDKRKAIENFRRPESLSEVKSFLGLMNFTERFIVNRA FT EKTEKLRSLTKSDKFYWTHEEEKEFMFLKTEALNSIIKLGYYDHRDKTELY FT VDASPVGIGSVLVQFSCSGDPRIIACASKALTMAEQRYPQTQREALAIVWG FT VERFRYYLTSKNFTIRTDSEANEFIFGGEHRTSRRAITRAEVWALRLLPYS FT FQIERISGHQNVADALSRLIDRSQVDDSFDEDNEKHMLYALDAGSMSISWA FT EIQSASENDDELCEVRLGILSGKWPRGLRRYECQSKYLRVLDPLVFKNHLI FT VLPSSLRRRAIEAAHQGHVGCGATKHILREFFWWPNMAKEAEAFVAKCETC FT IVITRKNPPVPLCSRTLPNGPWEILQIDFLSVQGCGCGHFLMCVDTYSRYL FT HVVELKQTDARHTNAALCKIFSVWGLPLVLQSDNGPPFRSVEFVDHWEAKG FT VRVQKSIPLSAQSNGAIERQNQGVIKALAAAKLEQRSWREALEQYTYIHNT FT RKHHSRLGITPFELLVGWRHRGTFPSLWEQTEMLDRNDVAESDAYSKLVSK FT QFADRHRGAKDSDISVGDKVVVSVTQRTKTDPKFSLERFTVLTREGAKVVI FT RGEDGLEMSRNVQDVKRVPVITDLETYEEEKQITVNGDSNQSTSAVTHPGN FT NYSFADTTESLPCASTSADKSSAESNRPRRVIRKPAKLNDMYLFNIFQ" XX SQ Sequence 4096 BP; 1231 A; 823 C; 938 G; 1104 T; 0 other; atggcgcagt acggttattc agaacctgaa aacagtattc aagcgattcc cgattaagta 60 gtaattcacc tgagagggta aaacaggcaa ttcctccgag tgggtaaagc cgggattttg 120 ataacacata catattgttt gactagaata acacatttat tttgggattc ataatgattt 180 ggtcgttacc tgaacttcgt taatgtgaga tttactgtat gcctctaagt gaacatttat 240 gtttgttatt cacagactgc gaccttatca tccaacgcct atatgagtac gtttgaggat 300 acagtaattg atcctacctg cgatgttcga ccaaaagtgg ccgtcgatcc gatacttcac 360 gaaaatcaag tgcaccatct accgtccatc aagactattc ctgaacatgt ctcatctacc 420 ggtggacaca gaatcggaaa atcaaatgca gtacagttta atgaaaatca gtcacccgaa 480 atggttcgta gacaatattg atcatttttt tctctaatta gctatagtaa catgctgcca 540 atataatttt tcattccgca gaacggtgga tgtggacagc atagctcagc tcaaaacgcg 600 atctcttctg ataatgcggg atgtgatccc agcgcagtaa tcggaaatca aatcgtgtca 660 aatgttcatt tcgaaactaa tcatttgtat caatcaaaca aattaggtga cgaaagcttg 720 gttgtaggcc agatcgcagg cataaacgtc acatttatga tagattcagg ggcagacgta 780 aacacggtcg gtatggaggt actcgatgcg ttgatggata gcgatcctct caaaagttcc 840 ttattcagtc taaagaatgg gacggatagg ccactcaaag cctacgcaac tccaacggct 900 attcccgtag ttgccacttt tgttgcggag ttattcatca cttctgatcg accacgtttg 960 ttggaaaaat tttatgttgt gcatgatgcg aaacccttgt tgggtagaag tacttcaatg 1020 aggtacagcc tgttacggat gggattaaac gttccgattc gagacatcgt tgcaaacgaa 1080 gcacaattac taccggggga gatacatgtg gtctcatcga caaatgaatt ccccaaattt 1140 aatatacctc cagtgcagct gcattatgac aaaggcctac caccttcaag acacgttttc 1200 actagcatac caccagcatt caaagaggaa acaagccgaa gactggaaga attgctatct 1260 tccgatatta tcgagaaagt tacttccgat atggataagt ctttctgctc ctcactgttg 1320 gttgtcccga aagggaaatg cgatattcgg ttggttgtcg acctgcgggg tccaaacaaa 1380 tgtattgtac gaactccgtt taaaatgcct tcactagagt ctatcgtttc ggatctacat 1440 ggagcgaagt ggttttccac gatagattta tcaagtgctt tctatcatgt ggtgttgcac 1500 gaagactgta ggcatttaac gaacttcttc gcgggagatg caacgtacag gtttaagcgg 1560 ctcccattcg gtctgtgcaa ctcgccagac atttttcaag agatattgca gactgtggtg 1620 cttgaagatt gtcctggtac agtaaactat ttagatgaca ttcttgtttt tggtggcact 1680 aaagaggagc acgacagaaa cctggaaaaa gttcttcgtc gactgcgcga gcacaacgtc 1740 ctcttaaacg ctgcaaagtg cgtgttttcc aaacaatctg taacgttcat tggattccgc 1800 ctatctcagg acggttggag tgtagatgac gacaaacgga aggcgatcga aaattttcgt 1860 agaccagaat cactttcgga agttaagagc ttcctaggcc taatgaattt tactgagcgc 1920 ttcattgtga acagagcgga gaagaccgaa aagcttcgtt ctctaactaa atcagataag 1980 ttctattgga cccacgagga agagaaggag tttatgtttt tgaaaactga agcgttgaat 2040 tccatcatta aattaggtta ttatgatcat cgtgacaaaa cagaactata tgttgatgcc 2100 tcaccggtcg gaataggatc cgttctcgtg cagtttagtt gctcaggaga tccacgaata 2160 atagcgtgtg cgtctaaagc tctaaccatg gccgaacaaa ggtacccgca aacgcaacgg 2220 gaagctctag cgatcgtttg gggtgttgag cgtttcagat attacctgac tagtaagaac 2280 tttaccataa gaacggattc agaggccaac gagttcatat tcggaggtga acacagaacg 2340 agccggcgag ctattacacg tgctgaagtc tgggcgttgc gactccttcc atacagcttt 2400 cagattgaaa ggatttcggg tcatcagaat gtcgcagatg ctttatcgcg ccttatcgac 2460 agatcacagg ttgatgattc attcgatgaa gacaatgaaa aacacatgtt gtatgctctc 2520 gatgctggaa gcatgagtat ttcatgggcg gagatacagt cagcctcaga gaatgacgac 2580 gaactatgtg aagttagact gggaatttta tcaggaaagt ggcctcgtgg acttcgacgt 2640 tatgaatgtc aatccaagta cttaagggtt ttggatcctt tggtgttcaa aaatcatcta 2700 atcgtgttac cgtcgagcct caggagacgt gctatagaag ctgctcatca ggggcatgta 2760 ggatgtggcg ccactaaaca catactgcgt gaattcttct ggtggcctaa catggcaaag 2820 gaagcagaag ccttcgttgc caaatgtgag acgtgtatcg tgattactcg aaaaaaccca 2880 ccagttccgc tatgcagtcg tacacttcct aatgggcctt gggaaatcct acaaattgat 2940 tttttatcag tccaaggctg cggttgtggg catttcttga tgtgtgtgga cacatattca 3000 cggtacttac atgtggtcga attaaaacaa acggatgccc gacatacgaa tgctgcactt 3060 tgcaaaattt tttcggtttg gggtcttcca ttagtcctac aaagcgacaa tggtccgcca 3120 ttcagaagtg ttgaatttgt tgaccattgg gaggcaaaag gagtgcgagt ccagaaatcg 3180 attcctttaa gtgcacaatc gaacggcgcc atcgaacgac agaaccaagg cgttattaag 3240 gctttggcgg cagcgaaatt agaacagaga agttggagag aagctctgga acagtacaca 3300 tatattcata atacacgcaa acatcattca cgcttaggga tcacaccgtt cgagttgtta 3360 gttggttggc gtcatcgagg aacttttccc agtttatggg aacaaacaga aatgctagat 3420 aggaatgatg tcgcagagag cgatgcctat tcaaagcttg tgagcaaaca attcgctgat 3480 aggcatagag gtgcgaagga tagcgacatt tcggtgggtg ataaggttgt agtttctgtc 3540 acacaacgaa caaagacaga ccctaagttt tccttagaga ggttcacagt cctaacaagg 3600 gaaggagcta aggtggtgat tcgaggagaa gatggacttg aaatgtctcg caatgtgcaa 3660 gatgtgaaaa gagttccggt gataactgat ttggaaacat acgaagaaga aaagcaaata 3720 actgtgaatg gtgattccaa tcagtcaacg tcagctgtaa cacatcctgg taacaattat 3780 agctttgcag atactactga atctctacca tgcgcctcca catcagcaga taaatcctcc 3840 gcagaatcaa accgaccaag aagagttatt cgtaaacctg caaaactaaa cgatatgtac 3900 ttgttcaaca ttttccaata aattaaaatt tttaaaagct ggccacaaaa ctctaaagcg 3960 gactattaaa aatgtatcat acaaaaataa atgtgtacta catatgccgt tcgattagtc 4020 tcttagtttt aagataagca aaatcccaaa aaaaaacctt tgaagttttg ctatgaagag 4080 tagaggtggg gaagga 4096 // ID BEL-87_CQ-I repbase; DNA; INV; 5414 BP. XX AC AAWU01006055; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-87_CQ_; KW BEL-87_CQ-LTR; BEL-87_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-5414 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 307-307 (2011). XX DR GenBank; AAWU01006055; Positions 3138 8551. XX CC Positions [4428-4949] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 84..1892 FT /product="BEL-87_CQ-I_3p" FT /translation="MLSTPVKDPDSSLLEVFSTPIGELQQSRRKMAEEFDI FT LVKQRGAVKGKITRIKNVLGQHTQNQTIPDEFLLHTQLKTIDATYEEFNHF FT QNKIYAAKPAEADDQEKKYVEFEADYNFVRAKVCQLLRGVKQEPDPQPPAA FT AQPVRVQATSSLHTPLPSFDGKPENWYKFKAIFTDVMSKHPGESEATKLYH FT LDKCLVGEAVGSIDQQTINDSDFAAAWAELTEQYENKRRIVDIHVGGLLNL FT KPMTSESGKQLRELVDGCKRHVSALNFHEFPMAGLADILVVNILASKLDIE FT TRKVWETSIEHGEIPNYADTVKFLTTRSQVLERIELADNKQKKPVAAKTTK FT IPALKASSLAASVEYRCAFCGQEHQNHECGEYLKLDPKERYEKAKQTGVCS FT NCLKKGHITRRCSSTKTCKACKKRHHSTLHLNETGVAVEQPLARSTQNSTT FT ATPSAGPAGTTAVTASCVIYQQRALLCTAIVNILDDGGKALPCRVLLDCGS FT QVNLCTEKVASLLRLKRRSVNVDVKGVDGSTTRVTALVHINVQSRDNVYTS FT NLECLVINRITGTIPAKNIDISTWPVPAGLQLADPEFYRPQHADWRRPILR FT SPENW" FT CDS 1849..3369 FT /product="BEL-87_CQ-I_1p" FT /translation="MLIGVGQFFGLLKTGKVKLAESLPFLQETVFGWVVGG FT LADSTSWKYEVTRCNVAVQDAELNDLVERFWESEAVPTASRMSSEETACEQ FT CYEQTHSRDPSGRYVVMLPFRGNVSQLGDSKQHALQRFSYLEKKLAKNADL FT KAAYCAFMEEYVRLGHCREIDPSKEEDGGYYLPHHAILKPSSSTTKLRTVF FT DASAQTSSGLSLNDTLMVGPTIQDPLFDIALRFRTYPYVFTADISKMYRMV FT RVHPDHTKFQRVFWRTDPSESLKTLELLTVTYGTASAPYLATRTLKQLAQD FT EGQAYPRAAKILENDFYIDDALAGADTLKELLLARDELEELLEKKGFELHK FT WCSNSAEFLDTIPDERREKQVSFELSGVNEMIKTLGILWNPTSDKLLFRVA FT LTDLTQPVSKRRIRSEVARSFDPLGLQAPVCVTAKLILRQCWRKKVSWDEQ FT LCKVLTTKATVQGAMEQISWRTASADGGTDRAPSHRATQGGPGAARIFGRI FT PGSLRHVRLRA" FT CDS 3471..4949 FT /product="BEL-87_CQ-I_4p" FT /translation="MSRLVSRVMKALKVKFTRKVLYSDSTIVLGWLAKSPS FT ELETYVANRVAEIHELTPRADYEYKYVRSSANPADLVSRGLFPEALIGNRF FT WFHGGFLEENDYQEDEMAIVVELPEVRVAAVVITDEQEEDYNRIFTRFSSF FT RKLQRVLAYVIRFCNKTRKRSTYGDNDKYPTIPELRSSLRSIVYMVQQQRM FT QTDIQEVLKKQGKPNERYTGRLRSLNPWVDTYGILRVNGRIKYANVSYEQR FT CPAILPAEHPVTAILVQAVHEENLHVGSSGTLSVLRQRYWILNGRNAIRAR FT LRKCVRCFKVNPSETKLFMGDLPSYRVTQAYPFERVGVDFAGPIYVRKGHP FT RKPVYCKAYVALFVCMVTKCIHIELVSNLTTDAFIAALHRFVARRGLPSDV FT YSDNATNFAGASSELHELYALLTQQLTKDALQEFCLPKEIRWHFIPPRSPH FT IGGLWEAGVKSAKHLIKRNAGETKLTEEEWSTLLTQIEGIFNSRPLVP" XX SQ Sequence 5414 BP; 1355 A; 1476 C; 1551 G; 1032 T; 0 other; tttttggtcc tacttcgccg gatattcgta gtagtttcgg tcgaaaggaa cggacaagga 60 cccggtcgat tggatgggcg tagatgctct caactccggt gaaagatccg gacagttcgt 120 tattggaagt ttttagtacg ccgatcggag agctgcaaca atcgagaaga aagatggcgg 180 aagagttcga cattttggtg aaacaacgtg gtgcagtgaa aggaaaaatc acccggatta 240 agaacgtgct gggccagcac acgcaaaacc aaacgattcc ggatgagttc ctcctgcaca 300 cccagctgaa gacgattgac gcaacctacg aggaattcaa ccacttccag aataagatct 360 acgcggccaa gccagcggaa gcggatgacc aggagaagaa gtatgtcgaa tttgaagcgg 420 attacaattt cgttcgggcc aaggtctgcc agttactgcg tggggtcaaa caagaaccgg 480 acccccagcc tccggcagca gcccaacccg tgcgagtgca agcgacatct tctctccaca 540 cgccgctgcc atctttcgac gggaagccag aaaactggta caaattcaag gccattttca 600 cggatgtcat gagcaagcac cccggcgagt cagaagcaac caagctctat catttggaca 660 agtgcctggt cggagaagcc gttgggtcaa tcgaccagca aacgatcaac gacagcgact 720 ttgctgctgc ttgggcggaa ctgaccgagc aatacgagaa taagaggagg atcgttgaca 780 ttcacgtcgg cggtttactt aacctcaaac cgatgacatc tgaaagtggt aaacaactcc 840 gtgagctggt ggacgggtgc aaacggcacg taagtgcttt gaacttccac gagttcccga 900 tggctggcct tgcggacatt ctggtggtca acattttggc gtcgaagctt gacatcgaaa 960 cgaggaaagt gtgggagaca agcatcgagc acggcgagat tccgaactac gccgacacgg 1020 tgaagtttct cacaacccgg agtcaagttt tggaacgcat cgagcttgct gacaacaagc 1080 agaagaagcc tgtcgcagca aagacaacga agattcccgc actgaaggca tcgtccctgg 1140 cggcgtcggt cgagtaccgg tgcgcatttt gcggacagga gcaccagaac cacgagtgcg 1200 gcgaatatct caagctggat ccgaaggagc gctacgagaa ggcgaagcaa actggagttt 1260 gttccaattg cttaaagaag gggcacatta cgcgtcgctg ctcctcgacg aagacgtgca 1320 aggcctgcaa gaagcggcac cattctacac tacatctgaa cgaaacgggt gtcgctgtcg 1380 agcagccact tgcacgatcc acgcagaact ccacgacggc tacgccgagc gcaggaccag 1440 ccggaacgac ggcggtcaca gcctcctgtg tgatctacca gcaacgggcg cttctctgca 1500 cggcgattgt gaacatcctc gacgacggtg gcaaggccct gccctgccgc gtactgctag 1560 attgcggttc acaagtgaat ctgtgtacgg agaaggtcgc gtcgctactg cgactgaagc 1620 gacgaagtgt gaacgtcgac gtcaaaggtg tggatggttc aacgaccagg gttacggcct 1680 tggtgcacat caacgtgcag tcccgagaca acgtctacac aagcaacttg gagtgtctgg 1740 tgatcaaccg tatcactgga accatacctg cgaagaacat cgacatctct acctggccgg 1800 tacccgccgg cctgcagcta gcggatccgg agttctaccg ccctcaacat gctgattggc 1860 gtcggccaat tcttcggtct cctgaaaact ggtaaggtca agctggcaga gagcctgccg 1920 tttttgcagg agacagtgtt cggatgggta gtcggtggcc tggccgactc cacgagctgg 1980 aagtacgaag tgacccgctg caacgtggcg gtccaagacg ctgagctgaa cgacctcgtc 2040 gaacggttct gggaatccga ggcggtgccg accgcttcaa ggatgtcgtc ggaggagacg 2100 gcatgcgagc agtgctacga gcagacccac agccgtgacc cgagcggtcg ctacgtcgtg 2160 atgcttccct tccgtggcaa cgtgagtcaa ctcggcgact cgaagcagca cgcactacaa 2220 cgcttctcgt acctcgagaa gaaactagcc aagaacgcgg acctgaaggc ggcctactgc 2280 gctttcatgg aagagtacgt gcgcctcggc cactgccgcg aaatcgaccc gtccaaggag 2340 gaagatggtg ggtactacct gccccaccat gccatactga agccaagttc gtccaccacc 2400 aagttgcgca cagtttttga cgcctcggca caaacgagtt ccggactctc gctcaacgac 2460 accctgatgg tcggtccaac cattcaagat ccgttattcg atatcgccct tcgattccgg 2520 acgtacccgt acgtgtttac ggcggacatc tcgaagatgt acagaatggt tcgggtgcac 2580 cccgaccaca cgaagtttca acgtgtgttt tggagaacgg acccttccga gtcactcaaa 2640 acgctcgaac tgttgaccgt cacttacggg acggcatcag cgccatactt ggcgacccgc 2700 acgttgaaac agctcgccca agacgaagga caagcctacc ctcgcgctgc caagattttg 2760 gagaacgatt tctatatcga tgacgcactt gcgggtgctg atacactgaa ggagttgctg 2820 ctggcgcgcg acgaactgga ggagctgctg gagaagaaag gcttcgagct ccacaagtgg 2880 tgctccaatt ccgccgagtt tcttgacacc attccagacg aacgacgaga gaagcaagtg 2940 tcattcgaat tgagtggcgt gaacgaaatg atcaagactc tcggcattct gtggaacccc 3000 accagtgaca aactgctgtt ccgcgtcgct ctcactgact tgactcaacc ggtttccaaa 3060 cgtcgcatcc ggtcggaagt tgccagatct tttgacccgc taggtctgca agcaccggtt 3120 tgtgtcaccg ctaaattgat cctgcgtcag tgttggcgaa agaaagtgtc ttgggacgag 3180 caactgtgca aggtgcttac aacaaaagca actgtgcaag gtgctatgga acaaatttcg 3240 tggaggactg cgagcgctga tggaggtaca gatcgagcgc cgagtcatcg tgccacacag 3300 ggtggccctg gagctgcacg cattttcgga cgcatccctg gaagcttacg gcacgtgcgt 3360 ctacgtgcgt aacattctgg cggacaacac agcagtagcc tacttgcagc aagtcgcact 3420 tggcgcccaa agagacgatc ccgaaactcg aactgtgcgg cttccggctg atgtcaaggt 3480 tggtgtccag ggtaatgaaa gcactcaagg taaaattcac tcggaaggtg ctgtacagcg 3540 actcgacgat tgtactggga tggttggcca agtccccaag cgagctcgaa acgtacgtgg 3600 caaaccgagt tgccgaaatc cacgagctta ccccgagggc cgactacgag tacaagtacg 3660 tgcgaagcag cgccaacccc gcagacctcg tctcacgagg actctttccg gaagcgctga 3720 ttgggaatcg cttctggttc catggcgggt tcctcgaaga aaacgactac caagaagacg 3780 agatggccat cgtagtcgaa ctacccgaag tacgagttgc agccgttgtg ataaccgacg 3840 aacaagaaga agactacaac aggatcttca cacggttcag ctcgttccgc aaactacaac 3900 gagtgctagc ctacgtgatt cgcttttgca acaagacccg gaagcgatcc acgtacggag 3960 acaacgacaa gtacccaacc atacccgaac tacgatcgtc gctgcgctcg attgtgtaca 4020 tggtccagca gcaacgaatg cagacagaca tccaagaagt gctgaagaag caaggcaagc 4080 cgaacgagag gtacactggc cgactacgaa gcttgaaccc gtgggtcgac acctacggca 4140 ttctacgagt caacggacgg atcaagtacg ccaacgtgtc gtacgagcag cggtgcccgg 4200 ccatactgcc tgccgagcat ccagtaacgg cgatcctggt tcaagctgtc cacgaggaga 4260 atctgcacgt cggatcgagc ggaactctat cggtcctgcg gcaacggtac tggatcctga 4320 acggacgaaa cgcgatccgc gcaaggttgc ggaagtgcgt acggtgcttc aaggtgaacc 4380 catcggagac caaactgttc atgggagact tgcccagcta cagagtgacc caggcctacc 4440 catttgaacg agtgggcgtt gacttcgctg gaccaatcta cgtacgcaag gggcacccgc 4500 gcaaaccagt gtactgcaag gcgtacgtgg cattgtttgt gtgcatggtg acgaagtgca 4560 tccacatcga gctggtgtca aacttaacga cggatgcttt cattgccgcc ttgcaccgat 4620 ttgtcgctcg tcgcggcctt ccctccgacg tttacagcga caacgccaca aactttgctg 4680 gagccagctc ggagctccac gaactgtacg cgctgctgac gcagcagctg acgaaggatg 4740 cgcttcaaga gttttgtctg cccaaggaaa ttcgctggca ttttatccca ccacggtcgc 4800 cgcacatagg aggtttgtgg gaagccggcg tgaagtcggc gaagcacctg atcaagcgga 4860 acgcaggtga aacgaagctg acagaagagg aatggtccac actactaacg cagattgaag 4920 gcatatttaa ctcgcggccg ttggtgccgt agacggcgga cccgggtgac ctcaacgtga 4980 tcacgcccgg tcatttgctg attggccgcc cattcaacgc gatcccggag ccggcgtacg 5040 atcaactcaa gcctggaacg ctgacacggt ggcagcacct gcagaagatg cgagctgact 5100 tctggaaacg ctggtcggcg ggctacctgt ccgagctgca gcagcggcag aagtggaaca 5160 agcagcacac tgtcgtcaag gaaggagatt tggtgctgct caaggaggag aacgttccgg 5220 cgctgcagtg gaggctcggc cgagttgtga aggtccaccc cgggcaagac ggagtcaccc 5280 gcgtggtgac ggtaaagaca gcaggaggag tttacaagcg gtcgacggca aaggttgcgg 5340 tgctcccact cgacgacgaa gcagaggaga agacagccgg ttgaggttcc cggatgtcca 5400 ctggcggggg agga 5414 // ID DNA8-75_AP repbase; DNA; INV; 684 BP. XX AC . XX DT 25-AUG-2009 (Rel. 14.09, Created) DT 25-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-75_AP. XX NM DNA8-75_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-684 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2011-2011 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 684 BP; 218 A; 117 C; 96 G; 249 T; 4 other; ggtactttcc tgttatacga gactaccctg ttttgttcaa catcacaatc ctgttatcta 60 agactaccaa aattatactt tcctgttata cgagactacg gctattatcg cgactaccac 120 ccaagactac ctccgttatc gcgactacct atcttagctg atattatagc tgaattaaaa 180 taatactaaa taagctgcga ttatatttat tatatatcta tcattacgta tattattagt 240 ttttgtaatt ggtgtttccc acggcaaatt tggtacatta cgaataatta ttatgtattt 300 aatttttttt tatanaattt gaatgtggcc atccccacaa tacataaatt attaattatt 360 ttatttgtgg atttaccatt ttgcagtatt attattatta ttttattgtt ttccgtacct 420 acgccgcgnc aaatttggca cttaaccgtg taaaatacat tacgaataat tactatgtat 480 ttaatttttt tctttatttc attctgacca tccccacaat acatattatt aaaaaatata 540 tcaaagaagg caagaaaant aaggtagtct cgtanaacag gaaagtacca ttttggtagt 600 cttagataaa aggattgtga tgtttcacaa aataagcgcg gtagtcgcga taacggaggt 660 agtctcgtat aacaggaaag tacc 684 // ID Gypsy-10_OD-I repbase; DNA; INV; 11460 BP. XX AC CABV01000282; XX DT 24-FEB-2011 (Rel. 16.02, Created) DT 24-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Oikopleura dioica genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-10_OD_; KW Gypsy-10_OD-LTR; Gypsy-10_OD-I. XX OS Oikopleura dioica OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; OC Oikopleuridae; Oikopleura. XX RN [1] RP 1-11460 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Oikopleura dioica genome."; RL Direct Submission to RU (24-FEB-2011). XX DR Genome; CABV01000282; Positions 27143 15684. XX CC Positions [4784-5263] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 516..1919 FT /product="Gypsy-10_OD-I_4p" FT /translation="MDYNITEPHLIRINKSIKFEYLKFFGRVLYNFEPKFE FT DKNDLLAQIQQKAKIIQIQVLLHNNKFILAPLNFATKTSTPKSYLKLISTF FT NKTPIETLKSTLDSTESLLFLTKEYKAWPRAKVLLADDQLLDLITMTHGAN FT EPLPEEIDLEVSSLINDETEDSGPSIVKDLNNFSVSRNNKISLPEYDLKGY FT IGDWIKLCLNQLAREKIDDHDRKVSALLRALPKTSAYEIDKRISALPATEQ FT ASLETILDIFRKYFKRNPISVKKELDELKFNKSYKDIKSFFADIRSLTEQA FT FPGKNEEVLKALSYEYLNKKLPMALQKSESFRLAENSSLETKINLLDNLYA FT SLPPTAETNVTISKDSQNAVEANYFEGNQNRGRGGYRGRGGYRGRGGQRGR FT GNNRGRGGHYHNDKRQPHGQSYNTEHKSSNNQAKGNDKGKKEQSKGTKDNK FT KDKSHVQCYKCKGKGHYKSQCPS" FT CDS 5834..7876 FT /product="Gypsy-10_OD-I_3p" FT /translation="MAAECVTFKLYEVVAFNNRSIEEAEEELRANVKGNWN FT TSTKELPVLTRSNQESRAFSKRRAVLPNFEYPKSRLKPTSSSSLKLSEVQS FT RRKQLVHTTPKIELEIETTTLIPLTTSIGENLLETTTSMQSEYDEYSYELL FT NMCRGDNCRKRRQVLAIAALTSLIIGGVGVGSAVIEHNENKSNELRIKHDE FT LLSSKLAATTETKLEEVHIQINKQNSEIVKAFRQDCQTLDELSNEQFLNSI FT VNEFEDFKIRITNETLNLKTRASFEHNKALETVINSCIVLNGPSFEAACSD FT FIELTSVRASEYAPKFDHENRLSLVLKVEFANPEMEIISQTYEVLSVPVPV FT GIRESKYYYMKVLTPKRVAFSDRVNKAFSIDECKTNFLMKNIFCNEKDLFY FT NKNFDACAASLMTNINGNACEHEILHSNMDCMFSNNEDFIMVGHFHTPSVH FT SSRKSSIIPVVKVGSQKVDHASNITIFERIDYTLGITCLSSSYSVTGRRWT FT INKIVDVDFPNLGAQTFHISDHHFDNILNQINRTALMNKTDIEAISKEFQY FT GAKTGYDKVDKLFNTARPYSHIVLWSTAAVVSAILIFTLFVYFCVGMNEFM FT KCCCPKLFKLKNLRVVNNETIIWDRDLNIDNSLDRHRQMRENNSQSNRTQE FT RRIPPEIPMRRTRFLDRNRPQSSRSLQIEDEL" FT CDS join(1970..2872,2876..5785) FT /product="Gypsy-10_OD-I_1p" FT /translation="MGHLLPTTTISIVQYEGDKILLSNDIQRVLLDTGSSI FT SFLKNSAVPTDFKVQRAALSLSNAFGSKMCAITQKIECNATIDGKIIPQVS FT FYIIPDRYEISYTGVIGMDILKRFKLNLNKSPDTPILFNKFEVVEDNDEII FT ELIDRTSDSVLFANEQIIIPPHDTVHIKIKRSKALEKKNKISIITTLDMEN FT KNTFLHSYILSKGHMLVQVENKNDVPVFVKKGAALADEAKEISQHLLNFLI FT NSKSLPVQERKIHKKEFEEWRKKRDILVKDIDLSLDIEDAAKSAPIQYSEG FT LRKILNKYNNFSRTPSDAGLSKNFVCEIRLKDTEPIFHRPYPIDAFKIGKI FT DSKLKEMTDNNILKKTISSFNSPVIFVLKKNKKLRVVNNYSAGGKKSINSR FT LILPRFPTLPIRALLAKLSTAIASLKTRFPSDKLVFCSLDLANAFYTLSLR FT EGHRDYTTFIFGDKQLQYQRMCQGLASSPSTFQAFASKILDDLEGQNEHFF FT CFNYQDDFLLISTERHLNETLDKIFSRLEKFQVIVSLAKCEFYKESVKFLG FT FVIDQNGIKASKSKVDALLQLKPPKTAREAMSFVGCLNYYCRLIPKLSALL FT SPITKGISLGKKFTLTDAMKENITILKNHIRNGIGVCHLVYPANTDDFERI FT FVAADTSLLQTGSIIGNLRIDNDKISNVRIAGYSSKALVEQETLLSARARE FT LIGLSHAIKAFKELLPHDARMIFIVDHKSLTSIIDKPALKTSGGTRVRGAY FT AQILEYPLCRIMYLPNTNSIIEIVDNLSRHPVEEINKDIFRPSLDKNSNGQ FT CNNMELSIPRQVVDIEAIKSAQERCEKLQKIKKEILEKPKAINSEYSLIND FT IIYKRVRNNKLLAVISEELTRDVINFFHVALHHCGEQMLLQHLKTEPILLQ FT NKHKNVAKCIKGCLTCGLAHKRYSTEGVKASIEPTLSPYAAMYADLVVWNL FT SEYDKQYLFLTFLDGFSRRLTVRHIKTKKAQHMIPAITELIMEAGAIGQSH FT LITDNGKEFTSREFEECMTLLAVSHSTISPRNSSANKIERCHRELRSILRA FT YDLTPANIVHKFYCATYIFNMRPSEGLSNLSPNQVLFNKNPPSYFGQLSIK FT EEEVDYVSKDVYNEHREHIQALHLSQAKNQLTKFFLKSNFNEGTLKEKDIV FT ILRDPSNNAFHSSSQRGPWYIHKVKTRNNFEITHIFNRNKLVRQGKYLVKL FT NLDEYDSKYLRETESIFVDQKTREIINSNYPTKNKTLSPLAIDYSHLKTST FT DEFVIEIKFN" XX SQ Sequence 11460 BP; 4268 A; 2254 C; 1898 G; 3040 T; 0 other; gaggatatta ctggtgactg aagaagcgga ctttaccagg taattctttt atattattta 60 ttttatattt aattgtataa ttaattatta atttattata atttataatt ttattaataa 120 attttattat ttattataat tttataattt tattaataat aatttattat aattattatt 180 atataattaa ttttaatttt tgggtggaac aaaaaaccaa cctattttct gtcagaagac 240 agaaataggt actgtaccca aactccccct agcgcctgtg acgaggctta aataatcttt 300 gtcaaaatac gcagtgacca tttcaaaacg gtcctgcgaa gctcccggta ttagagtagg 360 tagtttttaa tagaacccag gtagtttgat taggcaggat cccaggcaga ttaggtcaga 420 tatgcagata ctccgagtgc gaaaatgata aaatagcaaa ctcaggaatg ggaataaaaa 480 ataaatataa aatattatag ctacaaaaat aactaatgga ttataatata acagaacctc 540 accttataag aattaataaa tcgataaaat ttgaatacct aaaatttttt ggaagggttc 600 tttataactt tgaacccaaa ttcgaagata aaaacgacct tttagcgcaa attcagcaaa 660 aggccaaaat aatccaaata caagtgttat tacacaataa taaatttatc ttagccccac 720 ttaattttgc aactaaaacc tccacaccta aatcatatct taaacttatc tctacattta 780 ataagacacc catagaaaca cttaaaagca cacttgattc aaccgaaagc cttctttttc 840 ttactaaaga gtataaagct tggccaagag caaaggtgtt acttgcagac gatcaactac 900 tcgatctaat aactatgacg catggcgcga atgagccgct accggaagaa atagatctag 960 aggtatcaag cctcataaac gatgaaacag aggactctgg tccttctatc gtcaaagatc 1020 tcaacaattt tagcgtctca cgaaataata aaatttctct gcccgaatat gatctgaaag 1080 gatacattgg ggattggatt aaattgtgcc tcaatcagtt ggctcgtgaa aaaatagatg 1140 accatgaccg aaaagttagt gctctgttac gggctcttcc gaaaacaagc gcttatgaaa 1200 ttgataaaag gatcagtgcc ttaccagcaa ctgagcaagc atcgctagag actattttag 1260 acatttttcg aaaatatttc aaaagaaatc cgatatctgt caaaaaggag ttagatgaat 1320 taaaatttaa taaatcatac aaagacataa aatccttttt cgcagatatt aggtcgctta 1380 cagaacaagc attcccaggc aaaaatgaag aagtgctcaa agcattatct tatgaatatt 1440 taaacaaaaa attgccaatg gctcttcaaa aaagcgaatc gtttcgactc gcagaaaata 1500 gtagtttaga gacaaaaatt aatctcctcg ataatcttta cgcatctctt cccccaaccg 1560 ccgagacaaa tgtaactatc tcaaaagact ctcaaaatgc tgtagaagca aattattttg 1620 aaggaaatca aaacagaggt cgtggtggtt acaggggacg cggtggctac agaggacgag 1680 gtggtcaaag aggacgagga aataacagag gacgaggcgg acattatcac aatgacaagc 1740 gtcagcctca tggtcagtca tataacaccg agcacaagtc gtctaacaac caagcaaaag 1800 gcaatgataa gggcaaaaaa gagcaatcaa agggcacgaa agacaataaa aaggacaaat 1860 cacacgtcca atgctataaa tgcaaaggca aagggcatta taaaagtcaa tgcccaagct 1920 aaaccatgtt tacctacttg taaataattt tattcccact tagtcaagaa tgggacacct 1980 actgcctact accactatat caatcgttca atatgaaggc gataaaattc tcctttcaaa 2040 tgatatccag agagttctac tggatacagg ttcaagcatt agctttttga aaaattcagc 2100 agtcccaaca gattttaaag tccaacgagc agcactttca ctttcaaatg cattcggctc 2160 caaaatgtgt gcaataactc agaaaattga atgcaatgcg actattgacg gcaagattat 2220 cccacaagtt agtttttaca taataccaga cagatacgaa atttcgtata caggcgtaat 2280 cggtatggac atacttaaaa ggttcaagtt aaacctaaat aagtctccgg acacaccaat 2340 tttgttcaat aaatttgagg ttgtcgaaga caacgacgaa ataattgagt taatagatag 2400 gacaagtgat tcagtgcttt ttgcgaatga gcagattatt attccacctc atgacactgt 2460 tcacataaaa attaaacggt caaaggctct agaaaagaag aataaaataa gtattattac 2520 aactcttgat atggagaata aaaatacatt tttacactca tatatccttt cgaaaggaca 2580 tatgctcgtt caagttgaaa acaaaaacga cgttcctgta tttgttaaaa agggagctgc 2640 actagcagat gaagctaagg agataagtca acacttactt aattttctaa ttaactccaa 2700 gtcacttcca gtgcaagaaa ggaaaatcca taaaaaagag ttcgaagagt ggcgtaaaaa 2760 gagagatatc ctcgtaaagg atatcgatct cagcctagat attgaagatg cagcaaaaag 2820 tgcaccaatt cagtattctg aaggtctccg taagatttta aataaatata attagaattt 2880 cagcaggacg ccttcagatg caggattgtc aaaaaatttc gtgtgcgaaa ttagacttaa 2940 ggatacagag ccaattttcc atcgaccata tcccatcgat gcgttcaaaa ttgggaaaat 3000 cgattcaaag ctaaaggaga tgactgacaa taacatcctt aagaaaacca taagtagttt 3060 taacagccca gtcatattcg ttctcaagaa aaataaaaag cttagagtag tcaataacta 3120 cagcgcagga ggcaagaaat ctatcaactc taggttgatc ctccctcgct ttccgactct 3180 tcccatacga gcgctactag caaaactatc tacggctatt gcaagtctta aaacgagatt 3240 tccatcggac aaacttgttt tctgcagcct agatttagca aacgcgttct atacgctaag 3300 tttgagagaa gggcacaggg attacaccac atttattttc ggagacaaac aactccaata 3360 tcaaagaatg tgtcaaggtc tcgcctcttc tccatctact tttcaagcat tcgcaagcaa 3420 gatccttgat gacttagaag gtcaaaatga acattttttc tgttttaatt accaggacga 3480 ttttctcctg ataagcacag aacgtcatct aaatgaaaca cttgataaaa tattctcacg 3540 actagaaaaa tttcaagtaa tcgtttcgct agcgaaatgt gagttctaca aagagtccgt 3600 aaaatttctt ggattcgtca ttgatcagaa tggaatcaaa gcatctaaaa gtaaagtcga 3660 cgcactcctt cagttgaaac caccaaaaac ggccagagag gctatgagtt tcgttggttg 3720 tctgaactat tactgtcgac tcataccaaa actctctgct ttactctctc cgatcacaaa 3780 gggaatttcc cttggaaaga aatttaccct tactgacgca atgaaagaaa atattactat 3840 cttaaagaac cacattagaa atggaatcgg tgtctgccac cttgtttatc cggctaatac 3900 tgatgatttt gaacgaatat ttgtcgctgc agacaccagt ctgttacaga ccggttcaat 3960 aatcggcaac ttaaggatag ataatgataa aatttctaat gttcgcattg caggctattc 4020 atcaaaagcc ttagttgaac aagaaacact cctgtcagct agagctcgcg agttaatagg 4080 acttagccat gcaataaaag cattcaaaga acttcttcca catgacgcaa gaatgatttt 4140 tatcgttgat cataaatccc taacctcaat catcgacaaa ccagctctta aaacgagtgg 4200 aggcacccgc gttagaggcg cttatgcaca aattctcgag tatccgctct gccggataat 4260 gtacttgcca aatacgaata gcatcataga aattgtcgat aacttaagca gacatccggt 4320 tgaggaaata aacaaagaca ttttcagacc atctctcgat aaaaactcta acggccaatg 4380 taataacatg gaactaagta tcccaagaca ggttgtagat atagaagcaa tcaaaagcgc 4440 gcaagaaagg tgtgaaaaat tacaaaaaat taagaaagaa attcttgaaa agcccaaagc 4500 aatcaacagt gagtactcat taataaatga tatcatctat aaacgagttc ggaacaataa 4560 actgctggca gttatatcgg aagaactcac tagagacgtt atcaactttt tccatgtagc 4620 tctccatcac tgtggagagc aaatgcttct tcagcatttg aagacagaac caattcttct 4680 tcagaataag cataagaatg tagccaaatg tataaaaggt tgccttacct gtggacttgc 4740 acacaaaagg tacagtacag aaggcgtcaa agcaagtatt gaaccaacac taagcccata 4800 tgcagctatg tatgcggatt tggttgtatg gaacttatca gaatatgata agcaatattt 4860 gttcctcaca ttcttagacg gatttagtcg tcgactaacc gtcagacaca taaaaacaaa 4920 aaaggctcag catatgattc cagcaataac tgaattgata atggaagctg gtgccatagg 4980 ccaatcacac cttattaccg ataatggtaa agaattcaca agtcgagaat tcgaggagtg 5040 catgactttg ctagcagtca gtcatagcac catttccccg agaaattcct ccgctaacaa 5100 aattgaaaga tgtcatagag agttaagatc aattcttcga gcatacgatc tgaccccagc 5160 aaacatagta cacaaatttt actgcgcaac atatattttt aacatgcgcc caagcgaagg 5220 attatcaaat cttagcccaa atcaagtgct ttttaataaa aatccaccat cctactttgg 5280 tcaactaagc atcaaagaag aggaagtaga ttacgttagt aaagacgttt acaacgaaca 5340 tcgagagcac atccaagcac tacatttgtc tcaagcaaaa aaccaattaa caaaattttt 5400 cctaaagtca aacttcaacg agggaactct caaagaaaaa gacatagtca ttttaagaga 5460 tcccagcaat aatgcttttc acagctcttc ccaaaggggt ccatggtata ttcataaagt 5520 aaaaacgcgg aataattttg agattacaca catatttaac aggaataaat tagttaggca 5580 aggcaaatat ttggtcaaat tgaatctcga tgaatatgat tctaaatatt tacgtgaaac 5640 tgaaagcata tttgtcgatc aaaagactcg agaaataatc aactcaaact atccgacaaa 5700 aaacaaaaca ctctcgccac tagccattga ctattcccac ctaaagacca gtaccgacga 5760 atttgttata gaaataaaat tcaactaaga tataaccaca gaaagatgga caatcggagg 5820 ccagaacaga aatatggccg ccgaatgcgt cacttttaaa ttatatgaag tagttgcttt 5880 taataacaga tcaatagagg aagcagaaga agaactaaga gcaaacgtta aggggaactg 5940 gaatacttca acaaaagaac taccggttct tacaagaagt aatcaggaaa gcagggcctt 6000 ctctaaacgt agagccgtgc ttccaaattt tgagtatcca aaaagtcgtc taaaaccgac 6060 atcctccagc tccttaaaac tttcagaagt acaatcacga cgtaaacaat tggttcatac 6120 aacaccaaag atagagctcg aaatagaaac aacaactttg atccctctaa caacatccat 6180 tggagaaaac ctgttagaaa caacaacatc gatgcaaagc gagtacgatg agtactcata 6240 cgagttgctc aatatgtgta gaggtgataa ttgtagaaag agaagacagg ttctggctat 6300 tgcagctctt acaagcctta tcatcggtgg tgttggcgtc ggtagcgccg ttatagaaca 6360 taatgagaat aagagtaatg aattgagaat caaacatgat gaactcctct cgtcaaagtt 6420 agcagccaca actgagacca agttagaaga ggtgcacatt caaattaata aacaaaattc 6480 agaaatagta aaggctttcc gtcaagattg ccagacacta gatgaacttt caaatgagca 6540 atttcttaat tctattgtta acgaatttga agattttaaa ataagaataa caaatgaaac 6600 tcttaatctt aaaacaagag cttcttttga gcacaataaa gctcttgaaa ccgtcatcaa 6660 ctcgtgcata gtcttaaacg gacccagttt tgaagctgct tgcagtgact ttattgaatt 6720 gacatctgtc agagcatctg aatacgctcc gaagttcgat catgaaaacc ggctttcact 6780 cgtactaaaa gtagaattcg cgaacccaga aatggaaata ataagccaaa cttatgaagt 6840 tctatctgta cctgtaccag tgggaatacg tgaatctaaa tattattaca tgaaagtctt 6900 aacccctaaa cgtgtagcat tcagcgaccg agttaataaa gcctttagta ttgatgaatg 6960 caaaacaaat tttctaatga aaaacatttt ttgcaatgaa aaagatttgt tttacaacaa 7020 aaattttgat gcatgcgctg caagtctgat gactaatatc aacggtaatg catgcgaaca 7080 cgaaatactt cactctaata tggactgcat gttcagcaat aatgaagact ttataatggt 7140 cggacatttt cacacgccta gcgtacattc atcaagaaaa tcttcaatta tcccagtagt 7200 aaaggtagga tcgcaaaaag tcgatcacgc ctctaatata acgatttttg agcgcattga 7260 ctacactctc ggaataacat gtttatcaag ctcttactcg gtaacagggc gcagatggac 7320 gattaataaa atcgtcgatg tagatttccc aaacttaggc gctcaaacat ttcacataag 7380 tgaccatcat ttcgataata tactaaacca aattaacaga acggccctga tgaataaaac 7440 agatatcgaa gcaataagca aagaatttca atacggagca aaaacaggat atgacaaagt 7500 ggacaaactt tttaacactg cgaggccata cagccatatt gttttatgga gtactgccgc 7560 tgtcgttagt gccatcttaa tatttacact tttcgtctat ttttgcgtcg gaatgaacga 7620 attcatgaaa tgctgttgcc caaaattatt caaattaaaa aatttgcgag tagttaacaa 7680 tgaaaccatt atttgggacc gagatttgaa tattgacaac tcattagatc gccatcgaca 7740 aatgcgcgag aataatagtc aatctaatag gacccaagag cgtagaatac cacctgaaat 7800 tcccatgaga agaacgaggt ttctagatag aaatcgtcct caatcatcaa ggagccttca 7860 aatcgaagac gaattatgaa cgatccttct cagctcaaat caaccatcat gtttaaaact 7920 ctcgcaggcc tagaaagaaa aaatctaatg taccaaatac ttcttatgca taatatctca 7980 gcagcaccga gcatggagca aatccagcaa tttgaactcg aatcccgaga tgtatttgaa 8040 gccatttaca attacttact taactccatt gttgataaaa acgaattcag gccaatcatc 8100 cctctgttca ataggtcgtc aaaacccata atcaatcttt acaaacgcac agaaaacatt 8160 ttgctaaatc atttttcaag atcagctaac cttatgctaa caagctctcg ttcggaagaa 8220 acaattaatc ttatctccca taaaccacta cctgctgaac ccttaaacaa ttctgatgta 8280 atctatgaag acccaagtac cctaagcttg acaaagtcag attctagctc atctcaaaca 8340 atcatcatga agccccttaa agaaattttt agaaaataaa gcgaatcata caacaatttg 8400 agagtaaaac tcgcatcaat ctcagtcaaa atgaaatttt actgcttttt catgacaaca 8460 aaccttcacg caagcaatca acagccaaaa ctgctatttt gtacaaaccg atctgtcggt 8520 agaatagtca atcccatttt tatagtaaat ccggctctaa atcaaaccta atcgacgaac 8580 caggaacttg agcttcaatc tgttatggga aacttatttt gagggcatca aattcacatt 8640 atcaacggga ccttctatac cgctcaacca ctccaacgcc aaaaacctaa ccaccctcct 8700 aataatccca ttgataaacc aaaccagaaa tgcccttttt tgttcaagaa tgcaccaaat 8760 gcagcaatca tgaaagaaga gtttgtggaa acaagttcca cgcactctgc gccataaaat 8820 aaaagcaaat ttttgttcaa agcacaatta tcataacgtg ataataataa taataataat 8880 aataataata actttaaaat gtaccaatta taattcccgc aaaattttca tcgctcatac 8940 aaaaaaataa aatgagaatt actttaaagt ttaataatgt tgtaaatcaa ataattattc 9000 aaaaaaaaaa acaaatttgc caaaatacaa atttataaaa aaaatttttt ttctccaaat 9060 ttttcgaaac tagaacacgt tattttgcac tgcgcacgcg agttctaata aacgttaatg 9120 aatagagggc gctcgcgtac aatttaaccc actgaccgaa gcacgctcaa aaatgaccaa 9180 ttttaataaa atcacaatat ttatcaatat ccctcgtcac atggctctca aactcagttt 9240 taatcttgtt ttaaaattat ttagcataaa atgcacaact acgttgcaaa attcttctcg 9300 aacttcttcg ctatagcgta ctgtgttttt tgtaatataa aaaccagact gacgagtgcg 9360 gtcgaaaaca cgtttgtgtt ggccttcacc agttcatcca tgcaaaagaa gacaggtttt 9420 gcggaacgac tgcaaagcct ggcgaaaaca tcattgatga tcaggcctac gacattcccg 9480 gatttttcac tcagcatttg gaacgactac tgggagcgca gcagtgaagt gacaatcatt 9540 tttaacaacg aagactctgc gctagaatct gttccgtaaa ccctctggaa aaaataaata 9600 gtatggtgca gaagatggtg aaaaaaggga cacttaaaag cccttttacc agagctcaca 9660 agatatttga ggaagaatca agcgaaaacg aaagcgaaga cgaacgatac aacgaagtta 9720 taacgatctc cagttcagaa gaatccgagc ctgagtcaat cgactaaaaa accatatctt 9780 actgtttctg tacactacca atataatcaa aaaaaaaaaa tgataaaaac cgaacacaaa 9840 aaaaaatata cttgaaaaac tacataatct caaaaaaaaa aaatattaaa acggtggaaa 9900 aaatcggtgt ttttcaaaaa aaaaaaaata agttcggttt tctcaaaaaa aaaataaata 9960 aaaaaaaaga ctcgcaaatg cgacaaaaaa taacaaattc agctttcagc agcattggaa 10020 actacctgaa gcttataaag ccttcaaaaa ctggtaaaat tgacaattta tcgtcaaaga 10080 aataattacc attttcagcc gtcagctcga aacaaacaac aaaaaaatga aatcaattca 10140 tcaaaaaaaa aatcgtcttc tgtaatctca tttttctaag ttctcaaatt gttaaaaaaa 10200 aaaaaaaaaa acaaacaaaa atttcaaaat ttcatatttt ttgtcaatct tataaatcaa 10260 atatttaata aaaaacatct aatgtcaaaa ttatcctttt gcttctctta caaaaaatga 10320 aaaatattta aaaaatatat cgaatttcaa aaaataaaat tgaattttct tagaaaatga 10380 gcattacgac aagcaaaaac gcaactgagc cccagctcat cactgacgag ccatcaggat 10440 ttagccgaga aagaagaaac gtaaaaccta caaaagctct tacttttcag ctctttaccg 10500 atttaaatgg ctctgagggg tcctggaaat caagcataac caaagagaaa ttctatctaa 10560 tcttagaaaa aatggaaacg gtagttaggg aattcggcca aacaaaagag gaggggaagg 10620 gactagaaac cgatgattcc gatctcaaac actatttgag aacaacgtgg aacgactgcc 10680 acatgaggga ccaagctccc gaaaacatcc gtaataaatc tagcaaactt aggcaaaaac 10740 tctttacctg tcgctttttc gaattatgtg aatatcacgg taaaacagta tcaagagata 10800 cagatgtaga tgacacaaga aaaagagagt ccgctctccg atataaaaaa ttttgcgatg 10860 ctgttaaaag caacccctta aaagtcgaat gtgcagttga agcgcttgaa gaaattgcac 10920 gagcaaaaga gcgtcttcca gcaaacgcac ttccactcat cttatcgcta gagaacaagc 10980 tcctttttcg tgacaaccag atcatcccct tacccttaag cctatttgga tttaaaatac 11040 ttcaatcaag cataatagga cttctcttat gtgacccatt gttcatgacg atcctcgaaa 11100 atgcgccacc gccccgtaaa ataattcgta aaggctttag cgcggcccag ttgtattcac 11160 ggtgcattaa catgtggcca tcatatttca atagcttgat caccacattg aacactcttt 11220 cgggaaataa tatcgtcttc cctcagagcc aaactcctag ccgaccgtta aaaagaaagg 11280 ttgattcgtc gcacttagaa gagagagcag atgacaaaaa atctagaatc tcctaaaaaa 11340 ctgtgattac actctatttc caacttccat attattaacc tactttctct accatttact 11400 gttacaacta ccgtccgatg ttagaattta tccagtttct tttatgggga ggagtatagc 11460 // ID Gypsy-16-LTR_NVi repbase; DNA; INV; 863 BP. XX AC . XX DT 13-APR-2009 (Rel. 14.04, Created) DT 13-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: long terminal DE repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy-16-LTR_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-863 RA Bao W. and Jurka J.; RT "LTR retrotransposon from Nasonia parasitic wasp."; RL Repbase Reports 9(4), 770-770 (2009). XX DR [1] (Consensus) XX SQ Sequence 863 BP; 191 A; 220 C; 251 G; 200 T; 1 other; tgtaacgagt tgagcaccgc agcyaactcg tagcggtatg agtaccgctg cgccgcgacg 60 aaacagcgta gagtcgcgtg cgcgataata aacacgtgcg aattcggcaa taggcgcgaa 120 agagatagag agagcgagcg cgcgggctaa tccttcaggt gcgcgctcag atagctctcg 180 gccttcggaa ggtaaacacg cgacttcgcc tacaattcga aaactttcgt gatatcgtgt 240 gctgtgctgt gtccagcgga tcgaattgta tcttgtgtgt gagtgcgacg cggagtccgg 300 gtaatcctga tcctcctgcc tagcggtcca gcgcagtgac caagtcttgc agcctgtcca 360 ggattaggaa cgtcgtcgac gagagagaga aggtttgtgt ggcgcgcagt gcatgggaga 420 gagtgcatgt gcgagtgtgt atcatccggc gcgcgcgacc ttcacgacga tatacgcgtg 480 cgagcgcgat tattctggcg tcgcggcgag tcgcaaagtt gttggcgact cacgtttatt 540 tcgcgactag caaatataac cctagtctta ttattttacg tgatttgtcg tcatttctaa 600 acctcgctcg cttacttccg attcctgttc cttcctgcga ataccgccgc ctccgcgggc 660 gttatttatt agtacttatt taacgcagtt gagcagccga gcacgcgcca ggcggcgcgt 720 agacccagca aaacagaagc agctgagaca gctgcatgca acggaggtgt ttcgcgggcg 780 gagaggagtc ctccgccgca tcgagggaac gcgatccagt cgtttaacga gttattcgtt 840 gatcgaattt acaccccgtt aca 863 // ID Copia-6_SI-LTR repbase; DNA; INV; 242 BP. XX AC AEAQ01011805; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: long terminal DE repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-6_SI_; KW Copia-6_SI-I; Copia-6_SI-LTR. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-242 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01011805; Positions 527 286. XX SQ Sequence 242 BP; 64 A; 52 C; 26 G; 100 T; 0 other; tgttgtaata aatagtttaa aatcagttct tatatgtatt atgtctatgg tcgagtcact 60 cttgatttct catattggta tcactgttcc ctctctataa taccttttta cttcgagtct 120 ttctctcttt tcattcgttc ggtcatcaca cacatatctc tgtactaaaa ccacattgtt 180 gttctttaat aaatcgtctt ctatcaagca tattgatgta ccacaatttt ccacacaaat 240 ca 242 // ID Copia20-NVi_LTR repbase; DNA; INV; 228 BP. XX AC NW_001818714; XX DT 20-DEC-2007 (Rel. 12.12, Created) DT 20-DEC-2007 (Rel. 12.12, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: long terminal DE repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia20-NV; KW Copia20-NVi_I; Copia20-NVi_LTR. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-228 RA Kohany O. and Jurka J.; RT "LTR retrotransposon from Nasonia parasitic wasp."; RL Repbase Reports 7(12), 1205-1205 (2007). XX DR Genome; NW_001818714; Positions 7155 7382. XX SQ Sequence 228 BP; 52 A; 73 C; 39 G; 64 T; 0 other; tgtcagaagt ataacgctgc tcgcgactag ttcgcaagac gtgcgcctca tgctcggacg 60 acatggcgct tctttctgct tccttctcat tcgagtttcg tccgaccccg aggccacatc 120 tccgctacat acgacactct gatttttgtg tgctcaataa aagtttcttc tcaaacacag 180 gactaatccc ctttgattca tcccgtcaat ccaccaacta ctccaaca 228 // ID NAVIHAT1 repbase; DNA; INV; 5547 BP. XX AC DS265642; XX DT 14-NOV-2007 (Rel. 12.11, Created) DT 14-NOV-2007 (Rel. 12.11, Last updated, Version 1) XX DE hAT-type transposable element. XX KW hAT; DNA transposon; Transposable Element; NAVIHAT1. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-5547 RA Jurka J.; RT "Navihat1: hAT-type DNA transposon from Nasonia parasitic wasp."; RL Repbase Reports 7(11), 1172-1172 (2007). XX DR EMBL/GenBank/DDBJ; DS265642; Positions 481866 487412. XX FH Key Location/Qualifiers FT CDS 3616..4878 FT /product="NAVIHAT1_1p" FT /translation="MEQLLGVPNISSSSGINQAQAVFDTLNDWSLLENIEA FT LCCDSTASNTGRINGACVLLEILIDKDLLYFICRHHIYELVLRSTFEIKFA FT KTTGPEVQLFKNFGDFWKSVDQTKFEPGISDTFVIESLQDVKEEILDFCSK FT YSKKDLNRSDFKEFLELVIIFLGGKLPSGITFYAPGAIHHARWMAKAIYCL FT KIFIFRKQYLLDKKDVEVKCRAVCTFIVRVYVHAWFCTPFAAQAPNQDLKF FT LKCLYEYRRIDESISDCAVRKFMNHLWYLTPQLTALAFFDFTISNKEKLKM FT CEALQSNTSAFVYGKRILANEQNLDKIVNSSVSDFICKDTYETFRRLKIDT FT TFLEKNPSKWAKDRNYTHGLEVVKNLRVVNDTAEVKLITEFNNLLTKDEKQ FT LQYLLPVIKDYRSLFSDSKKETLMRPI" XX SQ Sequence 5547 BP; 1961 A; 808 C; 909 G; 1869 T; 0 other; cgggcaaata tagcttcttt atacctcaaa aaccatgcgc atataggttt tgaacccata 60 gcatcaattt ttcaagtgcc tcaaagcacc cgaagtttct tatgggattt catgttaaaa 120 aagggtcaaa acgtttatcg ttctgaaaac ctgtgaaaat tttggaattt taactttgga 180 agtatatttt cccatgtatt ttgagccgct gaatcgaatg gtgctattag ttttcgccgt 240 aatagtgaaa taaaaaagtt ataagctaaa atcgttgaaa aactcgaaaa atcgcaaaaa 300 aacgtgtaaa aatggacttt taaagtttgt aatcagtgat aagttggaaa gtttttttca 360 aaaacataca tttttatgta ttttgaactc attaaacgaa tggtgcaatt ggcttttgcc 420 gattcaattg agaaaacaag atataagcca aaatcgtcaa aaaggtcgaa aatcacggta 480 gtgaccaaaa aatgggtttg ttacaatgtt actaatacga gatcaatttt tttatacata 540 aaaagtatag ttatacatat ctgaaaaatt cttttgtata ggatacgggc tatattcaaa 600 aaccggaagg gaaaccggaa tggctgttgt cgttccagaa ttcttattcc ggattttcgt 660 tccatcgatg ggaccgcgat ctaaagcatt ttcggatcag cgtagggatc agctgacacg 720 aagtaacgtg aatacaaaat ggtagagttg aatgtataca tattatatta tacaacgata 780 aacaataaac actgtacgtt ctctctatca tcggactcag agcaataagc actgcacggc 840 gagcatatag cacatgtaaa cataagaata tttatattaa acttattttc ctatttaacc 900 acggctcaat ttatttcaac aggtactttt taatttcgag atatattgct ccgcatgaac 960 aattttattt agtttttcaa caatggctta taatttacgc gaaagagtca atcacattgg 1020 atattgtact aatcaaatag ttgcaactaa taatctacca acgataaaac aagtattagc 1080 tgctttattc tataacctac gaagagtttc aaagtctgta cgtgaaagtg cgaagttaac 1140 cattgaagag tgtatcatta tttggaagaa agctcgaatt cctacgcaag aagataaaaa 1200 atgtgtcatt aaactggagg ctgaatatga aagatggaga aaaatacaaa gaaatgcttc 1260 acgtagatca gacactcaaa taaaaaatga acaaacatat aaagaaagca taaacaaatt 1320 atttgacatt gcttgcgcag atgctttgaa gaagatggaa gacgaatctg ataaacagtt 1380 tttacttgat cagagaggta attatatttt gtttttgaat acttaattat gacacgactg 1440 aacatggaaa atagaaatgg tggttatagt cgatgctgta gaaagaactg atgatctaca 1500 tatagttaag ctttgttcaa aagtgagttc tacggagagc tgaaattttg cattttaagt 1560 gatatctttt attctttcta agatatagac ttaaaacttt acaggtaagt gtaatttaac 1620 ttaattaaca ataaaaaaga agttatagta cttcaagtcc ttgaaatata tcagcaagtc 1680 ctaaaagcat gaagtttgac aaaaagttgg ggttgcgctg atattgtata ttacattttt 1740 tctcatatct ctggatctat gataggtaga gatttgagac tttgcataca ggcctatttt 1800 actttaatat aatgagaaaa acagttggat taaatttaac aatttttgac atttatttat 1860 taaaaaactt gacattttga tattggtcga aattgccgct aaggtaggtt ttttcgtcaa 1920 gtgaattttt atttatacgt ttaattaatg caaatagttc ttaaaataac taaatggtta 1980 gtaaagatat tttattacaa aaaaatcgca aaaacgaccg gccacgactt aattatacat 2040 tgaagaatct ggtatctgct ttgcactttc tcataagtga agcattttaa tagcatatgc 2100 agcaaaactg cattgcaatt gcaataatta agtcgtagta ggtcgttttt gcactaattt 2160 ataataagtt attcaataac aatcagttct tttaagagct atttgcatta attaaatgta 2220 tgaataaaaa ttcactggct gaaaaaacat acctcagcgg caatttcaac caataacaga 2280 atgtcaattt tttagataaa taaatgtcaa aaattgttaa atttaatcca actgtgtttt 2340 tctcattata ttaaagtaaa ataagcctgt atgcaaagtc tcaaatctct acctatcata 2400 gatccagaga tatgagaaaa aatgtaatat tcaataatat cagcgcaacc ccaacttttt 2460 gtctaacttc atgcttttgg ggctcgctga tatatttgaa agacttggag tactataact 2520 tctttcttat tattaattaa gtaaaattac acttacctgt aatgttttaa gtctatatct 2580 tcgaaaaaat aaaagatatc acttaaaatg caaaatttca gctctccgta aaactcactt 2640 ttgaacataa ctcagcttta tagcgatcac gccatgcagt tgcaataaac aatttttttt 2700 attaaaggtt ttctgaattg atttaaaaaa acaattttgt aaatttttta aagtggcctg 2760 ctctctcaaa aaatacagtt tgaaattatt gtatgtacaa aataattgat taattatgaa 2820 atactttttt ttaaacttgt ttatttcatt ttttattaat gaccttgatt gtagaaggtt 2880 agattagaat tacaaaattt tgtttttcag ctctgcaaag aaatgaaact cttagttcca 2940 gtacctcaac aacttataca gacacaagta agccatcacc aatggaagtg gactatgaaa 3000 gcctcaatat tataaagaat gaaatttcgg ttaccatcga ccaggacaag gtagtcgatg 3060 cttcattaaa aatgcaaaag gatgattctc gtagtatggc gacaaaaggt ttgcattttt 3120 gaatagtcag ttgatatcag tttttactaa tagagaaaaa atgtatttat agcataattt 3180 tattttatta acctttaatt taatctcttc tttaatttta tttacagaag atttacatga 3240 attgcttgaa acagagtttg aatctccatt atttgaaagt acacctcctc tgaaaaagct 3300 actgaaccaa ttatgactgc gaaattggct acagtattag ataaatgtaa aattagtaaa 3360 agagacgtaa tgcatttgtt gatggccgct gcggaagcat tcatcattga tacgcgcaat 3420 ttagtattaa acacatcatc tattcacagg gttgatcaaa attttcgcaa agaacgatat 3480 gaggcaatta aaaaacttgc tccacaactt ttttctggtt taccgagcac catccactgg 3540 gattgaaagt tgctttcagg atatttacga agagaaacta ctgatagact agcgattatt 3600 attacaagta atggaatgga gcaactcttg ggtgttccaa atatctcttc tagtagtgga 3660 ataaatcaag ctcaagcagt attcgatact ttaaatgact ggtctttatt agaaaatatt 3720 gaagcgcttt gttgtgattc aacagctagc aatactggtc gaataaatgg agcttgtgtg 3780 ttattagaaa tattaatcga taaagatctt ctctatttta tctgccggca tcatatttac 3840 gagcttgttc taagaagtac ttttgaaata aaatttgcta agacaactgg tccggaggta 3900 caattgttta aaaattttgg agatttttgg aaatctgtag atcaaacaaa atttgaaccg 3960 ggtattagtg atacttttgt aattgaatct ctgcaagatg taaaagaaga aattttagac 4020 ttttgttcga agtattcaaa aaaggattta aacagaagtg atttcaaaga atttttggaa 4080 ctagtcatca tttttctagg cggaaaatta cctagtggta taacttttta tgcacctgga 4140 gctattcatc atgctaggtg gatggcaaaa gctatatatt gtttgaaaat attcatcttt 4200 cggaaacaat atttgctaga taaaaaagat gttgaagtaa aatgtagagc cgtttgcact 4260 tttatagtaa gagtctacgt tcatgcctgg ttttgtactc catttgctgc tcaagcacca 4320 aatcaagatt taaagttttt aaaatgtttg tacgaatata gaagaattga tgaaagtata 4380 tctgactgtg ctgttagaaa atttatgaat cacttgtggt atttgactcc gcaactaaca 4440 gcactagcat tttttgactt tactatttca aataaagaaa agttaaaaat gtgcgaagct 4500 ctacagtcaa atactagtgc gtttgtttat ggaaaaagaa ttttagccaa tgaacaaaat 4560 ttggataaaa tagtgaattc aagtgttagt gattttattt gcaaagacac ttatgaaact 4620 tttagacgtt taaaaattga cacaactttt ttagaaaaaa atccatcaaa atgggccaaa 4680 gatagaaact acactcatgg tttggaagtt gttaaaaacc tcagagtcgt gaatgataca 4740 gccgaagtaa aattgataac tgaattcaac aatcttttga ctaaagatga aaagcaatta 4800 caatacctat tacctgtaat aaaagattat cgaagcttgt tttcagatag caagaaggaa 4860 acgttgatgc ggcccatatg aatgaatatt ttgtttgaat atacaattag aacatttttg 4920 tactgttttt taaattttct tcagaatctc gcttcgacag atcgtttttc tttatctcag 4980 cttataaatt ttcctttatt tttttaaact aataagttta tgtgattcag tagttcaaag 5040 tataaattag tagaaaatat tacaaatcag ctttttggcc tttttaccgt gatttttgag 5100 ttttttgatg attttaactc cttttctcga ttgaatcggc aaaagccaat tgcaccattc 5160 gtttaatgag ctcaaaatac ataaaaatgt atgtttttaa aaaaaacttt ccaacttatc 5220 actgattata aacgttaaaa gtccattttt acacgttttt tcgagttttt caacgatttt 5280 ggcttataac ttttttattt cactattacg gcgaaaacta atagcacgat tcgattcagc 5340 ggctcaaaat acatgggaaa atatacttcc aaagttaaaa ttccaaaatt ttcacaggtt 5400 ttcagaacga taaacgtttt gacccttttt taacatgaaa tcccataaga aacttcgggt 5460 gctttgaggc acgtgaaaaa tcgatgctat gggttcaaaa cctatatgcg catggttttt 5520 gaggtatata gaagctatat ttgcccg 5547 // ID BEL-49_AA-LTR repbase; DNA; INV; 596 BP. XX AC supercont1.247; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-49_AA_; KW BEL-49_AA-I; BEL-49_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-596 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.247; Positions 1358910 1358315. XX SQ Sequence 596 BP; 213 A; 98 C; 105 G; 180 T; 0 other; tgttcggaat atcgggacaa gcctgtaccg ccgagtatac tgaaccccct gttgaaggta 60 tgacgaattg cgaccaacga tgagagaaac gtctgtcagc agaattacca atactaatat 120 gataagagat gatgagaagg aacaattaaa aagcctggat tattgaagaa tcacgttttt 180 cataagtgat gcgattgaac tattgaattc taaagtataa ttcgtttatt ttcatatata 240 ttcgatattc ttagagagtg ctaatagttt tatatttgat acctatccac actgttaaaa 300 gttatacgat ttagttatct aaaaaacaag atcctgaaaa gcagagtaag ttaacaatgc 360 atatattatt aacgtataaa ctgatctagc caatcattca atcttaaggt caatcagacg 420 accgttccct cctgtacaaa gaattaaccc tttggctgcc tgagaaagaa gaaaaacttg 480 ttgtaagttt aaatgaattc ttaatttggt aaaaatgact aaaataaatc taattatagc 540 ttcaagctgt caacaacgag actgctctga ggactgtttg tagttctatc cgaaca 596 // ID CR1_Ele26 repbase; DNA; INV; 3382 BP. XX AC . XX DT 26-OCT-2010 (Rel. 15.1, Created) DT 26-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE A CR1 clade non-LTR retrotransposon family from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1_Ele26. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-3382 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-3382 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (26-OCT-2010). XX DR [1] (Consensus) XX CC [2] Consensus update. This consensus is generated from 5 CC sequences with >94% identity, and ~98% identical to the original CC sequence in [1]. Likely 5'-truncated. XX FH Key Location/Qualifiers FT CDS 189..3278 FT /product="CR1_Ele26_1p" FT /note="endonuclease and reverse transcriptase." FT /translation="MRPTFSPHSGRTASSTMEAPTPPSTVEFLQPAFFSRP FT DPVVGCGEGVFQPVQSGKYVENADCLLPDISTPSSLRIYYQNVRGLRTKID FT SLFLAVSDLDYDVIVLCETWLDDRILSSQLFGNLYSVFRTDRSALNSDKRS FT GGGVLIAVSSRLSCSLDSAVISNELEQLWVKIVLPRNNVSIGVMYLPPDKK FT NDIGVIHRHIDSIGAVQSLLQENDMALQFGDYNLSNVFWSSNXDDNVSIDL FT TRSRLNPASSTLLDGFCFHGLSQVNTLVNGGDRTLDLVLVNDVVMQNFSLF FT EAIEALTVIDIHHPPIELVVSCPLPVTYEALFDHTGFDYKKANFESLNSAL FT YLIDWSHLDCIDDVNESVAFFTNSVMQVVAEHVPLRRPPSKPPWSNNRLRF FT LKRRRSAALRLYRSHRSQESKLHFNRSSSEYRSYNAYLYARYKRRTEQNLR FT TNPKQFWSFVNAKRNEDGLPTSMYLGDLSADSASDKCELFAMQFRRAFNDS FT ISQQNQADHALQDTPHDVFSYEMFYVTEQEVTRAISKLKLSYNPGPDGIPP FT ALLKRCSATLLRPLTKLLNQSLRLQQFPACWKKSFLFPVHKKGDKRCVSNY FT RGITSLCACSKVFEIIINDSLFDCCKNYISTDQHGFFPRRSVESNLVEFTT FT LCIKAIDAGKQVDAVYLDLKAAFDRVDHRILXQKLRKCGVSTGFSNWFDSY FT LTDRSLCVKIGSCESASFTNISGVPQGSNLGPLLFSLFINDASLILPPGTR FT LFYADDAKVYMIVNGLNDCIRLQSLLDSFEAWCVRNFLTLSIEKCQVISYG FT RKRNPIRFPYKLSGTTLERVQRVRDLGVVLDEELTFNYHYDDAISRANRQL FT GFVLKVXDGFRDPLCLKSLYCALVRPILEFAAVVWCPFHASWITRIESVQR FT KFVRRALRNLPWRDPRNLPPYHDRCRLIGIXTLENRRHVAQTAFVSKLLKG FT EIDSPSLLADVNVYAPERTLRRRHFISLGSRSTLYGQHDPVRFMTAKFNEV FT FHLFDFNVSITTLRNRFVQYFGRN" XX SQ Sequence 3382 BP; 875 A; 793 C; 722 G; 984 T; 8 other; gttctggcga gggggtcttc caccgtttac actcaggcaa gtaccttcaa gttgattgtg 60 attcctgcac tgaagttgtc tcgtttgata gcgcttatga tgttgacctt tcggcttcaa 120 acgaacttcg cacgtgcaat cgacctccac aaacaacgcc ctcggtgctc tgkgatcatc 180 cttcgaacat gcgaccaacg ttttctcctc actcgggacg cacagcatct agcacaatgg 240 aagccccaac gcccccctcg acagtcgagt ttctccagcc agcgttcttc agccgtcccg 300 atcctgttgt agggtgtggt gaaggggtct tccagcctgt gcaatcaggc aagtacgttg 360 aaaatgctga ctgtttactt cctgacatct ctacaccttc cagtctacgc atctactacc 420 agaatgttcg tggtttgcgt acaaaaatcg actcactttt ccttgctgtc agcgatttgg 480 actatgacgt catcgtactc tgtgagacct ggttagatga tcgcatccta tcatcacagt 540 tattcggaaa cctctactca gtgttcagaa cggatcgaag tgctctgaat agtgacaaaa 600 gaagtggggg aggtgtgtta atagctgtat catcgcgatt aagctgctct ttggattctg 660 cagtcattag taacgagctt gaacagctct gggtgaaaat cgttttgcct cgtaataatg 720 tgagcatagg agtgatgtat ctaccgcctg ataagaaaaa cgacattggc gtcattcatc 780 gccacattga ctctatcgga gccgttcaaa gtcttcttca agaaaacgat atggcactgc 840 agttcggcga ttacaatctg tcgaatgtgt tttggtcktc gaatgsggac gacaatgttt 900 ccatcgatct aactcgctca cgcctgaatc ccgccagttc tactcttttg gatggcttct 960 gctttcacgg tctctcgcag gtaaacacat tagttaatgg aggtgatcgt actctcgacc 1020 ttgttctcgt taatgacgtc gttatgcaaa atttttcatt gtttgaagca atagaagctc 1080 ttacggtaat cgacattcat caccctccga ttgaattagt tgtttcatgt ccactacccg 1140 taacctatga agcattattc gaccatactg gatttgacta caagaaagca aacttcgagt 1200 cactgaattc tgccctgtac ttaattgact ggagccatct cgattgcata gatgatgtta 1260 acgaatctgt tgcatttttc accaactccg tgatgcaagt agttgcagaa catgttccgc 1320 tacgcaggcc gcctagcaaa ccgccatggt caaacaatcg gttgcgtttt cttaagcgcc 1380 gccgctctgc tgccctccga ttgtatcgtt ctcaccgttc acaagaatcg aaactgcact 1440 tcaaccgatc cagcagcgag tacaggagct ataatgcata cctgtatgct cgctataagc 1500 gtcgaactga gcagaatctc cgaacaaatc caaaacagtt ttggtcattc gtcaacgcca 1560 aacgaaatga ggatggactt ccaacatcaa tgtaccttgg cgatttgtct gctgattctg 1620 ctagtgataa atgtgagcta tttgctatgc aattcaggcg cgccttcaat gacagtatct 1680 ctcaacaaaa tcaagccgat catgccttac aagacacccc tcatgacgtc tttagttacg 1740 aaatgttcta cgtaacagaa caggaagtaa ctagagcaat ttcaaagctt aagctttcgt 1800 ataatcctgg accggatggt ataccacctg cgctgctgaa gagatgttcc gctacactcc 1860 tgcgtccttt aactaagctt ctgaatcaat cacttcgact acaacagttt cctgcatgct 1920 ggaaaaaatc ttttcttttc ccggttcaca agaagggtga caaacgatgt gtgagcaact 1980 accgtggaat aacttcgtta tgcgcttgtt caaaagtatt cgaaataatt attaatgatt 2040 ccttgttcga ttgctgcaaa aattatatct ccaccgatca acatggattt tttccgagaa 2100 ggtccgtcga gtcaaatctc gttgaattca cgacgctttg cattaaagcc attgacgccg 2160 gcaaacaagt agacgctgtt tatttggacc tcaaagcggc atttgaccgt gtcgatcatc 2220 gaatacttst ccagaaactt agaaaatgcg gagtttccac cggtttcagc aattggttcg 2280 attcttatct tacggatcga tcgctctgtg taaagatagg ctcatgcgaa tcagcatcgt 2340 ttacgaatat ctctggagta ccacaaggca gcaaccttgg acccttgttg ttctctttgt 2400 tcatcaatga tgcatcactg attttgccac ccggaaccag gcttttctat gcggacgatg 2460 ctaaagtcta tatgatcgtg aatggcttga atgactgtat tagactgcag agtttgttgg 2520 actcattcga agcatggtgt gttagaaact tcctaacgct gagcattgag aagtgtcagg 2580 tcatctcgta cggtagaaag cgtaacccaa tcagatttcc gtacaagctt tcagggacga 2640 cacttgaacg tgtgcaacgg gttcgtgact taggtgtcgt tctagacgag gaactaacct 2700 ttaactacca ctatgacgat gccatctcca gagcaaatag acaactcggt tttgtgttga 2760 aggtgtstga tgggttcagg gacccgctct gcctaaagtc gttgtactgt gctctagtac 2820 gtcctatttt ggaatttgct gcagttgtgt ggtgtccttt tcacgctagc tggatcacgc 2880 gaatcgaatc tgtccaaagg aaatttgtac gccgagctct cagaaacctg ccatggcgcg 2940 atccaagaaa cttgccaccg taccacgacc gttgccgcct aattggaatc sagaccttgg 3000 agaacaggcg acatgttgct caaacagcwt ttgtttccaa actgctgaaa ggtgaaatcg 3060 actctccttc cctgttggct gatgtaaatg tgtatgcacc agaacgaaca ttgcgtaggc 3120 gtcacttcat tagtctagga agtcggagta ctctttatgg acagcatgat cctgtaagat 3180 tcatgactgc aaagttcaac gaagtcttcc atctgtttga tttcaacgtt tcgattacaa 3240 cacttcggaa tcgtttcgtg cagtactttg gtagaaactg actgttcatg tttgtgttat 3300 ttataattag tttatgttag tttattcatt aagactacat tatgtcagat ggattatttg 3360 aaatacaata caatmcaata ca 3382 // ID Gypsy-18_HM-I repbase; DNA; INV; 7800 BP. XX AC 1101284919122; XX DT 28-JAN-2011 (Rel. 16.02, Created) DT 28-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Hydra magnipapillata genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-18_HM_; KW Gypsy-18_HM-LTR; Gypsy-18_HM-I. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-7800 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Hydra magnipapillata genome."; RL Direct Submission to RU (28-JAN-2011). XX DR Genome; 1101284919122; Positions 8056 15855. XX CC Positions [4088-4567] - Integrase core CC 'ATAT' target site duplication CC LTRs are 93% similar to each other. XX FH Key Location/Qualifiers FT CDS 166..2247 FT /product="Gypsy-18_HM-I_1p" FT /translation="MSRLRSGKICWYTEPNPCAHIAANPYTSLNNSSDSTD FT LDLSDTKSFKTETSRSPAKNNMSLEQILEHLALKPSQELHPKIFKGAEVED FT ITEWLASFCRIALHNQWKDEKKLLAFPHYLEGSALVYYETLPALTRNDYHE FT LQEQFRRHYDNVNIAWNRKMELFGLVQDTDLNTYINTLDKLSHQLGIDDET FT KLNLFIKGLKPHLRDALKLKQPVNYQAAVAFAKLQNTVATDSTMSRLEAKL FT DALLNPARQPIAAVTQTEIPNSKKQIEGLQAEVESLKRFGNLPTRNERFRP FT IGRNLRTSDGQVICNKFLRVGHTQRSCNEQISSQNNIHHAPSRTNRWKNNT FT QNLSSSRCNYFSNNPAYNEKPKNIPHFDNHRQNQNYTNTLEFYPDEEDPCS FT EIWGKVYDKQVIILMDTGAKCSVINSTIYDSIKNKAAEPNPYFGKLKTANG FT TPLHIVGQFVCEIKIGTNTFKQKIMVGKNLTHPLILGYDFMHRYNVGLDCG FT QHLVGFGEKFSIQMIKEVVKDKFATLSNIEKLNSEAVIKRRSSNPRLTFDL FT TTNPNSDTYNTAGIRFKNKDDIQPPKPCNVSPNCDFRTITENESKSERNQK FT KYNTSDSTSIREENMLKNELSKVCARWNMNIAPGDETILTIHQAMESDVDC FT LFIPEANLFDQNVHEFPTITHGGSNQVKLHIVNSTLNYTTIGKIK" XX SQ Sequence 7800 BP; 2666 A; 1534 C; 1431 G; 2169 T; 0 other; tttggaggca ctacccgggt tcaagtagat tttatctctg cttggaccga tcaatctatc 60 tacgtaagtg aagattaact tttgtttatt tcgttatttt tttcacatcc ggttattcaa 120 acgaacccgt agctatattg actttattga ctttgtgatt tacgtatgag ccgtttacga 180 tcgggtaaaa tatgctggta tactgaacct aatccttgtg cccatatcgc cgctaatcct 240 tatacttctc taaacaactc atccgattct actgacttag acctctcaga taccaaatcg 300 tttaaaaccg aaaccagtcg ttccccagca aagaacaaca tgtcacttga acaaatttta 360 gaacatttgg cgctaaaacc gagccaagaa ctacacccga aaatttttaa gggcgcagaa 420 gtagaagaca ttaccgaatg gttggccagc ttttgtcgta ttgcgcttca taatcaatgg 480 aaagacgaaa aaaaattatt agcattccca cattatttag aaggcagtgc cctagtctat 540 tatgagacac ttcccgcgct cactaggaat gattatcacg aacttcaaga gcaattccgg 600 cgacattatg ataacgtgaa catagcttgg aacagaaaaa tggaactttt tggattagta 660 caagacactg accttaacac atatataaac actctcgata aattaagtca ccaacttgga 720 atcgatgatg agactaagct gaatttattt atcaaaggtc taaaacctca cctacgagac 780 gctttaaaac tcaaacaacc ggtgaattat caagctgcag ttgcttttgc aaagctacaa 840 aacactgtcg caaccgactc aaccatgtct cgtcttgaag ctaaactgga cgcattactc 900 aacccagcta gacagccaat tgctgctgtt acccagaccg aaatacccaa ttcaaagaaa 960 caaattgaag gtttacaagc cgaagttgaa tcattaaaga gatttggcaa ccttccaacc 1020 cggaacgaaa gattcagacc aatcggacgt aacctacgta caagtgacgg acaggtaatt 1080 tgcaataaat tccttagagt aggacacaca cagaggtcat gtaacgaaca aatttcctca 1140 caaaacaata ttcatcacgc accatcgcga actaaccgct ggaaaaataa tacccaaaac 1200 ttgtcttcat ctcgttgtaa ttatttctca aataatccag cctataacga aaaaccaaaa 1260 aacataccgc attttgataa tcatcgacaa aatcagaact acacaaacac gctagaattt 1320 tatcccgatg aagaagaccc atgctctgag atctggggga aagtttatga taaacaagtg 1380 attattctta tggatacggg agcgaaatgc agcgtgatta attcaaccat ttatgatagt 1440 attaagaata aagctgctga accgaatcca tattttggca aattaaaaac ggcaaatgga 1500 actccgcttc atattgtagg acagtttgtt tgcgaaatca aaattggaac caacacattt 1560 aaacaaaaaa ttatggttgg aaaaaacttg acacacccgt taattttggg ctacgatttt 1620 atgcaccgat acaacgtagg tcttgattgt ggtcaacact tggtaggatt tggggaaaag 1680 ttcagtattc aaatgattaa agaagtagtc aaagataagt tcgctactct aagtaatatt 1740 gagaaactca atagtgaggc tgttatcaaa agacgaagca gtaatccgag gctcacattc 1800 gatctcacca cgaatccgaa ctccgacact tacaatactg caggtatccg tttcaaaaat 1860 aaagacgata ttcaaccacc aaaaccatgc aacgtatctc ccaattgtga cttccgaact 1920 ataaccgaaa atgaaagtaa atctgaacgt aaccaaaaga aatacaatac ttctgactca 1980 acaagtataa gagaagagaa tatgctcaaa aacgaacttt ctaaagtttg cgcaagatgg 2040 aacatgaaca tagcgccagg agacgaaacc attctcacaa tacatcaagc aatggaatca 2100 gatgttgatt gtctatttat accggaagca aatttgttcg accaaaatgt tcatgagttc 2160 cctaccatta cccacggcgg atctaaccaa gttaagctac atattgtaaa ttcgacccta 2220 aactatacaa ccattggtaa aatcaagtga ttggagtgtc attccgttac ccaccaaagc 2280 cactgagaaa tatgaaacta aatgccctgc tgaattgatg aatttattag aagtaaagac 2340 taacaccgca acgttaagtt caaaagaacg ccttcaacta ctgaatttgc tagaacgcaa 2400 caaagatgtt tttgaacaag gattacacga aattggacat actgatgtga tacaccacta 2460 cattgacact ggcgatgctc ggccaattaa acaaagagct tattgtctgg cagaatctca 2520 aataaaattg ctggacaaac atatagaaga gatgataaca aatgatatca ttgaaccaag 2580 tgtcagtccc tggtcgagtc cggtagtcat ggtaccaaag aaagacggaa cttttagagt 2640 gtgtatcgat taccgtaagt taaataacgt cacaaaaaag gatacttatc cactacctcg 2700 aattgacgaa agcttggata tgttacatgg tgcatatact tttcttcatt ggatctgcta 2760 agtggatatt atcaggtgca actcgatgac gaatcttaag agaaaactgc gtttatcacc 2820 cataaagggc tatatcaatt taaagttctt cctttcggtc tctcaaatgc accatcaacc 2880 tttcaacgaa tgatgaacca cgtacttcga gaccatttac acacaaattg cttgctttac 2940 ttagatgata ttttagtata ctcgaagacg tataaggaac accttagcca tatacaaaag 3000 ataatctatg ctataagaca agctggatta aaattaaata ttctgaaatg caccttctgt 3060 acaaataaag taaagttttc gggacattat attaactctg aaggtattaa acctgatcca 3120 gaaaatatcg aggcgttaaa aaggttatcc tgtacccaaa aatctaaagg atgctagagc 3180 atttattggc ttatgtagtt actacagaag atttgtaaaa ggatttgcaa acattgcgtt 3240 acctctaagt cgattattaa agaacgattc cgaatttaaa tggcagactg aacaacaaaa 3300 aagttttgat caacttaaac aatccctcgt aacgtcaccc aatttagcct atccagactt 3360 tgagaaacct ttcatcatat attctgatgc aagtggcgaa gcgttaggat agctaagctt 3420 tggctgctgt tgccgcagta cagaaatgta gacctaatgt atatggcaaa catataaaat 3480 agtaacagat cataactcac taaaatggtt aatgtcaatt aaggatccga ctggttgact 3540 tgcagggtgg agcctaatat tgcaaggtta caacttcgaa atagaatacc gatcggggaa 3600 agtgcatggt aatgcggacg gactttccag acggagttat gacacagccg caacattgac 3660 agtaccaggt aaaccaatcg accaagttgt aaaagaacaa cgtaccgatc cctattacgc 3720 taatctagta atatatctag tcaagcaaga actacccaaa gacttaaaag aagcaaagaa 3780 agtcttagcg tcggaaagta actaccactt agatgaaaat ggtttactat ttcatcataa 3840 atcaaaaacc tcgagaaatc tcacccagtt ggttgtacca cggactttaa gaactgagtt 3900 aataacctgg gcccacaatt aataatgtgg aggtcacttt ggagtgacga aaacttatga 3960 aaagattaga actaattact attgggttgg tatgtacaac gacattcaaa cctgggtaaa 4020 atcatgtacc acttgtgctc aacgaaaacg aaattcggct accagcaaag cgcccctact 4080 gccatttcct gtagaaggac catgggatgt tattgcagct gattgcttag gaccttttcc 4140 tctttcgtta aagggaaacc gctatatagt tgtttttggt tgtttatttt caaagtatgt 4200 tgaagctttt gcagttccca ctatcacagc aacgtctatt gcagagttat ttgtagacca 4260 tattgtgttc aagcatggag caccaaggcg atttctaacc gatagaggta gtaattttac 4320 gagtacattg gtaaaagaag tgtgtgaaat attaaatgta aaaaagtctt ttacaaccgc 4380 gtattacccc caaaccgacg gttttgtaga acgcgtcaat gggattttgg ctcagagtct 4440 ttccatgtat gttgcctcta accaacgaga ttgggatgtt catcttccag ccgctgtata 4500 tgcgtataat actagtatat ctgctagcac tggaaaaacg ccctttaggt taacatatgg 4560 caaagacgct ctctacccca tgatactacc ctactaccaa gaaaggaccc aacagacaac 4620 actgacttac tttttcaacg atttgtgtct gacttaagat tattacgtaa ccttgcgaag 4680 gaaaatattc aaaaagcaca gacaaaaatg aaactacatt acgatcaaac aagtgaatca 4740 tatccgtttc aggtcggaca taaagtatgg gtatatacac caatcacaaa aaaaggttta 4800 acaaaaaact tacaagtttc tggcacggtc cgtttcgttt aattgaaaaa acatcaaccg 4860 tgacgttcac agtggaaaat atgaataata aagaactgcc tacgccaatt catgtgtctc 4920 gttttaaaca atggtttggt tatgaagaaa aaccaactac agacctagct ctaaacactg 4980 aagttctcac cgaagccaca gaaacgatgg acgaattctg atttaaggaa actaaccata 5040 aaatacagga agatgttaat ctcgcccccg aaattgaaca tcaagttgac agaatggaga 5100 gtctcgattt cgatatatat aaaatgaaga aaattgttaa aaaaaggaca agaggaaaaa 5160 agatacaata tctaattaaa tgggaaggtt atccatccag ccaaaatact tgggagcccg 5220 aagagaactt ttttgatcct caagtaatcc agaagtatat taatagtaaa aagactagca 5280 aaagtgtaca tatctcagct ataaccacga caaatctgtg catacgtaaa aagaaaaatc 5340 aaactaattg tagggtcatg ccgaaagtaa ccctcagagt aaacaaactg tggaaattat 5400 ggtgtctggc tcttttacca actttcatta ctggcttatt tattggcgaa gtatttgact 5460 gcgctaaagt gaaacctatt ggaatttacc aattaccaaa agttacgtct tgtaatcaca 5520 atatgcatac gctaaatgat tcggtgaaaa cgtttatggc agatgtctat gcttatagac 5580 ctcagacaat cacaataact ttatgtcatt gttatgccga aaaagtcaca cttacttgcc 5640 tgtcaaattt tcttaaccaa aaacctaaag acctaacatc caaacgaata cccgtaacag 5700 caggagaatg tcttttagcc acgaatacta aaatatcatt acaagtggat aacctcaata 5760 catagagaac ggtagcgaca gatcaatttc attgtgcctg gatgcgcaca aaaacacagg 5820 agtataccca ttttttcatg aagacatata acggacaaat caccggacga gcggtaacct 5880 tagaacaata cgtcacttag actttatgct atcatgccct gcgtcgttgc gttcccagtg 5940 aatggccaga atctataact gtatggggaa atgacaacca cgataaagaa gtaatgaaga 6000 aattcggtac ttatccaatt gagcgtatag gcgatttcat tctaatcagg agtctcaagg 6060 taggtgaagc tatactgctg gcagagcaaa ataactacgt gcttaccctg gataacggca 6120 tgcttctaaa agaccccaag ttcccagatg atgtctttaa ataatataaa tcaaccgctg 6180 caaatttctc acaaaaatta gcaaataatc cagctacagc aattcttgaa gcccacatta 6240 cgatagccct catgactcaa aagatgaata tgataagtac ttgggaacaa atgtgtttta 6300 tccaaacgga gatctctcgc atacatcgat ggatgattgc ctagttccct actacatcag 6360 cagagtgggt gcaccaatcc caaggagtaa ttgtggaatc agctggagat gctttactat 6420 tatcggaatg cattaactac acgacttacg ccatacagta taatcggaag attggcaatt 6480 tttgttttga acattttcca attaccttgc caagtttaaa cataacttat tttctagagg 6540 ttagtgatcg aaaattgata ggaacaagtc cccgcattcc ttgtaaactt agaccaaagc 6600 acacgtactt acaggaccct aaattaaata ttatgtacga aatatcggca tacggacgtg 6660 ttaaaatttg gagacccaga tggaccacgt gctttcctcg ccaaacaacc cgattccccg 6720 tattcgagga tacaacaaaa attttcttgt cgagaaacct caacgattat caccgtacac 6780 agttttacaa ttaatctcag ggtcacatga aacactgcag tcacttaaaa ctattagtga 6840 caccaacggt ggtaatgtct taactggtat tgctacggct ttaggtagtg ccctacaagc 6900 caccgcaagt ggcggaagcc agattattaa agccattggg ggtaccatca aagactcgct 6960 taatggagtt tccgacttag atgagaaact agttcattct attggcgttg cttcatcttc 7020 agttttaaaa gccgccggag gtgctgtaaa ggatgtagga gaaggagcag gttccttctt 7080 ccaaaagttt attgatggta taagtggttc tattctgtgg gtagcagtct ttttaatagt 7140 agtttatctt atcttaaaca agccgaatat aaattatgcc ttaccatgtt taaaaagatt 7200 taaaaacaaa atgatggaag atactatagt aataccacaa caggaccagt ttacatcaat 7260 gtcccccaaa gtaaaaaaac actcatcgcg taattgtaaa tgccgaacac gaaaagacaa 7320 atgagaatga aacatttcga cggtagtggc gggaattagg gcttcaataa tttcccggat 7380 ggagacatgg aacgccttca accaccatga gtttcataca aagttagacg cgttttccga 7440 attcattctg ttgattattc gagatgttat tgttttacag agattttact tgttgatttt 7500 gctgattatt tgaatttatt gttgtgcaga tacttgttga ttctgctgat tatttgagtt 7560 tattgttgtg cagatacttg ttgattctgc tgattatttg agtttattgt tgtgcagata 7620 cttgttgatt ctgctgatta tttgaattta ttgttgtgca gatacttgtt gattctgctg 7680 attatttgag tttattgttg tgcagatact tgttgattct gctgatttat cgaaattttt 7740 atttttgcgg aaattttgct tctaagtccg aggacggact ttttcttggt tcgtaagtaa 7800 // ID BEL-202_AA-I repbase; DNA; INV; 2961 BP. XX AC AAGE02028271; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-202_AA_; KW BEL-202_AA-LTR; BEL-202_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-2961 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02028271; Positions 17161 14201. XX CC 'CACTA' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 643..2715 FT /product="BEL-202_AA-I_1p" FT /translation="MSAADRKLRGLKNRKRSIVSSFAGIKTYVTGYQAERD FT KCEVPVRLENLVALWNEFNEVQTELETLEESEESLDGYLKERTEFERAYYR FT IKGTLLMLNQPSPETQIVAARQEGLNQSNVKLPDIKLPVFSGEFESWMNFH FT DLFISLVHSSTNLSTIQKFYYLRSSLSGEALKLIQTIPISNEQYSVAWNLL FT VSHFQNPRRLKRSYVQSLFDFPSMNREAAPELHALLEKFQTNVKILKQLGE FT NTDHWDVLLIYLLSSRLDAVTRRDWEEYSETHNATKFQQLIEFLEHRVNVL FT ETIASNSANLPPVAKKNNLPRSSSYGAVQQAFRPCPACSNQHLLYLCDIFS FT GMTIDEKENLVRNNHLCRNCLRWGHLARNCVSESSCRKCSGRHHTQLCITE FT SEANNQSASASGIDRQEESTAYSGITSCTSRTITNSSVLLATANIILVDES FT GHEHHARALLDSGSECSFVTESLAQRMHVRRYSANLSIDGIGQSSTRVQCK FT IRSTVKSRITDYNTTIEACILPRVTVELPSKSIDVSGWPIPTGIPLADPEF FT YQSRPIGVVLGAEVFFNIFNVAGRIPLGDSLPTLINSAFGWVMSGTAGTQG FT HVSSSAVCNLAVMANNGRFSQARHSQSRAGSKSGRSQYATHNQQPQKSRVF FT RSSTLCCAQPQFRRNDRCYPADIIRHSGMNRQSKPTSVSQQR" XX SQ Sequence 2961 BP; 892 A; 672 C; 629 G; 768 T; 0 other; tggtccttcg aaccggattg gtgcgtcgat aagtattcgc ccaaccagaa tatcttcagt 60 ttacgcaaat atcccagcaa agacgccgtt ggatagcatc aaaagagttg aatggggaaa 120 acaacaggca tggcgaatgt ggcacctaat tgcatgcagt aataaataac aatacaaggc 180 atattccatg cctcactaag gtgagtgtgg ttccattctc ttacgatttg atgaaatttt 240 ctaccggatc gttttatttt gcacatgttc tgatttacca tcgcattact ggccatcgat 300 gaagtggatg ctgattcaag aatctttcaa ccgaagtttt ggatgtcggt cgatgaactc 360 gatgaccggt caatgaactc atatcattca acgttagaca actggagaag ttgccatatc 420 gaaccatcag cttcacggcg tgcatcggca gagaaatcat ttgtcactga gtccgaccaa 480 cccaaattac taataagcaa cctgaaagcg gaagtgatca agagctattt caaataatac 540 aaggcaataa attgcctgtt accaggtaat tgcgattcca tatctgttat cctgtgttgc 600 gtggtctacg cggatccttt ggttttagat tcttccatcg ctatgtcggc ggccgaccgg 660 aagctgcgcg gtttaaagaa ccgaaaacgc agcatcgttt catcatttgc tggcatcaaa 720 acgtacgtta ccggttatca agctgagaga gataaatgtg aagtcccagt ccgtttggaa 780 aatttagttg ccctgtggaa cgagttcaac gaagtgcaga ctgaactgga aacactagag 840 gaatccgaag aatcgctcga tggttattta aaagagcgaa ccgaatttga gcgggcttat 900 tatcgcatca agggtacact tttgatgctc aatcaaccgt cgcccgaaac ccaaatcgtc 960 gctgcacgtc aagaagggtt gaatcagtcg aacgtcaagc taccggatat aaagctaccg 1020 gtatttagcg gagaatttga aagctggatg aatttccatg acttgttcat ttctcttgta 1080 cactcttcga caaatttgtc gacgattcaa aaattttatt acttgcgatc ctcattatca 1140 ggcgaagcac ttaaactgat ccaaaccata ccgatcagta atgagcagta ctcggttgcg 1200 tggaatttgc tagtatctca cttccaaaac ccaagacggt taaaacgttc ttacgtacag 1260 tccttgttcg actttccaag catgaacaga gaggctgcgc cagaactgca cgctctctta 1320 gaaaaattcc aaacgaatgt gaagatcctc aagcagttgg gggaaaacac agaccactgg 1380 gatgtattgc tcatatatct tctaagctca cgattagatg cagttaccag gcgtgactgg 1440 gaagaatatt ccgaaacgca caacgctacg aagttccagc agctaattga attcttagag 1500 catcgagtta atgttcttga aaccattgcc agcaattcag caaatttgcc tcctgttgca 1560 aagaagaaca atcttccacg ttcaagcagt tacggtgcag tccaacaagc gtttcggcca 1620 tgtccagcgt gttcaaatca acatttgctg tatctgtgcg acatattttc gggcatgact 1680 atcgatgaaa aagaaaactt agtaagaaac aatcacctat gccgaaactg tctgagatgg 1740 ggacacttgg cgaggaactg tgtgtctgaa agcagttgtc gtaaatgcag tggacgtcac 1800 catacgcaac tatgtatcac agaatcagaa gcaaacaacc aatcagcttc agcaagtggc 1860 attgacagac aagaagaatc cacagcatac agtgggatta cgagctgcac atctcgaacc 1920 ataacgaaca gctcagtgct gttagcaacg gcaaatatca tcctggtgga cgaaagtggc 1980 cacgaacatc atgcaagagc actgcttgac tcaggcagtg aatgcagttt tgtaacggaa 2040 tctttagctc aacgaatgca cgttcgacgt tattctgcaa atttatcgat agatggaatc 2100 ggacaatcat cgacgagggt acaatgcaag attcgatcaa cagtgaaatc tcgaataact 2160 gattacaaca ccacaattga agcatgcatt ctccccagag ttacagtaga gcttccatcc 2220 aaatccattg atgtttcagg ttggccaatt ccaactggaa ttcctttggc agaccctgaa 2280 ttttatcagt ctagaccaat cggtgtggtg ctcggagctg aagtattttt caacatcttc 2340 aatgttgccg gacgtattcc attaggagat tcacttccaa cactgattaa ttcagcattc 2400 ggctgggtta tgagcggtac tgctgggaca caaggacacg tatcttcatc tgccgtgtgc 2460 aacctggctg taatggcaaa caacgggcgc ttctcacaag cacgtcactc gcaatctaga 2520 gcaggaagca aatctggtcg ttcacagtat gccactcaca atcaacaacc tcagaaatca 2580 cgagttttcc gttcttcaac tttatgttgt gctcaaccac aatttcgacg aaacgaccgt 2640 tgttatcctg cagacattat tcgtcattca ggcatgaacc gtcagagtaa accaacatca 2700 gtttctcaac aacgataaaa aatccaaggc ccttataatt atgggtccag gtaaggacct 2760 gtccattatt tttcacttat caaaattagg ctaccggtct tattctatgt tacatgctta 2820 tctaatcaac gagctacacc aaaccagcga ggaattcaaa gcacgagcgg atttactaca 2880 gctacttctc caaatgctgc tccagaactt aaacggaaac atcagaagat tcagaaatgg 2940 ttatttctgc cgggggcagg a 2961 // ID BEL-637_AA-I repbase; DNA; INV; 6717 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-637_AA_; KW BEL-637_AA-LTR; Pao_Bel_Ele193; BEL-637_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-6717 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [5746-6306] - Integrase core CC LTRs are 94% similar to each other. XX FH Key Location/Qualifiers FT CDS 5188..6717 FT /product="BEL-637_AA-I_1p" FT /translation="MLRTTAFVLRYVDNLKRRIERRQPISGILHQEELAKA FT ERFLWKTAQAEAFPEEIAALKTTQGAPEARHCAVSKSSRIYKRWPFLDEEG FT VLRTRGRIGAAPFVQYEAKYPAILPEKHPITALLVESFHCRFRHANRETIT FT NEMRQRFDIPRLRVLVSRVMRDCTWCRVTKAVPIQPVMGPMPEARLRSFVR FT PFTHVGLDYFGPVFVKVGRSQAKRWVALFTCLTIRAVHLEVVHSLSTVSCI FT MAIRRFVSRRGTPAEFYTDNGTCFQGASKELAREIQSRNEAIALTFTSAHT FT KWHFIPPSAPHMGGAWERLVRSVKAAIGTILDAPRRPDDETLETILFEAEA FT MVNCRPLTYMPLESADEESLTPNHFLLGSSNGVKISPTEPVPEHATLRNSW FT KLAQHITDQFWRRWIKEFLPVIARRSKWYEEARDLQVGDLVLAVNGNLRNQ FT WIRGRVIQVFPGRDGRVRQALVKTATGVFRRAAVKLAVLDVEEKCKPDDEL FT SETPDPRHGLRVGV" FT CDS join(33..1037,1041..4460) FT /product="BEL-637_AA-I_2p" FT /translation="MEGHHNVTGFNCKRCQRPDSADEQMVACDICQEWEHF FT NCAGVDGSIKDHPYVCSMCKPKKTGTKAKESLKPPEKNDKKSSRTSSKKAG FT TKANPAPSSASSATRMAMLEAQMKLIEEQELQQKQDLEAEEELKKREMEAA FT QRHLEEKRKLLDEENRLRELKLREEKEILEKRKVMRQQSMEKKNELLKQMS FT ECSSKSGSNVDSIEKVSSWLAKQEDLPPHSEAHVQPSMPFPTLTQTAASTS FT MQPPNAISSSVPGQMSVFATVPNCSMPPARFPTFPPAVQSNVPMLHTPPIP FT AHSSQSPLIPVSTATVHVSQPVRSVFPPLVQPTALMSALNQQPPSMIAPTG FT NEIGQSHPYSQAPASSAVPAVLDTNQLAARQVLGKDLPKFSGRPEDWPLFI FT TSFEQSTLACGYSDVENLIRLKKCLEGNALESVRSKLLLPSSVPHVVQTLR FT TLYGRPELLIRSLLEKIHQVPTPRYDRLETVIEFGIAVQTLVDHLVAAGQN FT VHLSNPALMQELVEKLPGTMRMEWAVYKSRLPVATLQSFGDFMSGMVTTAS FT QVSYELPSLDRSGKNDRSRARNKGVIQTHSTGLVAASTPTTSVIKIGKPAK FT PCACCNREGHRLVECNRFKSLSTDERWKLVEQRSLCRTCLNSHGKWPCRSW FT KGCEAEGCRHKHHTLLHGAIANSQSGMSASHLSAGQFCMFRIVPVVLHGKG FT QKLLIFAFIDEGSSETLIEDAVAKQLGAHGPTEPLTLHWTGNVSREESNSQ FT RVQLKIAAKNSESILDLHHVRTVSCLVLPSQTLRYGDMVRRFPHLGGLPIE FT DYTNVQPKLLIGLNNLSLCVPRKLREGGPHDPVAVKCHLGWSIYGGVPASA FT IRSVAVNFHAAAPSDPDKLLNEQLRDYFSIENNISASMEVDSEEDKRAKAL FT LESTTRRIGQRFETGLLWKTDAIDFPDSYAMALRRLLSLERKLRQDPELGE FT RVVQQIRDYEQKGYARKVTEQELASSDTKRTWYLPVGVVVNPKKPNKLRLV FT WDAAAKVGDVSFNSNLLKGPDLLTPLLAVLSRFRQYPVAVTGDIKEMFHQI FT RIRKEDQQSQRFLWRENPTEMPQIYVMEVATFGSTCSPASAQFVKNLNAKE FT YANQYPRAAAAIHDNHYVDDYLDSFQTIEEAVQVVEEVKFVHSMGGFELRN FT MLSNSVEVLRRIEQRPGESSKDLLVTRGESTESVLGMRWIPTEDVFTYTFA FT PRADLQAVLEENHVPTKREVLKVVMSLFDPLGCISFFLVHGKVLIQDTWIS FT KIGWDDPIGDSSLERWKQWTGLFGQLPSLRIPRCYFREPFPRNFDQLQLHV FT FVDASESAFSSVAYFRLPVGEQIQVAMVASKTKVAPINTISIPRLELKAAV FT LGTRLLESIRAHHTIPIAESFLWSDSKTVLAWIQSEHRRYHKFVAVRVGEI FT LLSTDVNSWRWVPSKINPADEATKWKNGPNLNSDSSRFRGPSFLQRDENSW FT PKLLPATTSS" XX SQ Sequence 6717 BP; 1840 A; 1640 C; 1656 G; 1579 T; 2 other; aaatctttaa gatatttgta cgagggtgaa agatggaagg acatcataac gtcaccggat 60 ttaattgcaa gagatgccag cgacccgact ccgcagacga acagatggtc gcatgcgata 120 tatgccaaga gtgggagcat ttcaactgcg ctggcgtcga cggaagcata aaggatcacc 180 cgtatgtatg cagtatgtgc aagccgaaga aaaccggtac taaagccaaa gagtcgttga 240 aaccgccgga gaagaatgac aagaagtcgt cgaggacatc gtcgaaaaag gcaggcacga 300 aagcaaaccc agcgccttcc agcgcgagtt ccgctactcg gatggccatg ctggaagcgc 360 aaatgaaact tattgaggag caggaattac agcagaagca ggatctagag gcggaagagg 420 aattgaagaa gcgtgaaatg gaagcggccc aacggcatct agaggagaaa aggaagcttt 480 tggatgagga aaaccgtttg cgcgagttga agctacgaga ggagaaagag atcctagaga 540 agcgtaaagt tatgaggcag caatcgatgg aaaagaagaa cgaactttta aagcagatgt 600 ccgaatgcag cagcaagtcc ggttcgaacg tagattcgat cgagaaagtg tcatcgtggc 660 tggcaaaaca agaggacctt ccaccacact ccgaggcaca cgttcagcca agtatgccgt 720 tccccacgct gacacaaaca gctgccagca catcaatgca accaccaaat gcaattagtt 780 cgtcagttcc cggtcagatg tccgtttttg caactgtgcc gaactgttcc atgccgccgg 840 cgagatttcc cacctttccg ccagcagttc aatccaacgt tcccatgctt cacacgccgc 900 ctattccggc ccattcgtcg cagtctccgc tgatacccgt ttcaacagct accgtacatg 960 tgtctcagcc ggtacgaagc gtttttccac ctctcgttca accaaccgca ttgatgtcgg 1020 cgctgaatca gcaaccctwt ccgtcgatga tcgctccaac aggcaacgaa atcggtcaat 1080 cgcatcctta ttcgcaagca cccgcatcgt ccgcagtccc agcagtactg gacactaacc 1140 agttagcagc gaggcaagtc ctagggaaag atttgcccaa atttagtggt cgtccggagg 1200 actggccact ctttatcacg agcttcgagc agtcaacgtt agcgtgcgga tattccgacg 1260 tagaaaatct tatccgcctc aagaaatgtc ttgaaggcaa tgcgctggaa tccgtacgta 1320 gcaagcttct tcttccctcc agtgtgccac acgtcgttca aactttgcgc actctgtacg 1380 gcagacccga gctgctgatc cggtccttac tggaaaaaat tcaccaagtt ccgactccac 1440 ggtacgatcg actggaaacg gtaatagaat tcggtatagc agtgcagacg ttggtggatc 1500 accttgttgc cgcaggccag aatgtgcatc tctccaaccc tgctctcatg caggagttgg 1560 tagaaaaatt accgggcacg atgagaatgg agtgggcagt ctataagtct cgtcttccgg 1620 tagctactct acaatccttt ggggatttta tgtctggaat ggtgactacg gccagccaag 1680 tttcatacga acttccttct ctcgaccgtt ccggtaaaaa cgaccgatca agagctagaa 1740 acaaaggagt catacaaacc cattccactg gactcgtagc cgcttcaaca ccaactacct 1800 ccgtcatcaa gatcggaaaa ccagctaaac cgtgtgcttg ttgcaaccga gaagggcata 1860 gactggtgga atgcaatcga ttcaagtctc tgagcacaga cgaacgatgg aaactggtgg 1920 agcagagaag cctctgccgc acttgcctta acagccacgg taaatggcca tgtcggtcat 1980 ggaaaggttg tgaagccgaa ggttgccgtc acaaacatca cacgcttctc catggtgcga 2040 ttgcgaattc acaatctgga atgtctgcta gccatctgtc agctgggcaa ttctgtatgt 2100 tccggatagt cccagtggtg ttgcacggga agggccaaaa gcttttgatc ttcgccttca 2160 tcgacgaagg ctcgtcagaa accctcatcg aagacgcagt ggccaaacag ttaggtgcac 2220 atggaccaac cgaaccatta acgttgcatt ggacggggaa cgtctctcga gaggaatcta 2280 attcgcagcg agtacagttg aaaattgctg cgaagaatag cgagtctatt ctcgaccttc 2340 atcatgtacg cacagtaagc tgtcttgttc taccgtcaca gacgctgcgt tacggagaca 2400 tggttcgacg ttttccacat ctaggcggcc ttcctattga ggattataca aatgttcaac 2460 ccaagctgct cattgggttg aacaatctca gtttatgtgt accgagaaaa ctacgagaag 2520 ggggaccaca cgacccggtc gccgttaagt gccatctcgg ttggagcata tacggcggag 2580 ttcctgccag tgcgattcga agtgtggctg tgaacttcca cgctgctgct ccatcagatc 2640 cggataaatt gctgaatgaa cagctgcgcg attatttttc catcgagaac aacatatctg 2700 ctagcatgga agtagattcg gaggaagaca aacgtgctaa agcgctctta gagtccacca 2760 cccgacgaat tggccagcga tttgaaacag gcttgctctg gaaaacagat gcgatagatt 2820 ttccagacag ttacgccatg gcgctacgcc gcctgctttc gctagaacgc aagcttcgcc 2880 aggacccaga gcttggtgag agggtagtac agcagatacg cgactacgaa cagaagggat 2940 atgcgcgtaa agttacagaa caggaattgg cgtccagcga taccaagcga acctggtatc 3000 ttcctgtagg ggtcgttgtc aaccccaaaa aacccaacaa gcttcggtta gtatgggatg 3060 cagcggcgaa agtcggcgat gtgtccttca actctaatct cctcaaagga cccgatctgt 3120 tgacaccgct gctggccgtg ctctctcgct ttcggcagta tccagtggca gtcacgggtg 3180 atataaaaga gatgttccac cagatcagga tacggaagga agatcagcaa tcccaacgct 3240 tcctttggag ggaaaatcca accgaaatgc cacagatcta cgttatggaa gtggccacgt 3300 ttggttcaac gtgctctcca gcttcggcgc agttcgttaa gaacctcaac gctaaagaat 3360 acgcgaacca gtatcctcgg gccgcggcgg ctatccatga taaccactac gtggatgatt 3420 acttggacag ttttcaaaca attgaagaag ctgtccaggt agttgaggaa gttaagttcg 3480 tccactcgat gggcggtttt gaacttcgca acatgctttc caattctgtc gaagtactgc 3540 gaaggattga acagcgccca ggagaaagtt ccaaagacct gttggtaaca cgcggcgaat 3600 ctacggagtc tgtacttgga atgcgatgga taccgacgga agatgtgttc acctacacgt 3660 ttgcccctcg cgctgatcta caagcggtcc tagaagagaa tcacgtaccg acaaagcggg 3720 aggtcctgaa agtggtgatg agcttgttcg atccgcttgg ctgcatctcg tttttcttgg 3780 tccatggaaa ggtgctcatc caagatactt ggatctctaa gattggctgg gacgacccaa 3840 ttggtgatag cagtttggaa cgatggaaac aatggactgg gctctttggt cagctacctt 3900 cgctgagaat tcctcggtgc tattttcgtg aaccctttcc aagaaacttc gatcaactac 3960 agctacatgt attcgttgac gccagcgaat ctgcgttttc tagtgtagca tactttcggc 4020 ttccagtagg ggagcagatc caggtagcga tggtggcttc caagactaaa gtagccccta 4080 tcaacaccat ttccattccg aggctggaac tgaaagctgc tgttctcgga actcgtcttc 4140 tggaatctat aagagcacat cacactattc caattgcaga atctttcttg tggagcgatt 4200 ccaagacagt acttgcttgg atccaatcgg aacatcgtcg ctaccacaag ttcgtggcgg 4260 ttcgcgtggg ggagatactg ctttctaccg acgtaaacag ctggagatgg gtaccgtcta 4320 aaattaatcc ggcagacgaa gccaccaaat ggaaaaatgg acccaatctg aactcggaca 4380 gttcgcggtt tcgtggtcct agtttcctgc agcgagatga aaattcctgg ccgaaactac 4440 taccggctac cacttcatcg tgatttaaga aattacattt atatacaatt taaagtagaa 4500 taccattcgt cacgtactta tatttcgaaa atctaatatg acgctataac taacactgta 4560 tattttgaaa atccattcga gtcgttctgt cacctgaaca ccttcacctg cgtatcgatg 4620 ttgaagaaac gttgtatgag cactatgttg ctaatagtag cgcatcattt ttaagattgt 4680 ttaaattcaa aaatctaaaa agtttgaaac taaataaatt ttgccaaatt tagtaggttt 4740 gactgttcca aggattcatc actcgagttt cacgatatga ttccgaatcg tgtgatagac 4800 tgagagattc gccgtgctgt ttcaaccaat aaaaccttaa ttgcataaga tctcattgca 4860 cgatggataa tctcgctaac ggtaccacaa tgaaactgtc gtcgaaaaac tctacgtaaa 4920 atgaaaattt tcggtgtttc tacttgttga tcacgtcaaa cagaaacacg gtctaaatta 4980 ggttttcaga aggttgtgca aaaaacttaa tatcaacttt tctttcaaaa aaacaattcg 5040 tgaaaattgc atgttctgga attcatcagt gaagccggtc gctcggtaca gagcttggcc 5100 accacccaag aagaattacg gtctgtccat ctgcatcggg tttcgttttc agtcgtggat 5160 ccttcacgat tcagcaagtg gaacaggatg ctaaggacga ctgcattcgt gttgcgatac 5220 gtcgacaact taaaacgccg gatagaacgt cgtcaaccaa tcagtggaat actccaccaa 5280 gaagaattgg cgaaggcaga acgtttcctt tggaaaactg cacaggctga ggcatttcct 5340 gaggagatcg ccgcactcaa gacaacccaa ggtgcgccgg aagcacggca ctgcgctgtt 5400 tcgaaatcaa gccgtatcta caagcgttgg ccctttctag acgaggaagg agttctacga 5460 acgcgaggaa gaataggcgc ggcccccttt gttcagtatg aggcaaaata ccctgcgatt 5520 ctgcccgaga aacatccgat tacggctttg ctagtcgaat cgttccattg tcgattccgc 5580 catgcgaaca gagaaactat caccaatgaa atgagacaac gtttcgacat cccaaggctc 5640 cgagttttgg taagtcgggt aatgcgagat tgtacctggt gccgtgtaac gaaagcagtt 5700 cccattcaac cggtaatggg accaatgccg gaggctcgcc tgagatcctt tgtacgacca 5760 ttcacacatg tgggtctaga ttactttggt ccggtcttcg ttaaggtcgg taggagtcag 5820 gcaaagcggt gggtggcact tttcacctgc cttaccatca gggcagttca ccttgaggtt 5880 gtccatagcc tcagcaccgt atcgtgtatc atggctattc gaagatttgt ttctcgacgg 5940 ggcaccccgg cagaattcta caccgataac ggtacctgtt ttcagggcgc aagcaaggaa 6000 ctggctcggg aaatccaatc ccgtaatgaa gcaatcgcac ttacgtttac cagcgcccac 6060 acaaagtggc attttatccc accttcggca ccccacatgg gcggggcgtg ggaacgcctt 6120 gtgcgatcag taaaggcagc aattggaacc atattagatg cacctcgtcg tccggatgac 6180 gaaactttag agacaattct ctttgaggcg gaagccatgg ttaactgtag gccattgacc 6240 tacatgcctc tggagtcagc cgacgaggaa tcactaactc caaaccattt cctcctcgga 6300 agctctaatg gagtaaaaat ttcgccgacg gaaccagtac cggaacatgc aacgttgagg 6360 aatagttgga aacttgccca acatatcacc gatcaattct ggcgcaggtg gatcaaggag 6420 ttcctacccg tgattgcaag aaggtctaaa tggtacgagg aagcaaggga cctacaagtt 6480 ggtgacttag tcctggccgt aaatggcaac ctgagaaacc agtggattcg tggccgtgtc 6540 atccaggtct ttcctggaag agatggcaga gtccgacaag ctctcgtgaa gacagckaca 6600 ggagtattca gaagggcagc cgttaaacta gcggttcttg acgttgaaga gaagtgtaaa 6660 cctgacgacg aactctcgga gaccccagat cctcgccatg gtttacgggt gggggta 6717 // ID Gypsy-31_AA-LTR repbase; DNA; INV; 242 BP. XX AC AAGE02023662; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-31_AA_; KW Gypsy-31_AA-I; Gypsy-31_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-242 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02023662; Positions 27882 28123. XX SQ Sequence 242 BP; 73 A; 48 C; 47 G; 74 T; 0 other; tgtaaacgga tgaaaaatag ctggtgctat cggtgatgag tgaactgaat tctctgagtg 60 tgaactaata ctacatcact gtgcaaaata atactattta tacagcaaaa tatataccct 120 tctagtcagt ctaaattaaa ccacgaaagt agacagatct acgttatcta ctccgaaagg 180 tctcggttct ctcggtagtc cgctttgagg tcgatgtccg ttcgttccgc atatttgtta 240 ca 242 // ID P-18_HM repbase; DNA; INV; 3040 BP. XX AC . XX DT 21-MAR-2008 (Rel. 13.03, Created) DT 21-MAR-2008 (Rel. 13.03, Last updated, Version 1) XX DE P-type DNA transposon family: consensus. XX KW P; DNA transposon; Transposable Element; P-18_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3040 RA Jurka J.; RT "P-type DNA transposon families from Hydra magnipapillata."; RL Repbase Reports 8(3), 364-364 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(130..1956,1946..2620) FT /product="P-18_HM_1p" FT /translation="MVKKCCVPECRGNYDAKNKVRVFRLPSNEEERNRWIN FT SIPRSNFPNKPDTVVCERHFPENFLTTIVNGKVRPIEPPSIFPNIPASVVP FT TQPLKPRSTTKVLSSIRTEKCDELSQYLKNDLFTFIFLCEDIKNRKFICPV FT TTFVINDVLHIQSNSFYVNCIPMFAVHISSNFKFESFHAGVRCFISSLSKN FT RITYLDRWSRVEETIRFLNNIEIDHQKLVLQDLITSMNASEVGKKIYNVET FT IIRSYCYFSTSRCLYNQVRKDFKLPSVKTLTKITSMTKKVSDIDFISTIFS FT NLEENHRNCILLVDEVYVKSLLLYHGGYLFGKAENNPELLANSVLGIMVKC FT LKGGPSFLCKMIPVCHLDASFLFEQVCYVINSIKSSNGKVISVICDGNRTN FT QAFFKLFERVEGKPWLTKCGIFLLFDYVHLLKNIRNNWITEKMQELQFEKD FT QNLTAKWSDLKCILDFEKDQLIKLSRLNFVAIHPKPIERQKVSHCLRVFCD FT ETIAALKTYSKVIHPCNMNETICFLEKVTKFWKIVSVKEIYGDMKFLDSRK FT RVISEELDINLQYLLDFGDMAFKMSGEQGHRMKSLTKDTGTSIWHTCYGVV FT EMCKYLLKKQFKNNFDYVMLGEFSTDYIEKAFSKFRQGSGGTYFITVQNVV FT EKLNIEKAKLLLQLKVDVTDYAIDIEHSCQKCGYFISEESCDIINQLPELE FT SSLSLSTKMALFYIAGYATRKDDVSENDLFNDTAFYFQLYGDYTKTIDRGG FT LNIPLDCCVQWVFFCFIIFEIEKNKMCRKSLCNTFMMISDIYNFNMLKKHG FT TILSNIFLKNYCLLFSPKSNKEPALKVLKLSV" XX SQ Sequence 3040 BP; 1039 A; 386 C; 485 G; 1130 T; 0 other; catggcgata cttattacag gccttcccgt cgcgaatccg tatttttgac tgggggcgga 60 caatcttacg gaaagatata ttttagtttt tttaaagttc gatttgtgag taattgattg 120 ttacaataaa tggttaaaaa gtgctgtgtt cctgaatgta gaggaaacta tgacgctaaa 180 aataaagtac gagtttttag gcttccctcc aatgaagaag aaagaaatcg ttggataaat 240 tcaattccaa gaagtaattt tccaaataaa ccagatactg ttgtctgtga aagacatttt 300 ccagaaaatt ttttaacaac tattgtcaat gggaaagtta gaccaattga acctccttca 360 atttttccaa acattccagc tagtgttgta ccaactcaac ctttaaaacc aaggtccact 420 actaaagtgt tgtcttctat cagaactgaa aagtgtgatg agttgtcgca atatttaaaa 480 aacgatttgt ttacattcat ttttttatgt gaagatatta aaaatcgtaa atttatttgt 540 cctgtaacaa cttttgttat aaatgatgtt ttgcacatcc aatcaaattc cttttatgtt 600 aactgtatac ctatgtttgc tgttcatatc tcaagcaatt ttaaatttga aagttttcat 660 gctggtgttc gttgttttat ttcttcacta tcaaaaaatc gtataacata tcttgatcgc 720 tggtcaagag ttgaagaaac aattcgcttt ttgaacaaca ttgaaattga tcatcaaaag 780 ttagtattac aagatcttat tacatctatg aatgcatcag aagtaggtaa aaaaatttat 840 aatgtagaaa ctataatcag atcctattgt tatttttcaa cgtctcgatg cttatacaat 900 caggtacgaa aagattttaa attgccaagt gtaaaaacat taacaaaaat tacatctatg 960 acaaaaaaag tttcagatat tgactttatt agtaccattt tttcaaattt agaggaaaat 1020 catagaaatt gtatactttt ggtggatgaa gtttatgtaa agtctcttct tctttatcat 1080 ggtggttatc tttttggaaa agcggaaaat aatccagagt tgttagcaaa tagtgttttg 1140 ggtattatgg ttaaatgttt aaaaggtgga ccatcttttt tatgcaaaat gatacctgtc 1200 tgtcatttgg atgcttcttt tttgtttgag caagtatgct atgtcattaa ttcaattaag 1260 tctagcaatg gaaaagtaat ttcagttatt tgtgatggca atcgaacaaa tcaagctttt 1320 tttaaattat ttgagagagt tgaaggaaaa ccatggttaa caaaatgtgg aatttttctc 1380 ctttttgatt atgttcatct actaaaaaat attaggaata actggattac agaaaaaatg 1440 caggaattac aatttgaaaa agaccaaaat ttgactgcca agtggagtga ccttaaatgt 1500 atcttagatt ttgagaaaga tcagctaatt aagttatcaa gactaaattt tgtagctata 1560 catccaaaac ccattgaaag acaaaaagta tcacattgtt taagagtatt ttgcgatgaa 1620 actatagctg ctcttaaaac ttattcaaaa gttatccacc cttgcaatat gaatgaaacc 1680 atttgttttt tagaaaaagt tactaaattt tggaaaattg taagtgtaaa agaaatttat 1740 ggtgatatga aatttttaga ttctcgtaag agagttattt ctgaagaatt agacataaat 1800 ttgcagtact tacttgattt tggagatatg gcatttaaaa tgagtggtga acaaggtcat 1860 agaatgaaat ctttaactaa agatacaggt acttcaattt ggcatacatg ttatggagtc 1920 gtcgagatgt gtaaatattt gctaaaaaaa caattttgat tatgttatgt taggagagtt 1980 ttcaacagat tacattgaaa aagcattcag taagtttagg caaggttcag gtggaacata 2040 ctttataaca gttcaaaatg ttgttgaaaa gttaaacatt gaaaaagcta aactgttatt 2100 gcaattaaaa gttgatgtta cagattatgc tattgatatc gagcattctt gtcaaaaatg 2160 tggctatttt ataagtgaag aatcttgtga catcattaat caattacctg aactagaatc 2220 atctttgtca ttgagtacaa aaatggcttt gttttatatt gcaggttatg ctacaagaaa 2280 agatgatgtt agtgaaaatg atttgtttaa tgatactgca ttttattttc agttatatgg 2340 agactataca aaaactatag accgcggtgg attgaatata cctctcgatt gctgtgtgca 2400 gtgggtattt ttttgtttca ttatttttga aattgagaaa aacaaaatgt gcagaaagtc 2460 attgtgtaat acttttatga tgatatctga tatttataac tttaatatgt taaagaaaca 2520 tggtacaata ttatcaaaca tatttcttaa aaattattgt ttgctttttt cacccaagtc 2580 caataaggaa cctgcattaa aggttttaaa gttatctgta taacattaaa aaattattat 2640 gtacattttt ttataacatt attgatgtat attttgatat ttttttcttt ttacttgatg 2700 tatattatat ttttttcctt ttttttcttt gcattatttt aaagtttgtt catttttttg 2760 tgatttagat tttaaataat tacaactcat tattctgaaa aatattgtat tttattcttt 2820 ttttcaaaaa tgtttttttt aaatttgatg attcttaata aacactgcaa tagacatttt 2880 ttgcattaat acagtacttt taaaatttag taaactaact aaaaataatg gattagttcg 2940 caaacttttc tagccatttt ttatattttc cgtaactttg tccgccccca gtcaaaaata 3000 cggattcgcg acgggaaggc ctgtaataag tatcgccatg 3040 // ID Gypsy-6_IS-LTR repbase; DNA; INV; 179 BP. XX AC ABJB010051582; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the black-legged tick genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-6_IS_; KW Gypsy-6_IS-I; Gypsy-6_IS-LTR. XX OS Ixodes scapularis OC Eukaryota; Metazoa; Arthropoda; Chelicerata; Arachnida; Acari; OC Parasitiformes; Ixodida; Ixodoidea; Ixodidae; Ixodinae; Ixodes. XX RN [1] RP 1-179 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the black-legged tick genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; ABJB010051582; Positions 17432 17254. XX SQ Sequence 179 BP; 40 A; 51 C; 46 G; 42 T; 0 other; tgttgtgtac tcgccactga ctggtggcgc cgccgcctgt atgccgggaa tttgagggcg 60 ttcgacaagt ctgtgctgcc tgggggacca ccgcgtttcc cgcctagcca aaagtcatgt 120 taacaaataa acagtaaaac gttcatcgtc cgctccgacc aattcgtgcg tacataaca 179 // ID Gypsy-65_AA-LTR repbase; DNA; INV; 267 BP. XX AC supercont1.236; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-65_AA_; KW Gypsy-65_AA-I; Gypsy-65_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-267 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.236; Positions 1363777 1363511. XX SQ Sequence 267 BP; 63 A; 52 C; 72 G; 80 T; 0 other; tgtaaatcta ttagaaagtt ggtgctatcg ggcaggcaac caatacttat acgtttctca 60 ttttgggggg cttcaacgct tcggtgaaat aaattgagtc agtttagtta cagcaagcaa 120 acgagctagt tcgcttttgt gctgcgatcc gaaatacccc cagttatcgt agggtctgtg 180 attccggtaa gtcgtcggga agtaagtctc gaggttctgt ggtcatcgtc gaaggttggc 240 caagttctgc gtcggacgta gtttaca 267 // ID Gypsy-16_OD-LTR repbase; DNA; INV; 194 BP. XX AC CABV01003934; XX DT 24-FEB-2011 (Rel. 16.02, Created) DT 24-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Oikopleura dioica genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-16_OD_; KW Gypsy-16_OD-I; Gypsy-16_OD-LTR. XX OS Oikopleura dioica OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; OC Oikopleuridae; Oikopleura. XX RN [1] RP 1-194 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Oikopleura dioica genome."; RL Direct Submission to RU (24-FEB-2011). XX DR Genome; CABV01003934; Positions 9959 9766. XX SQ Sequence 194 BP; 50 A; 43 C; 37 G; 64 T; 0 other; tgtcgtgctc ggaaatctgc tgtcaagatt tacgtctctt agtctctctc gcacctgttc 60 tgaatctttc gcgtggcgct tgccggcttt tctgttcact tttaaaatag caaaactgga 120 taatatacga ttttactgaa actacaagtt agagtctttt tactatcgaa agcacgcaaa 180 tgtagagcac gaca 194 // ID RTE-4_BM repbase; DNA; INV; 2754 BP. XX AC . XX DT 30-APR-2010 (Rel. 15.07, Created) DT 30-APR-2010 (Rel. 15.07, Last updated, Version 2) XX DE Non-LTR retrotransposon - a consensus. XX KW RTE; Non-LTR Retrotransposon; Transposable Element; RTE-4_BM. XX OS Bombyx mori OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Bombycidae; Bombycinae; Bombyx. XX RN [1] RP 1-2754 RA Jurka J.; RT "Non-LTR retrotransposons from Bombyx mori."; RL Repbase Reports 10(7), 1056-1056 (2010). XX DR [1] (Consensus) XX CC >95% identical to consensus. XX FH Key Location/Qualifiers FT CDS 10..2709 FT /product="RTE-4_BM_1p" FT /translation="MGERIISQPDFLLYHIGTTPGHHGVGFIVKKYLEKSV FT EEFIGISERIAMMNLKLPGYKDSWSIVQIYSPTEQSDLETIEEFYLDLNNT FT IQERGHXNXIVMGDFNGQIGAKRPGEETVLGPFTYSNKIRSRNGELENNLT FT ILNSVYKKNTTKMWTWISPDGKTKNEIDFIMTNTKNCFTNFDVINKFNFNT FT NHKLIRAEIKSVQPKKPRPRQELTIGKLSKHQMAQLVANLRDKFTDFKKNT FT NTMGTQEKYNWIENIINDQVQHIDKIRSEPRKWLTSNTMKLLEQRNQLISA FT KEGKDRRRHLAKISKDIKESIRTDRKKSRMEIIEKHIAKTGGIKKAHRELT FT XSNDWIAKIKNNSGEWHHNRTNILGIATSYYKKIYEHNTAQDNIYLAETSS FT IPNIIQTEVTKAIDSQHRDKAPGPDGISNEILKESNEIITPVLTDMFNEIL FT KTEIIPRQWTESNILLLYKKGDKYDIGNYRPISLMSNIYKVFAKVILNRIE FT KILDEHQPIEQAGFRKGYSTIDHIHVVRQVMEKYKEHQCTYYIAFIDYSKA FT FDSLFHEKVWESLKEQGIEHKYIRLIKNVYSQNTARIQLEKKGEPFRVGKG FT VRQGDPLSPKIFSAVLESIFRALNWENLGINVDGVSLTHLRFADDIVLFAK FT SSEDITRMINELSTESEKVGLKLNPEKTKIMTNGERIPINLGNNQIDYTDE FT YIYLGQLITDDNPVLKEVERRITNGWRRYWGLKEIMKDKNLHMNIKSKLFN FT TCVLPVLMYGSQTWALSQNTISKLASCQRAMERSMMNVKKSDRLSNRIIRS FT KTKVIDIGCKIRRLKWRWAGHMVRGKDKWSKIVTRWYPRESKRKKGRPQKR FT WDDDIRQVAGVTWNRVAQERHEWKRLEEAFVDWQTDLQKIRKIQIIG" XX SQ Sequence 2754 BP; 1134 A; 476 C; 540 G; 599 T; 5 other; gtaagaagaa tgggtgaaag aattatttct cagccagact tcctcctata ccacattgga 60 acgacgccag gacaccatgg cgttggtttc atcgtcaaaa aatatcttga aaaatctgta 120 gaagaattta taggaatctc ggagcgcata gcaatgatga atttaaagtt accgggatat 180 aaagattcat ggtctattgt acaaatatat tcnccaacag aacaatcgga cctggaaaca 240 attgaggaat tctaccttga cttaaataac actatccaag aacgtggcca caanaacttn 300 atcgtcatgg gagatttcaa tggccaaatt ggagctaagc gtccaggaga agagacngta 360 ctgggaccat tcacttacag taataaaatt agaagtagaa atggagaact agaaaataac 420 ttaaccattt taaactcagt atataaaaaa aacacaacga aaatgtggac atggatttca 480 ccagatggaa aaaccaagaa cgaaatagat tttataatga ccaatacaaa gaactgcttt 540 acaaatttcg acgtcattaa taaatttaac ttcaatacga atcataaact aattagagct 600 gaaataaaat cagtacaacc aaagaaacct agaccgagac aggaactaac aatcggtaag 660 ctcagcaagc accaaatggc acaactcgtg gccaacctaa gagataaatt cacagacttc 720 aaaaagaaca caaataccat gggaacccaa gaaaaataca actggataga gaatattatt 780 aacgatcaag tacaacacat tgacaagatt agatctgaac cgagaaaatg gctaacatct 840 aataccatga aattactaga gcaaagaaac cagctaatca gcgcaaaaga aggaaaagac 900 agaagaaggc atttagcgaa aattagtaaa gacatcaaag aaagtataag gacggacagg 960 aaaaagagta gaatggaaat tattgaaaag catatagcca aaacgggagg cattaaaaaa 1020 gctcatagag aattaacaan ttcaaatgac tggatcgcaa agattaaaaa caacagcgga 1080 gaatggcacc ataatagaac aaatatatta ggaatcgcta cgtcctacta caaaaaaatc 1140 tacgaacaca acaccgctca ggacaatatc tacctagcag aaacctcaag tatccctaac 1200 atcattcaaa ctgaagtaac aaaagctata gatagtcaac acagagacaa agcaccagga 1260 cccgatggca taagcaacga aattctaaaa gaaagtaatg aaattataac accagttctc 1320 accgacatgt ttaatgagat attaaaaacc gaaattatcc cacggcaatg gacagaatca 1380 aatattttac ttctatataa gaaaggagac aaatatgaca ttgggaacta ccgccctatt 1440 agccttatgt ccaacatata caaggtattc gccaaggtca tacttaaccg gattgaaaaa 1500 atattagacg aacaccaacc catcgagcag gcgggatttc gtaaaggcta ttcaacgatc 1560 gaccacattc atgtagtacg ccaagtaatg gagaagtata aagaacacca atgcacctac 1620 tatatcgcat ttattgatta tagcaaagca ttcgattctt tgtttcatga aaaggtatgg 1680 gagtcattaa aagaacaagg aattgaacac aaatacatcc ggttaataaa aaatgtttac 1740 agccagaata ctgcgagaat acaattagaa aagaaaggtg aacctttcag agtaggcaaa 1800 ggcgtacgac aaggagatcc cctgtcacca aaaatattct cggcggtact agaatccata 1860 ttcagagctc taaattggga aaatctcggt atcaatgtag acggtgtctc attgactcac 1920 ttgagattcg cagatgatat tgtgttattt gcaaaatcgt cggaagacat cacaagaatg 1980 ataaacgaac tatcaacgga gagcgagaag gtagggttaa aactgaaccc ggaaaaaaca 2040 aaaataatga caaacggaga gagaattccc ataaacttag gaaacaacca aattgattac 2100 accgatgaat atatttactt gggacagcta ataacagatg ataatccggt gctcaaagaa 2160 gttgaacgaa gaataactaa tggctggaga aggtactggg gcttgaaaga aattatgaag 2220 gacaaaaatc ttcatatgaa cattaaaagc aaactattta atacttgcgt actaccagtt 2280 ctcatgtacg gcagtcaaac atgggcttta tcacaaaaca cgattagcaa gcttgcatca 2340 tgccaacgcg caatggagag gagcatgatg aacgtgaaaa agtcagaccg actgagtaac 2400 cgtataataa gaagcaaaac aaaggtaata gatatagggt gcaaaattag aaggcttaaa 2460 tggcgttggg cagggcatat ggttagaggt aaagataaat ggagcaaaat tgtgacacga 2520 tggtatccta gagaaagtaa aaggaaaaaa ggtagaccgc aaaaaagatg ggatgacgac 2580 ataagacaag tggcaggcgt cacatggaac agagtggctc aagaaagaca tgagtggaaa 2640 aggttggagg aggcctttgt cgattggcaa acagatctgc agaaaataag aaaaattcaa 2700 ataataggct agattaaggt ttaaataaaa tttaagtttt atatattata ttta 2754 // ID Gypsy-132_AA-I repbase; DNA; INV; 6847 BP. XX AC AAGE02026533; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-132_AA_; KW Gypsy-132_AA-LTR; Gypsy-132_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6847 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02026533; Positions 52784 45938. XX CC Positions [3463-3924] - Reverse transcriptase CC Positions [4945-5421] - Integrase core CC 'ACTA' target site duplication CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 422..2638 FT /product="Gypsy-132_AA-I_2p" FT /translation="MPRLSGEEFCTVYRTLRVDHLTAGELDYELLLRNIAI FT AEDESLCKRRRRLREILKQEKEGHENIMEFDLDPEVDFNYCLKIFQEIRRT FT LEIKDKSSDHCRARLLHLGHRLTVIRNHCSGELASKCHELYQKVIALFSNH FT FGNENSFSTDGDSSTSSGNELPQGAAALRPPPAAPYTGTKPKRMMVKYVTK FT QDFTSAMKEVADLFKGLTTELRELKTELQTRKSTPVCDFDLSDISDPIAES FT ERKGEHITTEVETKSFPRPVTQVTTAISQSIPFPPPQAVQNFTESSRSQAP FT TTLPPIGNNLMFNTTNPFTQDVSGLGQTVDPGRSEPTVTFARPPAAYREFV FT PSTAPMNGSFQISPTWYNNRYNTEVVQYPRWSESEAPPPPQVHNPGLTRSF FT SPVLGCQPSLPVMHSHPNPQRERRTPVFQPRKVIPVSQWKIQKYKADDQGL FT GMNEFLENVNQMALSEHVSEEDLFDSAIHLFDGPALSWYTAMRSQDRLRDW FT SHLIQELKQAFRHPDLDAVLRVKIYQYRQQRHETFQQYYLHMEKMFRSMSQ FT PMTELDKLEVLKMNLRFDYRKLLVGRPIKSLQSLVNLGHDLDAADSSAFAR FT VFGNSKKEACALSSSGKGSPQATRGCPNKNPQAGKFNQYPKNDKNSKAVST FT PKQLKTDNNTNDYQKPKGDQHPPKRGSADPFTKMVKNYRPPPHGECFNCRD FT DHDTGECPLPKRLFCKICAFPGVEYKNCPFCTKKPLRES" FT CDS 2875..5799 FT /product="Gypsy-132_AA-I_1p" FT /translation="MFNSNIYQRLKNVRFRNPVEPTELVTASGDPLEIAGE FT VLVPYSFQGKTRVIPTLYVPGLAVDCICGIDFWQAYRIRPTVSSFSITKPS FT NIQQSNVEQSVRLTENQKFILEQVKGAFKVASAEKLDTTPLVEHRIILGEE FT AQKLKPVRQYPYPISPKIQAGLFKEIDRLLAKGIIEESNSEWSLNIVPIKK FT SSGDIRLCLDARKLNERTIRDAYPLPHPGRILGRLPRARYLSTIDLTEAFL FT QIPLARGSRKYCAFSVQGKGMFQFTRLPFGLINSPATLARLMNRVLGQGVL FT EPNVFVYLDDIVIVSETFEEHVRLLNDVARRLSAANLSIKLEKSHFCVSEI FT PFLGYVLSSRGLQTNPEKIRPIVEFERPVTVSKLRRFLGMSNYYRRFIADY FT CSITAPLSELLKSQSKNLKWNQDAEEAFQNIKEKLITTPILASANFDHEFI FT IHTDASDQAIAGILTQKIEGREHVIEYFSKKLTTPERSYHATEKEGLAALL FT SIEHFRGYVEGSHFTLVTDSSALTFIMRTKWKTSSRLSRWSLSLQQFDITI FT LHRRGKENVVPDALSRSVLMISQKSDSSWYHTLKDKVEHTPEEYPDFRIQD FT GQLRKYVFSNDPTDHRFDWKIVPSPDARKMIVVNAHDESMHIGVDKTIAKI FT RLKYYWPRLTNDVRTHIQKCTVCKEIKPSNVPTVPLMGSMRVSSQPWQIIA FT LDYIGPLPRTRSGKQYILVIMDLFSKWTQLHAFPSISVSSLKMVLQDHWFF FT RNSVPSILLSDNASTFLSKDFGALLERFGVKHWLTTRYHSQANPVERVNRS FT INTAVRSYAREDQRSWDCKLSEIELVINSTIHASTGYSPFMITRGQEIVVD FT GGDYHRFENDDEQSLEARMVRIKSNSPRIYDLVTKNLRKAHDLSSHNYNLR FT HRKNAKPFNVGDKVYKRNTRQSNASDYYNAKLGSQYLPCTVVAKHGYSSYE FT LMDSTGRNIGVWPANLIKPA" XX SQ Sequence 6847 BP; 2049 A; 1567 C; 1471 G; 1760 T; 0 other; tttggcgccc aacgtaaaat tctggaagtt acccagcaag gaaaagtatt cagggaatat 60 tttttctctc gaggtgtggg ggagttttcg cgaaaatcgc attttctctc aatcaattaa 120 attcgccctg gttggtcctt atctagcctg ccaaaattag cttgattgcg ctggggaaag 180 tattgaaaac gtgtcgcggt cgcgggtgct gaagcacaaa ataggctcag tagaatatct 240 atcagacaga ttatcgaaaa atactaacct gctttgttca tctacggaaa attagctaca 300 tgtgagtttg aagacttaga aactatcgga gttttcgttc gtcctaaagg tgaatccaat 360 cgacatacat ccctcccata gttgagttta ttagccaatt gtccagaaga tcattttcaa 420 aatgccacga ctctcaggag aagaattctg caccgtgtac cgtacgcttc gcgttgacca 480 cctcacagcg ggtgagttag attacgaact gcttcttcga aacatcgcga tcgccgaaga 540 tgagtctctt tgtaaacgta gacgccggtt gcgggagatc ctaaaacaag agaaagaagg 600 ccatgagaat attatggagt tcgatctaga tccggaagtt gattttaact attgcttgaa 660 gattttccaa gagatcagaa gaacccttga aattaaagac aagagcagcg accattgtag 720 agctagactt cttcacttag gtcatcgttt gaccgtgatt agaaaccact gtagtggcga 780 gttagcgtca aaatgccatg agttgtatca aaaggtaata gccttgttct caaaccattt 840 tgggaatgaa aacagttttt ccaccgacgg tgattcttca acttcaagtg gaaatgaatt 900 accacaagga gcggctgcgc ttaggccccc tcctgcggca ccttatacgg gaacaaaacc 960 aaaacgaatg atggtaaaat atgtgactaa gcaagatttt acatccgcca tgaaagaagt 1020 tgctgatctt ttcaaggggt tgactactga actccgagaa ttgaaaacgg aacttcaaac 1080 aagaaaatct acaccggttt gtgactttga tctgtcagac atcagcgacc ctatcgctga 1140 atccgaacgg aaaggtgaac atatcacaac tgaggtggaa actaaatcgt ttccgcgtcc 1200 cgtaactcag gtaacgactg caatcagtca gagcattcct tttcctcccc ctcaagcggt 1260 tcagaatttc actgaatcat cacgctcaca agcgccaact acactgcctc ccattgggaa 1320 taaccttatg ttcaacacca cgaacccttt tacccaagat gtttcagggt tgggtcagac 1380 agtggatcca ggtcgctctg aaccgacagt gacattcgct agaccaccgg ccgcatatcg 1440 agagtttgtt ccatcgacgg cgccaatgaa tggatcgttt cagataagtc ctacttggta 1500 taataatcga tacaatacag aggtagtcca atatccccgg tggtcggagt ccgaagctcc 1560 cccaccacca caggttcaca accctgggtt aacgcgttca ttctcacctg tccttggatg 1620 ccaaccatcc cttccggtga tgcacagcca tcccaacccc cagagagaaa gacgcacacc 1680 cgtcttccag cctcgtaaag ttatccctgt ttcgcagtgg aaaatccaaa aatacaaggc 1740 agacgaccaa ggactaggaa tgaatgagtt cttggagaat gtgaatcaaa tggctctctc 1800 tgagcatgtt tcagaagagg atttgtttga ctcagccatc catctatttg acgggccagc 1860 acttagttgg tacacggcca tgaggagtca agatcggttg agggattgga gccacttgat 1920 tcaagaacta aaacaagcat tccgacatcc cgatttggat gcggtgctta gagtaaaaat 1980 ttatcaatac cgacaacagc gacatgaaac gttccagcaa tattatttgc acatggaaaa 2040 aatgttccgg agcatgagcc aaccaatgac agagttggac aagctagaag ttctcaaaat 2100 gaaccttagg tttgactacc gaaaattatt ggtaggaaga ccaataaagt cactgcaatc 2160 actggtaaac ctaggtcatg atctcgatgc agcagactcg tctgcattcg ctagagtttt 2220 cgggaattcg aaaaaggagg cttgtgccct cagcagcagt ggtaaaggta gtccacaagc 2280 tacgcgaggg tgtcccaata agaatccgca agctgggaaa tttaatcagt atcctaaaaa 2340 tgacaaaaat tccaaagcgg tttcaacacc taaacaactc aaaaccgaca acaacaccaa 2400 tgattaccag aaaccgaaag gagatcagca tccaccaaag cgaggatctg ccgatccgtt 2460 cactaagatg gtgaaaaact atcgtcctcc cccacatggt gaatgcttta actgtagaga 2520 cgatcacgac acaggagaat gtcctctgcc aaaacgacta ttctgtaaaa tctgtgcgtt 2580 cccaggagtg gagtataaaa attgcccgtt ctgtacaaaa aaaccgttaa gagagtccta 2640 aagccgccgg attctcctca ggcgctcgtt catcagactt tcacggcact cgttgcagac 2700 ctgggtaagt tttatgagcc ttgttccaat atcgttaggg aagacattcg gctaccgcaa 2760 gttaataaaa ttactctaac ttctcctgat ggggatgatg accgacctca cgtccactta 2820 aaaatatttg atgtggccgt gaatgccctt cttgatagcg gaagtcacag aacaatgttc 2880 aactccaaca tttatcaacg attaaaaaac gtacgatttc ggaacccagt ggaacccacc 2940 gagttggtca cggctagtgg tgatccattg gaaatagccg gagaagtatt agtaccttat 3000 tccttccaag gtaaaacccg ggtcataccc acactatatg ttcctggtct agctgttgat 3060 tgcatatgtg gaattgattt ttggcaagct tatcgcattc gaccgactgt ctcatccttc 3120 tcgataacaa agccctccaa tatccagcaa tcgaacgtag aacaatccgt cagattgaca 3180 gaaaatcaaa aatttattct ggaacaagta aaaggagctt tcaaagtagc gtctgctgaa 3240 aagctagata ccactccact agtcgagcac aggataattc tcggagaaga agctcaaaaa 3300 ctcaaaccag ttcgacagta cccctacccg atatctccaa agattcaggc cggtctcttc 3360 aaagaaatag atcgcctatt agctaagggg atcattgagg aatccaactc agaatggtcc 3420 cttaacatag ttcccattaa gaaaagttca ggagatatac gattatgcct cgacgcccgc 3480 aaactaaacg agcggacgat ccgtgatgct tatcccctac cacatcctgg gcgcatcctt 3540 gggcgattac cacgagccag atatcttagc acaatagatc tcacagaagc atttctgcaa 3600 atcccgttgg ctcggggatc gcgtaagtac tgtgcgttca gcgtccaggg taaggggatg 3660 ttccagttca ccaggctccc attcggtctc atcaacagtc ccgctacgtt ggctcgtctg 3720 atgaaccgag tgctcgggca gggtgtactg gaaccaaacg tcttcgtgta cttagacgat 3780 atagtcattg tctcggagac gtttgaagaa cacgttcgtc tgctcaatga cgttgcgagg 3840 cgccttagcg cagccaattt atcaattaaa ttggaaaaat cccatttttg tgtatcagaa 3900 atcccatttc tgggatatgt tctgtcttcc cgaggtttac agacgaaccc ggaaaagatc 3960 agaccaattg tggagtttga gcgacctgtt acagtgtcga agcttcgccg gttcctcggc 4020 atgtcgaact actataggcg ttttatcgcc gactattgca gtatcactgc tcccctttca 4080 gagcttttga aatcccaatc taaaaatctg aaatggaatc aggatgcaga agaagcattc 4140 caaaatatta aagaaaaact catcactact ccaattttag caagtgctaa tttcgatcat 4200 gaatttatca tccacacgga tgccagcgac caagctattg cgggaatatt aacccagaaa 4260 atagagggcc gagagcatgt catagaatac ttttcaaaaa aactgactac acctgagcga 4320 agctatcacg ccactgagaa agaaggattg gctgcccttc tttcgatcga acattttcgg 4380 ggctacgttg aaggaagtca cttcacactt gtgacggatt cttcagcatt gactttcata 4440 atgcggacaa aatggaagac atcatcgcga ttaagccgct ggagcttatc attacaacaa 4500 tttgacatta ccattctcca ccgccgtggg aaagaaaatg tagtgccaga tgcgttatcc 4560 agaagcgtac taatgatttc gcagaaatcg gactcctctt ggtaccatac actcaaggat 4620 aaagtggagc acacccctga agagtatccg gatttccgga ttcaggatgg tcaactaagg 4680 aaatatgtat tttccaacga tcccaccgat cacaggtttg attggaaaat cgtaccctct 4740 cccgacgctc ggaaaatgat cgttgtcaac gcccacgatg agtcgatgca tatcggggtg 4800 gataaaacga tagctaaaat ccgcctaaaa tattactggc ctcgtttaac gaacgatgtg 4860 cggacacaca tacaaaaatg cacagtttgc aaagaaatca agccgtccaa tgtaccaact 4920 gttcctttaa tgggaagtat gcgagtttcc agtcaaccct ggcagataat agccttggac 4980 tatatcggcc ctttgccaag gacacgttct ggaaaacagt acatacttgt catcatggat 5040 ctgtttagca aatggacgca gctgcatgca tttccaagca tttccgtgtc ttccctcaag 5100 atggttctgc aggaccattg gttttttagg aactctgtcc cttccatttt gctatcggac 5160 aatgctagca catttttatc gaaagatttc ggtgctttac tggaaaggtt tggagtgaaa 5220 cactggctga caaccaggta tcactcccaa gccaatccag tagaacgcgt taatcggtca 5280 ataaacacgg ccgttcgttc ctatgctagg gaagaccaga gatcctggga ttgcaaatta 5340 tctgaaattg agctggtaat caactctacc attcatgcct ccacaggtta ttcacctttc 5400 atgataacaa gggggcagga gatcgtggtg gatgggggag actatcatcg cttcgagaac 5460 gacgacgaac agtcccttga agctcgaatg gttaggatca aatcaaattc gccacgaatt 5520 tatgatttag tcacaaaaaa cctccgaaag gcgcatgatc tatcaagtca caactacaat 5580 ctacgccatc gtaaaaatgc aaaacctttc aatgtaggtg acaaggtata taagcgcaac 5640 actcgacaat caaacgccag tgactactat aatgcaaaac tggggtctca atatcttccg 5700 tgcactgtag tggcaaagca cggctattcc tcttatgagt tgatggatag cactggccga 5760 aatatcggtg tttggccagc aaatttgata aagccagcat gaaacaaaag aaatcttcac 5820 acaatagtta taacaaccaa tcctaccgtt ttggagatca gcctaaaaag aaagaaacaa 5880 ctgttatata cttaggtacg acaccacatg tggtgatatc ggatggatac ttaccagctc 5940 ataaacgctt atcttaggga gttttacgcc gttttccttg gccattggtc ccgtatatag 6000 gtattttgct cattgtgaag gtccatcgaa gcccatgcgg gtccgaaaat agtcccatcg 6060 caaatacttt gcgaaaatcg aggcagcata ttagggtaca gaatcaaatc gcgcgcggta 6120 atcgatctgt gccgatcaaa tgataatggt tcactgaagc cgatgtggaa gccgagcttc 6180 atttctctcc tcactaatcg tccctctctc gccggataac ttcgccctga aaagccgatc 6240 ggaagagtag gcaaatagtc gagcagtttt atagcacttt tgaatctcct agacgaatat 6300 atataagtag gtaactgata cctgggtctt cggaaaagat tcgtcgcgtt agataacccg 6360 atctgcgaac caaatggtcg aggtgaagag gacttgatca tcatagttag tgccattttg 6420 atcgtgggca acctaggatc ggtcctcgga aggtagtttt cttcacgtcg cggtcgtcgc 6480 gtttgtaaat atgtatagat gtatgtataa gcagttcatg tgttcatatg ttactgaaat 6540 gttccggtac ataagttgtt tttccgttat ttacatagaa tattccgtta gttttgttcc 6600 tcattatttg catgtagtta acttatatta tacttacaac taagaaaata gtgtatggtc 6660 aggatctagt aaagaaacga ttctctgaaa ggtaatgttc tacaagagtc ggtcggcgat 6720 tagatttaaa gtattccata gcacgtatgg acgccccagt aacatcttag ctacaagtaa 6780 gcaaacagat aattatcgaa aaagaaatat tgtacatatt tctttttctc tccaaccaca 6840 ggcgagt 6847 // ID Kolobok1-1_HM repbase; DNA; INV; 2748 BP. XX AC . XX DT 06-MAR-2008 (Rel. 13.03, Created) DT 01-APR-2008 (Rel. 13.03, Last updated, Version 1) XX DE Kolobok-type family. XX KW Kolobok; DNA transposon; Transposable Element; Kolobok1-1_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2748 RA Jurka J. and Bao W.; RT "A distinct subgroup of Kolobok-type DNA transposons."; RL Repbase Reports 8(3), 168-168 (2008). XX DR [1] (Consensus) XX CC This is a distinct subgroup of the Kolobok superfamily which CC includes families identified so far in hydra, starlet sea CC anemone, wasp, aphid, red flour beetle, and sea urchin. Most CC elements are flanked by TTAA TSDs. CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 363..1925 FT /product="Kolobok1-1-HM_1p" FT /translation="MTKPTDLGVILNKNINDRVKQRTNYYKKRRGFMVKKN FT KNACNVSECDSIERNILGASAKKVKVVTNVPSKNIKRITGYRIIDTEILSK FT VLSLLMCPQCEKCALYLGDHQKKKKGLATLLYLKCTTLSCDYKHEFYTSVA FT CDKGFDINNRTIYAMRVLGHGHSGIEKFTQLMNMPKPMTPKNYSKITTKIN FT KHVSDCAKSVANETMSDAATEMKENATSEDIVDVGVSCDGSWQKRGYQSMN FT GVFTAILIDSGKIIDVEPMNKSCKACCMKEQLKKSNPDAYANWKNKHICLY FT NYQGTAGGMESVGAVRVFKRSIEKHKLRYTQFLGDGDSKSHLSIKDIYPNM FT EVKKLECVGHYQKRVGTRLRKLKKNVKGLGGRGRLTNAVIDRLQNYFGVAI FT RSNTNDLNKMKQAVLASLFHVASSKENNYHIYCPTGGDSWCKFNIDKAKST FT SIYKPGPGIPLSDIIHKIKPIYADLSKDSELVKCLHGKTQNCNESFNNLIW FT DRLPKSNYVGLAQFRVWSVRCCCKL" XX SQ Sequence 2748 BP; 978 A; 357 C; 465 G; 947 T; 1 other; ggggcaacta ccctcacatt tttttaaaaa tttgattttt ttttttttaa taattttaga 60 ccaaaatttt atgcagattt tgaatctgca aaaaaaatca aacatttttt tataaatttt 120 tgtttaattt atgttttttt gaactcaagt aaaaaatatt tgcatagcaa cgcgttgcta 180 tgggggcgaa tattttgttt gtgaacttat cctggagtac tgtttcaaat atatgctttg 240 atttatatct tctttgctaa ttaatattct gttccttatt atatgccatt tacaattgct 300 actttaatac gattttatca agttatcaaa gctaatttat ataacagcag aaattaaaaa 360 atatgacaaa acctactgat ttgggtgtta ttttgaacaa aaatattaat gatagagtaa 420 agcagagaac aaattactat aaaaaaagaa gaggatttat ggtgaagaaa aataaaaatg 480 catgtaatgt aagtgaatgt gactctattg aaagaaatat tttrggcgca tcagctaaaa 540 aagtaaaagt tgttacaaat gtaccaagta aaaacattaa aagaattact ggttacagaa 600 ttatagacac tgaaatttta tctaaagttc tttcattgtt aatgtgtcct caatgtgaaa 660 agtgtgcact gtacttaggt gatcatcaga aaaaaaagaa aggtctagca acgttgctat 720 atttaaagtg tactacacta tcttgtgact ataaacatga attttataca tcagttgctt 780 gtgataaagg gtttgatatt aacaacagga caatatatgc aatgagagtc ttaggccatg 840 gtcattctgg cattgaaaag tttactcagc tcatgaatat gccaaaaccc atgacaccca 900 aaaactattc aaaaattaca actaaaataa ataaacatgt ctctgattgt gctaaaagtg 960 ttgcaaacga aacaatgtca gatgctgcaa ctgaaatgaa agaaaatgca actagtgagg 1020 acattgttga tgttggtgta tcatgcgatg gatcatggca gaaaagagga tatcagtcaa 1080 tgaatggagt ttttaccgca attttaatag atagtggaaa aataattgat gttgaaccta 1140 tgaataaatc atgtaaagct tgctgtatga aagaacaatt aaaaaaatca aatcctgatg 1200 catatgctaa ttggaaaaac aaacatattt gcctgtacaa ttatcaaggt acagctggtg 1260 gtatggaatc agttggcgca gtgagagttt ttaaacgctc catcgaaaaa cataagttaa 1320 gatatactca atttcttggt gatggggata gtaaatccca tctttctatc aaagatatat 1380 acccaaatat ggaggtaaaa aagttggagt gtgtaggaca ttaccaaaaa cgagttggta 1440 ctcgcttacg caagttaaaa aaaaatgtta agggtctagg tggtcgtggt cgactaacca 1500 atgcagttat tgatcgcttg caaaattatt ttggtgtagc tatccggagc aatacaaatg 1560 atctgaataa aatgaaacaa gcagtattag ctagcctttt ccatgtagct tcttctaagg 1620 aaaataacta tcatatttac tgcccgactg gaggtgatag ttggtgtaag tttaacattg 1680 ataaagcaaa gtctacttcc atatacaaac ctggaccagg aattccactg tcggatatta 1740 ttcacaaaat caagccaata tatgctgatt taagtaaaga ttctgaatta gtaaagtgtc 1800 ttcatggcaa aacccagaat tgcaacgaga gttttaataa tttaatatgg gatagattac 1860 caaaaagtaa ttatgttggt ttagcgcagt ttagagtttg gagtgtacga tgctgttgca 1920 aattataaca tcggtatgaa gtctgtcatt ttaatttatg aaaaactaaa tatgaaacca 1980 gggcatttta ctttagctgg ttgcagactc ataaacaaaa aacgaataaa actatcatta 2040 tttaaatcga gtgaaagttt taagatgaaa cgaaagatgt tacgacgtca acgattatca 2100 aaggatgatc atcaaaagtc aatagaaaag tccaaggaat attatgaatc tggtagtttt 2160 tgaaatattc ttttaaaaag ttattttttt aaagatattt ttatgaaatg aacaataatt 2220 tatgatttta ttttttgttt tgtgttttct gagaatttaa tttttaaaca cgccgggaaa 2280 aaaatcttga aaagtataca tgctatcatg atgaaatttt cagggatcgt tcattttacc 2340 aatgtttatg atatgagcgg agtagaattt ctcatatgac ttaagatttt gagtttttta 2400 gtttttttct ttgtattttg accttttttt taataatcta atttgactgt agctttttat 2460 caaaaaaaaa tttttattta aaaatctgct tcgctcatat cacaaattta catatttaga 2520 acagttctac aaaatttcat tgacactcct ttactcataa ttcttatttt ttttacggcg 2580 taaacacagt ttttgtgggt tttttagcta tttttttgag ttgccatagc aacgagctac 2640 cattttatgt attttttggt atttttaatg tcagtataat atttactata tgactaaaga 2700 aaaagttact ttatatgtgt tgttttgaaa tttgagggta gttgcccc 2748 // ID BEL-636_AA-LTR repbase; DNA; INV; 578 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-636_AA_; KW Pao_Bel_Ele148; BEL-636_AA-I; BEL-636_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-578 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 578 BP; 149 A; 106 C; 154 G; 156 T; 13 other; tgtttacgag gaataagacc cctagctaca cctatggaac tggctctgct tggtgagatc 60 ttcgaaccta cgaattagag aaattgggag aaaactggga gatgactact tcggtacctg 120 aaccgtttcg aatcgatgac gtcaggtaga tagttcgggt gtcggagtta ggttcagaga 180 tcgaacaaat gggttagttg gttgatgaga gatgcatagg cagatcagtt agctccktat 240 agtccggtcg cgcggtcacm tamactcgat gtcwgggcct aagttttaaw ctgttagtcg 300 taagtttcaa tgttagagtt taagttgcaa taaaagtgaa ctgtagagtg ttgtttccgt 360 ccaaataaat ggtgtaattg ctgtacgtcg cgtgtgaagt tatawttgtc atmttgwgga 420 aggaagagcc ggttttgtgg agaaawtcgc cagtgtcggg ackgtgtatt catccccata 480 cagtccacat caaacccatt agatcgcagg aagggcgatt gggatttcta cctggaatcg 540 gawkcktttg gttggtaggc cccacataca ccccgaca 578 // ID R2_DPe repbase; DNA; INV; 3533 BP. XX AC . XX DT 08-NOV-2010 (Rel. 15.11, Created) DT 08-NOV-2010 (Rel. 15.11, Last updated, Version 2) XX DE 28S rDNA-specific non-LTR retrotransposon R2 in Drosophila DE persimilis. XX KW R2; Non-LTR Retrotransposon; Transposable Element; R2_DPe. XX OS Drosophila persimilis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; obscura group; OC pseudoobscura subgroup. XX RN [1] RP 1-3533 RA Stage D.E. and Eickbush T.H.; RT "Origin of nascent lineages and the mechanisms used to prime RT second-strand DNA synthesis in the R1 and R2 retrotransposons of RT Drosophila."; RL Genome Biology 10(5), R49-R49 (2009). XX DR [1] (Consensus) XX CC This family is specifically inserted into 28S ribosomal RNA CC genes. XX FH Key Location/Qualifiers FT CDS 121..3330 FT /product="R2_DPe_1p" FT /note="reverse transcriptase and restriction-like FT endonuclease." FT /translation="SSFGLIVTNLNSETVLWGCQPLGQFSLIGTNMQNTTP FT RIINTNSLTNQIPTVSSLGAQSEHSAQVNPNSGYQCTICESSFRSKSGLGV FT HMSRRHKDEFDQLRLRTDRKAQWSEEELSMMARKEIELAANGERYLNKKLA FT EVFTNRSVDAIKKCRQRERYKTKIEQLKGQAVPLPEALESETIQRRPSIRE FT RDLLVTPPNTLGTTPTELSNSEILAVLQGYPPVVCNDQWRVEVLQSIVDGA FT QASGKEITLQRLSTYLMEVFPSQNDRPIQTRPPRRPRNRRQGRRQQYALTQ FT RNWDKHKGRCIKAILDGTEGTATMPSQGIMGSYWRQVMTQTSPTYSGTNTT FT FRTEHPLEGVWSPITLGDLRVHRVSLTKSPGPDGITPRTVRSIPSGVMLRI FT MNLILWCGKLPVSIRQARTIFIPKVGNASRPQDFRPITVQSVMVRILNAIL FT ASRLTSSVDWDPRQRGFLPTDGCADNTTIVDLILRDHHKRCKSLYIATLDI FT SKAFDSVSHAAVSATLTAYGAPKEFVDYVQNSYEVCGTTLNGDGWRSEEFI FT PARGVRQGDPLSPIIFNLIIDQLLRSYPNEIGATIGDHTTNAAAFADDIVL FT FAETRLGLQTMLDTTVDFLSSVGLTLNSDKCFTVGIKGQPKQKCTVVIPET FT FRIGSRSCPALKRTDEWKYLGITFTAQGRTRYSPADDLGPKLLRLTRSPLK FT PQQKLFALRTVLIPQLYHKLTLGSVMIGVLRKCDILVRSTVRKWLGLPLDV FT STAFFHAPHTYGGLGIPSVRWVAPMLRMKRLSNIKWAHLAQSEAASSFLTD FT ELNKARGRTLAGLNELTSRTEIETYWANRLYMSVDGRGLREAGLFRPQHGW FT VCQPTRLLTGQDYRNSIKLRINALPSRSRTTRGRNELERQCRAGCDAPETT FT NHILQNCYRTHGRRVARHNCVVNNLKRILEEKGHTVHVEPSLQLETSVSKP FT DLVCIRDNHACVIDAQIITDGLFLDDVHHRKVEKYKRPEVISALRREFGVS FT GNVEVLSATLNWRGIWSNQSVRRLIAKGLISSGDSNVISARVVTGGLYCFR FT QFMYLAGYTRDWT" XX SQ Sequence 3533 BP; 1007 A; 805 C; 870 G; 851 T; 0 other; aagatatgga tctgaataat agcgtagaag gggagtcatt ccgtaattcg taaatcgtaa 60 aaatcagatc aagttgattc aagacctcct cgtggtatct tctggatgct attagactga 120 agttcttttg gtctaatagt aactaacttg aacagcgaaa cagtcctatg gggttgccag 180 ccccttggac agttcagttt gattggcact aatatgcaaa atacaacgcc tcggataata 240 aacactaatt cgttgacgaa ccaaatccct acggtctcta gcctaggggc ccaatctgaa 300 catagtgcac aggttaaccc aaacagtggt taccaatgca cgatatgtga atcgtctttc 360 cgtagcaaaa gcggactagg cgttcacatg tcacgtcggc acaaggacga gtttgatcaa 420 cttcgtctgc gtaccgaccg taaggcacaa tggagtgagg aagagttgag tatgatggca 480 agaaaagaga tcgagctcgc agcaaatgga gaaagatatc taaataagaa gctagcggaa 540 gtatttacga accgtagcgt cgacgctatc aagaaatgtc gacagaggga gagatataag 600 accaaaatcg aacagctaaa gggtcaagct gttcctctcc cagaagcatt agaatctgaa 660 accatacagc gccgccctag tatacgcgag cgagatctcc tagtaacgcc acctaacact 720 ctcggaacca ctccaaccga actgtcgaac agtgaaatcc tggcagtact acaggggtac 780 ccacctgtag tatgcaatga ccaatggaga gttgaggttt tgcaatccat cgtagatggt 840 gcgcaggcct cgggtaagga aattactctt cagcgcttgt ctacttacct tatggaagta 900 tttccctcac agaatgaccg ccccattcaa acgagacctc cacggagacc tcgtaatagg 960 agacaaggta ggagacagca gtacgcctta acacagcgta actgggacaa gcacaaaggt 1020 cgttgtataa aagccatttt ggatggaact gaggggacag caactatgcc aagtcaaggt 1080 atcatggggt cctattggag acaagtcatg acacaaacaa gcccaacata tagtggtacg 1140 aacaccacgt tccggacgga acacccactt gaaggggttt ggtccccgat aacactaggg 1200 gacctaaggg tacacagagt gtcattgacg aaatctccag gacctgatgg aattactcca 1260 agaactgtca ggagtattcc gtcaggagtt atgcttcgca taatgaacct gatactttgg 1320 tgcggaaagt tgcctgtctc catccgacag gcacgaacca tcttcattcc gaaggtgggg 1380 aatgcttctc gaccgcaaga ctttcgtcca attacggtac aatctgttat ggtaaggatt 1440 ttaaatgcca ttttggcttc ccggttgacc tcatcagtcg actgggatcc gcgtcagcga 1500 ggtttccttc caaccgacgg atgtgccgat aatacgacga tagtcgactt aatcttaagg 1560 gatcaccata aacgttgtaa atcactttat atcgcaactt tagatataag caaagcattt 1620 gactcggtgt ctcatgcagc agttagcgcc actctaactg catatggtgc ccctaaagaa 1680 ttcgttgact acgtacaaaa ttcgtacgag gtctgtggca caacgctcaa tggggacgga 1740 tggagatcag aggaattcat acctgctcga ggtgtcagac agggtgaccc gctatctccc 1800 ataatattca acttgatcat cgatcagttg cttaggtcct accccaatga gattggtgcc 1860 acaatcggtg atcacacaac aaacgcggcc gcgttcgcag atgatattgt cttatttgcg 1920 gaaactcgtt taggccttca aacaatgcta gacacgactg tcgattttct atcttcagtc 1980 gggcttaccc ttaactcgga taaatgtttt acagttggaa taaaggggca accgaaacag 2040 aagtgtactg tggtcatccc agagaccttc cgtatcggtt cgcgctcgtg tcctgcattg 2100 aagcgcacag acgagtggaa gtatttaggc ataacattca ctgcacaagg gaggaccagg 2160 tacagtccag ccgacgacct aggtccgaag ctgttgaggc tgacaaggtc ccccctaaaa 2220 ccacaacaga aattgttcgc actcagaaca gttcttatcc cacaacttta ccataagctg 2280 accctaggta gtgtgatgat aggtgttctg agaaagtgtg acatactggt acgttcgacc 2340 gtaaggaagt ggttagggct tcctctggac gtgtcaactg cattcttcca tgctcctcat 2400 acttatgggg gcctcggaat cccttcagtt cgttgggtag cgccaatgct acgtatgaaa 2460 agattgagca atattaagtg ggcccacctc gcgcaatccg aggcggccag ctcatttctt 2520 accgacgaat tgaataaggc ccggggtaga actctggctg gactgaatga gttgacatcg 2580 cgtacagaga tcgaaacgta ctgggcgaac aggttgtata tgtctgttga tggtcgcggc 2640 ttacgtgaag cgggactttt tcgtccccaa cacggctggg tgtgtcagcc cacgcgtttg 2700 ctaacaggtc aagattaccg aaacagtatc aagctgcgaa taaatgccct accatcgagg 2760 tctcgtacca cgaggggcag aaatgaattg gaacggcaat gtcgtgcagg ttgtgatgct 2820 cccgaaacaa caaaccacat cctgcagaat tgttaccgta cgcatgggag gcgggtagca 2880 agacataact gtgtagtcaa taaccttaag aggattcttg aggagaaggg ccacacagta 2940 cacgtcgaac caagtttgca gctggaaacc tcggtaagta aaccagacct ggtgtgtatc 3000 cgtgacaatc acgcttgcgt gattgatgcg cagattataa cggatggact gtttctcgac 3060 gatgtgcacc atcgcaaagt tgagaaatat aaaagaccgg aagttatatc tgcactgcgg 3120 agagaattcg gagtgtcggg caacgtcgaa gtcctatccg cgacgttaaa ctggcgtggg 3180 atctggagca atcaatccgt tagaagattg atagcaaagg gtctcatctc atccggtgac 3240 agcaatgtca ttagcgccag agtggtaaca ggcggactat attgcttcag acagttcatg 3300 tatctcgcag gttacactcg agattggact tagcctatac actatgttgg agagaagacg 3360 cttgctacct aggcaaaatg tgaaattagg tataaacatc gtggttgtaa aacttgaggt 3420 gggtttttag tacgtatgcg tgattacttc gtaatcatga atcgtgcatg ctagtggggt 3480 ttggcctcca ctagtatctt tgaagatttt ccttcctcag cgatcaaaaa aaa 3533 // ID Gypsy-21_DYa-I repbase; DNA; INV; 5041 BP. XX AC chr2L_random; XX DT 06-APR-2011 (Rel. 16.04, Created) DT 06-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-21_DYa_; KW Gypsy-21_DYa-LTR; Gypsy-21_DYa-I. XX OS Drosophila yakuba OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; melanogaster subgroup. XX RN [1] RP 1-5041 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; chr2L_random; Positions 1376980 1382020. XX CC Positions [4059-4535] - Integrase core CC 'ATTT' target site duplication CC LTRs are 96% similar to each other. XX FH Key Location/Qualifiers FT CDS 644..1720 FT /product="Gypsy-21_DYa-I_1p" FT /translation="MYDLRPTSLIRNLPGGNADNSNISTNTIDYGSSGNVN FT ISNAPTIAIGNGLGGNPDGNQEERPFTRAGSTVSLSHAKEAAIDFEGNICA FT RNWVEHLKNIGLIYNLTDDCLRMLFVTKLKGNAQRWLHVNPTRMLETFERL FT CEQLIVAFGETASKSKLRRRFEQRKWQRNERFTMYFEEKKMMSQTVNMDME FT ELLEHLIEGIPSASLRDQARIQRFANPEQMLLAFANVRLPQQAGSYVPKKS FT TSEAVVNNLRCRNCNSKGHFAKDCRKPARAPGSCFACGEMGHFVGECRLRK FT SNINSNTNNYVRRLKIFFVYFTNLPIIAECLIDSGSAISFIKESKIPVEVA FT LKAVNQVYHGINKSQL" FT CDS 2982..4928 FT /product="Gypsy-21_DYa-I_2p" FT /translation="MDKCEFMQSSIKYLGFLITKEGIRADDKGIEAITNFP FT IPGKVHNVQSFLGLCSYFRRFIKDFSTLAKPLYDILRKDKEFVFEQRELDC FT FLNLKKKLIEAPVLAIYDHKDEVELHCDTSATGFGAVLMQRKEDGKLHPVF FT YFSKRSSETKAKYHSFELETLAIIYSLRRFRIYLQGRHFKILTDCDSLTLT FT LNRVELNPRIARWALELQNYDYELIHRSGTKMQHVDALSRCNILIVETTSF FT EDNLLICQSKDLKIQKIKEDLEKNQHKLFEMRNGVVYRKTNDDRLLFFVPE FT EMENHVLHKYHNELGHIGRDKMIDAISRTYWFTNIKEKCKDHIENYLKCVA FT FSPNTGKSEGFLHSIPKGDKPFELLHIDHYGPVAAGRANKHIFLVVDGFTK FT FVKLYTTKTTSTKEVLKALSDYFRAYSKLKCIVSDRGSWFTSNEFAQFLTE FT SNIKHLKIATGSPQANGQVERINRTIGPMIAKLTEPENGIHWDNVIEQVEY FT AINNTVHRSIKQVPSKMLFGVEQKGQIIDEFRERLEEINDTVQVENLEEIR FT KRARDNQNRAQAYNETYYDKTKKKPKYYRKGDYVMVKNFDSTAGVARKLIP FT KNKGPYVIDKILRNDRFLLKDVEGFQVSRNPYKGVWSAQNIRPWIGKRTNS FT " XX SQ Sequence 5041 BP; 1851 A; 912 C; 1038 G; 1240 T; 0 other; aattcagaag tggtcgaacc tacgaaatga atcaggccga tctcaacaaa ttttccgtca 60 cccaattaaa aaggtggctt tcggcgttgg gtttaccaac ccaaggtatt aaatcggagt 120 taatggctcg cctagggcaa gtaccgcttg tacagcgagg aaatatgcca gaagaagatc 180 ggcccggcac cggaccgcaa agcatcatcg atcctgaaaa tgaaacaaat tcgtcacagg 240 aaggcgtttc cgagaatgtt ccagaacaaa atgaacttct cggaattatc acacagctac 300 gaacaacgct tattacgcag gaaaatgcgt tgcatcaagc agagacgcgt agcgcattgc 360 aaaaacaaca gctagagctt cttagcgccg ccatacatac acaacagcag aacgagacac 420 gcgccggcac acatacacaa cagcaagaga cgcgcgccgc cacacacata caacagcaag 480 aggagacgcg cgccgccaca aacacacaac agcaagagga gacgcgcgcc gccacacaca 540 cacaacagca agagaagcgc gccgccacac acacacacaa cagcaagaag agacgcgtgc 600 cgccacacac acacagttac aacaagagca gacagtttcc aacatgtacg atttgcgccc 660 taccagccta attagaaatt taccaggtgg caacgctgac aattcaaaca tctcaacgaa 720 cacaatagac tatggatcaa gtggcaacgt caacatttcg aatgccccaa caatcgctat 780 tggaaatgga ttgggtggca accctgatgg caaccaggaa gagaggccat ttaccagggc 840 tggatcaaca gtttcacttt ctcatgccaa agaagctgcc atcgatttcg aagggaacat 900 ttgcgcacgc aactgggtgg agcacctcaa aaatatagga ctgatctaca atctaacaga 960 tgactgccta cgaatgctgt tcgttaccaa acttaagggt aacgcacagc gttggctaca 1020 cgtcaaccca actcggatgc tagaaacttt cgaaaggctg tgtgagcagc tgatcgtagc 1080 atttggagaa acagcgtcga aatcaaagct gcgccgaaga tttgaacagc gaaaatggca 1140 aaggaacgag cgatttacca tgtacttcga ggaaaagaaa atgatgtccc aaaccgtcaa 1200 catggacatg gaagaactgc tcgagcatct cattgagggg atcccatcgg cgagcttacg 1260 cgatcaagct cgcatccaaa gattcgccaa tccggagcaa atgctgctag catttgccaa 1320 tgtacgcctc ccgcaacaag ctgggagcta cgtaccaaag aagtcaacat cggaggcggt 1380 ggtcaacaac ctgcgttgca ggaactgcaa ctccaagggc catttcgcca aagactgccg 1440 caaaccagca agggcacctg gatcttgctt cgcttgtgga gaaatgggcc attttgtggg 1500 agaatgtcgt ctaaggaaaa gcaacatcaa cagcaacact aacaattatg taagaaggtt 1560 aaaaattttc tttgtttatt tcactaattt acccattatt gcagagtgtc tcatagattc 1620 agggagcgcg atatcattca ttaaagaatc caaaattcca gttgaagtag ctttgaaagc 1680 agttaaccaa gtgtatcatg gaataaacaa gagccagctt taaatatttg aaaaaacttt 1740 atgtttcata ttaaaagaaa aaataaaaat tcaattcgaa atggttgtcg tcgctaatga 1800 atcaatgaga tacgatgtag tcctgggaag ggattttatg aatcaatatg gtttgagcat 1860 caatctagac acccttggcg ttaatgaaga tattaataat atgattaaaa tatttgggga 1920 aaatgatcaa caactggcaa atgccgtagt agccaattgc gatgtcagcg aaactgttag 1980 ggaaaatgaa actcttgagg aaaaactgtt aggaaatgaa actgttacgg aaaatgagat 2040 tgttagggaa aaactgttag gaaataaaac tgttacggaa aatgagatcg ttaggatatg 2100 catcttttaa ggaaaaattt ttaaaaaatg aaacgtttac ggaaaaattt gtagaaagtg 2160 aaactcttaa gagaaaattg ttagagaatg aaacttttag gaaaaaaatg tcagaggata 2220 caactgatga gatacaacga agtgaaagaa tagaaatttt aggagacaac tttgaaactg 2280 aaatgctgaa catcaaattt gttgatgatc agtttttaga ctataactgc gggtgtgatg 2340 taagttatga aacgacatgc aaatttgtta aaatgttcga agaatcttac gtcaatgcga 2400 agagaccaga tagtccattt accagatgcg aaatgaaaat tagtttagaa aaatccaaac 2460 cattcagttg ctcgcctagg cgactttcat ttagtgaaaa ggagcagtta caaaaattat 2520 tagacgaata tttagacaaa ggcataataa gaaacagtga ttcagaatat gcttccccct 2580 ttgtgctagt gaagaaaaag actggagact tgcgtttatg tatagactat agaaaattga 2640 acaaaatttt aataaaagac aattatccgt tacctttgat agatgatctt ctagacaaga 2700 gatttaaaga atggatattt tcacgtattc gttgacaatg attcaattaa atacacatcg 2760 ttcgtgactc cgttaggaca atttgaattt ttaaggatgc ctatgggcat caagaacgca 2820 tcagcagtat tccaacgctt tgtaaacaaa atttttgctg atttgatacg cgaaaacaaa 2880 gttatcgtat acatggatga cataatgata gccagtgcaa acatggaaga acattaaagg 2940 atgtgttcat aagactagtc aataacaaat tagaacttcg tatggacaaa tgcgagttta 3000 tgcaatctag tataaagtac ctaggatttc ttatcactaa agaagggatc agagcagatg 3060 acaagggtat cgaggcaatt acaaactttc caattcctgg aaaagtacac aacgttcaaa 3120 gttttctagg attatgttct tattttagaa ggtttataaa agatttctct acgttagcaa 3180 aaccacttta cgacatttta agaaaagaca aggaatttgt ttttgaacaa agagagctag 3240 actgtttctt gaaccttaag aaaaaactga tagaagcacc cgttttagct atttatgatc 3300 ataaagatga ggttgaattg cattgcgata ccagtgctac gggttttgga gcggttttaa 3360 tgcaaaggaa agaagacgga aaactccatc ccgtattcta tttttcaaag cgaagttcag 3420 agactaaagc gaagtatcac agctttgagc tggaaacttt ggctattata tactcgttgc 3480 gtagatttcg aatctattta caagggagac atttcaaaat actaactgat tgcgattcct 3540 taacgcttac attaaacaga gtcgagctaa atcctagaat cgcccgatgg gcactcgaac 3600 ttcaaaatta tgattacgaa ctgattcata gatccgggac aaaaatgcaa cacgtagatg 3660 ctctgagcag atgcaacatt ttaattgtag aaaccactag ttttgaagac aatttattaa 3720 tctgccaatc aaaagattta aaaatacaga aaataaagga agatttagaa aaaaatcagc 3780 ataaactgtt tgaaatgaga aatggagtag tgtacagaaa aacaaatgac gatcgtttat 3840 tattctttgt cccagaggaa atggaaaatc acgttttgca taaatatcat aatgaacttg 3900 gtcacattgg tagagataaa atgatagatg cgatttcgag gacttattgg ttcacaaaca 3960 tcaaggaaaa atgtaaagat cacattgaga attacctaaa gtgtgtagca ttttctccaa 4020 acactgggaa gtcagaaggt ttcctacaca gcattcctaa aggagataaa ccatttgaat 4080 tattgcacat agaccattac ggaccagtag cggcaggaag agcgaataag cacatatttt 4140 tagtagtaga cggtttcacg aagttcgtta aattatatac aacaaagaca actagtacca 4200 aagaagttct aaaagcttta tccgattatt ttagggcata tagcaaacta aaatgtatag 4260 tttcagatag aggtagttgg ttcacatcaa acgaattcgc acaattttta acagaatcaa 4320 acattaaaca tctaaaaatt gcaacaggtt ctccgcaagc aaacggacaa gtggaacgca 4380 taaacaggac aattgggcca atgatagcaa agttaactga gcctgaaaat ggtattcatt 4440 gggacaatgt aatcgaacaa gtagaatatg cgatcaataa cactgtacac agaagtatta 4500 aacaggttcc tagtaaaatg ctctttggtg tagagcagaa aggtcaaatt attgatgaat 4560 ttagggagag attagaagaa attaacgaca cagttcaggt agaaaactta gaggaaatta 4620 gaaaaagggc tagggataat caaaatcgag cacaggcata taacgaaaca tattatgata 4680 aaacaaagaa aaagccaaag tattatagaa aaggcgacta cgtaatggtg aaaaattttg 4740 acagtactgc gggagttgct aggaagttga tcccaaagaa caaaggccct tacgtaattg 4800 ataaaatact tcgtaacgac aggttcttgc taaaagatgt tgaaggtttt caagtttctc 4860 gaaatcctta caagggtgta tggagtgctc aaaacataag accttggata ggaaaacgaa 4920 caaattctta aaggaaatat acactaaaat accgtaattc gttaaatttg ccatataaaa 4980 taaacaaatt gttataacat aaatgtttag ttctaagaaa ccagttctaa gatggccgaa 5040 t 5041 // ID MSAT-1_AAe repbase; DNA; INV; 560 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE Minisatellite-type sequence: consensus. XX KW MSAT; Satellite; Simple Repeat; MSAT-1_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-560 RA Jurka J.; RT "Tandemly repeated DNA from Aedes aegypti."; RL Repbase Reports 11(4), 1447-1447 (2011). XX DR [1] (Consensus) XX SQ Sequence 560 BP; 192 A; 108 C; 131 G; 129 T; 0 other; gaaggaactt ctagaacaat tcctaaagga actcctggaa aaattcctga aagaacccct 60 gcagaaattc ctgagaaata cctgaagaaa ctccagtaga aattcctgaa gctactacgg 120 gaggaattcc tgaaaataac cctgaaggaa ttcccgaagg aactcctgca ggaatatttg 180 gagtaaaatt tgaaggaact cctggaggaa tttccgaaca acctccagac ggaattcctg 240 aaagaactcc tggaaaaaat cctgaagaat tcctggagga atccctagga aacttccgga 300 ggaattcccg aagggtttcc tacaggaagt cttggaggaa ttcctgaagg aactccctga 360 tggaatacgt gtaggatgtc ctgaagtaaa tcatcgaaga tttcctgaaa gatttcctgg 420 agtaaatcct gaaggaactc ctggaaaaat tcttgaagga aatcatggat gaattcctgg 480 aggaattgtt gaaggaaacc ctggaggatt tcctgaaaga tctcctggag gaattcctga 540 aggaacttct ggaagtatat 560 // ID Copia-11_AA-I repbase; DNA; INV; 4146 BP. XX AC supercont1.46; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-11_AA_; KW Copia-11_AA-LTR; Copia-11_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4146 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.46; Positions 2094762 2090617. XX CC Positions [1474-2001] - Integrase core CC 'AGGAC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 109..4122 FT /product="Copia-11_AA-I_1p" FT /translation="MEALNKFVRLNNHNWQTWKFRMEMLLTREDLWYVVET FT PKPDENAIQWEKDDKKARATMGLCIEENQYGLIKSAATATQFWNNLKAYHE FT KVTVTSRVSLLKRLCSLNLLEGGDLESHLFEIEDLFDRLTQAGQELNDSLK FT VAMILRSLPDSYSGLVTALESRPDADLTVVLVKSKLLDEFERRKERSVPGS FT SGRSGELKALKSFGKRNMSVSDRVCFFCKQKGHLRNDCAVLQEQQKRKSER FT KKKKNEPSAAKQAVTENKSAVLFLVNGIRSSGWIIDSGASVHMSNDKRFFE FT SIDETDVHDVILADGRSVKSVGCGTVKMSGVNGFGGAVDVILEKVLYVPSL FT NSGLLSVTKLTSKGFLVVFREDDCDITDASGKVVAKGDRCGSLYQLRMAEE FT ALKVEKMHHNESCQHQWHRKLGHRDPAVVGQLCSKKLAEGVVVKDCGIRIA FT CETCMEGKLSRAPIPHQAERKSTQVLDLVHTDLCGPTKTPTPSGSRYLMTV FT IDDFSRYTVVYLLQKKSEAVDRLIEYVRYVETLFNRKPRMVRSDGGGEYVN FT ERLQNFFKSEGIKCQYTTAYTPQQNGVAERRNRYLQEMATTMLLDAKLEKK FT YWGEAVVTAAYLQNRLPSRAVDVTPFELWYGRKPNLEHLRVYGCKAWVHIP FT DVKRGKFDSKARKLTFIGYSEQHKAYRFVDLATEKVTISRDARFLEMDEAQ FT RQEEVTVLQAPDENYTEFVPCNVPSTSKQFLVEAPVKETVEPDRCERLGDL FT QDDDEVHSEHEQFFDISSSSLEDPLAVKDEPLDSEEAEPDPREQATGKLPR FT RNRGKLPDRFSDYVVGVATQQAKEPVTFDEAVKSSNSDAWRKAMDEEFQSH FT LRNGTWELVNPPAGKRVIGSRWVYKIKQDETGAVVRFKARLVAQGYAQRYG FT VDFDEVFAPVTKYATLRVLLVLAGKDRLCLKHLDVKTAYLYGSIEEEVYMR FT QPVGYEVRGQEEKVCRLRRSIYGLKQSARCWNRKLSDVLLKFGFVACNADP FT CMYVASRNGRKVFLIVYVDDLLVGCSSEEEVAKVYEELRKEFEICWLGDAK FT YFLGLELQKEDDGFYSMSVKAYIEKLIGKLGLQDAKVARTPMDQGYVKDKT FT ASENLKDVSQYRSVVGAVMYIAVTARPDIAAAASILGRKFSAPTEIDWTTA FT KRVVRYLKATKDWKLKLGCEGGKLLEAYSDSDWAGDPDTRKSTTGFVIFYA FT GGAVSWASRRQDCVSLSSMEAEYVALGETCQDVLWLRRLLSDIGENQECAT FT TVHEDNQGCLCFVRSERTSKRTKHIETKESFIKDLCDRGIVKLQYCPTDEM FT KADILTKALGNIKHDRFTELLGFVKG" XX SQ Sequence 4146 BP; 1124 A; 802 C; 1240 G; 980 T; 0 other; ggttatcggc ccagtcgatg tgtcgtgaaa aatagtggat agagtcggaa aatcggttga 60 taaagtgccg tcgcgtgtgt tgttttgtcc cagcagatta gagcaacaat ggaggcactc 120 aataagtttg tccggttgaa taaccacaac tggcaaactt ggaagttccg tatggagatg 180 ctccttacga gagaggattt atggtacgtc gtggaaactc cgaagccgga cgaaaatgct 240 atccagtggg agaaggacga taagaaggcc cgagcgacta tgggactctg tatagaggag 300 aaccaatacg gtttgataaa atcggctgcg actgctacac agttttggaa taacctcaag 360 gcgtaccacg aaaaggttac agttacttcc agagtgtcct tactgaagag gttgtgtagc 420 ctaaatctgc tggaaggtgg cgatctcgag agccatctgt tcgagataga ggatcttttc 480 gatagattga cgcaagctgg gcaggagctg aatgattctc tcaaagttgc aatgatattg 540 cgaagcctgc cggattctta ttccggactt gttactgcgc tcgaaagcag accggatgca 600 gatctaacgg tagtgttagt taagtccaag cttctcgacg aattcgaacg tcgaaaggag 660 cgttctgtac ctggttccag tggacgttct ggtgaattga aggcgctgaa aagtttcggc 720 aagcggaata tgtccgtttc ggatcgtgtg tgcttctttt gtaaacagaa gggacacttg 780 agaaatgact gtgcagtgct gcaggagcag cagaagcgta aaagtgaaag gaagaagaag 840 aagaatgagc catcggccgc taagcaggct gtgaccgaga acaaaagtgc tgtgttgttc 900 cttgtgaatg gaattcggtc tagtggttgg attatagaca gtggtgcaag cgttcacatg 960 tcaaacgaca agcgattttt cgagtcgatt gatgaaacag atgtgcatga cgtgattctg 1020 gctgatggaa ggtccgttaa gtcagtcgga tgtggaactg tgaagatgtc tggagtgaac 1080 ggcttcggtg gtgcggtcga tgtaatcctg gagaaggttt tgtacgttcc atcgctaaac 1140 agtgggttgt tgtctgtgac aaagctgacg tcgaagggat ttctggtcgt gttccgtgag 1200 gatgattgcg atatcacgga tgcctccggc aaggtagtag ctaagggcga tcgctgtggc 1260 agtttgtatc agctgagaat ggccgaagaa gcgctgaagg tagagaagat gcatcataac 1320 gaatcctgtc aacatcagtg gcaccgtaaa cttggccaca gagacccggc agttgtgggc 1380 caactgtgct cgaagaagct tgctgaagga gtggtagtga aggattgcgg aattcggatt 1440 gcttgcgaga cctgtatgga aggaaagtta tctcgtgctc cgattccaca ccaggctgag 1500 cggaagtcaa cccaagtgct cgacctggta cacacagacc tttgtggtcc gacgaaaacg 1560 cctacaccga gtgggagcag gtacctaatg acggtgattg atgattttag tcggtacact 1620 gtagtctact tgctgcagaa gaagtcggaa gctgttgatc gtttgattga atacgtcagg 1680 tatgtggaga cattatttaa tcgtaaacct cgtatggtac gctcggacgg tggcggcgaa 1740 tacgtcaatg aacgactgca gaatttcttc aaaagtgagg gaataaagtg ccagtacact 1800 acagcctaca cgccccagca gaatggtgtc gctgaaagac gaaatcgcta cctgcaagag 1860 atggcgacaa cgatgctgct ggatgcaaag ctcgagaaga agtactgggg tgaagcagtc 1920 gttacggcag cgtacctcca aaatcgactg ccttccaggg ctgttgacgt tactccgttc 1980 gagttgtggt acggtaggaa gccaaacctg gaacatttgc gagtgtatgg ctgcaaggcg 2040 tgggtacata tacctgatgt caagcgaggc aaattcgaca gcaaggcacg aaagctgacc 2100 ttcatcggct actccgaaca gcacaaagca taccgttttg ttgatttggc tacggaaaag 2160 gttaccatca gcagggatgc caggttcctg gaaatggatg aagctcaacg acaagaggag 2220 gttactgtgc tccaggctcc tgatgagaat tatacggaat ttgttccgtg taacgtacca 2280 agcacaagta agcaattttt ggtggaagcg cctgtgaaag aaacggtgga acctgatcgg 2340 tgtgagcgtt tgggtgatct tcaggatgac gacgaagtgc atagtgaaca tgagcagttt 2400 ttcgacatta gttcttcatc tttggaggat ccgttagcag tgaaagacga gccacttgat 2460 agtgaagaag cggaaccaga tccacgtgaa caagctacag gcaagctgcc ccgtaggaat 2520 cgtgggaagt taccggatcg tttttcggac tacgtcgtag gcgtagcaac gcagcaagcg 2580 aaagagccag ttaccttcga tgaggcagta aaaagcagca atagcgatgc ttggcgaaaa 2640 gcgatggatg aagaatttca gtcgcatttg cggaatggta cgtgggagct tgtgaatcct 2700 ccggctggaa agcgcgtcat tggtagccga tgggtttaca agatcaagca ggacgaaaca 2760 ggagcagttg ttcgattcaa ggctcgactg gtggcgcaag gatacgcgca gcgatacgga 2820 gtcgattttg acgaagtgtt cgcccctgtc accaagtatg caaccttgag agtattgctt 2880 gtgctggcgg gcaaagatcg tttgtgtttg aagcatctgg acgtcaaaac tgcctatctt 2940 tacggtagta ttgaagagga ggtgtacatg cgtcagccgg ttggttatga agttcgcgga 3000 caagaggaga aggtttgtcg tttgaggcgc agcatatacg gcctgaaaca atcggcaagg 3060 tgctggaacc ggaaattatc ggacgtccta ctaaaatttg gcttcgtcgc gtgtaatgct 3120 gatccgtgta tgtatgttgc atcgaggaat ggcaggaagg tatttctgat cgtttatgtt 3180 gatgatctgc ttgttggatg ctcgtctgag gaggaggtag ctaaagtgta cgaggaactt 3240 cgcaaagagt ttgaaatttg ttggctaggt gatgcgaagt attttcttgg gttggagctg 3300 caaaaggaag acgacggatt ctacagcatg agtgtcaagg catatataga gaaactgatc 3360 ggaaagctag gattgcaaga tgcgaaggtt gcaaggacgc cgatggatca agggtatgtg 3420 aaggacaaga cagcaagcga gaatctaaag gatgtatccc agtaccggag cgtggttggt 3480 gctgtgatgt acatagcggt tacagcacgt cctgatattg ccgcagcggc ctccatatta 3540 ggaaggaagt tcagtgctcc aacagaaatc gactggacta cagcaaagcg tgtagtgcgc 3600 tacctcaagg caacaaagga ttggaagttg aagttgggtt gcgaaggcgg caagctgttg 3660 gaagcatatt cggactctga ctgggcaggt gacccggaca cacgcaagtc tacgacgggc 3720 ttcgtaatat tctacgcagg aggagctgtg tcgtgggcta gtagacgcca agattgcgtg 3780 agtctatcgt ccatggaggc cgagtatgta gcactgggtg agacttgtca agatgtgcta 3840 tggttgcgac gattgctgag tgacatcggc gagaaccagg agtgtgctac tacggtgcac 3900 gaagacaacc agggttgcct gtgtttcgtc aggtctgagc ggacaagcaa gaggaccaag 3960 catatcgaga cgaaggagag tttcatcaag gatctatgcg atcgtggcat agtgaaactt 4020 cagtactgtc ccacagatga aatgaaggca gacattttaa cgaaggcact tggaaatata 4080 aagcacgacc gcttcaccga actactaggg tttgttaagg gctaagggca aaccgttgag 4140 gaggag 4146 // ID Copia-36_DPu-I repbase; DNA; INV; 4387 BP. XX AC ACJG01003741; XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the common water flea: internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-36_DPu_; KW Copia-36_DPu-LTR; Copia-36_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-4387 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the common water flea."; RL Direct Submission to RU (08-FEB-2011). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR Genome; ACJG01003741; Positions 3931 8317. XX CC Positions [1739-2260] - Integrase core CC 'TTGGT' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 449..2290 FT /product="Copia-36_DPu-I_2p" FT /translation="MPVIIGTFYEMNSIHTRKQEQPEFLNNNEFSLKLLPH FT YRYRPGDTIMAHITAVETLAGQLIDVGVPVGDEDIISKIVCNLPLSYRTFQ FT TTWASTNPAEQTRALLTTRLVNEERAIQRHQQAQESSGATNSTSNNEALYS FT NSSQSQRGRLNRRDSGRLHNRPRQMQGDRPRCTYCTLPGHKAEECRTKKRD FT ERNSERQGHGDRANTAQTERNATETEWALISAACLKACKKGKWYADSGATQ FT HMSEDRSAFNNYQAIPEGTWTVDGIGNTKLQVVGKGDIQVESRIPGIFHTG FT TIKNVLHVPHLGVNLISIAALTNLGAEVAFTANKINVTLNNQIVMTGERSG FT MSLYELEIQTAPTKMSQSAEKATTSNANAVALKAASFNTWHKRLAHVNYQT FT IQRMMRHEAVEGITLKENSEIPKEICQGCALGKMHKLPFLPGTRKVTEIGE FT LVHSDICGPMQTPTVGGAKFFVLFKDEHSSYRVAHLLKHKSEVPDRFQEYV FT HTLNFETGKKVATLRSDNGGEFTSNAFESWLKENHIRHETSVPHNPQQNGA FT AERDNRTIVESVRCLIHERNLPLWLWGEALMYAIYTLNRTLPQNKKITPYK FT SGTTKNQTWLTSENSE" FT CDS 2164..3519 FT /product="Copia-36_DPu-I_1p" FT /translation="MGRSTHVRHIHTQSNIATKQENNAIQIWHNKKPNVAH FT LREFGVKAYAHIPDAERQKLDAKSKEYIMVGYAETQKAYRLCDRTTRKIHI FT SRDVIFEEDTTVSSTHLEETEPFEPLTPSTNDKEKDQTEQPKTTKRSSSRQ FT HQPKRLFPMEQGSYDTKKRKEKDSATTTVSLKSLSLNSYVEPNHYQDALSS FT PDAAMWKEAIEDEIQSLIENETWELAQLPKGRSVIDTLWTFKKKIGIDGEK FT DRYKARYCAKGFTQLKGIDYQETFSPVVKHTTLRTMLSIATVLDLDIIQLD FT VKTAFLHGEIEEETYIKQPEGYVIPGKENEVCRLKKCIYGLKQSSRVWNQK FT FHKFLTSFGLKSTAADPCLYTCQDKDGIILVAIWVDDGLVCGSKMESINKV FT IEHLSVHFKMTSKTAKHFVGLEITRDRQHRKTFVSIPNYIEKTTKIQHGQL FT QSQGYTR" XX SQ Sequence 4387 BP; 1596 A; 1043 C; 892 G; 856 T; 0 other; ggttatgggc ccagacgcct gaaaactgat aaagtgagat ggacggaaca tttaacccaa 60 tgtttaaccc ccgagatatc agtcatattg acaaatttaa cgggacgaat ttccagcaat 120 ggaaatatgg tatttccctc atacttgatc aatatgaaat caaagacact gctgaggtaa 180 tcaaatcgca taaagtactc tctaaaaacc atgcacattc caatgagtca cacatgattt 240 tcacagggct tcgatgtcag cccaaatcca actacaaatg atgccggtga tgttaccaac 300 accgaagaaa tatatcaatg gagtagaaaa gacctcatgg ccaagcagta catatactct 360 accaacgata gtgaaaggca acgagtactc atgaactgca agacatcaaa tgaaatgtgg 420 ctaaaactaa cgacccaata tcaacagaat gccagtgata ataggcacct tctacgaaat 480 gaattctatt catacacgta agcaggaaca accagaattt ttaaataata atgaattctc 540 actaaagttg ttaccacatt acagatatcg accaggtgac actataatgg cacacataac 600 agcagtagag acactggctg gacagctcat agatgtcggt gtaccagtgg gtgatgagga 660 cataatctca aagattgtat gcaatctacc actgagctac agaacatttc aaacgacatg 720 ggcaagcacc aacccagctg agcagactag agccttgctc acaaccagat tggtcaacga 780 agaaagggcg atacaacgtc accaacaagc acaagaatca tcgggagcaa ccaactccac 840 atcaaataac gaagccctct acagcaacag cagccaatca caaagaggga gactaaacag 900 aagagatagc ggtagattgc acaacagacc tcgccaaatg cagggagaca gacctcgatg 960 tacttactgt actctcccag gccataaagc tgaagaatgc agaaccaaaa aacgagatga 1020 gagaaactcg gagagacaag gccatggcga cagagctaat acagcacaga cagaaagaaa 1080 cgctacagaa acagaatggg cactaatctc tgcagcgtgt ctgaaggcat gtaaaaaagg 1140 aaaatggtat gcagactcgg gagctacaca gcacatgtcc gaagacagat cagcattcaa 1200 caactatcaa gcgattcctg aaggaacatg gacagttgat ggaattggaa atacgaaact 1260 ccaagttgtc ggaaaagggg acattcaggt tgaaagcaga attccaggaa tattccatac 1320 aggaacaata aagaacgtgc tccacgtacc acatctaggt gtaaacctta tatccatcgc 1380 cgctttgacc aatctaggag cagaagtggc tttcactgcc aacaaaataa acgtcacact 1440 caacaatcaa attgtcatga ctggagaaag gagtggaatg tcgctctacg aacttgaaat 1500 tcaaacagca ccaaccaaga tgtcccagtc agcagagaaa gcaacgacat ccaacgccaa 1560 tgcagtagcc ctgaaagcag catcattcaa cacgtggcac aaaaggctag cacacgttaa 1620 ctaccaaaca atccaaagga tgatgagaca tgaagctgta gaaggtatca ctctcaaaga 1680 aaatagtgaa atccccaaag agatatgtca aggatgcgct ttaggaaaaa tgcacaaact 1740 acctttctta ccaggaacaa gaaaagtaac cgaaattgga gagctcgtgc attccgacat 1800 ttgcggcccg atgcaaaccc ccacggtagg aggagccaaa tttttcgtgc tgttcaaaga 1860 cgaacacagc agctatcgtg tagcacatct cctaaagcac aagtcagaag taccggatag 1920 attccaagaa tatgtgcaca cactcaactt cgaaacagga aaaaaggtag ccacacttcg 1980 ctccgataat ggcggagaat tcaccagcaa cgccttcgaa tcctggttaa aagaaaacca 2040 catcagacac gaaaccagcg tacctcacaa cccacagcaa aacggagccg cagagagaga 2100 caacagaact atcgttgagt cagtaaggtg tctcattcat gaaaggaatt tacccctctg 2160 gctatgggga gaagcactca tgtacgccat atacacactc aatcgaacat tgccacaaaa 2220 caagaaaata acgccataca aatctggcac aacaaaaaac caaacgtggc tcacctcaga 2280 gaattcggag tgaaagcata cgcccacatc ccagacgccg aaagacaaaa actggacgcc 2340 aaaagtaaag aatacatcat ggtgggctac gcagagacac aaaaagccta tcgtctatgc 2400 gacaggacca caagaaaaat tcacatcagc cgagatgtta tattcgagga ggacaccacg 2460 gtatcttcca cccacctaga agaaacagaa ccattcgaac cactgactcc atcaacaaac 2520 gacaaggaga aagatcaaac tgaacaaccc aaaaccacaa aaagaagctc atcaaggcaa 2580 catcaaccca agagattgtt ccccatggaa cagggctcct acgacacaaa gaaacgtaaa 2640 gaaaaggaca gcgctactac tacggtatct ttgaaatcac tcagtcttaa ttcgtacgta 2700 gagcccaatc attatcaaga cgcactgtca tctcctgatg cagcaatgtg gaaagaagcg 2760 atcgaggatg aaattcaatc actcatagaa aatgaaactt gggaattagc tcagctaccg 2820 aaaggccgtt cggtaatcga cactctatgg acattcaaaa agaaaattgg catagatgga 2880 gagaaagatc gatacaaggc gagatactgt gcaaaaggat tcacgcagct gaaaggtatc 2940 gactaccaag aaactttctc acccgtagtg aaacacacta ctctaagaac tatgctatca 3000 atcgcaacag tcctcgacct agacatcata caactcgacg ttaagactgc ttttcttcac 3060 ggagaaatcg aagaagaaac ctacatcaaa caacctgaag gatacgtcat acccggcaaa 3120 gaaaacgaag tctgtcgtct taagaagtgc atttatggac tcaagcaatc ttcaagggta 3180 tggaaccaaa aatttcacaa attcttaaca agctttggcc taaaaagcac agcagcagat 3240 ccatgcctat acacctgcca agataaagac ggaatcattc ttgtggcgat atgggtagac 3300 gacggcttag tgtgtggaag caaaatggaa tcaatcaaca aagtaatcga gcacctaagt 3360 gtccacttca agatgacatc aaagaccgca aaacacttcg ttggtttaga aatcacaagg 3420 gatagacaac accgaaaaac ttttgtctcc attccaaact acatcgaaaa aactacaaaa 3480 attcaacatg gccagctgca gtcacaaggt tacacccgct gaccctcaca cacaactatc 3540 gattaaaatg tgtccaaccg aagaccagga gaaggaagaa attaagaaga cgccgtaccg 3600 agaggctgtt ggatgcctaa tttatgcagc catcactgtt cgcccagaca tatcatacgc 3660 agtaggacaa gtgtctagat tctgtgagaa cccaggcaga ccccactggt cagcagtcaa 3720 gcacatcttg tcatatctag cgggaaccaa aaatcacgga atctgttttt ccgacggaaa 3780 tggcgaaaga aacaccctct taggattttg cgactcagat tacgcagggc aggtagacac 3840 aagacgctca acgtcaggac tagtcttcat ggctaataac ggaccaatct catgggggag 3900 cacacgccag acctgcattg cccaatcaac aacagaggct gaatacgtat cactcaacga 3960 agccgcaaga gaagcagttt ggttgcgccg cttgatgaat ggagtcagct gtcaaccatt 4020 acaacccact aaactctact gcgacaacca gagtgccatc cggctggcag gcaacccgga 4080 actgcacaag aaaacgaaac acattgaagt aaaatatcat tatgtcagag agcagcaaca 4140 aaaggaagaa atcaagattg aatatgtccc aaccaaccaa caaactgcag acatcctgac 4200 aaagcctctt gctggcgtca ccttcagaga gatgagagaa cggctgggag taaaggaagc 4260 tccaatcact ttttaattat tcttttcatt accgattagt caatccaggt cacgccattg 4320 atgcggtcgt ctcgaagata ctcttttttt ttatcattca attatccagg tttcgcttga 4380 gagaaag 4387 // ID DNA8-49_AP repbase; DNA; INV; 864 BP. XX AC . XX DT 25-AUG-2009 (Rel. 14.09, Created) DT 25-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-49_AP. XX NM DNA8-49_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-864 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 1979-1979 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 864 BP; 249 A; 127 C; 143 G; 345 T; 0 other; cagggctcgt aagttcatgc atctgcatgt tttttttgct acacaggaaa acatgctaac 60 tgagatacaa atccgtttca taacaatagt aataataata tgaagtttga tagtgcatgt 120 ttttgcatgt tttgggtgta aaggcatatt tggcatatag tcgatttttt acagtcgtta 180 gtgcatatta tttcataaaa atatttttta gacaaatatt tgaaaagttt cttgtttctt 240 atttactttc ccgtttacaa aattatagtg ttttgattta tttttattca gtgagcgtgt 300 gactacctag tgtcctagta cctacatact tatttagatt ttatgcgttt tacgaaatat 360 cgcggaaaca atgtaatcgc taccactacc atcctccacc gtataaagtg gcagcttctc 420 accaaccagc atttataatt ttgtacattg gtacagtata gttttcttac actcgccgat 480 gagtttactc gtatgagtcg tatgcgtttg tccgttcaat ttcctgtttt agtgttttca 540 tgtgaggtat aggtactgta cacattattc atattttatt taaacaatgc cgaaagaaaa 600 acaaacagtt gcgggccgtt tgcaaaattt tgtggaggaa tttgaatttt tttaaattta 660 tggattaatt cttattattt taaaattttc tcatgatttt tttacataat gcatattttg 720 agcgcataat ttaaaaatgt tagggcatat aaatgcatat tttcgaactt tttagtgcat 780 attttaatgc atattttgac aatttttagt gcatgaatgc atgcttattt atgcattttt 840 tagtgcatga agtaacgagc cctg 864 // ID Gypsy-52_AA-LTR repbase; DNA; INV; 1701 BP. XX AC AAGE02020210; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-52_AA_; KW Gypsy-52_AA-I; Gypsy-52_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-1701 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02020210; Positions 28615 30315. XX SQ Sequence 1701 BP; 465 A; 352 C; 408 G; 476 T; 0 other; tgtaacggta gccctttttg ttatttgttt atttgttaat tgtttattta tttattattt 60 tctattaaaa cttagtcaat taaattgaaa gacatttaaa ttaatcaaca aaagttacat 120 ggacatgatc aaaaccaaaa tgacacgaac aattcagaaa attgcataca actgcttcaa 180 gcgtagatag gttatccgat acgcagcgta gatgcgcaag accgagaatc actgttcgtg 240 ttttcgttac cgtgttagag aaccccgaag tagtcagctt ttgaatactt ggctttgtaa 300 cggaatgaca agaaaaggct attcaatcgc gacggagatt tcaataactc agatcacaca 360 aacataatta acggagaccg tcgggtttct gtttccggga gccataccgt gagatcctat 420 ctgaggaaga acagaaatat tctcgtccac tttgttccag atcgtggtag aagtaggatc 480 gcgtagtgta gtttctcctg tggtgtgatt tcgtgagttt atttaaaacg tgaagtgttc 540 tttaaccagt attaattgtc cccatatccc cgtttaggcc tgaaatagtc agttagattg 600 tgcggtaatt cgttgtgcgt gttgaactcg agttgaccag ttcaaaggta aatgacattt 660 attgtgtgtc ctatatgtgc taattgtgtc cgtctagcca agcgcaagct tcggctgtta 720 ttttctaaaa atagaactca cgtacatcgc acaaagcaac gagcgaaaac gtcccgctgc 780 accacatcac gtcgcgcgtc ctccacgttc tggtgccaaa gttcgccggc gccctgaggc 840 cggaaaggag ttgcgaacgt caccaacgca ctccaggcag aataccattc gcctaccgtc 900 atcccgttgg gaccagatcc ttggaaacgc ctcaacgccc tcgtgaatga gcgtgattcc 960 gtaggagcaa cgtaagccat cagagcgtca tcctgaggta gcctacgtca tcggcaacgc 1020 catcatgccg taagttcgga gagtgagctt cgacaaaccg tgagtaaaac gcaacgttgc 1080 atgcaagaag attagcacgg acacacaaaa agcgctagag agtagagatt agaaccgcac 1140 acgttagtta ggaaggggga tagtagaaga cgaagaaagg tcggccgaaa agtcgttaac 1200 gaaaatgtag attagttaaa tgttaataaa tgtctttttt gaagtacttg tttccctaaa 1260 agccatgtgt tagttagtga tagttgagaa agttgaagta cgttttcacg tttcgttttg 1320 tcttgagttt tcccgttttt tgttgaggaa tccgccttcc aaagcgttat ggagtcgaag 1380 tggttgggag atctctgccc gtgatgagaa aaagggtcag gtcgagttct aggattcggt 1440 tgtctgttgg gatttgttgg ttaggtttgt gtggtctcag tttttgaaat gttagattcc 1500 gtcgaggcca ggttgaagca agagtctgct cacttgttcc cgaacactgt taagtatttg 1560 aaagctgact cttcggtcag tacctagccg ggcaccgtaa acacgacgag tggtcctcct 1620 cggaggtggc gctcaagcca ctcgtttaaa atccagtccg gaagtcagta gcagactccg 1680 agccgccttt aggccgctac a 1701 // ID Harbinger-N1_BF repbase; DNA; INV; 465 BP. XX AC . XX DT 04-AUG-2008 (Rel. 13.08, Created) DT 04-AUG-2008 (Rel. 13.08, Last updated, Version 1) XX DE Amphioxus Harbinger-N1_BF non-autonomous DNA transposon - DE consensus. XX KW Harbinger; DNA transposon; Transposable Element; Nonautonomous; KW Harbinger-N1_BF; non-autonomous. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-465 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-465 RA Kapitonov V. and Jurka J.; RT "Harbinger-N1_BF - a family of non-autonomous DNA transposons RT from the amphioxus genome."; RL Repbase Reports 8(8), 814-814 (2008). XX DR [2] (Consensus) XX CC The genome contains several thousand copies of this CC non-autonomous transposon, which is characterized by TWA TSDs and CC 33-bp TIRs. XX SQ Sequence 465 BP; 161 A; 85 C; 88 G; 131 T; 0 other; ggccacgttg atttgattat atggatgaca tccgcgctcg caccaatttt cggccatttc 60 caaaaaaaaa aaaagttttc gaccacacaa taaattgtgc aaagcagctg taaaatgagc 120 acaaatattc cgtagattga aaaaaaaagt atcagaaaac agataactaa tgccatatat 180 agaatgccaa aatgaatcta aagtcaaaga ataagatgtg aaaggacgga aagaaaaaaa 240 aatctattgt tggttggggg aggtaccagt acagaaaatt caggtccaga ggatcatggt 300 ataccatact atgtgtgttt agtcctatgt tcaaccttcc tggccacttt gatttcatac 360 tgaaggcaac tttttttttt ggccaaactt tttttttatt cgctcgctcg catcagtttt 420 ggagttcccg gaggatgtca tccatataat caaatcaaca tggcc 465 // ID Copia-11_SI-I repbase; DNA; INV; 4012 BP. XX AC AEAQ01016250; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-11_SI_; KW Copia-11_SI-LTR; Copia-11_SI-I. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-4012 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01016250; Positions 5106 1095. XX CC Positions [1500-2006] - Integrase core CC 'ATAAT' target site duplication CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 900..3983 FT /product="Copia-11_SI-I_1p" FT /translation="MQIQDSWLADSRATHHMTHRREWMYDFESITKGQFGV FT TVGNDQIIYALGRGSIDIIATIDGQKVQHKLCNVLFVPDIGKNLLSVGAAA FT DNGVEARLSKTGIKLMSKNGIIAYGTRVAHSLYSMDFETVVNTSANVASVA FT ANEQVWHERYGHVNYKAIRRMVKNPAVDGMQMIGKSNTDEETDQFCEACIF FT GKHSRKSFNDSKTRADELGALIHFDICGPMSIESFGGARFMALFVDDYTGM FT VFVYSMKSKADIVNCMKDVIAEANAVGHKIRRVRSDNAKEFIGKDMKKVLR FT EHSILQETSTVYCPEQNGRAERQNRTIVEMARTMIAGVELPKGLWGEAVHT FT AAYIRNLVPLERLDGKTPMELWTGKRPNVSHLRIIGSKAYTLTNEKQSKFE FT KKSQELILVGYVPKQKAYRVWKRGTRKVVVSRDVIIVESKPKQQAAIVRTT FT EDNNDGSLDNNLQSENKRQSDLSAQTISKIETDEAESIDDKEVNTDNIANR FT TRSKATLNIVTAASAFIADVHIPTTVEEAKLGSHKEQWESAMKSEYDSLVQ FT NNTWTLVELPKDKRSIKNKWIFKLKTKIDGSIDRFKARLVIKGCSQKAGVD FT YMETFSPVARHDSIRALLSIAAVKDYEIYQLDVKTAFLYGDLNETIFMDQP FT ESFNDGSGRVCKLNKSLYGLKQAPRQWNKKFDKLMQRFGLKSSNADPCVYT FT STDLCVALYVDDGLVIGHTKSKIDKLLEVIQSNFEITSSIATCYLGIEFML FT DRKLRTIKLSQTAYTKSVLKKFGMLECNGVTVPADPDVQLKRNLNDDGKTG FT EIADVPYRQLIGSLMYLSVGTRPDISYAVSILSKFLEAPSREHWTAAKRIL FT KYLKSTPNLGIVYNGMTSKPNQLIAYSDADYASCLDTRKSTSGVILMLNDG FT PIIWSSRKQSIVATSTTDAEYVAAHDATKEVVWSRQLLKDIGVEQFEPTTL FT FCDNAAAQKLIENPIFHKRSKHVDIKFHYTRELVKQNQIKIQHVATQSQLA FT DILTKPLARGKFEINRNQLNLM" XX SQ Sequence 4012 BP; 1326 A; 774 C; 933 G; 979 T; 0 other; ggttatgggc ccaggctcac gtgcaagtgt ataaaatctt acagtctctc gtgtctttaa 60 taaaaagagg atagttttct ccgagtcaca agatgaccac ggaagatttg aggttaactc 120 cgttgaccaa agatagttat cagcgttgga agtacgaggt acgttcctgt ctgaaaagca 180 ttggtgcaac tggtcatgtt gacggtatgg taaagtaccc aacggggtca ggcagcgaga 240 aagacgccga aacatggatt aagatggatg gtaaggcaca gcgggtgcta acctgtgctc 300 tttccgacga cgaccacgca gcaattcgag actgtgatac ttcgcgggat gtctggttaa 360 aaattcagtc gatttacgag acaaaatcga acgaaaataa gtatctgcta aaccaagagt 420 ttcatcgaat gcgtttcaat gaagatcaga gcgttacaaa ctattgtgct caattagcag 480 taatttgtca gaaattaaaa gcagccggcg aggaactctc cgattcaggt ttgattgcta 540 aattagttaa tgatttgcct tcaagatttg acaatttcag gtcaaattac tacatacaag 600 cagcctccgg aacgacgctt acgtttgaca aattaaagga acagctcatg ctgattgagg 660 ctaatgtatc tagtgctgct gctaatacag atactggtga tgctttaata acgcaagcaa 720 aagaacggaa ctcaaaggcg aagcgagaac aacgggcatg ttttcattgc aagaagattg 780 gacacgtcaa acgtgattgt cgaaaataga aatctgagca agctaaagaa caaggaaaat 840 caacagcctc agtagcttca ggtcaagcat ttgtaacagc aagtggatac gctcgcaaca 900 tgcaaataca ggacagctgg cttgccgact caagagcaac acatcatatg actcatcgac 960 gcgaatggat gtacgatttt gaatcgatta caaaaggaca gtttggagta actgttggaa 1020 atgatcaaat tatttacgca ctcggacgag gaagtatcga catcatcgca acgattgatg 1080 gtcaaaaggt acaacataaa ttatgtaacg ttttgtttgt acccgatata ggcaaaaatt 1140 tactgagcgt gggcgcagca gctgacaatg gtgttgaggc tcgactaagc aaaactggta 1200 ttaagttgat gtctaaaaat ggaatcatag cgtatggaac gcgtgtagca catagtcttt 1260 actcaatgga ttttgagact gttgtcaata cgtcagcaaa tgtagcttcg gtagcggcca 1320 acgaacaggt atggcacgaa aggtatggcc atgttaatta taaggcaatt cgacgaatgg 1380 ttaaaaaccc agcagtcgat ggcatgcaga tgattggtaa atcaaacaca gacgaggaaa 1440 ctgatcagtt ttgtgaagcg tgcatttttg gcaagcatag tcgaaaatcg ttcaatgact 1500 caaaaactcg agccgacgaa cttggtgcac tcatacattt tgatatctgt ggcccgatgt 1560 caattgaatc gttcggtgga gctcgattca tggcactttt tgttgacgat tacacaggaa 1620 tggtatttgt gtattcaatg aagtcaaagg ctgacatcgt taactgtatg aaggatgtca 1680 tcgctgaagc taatgcagtt ggacacaaga ttagacgagt acgttcggac aacgcaaaag 1740 aattcattgg aaaggacatg aagaaggtac ttcgagagca ttccatcctc caggagacat 1800 caacggtgta ttgtccagag cagaacggtc gtgcagaacg tcagaacaga actatagtcg 1860 aaatggcacg aacaatgata gctggagttg aattaccaaa aggactatgg ggtgaagcag 1920 tacacacagc cgcttatatt cgaaatcttg ttccactgga gagattagat ggcaaaactc 1980 cgatggaatt gtggaccggt aaaaggccaa acgtctcaca tctgcgcatc ataggtagta 2040 aggcatacac gctgactaac gagaagcaat cgaagttcga gaagaaaagt caagaattga 2100 tattagtcgg atacgtgcca aagcaaaagg cctaccgtgt ttggaaacgc ggaactcgga 2160 aggttgttgt aagtcgcgat gtaatcattg ttgagtctaa acctaagcaa caagctgcta 2220 ttgtgcgaac gacagaagac aataatgatg gctcattgga caataattta cagagcgaga 2280 ataaaagaca gtccgatttg tcagcacaaa caattagcaa aattgaaact gacgaagctg 2340 agtctattga tgataaggaa gttaataccg acaacatcgc gaaccgcaca cgcagcaagg 2400 ctaccctgaa cattgttacc gcagcaagtg cctttatagc agatgttcat attccaacta 2460 cagttgaaga agcaaagttg ggaagtcaca aggaacagtg ggagtcagca atgaagagcg 2520 aatatgattc tcttgttcaa aataacactt ggacattggt cgagttacca aaggataaaa 2580 ggtcaatcaa gaataagtgg atcttcaaac tgaagacaaa aattgacgga tcgattgatc 2640 gttttaaagc acgcctcgta ataaaaggat gttcgcagaa agccggagtt gattatatgg 2700 agaccttctc accagtcgca agacacgatt ccattcgtgc attgctgtca attgcagcgg 2760 tcaaggacta tgaaatctat caactggatg tcaaaacggc gttcctgtat ggggatttaa 2820 acgaaactat ttttatggat caaccggaaa gttttaacga cggctcaggt cgagtatgca 2880 agctgaataa aagcctgtac ggtctcaaac aggcacctag acaatggaat aaaaaatttg 2940 acaagctcat gcaacgattt ggtctaaaat cttcaaacgc agatccatgt gtgtatacaa 3000 gcactgattt atgtgtagca ctgtacgttg atgatggcct agtgattggt catacaaagt 3060 caaaaattga taagctgtta gaagtaatac agagcaactt tgaaattaca agctcaattg 3120 caacatgcta tttaggcata gaattcatgc ttgatcgtaa gctgagaaca attaaattgt 3180 cacagacagc ctacacgaaa tctgtactca agaagttcgg tatgttggag tgcaatggcg 3240 taactgtgcc tgctgatcca gacgttcaat taaagcgcaa cttgaacgat gacggaaaaa 3300 ctggagaaat cgcagacgta ccctaccggc agttgattgg ctcacttatg tacctgtcag 3360 tcggaacacg acctgacatt tcgtacgctg tcagtatttt gagcaagttt cttgaagcac 3420 catcgaggga gcactggaca gcagcgaaga gaatcttaaa gtatttgaag tcaacgccca 3480 acttggggat agtctacaat ggaatgactt ctaagcccaa tcaattgatt gcttattcag 3540 atgccgacta cgcatcatgc ttggatacac gcaagagtac cagcggagtc atacttatgc 3600 tgaacgacgg accaataata tggtcatcac gcaagcagag catcgtagca acatcgacaa 3660 ctgacgcaga atatgtggcc gcccatgatg caactaagga agttgtctgg tcacgacaat 3720 tactcaagga tattggagtc gaacaatttg aaccgactac tctcttttgt gacaacgcag 3780 cagctcagaa gctaatcgaa aatcctattt tccacaagag gtcaaagcac gttgacataa 3840 aattccacta tactagagaa ttggtcaaac aaaatcaaat caagattcaa cacgtggcaa 3900 cacagtcgca actggccgac atcttaacga agcccttagc acggggaaaa tttgaaatta 3960 atcgaaatca attaaatttg atgtaattgt tgactgttat ctcgagtggg ag 4012 // ID Transib2_DP repbase; DNA; INV; 3032 BP. XX AC . XX DT 21-MAR-2005 (Rel. 10.03, Created) DT 01-JUL-2005 (Rel. 10.08, Last updated, Version 2) XX DE Transib2_DP is a family of autonomous Transib transposons - a DE consensus sequence. XX KW Transib; DNA transposon; Transposable Element; transposase; KW Transib2_DP. XX NM Transib2_DP. XX OS Drosophila pseudoobscura OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; obscura group; OC pseudoobscura subgroup. XX RN [1] RP 1-3032 RA Kapitonov V.V. and Jurka J.; RT "RAG1 core and V(D)J recombination signal sequences were derived RT from Transib transposons."; RL PLoS Biol 3(6), (2005). XX DR [1] (Consensus) XX CC Transib2_DP is a family of autonomous Transib transposons. The CC consensus sequence encodes the 704-aa Transib2_DPp transposase CC (pos. 270-288, 543-2638). Transib2_DP elements are characterized CC by CAWTG target site duplications and 38-bp terminal inverted CC repeats. XX FH Key Location/Qualifiers FT CDS join(270..288,543..2635) FT /product="Transib2_DPp" FT /translation="MENEDKGKFEFLHSDLIDIWRNNNRQKKSVSHWIYSK FT INNNHLENEKKREIDKKISNFESFIKRNLLKCNKMMERFMTTHAKWMASKI FT TIIVDENCKTPKKMGRPTLHYTNAGTRLKRKLASDLANDNNNNTNLLMHAA FT AVSAKKERKTDVAFVVTKTIRTPEYPTESKKHLQLQKPIPLSPDEALAYLL FT ENSLNKQQYTNTRLLNKRHNCDIYPSYNKVMEAKLQCRPECIEVMETTARV FT PLQNLLDHTAHRIIKLQSDVFKQFPDASENKLICSYGFDGSTGHSIYKQKF FT LTEIPGSQLSDHSLFVTSVIPIQITDTYHRNIWTNRTPQSIRFCRPLKIDL FT VKETTAHIIIEKKDLDFQINTLKPFVYKLVENVDIVVTYEMHMTLIDGKVL FT NVLTGTKSTQCCPICGVTPTKVLEITDLSSETFTPKLGALKFGVSPLHAWI FT RFLEFVLNISYRIEIQKWHIKGDNKIKTSIRTKYIQEKIWEDMGLRISMPK FT QNGNGNSNDGNTARRAFANTKLLSSITNFSVALLNNFHIILIAISCNYYIN FT SEKFRSFCESTFCLYMQTYPWYPMSPTIHKVLVHGFQINNSTLVPLGCLGE FT NASEARNKMYKKDRLSHARKNSRVNTMTDIFYRAIDSSDPLLSSVCLKERE FT RKNKKKPLSKEVLSLLELPSTESKDSCEYENDSQSDSDSDYNIYDIELEEE FT DEVI" XX SQ Sequence 3032 BP; 1127 A; 486 C; 532 G; 887 T; 0 other; cactatggtc cagaattcgt tttttttggc ataaagtcga aatttttatg tgaatggatt 60 acgttttttc ttgcaaattc tcaaagtaaa ttcatcatac atattaaaaa tgaaaaaaaa 120 aaaattattt agcttccctg aagattttgg cgagcttttg aaagtgaaca atcgatcagg 180 tgattgtggg aaaaaaattg gcgccaaatt acttgtttac atttttacgt ggcgaccgag 240 taaatttgct caaataaatt cggctaaaaa tggaaaatga agataaaggt atgtagtatc 300 taatgtattt agttatttta tgtgtagtcc caagatttaa gtgttgtgca atgtccaatt 360 tatgtgtatt tttgtagaaa atttaactgc gtcccctgtg tttcgtagaa tataacaaat 420 gaactgagtt ctgctgagaa aaactaacta tgtgagtttg tacctacaaa tgttgtaatt 480 gactggaaag taacaacttt ggcccgtttt ggcccctttt ctttaaattt gttaatattc 540 aggaaagttc gagtttttgc atagcgatct catcgatatt tggcgcaaca acaatcgaca 600 aaaaaaatct gtctcacatt ggatttacag taaaatcaat aacaaccatt tggaaaatga 660 aaaaaaaaga gaaatagata aaaaaatcag taattttgaa tccttcatta aaagaaacct 720 tctgaagtgc aacaaaatga tggaacggtt tatgaccacc catgctaaat ggatggcttc 780 taaaataaca attattgttg atgaaaattg taagacacca aagaaaatgg gacgtccaac 840 attacattat acaaatgcag ggacaagatt aaaaaggaaa ctggcgtctg atcttgcaaa 900 tgataataat aataacacga atcttcttat gcatgctgcg gcagtttctg ctaagaaaga 960 acgcaagacc gacgtggctt ttgtcgtaac aaaaacaatt agaacacctg aatacccgac 1020 tgaatcgaaa aagcatcttc aattgcaaaa gcccattcca ctttcaccag atgaagcttt 1080 agcttacctc ctggaaaatt cactcaacaa acagcagtat acaaacacta ggcttttaaa 1140 taaaaggcat aactgcgaca tttatcctag ctataataaa gtaatggaag ctaaattaca 1200 gtgccgacca gagtgcatag aagtaatgga aactactgct cgagtgccat tacaaaatct 1260 tttggatcat acagcacatc gaataattaa attgcagtcc gatgttttca agcagttccc 1320 ggatgcatct gaaaacaaat taatctgcag ctacggattt gatggctcaa cgggtcatag 1380 tatatacaag cagaagtttt taactgaaat acctggcagt cagctttccg atcattcttt 1440 atttgtaact tctgttatac ccatacaaat tacagacacc tatcatcgga acatctggac 1500 aaatagaaca ccgcagtcta ttagattttg cagaccgcta aaaattgatc tagtaaaaga 1560 aactactgcg cacataatta tagaaaaaaa agatttagat tttcaaatta acactcttaa 1620 accctttgtt tataaattag ttgagaatgt tgacatagta gttacctatg aaatgcatat 1680 gacacttata gatggaaaag tgctgaacgt attaactgga actaaatcta ctcaatgttg 1740 tccaatatgt ggggtcaccc cgacaaaagt attggaaatt acagatctta gttcagaaac 1800 ctttacccca aagcttggag ctttgaaatt tggagtaagt ccattacatg cttggatccg 1860 gtttttagaa tttgtcttga acatatcata cagaatagag atacaaaaat ggcatataaa 1920 aggtgataat aagataaaaa cgagcattcg gacaaaatat attcaggaaa agatttggga 1980 agatatgggt ctacgtataa gcatgccaaa gcagaatggc aacggaaata gcaatgatgg 2040 gaatacggca agaagagctt ttgcgaacac taagctacta tcatccataa ccaactttag 2100 cgtagccctt ttaaataatt ttcacataat cttaatcgca atttcgtgta actattatat 2160 aaattcggaa aaatttcgtt ctttttgtga gagtactttt tgcttatata tgcaaacata 2220 tccgtggtac ccaatgtccc ctacgattca taaagtatta gtccacggtt ttcaaataaa 2280 caattcgacg ttggttccac ttggatgttt aggagaaaat gcttcagagg cacggaataa 2340 aatgtacaaa aaagacagat tatcacatgc aagaaaaaat agtcgggtaa atactatgac 2400 tgacattttt tatagagcga tagattcctc ggatccatta ctatcgagtg tctgtttaaa 2460 ggagagggaa aggaaaaaca agaaaaagcc tctttccaaa gaagtattaa gccttctgga 2520 gttgccgtct accgaaagca aagattcatg tgaatacgaa aatgactcgc aatctgattc 2580 tgattctgat tataatatat acgacataga actagaagaa gaagatgagg tcatttaaga 2640 aaaaaattag tgtgtaataa atttccatag tttatatata aatattgata atttgaaata 2700 agatattata tatgtataat gatttaacaa caaaatgtaa taaatttgaa gtatgtaatt 2760 taaacttccg actttatgca aaaaaaaaaa acaagaacaa gtacaagtat acaagtatat 2820 gtacaaaatt taaaagctct agctttaatt tcgagctctg tgttaaacaa atctgaggtg 2880 ctctggggaa gggggtcgat ggcgtggtcg aatctaaaac aaaaaaaaaa ccgaaatttt 2940 ataaaaatac tcaaaataat tgtgcaaagt ttcaaggctc tagctttaaa actagacttt 3000 atgccaaaaa aaacgacttc tggaccatag tg 3032 // ID Copia18-NVi_I repbase; DNA; INV; 4231 BP. XX AC AAZX01010628; XX DT 13-NOV-2007 (Rel. 12.11, Created) DT 13-NOV-2007 (Rel. 12.11, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: internal DE portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia18-NV; KW Copia18-NVi_LTR; internal portion; Copia18-NVi_I. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-4231 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from Nasonia parasitic wasp."; RL Repbase Reports 7(11), 1152-1152 (2007). XX DR Genome; AAZX01010628; Positions 9186 4956. XX CC Positions [1603-2106] - Integrase core CC 'GCCCG' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 90..1598 FT /product="Copia18-NV_I_2p" FT /translation="MAGELSTRHVAKFDGKNFAAWKFQINAVLIAHNLEGI FT VTGTRTKPTDATTNETKQWVKDNAKAMYVLSSALEPGQLNCLLSCVTAKEM FT WDKLSVIHEQKTATHKLLMMQRFHEYKMDLSDSVAQHVAKVQNLAAQLLDV FT GENLPDIVIMSKILTSLPMKYRNLRTAWSSVATEKQTIEHLLERLIEEENL FT LRIDEEEEAVALAAFSKKRESVGTGTKESGGASVKNSKQGGDRSNKKTYKC FT FSCHKKGHFARNCPEKKEKQQNSNQTSNASANVAFVAERRVSSKNELDEAK FT VSSWKPSSEQERKLLETNTEEVWFIDSGASTHVTHRKEWFIEYRSRRDGSS FT IVLGDDGECGVAGEGTIAVNRLIDDKWIEARIENVLHVPGLKRNLLSVGRC FT TSLGYQFEFKNRYVELKKHKEIIAVGVIQSNAIYRMFLQVIKPDRKEINIT FT STSMKVWHERLAHLNVRAMRDLTSGDSVQGVRIVEKEGFFCEPCQLGKSHK FT LPFKKKN" FT CDS 1651..4206 FT /product="Copia18-NV_I_1p" FT /translation="MPTESVNGARYYVLFKDDSSGYRYIYFLKNKSDVTEK FT FIEFESTVRNLRGNSMKFLRTDNGREYVNDRLKQYLSQHGITHVNTAPYTP FT EQNGQAERENRTIIEAARTMLKARDLPNYLWAEAVNTAVYLLNRTTVKATH FT SNKTAYEVWTNKKPNLEHVRTFGCAAYMNVPKQLTNKLSDKAVALIMVGYQ FT DNSSNYRLFNPITKKVHVSRHVIFNEDGRVKETTNQESDEDWFLPGDDVKV FT PDEVHGEHENARDQPTPEPDEPVPEPAGGHDNIGDDQADVPGEGEAPIHSR FT LRDRSKIKPPSKYLVNIAVFRPPETYDEALERPDAEKWRKTIEYELRQHEK FT NNTWELVPRHHGMRTIDSKWVLKSTPGLGNQDVFYKARLCARGFMQKEGLD FT YTETYAPVVRYDSLRMLLAHVALEDLETVTFDIKSAFLYGDLEEDIYMEVP FT KGVRVNTRGMNARNDECRSSVDANECKNVLVCKLRRSLYGLKQAPRCWNTK FT FKKFLSSFNFYESQADKCIFIGKCDDHKVYLALFVDDGLVICKSKVVLNNI FT LSELRKQFEITVGDSCYFVGLEIYRNREEKSVIICQQAYIQHVIDRFNMND FT ANPVCTPAETSLYLQKADKNVANKNNVPYREAIGSLMYLAMTTRPDIAYIV FT CYLSRFQNGYDSRHWQALKRVLAYLKGTKYLGLEYKLNKEPCNLIGYSDSD FT YAGCLETRRSTSGYVFFMAGAAVSWTSRRQELVTLSTTESEYVAAATAARE FT ALWLKRFLSDFGYTCVTGILLHVDNKSAICLTKDSKNHRRTKHIDTRYHFL FT KERCETGDLSVTYVASREQRADVFTKALPKNLFIEMRENLGLVDNSYLIA" XX SQ Sequence 4231 BP; 1369 A; 860 C; 1024 G; 978 T; 0 other; ggttatgggc ccagatctac gcgtgagtga gaagtgatta ttttctttgt gtcgcgagtg 60 aacgaacatt taacgtgtga taaaggaaga tggccggaga actgtctaca cgacacgtgg 120 cgaaattcga cgggaagaat tttgcggcct ggaaattcca gatcaacgcc gtgttaatcg 180 ctcacaactt agagggcatc gtaacgggta cgcgaacaaa accgacagat gcgacaacaa 240 acgaaacaaa acagtgggtg aaagacaatg ccaaggccat gtacgtgtta tcatctgcac 300 tggagcctgg acagctgaac tgcttgctgt cgtgtgttac agccaaagaa atgtgggata 360 agttgagtgt catccacgag cagaaaacag ccactcacaa gctgttaatg atgcagcgtt 420 ttcacgaata caaaatggat ttatccgata gtgtcgcaca gcatgtggca aaagttcaga 480 atttggcggc gcaactgctg gacgtaggtg agaatcttcc ggacattgta ataatgtcga 540 aaatcctcac gagtctgccg atgaaatatc gcaacttacg cacagcttgg agtagtgtcg 600 caaccgaaaa acaaaccatc gagcatctac tggaacggct gattgaggag gagaatcttc 660 tacgcatcga cgaagaagag gaggctgtag ctctagcagc tttttcaaag aaaagagagt 720 cagtaggcac aggcactaag gagtcaggag gtgcaagcgt aaaaaattcg aagcagggtg 780 gtgatagaag taacaagaaa acctacaaat gcttctcgtg tcacaaaaag ggacacttcg 840 ctcgaaactg tcccgagaag aaagaaaaac aacagaattc gaatcagacg tctaatgctt 900 ccgcgaatgt agcctttgtg gccgagagac gcgtcagttc taagaacgag ctcgacgaag 960 caaaagtctc ttcttggaaa cctagtagcg agcaagagcg aaaactgttg gagactaaca 1020 cagaagaagt ctggtttatc gacagcggcg catcaactca cgtgacccat cgaaaagaat 1080 ggttcatcga gtaccgatca aggagagatg gcagcagcat agtcctaggt gacgatggtg 1140 agtgcggtgt agccggcgaa ggtacgatag cggtgaacag actaattgac gacaaatgga 1200 tcgaagcgcg aatcgaaaac gtacttcacg taccaggctt gaaaaggaat ttgctctcgg 1260 ttggtcggtg cacatcgctc ggttaccagt ttgaattcaa gaatagatat gttgagttaa 1320 aaaaacacaa agagattatc gcggtcggag tgatacagtc aaatgccatc tatcgtatgt 1380 tccttcaagt gatcaaaccc gacaggaaag agataaacat aacctctacg agtatgaaag 1440 tctggcatga gcgcctggca catcttaatg tgcgagcgat gagagatctg acgtccggcg 1500 actcagttca aggtgtacgc atagtagaaa aagaaggttt cttttgtgaa ccttgtcagc 1560 tcgggaaatc gcacaaattg cccttcaaga aaaaaaattg aacgagcgag cctgccaggt 1620 gaatacttcc atagtgacgt atgcggaccc atgccaacag agtctgtgaa tggcgcgaga 1680 tactacgtct tattcaagga cgacagtagt ggttatcggt atatttactt cttgaaaaat 1740 aaatcagacg tcaccgaaaa gttcattgaa ttcgaaagta ctgttagaaa tcttcgcggt 1800 aacagtatga agtttttacg caccgataat ggtcgtgaat acgtgaatga taggctgaag 1860 cagtacctga gccaacacgg aataacacat gtgaacacag caccatacac gccggagcag 1920 aacggccaag cggagcgcga aaaccgaacg atcatagagg cagctcgcac gatgttgaaa 1980 gccagagact tgccgaacta cttatgggca gaggcggtca acaccgcagt ttatctatta 2040 aacagaacaa cggtcaaggc tacccatagt aataaaactg catatgaagt ttggacaaac 2100 aaaaagccca acttggagca cgtacgaacg tttggctgtg ctgcttacat gaacgtaccg 2160 aagcagctga ccaacaaact gagtgacaag gcggtagctc tgatcatggt gggatatcaa 2220 gataactcca gcaactatcg tctgttcaat ccaatcacca agaaagtgca tgtatcgaga 2280 catgtaatct tcaatgaaga tggtcgcgtt aaggagacaa caaatcaaga gtcggacgaa 2340 gactggtttc tcccaggcga tgacgtaaag gtaccagacg aagttcacgg agaacacgag 2400 aatgctcgag atcaacctac ccctgaaccc gacgaaccag tccctgaacc agctggaggt 2460 cacgacaata tcggagatga tcaggctgat gtaccaggag agggcgaagc tccaattcac 2520 tcacgtctac gagacaggtc taaaattaag cctcctagta agtatttggt aaatatcgct 2580 gttttcaggc ctcctgagac gtacgacgaa gcactggaaa gaccagatgc cgaaaaatgg 2640 cggaagacca ttgaatacga actgagacaa cacgagaaaa acaacacctg ggagcttgtt 2700 cctagacatc acggaatgag aaccattgac tcaaaatggg tcttaaaatc aactcctgga 2760 ctcggaaatc aagatgtttt ctacaaagca cgtctatgcg caagaggatt tatgcaaaaa 2820 gaaggactcg attacactga gacgtatgca ccagtggtgc gatatgattc tctgcgcatg 2880 ctgctggctc atgtcgccct tgaagatttg gaaactgtaa cgttcgacat caagtcagcg 2940 ttcttatacg gcgatttgga ggaagacatc tatatggagg tgccaaaagg ggtgcgtgtg 3000 aatactcgtg gaatgaatgc aagaaatgac gaatgccgct caagtgttga tgcgaatgaa 3060 tgtaaaaacg ttttagtctg taaactgcgt aggtctctgt atggtcttaa acaggcaccg 3120 cgttgctgga acactaaatt taagaaattt ctgagttcgt tcaacttcta tgaaagtcaa 3180 gctgacaaat gtatattcat tggcaaatgt gacgatcaca aagtatatct cgctctcttt 3240 gtagacgacg ggctcgttat ttgtaaatca aaagtagtct taaacaatat tttatctgaa 3300 ctgcgaaaac aattcgaaat aaccgtaggc gactcatgtt attttgtagg attagaaatt 3360 tacaggaaca gggaagagaa aagcgtaatc atatgtcagc aagcctatat acagcatgta 3420 atcgatagat tcaacatgaa cgatgcaaat cctgtatgta cacctgcaga gaccagttta 3480 tacttgcaaa aggcagataa aaatgtcgcg aacaaaaata acgtaccata tcgcgaggcc 3540 ataggttcgt tgatgtacct ggcaatgact actagacccg atattgctta tatcgtttgt 3600 tacctaagta gatttcagaa tggttacgat agtcgtcatt ggcaggcatt gaagcgtgtc 3660 ctggcctatc taaaaggtac caagtaccta ggacttgagt ataaattaaa caaagaacca 3720 tgtaatctga tagggtactc agactccgat tatgctgggt gtttggaaac tagaaggtcc 3780 acttcgggat atgtattttt catggctgga gccgcagtgt catggacttc cagacgccaa 3840 gaattagtaa ccctcagcac gaccgagtct gagtatgttg cagctgccac agcagccaga 3900 gaagcactgt ggttaaaaag atttctgtcc gatttcgggt atacatgtgt cactggcatc 3960 ttgttacatg tcgacaacaa aagtgctatt tgtttaacga aagattctaa aaaccatcga 4020 agaaccaaac acatagatac ccgatatcat tttttaaaag aaagatgcga aactggagat 4080 ttgtctgtaa cttatgtagc atctagagaa cagagagcag acgtgttcac aaaagctcta 4140 cctaaaaatt tgtttattga aatgcgtgaa aacttaggtc ttgtagacaa ctcatatcta 4200 attgcttgat cagtgtgata aacggcggaa g 4231 // ID DNA8-108_AP repbase; DNA; INV; 705 BP. XX AC . XX DT 29-AUG-2009 (Rel. 14.09, Created) DT 29-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-108_AP. XX NM DNA8-108_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-705 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2046-2046 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. It contains a piggyBac-like insertions (pos. 123-679) CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 705 BP; 263 A; 105 C; 94 G; 242 T; 1 other; cataggcgtg cgcagacttt tttcgcaggg tatgccggta tttttaaaca tacctagctc 60 aatacaaaaa ataactcccc aaaaactctt aataggtaca aataacttgc catataatgc 120 gaaaagaaat taacatgcat tttatttagc ttatttattg gttttagata ttttaatttg 180 aaaagtttgt actgtgattt cattccaata aagtaacagt ataaaaacct acagacattt 240 gatcatatta taatatataa atataataat gacaaaaaat gataatctaa ttaatcgaaa 300 ttattttatt tttgatagta tgaaaataag caataccgca ggggcctgaa tgttctggtg 360 acaaaacccg caattctcac gaaaatngtg gattttgatt tcaactatat tattatgagt 420 tagcaatcta ttttattatt tataggtacc tgtggtgatt atgaaaatat aattatgaaa 480 tctaattaat atttgatttc gtctcaactc ccaatatttt attataagaa aataatataa 540 aaaaataatc aaaatattta aattcttgaa ctaagattct atgaactatg ttagttcatg 600 attctaattt ataatcaaaa catttacaac taaaaactat atggttatga taactctgca 660 ggggatgcca tggcataccc ggcatccccc ctgcgcacgc ctatg 705 // ID Gypsy-83_AA-LTR repbase; DNA; INV; 195 BP. XX AC supercont1.247; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-83_AA_; KW Gypsy-83_AA-I; Gypsy-83_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-195 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.247; Positions 114902 115096. XX SQ Sequence 195 BP; 51 A; 29 C; 45 G; 70 T; 0 other; tgtggtgact gagtgcgggc ctgacattca tgctccttta ttgtttgtgt ctacggacgg 60 acctgtagcg aactgtcatt attattgatg tggaattgga gattgttggt cggtacttta 120 taaattaaat aaactgagtc attgaagttg taataaacgt gtatttaata ttcgttataa 180 tcctgagcta ccaca 195 // ID CR1-5_CQ repbase; DNA; INV; 4762 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-5_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4762 RA Kojima K.K. and Jurka J.; RT "CR1 non-LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 6-6 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 10 sequences with >95% CC identity. XX FH Key Location/Qualifiers FT CDS 307..1245 FT /product="CR1-5_CQ_1p" FT /translation="MASACDYCAKQVKDEEPFVKCMGFCEGVVHMRCTKTG FT SLKMNTSFLKIINECFNMFWMCDECSKLMKVARFRTVMESVGDAVDHMQDE FT QDRKNSELRDLITENGKQLLLLSKRVSSLTPTANKLGTPRNKPTPKRTRDS FT NPVPPKPLVFGTNEVASASVTCIPPEESLFRLYVTRFANHTTATQVEGLVK FT NAIGQNELVQVTALVKKGVDPLTLPFISFKVAVNLGHKAAILNPAIWPGNC FT GFREFEDLQRKSSSPSNGDGQRAKKPMLSLATIPITLTPSGSASPNPDNRD FT VEPFLTPRSGNSDGAPAPMDH" FT CDS 1188..4682 FT /product="CR1-5_CQ_2p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="TVFDTPLGELRRGAGPNGSLAPVFGCRLTSGCTTQST FT EEAPTLSGTVAQSAASVHLSHPDPAAGSRVGVFQSSQSGKYAHRLSPSPPD FT AYPVSSATPSSCLSRHNPDGDLRAAAPPQPTACWTSGCTTQSTEEAPTLSG FT TVAQSAASVHLSHPDPAAGSRVGVFQSSQSGKYAHRLSPSPPDAYPVSSAT FT PLRRPGRGQKSRLPCNESLNIYYQNVGGMNTRALEYCLACSAVSYDVIAVT FT ETWLKASTLSTQVFGNGYEVFRGDREQFVNSQKRDGGGVAIAVRHGLRARI FT VVSDEWKCAEQVWVAIDLDDHTVFICAAYIPPDQTSSANVFEINAASVLAV FT SAMARPYDELIVLGDFNLPGLIWRPSGDGFLYADSALSTQAAHVTEFLDSY FT SSSLLQQINSFPNENNVVLDLCFVSCPDTAPPLVIAPAPLVKDVRHHPPLL FT LSLPVRLATEHTPVVCEVRYDYRNLDVPSMLELLQSIDWENTLDKDNLDEA FT VQTFTNILSYAIDRHVPKKPVTNARVPWQTSELRRVKAAKRSALKKFSKHR FT TLPLHNYYVRLNSCYKRMSRQCHADHEVRTQKKLKSNPKGFWKYINEQRKE FT IGLPSSMFLGDETASNTEEICQLFAEKFSSVFTLEDLQPEQVTAAASNVPT FT NGLGLNSIRIDRSKILEAVAKLKISNTTGPDGVPSIVLKKCIDGLLEPLYL FT LFNLSLSIGSFPKLWKHAFMFPVHKKGDKRNVDNYRGISALCAVSKLFELA FT VMAPVFFFCKNLISEDQHGFMPSRSTTTNLLTFTTFVTDSFTARSQTDAVY FT TDLSAAFDKLNHAIAIAKLERMGIGGTLLSWFQSYLADRKINVKIGDCLSA FT ALMAFSGIAQGSHLGPLVFLLYYNDCNHSLSGPRLSYADDMKIYCQVNNAA FT DAQALQQDLNIFADWCETNRMVVNPTKCSVITFSKKRNPIKFDYQICGTLI FT PRENCVKDLGVLLDTELTYKQHISYMISKASRQLGFIFRAAKGFTDVYCLK FT ALYCALVRSTLEYCSAVWSPYYENSAVRIESVQRRFIRFALRRLPWRDPFH FT LPSYESRCKLINLDTLAVRRNVSRALVISDALTSRLQCPAILRGLDIMAPL FT RTLRNTPFLRVPVRRTNYAMHSALTGLHRAFNFVASLFDFNLSRDVIKNKF FT LLFFRR" XX SQ Sequence 4762 BP; 1137 A; 1339 C; 1166 G; 1120 T; 0 other; cacttttttc tccgtgtcta cgtctacggc gattgtttac gtttgcgata gcgctgatat 60 tttttgcgat tttacgtgtg ttttgcgcaa gtttccgaag tttattcgcg ttgtgcggtc 120 cgtttacgct aaccaccaat cccggaaagt gtcgaacggt cggttcccgt ggcctggttg 180 ctggttgaaa cctgcgtgtt tttactccgt tgagtgaacc tccaacacaa catcgtcgtt 240 aagggcatct ctaccggcgg tgtttattgg ttgcggctgc acaaattatt tgccgaaatt 300 ttggacatgg cttctgcttg cgattactgc gccaaacagg tcaaggatga ggagccgttc 360 gttaagtgca tgggcttctg cgaaggggtc gtacatatgc gctgtaccaa gacgggcagc 420 ttgaagatga acacctcgtt cctgaagatc atcaacgaat gcttcaatat gttttggatg 480 tgcgacgagt gttcgaagct gatgaaggtg gctcgtttcc gcaccgtaat ggagtccgtc 540 ggcgatgcgg tcgaccacat gcaggatgaa caagaccgca aaaactccga gctgcgggac 600 ctgatcactg agaatgggaa gcagctgctg ctgctctcca agcgcgtcag cagcctaaca 660 ccaactgcca ataaattagg tacgccaagg aataagccta ctccgaaacg tacccgtgac 720 tccaatcctg ttccgccgaa gcccttggtg tttgggacga acgaagtagc gagtgcaagt 780 gtcacttgca tccccccaga agaaagtctg ttccggctgt acgtgacacg cttcgccaac 840 cacacgactg ctacacaagt ggaaggactc gtgaaaaatg ccatcggtca gaacgaactg 900 gttcaagtga ccgcattggt caaaaagggg gtcgatccgc taactctacc gttcatctcg 960 ttcaaagtcg cggtgaatct tggacacaag gcggcgattc taaacccagc aatctggcct 1020 ggcaactgcg gctttagaga atttgaggac ctgcaacgaa aatcttcttc gccgagcaat 1080 ggtgatggac aacgggcaaa aaaaccaatg ctgtctctcg cgaccatacc gatcactctc 1140 acaccatcag gttctgccag ccccaatcct gacaatcgcg acgttgaacc gtttttgaca 1200 ccccgctcgg ggaactcaga cggggcgccg gccccaatgg atcactagct ccagtattcg 1260 gctgccgctt gacatcggga tgcacgaccc aaagtaccga ggaagccccg actctctccg 1320 gcacagtcgc gcagtccgca gccagcgttc atcttagtca tcccgatcct gctgccggaa 1380 gtagagtcgg ggtcttccag tcgtcacagt caggcaagta cgcacaccgt ctgagccctt 1440 cccctcctga tgcatatccg gtttccagtg ctactccttc aagctgcttg agccgacaca 1500 atcccgacgg cgacctacga gctgctgcac cgccgcagcc cacggcatgt tggacatcgg 1560 gatgcacgac ccaaagtacc gaggaagccc cgactctctc cggcacagtc gcgcagtccg 1620 cagccagcgt tcatcttagt catcccgatc ctgctgccgg aagtagagtc ggggtcttcc 1680 agtcgtcaca gtcaggcaag tacgcacacc gtctgagccc ttcccctcct gatgcatatc 1740 cggtttccag tgccactccc ttaaggcgtc cgggacgagg ccaaaagtcg cgattgccgt 1800 gtaacgagtc gctgaacatc tactaccaga acgtcggtgg catgaacaca cgagctctgg 1860 agtactgtct cgcatgctcc gccgtctcat atgatgttat cgcagtaacg gagacctggt 1920 taaaggccag cacactgtcc actcaagttt tcgggaacgg ctacgaggtc ttcaggggtg 1980 atcgcgagca gttcgtcaac agccagaaga gagatggcgg gggcgttgct atcgcggttc 2040 gtcacggact tcgagcgcga attgttgtgt ccgatgaatg gaagtgtgcc gagcaagtat 2100 gggtcgccat cgatctcgat gaccataccg tcttcatctg cgctgcgtac atcccccccg 2160 atcaaaccag cagtgcaaat gtgtttgaaa tcaacgcagc ttctgttttg gccgtctctg 2220 ccatggcccg cccctacgat gaactcatcg tcctcggaga tttcaacctg ccaggattga 2280 tctggcgtcc cagcggcgat ggttttctgt atgccgactc tgcgctttca acccaggccg 2340 cgcacgtgac ggagtttctg gacagctatt ccagctcctt gcttcaacag atcaactctt 2400 tcccgaacga aaacaacgtg gtgttggacc tctgctttgt cagctgccca gataccgcac 2460 ccccgctagt catcgctccg gcaccgctag ttaaggatgt ccggcatcat ccgccgctgc 2520 tgctgtccct gccggtccgt ttggccaccg agcacacgcc agttgtttgt gaagtccgct 2580 atgactatcg caatttagac gttcccagca tgttagagct cctgcaaagc attgattggg 2640 agaacacact tgacaaggat aatctcgacg aggccgtaca gacctttacc aacattctta 2700 gctacgccat cgacaggcac gtaccgaaga aacctgttac gaacgctcgt gttccatggc 2760 aaactagcga gcttaggaga gtcaaagctg caaagaggtc ggcactgaaa aaattctcca 2820 aacatcggac gttacctttg cacaactatt acgtgcgact caacagctgc tacaaaagga 2880 tgagtagaca gtgtcatgcc gatcacgaag tacgtaccca gaagaaactc aagtccaacc 2940 cgaaaggatt ttggaagtac atcaacgagc agcggaaaga aatcgggtta ccgtcatcta 3000 tgttcctagg agacgaaaca gcatcgaata cggaggagat ctgccagctg ttcgccgaga 3060 aattttcgag cgtgtttacg ctagaggacc tgcaaccaga acaagtcacc gccgccgcca 3120 gcaacgtccc aacaaacggt ctgggtctga acagcattag aatcgatcgc tccaagattc 3180 tggaagccgt agccaagctc aagatatcca acacgacagg cccggacgga gttccatcga 3240 ttgtcctgaa aaaatgcatc gacgggctcc tggagcctct ctatctcctc ttcaatctat 3300 ccctctctat tggatcattt ccgaagctct ggaagcacgc cttcatgttt ccggtacaca 3360 agaaaggtga taaacggaac gtggacaact atcgtggaat atcagcactg tgcgccgtct 3420 cgaaactttt cgagctagcg gtcatggctc cagttttctt cttctgcaaa aacctgatca 3480 gcgaggacca gcacgggttt atgccatcgc ggtcaacaac gacgaatcta ctcacgttta 3540 cgacgttcgt gaccgacagc ttcaccgcca gatcgcaaac cgacgcggtg tacaccgacc 3600 tgtcggcagc gttcgacaag ctgaatcacg ctattgcgat cgcaaaactg gaaaggatgg 3660 gaatcggcgg gactctcctc agttggttcc agtcctatct ggccgaccgg aaaatcaacg 3720 tcaagattgg agactgcttg tcggctgcct tgatggcctt ttcaggtatc gcacagggaa 3780 gtcatctcgg accactcgtc ttccttctat actacaacga ctgcaaccac tccctctctg 3840 gaccgcgtct gtcctacgca gacgacatga agatctactg tcaagtgaat aatgcagccg 3900 acgctcaagc cctccaacag gatctgaaca tcttcgcgga ctggtgtgaa acaaatcgaa 3960 tggtcgtcaa ccccaccaag tgctctgtca tcaccttctc caagaaacgc aacccaatca 4020 agtttgacta ccagatctgc ggcacattaa ttccccggga gaactgtgtc aaggacctcg 4080 gcgtgcttct tgacacggaa ctcacataca agcagcacat atcatacatg atttccaaag 4140 cctcgcgaca gctcggcttc atcttccgtg cagcaaaagg ttttacagat gtctactgtt 4200 tgaaggctct gtactgcgct ttggtccgtt caacgctgga gtattgttct gccgtatgga 4260 gcccctacta cgaaaacagt gcggtgcgga ttgagagtgt gcagcgaaga tttatccggt 4320 tcgcgctccg tcgactcccc tggcgggatc catttcatct gcctagttac gaaagccgct 4380 gtaaactgat caaccttgac accttagcag tgcgtagaaa tgtcagccgt gcactcgtaa 4440 tctccgacgc tctgacttct agattgcagt gcccagccat ccttcgagga ctagacatta 4500 tggccccgct tcgaacgctt cgtaacactc cgttcttgcg tgttcctgtt cgccggacga 4560 actatgctat gcatagcgcg ttgacgggac tacacagagc attcaatttt gtagcttctt 4620 tgttcgattt taatctgtcc cgtgacgtca tcaaaaataa gtttttattg ttttttaggc 4680 gataatttgt gtacatgtac ttagttttaa gcacaccatt tgggcatgtg gtgcctgttg 4740 gtgttaaaca aataaacaaa ta 4762 // ID Copia-116_AA-LTR repbase; DNA; INV; 316 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-116_AA_; KW Ty1_copia_Ele92; Copia-116_AA-I; Copia-116_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-316 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 316 BP; 77 A; 65 C; 86 G; 88 T; 0 other; tgttggagta tgcaatgatt gtgcctcgta tgcaaagcgg caacactgaa tgtgtagaat 60 ttggagaatg aggctgcgac gcatgcgcca gtagggcagg cggcgagagc gccaaataac 120 agagagaaaa gggagaatga aagaattgca ttcatccgta atgtgatgtt cgatccgtac 180 agacgtgttt acattaaagt tctcttgcgt acagtagttg agttattagt tgttttcttt 240 cctaccgaaa tcctggttcc gcgttccgct tcggtgaccc tgttctgcgt tgtggtgtgt 300 ccacctctgc ccaaca 316 // ID Gypsy-15_SI-I repbase; DNA; INV; 4381 BP. XX AC AEAQ01023729; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-15_SI_; KW Gypsy-15_SI-LTR; Gypsy-15_SI-I. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-4381 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01023729; Positions 442 4822. XX CC Positions [3315-3776] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 69..4349 FT /product="Gypsy-15_SI-I_1p" FT /translation="MSLTPEALMALSTAFTEAIREATRMAAESTTGTTTST FT HSKPPSFKISEYRSSDGATVEDYFKRFEWALKLSKIAEEHWANFARVHMGT FT ELNQALKFLVSPRSPEDLTFEQIRDTLKNHIDRAKNKFAESIRFRKICQQK FT DETVASFALRLRQGAVHCEYGEFLDRMLIEQLLHGLESSDTCDEIIAKKPA FT TFDAAFDIAHTLEATRLTANEVKTTMTPAPEATHKLGYTQPKKKYGNKQGR FT QRSSSRNRQQQYQHKQKDFSKKKQDQRCQGCGRDHPRSQCQFRDAECHNCG FT IKGHIATVCRSKKKEKSDQVSDALVPADHVDSIQYFNKVNDINTSNLARKR FT MINVSIDGHNLEMELDTGAPCGIVSKKMLHTIKPTCLLQKTDRQFASYTSH FT KINCIGRLPVNVTIGRTTKRLNLYVVDGNFDTLMGREWISHFTHEIDFAKL FT FSSPDGIHAVSTVPPCLTAEQKGQLDQLLNRFQDVFDDVAGKLSGPPVKIH FT LKPEATPVFAKAREIPYALRDAYAREIDAKIKSGLYKKVDYSEWASTTHVV FT TKKNGKIRITGNYKPTLNPRIIIDEHPIPRAESIFNQMRGATTFCHLDITD FT AYSHLPVDEEFSHALTLNTPTHGLIRPTRAVYGAANIPAIWQRRMESVLQD FT LPKVRNFFDDVLLFADNFADLMITLEKTLERMRSHGLRLNRTKCVFASPSV FT EFLGHKIDAQGIHKSDKHIEAIRDAPKPSTHEELQLFLGKATYYNAYIPDL FT STRDRPLRDILRQETFKWTPAAEKAFKEIKTALISPQVLMPYDPSLPLLLA FT TDASKTGLGAVLSHRLSNGRERPIAYASRTMTTTEQKYPQIDKEALAIVWA FT VQKFFHYLYARHWTLITDHKPLTQILHPEKSLPVLCISRMANYADYLAHFD FT FDVIFKSTKENTNADYCSRASLSSTINAIQDSSPREKEEFDGDEFDHFIIK FT QIKQLPIDAEQIARETRKDPALGKIIKLLETGQNLERAGYKAPESAYKLAS FT NCLVFEHRVVIPLVLRKAMLDDLHAAHLGIVKMKGMARSFIYWPGIDPEIE FT RTAKSCVECAKHAHAPPKFRQHHWDYPKGPWERIHIDYAGPVAGMMLLIVS FT DAFSKWLEVKATNSMTTAATIAILDNLFTAHGVPTTVVSDNGTQFTSSEFK FT SFLQTSGVKYHKLTAPYHPSTNGQAERSVQTVKAALRAMGTTRSTLQEDLN FT RFLRHYRIAPHSTTEQSPSQLFLGRTLRTRLDLVRPDDVFTKVTQKWNSQF FT VPTFRSLKPTQVVYFLSGNPRMDKWIRGIITTRLGDLHYEIDYNGRRFKRH FT IDQIRGHEEGKTTEQSNVTPAPDVSNNWESRQPRRVRFYPDTSEAHTPVAP FT TNAKIQSSPVPQPRTSTRTTTVSEQIPSTASDRGVIRNGHASPPVLPRRST FT RDRRLPNRFSPSR" XX SQ Sequence 4381 BP; 1370 A; 1082 C; 933 G; 996 T; 0 other; aagtaagaga tacaacagtg gtgtcagcag tggaatgaga tcaatcagca aaacagcaag 60 cgtgcaaaat gtcgttaaca cccgaagcgt tgatggcatt gtcgaccgcc ttcacggaag 120 cgattcgaga agcaacacga atggccgcgg aatcgacaac ggggaccaca acctccacgc 180 actcaaagcc accatcgttt aaaataagtg aatatcgatc ctctgatggg gccaccgtcg 240 aagattattt taaacggttc gagtgggcac tcaaattaag taaaatcgcc gaagaacatt 300 gggccaactt cgctcgcgtt cacatgggaa cagagctcaa ccaggcactc aaattcttgg 360 taagcccacg atcacctgaa gatctcacat ttgagcaaat acgggatacc cttaagaacc 420 atatagatcg cgctaaaaat aaatttgcgg aaagcatccg atttcggaaa atctgccaac 480 aaaaagacga aacggtagca agttttgctc ttcgtttacg gcaaggtgca gttcactgcg 540 agtacggaga atttctcgac agaatgctaa tagaacaatt attgcatggg ctcgaatcta 600 gtgacacgtg cgacgaaatt atagcaaaga aaccggcaac ttttgacgcg gcatttgata 660 tcgcacacac tcttgaagca actcgactta ccgctaatga ggttaaaact acgatgacac 720 cagctcccga agctacgcat aaactaggtt atacacaacc gaaaaaaaag tacggcaaca 780 aacaaggacg ccagagatca tcatcacgca atcgacagca gcaataccaa cacaagcaga 840 aagatttctc aaaaaagaag caggaccagc gatgccaagg atgtggtaga gaccatccac 900 ggagtcaatg tcaattccga gatgccgaat gtcataattg cggtattaag ggacacatcg 960 ctacagtgtg tcgatccaag aagaaagaaa aatccgatca agtttctgac gcactcgtac 1020 cagcagatca cgtcgattct attcaatact tcaacaaggt caatgacatc aacacttcca 1080 acctagcacg gaaacgaatg atcaatgtct cgatcgatgg ccataattta gagatggagc 1140 tggatactgg agcaccctgc ggtatcgtaa gtaaaaaaat gttacatacc attaagccta 1200 catgtttgtt gcagaaaacc gatagacagt tcgccagtta tacgtctcac aaaatcaatt 1260 gtattggacg actgccagtc aatgtcacaa ttggccgaac taccaagagg cttaatttat 1320 acgtggtgga tggtaatttc gatacgctca tgggtcgaga atggatctca cacttcaccc 1380 acgaaataga tttcgcaaaa ctattctcat cgccagacgg gattcacgcg gtgtcaacgg 1440 taccgccctg tcttaccgca gagcaaaaag gtcagctaga tcaactcctg aatcgatttc 1500 aagatgtatt cgacgacgta gcagggaagc tatcgggccc gcctgtcaag atacatttga 1560 aaccagaagc gacaccagtc ttcgccaaag cacgtgaaat accttacgca ctccgtgatg 1620 cttacgcgag ggaaatcgat gcgaaaatca agtcaggact ctacaaaaaa gtcgattatt 1680 cagaatgggc gtcgactacg cacgttgtaa cgaagaaaaa tggcaaaatc cgcatcacgg 1740 gtaattacaa gccaacactt aatccgcgta tcattataga tgaacatccg atcccgaggg 1800 ctgagtcaat attcaatcaa atgagaggcg caacaacctt ttgtcatctc gatattacag 1860 atgcgtattc acatcttcct gtggatgagg aattcagtca cgcgcttacg cttaacacac 1920 ctacacatgg cttgattcga ccgacacgcg cagtgtacgg cgccgcaaat attccagcta 1980 tctggcaacg acggatggag tctgtcctcc aagatcttcc taaagtacgc aattttttcg 2040 atgatgtact acttttcgcc gataactttg ccgatctaat gataaccttg gaaaaaacgc 2100 tggaaagaat gcgctctcat ggtttacgac tcaatcgcac gaaatgcgtc ttcgcatcac 2160 cctctgtgga atttctagga cataagatcg acgctcaggg catccacaaa tcggataaac 2220 acattgaggc aatccgagat gcgccgaaac cgtcaacgca tgaagaactc caattatttc 2280 tggggaaagc cacatactac aatgcataca tcccggatct ttcgacgaga gaccgtccgc 2340 tgcgagatat acttcgtcag gagactttca aatggacgcc tgctgcagaa aaggcattta 2400 aggaaatcaa aacagcattg atatcacctc aagtccttat gccatacgac ccgtcgctac 2460 ctctactttt ggcaaccgat gccagcaaaa caggcttggg cgcggtactc tcacatcgtc 2520 tcagcaatgg tcgggagcga ccaatagctt atgccagtcg tacgatgact actacagaac 2580 agaaataccc acagatcgac aaagaagcgc ttgccatcgt ctgggctgta cagaagttct 2640 tccattatct ctacgctcgt cattggacct tgatcactga tcacaagcca ctaactcaaa 2700 tccttcatcc agaaaaatcg ctaccggtgc tatgtattag cagaatggct aattatgcag 2760 actatttagc tcatttcgac tttgatgtga ttttcaaatc cacgaaagag aatactaacg 2820 ctgattattg ttccagagca tcactatcat caacgattaa cgccatacag gattcttcgc 2880 cacgagagaa ggaagagttc gatggagatg aattcgatca cttcatcatt aagcaaatta 2940 agcaattacc aatcgacgcg gaacaaatcg ctcgagaaac aagaaaggac cctgcactcg 3000 gaaaaattat caaattacta gaaacaggac aaaacctcga acgtgcaggt tacaaagctc 3060 cggagtcagc ctataaactg gcctctaatt gtttggtgtt cgaacatcgt gtcgtgatac 3120 cactcgtgct acggaaagca atgctggatg atctacatgc ggcacattta ggtatcgtta 3180 aaatgaaggg aatggcacga tcatttattt attggcctgg catcgatccg gaaattgaac 3240 gtactgcgaa atcatgcgtt gaatgtgcta aacatgcaca tgctccccca aagtttcgtc 3300 agcatcactg ggactacccc aaaggaccgt gggaacgtat ccacatcgat tacgccggtc 3360 ccgtcgctgg aatgatgctc ttgattgtct ccgacgcatt tagcaaatgg cttgaagtca 3420 aggcgaccaa ctcgatgacg acggctgcaa caattgccat tcttgataac ttattcacag 3480 ctcacggtgt tcctaccact gtagtttcag acaatggaac gcagtttacg tcatctgaat 3540 tcaaatcgtt cttacaaacc agtggcgtta aataccacaa gttaacagcg ccttatcatc 3600 cctccaccaa tggacaagcc gaacgaagtg tgcagacagt caaagctgcc cttcgtgcca 3660 tggggactac tcgtagtaca cttcaagagg atctaaacag attccttcgt cattatcgca 3720 tcgcaccaca ctcaactacg gaacaatcac catcacagct attcctcggt cgaacattac 3780 gaacacgctt ggatttggtt cgacctgatg acgtcttcac aaaagttact caaaagtgga 3840 actcacagtt tgttccaacg tttcgctcac tgaagccaac tcaagtcgta tatttccttt 3900 cgggaaatcc taggatggat aaatggatcc gaggaattat cacgactcgt ctgggagacc 3960 ttcattacga gattgattac aacggtcgaa gattcaaacg ccatatagat caaattcgag 4020 gacatgaaga aggtaagacg acagagcaat ccaatgttac tccagcaccc gatgtgagta 4080 acaattggga aagccgtcaa ccccggcgag tacgattcta tccggatact tcggaagcac 4140 acacaccggt ggcacctacg aacgcaaaaa tacagtcgtc tccagtgcca caaccaagga 4200 cttcgactcg tacaacaacg gtttcagaac agatcccatc aacagcatcc gatagaggcg 4260 ttataagaaa cgggcacgcc tcacctccag ttcttcctcg gagatcgaca agggatcgtc 4320 gtcttccgaa cagattctct ccaagtcgct aatcgggaga atttccgttg aggggaggaa 4380 a 4381 // ID CACTA-3_AA repbase; DNA; INV; 9069 BP. XX AC . XX DT 16-MAY-2011 (Rel. 16.05, Created) DT 16-MAY-2011 (Rel. 16.05, Last updated, Version 1) XX DE DNA transposon: consensus. XX KW EnSpm; DNA transposon; Transposable Element; CACTA-3_AA. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-3053 RA Yuan Y.W. and Wessler S.R.; RT "The catalytic domain of all eukaryotic cut-and-paste transposase RT superfamilies."; RL Proc Natl Acad Sci U S A 108(19), 7884-7889 (2011). XX DR [1] (Consensus) XX SQ Sequence 9069 BP; 2909 A; 1721 C; 1645 G; 2794 T; 0 other; cacacttaaa aactaatcgc gagctcggca gaattcatta ctgaactacc acggcaattg 60 gatattttac tgagaattca gaaaaaatac aatttttacc gactcttggt ttaacttcca 120 actgaaattt cggtacgttt taaacttttc ctgaattacg gcaacactga atgccgaacg 180 ccaccgtaaa catcactttc tccgacgtca gttaaaaatc ctgccgaaat ctcggttttc 240 aaagattttt tgccgaaaca cggtatttcg aatgtcatcc gccattattg ttgtcattga 300 acgcgagtta actccggcat ccgggaacga aggaaagttt aatttttaaa gtgaaattac 360 aagtgttaaa catggagcaa atgttggaaa tcatcgaaaa ggaagacgta agattgtatg 420 atatcttcag taagtatcgt tgtaaaacaa aatatttcat tgtagtaaat aatgtcaagt 480 ttgacctgct tcttctgccg tgaagcagtt gtgggttatg tcgaggagtt tcagcggcat 540 ataaatggcc acatcagaaa caaacatttt gtttcggatg gtgtatttta tgactgtatg 600 gttccgaaat gtagatccag atacgatcat ttcaaatctc ttaagaggca cataattcag 660 cagcatccag ttcagtttct caacaattca gatagcaaga tagaagaagt ttcagaacct 720 gttgacgatt catcgatccc tcatgaccat cccgcccaac cggaagaatg tatcgaagtt 780 ggtccaaacg attccacctc tttggaacag attaaaaaga ctatctccct tggaatttgc 840 cggttgactt cagatgtctc attgccgcaa tcgaaaattg cggaaatgat caaaatttgt 900 gaaacgttgg ttcaaatgct tggagagtat tttcaggaac aaacaaaggc gtttttggaa 960 gattgcgcca taaatttaga ttctccagaa acaatctcgt ttctgaatca gtttcatttg 1020 tccaatttgt tttctgatgt ttctacgtta tcgaagcaaa ccaatttttt gaaaacattg 1080 gccgtaagca ttcctgatcc atctgagaaa atgcttcaaa cccgagaaga catccggcat 1140 atcgatggaa ttccgaaaaa agtgaaggtt aacgaaactt atatgtacat accgattatt 1200 gacacattaa agctaatctt tagagccccc gaaacaagag atctgttaaa tgatgctgaa 1260 caagaacaat cgcctgttaa agaatattca agctttcgga ccggagaaac gtataaaaat 1320 agtaaatatt tttgtgaatt tcctaatgca atacgtttga gtctttatca agacgatgtg 1380 gagttaggaa atccattaag ctctagggct ggaataaata aagtgtcggt cttcgacttc 1440 aaaattcaaa atttcccaag taaatggaat agctccccca aaacagttta ccctcttatg 1500 tattgtacgt cgatcgatgc caaaaagcat ggatataaca aaattctgaa gcctctcata 1560 agcgacctca aaaaacttga aaaaggtgtc acggtattct acggaaacga agaatacact 1620 attcgggcgg ttgtaaccat attttgtggt gatacgttag cagtacatga gatttttgga 1680 ctccttggac ccatggctaa ctacttctgt agaatttgta ctatccaaag accggctttt 1740 catcaggacc cgttcgaaag cttcccacta cgaacaaaag catggtatga gcataatttg 1800 gatcaagtag aatctggtgc aataagttct tccgaatgcg gtttgaagcg tggtggatgc 1860 gccttaaacc agttgaataa ctttcacatt actgaaaatt ttgcacttga cgccatgcac 1920 gatattgcag aaggactggt acctcttact attcagttag tgctaagtta ctattacaag 1980 aaaaaagaat tgggcatgac tatagagttc atcaaccaac gcatacactg ttttgcatat 2040 ggctatgtgg ataaaaaaaa tcgaccatgt gcgaatttca cccatgaaat gctttctaag 2100 ccagcagtgc ataaattaaa gcaaacagct gcacagaatc tgttgttgct cagagcgttt 2160 ccttttctat tcgggcataa agttttggtt gattgcgatt atatgaagat gattggtcat 2220 ttgatcaata taacacgagt attaatgtcc actatagtct cggagcaaat gctggcagca 2280 ctagaagagc acgttagact atacgaatgc ttgttttacg aaaaattcaa aaggcgaatt 2340 aacaaaaacc accatcttga ccactacgta ctatgcatca aaaagagcgg aaacatgaag 2400 cagttcaact gtctggtgtt tgaacaaaag aataaaccaa cgaaaaatca atcatcgact 2460 tgccgaaact tcaaaaacat ctgtaaaagt cttgctcaac gccaatgctt taaaatgatt 2520 gtagatgtac tcgacaatcc cttccaagat aaaacatgtt ataatggcgg taaaatagtg 2580 cttcgagaac atgcgcgaag taaatttttt ttggacgaag ccttaattca cgtttttata 2640 cctaaatcgg tatctatcaa cggtatcgac tttcggccaa atctgatagt ttgtattcgt 2700 aactacgaca atgactacta tccttcatat ggaattatca cagaaattgt tgtgataaat 2760 agcaaagttc actttttgct gaagttatgc agtacgacag catataacaa ctttctggag 2820 gcatacgaag tttcggtaaa atcagaagaa cagttttttc cttttcatca aattcattcc 2880 catacgacat ttgctttctg gacaattcac ggaagtagtt ccaaatttgt ttccagaaga 2940 aactattgtc aagattactg aatgtgagtt atattatttc aagctttaat ttaagacaat 3000 cgttctaata aaatatattt cacagttcga gaagagataa cagacgtcga aacattctct 3060 atgctggaca acaacgactt cacaagactc aatctaacaa cgaaaatgat aaaaaccatc 3120 cagaagttcc aaaaacttgt cgaaaacgat ttagtagttg aagagttgga cgagcaagtc 3180 gaggaagaga tgaccatcga acacaagatt gagccaccca ccgaaaacaa gcttcctatt 3240 tctttggaaa aggtgtgtat ttattttgtt catgtaagtc ataattttca atcgctcata 3300 ttattcaaat atgtgactgg tcaggatatg catcgttttc tccctgggcg gcatgatata 3360 attaaaaact tactttatta cttgttatag gagatcgacg ttgccctcat attctcacag 3420 acacaaaatg ggcgagagat acacgaaatg ttggctgcag gcaacaggcc cagtgataaa 3480 attatcacaa gcattacgca tattgcatgc gactatctca agtctaccta cggagtgtaa 3540 gtatcataag taaaattcat ttaacgggtt gaccaaattt tacttaccat tttaggaggc 3600 catccaccta ctacaaagaa acacttgccc ggtccttgat caaaacttat cccgtattgg 3660 cgtcaactgc atcggatatt cctcacgtaa gtaataaacc gttattttta tgcgaacgtt 3720 cctaacgaac cgattgaatc aaactctggc atgatcactt ctggcacaag cttttctgca 3780 caccaagcat acatagactg gtgaaccatt actttcattc gttaatggta acgattaaaa 3840 gatcaaaatg atgttttacc ataggtactt atcaattttg atattaatga aaaataaatt 3900 cactatacat atttgaacta gtctgccgct attctttata aacaaataga ttttcaattt 3960 taccacccac cacccaatgt tctggattca cttaggtggc tcagatgatt gttgcgtgcc 4020 actagttctc tctttgattc tcccgttgtt cttatgcaga gcaaatactg acgtcactat 4080 ttgctttgca taagaacagc gggagaatga aatatagaac tagtggccca caacaatcat 4140 ctgcaccacc tgagtaaaac cagaaccttg cccaccaccc acttaacatc gaagcgagca 4200 agcaaaccgc agtcgggaga tctgtcattc tgtctgtagt ttcgagatta acttttcaat 4260 ttgtcccaaa tatgttaagc atttacggaa ccatcgattc ggagctcatt ttgaccaagt 4320 actaaaaata ctataaaata cggttttaat caattttgcc acgatgcaaa atttttaaat 4380 caattggttt gctacaaaag tgaatatttt gtttgaattt taaaatgatc ttcggcattc 4440 ttgtttatgt aaattgagtc taaccaagaa atcaaatttg tgctaagaat attaccagaa 4500 atcaaaatct gcaaaaatct tttatgaaac ttaaaaggtt tttgtgaagc atctgtgaag 4560 tgaaatatgc aaatatccaa ggattcgtca aggaatccga cattcttctg gaattcattt 4620 tgctaaacct ctttattttc aactatatat atttttagat tcctgaaaat tttaaaaaat 4680 acaaagaatt tctctccgcg ccttgttaaa aaccctcgta agaggaaatt gaaaaatgtc 4740 aaataacaaa aagtcttatt tagtggattc caatcgatta catatgctag aacaagggat 4800 aaacaagcac ttacaaaaaa aaaacttaaa taaaattgca cgctagtcaa ccataatttg 4860 aaccgaaata tctgagtcat tcttgagcca agctccctta gtaggttttt tttttcaaac 4920 agtaaattat ttgtttgttt agcaaattga aataatctta acggtagcgt gcaaaaattt 4980 tggtgaatat tgttaaacaa ccattctcaa atgttcgtct atattgattt gttaaccatt 5040 aatggcaaat aatattattg tgaatatttc tcgataactt tttcaaatca acgtacgcaa 5100 cttttgtact gttttgagaa aatttattga gaagaattaa aaagaattaa actgtaccct 5160 tttgaccatt ttttcgtctt attgctcaaa ttgcatgcaa tcagttcgcg ttcaatgttc 5220 aattagggac tttctattat tttcccataa aaacttgaag ttttattcaa ttatttgcgt 5280 cgtttttgta ccatcgactc aagttgtgtt ttaattttat tcgtgcttag aaatggtaca 5340 caaattatgt caagcagaat ttggaactat tggacccccc cccccccttt ggtcacgttt 5400 tctgaatgta acatcaaaac attttgtaag acttgtcacg ctgtgtgacc cctcccctct 5460 attggagcgt gacattattt gggtatgacc ccttacgtcg tttgtctctt gaaataacgg 5520 agcggtgtaa ttagtggtta ggtttttaaa ttttgtctcc tgaatacgat caactgataa 5580 tttacttttc gggaataagt ctgtttgaat actttttcga gatttaaatt tatttcaaaa 5640 tgatttgctt ccgtaaatag ttttatagtg tgttagttac tttaagaagt aagtggtttg 5700 attgacgatt tgacactaaa atatagttgt ttgcattgca tggttaagaa agtgtggctt 5760 gaataaaatt cgtgcatatt ggtcaattac gagtacagta ttccctgttt cggttgtttg 5820 ataagttaat tactcccagt tggtcgggta tcaatccccg tcgacgcagt ggggacgtct 5880 accttggaag gggaataagc ccttagttcc atcagcacat gatgggttcg gtatgcaggg 5940 tccccgatgt gtttggaggt ctccctgggt gtctctgttt agccttaata ccagaaatag 6000 ccggacataa aacaacaaca ctgatgttaa aaagcgtatc gattgatcag aaaaaaggag 6060 atcgattagg gtaagtgttc ccttagttgt ggctgttcct atagttgcgg tagtgccgtt 6120 ttcactgatt ttattgcatt agccactgaa ccgacactgc caatcgacgt cttcttcttc 6180 tttctggcgt tacgtcccaa ctgcgacaaa gcctgcttct cagattagtg ttcttatgag 6240 cacttccaca gttattaact gagagctttc tttgccgatt gaccattttt gcatgtgtat 6300 atcgtgtggc aggtacgaag atactctatg ccctggaaat cgagaaaatt tcctttacga 6360 aaagatcctc gaccagtggg attcgaaccc acgaccctca gcatggtcat gctgaatagc 6420 tgcgcgttta ccgctacggc tatctgggcc ccgacgtatt ggcttattga tacacggaat 6480 agttaaaaag agcgttcaat ttgctttaaa actgatgaaa tatcactaaa tttgctaaaa 6540 ctgttcttgc ttgtaccaat agttgcggta aagtgttcct atagtggagg atcccataag 6600 aaaacaacgg ataccgcaac tataggaaca caaattaaaa atttaccgca actaaaggaa 6660 cagtatacca atagtggagg tattattttt cactgacatg ccgtggatta ctacgatgaa 6720 atcttttttc tcattaagtc aatggtcgtt acttcccgtt acaacatcaa catgtgcatt 6780 aattgcgctt cttgaattaa ggcggttaaa tgaagttcaa tcgtgcttag tacctccact 6840 attggtacat ctaccctaat tcatgatggg gcatctgtac cagtgataga agcaatagta 6900 acgggttgag gaataaggaa ggaatcgcga aaaaaatgaa agctcaattt ccatatttga 6960 atacgtcatt caaaagcttt cctattctat ttcacatgaa aaaataatct ttgaatcata 7020 aaatcagtat ttttcagaaa aatacttcct tcatcactgg aacatgatcc agtaataatg 7080 ttttttttag gtacagaagc ctataatagt ggtttacatt gatttctcat taagactcac 7140 tattgtagga acgcaccact attattgatg caaagaagcg tgttttctgc tatttcctta 7200 tattttcaca caatcattta attttgacca gtaaaataca caaaaatata aaagttgcct 7260 ataacgcgat ataaaccgca ttaccactat tagtggacac acacaaccac tggatcagcc 7320 attctaataa tgtttattat tccaggcact gtggtttcac aagaacggac gtggcgatgg 7380 gcgtcatgct ggtaaaattc attacaggat ggaaagccta gcaaaacaat cagacagccg 7440 cgtatttctt cggcagcgtg atgccgaaca agtaccacag acatctggaa caacagtagt 7500 cgacaaagaa gaacccaaca ttgatgagct ggtaaaatag tgatatttat tattccaaca 7560 attgcttcat aaaacgattt ctgatttcag gttgacgaac tacgatgtat tgtcccaacg 7620 caccaagaaa aaggaaaaat tgaagaattg tggaagaaaa cgattattca acgaaacaaa 7680 gcacgagacg aaggattttt tttgcagtac ttgaaggatt tccccgttgc gttggcattc 7740 gacggtcaat tggttggtta aatatcagtt tttttttggt agtttataac gagcttgttt 7800 tttatttcag atttctcttg acttccaatt gctgaagccg aatgctcaat gttttgatga 7860 tgcatggaat tctattctac caaaaatact ggaccaatac cgagatgtac acatgtacat 7920 gaaaaatgat gtcatcaaag cattgaccgt tattcgtgac aaaaacccga gcagaggagc 7980 gaaacgtccg cgagaggaaa cccaagcacg taaacttaat ccacttcgtg gtgtcatcga 8040 gtggattgat gtaagtagta cacaagatcg attatgtcat tcaaatgaaa atttcaaaaa 8100 tatgtctgaa actacacatt aaaacaaaaa agtcacagaa tcactaaagt ttatgcattg 8160 atgactattt tctatctatc tctttcattc tgctgttaat ttttacacta gctgccattt 8220 tgactgactt caagcttagc aatgatttaa cccaagggaa ctgataaata gcttaaaaat 8280 ctctgcactt cactctgcag tgaggcaaat agaaaatagc aaaaatacta taaattagta 8340 actctgtgac tacctttcct tctgagtaaa agcagacgaa attcataact aatgagaata 8400 atttttttta catttcagcc ggagttcgaa atgccctcaa cagaaattcc tatgattttc 8460 atcgccgaaa agttattcga gatcggggac tgttgtgtgg cctggaaaga tattacgatt 8520 cccgtaggaa atgacgtctt ggcggccttc aagctccttt gtcaagcttt cgtggttttc 8580 aatgtgaaat gcagcccatc cgataaaata ttttattcct tttttcatgc attttgcttc 8640 aaagtagaac cgctgagtac cacgagcaac aagtttgtgg ataaactgaa ctgaacaaca 8700 acatgttgaa cttttcctaa caactaagga aacctaaacg gttagcagag aaataaaata 8760 aatgccctga ttgattacac tatgtttcat ttgtttattt gtatttatca ctggcttatc 8820 gttatttaaa caacagttgg ctccatttta cagctgaaaa tcctgtgcta ttattatctg 8880 attgttttta ttttttgctg agattccagt tcttgtcaaa ccgaaaatct cagtaaaata 8940 cagcagctgt catttttgtt gaaaccgaga atgtcagttg ttaacaatct gccgaaatcg 9000 caaccgagca ctcggctgtg cgattctcgg caaaattaaa ccgagattca gcgaaaaaat 9060 ttaagtgtg 9069 // ID Copia-1_SI-LTR repbase; DNA; INV; 359 BP. XX AC AEAQ01003789; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: long terminal DE repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-1_SI_; KW Copia-1_SI-I; Copia-1_SI-LTR. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-359 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01003789; Positions 1702 1344. XX SQ Sequence 359 BP; 96 A; 87 C; 81 G; 95 T; 0 other; tggagtagtt cgttgcgggt cggatgagag gcgcgatcgc ccaatcagag acgggcgaac 60 gctcggcgaa taggagtccg agactagcga gagcggctcg aaagtacgcg tgtccggctt 120 ccggacttac ttcgtgttcg gcagtccatg agagagcacg cgatcatctt acaactcacg 180 aaactgaaca aacagttttt tttgtctgta caagaaacga gtttccgaaa taataaagtt 240 cacttttcca ttaccatttt tgttaaaaat tgaaacacgc gagtgaacat ttcattttca 300 acctgctcga taattctctt cagagactct tccttcacat ccatcaatcg ttgccctca 359 // ID CR1-54_HM repbase; DNA; INV; 4384 BP. XX AC . XX DT 22-DEC-2008 (Rel. 13.12, Created) DT 22-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE CR1-type family - consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-54_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4384 RA Bao W. and Jurka J.; RT "CR1 families from Hydra magnipapillata."; RL Repbase Reports 8(12), 1882-1882 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 117..887 FT /product="CR1-54_HM_1p" FT /translation="MSITLVEVKKLITEMFMKFKIETEEMLKQQENTFINI FT VSSNTKILNDRLDRVELNILQNSTKIASIAKEVEEIKESLNFHEDLFNKKI FT KTAIDCLAKENLANFKIQNNNNELIRINNKLREIEDRSRRNNLRIDGVLEN FT DNESWAECEKKVKEIFNNILGVKNVNIERAHRIGKVVHNKHRTIILKLLDF FT KDKTEILNNSSKLKGKNIFIYEDYCKETNIIRKTLHEKMKIERQAGKYAYI FT SYDKLIVRDWNSNKK*" FT CDS join(947..1582,1552..4032) FT /product="CR1-54_HM_2p" FT /translation="MASVNDFETFTFFDNVFSLRENINKDINYFHEINKEC FT AYYYPWEVEGFLFNNICNKNLYQIRILHVNIRSINCNFEKLQNLLKETKNT FT FNIICLTETWATANDLINDSNFHLPHFEIISLERQTNKRGGGVLIYVHENI FT RHCLRNDLSISNGDGELLTIEIIIEQTKNILLSCCYRPPDGVAENLSMFLL FT QNVFKEGNNKNKKIFLSLETLIKNFSFIGDFNMNCFIYNDNNKVRNFYDSI FT FEAGAFPLINRPTRVTKYSATLLDNIITTDIFNNDIKKGILKTDISDHFPI FT FLTIPTNQANVKKPNKILTKRIFNQSKIELFKNQLSLLHWKHLNFNDNANN FT LFENFFKTFFSVYDANFPNVTHTSKTKNINNPWVTKGFKKSSKTKQKLYIK FT YLKTKTPTSEKKYKEYKNLFEKIRKSLKKNYYSNLINETKNNLKRTWQILK FT EIIGKQKTCSSFLPHMLKIDNKILYNPQIIANEFNKYFVEIGPTLSDKIPI FT TKTSFHDFLAPLDKGICSDELSSELSFEEFEKAFKTLKKNKATGPDEINGN FT IVIDCYEQIKNILFKIYKASIQQGIFPERLKIAKIIPIHKEGDRSNINNYR FT PISILSVFSKILERIMFNRVNNYFNYNNLLYNNQFGFKKDSSTEHAIIQFV FT RNISKSFEKAQYTLGIFIDLSKAFDTVDHNILIQKMKYYGLNNNTLAWFKS FT YLTNRKQVVYSNNDVQSGSLNITCGVPQGSILGPLLFLIYVNDLNKASNLM FT SIIYADDTNLFQSHTDVYKLFLKTNNELINISKWFKCNKLTLNTNKTKWIL FT FHSFSKKRSLPDIMPQIYIDETLIKRDSVIRFLGVYLDENITWKNHINHVT FT TKVSKNIGILYKARPYLSKKSLTQLYYSFIHSYINYANIAWGSTEKSKLQR FT LYRQQKHAIRVINYVDRSTHSKPFFDEMKILNVYKLNVFNVLCFTYMWKNN FT LSLPVFNDLFTLKLTNKYSLRNKDLLNEPFCRTNFNKFCINYRAPYLWNKI FT VLPNFDPFIRFSVFKNELKKFILAMDDILSYY*" XX SQ Sequence 4384 BP; 1753 A; 645 C; 551 G; 1434 T; 1 other; tttcgcaacg aacgctgaag tgaacgcacg ctttttcgct gcttaaagaa aataaaatag 60 aaaatcgtta gtagtattaa agttacttat attatattaa tttatactta ttgagcatgt 120 ctatcacact tgttgaagtt aaaaaattaa taacggaaat gtttatgaag tttaagatcg 180 agacggaaga aatgcttaag caacaagaaa atactttcat caacatagtt agttctaaca 240 caaaaatatt aaatgatcga ttagacagag ttgaacttaa catacttcaa aactcaacta 300 aaatagcatc aattgcaaag gaagtggaag aaatcaaaga aagtttaaat tttcatgaag 360 atctctttaa taaaaaaata aaaaccgcca ttgattgtct tgcaaaagaa aatctggcaa 420 acttcaaaat acaaaayaac aacaatgaac ttataagaat aaataacaaa ctcagagaga 480 ttgaggatag gtcaaggcgt aataatttaa gaatcgatgg tgtccttgaa aatgacaatg 540 aaagctgggc tgaatgtgaa aagaaggtca aagaaatatt caacaacata ttaggagtca 600 aaaatgtaaa catcgaaaga gcgcatagaa taggcaaagt agttcacaac aaacatagaa 660 caataatact aaaacttctt gattttaaag ataaaactga aattcttaac aactcttcca 720 aattaaaagg taaaaacatt tttatatatg aagattactg caaggaaaca aatattatac 780 gaaagacact acatgaaaaa atgaaaatag aaaggcaagc tggcaaatac gcgtatattt 840 catacgacaa acttattgtt cgcgattgga attccaataa aaagtaatat attatctttt 900 aaatttttat tatattttta cataaactta tttgctttta ttaattatgg catcagtaaa 960 cgattttgaa acttttactt ttttcgataa tgttttttca ttgcgcgaaa acattaacaa 1020 agacattaat tattttcatg aaattaacaa agaatgtgct tattactatc cgtgggaagt 1080 agaagggttt ctatttaaca atatttgcaa taaaaactta tatcaaatta gaattctaca 1140 cgttaatatt cgaagcatta attgcaattt tgaaaaactt caaaatttac ttaaagaaac 1200 aaaaaacact tttaatataa tttgtttaac tgaaacttgg gctactgcaa acgatttaat 1260 taatgactct aattttcatc ttcctcattt tgaaataatc tcacttgaaa ggcaaactaa 1320 taagcgcggt ggcggagttc taatttacgt tcatgaaaat attaggcatt gtctaagaaa 1380 cgatcttagt atttctaacg gcgatggaga acttttaacc attgaaataa taattgaaca 1440 aacaaaaaac attcttctaa gctgctgtta tcgaccacct gatggtgtgg ccgaaaactt 1500 gagcatgttt ttactacaaa acgtttttaa agaaggaaat aataaaaata aaaaaatttt 1560 tctttcattg gagactttaa tatgaattgt tttatatata atgataacaa caaagttaga 1620 aacttttatg attcaatttt tgaggctgga gcttttccat taataaatcg cccgacgaga 1680 gtgacaaaat actcagcaac tttactagac aatatcatta caacagacat ctttaataat 1740 gatataaaaa aaggaatctt aaaaacagat atctcggatc actttcctat ttttctgact 1800 atacccacta accaagcaaa tgtaaaaaaa cctaacaaaa tcttaactaa acgtattttt 1860 aatcaatcaa aaattgaatt gtttaaaaac caactatcgc tactgcattg gaaacattta 1920 aactttaatg ataacgctaa taatcttttt gaaaatttct ttaaaacatt tttttcggtt 1980 tacgatgcta actttcctaa tgttactcat acatcaaaaa ctaaaaacat aaataatcct 2040 tgggttacca aaggtttcaa aaaatcatca aaaactaagc aaaagttata tataaaatac 2100 ctaaaaacaa aaactcctac aagcgaaaaa aaatacaaag aatataaaaa tttatttgaa 2160 aaaatccgga aaagtttaaa aaaaaactac tattcaaatt tgattaatga aacaaaaaat 2220 aacctaaaac gcacttggca aatactaaaa gaaataattg ggaaacaaaa aacatgctca 2280 agttttttac cacacatgct taaaatcgat aacaaaattt tatacaatcc acaaattata 2340 gctaatgagt ttaataaata ttttgttgaa attggtccaa ccctatcaga taaaattcca 2400 attaccaaaa cttcttttca tgatttttta gcacctcttg ataaaggcat ttgctcagat 2460 gaactatcgt ccgaattatc ctttgaagag tttgaaaaag cttttaaaac cttaaaaaaa 2520 aataaagcaa ctgggcctga tgaaataaac ggaaatatag ttatagattg ctatgaacaa 2580 attaaaaata ttctttttaa aatctacaaa gcatccatcc aacaaggaat tttccctgag 2640 cgcttaaaaa ttgctaaaat tattccaatt cacaaagaag gcgacagatc caatatcaat 2700 aattaccgcc ccatctctat cctatctgta ttctccaaaa tcctagaaag aattatgttt 2760 aacagagtta ataattattt taattacaat aacctactat acaataatca attcggcttc 2820 aaaaaagata gttcaactga gcatgccatt attcaatttg tacgcaacat ctccaaatcc 2880 tttgaaaaag ctcaatatac attaggtatt tttatagatc tatcaaaagc atttgatacg 2940 gttgaccata atattttaat acaaaaaatg aaatattacg gtttaaacaa taatacttta 3000 gcgtggttta aaagctattt aacaaatcgt aagcaagtag tctatagtaa taatgacgtt 3060 caaagtggat ccttaaatat aacctgtggt gttccacaag gttctattct tggaccactg 3120 ctatttctta tatatgtcaa tgatctaaac aaagcctcta acctgatgag tattatttat 3180 gccgatgaca ctaacttatt tcaatcccac actgatgttt acaaactttt cttaaaaact 3240 aataatgaac ttataaacat ttctaaatgg tttaaatgta ataaattaac gttaaatact 3300 aacaagacta aatggattct ttttcattcc ttctcaaaaa aacgttcttt accagatatt 3360 atgcctcaaa tttatattga tgaaactcta ataaaaagag attctgttat aagattccta 3420 ggtgtttatc ttgatgagaa tattacttgg aaaaaccata ttaatcatgt aaccaccaag 3480 gtatctaaaa atattggcat tttgtataaa gcccgacctt acttaagcaa aaaaagcctt 3540 acgcagctct attactcatt catacacagc tatataaact acgcaaatat tgcatggggt 3600 agtaccgaaa aaagtaaact acaacgtcta tatcgccaac agaaacatgc gattcgcgtg 3660 attaattatg tggatcgctc tacacattca aagcctttct ttgatgaaat gaaaatactt 3720 aatgtctaca aacttaatgt tttcaatgtt ttatgtttta cttatatgtg gaaaaacaat 3780 ttatccctac ctgtttttaa cgaccttttt acattaaaac ttactaataa gtactcacta 3840 agaaataaag atttgctaaa tgaaccattt tgtagaacaa attttaataa attttgtatt 3900 aattatagag caccttacct ttggaataaa atagttctgc ctaattttga tccttttatc 3960 cgtttttctg tttttaaaaa tgaattaaaa aaattcattc ttgctatgga tgacatttta 4020 agttactatt agcttgaagt ataaacttga tttatgagat ttataataat atttatatta 4080 ataaatttat tttctttaat ttatatcaga tctataaaat ttattattat ggattattat 4140 aacagatttg tgagaattat tattacagat acgtgattaa tattatgaat ttattattat 4200 atatttatta ttacggccgt ttttggattt atattttcag tttttatggt atgataaaat 4260 tgtaaaggtt ctgatgataa gatcagtaga tcttctttca gaagccttgt tttctttgtt 4320 cttgtaaaat atagtattta ttttacggca aatgtaaaca gaaaaaaaaa aaaaaaaaaa 4380 aaaa 4384 // ID Helitron-4_NVi repbase; DNA; INV; 3763 BP. XX AC . XX DT 15-APR-2009 (Rel. 14.04, Created) DT 15-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE Helitron DNA transposon - consensus. XX KW Helitron; DNA transposon; Transposable Element; Helitron-4_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-3763 RA Bao W. and Jurka J.; RT "Helitron DNA transposons from Nasonia vitripennis."; RL Repbase Reports 9(4), 765-765 (2009). XX DR [1] (Consensus) XX CC The conensus is incomplete at both ends. XX FH Key Location/Qualifiers FT CDS join(142..879,869..1168,1181..2536,2527..2739, FT 2743..3525) FT /product="Helitron-4_NVi_1p" FT /translation="RIIMYLTRNSNHDKRRYNAAKCDEVAVVFVGENGQPP FT EDRDVCIYSKHRAPNIPYISKHVDPMSYPLMYPNGGFGWMPNMKNKNNKNN FT ISMLQFYCHKFSIRNDFNPYLNLGKLTQQYIVDAWVKVEGSRLFFLNKNQH FT QLRTDLYKGVMDHLKNKSNDDNVKIGKMVILPSTFSGSPRSLQQNFLDSMT FT IVQKYGKPDLFITMTCNPKWKEIKQNLRKNESAIDRPDLVAKVFNAKVNEL FT LNEILMKFFKKNIFGKCIAYTYVIEFQKRGLPHMHLLAFIHKNHKLNNADD FT VDKLISAEIPDEMKYPRLHNIVKQSMIHGPCGKNNLNSPCMDSKTKKCTKN FT FPKQTTFNNTGYPLYRRSNNGKQIRFNKNNIADNRFVVPYNPYLLLKFNCH FT INVEVCSTIQCVKYLFKYCYKGHDCAFIKMSQGNNENECXEENFNYDEIKQ FT YLNTRYVSPPEAMYRLLQYAMYVLSHVIYRLAVHLKNEQFVFFKEGSEEEL FT INKNLNTTLTAWFELNANDPQARQYLYVDIPYFYVFNEKTKKWTPRKRVWK FT PIISRMYFVNPKDRERFFLRLLLLHVKGAQSYEELKTLENVTYATFHEVVV FT ARKLITTDEEWDKCLEEAIKVEFPKALCNLFGYICVFHHPVNVKELFEKYK FT AYFYSPLLTKEIRENISLKHINSILTSHGYTLTDFNLPKFSEDVEETNYEE FT FYATNKIENDSLYNISSLTNKQREIFDAVINSVINKANQVKCFFIDGPGGS FT GKSYLLNTIITHLNKEKISVLPVAWTGIAANLLINERTSHVIFKLHHITSL FT NINEDSTCNITPNSIYRKNLKNVEVIIWDEITMTSKHAFQAVDKLFKDLCN FT NDIEFGGKVMIVSGDFQTLPVVKHGNRTKIIENCVKNSHLWSKFQIMTLKD FT KMRVSNCDKEFKQWLLKIGDGKYSNKFEMDNDITLLPTELLSTSDIIMEIY FT GDCFDNNNANKLCERVILAPKNNDVLKINNNILNRMQGDVREYLSVDSCED FT DNKEMLPIEFLNSLTPNGLPPHKLRLKVGAVIILLRNLNLNDGLCNGTRLI FT VRKMMQFCIQAEIISGKQFGKVVLIPRIDLTSSKEEIPFDMTRRQFPVRLG FT YVMTINKSQGQTFNKVGVYYM*" XX SQ Sequence 3763 BP; 1532 A; 472 C; 597 G; 1160 T; 2 other; atgttcttta atgctattaa aggaattaga taattttttt cgacaagtaa acatatacgc 60 taaatcatac aaaatgatgc acgaagtcga aatggaagaa caaaatcgtt tgaaaaacaa 120 caacgttgag tttaatagta aagaattata atgtatttaa cacgcaattc taatcatgat 180 aaacgtagat ataacgcagc taaatgtgat gaagtagctg tggtatttgt aggtgaaaat 240 ggtcaaccac cagaagatcg agatgtatgt atatattcaa aacatagagc acctaatata 300 ccgtacatta gtaaacatgt agatcctatg tcatatccac ttatgtatcc taatggaggt 360 tttggatgga tgcccaatat gaaaaataaa aataacaaaa acaatatttc aatgctacaa 420 ttttattgtc ataaattcag cattagaaac gattttaatc cttacttaaa tttaggaaaa 480 ttaactcaac aatacattgt tgatgcttgg gtaaaagttg aaggttcacg attatttttt 540 ttaaataaaa atcaacacca attaaggaca gatttataca aaggtgtcat ggatcactta 600 aaaaataaaa gtaacgatga caacgtaaaa attggaaaaa tggttatatt accttcaaca 660 ttttcaggaa gtccacgatc gttacaacaa aactttttag attcaatgac aatagttcaa 720 aagtatggta aaccagatct tttyattact atgacatgta acccaaaatg gaaggaaatt 780 aaacaaaatt tgagaaaaaa tgaatcagct attgatagac ctgatttagt agcaaaagtt 840 tttaatgcaa aagttaatga attattaaat gaaattcttt aaaaaaaaca tttttggaaa 900 atgtattgca tatacttatg taattgagtt tcagaaaaga ggtttaccac atatgcactt 960 gttggctttt attcataaaa atcataaact gaacaatgca gatgatgttg ataaacttat 1020 aagtgcagaa atacctgatg aaatgaaata tccaaggtta cataatatcg ttaagcaatc 1080 aatgatacat ggcccttgtg gtaaaaataa cttaaattca ccatgtatgg atagcaaaac 1140 aaaaaaatgt accaaaaact ttccgaaata gtttaattag caaactactt ttaataatac 1200 tggatatccc ctttatcgac gaagtaacaa tggaaaacaa attcgtttca ataaaaacaa 1260 tatagctgat aatcgatttg tagttccata caatccgtat ttattgttga aatttaattg 1320 tcatattaat gttgaagtat gttcaactat tcaatgtgtc aaatatttat ttaaatattg 1380 ttataagggg catgattgtg catttataaa aatgagtcaa ggtaataatg aaaacgaatg 1440 traagaagaa aatttcaact acgatgaaat aaaacaatat ttaaatacaa gatatgtaag 1500 ccctccagaa gcaatgtatc gtctactaca atatgcaatg tacgttttat cacacgtaat 1560 ttaccgatta gctgtacatt taaaaaatga acagtttgta ttctttaagg aaggatcaga 1620 agaagagttg attaataaaa acttaaacac aactttgact gcatggtttg aactaaatgc 1680 aaacgatccg caagctagac agtacttgta tgttgatata ccatattttt acgttttcaa 1740 tgaaaaaaca aagaaatgga caccaagaaa aagagtttgg aagccaatta taagtagaat 1800 gtactttgta aatccgaagg atcgtgaaag atttttttta agattattat tattacatgt 1860 aaaaggggct caatcttatg aagaattaaa aacattggaa aatgttacat acgcaacatt 1920 tcatgaagta gttgtagcta gaaaattaat aaccaccgat gaagaatggg acaaatgctt 1980 ggaagaagca atcaaggttg aatttcctaa agctctgtgt aacctgtttg gttatatatg 2040 cgtatttcat cacccagtaa atgttaaaga attatttgaa aaatataaag catattttta 2100 cagtccattg ttaactaaag aaatacgtga aaatatctca ctgaaacata tcaattctat 2160 tttaacaagt cacgggtata cactgactga tttcaattta cctaaattta gtgaggatgt 2220 tgaggaaacc aattatgaag aattttatgc aacaaataaa attgaaaatg attcattata 2280 caatataagc tctttaacga ataaacaacg tgaaattttc gatgcagtaa taaatagtgt 2340 aattaataaa gctaatcaag tgaagtgttt ctttattgat ggacctggag gtagtggaaa 2400 aagctactta ctaaatacaa ttatcactca tttaaataaa gaaaaaataa gtgttttacc 2460 tgtagcatgg actggtatag cagcaaattt attaataaat gaaagaactt cacatgtaat 2520 ttttaaatta catcattaaa tataaatgaa gattcaacgt gcaacattac tcccaattca 2580 atatatagaa aaaatctgaa aaatgtagaa gtcataattt gggatgaaat aacaatgaca 2640 tcaaaacatg cttttcaggc agtggataaa ttatttaaag acttatgtaa taatgacatt 2700 gaatttggtg gcaaagtaat gattgtatca ggtgacttct gacaaacact acctgttgtc 2760 aaacatggga atcgaacaaa aattattgaa aattgtgtga aaaatagtca tttatggagt 2820 aaattccaaa taatgacttt aaaagataaa atgagagtta gtaattgtga caaagagttt 2880 aaacaatggc tattaaaaat tggagatgga aaatattcca acaaatttga aatggacaat 2940 gatataactt tacttccaac agaattatta tcaaccagtg atataattat ggaaatttat 3000 ggagattgct ttgataataa taatgccaat aaactttgcg aaagagtaat tttagcgccg 3060 aaaaacaatg atgtactaaa aattaataat aacattttaa ataggatgca aggcgatgta 3120 agagagtatc taagtgtaga ttcatgcgaa gatgataata aagagatgtt accaatagaa 3180 tttttaaatt cacttacgcc aaatggccta ccgcctcata agttacgttt aaaagtaggt 3240 gctgtaatta ttttgctgag aaatttaaat ttaaatgatg ggctttgtaa tggtactcga 3300 ttaatagtaa ggaaaatgat gcaattctgt atacaggcag aaataattag cggtaaacaa 3360 tttggtaaag ttgttttaat accacggatt gatttgactt cttcaaaaga agaaattcct 3420 tttgacatga caagacgaca gtttccagta cgattaggat atgtcatgac aatcaacaaa 3480 tcacaaggcc aaacatttaa caaagttggt gtctattata tgtagcatta tccagggtaa 3540 cgttaaaaaa atttaacaat tttactttca aaaaattaag tttcgaggat aaaagtaaaa 3600 ataatgatga ttcaaataag atatatacga aaaatatcgt atatcatgaa atattaaatt 3660 aataataaac taataaaata tgtcagttat tgagaaaaga tgtgaatgga ttcacggcac 3720 acagatacct taagtatact ttaatataat agtagaagta gat 3763 // ID Gypsy-69_CQ-I repbase; DNA; INV; 6751 BP. XX AC AAWU01038780; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-69_CQ_; KW Gypsy-69_CQ-LTR; Gypsy-69_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-6751 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 517-517 (2011). XX DR Genome; AAWU01038780; Positions 1045 7795. XX CC Positions [2691-3230] - Reverse transcriptase CC Positions [4464-4940] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 667..1800 FT /product="Gypsy-69_CQ-I_1p" FT /translation="MLTRKMAEEIQAALDKLREDLERQRLEIVELQGRNEN FT ELFGDTSDDPEAAVEMPIPENNFKASKIPDAIKMIVPYHGDPKTLATWIRS FT VEEKVKFASEDCPSERDRKRAWPLWISVIRDKILDEANNVLVANHIPTDWE FT AIKTALIDQLGDKRDLSTLVTNISYLEQGAKDIVQFYKECRELLSDITAKI FT SIDRELESCARTLTGNYENMIMNSFIDGLNDPYSTLTRTSQPKSLLEAYQN FT TLGQFNANQRKKQRLRTTTHSHKITSHKYGPVVKYNSFSQSTTPNNQQIPL FT QSYSQPGPSKPFFKPSFPQSKPQLLQIKADPSTQTRQNRWPYPQRQGGVNY FT HEAPEEIQTEEEVEQTPEEVEDLNFCLTDELENET" FT CDS 2412..5243 FT /product="Gypsy-69_CQ-I_2p" FT /translation="MKSPIFKPNKNKLFEQLRLDHLNQEEKKQISKLIQSF FT QDIFFLEGQKLSFTNAIKHKINTSDELPVYTKTYRYPYFHKEEVRKQIDSM FT LNQGIIRPSISPWSSPIWIVPKKIDASGQKKWRLVVDYRKLNEKTVGDRYP FT IPNITDILDKLGRCMYFSTLDLASGFHQIEVNERDIQKTAFNVDSGHYEFV FT RMPFGLKNAPSTFQRVMDNILREHIGKRCLVYMDDIIVFSTSLQEQIQNLK FT SVFESLAKANMKIQLDKSEFLRKEVAFLGHVVTPEGVKPNPEKIATISNWP FT IPKNEKELKGFLGVMGYYRKFIKGFAKIAKPLTSCLRKGESITYTKPFLEA FT FEKCKSILTSSQVLIYPDFEKPFILTTDASNYAIGAVLSQGPVGQDRPIAF FT ASRTLSKTEENYSTIEKELLAMEWGCKYFRPYLFGRQFTLFTDHKPLTYAI FT DLKDKTGRLSRMAVRMKEYEAHVKYRPGNQNVVADGLSRIPSNSRELEVNV FT NDQSDSTSDDATVHSADSDDGDYIPMTELPLNYFKSQIILKIGEQDEEIYE FT EIFPGIHRRTVTRINFGVPFCIRLFREKMHPTRINGILCPESIINTLQVTF FT KNYFSRAKTFKIKLTQSLLQDLKTIEEQELVIKTTHERAHRGIDENHTSII FT RKFYFPQMKKKITSHINLCETCLENKYERNPYKIKFADTPIPKKPLDIVHI FT DIFESRPNLFLSAVDKLSRFAIIIPIKSRSIPDLKKAILKLITSYGTPCML FT VSDNEPALRSTEIRGLLHRLNCETYYTPSDTSQVNGIVEIFHRTLSEIFLC FT MKQKFEDLSDKEIYKLATSLYNTTIHSVTKLKPIEVFFGIKEGEERPLNLE FT TILENRNAIFDELILKLVQQQKKTNDYRNKSREHEPHLDTNESVYNKLQGI FT RNKKRQKYKKQKVRENRRKTFIDCRNIKLHKSKLKRKRKL" FT CDS 5265..6752 FT /product="Gypsy-69_CQ-I_3p" FT /translation="MYYPISIFINLTLIMVFVTTSHLTIRNLNEDPILLTE FT LRNCKLQSGTLKIIHPINLSAIQESVDYLATTFYSSSASKNPLGQIAKQKF FT KILYNNLMQIKPVVKHRQKRWDSLGTAWKWLAGSPDAQDLRIINSTMNELI FT TENNNQVDVNNRFNKKIAEITSSISQIINTLATNEVQNEISILVTITNIDM FT MNKVLEDIQDAIILSKNSIIANRLLNAQELTFIRSLLIEEGVNSDLPDEAL FT QHVTPKFAVNKELLLYILHVPRLFSKTVPIIQVYPLAHKNQIIFGYPEFFI FT KDGSTLYTTNHPDEFVQKLTYLEKIQDTCIYPLIFGTQSRCNATFFNDTTQ FT MLISEKALLIQNSWNGMLQTNCGPDNRTLSGNFVIEFINCSITFNNQTYQN FT EEITRESNIIYGAFHNFNANWTYKQRLDISSINNETLGNRKKLDHVFLEQY FT HLKLQIWTVIGGLSVTYIIILFIVFAVAKILLDHRGSGRSELHEGAVTKQP FT " XX SQ Sequence 6751 BP; 2451 A; 1393 C; 1189 G; 1718 T; 0 other; gattgttaag tggcgcagtc gtggtctgcc cgtggagtgc gcgagtgatt agtgaaattt 60 aacagcggga ccagcaccgg aagaatacta gtgattggag aacgcaaccg tgagaatatc 120 ggacttagat ttcgtgattg cacgccacca atcggagcag agaacctaca gctcacccgg 180 aaagagtgat tagtgaaatt cagcggcggg accagcaccg gcagaatact ggcgaacgga 240 gaacgcaacc gaggggatat cggacttgga tttcgcgatt acacaccacc gatcggagcg 300 aagaacctac aactcacccg gaaggaaccc atctttgaag gaacatcaag gattagcggt 360 gagtgagaaa atataatagt ataataaatt ctttcggaaa tagaagtgaa gccgctttag 420 gagaaagaaa ctgtgtcaaa tttagctgca agattttctt ttttttgctt ttttcaaaaa 480 ttatcatctt ggtcgtgttt aaagaaaaat ttaatcaaaa tcagtgttta acgaaaacag 540 aagaaaagtg agtgaaaaat taatcaaaac gaaatagtgt aacgaaatca gaagaggagc 600 aaaattgtat ttataaaatt aatttttaaa tcgctttttc aaaattattt tctcagtcgt 660 gattttatgc taacacgtaa aatggcggaa gagattcaag ctgctttaga caaacttcgt 720 gaagatttag agcggcaacg attggaaatt gtcgagttac aaggtcgtaa tgaaaacgaa 780 ttattcgggg atacttctga tgaccccgaa gctgcagtag aaatgccaat acctgaaaat 840 aatttcaaag cttcaaaaat tccagatgct attaaaatga tcgtacccta tcatggggac 900 ccaaaaacac ttgctacttg gatccggtct gtagaagaaa aagtaaaatt tgcttctgaa 960 gactgcccat ccgagagaga ccgaaaaagg gcatggccac tatggatcag tgttattcgt 1020 gacaaaattt tggatgaagc aaacaatgtt ttagtggcga accacatccc taccgattgg 1080 gaagccataa aaaccgcgct cattgatcaa ttaggagaca agagagactt gagcactctc 1140 gtcacaaaca tttcatacct ggagcaagga gcaaaagaca ttgtccaatt ttacaaagaa 1200 tgcagagaat tactctctga cattaccgcc aaaatatcta tagatcgaga attggaaagt 1260 tgtgccagaa cactaaccgg aaattatgag aacatgatta tgaactcatt tattgatggg 1320 cttaatgacc catattcaac attaactaga acatcccagc ctaaatcatt gctagaagct 1380 tatcagaata cacttggtca atttaacgca aatcaaagaa aaaaacaacg cttgagaacg 1440 acgacccata gccataaaat tacatctcat aaatatggcc cagttgtcaa atataattca 1500 ttttcccaat ctacaacacc aaacaatcaa caaataccat tacaatcgta ttctcaacca 1560 gggccatcca aacctttttt caaaccatcg tttcctcaat cgaaacccca gcttcttcaa 1620 ataaaagccg acccatcgac ccaaactaga caaaatcggt ggccttatcc acaaagacaa 1680 ggtggcgtaa attatcatga agctcctgaa gaaattcaaa cagaagaaga agtagaacaa 1740 acacctgagg aagtcgaaga tctaaatttt tgtttaacgg acgaattgga gaacgaaacc 1800 taactaataa tttcctcccc tactatgaaa taaaatcaaa caattttgga aaaattcggc 1860 tattaataga cactggtgcg aataaaaatt acattaatcc agatattgta ccgaagcatg 1920 caattgctct aggaacaaac cataaaataa aaaatattag tggaagttac aatgtaagca 1980 attatacacg ctttaaccct ttcgaaaaat tctttaaaaa cttatcacca caaaaattct 2040 ttttatttaa atttcataat ttctttgatg gacttatagg atttgaaacc cttcgatcaa 2100 tcaatgctat aattgataca ggttcgaatt cactaaaaat cggagacacc atcattcccc 2160 tcaataaaaa gctacccgat agcattatct taaacgcaaa cgagaccaac atagtttcgg 2220 ttccaacagt tcaagaaaat ggtgattttt tagtagaaaa tgacctacct atagcaccta 2280 acattttcat actgtctggg gtgtattcat atcaagatag aaaaactaat atcctcgttc 2340 ataatggtaa taccaatgaa cagaacctga acattggtgg gctcctagat ttcggaataa 2400 acaattttga aatgaaaagt ccaatattca aaccaaacaa aaataaactt tttgagcaat 2460 taagactaga tcacctaaac caggaagaga aaaaacaaat ttcgaaattg atccagtcct 2520 tccaagacat tttctttctt gaaggacaaa aacttagttt taccaatgca atcaagcata 2580 agattaacac atcagatgaa cttccggtat atacgaaaac gtaccgatac ccttatttcc 2640 acaaagaaga agtacgtaaa caaatcgact ccatgttaaa tcaaggaatt attcgtccct 2700 caattagccc atggtcgtca cccatttgga ttgtcccaaa aaagattgac gcttcaggcc 2760 aaaagaaatg gcgactagtt gtggattata gaaagctcaa cgaaaaaacc gtaggtgacc 2820 gctatccaat tcctaacatt acggatatac ttgacaagct agggcgatgt atgtattttt 2880 caaccctaga tttggcctct ggatttcacc aaatcgaggt gaacgaaaga gacatacaaa 2940 aaactgcgtt caacgtagat agcggacatt acgagttcgt cagaatgcca tttggcttaa 3000 aaaatgcgcc ttccacgttc caacgtgtta tggataacat tctacgtgaa catattggga 3060 aaagatgtct cgtctatatg gacgacatca tagttttctc aacgagcctt caagaacaaa 3120 ttcagaacct taaaagcgtt tttgagtcgc ttgcgaaagc caatatgaaa attcagctcg 3180 ataagagcga atttcttcgt aaagaggtgg catttctggg acatgtggta acgcctgagg 3240 gtgtaaaacc aaatcccgaa aaaattgcaa ctatttctaa ttggcctata cccaaaaacg 3300 aaaaagaatt aaaaggattt ttgggagtga tgggttacta cagaaagttc atcaaaggtt 3360 ttgctaaaat agcaaaacct ctgacaagct gtctcagaaa aggagaatct attacttaca 3420 ccaagccttt tcttgaagca tttgaaaagt gtaaatcaat actgaccagc agccaggtac 3480 tgatttaccc agatttcgaa aaaccgttta ttcttacgac agacgcgtcg aattacgcta 3540 taggagcagt tttatctcaa ggtccggttg gtcaagatag accgatcgca tttgcttccc 3600 gcacactctc taaaactgaa gaaaactatt ctactattga aaaagaacta cttgctatgg 3660 aatggggttg caaatacttc agaccatatt tattcggcag acaattcaca cttttcacgg 3720 accataaacc actaacgtat gccatcgacc tcaaggataa aactggcagg ctcagccgca 3780 tggcagttcg catgaaagaa tatgaagcac atgtaaaata tcgccctggt aaccaaaatg 3840 tcgttgccga cggactttcc agaattccaa gtaacagtag agaactggaa gtaaatgtta 3900 acgatcagag cgattcaaca tcagatgacg cgacggtaca ttctgcggat tcagacgatg 3960 gagactacat ccccatgact gaattgcctc taaattattt caaatctcag atcattctta 4020 aaatagggga gcaagacgaa gagatttatg aagaaatatt ccccggtata catcgaagaa 4080 ctgttacgag aattaatttt ggagttccct tctgcataag attgtttcga gaaaaaatgc 4140 accccacaag aataaatggt atactatgcc ctgagagtat cataaatact ctgcaggtta 4200 catttaaaaa ttacttttca agggctaaaa cattcaaaat aaagctaaca caatctcttt 4260 tacaggactt aaaaacaata gaggaacaag agttggtaat caagacaacc catgaaaggg 4320 ctcatcgagg cattgatgag aaccatacca gcatcatccg aaaattctat tttcctcaaa 4380 tgaagaaaaa gattacgtct cacataaacc tttgtgaaac atgtcttgaa aacaaatatg 4440 agcgaaaccc atacaagatc aaattcgcag atactcccat cccaaaaaaa cctctcgata 4500 tagttcatat cgatattttt gaatctagac caaacctttt cctatccgcc gtagataaac 4560 tatccagatt tgctatcatc atcccaataa aatcccggtc aataccagat cttaaaaaag 4620 ccattttgaa gcttatcaca tcgtacggaa ctccatgcat gttagttagc gataacgaac 4680 ctgcattgag atctacggag atccgaggcc ttctgcatcg gttaaattgt gaaacttact 4740 acacacccag tgacactagc caagtcaatg ggattgtcga aattttccat agaacacttt 4800 cggaaatatt cctctgcatg aagcagaaat ttgaagatct ctcagataaa gagatctaca 4860 aattagcaac atcattatac aacacaacga ttcattccgt aaccaaactt aaaccgatag 4920 aggttttttt cggaattaaa gaaggggaag agagaccctt gaatttagaa acaatattag 4980 aaaatagaaa cgcaattttt gacgagttga ttttgaaact agtacaacaa caaaagaaaa 5040 ctaacgatta tcgcaataaa agtagagagc atgaaccaca tttagacaca aatgaatcag 5100 tgtacaacaa acttcaaggg attagaaaca aaaaacgcca aaaatacaaa aaacagaaag 5160 ttagagagaa caggagaaaa actttcatag attgcagaaa cataaaactt cataaatcca 5220 aattaaaaag gaaacgaaaa ctctaataat actctttttt gcagatgtac tacccgatct 5280 cgattttcat caacttaact ctcatcatgg tttttgtcac tacttctcat ttgacgatac 5340 gaaatttgaa tgaggatcca attctcttaa cggaactcag aaattgtaaa ttacaaagtg 5400 gcactttgaa aattatacac cccatcaatc tctcagctat ccaagaatca gtggattatt 5460 tggctacaac attttatagc agttcagcta gcaagaaccc tttagggcaa atagctaaac 5520 aaaaatttaa aatattatat aacaacctaa tgcaaatcaa acctgtagta aagcacaggc 5580 aaaagagatg ggacagctta ggtacggcat ggaaatggct cgctggctca ccggacgccc 5640 aggatcttcg cattatcaac agcaccatga acgagctgat tactgaaaac aataaccaag 5700 tcgacgttaa caaccgattc aacaaaaaga ttgctgaaat cacatccagc atctcccaaa 5760 tcatcaatac actggcaaca aacgaagtac agaacgagat ttccatcctc gtcaccatca 5820 ccaatatcga catgatgaac aaggtattgg aggacattca ggacgccata attctatcaa 5880 agaattccat aatcgcgaac agactgttaa acgcccaaga gttaacattt atccgatcct 5940 tactgataga agagggagtc aactcggatc ttccagatga agccctccag catgtgaccc 6000 cgaaattcgc tgtcaacaag gaattgttgc tatacatcct tcacgttccc cgactattca 6060 gcaaaactgt tccaataatt caagtctacc cgttggccca caaaaaccaa atcatctttg 6120 gatacccgga atttttcatc aaagacggtt ccacactata cacaacaaac catccagacg 6180 agtttgttca gaaactaacg tatcttgaga aaatacaaga cacttgcatt tatccgctga 6240 tatttggaac acagtccaga tgtaatgcga catttttcaa tgatacaacc caaatgctta 6300 tctccgaaaa agcacttctg atacaaaatt cctggaacgg catgcttcaa acgaactgcg 6360 gaccagataa tcgaactctt agtggaaact tcgtcattga attcatcaat tgttcaatca 6420 catttaacaa tcaaacatac cagaatgaag agatcacccg tgaaagcaac ataatctacg 6480 gagcattcca caactttaat gctaactgga cttacaagca aagattggac atctcttcga 6540 tcaacaacga aaccttggga aacaggaaaa agttagacca cgtttttctc gaacagtatc 6600 acctcaagct acagatttgg acggtaattg gtggattatc agtaacgtac atcatcattt 6660 tgttcatcgt gttcgcagtt gctaaaattc ttttagacca tcgcggatcg ggacgctccg 6720 aacttcatga gggagcagtt acgaaacaac c 6751 // ID TECTH1 repbase; DNA; INV; 1649 BP. XX AC X17627; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 28-SEP-1995 (Rel. 1.08, Last updated, Version 1) XX DE C. thummi DNA transposable element TECth1. XX KW Transposable Element; Repetitive sequence; TECTH1. XX OS Chironomus thummi OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Chironomoidea; OC Chironomidae; Chironominae; Chironomus. XX RN [1] RP 1-1649 RA Wobus U.; RT "TECTH1."; RL Direct Submission to Genbank (20-NOV-1989)Wobus U., RL Zentralinstitut fuer Genetik und Kulturpflanzenforschung d.ADW, RL Gatersleben, DDR-4325, Germany.. XX RN [2] RP 849-1649 RA Wobus U., Baeumlein H., Bogachev S.S., Borisevich V.I., Panitz R. RA and Kolesnikov N.N.; RT "A new transposable element in Chironomus thummi."; RL Mol. Gen. Genet 222(2-3), 311-316 (1990). XX DR GenBank; X17627; Positions 889 2537. XX SQ Sequence 1649 BP; 599 A; 233 C; 160 G; 657 T; 0 other; caaggccata gaactccttg cactatgtct acaatcacta cttttttact caactttata 60 aatttttact tatgtttcag tacacttact atacaatgta tcacaatcac tgcaattact 120 aaactttact gcacttaact cttttttact actctttact ttacttaagt acattaacct 180 atactttact acactttcct ttactttact ttactttagt atacttaagt acattttgct 240 taactttact accctttact acactttatg cactttactc gtacttttct gtactttatt 300 tttaatcaaa ttaattagtg cataatttca atctaaatag ttataggcct cttaaacatg 360 tggtccatta atacgttgta atcatacttt caaaaaacaa aaaataaaat aaaagatttt 420 gcaatcatta taaaaattta aaagaaaatg tttaaaaact taactaaagg ttatagaatt 480 taaaatatta tttttttatt tttttattat tctatcattt taaaccatat ttagctgatg 540 gaaaagaaag agcttacaaa aacataaagc actaatgcta ctaaggcaga aatagtttca 600 ttcatgtcaa agaaatttct aattgatacc tttagtttgt aaggaaattt gtcaaaaata 660 aaataaaagt atagtataga aaaatcgaaa aatttaaaaa aaaatctaaa atgttgtgct 720 tttgttttta aaaatgagta attttaattg aaaaaatttt taaaaaatta tgaaaaattt 780 taattcgaat tgactcgaat ttgatttttt cacgtaagct tatattaaaa aaactaatga 840 ggacgatttg tgttaagttt aaatttttaa aaaaagctat aaatacaagc ttcatataaa 900 ttagtttcaa tttatttgca tttcgaaatt gaatgaaggg actgtagaaa aacaatattt 960 taaatttatt ttgtaataat atgttggctt tttatttcaa caagattaac tttccaaaaa 1020 aaacttttta aaaatatata tcaaaagtgt tgataacatt taattttttc taaaaccaac 1080 aatatcaaaa acattgattc tcactattct ttcgttacta ttcttcgtaa ttaggaaggc 1140 atccttaatg ttgaaatatt tgaatttagt tcaatagttt tattttttac atgagaaaag 1200 cttcatttat atttacattt taaatatatt ttatatataa aatttcaaat aataagcggt 1260 tcttaagttg cgataatatt gactatttat aaataaattg aatggtaaaa taaacattac 1320 ttcaaaatta attcaaaatc aactttaaac attttggttt ttaaagtttt aaacagtttt 1380 atgttgctgc actttctata cttttactac acttgctata ctctcactat actgactata 1440 cttttacaac actttactac acttaaacta cacttgctac acttttgcta cactttgcta 1500 cattctacta tgctttggtt aactttgctt cactttacaa tatttaacta tactttgctc 1560 atgttaactt aattttgcgc aatagtttgt atacgttact gcacaatgct cactcacggg 1620 tatactattt tgggagttct atggccttg 1649 // ID BEL-606_AA-LTR repbase; DNA; INV; 390 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-606_AA_; KW Pao_Bel_Ele199; BEL-606_AA-I; BEL-606_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-390 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 390 BP; 129 A; 89 C; 64 G; 106 T; 2 other; tgttggggtc aaacctacgc cactgtaatc cgatcgcagg caaccctgct cgtcgtcaca 60 aacacttatc aattgacaag ccgtcaaacc ttgacagata gcactgtcgt aatatccgct 120 aaccattctt ctcwaaacca ttatttgtac cacacgaaca aggcagagtc agaatttgta 180 gaaagatata gattatatag atcgtgaatt cgaaccgtca tcctgttgct ttaacgttga 240 actaaaacta ggagaccaat tgtaagttaa attgtatkta ttacttgaac tacttactat 300 tataaccatt cttaataaat agttggagct cgcttaacct ataaaacggc gtgctacaga 360 aactccgaaa acgtccgaaa cgttccaaca 390 // ID BEL-197_AA-I repbase; DNA; INV; 6107 BP. XX AC supercont1.1408; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-197_AA_; KW BEL-197_AA-LTR; BEL-197_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6107 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.1408; Positions 25245 19139. XX CC Positions [5179-5727] - Integrase core CC 'ACAGA' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 1757..3832 FT /product="BEL-197_AA-I_2p" FT /translation="MLLSELVSKIPPSIRLDCGLHTQRIPDVTMKAFSDFV FT SSIKAAACQVTMPYDSNSQDDMNRSGRKKEKSGYVNPHSAEVNIRDGASST FT KPQRHHKPCLMCHRGDHKIRNCNQFSSLTITERWNLVSRIVTTHRSKPHGT FT LFKILPVTLSHNGKSVSTFAFLDDGSNVTLLENSIANMLDLDGEESSLCLR FT WTSNVTRKESVSKRVQVGISEGGGKTQHVLSQVQTVDYLSLPRQSLDYAEI FT SSKFPHLRGLPIQSYTEAVPKILIGVDNARLKLLLNKRERRDEEPVAVKTR FT LGWTIFGGRRATDKCAMVHVCDCSVDQRLHELVRDYFEMDQLGIMVTSILE FT SEENRRARRILEDTTRCTESGRFETELLWKHGEVKFPESYAMAERILKCLE FT RRLQSDPELRKNVEAQIDDYQEQGYAHKATDEELRSSDPNRVWYLPLGIVQ FT NPHKPHKVRIVWDAAARVKGVSLSSMLLIGPDLLTPLMSVLCRFRQRQYAI FT TGDIKQMFHQLRNKKEDRQSPRFLWRAKPESTPDVYVMDVATFGTTCSPCS FT AQYAKNLNAAQYEAQYPEAARAITQNTYVDHYLDSRDTVEEASELAVQVRE FT VHSKAGFEIRNWLSNDRKILQRIGVENSQDVCPAVKSFTAEKTTIAARFLG FT MMWEPNEDLFVFTIQFHADLLPLLAGELVPSKRQVLRVVFLTL" FT CDS 5107..6108 FT /product="BEL-197_AA-I_1p" FT /translation="MWCRVYKAMPDTPRMAPLPQPRVIPFVRPFTHTGIDY FT FGPLLVKQRRSNVKRWVAIFTCLTVRAVHLEVVHSLTIQSCKMAIRRFVDK FT RGAPQNIFSDNGTYFSGATRELAADIKSIHLAIAGTFTNAKTEWHFNPPSA FT PHMGGVWERKVRSIKDAFKVLAHREKLDDEGLLAEASMIVNSRPLTFVPLE FT FPGQEVLTPNSFLLMSSSGNNSSLLRIQKVSLRTNWGVMQHLLNQFWQSWI FT KSYLPTIARRTKWFTDVRPLQEGDLVIIVDESVRNGWLRGRITRTYPGVDG FT QIRKVDVHTGSGTFQRPTVKVALLDVLEDSKADDKLGLTGRG" XX SQ Sequence 6107 BP; 1783 A; 1446 C; 1456 G; 1422 T; 0 other; aactcaaaat taacgagtca tgatgaatgc gtcgaatcct aacctcacta tgcaggaagg 60 cataaatagt tccaaaagtt gcaaccgtcc ggatagcgca gaggacatag ttgcctgcga 120 taaatgcaat tgttggtggc attttagctg tgccggtgtg caagcatcca tatgcgaccg 180 cttgtggcgt tgtccaaact gttctacttt ctgtggtacc attatttccg agcactcgaa 240 tgcttacagc gccagactac aattgaagca actcgaagag gccaaagcaa tggaagacaa 300 aattcggaac gaacaagcag aaagagaacg agaattttta gcagcgaaac atcagctcga 360 atcggagatc gaagcaaggc aatcggtggg tggcagcctt agaagccacg aaagtcatcg 420 cagccgacgc agcagagtgg gaacttaggt gacagagcag gattttacta taggcaatcg 480 agcgattgag aatcatccat cggaggctgg caaaacctct accccaatac ttacgggtca 540 ggtaggaacc ggagctacga agaaaattct ccctccatct caactgaacg accaagctca 600 gcaacaagaa aaccccaccg aggaccaaca ctcacgtgtg acccgctcaa atgagcaagc 660 aaatccgctt ccacccaaca caccgctggg cctaccgaag cagttactcc caccaaccaa 720 atccgctgga atactcgagt cgtgctccct tgaacccaac attccgttgt caggaccagt 780 ggacacaata aatccgatga ctaccgcaaa agcggacgca gcccactgcg cagtattgtt 840 gcagcatcct ccagccgacc caaaagatac gatggcgtca tcttatcctg cgtttccagc 900 gccaccttca atgtattcgt taggcgcact ggcgcagcaa cttcctctaa aaacaccatc 960 gaaactgccg caaaggaaca ccacgaagtc actcgctcct cagttgtcaa ctcctatgtt 1020 tccaccaagc gaccctatgc aatggcggaa tccaccgcca cccacggtgg ccccatcgcg 1080 ttcaacgata ccaaagtttg ttgctcccac gcaacctgac ataacggaac gtacagtgaa 1140 ccgaccatgt agaacaagct cctcccttgc ttaatccttc ggctccagta atccgagaag 1200 cttccgatga agaacacaga cccacattac gacacaggat gccgcaacaa ccgacttcac 1260 aaaataacgc acggggcaat ccagcgtacc tatggtacca accaccgatg ccaccctact 1320 ggcagcaaca gcactcggta cagggttcat tcggaggatt gataatggtg gatgtggtct 1380 cgcgggaatt accaaagttt tccggggatc ctttagagtg gctcatgttt ttgagcgcat 1440 ttgtatccac cacggtgatg tgtggaatac aaccggccga aaatctcgct ggttgcagaa 1500 atatttggtc ggaaaggcac gggaaaaagt gcacagcatt ttgactctac cggaggcagt 1560 tcctgaaata atcaaaacac tgcgtgagga atgttcccga ccagaccagc tggtttatgg 1620 tttgatgagt aaaatcagaa atgcttcagc cccgaacgtt aataaattag acacgttagt 1680 cacgttcgag cgtgaggtgc gaaatctcgt cacctacata gaagccgcca aattgtaagc 1740 gcacctgtcc aaccccatgc ttctatcgga gctggtcagc aaaattcctc caagcattcg 1800 attggattgt ggtttacaca cccagcgaat tcctgacgtt acgatgaagg cgttcagcga 1860 tttcgtatca tcgattaagg cagctgcttg tcaagttacc atgccatacg attccaactc 1920 tcaagatgac atgaatcgct ctggaaggaa aaaagagaaa agcgggtacg tcaatccaca 1980 ttcagccgaa gtgaacatca gagatggtgc ctcgtccaca aaaccgcaga ggcatcataa 2040 accatgccta atgtgtcatc gtggcgacca taagattaga aactgtaacc aatttagttc 2100 cctgacgatt actgaacggt ggaatttagt atcacgaatc gtcaccactc accgcagcaa 2160 gccacatggt actcttttta agatactacc cgtgacgttg tcgcataatg gaaaatcggt 2220 ttcgacattc gcctttctag atgatggttc gaacgtcacg cttcttgaaa acagtatagc 2280 gaacatgctc gacttggatg gagaagaaag ttcgctgtgt ctgcgatgga caagcaatgt 2340 cactagaaaa gaatccgtat cgaaaagagt gcaagtcggc atatccgaag gtggcggaaa 2400 aactcaacat gtgttgtcac aagtacaaac tgttgactat ctcagcctgc ctcgacagag 2460 cttggactac gcagaaatta gtagcaaatt tccgcatcta cgtgggctac cgatccagag 2520 ttatacggaa gcagttccaa agattctgat tggtgttgat aatgctcggt tgaagctact 2580 cttgaacaag cgagagagac gagacgaaga gccagtggct gtcaaaacga gattgggttg 2640 gacgatattc ggtggacggc gagcaaccga caaatgcgcg atggtgcacg tttgtgactg 2700 ttctgtggat caacggcttc atgagctggt aagagattat tttgaaatgg atcagctggg 2760 tataatggtc acttcaatac tggaatccga ggaaaacagg cgtgcacgac ggatcctgga 2820 agatactact cggtgtacgg aatcaggtcg gtttgaaacc gaattattat ggaagcacgg 2880 tgaggtgaaa tttcccgaaa gctatgcgat ggcggaacgg attctcaaat gtcttgaacg 2940 tcggctacaa tcagatccag agcttcgaaa gaacgtggaa gctcagattg acgattatca 3000 ggaacaaggg tatgctcaca aggccactga cgaagagctt cgaagctctg atccaaatcg 3060 cgtgtggtat ttaccgctgg ggatcgttca aaatccacat aaaccccata aggtgcgcat 3120 tgtatgggat gcggcagctc gcgtcaaagg agtttctttg agctccatgc tcttaatcgg 3180 accggatttg ctgactccgc ttatgtcagt tctttgtcga ttccgccaaa gacagtatgc 3240 tataacaggc gatataaagc agatgttcca tcagctacgg aataaaaagg aggacagaca 3300 gtcgccgaga tttctgtggc gcgcaaaacc cgagtcaacc ccggatgtgt atgtcatgga 3360 cgtggctaca ttcggaacca cctgttcacc gtgctcagcc caatatgcta aaaatttgaa 3420 tgctgcccag tatgaagcac agtatccgga ggctgcacga gcaattacgc aaaacaccta 3480 cgtggaccac tacttagaca gccgagacac cgttgaagaa gctagtgaat tggcagtaca 3540 agttcgtgag gtccacagta aagccggatt cgaaatccgg aactggctct ccaatgaccg 3600 gaagattctt cagcgaatcg gagttgaaaa tagccaagat gtctgcccag cagtaaaaag 3660 cttcaccgca gaaaagacga cgattgcagc ccgcttcttg ggaatgatgt gggaaccgaa 3720 tgaggaccta ttcgtgttta ccattcaatt tcatgcagat ttgcttcctc tacttgctgg 3780 tgaacttgta ccttctaaac ggcaagtttt gcgtgtggtc tttttgaccc tctaggtctt 3840 attgcttcat ttacggtcca tggaaaaaaa tacgtatcca gaatatctgg agatcaggcg 3900 tcgaatggga tgatcccatt atcaacaagg acttcatgga ctggaaaagg tggataagtt 3960 tattaccaga actagaacga gtgcacattt cgagatgtta cttcccaaac tacgagcgag 4020 aaagctacgg ttcactacag ctacacgttt ttgtggacgc aagtgaactt acgtactgta 4080 gtgtggcata ttttcgcata atcgaccgcg ggcaaccacg atgcgccctg gtagctgcaa 4140 aagctaaagt aactcctctg cgaccacaat caattctccg aaacgagttg aatgcaggcg 4200 taattggggt gcggctaatg aaaagcgtta tggacaatca ctcgctgcga attacaaagc 4260 ggtattttca tactgattcc actgtattac tggcttggct gcgggctgat cttcgtaagt 4320 atcgtccata tgtgcagttt aggactactg aaatcctggc ggagacatca ttagaagaat 4380 ggcgctgggt acccacccgg ctaaacatcg caggcaacca aatggggcag cggtcccagt 4440 tttgatggac ggagcaattg gtaccgcggg ccagatttcc tctggcaacc ggaatgcgaa 4500 tggcctgtaa ggaaaggtat agtcgatgaa ccaatggaag aattgagaac aacgaacgta 4560 caccgagaat cagcgacgac caacgtaatt gatttcagca agttttcacg ttggaaagat 4620 ttggtcaaat ctctagccta tgtataccat ttcgtcaacc gttgttctaa gaaacaacgt 4680 gttccaacga aatcccgcct tgtgacactg gtacgtcaag actacgtagc ggcagaaaat 4740 ggattgtggc gaatagttca agtgaaagaa ttcggagagg agattgcagc attaaaaaat 4800 ttgaagttac ctgcaaaaac aggttgcata atgaagtcca gtccattggc gaaactatct 4860 gcattcctag atgaacataa tgttctgaga atggaaagta ggatcgaccc aaaagccgca 4920 tattacccat acaattttcg aaatccaata attgtacctt aacaacatcg tgcctctgag 4980 ctactgatac tccacttcca tggaagatat ggtcacgcca atgctgagat cgttataaac 5040 aaactgcgac agatctacta tattcctaag atacgctcag tagtaaagaa ggtcatcaaa 5100 cagtgtatgt ggtgtcgtgt atataaagca atgccggata ctccaagaat ggcaccgttg 5160 ccacagccac gtgtaatacc atttgtgcgg ccatttacgc acacaggtat tgactatttc 5220 ggaccacttc tagtgaagca gaggcgtagc aacgttaaaa gatgggttgc aatttttaca 5280 tgccttaccg ttagggcggt gcatcttgaa gtggtgcatt cattgacaat ccagtcctgc 5340 aagatggcca ttcggcgatt tgtcgataaa cggggagcac cgcagaacat tttcagtgac 5400 aatggtacct attttagtgg agccaccaga gaactagctg ctgacataaa atcgattcat 5460 ctggccattg cgggtacctt tactaacgct aaaacagaat ggcatttcaa cccaccatcc 5520 gctccacaca tgggaggtgt gtgggaaaga aaagttcgct ctattaaaga tgccttcaag 5580 gttctagcac atcgtgaaaa gttggacgat gaagggcttc ttgcagaagc ctcaatgatt 5640 gtcaattcac gtcctcttac ctttgtgccg ttggaatttc ccggtcaaga agtattaacg 5700 ccgaatagtt ttctattgat gagctccagt ggcaataaca gcagtcttct gcgtatccag 5760 aaggtttcgt tgcgtaccaa ctggggagta atgcagcatc tgctcaacca attctggcaa 5820 agctggataa aatcctatct ccctaccatc gcaagaagaa caaaatggtt cacagatgta 5880 cgaccacttc aagaaggcga cctcgtgatt atcgtcgacg aaagtgtacg aaatggctgg 5940 cttcgtgggc gaattaccag gacgtatcca ggtgtcgatg gacaaatacg caaggtagac 6000 gtccataccg gctcaggaac atttcaacga ccaacggtta aagtggctct cttagatgtg 6060 ctggaagata gtaaggccga cgacaagtta ggcctcacgg gtcgggg 6107 // ID BEL-5_CQ-LTR repbase; DNA; INV; 322 BP. XX AC AAWU01007782; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-5_CQ_; KW BEL-5_CQ-I; BEL-5_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-322 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 164-164 (2011). XX DR GenBank; AAWU01007782; Positions 10143 10464. XX SQ Sequence 322 BP; 75 A; 82 C; 81 G; 84 T; 0 other; tgttggctgc ttgacggatc gagctgattt ggcagcactg gtcgagcgcc gcagccagat 60 tgctacgatc ggttttccca ccttgcacaa cctcggcgcg caacggcggc cgttgtgcgc 120 gacgacgaag aaggagaaga ggagagaacg acgctcggcg ccatcttttt tcatcactcg 180 cttacaagtg ctcgcggaaa caacacgttc tttacgcttt agtttaatta gttttaataa 240 agtagtgtta aagtaaagtt tccgtttcga gtgcttcttt tctacatcca cggcctcaag 300 aactgtctgc cgcaaaagtc ca 322 // ID EnSpm-1_NVi repbase; DNA; INV; 4143 BP. XX AC . XX DT 20-APR-2009 (Rel. 14.04, Created) DT 20-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE EnSpm-type family - consensus. XX KW EnSpm; DNA transposon; Transposable Element; EnSpm-1_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-4143 RA Bao W. and Jurka J.; RT "EnSpm-type families from Nasonia vitripennis."; RL Repbase Reports 9(4), 762-762 (2009). XX DR [1] (Consensus) XX CC The consensus is incomplete at both ends. XX FH Key Location/Qualifiers FT CDS join(1560..2417,2378..2587,2539..3561) FT /product="EnSpm-1_NVi_1p" FT /translation="FFIYKKYTFQITTFTEYENDLQEYLHRILLEDKITDS FT SPAPKNPKSLNDMIKDINFFKPLSISADVSQGELLLMVLKYCLSNNTSVSA FT MSNLCRMINTIFQTEVMPNSRYFLDQLFDSMDKVEFHAVCPNCTAYIGQFN FT EIRDVKSCHTCTSELNLENPSNTSFFVLMDPSSQIRDLISIYQDHYDFVIK FT ERVPETNFLNDIYDGKEYRKFVRSLPEQQKTNYISSVFNTDGANKFKCSQF FT SIWPLFLMINELPRQQRMSKLVNCGLWFNKNKPDMSVFWARSLKCTRYECF FT LGTFVEMLNKITKNGITCVIKGEERVLNLHIINCCVDSVARAPTQGIKQFN FT GKCGCSWCYHLPYAIQWKMWLQLVLSSPLLEKSIVPRDHQTTIHEMMLATP FT DNPVNGIKNPSALINLPYFNIISGFVPDYMHCCLEGVAKQITEYHIASMNN FT DDITEIDYKINQITAPHQISKLSRQINLRDKWKAREWENYILYYSAVVLPE FT HLTRNHYYHWLLFVESMYILLQEKISIEELNRADEMLHEFVAKTEQYYGQK FT AMTYNMHQLLHLSQSVLNWGPLWCHFTFSFESGNHVLLKAIKCSKGVTQQI FT IRYIKMNHALTVLQEFAYPRINELVKLFCADILQSHVQNCYKISDVTYVGK FT GDSVDNPLLLNFNLSIDAAAHHKIIKDGCLYESNKREKKGQTIHSYY*" XX SQ Sequence 4143 BP; 1479 A; 565 C; 635 G; 1464 T; 0 other; ccctggtctt ttttgcgtgt aacgagagtg aatataaact cgaaagcctt gtttgtttat 60 ccaccgaaca aagttgtttt tttgtttagt tttatatttt taaatgttgt aaaaaaatgg 120 acatagatat aaacgctacc gagaatttgg cgatgaagcc atttcaaaaa ggagtgctta 180 cagacacaga aatataatat cgccgtctcg caatcacgta gaggttagta tattattatt 240 gaaaaatatt tttgttatca atcaaattat ggaaagttat caataaacag tatgtatttt 300 tttctcatag aaaacacggc aattttcatc tattggatct gaatcaataa atgatataat 360 cgatcaaaat caatcaaata cagaagtaga agactgcgat caggtaataa ttgtatatac 420 atcattttac tttttaataa tcaaattgtg tttacttggt tacaattttt atttcatttt 480 gatacacgag gtatatatca ttcaaggtaa cttctaaaaa tttaatttgt aacaaataat 540 gagttaaagt aaataatatg aacatgttga caaaaattta atacaaacga catttaactc 600 tgtttttaag ttgaggaggt tctatcatga gggcaaatag tttttataat atattttttg 660 catacgctca cttgcctagg gaatatcgat aaaacctttt tcagtgaatg tatttcatat 720 aaaaatattt atgatgcata gaggtaccta ctgttcgttg ttaaaaacaa tacgtttgtc 780 aaatattact ttagaaaaaa attctttaca aaaagtgtgt gttaaatata aaggtattcc 840 caaatgaaaa gttaaaaaat gtgattctct ttgagactcc atatagaatt agaacgaata 900 ttcttattat tatcgtcact gaggtatcca aaagactcaa aaatttgcat aggctgctgt 960 atacgtattt acttgtgtag attttttgtg tagctgtaat attaaggtat taccgttgtc 1020 tctaatcaaa atgagcactt ttttttaatt ctcgtttatt catgatattt tattttacaa 1080 aagaagttgc ataaatgaac atacgccaaa tattgattaa gtattgctat taaatgatta 1140 tatttgtcat aattatcatt aataaaactt ttctaaaaaa ggtcatttat tttttgaggt 1200 taagtgtgac tttttttgct tttaatcaat tacttttaat acacgtatct atctaatcat 1260 agttttcaaa cggtattaat agttaatgac gttcctacct taattaaggc tttacaaaat 1320 gtaaagaaaa gtgtatgaac gtgaatgctt tggaaagata tttaattaga gcccccttaa 1380 aatttatata acttatatta ggtcgactgt aacaaaatat ttgttccagt gtacgtgtct 1440 tgttcattgt tattattatt gttattatta ttcttattat tattattatc atcattataa 1500 ttattaatca tgtggctaac acatttttaa aatttttcta aaattttgaa aacaaataat 1560 tttttattta taaaaaatat acttttcaga ttactacttt cactgaatat gagaatgatt 1620 tgcaagaata tcttcatcgg atactactcg aagataagat tacagattct tctccagcac 1680 ctaagaatcc caaaagctta aatgatatga taaaggatat taactttttt aaacctttat 1740 caatttctgc cgatgtaagc caaggtgaat tattattaat ggttttgaaa tattgcttat 1800 caaacaatac atctgttagt gcaatgtcaa atttatgtag aatgataaat acaatttttc 1860 aaacggaggt tatgccaaat tcacggtatt ttcttgatca attatttgat tctatggata 1920 aagtcgaatt tcacgcagta tgtccaaatt gtactgcata cataggacaa tttaatgaaa 1980 tccgtgatgt aaaaagctgc catacgtgca catcagagtt aaatctcgaa aatcccagta 2040 atacaagttt ttttgtttta atggatcctt catctcagat aagagatttg atcagtattt 2100 atcaggatca ttacgatttt gtgattaaag aaagagtgcc tgaaactaat tttttaaatg 2160 acatttatga tggtaaagag tacagaaaat ttgttcgttc tttaccagaa caacagaaaa 2220 caaattatat tagttccgtt tttaatacag atggtgccaa caaatttaag tgttcgcaat 2280 tttcaatatg gccattgttc cttatgatca atgagttgcc gagacaacag agaatgtcaa 2340 aattagtcaa ttgtggtttg tggttcaata aaaataaacc agatatgagt gttttttggg 2400 cacgttcgtt gaaatgttaa ataaaatcac aaaaaatggt attacatgtg tcataaaagg 2460 agaagagcgt gttttaaacc tgcatattat taattgctgc gtagattcag tagctcgtgc 2520 acctactcaa gggattaagc aattcaatgg aaaatgtggt tgcagttggt gttatcatct 2580 cccttattag aaaagagtat tgttccaaga gaccatcaaa ctaccattca tgagatgatg 2640 cttgctactc ccgacaatcc agtaaatgga atcaagaatc cgtctgctct aataaatttg 2700 ccatatttta atattatttc cggatttgtc cctgactaca tgcattgctg cttggaaggt 2760 gttgcaaaac agattacgga atatcatatt gcatcaatga acaatgatga cataacagaa 2820 atcgattata aaatcaatca gattactgca ccacatcaga tttcgaaatt gagtcgacaa 2880 attaatttga gagataaatg gaaagctcgt gagtgggaga attatatatt atactatagc 2940 gctgtagttt taccagaaca tttaacaaga aatcactatt atcattggct tttgtttgtc 3000 gaaagcatgt atatattact tcaggaaaaa attagtattg aagaattaaa ccgagctgat 3060 gagatgctac atgaatttgt ggcaaaaact gaacaatatt atggacaaaa agcgatgaca 3120 tataacatgc atcaactcct tcatttatct caaagtgtat taaattgggg acctttatgg 3180 tgccatttta ccttctcttt tgaatcaggg aatcacgttt tacttaaagc aattaaatgt 3240 tcaaagggtg ttacgcaaca aataattagg tatataaaaa tgaatcatgc actaacagtt 3300 ttgcaagaat tcgcctatcc acgtataaat gaactcgtaa agttattttg tgctgatatt 3360 cttcaatcac atgttcagaa ctgttataaa atctctgatg ttacttacgt cggtaaaggc 3420 gacagtgtag ataatccatt attacttaat tttaatttat caatcgatgc agcagcacat 3480 cataaaataa taaaagatgg ttgtttgtat gaatcaaata agagggaaaa aaaaggtcaa 3540 acaattcatt cgtactattg aaagatgaca gatttgctcg cttagaacaa tttattgcag 3600 ataaagaaaa taaaatagag cttactatct gcaatattgt aaaaacacga aagtttcata 3660 caaaatatta tcataagtta cactacattg aagaaattga ggaagaaaat tctattgtac 3720 ctactagaga aataaagagt atttgtatat ttttggaaat ttcgagaatt atgtatattt 3780 tgcctatacc aaatttgtta cattattaat gctctatgtt caaataaatt ttaataattt 3840 ttgtattaat ttttactttt attataccat tggatccact acaaaataaa acataatagg 3900 tttttaaatc ttattttaga tattgtttta acactatcga aaccaattat aaaatgctta 3960 gaattaaaaa aaaacatttt tgaccagatc atacattatt atacctcata tatctaaatg 4020 attatcagct gattatcacc tgacgttttg gcatgattat caggtgaaaa tcaggtgatt 4080 atcaggtgaa aatcaggtga ttatcaggtg ataatcaggt gacttttttc agtagggaat 4140 gta 4143 // ID Sola2-N3_AAe repbase; DNA; INV; 2439 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A non-autonomous Sola2 DNA transposon family from Aedes aegypti. XX KW Sola; DNA transposon; Transposable Element; nonautonomous; KW Sola2-N3_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-2439 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the yellow fever mosquito."; RL Repbase Reports 11(4), 1305-1305 (2011). XX DR [2] (Consensus) XX CC >96% identical to consensus. 4 bp TSDs. TIRs are ~690 bp long. XX SQ Sequence 2439 BP; 866 A; 398 C; 395 G; 780 T; 0 other; gacggatttc acgccaactc gacaaaggga ttggcagcac cccttcagaa ttcaatgaaa 60 ctttctgggt gtgaagacta tgtgaaacta agatacttta catactttgt tttttcaaaa 120 tcgatctaga ctaacatttg gaaagggtca aagttttttt tttacttttt ttataaaccc 180 gtataactcg aaaacggtaa gacctacaaa aaagtgttgt atgggggact gtcgtgaaat 240 ttcctgacgt tttagaaaaa aatattgaaa aaataaaaac acattttcta cactgaaaaa 300 aaaaagtatt caaaacttta aagtcgattt aaaaaaacgg ccatttcaga ttttgatcat 360 ccttaagcaa aaagtttcgt aattaaattt actaaaagtc gtccatacat tgcaaattgg 420 gcatatctaa gggaaaaaag tttttctaac aacgaatttt tcaagtccaa aactgatttt 480 tttttcaaag attttgttct aaaccgtcaa atatgatcgt gttttcaagt gataaatgag 540 tttgaatcag aaataaattg tattttttat tgagaatagt taaaaaacgg aattatgcag 600 tgattatgtt tttttaaatg aaggcaatct atggatttcg cctctgccta tgagaaaatc 660 aactccgtct aacaatccga cggattttta tggttcgaaa gattgcaaac attgaaaagg 720 aacaacctga tccatacccc attaaatttt cttctagaaa atactactaa acgtacctgc 780 gttacaagcc agatgaataa gaaataatgt ttgtattgtt atctacctaa aatgtccgtt 840 aacgacctga gttactcaac aatcaccgcg tgagtgtata gaaagatagg tatttaactt 900 ataattgtcg ttctacctat gcatataact atttaagctt gatgtcaata aaactctttt 960 cattttatct aagtcatttg tgaaaagaag ttaaagatgg atcaacaaga tttatcgcgc 1020 attaaaattt cgagatgttg cgccggtata aacaatccta agtgcaggcg tagccagttg 1080 cgaaacttgt caccaacaat gatcgaacac atacgtgcta tgggatatag tcatcagctt 1140 gatgaaacta tgcatatttg tgagtcatgt cgcttagtga tcaggctcca cagaagtccg 1200 cctaagaaaa aacgacggca aaggcccctg tgacgctatt ggtggcactc tcaagcgaat 1260 ggcaaaaaga gcaagtcttg caaaagacta tggaaacaca atcgcaactc cgcgagaact 1320 attcgactgg gcagtgaaac aaactgatac atgtatcacc aaattaaatt tctgttatat 1380 atctaatgaa caatatgtta aaatgtcaga ggaattggtg gaattgtttg ataaggttaa 1440 aactgtccct ggtacccaaa aatatcattg ttttatgcct attagtgata cacaaattgc 1500 agccaaacgg tataccaatt cagaggatga accaaaaata ttcaatttat tcagtaaagc 1560 ccaaaaataa tgtaaatatt catgtgcaga tactacgttt taggatatat agcaataata 1620 aaaataatga tgcatgataa aaaaaaccac gacttttttt cataagcata atattgaccc 1680 tttccagaca cctgttaaga ttggctttga ccaaataaga aatgaaatat tattgtgata 1740 tctttccaac cataaaaaaa cccatcggat tgttagacgg agttgatttt ctcataagcg 1800 gtttggtgcg agatcaattg tatttaataa aaaaaactct gtataattcc gttttatttc 1860 ttttaactac tcccaataaa acaaatataa tctattttgt gattaaaact catttatcac 1920 ttgaaaatat gatcatacta gactatttaa agcgaaataa aaaaaaaaat tttggactta 1980 aaaaattcgt tgttagaaaa acttttttcc ctaaaatatg cccaatttgc aatgtatgga 2040 cgacttttag taaatttaat tacgaaactt tttgcttaag gatgatcaaa atctgaaatg 2100 gccgtttttt ttaaatcgac tttaaagttt tgaatcattg ttttttcagt gtagaaaatg 2160 tttttttatt ttttcaatat ttttttctaa aacgtcagga aatttcacga cagtccccca 2220 tacaacactt ttttgtaggt cttaccgttt tcgagttata cgggtttata aaaaaagtaa 2280 aaaaaaactt tgaccctttc caaatgttag tctagatcga ttttgaaaaa acaaagtatg 2340 caaagtatct tagtttcaca tagtcttcac acccagaaag tttcattgaa ttctgaaggg 2400 gtgctgccaa tcgcgtgtcg agttggcgtg aaatccgtc 2439 // ID Copia-23_NVi-I repbase; DNA; INV; 6872 BP. XX AC . XX DT 01-JUL-2009 (Rel. 14.07, Created) DT 01-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE LTR retrotransposon from parasitic wasp: internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; KW internal portion; Copia-23_NVi-I. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-6872 RA Bao W. and Jurka J.; RT "LTR retrotransposon from Nasonia parasitic wasp."; RL Repbase Reports 9(7), 1514-1514 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 1421..6799 FT /product="Copia-23_NVi-I_1p" FT /translation="MFLDNSLIEERLLESVKKEIKRIRAEEVAPAEKVDPS FT QRGNTLSPKLDVNTVTADGGPAYKFDEGKEQKSDFVDNDSWKNKFEANMLG FT KSTPVIKSNQLIQLGLGLNNFRHDXRTSKLGQFSLNNTDKNLEISEITAKL FT DKLEKENTDLKKTIETECNRVHDDVVTELLRHKWAIKENYSWIEEKVQFWA FT EQDRLCDEAQDESAGLSADAENAQNNNQPSNQNISNKVTSNAHVSRDYIAK FT LVQDEVNKIDSNKSIGNIQDKIQIEVTNRDSTFRREYKLTQNLNVNTFLDF FT LNSELTLADLLYVIDSTVKPTRVLDETKLEKDKVRVRDIIINRIDISYYEK FT VSDIRDPIKLLDEIKRIKENELNVTTRSVRKELHSIIYNPHKERASAFWDR FT FDRIVRLYNKLPDTPPLSEVEINDAFYEAIVVHLPQVKETEFLNKNTTGKV FT LNLKQLKDYIVQQESARIGSNGSAGQKAEAGAKAYLVTNPGVLCYNCGNKG FT HYKDECTRKGKMCFRCKRYEGHVRANCPYTENQLEKVLKENESQRRYESSN FT SNRGGRRGGFSRGGKRRNSESQKGSNPKKIKSDRGHARGRSRGKGRQNKQN FT NNNSTKTSESGECTGLYVDSKFLNAVESDKNRLIRFLADSGATEHMTNSKL FT IFKTFDNTKRLDIKCANDNDSAIIKSEGVGNISGYTKNDDFLSLKNVIYAK FT SLSENLLSLRKFVDKGLGIYLDNKXIDIFDPXSKTIFVSGIYEQPYWVIEL FT ETNNSDXNENNNVNKVVAYITTRRREYPTVSVAPNKKSESXIKETTREQDK FT LESTEISKDSKLDEPMQIDNELNVEVNSDNNKLCSSYSNFENTLNDRKICN FT LXESIDLEKLDCESNISKINNLFYTNKAMLWHVRLGHASLKKLKEFQKKFP FT NLKNLKEIKFDDSVMDCEVCIVSKFNKLPFSSTRQRATKPLQIVHADTMGP FT ISPATHPKKYRFISVFIDDYSRLAIAYPMKHKSETGHCLESFIKSSRNLLG FT YDAKFCYLRCDQGTEFTGGYTQEVLDKFEAELKLASPDTPEHNGVAERFNQ FT TIQKLTRALMYDTRLPENMWDLALNAAVYYYNRTPHSSNEMIPPLQMFKPD FT FKLNLEQLKRFGCIAYIKVQRKTGPKFDQLGKRVVLIGYKPTGYLFLKPEE FT GKYYESRDVRFNEKLVFGDKYNKQSIKDWSNPMEDINKETWLVKFDEEDEI FT LISETEGERRRRGRPRKEKGVELPSESHTLESDLNLLESDELNAFIATVEN FT DPNSYREAMSTKNKLDWQGAIKSELDSMEKNKVWTLVDRPVIQGGKRPNII FT DSRWVLKTKVGLNNDNKYKARLVIRGFKDQNHYKLMETYAPVSRLPLIRSV FT LAIINKFDLEVRQLDVKTAFLNGTIDNEIYMEIPEGIDCSPVTRRDKVCKI FT QRALYGLKISPKKWYEKFTEVVIKLGLESHDSEPCLFTWRNNEKYLIMLLY FT VDDILITSNDTHKLDEITSKLKLEFEMSDMGEPKSFLGIEINRDRQNRTMT FT LTLENYIDKMLKRFGYSEMHPQRTPMVTNQVANRERREREESENQLENTLN FT KTNGPYREIVGSLLYLANTVRPDITYAVNVLSRHQINPTDEEWKMVKRVCR FT YLKHTRSLGLKFEGKLDNLQGFSDASFADCKGSITTSGFVIKLYGDTVAWK FT THKQTFVALSTCQAEYVAMSEASQEMVSLQNSLSLILRNSFLPMTLWCDNK FT AAEASTQVSSTSKLRHMTVVREHYVRECVARNLIKISWIASKDQIADIFTK FT ALPFELHAKLTKLLLNYYLKY*" XX SQ Sequence 6872 BP; 2438 A; 1169 C; 1460 G; 1785 T; 20 other; ggttatgggc ccagcacgct tccactgcgc tgcttaaggc tttgctagac ctgtgcgtgt 60 acttggtgaa aatatctcga gtgaagtgct gaaaaatacg ggttttccct ttaagtgaaa 120 agtttgacga tcaaagttcg acgctctgca ttccagcttc gacgtgcacc gctcgacctg 180 tggaggacag gacaagctac aggcatctca gcgtggtttt gctgcggaga cgaccggtga 240 ctcttttggg actacttcag ctcgatttgt gcaacgatgt ccgatcagac aaaaagagta 300 tctgttaata tgcccaagac aaggggtaag tcaccgtctt tctctttggt aaagtgcgcg 360 agtgaaattt tgcgggaaga cggtaaaatc ttccaacgag catgacgcaa aatttgtagc 420 gcggcacgtg aggcaggcga ttcttagaat cagcgcgaag ccggacaacg tgttcgagca 480 tgcccgcgcg cgtgtttcga gcgctcggga caaagacgca gcgaaacaaa atggcgacgc 540 gtggcaacgc gggaaagaaa agccgttggc gcgttcgagc gcgagcgcca tcttactctc 600 aatgcacatg agagcgacga gacgagctag ttactcgtgc gcgcaaccga tggcgctagt 660 ttcgcgactc gaagcttgac tgcagtatac gtttacaaat accgatgctt gatgacacaa 720 tagtgtaaat taaagatgat agttcgattt ttaatttaaa attattcaag tttgagtctt 780 aaactatgtt acggcacgct gatcgaatcg ggaagcagaa acgtttatca aaattaaact 840 caagttcgac caaaattaat taaaattgca attttttaag ttcgactaaa atttcataaa 900 aacagttaac tgatttgctt gaattaaaca gaccagaaaa atctgcagtt gttcttgctg 960 ctccaagacc aggagctggg aatctcggaa aaataccctg atgattctat ttatccagaa 1020 atggacgaag agggaaacat caacatactg acgtcaagtg gtgtaaaata tgagataggt 1080 aaagcttgaa tttttgatat aagctcgaaa attaaaataa aaaaaaactc gtttttccac 1140 aacaaaatat gtacatatta aatctaaata gcatgctgtt cagttccaaa actaaagcgt 1200 tggccagtac atgcagggta gtgtatgttg ttgctgctac actgtgtgtg ctgtaaaatc 1260 acaggtttat ttttttttat ctctttatat aaaaatcagg caattagcac tgcttctttg 1320 attgtcgttc gtcaactaag catatgctta ataaattgct acatgtgata gagtgctaat 1380 tttgattgtt gttcgtcaaa acaagcaact catttataaa atgttcctag ataattctct 1440 aatcgaggaa cgtttactgg aatccgtgaa gaaggaaatc aaaaggattc gagctgaaga 1500 ggtggctcca gctgaaaagg ttgacccaag ccaacgtggg aataccttgt ccccaaagct 1560 ggatgttaat actgtaactg ctgacggtgg gccggcctat aagttcgatg aagggaaaga 1620 acaaaagtcc gattttgtgg ataatgacag ctggaaaaat aagtttgaag caaacatgct 1680 aggtaaatcc accccagtta ttaaatcaaa tcagcttata caactgggtt tgggactgaa 1740 caatttccgt catgattmga ggacatctaa gctcggacag ttttctttga ataatactga 1800 caaaaatctt gaaatttcag aaataacggc taagctggat aagttggaaa aggagaatac 1860 agatctcaaa aagacaatcg aaacggagtg caatagagtg catgacgatg tggtcaccga 1920 actgctcaga cataaatggg caattaaaga aaattacagc tggatagaag aaaaggtaca 1980 gttttgggct gaacaggatc gtctgtgcga tgaagcacaa gacgaatcag ctggactttc 2040 ggcagatgca gaaaatgcac aaaacaacaa tcaaccgtcg aatcagaaca tctcaaataa 2100 agtaacatca aatgctcatg taagtagaga ttatatagca aaactagttc aagatgaagt 2160 taataaaata gattcaaata aaagtattgg aaatatacaa gacaaaattc agatcgaagt 2220 aacaaataga gatagtactt ttcgtagaga atataaatta acacaaaatt tgaatgttaa 2280 taccttttta gactttttaa attcagaatt gactcttgct gatttattgt acgtgattga 2340 ttcgacggtt aagccaacac gtgtactcga tgagacaaag ctggaaaaag ataaagttcg 2400 agttcgcgat attatcataa atagaattga tatatcttat tatgaaaagg tatctgatat 2460 tcgggatcct attaaattgc tggatgaaat caaacgtatt aaagaaaacg aattaaacgt 2520 gactacaaga agtgtaagaa aagagttaca cagtattata tataacccac ataaagagag 2580 agcatctgct ttctgggata ggtttgatag aatagttcga ttatataata aattgcctga 2640 tacgcctcct ttgtctgaag ttgagataaa cgatgcattc tatgaagcca tcgttgtaca 2700 tttacctcaa gttaaagaga ccgagttttt gaataaaaat acaactggca aagttttaaa 2760 tttaaaacaa cttaaggatt atattgtaca acaggaatca gctcgaattg gatcaaacgg 2820 gtcagctgga cagaaagcag aagctggagc aaaagcatac cttgtgacta atcctggggt 2880 gctttgctac aactgtggca acaagggtca ctataaagat gagtgcacaa ggaaaggcaa 2940 gatgtgcttc agatgcaaaa ggtacgaagg gcatgtccgt gcaaactgcc catacacgga 3000 gaatcagctg gaaaaggttt tgaaggagaa tgaaagtcaa agaaggtatg aatcctcaaa 3060 ttctaatcgg ggtggtagac gaggtggctt tagtcgtggt ggaaaacgac gcaattccga 3120 atcacagaag ggtagcaacc ctaagaaaat caagtccgat agaggccatg cccgaggcag 3180 atcacggggg aaaggtagac aaaataaaca aaacaacaat aattctacca aaacctccga 3240 atctggtgag tgcacaggtc tgtatgtaga cagtaagttt cttaacgcgg tagaaagcga 3300 taaaaaccga ctgattagat ttctggcgga ctctggtgct acggagcaca tgactaattc 3360 taaattaatt tttaaaacgt ttgataatac taaaagactc gatattaaat gtgcaaatga 3420 caatgattca gcgattataa aatcggaggg agtaggtaac atatctggtt acaccaaaaa 3480 tgatgacttt ttaagcttga aaaatgtaat ytatgctaag tcattgtcgg aaaatttgct 3540 atcactgagg aaatttgttg ataagggytt aggtatctay ttrgayaaya agaraatyga 3600 tatwtttgat ccrrtgtcaa aaacgatttt tgtctctggr atatatgagc aaccttattg 3660 ggtaatagag ttagaaacaa ataactckga taraaatgaa aataataatg taaataaggt 3720 tgttgcatat ataaccacaa ggcgtcgtga atatccaact gtaagtgttg cgcccaataa 3780 gaagagtgaa tctgmgatya aagagacaac tcgtgaacaa gacaagcttg aaagtactga 3840 aattagtaaa gatagtaagc ttgatgaacc aatgcaaatt gataatgaat taaaygtaga 3900 agtaaattca gacaacaaca agctatgctc tagttattca aatttcgaaa acactttaaa 3960 tgatagaaaa atatgtaatt taratgaatc aattgactta gaaaaattag attgtgaatc 4020 taacatcagt aaaatcaata atttgtttta caccaataaa gccatgctat ggcatgtcag 4080 attgggacac gcttcrttga aaaagttgaa agaatttcag aaaaagtttc caaacttgaa 4140 aaacttaaag gaaattaagt tcgatgattc tgtgatggat tgcgaagttt gtattgtgtc 4200 aaaatttaat aagttaccct tcagttcgac tagacagaga gcgactaaac ctctgcaaat 4260 tgttcacgca gacacaatgg gaccgattag tccagcaaca catcccaaga aatacagatt 4320 catatctgta tttatcgatg attattcacg acttgcaata gcgtatccaa tgaaacacaa 4380 gagtgaaacg ggtcactgcc ttgaatcttt cattaaaagc tcgagaaatt tattaggata 4440 tgacgctaaa ttctgttatc ttagatgtga tcaaggtaca gaattcacag gaggatatac 4500 acaagaagtt ctcgataagt ttgaagcaga gctaaagtta gctagcccag atacacctga 4560 gcataacgga gtggctgaac gatttaatca gaccatccaa aaattaacga gagctttaat 4620 gtatgataca cgcctacctg agaatatgtg ggaccttgcc ctgaatgcag cagtctatta 4680 ctataataga acgcctcaca gctcgaacga aatgatacca cccttgcaaa tgtttaaacc 4740 agattttaaa ctcaatctgg aacaattaaa gcgttttggt tgtatcgctt atatcaaagt 4800 acaaagaaag acaggaccta agttcgatca gctgggaaaa agagtagtac taataggcta 4860 caaaccaact ggatatctat tcttgaaacc agaagaaggt aaatactacg aaagcagaga 4920 tgttcgattc aacgaaaagc ttgtatttgg agacaaatac aataaacaga gcatcaaaga 4980 ttggtctaat ccaatggaag atataaataa ggagacgtgg cttgttaagt ttgatgaaga 5040 ggatgaaatt ttgatctctg aaacggaggg agaaagacgg cgaagaggtc gcccaagaaa 5100 agaaaaagga gtcgaactcc catcagaatc acatacactt gaatccgact taaacctgct 5160 ggaaagcgat gaactgaatg cgttcattgc aactgtagaa aacgatccaa attcttacag 5220 agaagccatg agtactaaaa ataagcttga ttggcaagga gccattaaaa gcgaattaga 5280 ctcaatggag aagaataaag tctggacact tgtcgacaga cctgtgattc aaggaggaaa 5340 aagacctaat ataattgact caagatgggt tctaaaaaca aaagttggat tgaataatga 5400 caacaagtat aaagctcgac ttgtgataag gggtttcaaa gatcaaaacc attataaatt 5460 aatggaaaca tatgcacctg tttctcggct acctctgatc agatcagtgc tagcaatcat 5520 aaataagttt gatctcgaag taagacagct tgatgttaaa acagcttttc tgaatggaac 5580 aatcgacaac gaaatttata tggaaatacc tgagggtatt gactgctcac cagtaaccag 5640 acgagataaa gtctgcaaaa ttcaacgtgc tttgtacggt ttgaaaataa gtccgaaaaa 5700 gtggtatgaa aaattcacag aagttgtaat aaaactagga ttagagtctc acgattcaga 5760 gccgtgtctc tttacttggc gaaacaacga gaaatatctg ataatgttgt tgtatgttga 5820 tgacatactg ataaccagta atgacacaca taagctagat gaaattacgt caaaacttaa 5880 gttggaattt gaaatgtctg acatgggaga gcctaagagc tttcttggca ttgaaataaa 5940 cagagacagg caaaatcgaa cgatgactct gactctagaa aattacatag acaaaatgtt 6000 aaaacgtttt ggatatagtg aaatgcatcc tcaaagaacg ccaatggtga caaaccaagt 6060 agcaaatcgt gaaaggcgag aaagagagga aagcgaaaat cagctcgaaa atacactgaa 6120 caaaacaaac ggtccttata gagaaatagt tggttctctt ttgtatcttg ccaacactgt 6180 acgtcccgac ataacgtatg cggtgaacgt gttaagtaga caccaaatta atcccactga 6240 tgaggagtgg aaaatggtga aacgagtttg tcggtatctg aaacacacca gaagtctggg 6300 tcttaagttt gaaggaaagc tcgataactt gcaaggtttt tcagatgcca gttttgctga 6360 ctgcaaaggc tcgattacaa cgagtggatt tgtaatcaaa ctgtacggtg acactgtagc 6420 gtggaaaacc cacaaacaga ccttcgtagc tttatcgacc tgtcaagctg agtatgtagc 6480 aatgagcgaa gctagtcagg agatggtatc tctacaaaac tcgttaagct tgattctgag 6540 aaactccttc ttgcccatga ctctttggtg cgataataaa gctgctgagg ctagtactca 6600 agtaagcagt acaagcaaac tcagacatat gactgtagtt agagaacact atgttcgaga 6660 gtgcgtagcc agaaatctaa tcaaaatcag ttggatcgca tcaaaagatc aaatagcaga 6720 tatattcaca aaggcgttac cttttgaatt acatgctaaa ctaactaaat tgcttttaaa 6780 ctattattta aaatattaac atttgtttgc ttcttttcag acgctaaacc gtaatgctgg 6840 acgacgtcgt tagaaatctg ctcgggaggg ag 6872 // ID Waldo-5_AAe repbase; DNA; INV; 6404 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A Waldo non-LTR retrotransposon family from Aedes aegypti. XX KW R1; Non-LTR Retrotransposon; Transposable Element; Waldo-5_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6404 RA Kojima K.K. and Jurka J.; RT "Waldo, (AC)n microsatellite-specific families of non-LTR RT retrotransposons from the yellow fever mosquito."; RL Repbase Reports 11(4), 1462-1462 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >98% CC identity. Both sides are (AC)n microsatellites. XX FH Key Location/Qualifiers FT CDS 833..2290 FT /product="Waldo-5_AAe_1p" FT /translation="MGSQRRLCTTDEDSGSEGGPTKMESIDDTKAVGEDAN FT AFARSGRVARSPVTPLDTVASTSQVHSEPQATQQERTLLNRVFTPNSSSST FT THNESRLEKSSLADIRRKVNELYEFVKERNNVHLKIKHMVTSIRSAVSAAE FT REQNALRMRAETAEKALKEAAERPTTDVQETPRSHQNTRSEKRFRESPGEQ FT EDAKKHKNEKKGVNNQRDAGSEGEWRIVGSQQKKKKNRKVKEGKKPEQKKK FT ETRQPPRLRYKGDALVVEANDKTTYAALLKRVKEDPELKELGENVVKTRRT FT QKGDMIFVLKKDPSVKSTAYKELIAKSLGDEANVRALSQEAVVECRDLDEI FT TTEEDLKWALAEQCNLEGQMSMRLRKSYGGTQTAAIRLPVDAANKLVALGT FT IKVGWSVCPLRLVPRVAKQMERCFKCRGFGHQSRNCKGPDRSDLCWNCGGN FT GHVARDCTKRTRCLLCMPEDGNDHPTGGFKCPAYKKAKAGQ" FT CDS 2296..5241 FT /product="Waldo-5_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MEITQVNLNHCDTAQQLLWQFTTETKCDVAIIAEPYR FT VPLENGNWVADRAGMVAIQVMGRFPIQEVVDSSHDGFVIAKVNGVFVCSCY FT APPRWTDEEYNSMLDLLTDTLVGRTPVVIGGDFNAWAVEWGSRLTNARGYS FT LLEALAKLDVRLCNEGAVSTFRKDGRESIIDVTFCSPLLLDDMNWRVSEEY FT SHSDHQAILYSIGRRISPAPMRMRTYERKWKTKDFDKEVFIEALRTYSRAN FT LDAEELTEVVATACDASMPRKLEPRNRRRPAYWWNETLSTLRAACFRARRR FT VQRARTEVEREERRIAFREARAAFKQEIKRSKSSCYKELCREAEENPWGNA FT YRIVMAKFKGPATPVETCPDKLRIIVEGLFPHHDPTVWPPTPYEDEEEEPG FT GVRISNEELLAVAKGLKVKKAPGPDGIPNVALKTAILAFPDLFRTTLQKLL FT DEGCFPDSWKVQKLVLVPKPGKPPGDPASYRPICLLDTLGKLLERIILNRL FT TQCTESESGLSKRQFGFRKGVSTVDAIRTVVQNAEKASKQRRRGNRFCAVV FT TIDVKNAFNSASWEAIALALHRMRVPGYLCSMLKSYFQNRVLVYETELGQK FT SIRITAGVPQGSILGPTLWNAMYDGVLMLVLPNGVEIVGFADDVVLTVTGE FT TLEEVKMLTTEAIDTIEAWMTGAKLQLAHHKTELVLVSNLKAVQRVEINIG FT RQDISSKRALKHLGVMVDDRLNFNAHVDYACEKASKAVNTIARIMPNVGGP FT RSSKRRLLASVAVSILRYGVPVWAAAVKTKRNRRMLNSTFRLMAIRVASAY FT RTISSEAVCVIAGMMPICITLAEDVECYERRGTRNVRRIVRTDSLAKWQQE FT WNNAEKGRWTYRLIPNVSLWVNRKHGEVNFHLTQFLSGHGCFRKYLHRFGH FT ASSPCCPDCENVEETPEHVVFECPRFEEVRRDMRGVTVDNVVEEMCREESI FT WNAVDSTVSRIMSELQRKWRSDQRSAS" XX SQ Sequence 6404 BP; 1790 A; 1435 C; 1932 G; 1247 T; 0 other; gggttgcaaa atggtgagtc gacaactagg aaggagcgac caacatagct ctggtcctca 60 caagccccta cctcacgctt ccacgggtct aacgatgaca aagaccgcca gctaagggtt 120 gcgtacttag ctggtagtgc aacctgggca ctgttgtcct tctgacatca gctagagtga 180 gggggtgcca ggtgggagct tgggattctt accttttcca agcgaaacta gtacatacag 240 ccgtatggaa taccaccttt agtacttccc ttgggaagtg gccgagcgtc tggtccttct 300 gactaggggc tcaagcatgg tctacctagg atgtggcggg ggttcatcag tgggctctgg 360 tgaatctcta caaaaaacca cagatctgca agtagccctg aacaagcgac ctggtaccgc 420 tttcaaagtg ccttagccca tcatctggag tgccaaccgg cacattagga tcgacgccag 480 taagatctta attacggcat actggcggca tggactgagg aactggaaca aggaacacga 540 catatctacg ctacagagct gacagccgca ctattctact cccttagtaa ggatgggtga 600 cacttccctg aaacggcggg tgggccaata gtgcggttgc ccccgtcccg ttaaaaccgt 660 ggcaggcctt caagtacgtt cgacgtctgt ccatcaagcg tgatgagtac ataggttcgg 720 atgtagcatc cccgatctac gccgatctgg cactgtaccc ggcccccata agggtccgtg 780 tcaacccacg catggcctca ctgcctacaa cgctcatagg tatgagataa ccatgggatc 840 gcaaaggcga ctatgtacga cggatgaaga tagcggctcg gaggggggac ccactaaaat 900 ggagagtatt gatgacacca aggcagttgg agaagacgcg aacgcgttcg caagaagtgg 960 tagggttgcg agatcaccag ttacaccact ggacaccgta gcgagtacta gccaagtaca 1020 tagtgagcca caagcaacac agcaggagag gacgttgcta aaccgcgtgt tcacacccaa 1080 ttcgagtagc agcactaccc acaacgagtc gcgactggag aagtcgagtt tggcagacat 1140 taggaggaag gttaacgagc tctacgagtt cgtgaaggag agaaataatg tgcacttgaa 1200 aattaagcac atggtgacga gcattaggtc cgccgtatca gcagctgagc gtgagcagaa 1260 tgcgctcaga atgcgagcgg aaacggctga gaaagctctt aaggaagcag cggaacgacc 1320 tacgacggat gttcaggaga cgccgagaag tcaccaaaat acccgctcgg agaaaaggtt 1380 cagggaatca cctggagaac aagaggatgc aaagaagcac aaaaacgaga agaaaggcgt 1440 caacaaccag agagatgctg gaagtgaagg tgaatggcgt atcgtcggaa gtcagcagaa 1500 gaagaagaaa aaccggaaag tgaaggaagg aaagaagccg gaacagaaga agaaagaaac 1560 acgtcaacct cctaggctga ggtacaaggg tgatgctctg gtcgtggaag cgaacgacaa 1620 aacaacgtac gctgcgctcc tcaaaagagt gaaggaagat ccggagctaa aggaactggg 1680 cgaaaacgtt gtcaaaacgc ggcgcacgca gaagggcgac atgatcttcg ttctgaagaa 1740 agacccctct gtcaagagca cggcgtacaa ggagctcatc gccaaatcct tgggcgacga 1800 ggccaacgta agagctcttt cgcaggaggc agtggtcgag tgcagagatc tggatgagat 1860 cacgacggaa gaagacctga agtgggcact ggccgagcag tgtaacctag aaggacagat 1920 gtctatgcgg ttaaggaagt cgtatggagg cacacagacg gcggcaatcc ggttgccagt 1980 ggatgctgcc aacaaactgg tggcgctagg cacgatcaaa gttggttggt cggtgtgccc 2040 gctgagattg gtacctagag tcgccaagca aatggagaga tgcttcaaat gcaggggatt 2100 cggccatcag tcgagaaact gtaagggccc ggacaggtcc gacctatgct ggaattgcgg 2160 cggaaatggt cacgttgctc gggactgtac aaagcggact aggtgcctgc tatgcatgcc 2220 ggaggacgga aacgatcatc cgacgggagg cttcaaatgc ccggcgtata aaaaagcgaa 2280 ggcgggccaa taaggatgga gatcacgcaa gtgaatctca atcattgcga cactgcacag 2340 caactgttgt ggcagttcac gacagaaacc aagtgcgatg tagcaataat tgcggagcca 2400 tatcgggttc ccctcgaaaa cggtaattgg gtggctgata gagcaggtat ggtggcaata 2460 caggtgatgg gcaggttccc cattcaagaa gtagtcgata gctcacacga cggctttgtt 2520 attgccaaag tcaacggagt cttcgtatgc agctgttatg cgcctccgag atggacagac 2580 gaagagtaca atagtatgtt ggatttgctc acagatacgc tggtgggaag aacgccggta 2640 gtcataggag gagatttcaa cgcctgggcc gtagagtggg gcagcagatt gaccaacgcc 2700 agggggtaca gcttactcga agctctggcg aagctagatg tgaggctgtg caacgaaggg 2760 gctgtcagta cattccgtaa agacggtcgg gagtccatca tcgatgttac gttctgtagc 2820 ccattgttgt tggatgacat gaattggagg gtgagtgagg aatattccca cagtgatcat 2880 caggcgattc tctacagcat cggccggcga atctccccag cgccgatgag aatgaggaca 2940 tacgagcgga agtggaagac aaaggacttc gacaaggagg tgtttattga ggcgcttcga 3000 acatacagca gggcgaacct tgacgcagaa gagttgacag aagtggtagc aacggcttgt 3060 gatgcatcga tgccgagaaa gttggagcct aggaaccgaa ggcgcccagc gtactggtgg 3120 aatgagacgc ttagtaccct ccgtgctgca tgcttccgag ccagaagacg cgtccagaga 3180 gcaagaaccg aagtagagag agaggaacgg agaatcgctt tccgtgaagc tagagctgct 3240 ttcaaacaag agattaagcg gagcaagtcc agctgttaca aggagctatg tcgagaagcc 3300 gaagaaaacc cttggggcaa cgcctaccgt attgtgatgg cgaagtttaa gggtccggcg 3360 acaccagtcg aaacatgtcc agacaagttg aggattatcg tggaaggtct gtttccgcat 3420 catgacccaa cagtgtggcc gcctacgcca tacgaagacg aggaagaaga acccggaggt 3480 gtgcgaatct ctaatgaaga gctactagca gtggcgaagg gtctgaaggt gaagaaagct 3540 cccggtccgg atgggatccc caatgtagca ctaaaaaccg cgatattggc gttcccggat 3600 ttgttcagga cgacgctgca gaaactcttg gatgaaggct gtttcccaga cagttggaag 3660 gttcagaagc tggtgttggt gccaaaacca gggaagccac caggggaccc agcatcgtat 3720 agacctatat gtttgctgga cacacttggt aagctcctgg aaaggatcat ccttaacagg 3780 ctgacacaat gcacagagag cgagagtggc ttatcgaaga gacaattcgg attccggaaa 3840 ggggtctcga cggtggatgc aatccggaca gttgtacaga atgcggagaa ggcatccaaa 3900 cagaggagga gaggcaatcg gttctgcgcc gtagtgacga ttgacgtcaa aaatgctttc 3960 aacagcgcca gctgggaggc tatagcctta gcactgcata gaatgcgagt tcctggctac 4020 ttgtgcagta tgctaaaaag ctatttccaa aacagagtcc tggtatatga aacggagttg 4080 ggtcaaaagt cgatcaggat cacggcagga gtgccacaag gctcaatcct tggtccaacg 4140 ctttggaacg cgatgtacga tggagtatta atgctggtgc tacctaacgg agtggaaatc 4200 gttggattcg cagatgatgt cgttctaacg gtgactggcg agacgttgga agaggtcaaa 4260 atgctgacga cggaagcgat tgacacgatt gaagcgtgga tgactggagc aaaactgcag 4320 ctggctcatc ataagacgga gctagtgctg gttagcaacc tcaaagctgt tcagagggtt 4380 gaaatcaaca tcggtaggca agatatttca tcaaaacgtg ctttgaagca tctgggggtg 4440 atggtagatg accgtctaaa cttcaacgct cacgtggact acgcctgcga aaaggcgtcg 4500 aaggcggtca acacgatagc gaggatcatg ccgaacgtcg gaggaccgag aagtagtaag 4560 aggcgtctcc tagcgagtgt agcggtctct attctcaggt atggagtccc agtctgggcc 4620 gcagcggtaa agaccaagcg taatcggagg atgttgaaca gcacattccg actcatggcc 4680 atacgagtag cgagcgccta taggacgatt tcatcggagg cagtatgcgt tatcgctggc 4740 atgatgccca tctgcatcac cctggccgag gacgtggagt gttatgagcg aagaggcacc 4800 agaaatgtga gaaggatcgt cagaacggac tcgttggcta agtggcagca agagtggaac 4860 aacgcggaga agggaaggtg gacttaccgt ctaattccaa atgtgtcact gtgggttaat 4920 aggaagcatg gagaagtgaa cttccatctg acgcagtttc tgtccggaca cggctgtttc 4980 cggaagtatc tgcatcggtt cggacatgct tcttcacctt gttgcccgga ttgcgagaat 5040 gtggaggaaa caccggagca cgtagtcttc gaatgtccta ggttcgaaga agtgaggagg 5100 gacatgcgag gagtcactgt cgataacgtt gtcgaagaga tgtgccgaga agaaagcatt 5160 tggaacgctg tcgatagcac ggtatcgaga ataatgtccg agcttcagag gaagtggcgt 5220 agcgaccaac gatctgctag ctagagtgaa cgcacgaaag ggaagaatgg ctgctcggtt 5280 ccgggagata ttcctttgcc ggggaactct ccgccggagt aggctagatc caccgtcggg 5340 gactagctga gtagacgcga cgtagcaccg gtcccgggtc gtaggagcac cagtgaaccg 5400 gaagttagcc ttcaccggaa tcgctggact gactccggca ccctaccggt cggcccgtaa 5460 aaaagtgaag agcagcagta gcagcagcag aaggagaatc ggtctcggga gaacttccat 5520 cgccggggaa ttcttcgtcg gtgtaggcta ggtccaccgc cggggactag ttgagtagac 5580 gtgtcatagc gccggtttta gggtcgtcgg ggcaccagtg aaccggaagc taggctccac 5640 cggaatcgct ggactgactt cggcaccctt ccggtcggct cgaaacgtag aaaaagaaga 5700 atcggtctcg ggagaatttt caccgccggg gaattcttcg tcggtgtaga ctaggtccac 5760 cgtcggggac tagtcgagta gagtgcgtcg ggacgctggt aatgggtagt cgaggcgcct 5820 acgaaccgga agttatgctc aaccggaatc gttggaccga cctcggcatc ctaccagctg 5880 aacaaagaaa aggagaaggt cgtcgtaact gtgtgagtac gcgccctgaa tgagtgcaga 5940 gtagtgtaca aactggagcc gaagggctca gaatgttgca tgtaggtact aacaaattga 6000 cgggtcgcca atgggcgaaa ataggtacta acaaattact aacgtgagga gccgaagggc 6060 tcagcatgag gcacaacgtt gaacggtttt aaaactgcta acaggagccg aagggctcag 6120 gagccaaagg gctcaggagc cgaagggctc aggagccgaa gggctcagca tgtagagtaa 6180 caaattgaag aggtgcgctc agcacgttag cctcccctcc gaagtaatac cggaaggtgg 6240 ttccggaggg ttatggtact aaacctagga gagtgttttt agtggggaaa gcactttgtg 6300 gatcccacac cgcgccaaaa attacactgg catgagcatg aacatataca ggccagtcta 6360 tgaagatttt taaactccta gttgcatgaa aaaaaaaaaa aaaa 6404 // ID Gypsy-25_OD-LTR repbase; DNA; INV; 183 BP. XX AC CABV01002755; XX DT 24-FEB-2011 (Rel. 16.02, Created) DT 24-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Oikopleura dioica genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-25_OD_; KW Gypsy-25_OD-I; Gypsy-25_OD-LTR. XX OS Oikopleura dioica OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; OC Oikopleuridae; Oikopleura. XX RN [1] RP 1-183 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Oikopleura dioica genome."; RL Direct Submission to RU (24-FEB-2011). XX DR Genome; CABV01002755; Positions 5427 5245. XX SQ Sequence 183 BP; 54 A; 50 C; 35 G; 44 T; 0 other; tgtgaggtta ggaaatcccc tacgttgcac taaagtgcgc cctcttgcgc tctgctcacc 60 tgtgcactga tatagccctg tacacgaaac ctgaactcac tctcttgaaa cactctctcg 120 aaataaacgc aatagaaagg agtccttgag tattaatcct tgggagagaa aacccaaacc 180 tca 183 // ID Gypsy17-LTR_Dya repbase; DNA; INV; 199 BP. XX AC chrU; XX DT 19-MAY-2009 (Rel. 14.05, Created) DT 19-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy17_Dya; KW Gypsy17-I_Dya; Gypsy17-LTR_Dya. XX OS Drosophila yakuba OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; melanogaster subgroup. XX RN [1] RP 1-199 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1107-1107 (2009). XX DR Genome; chrU; Positions 6458364 6458166. XX SQ Sequence 199 BP; 74 A; 47 C; 19 G; 59 T; 0 other; tgttatatat gttaattcaa atatacatct caccactctg ttgaaattac tctacttatc 60 ctcttaatat ctacgatata caacactaac cctagtatac caaaaatcat tcctgatcag 120 caaagcaaaa taaaaaagtc agttgttgtc tcaccctaaa gcagaacgcg tcgtttctaa 180 tatgcacaca tacacaaca 199 // ID BEL-37_AA-I repbase; DNA; INV; 5913 BP. XX AC supercont1.244; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-37_AA_; KW BEL-37_AA-LTR; BEL-37_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5913 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.244; Positions 1602611 1596699. XX CC Positions [4943-5503] - Integrase core CC 'ACCTT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 38..5911 FT /product="BEL-37_AA-I_1p" FT /translation="MNDLANTTAHNCQSCERPDTAEDEMVQCHLCQLWEHF FT SCAGVDERVKASNIQFACKSCSDKQKASTSAANKKNLKSMTSSAKGSKAGS FT RRSKQLPNIPGSATSSSRALLEEQLKMIDEERRLREQELEEQTELKRREME FT ESKRQLQEKKKIAEEEKLLREQQLKEESAIKALQQKIRRESLEKRNEIIRQ FT LAQTSSRGGSIPDSNEKVQGWLAEQVKVSVMPSRSVPCKSIVPEQEQQPDC FT SSAVSEVASALQNLLNVPNYPSSCHLPLPAPRDNMTPPLNQISPDNGLHGP FT SLRVSGLSQQQIAARQVLGKELPLFNGNPEDWPIFISCYEQSTATCGYSEA FT ENLIRLQRCLKGHALESVKSRLLLPSSVQHVIQTLRTLYGRPELLIRSLMN FT KIHHVPAPRHDRLETLMQFGLSVQNLVDHLKAANQQNHLSNPVLMQELVEK FT LPGTMRLDWAVFKNKHQPATLETFGEFMSGLVTAASEVTFDLPAFENTPRV FT EKRKPRDTGILHTHLSETESSRQWSSGPSATPTTSYRASKMCAACGRDGHR FT VNECTQFKAASVDERWKLVQQKGLCRTCLNSHGKWPCRSWSGCDIDGCRQK FT HHTLLHSSSPTLHDMNISASHVSAGEYNWPLFRVVPVVLYGNERSQITFAF FT IDEGSSYTLLEESVAKQLDVSGPTDPLTLQWTGNVTRVESKSQLVNLKISG FT KNGSCIHDLDHVHTVRHLVLPSQTLKYKDLAHRFPHLRGLPLEDYELVQPQ FT LLIGLDNLRLCVPLKLREGGPKDPIGAKCRLGWSIYGCIPGQPKAPNIVNF FT HGAAADPDHEMNEQLRDYFTLESVGVSACYVGEVQESADEIRAKRLLKETT FT RRSVSGTSFETGLLWNTDSPNFPDSFPMAIRRMLALERRFEREPELGENVR FT SQIASYEQKGYAHKAKLEELTSVDSNRVWYLPLGIVTNPKKPGKVRLIWDA FT AAKVGGVSFNSRLLKGPDLLTPLPQVLCHFRQFPVAVCGDIMEMFHQIKIR FT FPDCQSQRFVYRNKPTDHPQIYVMDVATFGSKSSPASAQYVKNLNAKEFEE FT EFPRAALAIENSHYVDDYLDSFQTINEAIEVVNEVKLVHSRGGFTLRHFLS FT NESDVLHGVGEFEEETVKNLALERGDKSESVLGMKWLPKEDVFVYSFALRD FT DLAQILEERHIPSKREVLKVVMSLFDPLGFISFFLIHGKILIQDIWSRGTE FT WDESIAQDLYTRWRQWTGLFSELDSVRIPRCYFRHPFPKNLDNLQVHVFVD FT ASEAAYSCVAYFRLEYEGVVQVAFVSSKTKVAPLKTISIPRLELKAAVLGT FT RMLNSIKKQHSYPIAQQFMWSDAGVCLAWIRSKNHRQYHQFVSVRVGEILM FT STEPQDWRWVPSRINVADSATKWKSGPQLSMENPWFHGVQFLHESEERWPK FT ERSLSDTNEELRRTQLQLGHFSPIIDISRFSSWVRLHRATAYVLRYIDNLR FT RKRDGQQLELGILNQYELQHSEQALWKMAQGEAFPDEISLLTKSKGPPEGR FT HCIVEKSSPIYKTWPFLDNDGILRMRGRIGAAPHAPTEAKFPAILPKQHLI FT TFLIVDWYHRRFRHANRETIVNEIRQRFEIANLRSLVQKVAKKCIWCRVTK FT AAPKPPVMAPLPEMRLKAFVRPFTFVGLDYFGPVLVKVGRSNAKRWVALFT FT CLTTRAVHMEAVHSLSTESCIMAVRRFVSRRGPPREFWTDNATCFQGANNE FT LKSEIKAKTKALALTFTSAQTSWNFIPPATPHMGGAWERLVRSVKVAIGAI FT LDAPRKPDDETLETILYEAEAMVNSRPLTYIPLESADEEALTPNHFLLGSS FT TGEKILPTEPVEQRAVLRSSWKLAQYISDQFWKRWLKEYLPVITRRSKWFD FT EAKDLAVGDLVLVAGDTARNQWIRGRIEEVLPGRDGRVRQALVRTSSGTLR FT RSATKIAVLDVVESSAPGSRLDSVSNPCQVHGRG" XX SQ Sequence 5913 BP; 1649 A; 1441 C; 1473 G; 1350 T; 0 other; aacttcaaga tttttatggt cttggggaag cgtcaatatg aacgatctag cgaataccac 60 ggctcacaat tgccaatcct gcgaacgacc ggatacagcg gaggacgaaa tggtccagtg 120 tcacctttgc cagttgtggg aacatttcag ttgtgcaggt gtcgacgagc gagttaaagc 180 atccaatatc caattcgcct gtaaaagctg tagcgataag cagaaagcat cgacgtcggc 240 agcaaacaag aagaatctta agtcgatgac cagttccgcg aaaggatcga aagccggatc 300 taggagaagc aaacaactgc cgaacattcc cggaagtgcg acctcgagca gtcgggcgct 360 ccttgaagag caactgaaaa tgatcgatga agaacgacgt cttcgagaac aggagctgga 420 agagcagacg gagcttaaac ggcgtgaaat ggaagaatcg aaacgacaac tgcaggagaa 480 gaagaagatc gctgaggaag aaaaactgct tcgcgagcaa caactgaagg aagaatccgc 540 tattaaggca ttgcagcaga agatcagacg tgaatctcta gaaaagcgga atgaaatcat 600 tcggcagctt gctcagacaa gtagcagagg agggtcaatt ccagactcaa atgagaaggt 660 ccagggatgg ttggctgaac aggtgaaagt gagtgtaatg ccatcaagaa gtgttccttg 720 caagtcaatc gttccggagc aagaacaaca gccagactgt tcgtctgcgg tatcggaagt 780 agcatcagcg ttgcaaaatt tactgaacgt tcccaattat cccagcagct gccatctacc 840 tcttcctgca ccacgcgata acatgactcc gccacttaat caaatttctc cagacaatgg 900 gcttcatgga ccaagtctaa gggtaagcgg cctttcccag cagcaaatcg cagcacgaca 960 agtcttggga aaagaactgc cccttttcaa cgggaatccc gaagactggc ccatatttat 1020 tagttgctac gaacagtcaa cagccacctg cggctattca gaggcggaaa atctcatccg 1080 tcttcagcgg tgcctaaagg gacacgcttt agaatcagtg aagagccgat tactcctacc 1140 atcgagcgtg cagcacgtca tccagacttt gcgtacgttg tacggaaggc cagaacttct 1200 aattcggtct ctgatgaata aaattcacca tgttccagca ccaagacacg acaggctgga 1260 gacgctgatg cagtttggac tctcggttca gaacctcgtg gaccatttga aagcagcaaa 1320 tcagcaaaac cacctctcga acccagtact catgcaagag ctagtggaaa agcttccggg 1380 tacgatgcga ctcgactggg ccgttttcaa gaataaacat caacctgcta cattggaaac 1440 ctttggtgag tttatgtcag gactcgtaac agcagcgagt gaagtaactt tcgacctacc 1500 tgcgtttgaa aacacgccac gagtggaaaa gcgaaagccg agagacacgg gcattctcca 1560 cacccatttg tccgagactg aatcttcacg acaatggtcg tcagggccat cagcaacgcc 1620 aacgaccagt taccgagcaa gcaaaatgtg tgcagcgtgt ggacgtgacg gccatcgtgt 1680 aaacgaatgc actcagttca aagcggccag cgtagacgaa cgctggaaac tagttcaaca 1740 aaaaggtcta tgtagaactt gtttgaatag tcatggcaaa tggccgtgcc gatcgtggtc 1800 cggatgtgac atcgatggat gtcggcaaaa gcaccataca cttctccatt cttcctctcc 1860 tacactccat gatatgaaca tatctgcgag tcatgtgtcc gccggtgagt acaattggcc 1920 acttttccgc gtcgtaccgg tagtactgta tggaaatgaa cgatcacaaa tcacatttgc 1980 cttcatcgat gagggttcct cgtatacctt gttggaggaa tccgtcgcca agcagctcga 2040 cgttagtggt ccgacggatc cactgacctt gcaatggact ggtaatgtaa cgcgtgtgga 2100 gtcaaaatct caactggtga atcttaagat ttccggtaaa aatggttcct gtattcatga 2160 tctggaccat gtacatacag ttcgtcatct ggttctccca tcccaaacac tcaaatataa 2220 agatctggcg caccgcttcc cacatttgcg tggcctccct ttggaagatt atgagcttgt 2280 tcaaccacaa ttgttgattg gcttggacaa ccttcgactg tgtgttccgt tgaagcttcg 2340 cgaaggtgga ccgaaggacc ctatcggtgc gaagtgtcgt ttgggctgga gtatctacgg 2400 ctgcattccg gggcaaccaa aggcccctaa catagtcaac tttcatggcg ccgctgctga 2460 tccagaccac gagatgaacg aacagctccg tgattacttc acattggaga gtgtaggtgt 2520 ttcagcctgc tatgttggtg aggtacaaga atccgcggat gaaatacgtg cgaagcgact 2580 gctaaaggag acaactcggc gctcggtatc cggaaccagt ttcgagactg gactgctgtg 2640 gaacaccgat agtcctaact tccccgacag ctttccgatg gccatccgcc gtatgctagc 2700 cctagaacgg agatttgaac gggaaccaga gctcggagaa aacgtgcgct ctcagattgc 2760 gagctacgaa caaaaaggat acgcccataa agccaagcta gaggaattga cctcggtaga 2820 ttcaaaccgc gtgtggtacc ttcctctggg tatcgtgacc aaccctaaaa aacctgggaa 2880 ggtcagatta atttgggatg cagcggccaa agtgggcgga gtatccttca attcgagact 2940 cctcaagggt cccgatctcc tcaccccact tcctcaagta ctgtgtcatt tccgccaatt 3000 tccggtggca gtgtgtggcg acataatgga aatgttccac cagatcaaaa tccgctttcc 3060 cgattgccag tcccagcgct tcgtttaccg aaataaaccg actgatcatc cccagatata 3120 tgtgatggat gttgccacat ttgggtcgaa aagctcacct gcatcggccc aatatgtaaa 3180 gaatctcaac gctaaagagt ttgaagagga atttccacgc gcggcgcttg ccatagaaaa 3240 cagccactac gtagacgatt acttagacag ttttcagaca ataaatgaag caatagaagt 3300 ggtcaatgaa gtcaaattag tgcattccag aggaggcttc acactgaggc attttctgtc 3360 aaacgaatcg gacgtattgc acggagttgg agaatttgaa gaagaaacgg taaagaatct 3420 ggcgttagaa cgaggtgaca aatcagaatc ggtgttggga atgaaatggc ttccaaagga 3480 agatgtgttc gtgtattcgt tcgccttaag ggacgatctt gcgcaaattt tagaggaacg 3540 acatatccca tccaaacgtg aggttcttaa agttgttatg agcttgtttg atcctcttgg 3600 gttcatctcc ttctttttga tccacggaaa aatcttgatc caagacattt ggtccagagg 3660 aaccgagtgg gacgaatcaa tagcccagga tctctacacg cgttggcgac aatggactgg 3720 cttgttctcg gagttggatt ctgtgcgcat accacggtgc tactttcggc atccatttcc 3780 aaagaacctc gataatcttc aagttcacgt cttcgtcgac gccagtgaag cagcatactc 3840 ttgtgtggcg tacttccgtt tagaatatga aggagtcgtg caagtagcat ttgttagctc 3900 caagaccaaa gtcgctccgc tcaaaacgat atcaataccc aggcttgaac taaaggcagc 3960 ggttctgggt acccggatgc tgaacagtat caaaaaacaa cacagctatc caatcgccca 4020 gcagttcatg tggagtgatg ccggtgtctg ccttgcttgg atacggtcca aaaatcatcg 4080 tcagtatcat caatttgttt cggtccgcgt gggtgagata ttaatgtcaa ccgagccgca 4140 agattggaga tgggtcccgt cacgaatcaa cgttgcagac tcggctacca agtggaagag 4200 tggtccacag ctatccatgg agaatccttg gtttcatggc gttcaattcc tacatgaatc 4260 cgaagaacga tggccgaagg aacgatcact ctcagatact aacgaagaac tccgccgcac 4320 ccaattacag cttggacact tctctccaat aattgacatc tcaaggttta gctcatgggt 4380 tcggcttcac cgtgccactg catatgtgct acgttacatc gataaccttc gccggaagag 4440 ggatggtcaa caattggaac tcggcattct caaccagtac gaattacagc actccgaaca 4500 agctttgtgg aaaatggcgc aaggcgaggc gtttcctgat gaaatttcac tgctaactaa 4560 atcaaaagga ccaccggaag gacgccattg tattgttgaa aaaagcagcc caatctacaa 4620 gacttggcct ttcctagaca acgacggaat tctgcgaatg aggggacgta tcggtgctgc 4680 accacatgca ccgactgaag ctaagttccc tgccatcctc cccaaacaac acctaattac 4740 atttctcatt gtagactggt atcaccgccg ttttcgccat gccaaccgag aaaccatcgt 4800 caacgagatt cgtcaacgtt tcgaaatagc aaacctcagg tcactcgtgc agaaggtcgc 4860 caagaaatgc atctggtgcc gcgtaacgaa agctgcacca aaaccaccag ttatggctcc 4920 tcttccagag atgcgattaa aagcctttgt ccgacccttt accttcgtcg ggctggatta 4980 ttttggaccc gtgcttgtca aggtgggtcg aagtaatgcc aaacggtggg tggccctatt 5040 cacctgccta acaacgaggg ctgtgcacat ggaggctgta cattcgttga gcacggagtc 5100 ctgcattatg gcagtgcgac gcttcgtgtc acgtcgcggg ccacctaggg agttttggac 5160 ggacaatgcc acctgctttc aaggcgcaaa taatgagttg aaatcagaga tcaaagccaa 5220 aacaaaagcc ctagccctta cctttactag tgcccaaaca agctggaatt tcattccccc 5280 tgcaacaccg cacatgggcg gtgcatggga gaggcttgtt cgctcggtta aggtggcgat 5340 cggtgcgatt ctggatgcac ctcgtaaacc tgatgatgaa acactagaga cgatactcta 5400 cgaagctgag gccatggtga acagtagacc gctcacgtac ataccattgg agtcagcaga 5460 tgaagaggcg ttgacaccca accattttct tctcggcagt tcaactggag aaaagatcct 5520 acccacagaa ccagtagaac aacgtgcagt ccttcgaagt agctggaaat tagctcaata 5580 catttccgat cagttttgga aaagatggct taaggagtat ctgcccgtca tcacaagaag 5640 gagcaaatgg ttcgatgaag ccaaagacct ggcggtcgga gatttggttc tggtggctgg 5700 cgatacggcg aggaaccagt ggattcgggg acgaatcgag gaagttttac cgggacgaga 5760 cggaagagtg cggcaagcgc tcgtacggac atcgtcaggg accctacgaa gatcagccac 5820 aaagatagct gttttggacg tcgtggagtc tagtgcacca ggctcgagat tggattcggt 5880 ttcaaaccct tgccaggtcc acgggcgggg gta 5913 // ID Gypsy-152_AA-LTR repbase; DNA; INV; 1557 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-152_AA_; KW Gypsy-152_AA-I; Gypsy-152_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-1557 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1030-1030 (2011). XX DR [2] (Consensus) XX SQ Sequence 1557 BP; 475 A; 304 C; 359 G; 419 T; 0 other; tgtaacggta agagtacaac cctaagtatt aaattaatta aactcgcctt taaatataca 60 accatcgcaa acggcttaag ctactcgaac ctttcaacaa tgttcccatc ccaactgctc 120 gaatggtata cccaatgtgt aggcgagaga aagaggaaaa atagaaactg taattttcca 180 cgacgtaaca agccgtgatt ggatcactca agaacaatgc agacaaatct ttacatttaa 240 aatattacga cgcaatttca tttgattggc tggtccaagg tcacgggacc tgaaaggaaa 300 agccatttgt aaaatcaaat cacttttatt gaagtgcttg gagtgatcag ttcggtgtcg 360 gaaaagctta aagttttaaa gtaaatctct ttgcagttgt tttgtggtga attttagaac 420 cttaaaagaa gagtaagtgc catgaaatca ttataaaatt gtaaagctaa tttattccca 480 ttgccagaag attatccttt gtgtgctcaa agttagtgaa aggcactagt gactaattta 540 ggttgaagaa acgtagaccc aggtaagaca aattgtgatt caaatgtatt caatctgtga 600 aataagaccc taaaatcacg ttgataggtt ttcccattaa cgtgggagcc cggtcggggc 660 accgattgcc tctaattcga gagcggaaaa ggttagaaac cgcgagtaag ttgagaagtg 720 gtgcagatta gtatcggcga gtatccacgt ggcttggaag ccacgggtac cgaagtagtc 780 caaccgctgc tctccctggt gtcgggcgga catttaagcc gagactgccg acctaccacc 840 caccacccgt tggaaaattc gctcgaatca agctgttctt ggccgccggc tgaatctaac 900 ctctcgttac tagcgtgaga gacgacggct gattctaacc tctcgggccg cgagagatat 960 cgagttgtgc acgcgttaca acaaagcaag ctttcggaat agtcaggaca gcaacctgcg 1020 gcaggtgccg acttcgtcac ccgtctcgtc gtagaggaca gctgctaacg gccgtattgc 1080 agatgaaaag taaagggtct tcaaattaag gtcagtgatc actgtttaga ataacaaatt 1140 gaaatgagct caagtaggtc tagctagaca ggaattaggt tttggaatga tccatgaatt 1200 aaatgcaagg aatcgcgtgg cgaataaata ggtttttcaa attttaaaac gaaataaata 1260 tgcgcttcca tgagagtagg ttgtgtttgt acatatttta aataaaggtt ttctaaaatg 1320 atcttgaatg atacttttga attagattag ataggatatt ggttggaaaa cgtttcgatt 1380 aactcgccca cagattgaga gtttcaggcc gggcatccct gaaataaact ccgccaacga 1440 cctccttttg gagtggcgcg taagtcgttc gcaggtttca aagaaacttt tcattccggt 1500 attttgggga ctgagcaatg caacgctctg tccgttggtt gtaagcggag cgttaca 1557 // ID Copia-1_BM-LTR repbase; DNA; INV; 186 BP. XX AC nscaf2830; XX DT 19-MAR-2010 (Rel. 15.04, Created) DT 19-MAR-2010 (Rel. 15.04, Last updated, Version -1) XX DE LTR retrotransposon from silkworm: long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-1_BM_; KW Copia-1_BM-I; Copia-1_BM-LTR. XX OS Bombyx mori OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Bombycidae; Bombycinae; Bombyx. XX RN [1] RP 1-186 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from silkworm."; RL Repbase Reports 10(4), 584-584 (2010). XX DR Genome; nscaf2830; Positions 424014 424199. XX SQ Sequence 186 BP; 48 A; 38 C; 30 G; 70 T; 0 other; tgttagagaa atggaatgca gccattttag tttccctttg tcttacgcct ggggcgctgt 60 actctgtact acttgtcaag ctgacattct gtgtactatt atattttttc taccaataaa 120 tgcgagttat attgtgatcc taaaaacgta tcattatttc ttcaacaacc acgtagtttt 180 cctaca 186 // ID Crack-17_BF repbase; DNA; INV; 3521 BP. XX AC . XX DT 30-APR-2009 (Rel. 14.04, Created) DT 30-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE Amphioxus Crack-17_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; KW Crack-17_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-3521 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-3521 RA Kapitonov V. and Jurka J.; RT "Young families of Crack non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(4), 822-822 (2009). XX DR [2] (Consensus) XX CC Its 5'-terminal portion is incomplete. XX FH Key Location/Qualifiers FT CDS 2..3310 FT /product="Crack-17_BF_2p" FT /translation="NCTYFSAVTLSHHTCTLLFPLGGNVREKNTSNAVTST FT SVNCVHLCFHTLLILSGDVAVNPGPKDPCGICNKCVRKNQKGICCDLCDKW FT FHINCTGMDTHEYFTLANSDQYWYCNPCMLPCFSDSFFDSSSNDNSTSSSV FT SDIVNDEEILESTDKIPPSKGLTIVHLNICSLYSKLDQLYVFMSSNNVDVM FT TVSETHLDSTILDSELHIEGYHLYRNDRNRHGGGVAIYVSEAYSHTEVPEL FT KQPGLEAVFCTVNIPYTKPITIASIYRPPSSPVEFFSLLRDALEKWSLTKS FT DSELFLLGDFNVDINSNSSPGAKSLKYLSREFQLQQVINTPTRVNQHSSSL FT IDHVYCSDIHRVNDYGVVNCTLSDHYAVYCIRKGKRPNSAPKYVTSRKFTK FT FQEEDFLTDLKALDWSPVYQATCVEEAWSIFKSFFITISDIHAPYISKRSK FT AMQPPWLSADIKKMMFERDKMKARARKTGDSTDWEGYRATRNKVNRLVKQA FT KANYCQHKLENCSSEPNRIWTAVKEILPRKSSVITKTLNWAGQHLCDPLSI FT ANCFNNFFVTVGRKLAERFTQQTLTLQCPTVYRQISSTFQFKQVNDDSVYN FT KLSTLSNNKATGLDHIHSRLLKAAAPCICKPVTHIFNMSLSSGHIPKEWKR FT ARVTPIHKGGDQDDPNNYRPISVLPVIMKAFEKEVHGQFLEYLHQHQILAT FT QQSGFRPGHSTATTLIDATDYMLHNISSGKLVGAVFLDLKKAFDTVDQNVL FT LQKLSWVGVHGVEHHWFTDYLAGREQTVSLNGCESDFLPVTLGVPQGSILG FT PLLFILYINDLPSSVSTVCKVCLYADDTAIFCSSKDARVIENTLNSELANL FT ATWLHSNRLTLNVAKTKWILFGSSGKLRAVPPLRLQIDQETVEQVYQFKYL FT GVLLDCNISWNEHIDMVCSKVSQRIGLLRRLRSCLTVNIASMLYRCMILPL FT LEYCDIVWENCNITKQRQLQVLQNRAARVILQMKRRSSVQDLHSRLNWKYL FT TQRRKEHMCVMVFKCINGLVPTYLDTTFVHNHLLHSHNTRQASLLHKPNYT FT NTAGHRTFAYRGATYYNTLSADIRHAPTLRQFKAALRHTPPHMDL*" XX SQ Sequence 3521 BP; 1059 A; 803 C; 712 G; 947 T; 0 other; caactgtacc tacttcagtg ccgtcacact aagtcaccac acttgtactc tgctattccc 60 attgggtgga aatgtcaggg aaaagaacac ctcaaatgct gtgacttcga cttcagtaaa 120 ctgtgtacat ctctgtttcc acactttgct aattttgtct ggcgatgttg cagtcaaccc 180 tgggcctaag gacccatgtg gcatatgtaa taagtgtgtg cggaagaatc agaagggaat 240 atgctgcgac ttatgtgata agtggtttca tataaattgc acgggtatgg atacccatga 300 atacttcacc ctagcgaact cagatcagta ttggtattgt aacccctgta tgcttccttg 360 ctttagtgat tcattttttg attcttcaag caatgacaat agtacttcta gtagcgtctc 420 tgatattgtg aatgacgaag aaatcttgga aagtactgat aagatacctc catcaaaggg 480 cctaaccatt gtacatctca atatctgtag tctgtatagc aagttggatc aattgtatgt 540 atttatgtct tccaataatg ttgatgttat gacggttagt gaaacccacc tggacagtac 600 gatcctggac tcagagctac atatagaggg ctatcacctg tacagaaatg acaggaacag 660 acacgggggt ggggttgcca tatatgtgtc agaggcttac agccatacag aagtacctga 720 gctaaaacag cctggcctgg aggcagtttt ctgcacagta aacatacctt acactaaacc 780 catcaccatt gcttctatat acagaccacc atcaagccct gtggagtttt tctccttgtt 840 gagggatgct cttgagaaat ggagtctgac caagtcagac tccgaactgt ttcttctggg 900 cgatttcaat gttgacatta actcaaacag tagtccaggt gctaagtctc taaaatacct 960 aagccgtgag ttccaactac agcaggtaat taatacacct actagagtaa accaacattc 1020 gagtagtctt atagaccatg tctactgcag tgacatacat agagttaatg actatggggt 1080 agttaattgt accctctccg atcactatgc agtgtactgt attagaaagg gtaaacgccc 1140 aaactcagcc cctaagtatg taacctcacg caagtttaca aagttccaag aagaggactt 1200 tctcactgac cttaaggctc ttgattggag cccggtttac caggctacct gtgtagaaga 1260 agcatggtca atatttaagt ccttctttat caccattagt gacattcatg ctccctatat 1320 ctccaagcgc tctaaagcta tgcagccccc ctggctctca gcagacataa agaaaatgat 1380 gtttgaaagg gacaagatga aagctagggc tcggaagact ggggacagca cagactggga 1440 gggttacagg gccacacgga acaaagtaaa cagactagtt aagcaggcaa aggccaacta 1500 ctgccaacac aagctggaaa actgctcgtc agaaccaaac cgaatatgga cagcagttaa 1560 ggaaatctta cccagaaaga gctctgttat tacaaagact ctcaactggg caggacaaca 1620 tctctgtgat ccactcagca tagcaaactg ttttaacaac ttctttgtta cagttggcag 1680 gaagcttgca gagaggttca cacaacagac actgacccta cagtgcccca cagtgtacag 1740 acagatctca tccacctttc aattcaaaca ggttaatgat gactctgtgt ataacaaact 1800 cagtacattg agcaacaaca aggcaacagg tcttgatcat attcattcaa gactattgaa 1860 agctgcagct ccatgtatct gtaaacctgt cactcacata tttaacatgt ccctctcctc 1920 aggtcacatt ccaaaagaat ggaaaagggc cagggttaca cctatacaca aaggggggga 1980 tcaagatgac ccaaacaact acagacccat ttcagtgtta ccagtaatta tgaaagcatt 2040 tgaaaaggaa gtacatgggc agttcttaga atatctgcac caacaccaga ttcttgctac 2100 tcaacagtca ggtttcagac ctggacattc tacagccaca acccttattg atgccacaga 2160 ttacatgtta cacaacatat cgagtggtaa actcgtgggg gccgttttct tagatctgaa 2220 aaaggcattt gacaccgtgg atcagaatgt cttactacag aaactgtcct gggttggtgt 2280 acatggggtt gagcatcact ggttcactga ttacttagct ggaagagaac aaactgttag 2340 tctaaatgga tgtgaatccg acttcctacc agttacatta ggggtacccc agggctctat 2400 tctgggcccc ctactgttca ttctgtacat aaacgactta ccttcatcag tctctactgt 2460 atgtaaagta tgtctgtatg cagatgatac tgctattttc tgttccagca aggacgctag 2520 agtcatagag aacaccttaa actctgaact agcaaacctt gccacatggc tacattcaaa 2580 tcgtcttaca ctgaatgtcg ctaagactaa gtggattcta ttcggtagta gtgggaagct 2640 aagagcagta cctccactta gactccaaat tgatcaggaa acagttgaac aagtgtacca 2700 attcaaatac ctaggtgttc tccttgactg caatatctca tggaacgaac acattgacat 2760 ggtatgttcc aaggtatcac agagaatagg tttgctaaga cgcttaagat catgtctcac 2820 tgtcaacatt gcaagtatgt tatacagatg catgattcta ccactacttg agtattgcga 2880 cattgtttgg gaaaattgta acatcaccaa gcagcgacaa cttcaggttc ttcaaaatcg 2940 agctgccagg gtcatcctac aaatgaagcg acgatccagt gtccaagacc tacacagcag 3000 actcaactgg aagtatctta cgcagcggag aaaggaacac atgtgtgtga tggtctttaa 3060 atgtattaac gggcttgtcc cgacatactt ggataccacc tttgtacaca accacctgtt 3120 acacagccat aacaccagac aagcctcttt gcttcataaa cccaattaca caaacacagc 3180 tggccaccgc acatttgcat acagaggagc aacatattac aatacccttt cggcagacat 3240 tagacatgca ccaactctac ggcagtttaa agctgcccta agacacactc ctccgcacat 3300 ggacctctga cctccaccgt tgacgttttt tgttgacctt gacaactgtt cttggtaaag 3360 acgcttcggt ttatttgtat atattgtttg ttgtcctctg gttgtcttac tgaatgtaac 3420 acgtttcgtt tttgttattt atgtagtggg cccccttgga aatcaacttc aaacaagttg 3480 aaggggctac ccacatgttt tgagatgaat aaataaataa a 3521 // ID CR1-39_HM repbase; DNA; INV; 4480 BP. XX AC . XX DT 17-DEC-2008 (Rel. 13.12, Created) DT 17-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE CR1-type family - consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-39_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4480 RA Bao W. and Jurka J.; RT "CR1 families from Hydra magnipapillata."; RL Repbase Reports 8(12), 1867-1867 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(156..1037,1041..2408,2281..4284) FT /product="CR1-39_HM_1p" FT /translation="MATAAMLPDLLDLDNFGELMTKLCAKAEIPLPDPNQK FT AAVLKQEHFAVAIKSMFKAFKAVFDKQEKQIIDLSERIKAIETKKPDNVVI FT PLFSNIVKSRKLNAQETNCLMAVSKFQSEKDHKANNIVLYGLKESIKSEEE FT HKKNDDYDLVCNLLSELQLPPNNNVSISRTYRIRPKQHHHHAQKKENLTNM FT DEEDETIETSVSTLKPAPLIVEIVNGNLIRKEILKLAKNLAKSKDFNNVFI FT QPDLTSAERSLRKELLTKRNNLNKEIPRENNRFTGTHYFGIRGTEVKKIKI FT VQQFITTRFLYTPINPTIKTNTIKNNSRIHLSCCSSKEVPGNITLTQIIPS FT SIPNQQKLFSNELNSDEKIMPDRKLLKTKALKPKRPKPNRTLRSQYSQQHN FT KALLRDPTTIDFSDILYYWANNPCSLNKDKHNELVARLANIRDPSLLPHVI FT WFTETWFTNDSDTSIAGYQLHRKDRIGRGGGVAIYIKDSVVAEETNVVQLN FT SPEIEQIWRMIKIGEDVILIGCIYRPHDFNDDYFVSVLATILAAKNVLQKL FT GCSSMLLYGDFNLKETSYKSIEEGSAVATIAYVAHKHPTDLKFQECLNECH FT LTQLVTFPTYRQYRDAPFSSTLDLIISDKPDRSIEITRADHLGDTPGGQAH FT CMIHGLFALTGHNTTTTPMKKRFIWSKADYNSISTCISSIDWITTFNGSSV FT HENYKILVKNYNDAVNKFIPTTTSPFYKNQPLWITPEVLKTIKKKRKALGL FT KTTTTQSINLFQQQPHHSIKINRCGLRQKYLKLSKKKEKLWGKYIAAGRET FT HEALRVKHKLACKIVKKTINKAVTDHEEKLVKDSKSHPKNVHAYVRSKQEV FT KDPLHSIETDNGIITTDKNIICSTLNNYFQSVFETEPDGSMPTFPDRTQAT FT CKINENWFTIEDIQSHLNNLEETKSIGVDGIHPRVLRNCAAAFAIPFNSIF FT RQSLLSGSVPEYWKKSNITPIFKKGSKLKAYNYRPVSLTSIPCKVMEKIIH FT KHIMAHCVENNLISKHQHGFIRKKGCLTNLLEARDILTEAMHQGYSADIIY FT TDFAKAFDKVPHKRLLHKLKAYGIHKKLLLWIKMWLKDRKQRVVLGEHVSE FT WKNVTSGVPQGSVLGPLLFILFINDLPDSIVHKIMLYADDSKIIGIIKSAS FT DNTTLQADIDNAVTWSNTWLMHFNIDKCKVMHAGRPNKRLSHDYSMETADG FT MRHTIATTTIECDLGVLISNDLKVRAQVEKAASAANRMLGKLKNAFRSRNL FT NLWRTLYISYVRPHLEFAIQAWSPYLETDIKLLEDVQDRVTKTISSISHFD FT KIKRLQVLKLTTLKERRLRGDIIEQFKIFNNYDEVQFMVPQHQLIGREVYN FT LRGHDKQLDRQFVRGCEERSNFFTNRVAVHWNSLTQHAVNAPSLNAFKDRI FT SMR*" XX SQ Sequence 4480 BP; 1623 A; 915 C; 742 G; 1200 T; 0 other; tatatatata tataaaaaca catactgaaa ctttttagtc cctgtgtaat aagtacataa 60 ttaaatcaat caataaatat ttcacatcag ggaggtaatc cactgttatt tagcgtggta 120 gccctcccac catcaatatc agcttggacc atccaatggc aactgctgca atgttgcccg 180 atttgttgga tcttgacaat tttggtgaat taatgaccaa attgtgtgct aaggcagaaa 240 ttccactacc tgaccctaat caaaaagctg cagtcttaaa acaagagcac ttcgcagtag 300 caatcaaatc tatgttcaag gctttcaagg cggtttttga taaacaagaa aagcaaataa 360 ttgatctaag cgaaagaatt aaagcaatag aaacaaagaa gccagataat gttgtaattc 420 cgcttttcag caacatagtt aaatcaagga aactcaatgc acaagaaact aactgtttaa 480 tggcagtttc aaaatttcaa tcagaaaaag atcataaagc aaataacata gttttatacg 540 gattaaaaga atcaattaaa tcagaagaag aacacaaaaa aaatgatgat tacgatctcg 600 tttgcaatct tttaagtgaa ctgcagctac cgccaaataa taatgtatcc atatcaagaa 660 cttatcgaat aagaccgaaa caacatcacc atcatgcaca aaaaaaggaa aacttaacaa 720 acatggacga agaagatgaa actattgaaa catctgtcag taccctaaaa cctgcacctt 780 taattgtcga aattgtaaac ggaaatttaa taagaaaaga aattttaaaa ctcgccaaga 840 acttagctaa aagcaaagac tttaacaatg tatttattca acctgattta accagtgcag 900 agcgcagttt aaggaaagag ctactcacga aaagaaacaa tctaaacaaa gaaataccgc 960 gagagaacaa cagatttact ggtactcact actttggtat cagaggtact gaggttaaga 1020 aaattaaaat tgtccaatag caattcatca ccacccggtt tctgtacacg ccaataaatc 1080 ctacaattaa aactaataca ataaaaaaca actcaagaat acatttatca tgttgctcct 1140 ctaaggaggt acctggtaat attacgttaa cccaaatcat accatcctca ataccaaatc 1200 aacaaaaatt attctccaat gaactaaatt ctgacgaaaa aattatgcca gacagaaaat 1260 tgttaaaaac taaagcatta aaacctaaac gaccaaaacc taatcgaact ttgcgatcac 1320 aatacagcca acaacataat aaagctttgc ttagagatcc aacaactatc gatttttctg 1380 atatacttta ttactgggcc aataatccat gctcacttaa taaagataaa cataatgaac 1440 ttgttgccag gctagctaat atcagagatc catcactgtt accgcacgtc atatggttta 1500 ctgaaacctg gttcacaaac gattctgata catcaatcgc cggttatcaa ttacatcgta 1560 aggacagaat cggcagagga ggcggtgttg ccatctacat caaagatagt gtcgttgctg 1620 aagaaacaaa tgttgtccaa cttaactcgc ctgaaattga acaaatatgg cgaatgatca 1680 aaatcgggga agatgtcatt ctcatcggtt gtatttatcg ccctcacgac ttcaatgatg 1740 actactttgt ttctgtttta gctaccatat tagcagcaaa aaatgttctt caaaaactag 1800 ggtgttcatc aatgcttctg tatggagatt tcaatcttaa agaaacgtct tacaaatcca 1860 ttgaagaagg aagtgctgta gcaaccattg cttatgtagc ccacaagcac ccaaccgatc 1920 taaaattcca agaatgtctc aatgagtgtc atcttaccca actggttaca tttccaacat 1980 atcgacagta tagggatgcc ccttttagta gcacgcttga cctaataatc tctgacaaac 2040 cagaccgctc catagaaatc actagagcag atcatctagg cgatacacca ggaggtcaag 2100 cacattgcat gatacacgga ctctttgctt taaccgggca caatacaaca actactccga 2160 tgaaaaaacg attcatctgg agcaaagccg actacaattc catatcaaca tgcatatcat 2220 ctatagattg gataacaact tttaatggaa gctctgtgca tgaaaattac aagattctag 2280 ttaaaaacta caacgacgca gtcaataaat ttattccaac aacaacctca ccattctata 2340 aaaatcaacc gttgtggatt acgccagaag tacttaaaac tatcaaaaaa aaaagaaaag 2400 ctctggggta agtatattgc tgctggtcgc gaaacacacg aagcacttcg agtaaaacat 2460 aaacttgctt gcaaaatagt taaaaagact ataaataagg cggtaaccga tcatgaggag 2520 aagcttgtca aagattcaaa atctcaccca aaaaatgtgc atgcgtatgt cagaagtaaa 2580 caagaagtca aagatcctct tcattcaatc gaaactgaca atggcattat tacaacagat 2640 aaaaacataa tttgctccac cctcaataac tattttcagt ctgtctttga aactgaacct 2700 gatggcagta tgccaacatt tcccgatcgt actcaagcaa catgtaaaat taatgaaaat 2760 tggttcacaa ttgaagacat tcaaagtcat ctaaataatc ttgaagagac taaatcaatt 2820 ggtgtagatg gaatccaccc acgtgtatta cgtaactgtg cagcagcatt tgcaatacca 2880 ttcaactcta tttttagaca atctttattg tctggctctg tacctgaata ttggaaaaaa 2940 tcaaacatta ctcccatctt taagaaaggg agcaagctta aggcctacaa ttatcgaccg 3000 gtctccctca cctcaatacc gtgcaaagta atggaaaaaa taattcacaa acatattatg 3060 gcgcactgtg tggaaaacaa tctaatctcc aagcatcagc atggttttat tcgtaaaaag 3120 ggatgtctga ctaacctcct cgaagctcgc gacatactta ccgaggctat gcatcaaggt 3180 tacagtgctg atattatcta cacagatttt gctaaggcat tcgataaagt cccgcacaaa 3240 cgtcttctac ataaactgaa agcatacggc attcacaaaa agctgctact ttggattaaa 3300 atgtggctaa aagatcgtaa gcaacgtgtt gtcttgggag aacacgtttc tgagtggaag 3360 aacgtcacaa gtggagttcc gcagggttct gtcttagggc cactgctttt tattttgttt 3420 attaacgacc ttcctgatag tatcgttcac aagataatgc tatacgctga tgacagcaaa 3480 attattggga tcataaagtc ggcatcggat aacacaacgc ttcaggcaga tattgataac 3540 gcggttacat ggtcaaacac ttggctcatg cactttaaca ttgataaatg taaagtgatg 3600 catgcaggtc gtcctaataa acgtttatca catgactact ctatggaaac cgctgatggc 3660 atgcgccaca ctattgcaac tacaactatt gaatgtgacc ttggagttct catatcaaac 3720 gatctgaaag tcagagcaca agttgaaaaa gcagcatccg cagccaatcg catgttagga 3780 aagctaaaaa atgctttccg cagtagaaac ttaaacttgt ggcgcacact ctatatatct 3840 tatgtccggc cacatctaga attcgccatt caagcctggt ccccgtatct cgaaacagac 3900 attaagttac ttgaggatgt tcaagaccgc gtcactaaga caatatcatc catcagccat 3960 tttgataaaa taaagcgtct gcaggtactc aaactaacca ccctcaaaga acgtcgtttg 4020 cgtggtgata taatcgaaca attcaagatt ttcaacaact atgacgaagt tcaatttatg 4080 gtcccacagc accaactaat tgggcgtgag gtgtacaacc tacgcggcca tgacaagcag 4140 cttgatcgtc aattcgtaag aggatgtgag gagagatcaa actttttcac aaaccgagtt 4200 gcggtccatt ggaactctct cacacagcac gcggtaaacg ctccatcttt aaatgcattt 4260 aaagatcgga tttcgatgcg ataaactgtt aaatcagcat agagactcct ggaaaagtct 4320 cttcatcaaa taaataattt aacatttaaa tatatacata tgtatatgta tatatttttt 4380 aattattgtt acagcattat tgcggtttaa atgtataaat ttgattactg aatatatcaa 4440 actcaaactc aaactcatat atatatatat atatatatat 4480 // ID BEL1_I_Dpse repbase; DNA; INV; 5439 BP. XX AC Unknown_singleton_86; XX DT 13-MAY-2009 (Rel. 14.05, Created) DT 13-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL1_Dpse; KW BEL1_LTR_Dpse; BEL1_I_Dpse. XX OS Drosophila pseudoobscura OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; obscura group; OC pseudoobscura subgroup. XX RN [1] RP 1-5439 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1008-1008 (2009). XX DR Genome; Unknown_singleton_86; Positions 13362 18800. XX CC Positions [765-1319] - Integrase core CC LTRs are 96% similar to each other. XX FH Key Location/Qualifiers FT CDS 255..1817 FT /product="BEL1_I_Dpse_2p" FT /translation="MRFLDKIRKRDFASEEANLAARFRNMEDVNIVIYRMV FT QEEAYSEEMALLRQGRIVSKKSSIYKLSPYLDETGVLRIRGRIDKSIGITR FT NLKQPVILPGGHNVTNLLIDYFHRKFQHQLTEIVVNEMRQLYHMPGLRARV FT RAVAKTCQRCRNKRALPEAPEMGSLPPERLAINELPFTTTGIDYFGPLEVN FT VGRRREKRWGVLFTCLTVRAVHIELANSLSTDAFMLTLEMFVARRGVPKRI FT VSDNGTNFRGASRLLREEIERILPNELETKYPQIEWSFIPPGAPHMGGAWE FT RMIRSVKSILAEILEETHVQEPVLRTALAKIENMLNSRPLTYVPLECPEAD FT VLTPNHFLRAETSSMAAKNDGSAIGAMLGKSFRIAGQIVDSFRKRWLKEYL FT PCLTTRTKWHGTPTNPIQLGDVVVLADETSPKVQWRKGVIMDLRVAKDGIA FT RSAVIRTASGLLTRPIVKLAKLDVRTTNPLMDQGVTRSLANITTSDEVCEG FT RNVTAETAAAALNITAPNSPEYHCA" FT CDS 2362..4194 FT /product="BEL1_I_Dpse_1p" FT /translation="MDGVKGQQSESGGETPCGLGEINVPEKMLGACSETEG FT GAWETHRVRDTYIDVISAATTATPNVSTTAPTSHQAPEHTNQGGGVDQAFS FT PRTSPLDSWETFAQQRADALGRPTYATAFEGMRSIPTQSRLPQGGGLMSST FT VINGGWDRAVPNWTGVHMSGPTEATRTGMPTGLNPNETMPVGATWHPMAAP FT AQGTNLHGQNPFIRAQPAQYYPSTSRNISDPYAYGSNQAVAPAGAERCLTA FT SQIAARQVINKDLPCFSGNPEEWPMFIVNFEQSTERCGLSDQENLMRLQRC FT LKGAALEAVREKLMMPATVRLAIETLRMLYGRPEAIYHTLQTKLRMEPSVK FT SSSIESLVRLALSVQNYRATVDAIGLSAYLNDPMLLRDLVEKLPFDLKLDW FT ARQSAGLPRVNIAVFDDWLFRVASCASTVMPVSPEKEVCTKEKNPRVRLMV FT HNDSSAQPPCRRCNGAHKLEDCKQFLELEGPGRWNFARENKLCWRCLGKHF FT IRHCKSTRVCSVDGCSRKHHFLLHNSNQQSSTEENRPKESTTLLHTMQNEA FT ALFRYVPLTLYGPNGQMQVFALIDEGSACTLIETEITEKLGIDGPSDELCL FT RWTGDITQRDPNG" FT CDS 4149..5438 FT /product="BEL1_I_Dpse_3p" FT /translation="MPTLDWRHHAERSKWISLAISPRDNVARRFTLQNVRT FT VKNLDLPKQTLDDSILKSHSHLEKLPILSYKDVRASLIIGLDNVKLGVPLT FT VKERENSEFIAAKSRLGWSVYGRRKGDGPASPRVMHVCECNKHPQLDTLVK FT ESFSLDAVGINMAKSTLRSKEDERAKTILENSTRFSSEMKRWETGLLWRYD FT NVELPSSREMAMRRLRCLESRMTRDKVLREFVNEQIKRYEDIGYVRKLEKD FT EMVGGKRSWYLPVFVTKNPNKGKLRLVWDAAAKVEGIALNSMLLKGPDQLP FT SLVGVISRFREKSIGICGDIREMFHQVRIRSEDQSSQQFLWRYGDTSREPD FT RYVMSVMTFGASCSPAAAIYVRDRNAERFIEEQREEVEAILRNTFVDDWLQ FT SCDSEDEMVRLALAVRNIHKEGGFEMRGWISNSSTV" XX SQ Sequence 5439 BP; 1665 A; 1320 C; 1435 G; 1019 T; 0 other; agcgacgtgt tttttggacg agtcgagagt ttccaaccgg aagcgagatg gttctccggt 60 ccgccgttcc tatttgacga ggaagaacac tggccaacta cacagctcaa ctcgagtaat 120 ctggaacgga tacacggata cacgtacacg aatggacaga tacgccgaaa ccaccatcaa 180 agctgagtag cataatccca aatgtatgtc gttttagcaa atgggaaaag ttcacgggag 240 ccctgagatg cgtgatgaga tttctggaca aaatacgcaa aagagatttc gcatcagagg 300 aggcaaattt agccgcacgc tttcgcaaca tggaagatgt caacatcgtt atataccgga 360 tggtacaaga agaggcttac tctgaagaaa tggctcttct acgacaagga agaatcgtga 420 gtaagaaaag ttcgatctac aaattatcgc cgtacctcga tgaaacagga gtactgagaa 480 tacgcggccg catagataag tcgatcggca taacgaggaa cctgaaacaa ccagtaattc 540 tacccggagg acataacgtc actaatcttc tcattgatta tttccatcgg aagtttcagc 600 atcagcttac tgagatcgtc gtcaacgaga tgcgacaatt gtaccacatg ccaggactgc 660 gagcaagagt tcgcgccgta gcgaaaacct gccaaagatg ccgcaataaa agagcgcttc 720 cagaggcacc ggagatggga agcctaccac cagagaggct agcaatcaac gaattgccat 780 tcacaaccac aggtatcgac tacttcggac cactggaggt aaacgtagga agacgacgcg 840 agaaacgatg gggagtacta ttcacgtgcc tcaccgtgcg agcagtacac atcgagctgg 900 caaactccct gtcaacagac gccttcatgc taacattgga aatgttcgta gcccgtcgag 960 gagtccccaa gagaatcgtg tcggacaacg ggacgaactt ccgaggagcc agccgcctac 1020 tacgagaaga aattgaaaga attctaccga atgaactgga aaccaagtac ccgcaaatag 1080 aatggtcctt cataccccca ggagcacccc acatgggagg tgcgtgggaa agaatgatca 1140 gatcggtgaa atccatacta gccgaaatat tagaggagac gcacgtacaa gaaccagtgc 1200 tgagaacagc actagcaaag atagaaaata tgctaaattc ccgacccctg acttacgtgc 1260 cgttggagtg tccggaagcc gacgtactta caccgaatca ctttctgaga gcagagacca 1320 gctcgatggc agcaaagaac gacggcagcg caataggagc tatgcttggt aagagttttc 1380 gcattgcagg gcagatcgtg gacagcttca gaaaacgatg gctgaaggag tacctaccgt 1440 gcctgacgac acgaaccaaa tggcatggaa ccccaaccaa ccctatccaa ctcggagacg 1500 tcgtggttct agcggacgaa acaagcccca aggtgcagtg gaggaaaggc gtcatcatgg 1560 accttcgagt agccaaggac gggatcgccc gaagcgcggt gatccgcact gcatctggac 1620 tgctaactcg gccaatagtt aaactggcca agttagacgt gaggacaacg aacccactaa 1680 tggatcaagg cgtaaccaga agtttggcaa acatcacgac aagcgacgaa gtttgcgaag 1740 gaaggaatgt tacggcagaa accgcagcgg cagctctaaa tatcactgcg cctaacagtc 1800 ctgaatatca ctgcgcctaa cagccctgaa tatcactgcg ttaacagcac aaacattaac 1860 agcgctgagc aaagggttaa ttagggcgaa gggcagacaa aggcaggaag aaggaagaac 1920 gaaaacaaag gcagcggaaa aagcactgaa gcggtacaag ttttttagcg ctagctcgta 1980 gtttaacacc cgcccgaatt tgtaaatgaa taaatcaata aaataaatca aaccaataaa 2040 accgacctga agaacaaaac cacgggaggc gtattgaaca cgggaggctt atcaaacgta 2100 cggccccaga tttggaggcg ttgcccggcg cggcaatatt ctaaaaaact tttcctgcgc 2160 aacagagagc aaatcgaata attacgaccg acagaaagaa gatcggcgtg cgacgcagtc 2220 agctgcgtca cgagggattc gactttgcag gtgagatatc ctcatacagt tggacagaaa 2280 ggctaatctc ctacattcga gcgacacgtg gttcgtgtat aacctcaaac caggactgta 2340 caaaaaagac accggagtaa aatggacggc gtcaaaggtc agcagtcgga atctggcggt 2400 gagaccccgt gtggattagg agaaataaat gtgccagaaa agatgcttgg tgcgtgttct 2460 gagactgagg gaggtgcatg ggaaactcat cgcgttaggg atacatatat agatgtgatc 2520 tcggcggcaa caacggccac acctaacgtg tccaccacag ccccaacgag ccatcaagca 2580 ccagagcata ccaaccaagg aggaggggtg gatcaagcct tctcaccaag aacgagccct 2640 ttagatagct gggaaacgtt cgctcagcag agagcggatg cattaggaag gccgacgtac 2700 gccacggcct tcgagggaat gcgctccatc ccgacgcaat ccagactgcc tcaaggagga 2760 ggcctcatga gctccacggt aatcaacgga ggctgggaca gagcagttcc caactggacc 2820 ggagtccaca tgtcaggacc cacggaagcc acgcggactg gaatgcctac tgggctgaat 2880 ccgaacgaga caatgccagt cggcgctacg tggcatccaa tggccgcacc ggcgcagggt 2940 accaacctgc atgggcaaaa ccccttcatt cgcgcccagc ctgcacagta ttacccaagt 3000 acgagtcgaa atatttcgga cccgtacgcg tacgggagta atcaagcggt cgcgccagcg 3060 ggggcggaga gatgcttgac agcatcccag atagccgcaa gacaagtcat caacaaggat 3120 ctaccatgct tttctggaaa ccccgaggaa tggcccatgt tcatagtaaa cttcgagcaa 3180 tccacagaac gctgcggact cagcgatcaa gagaatctca tgaggctgca aaggtgcctg 3240 aaaggtgctg ctttggaagc agtacgagag aaattaatga tgcccgcaac agtccgacta 3300 gccatcgaga ctcttcgaat gctatacggc cgaccagagg ccatatatca caccctgcag 3360 acgaagctaa gaatggagcc gtccgtgaaa tcgagcagca tcgaaagctt agtgcgcctg 3420 gcgctgtcgg tgcaaaatta tcgcgctact gttgacgcga taggactgtc cgcctatcta 3480 aacgacccaa tgttacttcg cgaccttgtg gagaaactcc cattcgactt gaaactggat 3540 tgggccagac aaagcgctgg acttccgagg gtaaatatcg ctgttttcga cgattggcta 3600 ttccgagtcg catcgtgcgc aagcaccgtg atgccagtgt cacccgagaa agaagtctgc 3660 acgaaggaga agaacccaag agtccggctc atggtgcaca atgatagcag cgcacaaccg 3720 ccctgccgta ggtgcaatgg ggcccacaaa ttggaggact gtaagcagtt cctggagcta 3780 gaagggccag gtagatggaa cttcgccaga gaaaataagc tatgctggag gtgccttggc 3840 aaacacttta taaggcactg taagtcaaca agagtatgtt ccgtcgacgg atgctcgaga 3900 aagcaccatt tcctcctcca caacagtaac cagcaaagct ccacggaaga gaataggccg 3960 aaggaaagca ctacgttact gcacacgatg cagaacgaag ctgcgttgtt tagatatgtc 4020 cccttaacat tgtatggacc gaacggccaa atgcaagtgt tcgcactgat agacgaagga 4080 tccgcctgta ccctaatcga gacggagata acggaaaaac taggtatcga tggcccctcg 4140 gacgagctat gcctacgctg gactggagac atcacgcaga gagatccaaa tggataagcc 4200 tagcgatctc tccaagggac aatgtggctc gcagattcac gcttcaaaac gtgagaacag 4260 taaaaaacct ggatctgccg aagcagacgc tggacgactc aatcctaaaa tcccattcgc 4320 accttgaaaa gctaccaata ctcagctaca aggacgtgag agcgtcattg atcatcggat 4380 tggacaacgt gaaacttgga gtgccactga ccgtcaaaga acgcgagaat tccgaattta 4440 tcgccgccaa gtccaggttg gggtggtctg tttatggccg cagaaagggc gacgggccag 4500 cgagtccgcg ggtaatgcac gtatgcgaat gtaataaaca cccacagctg gacactcttg 4560 taaaagagag tttctcatta gacgcggttg gcataaacat ggcgaagagc acactcaggt 4620 ccaaagagga cgaaagagca aaaacaatac tggaaaattc cacgaggttt agcagcgaaa 4680 tgaagagatg ggagacagga ctcctctggc ggtacgacaa cgtggaactt ccttcctcca 4740 gagagatggc aatgcgacga cttcggtgcc tagaatctcg tatgacacgc gataaggtgc 4800 tacgagagtt cgtaaacgaa cagattaaga ggtacgaaga catcggctac gtccgaaaac 4860 tggaaaaaga tgaaatggtc ggaggcaaac gctcctggta tctgcccgta ttcgtgacta 4920 agaacccgaa caaaggcaaa ttaagactcg tatgggatgc agcagccaaa gtggaaggca 4980 tcgctcttaa ttcaatgcta ctaaagggcc ccgaccaatt gccatcgctg gtaggagtaa 5040 tatcgcgctt tcgagagaag tctattggca tctgcggaga tattagagaa atgtttcacc 5100 aggttaggat acgttcggag gatcaaagct cacagcagtt tctctggaga tatggtgaca 5160 ccagtcgaga accggatagg tacgtaatga gcgtaatgac ctttggtgct tcttgttccc 5220 cagcagcagc gatctacgtg agagatagga acgcagaaag gttcatcgag gaacaaagag 5280 aggaagtaga ggccatactt cgcaacacct ttgtagatga ctggctccaa agctgtgact 5340 cagaagacga aatggtccgt ctagcgctgg cagtgaggaa catacacaaa gaaggaggct 5400 tcgagatgag aggatggatt tctaactcga gcactgtga 5439 // ID DNA8-16B_AP repbase; DNA; INV; 167 BP. XX AC . XX DT 29-AUG-2009 (Rel. 14.09, Created) DT 29-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-16B_AP. XX NM DNA8-16B_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-167 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 1963-1963 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 167 BP; 45 A; 36 C; 37 G; 49 T; 0 other; cattggcgca aataggtggg ggcttggggg gacttagtcc ccccacattt ctgccaagtc 60 cccccagttc aaaagtcagt aattagtact gagcatatat aatttttaga aaatattata 120 ggaggggctt tagtcccccc aaaataattt gtctatttgc gcctatg 167 // ID OR1 repbase; DNA; INV; 1092 BP. XX AC D32089; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 28-SEP-1995 (Rel. 1.08, Last updated, Version 1) XX DE SINE element. XX KW SINE2/tRNA; SINE; Non-LTR Retrotransposon; Transposable Element; KW OR1; Repetitive element. XX OS Octopus vulgaris OC Eukaryota; Metazoa; Mollusca; Cephalopoda; Coleoidea; OC Neocoleoidea; Octopodiformes; Octopoda; Incirrata; Octopodidae; OC Octopus. XX RN [1] RP 1-1092 RA Ohshima K. and Okada N.; RT "Generality of the tRNA origin of short interspersed repetitive RT elements (SINEs). Characterization of three different RT tRNA-derived retroposons in the octopus."; RL J. Mol. Biol 243(1), 25-37 (1994). XX DR GenBank; D32089; Positions 510 1601. XX SQ Sequence 1092 BP; 245 A; 247 C; 254 G; 346 T; 0 other; cgtggaggcg caatggccca gtggttaggg cagcggactc gcggtcatag gatcgtggtt 60 tcgattttca gaccgggcgt tgtgagtgtt tattgagcga aaacacctaa aagctccatg 120 aggctccggc agggaatggt ggtgatccct gctgtattct ttcaccacaa ctttctctca 180 ctcttacttc ctgtttctat tgtacctgta tttcaaaggg ctggccttgt cactctcagt 240 gtcacgctga acatccccaa gaactacgtt aagggtacac gtgtctgtgg agtgctcagc 300 cacttacacg ttaatttcac gagcaggctg tcccgttgat tggatcaacc ggaaccctcg 360 tggtcgtaac cgacggagtg cttccatttt atatattcct atggttcatc aggaaatatg 420 tttaccatcc ttctgagatt ttgtatatat ggatgctatt gtttgttcta atttattcaa 480 attgtaagga aaatttagaa tctaaatcta cttctgggtt atataaagtc agtcgtttga 540 agttatcatt ttattctgat acaaatgtta tgtgtttaag agctcctctc ttaacgtcag 600 acctcccctt tatactgatg ctaattcaga tttcatagtt gcttatgatt gtactctttg 660 gtgtggagca tcacatatgg tgtctaaaca gtggtagtgt tgaagcgtgg aggcgcaata 720 gtccagtggt tagggcagcg gactcgcggt cataggatcg cggtttcgat tcccaaaccg 780 ggcgttgtta gtgtttattg agcgaaaaca cctaaaagct ccatgaggct ccggcaagag 840 gatggtggtg atccctgctg tactctttca ccacaacttt ctctcactct tacttcctgt 900 ttctgtggta cctgtatttc aaagggccgg ccttgtcact ctctgtgtca cgttgaattt 960 ccccccgaaa tacattaagg gtacacgtgt ctgtggagtg ctcagccact tatacgttaa 1020 tttcacgagc aggttgttcc gttgattcgg atcaaccgga accctcgtcg ccgtaaccga 1080 cggactgttt cc 1092 // ID CR1-114_AAe repbase; DNA; INV; 4978 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-114_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4978 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1202-1202 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 15 sequences with >95% CC identity. XX FH Key Location/Qualifiers FT CDS 418..1461 FT /product="CR1-114_AAe_1p" FT /translation="MDSCQICSNSISSDKLVTCSGTXGYFFHYACAGLSKS FT HYSSWSANIGLYWFCASCRLNFNPNVCDREKTIVRALRELLLRTDSMDTRL FT ANYGEHLRKINMTLYGYQQRTSQTNRSHDNSTFHQQIDMLNLDDTIDNTVN FT RSRSCEETSFXEVLDEINTTVGQPTDKFIVGANKRVQIICNQPSPSTSKEH FT TNQSRASTPAASSQIPKTPISVSKSSNASNPNRNSVSNHPTGTTRTSSGPL FT SVANIGQPAADSVDFYVTPFTPDQKEEDVKLYIQEVANVAPSTIKVVKLVP FT RGKDLDDLSFVSFKVTVNKTASDVIGDPWYWPDGVTVRVFDHNQKNGSSIQ FT RPPQP" FT CDS 1422..4889 FT /product="CR1-114_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="SKKWFIHPATAPALVPFLSLESTTPGRTQKCIMDVPS FT HPDPAVSSNVPCSKSRPGTSAVGGMVRGIQNAFSGKYNSSFKLSSFDRFSN FT PRSDLQSNDHLGPCCFSTTSTNMSIRNSTSPGRTQKSTLDAPNHPDPAAAA FT DACFQSRPGISAVGGKEGGFQNASSGKYNSYYELPSPGTLSNSSFQSPVLL FT PSINQSPSTSADSALKVYYQNVRGLRTKIDSWFLAVTGCDYDVIALTETWL FT DNRINSVQLFGPLYDVYRCDRNANNSDKRSGGGVLIAVKKSLSSTQLNVGD FT SVLEQVWVKVGANNTTLYISSFYLPPDKTNDHQVIESHLQSIRAIDTLSSC FT DDRILILGDYNLPGIQWIQNEYGFVSIDYPNSTLNNTYYSIIDGMAFSNLH FT QLNATRNLNGRTLDLVFCNSLDNSVSEASELLTTIDSHHPALELLISTLVE FT ETGADDDERQLNFKQTDFGALTSYLDLIDWSFMSNWTDVDSAARCFIDVLN FT TWLIEHVPLKQKPRYPPWETNELRTLKRNKNAALRRMRKTRTPATKTEFQH FT FSRAYTSMNASCYRSYVHRMQNNLRRNPKSFWGFVNSKRKDNGLPVNMFLG FT EKHCTSSTNKCSLFAHRFSSVFNNEPCSDADVSRAVQDIIPDVVDIDIFDV FT TDSMILRAASKLKNSFNPGPDCIPAAVYKKCINSMLQPLRIIFNISLKRYK FT FPDVWKTSTMFPVFKKGSKRDIANYRGITSLSAGSKLFEIIVNDFLFFKIK FT NYLSTDQHGFFPGRSVTTNLVDFTTNCLMNIENGLQVDAVYTDLTAAFDSI FT NHNILLEKIKKIGASDGLVRWLKSYLVGRNLRVKIGGCYSGTFECRSGVPQ FT GSNLGPLLFSIFFNDVTNFLPKGCRLLYADDLKIYFIIKNEDDCVMLQTVL FT NQFSGWCSRNQMTLSIDKCSAISFHRKVKPICYNYRINGRLLERLSVVRDL FT GVMLDDNLSFNHHRSSIIDKANRQLGFISKISRDFTDPYCLKALFCSLVRP FT LLETADVVWTPYHSTWVERIERIQKRFLRHALKNLPWREPDNLPPYRERCQ FT LLSMDTLEQRRHVNQAVFIAKLLKGEIDCPNLLSLLPLHVPSRMLRNHTLL FT RPGQHRTNYGANAPLPAMLNHFNLVQQFFDFNMSTTSFKRKIVTQRLP" XX SQ Sequence 4978 BP; 1374 A; 1173 C; 993 G; 1433 T; 5 other; ctsactcgtt tgtgcaatcg gtcgctctta tttgttgtga ttttttcgtc gcgaatattt 60 tgatgttgca aacattgcaa atttgtcgcc gcgcgttttt cgtgttctga atggacttgc 120 ttaccggttt tcgatgttaa tgtgacattt tatgctctct ggatcgtgaa ttttcgtgca 180 ctgtttttag tgatttgttt gacaaactat tatttgtttt tcacacaccc acattcgctg 240 tcgagctttt tttgaaacac catctgccag aaaaatcctg aagckcatct accacagtca 300 acatcacgta caccggagtg ctttcgttag wgaaatcaca tacccaaagg cgcttgaggc 360 acatcttcct accgtcgcac tgcgtagctc tgtaaaaggg attatctttg ttgcaaaatg 420 gattcatgcc aaatctgctc aaatagcata agctctgaca aattggttac atgcagtgga 480 acctgkggct acttctttca ttatgcctgc gctgggttat caaagtcaca ttattcatcg 540 tggagtgcta atattggact ttactggttc tgcgcatcct gtcgattgaa tttcaatccc 600 aacgtctgcg atcgagagaa gacgatagtt agagctttac gcgaactgct tttacgcaca 660 gactcgatgg acactcgcct cgcgaattat ggagaacact tgcgcaaaat caacatgacg 720 ctgtatggct atcaacaacg cacatcgcag actaatcggt cacatgacaa ttcgacgttt 780 catcaacaaa ttgatatgct aaacctggat gataccattg acaataccgt taatcgctca 840 agatcctgcg aggaaacatc ctttttsgaa gttcttgatg agatcaacac tactgtgggt 900 caacctacag ataagttcat cgtcggtgcg aataaacggg ttcaaatcat atgcaaccag 960 ccttcaccat ctacatctaa agagcataca aatcagtctc gtgcatctac tcccgcagct 1020 tcttcacaaa taccaaaaac tccaatctct gtcagcaaat caagtaatgc atcgaatcct 1080 aatcgaaatt ctgtgagcaa ccaccctact ggtactacca ggacatcttc tggtcctcta 1140 agcgttgcca atatcggaca gcctgccgct gattctgtgg atttttacgt tactcctttc 1200 accccggacc aaaaagaaga agacgtaaaa ctgtacattc aagaggtcgc taatgttgcc 1260 ccatcaacca ttaaagttgt caaacttgta ccacgcggga aggatctaga tgatttgtct 1320 tttgtgtcct tcaaagttac cgtcaacaaa actgcctcgg atgtaatcgg cgacccttgg 1380 tattggcctg acggtgttac tgttcgtgtc ttcgaccata atcaaaaaaa tggttcatcc 1440 atccagcgac cgccccagcc ttagtaccat ttctatcgct cgaatcgaca accccaggac 1500 gcacacaaaa atgcattatg gatgtcccta gccaccccga ccctgccgtt tcgtcaaatg 1560 ttccttgctc taaaagtcgt cctggcacct ctgcagtcgg aggtatggta aggggcatcc 1620 aaaatgcctt ctcaggcaag tacaattcgt cgttcaagct ttcttcgttt gatcgtttct 1680 caaaccctcg tagcgacctg caatcgaatg atcatctggg accttgctgc ttttcaacca 1740 cgtcaacgaa catgtctata aggaactcga cttctccggg acgcacacaa aaaagcactt 1800 tggatgcccc caaccacccc gacccagccg ctgcagctga tgcctgtttt caaagtcgtc 1860 ctggcatctc tgccgtcgga ggaaaggaag ggggcttcca aaatgcctct tcaggcaagt 1920 acaattcgta ttatgaactt ccatcgcctg gcaccttatc aaattccagc ttccagtccc 1980 cagtgctact tccatcgata aaccaatctc caagcaccag tgccgattca gcgttaaaag 2040 tgtactacca gaatgtgcgt ggtttgcgca ctaaaatcga tagttggttt cttgctgtta 2100 ctggctgtga ttacgacgtg atcgccctta ctgaaacctg gttggacaac cgtatcaatt 2160 ccgttcaact gtttggaccc ttatacgacg tgtacaggtg cgaccgcaac gcgaacaaca 2220 gcgataagcg gtccggaggc ggagtactga tcgccgttaa aaagtctctc tcttccacgc 2280 aattgaatgt tggcgacagt gtattggagc aagtgtgggt taaagtcggt gcaaacaata 2340 caaccctgta tatctcgtcg ttttaccttc ctccggataa aacgaacgat caccaagtca 2400 ttgagtcgca tctgcagtcc attcgtgcaa tcgacactct tagttcatgt gatgaccgaa 2460 ttctgattct tggtgattac aatcttcctg gaattcagtg gatacagaat gagtatggtt 2520 tcgtttctat tgactatcca aactctactc ttaacaatac atactactcg attattgacg 2580 gtatggcttt cagcaatctc catcaattga atgctactcg taatttaaat ggacgtactc 2640 ttgacttggt gttctgtaat tcgcttgaca attctgtttc tgaagccagc gaactgctca 2700 ctactattga ttcgcatcat cccgcattag agcttcttat ttcaactttg gtcgaggaaa 2760 ctggcgctga cgatgatgaa cggcagctta acttcaaaca aacggacttc ggtgctttaa 2820 ctagttatct tgatctcata gattggtcct ttatgagcaa ttggacagat gtggattcag 2880 ctgctcgctg ttttatcgat gtgcttaata catggctgat agaacacgtc ccattaaaac 2940 aaaaaccacg atatccacct tgggagacta atgagctacg aactcttaag cgcaataaga 3000 acgccgcttt gcgtagaatg cgcaaaactc gaactccggc aacgaaaact gaattccagc 3060 attttagtag agcttatacc tcaatgaatg cttcttgcta tcgcagttac gtgcaccgaa 3120 tgcaaaacaa tctccgtcgc aacccaaaat ctttttgggg gtttgtaaat agcaagcgta 3180 aggacaacgg gcttcctgtc aacatgtttc ttggcgaaaa acactgcact tcttccacga 3240 acaagtgttc tttatttgcc caccgtttct ccagcgtctt caacaatgaa ccctgttcgg 3300 atgctgatgt aagcagagct gttcaagata tcattcctga tgtagttgac atcgatattt 3360 ttgatgtaac tgattcaatg atcctacgtg cggcaagtaa gcttaaaaac tcgtttaacc 3420 ctggtccaga ctgtattcct gcagcagtgt acaaaaagtg cattaattca atgcttcaac 3480 cattacgaat aatattcaac atctctctga agcgttataa gtttccggat gtctggaaga 3540 cttctaccat gtttcccgtc tttaaaaaag gatcgaagcg cgatatagca aactatcgtg 3600 ggataacgtc tcttagcgcc ggatctaagc tcttcgagat catcgtaaac gatttcttgt 3660 ttttcaaaat aaagaactat ttgtcgaccg atcaacatgg cttcttccct ggaagatcag 3720 ttactacgaa cctagtggac ttcaccacca attgccttat gaacatagaa aatggattac 3780 aagtggatgc cgtgtatact gacttgactg ccgcctttga cagtatcaat cacaacattc 3840 ttctggaaaa aattaagaaa ataggtgcat ctgatggtct cgttagatgg ctcaagagct 3900 acttagtggg tcggaattta cgtgtcaaaa ttggtggctg ctattcggga acatttgaat 3960 gcagatctgg agttccccaa ggcagcaatt tgggaccgct gctgttctct atcttcttca 4020 atgatgtcac caacttttta ccgaaaggat gcagattatt atacgcagat gaccttaaaa 4080 tctacttcat tataaagaac gaagatgact gcgtaatgtt gcaaaccgtg cttaaccagt 4140 tctccggatg gtgttcacgg aatcagatga ctctgagtat cgacaagtgt tcagctatat 4200 cattccatag gaaagtaaag cctatctgtt acaattatcg gatcaatgga aggttactgg 4260 agagattgtc tgttgtaaga gacttaggtg tgatgttgga cgacaacctt tcgtttaacc 4320 atcacaggag ctctatcatt gataaggcca accgccagct tggctttatc tcaaagattt 4380 cacgcgactt cactgatcct tactgcctga aagccttgtt ttgttcactg gtacgaccgt 4440 tgcttgaaac tgcagatgtt gtatggacac cttaccactc cacatgggtt gagcgaatag 4500 aaagaatcca gaagagattt ttacggcatg cgcttaaaaa tttgccatgg agagaacccg 4560 ataatctgcc accttaccga gagcgttgtc agttactgag catggacacc cttgaacaac 4620 gaagacatgt caaccaagca gtttttatcg caaaactttt aaaaggggag attgattgcc 4680 caaatcttct ttctttgctg cctctgcatg ttccatcgag aatgctacgc aatcatacgc 4740 tactccgccc cggtcaacat aggactaact atggtgctaa tgctccactt cctgccatgt 4800 taaaccactt taatttagta caacaatttt ttgattttaa tatgtcgaca actagtttta 4860 agaggaagat tgtcactcag cgactaccgt aataataagt taaggtcttg tataagaata 4920 gaaaaattca ctagggcgtg ttccgatgaa tgtatcccaa taaataaata aataaata 4978 // ID BEL-120_AA-I repbase; DNA; INV; 5727 BP. XX AC AAGE02025227; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-120_AA_; KW BEL-120_AA-LTR; BEL-120_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5727 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02025227; Positions 35175 40901. XX CC Positions [4750-5331] - Integrase core CC 'ACTTA' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 415..5709 FT /product="BEL-120_AA-I_1p" FT /translation="MLEHTPVKSKQDKMAELKTLVHLRGQAKAKVTRIRKA FT IEEVIEGGVVQFTIAQLKVYSRNLETHFQEYLNLHHQITALLPPEKLDDND FT ATYLQFETHHTETAMLVEGLISSATPQPNPPPHVLQPDAGMMQPQIVVQQQ FT PLPVPIPSFDGKPENWPRFQQVFSDIMNRSRDSNAVKLHHLERALAGGSAA FT KLIDPKTLSDGNFAHAWELLLDVYEDERKAVDSHIHGLLGLKRMNKESSKQ FT MRELLNEVTRHVEGLKLLEQDLEGVSERFVVVILSDAFDPETRKQWEATIP FT HKEIPTYEDTIKFLKERCSLLERCEASHPKLSSTKESNPKSTPKPQPAKSF FT AMATPVANSEATCEICSASHPNFKCDRFVSLTVPQRRVKVRDLNLCFNCLR FT KGHLALKCSSSKSCQECQGRHHTLLHEPKIPKPDPKPIDIPKSKPESAQVV FT VSSEVPPPVESVSPPVTSNPVHKVPRSDGTLLLTAMVDVLDSHGNSHPCRA FT FLDCGSQPHLVSRSLVECLGIDQEPVHVEVFGATGKRSVLDRKATLSFRSR FT CSNYQSKIECLVTDVVTGPLPGRRIDTQSWNIPSGLLLADPNFHTPGEVQL FT LIGMKLFFHLMLPGQLMLGSQLPILKETRLGWVVAGGAEDVDENQHCYTAS FT LRPLTECMEKFWSVESVESIDAVSKEEQDCEQLFSSTTVRNPNGTYTVHLP FT LKESVSELGNNHSLALKRFHILENRFAKNADLKQAYVDFINEYKSLSHCQP FT IAESDDAPDKLVHYLPHHAVLKPSSSTTRLRVVFDASARSTGPSLNDVLKI FT GPTVQDDLFSIILRFRKHRYAITADISKMYRRIRVDESHSGLQRIFWRENP FT NDPLQILELSTVTYGTASAPYLATRTLLQLARDESQRFPIASKVVEEDVYV FT DDVVTGADTVEQAVNLRSDLTNLFNSGGFPVHKWCASDATILDTVPEEDRE FT KFIELESSDVNRVIKTLGLLWDPEQDIFRFHCELKFEPLSMPTKRIVLSEI FT ARLFDPLGLVAPVIVLAKEIMQRLWKNKVGWDDPLDEDTLKSWLEFRNSLK FT HIDMIKIPRFVGLEGVEYVEVHGFADASSRAYGAVVYLRVVAANGIKISLL FT CSKSRIAPLSPLTIPKMELCAALLLSRLIPKSISALKIDFQRVCLWSDSSI FT VLAWLKKSANQLEVFVRNRVTEINTATEGYEWSYIRSEENPADVVSRGQMP FT QNLASNCLWWNGPSFLKSRQYEVANPDAIPDSELPCLVVTLPVVCATVVLE FT DVPFLEAFGSFRKLQRVVGYALRFVNNCKSSHADRILSKTLTVPELRKSMI FT VIIGLVQRIEMSDEISLVSSKKPSRRLAALDPMLVDGLLRVGGRLENSRLP FT YEARHQIILPNKSPVCDLLIRDMHLENLHLGPSGLLSLLRQRFWLINAKST FT IRKVLRKCVTCFRSRPRCLQPMMGRLPETRVVPSAPFSRTGVDFAGPVFVK FT VGLRKSVRVKAYICIFVCLATKAIHLELVSDLSSAAFVAALQRFVSRRGLV FT EEIHSDHGTNFVGAKSDLHDLYFLFRDDQTKGKIDEFCCAREIVWKFIPPR FT SPNFGGLWEAGVKSTKTHLRKVLSNACLTFEEFYTVLTQIEAVLNSRPLFA FT NSLDTHEPSALTPGHFLIGRELVAIPEPTLEDIPANHLTRWKYLQSLREHF FT WRRWSNEYLNTLQARSKWFKGSQNVIPGLIVLVKEDNLPPQSWKLGKIMKV FT YPGKDLIVRVVDVKTATGVYRRPVSRLAPLPIEENDDALASADPQ" XX SQ Sequence 5727 BP; 1448 A; 1400 C; 1386 G; 1493 T; 0 other; ttgtttggtc ccgattcgaa ccggatagca gcatcggtgg cattccaagc gcagctggaa 60 ggatatttag taagctctgg tggaggaaaa cgaaattgct ccgcgattat ccctccccgt 120 acggcctcac tcgcgttccg gaagtacagt tcgcgcgaag tgaaaaaagt gaaacaggtc 180 cgaacaagtt cggtgaaaat tgtagtgtgt gtgtgagccc atcgagggct gtgaatcctg 240 tccgaacaag tttcggaaaa gacagttcag ttgtgttgtg agcccgacgt gggctgaaaa 300 atcctgtccg aacaagtttc ggaaaagact ttagtgtgtg tgtgaaaaag gacagttcca 360 gtagtgtgtg tgcccaaccg ggacgaagaa aaagtgctaa aacgtgtgtc gataatgcta 420 gaacacaccc ccgtgaaaag caagcaggac aaaatggccg agctgaagac gctcgtccat 480 ctccgtggcc aagccaaggc caaagtgacc cggatccgga aagccatcga ggaagttatt 540 gaaggcggcg tcgtgcagtt caccattgcc cagctgaaag tgtactcgag gaatttggag 600 acccatttcc aagagtacct caacttgcac caccaaatca cagctttact gccaccggaa 660 aagctggatg acaatgatgc cacctacctt cagttcgaga cccaccacac ggaaaccgcc 720 atgttggtgg aaggcctaat ttcgtccgct accccgcaac ccaacccacc accgcatgtg 780 ctccaacccg atgcaggtat gatgcaacca cagattgttg tccaacagca accacttccg 840 gttcctattc cgtcatttga tgggaagccg gaaaattggc cacgattcca gcaagtgttt 900 tccgacatca tgaatcgatc cagggactcc aatgctgtga agctacacca tcttgagcgg 960 gcactagcgg gtggttccgc cgcgaaacta atcgacccga agacgctcag cgacggaaat 1020 ttcgcccacg cttgggagct attgcttgac gtgtacgaag acgaacgaaa ggcagtagac 1080 tctcacatcc atggcttgct gggtttgaag cggatgaata aggagagctc caagcagatg 1140 agggagcttt tgaacgaggt aacccgccac gtcgaaggat tgaagctgtt ggagcaggat 1200 ttggaaggcg tatccgagag gtttgtcgtc gtcatcctgt ccgatgcatt cgaccccgag 1260 acgaggaagc agtgggaagc tacaattccc cataaggaaa tcccaacgta tgaagacacc 1320 atcaagtttt tgaaggagcg atgttctttg ttggaacgtt gcgaagccag ccacccgaag 1380 ttgtcgtcca ccaaggaatc caaccccaag tccaccccga aaccccaacc ggcgaagtcg 1440 ttcgctatgg ctacacccgt agctaatagt gaggcgacct gtgagatctg tagtgcgagt 1500 catcccaact tcaagtgcga caggtttgtt tccctaacgg taccccaaag gagagttaag 1560 gtaagagatc ttaatctttg ctttaactgt ttgaggaagg gccaccttgc cttaaaatgc 1620 tcttccagca aatcttgcca ggagtgtcaa ggaagacacc ataccctact gcatgagccc 1680 aaaattccaa aacccgatcc aaaacctatt gacattccaa aatccaaacc agaatctgcc 1740 caagttgtcg ttagttccga agttccacct ccagtggaaa gtgtttcgcc tcctgttacg 1800 tctaatcctg tccataaggt tcctcggtcg gatggcacac ttttgcttac ggccatggtt 1860 gacgtcctag acagccatgg aaattcccat ccttgccgtg cattcctcga ctgtggttcc 1920 caaccacatt tggtttcccg ttcattagta gaatgtttgg gaatcgacca ggaacctgtc 1980 cacgttgaag tgtttggagc aactggtaaa agatcggtgc tcgataggaa ggcgacactt 2040 tcattccgct cccgttgttc gaattaccaa tccaaaatcg aatgtttagt caccgatgtg 2100 gtgactggtc ccctccctgg tcgaaggata gatacccagt cttggaacat tccctcaggt 2160 ttgttgcttg ccgatccaaa cttccatact cctggagaag ttcagctttt gattggaatg 2220 aagcttttct tccatttaat gttgcccggt cagttgatgc ttggtagtca gctgccaatc 2280 ctaaaggaaa ctcgtttggg atgggttgtg gccggtggtg ccgaggatgt tgatgagaat 2340 cagcattgct atacagccag tttgcgaccg ctcacggaat gtatggagaa gttttggtct 2400 gttgagtcgg ttgagtccat tgacgccgtg tctaaggaag agcaagattg tgaacagctt 2460 ttcagttcca ctaccgttcg aaatcctaat ggtacttaca ctgtccatct tccgttgaag 2520 gaatccgtaa gcgaactggg aaataaccat tcattggctt tgaaacgatt ccacattctc 2580 gaaaatcgct ttgccaagaa tgctgacttg aaacaagcgt acgtcgattt cataaacgaa 2640 tacaaatcgt tgtcccattg ccagcctata gctgaatctg atgatgcccc tgacaagctc 2700 gttcactatt tgcctcacca cgcggttttg aaaccaagca gttccactac ccgattaaga 2760 gttgtcttcg atgcctcggc tcgatcaact ggtccatctc tgaacgatgt tctaaagatc 2820 ggtcctaccg tccaggacga tctgttttcg ataatcctcc gattccgaaa gcatcggtat 2880 gccataacag ctgacatctc caaaatgtac agaagaatcc gggttgatga aagtcattct 2940 ggtcttcaaa gaattttttg gagggaaaac cccaatgatc cactccagat tcttgagttg 3000 agtaccgtca cttatgggac agccagtgct ccatatttgg ccacccgaac acttttgcag 3060 ttagcccgtg atgaatccca gcgttttcca atagcgtcca aggtagtaga agaggatgtc 3120 tatgtggatg acgtggtgac gggagctgat actgttgagc aggccgtcaa tctccgttca 3180 gatttgacaa atctgttcaa ctctggcgga ttcccagtcc ataaatggtg tgcgagtgat 3240 gcaaccattc tcgacactgt tccagaggaa gaccgcgaga agtttataga acttgaaagc 3300 tcagatgtca acagagttat caaaaccctg ggcctactat gggatccaga gcaagatata 3360 tttcgttttc attgtgagct gaaattcgaa ccattgtcca tgccaaccaa acgtatcgtc 3420 ctttccgaga tagcccggtt gtttgatcct ttgggactag tcgctccggt gatagttttg 3480 gccaaggaga ttatgcaacg tttgtggaag aataaggttg gatgggacga tcctttggat 3540 gaggatactt tgaagtcttg gcttgaattt cggaattctt tgaagcatat tgacatgatc 3600 aaaattccgc gcttcgttgg gctggaaggt gtcgagtacg ttgaagtaca tggttttgcc 3660 gacgcttcca gtcgtgccta cggagctgtg gtctacctga gagttgttgc tgccaacggc 3720 atcaagatct cattgctctg tagtaaatca agaatcgccc cgctaagtcc tttgacaatt 3780 ccaaaaatgg aattgtgtgc cgccctgttg ctatcaaggt tgattcccaa gtccatctcc 3840 gctttgaaaa tagacttcca acgcgtctgt ttgtggtccg atagtagcat tgttctcgca 3900 tggctcaaga aatctgccaa ccagcttgag gtctttgtcc gcaaccgtgt gacggagatt 3960 aataccgcta ccgaaggcta tgagtggtcc tacatccggt cggaagaaaa cccagctgat 4020 gtcgtgtcca gaggccaaat gccacaaaac cttgcgagca attgtctgtg gtggaacgga 4080 ccatcttttt tgaaatcgcg ccagtatgaa gtagcgaatc ccgatgcgat cccagattct 4140 gaactaccct gtctagttgt gactctgcca gttgtctgtg ctaccgttgt tttggaggac 4200 gttccctttc ttgaagcttt cggatctttc cgtaagctgc agcgagtagt cggatatgct 4260 ttgcgctttg tgaacaactg caaatccagc catgcagatc gcatactttc caaaactctg 4320 actgttcccg aactgcgtaa gtcgatgatt gtcatcatcg gactggttca acgtattgaa 4380 atgtcggatg aaattagcct ggtatcatcc aagaaaccca gtcgtcgttt ggctgcgcta 4440 gacccaatgt tggttgatgg cttactaaga gtaggtggta gattggagaa ttctcgtctg 4500 ccttacgaag cgcggcatca aatcatactt ccgaataaga gtccagtttg tgacttgctg 4560 attcgagaca tgcacctgga aaaccttcac cttggtcctt ctggattatt gtccttgctt 4620 cgtcagaggt tctggttgat caacgccaaa tcaaccattc ggaaggtttt gagaaagtgt 4680 gtgacttgtt tccgcagtcg tcctcgttgt ctgcagccca tgatgggccg tttacccgag 4740 actcgcgttg ttccttccgc tccattctct cgtaccggcg tggactttgc cggtccggtt 4800 ttcgtaaagg ttggcctccg aaagtccgtg agagtgaagg cctacatatg tatttttgtt 4860 tgccttgcca caaaggcaat ccatttagaa ttagtttctg atctgtcatc tgcagcattt 4920 gtcgctgctc tgcaaaggtt tgtgagccgt cggggtctcg tggaagaaat ccattccgac 4980 cacggcacaa actttgttgg agctaaatcc gatctccacg atctctattt cttgttccgc 5040 gacgatcaga ctaagggaaa gatcgatgag ttttgttgcg cacgagaaat cgtctggaaa 5100 ttcattccac ctcgatctcc caactttggg ggattgtggg aggctggggt gaagagtaca 5160 aaaacccacc tacggaaagt cttgtcgaat gcttgcctca catttgagga gttctacact 5220 gttctcacac aaatcgaagc agtgttaaac tcgcgtcccc tcttcgctaa ctctttggat 5280 acccacgaac cctctgcctt aacgcctggt catttcctta tcggcagaga attggtcgct 5340 attcctgagc ccacgcttga agatatccct gcaaaccatt tgactcgatg gaaatacttg 5400 caatctctcc gtgaacactt ttggagacga tggtcaaacg aatacctcaa cactctacaa 5460 gcccgctcta agtggttcaa gggatcccaa aatgtaattc ctggattgat cgttctggtt 5520 aaggaagaca atcttccacc gcagtcatgg aaattgggca agatcatgaa ggtttatcct 5580 ggcaaggatt tgatagttag ggtagtagat gtaaaaactg caaccggggt ataccgacgt 5640 ccagtttccc gactcgcacc acttccaata gaagaaaacg atgacgcact cgcatctgca 5700 gatccacagt gaatcccggc gggtgta 5727 // ID Gypsy-77_CQ-LTR repbase; DNA; INV; 185 BP. XX AC AAWU01044424; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-77_CQ_; KW Gypsy-77_CQ-I; Gypsy-77_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-185 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 534-534 (2011). XX DR Genome; AAWU01044424; Positions 8470 8286. XX SQ Sequence 185 BP; 51 A; 48 C; 27 G; 59 T; 0 other; tgtagtatcc acgatcagag tgtaaaactg ttatacgcaa agaattctct cagagagagt 60 acctctcttt acccaactgt actttgaatc tgaaatacat cgttcttctt cgtttgacca 120 cgcacgcgac tggacgtctt ttttattacc acctccgaca ccgacttata actttacatt 180 atcca 185 // ID BEL-602_AA-LTR repbase; DNA; INV; 509 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-602_AA_; KW Pao_Bel_Ele218; BEL-602_AA-I; BEL-602_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-509 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 509 BP; 172 A; 85 C; 97 G; 155 T; 0 other; tgttaacact accgcttggt accaccctag tggacgaacc caacagatcg acaagtactg 60 ctgtctgacg tacaaatgtc atggatactt tggcgaacgc tgtcactgtc agcgtagttt 120 agcgaagatt ataaatgcgg aaaaaagaag aattacgttt cgttagacaa aatcacattt 180 gagtcaatcc aagttgaatt cgttcatcct aattatattt cggttaaatt gttgaatttg 240 attaatattg tgtagcacac gtgagtaaac gtattctgaa cgttaaatgc gagggttaca 300 gtgctagaaa attattctta taattacagg ccggaataca ttaagaattc gtatgaaatt 360 gttcacccga attgaagtaa ttgaaaacca ttgatgtaag acaccatgta attaaaactt 420 gaaattattc taataaaaca tatttgcagt tttgagctgt acgacaaact cgtctgctgc 480 aaaaggattt ctaaatttcg gtgtgaaca 509 // ID CR1-13_CQ repbase; DNA; INV; 3854 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-13_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-3854 RA Kojima K.K. and Jurka J.; RT "CR1 non-LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 15-15 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 10 sequences with >96% CC identity. XX FH Key Location/Qualifiers FT CDS 1..669 FT /product="CR1-13_CQ_1p" FT /translation="KCRVLEIAEAAGAAAPSFVFPTVPVFRFGSTXNRQSG FT SSRSASGQPQAHNAFQTSQSTLAGFHLPAAXSXTAFSLAANTRVXPINNNP FT QSVPQQKAARPRNRKQLQPHLXPFLQPPLVQHNLAPPLQQTAPAFTCKHFY FT VKPFDPATTAEQIANYIQSRTGXXYDNFNCQRLTSESRSDSRPLTFVSFKV FT TVPDVVAFTNVITNSAFWPDFVSVDSFTSRRRS" FT CDS 673..3621 FT /product="CR1-13_CQ_2p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="QSKPCSALASVTSNTRRDILSVYYQNLGGLNSSLAEY FT KLACSDASYDLYAFTETWLQSHTLSSQLFDDTYEVYRKDRSSSTSSKSSGG FT GVLLAVRKELKPREMKVPDTSEVELVWVILPLTEKSVHICVVYLPPDRVND FT SELVERYTQSLLWVVSKLKANDSICVLGDFNLSCLKWISDKDGSLFVDVSR FT STVSTVTARFLDDHHLANLAQISEITNENNALLDLCFVSNDVTEYYRVSRA FT PAPLVKDVRHHPPLHLSIFDSAPCKFADSKTAVYYNFKKADYIGMNTFLNN FT MNWEELLASRNVHSAAEVFSHVLLYAIDQFVPKKCRKPPVTPEWSTPALKR FT LKSSKRSALKCFSKCHSSFYRLQVNNINNRYKHLNKKLFLAHQIRVQKHLK FT KNPKGFWNYVKKQRCESGLPNSMVRDGVDATNTREICDAFRAQFSSVFTHK FT NLSEQEISNAAARVPESLPIPSNVVISHEDFRTASAKLKCSHSAGPDGIPS FT VVLKSCSNSICLPLTKIYNMSLSSGVFPDRWKESFVFPIFKKGCKRDICNY FT RGIAALCAMSKLFEVIVLNFFLHNFSSGIAYDQHGFLPKRSTNTNLVTYVS FT TIQRAIQSGYQVDAVYTDLSAAFDKIDHRIAIAKLRRIGLHGTFIDWVYSY FT LTGRSMQVKIADCLSSPFLVTSGVPQGSHLGPFIFLLYINDVNYQLRCSKL FT SYADDFKLFTIIKKPSDCRFLQQQIKVFSDWCQLNQMVLNPTKCSVITFSR FT KRSPVCFDYTLFENSLKRDRCVKDLGVMLDAELSFKDHVAYMTSKASKLLG FT MIFRMTKYFRDIQCLKTLYCSLVRSTLEYGSIVWAPFYANEINRIEAVQRK FT FTRFALRYSPNQSLDYASRCDFLHLDFLSVRRDLAKTVFVSDLLRSNIDCP FT YLVGQLNINIRRRTLRSFAFLTLPFARTNYAFNEPFSSMCRVFNRCYSSFD FT FNLSRPVQKLQFLTFLREENPRPLNPLV" XX SQ Sequence 3854 BP; 1003 A; 995 C; 723 G; 1124 T; 9 other; aaatgccgcg ttctcgaaat tgccgaagcc gcaggtgctg ccgccccctc atttgtattc 60 ccaaccgttc cagtttttcg tttcggatcc acwamcaacc ggcaatctgg cagctctcga 120 tcagcgtctg gacaaccaca agctcacaat gccttccaaa catcacaatc aacattggct 180 gggttccatc ttcctgcagc caamagtakw acggcatttt cgcttgctgc caacactaga 240 gtgcawccaa tcaacaacaa tccacaatct gtgccacaac aaaaagctgc tcgtccccgg 300 aaccggaagc agctacagcc ccacttamag cccttcctac aaccaccatt agttcaacac 360 aatttggctc ctcctcttca gcaaactgcc ccggctttta cctgcaagca tttctatgtt 420 aaaccgtttg accccgcaac aacggccgag caaatcgcta attacatcca aagccgtacc 480 ggctgsmgct atgataactt taactgtcaa cgactgactt ctgagtctcg ttcagactct 540 cgaccactaa cattcgtgtc gttcaaagtt accgtgcccg atgttgtagc ttttaccaat 600 gttattacca actcagcatt ctggcccgac tttgttagcg tcgattcgtt cacttcaagg 660 cgtcgttcat agcaatcaaa gccctgctca gctctcgctt ccgtaacatc gaataccaga 720 cgagacattc tttctgttta ctaccaaaat cttggcggcc tgaatagttc cttagctgaa 780 tacaagctcg cgtgctccga tgcttcttat gacctttatg cctttaccga aacttggtta 840 cagtctcata ccctatccag ccagctattt gatgacactt acgaagttta ccggaaggat 900 aggtcaagtt ctactagttc gaaatccagc gggggcggtg ttcttcttgc tgtacgcaaa 960 gaacttaaac cacgtgaaat gaaggtcccc gacacctccg aagtcgaact cgtctgggta 1020 attctccctt taactgaaaa atccgttcac atttgcgtcg tttatcttcc accggaccgt 1080 gtcaacgatt ctgagcttgt cgagcgttat acccagtccc tcttgtgggt cgtatctaaa 1140 ttgaaagcta atgacagtat ctgcgtcctg ggcgacttta atctcagctg cctcaaatgg 1200 atttccgaca aagatggttc ccttttcgtc gacgttagca ggtctaccgt tagcacagtg 1260 acagcgcgct ttttggacga ccaccatctg gcaaatcttg ctcaaattag tgaaatcaca 1320 aacgaaaaca acgctttgct tgatctttgt tttgtcagca acgatgttac ggaatattac 1380 cgagtttccc gcgcgcctgc acctctggta aaagatgttc ggcatcatcc tccgttgcac 1440 ctgagtattt tcgatagcgc tccctgtaag ttcgccgata gtaagactgc tgtatactac 1500 aacttcaaaa aagctgacta cattggaatg aacaccttcc tgaacaacat gaattgggag 1560 gaactgcttg catcacggaa tgtccattct gctgcggagg ttttttctca cgttctattg 1620 tacgcaattg atcagttcgt ccccaagaag tgtcggaaac cacctgtcac tccggaatgg 1680 tccactcctg ctctcaaacg actaaaatcc tccaaacgtt ctgcgttgaa atgcttctcg 1740 aagtgccatt ctagcttcta ccgtctccag gtcaacaaca taaacaatcg gtacaagcac 1800 cttaacaaga aattgtttct cgcgcatcag attagagttc aaaaacacct gaagaaaaat 1860 cctaaagggt tttggaacta cgtgaaaaaa cagcggtgtg agtccggcct tcctaattcc 1920 atggttcgtg acggcgtcga tgcaacaaat acgagagaaa tttgtgatgc gttccgtgcc 1980 caattctcta gcgtgttcac ccataaaaat ctttctgaac aagaaatatc caacgctgcg 2040 gctcgagttc cagaatcgct acccatccct tcgaacgttg tcatctcgca cgaagacttt 2100 agaactgcaa gtgccaagct aaaatgctcc cactccgccg gacctgatgg tatcccctcg 2160 gttgtcttga aaagttgttc taacagcatt tgccttcctc taacaaaaat ctataacatg 2220 tctcttagct ccggggtgtt tcctgaccga tggaaggagt catttgtttt cccgatattc 2280 aaaaaaggct gtaaacgaga catttgcaac taccgcggaa ttgcagccct ctgtgcaatg 2340 tctaaacttt tcgaggttat cgtgcttaac ttttttctgc ataacttttc ttcgggcatt 2400 gcttacgacc agcacggatt tctccccaaa cggtcaacta acacaaacct tgtgacttat 2460 gtttccacaa tacaacgagc aatccaaagt gggtaccagg tcgatgcggt ctacaccgac 2520 ctctctgcag ctttcgacaa gatcgatcat cgaatcgcaa tcgcaaaact tcgacgtatt 2580 ggcttacacg ggactttcat tgactgggtt tactcctacc taaccggaag atcgatgcag 2640 gtaaaaatcg cggattgcct ttcatcgcca ttccttgtga catccggggt acctcagggc 2700 agccacttag gtccatttat attcctgctc tatatcaacg atgtcaacta tcaacttcgg 2760 tgctcgaaac tttcttatgc tgacgatttc aaactgttca ctatcatcaa gaaaccctca 2820 gactgtcgct tccttcaaca gcaaatcaaa gttttctccg attggtgtca gctgaaccaa 2880 atggtgttaa accccactaa atgttcagtt ataacgtttt cacggaagag gtcgccggtc 2940 tgcttcgact acacactatt tgaaaactct ttaaaacgtg atcgttgtgt caaagatttg 3000 ggtgtcatgc ttgacgcaga actttcattt aaggatcatg tggcttacat gacttctaag 3060 gcatccaaac tgctcggtat gatcttcaga atgactaagt actttcgaga tattcaatgt 3120 ttaaaaactt tatattgttc actagtcaga tccaccttgg aatacggttc tatcgtctgg 3180 gcgccatttt atgctaacga gataaatcga attgaagctg ttcaacgcaa attcactcga 3240 ttcgcattgc gctactctcc caatcaatct ttggattatg ccagtcgatg tgatttcctt 3300 catcttgatt ttttgtctgt caggagagat ttagcgaaaa ccgtgtttgt ttctgatctg 3360 ttgagatcca acattgattg tccttacttg gtcggacaac tcaacatcaa cattcgccgg 3420 cgaacacttc gctcatttgc ctttctgacc ctaccctttg ctcgcaccaa ttacgccttc 3480 aatgaaccat tctccagcat gtgcagagtc ttcaatcggt gctactcgtc tttcgatttt 3540 aatttgtctc gtcctgttca aaaacttcaa ttcttgacgt ttctccgaga agaaaatccc 3600 cgacccctta accctctagt ttaggtttac acttgtagct tatttattta tttatttatt 3660 tatttattta tttatattgt atgtaatttt cgttagctta ataggttagt ttaagtaatt 3720 atgtaaaagc ctttttgtca tttggactgt aaatctgttg acaataaaaa aagaggtttt 3780 gtgcctattt gagaaggacc catcggttgg cgatccactc aaacgggctt ttccctccaa 3840 acaaaacaaa acaa 3854 // ID Sola3-2_BF repbase; DNA; INV; 8070 BP. XX AC ABEP01046127.1; XX DT 28-JAN-2009 (Rel. 14.02, Created) DT 28-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE Sola3 DNA transposons from Branchiostoma floridae. XX KW Sola; DNA transposon; Transposable Element; Sola3; TTAA TSD; KW Sola3-2_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-8070 RA Bao W., Jurka M.G., Kapitonov V.V. and Jurka J.; RT "New superfamilies of eukaryotic DNA transposons and their RT internal divisions."; RL Molecular Biology and Evolution 26(5), 983-993 (2009). XX DR EMBL/GenBank/DDBJ; ABEP01046127.1; Positions 30459 22390. XX FH Key Location/Qualifiers FT CDS join(3142..4122,4688..6634) FT /product="Sola3-2_BF_1p" FT /translation="LIAQSDFQEETHGSLSSSTSAGGAERSLYTVESHTSS FT GDAGSTAPEPKALLGQLLYASGSKVEKVKVLSVPWAEASQRTQRFHLQQAS FT DAVSSVLQVLAPDDSYSLWKSLRDSRLADVALGVGANVIEMEILTALVECY FT NCANQSFTRRQILSIMADKYSFSELEKLMPGLTRYQVTAAKKHMYQHGRGA FT PLPPSPDVRMRVDPGKLDHFISFITSAHVVQDLPFGEKTLKLSSGETLKVP FT NTIRAMIPERIVSQYQQYCAETHVVPMGKRTLLRVLSACSASVRTSLQGLD FT YFTAEGNNNMLLFQSMQDIDTQYISCTIWLSGMILWIKMCLTFLQVHVSQE FT TRVAEHCRKYALSDKDGHFRATCSHEHTDTCEACGNLHQIFDCLEESFTTF FT NFTSPEEQDEVRFIIDQARQDILAWQSHQLRSINQDAVWYSLLDSLDESSV FT LLVQDWAMKFLPRKFRESQTDWFGKRGISWHLTVAYRCVNAVIEAQTFVHL FT FQTCTQDSNTVVSVMKHTVEELKKDYPSLTSLYVRSDNAGCYHNVLTLQAS FT KSISETTGVTVKRIDFCEPQGGKGSCDRQAATIKSHIKAWINEGHNVETAD FT EFKAAVESHGGVPGVKVYRCEVNVDATVTSTKFEGITKLNNFEFTDAGLRV FT WRAFDIGEGKLIPWSKLPAVEPSTLLIISEPYNPGSAFRPVKSRRKKSQPQ FT LETESDSEEETVSHETPAQTQVNIFPCPEEGCVKVYQTCRGLDAHVAVGRH FT NRRLERETLLDKAKLMYAAKLEKGPSEVPRLETASEGKRTTECPVEGWALR FT GPRKRVRFTAKQTAYLDRTFHLGERTGKKSDPQTVAKAMRHARDLQGQRLF FT GVNDFLTTQQVSSYFSRLAGKRQKPTEDDEVEQLDPAVLESINSKVQNEVM FT NNVHVRHPVLYDTYNLCDLAIRGDLYELGMPFLKQICEHFDIDVSSITDRR FT RKSAFVAKLQEYLSSCPCSK*" XX SQ Sequence 8070 BP; 2447 A; 1550 C; 1773 G; 2300 T; 0 other; gagcaaatgt ctgtaaggat caaacaatgg tggcaccggt caaaaaatag atttggctca 60 tattttgcac agtgatagta cttggtacac aacccctcaa aatagctttc tttctgctca 120 gaacctcttg ttgctatggc aacaggccaa agaagtttta gggccatttt tccagatttt 180 ggcttctaaa aaacatgaaa attggccaac tttagggcac tatcactggt caatacttaa 240 acttaaagat aaatgcagaa tatgatggtt agagcaatgc ctaaacttgt atgaaatagc 300 agtaaagtca atgttctgca aaatctaagg tcttgggcaa cccgaataca atacgctgtt 360 ttcaggttgt aaaaaatttg aaatttggcc caactttgaa gcactgaaac tccataagta 420 taaaatattt ttgcttgcaa tttgcatcga tgaataaccc actaaagagc tatcagacgt 480 atacctgcga agtttgggtt cttgttttgt tgctgcgtaa ttgtccttta aagtgggcca 540 gttttcatga ttttgatgac atgtcgtttt gtgctatctt tgcacaatca taagtttcaa 600 accaaacttg ttctgtactt gacagttaca catgatatac accagtatat tgtgtatcaa 660 acacatgcat ttacagttct ggtccttgtt ttgttgctac gcaatcgtct tttaaagtag 720 gtcaattttc atgattttga taatatgtca ttttgtgcta tcttccaact accataagtt 780 ccaaagtaca attattctgt aatctaattt tacatgaaac ataaaccaat atattgtcta 840 ttaaatgcat gcttgcagag gtcaggttct tgtattatag cagtgtactt gtccctgaaa 900 gtaggtcgta ttttgcactt gtcaacaaat cagaaatttg gtatggttac acaggaataa 960 ctcactgctt gcagtttttc cagatgtttg cctttggatt gtttataaaa caaatgttga 1020 gtgtggtatt aatgtttgtg ttattgtagc tgtgtggctg actttaaaag taggtcacat 1080 gttgcacata tccacttgtc tcaatgcttt ctgtgccaac tactaaggta tgagatgtat 1140 gtagacatat taatgtcccg caccatgttg gaaaggtgct atggtggtcc attgctacaa 1200 tatgaatttc agctttattt agcttccttt cttgggggca acatataaca tatcacccag 1260 ttcctcgaat atagtgaatc tccccttagc ttggtggcga gggcaagaaa ctcgagaagt 1320 ccttatctct ttcatctagt gttcttgctg ccttatcttt ttttgtccag acactgtctg 1380 gctttgggtg aaacgttagg tttacagcac ggctagtgga aggtttgtcg ccgtcctggt 1440 tagagaaaat ttgaatccgt ttttaccatc tttttctcgc tgtcatactg tggttggatt 1500 aacacgttgg aaagaacggg gattggaatg atgtagtgga tattgatcag tgacatctat 1560 ggccagtcgt accatccaca gcaacacaac actttcttca ggagcgctgc aagtactggt 1620 aagttgtata ttatggtaga aaatttgtac atatgatagg gttatatttc tctcatttta 1680 atgcatatta ttattttgaa aaaaaaatat tgcccttttg tcgcagtatc tgtgagccac 1740 tttggtctaa atgtttatta tttatttgat ttgaaggtct ttcgtaagca gggtaaacag 1800 aaatagtaat atcgtttcat ccaattattt cttttaactt acaggccctg tgcatcatgt 1860 cgtgttcgtt tgcagagttc gcaagtgggt catgtggagc taacccaaac gctccagcag 1920 agggcgaaaa gatcataccc ctgaataact gcagtaaaga catttcaaac catctgagtc 1980 tattccaaat tcccaataca gtacagactg agcatgatct gattttagct cgagtaggac 2040 tgtttgccca tcccgattat gtcagcgaca tgacgatatg ccctaagcac agaggcgcat 2100 ttggtctcag ttggaggtgt ggaactcaaa aatgcaaaat gccagtacaa ctgtctaacc 2160 accgtcgctg gggagcacga cccaaaggcg acagggggat tgggttcttc cagtcaaaga 2220 aggtatacga gcttgccggg gagttggttc cagttggttc aggtgagact cgtttatgaa 2280 ggtagaagga attgatttct ctttgctgtc agtcagattg ctaatgagca ttgcatgtac 2340 aaaaatattc atgttgtcat gctgttgttt ctgtagttat atgccgtaac tgccgtacag 2400 ccatcggggg ggaagaagat ctgaaaggag aagctattcc ccagcagagt gaaatcggtg 2460 aacatgaacc agtgcccagc accagcggtc ttgacgagtc gcggcttgag caactgacac 2520 tggtatgact tgttctcaag tacaatgggt catgcaattt ttataaaaag ccaactactt 2580 acaaacttgc caaacattta gagaaaaatt tcttctaaca aacttcacgt ttggcaatag 2640 ttgataattg attgcataat tcaattctca ggaaacagta ccacaggaaa gcgaagcagg 2700 agaacaagga caagttcccg gcaccagcgg tcttgaggag tcgctccagc aaatgacact 2760 ggtatgttta tttattactt ccgccaagaa ggatataatt ttgtgtgttt gtcacagtct 2820 actgaaattg tgtctgtgtc aataagccga gagggccaag atagattatt ttgatgtttg 2880 gtatgtggtt attaatagat gtaaatatac aaatgtgaat ctgtcaaaat catttgaaaa 2940 taatttttga gttgggaaat tacatcatgt acatgaataa taagggaaat ctgtcatagt 3000 taaacctggg gaagggcatt acggaacatg caatttcaaa tggatgtgca gttatataca 3060 ctacaagtac tagtacaaca ccaaaacatt cagagttgaa caacttttag aagaattcat 3120 ttcgataatt gatcgcaata attgatcgca caatccgatt tccaggaaga aacacatggc 3180 agcctttcat catccacatc cgcaggagga gcagagagaa gcttgtacac agtcgagtca 3240 catacatcta gtggtgacgc cggttcgact gcacctgagc caaaggcact attgggtcaa 3300 ctgctgtatg caagtgggag caaagttgaa aaagtaaaag ttctttctgt gccatgggcg 3360 gaggctagcc aaagaaccca gcggtttcat ctccaacagg ccagtgatgc agtcagctcc 3420 gtcctccagg tactcgcccc agacgattcc tacagtctgt ggaaaagtct gagagactca 3480 aggcttgcag atgtggccct tggggtgggg gccaacgtga tagaaatgga gattctgacc 3540 gctcttgtag agtgctataa ctgtgccaat cagtcgttta cgcgaaggca aatcttgtca 3600 atcatggctg ataagtatag cttcagcgaa ctggagaaac ttatgccagg attaacaaga 3660 tatcaggtaa ctgctgcaaa aaagcacatg tatcagcacg ggagaggagc ccccttgcca 3720 ccgtccccgg atgtaagaat gagagttgat cctgggaagc ttgatcattt catctccttc 3780 attacaagtg cacatgttgt gcaggatctt ccatttggcg agaaaaccct gaagctatca 3840 tcgggagaga cgctcaaagt tccaaacacc atccgtgcca tgatcccaga gaggattgtt 3900 tcccagtatc aacagtattg cgcagagaca cacgttgtgc caatggggaa aagaactctg 3960 ttgcgcgtgt tgtccgcctg tagcgcttct gtaaggacgt cgctgcaggg ccttgactac 4020 tttacagctg aaggtaacaa caatatgtta ttgtttcaat ctatgcaaga tattgacaca 4080 cagtacatta gctgtactat atggcttagt ggcatgattc tctaatttaa aagtagtttg 4140 caagggactg gaatattttg tcatcatcaa attaatattg acctttattt gcatattcaa 4200 aagcattttc gaaccattct aaatttcagt caatttcaat cattctgaaa aacttgggta 4260 gagtttgctt caaatgttat ttgaaacatt gacaagtatt atctagttga atcagaagtc 4320 acccgccaca tacatttatc atatagaaca gttccaaaaa ttctttaatt taatttcatt 4380 caatgacatg ttttcatcta ggtggcaggg catttgagga catagaaggt gttctagaaa 4440 agatggcaga acagcacggg caagaatggg caaagactca ggcgaccatc ctaagatccg 4500 cgaagagata ccttaaagga gactataagg tacgtcatac cctgtaagtt aaaatggaaa 4560 acaaatatag tcaacgtttg acacatgttg gtttagaatg ctttaatatt agctgtcaac 4620 aaacagcata ttgtacatta tgttctggtg caaattgttt ggtataattt ttccctttat 4680 tcactaatgg ataaaaatgt gcttaacatt tctacaggta catgtgtctc aggagacaag 4740 ggttgccgag cattgccgga agtatgcact cagcgacaag gatggccact ttcgagctac 4800 ctgttcacat gagcatactg acacctgtga agcatgcggt aacctacatc agattttcga 4860 ctgtcttgaa gagtcattca caaccttcaa cttcacatca ccggaggaac aggacgaagt 4920 tagatttatc atcgatcagg caaggcagga tatcttggct tggcagagtc atcaacttag 4980 aagcatcaac caggatgcag tgtggtattc tctgttggac agcctggacg aatctagcgt 5040 acttcttgtg caagactggg cgatgaaatt tttgcccaga aagttccgtg aaagccaaac 5100 tgactggttt ggaaagcgtg gcatttcatg gcatctgaca gtggcatacc ggtgtgtcaa 5160 tgcggtaatt gaagcacaga cattcgttca tctcttccaa acatgtacgc aggacagtaa 5220 caccgtggta tctgtaatga agcacactgt ggaagagctg aagaaagact acccatctct 5280 tacatcactg tatgtcagaa gtgataacgc cggatgttat cacaatgtgc tgacattgca 5340 agcgtcaaaa tccatcagcg aaacaacggg tgttacggtg aaacgaattg acttctgtga 5400 accacaaggc ggcaaaggtt catgtgacag gcaggctgcc acgattaaat ctcacataaa 5460 agcctggatt aatgagggac ataacgtgga gacggcggat gagttcaagg ctgctgttga 5520 gtcacatggt ggggtgcctg gggtgaaagt ctaccgctgt gaagtgaatg tagatgccac 5580 tgttacatcc acgaagtttg aaggtatcac caagttgaat aactttgagt tcacagatgc 5640 aggattgcgt gtttggcggg cattcgatat aggggaaggt aagctaatcc catggtccaa 5700 gttacctgca gttgagccat ctacactgct gatcatcagt gaaccttaca atccaggaag 5760 tgcatttcgg cctgtcaagt caagacgaaa gaagtctcag ccccaacttg aaacagagtc 5820 agattcggaa gaagaaacgg tgagtcatga gactccagcc cagacacaag taaacatctt 5880 cccgtgccca gaagagggct gtgtaaaggt gtaccaaacc tgcagggggt tagatgctca 5940 cgtagcagtt ggcagacata atcgccgcct tgaaagagag acactgttgg acaaggcgaa 6000 gctaatgtat gcagcaaagc tggagaaagg tccctcagaa gtgccacgtt tggagacagc 6060 gtcagaaggg aaaagaacaa ctgaatgtcc tgtggaagga tgggccttac gtggcccacg 6120 gaagcgggtc aggttcactg caaaacaaac ggcatacctc gacaggacat ttcacctggg 6180 tgaaagaacc ggtaaaaagt cagacccaca gacagttgct aaagctatga gacacgcaag 6240 ggacttacag gggcaaagac tgttcggggt caatgacttc ctgacaacac agcaggtgtc 6300 aagctacttc tcaagacttg ctggcaaacg ccaaaagccg acggaagatg atgaggttga 6360 gcagcttgac cctgctgtgt tggaatccat caatagtaaa gttcagaatg aagttatgaa 6420 taacgttcat gtacgtcacc ctgtcttgta tgacacatat aatctctgcg atttagcaat 6480 aagaggggac ttgtatgaac ttggcatgcc attcttaaag cagatttgtg agcactttga 6540 tatagatgtg tcgtcaataa ctgatcgccg ccggaagtca gcatttgtag caaaactcca 6600 agagtaccta agttcctgcc cttgctcaaa atgagtatgg tccacaaaaa tagcagcagt 6660 tgactaggac gcatgtttac catagatatg caattgttaa ggatgggact aggtgaggtg 6720 gtgaatagaa agatgcacag atatttatgg catttcttca agtgtagaaa gaagctaagt 6780 ctttgtgtgt caatatagaa agttcagagt ggatgtatct gacattgaac acgagtaaga 6840 ctatgtgtag aatccctaga aatgtttgtt tttctttatg gtgatagtgt agttgcagat 6900 tgttacattg gttctaggaa aacgaagact gtgccatgtg tatcactgtt gttaacaaca 6960 agatttgatg gaaagtgtgg aatatgacct actttaaggg acatgtgcat acactgctgg 7020 aatacaagaa cctgaactct gcaagcattc ttggcttata ttccaaataa atttcaatta 7080 cagaataatt ttttctccgg gagttatgat agttgaaaga tagctcaaaa tgacctacat 7140 tgtattatca aaatcatcaa aattgaccca cttgaaagga cagtacatta ataacgaaat 7200 aagaacccaa actttacaag cacgcatttg gttgacaatg tactatttta tattccatgt 7260 aaatttcaat cacagaacaa atttggtttg aaacctatga ttgtgcaaag atagcaagaa 7320 ataacatatt atcaaaatca tgaaaattga cccactttaa gggacgattc cgtagtgaca 7380 aaaaagaacc tgaactatgc aagtacgcat ttgatagaca atgtattggt ttatattccc 7440 tttaaatcgc aattacagaa caagtttggt ttgaaactta tgattgtgca aagatagcac 7500 aaaacgacat gtcatcaaaa tcatgaaaac tggcccactt taaaggacaa ttacgcagca 7560 acaaaacaag aacccaaact tcgcaggtat acgtctgata gctctttagt gggttattca 7620 tcgatgcaaa ttgcaagcaa aaatatttta tacttatgga gtttcagtgc ttcaaagttg 7680 ggccaaattt caaatttttt acaacctgaa aacagcgtat tgtattcggg ttgcccaaga 7740 ccttagattt tgcagaacat tgactttact gctatttcat acaagtttag gcattgctct 7800 aaccatcata ttctgcattt atctttaagt ttaagtattg accagtgata gtgccctaaa 7860 gttggccaat tttcatgttt tttagaagcc aaaatctgga aaaatggccc taaaacttct 7920 ttggcctgtt gccatagcaa caagaggttc tgagaagaaa gaaagctatt ttgaggggtt 7980 gtgtaccaag tactatcact gtgcaaaata tgagccaaat ctattttttg accggtgcca 8040 ccattgtttg atccttacag acatttgctc 8070 // ID hAT-23_SM repbase; DNA; INV; 3422 BP. XX AC . XX DT 13-FEB-2008 (Rel. 13.02, Created) DT 03-MAR-2008 (Rel. 13.02, Last updated, Version 1) XX DE hAT DNA transposon from Schmidtea mediterranea: consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-23_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-3422 RA Jurka J., Bao W. and Tempel S.; RT "hAT DNA transposon from Schmidtea mediterranea."; RL Repbase Reports 8(2), 72-72 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS join(945..2420,2299..2664,2630..3121) FT /product="hAT-23_SM_1p" FT /translation="MNHNKIITLKIKPNLLLFLVLNLIIKILNSNYIFIVN FT YYSLSMSAARKKDLWNNFTRMKDGNSAVCKRCNTTIKCAGGSTSGLHNHLI FT RKHSINLLKRETLNIEAVESCNSKISSNISVSDDTNSRTKISNYFIQIKTI FT DNSFPAVLARLTARDCLSFKKICTSYDLQELFKSKGCDIPKSPMTIKRIVM FT DYGDKIRQQTVIELSERISRGELFSLTLDEWTSLKNRRFMNVNVHSYDDIY FT WNLGLVRIEGAMAADKCITYLQNKLQEFNIPTENVISITTDGAAIMKKVGR FT LIDVSHHQLCFAHGIQLAVIKVLYQKSFSDNISDENMADAATEYEHETVTI FT SKEEKGDEEEIEHFEVHTDTLPTDMPISIKHECIGPIISKIRKVVGIFRRS FT TVKNDALQKHVMVEFDKELSLILDTKIRWSSLLDMLERFYQIKNCVRKTLI FT DIKSDLELSEKELNLVSNIISALQPIKVAVETLCSRDTNTIHCGFNSNYRK FT RNLIWYQILYRPCNRLKWRWKLFVRETLTLYIADLTLKFMLDELGNQNTFI FT SNELKEALILRISERRTMYSDIFYYLHEPRSCGENSYGMFNTINKIVISKK FT IVEIIQTFSNKRFRNYFKLFQISDSATIDTESEIVCEEPELTIGSAIPSTS FT AGTSSGLSMKEKLNLVIQNRININADEEISDPKTPSGLLNIIRKEMLYFKE FT QNIKGKYLQKALKYISTVRPTSVESERALSAAGLICTRSRSSMNDETLSTM FT CFLRAFFERAIYAKLAVKIAIETKLSK" XX SQ Sequence 3422 BP; 1262 A; 473 C; 571 G; 1116 T; 0 other; taggggtaat aataccggtt taccggtaaa tttttccata ttagtgtatt tacatttacc 60 ggttttttgc aataaattac cggtaaaccg gtaaaatttg gtaatatttt aacgctatag 120 agtactgaca cgtgacgtta ttttaaagca aaagtgggca ttgaccattt aatttcaaga 180 aaaatagatt tgcttaaaat tatgcttcag atatataaat tttaaattat cataaatcta 240 atatataata tgaaaagtat aattaaaagt gttcggtgta atcaaaattc aataaactat 300 caaatttcgt tacgtttgat taaattaaac aaatagaaac tatctacaaa actgaaaacg 360 gcatttcaaa tttcatgtca gcgctcatat aatatgtaaa aagaggacat ttttcataaa 420 atttaaaata aaatatacct acttatacaa aaattatttt tctttattca attattgtgt 480 tgaaatttta ttgtccatcc gtttttgaaa taatgtgaac gctgcattat tcacttagta 540 gtaccaacca tgcaagtaaa aaaatcatag ttatcttttg atttttttat tcaaaaaaat 600 caatgttgac agaaggaaat tttgtgatcg gatttgtgta gataaaatta atagaattca 660 aataaaaact ttaacgttat ttttgtagta aaatattaac ttgtgatgtg aagtaataaa 720 aaagtaagac taagaagata taaatatgta aatttttgca atatagtggt tttatgtaat 780 tttgtcaact attttaattt aacctgagaa ataaagaagc gtaattgtag ttaaaaagta 840 caaataccta ttatcaagaa gaaaatatat gcagaatttc tttcatattt aattttggtt 900 atttccgttt taaataggtg gttattctga gtttgctcgt tttaatgaat cacaataaaa 960 taataacatt gaaaataaag cctaacttat tattattttt agtattaaat ttaattatta 1020 aaatattaaa ttctaattac atatttattg taaactatta tagcttatcc atgtcggcag 1080 caagaaagaa agatctttgg aataatttta cgcgcatgaa agacgggaat tcagctgtat 1140 gcaaaaggtg taatacaacg ataaaatgcg caggcggaag cacaagtggc ctgcataatc 1200 accttatcag aaaacacagc attaatcttc taaagagaga aacacttaac atagaagctg 1260 ttgaatcatg taactctaaa attagttcca acatttccgt ttcggatgat actaattctc 1320 ggaccaaaat ttccaactat ttcatacaaa ttaaaaccat tgacaactca tttccagctg 1380 ttttagcgcg actgacggca cgagattgcc tgtcatttaa aaagatttgc acttcatacg 1440 atctgcaaga attgtttaaa tcaaaaggat gtgatatccc taaatctcca atgacgataa 1500 aaagaatagt tatggattat ggtgacaaaa tacgccaaca aaccgtcatc gaattatcag 1560 aaaggataag ccgtggtgaa ttattcagct tgactttaga cgaatggacg tctttgaaga 1620 atcgaaggtt tatgaatgta aatgttcact cgtatgatga tatatattgg aatttgggtt 1680 tggtaaggat tgaaggggca atggctgccg ataaatgtat cacgtatctt caaaacaaac 1740 ttcaagaatt caatattcca acggagaatg taatttcaat tactacggat ggagccgcga 1800 ttatgaaaaa ggtaggcaga ttgattgatg ttagtcatca tcagctatgt ttcgcccacg 1860 gcatccagtt agccgtaatt aaggtgcttt atcaaaaatc tttttcggat aatatttcgg 1920 acgaaaatat ggcagatgct gctactgaat atgaacatga aactgtaact ataagtaaag 1980 aggagaaagg ggacgaagaa gaaatagaac attttgaagt gcatactgat acattaccta 2040 cggatatgcc aatttcaatt aaacacgaat gtattggtcc cataatatca aaaattcgga 2100 aagttgttgg aatttttcgt agatcaacag taaaaaatga cgcccttcaa aaacatgtca 2160 tggtcgaatt tgataaagag ctttcgctaa tacttgacac aaagataagg tggagtagtt 2220 tgcttgatat gttggaaaga ttttatcaga taaaaaattg cgttcgaaaa actctaatcg 2280 atattaaatc agatttagaa ttatcggaaa aggaacttaa tttggtatca aatattatat 2340 cggccttgca accgattaaa gtggcggtgg aaactctttg ttcgagagac actaacacta 2400 tacattgcgg atttaactct taagtttatg ctggatgaac taggcaatca aaacactttt 2460 attagtaacg agttgaagga agctctaata ttaagaataa gcgaaagaag aaccatgtat 2520 tccgatattt tttattatct tcatgaacca cgttcatgtg gtgaaaatag ttatggaatg 2580 ttcaatacaa ttaacaaaat agttatatca aagaagattg ttgaaataat tcaaactttt 2640 tcaaataagc gattccgcaa ctattgatac ggaatctgaa atagtgtgtg aagaaccaga 2700 attaactatt ggatctgcga ttccatcgac ttctgcagga acatcttcag ggctctcaat 2760 gaaagaaaaa ctaaatttag ttattcaaaa taggattaat ataaatgcag atgaagagat 2820 cagtgatccg aagactccat ctgggctttt aaatataata agaaaagaaa tgttatattt 2880 taaggaacaa aatattaaag gcaaatattt acagaaagcc ctgaaatata tttcaacagt 2940 tcgcccgact agcgtagagt cagaaagggc tttgtccgcc gctggtttaa tttgcacaag 3000 atccagatcc agcatgaatg atgaaaccct gagcacaatg tgttttttga gagccttctt 3060 tgagagagca atttatgcta aattagcagt gaaaattgca attgagacta agctaagtaa 3120 atgaattttg tgattaacat tataatattt cccaaaagtt ttttaatatg acattcattt 3180 tttgtgttat tagatttatt agttgtttga tttgtttttt aactacaagg gtttccagtt 3240 cgcactttca aaattttact tcaaaatata ctgtagccga taatatgtat tcattagtag 3300 gggatttttt gggtaaatat ttttattttt aatgcatact aagcttataa aaattaccgg 3360 taaaaatgta aaaaaatacc gaattaccgg tttttcaaaa acccggtaaa ttattacccc 3420 ta 3422 // ID Gypsy-7_PPc-I repbase; DNA; INV; 4624 BP. XX AC . XX DT 08-JUL-2010 (Rel. 15.07, Created) DT 08-JUL-2010 (Rel. 15.07, Last updated, Version -1) XX DE LTR retrotransposon from the Pristionchus pacificus genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-7_PPc_; KW Gypsy-7_PPc-LTR; Gypsy-7_PPc-I. XX OS Pristionchus pacificus OC Eukaryota; Metazoa; Nematoda; Chromadorea; Diplogasterida; OC Neodiplogasteridae; Pristionchus. XX RN [1] RP 1-4624 RA Jurka J.; RT "LTR retrotransposons from the Pristionchus pacificus genome."; RL Repbase Reports 10(7), 1006-1006 (2010). XX DR [1] (Consensus) XX CC Positions [3453-3914] - Integrase core CC LTRs are 90% similar to each other. XX FH Key Location/Qualifiers FT CDS 291..4541 FT /product="Gypsy-7_PPc-I_1p" FT /translation="MGTNEDGEMAAEITEMKALMASMAQLLMKQCEKKPES FT LQELSNASMNAIESRIQEFVYSPEDGNTFERWWNRYVDIFEIDLKEMDDLK FT KIRLLIRHVSTSVERTFVESIAPVNWADMTLLQVKNKMLILFGDNTSTFDR FT RRTMIDLKMSKENIEDVRVLAARVNQTVENAQVKDATIDEWKVLTFLHALD FT LPRYSDVHMRMMQSARQKGKDCTLDDLISVFNDVAQLKKDSLSITESGRAV FT HYVDKRTNNKNQNKQSNKSKPQKFNDSVVCMSCGKASHNRSECKFRNANCY FT NCKRDGHIAKVCKKPKKALTVSVSTVSTSDYHIQVNINGQSTSIKIDTGAD FT ITILSEKVWRRIGEPQCEAADCTATCANGESLELIGKFNATADYKGVEASG FT DMYVTRKNINLLGQNFIKLLKLVEIREATIHEVSVPLSFCSNIQYTEWIKR FT EFPEVTASGLGCCTEMKASLQLKPDMKPVFIKARPVPYALTEKVEVELNRL FT ERCGVIEKVEYSDWAAPILTVSKPNGSMRMCADFSTGLNAAIDLPAHPLPV FT PADIFATLNGASIFTQIDLAEAYLQVPLDEEAQKLLVINTHKGLFRYKRLP FT FGVKAAPGIFQRLMDTMLSGVKNAIPYLDDIIIGGRNKEEHDETLIDVMMK FT LQQFGLRTRAEKCSFGMKEISFLGFIINKDGRHTDPKKTEAIRTMPEPQNV FT VMLRSFLGMVNYYGQFIDGMHILRSPMDHLLKDNVKWKWSKECSSAFEKIK FT EKLNSQLSLVHYDPHKEIVIAADACEDGIGAVILHRFPDGTLRAVSHASRK FT LKPAERNYGQIEKEALALIFAVGKFHKYVFGRRFTLQTDHKPLLSIFGSPK FT GVPAYTAKRIYRWAETLLMYDFRIEYINTDSFFYADALSRLISECQTEQEV FT NIALVQTEEDVNDMLAADIRRLPVTATEIQSETKKDTLLQEVIKRHLKGWP FT KKDTKDPCLTAFYSRRDSLSIVKGCLLFGSRVIVPQILQSRVMKDLHAEHP FT GIVRMKSLARSICYWPGMDQEIEKVVLKCDRCAKAAKSPVKVPLQPWPTAD FT SPWERVHIDYAGPVRGEYFLVIVDAYSKWPEVYSTTRITASITVDFMKDAI FT ARYGIPQIIVSDNGTQFTSELFREMCAAYGIKHMTIAPYHPQSNGQAERFV FT DTLKRSLKKMNGEAPNQQIIRQFLMAYRRTPNPNVPDGKSPSEVFMGRAIR FT SKIDLIRPQILSDKIDERMKVQFDKRNGTKDRWFSVGDEVYYHAPDGPNRF FT QWIPAVIIGKKGKVMYEIEVKQKKQRAHANQLRKKSSGAGQTELKGTSIPL FT DLLMDTFDLNRNVPVVDHREEFRVELEEFPNEELGDSDDQSINSGYSTARS FT AVSSTPSSPILQRDQQPMLPQLLQPLNQRPMRIRQPPQRLDVDPRQKSYRA FT TKQ" XX SQ Sequence 4624 BP; 1350 A; 951 C; 1147 G; 1155 T; 21 other; ackgsaagac acaacattgg cgttcaggac aattcggact tcggtaactt tgactgattt 60 ttwtacttcg tctckgmktt gtsttcttga gttcgcgtga ttagcaagtg kcagactgwg 120 cctggccgat cgcaactggt ggtgtggtgw cagagccgtg tctgaccacw gamagctwgg 180 cgcamaccgt gcgcatcgca aactgtgcgt gattgaaata agcagaattt catcaagtac 240 gccgaaggct ctgcccgaat tttgattttg aatcaaaaga gactgcagaa atgggtacta 300 atgaggatgg tgaaatggct gcagagatta ctgaaatgaa agcattgatg gctagcatgg 360 cacagctact tatgaagcaa tgtgagaaga aaccagagag tctgcaggaa ttatctaatg 420 catctatgaa tgccatcgag agccgaatcc aggaattcgt gtactcacca gaggatggaa 480 acacattcga acgatggtgg aatagatatg tggacatatt cgaaatcgat ctcaaggaga 540 tggacgatct gaagaagatt cgtttgctga tcagacacgt cagcacctct gtcgagagga 600 cattcgtgga gtcgattgcg ccagtgaact gggccgatat gacacttctg caggttaaaa 660 acaagatgct gattctcttt ggtgataata cttctacatt tgaccgcaga cgwacaatga 720 ttgatctgaa gatgagtaaa gagaacattg aagatgtgag agtgctagca gcacgagtta 780 atcagacggt agagaatgca caagtgaaag atgcaaccat agatgaatgg aaagtactga 840 ctttcctgca tgcacttgat ctgccgagat attctgatgt tcatatgaga atgatgcaat 900 cagcaagaca gaaaggcaaa gattgtacat tggatgatct gatctctgta ttcaatgatg 960 tcgcacagct caagaaggac tcgctttcca tcacagagtc gggcagagct gtacattatg 1020 tggataaacg aacgaataac aagaatcaga acaaacaatc gaacaagtcg aagccgcaga 1080 aattcaatga ttctgttgtm tgtatgagct gcggaaaggc ttctcataat cgatcagaat 1140 gcaaattccg taatgcaaac tgttacaatt gcaagagaga tggccatatt gcgaaagtgt 1200 gcaagaagcc aaagaaggca ctcactgtta gtgtatccac agtatccact tcggactacc 1260 acatccaagt gaatatcaat ggacaatcga cgtccatcaa gatcgatact ggtgcagata 1320 taacgattct ctcagagaag gtgtggagac gaatcggcga gccccagtgc gaggctgccg 1380 attgcactgc cacatgtgct aatggagaga gtctggagct tattggaaag ttcaatgcga 1440 ctgcagatta taaaggtgtk gaagcgtctg gtgatatgta cgttacccgc aagaatatca 1500 accttctcgg tcagaacttt atcaagctac tgaaactcgt tgagattcga gaagcgacaa 1560 ttcacgaggt mtctgtccct ctgtcgttct gcagtaatat tcaatacact gagtggatca 1620 aaagggagtt tcctgaagtt accgctagtg gattgggatg ttgcacggag atgaaggcat 1680 ctctgcagtt gaagcctgac atgaagccgg tgttcataaa ggcacgacca gtaccgtacg 1740 ctctcacaga gaaagttgaa gtggaactca acagactaga gaggtgtgga gtaattgaga 1800 aagtggaata cagtgattgg gcagccccca ttctcactgt gagcaagccg aatggatcaa 1860 tgcgcatgtg cgctgacttc agtacaggtc tgaatgctgc aattgatctt cctgctcatc 1920 ctctgcctgt tcctgcagat atttttgcca ctctcaatgg agcatcgatt ttcacgcaaa 1980 tcgatctagc agaagcgtat ctgcaagttc ctctggatga ggaagctcag aagctgctgg 2040 ttattaatac acacaaagga ctctttcgat acaagagact gcctttcggt gtcaaggctg 2100 ctcccggcat attccaaaga ttaatggata cgatgttgag cggtgtaaag aatgcgatcc 2160 catatcttga tgatatcatt atcggtgggc gcaacaagga agaacacgat gagaccctca 2220 tcgatgtaat gatgaaactg cagcaattcg gactacgtac acgagctgaa aagtgttcat 2280 tcggaatgaa ggaaatcagt ttcctcggtt tcatcatcaa caaggacggt cgccacaccg 2340 atccgaagaa gacggaggca attcgkacaa tgcctgagcc gcagaatgtt gtaatgctcc 2400 gcagttttct tggaatggtc aattactatg gacagtttat tgatggaatg catattctga 2460 gatctccaat ggatcatctg ctcaaggaca atgtgaagtg gaagtggtck aaagaatgct 2520 ctagtgcatt cgagaagatc aaggagaaac tgaattcaca actcagtctc gtmcattatg 2580 atccacacaa ggaaattgtt attgctgcag atgcttgtga agatggaatt ggtgctgtta 2640 ttctgcatag gtttcctgat ggaactttgc gggctgtgag tcatgcttcc aggaagctga 2700 agcctgcaga gaggaactac ggacagattg aaaaagaggc acttgcgctc atttttgccg 2760 ttggaaagtt ccacaagtat gtgtttggca gacgttttac tctccaaaca gatcataagc 2820 cgctgttatc gatctttggc tcgcccaaag gagttcctgc atatactgct aagagaatat 2880 acagatgggc tgagactctg ctgatgtatg atttccgaat tgaatacatc aatactgatt 2940 ctttcttcta tgctgatgct ctttctcgtt tgatcagtga atgtcagaca gaacaggaag 3000 tgaacatcgc gctagtgcag actgaggagg atgtcaacga catgctcgct gcagatattc 3060 gtcgattgcc agtgacggca actgagattc aatctgagac taagaaggat acactcctcc 3120 aggaggttat caagaggcat ctcaaaggat ggccgaagaa agacactaag gatccgtgcc 3180 ttactgcatt ttattctaga agagattcac tgagtattgt gaaaggatgc ttgctcttcg 3240 gttctcgagt gattgttccg cagattctac agtcgagagt gatgaaagat cttcatgctg 3300 aacatccagg gattgtgcgc atgaaatcac tcgcgagaag tatctgttac tggccaggaa 3360 tggaccagga aatcgagaaa gttgttctca agtgtgatcg ctgcgccaag gcagcgaagt 3420 cgccagtgaa agtaccacta cagccatggc ctactgcaga tagtccatgg gaacgagtcc 3480 acatcgatta tgctggaccc gtcagaggcg aatactttct tgtaatcgtt gatgcttata 3540 gcaagtggcc ggaagtgtac agtactacaa gaatcactgc atctattact gtcgatttta 3600 tgaaggatgc aattgcgaga tatggaattc cgcagattat tgtatcggat aatggaactc 3660 aattcacatc ggaattgttc cgtgagatgt gtgctgcata tggtatcaaa cacatgacga 3720 ttgctccgta tcatcctcaa tccaacggac aagctgagcg ttttgtggat acgctgaaga 3780 gaagtctgaa gaagatgaat ggtgaagctc cgaatcagca gatcattcgt cagttcttaa 3840 tggcatacag aaggacgccc aatccgaatg tgccagatgg caagagtccg tctgaagtgt 3900 tcatgggaag agcaatcaga tcaaagatcg atttgattcg tccgcagata ttatcagata 3960 aaatcgatga gagaatgaag gtccaattcg ataagcgaaa tggtaccaag gacagatggt 4020 tctctgtcgg agatgaagtg tactaccatg cgcctgatgg tccaaatcgg ttccaatgga 4080 ttcctgcagt tattattgga aagaagggca aggttatgta cgagatcgaa gtcaaacaga 4140 agaagcaacg agctcatgcg aatcagcttc gcaagaagtc aagtggagcc ggtcagactg 4200 agctgaaagg cacttcgatt ccacttgatc ttctgatgga cacgttcgat ctgaacagaa 4260 atgtgcctgt ggtagatcac agagaggagt tcagagtaga actcgaggag tttcctaatg 4320 aggaactcgg tgattcggac gatcagtcca tcaattctgg ctattctaca gccagatctg 4380 ctgtatcttc tactccatct tcgccgattc ttcaacgtga tcaacagccg atgctgccgc 4440 agttacttca accactcaat cagcgaccga tgagaattcg tcaaccgcct caacgactcg 4500 atgtcgatcc gagacagaag tcgtacagag cgacgaagca ataatcccgt cccctttgct 4560 acgtcctcac ctcacctctt tcccattctg taaaagatcg gcagcgccaa tcttgaaggg 4620 gagg 4624 // ID I-44_AAe repbase; DNA; INV; 7084 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE An I non-LTR retrotransposon from Aedes aegypti. XX KW I; Non-LTR Retrotransposon; Transposable Element; I-44_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-7084 RA Kojima K.K. and Jurka J.; RT "I clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1317-1317 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 11 sequences with >90% CC identity. XX FH Key Location/Qualifiers FT CDS 307..1905 FT /product="I-44_AAe_1p" FT /translation="MSPNGLHLRPGDPGGPGGGGSGITNQRNGEYTGALVP FT SFMDPEGTLGQLHYLKLSGTSGQILTDPFLLRLSVEKHIGGPIAGAFKENR FT GLTYVLKVRSQEQVKQLLRMKQLSDGTGITIEEHSSLNQVRCVVTNPDTVG FT LSDQYLVENLANQNVKEVRRIKRRNEKGEPVNSPVMILTINGTVIPEHIDF FT GWTRCRTRNYYPSPMLCFRCWTFGHTGKRCSATQRTCGKCSKTHDEELTPN FT IAEGATAFEVSRGRTNCDEPAHCKNCEEWNKKREKNEKVDTKHAVSSRECP FT IYKKENAIQHLRVDLGISYPAARRDYETREAAMAKKKNPQTDKSFAGIVNA FT SKDCEMDEMRSMVKNLLEDSKRKDERIAQLERALEKRSVNTRLDSAKDHGP FT TAELVRQVAELTATVRQLQDNLTKKDAFIKTLLTTRSHPVASPSNSTISAA FT ETEKANMTHSHSSTEEIIVDNVTLQPFFQVQEWIKNSTHIANDQSDYGRES FT TPKEQQTQENAETMECSIEVDNISSDGDASVAPGFRR" FT CDS 2060..6805 FT /product="I-44_AAe_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="TSRPLSPPTYDSQHQWTFTYTTTFTENNQPENCSNES FT TEHNFHYETPQNRFTNNPSTNSTTAGASGSRGPVSAEATPQPVLTDNPQHP FT SVSAADGTHKGVKTYTPQYPEDGRDTQGPDKPEAISVPEPTDYPCHPHRSG FT RGTHKGVKTRPPKDIEGNGIGKLTSSTTCMLSTSLTSSCPTSSCSTSTSTT FT SSDSTNSINSKITATEKETTTISRGRGSVADSLPPELSHIPQHPCTLRRNP FT PRTRNKPVRYQAGITPTKPDNPFPPCLTEREATNADHYDRMIRKRDQQSLP FT TDGPSHRSSITDHMSSPPDPEPGLPGHPGSSNPGGKSHPKHGQDPEQGPPG FT HPERPRPGGKPHLRSTTSTSKRPINQGPTTLAVQWNMNGFHHNLPDLEVLV FT HDFQPVALAIQEIHRTTPATMNNTLGKRYRWFTKTGANIYQSVAVGVSTDI FT TTEEIDLDTDLPIVAVRLPWPFPVSVISVYLPNGKLPDLRNQLEAILLKIP FT SPMVILGDVNGHHRAWGSRTDNARGKCVVDIACSNNLAILNDGTPTFIRGQ FT TETAIDVSLVSTSITSRLLWSVDTDTRGSDHYPITLRLDKSISPQTSRRPR FT WIYDRADWAQFQYDIDSGIEQTPPTSITDLTALVTGSAEKSIPKTSPNSGR FT RALHWWSPETKTATKTRRKALRAFKRSKKNLPAEHPDLVTAKKCYQAARNE FT CRQIIREAKEKSWTDFLDTINEDQSSSELWRKINSIQGKRRVKGFSLKIDG FT VTTRDPQTIADALAEYFHGISAIDKYTPTFLKHYPNPQLSVINFPTPPDQG FT QLFNQPFSMTELNYAMGRVKGKSAGSDELGYPMLKHLSYSGKSKLLELINQ FT EWTEGTLPEEWKHSLVVPIPKSSGPANSTTSYRPIALTSCACKIAERMANR FT RLMEHLESNGLLGYRQHAFRPGYGTGTYLTNLGSILEDAMKAQEHVEIVSL FT DLAKAYNRAWAPSILKQLVEWGVSGNLLNFVKNFLTNRTFEVLIGNHRSKV FT TREETGVPQGSVISVTLFLVAMSGLFLVLPKGVYILVYADDILLLVTGKHP FT RSIRRKMQAAVNAVTKWAGNTGFDIAPEKSSRLHICNSNHRPPKTPITANG FT KTVALKKTLKVLGVTLDRKLSFHSHFLAVKESCTNRVNLIRSISSKRTRSD FT RATRHRVADAVICSRLFYGIEITSRAQDNLVKTLAPTYNNTIRAISGLLPS FT TPADSACAEAGVLPFRYKAALTLGNRTVCYLERTKEDGQTCILEENANSAL FT NAVANIQLPPVAGLHRIRPRSWSAKRPQIAQNIKTHFRRQCNPTEVQTAFR FT EVLHRKFNWHEVRYTDGSKLAERVGLGIHSRTTEIYHSLPGQCSVFSAEAA FT AILVAITTPCDYPVLVVSDSDSALSAIANPNNRHPYIQKIQAALDSPSTRA FT TFMWVPGHCGISGNERADTLAGIGRNSPLLSRKIPRDDVKSWLKERLWTAW FT SVEWHRNRGLFLRKIKNTTTTWQDQPARREQVVLSRLRTGHTRLTHDMGST FT GGSFHRQCETCRTQNTVEHFLINCPTLEHLRIIHEITGIDSTLQNDAVKER FT QLIIFLKEAGLFTLI" XX SQ Sequence 7084 BP; 2010 A; 2107 C; 1617 G; 1350 T; 0 other; caatttagtg caaaaactgc agtggaagag ctaaacccgc gtgtgatctc acggtaatca 60 cccattgttc ccgcattgtg ttgttggcgc aggcatagtg gtattcattg aagcccacga 120 ctgtgtgtgg ctacataacc cgagtgtgtg aaaatatcaa acagtgaaca aactcctttt 180 ttggcactct atcgaaaagt ttcgtcatcg cagtgtgtac tcccgactaa catatctacc 240 ctgtgaatct ccaagccggt cgactgggag atccttagat catcttagtt ccggcccggc 300 ggcgtcatgt cgccgaatgg cctccacctc cgcccagggg acccaggggg ccccggtggt 360 ggcggcagcg gaatcacgaa tcaaagaaac ggagagtata caggggcatt agtaccctcc 420 tttatggacc ccgaaggtac tttagggcaa cttcactacc tgaagctgtc cggaacctcc 480 ggccaaattc tgaccgaccc cttcctcctc cgtctgtcag tcgaaaagca catcggtggc 540 cctatcgccg gtgcgtttaa ggagaaccgc ggcctaacgt acgtactgaa agtacgtagt 600 caggaacaag tgaaacaact gctacggatg aagcaactta gtgatggtac tggtatcact 660 attgaggagc actcgtcact caatcaagtc cgatgtgtcg tcactaatcc cgacaccgtg 720 ggtcttagcg atcaatattt ggtcgaaaac ctggcgaacc aaaacgtcaa agaggtgcga 780 cgcataaagc gccgcaatga gaagggcgaa ccagtcaact ccccggtcat gattttgaca 840 atcaacggca cggttatccc ggaacatatt gattttgggt ggaccagatg ccgaaccagg 900 aactactatc cctccccgat gctgtgtttc cgctgttgga cttttggtca caccggcaag 960 cggtgctcag ctactcagcg gacctgcggc aagtgcagca agactcacga tgaagaactc 1020 acccccaaca tcgctgaagg agcaacagca ttcgaggtat ccagaggaag gacaaactgc 1080 gacgaaccgg cccattgtaa gaactgtgag gaatggaaca aaaaacgcga gaagaacgaa 1140 aaggtcgaca cgaaacacgc agtctccagt cgtgaatgcc cgatctacaa gaaagagaat 1200 gccatccaac atctacgggt cgaccttggt atttcgtacc cggcagcccg ccgggattac 1260 gaaaccagag aagcggccat ggccaaaaag aaaaatccgc aaacagacaa atcctttgca 1320 ggcattgtca acgctagcaa ggattgcgag atggatgaaa tgaggtcgat ggtaaaaaat 1380 cttttggaag actccaaaag gaaggacgag cgcatcgccc aacttgaacg cgcgctagaa 1440 aaacgcagcg tcaacacccg gctagatagc gcaaaagatc acggtcccac ggcggaacta 1500 gtacggcagg tggctgagct aacagccaca gtcagacaac tgcaggacaa tctgacgaag 1560 aaagacgcct tcatcaaaac actccttacc acgagaagcc accctgtagc atccccaagt 1620 aactccacca tctcggccgc cgaaactgaa aaagccaaca tgacgcactc ccattcctca 1680 accgaggaga taattgttga caacgtaacc ttgcaaccgt tcttccaggt ccaggagtgg 1740 atcaaaaaca gtacccacat cgctaacgac caatccgact atggtcggga atccaccccg 1800 aaggagcaac aaacacagga gaacgccgaa accatggagt gcagtatcga agtagacaac 1860 atcagcagcg acggcgatgc tagcgtcgct ccgggcttcc gccgctaaca ccatcaactc 1920 caacgccaca tccacctcta acccatctaa acggggccac cacagtcccg atggcagtga 1980 ctcctctatg aagggaccca gcagctccaa acggaaacac cgtaagagcc acgggatacc 2040 cccttccccc acaatctaga caagtcgccc gctcagtcct cccacctacg actcccagca 2100 tcagtggacc tttacctata caacaacatt cactgaaaac aaccaacccg agaattgctc 2160 aaatgagtcg actgagcaca acttccacta cgagacacca caaaatcgat tcacgaacaa 2220 cccctcaacc aactccacaa cagcgggagc ctcgggcagt cggggccccg tcagtgcgga 2280 agctacaccc caaccggtac tgacggacaa tcctcaacac ccgagtgtct ctgcagctga 2340 cgggacgcac aagggtgtaa aaacctatac cccgcaatac ccagaggacg gccgggacac 2400 acaaggcccc gacaaaccgg aagccatttc tgtaccggaa ccgacggact acccttgtca 2460 tccccaccgg tctggaagag ggacgcacaa gggtgtaaaa acccgtcccc ctaaggatat 2520 tgaaggaaat ggaatcggta agctgacgtc gagtacaact tgcatgctgt ctacgagttt 2580 aacttcttcg tgtccaacat cttcgtgttc aacttcaacg agtacaacct cgtcggattc 2640 aaccaactcc atcaatagca agataacagc cactgaaaaa gaaaccacta cgatcagccg 2700 gggccgcggt agtgtggctg attcacttcc accggaactg tcccacatac cacaacatcc 2760 atgtacgctt cgtagaaacc cccccagaac tcgaaacaaa ccggtaagat accaggccgg 2820 tataactcca acgaaacctg ataatccctt tcctccctgt ctcacggagc gggaagccac 2880 gaacgcggat cactacgacc gaatgatcag aaaacgcgat caacaatcac tgccaactga 2940 tggcccatcc catcggtcgt cgataacaga ccatatgagt tcacctcccg accccgaacc 3000 gggtctgcct gggcaccctg gatcgtccaa tccaggtggt aagtcccacc caaaacacgg 3060 ccaagacccc gaacagggtc ctcctgggca ccccgaacga cctcgtccgg gtggtaagcc 3120 ccatctgcgt tcaacaacct ccacctccaa gcgccctatc aatcaaggcc ctaccaccct 3180 cgctgtccag tggaacatga acggtttcca ccacaacctg cccgatttgg aagttctggt 3240 ccatgatttt caaccggttg ccctggccat ccaggaaata caccgcacca ccccggcgac 3300 aatgaacaac accttaggta agagatatag gtggttcact aaaaccggcg ctaacatata 3360 ccagtcggta gcagtgggcg tgtctacgga cattacgact gaagagattg atctcgatac 3420 cgacctccca attgtcgcgg tccggctccc gtggcccttt cccgtttcgg ttatctccgt 3480 ctacctcccg aatggaaagc ttccggatct aaggaaccaa ctggaagcaa tcctactcaa 3540 gatccccagc cccatggtga tacttggtga cgtcaatggg caccaccgag catggggcag 3600 tcgaaccgat aatgctaggg gaaaatgtgt cgtcgacatc gcatgcagca acaacttggc 3660 gattctgaat gatggaaccc ctacattcat ccgcggacag acagagacgg ccatagacgt 3720 ttctcttgtc tctacctcca tcactagtcg cctcctttgg tctgtggaca cggatacccg 3780 aggaagtgat cactacccga ttacgcttag actcgacaaa tcaatcagcc cgcaaacctc 3840 acggcgacct cgctggatct atgaccgggc cgactgggcc cagttccaat atgacatcga 3900 ctctggtatt gaacaaaccc ccccaacttc aatcacggac ctaactgcct tagtcactgg 3960 atcggcagaa aagtccatac cgaagactag cccaaactcg ggtagaaggg ctctccactg 4020 gtggtctccc gaaacgaaga cggctacgaa aactcgcaga aaagccctgc gcgccttcaa 4080 aaggtctaag aagaacctgc ccgctgagca ccccgacctt gtgacagcca aaaaatgcta 4140 ccaagccgcc cggaacgaat gccggcaaat catcagggaa gccaaggaga aatcctggac 4200 cgacttcctc gacacaataa atgaggacca gtcatcttcg gagctatgga ggaaaatcaa 4260 cagcattcag ggcaaacgcc gagtgaaagg tttttccctc aagatcgacg gagtaaccac 4320 ccgggaccca cagacgatcg cggatgccct cgcggaatac tttcatggga tatccgccat 4380 cgataaatac acgccaacat ttttaaagca ttatcccaac ccacaattaa gtgtgataaa 4440 tttcccgact ccccctgacc aagggcaact tttcaaccaa cccttctcca tgacagagct 4500 caattacgcc atgggcaggg tcaagggcaa gtcagcagga tcagacgaac taggttaccc 4560 aatgctaaaa caccttagtt acagtggaaa aagcaagctg ttagaattaa taaatcagga 4620 atggactgag ggcacgctac cagaagagtg gaagcacagc cttgtagtcc ccatacccaa 4680 gagttccgga ccggctaaca gtaccaccag ctaccgcccg atagcactca ccagctgtgc 4740 atgcaaaatt gcagaacgga tggcaaatcg tcggctgatg gagcatcttg agtccaatgg 4800 cctactaggg taccgccaac atgcatttcg cccagggtat ggtaccggga cttacctaac 4860 taatctcgga agtatcctag aagacgcaat gaaggcgcag gaacacgtgg aaatcgtatc 4920 cctcgacctt gccaaggcat acaaccgagc ctgggcccct agcatcctta agcaactagt 4980 agaatgggga gtatccggaa acctattgaa tttcgtaaag aatttcctca caaaccgaac 5040 cttcgaggtc ctgataggaa atcaccgatc gaaagtcacg agggaggaga ccggagtgcc 5100 tcaaggctct gtgatttcgg ttacactctt cctcgtagcc atgagtgggc tctttctggt 5160 gctaccgaaa ggagtatata tattggtata cgcggacgac atcttgctcc tggtcacagg 5220 aaaacatccc cgttcaatcc gccggaagat gcaagcggca gtgaatgccg tcaccaaatg 5280 ggctggcaac acgggctttg acatcgcgcc cgaaaagagc tcaagacttc acatctgcaa 5340 ctcaaatcac cgcccaccaa aaacacctat aacggccaac ggaaagacgg tcgccttgaa 5400 gaaaacactt aaagtgctgg gagtgacgct tgatcgaaaa ctttctttcc acagtcactt 5460 tctcgcggtc aaggagagtt gtacaaaccg tgtcaacttg atccgtagca tatcatccaa 5520 acggacgcgc agcgaccgcg ccacgcgaca ccgggtggcc gacgccgtaa tttgtagtcg 5580 cctcttctac ggaatcgaga ttaccagccg cgcacaggac aatctagtga aaacactggc 5640 gccaacctac aacaacacca tccgtgcgat ctctggacta ctaccttcca ctcctgctga 5700 ctcagcttgc gcagaagccg gtgtgttacc tttccggtat aaagccgctt tgaccctcgg 5760 aaaccgcacc gtctgctatc tggaaaggac caaggaagat gggcagactt gcatcctcga 5820 agagaatgcg aactctgctc ttaacgctgt ggccaacatc cagcttcctc cggtggccgg 5880 gctccaccga atcagaccca gaagctggtc ggccaagaga ccccagatag cccaaaacat 5940 taagacccat ttccggagac agtgcaaccc cacggaagta cagactgctt tcagagaggt 6000 actacacaga aaattcaatt ggcacgaggt ccgatacacc gacggctcca aactagcgga 6060 gcgggtcggt ctcggcattc atagtcgcac tacagaaatc tatcacagcc tccctggtca 6120 gtgctccgtt ttctcggctg aagctgcggc catacttgta gccatcacga cgccctgcga 6180 ctatccggtg ctggtcgtca gcgactccga cagtgccctg tctgcaattg ctaacccgaa 6240 caatcgacac ccttacatac agaaaatcca ggcagccttg gatagcccat ccactagagc 6300 cacattcatg tgggtccccg gtcactgcgg catctccggc aatgagcgag ctgacacact 6360 cgctggcatc ggccgtaaca gcccattact atccagaaag atcccccgtg atgacgtaaa 6420 atcctggttg aaggagcgac tttggaccgc ctggtccgta gaatggcacc ggaacagagg 6480 gttattcctt cgcaaaatta aaaacaccac gacaacatgg caagaccaac ctgctcgccg 6540 ggagcaggtc gtcctctctc ggctgcgtac gggacataca cggctgacac acgacatggg 6600 tagcacaggg ggcagtttcc accgccagtg cgagacatgc aggacacaaa atactgtcga 6660 acattttctg ataaactgtc ctaccctgga gcacctaaga ataatccacg agatcaccgg 6720 tatcgacagc accctccaaa atgacgcggt aaaggaacga cagctgataa tctttttgaa 6780 agaggctgga ctctttacct taatatgacc cctgctaaga ccctggacct gaaaacgaat 6840 acgaaatcac cacatcccct accgttacga tccctgctat gaccttgaac tcgatacgga 6900 acttccacat cattcgacaa caccagtgaa acaaagaaca ccgcaacata cactgttaat 6960 tatacaatgt aactccctct cctcctccat ccctgcaaat taaccccccc ccccctttta 7020 tatacgggga tgaaccagcc tcgggctgaa tttcccctta ataaagagtc aatcaatcaa 7080 tcaa 7084 // ID Gypsy-143_AA-I repbase; DNA; INV; 7235 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-143_AA_; KW Gypsy-143_AA-LTR; Gypsy-143_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-7235 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1019-1019 (2011). XX DR [2] (Consensus) XX CC Positions [5200-5682] - Integrase core CC 'CAAT' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 438..2789 FT /product="Gypsy-143_AA-I_2p" FT /translation="MENYYMNADDLNDEEVAYELKLRNSAIGRSTDNRRVL FT RYLIRREENQEFPTSNTINDEYVTVAAAVKQIEDSLLAGRPIGCRSRLIHH FT YRRLRRCHATTASGEQTRLALMKFVFGLAGRYFQLDLNRLGAVSNAGVSSE FT PTGVVSPPNQSVGGQMRLATPPMVNPEMQSSAEGAVGGETRNDAVEDPLQM FT SENVLNPFIVENRHSLDSRFPAPGNLQSEMQQASLGRMSAEGNPLPFRNVR FT FRSGEDLSCLNPPRQNLSSVDPEASGGSVPQPLNVIGFNEERNSVAGTVPK FT PSRGSRTDFWSMGPPVSQGEPSSNPQEGQYRPWLDERVRQPSNAGNSVPEP FT RETVSVQPANTNANQQLNLREYVHISEIENYVQSCVRTLLNERSDRPLVRE FT TVVNNVSEQFANFGISSTGERNESVPRLSPPLQLQNAPGPQSSSNNIPFTP FT LLGIGTERNSMPFQSQLNSTQMPQFAGNTQSPIQFPASLSSGNNGTLGQRR FT LPHQTCNIIEKWPKFAGDSNPVPVIDFLRQIDLLCRSYQVSKEELRTHAHL FT LFRDDAYVWYTAYEPKFNSWDTLLYYLQMRYDNPNRDRFIKEEMRNRKQRP FT NELFSAFLTDMETMAQRLIKPLTEGEKFELIIENMKLSYKRRLALEKIYSV FT EHLAQLCYKFDTLEGTLYGQRVQTKPHLVNEVLLEEEDEFVVQADDEEEIA FT VLQAEKVTSNKARLSPADNARSSFNNGQRQPLCWNCRKVGHIWKECDRQKT FT IFCHVCGHPDTTAFRCPQQHNIRPLSEVKSKNG" FT CDS 4756..6057 FT /product="Gypsy-143_AA-I_1p" FT /translation="MTLSCDIKKGCDNIPADCISRSIYTVHIHLSDPYIDG FT LKYQIEHNPQKHPDYEVVNGTVFKFIANSTLIEDPAHRWKQVVPGPDRRDI FT IKEVHCEAQLGFLKTLTKIRERYFWPSMASDTKRYCSSCKVCKESKVPNQN FT VQLSCGKLKVCSRPWEIISRDFLGPCPRSREGNMWIQIICDFFSKFVLAQY FT MKTSTAPLVCTVMENLVFNLFGAPSICITDNAKVFVSDPFKKLLKEYGVTQ FT WNLAVYHPAPNPAERVNRVIVTAIRCALNKQADHKNWDDSVQTIARAIRTS FT VHESTGHTPYFVNFGRDMISHGQEYENLQELGDGQDYDATKRKEELTKLYE FT LVRSNLYKAYQKYSRPYNLRSNQKHTFKKGDIVYKRNVHLSDKSANFVGKF FT GSKFTKARIKGVLGTNTYILESEDGHRIPGTFHGSFLKRA" XX SQ Sequence 7235 BP; 2122 A; 1477 C; 1596 G; 2035 T; 5 other; ttttggcgcc caacgtgggg cctggctttt taagcttggc attgctgatt agtaatttga 60 aagttggcaa ttggaaaatt gactaggatc tttttaaaat tgattgaata ttcgatctga 120 attgaaatta ggataacaaa ttttgaattt tgattgcaaa tgcaatcggc tatcggttta 180 gaataataaa ttattgaatt tcggttcgtt tgaagtcggt tagtttaata tcggtttaaa 240 ttcggtaatt tttcggtgaa attttgtttt ttttttcgat aattgcttat aaagtagaac 300 attcgtacaa gtctgtcatt catttcgaat agcttgttcg attataaagt gaatttaaag 360 taagattttc ttctaaattg aattcaataa tagtgaattg aagtgattgt tttggtgtgg 420 aaagaaaatt cttcaaaatg gagaattact atatgaatgc ggatgattta aatgacgaag 480 aagtggccta tgagctgaaa ttgcggaatt cggccatcgg tcgcagcaca gataatcgtc 540 gtgtccttcg ctatttgatt cgtcgtgagg aaaaccagga gtttccaacg tcgaacacca 600 tcaacgacga atacgtaacc gttgctgcag ctgtgaagca gattgaagat tctctgctgg 660 ctggcagacc gattgggtgt cgatctaggc tgattcacca ctatcggcgt cttcgtagat 720 gccacgcgac tacagcgtct ggagagcaga ctcgtctggc gctgatgaaa tttgtttttg 780 gattagcagg acgttatttt cagctcgatt tgaaccggct aggagctgtt tcgaatgccg 840 gagtttcttc ggagccaacc ggagtagtct cgcctccgaa tcagagtgtg ggtggtcaga 900 tgcggctagc tacacccccg atggttaatc cagagatgca atcttcagct gaaggagctg 960 ttggcggcga aactcgcaac gacgcggttg aagatccact ccagatgtca gagaatgttt 1020 tgaatccgtt cattgtcgaa aatcgccatt cgttggacag ccgtttccca gccccaggga 1080 atctgcagag tgaaatgcag caagcttcgc tcggtagaat gtccgctgaa ggaaatcctc 1140 tgccttttcg taatgttcga ttccgttccg gagaagattt gtcatgcttg aatcctccta 1200 gacagaattt gtcatcggtt gacccagaag cctctggagg tagtgttcct cagcccctta 1260 atgtgattgg ctttaatgaa gaaaggaact ctgttgcagg aaccgttccg aaaccttctc 1320 gtggatcacg aactgatttt tggtcaatgg gacctccggt atcccaaggg gaaccatctt 1380 ccaacccaca agagggccaa tatcgtcctt ggttagatga aagagttcgc caaccctcta 1440 atgctggaaa ttcagtacca gagccacgag aaaccgtttc tgttcaacct gcaaatacaa 1500 acgcgaatca gcaactgaat cttagggagt acgtgcatat ttccgaaata gaaaattacg 1560 tacaatcatg cgtgagaact ttgttgaatg aacgatcaga tcgtccattg gtgcgtgaga 1620 ctgttgtaaa taacgtgagt gaacagtttg ctaattttgg aataagttct acaggcgaac 1680 gtaacgaaag tgtgccgaga ttatcgccac cattacagtt acagaacgcc ccaggacctc 1740 aatcctcgag taacaatatt ccgtttactc ctttgctagg aatcggcaca gaaagaaatt 1800 caatgccatt ccagtctcag cttaactcga ctcagatgcc tcaatttgca gggaataccc 1860 agagtccgat acaatttccg gcctcgttgt cttcaggaaa taacggtaca ttaggacagc 1920 gaagattgcc acaccagacc tgcaatatta ttgaaaagtg gcctaagttt gcaggagact 1980 caaatccggt cccggtgatc gactttttga gacagattga tctcctttgt cgttcgtacc 2040 aagtgagcaa ggaagaattg cgaacacacg ctcacctcct gttccgagat gatgcgtacg 2100 tttggtatac agcttatgaa cccaagttta actcgtggga taccttgctg tactacttgc 2160 aaatgaggta cgacaatcca aatcgcgaca ggtttataaa ggaagaaatg cgcaatagaa 2220 aacaacgccc taacgaatta tttagcgctt tccttaccga tatggagact atggcacagc 2280 gcttgatcaa accattgacg gaaggagaaa aatttgagct cattattgaa aatatgaaac 2340 tttcttacaa aagaagactt gctttggaaa agatttactc cgttgaacac ttggctcagc 2400 tttgctacaa attcgatacc ttggaaggga ctttgtatgg ccaacgtgtc caaacaaaac 2460 ctcatctggt aaacgaagtt cttcttgagg aagaggacga gtttgtggtt caagctgatg 2520 atgaagaaga aatagctgtt ctccaagcgg agaaggtaac ttcaaacaaa gctcgtctat 2580 ctccagctga taatgctcga agctctttca acaatggcca acgacaacct ttgtgctgga 2640 attgtaggaa ggttggtcat atttggaagg agtgtgaccg ccagaagaca attttttgcc 2700 atgtttgcgg gcatccggac actacggctt tccggtgtcc tcaacagcac aatattcgac 2760 cattatctga ggtgaaatca aaaaacggtt aggtaaagag ggatttgggg acccctatct 2820 gagccttaac gatcagaggt ctccaacagt atgttcgaat acttcaatac cacgtatacc 2880 atccatacac agtttaatcg ttgtcccaca cttatagtaa atgttttggg catagaagtt 2940 gaaggtcttg ttgacactgg agctagtgtg acaatcatga gctcgcagaa attaatagac 3000 cgtctcggac ttaagaatct gccgtgtcgt cttaaggttt cgaccgctga tcgaacttca 3060 tatagctgcc tgggatatgt caatattccg gtcgagttta agaaaatcac aaggattatt 3120 cctaccgtga tcgtaccgga agtttgcaaa acattgattc tcggaatcga tttcctgtaa 3180 gcattcaact ttcagtttac gattaacccg aatcaaaggc aaatggttcc agatcagtct 3240 gagaatcacg ccctattact ggcagaaagt tattttagtg aagatgaagg aaaagtcttt 3300 tttcgttagg tccctgatct tgacaccaat caatgtctag atccgaagga agacgatcat 3360 tcattggaga tgccaacagt cgaactgcca tcaaatccaa caaaaacctc gaaggatatc 3420 gttactgaac atgaattgtg cactgagcag agagttctct tatttaaggc tatccaggag 3480 cttcctgcga cacgttaagg caacttagga agaaataatc tcataaaaca ttcgatagat 3540 ttgattgtag gcgcgacccc caagaaaata ccactttata agtggtcacc gactgtagaa 3600 aaggtgatcg atgaagaaat ggagcgatta cgtaggcttg acgtaattga agaatgtctg 3660 acctccgcag acttcttaaa tccgttattg cctattagga aacccaacgg aaagtggaga 3720 atatgtctgg actcgcgacg attgaattct gtcacgaaga aasatgagtt tcccatacca 3780 aacatgtctc agatcttgca tmgtatcagt aaagctcgat acttctcagt catcgatttg 3840 accgagtcgt attatcaagt tgagctggac gaaaatgcca aaagaagaac tgcctttagg 3900 accaacaaag gcctcttcag gttcgaggtt atgccttttg gcttaatcaa cgccccagcg 3960 acgatgtcta ggctcatgac ccaagttatt ggccatgatt tggagccatt cgtgtatgta 4020 tacttggatg acatcatcat agcagctgag acttttgaag aacatgtacg actcataaga 4080 tgtgttgccg aaaggttacg aaacgcgggt ttgaccatta accttacaaa atccaaattc 4140 tgtcaaacca gcatacgata tttggggtac gtcatctctg aagccggatt atcgacggat 4200 gttgcaaagg tacaaccaat cctcgattat cctgcgccta ggtcggttaa agaagttcgt 4260 cgtctactag gtcctgcagg tttctatcag aaatgtatcg gcaaatactc ggaaatcacg 4320 actccattga ccaacctgtt taagaaggga cgtgaaaagt ttgaatggac tgctgaagcg 4380 gatgaagctc tcaacaagct gaaagaagcg ttggtatctg ctcctgcgtt agcaaatcca 4440 gacttctcag agcaattcat catcgaaact gatagttccg accttgcggt tggtgcagtt 4500 ttggtgcaat tgcagaaagg tgagaggaaa tgcatgctta cttttccaaa aagttatcca 4560 gcacacaacg tcggtacagt gcaaccgaaa gggaatgcct ggccgtacta ttaagtattg 4620 agcacttccg acattttgtc gaagggagtc gctttattgt gcaaacagat gcaatgagac 4680 ttactttcct ccagaccatg tctaacgagt ctcggagtcc gcgtacctct cggtgggctt 4740 tgaagttatc aaagtatgac attgagctgc gatataaaaa agggctgcga caacatccca 4800 gctgactgca tcagcagaag catctacact gtgcacattc atctctccga tccatacatt 4860 gatggactca agtatcagat tgagcataat ccacagaaac atcccgacta tgaagtagtc 4920 aacgggaccg tctttaagtt tattgctaat tccaccctga ttgaagaccc tgcccatcga 4980 tggaagcaag tagtgccagg acctgatcga cgagatatta ttaaagaagt tcactgtgaa 5040 gcccagctcg gtttcctgaa aacactgacc aagattcgag aacgctactt ttggccaagt 5100 atggctagcg acactaaacg gtactgctcc agttgcaaag tgtgtaagga gtcaaaagtg 5160 cccaaccaaa acgttcaact ctcatgtggt aaacttaaag tgtgttccag accctgggag 5220 ataatctcca gggattttct tgggccatgc cctcggtcac gcgaaggaaa tatgtggatt 5280 cagatcatct gcgatttctt ttcaaaattc gtattggcac agtatatgaa gacatccact 5340 gcacctttgg tgtgcactgt aatggagaac ctagtcttca atctcttcgg tgcaccgtca 5400 atttgtataa ccgacaacgc taaggtcttc gtttcggacc cctttaagaa gttgttgaag 5460 gagtacggag ttactcaatg gaatctagct gtctatcacc cggcccctaa tccagccgaa 5520 agagtwaatc gagtgatagt cactgcaatc cgctgtgcwt tgaacaaaca agccgatcat 5580 aaaaactggg atgactcggt gcaaacgata gctcgtgcca ttcggactag tgtgcatgaa 5640 agcactggac atacacccta tttcgtgaac ttcggacgtg acatgatcag tcacggccaa 5700 gaatacgaaa atcttcaaga actcggggat ggacaggact acgatgccac aaaacgaaaa 5760 gaagagctta cgaaactcta cgaactagtt cgctctaatc tctataaggc ctaccagaag 5820 tattcgcgac cttacaatct tcgatcgaac caaaaacata cgttcaagaa aggcgacatc 5880 gtatataaac gtaatgttca cctttctgac aaaagtgcca attttgtggg caaatttgga 5940 tcgaaattca ctaaggcacg aattaaggga gtgctcggta ccaacactta catccttgaa 6000 tcggaggatg gacaccgtat acccggcact ttccacggtt cgttcctcaa acgagcataa 6060 aaaggtcaag ctatgactgc actacggccg cgtagtgcat acaccagtca aaataacaaa 6120 aatacactta tgtggtgcaa cgttagtccg atccgagatg tcctcgtgtt tcctcgatcg 6180 ttgacaaaat cttgagctta tgttaccgaa taaaacccag ctatgactgt gtccttaaac 6240 cggaggcaca gcaacaatcc tttccactaa aatactcctg aaggtgctac ggagctccaa 6300 cgttcgagat gtcacatcag ctgtttcctc acgttgtcga gaaatgattt cgctcagtat 6360 agtgctctga tgcagctatt atcattgata gctagcatag aatagggcak aattttaatt 6420 ggaaaattca cacatccaac ttccatcatg cgcgatgacg caggagaagc gaagttgaga 6480 atgcgtacat ggatcagcag aaaaatcgtc aaaacactcg taaagtaact cagttctgca 6540 tatcttccat agtaatatta aatcccactt caatttgctt aattttagac ctaaaataga 6600 gtttgtaacg ttgttaaaat attaaaattt tagcataaat tacctttcgg atcaataatt 6660 tcgttcgttg tttgttttgt tttagtgagt tttgacagtt gtacgttgtt tcgtcattgg 6720 taaccagtgt tcagatgtgt agtagtatta gtgttaggtc atttcggata tgctgtaagt 6780 tagttcaata gggttataaa atcaataatt tgcgattttg tttgcagtat tcggttagat 6840 ttagtttcag tagcttatca gattagttct gatgtttatc ggtgtatgaa atcggttcga 6900 ttaattttca agtatgttga agcaaatcgg accgacattc ggaaatcttt ctcagttatt 6960 actgaggtga tcggataatt aggttagaaa tttgtatcag attcggtctg atgtaatcgg 7020 aaaattggtt tagtttagga aattaggata ttggtttcag attattctga agtattcggt 7080 tagcttagtt tcggaaatac gtaggtaata gaagaatatt cagatcattg gttaaattgt 7140 ttgtttgtta atttgaagat aagccttctt ttcttataaa aattttgaaa tttctttaaa 7200 tttcaaaatt tttatttatt tagtgggggc gataa 7235 // ID SIRE6_TC repbase; DNA; INV; 900 BP. XX AC AF227603; XX DT 09-JUL-2004 (Rel. 9.06, Created) DT 09-JUL-2004 (Rel. 9.06, Last updated, Version 2) XX DE Trypanosoma cruzi clone SIRE repeat region. XX KW SINE; Non-LTR Retrotransposon; Transposable Element; SIRE6_TC; KW short interspersed repetitive element. XX OS Trypanosoma cruzi OC Eukaryota; Euglenozoa; Kinetoplastida; Trypanosomatidae; OC Trypanosoma; Schizotrypanum. XX RN [1] RA Vazquez M., Ben-Dov C., Lorenzi H., Moore T., Schijman A. RA and Levin J.M.; RT "The short interspersed repetitive element of Trypanosoma cruzi, RT SIRE, is part of VIPER, an unusual retroelement related to long RT terminal repeat retrotransposons."; RL Proc. Natl. Acad. Sci. U.S.A 97(5), 2128-2133 (2000). XX DR Genbank; AF227603; Positions 1 900. XX SQ Sequence 900 BP; 240 A; 158 C; 235 G; 140 T; 127 other; ttnnnanncc naanngaaaa aaccccaaan gagagagnag gggtnccggn tnaattggga 60 ncgnaggnnt aggggatagg aaaagagggc nanaaaagaa tangaaaggt taaaaaccag 120 antntaaant ggnnggggnc tttaaatcga ttatgngana ntnncccncg anaaaaaccc 180 cattttaaaa agagnttngg gagggtnttt gatttncana ncggaaaaga atttgnaaan 240 gaattcggag tcanaganan aaaagatnaa gccccntnag aagagaatna aatgganaag 300 ttggggttnt aattgcaggg agggaataaa aaaataataa gnnttaagnc naggntatnc 360 ccnngaataa attttttnan aaagcccgaa naanaaatgg ggagaggggg cnttngnana 420 cgagnagaaa cnacgaaaaa anaatcttan tnanatgggg ataaangcng tcaccgggtc 480 ggngacgacg gccacccagg ngcatnnaan angntgagcc cnntcttngn gggtggcnng 540 aaggccgcta tnagngttgg gggtngccct cgccttcgtt ggagntcgta nngtttatcn 600 gccagcgccn tgcttnaggg ttggcatnac accaccangc gacagngtaa tatccttcag 660 gagcatgccg aggtngtngt cgtggcgcac gaaagtgtca ccgagagggg ngtcagncgc 720 ntaggnttcn ntgcctgttn gctcgccgcc ttcacggana gntccagcag atnggnngtc 780 aggnactccn gcaccnccgc catnnanaca gnncccgang cgccgatncg ncgggcatat 840 tggccgcggc gcagcagcga gcccacacgg cccacgggga agatcagacc ggcnttngng 900 // ID Gypsy-10-LTR_HM repbase; DNA; INV; 150 BP. XX AC . XX DT 30-DEC-2008 (Rel. 13.12, Created) DT 30-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE Long terminal repeat of the LTR retrotransposon - a consensus DE sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy-10-LTR_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-150 RA Bao W. and Jurka J.; RT "Gypsy LTR retrotransposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 1987-1987 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX SQ Sequence 150 BP; 51 A; 11 C; 20 G; 68 T; 0 other; tgttatgata atatgaaata tggagaatat atataaatat atatgttttg tatgatactc 60 tatatacttt atgtattgtg ttattttgtt atagactagt ataattatag ctcttttgag 120 ccaacataat taatatggtt attctttaca 150 // ID Gypsy-30_AA-I repbase; DNA; INV; 5390 BP. XX AC supercont1.271; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-30_AA_; KW Gypsy-30_AA-LTR; Gypsy-30_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5390 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.271; Positions 655542 650153. XX CC Positions [2597-3025] - Reverse transcriptase CC Positions [4355-4816] - Integrase core CC 'GTTGG' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 1682..3247 FT /product="Gypsy-30_AA-I_1p" FT /translation="MSNGHPSPNTPAEDKKKCLRCGYRSHRPGDRCPASNK FT TCLQCGIVGHFRTVCRSKQRSQSNDDWVSRTSSIFCLCYIIWDSNLQGTRK FT RKHSFTATRDKYRSEKRARNVFNVKENSSDESPEDLPVYNVGNGEDEVIQC FT RVGGVQIAMLIDSGSKHNLIDDTTWELMKLRDVKITNQRVDTDKRFLAYGR FT IPLKLITTFDAELEVDDNGDLLRTRTSFHVIQKGQQPLLGKVTAQKLGVLK FT VGLPSTHSSSIRTIESSTKVFPKMKGIKLNIPIDRTVPPVIQPLRRCPIPM FT LEKVEQKLNELLEMDIIERVMKPTSWVSPLVPILKENGDLRLCIDMRRANL FT AIQRLNHPLPIFEDLVARFRNARFFTSLDIKQAFHQVELSEDCRDITTFIT FT NWGLYRYKRLLFGINCAPELFQNLMESILGGCKNTVVFIDDIMIFGETEEE FT HDAAVKQTLHVLSQYGILLNDHKCRFKQREVKFVGHQLSADGVSPSEEKIK FT SILSCRAPRTKEELRSFLGLVTYVSR" FT CDS 3956..5293 FT /product="Gypsy-30_AA-I_2p" FT /translation="MIRAVQESAAIDIGEVVQATASDDELQDLIKCIETDH FT WERENVKQYKAFRLEFSFVNKLIMRGTKLVIPTALRSRMCELAHEGHPGQS FT MMKRRLRERCWWPGIDVDAVKFCEKCAGCQLVQSIDPPEPMMRRTLPEKPW FT IDLAMDFLGPMPSGEYLLVVIDYYSRYIEIEIMNRITAQETIKRLRRIFRT FT WGPPRTITLDNAKQFVSTEFKEYCSINGIHLNHTSPYWPQANGEVERQNRS FT LLKRMKIAHAIHDDWKVELDSYLDLYNNTPHTITGKAPSELLQGRKLRSKL FT PQFDDLETTPPNTDFRDQDLVKKVQQKEREDTRRRAKLSPIAAGDIVLMKN FT LLPSNKLSTNFLNEKFTVLDRNGSNVTVQSNKSGKQFNRNVSHLKKFNALS FT DAENEESTMEPNTHQVIQKSTEYPETAATDVSGSGPMRRSLRIIRPADRFS FT P" XX SQ Sequence 5390 BP; 1745 A; 991 C; 1220 G; 1434 T; 0 other; aaccgtggcg acgaggtgaa aaccaggtta aagcatcgag taataatttg aaaatcgtgg 60 caggtatgtg aaaaaagtga aaattagtgt tggattcggc ttgtgtggtt agcgccatat 120 tggaaatcgc agaagttgac cttgaaaaaa aatcaacgct gaattggcac agagttactt 180 atctatctat aagagtttta aagtggtgct ataaggtgtt ttgatgtaag cgaacgaatg 240 tgtgaatgaa gaaaatctgt tttttcttta gcataaaatt aattcaagtg ctggaaaaaa 300 agaaaatgct aaagtgacgc cacatcattt tttttttctc tcctattgtg tgtattggtg 360 atggaaagta tttctttatg atgggaagga atgaattaca gaaacactat gtgtgtagta 420 gtgagaaaaa tcgttgatgg tgaaatgaaa ccgtatgcgt gtttagaaag aaaagagaaa 480 attctaattg gataattatg tgtcgactgt cgaaggtgtc tgtttggcac gaatatgaat 540 gagatgcatg aaaggttgtg tagccgattc aatgccggtg tgaaacggaa tacatatgaa 600 catgttgatg aataagcaac cgattcaatt ccggttgaca gtaattgatg gctacgaaac 660 cgattgaatt ccggtttgaa gaaatagtta gacaagaaac cgattaaatt acggttcaca 720 attgtgttgt tatgcaagat gttggttatg tgaccgattg aattacggtt gatgatgcgt 780 gttgtgaata ttttgctgga tattttgaaa gtgttaccga atctattcag gctcagaaaa 840 atatccaatg tctttgatgt gtaaattaac ttgatgaaaa aaaaaaacaa aatgaaatgg 900 aaagaaaaga ttatggattg tggtgaaaat aaatgtgtag caataagaaa atatgcttca 960 gtagaaaata tattgaaaag ctcatgatgt gtgttctgtt atgttttttt tggatgcagc 1020 ttcagcaaaa tggaaggaat cgaagacagt caacaaatcg ttcagcaaca tgtgcagttt 1080 tttagcccgg catcgtataa tctacctcac ttcaagtata agcacctccc acaaaccgag 1140 gtgcgaaatg cttggaacat gtggattagg tggtttgata atgtcatggc tgcagctaat 1200 gtgttagatg aggcgagcaa gaaggtgcag ttattggcca tgggtggcat ggagctacag 1260 tgtgcctact acggccttcc aggtgctgat gaagaagcaa acgaagaaga tgactgcttt 1320 ccttatcaaa gcgcgaaagc caagttaaca cagcattttt caccgaaaca ccacgacagt 1380 tttgagcgtt tcctcttttg gtcaatggct ccccaggaag atgaaccaat tgagaagttc 1440 gcttcgaggg ttcaaatgaa agcggagaaa atctgttttg gaaagactgc tcttgaaagt 1500 cgtcatattg ctattatcga caagataatt caatatgcaa cagatgatct gcgccaaaag 1560 ttgctggaga aggaggtgtt gaccctcgat gatacaacaa aaatcgtgaa cgcttttcaa 1620 gcggtgcggt atcaatctgc taaaatgaac agcaaagaaa aggaaaacgt aacaatcaac 1680 aatgtcaaat ggtcatccta gtcccaacac tcctgcagaa gataagaaga aatgcttacg 1740 ttgtggatat agatcgcatc gtcctgggga tagatgcccc gcatccaaca aaacctgctt 1800 acaatgtgga atagtaggac atttccgaac ggtatgtcga tcgaagcaaa ggtcacagtc 1860 caacgatgat tgggtaagtc gaaccagttc tattttttgt ttatgttata ttatttggga 1920 ttcaaatctc cagggtactc ggaaacgtaa gcatagcttt acggctacca gagacaaata 1980 tcgatcagag aaacgcgcta gaaatgtttt taacgtcaaa gaaaatagca gtgatgaatc 2040 accagaagac ttgccggttt acaatgtggg aaatggagaa gatgaagtca ttcaatgtcg 2100 tgtaggcggt gttcagatcg ctatgctcat tgattcaggt tctaagcaca accttatcga 2160 cgatacaaca tgggaattga tgaaattgcg ggacgtgaag ataaccaacc aaagagtaga 2220 caccgataaa cgattcttag catatggtcg catacccttg aagcttatta cgactttcga 2280 tgctgagctc gaagttgatg ataacggcga tttattgaga actagaacat ctttccatgt 2340 cattcaaaag ggtcaacaac cacttctagg aaaagtgaca gctcaaaaat tgggagttct 2400 gaaggttggg ctacctagca ctcactcttc atcaatcagg acaattgaaa gttccacaaa 2460 ggtttttcct aaaatgaagg gcataaaatt gaacatccct atcgatagaa ctgtccctcc 2520 cgttattcaa ccgttgagac gctgcccgat tcctatgttg gaaaaggtag aacaaaaact 2580 gaatgagctc ttagaaatgg acattattga gagagtaatg aaaccaacat catgggtttc 2640 acctttggtg cctatcctta aagaaaacgg agatttacgg ttatgtatcg atatgagaag 2700 agcgaatctg gcaatccaaa gattgaacca tcctctacct atcttcgaag atcttgtagc 2760 aaggtttcgc aacgcccgat ttttcacatc actcgacatc aagcaggcct ttcatcaggt 2820 agagctatcg gaggactgcc gcgatatcac caccttcatt acgaactggg ggctgtaccg 2880 gtataaacgg ctgttatttg gaataaactg cgcgcctgag ctcttccaaa atctcatgga 2940 aagcattctc gggggatgta agaacactgt tgtgtttata gacgacataa tgatattcgg 3000 agagacggag gaagagcacg acgctgcggt aaaacaaacg ctacatgtct tgagtcaata 3060 cggaatactg ctgaatgacc ataagtgtag gttcaaacaa cgggaagtca agtttgttgg 3120 acatcagcta tcagcggacg gcgtatcgcc cagtgaagaa aagatcaaat caattttatc 3180 ctgcagggct ccacgtacaa aagaagagct aagaagtttt ctaggtcttg tgacgtacgt 3240 ttcaaggtaa acttatataa ccttgtaaca tattttcgat gtgtttgtaa tttggtgtac 3300 tattaggttt attccgaatc tagcgaccat aaactaccct ttgagacagc ttctgaaaca 3360 gggaattccc tttgagtgga aagcgttaca tcaagagtct ttcgaaaaag tgaaatcact 3420 cattggatcc gtaggcagct taggtttttt tgaccctaaa gatcgcactt tgctagttac 3480 tgatgcttct ggagtaggat taggggccgt cttcatccag ttcaagaact gtcaaccaag 3540 aattatcagc tatgcttcga aaagtttatc cgagatcgag aaaacctatc cacctattga 3600 aaaagaagcc ctcggtattg tctgggcagt tgaaaggttt agaaactatt tgctcggagt 3660 cacattcgaa ctagaaacag accatagacc tctggaaaca ttattttcgg caacatccag 3720 accaacagcg agaattgagc gatggctgtt gcgaatccag gctttcaaat tcaaagtcat 3780 ttaccgcaaa ggctcggcta atttggcaga ttgtttgtct agattagcgg cacatgttga 3840 agaccctcaa tggacagaag aaacggatgt tttcataaga agagttgtgg tgcagtcttt 3900 gtctattctc tcgacttcat ccgatacgca agattttgac acaaaaactg aagaaatgat 3960 cagagcagtt caagaatcgg ctgcaatcga tattggagag gtcgtacaag caacagcttc 4020 tgatgatgag ctacaagacc tcatcaaatg catagaaacc gatcattggg agcgagaaaa 4080 tgtgaagcaa tacaaagcct ttcgactaga attctcattt gttaacaagt tgattatgcg 4140 aggaactaaa ctagttatcc ctactgctct gaggtcaagg atgtgtgaat tggctcatga 4200 aggtcatcca ggacaatcaa tgatgaagag gaggcttcgg gagcgatgtt ggtggcccgg 4260 aatagatgta gatgctgtga aattttgtga aaaatgcgca ggatgtcaac tggtacaatc 4320 tatagatcct ccagaaccca tgatgcgtcg tacgctcccg gaaaagccct ggatagattt 4380 ggcgatggat tttctggggc ctatgccatc cggagagtat ctgttagttg taatcgacta 4440 ttacagtcga tacattgaaa tagaaataat gaaccgaatc actgctcaag agaccatcaa 4500 acgtttaagg cgcattttca gaacatgggg tcctccaaga acaataacgc ttgataacgc 4560 gaaacagttt gtttcaactg agttcaagga gtattgttca atcaacggca tacatctcaa 4620 ccatacttcg ccatattggc cacaagcaaa tggcgaagta gaaagacaga accgatcttt 4680 attgaaacgg atgaaaattg ctcacgctat tcatgatgat tggaaggtag aactagacag 4740 ttacctggat ctatacaata atactccgca caccataacc ggaaaggcac ctagtgaatt 4800 gttacagggc agaaagttgc gatcgaaact tcctcagttc gacgatcttg agacgacacc 4860 gccaaatacc gatttcaggg atcaagattt ggtcaaaaag gttcagcaga aggagagaga 4920 ggatactaga aggcgtgcaa agctcagtcc tatcgctgct ggtgatattg tgctaatgaa 4980 aaatcttctt ccatcgaata agctatccac caacttcctc aacgagaagt ttactgtgtt 5040 ggatagaaat ggatcgaacg tgacagtgca atctaataaa tctggcaaac agttcaacag 5100 aaacgtttca cacttaaaga aattcaacgc tctatctgat gctgaaaacg aagaatcaac 5160 gatggaacca aatacccacc aagtcatcca aaagtctaca gaatatcccg aaaccgcagc 5220 aactgatgtt tccggttcgg gacctatgag acgatcgctg cggatcatca gacctgccga 5280 cagattcagt ccttgaagat aagtatacct tttgttctag aagtattgtt ctatttatca 5340 tcacaaaaat gcgaataaaa tgtttctata aattggtaaa aaaaagggga 5390 // ID Gypsy9-I_Dpse repbase; DNA; INV; 4981 BP. XX AC Unknown_group_264; XX DT 13-MAY-2009 (Rel. 14.05, Created) DT 13-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy9_Dpse; KW Gypsy9-LTR_Dpse; Gypsy9-I_Dpse. XX OS Drosophila pseudoobscura OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; obscura group; OC pseudoobscura subgroup. XX RN [1] RP 1-4981 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1072-1072 (2009). XX DR Genome; Unknown_group_264; Positions 12625 17605. XX CC Positions [3771-4247] - Integrase core CC 'ATTG' target site duplication CC LTRs are 93% similar to each other. XX FH Key Location/Qualifiers FT CDS join(2661..3665,3669..4607) FT /product="Gypsy9-I_Dpse_2p" FT /translation="MAHVCRLSAANRAFYPGRVPATQDQLHTRATPKCPLH FT HDPRPEKRVLADTNGPRQPGSHGIHDTRPRIIPMESSAFRFTLRGSHVSTD FT PRRRYRRRHGTIRVRISRRHRHRREKLPGTPEQRKRSFRTTAKGKPTGRLQ FT PWSRRGPHPRDPIAYASRRLNKAEENYSATDKECLAVIWAIRKLRCYLEGY FT RFDVITDHLESPYGRVARWALELQQYQYDVHYRAGNQNVVADALSRQPLET FT LSRIEEEDNTACTWLKQRIQQVQDEPQKYPDYTHENGQLYRYLGHGADDDD FT HIPWKLCVPKNGRTRVLRECHDEPTAGHLGVRKTILRTTQPYYPGLHRDVR FT RYVQGCVSCQKFKATQRKAVGRMLTRKPEEPFATLCADFVGPLPRSKHGNT FT ILLVFFDNFSKSVELVPLKKPTTANLERAFRERILSSFGVPKLFVCDNGIQ FT FTSRSFRTFMRSAGVELQHTAPYWPQENPTERANRTVKTMITQFVDDHQNT FT WDELLPEMTLAINSSVSETTGFSPAFLLQGREPRLPAAVYDEVTPGTGSTP FT ETATSKATRLKEEFAIVRNNIQRASQEQGRHYNIRRRLWRPAVGSLVLLRQ FT HHLSNAAEGFAAKLAPKIDGAYRVKSFPSPNIANYNIAKAGNTKPQISTT" XX SQ Sequence 4981 BP; 1438 A; 1620 C; 1233 G; 690 T; 0 other; gtggcgctcg agcagggacc caaaggaacg tcagcgaggg tttgaagaag accacaagag 60 aacaacccaa agaaaacccc gacctcgcgc aatatcaacg gcaagagttg cgcaacgagc 120 agacgaaagc gcgacacagc gacagagaaa cgcgaacagt gaagtcaccc cgagaagaaa 180 ttcgaagcgt cagccgcgaa ggaaaatcgc aagcccatcg aaacagccaa ctcgagcaac 240 aatcggagcg atggaatccc atacagaaga acagcccagt gggtattacc ctggagaatt 300 cgtcgccgag acgagctgga gtccttcgcg gaggaattcg gattcagccg agagggtaac 360 gtggaagatt tgcggagaaa tttcgccgaa ctagtcgccg ggccccgcca gtacgagcca 420 gatcgcccca gcccgacatc tcagcactac cagagaagca gaagacggac gccaaactca 480 gcctacaagt tctaggagtc atggagggca aagcaagcac ctcactgaca tcgataaccg 540 tcgaagatct accgaagccc ctgcccccgg agccaaggcc ttcgcccatg agaaggatct 600 tgcaggaata gcaggaaggg atggcccctt agacggtcat tactacagtc acgccgcaca 660 gatggcggaa cggatccgaa agtggctcct caccgaccga gccaaccgtt ggttctgcac 720 cagccgactc cagagcgcct catgggccga tttccgccag gaattcctag cctttttcct 780 gcctccccgg tacttcgagc aattggaaga ccagatcagg gaccgcaaac aagcggtggg 840 cgaaccattc aaggacttcg tcatcgagct ccggttgctc atgcaccacg ccgggtacga 900 cgaagccaag gaacttagcc ggatctatga caacaccctc ccggcgtacc agctgtacgt 960 gcgaagacac gaactgagaa gtctgacgca attaacaata ttggccacag agttcgagag 1020 tatccaggaa cgaaacgtac cagcgaccct catccgccct cgacaacccc ccaggtacac 1080 gcggccaacg gcacaacaaa acgaaagcgg gacgaatcgg gcccaatccc gaaagcctca 1140 ctggaacacc tcggaacctg caaggaagcc gcaccaggcc agcacaggaa acggcacatc 1200 gaatgatcac cacccccatc gccaacccgc acgcagcctg tcgacgctgc ggagaagctc 1260 cggagaagcc ccctcttctg gtgggaatgt ggcagaagcg tcatcaacgc cgacctcgcc 1320 cgcaaactcg acctacgagc cgcatgcgtc ccgatacgcc tagctgacgg ctcagctcgc 1380 gaggtaactc agatcttgtt agcgcgtcaa actagacgaa ccagaagtac agatgccact 1440 cctggtacta cccaatatgg tagacacaag ccatcatagg gatggatttc ctgtgcgcca 1500 ttgaaaccac aatacaatcg ccacacgcct ccatctccaa ggctctcgac gccccacccc 1560 aaacatgccg atgatcaccg ccgaagtaca cgccccggac ccacgaactc gccacgacac 1620 gagagaagag cacaaggaac aaggaacaca gaacccttgg ccacgacgag gcacaccagc 1680 cccaagccca ctccgagaaa cccaaaacta agcagagcca cgaagaaagc gagccggtct 1740 cgcgagggca ctgacgcggt tctaccaacg aaggaaaagg aacccaggac gcacctctag 1800 aagataaccc ggccccagca acgacggccg ccacgcaaca cccaggctct cgaggtacca 1860 cctccgatcg ccaccatggc ccacgaacgc gtcgacaccg ccctcgacgc gaaaacgacc 1920 cctacgacac cgaccgaccg ctactcccgc gtcctctcag gcaggaaccg ggaaaggcga 1980 tcactgccac acgaggcacc catcggaacc ccgacgcacc tggagccgac gcaccatcga 2040 tcgtcccaga gatcatggaa cgcctcccga gcaacacctc gtacggcgcc ggcatgcggc 2100 ccagcaccac gcctggagcc atgtcgttga acgtcatcgc ccaaactagc cgttggtcaa 2160 acgctacgat cgccatcgag acccgcacgt tcgtacatca cgctgggtac aacgtcgcgc 2220 tagcaccccc caggacacac acgagccccc tactcatgat cgaggcaccc gccgtcgtca 2280 aaccaccaag cgcattcgac acaacgatcc tccccgagag ccccgaccac acttgccacc 2340 caggaatcga cgacccacct taggacggtt cggaacgaat ctccgaccca tggccttccg 2400 acatcgcgcc gaaaatacag gaatttctag acgaggaact gccaacactc gcgggcctcc 2460 gagggacgac gggcctaacg gaatacacca tcgtcatgag ggacaaccga cccataaaac 2520 agcggtacta cccgaaaaaa tccagccatg cggcggatca tcgatgaaca ggtcgaccaa 2580 ctgatagctg aggacctaat cgaaccatcc cggagccccc acagcgcgcc aatcgtcctg 2640 gtctgcaaaa agaacaggag atggcgcatg tgtgtagatt atcggcagct aatcgagcgt 2700 tctatcccgg acgcgtaccc gctacccagg atcaactaca tactagagcg actccgaaat 2760 gcccacttca tcacgaccct agacctgaaa aacgggtatt ggcagatacc aatggcccgc 2820 gacagccggg aagccacggc attcacgata cccggccgag gattatacca atggaaagta 2880 gtgcctttcg gtttacactc cgcgggagcc acgtttcaac ggaccctcga cgacgttatc 2940 ggcgcagaca tggaaccatt cgtgttcgca tatctcgacg acatcgtcat cgtcgggaaa 3000 agcttccagg aacacctgaa caacgtaaaa gaagttttcg cacgactgcg aaaggcaaac 3060 ctacgggtcg actgcaacct tggagtaggc gcggtcctca cccaagagat ccaatcgcat 3120 acgccagtcg acgcctcaac aaggcagaag aaaattactc ggccaccgac aaggaatgcc 3180 tcgccgtcat ctgggcgatc cgaaaactgc gatgctatct cgaaggctac cgattcgacg 3240 tcataaccga tcatctcgag agcccttacg gtagagtagc aagatgggcg ctcgaactac 3300 agcaatacca atacgacgtg cactatcggg ccggcaacca gaacgtagtc gctgacgcgc 3360 tgtcaagaca accattggag accctctccc gcatcgagga ggaagacaac acagcatgca 3420 cctggcttaa acaaaggatc cagcaagtcc aagacgaacc ccagaaatac ccggactata 3480 cgcacgaaaa cgggcagcta taccgctatc taggacacgg agccgacgac gacgaccaca 3540 taccgtggaa actatgcgtg cctaaaaacg gccgaacgag agtactccgc gaatgccatg 3600 acgaaccaac agctggacac ctcggcgtcc ggaaaaccat cctgaggaca acacaaccat 3660 actactaacc aggactacac agagacgtgc gacggtacgt gcagggctgc gtaagctgtc 3720 agaaattcaa agccacgcag cgcaaggccg tgggcaggat gctcacacgt aaacccgaag 3780 aacccttcgc caccctgtgt gccgatttcg tcggaccatt accacggtcc aagcacggga 3840 acaccatctt actggtattc ttcgacaatt tttccaagtc ggtcgaactg gtacccctca 3900 agaaaccgac aacagcaaac ctcgaacgag ccttcagaga acgaatactg agcagcttcg 3960 gcgtgccgaa actattcgtc tgcgacaacg gaatccagtt caccagccga tcgttcagga 4020 cattcatgcg aagcgcaggc gtggagctgc aacacacagc cccgtattgg ccccaagaaa 4080 acccgacgga gagagccaat cggactgtga aaacgatgat cacccagttc gtcgacgacc 4140 accagaatac atgggacgag ctcctacccg agatgacatt ggcgatcaac tccagcgtct 4200 ccgagacgac cggattcagc cccgccttcc tcctgcaagg aagagagccc cgactccccg 4260 ccgccgtcta cgacgaagtc acgcccggga cagggagtac gccagagaca gccaccagca 4320 aagcgacgag gctgaaagaa gaattcgcca tcgtccggaa taacatacag cgagccagtc 4380 aagagcaagg gagacattac aacatccgcc ggcgactatg gcgccccgcc gtcggatcac 4440 tagtactact acgccaacac cacctctcaa acgccgccga gggctttgcg gcaaagttag 4500 ccccgaagat cgacggcgcc taccgggtca agagttttcc atcgccaaac atcgccaatt 4560 acaacattgc gaaagccgga aacacaaaac cgcaaatatc aacgacctga gagagttcca 4620 cgaccacgag tccgaagaat ccactgatct acctgagaat gctgacatcg acgaggaaag 4680 caacgagaca ttaacccgac gccccaacca cggcaccaca gagtcagcaa atcccgggat 4740 acccgaccaa aagacagcaa cggcaaaaag cgagcccgat ggcataatac caccgaaaat 4800 gcaagcgcag cgaaccgcgc gaccccactt aatcgaacag cgctgacgcc acgaccgcgc 4860 gccccacaaa tccgttgacg aggcgaagaa aacgcggaaa ccccaacgca gaccacagaa 4920 gaaaccacac gagcccggcc cactagtcct cgtaactctc gccggggaag aaaggggagg 4980 g 4981 // ID Sola1-1_BM repbase; DNA; INV; 3740 BP. XX AC BAAB01062465.1; XX DT 28-JAN-2009 (Rel. 14.02, Created) DT 28-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE Sola1 DNA transposons from Bombyx mori. XX KW Sola; DNA transposon; Transposable Element; Sola1; Sola1-1_BM. XX OS Bombyx mori OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Bombycidae; Bombycinae; Bombyx. XX RN [1] RP 1-3740 RA Bao W., Jurka M.G., Kapitonov V.V. and Jurka J.; RT "New superfamilies of eukaryotic DNA transposons and their RT internal divisions."; RL Molecular Biology and Evolution 26(5), 983-993 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS join(712..1950,1954..2667) FT /product="Sola1-1_BM_1p" FT /translation="MNRSAQILKMVFSEDQENPVVYNKPIFSPLDTVRSLV FT KSDVNNISNLNSHSYVSDSQIVPYFMKNDLPGPNNVDMEPTVNYSESVSSN FT QIQLSDLQRPRLNKVVPESNCSSEYNDDNSKSEYSNSQNDFSSDDVDNDPD FT FLYKDLELSSTSEDESITMSNRREKIIEIEQEKKKGRKRRAVPKEWKKNKA FT KLLRNSGKAYLSSSKSKKLMKERELKPTCNDKCKLKCFNKISEEKRQIIFS FT NYWKMADLEKQRQFINKYVKAIKPKYRYIREGSTRKDYNHAFYFEVDKEDI FT RVCKTFFKNTLGISERPIRTVIYMQNSVVGGFLAGDNRGRHGNHNKLDSTV FT IERIKEHINSIPRIESHYCRATTSREYIEGGLSIAQLHRDYVEKCRAESRS FT YADYQIYYNIFSKEYNISFSPKKDQCEDFAAYDNAEDKQPLEDKYFSHLEE FT KDLVRKEKELDKKNTNNKCIAATYDLQAVMPCPRGDISNFYYISKLNVLNF FT TIYELGSKDVNCYVWHEGEGLRGVNEIGSCVLNYLRRLQEQSKEDFQVIFY FT SDNCCGQQKNKFMIAMYIYAVTNLKNLTSITHKFLIKGHTQNEGDSAHSTI FT ERNISRSLKSAPIYVPEQYITLIRTAKKKGNPYKVHELNHEISLTLKRLRM FT A*" XX SQ Sequence 3740 BP; 1435 A; 519 C; 575 G; 1211 T; 0 other; cgtcggttaa atagaaaagt gacatttctc aaaaatattg taaattaata ttatctgtta 60 aaacgttttt ttatcgtatg tattttaata gaaaaatgac acatcatcaa attaaacgtc 120 aaaaactccg tactgccaca agtgtcactt tgatactctg tggtaattcg ccattttcat 180 tgttgaaaaa cgtcactctg ccatttgcaa cgcaaacgcg aacgtgcgaa cacggacgct 240 tctcaagaac agtgacattt tccaaaaaag tgcttgaata caactgtaaa aggtaagtaa 300 tatctacgaa ataatgtatt ttttcgagaa acgacgtttt tcaatttaat tttggtttta 360 gttgtgccaa aagttttaaa gaagaatgta tttttttgac atgtgacatc aatttgagaa 420 attatatgaa agaaatacga ccgattctac ttaggacatt tatctatata attatacatt 480 tagctgtaat ataatagatt caatagaaaa tagttacaat ttgaaatatg actgtaaggt 540 tcaaaaaata gattattgtt catgtttttt ttctaaataa agatttatct tttgcaatat 600 ttacttaaag gcattttaca aaattgaaag tgcaatgttt ttgaatggac ttaaaaacaa 660 tgtatttttt taaattaaat ctttatataa atatattcat ttatttttag gatgaatcgt 720 agtgcgcaaa ttttgaaaat ggtgttctct gaagaccaag agaatccagt tgtttataat 780 aagcccattt tcagtccttt agatactgtt cgcagtcttg taaaaagcga tgttaataat 840 atatcaaatt taaatagcca cagttatgtt tcggattcac aaattgtgcc atattttatg 900 aagaatgatc tacctggacc aaataatgtt gatatggaac ctacagtaaa ttattcagaa 960 tcagtatcaa gtaaccaaat acaactttct gaccttcaac gtccacgatt gaataaagta 1020 gttccagaat ctaattgttc ttcagaatac aatgatgata attctaaatc tgaatattct 1080 aatagtcaaa atgacttttc atctgatgat gtcgataatg accccgattt tttatataaa 1140 gatcttgaat tatcttctac atctgaagat gaaagcatca caatgagtaa tcggcgagaa 1200 aaaataattg aaattgaaca agaaaagaaa aaaggaagaa aacggagagc tgtacccaaa 1260 gaatggaaaa aaaataaagc aaaactatta aggaactctg gaaaagcgta tttatcgtca 1320 tctaaatcaa aaaagctaat gaaagaaagg gaattaaaac caacttgtaa tgataagtgt 1380 aaactgaaat gttttaataa gataagcgag gaaaagcgtc aaattatttt ttcgaactac 1440 tggaagatgg cagacttaga aaaacaacgt cagttcatta ataaatatgt aaaagcaatt 1500 aaacctaaat atcgctatat tcgtgaagga agtactagaa aagattataa tcacgcattt 1560 tattttgaag tagacaaaga agatatccgg gtttgcaaaa catttttcaa aaatacatta 1620 ggcatatcag agagaccaat aagaacggtg atttatatgc aaaactctgt agttggaggt 1680 tttctagcag gtgataaccg agggagacat ggaaaccaca ataaacttga cagcacagtt 1740 attgaaagga ttaaagaaca catcaattcc attcctagaa tcgagagtca ttactgtcgg 1800 gctactacaa gtcgagaata tattgaagga gggctttcca ttgcgcaatt gcatagagac 1860 tatgttgaaa aatgccgagc agaaagtcga tcgtatgctg actatcaaat atattataat 1920 atatttagca aagaatataa tatttcgttc tgatctccaa aaaaggacca atgtgaagat 1980 tttgcagctt atgataatgc agaagataaa cagccattag aggacaagta tttttctcac 2040 ctagaagaaa aagacttagt caggaaagaa aaagaactgg ataaaaaaaa tacaaataat 2100 aaatgtatag ctgccaccta cgatttgcaa gccgtaatgc catgtcctag aggtgacatc 2160 tccaatttct attacatttc taaactgaac gtactcaact ttaccattta cgagcttggc 2220 tccaaagatg tgaattgcta cgtctggcac gaaggtgagg gattaagagg ggtaaatgaa 2280 ataggctcct gcgttttaaa ttatttaaga aggctccagg agcaaagtaa agaagacttt 2340 caagttatat tttacagtga caattgttgt ggacaacaaa agaataaatt tatgattgca 2400 atgtatatat acgctgtaac taatttaaaa aatttgacgt caatcactca caaattttta 2460 ataaaaggcc acacccaaaa tgagggagat tctgctcatt ctactattga gcgcaatatt 2520 tccagatctc tgaaatcagc tcctatttat gtaccagaac aatacataac attgattaga 2580 accgcaaaaa aaaaggggaa tccctataaa gtccatgaac taaaccacga aatttctttg 2640 acattaaaaa gattgcggat ggcataggac caaattatac aacaaatgaa gatagagaaa 2700 aggtaaagat gggtgatatt aaagttatta aagtggagaa aaggtacaaa gaccgatttt 2760 ttacaagtta tcttataaag aaaatgattt taaaactgtt gtcatcaaaa ctcgaacaac 2820 caagaaaaat acagagttca cagaattaga acgtctttat tcctcaaaat taggcgtttc 2880 taataataaa aaagcaggca tacttacact aatagataaa aatattattc ctaggtttta 2940 taaaagcttt tatgaaaatc tatagatatc taaataatat cttatttcta agattgtata 3000 gataagctta agtatttggg ttttggaatc catgagatat tgtttaatgt tactggaatt 3060 tcaatgtttg tgttttttaa tactttggat aagtttgatt ctataaaata tgctttattt 3120 gatttttgag actgataata taatttagtc ctatactaac taaacaagca ttcatattga 3180 aatgtgatta agattattag atttagagta ttatatataa taataaaatc ttaatttaaa 3240 tcaaactgaa cattatttac ttaaatatta taagaagact tcacttttaa ctatttgatc 3300 tcaatgttgt ttccattgat aatttaataa gttcatttgc gacaagatac agatttgtta 3360 attttcgaat tactattgaa tgttaattca taacactttt taaataaata tttactttta 3420 taccacattt tatcttatta aaactgtttt tcgactacaa aaacgtttaa tttttgaaaa 3480 aatacttatt cagtggtatt catttcatat gtggaaaaac gacacttcca gctagaattg 3540 ccatattttt tattaaatta ttaaaaacaa ataaatcatc attatcagca tttaaatacc 3600 tattaaatga tatacaaaac ttattagtta aaatctaaca gatgccgaaa aattaatttt 3660 aataaaattc aatactttta actatgtgta ctgtaaagtt tggtcttttt gatttatgtc 3720 acttttctat ttaaccgacg 3740 // ID Gypsy-5_AA-LTR repbase; DNA; INV; 177 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-5_AA_; KW Gypsy-5_AA-I; Gypsy-5_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-177 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 980-980 (2011). XX DR [2] (Consensus) XX SQ Sequence 177 BP; 64 A; 29 C; 28 G; 56 T; 0 other; tgtggtgaat ctgaaaaaac atagtaggcc cgacgttcag accgtcagtt tgaaaactga 60 gatcttttag tgaacacacg ttatcagtcg atgttacttt gcaaactaaa tatcattaaa 120 ccctaagata gttttcattt aaatgtacaa ctttataaat tagtgaataa tatcaca 177 // ID Gypsy3-I_Dmoj repbase; DNA; INV; 4453 BP. XX AC scaffold_6498; XX DT 13-MAY-2009 (Rel. 14.05, Created) DT 13-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy3_Dmoj; KW Gypsy3-LTR_Dmoj; Gypsy3-I_Dmoj. XX OS Drosophila mojavensis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; repleta group; OC mulleri subgroup. XX RN [1] RP 1-4453 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1038-1038 (2009). XX DR Genome; scaffold_6498; Positions 3079539 3083991. XX CC Positions [3871-4140] - Integrase core CC 'GTTAT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 874..4452 FT /product="Gypsy3-I_Dmoj_1p" FT /translation="MTESVIKPFLCESLGNALLQNEWEKWLRAFMIYLEVE FT DVKSVIQKRTKLLHLGGTQLQTVAYALPGAIVEFSEADQNDVFTVLVDKLT FT AYFSPKQNSTFERHLFRKISPEEGETFGKFILRLRQQIQKCNFGSTKAEIE FT EICFKDKIIDTWVSTELKKKLLEKEHTLNEIIEACQVDEEINKQSQLMQSK FT PVPETIHKITNLKSGKIGDCGRCGRPGHREDSPMCPARQVKCSRCSRVGHF FT ARRCRTNLKRPYPHSNNKEAIGNPKRKYVHRIENGNPEGNKTIQRNCFKID FT SDEEYEEMMQCRIGGADVSLVIDSGSRFNIISQDDWLTLQLKKATVLNVRG FT SSQNQFRGYASDQILNVICIFEAPISIKLNAEVIASFFVIEKGRQSLLGRE FT TATKLNVLRVGLPINNIEFVPPFPKWKGVRVKLTIDPYVNPVKQPMRRIPV FT ALEGRVNAKLEEAYKLDIIEPVEGHSPWISPMVITFKGNGDIRICLDMRQA FT NRAILRENYPLPTFESFMTKLKNARIFSRLDLKDAYHQLELDESSRQITTF FT ITPRGLFRYKRLMFGINAAPEIFQRRMEELLAPCNNVMNYIDDVIIFGDSK FT EEHDKTLEQIRSIFKDNNVTLNNEKCSWETNRIKFLGHILTDKGIEVDPDK FT TESIRLFRAPKNKEETRSFLGLVTNVGKFIADLADLTEPLRELVKKDRKFT FT WGPTEENGFQKLKSVLTKIPNLSYFNPKNKTRLIADASPVALGAILLQFND FT EGEPNIIAFASRSLTDVEKRYSQTEKESLALVWAVEKYYFYLIGLEFELVT FT DHKQLETIFKPTSKPPARIERWLLRLQAYKFKVIYKSGKENIADSLSRLCE FT QTPSSCYDIKGEHSILRIVESSAPIPISISEIAENSTIDEEIVDAMTCLQQ FT GSWDSAMSKKLYPFRHELSAIGAIMLRGVRIVIPTTLRQKVLVLAHEGHPG FT ESAMKRRLRSKVWWPQIDRDTEKFVKNCFDCILGSQATNPPPMKRTEFPNG FT PWISVATDLLGPLPNNEYILVLIDYYSRYMEFKVLRSITSESLIGACKEIF FT SRLGYPKQLRTDNGRQYISLEFKNYCKSCGIEQLKNVVFPHKLTPMFDTTE FT YEVLKREGNIVQVSGGGKTLLRNANHLKRVLGETEPADVTNAKSVPIPATK FT PTPQPAAQFEDQAPAEDHGNSTEGLKLKLVRKGG" XX SQ Sequence 4453 BP; 1578 A; 802 C; 997 G; 1076 T; 0 other; ttggcgacga ggtgaaaaaa atgtaaaaat ataacaaact agataattat agttaaataa 60 aaagaaaaag atgaaacaga acggtgaaaa agaaaaaaaa acaatataaa tgcagcgtgg 120 ctgtgtcaat gcaattccaa tgcctggata ggatccgtta gttttcgttg tctttccctt 180 ttcgcgtgac gttttttgtg ctcgcgagtt cgaagagaaa gaaagagaaa tattcgtaag 240 agaaacccta aagcaatgta attacaagaa gcgtaagtgt gaatgtatac acacatacgg 300 acgcgtacac atgaacatgc aaaataaagt aaaatgcgtg gctcacgggc aatgaaaaaa 360 aaaaaaagaa gaaagtgaaa gagatgccga ctaattattc aattattatg aaataagaag 420 tattatctga caaaggtcag gggaaaggaa aaaaatccga ctaaatatta gacaaatgtt 480 taaaaagatg acgatctgac aaaggtcaga ggcaaaaggc cgattagtat taaacaaatg 540 tttaataaat agaattatct gacaaaggtc agtggcaaaa cgccgattag tattaaacaa 600 atgtttaata aatagaatta tctgacaaag gtcagaggcg gaatgccgat cgaatattca 660 ataaatgttt aataaataga attatctgac aaaggtcaga ggcggaatgc cgattagata 720 ttaaacaaaa gtttaataaa catttcttat attatctgac agtggtcagt ggcagtccgc 780 cgatgagaaa gtcaataaaa tgcaacttgc tagacagaag catctaatta atggttgatt 840 ttatttgcag acaaacatct atacttgatc gcaatgaccg aaagcgtcat aaaacctttc 900 ttatgtgaaa gcttgggaaa cgcactacta cagaatgaat gggaaaaatg gttgcgagct 960 ttcatgattt atctggaggt ggaagatgtc aaatcagtaa ttcagaaaag aacaaaattg 1020 ttacatttag gtggtaccca attgcaaacg gttgcatacg cattgccggg tgccatagta 1080 gaatttagcg aggccgatca aaacgatgtc tttactgttt tagtcgataa gttgacagct 1140 tatttctccc caaagcagaa ttcgactttt gaaagacacc tttttaggaa aatctctccc 1200 gaagaaggtg aaacatttgg aaagttcatt ttgcggttgc gtcaacaaat tcaaaaatgt 1260 aattttggat ctacaaaggc cgaaatagaa gaaatttgtt ttaaagataa aatcatcgat 1320 acgtgggtta gtacagagct aaaaaagaaa ctattagaaa aagaacatac tttaaacgaa 1380 attatcgaag cctgccaagt ggacgaggaa ataaataagc agtcgcagct gatgcaatcg 1440 aagccagtgc cagaaaccat acataaaata actaatttaa aatcggggaa gattggggat 1500 tgtggaaggt gcggacggcc aggacacagg gaagacagtc caatgtgtcc agcacgtcaa 1560 gttaagtgca gtcgctgctc tcgcgtaggt cattttgccc gcagatgcag aacaaacctt 1620 aaacgcccat acccacattc gaataataaa gaagcgatag gaaatcccaa gcgcaaatat 1680 gtgcatagaa ttgaaaacgg gaatcccgag ggaaataaga caatccaacg aaattgtttt 1740 aagatagaca gtgacgagga gtatgaggag atgatgcaat gcagaattgg aggggcggat 1800 gtttcactag taatcgattc gggatctcga tttaacatta tctcgcaaga cgattggtta 1860 actcttcaat tgaaaaaggc aacagtttta aatgtccgcg gaagttcaca aaatcaattt 1920 aggggatatg cctcagatca aattcttaac gttatttgta ttttcgaggc accaatatct 1980 atcaagttga acgccgaggt gattgcatcc tttttcgtaa ttgaaaaagg acggcagtca 2040 ctgctgggac gagaaacggc aacgaagttg aacgttctgc gagtgggact accgattaac 2100 aacattgagt tcgtgccacc atttcccaag tggaaaggtg ttagagtgaa gctaaccata 2160 gacccctacg ttaatcccgt gaaacaacca atgaggcgca ttccggtggc gctcgaaggg 2220 agagtcaatg ccaagttaga ggaagcctac aaactggaca tcattgaacc ggtcgagggt 2280 cacagcccgt ggatatctcc gatggtgatc acgttcaagg gtaatggtga tattcgtata 2340 tgcttggata tgaggcaagc caaccgggcg attcttcgtg agaactaccc tttgccaaca 2400 tttgaatctt ttatgactaa attaaaaaac gccagaatat tttcacgact ggatcttaag 2460 gacgcgtatc atcaattgga gttggatgaa tcaagtaggc agataacgac cttcattaca 2520 cctcggggcc tatttaggta taagcgacta atgtttggaa taaatgcggc gccggaaatc 2580 tttcaacgac gaatggaaga actcttagca ccgtgcaaca acgtaatgaa ttatatcgac 2640 gacgttataa tattcgggga tagcaaagaa gagcacgata aaacactgga acagattaga 2700 agcattttta aagataataa tgtaactcta aataatgaaa aatgcagttg ggaaacaaac 2760 agaattaaat ttttaggaca catcttaaca gacaaaggaa ttgaagtaga tcctgacaaa 2820 acagagtcta ttagattatt tcgggcaccc aaaaataagg aggagactcg tagtttccta 2880 ggactagtaa ccaatgtggg caagttcatc gcagacctag cagacctaac ggaaccctta 2940 cgtgagctag taaagaaaga tcggaagttt acttggggac caactgaaga aaatgggttt 3000 cagaaactaa agtcagtatt gactaagatc ccgaacctct catattttaa tcccaaaaat 3060 aaaacacgat taatcgctga tgccagccct gtggctctgg gtgcgatact attacaattt 3120 aacgatgagg gggaaccaaa cataatcgca tttgctagta gaagcttaac agatgtagaa 3180 aagcgttact cccagacaga aaaagagagt ttagctttgg tatgggcagt agagaaatac 3240 tatttctatt tgataggact agagtttgaa ttagtgacag atcataagca actagaaacc 3300 atttttaagc caacatcaaa gcctccagcg cgtatcgaga gatggcttct gcgtcttcaa 3360 gcatataaat ttaaagttat ttacaaatcg ggaaaggaaa acattgctga tagtctgtcc 3420 cggctgtgtg agcaaacacc atccagttgc tacgatatca agggggagca cagcatattg 3480 cgaattgttg aaagctctgc accaatacca atatctatat ctgaaattgc ggaaaacagc 3540 acaatagatg aggaaatagt agacgcgatg acgtgccttc aacagggctc ttgggattcg 3600 gcaatgtcga aaaaattata tccatttcga catgaactgt cggctattgg agcaattatg 3660 cttagaggag ttcgaatcgt tatcccaaca acattgagac aaaaagtgct ggtgctggca 3720 catgagggac atccagggga atcggccatg aagcggagat tgaggtccaa agtctggtgg 3780 ccccaaattg acagagacac cgaaaaattc gtgaagaatt gttttgattg tattctgggg 3840 tcccaagcaa caaatccacc accaatgaag agaacagaat ttccgaatgg gccatggatt 3900 tcggtcgcta cagatctctt agggccgtta cccaacaatg aatatatctt agttctaata 3960 gactattact ccagatacat ggagtttaag gttctacgct ccatcacatc tgaatcatta 4020 attggggcgt gtaaggaaat ttttagtagg ctgggttacc cgaaacaatt gcgaactgat 4080 aatggccgac aatatataag ccttgagttc aaaaactact gtaaatcgtg cggcatcgag 4140 caactcaaga atgtagtttt tccccacaaa ttaacaccaa tgtttgacac tactgaatac 4200 gaagttttaa aaagagaagg gaatatagta caagtgagtg ggggagggaa aacgctctta 4260 aggaatgcaa atcaccttaa aagggtcctg ggtgaaacgg aaccggccga tgtcaccaac 4320 gcgaaatcgg ttccaatacc ggccacaaaa ccaactccac agccagccgc acaattcgag 4380 gaccaagcac cagcagagga tcacggaaac tccactgagg gactaaagct caagctcgtg 4440 aggaaaggag gga 4453 // ID Dparag19 repbase; DNA; INV; 447 BP. XX AC GU229942; XX DT 19-APR-2011 (Rel. 16.04, Created) DT 19-APR-2011 (Rel. 16.04, Last updated, Version 1) XX DE Mariner-like element internal region of mellifera subfamily. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Dparag19. XX OS Drosophila paraguayensis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; tripunctata group; OC tripunctata subgroup II. XX RN [1] RP 1-447 RA Wallau G.L., Hua-Van A., Capy P. and Loreto E.L.; RT "The evolutionary history of mariner-like elements in Neotropical RT drosophilids."; RL Genetica 139(3), 327-338 (2011). XX DR EMBL/GenBank/DDBJ; GU229942; Positions 1 447. XX CC Clone Dparag19. XX SQ Sequence 447 BP; 112 A; 110 C; 122 G; 103 T; 0 other; gcaaaaatac atttttgccc gtatggatgc atgcgaatcg ctactgaatc gcaactaaat 60 cgacccgttt tgaagcggat ggtgactggc gatgaaaagt gggtcactta cgacaacgtg 120 aagcgcaaac ggtcgtggtc gaaaagcggt gaagctgccc agacggttgc caagcctgga 180 ttgacggcca ggatggttct tctgtgtgtt tggtgggatt ggcagggaat catccactat 240 gagctgctcc cctatagcca aacgttcaat tcggacctgt actgccgaca actgtaccgc 300 ttgaatgcag cactctatgc agaagaggcc atctttgatc aacagaggtc gaattgtctt 360 ccatcaggac aacgccaggc cacacacatc tttggtgacg cgccagaagc tccgggagca 420 tcggatggga ggttcttttg catccac 447 // ID SMAR13 repbase; DNA; INV; 3097 BP. XX AC . XX DT 02-OCT-2007 (Rel. 12.1, Created) DT 22-OCT-2007 (Rel. 12.1, Last updated, Version 1) XX DE Consensus sequence of Mariner-type family of repeats. XX KW Mariner/Tc1; DNA transposon; Transposable Element; SMAR13. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-3097 RA Jurka J.; RT "Mariner-type element from freshwater planarian (Schmidtea RT mediterranea)."; RL Repbase Reports 7(10), 1071-1071 (2007). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 1150..2862 FT /product="SMAR13_1p" FT /translation="MDPKVKDKTGEKKKPKKMISMEAKHEIIAKHERGVRI FT IDLANEYGRNPSTISTIIKQKEAIKKLQPSKGVTIISKLRTNIHDEMEQLL FT LLWIEEKQLAGDSVSEAIICEKAGAIFQDLKRDATETEGESSQGGEGFKAS FT RGWFDNFKKRSGIHSVIRHGEASSADIKAAENFIKVFEKLISEEGYLPQQV FT FNCDETGLFWKKMPRRTFITAEEKSLPGHKAMKDRLTLALCANAFGDFKMK FT PLLVYHSENPRAFKAYKLMKEKLKVLWRANSKAWVTRQFFNEWMNIVFGPS FT VKKYLIDNGLPLKCVLLLDNAPAHPPGLQDDLLDEFKFIKIVYLPPNTTSI FT LQPMDQQVITNFKKLFTKHLFKRCFEVTENTNLTLREFWKNHYNIVICLKL FT IDIAWQGVTKRTLNSAWRKLWPDVVLEREGFDGFEPIEEEIVSIGRSMGLE FT VDEADVADLIEEHAEELTTEELKELQKVSHSEVMMELSSEEVVEPEEELTS FT REISDILGKWQEVSDFVEKRHPEKLSTGRASALFDDRCLTFFRNILKRRQK FT QTSLDRYLVKVSRTESQKSEAKKA" XX SQ Sequence 3097 BP; 891 A; 637 C; 698 G; 862 T; 9 other; tacagtggta ccttgagata cgaatgtcct gacatacgaa tgttttgaga tacaacatga 60 aatattctaa aatttatacc tcgacataca acgaaaaatt tgagatacga atttgcgatg 120 tgcgataatc gcataggtga tcccttgatg gcagagcgtt cgctcagctt tcctcctgct 180 gcttctaccc gctatctcgc tctgtttggt tgttgacacc agagtcctct gtatggccgc 240 catgacaact aataaaggtt tactactact ttgtggtgag gttcgatgac ctgcacccta 300 accaacccgg aaagaccctc ttgaacgtct cgttccttag ctcggactca gccagctgtt 360 tacgtctcca cccccgctcc cgtctccgaa ggttccacag gcactgggat atggagaagt 420 tcattgaccc acgtcgagaa tcaaaacccc tattgtatcg gtccagacat cgctgccaag 480 gttcaaagac ccgcacccta acgaagctga aagcccatat tgtccgatct cagaggatgc 540 agacccccca gaaaaattaa tttcacgaga gtaagcaatt aagcacgtac aaatcagtat 600 ttaagacact ggctgaggcg agcaggacac cttcccttcc taaccttctg acggcccgta 660 tcggctctgc gagcaggaca cggcactctt ctcggctttt gcggccggac ttggcgatct 720 ttattgacgg cccgtatcga ctctgagggt aggatacggc accctatttc tcggtctggg 780 tcggctctcg cggccggact cggcatctct ctgttcaacc cccttcacgg ccagcgtcga 840 ctcttgcagc cggacgcggc actttcaagc agtctaccac aggtatggtg ttgtaaattg 900 cactgtttcc ttgtttattc ggccaataaa tttttcaagg ttaactgtgt tattaatgtc 960 acgacgacat taaattggtg tcaggtgtgt cgcgacgaca ttawattggt ktcwrgtrtg 1020 ttgsgasaac aackttrtag ttcacgcatc gttcacacgt cgctgtgtta ttcgctttgt 1080 ttgtgcctgt ttttcgtgtc attttggata gtatatatac tgtaccctgc atattttatt 1140 attttaacca tggatcccaa agttaaagac aaaactggcg aaaagaaaaa gcctaagaaa 1200 atgatctcaa tggaagcgaa acatgaaatt attgcaaagc atgagcgtgg cgttcgtatc 1260 atcgacttgg caaatgagta tggccgcaat ccttctacaa tatccacgat catcaagcag 1320 aaggaagcga taaagaagct gcaaccttcc aaaggcgtaa ccattatttc aaagctacga 1380 actaatatac atgatgagat ggaacagcta cttttattat ggattgagga gaagcaattg 1440 gcaggtgact ctgtttctga agcgattatt tgtgagaagg ctggcgccat ctttcaagac 1500 ctcaagcgtg atgcaaccga gacggaggga gaatcatcgc aaggcggtga ggggttcaaa 1560 gctagtcgtg gctggttcga caattttaaa aaacgaagtg gcattcattc agttattcgt 1620 catggagaag catctagtgc agatattaag gcggctgaaa atttcattaa ggtgtttgaa 1680 aaacttatta gtgaagaagg atacctaccg caacaagtgt ttaattgtga cgaaacaggg 1740 ttattttgga aaaaaatgcc taggcgtacg tttatcacgg ctgaagaaaa gagtctaccc 1800 ggccacaagg ctatgaaaga taggctaacg cttgcccttt gcgctaacgc ttttggggat 1860 ttcaaaatga aaccgttgct agtctaccac tcggaaaacc caagggcctt caaagcttat 1920 aaattaatga aagaaaaatt aaaagttctg tggcgtgcaa acagtaaagc atgggtgacc 1980 cgtcaatttt ttaatgaatg gatgaacata gtttttgggc cttcagtgaa aaaatactta 2040 atcgataatg gtctaccgct taagtgtgtc cttctgttgg ataatgctcc tgcacaccct 2100 cctggccttc aggatgacct tttagatgaa tttaaattca ttaaaattgt ctatctgcca 2160 cccaatacaa catctatact tcagcccatg gatcaacaag ttatcaccaa tttcaagaaa 2220 cttttcacaa agcatctttt caagcgttgc ttcgaagtta ccgaaaatac gaacctaaca 2280 ttgagggagt tctggaagaa tcattacaac attgtgatct gtcttaagct tatagatatt 2340 gcctggcaag gtgttaccaa aagaacattg aactcagctt ggagaaaatt atggccagat 2400 gttgttttag aacgagaggg atttgatgga ttcgaaccta ttgaggagga gatagtatcc 2460 attggtcgct ccatgggcct agaggtagac gaggcagatg ttgctgatct catcgaggag 2520 catgctgaag aactcacgac ggaggagttg aaagagttac agaaggtttc ccactcggag 2580 gtaatgatgg aactcagcag tgaggaggtg gtcgagccgg aggaggaact tacttcacgt 2640 gaaattagtg atatccttgg caagtggcag gaagtgtctg attttgtgga aaaaagacac 2700 ccagagaaat tgtctacagg gagagcgagt gccttatttg atgataggtg tctcacattc 2760 tttcgtaaca ttttaaaaag gaggcagaaa cagacctctc ttgatcgtta ccttgtgaag 2820 gtctcacgca ctgaaagtca gaaaagtgag gccaaaaagg catgacgtta aagtgatgat 2880 taataaatat tatgtgtgtt attactcgtt tatttatcta caaattgtgt atattatatg 2940 taatttaact gtgttttggg ttagtttcat ttcgttagaa cgcattaatc tatattacat 3000 tattttaaat gggaaaaatt gctttgagat acgatttttt tgacatacga cgatgatcgc 3060 ggaacgaatt aaattcgtat ctcaaggtac cactgta 3097 // ID DNA8-52B_AP repbase; DNA; INV; 1102 BP. XX AC . XX DT 29-AUG-2009 (Rel. 14.09, Created) DT 29-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-52B_AP. XX NM DNA8-52B_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-1102 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 1982-1982 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 1102 BP; 428 A; 154 C; 146 G; 372 T; 2 other; tagggctcgg attttaaagc ataataaata acaaaaaaag cacaaaattg ttttaaaaaa 60 gcaaaaatga agcatattta ttttttaaaa aagcaaaaaa aaagcatgac taaaaattaa 120 catgaattag ttataatttt ataaaactga ataaaaaaaa ttaatttaaa ttaaaaaaag 180 aatttactac catgtatttg gcaagatttt cagaagtgaa actgactcta ttgtctgaca 240 ggatgttttt atacatagaa aacgaacgct ctacatccac cgaagtgatc ggtgcatact 300 tcatgcaaga ggtttcgttt aaatcaaacc catgataatt aatttcgtct gatgaaacgc 360 cacacagtat agtattaatg tccagtaatt gtttccagcc tttgtttttc gaaagaacac 420 gatttagctt accxgaaata gcagttgcaa tatctccatt agcttcattt aacttttttt 480 ctgcatctaa gacaatattt atacatgttg tcaacgattc gcctaaaatt atacatataa 540 tttagtatta tgatgttaaa ttgtttaaat actttgttaa aagttacctt ttccctctaa 600 tgttgtaata gtaataatat ctggtagaaa tccaaagttg gttgaaatat acacaatatt 660 tgattgaaga gatacatttt gaaatagttg tttcactact ttaacacatt gagattcttc 720 ttcgaagtct tcaattacac tttttaattc attaaaatat ttgcaataat actggacagc 780 tttagatctg ttgatcgata atctatacat ttaaggacgt ttcgcactaa tggtacggca 840 cggtcattat tgcgggtttt cacatttctt attaattaaa taaacatttt tgttttattt 900 ttatagcata acagacatat aaaaattcca cagctagaat aactgaaaaa agcaataaaa 960 aagcaaaaaa agcatacaaa aaaaagcaaa aataaattcg tttatttgga atcagaatga 1020 cgtgaaacga atttatgtat aaaaattaga tttctatctt axatatataa aaagaagcat 1080 ttgctttaaa atccgagccc ta 1102 // ID Jockey-7_CQ repbase; DNA; INV; 4304 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A Jockey non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW Jockey; Non-LTR Retrotransposon; Transposable Element; KW Jockey-7_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4304 RA Kojima K.K. and Jurka J.; RT "Jockey non-LTR retrotransposons from the southern house RT mosquito."; RL Repbase Reports 11(1), 118-118 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 5 sequences with >97% CC identity. XX FH Key Location/Qualifiers FT CDS 1..1377 FT /product="Jockey-7_CQ_1p" FT /translation="FVILGKPKVTLNQGTQDASTRTTVGSIQKNSTRGGVN FT LTSQKLSLYDEGLLKDIPLNPISTRLPAVPGGGGGVPIDDTMNTSNNRIPP FT FIPLRNDFDILDDDDNGDGQNGVEQAKKAPKVRLPPIHVMDLSVGDVFKLL FT NDNGLPRSEAFLLKHTRTSVQFLTKSKEVFDKAVSVLKSKNVKFFTHDSSG FT KAPSKFVLSGLPLAEVEDVKEELARVNILPRDIKVLSSSKSAVDQHALYLL FT YFDRGSTKLQDLRKTKALFNVVVSWRYYSRRPNEVAQCHRCQRFGHGSTHC FT YLSPKCVKCGGQHLTDVCSLPRKMELNDQNNSKSKLKCANCGGSHTANFQG FT CPSRKKFLDELEKRKKKPVRQPAPNHTSREAFPSLGQQRSGPRTSNFPAGH FT RTYAQVSATSLPPAPDVDAEGEGLFSISEFLALARDMFARLSGCRSKLQQF FT NALAELMVKYIYHG" FT CDS 1373..4039 FT /product="Jockey-7_CQ_2p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="MANLNQLKIVNWNGRSVLPKKLEFFDFLSRHHIDIAT FT VSETWLKPKQSFFHPDYRCVRADRENQDDERGGGVLIAIRKDIHFKILDIN FT TSAIETVGIEILNAAQPVHIIAVYFPGVHRGSTWTTFKRDINTLVRRTVPF FT FVAGDLNARHRQWNCLRANKAGNILASRASSSDFFIHAPLTHSYHPKGGRR FT PSTLDLVLSNNFVDMSSLTVVNDLSSDHLPVRFDVNISVPFNQTXPATRCY FT SRANWQVFQRVLNEKIDLTSPLVTNLESPEAIDRCIAMFTSFIIEAESTAV FT PTAPVRSYKEAEFAESTRRLVQLRNTRRRQWFRTRDPLLADIVEALNVRIR FT NECTIARNQHFSNSIRHLENGAKDMWRISKALRNSVKYTPPLKNGDSLVAA FT PSEKANLLADSFANAHRNTLPSEPSVIAQVEESVAHITATDSTTAAAPVVR FT PKELQRIIQSLKPKKAPGLDKVGNIVLKRLPRKGIVVLTKIVNACLALSYF FT PSSWKHAIVRGIPKAGKDVTLPSNYRPISLLPTMSKILERVILSRIESHLE FT SRTVIPRQQFGFKRGHSTNHQLVRLTQSIQQSFARGQSVGMILLDVEKAYD FT SVWQDAILHKMKLAGFPQYILKLLHSFLKNRSFQVIVDGELSRPQLIPFGV FT PQGAVLSPILYNIFTSDVVMVNGVEYYMFADDTGFASADKHPEIIIEKLQS FT AQNKLEDYQRRWKIKINPAKTQAIFFTRRRSERFLPQSQVVSMGHAVPWSE FT QVKYLGLEFDPKLKFDKHVAASLEKCSKLTRMLYPLVSRRSRLSSRNKILL FT YKLIFRPTLSYGFPAWHGCAQSRRKKLQVRQNKLLKMMLDLPFNFPTAELE FT DASGVEELDSWTSRLLRNFWTRCSMSENDLIVNLVP" XX SQ Sequence 4304 BP; 1131 A; 1191 C; 1028 G; 952 T; 2 other; ttcgtcatcc tggggaaacc taaagttacg ctaaaccagg gcacacagga tgcgtccacc 60 cggaccacgg taggatccat ccagaagaat tccacccgag gaggagtgaa cttaacctca 120 caaaaactga gcctgtacga cgaaggtctg ctcaaagaca taccgctgaa cccgatctca 180 acaagactgc ccgccgtacc gggcggtggc ggaggagttc cgatcgacga cacgatgaac 240 accagcaaca acagaattcc accattcatt cccctgcgga atgatttcga tatactcgac 300 gacgacgaca acggtgacgg gcaaaacgga gtggaacaag caaagaaagc tcccaaggtc 360 agactccccc caatccacgt gatggacctc tcggtgggcg atgtcttcaa gctgctgaac 420 gataacggtt tgccaaggag cgaggcattt ctactcaagc acacccggac gtctgtgcag 480 tttttgacta agtcgaaaga ggtgttcgac aaagcagtgt cggtactgaa gagtaagaac 540 gtgaagtttt tcacccacga ctcttctgga aaagctccgt ccaaatttgt gctctctggg 600 ctaccactcg cagaggttga ggatgtcaag gaagaactgg ccagagtcaa catactccca 660 cgagacataa aagtgctgtc atcgtctaag tctgctgttg accaacatgc actctacttg 720 ctgtattttg atcgcgggtc gacgaagctg caggatctgc gaaaaacgaa ggcgttgttc 780 aacgtggttg tttcgtggag gtactactcc cggaggccga atgaggtggc ccagtgccac 840 cgatgtcagc gcttcggcca tgggtcwacc cactgctacc tctcgcctaa atgtgtcaaa 900 tgcggcggcc agcatctcac cgacgtgtgt agcctgccaa ggaagatgga gttgaacgac 960 cagaacaact cgaagtccaa gctgaagtgt gccaactgtg gaggcagtca tactgccaac 1020 ttccaaggct gcccttccag gaaaaagttt ctggatgagt tggagaagag gaagaagaaa 1080 ccggttcgtc aacctgcacc aaatcatacg tctcgggaag cttttccgag cctgggtcaa 1140 caaaggtccg gtccaagaac cagcaatttc ccagcaggcc atcggaccta tgcccaagtg 1200 tctgccacaa gcttgcctcc tgcaccagat gtcgacgctg aaggtgaagg tcttttctcc 1260 atctccgagt tcctggctct cgcccgggac atgttcgccc gcttgagtgg atgtcgctca 1320 aagctgcagc agttcaacgc tcttgcagag ctgatggtca agtatattta ccatggctaa 1380 tctcaaccaa cttaaaattg ttaactggaa cggcaggtcg gtcctgccga agaaactcga 1440 gttctttgac tttctctccc gacaccacat cgacatagca accgtgtcgg agacgtggct 1500 taaaccaaaa caatctttct tccatcctga ctatcgatgc gtccgagctg accgagaaaa 1560 ccaggacgac gagcgtggcg gtggtgtttt gattgccatc aggaaagaca tccacttcaa 1620 gattcttgac atcaacactt cagctatcga gacagtcggg atcgagatac taaacgcagc 1680 ccaaccggtc cacatcatcg cggtctactt tccgggagtt caccgtggct caacgtggac 1740 gactttcaag cgggacatca acacactggt gaggagaacg gttccattct ttgtagcggg 1800 ggacctgaac gctcgccatc gccagtggaa ttgtctcaga gcaaacaaag ccggaaacat 1860 cctcgcctcc cgggcgagct catccgactt cttcatccac gcccctctga cccactccta 1920 ccatcccaaa ggaggtcgtc ggccttccac gctggatctt gtcctgtcca acaacttcgt 1980 ggacatgtct tcgctgaccg ttgtcaacga tttgtcgtcg gatcacctac cggtgcgctt 2040 tgatgtaaac atcagtgtac ctttcaacca aacgcscccc gctacgcgtt gctattctcg 2100 cgcaaactgg caggtgttcc aaagagtgct gaatgagaaa atcgacttga cttccccgct 2160 ggtcacgaat ctggagagcc ctgaagcgat agacaggtgc attgcgatgt tcacctcgtt 2220 tattatcgaa gctgaatcaa ctgctgttcc gaccgctcct gtgcgatcgt acaaggaggc 2280 cgagtttgcc gagtccactc gacgattggt gcagctgagg aatacccgac gacggcaatg 2340 gttccgcact cgtgatccgc ttctcgcgga catcgtcgag gcacttaacg tgcggatccg 2400 caacgaatgt actatagcca ggaaccagca cttcagcaac agcatccggc acctggaaaa 2460 cggcgcaaag gacatgtgga gaatcagcaa ggctcttcgc aacagtgtga aatacactcc 2520 cccgttgaag aatggcgact cccttgttgc tgctccgtcg gaaaaggcaa acttgctggc 2580 ggatagcttc gccaacgctc accgtaacac cttacccagc gaaccgtcag tcatagccca 2640 agttgaagaa tcagtagctc acatcaccgc tacggacagc acgacggccg ctgcgcctgt 2700 cgtccgcccg aaagaactac aacggataat ccagtcgctc aaaccaaaga aagcgcctgg 2760 tctggacaag gtggggaaca tcgtattaaa gcgcctgcca cggaaaggga ttgtggtgct 2820 tacaaaaatc gtcaacgctt gtcttgcgct ctcctatttc ccgagtagct ggaagcatgc 2880 gattgtcaga ggaattccaa aggcgggcaa ggacgtaacg ctcccttcga actatcgccc 2940 aatcagcctt ctgccaacga tgagtaaaat cctcgagcgt gttattctgt cccgtatcga 3000 gagccacctt gaaagtcgta ctgtcattcc ccgacaacag ttcggcttca aacgagggca 3060 ctcgacaaat caccaactcg tgcgtcttac gcagagcatc cagcagtcct tcgctcgtgg 3120 ccaatcagtg ggcatgatac tcttggacgt tgagaaggcg tacgattccg tctggcagga 3180 cgccattctg cacaaaatga agctagctgg atttccgcaa tatatcctca aacttttgca 3240 ctccttcttg aaaaaccgaa gcttccaagt gatcgtcgac ggtgaactat cacgtcccca 3300 gctgattccg ttcggggtcc cccaaggtgc tgtactcagc ccgatcctgt acaacatctt 3360 cacgtcggac gtggtgatgg tgaacggcgt cgagtactac atgttcgcgg atgacacggg 3420 attcgcctcc gccgacaagc atccagaaat catcatcgag aagctgcaat cggcccagaa 3480 caagctagaa gactaccagc ggcgctggaa gatcaagatc aatcctgcaa aaacgcaagc 3540 catctttttt acccggaggc gaagtgaacg ctttcttcca cagtcgcagg tcgtgtcgat 3600 gggccacgcc gtcccctggt cggaacaagt caaatatctt ggactagagt ttgaccccaa 3660 acttaagttt gacaagcacg tcgcagcatc tctggagaaa tgtagcaaac tcaccagaat 3720 gctgtaccct ctcgtcagcc gacgatcgcg actcagcagc agaaacaaga tcctgctgta 3780 caaattgatc ttccgcccaa ccctttcgta cggcttccca gcatggcacg gctgtgctca 3840 atctcggcga aagaaactcc aagttcgaca aaacaagctg ctcaaaatga tgctggatct 3900 cccattcaac ttcccaactg ccgagctgga ggacgcttcg ggagtcgaag agctggactc 3960 gtggacttcc cgcctacttc gcaatttctg gacaagatgc tcaatgtcag aaaatgactt 4020 gatcgtcaac cttgtgccgt aatctctgtg atctagtttg taagaaaccc caattttcct 4080 ctaactatcc cccttctatc agcaagagaa acaggttttt tgtttgatgt gttttcctgt 4140 tgaaagctta ctttgtaaca taaaattaat ggcatctaaa gatacatgac tattagacat 4200 agctgcaagg atcctccaaa actctattta gatcttaaac taacttgatg ttcaaagcta 4260 aacctgaaaa ataaactgaa ttgaattgaa ttgaaaaata aaaa 4304 // ID Gypsy-96_CQ-LTR repbase; DNA; INV; 254 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-96_CQ_; KW Gypsy-96_CQ-I; Gypsy-96_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-254 RA Jurka J.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 572-572 (2011). XX DR [2] (Consensus) XX SQ Sequence 254 BP; 65 A; 62 C; 58 G; 69 T; 0 other; tggatggtcg caccttttcc acacgccact ggactcgcgg ttgacagcca agcctgtcaa 60 gcctcgactg tcgtgacagt gcgttgagca agattgacag ttgtcaccaa gaggaaataa 120 atcattacag tttagataga gaacaaacaa cacgcgtttt tattacttgg acaagtctcg 180 tccttataat tgttaattta attatcttgt gcccgcgtac cacctcggcc accgtgggta 240 cgtttggggc taca 254 // ID Gypsy-28_DPu-I repbase; DNA; INV; 6144 BP. XX AC scaffold_218; XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from Daphnia: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-28_DP_; KW Gypsy-28_DPu-LTR; Gypsy-28_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-6144 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Direct Submission to RU (16-DEC-2010). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR Genome; scaffold_218; Positions 92201 98344. XX CC Positions [4934-5398] - Integrase core CC 'TGTAT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 1409..5911 FT /product="Gypsy-28_DPu-I_1p" FT /translation="MSHEYNLRTRQPRHRSSSRPPRVRSGRTMATAQDAID FT AVAAVGNRIDLMEAGHTNITTQLATITQQLANLIGAPPPPPGPGPGIPAPQ FT PSTRRRLDPSTMEKLHGDASTSLLRSWRNRWDDYAALNQMTSYPAAEQMAA FT LRMCLDPSMQQVVKIVFGILPTTVTTPDAVLDLIGNYVRGKRNVALDRVAF FT EERRQGPAESFDDFYISLRRLADAADLCGACIDSRMATRIMAGIRDSETKK FT KLLALSPFPTTQAAVNLCRSEESARANEKILSTQTGVSSIQAKQSGRHSSS FT DAYSCGACGRSQHTAGQICPAIGKMCHICGKPNHFAPKCPNKSSKPRPPGG FT SGGSRYEHGGGGNKTDGGIAPKTKMARICIGNVKAKHRDRRTPVIAVEILD FT GNGLSARSFANVTPDPGAEVSVGGLDFLSAIGLSESDLSSSSFDLVMADKS FT APLLSIGQRDVRIRYGEQAATITVVICPEIHGVLLSWLDCIALRILHSDYP FT LPLPRLPASVQTVTTPASSSNDVTSVFLENLNIPSSPSAEQKAEIKAAVAN FT HFSDVFDQSTGLRCMSGPEMVIQLQEDAIPYYVNGARPIPFADRPEVKQLL FT DDYVEQGLIVPVEEATDWAAPLVVLRRSNGKLRIVVDHTRLNRFVRRPTHP FT TRTPRDAVAEIDSGANFFSCFDAANGYYQIPLHPSSQILTTFMTPWGRFKF FT LRASMGLSCSGDEFNRRADMAFADQVNTVRVVDDLLRFDRDFPAHVKGVCA FT LLQAARTANITLNLEKFQFAEPKVAWVGYEIQHGGVTVDPNKLQAISRFPR FT PNNITELRSFMGLVEQLAGFSADVAAAKGPLRPLLSSQNSYIWTADHEMAF FT EAVKSALLAPPILASFDPSRETSLQVDASRKNGMGYALLQKHDAQWRLIDA FT NSRWCTDTESRYAIVELELAAAEWAIRKCKLYLKGLPAFTLVVDHQALVPI FT LNTYTLDAVENPKIQRLKERLAPYIFTTVWRKGKEHAIPDALSRAPINDPG FT PDDEAANSDVTAFAHRTIISRITAISCDDNLDPDESAVPPHLPDPLVDEIR FT AVAASDGEYSALIAAIESGFPERRDRAPAAVGNYWGIRHQLSVEQGIVLFG FT NRIVIPQAARKNVLRKLHAAHQGIVRTNRRARQTVYWPGITNEITTLLSTC FT STCQERLPSNQQEPMMSDPLPTHVFEDVSADLFQSGQLHVLVYADRLSGWP FT VVHRRRRDPTAREVLHAVINNFAELGVPMRFRSDNGPQFDAGVFRSAKDRW FT GVAVSNSTPNYSQSNGHAEAAVKAVKELVEKISPSGDLDTEEFKQGLLEFR FT NTPRENGLSPAQMVFGHQLRSIVPAHRSAYATCWKSVMEARDRQAVANADA FT KMRYDLRSRGLEPLPIGTNVRVQDPKSKLWSHVGVVVAIGRYRAYRVKFAS FT GSVLWRNRRFLRRLVNTGGSEGDDDGAQDQACDNGGDGRDGAPHTTEESGD FT DETAVHVDDKSASPRVNPPAQRRSGRRRTPKVIVSM" XX SQ Sequence 6144 BP; 1370 A; 1816 C; 1598 G; 1360 T; 0 other; tggcgcagtt ggtcaaactt gccttttaga acccttgact cgaaaaattt cgtgcgagtt 60 cccccagttt ggctttaatc gaactttaat tgtttcttgc cgtagctgtt tgggggccgc 120 catcttgttt cctttgttct ggctagcttt ctcgcgtggc gccgccatct tagccagtgt 180 aggacagaga cgttcgtttt tcgggccata gttgcatata tttgattcac atttcaagtg 240 acgttggtcc cgattataat attcatggtc gtagctcttt gagccacctc gtccggaata 300 ttgtgggccg ctcattgatt gttggcagat tttttttgct tacaagatca tagtcacgtc 360 ggatacacaa gtataattgg tgttggtccc gattataata ttcatggtcg tagctctttg 420 agccaactcg tccggaatat tataggccgc ccattattcg ttgaaaatat tttccgagct 480 catatttatc tcgccctggc tatacagtgt cgttgatgct ggtcccgatt ataatattca 540 tggtcgtagc tctttgagcc atctcgtccg gaatattgtg ggccgctcac tgattgttga 600 caatattttc cgagctcata tttatctcgc cctggctata cagtgacgtt gaggctggtc 660 ccgattataa tattcatggt cgtagctctt tgagccacct cgtccggaat attgtgggcc 720 gctcattgat tgttggcaga ttgtttctct gaccacgagc tcataattat cgattatccc 780 aaccacggag gaaactgaaa tttaacccca tgccccgcgg tcgcaacagt gagttgagcc 840 gagagctacg acgttccgtg ctggaggaaa gacagctagc tgcaacaatc gaacgcgagt 900 tagcaagaca acgcagacgg gtcgctcatc aggaaccata ccggcgagta gagtttcgcc 960 cggtcgtccg accgtacatc ccactggaac gccaacctca gcagtccgtc gcacacggcg 1020 gttccgtgcc gcctcctcct ctacgggcgg acgagctccc tacacagcgc ccagaaatag 1080 caacgttcgc tcacaacgcc ggcgcagtcg cacatggtct agctacggcc agactttata 1140 tcccggccgt tcaccctgca caagagtcag ccgaggacgt tctacaacta gcaatcgacg 1200 aagagatacc actcaacgaa taacgtacac atctcctggt cgtaagcccg ctggggtcac 1260 tcggtcggag tactacactc agtctatagt gtaatgttgt tactcgtctt gtctacttca 1320 atatacttgt tcttccacca actcatcttt atctccgcgt catcactcat cgtcttcggc 1380 ccgttcccat accgttcgag attcgtgcat gagtcacgag tataatttgc gcactcgcca 1440 accacgtcat cgtagctctt cccggccccc tcgcgttcgt tccggccgca caatggcgac 1500 cgcacaggac gctattgacg cggtggcggc agtgggcaac cgaattgatc ttatggaagc 1560 aggccacacc aatataacga cgcagctggc aactatcacg caacaattgg cgaatctgat 1620 cggcgccccg ccacctcctc cgggccccgg ccctggaatt ccggcgccgc aacccagcac 1680 acgtcgacgc ctggacccct ccacgatgga aaagttgcac ggggacgcat cgacttctct 1740 cctccgttcc tggcgaaatc gatgggacga ctatgccgcg ctcaatcaaa tgacatctta 1800 cccggcagcc gaacagatgg cagcgcttcg gatgtgcctc gacccttcca tgcagcaagt 1860 agtaaaaatt gtatttggga tcctacccac aacggtcact acacccgacg cggtactcga 1920 cctcatcggg aattacgtgc gcggaaaacg gaacgtggct ctagataggg ttgcttttga 1980 ggaacgccgg cagggcccag cggagtcttt cgatgatttc tacatcagcc tcaggcggct 2040 cgcggacgcc gcagatctat gcggcgcgtg tatcgacagc cgcatggcta cccgcataat 2100 ggcgggtatt cgtgattctg aaacgaagaa gaaactcctc gcccttagcc cgttcccgac 2160 gacccaggca gcggtaaatt tatgcagaag cgaggagtct gcccgggcca atgaaaaaat 2220 cctcagcacc cagactggcg tctcttccat tcaagctaaa cagagcggcc ggcactcgtc 2280 gtcggatgca tattcctgtg gtgcatgtgg tcgatcgcag catactgctg gccaaatctg 2340 cccggccatc gggaaaatgt gccatatttg cggcaagcca aaccacttcg ctcccaaatg 2400 ccccaacaaa tccagcaaac cgagaccacc tggcggcagc ggcggctctc gttacgagca 2460 tggcggtggc ggcaacaaaa cggatggtgg cattgccccc aaaaccaaga tggcacgcat 2520 atgcatcggt aacgtgaagg ctaaacaccg tgatcgccgc acccccgtta tagccgtaga 2580 aatattagac ggaaacgggc tttcagcgcg gtcgtttgcc aacgttaccc cggatcccgg 2640 agcggaagtc agcgtgggtg ggctggattt cctatccgct attgggctgt ccgaatcgga 2700 cctatcatct tcttccttcg acctcgtgat ggcggacaag tcggcccccc tcctttcgat 2760 cggccagagg gacgttcgga tccgatacgg cgaacaagca gccaccatca ccgtagtcat 2820 ctgtccggaa attcatggcg tcctgctgag ttggctggac tgtatcgcgt tgcgaatttt 2880 acacagcgac tacccgctcc cactcccccg cttaccagca tctgtgcaga cggtgacgac 2940 acctgcttca agctcaaatg acgtcacctc cgttttcctc gaaaacttga acatcccgtc 3000 gtccccatct gcggaacaaa aagcggagat taaagcagcc gtcgctaatc atttttccga 3060 tgtttttgat caatctacgg gcctccgatg tatgagcggc ccggaaatgg tcatccagct 3120 acaagaagac gccatcccat actacgttaa tggagcccga cccattcctt tcgctgatag 3180 gccggaggtg aaacaactgc tggacgatta cgtggagcag gggctgattg tcccggtaga 3240 ggaagccact gactgggcag cgccgctagt ggtccttcgc cgttccaacg gaaagctgag 3300 aatcgtcgtg gatcataccc gcctcaatag gtttgtccgc cgtcccactc atcccacgcg 3360 aacgcctcgg gatgcagtag cggaaatcga cagcggagca aattttttct cgtgcttcga 3420 cgccgcgaac gggtactacc agatcccgct acacccatcc agtcaaatcc taacgacctt 3480 catgacgccg tggggcagat tcaagttttt acgggcgtca atggggctta gttgttcggg 3540 cgacgaattt aaccgccgtg cggatatggc attcgccgat caagtcaaca ctgttcgtgt 3600 cgtggacgac ttgctccgtt tcgaccgaga cttcccagcg cacgtgaaag gtgtgtgcgc 3660 cctcctccaa gcggcccgga cggccaacat caccctcaac ctcgagaagt tccaattcgc 3720 ggaacccaag gtagcgtggg tgggatacga aatccaacat ggcggcgtga ccgtcgatcc 3780 taacaagctg caagctattt cccgcttccc gcgccctaat aacatcacgg agcttcgatc 3840 cttcatggga ctcgtggaac aacttgctgg tttctcggcg gatgtcgccg ctgcaaaagg 3900 gccgctccgc ccgttactca gctcccagaa ctcttatatt tggaccgccg accatgaaat 3960 ggcgttcgag gcggtaaagt cggcgttact ggcgccccca attctggcca gctttgatcc 4020 gtcacgggag acatctttac aggtggacgc gtcccggaaa aacggaatgg ggtacgccct 4080 cctacaaaaa cacgatgcgc agtggcggct catcgacgcc aactcccgct ggtgcacaga 4140 cacggaatcc cgatatgcca tcgtggaact tgaactggca gcggccgaat gggccatccg 4200 caaatgcaag ctttatctga agggactccc tgctttcacg ctagtggtgg accaccaagc 4260 actcgtgccc atccttaaca cttacaccct tgacgccgtg gaaaatccaa aaatccagcg 4320 cctaaaagag cgtctggccc cgtatatttt caccaccgta tggcgcaagg ggaaagagca 4380 tgccatcccg gacgctctat cgcgggcacc gataaacgat cctggcccgg acgatgaagc 4440 ggccaactcg gacgtcacag catttgctca ccgcaccatc atctcccgga tcaccgccat 4500 ctcctgcgac gacaacttgg atccggatga atccgccgtc cctcctcatt tacccgaccc 4560 gctagtcgat gagatccgtg cggtggccgc gagcgacgga gagtactccg cactaatcgc 4620 agcaatcgaa tccggtttcc ccgagcgtcg agaccgtgca cccgccgcag tgggaaacta 4680 ttggggcatc cgccaccaat tatcagtgga acaaggtatt gtgctgttcg gaaaccgcat 4740 tgtaatacct caggcggccc gaaagaatgt cctccggaag cttcacgcgg ctcaccaggg 4800 aatcgtgcgg acgaacagaa gggcccggca gaccgtgtac tggccgggca ttacaaatga 4860 gatcacgaca ctcctttcaa cctgctctac gtgccaggaa cgcctgccaa gcaaccaaca 4920 ggagcccatg atgtctgatc ccctcccgac acacgtgttt gaggatgttt cggccgacct 4980 gttccagtcg ggccagctgc acgtgcttgt ctacgctgac cgattgtcgg gatggccggt 5040 tgttcacagg aggagacggg acccaaccgc gcgggaagta ttgcacgccg ttatcaacaa 5100 cttcgccgag cttggcgtcc caatgcgctt ccgatccgac aacgggcctc aattcgacgc 5160 gggcgttttc cgatctgcta aggaccgatg gggagtcgcc gtcagcaatt ctacgccgaa 5220 ctatagccaa agtaacggcc acgcagaagc ggccgttaag gccgtcaaag aattagtgga 5280 gaaaatctcc ccatccggag acctggacac tgaggagttt aagcaggggt tactggaatt 5340 ccggaatacc ccccgggaga acggactatc gcccgcccaa atggtgttcg gacaccagct 5400 tcgctcgatc gtcccagctc atcgttcggc ctacgcgaca tgttggaaat ccgtcatgga 5460 ggcgagggat cgccaggcgg ttgcgaacgc ggacgccaaa atgcgctacg atctccgttc 5520 ccgcggcctc gaaccattgc cgatcggcac caacgtgcgc gtgcaggacc ctaaatcgaa 5580 actgtggagc catgtaggcg tggtcgtcgc gattgggcgc tatcgcgcgt acagagtgaa 5640 atttgcaagc ggcagcgtac tatggagaaa ccggcgtttc ctgcgtcgcc tggtgaacac 5700 gggcggcagt gaaggtgacg acgacggagc tcaggatcaa gcgtgcgaca atggtggcga 5760 cgggcgtgac ggtgcgcccc ataccacgga agagtcaggc gacgacgaaa cggccgtgca 5820 cgtcgacgat aagagcgcct ccccccgtgt aaatccaccg gcccaacgcc gaagtggccg 5880 ccgtcggacc ccgaaagtca ttgtgtctat gtaagctatg ctcaccgtat atgggtaaaa 5940 ctgtgtgttc ttgatcttac atgcatcccc cttttggttt cttgttttct ctctccgttt 6000 ctgttccggg aaagcggccg cggaagtctc gtcggccgcc ccgtatatct gtatgctttt 6060 gatttcgcta ctatcatttg tgtgtataag ctgtcacgcc ctacttgggt aaaatcggtt 6120 acggcgtgac agcttgggga gagt 6144 // ID BEL5b_Cis_I repbase; DNA; INV; 3113 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 06-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE Internal portion of BEL LTR Retrotransposon from Ciona savignyi. XX KW BEL; LTR Retrotransposon; Transposable Element; internal portion; KW BEL5b_Cis_I. XX OS Ciona savignyi OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. XX RN [1] RP 1-3113 RA Smit A.F.; RT "BEL5b_Cis_I - BEL LTR Retrotransposon from Ciona savignyi."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX CC Ci000547, Ci000230, Ci000121 Probably non-autonomous, despite 2 CC kb ORF from 660-2660. The ORF contains 6 large deletions with CC respect to the ORF of BEL5_Cis_int and has diverged 15% from it, CC but remained open. XX SQ Sequence 3113 BP; 816 A; 604 C; 755 G; 933 T; 5 other; ttgatttctg ggattatcac tgattcagtg atttcttggc tttttgtctg gattttttga 60 tttttggatt ttctatcact ggtacagtga tttctggtct cttcgtgagg ttatttctac 120 attgtacggt tatagtacaa ttgctttgta gattgtacat ttaaggtaca atcgatttct 180 gtgtttcgag aaagtgattt cttgaccttg agaaattatt tctggatttc aagaaagtta 240 ttgctggctt tcgagaaatt gatttctgtt tttcgtgaaa ttatttctgg attgtacgtt 300 taagagtaca attatctctg gattttcgtg gtttctggta cactgattta gtggtttctg 360 gttgtgtacg ttttaggtac aattgatttc tgaaagcacg tttaaggtac aatttatatc 420 tggtggtgta cgtttcgagt acaattgatt tctcggtgta tgtgagagta cgtttaaggt 480 acaatttata tctggtatat aaaggtacca ttgatttccg gcttctgaga aattatttcg 540 ggtgtttcgt anaattgatt cctggttttt atatcacnaa ttaagtgatt tctttacttc 600 gtgggtgcgt gtcgttgata tctggagctt gattcacaga tttgtctctt catttcaaga 660 tggcaaccgc tttcaacgcg gccctggtag attctttaat ttacgtcgct cctagtgcag 720 cagaactgga atttctgcgc agagttgcac tacgcacact aaccaggctg gaaaataagc 780 tgcagaagtt gatatcggaa gaagcttctc cgaaggaagt gattaaggta aaggatcgtt 840 atgacgatgc ctgcgacgag tgcttacgga tttgcggcaa atacgtctct actgtgttga 900 cggaaggcgc tcgtactgac tcagaagcga gagtgaagat cattattcaa aggaaacaaa 960 actcaaatca agattatagg caatttgtac aaaagtgcgc tgcggcccgg tctgcagatg 1020 ttgtcagtag gtccanggcg tcntctagtc ggcgaagttc cgctcgccta caagcgcaac 1080 tgctggaaat ggaagccata caaatcgaaa aagaggccga gatccaacga aagagtttgg 1140 agcgcgaggc cgagatccaa cgaaagattt tggagcttga cctgcaggtt gcaagagcga 1200 aggcaaaagc aaaggcatac aaagcttcac ttttgacgaa gagtngaagt aaaccttcca 1260 attctttagg cgctttgtcg aatgacgagg agaggcacgc cgaggagaga catcggaaca 1320 ctgttccaaa caaaatacag ttatttacct gggatcaaag gaacagggct ggcgctgctg 1380 ctgacccaat acccgtagtt ccagtggcat acaatctgcc caatatgaac ttttcaccct 1440 ttatttccaa ccaagtgccc tttggaatga cctcttccga ctatggcgtt aatcaaggtg 1500 aacgcccaga agtcctaaat attgatacaa aacatgataa aaggaactta aagatatctg 1560 aatgtgagct gtctatagag gatggacaag caattatgat ggaagaaacg gcaaccaaag 1620 ttggcaaaca tcatcagatt ggactacctt ggaggctacc tcgagttaca cttccgaaca 1680 accgtttgat ggcacttaag cgactcaagt caccaaagag gagcttaaca ggtataaagg 1740 acatgttccg tcaagtattg gtagatccta agaaccgtga tgccctgcgc ttgctcaggt 1800 ggccggatga tgacatggat acggaaccag ttgacctttg gatgaacgtg cacatttttg 1860 aagcaacttc gtcgcctagt tgtttacctt tttctcgaag gaaggtggtt ccgaaaaatg 1920 ataccggggc tctgttgcca agtaaagttc aaaaggctat aatgtggttc aaggggccgg 1980 aatttttacg tcaacccgaa gagtggccag cacgtccagt ggtgtcgcca ctgttgtcag 2040 aagatcacgc aaggttcagg atgaagaagg gtatcgtgtc ttcccttaca gaagcgacac 2100 cggaaaccac cttgcagcta atattcaaat ggcattctac attttccgtc cgaaagcgtt 2160 ctgtggcttg gttacttcgg tatcgcaagt accttctttg gaaatcgtgt cgacctcgga 2220 ttctccaccc ctcagaattc ctgtggacga gatatttatt ctattcggaa ctgaaggagg 2280 ctctactctc aattataaag gtggttcacc tagaaggttt tgctgccgaa agaaagcagc 2340 tgcccaatca tcaatcgttg tttacaacga tcacgctaca tttggtaccg ccgttattgg 2400 cttcgctgcg gaaattacta aaaggagcat ttgacgactc tctttcaccg gacgaggttt 2460 tgaagtccga tggatacaaa cgatcatggc gcgcggttca gctttgtgct gataagtttg 2520 gggagagatg gactaaggat tatttgccta caactccgac agaagtggcc gcgacccacg 2580 agaaactttg ctgtgggaga cgttgtgttg gtcgtggacg agctgaagaa gcgcggcgta 2640 tggccgaagg gcgtaattga acattgtaca tatgataaaa ctggccttgt caggcgcgta 2700 cgtgtacgaa cgatgatgtc cactatggag cgagatgtac gcaagctgtg tctgctggaa 2760 gcacatcctt gaactgttac ccagtgtgct cgtggccgca aggctgtaaa ttgtatgctt 2820 cgtttaacca ctagttctag ccacggattt ggctactgtg ttttattttt cgtatacttg 2880 ccacctaagt tgtaactgag acgatgtttt gttattacta cattcctgtg tatttatacg 2940 tagacatgtt gttgcgccca ttgtttatat acctctacgg tagtttgtaa ttgcctttat 3000 tttcgaactt aagtatgacg tagtgcagaa gtgttttttg ggtgattgaa atgcccgttg 3060 atgtgtgttg taaaattatt tcatttactg ttttcaataa tttggggccg gta 3113 // ID Copia-49_AA-LTR repbase; DNA; INV; 203 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-49_AA_; KW Copia-49_AA-I; Copia-49_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-203 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 970-970 (2011). XX DR [2] (Consensus) XX SQ Sequence 203 BP; 56 A; 48 C; 36 G; 63 T; 0 other; tgaggtggac aatcccgtag tcctacgtca ttcgacctaa gcggcgaaac cccttcgcgc 60 gcatttttgt cgtcagtagc aacaacaaca acattagttg aaaaacgttt ctgtgagaaa 120 cacgcgtttt aataaaagtt aattttcttt cgctttgtaa caacgtttgt tacttttatt 180 tcgacgatcg aaccattccc tca 203 // ID I_Ele4B_AAe repbase; DNA; INV; 5837 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A I non-LTR retrotransposon family from Aedes aegypti. XX KW I; Non-LTR Retrotransposon; Transposable Element; I_Ele4B_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5837 RA Kojima K.K. and Jurka J.; RT "I non-LTR retrotransposons from the yellow fever mosquito."; RL Repbase Reports 11(4), 1371-1371 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 15 sequences with >96% CC identity. The consensus is ~87% identical to I_Ele4. XX FH Key Location/Qualifiers FT CDS 546..1838 FT /product="I_Ele4B_AAe_1p" FT /translation="METDGDGTSGETGESSSTKRFRIKTYPSTFLGPFVVY FT FRKKEKPINVLLISSEIYKLYKSVKEIKKVSLDKLRVIFGSRDDANSLLES FT KLFFDSYRVYAPCDSCEINGVIYDENLNCEDIVNYGSGKFKNKAIQPVKIL FT DCMRLSKLTFTDKQSSYMHSNCIKITFEGSVLPDYVDIDNVIFKVRLYYPK FT IMHCDRCLLFGHTSHFCSNKIKCFKCGGSHSSTDCNKNSDFCIYCRKKHNS FT LKDCSVYISQQQQSNQKIKNRNKLSYSEVIKTSPELSSTNMFAPLSNNVDN FT AVSNDDHENFIYKPPNKRKRIIKPVINNDCFSNPQPSTSFDIHFPPLTSPT FT SSRVIPGFQKNTTQVNTNENKDNSNSTFQKTESSDAENSILNILEELIDFL FT GINDFWKKIIKKILPFLASIFEKLNSIGPLICSLFSS" FT CDS 1841..5533 FT /product="I_Ele4B_AAe_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="MASSKINNLNILQWNCRSIIPKIDRFKALVINYDLDI FT FCLNETWLVEAKSFRIPSFNIIRKDRNTSNGGVLIGIRENIEFKYLNFSID FT SEIEYVAVSVKHKGLEFSIICIYIPPQAKFSLLELKSIFNNIPSPFYILGD FT LNAHNLAWGSHKIDGRGSLIMDVIDELNLNILNDGSFTRIVVPPGHHSCID FT LSLCSNSLSMISSWKTIEDANGSDHVPILINVHNPNNIQYNQEPYVPDLTI FT NVDWAKFSDLVSFALLSFDNSLSTIENYKRFSKLLIDCLKKSQTKKMMHSS FT SKKKNPAFWWDNDCTIALKNKSNAFKLFRRFGSRDQYILYCKAEAQFTRVI FT KFKKRNYWRNFIENLDPDTSLSTLWTVARNMRNYNFSTSPVLEYSEDWIDQ FT FASKICPDFTPIDIKLKKKSTYNFFPNLCTEFSIEEMNLALSVTKNTTPGI FT DNIKFIVLKNLPIDGKLHLLSLYNTFLFQNIFPEEWRFIKVVSILKPGKNP FT SLVESRRPISLLSCLRKLMERMILNRLELWAEKNNIFSSSQFGFRKGRGTR FT DCVALLSSYIELSLNKKQDVVSTFLDVSGAYDSVLIDLLYNKMIDCKLPLI FT IANFLCNLFSFKIMHFFHNGESKMIRYSYFGLPQGSCLSPFLYNLFTRDII FT SIIPNGCIFIQFADDKVISISGKSREVIRHFMQCCLDNLGTWAHNNGFSFS FT VQKTKFIIFSRKHSPIDIVLYLNYHQIEQVFEYKYLGIWFDFKLKWNKHIQ FT YIQTICSKRINFLRTITGTWWGAHPSDLITLYKTTIRSVMEYGCFTFGSAI FT QTHFSKLEKIQFRCLRICLKLMNSTHTKSIEVLAGIIPLKYRFHELNCKFL FT LQCFSNNHPIINILKSLFEINPTSRLLSSFIYCSAENVMLDSSPGFYEYNM FT NVHSFRPYVDFTLHEELKHIHFSARSYFAKLLFQRKFIGVEPEQIYFTDGS FT LINNIAGFGVFNFHIAHFFKLEXPCSIYIAELTALHFACCLIRNCAPNIYM FT VCSDSFSSLHALNTINFNFKTSKIILSIKEILNDLFSRGFVIKFVWVPAHS FT NIYGNEQADSLAKLGVSRGIIYDRAIFPSEYFTKLKQNSINNWQIAWNTSD FT KGRYCYSIFPKVKRFPWFHQLSVSRNFICFYTRLMSNHYICNSHLYRINIK FT DSNLCECGESYEDIDHIVFNCTRFDLPRRNIFEKISRLGLNVPTSIRDILA FT LKNLNILKILYGYFNEISYYS" XX SQ Sequence 5837 BP; 1895 A; 816 C; 887 G; 2237 T; 2 other; tcatctttgg taagtaggcc ttgatcggat taccggtgtt tttctgcgct tgtcaatttt 60 tttccgtttt tcaggtggat ttcgtttgaa aggatttgaa ggtggattat caaacgagtg 120 gccatcgaag atttgaagaa gaagaggtac tagaaggtgt actggactaa tttcaactgt 180 tcgagttcaa tttggtgatt gtgaagaaat tgatttggct ttgtcgactt caagacaaaa 240 cgactgttga agagaattga agttcgaagg aggaaattcg accattctgg aatcaacgag 300 gatttgatga agaagacgtg aactaaaaaa tcaagcttca actcatcaag attgatttga 360 gtgctggtgg aaatacttgg ttgtctttga cggcctcaag cctttggtgt ggattgaagt 420 ctctttaagt ataattttat ttattttatt agtattattc ttattcttat tattacttgt 480 ttatattatc aattgcactt cagtattttt tcatatattt tcactcaccc cgtcaatttt 540 ccaatatgga gactgacggg gatggaacat ctggtgaaac tggtgaatca tcttccacca 600 agcgttttcg cattaaaact tatccctcca cttttttagg cccttttgtt gtttattttc 660 gtaaaaagga aaagcctata aatgttcttt tgatatcttc ggaaatttat aaattataca 720 aatctgtcaa ggaaattaaa aaagtttcac ttgacaaatt gagggttata tttggatccc 780 gtgatgatgc aaattcatta ttggaatcta aacttttttt tgattcatat cgagtttacg 840 ctccttgtga ctcgtgtgaa ataaacggag tcatttacga tgaaaatttg aactgtgaag 900 atattgtaaa ttacggttca ggtaaattta aaaataaggc aattcagcca gttaaaattt 960 tagattgcat gcgtttatcg aaattaacat ttaccgataa acaatcttca tacatgcatt 1020 ctaattgcat aaaaattaca tttgagggtt ctgttcttcc cgattatgta gacattgaca 1080 atgttatttt taaagttaga ctttattatc cgaagattat gcattgcgat cgttgtctcc 1140 tttttggtca cacatcgcat ttttgttcta acaaaattaa atgttttaaa tgtggtggta 1200 gtcattcatc tactgattgc aataagaatt cagatttttg tatttattgt cgtaaaaaac 1260 acaattcttt aaaagattgt tctgtttaca tttcgcaaca acaacaatct aaccagaaaa 1320 ttaaaaatcg aaacaaatta tcatattctg aggtaattaa aacttctccg gaattatcct 1380 caacgaatat gtttgctcct ctttcaaata atgttgataa tgctgtatca aatgatgatc 1440 atgaaaattt catttataaa cctcctaata aaaggaaacg aattattaaa ccagttataa 1500 ataatgattg tttttcaaat cctcaaccat caacatcatt tgatattcat tttccccctc 1560 ttacttctcc tacttcttct cgtgttattc ctggttttca aaaaaataca acacaagtaa 1620 acacaaacga aaataaggac aattcaaatt caacttttca aaaaactgaa agttcagatg 1680 cagagaattc tattttgaat attttggaag aattgattga ttttttggga ataaacgatt 1740 tttggaaaaa aataattaag aaaattttac catttttggc ctctattttt gaaaagttga 1800 attcaattgg acccctcatt tgttctttgt tttcttcgta atggcttcat caaaaattaa 1860 taatttaaat attttacagt ggaactgtcg aagtattatt ccaaaaattg acagattcaa 1920 agcattagtc ataaattatg atttggatat attttgttta aatgagacct ggttagtaga 1980 agctaaatca tttcgaattc catctttcaa tattatacgt aaagatcgca acacttcaaa 2040 tggtggtgtg ttaattggaa ttcgtgaaaa tattgaattc aaatatttaa atttttcaat 2100 tgattcagaa attgaatatg ttgcagtttc tgttaaacat aaaggtttag aattttcaat 2160 tatttgcatt tatattcctc cccaagcaaa attctctttg ttagagttga aatctatttt 2220 taataatatt ccttcacctt tttatatact aggagattta aatgctcata atttggcatg 2280 gggaagccat aaaattgatg gtaggggttc attaattatg gacgttattg atgaattaaa 2340 tttaaatatt cttaatgatg gttcttttac aaggattgtt gtgcctcctg gtcatcattc 2400 ttgtattgat ttatctcttt gttcgaatag tttatccatg atttcatctt ggaaaactat 2460 tgaagacgca aacggaagtg atcatgtgcc tatattgatt aacgttcata atcctaataa 2520 tatacaatat aatcaagaac cttatgttcc tgatttaaca ataaacgtgg attgggctaa 2580 attttctgat ttagtttcct ttgctttgct cagttttgac aattcacttt ctaccattga 2640 aaattataaa cgtttttcaa aattattaat tgattgttta aaaaaatctc aaactaaaaa 2700 aatgatgcat tcatcatcta agaaaaaaaa tccagcattt tggtgggata atgattgtac 2760 aattgctttg aaaaataaat caaatgcttt caaattattt agacgatttg gttctagaga 2820 tcaatatatt ttatattgta aagctgaagc tcaatttact agagttataa aatttaaaaa 2880 aaggaattat tggagaaatt ttattgaaaa tcttgatcct gatacatcac tatcaacatt 2940 atggactgtt gctcgaaata tgagaaatta taatttttct acttcaccag ttttggaata 3000 ttcggaagat tggattgacc agtttgcttc taaaatttgt cctgatttta cacctataga 3060 tataaaactt aaaaaaaaat caacatataa tttttttcct aatctttgta ctgaattttc 3120 tattgaagaa atgaatttgg cattatcagt tacgaaaaat actactcctg gtattgataa 3180 tattaaattt atagttttaa aaaatttacc tattgatggg aagttacatt tactttcatt 3240 atataataca tttttgtttc aaaatatttt tcctgaggaa tggcgtttca taaaagttgt 3300 aagtatctta aaacctggga aaaatccttc acttgttgaa agtagaagac caattagttt 3360 attatcatgc cttcgtaaat taatggaaag gatgatttta aaccgtcttg aattatgggc 3420 agaaaaaaat aatatatttt cttcatctca atttgggttc agaaaaggtc gaggcacacg 3480 tgattgtgta gctcttttat catcatatat tgaattatca ttgaacaaga aacaagatgt 3540 agtttctaca tttcttgatg tttctggtgc atacgattct gtactgatag atttacttta 3600 taataaaatg attgattgta aacttccttt aataattgca aattttttgt gtaatttatt 3660 ttcatttaaa ataatgcatt ttttccacaa cggagaatcg aaaatgatcc gttatagtta 3720 ttttggtctt cctcagggtt cttgtttaag cccattttta tataatttat ttactagaga 3780 tattatttct attattccta atgggtgtat cttcattcaa tttgctgatg ataaggttat 3840 ttctatcagt ggtaagagta gagaagttat tcgtcatttt atgcaatgtt gtttggacaa 3900 ccttggtaca tgggcgcata ataatggttt tagtttttca gtacaaaaaa caaaatttat 3960 tatattttct cgtaaacatt ctccaattga tattgtttta tatttaaatt atcatcagat 4020 agagcaggtt tttgaatata aatatcttgg aatatggttt gattttaaac taaaatggaa 4080 caaacatatt caatacattc aaacaatttg ttctaagaga ataaattttc ttcgaacaat 4140 cactggaact tggtggggag ctcatcccag tgatttgatt accctttata aaacaactat 4200 tcgttctgta atggaatatg gctgttttac ttttggtagt gccattcaaa ctcatttttc 4260 taaactagaa aaaatccaat ttcgttgttt aagaatttgt ttgaaattaa tgaattctac 4320 tcatactaaa tctattgaag ttttagctgg tatcattcct cttaaatacc gttttcatga 4380 actaaattgt aaatttttac tacaatgttt ctccaataat catccaataa ttaatatatt 4440 aaaatctttg tttgaaatta atcccaccag tagattattg agttcattca tttattgctc 4500 cgctgaaaat gtcatgttag attcatctcc tggtttctat gaatataaca tgaatgttca 4560 ttcttttcgt ccttatgttg attttacctt gcatgaagaa ttgaaacata ttcattttag 4620 tgctcgctca tattttgcaa aattattatt tcaacgtaaa tttattgggg tggaacctga 4680 acaaatttat ttcacagatg gatctttgat caataatata gcaggttttg gagttttcaa 4740 ttttcatatt gcacattttt ttaaattaga akctccttgt tcwatttaca ttgctgaatt 4800 aacagctctc cattttgctt gttgtttaat cagaaattgt gctccaaata tatatatggt 4860 gtgttctgat agttttagta gtcttcatgc tctgaacaca ataaatttca atttcaaaac 4920 aagtaaaatt attttatcta ttaaagaaat attgaatgat ttgttttcta gaggatttgt 4980 cattaaattt gtttgggttc ctgctcattc taatatttat ggtaatgaac aagctgactc 5040 tttagcaaaa ttaggtgttt ctcgtggaat aatttatgat cgggctattt ttccatcaga 5100 atactttact aaattgaaac aaaattctat aaataattgg caaatcgctt ggaatacaag 5160 tgataaagga cgttattgtt attccatttt tcctaaggtg aagcgtttcc cttggtttca 5220 tcaattatcg gtcagccgta attttatttg tttttatacc agacttatgt ctaatcacta 5280 tatatgtaac agtcatttat accgcataaa tatcaaggac tctaatcttt gtgaatgtgg 5340 tgaatcatat gaagatatag atcatattgt ctttaattgt actcgttttg atttgccaag 5400 aagaaatatt tttgaaaaaa tttcaagatt gggtcttaat gtacctacat ctattcgaga 5460 tattttggcg cttaaaaatt taaatatttt aaaaatttta tatgggtatt ttaatgagat 5520 ttcttattat agttgatact gctcgctttt ttatatttat tttcagataa ccgatccttt 5580 gtaccactcg gagtttaagt tgttttaact caagttactt ggatatacga ggaccctcat 5640 gatgactcca acggctccga tatggatcaa ttccggatga gcctttagta tttaagaatt 5700 atttttgtaa tgttgttcga aaagataaag aggttttgtg cctttttgag aaagatttcc 5760 aaaaagaaat cactcaaagg ggtttttccc tctttcaaaa ttcaagttaa aataaaagat 5820 aataataata ataataa 5837 // ID Gypsy-31_DPu-LTR repbase; DNA; INV; 128 BP. XX AC scaffold_55; XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from Daphnia: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-31_DP_; KW Gypsy-31_DPu-I; Gypsy-31_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-128 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Direct Submission to RU (16-DEC-2010). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR Genome; scaffold_55; Positions 727742 727869. XX SQ Sequence 128 BP; 24 A; 43 C; 25 G; 36 T; 0 other; tgctatgact cagcaacgtc gcccccctcc cctccctctg atcacgtcaa gcgtcagttg 60 gtctcgtctc gttggtagag tgaatacaag cctgtatcgc tctcgaattc agcctctttc 120 agacttca 128 // ID Gypsy-608_AA-I repbase; DNA; INV; 6037 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-608_AA_; KW Gypsy-608_AA-LTR; Ty3_gypsy_Ele137; Gypsy-608_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-6037 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [5092-5580] - Integrase core CC 'ACAAC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 1397..2323 FT /product="Gypsy-608_AA-I_2p" FT /translation="MSAIELIRTIPLLIPTYQGDGEKLTSTVAALKACKSL FT IDNTNQAIALQVILSRLEGKARAAVGDAPQNIDEIIAKLEEKCKIKIAPET FT VVAKLNATKQVGEIGKFTETIEKLTLDLERAYISENVPVDAATRMSVKAGV FT KALAAGVKNSETRLLLKAGQFSTLTAAVEKVTENETTEKPTNSVLHFRQNS FT RNNPHNDGRSTNNYHRGRGNGNRPANYGRNGRYQSNDSSNYRRNNNGNGYH FT RQNNQNYGRNQRGRFHNQAPRVFYAQSGNQPAPQPGQVGSQGAYQGQPITQ FT QQQTQPQGQIIHHITRQ" FT CDS 2323..5919 FT /product="Gypsy-608_AA-I_1p" FT /translation="MKNGKCLQINTTCSNFIYLKLNFSETESTLLVDSGAE FT VSIFKVNSLKQSCLINSSEKCQITGINSLSTETIGTASTDISFPNGISLQH FT KFQLVDQNFPIPTDGILGRDFFTKYHCSINYDTWFLTVITNGETQEIPIED FT NLNGDFILPPRCEIYKHVYTEEINDNYVLLSKELKPGVYCANAIVSSQSHV FT VKLINTNDETIRIDGNFERTYIPLKHYHILSYDKSHVKERNQKLFDELKLK FT NTPENFKQKLLNLCGRFNDIFALEGDTLSSNNFYKQNINLSDNVPVYIKNY FT RSPETHRQEIDTQVDKLLNDNIIQASVSPYNSPILLVPKKSTTDTKKWRLV FT VDFRQLNKKIVADKFPLPRIDDILDQLGRAKYFSTLDLMSGFHQVELDDNS FT RRYTAFSTTSGHFEFNRLPFGLNISPNSFQRMMTIALSGLPPESAFLYIDD FT IIVVGCSVDHHLANLEQVFERLRKFNLKLNPSKCNFFCADVTYLGHHISEQ FT GIQPDKSKYSSILNYPEPKTADEVKRFVAFCNYYRRFIPYFADIAAPLNAL FT SRKNAKFVWDEHCKQAFISLKNKLLSPQILKFPDFNKTFILSTDASELACG FT AVLSQNHGDIDLPVAYASRSLTKGESNKSTIEQELTAIHWAILYFRPYLYG FT RKFVVRTDHRPLVYLFSMTNPSSKLTRMRVDLEEFVFDVEYVKGKENVGPD FT ALSRISIDSEQLKNLQILRVQTRSMTRTPQQSPRIQTANKTDHLMAYDSIN FT NIDAFNMLKLKFSSENNHLYATIYSKNVKGVIAQAQFPYALKKLNLTRYLK FT TINDMAKDVEIDKLAIARNDEIFKMLTPESFKVMCNETLTNVQLILYTPAQ FT ILKEEHEIMKIIEENHVTPLGGHVGINKLLQKLRRSYYWTNMKSDITKYVK FT SCLFCKQNKHTIKTKEHFTHTSTPERTFQTISMDTIGPFTRSNKGNRYALT FT IQCELSKYVIVTPLVDKQANTLAKAFVENLILKYGCPSVIKTDMGTEYRNE FT LFHNICNLLQIKQNFSTAYHPETIGSLERNHRCLNEYLRQFINEQHDDWDS FT WLPYYTFYYNTTPHTEHKFSPFELVYGRQFQFASNLTENIDPVYNTDSYFT FT EMKYRIQTACTKAKILLDKSKTSRNVTQAQKSNPLETKIGDTVWLKIENRR FT KLDPVYSGPFKIINIDHPNVTIEHFISKEQQEVHKNRIIK" XX SQ Sequence 6037 BP; 2225 A; 1262 C; 1055 G; 1494 T; 1 other; acaacgaacc tccaacgtat ttacatggcg acgcgtgcag gtgctaagtc ttcaaagact 60 tagaacagtt tacaagagca agtacaagtg cgaacgttca aatacagcaa atattttgaa 120 caatcaaata gtgaaatcaa gtgaaaatta aaaggactat gggcaagcac cagtccaaag 180 agaataatga aaacggcgac attcaattaa cggtgataca agaccagcag caacaacaca 240 gcgccgacca cgaggaacag accctgttgc tgtggctaat cctggtcaca gtgcttatcc 300 atttagcaat aaagctgtat aaaatgtaca aaaagcgtga aaaaagtatt gcagtgaaag 360 cagcccgctc ggtagcagcg attgtttaaa gcctggagaa accactcggc aaagtcacct 420 tgaagccgca catcattatc atctttaaaa ctaatcacaa atgcccggag cgcaccagct 480 cggcgaggcc aaccggtagc tgaaggacaa cagcatctac aaaaatacaa atcacaaatg 540 cccggagcgc accagctcgg cgaggccaac cagtagctgc aggacaacaa catctccaaa 600 aaatacaaat cacaaatgcc cggagcacac cagctcggcg aagccaccct gtagccgaaa 660 ttcatctgca tttttaaaaa atcatatcac aaaagcccgg ggcaacagct cggcgaggcc 720 accctgtagc cgatattcat cagcatcttt aaaaatcata tcacaaaagc ccggggcaac 780 agctcggcga ggccaccctg tagccgatat tcatcagcat ctttaaaatc atatcacaaa 840 aacctggggc aacagctcgg cgaggccaac ccagctgcgt atcaaaccga tgcagggcac 900 gacgacgacg cagctgagta acgagaacga caccggtaaa gagtgagtac tgcaagtcag 960 agcgaaatta accgataatt gggtcgattt ttttttttgt atcgtgtagt gatccgtgaa 1020 actagaatta gaaaaacatt tgtagctctc cgcaataatg tgtttgtatc attcataaaa 1080 cacaatacaa gaaatgcatg aaaaatatga tagactaaaa accatttata atgaattgaa 1140 agccaaagta agcgtcaggt cttctaaaag aatcctttta aaacagaaaa atctcgccga 1200 ttccctaatt aataaattaa cgggcgctgc tcaaattgag aactcaaagg aaaatagtat 1260 tttgttagaa aaagcgaaga aagtgcattt ggmaattctt tcgctagtag agataaaatc 1320 aaaatcacaa gttacttcct tcaagggtgc agcaacagca ctaatatttg caataaaagt 1380 aaaaaataaa gcaaagatgt cagcgataga attaattagg acgatcccgt tgctaattcc 1440 aacataccaa ggtgatggtg aaaagttgac gagcacagta gccgctctga aagcgtgcaa 1500 gtcacttata gataatacaa accaagcgat tgcccttcag gtaatccttt ctcgcttgga 1560 aggaaaggct agggccgcag tgggagatgc tccccaaaat atagacgaaa ttatagcaaa 1620 acttgaagag aagtgcaaaa taaaaatagc accggaaaca gtcgttgcta aattgaacgc 1680 aacaaaacag gtcggagaaa ttggcaaatt tacggaaaca atagaaaaat tgactttaga 1740 cctggaacgc gcttacataa gcgaaaacgt tccagttgac gcagctacaa gaatgtctgt 1800 aaaagcaggc gtaaaagctc ttgcagctgg agttaaaaat tcagaaacca gacttcttct 1860 aaaagctggt caattctcca cccttactgc cgcggtggaa aaagtaaccg agaacgaaac 1920 gacggaaaaa cctacaaata gtgtcttaca tttccgacag aattcccgga ataacccgca 1980 taacgatggt cgctctacga ataactacca cagaggccgt ggaaacggaa accgtcccgc 2040 taattacgga cgaaatggta gatatcaatc gaacgactca tcgaactatc gacgaaataa 2100 taatggtaat ggttaccata ggcaaaacaa ccaaaactac ggccgtaatc aacgtggcag 2160 attccataat caagctccgc gtgttttcta tgcccaatcg ggaaaccaac cagctcctca 2220 accgggccaa gttgggagcc aaggagcata ccaaggccaa cccattaccc agcaacagca 2280 gactcaacca cagggccaga ttatacatca cataactcgt caatgaaaaa tggaaaatgc 2340 ttgcaaatta atacaacttg cagcaatttt atttatctaa agttaaattt ttcagaaaca 2400 gaatcaactc tgttagtaga ttccggtgcc gaagtttcta tttttaaagt taattcgcta 2460 aaacaatcct gtttgataaa ttcttcagaa aaatgtcaga taaccggcat aaatagcctc 2520 tctacagaaa ctatcggaac cgcttcaaca gatatttcgt ttccgaatgg tatatcacta 2580 caacataaat ttcaattggt cgatcaaaac tttcctatac cgacagacgg aattctcggc 2640 agagatttct ttacaaaata tcattgttcc ataaattacg acacctggtt tctaacagtc 2700 attaccaacg gagaaacgca agaaattcca attgaagata acttgaacgg agattttatc 2760 ctaccaccaa gatgcgaaat ttataaacac gtttacactg aagaaatcaa tgacaactat 2820 gttctactat cgaaggaact caaacctgga gtttattgtg ctaacgcaat agtaagctcc 2880 caaagtcatg ttgtgaaatt aatcaacacg aatgatgaaa ccatacgcat tgatggaaat 2940 ttcgaaagaa cttatatacc gcttaaacat taccacattc tcagttatga caaatcccat 3000 gttaaagaac gaaaccagaa actttttgac gagctgaaac tgaaaaatac accagaaaat 3060 tttaaacaaa aacttctaaa tctttgcgga agatttaatg acatatttgc tcttgaagga 3120 gatacactgt cgagtaataa tttttataaa caaaacataa atttgtctga taacgtaccg 3180 gtatatatta aaaattatcg gtcaccagaa actcatcgac aagaaattga tacacaagtc 3240 gataaacttt taaatgataa tataatacaa gcttcagtat cgccatacaa ctcccccata 3300 cttcttgtac caaaaaagtc cacaacagac accaagaaat ggcgtttggt cgtagatttt 3360 cggcaactaa ataaaaagat agtagccgat aaattccctt tacctagaat agacgacatt 3420 ctcgatcaac taggtagagc caaatacttc agcactttag acttgatgtc cggttttcat 3480 caggttgaac tcgatgataa ttctagacgg tatacagcct tctccacgac ttctggtcat 3540 tttgaattca accgtctacc attcggtttg aacatttcac caaacagttt tcagagaatg 3600 atgaccattg ctctgagcgg attgcctcct gagagtgcat tcctttacat agacgacatc 3660 atagtagtcg gatgctcagt tgaccatcat ttagctaacc ttgaacaagt tttcgaaaga 3720 ttgcggaaat ttaatctgaa attgaaccct tccaaatgta attttttctg cgctgatgtt 3780 acctatttag gacatcacat ttctgaacaa ggtatccaac ccgataaaag taaatactct 3840 tctattctga actatcccga gccaaaaacc gcagacgaag taaaaagatt tgtcgcgttc 3900 tgtaattact acaggcgttt cattccttac ttcgcggata tagccgctcc attgaatgct 3960 ttatcgcgta aaaatgcaaa atttgtctgg gacgaacact gtaaacaagc tttcatttca 4020 ctcaaaaata aacttttatc ccctcaaata ttgaagttcc ccgattttaa caaaacattt 4080 attctcagca cagacgcatc cgaactagca tgcggtgctg tattgtcaca aaatcatggt 4140 gatattgacc taccagtggc atacgccagc cgatcattaa cgaaaggcga gagcaacaag 4200 tccaccatag aacaagaact aacagcaatc cactgggcta ttttatattt tagaccatat 4260 ctttatggta gaaaattcgt ggtcagaact gaccaccgcc cgctagtata cttgttctcc 4320 atgacaaatc cctcatcgaa acttactcga atgagagtag atttagagga gttcgttttc 4380 gacgttgaat acgtgaaagg aaaggaaaac gttgggcccg acgcactatc tcgtatcagc 4440 atagattcgg aacaactcaa aaatcttcaa atactaagag ttcaaacaag atccatgact 4500 cgaacacctc aacaaagtcc aagaattcag actgctaata agactgatca ccttatggca 4560 tatgattcta taaataatat tgatgcattt aatatgctca aactgaaatt ttcaagtgaa 4620 aataaccacc tatatgcgac aatttattca aaaaatgtga aaggggttat tgcgcaagct 4680 caatttccct atgcattgaa aaaacttaat ctaacaaggt accttaaaac aatcaacgac 4740 atggccaaag acgttgaaat cgataagcta gcaattgctc gaaatgacga aattttcaag 4800 atgttgacac ctgaaagttt taaagtcatg tgtaacgaaa cactgactaa cgttcagtta 4860 attctttaca ccccagcaca aatattaaaa gaggagcatg aaatcatgaa aatcatcgaa 4920 gaaaaccacg tgacaccctt aggaggacat gtaggtataa acaaacttct tcaaaaattg 4980 agacgtagtt attactggac taatatgaaa tccgatataa caaaatacgt taaatcctgt 5040 cttttctgta agcagaacaa gcacacgatt aaaacaaaag aacactttac acatacatcg 5100 acaccagaaa gaacattcca aacaatctca atggacacaa taggaccatt tacgagatcc 5160 aacaaaggta atcgctatgc attgactatc caatgcgaat tatctaaata tgtcatagtt 5220 acccctttag tcgataagca ggcgaacacg ctagctaaag cgttcgtcga aaatttaatt 5280 ctaaaatatg gatgcccgtc tgttatcaaa acagacatgg gtactgaata taggaatgag 5340 ttatttcata atatctgcaa cttgcttcaa attaaacaaa atttctccac tgcataccat 5400 ccagaaacaa taggaagttt agaacgtaac caccgatgtc ttaacgaata cttgagacaa 5460 tttattaacg agcaacatga tgattgggat tcatggttgc cctattatac attttactac 5520 aatactacac cacacaccga acacaaattt tcaccattcg aattagtata tgggagacag 5580 ttccagtttg caagtaattt aaccgaaaat atagacccag tatataacac agattcatat 5640 tttactgaga tgaaataccg aattcaaaca gcatgtacaa aagcaaaaat cttattggat 5700 aaatctaaaa cttcaagaaa tgtaacccaa gcacaaaaat caaacccatt agaaacaaaa 5760 attggagaca cagtttggtt gaaaatagaa aatagaagga aacttgatcc cgtgtattca 5820 ggaccattta aaataataaa catagatcac ccgaatgtaa caatcgaaca ttttatatca 5880 aaagaacaac aagaagtgca taaaaatagg attattaagt agaagtaaaa gtgaaaaaaa 5940 aaaataaata tatatatata tcatcgaaat tgtataatct atttcattat taacgaatag 6000 ttaactacat taaccattct tttcataagg gggtagg 6037 // ID Sola2-1_NVi repbase; DNA; INV; 3698 BP. XX AC . XX DT 15-FEB-2009 (Rel. 14.02, Created) DT 25-JUL-2009 (Rel. 14.08, Last updated, Version 2) XX DE Sola2-like DNA transposon: consensus. XX KW Sola; DNA transposon; Transposable Element; Sola2; Sola2-1_NVi. XX NM Sola2-1_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-3698 RA Bao W., Jurka M.G., Kapitonov V.V. and Jurka J.; RT "New superfamilies of eukaryotic DNA transposons and their RT internal divisions."; RL Mol Biol Evol 26(5), 983-993 (2009). XX RN [2] RP 1-3698 RA Jurka J.; RT "Sola2-like elements from the parasitic wasp Nasonia RT vitripennis."; RL Repbase Reports 9(2), 486-486 (2009). XX DR [2] (Consensus) XX CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Nasonia CC Genome Project. XX FH Key Location/Qualifiers FT CDS join(1296..2597,2545..3018) FT /product="Sola2-1_NVi_1p" FT /translation="MSLDESNDSTIHMETLDDSATSLNDSADFDPEKDNRE FT KSQREMELEEMLDGLKDKFSNLSNTDPLRLQILTIAPSTWSARKLAKEFGA FT TRYLAAKAKKIKSQQGILPNTTMKAGKNLPKNTIDEVFDFYTSDMNGRLTS FT STKEFVSVNIEGERVXQQKRLLLLNLSDLYRLFKQSHPDCDIGFSTFAKLR FT PKFCILPGASGTHSVCVCTIHQNVKLMLDAIDFKNLTKDTNLPLSDYRDCL FT KNMSCENPSPKCYIDQYDKCPGTTNILEHIRTLLDDKRISHVEYATWSGTD FT RSTLLKSVVKLDEFLTELDNKLKILKPHSFLSKQQSLFVKEKKENLKEIEV FT LVMFDFAENYCYVCQDASQAFHFNNDQCTVFPAIYYYKEDGELKHKSNVFL FT SNSKKHDTAAVYSIQQQLIPEIKKKSEKSYLRKRWSQATLKKKVKKVIYVS FT DGAKQHFKNKFQIANLIYHKQDFDVIAEWHYFTTAHGKSAYDGIGANFKRE FT AYKASLTAKPNDALLTSEALFNWSRKHFQNINVFYFDEKYHAKMQRKLNRR FT FEAAKAVPEISKQHGFIVDDQRNVCIKRYSKEEGGIQWSMYADN*" XX SQ Sequence 3698 BP; 1375 A; 572 C; 591 G; 1157 T; 3 other; cgttgaagct tctagttttt agtactgtat ggcatatgtg aatctttcaa atggcggacc 60 agtaatgctg ccaaaaggaa gaaactactc aatcaaagcc gttatttgta ttctgattcg 120 tataaaaata acaaattgct tcttacaatt cacttttttt atgtaattta ggtcgctgat 180 tataaatcta ttcgtagttt tcaagatggc ggatccaaga tggcggctga aaaatgtaaa 240 aatcatttta tatcgacagt ttgagtcggg tatgaatata agggttttcg gggtcgctga 300 ttacgaattt attatttatt ttttgaaatt ccaagatggc ggatccaaaa ttcaataata 360 cacttttttt aaacaaaaaa aatatttatt tgtaacaaga actaaactaa tctatctaaa 420 agtatgcgaa tcttacccaa atccttaata ataagcatga atgtaaagta aamattttaa 480 aatgtttaaa tttgtttact caaattattt agataaatta attttgcaat tttaaattga 540 aaattgtaat ctacaatact aaaaatgtag gcatacaaat ttcaagtaaa tcgactaaat 600 taaatgtggc tcaaaaaatt aagcgctatt actttgcatt tttatattta atgttgcccc 660 aactcattgt tagattaatc tgaaaaatgg tattaaattg cttcaaatca actacggcag 720 tgtataagca ggcagtgaaa agttaaaatt gtgaaaatag aaagtttaag tgtttgcatt 780 actattttta cttgctttac gttgatactg ctttgcttta aaagaaattt tccaattaaa 840 actgtttaat atcgatataa cattgtatgc ggtctatgca aatgttaaat cacagatttt 900 aaaagctaaa ttttatataa aatgtaattg atttataaaa ataagtatgt aaaaaaggtt 960 gaattaacta agtatattct tataatagtt gttcgttaat aattttgcag tagttcagaa 1020 aaagtaaatt acaagctttt gacagaggtt aacctttttt tttaatacga tttcttcttt 1080 caataaaact acattcttat cacaatgcca aaaagactcc attcggacag atgcgccaat 1140 ccatttaaaa aacccgacga gcaaggtcac attggtaaat ctttgagact aataccgaaa 1200 tctatgctag aagagtatct gatttatctg aacatagtaa ggtttgctct aaatgtgtaa 1260 amaaatttaa agataaaact gatggtaata atagtatgtc cttggacgaa tcaaatgatt 1320 ctacgattca tatggagact ctggacgatt ctgcaacgtc tttaaatgac agtgcagatt 1380 ttgatcctga aaaagataat cgagaaaaat cacagagaga aatggaattg gaagagatgc 1440 ttgatggact taaagacaaa ttttctaatc taagcaacac agatccttta aggcttcaaa 1500 ttctgacaat agctccatca acatggtctg ctagaaaatt agcaaaagaa ttcggtgcaa 1560 cgcggtattt ggctgccaaa gctaaaaaaa ttaaatcaca acaaggtatc ttacctaata 1620 caaccatgaa agcaggaaaa aatctaccaa aaaatactat agatgaagta tttgatttct 1680 atacaagtga tatgaatgga agacttacgt ctagtacaaa agagtttgtt tcagtcaata 1740 tcgagggtga gcgtgtamat cagcagaaac gtttactact tttaaattta tctgatttgt 1800 acagattgtt taaacaatca catcctgatt gtgacatagg ttttagtacc tttgctaagc 1860 tcagaccaaa attttgtata cttcctggag cttctgggac ccattcggtg tgtgtatgca 1920 caatacacca aaatgtaaag ttgatgttag atgccattga ttttaaaaat ctaactaaag 1980 atacaaattt accgctatca gattatcgag attgcttaaa aaatatgtct tgtgaaaatc 2040 cgagtccgaa atgctacatt gatcaatatg acaaatgtcc tggcacaaca aatattttag 2100 aacatatacg tacactatta gatgacaaac gcatttcaca tgttgagtac gctacatggt 2160 cgggaactga tcggtcaaca cttttaaaga gtgtcgttaa attagatgaa tttctaacag 2220 aacttgataa taaattgaaa attctcaaac cccattcatt tttatcaaaa caacaatcac 2280 tttttgttaa agaaaaaaaa gagaatctta aggaaattga agttttagtt atgtttgatt 2340 tcgcagaaaa ttactgttac gtttgtcaag atgcttctca agcatttcat ttcaataatg 2400 atcagtgcac agttttccct gccatttatt attataaaga agatggtgaa ttaaaacata 2460 agagtaatgt tttcttatca aacagcaaaa aacatgatac tgctgccgta tatagtattc 2520 aacaacagtt aatacctgaa ataaaaaaaa aaagtgaaaa aagttattta cgtaagcgat 2580 ggagccaagc aacactttaa aaataaattt cagatagcaa atttgattta tcacaagcaa 2640 gattttgatg tgatagctga atggcactat tttacaactg ctcatggtaa aagtgcgtat 2700 gatggtattg gtgcgaattt caaaagagaa gcttacaagg caagtctaac tgcaaagcca 2760 aatgatgctc ttctaacatc tgaagcactt tttaattggt ctagaaagca ctttcaaaac 2820 attaatgtct tttactttga tgagaagtat cacgcaaaaa tgcaaagaaa attaaacaga 2880 agatttgaag cggccaaagc cgtacctgaa atttcaaaac aacatggatt tatcgttgat 2940 gaccaaagaa acgtttgcat aaaaagatat tcaaaggaag agggaggaat tcaatggtca 3000 atgtatgcag ataattaaaa taagtaaata ttaataccga acgtgaaaaa ccggcccttt 3060 taagggccaa gtgatgtgca agaccgatcg cgcaacctcc tcgtggtgca cattggataa 3120 cgctaatgcc gacggcacaa aacgagtttt gacataaatt atacatgaag taacaacaaa 3180 tcttattcct gctattcaag aaataataca ggtaaaaaat tgtgtattcc tataagaaca 3240 gtagtttcct ttggtagctt agatctgata ccttggaatt tcaaaaaaca aaagcaaatt 3300 cgtaatcagc aacctcaaaa acacttatac tcatacccaa ctgtaaattt cgaaaaaatc 3360 gatttttgca ctatttatcc gctatcttgg aatttcaaaa ataaataata aattcgtaat 3420 cagcgacccc gaaaaccctt atattcatac ccgactcaaa ctttttcaca ttttggatcg 3480 ccatcttgaa aatttgaaaa ctacgaatag atttataatc agcgacctaa attacataaa 3540 aaagtgaatt gtaagaagca atttgttatt tttatacgaa tcagaataca aataacggct 3600 ttgattgagt agtttcttcc ttttggcagc cattactggt ccgccatttt gaaagattca 3660 catatgccat acagtactaa aaactagaag cttcaacg 3698 // ID Gypsy-188_AA-I repbase; DNA; INV; 7434 BP. XX AC supercont1.105; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-188_AA_; KW Gypsy-188_AA-LTR; Gypsy-188_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-7434 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.105; Positions 1884281 1876848. XX CC Positions [3069-3572] - Reverse transcriptase CC Positions [4567-5043] - Integrase core CC 'AGGT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 412..2262 FT /product="Gypsy-188_AA-I_1p" FT /translation="MSFPHADYLTNEEINYELVLRNYKEETDKDIPTKLRL FT LRKLFQTDQKEGRDYRSPYRFEQEVDIISNRVLLIQQEWQGKEGDPRLVSR FT LKHYYLRVRRGKTTDKEVENTRRELLRSISETLRTVTDKYPGDDESKTNPG FT TLQGHENQTARDLGAYPKNLPLQSKPDDDFKDPKSSQPTYEELQEKIRELQ FT SAIQQMRTQGSQGHERRSRNWKQSEPGPDDLGSDISEPEMVKQHDKEHISN FT SSKQRATRTLEYRTDVSDSGGDEVRRTNRQFVDQIPYRHREYRHNPNREDE FT RRRNRISSDDRFSAGGSYSTDGYHHRNGRQIRRIEHWKLSFSGDNRTVSVE FT NFLYKLKKIAAREGVSQQGLLRDIHLVLDGQASDWFFTYVDEFEDWEDFEE FT KIRFRFGNPNQDQGIRQKIHERKQLRGESFSAFVSEIERLNKMLSRPLSKR FT RKFEVVWDNMRPHYRSKISIVRVRDLEHLIQLNHRIDAADSSFQIHNEFRN FT TEQRNQRNVHQIECASSQDECDEPEGVDVVDARLDHRGQMSRRRVQPTFAQ FT NTSKTNQQQTSESTGNACWNCRKTGHNWRDCKEPRAIFCYGCGELGRTIRS FT CQRCAESNRRWNAPIQGNQ" FT CDS 2235..3719 FT /product="Gypsy-188_AA-I_3p" FT /translation="MECTNPGKPVRECAMGTTDIPIEIKVPTEKGYFDPVD FT RIHTIKIQTKKCPYLRVNVFDTEILGLLDSGAGISVVNSLELSEKYGLKLQ FT PTRLRVCTADNTEYRCLGYLNVPYTYKNVTRVVPTVVVPQISKPILGCNFW FT RSFGISPMADLGNGPEQVGSFAETEEAFAFTLEPIGELPRIESTGNDTTLD FT VPTFDIPEESQEPTVETLEVEHELTDKERQELIQAIKQFPFTSSNNLGRTH FT LIEHEIVLKEGAKPKNQVMYRSAPAIQKEIDAEIQRFKDMDVIEECYSEWT FT NPLVPVRKSNGKIRVCLDSRRLNTMTVKDSYPIRNMLEIFQRLENAKYFSI FT IDLKDAYFQIPLKEECRNYTAFRTTKGLFRFKVLPFGVTNAPFTMCRLMDK FT VIGFDLIPSVFVYLDDIVIASRTLKEHLRLLKVVAERLKNANLTISLEKSR FT FCRKQVKYLGYLLTERGVAIDGSRISPILDYARPKTVKDVRRLLGLAGFY" FT CDS 3757..5424 FT /product="Gypsy-188_AA-I_2p" FT /translation="MELKSVLTSAPILANPDFAKPFIIESDASDTAVGAAL FT IQEQEGENRVVAFFSKKLSRTQRAYSSVEKECLGVLLAVENFRHYVEGSRF FT KVITDARSLLWLFTIGVESGNSKLLRWALKIQSYDIQLEYRKGANNITADC FT LSRSLNVLDLSSHDEEYEELAENIQAEPEKFPEYRVSDGNIFRYVKSKHRT FT EDNRFRWKLIPKEAERTEIIKREHEIAHFGFEKTLQGLKRRYYWSGMAKEV FT RNLCQKCLKCQTSKSGNTNVNPPMGGQKEFVEYPWQFVTLDYVGPLPASGK FT GRHTCLLVATDVFSKFVLVQPFREAKASSLVEFVRNMIFLLFGVPEVILTD FT NGTQFTSKAFRDLLSEYGVNHWLTPSYHPQVNNTERVNRVVTTAIRATIRQ FT HKDWAENLQSIACAIRNAVHESTKYTPYFVVFGREMVSDGKEYQRLRQGSG FT NTSGENSQDKRKRLYEEIKQHLSKAYERHKRTYNLRSNAECPTYIAGETVL FT KRTFEQSSKAKNFCAKLAPKYELAVVRKTLGKHCYELEDLQGKRLGVFYGS FT HLKKMHPQS" XX SQ Sequence 7434 BP; 2267 A; 1498 C; 1737 G; 1932 T; 0 other; ttttggcgcc caacgtttag aaggcagaca cattttttga atttttggcg cttattggac 60 aattgcacat ttttgagaat ttctttcgtt ttttgcacat aatactaaaa caaaaacata 120 tatacacaat acaaatcaat taatttatta tcgtatgcgg tatttttcat atataatcct 180 tttcataatt ttttttttac acataattca cacacatata tatatatatt atcatattta 240 tatcattaca attataagca ttttgttgtg tgaaagaaat attcgtagtt gtatgttgtg 300 tacaaatttg tgccgttcta aaattttctt tgtaaataga catttaaagt gaacttgtag 360 ttattttttt tattattcat aagccttgca cattctctaa taattataaa aatgtcgttt 420 ccacacgctg attatttgac taatgaggaa ataaattatg agctggttct gcgaaattac 480 aaagaggaaa ctgacaagga tattccaacg aaattgcgat tgttgcgtaa actgtttcaa 540 acggatcaaa aagagggaag ggattacaga tcaccttaca ggtttgaaca ggaggtagac 600 ataatctcca atagggttct actgattcaa caggaatggc aagggaaaga aggggatcct 660 aggctagtct cgcgattaaa acattactac ttgcgagttc gaagggggaa aactacagac 720 aaagaagtag aaaacactag gagggaactt cttaggtcta tttcagagac tcttcgcacc 780 gttaccgata agtacccagg cgatgatgag agcaaaacga acccgggtac actgcaaggt 840 cacgagaatc agactgctag ggatttgggt gcatatccca agaacttgcc gttgcaatcg 900 aaaccagacg atgactttaa ggatccaaaa tcatctcaac cgacatacga agaactacag 960 gagaagattc gtgagctcca atctgctatt caacagatgc gaacacaggg tagccaaggc 1020 catgaacgga gatcacggaa ctggaagcaa tctgagccgg gaccagatga cctgggtagc 1080 gatatcagtg aaccagaaat ggtgaaacaa cacgacaagg aacacatctc aaacagtagt 1140 aaacagcgag caacgagaac tctagagtat aggaccgacg tatccgacag cggaggcgac 1200 gaagtgagac gaaccaaccg acaattcgtg gaccagattc cgtacagaca cagggaatac 1260 aggcataacc caaaccgtga ggatgaacga cgtaggaata ggatatcgag cgatgatcgg 1320 ttcagcgcag gtggatcata ttcaacggac ggttatcatc accgaaacgg tcgacaaatc 1380 cgaaggatcg aacactggaa gctctctttc tcaggagata acaggacagt ttcagtggaa 1440 aacttcctgt acaagctgaa aaagatcgca gcaagggaag gagtatctca gcaaggtctg 1500 cttcgcgata tacaccttgt cctagacggc caggcctcag actggttctt tacgtacgtg 1560 gacgagtttg aagattggga agatttcgag gagaaaataa ggtttcggtt tggaaatcca 1620 aatcaagacc aagggatccg gcagaaaatc cacgagagga agcaactcag aggcgagtct 1680 ttcagcgctt tcgtatcgga gatagagcgt ctaaacaaga tgctgtcgag accactctcc 1740 aaacggcgga agttcgaagt ggtgtgggac aatatgcgcc cacactatcg ttcaaagatc 1800 tccattgttc gagtccgaga tttggagcac ttgattcaac tcaaccatag aatcgacgca 1860 gccgatagca gcttccaaat tcataacgaa ttcaggaata cagagcagag gaatcagcga 1920 aacgtccacc agatagaatg tgcatcttcg caagacgaat gcgacgaacc ggaaggagta 1980 gatgtggtag atgcaagatt ggatcatcgg ggacagatgt cgagaagaag agtccaacca 2040 acgtttgcgc agaacacgag caaaacgaat caacagcaaa ccagcgaatc aacagggaat 2100 gcgtgttgga actgtcggaa aacaggccat aactggcggg actgtaaaga gccacgagca 2160 atcttctgct atggttgtgg ggagttaggg cgaaccatcc gttcatgtca gcgttgcgct 2220 gaatcgaacc gaagatggaa tgcaccaatc cagggaaacc agtaagggaa tgtgcgatgg 2280 gaactacgga cattccaata gaaattaagg ttcccaccga aaaaggctat ttcgacccag 2340 ttgaccgcat acatacgatc aaaattcaga caaaaaaatg tccttattta agagtaaacg 2400 ttttcgacac cgagatccta ggattattag actcaggagc gggaatcagt gtggtgaact 2460 cgttggagtt gtcggaaaag tacggtctga agctccaacc aaccagactt agagtttgta 2520 ctgccgacaa caccgaatat cgatgcctgg gatatcttaa cgttccgtat acatacaaga 2580 acgtgacaag agttgtacca actgtagtag tcccacagat atctaaaccc atccttggct 2640 gcaatttctg gcggagtttt ggaatttcgc ctatggcaga tttagggaac gggccggaac 2700 aagtgggaag tttcgcggaa accgaagagg cattcgcttt cacccttgaa ccgataggag 2760 aacttccaag aatagagtcc acgggaaatg atacgacact tgatgtacca acctttgaca 2820 tcccagaaga atcgcaggag ccaacggtag aaacgctgga ggtagagcac gaattaaccg 2880 ataaagagcg tcaggagtta attcaagcta tcaaacagtt tccgttcacc agcagtaata 2940 acttaggccg gacgcacttg atcgaacacg agattgtgct taaggaggga gcaaagccta 3000 aaaatcaggt aatgtaccgc tcagcaccgg ctatacagaa agagattgat gcagaaatcc 3060 aacgatttaa ggacatggac gtcatagaag agtgctacag cgagtggacg aaccccttag 3120 tgccggtcag gaaatccaat ggcaagataa gggtttgcct ggactcgcgt agacttaaca 3180 cgatgacggt taaggatagc tatcccataa ggaacatgtt ggagattttc caacgattag 3240 aaaatgcgaa atatttttca atcatagatt tgaaagatgc ctactttcaa atcccgctta 3300 aagaggaatg ccggaattac acagccttcc ggacgactaa gggactcttc cgatttaaag 3360 ttttaccatt tggagtgacc aacgcccctt tcaccatgtg taggttaatg gacaaggtca 3420 ttggtttcga cctgatacct tcggtgttcg tctatcttga cgacatcgtg atcgcgtcta 3480 gaactttaaa ggagcatttg agactactga aagttgtggc tgaaaggttg aagaacgcca 3540 acttaacaat atccctcgaa aagtctcgat tctgccgaaa gcaggtcaag tatctagggt 3600 atctgttgac cgagcgcggc gtggcaatag acggctctag aatctcgcca attttggact 3660 acgcaaggcc aaaaaccgtt aaagatgtac gccgattgct ggggttagca ggtttctact 3720 aaatttgaat ggacggaggc tgcggaggac gcgttcatgg aattgaagtc agttctgact 3780 tctgcgccaa tactggccaa tccagatttc gctaaaccgt ttataatcga aagcgatgct 3840 tcagatacag cggtgggcgc agctttgatt caagaacagg aaggggagaa cagagtagtg 3900 gcgtttttta gcaaaaagtt gagccgaact cagcgagcgt attcgagcgt agaaaaggag 3960 tgtttgggtg tgctactagc ggttgaaaac ttccgccact atgttgaggg gtcacgtttt 4020 aaggtgatta ctgacgcacg aagccttctg tggttgttca ccatcggagt agagtccggc 4080 aactcgaaac ttctccgatg ggccttaaaa atacagtcat acgacattca acttgagtat 4140 cggaaaggag cgaataacat cactgcagac tgcctgtccc gttcgttgaa tgtgctagat 4200 ttatcatcac acgatgaaga atatgaagaa cttgccgaga atatacaggc cgaaccggag 4260 aaatttcctg aatatcgagt gtccgatgga aacattttcc gctacgtaaa aagtaagcat 4320 cgaacggaag ataatcggtt tcgatggaag ctgataccga aggaggcaga gcgaaccgag 4380 atcattaaga gagagcacga gatagcacat tttgggtttg agaagacttt gcaggggctc 4440 aaacgtcggt attactggtc gggaatggcc aaagaagtca ggaatctctg tcaaaagtgt 4500 ttgaagtgcc agacgtcaaa atcaggaaac actaacgtta accctccgat gggaggtcag 4560 aaggaatttg tagagtatcc ctggcaattt gtgacgctgg attacgttgg tcctctgcca 4620 gcttcaggga agggaagaca cacgtgctta ctcgttgcca ccgatgtttt tagcaagttc 4680 gtgttagtcc aaccgtttcg tgaagcgaaa gccagttcac tggttgaatt cgtacgaaat 4740 atgattttct tgctcttcgg agttccagaa gtgatactga cggacaatgg aactcagttc 4800 acttccaaag cgttccgaga tctcctgtct gaatatgggg tcaatcattg gcttaccccg 4860 tcgtaccacc cacaggtgaa caacacagag agagtaaacc gggtggtaac gacagcgata 4920 cgggccacga ttaggcaaca caaagattgg gcggaaaatc tgcaatcgat tgcgtgtgcg 4980 attcgaaacg cagtacacga gtccaccaag tacaccccgt actttgtagt gttcggtcgg 5040 gagatggtgt cggatgggaa agagtaccaa agattgcgac aaggttcggg taacacaagt 5100 ggtgaaaatt ctcaagacaa gcgaaagaga ctgtatgagg agataaaaca gcatctctcg 5160 aaagcatacg aacgccacaa gcgaacatac aatctgcgat cgaatgcgga atgtcctact 5220 tacatcgctg gggaaacggt actgaagcga accttcgaac agtcgagcaa ggcaaaaaac 5280 ttctgcgcga agctagcacc gaagtacgaa ctcgccgtag tcaggaaaac tttaggaaaa 5340 cactgctacg agcttgagga ccttcaaggg aaacggctag gcgtctttta tgggagtcat 5400 ctgaagaaga tgcacccaca gtcttagaca actttttttc aagctatgaa cctcttcgga 5460 gtgacctttt ataaaaaaaa cacgcctacg ggttcataaa tcaccgagtc actaaggtga 5520 ccaatgactt tttcaaagct atgtaacgtg agagactatt ttcaaaaata ccttcgggga 5580 aagtcatacg tcacatagtc gattggattg gggtgagaaa tggtttagga tcaccattgt 5640 cgattgtaga gtgtccctgg gcgatgattt ctcactgaag tagtaagaat catcaagcat 5700 agaaggagct gggcgatgat ctgataagta gtagtggcaa cacgaagact acagagagcc 5760 taagtgatga cttccgatcc ttagtactca gtactaagtt agtactgatt cttatgaagc 5820 aaggaaggag cccatgtttg acctctggcc ccaattgacc atacggtggc tctaaatcac 5880 ccatcactgt gattgatcgc ctgtaatgac gtcatctcac atggaaaatt cgatggacaa 5940 tgaaaactca agctatgaaa acttcccctc gaagcaacta aggtatttcc atgaaaatct 6000 ccacgaaaac accttcaagg gcaaaagaat caaagaataa agggagttga gactacgaaa 6060 cgtaatcctc ccatgtacat aatgtaaata ttagattagt tagttagtac gcgcctaacg 6120 ccctacggta atgctattat tagtacctta aaattagatt aattagtaaa gtaaattcac 6180 tcacgattct tattggaatt agcccttcgt tttcgtcctt cgatttagta tccagtccta 6240 gttgcatttt ttcattcagt catcgttttt ggttgtccag atcttcttta tcgacagtgt 6300 ttattttcgt ccttctatgg cgtttctgaa tagttcgtcc gtcttcgttc gcgtttaagt 6360 tatccagtcc atcgtttgtt atatatatcc gtccttgtgc tattcgaagc aattcttgta 6420 atccagttag ccaatcaatc actagaacca gtttccaatt aaaataatcg cccatgccat 6480 tgttgaacgg ttgaacttcg tagatgttgc ttcacctaaa aatagaaaat caaccctccc 6540 cgagcccgaa caacagttca attagccaca acaaccaacg atacaattaa ttttaaactt 6600 tccgttatgt tttcctccat ccttgtagta tttcatcgta aaaactttaa attccgagcc 6660 agctccaaat acaatttgca cttttctatt tttttccact ttttgttttg cttgttttga 6720 cgtttgtgtt cagttttgtt ccgttaagtt ggtagcgcat gagtgggtga gttacaaggg 6780 agtgagtgac ggctcgttcc cggagtgacc gatcgagggg tgattcattg tttgtggtat 6840 tgtacgttct cataatattt cgaatgaaat atttatgatg aaaaatttta aatttatgtt 6900 gtttgtgtgt tgtgaaatgt gttcggactt atgtctggag tattttcgca gcacactgtt 6960 taaagtggaa agcggctgtg atcatatgtt tcgggtagaa tatctggagg gatcgcagcc 7020 aggattcact agtccattta ttgtggaaag tgaacgcggt caagtattcg aaacatgttc 7080 tggagtagac tgacgttcag gattcacagg ttcttgttgt gggagtttta cggacactgt 7140 cttagttttc ggctcatggt ggctggagta gaagacaggg gtcctagttc atgtttactt 7200 ctctggagtt aaggcctact ccagatgaaa gtattttatt gtcacgtttt ttttttaatg 7260 ttgttgtgtg tatgcgttgc ctgccaactg acattttttc aattaaatag cccaatcatt 7320 tttcttcaat agaatcaaaa atcagctgga acatttttag ggtttaaccc tacgaaaatt 7380 tgattgaaat tctctcaatt tcaatcaaat tttcgtacct ttagtagagg ataa 7434 // ID ISL2EU-1_CS repbase; DNA; INV; 4382 BP. XX AC . XX DT 12-MAY-2011 (Rel. 16.05, Created) DT 12-MAY-2011 (Rel. 16.05, Last updated, Version 1) XX DE DNA transposon: consensus. XX KW ISL2EU; DNA transposon; Transposable Element; ISL2EU-1_CS. XX OS Ciona savignyi OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. XX RN [1] RP 1-3053 RA Yuan Y.W. and Wessler S.R.; RT "The catalytic domain of all eukaryotic cut-and-paste transposase RT superfamilies."; RL Proc Natl Acad Sci U S A 108(19), 7884-7889 (2011). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 670..1845 FT /product="ISL2EU-1_CS_1p" FT /translation="MPELLISGLKPLVKRLRNNSVPSVFAWSKKNIVTDRE FT ARLNHRSEELQGQLLFEFNDIELGAEAVVECEIDCEAHPGIPAPKITDSST FT QTEQTGSSSTFTITSFKQDNEGINFFTGLPNFTTFNAVLYSLGEGSNHLTY FT LYPLAPNNLSTTDLFFLVLIKLRQGKTNFELSRMFGLSEKDVYSIFVTWIR FT YLSLQWREIDIWPSKDLVNFYSPLSFKCEFPNTRVIIDGTEIPVKKPKAPL FT AQQVTFSTYKNRNTAKVLIGATPGGLISFISSAYGGCTSDRQIVERSNLVN FT MCDPRDSIMADKGFDVQDIFAPYDITVNIPTFFKKKSRMNLSTVMRDRRIS FT SKRVHIERIIGLGKTYKILKIPFNHTETEMADDIIFVCFMLCNFRNSIM" FT CDS 3414..2098 FT /product="ISL2EU-1_CS_2p" FT /translation="MPALSMKAIDDFLSQYGTSVSEKAELLYKDRFIQSIR FT IAEVSQHHGTFIQARCHAEMRKGVIYKIDIFIDDCDKVRESQCECAAGMGP FT SAHCKHVAMVLFAVNDFKVNKCITTRQTCTQKLQTFHHCKPYTGSPTKCSS FT LKLRQNPLHVNYLFDPRPLRFRNQPSYSSHFRNVVLNGCGPKSTIRQLYAP FT ALKQAVYNDHDYLKQHPEDTFLEHEEITVISLHRICEIINATSLQSESTCW FT HEERRKRLTSSNFGKICKSSEDCFVKLARNLIIHKSVNCAAVNHGKQYESV FT AFAKYKQIGLKDVMTTGLWVSHKQPYLASSPDGLVDDDGVIEIKCPYASRN FT CLINPATVPYLELRNNYLCLNRKHNYYYQVMGELYCTERKWCDFIVYTFCD FT FKVIRIVRDEGFIKDMIAQLVNFFNMYFRPAVLKCILYKDENKLF" XX SQ Sequence 4382 BP; 1398 A; 812 C; 768 G; 1404 T; 0 other; gggagacttt tcgtcgctag gtgtctctct tcccgactta gatgaagcct gaaaaccgtt 60 gttatgtttt tttatgcggg aaagcggtga taggaagcca aaagcacttt aaacgtacgg 120 gtttccgcga aattgtttaa aatagaccag actatttaac agtaataagt aagctgtgat 180 aattatgttc cattactagg tataaagctg catgtagcat ttaggcggta actacagaca 240 gtgtagcgct agcctagctg cagcctgcag catataaata ttgccacaaa gcatagttgt 300 agaatatccg accagatctg actggaaagg ataccatgtc gcaattattt aaatgtttat 360 tttgtaagat tacttaaata tatgtcttta ggctaaaaat ggcaacaaat tattgctgtg 420 tacctttatg caaaggaact gggggtcatc gatttcccag caatgaaacc ttaaaacaaa 480 gttggatatc agctattaaa agggtgagga ctattcgtgg aggaaaaagc tggcaaccag 540 gccaatattc aactgtgtgc tatgctcatt tcaaagagga ggactacctt tcttgtaata 600 aatatggtaa gtaatgcaca ttcttgtcct gatattaaat gactttttct aaagttgttt 660 gttttattaa tgccagaact tctaatttca ggattaaagc cacttgttaa gcgattgcgc 720 aacaacagtg tgccaagtgt gtttgcttgg tccaagaaaa atatagtaac agaccgagag 780 gcaagactta atcatcgcag tgaagaactt cagggccaat tactttttga gtttaatgat 840 attgaactgg gggcagaagc cgttgttgaa tgcgaaatag actgcgaagc acaccctggt 900 attcctgctc ccaaaataac agattccagc acacaaactg aacaaactgg ttctagtagc 960 acattcacca tcacaagttt caagcaggat aatgaaggaa ttaatttttt taccggtttg 1020 ccaaatttca cgacttttaa tgcagtgctt tatagtttgg gagaaggatc aaaccatctc 1080 acatatttat atcccctggc accaaataac ttaagcacca ctgatttgtt tttcctagtt 1140 cttataaaac ttcgacaagg caaaacaaat tttgaactca gtcgaatgtt tggtttaagc 1200 gaaaaagatg tctattctat ctttgtcact tggattcgtt atttaagctt acaatggaga 1260 gaaattgata tatggcccag caaagactta gttaattttt actcgcccct ttcgtttaaa 1320 tgtgaattcc ctaacactcg tgttattatt gatggcactg aaattcctgt taagaaacca 1380 aaagcacctt tagctcagca ggtcaccttt tcaacctaca aaaatagaaa tactgcgaaa 1440 gttttaattg gtgcgacccc tggtggtcta atatctttta tatcgtcagc atatggtggc 1500 tgtactagtg atagacaaat tgttgaacgc agtaatttgg taaacatgtg tgatcctaga 1560 gattccatta tggcagataa aggttttgac gtgcaggata tatttgctcc ctatgatatc 1620 actgtgaata tcccaacttt ttttaaaaag aaatctagaa tgaatttgtc tactgtaatg 1680 agagatcgta gaatttctag taaacgtgtt catatagaaa gaataattgg cctgggaaaa 1740 acctacaaaa ttttaaaaat tccttttaat cacacagaga ctgaaatggc tgatgatatt 1800 atttttgttt gttttatgtt atgcaacttt agaaattcta ttatgtaatc tcgatgatgc 1860 tttgccagta gttcaatgtt ttttttctgt tttcaatgtt cagggtgatg tgtatttcgg 1920 ttgggtattg tagagaatat aatttatatc gtatataata tattcatgtc cttgaatgtg 1980 attatattac aataaaactt gttatacatt tgtgaacaac atatgaactc gtcataaaca 2040 gaaaagcgaa caaatttaaa agggtaagaa ctttttatct attgtcgtta gcgctcaaaa 2100 taatttgttt tcatctttat acagaataca ctttaaaaca gcaggtcgaa aatacatgtt 2160 aaaaaagttt acaagctgag caatcatatc cttaataaag ccctcgtctc ttactattct 2220 aataacttta aaatcacaaa aggtatacac aataaagtca caccattttc gttcagtaca 2280 atacagctca cccattactt gataataata gttatgtttg cgatttaagc aaagatagtt 2340 gttacgcaat tctaggtaag gaacagtagc aggatttatc agacaatttc ttgaagcata 2400 tggacactta atttctatga caccatcatc atcaacaagc ccatcgggtg aagacgcaag 2460 atatggttgt ttgtgtgata cccataaacc agttgtcatg acatctttca aaccaatttg 2520 tttatattta gcaaatgcta cagactcgta ctgtttacca tggtttacag cagcacaatt 2580 cacagatttg tggataatta aatttctggc caatttcaca aagcaatctt cagaggattt 2640 acaaatttta ccaaaattgg atgaggtcaa gcgttttctt cgctcctcat gccaacatgt 2700 gctctcactt tgcaatgaag ttgcatttat tatttcacat attctgtgta aacttatgac 2760 tgtaatttct tcatgttcaa gaaatgtatc ctctgggtgc tgttttaaat aatcgtggtc 2820 attgtatact gcttgtttta gggcaggagc atacaattgt cgaattgtac ttttaggtcc 2880 acaaccattc aatacaacat ttcgaaaatg ggagctataa cttggttggt tgcggaatcg 2940 aagtgggcgg ggatcaaaca aatagttaac atgcagggga ttctgtctta atttgagtga 3000 actgcacttt gtgggtgacc cggtatatgg tttgcaatga tgaaatgttt gtagtttttg 3060 agtgcatgtt tgccttgttg tgatacattt gtttacttta aagtcattaa cagcaaaaag 3120 aaccatagcc acatgcttac aatgggcaga aggtcccatt cctgcagcac actcacactg 3180 actctctcta actttgtcac agtcgtctat aaaaatatcg attttgtata taacaccctt 3240 tctcatctcg gcatgacatc ttgcctggat aaacgtccca tgatgctggg aaacctcagc 3300 aatgcgaatt gactgaatga atctgtcttt gtacaataac tctgcctttt ctgatacaga 3360 cgtcccatat tgtgacaaaa aatcatcaat ggccttcata ctcaaagcag gcatctataa 3420 atcaatagca ttaacattta ttttacattt tgaattcggt acaaaatata aacaatgcaa 3480 tgataataac aaaaagcata tatatcggta ccgatgaaac caaacagttc acaatcaagt 3540 gtcggataac attaaccaaa acacaaactg ttataagtgt tcatatatat attaaaaaga 3600 atgatgtaac attgacacat aacaaatcaa ttttattttt tgttttcgct gcagagttag 3660 cctaaccact aatgatttaa aaacacaaag catataccaa aacaaatata ttgtacctta 3720 tttaaataca aacctttgta ttagaattta catctttata ctttaactgt gatggtgttg 3780 ccattgcata atcttgttgt gatgatgttg atggttgctg gctattaaaa ttgaagttgc 3840 gttgataact ttccagcctg aaattgcaat ctattaaatt tatttcgctc ctcttaatac 3900 cggtatcaga ggacaacact aaaaggaaag gagggaaagc gggaaggagg gaaggcggga 3960 aactgctcac taaaactatg aaaattaatt ttaaccctgc tctaaacact aatataaaac 4020 ccagaactaa ccctaacccc aaccccatgt ttaaaactaa cttataagcg ctaatcttca 4080 ttttaatgtg cagtttccct ccctcccgcc ttccctcttt ttagttttta ctgacattac 4140 cggtgccggt agcccactgt tacctctcca ctaagtctcc ttttttaccg ctaactttag 4200 cgcctctttc acgtaactct tcttttaact ccaaaatggt caaagcgcca aatttataca 4260 ttttgggcat tcaccgttgc tgactctact cttatggtaa cctatctggt gcatgtaaac 4320 aaacgacgtc gtcctcaaaa catctaacta acgcaacgcc cactagtgtt gaaaagtctc 4380 cc 4382 // ID hAT-6_AP repbase; DNA; INV; 2428 BP. XX AC Contig27391; XX DT 19-AUG-2009 (Rel. 14.08, Created) DT 19-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE DNA transposon. XX KW hAT; DNA transposon; Transposable Element; hAT-6_AP. XX NM hAT-6_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-2428 RA Jurka J.; RT "DNA transposons from pea aphid."; RL Repbase Reports 9(8), 1790-1790 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX FH Key Location/Qualifiers FT CDS join(700..1083,1087..1848) FT /product="hAT-6_AP_1p" FT /translation="MSIKLRSDFQIFLIGSQITQIHGSKLPSNGQVLSVFF FT YNMRTVNLTTRESAALAVRECCIFWEKARIPIRAIHHCITKLINLYEEWRN FT LQKNAQKVGESYRLKENDLKKKIDLLFDIAHFDALKLIKIVDKQFLVNQRL FT PGRPGCLGGIDIKGEIKENIHIQRKLNESKRKEIYNTSCASTSTLLDNSFE FT QNLETDSLESSDISELETSINSLELIEPPTKKKKCRGKINFITPKLAGALD FT RCQLSIRDSVYVLQATLEALNFNTDDYVINQTSIHRARESYRCERGEPIKL FT QFKKSLPNYAVVHWDGKMLPDTTTRNATVERLPIVITSKNIEQILGVPKLE FT RSTGEEQAAAVCNALRDWGLCNIVQALCCDTLRPIPVV*" XX SQ Sequence 2428 BP; 878 A; 317 C; 398 G; 835 T; 0 other; taaatcggtg tgtatacata taaaagtgaa ctgcaaacaa tttcaggtca attaaataaa 60 gttaacccgt gccgacttga agttaaagtt tgagcttaat acatttttaa ataattttag 120 acaatttttg tatttccgta aatatttcgt aaaccattat gaaaatcgta attttttata 180 atatgttttt tcaagtaaat taaatttcca atatatttta taatataaca ttttttaata 240 aatctattaa tttataagat gcataagttt taatcttaaa aaacagaatt tctccacaaa 300 tttataattt acattttatt attttattgt aggattaata ataataatct tagcttataa 360 tatttggtaa aattataaaa tgttcaatgt tctaagtaat attatgtact tcttatacac 420 ttatttgtgt aattattagt tcttcatatt ttcgtattat actcgtataa ataataaacg 480 ataagattgt gaaaacgatt aagatgtgtt gaaatacgta ttgtttatct ataacttatc 540 gcattaatag tttccgtagg tattagtctt gctttataat ttataaaaga tattattttg 600 ttagtgatac tcgtttattt cagttgtgtc aacgtcttat gtgttaaagg ttattattgt 660 gtttgtcgtt agtaattcat aataatttaa atagcaaaaa tgtcaataaa actgagaagt 720 gattttcaaa tatttttaat tggatctcaa atcacacaaa ttcatggctc aaaacttcca 780 tccaacgggc aagtactttc tgtatttttc tacaatatga gaaccgttaa tcttactact 840 agggaaagtg cggcgttagc tgtacgagag tgttgcattt tttgggaaaa agcccggatt 900 cctattcgag ctattcatca ttgcataact aaattaatta acttgtacga ggaatggcgt 960 aatctacaga aaaatgcaca aaaagttggt gaatcttatc ggttaaaaga aaatgattta 1020 aaaaaaaaaa tagatttgtt atttgatatt gctcattttg atgcgttaaa attaattaaa 1080 atataagttg ataaacagtt tctagtaaac cagcgtttac cagggcgtcc tggttgtctc 1140 ggaggtatag atattaaagg agaaataaaa gaaaatattc acattcaacg aaagttgaat 1200 gaatcgaaaa gaaaagaaat ttataataca agttgtgctt caacttcgac attactcgat 1260 aactcatttg agcaaaattt agaaactgat agtctagaaa gtagcgatat atcggaatta 1320 gagaccagca ttaattcttt agagctaata gaaccaccaa ctaaaaaaaa gaaatgccgt 1380 ggaaaaataa attttataac accaaaattg gccggagcgt tagatagatg tcaattaagt 1440 attcgagatt ctgtatatgt tctccaggct actttagaag ctttaaattt taatacagac 1500 gattacgtaa taaaccagac ttctatacac agagctcgag aatcataccg ttgcgagcgt 1560 ggtgaaccaa taaaattaca atttaagaaa tcactaccta attacgcagt agtacattgg 1620 gacggaaaaa tgttaccgga tactacaact agaaatgcaa ctgttgaacg acttccaatc 1680 gtgataacta gcaaaaatat agaacaaatt ttaggagtac caaaacttga gcggtcaaca 1740 ggtgaagaac aagctgcagc tgtgtgtaat gcattacgag attggggact atgcaatata 1800 gtacaagctc tgtgttgtga cacactgcgt ccaataccgg tcgtttaaat ggagcatgca 1860 ttttaattga aaaaaagtta ggtagagact tactacatct tccgtgtaga catcatattt 1920 atgaacttat tttgagagcc gtttttgaaa ttaaaatacc tcaagttact acaagcccaa 1980 gtattccatt atttaaaaat tttcaaaaac aatggtgtga aatacatact aataaatata 2040 atagtgggat tgaagatcaa gcatgtggtg aatttagaaa atgtgaagga agatattctt 2100 agttttgtca aagataaatt aaaaattaaa cactcgtgga gattaccgtg aatttttaga 2160 attagtagta atgtttcttg gtggtgattt ggaaaacaat ataacaattc atcctccggt 2220 gctatgcatc aagctcggtg gatggcacgg gcaatttatt gcttgaagat atttttattt 2280 agaaattatt ataatattgc agagtctgtt aaaaaggcaa ttggagatat ttgtgttttt 2340 atcattatat tttatgttaa agcttggtta ggttagattg acctaaaatt ttttgcagtt 2400 cacttttata tgtatacaca ccgattta 2428 // ID BEL-116_AA-LTR repbase; DNA; INV; 382 BP. XX AC AAGE02029143; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-116_AA_; KW BEL-116_AA-I; BEL-116_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-382 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02029143; Positions 15379 15760. XX SQ Sequence 382 BP; 98 A; 94 C; 75 G; 115 T; 0 other; tgttctgttt caccgaaatt atctcttata gataagggtc cgatacccct tatcgagtgg 60 agagtttctc atactctctc tcagcacacg ataggagatc gggtgatcaa catcccgatt 120 gtccactcag tttttagttt agcgctagcc accggtgaag gcaaacgtcg gcaaataaaa 180 attacttttc gcactgtata cctcttgttc gacctcaata aaagtttaat aacttacgcg 240 agtgttacaa tagtgcttcg tagtgaaagt tatatcactt ccgactgctg cttagattta 300 tggaccagct tgtaccacca tcaccttgct tctggtgatt tgcctgattc cggacctagg 360 tgagccacga cctgcccaaa ca 382 // ID Gypsy-5_SI-I repbase; DNA; INV; 4823 BP. XX AC AEAQ01012110; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-5_SI_; KW Gypsy-5_SI-LTR; Gypsy-5_SI-I. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-4823 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01012110; Positions 15207 10385. XX CC Positions [2851-3327] - Integrase core CC LTRs are 94% similar to each other. XX FH Key Location/Qualifiers FT CDS join(541..2247,2251..3693) FT /product="Gypsy-5_SI-I_1p" FT /translation="MTSVKGQKTTSTVKDALNILRIEDMIKDVEETFTEAQ FT NQKVTKANVPLITIKFKAFTVKALVDTGAQISAVTKKLYDALVTTNDIIDA FT LPVRKFFLKGDFARRGAAVSNKARIEFEFNGKKFQYEFYIVEQMAYRVVLG FT IDFLVKYHTNVLCDDRITVNFEAEADNEPKTIATITMEETDEALRQIIHKN FT AEIFQDGIGLVNHYKHKIVVKSNAPYKKRLYPIPDKHFKKVTSYIDDLEEQ FT GIVRKAATQYINPLVVVIKKSGDIRLCLDAKEINKRMCEDYAQPPTIEEVV FT RRFGPNGYYTSLYISNAFWQIPLEEQSQNYTGFTFNGEMYVFRRMPFGIKT FT AGASFTRAMSKALGPNANDFLIVYLDNILIASKTLEEHIKHIDYVLGKLKE FT VGFRLNKDKCEFIQREIKFLGHTFNEIEAEINADTKLAIRNCQRPKNKRDV FT QVFRGLVNWDRRFIKNLARMTKPLEELLRKGKKFAWNDEQQKSFEEIKLAF FT EEAPNLYLTRPGYKFGIYVDAIKTGLGARLYQYKEDGREKFTIAYASRALH FT GAESNYTITELECLAVVWALKKHTLLLRRQVRVHTDHRALQFLSTCVQNNT FT RIARWFSFLQEFDLDVIHIPGKVNTFADTLSRSADERRVLMKNDKCIGLIE FT NARDVTRTDDWVKIIREAQQAHPDLISAPQTDPHVLLERDNIVRHMDGNSD FT KIVLPEEKNWKMMKQAHKLLAHFGTDKLIGFMKKYFIGKKFEKYARDVVAS FT CKVCHATKFYTRPTVGMELPDRPGKAISLDLCGPWPQSKRKYKHVLVIMDK FT FSKLVKIYPLKDKKLATIINKLENDYFPNIEVPEEILTDNDGQFLSRRWQQ FT FAEEYNCTIKKTTPFNLQSNPVERVMRELGRIMRTYASHEHTLWANTVQRA FT ERVINATVHSSTGFTPNELHFGVDDHLELPREILPRIYEEILQDDKIIEAR FT VNLERNAQKRKKQADKFQTVNVYQPGDIVWIKTRPRSDARKRKVAKIHLLY FT DGPFQVLDVLRRNAYLIGDLNGNVRGAYNTRLLRPDREPHMRPR" XX SQ Sequence 4823 BP; 1825 A; 1020 C; 1027 G; 951 T; 0 other; taaaaacgaa aaaagtactt atttcatgga caaacgagat aacgaaccac aaaacgtgga 60 acaaaatcac atttccgtta agaaaagagc ctcgagaacg gaacgcaata aagaaaataa 120 acaattaggg gccaccaata ctttaacaaa ttccgcaaag aaaagagtcc caacagcgga 180 acatactaat aaaaaacaaa cacaggggac cacaaataaa caatatacta acagtacaac 240 agtccgagaa caagagtcaa ttacagaaaa gaagattcta agggctaaaa atgagaagca 300 gcgtaaaacg accggaacca acccaattcc agtaactcta gaagacggaa aagtggaagt 360 gatgaaggaa acaccgagtc aagctgtact agatcagcgc aaagtgatct tttcaaagga 420 tttgaaaagg gaacgaccca gtgacgacga aattgtcacg agaaaagttc cagaaaaaat 480 aaacccacaa attcaaacaa ttcgtgcgga taacgccaat tcccacgaga aatcagctaa 540 atgacaagtg ttaaaggcca aaaaaccaca agcacagtca aagatgcatt aaatatcctt 600 aggatagagg atatgatcaa agacgtggag gaaaccttca ccgaggcgca aaaccaaaaa 660 gtcacaaagg ctaatgtgcc gctaattact ataaaattca aagcgttcac agtgaaggcg 720 ctagtggaca ccggagcaca gatttcagcc gtaacaaaga agctgtacga tgctttggtc 780 accaccaatg atataattga cgcattgcca gtgagaaagt tcttcttaaa aggcgatttt 840 gcaagaagag gagcagcagt gtcaaacaaa gccagaattg aattcgaatt taacggcaag 900 aagtttcagt acgagttcta catcgtcgaa caaatggcat acagagtggt acttggaata 960 gacttccttg taaaatatca caccaacgtg ctatgcgatg atagaatcac ggtgaatttc 1020 gaagcggaag ccgacaacga accaaagacc atcgcgacaa taacgatgga ggagactgac 1080 gaagctttac gccaaataat ccacaaaaat gccgagattt tccaagatgg cataggattg 1140 gtcaaccact acaagcacaa aattgtggtt aaatcaaacg caccttacaa aaagagatta 1200 tatcccattc ccgataaaca ttttaagaaa gtcacaagct atattgacga cttagaagaa 1260 caagggatag ttagaaaagc agccactcaa tacataaacc cacttgtggt tgtcattaaa 1320 aagagcggag acattcgtct ttgtctcgac gccaaagaaa taaacaagag gatgtgcgaa 1380 gattacgcgc agccaccgac aatagaggag gttgtccgca gattcggacc aaacggatac 1440 tacacctcac tctacataag caatgccttt tggcaaatcc cattggagga gcaatcgcag 1500 aattataccg gatttacgtt taatggcgaa atgtacgttt ttcgtcgcat gccattcgga 1560 atcaaaacag ccggagcttc tttcaccagg gctatgagca aagccttagg cccaaatgca 1620 aacgactttc taatcgtcta tctcgacaac attttaattg cctccaaaac cttggaagag 1680 catattaaac atattgatta cgtccttgga aagctgaagg aagtaggttt ccgtttgaac 1740 aaggacaaat gcgagttcat acaaagggag atcaaattct taggccacac tttcaacgag 1800 atcgaagcgg aaataaacgc tgatacgaaa ttggccatta gaaactgtca aagacccaag 1860 aataagagag acgttcaagt gtttcgcggt cttgtaaatt gggaccgcag atttatcaag 1920 aacctagcca gaatgacgaa gccattagag gagcttctaa gaaaaggaaa gaagtttgca 1980 tggaacgatg agcaacaaaa gtcatttgag gagataaaat tagccttcga ggaagcacca 2040 aacctttatt tgacccgtcc aggttacaaa ttcggcatat atgtcgatgc aataaaaacc 2100 ggattgggag cccgactcta ccaatacaag gaggacggca gagaaaaatt taccattgca 2160 tacgccagcc gtgcgttgca tggagctgaa agtaactaca caatcaccga actagaatgt 2220 ttagctgtag tttgggccct gaagaaatga catactcttc tactaaggag acaagtgcga 2280 gtacacactg accaccgagc cctgcagttc ttgagcactt gcgtgcagaa caacacaagg 2340 attgctagat ggttcagctt tctacaggag ttcgacctag atgtaatcca tattccaggg 2400 aaagttaata catttgccga tactttgtcc agaagcgcag acgaaaggcg cgtactcatg 2460 aagaatgaca agtgcatcgg tttgatcgaa aatgcaagag atgtaaccag gaccgacgat 2520 tgggtaaaaa tcatccgcga agcgcagcaa gcgcacccgg acttgataag cgcaccacag 2580 acggatccac acgtgttact ggaaagagac aacatcgtcc gacatatgga cggaaactct 2640 gataaaatcg tcttaccgga ggaaaagaac tggaagatga tgaaacaagc gcacaaattg 2700 ttagctcact ttggtaccga taagttaatc ggattcatga aaaagtattt catcggaaaa 2760 aaattcgaga aatacgccag agatgtggta gcatcgtgca aggtgtgcca cgcaacaaag 2820 ttttacaccc gaccaaccgt aggaatggag ttgccggaca gaccaggcaa ggccatatca 2880 ctggacctat gcggaccatg gccacaatca aaaagaaaat acaagcacgt actcgtgatt 2940 atggacaaat tctccaagct agtaaagata tatccactga aggataaaaa actggccaca 3000 atcatcaata aattggaaaa cgactatttt ccaaacatcg aagtgccaga agaaattctt 3060 acggacaacg acggacaatt cttgtccaga agatggcaac aatttgcaga agaatataac 3120 tgcacaataa aaaagacaac gccattcaat cttcagtcca acccagtaga aagggttatg 3180 cgtgagctag gacgaataat gcgtacttat gcttcacacg aacatacctt atgggcaaat 3240 actgtacaac gcgcagaacg cgttatcaat gccacagtac actccagcac tggatttacg 3300 ccaaacgaat tacattttgg tgttgacgat catctggagc taccaagaga aattttgccg 3360 cgaatttacg aggaaatctt acaagacgac aaaattatcg aggcgcgtgt gaacctagag 3420 agaaacgcgc agaagagaaa aaaacaagcg gacaaatttc agactgttaa cgtctatcaa 3480 ccaggagaca ttgtatggat caaaacaagg ccacgttcgg acgcgagaaa gagaaaagtc 3540 gcaaaaatcc atttactata cgacgggcct ttccaagttc tcgatgtgtt gagacgaaat 3600 gcatacctaa ttggagacct caacggaaac gtgaggggag cctacaacac ccgtctgcta 3660 aggcctgatc gcgagccaca catgaggcca cgataggagg ccctagatga acaaatagac 3720 gaacccgagc aggactcacc acaaacgagc gtgtaccagg atgccgaaga cggttcaaaa 3780 agttacagtg aaagatacga gagtcttgat caagaattac cacaagacga tcaattagag 3840 gagaacgaat atgatccgga tgaatatcta tcagaggatc aagacgaaca atactatccc 3900 gacgaggatg gcaaacagca tattacagag gaggaagaag attactccga accggaagct 3960 gacgaacaaa tttcggacta cgaacgattt gataccgctg agtacaaaca tgaatatgta 4020 cccatagaag cagagcaata cctcgaagag ggtgaccaaa ttgtggatac aaccacaaac 4080 caaacactgg atgattccga aattatcgaa ttatccagcg aacaggaacc aatcgaagaa 4140 tattcgcagg aagactcaca atatcctgac aagcacgaga caagaagaac tatccgagga 4200 agaacacttc acggaatatg atcccagagg aggactattc ctagacgaga taccacgatg 4260 gcatgaagaa aaaaacaaat acattgtcga agagccacaa gatggtgatc cagaggatag 4320 aatcacgtca attcgatcgc agaaccgcag tatagcgaca cagggtcaac aattgacaca 4380 attaatggca aacttgccaa accaacacaa ggagacctac caggacaacc gtgttcaagc 4440 cagaatgaag aaaacaagat attctaccag gggtggaacc acacatgcca cactgaaaag 4500 tgaatatcac ctgtgcgtaa ttactattaa ggaagaaccc aagaaacagt ttaatcttcc 4560 agaatgcact ataacgccaa ttcgcagaaa aattgcaata aatggcgtat atataaacga 4620 aaacaataaa agaccacgag aagaggtcga aacagtgaaa ccagattcaa aagtgaaaag 4680 agataccgca aataatacgg taacaacaaa aaatctaaaa aatattaccc aaagatactc 4740 cataaggaac atcaaggaac atcccaccaa atctacaaaa attgccgttt tattaaaaca 4800 aaacgacaat ttggcgggaa aat 4823 // ID Gypsy-31_OD-I repbase; DNA; INV; 5750 BP. XX AC CABV01003571; XX DT 24-FEB-2011 (Rel. 16.02, Created) DT 24-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Oikopleura dioica genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-31_OD_; KW Gypsy-31_OD-LTR; Gypsy-31_OD-I. XX OS Oikopleura dioica OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; OC Oikopleuridae; Oikopleura. XX RN [1] RP 1-5750 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Oikopleura dioica genome."; RL Direct Submission to RU (24-FEB-2011). XX DR Genome; CABV01003571; Positions 6069 11818. XX CC Positions [2174-2635] - Reverse transcriptase CC Positions [3728-4195] - Integrase core CC 'TGTGA' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 90..1097 FT /product="Gypsy-31_OD-I_2p" FT /translation="MENFEFINIKTAIEDKIQRANWIFVKSLENHERDNKD FT DLTAQFERLAATFVYNSCSDETIKSNGEELCSFEKAEDRLTFLSSFYAIET FT KSEKIRRYKEKLEKLARLSNEKFELFLSKVTSTVEQITDKADAREYIINEK FT FRQILSPETESFLRDMCMLEKSPKEIAVFLDQRERNITVPKIAQLQIDNKI FT DTLVQSTSEMIAKTSIAFEEQIARMEDKHKKSEEMAEQRAKILNNQIENLT FT ATISKLTFERNNQQPSRPYFPNQNQSNGQNSFQPRPRFCNYCKVDTHYRSQ FT CPHVTCFSCGNKGHMRNRCPNNQNTAQTQNAPKPADFQAKNLNA" FT CDS 2039..5698 FT /product="Gypsy-31_OD-I_1p" FT /translation="MYRTPFALRAEMKRILESFIQQGIIEPTRSEFNSPCL FT LVRKKNGSFRLVVDYRRLNQITTQQHHPLQQIEDVLSYLEKSKIYSCIDLR FT KGFHQCSVEKSSRPALAFSNEWGQYTWRKMPMGTKNAPLHFAKCIDHIFRD FT VPKSKICCYLDDLIVHSKTESEHFENLEKFFIILSQNNLRINIDKANFFER FT NASILGYEICDGKVKPSEDKIEAVKKLNIPRSREEAQSLFGLLSQHRKFIQ FT GFAPLGLEISKTYRGNFQWTENASVALNKLKSIICENTLELKIPPMDECLY FT VLETDASNESFGGVLYLCIDNNRTHEHDLNCLRPCAYHSLNFTPTQMKYCT FT LEKELYAGRSCMEKWKVYLAFVNFIWITDNSCVKFANSFKTNNWKIQRWLS FT EIMGFSFSIIQRKSRQMKISDCLSRNTANISQIKFSNTGFVDFQKKDTILK FT QVRNYVSLNRWPNNPPRELYQYFLNRQNLEILSSDELIINCASGLKKFCVP FT ESLQGEIIAEYHNHTHPGIDICFNKISTKYFWPGIRSTVSEFIRTCHYCQS FT AKPNNHPNRASLGKFRTPSGPYEAISIDLIGPLKETDFGNIYIFTYIDGFS FT KKVYAEPLNQKKSEYLLQVFRSLLFRNPKFPKFVILDNAPEFNAIAKFLKD FT NKIEPHYIPPRHPQSNGLVENANRTIKARIRAKTNYINWDQYLHEIIHEIN FT SSHHSVLKLSPFSVETGVLDNHNFQDSNWRSYGNKVEIDFKDIREKIDAEK FT NKRLAKFRNEKFKEYELEDLVLIKNFRSKFPPFIGPFRIIYKSPTGTWYNC FT KNDDKEFRRHADDLKPYKLRETSAFEQLQSNEIGQKQQKTEPRTAVYYDFS FT DDESSYGDNDFDKIDDLTIINKDYENNGPNRDENAESEDSSESSSESSSSS FT SDDSEIFRVPEKIKNILERREKELLEITEIINTLPKPNISSDVSGSIVSPA FT ESIISAKSPDNSSNTSRGATYLEHDVDKNVSYFEDNELEIEPANNGEILDL FT STEAETHLISYDKLSESVTGNNRKRERSDDSLISRKVAKFTHENNLQNMAL FT LIKMEYEDKSFFEFINDKWKNNAVENIPIERGMFLTLRELTKDLLLFILFK FT LNKEFCVDENVTVLRLRIQETIAKEYPNWRRSRSGKYLFFASFRTQKERNL FT YDLSLPELKVVCAHYNLPTPQKYSKTFLTAFIEQELPKVSRNHPRRKNELI FT FVPDLNETEI" XX SQ Sequence 5750 BP; 2042 A; 1031 C; 1046 G; 1631 T; 0 other; tggtgaacac aaatgggaaa actcctgaag aggtacttta cagtcacaat ctgttttcgg 60 ctgtttcgaa ccagcttatc cgtccccaga tggaaaattt cgaattcata aatataaaga 120 cggccattga agacaaaatc cagcgagcaa actggatttt cgttaaaagc ctcgaaaatc 180 atgaacgtga taataaagac gatttgacag cacaatttga aagattggct gcaacttttg 240 tttataactc ttgcagtgac gaaacaatca aatcgaacgg agaggaactc tgctcgtttg 300 aaaaggctga agacagattg acatttttat caagttttta cgcgattgaa acaaaatccg 360 agaaaatacg gcgttataag gaaaaacttg aaaaactcgc gcgacttagt aacgaaaaat 420 ttgaattatt tttgtcaaaa gtgacctcga ctgttgaaca aataacggat aaagccgacg 480 caagggaata cattattaat gagaaatttc gacaaatttt atctcccgaa acggaaagct 540 ttcttcgcga tatgtgcatg cttgagaagt ctcctaaaga aattgcggta tttttggacc 600 aacgagagag aaatattacg gtgccaaaaa ttgcgcaatt acagattgat aataaaattg 660 acacgttagt tcagtcgacc tctgaaatga ttgcaaagac ctcgattgcg tttgaggaac 720 aaattgctcg aatggaagac aaacataaaa aatcggagga aatggccgaa caacgtgcca 780 aaattttaaa taatcagatt gaaaatttaa cggccacaat ttcaaaattg actttcgagc 840 gtaataatca acagccatcc cgcccttatt ttccaaacca gaatcaatcc aacggccaaa 900 actcgtttca accgcgcccg cgcttctgca attattgcaa agttgacacg cactatcgaa 960 gtcagtgtcc acacgtcacg tgtttttcat gtggaaataa aggtcacatg cgcaacagat 1020 gcccaaataa tcagaacacg gcccaaactc agaatgcccc taagccagca gattttcaag 1080 caaagaattt aaacgcctaa atggagataa agaccctggt caggctttag ctcctttaag 1140 cgttaaaatt aatcgcatag gattacaata ttatgttaat atttcaattt tttcaaaagt 1200 aattaaatgc ttagtagata caggaagtca acttaatctc ttgcctaaat cattcgttcc 1260 cgaaaatgtt aaaatatgcc cacccgatct tgttgcccgt aattatggtg gaggttctgt 1320 tgttattctt gggtttgttg aggagaaatt ttatatcgaa aaccaacttt ggggctcatc 1380 acgtttttat atagtgccag ataattgcga accaatatta ggatcagaag caattaaatc 1440 gaacgaaata accgtcattt tgcatcgaaa cagattaata aaaagcggtc cgatagaaag 1500 attagctaaa ttatgcgcgt taaaagtaga cgaagttcga attgatggaa tcaaggaaga 1560 ctgctttttc gcccgttcgg aacaaaattt aacatttagg ggtaaaagcg agcaattgat 1620 cgatttaaaa attgacaatc ttacagagcc tagaaatctt ttcgtagacg aaactcagct 1680 cggaaattca aaactagaaa taattcaatc attccaatct gttgatcctt cacatccctt 1740 tatacaagtt cttattataa atccgcttaa ttcaacaatt aaaataccag ctggaacagt 1800 tttcgcaaag ttatcagaga ttgcgcaagt tgcaaatttg aaatcgaaaa gaaaagatct 1860 gtcgaaagtt cttagtcgga taaaagtcgg agaaatatca tcaaaaaatc gtggaaaatt 1920 cgagaatctg atacagcaat tttcttttct ttttcaagat gaagacgaca tactgccaga 1980 aactgctctt gaggaatttt ctattgacat tggagataat aagcctacaa tcagctctat 2040 gtaccgaacc cctttcgcct tgagagcaga aatgaaacga attttagaat catttattca 2100 gcaaggtatc attgaaccaa ctagatccga gttcaattca ccttgtctac ttgtccgtaa 2160 aaagaatggt tcctttcgtc ttgtcgtaga ttacaggcgc ctaaatcaaa taacgaccca 2220 gcagcaccat cccttgcaac agatagagga cgttttaagc tatttagaaa aaagcaagat 2280 ttactcttgc attgatttac gtaaaggatt ccatcagtgt tcagtagaaa aaagctcccg 2340 gccagcttta gcgttttcaa atgaatgggg ccagtacact tggcggaaaa tgcctatggg 2400 cacaaaaaac gcaccgcttc attttgcaaa gtgtattgac cacatttttc gggacgttcc 2460 gaaatcaaag atttgttgtt atttagacga tttgatagtt cacagtaaaa cagaatcgga 2520 acacttcgag aatttagaaa aattttttat tattttatct caaaataatt taagaataaa 2580 tatcgacaaa gcgaattttt tcgaacgcaa tgcttcaatt ttgggatacg aaatttgtga 2640 cggaaaggtt aagccgtcgg aggataaaat cgaagcagta aaaaaattaa atattccgag 2700 aagtcgcgag gaagctcaga gtctgttcgg tcttttatca cagcacagaa aatttataca 2760 aggatttgca cctttagggt tggagatttc aaaaacttac aggggaaatt ttcaatggac 2820 tgaaaatgca tctgttgctc taaacaagtt aaaatcgata atttgcgaga acacgcttga 2880 gttaaaaatc ccgccgatgg atgaatgtct gtacgttctg gaaacggatg cgtcgaatga 2940 atccttcggg ggtgttctgt atttatgtat cgataataat agaactcacg aacatgattt 3000 aaattgtttg cggccttgtg cttatcattc gttgaatttc actccaactc aaatgaaata 3060 ctgtacgcta gaaaaagaat tgtatgcggg tcgatcttgc atggaaaaat ggaaagttta 3120 tttagctttt gtcaatttta tctggattac tgataacagt tgcgttaaat ttgcaaattc 3180 gtttaaaact aataattgga aaattcagcg ttggcttagt gagataatgg gattttcctt 3240 ctcgattata caacgaaaat ctcgtcagat gaaaataagc gactgtctat cccgaaacac 3300 agcaaatatc agtcagataa aattttcaaa taccggcttt gtagattttc aaaagaaaga 3360 cacgatcctt aaacaagtac ggaattatgt ttcactgaac agatggccaa ataatcctcc 3420 tcgcgaatta taccaatact ttctcaatcg tcaaaatctc gaaattttat cctcagacga 3480 acttattata aattgtgctt ctgggttaaa gaaattctgc gttccggaat cacttcaagg 3540 cgaaataatc gcggaatatc ataatcatac tcatccagga atcgatattt gttttaataa 3600 gatttcaaca aagtattttt ggcccggaat aagatcaaca gtttcggaat ttataagaac 3660 atgtcattac tgtcagtcgg caaaaccaaa caaccatcct aatcgagcgt ctctcggcaa 3720 atttagaact ccctcgggcc catatgaagc tatttcaatc gatcttatag gcccccttaa 3780 agaaacagat tttggaaata tttatatctt cacatatatt gacggattct caaaaaaagt 3840 atacgctgaa ccattaaatc agaaaaaatc cgaatattta cttcaagttt ttcgaagcct 3900 tctttttcga aatcccaagt tcccaaaatt tgttattcta gataacgccc cggagtttaa 3960 cgcaattgcg aaatttctta aagataacaa gattgaaccc cattacattc ctcctcgaca 4020 tccccagagt aacggccttg ttgaaaacgc taatcgtaca attaaagcaa gaataagagc 4080 aaaaacaaat tacattaatt gggatcaata tttacatgaa ataatccatg agattaatag 4140 cagtcatcat tcggttctta aattatcgcc attttcagta gaaacaggtg tcttagataa 4200 tcataatttt caagattcta attggcgttc atatggaaac aaagtcgaaa tcgactttaa 4260 ggatatacga gaaaaaatag acgctgagaa aaataagaga cttgctaaat ttagaaacga 4320 aaaatttaaa gaatacgagc ttgaggatct tgtgcttatt aaaaattttc gctctaaatt 4380 tccaccattt atcggtcctt ttagaataat ttacaaatct ccaacgggga catggtataa 4440 ttgtaaaaat gatgataagg aatttcgaag acatgcagat gatcttaaac cctataaatt 4500 acgagaaact agcgcatttg aacaattaca aagtaacgaa attggccaaa aacaacagaa 4560 aactgaaccg aggacagctg tatactatga tttttcggac gacgagagtt cttacgggga 4620 taatgatttt gataaaatag acgacttaac gataattaat aaggattatg aaaataatgg 4680 gccaaatcga gacgagaacg ccgaatcgga agattcgagt gaaagttcaa gtgaaagttc 4740 gagtagttcg tcggacgatt cggaaatttt tcgagtacct gagaaaatta aaaacatttt 4800 ggaacgtcgc gagaaagagc tgcttgaaat tacagaaata ataaatacct taccgaagcc 4860 gaacataagt agcgatgttt caggaagtat agtatcacca gctgaatcca taatttccgc 4920 gaaatcgccc gataatagtt caaacacctc tcgtggcgcg acttatttgg aacatgatgt 4980 agataaaaac gtttcctatt ttgaagacaa tgagctagaa attgaaccgg caaataatgg 5040 agaaatctta gatctatcaa cagaagcgga gacgcatctc atttcttacg ataaattgtc 5100 ggaatctgtt actggtaaca atagaaagcg ggaaagatca gacgattcac tgatctctag 5160 aaaagtagct aaatttactc acgaaaataa tttgcaaaat atggctcttt tgattaaaat 5220 ggaatacgaa gataaaagtt tttttgaatt tatcaacgat aaatggaaaa acaatgcagt 5280 agaaaatatt ccaattgaaa gaggaatgtt tttaacttta agagaattga caaaagatct 5340 tcttttattt attcttttta aattaaacaa agaattttgc gttgacgaaa atgtcaccgt 5400 gttacgattg cgaattcaag aaacaatcgc aaaagaatat ccaaattggc gtagatctcg 5460 ctcgggtaaa tatctttttt tcgcgtcatt tagaacgcag aaggagagaa atttatacga 5520 tctgtccctt ccggaattaa aggttgtttg tgcgcattat aacctgccaa caccacagaa 5580 atattctaaa acttttctga ctgcttttat cgaacaagaa cttccaaaag tttcacgaaa 5640 tcatccccgt cgtaaaaatg aattaatctt tgttccggac ttaaatgaaa cagaaatata 5700 aatcgagtac aaaaaaaaga ttctcgccct ggacccataa gtggtcacag 5750 // ID Gypsy-4_PPc-LTR repbase; DNA; INV; 237 BP. XX AC chrUn; XX DT 06-JUL-2010 (Rel. 15.07, Created) DT 06-JUL-2010 (Rel. 15.07, Last updated, Version -1) XX DE LTR retrotransposon from the Pristionchus pacificus genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-4_PPc_; KW Gypsy-4_PPc-I; Gypsy-4_PPc-LTR. XX OS Pristionchus pacificus OC Eukaryota; Metazoa; Nematoda; Chromadorea; Diplogasterida; OC Neodiplogasteridae; Pristionchus. XX RN [1] RP 1-237 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Pristionchus pacificus genome."; RL Repbase Reports 10(7), 1001-1001 (2010). XX DR Genome; chrUn; Positions 23867569 23867805. XX SQ Sequence 237 BP; 55 A; 73 C; 36 G; 73 T; 0 other; tgggtaaaga gattggtctg cagatattct tcttgtctac acgtcttcta ttggattatc 60 tactcgacaa gactcacaat aagattgaat atgaagatcc ctcactcatc cctcatttcc 120 ctctcacttt gactcggtcg cccctctata taagggtcga cctccccctc tttactcact 180 ctcataccga gataaatcag tgatcgctcg ctctcttctc gccttccaca cggcaca 237 // ID DNA-1_DPu repbase; DNA; INV; 1031 BP. XX AC . XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE DNA transposon from Daphnia: consensus. XX KW DNA transposon; Transposable Element; nonautonomous; DNA-1_DPu. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-1031 RA Jurka J.; RT "DNA transposons from Daphnia."; RL Direct Submission to RU (22-MAY-2010). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR [1] (Consensus) XX CC >94% identity to consensus. TA TSD. XX SQ Sequence 1031 BP; 321 A; 199 C; 201 G; 309 T; 1 other; cagtcccctc cataagtatg ggatcacccc gcgccgttcc ataaggcggc gtgtattttt 60 catatgggtt tttcgggtgg caaaaatcga aaaaatgaca taaattcacc aaacttggga 120 tttatgtcat tttacgtctt ttctattgtc tgaatttttt tcagattttt tcgcctattt 180 tttaggcctc taaaagtggc gcacaaagta tcggatcact ccaacgcccc ccgtgttatc 240 ttatgggcca cacttgtttt tcacattgct ttttatacat ttaattttca aggtggaatt 300 tttattttct aaccctattt acatttccct tttataaatc aactgtgaga tttccattgt 360 gatcggaacc atttccgaat tttcccagat ttattaattt ccccagttgc tgaaatmaga 420 gactacagaa gagttcattt agggtttgca acctctatcg cctgccgtag gccggaaaat 480 tagtgtcgtt tctatacgcc ccttaagtta aaaaggtagc agagcgccag tggcgtatag 540 aaacgacact aattttcctg cctacgccag ctgatagagg ttgtaaaccg taaatgaagt 600 cttctgcagt ctctgatttc agaaactcgg gaaatgaata aatctgggaa aattcgaaaa 660 tgaatcagat aaccatgaaa aactcacagt tgatttatga aagggaaatg taaatagggt 720 tagaaaataa aaattccacc ttgaaaatta aatgtataaa aagcaatgtg aaaaacaagt 780 gtggcccata agataacacg gggggcgttg gagtgatccg atactttgtg cgccactttt 840 agaggcctaa aaaataggcg aaaaaatctg aaaaaaattc agacaataga aaagacgtaa 900 aatgacataa atcccaagtt tggtgaattt atgtcatttt ttcgattttt gccacccgaa 960 aaacccatat gaaaaataca cgccgcctta tggaacggcg cggggtgatc ccatacttat 1020 ggaggggact g 1031 // ID ANM4 repbase; DNA; INV; 1885 BP. XX AC AF126012; XX DT 27-JUL-1999 (Rel. 4.06, Created) DT 27-JUL-1999 (Rel. 4.06, Last updated, Version 1) XX DE Antheraea mylitta mariner transposon Anm4.2 transposase DE pseudogene, complete sequence. XX KW Mariner/Tc1; DNA transposon; Transposable Element; ANM4. XX OS Antheraea mylitta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Saturniidae; Saturniinae; Saturniini; Antheraea. XX RN [1] RP 1-1885 RA Nagaraju J., Prasad D.M. and Nurminsky I.D.; RT "Cloning and characterization of mariner-like elements from RT silkmoths."; RL Unpublished. XX RN [2] RP 1-1885 RA Nagaraju J.; RT "ANM4."; RL Direct Submission to Genbank (03-FEB-1999)Centre for DNA RL Fingerprinting and Diagnostics, CCMB Campus, Uppal Road, RL Hyderabad, Andhra Pradesh 500007, India. XX DR GenBank; AF126012; Positions 1 1885. XX SQ Sequence 1885 BP; 582 A; 396 C; 372 G; 535 T; 0 other; atccccggga aattctcgat caccacggcc atagactgca ccacaatacc gagaagagtc 60 gcaggtgcgt tggcagcttg ttttagtggg aagggtgagg gagaagagga agaagggata 120 aggagggata taggtttcca gcgtggcgtt aaaaatgtaa agaaaattct ttatttttca 180 taacataata ataagtcttt acctatgaaa ttgccgttct gcagtactgg cctctcaaat 240 caaaataatt tttgtttggc taacaattct aaattcaaat taattttatt ttaaaaacat 300 ggaattcttt tttccaaaat tagtcatttg tttattgatt actgttacat tttcatcgca 360 ttttcaatct tttttgtcgt catggatacg ttaaaaattc gaataattat ggaatatgag 420 taccgtcgtg gcagcaacgc ggcacaagag gcacgcaaca tgtcattgac gtttacggag 480 ctaacactac aaaccaacgc acaacttgtt attggtttgc tcgctttccg ttttgggaat 540 ttcgacctga aaaatgaacc tcggggtcgt ccgaagacta tggtcgacaa tgacgaatta 600 aaggcgattg tggaagctga tgatacacaa tctacggcta aattagcagc agctttcgac 660 gttagcgtca aaacaatatt ggtctatttg cgtgaaattg gcaagggtaa aaaagcttta 720 caggatgggt gctccacgaa cttcaatgat tcgccagcgc gaagtacgcg tcgagacgtg 780 ccttgcacta ctcaacagac acacaaatga aggaattttg aaccgcattg ttacctgcga 840 tgaaaaatgg actctgtttg ataactgcaa gcactcagca agttggttgg atcctgttca 900 gcacctaaat aatgccctaa acgagaactt actcctagaa atctgatggt cactgtttgg 960 tggtgtagcg ccggtgtgat tcatcacagc ttcttaccaa acgtggtgag cattactgct 1020 gatatgtact gtgaagaact gaacccaatg atggagaagc tggcacgtct ccaaccagca 1080 ttggtctaaa ggtctctccg ctgttcttgc aagacaacgc acgacctcat actgcacagc 1140 agacggtgtc taaattgcaa gaactggggt tggaagttct ccgtcatccg cccttactca 1200 cctacagaat actacttttt ccagaatttg gacaactcct tggcgggaaa atgaattcaa 1260 tacccgagag gcagacaaaa tgcctttgaa agaatttgtt gcctcccgtt cagctgattt 1320 ctttaagaag gacatcaata aactgccact gcgttggcga agatgcattg attgcttagg 1380 tgatttcttc gattaataca aaaaaaatta catgaataaa attcgtttaa atttttcata 1440 tacaacggca atttcataga taaagaccta tagataatgt tacatgaaac gcaatatgtt 1500 tactaatata ccaaattatg cttatgtgta gtaaaagtaa aaacaataaa tattatgtaa 1560 caacatatta atttagattt ttacatatcc ccattccccc attatgtatc ttttcaactg 1620 ttttccattt tttttttatt gtagcacggc aaaattactg cactatgcct gatgataagc 1680 acagtgcaat ccaaatataa tagccgacca cccgattccc cacactatcc atcgcccggt 1740 cgtccactat ccccccgggg ctgggaaaaa cccggaaatt gccaggggga atttccccaa 1800 aaaccacaac cccccaaaaa aaaaaaaaac ccctttttgg ggttttcccc aatacattgg 1860 tggttgcaac catcaggctg ttgaa 1885 // ID Ginger1-1_BF repbase; DNA; INV; 4680 BP. XX AC . XX DT 03-FEB-2010 (Rel. 15.01, Created) DT 03-FEB-2010 (Rel. 15.01, Last updated, Version 1) XX DE Ginger1 DNA transposon from Branchiostoma floridae. XX KW Ginger1; DNA transposon; Transposable Element; Ginger; integrase; KW Ginger1-1_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-4680 RA Bao W., Kapitonov V.V. and Jurka J.; RT "Ginger DNA transposons in eukaryotes and their evolutionary RT relationships with long terminal repeat retrotransposons."; RL Mobile DNA 1, 3-3 (2010). XX DR [1] (Consensus) XX CC TIR is 77-bp long. XX FH Key Location/Qualifiers FT CDS 1802..3436 FT /product="Ginger1-1_BF_1p" FT /translation="MDEQLYSQILHFYTCSDGERKYPADIYNLQPIDRTNA FT KSGFRQVAKPFHAENGALYRGNQEVVVKSRVKSILKAFHDDPATGGHFGRD FT KTLQKVTKRFFWKGMKKDVEAHAHKCQKCFVVNPKVSREAPPLHSISVPSK FT VWSMVGIDLIGPLQETKSGNKYIVAVTDHFSKWSEAKAIPDKSATSVAEFL FT FSVVCRLGCPDVMISDQGREFVNEVIDKVMDHFNTDHRISSAYHPQTNGQR FT ERDNRTLKDSLSKVVNECGDDWDNYIEGVLFAHRTSIHASTKCTPFEAMYG FT RQAKLPVDLKPVDDNEMIEAADTITPDTLQTLSSIREEINASIESNIVAAQ FT KRQKRNYDLRHASQTAIAVGSTVYMKNKLTIHRMGPGIKYRWVGPYTVVES FT LSKGRVKLQNAKGVVLRNTYHGSGLKVFEEGESAASVHAAESASVHAAESV FT SVHAAESVSVHAAESVSVHAAESASVHAAESASVHAAESASVHAADCNVPR FT SCEESVSGCQDLDELLSLVGNASRTSEHQEDNSVPTSAASSVAPNFGII" XX SQ Sequence 4680 BP; 1408 A; 931 C; 1023 G; 1318 T; 0 other; tgtagtaggc cagaatatac ccctggccag tttatacccc gggtataatc tagattaacc 60 cctggggtat attctggcct aggccagatt atacccttta ggccaaagta taccccgagg 120 gtataatcta gcctaggcca gattataccc cggggtatag tctacattat accccatcct 180 ggtgcgcttt gggttacaga ataaaggcgt taaggacaat tgatacagcc tagattcatt 240 tttttattga tgtttacaat gtagtacaag gacaaaatac atgccccgat gaactttttg 300 aatttgacat taaacgttaa tcttacactg cgtttgagtt cagcacgctg caggcttcgt 360 ctgcaaactc tcccacactc cacggcgcca tgaatacagt cctcactatc gacgcggaaa 420 agtctaggtt gtttaaataa tgtgatatct gggagttttg caccgtgtcc gaggaagtca 480 taaccggtca atcactgacc ttgtgattgc ccctgaactt tgaactgaat taatgcacat 540 ggagtgtacg ttgaacacac gcctttatct attagatcta acaaacaatg gcatgagccc 600 tgatgcattt cacatctttt actttatgtg ttttttttta ggttcaatac aggggcactt 660 tgttgcaatt ttcattgact tttttcaacg tttgtcgtta cgtttggatg ttgtgaggca 720 agctgaaaac aacagctgag tgttgccatg gtgttgcatc gtattacagt tgggcgggtg 780 catccaaata tttggaggga tggccagagg gattataatg ttaacagtaa caattacatg 840 catctgtcag ctaaactatt tcatatttac caatatcaaa catcacaaat agctgattat 900 gtgcatttta tgtttaaaaa aaatacatat tcatttgaaa aaatatcaac gataagggat 960 cttccaaaat ttggaatgta ccatactgcg aatgaaaagg tagatgcact acataaatta 1020 ctatcatcaa ggaggttcct tgctattgca aaaattcttg cactgtaatg gctatctaca 1080 tgttatatgt tatgtattaa aggtgctttg tgatacacat ttatgccatt gtgacgtaat 1140 aatcaaaatg attagaaaca aaccctggtg tgaaaatgga acagtccgta gcgttgtcct 1200 ctttccgcgc ctttttgaag atgacacgct ttctgtgact gggcttggga tatcaacaca 1260 cttttaaaat aaggtaagat aaacttagtg tttttagtgg taaatgaaaa gcgaaacatt 1320 tcttttatca aggtttaatg tgatctaagt tctatgacag aacggtaaag atttcagcaa 1380 gttttgtttg ttggatgttt atgtatgaaa gttgtcaatt agggcgggaa aagattactt 1440 ccatatatgg cgaaagcact taagtttaac gtcgcattca ctgtaaattg attttccgtt 1500 ggccatacat agtaaatcat tttagtaaca gatctgtcct tagaattacg tttgttacgg 1560 ccttaagatt tcacatgaag gtgataaaaa tataagttgc acatgaaagt gataagtgac 1620 gcaaaaaaca cctgtcgcaa tcagtacaag agatgcatca tttggggcat tctgaaaata 1680 aaccagctta aatttttata tatcacatct gtacgtatgt cgatttacaa tcatcaatag 1740 cgtgacagtt gcaattctct actaaaaaga atctatcatt catttttttt tagaattcaa 1800 aatggacgag caattatata gccaaatact gcatttctac acatgctccg atggcgagcg 1860 gaaatatcca gctgacatct acaatctaca gcctatcgat cgcacaaacg caaaatcagg 1920 gtttcgtcag gttgcaaaac cattccatgc agaaaatgga gccctgtatc gcggaaacca 1980 ggaggtcgtt gtcaaatcta gggttaaaag cattctgaaa gcattccacg acgaccccgc 2040 tacaggaggc cactttggga gggacaagac acttcaaaaa gtgacaaaac gattcttctg 2100 gaaaggtatg aagaaagatg ttgaagcgca tgcgcataaa tgtcaaaagt gctttgttgt 2160 caacccaaaa gtgagcaggg aggctccgcc cctccattca atctccgtac caagtaaggt 2220 gtggagtatg gtgggtatag atctgatagg ccctctgcaa gaaacaaaaa gtggcaacaa 2280 gtacattgtc gcggtgaccg accacttttc gaagtggtcg gaagccaagg ccattcctga 2340 caagagcgcc acttctgtgg cagagttcct cttctcggtg gtatgccggc tggggtgtcc 2400 tgatgtcatg ataagtgacc aggggcgaga gtttgtaaat gaggttattg ataaagtcat 2460 ggatcacttc aacaccgacc accgtatttc ctcggcgtat catccccaga caaatgggca 2520 gagagaaagg gacaatcgaa cactgaagga ctccctcagt aaggttgtca atgaatgcgg 2580 ggacgattgg gacaattaca tcgaaggagt cctgtttgca caccgcacct ctatccatgc 2640 ctcaacaaaa tgcactccct tcgaagcgat gtacggccga caggcgaaat tgccagttga 2700 cttgaaacca gttgacgata atgagatgat agaggctgcc gacacaatca cccctgacac 2760 actacaaaca ctaagctcaa taagggagga gattaacgcc tccattgaaa gtaacatcgt 2820 ggcagcacaa aagcgacaga aacgcaatta cgatctgcgc catgcaagtc aaacagcaat 2880 tgctgtcgga tctactgtgt atatgaagaa taaactgaca attcatagaa tgggaccagg 2940 gataaaatat cgatgggtag gtccttacac agtcgtggag tcgctgtcta aaggacgcgt 3000 taagctgcaa aatgcaaaag gtgtcgttct ccgaaacacc tatcatggaa gtggactgaa 3060 agtctttgag gaaggcgagt cagccgcgtc agtccatgca gctgagtcag cgtcagtcca 3120 tgcagctgag tcagtgtcag tccatgcagc tgagtcagtg tcagtccatg cagctgagtc 3180 agtgtcagtc catgcagctg agtcagcgtc agtccatgca gctgagtcag cgtcagtcca 3240 tgcagctgag tcagcgtcag tccatgcagc tgactgcaat gtgcctcgta gttgtgaaga 3300 aagtgtgtcc ggctgccaag atctcgatga gctattgagt ctggttggta atgctagccg 3360 tacaagtgag catcaagaag ataactccgt accaacttca gccgcatctt cagttgcgcc 3420 aaactttggt atcatctgaa aggaggagta ctgtaccgat cgaaataagt tgtcactggt 3480 ttgatatatg tttcggcggc gttcataggc ggtcgagttg ggtttgacgt cacgccggcg 3540 ccatcgtgtt tcgaatcttt aaaatcttta ctccatatac atatatagat ataaattttt 3600 tgttacgtct gaaattgctt aaaatcctgc tcaaataatg attgtttaag atatctggat 3660 ccgatcatcg tctgtaagac cagtgcatac tagttatttt acatactaca aacagtttaa 3720 aggagtgttt aactagaatt gttcagcatt ctagtaatac gataagggac cagacagagt 3780 tatgatgtga aaatggatgt gcccagccta ttggaacggc ttgcttgcct gtattcatat 3840 aatcaaggac gctgatataa cgtccttgat ataattggtg gttattcttg caaaaagtta 3900 ggcagatgtt atatcaagtg tgttacttga tggaatattg tttaaggatt ggttgttttt 3960 gaggcgataa tgtgatgaag ataatatgtt tgaactcaat atgttaccta ttcatgacac 4020 atgattattg aatgtggatc tacagttttc aataaaacat ttttgaatgt tccccttcag 4080 aagtcgcgtg attccatttt actccaatga ttttcatctt ttatgcggcc gcggcattaa 4140 ttttgctgtg aaaaggcccg gaattccata ggcacagatt agtctgggga ccaaagctga 4200 atgaagacct gatgattagt attgatattc aaaccttgcc aaatctacga ggaatcacaa 4260 ggtaaacgtc gatttgaata gaacacgaca atagatttga gaagaacact gtaaccatac 4320 atactgcacc agtcgatctg aatctatgca agaactttac acaacatccc aggaataaca 4380 atgttacgtt agggttaaag ctagataggc cccaaaataa tgacctcggg gagaaaagtt 4440 acgtactcta tgtacgggtg aaaatatggc agtgaatggt ctgattctga gagcacatgc 4500 ataaaattgg ttaaaccatg ttcattctac aggtattaaa gcccaaagtc tacccctgcc 4560 ctgatcaaac acccgtagac cagaatgtac cccatgagtt taatctagat tataccccgg 4620 gggttaatct agattatacc cgggtatatt ctggccaggg gtatattcca gcctgctaca 4680 // ID BEL-616_AA-I repbase; DNA; INV; 5901 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-616_AA_; KW BEL-616_AA-LTR; Pao_Bel_Ele86; BEL-616_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-5901 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [4920-5498] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 366..5873 FT /product="BEL-616_AA-I_1p" FT /translation="MFLRSGKVKKVLFPSTPFSTPTGEFSVLRSVREQKEE FT AEQRQKEKEMAESVVALEQCCHAARSKVVRIKSAILNAEQDPEKFTHHGLK FT LYLKTVDSSYEEYNNFQNRIYLTDPQRREEFEPKFVEFEELYEFVRISLCE FT MIQHYEDIEKAIANEAALEREKQLLAVKFQGNAVAISDRAGSSGVPESVVP FT YRMPPSLLLQQTPLPTFDGKYEHWHKFKARFCDIVDRCTQDSPATKLHYLD FT KALVGEAKGAIDEQTLNDNNYDGAWRILVERYENLSMVIHGHVTRLLNLKQ FT MAKESSVELRTLIDDCTKRVESLEFHKLKMDKMSEAIVITLLASKLDPSTR FT RSWEASCEHGKLPVYKDTIAFLRKHCHVLERCEQNVMPIKSKIVPMKTQPS FT IITSKTHTVMISKTTDGCPVCGSAHSIDACGAFKRLNVDERYMKAKQLGLC FT FSCLLRGHRTVACKSGKTCPTCSKKHHALMHPEEKKPVAVEFPQPPAAEQA FT TSSSHPLTAAKCTIPIEPMQPKQILLATAIVMVYDSNEVAHKCRVLLDSGA FT MANFMSTRMAELLRLRKEGANVPIDGVNGMKTVVKYRVNARVASRTTGFQS FT SLDYLVVPRVTGALPASKIDPHSWQIPKSVELADPHFYEPGRIDMLLGAEL FT FFEVLQKGKIKLAPNLPLLQESQLGWLVSGPVSDSAVIGTVKVCQAGKSEE FT TDEQLCQLLRRFWTIDEFGGDASPRPDDRCELSFQETHSRTKDGRYVVMLP FT FRDNVGELGESRNQAMRRFLSLERRLEGQPELKQMYVDFISEYLALGHCRV FT VTDSECDRPESVYYLPHHCVIKPDSSTTKLRVVFDASAKSSSDLSLNDVME FT IGPTVQESLFNVILRFRLHKFVFTADIPKMYRQVMVHEAHRKYQRILWRED FT KNQPMKELELNTVTYGTAAAPFLATRSVVQLARDEQEDFPAASEAVEKCFY FT VDDVMTGADTLSDARQLQRDLIALLDRGGFQLHKWCANDEALLEDIPADAR FT EKQMNFEDCEINGVIKTLGILWDPSSDEFLFHVKPIRDGSEFPTKRSVLSD FT MSKLFDPHGLLAPVVLIAKLVMQQLWQQNVGWDDAIPKEQFRIWNRFRSEL FT CCLNSLRIDRRITIDDTVAVELHGFADASKVAYGCCIYMRSVKSDGAAEVR FT LICGKSRVAPMKELQRDIKVDATPDEMTIPRLELCAALLLAEQVDKVRETL FT ALTTAKVVLWSDSKIVLNWLKQMKPNTPVFVQNRVVKIRKLFAWHTWHYIS FT TQHNPADLVSRGVFPEDLMRCEEWWFGPSVIPVVEDDECGVVEPEEYAHQI FT VTTLVPERDSVIPKKQLEPYEVILKYSSYRKLQRIFGYVVRFLHNCRLKGN FT KTTRRSGALNSEDYTEAQKAMVTIVQLVVYKEEISCIQANQSVKGKLRNLN FT PIYDDNERLLRVGGRIRNSDLPKDQKHPMILPENNHFTEVLIEALHREHLH FT VGLNGLLAVVRQKFWPVNAKRTIHRVLRKCVVCFRTNPRDVQQYMGDLPSC FT RVTAAQPFARTGIDFAGPFFIKVGRMRAKVKVYVCLFVCMSTKSIHLELVS FT SLTTDGFLAALHRFANRRGNPSELFSDNGTNFRGADRELSELLNLLQSQVL FT DDKVNEFCQPRGIKWSFNPPKAPHQGGIWEANIKCMKSHLCKTLNESYLTY FT EELNTLLIQIEGVLNSRPLVQLTDDPFDYEALSPGHFLVGRELTAVAEPLY FT DDLKETSLSRYQLVQKRMQHFWQRWSNEYVTGLQKRSKWYKDPTLLRTGLL FT VIMKEDNMPPKTWKLGRIVETHPGKDGVVRVVTIRTSNSIYKRPTTQIAVL FT PIDDCTDQAEPKEK" XX SQ Sequence 5901 BP; 1588 A; 1272 C; 1665 G; 1375 T; 1 other; ttttggtccg atcgtgccgg atgtcggatt agtggatgga agcagtttag attctcggtt 60 cgtgcgcgga acgagaaaga atcgacgcga gtttcggcag ttgcgacggt cgaatcgcga 120 agtgaatccg attgcatgcg ttgcatcgga tagtgttttc cgataaacgt ggtcggaaat 180 tgaattctgt tcccgattga gtggtcgaag aaagcttccg gttacttgcg cgttccggaa 240 gtgtgwgctg ctagtcggac ggcagaatag aactttgaaa agtggaggtc aagcctcaac 300 gaagaagaag agcgtgaaaa ttgtgcggta gtgagaagaa gaactaaaag aactattcgc 360 atgtaatgtt tttgcgttct ggtaaagtga agaaagtgtt gtttccgtcg acgccatttt 420 caaccccgac cggtgaattc agtgtgcttc gtagtgtgcg tgagcagaag gaggaagctg 480 aacagcggca gaaagagaag gaaatggcgg aaagtgttgt tgcgttggag cagtgttgcc 540 acgctgcaag gtcgaaagtg gtgcgaataa agtcggccat tttgaatgca gagcaggatc 600 ctgaaaagtt cacgcatcat ggcttgaagc tgtatctgaa gacggtcgat tcttcttatg 660 aggaatacaa caattttcag aatcgtattt acctcaccga tccacagagg agagaagaat 720 tcgagccaaa attcgtcgaa tttgaagagc tgtacgagtt cgttcgaatt tccctctgtg 780 agatgataca gcactacgaa gacatcgaga aggccattgc gaacgaggca gcgttggagc 840 gagagaagca gctgctggct gtgaagtttc aaggtaatgc tgtggcaatc agtgatcgtg 900 ctggatccag tggcgtaccg gaaagtgtcg tgccgtatag aatgccacca tcgttgttgt 960 tgcagcaaac accattgcca acattcgacg gcaaatacga gcactggcac aagttcaagg 1020 cgcgattttg cgatattgtc gaccggtgca cccaggattc cccggcgacg aaactccatt 1080 atttggataa ggcgttggtc ggagaagcga agggggcaat cgatgagcag accctcaacg 1140 ataacaacta cgacggtgca tggagaattc tggtcgaacg ttacgagaac ctctcgatgg 1200 tgattcatgg ccacgtgaca aggttgctga acttgaagca gatggccaag gagtcgtcag 1260 tggaactgcg aacgttgatc gacgactgta cgaagcgtgt ggagtccctg gagttccaca 1320 agctcaagat ggacaagatg tctgaagcca tcgttatcac gttgctagcg tcgaagttgg 1380 atccgagtac ccgaagaagc tgggaagcat cttgtgaaca cggtaagtta cctgtgtata 1440 aggacacgat tgcattcctg cgaaagcatt gccatgttct ggagcggtgt gagcaaaatg 1500 tcatgccaat caagagcaag atcgtgccga tgaagaccca gccatctatc attaccagta 1560 agacgcacac tgtgatgatt tcgaagacga ccgatggatg tcccgtgtgc ggaagtgctc 1620 attcgattga tgcatgtgga gcgttcaaaa ggttgaatgt tgacgagcgg tatatgaagg 1680 cgaagcagct gggattatgc ttcagctgcc tgctacgagg acatcgaacg gtagcctgca 1740 agagcggtaa aacgtgtcca acttgctcga agaaacatca tgcactgatg catccggaag 1800 agaagaagcc tgttgcagtt gaattccctc agccacctgc ggcagagcaa gcaacgagtt 1860 cctcccatcc actgacagca gcaaagtgca cgattccaat cgagccaatg cagccgaagc 1920 agattttgtt ggcgacagct atcgtgatgg tgtatgattc caacgaagta gcgcataagt 1980 gccgtgtgct gctggattcc ggagcgatgg caaacttcat gtcaaccaga atggcggaat 2040 tgttgcgact gcgaaaggag ggtgccaacg tcccaatcga tggagtgaat gggatgaaga 2100 ctgtcgtgaa gtacagagtg aacgctagag tggcatcaag aacaactggt ttccaatcgt 2160 cgttggatta tctggttgtg ccccgagtga ccggtgcatt gccggcttct aagatcgacc 2220 cacacagctg gcagattccc aagtcggtgg aactagcaga tccgcatttc tacgaaccag 2280 gtcgcattga catgctactc ggtgcggagc tgtttttcga agtgctgcaa aaaggtaaga 2340 tcaaattggc cccgaatcta ccattgttgc aggagagtca gcttggatgg ctcgtttctg 2400 gaccggtgtc cgactctgcg gtgatcggaa ccgtgaaggt gtgtcaagct ggaaaatcgg 2460 aggaaaccga cgagcaactg tgccaactgc tgcggcggtt ttggactatc gacgagtttg 2520 gaggtgatgc gtcgccaagg ccagacgata gatgtgaact aagtttccag gagacgcaca 2580 gtagaaccaa ggatggccgt tacgtggtaa tgctgccatt ccgcgataat gttggagagc 2640 tcggagaatc ccggaaccaa gcgatgcgac gatttttgtc tctggagcgt cgcttagaag 2700 gtcagccgga gttgaagcaa atgtacgttg atttcatcag cgagtatctg gcacttggcc 2760 attgccgagt tgtgacggat tccgaatgcg ataggcctga gtctgtctac tatcttccac 2820 accactgcgt gattaaaccg gacagctcaa ctaccaagct gagagtggtg ttcgacgctt 2880 cggcgaagag tagttcagat ttgtcgttga acgatgtgat ggagattggg cccacagtgc 2940 aagagtcact cttcaatgtg atattgagat ttcgcttgca taaattcgtc ttcaccgccg 3000 acatcccgaa aatgtatcgg caggtgatgg ttcacgaggc tcatcgaaag tatcagcgaa 3060 ttctgtggag agaagataaa aatcagccaa tgaaggagtt ggagttaaat actgttactt 3120 acggtacagc tgcagcgcct ttcctggcga cacgatcggt tgtccagttg gctagagacg 3180 agcaagaaga tttcccagcg gcgagcgagg cagttgagaa atgcttttac gtcgatgatg 3240 tgatgactgg tgctgacacg cttagcgatg ctaggcagct ccagcgagat ttgatcgcac 3300 tattggatag aggtggattc cagctccaca aatggtgtgc gaatgatgag gcgttactcg 3360 aggacattcc agctgatgcc agggagaagc agatgaactt cgaagactgc gagatcaacg 3420 gagtgataaa aactcttggc attctttggg atccctccag cgatgaattc ctgttccacg 3480 tgaagccaat tagagatggc tctgaatttc ccacgaagag gtcagtgctg tccgatatgt 3540 caaaactgtt cgatcctcat ggcttactag caccggtggt actcatcgcc aaactggtaa 3600 tgcaacagct atggcaacag aacgtaggat gggatgatgc gattccaaag gagcaatttc 3660 gtatatggaa ccgattcaga tccgagcttt gctgtttgaa ttccctccgg atcgatcggc 3720 ggattactat tgacgatacg gttgctgtgg agctgcatgg atttgctgac gcttccaagg 3780 tggcgtacgg atgttgcatc tatatgagaa gcgtgaagag tgatggtgct gcagaagttc 3840 gactgatttg tggcaagtca cgagttgccc caatgaaaga actacaacga gacatcaagg 3900 tggacgcaac ccctgatgag atgacgatcc cgaggctaga gttatgcgct gcgttgctgc 3960 tggctgaaca agtggataaa gtgcgcgaaa ccttagcatt aacaaccgca aaggtggtgc 4020 tttggtcgga ctcgaaaata gtgttgaact ggttgaaaca aatgaagcca aataccccag 4080 tgttcgtaca aaaccgcgta gtgaaaataa gaaaactttt tgcgtggcac acgtggcact 4140 atatttcgac tcagcacaac ccagcggatt tggtgtcacg tggtgtattc cctgaggatt 4200 taatgcggtg tgaagaatgg tggtttggcc caagtgtgat tcctgttgtt gaagacgatg 4260 aatgtggtgt ggtggaaccg gaagaatatg cccaccaaat tgtgacgact ttggtaccgg 4320 aacgtgattc tgtaataccg aagaaacaac ttgagccgta cgaagtgatc ctgaagtaca 4380 gcagttatcg aaagctgcag cgtattttcg ggtacgtggt tcgtttccta cacaactgcc 4440 gcttgaaggg aaacaagacg acgcgcagaa gtggtgcact gaatagtgaa gactacaccg 4500 aagcacagaa ggcgatggtg accattgtcc agctggtggt ctataaggaa gaaatcagtt 4560 gtatccaagc gaatcaatca gtgaagggaa aactgcgcaa tttgaaccca atctacgacg 4620 acaacgaaag gctgctgcga gttggcggtc gcataaggaa ctccgacctg ccgaaagacc 4680 agaagcaccc gatgattttg ccggagaata accacttcac ggaggtgcta atagaagcgt 4740 tgcatcgtga gcatcttcat gtcggtttga atggactact tgcggtggtc cgacagaaat 4800 tttggccagt gaacgcgaag cgaactattc atcgagtcct gaggaaatgt gtagtttgct 4860 tccggacaaa ccccagagac gtacagcagt atatgggcga tcttccgagc tgccgagtga 4920 ctgctgccca acctttcgcc aggactggaa ttgatttcgc aggaccattc ttcattaagg 4980 ttggacgaat gagagctaag gtgaaggtgt acgtgtgtct tttcgtatgc atgtctacca 5040 aatccataca tctcgaactg gtgagctctt tgacgactga tggtttcctg gcagcacttc 5100 atcgatttgc taaccgacga ggaaatccat ctgagctgtt ctctgacaac ggaactaatt 5160 ttcggggagc agaccgagaa ctttccgaac tgttgaacct gctacagtct caggtgctgg 5220 acgacaaagt gaacgagttc tgccagccca gaggaatcaa atggagcttc aaccctccca 5280 aggctccaca ccagggaggt atttgggaag ctaatatcaa atgcatgaaa tcccatttgt 5340 gcaagacact aaacgagagc tatttgacat atgaagagtt gaacacttta ttgatccaga 5400 tagaaggtgt attgaactcg agacctcttg tgcagctgac ggatgatccc ttcgattacg 5460 aagcgctgag cccaggacat ttcctagttg gacgggaatt gacggcggta gcagaaccgc 5520 tgtacgatga tttgaaggag acgagcttgt cacgatacca gctggtgcaa aaacgaatgc 5580 agcatttttg gcaacggtgg tcgaatgagt acgtgacggg tcttcagaag cgcagcaaat 5640 ggtacaagga ccctacgttg ctgcgaactg gattgctggt gattatgaag gaggacaaca 5700 tgccacccaa gacctggaag ttgggtcgca ttgttgaaac gcatccaggt aaggacggag 5760 ttgtacgtgt ggttacgatc cgcactagca acagcatcta caagcgaccg acgacccaga 5820 tcgctgtgct tcccatcgat gactgcacgg atcaagcaga gcccaaggag aagtgagacc 5880 ttcgcctcac ccgggggagt a 5901 // ID Gypsy-138_AA-I repbase; DNA; INV; 4405 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-138_AA_; KW Gypsy-138_AA-LTR; Gypsy-138_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4405 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1009-1009 (2011). XX DR [2] (Consensus) XX CC Positions [3356-3823] - Integrase core CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 230..3859 FT /product="Gypsy-138_AA-I_1p" FT /translation="MTEEDNTKHVVAGPSGSVKIAMNFGSFDNYVAGDDFE FT VYEERMLQHFLLHDIPEGRKVAFLLTHLGMDTYAILKKLLQPVNPSTKTYA FT ELVAALKKHFRPDVNKVSERYRFHQADQKAGQIVSEYVVELKALAEKCEFG FT DFLTEALRDRFVFGIYDSRLRTHLLKQKNLTFEKAVDEALNWELAEKDNKV FT RENVLSAHVVRSGKSYRSRSKSRSKFANSNGGQKNQYGRRRVCEKCGRDHE FT QGKCPAKSWKCFACGRQGRAANVCFSKNSKQNRSGSQDSRRNQTVGTVGAG FT EDLATEIANLRMQLNSVKNETLLVKSKEATETLFVDNRPIEFEIDSGACAT FT VISDSLYRKWFSSRPLFSINDDFSTVTGQGIKVIGGFSADVSKCQRGPSEK FT LALVVIESSKTFRPLLGRTWMDVLWPGWRNKFKESIVSMNRVESTLLDSIR FT LKYPNVICDDNTPIREFEAEIVMEDNVSPIFHVAYSPPFQQRPAIEAELNR FT LCQENILKKVAFSKWASPIVVVPKPNGSLRLCIDCKVTINPYLRSEHYPLP FT RIDDIFAKFANCNFFCVIDLRGAYQQLKVSENSQAYLTINTHIGLFQYVRL FT PFGVATAPSIFQSIMDQILGDIEGCVTYLDDVLIGARTLEECKQVLDKVLE FT RLSRFNVKINTEKSKFFVTSVDYLGHTVSSDGIRPNQSKVTAIVNAPAPTN FT ISELQSYLGLLNYYSKFIPNISSELRVLYRLLRKDVVFSWDSDCEKCFVRS FT KELILKNNVLQLYDPNKPIVVAADASPYGVGAVLSHIVDGEEKPVIFASCT FT LSPAEKNYSQLHREALAIIFAVKRFHKYIYGHRFKLVSDCEALKEIYHPRK FT GTSVISASRLQRWAVILSMYDYEWEYRPSRNLAHADALSRLPVSGGTEIEE FT LSINRLQECPELPLKTCDVAEFTAKDDVLSKVYDYVLYGWPRKIPDSLHYY FT YKLRNSLSCQDNCLFYGDRVVIPDRLQKIVLKMLHTNHDGIVRGKMLGRGL FT FWWKGMQKDIEEHFKQCSVCDQRRSVPKEKVTSKWTSCTRPMERIHIDFFE FT FEGKMVLLIIDSFTKFIEARVMNRTNAEQVNERLEDFFKFVGLPEQMVSDN FT GPPFSSFEFVQYWESRNVKIIKSPVYHPQSNGAAERGVQTVKNFLKKRLLE FT KQGNRELLSRKLNDILAWYNSTPSTATEKSPSEMIFRFKPRTVLTSINPKT FT " XX SQ Sequence 4405 BP; 1392 A; 745 C; 1097 G; 1170 T; 1 other; aaagttggcg acgaggtaaa gtggagagtg tagtgacgat cgacgaaagt gtacctaccg 60 catggcgaaa atcacgtgga gcgcagaaac ggagtgaggt taagtgctac cgtttgttca 120 gtgcacaata ggctgatata caaaagcgta tccagttttg aagagcacgt ggtatcgcga 180 aacgaagtgt gttattgcta tcgtttgatc agtgtggaac cttccaaaga tgaccgagga 240 agacaatacc aaacacgtgg tggccggacc gtccggtagt gttaaaatcg cgatgaattt 300 cggatcgttt gataactacg tcgcgggtga tgactttgaa gtgtatgagg aaagaatgct 360 tcaacacttt cttctgcatg acattccgga aggcagaaaa gtggcgtttc ttttaacgca 420 cctcggaatg gacacgtatg ccattctgaa gaagctattg cagcccgtga atcctagcac 480 taaaacgtac gcggaattag tagcagcgct caaaaagcac tttaggccgg atgtgaataa 540 ggttagtgag cgatatcgat tccatcaggc agatcagaaa gcgggtcaaa tagtgtcgga 600 atatgtggtc gagttaaaag ctctcgcgga gaagtgtgaa ttcggtgatt ttttgacgga 660 agccctccga gacaggttcg tgttcggtat atacgacagt aggctccgta cccacctgct 720 caagcagaaa aacctaactt tcgaaaaagc ggtggatgaa gcgttgaact gggaattagc 780 cgaaaaggat aataaagtgc gggaaaacgt gttgagtgcg cacgtggtgc gatctggtaa 840 gtcataccgg agtcgtagca aaagtcgttc gaagttcgcg aatagcaatg gcggtcagaa 900 aaatcaatat ggccggcgaa gagtgtgtga aaagtgtggt agagatcacg aacaagggaa 960 gtgtccagcg aaaagttgga aatgttttgc gtgtggcagg caaggccgtg ctgcaaatgt 1020 gtgcttttcg aaaaattcga agcaaaatcg tagtggaagc caggattcga gaagaaacca 1080 gaccgtaggc acggttggtg ctggtgaaga tttggcaaca gaaatagcaa atcttcggat 1140 gcagttgaat tcggtgaaga acgagacctt gttagtgaag agcaaggaag caaccgaaac 1200 gttatttgtg gacaaccgac cgattgagtt tgaaattgac agtggggcat gtgctacagt 1260 gattagtgac agtttgtacc gaaagtggtt ctcgagtcgt ccgttgttca gtataaacga 1320 tgatttttca actgtaacgg gacaaggtat taaagtgatt gggggattca gtgcagatgt 1380 gtccaagtgt cagcgtggtc cgagtgaaaa acttgcgcta gtagtgatcg aaagcagcaa 1440 aaccttcaga ccgcttctgg ggcgcacgtg gatggacgtc ctctggcccg gatggaggaa 1500 caagttcaag gaaagtatcg tgagtatgaa ccgtgttgaa agtaccctcc ttgattcgat 1560 tcgtttgaaa taccccaatg tgatttgtga cgataacact ccaattcgtg agtttgaggc 1620 ggagattgtc atggaagaca acgttagccc tattttccac gttgcttatt caccaccctt 1680 tcaacaacga ccagcgattg aggctgagtt gaatagattg tgtcaggaaa acattctgaa 1740 gaaagtcgcg tttagtaagt gggcttctcc catagttgtt gtgcctaagc ccaatggcag 1800 cttgagactg tgcattgact gtaaggtaac cataaatcct tatttgcgtt cggaacatta 1860 cccgttaccg agaatcgacg acatttttgc caagtttgca aactgtaatt tcttttgcgt 1920 tatcgatctt cgtggtgcgt atcagcagtt gaaagtttcg gaaaactctc aagcgtatct 1980 aacaattaat acgcacatag gactttttca gtatgtgcga ttgccctttg gcgtggcgac 2040 tgctccatcg atcttccaaa gtatcatgga tcagatctta ggagacatag aaggatgtgt 2100 aacgtattta gacgacgtgc tgataggagc aagaacatta gaagagtgca aacaagtttt 2160 agacaaggtg ctagaacgtt taagcagatt caacgtgaaa atcaataccg aaaagagcaa 2220 gttttttgta acgtctgtgg actatttggg tcacacagtt agtagcgatg gaattcgtcc 2280 aaatcagtcc aaagtaactg caattgtgaa tgcaccagct cctacaaaca ttagtgaatt 2340 acaatcgtac ttgggactgt tgaattacta ttcaaagttc attccgaaca tctcttcaga 2400 gttgagagtt ctgtacagac tattgagaaa agatgtagtg tttagttggg atagtgattg 2460 tgagaagtgt tttgtgagaa gcaaagagct gattttgaag aacaatgtgt tgcaactcta 2520 cgatccaaac aaaccaatcg tggttgcagc ggatgccagc ccatatggcg taggcgctgt 2580 gttatctcat attgtagatg gagaagaaaa accagtcatt tttgcctcgt gtacattatc 2640 gcctgctgag aaaaattact cgcagcttca tagggaagct ctagccataa tatttgctgt 2700 gaagcgtttc cacaaataca tttatggaca tcgttttaaa ctcgtgtcag attgtgaagc 2760 gttgaaggaa atttatcatc ctcgtaaagg aacatcagtg atttcagcat ctagacttca 2820 acgttgggca gtgattttgt ctatgtacga ttacgaatgg gaatacaggc caagtaggaa 2880 tttggctcat gcagatgcat tatccagatt acctgtgtca ggtggcacag aaattgaaga 2940 actgtccatc aatagattgc aagaatgtcc agaattaccg ttgaaaactt gtgatgtagc 3000 ggaattcaca gcgaaagatg atgttctttc gaaggtatat gattatgttc tgtatggatg 3060 gcctcgaaaa atcccagatt ctttgcacta ttattacaaa ttgcgtaatt cgttaagttg 3120 tcaagataat tgcttgttct atggtgatcg agtagtgatt ccagataggt tgcagaaaat 3180 tgtgcttaaa atgcttcaca ctaatcatga tggaatagtt agaggaaaga tgttaggtag 3240 aggattattt tggtggaaag gtatgcaaaa ggacattgaa gagcatttca aacagtgttc 3300 agtgtgtgat cagagacgaa gcgttcctaa agaaaaggta acatctaagt ggacaagttg 3360 tacgcgtcca atggaaagga tccatataga cttttttgag tttgaaggca agatggtgtt 3420 actgattata gattcattca caaagttcat tgaagcaaga gtgatgaata gaaccaatgc 3480 agaacaggtt aacgaacggt tagaagattt tttcaagttt gtgggattgc cagagcaaat 3540 ggtatccgat aacgggcctc cgttcagttc gtttgagttt gtgcaatatt gggaatcgcg 3600 taacgttaag attatcaaat ctccagtgta ccacccacaa tcgaatggtg ctgctgaacg 3660 aggtgttcag accgtgaaga actttttgaa aaagagattg ttagagaaac aaggcaatag 3720 agaactatta tcaaggaaac tgaatgacat tttagcttgg tataatagta ctccttcgac 3780 agctactgag aagtcaccca gtgagatgat tttcagattc aaacccagaa ctgtattgac 3840 aagtattaac ccgaagacaa gmtcatgtga gacagggaca gataaggtca atcgtaagag 3900 cgtaacgttt gatgaatcca aaaatgcaga atatgtttat gatgtcaaaa agaaaaattg 3960 tttgaaagaa aattgtaacc ccaaccaaga ttttaaaaaa ggggaaaagg ttatgtaccg 4020 aaaccatttt aaagaattag tgagatggat tccagcaatc attcttgaga gaatcggaaa 4080 atttttgtat aaaataaagt tgttggagaa cgatagtatt aaaaatgttc atgtaaacca 4140 gttaagatat ccaagctcaa ttgtaaaaac gaatattgca aaagtgccag aacgatcatc 4200 agaaaatata actattaagc gtaggcgtag tgaatcgtta ggtgaatcac cacctagaaa 4260 atttcaagct gatgagagac aaaatcgatg cggatcaaat gaatcaattt tcctcagaag 4320 atcagtaaga accagaaagt taccagagtg gtttagattt gaagattatg aaaggtagta 4380 gtaagaaatc aaaaaagggg aaagt 4405 // ID Gypsy-242_AA-I repbase; DNA; INV; 4381 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-242_AA_; KW Gypsy-242_AA-LTR; Gypsy-242_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4381 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1085-1085 (2011). XX DR [1] (Consensus) XX CC Positions [3269-3736] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 82..2262 FT /product="Gypsy-242_AA-I_1p" FT /translation="MSDNDLKDAILRLTELIVQQQERITALERPDPTDGSE FT KIIESLASGIQEFQFDPDDGLFFDAWYSRYEDVFKEDGKRLDDAAKVRLLL FT RKLSTTVHERYVDSILPKHPREFSLDETVGKLKKLFGRRKSIFHARYQCLQ FT YFKSEADDFTSYAAKINKHCEAFQLSKLSNDQFKALQFICGLQSPRDADIR FT ARLISKLEAEESAPARPDGPSTITLENLVEECHRIVNLKHDTQMVENKDSK FT TINAVTRNHKDSNPKKAKTPKTPCWKCGEMHYTRFCPFSSHTCTKCKQKGH FT KEGYCSNDKTKPKPFHFKPKENLKTKGIFSIRNVESKRKFVTVELNGVPVK FT LQHDSASDITVISEENWIKIGQPATRRTAEAAITASGDNLKLLAEFHTDIT FT IGNVSKSGRIFISNNPELNVLGIETMDLFDLWSIPINSLVNAIQQRPEDFT FT KQLKQKFPEVFKSTLGRCTKAKIKLYPKQDARPVYCPKRPVAYAALPKVEA FT ELQRLQDKGIISPVQFSDWAAPIVVVRKADNVSVRVCGDYSTGLNDALESD FT RHPLPHPDDLFAELSGARYFTHLDLSDAYLQVEVEESSRKLLTVNTHRGLF FT QYNRLPPGVKSAPGAFQRIIDSMVAGIPGVKPYLDDILIAGKTKEEHDRSL FT YAVLERIREYGFHLNLEKCKFAVSQIEFLGHIVDKDGIRPDPSKTLAISQM FT PPPKDVQQLRSYLGAINYYGRFVKQMK" FT CDS 3050..4369 FT /product="Gypsy-242_AA-I_2p" FT /translation="MFGDRIVIPEKFRKQVLRQLHKGHPGMDRMKSLARSY FT IYWPNVDEDVEDFVRECRSCAQAAKAPTKTTLESWPVCTQPWQRVHIDYAG FT PIGGYYYLVIVDAYSKWPEIFRTRNITATATLDILRETFARYGNPETLVSD FT NGTQFTSEKFQQFVQENGIDHMRTAPYHPQSNGQAERFVDSLKRGLKKLSE FT GESTPTLQHLQTFLFVYRNTPNKCSPQTKTPAELFLGRPARTTLDLLKKPV FT RTPTSSNDQQNLQFNRRHGAIKREFQPGDLVYAEYHSNNKKSWIPGRIVER FT KGSVNYNVFLHLGSREKLIRSHTNQLRERYEAKEPSVAQPVNLPWQILLQE FT HQDYTHIETDDLEEETFPPGPDVVPVPEVSATTSANDDRLRCNTPELVHPE FT AIAVAEPDEIQPDEATGQEPSVDTDVEPRPLRSKKLPAWLAPYDIF" XX SQ Sequence 4381 BP; 1251 A; 1159 C; 1030 G; 931 T; 10 other; atattttggc gacgagaagt acgaaccgaa gtcaacctca agtctctcga cggaacatct 60 atcgaagtac cgtaagaaga aatgtccgac aacgatctca aggacgctat tctacgtctg 120 acggagctaa tcgtgcaaca gcaggagcgg attactgcac tggaacgacc cgatccaacc 180 gacggaagtg agaaaatcat agaatcgctc gcctccggga ttcaagagtt ccagttcgat 240 cctgacgacg gccttttctt tgatgcatgg tattctcgat acgaagatgt ttttaaagag 300 gacggaaaac gcctggacga tgctgccaag gtgaggttgc ttctccgaaa actcagtacg 360 acggtccatg agcgatacgt ggacagcatc cttccgaagc acccaaggga attcagccta 420 gatgaaaccg ttggcaaact gaagaagctg ttcggacgtc ggaagtcaat tttccatgcg 480 aggtaccagt gtttgcagta cttcaagagt gaagcagacg attttacctc gtatgcggcg 540 aagattaaca aacactgcga agcgttccag ctgtcaaagc tatctaacga ccagttcaag 600 gctctccagt tcatttgcgg cctgcaatca ccacgcgatg ccgatattcg tgccaggctg 660 atctccaagc tagaagctga agaatcagcc ccggcgcggc cagatggtcc aagtactatc 720 actttggaaa atctggtgga ggagtgtcat cgtattgtta acctgaagca cgatactcaa 780 atggtcgaga acaaggatag caagaccatc aacgccgtca ctcggaacca taaagattca 840 aacccgaaga aagccaagac ccccaagacg ccttgctgga aatgcggtga aatgcactac 900 actcgctttt gcccgttctc cagtcacacc tgcaccaagt gcaagcagaa aggccacaaa 960 gaagggtact gctcaaacga caaaaccaaa ccaaagccat tccacttcaa gccaaaggaa 1020 aatctgaaga caaaaggaat tttctcaatc cggaacgtgg aaagcaagcg caaattcgtc 1080 acagtggaac tcaacggagt accggtcaag cttcaacacg attcggcatc ggacatcaca 1140 gtcatttcgg aagaaaactg gatcaaaata ggacaaccag cgactcggag aactgccgaa 1200 gcagcaatca ccgcctctgg agataatctc aagcttctag cagaatttca caccgacatc 1260 accatcggca acgtgagcaa gtcagggcgc attttcattt caaacaatcc tgaattgaac 1320 gtgctgggaa ttgaaaccat ggacctgttc gatctttggt ccattccaat caacagtctt 1380 gtcaatgcaa tccagcagcg tcctgaagat tttacaaagc agctcaagca gaagtttcca 1440 gaagtgttca agagtacgct gggccgatgc acgaaagcga aaattaagct gtacccgaag 1500 caagacgctc gtccggtgta ctgtccgaag cgaccggtgg cctacgcggc acttccgaag 1560 gtggaagcag aactccagcg gctccaggac aaaggtataa tttctccagt gcaattctcg 1620 gactgggcgg caccgatagt cgtcgtacgg aaggcggata acgtctctgt ccgtgtgtgc 1680 ggtgactatt ccactggctt gaacgacgct ctcgaatctg accggcaccc tcttccgcat 1740 cctgatgatc tgttcgctga actatccgga gcacggtatt tcacccacct ggacttgtcm 1800 gatgcctatc tgcaagtcga agtggaggaa tcatcaagaa aactgcttac ggttaacaca 1860 catcgtggct tgttccagta taaccgcctg ccgccaggag tcaagtcagc cccaggagct 1920 ttccaacgca tcatcgacag catggttgcc ggcatacctg gagtgaagcc ttacctggac 1980 gatattctca tcgctggcaa gaccaaggag gaacacgacc gtagcctgta tgctgtcctt 2040 gagcgaattc gtgaatatgg ctttcacctc aacctcgaaa aatgcaagtt cgccgtttcc 2100 cagatcgagt tcctcgggca catagtggac aaagacggta ttcgcccaga tccatcgaag 2160 acgttagcga tttcccagat gccaccaccg aaggacgttc agcagctccg ttcttacctg 2220 ggagctatca actattacgg acgattcgts aagcagatga agmagctacg ggcacccctc 2280 gacaatctgc tcaagaagga cgcccgctgg aattggacat ccgagtgcca acaatcattc 2340 gacaaattca aggccattct ctgctcgaat ctgctcctga cccacttcga cccagccaag 2400 gagatcattg tcgcggcgga cgcatccaag tatggacttg gggccgtaat catgcaccgt 2460 ttcccgaccg gcgaggtcaa ggctatcgca catgcatcaa gatctctgac gtcagcagaa 2520 gccaactacg gacagataga gaaagaagca ctcgccctgg tttttggagt cactcgtttt 2580 cacaaaatgc tstacggacg cagattcatc ctccagacgg accatcagcc gctgatcaag 2640 gttttcggat ccaagaaggg gatwccggtg tatactgcca acagactgca acgttgggcc 2700 ctsactctgc tcctgtacga cttcgacatc magcacattt cgactgacaa gttcggttgc 2760 gcagatgtcc tgtcaagact gatgtccacc aaacccgacc agatgaagat tacgttctcg 2820 ctgcmgtgca cgtggaaacc gaagtcaaag ctgttctcgc agacacggtg agcaaactcc 2880 ccgtgacgct cmagatgata gttgctgaaa cgcggaaaga tcaagttcts cagcaagtta 2940 tgcaattcct gaaagaaggt tggcccgtaa gtagtaagaa gattacgacc ctgctgtgag 3000 aaaattcaac gcccgacgcg atggattgca gatagtcgac aactgtttga tgtttggtga 3060 ccgcatcgtc attccagaaa agtttcgcaa gcaagtactt cgtcagctcc acaagggaca 3120 ccctggcatg gaccgaatga aatcactggc aagaagctac atttactggc caaacgttga 3180 cgaggacgtt gaagactttg ttcgtgagtg cagatcctgt gctcaagcag cgaaggctcc 3240 gacgaagact actctggaat cctggcccgt gtgcacccaa ccatggcaac gtgtacacat 3300 agattacgca ggtccaatcg gcggatacta ttacttggtt atagtggacg catattccaa 3360 gtggccagaa atttttcgaa ctcgtaacat aaccgcaact gcaacgttgg atatactacg 3420 tgaaaccttc gctcgctacg ggaatccgga aaccctagtc tcggataacg gaacccaatt 3480 cactagcgaa aaattccagc agttcgtcca agagaacgga atcgatcaca tgcgtactgc 3540 tccataccat ccccaatcga atggtcaagc agagcgtttt gtcgattctc tcaagcgcgg 3600 actcaagaag ttaagtgaag gggaatctac gccaacacta cagcatcttc aaaccttcct 3660 gtttgtatac cgcaatacac cgaataagtg cagtccgcag acgaagactc cagcagaact 3720 gtttcttggt agaccagccc ggacgacttt ggacctcttg aagaagcccg ttcgaacacc 3780 tacgagcagt aatgatcaac agaatctgca gttcaacaga agacacggtg ctatcaaacg 3840 tgagtttcaa cctggagatc tggtgtacgc cgagtaccac agcaacaaca agaaatcctg 3900 gattcctgga cggattgtag agcggaaagg ttcagtcaac tacaacgtgt tcctgcacct 3960 cggatcccgt gagaagctta tacggtcaca cactaatcag ctacgtgaac gatatgaagc 4020 caaagaacca tccgttgctc aaccagttaa tcttccatgg caaatcctcc tacaagaaca 4080 tcaagattac acccatatcg aaacagatga cctagaagaa gaaaccttcc ctcctggacc 4140 cgatgtcgta ccagttcctg aagtttcagc aaccacttca gcaaatgatg accgcttacg 4200 atgtaataca cctgaactgg tccaccctga agcgatagct gttgccgaac ctgatgagat 4260 ccaaccagat gaagcgactg gacaagagcc atccgttgat accgatgttg aaccacgacc 4320 gcttcgctcc aagaagttgc cagcttggtt ggccccatac gacatattct aaaaggggag 4380 a 4381 // ID Gypsy-618_AA-LTR repbase; DNA; INV; 315 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-618_AA_; KW Ty3_gypsy_Ele27; Gypsy-618_AA-I; Gypsy-618_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-315 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 315 BP; 71 A; 66 C; 75 G; 103 T; 0 other; tgttggagtg ctaacggaat atgcggggtg cccaaccaag cacgccgctg agctcgagtt 60 ggcagcgtga gctaccacac tatgatgcca tcgttcccgt atgtatgttc ctgtacctat 120 aaatacatcc taagccatat gtgtcgtttt gtattgtatt gtgttacgta atagtcagtt 180 gaataaagat gtcgttttta gaaggtttta ttgtaacgac gcgcgtttta atttgttaaa 240 tatccgcgcc tatttattgt ggtttcgcgc ctagtattcg cgttcgcgag ttcggccacc 300 ttaggtccga aatca 315 // ID ECORI_TB repbase; DNA; INV; 385 BP. XX AC AF093063; XX DT 09-JUL-2004 (Rel. 9.06, Created) DT 09-JUL-2004 (Rel. 9.06, Last updated, Version 2) XX DE Trichogramma brassicae EcoRI satellite monomer, complete DE sequence. XX KW SAT; Satellite; Simple Repeat; ECORI_TB; EcoRI; satellite repeat; KW tandem repeat. XX OS Trichogramma brassicae OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Trichogrammatidae; Trichogramma. XX RN [1] RA Landais I., Chavigny P., Castagnone C., Pizzol J., Abad P. RA and Vanlerberghe-Masutti F.; RT "Characterization of a highly conserved satellite DNA from the RT parasitoid wasp Trichogramma brassicae."; RL Gene 255(1), 65-73 (2000). XX DR Genbank; AF093063; Positions 1 385. XX SQ Sequence 385 BP; 118 A; 74 C; 64 G; 129 T; 0 other; gaattccaag tgtcgtgtaa aaatttcaaa tcgatcccgc aaaccctcga aaagttatac 60 ccgtttaaag tcactttttg gcccaaaaat gccacattaa taaggttatc tttggttcta 120 acggtcggat tgactcgaaa atttttatgc acgtctagaa ttggataatt tatgacctta 180 caacttgatc gaaaggttac aaataaaact ctgaaaactt ttcacggtca tcgtgacttt 240 tcgaaataaa agtaatttca tcttttgcgg tcttcgtgac acttccggct ggtgatacaa 300 cttttgtccg gtctagaatt ttttgaatat atcttccgct ctaatggtcc gattgactta 360 aaaattggtt tacagacaca aaatt 385 // ID BEL-650_AA-I repbase; DNA; INV; 6322 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-650_AA_; KW BEL-650_AA-LTR; Pao_Bel_Ele200; BEL-650_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-6322 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [5379-5936] - Integrase core CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 1188..6320 FT /product="BEL-650_AA-I_1p" FT /translation="MLTSPHSAFVPAHMSTPQAQSTPPSVHTVSSRLSPIT FT SSMIPTSAPIAASMSAASDVARRQLFGGPNSQQLAARHVVPRELPAFTGDP FT VEWPLFLSCYQNTTDMCGYSDGENLMRLQRCLKGNALEAVRSHLMHPSSVP FT MIIDTLKTLYGRPEQIINALLTKLRATPTPKPENLESLIGFGLACKNLCSH FT LQAAGLQEHLSNPLLLQELVSKLPATLKLNWSLFKRQQANANLTTFGCYMD FT QIVTAASEVTFINETDGHRSKQEKPKTKEKMFVNTHASEEKKASYSITDRK FT ENKPKPCVACQKDGHKIKDCHVFQKMTLGDRWRLVQERYLCRRCLGSHGKF FT PCKASFCGENGCEDRHHKLLHPGNPQLPRSSEPPRTTSTVTVHRQFKQPVL FT FRILPVSLHWNGKTIRTFAFLDDGSSRTLVESRVAEELGVTGDVHPLCLQW FT TSGIERMEDESQLIQLQISGAMNTTRFDLKEVHTVKSLNLPVQSLDFSELS FT RQFPHLQGLPVQSYTGAVPTILIGLDNSFVMATRKSREGVKGSPIAAKTRL FT GWAVYGSTSPDAERSVHPLFHISVRSPDQELHDLVKEFFAMESVGTISSTM FT ESDEDRRARKILEATTVRTESGRFQTGLLWKYDDVEFPNSRPMAEKRFRCL FT ERRLASKPEPYDNVKTQISQYQEKGYAHKASEEELASFDPKRTWFLPLNVV FT TNPRKPNKVRLVWDAAAKVQGQSFNSALLAGPDFLAPLQAVLSSFRQFEVA FT ISADICEMFHQILIRPEDRSAQLFPWRDEPSKPFEVFVMDVATFGSTCSPC FT AAQFVKNRNAEEYSEQFPLAAEAIIKHHYVDDFLCSVHSEQEAVELAEQVK FT LVHSKAGFVIRNWLSNSKSVLTRVGEQQAVPEKKTWTDRSVANERVLGMIW FT KPEPDVFVFQGVFREEIQALLLDDTVPTKREVLRVVMSIFDPIGLVAVYVV FT HGKVLVQHIWRSGIGWDERIPDGLLEQWKRWIKLLQQLDHVEIPRCYFPGY FT SPEHYRAVELHIFVDASEEAFAAVAYFRIVEGSRVRCSLVSAKTKVAPLKL FT LSIPRLELSAAVLGARLAKSVMENHTIPISHKVFWSDSCTFLSWLQADPRK FT YRQFVAFRLAEIRELTRVNEWRWVPSRLNVADEATKWGKGPEITSDSRWLL FT APKFLYQHPDLWPQQTIKEHESLEELRPIHLHCDIGKEQLLEFTRFSKWER FT LIRAVAYASRFLNNLRCKIKKQPLNNTEWLSRDELLKAETFVCKQIQREAF FT GDEFAILQGNKDRPVAEQMKLEKTSKIIKLSPFLDEQGVIRMDGRIAGAQQ FT VSFDFKFPIILPKGHCGTKLIVDWYHRQYKHCNSETAVNEMRQRFHVSEMR FT VAFKQAGKQCQWCKIYKAVPEIPRMAPLPQARMASHVRPFTYVGVDYFGPM FT LVKQGRSEVKRWVALFTCLTIRAVHLEVVHNLTTESCKMAIRRFIARRGAP FT REIFSDRGTNFVGANRDLKMELRQINTNLASTFTNTDTQWRFNPPSTPHMG FT GSWERMVRSVKSALASLSTGGKPDDETLRTLLAEAESIVNSRPLTYLPIDS FT EEQEALTPNCFLMLSTSGVNQPAREPIEERTMARCNWDLCSQLLDKFWTRW FT IKEYLPTITKRTKWFKDCKPVQAGDLVVVVEDRLRNGWLRGRVLRVFPGSD FT GRVRNAEVTTASAGVLVRSVAKLAVLEVGGTVEVNFDQYGSG" XX SQ Sequence 6322 BP; 1622 A; 1658 C; 1663 G; 1379 T; 0 other; aactttatag atttctaacg agaaatctaa cggagatgcc gggttccgcg accattcctg 60 gcagtgggat gccaatgcac caacagcatc cagaaggtac cgatctgcag agggaaagcg 120 atcctaaccc caatgttgag gttatgactg tgcctgctgc atcgatcgac acccgttcca 180 gaaggagccg atcatcaaga gctagctctc gcgcgttgcg tgcttccctc gatctgcagc 240 ggctcgaaga ggaaaaacgt ctcgagaagg aattcatcgc caggaagtac gagatcctcc 300 aggcccagct tgaagatgga gaaggtagcg gcagtagtgg cagaacacgt cgatcctccc 360 atggcagttc acagagaacg cacgattggg tacagagcca attagtcccg attaccagta 420 ccaccaccgg accaacaacg attcccgtgg ttactcaatc ccggacaggc actattccga 480 aggatgtttc cccagtagtg gttccagtgt caagcagtgc catgatcatg gacgagtcga 540 atgccatcga catgcgatct ttgtccatca acacgaccca gtggttccag cggtgccaca 600 gatgcagcaa actgcaggct catcgtttgc tccgacacgt catcggcttg cacatctgca 660 ggcaaggaac aacgagcaag cttcagcgat gaataccagc accgactgtg atggtgcagc 720 aggtggcttc acaacaattt cgacggatca tcatgcactg ccgaccgcgg tgaatttctc 780 tgtctcgcca acacgtacac cagaggcttt gatggagcag cttagcgagt tgcagaaaca 840 tgcggacgat cttcaacgac agctgatgag taggacagac cgccaaactc aggattctgt 900 tcggcatgca gatagtgctc ggcaatttca gccaacccag tcatcgacag tgagcggtat 960 cgccgaatac atcatcaagc caccgaatgt gccaaggcca gcgcgcatcg ccgcttacga 1020 atttccgatc agcgacggta atatctgcgt caccagttcg gtaagtcaaa cgtgctccaa 1080 tactatttca acattcgctc cgcaattaaa tgctgtgagc tcgtcaattc cttatcgact 1140 cgagctgttg atcgtataaa taaaacacct cctccggctg ccgggccatg cttacttccc 1200 cccattctgc attcgttccg gcgcatatga gtactcctca agcccaatct actcccccct 1260 ccgtgcacac agtctcgtct cgattgtctc caatcacttc gtcgatgatt cccacatctg 1320 ctcccatcgc cgcttccatg tctgctgcca gtgatgttgc tcggcggcag ttatttggtg 1380 gaccaaactc acaacaactc gctgccagac acgtcgtgcc cagagagctt cctgctttca 1440 ccggtgatcc agtcgagtgg ccgttgttcc tgagctgcta ccaaaatacg accgacatgt 1500 gtggctattc cgatggtgaa aatcttatgc gactgcagcg gtgcctaaaa ggaaacgcat 1560 tggaggcagt acgaagtcat ctgatgcatc cttcgtctgt accgatgatc atcgacacgt 1620 tgaagacgct gtacggacgc ccagagcaga ttatcaacgc tctcctcacc aaactacgtg 1680 cgacaccaac gcccaaaccg gagaatttgg aaagcctgat cggattcggt ttggcgtgca 1740 aaaatctgtg cagccatcta caagcagcgg gtcttcagga acacctttcg aaccccttgt 1800 tgcttcagga gttggtgagt aagcttccag ccacgctcaa gctaaattgg tcacttttca 1860 agcgccagca agccaacgcc aatctcacca cattcggttg ctacatggat cagattgtga 1920 cggctgcaag tgaggtgacg tttatcaacg agacggatgg gcatcgtagc aagcaggaaa 1980 aaccgaaaac gaaggagaag atgttcgtga atacgcacgc ttcagaagaa aagaaggcgt 2040 cgtacagcat taccgatcgc aaggagaaca agccgaagcc ctgtgttgcc tgccagaagg 2100 acggccacaa aatcaaggac tgccatgtct tccaaaagat gaccctcggt gatcgctgga 2160 gactcgttca ggagcgttac ctgtgtcgtc ggtgcctggg ttctcacgga aagttccctt 2220 gcaaagcgag cttctgtgga gagaacggct gtgaggatcg ccatcacaag ctccttcatc 2280 caggaaatcc acagctaccc cgatctagtg aacctccaag aactacaagc acggtaacag 2340 ttcatcgtca gttcaagcag cccgttcttt tccggatact tccggtgtcg ctacactgga 2400 atggcaagac catccgcaca tttgccttcc tcgacgacgg ttcatctcga acactagtgg 2460 aatcgagggt agctgaagaa cttggagtga ctggcgacgt tcatccactc tgtctacagt 2520 ggactagtgg aattgaacga atggaagacg aatcgcagct gatacagttg cagatttcag 2580 gagctatgaa caccacgagg ttcgatttga aggaagtgca caccgttaag agcctcaacc 2640 tcccggtaca atcattggac ttcagcgagc tttccagaca gttcccgcac ctgcaaggct 2700 taccagttca gagctacacc ggcgctgttc caacgattct gatcggcttg gataactcgt 2760 tcgtgatggc tacacggaag agtagagaag gagtaaaagg tagcccaatc gcagccaaaa 2820 ccaggctagg ctgggcggtc tacggaagca cgtcccccga tgctgaacgg tccgtgcatc 2880 cgttgttcca catttctgtc cgttcgcccg atcaagaact ccacgacctt gttaaggagt 2940 ttttcgctat ggaaagtgtt ggcacaatat catctacgat ggaaagcgac gaggatcgtc 3000 gcgcaaggaa aattctcgaa gccactacag ttcgtacgga gtcaggcagg ttccaaacag 3060 gattgttgtg gaagtacgac gacgtggaat tcccgaatag caggcccatg gcggaaaagc 3120 gattccgctg tctcgagcgt cgtttagctt caaagcctga accgtacgac aacgtgaaga 3180 cccagataag ccagtaccaa gaaaaggggt acgctcataa ggctagtgaa gaggaattag 3240 ccagcttcga tcccaaacgg acctggtttc taccgttgaa cgtcgtaacc aacccaagga 3300 agccgaacaa ggtgcgacta gtgtgggacg cagctgcaaa ggttcaaggc caatcgttca 3360 actccgccct tctcgctgga ccagattttc tcgccccgct tcaagctgtt ttgtcttcat 3420 ttcgtcagtt cgaggtggcc atcagtgccg atatctgcga gatgttccac cagatcctaa 3480 tccggccgga agaccgatca gctcagctgt tcccgtggcg cgacgaacca tccaagccat 3540 tcgaggtgtt cgtcatggac gtagcgacgt tcggttccac ctgttcaccc tgcgctgccc 3600 aattcgtaaa aaaccggaac gccgaagagt attcggaaca gtttccactg gcggcagagg 3660 caattataaa acatcactac gtggacgatt ttttgtgcag cgttcactcg gaacaggaag 3720 cggttgaact tgcagagcag gtgaagcttg tgcactccaa agccggattc gtcattagga 3780 attggctgtc gaactccaaa tccgttctaa cacgggtcgg ggagcagcaa gcagtgccgg 3840 agaagaaaac gtggaccgac agatcggttg cgaacgagcg ggtcctcggt atgatctgga 3900 agccggaacc ggatgtcttc gttttccaag gtgtgttcag ggaagaaatt caagcacttc 3960 tgctggacga cactgttccg accaaaagag aagttctgcg agttgtcatg agcatcttcg 4020 accccatcgg attagttgcg gtttacgttg tccacggaaa ggtgctcgtg cagcacatct 4080 ggcgttcagg gattggctgg gacgagcgca tacctgacgg actattggag caatggaagc 4140 gatggataaa acttttgcaa cagcttgatc acgttgaaat tccacggtgc tacttccctg 4200 gctactctcc cgaacactac cgagccgtgg agctgcatat tttcgtggat gcatctgaag 4260 aagccttcgc tgcagtagcg tactttcgca tcgtcgaagg atcccgagtt cgctgttctc 4320 tggtgtcggc gaagacgaag gttgcgccat tgaagctact gtcaatcccg agactcgaac 4380 tatcggcagc tgttcttggc gcaaggttgg cgaaatctgt aatggaaaat cacacgattc 4440 ccatcagcca caaagtgttc tggagtgatt cttgcacctt tctctcctgg ttgcaagcgg 4500 atccaagaaa atatcgacaa ttcgtggcat tccgactggc agaaatccga gagctaacgc 4560 gagtcaacga atggcgctgg gttccctccc gtttgaacgt tgccgacgag gcgacaaagt 4620 ggggcaaagg tcccgaaatc acctcagata gtcgctggtt gctagctccg aaattcctct 4680 atcaacatcc ggacctgtgg ccgcagcaaa caataaaaga acatgagtcc ctggaggaac 4740 ttcgtccgat tcatcttcac tgcgacatcg gcaaagaaca attgctagag ttcacgaggt 4800 tttcgaaatg ggagcgcctg atccgtgctg tagcctacgc aagtaggttc ctgaacaatc 4860 ttcgttgcaa gataaaaaag caaccactca acaacacgga gtggctgagc agagatgaat 4920 tgctcaaagc ggaaacattc gtttgcaagc agattcagcg agaagcgttt ggtgatgagt 4980 ttgccattct acaaggaaac aaagatcgtc ctgttgctga acaaatgaag ctagagaaga 5040 ccagcaaaat catcaaatta tctcctttcc tggacgagca aggagttatc cgtatggacg 5100 gaagaattgc gggcgctcag caggtctcat ttgactttaa gtttcctatc attctcccca 5160 aaggacactg cggaacaaag ctaatcgtgg actggtacca tcggcaatac aagcactgca 5220 actctgagac ggcggtcaac gagatgcgcc aaagattcca cgtgtcggag atgagagtcg 5280 ccttcaaaca agccggaaaa caatgccagt ggtgcaagat ctacaaagcg gttccggaaa 5340 tcccacggat ggctccacta cctcaagcaa ggatggcgtc acacgtccgg ccgttcactt 5400 acgttggcgt tgactacttc gggccgatgt tggttaagca gggacgcagc gaggtgaaac 5460 gttgggtcgc ccttttcacc tgtttgacca tacgtgcagt gcatctcgag gtggtccaca 5520 acctgacaac agagtcctgc aagatggcga ttaggcgctt catcgcacgg aggggggccc 5580 cgcgggaaat tttcagcgac cgagggacaa atttcgtggg tgctaaccgg gacctgaaga 5640 tggagctaag gcaaattaac accaacctag caagcacttt caccaatacg gacacgcagt 5700 ggcgcttcaa ccctccttcc acgccgcata tggggggatc ttgggagcgc atggtccggt 5760 cggtcaaaag cgcgctggca tcactgtcaa cgggaggtaa gccggacgat gaaacgcttc 5820 gtacgctact ggctgaggca gaatctatcg tcaactctag gccgcttaca tacttgccaa 5880 tcgattctga ggagcaagaa gcactaacgc cgaactgctt cttgatgctg agcaccagcg 5940 gagtcaatca gccggctagg gagccgatcg aagagaggac gatggcacgc tgcaactggg 6000 acctctgcag ccaactgttg gataagttct ggacacgttg gatcaaagag tatctgccta 6060 cgatcactaa gcgtacgaag tggttcaagg actgcaagcc tgttcaagct ggagatctgg 6120 tggttgttgt ggaggaccgg ctccggaatg gctggctgag aggtcgggta ctacgcgtgt 6180 ttccgggaag cgacggccgt gtgcggaatg cggaagttac gacagccagt gcaggagtgc 6240 tggttcggtc agtggcaaag ctggcagttc tggaagtcgg tggtactgtt gaagtaaact 6300 tcgatcaata cgggtcgggg ga 6322 // ID LIN15_SM repbase; DNA; INV; 6104 BP. XX AC . XX DT 16-AUG-2009 (Rel. 14.08, Created) DT 16-AUG-2009 (Rel. 14.08, Last updated, Version 2) XX DE Non-LTR retrotransposon; consensus. XX KW NeSL; Non-LTR Retrotransposon; Transposable Element; LIN15_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-6104 RA Jurka J.; RT "Non-LTR retrotransposons from Schmidtea mediterranea."; RL Repbase Reports 9(8), 1909-1909 (2009). XX DR [1] (Consensus) XX CC The 5' and 3' termini are approximate. CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 526..1833 FT /product="LIN15_SM_2p" FT /translation="MKSHIKYHDGDFVMLDLLSDSDLNVGNATLSSSELSR FT SQMRDLFKSFPELAELFFSESDCNTDKFKNLINNLKSEHGFPEQISFVGGK FT FIKLNETKDSDHSQAEGESDIMFNIGLLNFHEMCLKDHTLRSLFSQPKSSE FT KIIDIIFHLNYLQLKGELKHTSNGSYGDYVCNYVRNCLSLYSISDIETYNT FT LSLDDIEKWAPSLVEMIPPLSYHSSSCKERLLRLRQWRLSSEIDRLHNIIS FT PETITHINSFLSEIDKVYPQKATQLKALQNLSNNINAIXSESDDETIAKAK FT RFLFFVIRDKKKFLLGSKYSYRIPLHCRKICTTNMTTHIFSIIQSLLLTLT FT VSDSEQIADRSDSITLVNTTQSVEINPGTQSGSIVDISVVETGSTEHARVK FT RRLEVDIDEVESIIHTTSGTQQNHINTKSIKKARNSLAKNK*" FT CDS 2251..5982 FT /product="LIN15_SM_1p" FT /translation="MALNQLFNLKAAMTFHPHENFLYQENSYDCGPYICAY FT AMMISGVWQSFPDHFIDIIREEVHQLQLSCQQKDQKVSGGGLPKFMEIIKR FT NGVRKTSSATQAFLLKQNSCISSLLDEYFKTRTNTTVLSPELTMNIVAQNL FT TFIRENVTFRQFTGIDHIISIVPNDSNWSLFIFSLVLGNSYIFNFHDRXVT FT ERLIAIGENMTEYLNKFIISNRKVIFISEHNIEHASITSDDSLLFASAFII FT LIESLIFSKQVSHTRTSEFLKSVKIKQANEICLNLNSINCKINEEITAKYF FT SNLKLNSNFMLLNETMCTAILDDCTRHLTKFLNYEDLIKAQVIFAIIAPPK FT HREILLILDYNTDEYYIINPTTLCGTESYQSVCRIFRNKINEIKETPGRVI FT NMGDIPNDIRGSGLVSKLLICAFIKNYANDLPLINLNLREIGTNISSLLPI FT ISESKKILMSDLTKITKIKKSSLNLEQRKLKIDELLHDVSHLTVNEIIDII FT VRQFPDRVKSKXKPYLGNKTRPQKINKGILVRKFMTDMKWTVDKVFNNTNV FT SIRPTFQKIVLNFLIKSPFHGNWNNPLNFCKRSKQALNLELFSNFEIMFQL FT KRADNTSPGIDGIQYRDLLLLDPEGKLLTFLFNKIIVNKIIPDSWKSFKTI FT LTPKPNKDGKYDDVSSWRPIALLSVIYKVFASCLASRLTYWINTNNLLHIG FT QKGGSRHDGCVEHNSILSAALEHSKYSKNCPLAIAWLDIKDAFGSVPHDYM FT WSLLRFIGVGEDFTSILQLLYSDTSTFYSCGPILTPNIPIKQGVKQGCPIS FT MILFALAINPVLEAVSRSDCEPFMIGESPVKILAYADDIALIAKSVEDLQK FT ITNIAVRTAAEIGFEYRPEKCGYLQLPKVHIDSEILINNTKIKKLLSKEFY FT QYLGVPVGEEPDQSPYDILDKAVSDTRKLANSELLGWQKLKAYKIFVHSRL FT IFPFRTREIKTGALSKSNGNSNRSVNTSSQLRGCFRKMLCLPNQSEVSYFY FT NATENGGANCVDLLDEYHTQTITHFFRLLTSECDYARQINIDSLKFVTGPR FT LGIKTPSLQESLDWFNGKESKPHHSGRKTRFQRARIAVAFFEKTHNISVSF FT IVKDHKPSLYITTAMRGTIILTSDLRKTTSKVLHMALCDSYLSKWEKSCVS FT NFIASAVKLSPKINKAIFRSELSEFAWNFIHRARTNTLSIFAKHHNKGDKR FT LCRLCHAEDETMSHAIQSCKVHLTLGANDTMTV*" XX SQ Sequence 6104 BP; 2030 A; 1262 C; 1039 G; 1768 T; 5 other; cagaaataaa attaacgtta ttaaaaaact acaaatttga ataaacgaaa cttaaccttg 60 actcattttc cagaacattg aacaaaactg agaaacccca attctttaga gctttcccaa 120 accttggtaa gctattaaaa tctaaagaat ctcctcaaaa atttaagacg ttaaataatg 180 aactggctct ataaaatctg cgataaccct ctaaatattg ctgaatttgc tggatttttc 240 tataggaacc gcaatatttg cgtctctatc aaatttaagt tagagataag atggcccatt 300 aaataataat cccattcaca aaacctccat agttgattct aaaatagtaa aaacttacaa 360 taatggaccc atttcacccg gcatcgagac aataaaaaag ttcttgtttg ttttatcaac 420 caaactttct gaaacggcca caatatcaaa cgccaaagtt cggggggaac agattaataa 480 cgtctaaatt tattaaaaaa gcaaagagct ctcccggatt atttgatgaa atcacatatc 540 aagtatcacg atggtgattt tgttatgttg gacctcctca gcgattctga ccttaatgtc 600 ggaaacgcaa cattgtcctc ttcagaactc tctcgatcac aaatgcgtga tctttttaaa 660 tctttcccag aactggcaga gctgtttttc agtgagagtg attgcaacac tgataaattt 720 aaaaacctaa ttaataatct aaaatcagaa catgggttcc ctgaacagat ctcatttgtg 780 gggggtaaat ttatcaaact taatgagact aaagattctg atcactcaca ggctgagggt 840 gaatcagata taatgtttaa tatcggcctt cttaattttc acgaaatgtg tttgaaagat 900 catacattaa gatctctttt ctcgcaacca aaatcttcgg aaaagattat tgatataatt 960 tttcacctca attatctaca attgaaaggt gaactcaaac acacatcaaa cggatcatat 1020 ggtgattacg tttgtaacta tgttcgtaat tgtctgtcat tatattctat ctcggacata 1080 gaaacataca acacgctttc tctcgacgat attgagaaat gggcaccgag cttagttgag 1140 atgatacctc cmttatctta tcatagctcg agttgcaaag aaagactttt aaggctccga 1200 cagtggagat tgtctagcga aattgacagg ctacacaaca taatctcgcc tgaaacgata 1260 acacacataa attctttcct gagcgaaatc gataaagtct accctcaaaa ggctacccaa 1320 cttaaagctc ttcagaactt gagcaacaac atcaatgcaa tcragtcaga gtcagatgac 1380 gaaacgattg caaaagccaa gagatttctc tttttcgtaa ttcgcgacaa aaagaaattc 1440 cttttaggta gtaaatattc ttaccggatt ccactgcatt gtagaaagat ttgcactact 1500 aacatgacta cgcatatctt ttctataata caaagtctcc tccttacttt aactgttagc 1560 gatagcgagc aaatagcaga cagatctgat tcgattactc tggtgaacac cactcagtct 1620 gtagaaataa atccaggaac gcagtctggt tcgatagttg atatcagcgt tgtagagact 1680 ggctctactg aacatgccag agtgaaacgc agacttgaag ttgacattga cgaggtcgag 1740 tcaataattc atactactag tgggacccaa caaaaccata ttaacactaa atctattaaa 1800 aaagccagaa attcactggc gaaaaacaaa tgaaggtgcg tgcctgcatc agtctcaaaa 1860 tctgctctaa tctctaagga gactgatgca gacagcacaa taaaaatcat cgaaataaat 1920 aaacctctaa atgaagcagt tcgtgatcct acaatttggt tttctgatga tgatattgat 1980 ctttacctag aaaatcacat cagtagcctt gagtttgcgt ccataaattg tttcatcgtt 2040 gagattttgc gcactaaccc tggcgaagat ctttttccaa tcccggaaaa ggttttcaaa 2100 gcgcaaataa ttttatgccc tttaaatatt gacaaaactc actggatctt gtttgtatat 2160 tgtaagaact ccttgacatc ttactttatt gatccaatcc ttcaatatcg gcatcggttt 2220 gtcaacaaaa catatgcact tcaagttaca atggcattga atcagctttt caatcttaaa 2280 gctgccatga ccttccaccc acatgaaaat ttcttatacc aagaaaatag ttacgactgt 2340 gggccttaca tctgtgctta cgcaatgatg ataagtggag tttggcaaag ctttcctgac 2400 cattttattg acatcatcag agaggaggtt caccaacttc aactctcctg tcaacagaaa 2460 gatcaaaagg tttcaggagg tggcctaccg aaattcatgg aaatcatcaa acggaatggg 2520 gtcaggaaga ctagcagtgc tacccaggct tttctcctta aacagaattc gtgtatttca 2580 tccctattag atgaatactt caaaacgaga accaatacga ctgttctatc tccagaacta 2640 acaatgaaca ttgtcgccca aaacctcaca ttcattagag aaaatgtgac attcagacaa 2700 ttcactggaa tagatcacat catcagtatt gtcccaaatg actcgaattg gagtctgttt 2760 attttctctc ttgtgttagg gaacagttac attttcaact tccatgacag awtagtaacg 2820 gaaagactaa ttgccattgg agaaaatatg actgaatacc tcaacaaatt catcatctcc 2880 aatagaaaag tgatcttcat tagcgaacac aacatcgaac acgcttccat cacatctgac 2940 gactcacttc tctttgcttc tgctttcatt atattaatcg aaagtcttat ttttagcaaa 3000 caggtttccc acacaagaac cagtgagttt ctaaaatcag ttaaaattaa acaagcaaat 3060 gagatctgtt tgaatttaaa ctcaataaac tgtaaaatca acgaggaaat aactgcaaaa 3120 tattttagta atttgaaact aaattcgaac tttatgctac taaatgagac aatgtgcact 3180 gctattcttg atgattgcac acgacatttg accaaatttt taaattatga ggatctcatt 3240 aaagcacaag taatattcgc cataattgcg ccccctaaac atcgagaaat tcttttgata 3300 ctagactata atactgatga atactacatc atcaacccwa ctactctgtg tggaacagag 3360 agttatcaat cagtctgcag aatttttcga aataaaatca acgagattaa ggaaacacct 3420 gggcgcgtca taaacatggg tgatattccc aatgacattc gtggatctgg tcttgtgagt 3480 aaactactca tttgcgcttt cattaaaaat tatgctaatg atttacccct gattaactta 3540 aacttgcgcg agataggtac caacatcagc tctttacttc caatcatcag tgaatcgaaa 3600 aaaattctaa tgtcagattt gactaaaatc acaaaaataa aaaagagcag cctgaacttg 3660 gaacaacgta aactcaagat tgacgaactt ctccacgatg tttctcacct gactgtcaat 3720 gaaatcattg atattatagt aaggcaattc cctgaccgtg taaaatcaaa amacaagccg 3780 tatttgggta ataaaacacg accccaaaaa ataaataaag ggatattagt aagaaaattc 3840 atgactgaca tgaaatggac ggtcgataaa gtctttaaca acacaaatgt cagcataaga 3900 ccaactttcc agaaaattgt cctaaacttc cttataaaat caccttttca cggaaattgg 3960 aataatccgc tcaatttttg taaacgttcc aaacaagctc ttaacttgga gcttttctca 4020 aattttgaga taatgtttca actcaagaga gctgacaaca caagtcccgg tattgatggc 4080 attcaatacc gcgatttgtt gctccttgac cccgagggta aattattgac gtttttattc 4140 aataaaatca ttgtaaataa aataatacca gacagttgga aaagcttcaa aactattcta 4200 actcctaagc ctaataaaga tggaaagtat gatgatgttt catcctggcg tccaattgcc 4260 ttactttctg tgatctacaa agtgttcgcc tcatgtcttg cgagtcgttt gacttactgg 4320 ataaatacca ataatctttt acatatcggc cagaaaggtg gttccagaca tgatggatgc 4380 gttgaacaca actcgattct ttcagctgct ctcgaacatt caaaatacag caagaactgt 4440 ccacttgcta tagcttggct tgacatcaaa gatgcctttg gcagtgtccc acatgattac 4500 atgtggtcct tacttcggtt cattggagta ggggaggatt ttacatccat tttacagttg 4560 ttatattccg acacgagcac gttctacagt tgtggcccaa tattgactcc gaacattccc 4620 atcaaacagg gtgtcaaaca aggatgtcct atatcgatga ttttgtttgc cttggccatc 4680 aaccctgtac ttgaggcagt atcgcgttct gactgcgagc ctttcatgat tggagaatct 4740 cctgtcaaga ttcttgccta tgctgacgat attgctctga tcgctaaatc tgttgaagac 4800 cttcagaaaa tcacaaatat agctgttaga actgcagctg aaataggatt tgagtatcgg 4860 cctgaaaaat gtggatacct tcaacttcca aaggttcaca ttgacagtga aatattaatc 4920 aacaacacaa aaatcaagaa attactgtca aaggaatttt atcaatacct cggtgttccg 4980 gtcggtgagg aacctgatca gagtccttat gacattcttg ataaagctgt ttctgacact 5040 cgaaaacttg ctaactctga gttgctaggt tggcaaaagc taaaggctta taagatcttt 5100 gtccattctc gtttgatctt cccctttaga actcgagaaa tcaagaccgg tgctctttcg 5160 aaatccaatg gaaacagcaa tcgctctgta aataccagta gtcaattaag aggttgtttt 5220 agaaaaatgc tgtgtttgcc taatcagtct gaagtaagtt acttttataa cgcaactgaa 5280 aacggagggg caaattgcgt tgatcttctt gatgaatatc acacgcagac catcacccac 5340 tttttccgac tactaacttc agaatgtgac tatgcgagac agatcaacat tgactcttta 5400 aaatttgtca ctggtccgcg attaggaatc aaaacaccgt cattgcaaga aagtcttgac 5460 tggttcaatg gtaaagagtc taaacctcac cattctggca gaaaaactcg ttttcaaagg 5520 gctcggattg cggttgcgtt tttcgaaaag acacacaata tcagtgtatc attcatagtc 5580 aaagaccaca aaccatcgct ttacatcaca acagcaatgc gtggcactat catattaacc 5640 tcggaccttc gaaaaacaac atcaaaggtt ttacacatgg cactctgtga ctcgtacctc 5700 tctaagtggg aaaagagctg cgtctcgaac ttcatagctt cagctgtgaa actttctcca 5760 aaaataaaca aggcaatctt tcgcagtgag ttatcagaat ttgcatggaa cttcatccat 5820 cgagctcgca caaacactct ttccatcttt gcgaaacatc acaacaaagg tgataagaga 5880 ctttgtcgac tgtgtcacgc tgaagatgag acaatgtcac acgcaattca atcgtgtaag 5940 gtccatctga ctcttggtgc gaacgacaca atgactgttt aaaacttatc gcatcaaatc 6000 tcactaagaa ttcaaacttg attgtcgtgg ttgatcacgt ttgttcactt gtcccaaact 6060 ccaaagaacg cgtcgatcta atgatcacga cttgagcaga aaga 6104 // ID DNA8-103_AP repbase; DNA; INV; 270 BP. XX AC . XX DT 29-AUG-2009 (Rel. 14.09, Created) DT 29-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-103_AP. XX NM DNA8-103_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-270 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2041-2041 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 270 BP; 93 A; 65 C; 35 G; 76 T; 1 other; cagaggcggt ttgaatatat tatacttagt ggtgcaagac aatatataga cacccaccca 60 ccccttttca tcatagaaat tataaatacg ctttcaacag tttcagctac gattatcatt 120 tatcaaattt caatgattag gtacttcatt actttattaa acattaaaat taaacattgc 180 ggtmagcaca tactcaacca agaccaccca cccacccacc taatattggt ggtgcagtgc 240 accaaaaaca gtatactcaa accgcctctg 270 // ID Copia-19_SI-I repbase; DNA; INV; 4078 BP. XX AC AEAQ01024750; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-19_SI_; KW Copia-19_SI-LTR; Copia-19_SI-I. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-4078 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01024750; Positions 359 4436. XX CC Positions [1610-2071] - Integrase core CC 'GAAAG' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 1658..4045 FT /product="Copia-19_SI-I_1p" FT /translation="MTLIDDFSRYCFVYLLKEKSEAARRIMEFVILCDTQF FT GRKPKLLRTDQGREYLSTELQTFLKDNGIVHQTTASYSSQQNGVAERKNRS FT LIEMARCMLIEAELPHTYWGEAVITANYLQNRLPSRAIDKTPYELWFDRTP FT NVHYLRTFGCKAFMHVPDEKRRKLDNKAIELRLVGYSETVKGYRLLNTITN FT KIYVSRDVKFIEKIYRKSYFELTDDSVEENHSPKIPIPSKIESSEHASSEF FT ITNEPAPQAEETIRKSSRSTKGIPPQRLIETCKLVSCSIPEEPLTYEQAIS FT SPENKHWKQAMEEEIASFQKNRVWTLTKLPSDKKVIGGKWVFKRKLDDKGN FT ISRYRARFVAQGYLQVYGEDYDEVFAPVAKQATFRTLLTIAGKRNMSVQHY FT DIKTAFLNGELEEDIFMKQPAGFSVEGKEELVYKLHRSLYGLKQSARAWNQ FT KIHDVLTKEEFKRANADKCLYSRGSNGKWTYILIYVDDLIIASDNKDQIKK FT LEEKLSQVFNLCNLGKLKFYLGIHIEQESDGMFAIHQRTYIDRIIERFGLK FT DAKPSKIPLDPGYEKFKELSEKLPNNKQYRSAIGALLYLSTYTRPDIAAAT FT SILSRKISAPTEADWEEVKRVIRYIKGSRHLRLKLGEDNTNEDTALIGYAD FT ANWAGDVTDRKSNSGFVIKLYGAPISWASRKQTCVALSSTEAEYVALSEAC FT QELWINRLLNDFNICLKEPGILYEDNQSCLKMLDSEKMSNRTKHIDTKYHF FT CKELKTNQQVHFYYCPTSEMIADILTKPLEAVKTRQHVSGLGLIN" XX SQ Sequence 4078 BP; 1486 A; 702 C; 843 G; 1047 T; 0 other; aaggttatgg gcccaggaca catcttgcat ataaaacgaa atagataaaa gtaagttaac 60 ctcaaaaatt ctttctgaag acgctataaa cgtgtggatt catacaagga ttgttgcaaa 120 aaaaacgtgt ctaaaatggc agatattgca aaaatcagtg tagagaaatt gaataatagc 180 aactatcaaa cgtggaaata cgaggctgaa ctcgttcttt gacgtgaaaa cacgtggttt 240 gcagtcacca cagatcctcc tgccgcgaca gaaaaaggat atgaagcttg gacacaggcg 300 gatagtaaag ccataggcac aatcggtttg cttgttgaaa agggtcaaca tgtgcacacg 360 agaccagcaa aatcagcaaa agaagcatgg gaaaacttaa agaagcatca tgaaaaggca 420 agtctttcat ctaccgttca tttgtatacc aaattagcta acacaaaatt ggcagaaaat 480 ggtaacgttg aatcgatact ccatgaaatc gaatgtattc ttgaccacct tgtcgctgta 540 ggagagccag ttaccaagaa acttaaaatt ggtatgatgt taggaagttt acctagcagc 600 tacggtacgc ttattactgc ccttgaatgt cgtgcagaaa acgatctaac tgttgaaatg 660 gttagagaga aaatactcgg ggaatacaca cgccgaagaa ctgcaaacga gggtacacaa 720 gaactaaccg aaacagcact gaagatcaag aaagaaacat ggaagaaaga taaaaaagaa 780 aaaaaagagg gctaaaatgc gttttttgtc ataagcaggg acacatacaa gaaaaatgtt 840 acaagtatct cgcacaattg cagttgaaaa atactggaag tgaaaaacag caaactgcaa 900 aaatcgttaa gcaaattcca gaaaattctg aggaaggatc atcgaaatca agcaatgatt 960 ctgaactatt ttgctttaag acgagcatgg gcgatcaaga tgcatggtac gtagattctg 1020 gagcttcctg tcatttaact aataataaaa gttattacaa cagattcaat aagaatgtct 1080 gtgaaataat aacattagca gacgggcgca agacaaaatc tgaaggaaca ggagatataa 1140 cactttcttg cagcaatgac aacggaaatg taaagcatgt tcaggtaaaa agcgttcgat 1200 atgtgccaaa tttagattgt aatcttcttt cagtacgggc gctcacttca cgaggcatac 1260 gtgtgaattt ccttgagaat cgatgtgaga tcataaatgg tgatgtcatt gttggaactg 1320 ctgatgctgt acatggtttg tttaaactgc gttgtacaca gaaagtgttg atgactacaa 1380 atggatcaca tgaaaaatgt atacacactt ggcatcgacg ttttggacat cgtaatcctg 1440 aagctataaa actgttagag agtaagtcga tggcatgcgg aatcaagatt gaaaaatgta 1500 atgagcgttt tccatgtgaa acgtgtacaa aaggaaaatt tacacgtttg ccatttccaa 1560 agaaatctag tcagcagtca aagatacctc tagatttaat acacagtgat ctttgtgggc 1620 cgatgcagac aagacaccga gtaataaacg ttacataatg acattaattg atgattttag 1680 tcgttactgt ttcgtatact tgttaaaaga aaagagtgaa gctgctagaa gaataatgga 1740 attcgtcatt ctatgcgata cacagttcgg acgtaaacca aaattgttaa ggacggatca 1800 aggtcgcgag tatctaagta ctgaactaca aaccttctta aaggataatg gcattgtgca 1860 tcaaacgact gcatcatact cctcacagca aaatggagtt gctgaacgca aaaataggag 1920 tctcatcgaa atggcacgat gtatgctcat cgaagctgaa ctacctcaca cttattgggg 1980 tgaagcagta ataactgcca attatctgca aaatcgctta ccttcgagag caatcgataa 2040 aacaccctat gaactctggt ttgatcgcac tcctaatgtc cattatttac gaactttcgg 2100 atgtaaagct ttcatgcatg ttccagatga aaaaagaaga aagttggata acaaagcaat 2160 agagcttcga ctagtaggtt attccgaaac tgtaaaaggt tatcgattac tcaatacgat 2220 tactaataaa atatatgtta gtcgtgatgt caaatttatt gaaaaaatat atcggaagag 2280 ttattttgaa ttgactgatg attcagtaga agagaatcat tctccaaaaa ttccaatacc 2340 ttcaaaaatc gaatcttcag agcacgcctc ttcagagttc ataactaatg aaccagcacc 2400 tcaagcagaa gaaacaatcc ggaaatcatc gagatctacg aaaggtattc caccacaacg 2460 actaatagaa acatgcaagt tggtttcatg ttctatacct gaagaaccgc tgacatacga 2520 gcaggctata tctagtccag aaaataaaca ctggaaacaa gctatggaag aagaaatagc 2580 atcgtttcag aagaatcgcg tatggacttt aacaaaattg ccttcggata aaaaggtcat 2640 cggaggaaaa tgggtgttta aacgaaaact cgatgataaa gggaatatta gtcgttatcg 2700 tgctcgcttt gttgcccagg gttatttgca agtatatgga gaagactacg atgaagtatt 2760 tgcaccggtg gcaaaacaag caacatttcg tacgcttttg acaatagcag gcaaacgaaa 2820 tatgtcagtg caacattatg atataaaaac agctttctta aacggcgaat tagaggaaga 2880 tatttttatg aagcagccgg caggtttctc tgttgaaggt aaagaggaac ttgtatataa 2940 actacatcga agtctttacg gactgaaaca atcagcaaga gcatggaatc agaagattca 3000 cgacgtactg acaaaggaag aattcaaacg cgccaacgct gacaaatgtt tatattcaag 3060 aggatcaaat ggaaaatgga catatatttt aatatatgtt gatgatctca tcattgcaag 3120 tgataacaag gatcagatta agaagttaga agaaaaactt agtcaagtgt ttaatttatg 3180 caaccttggt aaattaaaat tctatcttgg gattcacatc gaacaagagt cagatgggat 3240 gtttgctata catcaacgta catacataga ccgaattatt gaacgttttg gtttaaaaga 3300 tgcgaaacca tcgaaaattc cattggatcc aggatatgaa aaatttaagg aattatcaga 3360 gaaattgcct aacaataagc aatatcgtag tgcaatagga gcattattgt atctgtccac 3420 ctacacacga ccagatatcg cagcagcaac atctatctta agtagaaaaa taagcgcacc 3480 aaccgaagcc gattgggagg aggtgaagcg cgtaatcaga tacatcaaag gatctaggca 3540 tttgcgtcta aaattaggag aagacaatac caacgaagat actgccttga ttggatatgc 3600 tgatgcaaat tgggcgggag atgtaacaga tcgaaaatca aattcaggat tcgtaattaa 3660 actttatgga gcaccaatca gttgggcaag tcgcaagcaa acgtgtgtgg ccttatcttc 3720 tacagaggct gaatatgttg cactctccga agcctgtcaa gaactatgga ttaaccgatt 3780 attgaatgat ttcaatattt gtctaaaaga accaggtata ttatacgagg acaaccagag 3840 ctgtctcaag atgttggact ccgagaagat gtcaaatagg acgaagcata tagatactaa 3900 gtatcacttt tgtaaggaat taaaaacgaa tcaacaagtt catttttatt actgtccaac 3960 atctgaaatg attgcagaca tcttaacgaa gccacttgaa gctgtaaaaa cacgtcaaca 4020 tgtatctggg cttggtttaa ttaattaaat gttccatagt cataactttg agagggga 4078 // ID BEL-612_AA-I repbase; DNA; INV; 6176 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-612_AA_; KW BEL-612_AA-LTR; Pao_Bel_Ele159; BEL-612_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-6176 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [5227-5787] - Integrase core CC 'GGGAC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS join(22..2499,2503..6174) FT /product="BEL-612_AA-I_1p" FT /translation="MSNPDPSATNTSDPKTPENKDLTTHCVCCDRPETVDD FT CVQCDQCNGWWHMMCAEVSASVADRSWTCGHCLPLSVSSRTTTSSVRVARL FT ALKKKQLEEQHAMEQRHLAEKYKLMEEELNEMDETSSNRSRMSRRTSLEKV FT KLWQRKYAEQSGDLHDIPADGRQAAGDALCNQGTNDRGDLAHPSNVKEPTP FT PPPAIKSPHERAEVDVNRLDSVQRGRLNAVVIEARQESTLYKPPLNSTVKS FT LSIPELNNAKWQHNTGTIPKQQKVINNPGKTDPQMPPVAPQGKPLFDHTTS FT TSLPPKSENSLQQIIKQFGSLAPSTMLAFNRTNMNALGTNTLPPSGTSTGF FT VPQANLELSSGSVPPPAEQPNVGDLPQANSISHSVSVPPLVGLQTVGSSPQ FT VNPGFSIGTVPPPPGLPIVENFTPSPSQLAARQVMSRELPTFSGDPADWPI FT FISSFMNSTLACGYNSAENLARLQRCLKGHAYESVKSRLLLPESVPSVIET FT LRLLYGRPELLISALLQNVRSVPAPKPEKLESIIDFGLAVRSLCDHLEAAG FT QQEHLSNPTLLMEMVEKLPAHTKLQWADYMQQHPLVNLKTFGDFMLRVVTS FT VSRVTMYVGGSSSDQHKTKQKGVVYAHTSEAELMHEPVFEQNRVCFCCKKP FT GHRVAECSIFKAYSVDDRWKFVQSKGLCRSCLNSHGRRSCKNATQCVFEGC FT QYRHHPLLHSNRSDNVSERSTQGLSTVQNHIHRQFKQALLFRIIPVVVSGP FT RATIETFAFLDDGSDLSLIESSLVDQLGIDGWKKPLCLKWTGNVTRIESES FT KHVRVMIRGMNSQQQFSLNDIRTVELTLPEQSLNYDELSQRYRYLQGLPVD FT SYDKAVPRLLIGVNNANLTVPLQVREGRKDEPIAVRTRLGWCIFGGRGNVA FT AHSLNYHACECSNDQDLHNTVKEYFAMDDAGVKSPVVLESVDDNRARKIME FT QTTVRRGDRFETGLIWKYDVIEFPDSFNMAVKRLECLERRMMRDPTLATNL FT KNQIMEYQLKGYAHRATEEELSQADPKRVWYLPLGAVTNPRKPGKVRLIWD FT AAAKVDGISLNSMLLKGPDQLTSLPAVLSRFRQYPVGVSADIKEMFHQLLI FT RPADRHSQRFVFRNNPADPVDIYLMDVATFGSTCSPASAQFVKNQNAEAFV FT NIFPRAVEGIVDNHYVDDYLDSFENEDEAERVSREVRSIHQKGGFLLRNWL FT SNSTIVLRGLNEEEQKDTKCLYLSGLENSDRVLGMLWKTTEDELWFPMIMK FT EEVQKVIESGERPTKRQMLKCLMGIFDPLGLLSVFLVHGKILLQDVWRTGL FT QWDEKVPERIFERWIRWISLFPEIGNLRIPRCYFKGASAQMYGRLQLHIFV FT DASEAAFSAVAYFRVVNAEGNPECSIVAAKTKVAPLKPLSIPRLELQAAVL FT GSRLMSFVQKSHRIEVKQRFMWSDSATVLAWLRSDHRRYKQYVACRVGELL FT STTDVEEWRWVPTKLNPADAATKWGKNACPKDNDEWFKGPKFLRLSEEEWP FT KQTSFPIQSEEELRPCFMVQEVIIPESVVDFSRFSKWRKLLGAIAYMHRFV FT DNCRRKHRGEAQNSIYLSPEELRRAKNTVMRAVQWQEFPDEMITLSRGLTG FT LPKTSSLYQLTPTIDECGVLRVDGRIGAAPNTAFDSMFPVILPRKHLVTNL FT VTEDFHRAFHHGNSETVVNEIRQFYHISRLRTVVKQIAKACQWCKVNKATP FT KIPRMAPLPVARLSSYSRPFTYTGIDFFGPLLVKVGRSAAKRWICIFTCLT FT IRAVHVEVAHSLSTSSCMSCIRRFVCRRGAPAEIYTDNGTNFQGAERLLKE FT QLLKMHDELAATYTNTDTKWIFIPPGAPHMGGAWERMVRSIKSAMETAYNS FT DRKLDDEGLATLVVEAEGIVNSRPLTYLPLDAAEGEALTPNHFLLGSSRGV FT RQPAIPFNDPAVAVKNSWNLIQHQLDVFWKRWIREYLPMLTKRMKWFGEVT FT PIAEGDLVLIVDDGRRNGWIRGRVKDVLVAADGRIRQATVQTARGILRRPV FT SKLAVLEVELGGKTGTDGQCYGGE" XX SQ Sequence 6176 BP; 1745 A; 1449 C; 1556 G; 1422 T; 4 other; aaactttaag aattgttcgc catgtcgaat ccggatccat cagcaaccaa cacgtccgat 60 ccgaaaacac cggagaataa ggacctaact acccactgtg tttgttgtga tcgtccggag 120 acggtcgatg attgtgtcca atgtgaccaa tgcaacggat ggtggcatat gatgtgcgct 180 gaagtgtcgg cttcggtagc cgacagatcg tggacctgcg gacactgttt gccgcttagc 240 gtttcttctc gaaccacgac gtctagcgtt cgggtggctc gcttggcctt gaaaaagaag 300 caactggaag agcagcatgc tatggaacaa cgccacctcg ccgagaagta caagttgatg 360 gaagaagagc tgaatgaaat ggacgagaca agcagcaacc gaagtcggat gagcagaagg 420 acgagcctgg aaaaagtgaa actatggcag cggaaatacg ctgaacaatc cggagacctg 480 cacgatattc cagcggatgg tcgtcaagca gctggcgacg ccctatgcaa tcagggtaca 540 aacgatcgtg gggatttagc tcatccgtcg aatgtcaaag aacctacacc accaccacca 600 gcgatcaaga gtcctcatga gcgggctgaa gtcgatgtca acaggttgga ttcggtgcaa 660 cgaggaagac taaacgctgt cgttattgaa gctcgccagg agtccacgct atacaaacca 720 cctttgaatt caacagtaaa atcgcttagt attccggagc taaacaatgc taagtggcaa 780 cacaataccg gcacgatccc taagcagcag aaggttatca acaatccagg taaaaccgac 840 cctcaaatgc cccccgtagc acctcaaggt aagccacttt tcgatcatac cactagtacg 900 tcattaccac caaaatcaga aaactccctc caacaaatca tcaagcaatt tggtagcctg 960 gcaccatcta ccatgcttgc ctttaatcga acgaatatga atgcactagg aacaaacact 1020 cttccacctt ctggaacgag tactggcttt gtgccacagg caaatctgga gctatcttcc 1080 ggatccgttc ccccacccgc ggaacagccg aatgttggcg atttgccaca ggccaattcg 1140 atttcacatt ctgtatcagt tcccccactc gtgggactgc agactgtagg ttcttctcca 1200 caagtaaatc cgggtttttc aatcggaaca gttcccccac cgccgggact tccgatcgtt 1260 gaaaatttca ctccatcccc ctcgcaatta gccgcacgtc aagtaatgtc acgtgaacta 1320 ccgacgtttt ccggcgaccc tgcagattgg ccaatattta taagcagctt catgaacagt 1380 acgcttgcat gcgggtacaa ctctgcagaa aatctcgcgc gtctacaacg ctgcttgaaa 1440 gggcacgcgt acgagtctgt caaaagtcga ttgttgcttc ctgaatcggt gccatcagtg 1500 attgaaaccc tccgtttgtt gtatggacga ccggaattgc taatcagtgc ccttctacag 1560 aacgtgcgta gtgttcctgc acccaaaccg gagaaattgg aatccattat tgacttcgga 1620 ctggctgtgc gcagcctctg tgatcacctg gaagctgctg gacaacaaga acatctctcc 1680 aacccgacgt tgctgatgga aatggtkgaa aaacttcccg cgcacaccaa actacagtgg 1740 gctgattaca tgcagcagca tcctttggtc aacttgaaga cattcggaga tttcatgctt 1800 cgagtcgtca catcagtcag cagagtaaca atgtatgtcg gaggcagcag cagcgaccag 1860 cataaaacga agcagaaagg agtagtgtac gctcatacca gtgaggcaga actaatgcac 1920 gagccggtgt ttgaacagaa tcgagtgtgt ttctgttgta agaaaccagg acatcgtgtt 1980 gcagagtgtt ccatattcaa agcgtattct gtagacgacc ggtggaagtt tgttcaatct 2040 aagggattat gccgcagctg tctcaactca cacggaaggc gaagctgcaa aaatgcaacg 2100 caatgcgtat ttgaaggatg ccaataccgc catcatccgc tgctacactc caatcgatcc 2160 gataatgtca gtgagcggtc tactcaagga ttgtcaacag tgcaaaacca cattcatcgg 2220 caattcaaac aagctttgtt attccgcatt atacctgtag tcgtttccgg accgcgagcc 2280 accatcgaaa ccttcgcctt cctagatgat ggctcggatc tatccctcat agaaagtagc 2340 ctggtggacc agctgggcat tgatggctgg aagaaaccct tgtgccttaa gtggacaggg 2400 aacgtcacca ggatagagtc agagtcaaaa cacgttcgag tcatgatcag aggaatgaac 2460 agccagcaac agttctcgct caatgatatc cgtaccgttm aggaactcac actcccagaa 2520 caaagtctga actacgatga actttcgcaa cgctaccgct atctacaagg tttaccagtg 2580 gacagctacg ataaagcggt acctcgtttg ctgataggcg tcaacaacgc gaatcttacg 2640 gtcccactcc aagtgaggga aggcagaaag gatgagccta ttgcagtgag aacccgcttg 2700 ggatggtgta ttttcggtgg tcgtggtaac gtagccgctc attcgctaaa ttaccatgcc 2760 tgcgaatgtt caaatgatca agatctgcac aatacagtga aagagtattt cgcgatggat 2820 gatgccggag ttaagtcacc agttgtgcta gaatccgtag atgacaatcg agcaaggaaa 2880 ataatggagc aaactactgt ccgtagaggt gatcggtttg aaacgggcct aatatggaag 2940 tacgacgtta tagagtttcc ggacagcttt aatatggcgg tcaaacgttt ggagtgttta 3000 gagcgcagga tgatgcgaga tcctactcta gcgacgaacc tgaaaaatca gattatggag 3060 tatcagctca aaggttatgc gcatcgagcg acggaggaag aactttcgca agcggatccg 3120 aagcgagttt ggtatctacc actaggagcg gtaacgaacc ctagaaagcc aggaaaagtg 3180 cgactaattt gggacgcggc tgctaaagta gacggaatat ccttaaactc tatgctgctt 3240 aaaggtccag accagttgac atccctccct gctgttcttt cacgatttcg ccagtaccca 3300 gtgggcgttt cagcagatat aaaagaaatg tttcaccaac ttctaattcg accggctgac 3360 cggcattctc agcgcttcgt gtttcggaac aatccagcag atccggtgga catttactta 3420 atggatgtgg cgacctttgg atctacgtgc tcgccggcct cggcacagtt tgtaaaaaac 3480 cagaacgccg aagcattcgt taatatattc ccacgagcag tcgaaggaat tgtagataat 3540 cactacgtag acgattatct cgatagcttt gaaaacgaag acgaggcaga acgagtatca 3600 cgagaagtcc gatcaataca tcagaaagga ggtttcttgc tcagaaactg gttgtccaat 3660 agcaccatag ttctacgtgg actcaatgag gaggaacaaa aggatacgaa gtgtctctat 3720 ttgagtggat tggagaacag cgatcgtgtt ctaggaatgt tatggaaaac gactgaagac 3780 gagctatggt ttccaatgat catgaaggaa gaagtacaaa aggtgataga aagcggtgag 3840 cggcctacta aacgacaaat gttaaagtgc ttgatgggaa tattcgatcc gttgggactt 3900 cttagcgtat ttctagttca cggaaaaatt ctgctccagg atgtatggcg caccgggttg 3960 caatgggacg aaaaggttcc agagagaatt ttcgagcgtt ggataagatg gatcagctta 4020 tttcctgaaa tcggaaacct acgtattccc cgatgctatt tcaagggagc cagcgcacaa 4080 atgtacggac gacttcaact ccacatcttc gtcgacgcta gtgaagctgc gttttccgct 4140 gtagcatact ttagagtggt caatgcagaa ggaaatccgg agtgctcaat agttgcsgca 4200 aaaacaaagg tcgcacctct gaagcctctt tcgattcctc gactggagct gcaagcagca 4260 gttttgggta gccgactgat gtcatttgtg cagaaaagtc accgtattga agtcaaacag 4320 cggtttatgt ggagtgattc cgccactgta ttggcctggt tgcggtctga ccatcgccgc 4380 tacaaacagt atgtggcatg ccgcgtaggg gaactgttat caaccacaga cgtggaagaa 4440 tggcgatggg ttcccactaa gctgaaccca gcagacgctg caacgaaatg gggaaagaat 4500 gcgtgcccga aggataacga tgagtggttt aaaggcccga aatttctgcg tctttccgaa 4560 gaggagtggc cgaaacaaac gtcatttcca attcaatctg aggaagaact wcggccctgt 4620 ttcatggtac aagaggtcat cattcctgag agcgttgtag acttcagtcg gttctcaaaa 4680 tggcgaaaac tactaggcgc aatagcgtat atgcatcgat ttgtcgacaa ctgtaggcgc 4740 aaacatcgag gagaagcgca aaattcaata tacttgagcc ctgaggaact gaggagagcg 4800 aagaacactg tgatgcgggc tgtgcaatgg caggagttcc cagatgaaat gattacgctt 4860 tcgcggggcc taaccggatt acctaaaaca agcagcttgt accagttgac gccaacgatt 4920 gacgaatgcg gagtgctacg cgtcgacggt agaattggtg ccgcaccgaa taccgcattt 4980 gactcaatgt tccctgttat tcttcctaga aaacatttgg tgacaaatct agtaacggag 5040 gatttccatc gagccttcca tcatgggaat tccgaaacag tagtcaacga aatcagacaa 5100 ttctaccaca tatctcgact aagaacggtg gtgaaacaaa tcgctaaagc gtgtcaatgg 5160 tgtaaggtga ataaagcaac tccgaagatc ccgcgaatgg cgccgctgcc tgtagctcgt 5220 ctgtcctcgt attcacggcc cttcacgtac actggcatcg acttcttcgg accactattg 5280 gtgaaagtag gtaggagtgc tgcgaaacgc tggatctgta tcttcacctg tctcaccata 5340 cgagccgttc atgttgaggt ggctcacagt ctttccacgt cttcgtgcat gagctgtatt 5400 agaagatttg tttgtcgtcg aggagcaccg gcggaaatct acacggataa tggtaccaac 5460 ttccaaggag ctgaacgcct cctaaaagaa cagctcctga agatgcacga cgaattagct 5520 gcgacgtaca ctaatacaga tactaaatgg atattcattc cacctggcgc acctcacatg 5580 ggcggagcgt gggaaagaat ggtgcgctcg ataaagtcgg ccatggaaac tgcgtacaac 5640 agtgatcgga agctagacga cgaaggacta gcaacgttag tcgttgaagc cgagggaata 5700 gtcaacagcc gaccgcttac ttacctgccc ttagacgctg ctgaaggaga agcactcact 5760 ccaaatcatt tccttcttgg aagctctaga ggtgttcgac agccggcaat accattcaat 5820 gatccagctg tagctgtaaa gaattcctgg aacctgatac aacatcaact tgacgttttc 5880 tggaagcgtt ggattcgaga gtatctcccg atgttgacca agcgaatgaa gtggttcggt 5940 gaagtaacgc ccatagctga aggagatcta gttctaatag tggatgatgg ccgaaggaac 6000 ggctggattc gcggaagagt taaggacgtt cttgttgcag cagacggcag gatacgccaa 6060 gctacagttc aaactgcgag ggggatattg cgtagaccgg tatcgaagtt ggccgtgttg 6120 gaggtagagt taggtggtaa aactggaacc gatggccagt gttacggggg ggagga 6176 // ID Gypsy-100_CQ-I repbase; DNA; INV; 6850 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-100_CQ_; KW Gypsy-100_CQ-LTR; Gypsy-100_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-6850 RA Jurka J.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 579-579 (2011). XX DR [2] (Consensus) XX CC Positions [4846-5322] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 3919..5694 FT /product="Gypsy-100_CQ-I_1p" FT /translation="MANYYRRFINNFSELTSPLTDLLKNKPKKVEWTAEAD FT LAFRTIKEKLISAPVMSNPDFKLPFTVQTDASDHAIAGVLTQVQNGEEKVI FT AYHSEKLKGAELNYHAAEKEGLAALRCIEKFRCYIEGSKFTLVTDSSALTF FT IMRSKWKTSSRLSRWSIILQQYHMIIKHRKGKENVVPDALSRSLEIAEIDD FT GSDWYSKLYASIKRDPENYIDFKIENNILYKLVSSQSDVLDYAFDWKQCVP FT TSKRADILVKEHDEAFHVGYEKTVAKIKERFYWPRMAAEIKKYVMNCETCK FT RIKHSTISVVPEMGNQRITTKPFQILTIDYIQSLPRSAKGNAHLLVLMDVF FT SKYCLLVPVRKICASSVCEILEQFWFRRLSVPQYLISDNATTFQSKEFQQL FT LNKYEIQHWASARHRSQANPTERLNRTINAMIRSYVRENQRLWDTRISEVE FT FVLNNTVHATTKYTPYRVIFGHEIVTKGSEHRLQSSLELSDEERMERMQIV FT SRSVHQRVVDHLRKAHEETKRRYDLRHKRYAPTFDVGQKVYKRTFRQSSAG FT DHFNAKLGEVYEPVVILAKKGSCAYEVGDDAGKSLGVFSAGDLKA" XX SQ Sequence 6850 BP; 2238 A; 1210 C; 1443 G; 1951 T; 8 other; tattggcgac caacaaaaaa aaagttaata atttcaatta ttcactttat tttggaggtt 60 aacgaggggg gagaatctgg tttgaatcgg caaaatcggt cttcggattg gcgtcggtgt 120 ttctggtaaa cccacttgct tttaaactgt tcaatttaat gtagataatt gtagttctga 180 agtgaattct tcaactgata ttataaacca cttaatttgg gtggcaatga ttgttcaaaa 240 gtaaattaat cgctcaataa cttgcaattt tggcgtgatt cgatcgaaaa ccgcatgtaa 300 aatatttatt ggttataatt tgtcgtgaaa tcgtttgtct caattaattt attaaatttg 360 gtgttcattt gtacatttta aggtgtgaat cattttgcat gagctgtatt ttattctatt 420 tattttattt attattttaa actgtttaag tgtgtttttt tttctatcca aaatggatca 480 atttattcgt caatattttg atatgaacac tgctcatttg ttggaagacg agctgaacaa 540 cgagctgacc attcgcaata ttgagtttgg ttcggagtct cgatcagctt tagagagaag 600 attaagagga caattaaaag aagaaagaga gtccaaagtc ttaacatttg agtacgaaaa 660 aggatgggat ttattgatcg atgaactgga attatgtgat aataaaattc aggagatcaa 720 aaatatttta gaaaacagaa cagctwaaag tgtcccagat caaaaattca aaacacgatt 780 acttcattta tttatccgca tgcttcgagc taaagttcac actaccgagg acgctgatat 840 taatacgatt aaggaaatgg taggagtttg cgctaaacta ttaagaaatt atttttcacc 900 gacttcaccg tttgaggaga tacgagcggc agaagcagaa attatcaatt ctagcatacg 960 ccaagcgcga gatgaactcg acaaagataa gaccccaact gagatcacta ctcagaacaa 1020 tggtaacgta acccctattc cgtccaaatc aaattcaata tattcgggag tgggaactga 1080 acaggaagaa gtagaggaag aagaagaaaa ggaaaaagat gatgaagaag aggaaggaaa 1140 cgtcgaagaa gagggagatg cgggaaaaag gaatgagcaa caaagcgaat taaataagtt 1200 gcgagaaatt aacgctgagc ttagaactat tgtkgatcaa ttattagaga gaatacacaa 1260 attagaaatt gaaaattcca aaccagatga tgtcatgaaa aatagcaccc aatttaacaa 1320 tcagaataaa cctaagttgg aaggcccatc aaaaccaaaa tttagttaca aagatttttt 1380 aaattggctg gtcaaagatg aaaattcagg tgaacacagt aacaacaatt caaatggaaa 1440 tgtcaaagat aataaaaawt ttcataaaca accaaaaccg agttcctcaa gccataaggg 1500 atcgggagga gaagtgactg gaaagcggtt ccctgtgcac aagtggaaaa ttaggtacga 1560 tggctctgat ggaggtcgtc gcctgcacga ttttttgaaa gaagtagaat ttaatgcaag 1620 atccgaaggt tttaacaaac aagaattgta caattcagct catcacttgt ttgaagggaa 1680 agctaggtca tggtttatgg agattaatgc aaataatgag ttagaaactt gggaaaatgt 1740 tgtcagtgag ttgaaactag aatttcttcc acctgatctt gattattttt atgaacggca 1800 gttgcatttg cgtaagcaag ctgcaaggga aaagtttcaa gattattatt ttgatatgat 1860 tcgactcttt agaaatttgt ctagtccgat ggaggaggac agaaagttta aaattatttt 1920 tcgtaatgct cgaagcgagt acaaaagcgc tatgctagct gcaaacgtta aaactcttcc 1980 cgcgatgaag gaatttggga aaaactttga cgccatcaac tggcaatggt acactaaggg 2040 tgagaaagaa ggtccccgag gaacacgaac aaatgatagg cagataaacg aaatcaaaac 2100 agagagaaga aacccaaacc cgaacaatgc tggaaataat caaaacaata atcgtccgtg 2160 gaataaggga gagaatttct cgagagagtt tagaaacacg aatcgtccaa atccaaatta 2220 cagaaatggt cagaataagc agaatccacc aactaaacct gaaaattcgt tgcaaaaaga 2280 gcccgagcgg aagccattgc aaaaaactag tcaacctgat gatcaaaatc cagggcctag 2340 tacaaatcta aatactttgg aaaagatttt aaagtcgtat gtcccgttac gtaaaggtac 2400 ttgttttaat tgccataatt ttgggcacaa cttttatcaa tgtaaacagg aaaagcaaat 2460 attttgtgaa gagtgtggtt ttcctggttt tttaaaatgt gattgtccgt tttgtccaac 2520 aaaaaactca gagaaggctg ctcaatgagg cagagtagtc gaaaggaatt acctgaaagc 2580 cctcttgaag aaaattccac caaaagtatt cttacgcagc tcggttacgc gcacgtggwg 2640 gctgaatcta acgatgatgg ggaggtcaac acaatttttc tccacccgca tggtgataac 2700 aggccatttg tgcgaatcga cttgctagga atccaaataa ctgcactttt agacagcggg 2760 gcaaatagaa acattttagg gaaaaattca cataaactcg tcacgaattt gaatttgaac 2820 tgttcaccgt csgacatgtc tttgattact gccgaaggac atccagtaga agttgcagga 2880 gagattgata ttccagtttc ttttaacgga actacaaaga tcgtctcttt tgtagtcgcg 2940 ccctctctca agcgacagtg ctatcttgga atgtcgtttt gggaccaatt cggtatctat 3000 ccttctctcc gagagtcgtt tatcgaaaca attgatgact ctgaggagca gatcgatgat 3060 gaagttctgt tgactcccaa acaacaagcc aagttagaac aaattaaatc tttatttctg 3120 gtaccaaagc ctggtgatat cggctgtaca aagttgttaa ctcacactat tgaacttagt 3180 gacgaatacc gtgacaaacc accaattaga caaaacccat acccatggag tcccgaagtt 3240 caacgaaaaa taggaattgc tttagataat atgattaggg atgatattgt agaaccgtct 3300 acttcagaat ggtctcaacc agtagtacct gtttcaaaaa gagacagcga tgctgtacgc 3360 ctctgtctcg acgcgagaaa gctgaatgag cgcactaagc gtgacgctta tccgcttccg 3420 catcaaaatc gcattttaag cagacttggt tcttgcaaat atcttaccac cattgatcwt 3480 agtcaagcct ttctacaaat accgcttagc cccgaatcta ggccatttac agcattttcc 3540 atacctggac gtggattgtt tcattttaag aaactaccgt tcgggttagc aaacagtcca 3600 gccagtttga gcaaattaat ggacaaagtg ttaggattcg gtgtgctcga gccaaatata 3660 tttgtgtact tggatgacat agttgtcgca agtcacacgt ttgaagamca cgttgaacac 3720 ttgaaagagt tagctagaag gttgaatgaa gctgatttga gaatcaattt acaaaaatcc 3780 aaattttgtg tcccggaatt gccctactta ggttacattt tatcaaaaga cggtcttagg 3840 cccaatcctg atcgcgttca cgcgattgtt gtgtatgagg tccctaagtc tgttcgtgcg 3900 ttgcgcagat ttctcggtat ggcgaactat tatcgacgat ttatcaataa tttcagcgaa 3960 ttgacatcgc cactcacaga tctactcaaa aacaaaccta aaaaggttga gtggactgct 4020 gaggctgatc tggcattccg gaccattaag gagaaattga ttagcgctcc tgtaatgtca 4080 aaccctgatt tcaaattacc ttttacggtg caaaccgatg caagcgatca cgcaattgcg 4140 ggggtgctta ctcaagttca gaacggagag gagaaggtca tagcgtatca ttcagaaaag 4200 ttaaaaggcg cggaacttaa ctatcacgcc gcggaaaagg agggattagc agctctgagg 4260 tgtatagaaa agtttcgatg ttatatcgag gggagtaaat ttacactagt gaccgactcg 4320 tcagctttaa cattcattat gcgctctaaa tggaagactt cttcaaggtt gtcacgttgg 4380 agcataatct tacaacaata tcatatgatc ataaaacacc gaaagggaaa agaaaacgtt 4440 gttccwgacg ctctctcccg gtctttagaa atcgcggaga ttgacgatgg tagcgattgg 4500 tattctaaac tttacgcttc tataaaacga gatcctgaaa attatattga ctttaaaata 4560 gaaaataata tcctttacaa actagtttct tcacagtccg atgtattaga ttacgcgttt 4620 gactggaagc aatgtgtacc gacttctaaa agagcagaca ttttagtcaa agaacatgat 4680 gaagcattcc atgttggcta cgaaaagact gttgcgaaaa ttaaagaaag gttttattgg 4740 ccacgaatgg cagctgaaat taagaaatat gtaatgaact gtgaaacttg taagagaata 4800 aaacattcaa caatttcggt tgtcccagaa atgggcaatc aacggattac aaccaaacca 4860 tttcagattc taacaatcga ctacatacaa tctcttccga ggagtgccaa gggcaatgca 4920 catctgctcg tcctgatgga cgtgttctct aagtactgtc tgttggtccc tgtccggaag 4980 atctgtgcaa gtagtgtttg tgaaattcta gaacagttct ggttccgtcg tttatctgta 5040 cctcagtacc tgatatcaga taacgcaaca acattccaat ctaaagaatt tcaacaacta 5100 ttgaacaaat atgaaataca acattgggct agcgcaagac accgaagtca agcgaacccc 5160 acggaaagac tgaataggac gataaatgca atgattagaa gttatgttag ggagaatcag 5220 agactttggg atacgcgaat ctccgaggtg gaatttgtgc tgaacaacac cgttcacgcg 5280 actaccaagt acaccccgta cagggtgatc ttcgggcatg agatcgttac caagggctcc 5340 gagcatcgac tccaatcctc actggaattg agtgatgaag agaggatgga gcggatgcag 5400 atcgtcagta gatcggttca ccagagagtt gttgaccatc tacgaaaggc gcacgaagag 5460 acgaagagac gttacgattt gaggcacaaa cgttacgccc cgacctttga cgtcgggcag 5520 aaagtgtaca aacgtacgtt ccgtcagtct tccgccggcg accactttaa cgcgaagctt 5580 ggagaagtgt acgaaccggt ggtgattctc gcaaagaaag gttcttgtgc gtatgaagtt 5640 ggtgatgacg cagggaaaag tcttggtgtc ttttccgctg gcgatttaaa agcataaatt 5700 tttgcatgtt ttatgctgga agtgatagaa agttggtgta aacgatttgt gatgggaaag 5760 ttgagagccg atttctgtta tcagcatgta gaaagtcggt tcctcacacc gtttcagcta 5820 ttgtttcgac ccagccgttc ttatcatgga attgtatccc gggatgatga gcaagaaaag 5880 gagaggaata tggataacga ttggagatgg aataattgaa ggtcagtttg tctattgtaa 5940 aagggtacca aaaacaaaac agtaaagctt tacaaatttt atactcttct gtgatatatt 6000 tattttcttt tgtgtctttt aatcgcattg tccttgatcg taaacagttc ttttggtcat 6060 ggaatagcga gttgtgggcg gacaaaatga atttatttat ttatctctga gtttttgtta 6120 cagaaaatgt ttgtgggtag tttttgaagc accagtattt atgtttttat gctgtattat 6180 tttaacttcg tcacaaggac gcttgtgaac gttgaatcca ttgaaggagg agcaatttta 6240 aattgtttct tatttataca ccaccaaatt aagtacaaat acaccaaatt tgattccctt 6300 ctgtttaaga atccacgtac gaagtcaacc ctatttaact gatcttggcc aatcttttaa 6360 agtaaaaatc gatacatttc gacttcgtat ctgcgcaaaa attgttcgtt ttcttaaact 6420 ggatcgaatc aaatacgcaa cataagtaaa tttcgcgata tgaattgaaa aactaaggcc 6480 atagtttgca gtgtcactct agttagtgct cacaagtgca acataatacc acccaaaacg 6540 aatccgcgcg cttaattaat gaaaatttaa atttgtagaa gtattccgga atacagtaag 6600 aattcatatt tttttttgta attttaatgt gtaatttatt gtttgatgtt tattgtttgt 6660 taatgattgt ttcccatctt ttgatgcgag tataatattt cagtcgaaga tcatttcgtt 6720 tcttcactta tgcacagttt gaacggattg attgtaaggg tacagttgcg gcattttgtt 6780 tgattctaat cttattcaaa aaacaaaaaa aatatctaac aatatttttt ttccccgagc 6840 ggtgcggtag 6850 // ID BEL-211_AA-I repbase; DNA; INV; 5568 BP. XX AC AAGE02024031; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-211_AA_; KW BEL-211_AA-LTR; BEL-211_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5568 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02024031; Positions 20918 15351. XX CC Positions [4621-5181] - Integrase core CC 'CTGAA' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 43..5568 FT /product="BEL-211_AA-I_1p" FT /translation="MMPPLAPPPERSCSMCQSEDDSQMVCCDKCSNWFHFD FT CVGVTEDIANHPWICPSCVKANTMNPVSGSTPVPISAPSSSSASNIPAPPV FT PPVVIAPSASPMQPIQLSVELQLKMLEEERALERRYLQRKYQLMMEASGSG FT TTSFPQPSTNAMPDDSFRMPPCHSSPLRPPPATAPELHLLEGTGGANETAL FT LNSSQIAARHAVAKELPVYSGEPEEWPLFIATYENTTRLCGYTPEENMVRL FT QRCLRGKALEAVKCQLLHPANLEHALSTLKMLFGRPEIIVHCLIEKINKIP FT APKADRLNTLVDFALAVRNMVATVRACNLEEHLYNISLLQGLVEKLPTMVK FT LNWATHRVRLQRVSLSEFSDWLYTVAEAASTVTMSSTPPHVYEGKPRRSGN FT REDGFLSLHAEHQQESTWTQRMSTGCVVCQRSCNAVEFCPQFLNLNHSDRW FT ATLRDHKLCRTCLGVHRGPCRSNDICGKNGCTFKHHRLLHNERSDSRTSVR FT YSVQSEQNENANFNQHNCNTHRWNEKSILIQYVPVVLHHNGNTVHTYAFID FT CGSHITLLEEELAADLNLRGEKHPLCIRWTADHCRFEDSAERVSLHVSGTG FT NGENKFLLSEVFTVKDLKLPPQTLSVSKLCRKYGYLKGLPVESYNNIRPRL FT LIGMNNIRLGHALDSREGNENGPIATKTRLGWTIFGTCQDDHPKPSTIPCS FT FHICSHSTIADEDLHTVVKNYFALDSLGITANKQLLSVEDERALKILRSKT FT RLENGRYTTSLLWKYDDVRLPNNKAMALRRHSCLLKRIKREPALGETLRKK FT VADYVRKGYIRKLSYQEIHQPGNRIWYLPIFPVINHNKPGKVRIVWDAAAA FT IGGTSLNSVLLKGPDQLEPLPFVLYKFRERRVAICGDIEEMFHQVRIDEVD FT QQSQRFLFQDKPDAPEPDEYVMNVMTFGATCSPSSAHFVINENAERFAGQL FT PAAAGAIRKCHYVDDMLASVDTEPEAIQLAKDVRFIHQQGGFKMRGWISNS FT STVTEALNGENVSDKNLDLIKEVALEKVLGMWWNTDTDEFQFKLSKERHVD FT LLSGAKYPTKRDILSMLMSIYDPLGLIANFLMPLKLLLQEIWRSGVSWDEP FT IEFLHMQKWKSWLSYLPQAGSVRIPRCYNIRLSYDQLNIELHTFVDASELG FT YCAVAYLRFEKAGQVDCVLVGAKTRVAPLKFVSIPRLELQAAVVGTRLAKS FT IQEGHSISITQRYFWTDAADVLCWLRSDHRAYSQFVAFRISDILESSDIAE FT WKYVPTKENVADEGTKNQRQLKLGSDSRWRRGPPFLWGPKNTWPVESPSKK FT MTTLEMRPSMLHHTVIPVRDCLQPNTFSDWLRLRRITALVARFPMNIRKKI FT AGEPIVVGTLTCTELLQAEVYHYKRAQYDVYADEMALLSTHEKGKQILPKG FT NRLYKLTPTIDNDGVLHIHGRIDACDYVGKNTKYPIVLPRGHALTRLILQS FT IHEEYHHHCHETFVNEVRKKFYIPKIRMECRSIRSNCQQCKVRRAQPNPPA FT MGDLPVARLAAFVRPFSYIGVDYFGPFHVAVGRRVEKRWGVLVTCLTVRAI FT HIEVAHSLNTDSCIMALRNCFARRGVPIQIISDRGTNFIGAEKELKKALEL FT VDQDAMIKKFTTSSTSWTFNPPATPHMGGSWERLIQSVKKVLYDIQPHRVP FT RDEVLKNTLIEVENIVNSRPLTHVPIDDESSPALTPNHFLVGSSNGLKPLV FT PFDDCATALKQSHKTSQILANFFWKRWINEYLPNITRRTKWFAPVKPITIG FT DIVVIVDPKQPRNCWPKGRVLATTLSKDGQVRQATVQTSGGIFQRSAVNLA FT VLDVGAKRSESDQGPITGGD" XX SQ Sequence 5568 BP; 1604 A; 1364 C; 1289 G; 1311 T; 0 other; aattaaaatt ttcgtttcac ggattactca aggagtgtct acatgatgcc acctctcgct 60 ccacccccag agcgaagctg ctctatgtgt caatctgaag atgacagtca aatggtttgc 120 tgcgacaagt gtagcaactg gtttcatttc gactgtgtcg gggtgacaga ggatattgct 180 aaccacccat ggatttgccc gagctgcgtc aaggccaata cgatgaaccc cgtgtctgga 240 tccactcctg tgccgataag tgcgccgtcg tcatcatcag catcaaacat acctgcacca 300 ccagtaccac cagtagttat agcaccatcc gcttcgccca tgcaacctat tcagctgtca 360 gtagagctgc agctaaagat gctcgaggaa gagcgtgcgc tcgaacgaag gtacttgcag 420 cgtaaatacc agctaatgat ggaagcatct ggatcaggaa caacgagttt tcctcagcca 480 agcacaaatg ccatgccaga tgatagtttt cgtatgccac catgtcactc ctctccgtta 540 cggccacctc ccgctaccgc ccctgaactc cacctccttg aaggcacagg tggggcaaac 600 gagacagcat tgctaaattc cagccaaatc gcagctcgtc acgcggtcgc gaaggagcta 660 cccgtctaca gcggagaacc agaagagtgg ccactattca ttgccacata cgaaaacact 720 acaaggctat gcggatacac gccagaggaa aatatggtcc gactacagcg gtgtctacgc 780 ggaaaagcgc tggaagcagt caaatgtcag ttactacacc cggcaaattt ggaacacgca 840 ctttctactt tgaaaatgct cttcggacgc ccggagatta tagtgcactg ccttattgaa 900 aaaatcaaca aaatccccgc acccaaggca gacaggctca atacgcttgt agattttgct 960 ctggccgtgc ggaatatggt ggccaccgtg agagcctgta acttggagga acatctctat 1020 aacattagtc ttcttcaagg tctcgttgaa aaactaccca cgatggtaaa actaaattgg 1080 gcaacacacc gtgtacgtct acagagggtt tccttgtccg aattcagcga ttggctgtac 1140 actgtggcgg aagcagcgag taccgtaaca atgtcatcta cgccaccaca tgtttacgaa 1200 ggaaagcccc gtagaagcgg taatagagaa gatgggttcc taagtcttca cgccgaacat 1260 cagcaagaat ccacctggac acaacggatg agtactggtt gcgtagtttg ccaaagatct 1320 tgtaacgccg tagagttctg cccacagttt ttgaacttaa accattccga tcgttgggca 1380 acattgcgtg accacaagct atgccgcact tgccttggtg tccaccgagg accgtgtagg 1440 tcaaatgata tctgtggcaa gaatggttgc acgtttaaac atcaccgtct gctacacaat 1500 gaacgtagtg attctcgtac aagtgtccga tattcggttc aatccgagca aaacgagaat 1560 gctaatttca accagcataa ttgtaacaca cacaggtgga acgaaaagtc gatactgatc 1620 cagtatgtcc ctgtagtgct ccatcacaat ggaaacactg tgcacaccta tgcatttatt 1680 gattgtggat cgcatatcac gttactagaa gaagagttgg ccgcggatct gaatcttcgc 1740 ggagaaaaac atcctctgtg catccgatgg accgccgatc attgccgttt cgaagattct 1800 gcagaaagag tctcgctcca tgtatctgga actggaaatg gggaaaacaa atttctgctc 1860 tcagaggtgt tcaccgtaaa agacctcaag ctaccaccgc agacactctc cgtatcaaag 1920 ttgtgcagaa aatatggtta tttaaaaggt ctgcccgtag agtcatataa caacatacgc 1980 ccacgcttgc ttatcggtat gaacaacatc agactcggtc atgcgctgga tagtcgagaa 2040 ggcaatgaaa atggacccat cgccacgaaa acaagactgg ggtggacgat tttcggaact 2100 tgtcaagatg atcatcccaa gccgtctact attccctgca gcttccacat ctgttcacac 2160 tctaccatag cagacgagga tttgcacact gtagttaaga actattttgc ccttgatagt 2220 ttgggcatca cggcaaacaa acaactacta tcggtggaag atgaacgagc cctgaagatt 2280 ctacgttcca aaacgcggct cgaaaacggt cgatacacga caagcctcct gtggaagtat 2340 gatgatgtac gtctacccaa caataaagcg atggctttaa ggcgtcacag ttgtctcttg 2400 aaaaggatca aaagagaacc ggctctaggt gagacactga ggaaaaaagt ggccgactac 2460 gttcgaaaag gatatattcg caagctgtca tatcaggaga tacaccaacc aggcaatcgt 2520 atttggtacc tgccaatttt tcccgtcatc aaccacaaca agccaggcaa agtacgtatt 2580 gtgtgggacg ccgccgcggc tataggcggc acatctctca actccgttct ccttaaaggc 2640 ccagatcaac tggaacccct accgtttgtt ctatacaaat ttcgagaacg ccgggtggca 2700 atatgtggtg acattgagga aatgttccac caagtcagaa tagacgaggt ggatcaacag 2760 agccaacgat ttctatttca agacaaaccg gatgcccctg agccagacga atacgttatg 2820 aatgtcatga cgtttggagc cacttgctcg ccaagcagcg cgcacttcgt catcaacgaa 2880 aatgctgaac gtttcgctgg tcaactacca gcagcagccg gagcaattcg gaagtgtcac 2940 tacgtggatg acatgttggc tagcgtcgac acggaaccag aagccatcca gctagcaaag 3000 gacgtacgct tcatccatca acagggtgga ttcaaaatga gaggatggat ttcaaattct 3060 tcaacggtta cagaagcact caacggcgaa aacgtttccg ataagaactt ggaccttatc 3120 aaagaggtcg ccctggagaa ggtgttaggt atgtggtgga acaccgacac agacgaattc 3180 cagttcaaac tatcgaaaga gcgccatgtg gacctgttat ccggtgctaa gtatccgacc 3240 aagcgagata tacttagtat gctcatgtcc atctacgatc cactaggcct tattgctaat 3300 ttcctcatgc ccctgaaact attgctacag gaaatttggc gatcaggcgt gtcctgggac 3360 gagccaattg agtttcttca tatgcaaaag tggaaatcat ggttgagcta tttgccacaa 3420 gctggatccg ttcgaattcc tcggtgttat aatatcaggc tctcatatga ccaactaaac 3480 atcgagctgc atacatttgt ggacgcaagc gaactcggtt actgcgctgt tgcttatcta 3540 cgatttgaga aggctggaca agtcgattgc gttctagtgg gcgccaaaac gagggtggca 3600 ccactcaaat ttgtttcaat accgcgcctg gaacttcaag cggcggttgt cggcacccgg 3660 ttagccaaga gcatccaaga agggcactcg attagcataa cgcaacgcta cttctggaca 3720 gacgctgcag atgtactgtg ttggcttcgt tcggaccacc gggcatactc gcaattcgtt 3780 gcatttcgga taagcgatat tctggagtcc agtgatatag ctgaatggaa gtatgtacca 3840 accaaagaaa atgttgcaga cgaaggtaca aaaaatcaac gacaacttaa acttggttct 3900 gacagccgat ggagaagagg gccccccttc ctctggggac caaagaacac ctggccggta 3960 gaatcacctt ctaagaagat gaccacacta gaaatgcggc caagtatgct tcatcataca 4020 gtgatcccag ttcgcgactg tttacaacct aacacgtttt ctgattggct tcgtttgcga 4080 cgcattactg ctctagttgc aaggtttcca atgaacataa gaaagaaaat tgcaggagag 4140 ccgatcgttg ttggtactct aacatgtaca gaacttctac aggcagaagt ataccactat 4200 aaacgtgccc aatatgacgt ctacgcagat gaaatggccc ttttgagcac ccacgagaaa 4260 ggtaagcaga ttctgccgaa agggaaccga ctgtataagc taaccccaac aatcgacaat 4320 gatggtgtcc tccacatcca tggtcggatc gatgcatgtg attacgtcgg caaaaatacc 4380 aaatacccga tcgtgttacc acgtggacat gcactcacaa gactgattct ccaaagcatt 4440 catgaggagt accatcatca ctgccatgaa acgtttgtca acgaagttcg caaaaagttt 4500 tatatcccca agattcgtat ggaatgcaga agtatccgct ctaactgcca acagtgtaaa 4560 gtacgtcggg cacaaccaaa tccacctgcg atgggcgatc ttcctgtagc acgcttagca 4620 gcatttgttc gcccattttc gtatatagga gtggactatt ttggcccttt ccatgtagca 4680 gttggtcgca gagttgaaaa acgttggggc gttctagtaa catgccttac cgttagagct 4740 attcacatag aagtagctca ttcacttaac acggattcgt gcattatggc attgcgcaac 4800 tgctttgcac gtcgaggagt ccctatccaa attattagtg accgtggtac caatttcata 4860 ggcgccgaga aggagcttaa aaaagcgcta gaattggtcg atcaagacgc aatgataaag 4920 aagtttacta cctcgtcaac atcgtggaca ttcaaccctc cagccacacc acatatgggc 4980 ggatcctggg agaggttgat acaatcagtg aagaaggtcc tttatgatat ccagccacat 5040 cgagtaccaa gagatgaagt actgaagaac acacttatcg aggttgagaa tatagtgaac 5100 agcagacctt tgacacatgt gcccatagac gacgaatcat caccggcgct aactcctaat 5160 cattttttag tgggatcgtc gaatggtttg aaacccttgg tgcccttcga tgattgtgca 5220 acagcgctta agcagtccca caaaacctcg cagattctcg ctaacttctt ttggaaacgt 5280 tggatcaacg aatacctccc aaacatcact cgtcggacaa aatggtttgc tccggtaaaa 5340 cctataacga tcggagatat agttgttatc gttgacccta agcaaccaag gaactgctgg 5400 ccaaaaggga gagtgttagc tacgactctt tcgaaggatg gacaagttcg tcaagctact 5460 gtacaaacat ctggagggat ttttcaaaga tctgcagtaa accttgctgt gctagacgta 5520 ggcgcaaaaa ggagtgagtc ggaccagggt ccaatcactg ggggggac 5568 // ID Gypsy-173_AA-I repbase; DNA; INV; 3947 BP. XX AC supercont1.174; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-173_AA_; KW Gypsy-173_AA-LTR; Gypsy-173_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-3947 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.174; Positions 877193 873247. XX CC 'ATAGC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 614..3784 FT /product="Gypsy-173_AA-I_1p" FT /translation="MAQSALATNIEPYRKGAGFTEWAERLELLFLINEVKE FT EHKRAYLATLGGPVVYAELKLLYPNTDLNAVAYNDMISKLKARFDKVAPDI FT IQRLNFNNRIQQKDESVEDFVLAVKLQAEFCTFGDYKSVAIRDRVIAGIRD FT KALQQRLLNEENLTLATAEKIIATWEVAGANSKGLADNSGIEKIATIASTN FT KPDSKPRSTLEKLLATIELARAENPDLGEGHSRGPVRSRLGFRPNIQRRHP FT TYNAQMGPYNPRQRGDWRGTRDRQRGQYVYNRLPQGASSSASIFQQVMDQV FT LHGIDFVYCYLDDVLIAGSNVKECKERLLLVLKRLSEANIKVNLDKCRFFV FT KELDYLGHIISDKGLLPCPEKTLTIQKAKTPKNEMELKSFLGLINYYNKFI FT PRLSVKLYHLYNLLKNKVKFVWDEHCDKAFEESKNLLLNAQFLEFYDPKKP FT IVVVSDASSYGLGGVISHIIDGAEKPISFTSFSLNSAQRKYPILHLEALAL FT VSTIKKFHKYLYGQKFQVFTDHKPLVGIFGKEGKNPIYVTRLQRFILELSI FT YEFEIQYRPSTRMGNADFCSRFPLEQMVPDDLDAECVKSLNFGKELPIDYK FT VIADWSKEDGFLQQVVSHLQNGWPKKLEKCFADIFANHQDLEIIGDCVLYQ FT NRVVIPKGMQSKVLKLLHANHAGIVKMKQLARRNVYWFGINNDIEKFVSSC FT DICGSMAIVPKPKVLSKWIPTARPFSRIHIDFFYFEHHVFLLIVDSFSKWL FT EIEWMRNGTDASKVINKLIVYFARFGLPDMLVSDGGPPFNSGAFISFLERQ FT GIDVLKSPPYHPSSNGQAERFVRTVKEVMKKFLMERSVMELELENQINLFL FT ITYRNNCLTKDGDCPSERIFSYNPKTLIDLVHPKKHPKQLLKDQLPHDDTN FT IPREDGDVHMRSNDPLNDLIAGEELWYKNHNPHVKARWVKANFIKWHSRNI FT LQVQTGSVPIMAHRNQVRKCNTDNRSKPNTVVLLRRDDEGGVEMNQFPDDR FT EVATASVPQESGDNRRRRKRKYEDPMDEQSLILRRSSRSKKPKKDDIFVYN FT " XX SQ Sequence 3947 BP; 1288 A; 676 C; 869 G; 1114 T; 0 other; agtggcgacg agttcggcga aaatcgaacc agagaaagtg gtaagcctag cagtcgtgga 60 acggcgcaga cgttggaacg ttcgattaag ctgaagaaaa gtaagagtga tcacggcaag 120 atttacagtc agccattgtg ctatttcagc attgtcaact gtgattgcct attgtgatct 180 tttgtgaaac atcctgcaac gcacattggt ggcacaaatc tcaaacaaga taaacgcagt 240 ccaatccgat agtgaaaagg gaacaacggc tattggattt catccagatt agcagccaaa 300 caacggttgg aaaacgtatc gatcggttga ggggagtcgt ccggcaaacc aacccgagac 360 cagtgcattg ttgagaggta gatacgaacg ctgtgacgtc ggcaaaggtc gatacattct 420 tttggtgagt caattttgtt acatattttt aaacacaaaa tagcgacaat tttcttttgt 480 ttatctctat acattgtgtt tgaacaaaac tgttccctat tgacgaaaat tcacagcgct 540 tttgtttgat ttaccttttg ataattttga attaattgtg tggtataata gcctttagta 600 gtactgacta aatatggcgc aatctgcttt ggcaaccaac atcgagcctt acaggaaagg 660 tgctggtttc acagaatggg cggagcgcct agaactattg tttttaatca atgaagttaa 720 ggaggagcac aaaagggcat atttagccac cttgggtggt cccgtagtat atgctgagct 780 aaagctatta tatccaaaca ccgatttgaa tgctgttgca tacaacgaca tgatatctaa 840 actgaaagct agatttgaca aggtggcgcc tgatattatc cagcgtctta actttaataa 900 tcgtattcag cagaaggacg aatcggttga agattttgtt cttgctgtta aattgcaggc 960 agaattttgt acatttggag actataagtc tgttgccatt agggaccggg taattgctgg 1020 tataagggat aaggcccttc agcaacggct ccttaatgag gaaaatttaa ccctggctac 1080 agcggaaaag attatagcca cgtgggaggt agcaggggca aactccaaag gtttagccga 1140 taatagtggt atagagaaga tagctacaat tgcatcaact aataaacctg atagtaaacc 1200 acgtagtact ttggaaaaat tgcttgcgac gattgaattg gctagagctg agaatccgga 1260 tttaggtgag ggtcatagta ggggaccagt caggtctaga ttaggtttta gacccaacat 1320 tcaaaggagg catccgacct acaatgctca aatgggacca tacaacccca gacagcgagg 1380 agattggagg ggcactcgag acagacaaag aggacaatat gtgtataatc gactaccaca 1440 gggggcgtca tcaagcgcct caattttcca acaagttatg gatcaggtgt tgcacggaat 1500 tgattttgtt tattgctatt tagatgacgt attaattgct ggatcgaatg ttaaggaatg 1560 taaagaaagg ttgcttttgg tattgaaaag attatcggaa gccaacatta aggtaaattt 1620 ggacaaatgt cgctttttcg ttaaggaact tgactattta gggcacatta tcagtgacaa 1680 aggtttactg ccatgccccg aaaaaacatt gactatacaa aaagcaaaaa cccctaaaaa 1740 tgaaatggaa ttgaaatctt ttcttggttt aatcaattat tataacaagt tcattcccag 1800 attgtcagtt aaactgtacc atctctataa tttgttgaaa aataaggtta agtttgtttg 1860 ggatgaacac tgtgataaag catttgagga aagtaaaaat ttacttctca atgcacaatt 1920 tttagaattt tatgatccaa agaaacccat tgtggttgtt tcggacgcat ccagctatgg 1980 cttgggagga gttatatcgc atattataga cggagctgaa aagccaataa gttttacatc 2040 gttttctttg aactcagcgc agcgtaaata ccctatttta cacctagaag ctttagcttt 2100 agtgagcact ataaaaaaat tccataaata tctatatgga caaaagtttc aagtttttac 2160 ggatcataag cctctagtcg gcatatttgg caaagagggt aaaaatccta tttatgtaac 2220 gcgtttacag agattcatac ttgagctctc catttatgaa tttgaaatcc agtatagacc 2280 atctacccga atgggcaatg cagatttctg ttccaggttt ccgttagaac aaatggtacc 2340 tgatgattta gacgcggaat gtgtaaaaag cttaaacttt ggaaaagaat tgcccattga 2400 ttacaaagtg atagccgatt ggtctaagga ggatggattt ctgcaacaag ttgtttctca 2460 tttgcaaaat ggttggccaa aaaaacttga aaagtgtttt gcggacattt ttgcaaatca 2520 tcaagatttg gaaatcattg gcgactgtgt attgtatcag aacagagtag tgattcctaa 2580 aggtatgcaa agtaaagtat tgaaattgct tcatgccaac catgcaggaa ttgtaaaaat 2640 gaagcagctt gcgaggcgta atgtctactg gtttggaatt aacaatgata tagaaaagtt 2700 tgtctcatca tgtgatattt gtggcagcat ggcaatagta cctaagccta aagtgctatc 2760 taaatggatt ccaaccgcta gaccgttcag cagaatacat atagattttt tttactttga 2820 acatcatgtt ttcttgttaa tcgtggatag tttctcgaaa tggcttgaga tagagtggat 2880 gaggaacggt accgatgcaa gtaaagtcat taataagttg attgtatatt ttgcacgttt 2940 tggattgcca gatatgctag tttctgatgg aggaccaccc tttaactctg gtgccttcat 3000 aagtttcctt gaaagacaag gtatagatgt attaaaaagt ccgccatacc acccatccag 3060 taacggccaa gccgagagat tcgtaagaac ggtgaaagaa gttatgaaga aatttttaat 3120 ggaaagaagc gttatggagc tagaattgga aaaccaaata aatttatttc taataacgta 3180 cagaaacaac tgtttgacga aggacggcga ttgcccgtcc gaacgaatat tttcgtataa 3240 tccaaaaact ctaatcgatt tagttcatcc caaaaagcat cccaaacaac ttctaaaaga 3300 tcaactaccg catgatgata ctaacatccc tagagaagat ggagatgtcc atatgaggtc 3360 aaatgatcca ttgaacgatc tgatagcggg tgaagagtta tggtataaaa atcataatcc 3420 tcacgttaaa gctagatggg ttaaagcaaa ttttattaaa tggcattctc gaaatatatt 3480 gcaggtgcaa actggaagcg ttccaatcat ggctcatcgc aaccaggtgc gcaaatgtaa 3540 cacagataat cgttccaagc ctaacacagt ggttctctta aggcgtgacg atgagggagg 3600 agtcgaaatg aaccagtttc ctgacgatcg tgaagtggca acggccagtg taccgcaaga 3660 gagtggggac aatcgaagaa ggcgtaaaag gaaatacgaa gacccaatgg acgaacaaag 3720 tttaattctt cgaagatcgt ccaggtcaaa gaaacctaaa aaagatgata tatttgttta 3780 taattaatca catttgtgaa ctcgaatctt gtttgtttat aaaattgaat tttaaagtga 3840 atacattatt ctatcgaata tagatggcca ttagcggcgt agtcaaaata ttctatattt 3900 aatcgtcggc aagagccgga gtaaggacat ctaaagggat gagaagc 3947 // ID BEL-129_AA-LTR repbase; DNA; INV; 648 BP. XX AC supercont1.127; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-129_AA_; KW BEL-129_AA-I; BEL-129_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-648 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.127; Positions 1983984 1984631. XX SQ Sequence 648 BP; 207 A; 108 C; 136 G; 197 T; 0 other; tgttgtgaga acccctgcgc tctcactatg accataaaac gtgagtcccg aatgctccac 60 gagccgtcag tctgacatcg atatcaggaa tataacgatg catgatcgta aaacgtcaga 120 tggagcatga aatgggaacg tgtttatgct gcggccgtag ataacaataa ccatagcaga 180 gagaagtaga gaattgttcg attttggagt tacttaatag actattcagt gaaactgtaa 240 gttaatatta ctaatttgta ccatgcattt ctaatagtat tatttacagt atcagattca 300 gtgttattga gagtgaattc gccaagttgc agtcagttag taaactagct cgattagtga 360 tgtaagtcag tttgaaattg gaggtttcaa tgaagtttaa ttagaaatta atataaaata 420 ttttttgtat agctcaagtc tgcttcccag taagatacgc acagttattc gcggaggttc 480 tggacaggtt tggttaaatc gcactgaaat gtaagaattt gaatttatga aaatgagccc 540 acgaccacta aagaattatt tcaataaatt tcagcttaaa gctgctgctg ccagtaacgt 600 agttgctacg agaggtgata tttccatcca gaactttctg tccgaaca 648 // ID Gypsy-17_DWil-LTR repbase; DNA; INV; 218 BP. XX AC scaffold_180762; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-17_DWil_; KW Gypsy-17_DWil-I; Gypsy-17_DWil-LTR. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-218 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_180762; Positions 44657 44874. XX SQ Sequence 218 BP; 74 A; 32 C; 38 G; 74 T; 0 other; tgtagcaata tgtattaata ttaagtatat cttactcttt cccccagtgc aatatgttta 60 gaattaagta tatcttactc ttttctttgt tacaaattag tattaagttt tatgcaatgg 120 tacagtacga gaagcgtgct gtctcacttt aaagaaaaga acaaagaggt tcaaataaaa 180 agaatgagag cggctcagtt tttatggaga gctcacca 218 // ID Ginger1-6_HM repbase; DNA; INV; 5214 BP. XX AC . XX DT 02-FEB-2010 (Rel. 15.01, Created) DT 02-FEB-2010 (Rel. 15.01, Last updated, Version 1) XX DE Ginger1 DNA transposon from Hydra magnipapillata. XX KW Ginger1; DNA transposon; Transposable Element; Ginger; integrase; KW Ginger1-6_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-5214 RA Bao W., Kapitonov V.V. and Jurka J.; RT "Ginger DNA transposons in eukaryotes and their evolutionary RT relationships with long terminal repeat retrotransposons."; RL Mobile DNA 1, 3-3 (2010). XX DR [1] (Consensus) XX CC TIR is 44-bp long. Tpase gene contains 2 introns: 289-724, CC 887-1006. XX FH Key Location/Qualifiers FT CDS join(177..288,725..886,1007..2847) FT /product="Ginger1-6_HM_1p" FT /translation="MAVHVYSVDDVKQYLSDGIIKFFNSRRIAKNFKLCGM FT YGKLYYIKSSKKLLVLLTDHEKKLAFRDVHDSDHGAHVGLNNTRAKLKSSF FT YWLGMVSDITKWIKECDKCQRMEKIKTVAPELKPIRVNGLWDFLGIDLIGP FT LPITSKGQKYILTVTDLWSKYVEAFPIPEKSALCVSKCLTTLFYRFGPPKK FT ILSDQGREFVNSVNELLFSLFNVKHLITSAYHPQTNGQDERTNQTVKRALS FT KLCNEAQDNWDELLEPVLFGLRTCVQNSTKFSPFFLMFGREPQLFSALAMN FT PTNSNLIDGNNDIHSTESQIQERIDSCNKVSIVVNSNILYAQTKMKKYYAA FT KQMKGCKSFSFKIGDKVLLKNCRKIGRKGSRMEHDWLGPATITSFKQNGAV FT VTRNGKVWRNAVALINMKPYIEKNQTLSSAIILNEHNYCLPRKIKMARNNK FT NIIEPIKSNKRHIVDSKCYVKGRKNKLTKVENSLSKNVYPAISVNNKFFLN FT LSTQSVKLGLDILKNPCGWLNDTLIDIAQSFLSSQFPHIAGFQSSCIFNKN FT NSGGYIPSGKFIQILNVRNSHWILISNVSSDTCLKSSVQYYDSLFTSSKED FT VPLLVHRVAQSLLICEDVSSISFDVMNCQTQDNGNDCGLYAVANATALCNG FT IDASLIVWEKKSMRKHFLMCIEKGKLEMFPYITILNGLKRCALSFLCDDTC FT HCALG" XX SQ Sequence 5214 BP; 1821 A; 717 C; 730 G; 1946 T; 0 other; tgttgcgtta aaaaagttag tcgttaaaaa agtttacaaa ctgttatacg catgcgttac 60 gcttgcgcag ttttgataaa aagtttccgt tagggagtga aattcagcgt taaaaaagtt 120 tcatcgttcc attagggtat gtaaaatttt aaatataaac tcatgttata ctaaaaatgg 180 ctgttcacgt ttactcagtt gatgatgtaa aacagtatct ttctgatgga ataattaagt 240 tttttaactc acggcgtata gcaaaaaact ttaaactttg tggtatgtgt ttatttataa 300 atgacataat aaaatttaaa tggtaatatt tatattataa tatatattat aaatatatta 360 gtaatatttt atatatatat atatatatat atatatatat atatatatat atatatatat 420 atatatatat atatatatat atatatatat atatattgcc aaatgatctg ttaagcacaa 480 ttaaaataaa tatttgaata aaattaacaa cgcttttaaa atgtttgtat acatgtatca 540 aactactacg atggatagtt ctatatatat atatatatat atatatatat atatatatat 600 atatatatat atatatatat atatatatat atatatatat atatatatat atatatatat 660 atatatatat atatatatat atatatatat atatatatat atatatatat ataatttttc 720 ttagatggta aattgtacta cataaaatca tctaagaagc tcctagttct tttaacagat 780 catgaaaaaa agttggcttt cagagatgtc catgattctg atcatggggc acatgttggt 840 ctcaacaata ctagagctaa actaaaaagt tctttttact ggctaggtta tttttttact 900 taaatataga tttttaaaac taaagtttgt taaataaatt tgaaataaaa aatgctaatt 960 catttgctta tgatactgtt atatattaaa tttctttgtt tttaaggtat ggttagcgat 1020 attaccaaat ggatcaagga atgcgacaaa tgccaacgaa tggaaaaaat taaaactgtt 1080 gcgcctgaat taaaacctat tagggtaaat ggtctatggg attttttagg tattgatcta 1140 attggcccac tacctattac atcaaaaggg caaaaatata ttttaacagt gacagacctc 1200 tggagtaaat acgttgaagc tttccctatt cctgagaaat ctgctttgtg tgtatctaag 1260 tgtctcacca ctttatttta tcggtttggt cctccaaaaa aaattctttc cgatcaaggc 1320 agggagttcg taaacagtgt aaatgagctt ttattttctt tatttaatgt caaacacctt 1380 ataacatctg cttatcaccc acaaacaaat gggcaagatg aaagaacaaa ccagactgtt 1440 aagagagccc tttcaaaatt atgcaatgag gcccaggata actgggacga gttgttagaa 1500 ccagtattat ttggtcttcg cacatgtgtt caaaattcaa ctaagttttc accatttttt 1560 ctaatgttcg gcagagaacc acaattattt tcagctttag caatgaatcc aaccaattca 1620 aaccttattg atggtaataa tgatattcat tcaactgaaa gtcaaattca agaaagaatt 1680 gattcatgta ataaagtcag cattgtggta aattcaaata ttttatatgc tcaaacaaaa 1740 atgaaaaagt attatgccgc taaacagatg aagggttgta aatcattttc atttaaaata 1800 ggtgataaag ttttattaaa aaactgccga aaaattggtc gtaaagggag ccgaatggag 1860 cacgattggc ttggacctgc tacaatcaca agctttaaac aaaatggtgc tgttgtaact 1920 cgtaatggaa aagtttggag gaatgctgtt gctttaataa acatgaagcc atatatagaa 1980 aaaaatcaga ctttatcatc agcaataata ttaaatgaac acaattattg tctaccaagg 2040 aaaattaaaa tggcaagaaa taacaaaaat ataatagaac ctataaaatc aaacaagaga 2100 catatagttg actcaaaatg ctatgttaaa ggtagaaaaa ataaattgac taaagtggaa 2160 aactctttat cgaaaaatgt ataccctgca atttctgtta acaataaatt ttttttaaac 2220 ttatcaacac agtcagttaa attaggcctc gatattttaa aaaatccatg tggttggtta 2280 aatgatacat taatagatat agcccaaagc ttcctatcat ctcagtttcc tcatattgcc 2340 ggatttcaaa gttcatgtat ttttaataaa aataattcag ggggttacat cccttcagga 2400 aaatttattc aaatactcaa tgtcagaaat tcacattgga tactgataag taatgtgtca 2460 tctgatacat gtttaaaatc atcagttcaa tattatgatt ctctttttac cagttcaaaa 2520 gaagatgttc cattacttgt gcatcgtgtt gctcaatcat tattgatttg tgaagatgtt 2580 agctctataa gttttgatgt tatgaactgc cagacccaag ataatggcaa tgactgtggt 2640 ctttatgctg ttgcaaatgc tacagctcta tgtaatggaa ttgatgcaag tttaattgtc 2700 tgggaaaaaa aaagtatgag aaaacatttt ctaatgtgta ttgaaaaagg taaacttgaa 2760 atgtttcctt atataaccat attaaatggg cttaaaagat gcgctttatc atttctttgt 2820 gatgatacct gtcattgtgc tttgggataa ttttagggta ggatatatta ttacatttat 2880 gattttttaa atgaatattt gcatatgtat gatactgtca aaaaattaaa aaaaaaaaac 2940 aaatttattt tatagatgct tttgtgatag aaagaagatc tattttcttt ttgatgcatc 3000 caaatgaaag aaggaaactg atttttgttg ataatatata tatttctgaa attctttaat 3060 cttttttaat tattttacaa gtttataaaa aaaaaaagtt ttttgattta tatcttgcaa 3120 caataaaaaa aaaatatttt tgctcatttt ctcacatctt gaacataaag tttttttata 3180 attatttttt ataatagttt taagtttaaa ggatattata tttcttttgt attgaaagaa 3240 atattcttat ttttacctac cattttttta tttcagctgg cgtactgttg aaacatggtt 3300 ttaatttagt aaatttttat gatttgaaat ttgctatatt aatcataagc caacagggaa 3360 aatctagaac tctacaaaat acaaggtaag aataatttgg tttaatggta acaaaacttt 3420 tttttatgtc cagataagca actttgtgct actgtattat gttttattca atattttagc 3480 cattgttgtg ctagtaatga tatttattta taataattta ttttatgtct tttaattatt 3540 tttaacagag tgtcttggaa gttactctaa tttagtaagt tggtaaaatt atttgaaatt 3600 taattgtttg ttaaatacgt ttatctctag acgtcttata tttcttgttt tttaagaaat 3660 taaaactcaa aagattattt aagtttaatg tttttaatgt ttaaattata cttttaataa 3720 tggaaaaaaa taattttaat caatgttgta ttgttaaaaa atactttgaa tagtctttat 3780 ttactttttt tctacgtatt tgactttcat ggcttcttgt cgaagttggc tctcatcccc 3840 attttgattg cggtaaaaaa tgttttgaaa agtaactcta taaggaaagt tatgaaatcc 3900 ttaactccac ttcactttcc aattgtaata gtttgtttat ttatttatgt tacttttttt 3960 ttactgatac acgttcattg gactcctagt actcacgtct tccatatgtt agaccaattt 4020 aaaagacttt caactatttt tatattgata gctcttacta tcgagacatg ctactacact 4080 ggtcacacac aagtatacca cattgacaat ttacattaca cacttagtta taagaactcg 4140 aaacagttca caactctcat tgacccaact tatactgtga caaaccactg cacttaaaga 4200 cactcatgtc aaaaaattac ttactattgt ttgcaagttg tactgaaccc cgctaagtgt 4260 cctctggttt cagtgtcaat tcaggagtct gaggcaatcg tcctgactgg atactctgag 4320 aatagtcctt ttgggataac acaatgggat atcttaaaaa gataccctgg tgtagaccca 4380 gagtggaact tcttttggga taccagtctc aaatgaaggc tggaggtctt catgtgttca 4440 tactgtacca taaggcacaa atccatactg tactcgagac gttctcgaca cttatatcta 4500 taatgtatgt ttatcaccct aataatacta aacactcgtt gactacttat aaagcgaata 4560 attctactcc attaccaatt ctgtctatac atcacattta ccctagacac taacaacaat 4620 ttagtagcat tactctggat actctgaaaa tagtcctgaa atgactttcc ttggctattg 4680 ttttttattc ttatttgttt gttactacat tttacttata tcagttttta gtatttttct 4740 actttaagac tctttgtttt tccactggat atctgtcttt tgtactttta tctcactctg 4800 ctaaactctt tttaacttat tattcatctc ttaggtgttc actgctgggt gcagcttcac 4860 taaaatgatt aaaaaaggag tttatttaat catttttagg agtagctagc agttttagca 4920 aagtgctata gcgatcggct ttaataatta gaaacaaaat aaaaatttaa attatataga 4980 aaaacaataa aaatatcaaa atttaatcat aaatatttgt tcatattttg tacatcttac 5040 tcatcgctaa aatattttgc atatttgcaa ggtttgcgct gccattttta aaactttttt 5100 aacgggtaaa cttttttaac gcctatacgc gttaaaaaag ttttaattat tttgcgcacg 5160 cgcagaattc acattttgta aactttttta acggctaact tttttaacgc aaca 5214 // ID LOA-10_AAe repbase; DNA; INV; 6327 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A LOA non-LTR retrotransposon from Aedes aegypti. XX KW Loa; Non-LTR Retrotransposon; Transposable Element; LOA-10_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6327 RA Kojima K.K. and Jurka J.; RT "LOA clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1420-1420 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 20 sequences with >98% CC identity. XX FH Key Location/Qualifiers FT CDS 982..2553 FT /product="LOA-10_AAe_1p" FT /translation="MEIDRESDIVESNSLEEIESCSSSILDTPNREDMYVD FT DDDYDDGINVTILSSSATLEFNKKAVENSMVEKKLNASATSNSNGITENNS FT GNKTPFTGANPKDKKHKRLNGAAKKRFKYLVDHGHSRDEARLLAEKPFRVL FT ESDPHKRRRNADLSESNSSETNPPKRLARQLDRGQTLQVRSSVQNRLEQAR FT KGEGSSVRKEAAGNSGPHKPLYSEVANYIRIGILPEGFPNIELSTQQLTAT FT QNEILKKVAEQRKEPIKPKFGSRVFRPGHMIVTCKNQDTANWLKETISQIQ FT PWENACLIAVDEKDIPRPEILVGFFPLSEQDSNEEIFALVESQNEGLLVDS FT WRVLKRYTVKQHHVELMFTVDNVSMKSLESNKFIIDYKFGVAYIRHRNTNT FT ESTEGSHNEITRGEKTASDSVGYTREDPQFSGAVEDAQMADPTQKDEADGL FT ELEKTIIMENGCEGSSKTHKDKSDKNLNRSINPLPGSSKGGNNSLPYRQIR FT GQILPDRPKQSTKAEQSACKTLETNKHE" FT CDS 2558..6208 FT /product="LOA-10_AAe_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="MTQRSIKFIQENLHHAKAASAILSRTFTKDKIDVALL FT QEPWTKNHKILGINILGCKLIYDNTQLVPRAAILVNCNAKFTPITEFIGKD FT IAAISLEVPTAKGNTLIYVASAYFPGDVEEVPPPEVAAFVSHCRRQNKGFI FT IGCDANAHHTIWSSTDINSRGESLFSFISQNDIDICNRGNAPTFINAVREE FT VLDLTLCNSTISGSIEKWRVSDEISLSDHRQILFEFATKDIVRETFRNPRK FT TNWELYDSRLRLQNESTSNIINTSVELEKAADCLTTSIMVAYNESCPIKVR FT STSRDVPWWNDRLDKLRRNARKLFNRAKRTSNWSQYREALTNYNIEIRRSK FT RKNWRHMCESIDSTPETARLQKVLSKDHTNGLGTLKKNDGSFTENTQETLE FT LMMETHFPGSLPCSSDQTSTFSLSNGSDMSNEEASDLADHIFTYSKIEWAL FT DTFDPFKTPGLDGIFPIHLQKCKEKIIPTLLTLFKCSFLLRYIPTKWRQVR FT VVFIPKVNKKDKTLPKSFRPISLTSTFLKLMEKLIDEYIKSDIIKHKPLNR FT AQFAYQTGKSTTTALHSLVTKIEKTFDYKELLLAAFLDVEGAFDNASFTSM FT RRAMRSRGFSKSIIDWIEEMLSKREISAYLGDSVVRVRAVQGCPQGGVISP FT LLWSLVVDDLLIKLQHQGFEVIGFADDIIIIVRGKYDAIVSDRMQSALSYT FT LRWCQNEGLNVNPNKTTIVPFTKRRKISISSLKLGDVRLSLSSEVKFLGVI FT LDSKLSFNRHVEQQIEKARNAFWGCKRTFGRKWGLKPRMILWIYTAIVRPT FT ITYASVIWWEKTKQISTQSKLNKLQRLATAALTGAMRSTPSKALDAMLNLP FT PLQDFIQMDAVKNASRLRRSSTINDSDLKGHMSIVKTLNINPIFSISEDCM FT MKRNFFDHLFCVPDITRQDWEVGQPNFRSGSTIFFTDGSKQDNLVGAGISG FT PGVNVSLPLGCWPTVFQAEIFAILECADICLKRRYKNANICICSDSKAALN FT ALKSNVYTSKLVWECTMLLQQLSCRNKVNLYWVPGHCGIEGNEKADQLAKI FT GSSTQFIGPEPYFGIAPCGFKLELKNLERTKIKLTWTNTSDSRQAKRFIEP FT DARKTLRLINLNKHELSTYTGLITGHCPSKYHLKIIGKLQEDKCRFCKLES FT ESSEHLMCECVALYRKRCRYLEKGLLEPWEIWNSHPKQVLSFIRNAVPDWD FT TRQLMGG" XX SQ Sequence 6327 BP; 2104 A; 1210 C; 1259 G; 1754 T; 0 other; atttggcaac cctgctgagc tctgctagaa tcttaaaccg cccccgtatt tactgactgt 60 tttctaagtg taaaccgcct tataaatcgc gagtgacctt ttaacagttt gtttaatcgg 120 tgaagtgaga agattgattg cggaaaggta agggagtgca tcttttagcg tttaattaag 180 agatcattag tgctggaaaa agtaccccgt tatatttcta tagtggcaca aacaaaagcc 240 acaaaacgag cgagagaggc tgaaattgaa tcaggccttg catgcaagtt tccactgtta 300 ccaatggtgt gtgtgttttc aacatattga tttgtatggc ataaataggt gaggtgatgc 360 tgctaatttg tgcctcttaa tacttttaat agtgaatgaa gagctttctg tgctcacaaa 420 ctgctattta aaaaacaaaa ataattagaa gtagtgtagt gtaatagtaa atattaccgt 480 aatttctctt tgaacacatt accggacaat ccataatatt caagtgctac ggaatacaaa 540 atgcctaata ccaatgcctg tttttgcata gtaagggtaa ctggtatgca gttttgattt 600 ttttttctct ctcttttttt tcaagttaat ccaatccttt tttttctctt tgcataaaag 660 ctgtttatct acctgcctaa tttaaattaa atgtttttgc tacttccttt tatatcaatt 720 tctaacaaac tgatattttc atatgttttt ttttgtctct tatctccatt ataaaactaa 780 gctggcataa taaattaccc aaaattacgt acttttttcg tctctgctga ttttctccga 840 ttcttctttt tttttttttt tcgctcacct tccccatatt ttatgtatac ctaatgcagc 900 aatcatgtgc gatctgttcg aatttcacag caactgaaag tagattcccc tcattgttac 960 agaatccaac tacgtattaa aatggagatt gatagagaaa gcgatatcgt cgaaagcaac 1020 tctttagagg aaatagaatc gtgttcctca tctattctcg ataccccaaa tcgggaagat 1080 atgtatgtcg atgatgatga ctatgatgac ggtataaatg tcactatttt atcgtcgtct 1140 gcaactctcg aattcaacaa gaaagctgtt gagaatagca tggtggagaa gaagctaaat 1200 gcttctgcca catctaactc aaacggcatt acagaaaata atagtggcaa taaaacgcca 1260 ttcactggtg caaaccccaa agataagaag cataagcgtc ttaacggagc tgctaagaag 1320 aggtttaagt accttgttga ccatgggcac agtcgagacg aagcacgtct tttggcggaa 1380 aaacctttcc gagttctgga atctgatcca cataagcgtc gcagaaatgc cgacctaagc 1440 gaatctaatt ctagcgaaac caatccacct aaaaggcttg cccgtcagct tgatagagga 1500 caaactttac aagttaggtc ctcagttcaa aacagactgg aacaagctag gaagggggaa 1560 gggtcgtctg tacgcaaaga ggctgctgga aacagtgggc ctcataaacc attatactcg 1620 gaagtagcga actatataag gattggcatt cttcccgaag gcttccctaa tatagaactt 1680 tctacccaac aacttactgc gacgcaaaat gaaattctta aaaaagttgc tgaacaaaga 1740 aaggaaccaa tcaagcctaa gttcggcagc cgcgtgttta ggccagggca tatgattgtc 1800 acctgcaaaa accaagatac tgctaactgg ttaaaggaaa caatttccca aattcaacct 1860 tgggagaacg catgcctaat tgcagtggat gaaaaggaca tacccaggcc tgaaattttg 1920 gtgggatttt ttccactaag tgaacaagac tcaaatgaag aaatctttgc tctagtggaa 1980 agtcagaatg aaggcttgct agttgattct tggagggtcc tcaagagata cacggtaaag 2040 caacatcacg tggaacttat gttcaccgtt gataacgttt ccatgaagtc tctagaatcc 2100 aacaagttca taatcgacta caaatttgga gtagcctata taaggcaccg aaacactaat 2160 acggagagta ctgaaggtag tcataacgaa ataacgagag gtgaaaaaac ggcttcagac 2220 agtgtaggat ataccagaga agatcctcaa ttcagcggag ctgtagagga tgcacaaatg 2280 gccgatccta ctcaaaaaga tgaagctgat ggacttgaac tcgaaaaaac aattatcatg 2340 gaaaatggtt gtgaaggctc ttcaaaaacc cataaagata agagcgacaa aaacctaaac 2400 agatccatta atcctctacc gggttccagc aagggtggaa acaactctct gccataccgg 2460 caaatacgag ggcaaatact acccgacaga ccaaaacagt caacaaaggc tgaacaatct 2520 gcctgcaaaa cattagaaac aaataaacac gaataaaatg acgcagagaa gcataaaatt 2580 cattcaagag aaccttcacc atgcaaaggc agcttcagca atcttatcta gaacttttac 2640 aaaggacaaa atagacgtgg cgcttttaca agagccatgg acgaaaaacc acaaaatttt 2700 aggaatcaat atattaggat gtaagttaat ctatgataat actcagctcg tcccgagagc 2760 tgcaattcta gttaactgca atgctaaatt tacaccaatt acagaattta tcggaaagga 2820 cattgctgca atttctttag aggtgccaac cgcgaaagga aatacgctaa tatacgttgc 2880 gtccgcctat tttcctggcg atgttgaaga agtacctcct ccagaagttg cagctttcgt 2940 gtcccactgc aggagacaaa acaaagggtt catcatcgga tgtgacgcta acgcccacca 3000 cactatctgg agcagcaccg atatcaacag tagaggtgag tctcttttca gttttatatc 3060 tcaaaatgat attgacattt gtaatagggg taacgcccct acttttatta atgctgtgag 3120 ggaagaagtg ttagacttaa ctctctgcaa ttcaacaata tcgggtagca ttgaaaaatg 3180 gcgggtatct gatgaaatat cgctatcaga tcataggcaa attttattcg aatttgctac 3240 caaagatata gtaagggaaa cgtttagaaa tccccgaaaa accaattggg aactttatga 3300 ttctcggtta cgattacaaa atgagtcaac ttctaacatc attaatacat ccgttgaact 3360 tgaaaaggca gctgactgcc ttactacttc tattatggtg gcttataatg aaagctgtcc 3420 aataaaggta cgctctacta gtagagatgt tccttggtgg aatgatagat tagataaact 3480 aagaagaaat gctcgtaaac tattcaatcg ggcaaaacgt acatcgaact ggtctcagta 3540 tagagaggcc ttaaccaatt acaacataga gataagaaga tcaaagcgga aaaattggag 3600 acacatgtgt gaaagcatag atagtactcc tgagactgca agacttcaaa aggttctttc 3660 taaagaccat accaacggtc tcggaacact taaaaagaac gacgggtctt ttactgaaaa 3720 tacacaagaa actctagaat taatgatgga aactcatttt ccagggtcac taccatgttc 3780 atctgatcaa actagcacat ttagcttatc aaatggatcg gatatgtcca atgaagaagc 3840 ttcagatcta gcggatcata tatttacata ttctaagata gaatgggcat tggacacttt 3900 tgatccattt aaaaccccag gattagatgg aatttttcca atacacttgc agaagtgcaa 3960 ggaaaaaatt attcctaccc tgctgaccct attcaagtgc agctttctat taaggtatat 4020 tccaaccaaa tggagacaag tccgagtggt gtttataccc aaagtaaata aaaaggataa 4080 aacattgccg aaatcattta ggccaataag tcttacgtcc acatttttaa aattaatgga 4140 gaaattaatt gatgagtata ttaaatcaga tataattaag cataaaccgt taaatagagc 4200 tcaatttgcc tatcagactg gcaaatccac gacaacggct ctccattcct tagttacaaa 4260 aatagagaaa acttttgatt acaaagaact acttttggca gctttccttg atgttgaagg 4320 tgcgtttgac aacgctagct tcacatcgat gagaagagcc atgagaagtc ggggtttttc 4380 aaaaagtatt attgattgga tagaagagat gctgtcaaaa agagagatat cagcatatct 4440 tggtgattca gttgtaaggg taagagctgt tcaaggctgt ccccaaggcg gagttatctc 4500 tcccctcctc tggtccctag ttgttgatga tcttcttata aaactgcagc atcaagggtt 4560 tgaagtaatt ggattcgcgg atgatatcat cattattgtt cgtggaaaat atgatgcaat 4620 cgtttctgat cgaatgcaat ctgcgttgag ctacacttta aggtggtgcc aaaatgaagg 4680 gttaaacgtt aaccctaaca aaaccacaat tgtaccgttt actaaaagac gcaaaatctc 4740 catatcgagt ctaaaactag gagatgttag actgtctctg tcttcagaag taaaatttct 4800 tggagttatc ttagatagta agctcagttt taacagacat gttgaacaac agatagaaaa 4860 agcaagaaat gctttctggg gctgcaaaag aacttttggt agaaaatggg gactaaaacc 4920 aaggatgata ctttggattt atacagccat tgtgagaccg actattacat atgcatctgt 4980 tatttggtgg gaaaaaacca aacaaatctc aactcaatcc aaattgaaca agctgcaaag 5040 attggctaca gctgcattaa ctggggcgat gcgtagcact ccttctaaag ctttggacgc 5100 aatgttgaat ctacctcctt tgcaagattt tatacaaatg gatgctgtta aaaatgcttc 5160 acgactcaga aggtcctcta ctattaatga tagcgatctc aaaggacata tgagcatcgt 5220 aaaaacatta aatataaatc caatattttc aataagcgaa gattgcatga tgaagcgaaa 5280 ctttttcgat catctctttt gtgttcctga tataactcgt caggactggg aagtgggaca 5340 accgaacttc cgttctggct caacaatctt ttttacagat ggctcgaagc aagataacct 5400 tgttggtgcg ggaatatctg gcccgggagt aaacgtttca ctaccactag gttgttggcc 5460 aacagttttt caggcggaaa tttttgccat cttagaatgt gccgacatct gtctaaaaag 5520 gcgctataaa aatgctaata tctgtatttg ttcggacagt aaggcagctc tcaatgcttt 5580 aaagtcgaac gtttatacat ctaaactggt ctgggaatgc accatgctac tgcagcagtt 5640 gtcctgtcgc aataaggtca atctttattg ggttcctggt cattgcggaa tagaagggaa 5700 cgagaaagct gatcagctag ctaaaatagg gtcatccact caatttattg gacctgagcc 5760 ttattttgga atagctccat gtggttttaa actagaatta aagaatttag aaaggacaaa 5820 gataaaactt acctggacca atacatccga ttcccgtcag gcgaaacgat tcatagaacc 5880 ggacgccaga aaaacactaa ggttaataaa cttgaataaa catgagttaa gtacttatac 5940 tggactaatc acaggtcact gtcctagtaa ataccactta aaaataattg gaaagttgca 6000 ggaggataag tgtcgcttct gcaagttgga gagtgagagc tctgaacatt taatgtgcga 6060 gtgtgttgca ctttaccgta agcgttgtag atatcttgaa aaaggcttat tagagccttg 6120 ggaaatctgg aactctcatc ccaaacaggt actaagtttc atacgaaatg cagtacctga 6180 ttgggataca cgccaactga tgggtggata gtcacttctt atagtgatga gtcctctcat 6240 cgcggcaaaa acaaagagga tgacaccaca aaagatcaaa taaatggtcg cagtggtaat 6300 atgtccccga cactggaaaa aaaaaaa 6327 // ID Copia-108_AA-I repbase; DNA; INV; 4086 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-108_AA_; KW Copia-108_AA-LTR; Ty1_copia_Ele43; Copia-108_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4086 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [1493-2032] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 77..4086 FT /product="Copia-108_AA-I_1p" FT /translation="MEKDSDAVRVPLFDGSNYPSWKFRMLVLLEEHEMTEC FT IEEEVADVERLQVSAADNESVKATKLVALEARRKKDRKCKSILISRISDSQ FT LEYVQDQPTPRQIWLALQRVFERRSIASRLHLKKKMLTLRHEGGDLQEHFL FT KFDRLVREYKSTGAVIEDIDVVCHLLITLGPEFATVVTAIETMPEDKLTIE FT FVKCRLLDEEIKCRGKEFDSFGVKREPSAFAGKPTGAVRKWKCFSCQKVGH FT KAAECPERQKRKKEDNKKSSANVSEEKAGGICFAASGGVVREEKVLWIADS FT GASEHMSNNKNLFEELVPLKDPVEIAVALNGKSAIAKCSGKIKMIAVSGSK FT KRECTLENVLYAPDLRCNLFSIRKVEMAGMEIVFKDGSVKVRKDSEVVACG FT RRRGLQYEMDFYAESTSASSLYSCGKLQKCNELWHRRYGHLSEKNLELIVK FT KNMVTGFEKCAVDDNGEMFCEPCIEAKQTRKPFVSSSEKRASRVLELIHTD FT VCGPVTPVGHDGSRYYVSFIDDWSRFTIVYTIQSKDQVLGCFKLYEAMVTA FT KFGRKIARVRSDNGGEYRNEQFEKFCQQKGIQMECTVPYSPEQNGISERMN FT RTLVEKARSMLIGSGVDKVFWCEAVETAAYLVNRSPASALKEGKTPFELWE FT GRKPNISGMRVWGSPAFCHVPKENRKKLDSKAWKGIFLGYHCNGYRIWDPK FT RKKVVVQRDVIIDESRGTSKEVPALGKTSTDNEMIRITHDEEVGDTDFGNT FT EVPGDSHETSFEDCDEGSDSDSAADEDSAGAVEAPSDQIRTSGRRREPPAW FT HRDYDMNCAYAMSATNFVENFPDTLKEMRKRDDWPYWKAAVEEEMNSHRKN FT NTWSICKLPEGRKAVSSKWVFKLKRGENGAVDRYKARLVARGFSQRFGYDY FT TETYSPVARMDTVRTVLAVANQERLKVHQMDVKTAFLNGNLEEEIFMTLPE FT GSEGELNDVCRLNRSLYGLKQASRAWNERFHRFVTKLGFRQSENDRCLYVR FT GEGSQRVFLLLYVDDILIVCRNLKEIEKVKKRLAQEFEMSDMGPVGNFLGM FT SIDRNEEQRILRISQRCYLESLLCRFGMENCKPAATPMETRLQLKKGDEAN FT KTSKPYRELVGCLAYVAQCSRPDLCASVNFFSQYQSCPTDEHWNFLKRVLR FT YVRGTLDIVLEFHGSDKAEPFVVYSDSDWGNDVNDRRSITGCVFRVFGGTT FT AWLTRKQQTVALSSTEAEFVALCTAACEGIWLRRLLEDLGVVIEGPVKHFE FT DNQSCIKVTEEPRDSRRLKHVDVKFNFVRELVKDKRIDIRYIPSEKQLADI FT MTKGLASTAFNRLREALGLQKLSRG" XX SQ Sequence 4086 BP; 1156 A; 743 C; 1200 G; 987 T; 0 other; ataggttatg ggctacagtt tcgcgtgtgc cgggaagtag tagcgtgttt cggacgattt 60 caagtgtttc gtgaagatgg agaaggattc ggatgcagtt cgcgttccgc ttttcgacgg 120 gtcgaattat ccgtcatgga agttcaggat gctggtcttg ctcgaggagc atgagatgac 180 cgagtgcatc gaagaagaag tggcggatgt ggaaaggttg caagttagtg cagcggataa 240 cgaatctgtg aaagcaacga agttggttgc gttggaagcg cgaaggaaga aagatcgcaa 300 gtgcaaatcc attctcatct cgcgaatcag tgatagccag ttggagtatg tacaagacca 360 gccaacgccg aggcaaattt ggctggccct ccagcgtgtg tttgaaagaa gaagtattgc 420 aagtcgtttg catttgaaga agaaaatgct caccctccgg catgagggcg gtgatttgca 480 agagcatttc ttgaagtttg atcggttagt gcgcgaatac aaatcgaccg gtgcggtgat 540 tgaggacatt gacgtggtgt gtcatttgtt gatcacgctc ggacctgaat ttgcgacggt 600 ggtgaccgcg atagaaacaa tgccggaaga taaattgacg attgaattcg taaagtgtcg 660 tttgctcgac gaggaaataa agtgccgtgg caaagaattt gattcgttcg gcgtgaagcg 720 cgagccgtcg gctttcgccg ggaaaccgac tggtgctgtg agaaaatgga aatgtttttc 780 gtgccaaaaa gtcggacaca aggctgcgga gtgtcccgaa cggcagaaga ggaagaaaga 840 agacaataag aagtcaagtg caaatgtttc cgaagagaaa gcgggtggaa tatgtttcgc 900 cgctagtggt ggtgttgttc gggaagagaa agtattgtgg attgcagatt cgggtgcttc 960 cgaacacatg agcaacaaca agaatctctt cgaagagcta gtgccattga aggatccagt 1020 tgaaatcgct gttgcattga acggtaagtc tgcgatagca aagtgttctg gcaagattaa 1080 gatgattgcc gtcagtggaa gcaagaaacg tgagtgtact ttggaaaatg ttctctatgc 1140 tccggatctc cgttgtaatt tgttctcgat ccgcaaagtg gaaatggcgg gaatggagat 1200 tgtgttcaaa gacggtagtg taaaagttcg gaaagattct gaagtggtcg cctgtggacg 1260 acgtcgtgga ttgcaatacg aaatggactt ttacgcggaa agtacaagtg cgtcgtcatt 1320 gtattcgtgc gggaagttgc aaaaatgcaa tgaactttgg catcgtcggt atggtcactt 1380 gagcgagaag aacttggaac tgatcgtgaa gaagaatatg gtgaccggat tcgagaagtg 1440 cgcggtagac gacaatggtg aaatgttttg cgaaccatgt atcgaagcga aacaaactcg 1500 gaaaccattc gttagtagct ccgagaagag agcttcgcgt gtgctagaac taatccatac 1560 ggacgtctgt ggccctgtaa cgccggtggg gcatgacggt agtcggtact acgtcagttt 1620 cattgacgat tggagccggt ttaccatcgt ctacacgatc caatcgaagg atcaggtgct 1680 cggatgtttt aaactttacg aagcgatggt cactgcaaaa tttggaagaa aaattgcgcg 1740 tgttcgtagt gataacggtg gagaataccg aaacgaacag ttcgagaaat tttgccagca 1800 gaaaggaatc cagatggagt gcacggtgcc gtattcgccg gagcagaacg gcatcagtga 1860 gcggatgaac cggaccctgg tggagaaggc caggtcaatg ctcatcggtt ccggtgtcga 1920 caaggtattt tggtgtgagg ccgtagaaac ggccgcttat ctggtaaata ggagccctgc 1980 aagtgccctg aaagaaggta agacaccatt tgaattatgg gaaggtcgaa aaccaaatat 2040 ttccggaatg cgagtgtggg gaagtccagc cttttgccat gtccctaagg aaaaccggaa 2100 gaaactcgat agtaaggcat ggaaagggat atttctggga taccactgta acggatacag 2160 gatctgggat ccgaagcgaa agaaagtggt agtgcaacga gatgtaatta tcgacgaatc 2220 tcgtggtaca tcgaaggaag ttccagcgct tggcaagaca tctacagata atgagatgat 2280 tcggatcacg cacgacgagg aggtaggaga caccgacttt ggtaatacgg aggtaccggg 2340 tgattcgcat gaaacaagct tcgaagattg tgatgaagga tcggattcgg actcggcagc 2400 tgatgaggat tcagcagggg cagttgaagc accatcagat caaataagaa ctagtggtcg 2460 tagaagagaa cctcctgcgt ggcacagaga ttacgacatg aactgtgctt atgcaatgag 2520 tgcaacgaac tttgttgaaa atttccctga tactctgaag gaaatgagga agcgtgacga 2580 ttggccgtat tggaaagccg ccgtcgagga ggagatgaac tctcatcgga agaataatac 2640 gtggagcatt tgcaagttac ctgaagggag gaaagcagtc tcgagtaaat gggttttcaa 2700 gcttaaacgt ggtgagaacg gagctgtgga ccgatataag gctcgactgg tggcccgtgg 2760 attttcgcag cgtttcggat atgattacac tgagacatac tcaccggtag caagaatgga 2820 tacggtgcgt acagttttgg ctgtggcaaa tcaagaacgt cttaaagtac atcaaatgga 2880 cgtcaagacg gcgttcttga acggaaactt ggaagaggaa atttttatga cgttacccga 2940 aggctcggaa ggagaactaa acgatgtttg ccgtttgaac cgttcgctgt atggtttaaa 3000 acaggcgtca agagcctgga acgaaagatt tcatcgtttt gtgaccaagt taggatttcg 3060 gcaaagtgag aatgatcggt gcctgtatgt acgaggtgag ggtagccaac gagtgttcct 3120 gttactgtat gttgacgaca tacttattgt ttgccgaaac ttgaaagaaa tcgagaaggt 3180 aaagaagcga ctggcgcagg agtttgagat gtcggacatg ggtcctgttg gtaattttct 3240 tggaatgagt atcgacagga atgaggagca gcgaattttg cggatttcac agcgttgtta 3300 tctcgaaagt ctactttgcc gatttgggat ggagaactgt aagccagcag cgacaccaat 3360 ggagactcga ttgcagctga agaagggtga tgaagcaaac aaaaccagca agccatacag 3420 ggagctcgtg ggctgtttgg cgtacgtagc gcagtgttca aggccggatt tatgtgcatc 3480 agtgaacttt ttcagccaat accagagctg cccaacagat gaacattgga atttcttgaa 3540 gcgtgttctg cgatacgtac gaggaacttt ggacatagtt ttggagtttc atggaagcga 3600 taaagctgaa ccgtttgtcg tttattcaga ttcggactgg ggcaacgatg ttaacgatag 3660 acgatcgatt actggatgtg tttttcgtgt ttttggtggc acaacagcct ggttaacaag 3720 aaaacaacag accgtagcac tgtcctcaac agaagcagaa tttgtggctt tgtgtaccgc 3780 cgcatgtgaa ggtatttggc tccgtcggct tcttgaagat cttggcgtgg taattgaagg 3840 tccagtgaaa catttcgagg ataaccaatc gtgtattaag gtgacggagg agccacgtga 3900 cagtcgacga ctaaagcacg tggatgtcaa attcaacttt gttcgcgagc tagtgaaaga 3960 caagcgcatc gatatccgct acattccatc ggagaaacag ttggcggata ttatgactaa 4020 aggactggca tcaacggcct ttaatcgttt acgagaagct ttaggactgc agaaattgag 4080 cagggg 4086 // ID Sola1-4_AA repbase; DNA; INV; 3171 BP. XX AC . XX DT 28-JAN-2009 (Rel. 14.02, Created) DT 28-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE Sola1 DNA transposons from Aedes aegypti. XX KW Sola; DNA transposon; Transposable Element; Sola1; Sola1-4_AA. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-3171 RA Bao W., Jurka M.G., Kapitonov V.V. and Jurka J.; RT "New superfamilies of eukaryotic DNA transposons and their RT internal divisions."; RL Molecular Biology and Evolution 26(5), 983-993 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS join(355..1146,1207..1722,1770..2396,2299..2601) FT /product="Sola1-4_AA_1p" FT /translation="SQSEKTDESSSSSEDEGFQGFCDHEQRVDESGFEAFK FT AAVNGEIVPNVTPPGSWRSFSRAVLNETTGEALNSHTNHKASSNAKLPECG FT GAKITRKEPSNVAERLCEYGVGSFENISTLKEPTEDIWNDPIDSDGSFFDA FT EETSATMKRTKKAIQKEDSAKKRRMSHGIKLVDCKCKKRCPDKVGKDARTQ FT NHTRFWSLDSLGQDSFFRETVQSVPLKRRVKDRFVLDGPKKRRSYVFNLMS FT ENELIEVCRKFYLNTIGYNENCGKHGFRCVDGDDNYNRKVSQRGKHTRDMT FT KRDAVKAHINSFNPTISHYRREHAPNRLYLPSDLTEKAMYEHYLNKKNVHV FT SYQFYGRVLNQMNISIVKLGHEECKLCTTSTEHQKQSGHSTVQSSEGDDLC FT TICDKFSVRLNFAAMSREAFRKDGDNVKPGCVVLAVDLQKVSVIQLPRLDG FT YKSIVFSQRLLAYNETFAPVGSYMKNLPVITCLWDESIAGRHASNIASCFH FT RVSHRFKDSEEIIFWLDNCSAQNKNWNLFQHIILLLNSNDIKIKKISFKFF FT ETGHTFMAADSVHAAIEKKMRRSKAAETFPDFAGIVKNAKNHMEVMDMKFN FT DFFNTTMNTTQNMINQCTPRPYIEKNQTHCFQEGKPRTFLQQRFRWKINAH FT PGRILKKIKHIVFKKGSHELSYSNAFDGSELMNKCVLFSKKQLKQISSENF FT NLENTLKRQAVPKGITKDRKEALLKVILPLVSNTSKSYWNDLPLQ*" XX SQ Sequence 3171 BP; 1052 A; 583 C; 645 G; 891 T; 0 other; ctgccgtgaa tcgcaagtca gtcctatctg taaaaacaag gaaatgagaa aactgctggt 60 aaagatttcg caagttttct catttatttt gtttaaccca cggataaatt caacaaaaat 120 catgcagtat aatagaatat atatgtgctc atcagtacat attagacaaa attatgaaaa 180 ctgttttttt cgactaaaaa agccaaagag actgacttgc gattactttt tgcatcaatg 240 cgaaaagtaa tcgcaagtcc gtcacattgc catgacaata ttgcgtgtgt tattctccac 300 aaattgttgc tatgaaaaat tgaccatctt cagagtacaa gttgatttat ataaagtcag 360 tctgaaaaaa cggatgaaag ttccagctca tctgaagacg aaggcttcca aggtttttgt 420 gatcacgaac agcgtgtgga tgaatctggt tttgaagcat tcaaagctgc agtgaatgga 480 gaaatagtac ccaatgtaac tccccccgga agctggagat ccttttccag agccgttctg 540 aatgaaacca cgggagaagc acttaattcc cacacaaatc ataaggctag ctcaaatgcg 600 aaattgcccg agtgcggagg ggcaaagatc acccgaaaag agcccagtaa cgttgccgag 660 cgtttatgtg aatacggcgt cggctctttc gagaatatat ccacgctcaa agaacccacg 720 gaggacatct ggaatgatcc gattgattcg gacggaagct ttttcgatgc agaagaaact 780 tctgcaacta tgaaacgaac gaagaaagcc attcaaaagg aagactccgc caagaaacgg 840 cgcatgtcgc atggtattaa attagtcgat tgcaaatgca aaaaacgatg ccctgataaa 900 gtaggtaaag atgctagaac acaaaatcat acccgctttt ggagtttgga cagccttggc 960 caggattcat tttttcgaga aacggtgcag tccgtgccgc tgaaaagacg agttaaagat 1020 cgttttgtac tggatggacc aaagaagcgg agaagctatg tattcaattt gatgtccgaa 1080 aatgagctga ttgaagtttg ccggaagttc tatctaaata caatcggtta caatgagaac 1140 tgcgggtaag atgtgctgct ctgtgatagt gtttaaaaat ctgctacaat gaacatttcc 1200 ttttagaaac atggttttcg atgtgtcgat ggtgatgata attacaatcg aaaagtgtcc 1260 cagagaggaa aacacactcg tgacatgacg aagagagatg ctgtaaaagc tcatatcaat 1320 tccttcaacc caacgatttc acattaccga cgtgagcatg ctcccaatag gctctatttg 1380 ccatccgacc taacggaaaa agcgatgtat gagcattatc tcaataaaaa aaacgttcat 1440 gtcagctacc aattctacgg ccgtgttttg aaccagatga atatatccat cgttaaatta 1500 ggccatgaag aatgcaagct gtgcaccact tcaaccgaac atcaaaaaca gtcgggacat 1560 tcaactgttc aatcctcgga aggagatgat ctttgtacga tctgtgacaa attttctgtc 1620 cgcttgaatt tcgcagccat gtctcgagaa gcttttcgaa aggatggaga taatgtaaaa 1680 cctggatgcg ttgttctggc tgtcgacttg caaaaggtaa gttaaaccac ttttttaaaa 1740 taacgtgttt tgacccctgt atattttagg tgattcaact cccaagactt gatggataca 1800 aaagtattgt tttttctcag cgtttactgg cgtataatga aacttttgct ccggtgggta 1860 gctacatgaa aaatctacct gttataacat gtctgtggga cgaatctatt gctggacgcc 1920 atgccagtaa catcgcaagc tgtttccatc gtgtcagcca tcggttcaaa gattcagaag 1980 aaatcatatt ttggctagat aattgcagtg ctcagaataa aaattggaat ttattccaac 2040 atattatact gctgctcaac tccaatgata ttaaaattaa aaaaatatca ttcaagttct 2100 tcgaaactgg acacactttt atggcagccg acagtgttca tgctgcaatt gagaaaaaaa 2160 tgcgtagaag taaagcagca gaaactttcc cggattttgc aggaatagtt aaaaatgcga 2220 agaatcacat ggaagtaatg gatatgaagt tcaacgattt tttcaacact accatgaata 2280 caactcaaaa catgataaat caatgcacac cccggccgta tattgaaaaa aatcaaacac 2340 attgttttca agaagggaag ccacgaactt tcttacagca acgctttcga tggaagtgaa 2400 ttaatgaata aatgtgttct attctcaaag aagcagttaa agcagatcag tagcgaaaat 2460 ttcaacctgg aaaatacact gaaaaggcag gctgttccga aaggcatcac gaaagatcgg 2520 aaggaagctt tactcaaagt tatcttgcca cttgtcagta ataccagtaa atcctactgg 2580 aatgatttgc cactacaata atcgtttata gaaatgcttc gaaaatactt caaaatatat 2640 gaaatcattt tagttataga ataaatccgt ttaaaaatat aatacatttg tttttgtatt 2700 tttttctatg taattattat ataattcatt attgggactg acttgcgatt acttttgatt 2760 atgggacaat ttgacttggt gttgtctttt attttttcag aacaaaattt tttttttgat 2820 tgctgcagct aatctatata aataaaaatg gaatggtgtt tgtatgtcac gaaatggctt 2880 acgaacgggt caacggattt gaatgattct ttctcagttt tgttcgtcag gggttccgac 2940 gtgttcgtgt gtataaaaat cccaggatat tcaccgggaa agttgaaaaa acgagcgcga 3000 acgaaactgt cattttatat gggacgatca aaagcgtttt tcaacagcct acttgatggc 3060 aagacgaagt ttgccgggac cactagtaat atatagaaaa ataaggcgaa aagtggttgt 3120 aactcaaaaa tgacaaaaat gcagatggga ctgacttgcg attcacggca g 3171 // ID Copia-102_AA-LTR repbase; DNA; INV; 146 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-102_AA_; KW Copia-102_AA-I; Copia-102_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-146 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 146 BP; 31 A; 35 C; 25 G; 55 T; 0 other; tgatagcatg gcaaccctgc cgtgtggaga ctttacatca cgaacttttt ccatttcctt 60 tccatttgag ctttgaataa aagtacagtt gaattcgttc tctctacctg attggttgtg 120 ttttattagc tctgctctac ctccca 146 // ID I_Ele28 repbase; DNA; INV; 7104 BP. XX AC . XX DT 14-OCT-2010 (Rel. 15.1, Created) DT 14-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE An I clade non-LTR retrotransposon family from Aedes aegypti. XX KW I; Non-LTR Retrotransposon; Transposable Element; I_Ele28. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-7104 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-7104 RA Kojima K.K. and Jurka J.; RT "I clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (14-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. This consensus is generated from 7 CC sequences with >99% identity, and ~100% identical to the original CC sequence in [1]. XX FH Key Location/Qualifiers FT CDS 688..2436 FT /product="I_Ele28_1p" FT /translation="MSASSPPLNPGDPGGPGGGYQRNRKNGEYTDRLIPEF FT LDREGNAGQLQFLRMKAKTGAIPNNPFLLRLSVEKHVGGQIAGAFKENKGI FT SYVLKVRSQAQFDKLLTMSRLADGTEISVEEHSNLNQVKCVVSNADTIGLE FT EGYIAEQLREQKVKEVRRIKRRNRNTNALEETPTLILTISGTVVPDHIDFG FT WSRCRTRNFYPNPMQCFRCWSFGHTGKRCTAPARVCGRCSKVHQEDQEMET FT ANEQGTTNEPTPRRVCTEPTYCKACNSAQHALSSRECPAFRKENAIQIIRV FT DDGLSYPQARREFEAREAQKERDQGRNSYAGVASSSKDAEIAELRGTVKRL FT ENEAAKREQRMADMEQALAYRPISERLETAKDHGPIEELIRQVAELTATVR FT QLQADLKEKDQIIAEMKEKQQVTPRNSMTTPTPIPQSSSPYTTVSSVECLP FT NFADPTMTAQVAEWVRNNVAAKKEKMANEKKRVVPKKQRKKSNDKQTALLQ FT DDQSPHMSHSPLSRIGKESEFLATNMELDDSDNSLKSVGTATSSAAGSKRI FT HSISSAESSINSADEVRITRNKTRLPRDTAADISMQ" FT CDS 2491..6957 FT /product="I_Ele28_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="SHYPITFTSTFPPADVQNPELNERQQSSSNDSTNHTQ FT TTSSTAEETSGSRGPPSADTTPQPELGDNPRHPNVLAAEDPEDGRERLGPA FT GADVTPQPEPADNPSLPLRSGRGTDKGGNAYPPEDVEGSKPAMNLRRNPPR FT RRNPPNRYGFSDINNPKPTATEYLSPKLKPLSVLESTNASAYDAMVAIRDE FT ACAPPTAGPSGVFRRPSPLYPYQDLSGHLGGKTPSGKPHPLRRPHRKGTPP FT TAGPSGLAHHFVIPDPSQDQLGHLGPQNPSATTVRKSTTLSIQWNMNGFLQ FT NRNDLELLVQEYQPIILALQEIHRVSETTMDSTIGRKYKWHKTAGTSLYHS FT VAIGVLADIPAVRLNITTDLPIVGVRLDLPIAVSALSVYLPNGKLPNLRER FT LQDALDQIPGPIIMLGDINGHHQSWGDKKTDARGCCVMDLACGNDLVVLND FT GSPTFFRGTTETAIDVTLASSEIGHKLLWHVKEDPHSSDHSPIIVTLTNST FT QPETTRRPRWLFEQADWAGFQELVEKELDQYASPSIEDITASIIKAASETI FT PRTSPNPGRRALHWWNDTTKAAVKHRRKKLRTFKKIKNKLPPDHPNTVQAK FT EEYQEARNACRQTIREEKEKSWTKFLDSINDQQSSAELWRRVNCLRGKRRV FT KGMCLTVDNVPTRDPQIIAEALGEYFQELSSMGKYPEAFRRKHHCSHSAIS FT NFEIAPDCGQPFNTPFQMAELEFALRRAKGKSAGPDELAYPLLKNLPPSGK FT QGLLRAINSIWTSGTLPADWRHSLVVPIPKGSGPANSTSSYRPIALTSCAS FT KVMERMANRRLVEYLESNGLLDDRQHAFRSGRGTGTFLAALGQILDDALSK FT GDHAELVSLDLAKAYNRAWTPGILEKLARWGITGNLLRYLKGFLTERTFQV FT TVGNSRSNIQSEETGVPQGSVIAVTLFLVAMSGVFEVLPGGVFLVVYADDI FT TLVVTGAHPKAIRRKAQAAATAVGKWASTTGFEISAAKSTRIHICGINHRP FT PRKPIKLNGETIPLKRSLRIMGITLDRHLTLKNHLIGVKHSCKDRLNLLRS FT ISAKRTCSDRATCVRVADAIIGSRLLYGIEMTCRNQDLLTETLAPIHNKAI FT RTVSGLLPSTPADAACAEIGVLPFRHKVALTIANRAIGYLEKTRGNGQACL FT LSEKANLALSTAANTTLPPVAGLHRLGPRSWSARAPEVDNSVKNRFKRQSQ FT PEKVRAYFCERLAKRYTNWEIRYTDGSRSAGRVGVGVHGNRHEEQHSLPTE FT CSVFSAEAAAIFQALTAPSDGSLLIVTDSHSSITAVENPRNKHPFIQGIQM FT LLDTAPYPRALMWVPGHCGIAGNERADVLAGIGRNSAPLTKKIPADDLRNW FT VKDVFWRTWTDEWRSNRELFLRKSKPNTIPGEDLSSRREQIVLSRLRTGHT FT RVSHDMAARGAGFRRQCELCNTQNTVEHFLCACPQYRDLRETHEITDIPRS FT LANDPVNERQVINFLKEAGLFGEI" XX SQ Sequence 7104 BP; 1972 A; 2055 C; 1782 G; 1295 T; 0 other; cagttgacag ttcgggttcg tcgcgtttgg atcgcttttt tcagcgtcgc gagtgaaaag 60 tggtgatttt cggagtgctt ttttcaatcg aatttctcga ctttccgaac tttccgcatt 120 ttccggagca aatttcggag tgtccgcggc caatccgccg gtttccgtaa atttgttcgc 180 cgtttttccg ccgaattttc ggtcgcaaaa gtgcccactc gtgtggagaa agccacaaag 240 gcaagtgagc gtgggtatta cgccaagggg cgaagaaagt gccggatctt tttcggaaaa 300 taagtggtgc taggctacgg ggcccaaaaa gcttaattaa ttagctctcg ttggtggttc 360 cgaagaagaa agtgttggcc atttttaggg ccccataaga gcccctgagt gccaaataga 420 ccagtgacaa ccaaacctcc taccccccac tttttgtgaa gtgttttcca cccccccccc 480 ctcaaaattt tcccctcccc ctacgattgg tagtgccggt taaggccgtc gtagaaccct 540 ccctcttaag ccaccgaggt gtacagaggc gagttttttg cataaaaacg tcagtgcggt 600 ggacgttcca cgagtgaatc aagtggtttc ccgaaaaggc cggtcgactg tgtggtctaa 660 ccaccatagt cccggcccgc cggcgtaatg tcggcgagca gccctccgct caaccccggg 720 gacccagggg gccccggcgg aggctaccag cgtaatcgta agaatggtga gtacaccgat 780 cgattgattc ccgagttcct cgacagggaa ggaaacgcag gtcagctgca gttccttcga 840 atgaaggcga aaactggtgc catcccgaac aacccgttcc tcctacgcct ctcggtagaa 900 aagcacgttg gtggacaaat cgcgggcgcg ttcaaagaga acaaaggtat ttcgtatgtt 960 ctcaaagtgc gtagccaggc ccaattcgac aaacttctga ccatgtcaag gctagcggac 1020 ggaaccgaga tcagcgtgga agagcattct aacctcaacc aggtcaaatg cgttgtttcc 1080 aatgctgaca ccatcggcct agaagaggga tatatcgccg aacagcttcg cgagcaaaaa 1140 gtaaaagaag tccgacgaat caaaaggagg aacagaaaca ccaacgctct ggaggaaacc 1200 ccaaccctaa tcctcaccat cagcggcacc gtggtcccgg accatataga ttttggctgg 1260 tccagatgtc gaacccgcaa tttctacccg aacccaatgc agtgcttccg ctgctggagc 1320 ttcggacata ctggaaagcg ctgcacggcc cccgctagag tctgcggaag atgcagcaag 1380 gttcaccagg aagatcagga aatggaaacc gcaaacgagc aaggaaccac gaacgaacca 1440 acgcccagac gagtgtgcac cgagcccaca tactgcaaag catgcaacag cgcacaacat 1500 gctttatcga gccgcgagtg cccggcattt aggaaggaga atgccatcca aatcatccgt 1560 gtggatgatg gcctttccta ccctcaggcg cgacgggaat tcgaggcgag agaggcccaa 1620 aaagaacgcg atcaaggtcg gaactcctac gccggagtag ctagcagcag caaagacgcg 1680 gagatagctg agctccgagg caccgtgaaa aggttggaga acgaagctgc aaagcgcgaa 1740 cagcgaatgg ccgatatgga acaagcccta gcataccgac ccatcagcga acggctggaa 1800 acagccaagg accacggccc gatagaggag ctaatccggc aggtagcgga attaaccgcc 1860 accgtgagac agctccaagc ggaccttaaa gagaaggacc aaatcatcgc agaaatgaag 1920 gaaaaacagc aggtaacccc ccgaaactcg atgaccacac caacgccaat cccacaatca 1980 agctccccat acactaccgt cagcagcgtc gaatgtctgc cgaattttgc ggacccgaca 2040 atgactgccc aggtagccga atgggttcgg aacaacgtag ctgcaaaaaa ggaaaaaatg 2100 gccaatgaga agaagagagt tgtccccaaa aagcagagga agaaaagcaa cgataaacaa 2160 acggcactac tacaagacga ccagagcccc catatgagtc actcccccct gagccgcata 2220 ggcaaagaaa gcgagttcct tgcaacaaac atggaactgg atgatagtga caattcgctg 2280 aagtccgtcg gcactgcgac ttcgtcagca gctgggtcaa aacggatcca ctcaatctcc 2340 agcgcggaat caagcatcaa ctcagccgat gaagttcgga tcaccaggaa caaaactagg 2400 ctccctcgtg ataccgcagc agacatatca atgcagtaac cgccgaacag gtcgctcgaa 2460 caaaacatca gcacatcacc agaacagtaa tcacactacc caatcacatt cacctccacc 2520 ttcccacccg ctgatgtcca aaacccagag ctaaacgagc gacaacagtc aagctccaac 2580 gacagtacca accacaccca aacaacatca agcactgcag aggagacctc gggcagccga 2640 ggccccccca gtgcggacac tacaccccaa ccggaactgg gggacaaccc tcggcacccg 2700 aatgtccttg ctgccgaaga cccagaggac ggtcgggaac ggttaggccc cgccggtgcg 2760 gacgttacac cccaaccgga accagcggac aaccctagcc ttcccctccg gtctggaaga 2820 gggacggaca agggcggtaa cgcctatccc cctgaggacg tcgagggatc caagcctgca 2880 atgaacctgc gccgaaatcc tccccggaga agaaacccgc caaaccgtta tggcttcagc 2940 gacatcaaca accccaaacc aactgctacc gaatacttat ctccgaagct aaaaccactg 3000 tcggtcttgg aaagcaccaa cgcttcagct tacgacgcga tggtcgcaat acgtgacgaa 3060 gcttgcgctc caccaacagc cggcccttcc ggagtatttc gacgaccttc accactttat 3120 ccctaccagg atctgtctgg gcacctcgga ggtaaaactc cgagtggtaa gccccaccct 3180 ctccggcgac ctcatcgaaa aggaacgcca cctactgccg gcccctccgg cctagcacac 3240 cattttgtca tcccagatcc ctctcaggat caactagggc acctcggacc tcaaaatcca 3300 agcgcaacga cagtaaggaa gtcgacaacg ctctccattc agtggaatat gaacggtttc 3360 ctccaaaatc gtaatgacct ggaactgctg gtgcaggagt accagcctat catcctggct 3420 ttacaggaaa tccaccgggt aagcgaaacc accatggaca gcaccatcgg tagaaaatac 3480 aaatggcaca aaacggccgg tacttcgctc taccattccg tggcaatagg agtactagcc 3540 gatatccctg ccgtcaggct aaacatcact acggatttac caatcgttgg cgtccggctg 3600 gacctgccca ttgctgtgtc tgccctgtcg gtctacctac cgaacgggaa gcttccgaat 3660 ttgagggagc gcttgcagga tgcccttgac caaatccctg gcccgatcat catgctcggt 3720 gacattaacg gccatcacca gtcctggggt gacaaaaaga cagatgcacg aggatgttgc 3780 gtaatggacc ttgcatgcgg taacgatctc gtcgtcctca acgatgggtc ccccactttt 3840 tttcgcggga caactgaaac cgcaatcgat gttacacttg cctcctccga gattggccac 3900 aagttgctgt ggcatgtgaa agaggatccc cacagcagcg atcattcgcc aatcattgtc 3960 accctcacca actccaccca gccagaaacg acccggcgcc cccgatggct cttcgagcag 4020 gcggattggg ccggcttcca agaacttgtg gaaaaggaac tcgaccagta cgcatccccg 4080 tcaatagagg acatcactgc ctccatcatt aaggccgcaa gcgagacgat tccaagaact 4140 agccccaatc caggaaggcg agccctccac tggtggaacg acaccactaa ggcggcggtg 4200 aaacaccggc ggaagaagtt gcgaaccttt aaaaagatca agaacaaact accgcctgac 4260 catccgaata cggtacaagc caaggaagaa taccaggagg caagaaacgc ttgtcggcag 4320 accatccggg aggaaaagga aaagtcgtgg acgaagttcc ttgactccat caacgaccaa 4380 cagtcctccg cggagttgtg gaggagggta aactgcctgc gaggaaagcg gagagttaaa 4440 ggaatgtgtc tgacagtcga caacgttccc actcgagacc cgcagataat cgccgaagcc 4500 ctaggcgaat attttcagga actctcgtct atgggaaagt atcctgaagc ttttcgcaga 4560 aaacaccact gttcccactc cgccatcagt aattttgaaa ttgcccctga ctgcggtcaa 4620 cctttcaaca cccctttcca aatggctgaa cttgagttcg ccctccgacg agctaaaggg 4680 aagtccgctg gaccagacga gctggcatac cccctgttaa aaaacctccc tccaagcgga 4740 aagcaaggcc tactccgggc catcaactcc atatggacgt cggggacatt accggccgac 4800 tggaggcaca gtctggtggt tccaataccg aaaggctccg gaccagcaaa cagcacctcc 4860 agctaccgcc caatcgctct gaccagttgc gcctccaagg tgatggaaag aatggcgaac 4920 cgccgactgg tggaatatct cgagagtaac ggcctcctag acgacaggca gcacgctttc 4980 cggtccggtc gtggcaccgg aaccttcctg gcggcattag gacagattct tgatgacgcc 5040 ttatctaaag gagatcatgc ggaactagtt tcgctggacc tggcaaaagc ctataaccgg 5100 gcctggaccc caggaatact agagaagcta gcccggtggg gaatcaccgg aaatctactc 5160 cgttacctga aaggtttttt aaccgagagg acattccaag tcacggttgg aaacagccgc 5220 tccaatatac aaagtgagga aacaggagtc ccccagggct ccgtgattgc ggtcactctt 5280 ttcctggtag ccatgagcgg agtatttgaa gttctaccgg gaggagtctt cctcgtggta 5340 tatgcggacg acatcaccct agtggtgacc ggtgcccacc cgaaagcgat caggcgaaaa 5400 gcccaagcag ctgccacagc tgtaggcaaa tgggctagca cgacaggctt cgagatctcg 5460 gctgccaaat ccaccagaat ccacatctgc gggatcaacc accggccacc taggaaaccg 5520 atcaaactaa acggagagac cattccactg aaaaggtctt tgaggataat gggtatcacc 5580 cttgaccgac acctaacctt aaaaaaccac ctcatcgggg tcaaacatag ctgcaaagac 5640 aggttaaacc tgctccgtag catctcagca aaaagaactt gcagcgacag ggccacctgt 5700 gtaagggtcg cggacgcgat catcggcagt cgtctactct acgggatcga gatgacttgc 5760 cgaaaccagg atctgcttac cgaaacatta gcgcccatac ataacaaagc cattagaacc 5820 gtatccggcc tgctcccatc cacccccgcc gacgcagcgt gcgccgaaat cggagttttg 5880 ccattcaggc ataaagtagc cctcacaatc gctaacaggg caataggata cctggagaaa 5940 acccggggaa acgggcaagc atgcctcctc tcagagaagg caaacctggc cctcagtact 6000 gcggccaaca ccacgcttcc cccggtggca ggcctccacc ggctcgggcc cagaagctgg 6060 tcggccagag cacccgaggt agacaacagc gttaagaacc ggttcaaaag gcagtcccaa 6120 ccggaaaagg tccgggcgta cttctgtgag aggctagcaa agcgatatac caactgggag 6180 atccggtaca ccgacgggtc aaggtcggcg ggacgtgtgg gcgtaggagt ccatggcaac 6240 agacacgagg aacaacacag cctcccgacg gaatgttccg tcttttcagc ggaggctgcc 6300 gcgatcttcc aggcactgac agccccaagc gacgggtcac tactaatcgt gacagactcc 6360 cacagttcca tcacagcagt agagaaccca aggaataaac atccatttat ccagggcatc 6420 cagatgttgc tggacaccgc cccatacccg agggcattaa tgtgggttcc cgggcattgt 6480 gggatcgccg gtaacgagcg agcagatgtc ttggctggca tcgggagaaa tagcgcccca 6540 ctaacgaaaa aaatccccgc tgacgactta aggaactggg tcaaggacgt cttctggcgg 6600 acctggacag atgaatggag gagtaatcgg gaattgttct tgagaaagtc caaaccaaac 6660 accatacccg gcgaagacct atcgagccga cgtgaacaga tcgtcctttc ccggttgcgt 6720 acgggtcaca cccgggtatc ccacgacatg gcggcaaggg gagccggatt ccgccgtcaa 6780 tgcgagcttt gcaacacgca gaacaccgtg gaacactttt tatgcgcatg ccctcaatat 6840 cgggacctac gagaaacaca cgagatcacg gacatcccca ggtcgttggc gaacgaccct 6900 gtaaatgaac gccaggtcat caattttctg aaggaggctg gccttttcgg cgaaatatag 6960 tagtagagag gaaactctgt attttgaaat gtaattattc ctagggctta acccttctgg 7020 tatagccaat ttctttttta aacggagatg aaccagccaa gggctgaaaa tctccctaat 7080 aaagataaat aataataata ataa 7104 // ID BEL-22_AA-I repbase; DNA; INV; 6173 BP. XX AC supercont1.310; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-22_AA_; KW BEL-22_AA-LTR; BEL-22_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6173 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.310; Positions 129287 123115. XX CC Positions [5235-5783] - Integrase core CC 'ACTTG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS join(21..1793,1797..6173) FT /product="BEL-22_AA-I_1p" FT /translation="MDDSPNGTPDPNYTVSCVACDRPPTADNFVCCDRCAD FT WWHMSCAGVTDSIENKDWLCSKCLPAPSISITSTSSTRRARLQLELQRLTE FT QRELEKLALNVELEKRFLERKYAMLEKSLQDEEVDHRSNRSRSSKTDAEKR FT KKDILDWVEQQSNEGVGKHPGRSPSVKSVVSNIEKRNQKPSYAMINQEDGA FT VGGEVPIGNRLVPPPILIPPGSGKLTAQYQSKEDVNLLQLHKDRASSYTLD FT EITALQDHLAKCKLQLLESTTQTQRKLDASDVSKGVLPKAYSNQRHHPKKD FT SFLETRSSKLPEIHRVDAFEEPEALQNAVRLFPDSRALMKKRNVPIGQQLN FT RSHEQQQKYEPEPRPLSPLQRVRKEASHQMHDVGRTNERIFATVNEGNQPF FT HKHRFEERVPAMLQPSDENIIEPVSNFIPPINQTIFGPTNQQLAARQALPR FT DLPTFSGNPADWPVFISQYEYTTDACGYSDGENMIRLQRCLKGPAWESVRS FT RLMLPASVPLVVQALRMRYGRSELLIDSLIARVRSAPTPKVDRLETLIEFG FT TMVQALCDHIIAAHLNDHLANPTLLKDLVEKLPAEYRMRWAGYRHHLVVNL FT KTFSDFMEEIVNDAYSVSSVTNTFERPIKTDRTKQKDRIFVHSESKKPDCS FT ICNRDGHRVKDCKTFLALPIDDRWKKVQSLQLCRICLTNHGRRACRITKRC FT ETDGCGFRHNPLLHSPKQNTRPPTDAHVSCHFNRGTLPSLFFRIIPITLRN FT GTKTIDTFAFLDEGSSVTMIESSLAKELNISGYTKPLCLKWTGDMCREERD FT SRLISVLISGVDSSKQYRMNDVQTTQTLGLPSQSLPIEELVRIYRHLGGIP FT IRGYRNAIPKLLIGVDNLRLSLPLKSREGVNGAPVAVKTRLGWCVYGGRRR FT IEQAEFSYHICECGGEEGFGSAMRKYFSFEEAGVMPRTTIMSDEDKRAQCI FT LEETTKRVGGRFECGLLWKYDNVELPDSYPMAVKRLHCLERRMAKDPILKQ FT NISQQIDEYQLKGYARRVPLNQLTDLDPRRTWYLPLGAVTNPNKPGKVRLI FT WDASAKVDGICLNSLLLKGPDLVTPLCFVMFRFRQHSVAISADLKEMFHQV FT RIREDRPAQSFLFRSDPENEPQIFHMNVATFGATCSPAIAQHVKNANAKEF FT EEKHPRAVEGIVFNHYVDDYLDSFATEEEAKQVTTEVREIHRCGGFELRNW FT CSNKDEVLQHLGQMVQNPVKQLRFDDQEQSERVLGMRWATDTDEFCFSAKM FT RNETAAIISGDQKPTKRQILKCIMSLFDPLGLLAPYLVFGKILMQNVWRQG FT IGWDEVVDDRLFEQWQHWTKMIDHINYIRIPRSYFRKAGAKGFQNIQLHTF FT VDAGGDACACVSYFRIVNEEGTVEIALVSAKTKVVPLKPITIPRLELQACV FT LGTRLAKYIVEGHSIIIKKRIFWSDSSSALSWINGDPRKYSPYVSFRVAEI FT LESTDRAEWRWVASEDNPADEATKWGAGPCFKSNSIWYTGPEFLYLPECEW FT PRPTPFVTSEEELRACYLHRELHIPQCIVDVSRFSKWTHLQRVMAYVHRFI FT NNCRYKCRHSGPLMRSELQKSENSIFRMVQWQSFPEEMVTLSTNHFLPLNE FT QRPLEKTSKIYKMTPLLDENQILRVDGRISSAQCASADVKFPIILPRTHRL FT TVLLIDDLHRQLKHGNKETLVNEIRQKYCIPKLRALVNSVVFRCLHCKIRN FT ARPIVPQMASLPKARLSPFVRPFSFVGLDYFGPFLVKVGRSQVKRWIALFT FT CMTIRAVHLEVAQSMSTESCVACVRRFVCRRGFPVEIHSDNGTNFQGAERL FT LREQINTELASTFTSTTTAWHFIPPATPHMGGAWERLVRSVKTALTAAYNN FT EKLDDEALYTMIIEAESIVNSRPLAYLPLDSAEQEAITPNHFLLGSSTGVK FT QPAVQQGSDCVIMKRSWTMIQKYLDMFWKRWVLEYMPTLTKRTKWFGETRN FT VREGDLVIIVDETKRNGWTRGRVLQAITASDGRVRQAVVQTSKGVLRRSVA FT RMAVLDVAREDGTKEDEGSHRGGG" XX SQ Sequence 6173 BP; 1821 A; 1368 C; 1490 G; 1494 T; 0 other; attcttcaaa aattcaagag atggatgatt cgccaaacgg tactccagat cccaactata 60 cggtaagctg tgtcgcttgc gatcgtccac ccaccgcgga caattttgtc tgctgtgacc 120 gttgtgcgga ttggtggcat atgtcctgcg ctggtgtgac tgactcgatt gagaataagg 180 attggttgtg cagtaaatgt ctaccagctc catcgatctc catcacatcg acatcttcaa 240 ctcgtcgagc cagactacag cttgagcttc agcggctaac agaacagcgc gaactcgaaa 300 aattggctct aaacgtagaa ctggagaaaa ggttcttaga acgaaaatat gcgatgttgg 360 aaaaatcact tcaggacgaa gaagttgatc accgtagtaa tcgtagccgc agtagtaaaa 420 cggatgctga aaaacggaaa aaagatatcc tggattgggt ggaacaacaa tcgaatgaag 480 gagtcggaaa gcaccctggc cggagtcctt ccgtgaagtc tgtagtttct aacattgaga 540 aaaggaacca gaaaccgtcc tacgcgatga tcaaccaaga agacggtgct gtaggaggtg 600 aagttcccat aggcaatcga ttagttccac caccaatttt gataccacca ggatcaggaa 660 aactaacagc acaatatcaa agtaaagaag atgtcaacct gttgcagttg cataaggacc 720 gagcttcaag ctacacgcta gatgagatca cagcactgca ggaccatttg gcgaaatgca 780 agctgcaatt gttagagtct acaacgcaaa cacaacgtaa actggatgcc tctgatgtct 840 caaaaggagt cttacctaaa gcctattcga accagcgaca tcatcccaaa aaggattcgt 900 ttctcgaaac tagatcgagt aaactgccgg aaatacaccg tgtcgacgcg tttgaagaac 960 cagaagcact ccaaaatgcc gttaggctgt ttccggattc ccgcgcgctt atgaagaagc 1020 gaaatgtccc aattggacaa cagttgaatc gttcacatga gcagcagcag aagtatgaac 1080 cggaaccaag accgctgagc cctcttcagc gtgtccgaaa ggaagcaagc catcagatgc 1140 acgatgtggg acgtacaaat gagcggatct ttgcaactgt caacgaaggc aatcaaccat 1200 ttcataagca ccgctttgaa gagagagtac cagcgatgct tcagccgagc gacgaaaata 1260 tcatcgagcc ggtgtcaaac ttcattccac ccataaatca aactatcttt ggacccacta 1320 atcagcagct agcagcaagg caagctttac cgcgagatct gcctacattc tcagggaatc 1380 cagccgattg gcccgtcttt ataagccagt acgaatacac aaccgatgct tgcgggtaca 1440 gtgacgggga aaatatgatc cgattacagc gttgcctcaa gggacctgct tgggaatctg 1500 ttcgtagccg gttgatgttg ccggcttcag ttccgcttgt agtccaggcg ttaagaatgc 1560 ggtacggtcg ttctgaactt ttaatagatt cgttgattgc gagggtgaga tcggctccaa 1620 caccaaaagt ggatcgacta gaaactctga tcgagttcgg taccatggta caagcccttt 1680 gtgatcacat tatcgctgcg cacctgaatg accatttagc aaatccaact ttgctgaagg 1740 acttagtgga aaagcttcca gctgagtaca ggatgagatg ggctggatat cgttaacatc 1800 acttggtggt aaatctcaaa acattcagtg acttcatgga agagattgtg aacgatgcgt 1860 atagtgtatc aagtgtaaca aatacatttg aaagacccat caaaactgat cgcacaaagc 1920 agaaagatcg gatatttgtt cactctgaat cgaaaaaacc ggattgttca atttgcaaca 1980 gagacggaca ccgcgtgaag gattgtaaga ctttccttgc actaccgatc gacgatcgct 2040 ggaagaaggt gcagagtctt caactgtgtc gtatctgctt gactaaccat ggacgcaggg 2100 cttgtcgtat cactaaacgg tgtgaaaccg atggatgtgg atttcgccac aatccactgc 2160 tacattcgcc taaacagaac actaggcctc ctacagatgc ccacgtgtcg tgtcacttca 2220 acagagggac tcttccttct ttatttttcc gaatcattcc gatcacctta cggaacggta 2280 cgaaaacaat cgatacgttc gctttcttgg acgaaggatc gtctgttacg atgattgaaa 2340 gcagtctagc taaggaactg aacatttctg gatacacgaa accactgtgc ctgaagtgga 2400 caggcgatat gtgcagagag gaacgagatt cacgactcat atcagtgctt atctcaggcg 2460 tagatagttc aaaacaatat cgtatgaatg atgtgcagac tactcagacg ttaggtctgc 2520 ctagccaaag tcttccaatc gaagaattgg tcagaatata taggcacctt ggtggaattc 2580 ccattcgagg atatcgtaac gcaatcccga aattgctcat tggcgttgac aacctgcgat 2640 tatcactacc cttgaagtct cgcgaaggtg taaatggcgc accagttgct gttaaaacac 2700 gcctaggctg gtgtgtatat ggaggtcgtc ggcgtataga acaagcagaa tttagctacc 2760 acatctgtga atgtggtgga gaagaaggct ttggatcggc catgaggaaa tacttttcat 2820 tcgaggaagc aggtgtaatg ccacgaacaa caataatgtc ggacgaagat aagcgagcac 2880 agtgtatact ggaagaaaca actaaaagag tgggaggccg gttcgagtgt gggttactgt 2940 ggaagtacga caatgtggag cttccagata gttacccgat ggccgtcaaa cgattacatt 3000 gcttggagcg cagaatggcc aaagatccaa ttctgaaaca gaacatttcc caacaaatcg 3060 acgaatatca gcttaaaggt tatgctcgtc gagtgccgct taaccaatta acagatttgg 3120 atcctcgtcg tacctggtac cttcctttag gggccgttac aaacccaaac aaaccgggga 3180 aggtgcgatt aatttgggat gcatccgcaa aagtagatgg catctgcctg aattcactgt 3240 tactcaaagg gcccgatcta gtaactcctc tgtgtttcgt tatgttccga ttccgacagc 3300 attcggtggc gattagcgcg gacttgaagg agatgttcca tcaagtccgt ataagagaag 3360 atcgtccagc tcaaagtttt ctattccgct cggatcccga gaatgaacca cagatattcc 3420 atatgaatgt cgctacattt ggagcaacct gttcaccagc catcgcacaa catgttaaga 3480 acgcaaacgc gaaggaattc gaggaaaaac acccacgagc cgtagaaggg atagtgttca 3540 accactatgt tgacgactat ctcgacagtt ttgccactga agaggaagcg aaacaagtta 3600 caacggaagt cagagagatt cacagatgcg gtggatttga acttcgcaac tggtgttcaa 3660 ataaagatga agtccttcaa catctaggac aaatggttca aaacccagta aagcagctgc 3720 gtttcgacga tcaagaacaa tcagaacgag tcctgggaat gcgttgggca acggacacag 3780 acgaattttg cttctccgcc aagatgcgaa acgaaactgc tgccataata tcaggcgacc 3840 aaaaacctac gaaacgtcag atactcaaat gcataatgtc cttgtttgac ccccttggat 3900 tacttgcgcc gtatttagtg tttggcaaga tccttatgca gaacgtgtgg agacaaggta 3960 tcggctggga tgaggtagtg gatgatagac tctttgagca gtggcagcat tggaccaaga 4020 tgatcgatca catcaactac attcgtattc cgaggagtta ttttcgaaaa gcaggagcaa 4080 aggggttcca aaacattcag ctacatacct ttgtcgatgc tggtggagat gcttgcgctt 4140 gcgtttccta tttccgtata gtcaacgaag aaggtactgt agaaatcgca ttagtgtctg 4200 ctaaaactaa ggttgtccca ctgaaaccta ttacgattcc tcgtttagag ctccaagctt 4260 gtgtgctagg aacacgactt gcaaaataca ttgtcgaagg acactcaatc atcatcaaaa 4320 agcgtatctt ctggagcgac tctagttccg ctctgagttg gatcaatgga gatccgcgaa 4380 agtacagtcc gtacgtatca tttcgagtag ccgagatttt ggaatcaacc gatcgagctg 4440 agtggcggtg ggtagcatct gaagataacc cggctgacga ggctaccaaa tggggagctg 4500 gaccatgttt caagtcgaac agtatttggt atacaggacc agaatttctc tatctacctg 4560 agtgcgagtg gccgagaccg actccctttg taaccagcga ggaggagctg cgagcatgtt 4620 atctacatcg agaactacat attcctcaat gtattgtaga cgtcagtagg ttctcaaagt 4680 ggacccattt gcagcgtgta atggcgtacg ttcacaggtt tatcaacaac tgccgttaca 4740 aatgccgaca ttccggtcca ttgatgagat ctgaactaca aaagagtgaa aattcaattt 4800 tccggatggt tcaatggcag tcttttcctg aagaaatggt aacactatct acgaaccatt 4860 tccttccact aaatgaacaa cgccctttag agaagacgag taaaatctat aagatgacac 4920 cactgttgga tgaaaatcag atcctccgcg tagatggaag aatcagctca gctcaatgtg 4980 catcagccga cgttaaattt cccataattc tcccacgaac acatcgatta acagtacttt 5040 tgatcgatga tctgcatcgt caactcaaac atggaaataa ggagactttg gtcaatgaaa 5100 tcaggcagaa gtattgtata cctaaactac gagcgttggt caattccgta gttttccgtt 5160 gcctacactg taaaattcgg aatgctcgac caatcgttcc acaaatggca tcattgccta 5220 aagcaagact atcaccattc gtacggccgt tcagctttgt gggacttgat tattttggac 5280 cgtttctggt gaaagtagga cgtagtcagg ttaaaagatg gattgccctt ttcacttgta 5340 tgaccatccg cgcggtccat cttgaagttg cccaaagtat gtcgacggag tcttgtgtag 5400 cttgcgtccg tcgatttgtg tgccgcagag gttttccagt ggaaattcac agtgataatg 5460 gcactaattt tcaaggcgcc gagcgcttgt tgagagaaca aatcaacacc gaacttgctt 5520 caacttttac aagtacaacg acggcctggc acttcattcc ccctgccaca ccgcatatgg 5580 gcggagcttg ggaaaggctt gttaggtcgg taaaaacggc acttactgcg gcgtataata 5640 atgaaaagct ggacgatgag gctctctata cgatgattat cgaagcggag tcgatagtaa 5700 acagtagacc gttggcatac cttccgctgg actcggcaga acaagaagcg atcactccta 5760 accacttctt gttagggagt tctaccggag ttaagcagcc agctgtgcaa caaggaagtg 5820 actgtgtaat catgaagcga tcttggacta tgattcaaaa atatttggac atgttttgga 5880 agcgttgggt gctcgagtat atgccaacgt tgacgaagag gaccaaatgg tttggagaaa 5940 ctagaaacgt acgagaaggg gatctagtca tcattgtcga cgaaactaaa aggaatggct 6000 ggacacgtgg tcgagtgcta caagcaataa cagcttctga tggcagggta cgtcaagcgg 6060 ttgtgcaaac ctccaaaggt gtactacgac gatccgtcgc aagaatggcg gtccttgatg 6120 tggccagaga agatggaacc aaggaagatg aagggtccca tcggggggga gga 6173 // ID hATw-4_BF repbase; DNA; INV; 5763 BP. XX AC ABEP01038455.1; XX DT 13-JAN-2009 (Rel. 14.02, Created) DT 13-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE hATw-type DNA transposon family from Branchiostoma floridae. XX KW hAT; DNA transposon; Transposable Element; hATw; 7-bp TSD; KW hATw-4_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-5763 RA Bao W. and Jurka J.; RT "hATw-type DNA transposons from Branchiostoma floridae."; RL Repbase Reports 9(2), 517-517 (2009). XX DR EMBL/GenBank/DDBJ; ABEP01038455.1; Positions 21044 26806. XX CC TIR is 11-bp long (2 mismatch). XX FH Key Location/Qualifiers FT CDS 1140..4151 FT /product="hATw-4_BF_1p" FT /translation="MATNGDILQFCGQHKRLKVREKMELLKETFTSIDSRV FT GTKVLWSSISRLAGKRKKLSKDKKELKSLLNQTFNSPKQNQTPQSRAPWTP FT RKKKLAKHLEEARKANKHLKKKLGSAEEWKNNMQCVQTELETLISEYTAAV FT QECDKITEDTEEKMKLMNNNNGELEKYIKEIENAFDSQSKKLQDATEKLSK FT WNVRNFNKREKSKKKKISICEEKLKHQDTLIAEMMARIKDLEVTNKKLTTE FT KVHLQKKASTLKAKLKMESPQAIVCANTEKIDKLEQEKCELEEENTMLEDT FT VKAMKTLETKHDKGYTNQIREVVIDLLSKNVSVKHVSSVIRTVLEKLAKMN FT ITGHLPSETLIRTLAVEGRMLSVIQAGKAMLEGHATLHNDGSTQEKKKFNT FT TSVTTPSGTLSLGLYEMAEENAQAVFEGNKQTLKEVAEILAKFDSTDPEME FT LTNLLSMITSTMSDRGSVMKAFNSAFEEWRGEIIKDTSSSVEKVNNFYCYM FT HVLINMASVALDALKCFDVCAIEKKGMPTKNGRAVTEDALYAACRTFQKQC FT DERTGQAAAFQAFLYGKDPGRKCQFSKLIGNRCNIQFVNGAAFFFHLEDMR FT EFLEIWKGGGTLSMSHSCLQQHLSNKVAIAGCRALGILEKRVTTPMWREIL FT AATSILDLNRQSLTMRAEFLRLSEDASELLVPDNKTIFPGREVVDEDDVCH FT ALLKDSEDAELDALTISALELIFSSWVILIEKQLTDYLPGGMYDNPDENLK FT QKSKHVPTTNVTCERSIGYRNRLHQLKPSAKTLHIESLLLLSTNHTVDWLG FT KLSEDDKERALETARKAAPGCIKKQKEREQEILSHRLELLRQKQDEKERKI FT RRVSQEKSKICEAVRQDGGEWKTPEEVEQRLCECATDSKKKEKILAQLRFH FT KVVLEDPLPTDKRYLVQQQMTVNGSVKKFSMSELKENLLELIGMNTLRCNN FT NNEEDEDEDSDQEVEQGTKYMATDRRLELIQAEKAKLKKKRDKQRERL*" XX SQ Sequence 5763 BP; 1958 A; 1002 C; 1259 G; 1544 T; 0 other; tagcgagcgc aattgccccc tatatgccgc gcggaaaatg actcccacgc gttacgggcc 60 gcgatcggcc aactaaaaaa attcttgaaa aacaacttta atactcacaa tcaccaaatt 120 acttgtgctt ttcaatcaat atcttatctc tatgtgtact aaatttcttt taaaaacaag 180 ttaccgatgt ttagttagag cgtgattaac gggcagggcc acccggccgg cgagctacgc 240 ccgcggcggg caccatttcc gccgaacgca aaaatgcgtt ccacactttg accccgtttc 300 tgccggatat accgaggatt atggtccata gaaacttcat gaatgcatta tagagactgt 360 ggggagtatt ctggtgtgtg ccgcaaattt ctacgtccaa ggtaagtcta tttatgacta 420 ttttaaaaac tttgaccacg atttttcccc cgatagaatt aaattatcta tttgcggcac 480 cggaaatacg atagaacatg ttcttagctt tattttgata ccagatttat aacatttact 540 ttggtatata cacggcggaa atcgctattt tcagtgctgt tctttggcgg gaagccgcgc 600 cgcggggcgt gggagggggg cgtggcgaac gcagtgttat tcactggctc ctcacacgtt 660 tcttgcatag agaattaagt tagtgcgtac gtactatgag gctagatggg atttagtttt 720 cagaatcctg tctgatatct tttggtaagc catttaatta taactttaca tttcttctca 780 gttatgatgc cagagaacct gttgtgacta gtgcctagta gtggacttct cttctatctt 840 cttcttctct gtacttcttc aaatcataca agtgacacag agggaaggaa ttggacaggt 900 gagacaatga tttgcaagtt tcttttttct ctgttgttga aaattgttat tcaatttcac 960 aacaatataa aggaattaaa acactttctt cacaatcatg tacatctatt taggtaaagg 1020 aatagttcag ttgtgaattt ttgtactatc attcagtgtt gcctgaagat gttttctttg 1080 ttttattatc ttgtatatta tatcttgcta attatttttg gtgattttac ataggcacca 1140 tggccacaaa tggagatatc ctacagttct gtggacaaca caaacgactc aaggtgagag 1200 agaagatgga actgctcaag gaaacattca catccattga cagcagggtg ggaactaaag 1260 tactctggag cagtatttca cgcctagcag ggaaaaggaa gaaattgtcg aaggataaaa 1320 aggaactgaa atctttgtta aatcagactt tcaactcccc aaagcaaaac caaactccac 1380 agtctcgagc tccatggaca ccgagaaaga agaaacttgc aaagcatctg gaagaagcaa 1440 gaaaggccaa caaacatttg aagaagaaac tgggtagtgc agaagaatgg aagaacaata 1500 tgcagtgtgt gcaaacagag ctggagactt tgatcagtga atatacagct gctgtgcaag 1560 aatgcgacaa gatcactgag gacacagaag aaaaaatgaa actgatgaac aacaacaatg 1620 gcgagctgga aaagtacatc aaggaaatag agaatgcttt tgactctcaa tcaaaaaagt 1680 tacaagacgc aacagaaaaa ctgagcaagt ggaatgtaag gaacttcaac aaaagggaga 1740 agtcaaaaaa aaaaaagatc tcaatctgtg aagaaaaact gaaacatcaa gacacactca 1800 tagcagagat gatggcaagg attaaggact tagaagtcac caacaaaaag ctgactacag 1860 aaaaggtcca cctacagaag aaagcctcaa cactgaaagc aaagctgaaa atggaaagtc 1920 cacaagctat tgtgtgtgct aacacagaaa agatagacaa actggagcaa gagaagtgtg 1980 agctagagga agaaaacacc atgttagaag atacagtgaa agcaatgaaa acacttgaaa 2040 ccaaacatga caagggatat acaaaccaga taagggaagt tgtgatagat ctgctgtcca 2100 aaaatgtgag tgtcaagcat gttagttctg tgataagaac agttttggag aaactggcaa 2160 agatgaacat cacaggacat ttaccttctg agactttgat cagaactttg gctgttgaag 2220 gacgaatgct ctcagtgata caggcaggca aagccatgct ggagggtcac gcaacacttc 2280 acaatgacgg gtcaacacag gagaaaaaga aattcaacac tacaagtgtg accactccat 2340 caggaaccct gtcactagga ctctacgaga tggccgagga gaatgcacag gctgtgttcg 2400 aaggcaacaa gcagacgctg aaggaggtag cggaaattct ggcaaaattt gacagtactg 2460 acccagaaat ggaactgacc aatctgttat caatgataac atcaacgatg tcggacagag 2520 gttcagtaat gaaagcattt aacagtgctt ttgaagagtg gaggggggag ataatcaagg 2580 acacgtcaag ctcagtagag aaagtcaata acttttattg ttatatgcac gtccttatca 2640 atatggcttc tgttgcactt gatgcattaa aatgtttcga tgtgtgcgcc attgaaaaga 2700 aaggcatgcc aaccaaaaat ggaagggctg tcactgagga tgcactgtat gcagcttgtc 2760 gcacgtttca aaaacagtgc gatgaaagaa cggggcaagc ggcggctttc caggctttcc 2820 tgtatgggaa ggacccaggt agaaagtgcc agttttccaa attaattggg aatagatgca 2880 acatccaatt tgtgaatggt gcggccttct tctttcactt ggaggacatg agggagttct 2940 tagaaatctg gaaaggtggt ggtaccctaa gtatgtccca ctcttgtctg cagcagcact 3000 tatccaacaa agttgctatc gctggatgca gggcgctagg tatcctggag aagagagtga 3060 ccacgccaat gtggcgagaa atcttagctg ctaccagcat cctcgatctg aacagacagt 3120 ctctcacaat gagagcggaa tttcttagac tatcagagga cgcatcagaa ctacttgtgc 3180 ctgacaataa aaccatattt cccggccggg aggtggttga tgaggatgat gtgtgccatg 3240 cccttcttaa agattctgag gatgctgagc tggatgctct caccatcagt gccctggagc 3300 tgatcttctc gtcctgggtg attctgatag aaaaacagct caccgactac ttacctggag 3360 gaatgtatga taatcctgat gaaaacctga aacagaaatc taaacatgtt cccacaacaa 3420 atgtcacatg tgaaagaagc attgggtacc gaaataggtt acaccagctg aaaccatctg 3480 caaagacctt acacatcgaa tcactgcttc tactctctac aaaccacaca gtagactggc 3540 tgggaaaact gtctgaagat gacaaagaac gtgcattaga gacagctcga aaagctgctc 3600 caggatgcat caagaagcag aaagaacgag aacaggaaat tctgagccac cgcctcgaac 3660 tgctgaggca gaaacaagat gagaaggaaa gaaagataag aagggtatcc caggagaaat 3720 ccaagatttg tgaagcagta aggcaagatg gcggtgaatg gaagactcca gaggaggttg 3780 agcagagact ctgtgagtgt gcaactgatt caaagaagaa agagaagatc cttgcacagc 3840 tcagatttca caaagtagtt ctggaggacc ctctaccaac agacaaaaga tacttggttc 3900 agcaacagat gactgtgaat gggtcggtga aaaagttttc aatgtcagaa ctaaaggaaa 3960 acttgcttga gctgataggt atgaacactt taagatgcaa taacaacaat gaggaggatg 4020 aagatgagga ctcagatcag gaagttgaac agggtacaaa gtacatggct acggacagac 4080 gactagaact gatacaggca gaaaaagcaa aactgaagaa gaagcgagac aagcaaagag 4140 agagacttta ggaatttgtt tgattatatt tgttgtaata tggctttgaa cagaagactt 4200 gaactgataa agagcaaaag tgaagaagcg agacaagcaa agagagacag taggaatttg 4260 ttaatatttg atatgatatt ttagatacaa tagtattcac atatttctgc agaaaaaagg 4320 tatgacttgt aaaaatgatg taattaatat gcaaaaaagt aattgatgtt aattatttat 4380 ttatgaagtg gatgacaaaa agttaaataa gtaatcttga agtaattaag tcaaaaacag 4440 ttgaaaacat agtataagtg catttttttc tcaaagccat gaattgattg gattattatt 4500 tattatatgc attattggtg taattaatat gcaaatatgt taaatgatta tacaacaaga 4560 attggtatga taacttgaac ttgctggtca cccatgctgt gtcagctgac aattgtataa 4620 tatatgtatt tttaaatgta tcttctgtaa tttatgaaag ctgcagtgat tagaatgctg 4680 aaattggatt atttatatta tctaaaacac taatcatata ctaattagaa tacttaagat 4740 catctatttg ttagctatat tgtgacagct gccagttcca ttactctgtt tagaggtggg 4800 cataaagaaa gtgtgtatgt agatgatgtc tatccaattt gaaggttttg tctaattaat 4860 atgcaaatat gcaaatcagt cattacataa gaatttaaat acaactgaca ctttgctgat 4920 catgtagtga cagatgtctg aggtaacctg agagagaaat gtagcctgaa tgtgtatagg 4980 atacctgaat atagtatgtg atattaacct aattagtatg cagacatact aatttcaata 5040 attaaatatt gcaaaacact aatcatatac taattagaat acttaagatc atctatttgt 5100 tagctatatt gtgacagctg ccagttccat tactccgttt agaggtgggc ataaagaaag 5160 tgtgtatgta gatgatgtct atccaatttg atggttttgt ctaattaata tgcaaatatg 5220 caaatcagtc attacataag aatttatttg caactgacac tttgctgatc atgtagtgac 5280 agatgtctga ggtaacctga gagagaaatg tagcctgaat gtgtatagga tacctgaata 5340 tagtatgtga tattaaccta attagtatgc agacatacta atttcaataa ttaacataat 5400 catttactaa ttatagacta agtgtactgt ctacttgctg gatttggtta gacagttgca 5460 tcaacttgaa tgtctgttgg tgttaagaaa ataaatcatg ctaatgatgc atatagtgtg 5520 tgtgtagtgt attgatttgc tctaattaca ggtccctccg gtgccatttg ggccccattg 5580 aactaatgat taaggttttc cacaaaaata cccaaattaa ttacttcaaa atttaatttc 5640 tagacttaag ttcatccgat ttgaaatccg aaaacggcat tttgttcggg gctgcaagag 5700 ctataattta agatagattt cactgtgtta tcacattagg aaaaaaatca cctgtgctag 5760 cta 5763 // ID Mariner-37_HM repbase; DNA; INV; 4851 BP. XX AC . XX DT 15-JAN-2009 (Rel. 14.02, Created) DT 15-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE Mariner-type family: consensus sequence. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-37_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4851 RA Bao W. and Jurka J.; RT "Families of Mariner elements from Hydra magnipapillata."; RL Repbase Reports 9(2), 395-395 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(1469..2326,2330..2986) FT /product="Mariner-37_HM_1p" FT /translation="MVKTRGGGCRSSPRFHRMSVGKIASVTEATICYNTTH FT LHDGITPSTSGSSTLDICQMKQLPTIDVSLTRVNGSNHSIDQSVTVKKRGK FT FLKWSEENMMSAIKHVRSEPSFSVRKAAILYNVPRTTLRDRLSGKVVIGAK FT PGRKPTLPFALEEKLIDYASNRAKLGVGFGKSQFLNYAGKLAAKHKFNFKK FT GLPSNQWWTRVKHRHGHKFTLRQPEGTASVRHQCMDKDKVSKYFSALNDVM FT VKNGFHVKPESIWNMDETGLQLDVKPRKVVARKGTKNLHSRTSGNESITVI FT ACVNAQGKFIPPQVIVKGKTSRSLWGFNTEFAPPGTNWSWSDSGWTKQGLA FT KLWFTKTFLENIGPQRPQLLILDGHDSHNFIELIDIAIQNQIHIVEMPSHT FT SNWLQPCDRTLFKPLKDYYRSTAQDLMGQFPGIITCRSNFAGIFASAWEKA FT MTSSNITSGFKLCGIYPFNPQAIPSDAYLPNYLHIAEDISVNTTENNCSLV FT SIVLSCFC*" XX SQ Sequence 4851 BP; 1662 A; 822 C; 795 G; 1572 T; 0 other; gggtaacttt acataatttc gcccatcact caatctcgcc caccatcttt aattgcttat 60 atgtcagtta actcacacgc tgcaccatca aaaagacgtt cttttagagc agcaaaccag 120 ccttatcaac atcaaaacaa ctttgacctc aaaagaattg caaataaaaa agtatctctt 180 aaagaaaatt tgttggtttt cggtaatttt tttgctactt aatttgccga attcaaacca 240 gctgatgcaa atctaccgag ttaattattt attatgcatg ttaaattata atttaatttt 300 gttctccact gtttaatctc tgattgtaat aacatttata atcgtttgaa atggtagaat 360 atgcataaaa ttgttttctg ctcaacctac caaattaact caatctcgcc cacagaataa 420 atatattgtc gcccaatgca tttaatctcg cccgggtggg cgagaataga tgcagcataa 480 tatattaaga cttacgctac cagcagtttt aacaaaaaaa acctttcgcc gtgaccccca 540 ttattttttc attaaattgg gatgacatat ttaaaaaata attgaaaaac aaaatcataa 600 aatttcgtaa ggatacatat taagagcata atctttttga atgcttacaa aggagtattt 660 tttatgtatt tttattctat atattattga ggaaattatt ttataaatta attaccctta 720 attcaacatc cttggcatta tcgtaaaaat gaaccccaaa cttacttgat aattgctttg 780 acaaaaattg atgatatcca cgttaattta tttttgattg ccttcaaaac aatactcttc 840 ctctgtgtat tgttattaag gaaattaatt aaaaaataaa ttgctatggc tctgcgttga 900 gtaattgtat aaaaagctca tgttcatcaa aaccaaataa attagctaga taaagaacaa 960 agaaagtatc gcaaataagt aaactttgtc tgctattgca ttttcttttc ctgataatta 1020 ttggtaagaa agaatttgga attttatctt tttatgctat gttttttttt ttagaaataa 1080 ttttttagtg tttagaaata atgcatattt tatttacgat tgttttattt attttcaatt 1140 taagaactaa tgttttagtt attaaaataa ctaaaacatt aggtccaaaa cgttaggttc 1200 aaaaggtcca aaactcattt tatgcataat aagaatgttt ttattcttct ttaatagact 1260 tttgtaatta cttagctaaa tgcttattta cttcattgta aattaagtac tgcgaagttc 1320 tagggttggg attaggttac aaatactaac atatggtttt caccacgtgt taatatttgt 1380 agctcttgta aatttggagc actttattaa attaatatga cattcatact gcaatttaat 1440 ttatctaggt attgtcttga aaaaaaacat ggttaaaaca agaggtgggg gatgtagatc 1500 atcaccaaga tttcaccgga tgtcggttgg aaagattgct tctgtaacag aggctacaat 1560 atgttataat acaactcatc ttcatgatgg tataactcct tcaacatctg gtagtagtac 1620 tcttgacatt tgtcaaatga aacagctacc aacaattgat gtttctttaa ctcgtgtcaa 1680 tggttccaat cattccattg atcaatcagt cactgttaag aaacggggca aattcctgaa 1740 atggagtgaa gaaaatatga tgagtgcaat taagcatgtc agatcagagc catcattttc 1800 agtgcgtaag gcagcaatat tgtacaatgt gccacgtaca acactgagag acagactttc 1860 tggaaaagtt gttattggag caaagccagg aagaaaacct accctacctt ttgctctgga 1920 agaaaaactt atcgattatg ccagcaatag agcaaagtta ggggtaggtt ttggaaagtc 1980 acagttcctg aattatgctg ggaaacttgc cgcaaaacat aaattcaact tcaagaaagg 2040 acttccaagc aaccagtggt ggactagggt taaacaccgg catgggcata aatttactct 2100 gcgacaacct gagggtacag cttcagttcg acatcagtgt atggataaag ataaagtgtc 2160 aaagtacttt agtgctttga atgacgtcat ggtgaaaaat ggttttcacg ttaaaccaga 2220 atccatatgg aatatggatg aaactggtct acaacttgac gtaaaaccaa gaaaagtggt 2280 tgcacgaaaa ggcacaaaaa acctccatag tcgaacgagt ggaaattgag agtcaatcac 2340 tgttattgca tgtgtgaatg cacagggcaa gttcattcca ccccaggtta tagtgaaggg 2400 aaagacatcc aggtctttat ggggttttaa cactgagttc gcgcctccgg gaacaaattg 2460 gagctggtcc gatagtggtt ggactaaaca gggcctagca aaactatggt ttaccaagac 2520 atttttagaa aacattggtc cacaacgtcc tcagttattg attctggatg ggcacgattc 2580 tcacaacttt attgagctaa ttgatattgc tatccagaac cagatacata tagttgaaat 2640 gccttcacac acatcaaact ggcttcaacc atgcgaccgc accttattca agccactgaa 2700 ggattattat cgctctacag cacaagatct gatgggtcag tttccaggaa ttattacttg 2760 tcgttcaaat tttgcaggga tatttgcttc agcttgggaa aaagcaatga cttctagcaa 2820 cattacatct gggttcaagt tgtgtggaat ttatccattt aatcctcagg ctattcctag 2880 tgatgcatat ttaccaaact atctacacat agctgaggat atatcagtta atacaactga 2940 aaacaattgt tctttggtta gtattgtact atcgtgtttt tgctaatact actatacaat 3000 aaaaattatt atttaaactt tagctagcaa ctgtgactaa tattttatgt acaaattgat 3060 aaaaattctg aatgaagcat aaagtttact aacttgcatt tcaatagtat tatttattat 3120 tttcttagga attccaatca tcaattgttc aagaaattaa tttttcagac gttgatgctt 3180 ctatagatgc tggaaaagtt gctaattatc tagaacaatt acaaaatggc actgaagaag 3240 ataacgcaac tcacagttcg ttacaaagca acacttctat tttggaagca gagactgttg 3300 ttcgtaataa tcagcttgca tatttagtca atccagttga agaaaaaaat gttgactgca 3360 ttgtgaatag ttttaccaat cttactaagc aaaaaaaaaa agaaatctta gaaccatcaa 3420 actttaattc tactacagtt tcagattttc cgtttgcaaa ctcttcctac cccctggtaa 3480 gtcaatcatt ttacagtttt ttttcttaat ttcatttgat caagtgatca atagttgcat 3540 agcaacaatg caacttttaa taaattttta gttttaaaat aattcaaatt ttttaaaatt 3600 atttaaatac ttgttctttt attgaatata tatgttttta tcaatgcagg tgatggagat 3660 gatgatgtct tggcatatcc taaggccacc gcacggaaaa agatagcgaa cagaaaaggt 3720 cagcttaaat acttcgtttt aacttctcaa gaagcatact cagcaaagct gaaatatcac 3780 caagacaaaa tccaacagga aagagaaaaa tgcgaaagaa aacaactgcg atttgttaat 3840 gctgctaagc gactcaaaga aaaagaagaa aaaaaggtat tggctaagaa aaaaaataat 3900 tagaaaatcg tgccaacaag tctatagcaa caacaaaaaa gaagttcatt tcattatcat 3960 ctgaattgca accattggtt gatcccgaga acaatactga ctaatatgta ctaaatattt 4020 gacaatattt ttttatataa ctaaaaatgc atacaatttc gtcttatcta tataaacttt 4080 tttattattt ttatttaaga atactctaat ggtcttattt gtgataagac ctaatggtct 4140 tatcactaag caccgccaca agggcttaac catttaatga gttggttccc catttcgagc 4200 tttaccgata acgtataagt gtttcgagca ggattcgaac tcgcaaccct tatacaatga 4260 gtaccatgct ctaatcacta cgccacggct gctcaatata aaaattaaac tcatgactgg 4320 ggcaaaaaga agacaatatt tgtcatatca ctacggctcg taagcattga tgtgttgaaa 4380 ctatacttta tttacacatt ttacaattac aattgagatt gatctcagaa tccagattca 4440 agtcagaatc tagggagaaa gttcgcattc acaaaaaccc catggtttta tacttttcta 4500 attaaaatgg tgcaattgag catagaaatt aactaacgag atttaatctc actcacctgt 4560 atataacgtc gccctacgca agtataagtt cttttttaat ttacacagtg ggcgacttta 4620 aatcaacatg gcgagattga gttcaagtgg gcgagaatca gtgcttttga cacactgtac 4680 aatttataat ggtactgcca actactctca taaaatctaa ataattttac atagttttgt 4740 agctgaagga ataagctatc taatgatgta tagttcagct gtgcaagtct aataatagcc 4800 tgtcaacatc aatttcagtg cggatgtggg cgaggttcga ttaagttacc c 4851 // ID TE-2_CQ repbase; DNA; INV; 1334 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A transposon family from Culex quinquefasciatus - consensus. XX KW Transposable Element; nonautonomous; TE-2_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-1334 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the southern house mosquito."; RL Repbase Reports 11(1), 96-96 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >95% CC identity. ~18bp TSDs. XX SQ Sequence 1334 BP; 442 A; 249 C; 273 G; 369 T; 1 other; aattcgcaat tcgcaatttt cagtgtaaga acgggccttg accgatctta tgcactaggt 60 tcccgacgaa cacgcactgc ccttacacct acatctcacc cttgctctga gtcagtacga 120 gcagcacgct agaacacgct ttgagtgttc gtgccaggca tgcacacctt cttttccggt 180 tacgcatttt aactcggccg ggggtggtac attacgtagg gtttgatgta agtataagcg 240 cctaaccatt tatagtgtgc ctatcaactt tcattaaagc aaaaactgtt ttatttttag 300 tttgaattca aaaactaatt gttatttact gtgttttgtt ttctcctgaa atcttcccta 360 gtgttgagtc gtgtttatat gttgctattt cttttgtcgc ggtgttttgt tacaattttt 420 ggtcctaagc attttataaa agtttttcaa aagtacaata gtaatatttg tgttaattct 480 ttgatcattc caaaaattga gtaagggctc gaacctcatt tgtttgaaga aaagggtgaa 540 gattgaaaac aatggacagt gaaagaaata tactatktga agaatcaata gtgaaagaaa 600 gatagtaatt tatgaagtag ataaagaaat taacaatagt taataaatag aggtaacaaa 660 caagcttaga ttgtattgag ggcaatttac agggaagatg agaaattatt caaatagata 720 aataaagaaa aggcacatga taaacaaaat tgacatcgct gccaaattaa gggaaagcag 780 aaagtttgtt acagcagcat caattggtaa aatagagtgg aaaagcgttg agaaataaga 840 gaatcaaatg agtaaaatcg aaatcaaatg gaggatagta aacagaagcc gcgttagaaa 900 atcagtgaaa caaaataaac agaagatagg tggtagctgg aatgaaaaga agagaaaccc 960 cgttgtgcga tgtctcaggt acatccacag cagttgactc aacatactgc gaatcaaaac 1020 aataccgtct acaatcacaa attacttccc tttcccacat tgtcgcgccc ctttctttta 1080 gccgtctcga ctctgaccca cggtggtcaa aggtttacga tccgtttcca caggccacca 1140 gctaggtcat catgaaaacc aagccaacgt ggaggtaaga aaataggaca cttgcaagga 1200 accagagcta tactgtcttg atcaagaatg atctaacagg actacggacg tagtcagtga 1260 cgctccttcc aggatgttcc tcgcctgagc gtcaactgaa gaggtatcat cagcatcatt 1320 cgcacttggc ctgc 1334 // ID Gypsy-618_AA-I repbase; DNA; INV; 4623 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-618_AA_; KW Gypsy-618_AA-LTR; Ty3_gypsy_Ele27; Gypsy-618_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4623 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [3504-3974] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 1332..2540 FT /product="Gypsy-618_AA-I_1p" FT /translation="MVNGKTVRMQLDTGSDVTIISSKTWKKIGSPQMNSSS FT IVAKAATGKPLELLGEFSCDMSIGEKNMRSLVRVSKENLHLLGSDTIETFG FT LWSVPFDTLCGKVSSESMCVDALKQKFPEVFSEQLGLCSKTKVKLELKPGK FT TPVFRPKRPVAYAMCQAVNDELDRLEQDGIISPVDYAEWAAPIVVVRKANG FT SIRICGDYSTGLNDALQPNQYPLPLPQDIFASLANCTVFSQIDLTDAFLQV FT EIDEGSRQLLTVNTHRGLYQYNRLPPGVKAAPGAFQQIIDTMLAGLPCVSG FT YLDDVVVGGIDEADHKKNLEAVLQRLKEFGFTVRAEKCTFGKEQIGYLGLL FT MDRHGLRPDPAKIETILKLPVPSDVSGVRSFLGAINYYGKFVPNMRSLRFP FT LDELLKEGA" FT CDS 2823..4610 FT /product="Gypsy-618_AA-I_2p" FT /translation="MIFGRKFRLQTDHAPLLRIFGSKKGIPVYTANRLQRW FT ALTLLLYDFVIEYVPTHKFGNADVLSRLIDNHAKPDEDYVIASVILEEDMR FT SVVNEASSVFPLSFKVIEKDTQSDPKLRKVYRYLQDGWPQGTKIVDAEIGR FT FHGCQESLNTVGGCLMFGERLVIPEKHRSRCLRQLHQGHPGIQRMKAIARS FT FVYWPSLDEEIVAYVKSCKHCAAVAKSPPHSPPVPWPKPVGPWKRVHVDYA FT GPIEGDYFLLAVDAYSKWPEIIPTRRITTAATISILRGLFARLGMPEVLVS FT DNGTQFTSEDFQKYCANNGIHHVTTAPFHPQSNGQAERFVDTFKRSVKKIQ FT EGKSTSSLQEALDVFLLVYRTTPNRQVVDGKTPSEAMFGRRIRTNLDLLRP FT PPVSPAVLAKSEDDSNKRCFATHDPVYAKLFTKNKWRWAPGVVCERIGHVM FT YSVWIEDRRLIRAHINQLRSRSSAAGSEQVENGRRDRSQLPLEILLDSCSL FT PKHSLQPTAPPAEQPMTPLQSSSASHEIQSSQQFDDQASPRTSVRSSGSSS FT TPSSTTSLSGASPEFASASSGPATPIQVQPRRSSRTRRPPRWFDPYRLY" XX SQ Sequence 4623 BP; 1227 A; 1073 C; 1247 G; 1073 T; 3 other; taaaagtggc gacttcggaa gtggagcatt agtgtagttt tgcgctgcaa aattcaggca 60 aaacgtaccc cccacgcaag aggaggagaa tgtcgtccgg agacaatggt cagcctaatc 120 cggtcggtaa tgaccaacag ccgttgggaa atggcgatgt tagacaacag cagccgtcca 180 ttcgtcctcc actacttcaa caagtcccac cgcgtctgcc accgcctacc gatccgtata 240 tccagtggtt ccagcagcaa caacagcaat acgtcacgga gctgtttcgt caacaacaag 300 aggcaattag tcgacagcaa gaagccatga atcagcagca acaaatgttc atgcatcagc 360 aggagcaatt acttcggagc atcatgactt cgatccatgt acaagtgccg ccgaatccgg 420 aggcaatact ggactcgctt gccaacaatg tgaaggagtt caggtatgat ccggagagca 480 gcgtaacgtt ttcggcgtgg tacaagaggt atgaggactt attcgagaag gatgctagtc 540 ggctcgatga gagtgcgaag gttcgtctgc tcatgagaaa gctgggaatg tccgagcacg 600 agcggtatgt gagcttcata ttgccgtaaa gccccaacaa gacttttcgt tcgaggaaac 660 agttaaaaag ctcaattcgc ttttcggcgc ctcggagtcc gtcatcagcc ggagataccg 720 ctgtctacaa atcgtgaagc agcccacaga ggactacgtc acgtacgctt gtcgcatcaa 780 caagtcgtgc gttgagttcg agctcacgaa actaacggag gaacaattca aatgcttagt 840 tttcgtttgc gggctgaagt ctgacagaga cgccgaagta aggacacgct tgttgactcg 900 catcgaagaa cgtgatgacg tcacgttgga gcagatttcc gaggagtgcc aaagattagt 960 gaatttgcgg cacgacacag aaatgattga gaacccggca tcagtgaacg caatgcgatc 1020 agtggaagca attcaagaag agttccccga agttcaagcc gaattccagc agtgagaaga 1080 gaaacggaaa tagtgctaat ggtccggaag tgttatgttg gctgtgtgga gcaccgcatt 1140 atgcgagaga ctgttcgttc aggaatcata agtgcggtga ttgttctaag accggccaca 1200 aagaaggata ttgccggagt gccgagaaag tgaaaactgg aagatttggt cggcgtaaag 1260 gcaatgcttc tacaaaggtg gtgacagtga atgcttgcag tgtgcagaag cgtaggaagt 1320 tcgtttcagt gatggtgaac ggcaaaacgg tgagaatgca gttggatact ggctccgacg 1380 tcaccattat ttcgagcaaa acgtggaaga agattggaag tcctcagatg aattcgtcat 1440 cgatagtagc gaaggccgct actggcaagc ccctcgagct gctgggggag ttttcgtgcg 1500 atatgagcat cggtgagaag aatatgcgca gcctagttcg agtgtctaag gagaatctcc 1560 atctgctagg gtccgacacg atagaaactt ttggactgtg gtcggtgccc ttcgatacgc 1620 tttgtggaaa ggtgagcagt gagtcgatgt gcgtagatgc gttgaagcaa aagtttccag 1680 aggtgttttc tgagcaatta ggtctctgtt ccaagacgaa agtgaaactg gagctgaagc 1740 caggcaaaac cccagtattt cgtccaaaac gaccggtggc atacgctatg tgtcaggctg 1800 tgaacgacga actagaccgc ctagagcaag atggaatcat ttcaccagtg gattacgcag 1860 aatgggcagc tccaatagtg gtggtccgta aagcgaatgg atccattcgg atctgtggcg 1920 attactccac gggcctgaac gatgcgttgc agcctaatca atacccgctg cctctgccgc 1980 aggatatttt tgcgagtctg gctaactgta cagtttttag tcagattgat ttgactgatg 2040 cattcctgca ggtagagatt gatgaaggat caagacagct gctaacagtg aacacacacc 2100 gcggcctgta tcagtataat cggctgccgc cgggtgtgaa agcagcgcca ggagcttttc 2160 aacaaatcat tgacacgatg ctggctggtc tgccctgtgt tagtggatat ctcgatgacg 2220 tggtggttgg tggaatcgac gaggccgacc acaaaaagaa tttggaagcc gtgttgcaga 2280 gattgaagga atttggtttc acagttagag cggagaagtg tacattcggc aaggaacaaa 2340 tcggctattt gggattgcta atggatcgtc atggattgag gccagatcca gcgaagatcg 2400 aaaccatttt gaagcttcct gtgccctcgg atgtcagtgg tgttcggtca ttcttgggtg 2460 caatcaatta ttatggaaag ttcgtgccca atatgcgatc tttgaggttc ccgctcgacg 2520 agttgctgaa ggaaggagcc amgtttcagt ggacagcgga gtgtcagmat tcgttcgatc 2580 gtttcaagga aattttgagc tcagatcttc tgctaactca ctacgatcca cagcgtgaga 2640 taatagtgtc tgccgatgcc tcatcagttg gagtaggtgc cacgatcagc cacaaatttc 2700 ckgatggctc agtgaaggtg gtgcagcatg cggccagagc gctaacgaag gcggagcaac 2760 gctacagtca gccggaccgc gaaggactgg caatcatttt tgctgtgaca aaatttcata 2820 aaatgatctt cggtcgcaaa ttccggctcc agacggatca cgcacctttg ttgaggattt 2880 ttggatcgaa aaaggggatc cctgtgtaca ccgcaaatcg tctacaacgg tgggcgctaa 2940 cgctcttatt gtatgatttt gtcatcgaat atgttccgac acacaagttt ggaaatgctg 3000 acgtcctctc gcggttgatc gataaccacg ccaagccgga tgaagattac gtcatcgcca 3060 gtgtcatcct agaagaagat atgaggtcag tagtgaatga agcttccagt gttttcccgc 3120 tcagtttcaa ggtaattgaa aaggacactc agtcagatcc gaagcttcgt aaagtttacc 3180 gttacctgca agatggttgg ccgcaaggga ccaagattgt tgatgctgaa atagggcgat 3240 ttcatggctg ccaggagtcg ttaaacacag taggtggatg ccttatgttt ggagaacgac 3300 tagtgatacc tgagaaacat cgaagccgat gcctgcgaca gcttcatcag ggacatccag 3360 gtatccaacg aatgaaggcc atcgctagaa gttttgtata ttggccttcg ctggatgaag 3420 agatcgtcgc atacgtcaag tcctgcaaac attgcgctgc tgtggcaaaa tcaccgccgc 3480 attcgccgcc agttccgtgg cctaagccag taggtccatg gaaacgggtc catgtggact 3540 atgcgggacc gatagaggga gattattttc ttctggccgt tgatgcgtac tccaaatggc 3600 cggagataat tccaacacga cgaataacaa cagcagcaac gattagcatt ctgagaggtt 3660 tgttcgcacg ccttgggatg ccggaagtgc tcgtcagcga caatggcaca caatttacga 3720 gtgaagactt ccagaaatat tgtgccaaca acggaattca ccacgttacc acggctccgt 3780 ttcaccctca atccaacggg caagcggaga ggttcgtgga tactttcaaa cgttcggtga 3840 agaagattca ggaggggaag agtacgagtt cgctacaaga agccttggac gtttttctgc 3900 tggtttatcg gacaacccca aaccggcaag tcgtcgatgg aaaaaccccg tccgaagcga 3960 tgttcggacg ccgcatacgt acgaatttgg atttacttcg tcccccgcct gtttcgcctg 4020 ccgttctagc caagtcagaa gatgactcga ataaacggtg ctttgccaca catgatccag 4080 tgtacgcgaa gcttttcacc aagaacaagt ggcgctgggc tccaggcgtg gtctgcgaac 4140 gcattggcca cgttatgtac agcgtatgga tagaggaccg tcggttgata cgcgcacaca 4200 ttaatcaact gcgcagtcgc tctagcgccg ctggttcaga acaagtcgaa aatggaaggc 4260 gtgaccgatc tcagcttccg ctggaaatcc tgcttgattc ctgtagtttg cctaaacatt 4320 cgttacaacc gactgcaccg ccagcagagc agcctatgac gccactgcaa agttcgtctg 4380 cgtctcatga aatccaatcg tctcaacaat ttgatgatca agcaagtccc cgtacgtctg 4440 ttcgaagctc aggatcgtct tccacaccat cgtcgacaac gtccttatcc ggagcatccc 4500 cagaatttgc ttcagctagc agtggacctg caacgcctat acaggtacag ccacgccgct 4560 cttcaagaac ccgaaggccg ccgcgttggt tcgatcccta ccgcctgtat tgaaagaggg 4620 gga 4623 // ID Gypsy-21_CQ-I repbase; DNA; INV; 4318 BP. XX AC AAWU01010825; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-21_CQ_; KW Gypsy-21_CQ-LTR; Gypsy-21_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4318 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 421-421 (2011). XX DR GenBank; AAWU01010825; Positions 48709 53026. XX CC Positions [3456-3929] - Integrase core CC 'TGTA' target site duplication CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 261..4318 FT /product="Gypsy-21_CQ-I_1p" FT /translation="MYDREKLQKLTVKELRELCAKKKVSVSGRKAALVNRL FT LETVRESDSESDDSLFSALDVMSSTKKEKKNATAAKKEDAPRASQFTFNDI FT GDSLEKFSGESADSDVCEWLEEFEKACETFGWNDVQKFVYGRRLLSGTAKL FT FVQSSTGLNDWGTLKNALEEEFEVKVSSAEIHELLRGRKKKSDESFLQYIY FT HMQSIAKRGHIDEETVCEYVVRGIVDDPVNKVGLYGARTLAELKERVKQYE FT KMKSQMRDVSRSAKVFVKSERAKKSETPRTARSEKSESAHTASDGVRCYNC FT GSRGHYANDCEAKSRGPKYFVCSEYGHRARECPKKEVRERKEDSELHTVAE FT TKMPVVEIAVGNSLLRTLFDTGSRYNVLCVSAHKKIGEPPMNTTPMIFRGF FT GAAETRACGTVTIDVCVDGETYPSMMFYVVPTGSMSYDAILGMQALHHLDV FT CVKPDGVVFKKKCEAQEEVESDEVNDLMALTSGKSGVTLDVAPRYAAEVGK FT LVADYKPKSEVRSRVETRIILSDETPIHIAPRRFAPKEKKILEKTIDEWLK FT AGIVKESVSEFASPVTLAPKKDGSMRVCVDYRQLNKRIVKDCFPMRNIEDQ FT VDKLKSAKVFTTLDLKNSFFHVPVEVSCQKYTNFVTHAGQYEFEKTPFGLS FT NSPASFSRFVADVFREQIRKGHMMIYVDDVIIPSATEEENIVVLKEVLAVG FT AENGVQFNWEKSQFLKSEVEYLGYVISGGGYKVSPAKIKAVKNYREPKSAK FT EVQRFVGLTSYFRKFIPGYALIARPLTSLLKKEAQFVFGETQRRSFEKLKE FT CLVSDPVLKIYDPDAETELHTDASKEGYGAVLLQKSDDGRFHPIYYMSKQT FT TSAEKNYSAYLLEVLAVVRAVERFRVYLLGVPFKIVTDCVAFKHTVKAKKL FT NPTVAKWVIALDEHKYHVEHRAGSRMKHVDALSRAPVMLTVSDPLVEMIKS FT AQQRDERLRAITELLKTQEFEDYFVNGDVLMKVVNGREVVVVPAELQGEVI FT KQAHENGHFGVRKLEEVIKSDYYIHQLTTKINRQIECCVKCILAEKKRGKT FT DGLLNPIAKGEVPFDTYHVDHLGPMDVTEKQYKYLFVVVDAFSKYVWIYPT FT KTTNAQEAIQRLRQQSELFGNPRRIVSDKGAFTSHDFKEYCAEQNIVHVEV FT TTGVPRGNGQVERVNQVILAMLTKLSVHDKTKWYKHVANIQRWINASFHQT FT TGVSPFEAMFGVQMRHEGDLRLSELLEEIKVAQFDDQRNEIRAKARESIEK FT AQEEQRRSYNLRAKQAPNYREGDVVAIKRTQFGPGKKYAAEFLGPYKVTKV FT KPNNRYDVEKICGEGPRKTSTAASHMKLYRVRSPEPLQDDR" XX SQ Sequence 4318 BP; 1149 A; 971 C; 1407 G; 791 T; 0 other; tgggggctca accacgattt cccgcgagtt gttgagctga gtgagagtgt ttccgcgtgc 60 caaatcgcga aaagttggca tttggctgat ggcgtcacca gagtccgcca ttttgcctga 120 gtgaagaagg aacgtgtgtg cgtgtgtgtg acgaacgaga gagttcacac ataccacagt 180 gagaaacgag actccgcgag tagtgagaga agagagagca tctccgcact cgtagtgcaa 240 gtgtgttagt gaggcgaacg atgtacgacc gggaaaagct tcaaaagcta acggtgaaag 300 agctgcgcga gctgtgcgcg aagaagaaag tgtcagtgag tggcagaaaa gcagcgctgg 360 tgaatcggtt gctggaaacg gttcgcgaaa gtgattccga gagcgacgac tcgttgtttt 420 cggcgctcga cgtgatgtcg tcaacgaaga aggaaaagaa gaacgcgacg gcggcgaaga 480 aagaggacgc gccgcgggca tcgcagttca ccttcaacga catcggagac tccctcgaga 540 agttctctgg ggaaagtgcc gacagtgatg tgtgcgagtg gctggaagaa tttgagaaag 600 cttgcgaaac gtttgggtgg aatgatgtgc agaagtttgt gtacggcaga cggctgctga 660 gtggcaccgc gaagcttttc gtgcagagtt cgaccggatt gaatgactgg ggcacgctga 720 agaatgccct ggaggaagag tttgaagtca aagtgtcaag tgccgagatc cacgagctgc 780 tgcgtggccg aaagaagaag agcgacgaaa gctttctgca gtacatctac cacatgcaaa 840 gcatcgcaaa gcgcggccac atcgacgaag aaaccgtgtg tgagtatgtg gttcggggca 900 tcgtcgatga tcccgtcaac aaagtaggcc tgtacggtgc acgcacactg gcagagctga 960 aagagcgtgt gaagcagtac gagaagatga agagccaaat gcgcgacgtg agccggagcg 1020 caaaagtgtt tgtgaagagc gagagagcga agaaaagtga gacgccgcga acggcgagaa 1080 gtgagaagag tgagagtgca cacacggcga gcgacggcgt tcgttgctac aactgtggaa 1140 gccgaggcca ctacgcaaac gactgcgagg cgaagtcgcg tgggccgaag tatttcgtgt 1200 gtagtgagta cgggcatcga gcgagagagt gtccgaagaa ggaggtgcga gagagaaagg 1260 aagactccga gctgcacacc gttgccgaaa cgaagatgcc agtggtggag attgcggtgg 1320 gcaattctct gctgcgtacg ctgttcgaca cgggtagccg ttacaacgtg ctgtgtgtga 1380 gtgcgcacaa gaaaatcggt gaaccgccga tgaatacgac gccgatgatc ttccgtggtt 1440 ttggtgctgc cgagacgcga gcgtgtggca cggtgacgat cgatgtgtgt gtcgacggcg 1500 agacgtaccc gagcatgatg ttttacgtcg tgccgactgg atcgatgtcg tacgatgcga 1560 tcctgggtat gcaggcgttg caccacctgg atgtgtgcgt gaaaccggac ggagtcgtgt 1620 tcaagaagaa gtgtgaggcg caggaggagg tggagagtga tgaggtcaat gacctgatgg 1680 cgttgacgag cggaaaaagt ggcgttactc tcgacgtggc gccgcggtat gctgctgaag 1740 tgggaaagct ggtcgcggat tacaagccga aaagtgaagt gagaagccga gtggagacgc 1800 gaatcattct gagcgatgag acgcccattc acatcgcgcc gcgacgattt gcgccgaagg 1860 agaaaaagat tctggagaag accatcgacg agtggttgaa ggctggaatc gtcaaggaaa 1920 gtgtgagtga gttcgcgagt ccagtgacgc tggccccgaa gaaagatggt tcgatgagag 1980 tgtgtgtgga ctaccgccag ctgaacaagc gcatcgtgaa ggactgcttt ccgatgagaa 2040 acatcgagga ccaagtggac aagctgaaaa gtgccaaagt gttcaccacc ctcgatttga 2100 agaactcctt cttccatgtg ccagtagaag tgtcatgcca gaagtacacg aatttcgtga 2160 cacacgccgg gcagtacgag ttcgagaaaa ccccgttcgg gctgagcaac agtccagcga 2220 gcttcagccg ttttgttgcg gatgtgttcc gtgagcagat ccggaagggg cacatgatga 2280 tctacgtgga cgacgtgatt atcccgtccg cgacggagga ggagaacatc gtcgtgctga 2340 aagaggtgct ggctgtgggt gcagagaacg gagtgcagtt caactgggag aagtcgcagt 2400 tcctgaaaag tgaggtcgag tatctgggtt acgtgatcag tggcggaggc tacaaagtgt 2460 cgcccgcgaa gatcaaagca gtgaagaact accgtgagcc gaaaagtgcg aaggaagtgc 2520 aacgttttgt gggactaacc agctacttcc gtaagttcat ccccggatac gcactgatag 2580 caagaccctt gacgtctttg ctgaagaagg aagctcagtt tgtgttcggc gagacccaga 2640 gaagaagctt cgagaagctg aaagagtgcc ttgtgagtga cccagtgctg aagatctacg 2700 atcccgacgc cgagaccgag ctgcacacag acgcctcgaa ggaaggttac ggagctgtcc 2760 tgctgcagaa gagtgacgac ggtaggtttc acccgatcta ctacatgagc aagcagacga 2820 cgagtgccga gaagaactac agcgcgtacc tcctcgaagt gctagctgtc gtacgtgccg 2880 tggagagatt ccgcgtttac ctgctgggag tgccgttcaa gattgtgacg gactgtgtgg 2940 cgttcaaaca taccgtcaag gcgaagaagc tcaacccgac tgtggcgaag tgggtgattg 3000 cgttggacga gcacaagtac cacgtggaac accgcgctgg gagcaggatg aagcacgtgg 3060 acgccctgag tcgagcacca gtgatgctga cggtaagtga cccgctagta gagatgatca 3120 agagtgcaca gcaaagagac gagaggttgc gagcgataac cgagctgctg aagacgcaag 3180 agttcgagga ctactttgtc aacggagacg tcctgatgaa agtggtcaac ggtcgcgaag 3240 tcgtcgtagt tcctgccgag ctacaaggag aggtgatcaa gcaagcgcac gagaacgggc 3300 actttggtgt gcgcaagctc gaggaagtga tcaagagcga ctactacatt catcaactca 3360 ccacgaagat caatcgccag atcgagtgtt gtgtgaagtg catcctagcg gagaagaagc 3420 gcggcaaaac cgatggtttg ctgaacccga ttgcgaaggg cgaggtcccg ttcgatactt 3480 accacgtcga ccaccttggc ccaatggatg tgaccgagaa gcagtacaag tacctgttcg 3540 tggttgtgga tgcgttctcg aaatacgtgt ggatctaccc gacgaagacg acgaacgcac 3600 aagaagccat ccagagattg cgccaacaga gcgagctgtt cggaaacccg agaagaatcg 3660 tcagtgacaa aggagcgttc acttcgcacg acttcaagga gtactgtgcg gaacagaaca 3720 tcgtacacgt ggaggtgaca acaggcgtgc cgagagggaa tggtcaagtc gagagagtca 3780 accaggtgat cttggcgatg ctgacgaagc tgagcgtgca cgacaagacg aagtggtaca 3840 agcacgtggc gaacatacag cggtggatca acgcgagttt ccaccagacc accggagttt 3900 ccccgtttga ggccatgttc ggagtgcaga tgcgtcacga aggcgacctg cgcttaagtg 3960 agctgctgga ggagatcaaa gtggcgcagt tcgacgatca gcgaaacgag atccgagcga 4020 aggcacgaga gtccatcgag aaggcccaag aagaacagcg acgatcgtac aacctgcggg 4080 cgaagcaagc cccaaactac cgagaaggcg atgttgtggc gatcaagcga acgcagtttg 4140 gccctggcaa gaagtatgcg gccgagttcc tcggtccgta caaggtgacg aaggtgaagc 4200 cgaacaaccg ttacgacgtc gagaagatct gtggggaagg cccacggaag acgagcacag 4260 cggcttccca catgaagctg tatcgggtcc ggagcccgga acctcttcag gatgaccg 4318 // ID Gypsy-13_OD-I repbase; DNA; INV; 7193 BP. XX AC CABV01000243; XX DT 24-FEB-2011 (Rel. 16.02, Created) DT 24-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Oikopleura dioica genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-13_OD_; KW Gypsy-13_OD-LTR; Gypsy-13_OD-I. XX OS Oikopleura dioica OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; OC Oikopleuridae; Oikopleura. XX RN [1] RP 1-7193 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Oikopleura dioica genome."; RL Direct Submission to RU (24-FEB-2011). XX DR Genome; CABV01000243; Positions 13447 6255. XX CC Positions [3001-3435] - Reverse transcriptase CC Positions [4732-5229] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 1007..2116 FT /product="Gypsy-13_OD-I_2p" FT /translation="MAPVPTKADLIAVLKYTQRKLAEADINSTDDAVKADI FT ARLKREETDLKECILDMAINGEDVKPTVVGDHGQINAKTMQQHLLMNFISN FT LSFTASNEESINNFLTKLEAIASSVPLIPFPEILASIKPVLPASVIKSLAG FT KTIVSHEDLKRHLLVLYGSSVSIYQRVESWMCKQKPSGKSWTKHCSDISGS FT LESIKMSYATLLRRRDESARSREQREAQPFAITYDDGFEMILFLKLLQDIA FT AADQSLHRTVSVELDAIKTPLCLASRVEQVKCQTLPLSAFTKAHAKPKQPQ FT KQQVEQDKSDKPATKGKGKKGKQNENQKKGFQHKEKSSIKQTVAGGSNFAN FT GNSQVTFALRDADQTDYDSVEEFASKN" FT CDS 2290..5781 FT /product="Gypsy-13_OD-I_1p" FT /translation="MKISVAKRLKLKIESTEQRISGVGSSNNRCVGQTVTD FT IKIGHHGIWKKTLIFLVPDEQLGIPLIIGRDPLHNRSTKISSNLENRTLFF FT RGRNGWCRVPFISDPLHENFDSKNDSHGSYNLVDLSTPDLLNSVNELGIDI FT NVNAPTDQVRQMAQLMLKHKSVFSCEARPIGNFSKFVARIDTEPNKTIHVP FT QYRLPVAFEKSISEEIEKLAKLKVIVPVEDNQGWNTPLGGVRKPDGSVRLV FT LNFKVTLNKILVRDDNFSIPDMEVQSRVPPGNVFFMSLDVSKGYWNIRVRE FT QDQIKLSIWWENRNWKFTRLPFGLKNAGNIFCRAIASALERMKYRTNVRVF FT IDDLLVFAPTFEVFSNAVDEALCLLNKAGFVVNPKKCSILYPECRWLGRLV FT SSEGMRADPENVSAIQRLESPTTYRGLQSLLGALNWIRSFASVKVGDNIAE FT ENFSHIIKPITELLKINTPRGKFTWTQEAEDSFVRVKAKLQDGSMVHFPDF FT KKPFILVTDASKLALGYVLMQPIANKCCIVRLNSKTLNPAQQNYSTTEREA FT LGIVWAVEDCKSFLRGVSFVIRTDHQALTFLDCKVPKNDKCARWANILSVY FT DFVVQYIPGADNHIADFLSRPDGKGAGPVKRTDDDVKLAGRFEKFHQWQIY FT IPSWVKPGSPPKTFLNGDDLTIPEDFVAALSRSEPSICVKMLAKVASAQFD FT DPVVSNLLDAIDREYTPKQDEQCEETKWLYNRRKNFSRCESTGALSVNDML FT YIPPSLRPEVLQSYHDSCAHAGANRMMNLTSHLTWPRKAHDIKNYVNSCPC FT VHRKGGRGQSHQPKIRPTVRGVTIFEKILVDFIDMPPSRAGFRYCLTILDT FT FSRYLIVVPSRKHRAIDAVNILRDQVFSTFPRPKIISSDRGSHFTSNLNEE FT FAKEHNTKWKFHCAYHPQSCGALEVQHRVLKDSIFVAVHSQNKDWPTVLPE FT VVRILNSLPNAATKTSPFEMVFGVKPDLSEFDLPRETAEVTPSEYLKKKAA FT DRKLLYERVSMCQEKTDATRQRHDGPEIVATDIEAGSQVLIYRPLSATSKR FT SKLRWIGPYNVVRSNGSVVEIVDAEGKHDFIHRSQVQPFKARPKRLGPLPN FT FPDLRVPLRRTGVDNVKEPPVVLNPPVDVDMRDLSLPNERRRSNQPSSPRK FT LWNRR" XX SQ Sequence 7193 BP; 2083 A; 1975 C; 1480 G; 1655 T; 0 other; gaagggattt acttccgatt atccgattcg agagccgtat aactacattt agttggtcat 60 tgcgcgtcat ttcttgatca ttattctcgt ttattcggtt actatgaaca attcgagatt 120 ttttggcact gtttttcaag aaataacgaa aattaagtag cttttttcga gtaatcatag 180 tttttttacc atttcagctg gctgaaatcg ttaatttttg taatttttga aagaaaagct 240 gcataatttc accgttttga gccgtaaacg ttaaaataac cgatttttga gaaaagcgcg 300 cccagcccgg gcgctgctat cataatctag ggttaattcc tgggtcatgg gagcgattgc 360 gacacgcaac ctcaaccgat tttcaaaagt cgacttcgac tattttcaaa tttcaaaata 420 ttttcaaagc tgtgagtcac cagtgactca tactcatccc aaggcttacg gtagccccgc 480 aaggggtacc taagggacca aaattgaaaa ttcttcgaaa attgtgactc ggtcaacttg 540 accccaagtg accccgagcg cccccaaggc gccccggggc accccaaaat gactcaaaat 600 cataaattta ctgttttaag ttcaaaattc acggaatttt tgaaaaatcg attgccataa 660 atcacgttcg cattaatatc ttcaaaattg agatcaactt cctgaaactt gtatttttct 720 ggtatgaaaa actttctcgc gcgctattta tgaaactgaa aatgtcatca ctgcagtcgg 780 ttgccttaat agatagctga gtccacgcaa gtgctcacta ttctaggcta taaaatcctc 840 actgtaagtc atttgaaatt cataaatatg caaccgtgat agaaacatac tagaatttta 900 tgagtcgatg aatgtctcac ccccaatcaa attaatgcgt tctaagaatt agttcaaatg 960 caatataggc atgtaagtga gcttgataaa gacaaacctg ctcaccatgg caccagtgcc 1020 aaccaaggcc gacctcatcg ccgtactcaa gtacacccag cggaagctgg ccgaagctga 1080 catcaacagc accgatgatg ccgtcaaggc cgacatcgcc cgtctgaaga gagaagaaac 1140 cgacctcaag gaatgcatcc tcgacatggc aatcaatggc gaagacgtca agcccaccgt 1200 cgtcggcgat catggtcaaa tcaatgcgaa aaccatgcag cagcatctgc tgatgaactt 1260 catttcaaac ttgtcattta ccgccagcaa cgaagagtcc atcaacaatt tcctaacgaa 1320 attggaagcg atagcctcaa gcgtcccatt gattccattc ccggaaatcc tagcttcaat 1380 caagcctgtg cttcctgcga gcgtcatcaa aagccttgcc ggaaaaacca tcgtttccca 1440 cgaagatttg aagcgacatc ttctcgtgct ctacggatca agcgtctcca tctaccaaag 1500 agtagaatca tggatgtgca agcagaaacc atccggcaaa agctggacta aacattgctc 1560 cgacatctcc ggcagccttg aatccatcaa aatgtcttac gctaccttgc tgagacgccg 1620 tgatgagtca gcccgatccc gagaacaacg agaggcacaa cccttcgcaa tcacgtacga 1680 tgatggcttc gaaatgattc ttttcctcaa gcttctgcaa gacatcgcag ccgcggacca 1740 atcacttcat cgaaccgtct ccgtggaact cgacgccatc aaaacgccgc tgtgcctagc 1800 atcgcgtgtg gagcaagtca agtgccaaac cctacctctt tctgcattca ccaaagctca 1860 tgctaaacca aagcagcccc agaaacagca ggtggagcag gacaagtccg acaagcctgc 1920 aacgaagggc aagggcaaga agggaaagca gaacgaaaac caaaagaagg gcttccaaca 1980 taaagagaag tcgtctatca agcaaactgt cgccggcggt tccaacttcg cgaatggaaa 2040 cagccaggtc accttcgccc tgcgcgatgc cgatcaaact gattacgaca gtgtggaaga 2100 atttgcatca aaaaactaag ctcagtcgat ggtaagcagt catccaccga cactttgttt 2160 gctgcagcag actgcatttc cttccccgaa ctcacaacgt ctgattcgcc gatttacatt 2220 ccgatccagt ttccgctcca aagtaaatgg atttccggtc tgtacgatcc tggttcattc 2280 gctaccatta tgaaaatttc cgttgccaaa cgactcaaat tgaaaatcga atctacagaa 2340 cagcgaatca gcggcgttgg ttcgtccaat aaccgatgcg tcggacaaac cgttaccgac 2400 ataaaaattg gacaccacgg catatggaaa aagacactca tctttcttgt tccagacgag 2460 caattaggca ttcccctaat tatcggccgt gacccactgc ataaccggag caccaaaatc 2520 tcgtcgaatt tggaaaatcg caccttattc tttagggggc ggaacggctg gtgtcgcgta 2580 ccattcattt ccgacccctt gcacgagaat tttgacagca aaaacgacag tcacggttcc 2640 tacaacctgg tcgacctgtc cacccctgac ctcctcaaca gtgtcaacga gctcggcata 2700 gacatcaacg tcaatgctcc aaccgatcaa gttcgccaga tggcgcaact aatgttgaaa 2760 cacaaatccg tcttctcctg tgaagctcga cccatcggaa acttctccaa gttcgtcgca 2820 agaatcgaca ccgagccgaa caagaccatt cacgtcccgc agtaccgact tcccgtagct 2880 tttgaaaaat caatctcgga agaaatcgaa aaactcgcaa aactgaaggt cattgttcca 2940 gttgaagaca accaaggatg gaacactccc ttgggaggcg tccgcaaacc cgacggaagt 3000 gtacgtctcg ttctgaactt caaggtcacc ctcaacaaga tactcgtccg cgatgacaac 3060 ttctcgattc ccgacatgga agttcaatct cgcgttcccc ctggtaacgt tttcttcatg 3120 tcgctggacg tatcgaaggg atactggaat attcgcgtac gagaacaaga tcagatcaag 3180 cttagcatct ggtgggaaaa tcgcaattgg aagttcacac gattaccgtt cggtctcaag 3240 aatgctggta acatcttctg tcgtgccatt gcttccgctc ttgaaagaat gaaataccga 3300 accaacgtcc gagtcttcat cgatgatctc ctcgtcttcg ctccgacctt tgaagtattc 3360 agcaacgccg tcgacgaagc tctatgtctt ctcaacaaag ctggcttcgt cgtcaacccc 3420 aagaaatgca gtattctata tcctgagtgc cgatggcttg gaagactcgt cagctcagaa 3480 ggcatgcgtg ccgatcctga aaatgtctca gccattcagc gtctcgaatc accaacaacg 3540 tatcgtggcc tgcaaagtct tctcggagcc ctcaattgga tccgcagttt cgcgtccgtc 3600 aaggtcggcg acaacatcgc tgaagaaaat ttcagccaca tcatcaagcc aatcacggaa 3660 ctgctgaaaa tcaatactcc tcgtggaaag ttcacctgga cgcaggaagc cgaagacagc 3720 ttcgtccgtg tcaaggctaa attacaagac ggctccatgg tacacttccc cgattttaag 3780 aagccgttca tactggtgac tgatgcgtca aaactcgccc ttggttacgt tctgatgcag 3840 ccaatcgcta acaaatgctg cattgtccga ctgaactcaa aaactctgaa tccagctcag 3900 cagaactact caaccaccga gcgcgaggca ctcggcatcg tatgggccgt cgaagactgc 3960 aaatcctttc ttcgtggcgt tagcttcgtt atacgaacag atcaccaagc actaacgttt 4020 ctcgactgca aagttccgaa aaacgacaag tgtgcccgct gggcaaatat tcttagcgtg 4080 tacgatttcg tggtacagta tatacctggc gccgacaatc acatcgccga ttttttatca 4140 cgaccagatg gcaagggcgc tggtcccgtc aaaagaactg atgacgacgt aaaactcgct 4200 ggcagattcg aaaaatttca tcagtggcaa atctacattc cttcatgggt caagccaggc 4260 agtccaccaa aaactttcct gaacggcgac gacctcacta ttcccgaaga tttcgtagcc 4320 gccctctcca gaagtgaacc ctcaatctgc gtcaaaatgc tagccaaagt cgcgtccgcc 4380 cagttcgacg atcctgttgt gtcaaactta ctcgatgcca tcgaccgtga atacacacca 4440 aaacaggacg agcaatgcga agaaacgaaa tggctctata atcgtcgtaa aaatttcagc 4500 agatgtgaat cgaccggagc gctttccgtt aacgatatgc tatatatacc accgtcactt 4560 cgcccagaag ttcttcagtc ctaccatgac tcatgcgctc acgcgggagc aaatcgaatg 4620 atgaatctga cgtcacacct gacttggcca cgcaaagctc acgacataaa aaattacgtc 4680 aactcatgcc cgtgcgtcca ccgaaagggc ggccgtggtc agtcccatca accgaaaatc 4740 cgtccgaccg tcagaggtgt aaccatcttc gaaaaaatct tggtagattt catcgacatg 4800 cctccgtctc gagccggctt ccgatattgc cttacaatac tagacacctt cagcagatac 4860 ctgatcgttg ttccttccag aaagcaccgc gcgattgacg ctgtgaacat cctccgtgac 4920 caagtattct cgacatttcc tcggccaaaa ataatttctt cggaccgggg ctctcatttc 4980 acgtctaacc taaatgaaga gtttgccaaa gaacacaaca caaaatggaa atttcactgc 5040 gcctatcatc cccaaagctg tggcgcactc gaagttcagc accgtgtcct taaggacagc 5100 atcttcgtgg ccgtccactc tcaaaataaa gactggccaa ccgttttacc agaagtggtc 5160 agaatactta attcattgcc gaacgcagct acaaaaacct ctccgttcga aatggttttc 5220 ggagtcaaac cagatctcag cgaattcgat cttcctaggg aaaccgcaga agtaacgccc 5280 agtgaatatc tcaagaaaaa agctgctgat cgaaaactcc tttacgaaag ggtctcaatg 5340 tgtcaagaaa aaactgatgc cacccgtcag cgtcatgacg gaccagaaat tgtggccacc 5400 gacattgaag ccgggtccca agtcttgatc taccgccctc tcagtgcaac aagcaaacgc 5460 agtaaacttc gatggatcgg accatataac gttgtacgct ccaatggctc cgtcgtcgag 5520 attgttgacg ctgaaggcaa gcacgacttc atccaccgat cacaagttca acccttcaaa 5580 gctcgtccga agcgccttgg cccactccct aacttccctg atctcagagt tcccctccga 5640 agaacaggcg tcgacaacgt caaggaaccg cctgtcgtac tcaatccacc agtcgatgtg 5700 gatatgcgcg acctctcctt gccgaacgaa cgacgaagaa gtaatcaacc cagcagtccc 5760 cgaaaactct ggaacagaag gtgaccccga ctccgagcag cttgaagaaa ctgacaacgc 5820 cgatggaaac aatgaagagc aagtcacccg cccgacccga ccctctactc cgacgaatga 5880 agctgagcta gatgaaattt tcagcacacc gctgtcacaa cctgcaaacc cgttcaagac 5940 tccagctaat acaaaaactg gttccctgcc tggaaagtcc gtcaacgctc ccagaatgac 6000 acgttcacga tcccgacttc tcaacgacga aagcgctgac cgcgatctcg ccaagaaaat 6060 tcacagtgaa atcaacaagc cagctacacg atcttcgtct cttcctccac agacaaaacc 6120 ctctgtcaaa cctcccttca aaatttgaag gccatactca ctcacggtta acgtgcgttg 6180 ggagcatgga ttactgtgct aaacacaata aaaacactcg ttaaacgtaa tcttatgccc 6240 tttctaatct attttataat ttttaaactc attagaatat ctagaaattc ttaagaaaag 6300 cagtgccaat ttttctcaaa aaatagttga cagtgcccct taggaccctt aggactctta 6360 gttaattttt ctatagaatc cgcaaaatgg ctggtcaact catcatcgac agccctgatt 6420 ccgacatcaa caatggtgcc atcgtcgcca acattgaagc caaaagtccc gaagttgatc 6480 ccatcatcac agccgtggac aacgttgaag tcagcgtcaa cgatgatgtc aaccctgcca 6540 tggacgtcga tcccatattc gatgcggcat cccctggtcc cctcgtgatc aacgaagacg 6600 tcgcctccac cagcactggt cccgtcatct ctggcgagga catcgtagcc atcttcgatg 6660 ctgcagccaa tggtcagaac acttcaaccc tcatcggaca actcaacccc cttcacactt 6720 ccactccgat gaccgctcga ccgactgatg tgcaggagtc tgatgagatg ttcaacctga 6780 tggtcaaaaa caagaagaaa aagcgacgag gagctcagct caaccgagtg cagcgcaaaa 6840 agaaccgact tgcagccctt gagcgaaatg gtatctctgc tgcgttggct gaagcaactc 6900 cagtgatgaa aaacagatcg accttcatgt gcatcgaaat cttagaaaaa attcagaagc 6960 ttgcagtgaa acgtaaaatc gaccgcaacc gacctgagtt ctggcgcacc aagcacggaa 7020 ttgacctggt cgacatgatg tgcaagtacg cccgtggacc tcgacgtgaa cttgcttccc 7080 gacaagtcgt caactggcgc ctttcagctg agcacaaggc cgacaatttg ggaagactct 7140 tctggaactc cgacaacgac gccaaagctg atgtataatc ggctgggagc atc 7193 // ID Gypsy-28_DWil-I repbase; DNA; INV; 4226 BP. XX AC scaffold_181145; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-28_DWil_; KW Gypsy-28_DWil-LTR; Gypsy-28_DWil-I. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-4226 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_181145; Positions 1786005 1781780. XX CC Positions [1766-2308] - Reverse transcriptase CC Positions [3326-3802] - Integrase core CC 'TACA' target site duplication CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS join(1439..2815,2819..4207) FT /product="Gypsy-28_DWil-I_1p" FT /translation="MCRMMLYSAGDKTSEDNQLSIFNVTETDSDIDVPSDF FT ENDVRAMINDYKAVEYQNVEERTQNAAGLVAESTDAVSLQIVPDGVILPFR FT QAPSRLPAIEAAAVKQQVDEWLNAEIVRRSSSNFASRVVVVKKKDGCYRIC FT VDFRKLNSMVMKDGFPVPIIDDVIEKLQAAKFFTVMDLENGFFHVPIEETS FT KKYTAFVTKEGLFEFNKAPFGFCNSPAVFIRYVTNVFQPLLNADVLDLYMD FT DIIIHATTAEECLHKMKKIFDTAAAFGLKIKWKKCHFLKTTVDFLGHTIYD FT GQIKPGQEKTKAVRKFKVPENIKAIQGFLGLTGFFRKFIKGYSIIARPLTD FT LLREDVDFRMDAVALKAFEELKRALTVEPVLKLYDRNANTEIHTDASKMGF FT GAAFLQWKDGQLHPVWFWSKKSLETEARQSSYILEVKAVYLACIKFRQYVL FT GIKFKVVTDCAAFKTINKKDVPREVALWIVYLQDFNFDVEHRAGERLKHVD FT CLNRYPQGIMIIRSETSALFKKAQQEDTMSKALIELLQNQPYEDFKLKGGL FT LYKTVGGNELLVVPKQMEKEIIVEAHNVGHYAMQKTMHAVQQSYWIPHFES FT KVARIINNCVKCIIYNKKFGKKEGLMQGIDKGDRPLETIHVDHLGPLDATS FT KQYKYIFAMVDAFTKFVWLYPTKTTSCEEVLRKLKEWSAIFGNAVRIISDR FT GSAFIAKDFEEYIKSNGIHHVWTTTGVPRGNGQIERVNRSIITIISKLSAD FT EPVKWFKYVPNVQRAINSHVNVSRKRSPFELMLGVQMNNGPENQILKLLQE FT EVCDKFDEERQSMRASAKREITKAQEAHKKQYDSKRKIDNVYKVADLVAIR FT RTQFVAGRKLASEYLGPYEVIAAKRCGRYDVKKAAADIEGPMLTATSSDNM FT KYWRNNQDSDELSSGSDD" XX SQ Sequence 4226 BP; 1432 A; 701 C; 1010 G; 1083 T; 0 other; tttggacctc gtgctatcgg ttaatataaa gacaataata aattgtgaaa tggtttttaa 60 aagttggtaa aaatcattta aagttgtaag ttgctattgg aaaattcccg agaacagttg 120 taaaattaaa agagttgttg ttgttgaaca aaatcggtta tgacgaaaac gaagcgtact 180 ccgacaaaaa acactgacaa aaaacgcgcg gtcgccgaaa tgtcgcgaca aaacgagacc 240 ttggaaaaag ctgacgaaat tgaaatacaa gaagagaagc aagaagttgg atgcataaca 300 agagccagag aagtacgcga agcgttgatg ctggatttgg tcgaaaaacg agaagagaac 360 acgaaagaga acaatatgga agaagagaac aacacgatgt tgaaaaattt gagttgctga 420 cgctgaaaat agaagctgac gaagtgaaat agggtgactc aggcatttaa gcagacgatt 480 ttgctaaagt ggtacccgac tatgatggac aaagtatgcc tgtggagatg tggttcaata 540 attttgcact gaatgctgag gcctatggac taactatgaa gcaaaaatat gttcaggcac 600 gatctaaaat gactaaagtt gcaaagttat ttttagattc aactccagta cgcagttatg 660 atgatcttcg tgtccagttg gaaaacgaat ttggcatgga cgacgtatgt agcgctaata 720 tacatcagaa gcttacgtga gcgtaagatg cagaaagagg aaacctttcg ggagtacatg 780 tttcatatga agtcaatcgc agcagaaggc aatatagata tgaaatcggt actgcgctat 840 atagtagacg gtttgaacat gaagaacgac ttaaagatca acttatattg cgcccagtca 900 ttcagagagc taagagaaaa atatgaaatc tatcagcacg caatggaaat ggcaaagatc 960 gatgtgccac aaggtctgaa gaaagttagc cagacaaact acgagaaaaa acctcgttgt 1020 tttacttgcg gatcgagcga gcatcttaga aaggaatgca atgtaacaac aaaatgtttt 1080 cgctgcaatg gtgagggtca tatctcaaag aactgtccaa agaatgcgaa tattggtgca 1140 gtgtataatt cgaagccatt ggagactgca aacattgggg tagtataaaa ttcaaaacgc 1200 ataaaggaag tggacataaa aggttgcaac attaacgctt tggttgacac aggagcggat 1260 gtgtctttgg tgtccagaag cgctgtagag aagattggag ttaaaagtct gcagaaagct 1320 gtcaagaatt tggttggtct aggcggaaaa gttactcatc cgcttggtac ttttcaagct 1380 gaggtcaagg tggacgacgt ccagacagtc caaaaatttg ttgtggtcgc aaacgaagat 1440 gtgccgtatg atgttgtact cggctgggga taaaacttct gaggataatc agttaagtat 1500 ttttaatgta acagaaaccg attctgatat agatgtaccc tcagatttcg aaaatgatgt 1560 gcgagcaatg atcaacgact acaaggccgt tgaataccaa aacgttgaag agagaaccca 1620 gaatgctgca gggctagtag cagaatcaac tgatgcagtt agtctacaga ttgtcccaga 1680 tggtgttatc ttgccgtttc gccaagcacc aagtcgtctg ccggctattg aagctgcagc 1740 agtgaagcaa caagtcgacg aatggctcaa tgctgaaatc gttcgccgat cgtcatccaa 1800 ttttgccagt cgagtcgttg tggtgaaaaa gaaagatggc tgctatagaa tctgcgtgga 1860 ttttcgaaag ctaaacagta tggtaatgaa ggatgggttt ccggttccga taattgacga 1920 tgtcatagag aagctgcagg ccgcaaagtt ttttacagtc atggacttag agaacggatt 1980 ttttcacgtg ccgattgaag aaacaagtaa aaaatacaca gcatttgtga caaaggaggg 2040 attattcgaa ttcaacaaag ctccatttgg attctgtaat tcacctgcag tatttatccg 2100 atatgtaaca aatgtttttc aaccgttgtt aaatgctgac gtgctggatc tgtatatgga 2160 tgatattata atccatgcaa cgacagcaga agaatgtctc cataagatga agaaaatttt 2220 tgacacagca gcagcgtttg gacttaaaat taaatggaag aagtgtcact ttctaaaaac 2280 cactgttgat tttttgggcc atacgatcta cgatggacaa attaagccag gacaagagaa 2340 gaccaaagct gtcagaaagt ttaaagtacc agaaaatata aaggcaattc aaggatttct 2400 gggcttaact ggattttttc gcaagtttat taaggggtat tcaattattg ctcgaccatt 2460 gactgatctt ttgcgtgaag acgttgattt caggatggat gcggttgcac tcaaggcatt 2520 tgaagaatta aaaagagctc tcactgtcga accagttctg aaactgtatg atcgcaatgc 2580 caacacagaa attcacacag acgcgtcaaa aatgggattt ggagcagcat ttttgcaatg 2640 gaaggatggc cagttgcatc cagtgtggtt ttggagcaaa aagtctttgg aaactgaggc 2700 tcgacagtct agctacattt tggaagtgaa agcagtatac cttgcctgca taaagtttag 2760 acaatacgtt ttgggcatca agtttaaagt tgttactgat tgcgcagcct tcaagtagac 2820 aataaacaag aaggatgttc ctagagaggt ggcgctatgg attgtctatt tgcaggattt 2880 taacttcgat gtggaacata gagctggaga gagacttaaa catgtggatt gtctcaatcg 2940 ctatccacaa ggaataatga ttatacgatc ggaaacgtcg gcactattta aaaaagcaca 3000 gcaagaggat acaatgtcaa aggctctgat tgaattgttg caaaatcaac cctatgaaga 3060 ttttaaatta aaaggcggtc ttttatacaa aacggttggc ggtaatgaac ttttggttgt 3120 cccgaaacaa atggaaaaag agatcattgt tgaagcccac aatgttggtc actatgcgat 3180 gcaaaagaca atgcatgcag ttcagcagtc ttattggata ccgcattttg agtcaaaagt 3240 tgcccgaatt ataaataact gcgttaaatg cataatttat aacaaaaagt ttggcaaaaa 3300 ggaaggattg atgcagggga tagacaaagg agatagacct ttagaaacga tacatgtgga 3360 tcatcttggg cctttagatg caacatcaaa gcaatataag tatatatttg ctatggtcga 3420 cgccttcacg aaatttgttt ggttatatcc aacaaaaaca acaagttgcg aggaagtact 3480 acgaaagtta aaagagtggt ctgctatatt tggtaacgct gttcgaatta tcagtgacag 3540 aggatctgcg tttatagcta aagatttcga ggaatatatt aagtcgaacg gaattcacca 3600 tgtttggaca acaacgggcg ttccaagagg caacggtcag atcgaacgtg taaacagatc 3660 tatcattacg attatttcca agctgtctgc ggatgaaccg gttaaatggt ttaagtatgt 3720 gccaaatgtg caaagagcta ttaactctca tgtcaatgta tccagaaaaa gatcaccatt 3780 tgaactaatg ctgggtgttc aaatgaacaa tgggccagag aatcagattc ttaagttgtt 3840 acaggaggaa gtatgcgata aattcgatga agagagacag tcgatgagag cttctgccaa 3900 acgggaaatt actaaagccc aggaagcaca caagaaacaa tatgattcaa aacgcaagat 3960 tgataacgta tataaagttg cagatttggt agcgattcgg cgtacacagt tcgttgcggg 4020 tcgcaaactt gcgagtgaat atcttggacc atacgaggtt atagcagcta aacgatgcgg 4080 aagatacgat gttaagaaag cagcagcaga tatcgagggg ccaatgctca cagctactag 4140 ctcagacaat atgaagtatt ggcggaacaa ccaggattcg gacgagttgt catcggggtc 4200 ggatgactaa gatcaggatg gccgaa 4226 // ID Copia-132_AA-I repbase; DNA; INV; 3592 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-132_AA_; KW Copia-132_AA-LTR; Ty1_copia_Ele73; Copia-132_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-3592 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [1489-2028] - Integrase core CC 'GGGTG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 61..3582 FT /product="Copia-132_AA-I_1p" FT /translation="MEEEKAERVFLPLFDGTNFAAWKLRMLILLEEHELVE FT CVETYAADVPELQEVDGDSEAVKKTKELARERRKKMDRKCKSLLVSRIHDS FT QLEYIQDKPAPKDIWNALRRVFERRSIASRMHLKRQMLTMRFESGSLQQHF FT LRFDKLVREYRATDAVLEEIDVICHLLLTLGSAYATVVTAIETMPEENLSL FT EFVKCRLLDEETKRKGVEMCSPAASGEAAFSGSKKPPKTYRMKCFGCKQEG FT HKLSECPMKKKNSNKSEKSKAHLAEQGGVCFVGCSRGSEIQCGSRKVNWFI FT DSGCSDHLVNDKSLFDDLKVLDSPIEIAIAKNEQCIVAKHSGTVRVYSEVN FT GKQIECTVKDVLYVPELRCNLFSVMRVDDAGMKVIYEKGKVRIMRGSEIVA FT TGARVGRLYGLNFSTVRESVNESLLTCGRIPKSLELWHRRFGHLNARSLEK FT LIRDEMVTGLKVDGAVKTKDMVVCEPCVVGKQTRKPFAVRDGKRSSRVLEL FT VHSDVCGPVTPVGVLGVKYFVTFTDDWSHFAVVFLITSKDEVFECFQTYEA FT LMSAKFERKIHRLRCDNGGEYKSKQFERFCKSKGIQVEWTVPHTPEQNGVS FT ERLNRTLVEKARSMLEDSGADKRFWGQAIQTAAYLLNRSPSSAIAPNVTPY FT ELWEGCKPDVSKLRTFGCGVYVHVPKEQRTKLDAKAWKGIFLGYANNGYRV FT WNPKAGKIVHARDVDFLEASEAKLQVDRVLPNDVFSVPKRKEQNDDSDDEE FT EASSVSDGESNEEFDSFRDDGSSDPGEGTVAVDGTGSSGEADRSSGETGRP FT QRNRKVPPWHRDYEIDYAAYALNATSYVEDLPNSIADAKRRSDWNLWQAAI FT IEEMDSLKKNGTWKLTKLPENRTPITNKWVFKVKRGVDGAPDRYKARLVAR FT GLSQKYGIDYSETYSPVAKLDTLRTVLALANQEKLVVHQMDVKTAFLNGTL FT EEEIYMVQPEGFEQGRELVCRLEKSIYGLKQASRAWNERFHNFVEERLKFK FT RSENDQCLYTRRIGSEKLIIVLYVDDILIAGSLKAVKMVKDLLSREFEMTD FT MGNIRSFLGMKIDYDLEKGVMWLIRLLEDLWWKPKEATTFYEDNQSAMRII FT ENSKDFGRLKHVDVKFHFLRDLVKQKRIRLRFLTSSDQPADMMTKGLPVAA FT FQRHRAAIGLSSCSG" XX SQ Sequence 3592 BP; 973 A; 694 C; 1087 G; 838 T; 0 other; ggttttgggc ccagtaatcg cgtggcgcga gtgtaaccgt ttcgtaaaaa gttgttcgcg 60 atggaggagg aaaaagccga gcgtgttttc ctgccgcttt tcgacggcac gaattttgca 120 gcatggaagc tccgcatgct aatactcctg gaggagcatg agctcgtgga gtgtgtggaa 180 acttatgctg ctgatgtgcc ggaactgcag gaagtggatg gcgatagtga agcagtgaag 240 aagacaaaag aattggccag agagagaagg aagaaaatgg atcgcaagtg taaatcgttg 300 ctagtgtcgc gtatacacga ttcgcagctg gaatatatcc aggacaaacc ggctccgaag 360 gatatttgga atgcgttgcg ccgtgtgttt gagcggcgca gcatcgcgag caggatgcat 420 ctaaagcgtc agatgctcac aatgcgtttc gagagtggtt cgcttcagca acattttctt 480 cggttcgaca agctagtgcg cgaataccgg gcaactgatg cagtgctgga ggagatcgac 540 gtgatctgcc acttgctcct cactcttggc agtgcgtatg ccacagtggt taccgcgatt 600 gagacaatgc cggaagaaaa cttgtcgctt gagtttgtca agtgccggct acttgacgaa 660 gaaacaaagc gtaaaggtgt cgaaatgtgt tcccccgccg cgagtggtga agctgctttt 720 tccggctcga agaagccacc gaaaacgtac agaatgaagt gtttcgggtg taagcaggag 780 ggacacaagt tgtcggaatg ccctatgaag aagaagaata gcaacaaaag tgaaaaatcg 840 aaagcgcatc ttgccgagca gggtggtgtt tgttttgttg gctgcagcag gggttccgaa 900 atacagtgtg ggtcgcgaaa agtgaactgg ttcatcgatt caggttgttc ggaccacctg 960 gtgaacgata aatcgttgtt cgatgatttg aaggtgttgg acagtccgat tgaaatagcg 1020 atcgcgaaaa atgaacagtg tattgtggcg aaacattccg gaacggtccg tgtgtactcc 1080 gaagtcaacg ggaaacaaat cgaatgcaca gtgaaggacg ttttgtacgt gccagaatta 1140 aggtgcaatt tgttttcggt gatgagagtg gacgatgcgg gaatgaaagt gatctacgag 1200 aaaggcaaag tgcgaattat gcgtggatcg gaaatcgtcg ccactggtgc gcgtgtcgga 1260 cgattgtacg gtttgaattt ctcaaccgtc agagaaagtg tgaacgaatc gttgcttacg 1320 tgcggtcgaa ttccgaagag tttggaattg tggcatcgtc ggttcgggca tttgaatgca 1380 agaagcctcg aaaaactgat acgcgatgaa atggttaccg gtttgaaagt ggacggcgcg 1440 gttaagacta aggacatggt agtgtgcgag ccgtgcgtcg ttggaaagca aactcggaaa 1500 cctttcgcgg tacgtgatgg aaagcgttcg tcgcgtgtgc ttgaactggt ccattcggat 1560 gtgtgtggtc cggttacgcc ggttggagta ttgggagtga aatacttcgt aactttcacc 1620 gatgactgga gtcattttgc cgtagtgttc ctgatcacat cgaaggatga agtgttcgag 1680 tgtttccaga cgtacgaggc gttgatgtcg gcaaagtttg agcgtaagat tcaccgtctg 1740 cgttgcgata atggtggtga gtacaaaagc aaacaatttg agcgtttttg taagtccaag 1800 gggatccaag tggagtggac tgttcctcac acacctgaac agaacggagt gagcgagcgg 1860 ctcaatcgaa ctctggtcga gaaggctcga tccatgctag aagattcagg agcagataag 1920 cggttctggg gacaggcgat ccaaaccgct gcgtatttgc tgaatcgtag cccgtcgtcg 1980 gctattgctc cgaacgtgac accgtatgag ctgtgggaag gttgcaagcc agatgtaagt 2040 aaacttcgaa cgttcggctg tggtgtgtat gttcacgtcc cgaaggagca gcgaacgaag 2100 ctggatgcaa aagcgtggaa ggggattttc ctgggatacg ctaacaatgg ttatcgcgtg 2160 tggaacccca aagcaggaaa aatcgttcat gcccgagatg tagatttcct tgaagctagt 2220 gaggcgaagt tacaggtcga cagagtattg ccgaatgatg tcttttctgt tcccaaaaga 2280 aaagagcaaa acgacgatag tgatgatgag gaagaagcga gttccgtatc cgatggtgag 2340 tcgaatgaag agttcgacag tttccgtgat gatggttcat cggatcctgg cgaaggaacg 2400 gtggcggtcg atggcactgg atcaagcgga gaagcagatc ggagcagtgg tgaaacgggt 2460 agaccacagc ggaatcggaa agttccgccc tggcacaggg actacgaaat cgactatgca 2520 gcatacgcgc tgaatgcaac aagctacgtg gaagacctcc cgaattccat agcagatgcg 2580 aagagacgaa gcgattggaa cttgtggcag gctgctataa tagaggaaat ggattccttg 2640 aagaagaacg gaacatggaa attgacgaag cttccggaga accgtacacc gattaccaac 2700 aagtgggttt tcaaggtgaa gcgtggcgtt gatggagcac cggaccggta caaggcacgg 2760 ttggtggctc ggggattaag ccaaaagtac gggatagact acagcgaaac gtattctcca 2820 gttgcgaagc tggatactct gcgcactgta ttggctctgg cgaaccagga aaagttggtc 2880 gttcatcaaa tggacgttaa gacggccttc ttaaatggaa cattggagga agagatctat 2940 atggtgcaac cggagggctt cgagcaagga agagaattgg tatgtcgtct ggagaaatcc 3000 atctatgggc tgaagcaggc atcgcgcgcg tggaatgagc ggttccacaa cttcgtcgaa 3060 gaaagactca agttcaaacg gagcgagaat gaccaatgcc tgtacactcg aaggattggt 3120 tcggagaagt tgatcatcgt tttgtacgtt gacgacattc tcatagctgg ttcgttgaag 3180 gcagtaaaga tggtgaagga tctcttgtct cgcgaatttg aaatgaccga tatgggtaac 3240 atcagaagct ttttgggaat gaagattgac tacgatttgg aaaaaggagt aatgtggtta 3300 attcgtttgc tagaagatct gtggtggaaa ccgaaagaag caacaacgtt ctacgaggac 3360 aaccagtctg cgatgcggat tatcgagaat tcgaaagatt tcggacggct caagcacgtg 3420 gacgtgaaat tccatttctt acgagatctg gtgaagcaaa agcgtattag acttcggttc 3480 ctgacttcgt cggatcagcc agctgatatg atgaccaagg gtctaccggt ggcagctttc 3540 caaagacatc gagctgctat tggtctatcg agttgcagcg gttgagcagg gg 3592 // ID DNA-TA-4_AAe repbase; DNA; INV; 629 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A non-autonomous DNA transposon family from Aedes aegypti. XX KW DNA transposon; Transposable Element; nonautonomous; KW DNA-TA-4_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-629 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the yellow fever mosquito."; RL Repbase Reports 11(4), 1273-1273 (2011). XX DR [2] (Consensus) XX CC ~97% identical to consensus. TA TSDs. TIRs are 24 bp long. XX SQ Sequence 629 BP; 217 A; 123 C; 100 G; 189 T; 0 other; ggggaagggg tggtaaaatg aacaccttaa ggatatcatc ttgtttctga tagaaaaacg 60 aggaatatgt atagttttat cgcacaagat caaaaacagg gatgttagct aaagcaggaa 120 cacgaattca gaccaaaata gctaaatttt cacatatttt tatcgctagc aaaaaacaat 180 tgaaatgttc attttacctg cactatgtgg gtaaaatgaa cagctgttgg tggtaaaatg 240 aacatcatgc aaaaaacgtg agcaaaacta atatttttaa caattttcgc tgcattcccg 300 aaaatatatc tataaatgat tgtaacccat tcaaaaagaa atgtaaaagt ttcagatttg 360 acatcacatc acaacgatat tcccatcact gaaaaacctc aagacctccg agccaaaagt 420 tacgtcagct ccaattttca aacgtggttt tctaaaatct tagttttcgt tgatattttc 480 acttcaattg acctccgtat caatccgaac actcggatat atgctctaaa tcgtccgtgc 540 gtgataaaaa ttcattgaaa tttgtttttt ctcggtaaaa ggcacggtgt tcattttacc 600 acccctgttc attttaccac cacttcccc 629 // ID Rehavkus-1_TC repbase; DNA; INV; 9485 BP. XX AC AC154132; XX DT 30-APR-2006 (Rel. 11.04, Created) DT 26-MAY-2011 (Rel. 16.05, Last updated, Version 2) XX DE This is a recently transposed copy of the Rehavkus-1_TC DNA DE transposon - a fossilized copy. XX KW MuDR; DNA transposon; Transposable Element; Interspersed repeat; KW Rehavkus-1_TC; Rehavkus group. XX NM Rehavkus-1_TC. XX OS Tribolium castaneum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Coleoptera; Polyphaga; Cucujiformia; OC Tenebrionidae; Tribolium. XX RN [1] RP 1-9485 RA Kapitonov V.V., Gentles A.J. and Jurka J.; RT "Rehavkus-1_TC, a family of Rehavkus DNA transposons from the red RT flour beetle genome."; RL Repbase Reports 6(4), 193-193 (2006). XX DR EMBL/GenBank/DDBJ; AC154132; Positions 91701 82217. XX CC Rehavkus-1_TC belongs to the Rehavkus group of the MuDR CC superfamily of "cut and paste" DNA transposons. Transposons from CC this group are widespread in different metazoa, including CC insects, sea squirts, sea urchin and fish. The beetle genome CC harbors several ~95% identical copies of this transposon. Its CC ~1340-bp inverted termini are composed of a 412-bp terminal CC inverted repeat and 164-bp subterminal minisatellite-like unit . CC Similar structure of termini characterizes all Rehavkus CC transposons identified so far in different species. The CC transposon is flanked by a 9-bp target site duplication and CC encodes the 1079-aa Rehavkus-1_TC transposase, whose C-terminal CC portion contains the Ulp1 cysteine protease. XX FH Key Location/Qualifiers FT CDS 2062..5292 FT /product="Rehavkus-1_TCp" FT /translation="MPKNFDFKELADFIKSLAPNTSLTYIANLPLSDRIIH FT ESSIYMFRLSKPSKRRLNDCKNRIKKILSRNEGNIKQFFHKKIVESKIISN FT SPVFFDNHYTTEKNDFSINSPESTNQFSSFDNYNDFHITDFSVNDVPNSIK FT NLYNQPNKSKFTLLPENSQNDISIKRLKPMSFKNCNIKEGTFSLSYEQFND FT IKFNNSLINYQITVRKKFATVNNICPLNFKKHQSDKKGKKFTIYAYCAHEN FT CKVFIIKLSPEKHDIIGTVFSNTLNYRHDNQKRTTFVRNFERQLLGNKLQT FT STPFMTRVRDTNFVPSDQIRAGNYDNIKSKHVYKKIKSQTLSKLDRDTDEF FT FDLIKMSRENRQYIYRVFEPLTVYLFSIEQLEVIKHALSQTNNGFIVLHLD FT ATGSVVHSPPETKKRVYYYAGVVHIKDKELVCPVFEMITSEHDGYSIGSWL FT QAFESFIVKSKQLKWSIFSYVVTDFSFAFINAITTQWNKMDDISVYLNIVY FT DYITSETTVRPKITLKLCCAHFMKLITNDINKICGSNKTQKKFFKHSIAAV FT FMKTSLPEVEKWFKKMVLITSKEFLDLDGKLSNVISKMFDTLNNSKDLEDI FT DEETETETEDTEIPHTLYQKSKFYRHFFKICEQDKKMINSAIDSDNINIFY FT NPLILDLILKKYLPILPLWSNIYPNLRDRDSIGRFSNAPVENWFCQVKNNI FT LKPTGLRLRSTRFIRVVREVVLSKSKECALNIDKNQCAIIKKKRIQTKNID FT LGLESEENWNKKIKSEKKKSYFEKPINEIHPSRSKFNSKVKEIKNVDFYNN FT NLIRDDRYYTLGNSGVNYVVGYYSNAHPMKELYLEDFETLGNFKNFRKMWL FT SNFLMDISLHVLYTKNQHKDTVRIITTHETLSIFENTCPKLTHTLEGIWLL FT PLLINQNHWCFVIIDFNKKTFSFINPLKISNLNQTETYLKKFLLGIRAPRR FT FSFTHWRCNIFKHPVQKNAYDCGVYVLLFAEKFIKEENPDLYSCYEPEEFR FT IILRNLLLEMSDDMENRCLICGQLNLNVTTDWVECDSCKRWIMLNCLPPRL FT APSNISENFTCFACQNYQNL" XX SQ Sequence 9485 BP; 3320 A; 1427 C; 1447 G; 3291 T; 0 other; attacggtga cgatagcact ttggaaagag ctagcaccat gaaatttcca gcggaccagt 60 caaatcacgg atgtgcgttt tcagcacttt gcgacgttgc tacgtttctc acaaattata 120 agaaaataaa ttatttactc attttctttt aaaattcaag tgtattgtta ttaaaaaaaa 180 taaagtaaat gcaatttcaa acttgttgta tctttgaaag caatggggga tgtagatagt 240 ttactagttg acacaaaaag gtgaaaccgc ctaataaagg caaaacaaaa acgaggaggg 300 catgctaccc aattatttag ggggcagatt ccacctaaat aagataatta ttcgttaatt 360 tcttcttcaa aaacattatt tgaaaaaaaa agttaataga acgacatact aattcaaaca 420 gagggttaga tgtcttccca aacttttaga ggattgaaat tttcaagaaa ggtaggtatt 480 taatgggctt tcatgtggaa tgggttaaat gcgtcgacaa aaagaaatag tactagaaaa 540 aaatatttaa tattattaat aaaatgttga catatttcaa actaagggtt ggatgtcttc 600 ccaaactttt agagggttga aattttcagg aaaggtagag tatttaatgg gctttcatgt 660 ggaatgggtt aaatgcgtcg acaaaaagaa atagtactaa aaaaaatatt taatattatt 720 aataaaatgt tgacatattt caaactaagg gttagatgtc ttcttaaact tttagagggt 780 tgaaattttc aggaaaggta gagtatttaa tgggctttca tgtggaatgg gttaaatgcg 840 tcgacaaaaa gaaatagtac tagaaaaaaa tatttaatat tattaataaa atgttgacat 900 atttcaaact aagggttgga tgtcttccca aacttttaga gggatgaaat tttcaggaaa 960 ggtagagtat ttaatgggct ttcatgtgga atgggttaaa tgcgtcgaca aaaagaaata 1020 gtactagaaa aaaatattta atattattaa taaaatgttg acatatttca aactaagggt 1080 tggatgtctt cccaaacttt tagagggatg aaattttaag gaaaggtaga gtatttaatg 1140 ggctttcatg tggaatgggt taaatgcgtc gacaaaaaga aatagtacta aaaaaatatt 1200 taatattatt aataaaatgt tgacatattt caaactaagg gttggatgtc ttcccaaact 1260 tttagaggga tgaaattttc aggaaaggta gagtatttaa tgggctttca tgtggaatgg 1320 gttaaatgcg tcgacaaaaa tggaaagtta attttgtaag acataaaaaa gccagttgaa 1380 gccagttttg cacttttatg tcaatcaaat tacaattcgt ccgaaattct gtatttgtaa 1440 ttattatttt ttataataat tataatgctt atctctttct tggcagcgac cttgtgataa 1500 gtatctgggt aataataata aaatcacaat tcgtgcattt tgataaatta taaatttttg 1560 aagaactgct tttctcgtag gtaccttaaa acgaaaatta gtttattgaa gtcgtgttaa 1620 tgagccgatt tttacgagat aaaaaggtaa aaagggtata ccgattattt ggaaaacatt 1680 ttattggtct acttagaatc atcccggctt cccggctacg cggtattttc ttagaattta 1740 gagtaatcgt ctgctacctt atctctgtct gctataacga gttataccga ttctcgattc 1800 atctgaattt gcagttattg catggctttg gtgatagcag gtgtattttg ctgagctttt 1860 gcgcgaatta attttgtaag aacctttttg cttttcggaa tcttttcttt tgtttttagt 1920 ttattggttt tgttttcagc atttatagtc gttagtttta gaatttttag attttttagt 1980 ttttgaattt ttgggttttt ttagttttag aatttttagt ttttagtttt ttattattat 2040 ttttgttgtc tactttaaaa aatgcccaag aattttgatt tcaaggaatt agccgatttt 2100 atcaaaagtt tggctcctaa tacatcactt acatacatcg caaatctacc tttaagtgat 2160 cggattattc atgaatcttc tatttacatg ttccgccttt ctaaacctag caaacgaagg 2220 ttaaacgatt gtaaaaatcg tattaagaaa attttgtcgc ggaacgaagg aaacataaaa 2280 cagtttttcc ataaaaaaat cgtcgaatcc aaaatcattt ctaatagtcc cgtttttttt 2340 gataatcatt acactactga aaaaaatgat ttttcaatta actcaccaga atccaccaat 2400 cagttttctt ctttcgataa ttacaatgat ttccatatta ccgatttttc agtgaatgac 2460 gttccaaatt caataaaaaa tttatataac caacctaata aatcaaaatt tactcttttg 2520 ccagaaaatt cacaaaatga tatatcaata aaacgtttaa aaccaatgtc ttttaaaaat 2580 tgtaatatta aagaaggcac tttttcacta agttatgaac aatttaatga cattaaattt 2640 aataattctt taattaacta tcaaattact gtgaggaaga aatttgcaac agttaataat 2700 atttgtcctc ttaattttaa aaaacaccaa tcagacaaga aaggaaaaaa atttacaatt 2760 tatgcatatt gtgcacatga aaattgtaaa gtgtttatca ttaaattgtc gccagaaaaa 2820 catgatatta ttggaaccgt ttttagtaat acactaaatt atcggcacga taaccaaaaa 2880 aggacaactt ttgtgagaaa ttttgaaaga cagctattag gaaataaatt gcagacatcg 2940 actccattca tgacacgagt tcgggataca aattttgttc cctcggatca aattcgcgca 3000 ggaaattatg ataatataaa atccaaacat gtgtacaaaa aaatcaagtc acagactctt 3060 tctaaattag atagagatac agacgaattt ttcgatctta taaaaatgtc gagagaaaac 3120 cgccagtata tttatcgcgt ttttgagccc ctaacagtgt atttatttag tattgaacag 3180 ttagaagtta ttaaacatgc actgtctcaa acaaataatg ggtttatagt tttacattta 3240 gatgcaaccg gaagtgtcgt ccattctccc cctgagacga aaaaacgtgt ttattattac 3300 gctggagtgg tacacatcaa agataaagaa cttgtttgtc ctgttttcga aatgataact 3360 tccgaacatg atggttattc tattggttca tggttacagg cttttgaatc atttattgtt 3420 aaaagtaaac agctaaaatg gtccattttt tcttatgtgg ttaccgattt tagtttcgct 3480 tttattaacg ccatcaccac acagtggaat aaaatggatg atattagtgt atacttaaat 3540 attgtatacg attatataac atcagaaacg actgttcgac caaaaatcac tttaaaatta 3600 tgttgcgctc attttatgaa attaataaca aatgacatta acaagatttg cggtagcaat 3660 aaaactcaaa aaaaattttt caaacattct atagcagctg tttttatgaa aacatcttta 3720 cctgaagtgg agaagtggtt caaaaagatg gttttaatta cttcaaaaga atttttagat 3780 ctcgatggga aattaagcaa tgtaataagc aaaatgtttg atacattaaa taatagtaaa 3840 gatctagaag acatagatga ggaaactgaa accgaaactg aggacaccga aattccccat 3900 acactttatc aaaaaagcaa gttttataga cattttttca aaatatgtga acaagataag 3960 aaaatgatta atagtgcaat agattcagat aatatcaata tattctacaa tcctctgatt 4020 ttagatctca ttttaaaaaa atatttacca attttacctc tttggtcaaa tatttatcca 4080 aatttacgtg atcgtgatag tattggaaga tttagcaatg cgcctgtgga aaattggttt 4140 tgtcaggtga aaaataatat tttgaagcca actggactaa gacttcgaag tactagattt 4200 attcgagtag ttcgagaagt tgtcttgagt aaatcaaaag aatgtgcact taatatagat 4260 aaaaatcaat gtgctatcat aaagaaaaag cggatacaaa caaaaaatat tgatttagga 4320 ttagaatcag aagaaaactg gaacaaaaaa attaaatccg aaaagaaaaa gtcatatttt 4380 gaaaaaccaa ttaacgaaat tcatccttcc cgttcaaaat tcaattccaa agtaaaagaa 4440 attaaaaacg tagattttta taataacaat ttaattcgag atgacaggta ttatacattg 4500 ggaaactcag gcgtcaatta cgttgtagga tattattcta atgctcatcc aatgaaagaa 4560 ctttaccttg aagatttcga gactttggga aattttaaaa attttagaaa aatgtggcta 4620 tcaaattttc ttatggacat aagtcttcat gttttatata caaaaaatca acataaagac 4680 actgttcgta tcatcacaac gcatgaaaca ttatcaattt ttgaaaacac ttgtccaaaa 4740 cttactcaca cactagaagg aatttggttg ttgcctttgt taataaatca aaaccattgg 4800 tgctttgtga taatagactt taataaaaaa acatttagtt tcattaaccc actcaaaata 4860 agcaacctta atcaaactga aacatactta aaaaaatttt tattaggaat tagagcacca 4920 aggcgatttt cttttacgca ttggagatgt aatattttta aacatcctgt tcaaaaaaat 4980 gcttacgact gcggtgttta tgttttatta tttgcagaaa aatttataaa agaggaaaat 5040 ccagatttgt attcgtgtta tgaaccagaa gaatttcgaa tcattttaag aaatttgtta 5100 ttagaaatgt cagatgatat ggaaaataga tgtttgattt gtgggcaatt aaatttaaat 5160 gtgacaacag attgggtaga gtgcgactcc tgtaagcgct ggatcatgct aaactgtcta 5220 cctccaagat tagctccttc taatatttct gaaaatttta catgttttgc ctgtcaaaat 5280 tatcagaacc tgtagtttat ttctttattc tttattcttt gtaccatatg attcaatacg 5340 ggtgattcaa aagtgtcgta caatctgtta aaggacatct gtcaaaggtc aaaggacatc 5400 aaactgaatt gaaggttcct atggcaaaaa attctattag cctattttca caagataatg 5460 ttctttaaag ttaaattttt cttttaattt tttggtgttt cagatcatgt tttttgactt 5520 tagttagtac aaacgacacc tctggcaaca gatacctgtt gaaaaaaaaa cactagttaa 5580 tgtcttatga tgtacgaaac aattaaaaaa cttgcaaatt cgacaataaa ttaaactcaa 5640 cgtcataaga aacaaaagtc gataccgttg ctaaggaaaa caaattaaaa aattttaatt 5700 aaaaattaag ttgcttgaaa aaaaagtgac ttcattttta tgcttaattg aacatctttt 5760 atcagtttat caccattcga gagtgtgtct gcacttttga atcaccctgt cttataccta 5820 atttttaata agtggtggct atgaaaacaa aaattatgac aacgtgggtg taatttcaat 5880 ttttggtcta ataaattgta tgagatttgt cattaattca caaaaagtgg cgctgagctt 5940 gttttacact gatttgaaat atgtactttc tgtttcaaaa ctatagaaat atagaactaa 6000 aaaatacatt ttatcaacct tattaaaaaa aatcggtaaa atgcgatgtt agcattttcg 6060 taattttgca acagggagtt atcattgaat gccactaagg gttagttgca taattaaaca 6120 acactgtccc cggtcattcc ttggaagggt gactggataa tgtcatgcca aaaacatcgc 6180 aactaactcg tcttttgaag aagacgcggt cccgttcact attagtggcc gttaggtctg 6240 gtcagaggct gtcgatgcga cctgaaaact ctgacacttg ggtgatgaca ttaccaccat 6300 acccatattt atttatttat ttttaccaca gagaggcaaa aatgaaaaaa acagacaaaa 6360 ggaaaatagt tcgtactgaa aaaattatta aaactgtgtg acaagtagta gagaaaacac 6420 tgacttgttt acctataccg gtgggacacg gggggacaaa ccttattgcc aaaatggcaa 6480 cgttgcaggt tcataatttg aggtgaacta catcacaaaa gcatgatttt tgtgtataac 6540 aagaatctgt gatacaactg aacactactt acaaaatttg agtgtaaagc cttcattatt 6600 tcataatttt catacagaac aaatgttaaa agtgttagct agaagctaag ctataaagtt 6660 ttaaaacttt tgttttctat tgccactgtt tattagaaat taaatctatc atacggtatt 6720 actccgcttt tattgttgaa gcacagtaat ttttagttta tctaggcttg ggcattagtg 6780 catgttgcat aggtaacagt ggcgttcgtt cttatttatt agataaaatc caagattttt 6840 aaaatttcca gatagtaaaa ctggggacga gcaaaatgga gattatgtat ttgttttagt 6900 aattttaatt ttggaaattt tagaaatact acacaatttt ttagttattt gttcggtgct 6960 tgcatgatat ggcttatatt tttttaaata agaattaatt ttaagtttcc tctctcagaa 7020 ttagtgcctt gtcgcggcga gagagcttac cgagtaatat ttattaaaga cattccataa 7080 ataaggtaac taatattact taaatattaa ttaattttta gtattttctt actgtgtaaa 7140 tattactcga acaaaactaa ttaaggttcc tttacaaaaa agtactaact ctctgcgttt 7200 gacagaaaag aacgtttgaa agtaatcaat cacggaaggc cttgttccaa aaataggtag 7260 aacgaataac aaaagtgtgg actcgaatta tttttcgttc tttgtaacat gcctgcctat 7320 acctggaagc gagcttctgg gattggtcag tttgaaacct ctgtatcact taaatgaaaa 7380 ataaatggcc tacttttttt gaatttttat catctctatt gcttttctac cagcacattt 7440 ccgctgggaa gtgagttctg gaccacgctg tataaattac ttttaaatta cctattaaag 7500 atagctttgt tgcataaatt tatcacatta ttttgtgaaa taaatattaa gcttttccaa 7560 agatgaattc agcagctcta gaaaggtgta acaatttgtc taataatttt tgttttaagg 7620 tatctacgag ataaatgttt tccaaataat cggtaaacca tttttacctt tttatctcgt 7680 aaagatgggc tcattaacac gactttaata aactaatttt cgttttaagg tacctacgag 7740 aaaaacagtt cttcaaaaat ttataattta tcaaaatgca cgaattgtga ttttattatt 7800 attacccaga tacttatcac aagatcgctg ccaagaaaga gaaaatttca accctgtaaa 7860 agtttgggaa gacatccaac ccttagtttg aaatatgtca acattttatt aataatatta 7920 aatatttttt tctagtacta tttctttttg tcgacgcatt taacccattc cacatgaaag 7980 cccattaaat actctacctt tcctgaaaat ttcaaccttc taaaagtttg ggaagacatc 8040 caacccttag tttgaaatat gtcaacattt tattaataat attaaatatt ttttttagta 8100 ctatttcttt ttgtcgacgc atttaaccca ttccacatga aagcccatta aatactctac 8160 ctttcctgaa aatttcaacc ctctaaaagt ttgggaagac atccatccct tagtttgaaa 8220 tatgtcaacc ttttattaat aatattaaat atttttttag tactatttct ttttgtcgac 8280 gcatttaacc cattccacat gaaagcccat taaatactct acctttcctg aaaatttcat 8340 ccctctaaaa gtttgggaag acatccatcc cttagtttga aatatgtcaa ccttttatta 8400 ataatattaa atattttttt tagtactatt tctttttgtc gacgcattta acccattcca 8460 catgaaagcc cattaaatac tctacctttc ctgaaaattt catccctcta aaagtttggg 8520 aagacatcca tcccttagtt tgaaatatgt caacctttta ttaataatat taaatatttt 8580 ttttagtact attttttttg tcgacgcatt taacccattc cacatgaaag cccattaaat 8640 actctacctt tcctgaaaat ttcatccctc taaaagtttg ggaagacatc caacccttag 8700 tttgaaatat gtcaacattt tattaataat attaaatatt tttttctagt actatttctt 8760 tttgtcgacg catttaaccc attccacatg aaaacccatt aaatactcta cctttcctga 8820 aaatttcatc cctctaaaag tttgggaaga catccatccc ttagtttgaa atatgtcaac 8880 cttttattaa taatattaaa tatttttttt agtactattt tttttgtcga cgcatttaac 8940 ccattccaca tgaaagccca ttaaatactc tacctttcct gaaaatttca tccctctaaa 9000 agtttgggaa gacatccaac ccttagtttg aaatatgtca acattttatt aataatatta 9060 aatatttttt tctagtacta tttctttttg tcgacgcatt taacccattc cacatgaaag 9120 cccattaaat actctacctt tcttgaaaat ttcaaccctc taaaagtttg ggaagacatc 9180 taaccctctg tttgaattag tatgtcgttc tattaacttt ttttttcaaa taatgttttt 9240 gaagaagaaa ttaacgaata attatcttat ttacaagata caagtttgaa attgcattta 9300 ctttattttt tttaataaca atacacttga attttaaaag aaaatgagta aataatttat 9360 tttcttataa tttgtgagaa acgtggcaac gtcgcaaagt gctgaaaacg cacatccgtg 9420 atttgactgg tccgctggaa atttcatggt gctagctctt tccaaagtgc tatcgtcacc 9480 gtaat 9485 // ID BEL-43_CQ-I repbase; DNA; INV; 3007 BP. XX AC AAWU01000931; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-43_CQ_; KW BEL-43_CQ-LTR; BEL-43_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-3007 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 239-239 (2011). XX DR GenBank; AAWU01000931; Positions 38684 41690. XX CC 'CCGTA' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 787..3006 FT /product="BEL-43_CQ-I_1p" FT /translation="MDQTMKKKLTYALQQRHLAEQKVVFVEDLLLGLYQPT FT LPQLTTLYDFLCAAFREYSQHHLTVLELIPIEDLAQQEAEFERNDASFYNV FT ATAVAEMQIEAERKPTSTLIAQRQQHLQAKVWLKQMHKAWEANQVDVPKPA FT PSHKLLQNQQTAPTRPIVIDSASGLGDKSLTIQSSDQPSAADSRSARDINA FT NPNANARSDLLEATSCESKVVTAESESGRVQPAMVRCSDDEADQRHVCSDS FT GLQRGVQIHNHSQIQICRLRIGSIPNLTTLMTVPENQPLLQAAARAPDKCE FT VPFSSHVTPDKIAVHKPTRNPPKLAPLDQCKTGLPPKPLQEQQTATAEPVA FT IESTYKLGSDSRPSQNAEQPNSVASRLSEKVTTNPSMDTKSDFPDESSSEP FT THPTGHLKDVTTEAELDNELPAVFPSKQVPPHNSDLKPEATVKTEPLQPKV FT VQNSIHHGEPEQRRVHSDSGLQKMDVPESEQKPNCFHQNGSYNMDFAMVMT FT VPVYKPLPQPAAKAPDKRRLPLPSRVHLDKSVVHVEEGRPPDDLVLAGEPE FT LKCSRSTQSRSHSEPASSEMHVVNQSRKRHKSVSNLSNLTPYKFANVHTSR FT RVPSIQITNRQRWCTIWFLIAQVQSQPVALNLVISMIGPQGSFHEPPSQNE FT PFQVRDQLSRFQRTVYRRRKSWHSTIPPDKSRDPESAQPCKRRSNLREFHA FT VLPAIEDVGERREPADQSAAIHRKRCVGRTKELRDLNGGS" XX SQ Sequence 3007 BP; 791 A; 942 C; 766 G; 508 T; 0 other; tggtcacatc gaaccggatt tgaaaccagc tcaagcctcc aggattgaag tcgtcggttc 60 caaccgggac tagcccagcc gctccgaaag caaccggaga gtaccaagaa gcgctactgc 120 ccaagttccc cctgcgtcca atcgcagttc gaccgtttcc gagagactgt ccagccgtgg 180 aggtcatcgg aagtcggatt cccggaagtg tccaccctag gaacttcctt tctgcagttg 240 aaccgtcgaa ctgaactcgc gcggatgcac aaagccggcg agtcgttcct gcccaagttt 300 tcccgttcgc tccatacccg cgttccggaa gcgaccggaa agtgccggcg agagctcctg 360 tctaagttcc ccacctcgct gccccatcga ccatttccga gcgactgtcc agccgtggaa 420 gtcatcggga gtcggatatc cggaaggtaa ctcgcgcgga tgcacaaagc cggcgagtca 480 ttcctaccca agttccccct gctgtccatc ctaggaactt cgtccaagca gttcacccga 540 cgaattgctc tcgcgcggat gaacaaagcc ggcgagtagt cccagttcaa gcgttcccga 600 tccaacccgt tccgagccga actcgcgagc aaatgcgttt gcgcgcgtgt gtgtgtcacc 660 gatcaaaaaa gtgacgcaac gagaagtgtc gtcggccatc cgaggggacg aacttttcca 720 gacgaagcga cccagcagca caaaccaaat cctcgacgtg ggcttcccag ctagcgggaa 780 gcggccatgg atcaaacaat gaagaagaag ttgacatatg cgttacagca gcgacatctg 840 gcggaacaga aggtggtgtt tgtagaggat cttcttctgg gtttgtacca acccacgcta 900 ccacagctga caaccctgta cgacttcttg tgtgcagcgt tccgagagta cagccagcac 960 cacctcacgg tcctcgaact catcccaata gaggacctcg ctcagcagga ggctgaattc 1020 gagaggaacg atgcgagttt ctacaatgtt gccactgcgg tggcagaaat gcagatcgaa 1080 gcagagagaa agccaacatc aaccctgatc gcgcaacgcc agcagcactt gcaagcaaaa 1140 gtatggctga aacagatgca taaggcctgg gaggccaacc aagtggacgt tccaaagcct 1200 gcaccgagcc acaaactcct ccagaaccaa cagactgctc ccacgaggcc gatcgtcatc 1260 gattctgcca gcggacttgg cgacaaatcc ctgacaatcc agagctcgga ccagccgagt 1320 gcagccgatt cccgatcagc gagggatatc aacgcgaacc cgaatgcgaa cgccaggtct 1380 gacttgctcg aggcaaccag ttgtgagtct aaggtcgtca ccgccgaatc ggagtcaggc 1440 cgagtgcagc cagcaatggt gcgatgttcc gacgacgaag ccgaccagcg ccatgtctgc 1500 agcgatagtg gcctccaacg tggggtccaa atccacaatc acagccaaat ccagatttgc 1560 cgcctccgga tcggatctat cccgaacttg acgactctga tgaccgtccc ggagaaccaa 1620 ccgcttctgc aggccgcggc tagggcacca gacaagtgtg aagttccctt ttcaagccac 1680 gtaactccgg acaagatcgc agtccacaag ccaacaagga atccgccaaa actagcgccc 1740 ctcgatcagt gcaagaccgg attaccaccc aaacccctcc aggagcagca gacggcaact 1800 gcggagccgg tggccatcga atccacctac aagcttggca gcgattcccg gccaagccag 1860 aacgcggaac aaccgaattc agtcgcctcc cgactaagtg agaaagtcac cacgaacccg 1920 agtatggaca ccaagtccga cttccccgat gagagtagca gtgagccgac gcatccaacc 1980 ggacacttaa aggacgtcac caccgaagct gaactcgaca acgagctgcc agcagtgttc 2040 ccgtccaaac aagttccccc acacaacagc gaccttaaac cagaagcgac cgtgaagact 2100 gagccactac aaccgaaagt ggttcaaaat tccattcacc acggcgagcc agaacaacgc 2160 cgtgtccaca gcgacagcgg tctacagaag atggacgtac cagaatccga gcaaaagccg 2220 aactgcttcc atcagaacgg cagttacaac atggactttg cgatggtgat gaccgtccca 2280 gtgtacaaac cgctgccgca gccagcggcc aaggcaccag acaagcgtag acttccctta 2340 ccaagccgcg tacatctgga caaaagcgta gtccacgtcg aagaagggcg tcctcccgac 2400 gacttggttc tcgccggaga acccgagttg aagtgtagcc gatcaaccca atcccgatcc 2460 cactccgagc ctgcatcgtc agaaatgcac gtggtcaacc agtcgagaaa acgccacaag 2520 tcagtaagta atctgtccaa cctaaccccg tacaagttcg ccaacgtaca cacgagccgt 2580 cgtgtgccca gcatccagat cacaaaccgc cagaggtggt gcaccatctg gtttctgatc 2640 gctcaagtgc agagtcaacc agtcgcgttg aatctcgtga tctcaatgat tggcccccaa 2700 ggtagctttc atgaaccgcc cagccaaaat gaaccgttcc aagtacgaga tcaactcagt 2760 cgtttccagc gaacggtcta tcgccggcgc aagagctggc attcgacaat cccaccggac 2820 aagtcacgcg atcccgagtc agctcaaccc tgcaagcgca ggagcaacct ccgagagttt 2880 cacgccgttt taccggccat cgaagacgtc ggcgagcgac gagaaccggc ggaccagtcc 2940 gcagccatcc accggaagcg ttgtgtgggc cgaaccaaag agttgaggga cctcaacggg 3000 gggagta 3007 // ID APO2_AP repbase; DNA; INV; 489 BP. XX AC AF407669; XX DT 09-JUL-2004 (Rel. 9.06, Created) DT 09-JUL-2004 (Rel. 9.06, Last updated, Version 2) XX DE Acanthamoeba polyphaga isolate Apo2 repetitive DNA sequence. XX KW APO2_AP; Repetitive DNA. XX OS Acanthamoeba polyphaga OC Eukaryota; Amoebozoa; Centramoebida; Acanthamoebidae; OC Acanthamoeba. XX RN [1] RA Webb R.S., Garman C.G., McIninch P.S. and Brown L.B.; RT "Amoebae associated with ulcerative lesions of fish from tidal RT freshwater of the James River, Virginia."; RL J. Aquat. Anim. Health0-0 (2002)In press. XX DR Genbank; AF407669; Positions 1 489. XX SQ Sequence 489 BP; 66 A; 131 C; 151 G; 139 T; 2 other; tgttggtcgt tgatacgtgt tgtgttgtgg tgttgtgttg ttgtattgtt gtattgtttt 60 gttgttgtgg tgttgtggtg ttgtgttgtg tgaatcacca tcagctgatt ggccactgct 120 actgagagaa gaagagggag gcagcgtacg gccgtagacc tcggcggcgt gttggaagtc 180 gtgggcgaga taggccagct gctggtactt gttctcgtcg tccttgaggc gcaccangtc 240 ctggaagcgc tggttccaga tgctgttcga gctgcagcgg aggtcgccgc ggtcgatgct 300 cctgagcacc cgctccatct cctcctcctc ggacgtgttc cactcctcct cttcccggca 360 tctcgtagtc ggcgtcgtcg gngcctcgtc ttcgtcagac gaagtagatg acgatgactc 420 gccgacgcgg cgagcccatc gtcctcatca tcacccgctg tctgtgtcgt atcccgccag 480 ttcgccctc 489 // ID Kiri-14_AAe repbase; DNA; INV; 4284 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 16-FEB-2011 (Rel. 16.02, Last updated, Version 1) XX DE A Kiritsubo non-LTR retrotransposon family from Aedes aegypti. XX KW Kiri; Non-LTR Retrotransposon; Transposable Element; Kiri-14_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4284 RA Kojima K.K. and Jurka J.; RT "Kiri non-LTR retrotransposons from the yellow fever mosquito."; RL Repbase Reports 11(2), 709-709 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 12 sequences with >95% CC identity. CC This family and related Kiri elements found in two mosquito CC species, Culex quinquefasciatus and Aedes aegypti, constitute a CC distinct lineage belonging to the CR1 group. The phylogenetic CC analysis with PhyML indicates that Kiri is a sister lineage of CC the L2 clade. Although several Kiri elements do not encode two CC proteins, probably due to 5' truncation, Kiri elements generally CC encode two proteins. Their ORF1 proteins commonly contain an CC L1-like RNA-binding domain. XX FH Key Location/Qualifiers FT CDS 280..1062 FT /product="Kiri-14_AAe_1p" FT /translation="MSVNLNKPLTRASSTSTVNTHTTPGRKDDSNAVMTVN FT DLWIKMQTMFTQSNERLEAKIDSCMDRLNNRMDVLEVNLSASKQECGDNIA FT AISEQVDHIRSDIFQTNRRMDVFGTSSELVISGIPYLQTEDLRQAFRNIAI FT AIGYLEDSVPTTALKRLSRHPIVPGTAPPILCEFALRTARDDFYRSYLQRR FT SLSLRQIGFDNNNRVYINENLPPSVRRIRSEAIKLKKQGQILRVFTKEGTV FT YVKRDEASDAVAVNSIAELN" FT CDS 1290..4121 FT /product="Kiri-14_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MGSRFGNNSSASLLIPKAVFTSVLESDHLCICHLNVQ FT SLTARNFTKFNELSMIFRDSLMDVICMSETWLDDSINDSMINIRGYKLIRN FT DRDRRGGGVCVYVRNNLVARILKTSNNNNSSPHSKTEYLCLEIECQGERFF FT LGVYYNPPEVDCSMLLFEHFEDYTVRYDSTFFVGDFNTDLRKTNNRQRRFS FT DLLSSLSLQCINIEPTFYHHFGCSLLDLVITDSPDLVVKQDQVSMPGVSNH FT DMLFFSLKIGCPQSDHGILYRDYKNFDSTALQNAFESIDWHTFLRCEDPNW FT LVEYFNNCAKYLHDTFIPLRTRTFKPNVWFTGEIELAIVNRNLAYRNWKHN FT KTIELRIVYNRLRNRVTQLIRSAKENFEKRNINTNLPSKVLWNRVKQLGLS FT KSKSTTGIDNFSSDDINDYFSSNFSTDSSDYSSFYDVSNGFMFNTVEDYEV FT INAIFSVKSNAVGFDEVPLSFIKILLPLALPLIVHIFNSIIKTSTYPLAWK FT YVKVIPIKKKGNSSTLSNLRPISLLSSLSKAFEKILKIQMSDYLTGANLLT FT PHQSGFRKDHSTTTALLKVHDDISQKIDKKGIALLLLIDFSKAFDRVSHSR FT LLKKLNCRFNFSLSSISLIKSYLTDRFQAVVNSGSTSSAHMIKSGVPQGSI FT LGPLLFSMFIDDLPSVLDFCSIHIFADDVQVYLCSGNTIDIEQMSVNINHD FT LRKIFSWSCINLLPINSSKTKALLISRKRNPPNPPDIFINNEKIEFVEHAS FT NLGVIFCKDLNWDAQINSQCGKVYGSLKRLHLSTRHLDIETKIRLFKTLLL FT PFFTYGDFLYSNANAYALNKLRLCFNACVRYVYNLPRFSRISHLHTSLIGC FT SFSQFYKYRTCLMFFKIINTMKPDYLYSKLVPLRQSRARNYSIPNHVSTYY FT GQSFFVRSIANWNQLPSSLKSSSSVTIFKRNLLVWLSSTE" XX SQ Sequence 4284 BP; 1261 A; 812 C; 770 G; 1431 T; 10 other; caaawaasgg ctwtakcaca ckcaamctcg aatttcctga agggatgtgt gtaaccgagt 60 agagtgtttt tgtcgacgca ktttcaaacg actttttcgg ctgktaaawc tkaagtttaa 120 atcgagtttt tgcgcgatct gggactaaag ttgtggtgta gtgattgcag ttgtgttctg 180 agctgttgta attaactgta aatccagtgc gaacttagtg tttggatcaa cttgtcaatc 240 gccattctat ataaagccct tcagcagtgt atcgcaacaa tgagtgtgaa ccttaacaaa 300 ccactcactc gtgcatcttc tacatccact gtcaacactc atactacccc aggtcgtaaa 360 gatgactcca acgctgttat gacggtcaac gatttatgga tcaaaatgca gacaatgttt 420 acacagtcta atgaacgttt ggaagccaaa atcgattcgt gtatggacag actgaacaat 480 cgaatggacg tgctggaggt taatctgtcc gccagtaaac aggagtgtgg cgacaacatt 540 gctgcgatct cagaacaagt cgatcacatt cgttcggaca tctttcagac taatcgcaga 600 atggacgttt ttgggacgag ctctgaacta gtaatctccg gcattccgta tcttcaaacc 660 gaagatttga ggcaggcttt tcgtaatatt gctattgcta ttggttatct tgaggatagc 720 gttccgacaa ctgcacttaa acggctttct agacatccaa tagtaccagg aaccgcacct 780 ccaatcctct gtgagtttgc cctacgtact gcaagagatg atttttatag atcttatctt 840 cagcgaagat cattgtcgct ccgtcaaatt ggtttcgata acaataatcg cgtctacatc 900 aatgagaatc tgcctccttc agtgagacga attcgttcag aagccatcaa actaaagaaa 960 cagggccaaa ttttaagagt cttcacaaag gaaggaactg tatacgtgaa gcgtgacgag 1020 gcaagtgatg ctgttgctgt taactctatc gccgagttga attagcccaa gaacatcata 1080 tgatcgtgcc ctgcgaaaat gttgctgctg ttgctgttgt actgctgtcg ttgctatttt 1140 tatcagagtt tggaaatgcg tcgatgatta taatgttata acaagttttt tttttttgtc 1200 aatgaactcg aatttgattt gctcattcaa cactattact ctgtactgga ttgaggttac 1260 tgaaaggtgt ttgctgctac cttacaacaa tgggtagtcg tttcggaaat aatagttctg 1320 cgagcttgct gattcccaag gctgttttca cctcagtctt ggaaagtgat catctttgta 1380 tttgccatct gaatgtacaa agtctcacag cacgaaattt caccaaattc aatgaactgt 1440 caatgatatt tcgtgatagc ttaatggatg taatttgcat gtctgaaaca tggttagacg 1500 attcaattaa cgatagtatg attaacattc gcggttataa actgattaga aatgatcgtg 1560 accgacgtgg tggaggcgtc tgtgtctatg ttcgaaataa tttagtagcg cgtattttaa 1620 aaacatcaaa caacaacaat tcatctcccc atagtaaaac tgagtatttg tgtctggaaa 1680 tcgaatgtca aggtgagcgt ttcttcttag gcgtgtacta caatcctccg gaggtagact 1740 gttcgatgtt gctttttgag cattttgagg attacactgt tcggtatgat tctacttttt 1800 ttgtgggtga ctttaacaca gatctgcgga aaaccaataa cagacaaaga agattttcgg 1860 atttattatc tagtttatct cttcaatgta tcaatatcga accgactttc tatcatcatt 1920 ttggatgctc gttgctagac cttgttataa ctgatagtcc tgatctagtg gtcaagcaag 1980 atcaagtttc aatgccgggc gtctctaacc atgatatgct ctttttttct ttaaaaatag 2040 ggtgcccgca atctgatcat ggcattctct atagggatta caagaatttc gacagtactg 2100 cattacaaaa tgcatttgaa agcatagact ggcacacctt tttgcgttgt gaggacccta 2160 attggcttgt cgagtatttt aacaactgtg ccaaatacct tcatgatact tttataccct 2220 taagaactcg tacttttaaa ccaaatgtgt ggttcactgg ggaaattgaa cttgctattg 2280 ttaacagaaa cttggcatat aggaattgga aacataacaa aactatcgaa ctaaggattg 2340 tatacaatcg tttgcgtaac cgtgttactc aattgattag aagtgctaaa gaaaatttcg 2400 aaaaacgtaa catcaatact aatctgccaa gcaaagtgct ctggaataga gtcaaacaac 2460 taggtctttc taaatccaaa tcaactaccg gtattgataa tttttcttct gatgatatca 2520 atgactattt ttcctctaat ttctcgacag attctagtga ttattcttcc ttctatgatg 2580 tatccaacgg atttatgttt aacacagttg aggattatga agttattaac gcaattttct 2640 cggttaaatc gaatgctgtt ggcttcgatg aagttccttt aagttttatc aaaatattat 2700 tgcctttagc tcttcctctt attgtacaca tattcaattc tatcatcaaa acttctactt 2760 accccctagc ttggaagtat gttaaagtaa taccaataaa aaagaagggt aattcttcga 2820 ctctttctaa tcttcgaccc attagtctgc tgtcctcgct ttcaaaagct tttgaaaaaa 2880 tactcaaaat tcaaatgtca gactatctta ctggcgctaa tttgttaacc ccgcaccaat 2940 caggatttcg taaagatcat agcaccacta cagccctttt gaaagtacat gatgatattt 3000 cacaaaaaat tgataaaaaa ggaatagctc tgttattatt gattgatttc tcgaaggcct 3060 ttgatcgcgt ttcgcatagt aggttgctca agaaactaaa ctgtcgtttt aatttctctt 3120 tgtcttccat ttctctaatt aaatcctatt taactgatag gtttcaagct gttgtaaatt 3180 ccggatcaac atcgtcagcc catatgataa agtccggtgt acctcaggga tctattcttg 3240 ggccattact tttctctatg tttattgacg atctaccctc cgtactcgat ttttgttcga 3300 ttcatatctt cgctgatgat gttcaagttt atttatgttc tggaaatact attgatattg 3360 aacaaatgtc tgtaaacata aatcatgatt tacgaaaaat atttagttgg tcgtgtatca 3420 accttttacc catcaactcc tcgaaaacaa aggccttgct aatctctaga aaaagaaatc 3480 cacctaatcc ccctgatatt tttattaata atgaaaaaat tgaatttgtt gaacatgcta 3540 gtaacttagg agtaattttt tgtaaagatt tgaattggga tgcgcagatc aattcccaat 3600 gtggtaaagt ttatggcagt ctcaaaaggt tgcatctttc cacaagacat ctggacatcg 3660 agactaaaat taggcttttt aagactcttt tattgccgtt cttcacatac ggtgattttt 3720 tatattcaaa tgctaatgct tacgctctta acaaacttcg tttatgtttc aacgcttgcg 3780 tgcgttatgt ttataatctt cctaggtttt caaggatctc ccacttacat actagcttaa 3840 taggttgttc cttttctcaa ttttataagt atagaacttg tttgatgttt ttcaagatca 3900 ttaacacaat gaaacccgat tatctgtata gcaaattagt gcctttaaga caatctcgtg 3960 caagaaacta ttccatccct aaccacgttt caacttacta tggacaatct tttttcgtta 4020 gaagcattgc taattggaat cagctaccct cttctttgaa atctagttca tcggttacta 4080 tttttaaacg gaacctatta gtgtggctct ccagtacaga gtaatatctc aattgataga 4140 gaacctcagg aattatagtt ttaagttaga ttgtaagaat atgataacat ttgaaatttg 4200 tattttcttt ttgtggcaat ttaaaaggct ttgccttacg ccacattttc attggaataa 4260 ataaaataaa taaaataaat aaaa 4284 // ID hATx-2_HM repbase; DNA; INV; 2889 BP. XX AC . XX DT 07-DEC-2008 (Rel. 13.12, Created) DT 14-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hATx-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hATx-2_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2889 RA Jurka J.; RT "A distinct, diverse family of hAT transposons from Hydra RT magnipapillata."; RL Repbase Reports 8(12), 1821-1821 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(1153..1953,1934..2704) FT /product="hATx-2_HM_1p" FT /translation="MVVKYADTIKETVISKFNNLRSQGMRFSLTFDEWTSI FT KNRRYLNVNVHTEDEFWSLGLARVVGSLPAEKCVDLVQKVLTQFMLNYDED FT IVCITTDGASVMQKVGRLSNCEQQLCIVHGIQLGVQEVLYKKPSTCAKRSL FT KVICNYNNISESDEDISTDDDTNKTNDDEEDFSGGFQVVMSEKDDPIELTD FT DLQPLITKVRTVVKLFRRSPTKNDQTLQKYTVEEFGKGLPLILDSKTRWSS FT LHTMLERFVKLKNCIRKSLIDLESKNLTLNPKIEITDKEFNSILDVVSCLA FT PVKLAVEALCRNDATPLSADTTLMFMINNIGDTDLAVKLKATLVRRINERR FT TTFSSLLHYLHKGHQRYENLDPALSFEHLSKSAIVNXIARLNDRLSQRAYN FT PSTSVSSDSDSVDNTANLSLKEKLDMAILNNKKCNNKERSNVSNNLSKTIR FT NEMVVFEDGGTRGVYLQNSYEYLKTVKPTSVESERGFSASGNIVTKLRSSL FT EDDTLNALCFLRAYFTKEKKVKNEKVL*" XX SQ Sequence 2889 BP; 1008 A; 485 C; 537 G; 857 T; 2 other; agtgcatcca ataccggtag taccgatacc ggaataccgg tattaccgga ctttttaaag 60 taccgaaata ccggtattgg gttagtctaa taccggtatt aaacttccag gcgcaattag 120 caacgttcgt tcagcttgta ataaaattat gaaatcacgc tcgctcaaga gtttgggaga 180 agaaagtgat aagttgattt tttatctaaa attattgata aaaattttaa ttaaagataa 240 aacttgttta atttaaaatt aaaattatgt cttctattaa aagagatttt attatatatt 300 atttttatta tatattattt tattatatat tagcattatg tcactcgctg agcacgtgtt 360 tgtgttttgc actaacgcaa tgaataaaaa gtttgaagtc gttaccgctg atttctataa 420 tggaaattcc attatagaaa tcagcggtta ttccattcca ttccaggaat gtttactatt 480 ttaaccatgc gcacgtggat acgctatgtg aataggaaat ataaaaaata taaaaaacta 540 atgcaaactc aatgcaaata gaaaattact ataagaaaat tgaaacttca gattattttt 600 attaaaaaaa aaattgtgcg cttttaatta agaactaata atcaatattt tatgtctgat 660 gaattcattt actttaaagt ccagaaacac atgaaatgtc ccaaaaggta ttcaaagagt 720 tcagtaccgg cgtacgagtt gacgaaaatt ctgtgtggtt ttattttcta cgtgataaag 780 aaaatcaact tgcaaagtgt aagaaatgca gcaaagagat aaaatcaaat ggtggcagta 840 ccagtggact gcatacacat ctaagaacta accataaaat tgacttactg aagagaaaat 900 tggtccaaga tccagcatcg acatcggttt cgttttcatt ggcaccgttg ccaacggcca 960 aaaaacccaa acctcagatc accgacttct tacaaaatca acacgacaat tccttgcctg 1020 cagttttttc tagattgact gcccttgata gatggactac cttttagcgt atttgtcacg 1080 tcagcagagc tgaaaacatc gctacaagct cgtgggtttg ttgttccaaa atcggcgacg 1140 acaattcgca atatggtagt aaaatacgca gacacaatca aagaaactgt aatatctaag 1200 tttaataacc tccgatcaca agggatgcga ttcagtttga catttgacga atggacatca 1260 attaaaaatc gcagatattt aaatgtaaac gttcacaccg aggacgaatt ttggagcctt 1320 ggattagcta gagttgttgg ttctctgcct gcagaaaaat gcgttgattt agttcaaaaa 1380 gtgttgacac agtttatgct gaactatgat gaagacattg tgtgtattac tactgatgga 1440 gcttccgtaa tgcagaaagt gggacgtttg agtaattgtg aacaacagtt gtgcattgtg 1500 catggcattc aactgggtgt tcaagaagtt ttgtacaaaa aaccgtcaac ttgtgcaaaa 1560 agaagcttga aagtaatctg caattacaac aatatcagtg aaagtgatga ggatattagc 1620 actgacgatg atacaaacaa aaccaatgac gacgaagaag acttttccgg cgggtttcaa 1680 gttgtaatga gtgaaaaaga tgaccctatt gaactgactg atgatctgca gccgttaatc 1740 actaaggtgc gcaccgttgt aaaattgttt cgccgttctc cgacaaaaaa tgatcaaaca 1800 cttcaaaaat acacagtaga ggaatttggt aaaggccttc cgttgatact tgactccaaa 1860 actcgctgga gtagcttaca cacaatgctc gaacgttttg tgaaattgaa aaactgcatt 1920 cgcaagtcgc tgattgacct tgaatccaaa aattgaaata actgataagg agttcaattc 1980 aattttagat gttgtttcat gtcttgcgcc agtaaaattg gctgtggaag cattgtgtcg 2040 aaatgatgca acaccgttat cagccgatac aacactaatg ttcatgataa ataacattgg 2100 agacacagat ttagctgtta agttaaaagc tactttagtg cgacgaatca atgagcgacg 2160 tacaactttt tccagcttac tgcactatct gcacaagggc catcaacgtt acgaaaattt 2220 ggatccagcg ctatctttcg aacaccttag caaatcagct attgtaaata ycattgctcg 2280 actgaatgat cggctcagtc agcgagctta taatccttcc accagygtat cttctgatag 2340 cgattctgtt gacaacactg caaacttgtc attgaaagaa aagcttgaca tggcaatttt 2400 gaataataag aaatgtaaca ataaggaaag atcgaacgtt tcgaataatt tatcaaaaac 2460 catacgcaat gagatggtag ttttcgaaga tggaggaact cgcggtgtat atttgcaaaa 2520 ttcatacgaa tacttgaaaa ctgtaaagcc aacaagtgtc gaatcggagc gtggattttc 2580 agcaagtggc aacattgtca caaaattacg atcatctctc gaagatgata ctctgaatgc 2640 actatgtttc ttacgggcat actttacaaa ggagaaaaaa gtgaaaaatg aaaaagtttt 2700 gtaattttga tataaaataa aatagcattt tgaatcaaag aagttttact atttacttga 2760 gattttattc atattatgat taattaaaat aataaaaata ataccggtat tataccggta 2820 ttaccggtat tgaaataatt cagtaccgaa ataccggtat tgaaaaatgt tatcggtatt 2880 ggatgcact 2889 // ID BEL-14_DPu-I repbase; DNA; INV; 8879 BP. XX AC . XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from Daphnia: internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-14_DP_; KW BEL-14_DPu-LTR; BEL-14_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-8879 RA Jurka J.; RT "LTR retrotransposons from Daphnia."; RL Direct Submission to RU (16-DEC-2010). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR [1] (Consensus) XX CC Positions [7927-8475] - Integrase core CC LTRs are 91% similar to each other. XX FH Key Location/Qualifiers FT CDS 1849..7065 FT /product="BEL-14_DPu-I_2p" FT /translation="MRERSLCNQLKSAKIVTPFEPFVPSTSGISTAWRSIT FT TATFSATLPNCSQDDFDGERDWIQAVIYDHQAILAKCDDYLAQTNPKPAST FT VSRASSRHSSVSSRQARIHEAERKEREAQLMLQQVEDETRRREEEDAKIRE FT AEDYKRKVESERKQREIRDEIDLQRLSGAIMRQQLNDLTVDDQAAPTRASS FT VVSGLSRVSRVSSHPSIPNNSTFLAQTPAAFTQASTMATTAPITRTMATPR FT TQPSTVVISTAAQTQAMTTPLVGSAARMPTPTRLFSQPAATVPSTPKATLF FT GRLLQAITPGTLTRSASVPHIPLNAGASNVIVTTSITNPMFSTPSFTTAQP FT VTTSQPTSTASNVVTPPTSFFQAPTPIVPPTTTSANAPIPTSAVPQAPTST FT VPPFVPPTSTSVAAPVSTSAAHPFTPVTTATIHVPTTSNVTHVPVSVPLPP FT RSSASVPFPPGSSASAPLPPVSSASASSWFSSWRNVGQPNLFGPRQPAQPP FT SAPPAQPPSAPPAQPPSAPPAQPPSAPPAQPPSLRRPSHRLLRRPHRSAPP FT AQPPSFPPASSASHAWPPNPPPTTSIQMSTVPTSSIPTSATQVTSAPPYST FT SHVPVSLGPSYGPGLSAYNPIVSSAPFGMGSIYGPATTAALPTSSSFTPLT FT SFGHVPTFFPSVSAPPFVPTSLTFASGFPGPSAPPPTTHNPHGVYTPDAWI FT HGLGAAHAAPGRSSIKPPRMKAPSFDGDPRNWPMFIQMFKVFVHDAVSSDA FT ERIAHLHDALTPSIRKDIGGALLNPGLYQHALNELHKRYGNPQIVSQACTE FT SILKLRPFKDNDFNALRSFSADLHSVVATLRLGGYGMELYSHATLSQLVAK FT LPPALKSRWGEKSWAMQPTLASIEDLDQWLDGVAMAEKSIRASSVESSHQR FT PVKPTEEKRRTHKPNVFNITSTTPAKTDAEDKRPRCPGCNSTHQHRLKDCR FT KFKDLPVEKRAKIVKDSNLCLRCLGSGHIGKDCTKPERCSKPECDGTHHSL FT LHGAPRLFPKRTDPKPPTPATFSGSIASRSSGSRILLPIVPIILKTDGKEF FT PLYALLDSGSEISVIKGETANLLNLHGRVERVATRTVDGETKSVDRKIVNF FT SVSSTDGRFSFDISDVHVMETFELNKRSIDLASLSKQWPHLVHVPVYPTTQ FT EDVAILIGHDHPAAIEIFETRKDPFDQRAPRAYLTAFGWCIGGPSGRLDNG FT ASNCYHSSTAERDCDVMLQQFIEADTFGTKPNVTKPIGAEEKRAWEILNST FT TRHTGERYEVGLLWKADDSVVPNNFFSTQRRFTNLEGKFAKNEELAETYRS FT VINTYVNLKHARKLSKEEIDSGPDGRTWYCPHHPVFNPNKPGKCRVVFDLS FT AKYKGVCLNDVLLKGPDLLTNLIGILLRFRQHAVPIVADVEKMFHQVRVRP FT SDGPAFRFIWRDPGDTKPPDIYQMDVHLFGAVSSPAVCANALQQAVKDSKD FT AESLLPQITRHFYVDNWLASFPSAAEAISTAHRLTEALKVGGFPLTQWATS FT NETVRKSLPGIQQEGASINMDLDADPIERTLGLVWDFRRDVFVLGAKADPG FT GRTKRDLLKSIFSIFDPLGFLAPIVFQAKVLMQDIWRHKFDWDDELSQDLI FT DRWIRWAETLPSLSGLVLERCIAPSREDIVATELHVFGDASELGFGAVAYV FT RFLFSDGSASVRFIISKARVAPLKFLTIPRLELNAAVLAARLGAQVRTELD FT IVFDRTLYWTDSTTVLSWI" FT CDS 7531..8877 FT /product="BEL-14_DPu-I_3p" FT /translation="MQSNAFQQEVNDLRAGRQIEEISKIVKLTPYFDHHGL FT MCVGGRLEKAPLPIDVRHPIILPRSERMTELILFKLHRDRGHLSASQLHHE FT ARQQYWIPKGRIAAQRAYHQCHRCRRINAKARNPFMAALPASRLKIGYPAF FT THTGVDYFGPIEVLIFRRTIKRWGVLFTCQTSRCVHLEMAYSMDTSSFIAA FT LDRFQNRRGVPASYHSDNGTNFVGAQRELAECLQNLNQHAIQRHLSRQPSK FT WVFNPPAAPHFGGSWERMVRAAKIALQAVLGNQRLTDEILLTALTLVENIL FT NSRKLTPMSEDATDPECLTPNHLLLGRATPNLPPDVFTKDDLNAKQRWRIS FT QAVADQFWHRWMKEVLPSLTEREKWYREGPNLEVGDIVVIIDPATPRGSWP FT TGRILKTFPGDDGVVRSATIQTKGTERHRPAHHLFPLESVRIREGALRMTK FT RRAG" XX SQ Sequence 8879 BP; 2057 A; 2731 C; 2120 G; 1962 T; 9 other; accagtagac tcgtttacaa atacatcggc ttgacctaac tgacgttccg ttttgtgtct 60 tttccgtccc gtaaccggcg tggtgtggca acatttggtg catcgaccgg gaaatccgaa 120 tcccggcacg gcggccatta ccacgcggtc ggcgttacac tggccatcga aggagaggct 180 gccgatatca acagccatct catcccgctg cccagccgtc gccgacacga aagtcccgcg 240 tggcagaaat tccgttgcag tttgctgatt cttccggccc aagaagaaac tggccagctg 300 gcagtttacg tcgcagtcgt ccagatcttc aacccgaaga acagcaacat ttacaattat 360 tatatttcat ttcttgattg tccgttttgt cctgttcaat ataataattg ctgtgtgtcg 420 tgtgacgaga gttcattggt cgatctcgtc agatcgtccg tggatctccc cccgcgttgt 480 ttgaactcat tggtcgatct cgccagatcg tccgtgagtt ctccctttct ctcccgcccg 540 attgcgttgg tcggttccgc cgaaccgtcc gcaaaatccc gtttcactcc gtcgtcattc 600 gtcaatcgcc gttccagtca gctgtccagt gtccaagtgt ccagtgcagt gtccagagct 660 atccagcccg tcatctggtt tccgtttgct gtctccacgt tccagctgtt ccagcatcac 720 agtttgtccg gcttcatcgc ccgtcagctg acttttcatt cccgctcagc ttgtccggct 780 cgttattttc cgtccagctt gtccgccttt cggcagtgtc agctggtttc atttgcttgt 840 ttgtccggcc gttggctcgc gtcagctgcc tttcccgccc agtctgtccg gctcttccgc 900 tggtcaactg gatttctttc ttttcgttgg ccgtccttac tgtcagccag gctaatcatc 960 cgccttggcc agtgtcgtcc agctgcatag acgtctcgtt tttctcacgt gtccaactcc 1020 tcagctccat cattccgcgc aaattcaaca gcgacgagaa gaagaagaag gacaatcttt 1080 ccaaaggcaa aaggaggaat acaaagaaaa tccaggcgtt acaggaagca gcaatcgcgt 1140 atttcgcgcg cgccgacgcc gccgctgcca tcgaggccgc tgccatcgaa gtcgccgcca 1200 ttgaagctgt tgacgtcgtc gttgaagaac ccgctgacga cgctgccgtt cccttggtaa 1260 taggcattgc cgaccaacat tcgaggccgc gtctcgcgga ttttgtgcct ccgccaactt 1320 cacgccgcgt cgtttaccgt ccttacgacc ggtggccatc accagaggag ccacctacac 1380 caccaggcga gccggaatca gaggaagagg atttcatccc caatctttcc gactgcgagg 1440 tacgaagtga ggtgggccct tcctcggacg acgaagcacc tgagccgacg tcgccgccac 1500 cgccgacacc accgtcgccc cctcgtagag tggtgcaaat taggaacacc aaaaccaata 1560 agacgcgttg cgtcggccta gatcaacttt tccgcccgga atcgcaagac aagatcgtcc 1620 gtcgcaaata attcgccgct tcatctaatt gtaacctcat tgtgatctga cttatcgttt 1680 gtatttcttc actcgtcccg catcaaataa acatcatcat ggctgaatct ktagttccgg 1740 ctctcacttc aatcgccgaa gaggaggaag tagtgtctga tcttgatacg atcaccgatc 1800 ccgctcgtct caaaagattc aggacaaccg caaagatgat ccatacgaat gcgggaaaga 1860 agcttatgca atcagttaaa gtcggcgaag atcgtgacac cgttcgagcc cttcgtgccg 1920 tctacgtccg ggatttcgac agcctggaga agcatcacga ccgctacgtt cagtgcaacg 1980 ctccctaact gctcgcaaga cgatttcgac ggagaaaggg attggatcca ggccgtcatc 2040 tacgaccacc aagcaatctt ggctaagtgt gacgactatt tggctcaaac taatcccaaa 2100 cccgcctcca cggtttcaag agcctcatca cggcattcca gcgtctcctc acgtcaagct 2160 agaattcacg aagctgaacg aaaggagagg gaagcccagt tgatgctcca acaagtggaa 2220 gacgagacca gacgcagaga agaagaagat gcaaaaatcc gagaagcgga agattacaaa 2280 cgaaaggtgg agagtgagag aaaacaacgc gagatacgtg acgagataga tcttcagcgt 2340 ttgagcggcg caatcatgcg gcaacaactc aacgatctta ccgtcgacga tcaggcagca 2400 ccaacacgcg cttcatccgt cgtgagtggt ttatcgagag ttagccgtgt ttcatcgcac 2460 ccttccatcc cgaataattc gacctttctc gctcaaactc cagctgcgtt tacacaagcg 2520 tcgaccatgg caaccacggc tccgataact cgaaccatgg ccactccgcg gacgcagcca 2580 agcaccgtcg tgatttcaac ggcggctcaa actcaagcga tgaccacacc gttagtcgga 2640 tctgcggcta ggatgccaac accaacgaga ctcttcagcc aaccggcagc gacggtgccg 2700 tcaacaccaa aggcgaccct tttcggcaga ctacttcaag ctatcacgcc cggcacactt 2760 acacgttccg catccgtgcc gcatattccc ttgaatgcag gcgcgtcgaa tgtcatcgtt 2820 acaaccagta taacgaatcc gatgttttcc acaccgtcgt ttactacagc acagccggtc 2880 acgacgagtc agcccacgtc aacggcttca aacgtcgtta ctccgccaac gtcgttcttc 2940 caggctccta caccgatcgt tccgccgact acgacatcgg caaacgctcc aattcctacg 3000 tcggcagttc ctcaggcccc tacatcgact gtaccaccgt tcgtcccacc gacttccaca 3060 tcagtcgccg ctccggtctc tacatcagcg gctcatccgt tcacgccggt gactacagcc 3120 accatccacg tcccgacgac cagtaatgtg acgcacgttc cagtgtcggt cccgttgcca 3180 cccaggtcct cagcttcggt cccatttccg cccggatctt cagcttcggc tccgttgcca 3240 cccgtgtctt cagcaagcgc ttcgtcgtgg ttttcatcgt ggaggaacgt tggtcaaccg 3300 aatttattcg ggccaaggca accggcccag ccaccgtctg ctccgccggc ccagccaccg 3360 tctgctccgc cggcccagcc accgtctgct ccgccggccc agccaccgtc tgctccgccg 3420 gcccagccac cgtcgctccg ccggcccagc caccgtctgc tccgccggcc ccaccgttcc 3480 gctccgccag ctcaaccacc gtctttcccg ccagcgtcta gcgcaagcca cgcctggccc 3540 ccaaatccgc cgccgaccac ttcaattcag atgagcacgg tcccgacgag ttcaattccg 3600 acaagtgcaa cgcaagtcac ctccgcaccg ccttacagca ctagtcatgt gccggtcagc 3660 ctaggtccgt catacggtcc gggtttgtcc gcctacaacc cgattgtgtc ttcggctcca 3720 tttgggatgg gttccattta cgggccggcc accacggccg ccctcccgac ttcatcgtca 3780 tttacgccgt taaccagttt tggacacgtc ccgacgttct ttccaagcgt ctcggcacca 3840 ccattcgtgc caaccagcct gacgttcgcg tccggtttcc ctggaccttc cgctcctcca 3900 ccgacaaccc acaacccgca cggagtttat acgccagacg cctggatcca cggcctcggt 3960 gccgcccacg ctgcgccagg ccgttcgagt atcaaaccac cgcggatgaa ggcgccgagc 4020 tttgatggag accctcgaaa ttggccaatg tttattcaga tgtttaaggt cttcgtccac 4080 gatgccgtta gttccgacgc ggaacgcatc gctcatcttc acgacgccct tactccatcg 4140 attagaaagg acattggtgg agcgttgttg aatcccggtc tttatcaaca cgctctcaac 4200 gagcttcaca aacgatacgg caatccgcaa atcgtctctc aggcttgtac cgaatcaatc 4260 ctgaaactcc ggcccttcaa agacaacgat tttaatgccc tccgctcctt ctccgcggat 4320 cttcactccg tcgtggcgac tttacgactc ggggggtatg gaatggagct ctacagtcac 4380 gctactctct cgcagctggt tgccaagctc ccgccggcac tgaaaagtcg atggggtgag 4440 aagagttggg cgatgcaacc cacgctcgcg tctatcgaag acttagacca atggctcgac 4500 ggagttgcta tggccgaaaa atccatccgg gccagttcag tggaatcgtc tcatcagcgg 4560 cccgtaaaac cgactgagga gaagcgacgt acgcacaaac cgaacgtttt caacattact 4620 tccacgactc ccgcaaaaac tgacgccgaa gacaaacgcc cgagatgccc aggatgcaac 4680 agcacgcacc agcatcggct aaaggactgc agaaagttca aggatttgcc tgtcgaaaaa 4740 cgagccaaga tagtgaaaga ttcgaatctt tgtcttcgct gtttaggcag cggtcatatc 4800 ggcaaagact gtacaaagcc cgaacgctgc tccaaacccg aatgcgatgg gactcaccac 4860 tcgctactgc atggagctcc acggctcttt cctaaaagga cggatcccaa gccaccaact 4920 ccagcgacat tctccggttc aatcgcatcg agatcgtccg gaagccgaat tttactgccg 4980 atcgtcccga tcatcctaaa gacagatggg aaagagttcc cgctctacgc gcttcttgac 5040 tccggaagtg aaatttccgt gataaagggg gagacggcga accttctcaa tttacatggc 5100 agagtggaga gggtagcgac gagaacggtc gatggcgaga cgaaatctgt cgaccgaaag 5160 atcgtcaatt tcagtgtttc ttcgacggac ggtcgtttct ctttcgacat ctcagatgtt 5220 cacgtcatgg aaaccttcga actgaacaag cgatcaatcg accttgctag cctgagcaaa 5280 cagtggcctc atctcgttca tgttcccgtt tatccgacca cgcaagaaga tgtcgccatc 5340 ctgatcggcc acgatcatcc agcagccatc gaaatcttcg agacccgaaa ggaccccttc 5400 gaccaacgag ctcctcgcgc ttatctaaca gctttcggtt ggtgcatagg cggtccaagc 5460 ggtcgactcg ataacggagc aagcaactgc tatcactcca gcacggccga aagggattgt 5520 gacgtcatgc tgcagcaatt catcgaagcc gacacattcg ggacgaagcc gaacgtcaca 5580 aaaccaattg gtgcggaaga gaagcgagcg tgggagatcc tgaacagcac taccagacac 5640 actggcgaaa ggtacgaggt gggtctgctc tggaaagcgg atgattccgt tgtcccaaac 5700 aacttctttt ccacccaacg gcgatttact aacctcgaag gtaaattcgc taaaaacgaa 5760 gaactcgccg agacttaccg gtccgtcatc aacacctacg tcaacctgaa gcatgcccga 5820 aaattatcga aggaagaaat tgactctggt ccagatgggc gaacctggta ttgcccgcat 5880 catcccgtct tcaatccgaa caagccggga aaatgcagag tcgttttcga cctgtccgcc 5940 aagtataaag gagtgtgcct caacgacgtt cttttgaaag gtcctgatct tctgaccaat 6000 ctcatcggca ttcttctccg attccgacag catgcagttc cgatcgtcgc cgacgtggaa 6060 aagatgttcc atcaagtgcg agtgagacct tccgacgggc cggctttccg cttcatctgg 6120 cgtgatccag gtgatactaa acctccggat atctaccaga tggacgtaca tctcttcggt 6180 gctgtctctt caccagccgt ctgtgccaac gcactacaac aagccgtcaa ggacagcaaa 6240 gatgcggaat cgctgctacc ccaaatcact cgacatttct acgtggataa ttggctagcg 6300 tcctttcctt ccgccgccga agctatctcg actgcccacc gattgactga agctctgaag 6360 gttggaggtt tcccattgac gcagtgggcc acttccaacg aaaccgtaag gaagtcactt 6420 ccagggattc agcaggaagg agcatctatc aatatggact tggacgccga tccgatcgaa 6480 cgaactctcg gtttagtctg ggatttccgt cgcgacgtct tcgtcctagg tgcaaaggcg 6540 gatcctggtg gaagaacaaa acgggatctt ctaaagtcta tcttcagcat cttcgacccc 6600 ctcggctttc tagcgccgat agttttccaa gcgaaggtcc tcatgcaaga tatatggcgc 6660 cacaaattcg attgggacga cgagctgagc caagatctta ttgaccgctg gatccgctgg 6720 gcagaaactt tgccctcact ctccggactc gtcttagaac gatgcatcgc tcccagccga 6780 gaagacatcg tagcgaccga actgcacgtt tttggggacg cctcagaact gggcttcggc 6840 gctgtcgcat acgtccgatt tttgttcagc gatggatcag ccagtgtccg gttcatcatc 6900 tcgaaagcca gagtcgctcc gcttaaattc ctgacaatcc cacgtctaga acttaatgct 6960 gccgttcttg ccgcgcgtct cggcgcacaa gtgcgtacgg agctkgacat cgtctttgac 7020 cgaactcttt attggaccga ctcsacaacc gtcctcagct ggatcamttc ccgtaattgc 7080 cgtttcaaca attacgtcgg aaatcgtgtt ggcgaaatat tcgaaagctc aacgcccgmt 7140 saatggagtt acgtcccaag cgccagcaat cccgccgacg acgccagccg tggtttggat 7200 ccttccgagt tcaccgttga ccatcgttgg ttctccggtc caccgtttct tcaaggtctc 7260 gacgaatggc caaaactcaa agttctcccg ccggtggaag maagcgaccc agagatccgc 7320 gaaacgtcct gggtcgggct tgtacagagg gagagcgacg aaatcgatct tctcatcgaa 7380 cgaaaatccc ggcctttgat catcttcaac accgtcgcct atatttatcg cttcatccgg 7440 aacgcccgtc aaaaggaccg taaccagcgt tcaataggcc agctctcggt cgaagaaatc 7500 gaagttggma aagcgttcat tctccgccgg atgcaatcta atgcgtttca acaggaggtg 7560 aacgacctac gggccggccg ccaaatcgaa gagatttcca agatcgtcaa gttgactcct 7620 tacttcgacc accacggcct catgtgtgtt ggcgggcgac ttgaaaaggc gcctcttccg 7680 atcgacgttc gccacccaat cattctacct cgttccgagm ggatgaccga gcttattctc 7740 ttcaaactcc accgtgatcg cggccatctc tcggccagcc agcttcatca cgaggcgcgt 7800 cagcaatatt ggattccgaa gggccgaatc gccgcccaac gagcttacca ccagtgtcac 7860 cgttgcagaa ggataaacgc gaaagcaagg aatccattca tggctgctct acccgcctcc 7920 cgattaaaaa taggctaccc cgcgtttacc cacaccggtg tagattattt cggccccatc 7980 gaggtgttaa ttttccgtcg caccatcaaa cgttggggtg ttctatttac atgccagaca 8040 tcaagatgcg tccacctcga aatggcttat tcaatggata caagttcctt catcgctgcg 8100 ctcgaccgtt tccaaaatcg tcgcggcgtc ccagcgtcct atcacagcga caacgggact 8160 aacttcgtcg gagcgcaacg agagctggcc gaatgcctcc agaacttgaa ccaacatgcc 8220 atccagcggc acctgagtcg tcaaccgtca aaatgggttt tcaatccccc agctgctcca 8280 catttcgggg gtagttggga acggatggtt cgagccgcca agatcgcgtt acaagcggta 8340 ctgggaaatc aacgcctcac cgacgaaatc ctattgaccg ccctcacgct cgtcgagaac 8400 atcctcaata gtcgcaagtt gaccccaatg agcgaggacg cgaccgatcc agaatgtctg 8460 accccaaacc atctattgct gggccgtgca actccaaatc ttccgcctga cgtcttcacc 8520 aaggatgatt tgaacgccaa gcagagatgg cggatctcgc aggctgttgc ggatcaattc 8580 tggcatcgtt ggatgaagga ggttctaccg agcctcaccg agcgcgagaa atggtaccga 8640 gaaggcccaa atctcgaagt aggagacatc gtcgtcatca tcgatccagc tacacctcgt 8700 ggatcgtggc cgactggaag gatcctcaaa actttccctg gcgatgacgg agtggtccgc 8760 tccgcaacca tccagacgaa ggggactgaa cgccatcgac cagctcacca cttgtttccc 8820 ctggaatccg tccggatccg agaaggtgct cttcgtatga cgaagcgcag ggccggcga 8879 // ID Gypsy-26_AA-LTR repbase; DNA; INV; 192 BP. XX AC supercont1.85; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-26_AA_; KW Gypsy-26_AA-I; Gypsy-26_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-192 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.85; Positions 1513349 1513158. XX SQ Sequence 192 BP; 50 A; 45 C; 43 G; 54 T; 0 other; tgtagtatat tacacaacct tgctactatg agtaccatac cataagtatc attaactttg 60 cgctcgtcgt ccatgatgct gctgtcacac acactgataa acaggattcg gtacccaggt 120 cggtgtgtga tctcgttcgt gggaagagac actttttgtt agactacggg cacgacgaga 180 cgcattgtca ca 192 // ID BEL-35_CQ-I repbase; DNA; INV; 2976 BP. XX AC AAWU01003719; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-35_CQ_; KW BEL-35_CQ-LTR; BEL-35_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-2976 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 223-223 (2011). XX DR GenBank; AAWU01003719; Positions 68176 71151. XX CC 'AACAT' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 334..2976 FT /product="BEL-35_CQ-I_1p" FT /translation="MWRSLLKLHTEPESEEEEVEEKKPVVEVLVPSPVKPL FT TPKKLWKKKKAKMADSEIKALSKKRSLVKGKLTRVLHAVQPSAQPGAQADA FT QPREADVAPPQLRVHQRNVEKYYGECHEIHDKILDLVETDEVSKQQEEKWL FT EFEQLYNRTLVALEMLLTAHERPAQVVLPAVQAAQQVIIHQQALRAPLPTF FT DGRYENWSKFKAMFQDLMRNSSDSDAVKLYHLDKALVGEAEGKIDLRTIQD FT NNYQGAWKELEEQYENTRLIIDLHIQGILQLKKMDKRSSTELRDVVERCSR FT HVEGLRFHKQELLGVSELIVVNILAAALDRETRELWEATIEKGELPTYQAT FT FDFLRKRCHILERCELSDPEASAVKPTVSKMPSATKPSVNVFTAATTTSEV FT VCELCGGNHLNYKCSAFCSMSVPQRLEKVREARACYNCLRTGHLVRKCPSE FT RTCKCGERHHNLLHKDSPKEQASPKSEAAPKPPPATEQATNGSSNDNADAS FT EEQTTSCCNSGLLQTSRPVLLQTAVISVVDKHGRLHPCRALLDSGSQAHIL FT SAAMARKLELPLTKCNVMLIGANAVKTPARSSVILNLSSRYTNFRDSISCL FT ISEKPTGIIPSVRIDTSRWKIPSGVRLADPQFFEPNDIDLVIASNYVWDLL FT RSDKVKLLNGAVTLRETDLGWIVTGTYDPLDQVSLPIVHSNNVRLDILEDA FT VEELVSVEELTDSTPPSAENFNKLVSPTHRSDESSRCVIQLPFRNTVDDQS FT QALRRFTQLERSRSDPPGSKSQYSVITDNPAWADPLPEWMMMEWLECPRSL FT NELKDITTPLRGVESDAVRSEDKLPPKTVELIDDPAPSCGGLKLAIVEIPS FT GERTSYSGSFVTKTCGLKEFQLRSRAGE" XX SQ Sequence 2976 BP; 767 A; 804 C; 850 G; 555 T; 0 other; tttggtccat tcgtaccgga catttggaca ctcgaccgtc ggatccgcga gtgaagtgtt 60 cagtggattc cggaaaatcg tgcgagcgga agaacacaac cggaagaaga tttcgagtgt 120 gtgatctgga agtgagaaaa gatccttccg gaaggcgact ccgtggcctg caacagcgga 180 aacagtggcg gcaagagccc gcaaaagaag tgccagaaag tttcgtggag tgagtgagca 240 gtgaaaaaac gcagcgacgc cattttgtag tggttcccac gagtggaaca atccgtagag 300 tgcaggcggc acaagcgttg ctggtctgac tctatgtggc ggtcgctcct gaagctgcac 360 acggaaccgg aaagcgaaga agaagaagtt gaggagaaga aacccgtcgt cgaagtgcta 420 gtgccgtccc cagtgaaacc gctgacgccg aagaagctgt ggaagaaaaa gaaggcaaag 480 atggcggaca gtgaaatcaa ggcgctttcg aagaagcgaa gcttggtgaa gggaaagctc 540 acgcgagttt tgcatgccgt gcaaccaagc gcccagccag gtgcccaagc agacgcccaa 600 ccacgggagg ccgacgtggc cccgccgcag ctgagagtcc atcagcgcaa cgtggagaaa 660 tactacggtg agtgtcacga aattcacgac aagattttgg acctggtgga gacagatgag 720 gtgtccaaac agcaggagga aaagtggttg gagttcgagc agctctacaa ccgaacgctg 780 gtcgcgttgg aaatgctgct gaccgcgcat gaaagacctg cccaggttgt gcttccggct 840 gtccaagccg cacaacaggt cattatccac cagcaagcgt tgcgtgctcc actcccaacg 900 ttcgacggac gttacgagaa ctggtcaaag ttcaaggcca tgttccagga cttgatgcgc 960 aattcgtccg attccgatgc cgtcaagctg taccacctgg acaaggcgct cgttggtgaa 1020 gcggaaggga agatcgactt acgaaccatc caggataata actaccaggg agcgtggaag 1080 gaactggagg agcagtacga gaacacgcgg ctgatcatcg atctccacat ccaaggcatc 1140 ttgcagctga aaaagatgga caagcggtca tcgacggagt tgcgagatgt cgtcgagcgg 1200 tgttccaggc acgtcgaagg tctacgcttc cacaagcaag agctgttggg agtgtcggag 1260 ctcatcgtgg tcaacatcct cgcagctgcc ctggaccgcg agacacgaga gctctgggaa 1320 gcgacgattg agaagggtga gctaccgaca tatcaagcga cctttgactt cttgaggaag 1380 cgatgccaca ttctcgagcg atgtgagctg tccgacccag aggcatctgc tgtcaagccg 1440 acagtctcta agatgccatc tgccacgaaa ccttcggtga atgtgttcac tgcggccacg 1500 accacaagcg aggtggtgtg cgagctttgc ggtggcaacc atctgaatta caagtgcagt 1560 gccttctgta gcatgtcagt cccacagagg ctggagaagg tgagagaggc acgagcttgc 1620 tacaactgcc tgagaaccgg ccatctggtc aggaagtgtc cctccgagag gacgtgtaag 1680 tgtggtgaga ggcatcacaa cctactccac aaggactcac cgaaagaaca agcatcgccg 1740 aagtctgagg ctgcacccaa gcctccacca gccactgaac aagcgaccaa tggttccagc 1800 aacgataacg ccgacgccag cgaggagcag acgacgtcgt gctgtaacag cgggctgctg 1860 caaacctccc gacctgtgct gttgcagaca gcagtcatta gcgtggtgga caagcacggt 1920 cgactgcacc cgtgtcgcgc actcctggat tccggatcgc aagcccacat cctttcagcg 1980 gcgatggcac gtaagctcga gctaccgctg acgaagtgca acgtgatgct gattggagca 2040 aacgccgtca agacaccggc gaggagcagc gtcatcctga acctctcttc aagatacacc 2100 aacttccggg atagcatttc ctgtcttatc tcggagaagc caaccgggat tattccatcc 2160 gtgagaattg acacgtctag atggaaaatt ccaagtggcg ttcgcctcgc agatccgcag 2220 ttcttcgaac ccaacgacat tgacctggtg attgcgtcca actacgtgtg ggatctgctg 2280 aggtcggaca aggtgaagct gctcaacggc gccgtcacgc tccgagagac agatctggga 2340 tggatagtca cgggcaccta tgacccgctc gaccaggtga gtcttcctat cgtccattcc 2400 aataatgtgc gcctggatat cctggaagat gccgtagaag aattagtatc agttgaagag 2460 ctgacagact caacaccccc ctccgccgag aatttcaaca agcttgtctc accaacccac 2520 cgcagcgacg agagcagccg ctgcgttatt caacttccgt tcagaaatac cgtcgacgac 2580 cagtcccaag ccttgagaag attcacccag cttgaacgca gtcgttccga tccacctggc 2640 tcgaaatcac agtactctgt catcaccgac aaccccgcct gggcagaccc actgccagaa 2700 tggatgatga tggagtggct cgaatgcccg agatcgttga acgagctcaa agacatcacc 2760 accccgctcc gtggtgttga gtcggacgcc gtccgtagcg aagacaagct tccaccaaaa 2820 accgttgagc tgatcgacga tcctgcaccg tcctgcggcg gcttaaagct ggccatcgtc 2880 gaaataccat ccggggagcg aaccagttac tccgggtcat ttgtgacgaa gacttgcggt 2940 ttgaaggagt ttcagcttcg cagccgggcg ggagaa 2976 // ID BEL-201_AA-LTR repbase; DNA; INV; 230 BP. XX AC AAGE02028917; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-201_AA_; KW BEL-201_AA-I; BEL-201_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-230 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02028917; Positions 35589 35818. XX SQ Sequence 230 BP; 82 A; 35 C; 48 G; 65 T; 0 other; tgtcatggca acactgacga ggacggtcct gcatctgact ctggtctaag ttgcggagaa 60 atgtcattcg ataagagtga taagaaagcg gtggcaaaaa cagaaaagat aaaatcgaaa 120 acagttagtg gcaaacgtgg ttcatcagaa attatctaaa acagtgatta tattactact 180 ataagatagt ttattcttac ttaacgtttg atatttacag gcttaattca 230 // ID Gypsy-59_AA-LTR repbase; DNA; INV; 234 BP. XX AC supercont1.119; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-59_AA_; KW Gypsy-59_AA-I; Gypsy-59_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-234 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.119; Positions 1702753 1702986. XX SQ Sequence 234 BP; 75 A; 33 C; 73 G; 53 T; 0 other; tgtaacggat gaagatatca taccttgggg aacaatgact ttgattcgaa tgagatttag 60 taacgggttt acttggcgta tgaatgaaag acggaataaa agataaagtg gtagaaccgg 120 gatgaccgac aagagctgga tcagtcgtgt tcggacggtg atgacgatcg gttgacaagc 180 gagtcgtgga gtgagcaatc agacggagaa gagattcgga cgcggaattt taca 234 // ID DNAX-8B_AP repbase; DNA; INV; 152 BP. XX AC . XX DT 28-AUG-2009 (Rel. 14.09, Created) DT 28-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNAX-8B_AP. XX NM DNAX-8B_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-152 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2062-2062 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC tsd TA or TATA. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 152 BP; 36 A; 46 C; 42 G; 28 T; 0 other; ctccggccca cgagagaagt aactcgtgtc ccgaccaaat cgcgtccgaa tccgcgcatc 60 tccacgtatt tgacatacag cgaaaggcac taagtggggg aattccgcca gtcagcatgc 120 gcgaaacggg ttacttctct cgtgggccgg ag 152 // ID Sola3-1_AA repbase; DNA; INV; 6027 BP. XX AC AAGE02019464.1; XX DT 28-JAN-2009 (Rel. 14.02, Created) DT 28-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE Sola3 DNA transposons from Aedes aegypti. XX KW Sola; DNA transposon; Transposable Element; Sola3; TTAA TSD; KW Sola3-1_AA. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-6027 RA Bao W., Jurka M.G., Kapitonov V.V. and Jurka J.; RT "New superfamilies of eukaryotic DNA transposons and their RT internal divisions."; RL Molecular Biology and Evolution 26(5), 983-993 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS join(1279..3297,3317..3601,3605..4330) FT /product="Sola3-1_AA_1p" FT /translation="MAEMAKCALENISECGQYLPKSDDKYYSAEELASTKV FT IDLFPKLANDKFFFDYSEKWLIEHRSGIKLQDKDSICAKHRYSFGEKWKPL FT KRCYHPDHLQNPTSAKRMKVLGVRPCKSHHFKYISKHDPTFPLFAPLCRLH FT EKTEDISVKADETDTPENDEVSFDQPSDLKLLNSVLSLLGVPNIRATIDTS FT IENLSKKFLDSKRQKIEFALEKIKNLFVTIAPGQEEKYKKLLSSHEDDNST FT RQGELKAYRKLYDAETSDNRKLRVLSNIDKERFSRREIEDQLGCSRSSTSK FT ARKMQTSNESSNEANNSVSKQRLDIEKCKHFLLWLYESGSLGDLAYGTTMV FT RYSSGYKMHIPKPILTGLRSHTIANYIDFCHNTNTDALSRTSLFKILNLLK FT PSQRHSLAGLDNHMVAGTEGFEVLKRLCEKFFEDDKGILSDLTAIQRYIKG FT PFANNCSKNSKTASHCTVFGLSDTKNDHFAQKCDHDHSQVCTSCAKIFQLF FT ETISDKISIENDQNVKAEMMYDYENSKKAVLKWQQHILRHVQQNQAKLDAF FT SDIQTDHTKAVWIRDWGQKVLPMKYREEQSSYFGKKGMSIHFDVFIFIDVE FT TKKFNKAVFVTLLDNCKQNAVETLALIPGILAKFRSLYPDICSLFIRINNA FT GCYSGKQILHNCMHVLCEGVFVCVLIISFSQFMLLANGVAESTFQLCKEAG FT IKLLRLDHNEPQKGKDQCDREAAVCKSYIRAYVNASNNTTTAADLKKGIMF FT QGGPRNTYVGVLKIRDSMSGVKNREGVQAIHSISFEFPFMTFWRYYDIGTG FT VKHRIQNINFNFNVDVISSYDKTANSQLGLGTPSMKTQILDASKQSSIPKN FT STTDAIYNFVSNLITTNDTRLQGQNDSQTAEENCEVVSEEFCTYFSKGWGV FT GSRENVRFDVRVKQFMLELFSLGEKTGKKISPEDAHRAMRSKLDRITGIKE FT FQPKSYISVEQIKSLFARFSTLHRQNKLVLCDVSEDSFLNVEEHHEYKLVI FT STI*" XX SQ Sequence 6027 BP; 2036 A; 1060 C; 1095 G; 1836 T; 0 other; gagccagttc gcgagagccg caaccccatc ttgtatcgat gttctccgat caggctgaaa 60 ttttcagggt ttgttcttct acataataga atatatttcg caaaatttca gaattttatt 120 ttaggggaaa gtggggcaaa aaaatcccca atgatttcat gtcgcaaaac atgccaaatc 180 aataaaaatg caataactac agtaaaaatg ctcggattta tttgaaattt ggcatgatta 240 ctctttgtta tatcatctta aatatgggca taaaatttgg tttaaaaaaa aattctttat 300 cgagaaaaac ctgtttttca ttaaaaaaat gatgattttc aaatcatttt ttttcgtgcc 360 aaccgaacta ggtcaaaaaa ccaaatatga cgttttgtag gccgtctcaa gggctttcat 420 ttgagatgtg atctaaagtt gccgtttcaa aaacttccga aaattttttt ttttcggggt 480 ggtgttttat tttaacccgc tagtccagtc cgcgagaatc gcaatatcat attgtaccga 540 ttttctccga tcaggctgaa attttcaagg tttgttcttc taccaaatag aagatatttc 600 acaaaatttc agatttccac attaggggaa agtgaggcaa aaatatccct aacgatttcg 660 tatcactata gtaaaaatgc tcggatgttg tttaaatttg gcatgattac tctacaaaat 720 ataataaaat aagaataata ataagataat aatagtatat tataacaaaa ggaataataa 780 taacaacaat aataataata atatgatgat aataataata attattatta taacgataat 840 aatatatcag taaaagtcaa aatcccatac aaagtcacaa cagtgcatta tatgtgcaat 900 cagtctaaac gaagctaaac tcaattattc acaaaatgat cgctaggaaa cataccatca 960 ataaaacaca ctcacacatg ggactatccg agtcaaactc atctaaacat aggcttggac 1020 caactttcaa aagaatgcgg gagttggtat ccaaccaatt agcgtgagaa gatacaacgc 1080 ttacttacaa cttaactaat aaacaaattg gtgcaatgct ctattttttg ctcgggagtg 1140 acatttgttt cggatatggc agcggcaaca tggtatgata caatcggtat cacaactact 1200 aacaaggccg ctaagtaagc tgtcagtttg caacaaatct tcgtctgtga attagtgttc 1260 aatttcaatt tgattaaaat ggccgaaatg gcaaagtgcg cgttggaaaa tatttctgag 1320 tgtggtcaat atttgcccaa atcagatgac aaatattaca gtgctgaaga gttagcttca 1380 accaaagtaa tcgatctgtt tcctaaactc gcaaatgata agtttttctt tgattactcc 1440 gaaaaatggt tgattgagca tcgttccggt attaagcttc aagacaagga ctccatatgc 1500 gcgaagcaca ggtacagttt cggggagaaa tggaagccat tgaaaagatg ttaccacccg 1560 gatcatttgc aaaatccaac atctgcaaag agaatgaagg ttttgggtgt acgcccgtgt 1620 aaatcccatc actttaaata tataagtaag catgacccga cttttcctct ttttgcccca 1680 ctttgtcggc tacacgagaa aacagaagat atttctgtaa aggccgatga aacagacact 1740 cccgagaatg atgaagtgtc ttttgatcag ccatctgatc tgaagttatt gaactccgtt 1800 cttagcctgc tgggtgttcc aaacataagg gctaccatcg acacttccat tgaaaatttg 1860 tcgaagaagt ttctagattc taagcgacaa aagatcgagt ttgccctcga aaagataaag 1920 aacctgtttg taacaattgc ccctggacag gaagaaaaat ataaaaagct tctttcttcc 1980 catgaagatg ataacagtac caggcagggt gagctgaaag cgtatcgcaa attatacgat 2040 gcagaaacgt ctgataatcg gaaattaaga gtactttcaa acattgataa ggaaagattc 2100 tcacgtcgtg aaatcgagga tcagttgggc tgctcacgat ctagtacttc aaaagctcgt 2160 aaaatgcaaa caagcaatga gtcgtctaat gaagccaata atagcgtatc aaaacagcgt 2220 ctagacatcg aaaaatgcaa acatttcctg ctttggctgt atgaaagtgg ctccttgggt 2280 gatctggcct acggaacaac catggttcgt tatagttctg ggtacaaaat gcacatcccg 2340 aaacctatat tgactggtct tagaagtcac acgattgcca attatatcga tttttgccat 2400 aatacaaaca ctgacgcatt gagtcgcact agtttgttca aaattttgaa cttactgaaa 2460 ccatctcaaa gacacagcct tgctggtctt gataaccata tggtagctgg aacagagggc 2520 tttgaagttc taaaacgctt atgcgaaaaa tttttcgaag atgataaagg cattctatcg 2580 gatctaacag ctattcaacg gtacataaaa ggtccatttg ctaacaactg cagcaaaaac 2640 tcaaagacag cgagtcattg tacggtattc gggttatctg atacgaaaaa cgatcatttt 2700 gctcagaaat gtgatcatga tcattctcaa gtgtgtactt cttgtgctaa aatattccaa 2760 ctgtttgaaa ccatatcgga taaaataagt attgaaaatg atcagaacgt gaaagcggaa 2820 atgatgtatg attacgaaaa ttccaaaaag gctgttttga aatggcagca acacatcttg 2880 agacacgttc agcaaaacca agccaaacta gatgcatttt ccgatattca gacagaccat 2940 accaaagctg tatggattag agattgggga caaaaagtgc taccaatgaa ataccgtgaa 3000 gagcaatcgt catactttgg caaaaaaggg atgagtatac attttgatgt attcattttc 3060 atcgacgtgg aaacgaaaaa gttcaacaag gcagtttttg tgacgcttct ggacaattgc 3120 aagcaaaatg ctgtagaaac tctggcattg attcctggaa tactagctaa atttcgtagt 3180 ttgtatccag acatttgttc gctttttatt agaataaaca atgctggttg ttattcaggt 3240 aagcaaatat tgcataattg tatgcatgtg ttatgtgagg gcgtttttgt ttgtgtatga 3300 atgcaaaacc atttgactta taataagttt ttcacaattc atgttactag ccaatggagt 3360 agctgaatcg acctttcagt tatgtaaaga agcaggaatc aaacttctgc ggttagatca 3420 caatgaacca cagaaaggaa aagaccaatg cgatagagag gcagccgttt gcaaatccta 3480 catccgggca tacgtgaatg cttctaacaa tactacaact gcagcagacc tgaagaaggg 3540 gattatgttc cagggaggac cccggaatac atatgttgga gtactgaaaa tccgggatag 3600 ttaaatgtca ggagtgaaga atagagaagg cgtccaggct atccactcca tctcattcga 3660 attcccattt atgacctttt ggcgatatta cgatattgga actggcgtca agcatagaat 3720 ccaaaacatc aattttaatt ttaatgttga tgttatttca tcttacgata aaacagccaa 3780 ctctcaacta ggactcggaa ctccttcgat gaaaacgcaa attctagatg cttctaagca 3840 atcaagtatt ccgaagaatt caacaactga tgctatttat aactttgtat caaatttgat 3900 aacgacaaat gatacaagat tgcaaggtca aaatgactca caaacagcag aggagaactg 3960 tgaagttgtt tcagaagaat tttgtacata tttcagcaaa ggttggggag ttggctctag 4020 agaaaatgtt cgcttcgatg tgcgtgtaaa gcaattcatg ctggaactgt tttcccttgg 4080 cgaaaaaacg ggaaagaaaa tttctccgga agatgctcat agggcaatga gaagtaagct 4140 tgaccggatt actggaatca aagagttcca acctaagtct tacatttccg tggagcaaat 4200 aaaatcatta tttgcgagat tttcaacgct tcacaggcag aacaagctcg tgttatgtga 4260 cgtaagcgag gacagttttt tgaatgttga agaacatcat gaatacaaat tggtaatttc 4320 cactatttag ttttattttt attttatttt attttttttt ttcgtttatg tgttacagat 4380 ggagttgcag caagaaggtg aagactttct agaaaatgcg ctagagaaaa tcaacgaaga 4440 gtacgatgat tggtctgacg atgacaatga atcgaataaa taaaccgtaa ccatagcatt 4500 tagaagttat ttttctaatt ctacaataaa ttagtttttc agatttctct gtgaaatacg 4560 aaagatatat taaaagtttt aaaagtaaat ttatttgtat tacttcataa gtaaattcat 4620 gtattctttg aagcattttt gtaagtgtat ctctctagaa aaaaaaaata taagggatta 4680 tttttcccta cttctccctt actttcttta tgaattgttt attattgttc aattcattat 4740 ttattcattg tttttttttt ttgcgggact tatagcattt atttttgtat tctcaacatt 4800 catttgtttg gagccatttt accccacttt ccctttacgc aaaaacatag aaacaatcaa 4860 aaagtattgt tatatataga agaacaaacc ctgattaaaa acaaacacca cccaaaaaaa 4920 aatcatattt tttaaggcta tgcagcccgt tattcgtttt ggcaacaatg atgacttttc 4980 aacttgcatt tcaaagtgag aaaactcagt cttgatagtt tatattgact tgaaaaagta 5040 tcactgtacg cgcttacatg cataaagtat gctgatactt tttcagctgt gtcagtgcaa 5100 aaccaactga ttttctttga ttcggaatcg tgagatgaat aagcaacaat catcaacgac 5160 gcgtacaaat ttcaatgacg gcctacttcg ccttaaacgg catctgtaga tcacatcaga 5220 ttctcaaatg aaagcccttg agtcgaccta caaaacgtca tatctggttt tttgactgag 5280 ttcggttagt tataaccgat ttgaaaatca tcatttattt tttttataaa aattggcatg 5340 tatttggcat gtttcgcgat atgaaatctt tggggatttt tttgccccac tttcccctta 5400 tataaaattc tgaaattctg cgaaatatat tcttttatgt agaaaaacaa actataaaaa 5460 aatcagcccg atcgaagaac atcgatgcaa gatggggacg tggctctcac gaactggctt 5520 ttaaaatcaa acaccacccc gaaaaaaaaa aaattttcgg aagtttttga aacggcaact 5580 ttagatcaca tctcaaatga aagcccttga gacggcctac aaaacgtcat atttggtttt 5640 ttgacctagt tcggttggca cgaaaaaaat gatttgaaaa tcatcatttt tttttaatga 5700 aaaacaggtt tttctcgatt aagaattttt tttaaaccaa attttatgct catatttaag 5760 atgatataac aatgagtaat catgccaaat ttcaaataaa tccgagcatt tttactacag 5820 ttatagcagt tttattgatt tgacatgttt tgcgacacga aatctttggg gatatttttg 5880 ccccactttc ccctaaaata aaattctgaa attttgcgaa atatattctt ttatgtagaa 5940 gaactaaccc tgaaaatttc agcccgatcg gagaacatca atacaacatc ctttggaaag 6000 ggggttgcgg ttctcgcgaa ctggctc 6027 // ID Loner_Ele1 repbase; DNA; INV; 6094 BP. XX AC . XX DT 06-OCT-2010 (Rel. 15.1, Created) DT 06-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE A Loner non-LTR retrotransposon family from Aedes aegypti. XX KW I; Non-LTR Retrotransposon; Transposable Element; Loner; KW Loner_Ele1. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-6094 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6094 RA Kojima K.K. and Jurka J.; RT "Loner non-LTR retrotransposons from the yellow fever mosquito."; RL Direct Submission to Repbase Update (06-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. This consensus is generated from 30 CC sequences with >98% identity, and ~100% identical to the original CC sequence in [1]. XX FH Key Location/Qualifiers FT CDS 675..2033 FT /product="Loner_Ele1_1p" FT /translation="MDESDGGDTPPRDLVARTRFYQPSSAGPWVVYFRRKK FT KILNVLSISRELTRKYPGTVIHQVKESKLRVTAPSYKAANEIAQHEAFTVE FT YCVYVPAREVEVEGKIIDASLTCEDIKTGKGVFENRSIPDVAILDCKQLQS FT VSMEGNKKVFSPSASFRVTFPGSVLPKFVCIDSAFYPVRLYVPKVFNCTNC FT KQLGHSASYCDNKPRCGKCNQAHLEDACTQEAEKCSYCREELHDLQKCPAY FT KRRIRNESRSIVKRSKKTYAEMVKSLNPSAKESSSAIPVGNSFQELSSDEA FT NDDETDGGDSSEVHAPGKRKSQSSPGLRRKVMKSSLKSLPNSKGGGANLKL FT PKKQVFDASVPCCSKDLPPPPPPIKYTSKRSSRQDSKKKATEVPRSSSKTP FT GAKRGLITFTALVDRICNALGVSDSFRGILDLFLPAIEEYLRELTISWPLL FT ASIISFDG" FT CDS 2029..5709 FT /product="Loner_Ele1_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="MDNSSSEVQDMITVLQWNCHSLKPKLDTFKFLLHNSD FT CDIFALCETWLSSEDDLNFYDFNIIRQDRDDNYGGVLLGIKKCHSFYRIPI FT PTHPGIEIVACQISVKGKDLCVASVYIPPRVSINRRHLWNAVSMLSSPVLI FT LGDMNSHGTGWGESYDDYRAAIFYDLCDDFNLNILNTGEVTRISSDGKESR FT LDLSLGSSSLSFDCMWKVIDDPHGSDHLPIITSIRNNFERSEQSIQVPFDL FT TRNIDWQKFASAVTFGIESFNTLPPLDEYRFLTELIYKSALESQKQRVPSA FT SFKRRPATPGWDDECTQLYREKSNAFKAFRRYGTSELYLEYSKLEKRLKNL FT IKAKKRSYWRRFIDGLSRETSMTTLWRVARNMRNRSSSNESDEYSNRWIFN FT FAKKVCPDSVPAQPPSWETQRDDALDRPFTMLEFSMALLSSNNSSPGRDMI FT KFNLLKNLPDIAKRRLLDLFNLFMEHNIVPPEWRQVKVIAIQKPGKPASDH FT NSYRPIAMLSCLRKLLEKMILSRLDHWVESNNMLSSTQFGFRKGKGTNDCL FT ALLSSDIQLAFADKEQLASVFLDIKGAFDSVSIEILSESLHNSGLPGILNN FT FLYNLLSEKHMSFTLGQLTTSRISYMGLPQGSCLSPLLYNFYVKDIDNCLE FT EPCTLRQLADDAVVSMKGPRAEILQRPLQNSLDNLSTWARNLGIEFAPQKT FT QLVVFSRKRNPAQLKLKLLGTDIDQSLTSKYLGVWFDSKGTWRTHIRYLTE FT KCQQRINFLRTITGTWWGAHPEDLIKLYKTTILSVLEYGSFCFLSAANCHL FT IKLERIQYRCLRIALGCMHSTHNASLEVLAGVLPLQNRFWELSLRILIKCG FT VSNTLVIENFEELLKLNTQSKFMRVYLYYMSSDISLPCYNPPRVRFTNDSS FT SVEYDLSMKQAIHGIPDQLRCITIPRIFNAKYQNVDSYRRYFTDGSRINGS FT TGFGVFNENSSAFRKLQEPCTVYVAELAAINFALGMISNMPADHFFIFSDS FT LSSIEALRSMKPVKHASYFLTKIREQMCALVERSYKITFVWVPSHCLIYGN FT EKADSLAKVGAEEGEIYDRRISHDELFHLVRQSSLHGWQSDWLNDQLGRWL FT HSIIPRVSLRAWWKGWDVSRDFIRVMSRLMSNHYSLDAHLHRINLALSNIC FT NRCGSGYDDIDHVVWQCPDMDASRAQLMDTLAARGRQPYVSVRDVLGSRDL FT VYMLSIYDYLRLCCIKV" XX SQ Sequence 6094 BP; 1694 A; 1361 C; 1289 G; 1750 T; 0 other; tcattccttg cttagcgctt gaacgagtcg gtcgacagga tcccggtctc atcgtacggc 60 actttttgca gagttcattt aggccttttt ccccaccgcc cgcgtgggtt tttggtttaa 120 ttagtgagtt aaacaagttt cagttttgtg caaagtgagt gcttagacga ttgcgctttg 180 cgaaggaagc gaactagtga aactagtatc gttccaccgc aataaaatca tcgcccatca 240 tcaattatcg gtgattgatt tttctcccgc accgagaaca acaacaaggc agaaggccga 300 cccggtcggt ctactaccta cggcgcgtgg acgttctcct gctgtagaac ggataagtat 360 acattatcta ttgaattgct ttcgcatctc cgaacaacag tgttgagttc aaggttgttt 420 ttcgctcctc ctctgtctct ctcccggtgc aattttgccc gtatagggga gttgggtaca 480 aaatagccca ctgattggtt aatttttttt cttttttttt ttttgtgaat ttgatttacc 540 catttagggg aggtgtgtac aaaatagtcc actgattaat aatttttttt ttacacatat 600 tttttttgtt tttgtttttt ttttttatta ttattttttt ttcttcattt ggttgcggtt 660 aagctttttt ccgcatggat gaatcggatg gaggcgatac gcctcctaga gatctcgttg 720 ctcgaacgcg gttctatcaa ccgtcttcgg ctgggccttg ggttgtctat ttccggcgca 780 aaaagaagat tttgaacgtg ctctcgattt ctcgggagct gacgcgtaaa tatcctggga 840 ccgttattca ccaagtgaaa gaatccaagc tgcgtgtaac tgcacctagt tacaaagcag 900 cgaatgaaat tgctcagcat gaggctttta ctgtggaata ttgcgtctat gtgccggcac 960 gagaagtcga ggtggaaggg aaaattatcg acgcgagtct aacatgcgaa gacattaaga 1020 ctggcaaagg cgtctttgag aaccggtcca ttcctgatgt ggctatactg gattgtaaac 1080 aattacaatc agtttccatg gaaggaaata agaaagtgtt ttcgccgtca gcctcgtttc 1140 gagtgacttt ccccggttcc gtcctcccga aatttgtttg tattgacagt gctttttacc 1200 cagtgcgtct ttacgtgccg aaggtattca attgtaccaa ctgcaagcag cttggccaca 1260 gtgctagcta ttgcgacaac aagccccgtt gtggcaaatg caaccaagct cacttggaag 1320 atgcttgcac acaggaagcc gaaaaatgta gctactgtcg cgaggaactt catgacctgc 1380 agaaatgccc ggcttataaa agacgcattc gaaatgagag tcgctcaatt gtgaagcgct 1440 ccaagaagac atacgcggaa atggtaaaat ctttaaatcc gtccgcaaaa gagtcttcat 1500 cagccatccc agttggtaac tcttttcaag agttgtcttc cgatgaagcg aacgacgacg 1560 aaaccgatgg gggagattct tcggaggtac acgcacccgg taaacgaaag tctcaatctt 1620 ctccgggact gcgccgaaag gtaatgaaat cttccctcaa aagtttgcct aattcaaaag 1680 gaggtggggc aaatttgaaa ttgccgaaga aacaggtctt tgatgcttcc gttccgtgtt 1740 gcagcaaaga tctgccgccc ccgcctccac cgattaaata tacttcaaaa agaagctcta 1800 gacaagatag caagaaaaaa gctactgaag tccctcgctc ttcctcgaaa acccccgggg 1860 ccaagcgagg actgataact tttacggccc ttgtggatcg catatgcaat gccctcggag 1920 tttcagactc attcagaggc atactcgacc tttttctccc cgcaatagaa gagtatctca 1980 gggaattgac tatttcgtgg cccctccttg cttcgatcat atctttcgat ggataattca 2040 tcgagtgagg tacaagatat gatcactgtg ctacagtgga actgtcatag tctaaaacct 2100 aaattagaca catttaagtt tttgcttcac aactccgatt gtgatatatt tgcactctgt 2160 gaaacatggc tttcttctga agatgatctt aacttctacg attttaacat tatacgccag 2220 gatcgagatg ataattacgg gggtgtgctt ttagggatca aaaagtgcca ctcattctac 2280 agaatcccca tcccgactca tccaggcata gaaatcgttg cttgccaaat aagcgtaaag 2340 ggtaaagacc tttgcgtagc ttctgtttat attcctccaa gagtttcaat aaaccgccgt 2400 catctctgga atgcagtctc catgctctca tctccagttt taatactggg tgatatgaac 2460 tcacatggta ctggatgggg cgaatcttat gacgattata gagcggctat tttctatgac 2520 ttatgcgacg atttcaactt gaacattttg aacacaggtg aagtgacacg tatttcatcc 2580 gacggcaaag aaagccgcct tgacctgtct ttgggatcaa gttcactatc attcgattgc 2640 atgtggaagg tgatcgatga cccccatggt agtgatcact tgcccataat aacttctatc 2700 agaaataatt ttgaaaggtc agaacagtca attcaggttc cttttgacct cactaggaat 2760 atcgattggc aaaaatttgc atcagcggta acctttggta tcgaatcatt caacactctc 2820 ccaccgttgg atgagtatcg attcctaaca gaattgattt ataaaagtgc attagaatcc 2880 caaaagcaac gagttccaag tgcatccttc aaaagacgac cagccacacc tggttgggat 2940 gatgaatgca ctcagttgta tcgagaaaaa tccaatgcct ttaaagcttt tcgtagatat 3000 ggtacctcag aactttattt agagtactcg aagctcgaaa aaagactgaa gaatcttatt 3060 aaagcgaaga agcgtagcta ttggcgacgt ttcatcgatg gcctctcacg agaaacttca 3120 atgacgacgc tttggagagt ggctcgcaac atgcgaaacc gctcatcctc aaatgagagt 3180 gacgaatact ccaatcgttg gatattcaac ttcgcgaaaa aggtctgtcc agactcagtt 3240 ccagctcaac cgccatcctg ggaaacccag agagacgatg cgttggacag accattcact 3300 atgctggaat tttctatggc tcttctttca tcaaacaact cctctccggg acgcgatatg 3360 attaagttca atcttcttaa aaatctccct gatatcgcta aaagacggtt gcttgacctg 3420 ttcaacttat tcatggagca caacattgtt ccacctgaat ggagacaggt caaagtaata 3480 gctattcaaa agcctgggaa accagcgtct gatcataatt cgtaccgccc gatcgctatg 3540 ttatcatgtt taaggaaatt gttagaaaaa atgatcctat cccgactcga tcactgggtc 3600 gaatcaaaca acatgctttc cagtacacaa tttggctttc gcaagggcaa aggaacaaac 3660 gattgtctag cgttgctttc atcagatatt caactagcct ttgcggacaa ggagcaattg 3720 gcttcggttt tcttggatat taagggcgct tttgattcag tttccataga aatattgtca 3780 gaaagcctgc acaatagtgg attacctgga atattgaata atttcttgta caatctgttg 3840 tcagaaaaac acatgagctt cactctcggt caactgacaa cttccagaat tagctatatg 3900 ggccttcccc aaggctcatg tctgagtccc ttgctttata atttttatgt taaagatatt 3960 gacaactgtt tggaagaacc atgcacgcta agacaacttg cagatgatgc tgtggtttct 4020 atgaagggcc cacgagcgga aatcttgcaa agaccattgc aaaattccct tgataatcta 4080 tccacttggg ctagaaattt agggatcgag tttgctccgc aaaaaactca actggtcgta 4140 ttttcaagga agcggaatcc tgcccaacta aagcttaagc ttttgggaac agacatcgac 4200 cagtctttga cttcaaaata ccttggggtc tggtttgatt caaaaggcac ttggcgaact 4260 catattaggt atttaacgga aaaatgccaa caaaggatca attttctacg aacaattacc 4320 ggaacatggt ggggtgctca cccggaagac ctcataaaac tctataaaac aaccattctc 4380 tctgttctag agtatggctc tttctgtttt ctctcagcag caaactgcca cttgattaaa 4440 ttggaacgaa ttcaatatcg ttgtttgcgt atcgccttag ggtgtatgca ctcgacacat 4500 aatgcgagcc tagaggttct ggcaggggtt ttaccgctac aaaaccgatt ctgggagctc 4560 tcgctaagaa tacttattaa gtgtggagtg agcaacacac tcgtcatcga aaacttcgaa 4620 gaattgctca aactgaacac tcagtcaaaa ttcatgagag tttatctcta ctacatgtca 4680 tctgacatta gccttccatg ttataaccct ccacgtgtac gcttcaccaa tgacagttcc 4740 tctgttgaat atgatctgtc catgaaacaa gctattcatg gaattccaga tcaacttcga 4800 tgtattacta tcccacgtat tttcaacgca aagtaccaga atgtcgattc ctacaggaga 4860 tatttcactg acgggtcccg tataaatgga tccactggct tcggtgtctt caatgaaaac 4920 tcttccgcct tccgaaaact tcaggaacct tgcacggttt atgttgctga gctggcagca 4980 atcaacttcg ctttggggat gatctctaac atgcccgcag accacttctt catcttctcg 5040 gatagtctca gttctattga ggcactccga tcgatgaaac ctgtaaagca tgcatcttac 5100 tttcttacaa aaataagaga gcagatgtgt gcactggtcg aaagatcata taagattacc 5160 tttgtatggg tcccctcaca ttgcttaatt tatggcaatg agaaggcgga ctctctcgca 5220 aaggtgggcg ctgaggaagg tgagatatat gatagaagaa tttcacacga tgaattgttt 5280 catttagtac gtcaaagttc tcttcacggt tggcaaagcg actggctaaa tgaccaactg 5340 ggacggtggt tacattccat aatccctaga gtttccttgc gagcgtggtg gaaaggttgg 5400 gatgtaagtc gagatttcat tcgcgtgatg tcaagactta tgtccaacca ttactcgcta 5460 gacgcacatt tgcacaggat caacctcgcg ctgagcaata tatgtaatag gtgtggttcc 5520 ggttatgatg acatcgatca cgtagtttgg cagtgcccgg atatggacgc ctccagagca 5580 caacttatgg atacccttgc ggcccgaggt agacaaccct atgtttcagt tagagatgtg 5640 ttgggctccc gcgatctcgt ctacatgttg tcgatttacg attatcttcg cttgtgttgt 5700 ataaaagttt aattctcttg tgttctcttc cccagttttt ttttttctct ctgtgttatg 5760 tcccatttgt gttggattga tgttcctggc catccggcaa tcgaaaaccc acatgaagtt 5820 ggtatacgcc atgatacgac aaacgagcaa ccatgaaata ccacccggat gagctgcaaa 5880 gaaccctgta acccaatccc aacctagccc ttcctaaaaa tatttgtgac cactaacctc 5940 gagttgccac gagtaccctg gctccaaccc ggactaaact tggtacttaa gaagttaaac 6000 aattgtaaaa aagtacaaaa atgaatctcg gctccgtaaa gcgttaaacg cgatagagcc 6060 ttaaataaat gaattggtaa aaaaaaaaaa aaaa 6094 // ID Gypsy-104_AA-LTR repbase; DNA; INV; 159 BP. XX AC supercont1.206; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-104_AA_; KW Gypsy-104_AA-I; Gypsy-104_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-159 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.206; Positions 119155 118997. XX SQ Sequence 159 BP; 44 A; 44 C; 27 G; 44 T; 0 other; tgaatcaatg aatagacagt tcaattcggt ctttggacca agtaacaaca ggagttatta 60 ctggaatatc accttccttc acaggcgatc atgcgtcatc tccctgcttc atcaagactc 120 gtcggttcat cttcgcttcc caccaacgag actacatca 159 // ID Gypsy-18_AA-I repbase; DNA; INV; 4385 BP. XX AC supercont1.213; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-18_AA_; KW Gypsy-18_AA-LTR; Gypsy-18_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4385 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.213; Positions 1460912 1456528. XX CC Positions [3367-3828] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 97..2946 FT /product="Gypsy-18_AA-I_1p" FT /translation="MTEPTPELMEALSGMIAQALKASLGSVVQQQVAALNA FT NGDPAAPAALAPKVPSFSMSEYRSTEESSVSDYFNRFEWAAQLSHIPADQY FT ANYARVHMGAELNNALKFLVSPNDPAATTYQDLRTTLIDHFDRAKNKFVES FT IKFRHIVQEKDESVAKFVLRLKQGAAHCEYVAFLDRMLIEQFLHGLEARDM FT CDVIIAKDPATFKAAYEIAHALEATRNTAREVNTGAPSPNPEAANKLGYET FT PRTKKVNRQSPSSHPKQHRYLTESPFEEQRNGSQQDRGSGPCNGCGGHHMR FT SDCRFRDARCNKCNKKGHIAKVCRSTRSSAQSFSTDQVESSEDPACEIDVV FT QTLGQLCDSTAIGKKLIDVIMEGQHLQMELDTGAPCGIISESKLRAIKPNF FT SLLPSDRQFASYSGHRINCLGRLPVKVSVGSASRRLNVYVVSGDSDPLFGR FT EWIAQFVDQINLNRMFTPSASVNAVTTTELISSQELKLAELLENYKDIFSD FT VPGKLIGPPAKVHLKPGASPIFAKARDVPLALCDRYAAEIEKKITAGFYEK FT VDYSEWASPTHIVVKTNGSIRITGNYKPTVNPRMIIDEHPIPKVEAIFNKL FT KGASFFFFLDLTDAYTHLPIDEEFRHVLTLNTVTHGLIRPTRAVYGATNIP FT PIWQRRMENVLQGLDNVVNFYDDIIVAAANFEQLMLSLTAVMDRMKQYGLR FT LNRSKCIFAVPSLECLGHRIDRNGLHKSDKHIEAIRDAPRPETPEDLQLFL FT GKSTYYSAFIPNLSTRARSLRDMLTADTFVWTPEANMAYQDLKKALISPQV FT LMQYTPDLPLLLATDANKTGLGAVLSHRLENGTERPIAYASCTMSKTEQQY FT PQIDKEALAIVWAVNKFFALSVCSKIYFDHRPQTTHADIAPRKIPPYAMYQ FT SNGELCRLLGPFQLRRRIPANQIEHQCRLLLTNSQQVNSFRRKQPI" FT CDS 3235..4353 FT /product="Gypsy-18_AA-I_2p" FT /translation="MKGLARSYVYWPGIDTDIERTAKSCMECARYAPAPPK FT FSSHHWEFPRAPWERIHIDYAGPFAGSMLLIIVDAFSKWVEVKTTTSTTTS FT ATIQILDGLFSTFGAPITVVSDNGQQFTAAEFKDFLQSSGVKYHKLSAPYH FT PATNGQAERYVQTVKNGLKAMGTTSGDLQSNLNTFLQQYRRIPHSETGDSP FT AKLFLGRNIRTRLDLVRPQTTNTAITEKQQAAFEPSFRSFSPGQRVHVLSG FT NTKMDKWISGTVSARLGDLHYEVVHQGKHLKRHVDQMRRFEDGSLLAHTPS FT NEALLTTIPGVVRRAFSYGDTVTSQQAPINSESGASAERTPAAADEQFSTR FT NASPVGEREVSSPVLRRTTRLRRAPLRYSP" XX SQ Sequence 4385 BP; 1205 A; 1130 C; 1049 G; 1001 T; 0 other; ataattggtg tcagaagtgg gattgtggga gaaaatatcc ggcatcccaa aagtgaggtt 60 aaaactacgg aaaccacatt cttccgagca ccgaaaatga ctgaaccgac tccagagctc 120 atggaggcac tgtccggaat gatcgctcag gctctgaaag catctctcgg aagtgtcgtc 180 caacaacaag tagccgcctt gaacgcgaat ggagaccctg ctgctccagc tgctctagct 240 ccaaaagtac cctcgttttc gatgtccgaa taccgctcca cggaagaatc atctgtctcc 300 gactacttca atcggttcga gtgggcggca caactgagtc acattccggc agatcagtat 360 gccaactacg cccgagtgca catgggggcg gaactcaata atgcattgaa gtttctggtg 420 agccccaacg atcctgcagc gaccacctac caagatttgc gcacaacact catcgatcat 480 ttcgatcgtg ccaagaataa gtttgtggag agtatcaagt ttcggcacat cgtgcaggaa 540 aaggacgaat ccgtcgccaa gttcgttctc cgtctgaagc agggcgctgc acattgcgaa 600 tacgtcgcct tcctcgatcg gatgctgatt gaacagtttt tgcacgggct ggaggcacga 660 gatatgtgcg atgtgattat tgccaaggat ccagctacgt tcaaggcagc gtacgaaatc 720 gcgcacgcgc tggaagctac gcgtaacacg gcacgggaag tcaataccgg tgcaccgtct 780 ccgaatcctg aagcagcgaa caagcttgga tacgaaactc cgagaaccaa gaaggtcaat 840 cgccagtcgc cttcgtctca tcctaagcag catcgctatc tgacagaatc accattcgag 900 gagcaacgga acggtagcca acaggacaga ggatcaggac cgtgcaacgg ctgtggtgga 960 caccacatgc gaagtgattg tcgcttccgt gacgccaggt gcaataagtg caataagaag 1020 ggtcacatag ccaaggtatg tcgttcaacg aggtcgtcag ctcaaagctt ttctaccgat 1080 caagtcgaat catcagagga cccagcttgc gaaattgacg ttgtgcaaac actaggccag 1140 ctctgcgact ctacagccat cgggaagaag ctgatcgacg tcattatgga aggtcaacat 1200 ctgcaaatgg agctcgacac tggagctccg tgtggcataa tttccgaatc gaagttgaga 1260 gcaatcaaac caaatttctc gttgctgcca tccgaccgac agtttgcaag ctactctggc 1320 caccgtatca attgtcttgg acgtttacca gtgaaggtat ccgttgggtc ggcatcccgt 1380 cggctaaacg tatacgttgt gtctggagat tccgaccctc tctttgggcg cgaatggatc 1440 gctcaattcg ttgaccagat caacctcaac cgtatgttca ctccaagtgc atctgttaat 1500 gcagtgacga ctactgagct tatttcaagt caagaattaa agctggcaga gctgctggaa 1560 aactacaagg atatcttcag cgacgttccg gggaagctga taggaccacc agcaaaagtg 1620 cacttgaaac caggtgcatc tccgattttc gctaaggctc gcgatgtacc acttgcttta 1680 tgtgatcggt atgcagcgga aatcgagaag aaaataacgg caggtttcta cgaaaaggtg 1740 gattactcag aatgggcttc gcccacccac atcgtggtga agacgaacgg gagtattcgt 1800 ataactggca actataaacc taccgttaac ccaaggatga tcatcgacga gcacccaatt 1860 ccgaaggtcg aagcaatttt caacaagctg aaaggagctt ctttcttttt tttcttggat 1920 ctcaccgatg cctacacgca ccttccgatt gatgaagagt tccgtcatgt actcactcta 1980 aatacggtaa ctcacggact gattcgacca acgcgagctg tgtatggtgc gacaaatatt 2040 ccgcctatct ggcagcggcg catggagaat gtgctacaag gcctggacaa cgtcgtcaac 2100 ttctacgatg acatcatcgt cgccgcagcc aacttcgaac aactgatgct atccctaaca 2160 gcagtcatgg acagaatgaa gcaatatgga ttacggctca atcgatccaa atgcatcttc 2220 gccgttccat ctttggaatg cttgggccat agaatcgata ggaacggcct ccacaagtcc 2280 gacaaacaca tcgaagcaat acgagatgca ccacgaccgg aaacacctga ggatttgcaa 2340 ttgttcttgg gtaagtctac atattacagt gcatttatcc cgaatctctc tactagagca 2400 cgaagtttgc gtgatatgct gaccgcagat acgtttgttt ggactcccga ggcaaacatg 2460 gcatatcaag atttgaaaaa ggcccttatt tccccacaag ttttgatgca gtacacccct 2520 gatctaccgt tactcttggc aaccgatgcg aataagactg gacttggagc tgtactgtca 2580 catcggctgg aaaatggaac cgagcggccg atagcgtatg ctagctgcac aatgtccaaa 2640 accgaacagc aataccctca aatcgacaaa gaagcgctcg caatcgtgtg ggcggtaaac 2700 aagtttttcg cattatctgt atgctcgaaa atttactttg atcaccgacc acaaaccact 2760 cacgcagata ttgcacccag aaaaatccct ccctacgcta tgtatcagtc gaatggcgaa 2820 ctatgccgac tacttggccc atttcaactt cgacgtcgta taccggccaa ccaaattgaa 2880 caccaatgcc gattattgct cacgaattcc cagcaggtca actcattccg acgtaaacag 2940 cctatctaac gggagaggaa gtgccaggga tgaattcgac cagtttgttt tacttcaaat 3000 acaacaactt ccggtaagag cggatctcat tgcgcgcgag acacggaaag atccccacct 3060 cggaaaaatt gtccaagcat tggagctggg gcaacatctt gcacaatcag gctacaaggc 3120 accggaagcg aaatatacaa tggctgctaa ttgtttggtc ttcgaacatc gagtggtcat 3180 accagctgtt cttcgtccgg cgatttgcac gtagctcact tcggcgtcgt taaaatgaaa 3240 gggttggctc gctcatatgt ctattggccc ggaattgaca cagacatcga acgaacagca 3300 aagtcgtgta tggaatgtgc gcgctatgca ccggctcctc ctaagtttag cagccaccac 3360 tgggaattcc cgagagctcc atgggagcgc atccacattg attatgctgg tccctttgct 3420 ggttctatgc tgctcataat cgtcgatgcg ttcagcaaat gggtcgaggt gaaaaccacc 3480 acatccacga ctacatctgc tactatccag attctcgatg ggcttttctc cacgtttgga 3540 gcacctatca cagtagtgtc tgataatggc cagcagttca cggcagctga attcaaggac 3600 ttcctacaga gcagtggggt caaataccat aagctttctg caccctatca tccggccaca 3660 aatggtcaag cggagcgcta tgtgcagacg gtgaaaaatg gcttgaaagc gatgggaacg 3720 acgagtggag atctgcaaag caatctcaac acctttctgc aacagtatcg aagaattcct 3780 cattcggaaa ctggagattc accggcaaag ctgttccttg gacgcaacat tcggacaagg 3840 ctggatctcg tgcgtccaca aacaaccaac acagcaatca ccgaaaaaca gcaagcagcg 3900 tttgagccat cgtttcgttc tttctctcca ggacagcggg ttcacgttct atcgggaaac 3960 accaagatgg acaagtggat ttcaggaacc gtttcagctc gattgggaga tttgcactac 4020 gaagttgttc atcaaggaaa acatctgaaa cgtcacgtcg accagatgag acgttttgag 4080 gatggcagcc tgcttgcaca cacaccatca aacgaagcac tgttgactac gataccaggt 4140 gtggttcggc gcgctttttc atatggggac actgtgacgt cacaacaggc gccaatcaac 4200 tcagagtccg gtgcatcggc ggagcgaacc cctgctgcag ctgatgaaca attttcgaca 4260 cgcaatgcca gtcctgtggg ggaacgagaa gtctcttccc ctgtattgcg ccgaacaacc 4320 agactacgtc gtgcgcctct aagatactcc ccgtaatatc tgaatttctt agaaggggga 4380 agaaa 4385 // ID BEL-170_AA-I repbase; DNA; INV; 2371 BP. XX AC supercont1.309; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-170_AA_; KW BEL-170_AA-LTR; BEL-170_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-2371 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.309; Positions 648903 651273. XX CC 'CCCCC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS join(65..1318,1322..2371) FT /product="BEL-170_AA-I_1p" FT /translation="MKVFADGGVMADRIDAVRPVDLVLLHKILYCDDTTLN FT WLRRRVLRFEGFDFERRSPQFEVRMRIIEDAEPEHVQTTCEILDVPIGDAA FT CSKQNDAVAYLRVVKAEKIDVSFVASRGRVAPVRFMSIPKLELQAALLGYR FT LSLMIKEEMGVQVHKTVYWTDSRTVLCWLKTKKALKTFVANKVGEILEQSS FT PDDWLYVTTLENVADEGTRDAEELDISTTGRWFTGPNMLKQNVTDWSTMFA FT NDHAFEADIDEFCSETLVVNTVMESESILPDVNRFSQWRRLIRSTAYVRKF FT IDMCKRKSSIKQIQPCEEQRAICLWLKAVQNEQFPEDYTVLMNRGELKPSS FT KLKSLAPFLDNEGIMRVGGRLNKANVEPGVKNPIILGASGRFVSLLVQNYH FT QEAHHHGRELVLNEIRQTYWIIGLSLLRKVVFGCQYCRVRKPKQVTPIMGQ FT LPECRLDMYKLPFTHTGMDFFGPFDVTVGRRREKRYGVVFTCMTTRAVHVE FT VAHSLTADSCIMSIRRMMARRRKPVFIYSDNGTNLRGADRELKEALNQLSF FT GDIEREVLTRGIQWMFNPPAASHMGGSWERVIRSIRTSLRETLKERAPRQE FT VLETLLVEAEHLVNSRPLTYVSSDVNDCEALTPNHFLIGASSNVQSPGAFS FT QSDLDWGKQWRMAQKLIDNFWKRWMKEYMPTLIPRQKWRQSIGEVREGDVV FT YVSDMINERGKWPLAIVERCWTGDDNIVRVAEVRTHSGVYRRPCIKLKKVC FT LEGDIRDDKPVSHQGAQ" XX SQ Sequence 2371 BP; 704 A; 433 C; 630 G; 604 T; 0 other; taaatggtcc ttcgacgacg aatatatgcg aagtggacaa cagtttatga gagaaggctg 60 agaaatgaaa gtttttgctg acggtggagt aatggccgac aggatcgacg ctgtgaggcc 120 tgtcgatctg gttctgctgc acaagattct gtattgtgat gacacgacgc tgaattggtt 180 aaggaggaga gtactgcggt tcgaaggctt tgattttgaa cgacgatctc ctcagtttga 240 ggttaggatg cgaatcatcg aagatgcaga accagaacat gtacaaacta cgtgtgaaat 300 attggatgtt ccaatcggag atgcggcttg ttcgaaacaa aatgatgcag tagcctactt 360 gagagttgtg aaagccgaaa aaattgatgt ttcctttgtc gcttcaagag gaagagttgc 420 tccagtacgt tttatgtcaa ttccaaaact agaattgcaa gctgccttac tgggttaccg 480 cctttctttg atgatcaaag aagagatggg cgttcaagtg cacaagactg tgtactggac 540 tgatagtcgc acggtgttgt gttggctgaa aacgaagaaa gcattgaaaa catttgttgc 600 aaataaagtt ggagaaattt tggaacagtc gtccccagac gattggcttt atgtcaccac 660 tctggaaaac gttgctgatg agggaacacg agatgcggaa gagctggata tttctacgac 720 tggaaggtgg ttcaccggac cgaatatgct gaagcaaaat gtcaccgatt ggtcaacgat 780 gtttgcaaat gaccatgcat tcgaagccga tatcgatgaa ttttgttccg aaacattagt 840 tgtgaacaca gtgatggaaa gtgagtcaat tctacccgac gtaaaccggt ttagtcaatg 900 gagaagatta atccgatcga cggcttatgt acgtaagttc attgatatgt gcaagcgtaa 960 atcgagtatc aagcaaattc aaccgtgtga agaacagaga gctatttgtc tatggctgaa 1020 agcagtgcag aatgaacaat tccctgaaga ttacactgtt ttgatgaatc gcggtgaatt 1080 gaagccgtca agtaagctga agtcgttagc accttttttg gacaatgaag gaataatgcg 1140 tgtcggaggg agactgaaca aggcaaacgt ggaaccaggt gtaaagaatc caatcatact 1200 tggagcatcg gggagatttg tatctttgct tgttcagaat tatcatcaag aagctcatca 1260 tcatgggcgt gaactcgtat tgaacgaaat caggcaaacc tattggataa taggcttatg 1320 aagtttgctg cgaaaagttg ttttcggatg tcagtattgt cgtgtcagaa agcccaagca 1380 ggttacgcca atcatgggac aacttccgga atgccgttta gatatgtata agcttccttt 1440 tacgcatacc ggaatggatt tttttgggcc cttcgatgtt acggttggac gacgtagaga 1500 aaaacggtac ggagtggttt ttacttgtat gacgactaga gccgtccacg ttgaagtggc 1560 ccactcgttg actgcagatt cgtgtataat gtccattcga cgaatgatgg cgcgacgcag 1620 gaaaccggtt ttcatctact ctgataacgg cacaaatttg cgcggtgctg atagagagtt 1680 aaaagaagca ctcaatcaac tgagctttgg agacatcgag cgtgaagtgt tgacaagagg 1740 catacagtgg atgttcaacc ccccagctgc gagtcatatg ggtggatcat gggaaagggt 1800 tattaggagc atcaggacat ctctaagaga aacattgaag gagcgagcac cgaggcaaga 1860 agtcttggaa acgttactag tagaagctga acacctcgtc aatagtcgtc ctttgactta 1920 tgtatcatct gatgtcaatg actgtgaagc gctaacccct aatcattttt tgattggagc 1980 atcgagtaat gttcagtcac caggagcatt cagccaaagc gatctcgatt ggggaaagca 2040 gtggaggatg gcgcaaaaac tcatcgacaa tttctggaaa cgatggatga aagagtatat 2100 gccgactctg attccgagac aaaaatggcg acaatccatt ggagaggtaa gagaaggtga 2160 tgtcgtatac gtatcggata tgataaacga acgtggcaag tggcccctgg caattgtgga 2220 aaggtgttgg acaggtgatg acaacatagt aagagttgct gaagtcagaa cccactccgg 2280 agtttatcgc cgaccatgca tcaagctgaa gaaggtctgt ttggaaggag acattcgtga 2340 tgacaaacca gtgtcacacc agggggcgca a 2371 // ID DNA-TA-6_AAe repbase; DNA; INV; 2220 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A non-autonomous DNA transposon family from Aedes aegypti. XX KW DNA transposon; Transposable Element; nonautonomous; KW DNA-TA-6_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-2220 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the yellow fever mosquito."; RL Repbase Reports 11(4), 1275-1275 (2011). XX DR [2] (Consensus) XX CC ~91% identical to consensus. TA TSDs. TIRs are 24 bp long. XX SQ Sequence 2220 BP; 728 A; 391 C; 391 G; 709 T; 1 other; caggtatgtt ccgtttttat caacacgatc ggcaattttc agttgacaaa aacggaaccg 60 tgacaaaaac ggaataattt tcttagacaa aatattttac aaatttgaaa agaaaattgc 120 agttttgtca ggataataca cattttcatc atataaatgc acagtattga tgtttaggag 180 ctgtgaggag catttagatt gttcattctt ataggttcgt aatgaaagtt gattgcagac 240 tcctatcaat caatatgatt ttgtaataaa tcattaatag ttttagtttt cttggttaga 300 atttcaataa atgtaatata gaatattaaa acgataatca cataaatgtc ctttgcatca 360 taaggtgcgt catatttgta aaaataacta caaggaagtg ggtcaagctg atcatttttt 420 tcaagaaaga ttaaacaaga aacaattctt tttctactgt tcatcctcat ccttctgctt 480 cttcttcttt atggcattac atcccaactg agaaagagcc ttcttctcag cttagtgttc 540 ttataaacac ttccgcagtt atcaactgag agttttttgc caaggtacat tttttgcatt 600 tttggccaca tccgattggc gattaaatgt ggagttatgt tgcgaaagtt ttgcaaactg 660 aaatgaatat ttacggtttc ctgtgctttg aattgccaaa ttctatcagt aaaatgattg 720 tggtgatctt tcttctaaag ttattactac caacttcgac agggatgcac taatcgccaa 780 tctagaacag cgccaccatc gccttttagc tttacgttag atcacgaata gactggcaat 840 aaactaacct taagttctag ttttttgttg agtaaaaata atattaaata ctagtggtcc 900 cgtcaagctt cgttttgcca tcaagtaggc tgctgaaaaa tggcatgcaa tctcccatac 960 aaaacgacag attagtttcc aggtgttttt ccagttattc cagataattt tccaaatcgt 1020 ttccttcact caaacacgtc ggaacacttg accaatgcaa cagtgaaata atcatgtaag 1080 tctatcagcc cgttcttaag gcatttcgtg acgtacaaac accattccat ttttatttat 1140 atagaagtta tcaaaaacac aaaagtgtac aaactctaga acaagcatca ccaaatagaa 1200 ctttttggta agaaactctt ttgtaaagtt ttatgaataa gcgtatggct gttggtataa 1260 gactcctaat taacctttta gtctccatgt ggggaagggg ttcatgtctg ctcaaagatc 1320 aagatcccag caaatttggg atggcaacaa aatcttctac agattgctta gttttgctgg 1380 acctcttccc ctaccccttg gaacaggatc taatcaatgg ctgtttgcta taatttgtgc 1440 aatgctatcg agtttgagag atagaagata atgtccaaaa tatgacgggg tcgacagtct 1500 tatcaaaaat aaaataaact cgatcatgtc tcttgccgtt gcccaaacat tcttgagttc 1560 attcccaagt gcatccctta tcaaaaagaa ctgaacatat atgggatcgt ccattaatca 1620 cgtaagcggt tatgggggga ggcaggggga tttgagatat cttaagtgcc atacaaatta 1680 ttttttattt tcatacaaaa aatcatacca tagggggagg gggtaaatat gtctatcaaa 1740 tttacataga aaattctttt atgccatgtt atataagaaa aaggagaggt ctctgcaact 1800 ctcttaagta cgtattcaac taatcacaaa ttccagttgt tgctaggacg tggccagagc 1860 aactctcgat tgttgaagga tgggccaatc ttgtccaaca gaccatgcta agtttgaaaa 1920 ctccaattga ttagttgcat aagaaggagt caaaccttat tgaatcgtat atgacttgac 1980 acctaccatg tatcgtaggt gctcaacatt aacataatgt cgaaaattca actctgaggc 2040 tattaaaaag catttmaatt gcttcaaaat aagtatttag tttatattct aaacattttg 2100 aaaattaaaa aaaacaattt ttgtgtgttg ataaaaacgg aataatttgt tgacgaaatc 2160 ggaacgtgac aaaatcggag cgtgacaaaa tcggaacgtg ataaaaacgg aacatacctg 2220 // ID Gypsy-1_HAS-I repbase; DNA; INV; 11420 BP. XX AC AEAC01014456; XX DT 20-JAN-2011 (Rel. 16.02, Created) DT 20-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Harpegnathos saltator genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-1_HS_; KW Gypsy-1_HAS-LTR; Gypsy-1_HAS-I. XX OS Harpegnathos saltator OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Ponerinae; Ponerini; Harpegnathos. XX RN [1] RP 1-11420 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Harpegnathos saltator genome."; RL Direct Submission to Repbase Update (02-FEB-2011). XX DR Genome; AEAC01014456; Positions 11611 192. XX CC Positions [6430-6975] - Reverse transcriptase CC Positions [8146-8634] - Integrase core CC LTRs are 95% similar to each other. XX FH Key Location/Qualifiers FT CDS 9483..11042 FT /product="Gypsy-1_HAS-I_3p" FT /translation="MTTSYKDLQRQYNELRRKQEETRQRRKELEKYKQQRL FT QEKIDLIRQSIAKDEEAIQQLEASSSSEETEKTQTIEPPCGETPTKIQKYK FT SHIQPRQIAPRISRSASTEKEISPEAQVWNIGDSDNSVDTVQNQPTSSKPS FT RSKQDKELRTNARDSSEKENSSPRTEEPSIQLRKQPVVKLRRLGERSRDRN FT ILAKRKEDDNSNKLSSKQQTRSYYRERTPNPRKSSKADKLERRREQKRLAN FT QRYRRKVKETLEAAKRQLAHARSDTTDSSDAEPTRVTPVNSPSGGSVQRPI FT PAVQRGRKAERKAATRDETESRDAKLIRRPVIVEDRRQSPKIRVKKQSHHQ FT QGTVVGGRAGERNLTTPTRRPGPSQDHPNRARSIQDHPDLERSIQDRPRQE FT RSNQHHPDRERTVSDHPDLERTIRDHQDLEMQDRDHQDLEVSDRDQRDLEV FT QEPDLETRERDHPDLEGRDPKDPDPEGSDHEYPDPEDPEIMKQICEGEDPE FT HRPGGNDNEDDDRRVLLQWLTGPE" FT CDS join(3547..7956,7960..9408) FT /product="Gypsy-1_HAS-I_1p" FT /translation="MSIAEGKGNPKGDVAFIESEIGPATRSGRIYGPSQPI FT IKPPTFVPEGSQGLSRTTSEISLASRTDAPNVVTASGNADTDRAMLGNSVL FT DPSLTGWGFEKSSILNSTGRSAIAKSRPPIDPRIRQMSEQFDEMKAMVKGM FT AVQLEFLTKINTNKPDSDEPKMTTEQWNSYRVQKRYGSNESIETKDDARSQ FT ISAVSANRDHIGNRNENNRMHERRDDRDEDLYDPEHDREYISERHRSKIPR FT CVEIEAKQRYATRDPPRGNPFGELGFSGRGDSQHPIRFFDDFIDIADYEEL FT RDRDRLYFFKHCMTGRAREWFDLQEFREFGEVVIKFRNKFWGRVEQNKLRD FT RVRNGMYNANTGRTMAEYVEKFARDVKYISPQIPEFELVQDLSQHFTVEIY FT NDLLRLRVDTIPELCHQLNIIQDYEHNRIHMAIKEKMSRELEAAQRSARYK FT PYGESIYGKRDEPEYPRQSPDRKYQREPRSDSRDSRRGYYSAKRAEFRDYR FT DPPTKWIPRKETSREYVDRRESSAGRKGGSARNNERFEEQRTDPRSRSANR FT DNRERSRQVKNERVEERISTPRDPSGTRNNRENSRQRDVSRQREPARERET FT SRIRETSRPRESSRSREQQQGVSSLRTEEILTELREFERREQNFKEDNKNV FT PIINVAIIAEKGEQELDRLSVRTLLDTGATVSAIDPSVIRNFEQRIREDRK FT TEDKELFPRVRINKTHMRGAFNQKGVHTNALVQLTIEMKDMQNRPKRYTHE FT FYAIETLFIPMILGADFLGKYRIAVTCPKPTALIATNRTVESNEALDLRMP FT STPEPVHETTVKWEIIDLCEPEPGIIKLEEFEAKEPKNENLNASRESLHTR FT SEEGSEDACAPETEGLEDTLAPAIEDPEDPLAPRVEDPENSLVSTDENFTH FT RLENLLEHYSLLLDGSLGKIRGYEHEIRMTTDKPFKGKSYPIPQKHLEEVR FT RQLREMENLGVVSRAATQYISPLVAVTKPNGKIRICLDARNINDRMENDHA FT QPPTIEEVLANIGHKSIFSKLDITQAYWQIPLTANSRQYTGFSFDHQTYIF FT ERLPFGIKTAGASFTRAIEAALKGKPELRKHVIVYLDDVLIASENETDHLS FT HLASVFEALQEAGFRLNRDKCEFARDRIVFLGHDITRVHVTITEETKRNIV FT DFPRPRNKKEIQAFLGLVNWDRRFVPKLSRHTVHLEKLLKKDTRFQWDQRM FT ERAFSDIKESVRNAPSLYLINPDRKYGIETDASTVGIGARLFQYDDAGREY FT TIAYASRSLKGAERRYTVTEVEGLALVWCLKKWRVLLMGRPVRVKTDHRAL FT KFISACAVASQRMARWLAFLQEFDLDINHIPGTANTTADALSRKPVKRNHR FT LPTERTTRWKRYRGATDFDSSSASEESDPSHEWDHEGRKMYVPRICLMADI FT HEGENTRDWVNLIREAQINDPAIIQRTAENPERYRTRNGFVRKTMDDGSDR FT VVIPDIAWELVLRIHRYLTHFGTDKVLEFARRYFEIANIERLTRDVVASCQ FT LCIATKVYARPTRGPEYYELPNDIGEVVSVDLYGPLPRSYDGCAYVLVVMD FT QFSKFTKMYPMRNQKLRTIEQILEEYYFRDVNRVPKTILTDCGGQFLAGRW FT KEFARRLGIKTRKTSPYNPQSNPVERVMRELGRSLRAYACDDHRRWDEIVP FT RVEQVINATEHSSTGAPPAVLENRQVDLVIGPPEILRSRGAPPIDRDRLLE FT AASERLRIMARKRRRQSEKRGTAIQYQQGDLVWIKSHRRSDQRHSKIHKLF FT PLYEGPFRVRDEIHPNVYTIERMTSKLIGPFNTRQLRPHRTSAWRPVEVNR FT EQPSQERSTTTGTMESAEIDTPYESGHSDPPSTPQRKRPRRRVRLAAVARR FT LSSDSEDSCERWIQPRRTSKVENQIDSPPDTDDTWEIVKRPKRVRKQSKIR FT NGGSKTKLSPTQTNNPPLTNPREHPEDIA" XX SQ Sequence 11420 BP; 3714 A; 2639 C; 2791 G; 2276 T; 0 other; aagtttattt tataatttta taagccgagt ttatttctat aacgtcgaag tcggtaatac 60 cgaccgggga gaacgccctc taggcgaacc cacggaggag cgtgactatg tgttcggttc 120 gggaaataag gttccgcatc tccacacact cgacggactg tgtgaagatg cggaaggcgc 180 gaaggcgcca agttagtagc tctcgaaccc cagagcagac cgaacgcgta tgtgaaagca 240 ctccgtcgcg tgcgacgtta aataaaaaac ccccgcggac gcgacgcgaa taggcgagta 300 agtaacggcg gatagtgacc ctttgtgtaa aaggttgaga ataaagactt ttgtgaaata 360 cacggtgtta gaactttgtt ccgggatccc cgtccgtctt cggaaatctg cgccgtggct 420 taaccaagtg gacgggtggc gaccatctta tcgaacctct agaacctgca ttcttcaccc 480 tctggacccg aggaaggact cgcagtccgg ttacactgtc aaaagaggaa ggcgaacgac 540 cggccgaccc gtgttctaca aggaggacgt caaccgcggc tcattaccga gctgactgac 600 accgacgagt tccgggttag ccgccggaca ccacaaaacc tacgtggcga ccgtgagtgt 660 aagagtgatt aagtgaatgt cgtaaaacaa cgttccgaag tgaaccaaag tgtagtgccg 720 ccgaaaaaat tgtgaaattc cgccccgaag cagcgtcgcc gatattgaga ttgaccatat 780 attactacgc gcgcacggaa gtagtgcccg ccatattaaa aaagaacgcg ccgagccgcc 840 atagcgagcg gacgtgcgga gagtcgcgcg cgcgtataca gtgacattat tacatcgccc 900 acgtcgcgaa accgtcgaga gtgtccggtg cgagtgcgcg acaaagggac taaaaggatt 960 atcagtgaaa aagtgcccgc cgcagctgtg aggacgaagg aatccatccc gggacagatc 1020 tggttgactg gacgaacatc cgaaagtgat ttcggcaacg agggacaaag actgcgtacg 1080 tgaacgaccc gacgcaaaga aaacaccaac ggtgagtcga gaaactcaaa ttactacaca 1140 ttacacacga cagaccaaag attagtgccg gtagtgcgcg cgagaggtta gaataacctc 1200 aacatagagt cagtgcttag agttagcatt tcgagacagt gaacgcaatt ctcgcgttaa 1260 aattcaagta gagattgaaa atctaaagct aacgacatcc ggaccgtcaa gatcgaacaa 1320 cgcgttccgg agatattgat tttaaagatt ttattcattt ttatgacact tgacaaagga 1380 acaccctcat acgtgaggcg aatcgaaacc gcgagtgaaa aactagcgag agaataaaaa 1440 gacgaatctg tagagtccag aaacgtcgag atcgataaac ggagttagga aatataaaga 1500 tttatagatt tgctgtcagc ggataacaaa gatagtttgc taattgcata gcgcatgccg 1560 aaccaagcat gagagcgaat tcctgtttag attcgacaca agtagacgta aaaacgaact 1620 cagcgagcca agaaacatta aattcggttc agaaataact gagatattca aagttacaat 1680 tggaacagtc gctgcgactg cgactaaaaa tatagagtca cttttcccgg atagattttt 1740 ctctggacga attacggcaa agccgcttaa attttcaaaa agaagatcaa attctgagtc 1800 tgctcgatca agaaccattc aatttgcata attagaacgc gaaatattaa ttattttcac 1860 tctaaattga ctcgtttccg actaaatttg gcactaacag gtctgaaagc gaacctttcc 1920 cacgaacctc tgagcaaaat tcggcttaga tgccatagta gagtagctaa aaagaaacta 1980 gcaaacaaaa aatcaaaggg attaattaac aattaccaga gttaccgcta aacaaagatt 2040 cagtgtcaga aaaagtcatt ttacaccact attttgagat aaaatttaaa caaggcacct 2100 gacatcattg caactaaaat tttcaaaaag gaaataaaaa acccgacttg ttgacacccc 2160 aacggtgtaa ttagtgcaaa tagaaccgga gatattaatt attttgtgga aaaaaaggcc 2220 catttccgaa gaaaatctgt atgtcaaggt atatgagcaa acttttcaac tgtaagactc 2280 agccaaattt cgctcagatg cttcatttgg tagcgtaata agctacttgc atgcgaaaaa 2340 ttattaagat taagcaacaa ttgactgaga tagaacattg acagttttcc taacctagct 2400 cgtggtaaag tgaaagtgat cgtcacgaaa aagcatttcc gagcattgga cgtatttcaa 2460 gcaataatat tcaaaaagta aagcaaaaac tgaaaccagt gacacctcag cgatcaagat 2520 taagtcatta atagctgaga tattaaataa actagttttg aggtcgctgc gacttgcgac 2580 aaagaaataa catcaatatt ttggaataac ttttttgttt gaagtcagat tttgatgcaa 2640 cttgaatttt tagttaagga ataaaaaact gaacttgcag acaccctgag tggataatta 2700 taacaaatag tttattttaa attaattatt ttgtaccaaa aatgaccact ttccgatgag 2760 attttacacc ataaggtcac agagtgaact tttaaacgga aagacttaac tgaattccac 2820 ttagaggctt tactagctaa cattgaaaca tacttggaca cacaacattg ccaaaattta 2880 ttaacaattt accgagctat ggttaataca actttcagtc ttctcagata ttagataaca 2940 aatattctgt gttcctatcc cacttcccga ttaatttcga ctacaagcga aggaaacaaa 3000 taacatacgc gttatattaa gcgcaagatc taattatctg tacaagtaga cattaaataa 3060 accagatagt tcgaggtcaa agggacccaa aatcattcgt gatactctaa gacataatat 3120 tgcgagcgtc gtcccctcgc aatgtacatc catacgttac tcaacagtgt acaataaagt 3180 tcacaatatt caaccgtagt tataaataat atttaagctt agtatataag tgaacaactt 3240 acccggtgta ttgtatttgc atttttttta tataaaatag tttttaagtc ttatatttac 3300 atacgcgccg aatattatag aatatttatt agcaaataag gattatctcg ggatcatcaa 3360 aaagcgcaaa cgtaggtaaa cttttaacga tataaatacg cttacgccgt aaacgcctag 3420 aatctcgaag tcgttattaa caaatcgata aactagaacg cgatcacttt cggttagcga 3480 gaaatacgat agatccgtga gactttggta agacgcgtag agaatccgcg tagaaagaga 3540 gtaaaaatgt cgatcgcaga gggcaagggc aaccccaagg gggatgtagc gttcatagaa 3600 tccgaaatcg gacccgcgac gcgatcagga cgaatatacg gaccttcgca accgatcatc 3660 aaaccgccaa cgttcgtacc cgaaggaagt caaggcctaa gtaggaccac atcggagatc 3720 agcctagcgt ctcggacgga cgcgccgaat gtcgtgacag ccagcggcaa cgcagacacc 3780 gacagagcga tgttgggaaa ctccgttcta gatccaagct tgaccggctg gggttttgaa 3840 aaatcgtcga tattgaacag taccggtaga tccgcgatcg cgaaaagcag accaccgata 3900 gatccgcgaa ttagacaaat gagcgagcaa ttcgacgaaa tgaaagcgat ggtaaaaggt 3960 atggcggtcc aattagaatt tctaacaaag ataaatacga acaaacccga cagtgacgaa 4020 cccaaaatga caactgaaca atggaacagt tatcgcgtgc aaaagcggta cggaagtaac 4080 gaatcgatag aaaccaaaga cgacgcgaga tcacagatat ccgcggtttc cgcgaaccgt 4140 gaccacatcg gaaaccgaaa cgaaaacaat cgaatgcatg aacgccgcga cgatcgagac 4200 gaggatctct atgatccaga gcatgatcgc gagtatatat cggaaagaca cagatcgaaa 4260 ataccgcgtt gcgtagagat agaagcgaaa caacgctacg cgactcgcga tcccccgaga 4320 ggaaacccct tcggggaact tggattttca ggtcgcggcg attcgcaaca tccgatacga 4380 tttttcgatg attttataga cattgcggat tatgaggaac taagagaccg cgatcgtctg 4440 tacttcttca aacattgcat gacagggcga gctagagagt ggttcgatct ccaagagttt 4500 agagagttcg gtgaagtagt aattaaattt agaaacaaat tctggggaag agtcgaacaa 4560 aataaactta gagatagagt tcgcaacggc atgtacaacg ctaacacagg ccgcacaatg 4620 gcagagtacg tagagaaatt tgcccgcgat gtaaaatata tctctcccca gataccagaa 4680 tttgagctgg tgcaggatct cagccaacac tttaccgtag aaatatacaa cgatctcttg 4740 agactccgag tcgataccat tccggaactg tgccatcaac ttaacattat ccaagactat 4800 gaacataatc gcattcacat ggcgattaaa gagaaaatgt cgcgtgaact agaagccgcg 4860 cagcgatccg cgagatacaa accgtatggt gagagcatat acggtaagcg cgacgaaccg 4920 gaatatccgc gacaatcgcc ggaccgaaag tatcaacgcg aaccgcgatc cgacagccgc 4980 gactcgagaa ggggatatta ttcggcaaag cgagcggaat tccgcgatta ccgcgatccg 5040 ccaacgaaat ggattcccag gaaggaaaca tcgcgcgaat atgtggacag gcgggagtcc 5100 tctgcgggac gtaagggggg ctcagcgcga aacaacgaaa gattcgagga acaaagaacg 5160 gatcctcgaa gtcgatcggc taatagagac aatcgcgaac gttcgagaca ggtcaaaaac 5220 gaacgggtcg aggagcgaat atcgactcct cgagatccat ccggtactag aaacaaccgc 5280 gaaaactcga gacaacgcga cgtatcgcga cagcgagaac ccgcgagaga gagagaaacg 5340 tcgcgcatac gcgaaacctc tcgccctcgc gagtcctcga ggtcacgcga acagcaacag 5400 ggagtatcat cccttagaac ggaggagatc ttaacggaac tgcgcgaatt tgaacgccgc 5460 gaacagaatt ttaaagaaga caataaaaac gtgcccatca tcaacgtggc aattatcgca 5520 gagaagggtg aacaggaact cgatcgctta agtgtcagaa cccttctgga cacaggagcg 5580 acggtatccg cgatagaccc gagtgtcatc cgtaatttcg aacaacgtat tcgggaggat 5640 agaaagaccg aagataagga attgttccct cgcgtacgca taaacaagac tcacatgcgc 5700 ggggcgttca atcaaaaggg agtgcacacg aatgcactcg tgcagcttac aatcgaaatg 5760 aaggacatgc aaaatcgtcc aaagagatac actcacgaat tctacgcgat agagaccctg 5820 ttcattccaa tgatactagg cgccgacttc ttagggaaat atagaatagc tgtaacgtgc 5880 ccgaagccga ccgcgctcat agcaacgaat cgaaccgtag aatcgaacga agcgctagat 5940 ttaagaatgc catcgactcc cgaacccgtt cacgagacca cggtaaaatg ggaaataata 6000 gatctgtgcg aaccggaacc cggcataata aagctcgaag aattcgaagc gaaagagcca 6060 aagaatgaaa acctaaacgc gagccgcgag agcttacaca cgagaagcga agagggttcc 6120 gaggacgcat gcgcgcccga aacagaggga ctcgaagata cgcttgcgcc cgctatcgaa 6180 gatcccgagg acccactcgc gcctcgcgtc gaggatcctg aaaacagtct cgtgtccacc 6240 gacgagaact ttacgcacag actagagaat ttgttggaac attatagttt gttgttagac 6300 ggatccctcg gaaaaatccg aggatacgaa cacgagatcc gcatgactac cgataagcca 6360 tttaaaggta agtcgtaccc tattcctcag aaacacctgg aggaagttcg tagacaatta 6420 cgcgagatgg agaatctcgg agttgtctcg agggcagcga cgcaatatat aagccctttg 6480 gtggccgtga ctaagcccaa tggaaaaata cgaatctgcc tagacgcgcg taatataaat 6540 gaccgaatgg agaatgatca cgctcagcct ccaaccatcg aagaggtatt ggcgaacatc 6600 ggccataaga gtattttctc gaaattagat ataacgcaag cttactggca gataccgtta 6660 acggcgaact cgagacaata cacaggattc tcctttgacc atcagacgta catctttgag 6720 aggctgccgt tcggcatcaa gaccgccggc gcgtcgttca cgcgagcgat cgaggcagca 6780 ctcaaaggga aacccgaact gcgtaaacac gtcatagtat atctcgacga cgtgttaata 6840 gcgagcgaaa acgaaacgga ccacttgtcg caccttgcga gcgtgttcga agcgttgcaa 6900 gaggctggtt ttcgtttgaa cagggataaa tgtgaattcg cgagagatcg tattgtcttt 6960 ttgggacacg atatcacgcg agtccacgtg accatcaccg aagagacaaa gcgaaatata 7020 gtagactttc cgaggccgcg aaacaaaaaa gagatacaag cttttctcgg tcttgttaac 7080 tgggaccgtc gtttcgttcc gaaattatcg cgacacacgg tacacctcga gaagctgcta 7140 aagaaagaca ccaggttcca atgggaccag agaatggagc gagcgttctc agatattaaa 7200 gaaagcgtac gcaacgcgcc tagcttgtat ttgataaatc ccgacagaaa gtacggtata 7260 gagacggatg cgtcgaccgt cgggatagga gcaagactgt tccagtatga cgatgcaggt 7320 cgcgaataca ccatagcgta cgcgagccgc agcctcaaag gcgccgaacg ccgttacacc 7380 gttacggaag tcgagggatt agccctcgta tggtgtctta aaaagtggag agttcttttg 7440 atgggacgac cggtgcgcgt taagacggac catcgagcgt tgaaatttat cagcgcgtgc 7500 gccgtggcga gtcagcgcat ggctagatgg ttagccttcc ttcaggaatt cgacctggac 7560 atcaatcata taccaggaac agccaacacc acggcggacg cgttgtcgcg caaacccgtc 7620 aagagaaatc accgacttcc cactgagcgt actacccggt ggaagagata tcgaggagcg 7680 accgatttcg actcttcctc cgcgagcgaa gaaagcgacc ctagtcacga gtgggaccac 7740 gaggggcgta aaatgtacgt gccccgcata tgccttatgg cagatattca cgaaggcgag 7800 aacactcgcg actgggtaaa cttgatccgt gaagcgcaaa tcaatgatcc cgcgattata 7860 cagcgaacag cagagaatcc agaacgatat cgaacgcgca acggatttgt aagaaagacg 7920 atggacgacg gaagcgaccg agtcgttata ccggactgaa tagcctggga actcgtactc 7980 aggatacatc ggtacttgac ccatttcggg accgataaag tactcgagtt cgcgaggcga 8040 tattttgaaa tcgccaatat agagagacta actcgagacg tagtggcctc atgccaattg 8100 tgcattgcga ccaaggtcta cgcgaggcct acacgaggtc cagaatatta cgaacttcca 8160 aacgacatcg gcgaagtcgt ttccgtcgac ctatacggtc cattgccgcg atcctacgat 8220 ggatgcgcct atgtactcgt ggtcatggac caattctcaa aatttacaaa aatgtacccg 8280 atgagaaacc aaaagctcag aaccatcgag caaatcctgg aagagtatta cttccgtgac 8340 gtaaatagag taccaaaaac gattttgaca gattgcggtg gccagttcct tgcagggcgc 8400 tggaaggaat tcgccagacg tttaggaatt aaaacacgca aaacgtcacc ctacaacccg 8460 cagagtaatc cggttgagag ggtgatgcgg gagttgggac gttccttacg agcgtacgcg 8520 tgcgatgacc accgacggtg ggacgaaatc gttccgcggg tcgaacaggt catcaacgcg 8580 accgagcact cgagcaccgg agcgcctccc gcggtattag aaaaccgcca agtagatctc 8640 gttataggac ctcccgagat actgcgatct cggggagcac ctcccataga ccgtgacagg 8700 ctgttggaag ctgcgagcga gagactccga attatggcgc gaaagcgaag gcgtcagagc 8760 gagaaacgcg gtaccgctat tcagtatcaa cagggggatc tcgtctggat aaaatcgcac 8820 aggcgtagcg accaacgcca ttctaagatt cataaattgt tccccctata tgagggacct 8880 tttcgcgtgc gagacgagat ccacccgaac gtatacacga tcgagcgtat gacatcaaag 8940 ctgatcggac cgtttaacac gcgccagttg cggcctcatc gcacttcggc ctggagaccg 9000 gtagaagtaa acagggagca gccgagccag gagcgatcga caactacggg aaccatggag 9060 agcgcggaaa tcgacacacc gtacgagtcg ggacactcgg atccaccgag taccccgcaa 9120 cgaaagcgac caaggaggag agtccgacta gccgcagtag cgaggagact gtcatccgac 9180 tccgaggatt cgtgcgaacg ctggatccag ccccgcagaa cgagtaaagt cgagaaccaa 9240 atcgattcgc cacctgatac agacgacaca tgggagatag tgaaacgacc aaaacgagtc 9300 cggaaacaat caaaaatccg gaatggcggg agtaagacga aattgtcacc cacccagact 9360 aataacccac ccttaacgaa tcctcgcgag catcccgagg atatcgcata aatggttcag 9420 tagatctcga ggaaaatagt gagaaaactc taaaaaacga gtgtttctct tactaattac 9480 agatgacgac gagctacaag gatctccaac gacaatacaa tgagctgcgc agaaaacaag 9540 aggaaactcg acaaaggaga aaagagctgg agaagtataa acaacaaagg ctacaagaga 9600 agatcgatct catcaggcaa tcaatagcaa aagacgaaga ggctatccaa caactagagg 9660 cttcatcatc gtcggaggag accgaaaaga ctcagaccat cgaacctcca tgcggagaaa 9720 cgccgacaaa gatacaaaaa tacaaaagcc acatccagcc cagacaaata gcgccaagaa 9780 tatccagaag tgcatcgaca gaaaaagaaa tatcgccgga agcgcaagta tggaacatag 9840 gagactccga caattcggta gacacagtac agaatcaacc aacgtcatct aagccatcga 9900 gatctaagca agataaagaa cttcgcacaa acgcgagaga cagcagtgag aaggagaaca 9960 gctcacccag aaccgaagaa ccgtccatac aacttcgtaa gcaaccggtg gtaaagttaa 10020 ggcgtctcgg agaaagatcg cgagaccgaa atatactcgc aaagagaaag gaagatgaca 10080 atagcaacaa gttatccagc aaacaacaga ctaggagcta ctaccgagag agaacgccaa 10140 acccacgaaa atcatcgaaa gcagacaagt tggaacggag aagagagcaa aaacgtttgg 10200 cgaatcaacg ataccgtcga aaggtaaaag agacactgga agctgccaag cgacaacttg 10260 cccacgccag gagcgacacg accgattcga gtgacgccga acccacacgc gtaacgcccg 10320 tgaattcacc gtcaggcggg agtgtgcagc ggccaatacc tgcagtgcag agaggacgaa 10380 aggccgaacg taaagcagca accagagacg agaccgaatc gagggacgcc aagctgataa 10440 gaagaccagt gattgtggaa gaccggaggc agagcccgaa gatccgagtc aagaagcagt 10500 cccatcacca acaggggacg gtggtcgggg gcagggctgg tgagagaaat ctaacaacac 10560 caaccagacg cccaggacca agccaggacc atccgaaccg tgcaaggagc atccaggacc 10620 atccggacct ggagaggagc atccaggatc gaccgcgcca ggagaggagc aaccagcacc 10680 atccggaccg ggagaggacc gtaagtgacc atcctgacct cgagaggacc atacgggatc 10740 accaggatct cgagatgcag gatcgggatc atcaggatct cgaggtgagt gaccgggatc 10800 aacgggatct cgaggtgcaa gaaccggatc tcgagacacg agagagggat catcctgacc 10860 tcgagggacg tgacccgaag gatcctgacc ccgagggaag tgaccacgag tatccggacc 10920 ccgaggaccc cgagatcatg aagcagatat gcgagggtga ggacccggag catcgacccg 10980 gagggaacga caacgaagac gacgacaggc gagttcttct gcagtggctg actggtcccg 11040 agtagacctc gagacagaga tcaggtaatc gcgagatcaa aggatctcga ggtcagtcat 11100 aaatgactgc acgaaaccga tcaaaagtta tgttatattt ttgtttatta tgcgactagg 11160 cttattatta tataatttta gttaggacct cgagggctta gagaaggtaa gaatgtgatc 11220 tcgagtcccg gtaaaaccta gtctttgcaa tacacgtacc tgctataatc ggaaaagatt 11280 catattagtt ttgtttagtt cattatttta tacttacctg atttgtagaa gcatcccacg 11340 aagtcctgtg atgactcgac ctgcgaacat aaaagaaata ttattatata atcatatata 11400 gccatatata tatatatttt 11420 // ID CACTA-1_AA repbase; DNA; INV; 8393 BP. XX AC . XX DT 16-MAY-2011 (Rel. 16.05, Created) DT 16-MAY-2011 (Rel. 16.05, Last updated, Version 1) XX DE DNA transposon: consensus. XX KW EnSpm; DNA transposon; Transposable Element; CACTA-1_AA. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-3053 RA Yuan Y.W. and Wessler S.R.; RT "The catalytic domain of all eukaryotic cut-and-paste transposase RT superfamilies."; RL Proc Natl Acad Sci U S A 108(19), 7884-7889 (2011). XX DR [1] (Consensus) XX SQ Sequence 8393 BP; 2678 A; 1520 C; 1556 G; 2639 T; 0 other; cccaagtagc acttgtaaca tagctacttt ttaaagtata aaaaaaatgt tgcaagataa 60 ttacatagcg aataccctga aacaagtcaa gttgcatata ctaggcaaat gtcactagtt 120 aaaaacattc cgacttttta tgagaataaa ttacaaatat gttacacagt agtgaacgca 180 agttgtttgt tgaaaagtaa ctgaaatcaa acattttaag ttgcaatgca ttaataaggc 240 tactatccaa taacatcttt gcaaaatgtt gtgatattgt ggtcactggt gaacttgaat 300 gtaataaact tgttttgatt taaagttttc caaaatacgg atgtgaaaat ggatcttcca 360 aacttaatca gtgaaaaaga taagaaaata tggagaaaac tcaagcgtaa tggtaatttt 420 gctagaacaa aaaaagtgtt aatacaacgc gctaaaaaaa tcgaacccaa taaactactt 480 aatcatggaa taataaatca tgcacataaa gaatgtatct caaatgtttc ggaagtgccg 540 gatcaaattg ggtcatcgat ccaatgcagt gaaaatctaa atcctcaaaa ggatgaagta 600 ttggattccg attctgagcc tgaatctgaa attaacgaca ctaacccaaa ggcagacatg 660 gaagatcttc gagaggactt gaaaaattgg gcaatcaagt ttcaaatcaa acatacagca 720 gtaaattctt tgcttagttt gttaaaaagc catatcccag ataacatact ccccaaagat 780 gcaagaacat tagtaggaac ccctcgtagc tcttgtgtca cttcggatcc aaacattgga 840 gggcaatact ggcattatgg attgcccaaa atcctcaaac acacgttatc aacgtatggg 900 gaacttccaa ataaattatc gctgaatgta aatattgatg ggttaccaac gtttaagagc 960 tcatcaacat cattttggcc aatactcgta aatgttcacg agcttcgaag tacaatgtcc 1020 cctttggtag ttggtgtttt ttgtggagca agtaatttga cagcgaataa tcttttctta 1080 tagcgtaatt aaactgcctt tattttacag ctaaacccaa agacgtgaac ggttttctta 1140 gaccattcat ttctgaattg aattctctta ttgatcatgg actttttatc aatcaaaaat 1200 tgataaccat agaactacgt tgctttatct gtgatacgcc agctcgctcc atgcttcgag 1260 gctagtatta gataaatagt gccaagaact aactaatatt ccagattctc ttttctaggt 1320 gtcatcggtc acaacggtta caattcgtgc ttaaagtgcg taactgaagg tgaatattcg 1380 tatgccttca aaactatgat attccctgaa atgaacgccg ctctacgaaa cgattcggat 1440 tttcgctcta gaacatacga agggcaccat aagtttgata gtattctcga ggatattcgt 1500 gggctagata tgatcaaaga ttttccaatt ggagattcgt tgcatttgat cgatttaggt 1560 ataatgaaac gactactgaa cgggtggaaa gcgggtactc tcaataacta taatgcaaga 1620 tggtcagcca aacaaataga gaatatttct tgttttctta aaacatgcaa gttgccacga 1680 gagtttaaac gaccagtaag aggtttggat catcttcctc gatggaaagg atccgaatta 1740 agaacctttt tgttatacat aagtgtagtc gttctgaaaa aatttttcga gtctgacgaa 1800 atactggatc acttcctaaa cttcttttgc gccattcaaa tttgtatgag atacgaccaa 1860 aacccagaaa attatgaagt ggcacgatgt ttaataatag attttctcaa tggagtgaaa 1920 accttgtacg gtgaacaact tttctgtagt aacttccata gtttaataca tttaattgac 1980 gatgtagaaa gatttggacc tctcgacact tttgacgcct atccatttga gtcaagacta 2040 tacaccttga agcggttgat tcgtactgga aatttgccat tgtcacaagt tgctagaaga 2100 ataagcgaaa tgcaggaaaa tgtttcacaa ccttctgaac ctaaacagcg ccctattcta 2160 aaacgaaaac ttagtggaaa taacttctca gaccaacatt tttctcaaat agagaaaaat 2220 tcagatgtgt attcctttgt cgatttcgga gacttttgta tagatacgca aaaaactgta 2280 gacaaatgga tcttaacaag aaacatgaaa gtaatgacgg tagataaaat tgtccaaaat 2340 aatgcatctg aaagtgtatt tttattcggg cagatattga gtgatctcag tgatttgttt 2400 gagaaaccag ttgtatcttc atctttgcaa attttctgct ccgacctaga agtttctcct 2460 gcagtgttaa ttgaatcaag cgatatattc tgtaaaatgg tcaaaataga ttgtcaggac 2520 cttatgttac ctaaatcggt tttcttccct ttgatccata caattagaca cgctaggcaa 2580 cacagtaagg attgatcaaa ataaaataaa acaatattcc aatcagattt gaaaagaatg 2640 ataattttat tgctgtacta tttctttaat ccgagccatt tagtattgtt aattggtact 2700 gctcatgtaa taaaaatcat gtattggaaa tgtgtacgtt taatcatcac tatcggaaga 2760 acgatcatca tcacttcctg tctcatcttc agttccacac tcactgctat ggctctgaac 2820 agcgggtggt agctgcaaca cgtctctgga catctgggaa gactcgtcaa gatttgaaca 2880 attctctgtt tgcacagaaa cttcagcgat atcctcagtt ttctgtcgtt tgtgcctctt 2940 ccttgaggat gatgctcgac gctcaccatt tccgttttca taccgtttaa gctccaacat 3000 cttgtacctg agaaggttct taaacaagtt ttcattctgc tcgcgagtag tagagcaatc 3060 tgctgcgtgt aaaacatgat gcatcgtttt aatgaaattg ggaaaggttg cttgaaagtt 3120 gatgttcagg ccaggatcac ctgttgatcc tttcttcgtc cttttgtttc ccatccaaga 3180 ataaggaatc agaacatttg gagccaacag tttccgcatg agtgttcgga acaatggtcc 3240 actgtctcgt tttccgttca aacagtaaat cgaatgcaaa taacggaaaa acttttcgga 3300 aaattttgga tcagccaaac attgttcaaa acccttaaac gattcgtcgg aatcgatttt 3360 tgtcattttg tcgaactcgg aaaaatcctc ttcttggcgc gttgcaattg ttggcacatc 3420 attcttattt cttaaagcca taagtttatt gaagttggcc atgaaatctg tcagtttcac 3480 taacagtcct tcctggacgt ccaagcgttc atttgttttc cgaatctcag tagtgatgtc 3540 gaatagtttc ccctcaaaaa cattgagccg atcatttatt ttccggagct cgatagttgt 3600 tggcgaataa tcaggttcat cacaggctgg cgatgaggca ccaaccatgt tacaatcgtc 3660 atctcccaga tcaacctcgt cttgctttat gggaatggac aatggttcct cggtgatttc 3720 gttgctgtta tcatcgatcg catagtaata tccattagca agttgtgcat actgctttcc 3780 atcggtaccc gtgacaactg gaagacttat ctgctgagac acctaaaaca ggaaacatca 3840 aatctcttag tatcaacatc atgaaaccaa tctagatcag tgtttcacca tacttgccat 3900 gtagcataat ctacgtaagc ctaattgaca atttcagcat ttgaactaca gttaaacctc 3960 catgtgtcga tgttccatta ctcgacatcg attcatggaa ccatactaaa aacaaaaaat 4020 catggttact gttatggttc ttacaacttt acaaagaata tctgttctat gagtcgatgg 4080 ttccttcaat atcgactcat ggaggtttga ctgtacatta ctaaacgtat ggatgcaaat 4140 actatgttta aatatcaata gcacattcgt tctatcgctc atctagattc ctgcgcatga 4200 tattccggtc atataacgat aatgtactaa aactacagtg gctcttacgt cgattatgcg 4260 aacatatgca gcatatcagc ataccagcag ccactcattt tattagaagt tttgtttcaa 4320 aaaatatcaa gacaccgttg gaagagcaca tacccctttt acagctgata cacaaatttt 4380 aaatttttcc tgcgaaaaac gttacgtagg gggcaagggg attcaaattt taaatttcat 4440 tgttacgtag taattagatg ctgcctaatt gaacttcaat ataatcatcc aaattagtgg 4500 ttggaattgt tttttgagaa acactgctct agaaactgga tcacagtggg acgaaactga 4560 aaataaacgg tcagaacatt ttagacgcca tatgtgtctt cgggacgttg ttttgaaata 4620 gcagcccgag taataataga aaataatttg ttaattaaac tgaacaaatc atgcgaccaa 4680 aaatacattg ttagaaatat atttttgagc gatcgctacc taacaaaatt ttagttctac 4740 tcctctagag gttttgcccg tatagttgca gatacgtccc actgtgtgga tgcatttata 4800 ttagatttat gatagaaaat gcttaccgaa ttttcctcaa cagaatctga cgaaaggctg 4860 tatacgctcg ccactgctgg tgtatttccg taaacgaggt tttcatttcg aggaacatga 4920 agttggtgta cagtctggac agtgtgtggc atcgaggaag tggaggatgt agatttcata 4980 ctttgattat gaactctttg cattgcattc gttactggac tgaactccga agccttgctg 5040 gtatgatgcg attgtaccac atctactgga acatgatgag cagccataac tggtcggctt 5100 gcatagggta atggctttgg aactgcaacc tgcaactgag ctccatcaga ggaaacggta 5160 cacaacatcg gcgcaggaga agtgaattgc gtattgatgg gcattccggt aggtgtcgaa 5220 aactggcagg catctgaccc tacagcttgg gattccggaa cgaccttgcg atttgcgatt 5280 gtcttgagct gttgatcttg cgcagtagat tgcgttttga gcaatggaac atctggcagg 5340 gcgggtggag caatcatcat gagctgggtc ttatcgccag gagctgagca atttctggca 5400 ggacacctcg gatagacaga atgcgattcg gaaactgttg cggaactcgt attttctgga 5460 agggtggttg gtagagttgc ccgctcacct gcttcagctg agccgaattc cgttccagtc 5520 gatgatgcct ggggtttata atggatgtta ttgaagacga gggtattgcg atcgtaacta 5580 cttaccaatt tatagctctt tgatttgaat ttaggttgtt tttttgctgg tttcgctctt 5640 gtaccgtgac ccatctgagc ggcgtcttcg gagtcagtca ctttttcaag taaagccatc 5700 atttcttctg ccgattcgaa agttccagct cgtcctgcta gcttgcactt ttgctttttc 5760 cacatttgtt tatctgggat ggaattttcg tttttggaaa gactgaccag gttgtttggt 5820 ggccaaaaaa cccatttatc ctccacccaa ttggctggaa ctacggtcat tgacggcaac 5880 gattttttgg ttttacgcgt ttggacgata cagaaagaca ttctgattta ttttaatata 5940 cgagaggcac gaataaataa agcaaaccac gaaaaatatt aaaaacttcg agctcaaaga 6000 ttcgcgcaca aactctgttc tacaatcaaa acaaacgcaa gaattggtgg ctgtcaaaac 6060 tgtagttaca ctgcggcaaa acgttaaacg caaatcagtt atttggtaac atcatttgtt 6120 tatttgtgtc gataatacaa catagagcaa tacgaaatat tgcggctgta ttatttaatg 6180 tttcaatgtt accgctgatg tataaataca agtgacgtgt caacttatag gaagtaaatt 6240 atccgtttca aatcagttac agtgtaatcg tctggatcca ttgcgactgt aatgtaatgt 6300 ttacatcatg ttggaaattt ataacactgc cgattaaatg ttttggtatg cttggtttta 6360 tttgttgaaa taaatttaaa tacagtttta ttttattaaa acacactttc tcatttttct 6420 ttttcaccaa aactgcctat tagtaacgta ttgcaaaatt atatttcgaa aagatattgg 6480 cagctttgct aaggtaaaaa ataaccatta tcgctgtcac taagtctgat taacaggaat 6540 aaataaataa taattggtca ttgtaatttc tagatacctg actagctaaa tctaacagaa 6600 ttcaaacttg acgagttgtt tagttatgac catttttttc taacaatgtg ccgttcacga 6660 attgtgtaat gtatattttt tgcagaccaa tggaagtttt gaaaaacgtc tactggtccg 6720 gagtgggtca ctaggtaaga ttgagtaaaa atgatcggaa cttcctttcc tcaaccaatt 6780 ggtataaatc caactaaact tgcttcataa tttcaactac gaatactaga aatgaaaaag 6840 acgatattgg ataattatag gtttactatt gcgcctccat ggaattggtc caagataggt 6900 ctaattttgt ttttcttcta aacgggcaaa tatgtttgta aaatttcatc attcacaaaa 6960 aaaaaacact cagacctgag ggagaaagag tattattcaa ggtcacccag aatcgaaaaa 7020 taatctacag ttattcctga tttcgaagtc tccccatttt gtaaaacacc ttgttcacaa 7080 caaattcact gaacttgtca tctatgaggc aactatcttt gtaatgagtc cacttggact 7140 tgtctgtgtt agattttgaa aaaattgaag ttttacacca aaatgtgaca gtttaaaaaa 7200 aaaaattaaa aaaaaaaaca aaattgaact atttgggatc agttaaagga gatccgagca 7260 taaatcaaaa attgcaaaat tgtctaattt agttctcaca ccacattcgt agccaaaact 7320 ttgaaaacaa ctagagttgg agtcagattt atactaaata gtcgatcaaa tacatattga 7380 caaaattcca atggtcctca aatctagaca tcggcatgat tcctttggag cgtcgaaaat 7440 tgtcgaaatg gtagcgtttg gaactttttt tttccacttg ctaggcagtt acgattcaag 7500 aaaaaataac ctttttgttt aatattataa caaagtgcaa gtttttattt ccacatggac 7560 acaagtgatg ttgctcaaat gtgttttaat ttacacttgt attcaataat atttatgttg 7620 ctttgagtta aacgatattt tcgagcattc atccggcagt gtaaagccaa cgatatggcg 7680 ataaattcga ggaaaaaaag gtttttaatg aataatacaa tttaattaac atattttaat 7740 ggaaaactgg aagaatcaat agaattataa atgtctcaag ttcgttcggt ataaacataa 7800 tacaaatatc actttacttt caaaaacgat cgcatacttt tcatattgat tctatacata 7860 agctaataat ttgccgctca atccgtaaat ataaggaagt ttcaaagcgc gtgaattctt 7920 taaaactaga aggctcaaca tcaccgtaaa ttttgattcg catgttgtat tttaaatcag 7980 aattacaaat ttatatcaat atatatagag cgagataatt tatctaagat ttaagaaaaa 8040 ctctgccgtt cagcgatagt tagcatcgtg aactgaatct gtaacacagg aaaaaaagga 8100 aattatttcg aaatttagtt aaattgaaac aaccattcac ttgttgagcg tttctcaaag 8160 atgtgaaagt ttattttttg ttctacatca ctaatgtgtg acaaaggagg ttatatatta 8220 attacataaa aacatcaatt gcaacaatca agaagttgca tttaggctcc acctcttgcg 8280 ctttttaggt tacacagatg tgatggtcaa tgttgcaaaa gcccgcattg atatttaaga 8340 tttgtttcat aaaaattgtt gtcactaaat tgtaacttag tgtgttgctt ggg 8393 // ID Gypsy-30_DWil-LTR repbase; DNA; INV; 367 BP. XX AC scaffold_181154; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-30_DWil_; KW Gypsy-30_DWil-I; Gypsy-30_DWil-LTR. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-367 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_181154; Positions 1825338 1825704. XX SQ Sequence 367 BP; 96 A; 70 C; 102 G; 99 T; 0 other; tgtcgcataa tggacgtact ttggacactg tacgtgtctg tcacataatg aatatagttt 60 gaccactgtg cttgccagag caagctgtgc aatgagtttg ggatattggg aatgcttctt 120 agaatttgac tatagcgaag aatatgactg gcgcctaggt gccgtacgag tgaggcgaca 180 gcgttgtcgc gctgtcgccg agtgtagtcc agcttggcgt ctaagaagaa gtagatcggt 240 gcgtgatcga agaatgcgag tcagttgcgt tcggataatt gtatgcgtcg gttggaattc 300 acttcactct taaagtttct gaaacaatat ggcagaccag cagatagaac taaatgaccc 360 tgcgaca 367 // ID Gypsy-40_OD-LTR repbase; DNA; INV; 237 BP. XX AC CABV01005033; XX DT 24-FEB-2011 (Rel. 16.02, Created) DT 24-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Oikopleura dioica genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-40_OD_; KW Gypsy-40_OD-I; Gypsy-40_OD-LTR. XX OS Oikopleura dioica OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; OC Oikopleuridae; Oikopleura. XX RN [1] RP 1-237 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Oikopleura dioica genome."; RL Direct Submission to RU (24-FEB-2011). XX DR Genome; CABV01005033; Positions 512 748. XX SQ Sequence 237 BP; 54 A; 71 C; 48 G; 64 T; 0 other; tggtttttat ctgctcctaa ccaatcgcta tgaaattacg atcataacac cttgtccgac 60 aaccctagtt ggcgttatcc ggcaaggtcc ggcaaggtcc ggcgatgtcc ggcgtccggc 120 gcgtccggca gactcggccg cgcgcctata taagcgcgcc cctgttcact atttctcatt 180 cttgtttcta ctaccaatac aagacaaacc aaatacaagt tgctatcttt ttacgca 237 // ID SmTRC1 repbase; DNA; INV; 4744 BP. XX AC AM268206; XX DT 19-AUG-2008 (Rel. 13.08, Created) DT 19-AUG-2008 (Rel. 13.08, Last updated, Version 1) XX DE SmTRC1, a EnSpm-type DNA transposon. XX KW EnSpm; DNA transposon; Transposable Element; SmTRC1. XX OS Schistosoma mansoni OC Eukaryota; Metazoa; Platyhelminthes; Trematoda; Digenea; OC Strigeidida; Schistosomatoidea; Schistosomatidae; Schistosoma. XX RN [1] RP 1-4744 RA DeMarco R., Venancio T.M. and Verjovski-Almeida S.; RT "SmTRC1, a novel Schistosoma mansoni DNA transposon, discloses RT new families of animal and fungi transposons belonging to the RT CACTA superfamily."; RL BMC Evol Biol 6, (2006). XX DR EMBL/GenBank/DDBJ; AM268206; Positions 1 4744. XX CC The features of SmTRC1 are: termini is 5'-CCC and GGG-3', 2-bp CC TSD. The transposase is distantly related to normal EnSpm CC transposase. XX FH Key Location/Qualifiers FT CDS 878..2557 FT /product="SmTRC1_1p" FT /translation="MHLMTSTYLSIRDADRVLNEFRLIVPDLPTSIRSILR FT TCTSVQPKSISSGVYYHLGLKTNLLRYVELWLCTCDFDSLQLYINVDGLSM FT SRSSSQHLWPVLGRIVAPRLSDVFMIGIYGGNTKPAQFNEISADTISEIKE FT MTETGLLSVRFNKYIAIKLSAVICDAPARSDVRYTVNHNGKAGCDRCVVNG FT RRLDGKMTFPNGEYTLRTDDSFRNQTQYIHHKGHSLFESLSIDMILTFPLD FT PMHMVYLGVTKKLVTLWIELGHKRLKNMNSCVIRTINKLISRCVESTPSDF FT PRKCRTLDYLSVWKASECRLFLLYLGPVILQNILPESLYINFKCLALSMYL FT LAHPKFYNRVTESVRMDLRNFLREYEWCYGCENLVYNVHSLQHLLDDVLAH FT GPLDSFSAFPFESYMRQIKQSVRSGYSVAKQAAQRYAEQMSFCDRLQVDIS FT TNDNPVIGMADSRKQVIMFRNSQIKSFHPDNVVVVKGKPGLITDIQDSGLL FT RFRQFTDPQNYFTDPFPSTDIGIFKCSVVSSAYSWVSIRDVDCKCMSINCV FT NHLVIVPLLHTVI" XX SQ Sequence 4744 BP; 1434 A; 862 C; 1004 G; 1444 T; 0 other; cccaaacaac caataaaggg gaaataaagg cgaacaaagg ggaaataaag cggacgaaag 60 cgtaaataaa ggggaaatgt cgctttttcg cctgcacgtc ggttatttag gggaaaagtc 120 gatttttccc ccttacgtcg tatatttggg ggatatttgt agacaatttg ttgtaacgta 180 taacatttga atgaaccgca ataaggaaca acattttgct gagtctgtca aaaataaaga 240 attattacat taaatttatg aatttcatca ttattactca tgcaataaaa tacgaacaca 300 ggctttcgta tgtgatgcgt ttgtacttac tttttctgtt gtttagctag ggtgttccga 360 catggttggc cgccatcaca aattacttac acgatgcaag tgagtgacgt gctctgtctt 420 tttagttcct tctaactgtt tatgttgtta gtctaataac atgagtttaa gcgaagtcag 480 tgaaagagct cgtagacgta gggcaactgt gaacttgtca agagtcctcg gatgtcatag 540 atctacttat aacagacgtt ggaataaatt acgtaaggat tttctgcgga gacatgactt 600 gctttcgttt agtaactttg ctaatgaaag cacaactaca ggtaactgac tttaacaact 660 atatatgtat aaactatgtc agtagacaga gaaagtaatt gtcatgagat tccttcaact 720 gtattacctt ccacaagctg taccaatgaa aggcttctgg aagataatgg tacgtgtttg 780 aagttcatac gctaataaat ttcttataga atgttctacc gtcgatttgc cctgtcaaaa 840 gcctttgagt cgcaaagaga agattaacag gatactaatg catttaatga catccaccta 900 tctgagcatc cgggatgcag acagggttct gaatgaattt cgccttatcg taccagactt 960 gccgacgagc attagaagta tcttgagaac gtgcacaagt gtacaaccta agtccatcag 1020 cagcggggtt tactaccatc ttggattgaa aactaacctc ctaagatatg ttgaactttg 1080 gttgtgtact tgcgattttg acagtctgca attatatatt aatgtagatg gactttctat 1140 gtccaggagc tccagtcaac atttgtggcc tgtattagga aggatcgttg cccccagact 1200 tagtgatgtt ttcatgatag ggatctacgg agggaatact aagccggcac aatttaacga 1260 aatttctgcc gacactattt ccgaaataaa agaaatgact gaaacgggtc tactaagtgt 1320 aaggttcaat aaatacatag ctattaagtt aagtgcagta atatgtgatg caccggctcg 1380 ttcggatgtt aggtacactg ttaatcacaa cggaaaggca ggttgcgacc ggtgcgtggt 1440 taatggtaga cgacttgatg gaaagatgac tttcccaaac ggtgaatata cgttaagaac 1500 ggacgacagt ttcaggaatc agacacaata tattcaccat aaaggtcatt ctttattcga 1560 aagtctatca atagacatga tattaacttt ccccttagat cccatgcaca tggtttactt 1620 aggtgtcaca aagaaacttg tgactctctg gatagaatta ggacacaaga gactaaagaa 1680 catgaactca tgtgtaatta ggaccataaa taagttgata tctaggtgcg tagaaagtac 1740 tccctctgac tttccgcgta aatgtcgaac attagactac ttatctgtat ggaaagcgtc 1800 agagtgtagg ttgtttcttc tatacttagg cccggttatt ctccaaaata ttctgcctga 1860 atctttgtat ataaatttta aatgtcttgc tttgtctatg tacttgcttg cacatcctaa 1920 gttttataat agggttacag aatctgtcag gatggattta aggaattttc taagagaata 1980 tgaatggtgt tacggatgtg agaacttggt atataatgtg cattccctac agcatcttct 2040 tgacgatgtc ctagcacatg ggccgttaga tagcttctca gcatttccat tcgagtcgta 2100 tatgagacaa attaagcaat cagtacgcag tgggtattcc gtggctaagc aagcagctca 2160 gcgttatgcg gaacaaatgt ctttctgcga taggctccag gttgacattt cgactaatga 2220 taacccagtt ataggtatgg ctgactcacg caagcaagta ataatgttta ggaatagcca 2280 aataaaatct tttcatcctg ataacgttgt agtagtaaag gggaagccgg gattgattac 2340 tgatattcag gatagtggcc tattgagatt caggcaattt actgacccac aaaattactt 2400 tacagatcca tttccttcca ctgacatagg aatatttaaa tgctccgtag ttagcagtgc 2460 ttacagttgg gtttccatcc gtgatgttga ctgtaaatgt atgtctatta attgtgttaa 2520 tcatttggtt attgtccctt tgttgcacac tgtaatttaa ttgtatttcc ttaaatttcc 2580 tgttatgata tctttagtct ccaatcaata agtacgtaat ccttaagaaa caaggcaaaa 2640 gtaagctcgt gattgcatcg ggactttggg tgattggaga aaaacgctgt ctgtatccga 2700 gggagaacat tgaccttctt ctgcaaaaca atgacctacc ggaagaaaat tggtctatgt 2760 tcaagtgcag tattgttcac aaatgcggtt agttacatac atgcaaatta tcgcttcttg 2820 tagaaagtct caaatcagcc aatggattgt tagcagcgtt gcaagtgtgc tatgctactg 2880 atgagaacta tccgcccgac gaaactacgt tgcccttaaa cacagataaa caaaagaggt 2940 cagttcaaca atctcactta catttcgtag aagcgcatta aaacgtaaag aaagttttct 3000 ttacaattcc atggacatgg atgtcgcgga gatgcggctg ggaatacccg agtccaatat 3060 ccaaaacttc ggatgtttga gaacgttgtc tctcccattg ccaagtaagt atctttgatt 3120 tggacatgtc tattaagcag ttcgacacaa ttagacagct cgccctccac atccgaggat 3180 aaaaatctaa ctgctcagta agtcatccct acgatctact tctatttttc agatttgaaa 3240 gaatgtatac tttgcagctt aaaatgtcag caagcttgag tgaaattagt acaaatataa 3300 agctgctgct ctcaaggttc agtgactgcc aagatccgcg tcaatttgac tgtggactat 3360 ccagccaaca gttccctttg tcatctgaag aggaactggg aactttggac gcttgtctag 3420 agcaaaaaga cgtgagagac agatttgtaa gcacacaaca tgatttttat ccttcattcc 3480 tattacagat ggccatgttg acacgtctaa tggatgatga tcctaaaacc tctatgcgat 3540 atattctgtc ctacatcatg aaaactgaag tcgccattaa tttcacgtta ctgggtactt 3600 catcaaaacg agcgattcag aagtgccggt tttacggctg cgtacgaagt aagttaaata 3660 aggtttttaa acttcaggtg cacttactaa tcgattcctg tcgtcatctg tcaacaagaa 3720 agaccttacg aaactatatg acatggctac acaaagttat tttcacgaaa tgagagacaa 3780 agtacagaag ggtcatcgac ggaaccaaga acacgtacgt acgactttaa agtgaatatt 3840 tttctcgtta tctataggta gagtctttcg tgttcaagga tataaccgta agttcagctt 3900 caagtacctt gttgtgcaaa acgagtttcc tagatatagt aacgttcttc ttcatcacat 3960 tttagaattc acaagaattg tttgactctc catagtccaa caggcctgct ggaaagaatg 4020 aatacgtgaa atgcctggtc ggtgacaatg gcaattacat aaggagcatt gatactctgc 4080 caactgctca ttactttgta cattgaagaa ctttttgttt tcaataaaaa tttggaaatt 4140 ataccatatt ttacgtactt tacattattc gtgtgctgtg ttgtgggaat gaaagcgaac 4200 acaggaggcg gttgcggtaa atgcattagt gtgtatgtag tggaaaagta gtaaatgtag 4260 cggatattta aggtagaagt cagaaattcg tccgatattt agggtgaaaa taggaaattc 4320 gtcggatatt taggggacaa gtcggctatt cgtcgtatat ttaggggaaa agtcggctat 4380 tcgtcgtata tttaggggac aagtcggcta ttcgtcgtat atttagggga aaagtcggta 4440 attcgtcgga tatttagggg aaaagtcggt aattcgtcgg atatttaggg gaaaagtcgg 4500 taattcgtcg gatatttagg ggaaaagccg gctattcgtc ggatatttag gggaaaaatc 4560 ggaaatttag cgacatttga aaaacagagc gacagaaagg ggaaattacg ctttatttcc 4620 cccggacagg cgaaaaagca gacattttgg cgcatttccc ctttatttac gacttatttt 4680 ccaatgcatt ttccgcttta tttccccttt gttcgccttt attccccttt attggttgtt 4740 tggg 4744 // ID Ingi2 repbase; DNA; INV; 5879 BP. XX AC . XX DT 28-JUL-2009 (Rel. 14.07, Created) DT 28-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE A family of Ingi non-LTR retrotransposons - consensus sequence. XX KW Ingi; Non-LTR Retrotransposon; Transposable Element; I group; KW Ingi2. XX OS Trypanosoma brucei OC Eukaryota; Euglenozoa; Kinetoplastida; Trypanosomatidae; OC Trypanosoma. XX RN [1] RP 1-5879 RA Kapitonov V.V. and Jurka J.; RT "Ingi2, a family of Ingi non-LTR retrotransposons from RT trypanosoma."; RL Repbase Reports 9(7), 1397-1397 (2009). XX DR [1] (Consensus) XX CC The consensus sequence was derived from multiple alignment of 25 CC copies (they are ~87% identical to the consensus sequence). XX FH Key Location/Qualifiers FT CDS 157..5676 FT /product="Ingi2_1p" FT /note="ORF2: AP endonuclease, RT, RNase H." FT /translation="MKIMKISLVTPATSSWCQGLVLRPGGSQVPALYPVRQ FT DWGWKQAFGDQRRSKKQHRHSGVKKYIRLLLCTCERASIGIHRLLLLLSGD FT VEQNPGPIIRGAQWNAGGLSQAKRIALEKKLHEDMVVFCLLQETHLAAEEC FT CALKISGYQHMGMARSPHGGGVSILVKDGVGVIQGPKEEGLPERVSATLMF FT SADLMLTIKSAYFPQKKNVTTSTLESLESSNRTLVIGSDTNAHHELWDPLR FT PNDTAGECIVDWCLQNDIKIANDGTPTRRMPHTATRSSPDVTMYRDCEVTR FT WEASLSPDSDHYWITFEVCIGIGVEMIAITKPRRALYAWNKANWREFRKVC FT DVTMLKKLRKSRKNVDAMNEAVTKGIRTAAKKTIPKGKGVSPPFWTRELTE FT LDTKIRECTNEKHRDALIYKRRKVLAETSLQRWKENVSKMSVTDTKSWNLV FT KSIYAPRPLTSPVLVVNDHPLTRRQQANKLAQMYRERSTKAPDAPPMSIPY FT RVSGEFAPITMAELETALKELSSGTAPGKDDIHCEQLKQLGRCSKKCVLKL FT FNCSLQRHRVPEKWKQGVIVSLLKPNKPAKCMSSFRPITLTSTLCKLMERI FT VARRIRDTVECKLQPQQAGFRPGRSTSLDALMTLTDSVTHRVAGEKTAAVF FT IDYARAFDSVDHRCIIDALKRYEVDMHLIAWVANFLEGRTAVVRINNTLSK FT KIQLTCGVPQGSVLGPLLFIIAMDSLSAKLNAIPALKHCFFADDLTIVCSN FT TDREAIRQTLQTGLDCIASWSAEHYMEVSAEKTEYTFFGVRDRDSLHLMLG FT EHRLNETRNPRLLGVTMQPHKGVSKHAHTLQTAANMRLMKLRAVASPEWGA FT SKESVRAFYLALIQAKLCYGIAAWWFNASPSDKEKLERVQAQAAHVIAGIP FT KYANRQDVMKEARLHKIDDIAHRRAIEYYLTRKANGKASADTVEKMFPPSH FT PIHTRLQSVSHLYEEVDRPEKPHTASVLQLARRPYFNTTTPGGLHSDASDS FT EKKRHTMQRVRRFKDYDYQVWTDGSVILDESSGAGAIIYPRQGNAVKVKKG FT AGRLACSYRAECVAMEAGLMKLKSVIKINESRKTKVAIFTDSLSLLMALKT FT GPAVVTDSILRRIWNLIIHFMKLRVAMSFQFVFLHCGIPRNDKVDELANKG FT NHMPQTYPAWITDIVTGITRQLRNQAQRDFAEGRGPNTHRSALLNHIKPTP FT KKKELDREDESLLAQFRTGTSKHFGWLHRVITRKVDELECRWCTNHTGNIL FT KNTPVPTQNTVKMEETEEGKVVIATRQNDPVICPVCNIVCARRQAGVVHMM FT KIHQWVRENALAALKMAPRAAQAYNGVYRCHVCGKVFDRKGLLQQHMPQHP FT PTTPVVWTERQKRKREEEQSLPTVHKCQWCPKQYATYGWLTRHIRLKHPEK FT QTEVTQQTEMQDVSSEDEQEQAPQAYECLRCKQVYKSKTWLTRHKCNPEQS FT KGEPSTPLQEALPEREACKICGKEYHHSWMLRHMNAKHPGHDISLRPQPIK FT IRKVTKEISKLQAKEDTSGDNKNKDDNEPPLYVCDKCKRGFKSKTWLTRHK FT CVSIERNGTKSENVNNPHRAATQLETTTKNGSKPGSIDRALPNSDSSHTQN FT LNSTTRKRKRETEHDTPADEMDMCDQREEPKHKRHKAQGHTENPRREFACG FT RCENTYMSWCTLVWHTRTHHKNAITVKRKREEGPLEITPLAPLVHQCPHCS FT KVCAHKQYLTMHLQAAHGQARSSDRRNELKEKCKETSAHLLQCPALSKLRS FT EHGVDSLKGEDIYFSVKLANFLRALFFPTANPAPTIDTKKLITRPRVKRKR FT KKGHHHAKYPLTTCDTLG" XX SQ Sequence 5879 BP; 1929 A; 1304 C; 1481 G; 1164 T; 1 other; ttgtggctgg agaaaaggca aagagggtga aaccgaacca gaaaaggaga agtgcaraaa 60 ttgtagtttt ctcgtcaata agaaatttgc tctgattgct gccgctttta tgagttttgt 120 agaattttag catttaaagg atttttgctc aattttatga aaattatgaa aatttccctg 180 gtgacgccgg ccacctcatc gtggtgccag ggtctagtac tccgtccagg aggaagccaa 240 gtgcccgcat tatacccggt ccgacaggat tggggctgga aacaggcttt cggcgaccag 300 agaaggagca agaagcagca tcggcactct ggggtgaaga aatatattag actgctcctc 360 tgcacgtgcg agcgtgcctc gattgggata cataggcttc tactactact tagcggagat 420 gtggaacaga atcccggtcc catcattcgt ggagcccagt ggaatgctgg gggtctctcc 480 caggcgaagc ggatcgcctt agagaagaag ctccacgagg atatggtagt cttctgtcta 540 ttgcaggaga ctcacttggc agcagaagaa tgctgcgcat taaaaatcag tggataccaa 600 cacatgggga tggcacgttc tccacatggt ggtggggtat caattttagt caaagatggc 660 gtgggagtca tccaaggtcc gaaggaggag ggtctgccgg aaagagtctc ggcaacactg 720 atgttttcag ctgatctaat gttaactata aaatcagctt acttcccaca gaagaaaaat 780 gtcactacct ccacgcttga gagccttgag agctcaaata gaacactggt gataggatcc 840 gatacaaacg cacatcatga gttatgggac ccgcttcgcc caaatgatac agcgggagaa 900 tgcattgtag actggtgtct acaaaatgat atcaaaattg caaatgatgg caccccaacg 960 agaagaatgc ctcacacggc aaccaggtcg tctcccgacg ttacgatgta cagagactgc 1020 gaagtcacaa gatgggaggc aagcctcagt ccagatagtg accactactg gataacgttt 1080 gaggtgtgca ttggcattgg tgtagaaatg atcgccatta caaagccacg aagggcgttg 1140 tatgcttgga acaaagcaaa ttggcgggag tttagaaaag tgtgtgatgt aacaatgttg 1200 aaaaaattaa ggaagtcgag aaagaacgtg gatgccatga atgaggcagt gactaaaggc 1260 attcgcacag cagcaaagaa aacaataccg aaaggtaagg gagtatcgcc accgttctgg 1320 acacgagaac tgacagaact ggacacaaaa ataagggagt gtacaaatga gaagcacaga 1380 gatgccctta tatacaagag aaggaaggtt ctcgctgaaa cgtccctaca acggtggaag 1440 gaaaatgtgt cgaagatgtc agtgaccgat acaaaaagtt ggaatttagt taagtccata 1500 tatgcaccga gaccactgac atctccggtt ctggtagtga atgaccaccc tctaacacgc 1560 agacagcaag ctaataagct agcacagatg tatagagagc ggtcgacgaa agcaccagat 1620 gctcctccca tgagtatacc atatcgagtg agcggagaat tcgcaccaat aacgatggca 1680 gaacttgaga cagcactgaa ggaactctct tcaggcactg cccccggtaa ggacgatata 1740 cattgtgaac agttgaagca attaggacgc tgtagcaaga aatgtgtcct caagctcttc 1800 aactgtagcc tacagagaca ccgggtgccg gagaagtgga agcagggcgt tattgtctca 1860 ctactcaaac ccaacaaacc ggccaagtgt atgtcatcgt tccggcccat cacgctcact 1920 agcaccctct gtaaactcat ggaaaggatt gtagcgagac gcatacgaga cacagtagag 1980 tgtaagctac aaccacagca agctggtttt cgtccgggga ggtctacgtc gttagacgcc 2040 ctgatgacac tcacagactc cgtgacacat agagtagcag gagagaaaac tgctgcggta 2100 ttcatagatt atgcgagggc atttgactct gtggatcaca ggtgcataat agatgcatta 2160 aaaagatatg aggtagacat gcacctcatt gcgtgggttg caaattttct ggagggacgg 2220 actgcagtag tacgaataaa caacaccctc tcaaagaaaa tacaactcac gtgtggtgtt 2280 ccacaaggtt ccgtgttggg gccgttgcta ttcattatcg caatggactc cctcagcgcg 2340 aaattaaatg ccataccagc ccttaaacac tgcttcttcg cagacgacct cacgatagta 2400 tgctctaaca ctgaccggga agctattcgg cagacgctac aaacaggatt ggattgtata 2460 gcgagctggt cagcagagca ctatatggag gtgtctgcag aaaaaacgga atacacgttc 2520 tttggggtga gagacagaga ctcattacac ctaatgctgg gagagcacag attaaatgag 2580 acgagaaatc ccagactcct tggggtgaca atgcagccac acaagggtgt aagcaaacat 2640 gctcacaccc tacaaacagc agcaaatatg agattaatga aactacgggc tgttgcatca 2700 cctgaatggg gggcgagtaa agaaagcgtc cgagcatttt acttagcctt aatccaggca 2760 aagttatgct atggtatcgc agcatggtgg ttcaatgctt ctccatcaga caaggagaag 2820 cttgaacgag tacaagcgca ggcggcacat gttatcgcgg gtattccaaa gtacgcgaat 2880 cggcaagatg tcatgaagga ggcccgcctt cacaagatag atgatattgc tcacagacga 2940 gcaatagaat actatttaac aaggaaggcg aacggtaagg catcagccga tacagtggaa 3000 aaaatgttcc cacccagtca tccgatacac acgcgcctgc agagtgtgtc acacttgtat 3060 gaggaagtag acagaccgga gaagccacac acggcaagcg tattgcaact ggcgaggaga 3120 ccgtacttca acaccacaac accaggtggt ctccattctg acgcttccga cagcgagaag 3180 aaaagacaca caatgcaaag ggtacgtaga ttcaaagact acgattatca agtgtggacg 3240 gatgggtcgg tgatactgga cgaatcctct ggtgccggcg cgataatata ccctagacaa 3300 ggtaatgcgg taaaagtgaa gaaaggagca ggcaggttag catgcagcta cagagccgag 3360 tgtgtagcca tggaggcagg acttatgaaa cttaagtcag taataaagat caacgagagt 3420 agaaaaacga aagtggcaat attcacagac tcgttatctt tattaatggc acttaaaaca 3480 ggtcctgcag tagtgacaga ttccatacta cgacgcatat ggaatctgat catacacttc 3540 atgaaactga gagtagccat gtcgtttcag tttgtgtttt tgcattgcgg tatcccgaga 3600 aatgacaaag tggacgagct cgcaaacaaa ggaaaccaca tgccgcaaac ctaccctgca 3660 tggatcaccg acattgtcac aggtataacg cgccaattac gtaatcaggc acaacgcgac 3720 tttgcggaag gccgcggccc aaacacacac cgcagtgctt tactaaacca tataaaaccg 3780 acaccaaaaa aaaaggagct agacagggaa gatgagtcac tgttagctca atttagaaca 3840 gggacttcaa aacattttgg atggttacat agggtgatca cacgaaaggt tgacgaactg 3900 gagtgtagat ggtgtacaaa ccacacgggg aacattctca agaatacccc tgtccccact 3960 caaaacactg taaagatgga agaaactgag gaaggcaagg tggtaatagc tacaagacag 4020 aatgaccctg tcatatgccc agtttgcaac atcgtgtgtg cgagacgaca agcgggagtt 4080 gttcacatga tgaaaataca tcagtgggtt cgtgaaaatg cactagctgc attaaagatg 4140 gcgccaagag cagcacaggc gtataatggc gtctatcgat gtcacgtatg tggtaaagta 4200 ttcgaccgaa agggactact gcaacaacac atgccacaac accctccgac aactccagta 4260 gtatggacag agcgacaaaa gaggaagcga gaagaagaac agtcgttacc aactgtccac 4320 aagtgtcaat ggtgcccgaa gcagtatgcc acgtatggct ggttgactag gcatatacgt 4380 ctgaaacatc ctgaaaagca aaccgaagtg acacaacaaa cggaaatgca ggatgtgagc 4440 agcgaagatg aacaagagca agcaccacaa gcatacgaat gtctccgctg caaacaggtg 4500 tataaaagca aaacctggct caccagacat aagtgcaatc cggagcagag taaaggggaa 4560 ccaagcacac cactacagga agcactgccg gagagagaag catgtaagat atgcggtaaa 4620 gaataccacc acagctggat gttacgccat atgaatgcaa aacaccccgg gcatgatatc 4680 tcactcagac cacagccgat aaagataaga aaagtgacga aggagatatc aaagttgcaa 4740 gcaaaggaag acactagtgg cgacaacaaa aataaggatg acaatgagcc accactttat 4800 gtttgcgaca aatgcaagag gggtttcaaa agcaagacat ggctaacacg ccacaagtgc 4860 gtgagcatcg aaagaaatgg cacgaaaagt gagaatgtga acaaccccca tagagcagct 4920 acacaactag aaaccactac caaaaatgga tcgaagcccg gctccataga tagagcactc 4980 cccaactcag actcgtcaca cacacaaaac ttgaactcca cgacaagaaa gagaaaaaga 5040 gagaccgagc atgacacgcc agcagatgag atggatatgt gcgatcagag agaggagccg 5100 aaacacaaac ggcataaggc tcaagggcac acagagaacc ctcgtagaga gtttgcatgt 5160 ggaaggtgtg agaacaccta catgtcatgg tgtacactag tatggcatac caggacacac 5220 cacaaaaatg caatcactgt aaagaggaag cgtgaagaag gaccccttga aatcacgcca 5280 ctggcccctc tcgtgcacca atgcccacac tgctcgaagg tatgtgcaca caagcaatat 5340 ctaacaatgc acctccaagc agcacatggg caggcacgta gctctgatag gcgaaatgag 5400 ctaaaagaga aatgcaaaga aacatcggct cacctgttac agtgtccggc tttgagtaaa 5460 ctcaggtctg agcatggagt agacagtctg aaaggggaag atatatactt tagtgttaag 5520 ctggcgaact tcttaagggc actcttcttt ccgacggcaa atccggctcc caccatagac 5580 acgaagaaat tgataacccg accgcgagta aagagaaaaa gaaagaaagg tcaccatcat 5640 gctaaatatc cactgactac atgcgacaca ctcggatgac agccagtgag attttcctca 5700 aatgatggaa gacaatatcc aaagggggct tcctcaacaa ggcaccgtta gccaggcccc 5760 acaggaggaa gcgagtggcg gatcgtgagg atgggaaaca tatcagcgaa agaagtagcc 5820 gccatggaag tctatataga ttgattgagg tgcgtgcagg ttaattcaca atgaaaaaa 5879 // ID BEL-119_AA-I repbase; DNA; INV; 6042 BP. XX AC supercont1.255; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-119_AA_; KW BEL-119_AA-LTR; BEL-119_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6042 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.255; Positions 1249929 1243888. XX CC Positions [5104-5661] - Integrase core CC 'CGGTC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 23..1597 FT /product="BEL-119_AA-I_2p" FT /translation="MRAAHDCGACDAPNSADVGMVACDGCGVWYHYACAKV FT SPGVQHRSWRCSKCPPEPIPGTTGAKKKTNKKQPASLTVLVAPSEIHKSSD FT ASQRKTSEKSKNSDKSKTLVVPILAEGDNTTPKKHPEIPAVDKSAHGEMRS FT SKSSTSTARARAQLALQRLEDERRLEEQKLKEERARLEEERIRLEKERQLK FT EQEHAIKAKELAMQEKYLRDKFELEERIADDGSSSKSSVLSRKDRTSAWLK FT SQHDLSKRGDNSVTQYSGWPNLVADLAASDRFQRSVDLKVDQCELEPRQTP FT EVAPNYQILGDNVQNEPVGNHFRSARHPATINAGLSQSSCEDRAAVGLNPR FT SHGAGPNSDQILARQIWPKKLPVFSGDPEEWPIFVHSFETANNACGFTDVE FT NIIRLRECLRGPARDAVVTKLMFPQSVNAIMETLRRLYGRPELLVKNLLAK FT VHRIETPKPERLESLINFGLTVQQLCDHLEAANLSGHLSNPTLLGELVEKL FT PASIKLEWARFKRAYSEPPSNTLERLWES" FT CDS 1588..6042 FT /product="BEL-119_AA-I_1p" FT /translation="MGELVYDASEVTSPIQPKTAVVKADKDKPKEKGHVYS FT HEDAAEVQNRGEERRPCPICGKTDHRVRNCERFQQLGLEGRLKAVDRCNLC FT EVCLFDHGQWRCRSRIRCNVGNCRDRHHPLLHRSGHGPAQEQRQRQFRASE FT CNTHERPQRSVLFRIIPVTLFNGNRKCETFAFLDEGSSLTLIESSLARQLG FT ATGVSEPLELRWTSSVKRNENSSKRVDFEISGKGQPQRYILKNAHTVGELN FT LPSQSLTINELSERFPHLRNLSVSSYTEAVPRILLGLENLSLFAPLDCCIG FT RPGEPIAVKSLLGWSVYGPDGNAPPKSGFVNVHECNCDADRELNDLVRQQF FT ILEDSVIATVPSPESVEEKRAREILASTTKFVEGRYETALLWKADEIDLPD FT SLPMAMKRMKSFEAQLAKDSSLRESVNQQIVDYIRKGYVHKATEEELREIN FT RRQVWYLPLGLVTHPMKQKKRLVWDGKAQVNGISLNSQLLKGPDLLVSLPS FT VICKFREKRVGFGGDIREMFLQLRMRTEDKFFQCFLFRFDSRHPPEVFIAD FT VAMFGATCSPCVAQHVLRENADKWADEFPLAAAAIKDKTYMDDYYDSADTP FT EEASELATQVRTIHARGGFEMRNWVSNSEEVLEKLGENSSMEPRLLQTSPE FT EKWERVLGMLWHPESDTLTFSTELGEQLIPYVSGKQRPTKRIALKIIMSLF FT DPLGLLAPYLIHGRTLIQDLWRSGVQWDEKMRDEEYEKWTRWVELLPSISE FT LNIPRCYFGEAYPPCYQTLQCHVFTDASESGYGCAVYFRSADNTGRLQCSL FT IMAKSKVAPLKHLSIPRLELEAAVLGARMLNTVLSNHSLQPRKVYLWTDSS FT TVLAWIHSDHRRYKQFVAHRIGEILSLTQSECWRWVPSKHNVANCLTKWVR FT DTEPDSNGRWFRGPAFLYLSEDLWPQQRTKVNPTEELRSTYLLTHISLPSQ FT LIDIGRFSKWSVLLRTVACIYRFISNCRLRLAGRPIETILATQNLAKLSKG FT PVTACAVPLKQEEFLQAERFLWRMVQGEHYPDEVRILLKNRDQPIEKWIAI FT ERTSPLYAFSPFADEFGIIRMEGRTADAVHANFDAKYPIILPKDSEITRRL FT LDHYHRQYGHANKETVVNEVRQRFQISRLRAAVDNVFRDCQSCKVNKCKPL FT PPRMAPLPEQRLTPYVRPFSYVGIDYMGPLEVTVGRRKEKRYVAVFTCMVI FT RAVHLEVSYDLSSESCIMSIRRFIRRRGSPVQIFSDNGTNFVGANRELQKQ FT IQRIDLECAGTFTDARTKWSFNTPSAPHMGGVWERMVRSVKEAMATLDDGR FT KLTDEILWTTLIEVEGMINSRPLTYMPQEPENPEALTPNHFILGHSSGAHE FT PLEPPEDLGQTLRSSFMRSQHLANIAWERWSKEYFPAINRRMKWLDEARSL FT KVGDVVYITEGKRRSWIKGIIDEVIPGKDGRVRQAIVRTASGKLKRPVIKL FT AVMELGESTVDPPLDSRGGG" XX SQ Sequence 6042 BP; 1659 A; 1392 C; 1627 G; 1364 T; 0 other; atctcaaaga tttaagacca ggatgcgagc tgcccatgac tgcggggcct gcgatgcgcc 60 gaactccgct gacgtaggta tggtggcatg cgacggatgt ggtgtttggt accactatgc 120 ttgtgccaaa gtgtcgccgg gagtccaaca ccgatcgtgg agatgcagta agtgcccacc 180 tgaaccgatt ccggggacca ccggagcaaa gaagaaaaca aataaaaaac agccggcgag 240 cttgactgtc ctcgttgcac cgagtgagat acacaagtcg agcgatgcca gccagagaaa 300 gacctcggag aagtcgaaga actccgacaa atccaagacc ttggttgttc ccattcttgc 360 ggagggtgac aatactactc cgaagaaaca ccctgagatt cctgcggtgg acaaatcggc 420 ccatggggag atgcgatctt ccaaatccag cacttcgacc gccagagctc gagcgcaatt 480 agcattgcag cgattagaag acgagcgacg attggaggag cagaagttga aggaagagcg 540 agcgcgattg gaagaagaac gaattcgtct agagaaggag aggcagttaa aggagcaaga 600 gcatgcgatc aaggccaagg agttagcgat gcaggagaaa tacctgcgag acaaatttga 660 gctggaggaa cgaatagcgg acgacggaag cagcagtaag tccagcgtac taagtcgaaa 720 ggatagaact agtgcttggc tgaaaagtca gcatgacctg agcaagagag gagacaacag 780 cgtcacacag tattcggggt ggcctaattt agtagcggat ttggcagcat cggatcgttt 840 tcaacgtagc gtcgatttga aagtcgatca atgcgagtta gagccgcgcc aaacccccga 900 agtggctcct aactatcaaa tcctcggcga taatgtccag aatgaaccag tcggaaacca 960 ctttcgaagt gcgcgacatc cggctaccat taacgctggt ttgtcgcaat cttcatgtga 1020 ggatcgtgcg gcagtcggtt tgaatccacg ttcgcacggc gcggggccaa atagcgacca 1080 gatattggcc aggcaaatct ggccgaaaaa gcttcccgtc ttttcgggcg accctgagga 1140 gtggccgata ttcgtccata gcttcgagac agcgaataat gcgtgcggtt ttacggacgt 1200 ggaaaatatt atccgcttga gggagtgctt gcgaggtcca gcgcgagatg ccgtcgttac 1260 gaaattgatg tttccccaga gcgtgaatgc gatcatggaa acgttgcggc gattgtatgg 1320 tagaccagaa cttttagtga agaatctact ggctaaagtc caccgaatag agactcccaa 1380 accggaacgt ttggaatcgt taataaactt cggtttaacg gttcaacagt tgtgtgatca 1440 cctagaagcc gccaatttga gcggccatct gtcaaaccct acgttacttg gcgagttagt 1500 ggagaaacta ccagcttcaa taaaactcga gtgggcgaga ttcaagcgag cgtattccga 1560 accaccctca aacactttgg agcgtttatg ggagagctag tgtatgatgc cagcgaagtt 1620 acttctccaa tccagccgaa gaccgctgtt gtgaaagccg ataaggacaa gccgaaagag 1680 aagggacacg tctattccca cgaagatgca gctgaagtac agaaccgagg agaagaacgg 1740 cgaccttgtc cgatatgcgg taaaaccgac catcgtgtac gaaactgtga acgatttcag 1800 caattgggtc tagaaggtcg tcttaaagcc gtcgatcgat gtaacttgtg cgaagtctgt 1860 ttattcgacc atggtcaatg gagatgccgt tcgaggatcc gatgcaacgt cggaaactgt 1920 cgcgatcgcc atcacccact gcttcaccgt tcagggcacg gtccagcgca agaacagcgt 1980 cagcgtcaat ttcgtgcctc ggaatgcaac actcatgagc ggccacagag atcggttctg 2040 ttcaggatca tcccggtaac gttgtttaat gggaaccgaa aatgcgaaac tttcgccttt 2100 ctagacgaag gttcatcttt gacgctcatc gaatcaagtc tagcacggca gcttggagca 2160 actggcgttt cagaaccgtt ggagttgaga tggacgtcga gcgtgaagag aaacgagaac 2220 agttcaaaac gtgttgattt cgaaatttct ggaaaaggac agccccagcg atacatattg 2280 aaaaacgcgc acactgttgg agaactaaat ctcccgagtc agagcttgac catcaacgaa 2340 ctatccgaga gatttccgca ccttcgaaac ctctctgttt cgtcatacac cgaggccgtg 2400 cccaggattc ttctaggatt agaaaatctt agtctgttcg cgccgttgga ttgttgcatt 2460 ggcagaccag gagaaccgat cgcagtaaaa tcgctcctag ggtggtccgt gtatggtccg 2520 gatgggaatg caccacctaa aagcggattt gtaaatgtgc acgagtgtaa ctgtgatgca 2580 gacagagaac tgaacgatct agttcggcag cagttcatcc tggaggacag tgtgatcgca 2640 acggtgccct cgcctgagtc ggtcgaagag aaacgcgccc gcgaaatttt agcgagcacc 2700 actaagtttg tcgaagggag atatgaaacc gctctgcttt ggaaagctga cgaaattgat 2760 ctacctgata gcctccctat ggcgatgaag cggatgaaaa gttttgaagc tcaattagcg 2820 aaggattcaa gtcttcgaga aagcgtgaac cagcaaatcg tcgactacat acggaaaggc 2880 tacgttcaca aggcaacaga agaagagctg agagagatca acagaaggca agtctggtac 2940 ctcccgttgg gcctggttac gcatccgatg aagcagaaaa agaggctcgt gtgggacgga 3000 aaggcacaag taaatggaat ctccttaaat tctcaacttc taaaaggccc tgacttgctt 3060 gtgtcgcttc cgtcagtgat ctgcaaattt cgtgagaagc gcgtaggatt cggaggggat 3120 atccgcgaaa tgtttttgca gcttcggatg agaactgagg acaaattctt ccaatgcttc 3180 ctgtttcgct ttgactcacg gcaccctccg gaagtcttta ttgcagacgt agcaatgttc 3240 ggcgccacat gctcgccatg cgtcgcgcag catgtactac gagaaaacgc cgataagtgg 3300 gccgacgaat ttccactggc tgcggcggcg ataaaagata aaacttatat ggatgactat 3360 tacgatagcg ccgacactcc agaagaagcg tcagagttgg caacgcaagt tagaaccatc 3420 cacgcccgtg gagggtttga aatgaggaat tgggtgagca acagtgagga agtgttggaa 3480 aaacttgggg aaaactcgtc catggaacct cgtttactgc agacaagccc ggaggaaaaa 3540 tgggaacgag tcttgggaat gttgtggcat ccggaatcag acacactgac gttttcaaca 3600 gaacttggag aacagctgat cccctacgtg tctggaaaac agcgcccgac aaagcgaatc 3660 gctctcaaaa tcatcatgag cctgtttgac cctttagggc ttctagcacc gtatcttatc 3720 cacgggcgca ctttaatcca ggatctgtgg agaagcggtg tgcaatggga tgaaaaaatg 3780 agagatgaag aatacgaaaa atggacgcgt tgggtggagt tgctgcctag cataagcgag 3840 ctcaacattc cacgatgcta tttcggcgaa gcataccctc cttgctatca aacgttacaa 3900 tgccacgttt tcaccgatgc tagcgagagt gggtacgggt gtgcagtgta tttccgcagt 3960 gctgataata ctggacgctt gcaatgctcg ctgattatgg caaaaagtaa ggtcgctcct 4020 ctaaagcatc tatcgatacc tcggttggaa ttggaagcgg ccgtacttgg agcaagaatg 4080 ttaaacaccg ttttgtcgaa tcactctctt caacctcgca aggtgtacct atggacagat 4140 tcttcgactg ttctcgcttg gatccattca gaccacaggc gttacaaaca atttgtagcc 4200 catcgtatcg gggaaattct ctctctgacg caatcggaat gctggcgatg ggtaccgtct 4260 aaacacaacg tagctaactg tttaacgaag tgggtacgtg atacggagcc ggattcgaac 4320 gggaggtggt tccgtggtcc agcgtttttg tacctttctg aggacctgtg gccacagcaa 4380 cgaaccaagg tgaatcctac tgaggaattg cgatcgactt acctgctcac tcacatttcg 4440 ctgcccagtc aactgataga catcggcaga ttttcgaagt ggagtgtact gttgcgcact 4500 gtggcgtgca tttatcgctt tatcagtaac tgccgattac gcctagctgg acgaccaata 4560 gaaacgattc ttgcaacgca aaacttagcc aaattatcga aggggccagt gacagcgtgc 4620 gctgtgccat tgaagcagga agagtttcta caagctgaaa ggtttctctg gagaatggtg 4680 cagggagaac actaccctga tgaagttcga atactgctga agaatcggga tcagcctatt 4740 gagaagtgga tagctatcga gaggactagc ccactgtacg cattttcacc ctttgccgac 4800 gaatttggaa tcattcggat ggaaggaagg accgccgatg ctgtacatgc taatttcgac 4860 gcaaaatatc ctatcatcct tccaaaagat agcgaaatca cacgacgtct tctagatcac 4920 taccatcgtc aatatggtca tgcaaataaa gaaaccgtcg taaacgaagt tcgtcaacgg 4980 tttcagattt cacgtcttcg ggcggctgtt gacaatgttt ttcgtgactg tcaatcttgt 5040 aaagtgaaca aatgcaagcc tcttcctcct cgaatggctc cgctacctga gcaacgtctg 5100 accccgtatg ttcgcccttt cagctacgtc ggcatagatt acatgggccc actggaggta 5160 actgttggtc ggcgcaaaga gaaaagatac gtcgcagtat ttacgtgcat ggttattcga 5220 gctgtgcact tagaagtctc gtacgatttg tctagtgaat cgtgcatcat gtcgatccgg 5280 aggttcatac gcaggcgcgg atctccagtt caaatcttct ccgacaacgg gactaatttt 5340 gttggtgcca atcgagagct ccagaagcag atacagcgaa tcgatttgga gtgtgccggc 5400 acatttaccg acgctcggac gaaatggtct ttcaacaccc cctccgcacc acacatgggc 5460 ggagtgtggg agcgcatggt gcgtagcgtt aaagaggcca tggctacgtt ggacgatggg 5520 aggaaactta ccgacgaaat tctgtggact acgttgatag aagtggaggg aatgatcaat 5580 tctcgaccac ttacctacat gccgcaagaa ccagaaaatc cagaagcgtt aacgcccaat 5640 cattttattc tggggcattc atctggtgct catgaaccgt tggaaccacc ggaagatctg 5700 ggccagactc ttcgaagcag ttttatgcgc tctcaacatt tagcgaacat cgcttgggag 5760 cgatggtcta aggaatactt tccagccatc aaccgaagaa tgaagtggct ggatgaagcg 5820 aggtcgttga aggtcggaga cgtagtttac attacggaag gcaaaaggag gtcgtggatt 5880 aagggcatca tagatgaagt cattcccggt aaggacggta gagtgcggca agcgatcgta 5940 cgtacggcat ccggtaagtt gaagcggccg gtaattaagt tggcagtaat ggagctgggt 6000 gaatctacgg tagatcctcc cctcgattca cggggcgggg ga 6042 // ID Sola1-N6_AAe repbase; DNA; INV; 1283 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A non-autonomous Sola1 DNA transposon family from Aedes aegypti. XX KW Sola; DNA transposon; Transposable Element; nonautonomous; KW Sola1-N6_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-1283 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the yellow fever mosquito."; RL Repbase Reports 11(4), 1296-1296 (2011). XX DR [2] (Consensus) XX CC ~95% identical to consensus. 4-bp TSDs. TIRs are 28 bp long. CC ~72% identical to Sola1-N4_AAe. XX SQ Sequence 1283 BP; 373 A; 224 C; 278 G; 408 T; 0 other; ctgcccatac tcgcaataca gtcccattag gaaaatcacg atgtcgagaa aaacgcttct 60 aaacataaaa ccctccattt tcgttcttcc gctgctggaa gtagtaaatg ccggtggcat 120 tgctcaatat actatcattg caatgtgcaa aaaccgtaaa taatttaaat ttagtaaaac 180 tggttgtttt ttctatccaa attagggtta ttgcagatgg gactcgttat ccgagtattt 240 ctctcgctaa acgcttgaat acccgcatac cagtcccatt actggggatt cgcttacttt 300 gttatcgtgc accttgacac attgttgttc aagtttacga tcgattgcaa tgtacttttc 360 tttttgcagc tttcagttat caactatctt tcaatagtaa taatgtccct tgaagttctt 420 aagctgtaca gtggtgatat gtaattgaga aaatggtgaa gatttcctga attcaaatta 480 accgttgcgg tcggccatat ggaaggcttg ctgaaatcgc agcacgtgac gggtgaagga 540 tttgaaaact atcaaagaaa ggtgaataga acatgttttg gtccacttga tgaaggcgaa 600 tggcggttcc caagatatag ataatcctga tgggagtcag gagctttcct gtttctcggc 660 atttggtgct gcaaacactt agtttgacca tgtagacact ggcgacgatg ctagcgacga 720 tgtgcgtaat cttgcggcgg ccaaacaagc ttctcacttc tcgctgttgg acttcagcct 780 gccggagatt ctgataatga ttggccgcag gaagcagtag agtgaatgga gcagtagaat 840 agtctcagga aaactatttt gctttatttt agtttacagg caactagcct gatgtggtgt 900 tagtattaat cgaagaacag aagaagtcgt tatggtaaaa ccttttttaa acaaaaataa 960 gagtttttct tgattatagt tgatttgtac tttcctgatt tgtttgattg tttctataag 1020 gaaatgagaa tcgcaatgca agctttcaga aatgatcaca ggaataggtt tagttgaatt 1080 tcttcataat catcgaatgg gactgttttg cgggtacttt caatatggga cgcacatggt 1140 ttggtatttt tttcacattt tgtccataca aagacacaat ttggagtttt attttagaaa 1200 gtacactaaa aatgaatact attcatggag ctaactgaaa agtggcaaaa aatcaaatgg 1260 gactgttatg cgagtatggg cag 1283 // ID RTE-1_PPac repbase; DNA; INV; 2938 BP. XX AC . XX DT 20-MAY-2010 (Rel. 15.07, Created) DT 20-MAY-2010 (Rel. 15.07, Last updated, Version 2) XX DE A family of RTE non-LTR retrotransposons - consensus. XX KW RTE; Non-LTR Retrotransposon; Transposable Element; RTE-1_PPac. XX OS Pristionchus pacificus OC Eukaryota; Metazoa; Nematoda; Chromadorea; Diplogasterida; OC Neodiplogasteridae; Pristionchus. XX RN [1] RP 1-2938 RA Kojima K.K. and Jurka J.; RT "RTE non-LTR retrotransposons from nematodes."; RL Repbase Reports 10(7), 1060-1060 (2010). XX DR [1] (Consensus) XX CC >99% identical to consensus. ~7-bp TSDs. The 3' terminus is CC composed by (AAATG)n microsatellites. CC This sequence was derived from sequence data generated by Genome CC Sequencing Center at Washington University School of Medicine in CC St. Louis. XX FH Key Location/Qualifiers FT CDS 100..2895 FT /product="RTE-1_PPac_1p" FT /note="includes endonuclease and reverse FT transcriptase domains." FT /translation="MVLPFFLATMNVLSLATDGRLTLFDQAVMKIKADVIG FT LCEVRRKEEGAIDLTSSSGTLYHTGRFGNRSAGCGFFVSRRMKPKVVRFLT FT ISPRIALLDCRLPNNVLLRLVQCYAPCSNHSDDQYDAFLSELESVFRQVVP FT GQRKFRKVYRVIMGDLNARVGKALPGDTAIGKFGYGDRNDRGEKVITLCET FT LRLRIGNTMFQKQESHCHTWVSPTGTTTTRIDYILYPRDFPALDVDVVNRM FT DFTSDHRMVRASFSVEGRPLGHRGGGKNEVVLVRDQFKDNLVRSIEGIPVG FT CDYSTLVSSIQQAATLSSTPAPSVPRFSPRTRQLFAERSKIKATVQGRDRV FT QWVAINKALRGSLQQDAIDRMTRMVEKAVEEGKNYSKAIGSQSVGRERIVK FT MQGGKGIVTTQADLEGVFREYYNKLYKGDGGRNYTVRETEEEFIPIGRDEV FT ESVLRGFTSGKSPGEDRVSGEMLKAGAEILSPYLTAIFNKILIEGFIPQGF FT GDSRTILLFKGGDKTLPKNYRPISLLPVIQKTLTAVINNRIGCSLDNLRAR FT EQQGFRGGHSTVDGIFILNTLIANCREFKRPLYLLFLDVSKAFDSVLCDSV FT LTAMERDGVHSTFIRFLQSLTVQSKSSVVVNGKGVEVDIGRGVRQGDGLSP FT RLFVAVLDTVFRGLNWGKKGICINGEYLSHLLYADDCVLISHDPRQIQSMV FT RELERELLAVGLCLNGAKTVALSTIPNSRPVIVAGETVVVKDKVVYLGQEV FT TMDSNRFKGEIHRRVRSANWAFSKYSEFFRKRGCPMVLKKRLFDGVVLPAF FT LYGAETWYLCKRDKQILSVAQRKLERKMLGITVLDHIRNEHLRSITKLKDV FT VREAEKRKWMWAERLSNFSHERWSLRILEWTPQGRRNRGRPLKRWRDDFVN FT AAGPQFLQLARDRTQWRTLMATQLRSHQ" XX SQ Sequence 2938 BP; 834 A; 668 C; 744 G; 692 T; 0 other; ttcattctga gttagtcgcc ccattccgac cggttggttc gccgatctct ccgtctccgc 60 ccaacacgga attggccatc cgtgggcgta cggccaacca tggttctacc gttcttcttg 120 gcaaccatga acgtcctctc cctcgctacg gacgggagat tgacgctatt cgatcaagct 180 gtcatgaaga taaaggctga cgtaatcgga ctatgcgaag tgagaagaaa ggaggaagga 240 gcaattgatc tgacgagctc atctggcaca ctctaccaca cgggacgctt cgggaatcga 300 tctgctggct gcgggttctt cgtcagtcgg aggatgaagc ccaaagtggt gaggttcttg 360 acgatttcgc cccgcatcgc tcttttggat tgccgccttc cgaataatgt ccttctccgt 420 ctagttcaat gctacgcacc ctgctcaaac catagcgatg atcagtacga tgcattcctt 480 agcgagcttg aatccgtatt tcgtcaagta gtaccagggc aacggaaatt tcgaaaagtg 540 tacagagtga tcatgggcga cctgaacgca cgagttggga aagcgctccc aggggatact 600 gctattggga aattcggata tggggacaga aacgatagag gggaaaaggt aattactcta 660 tgtgagactt tgcgccttcg cataggtaat actatgtttc aaaagcagga aagccattgc 720 cacacctggg tttcacctac tgggaccaca actacccgta tcgactacat tctctatcca 780 cgggacttcc ctgctcttga tgtagatgta gttaatcgga tggacttcac ttcagatcat 840 agaatggtcc gggcctcatt cagcgtggaa gggagaccgc taggtcatcg tggaggaggg 900 aaaaatgagg tggtactagt cagggaccaa ttcaaagata accttgtacg ctcgatagag 960 gggattcctg tgggatgtga ctacagtact ctggttagct caatacaaca ggctgcaaca 1020 ctttcctcta cgcctgctcc aagtgtccct cggttttctc ctagaactag gcaactgttc 1080 gcggaaagat caaagatcaa agccacagtt caagggagag atcgggtcca atgggtagcc 1140 atcaacaagg cgctccgggg tagcttacaa caagacgcaa tagatcggat gactagaatg 1200 gtggaaaagg cagtagaaga gggaaaaaac tacagtaaag cgataggaag tcagagtgtc 1260 gggagagaaa gaattgtcaa aatgcaaggt ggaaagggca tagtcactac ccaagctgac 1320 ctagaaggag tctttagaga atactacaac aaactctaca aaggggatgg cggacgaaac 1380 tacacagtga gagaaacaga agaggaattc attcccattg ggagagacga agtggagagt 1440 gttctaaggg gattcacatc cgggaaatcg ccaggggagg acagggtatc cggggaaatg 1500 cttaaggcag gggctgaaat actatcacca tacctcactg caatcttcaa taaaatcctc 1560 atcgaaggtt ttattccaca gggttttggg gacagtagaa caatcctcct gttcaagggc 1620 ggggacaaaa cacttcctaa gaactaccgc cccatttccc ttctcccagt catccaaaaa 1680 acccttaccg ccgttatcaa caataggatt ggttgttcac tcgacaatct tcgggcaagg 1740 gagcaacagg gattccgtgg tggtcactca acagtagacg gtattttcat cctaaacact 1800 ctcatcgcaa actgtcggga attcaaacgc cccctctacc tactctttct cgacgtaagc 1860 aaggcttttg actctgttct atgtgattct gtccttactg ccatggaacg ggatggagtt 1920 cattctactt tcattcgttt cctccaatca ctcacagttc agagtaaaag ctctgtggtc 1980 gtcaatggga aaggagtaga agtagacatc gggagagggg tcagacaggg ggatggtcta 2040 agtccgcgtc tattcgttgc cgtcttagat acagtattca ggggactcaa ctggggaaaa 2100 aagggaattt gcattaatgg agaatattta tcacatctcc tttatgcaga tgattgcgtc 2160 cttatttcac acgacccgcg ccaaattcaa tccatggttc gggagctgga acgggaacta 2220 ctagcggtgg gactctgtct caacggagca aaaactgtcg ctctttcaac tattcctaac 2280 agcagaccgg tcatcgtggc aggggaaaca gtagtagtaa aagacaaagt ggtttatctc 2340 gggcaagagg tcacaatgga ttcaaatcga ttcaaggggg aaattcaccg ccgagtaaga 2400 agcgcaaatt gggctttttc caagtactct gaattcttta ggaaaagagg atgtcccatg 2460 gtattaaaaa agcggctatt cgatggtgta gttctccctg cattcctata cggagccgaa 2520 acatggtatc tttgtaaacg agacaaacaa attctttccg ttgcccagcg taaacttgaa 2580 cgaaaaatgc tcggcattac tgtacttgat cacattcgga atgagcacct gcgtagtatt 2640 acaaaactga aggatgttgt gcgagaggcg gaaaagcgta aatggatgtg ggctgaaaga 2700 ctctcgaact tctcccacga aaggtggagt ctaaggattc tcgaatggac tccgcaagga 2760 agaagaaatc gcgggaggcc actgaaacgt tggagagacg actttgtcaa tgccgccggt 2820 ccacaatttc ttcaactcgc acgcgaccgt actcagtgga gaactcttat ggcaacccag 2880 ctgagatccc accaataacc agctgataaa gttgtttaaa tgaaatgaaa atgaaatg 2938 // ID BEL-23_CQ-LTR repbase; DNA; INV; 338 BP. XX AC AAWU01010308; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-23_CQ_; KW BEL-23_CQ-I; BEL-23_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-338 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 200-200 (2011). XX DR GenBank; AAWU01010308; Positions 15947 15610. XX SQ Sequence 338 BP; 105 A; 84 C; 61 G; 88 T; 0 other; tgttaaggcg caggtcaaat gtgaccaacc aaattttctc gtgtccaaca aatcgatcaa 60 tgatcgcgat cacgcacaca ctaattagca cacattagcc tacataattg gcctgcggcc 120 cttttgtttt aagatttcta tcagtcgaat aaagccaatc gaagagtgta caacttagac 180 tctatcgcga acaataaagt taagtgcaca tctcgtcgcg cctttttgta ctacccaagc 240 cgtacttttt aagtagtttt gatcgcgact agaaccgcgg aaagaattcg aacgatcact 300 ttaagccacc tacaatccgg cgaatcagtg cacaaaca 338 // ID CR1-119_AAe repbase; DNA; INV; 4543 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-119_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4543 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1207-1207 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 17 sequences with >95% CC identity. XX FH Key Location/Qualifiers FT CDS 748..1572 FT /product="CR1-119_AAe_1p" FT /note="PHD zinc finger at the N-terminus." FT /translation="MATSEVCHSCARSMSASEVVCGGFCKATFHFSCATIS FT EELYKQICGKPAIFWMCNGCREMMGNARFKNALSSMNAATIELNDTYQKLL FT EDMKSEIKQSLIAEIREEIKGGFNKLSPAVFSPLPRRFKFGNTNSPKRRLD FT EEAATSNQPAKIIRGIGPSAANISVRVDSPADKFWVYLTKISPEVTETDIE FT KLAKECLRVDEVAVKSLIPRGRPSSTLSFISFKVGVDPESKSKALDPASWP FT QGIEFREFIEDEGRRTQHFWKPTPNVDSGPIIIS" FT CDS 1506..4478 FT /product="CR1-119_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="RTKDSTFLETDAECGLRPDNHLVEPVDASSLSNSSPT FT LPNLSHLTVYYQNVRGLRTKTNSLFLSLASCDYDIVVFTETWLHADISNAE FT LSRNYAIYRCDRNSRTSALRRGGGVLIAVKSELNCKSVHLVDCESLEQIAV FT QIVLPHQSLFVCSIYIRPCSQPDYFIKHAESVQGLLDMANPHDTVIVMGDY FT NLPHLAWNYDEDVLGFLPLNASSEQELVVVESLLTTGLKQINYAVNSNGKL FT LDLVFVSDSDTVELFEPPSAMMKVDQYHKPIVLKVDFQPGSKPQSYRTFHE FT YDFARCDFSVINAKISALDWDQLLSLSNVDAAVSVFYDNIYQIIHDTVPVK FT TRRPVMSLQQPWWNPQLRNLRNRLRKARKRYFRSRTSEKMRAVELAEAEYN FT SLLEACFREYMDNIQTSLRDNPTSFWSFVNSRKKSSGIPTDLTYCNRSSSC FT DASSANLFADFFRSVFDNNQPPTSQQYIDQLPSYNIFLPRPDFSLELIKQT FT LGSVDVSKGPGPDRIPPSFIRNCASSLALPVASIFNRSLRDGVFPDAWKLA FT SITPIFKSGNIHRVENYRPISILNCLAKVLESLLHDKLYPAVHGIISEFQH FT GFVKKRSTITNLMSFTNTVIRSNEKRQQTDAVYIDFSKAFDKVPHAIAIAK FT LSRLGLPAWVVSWIKSYLSSRKAFIKIRGAKSDVFAIPSGVPQGSILGPLI FT FVLFVNDICDQLHCCKLLYADDLKIYRVIKSILDCCALQEDINRLTSWCRI FT NGMLANAAKCKVITFARVNSPIKFDYTIDHAPLERVDAIKDLGLILDDKLR FT FNKHISMVIAKANSMIGFLRRNTMEFTDVHALKSLYCSLIRSVLEYGVQVW FT APYHAVHVTRIERIQKKFVRFALRRLPWTDPDNLPAYEHRCALIGLHSLSS FT RRTFLQRLFIFDIVSNHIDCSSVLQCVNFHAPTRQLRNFNLLWTPGHRTAY FT GFHNPVDRCCRLFNDVYDAFDFNISKLVFKNRIKDL" XX SQ Sequence 4543 BP; 1215 A; 1117 C; 934 G; 1275 T; 2 other; gcaactctgc cgctctgtga gaactgtgat ckctgtcgtc gcatcgattt attatcaaat 60 ttcgtcgttc tgtgccgcct aaattgctct gtcccgcata attttgtcca caagtgccca 120 agccatccgt cacacaaact gtttgcactg tttagtcagg taggattagt gaaaatctta 180 tttttccgag ctggcttgag tgtggtcaga gaaagccacc tccgccatcg ccctatataa 240 acaatctccc acctcgccgc tctccacctg cactactgtc gcagttctgt gtatcgtacc 300 gatcaccgat ataaacaaaa caaccgatcg tcagattaat aatttgcctg ttgccgcccg 360 ccgattccga taaatcgtgt catttaaatc acctgtagca ttccaagcca atacaaaacg 420 tttgtttctg ctggccttct cattgcgcaa tccgaaaaac ccgccaaagc ttcgccaaga 480 ctgaaatcat catcatcgtc gtagaaatat ttcctcatcg ccgtcatcgt ttgcgtgaag 540 ataaacaacg cttgctgtac cacaacttgt gtgcgcttgt atgtattgaa gatttttaga 600 aaacaatcga tctcgtattg tgcccagtct gtcaacgtca tacttcagtc atcgatccgc 660 actagtggcg ccatctagtt gtcactgctc gcactgcagt tagagcaacc ccactgttgt 720 tgccatccag tgtagttaaa catattcatg gctacatcag aagtttgtca cagctgcgcg 780 cgcagcatgt ctgcttctga agttgtttgc ggcggttttt gtaaagctac tttccacttc 840 tcgtgtgcaa caatatcgga agagctttac aaacaaatat gcggtaaacc cgctatcttc 900 tggatgtgta atggctgtcg tgaaatgatg ggtaatgctc gcttcaaaaa cgctctgtcg 960 tccatgaatg ccgcaacaat tgaactcaac gatacgtacc aaaaacttct cgaagacatg 1020 aaaagtgaga ttaagcagag tctcatcgca gaaataagag aggaaatcaa aggaggtttc 1080 aacaagcttt ccccagctgt attttcaccc ctcccacgac gtttcaagtt tggaaatact 1140 aattctccta agagacgact wgacgaagag gccgcgacat ccaaccaacc ggctaaaatt 1200 attcgcggta tcggtccatc ggctgccaac atatcagttc gcgtggacag ccccgctgat 1260 aagttttggg tatacctcac gaaaatatca ccggaagtta cagaaactga catcgagaag 1320 ctcgccaagg aatgtttgcg tgtagatgaa gtcgctgtta aatcgttgat tccgagaggt 1380 cgaccgtctt cgacactgtc tttcatttcc ttcaaagttg gtgtagaccc ggaatctaaa 1440 tctaaggctt tggatcctgc atcctggccg caagggatcg aattccgaga gtttatcgaa 1500 gatgaaggac gaaggactca acatttttgg aaaccgacgc cgaatgtgga ctccggcccg 1560 ataatcatct cgtagaacca gtcgatgcaa gtagtctttc caactcttct ccaacgctgc 1620 ccaatctttc tcacttgacc gtttattacc agaacgtcag agggttaagg acaaagacaa 1680 attcgctatt tctctcgttg gcttcctgtg attacgatat tgtggtattc accgagacgt 1740 ggctgcacgc tgacatttcg aatgctgagc tatcgagaaa ctatgccatc taccgctgtg 1800 accggaattc tcgaactagc gcacttcgac ggggtggtgg cgtgttgatt gcggtgaaat 1860 cggagctaaa ctgcaaatcc gtgcatcttg tagattgtga aagtctcgaa cagatcgctg 1920 tacaaattgt cctgccacat caatcgcttt tcgtttgttc catctatatt cgtccttgca 1980 gtcaacccga ctacttcatc aagcacgctg aatctgtaca agggcttctt gacatggcca 2040 accctcatga tacagtgatc gtaatgggtg attacaactt gcctcatcta gcctggaatt 2100 acgacgagga cgtgctagga tttctgcccc tcaacgcttc gtctgaacaa gaacttgttg 2160 tagtggaatc gttgctgact actggactta agcagatcaa ttatgctgtc aactcaaacg 2220 gtaagttgct tgacttagta tttgtcagtg actcagatac cgttgaactg ttcgaaccgc 2280 cctctgccat gatgaaagta gaccaatacc ataagccaat agttttgaaa gtcgacttcc 2340 aaccgggtag caaaccgcaa tcatacagaa cgttccatga atacgatttt gcaagatgtg 2400 atttcagtgt catcaatgca aaaatatcgg ctttagactg ggatcaattg ctttcccttt 2460 ccaacgtaga tgctgctgta tccgtgtttt atgacaacat ttatcaaata atacacgata 2520 ctgtccccgt taagacccgt agacctgtta tgagcctcca acagccatgg tggaacccgc 2580 aattgcgcaa tttgcgaaac cgtctacgta aagcccgaaa acgttatttt cgttccagaa 2640 cttctgagaa gatgcgtgct gtagagctgg ctgaagcaga atataatagt ctgcttgaag 2700 cttgtttccg cgagtacatg gataacattc agactagttt aagagacaat ccgacgtcct 2760 tttggtcctt cgtaaactct cgcaagaaat catctggcat tcctacagac cttacctact 2820 gcaaccgtag ctcttcctgt gatgcctcct cggcaaattt atttgctgat ttcttccgga 2880 gcgtcttcga taataatcaa ccgcctactt cacagcagta catagaccaa ttaccatcct 2940 acaacatctt tctgccacgg cctgacttca gcctcgaatt aatcaagcaa accctcggct 3000 cagttgacgt ctcgaaagga cctggcccag atcgcatccc gccgtccttc atcaggaact 3060 gtgcatcttc gttggcatta cctgtcgcat ccattttcaa ccgctcgctt cgggatggag 3120 tttttccaga tgcctggaag cttgcctcaa tcacgccgat cttcaagtcc ggaaacattc 3180 acagagttga aaactatcgt cctatctcga ttctcaactg cctggctaaa gttttggaaa 3240 gccttctaca cgacaagttg tatcccgcag ttcatggtat catctcggaa ttccagcatg 3300 gatttgtgaa aaagcgctcg acaattacga acttgatgag cttcaccaat accgttatcc 3360 gcagcaatga gaaacgccaa caaacagacg ccgtttacat agacttttcc aaagcttttg 3420 acaaagtacc acacgctatc gctatagcaa aactgagccg cctcgggctt ccggcatggg 3480 ttgttagctg gataaaatca tatttatcgt caaggaaagc attcatcaag attcgcggcg 3540 ctaagtccga cgtgtttgcc attccttcgg gtgtcccgca aggtagtatc ctgggacctc 3600 taattttcgt acttttcgtc aacgatattt gtgatcagct acactgctgt aaattactgt 3660 atgccgacga tctgaaaatt taccgtgtga tcaaatcgat tcttgattgt tgtgcactcc 3720 aggaggatat caatcgatta acttcatggt gtcgaatcaa tgggatgtta gccaatgctg 3780 ccaagtgtaa ggttatcact ttcgcacgtg tgaatagccc aatcaaattc gattatacga 3840 tcgaccatgc cccgcttgaa agagttgatg ccatcaagga tctcgggctc atcttggacg 3900 acaaattgcg attcaacaag cacatatcta tggtcatcgc gaaggcaaac tcaatgatcg 3960 gctttttgcg tcgcaacact atggaattca ctgacgtgca cgctttgaag tctctctatt 4020 gctccttgat tcgaagcgtt cttgaatatg gagtacaagt ctgggctccg tatcatgcag 4080 tacacgtaac gcgaatcgaa aggatccaaa aaaagtttgt tagatttgct ctccgacgac 4140 ttccgtggac tgatcccgat aatctacctg catatgagca tagatgcgcg ctaataggac 4200 tacactcctt gtcaagtcgt cgcacttttc tgcaaaggct atttattttt gacattgtca 4260 gcaatcacat tgactgtagc agtgttcttc aatgcgttaa ttttcatgcg cccactcgac 4320 aacttcgcaa cttcaacttg ctgtggaccc ccggacatcg tactgcatac ggatttcaca 4380 acccggttga cagatgttgc agattattca atgatgtgta tgacgccttt gattttaaca 4440 ttagtaaact agtttttaaa aataggatta aggatcttta agcagtctgt gtgacttttt 4500 tttgtcaaag atgtagaata acaaataaat aaataaataa ata 4543 // ID ITmD37D_Ele7 repbase; DNA; INV; 1298 BP. XX AC . XX DT 13-OCT-2010 (Rel. 15.1, Created) DT 13-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE An ITmD37D DNA transposon family from Aedes aegypti. XX KW Mariner/Tc1; DNA transposon; Transposable Element; ITmD37D_Ele7. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-1298 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-1298 RA Kojima K.K. and Jurka J.; RT "ITmD37D-type DNA transposons from the yellow fever mosquito."; RL Direct Submission to Repbase Update (13-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. ~97% identical to consensus. This consensus CC is ~97% identical to the original sequence in [1]. TA TSDs. 27-bp CC TIRs. XX FH Key Location/Qualifiers FT CDS 160..1170 FT /product="ITmD37D_Ele7_1p" FT /note="transposase." FT /translation="MFAKEVVTSLLLAGKSVKQIAETVGCHLATVYRIKRA FT LHDGTSPRKPGSGRPRSARSLRVIRSVKEKLKRNPIRSIRKLAKDANISKT FT TMQRLVKDDLDSKSRARVKRHLVTDRIKALRLERSKILLSVFKKKRPIVVF FT SDEKLFTVDPVCNSRTNRFISPQKMEDVPENVKFAFTTKHPAGVMVFGAVA FT SNGLKMPPVFIKAGLKVNTEVYMNILKENVLPWMKDNFGEHEHVVFQQDGA FT PCHTSKRTQMWLKENMEFWPKDVWPPSSPDLNPLDYSIWAYVQAKACERSH FT PSVEALKASITKAWNSMSASYIQKVCSRFRSRLEKVIEQNGGHIE" XX SQ Sequence 1298 BP; 395 A; 256 C; 304 G; 342 T; 1 other; cagtatggcc cataaaaaaa tgcgaaagtt tttgaagcct gtttttggat acattctaat 60 gcatttttac ctatatttgg atgtgtgtgt aatctctagg catcgacctg tcaagtgatg 120 taaacaaaac agagatattt tgaaacaaac gtttgtaaga tgttcgcaaa ggaagtagtg 180 accagtttgc tcctcgcggg aaaatcggtg aaacaaattg cggagactgt gggttgtcac 240 ttggccactg tttaccgcat caaacgtgct ctccatgatg gtacaagccc acggaagccg 300 ggaagtgggc gtccgcggtc ggcacgttct ctaagggtga taagatcagt gaaagaaaag 360 ttgaagcgga atccaataag atcgatccga aaattggcta aagatgctaa tatctcgaaa 420 accaccatgc agcgattggt caaagatgac ttggattcaa agtccagggc aagagttaaa 480 cgtcacctgg taacggatcg gattaaagca ctacggctgg agcgttcgaa gattttatta 540 tctgtgttca agaagaaacg tccgatcgtc gtgttctctg atgagaaact ttttaccgtc 600 gatccggtgt gcaactcacg tacaaacagg tttatttcgc cacagaagat ggaggacgtc 660 ccagagaacg tgaaattcgc attcaccaca aaacatccag ccggagtcat ggtgtttgga 720 gcagtcgcgt cgaatgggtt gaaaatgcct ccagttttta ttaaagctgg tttaaaagtg 780 aacaccgaag tctacatgaa cattttgaag gagaatgtgc ttccatggat gaaagacaac 840 ttcggagaac acgaacacgt tgtttttcaa caagatggag cgccatgtca tacttccaaa 900 aggactcaaa tgtggctgaa ggaaaatatg gaattttggc cgaaggatgt ttggcctcct 960 agcagtccag atttgaatcc cctggattat tccatatggg cgtatgttca agccaaagcc 1020 tgtgaaaggt cccatcccag tgttgaagct ctgaaggctt ccatcacgaa ggcatggaac 1080 tcgatgtctg cttcttacat acagaaggta tgctcccgtt tccgctcacg cctggagaaa 1140 gtaatcgaac aaaacggtgg acacatcgaa taatgtaatc ttgacatgtt caataaatgc 1200 attttcaaac caaatacaag wttttacttt tattatgtat caaaaagatg aagcaaaagt 1260 ggaatacaga ttttcgcatt tttttatggg ccatactg 1298 // ID Polinton-1_EI repbase; DNA; INV; 16504 BP. XX AC . XX DT 15-MAR-2006 (Rel. 11.02, Created) DT 15-MAR-2006 (Rel. 11.02, Last updated, Version 1) XX DE A family of autonomous Polinton DNA transposons - a consensus DE sequence. XX KW Polinton; DNA transposon; Transposable Element; KW Interspersed repeat; Maverick; Tlr; Polinton-1_EI. XX OS Entamoeba invadens OC Eukaryota; Amoebozoa; Archamoebae; Entamoebidae; Entamoeba. XX RN [1] RP 1-16504 RA Kapitonov V.V. and Jurka J.; RT "Self-synthesizing DNA transposons in eukaryotes."; RL Proc. Natl. Acad. Sci. USA 103(12), 4540-4545 (2006). XX DR [1] (Consensus) XX CC This transposon is characterized by 597-bp terminal inverted CC repeats and 6-bp target site duplications. The consensus sequence CC was built based on multiple alignment of several copies that are CC >90% identical to each other. It encodes a family B DNA CC polymerase (POLB-1_EI), retroviral integrase (INT-1_EI), ATPases CC (ATP-1_EI and ATP1-1_EI), and one unclassified proteins CC (PTV6-1_EI). XX FH Key Location/Qualifiers FT CDS 1601..6271 FT /product="POLB-1_EIp" FT /translation="MPRVTKQDFKNLLNIYYFIHPERDIDMESINTTQDVI FT KAMFYTEDKLNIAKRVLDTYERKGKTYFKTNQYNLNKLEKAMARIYNENNV FT QRQNVENRYSELTKQQVNIKSKINDVKEQLNKVFNKHIRTSKIRLFRNNEY FT ECIVFREPSDIESFTECLINNREESHFDERYVHFEEPEVDTSNFKHVKSDI FT PVVEREIYKVNNLDQLKNIIQQSLQNANKPCKINIGFNYITEEMKLINDEE FT TYEYEIKKIENNKLSITKDNIIKNNEDIVNLTTNIEQSIISKADKAKNNLV FT SSKQVYIGITDIVIIKYNFGKTGLANKELEIYTKYRYNIMIANTKSDNLCM FT FKALCYYYAQTDDKIKNMTYNYKQLLALAKTYYNKMYGFTDLKNTVFNMFQ FT YETEIESFCKFFNISVYKYSFDVKTSQLTLLHKYKLEKVKSRMNIISHTFK FT DGNEHVMFVKDIDKLTGYLFCKVCGIKSFINTKEGQYKLHIHESRCNGAYD FT SKMKSDKAAPYCPRIFKDKLYTYCRAHGLTYSPIKYYITYDFETTETTIDN FT KIDNKTDNETEIIDAILSEFSLAVSNNLYDTINSRYFCKFVLDDKVGTKFQ FT NDVLIENDHFVEDAIQYICDLGKDIQEANIQHFLANLNKPEYSENEDLLKL FT LKSRSSNIPVLGWNSEKFDTRFIINHLDHIDANNINILGSTNSCKMLTIKY FT KSEGSDEEPDTCINIQFKDAMNYVSPISLDEASKSYGKNEIKVKGVFPYQQ FT LKTENIKEQLLRTEPFKREDFFNSLKQTELSESDYNKYLEDYAKYDNFYQY FT TKYYNIQDTAIMFPIIDNLIDLYHQDDQDMLHSISLASLAASTKYLKCYDD FT FDVDTEYKIYPETEKESNNVYKLSEYDYKNMVYRYQLQDKLAGRDITNNIT FT NDDFKHFEXIINVGMCKYCNCKFTRTNKPTLDRIDNKLGHSINNVELACNY FT CNKYVSNRDQVVAKFMIQIRNYAQFYNLTYNIEDTKVIDLVRKGITGGLST FT VHHRRNIKGENTITKLNYDMLNNKINIYDTKNIITHFVCVDFNSLYPSCFS FT SIENENNPYTDNKMYMPGSVKMSMFDLTEKQQKFARYIINKKEELFIAEVK FT GRIPNEKMNKCINFLPIIRNIDVSLDETVIGTLMYDYMMNNFYKTTNHQDK FT PIIERKLTQTYNTTYNSSKDKYMIFSSYYLWFLIDEFGFEIEDIKSFVTFT FT KHDSFSKFVNEKMNTRQEAILNHDKVRDLFNKIQLNSSYGQDGMRTDNFNK FT TKIVDKKNAFKSNKSVYHIDTLKISESRYLTTENNSSYKIITPVQCALFTL FT DNAKYWYLNFIYNFMNKCLDQTRFHYVEGDTDSMYFAISGADEEHLEYMDE FT KCNEQAFKYIIINQEFYNKHVYSWLPYNFYCSDETCKPKLETDQERIKHEK FT KLLGASIEKQGDNIIALGSKCYTTYNNEHIDKPISLKIKGVNKNSNKHICS FT QSYLEVLEKESTIDGTNYLLQYKKITNIDECISKQTELEHNIDRYTKLVLE FT TDKNTESDIKQTYINILGDLKDQLESKSYLKNIKSSVYKYV" FT CDS 9284..8442 FT /product="ATP-1_EIp" FT /translation="MNKFIYRSEKSSDFDKSHTSERSKENEDNIDSNIDTY FT VDNIVETKLNNSKKKFISKISNDVNFRYNSFNLCIGPQGSSKSTSVMKELM FT KLSLIPNDYHLIIYVNNNSSDDTFTSLVNYINIPVVKTDYEHVEKQCEQLF FT KLKDDYNKMVDEEIHKDSSILQYLYVDNFDKQRLHTFILFDDASFIFEKNS FT KSKFKNWFCQCRHLNITVFACIQIFNSIDPKLKSQLSTVFLFKGFSRERLQ FT LIYRQCCIDMSFSDFFNFYMRLQKYQKIIIDNIDCSLVIK" FT CDS 6407..7672 FT /product="ATP1-1_EIp" FT /translation="MLQTWGTYNTNFIFPKRDKIEIANKRLEDELKYCRNQ FT YKIEKDENCIDQYYVKYSITYKNIDEKGYKLSSKEEFDKDIEEVNRIYKYR FT PDIVAYSTYDYKKFTSERKDDDGEYVDNDEDYGDERLNMQMSSARKPTGAS FT MDINKSDVAIVDFDIHTSLFDEKYNIALRDYIIKKYHLKNGLVKTTSGGLH FT YYCYPDMIPHELKNKNKVVKIISNPLYEVDIFIPRPEIDDGPKVMKYGSQA FT YNHSGEIGIYQPINRRQKNITVRDFEINDLANLGNLYDVYNILKLGESLTI FT PNINAELCNKTKTRIYDDEGNEILQDIEYGTDKLEDKDILTHIKSEIHAHT FT NKGITLFKLFTIFNRFVKEDREYCYKYCYEHMNLTSKATEQFNSAKESSAN FT IKLYKNLNPNNMLKKYVTNSVKPDVKKE" FT CDS 15858..14818 FT /product="INT-1_EIp" FT /translation="MSFREGIDIKQKQIKEHQTRRLRHREQILNQKRNKIF FT MENIPFNNLTIDYPFKSLNKYESILKKDINKKQIISQDFNPKTMKKLYSRL FT YYSPYFNSYEVDLAFFDSGNSTKQNIYMFIININTKYLYVFSLYNKNNDSL FT IMPFNTLLNVGVVFDNIRFDGESALNSTYMSNYFNKLNIRTYSSSSKHINK FT NRVVDRVIRTIRTAYDNTMSRYVNKNQDILIQEIVTYYNFSKHNKTGYRPI FT DMQYIPNYNNMYKCYQLEYDYIKQMVKKNDEILYKQFENGFLSYIYGDKLI FT IYLDNSKDNTFRKTRNYFSTKATMIRYDHGNIVCKLDDNDKIVTIPCYFAL FT RDYI" FT CDS 11367..12890 FT /product="PTV6-1_EIp" FT /translation="MITVNAYQQQVPFNSNAHSKIAITDPSFHITNIDKSY FT LSATIKLRLKLVLDEAKTATTCVEKESNACMLFIGLKDSTQWLDSMKINWK FT NGTTYYENTKMIYETTISGMCRTQQEKESRAGVYTAWKNAFEHKNNVCGVY FT VKVDDLLRNSTFDAEMKLIIPLDNFLPLQSFTSYPNAIFNEFELDIKQRIT FT QNFVYCQVNPAILCDEFNENHLASDKITCSTSDWMKGVSKEFTQIGDKSLI FT LGYCYFSSGNSYSQFTGTVQIQESTCTLFQSVINGFNVKDSVLKEIYDKYS FT KTPLYVPAQITTEYDFPPNQSVSRINCSNMMSYLNTSMLFFLFPRSNNQCT FT VSKNPHISDIKVITANRTIPDQQMDTLSSEFSEFQLANTNFDTLFAAPQSI FT INSLTFNEYSENTNDSSKHIKYLSDATDFMMSIPLERYGSGCFFDGLYSTA FT NIQLTLQAIPMFGDYNPYIKPSGSTGSVNTNAINIILLNDAMWSLSPNKVE FT LLTNVDLDTMFK" XX SQ Sequence 16504 BP; 6575 A; 2062 C; 2111 G; 5749 T; 7 other; agtagttata gtaggtcacc tctctcgaac tcgcaaaaat cgcacactat taaaatcact 60 ttattagggt tttttaactt tttggacccc cctttttaac acacagtgtt atcctacagc 120 gatttttcaa tttcaaaacc ttcgaaaacc ccaaaggatg aaacagtttt tattatcgat 180 tttcctggtt tttgagaaaa attttttcaa aaatcgcaaa ggattaacca tcatttatat 240 gggtttttac gacctcattt ttcaaggtag aatgacataa aattgcctaa aatttgctca 300 ttttacacta aaatgttgtt tattttaaaa tttatataaa attaaaaatt tgaggtaata 360 tatgttaaaa taaacataaa ttagtatttt aataataatt gcaaaaagtt gcgagttcga 420 gagagtcgac ctactataac tactctaagt gaaacctaaa agttcgagag aggtgactta 480 gtatagtttt gaaaataatt ttaaaaataa attttgaaca agactagacc ataaactaat 540 atatcaagga gaaccgaaac gaagtgaggt tccacgacca acatatataa taaaaaatat 600 ttaatttcga ggtcgagaat tcgcgacctc atggaaagtt gagatgtttt ttattatttt 660 tttaaaattg ttttttcatt ttttctttct ataaaatgaa tacagaagtt aaagcagtct 720 tcaacacaag acctaccaga tgtccaaaac gtcttattga aaaattacaa gaaaatagtt 780 tttatcttgt tcgaggaata tgttacgatt tatataacca acctatttta tgtacatccg 840 aaatgaacaa atctaaaatc agatacataa gaccaaatac agccattaaa caatattctg 900 aaaaaatatt ataatgccga caactttaaa caaataggtg gtgaaataca aactgatgac 960 aatgttacaa tgtatttctc atcaagtaaa gaattttata ttaacaccaa atccagtttt 1020 acgtggaatg gattcaaaat tataaaatgt gaatttgatg aaaacactga gtaaactata 1080 aatttcgttt atttatcaat ttttattttt tgtttatttt caaaattgat tttatatttt 1140 tcctttctat aaaatgaaca cagatcgtaa actcatttca aaaagtaaat tcacaaaaga 1200 caatttgtca tgttatgaat acaaatatga aattatagac aaacatggta atccaaaaat 1260 tattacaaga tttaccatga ataagaaaac atcaaaaagt cacaaatatg ttcaagatag 1320 agatcatgaa cgagtttcaa attatattaa gaaatatatt gaagaacata aaaatgaaaa 1380 catatcttac ccagatataa ttaaaatatt aaaatctgat attaaaaaag aattcaacat 1440 cataatcacc aagaatattt taatatcatt aatttgggaa tgatgttgtt attgacactg 1500 ttttaatgac ttatgataat taatatttta tttttagtat ttttaatttt catattttct 1560 caaatatttt ttcatttttt attttttact tctattaaag atgccaagag ttaccaaaca 1620 agatttcaaa aatttgttaa acatttatta cttcattcat ccagaacgtg atattgacat 1680 ggaatcaatt aatactactc aagatgttat taaagctatg ttttatactg aagacaaatt 1740 aaatattgct aaacgtgttc tcgatacata tgaaagaaaa ggtaagactt atttcaaaac 1800 aaatcaatat aatttaaata aacttgaaaa agctatggct agaatttata atgagaacaa 1860 tgttcaaaga caaaacgttg aaaacagata ttcagaactt actaaacaac aagtcaacat 1920 taaatctaag atcaatgatg ttaaagaaca acttaacaaa gttttcaaca aacatattag 1980 aacttcaaaa atcagactgt ttagaaataa tgaatatgaa tgtattgtat ttagagaacc 2040 atctgatatt gaatcattta ctgaatgctt aatcaacaat agagaggaat ctcattttga 2100 tgaaagatat gtacattttg aagaaccaga agttgatact agtaatttca aacatgttaa 2160 atctgatatt ccggtagttg aaagagaaat atacaaagtt aataatcttg atcaattaaa 2220 aaatattatt caacaatcat tacaaaatgc aaataaacca tgtaaaatca acatcggttt 2280 caactacata actgaagaaa tgaaattaat caatgatgaa gaaacatatg aatatgaaat 2340 taagaaaata gaaaacaaca aactaagtat taccaaagat aatataataa aaaataatga 2400 agatattgtt aaccttacaa ctaacattga acagtctatt atttctaaag cagacaaagc 2460 taagaataat ttagtgagtt caaaacaagt ttacataggt ataactgata ttgtcattat 2520 taaatataac tttggtaaaa ctggtttagc caataaagaa ttagaaatat atacaaaata 2580 tcgttacaac atcatgatag ctaataccaa atctgataat ttatgtatgt tcaaagcatt 2640 atgttattac tatgctcaaa ctgatgataa aattaaaaat atgacctata attataaaca 2700 gttattagca cttgctaaaa catattataa caaaatgtac ggattcactg atttaaaaaa 2760 tactgttttc aatatgttcc aatatgaaac tgaaattgaa tcattttgta agttcttcaa 2820 tatttctgtt tacaagtaca gttttgatgt taaaacatct caacttactc ttttacataa 2880 atacaaacta gaaaaagtta aatcaagaat gaatattata tcgcacacat tcaaagatgg 2940 aaatgaacat gttatgtttg taaaagatat tgataaattg actggttact tattttgtaa 3000 agtatgtgga attaaatcat tcattaacac aaaagaaggt caatataaat tacatataca 3060 tgaatcaaga tgtaatggcg cctatgattc aaaaatgaaa tctgataaag ctgcacctta 3120 ttgtccaaga attttcaaag ataaattata tacttattgc cgcgcacatg gtttaacata 3180 tagtccaatt aagtattaca taacatatga tttcgaaaca actgaaacca caattgataa 3240 taaaattgat aataaaactg ataatgagac tgaaattatt gatgcaatat tatcagaatt 3300 cagtttagct gttagtaata acctatatga cactattaat tcaagatatt tctgtaaatt 3360 tgttcttgat gacaaagtcg gaacaaagtt ccaaaacgat gttttgattg agaatgatca 3420 ctttgttgag gatgctattc aatatatttg tgatttaggt aaagatatcc aagaagctaa 3480 cattcagcac ttcttagcta atcttaataa accagaatat agtgaaaatg aagatcttct 3540 taagttattg aaatcacggt catcaaatat acctgtattg ggttggaata gtgagaaatt 3600 cgatactaga tttattatta atcatcttga tcatattgat gcaaataata ttaatatact 3660 tggttccact aattcttgca agatgttgac tattaagtac aaatctgaag gatctgatga 3720 agaacctgat acatgcatta atattcaatt taaagatgcg atgaactatg tttctcctat 3780 tagtttagat gaggcttcta aatcatatgg taaaaatgaa attaaagtaa aaggtgtatt 3840 tccatatcaa caacttaaaa cagaaaatat taaagaacaa ttattaagaa cagaaccatt 3900 caaaagagaa gactttttta actctttaaa acaaactgaa ttgtcggaat ctgattataa 3960 taaatattta gaagactatg cgaaatatga taatttctat caatatacta aatactataa 4020 tatccaagat actgcgatta tgtttccaat tattgataat ttaatagatt tatatcatca 4080 agacgaccaa gatatgcttc attctatctc attagcaagt ttagcagcta gtacaaaata 4140 tcttaaatgt tatgacgatt ttgatgttga tactgaatat aaaatatatc cagaaacaga 4200 aaaagaatca aataatgtct acaaattatc agaatatgat tacaaaaata tggtttatag 4260 ataccaatta caagacaaat tagctggtcg agatataact aataacatta ctaatgatga 4320 tttcaaacac tttgarraca tcattaatgt tggcatgtgt aaatattgta attgtaaatt 4380 taccagaact aataaaccaa cattagatag aattgataac aaacttggtc attcaattaa 4440 taatgtagaa cttgcttgta attattgtaa taaatatgtt agtaatagag accaagttgt 4500 tgctaaattt atgattcaaa tacgtaatta tgctcaattt tataatctta cttacaacat 4560 tgaggacact aaagttatag atcttgttag aaaaggtata accggtggtt tatcaactgt 4620 tcatcataga agaaacatta aaggtgaaaa tactattacc aaattgaatt atgatatgtt 4680 gaataataaa ataaacatat atgatactaa aaatattatt actcactttg tttgtgttga 4740 ttttaattca ctttatccat catgtttctc aagtattgaa aacgagaata atccatatac 4800 tgacaataaa atgtatatgc caggttcagt aaagatgtca atgtttgatt taactgagaa 4860 acaacaaaaa tttgctagat acattattaa caagaaagaa gaactattta ttgctgaagt 4920 caaaggtaga atcccaaatg agaaaatgaa taagtgtatt aactttttac caattattcg 4980 taatattgat gttagtcttg atgaaactgt tattggaacc ctaatgtatg attatatgat 5040 gaacaatttt tataagacaa caaatcacca agataaacca attattgaac gaaaactaac 5100 tcaaacatat aatacaacct ataattcatc aaaagataaa tatatgatct tcagtagtta 5160 ctatttatgg ttccttattg atgaatttgg gtttgaaata gaagatatta aatcatttgt 5220 tacttttact aaacatgatt cattctccaa atttgttaat gagaagatga atactagaca 5280 agaagctatt cttaatcatg ataaagtaag agacttattt aataaaattc aacttaattc 5340 atcatatggt caagatggta tgagaactga taatttcaat aaaactaaaa tagttgataa 5400 gaaaaatgcg tttaaatcaa ataaatctgt atatcatatt gatacattaa aaatatctga 5460 atcacgatat ttaacaacag aaaataatag ttcttataaa ataattacac ctgttcaatg 5520 cgctttattt acactcgata atgcgaaata ttggtatctc aatttcatct ataacttcat 5580 gaataagtgt cttgatcaaa cacgatttca ttatgtagaa ggtgatacgg atagtatgta 5640 ttttgcaatt tcaggagctg atgaggaaca tttggagtat atggatgaaa aatgcaatga 5700 acaggctttt aaatatatta ttatcaacca agaattctat aataaacatg tttactcatg 5760 gttaccatat aatttctatt gttcagatga aacatgtaaa ccaaaattag aaaccgacca 5820 agaacgcata aaacatgaga aaaaactatt aggagcatct attgaaaaac aaggtgataa 5880 tattattgcc cttggttcta agtgttatac tacttataat aatgaacata ttgataaacc 5940 gatttctctt aaaattaaag gtgtgaataa aaatagtaat aaacatattt gctcacaatc 6000 atacttagaa gtattagaaa aggaatcaac cattgatgga actaattatt tattacaata 6060 taagaaaata accaacattg atgaatgtat tagtaaacaa accgaattag aacacaatat 6120 tgatagatat actaaattgg tacttgaaac cgacaaaaat actgaatctg atataaaaca 6180 aacatatatt aatatattag gtgacttaaa agaccagctc gaatcaaaat catatttaaa 6240 aaacatcaaa tcatctgttt acaaatatgt atgacttact cttcataaaa ctgctttatc 6300 tagtcttttc attaaaaata aggtatataa cgacaattgt caaacatgta ctccattata 6360 tatttatact gatccaataa catatactaa agaagatcaa aaaaatatgt tacaaacatg 6420 gggtacttat aatactaatt tcatattccc aaaacgagat aaaatagaaa ttgctaataa 6480 aagattagaa gatgaattaa aatattgtag gaaccaatat aaaatagaaa aagacgaaaa 6540 ctgtatcgac caatattatg ttaaatatag tatcacatat aaaaatattg atgaaaaagg 6600 atataaacta tcaagtaaag aagaatttga caaagatatt gaggaagtta atcgaattta 6660 taaatatcgt ccagatatag ttgcttattc aacatatgat tataagaaat tcacaagtga 6720 aagaaaagat gatgatggag aatatgtcga taacgatgaa gactatggtg atgaaagact 6780 aaatatgcaa atgagtagtg ctagaaaacc aactggagca agtatggata ttaataaaag 6840 cgatgtcgcc atagttgatt ttgatatcca tactagttta tttgatgaaa aatataatat 6900 tgcattaaga gactatatta ttaagaaata tcatcttaag aatggtttag taaagactac 6960 atctggtggt ttacattatt attgttatcc agatatgata ccacatgaac tcaaaaataa 7020 gaataaagtt gttaaaatca tatctaaccc attatatgaa gtagatatct ttatcccaag 7080 accagaaata gatgatggac ccaaagtcat gaaatatgga agtcaagctt ataaccatag 7140 tggtgagatt ggtatatatc aaccaattaa tcgaagacaa aagaatatca cagttagaga 7200 ctttgaaatc aacgatttag ctaatcttgg taacttatat gatgtttata atatattgaa 7260 attaggtgaa tcattaacaa ttccaaatat taatgccgaa ttatgtaata agaccaagac 7320 tagaatatat gacgacgaag gaaatgaaat attacaagat attgaatacg gaacagataa 7380 attagaggat aaagacatat tgactcatat taaatcagaa attcatgctc atactaataa 7440 aggaattaca ttattcaagt tattcactat tttcaataga ttcgtaaaag aagacagaga 7500 atattgttat aaatattgtt atgaacatat gaacctaaca tctaaagcaa ctgaacaatt 7560 taattctgca aaagaatcta gtgctaatat caaattatat aagaatctta atccaaacaa 7620 catgttaaag aagtatgtta ccaattctgt gaaaccagat gttaaaaaag aataattatg 7680 attattaacc atgatcatca aaatataaca aatttatttt atacttattt tcaattatat 7740 aagaatctta atccaaacac catgttaaag aagtatgtta ccaattctgt gaaaccagat 7800 gttaaaaaag aagtaaacac aactttcaaa attaatgaac atactagtat ggagaaaatt 7860 actgctcaca atcttcaatt aatcaaagaa agaaaagaag ctgaaaaagc aaagaaagat 7920 gaagaaaaca gtaatgagaa agatgataag aacagtagtg agaacagtag tgaagattat 7980 gaatacaaag aagtagatta caatgatcca gactattatg aagaataatt atgattaata 8040 accatgatca tcaaattata acaaatttat tttatactta ttttcaattt taatattttg 8100 atttatattt ttatcaaatg actaaacaag aagctatacc agaatttaag aatttatatg 8160 aaactgtgat aaactttgat agtaaagaag attttgaaag atattttaac aaatacgaag 8220 aagatatcaa aaaaattccg actagaggat tgaacattaa attcaaaata cctggttatc 8280 atataaatcg tgttaaaggt caaataacat tatacaaaaa taaaaatgaa aataatgata 8340 aaaataattc tgataatgat ttcaacatag aagatagatt ggatctgata atgagtcata 8400 ttaagagttt agataataaa ttaaatactt tattgaaatt acttaataac caatgaacaa 8460 tctatattgt caataataat tttttgatat ttttgtaatc tcatgtaaaa attaaagaaa 8520 tctgagaatg acatatcaat acaacattgt ctatatatta gttgtaaacg ttctctacta 8580 aatcctttaa acaagaatac tgtactgagc tgtgatttta atttcggatc aattgaatta 8640 aatatttgaa tacatgcaaa tactgttata tttaagtgtc gacattgaca aaaccaattc 8700 ttgaatttag attttgaatt cttttcaaat atgaaagatg catcatcaaa taaaataaat 8760 gtgtgcaatc gttgtttatc aaagttgtca acatataaat attgtaatat tgaagaatct 8820 ttatgtattt cttcatctac cattttattg taatcatctt tcaatttaaa tagttgttca 8880 cattgttttt caacatgttc gtaatcggtt ttgacaacag gaatattaat ataatttacc 8940 aatgatgtga aagtatcatc tgaactatta ttattaacat atatgatgag atgataatca 9000 tttggtatta aactcaattt cattaattct ttcataacag atgttgattt agaactacct 9060 tgtggtccta tacataaatt aaagctgtta tatctgaaat taacatcatt acttatctta 9120 gatataaatt ttttcttact attattcaat ttagtttcaa ctatattgtc tacataagta 9180 tcaatattag aatctatatt atcttcgttt tcttttgatc gttctgatgt gtgcgactta 9240 tcaaagtctg aggatttttc agatctgtaa ataaatttat tcatttctat attatgaaat 9300 tccaattttt agacttaata aatggaagaa gaacaagctc aacaagttat tcaaaatcca 9360 attcaacaag ttggaaaaaa taaatatatg gtaggtaaac catcaaagag atttaaaatt 9420 gaaaaaccac aaggtgcaat tcaatataaa cgaactcctt atattaaagg aacacataaa 9480 ccagtttatg gcgatgaacc aggtgtacta acagattcgt cagcaatgat gaataatcct 9540 gacaataaat atattttaaa tttaccagca aatcaagcta atcctatata tgatatcaga 9600 tataatccta aaatgttaga tccaaaacaa gcagaagtat ttgctaagca gaatagaatg 9660 catgctttta atagtgatat aaatggtgat gaaataccag atgttggtgt atatgatgaa 9720 gaaggtcgtc ttagatattt gaatggttat tcacttattc caaataaacg aaatagttat 9780 tatgcgtatt ataattcggt tccattagaa gaacgtatgg tggtgaagat aaaataccaa 9840 aaattaaacc aagtgtttat aaatggtttg aaaacacaat taaagatgtg ttaaataaaa 9900 ctccaaatgt tgcaaaaatt gctaataaaa atatgataaa cactaaacaa agtatatctt 9960 ttatgtattc tacattaatg gcttcattat tgggtgtaga tgtaaatgaa tttataaata 10020 acacattata cagaaaaggt gttgataaat atgctaaatt tgaaaccaat atcgatcata 10080 taaaaagcgg tataattgaa attatgaaac aaataccaac aaatgtttta tataaattat 10140 ttgtaccaga tgttgaagat cctgataata ttaatgctaa aaatggattt gtaactactt 10200 taaataattt acaagaatta attaatcaag tagtaaataa tttaactgaa ataagagaat 10260 ttgttatatc ttatatacaa gctccatcaa attacagaac taaaatgttc ccaccagatt 10320 taactaaatt acaaatgtct tatcaaggta aatactataa tacaacacta tgtgctgagg 10380 acttcgtccg actaaaaata tatcaatata ttctttattt atattttatt tttggtaaaa 10440 atataaagtg tgtaatcaca attttaataa ttatttatta acaggttgac caaatatact 10500 tgaattatta tccatataat cttgaatacc gtatttaatc tgttcgccga catgttgtct 10560 ataatcaggt tcttttataa atttgttata taacggaact gttttatcaa ctatttctga 10620 accataatca attgctttgt taatatattt tccatattca ggtgagatat tttcaatcat 10680 aggtttcatt ataggtttaa gaggtttaac gatattatca ttaatccata atgctgcttt 10740 aggaacaaaa tctttcaatt ttttaagttt ctgccaaaat agaggacttt ttaccatttg 10800 ttaaaatata aaaaacataa aatcaatatt tatttatcga catttttgac ctttggtatg 10860 actgkgacag agtcccgact acgtcttacg tctttaatgt cttaatttta tttcgacatc 10920 catatatttt tgagatcatt atgaccaaag tgacgtaatg atttaagagg atctgatcct 10980 ttactaacat caatcattct ttcaccaaat ttagaatatg tttgaaattt tttatcagca 11040 atattaccaa ctataggacc aataactgga acagaattta ttattggttt agctatattt 11100 gtaactttat tcattatgtt tccatatgtt tttaatactt ttccaccttt attctaaaat 11160 tttattggca aagtttttta gtgttttata aaatccaggt tgtcgcattt aaaatattta 11220 aaattcaatt ttttactctt aataaatgtc taacataaga attccttcag ttttagaaga 11280 tgccaataga ggagatatcc taaaatattt cgatgaacaa caaaaaataa ttacatcttg 11340 caaatgatgg tgtacatgat gcaagaatga taactgtaaa tgcttatcaa caacaagttc 11400 catttaatag taatgctcat tctaaaatag cgataacaga tccatctttt catattacta 11460 atattgataa gagttattta agcgcaacaa ttaaattaag acttaaatta gttctagatg 11520 aagcaaaaac tgcaactaca tgtgttgaaa aagaatctaa tgcttgtatg ttatttattg 11580 gtttaaaaga ttctactcaa tggctagaca gtatgaaaat taattggaaa aatggaacta 11640 catattatga aaacactaag atgatatatg aaacaactat ttctggaatg tgtagaactc 11700 aacaagaaaa agaatctaga gctggtgttt atacagcttg gaaaaatgct tttgaacata 11760 aaaataatgt atgtggagta tatgttaaag ttgatgattt acttagaaat tcaacatttg 11820 atgctgaaat gaaattgata attcctttag ataatttctt gcctttacaa tcatttactt 11880 catatccaaa tgcaatattt aatgaatttg aattagatat taaacaaaga attactcaaa 11940 attttgttta ttgtcaggtt aatccagcaa tattatgtga tgaatttaat gaaaatcatt 12000 tagcatctga taaaattaca tgttcaactt ctgattggat gaaaggtgtg tctaaagaat 12060 ttacacaaat tggtgataaa tcacttattc ttggttattg ttatttctca agtggaaata 12120 gttattctca gtttactgga acagttcaaa ttcaagaaag tacttgtaca ttattccaaa 12180 gtgttattaa tggttttaat gttaaggatt cagtattgaa agaaatatat gataaatata 12240 gtaaaacacc attatatgtt ccagctcaaa taacaactga atatgatttt ccaccaaatc 12300 aatctgtttc tagaattaat tgttcaaaca tgatgagcta cctaaataca tctatgttat 12360 tcttcttatt cccaagatct aataatcaat gtactgtatc taaaaatcca catattagtg 12420 atatcaaagt tataacagca aacagaacta taccagatca acagatggat acattatcaa 12480 gtgaatttag tgaattccaa ttagcaaata cyaattttga tacattattt gcagctcctc 12540 aatctataat taatagttta acgtttaatg aatattctga aaatactaat gattcatcta 12600 aacatattaa atatctaagt gatgctactg attttatgat gtcaattcct ttagaaagat 12660 atggttctgg atgtttcttt gatggtttat attctacagc aaatattcaa ttaacacttc 12720 aagcaatacc tatgtttgga gattataatc catatattaa accaagtgga tctactggtt 12780 cagttaatac taatgctatt aatattattt tattaaatga tgctatgtgg tcattatcac 12840 caaataaggt tgaactactt actaatgttg atttagatac tatgttcaaa taaaccatca 12900 aagatggtct gatacatcaa aaatgtcgta aaataataat aatatattta atctttattt 12960 ttagtaataa atggaacgat ttctaaagaa gtytgaacat ttttttaatt ttaataaaaa 13020 tggaagtatt ttcaacagct tatcaagata aaagtaaaaa tagtgttttt tgcacatcat 13080 attttaatcc ttcaacaata acatctaatg aaattaaaac tgataaaatt acaagcatag 13140 atggaacagt aagtataaat acaaaagata tttcttctac tgattatgta aattctgaaa 13200 tattaacata ttataataaa tctaaaagtt acagcgacaa tgtagcatta gtatgtaata 13260 catattctga tatgattgcc tcagcatgta actcttacag cgacaatgtt ggttctaatt 13320 gtaatgcgta cacagataca aaaatagctt caataccaac tccaaaaata atatatccat 13380 atccaaatat aatttcrcaa gatgtaacaa catataaaaa tgttacttat tctattaaat 13440 cagaaactga taaaactgca tattttagaa taaaaattga taatgttaca gaattcttaa 13500 caacaaatag aatagtaaaa gtagttccat atgtaattat taataatgaa atttataaag 13560 ttattgaaca agttagtggt ggaagtatgt tgaatggaaa aatatataca acagtacaaa 13620 atctatttaa tataaatgag catccaataa taataatacc agatacagtt ggttatttat 13680 ggggtgatgg atattattgt tttaattatt tcagtttata tttaccaaag aattttacat 13740 caaatagtaa tttatcatca ataacaacta aaccaatatg tgtttattta tattgtaatc 13800 ttaataattt attttatcct ggtattaatt attcaggaag taatccaaat ttaatgtgga 13860 gtaatataac tatatataat aatggaatat atctaataga accaaattta caaataacaa 13920 atacatcatc tattatagaa atatgtcaag attattcact aaaaatatat atgtcatcta 13980 aagctgtatc taattttaca gattataaaa catatattat taattcatta ggaagaacta 14040 ctacaactac aatttatgat atttttgaat caagtactgt ttataataat tataattcta 14100 gaaacgattt atggtaattt tttaaatcac taacataatc gaagattcca taatgtgttt 14160 ttttttcaaa tcactaaatg gttgaattag aaaaaatagt agaattagct tcattattat 14220 catttatatt tctgataata tttattacaa tattaaaaat atttacatat ttgtttaaaa 14280 agagattatt caaattaaaa gaacaaaaaa tattaacaac agatataaca ccaagaatac 14340 atagtgttta aaacactaac tctaactgaa caataaaata ataatctgaa tcaatattga 14400 ttatattatt tttgtaatca gtaaaccata tttttatttt tcgtttattg tctttgattt 14460 caaatgattt ttgattatca ctgttagcaa aaattataaa tccatcttgt gaaggataat 14520 cattactaga attcaaatca cttcttaaaa tatttccaca atcaatctga ccagatgaat 14580 tatagaatct agtagcaata gtattaatat atttattatt agtagaacaa acaaattcaa 14640 ttggtaattc aacttcaaat tgatatgtat ttattacttt gcaattatta ttatgtaaat 14700 tcatgtataa ttttctcatt ttatagaaca aaaaataaac atatacatga ataacaaatt 14760 gtaaataagc ttcgcgtaga ttgtattcga ttcgacttta ttattaatat tgaattagat 14820 gtaatcacga agtgcaaaat aacatggaat tgttacaatt ttatcattat catcaagttt 14880 acatactatg ttaccatgat catatcttat cattgttgct ttagtactaa aataatttct 14940 tgtctttctg aaagtattgt ctttgctatt atccaaatat attattaact tgtctccata 15000 tatataagat aaaaatccat tttcaaattg tttatataat atttcatcat tcttcttaac 15060 catttgttta atataatcat attcaagttg atagcattta tacatattgt tgtaattagg 15120 tatgtattgc atatcaatag gtcgataacc agttttatta tgttttgaaa aattatagta 15180 agtaacaatt tcttgaataa gtatatcttg atttttgttt acatatctcg acattgtatt 15240 atcataagca gttctaattg ttctgataac tcgatctaca actctgtttt tattaatatg 15300 tttggatgaa gatgaatatg ttcttatatt caatttattg aaataatttg acatataagt 15360 gctgttcaat gcagattcac catcaaaacg tatgttatca aaaacaactc caacattaag 15420 taaagtatta aaaggcatta ttaaactgtc attgttttta ttataaagtg aaaatacata 15480 taaatatttt gtgttaatat taataatgaa catgtagata ttttgtttgg tactgtttcc 15540 agaatcaaag aaagctaaat cgacttcata actattaaaa tatggagagt aataaagtct 15600 tgaataaagt ttcttcatgg tttttggatt gaagtcttga ctaattattt gtttcttatt 15660 gatatctttt ttcaatatag attcatattt atttaaactc ttaaatggat aatcaattgt 15720 taaattatta aaaggtatgt tctccataaa tattttatta cgtttttgat ttaatatttg 15780 ttctctatgt cgcaatcttc ttgtttgatg ttctttaatt tgtttttgtt taatatcaat 15840 tccttctcta aaactcattt ctaatttatt ttaaactata aagtcaaggt ccggaattca 15900 gacttttttt ttattatata tgttgttcgt ggaacctcac ttcgtttcgg ttctccttat 15960 tatgttagtt tatgttctag tcttgttcaa aatttatttt taaaattatt ttcaaaacta 16020 tactaagtca cctctctcga acttttaggt ttcacttaga gtagttatag taggtcgact 16080 ctctcgaact cgcaactttt tgcaattatt attaaaatac taatttatgt ttattttaac 16140 atatattacc tcaaattttt aattttatat aaattttaaa ataaacaaca ttttagtgta 16200 aaatgagcaa attytaggca attttatgtc attcgacctt gaaaaatgga ggtcgtaaaa 16260 acccatataa atgatgttta atcctttgcg atttttgaag aaatttttcg caaaaaccag 16320 gaaaatcgat aataaaaact gtttcatcct ttggggtttt cgaaggtttt gaaattgaaa 16380 aatcgctgta ggataacact gtgttaaaaa aggggggtcc aaaaagttaa aaaaacccta 16440 ataaagtgat tttaatagtg tgcgattttt gcgagttcga gagaggtgac ctactataac 16500 tact 16504 // ID CR1-50_HM repbase; DNA; INV; 3779 BP. XX AC . XX DT 22-DEC-2008 (Rel. 13.12, Created) DT 22-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE CR1-type family - consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-50_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3779 RA Bao W. and Jurka J.; RT "CR1 families from Hydra magnipapillata."; RL Repbase Reports 8(12), 1878-1878 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 114..890 FT /product="CR1-50_HM_1p" FT /translation="MEMENNFLNQVNWPSFGQGNHSPASKLLHEWCGKISK FT YVLELSKKVKDLEDKEQNNLKIIDSLKDDLSKSKQNVNSANIGNEWVQIVT FT KGSKSGKKPPEQLVVANATINELHDRERRKRNLIVFGIPESTQTDLLIKRN FT EDEKKITDVFDFIGKAEVKPVYTRRLRSKDKTKPGPILVELDEPSIRNPVL FT LAAKKLRNSMDHKQIYISPDLTEAERQLDLQLRQERNRLNANLDTNSPFRY FT GIRGNQLQKFKKNQNST*" FT CDS join(802..2988,2970..3779) FT /product="CR1-50_HM_2p" FT /translation="MQILIQTLHFDMASVVTNCKSSRRIKTQLKAPLLPES FT ITELNCIYLNATSLENKLDXLNVVTSLHCPKIVGVSETWFKNNSIVNIPGY FT NIYRYDRIGDRRGGGVCLYIKNSIDSYELVDTDFSISKIEQVWAVVYFGQD FT KYLVGCLYRPNYVVDMVDLDNIFKIARRYVDVNAFKDVLIMGDFNFPAIKW FT SNGSIASISNESGIEHKFYKTLSDTFLYQHINIPTFQLTNGLPSNVLDLIF FT TTESGSVCAVDPGFVLGDINKGHLIIYFKFVLKNNVKRLACSNFKFLYSKA FT KFDKISDFMSNIEWVKLYENKTVQDMYDELIYYTTEACNLFVPTIDISQIK FT SSTTPWINNEIKQLIRRKRNLRYINCSHRWNDVKLIKEYKQLCKLLKTEIY FT NARVLFEKNLVLRSKLNPKLLYKYLNSTNSIKDPIKAIRMPNGDLSHEPNK FT IVNCLNKHFQDVFTIEEKGDLPPFHLELNDVYKFEDIEPDDISFELVLSKL FT NSLKDNKSPGPDKLCSIVLKNCAISFTLPLTLIFRESLKTSQLPIQFKSAN FT VTPIHKKGDKTVASNYRPISLTSIPCKILESILRERIXKHLYKNNLLTIQQ FT HGFVKGKSCTTNLLETPDFISSCNSNGLPVDVVMLDFAKAFXSVPHKRLFA FT KLNAYGINGLALKWMEAFLXNRRQRIVLGEFVSEWVEIFSGVPQGSVIGPL FT MFIIYINDLPSKLINKTKLFADDTRYCPRYQILSKVTLDECTSSLQRDLDV FT SFKWTEDWLLKFNVDKCVVMHYGVNNKKNPLYINGIKLPESNSVKDLGVIF FT SKNLKWKDQVMSVTNNANNMLGRIKKSFARLDCYLLKSLYVTFIRPFLEFS FT VPVWSPYQKGDCKVIEQVQRRATKLIKSIRCFSYEDRLKXLKLTTLTKRRE FT RGDMIQLFKIMNNVNITEHNLQXIQNHARGHCFKYYRETTKNLLRHNFFYN FT RSANLWNSLPLELVNSSSVNSFKAGFDCWTNNNLSIXLYIYILS" XX SQ Sequence 3779 BP; 1420 A; 584 C; 626 G; 1134 T; 15 other; aaaawctttt tgtcatcatt tgtctaaaag tccctgtgta ataagattaa aaagttcagt 60 acaggaagta atccactgtt tagggtggta gtcttccaac aaggtgatag aaaatggaaa 120 tggaaaacaa ttttttgaac caagtaaact ggccatcatt tggacagggt aatcactctc 180 ctgcctccaa actgttacac gaatggtgcg gtaaaatttc aaagtatgtc ttagaactat 240 caaagaaagt aaaagatctt gaggataaag aacaaaataa tttgaaaatc attgacagcc 300 tgaaagatga cctaagtaaa tcaaagcaaa atgtaaattc agccaatatt ggaaacgaat 360 gggtgcaaat tgttacaaaa ggaagcaaaa gcggtaaaaa gccacctgaa caactagtgg 420 ttgctaatgc aactataaat gagctgcacg atagagaaag gagaaaaaga aatctaattg 480 tttttggaat tccagaatca actcaaacgg atctgttaat taagagaaat gaagatgaaa 540 aaaagataac agatgttttt gattttattg gcaaagccga agttaaacca gtttatacca 600 gaagacttag atcgaaagac aaaacaaagc ctggtccaat tttagttgaa ctggatgaac 660 catccattag aaatccagtc cttttagcag caaaaaaatt aagaaacagc atggatcata 720 aacagattta tataagtcca gatcttactg aagccgaaag acaacttgat ctccagctaa 780 gacaagaaag aaacaggctt aatgcaaatc ttgatacaaa ctctccattt cgatatggca 840 tccgtggtaa ccaattgcaa aagttcaaga agaatcaaaa ctcaacttaa agcaccttta 900 ttacctgaaa gcataacaga acttaattgt atttacctga atgctacctc actagaaaat 960 aaattagatk awttaaatgt tgtaactagt ttgcattgtc caaaaattgt tggagtaagc 1020 gaaacatggt tcaaaaataa ttcaattgta aacattcctg gttacaatat atacagatat 1080 gatagaattg gtgatcgtag aggcggcggt gtttgcttat atataaaaaa ttcaattgac 1140 tcgtatgagc tggtcgacac tgattttagt attagcaaaa tcgaacaagt ttgggcagta 1200 gtttattttg gccaagataa atacctggtt ggttgtctat acagaccaaa ttatgttgtt 1260 gatatggtag atcttgacaa tatttttaag atagcaagaa gatatgttga tgttaatgcc 1320 tttaaagatg tactaattat gggagatttc aatttcccag ccattaaatg gtcaaacggt 1380 agtatagcat caatttcaaa tgaaagtggc atagaacata aattctataa gactttaagt 1440 gatacttttc tctatcaaca tataaacata ccaacttttc aattgactaa tggcttaccc 1500 tcaaatgttt tagatttaat attcacaaca gaatctggta gtgtatgtgc tgtcgaccca 1560 ggttttgtgc ttggtgacat aaacaaaggt catctaatta tttactttaa atttgtccta 1620 aaaaataatg ttaaaagatt agcttgtagt aactttaaat ttttgtatag caaagctaaa 1680 tttgataaaa tttctgattt tatgtccaat atcgagtggg tgaagttgta cgaaaacaag 1740 acagtgcaag acatgtacga cgaattaatt tactatacca ctgaagcatg caatttgttt 1800 gttccaacca tagacatttc acaaatcaaa agctcaacaa ctccttggat taataatgaa 1860 atcaagcaat taataagaag aaaaagaaat ttaagataca taaattgttc tcacagatgg 1920 aatgatgtta aactgattaa agaatataag cagctatgca aattattaaa aacagaaata 1980 tacaatgctc gagttttatt tgaaaaaaat ctagtgttaa ggtctaaact taatcctaaa 2040 cttctataca agtatttaaa cagtacaaat tcaataaaag atccaataaa ggccataaga 2100 atgcctaatg gtgatttatc ccatgaacca aataaaatag taaactgtct aaataaacat 2160 tttcaggatg tcttcactat tgaagaaaaa ggagatcttc caccgtttca tctcgaatta 2220 aatgatgtat ataaatttga agatattgaa ccagatgata ttagctttga gttagtttta 2280 tctaaactta actcactaaa agacaataaa tcacctggac ctgataaact ttgttcaata 2340 gtcctcaaga attgtgctat atcgtttact ttacccctaa ctcttatatt tagagaatcg 2400 cttaaaacca gtcaactacc aatacaattc aagtcagcaa atgtaacacc gatacataaa 2460 aaaggtgaca aaacagtggc cagcaattat cgccctattt cattaacttc tattccttgy 2520 aaaatattag aatctattct aagagaaaga attraaaaac atctttacaa aaacaattta 2580 ttaacaatcc aacagcatgg ttttgtaaaa ggcaaatctt gtaccacaaa tttgctagaa 2640 actcctgact ttatttcctc gtgcaacagc aatggtttac ctgttgatgt agtaatgcta 2700 gactttgcta aagcttttra tagtgttcct cacaagagac tctttgctaa attgaatgca 2760 tatggtatta atggcttagc tctaaaatgg atggaagctt ttctaartaa tagaagacaa 2820 agaatagttc ttggtgaatt tgtttctgag tgggttgaaa tatttagtgg cgtgccacag 2880 ggctctgtta ttggaccgtt aatgtttatt atatacatta atgatctgcc gagtaaatta 2940 ataaataaaa ccaaattgtt tgctgatgat accagatatt gtccaaggta actttagacg 3000 agtgtacctc aagtctgcag agggatttgg atgtatcatt taaatggaca gaagactggc 3060 ttttaaagtt caatgttgat aaatgcgtag taatgcacta tggagtaaac aacaaaaaga 3120 accctctata tattaatggc ataaagttac cagaatcaaa cagcgtaaaa gatctaggyg 3180 taatattttc taaaaatctc aagtggaaag atcaagttat gtctgtaaca aataatgcaa 3240 ataatatgtt aggtcgaatc aaaaaatcat ttgcaagatt ggactgttat ttacttaaat 3300 cactctatgt aacctttatt cgtccttttc tagaattctc agttccagta tggtccccgt 3360 atcaaaaagg tgactgtaaa gtaattgaac aagtgcagcg tcgagccacc aaactgatta 3420 aatcaattag gtgtttcagt tatgaagatc gattaaaakt tttgaaattg acaactttaa 3480 ctaaacgaag agaaagaggt gatatgatcc aactattcaa aataatgaac aatgtwaata 3540 taacagagca taacttacaa aytattcaaa atcatgctag aggtcactgt ttcaaatatt 3600 atagggaaac aacaaaaaat ctgctaaggc ataatttctt ttataataga tcggcaaact 3660 tatggaactc attgccttta gaactagtca attcatcgtc tgtaaactcc tttaaagcag 3720 ggttcgactg ttggacgaac aacaatctat ckatayatct gtatatatay atcytatct 3779 // ID Copia-98_AA-I repbase; DNA; INV; 3283 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-98_AA_; KW Copia-98_AA-LTR; Ty1_copia_Ele66; Copia-98_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-3283 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC 'ATCTC' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 349..3246 FT /product="Copia-98_AA-I_1p" FT /translation="MSEKKFLFARLGNQNYQTWKLRMEMLLKREDLWSVVA FT DVKPEPVTAVWTRSDQKCHATVVLYLEDSQLCLIKDAESAKDVWNKLKTYH FT EKATMTSRVSLLKKICSLNLCESGDVEKHLIELEELFDRLACAGQALEDPL FT KIAMMLRSLPDSYSGLVTALESRPEADLTMPFVKQRLFDEYQRRAERSVDT FT GEKVMKMQSKGQKKRLCYHCQKPGHFRKDCRLLLQQQQQQSKKSEEVPKSG FT KQEKKKKSDAKQVAESDSQLCFTVANVRHRSCWYIDSGCSSHMSNDRSFFV FT KLDASVRTEVVLADGSTTKADGIGEGFVDCVNSEGNVKKILFKDVLYIPKL FT DSALLSVRRLTQRGCKVNFSGSNCDIMSASGEKVALGELYGNLFVLKTVEY FT VKLSKEIRHLPNCQHTWHRRFGHRDPAAVERIQTEQLGVGLQLKDCGLRQV FT IEQEPLSYEEAVRGPEQDLWKAAMREEYDSLMKNNTWSLVELPNGRKPVGC FT KWVYKRKEDVCGNVSKFKARLVAQGFSQQAGVDYDAVFAPVATQPTFRILL FT TVAGYKRMTVRHLDVKSAYLHGQLQEEIYMQQPKGFTVRGKEHLVCRLHRC FT LYGLKQGAKVWNDTISAILGELGFEQGRSDACLFTKKLESGELVYVLIYVD FT DMIVASVSEAEVDKVEKLLKKRISLSSLGEVSHFLGIRVTKDKDGFYALDQ FT QTYVAKVASRFKLDKAKGSKVPMDVGYYRSREGSKLLSSNSDYRSLVGALL FT YVAVNTRPDVAACVSILSRKISEPSEEDWTELKRVVRYLLLTSDYKLRLSV FT NRQERMVLLGYCDADWGGDPTDRKSVTGYVFQLNQSSICWASRKQTAISLS FT SMEAEYVALSEACKELVWLRRLLEEIGVKQNGPTTVYEDNTSCIEFVGVDR FT QSRRSKHIDTRIFYARNLCEQGVVSVQYCSSESMMADVLTKPLGAAKQRRF FT AEMLGLVNSGQ" XX SQ Sequence 3283 BP; 871 A; 654 C; 961 G; 797 T; 0 other; ggttatcggc ccaggaatcc gtcgaaaagt ggtgaagtga ttttcttccg gacattgcgt 60 tccagcgtgg attggtggcg tccattttgt ttggctgttc ggctgctcag atcaacgcgt 120 tcctttgtgg tggtgcgatc cgttattcgc cgcgtgtgtt gagaacagta gtgctgttgt 180 gtttggcagt gttcgcttcg gcgcttgtgc tgactggttc gtgttcgttt tcggtgcgaa 240 agcttctttc gatgctgctc gatcgattgg tgcgtgcgaa caattgagtc ggtggaaaat 300 actttcgatg cggttgaaag ttattgtgcg cgagtgaata ggaagaaaat gtccgagaaa 360 aagtttctgt tcgcacgtct gggtaatcaa aattaccaga catggaagct gcgtatggaa 420 atgcttctaa aacgggaaga cctctggtct gtggtggctg acgtaaagcc tgagcccgta 480 accgcagttt ggacaaggtc agaccaaaag tgccatgcaa cagttgtgct ttatctcgaa 540 gacagtcagt tgtgtttgat taaagatgcc gagtcggcga aagacgtgtg gaataagttg 600 aaaacatacc acgaaaaagc tacaatgaca tctcgcgtct cgttgttgaa gaagatatgt 660 agcctaaact tgtgtgaaag tggtgatgtt gaaaaacatt tgattgagct ggaggagtta 720 ttcgaccgac tcgcgtgtgc cggacaagct ctggaagacc cgttaaaaat cgcgatgatg 780 ctcagaagcc tcccggattc gtatagcggc ctagtgacgg cgcttgaaag ccggccggag 840 gctgatttga caatgccgtt tgtcaaacaa cgactgttcg atgaatacca acgaagagcg 900 gaacgaagtg ttgatacggg tgaaaaagtg atgaaaatgc aatccaaagg gcagaagaag 960 aggttatgtt accactgcca gaagcccgga catttccgga aggattgccg gttgttgctg 1020 cagcagcagc agcagcagtc aaagaaaagt gaagaagttc cgaaaagtgg caagcaggaa 1080 aagaagaaga aaagcgatgc gaagcaggtt gcagaaagtg attctcagtt gtgtttcacg 1140 gttgcgaatg tccggcatcg gagctgctgg tatatcgaca gcggctgctc tagccacatg 1200 tcgaatgatc gctcgttctt cgtgaaactg gatgcaagcg tcagaacgga agtggttctg 1260 gccgatggtt ccacgacgaa ggccgatggc atcggtgaag gatttgtgga ctgtgtgaat 1320 agtgaaggaa atgtgaagaa gatcctgttc aaggatgtat tatacattcc aaaactggac 1380 agcgctttgt tgtcagtgcg aaggctgaca caaagaggct gcaaagtgaa cttttctgga 1440 tctaactgtg acataatgag tgcaagtggt gaaaaagtgg ctcttggcga gttgtacgga 1500 aatctgtttg ttttgaaaac ggtggagtac gtgaagctga gcaaagagat tcgacacctg 1560 ccgaactgcc aacacacgtg gcatcggcgg tttggccacc gtgacccggc ggccgtggag 1620 cggattcaaa cggagcagct cggcgttgga ttacagttga aggactgtgg tttgagacaa 1680 gtgattgaac aggagccgct tagttacgaa gaagctgttc gtggaccaga gcaagatctg 1740 tggaaagcgg cgatgagaga ggagtatgat tcgcttatga agaacaacac ctggagtctc 1800 gtcgagttac ccaacggccg taagcccgtc ggctgtaaat gggtttacaa acgaaaggag 1860 gacgtctgtg gcaacgtgtc gaagttcaaa gcgcgccttg tcgctcaggg cttctctcag 1920 caagctggcg tggattatga cgccgtgttt gcgccggttg ctacccaacc gaccttcagg 1980 attctgttaa cagtggcggg ttataagcga atgactgtac ggcacctaga cgtcaagagt 2040 gcttacctgc acggacaact acaagaagaa atatacatgc agcagccgaa aggctttacg 2100 gttcgtggca aggagcacct cgtctgcagg cttcatcgat gtttgtatgg cttgaagcaa 2160 ggagccaaag tctggaatga cacaatcagt gcaatactag gagaactagg tttcgagcaa 2220 ggtagatcgg atgcgtgttt gtttaccaag aagctagaga gtggtgagct ggtgtacgtc 2280 ttgatctacg tcgacgacat gattgttgcg agtgtctcgg aagcggaagt cgacaaagtt 2340 gagaaactgt tgaagaaaag aattagtttg tcgtctctcg gagaagtctc tcatttcctt 2400 gggattcgtg ttacaaaaga caaggatggt ttctacgctt tggatcagca aacctacgtt 2460 gcgaaagttg caagccgttt caaactggat aaagctaaag gatcgaaagt gccgatggac 2520 gtcggctact accggagtcg tgaaggaagc aaactgctct caagcaacag tgactatcga 2580 agtttggttg gtgcattgtt atacgtggcg gttaatacgc ggccggacgt ggcagcgtgt 2640 gtttcgattt tgagccggaa aataagtgaa ccgtcggaag aagattggac ggaactaaag 2700 cgagtcgtcc gttatctact cctaaccagt gattacaagt tgagactatc agtgaataga 2760 caagaacgga tggtgctact tggctattgc gacgcagact ggggtggtga cccaaccgac 2820 cgaaagtccg ttaccggata cgtatttcag ctgaaccaat cgtcaatttg ttgggccagt 2880 cgtaagcaga ccgcaatatc cctttcaagc atggaggccg aatatgtggc cctctccgag 2940 gcatgcaagg aacttgtttg gctccgcaga ctgttggaag aaattggagt taagcagaat 3000 gggccaacta cggtctatga agataacacc agttgcatcg aattcgttgg agtagatcgt 3060 caatccaggc ggtccaaaca tattgacacg aggattttct acgccagaaa cctctgtgag 3120 caaggcgttg tttcggtgca gtattgttcg tctgagtcta tgatggctga tgtgttgaca 3180 aagcccctcg gtgcagcgaa gcagcgacgg tttgccgaga tgctgggcct ggttaacagt 3240 ggtcaatgag ttgcagatca acattgcaaa tatcgaggag gag 3283 // ID L2-4_Cis repbase; DNA; INV; 5542 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 01-SEP-2010 (Rel. 15.1, Last updated, Version 3) XX DE Non-LTR Retrotransposon from Ciona savignyi. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; L2-4_Cis. XX NM L2-4_Cis. XX OS Ciona savignyi OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. XX RN [1] RP 1-5542 RA Smit A.F.; RT "L2-4_Cis - CR1 Non-LTR Retrotransposon from Ciona savignyi."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX CC Ci000278, Ci000009. Multiple related subfamilies. ORF1 pos CC 100-792, ORF2 pos 892-4206. 75% similar to L2-1 at DNA level, CC 65% identical (79% similar) at protein level. Copies are <3% CC diverged from consensus. Re-classified as Crack clade. XX FH Key Location/Qualifiers FT CDS 102..791 FT /product="L2-4_Cis_1p" FT /translation="MEATMLGKLAAMEEAMKNIQGKQEIIEAKLDNKSPDT FT APVSSISDSTVTAREVAEMRSSLNGLEASMKELRAMLQRTDERIDDLEQYG FT RRNCLVLHGCQDIPKDHRFLGYVLAIFNKLPLPFPVSPAQIDIAHVLPTKN FT GRTPIIVKFVHRIVRNQIFAAKRHLKGTKMAITESLTARRLRIMEKAKEAF FT GFRMVWTINGVINAMVNDKRQVINRLXDITALLARNSSST" FT CDS 894..4205 FT /product="L2-4_Cis_2p" FT /note="PHD zinc finger, endonuclease and reverse FT transcriptase." FT /translation="MSTPKRCFCQSCSKLIRINQNFIKCDNCKSNYHYKCL FT TDVSNYSHISVNNIKKQLAPSISNWLCDHCSFEHCLPFSTISDTELSLLLT FT STSSYGNLDPNKLNLFFNDLDYDILNDGDEVTSVRCPELYITSEECKSHLF FT TNHQGFSTLSLNIRSLTNPNNFNKLEALVSSLKFKPEVIAITETWIVANQT FT GHYSSLPGYVFLSNSRSKSRGGGVGLYIRSDLNFQVMDCLTVMEERIFESL FT FVVFPDIFPQHSTKLKSSLTYGVIYRSPKLDSESNALFLHNLSNTLEKIES FT RDSQCILLGDLNYDLLKENNNNNITNFTNLMHDYCYQSVINKPTRITDSGA FT TVIDHIWSTFDPTILKSGIITDSISDHLPVIMTINTEITKTCTELPSRSFT FT NLNTTIFNNNIQNLDLHEIFCESDPDTAYXKLIDQYNVEFENCFPIIHLKS FT KKAQNQWYTIKIAQLNKTKQKLYKKFIRKKTMQCKDEFVTARNIYNREIRG FT AKKNHYRKLFESNRKNMKATWRSINSLLGRXSQPTSKXFEINGRMTNDPVT FT IANQFNDHFSNVASELVRHIPXSTSLPSDYLGPQSPSSIYIFPTSPLEIQA FT IIHXMKSKSSSGVDNIPSKLVKSTPNNILQALAHIFNLSLQSGKFVNAFKS FT AEVIPIYKKGKHTSVNNYRPISLLPSFSKILEKLMYNRTHSFLKQNNFFHN FT HQFGFRKGHSTSYASNVLVNLLSDQMEKKKSVLGVFLDLSKAFDTIDHKIL FT LNKLSHCGIRGVALRWFTSYLSNRRQIVNFNGVLSSNTNSVTLGVPQGSIL FT GPLLFLIYINDLPNCLKHSETIMFADDTSIFTPGPNQSVICQNANNDLERV FT STWLSTNKLVLNIDKTKFMYFSSSTKPELLPSSIIINNYQIEQVNSFKFLG FT LTIDNRLSWKPHMLNLAKKINTNLIMVRKIRPLIDRSSLLSLYHSLILSHI FT QYCISSWCFGNNQILAKLQRVCNRYIRMIFNLGKRENTVKTMQQHQLLTIY FT DMHKLEILSIMHRCNYQKLPTALLXSIQLKPMTTRMTTRSKTKFAIPFCSK FT TISQQSIKYIGPKYWHQLPIKIREIQPHHKFIKAVKQHLLNDPHPLP" XX SQ Sequence 5542 BP; 1760 A; 1173 C; 828 G; 1768 T; 13 other; atcattctat ttgaataagg ttgatgctgc tagagtaagg atttttataa cttggcttaa 60 cttgaatatt attgtgtacg gtacctgaat cggaaaaaga aatggaagca actatgttgg 120 gaaaactagc tgcaatggaa gaggcaatga aaaacataca aggcaagcaa gagataatag 180 aagctaagct tgataataaa agccccgata ctgcccctgt gtcttctatt tcagattcta 240 cggtgaccgc gagggaggtt gctgagatga ggtcttcctt gaacggtctt gaggcgtcca 300 tgaaagagct aagggccatg cttcaaagaa cggatgagcg aatagacgac ttggagcagt 360 acggacggag aaactgccta gtactacatg ggtgccagga catcccaaaa gatcatagat 420 ttctaggtta cgtattagcg atttttaaca agcttcccct gccctttcct gtctccccag 480 ctcagattga cattgcacac gtgctgccaa cgaagaatgg acggactccg atcatcgtga 540 agttcgtcca cagaatcgtt cgaaaccaaa tcttcgctgc caaacgacat cttaaaggaa 600 cgaagatggc aatcacggag tcactgacag cgagacgact ccgcatcatg gagaaagcca 660 aagaggcctt tggctttcga atggtgtgga ccatcaacgg agtcataaac gcaatggtca 720 acgacaaaag gcaagttatt aacagactan ctgacattac tgcccttctt gctcgcaatt 780 cctcttccac ctaggtatta atactatagt atactgacaa atatcccaac cctattggta 840 ccatgtgcaa attagtttgt acatagcatt tggccagtta ccttacccac tctatgtcaa 900 cccccaaaag atgtttttgt cagagctgca gcaagctcat ccgaataaat caaaatttta 960 ttaagtgtga caattgtaag tctaattatc attataaatg tctaaccgat gtctccaatt 1020 actcacatat atcagtaaat aatattaaaa agcaattagc cccatccatc tcaaattggc 1080 tttgtgacca ctgctctttt gaacactgtc tcccattctc aacgatttct gacactgaac 1140 tttctctttt acttacttcc acctcttctt atggaaattt ggacccaaat aagctaaatt 1200 tattttttaa tgatttagat tatgacatat taaatgatgg ggatgaggtg acttctgtta 1260 ggtgcccgga gctgtatatt acctctgaag aatgtaaatc acatttattt acaaaccacc 1320 aaggtttttc taccttaagt ctaaatatta gatcattgac aaatccaaat aattttaata 1380 aactggaagc actggtttct tctctcaaat ttaaaccaga ggtaattgct attacggaaa 1440 cttggatagt tgcaaaccaa actggtcatt attcttccct tcctggttat gtatttctna 1500 gtaacagtcg gtcaaagagc aggggtggtg gggttggatt gtatattaga tctgacctga 1560 attttcaagt aatggattgc cttactgtaa tggaggagcg gatttttgag tccctttttg 1620 ttgtctttcc tgacatattt ccccagcact caaccaaatt aaaatcatcc ttaacttatg 1680 gtgtcattta ccgatctcct aaattggact ctgaatcaaa tgctttattc ttacacaatt 1740 tgtcaaatac tcttgaaaaa atcgaatcaa gagactcaca gtgtatactn cttggcgact 1800 taaattatga ccttctgaaa gaaaataaca acaacaacat aacaaatttt acaaatttaa 1860 tgcacgatta ttgctatcaa tctgtcataa acaaaccaac tcggattact gactcgggtg 1920 caacagtcat tgaccacatc tggagcactt tcgaccccac aattttaaaa tcaggtatta 1980 ttactgatag tatatcggac catctcccag tcataatgac tattaacact gaaataacca 2040 aaacttgtac ggaattgcca tcacggtctt ttaccaactt gaacaccacc atttttaata 2100 ataatattca aaacttagac cttcatgaaa tcttttgtga atctgaccca gatacggcct 2160 atnctaaact aattgaccaa tataatgttg aatttgaaaa ctgttttcca attatacatt 2220 taaaaagtaa aaaagcacaa aatcaatggt acaccataaa aattgctcag ctgaataaga 2280 caaaacagaa actttataaa aagtttatac gaaaaaaaac aatgcaatgt aaagatgaat 2340 tcgttactgc cagaaatatt tacaaccgtg aaatacgtgg tgcgaaaaaa aatcattaca 2400 ggaaactttt tgaatccaat agaaaaaata tgaaagctac atggcggtcc attaactctc 2460 tgctaggcag acnaagtcaa ccaacttcaa aatnctttga aataaatgga agaatgacaa 2520 atgacccagt tacaatagca aatcagttta atgatcattt ctctaatgtt gcatccgagt 2580 tggtaagaca catcccacan tccacatcac ttccatcaga ctacttgggc cctcaatctc 2640 cctcctccat ctacatattc cccacaagcc cattggaaat ccaagcaata attcacgana 2700 tgaagtccaa atctagcagc ggcgttgaca atattccctc aaaacttgta aaatcaactc 2760 caaataacat tctgcaagcc ttagcccaca ttttcaacct atcacttcaa tctggaaaat 2820 ttgtaaatgc ttttaaatct gctgaagtca ttccaatata taaaaaaggc aagcatacat 2880 cagtcaataa ctacaggcca attagtttac ttccatcctt ttctaaaata ctagaaaaat 2940 tgatgtataa cagaactcat tcctttttga aacaaaacaa cttttttcac aaccaccaat 3000 ttggatttag aaaaggtcat tctaccagtt atgcaagcaa tgtattggtt aatctattat 3060 ccgatcaaat ggaaaaaaag aaatcagtcc ttggtgtttt tttagacctc tcaaaagcct 3120 ttgacaccat tgaccataaa atactnctaa acaaactgtc acattgtgga attcgtggtg 3180 ttgccttaag atggttcacc agctatcttt caaaccgtcg tcaaattgtt aattttaatg 3240 gtgttttgtc ctcaaacacg aactcggtaa ctctaggtgt cccacaaggg tcaatacttg 3300 gtccacttct atttctcatt tatataaatg atttaccaaa ttgcctnaaa cacagtgaaa 3360 caattatgtt tgcagacgac accagtattt tcaccccagg tccaaaccaa tcagtcatat 3420 gtcaaaatgc aaataatgat ttggaaagag tatctacttg gctctcgacc aacaaattgg 3480 ttctcaatat tgacaaaact aaattcatgt acttttccag ctccaccaaa cccgaacttt 3540 taccatcttc cattatcatt aacaactatc aaattgaaca agtcaactca tttaaatttc 3600 tgggtttaac catagataac agactatcat ggaagccaca tatgcttaat ttggccaaaa 3660 aaataaatac aaatcttata atggtacgta aaatccgacc actcatcgac cgttcttcac 3720 ttttatcact atatcactcg ctaattctta gtcacattca atattgcatc tcctcctggt 3780 gttttggtaa caaccaaatc ttagccaaat tacaacgtgt gtgcaatagg tatattcgaa 3840 tgatttttaa ccttggcaaa cgggagaaca ctgttaaaac aatgcagcaa catcaacttc 3900 tgaccatata tgatatgcat aaactcgaaa tcttatcnat aatgcaccgg tgcaattatc 3960 aaaagcttcc cacggccctt cttcantcta tacagttaaa acccatgacc actaggatga 4020 cgaccagaag caaaacaaaa tttgccattc cattctgcag taaaacgatt tcccaacagt 4080 ccataaaata tattggcccc aaatactggc atcaattacc aataaaaatt cgcgaaattc 4140 aaccacatca taaatttatt aaagctgtta aacaacacct cttgaatgat cctcatcccc 4200 ttccatgaat ttactcacca tttctcactt taatactacc ctaatattta actatattgc 4260 cagtcatttt tactataata tttatttacc tattatatac attacctttt ccttattatt 4320 actttccttc atttattccc ctatcctcct actgcacact cctctcacaa agctatgact 4380 tttgttttgt tttttgtttt tcttattttt gtttctattg ttaagattga attgctttaa 4440 cacaatttga ctatctcacc atcactgtat tagcccactg tataactcac agcatgctgt 4500 cttattttta tcatttcacc atttaagtca caggggtttt tactgttaat ggcatgccga 4560 tggcaatttg tatttgtagt gctgttttct ttttagtttt tttagccggc cctaatagta 4620 tgcccctatt tattgcttga ctgcatgcca catcttttta gtacttcacc agcactatta 4680 tatttactct tattattcac tgtctttcct atcttaaatt ctgttagcag tctttaagac 4740 ccatactgcc cagaaattct tgtttcagtt gtaaagaaat tccatcggcg gaaacgattg 4800 ctctnccacc ttccccggct cattcacctg ctcttcgccg gcggctgcga tcctttttct 4860 ccgatgatgc cccaataaag ttgagatttg gcagtggaac ccaattactg tggagacttc 4920 acattaaact gtttcggact tactattcat taattacaca caaaaagcgg aagcttgctg 4980 ctgtttgtct tttttcttat ttttgcagtc ttattttaaa tatttatgta tattgtatac 5040 atattactat cataaattag tttatttata tattctagtg cagtaatgtt ctgcacttaa 5100 agtgcttcat ttattaaacg tagtgcaata actgtttttg caccaaaaat aatttaatgt 5160 ctgaaagtgt gttttgaaac ggttttttaa ccactggtgg aatttcattt tagggggttt 5220 ttctttttcc acttaatccc aaagccggct cccccctcag tactgggtcg cgcttgcaca 5280 cttttactca agtattaaat aacaccaaat ttaattagtt aattttgatt taatttgttt 5340 cgttgaaact tatttgtatt aatgtaaggt ttaacggcca cgtcgttttt ttccttaatt 5400 tgtaatgtgt ttatccactg acgtgtttta actactgtag ttttgccgtt ttatcctgtc 5460 tcctcaatgt aaaataattg ttttattctt ccttactgta acgaaggagc gaaaaaataa 5520 actaaactaa actaaactaa ac 5542 // ID Gypsy-3_SI-LTR repbase; DNA; INV; 313 BP. XX AC AEAQ01004791; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: long terminal DE repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-3_SI_; KW Gypsy-3_SI-I; Gypsy-3_SI-LTR. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-313 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01004791; Positions 979 1291. XX SQ Sequence 313 BP; 90 A; 77 C; 82 G; 64 T; 0 other; tgtaagaaat tctccgaccc gacaattcga attggcgcca aacgatgggc ccgagcaaag 60 gaggcgcgca actattggac gagtaggagc caataggcgg cggacgcaga cccgtccgtc 120 taaccaacgg aaagtaggtg cgtcggcgac gcatcgtcgg ctgtaacaac aacacgcgcg 180 gacacgtgac gcaactctta ctcttgcgag aactttaagc gagtgtacac gtgttaagcg 240 agtagagcga aataaagcag tttttactat aacaaccgca gtttagagtt tttcctcgag 300 taattttcct aca 313 // ID BEL-229_AA-LTR repbase; DNA; INV; 558 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-229_AA_; KW BEL-229_AA-I; BEL-229_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-558 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 914-914 (2011). XX DR [1] (Consensus) XX SQ Sequence 558 BP; 169 A; 92 C; 114 G; 183 T; 0 other; tgttccggca gcactgctcc catgaaaggc agtgaactcg ccggcgattg tagtccgttt 60 gtcaatgcga cggtgacata gctgatagag agattagaca acgtgttaac aagattgttg 120 aaagtgaatt tataatttct tggatttcat atgaattgtt ttaaaagcta gatctaaaag 180 taagttgggc cgaaatttga aggattgaat tatgtctcta atatacatca tatgaactag 240 atttctcttt gccctagatt attactagtt tagtgtgtcc gggtagaact gaagtcatcc 300 tgattgataa gtgttcaata acgggtaaac ttaatctaaa ttcatttaat cgattgccta 360 atttttactg ttgccttttg caagatagaa cttcttcgaa ccgtattgtt caggaccagc 420 gtttcgacgt aaaacgttag gagaagacta tcgtaagtgg atgtaaccta taattcaata 480 atttgcctaa taaactatat attatagctt taagcttcgc tggacaaaag ctgtaaaggt 540 tcttctggcg tccgaaca 558 // ID DNA8-95_AP repbase; DNA; INV; 968 BP. XX AC . XX DT 28-AUG-2009 (Rel. 14.09, Created) DT 28-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-95_AP. XX NM DNA8-95_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-968 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2032-2032 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. Putative hAT element. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 968 BP; 343 A; 121 C; 131 G; 373 T; 0 other; cagggctgtg aatttaatgc atttgcatgt tttttttttg ctacactgta taataaaatg 60 gattcggcca aaatttgttt tgacttattt atagttcaaa taccaaatct tattttgcat 120 atttttgcat atttcatggg ttagggcata taaatgcatg ttttaatgat ttttttgtat 180 aaatgctttt ttttcgtaat atttacattt tttcggtcaa tctgttctaa attgacatta 240 aaatattaaa tgtatttgta taaactatga cttgtatagt ttcatggaaa ctattctatt 300 tttactcggg tatttaacgt gtattattga cttttgtact attgaagtcg agattaaaca 360 gttctcaaca aaaacaaaag atttcaaatt atgtgtatta tttcaaaaat attaactggc 420 gaaaaaaaga acgtaggttt ggatatacct accagaagat ttaaccagta gcgacatgac 480 gtacctattt caagtttgcc ccaataactt cgtcagatgt agaacgctcc ttttctttat 540 ataaaacttt gttggcacca aacaggaggt ctttcaagtt tgaaaatcta aagaaattat 600 tgattgtaca atacaataac tactttaaag gtaaaatata taattttaag tagaaacata 660 agctattcta agcatatttc ttttataaat cttgtatacg acgaagatca agactagtca 720 agcccaaaaa gtcatatact catatcagtc atatcaataa ataacaatac taatatttaa 780 actaaaacta ataaaaaatt ataatgaaaa aaataaaaat agaagttaca tttttgtggt 840 tttttttgca tatttttgaa aaattagagc atattttaag aatttttaga gcatattttg 900 tcagttttta gtgcatttat atgcatgcat atttaatagt ttttaagtgc attaaattca 960 tagccttg 968 // ID Gypsy5-NVi_LTR repbase; DNA; INV; 580 BP. XX AC AAZX01005749; XX DT 05-NOV-2007 (Rel. 12.11, Created) DT 05-NOV-2007 (Rel. 12.11, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: long terminal DE repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy5-NVi; KW Gypsy5-NVi_I; Gypsy5-NVi_LTR. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-580 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from Nasonia parasitic wasp."; RL Repbase Reports 7(11), 1136-1136 (2007). XX DR Genome; AAZX01005749; Positions 11082 10503. XX SQ Sequence 580 BP; 199 A; 78 C; 144 G; 159 T; 0 other; tgcgaaagag gaccggaaga ggcgggaacc aagcggaagg acgcgatgct ataggtcgat 60 gaccgatata gagggcgctg acgcagaccc aaagggatgg gcctgttgga aaaatcgggt 120 aggaaaggga aaaggacgcg agggtgaaac gaggaaaagc gtgcggagga cttttttttt 180 gtaaaataac tttaagtttt caaataaatt tagtatttca aagtaaataa cgtgaagtgt 240 aaacaattta atttttcgag tatatttaaa tacctataaa ttctacaaac tacacaagaa 300 ggaagaagtt tactatccag taaactttga tttcacttca acagcaaaaa tgtaagtaaa 360 atattaataa atgaaattat aatttacaag ggtggaaaat tgtatgtgag taggtttttg 420 aacgctgccg ctcatataca aaatttaggg tgaacttgtg tgattgactg gagggttgac 480 tggacttcgc atgtcggagg ccttttgacc tacagttcaa tcaaacttgt tcgtttgtga 540 gggttgggat ataaagttta gttgaaatac gcgttaatca 580 // ID Gypsy-14_DPu-LTR repbase; DNA; INV; 638 BP. XX AC scaffold_264; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-14_DPu_; KW Gypsy-14_DPu-LTR; Gypsy-14_DPu-I. XX NM Gypsy-14_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-638 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 744-744 (2010). XX DR Genome; scaffold_264; Positions 39753 40390. XX CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX SQ Sequence 638 BP; 143 A; 155 C; 115 G; 225 T; 0 other; tgtaatagaa cggcgtgtgc gtcatttggc ctacggctat gatgacgcat cacacctgta 60 cagcttccgg tgcaacccct tttcctttct gccgtcgccg ggttcttgac cctttctaga 120 ttgcatcttc acgcataaat tacgcgtcct ccacgcatat ttcaaacccc ttttacccca 180 tttaaatgct ctttaaaccc caattaagtg agttgacaga actgtcaact cactcggggg 240 tgggttttac atgggatact tttcctatcg ttttctttat acaatgacgc atccttcctc 300 ttttcgtata aaagggtgcg atattcttct cttcagggcg cagatccttt ctgtagtttg 360 accgatcggg tgtttccgat tcagttcaag taatacaact ctggttcaat tgtgtgtgtt 420 aattgtgttg tacaactcgt caactacaac tgagtaaagg tactgtctct cattctttat 480 tcttcctctc ctatttagtt atatcctttt ctatcaattg tacttctgga ctgatattaa 540 taaacattat cattgttgag tcacgcctca tattttatac attcgttccc gtaagagaca 600 agatgcacgc gccgtgtagt ctcttacatc tggtggca 638 // ID Odin-3 repbase; DNA; INV; 9507 BP. XX AC . XX DT 07-JAN-2011 (Rel. 16.02, Created) DT 07-JAN-2011 (Rel. 16.02, Last updated, Version 2) XX DE Odin-3 non-LTR retrotransposon from Oikopleura dioica. XX KW Non-LTR Retrotransposon; Transposable Element; Odin-3. XX OS Oikopleura dioica OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; OC Oikopleuridae; Oikopleura. XX RN [1] RP 1-9507 RA Volff J.N., Kehrach H., Reinhardt R. and Chourrout D.; RT "Retroelement dynamics and a novel type of chordate RT retrovirus-like element in the miniature genome of the tunicate RT Oikopleura dioica."; RL Molecular Biology and Evolution 21(11), 2022-2033 (2004). XX RN [2] RP 1-9507 RA Kojima K.K. and Jurka J.; RT "Odin non-LTR retrotransposons from Oikopleura dioica."; RL Direct Submission to Repbase Update (07-JAN-2011). XX DR [2] (Consensus) XX CC [2] Consensus update. This consensus is generated from 5 CC sequences with >88% identity. Both termini are uncertain. XX FH Key Location/Qualifiers FT CDS 1338..6587 FT /product="Odin-3_1p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="MQTTLAFRPKSSPIRNPNNRTPKPVNLFPNRASTPPL FT HLFAGIIEKSPELASNPLLQSIASTSLEDALERSPARRSLRTAKKSSQSNG FT GTSAESTSSSVNDSETTSLNGTLGLSETDNSQNTFIGPPIAGGNIFLNELS FT KYNTDEFTGQRLQPNFAQLKEFVDSQLVKSKARVMAGHAKIGNNYFLPQSA FT FTEVASLMGGSVTLNEYIDSGNHSAIPNDSSEIADLNNFAKNQMECLTRDF FT HKIGIEPREGREIWLTLIGNHCQKIITRLNAIIRDYHSMSLTQKKELYILE FT PSDFAAIKHIRERLIELLDIFKNGLKLVEHFANGGQWLGRLIPLPHSKKIF FT KLMTTKIVHEQHRAQQSPIKTLLNLNTFHLDRMGIAYCNKNYSRIVLGIYA FT NGGLQIKRQITYMIAKKGNENLRNKTQVKIVKKSLENLLIRNSKVNPKLVD FT LNNLDKIKYNKEAWNIALKFMFLFNNNILTYDNITKDNYPLHIFQFHSSLR FT KPNDSTNCRPIGXNETSDNCDEGNLDSQNANTKVNDRXNAAEIKRGIIPRT FT RITNTQIPSPLTFLSNLIPLTLLANIRADWEDYNADLAKTRAAKNFLATNS FT CDKDILKTINSATIVNPSESDITHLEKTCEIYNKEALNNYKPLRICNINPG FT KAKTNGETFRMIVRELPKIDLFCLNEIRLPHSTMTNSAIWPEGYSIMTHDR FT IEAVNPAMLNTDESLCYSAIMIKNESFSECELERHSVGPLTAIEITNDKRQ FT SLVVASIYRFCENDNKAKCFYNILFGSTPEIFCEWIEQILKLTRVNKAQVI FT LAGDWNCDFMRKRDNDQVVKRLSFMLKHLTNLSHSPTFFRGTKTASIIDHI FT FVSHPANTNYKNLRLNISLKFDGHTGQSIISNFNTNYRAFKLIQITKNGSD FT AIIKKTAIKYFNRIQNSLRMLDDPKKRLESSIKWLNRIYNETKVVVAKICK FT NKNAFAVKKGKDRVDYENLLRFLKESPKYAPRSGEVNRPIDRAKIGLSILI FT LKLKKRDQRLAHKKLSHRAENDRNVIWDIFKLEIQQTQAWNPEQECSPDEL FT ADMVDSLQKVTLSANENFTPLVDHPFPFEKLSKISYSINGESQFPSILAMF FT DLAKQHTKGNSGINKSFLSILPISVMDSLIVSPIQNAIDTGTYPLSLRTSR FT ITILPKKSRGIRPIAINEVLATIIEKIVAWNLTNYLEATNSIPPMQAGFRS FT TLNCANSIADIIRNSETNRLKGNTTATIFLDLKNAFGSIKHANLDKTFALY FT FNGPALSIVSQSLFRNYRVNGNGFFSTTRAGLRAGVPQGGTISPLCFILFV FT SQIQNIIKGKEGVKINLFADDMGVQVTAKNYAELVTKTNDIIGILETALQN FT MGLMLVGSKSHILITGKSKSKNFGLDLEFEIDGTIIKPEKQAKYLGTTLSV FT SNGQLNLDREADIMICKFRNIVDKLNCIKSTLTLTTNGDLLRACAYGTFLH FT NTSTTPYFSNKTLSQLQKIYDKGICIASNGKILSSDSEVIRLADCPNRAYL FT PRLDYERIIKNLNRAGHPTLSKVKGLAACNFMRTLLRELKSTSLAEFTSSC FT LSLRITHSSERLSNFCPSLFQHESFLKKMGTLDTLKIKHCFSSELSTNQVI FT ARELLINLLQEELITFTIESPPNIPKAASHKLIWPLSFRKYINSLPVNIRN FT LILTSPNKEALKNKFSTIHNHVENSYYCVDCISFDSTLDNTQAREDDLAAF FT IHEHQSKLNELDLDCSQPIAELVMDVIRMA" FT CDS 6769..7617 FT /product="Odin-3_2p" FT /translation="MIIKLREKEIKLVNFQELSSIEKLQNANHTLLPSGII FT ELAFPGLALTNKISARADLTALLEKFENNKISVPIRSVALPGEPMLETFQK FT WFNIQNSSFNPFATAIFWAHLINLNHVEXSSPIPNLTGGLALXINFLPRCI FT TAVKNPDPFLTGHIQRTFTGAPRTLPLFNLAHCHPAFDLSNCGLHVNPESP FT ELLDGWMXVPALLGQLKSHWLDLVPSIPNFKTGIPTQTRDDLIKHFTELHL FT VKGDNIIATRAALFCISKALQHNNIEISNPIWQRIAHLSPTI" FT CDS 7804..9351 FT /product="Odin-3_3p" FT /translation="MSTTSSGIFTDLHAKRAPPYTSEEAKQYRFNLYIDNV FT CINIIQDTPEDCLSELTCCGILVTNALNRHGIEITDDTVVLFKMISCDALL FT KAILNKVPHEYIDFFASLNLLTNEVQTFSTASCLAGYLQTITEKIVPSGII FT ALGNNQNSIYRISGELAFIRIRGELHVTALKQWSENNPIFSPXWDVNIKRI FT LAPSAINAEALTNWIASRLYIKRKGWDIKYAFAIHLASSSPGQRYPTPSGK FT VPILTKDAHLSKAELTQKINSPPTVRTTTIKPAELEVLNLANAXLVSKFSN FT PALLHPLRVFALSKSKKVLDLALRLYRSKMADVQDFLTIKACKAHNGLMLS FT SSICMASSPDSNCSSRISEILNEASTLPLERSEHLYKNEIHRIVRDITPNG FT SLTTLLSRLVKSINGHNASPTWADLLVEITQLDKKDAAVVKNYCSEKKLLP FT PDXKGDPNSTLVLLTDSDEDDRIAERASSTLMPRSVPQQEDLDFSPPSLSP FT INSPEQDSPPFWTKSTASED" XX SQ Sequence 9507 BP; 2934 A; 2747 C; 1639 G; 2063 T; 124 other; aacatcmgag gcaaagcmtt caaagagacg caggcwcact ggcgmttccg catgctctgg 60 gaccctacca ctagagacga agttggccaa wtgaagacct ccaactgcat cacacgcgag 120 gaattcaacm gaattgccaa gcttctcgtc cgctacgaca gcccsctwcc agcccaatcw 180 gtcmaagtat acactgagat gcgccwggag cagaagcagg agcgcattaa ctcgcamgcc 240 aaaacacacc tsgacatcct gcccaagctt gcaacwctgc cggagtacaa gcagcacgtc 300 acwgccatwc tcagcggcaa tgacctgcct gccccwtcca acccgcagct mwccagcgcg 360 tacgtgacca gcctccaaca acaaatwggt aagaaataca acaacaaaat caaawcgatt 420 cagwmcaacc tagaccagat wcggaacmta atcccgggcc tccagaatca aaccaccact 480 kcccacaama cgccgcaggc cgccscatcc gctgawaatg cmaacagggk kaccgacaac 540 gkcgagtcmt caatggawtw tacaaacgaw aaccagcmgc csgcccaaca gcamgccccw 600 cagccmtcag cgtacgtgcc kggactgccc gtcatgccag cmaatwccaw ctcgacaaaa 660 aacccctcag cgagtcacta gtgaacgacm ccgtctctag tggctcgcct acmcagcctw 720 gcmcgccgca wcagccwccw acacagccac cacaacacct tgccacwcca caccagccsc 780 cactgccwaa gccmawtgaa gctscttttg macagcaaaa ctccmtgcgm macaatacwt 840 cgagacagga gctccckcgt gcgacctcgg agcctgcctc agaaatggag tcgcaawgca 900 gcawctccag ccgtgccatm acggacgcct ttgacaaatg caacgtcaga tccccagcca 960 ggtacakcmc aaaccagagt ctctctcacc accaaggcca acamwkcmmt ctaacgmccc 1020 taccagwcac ccccaawgwc wtwcaaaaca camctggcac gkccccgmwc ccwkccgcma 1080 ackgcwccac actcaactcc gcwcccwgca mccaatccwa ctgctcsctc atcagcttca 1140 atgakgatga cagtattatc agcctccaca acgamaatgt actcacctcc acactsctcc 1200 cmwgcagcgt caccgaacaa acagtaacga atgcwcatag ctcgatcaag aaaaaaggga 1260 agmgatcgct cgaccaagcc aacctagcca gccaagagac wccttcccag gatagaggac 1320 ctaaaagtca aaggttgatg caaaccacgc ttgcattcag accaaaaagc tctccaatcc 1380 gcaaccccaa caaccggaca ccaaagccgg taaatctctt ccccaatagg gcatccacac 1440 cgcccctgca tctctttgct ggaattattg aaaaatcgcc ggagcttgcc tccaacccac 1500 tgctacagag cattgcctcc accagcctcg aggatgccct ggaaagatcc ccagcccgta 1560 gaagccttcg cacagcaaaa aagtctagcc aaagcaacgg tgggacttct gccgaatcaa 1620 cctcgagctc agtcaatgac tcagagacaa caagtctcaa cggaacactc ggactaagcg 1680 aaaccgacaa ctcccaaaac accttcattg gtccgccaat tgctggaggt aatattttcc 1740 taaatgaact ctcaaaatat aacacagacg aatttacagg gcaaagactt cagcccaact 1800 ttgcccagct caaagagttt gtggactcgc aactggtaaa atcgaaagcc cgagtgatgg 1860 caggacatgc taaaattgga aacaactatt tcctaccgca atcagccttc acggaagtag 1920 ccagcctgat gggagggtct gtcaccctca acgaatacat tgatagtggg aaccactcag 1980 cgattccaaa tgacagcagc gaaattgcgg atctcaacaa cttcgcaaaa aaccaaatgg 2040 agtgcctaac ccgcgacttc cataaaatcg gcattgagcc gcgagaagga agggaaatct 2100 ggctcactct catcggaaac cactgccaga aaatcataac acggctaaac gccatcatcc 2160 gggactacca ttcaatgtca cttactcaaa agaaagaact ctacatcctc gaaccttctg 2220 actttgcggc gatcaaacac atccgcgaac gactaatcga actcctcgac atttttaaaa 2280 acggcctcaa actagttgag cacttcgcga atggaggaca atggctaggc agactcatcc 2340 cactgccgca ctccaaaaag atcttcaagc tcatgaccac caaaattgta catgagcaac 2400 atcgtgcgca gcaaagcccc atcaaaaccc tgcttaatct gaatacattt cacctggaca 2460 gaatgggaat tgcttactgc aacaaaaact actccagaat tgtacttggc atctacgcca 2520 acggtggcct ccaaattaag cggcaaatca cctacatgat cgccaaaaaa ggtaatgaaa 2580 atctcagaaa taagacccaa gttaaaattg tcaagaagtc actagaaaac ctacttatta 2640 gaaattctaa agtcaaccca aaattagtcg acctcaacaa tcttgacaaa ataaaatata 2700 ataaagaagc atggaacata gcgttgaaat ttatgttcct ttttaataac aacattctta 2760 cctatgataa tatcactaaa gacaattatc cgctccatat tttccaattt cattcctcat 2820 taaggaagcc caatgactca acaaactgca gaccaatagg cgwgaatgag actagtgaca 2880 actgtgatga gggaaacctt gactctcaaa atgccaacac aaaagtcaat gaccgccwga 2940 atgcagctga aattaaaaga ggtatcatcc ccagaaccag aattactaat acacaaatcc 3000 ctagcccact tacctttctc tccaacctga tccctcttac cctgcttgcc aacattcgag 3060 cggactggga agactacaac gcagaccttg ccaaaacgag agcagccaag aattttctag 3120 caacgaatag ctgtgacaaa gacattctca aaacaatcaa ctctgctact atcgttaacc 3180 cgtctgaatc tgacattaca catcttgaaa aaacctgtga aatctataat aaagaggctc 3240 tcaacaacta caaacctctt cgcatctgta atatcaaccc tggcaaggcw aaaacwaatg 3300 gcgaaacctt tcgaatgatt gtacgtgagc tccccaaaat tgacctcttc tgtctaaatg 3360 agatccgctt gccacactct acaatgacaa actcggccat ttggccagaa ggctattcca 3420 tcatgactca tgacaggatt gaggcagtaa acccagccat gctgaatact gatgagagcc 3480 tctgctacag tgccataatg attaaaaacg aatcttttag tgaatgtgaa cttgaaagac 3540 actcagtcgg gccactaaca gccattgaaa taacgaacga caaaaggcaa tcactcgtag 3600 ttgcgagcat atacagattc tgcgaaaatg acaacaaagc caaatgcttc tacaatatcc 3660 tgtttggctc aactcccgaa atcttctgtg aatggattga gcagatcctc aaactcacaa 3720 gagtaaacaa agctcaggta atcctagcag gcgactggaa ttgtgatttt atgagaaagc 3780 gagataatga ccaagtggtc aaaaggctca gtttcatgct aaaacacctt actaacctat 3840 cgcactcacc aaccttcttc cgcggtacaa aaactgcctc cataatcgat catatctttg 3900 tctcccaccc tgcgaacaca aattataaaa atcttagact aaatataagc ctaaaatttg 3960 acggccatac tggtcaatcc atcataagca actttaacac caattaccga gcatttaaac 4020 tgatacaaat aacaaaaaat ggttcagacg caatcatcaa aaaaacggcc atcaaatatt 4080 tcaaccgaat ccagaacagc ctcagaatgc ttgacgaccc caagaaaaga ctcgaatctt 4140 ctatcaaatg gctaaaccga atctacaacg aaacaaaagt agttgtcgcc aaaatctgca 4200 aaaacaaaaa cgcatttgca gttaaaaaag gtaaagatag agttgattac gaaaacctac 4260 tcagatttct caaagaatca ccaaagtacg ctcctcggag tggtgaagtt aaccgaccaa 4320 tcgaccgtgc taaaattggc ctcagtatcc tgattctgaa gctcaaaaaa cgtgaccagc 4380 gactcgcgca caaaaagtta tcgcatagag cagaaaacga ccggaatgtg atctgggaca 4440 ttttcaaatt agaaatacaa cagacacagg catggaatcc agaacaagag tgctcgccag 4500 atgaacttgc agacatggtt gactccttac agaaagtcac cctatctgca aacgaaaact 4560 ttacgcctct agttgaccac ccttttccct tcgaaaaact atcaaaaata tcatactcta 4620 tcaatggcga aagccaattt ccttccatcc tagccatgtt tgacctggcc aagcagcata 4680 ccaaagggaa cagcggtatt aataaatcat tcctctcaat tcttcccatt tcagtaatgg 4740 attcgttgat tgtttcgcca atccaaaacg ccattgatac aggtacttat ccactatcac 4800 tgcgaactag tcgcatcacg attttgccca aaaaatcacg cggcatacgc ccgattgcca 4860 ttaatgaggt tcttgccacc attattgaaa aaattgttgc gtggaatcta actaattacc 4920 ttgaagccac gaattccatc ccaccaatgc aggcaggatt tcggagcact ctcaactgcg 4980 ccaactcgat cgcagatatc atacgaaatt ccgaaacgaa tcgtctcaaa ggtaacacca 5040 cggcaacaat attccttgac ctcaagaacg cctttgggtc aattaaacac gcaaatctcg 5100 ataaaacatt cgcgctctac tttaatggac cagccttatc aattgtttcc caatcgctat 5160 tccgtaacta tcgtgtaaat ggaaatgggt tcttcagcac tacccgtgcg ggactccgtg 5220 cgggtgtacc gcaaggcggc actatttcac ctttatgctt catactattt gtgagccaaa 5280 ttcaaaatat cattaaaggg aaagaaggag tcaagatcaa cttgttcgcc gatgatatgg 5340 gtgtccaggt gactgcaaag aattacgcag aacttgtgac caaaacgaat gacatcatcg 5400 gaatcctcga aactgcgctc cagaacatgg gactcatgtt agttgggagt aaatctcaca 5460 tcctaatcac aggtaaatca aaaagcaaaa actttggcct tgacctagaa ttcgaaatcg 5520 atggaaccat aataaagcct gaaaaacaag cgaaatacct agggaccact cttagcgtca 5580 gtaacggtca acttaatcta gatcgtgaag ctgacattat gatctgtaaa ttcaggaaca 5640 ttgtcgataa acttaactgt attaaatcca ccctcactct tacaacaaac ggtgatttac 5700 ttcgggcgtg tgcatatggc actttcctcc acaatacctc cacaacccca tacttctcga 5760 acaaaaccct cagccagctt caaaaaatct atgataaggg gatctgcatt gccagtaatg 5820 ggaagatcct ctcctcagac tcagaagtga taagattagc cgactgccct aacagagctt 5880 accttccccg tcttgactat gaacgcataa taaaaaatct taacagagct ggacatccca 5940 cactcagcaa ggtgaaagga ctcgcagcct gtaacttcat gagaactctc cttcgcgagc 6000 ttaagtccac cagtcttgct gaattcacta gtagctgcct ctcgttacga atcactcact 6060 ctagcgagag actttcaaac ttttgccctt cactattcca acatgaatcc tttctgaaaa 6120 agatgggaac tctagatact ctcaaaataa agcactgctt ctcttcagag ctgagcacta 6180 accaggtcat cgctagggag ctcctcatca accttcttca ggaagaactc atcaccttta 6240 ccatcgagtc ccctcccaac atccctaaag ctgctagcca caagctcata tggccactct 6300 cctttaggaa atatatcaac tccctccccg tcaacatccg caatctcatt ctcaccagcc 6360 caaacaaaga agccctcaag aacaaatttt ccacaatcca caatcacgta gaaaactcgt 6420 actactgcgt ggactgtatc agttttgata gcacacttga taatacacaa gcgagagagg 6480 acgaccttgc cgccttcatc catgagcacc aatcaaaact caatgaactt gaccttgact 6540 gctcccaacc cattgctgaa cttgtaatgg atgttattag aatggcctaa aaatgataag 6600 tgaaagacca cgtatcctca gaacccaata ttcmctccta caaactgata acctacaggc 6660 tacaccatga gtcaaaagcg ctaaccattc cctttaggag tcctgctgct ccaacaagtt 6720 gaaatcgcca ttgaaaatgg ccacgacgtc agagcccttg accgattcat gatcatcaaa 6780 ctccgagaaa aggaaataaa actcgtcaac ttccaagagc taagctcgat tgagaaactt 6840 cagaacgcca accataccct tctcccctcg ggcattatcg aactagcctt cccaggcctt 6900 gccctgacca acaagatctc ggcccgggct gacctcactg cactacttga gaagttcgag 6960 aacaacaaaa tctctgtccc cattcgttct gttgccctcc ctggtgagcc tatgctagaa 7020 acttttcaaa aatggttcaa tatccagaat tcatccttta atcccttcgc cactgccatc 7080 ttctgggccc atctcatmaa cttgaaccac gttgagkcct catccccaat tcctaactta 7140 actggaggac tcgccctcma cataaacttc cttcctagat gtattacggc agtcaaaaac 7200 ccggatccct tccttactgg ccacatccaa cgcacattta ctggtgcccc tcgcacwctg 7260 ccactgttca acctcgccca ctgccaccct gcatttgacc tcagcaactg tggcctgcac 7320 gtcaacccag aatccccaga actgctcgat gggtggatgc awgtccckgc cctcttgggc 7380 caactcaaat cgcactggct tgacctagtg ccgtccattc ccaatttcaa aacaggaatc 7440 ccaacccaaa cgcgggatga ccttattaaa cactttactg agctccacct tgttaaaggc 7500 gataacatca ttgccactcg cgctgctctc ttctgcatct ccaaggctct ccagcacaac 7560 aatatcgaaa tatcaaaccc catctggcag cgcatcgccc acctctcgcc aactatttaa 7620 tctcagcgcc gcagtgttcg tgccaaacta gctcaccgct cctcctgact wtttttctct 7680 tcccctacac gtagaactta tctgctcgtg aagataccaa gaaactgaat tacttactta 7740 ttttcttgcc ccaatctctc accataatac ccaacaaaaa cgctaaccgc ctcccacttt 7800 agaatgagca caacaagctc kggcatattc actgacctac atgcaaaaag agcwccaccc 7860 tacacgtcgg aagaggccaa acagtaccga ttcaaccttt acatcgacaa tgtctgcata 7920 aacatcatcc aagacacacc cgaggactgc ctcagcgagc tcacctgctg cgggatcctc 7980 gtgacgaacg ctctkaatag acacggtatt gagattacag acgacacwgt agttctcttc 8040 aagatgatct cctgtgacgc cctacttaaa gccatcctaa acaaagtccc ccatgagtac 8100 attgacttct ttgcctcact caaccttctc acgaatgaag tccaaacctt cagcacggcg 8160 agctgtctag caggctacct gcaaacaatc acggagaaaa ttgtcccctc gggcattatt 8220 gcccttggta acaatcaaaa ctcaatctac cgcattagcg gcgaacttgc gttcattcgt 8280 attcggggcg aactacacgt gacggctctc aagcaatggt ccgaaaacaa tcccatcttc 8340 tcgccatwct gggatgttaa catcaaacga atactagcac catccgcaat caacgctgaa 8400 gcactcacta actggatagc tagcagactc tacatcaagc ggaaaggctg ggacatcaag 8460 tacgccttcg ccatccacct tgcmtcgtcc tctcctggcc aaagataccc caccccgtcc 8520 ggcaaggtac caatcctcac caaagacgcc catctctcta aagctgaact tacccaaaaa 8580 atcaacagcc ctcccactgt ccgcacaact accataaagc ctgctgagct tgaggtgctc 8640 aacctagcta atgctstcct ggtctcaaaa ttcagcaacc ctgcactcct ccacccgctt 8700 agagtattcg cactcagcaa atccaaaaaa gtccttgacc tcgcactgcg actataccgc 8760 tctaaaatgg ctgatgttca agatttcctt actattaaag cttgcaaagc ccataacggc 8820 ctcatgcttt ctagctctat ctgtatggcc tcctcccctg acagcaactg ttcgagcaga 8880 atctcggaaa tccttaacga agcgtcgaca ctaccgcttg agagatcgga gcacttgtat 8940 aaaaacgaaa ttcatagaat cgtgcgcgat atcacgccga atgggtctct tacgacgctt 9000 ctttcgagac ttgtcaaatc cattaatggc cacaatgcct cacccacgtg ggcggacctg 9060 ctagtggaaa tcactcaact tgataaaaaa gatgcggcag tagtcaaaaa ctactgctcc 9120 gagaaaaagc ttctcccgcc tgacmaaaaa ggagacccta attcaactct tgtgctcctt 9180 acagactctg acgaggacga tagaattgct gaaagggcat ccagcacact catgcctcgc 9240 tctgttcccc aacaagaaga tcttgacttc tctcctccct ctctcagccc aatcaacagc 9300 ccggaacagg actctccccc tttctggacg aaatccacag cctctgaaga ctaaaaatcc 9360 ctcctctctc tctcccgtct aaagaacgct ataataagcc ttgacactac tattgattac 9420 acataactct cttgcctcat gttagtatag tctaaaccca ctcactgcct taagaawttc 9480 ttattactgc cctaggatct ttagtta 9507 // ID Gypsy-21_DWil-I repbase; DNA; INV; 4264 BP. XX AC scaffold_181130; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-21_DWil_; KW Gypsy-21_DWil-LTR; Gypsy-21_DWil-I. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-4264 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_181130; Positions 44904 40641. XX CC Positions [1819-2361] - Reverse transcriptase CC Positions [3379-3855] - Integrase core CC 'TATATA' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 259..4263 FT /product="Gypsy-21_DWil-I_1p" FT /translation="MSPPLRTPAKSKNEKNEKKIVEKKTLNETFGLVEDEE FT SIKNNMEVKDVSDVEKSDPRDLDIKMQQLIHMFSPKIEADERKYAEVKINP FT EHFEKVVCVFDGYNVPVEKWFENFEQNAEAYELTDKQMYVQARNKMSGAAQ FT LYLESETVSDFQSLRQMLCCEFESNLSSADLHQQLRDRKMHSGESFHEYML FT NMRKIASAGKIEDAAVIRYIVDGLNIKSEFKFSMYNCKSLKERIKERMQYG FT VFEHVKKFDNIEKNNRKITSIAKVENIKKKEYCFNCGSSEHKRKDCNSADR FT KCFSCNESGHISTNCPRKLSAVRKINIQKRVKSVSLNGVNVDGLIDTGSDV FT TIVKNSFLKYLGKIDLKESCLMLCVLGKSQTKAKSFFETEVIVDNLKTQQK FT FLVVESNTMDCDVILGYDFVEKFRVSIDERGYTFMSLVEDTKSQEEYADNW FT YNVFDISCSYSAPPQYQKQVEALIQDSFQLNDAEKVNCPIELKIVPDGAII FT PFRQSPSRLSSPEAGAVKNQVDEWEENGIVRKSTSNVASRVVVVKKKDNTL FT RVCVDYRKLNSIVLEDCFPVPLMDEVLEKLQAASFYTIMDLENGFFHVPIE FT ESSKSYTAFITREGLYEFNRAPFGFRNSPALFIRFVNHIFQDLINKNIMQL FT YMDDIIVYAKSANECLNKTKLVLETAAKYDLKIKWGKCSFLQKRITFLGHE FT IEDGKIWPGKDKTMAISRFGIPTNVKSIQAFLGLTGFFRKFIPQYAQIARP FT LTKLLKKDVKFIMGPSELQALQQLKTVIISEPVLHLYSREAPTELHTDASK FT DGFRAVLLQLFDNNLHPIYFWSKKASEADSKRHSYILEVKAAYMALKKFRH FT YLLGIPVKLVTDCIAFKQTTSKLDIPREVAQWILYLQDFNLNVEHRSAVKM FT QHVDCLSRFPQSCLVVKDEITAHLKKAQQTDDHIIAIIEVLKQQPYKSFNV FT KSGLLFKEVEGNELLVVPKLMEREIIQRSHEVGHFSTRKTMHAIHQQYWIP FT HLESKVSRLIANCIKCIVHNKQLGKKEGYLHCITKGVTPLHTLHLDHLGPM FT DNTSKQYKFILAMVDGFSKFVWLFPTKSTGQEEVIKRLNEWTDIFGFPERI FT ISDRAVAFTSNAFKEFLRSNDVEHVLSTTGVARGNGQIERVNRCILSIIGK FT ISADEPTKWYKGVSQVQKAINTTMHSSTKMSPFEVLFGTRMRNHIDDRLLD FT ILQEEQVDQFNDERQKLRQEAKINIEKAQATYKRNYDKNRKGEAIYQVGDM FT VAKKRTQFVAGRKLASQFLGPYEVIKAKRNGRYDVKKATNGEGPMVDYMRL FT WRYVAENDELLSSESEDDYQEGR" XX SQ Sequence 4264 BP; 1473 A; 695 C; 992 G; 1104 T; 0 other; tttgggggct cgccggtttt gtttgtgtta gtgcaggtgt aatgtggaga aagtgattag 60 aaaatgaaaa tgtggttgtg ctggaattta aaaaaacaat aaagttgtaa agttgcggtt 120 gattttgtaa aagttggtac aaaacatcca aataaaagtt gtacagttgt gaaaattcca 180 gtagtgcgcg gtagcttttc taagaaatcg ttctcgaaca gtcgatcagt cggcgccacg 240 aagtcgcgaa aaaccgaaat gtcgccacca ttgcgcacac cagcgaagtc gaaaaacgaa 300 aagaacgaaa agaagattgt tgaaaagaag accttgaacg aaacgtttgg tttagtggaa 360 gatgaagaat cgataaaaaa caatatggaa gtgaaagacg ttagtgacgt agaaaaaagt 420 gatcccagag atctagatat aaagatgcaa cagttaatac acatgttttc tcctaaaatc 480 gaagcagatg agagaaagta tgcagaagtc aaaataaacc ctgagcattt cgaaaaagtt 540 gtatgtgtgt ttgacggtta caatgtaccg gtagagaaat ggtttgagaa ctttgaacag 600 aatgccgaag cttacgagct cacagacaag cagatgtatg tgcaagcgag aaataaaatg 660 agtggtgcgg cacagttata tctcgagtca gaaacagtga gtgattttca gagtttgaga 720 caaatgttgt gttgtgagtt cgagtcaaat ttgagtagcg ctgacctaca tcaacagcta 780 cgcgatcgta aaatgcacag tggtgaatcg tttcacgaat atatgctaaa tatgagaaaa 840 attgccagtg ctggaaaaat cgaagacgcg gcggttatac ggtacatagt tgatgggctt 900 aatattaaaa gcgagttcaa gttctcaatg tacaattgta aatcgctaaa agaaagaata 960 aaagaaagaa tgcaatatgg tgtattcgaa catgtaaaaa aatttgataa tattgagaaa 1020 aacaatcgaa aaatcacgtc tattgcaaaa gtggaaaaca ttaaaaagaa agaatattgt 1080 tttaattgcg gttccagtga gcataagcgt aaagactgta acagtgcgga tagaaaatgt 1140 tttagttgta atgaaagtgg acatatttca actaattgcc ccagaaagtt atcagctgtt 1200 cgtaaaataa atatacaaaa acgggtcaag tcggtgtcgt tgaatggcgt aaatgttgac 1260 ggtttgatcg atacgggctc agatgtcact attgtaaaga actcttttct aaaatacttg 1320 ggtaagattg atttaaaaga aagttgctta atgctctgcg tcctaggcaa aagccaaaca 1380 aaagcaaaaa gctttttcga gacagaagtc atcgtggata acctaaagac gcagcaaaaa 1440 ttcttggtag tggagagcaa cactatggac tgtgacgtta ttttgggata cgactttgtg 1500 gagaagttta gagtatcgat cgatgagcgt ggatatacat ttatgagcct ggttgaagac 1560 accaaatcgc aagaagaata tgctgacaat tggtacaatg tttttgacat ctcatgtagt 1620 tattcggccc caccgcaata tcaaaagcaa gtcgaagcgt tgatacagga tagctttcag 1680 ctgaatgatg cagagaaggt aaattgtcct attgaattaa aaattgtacc agatggcgca 1740 ataatcccat ttaggcagtc acccagtcgt ttgtcgtcac cagaagcggg agctgttaaa 1800 aatcaagttg atgagtggga agaaaacgga atagtgcgca aatccacttc gaatgttgct 1860 agcagagttg tagttgtaaa gaagaaagat aatactttgc gggtttgtgt agactacagg 1920 aagctaaata gcatagtctt ggaagactgc tttccagttc ctttaatgga tgaagtgcta 1980 gagaaattgc aggcagcaag cttttatacg attatggatt tggaaaacgg attcttccat 2040 gtgcccatag aggaatctag caaatcatat acagcgttta ttactagaga aggattatac 2100 gagtttaata gagcgccatt tggatttcgt aattctcctg ctttattcat tcgtttcgtt 2160 aaccacatat ttcaggatct gattaacaag aatataatgc aattatatat ggatgacatt 2220 atagtatatg ctaagtcagc aaatgagtgt ctaaacaaaa ccaaattggt attagaaact 2280 gcagcaaagt atgatttgaa aataaaatgg ggaaaatgta gttttttgca aaaacgcata 2340 acctttctag gtcacgaaat tgaagatgga aagatttggc ccggtaaaga taagacgatg 2400 gcgatcagtc gtttcggaat tcccacaaac gtcaaatcga ttcaggcatt tcttggttta 2460 accggattct ttagaaagtt tattcctcag tatgcgcaga tagcacgacc ccttaccaag 2520 ttgctgaaaa aagatgtcaa gtttattatg ggtccaagcg agcttcaggc gttgcagcaa 2580 ttgaagacgg tgataataag tgaaccagtg ttacacctct attctcgaga ggcaccaacc 2640 gagttgcaca cggatgcttc aaaagacgga tttagagcag tactacttca gctatttgac 2700 aacaatttac atccgattta tttttggagt aagaaagcat ctgaagcaga ttcgaagagg 2760 catagctaca ttcttgaagt caaggcagca tacatggcgt taaaaaaatt tcgacattat 2820 ttgctaggga tcccggtgaa attggtgaca gactgcattg catttaagca aactacgtca 2880 aagctggaca tcccaaggga ggttgcgcaa tggatattgt atctacaaga tttcaatttg 2940 aacgtggagc accgatcagc agttaagatg cagcacgtgg actgtttaag ccgtttcccg 3000 cagtcgtgct tagtagtaaa agatgaaata acagcgcatt taaagaaggc acagcagaca 3060 gatgatcata ttatcgcgat tatagaggta ctaaagcagc aaccatataa aagcttcaac 3120 gtgaaaagtg gtttgctatt caaagaagtt gaaggtaatg agttattggt tgtaccaaag 3180 ctaatggaac gcgagataat acaacgatcg cacgaggttg gacatttttc gacgcgcaag 3240 acaatgcacg ctatccacca acagtactgg ataccgcatt tggaaagtaa agtatcaaga 3300 ttgattgcaa actgcataaa atgtatagtt cataataaac agctgggtaa aaaagaagga 3360 tatttgcatt gtataaccaa aggagtcaca ccgctacata ctttacattt ggatcactta 3420 ggacccatgg acaacacctc caaacagtac aaattcattt tggcaatggt agatggtttt 3480 tcgaaatttg tatggttgtt tccaacgaag tccacaggtc aggaagaagt aattaaaaga 3540 ctcaacgaat ggacagacat ttttggattt cccgagagga tcataagtga cagagcggtg 3600 gccttcacat caaacgcttt taaagaattc cttcgatcaa acgatgtaga gcatgtactg 3660 tcaactacag gagtagcaag gggcaatggc caaatcgaac gagttaaccg ttgtatcctg 3720 tctattatcg gaaaaatatc agcagatgaa cctacgaaat ggtataaagg ggtttctcaa 3780 gtacagaagg ctattaatac aacgatgcat tcatcaacga aaatgtcacc ttttgaagtc 3840 ctgtttggaa cgagaatgcg taaccatata gatgaccgtt tgctggatat attacaggag 3900 gaacaagtag atcagttcaa cgacgagcgt caaaagctac gacaagaggc taagattaat 3960 atagagaagg cgcaagcgac ttacaagagg aactatgata agaacaggaa aggcgaagct 4020 atctaccaag ttggcgatat ggtagctaaa aagagaacac agtttgtagc gggccgtaaa 4080 ttggcaagtc agtttttggg accctatgaa gtgatcaaag ctaagcgcaa cggccggtac 4140 gatgtaaaga aagcgaccaa tggtgaaggg ccaatggtgg actatatgag actttggcga 4200 tatgtggctg agaacgacga gttgctgtct tctgagtcag aagacgatta tcaggagggc 4260 cgag 4264 // ID BEL-14_AA-I repbase; DNA; INV; 5467 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-14_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5467 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 867-867 (2011). XX DR [2] (Consensus) XX FH Key Location/Qualifiers FT CDS 46..5430 FT /product="BEL-14_AA-I_1p" FT /translation="MEELIVKRNSLFQKVKWELETAKKVLARNPPICEVRE FT RVDRLHRLGDAFFAVQGEIEDCTTDLEAIASVFNYRAEFEERYYQAKAMYI FT QMDDGSVLGDDMKDESGSTLQNAVVALLEAQRVLMSNQAVVSTQTNALTAQ FT LEDARLQPPGSRVPVAEPNLDQLLNVRLPPINIPTFNGNRKEWRSFKDLFA FT STIHNKQTLRDSQKLQYLLSYLEGEAKSLVSSFAITDANYMQVWNKLLERY FT DQNKYTVFSLVKEFLEQPVVSSANPSSLRKLVSTSDEVIRQLSAMGAQYET FT RDPWLIHLLLEKLDKESRSQWAAKLVDLQDPTFQQFLKFLENRCDALETCS FT SFTRTSTTITEAMKRETRKDQARPMEKRLQSFYVESQQCPKCSSEHQIYQC FT ESFKASSVADRRELVQKSRLCFNCLRPSHCVKNCSSKTSCKNSGCNQRHHT FT LLCQQGDSQAANPVQKPIIQSACSSPASELPEQEVHSTDELSTFKTDVSSE FT LPTKVAVLPTALIKIRDKDGSFQLARAMIDSCSGASLISEACMTRLGLSRS FT NARFPVTGVAGSAAGTTKGVVKLEISSRFNDEVILTTEAYVLDVLTAPIPS FT QTIDVKRMKFLEAIQLADPVFWKTGKIDVILGVELFLPMLRIGQVTDDNGL FT PIAQNSVLGWLVAGRFEGGTGIQALCASLNVDSNVNIDKSLRMFWETEEIP FT SQNVLTADETRAVEIFKTTYHRDETGRFVVRLPFDETKPVLGESLNTAIKR FT LRAMERQFLLNPTLKQQYEDFMMEYLRLGHMELVPESEIAKPPNECFYLPH FT HAVHKADSLTTKLRVVFDGSCSSSSGVSLNDRLLVGPNNTENLLDVLCRFR FT TYPIVFVSDIEKMYRQVRVSEDQADFLRVVYRKNPEEPVQHYRLLTVTYGT FT SCAAYLATESLRQAARDSATKYPVAADRILKGFYVDDLMSGANTVDEARTL FT VNEISTILNGAGFALRKWSSNVPELIEDITDSQQGPIPIQFTSELDSVKAL FT GIRWSPREDSFDFNVSLDVTSRNTKRQLLSDASRLFDPFGWLSPCIVKIKI FT LFQQLWLHDLTWDDPLPSAIEEEWVSIKNNLQALEQLRIPRWVANHKGGMQ FT LIGFSDASESAYAAVVYGRSVDSNGKVHITLIAAKTKVAPIQQVTLPRLEL FT NAAVLLTALVKKISSALNHLKLEVNAWTDSTVVLEWLSSHPRKWKTYVANR FT TATILEVLPRSSWHHIPSIENPADCASRGVTPHELLEHPLWWTGPPWLHNE FT TEFLEHLTTDVSNTSQQVSIIHTEVSQKNVIRSSSSFTIEQYLLHRYSSIR FT FVSRILAVITRFIDNLKAARIKHNRKTGPITPAELQASTLWMVRCAQHDMF FT AKELECLWSGTLLPRKSKLRPLHPFIDAEGTLRVGGRLQNADLPYEMKHPA FT ILPGNHRCTTLIIHDAHLSNLHAGPTLLVATLNQRYWILRCHTAVRQVTQN FT CLQCCRMRAKTATQLMGSLPAVRTMPVRAFVHVGVDYAGPITVKSGNPRKP FT LLTKGYIVVFVCLSTKAIHLEAASNLSTATFIEALKRFIARRGLCTEIWSD FT NGTNFVGADRQMQDFFQSTEFETQANRFSSNVGIKWTFIPPSAPHMGGLWE FT AAVKSAKLHLYKVLSRGPHTYEDLCTILAQIEACLNSRPLCAISNSPDDYN FT VLTPGHFIIGQPMNLLPESSVPSVPINRLDSWKMRQKQVEEIWQRWRNEYV FT SSLQPRTKWQNTKDNLQVNRLVLVQNESTPPAHWELARVVAVHPDRHGMVR FT IVTLRRGSSEYQRPIQRLVPLPSA" XX SQ Sequence 5467 BP; 1509 A; 1315 C; 1275 G; 1368 T; 0 other; tttatggtcc ttcgcgatcc gggtagaaaa agtaaccggt agaccatgga agaactcatt 60 gtgaaacgta attcactctt tcaaaaggtg aaatgggagc tcgaaaccgc aaagaaagtg 120 ctagctcgca atccgccgat ttgtgaggtt agggagcgcg tcgatcggct tcatcgactc 180 ggagatgcct tttttgcggt tcaaggtgaa attgaagact gcactacgga tttggaagcc 240 atcgcttcgg tatttaacta ccgtgccgag tttgaggaga gatattacca ggcaaaagca 300 atgtacattc aaatggatga tggatctgtg ctgggcgacg atatgaagga cgaatcaggc 360 agcaccctac aaaatgcagt ggttgctctg ctggaagcac aacgtgtact tatgtccaat 420 caagctgtag tttcaacgca aacgaatgcg ttaactgccc agcttgaaga cgctcggcta 480 cagccacctg gtagccgagt tccggttgca gaaccaaact tggaccaact gttgaacgtt 540 cgtctccctc ctatcaatat tccaacattc aacggaaatc gtaaagagtg gcgatcattc 600 aaggatcttt ttgctagtac aatccacaac aagcagacat tacgggattc gcagaagctc 660 cagtatttgc tctcttacct tgagggcgag gcaaaaagtt tagttagctc cttcgcgatc 720 accgacgcta attatatgca agtttggaac aagcttcttg aacgatacga ccagaacaaa 780 tacaccgtat tttcgttggt gaaggagttc ctagagcaac cagtcgtatc atctgcgaac 840 ccgagctctc ttcggaagct ggtatcgaca tcggacgaag ttataaggca gttgagtgca 900 atgggtgccc aatatgaaac gcgtgaccct tggctcatac atctactctt ggaaaaactt 960 gacaaggagt cccgttccca atgggcagca aaactcgtgg atttgcagga tccaacattc 1020 cagcaattct tgaaattcct tgaaaaccgt tgtgatgctt tggaaacttg ctcttcgttc 1080 acgcgaacca gcacaacgat cacggaagcc atgaaaaggg aaacacggaa ggatcaagct 1140 aggccgatgg agaaaagatt gcaaagcttc tacgtagaat cccagcagtg tccgaaatgc 1200 tcttcggaac atcagatata ccagtgtgag tcgttcaagg cctctagcgt cgctgaccgt 1260 agggagttag tgcagaagtc tagactgtgc ttcaactgcc tcaggccatc acactgcgtg 1320 aaaaattgtt cgtcgaaaac atcgtgcaaa aacagtggat gcaaccagcg gcatcatacg 1380 ctcctctgtc aacagggtga cagccaggca gcgaatccag tgcagaaacc gataatccaa 1440 tcggcctgca gttctccggc ctcagagctg ccggaacagg aggttcactc tacggatgag 1500 ctgtcaacct tcaagacgga tgtgtcgtca gaactgccta ccaaagtcgc ggtattacca 1560 acggcactga tcaaaattcg cgacaaagat ggttcgtttc agctcgcaag agccatgata 1620 gattcctgct ctggggcgtc actaatcagc gaagcttgca tgacgcgcct tggactttca 1680 agaagcaacg ccagatttcc cgtcactggt gtggcaggat ccgcagcagg tactaccaaa 1740 ggtgtggtaa aactggaaat atcgtcccga ttcaacgacg aagtgatctt aacaacagag 1800 gcatacgttc ttgatgtact gactgctccg attcctagcc agactatcga cgtgaagcga 1860 atgaagtttc tggaagcaat ccagttggct gatccggtgt tttggaaaac aggaaaaatc 1920 gacgttatcc tcggggttga actctttcta ccaatgcttc gtatcggtca agtaaccgac 1980 gataatggat tgccaatcgc acaaaactca gtgcttggtt ggctggtagc tggacgattc 2040 gaaggtggaa ctggcattca agctctatgt gcgtccttaa acgtcgattc aaatgtcaac 2100 attgataaat cgctacgaat gttttgggag actgaagaaa ttccttctca aaatgttctt 2160 actgcagatg aaacacgagc cgtggaaatt ttcaagacta cttaccatcg cgatgaaact 2220 ggtcgatttg tcgttcgact accattcgac gaaacgaagc ccgtgctagg agagtcgctt 2280 aatacggcga tcaagagatt acgtgccatg gaacgacaat ttctattgaa cccaacgctg 2340 aagcaacaat atgaagattt catgatggaa tatctacgtt tgggtcacat ggaactcgtt 2400 ccagaatccg agatagcaaa acctcccaac gaatgttttt atctgccaca ccacgctgta 2460 cacaaggcgg acagcttaac gaccaaattg cgcgtcgttt tcgatggttc ctgtagtagc 2520 tcatccggtg tatcactcaa cgatcgtttg ctcgtcggtc cgaacaacac cgagaaccta 2580 ctcgatgttt tgtgtcgctt ccgcacatac ccgatagtgt tcgtaagtga tattgaaaag 2640 atgtataggc aagttcgcgt atccgaagat caggctgact tcctcagagt tgtgtatagg 2700 aagaaccccg aagaacctgt tcagcactat cgtctgttaa ctgttaccta tggaacttcg 2760 tgtgccgcat accttgcgac ggaatcactt cgacaagccg cccgagacag tgctacgaaa 2820 tacccggttg ccgcagatcg aatcctaaaa ggtttttacg tagatgattt gatgtccgga 2880 gcaaatactg tggacgaagc tcgaacttta gtcaacgaga tttcaactat cctcaacggt 2940 gctggatttg cccttagaaa gtggtcgtcg aatgttcctg agctcattga agacatcact 3000 gatagccagc aaggtccaat ccctatacaa ttcacgagtg aactggattc agttaaagct 3060 ttgggaatac gatggtcgcc gagagaagat tcgtttgact tcaacgtttc actcgatgtc 3120 acaagtcgca acaccaaacg tcagcttctt tcagacgctt caaggctgtt cgatcccttt 3180 gggtggctat ctccgtgtat cgtcaagatc aaaatcctct ttcagcagct ctggttgcat 3240 gatctgactt gggatgaccc actaccatca gccattgaag aagagtgggt gagtatcaag 3300 aataatcttc aagctttgga gcaattacgc attccacgtt gggttgcgaa tcacaaaggg 3360 ggaatgcagt taatcggatt ctcagatgca tcggagtcag cctacgctgc agttgtctat 3420 ggtcgatctg ttgattcgaa tggtaaagta catattacac tcattgccgc caagaccaag 3480 gtggcgccga ttcagcaagt tacgcttcct cgacttgaat tgaatgctgc tgtgcttcta 3540 acggcgctgg taaagaaaat atctagtgcg ctcaatcacc tcaaactaga agttaacgct 3600 tggacagatt ctacagtggt tttggaatgg ctgtcatcgc accctcgaaa gtggaaaacc 3660 tacgttgcta ataggactgc tactattctg gaagtacttc caagaagttc atggcatcac 3720 attccatcca tcgaaaaccc agcagactgc gcatctagag gagtgacacc acatgaactt 3780 ttggaacacc ctctttggtg gactgggccc ccgtggcttc ataacgaaac tgaattcctt 3840 gagcatttga ctactgatgt ctctaataca agtcaacaag tttcgataat tcacactgaa 3900 gtctcgcaga aaaacgtcat tagatcgtca tcgtctttca ccatcgaaca atatcttctg 3960 caccgctatt catcaattcg attcgtcagc cgaattctcg ctgtcatcac tcgcttcatt 4020 gataatttga aagctgcacg tatcaagcac aatcgcaaga ccggaccaat taccccggca 4080 gagctgcaag catcaacttt gtggatggtt cgctgcgcac aacacgacat gtttgctaag 4140 gaactggaat gcctttggag tggaactctt ctacctcgaa aaagcaaact tcgcccgttg 4200 catcctttca ttgatgctga aggaacactt cgagttggag gaagacttca aaacgccgat 4260 ttgccgtacg aaatgaaaca tccagcaatt ttacctggta atcaccgttg cacaacactt 4320 atcattcacg atgcgcattt gtccaaccta catgctgggc caacgttgtt ggtagcaact 4380 ctcaatcaac gctattggat tttgcgatgc cacactgcag tccgtcaagt aacacagaac 4440 tgcctacaat gctgtagaat gagagcaaag actgcaactc agctcatggg aagcttgcca 4500 gctgttcgca ctatgcctgt gcgagctttc gtccacgttg gagttgatta tgctggtcct 4560 atcaccgtaa aaagtggaaa tccacggaaa ccgctactaa ccaagggcta tatagtggtt 4620 tttgtgtgcc tgtccactaa ggccattcac cttgaggcgg ccagcaatct atcaacagct 4680 acgttcattg aagcgctcaa gaggttcatc gcccgccgtg gactatgtac tgagatttgg 4740 tccgacaatg gaacaaattt tgtcggtgct gatcgtcaaa tgcaggattt tttccaatct 4800 acagaatttg aaacacaagc caaccgtttt tcatccaacg ttggtatcaa gtggacattc 4860 ataccacctt ctgcacccca tatgggtggt ttatgggagg ctgcggttaa gagcgcaaaa 4920 ctgcatctat acaaagtgct cagccgtgga ccacacacgt acgaggactt gtgtaccata 4980 ctggctcaga ttgaggcatg ccttaattca aggccattgt gtgctatctc caactcacca 5040 gacgactata atgttctgac gccagggcat ttcatcatcg ggcagccaat gaacttgttg 5100 ccagaatcat ctgttccgag tgttcccata aatcgtcttg actcttggaa aatgcgccag 5160 aagcaggttg aagaaatatg gcagcgttgg aggaatgaat atgtgtcatc cctacaacca 5220 cgcacaaagt ggcagaatac gaaggacaat ttgcaagtca acaggcttgt tctagtgcaa 5280 aacgaaagca ctccaccagc tcactgggag ctcgcacgtg tagttgcagt ccaccctgat 5340 cgtcacggca tggtgaggat tgtgacacta cgtagaggat catcggagta tcagcgaccg 5400 attcaaagac tcgtgccact gccgtctgcg taggtcattt gaggcattcg cctcaaggcg 5460 gggactt 5467 // ID Harbinger-N6_BF repbase; DNA; INV; 309 BP. XX AC . XX DT 04-AUG-2008 (Rel. 13.08, Created) DT 04-AUG-2008 (Rel. 13.08, Last updated, Version 1) XX DE Amphioxus Harbinger-N6_BF non-autonomous DNA transposon - DE consensus. XX KW Harbinger; DNA transposon; Transposable Element; Nonautonomous; KW Harbinger-N6_BF; non-autonomous. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-309 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-309 RA Kapitonov V. and Jurka J.; RT "Harbinger-N6_BF - a family of non-autonomous DNA transposons RT from the amphioxus genome."; RL Repbase Reports 8(8), 819-819 (2008). XX DR [2] (Consensus) XX CC It has TWA TSDs and forms a palindrome. XX SQ Sequence 309 BP; 83 A; 73 C; 71 G; 82 T; 0 other; aggtggggtc acacctgcgt atatatttaa gtccgtatga ggcgcgcatg ggagtatttg 60 cctccaccag gccccacagg tcgttggaaa aataatagaa attggacaaa cgtaaacagg 120 taacatgtca gacgagttag ctgggtcgct aaccatactt ctcggtcagc taactcatct 180 ggcatgttat gtatctatat ttgtccaatt tgtactattt tccccgcgac ctgtggagcc 240 tggtagagac taagagcttc atacgcacct caaacggact tgaatatata cgcatgtgtg 300 accccacct 309 // ID Gypsy-22_DWil-LTR repbase; DNA; INV; 370 BP. XX AC scaffold_181130; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-22_DWil_; KW Gypsy-22_DWil-I; Gypsy-22_DWil-LTR. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-370 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_181130; Positions 406878 406509. XX SQ Sequence 370 BP; 165 A; 78 C; 39 G; 88 T; 0 other; tggtcactaa attggagacg tggcatcgac tcttttctgt tcccagaacg attgataatt 60 tatatatgat gatcatacct acacaaaaca tacatacaca aaacatacat acataaaaca 120 tacatacata tgatacataa aacatacata cataaaacat acatacataa aacatacata 180 cataaaacat acatacacac atgcattaat tcgattgtaa cagacacaca aagatagaca 240 aaaggtagag aatccaaaaa cattatgaat aaaaagacac ttctgtttta acatctgaat 300 cgaatagacg tttgagttac tcggggaaaa accaaataca cacacatctc acacaccctc 360 attcaaacca 370 // ID Copia-1_ACA-I repbase; DNA; INV; 4252 BP. XX AC AEYA01002308; XX DT 23-MAR-2011 (Rel. 16.03, Created) DT 23-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the Acanthamoeba castellanii genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-1_ACA_; KW Copia-1_ACA-LTR; Copia-1_ACA-I. XX OS Acanthamoeba castellanii OC Eukaryota; Amoebozoa; Centramoebida; Acanthamoebidae; OC Acanthamoeba. XX RN [1] RP 1-4252 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Acanthamoeba castellanii genome."; RL Direct Submission to RU (23-MAR-2011). XX DR Genome; AEYA01002308; Positions 233876 229625. XX CC Positions [1628-2122] - Integrase core CC 'GGTTG' target site duplication CC LTRs are 96% similar to each other. XX FH Key Location/Qualifiers FT CDS 1400..3109 FT /product="Copia-1_ACA-I_1p" FT /translation="MLVLPLDPLLLERAYATRVDLSLDVLHRRFGHACERR FT LCTMLKAKGIMPATKSLLPCAICVKNKTMRKPICKGPAVRSTTPMEQLHTD FT VCGPFPVATKTGKRYFISIINNTTRFAIIIAICEKSNIEAMLRSHLAATPA FT DLKCQRLRSDQGSKYTGKRVQELLRAAGIVHKATSSHTPEHNGVAEQFNRT FT IVEMVQCMLHDSGIAQCYWGEALHTTTAIYNHLPTNANDSTSPLDRWDADH FT AGALEDMHQFRAKVEVLVPPGERTKLSARTRTRVYLGPVDSGTSNHQVLVQ FT GHILVTCEVVFPYDAHDVLMGNNMSITVAPAVVNPLAKEANVGSAPQWASI FT NQSTTPAPSVLGASGSAKTSPPRSPSQVELSVIKRTPQGESPMQVEPLDPL FT TPPDMPVMDDDPALKESEEEEHAAPPSLEADNVIVVVKKKWPMLKKPLAHE FT CKKNTKFYNDDFHAFAAEVLQALDVSDSACTPNSYDEAVALPEAAEWIKAM FT KAEYGALDHNGMFELALLPVGLRAIRLRWLFKIKHKASGVINHLKARWVRK FT GYTQRLGIDYNKTFSLVVTMEHL" XX SQ Sequence 4252 BP; 1006 A; 1180 C; 1182 G; 884 T; 0 other; ggttatgggc ccacgctcaa ctggaagatg aacccatttg aagctcaatt gaaacttctc 60 aactccaaac tggtactcaa atccgcaagt ggttatgcga cgtggcgcaa tgatttgtct 120 gtgactctac agtcttgagg actgtttgag tacgcctttg ggtccatgga ggaactgaag 180 gggaggaagg atgagagtga agtgagtttg gaactttgaa aggacgattt cagacagaag 240 gtgcaacaga ctgtgggcta catctgcatc tgtctggatc ggccattctg agccatgatt 300 aaggggcttg agaccaaccc gtggaagatg atggctaccc ttgatgtgaa tcttgcgctg 360 aaggctaatg tgagcaagtt gacgctgttg acacagctca tcaacatcaa gcgtgagact 420 ggagaagcgc tcaactccta ctttggctgg attgttgaca tcaacatgga acttgtcaac 480 aacgacttgg cgctccctga agtgttcatc cttatggtga tcctgaatgg gctgctggtc 540 gagtacaata ctgtgtggcc agtaattgag gtgcagacca agattgacct gcatgacgtg 600 atgagtcgac tctgtgactg tgaggtgaac ttgaatgtgt gttctgaaca gaatgaggct 660 gccaatgcca tgtggcgtga caagcctgat cgtgcgcgcg gcaagcgtgg tggcgaaggt 720 cacggcccgt cgaagcagga caagggtggc gagcgccatg gccacagttg gaagtcgagc 780 aagggcccca agcaggggcc gcaggagggc gacgagacca cctgccacta ctgccacaag 840 cccggccact ggaggaacaa gtgtccgaag ctcaattggg gcgacagtgg cgatggtggt 900 ggcggcggtg gcgacagcca caaagtgaat gccacttggg cacagcacaa gagcgacaag 960 gacgaggaaa tcgtcctgtc tgtcgctgac caggtaaacg ctgtggccaa cgagaccaac 1020 cagtggtatc tggatttggg cgcgacctgc catgtcacct gtcgtcgtga actgctccat 1080 gactaccagc ccacgctgtc ctcgccttcg catatgcgac tctgtggttc aaagccgcag 1140 cactgtcaac ttggtgctgg gtgacaactt caagtgctgt gtcaagggga caggcaccat 1200 ccgcgccacc attgtcatag acggtacaat gaagactgtc gtgctcatgg atgtgtacta 1260 tgcccctgag ctggcgaaaa atctggtctc tatggcgtga atcgccaagt tgggttgcac 1320 catcctcatc aacaccgcgg ggtgccgcgt ggtggacggg cgtggacgcg tggtccttca 1380 agggacagcc cgaggcaata tgctggtgct gccacttgac ccgttactcc tggagcgggc 1440 atacgccaca cgcgttgacc tctcacttga cgtgctacac cggcgttttg ggcatgcgtg 1500 tgagcgtcgc ctgtgcacaa tgctcaaggc gaagggcatc atgcctgcca ccaagtcgct 1560 cttgccgtgc gccatttgcg tcaagaacaa gacgatgcgc aagcccattt gtaaggggcc 1620 agcagtgagg agcactacgc caatggagca gctccacact gacgtgtgcg ggccgtttcc 1680 cgtcgccacc aagactggca agcgctactt catctcaata atcaacaaca ccactcgttt 1740 cgccatcatc atcgccatat gtgagaagtc caacattgaa gccatgctgc gcagccatct 1800 ggctgcaaca cctgcggacc tcaagtgtca acgcctccgc agcgatcagg gcagcaagta 1860 cactggcaag cgagtgcagg agcttctgcg tgctgccggc atcgtccaca aggccacgtc 1920 atcacacact cctgaacaca atggcgtcgc cgaacagttt aacaggacca ttgttgaaat 1980 ggtccagtgc atgttgcacg acagtggtat tgcccagtgc tactggggcg aggcactgca 2040 caccaccacc gccatctaca atcacttgcc caccaacgcc aacgacagca cgtcgccact 2100 ggatcggtgg gacgctgacc atgcgggggc actcgaagac atgcaccagt tcagggccaa 2160 ggtggaggtg ctggtaccgc ccggcgagcg gaccaagcta tctgcgcgca cacgcaccag 2220 agtttacctt gggccagtgg acagtgggac cagcaaccat caggtgctgg tgcaagggca 2280 catccttgtg acatgcgagg tcgtcttccc ctacgacgca catgacgtgc tcatgggcaa 2340 caacatgtcc atcacagtgg cgcctgccgt tgtgaaccca ctggccaagg aggccaatgt 2400 gggctctgct ccgcagtggg cctccatcaa ccagtccacc acaccagcgc catctgtgct 2460 gggcgccagt ggatctgcca agacttcacc gccacgctca ccaagtcagg tggagctgtc 2520 tgtcatcaaa cggacaccac agggcgagtc gcccatgcag gtcgagccac tggaccccct 2580 cacaccacct gacatgccag tgatggatga tgaccctgct ctcaaggaga gcgaggagga 2640 ggagcatgct gctccaccca gcctggaggc tgacaatgtc attgtagtgg tcaagaagaa 2700 gtggcccatg ctgaagaagc cactggccca tgagtgcaag aagaacacca agttctacaa 2760 tgatgatttc catgcattcg ctgctgaggt gttgcaggcg cttgatgtca gtgacagcgc 2820 atgcactccc aacagctacg atgaagctgt tgccttgcct gaggctgctg agtggatcaa 2880 ggccatgaag gctgagtatg gtgcgcttga ccacaacggc atgtttgaac tagctctgct 2940 gcctgtgggg ctgcgagcca tcaggttgag atggctattc aaaatcaagc acaaggccag 3000 tggtgtcatc aatcatctca aggcaagatg ggtcaggaag ggctacactc agcgactggg 3060 cattgactac aacaagacat tctcactggt tgtcacaatg gagcatcttt gacagttgct 3120 caccctcacc atgatcctca atcttgaggt ccaccagatg gatgtagaga acgccttcct 3180 caatgctact ctctctgtcc acatctacat tgagcagccg caaggattcc tcaacctgga 3240 gcggccagac catgtctgcc tgctgtgcaa gagtctatat ggactccatc aagcgctgct 3300 tgagtggaac aggatgatgg accgccatct gtgcagccac cacttccttc ccactcacac 3360 caacccctgc atctatgtgt tgcaggagtc aagtgccctc gtcatcatca ctgtctacat 3420 caacgactgt gtgatagtcg cccctcttga gcacgttgag cacaccaaga cggtgctgca 3480 tgacagcttc aggatgaagg acctgggcca agccaggtcc atcctgggca tggaggtcat 3540 gtgtgactgt gaggaggggg ccctctacct gtgtcaggct ggcaagatca tggagattct 3600 ccacaacttc agcatggcca acgtgaaatc catcagcacg cccatggacc ctggcctcat 3660 cctccacaag ctcaatgtca cagcaccagg gcacctcaga aaaccatacc agtcggtagt 3720 ggggcggctg agctatctgt ctcaggccat gtgtctggat attgtgttca ctgtcaatgt 3780 gctgagctgt catgtcaaca gctacaacca gtcccattgg gggcgccatc aagcacattc 3840 ttcagtacct gcgcaccacc aaggaccttg caatcaagta cgccaccagt gggtcttgtt 3900 cccagtctgg cggcctgcta ccagtaggct acacagacac agactggggc ggggacgttg 3960 agacacagca ctccatgtca ggcatttttt tcacacttag tggcagtccc attcaatggg 4020 gcgcacacac tcacaagtgc atagccacct ccacaatgga ggcagagctg aacaccatcg 4080 ccgagacaac gtttttatgt gaaagtgtga gtaagctgaa cattgaattc aaatacctcc 4140 ccacggaggt catgcctggt gatgtgttca ccaaggcgct ggggcgcgca tgtgtagttg 4200 agctacgctc tctcatcaac cttgttgaac tgaagattga gagcaagggg cg 4252 // ID DNA8-13_CQ repbase; DNA; INV; 1159 BP. XX AC . XX DT 28-DEC-2010 (Rel. 16.01, Created) DT 28-DEC-2010 (Rel. 16.01, Last updated, Version 1) XX DE Non-autonomous DNA transposon from Culex quinquefasciatus - DE consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-13_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-1159 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the southern house mosquito."; RL Repbase Reports 11(1), 90-90 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >95% identity. CC 8-bp TSD. ~300-bp TIRs. XX SQ Sequence 1159 BP; 404 A; 172 C; 166 G; 417 T; 0 other; tagagcgtcc aatttcccgg ggttacaaat ttcccgggaa acgggaaatt ttcagccaat 60 ttcccgggaa atcccgggaa ttcccgggaa attttaaatt tattgaaaat tgttatgatc 120 ttggttttaa ttaatattat gcaacaaaat tgtataggac atcaacatta atggtttaaa 180 taagtgtgaa gatcaattaa cagcttgact gcatgtaaaa aatcattcaa ctacaagaaa 240 atgtattttg gtattttttg atgagaaatt ttaaacttgc ctctgtgacc agtttattga 300 aatgagaaaa aaaaaacatg cagaaattat gtttttcatg acaaatactg gtttcagctc 360 ttgaacaaat ggtaaagctt tttagtgttg gttttactcc aaacaaccca atgttttcaa 420 atatttgaag caagctttga atatattttt gcattttttg agaacaaaca atagaaagta 480 atttttcaat gatttttttt gtattattat gcaacatgat ttaaattata cgataaatta 540 acatcattcc acaaaaagtc ttctattttt tcacatttaa ccctctaaca cctgtagttg 600 caccagtgct ccataattct tttacttcaa attgtaattt ctttagtact aggtattcaa 660 ccaattgaat taattctttt tgcacaaatt cgattttcat ctcatttgat catggtaaac 720 aaaaacgtaa aaaaatctca attttaattt ggaggtaagg agttaatatt gttttgatga 780 aataacatca atctgttaaa agcttaccta attgtaataa tttaatactc attaagcaag 840 atctattccc agttgcttca aaaattaata ttatcagaaa taattctgat atcataattt 900 ctgataatct gattaattat atataaacca aacaatacat ttaattgaac aaaatcatgt 960 tgagtttgtt catttattat tttttcagag cattttcaaa aaatagttcc aaaaatttaa 1020 gaaaatgtat acttccgctt gaatttcggg aattcccggg aaatttacaa atttcccggg 1080 aaacgggaaa tatttttttt cgggaaatcc cgggaattcc cgggaatttt tttcccggga 1140 cgggaaattg gacgctcta 1159 // ID Helitron-3_AAe repbase; DNA; INV; 6792 BP. XX AC . XX DT 30-DEC-2010 (Rel. 16.01, Created) DT 30-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE Helitron-like sequence from Aedes aegypti: consensus. XX KW Helitron; DNA transposon; Transposable Element; Helitron-3_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6792 RA Jurka J.; RT "Helitrons from the yellow fever mosquito."; RL Direct Submission to Repbase Update (30-DEC-2010). XX DR [2] (Consensus) XX CC >98% identical to consensus. The ORF is corrupted. XX FH Key Location/Qualifiers FT CDS 796..1989 FT /product="Helitron-3_AAe_1p" FT /translation="MVKELPRQLDDDCAFNVCIKKHMIHKSSYLSGYVKKG FT TVKAWLNYLVATPLYRRNGITFSEEHLAAIEPVQQLGIPQTANTSFDLEVI FT DASNEVELLIGQQHTLLWNEDKCLEIAPGQNRTPLSIIYDEFAEELSFPDI FT YLGYPRSFNPEVRVTPFMMATSEIRRRDRRGAKPEHILFMAMKIMRLRVSE FT GLKNTFKCMGTANITRAQLQDREFLETCIERNLSFLKSIPNSIQYWQQRKR FT DVFAMIRQLGKPTMFLTLSANEIRWPHLLTVLHKLSNGSSDAGVANIMQQL FT TALQRATLVSEDPVTCCVYFNKLVNVIMQLLSSTRYSPFGKYYVVDYFKRI FT EFQHRGSPHAHILLWLANDPREDMSENMPATVQLIDMLCSVRADDLTETYG FT NQVN" FT CDS 2719..5349 FT /product="Helitron-3_AAe_3p" FT /translation="MWPHERIKSRKRTKQMDEEELEEDSTDVWTLNIIQRY FT EAREGMDEVCLADFAALYTEERRAKNTYKIRRFPRILRWCGYNMSELVEYK FT REMVLLFLPFRNEVCDVLDRNKFLMLYEDNEAAILAKHKEYDCELNLDQVV FT AEYIRLSDEDDVQQEVANKKRDEYVRTISMEPNDDDINNLPNGPMAAVVKQ FT RANVMSKQEYCEMVRATNAEQRDLILHIIHSLHSFDDSIKPVQIFFTGPAG FT CGKTFTLRILMETINRFSQAHNSQNNSYVACASTGKAAVAIGGTTVHSAFR FT ITMSRRQSSKLSFESLQLYRNAFANVKAIIIDETSMLGADVLNTVHVRLQE FT ITGNYDDPFGGMQIVFCGDLRQLPPVNARPVFKPSVNSMHGAVLWQSLEFY FT PLVQVMRQTNEQFSTILTKIGNGEQLSPDEVVIIESRFRTAEWCQQHVPRA FT IRLFHRNVDVERYNSIALVDREAAECTADDIYSGFKDNSQLAGARTKVHKM FT SVAETGCLPYLLRLAVGTPYMLTTNVDVEDGIVNGAIGELMFVERDEDDTQ FT QQITRLWIKFENDSIGRMLRVKARPLVYSKPGVLKAEWTPIAKRSANISLN FT GGVKCKRVQFPVVSACALTVHKSQGGTFSEVVYNYEKSQEQQLVYVGLSRV FT TSIEGLYLTNPSNDFKFHHGKAAVTPRIKDLRTELGRLNNHRLRTIGEDVM FT ATIASNTSAVMLMSINVQSLNAHSLDITTDRVLNAVDLLALSETWMENGTQ FT TVLAEFDCITQEKRDETRAGGVAIYQNTAASTAAVSHAIDKLSASYDAMLG FT EADKYGDICAAEISVMGTRTLLFSLYISPGMTKTEFVFLIFIINSFLLQAP FT RSSKRNFSWRVIWSCTAKRPCQSS" FT CDS 2115..2729 FT /product="Helitron-3_AAe_2p" FT /translation="MDQTRILLPITTGDGRLDQLRRAAVSMRDALETKAYD FT SMEAFLVDRNCTYEYYLDVIRSSIRRPTVMLKRSMSELWTNPFNPWIGKIM FT RSNMDLQFVLEEFSCAAYVVEYVNKTNRGISSLHRELIKLQEEHPDHDYNG FT LLKKVSIKMLNSVEMSAQEAAWYLLRQPMSEASRKVLFCLLHIGAGHANSH FT LFYLIGGIHSDDVAT" XX SQ Sequence 6792 BP; 1830 A; 1578 C; 1560 G; 1824 T; 0 other; ttaacaattg ctttaaattt tatcattttt attatactgt caaaaactgt gtctaaaagt 60 tattccccta tcacgttcta ctcgaggaga attcaacacc actcgtacga ttcccgtacg 120 taccccatca ccacgtcatg ttcgagaaca attaaacgtc ggtgatattc cagtggtgct 180 tgacgcattt catgattcct catcgtatcc catggaagaa gaagccaacg atgttcctgt 240 tatcccggaa gaagacgatg gcgatgatcg gtatgaaaat gaggatcctt tgctgtattt 300 aactcagggt gaacccaact tggaccgtgc tgatcgagaa ttcgataaac gtttcataga 360 aaacgagttc ggctatgcct gtgacgtgtg tgcacgaatt tggtttcaga atgatctgaa 420 accaatcagt aatgccgatg gtgccgtgtt gttggcggcc aactatttcg aaactgttga 480 aggattctca gcgtgtctta cttgtcgaag cagcttgaaa cgtggcctga tccctacact 540 gtcccaaacg aatggcttta catatccaaa atttccgtca aatctgcctc cgcttgatcc 600 gatcaccacc agactggttt caccaagaat ctgctttatg cagctgcgcc gtctgcggca 660 tgcagcaggt ttgtagttta tagtggtatt tttatttata gatatatttt attcatttcc 720 ctgtataatt tttgtgaaat aggtagtttg gccattattg gtcaaattat caacgttccg 780 gtggatgtgt ctgcgatggt gaaggagctt cctcggcagc ttgacgacga ctgtgcattc 840 aatgtgtgta tcaagaagca catgatacac aagtcaagct atttgtccgg gtacgtcaag 900 aaaggtacgg ttaaggcctg gcttaactac cttgtcgcca caccgctgta taggcgaaac 960 ggcattacgt ttagtgaaga acacttggcg gccatcgagc cagtgcagca acttggaata 1020 ccccaaacgg caaatacatc tttcgatttg gaggttatcg acgcatcgaa cgaggttgag 1080 cttctcattg gtcaacagca taccctgttg tggaatgagg acaaatgtct cgaaattgcg 1140 ccaggccaga acagaactcc actatccatc atctatgatg aatttgcgga agagttgtcc 1200 ttcccggaca tatatctggg atatccaaga tccttcaatc ctgaagttcg cgtaacgcca 1260 ttcatgatgg ctactagcga aataagacgc cgtgatcgac gaggagcaaa gccggagcat 1320 atccttttca tggcgatgaa gatcatgcgc ttgcgagttt ccgagggatt gaagaacact 1380 ttcaaatgca tgggtacggc caacataact cgcgctcagc tgcaagaccg ggagtttttg 1440 gagacatgta tcgagcgtaa tctgtccttc cttaaatcca tccctaactc gatccagtac 1500 tggcagcaaa ggaagcggga tgtctttgcc atgatccgcc agttggggaa gccaaccatg 1560 ttcctcacat tgagcgcaaa tgaaattcgt tggccgcatt tgctcaccgt actgcacaag 1620 ctgtcaaacg ggtccagtga cgccggtgta gcaaatatca tgcagcagct gactgcgcta 1680 cagcgtgcta ccctcgtaag cgaggatccc gtgacctgtt gtgtatattt taacaaactg 1740 gtgaatgtga tcatgcagct gctctcgtcc acacgataca gtccatttgg gaagtactat 1800 gttgttgatt atttcaaacg catagaattc cagcaccgtg gcagtccgca cgctcacatt 1860 cttctctggt tggctaacga tcctcgggaa gacatgtcag agaatatgcc tgccactgtt 1920 caactgatcg acatgctctg ctccgtccgt gcagatgacc taactgaaac ttatggcaat 1980 caggtaaatt gaaagtatct atagtgtgat gagtgatttc ttatgagtat aatgttttct 2040 aggtccataa acatacgttt acttgcttca aacgcaatga taagcggtgc cggttcaata 2100 taccatattg gccgatggat cagaccagga tactactccc gatcacaaca ggagatggcc 2160 gacttgatca gttgcgcaga gctgctgtga gcatgcggga tgctctcgaa accaaggcat 2220 acgatagtat ggaagcgttc cttgttgacc gcaactgcac gtacgagtat tatcttgacg 2280 tgatccgttc ctcgattcgg cgaccgacag tcatgctgaa acgctctatg tcagaactct 2340 ggaccaaccc tttcaatccc tggattggta agattatgcg ctccaacatg gacctgcagt 2400 ttgtcttgga ggagttctcg tgtgctgctt atgtcgttga gtatgtcaac aaaacaaaca 2460 ggggcataag cagtttacac cgcgagctga taaagctgca ggaagagcat cccgatcacg 2520 actataatgg attgctgaag aaagttagca tcaaaatgct caacagcgtg gaaatgtccg 2580 cccaagaagc tgcgtggtat ttgctcaggc aaccgatgtc ggaagctagc cgtaaggtac 2640 tattttgttt gttacatatt ggtgcaggtc atgctaattc tcacttattt tatcttatag 2700 gtggaattca ttccgacgat gtggccacat gagcggatca agtcgcggaa acggacgaag 2760 caaatggacg aagaagaact cgaagaagac tccaccgatg tttggaccct caacattata 2820 caacgctatg aagcgcgtga aggcatggac gaggtgtgct tggcggattt tgcagcactt 2880 tatacggagg aaagaagggc gaagaatacg tacaagatac gcagatttcc ccgtatattg 2940 cgttggtgtg gatacaacat gtcggaactg gttgagtaca aacgggagat ggtgctgctc 3000 tttctgccat tcagaaatga agtctgcgat gtactcgatc gcaataagtt tctcatgttg 3060 tatgaagaca acgaggctgc tatactcgcc aaacacaagg agtacgactg cgagctgaac 3120 ctggaccagg tcgtagccga gtatatccgt ctgtctgatg aagatgacgt acagcaagag 3180 gttgctaaca agaagcgcga tgagtacgtt cgcaccatca gcatggagcc gaacgatgac 3240 gacataaata acttgcccaa tggacccatg gcggcagttg tcaagcagcg tgcaaacgtc 3300 atgtccaagc aggagtactg tgagatggta cgtgcaacaa atgcggagca acgtgatttg 3360 attttgcaca tcattcacag tttgcacagc ttcgatgaca gtatcaaacc cgtgcagata 3420 ttcttcacag gcccggcagg ttgtggcaag acgttcactt tgcgaatact catggagacc 3480 ataaaccgct ttagccaagc acacaattcc cagaataatt cgtatgttgc ttgcgcatca 3540 actgggaaag cagctgtggc tatcggagga actacagtgc attccgcctt tcggatcaca 3600 atgtctcgga ggcaaagttc aaagctaagc tttgagtcat tgcagttgta ccgcaacgct 3660 ttcgcaaacg tgaaagctat catcatcgac gaaaccagca tgcttggtgc tgatgttttg 3720 aacaccgtcc acgttcgact ccaggaaatc accgggaatt atgatgatcc gtttggtgga 3780 atgcagattg tattttgcgg tgatcttcgt cagctacctc cagtaaacgc acgtccagtc 3840 ttcaagccga gtgtcaactc aatgcacggt gccgtcttat ggcagtctct cgagttctac 3900 ccgttggtcc aggtaatgcg acaaaccaat gagcaattct ccaccatcct taccaaaatc 3960 ggtaatggag agcagttgtc tccggacgaa gtcgtcatca ttgagagcag attccgtaca 4020 gcggagtggt gtcagcaaca cgtgccgcgt gcaatccgct tgttccatcg aaacgtcgac 4080 gtggagcgat acaattccat cgcattggtg gatagggagg cagctgaatg cacggcggat 4140 gacatttatt ccggtttcaa ggacaactct cagctggctg gtgctcgcac aaaggtgcat 4200 aagatgagtg tagcagagac tggttgccta ccgtacctac tacggcttgc tgtgggtact 4260 ccgtacatgt tgacgaccaa tgtcgacgtt gaagacggaa tcgttaacgg tgcgatcgga 4320 gaactaatgt tcgtcgaacg cgacgaagac gatactcagc agcagatcac taggttgtgg 4380 ataaaatttg agaacgactc cattggaaga atgttgagag tgaaggccag accactggta 4440 tactccaagc ctggtgttct gaaagctgag tggaccccaa tcgccaagcg atcagcgaac 4500 attagcctca atggtggagt taagtgcaag agggttcaat ttccggtggt gagcgcttgt 4560 gcccttactg tacacaaatc gcaaggaggc accttctcag aagttgtgta caactacgag 4620 aaaagccagg agcagcaact ggtttacgtt ggcctttcaa gggttacttc catcgaagga 4680 ctgtacctga cgaaccccag caacgatttc aagtttcacc acggaaaagc cgctgtaacg 4740 ccaagaatca aagatctccg aacagagctt ggacgtttaa ataaccaccg tctgcgtacc 4800 atcggcgagg atgttatggc aacaatagca tcaaacacgt ctgctgttat gttgatgagt 4860 ataaatgtgc aaagcctgaa cgcacactca ttagatatta ccacagaccg tgttctcaat 4920 gccgtggact tgctcgcact gagtgaaaca tggatggaaa atggcacaca gacggtcctg 4980 gctgaatttg actgcatcac ccaagagaaa cgtgacgaaa caagagctgg cggtgtggcc 5040 atttaccaga acacagctgc atccaccgcg gccgtttctc acgccatcga caagctcagc 5100 gctagctacg atgcaatgtt gggtgaagca gacaagtacg gtgatatctg cgccgccgaa 5160 atctcggtta tgggtactcg cacgttactg ttttcgttgt acatttcacc aggtatgact 5220 aaaacggagt ttgtgtttct cattttcata ataaattctt ttcttttaca ggcaccacga 5280 tcaagcaaaa gaaatttttc ttggcgcgta atctggtcct gtacagcaaa acgtccatgc 5340 caatcatcgt aacgggggat ttcaacatcg acgtgttgaa gccggaaaac cttgagttca 5400 tcgatttcat gaggacatat ctgcacctag atttggtctc agatcgatct caagcgacta 5460 ccatgggagg gtcatgtctg gatctgacgt tcacccggaa cattcgcgta cagtgcaaga 5520 gatactgtgc gtatttctca taccaccgac cgatactctc agtgcttgag atttgaacgg 5580 tctgccaatt gagaatctcg gcctgcctcc agttgagctc tgctttattc tccccccgtc 5640 gttgacttcg ttctcactcc gtccatcaat acgctttctt catgatgttt cgtgacccgc 5700 gattcaaggc tcaactatct ggtaaataga cttctaggct aagatgtaca aaatatctaa 5760 ttcttttcgt tttcttttac atcgttcagc gttaacaatg aaacattgtc cagtaaattg 5820 tttccatcga caactattcc tgctgccagt gcatctctga cggtagagtg tcggctttcc 5880 tcccgttttg ctgtgcctgg ttttgaaatc tccaaccttt ctttctttgt caaatccttg 5940 catcgtgaga gaagacgttt cgtaacccgc gattcaaggc tcgataatct ggtaaatgtg 6000 atttgatggt ttgaggtttg cataatatgt aattcgtttc attctcttcc aaatcgttca 6060 gcaccaacag tggaacatcg tccattcgaa ggtttcataa actgcagttg agctctgcct 6120 agtgcacagt ctgtttctgt gacgaagttc cgattgattt tggatatcaa atcatggcca 6180 ccaacctatc gcaacattat gcttgattct tctgttgatg cgctcttcaa aatatcgtgg 6240 atgttttgca acccgtgata aaaggctcaa ccatctggta aatataattt aacgcaaaac 6300 tttacaaaat atataattct tttctcccac tctttcagag tcaacaacga atccgtcctg 6360 tcgagtcatc tgatgggcct cgaccatcct tcggtcacac cgtcaacatc atctttcaca 6420 acaaacggcc tttttcaagg ccaccaacat ttcgataaag agttagttaa acgatatttc 6480 ctattatatg tatcacaaat atcactttaa tcctccacat cgaccatcga gaatgatggt 6540 tttggtaaaa ttgaaaatag gtacctgtgg ctaaatctat atatgcactc aatttaaaga 6600 aaagatcgtc aaatttcggc ttgtgagagt tgtcaatata cacagggttg gacaagtttt 6660 atactcttta taccattggt ttccctaaca tagacatcaa atcaatatat ccagccaatt 6720 aatgaaccaa ctaacgttat ccaacgtcaa cttggcggtc gtatctcgga aacaacctct 6780 tacttttttc tt 6792 // ID BEL-93_AA-LTR repbase; DNA; INV; 549 BP. XX AC supercont1.273; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-93_AA_; KW BEL-93_AA-I; BEL-93_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-549 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.273; Positions 125574 126122. XX SQ Sequence 549 BP; 154 A; 115 C; 131 G; 149 T; 0 other; tgttcgatcg tagcagtatt gcttagaatt acaccgcggc tcccggtata accggtgtat 60 aatgcgctac tttaagatta aaatttagat taaccggcag cactgtaccc gaacaccgcc 120 gatctgaccg gctccatgta tattattccc actagccgac cactgagaga tcagctgtgt 180 cggtccgtag aaagtgaaag ttcaagttcg aggtcagagt tgttttgact agcacgaaga 240 gcaggtctac aagatagcat gtagtgagtt gtgtctctag aataaattgt gctctgtatc 300 tgaagagtgt gctatttttt aataaacgga tgagtgtttt ctgcaaattt gtaactgcca 360 aaagtactta cctgtgtcct atcgccgaaa ccgatgccat tcgaagacct gtggaagaga 420 gaagaatact tacctgtgtg gtgccattcg atcgccaatc ctcgccgccc tgtgggtgaa 480 aaatagaaat tcaacgagaa aacacaacgt aggggatttg ttcgcgggct tttatggtaa 540 gaacgacca 549 // ID L1B_BM repbase; DNA; INV; 939 BP. XX AC . XX DT 09-JUL-2004 (Rel. 9.06, Created) DT 09-JUL-2004 (Rel. 9.06, Last updated, Version 2) XX DE Bombyx mori non-LTR retrotransposon - a consensus. XX KW L1; Non-LTR Retrotransposon; Transposable Element; L1B_BM; KW integrase-like protein; pol polyprotein; KW protease and reverse transcriptase-like protein; KW Repetitive element. XX OS Bombyx mori OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Bombycidae; Bombycinae; Bombyx. XX RN [1] RA Ichimura S., Mita K. and Sugaya K.; RT "A major non-LTR retrotransposon of Bombyx mori, L1Bm."; RL J. Mol. Evol 45(3), 253-264 (1997). XX RN [2] RA Abe H., Ohbayashi F., Shimada T., Sugasaki T., Kawai S. RA and Oshiki T.; RT "A complete full-length non-LTR retrotransposon, BMC1, on the W RT chromosome of the silkworm, Bombyx mori."; RL Genes Genet. Syst 73(6), 353-358 (1998). XX RN [3] RA Abe H., Ohbayashi F., Shimada T., Sugasaki T., Kawai S., Mita K. RA and Oshiki T.; RT "Molecular structure of a novel gypsy-Ty3-like retrotransposon RT (Kabuki) and nested retrotransposable elements on the W RT chromosome of the silkworm Bombyx mori."; RL Mol. Gen. Genet 263(6), 916-924 (2000). XX RN [4] RA Ogura T., Okano K., Tsuchida K., Miyajima N., Tanaka H., RA Takada N., Izumi S., Tomino S. and Maekawa H.; RT "A defective non-LTR retrotransposon is dispersed throughout the RT genome of the silkworm, Bombyx mori."; RL Chromosoma 103(5), 311-323 (1994). XX RN [5] RA Nakajima Y., Hashido K., Tsuchida K., Takada N., Shiino T. RA and Maekawa H.; RT "A novel tripartite structure comprising a mariner-like element RT and two additional retrotransposons found in the Bombyx mori RT genome."; RL J. Mol. Evol 48(5), 577-585 (1999). XX RN [6] RA Gentles A., Kohany O. and Jurka J.; RT "Consensus."; RL Direct Submission to Repbase Update (JUN-2004). XX DR [5] (Consensus) XX SQ Sequence 939 BP; 227 A; 289 C; 220 G; 199 T; 4 other; ctaccaccct gggacagtgg ttccgaaaat ggcgcataga catcaaccca gcgaaaagca 60 cagcggtgct ctttcaaaag gggtcgccct ccgaacacca cgctgagcat ycctcccgcg 120 attaggagca gcaatacccc cgcctcacac cgttcgcccg atcacgctct tcggccaacc 180 cataccgtgg gccaggaagg tcaagtacct gggcgtcacc ctcgatgcaa cgatgacatt 240 ccgcccccat ataaaaacgg tccgcgaccg tgccgccttt attctcggaa gactctaccc 300 catgatatgt aagcggagta aaatgtccct tcggaacaag gtgacactct acaaaacttg 360 catacggccc gtcatgacct atgcaagtgt agtgttcgct cacgcggccc gcacacactt 420 aaacaccttc caaatcattc aatcccgttt ttgcaggata gccgtcggag ccccgtggtt 480 cgtgaggaac gtcgacctac acgacgacct ggacttagaa tccatcagta aatacatgaa 540 gtcagcgtcg gaacgccact tcgataaagc ggcacgacac gagaaccccc tcatcgtggc 600 cgccggtaac tacattcccg atctgcggac agaatagaaa gcagtcgacg tcgccctaaa 660 cacgtcattt cggatccttc agatccacta acttttgcat taggtgcttt taggtacact 720 wsaagcaggc ttagggaccc cggtcaccgt tctcgtcgaa cccgtygctt gcgacgaagg 780 gctcgacgtg caaattaacc ctcagacatc agcccactga gtttctcgcc ggatcttctc 840 agtgggtcgc gtttccgatc cggtggtaga ttcatttgcg aagcaactgc tcttgctagg 900 gttcgtgtta gcaacatcgt caggtttgag ccccgtgag 939 // ID BEL-2_SI-I repbase; DNA; INV; 5220 BP. XX AC AEAQ01001442; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-2_SI_; KW BEL-2_SI-LTR; BEL-2_SI-I. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-5220 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01001442; Positions 7402 2183. XX CC Positions [4002-4598] - Integrase core CC 'GTATT' target site duplication CC LTRs are 96% similar to each other. XX FH Key Location/Qualifiers FT CDS join(1116..2174,2178..3707) FT /product="BEL-2_SI-I_1p" FT /translation="MNCEKFRRKPAKERLEVVTANRLCINCMGRHQVSACS FT SPKACARCAARHNTLLHDAYETAPGAVVAPSPSVHAAHHPHFEPPQMLFAT FT ARVLVTDRFGTIHAVRALVDPESETSLITDSLAQRLRLPQTPVSVATYGVG FT GLRSGVSRGRIAMNLTSRVGNAVFAISALVMPRLSIYGGSVEANSSGWSHI FT RDLDLADPDFSSTDPVELLLGVDVYAKVARPGMRLGADEEPIAQQTALGWI FT IMGPANEHRFRREVALRDAYSQFMAEYLELGHMTPAPSLPADCSRACYLPH FT HGVWKGGGEEARIRVVFNGSSRTSSGISLNGELPAGPNLLPTLCDVIMRWR FT KHQFVLAADIKMYRQIWVHPEDRDLQRIYRRIGNRVQEFHLNTVTYGLASA FT PYLVVRTLRQLVEDEGERFPRAAEALRRDTYVDDVLSGASSQLETCELREQ FT LTQLCLAGGFRLRKWLANHEDLLRDIPAEHRSSLPSTVALTSEEHDVLGIR FT WLPAEDNFAPTVKSMTEELVTKRTVLRQTARLFDQLGWLAPVVINAKLLIQ FT AAWLRRLDWDAPLAADDVECWRRLRTELPVLEEIRIPRWSRADSPGAIIEM FT HGFADASERAYGAVLYLRTIADGRACLTLLIAKTRVAPLRQVSLPRLELCA FT AALLARLATHTAFVLTLDGAAVHLWTNSTVVLGWFQGHPTRWTTFVANRVA FT EVQGAVPDAHWHHVPGRINPADCASREMSPLELRAHELWWRGPQYLWASSA FT AWPAETAVALSRGLPEQRAARVHTACEEEEPEELNRFSSLRCLLWVTACMR FT CWLTLAASRADLTRGAVLSATELADARAMWIRCAQRIAFRDEIKALSRGRI FT ERSRGVHSGC" XX SQ Sequence 5220 BP; 1039 A; 1435 C; 1598 G; 1148 T; 0 other; ttttctggtc cttcgagccg gatcgtgcgt cgcagccggt gcatatctgc ctacattttt 60 ctctcgcgta gtaaacgtgc cagtgctgta gtgtgagtgg tggagcccaa cctccaaatt 120 ctccaaatgg agacgttatt ggagcagcaa tctgagctcc gcggacgcat ctctcgcgtc 180 gtcgcgaatc tcaagaagac cggcgtcgcg aacctgaccg tcgaccacgt ccaagcggcc 240 ttgaccattt tggaaaagcg ataggcaaag ttcgaggaga accacgataa gctcgggctt 300 gctcacggga aggagttaag gaaaaccgag tactatacga acgactatgc tgaagaggtg 360 gagatgctct acaccagcca acgagctgca ttaatggacc tagggcggtc gttgaaggca 420 aagaccgaag gaaatgtttc gacgatggtc agggccgaaa gcacgcagcg agccccgctt 480 ccttggctcg agcttccaac gttctccagc aatttcgagg actggccggt attccgggat 540 ttgttcgact tcatcgtcgg tcaggatccg catttgtccg acgtccaaag cctccactat 600 ctaaagacga gggtcaaggg aggagcggaa cagtttttga ggggcttgcc ctcaacggac 660 gcgaattata aacgcgcatg gtcaactttg aaaggccact atgaaaaaaa aaggttgtta 720 gtcaaggctt acttgtccgc gattacatcg ttgcccaaaa tgaaagcgga ctgcgtagcc 780 gacctgcgac ggatctatca cggcatggta gcatccaccc aagctttgaa aggtatcggc 840 gagcccattt ccaataccac tcacttgctc gtgcatctca tggtggagtt gctcgacgcg 900 ggtacaagac gcgagtggga aaattccatc gcaaaaaaag cgacccgccg tcgtttgacg 960 agttcaagga cttcctcgag gaacagctcg tgacgcacga gtcgctacgg tctgtcaagg 1020 gggagtcctc ctcgggaaga tccgctcatc acaccgggaa gcagagcaga agctccgagc 1080 gcagctgcgt ggtgtgcaag cagaatcact ttattatgaa ctgcgagaag tttcgtcgga 1140 aacctgcaaa ggagcggttg gaggtagtga cagccaatcg cctgtgcatc aactgcatgg 1200 gacgacatca ggtgagcgcg tgctcttcgc cgaaggcgtg tgctaggtgc gccgcgcgcc 1260 acaacacact cctgcatgac gcgtacgaga cggctccggg agccgttgtt gcaccttcgc 1320 cgtcggtgca tgccgcgcat caccctcact tcgaaccccc tcagatgctt ttcgccaccg 1380 cgcgcgttct ggtgacggat cgattcggaa ctatacacgc ggttcgagcc ctcgtagatc 1440 cagaatcaga gacttcgttg atcacagact cgctggccca acgcctgcga ctgccgcaga 1500 caccggtgtc cgtggcaact tacggagtgg gcgggttgag gtcgggtgtg tcgcggggac 1560 gaatcgcgat gaatctgacc tcgcgcgttg gaaacgcggt atttgccatc tccgccttgg 1620 tgatgccaag actctcgatt tacggtggtt ccgtggaagc taactccagc ggatggtcgc 1680 acattcgcga ccttgatctc gctgaccctg atttcagctc tacggacccg gtcgagttgc 1740 tactcggtgt cgacgtctac gccaaggtgg cacgaccagg catgcggctg ggagctgatg 1800 aggaaccaat tgcgcagcag accgctctgg gatggatcat catgggtcca gcgaacgagc 1860 accgctttcg gagggaagtc gctcttcgtg acgcgtactc acaattcatg gctgagtacc 1920 tcgagcttgg ccacatgacg ccggcgcctt ccttaccggc ggattgttca cgcgcctgtt 1980 acctgcccca tcacggagtg tggaagggcg gcggcgagga ggctaggatc cgcgtcgtgt 2040 tcaacggttc ttcacggaca tcctctggta tctccttaaa tggcgaactt cctgcagggc 2100 ccaatttatt gccgacgctc tgcgacgtga taatgcgctg gcgtaaacac cagtttgttc 2160 tcgctgcgga catctaaaag atgtaccggc agatttgggt gcatccggag gaccgtgacc 2220 ttcaacggat ttacaggagg ataggcaacc gagtccagga gttccacctt aacacggtta 2280 cttacggcct ggccagtgca ccctacttgg ttgttcgaac gttgcgtcag cttgtcgagg 2340 acgagggtga acgctttcct cgagccgctg aggccctgcg gcgggacacg tatgtggacg 2400 atgtgctgtc cggggcttcc tctcagctcg agacctgcga gctgcgcgag cagctgactc 2460 agttgtgctt ggcgggcggc ttccgcttgc gcaagtggct cgccaatcat gaagacctgc 2520 tgcgcgacat ccccgctgag caccggtcgt cgctgccgtc gacggtcgcg ttgacgtcgg 2580 aggagcacga cgtcctgggg atacgttggc ttcccgctga ggacaacttt gctccaactg 2640 tcaagagcat gacagaggag ctggtcacca agcgcacggt gttgaggcaa accgcgcgtc 2700 tcttcgacca gctgggctgg ctcgccccgg tcgtaataaa tgctaaactt ctcattcagg 2760 cggcgtggct acggcgactt gactgggatg ccccccttgc tgcagacgac gtggagtgtt 2820 ggcgacggtt gcggacggag ctgcccgtcc tggaggagat ccgcattccg cgctggtcac 2880 gtgcggattc tccgggtgcg attattgaga tgcacggttt cgccgatgcg tcggaacgcg 2940 cctacggagc agttttatat ctccgcacaa tcgcggacgg cagggcgtgt ctgacgctgc 3000 tgatcgccaa gacgcgagtc gcgccgctca gacaggtctc ccttccgcga ctggagcttt 3060 gcgcagctgc tctgttggca cggctcgcga ctcacaccgc tttcgtccta accctagatg 3120 gagcggcggt tcacctttgg acgaactcca cagtcgtttt gggatggttt caaggacatc 3180 ccacgcgatg gacgaccttc gtggcaaaca gggtcgccga ggttcagggt gcagtgccgg 3240 acgcgcactg gcatcacgta ccggggcgga tcaatccggc cgattgcgca tcacgagaaa 3300 tgtcacccct ggagttgcga gcgcacgagc tgtggtggcg gggacctcag tacctgtggg 3360 cgagctctgc tgcttggccc gccgagacag cggtggcctt gagtcgtgga ctgcctgagc 3420 agcgagctgc tcgggtccac accgcttgcg aggaggaaga accagaggag ctgaatcgct 3480 tctcctcgtt gcgttgtttg ctttgggtga ccgcctgtat gcgctgctgg ctgacgttgg 3540 cagcttcgcg tgccgacttg acgcggggag cagtgctctc ggctactgag ctggctgacg 3600 ctcgggccat gtggatccga tgcgcgcaga ggattgcctt ccgagacgag attaaggctc 3660 tgtcgagggg gcgtatcgag cgctctcggg gtgtccactc cggttgctga gtccgtttct 3720 caacgatagg ggactgctgc gcgtgggtgg tcgactgaag cacgcgctat tgaatgcgaa 3780 ccagcgtcat cccataatac ttccgcaggg ttcgcatctc acctatctgg tggtcgcgga 3840 cgagcacggt cgtgttcttc atggaggcac ccagttaact ctcgcatcgt tacgccggag 3900 gtactgggta ctgcaaggtc ggcaattggt gaagcactac atttattgtt gtattccatg 3960 cttgcgatgg cgggcggcga atccccaacc acggatgggc agcctcccac gagccagagt 4020 cgttccgaat tgaccgttca ctcatacggg cttggactac gctggaccga tcctcctgcg 4080 caccacgaag ggtcgaggtc acagggccta caaggccatt gtggcagtct ttgtttgttt 4140 cagttcgagc cgtccacctc gaagtcgtta gcgactacac agctgacgcc tttcttgcag 4200 cgttccagcg cttcgtgtcc cgcagaggat tgtgcttgca cctgtacggt gattgcggga 4260 ctaacttcgt gggcgcggat cgacagctgc gcggtctttt gggaatggcg agcgagagta 4320 gcaggagtat cgtaggaaag ctagcggagg agggtgtgga gtggcacttc aaccctccag 4380 ctgcccctca ctttgggggt ctctgggaag ctgcggtcaa ggccttcaaa caccatctgc 4440 gccgagtcat cggcgagtca acactgactt acgaggagat ggctacattt ttggccgaag 4500 tggagacgtg tctcaattct cgaccgttgc aggcgctttc agacgacgcc gacgaattgg 4560 atgtgttgac accgtgccat tttctcgttg gcgctccgtt gaaggctatc cccgaaccgg 4620 tattgacggg gattcccccc agtaagttga cacgctggaa gctactccag caaatgaggg 4680 atcatctttg gcagcgatgg tcgcaggaat acctccaggc gctgacccct cgaccaaggt 4740 ggtggcgtgc agagggtggc ctgaaggagg gtcagcttta cctgctgaag caggaaacaa 4800 cgccaccgac gcggtggccc ttggtgcatg tcacgcgtct ccacccgggt gatgatggcg 4860 aggtgcgggt cgtgttcgtt cggggagcta caggggagct tcggcgacca gttatcaggc 4920 tcgtccctct tccaacggac gaggagagta tgcagcgcca tgcttagctg agctcactca 4980 gaggtcgcat tcggagaagg ctcgttgtat tgtgcgctcc ggtttaaggt ctctcctctg 5040 ttcgcaccgc gccagttgag cgagttgtgc agctcgttgc ttataataat agtacatagt 5100 tctatagtag cgttagttat aataagttgt gttcttaggt tgtgaagaac taggccagca 5160 ttgttgcgct gtttcgggcg gagtaaaatc gccgaatttt tatcggcgag gcgggcggga 5220 // ID P-1N1_TV repbase; DNA; INV; 923 BP. XX AC . XX DT 26-OCT-2009 (Rel. 14.1, Created) DT 26-OCT-2009 (Rel. 14.1, Last updated, Version 1) XX DE Nonautonomous P DNA transposon from Trichomonas vaginalis - a DE consensus. XX KW P; DNA transposon; Transposable Element; Nonautonomous; P-1_TV; KW P-1N1_TV. XX OS Trichomonas vaginalis OC Eukaryota; Parabasalia; Trichomonadida; Trichomonadidae; OC Trichomonas. XX RN [1] RP 1-923 RA Kapitonov V.V. and Jurka J.; RT "First examples of protozoan P DNA transposons."; RL Repbase Reports 9(10), 2161-2161 (2009). XX DR [1] (Consensus) XX CC This is a young family of nonautonomous P DNA transposons CC identified in the Trichomonas vaginalis genome. The consensus was CC derived from multiple alignment of ~300 copies of P-1N1_TV, which CC are less than ~1% divergent from each other. P-1N1_TV is a CC nonautonomous DNA transposon transposed by the autonomous P-1_TV. CC The 923-bp P-1N1_TV is almost 100% identical to a deletion CC product of P-1_TV (after deletion of its internal region at pos CC 289-8380). TIRs are 20-bp long (4 mismatches). The genome CC contains over 300 copies of P-1N1_TV, which are 0.4% divergent CC from the consensus. Some copies are 100% identical to each other. CC Therefore, it is likely that P-1_TV and P-1N1_TV are still active CC transposons. XX SQ Sequence 923 BP; 338 A; 124 C; 121 G; 340 T; 0 other; aaaggtaact aaactccttt cgccgatccg atttgctcca tatttaaaat atagcttgca 60 tatgtagaag aaaagctaat tttgaagtac gcccgaaatc caccaacagg atttcgagct 120 actaaaaagg tacttttaaa tgtgattttt tttataaaaa attaaattta ttttaagatc 180 agactaaaaa ctaattttat ggaaaatcta tatcgacaaa tttattactt attgggtata 240 tagatttttt gattatggcg attatgtctg atatgcttcc aataagccaa aacacatttc 300 atactgattt ctataaaagg tatctttatt atagaagtgt aagctatatc tgtttgtttt 360 tttattataa caaattcaaa attctcttat aaggactaga tgaaatctgt attattaaat 420 cattagctaa atgaaggttc aaatagaaag taaatatcaa tttatatgat actgaaacca 480 aatagttctt ttaataaaag ataacacgct gaaaattatt gagtatttgc aaatctcaaa 540 aatctattgg tgaatttggc tcaactttta aaattataac atccatatca aatcataaaa 600 gcagacaaaa attaaagcca aacctatcgg ccaatttttt tttgatattt tgatttttga 660 ggtatacttt taaaagtgga tttaaagcac ttatgaaaaa acattaactt ctgtcgttat 720 gattattgat ttagacaata atgattttaa tctttgatta tcaaaacact aaagtcatct 780 ttttaacatt tgcttggtag cattaactca agtctaaagt actattttat gtggcactct 840 tagaattttt gtattcattg tacacgctga aatagtacca ttaaagttat ggcggacttt 900 caaaaaatag tttggtcacc ttt 923 // ID BEL-26_AA-LTR repbase; DNA; INV; 208 BP. XX AC supercont1.16; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-26_AA_; KW BEL-26_AA-I; BEL-26_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-208 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.16; Positions 1555319 1555526. XX SQ Sequence 208 BP; 64 A; 40 C; 32 G; 72 T; 0 other; tgagtaattt gttttaactt tcaatttctt ttcatttgat gtgttgcgac acaatgataa 60 tttttgaatt caacccaaca ctgtactaac gaaaaattca gaaataaacc aaaccttgta 120 tcgaacggaa gtatttttac gtttttcgat agtgtgccaa accatccgat tttcctgccg 180 aaatctaagc ctgagctatc gttggaca 208 // ID CR1-18_CQ repbase; DNA; INV; 3242 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-18_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-3242 RA Kojima K.K. and Jurka J.; RT "CR1 non-LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 22-22 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 9 sequences with >98% CC identity. XX FH Key Location/Qualifiers FT CDS 2..3001 FT /product="CR1-18_CQ_1p" FT /note="reverse transcriptase." FT /translation="ADGPXRTSTPIPGSTTTLSNVPPVSAAPTHAEQSGVT FT PVVDERLLLYYQNVAGLNTKISDYALAISDAIYDLYAFTETWLNEDTLSHQ FT IFGPGFHVIRADRSAANSVKTSGGGVLLAVRSNLRPRQLSPPHCSALEQIW FT VALPLASTTMFICVXYIPPTHTNNSEIISQHCDSLSWISSQMKPNDSILII FT GDFNLPCLRWTLNPAGFFIPDAKHSTINSTVSQLLDDYSIANLGQLCGIQN FT NYGNVLDLCFASIGMSTSCNLTQAPSPLLRACRFHQPLLVSIECRVHAFRD FT TSSQFFYDYRRGNFQGMNEFLKNVDWNHLLADRDANSAAVAFTEVVLRAIN FT IFIPKKLHLPACHPAWSNDQLKRLKTQKRAALKKFNKHPTRRWKVQYNSIN FT RKYSRLNNTLFLRYQRXIQHRLKQNPKQFWNHVNSQRKETGLPSVMELDSR FT EASSPIAICELFRSQFSSVFTNEDLDDTTVQQAASNVPDRDTIAQHPLVDP FT DTVSRTCSALKYSTSSGPDGIPAAVLKSCSDSLALPLSKLFNISLKSGVFP FT LSWKKSYIFPVHKKGPKRNVRNYRGIAALCAVSKLFELIVLDHIKLNCNNY FT IAQEQHGFMPKRSTCSNLVAYTSFISQSMQKRQQVDAIYTDLSAAFDKINH FT RIAFAKLERLGFCGSFLKWLCSYLSDREMCVKIGDVLSAVFAVFSGVPQGS FT HLGPIIYLLYMNDVHLLLKCHKLSYADDIKLFAVINSSNDALLLQDQLNIF FT AHWCDDNRMVLNASKCSVITFTRKRSTISFDYNLSNTSLSRSSSIKDLGVV FT LDCQLSFIEHISYTVSKASKVLGFVFRVAKHFRQVSCLKALYCSLVRSTLE FT YCSVVWAPFYQNGIERIEKVQRKFTRYALRHIPLVDPLNPPSYADRCNSLG FT LDLLSVRRDVAKAIFVSDLLKSSIDCPEILEQVNFYIQRRTIRSHQFIRIP FT RASTNYGRNAAVSSMCRVFNDCYVHFDFHLSRNVLRKLFLNHFKT" XX SQ Sequence 3242 BP; 825 A; 849 C; 643 G; 922 T; 3 other; ggccgacggc cccwttcgca cctcaacccc aattcctggc tcaaccacaa cactatccaa 60 cgttccccca gtatcagctg ctcccaccca tgcagaacag tccggggtta caccagtagt 120 agacgaacgt cttttgttgt actaccagaa tgtcgcaggt ttaaacacga agatctcaga 180 ttatgctctt gccatctccg atgccatcta cgatctgtac gccttcacag agacgtggct 240 aaatgaggat actctctcgc atcaaatatt cggtcctggg ttccacgtaa ttcgtgccga 300 tcgttccgca gcgaacagtg taaaaacttc cggtggtggt gttctgctag ctgttcgttc 360 gaatctgcgc cctcgtcagc tgtctccgcc acactgctca gctcttgagc aaatctgggt 420 tgcgctaccg cttgcctcca caactatgtt catttgtgtt ttwtacatcc cacccacgca 480 cacgaacaac tctgaaatca ttagccaaca ctgtgattcc ctttcttgga tatcatctca 540 gatgaaacca aacgatagca tcttgataat aggcgatttc aacctcccct gcctaagatg 600 gactcttaat ccggcagggt tcttcatacc ggacgccaaa cactctacca tcaacagtac 660 tgtctcgcaa ctgctcgacg actacagcat agcaaacttg ggtcaacttt gcgggataca 720 aaacaactat ggcaatgtcc tcgatttatg tttcgctagc attggaatgt ccaccagttg 780 caatctcacc caagctccat caccattact tcgtgcttgt cgattccacc aacccctact 840 cgtttcgatc gagtgcagag tccacgcttt tcgcgacacc agcagccaat ttttctacga 900 ttatcgtaga ggaaacttcc aaggcatgaa cgaatttctc aaaaatgtcg attggaacca 960 ccttctagct gaccgtgatg ctaattcagc tgctgttgct ttcactgagg ttgtgttacg 1020 agctatcaac atcttcatcc ccaagaagtt acacctgcca gcgtgtcatc cggcgtggtc 1080 caacgatcaa ctgaaacgcc tgaaaactca gaaacgggca gccctcaaga agttcaacaa 1140 gcacccaacg cgtcgctgga aggtccaata caactctatc aaccgaaagt acagccgctt 1200 gaacaacact ttgttcttga gataccagcg cmgcatccag catcgcctaa agcagaaccc 1260 gaagcagttt tggaaccacg ttaacagcca gaggaaggaa actggcctac caagcgtcat 1320 ggaacttgac agcagagaag catcttcacc aattgccatc tgcgaactgt ttcgctctca 1380 atttagcagc gtattcacta atgaggattt ggatgataca acggtacaac aagctgcatc 1440 aaacgttcct gatcgcgata ctattgccca gcacccactc gttgatcccg acactgttag 1500 ccgaacttgc agcgctttga agtactccac cagcagcggt ccggatggca tccccgcagc 1560 tgttttgaag agctgctctg acagtcttgc tctccctctt tcgaagctgt tcaacatttc 1620 cctcaaatct ggcgtgttcc cactgtcctg gaaaaaatcc tacatatttc cggtacataa 1680 gaaaggacct aagcgaaatg ttcgaaatta tcgaggtatt gctgctctgt gtgccgtcag 1740 caagttgttt gagttgattg ttcttgacca cattaaactt aactgcaaca actacatcgc 1800 tcaagagcaa catggtttca tgccaaaacg ctcgacgtgc tccaaccttg ttgcctacac 1860 ttcctttatt tcccaatcta tgcaaaaacg ccagcaagta gatgccattt acacagatct 1920 ctccgccgct tttgacaaga taaatcatcg gattgctttt gctaaattgg aaaggttagg 1980 cttttgtgga tcgtttctta agtggctgtg ctcgtacctc tctgaccgtg aaatgtgtgt 2040 caaaattggt gatgtcttgt ccgcagtttt tgctgttttc tccggtgtac ctcaaggaag 2100 tcatctcggt ccaatcatct acctgttgta catgaacgat gtccacctgt tacttaagtg 2160 tcacaaactg tcgtatgctg atgatatcaa gctctttgct gtcataaact ctagcaacga 2220 tgcgctcttg ctgcaagacc aacttaacat cttcgcccac tggtgtgacg acaacagaat 2280 ggtccttaac gcctcgaaat gctctgtgat tacctttacc cgcaagcgat cgacaatatc 2340 ctttgactat aatctatcaa atactagtct tagtcgttcg tcatctatca aagaccttgg 2400 tgttgtgtta gactgtcaat tgtcctttat tgaacatatt tcctacaccg tctctaaggc 2460 atccaaagtt cttggttttg tttttcgtgt ggctaaacat tttcgtcaag tttcgtgttt 2520 gaaggcactt tattgctcac tggttcgctc gacactagaa tattgttcgg tggtctgggc 2580 tccgttctac caaaacggta tcgagcgaat tgaaaaggtt caacggaagt tcactaggta 2640 tgcgcttcgc cacatccccc tagtggaccc actaaatcca cctagctacg cagatcgctg 2700 caattccctt ggtcttgacc ttctatctgt tcgtcgtgac gtagctaaag caatcttcgt 2760 gtctgatttg ttgaagtctt caattgactg tcctgagatt cttgagcaag tcaacttcta 2820 cattcaacgc cgtacgatca gatctcacca attcattaga attccacgcg catcgaccaa 2880 ctacggtcgc aacgcggccg tgtcgagcat gtgtagagtc tttaacgatt gctacgtgca 2940 ctttgacttc catttgtctc gcaacgtact ccggaaactt ttcctaaacc attttaaaac 3000 ttgacccatc gctaccagaa gctttctact tgtgatatcc aactgtgatc ttagtttagg 3060 ttaggattag ggaaagtttt atttgtgttc tatgtattag tttaagaaat cttgtatcat 3120 tggagtttgt aaacttgttg atgcgttaag atgaggtggt ttttatgcct tcttgagaat 3180 gtgtctgaca cagctcaagg gggctttttg cccaccaaaa taaaaaatga aatgaaatga 3240 aa 3242 // ID hATw-3_BF repbase; DNA; INV; 5108 BP. XX AC ABEP01054281.1; XX DT 12-JAN-2009 (Rel. 14.02, Created) DT 12-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE hATw-type DNA transposon family from Branchiostoma floridae. XX KW hAT; DNA transposon; Transposable Element; hATw; 7-bp TSD; KW hATw-3_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-5108 RA Bao W. and Jurka J.; RT "hATw-type DNA transposons from Branchiostoma floridae."; RL Repbase Reports 9(2), 516-516 (2009). XX DR EMBL/GenBank/DDBJ; ABEP01054281.1; Positions 36118 41225. XX FH Key Location/Qualifiers FT CDS 883..3909 FT /product="hATw-3_BF_1p" FT /translation="MLFIPGSFFFFLVETEETYPGPISRKLGLSLSRLEGG FT FLSELGNGFSNELVLELFSFIRKSGLREETILKWLQNIATSELKEKLLHVQ FT EASVRIFIGRLQKKRRELVTKKRTKELQILMTKKFEGPKPLQERCANLCSG FT FGPSPAVAQPATQSLHQLINFSRDVVKQNNVLEYTVESLSEDYIKCLSEMN FT EKTTEIQELKQSIKDQQRMSRVCREESVKQNSLLLRLEKDSEKTYSRLQDA FT LVKLSEYNTRNVNKRLKRRDEKIKAQENKMKMQESTLNEKDSLIREKDEEI FT QDLKSMVEELQEKLDKEVKNKLHAQQIKSHYKSSLLRSHQLCDEDSVQLEE FT KLACLEAENKELHENMKQFLDSDEVKTFEDGRFTDEVRQVYMMLLSMDTGL FT NNVEWIIRTVLQKLGGLRCGRLPKYSTAQTMFAEARRLSQVHVAEVVSKSN FT NITVGTDGTTKKHEKYGSVSFFTQDGQFVAGVTEHQVDMFRKVIHELATAC FT GTSEEEMWLSVKNTMSDRHIVNKCLNKQLEEIREEALKIVVKDWDDKSLED FT KEKMKKMWNHFCSMHYIVGLATSAEVGLKKFENASLTSSEGATGMENSCPS FT QAESGTHRVVRATCKAFSHTGACEKSGHPKEFEAYLQTYTPAKDNKLISFR FT GERFNILFKNGGATYHHKNDLLAYLDTCEAPNRLLQAIRSDLSVPVYVAGC FT CALGIINKIVTAPLWRLIESERSILDMCQHFHQLHISFKSWEKDPGTLMDG FT EAIFPSVQREEEDVYKSLFSHDDAEVKRLTCEALKSIMAEFIVVTERMLKD FT YLPGGVFYTPTQDQREEMSSCPTNNTGVERTFAHLDRDVRLAPNATTLTRE FT GKIMFRLNKTGQYLDTIPMEEKCTVFKEARKTARKDRKLHQQERVQLKQHR FT QELLREKTRKQTQKKAAKEAALETLKSSVRQLGLWETAEQIDSGLSKLLTK FT KARLQALKDQIKFRKEVLGVGDQHPKSLFQYSTGGKQFQETDLKKNLLMII FT RN*" XX SQ Sequence 5108 BP; 1669 A; 913 C; 1184 G; 1342 T; 0 other; taggcatgtc aaacctaccc cgttcccgca ttagaatgtg gcgagacgcg gggccgtaaa 60 tgttttttaa aaaatcttaa aaatcaaccg cttgcacgat gacaataaaa aatcgatgtc 120 aataatctgt gggatgttaa ctatcataca attgtatttt gagtcgaaaa taccggtcat 180 ccaaggagct aaaaatcatc aaagttggag atacagccga aaaagcgccc cgttttgtcg 240 ccgcgcgcgc ggcacgctat ttcgaaggcc gcccggcgga aatgcgattt tcaactcaaa 300 agccacaagc cgccttgcct cggagaactc tgggaatgat tccaggtgaa agggcagaga 360 tcaagggtca ctccagtggt tgtacaaatt tgctacgtca ttctgagcgg gagatatgag 420 ccagaattcg gagacctccg aaagtaagtg gatatttttg ccatttttct tgaatttgga 480 agttttgttc acacacagta acaaatataa atatattgcg atcactaagg tgtagttttt 540 gtttctgttt gtgcgagttt ggcttgtgga tttcgcgatg cgttttgaca gttttttggc 600 aggccaaaaa atggcgtgcg ttgtgcgcac ggccgtggag ggggaagggg gcgtgagacg 660 cgcactgtgt gcgctgtaag tcaacatgta accacaattc gcgctaatta accggaatgg 720 catatattta ccctttagaa gaaactttca tatccgagga tttagcatgt ttactaataa 780 ataggccccc aaagaattat cactatcgta aattatatgt tgcgtacaga cacttttata 840 cagagtttat agccatgcca tcgggatttg tcacacactt gcatgttatt cattccggga 900 tcattttttt tctttttagt ggaaacagag gagacatacc cagggccaat aagcagaaag 960 ttggggctgt ctctgagccg gcttgaagga gggttcctga gtgagttggg taatggattt 1020 tccaatgaac ttgtgctaga actgttttct tttatacgga aatcggggct tagggaggag 1080 acgatactga aatggcttca aaacattgca acctccgagt tgaaagagaa gcttttacat 1140 gtacaggaag cgagcgtacg aatcttcatt ggtagactac agaaaaagag gcgtgagctc 1200 gtgacgaaga aacgaaccaa ggagctacag atattgatga cgaagaagtt tgagggccca 1260 aagccgctgc aagaaagatg tgcaaatctt tgcagtggct ttgggcccag tccggctgtt 1320 gcacaaccag ctacccagtc tttgcaccag ttgatcaact tttcccgtga cgtggtgaag 1380 cagaacaatg tgctggaata tacagtagag tcactgtcag aagattacat caaatgttta 1440 tcagagatga atgaaaaaac aacagagata caagagttga aacaaagcat taaggaccaa 1500 caaaggatga gtagagtctg tagagaggaa agcgtgaaac agaattctct gttgcttcga 1560 ttggaaaaag actcagaaaa gacatacagc cgtttacagg acgctttggt gaaactaagc 1620 gagtacaata ctcggaatgt taataaacga ctgaaacgaa gagacgagaa aataaaagca 1680 caggagaaca agatgaagat gcaggaatca accttaaacg agaaagactc cttgatcaga 1740 gaaaaggatg aagaaataca ggacttaaaa tcaatggttg aagagctgca agagaaacta 1800 gacaaggagg taaaaaacaa gcttcatgca cagcagataa agtctcacta caagtcaagt 1860 ttgctgaggt ctcatcagtt gtgtgatgaa gattctgtac agttagaaga gaagttagca 1920 tgtcttgaag cggagaacaa ggagttacat gagaacatga aacaattcct tgattctgat 1980 gaagtaaaga cgtttgaaga tggtaggttc acagatgagg ttcgacaggt ttacatgatg 2040 ttactgagta tggacacggg gctaaacaat gtggagtgga tcataagaac tgtactgcag 2100 aagttgggag gtctcagatg tggcaggcta ccaaaatact ccacagcaca aacaatgttt 2160 gctgaagcca gacgtttaag tcaggtccat gtagctgaag ttgtgagcaa gtcgaataac 2220 atcactgtgg gtacggacgg taccacaaag aaacacgaga agtacggatc agtcagcttc 2280 ttcactcagg acggacagtt tgtggctgga gttacagagc accaagtgga catgtttagg 2340 aaggtcatac atgaactagc aacagcttgt ggaacgagtg aggaggaaat gtggctgtct 2400 gtgaagaaca cgatgagcga caggcacatt gtgaacaagt gcctgaataa acaactagaa 2460 gaaatcagag aagaagcctt gaaaatagtt gttaaagatt gggatgataa gtctttagaa 2520 gacaaagaga aaatgaagaa aatgtggaat catttctgca gtatgcatta tatagttggt 2580 ttggctacat ctgcagaagt tggattgaag aagtttgaga atgcatctct cacttcctca 2640 gaaggtgcca cagggatgga gaactcctgt ccgtcacaag cagaatcagg aacacacagg 2700 gtagttagag ccacctgcaa ggcattctcc catacaggag cctgtgagaa gtcaggacac 2760 cccaaagaat ttgaggcata cctgcaaacc tatacacctg ctaaggacaa caaactgatc 2820 tccttcagag gcgagaggtt taacattctt tttaagaatg gtggggctac gtaccaccat 2880 aagaacgact tgttagcata cctcgacacc tgtgaggcac ccaacagact cctccaggct 2940 attcgatcag acttgagtgt tcctgtgtat gtagcaggtt gctgtgccct tggcataatc 3000 aataagatag tgacagcccc actctggaga ttgattgaat ctgagaggag tatactggac 3060 atgtgtcaac atttccacca gttacacata agcttcaaat cctgggagaa ggatccaggc 3120 acactgatgg atggggaagc aatattccca tcagtgcaga gagaggagga ggatgtatac 3180 aagtcactgt tctcacatga tgatgctgaa gtcaaaagac tcacatgtga ggcactgaaa 3240 agtatcatgg cagagttcat tgtggtgact gagagaatgt tgaaggacta tcttcctgga 3300 ggagtcttct acacccctac acaagaccag agggaagaaa tgtcctcatg tcccactaac 3360 aacactggcg tagagcgcac gtttgcacat ctggacaggg acgttagact ggcaccaaat 3420 gccactactt tgacaaggga ggggaagata atgtttcggt taaacaaaac aggacagtac 3480 cttgacacca tccccatgga agagaaatgt actgtattca aggaagctag aaagactgca 3540 cggaaggaca ggaagcttca tcagcaggag cgtgtgcagt tgaagcagca tcgtcaggag 3600 ctgttgcgtg aaaagacacg aaaacaaaca cagaaaaagg ctgccaagga agcagcactt 3660 gagactctga aaagtagtgt aagacagctt gggttatggg agacagcaga acagatagat 3720 tctggtttgt caaaactact caccaaaaaa gctagactgc aggcactaaa agatcaaatc 3780 aagttcagaa aagaggtgct tggagttggt gaccaacatc caaagtcatt gttccagtat 3840 tcaacggggg gaaagcagtt ccaggaaact gatctaaaaa agaatctgct gatgataatc 3900 agaaactaat ttgaatgtgg aatcttttaa gaatgtgtaa catatgtatt gatacttgat 3960 attaacacca ttctattatt atgattatga tgttaattta gaatttgatt gttagaactt 4020 aaaagtaaca tgcaatctac attgattcat cctgtgtgat tagcacaata ctatcataat 4080 tatgttaatt ggtgtatctc taaagcatgt ggttattcaa agtaaaagga aatctacatt 4140 gttgcatcct gatttcgtaa tcccgcttat gtattgatac taattagcac caaattagta 4200 ttatcattga cgctaattaa gtatatctgt taattaagcc tttgctaatt aatacttgtt 4260 taaagccaca tgcaaatgca atttacattg atagatctag cttctgtaag ttcagttaac 4320 agccaacatg tgtgcctttc attaacagta ctggtgcata gatttgacca tgactcatta 4380 gcatattgct cattagcata ttttcttaat tagtcattga ttttcctgtt gtaatgcgtt 4440 tctgtacttg agccaacatg caagactagt tgtatttcag gaaatgtgta attcaaaaag 4500 ggataagtag tcccaataat tgaatgtggt tgaggataat taacctcatt tgcatatttt 4560 gcaattaatg aaagctataa aaatacttac ttttctttat gattaatctt tcagggatgt 4620 ataaattcta aggtcttcat taaacctgta gcatgttccg agaagtttag agtaattgaa 4680 tgccaatatt gggggtgaat agattcgata gggttaatta gggttaatta gggttaaatt 4740 agtaaagata atgatggtgt gtcaagaaaa caatatgata acctgtccaa ttatattatc 4800 tttaaatgct gaaaatttga acatgattgg tcaaaccatt aagaaattac attaaaaagt 4860 gtgtcaaaaa ctttgggctt gtggtcattt ccggtgccat gtcgaggaca accgttccca 4920 tagggctaat agataagcgt cttctaataa aaacgtccaa cttttcaact ttaaaaaatt 4980 ctttaaaatt tcaactttgt ccgattaaga caggaaaaaa aacattgtta tcggaagaat 5040 ctgatctgtc ctttgacagc aatttcagta acctttatcc tacctaaaat tttggtgtga 5100 catgccta 5108 // ID BEL-38_CQ-LTR repbase; DNA; INV; 772 BP. XX AC AAWU01044769; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-38_CQ_; KW BEL-38_CQ-I; BEL-38_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-772 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 230-230 (2011). XX DR Genome; AAWU01044769; Positions 15212 15983. XX SQ Sequence 772 BP; 258 A; 162 C; 162 G; 190 T; 0 other; tgttggcgcc aacgccaaag tagatttaat ttgcccaaga aatgatttga atctcagaag 60 aaatagtttc caaccgatat gattttttaa cagtgtcagg taaagtaaat ttgcaataat 120 aatccatcca tttccacagt gtcagttaat ccttctattg tttaagcaca taacacaaca 180 caaatgtacc cgcacataca cgccacacat tgaagaagaa aggaaacagc agcatgtaaa 240 taaccgcaga ccaactcacc attcgtgtcc tttccctttc acacatattt cattcatgta 300 tctcacagca cacaataggc tcaaggacgc gcagcgcgcg tctgagcagg cagcttgcgc 360 aaacagacca ataggagagc agggaatagc aatgaaataa atgaatgaat gaaaagaaga 420 agtggcgtgt acatggttta gggaaaaagg attgcgcacc cgataggaga ttaaataata 480 agaatgtttt tgcaaagaag attcgggaag aaatgccgaa gttttagttt aaaaacccaa 540 tgattttgtt agtccgtaag tcagtttaag cttgtaaata tagaagttaa catcgtgaaa 600 caagctgttt tcttgcatcc tggccaaagg agtccacttg aagaaagtcg tgattacagt 660 ccactcgccg attgtcgcct tgaagctgtg ttgtgccacg gaattccctg caacacagtg 720 gaaaggttgg acaatcaaaa gccctcctgg agccgggcct agcgctccta ca 772 // ID DNA-TA-7_CQ repbase; DNA; INV; 953 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A DNA transposon family from Culex quinquefasciatus - consensus. XX KW DNA transposon; Transposable Element; nonautonomous; DNA-TA-7_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-953 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the southern house mosquito."; RL Repbase Reports 11(1), 57-57 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >93% CC identity. 10 bp TIRs. TA TSDs. XX SQ Sequence 953 BP; 328 A; 200 C; 168 G; 257 T; 0 other; ggggaaatat accctttcta atcaaacacc tatcttcgtc atatggagag tttgatgctc 60 gattaaagct ccaaaaatac tatttaggct ataaacttac cagcaacagc accgcctcga 120 gtaagcacgc aaatttatgc catttactgg ccaaaaagat caaatttaat gcacttttga 180 tcataatttc tattttgcga gaccatttct catcattttg gtggcacaca catccacacg 240 caagatcaga ggttaactgc ttaacgaaag tcgccatcaa tatcttttca cttgcgctaa 300 cgatttcaaa gcgcttaaga tataaaattg cttccaacta ccggcaacat gttcttttga 360 ccattactta gtcactcact tgaaaaataa tcccgaaaca tgtaaataat tagcaagcgc 420 catcaaaaaa acaaacgcgc taagtctttt gacgtttcaa ttttgactac ccattccaat 480 cgaaggctga agaaaaccga gcgagaagcg aaggcaaata aagaacaaag ggtggccacc 540 aacaccacca acatgctggt gcagcgcctg ttggagtgag caagtgctgc caggaaactt 600 tcgctataaa tgctactttt cgtgagtgtt cgcttaaaat ttcatcaatt ttggcagcac 660 taagcagacc caaaagagcg ctggaagaaa aaaagaacta agctgaaaat agaaaaatgg 720 gcgtggccca atgcactcga ctgattagaa tgcatttttg aaatattttt tttaaatgct 780 tgtaaaagga atagcataac ttttaataaa agtgaaaata aacagaaatg ttcacaaaaa 840 gttgctctac tcgttggtgt acttggtttt agtgaatttc aaggaacaat ccagattatc 900 cagcatcatt caaacacgac cgcttattag gcttaaaatg ggcatatttc ccc 953 // ID SATREP_MC repbase; DNA; INV; 169 BP. XX AC U96680; XX DT 09-JUL-2004 (Rel. 9.06, Created) DT 09-JUL-2004 (Rel. 9.06, Last updated, Version 2) XX DE Meloidogyne chitwoodi repetitive unit pMcCo122. XX KW SAT; Satellite; Simple Repeat; SATREP_MC; KW complex satellite repeat. XX OS Meloidogyne chitwoodi OC Eukaryota; Metazoa; Nematoda; Chromadorea; Tylenchida; Tylenchina; OC Tylenchoidea; Meloidogynidae; Meloidogyninae; Meloidogyne. XX RN [1] RA Castagnone-Sereno P., Leroy H., Semblat P.J., Leroy F., Abad P. RA and Zijlstra C.; RT "Unusual and strongly structured sequence variation in a complex RT satellite DNA family from the nematode Meloidogyne chitwoodi."; RL J. Mol. Evol 46(2), 225-233 (1998). XX DR Genbank; U96680; Positions 1 169. XX SQ Sequence 169 BP; 75 A; 14 C; 23 G; 57 T; 0 other; gaatataaat aagtaagatt aaactctttt gaacaaaata cagacgagag taatggactt 60 atgaaattgt aggtctgtta ttaaaaaaaa aattttggaa ttttcaaaaa gatctaaaag 120 tttttaaaaa aattattact ttatgattca tatatcattc gaaagagct 169 // ID Gypsy-140_AA-LTR repbase; DNA; INV; 1837 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-140_AA_; KW Gypsy-140_AA-I; Gypsy-140_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-1837 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1014-1014 (2011). XX DR [2] (Consensus) XX SQ Sequence 1837 BP; 480 A; 396 C; 418 G; 543 T; 0 other; tgtaacgttt tacgtttttt ttagtttaga atacataaat aataaacaaa aaaaataaaa 60 caaaaaagga ttttagtttt tatagcgttg aataaaaata ttttgtttgc gttcgtaaaa 120 tgcagcgatc catgtagcgt aagcgtatgt ggattgcgtt tgccataacg aattatgaaa 180 gcttcgcaaa gcgccatgta cacattttgg cggactacag gcgtaattca gatttgtcgc 240 catacctacc acacgccatg caaaagcagt tggcgcgatt ttctataata gttgcttttt 300 gtttgaatag ttggacggtc agaataggca gtttcttggc gcaagagttt cttcgactcg 360 atcatttcgc tacgaagttg gctggagagc agaagtgcgc gagtcgtgtg ataaagttaa 420 gttcagttgc agtaaatttt aaaagttgaa gttaaatctg gtgaaattaa atttgaagtc 480 ttaggtaaaa gttagtgagt gagtttgaaa gtaacaagtt gtgttaaacg ttttgtgtat 540 ttctttcttt agaggactga ggtccgtgaa taattcgcgt ttctgcgcaa aaggttttat 600 taagtttatc tcaacggaaa aaggtaatta taattaatta attgcatgaa gacttaaaaa 660 ctaatgtgat ttcgcgttgg acatgcagcg ccctgcgctt tttccgtacg gctggatttc 720 tatccgttgg cattcggcat ccgtcaccgc cgaatcgtga gtttagtccg ttcatcgaga 780 attctggagt ccgttcatcg tcgagacccg tgtgcagcgc aaaaggcccg cctttaaagg 840 tagttaaacg gaggcgtgag tgaagtcgct aaagtctctg tagcgaattt tctccttcgg 900 acgccaatta ccgaaggtaa aggtcccatc acgttacacc ttgtggacca tttccgagag 960 ttcggttcgt tgaagcgtga cgccaattgc acgccgcgaa cctcgatcct aacccaaaga 1020 tagggctgca ttactccccg ctcggattag cggacaggta cgcgaagccg ggtcgctggg 1080 tacgtgttcc tctattggca ggagcggatc gtcccgggta ttaataggcg aaatcgccgt 1140 ctcactctcg agatccgaaa tcggaagctt ccgttccagc cgccactccc tccgcaacgt 1200 tatttcggcc attttgcacg cgggtcaatt cttcgctgcc cgccaatttg cagcgacgga 1260 cacgccggtg gtcgccatct cgttaccgcc atcgccgagg tcgaccgtcg ccatcatcga 1320 gtctgcggag tcgacatttc gtttcccgtc aacgagccat catcgagtca acgccaaaat 1380 cctgcgcagg tacgtcaaga gcttatttta catggccatg tttgctcttt tccattccgt 1440 acggtttttg ctagtccaac ccgaatgaat ggtgaatcgt ttcctaaaaa tgaacagttt 1500 tgaagaaaga ttagttactt tttccgttac gtttaactat gcttaaataa aactactcgc 1560 taacttacat tacgtttcgt tccgtttatt agctcatttg agtaagcatc ctcttctgaa 1620 ttccctaaac gactttaagg ctgtggagga gtaaaagtca ttcttgcgtt tgattttcta 1680 gtgttgtttg accaagaacc gttcaaaact ctcccagaga gaaaaagtct ctaaacgtcg 1740 ggcaaatctg aacacgactg gtggtctctc attgagagtg gcgcttaagc caccgagttt 1800 gctaacaatc tctagtaaag ttttacgaac ggttaca 1837 // ID BEL-204_AA-LTR repbase; DNA; INV; 526 BP. XX AC AAGE02024599; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-204_AA_; KW BEL-204_AA-I; BEL-204_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-526 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02024599; Positions 16168 15643. XX SQ Sequence 526 BP; 138 A; 91 C; 151 G; 146 T; 0 other; tgtctgcaag cagcttcagc atttctcgga tgaagtaatg atactcgtca cgttgaggga 60 ctccagggat gctataaact ggtttgcgct actatggcta atacggttgt agtagtttac 120 cggttcttag tactgcggac tctcggacta cctactacca ctaagcggac ggtcatcgat 180 ggtcatcggg agccgagagt agcagagagt ttagttggta tccaaacgtc gtcgcgatca 240 gaagcaaagg ggcgaaccct tgctcgtttt aatctgttag ccttagtgtc gtaattaata 300 aagttaatgt agtgttttta tgtaagaata aagttgtgtt tcgtgtgtta ttatgtgtgt 360 ttaactgcat gcgaaggaag gcataaggat gacgaggacc tgaagggatg aaggaacaag 420 gacggccagg atcaaggatt tggattggat aaggactaag gaaggtacgt tctggtgaag 480 gtcgagtgga ggtcaaattt catttcggtc ggacggttcc ccgaca 526 // ID Jockey-4_CQ repbase; DNA; INV; 4116 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A Jockey non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW Jockey; Non-LTR Retrotransposon; Transposable Element; KW Jockey-4_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4116 RA Kojima K.K. and Jurka J.; RT "Jockey non-LTR retrotransposons from the southern house RT mosquito."; RL Repbase Reports 11(1), 115-115 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 9 sequences with >95% CC identity. XX FH Key Location/Qualifiers FT CDS 55..1203 FT /product="Jockey-4_CQ_1p" FT /translation="MTKKRRGSPLPGPPGDPLLAKNPFAPLANPGPAPPAT FT VPRKEKLPPFFVRSIEGNLKADLDALIKRGLKATMKLCTDGFKIIVPSKPH FT FNAVLEYLRQKKAAFFTHDIPADKPFKVVLRGLYEMQEGELTDYLSDCGLK FT VVAVHKIRRKETNTRFRDQLYLLHLEKGSTTLKDLKLIKAVANVVVEWEQY FT RPVHRDVTQCWKCLNFGHGGRNCFLASRCPNCGENHDQAQCAAEILVAKCC FT NCGGDHPSTDRACPKRAEFIQFRKNVAARKNNGRPKPTPINQESFPALPNR FT NRVPNLEPIPLPGLRPKPAPNPVPPGFQRFDPTPGVKPNPPGDGPRPRTKL FT LTAAELLNTFKIMYAKLMQCQTVEEQSMAVLEFTYQIVYG" FT CDS 1058..3844 FT /product="Jockey-4_CQ_2p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="MAPDREPNCSPRLSSSTPSRSCMPSSCNAKQSRNRVW FT RSSSSPTKSFMDNYTPTRILIWNSCSIGRKTLELSDFLRSNNIEVAAISET FT HLKPGDKVWVPDYVPVTLERTRCKKGGVAVLVHHLVHFRVLPSLQTKFIEA FT VGVEVDTSTGPVAVVAVYCPKQCRIVDGSLADYKNDLLKLTRRFPRFIIAG FT DLNARHSLWGNPSNNRNGNVLAEDLQAGHYVVLHPESPTFFSHAGVGSTLD FT IALTNLSEFCTPLQALTELSSDHLPVIFNLEARVNERQRLRRHNYHRANWI FT RFKQFVDDRVEENPVLDSKEAIDRALLVLEDSLNEAKDLFIPTAEVSSKFV FT NIDPETKRIMSLRNAVRRQYQRTGNPGRKALFKKLNKIVSVRVEKFRNRQF FT SKHLKSIPPYSKPFWRLTKILKTKPKPIPPLKSDDLAVTPVEKANLIAGHF FT LTSHNLGRDIVSPMEGPVLESIHQLAHTPWVLPDDQKITLEELSGWLKGLK FT NMKAPGFDGIFNIVLKHLGEKARTLLVAIFNRCLELGYFPTAWKRAKVVPI FT LKPGKDPSSPTSYRPISLLSSLSKLFEKLIYRRLLAHVEENNILLDEQFGF FT RRGHSTVHQLQRVTKMIHRAKSVSKTTVMALMDIEKAFDNVWHDGLVHKLM FT QFNVPTYLVQVISDYLDRRTAQVNIGSSVSDPYDCPAGVPQGSILGPLLYN FT LDTSDIPPLPGGGTLSLFADDSAISYEGRNIRHLVTKLQNGLDVYTRYLSD FT WKICVNAAKTQAIVFPHRNTDRLKPTTKLKVLGTEVDWSQVVRYLGLLIDS FT KLLFRYHLDDRIIKGIAMLKKLYPIINRRSKASLKNKLAVYKMVVAPMLLY FT GSPVWQGCAMTHRKKLQVIQNKFLRLILNQPWRTRSSDLHRLASIEPVDAR FT LASLAEKHRNRALVSEHGSIRGLYP" XX SQ Sequence 4116 BP; 1014 A; 1202 C; 1073 G; 825 T; 2 other; agctcgttga taacacaaaa agttttggag gataaggagt agaagtcagt tacgatgacg 60 aagaagagga ggggcagccc ccttcccggc ccgccgggtg accccctgct ggcgaaaaat 120 ccgttcgcgc cgctcgccaa ccctggaccg gcaccgccgg caaccgttcc gcggaaggag 180 aagctcccgc ctttttttgt gaggtcgatt gagggcaacc tcaaggccga cctcgacgcg 240 ttgatcaaac gtggcctcaa ggccaccatg aagctgtgca cggatgggtt caaaatcatc 300 gtgccgtcga agccccactt caacgcggtg ttggagtacc tccggcagaa gaaagctgcg 360 ttcttcacgc atgacatccc ggcggacaag ccgttcaaag tggtcctgag gggcctctac 420 gagatgcagg aaggagaact caccgactac ctatccgact gtggcctgaa ggtggtcgcg 480 gtccacaaga tccgccgaaa ggagaccaac accaggttcc gcgaccaact ctacctcctc 540 cacctggaga aggggagcac gacgctgaag gatctcaagc taatcaaggc cgtcgcaaac 600 gtggtggtcg agtgggagca gtatcgcccg gtccatcggg atgttacgca gtgctggaag 660 tgcctgaact tcggccatgg cggccgaaac tgttttctcg cctcgcgctg tccgaactgt 720 ggtgaaaacc acgaccaagc ccagtgtgcc gctgaaatcc tggtggccaa gtgttgcaac 780 tgcggtggtg atcatccgtc gaccgatcgt gcctgcccca agcgtgctga atttattcag 840 ttccggaaga acgtggctgc ccggaagaac aacggtcgtc cgaagccaac cccaatcaac 900 caggagagct ttccagctct tcccaaccgc aacagagttc cgaatctcga gccaattccg 960 cttccgggcc tgcggcctaa gccagctccg aatccggttc cccccggctt ccagagattt 1020 gacccaactc caggagtcaa gcccaacccg cccggcgatg gccccagacc gagaaccaaa 1080 ctgctcaccg cggctgagct cctcaacacc ttcaagatca tgtatgccaa gctcatgcaa 1140 tgccaaacag tcgaggaaca gagtatggcg gtcctcgagt tcacctacca aatcgtttat 1200 ggataattac accccaaccc gcatcctcat ctggaactcc tgctctattg gcaggaaaac 1260 attggaacta agcgactttc ttcgatcaaa caacattgaa gtcgctgcca tttcggaaac 1320 gcaccttaag ccgggggata aagtctgggt gccagactac gttccggtta cactcgagag 1380 aacgcgttgc aaaaagggcg gcgtcgctgt cctggtgcat cacctggtac acttccgggt 1440 cctacccagt ctacagacca agttcattga ggcggtcggc gttgaggtcg acacctcgac 1500 gggcccwgtc gcagtcgtgg ctgtctactg ccccaagcaa tgtcggatcg ttgacggatc 1560 gctggccgac tacaagaacg acctactcaa gctgacgcgt cgctttccac gattcatcat 1620 cgccggcgac ctcaacgccc ggcactcgct ctggggcaat ccaagcaaca acaggaacgg 1680 gaacgtactt gcggaggatc ttcaagctgg ccattacgtc gtcctgcatc cggagagccc 1740 gacgttcttc tctcacgctg gcgtcggttc aactctggac atcgccctga cgaacttgtc 1800 ggagttttgc acaccactcc aagctctcac cgaactctcc tcggatcatc tgccggtgat 1860 cttcaacctg gaggccaggg tcaacgagag gcagcgtctg aggaggcaca actaccatcg 1920 cgccaactgg attcgattca agcagtttgt cgacgaccgg gtggaggaga acccggtgct 1980 agactcgaag gaagccatcg accgtgccct gctagttctc gaggactcct tgaacgaggc 2040 caaggacctg ttcatcccta cggccgaggt ttcaagtaag tttgtgaaca ttgaccctga 2100 aaccaagaga atcatgtccc tcagaaacgc ggttaggagg cagtaccaaa gaactgggaa 2160 cccaggtagg aaggctcttt tcaagaagtt gaataagatt gtctcggtca gggtagaaaa 2220 gttcaggaac cggcagtttt caaaacacct caagtccatc ccaccctact ctaagccctt 2280 ctggcgccta actaagatct tgaaaacaaa acccaaacca atcccgcccc tgaaatccga 2340 tgaccttgcc gtcacgcccg ttgaaaaagc caaccttatc gcgggtcact tcctgacctc 2400 gcataacctg ggacgtgaca tcgtaagccc aatggagggt cccgtgcttg agagcatcca 2460 ccaacttgcc cacacgccgt gggtgctgcc ggacgaccaa aagatcacgc tggaggaact 2520 ctccggctgg ctgaagggac tgaagaacat gaaggcaccg ggcttcgatg ggatcttcaa 2580 catcgtactg aaacacctgg gggaaaaagc gcgaaccctt cttgtggcca tcttcaaccg 2640 ctgcctggag ctgggatact tcccgactgc ctggaagcgg gccaaagtag tcccaatact 2700 caagcccggc aaggatccgt ccagcccgac cagctatcgg cccatcagct tgttgtcttc 2760 cctaagcaag ctgtttgaga agctgatcta ccgccgcctc cttgcccacg tcgaggaaaa 2820 caacatcctg ctggacgagc agttcggctt ccgccggggc cattcgacag tccaccagct 2880 gcagagagtc accaaaatga tccatcgcgc caagtccgtt tcgaaaacca ccgtaatggc 2940 gctgatggac atagagaagg cgttcgacaa cgtgtggcac gacggtctgg tacacaagct 3000 gatgcagttc aacgttccca cctacctggt gcaggtgatc tccgactacc tggataggag 3060 aaccgcccaa gtcaacatcg gaagctctgt atcagacccg tacgattgcc cagctggagt 3120 tcctcaaggc agcatcctgg gcccgctgct ctacaacctg gacacttccg acattccacc 3180 tctgcccggc ggcggcacgc tttccctttt cgccgatgat tcggccatca gctacgaggg 3240 gcggaacatc cgacatctcg tcaccaaact ccagaatggc ctggatgtct acacgcggta 3300 tctctccgac tggaagattt gcgtgaacgc ggcgaaaacg caagcaatcg tgtttcccca 3360 ccgcaacacg gatcggctca aaccaactac aaagctgaag gtgcttggaa cggaggtgga 3420 ctggtcgcag gtcgttcggt atctcggact gctgatcgac tccaagctgc tgtttcggta 3480 ccacctcgac gacaggatca tcaagggcat cgcgatgctc aagaagctgt acccgatcat 3540 taaccgtcgg tcgaaagcat cgctgaagaa caagctggca gtctacaaga tggtggtggc 3600 tccgatgctg ctgtacggtt ccccggtctg gcaggggtgt gccatgaccc accggaagaa 3660 gctgcaggtg atccagaaca agttcctgcg actgattctg aaccagccct ggagaacaag 3720 gtcctctgac ctccatcggc tcgctagcat cgaacctgtc gacgcgaggc tggcttcact 3780 ggccgagaag catcggaaca gggcgctggt gtcggagcac gggtcaatcc gcggattgta 3840 cccctgagtg agtgtgsttg agtgtttgag agtgtgttag ttttaaggca gcgtgctttc 3900 tgattaagct aggtaggata ttttaataat tcaaaaaaac agcagcctga aatgcgcgat 3960 gtagacttta gaattagttt aaaattgcag ttcacttgaa caccataata actaagcctg 4020 aaaagcagtt gtgctgaatc ccttactctg taaacccctg gattttaaac cgacgaaata 4080 aaacaaaatt tgatttaatg caaaaaaaaa aaaaaa 4116 // ID DNA-12_AAe repbase; DNA; INV; 257 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A non-autonomous DNA transposon family from Aedes aegypti. XX KW DNA transposon; Transposable Element; nonautonomous; DNA-12_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-257 RA Jurka J.; RT "DNA transposons from the yellow fever mosquito."; RL Repbase Reports 11(4), 1267-1267 (2011). XX DR [2] (Consensus) XX CC ~97% identical to consensus. 4 bp TSD. >4000 copies. XX SQ Sequence 257 BP; 87 A; 45 C; 49 G; 76 T; 0 other; tctcgcggaa cattactaaa ggacctatgt acaaatgaga gattctctcc tctctcgttc 60 tctttcgatt ataacagtgg aatactaaag attttgggaa gtttttcact atagatcgaa 120 agtcaatttc cttgactagc gtttcataca aaaaacgcaa cagaagaggt taatgtgact 180 cagttattaa tgaaagagaa agtaaacaaa gagagcctct caatgttaga taggtccttt 240 agtaatgttc cgcgaga 257 // ID Gypsy-63_CQ-LTR repbase; DNA; INV; 2006 BP. XX AC AAWU01018550; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-63_CQ_; KW Gypsy-63_CQ-I; Gypsy-63_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-2006 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 506-506 (2011). XX DR GenBank; AAWU01018550; Positions 11790 13795. XX SQ Sequence 2006 BP; 503 A; 532 C; 480 G; 491 T; 0 other; tgtaacatgg tcactctatt cgtttagttg tttttcattt agctataatt atttaatata 60 aacaagtcca actcaatcaa gggtttgtgt cgaagctagc acatggcaag ctttagtaga 120 acagtgctac acctcacttc ctcacacccc tgccaatccg tccgataagc caaatagtcc 180 tggagaaagc acaacttgca ccaccacgta tgggaagact tgctgggtta aacctgccga 240 gagaacttgc acgatcaagc cctctcggaa agcctggaat gaattagaac aggaaagcca 300 gatgacaaat atgtaagagg gaacgagatg ccttcagtaa tattgccaaa atctgttcgg 360 gtaaattcgt tattcgaaac tacgaaattt ctcacgaaat tacgtacttc cgttcaacct 420 ttccctaatg acgctttctg aggaaagtta caatttctac tcgtccactt ggtcctgagt 480 cgcggaagag gcagcagcgc cgatccagta gtcaagcccc ccggtgtgtt tagtgacggt 540 aactcgagtg aaacgagtga tctgtagatt tgtgtagtta atctttgttc tttcctgtag 600 tgtccaacct tagcttagct agtagtttgt gcggtccgat tctgtgtgcg aagaggaatt 660 aaaaggtaaa gtacctagtg tgtgtgttga tgtgtgatgt gtgttaatcg tcgccaaaca 720 gccgacgacg gcgttctgtt ctagagtcgg aaccaaccgc ctcctaccag ctcgtgctcg 780 ccacaccgcc accaaaagcc ttagccgccc tgctgacccc gcggtctcag cgatcgagcc 840 ccggaaggcc cactgggatt cggaagggag accacacgct acacgtggct ccagggagag 900 ccatccattg cgccagccgt tcctgtccag ccgctggtcc agattcgccg cagaagttca 960 ccagaccgta gacgtcggtc ctcgtggtca cgcgccgtcc aggttccaga aggaggcgac 1020 ccgtccacca ggccgggcag ccggccaagc ccgatttcaa ccaggccgcc cccttaagat 1080 cgcgcgcctt tcatctgctc cggaagaata tcgcaccgcg cggtgcacta gcgacgcgta 1140 agccaatcaa cgaaaccggt ccgtgagtac tcacctataa cgtaaaaacc gcatgcgtac 1200 gcactcgtag acgtcgtccg agttgccaaa aacacacaca cacattttct aaggctcgtt 1260 gccctttccc gagcctagtt ccgttggtca ccgttacacc tgtgacctcc gggctaacgt 1320 aactaagccc cagaacaaat ccgcacacag atagttagta gagagagaga gaaccactaa 1380 gagaagagga aatgaacacg taggacacta caaacatgta acttaaacct aggttaaaaa 1440 tgtacaatta aaaaccacat ttagcatgca atatatctcg tttttcacct cccccaagac 1500 gttgttagcg tagttatttt ggtgaacgaa atgtctctct caccagatta gttgcacggg 1560 tctagaggtc cctcgcttcg cattcgacgt ctccacgttt ttgttactgt tgttttactc 1620 caagtgcaag gctgcttggg aatctcagcc tggggtcttt tcccgtcgca gcgattggtg 1680 cgggggtaga aattatcctg tgaggatcca gagcgaacag gtgactggtt gttcggacgg 1740 acaacgggtg gccccgttca gttttggata tttttggagt catcgcacgc gaagcggagt 1800 acctcttacc tattttgagt cccttgaacg ttcccccatt catttcaggt ttttacaaaa 1860 cgacccgccc tgcggctggg tctagtcggg cataaaccag gcaagtcggt cctctacgga 1920 ggtggcgcat aaaccgtttg ctaattaaca aacccgtggg acggttatcg gacggattcc 1980 ttcgtggcta cgatttgcgt accaca 2006 // ID Gypsy-83_CQ-LTR repbase; DNA; INV; 169 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-83_CQ_; KW Gypsy-83_CQ-I; Gypsy-83_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-169 RA Jurka J.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 546-546 (2011). XX DR [2] (Consensus) XX SQ Sequence 169 BP; 48 A; 47 C; 24 G; 50 T; 0 other; tgtaataaca accctgcaat gtcgacatac aagtcgacat accttttgta agcaaagtcg 60 ataagctagt gtgagcaata aaactctgtc tcttctactg cgaaccaccg acctactcgg 120 tcctgttttc cttatctcat catccgagtc tcaatcactt tcacttaca 169 // ID BEL-55_CQ-I repbase; DNA; INV; 3419 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-55_CQ_; KW BEL-55_CQ-LTR; BEL-55_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-3419 RA Jurka J.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 263-263 (2011). XX DR [2] (Consensus) XX CC 'AGATG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 623..3418 FT /product="BEL-55_CQ-I_1p" FT /translation="MDSTTKKKLTDAFRQRYRTERKVIFVQELLLGLYEPT FT LAQLTVLYEFLVKTYREQSRHHLQVIGLIPDECLAAQEVEFEKYDALFFEV FT ATALAELQIEAEQTVPSVTTPIAANRQRLQAKAWLKQMLKAWESNQVDIRD FT SGIETQAPESQEVVLRDCGLSKPKPLHEQAEVYRKLEAITDRVATSVEEML FT LPQHASNGSMPESVSKTISASLNQGPTSYNQCRDVVHQATKPFETRSDPAV FT CRVSPGNSRDVAASASPEPLLPAKLCDSLPSGVTSQRRDTQSDTSNERNRK FT HFVEPPKQDPVSHRVSAGIHEDVAASASLQPLIPSNLVDIQVAITTKHDEI FT VNAEPLLPATELVRDQHNEANHSCVPWDSSPEVTSRLVKVPPVVIHDALVL FT RSVHPDGKSSAVFDPSSKPAKPTNPQGHHRANQDGGLIPGKDPLAPSEVKA FT PDKLEHIHTTPNSQPQTFDAHPSGRQTVRITSRTMPHSCVPISRHDPVTPT FT LTGADQSKPEANRCSRIVHSVTAFSTTTVDFYITLKHWNPAKALSQRHACR FT AMFVTSAPKKRFVVKAPDRTEDLAHKGSAGRQTLRHENVLQEPRPICIQQP FT LKDHSKSDFNRSQAPYEFCKPRKRLLRILDDLTTRGAPVEDPTIVCILTHP FT WDTGTKSPFIVTAESDQVQPLQQEVVQYSDDEVNQRCVHSDSGPQTKVRIR FT KHTQNQINRVQPRQMTVPKYQTLMQTAAKAPDKRELPFPSSVSPGNSAFHV FT NEQRRLEDLGFIGELEVKCSRSSHSWSYAERGSSNVFVVHQSRKPSHKSTS FT NQESSASSPETRPLLIALEPSQPVQMNLVIAILGIQGSCHEPPSQDELFQR FT TVDRRRRFWHPTIPPDELSHEPKSAQPSKRRNNFRGIHAVLPAIEDVGERR FT DPADQSAAIHRKRCVGLTKELRDLNGGS" XX SQ Sequence 3419 BP; 913 A; 1061 C; 844 G; 601 T; 0 other; tggtctcatc gaaccggaat tgaaccccgc ccaagcctcc aaaagtgaaa tcactggatt 60 caaccgggac tagtccagcc gttccggaag cgtctggaat acgtctacca agcactcctg 120 tacaagttcc ccacccgctg ccccatcgca gctcgacgtt tccgagcgac tgtccagccg 180 tggaagtcat cggaagtcgg attcccggaa gtgaactcgc gcggatgcac aaagccggcg 240 agttgttcct gcccaagttc ccccctgctg ccccatcgca gcttgaccgg tttcgagcga 300 ctgtcccacc gcggaagttg cctgaattcg gattcccgga agtgtccacc ctaggaactt 360 cctccaagca gttcacccga cgaactgctc tcgcgcggat gtacaaagcc gttgcgtcat 420 cccactccaa gcattcccgg aacaacccat tccgagccga aaccgcgtac aaactagctt 480 gcgcgtgtgt gtatgtgtgt ctccgccaaa aagtgtcgtc gtccagccga ggaggcgaac 540 tttccccgac caaacagcag ccactgtccg accgataaca ggtgcctcga cgttgacttc 600 cttagaccaa agggaagtag ccatggactc cacaacaaag aagaagctga cggatgcgtt 660 ccggcagcgc taccgaacgg aacggaaagt aattttcgtt caagaactgc tgcttggtct 720 gtacgaaccg acgctggcac agttgacggt gctatacgag tttttggtga aaacgtaccg 780 agagcagagc cgacatcacc tgcaagtcat cggacttatt ccggacgaat gtctcgccgc 840 gcaagaagtc gaattcgaga agtacgacgc gctgttcttc gaagttgcta cagcgctagc 900 agagttgcaa attgaagccg aacaaaccgt accaagtgta accacgccga tcgccgccaa 960 cagacagcgc ctgcaagcaa aagcatggct gaagcagatg cttaaagcct gggagtccaa 1020 ccaagtggac atccgtgatt caggcatcga aacccaagca ccggagagtc aggaggtggt 1080 cttgcgagac tgcggtcttt caaagcccaa acccctccac gaacaagcag aagtgtaccg 1140 gaagttagaa gccatcaccg atcgcgtggc tacgtcagtt gaggaaatgc tgctcccaca 1200 gcacgcatcg aacggatcaa tgccagaatc tgtttccaag accatttccg cctcgctcaa 1260 ccagggtccg acgtcctaca accagtgtcg agacgtcgtg caccaagcca ccaagccttt 1320 cgaaaccaga tccgatccag ctgtatgccg agtttctcct ggaaactcca gagatgtcgc 1380 cgccagcgcg agtcctgaac cattgctgcc agccaagtta tgtgattccc tgccgagtgg 1440 agttacaagc cagcgtcgag acacccagtc tgacacgtcc aacgaaagga accggaagca 1500 tttcgtggaa ccacccaaac aagatccagt aagccaccgt gtgtctgctg gaatccacga 1560 agacgtcgcc gccagtgcga gtctccagcc actgattcca agcaatcttg tggacatcca 1620 agtcgccatc accacgaagc atgacgaaat cgtaaatgcc gaaccactgt tgccagcaac 1680 ggaacttgtc cgagaccaac acaatgaagc aaatcacagc tgtgtcccgt gggacagtag 1740 ccccgaagtg acgagcaggc tggtgaaagt gccacctgtc gtcatacacg atgcacttgt 1800 tctgcgtagt gttcacccgg acggaaaatc ctcggcggtg ttcgacccaa gttcgaaacc 1860 tgcgaagcca accaatcccc aaggacatca ccgcgccaac caggatggcg gtctcattcc 1920 gggaaaagac ccgctagcgc cgtctgaagt gaaggccccg gacaaactcg agcatatcca 1980 taccacgcca aacagtcaac ctcaaacctt tgatgcccat ccaagcggcc gtcaaactgt 2040 tcgtatcacc agcagaacga tgccccattc ctgcgtgcca atatcacgcc acgacccagt 2100 cacacccacg ctgaccgggg ccgaccagag caagcctgaa gccaacagat gttctaggat 2160 tgtccattcc gtaacagcgt tctccacaac cacggtggac ttctacatca cactcaagca 2220 ttggaatccg gccaaagcgt tgtcccaacg acatgcttgc cgggccatgt tcgtgaccag 2280 tgcccccaag aagcggtttg tcgtcaaagc accagatcga accgaggact tggcacacaa 2340 gggaagtgct gggagacaga cactacgaca tgaaaatgtg ctccaagaac cacgccccat 2400 ctgcattcag cagccgttga aagaccacag caaatcggac ttcaaccgaa gtcaagctcc 2460 ttacgagttc tgcaagccaa ggaagaggct actgcggatt ctggacgacc taacaacaag 2520 aggcgcgcct gtagaagatc caacgatcgt ctgtatcctg actcatcctt gggatacggg 2580 aacaaaatcg cccttcattg tcactgcaga atcagaccag gtgcagccac tgcaacagga 2640 agtggtgcag tactccgacg acgaggtcaa ccaacgctgt gtccacagcg acagcggccc 2700 ccagactaag gtccgaatcc gcaagcacac tcaaaaccag attaaccgcg tccaaccgcg 2760 gcagatgaca gtcccgaagt accaaacgct gatgcagact gcggcaaagg caccagacaa 2820 acgtgaactt ccctttccaa gcagcgtatc cccgggcaac agcgcattcc acgtcaacga 2880 gcagcggcgt ctcgaggact tgggtttcat cggagaactt gaggtgaagt gcagccgatc 2940 atcccactcc tggtcctacg ccgagcgtgg atcgtccaac gtcttcgtag tccaccagtc 3000 gagaaagcca agccacaagt caacaagtaa tcaggagtcg tcagcatcat cacctgaaac 3060 cagaccgctc ctgatcgcac tagagccaag ccaaccagtc cagatgaatc tcgtgatcgc 3120 aattctcggc atccaaggta gctgccacga accgcccagc caagatgaac tgttccagcg 3180 aaccgtcgat cgcaggcgca ggttctggca tccgacaatc ccaccggacg aactgtcaca 3240 tgagcccaag tcagctcaac ccagcaagcg cagaaacaac ttccgaggga ttcacgccgt 3300 cctaccggcc atcgaagacg tcggcgagcg acgagatccg gcggaccaat ccgcagccat 3360 ccaccggaag cgttgtgtgg gcctaaccaa agagttgaga gacctcaacg gggggagta 3419 // ID BEL-68_AA-LTR repbase; DNA; INV; 479 BP. XX AC supercont1.93; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-68_AA_; KW BEL-68_AA-I; BEL-68_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-479 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.93; Positions 110610 110132. XX SQ Sequence 479 BP; 183 A; 67 C; 107 G; 122 T; 0 other; tgtacgcacc accaaacatt gaaaacccta cgtttcttgc acctgctggt gtgtattaga 60 caatgactat tggttggtgg aattggagat gaagtaggtg aaaatttggt agttgaatta 120 tcccggaggg caaagtaaat ataaaaaggc aaagctctgt aaacaaacga aagcaacaaa 180 ttgatttagt ttaaagatgc agataaaaag tgcggggaaa tagttgaggg taaaaagtat 240 ataagttcga tatagtgatt aagaaacagt tcgtgagtgc ttcgttggca gcgcctgcca 300 ccgatgccaa gaaaaagacg aaaataatta gatttggtaa agaaaccata acatgtaagc 360 aaggaatata actaaattaa ggggaaaaat attaataata aaattatagt ttcaagcttc 420 gctacgagaa tcgctgctgt gaaaattatg gttgatttcg aaaatcacta gcccgaaca 479 // ID Sola3-2_HM repbase; DNA; INV; 5948 BP. XX AC . XX DT 28-JAN-2009 (Rel. 14.02, Created) DT 28-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE Sola3 DNA transposons from Hydra magnipapillata. XX KW Sola; DNA transposon; Transposable Element; Sola3; TTAA TSD; KW Sola3-2_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-5948 RA Bao W., Jurka M.G., Kapitonov V.V. and Jurka J.; RT "New superfamilies of eukaryotic DNA transposons and their RT internal divisions."; RL Molecular Biology and Evolution 26(5), 983-993 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 1455..4253 FT /product="Sola3-2_HM_1p" FT /translation="LPQTNQYNEKTLIESRLQTYLSCDDWLCAYHRFSFGI FT GWKPPNRCQYPSHIFQVGKKAPKIRSIPLNTLATYNKSNSSLPIGAVFCFK FT HLKTISIDNIIDMIEPIPTVKSSSYVDKVYVPDLAIISDKTIEESKLKASS FT LCEALGASPLSFHITQKRIEDLTQGTKQKVKKKFEKCKVQLEKIFAEAIAP FT GQSEALITQVLNSEDENVATEMVPEDLIVQVKMFQESDSLGKILLLSVVNH FT NKYTKETLMKVFDCKKHQIEKGRKLQKENIGLSIPKKGKVKICRMPQEKIE FT HFLEFLFSRGLLQDVAYGINKIKFDNGEKQKVANAILMMKFSHTITFYKEI FT CLETNYVPMSDSSLWRILCGIKPSQRKSLAGLDDVTAAGMNGFQTLIAISE FT KWKYKNVTKFLEKGKRYLKSNYPSKCSENSTLHSHSTSFALSDIDPDLNQS FT NITADDEQCIDCTDLISVINQVRELVIQSEDEDLLYDLNVATEDVEAYMKH FT QIRDAQQKMAKIMAFEQLDEETGFWLKDFCQKILPAKFREGQKEYFGKKGM FT TLHVDVFFFYENGRMKKKVYFTAVYRCEQSLVDVLCLADVVLAKVKNDFPY FT LKNLYAKSDNASSYHGNFYLEALYNVCKAKQFFLKRYDYNEPSRGKDQCDR FT ESAGAKCVIRSFVDAGNNLLTAEDLYEALHNGKGIQNADVAVVIINSKASI FT LSGSNLIPNISSYHSFQFFPDHMIMWRYFRIGKGKKWNYTNVTFHTSVEIV FT KPFSSTCKIYTAPTISKKPRVDRRVNTLKFCTQFGCTDSFFDNESLEVHLL FT SENHNFQSVKKSMVDRAKLSYINKMKISNLNSYDYLPSSQSVIKGIENFTD FT ITGWALPKRKVFRYSSKQKTLLMQMFMSGEEQGKKMSPEQVHQQLRTKLKP FT SEYVTTQQIRSLFSRRVNLLLYLLIPKV*" XX SQ Sequence 5948 BP; 2139 A; 912 C; 971 G; 1926 T; 0 other; gagccattat ctgagaaaga taggcaaagt ctggaagtga tgtctttaaa atctctttat 60 tttcagatat cttataaaac acaatgaaat taggccacag ttaaattttt tttcgaaaac 120 tcccattatt tgcggagata tcgggggccg aaatttggcc cattttgtaa aaatattggt 180 gcatgaaatg gaattactac ttttagggac aattttagca atatgaactt gagaaacaca 240 tttctttttc ttttttctat tactctagct ggatttctct gcttaaaata aaaattcaaa 300 atgtttacac tttttgttgt ctagaaatca cggttcaaaa ccgctaaatt tgaaatttta 360 cccgtacttt gaccatttat atctcctaag gccattactt gcacataatt tacttaactg 420 tttttgaaag ccaataaggt atacattgaa tttatagaaa aaaaattatg agtatgcaag 480 agcctgagaa agagtaagct tccaaaaatg aattttcgcg gttaatacgc tttgttagat 540 tacaaataac aatcgtggcc ttaggtctgc catgaaacgt ataagttatt ttgttacctt 600 atacatgcat cttccaatat ctgaaaaaaa atccatgtgc tcaagcgata aaaagcctag 660 agaagtccta gaaatagtct aattttgcca tattcttaac caacaaaata ccctagcaac 720 gacatagata tgcttagcaa cggctacttt tactctttaa tagttagtac tcaaattaaa 780 aataaataaa aaatacattt ttataacaaa actgtataaa ctgtacaaaa gtgcttgata 840 aaaattattt tttaagcttt tttaatgtaa caaaatgcgc aacaataggt tcaaagaatg 900 cgatggcaaa agtgtaataa ccaaagtaag gaagtgacat taaagatact ttttttaatg 960 actgtaatga ttgctacaca agcaattccc tttgggcagt catttgtaat actttcttta 1020 gacaattatt aatgattgca agactatttc cttggagaca cattcttctc gaagaattag 1080 cacatgaatt taataaatag ccatagcatg agatgagaaa ctacgaaata aagttttttt 1140 gttttgttat cttttgttta ttcaataaaa aataataaaa taaaaaataa aagttaaata 1200 caaaatgtca gagagcaaca atatagtcag tctctatgaa aaatgcgtgt attttgataa 1260 aggacagact tcttgtggcc cttttaagcg ttggaaacaa catgggatct ttcaaattaa 1320 tgaattaact gctgacataa gtaatcattt acaagtacta aaggtataat ttatcaatta 1380 atatatatat ttttaattat aagcaacaat acctccaaca aatagatgac aaatttaaca 1440 ttactatttt atagttaccg cagacaaacc agtataatga aaaaactcta attgaaagtc 1500 gattacaaac ttatctttca tgcgacgact ggttatgtgc ataccaccgt ttttcttttg 1560 gtattggatg gaagccgcca aacagatgtc aatacccaag tcatatattt caagttggta 1620 aaaaagcacc aaaaattaga tccatacctt tgaataccct tgctacttat aataaaagta 1680 attcatcgct acctattgga gccgtatttt gcttcaagca cctaaaaact atttctatag 1740 ataatattat tgacatgata gaaccaattc caacagtaaa gtctagttct tatgttgata 1800 aagtttatgt tccagattta gcaatcattt cagataagac catcgaagaa tcaaaattaa 1860 aagctagttc tttgtgtgaa gcactaggag caagtccact gtcttttcac ataacgcaaa 1920 agcgaataga agacttaaca caaggaacta aacagaaagt taaaaaaaaa tttgagaaat 1980 gtaaagtgca gctcgagaag atatttgctg aagcaattgc acctggtcaa tcagaagcac 2040 ttataaccca agtattaaac agtgaagatg aaaatgttgc aactgaaatg gttccagaag 2100 acctaattgt acaagttaag atgttccaag aaagcgattc tttaggaaaa attcttttgc 2160 tttcagttgt taatcataat aagtacacta aagagacatt aatgaaagtc tttgattgca 2220 aaaaacatca aatagaaaaa ggacgcaagt tgcagaaaga aaacattgga ctttccatac 2280 caaagaaggg taaggtaaaa atatgccgaa tgcctcaaga aaaaattgaa cattttttgg 2340 aatttttatt ttctcgaggt ctgcttcaag atgtagcata tggtataaat aaaataaagt 2400 ttgacaacgg agagaaacaa aaagtggcta atgccatttt gatgatgaag tttagccata 2460 caatcacatt ttataaagaa atttgcctag aaactaatta tgtccctatg tcagatagtt 2520 ctttatggag aattctttgt ggaataaaac cttctcaaag gaaatcttta gctggtcttg 2580 atgatgttac agcagctggc atgaacggtt tccaaacatt aattgccatt tctgagaagt 2640 ggaaatataa aaatgttact aagtttcttg aaaaaggtaa aaggtatctg aagagtaatt 2700 atccttccaa atgtagtgaa aattcaactt tgcatagtca ctcaacttca tttgcccttt 2760 cggatataga tccggactta aaccaatcaa acataaccgc tgacgatgaa caatgcattg 2820 actgtactga cctcatttca gtaatcaatc aggttcgtga gttggttata caaagtgaag 2880 atgaagatct actatatgat ttaaatgtgg ctacagaaga tgttgaagcc tatatgaaac 2940 atcaaattcg tgatgcgcag caaaaaatgg ccaaaataat ggcttttgag caacttgatg 3000 aggaaacagg attttggcta aaggattttt gtcaaaaaat attgcccgct aaattccgtg 3060 aaggccaaaa agaatatttt gggaaaaaag gaatgactct acatgttgat gtcttcttct 3120 tctatgaaaa tggtcgaatg aaaaaaaaag tgtattttac tgctgtatat agatgtgagc 3180 aaagccttgt tgatgtactc tgtctcgctg atgttgtttt ggccaaagtt aaaaatgatt 3240 ttccttattt aaagaatctt tatgctaagt cagataatgc atcatcttat catggaaact 3300 tctacttaga agccctttac aatgtttgca aagcaaaaca atttttcctt aaaaggtatg 3360 attacaacga gccttcgcgt ggaaaagatc aatgtgatag agagtcagca ggagcgaaat 3420 gtgtcattag gagttttgtt gacgcaggaa acaatttatt gactgcagaa gatctttatg 3480 aggctcttca taatggtaaa ggaattcaaa acgctgatgt tgctgtagtt ataataaatt 3540 ccaaagcctc tattttatca ggatcaaatc tgattcccaa cataagcagt tatcattctt 3600 ttcagttttt ccctgatcac atgataatgt ggagatactt ccgaattggt aaaggtaaaa 3660 aatggaacta tacaaatgtg acattccata cttcagttga aattgttaag ccctttagtt 3720 cgacttgcaa aatttatact gccccaacaa tttccaaaaa accacgtgtt gatagaagag 3780 ttaacacttt aaagttttgt acacaatttg gttgcactga ttcatttttt gacaatgaat 3840 cattagaagt acacttgtta tcagaaaacc acaattttca atctgtgaag aaatcaatgg 3900 tagacagagc aaaactatct tatatcaaca aaatgaaaat ttcaaaccta aattcttatg 3960 attatctgcc gtcatcccaa tctgtcatta aagggattga aaacttcaca gacatcactg 4020 ggtgggcatt accaaaaaga aaagtgttca gatattcttc aaagcagaaa acattgttaa 4080 tgcagatgtt tatgtctgga gaagaacaag ggaaaaaaat gagcccagag caggttcacc 4140 agcagctgag gacaaagtta aagccaagtg aatatgtgac aactcaacaa attcgatcat 4200 tattctctag gcgagtaaat ttacttctat atttactgat acccaaagta tgaatgatat 4260 aaaatttaat tttattcttt tcttgagttt taagataaat attttgaatt taccttcatg 4320 cttcaagtac atttttttag atggagtaaa caaaaaagag aaggaaagct agttaccatt 4380 ttttgtgaag atgctaatct aactgatgac atgattgatc tgactgatga tgcgagacat 4440 acaagtcaaa ataaagatga tgatttaaac aacgatattc agctcatagc aaaggagtta 4500 gcagctgatt ttcaaattgg tgattgggtt gctattgaat ataatatgca gtggtatcct 4560 gcaataattc aaagtgtaag acgttataac atcaaaatat ttaatttttg agtattagat 4620 cttttttatt aagttatata attattaatt tattaaccac aacatttaaa tgtaagaaat 4680 attcagaaag caccaatgat tagacgttgt tttactttta acttaaccct gtaggtcaat 4740 cgtgacaatg ttcatgtttc ttgtatggaa tattctacta cacctggaaa aaactgtttc 4800 aagtggccaa ccaaggaaaa tattctccaa tatttatatc ttgacgttat ttgtaggatt 4860 gatgcaccaa caccaactaa tcaaagaggg gactattctc tctcaataga cgatatgtat 4920 aacatagaaa ttcagattga cagataaact tctttaaaac tttatttggt ttgctttctt 4980 aaacaagttg tatttgaaaa aacgtttatc cgaaatttac ataattgttt taacaatgat 5040 ataaacactt caaagtgtgt gatgtttgga tctttttctc attaaagaag cttagaaaat 5100 aacttttatc aagcgctttt atacagtttt ttgttataaa aatgtatttt ttatttattt 5160 ttaattagag tactaattat taacgaatta tagaagccgt tgctaagcat atctatgtcg 5220 ttgctagggt attttgttgg ttaagaatgt ggcaaaatta gactatttct aggacttctc 5280 taggcttttt atcgcttaag cacatgaatt ttttttcaga tattggaaga tgcatgtata 5340 aggaaacgaa ataacttata catttcatgg cagacctaag gccacgattg ttatttgtaa 5400 tctaacaaag cgtattaacc gcgaaaattc atttttggaa gcttactctt tcccaggctc 5460 ttgcatactc ataatttttt ttctataaat tcaatgtata ccttattggc tttcaaaagc 5520 agttaagtaa attatgtgca agtaatggcc ttaggagata taaatggtca aagtacgggt 5580 aaaatttcaa atttagcggt tttgaaccgt gatttctaga caataaaaaa tgtaaacatt 5640 ttgaattttt attttaacca gagaaatcca ctcagagtag tagaaataag aaaaataaat 5700 gagtttctca aattcatatt gctaaaattg cccctaaaag tagtaattcc gttccatgca 5760 ccaatatttt tacaaaatgg gccaaatttc agcccccgat atctccgcaa ataatgggag 5820 ttttcgaaaa aaaatttaac tgtggcctaa tttcattgtt ttctataaga tatctgaaaa 5880 tgaagagatt ttgaagacat cacttccaga ctttgccgat tttgcctata tttctcagat 5940 aatggctc 5948 // ID Rehavkus-1_NVi repbase; DNA; INV; 8458 BP. XX AC AAZX01001051.1; XX DT 14-MAY-2009 (Rel. 14.06, Created) DT 26-MAY-2011 (Rel. 16.05, Last updated, Version 2) XX DE Rehavkus DNA transposon. XX KW MuDR; DNA transposon; Transposable Element; Interspersed repeat; KW Rehavkus; FB; FB4; NOF; Rehavkus-1_NVi. XX NM Rehavkus-1_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-8458 RA Bao W. and Jurka J.; RT "Rehavkus DNA transposons from Nasonia vitripennis."; RL Repbase Reports 9(6), 1158-1158 (2009). XX DR [1] (Consensus) XX CC TIR is ~960 bp long, TSD is 9 bp. XX FH Key Location/Qualifiers FT CDS join(4576..4965,5110..5946,5808..6413) FT /product="Rehavkus-1_NVi_1p" FT /translation="MEENMLCRYTDSNLILVHSEFFEYRQCRHLLPNPCPI FT AMAIIVLLGILLYTYMRILQQIYSKLATDLLLCADSPSSYDVLISMKQQRN FT ATIRHQKLAYTQIKIISKQPEVSETFLISELKKWPDIFNNGGQKNKVTPHD FT SDYAERDISNPYNYDELDFILEIPKHDVIDCLETKKKKEEWTEKIREKIKI FT LKNLLCPFALNGHITKDKFKFYGRCSDCETGIFGGSNNNEDNLSIRVKTST FT TYEIPHSKKLKQNGSRRREVKKVLRNQTASDYRESELGKIDTNYEPPDLNS FT TVTLRKVKEEALDESIGYTEFKRTSITNKVFVNLCNIRKLSVNPFYTVYFS FT DYQLQFWNEIQHKKLPLSFESTGGLLRKYKFYPGITSRTTYYYVMVVGIEK FT KLYQYYKQFYPYTMFIDWWFIKKIQVLSWYNIKNNILLCNGCRNRKKIVPV FT LQAILSIHHVLVIQEILEIPKEISTDGSLALQNAICQSFNKMTFKEYSAQC FT FQVLLNQNQELPYCYYCHDVAHFLKSIADWKCMEQVNQNVRDFYLRSVGYS FT TQVNKLEKFKDVVRSIVVIANSQLAEEGSEPHKHISKLIKVFKTYEHEIQN FT IEKPAFTDDVPH*" XX SQ Sequence 8458 BP; 2970 A; 1275 C; 1334 G; 2879 T; 0 other; agcaccgtaa agtacgaaaa taacttgcgg gaatcgcaat aaaaaaattt cattaattgc 60 ggaattgtta acaacgttag taactgcaat taaaaaaaaa ctgcgggtgc tctcaatttt 120 ttataaaaag taagaaactt tttagctagc ggcaaaacgt agatataccg ttttcccagc 180 tcgtgccact tgtataacta ctagatcaat gcaaattgat cgcaatttaa ctgcgagtta 240 tgaaaataat tgcggtttta cctaattcgc agtaattgtt gagtaatttg ttgcacaatt 300 atttcaatca tttaggcgcc aaatccgggc tgtcgggcat tccacgttcc gcgtttgcgc 360 agtctgtata gagtgtaggt ggcgtataga cgcttccgta gactctctgc aatcagcacc 420 ataacctaaa attcttcagg tcaaacttca aagtttggat aattcaactc tattaaacga 480 tccaatactt atcaaccttc ataaaatcag ttaactgctt atagcatcct cattgtcaat 540 ggtggaactg gtaagatctt tttttttaaa cttaaattat aacaaataat cataaataat 600 catctttaat tcatatttat aatcaagata ctctgcgttc agaatgaagt ggtcctactg 660 ttttagtaat gacatcaacg catgaattag ctttgaaaaa cgcaaacagt ctccaaagga 720 gtagaaatag caattactag atgtaaagtg catttcatct ctcagttact tcattagtta 780 taaaaagata atagaatctg cttagattag ttaaagatga caaggatgga gtcgtggctc 840 acaactattt ttgttcaagc gcaattgtca taataaatat ttggaaaaat taaaaagtgg 900 cgatcaaatg gacacttcga gtgcttaagc aaatctgtgt gatttgttgc tattagtatt 960 aaaagaagta tcttaaaatt gttactttca tcttcttgct ccgatcttta aacctttagc 1020 acacattgct ataggtaaaa aaagtatttg caagtttgct aagtatttgc ttgttttttt 1080 atagatacct tgttcaactc tgatcagctg tgccaactga atgccctggt gaaccagcag 1140 caactggggt cgagaggaac aatcagcacc aattggagcc cgactttcta cattttaagc 1200 ttactctact tgatgaaaaa atatatagtc caagatggat agtgcagttc tgcttgagtt 1260 ggaatgcctg ttacaagcca gtaacgattt gtgcaagaaa tgtaagtact tgatctattt 1320 gaaattccta atcgttctga tggaatagtt ttaagcatgg caaaatttat gatatataaa 1380 cttgttttta taggagttga tgtacagagc gaagcttgtc aacttttttc aagaaatttt 1440 tacaataact tctacaaaaa ttgtaacaga cgatgtagta aatagttgga aacggaacat 1500 tcatcataac attaatcatg attgccaacg actgttagaa cttgctgaag atgataagga 1560 tggtttcttt taggcattga ttcttttttt ggaacttggc acatttatat ttttattaaa 1620 agtacaaggt agaaaatatt gtgcagaaca agcaatatat gaatgtggca atgtccagca 1680 gagcaagtca tcagtcttcg tttcagataa aaagcatgtt tcaagagttt ttacaagttt 1740 acaaggaata caggagttga aaagtatgaa gtccataaat ttgaataggt tttgaaattt 1800 atttacatat ttcagttatt attgaaataa tctcgtaaag gtctgaaaaa ataatgtttt 1860 aaaattaacc aaaattacat gtaattattt tacaaaaaaa aatgtattta ctctgtataa 1920 acagtttttc ttgaataaaa cagttaacta aaaattagac agagctggtc taatttttag 1980 tttactcttc ttggaatggc gaaatataat taagtcgttt cgttcttccc aaccagcatt 2040 tggcaatttg ataaacgtgt aataatgcat tagtgaatca gtaccaaggt atggtaaatc 2100 gaatgctatt actcttacta aattgtaatc ttcattacac aaataaagat aacctagaat 2160 actatttaaa tagtattgcg tatttttatg tattgtaaaa ttagtctcca attttacaca 2220 taaatattct tttattttgt atctgagtgt tgcttcattt ttacattcat aacattgttt 2280 gctatttttc gaaaagtgta ttttaaaata gttttctcat tcagatattg atgaatcatc 2340 tattgctgat tagtatttac agttcgttga tagtgattcc aagtgctgat taaaccaagt 2400 gcttttaaac attataataa ctggatgata aaacttctac tatagagttg aaaaaacttg 2460 atgtcgtgat agtaatttta tttcttttaa aagttcttgg tttcaaaaga ttataatttt 2520 tattattttt cgttttttgc ttgatttgag ggtttttgat taatcagatc aatttctgga 2580 cttttgttac attttttaat tgattctttt tgggtaagta tactttgact ttgatcatct 2640 aacatattag tagtcaactc atgatcatgt tctttagtta aagaaatgtt caaatcacta 2700 ataactgagt caacaattga tgaaatatct tgtttacaaa agctatctac agataactgt 2760 gaatcactgt tggtagatga gaaactctct gaagtagtac aagaggtgct atcgctgtca 2820 gaatccattt tgttcaatcc cattcagttt tcaaatttaa ataatctaat tcatcactta 2880 ttttaggtac aactcttttt gatggattta ctatatttct tatactttct agattagttt 2940 ctaaatagtc aatatttttt tgtaaataga tatctaccga tggtaaaggc ccactttcct 3000 tttccaaaaa agattcaaaa tcacttttat aatgcttcaa taattcacca aaggattttt 3060 cttagttatt actttcttta aataatattc tggtccacag aacgaattca acacagcatt 3120 aaattaaatt tttggatatg tcttcattgc aatatgcatt cctttggcat tccatacctt 3180 ctgttgcagt atattgtttt gctctcaaaa caatatcaat aatgtatttt aaaatgtgtt 3240 gacatttagt ataatcgata tctaatacct tatttaaaac actgtcggaa tttgagattt 3300 tttcacactc atcttttatg atgtttatat tttctttgct atacaatgga tctatgttgg 3360 attcttttat tcttgaaata ggatctttaa aaaattcatg agtagtatca ctctgaaatt 3420 cactaaaact tagtacaagc acttttaaaa gtatctcttc aaaatcatac aaaattgtct 3480 gaaaactgag caatacaata caatatatat aaaaatattt tacagactgt agggaaattg 3540 cactaaaaca atttcaattg caaatcttgg ctattagaaa tctgatgttt gtataaatta 3600 aacactttgg taagagatct acctgacttt cttttaggta aaataaacat gatgttcaat 3660 attcttcaaa tgatttgagt ccgttaaaaa catgatttgg agttatgtaa taattattac 3720 gatagtctag tactaattta ccaggaatag gtaacgcact tcttcacact aaaatttcat 3780 caaaaaatct gttcaatata gctgtactac tttcttctgt tgacagttgg caaataggaa 3840 catatatatt atctattctt ataccaaaac tcaataaatt tatagattta gattctgttc 3900 catctggaaa atgaaaaggt ttgcaaaaac atcctatatt cccaaacatg aattctttgt 3960 ttttcgatgc actacgagca aactctatgg ctttcataga ccaatgaaaa atattgaatg 4020 gaaaagttgt gacttctaaa ataatagact tgtacatgac attatttctc aaagtcagta 4080 aatctctttc tttattacag ctttggtttt tttgttctgc agaactgatt gctttgacag 4140 tgttgctatt attatcagat gttccacacg ccaccaagct ttgcaaatca aaaagtacat 4200 tatttctatt taggcaacat acgattgaat gttgcgaatt gtcatttctt tttcgcaagg 4260 atattctcta ttcatatatt cacacacttc ttgccacact agatcacctt cactttttac 4320 acgattagaa tcatcaataa tcgcaaattg tcttaaagca ataactctat tttcgtgtaa 4380 tattgcaggc tttggacccc ttttacccat tgtttgtgga tgaaacaaaa tgctcaatag 4440 aaaatatagt acacaggaga agaaagtaaa acaattttgt ctgagtatga gctatatcat 4500 atagcttttt cgtcattttc tgataagtag aaagataagc gcataatgta cataattctc 4560 ccctttgtag attagatgga agagaatatg ctatgtaggt ataccgattc aaatctcatt 4620 cttgttcaca gtgaattttt cgaatatcgc cagtgtcgcc atttattgcc aaacccgtgt 4680 ccaatagcaa tggcaatcat agtattactt ggtatattac tatataccta tatgcgtata 4740 ttacaacaga tatattccaa attggccaca gatctgttgc tatgtgcgga ttcaccctcg 4800 tcatatgacg ttttgatttc tatgaaacag caacgcaacg caacgatccg acatcagaaa 4860 cttgcataca cgcagataaa aataatttca aaacaaccag aagtctcgga aactttttta 4920 atatctgagc taaagaagtg gccggatatt tttaataacg gtggatgact taaaggctat 4980 gctagtcaaa tttggaagga tatacaaaca tctttaaagt taaagatgtt accaaaatca 5040 ttattttttc cgtttataaa aacgaaaaac gcaaagatga gttgaaaagt ttttttaaag 5100 ttcagatgac aaaaaaataa ggttactcca catgattcag attatgcaga acgtgacata 5160 agtaatcctt ataattatga tgaattagat tttatcctag aaataccaaa acatgatgta 5220 atagattgtt tagaaacaaa gaagaagaaa gaagagtgga cagagaaaat aagagaaaaa 5280 attaaaattt taaaaaattt gctatgtcct tttgctttga atggtcatat aacaaaggat 5340 aaatttaaat tttatgggcg ttgtagtgac tgtgagacag gcatatttgg aggaagcaac 5400 aacaatgaag ataatttgag tatacgtgtg aaaacttcta ccacgtatga aattcctcat 5460 tcaaaaaagc ttaagcagaa tggatcaaga aggcgtgaag tcaaaaaagt tctgcgtaat 5520 caaactgcaa gtgattatcg cgaatcagaa cttggaaaaa ttgacactaa ttatgaacca 5580 ccagatttaa atagtacagt tactttaaga aaagttaaag aagaggcact agatgagtcg 5640 attggatata cagaattcaa aaggacgtct ataacaaaca aagtatttgt caacctttgc 5700 aatatcagaa aattatcagt caatcctttt tacaccgtgt atttttctga ttatcaatta 5760 caattttgga atgaaattca acataaaaaa ttacccttat catttgaatc gactggtggt 5820 ttattaagaa aatacaagtt ctatcctggt ataacatcaa gaacaacata ctattatgta 5880 atggttgtcg gaatagaaaa aaaattgtac cagtattaca agcaatttta tccatacacc 5940 atgttttagt tattcaggaa atattggaaa ttccaaagga aatttcaacc gatggctctt 6000 tagcattaca aaatgctatt tgtcaatctt ttaataaaat gacgttcaaa gaatacagtg 6060 cacagtgctt tcaagtatta cttaatcaaa atcaggaatt accatattgc tactattgcc 6120 atgatgtggc acatttttta aaatctattg ctgattggaa atgtatggaa caagttaatc 6180 aaaatgttcg tgatttctat ttaagaagtg ttggatattc gactcaggtc aacaaattgg 6240 aaaaattcaa agatgttgtg aggtctattg tagttattgc taacagtcaa ttagcagaag 6300 agggttctga acctcataaa cacatttcaa agctgatcaa agtttttaaa acttatgaac 6360 acgagattca aaacattgag aaaccagcat ttacagatga cgtgccgcac taatttaaca 6420 tcattcatag acaatctttt tactaaaaca tcatcatcta acattgaaga aacgctgtct 6480 gataaagcat atttttatta ttgcccagat tttatcaaat tatttaagaa tatatgttat 6540 actttcccat cgtggactaa ttgtatgaaa gattattttc aataaccaaa tttagttagt 6600 acattagcaa gatcggaaac atattttaaa tatcagaagg accgaatacc taatccaatt 6660 ataatggaaa aatttatatt acgtgactgt aaaaaactga agtcatcagt taaagtaggc 6720 ttcatgaaat tgaaaaatga taaaaaacaa gaaaaaaagt ggaaatttat ttgacttgaa 6780 gctaaataat ttaattagta ctgtgctaaa taatgagtat aacaaattgt aagtcagatc 6840 gaaactgaag gctatgctag tcaaatttga agaaggaatt gcttgcttct aaggagaaga 6900 gttttttaag tactaaatta aagagtaata ataataataa gcagatgaaa gtaattaagt 6960 ttttactaaa gatttgaaaa ctatattacg tatagttgaa aagaaaccag agaatcttga 7020 gcatttgtat acaatgttca aaaaatatca cactcaaaac aatttgaggt ttggaagctt 7080 tgtatttgat catatagatt tggtatactg tgtggcaatg agaacatttt atcatctggg 7140 tgagccagat ttagttctta aaagcaatac tctttttacc tatcacaatg cgtggtaaag 7200 gtttaaagat tggagcaaga acatgaaagt aacaattttc aaccaaatgt actattataa 7260 tattagaacg aataaaagat acgacatgtt tcacattgaa ttagaaaagt gtgagaaata 7320 tacaaactta aaacatatat tggtctagat tagtaattga ctctaatata aaaaattaca 7380 tcaaaagccg atacttcttt caatactaat agtattatgg ataattctca cctaaaatat 7440 gtataatgag atcataaagg cgtagcaaca aattacacag attttcttaa gcactggaag 7500 tgtccatttg atcgccactt tttaattttt ccaaatattt attatgacaa ttgcgcttga 7560 gcaacaatag ttgtgagcta cgaccccatc cttgtcatct ttaaataatc taagcagatt 7620 ctattatttt ttaataacta atggagtaac tgagagataa aatgcactgc acatccagta 7680 cttcagtctg tataaaatta ttcagtcttc ctggcacggc aattgctatt tctactcctt 7740 tggagactgt ttgcgttttt caaagctaat tcatgcgttg atgtcattac taaaacagta 7800 ggaccacttc attctgaacg cagagtatct tgattataaa tatgaattaa agatgattat 7860 ttatgattat ttgttataat ttaagtttaa aaaaaaaaag atcttaccag ttccaccatt 7920 gacaatgagg atgctataag cagttaactg attttatgaa ggttgataag tattggatcg 7980 tttaatagag ttgaataatc caaactttga agtttgacct gaagaatttt aggttatggt 8040 gctgattgca gagagtctac ggaagcgtct atacgccacc tacactctat acagactgcg 8100 caaacgcgga aaatggaatg ccctacagcc cggatttggc acctaaaaga ttgaaataat 8160 tgtgcaacaa attactcaac agttattgca aatgaggtaa aaccgcaatt attttcataa 8220 ctcgcagtta aattgcgatc gatttgcatt gatctagtag ttatacaagt ggcacgagct 8280 gggaaaacgg tatatctacg ttttgccgct agctaaaaag tttattattt attataaaaa 8340 attgagagca cccgcagttt tttttttaat tgcatttact aacgttgtta acaatttcgc 8400 aattaatgaa atttttttat tgcgatacac gcaagttatt ttcgtacttt acggtgct 8458 // ID Copia-19_DPu-LTR repbase; DNA; INV; 383 BP. XX AC scaffold_98; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-19_DPu_; KW Copia-19_DPu-LTR; Copia-19_DPu-I. XX NM Copia-19_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-383 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 702-702 (2010). XX DR Genome; scaffold_98; Positions 367084 367466. XX CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX SQ Sequence 383 BP; 87 A; 90 C; 73 G; 133 T; 0 other; tgttgattta tccaaacatg ttacactgta ctggcaacag atggcgacag gtggcggacg 60 gggtagcttg tatggtgcca agtacgcggc cttctctctc cagctccatg ctgtttagcc 120 tcgtccatct tgttcatgtt ccgtctgcct ggctatactg tcacgttttt gaagtttgtc 180 aagattccat ttctcctgtg tttcagactg tttggtctcc atccacgttc aatattcgtt 240 ttcttctgtg ctcacttata ttatcaatgg taactatatg taataatgta atgcatatta 300 tcctgtcaag ccatcactat gacaatgcat gtacctgtga gaatacagaa gcctcgccta 360 caatttatcc ttgtatatca aca 383 // ID PERERE-7 repbase; DNA; INV; 5014 BP. XX AC BN000798; XX DT 04-SEP-2005 (Rel. 10.08, Created) DT 02-JUN-2010 (Rel. 15.07, Last updated, Version 2) XX DE Schistosoma mansoni Perere-7 non-LTR retrotransposon (EST). XX KW CR1; Non-LTR Retrotransposon; Transposable Element; Perere; KW PERERE-7. XX NM PERERE-7. XX OS Schistosoma mansoni OC Eukaryota; Metazoa; Platyhelminthes; Trematoda; Digenea; OC Strigeidida; Schistosomatoidea; Schistosomatidae; Schistosoma. XX RN [1] RP 1-5014 RA DeMarco R., Machado A.A., Bisson-Filho A.W. RA and Verjovski-Almeida S.; RT "Identification of 18 new transcribed retrotransposons in RT Schistosoma mansoni."; RL Biochem Biophys Res Commun 333(1), 230-240 (2005). XX DR EMBL/GenBank/DDBJ; BN000798; Positions 1 5014. XX FH Key Location/Qualifiers FT CDS 27..1076 FT /product="PERERE-7_1p" FT /translation="RLPQSWCVARRLRKAQIKTSCPILFQFGSIIEANHLL FT KGQSLLRRIPTIKKVRITHDKTAIQRRNSSEVQNNQPSLRGSGSYNVGHSA FT IPKTPYPTPLKNGGTPHTATSPKNKTKHQAAVTIHPKPKHSTTTKCPPTYA FT STVSKHSSCVSNVKPKPKPHINRSNSYNRPNTVMRNNTKFSLNTHKYSRSL FT NTQHAPTHSSRMHYNKHNELAVPPYPNVRLRPTKTTLTHDANKPYSLKHAG FT YRNQHSPHNRNTTPGNNARQSCNNYLKHTTPLTTRNRSRTPHCTQGNHNVD FT LLGDPPIDSKYYQNQTTLYNSLISPNLAHTPPHPFLSLSPSTVLLWGLQML FT RATIPLF" FT CDS 1037..4237 FT /product="PERERE-7_2p" FT /translation="MGSPNVEGDNSTILRPLSIYSDLMGSDTSFSPSITVS FT SLKSSLAEHDHPPNTPPIHLTHPILPNTREHTSFAKLTSDHLSPVNYAGIK FT LNKQERRANTPTFSSSHSKSDKPQGLQHTILLDKHSSFLTLMLINARSIQN FT KMTQLRALLLLHSPSLVLITESWLSSDITDTYLQIDTYELFRCDRVTKKGG FT GCLIYASKNMRAEIFSDHTLSKVPDSTWLTVHTLECKILIGCIYRAPNTPS FT LTDTIISESFAKAASLDFACKVVCGDFNLPTINWKTHSGPPCYEGILRSLD FT IHGWQQHVNLPTRNHNILDLIFCHNVTPASMVVGEKFHNSDHKIVFCTLPI FT LAKRKKLKTLNTCFQYRDYANADWSNFRFLMKHSDWNTFFTSDNLLETLEI FT FNTTTKSALDTTIPSKIAFKTIAPYINQKAKSKLRKLRKTYFTSKDFSALH FT QVYSVLEGVQSHHNKNKQTEERTALTAASKVQNLCRLLAKRMRKNENHNIH FT SLLHDGVTYYDQTVICELLSVWFSQNGNTSNAPDFNIKCISKSHISTINFD FT IRSIHTAIKTLKPNASSGVDDIPSILYKMSGPDIHTLLLKIYTLSLEAGTY FT PEAWKVTYVLPNTIRTEKPGRKLQTYXYHSQYFAYNGKVIQSQLSDHLLKE FT ELIDPTQHGFIRTRSCSTCLIDFFNEVTRIRDQKKLVIILYFDIKKAFDKV FT PHNLLINRLQSVGIINPLLQWIKSFLTNRYQITKLNSTTSTPRPITSGVVQ FT GSVLGPLLFIIYINNICKCFTTGKTYLYADDLKVIYKTDIGDVRSTMQTIQ FT HELNKVDDWCKRWGLELNTEKCGWLCIGNTSLKLKLTLNKNPLLRLTSVTD FT LGVHYSDSLNFSEHISTKASQMRRLLGFILRNFFQKETKIILYKACVRPIV FT EYCSFLSSNLRLSDILKVEGIQRDFTRKILKNDQLTDYKSRCHILGLEPLW FT KRRLRSNLILYFKLLKNLTYSTCNIALYQDSHGYNLRDKVSTVKIEKHRSK FT TRENFFLIKYALIWNSLPSEVRMADKLHTFTRLIDQALTCIDLVQTNTSTS FT HTDIIGSLHI" XX SQ Sequence 5014 BP; 1641 A; 1280 C; 801 G; 1289 T; 3 other; ctaaaaatac tctactacaa gaatraagac tgccgcaatc ttggtgtgta gcaagaagat 60 tacgtaaagc acagattaaa acctcttgtc ccattctwtt tcagtttggt agtatcatag 120 aagccaatca cctccttaaa ggtcagtctt tactgaggcg tattccaacc attaagaaag 180 ttagaatcac ccatgacaag actgctatcc aacgtcgaaa ctccagtgaa gtgcaaaaca 240 atcaacccag cctacgtggt agcggcagtt acaacgttgg ccatagcgct atccctaaaa 300 caccataccc cacccctctc aaaaacggag gcacgccgca tactgctact agtcctaaga 360 acaagactaa acaccaggca gctgtaacca ttcaccctaa acccaaacac tcaactacaa 420 caaaatgccc acccacctat gcatctacag tttcaaagca ctcttcttgt gtctccaatg 480 taaagccaaa acccaagcct cacatcaatc gctctaatag ctataaccgc ccaaacactg 540 taatgcgaaa taatactaaa ttctctctaa acacccataa gtactccaga tctctcaata 600 cacaacatgc acctactcat agtagtcgaa tgcattacaa caagcataat gaactggcag 660 tacccccata tcctaacgtg aggttacgcc cgacaaaaac cacacttaca catgatgcca 720 ataagccata tagcttgaaa catgcaggtt acaggaatca gcacagccca cacaacagga 780 acaccacccc cggcaacaat gccagacaat catgcaataa ttatttgaaa catacaactc 840 ccctaactac ccgtaatagg agtagaacac cccactgcac acaaggcaac cataatgttg 900 acctgcttgg ggacccacct atcgattcta aatactacca gaaccaaacc accctttata 960 actctctgat ctcccctaat ttggcacaca caccacctca tcctttttta tctctctccc 1020 cctcaacagt attgctatgg ggtctccaaa tgttgagggc gacaattcca ctattctgag 1080 accactttct atatactccg acctgatggg gagtgacaca tctttctcac cgtcgataac 1140 tgtctctagt cttaaaagct ccttggcgga acatgatcac cccccgaata caccgcctat 1200 ccatttaact cacccgattc tacccaacac tcgtgaacat acttctttcg ccaaactaac 1260 ctccgaccac ctttctcccg ttaactatgc tggtattaaa ctaaacaaac aagaacgtag 1320 agccaatacc cccacttttt cctcttctca tagtaaaagc gacaaaccac agggtttaca 1380 gcacaccatt ttattagaca aacactcatc cttcctgaca ctaatgttaa ttaatgcccg 1440 atctattcag aataagatga ctcagctacg cgccctatta cttttacaca gcccctcact 1500 tgttttgata acggaaagtt ggttgtcctc cgatatcaca gacacgtacc tccaaataga 1560 cacgtatgag ctcttccgct gtgatcgagt caccaagaag ggtggcggtt gtctcatata 1620 tgcatcaaaa aacatgaggg cggaaatctt tagtgatcat accctcagta aagtaccaga 1680 ttcgacgtgg cttactgtac acacccttga atgtaaaata ctaatcggat gtatatatcg 1740 agctccgaat acccccagtc tgactgacac cattataagt gagtcattcg ctaaggctgc 1800 ctcgcttgac ttcgcctgca aagttgtctg tggtgatttc aaccttccaa ccatcaactg 1860 gaagacacat tctggcccac cgtgctacga aggaatacta aggtccttag acatacacgg 1920 ttggcagcaa catgttaacc tgcctacccg aaatcacaac attcttgact taatattctg 1980 tcacaatgtc acaccagcat caatggtggt tggcgagaag ttccataata gcgatcataa 2040 aatagtcttc tgcacccttc cgattcttgc caagcgcaaa aaattaaaga cattaaatac 2100 gtgtttccaa tatagagact atgcaaacgc cgactggagt aactttcggt ttcttatgaa 2160 gcattctgac tggaacactt tcttcaccag tgataacctt ttagaaacac tggaaatatt 2220 caatactacc accaaatctg cactagacac taccatccct tccaaaattg ctttcaaaac 2280 aattgctccc tacatcaacc aaaaggctaa gtctaagcta cgaaaactaa gaaagacgta 2340 ctttacatca aaagacttta gtgccctgca ccaggtatac tctgtacttg aaggagtgca 2400 aagtcatcac aacaaaaaca aacagacaga agaacgtact gcgctcacag ctgcatctaa 2460 agtccaaaac ttatgtcggc tccttgcaaa acggatgaga aagaacgaaa atcacaacat 2520 tcactcttta ctgcacgatg gcgtgacgta ctatgaccaa accgtcatat gcgaactgct 2580 aagtgtatgg ttttcacaaa acgggaatac atctaatgcc ccagatttta acataaaatg 2640 cataagcaaa agccatatta gcaccataaa ctttgacatt aggagtatac ataccgccat 2700 caaaaccctt aagccgaatg cgagctctgg tgtcgatgat ataccatcaa ttctctataa 2760 aatgtcgggg ccggatatac atacgttatt gctcaaaata tacacattat cattagaggc 2820 gggtacatat cctgaagcct ggaaggtaac gtatgttctt cccaacacaa tcaggaccga 2880 gaaacctggt cggaaactac agacctataa ntatcactcc cagtatttcg cgtataatgg 2940 aaaagtgata caaagtcaac tatcagatca cctacttaag gaagaactaa tagacccaac 3000 acaacatggt ttcattagaa caagatcatg tagtacatgt ctgatagact tctttaacga 3060 ggttactcgc atacgtgacc agaaaaagct agtgattata ctttatttcg atataaagaa 3120 agcctttgac aaagtacccc ataatcttct aatcaacaga ctccagtcag ttggaattat 3180 aaacccccta ttgcaatgga ttaagtcttt cctcacaaac agataccaaa taaccaagct 3240 taactctaca acatccacac ctcgaccaat aaccagtggt gttgtgcagg gtagtgtctt 3300 gggtccattg ctctttataa tttatatcaa caatatatgc aagtgcttta ccacaggtaa 3360 gacctacctt tacgctgatg acttaaaagt catctataaa actgacattg gtgatgtacg 3420 aagtactatg caaacgatcc aacatgaact caacaaagta gacgactggt gcaaacgctg 3480 ggggctagag ctcaacacag agaaatgcgg ttggctatgt attggtaaca cgtcacttaa 3540 actaaaacta acacttaaca aaaacccact gctcagactg acatcagtaa ctgaccttgg 3600 ggtacattac tccgacagtc ttaacttctc tgaacatatt tcaactaaag catctcaaat 3660 gcggagattg ctcggcttca tccttcgtaa tttctttcaa aaggaaacga aaatcatcct 3720 gtataaagcc tgcgtaagac ccatcgtcga atactgctcg ttcttatctt ccaacctacg 3780 cctatctgat atacttaagg tggaaggaat acagcgagac ttcactcgca aaatacttaa 3840 gaacgaccaa ctcactgact acaaatccag atgccatatc ttaggactag aacccctctg 3900 gaaaagaagg cttaggtcta acttaattct ttatttcaaa ctccttaaaa accttactta 3960 ttccacatgt aatatagctc tctaccaaga ctcacatgga tacaaccttc gagacaaagt 4020 atccactgtc aaaattgaaa aacacagatc gaaaacacgt gaaaacttct tccttatcaa 4080 gtacgcattg atatggaaca gtctgccttc cgaagtacgc atggcagaca aattacatac 4140 gtttacaaga ctcattgatc aagccctcac ctgcatagat ctggtgcaaa cgaacacatc 4200 cacgtcccac acagacatca taggctctct acatatatag actgaatcgc cttatttccc 4260 taccttgtag ttgtattaga tactttcccc tcactaattc ttttcaattg aagtagacag 4320 gcaactcact ggacctatca atactctgtc gcttaacgac gcttataaat accctaaaac 4380 gtcagaagaa acgcacaatc gactgcctac tgccagtggt cgtacaacca cagaacctgt 4440 cacccatcaa ccggttagca tagaacaaaa caacctcagc gcgaatttag tgtgagttac 4500 ttcttgtcta tttatcagct ttttactcat ctgtacacta tgttgccttg ccattaatat 4560 tatttcacca ctcgttatct taccttatct gtaactaata taaatctcca cttttccgcc 4620 tactatggta aacaaatatc ctacactata cccttcctaa tgcctatact ctgattcgta 4680 acctacactt cattatatag tcaataaaaa cattgcatcg aagccttacc cacagtccat 4740 aatttgtaac tgtctgataa gcttgtaaca tggtacttat agaaatagtg ttttccaaca 4800 ggatgtgaaa cacagagaat tgtgtgaggt atgatatgta tccaaaaaat aagcataaat 4860 gagcctaaca tcacaaaagg agggagggtg gagttactga ccagcaagca ctgatggtgt 4920 gggcgatgat ttcaatccac ccgtgtttgt ggggcagaac attttgtttg taccaggctc 4980 tttatggatg agaataagtc aatagatctg tgat 5014 // ID BEL-10_CQ-LTR repbase; DNA; INV; 236 BP. XX AC AAWU01000654; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-10_CQ_; KW BEL-10_CQ-I; BEL-10_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-236 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 174-174 (2011). XX DR GenBank; AAWU01000654; Positions 73011 72776. XX SQ Sequence 236 BP; 72 A; 63 C; 41 G; 60 T; 0 other; tgtttgggta gaacccacgc agtgtgtaga cttttgatta aaccccctta accgtgtaca 60 gtccacttcc caacacaaca tcacacacac actgacacag caaagggccc agtaaatgtc 120 acttccccaa ataaacttcc gttaggagcg atcattctcg aacgagcaga ataaaaagca 180 acacgttttg ttagagttcg tcgcgacttt tatttatttc gtttcgaccg caaaca 236 // ID BEL-642_AA-LTR repbase; DNA; INV; 221 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-642_AA_; KW Pao_Bel_Ele132; BEL-642_AA-I; BEL-642_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-221 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 221 BP; 66 A; 48 C; 45 G; 57 T; 5 other; tgaaaaacat cgcagctggt cagtagtaag gtcagcaaat ttgtatgcta tcagtctcaa 60 ataaaccatc aaacgataca gwccgttatt caaccacacg tcgcgagtcg caatttgtaa 120 tmttgtgaag gcaaattaag wccaataaat tacgwtccgt tcgtcaaagt acgttcgcgg 180 gtgttgcawg ttctctacgc gtgatccgac ggctaaattc a 221 // ID SACI-6 repbase; DNA; INV; 4236 BP. XX AC BN000804; XX DT 04-SEP-2005 (Rel. 10.08, Created) DT 04-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE Schistosoma mansoni Saci-6 LTR retrotransposon (EST). XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW gag-pol polyprotein; SACI-6. XX OS Schistosoma mansoni OC Eukaryota; Metazoa; Platyhelminthes; Trematoda; Digenea; OC Strigeidida; Schistosomatoidea; Schistosomatidae; Schistosoma. XX RN [1] RP 1-4236 RA DeMarco R., Machado A.A., Bisson-Filho A.W. RA and Verjovski-Almeida S.; RT "Identification of 18 new transcribed retrotransposons in RT Schistosoma mansoni."; RL Biochem Biophys Res Commun 333(1), 230-240 (2005). XX DR EMBL/GenBank/DDBJ; BN000804; Positions 1 4236. XX FH Key Location/Qualifiers FT CDS 328..3930 FT /product="BN000804_1p" FT /translation="METRSKRQKENDELDGKISRAVCGMYDVNDDKCDDIT FT LNSMSSSQLSTSRLKLDKALLRKKNLKKRLELERQLKMLDLQEEVDMAESE FT CNVIKDDERLPVDKCEINAKVENYLNGETGGNNHSGEVCNISKIDWVDELR FT DTLHTLVGNMALPKIDMMYFDGQPGQYYCFISQFNSLIESKLSDKGQLLSY FT LLYYRKGKARTAIESCISLPSHLGYDRAKRILYDLFGKEHLVARELIAELL FT NHKSVGRSADVLTDFAIKLRNVCITLKEMGYMSDVNSTANLEIIVSCLPLG FT LQNKWAEVADKIMMHGKEPSFEEFVAFVEERARIARTRYGRLVQCNSRFVK FT GNSEGQADRSRFHAIRRDPNDSVKVSSCAICLGDHEATDCPRLAKMSVRER FT RQEIRRRGLCYLCLRKGHIAMSCNSGFKCDVENCKVRHNSLLHIDGTDNYV FT MNLAKDGNSSRVCLGIVPVRLYGPKGCLETYALLDSGSDTSLVCEELINQL FT GIEGKETSIRVATVNGTTNCECLEVNLEVFSLDERGSIRINKVYTTKKLPI FT DHAAPLTELQLKRWKHLKDITLPRLQSNFVGILIGCDAPDAHWVLEQRLGD FT RKHPFAVRTHLGWMIVGPKGAPRSLYQVQWCHCSNDILRDIERLYDHEFED FT TDTFRNGYSVEDKRALEIVSSSFKLEGGHSQVGLPWKYDRPSLPNNLELAE FT RRLECLRKRFMKDNSLLQKYQAVMNKQLSKSYIIEASKEGFDRDAVCWYIP FT HHPVINPKKPGKLRIVFDCAAVYQGFSLNDQLLRGPNSVNSLLGVLLRFRL FT GNVALAADIEEMFLQVRIPRQDRGAFRLLWWEDGDIKRTAKARVSLLKVQT FT IPRLELTAAVLAARMGSQLQSELDIKFAEVKFWTDSTIVLHYIRNEKSQFK FT TFIANRISTIHSLTKVDQWRFVPSKENIADFASRGVKFNTDDVKVWEEGPR FT FLKKPKECWPAVNIQGPEPHLLELKKAMSTNVMVEESTVGIDWHFSPPAAS FT HWGGVWERMIRSVRRVLGALVKEQPLTDECLETFMIEAERIINSRPLVPVT FT DDSSDLDAITPAKLLLLRENVTELTNVLSNDRYSKRWKQANYLAQVFWRRW FT SKEYVSLLQRRYKWTQLERNIREGDLVMICSEFSEKNKWPLGLVQRVLPSK FT NGLVRQVELRTRKGILVRDIRKLCLLEAIDGRYLLGSLGVLV" XX SQ Sequence 4236 BP; 1257 A; 686 C; 1043 G; 1249 T; 1 other; cttttcggac attttggttt cttttgtcat tggtttggta tttgggagtt tttgctccgg 60 gaacctacaa aattgaagaa cgactttggt cgattatttg tgaattgcga ataaacgtgg 120 tatcttattt ggagttttgg ttctcttgcc tacttgggtt cataatttgc gaatatcgat 180 ggccagtact tcaatagctg gcacggacat acactcataa attggacgta acataaagtt 240 cttcaattac tggttctttg ntcacactgg attagtctgg acagtaattt gggtccaaat 300 tctggttcta ctgaacgaag gtgtactatg gaaactcgaa gcaaaagaca gaaggaaaat 360 gacgaattgg acggtaaaat ttccagagct gtttgtggta tgtatgacgt taacgatgat 420 aaatgtgatg atattactct taatagtatg tcctctagcc agctaagtac ttctagatta 480 aagttagaca aagcattact gaggaaaaag aatttaaaaa agagacttga gttggaacgg 540 caacttaaga tgttagatct tcaagaagag gttgatatgg ccgaatcaga atgcaacgtc 600 atcaaggacg atgaacgatt gcccgtagat aaatgtgaaa taaatgctaa agtggaaaat 660 tatttgaatg gtgaaacggg gggtaataat cactcaggag aggtatgtaa catttcaaaa 720 atcgattggg ttgatgaact gagggacact ttacatacat tggtcggtaa catggcttta 780 cctaagatcg atatgatgta ctttgatgga caaccgggtc agtattactg tttcattagt 840 caattcaata gtcttataga gagtaaactg tcagataagg gccaattact atcgtatttg 900 ttatattatc gcaaaggaaa ggccagaacg gcaattgaat cttgcatttc tttgcccagc 960 catttaggct atgatagggc taaacgaatt ttatatgatt tatttggtaa agagcacctt 1020 gttgcgcggg aactaattgc tgagttatta aaccataaat ctgtcggaag gtcagctgac 1080 gtattaactg actttgccat taagttacgt aatgtatgta taacgttgaa ggaaatggga 1140 tacatgtctg acgttaattc tacggctaat ttggagataa tagtttcatg tctgccacta 1200 ggattgcaaa ataagtgggc tgaagtcgct gataagatca tgatgcatgg gaaggagccg 1260 agcttcgaag aatttgtggc ctttgtagaa gagagggcca gaattgcccg aacccgttat 1320 ggtagattag tacagtgtaa ctcaaggttc gttaaaggga attctgaagg ccaagctgat 1380 agatcgcgct ttcatgcaat cagacgagac ccaaatgact cagtcaaggt atcaagttgt 1440 gcaatttgct taggtgatca tgaggcgaca gattgtccaa ggttggcaaa aatgagtgta 1500 agagaaagga ggcaggagat aagaagacgt ggtctatgtt acttatgttt gaggaaaggt 1560 cacatagcta tgtcatgcaa ctcaggcttc aagtgcgacg tcgagaactg caaggttaga 1620 cataattcat tattgcatat tgatggtact gataactatg tgatgaacct agctaaggat 1680 gggaactcct caagggtttg tttgggcatt gttccagtca gactatatgg tcccaaagga 1740 tgcttggaaa catatgcact tctggatagt ggttcagaca cttctcttgt atgtgaagaa 1800 ttaattaatc agttgggaat cgaaggtaaa gagacttcga taagagtggc gactgtgaac 1860 ggaactacca attgtgaatg tttggaggta aatttagagg tattttcatt agatgaacgg 1920 ggttctataa ggatcaacaa agtttacacg accaagaaac ttccgatcga tcatgcagca 1980 cctttaaccg aactccaact gaaaaggtgg aaacatttaa aggacataac ccttccgagg 2040 ttgcaaagta attttgtagg gatattgatt gggtgtgatg ctccggatgc acattgggtc 2100 ttggaacaac gtctggggga caggaagcat ccgttcgctg tgcgaaccca tctcggttgg 2160 atgattgtag gtcccaaggg ggcaccaagg tctctatatc aagttcagtg gtgccattgt 2220 tctaatgata ttttgcgaga tatagaaaga ttatatgatc atgagttcga ggatacagat 2280 acgtttcgta atggatattc tgtcgaagat aaaagagctc tagaaatagt tagtagttcc 2340 tttaaactgg aaggcggcca ttctcaagtt ggtctaccgt ggaagtatga taggccaagt 2400 ctaccgaata atttggaatt ggctgaacgc agactagagt gtttaaggaa aaggtttatg 2460 aaagataata gtcttctgca gaaatatcaa gctgtgatga ataaacagtt aagtaagagc 2520 tacatcattg aagctagcaa ggagggattt gaccgtgatg ctgtttgttg gtatattcct 2580 catcatcccg ttatcaaccc taaaaagcct ggaaaactca gaattgtttt tgattgtgca 2640 gctgtctatc aaggtttttc tcttaatgat cagcttttaa gaggaccaaa ttccgtcaat 2700 agtttactgg gtgtacttct acgattcaga ttaggtaacg tagcattagc cgctgatatc 2760 gaagagatgt ttcttcaagt aaggatcccg agacaagata ggggagcgtt tcgtctatta 2820 tggtgggaag atggtgatat aaaaagaacg gctaaggcaa gagtatccct gttgaaggtc 2880 caaactatac cacgcttgga actcacggca gcagttttag cagctcgcat gggatcccag 2940 ctgcagtcag aattagatat taagtttgca gaggttaaat tttggacaga ttccacgatt 3000 gtcttgcact atattagaaa tgagaaaagc cagtttaaaa cattcatagc aaatcggatt 3060 tcgactattc atagtctcac taaggtggac caatggagat tcgtgccttc taaagaaaac 3120 atagcagact tcgcatccag aggggttaag tttaacacag atgatgtcaa ggtatgggag 3180 gagggtccac gctttctcaa gaagccgaag gagtgttggc ctgctgttaa tatacaaggt 3240 cctgaacccc atctcttgga attaaagaaa gcaatgtcca cgaatgtgat ggttgaagaa 3300 tccactgttg ggattgactg gcatttcagt cctccggctg ccagtcattg gggaggagta 3360 tgggagcgca tgattcgctc tgtacgtaga gtattgggtg ctcttgttaa ggaacaacct 3420 ttaacagatg aatgccttga aactttcatg atagaggctg agcgtataat taacagtcga 3480 cctttggtcc cggttacaga cgactcgagt gatctcgatg caataacacc agcaaaactg 3540 ttactattga gagagaacgt tacagaactt actaacgttc tatctaatga taggtattct 3600 aagagatgga agcaagcgaa ttacttagca caagtttttt ggagacgttg gtctaaggaa 3660 tatgtttcgc ttttgcagcg aagatacaag tggacccaac tcgagaggaa cataagggag 3720 ggtgacttag taatgatatg ttctgagttc tccgagaaaa acaaatggcc cttaggtcta 3780 gtacagagag tgttaccaag taagaatgga ttagtgaggc aggtagagct gaggacaaga 3840 aaggggatcc tagtgagaga tattagaaaa ctgtgtcttc ttgaagctat agatggtagg 3900 tatctactcg gatccctagg cgtactggtg taagtcctag ggatcacgag gttgttagtg 3960 tgttgaatgc ttgttgttgg ttgtttgtga tgtatatact tcgcttctgc tacctattta 4020 ctcgtctgtg aaaagaacca aggttctttt ggtcgggagt gttacgggag aactgaagac 4080 atttggcccg gttaaaacct gtttactgta cagcaattta ctgtgttttt ggatgtgttt 4140 gttttactta acctttggac ttgtttgtta tatcattctt ttcggacatt ctggtttctt 4200 ttgtcattgg tttggtattt gggagtttct gctaca 4236 // ID Gypsy-203_AA-LTR repbase; DNA; INV; 192 BP. XX AC supercont1.83; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-203_AA_; KW Gypsy-203_AA-I; Gypsy-203_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-192 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.83; Positions 1399557 1399366. XX SQ Sequence 192 BP; 65 A; 50 C; 24 G; 53 T; 0 other; tgtgacgaag atgacttgta tttgttaaaa gtgtattcca taacacctgc atattggtaa 60 accttgcttt tatacgttca tcattcacac acacacacac acacactcac acacactgat 120 acacgacata gagagaaccc aataaatcat tctgtactga gactccgtcg actccattac 180 tctaatatca ca 192 // ID DNA-1_AAe repbase; DNA; INV; 5697 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A non-autonomous DNA transposon family from Aedes aegypti. XX KW DNA transposon; Transposable Element; nonautonomous; DNA-1_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5697 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the yellow fever mosquito."; RL Repbase Reports 11(4), 1256-1256 (2011). XX DR [2] (Consensus) XX CC ~96% identical to consensus. TIRs are 700-1000 bp long. CC Terminnal TTAA can be TSDs. XX SQ Sequence 5697 BP; 1731 A; 1059 C; 1113 G; 1790 T; 4 other; ttaaccttcc gtcagtcgct caaaaaaaag ttacaccagc ggtcgcatcg tgtactgagt 60 acacgcggag caattttgat tgtcaatcta aagctaggca agatagagta ttgatgtctt 120 cggcaaattt cttcagtkca acggtaccaa tcggtcagtg gacaaatgaa actttgaaaa 180 catcctcacc cttcagtggc catcggaatg tgtcccgggg gaaccgatga agtggacatt 240 tcgcaatgca acatctcgat cacaaatatc tcaggatcta gattttttag caagatggtg 300 tcttcggcaa agttgttcag taaatcaagg actaacatgt gataggctgc ttgtttcgga 360 attctgtcac cagttggcgc tagtgagcat gtaatttttc aaacacggat atctcaggat 420 cctgattacc tagaaagatg cggtcttcga caaagttgtt tagtaggtca aggactatca 480 tatcataggc catctaactt gaaattctgc catcagatgg cgctagtgag catgcaatat 540 ttcaatcacg aatatctcat gatcccgatt ttttagaaat atggtggttt tggcaatgtt 600 gttcgatagt tcaaggagta atatgtgatg atctatttaa ttcggaattc tttcaccaga 660 tgacgctagt gagtatgaca ttttttgaac acagatatct caggatcccg attacctaga 720 aagttgacgt cttcggcaaa agtattgagt acgtcaaggt ctagcatgtg cgctagtgag 780 catgctgtat attgatcgcg gctatctcag gatagcagta ccataccgaa ataccgtaag 840 gtgggtccgg ggagataagg gtctgaggcc aagggtaata tcgatgcact gtacaccact 900 tgagcaattc aacaagaatt gctcaagtac agtgcatcgg ctggttttag tgggtcgtcg 960 ttggtgcggc aaccccatac tgcctgagtt accttctcag gcgtctgttt gcagatttcc 1020 ctattgtaaa caaattctca ggatagcgat attatagcaa gatggtgtct tcagcaaagc 1080 tgtttagtag atcaaggacg cagatatctt tggatcctga ttcctaaaaa aatgatcctg 1140 ggatatcctc attactaaga aatattacat attcggcaaa gttgttcatt agatagaagc 1200 ctaaatggga aggacctcaa gttttaaaaa tcatccaaca ggtggcgcta gtgagcatgc 1260 atatttatga ttaattatat ctgaagatcc cgattacatc aaaagataaa gttttcggat 1320 aaacggttca ctaaatcaag tacaaatttg tgatgggcct cacttggttc ggaattcttc 1380 cgctagatgg tgatactgag takgctgtat ttccaaccca aatatctagg atcgcgattg 1440 gataatcatg aaaatgatgt cttcaacaaa gttgtttatt ctgtccaagg ctaaagtatg 1500 tttaacatgt tataatccgt ttgattcgaa actatgctac caaacggcgc tattcagcat 1560 gaaaattcca taagtaaata cttcagaatc ttaactacak tcaaatttcc agtagatatt 1620 taagggatca tcgactaatg gaaatattga gtcatggaac agatattctt tggaaagctg 1680 attgaagggt gcatcatagt aaccatgatt tttgtttttg tttcaagtat ggtttcatga 1740 gtcgatatcg agctatggaa catcgattca tggaggtttg acagtattta gaaatatggc 1800 gttttctgtt ctctttacaa attgtgttgg atttttcttg aaacaaattt ttataaaatt 1860 cgtcctcctt ttcattttaa catatttttt ttatagaatc actttccgag ttgagtcttc 1920 gttgagtcta gtacatgata ttgaagacgg gcttacaatg gagatcaaag tacgcgtatc 1980 tgtcmaagga tacacactct agtggaatta aacgatacag tactaaattc ggctttcatt 2040 cacttcaagg tattctacta cactacttga aatttaagat tatcaagaaa gtagggttta 2100 tccacagaat ttgtttatat agagaagcgc gaatactgga atcaaagaac tgtcaaacgc 2160 ggttccaatc gagcaatagg ccttttcacg agatgtttca aataaactga ttcggaaggt 2220 ttttgacagt cttgtatgga aatgacgatc ccgtgtatgc accagtcctc caccgataga 2280 aacgaatgga aacacggacc tgtctcatga aaaggcctgt ctaatgggcg gcagaaagct 2340 gttacacaaa aacaattggg tgtgagttag tgcaaagcga tatgaaagtg acgtcaccgg 2400 ctagctcggt ctattatggc aactcacagg atgacacgcg cactacaaca ttgctcgaat 2460 cacctcgaat gtgatagaag agtgcatggt gacgtcacga ataattcccc ttctctgtcc 2520 ctctctcatc acgctcaatt gactgtcctg ctctttgaca ttcactgctg gaattgtttc 2580 gatttttttt atatggaatt gacgttcact gctggaattg ttgacatatt ttttatatgg 2640 agttttgagg ttatgtcagg cgtgcaacga tctatagtta ttgcctgtgt tgtttataaa 2700 taaattcaaa atgctctcag cgcttctcta gtataatcat attctctggg tttatcatta 2760 tagtcaacgc tccatgattg ctctgataat tgaagcgatg atcgactcat ggtaatatcg 2820 cgttatgtac agtaatacct tggaaagctg ttgttgggga taattacagt gactcatgga 2880 ggtttcacaa tataccagtg cagagaaagt tcaatgttat ttatgttgtt ataaaccgaa 2940 gcatgtcgaa attaaacgat cttttcatga ctcatttttg tagtcatatt attcaaacaa 3000 taaaatagaa aatcatcctt caacattgat tttattatac actttgttct ttattgatgc 3060 tatatcataa caaatttagg aaacactttg gcatggtgtc gaaaacgaat cttaccaaaa 3120 tcgaacagta attatgttgt gaaaaattcg gttagccgac atctcggtag attatgttcc 3180 gagattcgat atttaaaatt accgaattct cagctcttgg gttcttggcg aaatgtttcg 3240 ctgaggtcgg agacctaata taagtgtgta tggatcttca ttcacctgta tatcgcgtag 3300 atatagtgcg tcttatgtga ggagaactgc tttgcgtttt acgaagtctt ttcacgcaaa 3360 cataatcaat gtattcatca gaagttcgat taataggcta aatataaaaa taagattgat 3420 caatttgaat ctagagtttg aacgataatg aaccttatat ctctcgtttt attttgtgct 3480 tgcctatcaa ttcactggtg gcgaaatatt ttaacatttg acaaaacgtg cactgatagc 3540 ccaaatgatc agaatcataa gatgttcaga aaataaaatg ctcactttcg ccgggttaca 3600 taattccaaa tataattgcc catagcatgt cagcactttg cttactgaat aactttgaag 3660 aagacgtcaa ctttcaaaac gatcgggaac tcgagattct gcattagaaa aatttgaatt 3720 aggacctcta gcgtagcctg atggaagtac atctagcatc ccactgcatt ttaaattgaa 3780 tgcggaagaa ttttgagtat attttacccc ttacacgttg gcccgtgata tacagaataa 3840 cttgatcaaa gacgccatct ttctaatcta attttatttt caaaatggaa atataaatat 3900 gtttgaaaat ccaatctccc gcatcttcct tatcatcctt accataaaaa tagtgttctg 3960 tgaaattttt agctttttcg gtggtgattt aaaggtggcc caaagacaat gtaggtttat 4020 atggaaatta ctatggaaaa tttttgagaa atgttccaga cacgttagta ctgtaatgta 4080 agtacaaatt tatcatccca tagtgaaaac tcattcttca aaccctaatg aaggatgttg 4140 ctgaagaaac aaatcccttt tgagctaaat tgtttgggat tttctttttt atgtttgcag 4200 ggctataatc ccgtatgaag ctaaataaaa gcaaacaaac tcatgaatat ctcttctcct 4260 atccgttgga atgaatttat ctcttcagca aattacttta ttatagtttg aagaatgagt 4320 ttttactatg tgatgataag tttgtactta caatctagta ctaacgtgtt tggaacattt 4380 tttaaaattt ctccatagta atttgcatat aaacctacat cgtctttggg ccacctttta 4440 ataaccacct agaaaactga aaatttcaca gaacattatt tttatggtaa ttatagtaac 4500 gaacatatgc gagattggat tttcaaactt ttttaaatta tgaaccgctc taataatcac 4560 taacaccgtg ttgtggaaga attgcgaatc aaacggcaca tcacatgttt agtccttgat 4620 ctactttaca gcttgccccg atcaaaatac agcatgctca caagcgccat ctggtagtag 4680 aatttcaagt tgggtggcct atcacatggc atatcaccta ctcaacactt ttgttgaaga 4740 cgtcaacttt ctaggtaatc gggatcctga gatatctgtg ttcaaaaaat gtcatactca 4800 ctagcgccat ctggtgaaag aattccgaat taaatagatc atcacatatt agtccttgaa 4860 ctatcgaaca actttgccga aaacaccata tttctaaaaa atcgggatcc tgagatattc 4920 gtgattgaaa tattgcatgc tcactagcgc catctgatgg cagaatttca agttagatgg 4980 cctataatat gatagtcctt gacctactaa acaactttgt cgaagaccgc atctttctag 5040 gtaatcagga tcctgagata tccgtgtttg aaaaattaca tgctcactag cgccaactgg 5100 tgacagaatt ccgaaacaag cagcctatca catgttagtc cttgatctac tgaacaactt 5160 tgccgaagac accatcttgc taaaaaatct agatcctgag atatttgtgt tcgagatgtt 5220 gcattgcgaa atgtccactt catcggttcc cccgggacac attccgatgg ccactgaagg 5280 gtgaggatgt tttcaaagtt tcatttgtcc actgaccgat tggtaccgct gaactgaaga 5340 aatttgccga agacatcaat attctatctt gcctagtttt agatttataa tcaaaatgct 5400 ccgcgtgtac tcagtacacg atgcgaccgc tcgtgtgacg ggcccggtaa aaaaaagtta 5460 cacgagcggt cgcactggtg tactgagtac acgcgtaaaa acttaggtta gaaacacgaa 5520 gttttcgaat acttttcgtt gattttttcg aaaaatttct tccctacaag caagaagaag 5580 agtagatctt ggaataggac caacacccgg atcaatttga aggggttttc aacgtgtttt 5640 tcgagttttc gcaaagtgcg tgtactcagt acactagggc gaccgacggt gggttaa 5697 // ID P-2_HM repbase; DNA; INV; 5510 BP. XX AC . XX DT 17-MAR-2008 (Rel. 13.03, Created) DT 17-MAR-2008 (Rel. 13.03, Last updated, Version 1) XX DE P-type DNA transposon family: consensus. XX KW P; DNA transposon; Transposable Element; P-2_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-5510 RA Jurka J.; RT "P-type DNA transposon families from Hydra magnipapillata."; RL Repbase Reports 8(3), 348-348 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 1320..2993 FT /product="P-2_HM_1p" FT /translation="MQEEIQKNSLQVTSDLDDDMKNLMSNAFDLHSVSPFM FT KFFWNEQQKYFLSNPTSVRYHPMIIRYCLSLCAKSPSAYEDIRFDKKSGTG FT FLVLPSRRRLRDYKNCIHPQRGFNPDIVKELKSKVKDFSEIEKYIVLLFDE FT IKIQENLVWDKHTGEIIGFVDLGDIDLNYATLQKIDAVATHVLVFMIRSIV FT NPFKFTLANFATLGISSAQIFPLFWKAVAICELECKLKVIATTCDGAASNR FT KFFKMHSRLSNSNVKKDVVVTYKTINLYCPERYIYFIADPPHLLKTTRNCI FT YSSGSGKCTRYMWNDGLYILWSHIADIFKEDQICGLHLLPKLSYDHIKLSS FT YSVMNVKLAAQVLSTTVSKVLLNYGSPDTFATANFCSMIDSFFDIMNISNP FT YESIQKSKPFLVPFSSLDDNRLNWLKAEFLPYFEKWLSSINTRQGKFTLND FT KGKMFISWQTYEGIKITVNSAVELITFLLSNGVSYVLTERFCQDPLENYFG FT HQRQIGSRKDNPSVRDFGYNDNTIRNQKVFRPIKDGNSHEDIHFEINEEPV FT PCRKRVKTSLQ" XX SQ Sequence 5510 BP; 1916 A; 900 C; 797 G; 1895 T; 2 other; catagtcata aaaataaata gtctggaaac acgggttgaa aacaataggt tttgcgttga 60 tgttaggttc ttgtacccag cctactttaa ggtgcaaaac aaacaattct ttagagatta 120 agaataaata caatgcctgg agcaaattgt tcagtatatg gttgtcatac ctctcgttat 180 tcacatggaa tagctatatt cagtcttcct aaaggtcagg atgactttac cgttcaatgg 240 agaaaaaaat tgagcgagat tataacaaaa gatcgtttaa ttgattcagc tttaaggaag 300 caaattgcca gtgaatcttt gcatatttgt gagagacact ttaaagaaga tgaagttaat 360 agaagtaagc ttaactactg tgcttatata tatatatata tatatatata tatatatata 420 tatatatata tatatatata tatatatata tatatataga ttcaactaga gcaagactta 480 taccaggaac tttaccttca aaaaatttac caataaagag tgttcaaaca accaggtaaa 540 aattttcaaa ataatataag cttagttttt ctctattttt ttatttattt tttttatttt 600 tgttttttag tttattaaaa aagacaaggc tttcgacact aagtattacg tcaaaaaaag 660 attgctgtga attaggcaaa gttcaaaatc atatacatta tttttataaa tcattttaag 720 agtttgtaga tcgagtgagc atactaaaat tggaatcatg ggaacttgtt atatcgaaag 780 atttagtcaa agtttttaaa gtagacaaca ttcacctaca accccaatat gaagtatatg 840 ttcaacctag ccttatattt acaatacgat cttttgcttg gaacttacca gatacgcatg 900 atatttattt gaattataaa caatcagtta aaaacattac tttatcaagt attattgctc 960 ttttatcaac ttactcattg tgtcagggtg ttgcatcagg ttcaaactgt tttaaaaaac 1020 atattcttac aaaattgtac aatccaagtg ttaaatcaat yactacattt gagagtgaat 1080 ttaataggtc aactttgtgt tttttattga taaattctgg caatatttgt aatgaatgta 1140 aaactttgga attgtcacaa aatgctaaac tacaaaaagc tgtaaaacgt aaagctgaaa 1200 cgctgacaaa acctgctcat ataaatgccc ccatctcatt aacttcacct gatagaatta 1260 aacttacctt gcaaaactat cgatctgaaa atgctttttt aaaacaagaa ataaaaagta 1320 tgcaagaaga aattcaaaaa aattcattac aagtaacctc tgaccttgat gatgatatga 1380 aaaatctaat gtcaaatgct tttgatttac atagtgtttc accatttatg aagttttttt 1440 ggaatgagca acaaaaatac tttttatcaa atccaacatc tgtgcgatat catcctatga 1500 ttataaggta ttgcctaagt ctttgtgcaa agtctcctag tgcatatgag gatattcgct 1560 ttgataaaaa gtctggtact ggttttcttg tattaccaag ccgtcgtcgt cttcgtgact 1620 acaagaattg tattcatcct caacgtggtt ttaatcctga tattgtaaaa gagttaaaat 1680 caaaagtaaa agattttagt gaaatagaaa agtatattgt attgttgttt gatgaaatta 1740 aaattcaaga aaatcttgta tgggataaac acacaggaga aattatagga tttgttgacc 1800 ttggtgatat tgatttaaac tatgcaactt tacaaaaaat tgatgcagta gcaactcatg 1860 ttctagtgtt tatgatacgc agtatagtta acccttttaa gtttacacta gctaattttg 1920 ccacacttgg aatatcatct gcccaaattt ttccattatt ttggaaagct gttgctatat 1980 gtgaactgga atgcaagtta aaagttatag caacaacttg cgatggtgct gcttcgaaca 2040 gaaaattttt taaaatgcat tcacgcttat ctaattcgaa tgttaaaaag gatgtagttg 2100 tcacatataa aactataaat ttatactgtc ctgaaaggta tatttacttt atagctgacc 2160 ctccacatct tctaaaaaca acaagaaact gtatttacag ctcaggctct ggtaaatgta 2220 ctagatacat gtggaatgat ggtttataca ttctatggag tcacatcgca gacattttta 2280 aagaagatca aatatgtggt ttacatctac tcccgaaatt atcgtatgac cacattaaat 2340 tgtcatctta ttctgttatg aatgtcaaat tggcagcaca agttctaagc actactgtaa 2400 gtaaagtcct tcttaactat ggatctccag atacatttgc aacagcaaac ttttgttcaa 2460 tgattgacag tttttttgat attatgaaca ttagtaatcc ttatgaatca attcaaaagt 2520 caaaaccatt tttagttcca ttttcttctc ttgatgacaa tcgtttgaac tggcttaaag 2580 cagaattttt gccatatttc gaaaagtggt taagttcaat taatacccgt caaggtaaat 2640 ttactctaaa tgacaaaggt aaaatgttta tatcatggca aacctatgaa ggaattaaaa 2700 ttacagtgaa ttctgctgtt gagttgataa catttttact atctaatggt gtatcttatg 2760 tattgacaga aagattttgt caagatcccc ttgaaaatta ttttggacat caacgccaaa 2820 tcggttcccg caaggataat ccatctgtcc gtgattttgg ctacaacgat aatactattc 2880 gaaatcaaaa ggtattcaga ccaataaaag atggaaattc ccatgaagat attcactttg 2940 agataaacga agaaccagta ccatgcagaa agcgagtaaa aacttcatta caatagggta 3000 ttcattaaaa acggctattt tcatagccac aaattattaa aacacattat tggcaaagtg 3060 aacaggtagc atgcaacgat atgcaatgta cgttttgggg acacccttaa tatatatctt 3120 gctgtattga tagatatatc ttactctaag gagcctaata caaggtgggg gaagaaattt 3180 attctagatc taagcgcgca ggaggagggg taataaaaaa gtgacattat ttttacttaa 3240 aatatatccg attttgcaat tttccgataa aaattctact aattccactg tgatgatagt 3300 ttaaattttt ttaaatgtgc acatcacatt gtagctatcc cctcccctat gtgacattaa 3360 gtaatacttt ttgatccccc tccccttaag actctcacgt actatttaaa cagcccctat 3420 ttaaataaat cataaataaa gtaacccttc ttcaccttct agcatacgca ggcattattt 3480 atcattaaaa aattcaaaac tttaaaatct aaatttattg taggaatttg ttaattcaac 3540 atttttctct atttaaataa gaaaacaggc agtttacaag agcctttcaa agacaggtta 3600 ataattttaa agttttccct tcccctaatg tccattgatt atgtcaacag tattaaggca 3660 tttttttcct cttctctcct ctcttgtcaa acttacagag cttcccctag ggctaacaga 3720 gctttactag gcttccccta caaaattaca tctartttta taaacttctc ccctgatccc 3780 taaatacaaa actacaatac aaacgcttta tcaacaattt ttgaaaataa catcaatcaa 3840 ttgatgttat taacagaaaa gcctaaatga accttaatag caacttcttc cccctccctg 3900 aacacacaca cttttttcac acaattttct tgctttttct ttccttaaca tgctttagga 3960 tccacatgaa taatgattgt ttttttacaa aaattccctt ttaattgaca actattttac 4020 tagtactaat tttaaaatta ccctgtatac ttggagtaat ttcatgaaaa cagcaattct 4080 attttattaa aaaattatta ttaatattct gataccataa tgtctttagc catttgtagc 4140 tatgaaaata gctgtttaaa aaaattgata tattaagatc aggtaagtgt atctatttgt 4200 ccaatgtgca acctatactt gattactctt cttctttagt gaagttcgca gtgactttga 4260 cttaagttgt ttaactttaa ctttataagc ttgtagatta tccttgacgt aggaaaatgt 4320 ccttgctctt acatacaaat tcaagatgtc ctcaagaaca tttaaacaaa cctctttgtt 4380 aacaacaaca tcagtttcac ttttgatctt tgaaaaattt gataaaatat ttgtgtcttt 4440 gagaagtgtt agtacaattt ctttactgtc aattttattg atagacttta cagccatatt 4500 tttaaatgtt gtttcagcag taataaaaat ggatattgcc tcaattttca ttttccataa 4560 gcctgatctg ttacgtgtgt tcacaagttt atgctcaggc aaatcatcac ttccttgata 4620 ttgggcagca agtaaaatag acatatactg ttggtaatat tcttccttaa aaatatctga 4680 aaacctgatg cgacgatata acgtgctaaa aacataccca cttagatagg caataacaga 4740 ttgatctcta tttgaaagta cacaagattt ttcagataaa attgaatcac ctaaaactga 4800 ttgaccagaa agatgtgaca atacatggtt acatacctcg aatccaagta aggtggaaca 4860 ctgtgtattt aatcctataa aaatagattc aatgttgtca aagatacatt tgtaaaaatc 4920 tggataaaaa ttttcggcat ttcctcgaaa taaagataaa accggtttaa taaactgata 4980 tgctgtgtta gatatatcga aattaatcac ataggcttta aaatgatttt gtattgattc 5040 aaagcagtta tctgatgaaa gctttgcaac acttttttct aaatatattt gaaaattttg 5100 aagatcaagt atttcttctg gtattttaga tttctcagta tcttttaaaa cttccgagtg 5160 aactgataaa gcatctgaat gtttgcaatt agtatgacga gtgagacccc tagagctcaa 5220 acatatttta ttacaaaatc gacacgaaaa tgttgtttgt ggtgtatgcg aatcatgttg 5280 tttttcaaca acatcagtta acaaatcctg gtcaataact gcaaaaatag catcaagctc 5340 atcttgattt aactcgacaa tgtgctcaaa aagaaataaa tcatctgaac cttcggccat 5400 cattaagttt attttgcacc ttattcctgc gggccacccg attttcaact taaacaataa 5460 aaactgttag aaacggcctg cgtttccaga ctattctttt tatgactatg 5510 // ID CR1-29_BF repbase; DNA; INV; 1876 BP. XX AC . XX DT 30-JUL-2009 (Rel. 14.07, Created) DT 30-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE Amphioxus CR1-29_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-29_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-1876 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-1876 RA Kapitonov V. and Jurka J.; RT "Young families of CR1 non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(7), 1600-1600 (2009). XX DR [2] (Consensus) XX SQ Sequence 1876 BP; 512 A; 542 C; 365 G; 457 T; 0 other; acctacctgg tagggatgac tctcccccta cttttctccc acctgtctct cccgagacgc 60 tcctcgagtc actacatgta acagtggacg aagtacgcca acaactcctc acgcttaaag 120 ttgggaagtc ttgtggtcca gacgacatta caccaaacct cttacggcgc gtggctgacc 180 ctatctcggc acctctgacc aaattgttca acatgtcgct gtcccgcggc caagtgccct 240 ccatgtggaa agacgcaaac gtcacaccaa ttcacaaagg tggcagtcgc catgttccca 300 acaattacag gccaatatcc cttctgaata tcattccaaa agtcctggaa aacctcataa 360 acaaacgcct ggtggagcac atccgcccaa tgctgactca acaccaaagt ggctttcgct 420 cgaaagacaa cacaactctc cagctactcc gtctggtgga cgagtggacc aaagccatgg 480 acagaggaga ggtggtcgga gtcgcatttt tggacctacg caaggccttc gataaggttt 540 ggcataaggg cctcctagcc aaacttgaag ggtacggaat tcgagggccc gttcttgagt 600 ggtttgcaag ctacctcaca gatcgacgtc agcgagtagt tgttaacggt gcgacatccg 660 actggaagac cccccttgct ggagtcccac aaggctctgt tctagggccc acactcttta 720 tattatacat taatgactta ccctacagat gtactgcatg cgcggcgaac ttgttcgctg 780 atgatacttc agtctcaaca tcccatcggt ccatctccat cgttacagac tcactcaaca 840 aggatttggc cactgtatct gactggttgc tatgctggaa actcgaggcc aacggagaca 900 agtgtaaggt catgtacctc acttcccgcc ccctcccacg tacgatccca ccagtcatcc 960 tggccggcac cacactacag attgtctaca gcttcaaaca cctaggcgtg accatgactc 1020 acaaacttac ctggtccctc cacgttgaag ctacaacaaa caaggccaga cgtacagccg 1080 gcatgctctg ctctctacgg aagaaaatcc ccaagaccac actactacag ttgtacaaaa 1140 ctttgactag accggtgctg gaatacgcag acatcgtttg gtcaggcctg accaaacgcg 1200 acgaaaaact catcgaatct gttcagtacc gggtcgccag agtgattagt ggtcatcacg 1260 gatttcctta cccttcctat cgtactgttt acgcccacct tggtcttcca tcgctcacct 1320 ttcgccgcaa attccacaca ctgactgccc tattcaaact gaggaatgga cactgcccac 1380 cccacctctg caacctcgtt cccactacca gatcatcagc cattcaatcc agataccctc 1440 ttagaaatgg aaacaacttg tcaataccac ttcccaaaag cacaagaact cagaaaacct 1500 tcttccacag agccactcag ttatggaata atctcacacc cgaaacacaa acagctccca 1560 caatatcatc cttcaaggcc aaagcctggc tctcactcgg tgacccattc tctgtgaact 1620 aatgtaacta tttcttttgt acagattgta tagatatgta tataatgtaa attttgtaac 1680 cgttttgctt aaacaatgta agttaacgct ggtgatttta tgattctgtg tttattgcag 1740 aaaattgtat ttgttatatt taacatgtat gtattctctc ccagggctag cccttgacaa 1800 tagcccatgg ctagttgggc agccctggct gtacgcactc agccgaacga ataaataaat 1860 aaataaataa ataaat 1876 // ID CR1-80_HM repbase; DNA; INV; 4270 BP. XX AC . XX DT 07-JAN-2009 (Rel. 14.02, Created) DT 07-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE CR1-type family - consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-80_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4270 RA Bao W. and Jurka J.; RT "CR1 families from Hydra magnipapillata."; RL Repbase Reports 9(2), 367-367 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 14..730 FT /product="CR1-80_HM_1p" FT /translation="MAITLLEIQEMFILHKKEIQLLFKEQEKVFVGLLSDN FT LKILNARLDNFEKKLSDYDKKIHLLENDVKDLKESLNFHENLIDQKIKAAK FT EIETPNIINILNEKCRISEDRSRRNNLRIVGIPENANESWDETEEKVQNLL FT CNKLGVKGVEIERAHRVGPKKEAQSRTIVLKLLRFKDKSNILKETKKLKGL FT SIYVNEDFSQETASLRKKLFTEAKERKSKGENVIVRYDKLIILKKGLQ*" FT CDS join(792..1211,1198..1947,1983..2615,2569..3852) FT /product="CR1-80_HM_2p" FT /translation="MANKLNATNFFEIFNKQNTILLNNDIDPDVNFFEDVI FT INSNYFDIEEVSNLMNASQSLFSTLTLNIRSMSKNFENFKSMLKNINFEFM FT VICLIETWCKSEKNNFQIPGYKAIHQTRGGGVGGGVCIFIHESIHYLENRN FT SKIEILSIQNADYESLTIELLNHNKKNVLVTALYRPPSGNKKSFQKHLKWY FT LKKVKNKQIYITGDFNLNLLNVKIDNDVKHFLNSLFQYNILPLINKPTRVT FT LLTESLIDNIFTNIFTTCNIQSGIIKSDISDHFPIFFISDFYIKEKKSKVL FT TEIKRQITEFNIKEFRNHLLKVNWKQLHNCKDTNKAYAFFYNEFLNAYDKA FT FPMKEIKTKCKNLQSPWITKGLIKSSKKKQKLYEKYLKKKTYQNEKKKKKL FT KKIISQLYYKIILEITKKTWDIIKEVIGKEKKTRGNYLPKYLYTEDERTIT FT NEKKDIANSFNNFFINVGKNLANKITPGKKNFKSYLKEKDYVMDELEVSPD FT ELRFAFNMIKPNKSAGLDGISPKVVKEVFDIIENPLLIIFNLSFKNGVFPD FT LLKLARVVPIFKDGDISKLSNYRPISILPCFSKILERIMHNRLYNLSHNSQ FT SFKCIIGFTTYLTTHKVLNNNQYGFRKGHSTEHAVIKLVKEILNGFENNQY FT TLGVFIDLSKAFDTVDHDILLYKLEIYGIKNNNFKWFCSYLKNRKQCVSYD FT KTYTKLENITCGVPQGSILGPLLFIIYINDIYLSSSILNFNLFADDTQVFY FT THSNIKTIFTTMNSELDNLNEWFKANKLSLNPTKSKYTLFSKSSKSETLPL FT KLPDILIEKTKLQRTYCVKFLGVLIDEQINWKEHINLIENKISKSIGIMYK FT TRYMLDKNCLKSIYFSFIHSYISYCNIAWASTYPSRLTCILKKQKQASRLI FT LNANKYTSANPLLRKLGVLNVYQLNIYNILLFMFRLKYHMLPNIFEDLFFI FT IDHKYQTKHSLLNYQVPKTLLKQSEFSITYRGPHLWNSFLLNNSKTTTSIQ FT TFKSILKKQLLDYDVSQILFYF*" XX SQ Sequence 4270 BP; 1712 A; 594 C; 572 G; 1387 T; 5 other; aaaaatataa aaaatggcta taactttatt agaaatacaa gaaatgttta tcctacacaa 60 gaaagaaata caattattat ttaaagaaca agaaaaagta ttcgttggtt tacttagtga 120 taacttaaaa atactcaatg cacgcttgga taattttgaa aaaaaattat cggattacga 180 taaaaaaatt cacttgttgg aaaacgatgt aaaagactta aaagaaagtt taaactttca 240 tgaaaattta atagaccaaa agataaaagc tgctaaagaa attgaaaccc cgaatattat 300 taatatttta aacgaaaaat gcagaatctc cgaagacagg tcgaggagaa ataacttaag 360 gatagttgga attcctgaaa atgcaaatga aagctgggat gaaactgaag aaaaggttca 420 aaatttatta tgcaataaat taggagttaa aggagttgaa attgaaaggg cgcatcgagt 480 aggccctaaa aaagaagctc aatctcgaac cattgtatta aaattattga gattcaaaga 540 caaatcaaac atcctaaaag aaacaaaaaa actaaaagga ctaagcatct acgtaaatga 600 agatttctcg caagagacgg caagccttcg taagaagtta tttacagaag cgaaagaaag 660 aaaatcaaaa ggagaaaacg ttattgtcag gtatgataaa cttattatwt taaaaaaagg 720 tttacaataa gacgaaatga tactgttaga aaacaatttt aatcttaatt ttgcatattt 780 taataataac aatggctaac aaattaaatg caacaaattt ttttgaaatt tttaataaac 840 aaaacacaat tcttttgaat aatgacattg atccagatgt aaactttttt gaggacgtca 900 taatcaactc aaactatttt gatattgagg aagtttctaa cctcatgaat gcgtcacaat 960 ctttattttc aacattaact ttaaacatac gtagtatgag taaaaacttt gaaaatttta 1020 aatcaatgct aaaaaatata aattttgagt ttatggtaat atgcttaatc gaaacatggt 1080 gcaaaagtga aaaaaataac tttcaaattc caggttataa agctattcac caaactcgag 1140 gtggtggcgt tggtggcggt gtatgcattt ttattcacga gtcaatccat tatctagaaa 1200 atagaaattc ttagcattca aaatgcggat tatgaatcat taactattga attactaaac 1260 cacaataaaa aaaacgtatt agtaactgct ttgtatagac caccaagcgg aaataaaaaa 1320 tcttttcaaa aacacttaaa atggtattta aaaaaagtaa aaaataaaca aatttacatc 1380 acgggggact ttaacttaaa tcttcttaat gttaaaattg ataacgacgt aaaacatttc 1440 ttaaactcac tttttcaata caacattctt cccttaatta acaagcctac tcgtgtaact 1500 cttctaactg aatctctcat tgataatatt tttactaaca ttttcacaac atgcaacatt 1560 caaagtggaa tcataaaaag cgatataagc gaccactttc caattttttt tatatctgat 1620 ttttatataa aagaaaaaaa aagcaaagta ctaacggaaa taaaaaggca gattactgaa 1680 tttaatatca aggaatttag aaatcattta ctcaaagtaa actggaaaca actacataat 1740 tgtaaagata caaataaagc ttatgcattt ttttacaacg agtttcttaa tgcgtacgac 1800 aaagcgtttc cgatgaagga aattaaaacc aagtgtaaaa acctacaaag tccatggatt 1860 acaaaaggac ttattaaatc ttctaagaaa aaacaaaaac tctatgaaaa atatttaaaa 1920 aaaaaaactt atcaaaacga aaaaaaataa taaaacttac aaaaacctgt ttgaaaactt 1980 gaaaaaaaaa gctaaaaaaa attatttctc aactttatta caaaataata ttggaaataa 2040 caaaaaaaac ttgggatata ataaaggaag tgataggaaa agaaaagaaa actcgtggta 2100 attatttacc gaaatattta tatactgagg atgaaagaac aataactaat gagaaaaaag 2160 atattgcaaa cagttttaat aactttttta taaacgtggg taaaaaccta gcaaacaaaa 2220 taacaccagg aaaaaaaaat tttaaatcat atctaaaaga aaaggactat gtaatggatg 2280 agcttgaggt atcgcctgat gaacttcggt ttgcttttaa tatgataaaa ccaaacaaaa 2340 gcgcaggcct tgatggtata agccctaagg ttgttaaaga agtttttgat attattgaga 2400 accccttact tattattttt aatctttcat ttaaaaatgg cgtttttcct gatctcttga 2460 aactagcgag agtagttcca atatttaagg atggtgatat ctcaaagtta tctaattaca 2520 gacctatttc aatactccca tgtttttcta aaattctaga gcgaataatg cataataggc 2580 tttacaactt atctcacaac tcacaaagtt ttaaataata atcagtacgg atttagaaaa 2640 ggacactcga cagaacatgc agtaattaaa ttagttaaag aaatattaaa tgggtttgaa 2700 aacaaccaat atactcttgg agtgtttatt gacctctcta aggcctttga tactgtggac 2760 catgatattc ttctttacaa acttgaaatt tatggaatca aaaataataa ttttaaatgg 2820 ttttgttcct atcttaaaaa tcgcaaacaa tgcgtatctt atgacaaaac ctacactaaa 2880 ttagaaaaca ttacctgcgg agttccacag ggatctattc tagggccttt gttgtttatt 2940 atttatatca acgatattta cttatcatct agtatactaa actttaatct ttttgctgat 3000 gacactcaag ttttttatac tcactctaac attaagacaa tctttaccac tatgaatagt 3060 gaacttgata accttaatga atggtttaaa gccaacaaac tctctctgaa tcctactaaa 3120 agcaaatata ctctcttttc taaatcctct aaatctgaaa cgctgccttt aaaactcccg 3180 gatattttaa tagaaaaaac caagttrcaa cggacatact gygttaaatt cttaggcgtt 3240 ttaatcgatg agcaaataaa ttggaaagag catattaatc taatagaaaa taaaatttca 3300 aaaagtattg gaatcatgta taaaactaga tatatgttgg ataaaaactg tttaaaatct 3360 atttactttt catttattca tagctatatt agttattgta acattgcttg ggcaagcact 3420 tatccgtcaa gactaacttg tatcttaaaa aaacaaaaac aagcaagccg ccttatatta 3480 aatgctaata aatacaccag tgccaatcca ctgcttcgta agcttggagt actgaatgtt 3540 taccaattaa atatttataa tattctttta ttcatgttta gattaaaata tcatatgcta 3600 cctaatattt ttgaagacct tttctttatt atagatcata aatatcaaac aaagcattca 3660 ttgcttaatt atcaggtacc aaaaactttg ttaaaacaat ctgaattctc aataacttac 3720 cgagggccac atttgtggaa ctcatttctc ctaaataata gtaaaactac aacttcaatc 3780 caaactttta aaagtatctt aaaaaaacag ttacttgatt atgacgtatc tcaaatacta 3840 ttttattttt aatttttatt ttgttctctc ttatttttat ttcgttaact tttctttttg 3900 tttaaaaagc ggcatcacac tttatatcat tacttatctt tatatttgta ttgtatttgg 3960 aaaaaaaaaa aatgtattta atattatttt actgagattg taatatatat aaacatgtgt 4020 atgtatatat gtgtatgtac atrtgtatat atatatatgt atgtatatgt acatgtatgt 4080 atgtgtatgt gtgtataagt atatatatat atatattttc ttatatgatg ggtaattttt 4140 tttttttttg gtttgatatg taaatgatac aamggggctg gatgataaga cttttgtctt 4200 ctgcctgctc ctgtcatatc ttaactcaaa atgtatatat tgtaaataat ttattataaa 4260 tatgacgaaa 4270 // ID CR1-57_AAe repbase; DNA; INV; 5594 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-57_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5594 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1144-1144 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 19 sequences with >97% CC identity. XX FH Key Location/Qualifiers FT CDS 289..1680 FT /product="CR1-57_AAe_1p" FT /translation="MTSVTCSACTQIITTESDRVYCFGGCEQILHIRCSEL FT RQSDSNALRNNVALKYMCFACRKKQVCLNDLQAKCTELLERINEIGATVTK FT HETSFSQLENRLLTRIESVLMPALVSKIESMISTRKVNDNRLSYATVTRDA FT TTTAVCASAVTVSPVPNGSDGRESLDEGWILRSGKRRGVTKSVTSTKTSAE FT NKNSDNSQNRRKRSSNVQSTAVKKIEQTVIIKPKSVQQADVTLKQVRDKID FT PVNFSVKGIRTRENGDVVVRCETSEHAQKLVDAAVDVLSDCYEISVQKPLK FT PRIKIIGLSEDLDASEFVSVLKKQNELPTSAEITLIRMRKIEKWKQFPFIA FT IIETDCQTFETLIQRERVHVRWDRCRVAEDVNVLRCFKCSKYGHKASSCNN FT PLCCPICTGDHEATDCDAAFEKCINCELRNKLSKSPYDEQLDVNHSAWSSA FT CPVYQKRLKNVRLMVDYSA" FT CDS 1684..5193 FT /product="CR1-57_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="QSVADRGITVSMGKLNTGICPAGALIDTAPIQMDISP FT QPIATTDTCLTDFGKSRTMCYQSSNFHDSLVGNPDCSGNLNTGICPAEAPL FT DAAPIQLDISYQMAIGTDATYNSRTICGSSCHQDCLVDNPACSGKFNERIC FT RAEVRTDAARFNSHLVLQSPVTSIHRNGWNDADGPGVPGNGGSICPAEAEN FT DTAPPESNSRCCFSVDEEDARNKLTFFYQNVRGLRTKIDDFFLAVSTCCYD FT VIVLTETWLDDQIMSPQLFGNSYTVFRNDRNRRNSRKSRGGGVLIAISTRI FT SCSLDPSPMHDSLEQIWVKLNMPGLSISVGVLYLPPDRKADLTSINNHIDS FT VESVFSNLRPNDLAYLFGDYNQSNLRWNTTPRTSPTIDVMRSSMSTACSSL FT LDGFSLHGLIQINHVLNQNERLLDLVLVNEAAFGNCALSEAIEPLCALDND FT HPALELDCYLPRPLKFDNVSYESGFDFRKADFHGLSEALLRCDWRFLNAAG FT NIDAAVDSYTRMMQTILPDHVPLRRPPAIPPWSNFRLKNLKRKRSKALRKY FT CRTRCIVAKQALTRASNKYRLYNRLLYKRYAIRTQNNLRRNPKQFWNFVKS FT KRNETGLPSEIFLGSSHASSTPEKCNLFARHFQGAFNETKADDSQIDIACR FT NTPINVCELSISNITSQQVTAAISKMKYSTASSPDGIPSCVLKRCCNALSP FT VLASLFNMSLQQCRFPSSWKLSIMFPVYKKGDKRDVENYRGITSLCASSKV FT FEIVVYDMLFAASKDYICTSQHGFYPKRSVSTNLVIFASHCIRNMDDGLQT FT DVVYTDFKSAFDRVDHNILLKRLELLGVSFNTICWLKSYLTNRKLRVMIGS FT EKSVAFNNLSGVPQGSNLGPLLFSLFINDLSRLLPTGCKLFFADDVKIYLT FT IKSFDDCIRLQELLNIFVEWSSINKLTLSLHKCSVISFHRKLKPMIYNYTI FT GNRNLERIDHVRDLGVILDTALTFRIHFEDIISRANRQLGFIFKICDEFKD FT PLCLRSLYCALVRSLLESNVVAWCPYQLTWINRIEAVQKKFIRRALRFLPW FT RDPVNLPPYEHRCRLLGLDTLVSRRSAQQAGFAAKVILGEIDAPEILSQLN FT FYAPERILRQRNFFMAGGSNTVYGQHNPINAVQTVFNEAYELFDFNMTAAT FT FQRRVLQRGTH" XX SQ Sequence 5594 BP; 1637 A; 1179 C; 1146 G; 1632 T; 0 other; aatgtgtgca aagcaagtgt gtattgtgta ttgtgcttac gtttaatttc tgtttatcct 60 tgcaaaaact gcaatacatg tgatctaaat tgcatcagat tgcagaactt tttatgctct 120 tgccacgcat agtggatgat tattgtgcag tgcaagtgtg ttaacaaaat ttaaagataa 180 ataactatca aaaattacat cggtgcagca acaacaacag aatcgacaga atcacaaagc 240 gttcttctgt tccattgaag aggggcatgc gtgccctgcg tcgctcccat gacgtcggtt 300 acgtgtagcg catgtactca gataatcacc accgaaagcg atagagtcta ttgttttggc 360 gggtgcgaac aaattctgca tattcggtgc tctgagcttc gtcaatctga ttccaatgct 420 ttgcgaaaca atgtcgctct caaatatatg tgctttgcgt gccgtaagaa acaggtatgt 480 ttaaacgatc tgcaagctaa atgtaccgag ttattggaaa gaataaatga aattggtgcg 540 actgtcacta aacatgaaac tagcttcagc cagctggaaa acaggcttct gaccaggatt 600 gagtctgttc taatgccggc gcttgttagc aagatcgaaa gcatgatttc tactcggaaa 660 gtaaacgaca atcggctctc ttatgcaact gtcacacgtg atgcaacaac tactgcagtc 720 tgcgcttctg ccgtcacagt gagtccggta cctaatggtt ctgatggtcg tgagagcttg 780 gacgaaggat ggatactacg atcgggcaaa cgtcgtggtg tcactaaatc ggttacaagt 840 acgaaaactt ctgcagagaa caaaaattcc gataacagtc agaataggag aaaacgatca 900 agtaacgtac aatctactgc tgtgaaaaaa attgagcaga ctgtcatcat caaaccaaag 960 tcagttcaac aggctgatgt tactctgaaa caggtccgag acaaaattga tccagttaat 1020 ttctctgtaa aaggcatacg aaccagagaa aacggtgatg ttgttgtacg atgtgaaaca 1080 agcgagcatg ctcaaaagct agttgatgct gcggttgacg ttctttccga ttgctatgaa 1140 atttctgtac aaaagcctct taagcccaga ataaaaatca ttggtttgtc cgaggatctt 1200 gatgcttctg aatttgtatc ggttttgaag aaacaaaacg aactacctac ctcggcggaa 1260 ataacactga tccgaatgcg aaaaatagag aagtggaaac aatttccatt cattgctatt 1320 attgagaccg attgtcaaac cttcgaaaca ctcatacaac gtgaacgtgt ccatgttcgt 1380 tgggatcgtt gccgggttgc tgaagatgta aatgttctcc gctgctttaa gtgctccaaa 1440 tatggacaca aagcttcctc atgtaacaac ccgttatgct gtccaatatg cactggtgat 1500 catgaagcta ccgattgtga tgctgccttt gaaaaatgta ttaattgcga actgcgaaat 1560 aagctgagta agtctccgta tgatgaacaa ctggatgtca atcactcagc ctggagttca 1620 gcgtgtccag tctatcagaa gcgtctgaag aatgttaggc tcatggtgga ttattcagcc 1680 tagcaatcag ttgctgatag gggtattact gtttctatgg gtaaattaaa caccggtata 1740 tgtcctgccg gagcacttat agatactgca ccaattcaaa tggatatttc accacagccg 1800 attgccacaa ccgacacttg tttgactgac ttcggaaaat ctaggacgat gtgttatcaa 1860 tcatctaatt ttcacgattc tcttgtgggc aatcctgact gttcaggtaa tttaaacact 1920 ggtatatgtc ctgccgaggc accccttgat gctgcaccaa ttcaattgga tatttcatac 1980 cagatggcta ttggaactga cgccacctac aactccagga caatatgtgg ttcttcatgt 2040 caccaagatt gtctagtaga caatcctgct tgctcaggta aatttaatga acgtatatgt 2100 cgtgccgaag tcaggacgga tgctgctcgt tttaatagcc atttggtttt acagtctccc 2160 gtcacttcta ttcaccgcaa tggatggaac gatgctgatg gtccaggtgt cccaggtaat 2220 ggtggcagta tatgtcctgc cgaagcggag aatgatactg caccacctga atcaaattct 2280 cgctgttgtt tcagtgtgga cgaagaagat gcacggaata agttaacctt tttctaccaa 2340 aatgtacgtg gactacgcac aaagatagat gatttctttc ttgctgtatc aacttgctgc 2400 tatgatgtta ttgttcttac ggagacatgg ctcgacgatc aaatcatgtc acctcaactc 2460 ttcggaaact cctacaccgt cttcagaaac gatcgaaatc gacgcaatag cagaaaatca 2520 cgcggtggag gtgttcttat cgcaatttca acaagaataa gctgcagctt agatccttct 2580 cccatgcatg actcattgga acaaatctgg gtaaagctga atatgcccgg actcagtatt 2640 agcgttggtg tcttatacct ccctcccgat agaaaggcag atttaacaag cataaacaac 2700 catatcgatt ccgtggaatc tgttttttcc aacttacgac caaacgatct tgcttatctc 2760 ttcggtgact acaaccaatc aaatttacgc tggaacacaa cgcctaggac aagtccaacc 2820 atagatgtta tgcggtcttc tatgtctaca gcgtgcagta gcctccttga tggtttctct 2880 cttcacggtc tgattcaaat caaccacgta ctcaatcaga atgaacgtct tttggatttg 2940 gttcttgtta acgaagcagc ctttggaaac tgtgcacttt ccgaggccat tgagccactc 3000 tgcgctcttg ataatgatca tcctgcttta gagttggatt gttatctgcc tagacctcta 3060 aagtttgaca atgtttcgta tgaatctggt tttgactttc gcaaagctga ttttcacggc 3120 ttaagcgaag cactacttcg atgcgattgg cgatttttaa atgctgcggg taatatagat 3180 gctgctgttg acagctatac aagaatgatg caaactatct tgcccgatca tgttccgtta 3240 cgcaggccac ccgcaatacc tccgtggagt aactttcgtc ttaaaaactt gaaacggaag 3300 agatccaaag cgcttcggaa atactgtaga actcgttgta ttgttgccaa gcaggcctta 3360 accagagcaa gtaataaata tcgactgtat aatcggcttc tttacaaacg ttatgctata 3420 cgaacgcaaa ataatttacg cagaaatcct aaacaatttt ggaattttgt aaaatcgaag 3480 cgcaatgaga ctggtctgcc atcagaaatt ttcttgggct cgtcgcatgc ctcctctact 3540 ccggaaaaat gcaatctttt tgctaggcac ttccaaggcg catttaatga aactaaagcc 3600 gacgattcgc agattgacat tgcctgtaga aacactccta tcaatgtatg tgagctttcg 3660 attagtaata tcacaagtca acaggtcact gctgcgataa gcaaaatgaa gtactccact 3720 gcttcaagtc ctgatggaat cccgtcctgt gtattgaaaa gatgctgcaa tgccttaagc 3780 ccagtgttag cttctctgtt caatatgtcg ctacagcaat gtcgttttcc gtccagttgg 3840 aaactctcta tcatgttccc cgtatacaag aaaggtgaca agagagacgt tgagaattac 3900 cgtggaatta catccctatg tgccagctca aaagtcttcg aaatcgtcgt ttatgatatg 3960 ttattcgctg ccagtaaaga ttatatttgt accagtcaac atggattcta cccaaagaga 4020 tctgtttcta caaacctggt gattttcgct tcacattgta tacggaacat ggacgatggg 4080 ctacaaacag atgtggtata tactgatttc aaatctgcat tcgatcgtgt agatcacaac 4140 atcttgttga agcgactcga gctgcttggc gtatctttca atactatttg ctggcttaaa 4200 tcgtacttaa caaatcggaa actacgtgta atgattggct ctgaaaaatc agttgcattc 4260 aacaatttat ctggcgtacc acaaggaagc aatctcggac ccttactatt ctcgctcttc 4320 atcaacgatt tgtctcgtct actacctaca ggatgcaaat tattttttgc cgatgacgtt 4380 aaaatatact taacaatcaa aagtttcgac gattgcatta gattacaaga attgttaaat 4440 atctttgttg agtggagctc aatcaacaag cttaccttga gtttacacaa gtgcagtgtt 4500 atatcatttc atcgcaaact taagcccatg atctacaatt ataccatcgg caatcgtaac 4560 ttggagcgta ttgaccacgt acgggacttg ggcgttattt tggatactgc tctcactttt 4620 cgcatccatt ttgaagacat tatttccaga gctaatcgac agttaggatt tatttttaaa 4680 atctgtgacg aattcaaaga tcctctgtgt ctccgttcac tatactgcgc acttgtacga 4740 tcactgttgg aatccaatgt tgttgcctgg tgcccatatc agttgacttg gataaacaga 4800 attgaagctg tccaaaagaa atttataaga cgtgcacttc gctttctgcc ctggcgtgat 4860 ccggtgaact tgcctccgta cgagcatcga tgccgtctcc ttggacttga tacactggtc 4920 agcagaagat ccgcacagca ggctggattt gcagcgaagg tgatacttgg agaaatagac 4980 gcaccagaaa tcctgtctca gttaaatttt tatgcaccgg aacgaatcct gcgacagcga 5040 aactttttca tggctggagg cagtaatacg gtttacgggc aacacaaccc gatcaatgcc 5100 gtacagacgg tattcaatga agcatatgaa ttgttcgact tcaacatgac tgcagctaca 5160 ttccagagaa gagtactgca aaggggtacc cattgagcat gaatgcagca ttagattcaa 5220 caaacagact ttcaggcgag aatatgtcca ataacttaac gttagaaaaa tttagctaca 5280 tgtctctttt tgtttttttt ttcagatgct gacggagaat acctctgtat atttgcaata 5340 cttgactcat ttttaaattg tatgtcctgc ttacttaatg tgtttagtgt atttttataa 5400 gtcatgttat tttatgtaga aagatatttg aaaagatgcg gggtttttac gtcttttgga 5460 gactggccac aatttctcct agtgtcaact ccaagggact tttccccacc acctcaatat 5520 tcattaagac aaaagtcaga tgaaggaaat aaaataaata aataaataaa taaataaata 5580 aataaataaa taaa 5594 // ID hATm-30_HM repbase; DNA; INV; 5068 BP. XX AC . XX DT 07-DEC-2008 (Rel. 13.12, Created) DT 10-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type family: consensus sequence. XX KW hAT; DNA transposon; Transposable Element; hATm-30_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-5068 RA Jurka J.; RT "Families of hATm elements from Hydra magnipapillata."; RL Repbase Reports 8(12), 1924-1924 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(705..779,991..1242,1398..1727,1775..2539, FT 2520..2753,2830..3162,3216..3542,3546..3917) FT /product="hATm-30_HM_1p" FT /translation="MIEFVYIDNLKRLRLVHINMQLDNFGPRTGIFRFRNY FT KMQKNSIKRRRVSARTKEHFLVGKPAKMCSMKLPTRADVIKQFLYLKKKNG FT PSYQNKRLICCPLSNKFRFNLVSFLSQILQIFLFSFQCTSLDTTCKKECVL FT GALWFPWLRSGFPMIGPVTLQRNITKYINRYLKLKKNQHNSKARCVQSRTD FT FLKTVRGTNYFLFKKLFIISFEIHSSNLNYKYFQGLFWLSNTKKIFIKAIK FT ADKDRLHSDKSEDLLFLKDQMGEIKMTLGPIDKRRAEIEQKRNSRIQSKVI FT ASTGFCEKSFVQSRGFKVCSTDGVHLPKTISSEPSSESESENNYMDCSIRS FT FKNQEQTIVINREVIKETMRTAIGMDITAHKITATLIKLVTAAGGDPNKLP FT LSYTSAFRMKNNIINKDAENIKILIKKEIEQGKVKIQLHFDGKLIKVCIYL FT ILTGVLIENNCFSYIVNLILVLGLDIKSRDLISNHGDVAIDRICVLVRAGG FT TDHLLGAPRVHSGTGLNQFKAIKEILKEFEIADHVRMVCFDTTAINTGMIR FT GAIYRFFRLRNDVGKPLLMLACRHHVLEVILKHYWLAVTIDKTTGPDNALF FT KGLKSKWNDIDVKNCVITHFKWQSIYDSWFNEQAEKARKCLMTIVNEGVFK FT NSQVNFSKHLNLQVRHEKRADYQELAELALMFLEPTFKPLNIHKPGAVHHA FT RFMAKSIYYLKIMLLLNSTKQILKLSQEKINEVKKMAIFISIFYAPQFLRA FT EKSDTAAIQVSFLAFDTNNHSILALLFDKIVYPNCIEILTLQDQHFIWHMA FT HLEKFDVAAASVFKRWGKHSWYLDPTTVIFALANSDQSVSDITKEEMARKL FT VQYPRTNKYDFERKSNSGFIPQKTEFLKLTEPPSLVNYVGAE*" XX SQ Sequence 5068 BP; 1822 A; 755 C; 864 G; 1627 T; 0 other; ggggtgttca aattttacaa attttttttt tggaacatgt atgcaagcta aaaagatgtg 60 taatcgtgag tgattttaag ataatatttt gatttttcaa aaaaaaaaaa atttcacctt 120 gctacaccca accaaagttt ttcaaaattt gtaacttttt acaagattga aattttctcg 180 agatctaaga tacatacaaa catgtatacg acttatttgt tagcttacta tgtctagagc 240 atgcttgaaa aaaattggca ccctagcttt atgggatcac cttgctacaa cttgatttcg 300 gaaaccgtga gagtatggac gggggggggg tggaggtaag agcaaaaaaa aaaagttttc 360 gtttagcaaa tcactttgtt acaatatggt tataaaaccg cataataatt tttatttaat 420 tttatcgtaa aaatattgtg atatagaatg aaatagaaaa gtaaattatg aataataaaa 480 atacatttag ataaagtaat aaaaaataag aatcgtttac aatctatata aagtataata 540 taaaccacaa attttaaagt atacaaataa taaaaccgca aagtatcatt attattagga 600 caagagaaaa acatgcaaaa ggccacatgt tttttggttc aaaaaaccga tttacttatg 660 ataaattttg tattttaaaa atattatttg ttttacaaaa cctcatgata gaatttgttt 720 atattgacaa tttaaaaaga ctgcgtttgg tacatataaa tatgcagtta gacaattttt 780 aacaattatt gaatatttat attttatcta cacttttttt atttatcaaa aatcagtatc 840 taaagttttc attgcttttg gtttaaagtt agtttaatgc tggtttttta ttaaaagtac 900 attcgtaaag ttgtttatgt ttagttattt ttaataataa tgttaaacta atattttttg 960 aatgtattta atgcaatttt tgaactatga ggacctagga caggtatttt tagattccga 1020 aattataaaa tgcaaaagaa ttctattaag agaagaagag taagtgctcg caccaaggag 1080 cactttttgg tgggtaaacc agctaagatg tgttccatga agctaccaac tagggctgat 1140 gtgattaagc aatttcttta tcttaaaaaa aagaatgggc caagttatca aaataaacgt 1200 cttatatgct gtcctctgag caacaagttc aggttcaatt tataatttat actttaattg 1260 gtttaaaatg aacaaaaaac ctacagaaaa acatatctat ttataagtaa ataacacttc 1320 aatagcattg aggaaaaaat ataatgttat atgtagtttc acaaaaatta tatttgttga 1380 aaaaaaatgt ctgataagtg agttttttgt cacaaattct tcaaattttt ttattcagtt 1440 tccagtgtac ttctttggat actacttgca aaaaggaatg tgtactagga gcactttggt 1500 ttccatggct aagatctggt ttccctatga ttggacctgt caccctgcaa agaaatataa 1560 caaaatatat taacagatat ctgaagctca agaaaaacca acataacagt aaggcaagat 1620 gtgtgcaatc tcgcactgac tttttaaaga ctgtaagagg tactaactac tttttattca 1680 aaaagttatt cataatctcc tttgaaattc actcctccaa tttaaattaa ataaacctca 1740 aaatatttac ttgaattgtt aattttttat ttaatataaa tattttcaag gactgttttg 1800 gctcagtaac acaaagaaaa tttttatcaa agcaatcaaa gcagacaaag ataggcttca 1860 ttccgacaag tctgaggact tgctttttct gaaggatcaa atgggagaaa taaaaatgac 1920 gcttgggcct attgacaagc gtagagccga gatagaacag aagagaaata gcaggattca 1980 gagtaaagtt atagcatcaa caggtttttg cgaaaaatca tttgttcagt ctagaggttt 2040 taaagtgtgc tctactgatg gtgtacatct acctaaaaca atttcttcag aaccaagttc 2100 agagtctgaa tcagaaaata attacatgga ctgttcaata aggagtttta aaaatcaaga 2160 acaaaccatt gttatcaacc gagaagttat taaagaaact atgagaacag ctataggcat 2220 ggatataaca gctcataaaa ttacagcaac cctgataaag cttgtcaccg cagctggagg 2280 tgatccaaat aagttgcccc tttcttatac ttcagctttt cgaatgaaaa acaatattat 2340 taacaaagat gctgaaaata ttaagatttt aataaaaaaa gagatcgaac agggtaaggt 2400 gaaaattcag ctacactttg acggaaagtt aatcaaggtt tgcatatatt taattttaac 2460 tggtgtttta attgaaaaca actgtttttc ttacatcgtt aatctaatcc tagttctagg 2520 acttgatatc aaatcacggt gatgtagcaa ttgataggat ttgtgttctt gtgagagctg 2580 gaggtactga tcatctactt ggcgctccaa gagttcattc tggtacaggg ctaaatcagt 2640 tcaaggctat caaagaaatt ttgaaagagt ttgaaattgc tgatcacgta aggatggtct 2700 gcttcgatac tactgctatc aatactggaa tgattcgtgg tgctatatac aggtgaactg 2760 tttttcaata taatttaaaa aaaggactct aaaagctaat tacaataaac aatttagtta 2820 taatcttaat ttttcagact gcgaaacgat gttggtaaac cattgttgat gcttgcttgt 2880 agacatcatg ttttagaagt catattaaaa cattattggt tagcagttac catagacaaa 2940 accacaggac ctgacaacgc actctttaaa ggcttgaaaa gtaagtggaa tgatattgat 3000 gtgaaaaact gtgtcatcac tcacttcaaa tggcaatcta tttatgattc ttggtttaat 3060 gaacaagcag aaaaagcaag aaaatgttta atgacgattg tgaatgaagg agtcttcaaa 3120 aactctcagg taaacttttc taaacattta aatcttcaag tttgaataca aaaaaataat 3180 gaatgaaatt aaatatccgc attaaacttt tttagagaca cgagaagaga gcagactacc 3240 aggaacttgc tgagctagct ctcatgttct tggaaccgac attcaagcct ctcaacatac 3300 acaagcctgg agcagtacat catgctcgtt ttatggcaaa gagtatctac tatctaaaga 3360 taatgcttct tcttaattcg accaagcaaa tcctaaaatt gagccaggag aagattaatg 3420 aggtgaaaaa gatggcaatt tttatttcaa ttttctatgc tcctcaattt ctgagagctg 3480 aaaaatcaga tacagctgca atccaagtta gttttttagc ctttgataca aataatcata 3540 gttaaatttt agcattatta tttgataaaa tagtgtaccc aaattgtatt gaaattttaa 3600 cattgcagga ccagcacttt atctggcaca tggcccactt ggagaagttt gatgtagctg 3660 cagcttcagt cttcaaaagg tggggaaaac acagctggta cttagatccc actactgtta 3720 tttttgccct agccaattca gatcagtctg tgtctgacat tacaaaggaa gaaatggcta 3780 gaaagctagt acagtatcct cgaacaaaca agtatgattt tgaaagaaag agcaattccg 3840 gttttattcc acaaaagacc gagtttctca aactcaccga gccaccttca ctcgttaatt 3900 atgtaggagc tgagtaatgg ttagtgtttg atatacttga acacactgaa agacatgtta 3960 agtggctttt ttatccttcg tctacatgaa atatagatcc agatttcatt gcttttcaac 4020 tatttgttaa aagcttggca gttgttaatg atgcagcaga gagagccgta aaagcaattc 4080 aggaggttgt ttcacagaca tatgacgaga agaaactgca gaaaatgtta atagtcaaga 4140 ataaaataaa aaaaaacaat ttcacgcact aaagcagcat ataaagaggc agccgaacag 4200 ctaacacctg ctgaaaagct aagccttgcg tacgaatatg aatgtcttga ggagacggag 4260 gacaagttat cactatctag tactagtgat tttgacagta gctcggatat tgttgatgaa 4320 gatgatgtgg ttgagactat agaggctgaa aacaaggtct tcaataatgc agaaacctaa 4380 atctatattt taagttaata tttttttcaa cgctatcatt tatttttctg aagcacaaaa 4440 gaaggggggg ggggaggaaa aggtacggga aaaaattatc catacatgcc cctccctgtt 4500 tgggtaaaga gacatcaata tgttttttat aatatgtttt attttgctag ccataaggag 4560 gttgtacaaa actctttaga tctatataac tgctgcaaca ccttactata aaatcttgac 4620 tgaatcttga gctgcggtat tatggcggag gttttattat tttttataaa aaaaaatttt 4680 agacgttttt atatttatac agtttcaact taaatttgaa aaaaaaaaaa aaaattgtca 4740 ccccctcccc gtccatactc tcacggtttc cgaaatcaag ttgtagcaag gtgatcccat 4800 aaagctaggg tgccaatttt tttcaagcat gctctagaca tagtaagcta acaaataagt 4860 cgtatacatg tttgtatgta tcttagatct cgagaaaatt tcaatcttgt aaaaagttac 4920 aaattttgaa aaactttggt tgggtgtagc aaggtgaaat tttttttttt tgaaaaatca 4980 aaatattatc ttaaaatcac tcacgattac acatcttttt agcttgcata catgttccaa 5040 aaaaaaattt gtaaaatttg aacacccc 5068 // ID Gypsy-24_DPu-I repbase; DNA; INV; 4839 BP. XX AC scaffold_38; XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from Daphnia: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-24_DP_; KW Gypsy-24_DPu-LTR; Gypsy-24_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-4839 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Direct Submission to RU (16-DEC-2010). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR Genome; scaffold_38; Positions 568901 564063. XX CC 'GAGTG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 571..3969 FT /product="Gypsy-24_DPu-I_1p" FT /translation="MAAFKPYGKPPPFDLDEYKDSFELWHKKWTIFLSLST FT IDSALEEDERDLYKAHTLLSCLSTDTLQAVLSMGLTDEQLDSHTDVINHLR FT ARCNAGRNRHVWRQQFAAKTQGVQQSADDWLCELRDLARKCEFATDCCANC FT EKTRILGQLIFGVESDEVRVKLLEQGATLSLDQALTIIRTAEASNKQSKNL FT KTGDAAAIQGATSTYKRFKEKSGKPPAGGSSSAGAKFTGCWNCGSKTRCKP FT LTACPAQGKECKKCGLLNHYFKVCRNKAAAKKQQGIYIDPSPPTVGALNAS FT DLVELAVNPEKWTSSVVIDFLPDTGADLDAIPESLYKRKFSKVALQKGVQP FT VTAVGSPIISIGVFSAVIVWTTRDKVSRSVNTAIHVLRELKQPVLSKRSQQ FT MMGMLPAAYPHTYVGMVASTPPTDSQRQADLKTLMNECPRIFDGTCHQMSG FT PPCHFELVENARPVAMRGSRPVSVPLLPKVKAELDNLESANIIRKVVKPTA FT WVHPFLVPLKKNGDIRLVIDFRELNKCIIRPNFETATPFQAVRTIPPGLKF FT FTIIDALKGYHQVPLDDESIDLTTFSTPFGRYQYLRLPFGVSHAGDDYCRR FT VSEVFDDLPNCRRIVEDVLVFSATYDEHVVAVRRLFHRAAAHNIAINTAKI FT VFAQPAVTFGGYVVDADGFRPDPELTRAIRDFPTPSSITDVRSFFGLCQQV FT GNFSDQLAAALDPLSPLLNTGYTWEWTTQHEEAFTAARTLLSTVRDLAFYD FT PKRKTSLHVDASRLNGLGFLLKQLDDSKVWRVVQAGSRFLSSAETRYAMIE FT LECLSAAWAMNKCRQFLEGLPSFELVTDHKPLVPILNSYSLDKLDNPRLLR FT LRLKMQRYAFVARWVPGKQNSDADALSRAPVDKATTSDELGEGLPSYQGKI FT ALMSLSGNQLSQADPNLDPVLEKIKCAAANDPVMVKLRNQITSGFPNDKCN FT LDSDLRPFWSVKERLAIDDSDDMIVMGPRIVIPHSIRADILRDLVQMHQGA FT TKTCQRARLSVYWPNIDNDIVNATKHCETCTKHLPSQQPEPFRVRPPATRP FT FEQIHADFCELNGRHFLVMVDKFSGWPHVVAFRDKKTTARNVIGHCRSFFS FT NVGAPVTFWSDNGPQFGAAEFQPFF" XX SQ Sequence 4839 BP; 1234 A; 1262 C; 1146 G; 1197 T; 0 other; tggcgcagtt ggtgatcgtt ttcagtgaag gtacatttat cgtgcaattt attgtgacaa 60 tttattggcg ttaatttccc gaccagagtg tgtgtgaact gcagtttttg tgttgtcgcg 120 acagtttttc gtctcgaccg catggctgcg cacggcacta cccagttgac tagtaactat 180 tgtgtggcct aacacttttg tccgccattg tttcgattgg atcgggtgaa gacatttttt 240 ttgcatccaa gtttatccaa ggcatccgac tttaactttt ggtttgaggt gaagttttcg 300 ctcatgttgg ctttattcat ggtagcgatc gtgatttatc agccattgag caaacgtttg 360 aacatttttt cgtcatgctt tagttggtca ttgtcatggt agctatcgtg atcgatgggc 420 cactaagcgg tgcattttga gttcttgtct ccacgttaca gtgttattcg tcagcaacaa 480 gcatcacttt tttgtgtttt ttctctctcc aagctggaga gcgaggaaag ggagcttaac 540 attggctggc tttggagcaa gcacatcagc atggcggcgt ttaaaccgta cggtaaacct 600 ccacctttcg acctggacga atacaaggat tcgttcgaac tatggcacaa gaaatggaca 660 atttttctta gcctgtcgac tattgattcg gcactggaag aagatgagcg agacttatac 720 aaggctcaca cgctgttatc gtgcttgtca accgacacac tccaagcggt tctttccatg 780 ggcctaaccg acgagcagct cgacagtcac acggacgtca tcaaccatct ccgtgctcgt 840 tgcaatgccg gccgcaaccg ccacgtgtgg cgccaacagt tcgcggcaaa gacacaaggc 900 gtgcaacaat cagcagacga ttggttgtgt gagttgcggg acttggctcg aaaatgtgaa 960 ttcgcgacgg attgttgtgc gaattgtgaa aaaactcgca ttctaggcca gcttatattt 1020 ggcgtcgaga gtgatgaagt gcgcgtaaaa ctcctcgaac aaggtgcgac tttatcgctg 1080 gatcaggcgc taacaatcat tcgaactgcc gaagcatcaa acaagcagtc gaagaactta 1140 aagacaggtg acgctgcggc aattcaaggc gcaacctcta cttacaagcg tttcaaagaa 1200 aagtccggca agccgccagc aggtggcagc agttcagcag gcgcgaaatt cacgggctgc 1260 tggaattgcg gatcgaagac acgctgtaag cctttgacgg cctgtccggc ccaagggaag 1320 gagtgtaaaa agtgcggcct actgaaccat tacttcaaag tttgccggaa caaagcagca 1380 gccaagaaac aacagggcat ttacattgac ccctcccctc cgacagtggg ggctttaaac 1440 gcgagtgatt tagtggaact ggcagtcaat ccggaaaaat ggacatcatc tgttgtgatc 1500 gattttctcc cagacacagg cgctgacctc gacgccatcc cagaatcact ctacaagcgg 1560 aagtttagca aggtggcgct gcagaaaggt gtacaaccag ttacagcagt ggggagcccg 1620 atcatcagca ttggtgtttt cagcgcggtt attgtgtgga cgacacgtga caaagtgtcc 1680 cggtcagtga acaccgccat tcacgtgtta cgtgaactaa agcagccagt gttgtccaaa 1740 aggagccaac agatgatggg aatgcttcca gccgcctatc cgcacacgta cgttggaatg 1800 gtggcgagca cgccacctac cgactctcaa cgacaggccg atctcaaaac tctcatgaac 1860 gaatgcccgc ggatttttga cggaacatgc caccagatgt cgggaccacc atgtcatttc 1920 gaattggtgg aaaacgcgcg gccagtggca atgcgaggct cacgcccagt gtccgtccct 1980 ttgttaccaa aagtgaaagc agagttggac aatcttgaat ccgctaacat tatccgaaaa 2040 gtggtaaagc cgacggcttg ggtgcatcct ttcttagttc cgctgaagaa aaatggcgac 2100 atccgtctgg tgatcgactt tcgtgagttg aacaagtgca taatcaggcc aaacttcgag 2160 acggccacac cgtttcaagc agtgcgcacc attccgcccg gattgaaatt cttcaccatc 2220 attgacgcgc taaaaggtta tcatcaggta ccactagatg acgagtcgat cgacctgaca 2280 accttttcta ctcctttcgg gcgataccag taccttcgtc ttcccttcgg cgtctcacat 2340 gcaggtgacg actactgccg tcgtgtatca gaagtgtttg atgatctccc gaattgcaga 2400 cgcatcgttg aagatgtact cgtcttctcc gcaacttacg atgagcacgt cgtagcggta 2460 cgaagattat tccatcgagc agcagctcac aacattgcca tcaacaccgc caaaatagtg 2520 ttcgctcagc ccgccgttac tttcggtggt tacgtcgtcg acgctgatgg atttcgaccg 2580 gacccagagc tcacccgagc aatccgcgat ttcccaacac ctagctcgat tacagacgtc 2640 cgttcatttt ttggactctg tcagcaagtc ggcaactttt ccgaccaact ggccgctgca 2700 ttagatccat tatccccgct gctcaatact ggctacacgt gggaatggac aactcaacat 2760 gaagaagctt tcacggcagc aagaacactt ttatctaccg tgcgtgatct agcgttttac 2820 gacccaaagc gcaaaacaag ccttcacgtt gatgcatcgc ggctcaacgg actcggtttt 2880 ctcttaaaac agctcgacga ttccaaagtg tggcgtgtgg tgcaagcagg atcccgtttt 2940 ctctctagcg ccgagacacg ttatgccatg atcgaactcg agtgtttaag tgccgcttgg 3000 gccatgaata agtgtcgtca gtttcttgaa ggacttcctt cgtttgaatt agtgacggac 3060 cacaaaccat tagtgcccat actgaacagt tactctctag acaaactaga caacccccgt 3120 ttgcttcgct tacgcctcaa gatgcagcgc tatgcatttg ttgctcgttg ggtcccagga 3180 aaacaaaatt ccgacgcaga cgccctctct cgagccccgg tcgacaaagc aaccactagt 3240 gatgaactgg gtgaaggact accatcatat cagggaaaaa tcgctctgat gagtctctcc 3300 ggaaatcagc tgtcacaagc tgatccaaac ctcgacccag tgttagaaaa aatcaagtgt 3360 gctgcagcca acgatccggt tatggtgaaa ctgagaaatc aaatcacctc gggttttcct 3420 aacgacaagt gtaacttaga tagtgatttg cgtccctttt ggagtgtaaa agagcgtttg 3480 gcaatcgatg actctgatga catgatcgta atgggacctc gcatcgttat tccgcattcc 3540 attcgtgctg acattctgcg tgatttagtg cagatgcacc aaggtgcgac aaaaacatgc 3600 caacgagccc gtttatcggt atactggccg aacattgaca acgatatagt gaacgcaacc 3660 aaacattgtg aaacgtgtac caaacatcta ccttcacaac aaccagagcc atttcgagtg 3720 cgcccaccag cgactcgtcc atttgaacaa atccatgcgg atttctgtga attaaatggc 3780 cgccattttt tagtgatggt tgacaaattc agtggttggc cacatgttgt tgcttttcgt 3840 gacaaaaaaa caacggcacg caatgttatc ggccattgcc gcagtttctt ttccaacgta 3900 ggagccccag tcacattctg gtccgacaac gggccgcagt ttggtgcagc ggagttccaa 3960 cctttttttt aacggattgg gggattactt ctctcacgtc ttcaccgtat tacgctcagt 4020 ccaatggccg ggctgaagca gaaatcgact cgatgaaggc gctcatccgt ggatcatgga 4080 cagctggcgc attcaacgaa gaaaaattcg ccaaaagtat cctgcttttt cgaaacgcgc 4140 cgcgatctgg tggtgcaagc ccagcacagt tgattttcaa ccgcccaatg cgtgactgtc 4200 tgccagcaca tcgccgctca ttcgcgccag aatggcaaaa agcagccgac gtactcgaga 4260 aacgtgcgag acggtccaag gagctacaaa tagctcacta caacaaacac acgcgaccac 4320 ttgctccgtt tttcgtcggc aaccacgtct tgatccaaca tcccaccagc aagctgtggt 4380 gtacccccgg agtaatcgta gaagtcggcc atcatcgaga ctacctcatc aagactccag 4440 ccggacggat ttttcggcgc aaccggcgtt ttcttcgccc cctcgtaccc gtatttccga 4500 ctactacacc agctactaca ccagctacta caccaatcca gccggaagtc gttcagccgc 4560 ctcaaccgga acaaatcgag ccaccggtac aagagaatcc tgctccaccg gtgcaagaaa 4620 accctgttcc acccgtttta cgccgttcaa caagaacaaa acagccgcgg cgtcacttct 4680 tcccagccga gtggacgcaa taacacccat cccattatca tgctcaccca ttcatggtta 4740 tgtgtgtgtt gtgtattgtg tgttaccagt tgcgtttctt taaaatctta aagagcaaag 4800 ttaactatgt tgtgttagct ttgctcgttg aaaggggcg 4839 // ID Gypsy-1_PPP-I repbase; DNA; INV; 5137 BP. XX AC ADBJ01000004; XX DT 13-DEC-2010 (Rel. 15.12, Created) DT 13-DEC-2010 (Rel. 15.12, Last updated, Version -1) XX DE LTR retrotransposon from the Polysphondylium pallidum (slime DE mold) genome: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-1_PPP_; KW Gypsy-1_PPP-LTR; Gypsy-1_PPP-I. XX OS Polysphondylium pallidum OC Eukaryota; Amoebozoa; Mycetozoa; Dictyosteliida; Polysphondylium. XX RN [1] RP 1-5137 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Polysphondylium pallidum (slime RT mold) genome."; RL Repbase Reports 10(12), 2158-2158 (2010). XX DR GenBank; ADBJ01000004; Positions 1175738 1180874. XX CC Positions [4040-4516] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 146..5074 FT /product="Gypsy-1_PPP-I_1p" FT /translation="MSDKTTLERILDGDTSIDYTKLKNVSGMTLKALADLA FT RKRIYEQVPDMEKLLNEYERLKSEYQLINMRYESAESIIRVLNTTVDTQAG FT FINSLKDQVKQQSTFLQHSTPRSNSLTSDSSQNHDDDENNNNRQSSSNHHD FT ENNNNRQSPPTESSTMRLSQLHSTGTSTSSQLKYNDAELRLLRLSISDINN FT RSIKTTRDNLTFRAKLHHDCSPERVEIAISTISQIEAVHSRLSFKSVEELR FT DITDTWEHFLNWLESALHVRSTMLGNLKGVVTNYVLDKNLSNYSRFMELVT FT EYHKTGVADDTLIVAFAKALNRYAPYLERNFASLAHLGDAAIHTFNNTGKK FT ITRRHLETWDLYLKDLSVSSRVLSNDDTETQRIRMGRSSFTKRRNHRMTRK FT SRATLQKKLPTSQPTTPKTKDQEAVTTKVLEVVTTTKRKPTSQIQINGLVW FT HSSGYYIKVVIGDQTFDAMVDLGSQASLIDPDLLPTLGLESSPCNEYLKPV FT AQVKGFYAKALVDTKVTIPGSIDDKDYNISVRSKLFVVKNQHKLIIGNNLF FT GLISPIAHLFADTPHLIIPVKLGKQIVSAKVLVTSLNKRAPKSMMLTNLNK FT LDLTKVSRTAVKAVASVPLPNNSSDATATTSVPTTPDVKDSLPSEPVATVA FT TEEPPITEQQSECATSDAVDNIDGVDLPVQEIDDDTDDDDEEEEDEDEPVL FT SSSDAALDAMAATILSAYKRIIVDALPENHSPDRGKFNMNILLKGDASAFN FT IKHGRRSIELESKISAEVAKLLEIGVIEEAPPNTEFCSPAFFVNKGTSKER FT MVVDFKHLNSMTVDDVFPMERLDEIIESIGGAKIFSVIDAKSGFYQMLLNP FT GSRRFTTFAANKRLYMFTRPCFGLKNSPAYFNRWLQHVLDPLVKKGFVRVY FT VDDILIFSKSVAEHEQHLKQVFELLDKNDVYVAKSKCHLFKYSVSYAGHML FT SDKGIKPLYNKVNAILNRSVPTTVKEMRSFIGAINHYRRYLNHMGPQLARL FT TSTISTKYRKINLTDQEIADFNDIKTELCSSRCLMSPRYDRTFHVYTDASD FT VGSGLMIAQYDDNNNLRPVLFDARKFDSAQRNYSARDRELLAFIHAVTRYG FT YLLSRPFVFHTDHKNLIYNSQNDMDNPRLVRWSEILSRFSFQTSYIPGKEN FT CMADYLSRAPDFYTPWDNDLLNDILASYSSELPKATRDWFNSFKRRQDITV FT INDLYYWIDGDNMRLVVLDPASITSIINEAHSSPYSGHVAYGRMLTKLNKS FT YVWPNMCDTISKFVKKCVQCQRSSIKKIKEGFLASLPLPDRPWCDISMDLL FT ALPAADTGEGNLFVVVDRFTKMTRLFPCHKDVTAIQLGNWFAREIIAVFGA FT PSSIVSDRDPKFTSELWTSSMKAIGTELKMTQPGRAQADGQTERTNRSILS FT HIRKWTDKKHSWATDIRFIEACINNNISYTTQFTPNQLLYGYEPKLPWNAN FT YFGIREYKDCQNQYRLEAKNNALDAQLQQAKQHDKVISNFNTYNINDLVLV FT RRSRLNTHKSSSSDDRKLTQSWGGPFIIVKQVSDINYRIRLQNKRATERTI FT HVDDIKPFIGDPTGTDLLDINAIIDKRQSKRGTGTAIEYLVKFDTDSDDHN FT KWINENTLIQYIPTMIESFNKQQKK" XX SQ Sequence 5137 BP; 1691 A; 1204 C; 958 G; 1284 T; 0 other; tctatcctat ttattaaact tttttaaatc ttttttttat ctttatatat aatatctatt 60 taggatatac ctcaatctcc ccggggagga tatcaattaa ttatatttat atattatctt 120 tatatacttt aaaataacat ttacaatgtc agataaaact acattagaaa gaattctcga 180 cggtgatacg tcaatagatt ataccaagct caagaacgtc agcggtatga cactcaaggc 240 tctcgctgac ttggcaagaa agagaatata cgagcaagtg cctgacatgg agaaactact 300 taatgaatac gaacgtctta agagcgagta tcaattgatc aatatgagat acgagtcagc 360 agaaagcatt atcagagttc tgaacacgac cgttgatact caagcaggtt tcatcaacag 420 cttaaaagac caagtaaaac aacaatcaac attccttcaa cactcaacac ctcgttccaa 480 ctctcttacg agcgacagtt cacaaaacca cgacgacgat gaaaacaaca acaatcgtca 540 gtcttcatca aaccaccatg atgaaaataa caacaatcga caatctccac ccactgaatc 600 atctacaatg agattatcac aattacattc aacaggtacc tcaactagct cccagttaaa 660 atacaacgat gccgaactta ggttactcag actatctatc agcgatatca ataacaggtc 720 aatcaagaca acaagagata acttgacctt tagagctaag ctccaccacg actgttcacc 780 agaacgagtc gaaatagcga tttccaccat ctcccagatc gaagcagttc attccagact 840 aagtttcaaa tcagttgaag aattgcgtga cattactgat acatgggaac acttcttaaa 900 ttggctagaa tccgctctgc atgtcagaag tactatgttg ggtaacctca agggtgtcgt 960 tacaaactac gtattggaca agaatctatc caattactca aggttcatgg agttggttac 1020 cgaataccac aagacaggag tagcggatga caccctaatc gttgcattcg ctaaagcgct 1080 caacaggtat gcaccctacc tcgagcgtaa cttcgcatca ctcgcccatc taggtgatgc 1140 tgctatccat acattcaaca ataccggaaa gaagatcacc agacgtcacc tggagacctg 1200 ggatctatac cttaaagatc tgtcagtgtc ctcaagagta ctctctaatg acgacacaga 1260 gacccaaaga ataagaatgg gaagaagcag tttcaccaag agaagaaatc acaggatgac 1320 aagaaagtcg cgagcgacgc tccaaaagaa gctgccgaca agccaaccta ctacaccaaa 1380 aacaaaggat caggaggcgg tcaccacaaa ggtactggag gtggtcacta ccaccaaaag 1440 aaagccgaca agccagattc aaattaacgg cctggtatgg cactcatctg gttactacat 1500 caaggtagtg attggtgacc agacattcga tgctatggtg gatctaggat cacaagcatc 1560 attaatcgat ccggacctac tacctacatt aggactggag tcatcaccgt gcaacgagta 1620 ccttaagccg gttgcacaag ttaagggttt ctacgccaag gcactcgtag acaccaaagt 1680 caccattcca ggatcaatcg acgacaaaga ctacaatata tccgtgagat ctaagttatt 1740 tgtcgtgaaa aatcaacaca agttgatcat tggtaacaat ctatttggat tgatttcacc 1800 aattgcacat ctttttgcag acactcctca cctcatcatt cccgtgaagt taggaaaaca 1860 aatcgtttct gccaaagtac tggttacatc tttgaacaag cgcgcaccaa agtcaatgat 1920 gcttaccaac ttgaataaac ttgacttgac caaggtcagt aggactgctg tgaaggcagt 1980 ggcatccgtg ccattgccta ataacagctc cgatgccact gccacaacat cggttcctac 2040 tacgcctgac gtcaaggaca gtttaccaag tgaacctgta gctacagttg ctacagaaga 2100 gccaccgatt accgaacaac aatccgagtg tgctacatct gatgcagtcg acaatatcga 2160 cggtgtagat ctccctgtac aagagatcga tgacgacaca gacgatgacg acgaagaaga 2220 agaagatgaa gacgaacctg tactctcgtc atccgacgcc gcattagacg ccatggccgc 2280 cactattctc tcagcttaca agcgtataat agtagacgcg ttgcctgaaa atcactcacc 2340 cgacaggggc aagtttaata tgaatattct actcaagggt gatgccagtg ctttcaacat 2400 taagcatgga cgacggtcca ttgaactgga gagtaagatc tccgctgaag ttgctaaact 2460 actggaaatt ggagttatag aggaagctcc acccaataca gaattctgct cacctgcttt 2520 ctttgtcaac aaaggtacaa gcaaagaaag aatggttgta gattttaaac atctgaattc 2580 catgaccgtg gacgacgtat tcccaatgga aagactagac gaaatcattg aatcgattgg 2640 cggtgcaaag atcttctccg tcatcgatgc caaatctggt ttctatcaga tgttactcaa 2700 tcctggttct aggcgattca ccacattcgc tgcgaacaaa agattgtaca tgttcacacg 2760 accttgcttc ggtttgaaaa acagtcctgc atatttcaac cgctggctac aacacgtcct 2820 tgatcctcta gtcaagaaag gtttcgtaag agtatacgtt gatgacattc tcatcttctc 2880 gaaatccgtg gctgaacatg aacaacatct caaacaagta tttgagctgc ttgacaaaaa 2940 cgatgtatac gtcgccaagt caaaatgcca cttgttcaaa tactctgtat cctacgccgg 3000 tcacatgttg tccgacaaag gaatcaaacc gttgtacaac aaggtaaacg cgattctcaa 3060 cagatcggta ccaactactg tgaaagaaat gagatctttc attggtgcaa ttaaccatta 3120 cagaaggtat ctcaatcaca tgggtccaca actggctaga ttgacaagta caatcagtac 3180 caagtacaga aaaatcaacc tcaccgatca ggagattgca gacttcaatg atattaaaac 3240 ggagttatgc tcatccagat gtcttatgtc gccacgctat gaccgaacat tccacgtcta 3300 cactgatgcg tcggatgtcg gatctggtct catgattgcc cagtatgatg acaacaacaa 3360 tctccgtcct gttctattcg acgcaaggaa atttgattct gcccaacgga actacagtgc 3420 acgtgatcgc gagttgctag ctttcatcca tgctgtcacc agatatggct acctactaag 3480 tagaccattt gtgtttcaca ccgatcacaa gaacctaatc tataattctc agaatgacat 3540 ggacaaccca cgactcgtca gatggagtga aatcctttct agattctctt tccagacatc 3600 gtacatccct ggaaaggaga actgcatggc cgactacttg tcccgtgctc ctgatttcta 3660 cactccttgg gacaacgatc tcctgaatga catccttgct tcttattctt cagaactccc 3720 caaagcaact agagattggt tcaattcatt caaacgtaga caggacatca ccgttatcaa 3780 tgatctttac tactggatcg acggtgataa tatgcgattg gtagtgctcg atccagcgtc 3840 gatcacatca atcataaacg aagcgcactc ttcaccgtat agtggtcacg tcgcttatgg 3900 aagaatgctc accaaactaa acaagtcata tgtgtggcca aacatgtgtg atactatctc 3960 taaatttgta aagaagtgtg ttcaatgcca acgttcatcc atcaagaaaa tcaaagaagg 4020 attcttggca tcactacctc ttccagacag accttggtgc gatatatcaa tggatctgtt 4080 agctttgcct gctgctgata caggcgaagg caatctgttc gtagtggtcg acagattcac 4140 caaaatgact cgattgttcc cttgtcacaa agatgtcact gctatccagc ttggtaattg 4200 gtttgcacgt gagatcattg cagtattcgg tgctccttca tctatagtat ctgacagaga 4260 tccaaagttc acatctgaat tatggacatc ttctatgaaa gctattggaa cagaacttaa 4320 gatgacacaa ccaggtcgtg cacaagcaga tggtcaaaca gaacgcacca acagatccat 4380 acttagtcat attagaaagt ggactgataa gaagcactca tgggcgaccg acattcgttt 4440 catcgaagca tgtatcaaca acaacatcag ctacacaaca caattcactc ccaatcaact 4500 actgtatggt tatgaaccaa agttaccatg gaacgccaac tactttggaa tcagagaata 4560 caaagactgt caaaatcagt atcgattgga agcaaagaac aacgcgctgg atgctcaact 4620 acaacaagca aaacagcatg acaaggtcat ctctaacttc aacacataca acatcaatga 4680 tttagtttta gttagaagat caagactcaa cacacacaaa tcaagctcat ctgacgatcg 4740 caagcttaca caatcatggg gtggaccatt cattattgtt aaacaagtat ctgacatcaa 4800 ctatagaatc agattacaga acaagcgtgc tacagaaaga acaattcatg tggacgatat 4860 caaaccattc attggtgacc caactggtac agatctactc gatatcaacg caatcatcga 4920 caaacgtcaa tccaaacgtg gaactggtac agctattgaa tacttagtta aattcgatac 4980 tgacagcgat gatcacaaca aatggataaa tgaaaacact ctcattcaat acattccaac 5040 catgatagaa tcattcaaca aacaacaaaa aaaataagta atctcaactc tgcttattct 5100 atttccaatt caatatttga tgtcatggag ggagata 5137 // ID NHAT-1_SM repbase; DNA; INV; 373 BP. XX AC . XX DT 15-JAN-2008 (Rel. 13.01, Created) DT 16-JAN-2008 (Rel. 13.01, Last updated, Version 1) XX DE Non-autonomous hAT-type DNA transposon - consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW NHAT-1_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-373 RA Jurka J.; RT "NhAT1_SM: Non-autonomous hAT-type DNA transposon."; RL Repbase Reports 8(1), 22-22 (2008). XX DR [1] (Consensus) XX CC 8 bp TSD. Present in ~2000 copies in the genome. The youngest CC copies are ~99% identical to consensus sequence. It has been CC derived from hAT-11_SM like autonomous DNA transposon. CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX SQ Sequence 373 BP; 136 A; 44 C; 66 G; 127 T; 0 other; cagggcttct taaaccatgg gtcgcgaccc catttggggt cgcgtagcaa aattctgggg 60 tcgcgagaga taaaaataca atttaacaga aaatgttttt taacttgtaa aattattgtt 120 ataaagataa aataacaatt atcgtagata aatatatcat aaaatcttct agttcggaaa 180 gattgtttgt acctgcattt ctattaagag tattgtaata acttgaataa catttactat 240 gaattagaat tagaagatgt tataggaatt ttaaaaaaat ataaatttat ttttctttta 300 tatttgtatg gggtcgtcaa gaaactcgca atcataaatg gggtcgtgaa ttacaaaagt 360 ttaagaagct ctg 373 // ID CR1-20_BF repbase; DNA; INV; 4768 BP. XX AC . XX DT 30-JUL-2009 (Rel. 14.07, Created) DT 30-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE Amphioxus CR1-20_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-20_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-4768 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-4768 RA Kapitonov V. and Jurka J.; RT "Young families of CR1 non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(7), 1591-1591 (2009). XX DR [2] (Consensus) XX SQ Sequence 4768 BP; 1443 A; 1083 C; 893 G; 1349 T; 0 other; aacaggtcac caatgagtct gttagtgctg agacattacc aaagtctaaa actttggcag 60 ctgaaacgtc acctaaacgc gtggctggtg agccagtgaa gaacagccta cctccttccc 120 catcagacat tgaaacgttg gaggctgaaa cgtcaccaaa gcgcgtggct ggtgagccag 180 tgaagatcag cctacctcct tccccttcag acattgatac gttggaggct gaaacgtcac 240 caaagcgcgt ggctggtgag ccagtgaaga tcagcctacc tccctcctca ccagtcccag 300 aagtcccgaa acctagtctt gaatccatca cactggaact tcggaatgta gcaaacacat 360 tggcctcagt tcaagaaaat gtgtacatgt tatcacaagg actacgggat gaaaatgcgt 420 ccctaaaaga agtgattcgt attcttatga ccgatggaaa acactcgtcc agtacagtca 480 gggaggacac atggaatcac gtgtctaaag gcccccctgc aactagtaat cttttccctt 540 cgagtagtac tagtgttccc ctgtcaaaca gatatcagtc acttctggat atagaggagg 600 agcgcagtga tgatgagtac caacgtagtc ccacgccaca agtaaagaac tctaggctgt 660 ttcccagaag atcaggtttc agtcaaggta cccgtggtcc acagtctccc agaagatcag 720 gtttcagtca aggcacccat ggtccacagt ctcccagaag atcaggtttc agtcaagcta 780 cccgtggccc acagtctccc agaagatcag gtttcagtca agctacccgt ccacagatgc 840 cccaacacaa aagtcagccc tggaaaagac cggatgttgt gattttggca gactccatga 900 cgaaagactt gaatccaagg cgtatgtccc gaagcactgg gaaagtagtc atatgtagga 960 cacacagagg agctcgttta gagcaggttg aaagattggc agagaatgat ctatctgaca 1020 tccaacctca gctgctcatt cttcacgttg ggacgaacaa ccttcatgaa gacatttctt 1080 ccataacgag aaaattcgag accctgggca acagtatgtc cggaaactcg actcaactct 1140 gcttctccag tttggcagtc agagacgggt ctactgtgaa agtgagcaaa gttaaccaat 1200 atctagcttc cttgtgtgct aggaaagggt gggggttttt ggacaacaaa gacattgagc 1260 tgcaccactt gtacgatgga gtccatctca ccgcgaaagg aacagctctt ctagccaccc 1320 atcttaacga ctacatccgg cagggttttc aggaggatgt attcaacagg agaggccgat 1380 ggagcgaagt ccaaaggaga ctagtgcaac tcagtcggaa gagttactga ctcagcagac 1440 agacacagac tcctgcctaa acgataataa taacgatgaa agattgtcaa ttgggaacat 1500 ttctcttagt agtagaactc tgagatcggt tagctcttct gcagcgcccc cgctacatag 1560 ggatgacaat agcaaagatc gtaaggaatt gcataaagta gatagcgcaa gccaaatcaa 1620 agttttatac atgaatggtc gtagtatcaa atctgttaac agtcggcgaa ataaattagt 1680 gctgttccaa aaccttgttg aatcaaatca gccttcttta gtggcagtaa ctgaaacttg 1740 gctaacccca agtatcaaag acgacgaatt gcttcccaag cgctacttgc tttatcgcaa 1800 agatcgttgt actgtcattc ccggcaaaat cggcgggggc attctcttag ctgtcaatgg 1860 caatattctg tcaagaagga ggtctgatat tgaacctgtt gacgaaatct tggtctgcga 1920 aatgctgaca gaaaaacagg ggaaaatcgg tatcatcttg ttttatagac cccctaatgg 1980 ggacttactg gcatttacta ataacctcca atatacattg cgtttggccg accaagaata 2040 caacaaggtg gtcttattag gtgatttgaa tctccctaat atcgattggt gtagtttgat 2100 aggcaattct actacagaaa accagttttg tgatgttttg aatgatttct cactaacaca 2160 ggttaataat gttacttcta acagtcataa ccatcttttg gatgtgattc tgactaacgt 2220 tcctgaaaaa tgttccaaag ttgaaaaagt atcatctgat tttcctactg atcacactat 2280 tctacagttt gacctgaatt ttcgtacggt attccgtcaa tcaaacatga aaagacatgt 2340 ctaccagtat aaaaaatgca actttgataa cctacgacaa agtctcagag acactctcgg 2400 ttctcaaggc tttgtagaat ctgatagcat tgatgactgt tgggatacgt ggttacaatg 2460 tttgtgtggc ttggttgaaa accatgtacc aaaagctaaa ataaaaaatt cttattccat 2520 gccatggtgt gatggtgatg tctaccatct tagaaataag aagttgactg catggcgaag 2580 ggcaaagaaa actaatcgcc cagcccactg gcataagttc cggaaactta gaaatggtct 2640 acaaaaactt attcattaca agtacaaact gtttcttgag aacttgggat ccttggtcaa 2700 taataatcct aagaaatttt ggactttctt taaaaataaa acaaaatcac attgcttgcc 2760 acaaattctg aaatacaaac attatcaagg aacgacacct actgataaag caaatctgtt 2820 taacaagttc ttcttctctg tttttactac ccctgtagta aatcagcctc tccctgagat 2880 cagtgcttgg agacatccct tcctcggcaa cattgacttt acaatcaagg aagttctaga 2940 tgtcttgctt accttagatt gtacaaaagc tattggccct gactctatat ccccagtgat 3000 cctacgcgag ggtgcacacg tattagctcc tcaacttact gccatcttca acaaaagtat 3060 tcaatcgggc atagttccag ctaaatggaa agaagccaac gtctgtcccg tctttaagaa 3120 aggcagtaaa gactctgttg aaaactatcg acccatttcc ttactcagtg tagtaagtaa 3180 ggttatggaa cgttgcattt ttaaccgtat tttccctcac cttcaagacc aaattcaccc 3240 actacaacat ggctttatta aaggaaaatc tactactacc caactattgg aaacctttca 3300 aaccgtcggg acaactttgg atcacagtgg ccagatagat ataatttacc tcgacttttc 3360 caaagctttt gacagcattc cacatcacct tcttgtacac aaacttaaga catttggatt 3420 ccactcgaat ctcctaaatt ggttctctga ttatctgtct aatagaagac aacgagtagt 3480 cattgaagga gtttcttcag actggcttcc tgtaatatcc ggagtcccac aaggttccat 3540 tttgggacct ctactttttt tactttatat taatgacctg ccctctacta tagaatgtcc 3600 catggccctt tatgctgatg actcaaaatg ttacaagcaa atttcatcac cagctgattg 3660 tatttccctt cagcatgata ttgacaacat ggtcgaatgg agtgtcacat ggatgatgaa 3720 atttaatacc tctaaatgta acgtgctcac tatctgtagg tctaaaacac cagtgctcta 3780 tgactacgaa attgagggac aagtgctctc atcagttacg gaatttaatg acctaggtgt 3840 aactatgtcc aactgcctat cttttaagag tcatataaaa aacatcacag ctagtgccaa 3900 ctcgaccctc ggctttatta agagatctgt tggttaccat gcccctgcaa atgttaagaa 3960 aatcctatat attacactag ttaggcccaa actcgaatat tgctcgttga tctggtcccc 4020 ccacactcat aacttaatta cttcccttga aaaagtccag agacgtgcta caaaatacat 4080 tctcaacaac catacaactg attacaggga tagattgatc tactgccaac tcttacctct 4140 atcttaccgc agagaattat tagacttatc cttcctattt aaatgtttct tgggacaata 4200 cggtattgac attcatcaat atataagttt tccactcaca tccctaagat ccaactcaca 4260 agccaagctc cagccccaaa aatgtctcac cactacgttt aaaaactatt tcttccaaag 4320 aatcgttcac atctggaacg ctttacccct caacattaga actctaaaac tttccccagt 4380 tgttacaatt aataccttta agaaagcaat cctagctcac tatctctctc ttaccaacga 4440 cagatttaac tccaacgacg tatgtacttg ggtcacccac tgtcagtgtc ccagttgtac 4500 acctttctag atatttcata tttcatctta ccattatttc tatgatttat ttctttatat 4560 ttgttattat tgtatatttg cttttattgt cgttgacaca atcattatta tttgattgta 4620 caattgttat tatttgttat cattatgatt atttttctat tattgtataa tttagttgca 4680 cgggaggcaa ctagtaaagg ttacaaagta cctgtttttg cctccctacc attttgtaaa 4740 cacatatggt gaagttcaaa taaataaa 4768 // ID Gypsy-219_AA-I repbase; DNA; INV; 5401 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-219_AA_; KW Gypsy-219_AA-LTR; Gypsy-219_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5401 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1041-1041 (2011). XX DR [2] (Consensus) XX CC Positions [4267-4737] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 1644..3656 FT /product="Gypsy-219_AA-I_2p" FT /translation="MKQTFELAKRYQNDDFHGEAGTSRGPVKSRLGIRPYD FT YGQQARGYRPTGSKRVSFKPQGGAVPRARPDYSQMVCNFCGMKGHIKKKCF FT KLKNLHRDAVNLVDSYRPGPSADRHIAELMERMQTNESEDEQSDSGDCLEC FT MLVSSINKISDPCLVFVEIDGKTVEMEVDCGSSVSVISKSRYFSKFSNPLR FT KYSKKLIVVNGSKLKIEGEAKVMVKFNNKQEMMQLLVLDCDNEFYPLLGRT FT WLDVFYPNWRHYFTNSLVINSLETESSKIAVDEIKKKYSQVFEKNFSKPIK FT GFKAQLVMRDESPIFKKAYDVPYRLREKVADYLDKLEKEKVITSIKTSEWA FT SPIVIVMKKNNDIRLVIDCKVSINKSIIPNTYPLPTAQDIFANLAGCKIFC FT ALDLEGAYTQLELMERSKQFVVINTMKGLYTYNRLPQGASSSASIFQQVMD FT KVLEGIAYVSCYLDDVLIAGRNLEECRKNLFLVLERLAKANIKVNFEKCKF FT FVTELTHLGHIISEKGLMPCSDKIATIEKAKAPRNETELKSFLGLLNYYHK FT FIPHLSSKLYHLYNLLKNNVKFVWDDNCQKAFEESKSLLLKTNFLEFYDPK FT KPIVVISDASSYGLGGLIAHVIEEEEKPISFTSFSLNEAQKKIPYFTFRSV FT SIGLHDKKISQISVWSTLYSLH" FT CDS 3547..5283 FT /product="Gypsy-219_AA-I_1p" FT /translation="MKHKKKYPILHLEALALVCTIKKFHKYLYGQHFTVYT FT DHKPLVGIFGKTGKNSIFVTRLQRFILDLSIYDFDIIYRPSHKLGNADFCS FT RFPLRQDVPENLDPEIIKSINFSKHIPLDYQMIAEGTRNDEFLQEIKGFLH FT KGWPERVSKRFSQVFANQQELELVDDCLLYKDRVIIPKIYQYQVLKLLHNN FT HAGVVKMKHLARTMVYWYGINADIDNFVTQCDACNSMMIPHKSENSSKWMP FT TAKPFSRIHIDFFYFEHRTYLLIVDSFSKWIEIELMKKGTDCNKVLKKLIA FT FFARFGLPDVLVSDNGPPFNSHDFKFFLERQGIKVMKSPPYNPSSNGQAER FT LVRTVKEVLKRFLLEPEIMEIDLEDQISLFLFNYRNNNMTSDGHFPAERIF FT SYTPKTILDLINPKRQYKHHLEVPQPVDEIIKKSSAGKIPMYPKDTIDDLM FT VGDALWYKNHNPNHHSRWIKASFIKQYSPNTFQIRIGSAETMAHRDQIRIR FT KDDSTWRRPNVFITRCDPNKTLNDAVVNPSEGQEVRKGVDDGLLVESTRSR FT KRKRVVPEADLQEPRRSKRSKKVNRSDDYHYD" XX SQ Sequence 5401 BP; 1802 A; 872 C; 1130 G; 1596 T; 1 other; cccttcggtt taaatattta aaagtggcga acgagaaaaa aaaattgtgc gtgaaaatat 60 ataccccgga aaaagtggac agcttgtaaa gagccaactt tccagtttct gcacggcaca 120 tcggcggacg ccattgtatc tggggcattg aaggacggct gatcatttgt ggcattactg 180 cggcggtaga aaatcagcgg cttcatccga caggtgcttg aaatccagct gtgagtcgaa 240 atcaccgaat tgattggtaa ctcatcctcc gtcgtaagta gaaactctcc gtgggtaagc 300 ggagtgattt ctggttattt ttttatttct aacgttgcgt aaccattttg tgcaaagcgt 360 ctgaagagac aaagaaaagg ccttttcgtc tattgaaaca cacaacataa tagggtgacc 420 caatctttta tatgtttaac aattaatgat aatagacttg caaaaagatt tcttttgtgg 480 ttctatttaa taagagattt tattcttttt tttaacagat aacagctagc gtatgaagcc 540 attgtgtagt gatttttatc acatgattca ttagctgggg attgtttcta agcaaaaaga 600 aaagtttcaa cggaacggtt ttggataagt cactagcgcc tttggacatt gctggaaggt 660 tgagacacgg tccggaggtg agcctaatcc aaattcttca catcagcgac agttattaag 720 gatcgaccgt ttggaaaccc accaactgcg tccggcagga ttaacgacaa cggtcgtctc 780 gttggcaaag gtcaaaagga cattttcggt aagtcagttt tttatttttc tcttaattga 840 atcacaaagc cagtgatatt tttctaattg ctaattgagt ttgtacattt gttttgtttt 900 actattgtta ttttttgtgg atttgtgcaa ttatttctgg ccgttactcc tatattcttt 960 atcattgacg aactgcgtac atattttcgc atatattttg cataatcata taattgtata 1020 gaaaacatag aaagtttaac gatgagttcg aacattgcct gtaccattga gccataccgc 1080 agaggtgcta gttttaacga ttggtactcc aggctgaaat tcttttttaa agttaacaaa 1140 atcgtcgacg aagacaaaat ggcgtatttc gtcacactca gcgggccagt gatttttgaa 1200 gaaattaaac tactgtaccc tgctggcaat tttgaagagg cggctttcga cgatttaatc 1260 gcgaaattaa aaaaccgttt agacaaaata gaacccgatt tagtccagcg atataaattt 1320 agtaccagaa tccaaaatca cgacgaaact actgaagatt tcattttgtc gttaaaatta 1380 caagccgaat tttgtggttt tgggaatttt aaagatatag ctatcttaga tcgcattatt 1440 gcgggaatta aggacaaaaa cttgagacat cgattactag gagaagaaaa gctgactttg 1500 gtaaatgcag aaaaaatcat agcgacttgg gaggtggcaa aggcgaatgc tgatacagca 1560 gaggagagac aaacgtcgaa cccagtacag attgcagcaa tccagagcgg atcaaaggaa 1620 attatggaaa agcttatggg aaaatgaagc agacgtttga actagcgaaa agataccaaa 1680 atgatgactt ccatggagag gcaggaacca gcaggggtcc agttaaaagt cgtcttggga 1740 tcagaccata cgattatgga caacaagcga gaggatatcg accaaccgga tccaaaaggg 1800 ttagttttaa accacaagga ggagctgtac ctcgggctag accggattat tcgcaaatgg 1860 tttgcaattt ttgcggtatg aaaggccaca taaagaaaaa gtgttttaag ttgaagaatc 1920 tccacaggga tgcggtcaat ctagttgaca gttacagacc aggacctagt gcagatcgac 1980 acatagcaga actgatggag aggatgcaaa ctaacgagtc ggaagacgag cagagtgatt 2040 caggtgattg tttggaatgt atgttggttt cgtccattaa caaaataagt gatccgtgtt 2100 tggtttttgt tgaaattgat ggtaaaactg ttgagatgga agttgattgt ggttcgtcag 2160 tttctgttat tagtaaaagc agatattttt cgaaattttc aaaccctttg cgaaaatata 2220 gcaagaaact aatcgtggtc aatggatcaa aacttaaaat tgaaggtgag gcaaaggtca 2280 tggtcaaatt taacaataaa caagagatga tgcaattgct tgtgcttgat tgcgataacg 2340 aattttatcc tttattgggg cgtacttggc tagatgtgtt ttatccaaat tggagacatt 2400 attttactaa ttcattagtt attaatagtc ttgaaacaga gtccagcaaa attgctgttg 2460 acgaaataaa aaagaagtac agtcaggtgt ttgagaagaa tttttcaaaa cccattaaag 2520 gattcaaagc tcaacttgta atgagagatg agtcacctat ttttaaaaaa gcctacgacg 2580 ttccttatag acttagggaa aaagttgctg attatttgga caaattagag aaagaaaaag 2640 tcattacttc tatcaagaca agcgaatggg catcacctat tgttattgtt atgaagaaaa 2700 acaacgacat acgtttagtt attgattgca aagtatctat caacaaatca attataccaa 2760 atacttatcc attacctaca gcacaagata tttttgcaaa tttggcaggt tgtaagattt 2820 tttgtgccct tgatttagaa ggagcataca ctcaacttga gttgatggag aggtccaaac 2880 aatttgtggt tatcaatact atgaaaggat tatatacata taatagactt ccacaaggag 2940 cttcttcaag tgcttctatt tttcagcagg taatggataa ggttttagag ggtatagcgt 3000 atgtgtcatg ttacttagat gatgtgttaa ttgctgggag aaatctggaa gagtgtagga 3060 aaaatctttt tttggtctta gagagactag caaaagctaa tataaaggta aattttgaga 3120 aatgtaaatt tttcgttact gagctcactc atctagggca tataattagt gagaaagggc 3180 taatgccatg ctctgacaaa attgctacaa ttgagaaagc caaggctccc agaaacgaga 3240 ccgaattgaa atcatttctg ggtttgttga attactacca taaatttatt ccacatttgt 3300 cttcgaaatt atatcatttg tataatctat tgaaaaataa tgtaaaattc gtttgggatg 3360 acaactgtca aaaagctttt gaagaaagca aaagtctgtt attgaaaacc aattttttgg 3420 aattttatga ccctaaaaaa ccgatagtag ttatttctga tgcatcaagt tatggtctcg 3480 gcggtttgat agcacatgta atagaggaag aagagaagcc aattagtttc acatcatttt 3540 cattaaatga agcacaaaaa aaaataccct attttacatt tagaagcgtt agcattggtc 3600 tgcacgataa aaaaatttca caaatatctg tatggtcaac actttacagt ttacactgat 3660 cataagccgc tagtaggaat ttttggaaag acgggtaaaa actcgatatt tgtgaccaga 3720 cttcagagat ttatcttgga cttgtccata tatgattttg atataatcta tagaccatca 3780 cataaacttg ggaatgcgga tttttgttcg agattccctt tgaggcagga tgttcctgaa 3840 aaccttgatc cagaaattat taaaagtatt aatttcagca aacatattcc gttagattat 3900 caaatgatag cggagggtac gaggaatgat gaatttttac aagagatcaa aggattttta 3960 cacaaaggtt ggcctgaaag agtgagcaaa cggtttagtc aagtttttgc gaatcagcaa 4020 gaattagaat tagtcgacga ttgtttgctc tacaaggaca gagtgattat accaaaaata 4080 taccaatacc aagttttaaa actattgcat aataatcatg caggtgtggt caaaatgaaa 4140 catttggctc gaaccatggt ttactggtac ggaatcaatg cggatataga caattttgtt 4200 acacaatgtg atgcttgcaa tagcatgatg attccgcata aatcagagaa ttcgtccaaa 4260 tggatgccca cggctaaacc ttttagtaga atacatatag atttttttta ttttgaacat 4320 cgtacctatc tattgattgt tgacagtttc tccaaatgga tagaaataga actaatgaaa 4380 aaaggcactg attgcaacaa ggttttaaag aagttaatag cattttttgc tcgttttggc 4440 ttaccagatg tgttggtatc ggacaatggt cccccgttta attcccatga ttttaaattt 4500 tttcttgaga ggcaaggtat caaagtgatg aaaagccctc cttacaaccc ttcaagtaat 4560 ggacaagccg aaagattggt aagaacagtg aaagaggtgt tgaagagatt cttgttggaa 4620 ccagaaatca tggagataga tctggaagac cagataagtc tattcttatt taattataga 4680 aacaacaaca tgacaagtga tggacacttt ccagctgaga ggatattctc ttacacacca 4740 aagaccatct tagatttgat taaccccaaa agacagtata aacaccattt agaggtacca 4800 caaccagttg atgaaatcat caagaaatcc tcagcgggga aaatcccgat gtatccaaaa 4860 gataccattg atgatctgat ggtgggagat gctttatggt acaaaaatca taatcctaac 4920 caccattcta gatggataaa agcatctttc attaagcaat actctcccaa tacattccag 4980 atacgtattg gaagcgcaga gaccatggca catcgggacc aaatcaggat tcgtaaagat 5040 gactcaacgt ggcgaaggcc taacgttttc atcacgcgat gtgacccgaa caaaacattg 5100 aacgatgcag ttgtgaatcc aagcgaagga caagaggttc gcaaaggcgt agacgatggt 5160 ctgttggtcg aaagtacacg aagcagaaaa aggaaacgtg tggtccccga agctgattta 5220 caggagccta gacggtcgaa gcgaagtaaa aaagtgaacc gaagtgatga ttatcattat 5280 gattaagttt tgaattatct attaacgagc tcatgtttag attttaaatt tttgaattcg 5340 atttcttgat aaatgtcaag ttctgattat atawtatact ttcttggtaa agggggagaa 5400 t 5401 // ID R1_Ele1 repbase; DNA; INV; 5466 BP. XX AC . XX DT 05-OCT-2010 (Rel. 15.1, Created) DT 05-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE A non-sequence-specific R1 clade non-LTR retrotransposon family DE from Aedes aegypti. XX KW R1; Non-LTR Retrotransposon; Transposable Element; R1_Ele1. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-5466 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5466 RA Kojima K.K. and Jurka J.; RT "Non-sequence-specific families of R1 clade non-LTR RT retrotransposons from the yellow fever mosquito."; RL Direct Submission to Repbase Update (05-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. This consensus is generated from 30 CC sequences with >97% identity, and ~97% identical to the original CC sequence in [1]. This family shows no sequence specificity. XX FH Key Location/Qualifiers FT CDS 268..1839 FT /product="R1_Ele1_1p" FT /translation="MNTNLNQNQNNSSSAGSSSSREANPFARSGLARSPLR FT RAEAGGGSSARSASAGNHISSPANASGEVMDGAWLMRAINKSRDGLSAMEV FT AAQQLDSIIDFASSKSNISKDLKQALFRLRKSMFAAKQNHAEPMATVAAAE FT PVELKVPKSTQTEPFVFAGSPKSAEANAYNKQSQKRARQPSGEELPGGARK FT ARRILTPKTGRSAGKSDPSQASRKAEKGGPEKAGPSRSDGNRGLRPLRGPQ FT SPQVRADQGGDAPWTTVVRKKKKEKQEVQERRDTRPKKSRRVGAKREKGDA FT IIIKTEESKYSEVLKAMRSDAKLADLGADVRSVRRTRTGEMILELKRDKER FT KGAAYKSLAEEVLGEGVEVRALTQSVTLKVMNLDEITNAEELVTALRQQCE FT IQVPTAAVQLRKGPAGTQVALVHLPVADANKSAKVGKIKVGWSVCSLNIHE FT QPVICFRCREPGHKSWGCKGPDRTKLCRRCGEEGHKALGCKNPPKCLICSG FT KSVNNNHPTGGSRCPTFKRAINEKSQCR" FT CDS 1821..4808 FT /product="R1_Ele1_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="RKVTVQVTQLNLNHCDAAQQLLYQAVAEWGTDIAIIS FT DPYRVPAGNGNWVVDGSGKMAAIWTTGKYPVQELVSTTYEGFVIAKVNGVL FT FCSCYAPPSWSTERFTQMLDCMTTVLTGRRPVVIAGDFNAWAVEWGSRITN FT QRGQILLEALAVLDVDLANVGTKSTFSRNGAESIIDVTFCSPGLTSSSNWR FT VDNAYTHSDHLAVRYSIDYNNSRQRVEEAARSRPSPRRWKTSYFNDEVLRE FT ALRRERNLLGLSGDELVAVLSRACDATMPRKVHPRNGRPPTYWWTQAIANL FT RRACLRARRRMQRARTEQEREERRAVFTAAKVALKSEIRASKKACFERLCQ FT SANANPWGDAYRIVMAKTRGAIAPTEQSPQMLEGIIEGLFPRHNPSPWPPF FT VGQPGIGAGDEDRVTDEELVGIAKSLSMGKAPGPDGVPNLALKVAILEAPG FT MFRSAMQICLDEGVFPDAWKRQSLVLLPKAGKPPGDPSAYRPICLIDTAGK FT VLEKIILNRLLRYTESVNGLSSNQFGFRKGKSTVDAILTVKKTAEIALQRK FT RRGIRYCAVVTLDVRNAFNSASWAAIADALLRLGIPEYLYKILGSYFQNRV FT LVYDTEVGRKCFHITSGVPQGSILGPVLWNVMYDEVLRLKFPAGVVIVGFA FT DDITLEVYGESIEEVELTAAHSIAIVEEWMSSRKLELAHHKTEAVVVNNRK FT SVQQAVISVGDCTITSKRSVKHLGVMIDDKLTFGSHVDYACKRASTAIVAL FT SRMMSNSSAVYASKRKLLASVASSILRYGGPAWGTALSTKSYRSKLESTYR FT LMCLRVASAYRTVSHDALCVITGMVPIGILIMEDIECFEMRGTRGIRRTAR FT LASMVKWQRAWDSSTKGVWTHRLIPRLDIWVNRRHGELTFHLTQVLSGHGC FT FRQYLHRFGHAGSPECPVCAGLEETAEHVLFVCPRFRTMRDRMLATCGRDT FT TPDNLVQRMCEDEFGWNAVSSAITHIVSELQRRWRVDSRSG" XX SQ Sequence 5466 BP; 1334 A; 1337 C; 1695 G; 1100 T; 0 other; agccgtgtag ttccggcgta tagaatagaa ctacggacga cctgttccgg tggtaagagt 60 ccaccaaacg gggtaacccc aattcaaggt gtgatgcgaa aaaccgtgct gaggaatgaa 120 tggtcgaagg ggtgaaaaag atgttcggcc gttaacggag cctgtggggc acctgggcac 180 ccctcacagt attttgtccc ttaccgcgtt aatgcagggc tctggcgtgg tggacctctt 240 ttcccgtgct actcgtggga tccaatcatg aatacaaact taaaccaaaa tcaaaataat 300 agtagtagtg caggtagtag cagtagtagg gaagcgaacc ccttcgcaag aagtgggtta 360 gctaggtctc cgttgaggag agcggaagca ggaggaggca gcagtgcacg tagtgccagt 420 gctggaaacc atatttcatc cccggctaac gcatcgggtg aggttatgga tggagcgtgg 480 ttgatgaggg ccatcaataa aagtagagat gggctctctg cgatggaagt ggctgcacag 540 cagctcgact ccataattga ctttgcgtcc tcgaagtcta acatcagcaa ggacctcaag 600 caggccctgt tcagacttcg taagtcgatg ttcgcggcca agcagaacca tgctgaaccc 660 atggcgactg tggctgcggc agaacccgtg gaattgaagg tgccgaagtc tacccagacg 720 gagcccttcg tcttcgcggg tagccccaaa agtgcggaag cgaatgctta caacaagcaa 780 tcgcagaagc gcgcgaggca gccgtcaggg gaggagctac ccggcggcgc tcgcaaggcc 840 aggcggatat taaccccgaa aaccggcaga agtgccggaa aatcggaccc cagccaggcg 900 tcccggaaag ccgagaaggg tgggcccgaa aaggctggcc cctcacggag tgatgggaac 960 agggggttgc gacctttgag aggtcctcaa tcaccacagg ttagggcgga tcaggggggg 1020 gacgccccct ggacaaccgt agtgagaaag aagaagaagg agaagcagga agttcaggag 1080 cgcagggata ccaggcctaa gaaaagtagg agggtaggtg ccaagcgcga aaagggtgat 1140 gcgatcatca tcaagacgga agagtccaag tactcggaag tcctgaaggc gatgcgcagt 1200 gacgcgaagc tcgcagatct tggagccgac gtacgcagtg tcagacgcac tcgtacaggt 1260 gaaatgattc tcgagcttaa gcgcgacaag gagcgcaagg gcgccgccta caaaagtttg 1320 gcggaagagg tccttggcga gggtgtcgaa gtgagggctc tgacgcagtc agtgactctg 1380 aaggtgatga accttgacga gatcaccaac gcagaagagc tcgtcacagc actgcggcaa 1440 cagtgcgaga ttcaggtgcc caccgctgcc gttcagctac ggaaaggtcc ggcaggtact 1500 caggtggcct tagtacacct acctgtggcg gacgcaaata agtccgctaa ggtaggcaag 1560 atcaaggttg gttggtcagt atgctcactg aacatacatg agcaaccggt gatctgcttt 1620 aggtgtaggg aaccaggaca caagtcctgg ggctgtaaag gccctgatag gaccaagttg 1680 tgtaggcgtt gtggtgagga aggtcataag gcactaggct gcaagaaccc tcccaagtgc 1740 ttgatttgtt ccggcaagtc cgtgaacaac aatcatccaa cgggaggctc aaggtgcccg 1800 accttcaaac gagccattaa cgaaaagtca cagtgcaggt aacgcagctg aacctgaacc 1860 actgtgacgc agctcagcaa ctgctgtacc aggcagttgc tgagtggggg acggacatcg 1920 ccatcatatc ggacccatac cgagtacccg ccggcaacgg caactgggtc gtggatggat 1980 ccggaaaaat ggcggcgata tggacgacgg gtaaataccc cgtccaggag ttggtgtcta 2040 ctacctacga gggcttcgtg atcgccaagg taaacggggt cctcttttgt agctgctatg 2100 cgcctccgag ttggtcgacc gagcggttca cgcagatgct ggactgcatg acgaccgtgt 2160 tgacagggcg aaggccagta gtaatagcgg gtgacttcaa tgcctgggcc gtggaatggg 2220 gaagccgcat cacgaaccag cgaggtcaaa tcctgctaga ggcactggcc gtgctagatg 2280 tcgatctggc taatgttggt accaaaagta ccttcagccg aaacggagcg gagtcgatta 2340 tcgacgttac tttttgtagt cctggcctaa cgagtagttc gaactggagg gtagacaatg 2400 cctacactca cagcgaccac ctggcggttc gctacagtat cgactacaac aacagcaggc 2460 agcgcgtaga ggaagcggct aggtcacggc caagccctcg taggtggaag acatcgtact 2520 tcaatgacga agtacttagg gaggcgctcc gtcgtgagcg taacctactc ggcctaagcg 2580 gggacgaact ggtagcggtg ctttcgcgtg cgtgcgatgc gaccatgcct aggaaagtcc 2640 accctagaaa tgggagacca ccgacgtact ggtggactca agcaattgcg aacctgcgcc 2700 gtgcctgcct acgggccagg aggcggatgc agcgagcacg taccgagcag gagcgtgaag 2760 aacgacgggc ggtgttcacc gctgccaaag tcgcgctgaa gtccgagata agggcaagca 2820 aaaaggcctg ctttgagaga ctctgtcaga gtgccaacgc gaacccgtgg ggtgatgcct 2880 acaggatcgt tatggcgaag acaagaggtg caattgctcc tacggagcaa tctccacaga 2940 tgctggaggg gatcatcgag gggctctttc cgcgccacaa ccctagccca tggcctcctt 3000 ttgtaggaca gccggggatt ggggctggcg atgaggatag agtaaccgat gaggaacttg 3060 tagggattgc aaaatcccta agcatgggga aggcaccagg tccggacgga gttccaaacc 3120 tggccctcaa agtcgcaatc ttggaggctc ccggtatgtt cagatctgct atgcagatat 3180 gcctggacga gggagtattt ccagatgcgt ggaagaggca gagcctggta ctattgccaa 3240 aggcggggaa accacctggt gacccgtcgg cgtatagacc aatatgcttg attgacacgg 3300 cggggaaggt gctcgagaag atcatcctca acagactgtt gaggtacact gagagtgtaa 3360 atggtctctc aagcaaccag ttcggcttcc ggaaagggaa gtccaccgta gacgctattc 3420 tgacggttaa gaaaaccgct gagatagcac tccagcgtaa gaggaggggg attcgctact 3480 gcgcagtagt gactctggat gtaaggaacg catttaatag tgccagttgg gcggctattg 3540 ccgatgcgct cctgcgtctg gggatacccg agtacctgta caagattctc ggaagttact 3600 tccagaatcg ggtattagtc tatgacacag aggtgggtcg gaagtgcttt cacataacct 3660 caggagtccc gcaaggttcc atcctgggcc cggtgttatg gaatgtcatg tacgacgagg 3720 tgttgagatt aaaattcccg gcgggtgtgg tcatcgttgg ctttgccgac gatattacgc 3780 tggaggtcta cggtgaatcg atcgaagaag tggaattgac tgcagcccac tcgatcgcaa 3840 ttgtggagga gtggatgagc tccaggaaac tggaattggc tcaccacaaa actgaggcgg 3900 ttgttgtcaa caaccgaaag tcggtgcagc aagcggtgat cagtgtaggc gactgcacga 3960 tcacttcgaa gcgctccgtc aaacacttgg gggtgatgat cgacgataag cttaccttcg 4020 gtagccacgt cgattatgcc tgcaagagag cctccacagc tattgtggca ctgtcccgga 4080 tgatgtccaa tagctctgcg gtgtacgcca gcaagcgaaa gcttctggcc agtgtcgcct 4140 cgtccatact gaggtatggt ggcccggctt ggggcacggc cttaagtacc aagagctacc 4200 gtagcaagct agagagtact tataggctaa tgtgcctgag ggttgcgagc gcgtaccgta 4260 ccgtgtcaca cgatgcactc tgcgtcatca ccggtatggt gcctattggt atccttatca 4320 tggaagacat agagtgcttc gaaatgcgcg gcacaagagg catacgcagg actgccagac 4380 tggcctccat ggtcaaatgg cagcgtgcgt gggacagttc caccaaggga gtgtggactc 4440 acaggttgat tccgaggtta gatatctggg tcaataggcg ccatggggaa ctaacattcc 4500 acctgacaca ggtcctttcg ggccatggtt gctttagaca gtatctacac cgtttcggtc 4560 atgcgggttc tcccgaatgt ccagtttgtg caggtttaga ggaaacggcg gaacacgttt 4620 tgttcgtgtg cccgcgtttc cgcacaatgc gtgaccgcat gttagccaca tgcggtcggg 4680 acacgactcc ggacaatttg gtccagagga tgtgtgaaga cgagtttggc tggaatgccg 4740 tttcatcggc tatcacccac atcgtctcgg aactgcaaag gaggtggcga gtggactcga 4800 ggagtggcta gtgcagacgc taaacaacag gtggtccaag ggttcgcagt cggctacgta 4860 ggtcataccg gtgccctacg gtcgaaatcg acccttacag cgattaagtg gccgcgggga 4920 gaacatcctg gtagcgctgc tgtcgtggcg ccggcctact gggtggatac gagcctctgg 4980 ttgttcgggg caggtggagg ccccttcgtc agcaatccca gctggtgcta gctgataggg 5040 cctgagcctt cagtaggtca aattgcgaag cccgcagtat cagttcttga tacctgcggt 5100 gcagctgggc gcgggcgtag tggtcgaccc tgcccgcttt cagcggacaa cgggaggtga 5160 ggaccacctg ggaagctggc aaagcgccag catgctacct tggtggaccc ccctaagcgt 5220 gtcatcgatg ttcgttgctg catggctacg cagctaacct ggaggatgcg atgtgcaata 5280 gcccctctcc gaagcaatgc cttcttggtg gtcccggaga gacgaagggt ttggcggcaa 5340 tggaaatggt ttagtgggtc gggggtgtag tcctgtttac tgcttgcatg tagtaaatgg 5400 tccctaaccc cacacggcgt ctgttgagca gattatcccc ccattgctta gaagaaaaaa 5460 aaaaaa 5466 // ID DNA8-64B_AP repbase; DNA; INV; 924 BP. XX AC . XX DT 29-AUG-2009 (Rel. 14.09, Created) DT 29-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-64B_AP. XX NM DNA8-64B_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-924 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 1998-1998 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 924 BP; 270 A; 123 C; 158 G; 371 T; 2 other; actagggctg ggatttcgat gcactttgca ttgttttcca aagcttactg tgcaatgctc 60 gtcttgtcac aaatttgttt tatgtacatt tgaatgagaa taagaaagtt tattttgcat 120 ttttttgcat tttttggcat tttttagttt ttaggtcatt ttcttaattt tttagggcat 180 tttagagttt ttggagcatt ttttgagttt ttggaaacca agtgtatgaa tttgaaattt 240 tctattttaa gatttttaca ccaacatggc ttttaaataa attaactagg gctgggattt 300 taatgctaga ttataagtag gtactcgtaa ttcgtattat attatcggct tatcgccatt 360 atcggacaag gtttgagtaa cctctacacg catttcctgt taaataatct taaatcggta 420 aaatcgacgt ccgatgatat cgttcgacgt tcagtacgag ttttgtcgtc gttaacaccg 480 ttttattact ctagttaatt ttttttgata atttaatata aaaatgccaa aagtaaaatg 540 ttcgacagcg agtcgactgc gtgcgtttgt acaagaattt ggacataata ttttttxtac 600 cgacggaxcg gtgttattct gtaaaatatg caacgtgaaa gttactgctg aaaaacgatt 660 tgccgtacaa caacatttgt caagagataa acatatcaat ggtttacact cgtaatagat 720 ttttataatg cattttttaa aactttttat tgtaatttat actaaaatta ttaaaaacaa 780 aaaaattttt agatcataaa agcatttttt ttagggcatt tttcgggttt ttttagggca 840 ttatttagag tttttaggtc ataaaagcat tttttttagg acatttttcg ggttttttta 900 gggcattaaa atcccagccc tagt 924 // ID Copia-3_DPu-I repbase; DNA; INV; 5035 BP. XX AC scaffold_55; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-3_DPu_; KW Copia-3_DPu-LTR; Copia-3_DPu-I. XX NM Copia-3_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-5035 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 669-669 (2010). XX DR Genome; scaffold_55; Positions 586087 581053. XX CC Positions [2004-2528] - Integrase core CC 'CTCCT' target site duplication CC LTRs are 98% similar to each other. CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX FH Key Location/Qualifiers FT CDS 502..2040 FT /product="Copia-3_DPu-I_1p" FT /translation="MQVFAALQHDQAQADPLVDGQPAQPVIEAQALDEPTL FT NQAAIDEWEQREINARAMILFNVDDDQQVSIRACRTAMEMWDQLNMEYSLM FT APDHAIMSLGNFFQYKYNPSHTISGHIATLKRMYDELLGTPGQVSEEQLKM FT VILKTLPPSFDRLRSAWDSVPIQERTVTALTSRLMIEERRTLEKHNGQRDP FT DDVAYFAVDTTYPEQGLAAQSHRRGGRGGNRGYRGKRGDYQGRGRRYDNQN FT SDEKTCFYCDKPNHVILNCRTRLRHEAEERQRNARNDVISKNRDINTKKIG FT FYSSSTCFSARSTDDFFADSGASHHMSDQRSYFSSMTPIQKGQWLVKGIGG FT ISLPVLGQGTIDFTATVDGKQFPGEFKIVLFVPSLNANLISVGTATNAGVE FT IIFTGNTVLFKHEGTVIMTGQRSGKELYHLNITVNDRTTAAVAKQKIPFSI FT VHQRFAHLNCRAIQRMAEKNVVDGLDLQDSKTPSDPCNGCIYGKMHHRCPF FT PKDELELPDQIKSSTQM" XX SQ Sequence 5035 BP; 1659 A; 1146 C; 1064 G; 1166 T; 0 other; ggttatgggc ccaggtttct ttcagtaaac tgcaaaaaaa agaaaaattt attatggcat 60 ctgctacaac aacccacgag gatagaaatt ttccaaagct gaacggcaag aacttcccat 120 catggcagac taacatgctg gttttgctcg agcaaaaggg aaagctgcac aagattgtca 180 agaaaactga cataaagcct gcaccggtac taaaatttat ttttttttgt gtgtgtctga 240 gtttctctat aagttaagac ttaatcatac atatgatcct gcaagcatgg tgtgggtaaa 300 atttcttttt attatcgtat gatgtcatgt atgttaacgt catttatctt tggagtgaat 360 atgcaaaaaa aaaaaagata aagtgttaat cgtggtgtga gcaaagaccc acgatacgtc 420 tgtatctatt catgagtctc aagtcttcca ctctaaagtg atgcgagtag tacccttact 480 aactcaagat tatactttgt catgcaggtg ttcgctgcat tgcagcatga tcaagctcaa 540 gcagatccac tggtggatgg ccaaccagca cagcctgtca ttgaagctca agccttggac 600 gaaccaacat tgaaccaagc agccatagac gaatgggaac agagagagat taatgcaaga 660 gccatgatat tattcaatgt cgatgatgac caacaagtct ctatcagagc atgcagaaca 720 gctatggaaa tgtgggacca actcaacatg gagtattccc tgatggcacc agaccatgct 780 atcatgagcc taggaaactt cttccagtac aagtataatc caagtcacac aatttctggc 840 cacattgcta ctctgaaaag aatgtacgat gagctccttg ggacgcctgg ccaagtctcg 900 gaagaacaat tgaaaatggt gattttgaaa actttgccgc caagcttcga ccgacttcgt 960 tcagcctggg atagcgtccc aatacaagaa agaacggtta ctgccctgac ttcaagactg 1020 atgatcgaag aacgccgaac gcttgaaaaa cacaatggac aaagagaccc tgatgatgtt 1080 gcctactttg ctgttgacac cacttatcct gaacaaggtc tcgctgctca aagtcatcgt 1140 cgtggaggaa gaggaggcaa tcggggctac agaggaaaaa gaggtgacta ccaaggaaga 1200 ggtcgcagat acgacaacca gaattcagat gagaagacgt gcttctactg tgataagccg 1260 aatcatgtga tactcaattg tagaacccgt ttgcgtcatg aagctgaaga aagacagaga 1320 aatgccagaa atgatgttat atccaagaat cgagatatca atacaaagaa aattggtttc 1380 tattcgtcgt ctacctgttt ttccgcgcgc agcactgacg attttttcgc cgattcaggg 1440 gccagtcatc acatgtcaga ccagcgttcc tacttctctt caatgacacc catacagaaa 1500 gggcaatggc tagtaaaggg aatcggtgga atctccttac cagtccttgg ccaaggcacc 1560 attgacttta ctgcaacagt tgacggaaaa caattcccgg gagagttcaa aattgtttta 1620 ttcgttccct ctcttaacgc gaacttgatc tcagtcggga ctgccaccaa tgctggtgta 1680 gaaataatat tcaccggtaa cactgtactg ttcaaacacg aaggtactgt catcatgact 1740 ggacagagat caggtaagga actctatcat cttaatatta ctgtgaacga tcgaactaca 1800 gctgctgtag caaaacagaa gatccctttc tccatcgtgc accaaagatt tgcccatcta 1860 aactgccgtg ccatacaaag aatggccgag aaaaacgtcg tcgacggatt ggacctgcaa 1920 gactcgaaga caccatctga tccgtgtaat gggtgcattt atgggaagat gcatcataga 1980 tgtccctttc caaaggacga actcgagcta cccgaccaga tcaaatcatc cactcagatg 2040 taggtggacc attcaggata ccttctattt gtggaggacg gtattacgtc atattcaaag 2100 atgatttttc cggctattca gaaggtttca tcatgaaaaa taaatccgaa gtcaaggaac 2160 tattcgtcaa attctgcgcg gccactaaac gtcaaactgg tagactagtt ggaacccttc 2220 acactgacat gggaaaggag tatgaaaaac aatggtttac tgactatctc gcaaaggagg 2280 gaatactaca tgagactaca gcatcttaca ccccacaaca aaatggagtg gctgagagag 2340 ccaacagaac tctaatggag gcggaaagaa gtatcatgtt taacaatccg gaatcatcac 2400 taaacaagca gaaaaagtct ctccttgaac tgtggggacc tttcctgctt gcctcaatct 2460 atgttcgcaa ccggtcgatg accaacactg aagacgcaac tccctaccaa aaatatttcg 2520 acaaaattcc aaaagtcgac catctgtgag tgatcggatg tagatcgatg gtacacgtcc 2580 ctactgctct tcgtcacaaa cttcaaccca aagcagaaga atgctggttt atcggatact 2640 gcaaaactac aaaagcgtgg attttctgga acgacaaaac aagaaaaact attgtcagca 2700 gagatgcgga gttcttcgag aaggaaatat actgtgggaa cacagacgaa cagcaaagca 2760 gcacaactct agaattcgca gaaccaattg aaattgtgac aacaatactg gtaatatttt 2820 tttttatata atctaacaca aagataatta actaataaca cgatctgtgt ccgtttgatg 2880 tttatcgttt atttatggcc cactgcctac gcaacagcaa aagtcagttg aaaatattac 2940 ggaagcggaa ccacttggtc caccacccga ccaatcagaa aacgatgaaa ctgcaggtcc 3000 ggctgactca gaaaatatca accctagtag taacaacgat gaagcatcac ttgaacctca 3060 cgccgagact catcgaaata gagaacaact gcagccagca tcagcaatcc cggaacagat 3120 tgttgacatg gatgccgatc aagctcctga gcagattact gccgtgcaac aaaaccatca 3180 tcaaataatc gaacagcaac tgcccagcaa caatggcctc cgccggtcaa atcgtatcaa 3240 gcttatggcc tggcaaaaga gtttggcaaa aatggttgac tgggtgcctt acggtaataa 3300 cgatagaaat agtaaacatg acaccatgtg cctaaatatg catatcaaat aactctttct 3360 tgccctcatg tgttagatga agaagtacct cttcaaaggt gctcggctct cctggcagaa 3420 ccagatgaac cgcagagcta taaacagctc atgatgtcag aaaacgcaga acactggaaa 3480 gaagccatga aagaagaata caattaccta atgaaaaacc aaacttggat tctcgccaaa 3540 ctgccaccag gtcgcattgc tctgaaccac aaatggatcg gaaaatataa accggcatat 3600 ccaggtgttg aggagcgctg gaaagccaga ttgaccgtcg tcggaacaag gcagcaatat 3660 aagatcgact tcaatgaaac cttcgcccct gttccgaagc aaagtgcaat caaaatattc 3720 ctgtcctatg cagcagccat ggaccttgaa ctgactcagt ttgatattaa aactgcattt 3780 ctctatgcaa aactaaaaga aactatatac atgaagcagc ctgagggttt cgtagtggaa 3840 ggtaaagagg accacgtttg tcaacttctc aaatcactct atggccttaa gcaggctccg 3900 tacgaatgaa atgaagaatt caacgagttc atactcgcgt ttggtcttgt aaggagtgag 3960 gccgatcctt gaacatctag agagaaaatt cgaaatccgc acactcacca actaacccac 4020 cactaaacca accaccaacc acactcacca acaagattcg ttggcttgaa cattgttcgg 4080 gaccgcccta atcgcctcat gtacatcgac caagaaaaca caattgacaa aatgttaaaa 4140 gactacaaca tgtgggactg caagccatca tcacttccag ccgacccaaa cgctcggctc 4200 tcagcctgtt cgaagcccaa gagtgaggag gagaaagaag gtaaagatga gctggaggaa 4260 catactgaag aagaagagaa agaaatggaa aaaataccgt accgagaagc agtaggatct 4320 ctaatctatc tggccaccac ctcacgccca gacatatcat ttgccgtcgg tcaagtctcc 4380 agatattgtg ccaagtatac aagaacacac tggaatgcgg taaaaaggat cttcgcctat 4440 ctaaaaggta caaagaacct aaaactatgc tacggtggaa ctgaactacc tcccgccgtc 4500 gtctactgtg acgcggatta tgctggtgac ctcgacgatc gacgatcaac atctggctac 4560 atcttctttt acaatggcgg gccggtatgt tgggcgagca ggaaacagtc catcacagcc 4620 caatctactg gagagtctga atatttatct gcgaacgaaa ctacaaaaga aagtgcttgg 4680 ttctgccaag ttgtactgga catcaccgga ataaatataa tgcccctcat gattaagtgt 4740 gacaatgacg ctgccatcgc tgcagctcaa cgccctcatg gacaaggccg catgaagcat 4800 attcccatca agatgaatct agtccgcgaa cacataagga ccggcaagat aaaaatggaa 4860 tacgtcgcgt cagaagatca actggcagat atatttacca aaccgctgaa atcaccagcc 4920 ttccaggaga tgaggaaaag gattggacta agttaagcat cagaacaaaa tacataaggt 4980 aagaacacat cttcaatgag tcagtcaatt gcccaattat aagattgagg aggag 5035 // ID Chapaev-11_HM repbase; DNA; INV; 3352 BP. XX AC . XX DT 14-DEC-2008 (Rel. 13.12, Created) DT 14-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE Chapaev DNA transposon -a consensus sequence. XX KW Chapaev; DNA transposon; Transposable Element; Chapaev-11_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3352 RA Jurka J.; RT "Autonomous Chapaev transposons from the hydra genome."; RL Repbase Reports 8(12), 1826-1826 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(549..848,1136..2629) FT /product="Chapaev-11_HM_1p" FT /translation="MNSTCCCCWEKYGERSKSLKKMNKNLEEQIQMFIFPG FT YSMNNDCYPSVVCPGCRRNLYRLKKGDSPRGEWGAKVTKVILMIFSIFFSF FT FFFLFIYKCIFIVASGIIKSKMATESLIDGSTFCLSTGGNPLKVTVGTPEN FT KSKRTSIKQLSIEIIKQLQIVLELSNRKTKEMLSTLRKGLGNKLSIEPNIF FT GRLSELEEYISNYYSVQKCEFVDSKGHLILRDLVYVQNTSDFVLDLLKLRG FT LDPTSAFIRISLDGGGSFFKVIINVFDCQKDNESDEYLNSGVQRSQFLAIV FT EDIPESNYNLRLVIEKLNLQDVSFYVAFDLKCGNALFGLSGHSGKKACLWC FT EGISTLNLGTKRTLGSIDYWYDKYSLENNSVKSSMQEFMNCINPRILYINL FT DPNTLIEQLVPPPELHLLIGFVSLLGNFLLDVWPGFDDWLKSKNVIQRGYQ FT GRGWDGNNSNKILENLDSLETQILETFTRLIPIVQCLRDFRNVKNCCFSSN FT LQTGFKEAITNLKNSFLSAQEVAINLNKQINTTWKVHILLCHVQPYVEHHN FT KGLGNFAEQCGESIHAKFKPTWTRFKRQIEHSEHGDRLLSAVVDFGAKRI* FT " XX SQ Sequence 3352 BP; 1132 A; 462 C; 538 G; 1220 T; 0 other; cactgtgcac cagagagacc gccaggtttt ttgtcgtggc actcaaacca agtccttatt 60 ttatgaaagt attatatata ttgttttaga aaatatattt tttgaaatcg aaacgcgttg 120 ctatcatacc ttttttgata taaatatatt ataaaatggt tataaaattt cttaattttt 180 cgccggatct tgacttgaaa aaaatgttta aaaaaatgtt tttatccttg gaaaccactg 240 atagatattt ttttaatgga ggtaacaaaa tttattttgt aaaatatatt gtttaatata 300 atagtttaaa aataaaatta taaatgtcaa aaaaatattt ttaacaaatt tttatatata 360 gttaactaaa tattagctat ttattatatt aattatactg taacctcata ttagatgggc 420 ttataaattt aattaccccc tgtactaact aatcaaaaaa aattcagaga ttaaaaagat 480 ggtagtataa aaattttcaa attgcggaag gtcgtgctat ttaggagttg tatttctttc 540 acttaaaaat gaattctact tgttgctgtt gctgggaaaa gtatggcgaa agatcaaaat 600 cattgaaaaa aatgaataaa aacttagaag aacaaattca aatgtttatc tttcctggat 660 attcgatgaa caacgattgc tatccatcag ttgtttgtcc tggttgccga agaaatcttt 720 acagactcaa aaagggagac tcacctcgtg gtgaatgggg agctaaggtt accaaggtaa 780 ttttaatgat tttttccatt tttttttctt tttttttttt tttgtttatt tataaatgta 840 tatttattta ggtagagtgg aaaacccttt tgagaaatag ccccagaacc agtctaactc 900 ttccagatga gataactaat ttaccgacaa ctaagtcaag tatatgttct ttttgctata 960 cattacctgc acctggttat tctcataact gtacacccac cagtgctgtg gcaaatatca 1020 ttacactgtc atttcttcta ggttctcttc aagctgaaca ggtaaaaaaa actatatttt 1080 gtttataagt ttttagtttt ttatgcaaat ttcattttgt gaaaacttta cgtaggttgc 1140 ttctggaatt ataaaaagta aaatggctac agaaagtttg attgacggat caacgttttg 1200 tttatctaca ggaggaaatc cgctcaaagt cacagttggt acaccagaaa acaaatcaaa 1260 aagaacttct attaagcagc tttccatcga aataataaag caattacaaa ttgtgttaga 1320 actttcaaat agaaaaacaa aagagatgct ttctacttta cgcaaaggtt taggaaataa 1380 gctttccatt gagcctaata tttttggtcg tctttctgaa ttagaagaat atatttctaa 1440 ttactacagt gtgcagaagt gcgagtttgt tgatagtaaa ggccatttaa ttttacgaga 1500 tcttgtatat gttcaaaata catcagattt cgtacttgac ttattaaaat tgcgtggttt 1560 agaccctact tcagccttta taagaatttc attggatggt ggaggttcat tttttaaagt 1620 aatcatcaat gtctttgatt gtcaaaaaga taatgagtca gacgagtact taaatagtgg 1680 cgttcaacgg tctcaatttt tagctattgt tgaagatatt cctgaatcta actataattt 1740 gagacttgtt atagaaaaat taaatttgca agatgtcagt ttttatgttg cttttgactt 1800 gaaatgtgga aacgcgctgt ttggtttatc aggacattct gggaaaaaag cttgtttatg 1860 gtgtgaagga atatctacat taaatttagg gacaaaacga actcttggca gtattgatta 1920 ttggtacgat aaatacagtt tagaaaataa ctcagtaaaa tcatccatgc aagagtttat 1980 gaactgcatc aatcctcgaa ttttatatat taatttagat ccaaatactt taattgagca 2040 actcgttcct ccacctgaac ttcatttgtt aattggtttt gtttcactgc ttggaaattt 2100 tttgttagat gtctggccag gatttgatga ttggctaaag tcaaaaaatg ttattcagcg 2160 agggtaccaa ggcagaggct gggacggaaa caattcgaat aaaatattag aaaatctgga 2220 ttctcttgaa acacaaattt tagaaacatt tactcgtttg attccaattg ttcagtgttt 2280 gagagatttt agaaatgtaa aaaattgttg cttttccagc aatttacaaa ctggcttcaa 2340 agaagcaata acaaatttaa agaattcatt tctttccgca caagaagttg caataaattt 2400 aaataaacaa ataaatacaa catggaaagt ccacattcta ctgtgtcatg tacagccata 2460 tgtggaacac cataacaaag gattgggtaa ctttgctgaa caatgtggag aaagtattca 2520 tgcaaagttt aagccaacat ggacaaggtt taagagacaa atagagcatt ccgaacatgg 2580 tgacaggcta ttatcagcag tagttgattt cggagcaaaa agaatttgat tttgtttata 2640 aatttattta tatagttata atatatatat atatatatat atatatatat acatgtatga 2700 ttatgttata aatgtacagt tttctgagca atgcagatag tttgtttcaa atttgagtat 2760 acaagaaata gttttgaatg tgtagacaaa aacacttaat acgtagatcg actcaatagt 2820 tatttttaat caaagaagag ttaaatttat ctgtatcttt gacatattgc tctggagcag 2880 gaagaagaat tgttcgtatt ttgttgaatt atgtacaata ttttgataaa tattataaat 2940 attttgattt ttttttgttt gtttatactt tgtttaaaat atttgacatt tatttttaaa 3000 aattattgta acgggcatct caaatgttta aatgggcata ggcccctcca ttttaccctt 3060 tgcctcattt acgacccttg actttttttt tttttaaaaa aaaggttttt tctattttta 3120 ttaaacaaat acttttgaat taaaaacttt tttttgtaac ttgcaactag catacctaaa 3180 tgggcttgtg ccccccttgt atttaccctt ttgccccatt tatgaccctt agcaatttgt 3240 ttggatttta aaaatcattt tttttattgt tataatacaa atactcttga ataaaaaagg 3300 tttcatttga gtgccacgac ataaaacctg gcggtctctc tggtgcacag tg 3352 // ID Gypsy-181_AA-I repbase; DNA; INV; 5357 BP. XX AC AAGE02024317; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-181_AA_; KW Gypsy-181_AA-LTR; Gypsy-181_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5357 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02024317; Positions 19011 13655. XX CC Positions [4403-4852] - Integrase core CC 'ACGTC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 2384..5332 FT /product="Gypsy-181_AA-I_1p" FT /translation="MGDVKKAGVRVSSSTKDSDRRFLSYGSSTPLSVRGTF FT TAEIKIQNRASFAEFYVVEKGQTCLLGDATAKQLRVLKIGVNVNSVEKKEP FT FACIKNVEVHIHMEPDAKPVIQPVRRLPIPLEAKVNRKLDELLARDIIEPK FT SGPTSWVSPLVVVAKANGGLRLCVDLRRVNQAILREHHPMPVIEQILARIG FT NGTVWSKLDIKDSFLQVMVAAKSRDILTFITEKGLFRFKRLPFGLITAPEI FT FQRVVDEILCGCEGTHWYLDDIFVEGETLEEHDARLEKVMTRLTERGVQLN FT LEKCQFRVTELEFLGHKITEFGIAPSDSKVAAIIAFRLPTTEGEVCSFLGL FT ANYLNKYIPHLATLDEPLRELTRKEVKFSWKSRHETSFQAIKHAMSDVRIL FT GFYNTYDRTAVVADASPTGLGAMLVQTDSQGKDRVISFASKSLTETEKRYC FT QTEKEALGLVWAVEKFQFYLLMRKFDLVTDCKALSYLFTPRSRPCSRIERW FT VLRLQCFYYSIVHIRGDQNVADVLSRLSCTQVEPFDSAEEIIVQHIAVVDA FT KAVAVSWEELVRESRDDAEITNVFECLKQNSTESLPVTYRIVSSELCDVEG FT VLLRGDRIVVPVALRQRVLNLPHNGHPGVRMMKSFLRSAVWWPKLDADVES FT LVKSCRGCVFVSSPDVPEPMLRRALPSGPWEDIAIDFLGLLPDGHHLLVIV FT DCYSRYLEICEMSCIDSTETISRLREVFSRFGIPTLIKADNGPQFSSNEFS FT TFCDDYGIQLVNTIPYWPQMNGQVERQNRSILKRLRIAQELGKDWRKELHD FT FLLVYHATTGEAPSKLMFGRRIKSQLPHVPVHYDDEAVRDFDGLQKEKGKC FT YSDRKRQAAFSNVKEGDKVFVKRMKKNHKLEADYSPEEYVVVKKTGSDVVV FT WSRVSGKEFRRNVTHLKKIPEEQGTVQITSTEEFDHAIHRNGSDGQAPQIC FT DTSQQQQVPLRYQRERQPPTKFKDSISH" XX SQ Sequence 5357 BP; 1597 A; 948 C; 1347 G; 1465 T; 0 other; atttggcgac gaaggtgaat tcaaggtagg catttattct gaaattccgg gaagtttgaa 60 gaagaaatgg ctgaaatatt tctggcggat gttatacatt cctcgagaaa agatggcgtc 120 gtcgaaacga aaaaatctgg gcaagttggc agtgatgtcg ttgggttgtg tgagatacgg 180 tgttgcaaga gggtgtacat gtgttggtag tggaatctct gagacatgcg attgagaaga 240 atctatattt ttgagagaaa gttacacaaa aagaaaactg aaataggggt gatcatgtta 300 taattgtgtt agaatcatta caaaatacaa agtgaattat gaatatgaaa caaatgaaca 360 attaaattcg tgcgggaata cacaaaaagc ttatgcttag cgtgaggcac aaatttgtgt 420 taatcaaaat agtagaatca aacgtcatgg tgatttacaa atgtttgggt gagagaaaag 480 aaaaaaaaaa catcgtatgg gctgatcgtt atttgagaat gaatgctggt gtggggccag 540 ttttgtacaa aatacaataa ttgtcatgag ggttgacaaa tgtttgtgtg agacaagaga 600 aaaacatcgt gtgggtcgat tgttgattca gttgagattg aatactggtg tggggccagt 660 tttgtataaa atgcaatatt taattgtcat gagggttgac aattgtttgt gtgagacgag 720 agaaaaaaca tcgtgtgggt cgatcgttgg ttcagttgag attgaatact ggtgtggggt 780 cagtttttta caaaatgcta taatgtcatg agggttgacc aatacttgtg agggacaaga 840 aaagagatcg tttggggcga tcgataatca gcttaattgg tgtggggcca atttcagagg 900 caagatactt cctatccatg aatctaagat cataagggtt gatcgaaatt cgtgagggac 960 gaatatgaat gaatgagttg gacaatcaat cagggtaaac ggcccttcag tggaagaagc 1020 ttcaatagtg gaggtgctac accaatggta cattcaccct aaggaagatg ttcaggcaag 1080 attggagaaa attaaatatt tgtgacaatg ttgtgggaac aagatttgaa tttgagtttt 1140 attatgagtg atactatgat aaatcaagcg ctggtgatat agtttaagaa ataaatatat 1200 ttttttgttt atgttggaat gatacatgaa tggatggttt tgtcgacttg ttgattacag 1260 atggatgata cgagacctat gccacaattc aggtgtgagg atatcgcaaa gtcaagattg 1320 catcatgagt ggaaggattg gaaaagcgga ttggagagat acttcgatgc caatgatata 1380 actgaccagt aaaagaaacg ggcgaagttg ttatatttgg gtggccctca actggacaag 1440 gtgttcacca accttcccga tgggaacaaa tttccattgg ttgctacaga aaaaaggttc 1500 tatgatgttg ccattgcagc attggataac tactttcaac cagtgaagca ggatattcta 1560 gaacgtcacc gtttgcgtca aatgaaacag ctaccgggtg aaaagttttc acactatatg 1620 gtagttaatt tattttttta aatcgtttcc aaaattctta ctttgaaatg ttttcatgca 1680 actttaggtt cgtttaagac agcaagccac actgtgtggt tttgataagt actcgaaaac 1740 aacgaagcag gtcctgactg agatgatgat aacggatgtc attgtagaag gttgcttgtc 1800 ggtcgaactc cgacgaagga tgctgctgaa ggaccaaact ttagatgaaa tcgaagagat 1860 tgcgacaggt ctggaaggtg ttgattctca aatacaagac ctaacgagaa agataagcga 1920 cagtcaggac ggtaccaagg tgtatagagt tagggatcgt tccacaccgg ggttcaggcg 1980 tcctccagtc catgcgtcta agttagataa aagtcgttat cgcgggcgtc agtttccacg 2040 ctcaaatatt atttgcttta attgtggtag atcaggacat attgctacat ccgattattg 2100 ccctgcaaaa ggtaaacagt gccataatag caagcgaatg ggccattttg acttcacgtg 2160 caaatcgcgt cgaatttcgt tggaaaaaaa cacaaccatc accaccgagt aagaaagttc 2220 gggtggttga gcaacaggaa tcacccagcc ctgatgaagg aaaagtttat tacgcttttt 2280 tcaacggtaa cctcagtaac atgctaacgt ttgaagtggg aggtgtgtct ttgaagatgc 2340 ttgtagattc tggggcagat gctaatttgg tgaccctcga ggcatgggag atgttaaaaa 2400 agcaggtgtt cgagtctcca gttctacgaa agattcggat cgtcgcttcc tctcctatgg 2460 tagtagcacc cctctttcag ttcgaggaac atttacggca gaaatcaaaa ttcaaaatcg 2520 agcctctttt gctgaattct atgtagttga gaaagggcaa acctgcttac ttggtgacgc 2580 cacggcaaag cagctccgag ttttgaaaat tggcgtaaat gtgaatagcg ttgagaaaaa 2640 ggaaccgttt gcgtgcatca aaaacgttga agtgcacatt catatggaac cggatgcaaa 2700 gcctgtcatc caaccggtga gaagattgcc tatcccactc gaagctaagg tcaatcgaaa 2760 gctagacgaa ttgttagcaa gggatatcat cgagccaaaa tcaggtccaa cttcgtgggt 2820 ctctccctta gtagttgttg cgaaggcaaa cggcgggctg cgattgtgtg tggatcttcg 2880 aagggttaat caagctattt tgagggaaca ccatcccatg ccggtgatag agcagatact 2940 ggcgcgtatc ggaaacggta ccgtatggag taagttggat ataaaggatt cgtttctcca 3000 agtcatggtt gctgcaaagt ctcgagacat tctcacgttt ataactgaga agggattatt 3060 taggttcaaa cgtttaccat ttggcttgat aacggcgccg gaaatatttc aacgagtcgt 3120 cgatgagatc ctctgtggct gtgagggtac tcattggtat cttgatgata tttttgtgga 3180 gggtgaaact cttgaagagc atgatgcgag gcttgaaaag gttatgacaa ggcttacaga 3240 acgaggagtc caattgaatt tggaaaaatg tcagttcagg gtgacagagc ttgagtttct 3300 tggtcataaa atcacggaat ttggaatagc accgtcagat tcaaaagtgg cagcaataat 3360 cgcttttcgc cttccgacaa ccgaaggtga agtctgcagc ttcttagggc tggcaaacta 3420 tcttaacaag tatattcccc atttggcaac ccttgacgaa cctctgaggg agctaactag 3480 gaaggaagtt aagttctctt ggaaatcacg tcatgaaaca tctttccagg cgataaaaca 3540 tgcaatgtca gacgttcgca tcctcggatt ctacaatact tatgacagga cggcagtggt 3600 tgctgatgcg agccctacgg gtcttggggc gatgttagtg caaacagata gccaagggaa 3660 agatcgagta attagctttg cgtcgaaatc actaacagaa acagagaaga gatattgcca 3720 gacggagaag gaagctcttg gtctagtttg ggccgtcgaa aaattccaat tctatctgtt 3780 gatgcgaaaa tttgacttgg tgacggattg caaagctcta tcatacctgt tcacgccccg 3840 atctcgtccg tgttcaagga tcgaacggtg ggttctgagg ttgcagtgtt tctattattc 3900 gatcgttcat attcgagggg atcaaaacgt tgcagatgtt ctttcaagac tatcttgtac 3960 tcaggttgaa ccttttgatt cagctgaaga gataattgta caacatatcg ctgtggtgga 4020 cgccaaagcc gttgctgtat cttgggaaga attggtccgt gagtctcgag acgatgcaga 4080 aatcacgaat gttttcgaat gtttgaaaca aaactctaca gaatcattac ctgtaactta 4140 tcgtattgtt tcctcagaat tatgtgatgt tgaaggagtc cttctgcgtg gtgaccgcat 4200 agtagtacca gttgccctac gccaacgagt tctgaactta ccacacaatg gtcatcctgg 4260 ggtacgaatg atgaagtcat ttctccgatc tgccgtttgg tggcccaaat tagacgcaga 4320 tgtggaaagt cttgtgaaaa gttgccgtgg atgtgttttc gtttcttctc cggatgttcc 4380 cgaaccaatg ctcagaagag cacttccttc tggaccatgg gaagacattg ccattgattt 4440 ccttggcctt cttcccgatg gacatcactt gctggtaatc gtagactgtt atagtcggta 4500 tctagaaatt tgcgaaatga gctgcatcga ttcgacggag acaattagtc gtttaagaga 4560 ggtgttcagt cggtttggta ttcctacgtt gatcaaggcg gataacggcc cacaattctc 4620 cagcaatgag ttcagtacat tctgcgatga ctacggtata cagctggtaa acacgatccc 4680 gtattggccg cagatgaacg gccaagttga gcgacaaaat cgctcaatct taaagcgact 4740 gcgtatcgcc caagaattgg gcaaagattg gcgaaaggaa ctacacgact ttcttctcgt 4800 ataccatgca actacaggtg aagcaccttc gaaacttatg ttcgggcgtc gaattaaaag 4860 tcaactacca catgttccgg ttcattatga tgacgaggcc gtgcgtgatt ttgatggtct 4920 ccagaaagag aagggaaagt gttattctga taggaagagg caggccgcat ttagcaatgt 4980 caaagaggga gataaagtgt tcgtgaaacg catgaagaag aatcataaat tggaagcgga 5040 ttattcgcca gaggagtacg tagttgttaa gaagaccggc tctgatgttg tcgtctggtc 5100 cagggtatct ggaaaggaat ttcgtcgtaa cgttacacat cttaagaaga tccctgagga 5160 acaaggcact gtacaaataa cttccaccga ggaattcgat catgctatac ataggaacgg 5220 ttcagatgga caagcgccac agatctgcga taccagtcaa caacagcaag taccgttacg 5280 ttatcaacga gaacgacaac cgccaacaaa gttcaaagat agcatatctc attgaattaa 5340 gaaagttatg gtggggt 5357 // ID Gypsy-96_CQ-I repbase; DNA; INV; 1906 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-96_CQ_; KW Gypsy-96_CQ-LTR; Gypsy-96_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-1906 RA Jurka J.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 571-571 (2011). XX DR [2] (Consensus) XX CC 'AAACAG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 95..1906 FT /product="Gypsy-96_CQ-I_1p" FT /translation="MDGASSMDQQPAGQQQQMPGGVAPVEQQPVPDGQVAA FT GQQQQAFIRSPLHPPPELQRLLEQQRGYVDEMFRQQHEVVRLQLARIQQQQ FT LEFMQRQGHILRNIVAAVNEQVSPHPEAQECAKMSLTMGNSRRAEEVRFEQ FT QESNIASSIVEGVNTGSDGDRRKFVPIVNNKSARVNLDTGSATTKASTGTW FT RKVGCHPTSTSVAEAVAAQMLEVDGKFYYEMTVGGWTFVGLARATRKQLFG FT CDTTDGFGPSQVPNVNISQQGGNSSPIGGIFKAVMKVDSGCLSSFVDRKLA FT ADVVPGFRSAERRTTAKYRPLEDVLDQLKQDLFFSPEAIPERFLVVREEGE FT RLQGKYTCRRQYKHQQPGTTAVLEASQQLTEPKLAGPWRISGLLDDLEEPL FT AVHKNEAGRSITTIQMQVLVVCPKAYGSIPNRQGVGCASPSEAILERIRTK FT LEFEPPPSTTSSALASERTKRTRMFARHDSVYSKYTDSKRQWSTGVDYDRS FT NMVMDDRRQIEPHAAQPRSSADWDPGPETVRKAVAAAGETGTFLALGLWLS FT QQPLTNITITPAVRVVKERTCAIGTGAMLFFKVEETCSGSTRTSSIKAGGC FT WSQLKQ" XX SQ Sequence 1906 BP; 456 A; 492 C; 592 G; 366 T; 0 other; aaagtggcga ctcggtgatt gttttgaccg agttttctcg gtcgaaattt cgcccacgcg 60 gaaaagtttt ctccggggaa gatccggacg cgagatggac ggtgcgtcct ccatggacca 120 gcagccggcg ggtcagcagc agcagatgcc gggcggcgtt gcgccagtgg agcagcagcc 180 ggtgccggac gggcaagtcg cggcggggca acagcagcag gcgttcatcc gttctccact 240 ccatcctccg ccagaacttc aacgcttgct ggaacaacag cgaggttatg tggacgagat 300 gtttcggcag caacacgagg tggtcagact tcagctggca cggattcagc agcagcagct 360 ggaatttatg cagcgccaag gccacattct gcgaaacatc gtcgcggcgg tgaacgagca 420 ggtttcgccg cacccggagg ctcaggagtg cgcaaaaatg agcctcacga tgggcaactc 480 caggagggct gaggaggtta ggttcgagca gcaggaaagc aacatcgcca gttcgatcgt 540 tgagggtgtc aacacgggca gtgatggaga tcgccgtaag tttgtgccga ttgtaaacaa 600 caagtcggct cgtgtaaatc ttgacacggg atctgccacc actaaagcat cgaccggaac 660 gtggcgcaaa gttggctgtc atccgacgtc aacatccgta gcggaagcgg ttgcggccca 720 gatgctggag gttgacggga agttttacta cgagatgacc gtcggcgggt ggacattcgt 780 tggtctggcc cgggcgacgc gaaagcagtt gtttggttgc gacacgactg acggtttcgg 840 accgtcgcag gtgccgaacg taaacatctc gcaacagggt ggcaactcat ctccaatcgg 900 tggtattttc aaggcggtga tgaaggtgga ctccggctgt ttgagcagtt tcgtcgacag 960 aaagctggca gcggacgtgg tacctgggtt ccggagtgcg gaacgccgaa caaccgcaaa 1020 gtatcgaccc ctggaagacg tgctggacca gctcaagcaa gatctgttct tctcacccga 1080 ggccattccg gagcgatttc tggttgtgcg ggaggaaggt gagcggctcc aaggcaagta 1140 cacatgtcgt cgccagtaca aacatcaaca accaggcacg acggcagtgt tggaagcgtc 1200 gcagcagctg actgaaccga agctggctgg accgtggaga atttctggtc tccttgatga 1260 cctcgaggaa ccgctagcag tgcacaagaa cgaggcgggg aggagcatca ccacaatcca 1320 aatgcaagtg ctggtagtct gtccgaaggc gtacggtagc ataccgaacc gacagggtgt 1380 cggctgtgct tcgccttctg aagcgatttt ggagcgcatt cgaacaaaat tggaatttga 1440 accgccacca tcaacaacgt catcagctct tgcaagcgag cgaacgaagc gaactcgaat 1500 gtttgcgcga catgattcgg tgtattcgaa gtacactgac agcaagaggc agtggtcaac 1560 aggtgtggac tacgatcgaa gcaacatggt gatggacgac cgacgacaga ttgaaccaca 1620 tgctgcccag ccacgcagtt ctgccgattg ggatccagga ccagaaactg tgaggaaagc 1680 ggtcgcagcg gccggggaga ctggaacgtt tcttgcgctc gggctctggc tgtctcaaca 1740 acctcttacc aacatcacga tcaccccagc agtccgtgtc gtcaaggaac ggacctgcgc 1800 catcggtacc ggtgccatgc tgttcttcaa ggttgaagaa acctgcagtg gttcgacccg 1860 taccagttca attaaagcgg ggggatgttg gagccaactg aaacag 1906 // ID Gypsy-31-LTR_NVi repbase; DNA; INV; 311 BP. XX AC . XX DT 14-MAY-2009 (Rel. 14.05, Created) DT 14-MAY-2009 (Rel. 14.05, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: long terminal DE repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy-31-LTR_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-311 RA Bao W. and Jurka J.; RT "LTR retrotransposon from Nasonia parasitic wasp."; RL Repbase Reports 9(5), 999-999 (2009). XX DR [1] (Consensus) XX SQ Sequence 311 BP; 61 A; 108 C; 83 G; 59 T; 0 other; tgtcacggac gagcaccagg ctctccgtcg gcacatccgg tggtaccacc gacgatgcag 60 gtgcatcgtc agcgaggtaa gccgccacct gagagctcgc gcgtgcagaa atgcaaggta 120 tgcatctcca cccgcgcgac cgctcgccgt cgacgagtcg cgtcgacgac agagcgggca 180 gatcaccggc gaccaccaaa ctggacggag ctcccttttt gggactaata cacgagcttg 240 ttctccacct ctgccttctt cttttctgcg cctccaagtg ctctctggga gcacccaccc 300 attccgtgac a 311 // ID Polinton-1_NVi repbase; DNA; INV; 14499 BP. XX AC . XX DT 10-APR-2009 (Rel. 14.04, Created) DT 10-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE autonomous Polinton DNA transposon - consensus. XX KW Polinton; DNA transposon; Transposable Element; Polinton-1_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-14499 RA Bao W. and Jurka J.; RT "Polinton DNA transposons from Nasonia vitripennis."; RL Repbase Reports 9(4), 791-791 (2009). XX DR [1] (Consensus) XX CC The sequence is incomplete. XX FH Key Location/Qualifiers FT CDS 9202..7967 FT /product="Polinton-1_NVi_3p" FT /translation="MTQSLPCDGFKWVEDLNCDFFNVPDDAPVGYILEVDL FT EYPESLHDTHKDLPLCPEHAAPPGSNQEKLLTTLNNKERYVLHYVALKQAL FT KYGLRLKHIHRALQFNQKPWLKPYIDLNSEMRKNAKNEFEKMLFKLFNNAV FT YGKTMENERKRVDVKLVNKWEGRYGCEAYIAQPNFHSCAIFNENLVAVQLA FT RTEISVRKPIYIGLXVLDLSKSLVYRFHYEYMQERVGDKAKLLYTDTDSLI FT YEVSDVDMYAVMKTDLHEFDTSDYPADNQFNIPLVNKKKVGLMKDECNGNI FT MTEFVGLRSKMYSVKVQGQQPIKKAKGVKSSVVKSTIEFDDYIHCLRDKAT FT ITRQQKNIRSRLHVIRTEKEQKIALSPHDDKRYLIPGQTDTLPWGHYMIDG FT YQLIEEDEPSAKRVRLH*" FT CDS join(11951..11481,11477..10179) FT /product="Polinton-1_NVi_4p" FT /translation="KVISENSATFQQNCRTTNCCPTNCRTSFSPLISEKKM FT DLTNEFVDIVKELWRIVKYACDGDLNSSDLQRSIDLTSKAISEVEELIESS FT ALNIYKKKTCQTSCGQLYSIHRLLNNCLLKVGSGLEGVTPSVTWDDTKSAF FT KNNLKTGVITNVKYLDIIFMEDACVLFMLKIEEALLVHKSIKVDTILAAEY FT MIVEENKEIVEVKYFNTKSASIYATSDLKEWFAVNVKQPIERDMEEFQERD FT SGWTLRSILNLTVNMKKFSPMKAASYIELPMAIQKKRACVNVQNQDNECFK FT WAILSALHPVDINAQRVSKYQAYANELNFKDIEFPVSVKQIPKFEKQNDVS FT VNLFILKKKGSVFNVSCHLTASKKDXHVNLLXVQDTYIDXXEEEQQGNLRV FT PRFHXVWIXDLSRLVNLQLSKNSHKKHLCDRCLHYFHDKPKLKAHVVDCLQ FT ANMCKVSLPQEKDSVLEFKNVNNKERVPIVVYADFECILKPVDDNRAYQEH FT EPFSIGCYVQYTYDEFKYISYRRKTEQDEDPAKWFVSQLYMLAEELTNMHK FT NPLPMKDMDQRSFDCATTCHICGGPFNVTDIKIQDHCHLTGQ*" FT CDS join(4241..3441,3487..3182,3186..2761,2768..1485) FT /product="Polinton-1_NVi_1p" FT /translation="MIVYKRRGRGLVNRLINNLPIELHLPGYQYCGPGTKL FT TKRLARGDPGINPLDVACKEHDIVYSQNRENIEARNAADRVLAHKAWQRVV FT SKDSGVKEKAAAFAVTSAMKLKSKFGMGVPFKKIVKAASKSIVPSKCARKV FT ILSALKGARKAVKEAGGKRNIGIPRILPVPSKVGGFLPFLIPIFAGLSATG FT AIAGGAAGIAKAVNDAKSAKQALEESQRHNRKMEDIALGKGLYLKPHKTGL FT GLRLKPGNELRKKKDCRRKITSESSHEKKKIVDVRLPQRALTNIDLLKYAK FT ILKIPYFRGVFMRNALPINGPHYRESAIVNLDDATGPGTHWVAYRKRGSEV FT VYFDSFGDLQPPLELMLYLGVDKIKVKYNYERYQDYDTFNCGHLCLEFLEN FT QKDSIGQLLGFTQRLLSSNVSHSSDLPVSILKVNVLRVERKITTGAYINGK FT KVHTIHEFFPAVPPGYKIIAVXSQVIYLRITVKSIDQLQIRIVDEDGHLVN FT FRGEVITVRLHLKQVSKYNGYRLRSRVYKESKMSKSHQSSANVKNANTAKD FT SVSKKPRLKSWWKITCIFFCCQNGRNSHYEIHAHQPYNVSSYNNSDEIRIS FT VQHQDLCLLPSASSLYVCGRLTKADGTLVKNTKFVNNAVCHMFEEIRYVIW FT VSINPSQSLIIHNAGWLDVEEKENLINSEGYFDISIPLSMFICFTEDYRKV FT VVNVKHEIILTRSRNDLNSVIQTPTKVATATTAAEYENFKIELMKVEWLMP FT YVVLSNQHKIRMLSHIQKGKSIDMSFRSWELYEYPLLPTTPKHVWTVKTSN FT QLEKPRFVILGFQTNRKEAKEINASRFDHCNISNVKLFLNSQYYPYGNLNL FT DIERNQYAMLYDMYANFQHAYYDKSIEPMLKKQHFINYLPLIVIDCSKQNE FT ALKNASVDVRLEFESKDNFPAGTSAYCLIIHDRS*" FT CDS join(4341..5039,5075..6004,6008..6394,6348..7316) FT /product="Polinton-1_NVi_2p" FT /translation="MFKKQSVKLPVVNFDTIAEPEKKNKRHGDLLPDSIRA FT VFCGPSNCGKTNSLLALITHPNGVRFENVYVYSKSLNQSKYKFLKDLLEPL FT NGIQYFAFSENDDVIAPDNALPNSIMIFDDIACEKQNNVRAYYCMGRHKKV FT DCFYLCQSYAQVPKHLVRDNVNLLVVFRQDEMNLKHLYNDHVNSDMSYSQF FT RELCSKCWMDDKHGFLLIDKDSAINNGRYRKGFDTFAINIRHTSINMADIQ FT RQKNILHQLVKAQNAVKRKYSLLKFGKDKFEQSIGETFKPIVDPLEKLVKT FT VEKSEVSKRSRAIKKEVKDEMVKNESSNDSFDDDTIEPYHESDFETAASED FT DDDDLNDTKIRRGVIQEHSNKEVADMYMTMLKHQQDNDKIYGVRKEGSEWM FT LGDSSIVFGDRSIKVKDGRYPKTAGLMELLIAKQPKIYDQSDLQNYRSILE FT ATNAHKKQYLSTQPIRVHNSKKYTNIINPMFSGKSGKGLPRYKIARRDTRM FT DYVYWDDPNELVDRLRLLIAEQSAGNPSHINEIHSIIEELREGGYIYAVSF FT TNIIIKMSVDVFGRNLKKSEGSRGPPGFGFKITTDGQYDMDKKRLCNVGDA FT QNIDEAVSLFQLQMENQKILDLISRQERDLNNLDSLLEAYREQFDLKLLTY FT KLEIDTIKEAVSQITSNIHSANVKRLCHRSQATFILQTFNMQHKKVIVEEL FT HKPARRNYPRRSFDVRGLDETWQADLVEMQPYAKENKSFRYMLTVIDVFSK FT FAWAVPVKKKTGEEVAAAMKSILRQGRVPKNLHTDRGTEFYNTNFQTLMKQ FT YKINLYSTYSNLKASICERFNRTLKNKMWIQFSLQGNYKWLKILPDLITKY FT NDTKHRTIXMKPNEVSTANQSQIFKRFTCESRLPKKPKFKIGDKVRLSKVK FT QVFEKGYTPNWSTEIFTISRVAATNPVTYHLKDYQDNPITGGFYEQELLKA FT KYPDVYLVEKVLKKRGKQVYVKWLGFDSSHNSWIDKTEI*" XX SQ Sequence 14499 BP; 4371 A; 2760 C; 2776 G; 4557 T; 35 other; gccgtcatca gcacatgtcg ttatcggcac acgccgtcat cagtacatat cgttatcggc 60 acatgccgtt tctgatacag gccggtactg gcacactcag atgagagagc ggtaagagag 120 ctgtgagaga gcggtcgata acgtcatcaa cgctggcgtt gtcgaccgcg agatccccga 180 tcgatgcgtc tcggtgctgg cggtgacgtc aacggcccga tgttaccttc tgctgttgat 240 accgacagct tgcttcttta ccgtcatgta tcttctagat gacaaaaaga aaaaagatat 300 cgttggctat gcaggggatt tgaaccggga accttgcggt cgccggccga gcgtcttacc 360 gacttacccg ttacgggagt gatgactagt attcaatcaa gttagctcac tacttaaatc 420 tcttataacc ctctttttca ggcatagaat aagtatttgt ggaaaacttt tgtgaatact 480 gtttccaatc agaagatttt tccttctttt cctgtttctt atcatttttc gaatcaggaa 540 gaggcttata acctgcacta ctaggtggtg aacccatgat gtacgaatga gcatgatgtg 600 gaagttttct cgtatttata agtgatggga gccccaacca tcgctctcac ctgtgaggtt 660 caaccatgta acttaacctc tccataaata ttttttcact cccactccat ggatgaacct 720 tgtctctcca attgcttaca ttgctcttgc atgtatttct tcataagtcg tacattgtga 780 agagcacagc tggctttata ggcgccatta tgatgaggac aaaccgttac gagtttgatc 840 ttgtctagag gtggatatcc catttccgag atgtcgatta cgttgaattt gaaacgttcc 900 aaccacttct tcttcaagga tcccattacc attactgcag tagagtcatg caaggcttcc 960 cgaagaattt taccgatttc tgcatattcg atttctccag atgaccatgg tataccatga 1020 taatgatatt ccaaccatgt attctcttgc ttatactttt ctgtgagtcg acgccatgga 1080 aaaggctgtt tgaaaagtag tacaataggt tcgatatcat ctcttagaga aactatagat 1140 aactccttga ggatgaagtc attgccgggc tgcttaaagc cttgcatgtc cactaagtac 1200 tccatctcgg aatgactctg actagcagca agcttttctt tatatcatat ccccaccact 1260 actacttcca tccccaccac aagcagcgca agaaaggagg cttgaatgcg cgcgcatagt 1320 agtgcaagag tggggatatg aaagcgctct actcaggaga acattggaat gcaagtttca 1380 aatctctcca tgaattcgat ttttgttatg atacaatgtg tgttattaaa taaatcaaac 1440 gagttttttt acaacaccat ttgcgcaatt atactgaact atgatcaact acgatcatgt 1500 ataattaaac aataagcaga ggttcctgcg ggaaaattat ctttggattc aaattccagc 1560 cgaacatcaa ctgacgcgtt tttcaaagct tcgttctgct ttgaacaatc gatgactata 1620 agaggtagat agttgatgaa gtgttgcttt ttcagcatag gctcaatact cttgtcatag 1680 tatgcatgtt gaaagttggc atacatatcg tagagcatag catactgatt gcgttcaata 1740 tctaaattta gatttccata aggatagtat tgtgaattta agaagagttt aacattactg 1800 atattgcagt gatcgaaacg actagcatta atttctttcg cttccttcct atttgtttgg 1860 aagccgagga tcacgaagcg gggcttttct aactgattag atgttttgac tgtccaaaca 1920 tgtttgggtg tagtgggaag cagtggatat tcgtacagct cccaactacg aaaactcata 1980 tcgatcgatt tacccttctg aatatgacta agcatccgga ttttatgctg attagataat 2040 actacatatg gcatgagcca ctcaactttc atcaactcaa tcttgaaatt ctcgtactct 2100 gctgcagtag ttgcagttgc gaccttcgta ggtgtctgta ttacggaatt cagatcattt 2160 ctcgatctag tgagaatgat ttcatgcttc acgttaacga ctaccttgcg atagtcttca 2220 gtaaagcata taaacatact tagaggtata gatatatcaa agtatccttc actatttatc 2280 aaattctctt tctcctcaac gtcgagccag ccggcattat gtatgattag actttgactg 2340 ggattaatcg atacccatat tacatatcgt atttcttcaa acatatggca gacggcgttg 2400 tttacgaact ttgtattttt cacgagagta ccatcagctt tggttagcct accacataca 2460 taaagcgaac tagctgacgg caaaagacac aagtcttgat gttgcaccga gatacgaatt 2520 tcatcactat tattgtaaga tgaaacgttg taaggttgat gagcatgaat ctcataatga 2580 gaatttcttc cattttgaca gcagaagaat atacacgtta ttttccacca actcttaagc 2640 cgaggctttt tagaaactga atcttttgcc gtgttagcgt tcttaacgtt cgcggacgac 2700 tgatgacttt tcgacatttt cgattcctta tatacccgcg accgtaaacg atacccatta 2760 tacttgcttc agatgtaatc tcactgttat tacttcaccg cgaaaattaa ctaagtgtcc 2820 gtcttcatca actatacgaa tttgaagttg atctatgctt ttcacagtga ttcgaaggta 2880 aataacttga gaagwaactg caataatctt atagcctggt ggtactgcag ggaagaattc 2940 atgtattgta tgcactttct taccgtttat ataggcaccg gtagtgatct tacgctcgac 3000 tcgtagaacg ttgactttga gaatggatac aggtagatca gagctatgag agacgtttga 3060 tgaaagaagt cgttgtgtga aacctagcaa ttgacctatt gaatcttttt gattctcaag 3120 raattcaaga catagatgcc cacaattaaa tgtatcatag tcctgatatc tttcatagtt 3180 atacttttat tttatcaacg cccaaatata gcattaattc tagaggtggt tgaaggtcac 3240 caaaactatc aaagtacaca acttcgctac ctctctttcg atatgctacc caatgtgtgc 3300 caggaccggt agcatcatcc aaattaacaa tagcagattc acgataatgt ggtccattga 3360 taggaagagc attacgcata aagacacctc gaaagtatgg gatttttaag attttagcat 3420 actttaagag atcaatatta gtgagagctc tctgaggtaa tcttacgtcg acaatctttt 3480 ttttttctta actcatttcc aggcttaaga cgaagaccga gccctgtttt atgtggtttc 3540 aaatagagac ctttaccaag agctatatct tccatttttc tattatgcct ctgactttct 3600 tcaagagcct gttttgctga tttagcatcg ttaacagctt tcgctattcc agcagctcct 3660 cctgcgatag caccagtcgc acttaaacca gcgaatatag gtataaggaa tggtaagaag 3720 cctcctactt tcgatggaac tggtagaata cgaggaatac caatgtttcg tttaccacca 3780 gcctctttga ctgcttttcg agctcctttt aatgctgata gaatgacttt acgagcacat 3840 ttgctgggaa cgattgattt cgatgcagct tttactatct tcttaaatgg aacacccatt 3900 ccaaatttag actttaactt catggcactt gtaacagcaa aagcggctgc tttttcctta 3960 acccccgaat ctttcgagac aactctttgc caagctttat gggcaaggac tctatcagca 4020 gcgttcctag cttcgatgtt ttctcgattc tgagaataga ctatatcgtg ttccttgcag 4080 gctacgtcta ggggattaat acctggatcg ccacgagcta atcgtttagt gagtttagta 4140 cctggtccac agtattgata acctggtaga tgaagttcaa ttggaagatt attgatcaat 4200 cgattaacaa gacctcgtcc tcgacgtttg taaacgatca tttcacttta aaggttaaac 4260 aacaactaat atcgagagta taaataccag ctctatttat agtctcacaa cagttgtaat 4320 gcagacctct ctatacaaac atgttcaaga aacagtctgt taagcttccg gtagtaaact 4380 ttgatactat tgcagagccg gagaagaaaa ataaacgaca tggtgattta cttccagata 4440 gtattcgagc agtattttgt ggaccatcaa attgtggtaa aaccaacagt ttgctagctc 4500 ttataacaca tcctaatgga gtaaggtttg aaaatgttta cgtctactct aaatcactca 4560 atcagtccaa gtacaaattt ttaaaagatc ttctcgaacc attgaatggt atacaatact 4620 ttgcattcag tgagaacgat gatgtcatag ctcctgataa tgcacttcca aactctatca 4680 tgatcttcga tgatatagcc tgcgaaaaac aaaacaatgt tagagcatac tactgtatgg 4740 ggagacataa gaaagtcgat tgtttctatc tatgtcagtc ttatgcacaa gttccgaaac 4800 atcttgtaag agataacgtc aatctcttgg tagttttccg acaagatgaa atgaatctca 4860 aacacttata caacgatcat gtcaactcgg acatgtcgta ttcgcagttt cgagaattat 4920 gctcaaaatg ttggatggat gacaaacatg gattcttact gattgacaaa gatagtgcta 4980 ttaacaatgg tagatatcga aagggttttg acacttttgc tataaatata agacacactt 5040 agtcgacatt cattagtcta tttttgactg ttaaagtatc aacatggctg atattcagag 5100 acaaaagaat atacttcatc agcttgtgaa agcgcaaaat gctgtaaaac gcaagtatag 5160 tctgttgaaa tttggaaaag ataaatttga acagtcaata ggagaaactt ttaaacctat 5220 tgttgatcct ctcgagaaac tagttaagac agttgaaaaa tctgaagtat ctaaacgtag 5280 tagagcaata aagaaagagg ttaaggatga aatggtcaaa aatgaaagtt ctaatgacag 5340 ttttgatgat gatacaattg aaccttatca tgaatctgat tttgaaactg cagctagtga 5400 ggatgatgat gatgatttga atgatacgaa aattcgtagg ggagtaattc aagagcacag 5460 taacaaagaa gtagctgaca tgtatatgac tatgttgaag catcagcagg ataatgataa 5520 aatttatgga gttcgaaaag aaggttcaga atggatgtta ggtgactcat caatagtgtt 5580 tggagatcga agcatcaaag taaaagatgg aagatatcca aaaacagctg gtttgatgga 5640 attgctcatt gcaaaacaac ccaagattta tgatcaaagt gatttgcaga attatcgaag 5700 tattctggaa gcgacaaatg ctcataaaaa acaatatttg agcacacaac caattcgtgt 5760 tcataatagt aaaaaatata ctaacattat caatcctatg tttagtggra aaagtggaaa 5820 aggattacct cgttataaaa ttgctagaag agatactcga atggattatg tgtactggga 5880 cgatccaaat gagctagttg atcgtttacg cttacttata gctgaacaat cagctggcaa 5940 tcctagtcat atcaacgaaa tccactcaat catagaagaa cttcgcgaag gagggtatat 6000 atactgagct gtttcattta caaacatcat tatcaaaatg agtgtcgacg tgttcggacg 6060 taatttaaaa aagagtgagg gaagtcgtgg ccctcctggt tttggtttca aaatcacaac 6120 agatggycaa tacgacatgg ataaaaagag gttatgcaac gtgggagatg cacaaaacat 6180 agatgaggct gtcagtttgt ttcaacttca aatggaaaac caaaaaattc tagatttaat 6240 atcccgacaa gaaagagact taaacaattt agattctctt ttggaagcat acagagagca 6300 atttgattta aaattactta cctacaagct agaaatcgat actataaaag aggctgtgtc 6360 acagatcaca agcaacattc attctgcaaa cgtttaacat gcaacataaa aaagtaatag 6420 ttgaagaact tcacaaacca gctcgacgga attatcctcg tcgcagtttt gatgttcgag 6480 gattagatga gacttggcaa gctgatttag tagagatgca accttatgct aaagaaaata 6540 agagttttag atatatgtta acagtcatag atgttttctc caagtttgca tgggctgttc 6600 ctgtgaagaa aaagactgga gaagaagttg cggcagcaat gaagtccata cttcgtcaag 6660 gtcgagtacc taagaatcta catacggata gaggcacaga gttttataat accaactttc 6720 aaactctcat gaaacagtat aagatcaatc tgtactcaac gtatagtaat ttaaaagctt 6780 ctatatgtga acgttttaat cgtacactca agaacaagat gtggattcaa tttagtttgc 6840 aaggtaatta taagtggttg aaaatyttac ctgatttgat cacaaagtac aatgatacca 6900 agcatcgaac tattrgaatg aaacccaacg aagtgtctac agcgaatcaa tctcagattt 6960 tcaaaagatt tacttgtgag agtagattgc cgaagaagcc taagtttaag attggtgata 7020 aggtacgatt gagtaaagtg aagcaagttt tcgagaaagg ctatacacct aactggtcaa 7080 ctgagatttt cacaataagt agagttgcag caacgaaccc agttacatat catctgaaag 7140 actatcaaga taatccaatc acaggtggat tctacgaaca ggagctcttg aaggcgaagt 7200 atccagatgt ttatcttgta gagaaagtct tgaagaagcg tggaaaacaa gtatacgtta 7260 aatggttagg gtttgatagt tcacataata gttggattga taagactgaa atataatttt 7320 ctttttttkt tggaagtgat gagagcccca ccttcgctct cacctgtgag atacaaccat 7380 agttaactat ctctccatta acttccttta tttattcaaa caattattta caatggttaa 7440 catttagaca atacaaaatt caatgttatt ttcgcctttt gaggtcaggg acagcttcat 7500 ctccttggct tgactagaca tactctcaaa ggtctctttg tccttctcca atgcagctga 7560 tacacgggaa gggaggaaca cctgacattc atcgtctaga gtagccacaa ttctcagtcc 7620 atatctggtt ttgactggac gtagggatgt caccatatgc ttctgtccgg tctgtagctc 7680 agacatcttc ttcgtgggaa ggaatcctcc gctggcaatc ttgttgatag cgtccatctt 7740 tgaggtcttt gctctttagc tcgttgatcg agaatgattt cgagagcgtt gggcttcctt 7800 aagtatagtc ttccccacca ctactaccag cggtgacggc ggctcacaca tcggattggt 7860 cggcagggag tctgcgcagt agagctgtga tctgaaacaa aatgtatgat taatataata 7920 acaattataa ttgttattat attaattata aatgattata acatacctaa tgtaatctga 7980 ctctcttcgc agatggctca tcttcctcga taagctgata tccatcaatc atgtaatgac 8040 cccacggtag agtatccgtc tgaccgggga tcaggtatcg cttatcgtca tgtggactca 8100 gagcgatctt ttgctccttc tcrgtcctga tgacgtggag gcggctgcgg atgttcttct 8160 gytgtctggt gattgttgcc ttgtctcgaa gacaatggat gtagtcrtca aactcgatrg 8220 tcgacttgac gactgacgac ttyacaccct ttgccttctt gataggttgc tgkccttgta 8280 ccttgacsga gtacatcttg ctacggaggc cgacaaactc cgtcatgatg ttgccgttac 8340 actcatcctt cataagtccg actttcttct tgttgactag agggatgtta aactgattat 8400 cagcaggata atcagaagtg tcgaactcat gaaggtcagt cttcatcaca gcatacatat 8460 ctacatcgct gacctcatag atgagactgt ctgtgtcygt gtagagcaac ttagccttgt 8520 ctcccactcg ctcctgcatg tattcgtagt ggaagcggta aacaaggctc tttgacaagt 8580 ccaacactgy caatccaatg tagatgggct tgcggacaga gatctcrgtg cgggcaagyt 8640 gkacagccac gagattctcg ttgaaaatcg cacagctgtg gaagttgggc tgagcaatgt 8700 atgcctcaca gccrtagcga ccttcccact tgtttaccag cttcacgtcg actctctttc 8760 gctcattctc catggtctta ccatacactg cgttgttaaa gagtttgaag agcatcttct 8820 cgaactcatt cttcgcattc tttctcatct cgctgttcaa gtcaatgtat ggcttcaacc 8880 atggcttctg gttgaattgg agagcacgat ggatatgctt caatcgaaga ccgtacttga 8940 gagcttgctt gagagcgacg taatgcagga catagcgctc cttgttgttc agagtggtta 9000 gcaacttctc ttggtttgat cctggtggtg ctgcatgctc tggacagagt ggaaggtcct 9060 tgtgagtatc gtggagactt tccggatact cgaggtctac ttccaagatg tatcctacag 9120 gagcatcgtc gggaacattg aagaagtcac aattcaggtc ttccacccac ttaaagccat 9180 cacatggcag ggattgtgtc attgccctag aaatatagga tttattagta tatcatcatt 9240 aatataatta tttcaatgtg aacttaccat ccgtataggt tgttgacatc atagtacatc 9300 aggtactttg actcttcatc ctcatcgtat ccagactcca tgtatcgatt attggccttg 9360 gcatacctat tacagcattg gctgatacca cctcggatgc cccgctcgac gaagagcagc 9420 atgtctatat cagttaacag ctggagcttg atgccagtat acttcaacat ggcatcccat 9480 gtaaatccgg gggtggtata gtaatgagca ggatctagct cataggtatg aagacacttg 9540 gatctgaagt cttcaaacac gttggcgagg agaagtacgt cagtttttag atagaggtct 9600 gagtactggc caagagtttt gatgttgaaa gcattccaga ccgtctgagc atgcttgtag 9660 tcctcgtctg atattccact atcatagagg ctactataga aactctcctt gtctggaagg 9720 gaggtctcgt tcaagtagtc aaaagatttg atgtgatcgt acgggaacac acctttcctt 9780 ctaagcaatt caatctgaac atcgctgaac tccttcttga actctttatc gactatttcc 9840 aacttgtcta aatacgatgc aagtttgtca agtgaagatg gcatgaatcg gaaggagtcg 9900 ataaacttga agttgatgtc gcttccatca atatgcttgg tgaaagagat gtacttctct 9960 ttattttgcg gtatgataga gactcgtccc ttgaagttat tggcaagctc ctttatgaac 10020 aagtgcgaat catatccgct caagttgtgg aatacaactg gaatacttct agagtcctta 10080 tagttgagat tacaactgtt gtgagccgct ccacgatatc tggaaaaaaa gaatagttat 10140 tagataaatt gataataatc agaataataa taaacaactt actgtcccgt aaggtgacaa 10200 tgatcttgaa ttttgatgtc cgtcacgttg aacggtcctc cacagatgtg acaagtcgta 10260 gcacaatcaa acgatctttg atccatatcc ttcattggca agggattctt gtgcatgttg 10320 gtgagttctt cagctagcat gtagagttgt gaaacaaacc atttcgcagg atcttcgtct 10380 tgttctgttt ttcttctgta actgatgtac ttgaattcgt cataagtgta ttgaacatag 10440 cagccaatgc tgaagggttc atgctcttgg taagctcgat tatcgtcaac aggcttgagt 10500 atrcattcga agtctgcata cacaactatt ggaactctct ctttgttgtt gacgtttttg 10560 aattccagca cgctgtcttt ttcttgaggt aacgacactt tgcacatatt tgcttgtaaa 10620 cagtcgacta cgtgagcttt caattttggc ttatcatgaa aatagtgaag acaacgatca 10680 cagagatgtt ttttatgtga attctttgat aattgtaaat ttactaaacg agacaaatct 10740 tyaatccata cgyaatggaa tcgtggaact ctcaaatttc cttgttgttc ttcttcctym 10800 tcatctatat aagtgtcttg cacrarcary aggttcacat gkttatcttt ctttgaagct 10860 gtcaagtgac atgatacatt aaatacacta cccttcttct tgagtataaa gagatttaca 10920 gagacgtcat tctgcttctc aaacttcggt atttgcttta cagatactgg gaattcaata 10980 tctttgaagt tcaactcatt cgcataggct tgatattttg aaactcgctg tgcatttata 11040 tctacaggat gtaaagctga gagaatggcc catttaaagc attcgttgtc ttgattttga 11100 acattgacac atgctctttt cttttgtata gccattggta actcgatgta tgaagcagct 11160 ttcatggggc taaatttttt catattcacc gttaagttga gaatgctcct taaagtccat 11220 ccagagtctc gttcctggaa ctcttccatg tctcgttcaa tcggctgctt aacgttaact 11280 gcaaaccact cttttaagtc actagttgcg tagattgaag cagactttgt attgaaatat 11340 ttcacttcca caatttcttt attttcttcg acaatcatgt actctgcagc gagaattgtg 11400 tcaactttta tcgacttatg cactagtaac gcttcttcaa ttttcaacat aaagagaaca 11460 catgcgtctt ccatgaatca gattatgtcc aaatatttca cgtttgttat aacaccagtt 11520 ttgaggttat ttttgaatgc tgatttagta tcgtcccacg tcacactagg agtaacccct 11580 tcaagtccac tgccaacttt tagtaagcag ttgttgagta atcgatgaat actataaagt 11640 tgaccacayg aagtttgaca tgtctttttc ttataaatat ttaaggctga tgactctatc 11700 aattcttcca cttcactaat agctttagag gttaagtcaa tcgatctttg caagtcgcta 11760 gagtttaagt ctccatcaca agcatatttt acaatacgcc acaactcttt cactatgtct 11820 acaaattcgt ttgttaaatc catttttttc tcacttatta atggagaaaa tgatgttcgg 11880 cagtttgttg ggcagcaatt cgtcgttcga cagttttgtt gaaacgttgc actattttca 11940 cttatcactt tttaagttgt ccagcagaga ttcgtcgttc agcagcgatt caacgtctga 12000 cagcgttaca ctttttcaca gcactygtca ctttttacac cttcctttaa cgattcagca 12060 tccaccagat tgyagttcag cagttaagaa gttcgttgta cactttcgtt cgatgatctc 12120 ttggcacttt tacacaaaac tttttatata acttgtcact ttttacacag cagctcctta 12180 ctccacgagt cgatgaacaa taatgatttt ttaaagcatc aattttttct taagtagagg 12240 tctccccact ataatgcagt agaaaagctg taacgctgat tggttcgtaa gaatgtagaa 12300 ggggaagcat tgatctgatt gattaagagg taagtggagg gggaagtcgt catctgattg 12360 gttgacatag ctctcgcaag cagtataaag aggaaccaga tacgttttag taatcatttc 12420 gtcatggcct acgtagtaga tatccaaggc ttcaagagca cttgcaacga gttcatcttc 12480 aaagaggtag cgatcgttgc tctcgaggaa gatgcaactc catcagtctt gctctttcga 12540 ccaccgtact cttggaacac cctaccgaga tggagaaaat gtgagaaccg gtggttagaa 12600 cacaactatc taggaatatc ctggaatgat ggagagatac cttacgaaga tctagtgcat 12660 actcttcata atcttctacg aggagctcat aagatctatg tcaaaggtcc ggagaaaaga 12720 agatggttgg agagactagt acctcatgtc tttgaccttg gtgaaatgag atgtccatct 12780 ttgaaagagc tgcgaaagaa gagttataca acatgcaaca accacaaact atgtaaaaaa 12840 ccggtatgtg cagctgaaag cgcatgcgca ctgaagactt gggtacttgg acagaaacca 12900 aatatatcag aggrtwtttg atgatgaagt agagaatatg taaaaaatca tagatccctg 12960 gtatatatca gtcgattacg tttcctacat ataccgatag agatgggtaa agagctctgt 13020 gtctgttttc agcctgaaat cagtgataac gtttcctgca tataccgaca gagcgttcgg 13080 ggtctctttt catcttgaaa tcaagtatca aatttaaacc agatgtaata tgatcacata 13140 tgacgatttt ctctatataa ttttgtttaa tttttttatt taaaattaca gcaaaattaa 13200 tacgctgata taaagttaaa tataaatttt attcaatgtt tatatataca tataattttt 13260 tattcgggct tattttatat taacacatta tttcaatgta caccatccgt taaatgaaac 13320 tttgaattat cccgctcgta tgtgaagtgt catggccgat gggtaggata ttcaaagatg 13380 gctgacgggt ggggtattca aaggtataaa aagacatatt taattttcca attattttat 13440 taaattttcc agtaataaaa tagcaacgtt tggtatattg tgatataaac aatcacgtac 13500 tgttttaaaa acaaatagat tttgttgtcg ggtgacctct ccacatattt catacaaaaa 13560 atctctttat tcataaataa acttattcct tcttgcataa aatgagatgg ttgatcatcc 13620 tgttccaaaa ttaatgacgt atagaaatca tgcattgtgc caaataccta aaaagtaaat 13680 ttattattaa tatttgtaaa caataatata ttagttttac actttatatg aaaaacgtct 13740 gcgaagaacg agttgaaaaa tttacaagtg taatacaggc ttgtgtaagt taaagtagcc 13800 tggaatttaa cataatctta acctaactta cgtaaaggtc gtattacact tataaatttt 13860 tcaaatcttt ctacactgac gtttgtcatg ttgagtgtaa acttgtttta ttatgattat 13920 aaaaaattac cgaagcgact atttccaata aaagtaatct tttctgtata agcggtacgt 13980 caatttgttt taagaaattg tgtagaattc caaataactc gatgccagtt ttcttgaagt 14040 ttgttccatc ttcttgagtg tatagcattg ttgcatagaa cgagttattg tcgccataaa 14100 actcctctcc aacttccaaa aacaacctta tcttaacaca tatatatata tatatatata 14160 tatatatata tatatatata tatatatata aacgtatatt gatacacgac gtctgtttct 14220 atataatttt ttaataatca tgtatagtcg atatacataa tcatatatga tcgtctattc 14280 atacacaaag tttgtatcta tataatttta ataactcata tatggttatg tatactaatt 14340 tatagtcaaa cataatcata aataatcata catcaattta gttgtagttg attgaagata 14400 agtacttata tatgattgta aataaaaaat atatatttaa tcgactacaa ataaattgtg 14460 taaacttatt tatgctcata catgactata ttaaatact 14499 // ID P-29_HM repbase; DNA; INV; 4264 BP. XX AC . XX DT 28-DEC-2008 (Rel. 13.12, Created) DT 28-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE P-type DNA transposon family: consensus. XX KW P; DNA transposon; Transposable Element; P-29_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4264 RA Bao W. and Jurka J.; RT "P-type DNA transposon families from Hydra magnipapillata."; RL Repbase Reports 8(12), 2082-2082 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(152..838,735..1307,1202..2761,2739..3062) FT /product="P-29_HM_1p" FT /translation="MVNSCAAAGCTNRAMKNDNRAFHKFPINNRELCKKWI FT VAIKRETFIPTEHSRICSDHFLPSDYNIPDFNNKPKLKPFAVPSIFIDSKP FT KVKRKFNSISPQTNKKRLINKKVSETGSHHHIDFQSDISDNNNVAANRNEI FT YINSVISETTESIDCQVDINSTFNIIKSSNKIINDDTIISKSTLSTNASPT FT KAKLKAKIKILKQKLRRKEKKNKFIKGYAKKSKQKKNCFKQKLKFLNKSCD FT EKKKKINSLKVMLKNLSKKKIVSEDAANVLCENFSGLTFDIIKNQFKNNTS FT LPKGHRYTDEIKKFAITLHFYSPKAYNFLRPILHLPASSSLANWTSSVDCE FT PGFFEDVLSYISIKAETDINYKDSCALIIDAMAIKTSVLNDKKTGRFIISL FT ALQIMEKILLQLNQIHQLQKHWYLCWKDRSIYHFIGFTDYGKDIVAIEPDT FT PATEALVFMLVGLRGHWKTPIGYILCSKITAENLSCLIKKALTISSDKKIN FT VRSITFDGVNVNFNSVESLGCTFGKRLEKINGRFTFEGYNHYLYVIPDPSH FT MLKLARNALCDLEVLKDCDGKYIKWSYIKALYEIQEEEGLKFANKISIKHI FT YFQRHKMNVKFAAQTLSSSVADAIEFLMFSKHPNFKHAEGTINFIRVIDKL FT FDMLNSKNLFSKSYKKALFLNDYPYWNSTFDQIINYLSKLTDANGTPLLKH FT RRKTFVLGLIVAAKSLQHLAYELLTLNVHPFKFCLTYKFSQDHLELLFSCI FT RGMNGFNNNPDVIQLKSSLKRILLRNSIVGSKHANCLTFEDIANGSIFALK FT WSKKSAIYSNPETDFLNFDIDIVTIQNYLDNTSIYKEAILGYISGFIVRRL FT LSKITCSVCANSLISQNNFVEHSYGTSALSLINIKNRGGLIAPRNDVIQTI FT IVCESVFKFLIIINXIYICXRIFTQIKIMSXNLCQQCQXIYVSSDDFKKPK FT MNAERNLKLKMIYAVNKIISEKHFFKDLNDHDAEHEAVTEDMHSTQLIKML FT ANDFFTIRLQRYGQLYTEKVLRKSMIGLRQQSNKLIIFKGL*" XX SQ Sequence 4264 BP; 1672 A; 525 C; 554 G; 1509 T; 4 other; caatgatata ttaaaaataa ctagatgtct aacttccgcg tattttttga agcaaaaatc 60 ggccattatg gggggcaaat ttaaaaaata gtaaatcacg gttgtttgga tcaaataata 120 ttcgatattt aatattcata atttataaat catggttaat tcttgtgctg cagcaggttg 180 cacaaataga gcaatgaaaa acgacaacag agcatttcat aaatttccaa ttaataatag 240 agagctctgt aaaaagtgga ttgttgctat aaaacgtgaa acatttattc ctactgagca 300 cagccgcata tgtagtgatc actttttacc ttcagattac aatatacctg attttaataa 360 caaacctaaa ttaaaacctt ttgctgtgcc atctattttt attgattcca aaccaaaagt 420 caaaagaaag tttaattcta tatcgccaca aacaaataaa aagagactca ttaataaaaa 480 agtttcagaa acaggcagtc accatcacat cgatttccaa tcggatataa gtgataacaa 540 taatgttgct gctaatagaa acgaaattta tattaatagt gttatttcag aaacaacaga 600 gagtattgat tgccaggtgg atataaatag tacctttaat attatcaaaa gtagtaacaa 660 aattattaat gatgatacca ttatttctaa aagtacatta tcgacaaatg caagtccaac 720 taaagcaaaa ttaaaagcaa aaattaaaat tcttaaacaa aagttgcgac gaaaagaaaa 780 aaaaaataaa ttcattaaag gttatgctaa aaaatctaag caaaaaaaaa attgtttctg 840 aagatgctgc taatgttctt tgtgaaaact ttagtggatt gacttttgat attataaaaa 900 atcagtttaa aaacaatact tcacttccaa aagggcatag gtatactgat gaaataaaaa 960 aatttgcaat aactctgcat ttttactcac ctaaagcata caattttcta cgacctatac 1020 ttcatttacc tgcttcaagc tctttagcca actggacttc ttctgtagat tgtgaacctg 1080 gtttttttga agatgtttta agttatatta gtataaaagc agagactgac attaattata 1140 aagatagttg tgcattaatt atagatgcaa tggcaatcaa aacatcagtt ctaaatgata 1200 aaaagacagg tcgatttatc atttcattgg ctttacagat tatggaaaag atattgttgc 1260 aattgaacca gatacaccag ctacagaagc attggtattt atgctggtag gattacgagg 1320 tcattggaaa acacctattg gctatatact gtgtagtaaa ataacagctg aaaacctttc 1380 atgtcttatt aaaaaagcat taacaatttc ttctgataaa aaaattaatg tacgtagcat 1440 tacatttgat ggtgtaaatg ttaactttaa ttctgttgaa tcactaggat gtacatttgg 1500 gaaaagatta gaaaaaatta atggaagatt tacttttgaa ggctataatc actacttgta 1560 tgtaatccct gacccaagtc atatgctaaa attagctaga aatgcgttat gtgatttaga 1620 ggtacttaag gattgtgatg gaaaatatat taaatggagt tatataaaag ctttatatga 1680 aatacaagaa gaggaaggat tgaaatttgc aaataaaatc tcaataaaac atatttattt 1740 ccagcgtcac aaaatgaatg ttaagtttgc tgcacaaaca cttagtagtt cagttgcaga 1800 tgcaattgaa tttctgatgt tttccaaaca tccaaatttc aagcatgctg aaggaactat 1860 aaattttata agggtcattg acaaattatt tgatatgctg aactctaaaa atcttttttc 1920 aaaatcatat aagaaagctc tttttttaaa tgattatcct tattggaatt caacttttga 1980 tcaaataata aactacttat caaagttgac agatgctaat ggtactccat tattaaaaca 2040 taggagaaaa acttttgtat tagggttaat tgtggctgca aaaagccttc aacacttagc 2100 ctatgagctt ttgactttaa atgttcatcc ttttaaattt tgcctaacat acaaattttc 2160 acaagatcat cttgagttat tgttctcttg tattcgtgga atgaatggat ttaataataa 2220 tccagatgtc atccaactta agtcatcttt aaaaagaatt cttttgcgca attctattgt 2280 aggctcaaag catgcaaatt gtttaacatt tgaggatatt gcaaatggct ctatttttgc 2340 attaaagtgg agtaaaaaat ctgcaattta ttcaaatcct gaaacagatt ttttaaactt 2400 tgatattgat attgttacaa ttcaaaatta tcttgataat acctctatct ataaagaagc 2460 aatactagga tatatatctg gatttattgt aagaagatta ttatcaaaga taacatgttc 2520 tgtgtgtgca aactcactta tatcccaaaa caattttgtg gaacattctt atggaacatc 2580 agctctttca ttgattaaca taaaaaatag gggtgggctt attgctccta gaaatgatgt 2640 tatacaaact ataatagtat gcgagagtgt tttcaaattt ttaataatta ttaatawtat 2700 ttatatttgt takcgaattt ttactcaaat taaaataatg tcaayaaatt tatgtcagca 2760 gtgacgattt caaaaaacca aaaatgaatg cggagcgaaa ccttaagctg aaaatgattt 2820 atgcagttaa taaaattatt agtgaaaaac attttttcaa agaccttaat gatcacgatg 2880 ctgagcatga agctgtcacc gaagacatgc attccacaca actaataaaa atgttagcaa 2940 atgacttttt tacaatcaga ctgcaaagat atggccaatt atatacagaa aaagttttaa 3000 gaaaatcaat gattggtcta cgtcaacaat caaataaatt aattatattc aaaggtttat 3060 aacataaata tttaatatat aaaaggataa taaattattt aaaatttaaa aattcagtat 3120 atatatatat atatatatat atatatatat atatatatat atatatatat atatatatat 3180 atatatatat ataatatata tatatatata ttatattatt taaaaattca gtatatatat 3240 atatatatat atatatatat atatatatat atatatatat atatatatat atatatatat 3300 atatatatat atatatatat atatatttat aaaattcttt gtgtatatat atatatatat 3360 atatatatat atatatatat atatatatat atatatatat atatatatat atatatatat 3420 atatatatat atatatatat atatatatat ataatatgaa aataatataa ataaaaatat 3480 taaattaatt tttattaact gttacttttc ttgatttgtt gtttatattg aaaaactcgc 3540 ttttcaaagg aaagatattc tttatatacc tgaataccta tggagctttt ttgtattgaa 3600 aaccttacta gatataacat aacattgaat taaatttatt taattgacgg taaatcaatt 3660 atagaaaaaa gtaataatga tttttatgat aacaagaagt gtttaaccat aaataaaatt 3720 ctaacatttc tagaaggaac ctctaagttg tgtcacacag agcttagtta atagtcaaga 3780 cagttttata aaatgtttca ttacaagttt taactctgtt taccttgcag ttatgttata 3840 atacctccta gcattactgc agagggtaat taatgactaa caggtgcata aaatgttaat 3900 agattaaaga ttaaaatcca actaaatcga ttttagttga atgttaaatt ttaaaatcat 3960 acaaayttaa attttaaaat caaaaaaaat gaaataagta tgattttgaa atgtttaaaa 4020 aaacaatgca tgttttacat aattattaaa ttttaaatca aaatcaaaat aacatcaaat 4080 ttttcagata tctgtgtatc gattttttat tgattacatt tttaaacaat tttaagcttc 4140 gtatttgttt tgttttgata aatagcgtaa agattttgcc ccccataatg gccgattttt 4200 gcttcacttt ttcttatacc gatttacggc agttagacat ctagttattc ttaatatatc 4260 attg 4264 // ID BEL-216_AA-I repbase; DNA; INV; 7457 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-216_AA_; KW BEL-216_AA-LTR; BEL-216_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-7457 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 889-889 (2011). XX DR [1] (Consensus) XX CC Positions [5465-6049] - Integrase core CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 2825..6430 FT /product="BEL-216_AA-I_1p" FT /translation="MPKELVLADPTFHESGAVDLIIGAEVYLELMIAERQI FT KLGGSGPILQNTLLGWIVSGGVPDESTSAPAVSACATEKIEEGLARFFELE FT SCRTTSTLSLEESACETHFERTTIRDSSGRFVVQLPKKQFMLDRLGDTQAI FT ATRRFMALERRLDADPALKKMYTEFIDEYLRMQHMREISPRELSTFPVTYF FT LPHHAVLKPDSTTTKLRVVFDASCASTTGVSLNDALMVGPVVQEDLTSITL FT RFRLRKYAMTADIEKMYRMIKMHPMDHPLQCILWREDSTKPLRFFVLTTVT FT YGTSSAPYLATRCLKKLAEDEKANFPAATNTIIYDFYVDDMLKSVDSIEEA FT AQLSKDLVHVLGTAGLTLRKWSSNSRELLDQIPPYLRDERSSLDLELSNPT FT VKTLGIKWEPRSDIFRFTVPQWNPATEFTKRIILSDFAKLFDPLGLVGPSL FT VPAKVFLQELWRSKCSWNDILPDELQKWWREFRGSLEGLTHLQVPRWIAFG FT SDTISVELHMFCDASAKAYGACIYLRCTSFDGTVTSTLLTAKSRVAPLEDL FT KKKRKQASIPRLELSSALTGVYLFEKVVQSIKITAQPYFWTDSTIVKCWIA FT APPSRWNMFVANRVSEIQHITRGGIWNHIAGLENPADVLSRGMTPDLLKDF FT EMWWQGPPWLHLDKSSWPRTANINPEDLDPSLLEERSTVSVPAHVFNPSPV FT FSLRSSLQDLVRIVSLIRRFAHNCRNPGDRRAGFLRHEERAEALHQLLILA FT QQESFPEDLITLRRTGEVKPTSRLKQLAPRLVNGLILVGGRLRNANISAGR FT KHPVILDHHHPLSVLIAAHYHQKNLHAGQQLLVASMRERYWPLAARTLARK FT VIHSCVKCFRVRPKAHEQLMADLPAERVNPAPAFLRVGVDYCGPFYIQYPY FT RKGGPIKCFVAVFVCLVVKAVHLEVVGDLTTQAFIAALRRFVSRRGRPEII FT MCDNATNFVGARRELDELRKLFNRQEFVKAVTEEASWENINFKFIPAKSPN FT FGGLWEAAVKSMKEHLKRTLGNTVLVSDEMATLVAQIEASLNSRPITPLSN FT DADDLEILTPGHFLVGRPLTAIPEPSLQDLNEARLSRWQRVQYFLQRIWNR FT WSTQYLSNLHARTKWTRQRNNLFIGTMVLLSEDNVPPLKWRMGRVTEIHPG FT NDGNIRVVKVRTKDGDFLRAISKVCVLPIRDNEPSPSSAHGED" XX SQ Sequence 7457 BP; 1850 A; 1968 C; 1847 G; 1764 T; 28 other; cgccgcgcaa tgagaatata taccaaggtg tggaaaaaga agcaagggct atgtattagt 60 aaagagaaaa aaacgtaaac acaaataact acgtcactaa tttacacggc ttcttatggt 120 agctcgcttg cttgtagttg tgaaactttt tgcaccgtac atggatgtca cgcacactaa 180 atgtcttctt cctcacctcg gtatatattc tcattcacgc cgcgtgttta atttttgcat 240 gtcgtacaat tgtgaacmag ctctktgakg tscaataaac cgkctcagtc cgtctaagtt 300 gccacgcgat tttaagtttt ccgaagtgcg taaatccgac gactacagac ccagtgttaa 360 gcccgatagt gaacattttt ggtccaagtc gaaccggatc gtcgggaaac gcgaawscaa 420 agtgaacttt tgggccattg tgaggtaccg cgtggtcgct gattgttctc cacgagggag 480 aacaatagga cacgtgcgtc cccgtcgtca agtgcggtcc tgccatctgg cacctcccgt 540 gagtgttcct gcggtgtgct gtgacggacc cgaagctgct atcatcggag cttcsaccgg 600 agaccggacg gatttgccag gcgaaccagg ataagtactg agggctgggc aggtttgtct 660 gtggcgcgga atcttttgca tgcgagatcg ccgccgatgg aaggttagaa ttgagcgaat 720 tgactcacaa aacgakawac catactgaat aaagaattcg tatgattgct tgctagatcg 780 aaagtttgtg gtgtccaaat aatttatcgc tgagttgagc cctgaccagt ccgccacctc 840 tgttggcttt ctccgcttcg ttggttggtt tgcattgtcc cgttccccct caattgcctc 900 cactgtcaag tcaaaccgcg gggtcgattt gagtttcgcg tgcgagtgaa gtgctaaagt 960 gacagtgcga gtgacttaca gtgttcaaat atgccaggtg gtgaggatct gcggctgctc 1020 ttcaagcaag agcgwtcgtt gmgagtttcg ctggacaacc tgaaggagtt cgttcgctcc 1080 cacccagacg gtscggatcg gcgtggcgtg gamctacgmc tatgcaaact cgacgaaatc 1140 tgtggcaaat ttatcgacgt gcgaatgcga atcgagctgc tcaccgacga tgcagccgag 1200 ggtgatattt cgaccgacac ggaagaaacg gaagaggcta agctacaacg tcacatgagg 1260 ctacgaaagc agcaggatcg ggaaaatgca agaatcatca aggactttga gaatgaagtg 1320 taccacctga agcaasttct gcttggscgg ctccaaccca gcccgccagc aggcccaagt 1380 cagcagcatc gggctgcttc tcatgcaccc ccacagtcca aggtcaagct gcccgagctg 1440 aagctccctt ccttcagtgg taggatgtcg gactgggtca catttcgtga cacgtacaag 1500 aacctgatcc acgatgacaa caatctctcc aacatggata agttcacgta cttacgcacg 1560 tctctcaccg gcgatgcact acaggagatc gcctcaatcg agatttcctc cgccaattat 1620 gacattgcat ggaaggcgtt ggaaagcgtc ttcgaaaaca agaagttgct ggtgatgacg 1680 tacctagatg ctctgttcgc tctcgagcca cttcgtcaag aaaacttcga gacgttgagc 1740 aagcttgtta acggttttga aaaaaacctg caaatgctca ccaagatcgg tgaagatacc 1800 gawggttgga gcacgctgct gcagtacatg gtctgcaaac gccttcattc gagcacgctg 1860 agacagtggg agwckcatta cagctcaaag gaagtwccga agtacaagga tctgattaag 1920 ttcctkaagg gtcactgttc ggtgttacaa tccatcgccc ctggaaggca ggtccagggc 1980 gagccgaaga agtctgttcg tccatccctc agccatgctg gagtccagtc ctcaagtagc 2040 tgtccgtttt gcggagaatc ggcccattcc gccttcaagt gtacgaagtt ttccaaaatg 2100 agggtttcgg aaagggtcga cgccgtgaag aagcattcgc tttgcttgaa ctgtttgtca 2160 tcaggacaca tcgcacgatt ctgcaccaga gggtcgtgct tccactgtgg tcaacgtcac 2220 cactcgttac ttcacgcgaa ctcatcgacc accgcgaatt caccgaccaa gtctggacag 2280 tcaactggga atcagtcatc gaagaaacca cacggacaga atatgcaacc accacaagct 2340 aagccaacac caaacggtca ctcaacacac actcagccta cacaaagcgg acgcaatgat 2400 ccacagagtc cgacaaacgc aagtaacgct caatcacaca ttccgccaca gactgcccca 2460 gtacatccct acacactgct cctctagcaa cacaacgaca cctcccacac actgttctgc 2520 tctccaccgc gatcgtaaat ctctacgatc agttcggtaa ctctttgctc gctcgtgcat 2580 tgctagactc tgggtctcag cgatgctaca tgtcagagac catctcgcaa aaacttaagt 2640 tcaagcgtac tcgtgagcac ttaccgatcg ctggaattgg tggttcacga actgcttcca 2700 ccaaagcagt tttcgctgag gtccattcgc ttgtcacgaa atacgtgacg aacctcaaat 2760 ttcacgtgtt gccgcgagtt actgtcgatc ttccacgcga agtatcgata ttcgatcctg 2820 gaacatgccg aaggagctgg tattggccga tcccacmttc cacgagtctg gagcagtcga 2880 cctgataatc ggtgctgaag tgtacttgga gctgatgatc gccgaacgtc aaatcaaatt 2940 gggagggtct ggtccgattc tgcagaacac tctgctgggc tggattgttt ccggaggcgt 3000 tccggacgaa tctacatccg caccagctgt gtccgcatgc gcaaccgaga agattgaaga 3060 aggattggca cggttctttg agctggaatc gtgtcgcacc accagcacgt tgtcgttgga 3120 ggagtcagcg tgtgaaacac atttcgagag aacgacgata cgggactcaa gtggcagatt 3180 cgtcgttcag ctacccaaga aacagttcat gcttgatcgc ctaggagaca cccaagccat 3240 tgccactcgt cgcttcatgg cactagagcg aagattggat gccgatcctg ccttgaagaa 3300 gatgtacacc gagttcatcg acgagtacct ccggatgcag cacatgcgcg agatttcacc 3360 acgagagttg agcacctttc cggtcacgta tttcttgccg caccatgcgg tacttaagcc 3420 cgacagcaca acgaccaagc tgcgcgtcgt gtttgatgcg tcatgcgcaa gcaccactgg 3480 agtttcgttg aacgacgctc ttatggtggg tcctgttgtc caggaggatc tcacctctat 3540 cacgcttmga tttcgtctac ggaagtacgc aatgacagct gacatcgaga agatgtacag 3600 aatgatcaag atgcacccga tggaccaccc actccagtgc attttgtgga gagaagattc 3660 aaccaagccg cttcgattct tcgtcctgac aacagtcacg tacggcacat cctcagcgcc 3720 atatttggca acgcgctgcc tgaagaagct ggcggaggat gagaaagcca attttcccgc 3780 cgctacaaat acgatcatct acgattttta cgttgacgat atgctgaaga gcgtcgacag 3840 catcgaggag gcagcccagc tctcgaaaga cctggttcat gttctgggta ctgctggact 3900 cacattaagg aaatggagct ctaattctcg agaactgcta gaccaaatcc caccctatct 3960 gcgagatgaa cgctcgtctt tggacctcga actttcgaat cccaccgtca aaacccttgg 4020 aatcaaatgg gaacccagat cggacatatt ccgattcact gtgcctcagt ggaatcctgc 4080 tactgagttc accaagagaa tcatcctgtc cgattttgcg aagctcttcg atccgctagg 4140 cctagttgga cctagtttgg ttccagctaa ggtattcctc caggagcttt ggagatcaaa 4200 gtgttcctgg aacgatattc tgcccgacga gcttcagaaa tggtggagag aattccgagg 4260 aagtttggaa ggcctcactc atcttcaagt tcctcgttgg atcgctttcg ggagcgatac 4320 aatttccgtg gagctccaca tgttctgcga tgcttccgcg aaggcctatg gtgcttgcat 4380 ttacctgcgg tgcacctcgt tcgatggcac ggtcacctcg acgctattga ctgcgaaatc 4440 tcgagtagct ccgctcgaag atcttaagaa gaagcgaaag caagcctcca taccccgtct 4500 agaactgtcg tctgcgctca ctggagtcta tttgttcgag aaggtcgtcc aaagtatcaa 4560 aatcactgct caaccgtact tttggactga ctcaacaatc gttaagtgct ggattgctgc 4620 tcctccatca cgatggaaca tgttcgtggc caatagagtg tctgaaattc agcacatcac 4680 kcgaggagga atttggaacc acatcgcggg gctggagaat ccggcagacg tcttgtcaag 4740 gggaatgact cctgatttgc tgaaggactt cgaaatgtgg tggcaaggac caccgtggtt 4800 gcacctagac aaatcttcct ggccgagaac ggctaacatc aacccggagg atttggatcc 4860 ttcgctactc gaggagagat ccacggtctc agttccagct catgttttca atcccagtcc 4920 tgtcttcagt ctacgatcgt cactccaaga tttggttcga attgtgtccc tcattcggcg 4980 attcgcccac aactgcagaa atcctggcga ccgtagagca ggattcttaa ggcatgaaga 5040 acgagcagag gcattacacc agctgctgat ccttgcgcaa caagaaagtt ttcctgaaga 5100 tctcatcaca ctgcgaagaa ccggagaggt taagccaacc tcaagactga agcaactagc 5160 tccacgtcta gtgaacggat tgattctggt tggcggccgg ctacgaaacg ccaacatttc 5220 ggctggccgg aagcatccgg tcatactaga ccatcatcat cccctctcag tcctcatcgc 5280 cgctcactat caccagaaaa atctgcacgc tgggcaacaa ctgctagtcg ccagcatgcg 5340 ggaaaggtat tggccgctgg cggctcgcac cttggctcga aaggtcatcc acagctgcgt 5400 aaaatgcttt cgggtccgtc cgaaggcgca cgagcaactc atggcggact taccagcaga 5460 acgtgtcaat ccagctccag cgtttctgcg cgtaggggtg gactactgtg gtccgttcta 5520 catccagtac ccctaccgca agggcggtcc aatcaaatgc ttcgtcgccg tgttcgtttg 5580 tcttgtcgta aaggccgttc atctggaggt cgttggagac ctcacgactc aagccttcat 5640 cgccgctttg agaagatttg tctcccgccg aggtcgccca gaaatcatca tgtgcgataa 5700 cgcaacaaat tttgtcggcg cacgccgtga gttggatgag ctgcggaagt tgttcaaccg 5760 tcaagagttc gtgaaggcgg ttacagaaga ggcttcctgg gagaacatta acttcaagtt 5820 tattccggcg aagtccccca acttcggagg cctttgggag gccgccgtca agtcaatgaa 5880 ggagcatttg aagcggaccc tcggcaacac agtgctcgtg tcggacgaaa tggctaccct 5940 ggtggcacaa attgaagcta gcctcaactc gaggccaatc acaccccttt cgaacgatgc 6000 ggacgatcta gaaattttga ctcccggtca ctttttggtg gggaggccgc tcacagcaat 6060 accagaacca tcgctgcaag atctcaacga ggcaagactg tcaagatggc agcgagtgca 6120 gtattttctt caacgaattt ggaatcgttg gtcaacgcag tacctgtcca atctgcacgc 6180 ccgcacaaaa tggaccaggc aacggaataa cctgtttatc ggcactatgg tgctgctcag 6240 cgaggacaac gtgccacccc tcaagtggcg aatgggtcga gtcacagaga ttcacccggg 6300 gaacgacggc aatatccgtg tcgtcaaggt tcgcacgaag gacggcgact tccttcgagc 6360 gatctccaaa gtttgtgtcc tgccaatacg ggacaacgaa ccgtcgccct cttcggctca 6420 cggggaggat tgattccttc tcccacgagc gctctacggc gctgccgagg cctccgggtc 6480 tcggcgttcc agttaagttt ttgtttatca atatatggtc aaaaagctca gagaatgttt 6540 cgtcctccat agtcaaattc acccacggcg gccggagccg cccaaccgat tttccgaagt 6600 ttgtcatccc actactacga tgtctggtcc atctattttg tcgcagttwc gtcaatctca 6660 actcagttcc tgtggtcgtg caatcaagct gatgcgtcaa gtattctgtg ggggttgagg 6720 aatccccaca caaagttacg tatattgcat gccgttagtg atttcgtgca atcaaacacc 6780 atttcatttg cccagcagga tacagtccat acattcaccg tacctcaagg gtttacctgg 6840 cggacgatgg cctacacgat aaggtccact atcctcatag aagtcatgat gtccggtgtc 6900 gctcttgtgt cttcatccag catccacgtt catcctgaag ttgttccgat gccaggaaca 6960 gtcatccagc camcaacaga ggatagatca acggaatcat cggtggcgca tcggcgcctg 7020 cggaggcgga cggcctccgc gacccaggga agtcgttttt tgtttcattt ttaccgttaa 7080 aaaatcgctt ctcatctcat tcatagattc ggctcgaagg accacctcaa ctgaggtcgg 7140 ttggtttccc attacttgga cgtaaccgtg gtgagcctca gcgcgcacag tactaacgat 7200 cgataccaca caggccgtgt gccggaaatc gctcgcgaac gaaggaagag aatsaactac 7260 atcggtagcg gttagaggtt cccaaataca gtcaacctgg cagctgatca tcatcttcag 7320 ctaccaacag tttctgcagt gtttacatgc acgtagtatt aatattattc gctagttgat 7380 aaggtgttgg aattgttttt gcgttacgat aagaatatta gctaatttga aatccggtta 7440 tttcaaggcg gccggta 7457 // ID GLSAT2 repbase; DNA; INV; 142 BP. XX AC M11264; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 28-SEP-1995 (Rel. 1.08, Last updated, Version 1) XX DE G.lateralis satellite DNA. XX KW SAT; Satellite; Simple Repeat; GLSAT2; KW Satellite repetitive element. XX OS Gecarcinus lateralis OC Eukaryota; Metazoa; Arthropoda; Crustacea; Malacostraca; OC Eumalacostraca; Eucarida; Decapoda; Pleocyemata; Brachyura; OC Eubrachyura; Grapsoidea; Gecarcinidae; Gecarcinus. XX RN [1] RP 1-142 RA Fowler F.R., Bonnewell V., Spann S.M. and Skinner M.D.; RT "Sequences of three closely related variants of a complex RT satellite DNA diverge at specific domains."; RL J. Biol. Chem 260, 8964-8972 (1985). XX DR GenBank; M11264; Positions 627 768. XX SQ Sequence 142 BP; 57 A; 35 C; 36 G; 14 T; 0 other; aacaagaaga acaagacgaa gaaagaagag gaataacatc aacaacacca agaagagcac 60 ggacgactac aacaaggaga gggggaaggc aggaaggaat tgccgcgtgg cacattccgc 120 atttcaccaa gctcccctct cc 142 // ID BEL-169_AA-LTR repbase; DNA; INV; 482 BP. XX AC supercont1.326; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-169_AA_; KW BEL-169_AA-I; BEL-169_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-482 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.326; Positions 459571 460052. XX SQ Sequence 482 BP; 178 A; 74 C; 84 G; 146 T; 0 other; tgttgcgaca gaatcggtaa cactgccaag cacgacctat acatagaccg aggaatcacc 60 tatcatggaa tgaagagaaa tgtcatcgca aaaactgaaa gtcatttcta aaagcaaatt 120 gtgcattcta ttaactaagt tgaattcgat tataaaacca aaaattatta acctaaagta 180 ttctattttg aattggtatc tattaaacga gtgtgtggag taatttgttt gaactaaagg 240 ttagtaatgt gaattaaatg agctagaagt taaattatat cactataata actttagcag 300 gaaaatcact cgtccacggt agccgtttag tatcaacgaa gattatcaag caaactgatt 360 tgtaagttga aatacgttgt tacatgatag aaactaaatt tattactctt gtagtttgaa 420 gcatatcaac gacgctaaat aaacgctttc aataacagta ctggctttgt tctacgggaa 480 ca 482 // ID P-2_TV repbase; DNA; INV; 7707 BP. XX AC . XX DT 26-OCT-2009 (Rel. 14.1, Created) DT 26-OCT-2009 (Rel. 14.1, Last updated, Version 1) XX DE P DNA transposon from Trichomonas vaginalis - a consensus. XX KW P; DNA transposon; Transposable Element; P-2_TV. XX OS Trichomonas vaginalis OC Eukaryota; Parabasalia; Trichomonadida; Trichomonadidae; OC Trichomonas. XX RN [1] RP 1-7707 RA Kapitonov V.V. and Jurka J.; RT "First examples of protozoan P DNA transposons."; RL Repbase Reports 9(10), 2164-2164 (2009). XX DR [1] (Consensus) XX CC This is a young family of autonomous P transposons. While the CC transposase-encoding region is finely reconstructed, structure of CC terminal parts pf the transposon can be obscured by a wrong CC assembly of the genome. However it is still possible that the CC terminal regions are correct, forming 1909-bp terminal inverted CC repeats. XX FH Key Location/Qualifiers FT CDS 3541..6060 FT /product="P-2_TV_1p" FT /note="P transposase." FT /translation="MNSEKIEINLSCVDIATFKKLEKLIKEKLPGVIYNNI FT ADYQTSPNQSKLLQENTIKSMIAASKNRALSYQRSLQMKKINEQRKIQDGT FT ADELEQEAEKADLETLNDNEDDSDDQDEDETNGDESLSLSEILLDDYTPLF FT KYPPFADKLTCVIINNMLYNLNFPNSKSRRYYAETLQFAFMLYCQQPSEYL FT LARQKLMLPSKATLFRVFGKDIAAWKELLLSKSSSLPVFLRFRELYPYTDE FT ELREQCHFCIGIDAITGKLWKKSGHDDQIEDLFALYLMPLDCKGPSICLHI FT IQHTNGLASGIMSEILDYIATVQKTLNVPVFTTDGDKSYDFVYQIFFHHWY FT SLVTGFDGLVEVLKSAKFQDLKIIFIADVVHIGKNLRTLLIFPNKLIFLQP FT YDPTLHVDITKIRLFSKLGDAINNCDKSTTLNDKYPVTIFNVQTVKDMLDN FT NCYTEATWLLPLALWFEGVRNTILPVEARLSMFKLAFFILHFFLRLQDVVG FT VTDRLPNIETPDHRLKFKGYLFARRMHLLKMATSLLTLYGFLEEFPNINFQ FT HLTTLPVEHLFALFRKFAHSKKEVKDLMSAGAHLVMNQTYREQNFDRLVSE FT CRLDNSGANMANVEEMPEDIKSQYIFTDEDIASITTDLMYLVGWHEDQVFS FT VWGRLIKYNHTQPLSVFEPYSKDFPRNGVEILRQKLDQIIEIKAPVPFSEN FT RGYCPSQAYETQLQKVPMAKTPLYENFNDILQFANSNSSSSTENSTTTDST FT SQGIQSQNSNFPNENLENSQNAESTDDFQSLITPIQEDHFPSANELKQGFD FT IAHDVLISPPPENQHPEIDEEETQKAIYLLQNLDFPKK" XX SQ Sequence 7707 BP; 2777 A; 1125 C; 1182 G; 2620 T; 3 other; aaaggtagtc tcacgaaaat acgttatttt cccgatgcct tggaagcagc tttgcaggtt 60 acccgtaagg taaccagtag aagtttctga gcaaggatga agaagattaa agagcctttt 120 tattgcaaac gctatatttg taagtgtctt tgtagaactt ctcataaatt caaagatttc 180 caaaaacgaa tttttcgatt ttgatataat aaatgatatt taaacaaatt atagatgtgt 240 ttgcaattaa aaaaatccga gttaactaca atatcttcaa tttttgtcgt tcactttttc 300 atgtcatact gatagcaaga atactcaccc aatttggtta cacttcatga tagtttgaag 360 taaaaaaatg ccacagtttt ttttatttga tatcgttatc taaattttgt acagaaatga 420 aataaattat aatatgaaac gtgtgttatt atgcttgttg ggaaacaatc caatacttat 480 gtctgaatga aatgattatt tctacgagat ttttcatatt cagctaataa tgctatcaaa 540 atcaaataac aataaaaagt tattttgtac ataatgtaac ttaatcttta atactatata 600 gaacttcgaa atacgactaa acggtgcgag ttacgcgatt aaataccttt ttttcgatcg 660 aaacttgaaa gtgaggataa ctcaatgggt aggggtatgg gttcgacttt ttctattggg 720 caactatttt tttcttttca ttgtttcagc atttttttat gcgattatgc ataattattt 780 gatttcctat atattcaata ataaatttgt tccgaactct taacaaataa ataaattttt 840 agaatattta aatttatagc tccctacatt aatataaagt gttcgaatag ttcatatata 900 gtttctttaa aataatttgt ttcattttaa attggtttgg cacaatgctt gacataacat 960 gcacaaagct atgccatttt aaaatatgta tatcaatcaa gccgattaag tccatcctaa 1020 acgaaaatca tcgatactca tgaatttatg atagtcgttc ttgatataat tgcgcgtttt 1080 aaaaaatgct tacttttgaa cacagaactt atctatccat atttatcaat gaaagtttat 1140 aagatgcatt aatgacaaaa ccttacttta tcaattataa tcttatcgta aatttcaaac 1200 tacaatactg tggggcaaat attaaataac cactatgttg cggattttat gtaagtctgc 1260 aaactgtaaa atgtaattaa acacaaataa aaaaaaatgt aaattttaat cgagaaattt 1320 agcaatcttg aaagccttga tcggcatctt ttacaagatg tcagtaatgc atattcaata 1380 actatgaaaa tctagccaag catggatttt tttcgaaaaa aaattgtatt aaatttcatt 1440 tttcattctt tatacaagtt ttaacctttg gtgcaattat atcctatcgc aaatgcatta 1500 ttgacgttat tacttggata cttcttcata acgctaaatc ttcaaattag ttatgtgttt 1560 gttactaata gaagttggaa aggtattaaa aaaggaaatg ttaataaata tacatgttta 1620 attaaaaaac aaacaagaat aaattaaata attttatttc ttaggaaagt ctaagttttg 1680 taatagataa attgcttttt gtgtctcttc ttcatcaatt tctggatgtt gattctcagg 1740 tggcggacta attaaaacat catgggcaat atcaaaccat tgttttagtt cgttggcaga 1800 aggaaaatga tcttcttgta ttggcgttat taatgattga aaatcatcag ttgattctgc 1860 attttgggaa ttttccaaat tttcatttgg aaaatttgaa ttttgagata ttgtttaagc 1920 actcagatct catttttaaa atatctgcac caaattctta catctgataa ttttcatgtg 1980 taaccaatgt ttcccaatcc atttgccata ccctactaat cgcaaattca ttccctttaa 2040 ttttgtatta aatttaaatt ccaatcttaa tttctacatt atttatgcta accctttaaa 2100 ttttgttctt atttattcac cctcagatac aaatttattt agaatgtatt cctctgaggt 2160 ctatatttgt attcgaaaat aaaatcttta aaactctcta agttatcatt ttctcttcag 2220 atatctaata tttttctaat gacataacta ctttggacgc atatccaaat tccaaaaaaa 2280 atatggaaaa aataaaaatg aaattgaaaa atacaatttt tgatatgtaa aaagagattt 2340 tgaaaaaaat tgggaaaaag agtatctaat aaaattgttt gagatctatg tagacatagt 2400 ttttacatta aaaaatcaat aatattgaaa ttgatctgaa tatatctttt tttttaagtt 2460 tgaatgcatt aatctcaaag catttgataa aatactggtt ttatttcaaa attctaaata 2520 atttgtaatt agatctttat gatgtaattt aattttaaac aagtacagat aaagagaatc 2580 ataataaagg tgatctatat tgcttaagta aaattgagta taatagatcg atcttagcaa 2640 caagatctga ggaagtagat ttgagagtcg agagtttaga tcttagcaac aagatctgag 2700 gaaatagatt tgagagtcga gagtttggat cttagcaaca agatctgagg aagtagattt 2760 gagagtcgag agtttggatc ttagcaacaa gatctgagga agtagatttg agagtcgaga 2820 gtttagatct tagcaacaag atctgaggaa gtagatttga gagtcgagag tttagatctt 2880 agcaacaaga tctgaggaag tagatttgag agtcgagagt ttggatctta gcaacaagat 2940 ctgaggaagt agatttgaga gtcgagagtt tggatcttag caactagatc tgagagtttt 3000 atcatttccg gcatattggg ccgaaaagga gtgatccgcg taagtatgaa aaaaagtgaa 3060 ttttgttgta tggtgttttt tattatatcc gtttctctca caattttcaa taaatattat 3120 tgttcaaacg gaaatataaa caaacaattc aaaaataaaa atgtaaatgt aaattgaaaa 3180 tgtattataa acatgctaaa aagcgtatat tttaaactta aaggaaaaat gaaattcgta 3240 aaaatgagca tgatgtttga taagggwgtt ttaatttttt ttcaaaaaaa ttaaggaact 3300 taaattggtt tcgaggcgta gatagggata aaaagatgct ttaagaagat ctcagtagaa 3360 atatggttta taagaggata taggtcagga tataagatta tgaaaattga agaagtgaat 3420 tatcaagatg tatttatgag taaactatat tgttttaagc tatctaaatc tgttttgata 3480 taaaacgaaa aagaaatgct ttaagtgttc ggccacttaa tcattttccc acaaatcatt 3540 atgaacagtg agaagataga aattaattta tcatgtgtag atattgccac atttaagaag 3600 cttgaaaagc ttattaaaga aaaacttcct ggtgttattt ataataatat cgcagattat 3660 caaacaagtc ccaatcaatc aaaactatta caagagaata ctattaaatc aatgattgca 3720 gcttccaaaa accgtgctct aagctatcaa agatctctgc aaatgaaaaa aatcaatgaa 3780 caaagaaaaa tacaagatgg aactgcagac gaattagaac aagaagcaga aaaggcagat 3840 cttgaaactc tcaatgacaa tgaagatgat tctgatgatc aagatgaaga tgaaacaaat 3900 ggtgatgaat ccttgagttt aagcgaaata ttattagatg attatacccc attatttaag 3960 taccctccgt ttgcagataa attaacgtgt gttatcatta ataacatgtt atataattta 4020 aattttccaa actcaaagag ccgacgatac tatgcggaga cacttcaatt tgcctttatg 4080 ttgtattgcc aacaaccctc agaatatttg ttggcaagac agaaacttat gttgccatca 4140 aaagcaactc tatttcgagt ctttggtaaa gatattgcag catggaagga acttttgctt 4200 tcaaaatcat cttcactccc agtattcctc agattcaggg aattatatcc atatacagat 4260 gaggaattac gtgaacaatg ccatttttgc attggaatag atgcaatcac aggcaaatta 4320 tggaagaagt ccggccatga cgatcaaatc gaggacttat ttgcattata tttgatgcca 4380 ttagattgca aaggtccatc aatatgtttg catataatac aacatacaaa tggtcttgca 4440 tctggaatta tgagtgaaat cttggattac attgctacag ttcagaaaac actgaatgtt 4500 ccagttttta caactgatgg tgacaaatct tacgattttg tttatcagat attctttcac 4560 cattggtaca gcctagttac tggctttgat ggattagtag aagtccttaa aagtgcaaaa 4620 ttccaagatc ttaagattat cttcatcgcg gacgtcgttc atataggcaa aaacctcaga 4680 acgttattaa ttttccccaa taagttaatc tttttgcaac cttatgatcc aacgctgcac 4740 gtcgatatta caaaaattag attattttcc aaacttggag atgcaattaa caactgtgat 4800 aagtctacca cactcaatga caaatatcct gtaacaatct ttaacgttca aaccgtcaag 4860 gatatgctag ataataattg ctacactgaa gcaacttggc tccttccact tgcactttgg 4920 tttgagggag tccgaaatac catattacca gttgaagcaa gattgtcaat gttcaaactt 4980 gcttttttca tcttgcattt ctttctcaga ttgcaagatg tggtcggagt cactgataga 5040 cttcctaata tagagacacc agatcatcgt ctcaaattca aaggttatct ttttgccaga 5100 agaatgcatc ttctgaaaat ggcgacatct cttctcactc tttatggatt tttagaggaa 5160 ttcccgaata taaattttca acatctcaca acccttccgg ttgaacattt atttgcctta 5220 ttccgaaaat ttgcgcattc aaaaaaagaa gttaaagatt tgatgagtgc gggagcgcat 5280 ttggtaatga accaaaccta ccgtgaacaa aattttgatc gtcttgtgtc agaatgcaga 5340 cttgataatt caggagcaaa tatggcaaat gtcgaggaga tgccagaaga tattaagtcc 5400 caatatatat tcacagatga agatattgcc agtattacga ccgatcttat gtatttagtt 5460 ggttggcacg aagatcaggt attttcagtt tggggaagat taataaaata caatcacaca 5520 caaccattaa gtgtttttga accttacagc aaagattttc cgcgtaatgg agtggaaatt 5580 ttaagacaaa aactggatca aattattgaa ataaaagctc cagttccttt ttcagaaaat 5640 aggggctatt gcccatctca agcatacgaa acacagttac aaaaagtgcc gatggccaaa 5700 acacctcttt atgagaattt caatgacata ctacaattcg caaactcaaa ttcctcttct 5760 tctacagaaa actctacaac tacagattct acatctcaag gaattcaatc tcaaaattca 5820 aattttccaa atgaaaattt ggaaaattcc caaaatgcag aatcaactga tgattttcaa 5880 tcattaataa cgccaataca agaagatcat tttccttctg ccaacgaact aaaacaaggg 5940 tttgatattg cccatgatgt tttaattagt ccgccacctg agaatcaaca tccagaaatt 6000 gatgaagaag agacacaaaa agcaatttat ctattacaaa acttagactt tcctaagaaa 6060 taaaattatt taatttattc ttgtttgttt tttaattaaa catgtatatt tattaacatt 6120 tcctttttta atacctttcc aacttctatt agtaacaaac acataactaa tttgaagatt 6180 tagcgttatg aagaagtatc caagtaataa cgtcaataat gcatttgcga taggatataa 6240 ttgcaccaaa ggttaaaact tgtataaaga atgaaaaatg aaatttaata taattttttt 6300 cgaaaaaaat ccatgcttgg ctagattttc atagttattg aatatgcatt actgacatct 6360 tgtaaaagat gccgatcaag gctttcaaga ttgctaaatt tctcgattaa aatttacatt 6420 tttttttatt tgtgtttaat tacattttac agtttgcaga cttacataaa atccgcaaca 6480 tagtggttat ttaatatttg ccccacagta ttgtagtttg aaatttacga taagattata 6540 attgataaag taaggttttg tcattaatgc atcttataaa ctttcattga taaatatgga 6600 tagataagtt ctgtgttcaa aagtaagcat ttttttaaaa cgcgcaatta tatcaagaac 6660 gactatcata aattcatgag tatcgatgat tttcgtttag gatggactta atcggcttga 6720 ttgatataca tattttaaaa tggcatagct ttgtgcatgt tatgtcaagc attgtgccaa 6780 accaatttaa aatgaaacaa attattttaa agaaactata tatgaactat tcgaacactt 6840 tatattaatg tagggagcta taaatttaaa tattckaaaa atttatttat ttgttaagag 6900 ttcggaacaa atttattatt gaatatatag gaaatcaaat aattatgcat aatcgcataa 6960 aaaaatgctg aaacaatgaa aagaaaaaaa tagttgccca atagaaaaag tcgaacccat 7020 acccctaccc attgagttat cctcactttc aagtttcgat cgaaaaaaag gtatttaatc 7080 gcaccgttta gtcgtatttc gaagttctat atagtattaa agattaagtt acattatgta 7140 caaaataact ttttattgtt atttgatttt gatagcatta ttagctgaat atgaaaaatc 7200 tcgtakaaat aatcatttca ttcagacata agtattggat tgtttcccaa caagcataat 7260 aacacacgtt tcatattata atttatttca tttctgtaca aaatttagat aacgatatca 7320 aataaaaaaa actgtggcat ttttttactt caaactatca tgaagtgtaa ccaaattggg 7380 tgagtattct tgctatcagt atgacatgaa aaagtgaacg acaaaaattg aagatattgt 7440 agttaactcg gattttttta attgcaaaca catctataat ttgtttaaat atcatttatt 7500 atatcaaaat cgaaaaattc gtttttggaa atctttgaat ttatgagaag ttctacaaag 7560 acacttacaa atatagcgtt tgcaataaaa aggctcttta atcttcttca tccttgctca 7620 gaaacttcta ctggttacct tacgggtaac ctgcaaagct gcttccaagg catcgggaaa 7680 ataacgtatt ttcgtgagac taccttt 7707 // ID Gypsy-263_AA-I repbase; DNA; INV; 4892 BP. XX AC . XX DT 13-JAN-2011 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-263_AA_; KW Gypsy-263_AA-LTR; Gypsy-263_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4892 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (13-JAN-2011). XX DR [1] (Consensus) XX CC Positions [3933-4394] - Integrase core CC 'CCAAG' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 2505..4340 FT /product="Gypsy-263_AA-I_1p" FT /translation="MINFFVTLIRHICFISFRYITTFITHKGVFRYKRFMF FT GISCAPEMFQKIMEQILADCECVINYIDDVFIFGETEQEHDEALRKVLEVF FT NSRGILLNQAKCVFKVPEITFVGHRLSSKGVQPMEEKTAVLKSFRAPSSTE FT ELRSFLGLVTYMGRFLPDLGTVTEPLRGLLRKGVSFHWEGRHEDAFQQIKT FT MICNISHLHYFDNHFRTRLIADASPVALGAVLVQFSEADDIPRVVSYASKS FT LSSTEKRYCQTEKEALALVWAAERFSMYLLGREFELETDHKPLETIFSPSS FT KPCARIERWVLRLQSFKYVVKYRRGAANVADPFSRLATVTENNEFENEGKF FT MVLAIMDSVAVDTDGIERASMDDAELCSVRECIQSGNWNKPEVKPYEVFRN FT EIGVMDNIVVRGNKMVIPLSLRSRMLKLAHEGHPGESLMKRRLRDRMWWPG FT MDREITKFVTMCDGCRLVGLPSRPEPMQRRPLPSRPWIDVALDFLGPLPSG FT EYLLIIIDYYSRYKEIEIMNRITAKATIDRLERIFTRLGYPMTITLDNARQ FT FISAELEQFCSEHGVHLNNTTPYWPQENGLVERQNRSVLKRLQISQSLQRD FT WKADLNDYLMMYYT" XX SQ Sequence 4892 BP; 1392 A; 1062 C; 1238 G; 1195 T; 5 other; aactggcgac gagctgaaga atccagcgtg gatgaatttg ttttgatttg gtaagaaata 60 gagcggaaaa ttggctcgta caaacgcatc atttggccac aacaaattga ggttagcgta 120 tgttgktttc gataataata gctggtgctg tcggtgccga attgtgaaat tgtgaatacg 180 ggcgcttgga aagaatgtgc tactcaaact agttttacga ttgagcataa acacagagtt 240 gagtttagcc aagaataaca aactgcagct atatggcatt cggaacacag catcgctgtg 300 gtggtctacg agaccgtatg agattcagaa cacagcattg ctgtgatggt ccttgagacc 360 atgtctgttg aagtaattac agtattgctg tacttaactt cagggtagta actgagtttt 420 agaacgcaac actgttgcga tgatcctcga gatcatgtct gagaatatak aatagtatgg 480 gtacwgatgg tcattgaaac cacacacgga acaaacgata ttagttccat tgcacaaaca 540 aatggaatat aatttctaat gtgatacatc atggtatttt gttttatatt tctttagatc 600 aagatggaca agtggaatat ccaaccgttc aatttcaaag cgctgcctgt tacgcaagtg 660 agagatgcgt ggatcaaata caaacgtaac ttcgaataca ttgcattggc aaacgaagaa 720 gagagccaaa cgaaattgca atatattttc ctcgcaaaag ccggaccaga tgtacaggag 780 gtatttgcat cgttgcaagg aggggccgcc gggggcggcg cggccggccc cggggggggg 840 gggcggcccc atagacgttg agaaatgaga aggataacga agcacccttc gatgacgtta 900 ttgcgaagct ggacgaatat ttcgcgccac gccagcacga aacgtttgaa cgacacaatt 960 tttggacgtt gaagcaaaat ccagatgagt cgctggagaa atttctgctg agagtaacgg 1020 atcaagcgtc ggaggtgcaa ttttggcaag accaaagagg aaagtgccga aatcagtgtg 1080 atcgacaaaa tcattcttct tgctccatca gatcttaaag agaggctgtt ggaaaaagaa 1140 aagctttcgt tgaatgaact taccaaaatg gtgaattcct actcttcggt aaaattccaa 1200 gcaagtcaga tgcaaccggc aacgatcaac agttaccgtt caatgctatc aacaaaagcc 1260 agcgatcgca gaacaatcga tccgcccgtc ccggtgctct cgtgcggcga gaggtccaca 1320 gcggaaacga tcatgcccgc caggacgagg atgcgcatgt gcaagccgga catttcgcga 1380 gcgtgcgaac gcacgggccc taaggggacg gaaccggcca gaaccagggc gcaagcgacg 1440 cgtgcgaacg atcaggcggg agggcgaacc acacagcttc tcgcgtgtcg gcgacgggga 1500 gagtcttggg tcaagatcgg ggcgtgctcc aaacgcatga ctctggcgaa gaacatccgg 1560 cgacgaactg ggagaattag gttcaaggcc aagtgaaatc cggacagtcc gactcacgtt 1620 caggctacgg aaaggctcga ccctgggtag gaggtgttcg aggcggatca cgacggggga 1680 aggccctcag gcacggcacg ttctcgtcgt tgcagaaggt tcacaatcca tacttggcaa 1740 agagacggcc aaatctctgg gagtgctgtc gatcggattg ccaagtacac gttcaactgc 1800 aataaacgcg atacgcgaaa acaatgacaa acgacctttt ccgaagatca aaggagtcca 1860 acttcgcata ccgattgaca agaccgtttc gcctgtggcc ttgcaccgag tcccgcgggc 1920 gggccgccgg gcggccgccg gcccccgcgg actcggcgcc attgcccaac acgctcgacg 1980 accgccactt gctttgatgg atcgcattga ggagaaattg aacatgctgt tgaaggtgga 2040 tataattgag cctgtgcatg aatacagcca gtgggtgtct ccgctggtgg caatagttaa 2100 agacaacgga gacttgaggc tatgtgtgga catgaggcgc gcaaatgaag ccatacgtcg 2160 agaaaaccat ttaatgccta ccttcgagga cttcttacca cgcctcaaga aagctaaatt 2220 ctttagctta ctagacgtga aagaagcatt ccaccaggtt gagctcgaag agtcctgcag 2280 gtaaataatt gattgatgat gattaaccat gttcattatt tccaacttgg gcagtggttc 2340 ccagtctcgt tcattacgcg gaccacttta taatttcaag gtttacataa atcgcatttg 2400 attggctatc ggcgcgtggt gttcaaattg aacactacat ttgagtggac actttgaatt 2460 tgttgtttgt atttatctga aaaagcactg tttaaattca agatatgatt aactttttcg 2520 ttactttaat caggcatatt tgttttatat ccttcaggta tattacgacg tttatcactc 2580 acaaaggtgt attccgttat aagagattca tgtttggaat ctcttgcgca cctgagatgt 2640 tccaaaagat aatggaacag atccttgctg attgcgagtg cgtgataaac tatatcgatg 2700 atgtgtttat cttcggcgaa acggaacagg agcatgatga agcccttcgg aaagtgttgg 2760 aggtttttaa ttcgcgcggg atacttttga accaagctaa atgcgtcttc aaagtaccag 2820 aaataacatt cgtgggtcat cgtctttctt caaagggtgt gcaaccaatg gaagagaaaa 2880 cagcagtcct aaaatcattc agagctccat caagtacgga ggaattgcgc agttttttgg 2940 gactagtcac gtatatgggt cgttttctac cggatttagg cacagtgact gaaccactgc 3000 gtggattgct ccgaaaaggt gtttcgttcc attgggaagg aagacatgag gatgcttttc 3060 aacaaattaa gacgatgatt tgtaacatta gccatctcca ctatttcgat aaccattttc 3120 gcacccgcct aatagccgat gcctctccgg tcgctttggg agccgttttg gtgcaatttt 3180 cagaagcaga cgatatccca agagttgtaa gctatgccag caaaagcttg agctctaccg 3240 aaaaaagata ctgccaaact gagaaggagg ctcttgctct cgtttgggct gctgagcgct 3300 tctcgatgta tttgttgggc cgtgagtttg agctagagac agatcataaa cctctggaga 3360 ctattttctc accwtcatct aagccgtgtg cgcgcatcga aagatgggtg ctacgcctac 3420 aatctttcaa atacgtcgtt aaataccgca ggggagcagc aaacgttgct gatccgttct 3480 ccaggctcgc gaccgttaca gaaaataatg agttcgagaa tgagggcaag ttcatggttc 3540 ttgcaattat ggactcggtt gctgttgata ccgatggaat agagagagca tcaatggatg 3600 atgcggagct ttgtagcgtc cgtgaatgca tacagtcagg caattggaac aaaccggaag 3660 ttaaaccata tgaggttttc cgaaacgaaa taggagtgat ggataatatt gttgtacgtg 3720 gaaacaaaat ggtcataccc ttgagtttgc gaagtcgtat gttgaagcta gcccacgaag 3780 ggcatcccgg cgagtctctc atgaagagaa gattgcgaga tcgaatgtgg tggccaggta 3840 tggaccggga gatcactaaa tttgtaacta tgtgcgacgg ttgccgactg gtgggactgc 3900 cgtccagacc tgaacctatg cagcgacgtc cattgccttc gcgaccgtgg attgacgttg 3960 ctttggattt tctcggccct ctcccatccg gagagtatct tttgatcata attgactatt 4020 actcgcgata taaggagatt gaaatcatga atagaataac cgcgaaagca acgatcgatc 4080 gtctagaacg tattttcact aggcttggct atcctatgac aataactcta gacaatgctc 4140 gtcaattcat tagcgcagag cttgagcagt tttgttcaga gcatggagta catctgaata 4200 acacaacccc ttactggccc caggaaaacg gccttgtcga gcggcagaat aggtctgtcc 4260 tcaagcggct tcagataagc caaagtctgc agcgagactg gaaagcggac ctaaacgact 4320 accttatgat gtactacacg wcaccacata ctgttaccgg aaaaacccca acggagttat 4380 gtttcggtcg cactataaga tcgaagctac cttcattgca ggatgttgaa gttgcctatc 4440 gggatgatga agtttatgat cgggaccgac tggctaaaca aaaagggaaa gagcaaggag 4500 atcacaatcg tcgggctaca ccgtcaaata ttcagattgg agacgatgta ctgatgaaga 4560 acttactgcc gaacaacaag ctgacaccga ctttcgaccc aaccgagtac gtggttttgg 4620 acaaatctgg ggctcgggtt accgttcaga acaagagtaa cgataagatt tatcaacgaa 4680 attccgccca tctgaagcag attcctaagc aggttgaccc acatcaaacc gttgacagta 4740 acgaaaaccg ttcggaccgc gacacaatcg atgaagttga aacctctgga catgacacga 4800 tctctaaggt gcgccggaat atccgacgtc ccctacgttt tgatgattat attttggatt 4860 ccgagacttc gtctctaaaa gaacaaacga ga 4892 // ID Gypsy-67_CQ-I repbase; DNA; INV; 6862 BP. XX AC AAWU01019862; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-67_CQ_; KW Gypsy-67_CQ-LTR; Gypsy-67_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-6862 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 513-513 (2011). XX DR GenBank; AAWU01019862; Positions 84358 77497. XX CC Positions [3530-4033] - Reverse transcriptase CC Positions [5087-5563] - Integrase core CC 'AAGT' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 330..2711 FT /product="Gypsy-67_CQ-I_2p" FT /translation="MASNNNQVGLGTYLNPSELLPDEVEYELNLRRINISH FT LQLEDRRLVLRRAQLAEMNQPVVLTSRRTIIQDTNFIQSKLVDLRYELERC FT GARSELLTRLRHIRLRVLRTEAPTRDQVMVKNELLEEIDFYFGVYASPLGG FT PRRNASGGPQLSNTSGQSNPDGAGTQSNVHGAQGGAGPNTQPGRSRTGTVP FT RRTTNPTPNPETAPPRRLIFDLASNSWIDNTGANSSHGGNFDSPLARGGQN FT RQSTQNHGPSEHSSPYVDETARVPPRNSRLNPNVRSELSQIVDDVLTTRME FT GMMDQLMQSMLTFIDQNQHRNRDETEEPEIGNLNLGNNQAGGRPYRDRSEP FT PPGGNGINANPRGRPIDGRGSGNPDFRPRDTPFDQGLPDNYPDGDGAQGYD FT TRNRNPENNDGIRGDPSLGNDGNRGSSSQNPPFGRGNLDRYSGNRDLSRGN FT SSGNPDRWPDRNHGQRCTEGEFERYRESGGRPEQPSPNRSMDQGSVPDYDP FT GRRGFEGRYDNHRIPINKWPIKFSGDQKVMSVEEFVKRVEILAGNNQVSDE FT ELLRKANFLFKSESVAETWYYTFSHKFASWQMLKHQLRLRFETPNKEKVLA FT RQIRNREQYPNETFQIYRSEIERLNQQMSRPLGDGELCSILFDNMKDWYRP FT HFAFMRPENLTVDLLSDLCHELDKSVYRTYAPRPRPYHVNCLEEDGAYAYP FT PDEEPYGEEVNAMSRSRPRRFARPEQGEPNQNAPETENNVLCWNCRQFGHF FT WRGCQRNIKLFCHVCGRPDVTTANCPENHPRMENQKNPFLEGN" FT CDS 2552..5938 FT /product="Gypsy-67_CQ-I_1p" FT /translation="MLELSSVRAFLERMPEEHQAVLPCVRKTRRHDRELSG FT KSPQNGEPKKPILGRELGRTSSNANNTPDSSFDRFSELLTINVRENRCPYV FT TVSIFGTSVEGLLDSGAEACILSDAELLNKHNFTRSPTKLKIRTADGTAHN FT CLSSVYIPYTFRGVTQVIPTLFVPNLAKKLILGNEFWKLFKITPAVEEHGQ FT LTQLNVNEVGVDEISINFIETYFAEESAEINLVLEPDTGFIRPVPVEEDLS FT LELPAVEPPYQKPIVSVETEHPLSPTENKQLLAVVRKLMGNGKLGRTHVLE FT HKIEILEGERPKKPPRYRVSPAIQKEMDKEIERMKELDVIEESDSDWCNPL FT LPVRKSSGEWRLCLDCRRVNEVTKKEAYPFPDMQVILGRIEKARFFSIIDL FT SKAYWQIPLAMESRDYTSFRAGKSLYRFKVMPFGLTGAPTTQTKLMNKVLG FT YDLEPNVFVYLDDIVVTSHTFEEHLQLLQKIADRLAAANLSISIEKSRFCQ FT KKISYLGYVLTEEGLSIDSQKLEPILSYPAPTTIKEIRRLLGMVGFYKQFI FT PNYSTVLAPITDLLKGKSKIGWTERAEKALNEIKSLLTSPPVLANPDFSQP FT FVIESDASLVAAGAVLVQYQAGIRRPIAFFSKKFSATQVKYSATERECLAV FT ILAIEKFRHYVEGTRFKVVTDAQSLKWLNQVSIEGNSARLVRWALKLQQHD FT IILEYRKGKLNVVADALSRAVPEPDVFTIDLDLERLKKNIAKDEERYKDFR FT VIRGKVYKFVASSTPNDQRFEWKMVPTKAERILILDAEHSVAHFGFYKTLK FT KIQEKYYWPGMTVDVKRFCRGCEVCRTSKYPNTARVPPIGRSKVASMPWQM FT ISVDYMGPFPRSKHGNTVLLVITDHFSKFVIIQPMREAKTTALVAFLESMV FT FLQFGVPEIVISDNGPQFKSALFGNLLKRYKVNHWKNASYHPANNPTERVN FT RVIVAAIRTYLKDDQREWDENINQVALAIRTAVHESTEFTPYFINHGRNYI FT SSGEEYSRIRETNGGTEQEAEKLGEDMKIIFEKVKLNLKKAYDRYGKYYNL FT RSKAKAPTFEKGEVVLKKNYGLSNKAKRFNAKLANTYSPAKVIAKLGSHCY FT ELEDLTGQKLGVFRSADLQKR" XX SQ Sequence 6862 BP; 2059 A; 1569 C; 1710 G; 1524 T; 0 other; ttttggcgcc caacgtgggg cgataattga ttagggcgct ttagagagtt gtagttgtag 60 acaaaaaaaa aaaaacgatt attctcggaa cgacacagta tttgcaaatc gagtaaatta 120 ggattattcc gggcgcgatc agtggaaaat aaaggtagaa tttagggaaa tctattcgcg 180 aatctattgg tatttgggaa attgtataac ggggtgaaaa attggggttt gcttacttta 240 aatttgaatt agcataactt ataattattt aaaaattatt gaaaaaaaaa acaaggaatt 300 atcaaaaaaa aaaaaaatca tttctcaaca tggcttcaaa taacaatcaa gtcggtctgg 360 gaacgtattt gaacccgagc gagctcttac ccgacgaagt agagtatgag ttgaacctcc 420 gacgaataaa catttcgcat cttcagttgg aggatcggag gctggtgttg cgacgtgccc 480 agttggcaga aatgaaccaa cccgttgtcc tgacctctcg tcgaacgatt atccaagaca 540 cgaactttat acaatccaag ctcgtcgact tgcgatatga attggaacga tgcggtgcca 600 ggtcagagtt attaacacgg ttgagacaca tccggttacg tgttctgcgt accgaggcgc 660 cgacgagaga ccaggtcatg gtaaagaatg aattgttgga ggaaattgac ttttactttg 720 gagtttacgc cagtcctcta ggtggaccgc ggaggaacgc ttctggggga cctcagctgt 780 caaacacaag tggacagagt aatcccgatg gcgccggaac gcagtcaaat gtgcacggag 840 cacagggtgg agcaggtccg aatacgcaac ccgggagaag tcgcacggga accgtaccga 900 gacgaacgac caatccgacc ccaaacccgg aaacagcgcc tcccagaagg cttatcttcg 960 atttggcatc aaactcgtgg attgacaata ctggagcgaa ttcttcgcac ggtggtaact 1020 tcgactctcc cctagcgaga ggcggccaga atcgtcagag cacgcaaaat cacggaccat 1080 cggagcattc gtcaccgtac gtcgatgaaa cagcacgggt gccgccaagg aattccagac 1140 tgaacccgaa cgtgagatct gaacttagtc agattgtgga cgacgtttta acaacacgaa 1200 tggaaggaat gatggatcaa ttaatgcaaa gtatgctcac atttattgac caaaatcagc 1260 atcgaaatcg agacgagacc gaggaaccgg aaattggaaa cctgaacctc ggcaacaacc 1320 aggctggcgg aagaccatac cgggatcggt cagagcctcc tccaggcggc aatggaatca 1380 acgccaaccc cagaggaagg ccgatcgatg gacgtgggtc cggcaacccg gacttccggc 1440 cgcgagatac cccgttcgat caaggattac cggacaatta tccggacggg gatggcgccc 1500 aaggttacga cacgcgaaat cgaaacccgg agaataacga cggaattagg ggagatcctt 1560 cgttgggaaa tgatggaaat cgtggttcct caagccagaa tccgccattt ggccgtggaa 1620 atttggaccg gtattcagga aaccgcgatt tgtcgcgagg caactcttcg ggaaacccag 1680 atcggtggcc ggatcgaaat cacggacaac ggtgtacgga aggagaattc gaacgatacc 1740 gcgagtcagg tggaaggccc gaacaaccaa gtcctaatcg gtccatggac caggggtcgg 1800 ttccagatta tgacccaggg agacgtgggt tcgaaggacg atatgacaat caccggatcc 1860 cgatcaacaa gtggccaatc aaattcagcg gcgatcagaa ggtcatgtca gtcgaagagt 1920 ttgtaaaacg cgtcgagata ctcgctggga ataaccaagt cagcgacgag gaactgttga 1980 gaaaggctaa tttcctcttt aaatcggaat ccgtcgcgga aacctggtac tacacgttta 2040 gtcacaaatt tgcatcgtgg caaatgttga agcaccaact ccgccttcga tttgaaactc 2100 cgaacaagga aaaagtacta gcgagacaga tccgaaatcg agagcaatac cctaatgaaa 2160 cgttccagat ctaccgaagt gaaatcgagc gattaaatca gcagatgagt agacctctgg 2220 gtgacggcga gttgtgcagt atcttatttg ataacatgaa ggactggtac cgtcctcatt 2280 ttgcgttcat gagacccgag aacctgactg tagacctgct gagcgatcta tgccatgagc 2340 tggacaagtc ggtttatcgc acgtacgcac cccgcccgcg accgtatcat gtcaattgcc 2400 tggaagagga cggagcttat gcgtacccac cggacgaaga accgtatgga gaggaagtca 2460 atgcgatgtc cagatcccga cctcgacgat tcgctagacc agaacaagga gaaccgaacc 2520 aaaacgcgcc ggagaccgag aacaacgtac tatgctggaa ctgtcgtcag ttcgggcatt 2580 tctggagagg atgccagagg aacatcaagc tgttttgcca tgtgtgcgga agaccagacg 2640 tcacgaccgc gaactgtccg gaaaatcacc ccagaatgga gaaccaaaaa aacccattct 2700 tggaagggaa ttagggagga cctcttccaa cgcaaataac actcccgaca gctcattcga 2760 tcgtttttct gaactgttga ccatcaacgt tcgggaaaat cggtgccctt atgtgaccgt 2820 ttcgatcttc gggacttctg tcgaaggact gttagactca ggagcggagg catgtattct 2880 aagtgacgcc gagctgctca ataaacacaa cttcactagg tctcccacga aactgaagat 2940 ccgaaccgct gacggaaccg cacacaactg cttgagttcc gtttatatcc catacacgtt 3000 ccgaggggtt acccaggtga tccctaccct gttcgtacct aatctggcca agaaattgat 3060 tcttgggaac gagttctgga agttgttcaa gatcacaccc gcggtagaag aacatgggca 3120 actcactcag ttgaacgtta acgaagtagg cgtggacgag atctcgatca attttatcga 3180 gacgtacttc gccgaagaat cggcagagat caacctcgtt ctcgaaccgg atacgggatt 3240 catccgaccg gtacccgttg aggaagatct tagtctcgag cttccggctg tagaacctcc 3300 atatcagaag ccgatcgtga gtgtggagac ggagcatccg ctgtccccaa ctgagaacaa 3360 acaacttcta gctgtagtgc gtaagctgat gggaaacgga aaattaggac gaactcatgt 3420 tttggaacat aaaatcgaaa ttttagaagg agagcggccg aagaaaccgc cacggtatcg 3480 agtttcaccg gccattcaga aagaaatgga caaagagatc gagaggatga aggagctcga 3540 cgtgattgaa gagtcggatt cagattggtg taatccgttg ctcccggtcc gaaagtcgtc 3600 cggagaatgg cggttgtgcc tcgattgtcg tcgagtcaac gaagtgacca aaaaggaggc 3660 gtatccgttc ccggacatgc aggtaattct cgggagaatt gagaaagccc gattcttctc 3720 gatcattgat ctgtccaagg cgtattggca aataccgcta gcgatggaaa gtcgagatta 3780 cacgtcgttc cgagctggga agtcccttta ccgcttcaag gtgatgccgt ttgggttgac 3840 gggagcgcct acaacacaaa ccaagctgat gaacaaagtg ctgggatacg acctggaacc 3900 gaacgtcttc gtgtatctgg acgatatcgt cgtcacctcc cacactttcg aggaacatct 3960 tcagttgttg cagaagattg ctgatcgcct ggccgctgcc aacctttcaa tcagtatcga 4020 aaaatctcgc ttttgtcaaa agaaaatctc ttacttgggc tacgtcctga cggaagaagg 4080 gttgtcgatc gatagccaaa agctcgaacc cattttgagc taccccgcac cgactacaat 4140 taaggagatt cgtagactgt taggcatggt cgggttctac aaacaattca tcccgaacta 4200 cagtacggtg ctcgcaccga ttacggacct cctgaaggga aagtctaaga ttggatggac 4260 cgaacgagcg gaaaaggctc tgaatgagat caaaagttta ctcacatccc caccggtgct 4320 cgccaatccg gacttctccc agcctttcgt catcgagtca gacgcgtcct tagtcgccgc 4380 tggagctgtt ttggtgcagt atcaggccgg aatccgtcgt ccaatcgcgt tcttttcgaa 4440 gaaattctca gcaacccaag tgaagtattc tgcaactgaa cgggaatgcc tcgccgtaat 4500 cttggccata gagaagtttc gacactatgt ggagggcacc agattcaagg tcgtcaccga 4560 cgcgcaatcg ctgaagtggc ttaatcaggt cagcattgag ggcaactccg cgcggctggt 4620 cagatgggca ctgaaactgc agcaacacga catcatcctg gaatatagga aagggaagtt 4680 aaacgtggtg gcagatgccc tgtccagagc tgttccagaa ccggatgtgt tcacgataga 4740 tctagatctc gagcggctca agaagaacat cgccaaggat gaggagcggt acaaagactt 4800 ccgggtgatc cgaggaaagg tctataagtt cgttgctagc tcaactccaa acgatcaacg 4860 attcgagtgg aagatggttc ctacgaaagc cgaacgcata ctgattctgg acgccgagca 4920 cagcgtggcc cacttcgggt tctacaaaac gttgaaaaag atccaggaga agtactactg 4980 gcccggcatg accgtcgacg tgaagcggtt ctgtcgaggc tgtgaagtct gcagaacctc 5040 gaaatatccc aacacagctc gagttccgcc gataggtcga tcaaaggtgg ctagcatgcc 5100 ctggcagatg atttccgtcg actacatggg accatttccc cggtctaagc acgggaacac 5160 ggtcctgttg gtgatcaccg accatttctc gaaattcgtg atcattcaac caatgaggga 5220 ggccaagact accgccctag tggcgtttct ggaatcgatg gtatttttgc agttcggggt 5280 ccccgaaatc gtaatctctg acaacggacc gcaatttaaa tccgcgttgt ttgggaactt 5340 gttgaaaagg tacaaggtta atcactggaa gaatgcgagc taccacccgg ccaacaatcc 5400 gaccgaacga gtgaaccgag tcattgttgc ggccattcgg acgtacctca aggacgatca 5460 acgcgaatgg gacgaaaata tcaaccaagt ggcccttgcc atcaggaccg cagtacacga 5520 atctaccgag tttacgccct acttcatcaa tcatgggcgc aactatatca gttccgggga 5580 agaatattcg cggatcaggg aaaccaacgg aggaacagaa caggaagccg agaaactggg 5640 agaggacatg aaaataatat ttgaaaaagt gaaactcaac ctaaaaaaag cttacgatcg 5700 ctacgggaag tattataatc tccgttcgaa agcgaaagcc cccacttttg aaaaagggga 5760 agtagtgttg aagaaaaatt atgggttatc taataaggcg aaaagattta acgctaaatt 5820 agcaaacact tactctcccg ccaaggttat cgccaagttg ggtagtcatt gctacgaact 5880 ggaagattta accggtcaaa aattgggagt tttccgttcc gcggatctcc agaaacgttg 5940 atcttgggat aagaaagcac tagcttgtac aactctaaac taaactaaag tacattgagg 6000 gtaaacaaac caaaccagac ttaccatcat tttttctttt attctcttct ttgcaaaaca 6060 ataaacaata ccccattcaa acactcccac ttttcatttt ccttaagtaa acactcccct 6120 aaaccatgcc acaatactag ccaccaccac caccactatt tacttaattt agttaataaa 6180 tttttagaaa taactcgaag aatatttaga taaaagttca taaaccgcaa accagagcgg 6240 aaaaaaactc gcgacgggac tgacgtttga attgttttga tttgccgagg ttgtcggtct 6300 gcaagtcggt aaagcggttt agtacacgag agcaaaaagc cagtggcgta cgctctcaga 6360 ctaggttaga aaaaaagagt aggaaaagat aaaactttag gcggaattta tggaaccaaa 6420 ctgagttcaa caaaggtact aaaattagcc attttgttcg gataactatc tgaaacggag 6480 gagtatatct cctaacaagg gaatacattg ttctatagct gaattcaata agcggcaaga 6540 gaaggaatga ccttcagtta tgattcacgc ggaatcgagt acgggtcata gtggattttt 6600 atctgcaagt cgtataataa acgggaaaaa tcgaaattag gaagaacacg taatctgaag 6660 aagcaggact acaaatagta atccagtgcg cttcgtaaac gaaataaaat gtggattagt 6720 tacgctgtgg ttctaagaat aatcgttgag aactaaaata ataagtaagg aacaaaagcg 6780 ccctcatcaa ttcagtttaa gtgatcgtat aagtgtaaaa aaaattcttg cataattttt 6840 tttttgttga ggaaggggga aa 6862 // ID hAT-N17_AP repbase; DNA; INV; 297 BP. XX AC . XX DT 29-AUG-2009 (Rel. 14.09, Created) DT 29-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW hAT-N17_AP. XX NM hAT-N17_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-297 RA Jurka J.; RT "hAT-type DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2117-2117 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 297 BP; 99 A; 39 C; 66 G; 93 T; 0 other; cagtggttct caacctgggg ggcgcgcccc cctagggagg cgtgacaaaa tttcaggggg 60 ggcgtgaaaa tgttgaaaaa aaatgttaat tgtatattta ataatagtta attttaattt 120 taaattataa ttttttttca ttgaagttgt attcatatta atccaataac caacattaaa 180 gtatagttta catataaata ttatgcatac atcaatctta tagtagtgag gggggaggcg 240 tgaactcata aaatagtttg aaagggggcg tggacctcaa aaggttgaga acccttg 297 // ID CR1-104_AAe repbase; DNA; INV; 4835 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-104_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4835 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1192-1192 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 29 sequences with >91% CC identity. XX FH Key Location/Qualifiers FT CDS 417..1229 FT /product="CR1-104_AAe_1p" FT /translation="MACNICQKVISDDDGVTCRGFCGNSYHMLCVRLDLSH FT EALEAHRKNLFWFCDGCADTLSNDGIQKFAHHCNHDIAHDNVTLQSLKDDI FT CGLKEVVNALTTKFESKSLTPSLRAPRRWIADTLPVPNTPKRIRDDGPMNQ FT PNAKPCNIRGTKTASELVKTVAPPEDLLWIYLSAFHPSTSENDIVNLVREC FT MDMGTDAQPKVVRLVPKDKDPATLSYVTFKVGVSKALKEIALSMDTWPDNI FT FFREFDNSLNSNRRVLRITASNSQSGGGGR" FT CDS 1232..4549 FT /product="CR1-104_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MSSLDPHVPASVFNPERTACSIMEAPSPPDPVALPGC FT SRLHSRSGPVVEDGVGVFQPLITGKYTFHTERPLLESLPNSSTSMVQDPNA FT SPNRIVYTSTHRSSTTLGHCPRSSSTAIIPECTAQNTETCTHSVSTSGSAN FT ISSSSRTISDNAKLTVYYQNVRGLRTKIDDVFVAISATNYDVIVLTETWLS FT DPILSTQIFGDAYTVFRNDRNARNSRKSRGGGVLIAVARHINSAADSTPIV FT DSLEQLWVKLKFTNFSMSVGVIYLPPDRKSDLDSVDNHVESIGAVLSHCQT FT NDHAILLGDYNQSGLVWDLHTDDPPKVDVLRSHISPACASLLDGFNLHGLT FT QINTIFNRNNRLLDLALVNETSLGNCTLSEIADPLSTLDADHPPFVIELGL FT TKPEFDDSLDPPSLDFLRANYSGLAEALASINWAAVDFDTEVDEAVQIFSR FT ELRSAMAEHIPLRRPSSKPPWGNSHLRRLKQVRSKLQRKYRRNQCPVSKQL FT FNRASYQYRIYSRFLYKRYTQRTQQNLRRNPKQFWKFVNSKRKEKGLPVEM FT HLKTEVGSSAREKCNLFASHFKSAFNNFVSSASQIEMACRDTPENAFQMER FT FQITTEHVQTAILKLKLSYVSPDGIPSCIFKKCSDALIQPLTSIFNASLRQ FT GKFPTSWKSSEMFPVHKKDDKRNVENYRGITSLCACSKIFEIIMNDMLFAS FT CKNYITPDQHGFFPKRSTATNLVQFVSFCLCNLDSGSQIDAVYTDLKSAFD FT RVDHGILLARLEKLGVSRDLVHWFKSYLTDRQLTVKIGLDSSEPFSNLSGV FT PQGSNLGPLLFSIFINELSSVLPPGCRLFYADDVKLYVVINCLQDCLHLQQ FT SINHFVDWCNSSLLTVSLQKCHVISFHRKRTPITFDYSIADQPLTRVTEVR FT DLGVTLDTALTFRVHYNDMISRANRQLGFMFKIADEFRDPLCLRSLYCALV FT RSILESNVIVWCPYQSTWISRIEAVQRKFVRYALRSLPWHDPTNLPRYEDR FT CRLLGIETLEQRRNNSQAVFIAKILLGELDSPEILNRINIYAPTRVLRQRG FT FLMLESRNTGYGSHDPIRFMAATFNGVYELFDFNASAERFKRRLREMNR" XX SQ Sequence 4835 BP; 1306 A; 1103 C; 1013 G; 1412 T; 1 other; tcgtgtggct tgtttatgta tgagcacgac ttggtaaatg cgttttactt aagcgttttt 60 tatcagtttt agattgattt tgaattgttt tttttgttga tcggtcaatt gtttcgtgag 120 ttttctttga cgatttattg gagttcatag tgataatagt gtgtttgctg cttaaaagta 180 gtttttaaat aatttattgt ggtgtcaaca aagtcaacta gtattgactg catcggtgtt 240 gctttattac aacgactcgt atgcaacatt tcatcgatcc atcccgtgct taaaagcttc 300 gattgtgcct gagttcaact tgctgcatat ccaaaggcgt tttcattttt ccatcgtaca 360 gtgaagatcc tttgaacatt tctgttgccg taatagtcac tacgttggtt cggaatatgg 420 cctgcaacat ttgccaaaaa gtcatttcag atgatgacgg agttacctgc cgtggtttct 480 gtgggaactc ttatcatatg ctatgtgtta ggttggacct ttcgcatgaa gcattggaag 540 ctcacaggaa aaacctattc tggttttgcg acggatgtgc cgatacgttg tctaatgacg 600 gcattcaaaa atttgctcat cactgtaatc acgatattgc gcatgacaac gttactcttc 660 aatcacttaa ggatgatatt tgtggactga aagaagtcgt taatgcactt acgaccaaat 720 ttgaatccaa atcgctcact ccgtccttgc gtgcacctcg gcgatggatt gctgacacat 780 taccggttcc gaatacgcca aaacgaatac gcgacgatgg accaatgaat cagcctaatg 840 caaagccatg taatatccgt ggaactaaaa ctgcatccga acttgttaag acggttgcac 900 cccccgagga tctgctgtgg atttatcttt cggccttcca tccgagcact tccgaaaatg 960 acatcgtcaa cctcgtgaga gaatgtatgg atatgggcac tgacgcacaa ccgaaagttg 1020 taagacttgt tccgaaggac aaggatccag caacattatc ttatgttacc ttcaaagtgg 1080 gtgtgagcaa ggcgcttaag gaaattgctt tatctatgga cacgtggcct gacaacatct 1140 tcttccggga attcgacaat tctttaaaca gcaatcgccg tgtgttacga attactgcga 1200 gcaacagcca atccggcggg ggaggtcgat aatgtcatcc ctagatccac atgtacctgc 1260 ttccgttttc aacccagaac gcactgcctg tagcatcatg gaagccccta gtcctcccga 1320 cccagtcgcg ctccctggtt gttcccgcct tcatagtcgt tctggccctg tggtcgagga 1380 tggtgtcggg gtcttccaac cgttaatcac aggcaagtac acatttcata cggaacgtcc 1440 wctgcttgaa agccttccga attccagcac atcaatggta caagatccta atgctagtcc 1500 gaaccgaatc gtctatacat ctacgcatcg ctcctcaaca accctgggcc attgtccgcg 1560 ctcctcatct actgcaatta ttccggaatg cactgcccag aatacagaaa cctgcactca 1620 ttctgtttcg acgtcgggat cagcaaatat ttcttcgagc agtcgaacaa tctccgataa 1680 tgcgaaattg accgtgtact accagaacgt acgaggcttg cgcacgaaaa tcgacgatgt 1740 atttgtagcc atctccgcaa ccaattacga tgtaatcgtt ttaactgaaa cgtggctttc 1800 cgacccgatt ctttcgacgc agatcttcgg ggatgcatat actgtatttc gaaacgatcg 1860 taatgcacga aacagtcgta aatctcgtgg tggaggagtt ttaatcgctg ttgcacgtca 1920 cattaatagc gctgctgact caacgccaat tgttgattct ctcgagcaac tttgggtcaa 1980 gctaaaattc acgaattttt caatgagtgt gggggttata tacttgccac cagatcgcaa 2040 atcagacttg gacagtgttg acaatcatgt tgaatctatt ggagccgttc tgtctcattg 2100 tcaaaccaac gatcacgcta ttctgctcgg tgactataat caatcgggcc ttgtttggga 2160 tttacacacc gacgatcctc ctaaagttga cgtattgcgg tcgcacattt ctccagcatg 2220 cgcttcgctt cttgatggct tcaatctcca cgggttgaca caaatcaaca caatatttaa 2280 taggaacaac cgccttttgg atcttgccct agtgaatgag acatcactag gaaattgcac 2340 cttatccgaa atcgccgatc ctctttccac gttggatgca gatcaccccc cgtttgtaat 2400 agaactaggt ctaacgaaac ctgaatttga cgactcgctg gaccctccta gcttggactt 2460 tctcagagca aactattcag gtttagctga agcactagcg tcaattaatt gggcggccgt 2520 tgactttgat acagaagtcg atgaagcggt ccagattttt tcccgcgaat tacgcagcgc 2580 catggctgaa cacattcctt tacgtagacc atcctcaaaa cctccatggg ggaattctca 2640 cttgcgacga ttgaagcagg ttagatcaaa attacaacgc aagtaccgta ggaatcagtg 2700 ccccgttagt aaacagttgt tcaaccgagc tagctatcag tatcggattt acagtcggtt 2760 cctctacaag cgatatactc aacgaacaca acaaaatctg cgaagaaatc ctaagcaatt 2820 ttggaagttt gtcaactcaa aacgcaaaga aaaaggccta cccgttgaga tgcatttgaa 2880 aaccgaagta ggaagctctg caagggaaaa atgtaacctg tttgcgtccc atttcaaaag 2940 tgcgttcaac aatttcgtat cttccgcctc acagattgaa atggcatgca gagatacacc 3000 ggaaaatgct tttcaaatgg aacggttcca aataacgacc gaacatgtgc agactgcaat 3060 cttgaagctt aagctctctt atgtcagccc agacggaatt ccgtcgtgca ttttcaagaa 3120 gtgcagcgat gccttgatcc aaccgttgac atcgatattc aatgcctcat tgcggcaggg 3180 caaatttcct accagttgga agtcgtctga aatgtttcca gtgcacaaaa aggatgacaa 3240 gcgcaatgtt gaaaattacc gcggtataac atcactgtgt gcctgctcca agattttcga 3300 gattatcatg aatgacatgt tgtttgcttc ttgcaaaaac tacatcaccc cagatcaaca 3360 tgggtttttt cctaaaagat caactgccac aaatcttgtc cagtttgttt cattctgctt 3420 atgtaacttg gactctgggt cgcaaataga tgcagtatat acggacttga aatccgcctt 3480 cgatcgtgtg gaccatggga tactactcgc tcggcttgag aaacttggcg tctcacgtga 3540 tctcgttcac tggttcaaat catacctcac ggatcgtcaa ctcacagtta aaattggatt 3600 agactcatcc gaaccgtttt ccaatttgtc cggagtgcca caagggagca acttgggtcc 3660 tctgctgttc tccatattca tcaacgagct ttctagtgtg ctaccacccg gatgccgttt 3720 gttctatgca gatgatgtga agttatacgt ggttatcaac tgtctccaag actgtcttca 3780 tcttcaacag tccattaatc acttcgtcga ctggtgcaac agtagcttgt taacagtgag 3840 tctacaaaaa tgccatgtta tatcgttcca tcgtaagcga acgcccatca cttttgacta 3900 ttccatcgct gatcaacctc taacacgtgt aaccgaggtt cgtgacttag gcgtaacatt 3960 agatactgcc ttaacctttc gggtgcatta taacgatatg atctccagag ctaatcggca 4020 gctcgggttt atgtttaaaa tagctgacga gtttcgcgac cctctgtgtt tacgatctct 4080 gtattgtgca cttgttcggt ctattctgga atcgaatgtt attgtatggt gcccgtatca 4140 atcaacatgg atctctagga tcgaagctgt tcaacgaaag tttgtgagat acgccctacg 4200 tagccttcca tggcatgatc ctacgaatct gccaaggtac gaagaccgct gcagattgct 4260 aggtatcgag acgcttgagc aaagaaggaa taattcacag gctgtattca ttgcaaaaat 4320 cttgttagga gagttggact ctcctgaaat actgaaccgc atcaatatat atgctccaac 4380 gcgggtgcta cgacaacgtg gtttcctaat gttggaatcc agaaacacag gctatggctc 4440 acatgatcct ataagattta tggcagcaac gtttaacgga gtgtacgagt tgttcgactt 4500 caacgcttct gctgaacgtt tcaaacgaag gcttcgggaa atgaaccgtt aacgaaacac 4560 cgaaaaattc atcaaattac aatgttttta ttctttttat agcacaaaac tgattattgt 4620 tagaatttaa gaactatttg tttgttgtac tgtaagttat ttgttcttag ttttatgttg 4680 tatttgtatt ttttctcttt aaaagatgtg gggtttttat gcctatttga gcaagtcaaa 4740 cgatgtggcc agctcaaata ggcttttccc tgccaggctt cataaggact ttacgtccga 4800 tgaaggttag aaacaaataa ataaataaat aaata 4835 // ID Copia-10_CQ-LTR repbase; DNA; INV; 136 BP. XX AC AAWU01001056; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-10_CQ_; KW Copia-10_CQ-I; Copia-10_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-136 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 336-336 (2011). XX DR GenBank; AAWU01001056; Positions 14036 14171. XX SQ Sequence 136 BP; 39 A; 24 C; 30 G; 43 T; 0 other; tgttggcgaa gcaaccgttg aattaattaa ttagttatag taagctacga ctagggtgaa 60 ttgaataaag ctagtcattc gttgtctatc actcgaacct agtagacgta tctactgcct 120 ttaggttatg ggccca 136 // ID Chapaev-3_HM repbase; DNA; INV; 5498 BP. XX AC . XX DT 27-FEB-2008 (Rel. 13.02, Created) DT 27-FEB-2008 (Rel. 13.02, Last updated, Version 1) XX DE Autonomous Chapaev DNA transposon -a consensus sequence. XX KW Chapaev; DNA transposon; Transposable Element; Chapaev-3_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-5498 RA Kapitonov V.V. and Jurka J.; RT "Autonomous Chapaev transposons from the hydra genome."; RL Repbase Reports 8(2), 29-29 (2008). XX DR [1] (Consensus) XX CC Chapaev-3_HM is a young family of autonomous Chapaev DNA CC transposons that were active in the hydra genome less than a few CC million years ago (they are 2% divergent from their consensus CC sequence). The consensus sequence was obtained based on a CC multiple alignment of 6 copies; it codes for a 975-aa Chapaev CC transposase (ten exons). Chapaev-3_HM is characterized by 4-bp CC target site duplications, 11-bp terminal inverted repeats, and CC 22-bp subterminal inverted repeats (separated by 17- and 7-bp CC regions from the 5' and 3' TIRs, respectively). The Chapaev-3_HM CC TPase forms a distinctive group of Chapaev TPases (including CC Chapaev-4_HM, Chapaev-6_HM, Chapaev-7_HM, Chapaev-8_HM) whose CC 240-300-aa N-terminal portion composed of the Chapa zinc and RING CC fingers is similar to the N-terminal portion of RAG1 (pos. CC ~100-380). CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(451..1392,1488..1780,1959..2171,2555..2794, FT 2904..2980,3068..3273,3381..3659,4133..4328, FT 4658..4857) FT /product="Chapaev-3_HMp" FT /note="Transposase." FT /translation="MATATAANHLSCLVKLCRLCGNYVGNDSFDVISIRVR FT VDQAFFTEVGEDRAETHPPKICMKCYTVMRHIEQRGTTPSNFSIKHWPQTC FT SLERCICFVKKSGRKKKKQCGRPPSGNKVIWTRESLNVILDSFSDMSRMCY FT ESINIADHPHKDLCVCMFCKRIIHRPVLLNNCQHLFCATCIFPNLIGKLET FT EAKCPICCSNITLGSISKATAMQNILESIALKCSKNTCNDNCVMQFDTKSI FT SNKEEHEQVYKSENISYSCSASSHSSLTVADIYKIDKTCDIPKELDYAFAH FT FAKLKMAKNNSHSFELPSGGPRAIQFSATPKIYKTSTDISQKTIQRRQKQL FT HQSLIENAGPTKKAKMQQVALMLNSFDKNDKIDILKKANISQVEIGSEDIV FT SLKADCGLPWEKMKKMIRFFKSKNVKLPSLSNQRKVSKNWSGDDLVVEDKE FT FLVESINKRGTFDLQQTPTAYINDLSSHLIKVLDELERHNQLMYDKITPRE FT IHIKIGGDYGGSSFKMCYQVANVVNPNSKNNTTVFNIFEAKDYRANLKIGL FT SRHTDEIRQIQSMKWREKNLRVFIFGDFEFLCAVYGISGATGRHCCLFCEI FT TKNEMQLSMKDRNQTILLRTLESLKSDFERFKNDGGNNKNAKKFNNVIDEP FT LFDIPIEQIAIPALHITLGIFLKFFNLLEKESHLLDIKLAGVLAINDKRID FT LSEYDDYVEKQVEIFQLKYSIEDINNKILLIQDACVIEVFHNPENSENIKY FT ENIIKLCNSVPQTVFNHINLPEHPIQMEAIGICKKFKVLFEKFSKCHKEMN FT SCEIFNNTKVQSFEATVNELLTFFRTNWPNEGGESIHAEFNNIERLYSNML FT GTQKLESVMRDHYTRNNLISKSLKVPIIP" XX SQ Sequence 5498 BP; 2102 A; 718 C; 798 G; 1880 T; 0 other; cacggtcgtt taatatcgat ccctttcaga catttatagt gcgcatgcgt ttacattgat 60 tttttttttt tgagtcgaaa tcctatattt tagaaaccgt ataaataggc acacatttta 120 aaactgttgt actttttcct tcttaatgtt gttcttcctc attcaacgac tttgaagacg 180 tttataggac caaacatgat agcaaaataa aacaaagttt actttcattg gtttttaaca 240 aattttaaat ggctctaaaa atatgtattt tttcaaaatt tcttgttttt taaactactg 300 taactttttt aattcaattt tttgctttga gaactaaata gattctaaaa caaaaatata 360 aatactttta aaaaacacca aatttattta ccagccattt tttaaagtta aagaaatttt 420 gaattaaaaa agtgtaccta actacctaat atggcgactg ctactgctgc taaccactta 480 tcttgtcttg tcaagctttg cagattatgt ggaaattatg ttggcaatga ttcatttgat 540 gttatatcaa taagagttag agttgaccag gcatttttta ctgaggtagg agaagacaga 600 gcagaaacac acccaccaaa aatttgcatg aaatgctata cagtaatgag gcatatcgaa 660 caacgaggaa caaccccatc aaatttcagt ataaaacact ggccacaaac atgttcactc 720 gaaagatgca tctgttttgt aaagaagtcc ggtagaaaga agaagaagca atgtggaaga 780 ccaccatcag gcaataaagt tatttggaca agagagtcac ttaatgtaat attagacagt 840 ttttctgata tgtctcgcat gtgctatgag tctatcaaca tagctgatca cccacacaaa 900 gacttatgtg tctgcatgtt ttgcaaaaga ataatacata gacctgtctt acttaataac 960 tgccaacatt tattttgtgc tacatgtatt tttccaaatt taattggaaa attagaaaca 1020 gaagcaaaat gcccaatatg ttgctcaaac ataactttag gtagtatttc aaaagcaaca 1080 gcaatgcaaa atatccttga gagtatagct ctgaagtgtt ctaaaaatac atgcaatgac 1140 aattgtgtca tgcagtttga cacaaaaagt ataagcaata aggaggaaca tgaacaagta 1200 tacaaaagtg aaaacatatc atacagttgt tcagcatcat cacactcctc attgacagtt 1260 gcagacattt acaaaattga taaaacttgt gacattccta aagaattaga ctatgcattt 1320 gctcactttg caaaactaaa aatggcaaaa aacaattcac attcatttga acttccttct 1380 ggtggtccac gggtaagaaa tatataattt ttttttaaaa ttgcagattc ttaaaagaag 1440 taaatcaaat atttgcatat agtgaaaaaa taaattgtct attacaggct atacagtttt 1500 cagcaactcc aaaaatctac aaaacatcaa ctgatattag tcaaaaaacc attcaaagac 1560 gtcaaaagca actacatcaa tcactaatag agaatgcagg accgacaaaa aaggcaaaaa 1620 tgcaacaagt agcgcttatg ctcaattcct ttgataaaaa tgataaaatt gatatactaa 1680 aaaaagctaa tatttctcaa gttgagatag ggtcagagga cattgtttca ctgaaggcag 1740 actgtggttt accttgggaa aaaatgaaaa aaatgattag gttaataata tactttttat 1800 tacactgctt tttgcataca aagctaattc agttatctca aattataaat atttataaat 1860 atttacatta caaaagtagt tttagcatgt ttttttttta cttttttttt tcaacatata 1920 caaaccaaaa aaagtcaaaa agttatttca tattttagat ttttcaagtc aaaaaatgtc 1980 aaactacctt cactttcaaa tcaacgaaaa gtttcaaaaa attggtcagg tgatgatttg 2040 gttgttgaag ataaagaatt tctggtggaa tcaataaaca aaagagggac ttttgattta 2100 caacaaacac caacagcata tataaatgat ttatcatcac atttaatcaa ggtgcttgat 2160 gaattagaaa ggtaagttca aaataaaact aaaattataa acatgataaa tgcttcttta 2220 ttattcaaat ccagatttgt attctattaa attttattta aaaataattt gtgtttatta 2280 taaataccaa cttattatat taaataccag attcttggat ataccaaaaa atggctaata 2340 ttcatgacat taaattatgt gctcaatttc agattccaat ttatgctaaa ttgatgctaa 2400 aaaatgactg aatgttcata tcagttgtca ttacaacatt gatgttgatt agaagtctaa 2460 tttgttgtgt tatgaagcaa aaatgttgat gcttaagttt gtaaaaatta tttggttata 2520 aataaaaatt gcgttaaaaa taatatcatt ataggcataa ccaactcatg tatgacaaga 2580 taactcctag ggaaatacat ataaagatag gaggtgacta tggaggtagt tcgttcaaaa 2640 tgtgttatca agtagctaat gttgtaaatc ccaattcaaa gaataataca acagttttta 2700 atatatttga ggccaaagac tatcgtgcaa accttaaaat tgggctatca aggcacacag 2760 atgaaataag acagattcag tcaatgaaat ggaggtgaat aaaaaaatat ttctgaaatt 2820 tagtttggga taattacatt aataaatgtt tgtagaaata tctattttag ttatgatttg 2880 attttaaagc attgttgttt tagagaaaag aatttacgtg tatttatatt tggcgatttt 2940 gaatttcttt gtgctgtcta tggcatctca ggagcaactg gttagaccat gatttcagta 3000 ctttttgagt taactattaa aagtgcatag tgtttataag aaatttaaaa ataatatcat 3060 ttaacaggtc gacattgctg tttattttgt gaaatcacaa aaaatgaaat gcaattgagt 3120 atgaaagatc gaaatcaaac aatccttttg agaacacttg aatctttgaa atcagatttt 3180 gaacgcttta aaaatgatgg aggtaacaat aaaaatgcaa agaagtttaa taatgtaatc 3240 gacgagccat tatttgatat accaattgaa caggtttgta gtagttgttg tcaaaacatt 3300 tttgccagta atatatgtac tacacatcaa aaagttaaca aaaagttaag tattatatat 3360 tattataaat ctctttatag atagcgatac cagctctcca catcacatta ggaatatttc 3420 ttaaattttt caacttgttg gagaaagagt ctcatcttct tgatatcaaa ctagcaggag 3480 ttcttgcaat taatgataaa cgtattgatt taagtgaata tgatgattat gttgagaaac 3540 aggtagagat attccagtta aaatattcca ttgaagatat aaataacaaa attcttctta 3600 ttcaagatgc ttgtgttatt gaagtatttc ataaccctga aaattcagaa aacataaagg 3660 taatgtacaa agagagaatt gcactgttag aatcaaagag aacagaaaag gtatatttac 3720 attaaaaagt acaagaaaca ttgacttatt ttaaggacat tcggattatt tatacatcaa 3780 taattgtatt tactttataa ggagggacga attgcagaat gttttaaagt tgagttgaat 3840 caaggagccg gcccaatagt aaaggaaatt gaatctgttt tgcagtcatg tggagtgcaa 3900 cgtcaagctt atcatagtcg ctcctttatt ggaaatcata ttcataaaat gttattggta 3960 acattgttat ttaaatatat ttattctttt aggcaataac attgatctgt ttttctgtat 4020 actatgcagg tttatggagt tttgacaact tttttttagc attcccattc tgatcttata 4080 taatttatat acttaaaatt tattttttag aaaattattt tttatttttt agtatgaaaa 4140 cataataaag ttatgcaact cagttcctca aacagttttt aatcatataa accttccaga 4200 acatccaatt cagatggagg caattggcat atgtaaaaaa ttcaaagttt tgtttgagaa 4260 gttttcaaag tgtcacaaag aaatgaattc gtgcgaaata ttcaacaata caaaagtaca 4320 gagttttggt aaggttagac tatatatata acagagtttt agtaaagtta gactttatat 4380 ataacagagt tttggtaagg tttagactat atatataaga cccctctccc ttatctgccc 4440 ttgtatgtat atatatatat atatatatat atatatatat atatatatat atatatatat 4500 atatatatat atatatatat atatatatat atatatatat atatatatat atatatatat 4560 atattaacac tcttaagatc ttaactgttg cctcttgctg ataggaacgg agattttttt 4620 accaatacat ttattagtta agtttgtttt attttagaag caacggttaa tgaactattg 4680 acgttttttc gaactaattg gccaaatgaa ggtggcgaaa gtatacacgc agaatttaat 4740 aacatagaaa gattatactc taacatgttg ggcacccaaa aactagaaag tgtcatgaga 4800 gaccattaca cgagaaataa cttaatatca aaatctttaa aagtgccgat catcccttga 4860 aaataaaaat tacaaattaa aattatatag tttacgttat atatatatat atatatatat 4920 atatatatat atatatatat atatatatat atatatatat atatatatat atatatattt 4980 agtgtttgaa taaagttact tagaggtgtt attaaaataa aaaaagtcag ccatattatt 5040 tttcgaataa ttgagaaata gtgaaaaatt attaaacttt ataacctaat atcattagtt 5100 aaaaaaatgg ctggtaaaaa tactatgcta attattaaaa ttcagctttt attctttcca 5160 atgatatata acactacaac taaaaaattg atttttttaa gttttattaa agaataaaag 5220 tggtagtaaa atctgcgata aaatatgaga cgaattgaaa ggtgaattat agctgtaggc 5280 gtcacatgtt tatagttatt ataatttttc ctatttacta aaatgtaaga taagataaaa 5340 acataataat tagtaagtaa atatatattt atatcgtata tatcaagatt taaaaagtca 5400 caaaacgctt cgtttttaag gctagtgaaa aaacagcata attgagatca tgcgtaatac 5460 acatgcccac tataaatgtc cattgcgaaa cgaccgtg 5498 // ID CR1-111_AAe repbase; DNA; INV; 5034 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-111_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5034 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1199-1199 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 17 sequences with >97% CC identity. XX FH Key Location/Qualifiers FT CDS 275..1306 FT /product="CR1-111_AAe_1p" FT /translation="MESCQACSKSLDSDRVVTCSGSCGMTFHFICVGLSKS FT QYACWTSKVGMFWFCTSCRLNFEPAVYERDKIMLKALRELLIRTDAMDTRL FT GNCGENLRKFTKILHGGNQSKSSNCSNLQTTLLQSIDEMTLDDVVDDPINR FT SRSCDETSFFEVLDEVNSSIAMAPDRFVVRGNKRVQIVESTSRMNASTPAA FT LPHVSKCLPPDTICAPPSRSSPNIHSIGNNTRSPSGRTLKPNNSFLKVANG FT ALQAIDDECYYVTPFTPDQTEDDIKLHVCDITNADPSVVKVTKLVPRGKKL FT EELSFVSFKISVCRSYSGVVSDSWYWPEGITIRPFEPNSKNEIPARLPKSM FT Q" FT CDS 1258..4962 FT /product="CR1-111_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="TKLKKRNPCTASEIYAIEQRQISSEIATADECQSSPS FT PGRTESSTLEALEPPITVLTIQPAIIRRPGPVFGTGERVFQPVLSGKYSRL FT EENIRSEELFHNSPSEIFVSTFDVSANIPGRTIYSTLEAPEPPITVVTTQP FT AIIRRPGPVFGTGEGVFQPVLSGKYLQLRNTICSENVFVRSVDDPHHSTGF FT PIPGCTPASPSEAPYPLVTVEPILPAASSHPGPVYERGEGVFQTPIAGKYT FT NDQNNMLPVISIASSSFASHPTFNELTEPPGASSIVPVNASTTHHHLALYY FT QNVRGLRTKIAHLRLLLSSCDYDVIVFTETWLCAGIENREITSEYTIFRCD FT RNELNSRHSRGGGVLIAVKNNLPCDTIHIPNCENLEQIAVRIKLRERFLYI FT TAIYIPPNSSPDVYSAHANAMQFVTDLASVFDVVMSIGDFNLPNLHWQLDS FT DTNGYIPSNMSTEHERNLVESLFAMGLRQVNRFTNTNRKLLDLVFVNLPEH FT VDMILPPSALLRLDNHHVPYILLLEELEANPAMPAQEDWAFDFHACDYRQL FT NNVLADIDWDSEFDNRPLEEILPLFYDRLNAVINDHVPKRLRRSSSSFNKP FT WWTLELRNLRNSLRKSRKHFFRTRSENDRSTLREMETSYKVLLSSTYCNYI FT SRIQANVKEDPRRFWNFVNHQKTCNGIPSTVNYNGVFAHSNLEAANLFATF FT FESVYNKVAPVQRHNSFAHVDVCDITLTATEFVPDETREALGELDPDKSAG FT TDGIPPLLLKNCAASLAKPITKIFNCTIRERAFPSVWKTARIVPIFKSGTC FT SNVSNYRGISILCSLSKVLEKMMHNVLYNVARPFITDTQHGFMKHRSTTSN FT LMCYTTRLSTELAARRQVDSIYIDFAKAFDTVPHVLIVEKMKYIGFPAWIA FT QWIFSYLTDREAFVVVNNVRSRTFAITSGVPQGSVLGPLLFNIFVNDLGNL FT ISSSKLSFADDLKLFRVILSEADCEALQEDINNVLIWCDNNGMNVNSKKCK FT IISFCRRENLFQHNYTMGATAVDRVHSICDLGVTIDSKLRFTEHIGIITAK FT AFSTLGFIRRHANSFTDIYALKTLFCSMVRSILEYAAPVWCPQYVTHILTI FT ERVQKKFLRFALRDLPWRDPVNLPHYSDRCQLIKLEALSTRRTHQQRMLIF FT DLLTGNIDCPELLEQIPINVPPRRFRYSPFLVIPYNRTNHGSNNPFYVCLR FT SFNDVAELFDFNLSKTVFSNRLREL" XX SQ Sequence 5034 BP; 1393 A; 1243 C; 1052 G; 1346 T; 0 other; cattgtgagc atcgcgtttc acggtcgtag ttctgtgaat tattgtgttt agtgtttcgt 60 gatactggat caattattac tgacatctac aaaacaccag cacgctgtca atttttcgaa 120 gcgccatctg ccagaaaaac aaggaagcta gtagaagttc gagtataaca gttcaacagg 180 gctgcatact ggtaggcgct ttaggcgcat cattatttta ccgttgctgt gcgtagtgtt 240 gtaacattct tcctcgacac ttcacggctt cactatggaa tcatgccaag cttgctcaaa 300 atctttggat tcggatagag tggtcacctg tagtggatca tgtggaatga ctttccactt 360 tatatgcgtt ggcctgtcta aatctcaata cgcctgttgg acgtcgaaag tgggcatgtt 420 ctggttctgc acatcatgcc gactgaattt cgaaccagcc gtatacgagc gagataaaat 480 aatgctgaaa gctctacgtg aactgttgat tcgaacagac gctatggata cacggctcgg 540 aaattgcgga gaaaatctca ggaaattcac caaaattctc catggtggca accaatccaa 600 atcatcaaac tgctcgaatc ttcaaacaac attgcttcag agtattgatg aaatgactct 660 cgacgatgtc gtagatgatc caatcaaccg gtccaggtca tgtgacgaaa cgtctttctt 720 tgaagtgctg gatgaggtga acagctcaat agctatggct cctgataggt ttgtcgtccg 780 aggaaataaa cgcgtgcaaa tcgtggaaag cacgtctcgt atgaacgcat ctactcccgc 840 agctttgcca catgtatcga agtgcttgcc gccggataca atatgtgccc cgcctagtcg 900 aagcagtcct aacatccact ctatcggaaa taacactaga tcgcccagcg gaagaaccct 960 caaacctaac aactcttttc tcaaagttgc aaatggcgcg ctccaagcaa ttgacgacga 1020 atgttattat gtcactcctt ttacgccaga tcaaacggaa gacgacatca agctgcacgt 1080 gtgtgacatc actaatgcgg atccctcagt tgtcaaagtt acaaagctgg ttcctcgtgg 1140 gaaaaaactt gaagaacttt cgtttgtatc cttcaaaata tcggtttgca gatcttactc 1200 aggtgttgtc agcgacagct ggtactggcc agaaggaatt accatacggc cctttgaacc 1260 aaactcaaaa aacgaaatcc ctgcacggct tccgaaatct atgcaataga acaacgtcaa 1320 atttcttccg aaatcgccac agccgatgaa tgtcaatctt ctccatcacc gggacgcaca 1380 gaatcaagca ctttggaagc cctggagccc cccatcacag tcctgaccat ccagccagcg 1440 atcatcagac gtcccggtcc tgtgtttggg acgggcgaaa gggtcttcca gcctgtgcta 1500 tcaggcaagt actctcgtct cgaagaaaat atccgctctg aagaactttt tcacaatagc 1560 ccatcagaaa tcttcgtttc tacattcgac gtatcagcca acatcccggg acgcacaata 1620 tacagcacat tggaagcccc ggagcccccc atcacagtcg tgaccacgca gccagcgatc 1680 atcagacgtc ccggtcctgt gtttggaacg ggtgaagggg tcttccagcc tgtgctatca 1740 ggcaagtacc tacagttgag gaacacaatc tgctctgaaa atgttttcgt tcgtagtgtt 1800 gacgacccgc accattcaac tggatttcca ataccgggat gcacgcctgc tagcccctcg 1860 gaagcccctt atccactcgt cacagtcgag cccatcctgc cagcggccag cagtcatccc 1920 ggtcctgtgt atgagcgcgg agaaggggtc ttccaaactc caattgcagg caagtacacg 1980 aacgatcaga acaacatgct gccagtaatt tcaatcgctt ccagctcctt tgcatcacat 2040 cccaccttca acgagctaac cgaaccacct ggcgcgagca gcattgtccc agttaacgca 2100 tcaacaacgc atcatcattt ggctctgtac taccagaacg ttaggggtct gcgaaccaaa 2160 attgctcacc taaggctgtt actgagtagc tgtgactacg atgtcatagt cttcacggaa 2220 acatggttgt gtgctggtat tgaaaatcgc gagattacgt ctgagtacac gattttccga 2280 tgcgaccgca acgaattaaa tagtcgccat tcacgcggtg gtggtgtgct catcgcagtc 2340 aaaaataatc tgccctgtga caccattcat ataccaaatt gtgagaacct tgagcaaatc 2400 gccgtacgta taaaattgcg tgagaggttt ctctatataa cagcaatcta tattccgcca 2460 aactccagtc ccgacgttta ctctgctcat gcaaatgcca tgcagtttgt tacagatctt 2520 gcttcagtgt tcgatgttgt tatgtcgatt ggggacttca atcttcccaa tttgcattgg 2580 caacttgata gcgatacgaa cggttacatt ccctcgaata tgtccactga acatgagcga 2640 aatttggttg aatcgttgtt cgccatggga ttgcgtcaag tcaacagatt cacaaacacg 2700 aatcgaaaac ttctcgatct cgtctttgta aatcttccag agcatgtaga tatgattctt 2760 cctccttctg cccttttgcg gttggacaat caccatgttc catacatctt actccttgaa 2820 gagttggagg ctaatccagc catgcccgct caagaagatt gggcattcga ttttcatgcc 2880 tgcgactatc ggcaattaaa caacgttcta gcagacatcg actgggactc cgaattcgac 2940 aacagaccat tggaggaaat tttaccgttg ttttacgacc gactcaacgc cgttatcaac 3000 gatcatgtac caaaacgttt aagaaggtca agctcgtctt ttaacaaacc gtggtggact 3060 ctcgaactcc gtaaccttcg caacagcctt agaaaatcaa ggaagcattt tttccgtaca 3120 agatccgaga acgacagaag cacgcttcgt gaaatggaaa catcgtacaa agttctcctt 3180 tcatctacat actgcaatta tatttcgagg attcaggcaa acgttaagga agatcctaga 3240 cgtttttgga atttcgtcaa ccaccaaaaa acttgcaatg gaattccgtc aaccgtcaac 3300 tacaacggtg tttttgctca ttccaactta gaagcagcta atctctttgc tacatttttt 3360 gaaagcgtat acaataaggt agcgccagta caacgtcaca actccttcgc acacgttgac 3420 gtttgcgaca tcaccttaac agctaccgaa ttcgtacctg atgaaacacg cgaggcacta 3480 ggtgagcttg accccgacaa gagtgcgggc acggatggca ttccgcctct tcttctgaag 3540 aactgcgctg catctctagc caaaccgatt actaaaatct tcaactgtac cattcgtgag 3600 agagcctttc cgtctgtgtg gaaaacggct cgcatcgttc cgattttcaa atcaggaaca 3660 tgtagcaatg taagcaacta cagaggcatt tccatacttt gtagtctcag caaagtttta 3720 gaaaaaatga tgcataacgt tttgtacaat gtagcacgac ctttcataac tgacactcag 3780 cacggtttca tgaagcaccg gtctacaacc tcgaatctga tgtgctacac taccagattg 3840 tctacggaac tggcagcaag acggcaggta gactccatct atatcgattt tgctaaggct 3900 ttcgacaccg ttccgcacgt tctaatagtc gaaaaaatga aatacatcgg atttccagcc 3960 tggattgcac aatggatttt ttcctatctc acggatcgtg aagcttttgt ggttgtcaac 4020 aacgtacgct cgcggacctt cgctataaca tctggtgttc ctcaaggcag tgtgttagga 4080 ccactactat tcaacatctt cgtgaacgac ttgggcaact tgatttcgtc atccaaactc 4140 tcttttgcgg atgatcttaa attatttcgt gtcattctct ccgaagccga ctgcgaagcc 4200 ttacaggagg acatcaacaa cgtgctaata tggtgcgata acaacggcat gaacgtgaac 4260 agtaagaaat gcaaaatcat ttcattctgt agacgcgaga atttatttca gcataactac 4320 acaatgggtg caacagctgt agatcgagtg cattcaattt gcgatttggg cgtgacaata 4380 gattccaagc tacgtttcac tgagcatata ggcattataa cggcaaaagc cttctcaaca 4440 ctcgggttca ttcggcgcca tgccaacagc tttaccgaca tttatgcttt gaaaaccctg 4500 ttttgctcaa tggtgcgaag cattctcgaa tacgctgctc cagtctggtg tccacaatat 4560 gttacgcaca tacttacgat cgagcgagta cagaaaaagt tcttgcgttt tgcactgcgg 4620 gatcttcctt ggagggatcc agtaaatcta cctcactatt cagaccggtg ccaactaatc 4680 aagctggaag cgctatccac aaggcgcacg catcaacaaa ggatgctgat tttcgacctg 4740 ctgaccggaa atatcgactg tcccgagcta ctggaacaga ttcctataaa cgttcctcct 4800 cggcgatttc gttactcacc gtttttggtc attccgtaca acagaaccaa ccacggtagt 4860 aacaatccat tttatgtgtg tcttcgttca ttcaatgatg ttgcagagtt atttgatttt 4920 aatttgtcta aaactgtttt tagtaataga ttaagagagc tttagattaa tgttagtata 4980 taatttttca gtctgtgcgt tgtaaacgaa gacggtgtat gtaaataaaa taaa 5034 // ID Gypsy-235_AA-LTR repbase; DNA; INV; 201 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-235_AA_; KW Gypsy-235_AA-I; Gypsy-235_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-201 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1072-1072 (2011). XX DR [1] (Consensus) XX SQ Sequence 201 BP; 57 A; 48 C; 37 G; 59 T; 0 other; tgaaggagac gaattttgaa atggtctaca ccctcaattt cggctgaact cttacccctc 60 gatttcaaca aaccgatctt atcccttcag tttaggttca gtgatttggg agagttttgt 120 agagtatata cgttcacgtt cccgccagca cacttcctgt tcagctagcg taggaaaaac 180 atcccgaaat ataatactac a 201 // ID BEL-64_CQ-LTR repbase; DNA; INV; 366 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-64_CQ_; KW BEL-64_CQ-I; BEL-64_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-366 RA Jurka J.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 282-282 (2011). XX DR [2] (Consensus) XX SQ Sequence 366 BP; 96 A; 86 C; 75 G; 109 T; 0 other; tgttgggaca cactagtgcc caaacacgaa gccgtataga taagagagaa tttgaattcg 60 gttacattag tattaataca ctatcacctt atcacaacgc tcggccagcg agcccgatca 120 cgagagagtt tggctaccta ctttgaaggt caagttcgtg gtgatcgcca ctcgtaagtt 180 aagttgaaat atacagttcg atctgacctc acgcgttttt cgttcctttg gcaataaatt 240 tctcgtcgaa ataaattcgt ctttttcgat tgccaattac gacgcgtttt ttgatcctcg 300 aatattctgt cgcgggagtg tttgaaggcc accgatatca cgccctttga cgaccattcc 360 ctaaca 366 // ID Copia-39_AA-LTR repbase; DNA; INV; 136 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-39_AA_; KW Copia-39_AA-I; Copia-39_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-136 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 963-963 (2011). XX DR [2] (Consensus) XX SQ Sequence 136 BP; 38 A; 22 C; 27 G; 49 T; 0 other; tgtagtggaa acggaaatgg tcagagtagg ctgctaatgt ttaatgtcag taatttgatt 60 aggaaatata cctcattcat gtttgttcgt tcattctaac cagacgtgtt ttaatcactg 120 ctagtcactc tttaca 136 // ID Gypsy-20_SI-I repbase; DNA; INV; 4333 BP. XX AC AEAQ01023379; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-20_SI_; KW Gypsy-20_SI-LTR; Gypsy-20_SI-I. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-4333 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01023379; Positions 4962 630. XX CC Positions [2267-2809] - Reverse transcriptase CC Positions [3823-4299] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 284..3808 FT /product="Gypsy-20_SI-I_1p" FT /translation="MAELEPPVSSQDKRAAGSADKHMKSTKKGGRKLTDDR FT QKVRKYNDQDIDEMRIVELKDALRRLNVSTTGNKSELQMRLRQESARERRK FT EKAGTSDKDAMTDDDEISDNEETMSESEKKVAGKEKRRRSASKNRQVKREH FT CRRGETSDTDTSDTDISDTDASDTDTSVERTSNGRNQRRSLNDRRRYRRNA FT FTIKDVEGSLTYFTGDDKLPIEKWITEFEDTSDLLQWDELQQLIYGKRMLK FT GSAKRFISFEKGITSWKILKQKLKREFKTELNSALIHSQLFKRRRQPGESS FT RQYIYAMQEIADQGYVEEEALIQYIVDGLPDEDNSKTFLYEACTIRELKKK FT LEVYDRLKEKTQRKKTPLKKDKDNLKSSMKETKKELSKSGSRQTDKKHCFN FT CGSAEHDVKSCTNADKGPKCFKCNCFGHIASKCDVKQTPPSETTPATVSCV FT NSIDSKFILVDIAGAKYQALLDTGSDVSLIRDDLYQRIGKPKLKKTMRVFT FT GLGNTTTKPSGAFNLKLSVEGEDYEIEIFVVPTSSMTSEIILGRNFLQDVE FT VVIKRGHIGVKRPTAGELFAEEMREAVHHEKEDAEEKGFKELTVLPCTVAD FT EIEVPEPYRSRIETMIAEYRPRKDVETSIETKIVLKDEIPVNHQPRRLAPK FT EKKVLDKQISEWLQSGIIKPSQSEYASPVVIVSKKDGSSRICVDYRSLNKK FT IIRDRYPMPLIDDKIDALAPFRVFSIIDLQNGFLHVPVNKNSQKYTAFVTP FT IGQYEFTRTPFGLCNSPTNFLRFVNEIFRDLIQRSIVFTYMDDLIIPGVDE FT TEALSKLTETLTVAANNGLNINWRICKFLQRRVEFLGHIIENGCVKPSPAK FT IKAVQGFPRPTTFKQLQSFLGLTGYFRKFIKDYAKLAHPLYELKEGQHFHF FT GQSQQLAFSRLKDALSRDPVLRIYNPKAITELHTDASKQGYGATLLQRKPD FT EKHFHPVYYLSYKTTEAEKKWCSYELEILAIIKAVKKLRVYLLGIQFIIVT FT DCQAFQRTLTKENLPPKVARWALMLEEFEYKIEHRASERMKHVDALSRFPV FT MLLEDTVLLLIKGEQDKEERLRVIKQLLTKEPYEDYILENGLLMKKMDAKN FT IVVLPSSMHHDVIRRTHENGHFGVKKMIESIREEYYIPKLKEKLEQYVECC FT VPCILAEKREEKKKAC" XX SQ Sequence 4333 BP; 1545 A; 787 C; 971 G; 1030 T; 0 other; aatttggggg ctcagccggg attcacgaca aggacgagaa tcagacggtt ttacatcaag 60 agccagaacg acgtctatgc gaaagacgat cggcagtgcg cataagagaa tcgtcgtaca 120 gaaagagatc agctttgaac cgtaagtcag tcgctaagtt gaatcagttg taagcttcgg 180 tttagtaagc cgagctggta agtaagccgg cgatacggcg cgcgctaccg aacctaacct 240 cagttgtaaa gttgtaaagt tgagagacta agaaacagtc gcaatggctg aactcgagcc 300 accagtcagt agtcaagata agcgggcggc ggggagcgcc gacaagcata tgaagtcgac 360 gaaaaaaggc ggacgaaagt tgacagatga taggcaaaag gttaggaagt acaacgacca 420 ggacatcgac gaaatgagaa tagtcgagct caaggatgct ttaaggcggt taaacgtttc 480 taccacaggt aataagtctg agttacagat gagattgcgg caagaatcag cgcgggaacg 540 ccgtaaggaa aaggcgggaa cgtctgacaa ggacgccatg acagacgacg atgagatatc 600 ggacaatgaa gagacgatga gcgagagtga gaaaaaggta gcgggtaaag aaaagagaag 660 aagaagcgca tcaaagaatc gacaagtaaa gagagagcat tgcaggaggg gcgaaacgtc 720 cgacaccgac acttccgata cagatatttc tgacacggat gcttccgaca cagacacttc 780 agtcgaacga acgagtaatg gacgaaatca gcgtagatcg cttaatgata gaagaaggta 840 cagacgtaat gcatttacta ttaaagatgt ggaaggcagc ctaacctatt tcacgggcga 900 cgataaatta cctattgaaa aatggattac agaatttgaa gacacaagcg atttgcttca 960 atgggatgag ttgcaacaat taatatacgg caaacgtatg ttgaaaggat cagcgaaacg 1020 ttttatatca tttgaaaaag gcatcacgtc atggaaaatt cttaaacaaa aattaaaaag 1080 ggaatttaaa accgaactaa atagcgcatt gatacacagc caattattca agcgtcgacg 1140 ccagcccggt gagagtagtc gacaatacat ctacgccatg caggaaatag ctgatcaggg 1200 atacgttgaa gaagaggcat taatacagta tatagttgat ggactacccg acgaagacaa 1260 cagtaagacg ttcctatacg aagcatgtac tatacgagag ttgaaaaaga agctagaagt 1320 ttatgatcgt cttaaagaaa aaactcagcg gaagaagacg cccttgaaaa aggacaagga 1380 caatctcaag agcagcatga aggaaactaa aaaagaacta tctaaatcag gcagcagaca 1440 aacagataag aagcattgct ttaattgtgg atcggccgaa catgatgtca agagctgtac 1500 gaatgcagac aagggaccga aatgctttaa atgcaattgc ttcggacaca tagcttcaaa 1560 gtgtgacgtc aagcagactc caccatccga aacaacacca gctaccgtaa gttgtgtaaa 1620 tagcatagac agtaaattta ttctagttga cattgcaggt gcaaaatatc aggcattgct 1680 agatacaggg agcgatgtca gcttaatacg agacgacctt tatcaacgca ttggcaaacc 1740 taagctgaag aaaacaatgc gagtgtttac tggactggga aacacaacta ctaaaccaag 1800 tggcgccttt aacttgaaat tgtctgtaga aggggaagac tacgaaatcg agatatttgt 1860 agttcccacg agttctatga cgtccgaaat aatattagga cgtaactttc tgcaagatgt 1920 tgaggtggta ataaagagag gtcacatcgg agttaaacgt ccaacagctg gagaactttt 1980 tgccgaggag atgagagaag cagtacatca cgagaaagaa gatgctgagg agaaaggatt 2040 caaggaattg accgtgttac cctgcacagt tgccgatgaa attgaagtgc ctgaaccgta 2100 tcgaagccga attgagacaa tgatagcaga gtaccgaccg aggaaggacg tcgaaacatc 2160 aatcgaaacc aagattgtat taaaagacga gatacccgtc aaccatcagc ctcgtaggct 2220 tgcgccaaaa gagaaaaagg tactagataa gcaaataagc gaatggctac agtcaggtat 2280 tattaaacct agtcaaagcg aatatgccag tccagttgtt atcgttagta aaaaagatgg 2340 ctcaagtcga atatgcgtag attaccgttc tcttaataaa aaaataattc gcgatagata 2400 tcctatgcct ttaattgacg ataaaataga tgcacttgca ccttttcgag tcttttcaat 2460 aatagactta cagaacggat ttcttcacgt accagttaat aaaaatagtc agaaatatac 2520 ggcttttgtg acaccaattg gccagtatga gttcactaga acaccatttg gactttgtaa 2580 tagccctacc aatttcttaa gatttgtaaa cgaaatattc cgagatttaa ttcaaagatc 2640 gattgtattt acgtatatgg atgacttgat cataccaggt gtcgatgaaa cagaagctct 2700 ttccaaacta acagagactt taacagtagc agctaataat ggtcttaata ttaattggcg 2760 gatatgcaaa tttttgcaac ggcgagttga gttcttaggc catataatcg agaatggatg 2820 cgtgaaacca tccccagcaa aaattaaagc agttcaaggt ttcccacggc ctacaacgtt 2880 caagcagtta cagagtttct tagggctcac agggtatttt agaaaattta ttaaagatta 2940 tgcaaaacta gcccatcctt tatatgaatt aaaagaagga caacatttcc atttcggaca 3000 gtcgcaacag ttagcctttt cgcggttaaa agatgcttta tcaagagatc cggtacttcg 3060 aatttacaat ccgaaagcta ttacagaatt acatacagat gccagcaaac agggatacgg 3120 ggcaacgctc cttcagagaa agcccgacga gaagcatttc catcccgttt attacctaag 3180 ttataaaacg actgaagccg aaaagaaatg gtgttcttat gaattagaaa tactagctat 3240 aattaaagct gttaaaaaac tgagagtata cctgctaggc attcaattta tcattgtaac 3300 cgattgtcaa gcttttcagc gtactttaac taaagagaac ttacctccca aagttgcccg 3360 ttgggcatta atgttagagg aattcgaata taaaatcgaa catcgtgcaa gcgaacgaat 3420 gaagcatgta gacgcgttga gcagatttcc ggtaatgttg ctagaggata cagttctact 3480 actaatcaaa ggcgaacaag acaaagaaga acgtttacgc gtaatcaagc aattgttaac 3540 taaagaaccg tatgaggatt acattctaga aaatggattg cttatgaaaa agatggatgc 3600 caaaaatatt gtagtattgc cgtcgagcat gcatcacgat gttatacgaa gaacgcacga 3660 gaatggacat tttggtgtaa agaaaatgat agagtcgata cgcgaagagt actacattcc 3720 aaaattaaaa gaaaaattag aacaatacgt agaatgttgt gtgccttgca ttttagcaga 3780 aaaaagagag gaaaaaaaga aggcatgtta aagccaattc caaaaggtga cgcacctttt 3840 agcacttatc atatggacca tttaggaccc atgacaagta cggctaaatt atacaaacat 3900 ttgcttgtaa taatcgatgg cttttcaaaa ttcgtgtgga tttatccgac taagactact 3960 aatactaaag aagtcctaga taagttgaca acaatgcagc aaatttttgg caatcctcaa 4020 cgtattgtta ccgatagagg cactgcgttt acgtcgtctc aattccgcga ttattgttct 4080 acagagaata tagagcatgt aacgataact accggagtac ctcgcggtaa tggacaggta 4140 gaacgcataa acagaattat tattccagtt cttaccaaat tagcattaga ccatcctgat 4200 cgttggtatc gtcaagtacc gaaattgcaa atgtgtatta atagttctta tcaaagaagc 4260 gtaggaatga gtccttttga agttttattt ggaatcaaaa tgaaacaaca ggaagatatt 4320 cgtttattag ata 4333 // ID Gypsy-597_AA-I repbase; DNA; INV; 6062 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-597_AA_; KW Gypsy-597_AA-LTR; Ty3_gypsy_Ele58; Gypsy-597_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-6062 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [4935-5414] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 928..2118 FT /product="Gypsy-597_AA-I_1p" FT /translation="MSEYEKLKHLYNYIKKSINTPLTENTLREKSDSCKEI FT YENLQIYFSENSVDEILISNAKKWYLEINQVLKAKLDNLNTKNQLDETTIS FT MVQPEVERQFDVKQATALVQPYDGSAAGLESFIDSVNFLKELIQPNHTAMA FT VKFLKTRLTGKARCGFPDTIVTIDALIEHIKIKCQDTLTPDLIVAKLNATK FT QRGQILNFCEEIENLCSKLENSYISQKVPENVAKAMTTKAGVNALINGINA FT SETKLILKAGNFSSIREALQKAQESATDNTSSQVFNLRNKQFNPNERKMNF FT RGNGFNRNRNFGNCYRGGFSQRPQNNYYHNSSRGRYQRGGYHNGYNNRGRF FT NDNRPRYDNPNRVYVANVDQPYGPPQNQNRDAPNRAAPQQQQQQQSQRNFL FT GQC" FT CDS 2166..5759 FT /product="Gypsy-597_AA-I_2p" FT /translation="MANSVCTLLVDSGADISIFKTDKILTNHNINYKVKTR FT ITGITEGEIETMAITETVLNFGNGIRVNHSFHLVDANFPILTDGILGRDFL FT SLFRCNIDYESWLLNFNIDNIAVSVPIQDTVNENYIIPQRCEVIRKIKDLK FT ITEDMVLFSKEIKPGLFCGNAIISPTCQYVKFMNTTSRPINVSNVDLEFML FT EPLTRYQILNTRKNNKPDDTQKRNNEILKEVDLTNVPDYAKNDLKQLITKY FT TDIFSLPDEPVTTNNFYEQTINLNDPVPTYIPNYKTIHSQSNEIQSQVNKM FT LKDNIIEPSVSSYNSPILLVPKKSTDDKKKWRLVIDFRQLNKKILSDKFPL FT PRIDSILDQLGRAKYFSTLDLMSGFHQIPLDENSKKYTAFSTQSGHYQFTR FT LPFGLNISPNSFQRMMTIAMSGLTPELAFVYIDDIIVVGCSMQHHLHNLSV FT VFERIRKYNLKLNLSKCKFFNTEVTYLGHRITDRGILPDSSKYDTVKNYPV FT PKNSDEVRRFVAFCNYYRKFVPNFATIAYPLNQLLRKNIKFEWSSKCEDAF FT NALKSFLMSPKLLQYPDFKQTFILTTDASDVGCGAVLSQNIDGNDLPIAFA FT SKSFTPGEKNKSVILKELTAIHWAIDYFKAYLYGRKFVIRTDHRPLVFLFN FT MKNPTSKLTRMRLDLEDFDFTIEYIQGKTNVTADALSRIVTTSDELKSKSI FT FTVNTRAMTKRKLKESKRSQELPDENKSQNNAVNNKPDQLVIYTTENPTET FT RKLPKLRSAMENENLIFTLHEASQKNKIIIRITTQLRSIRQTLVFALSVLE FT KEMQMKRITKIALSSHDCIFEWVPKETFKELVLKSIKNLQIIIYNPPKFIS FT DLDEIKRILENYHQTPTGGHIGQHRLYLKLREIYNWTNMKSSIAQFIKACE FT LCKLNKITKHTKEKLTITNTPSKPFEVVSIDTIGPLPKSNNNNRYAVTIQC FT ELTKYVVLVPIPTKEANVIAKSLVENFILIYGKFLELKSDQGTEYKNEVLD FT QICKLLQVKQTFSTAYHPQTIGSLERNHRCLNEYLRNFVNEHQSDWDDWLQ FT FYAFTYNTSPHTDHGLTPYELVFGVKARVPHETYTDRIDPVYNFDLYSKEL FT RYKLQKSHEIVKQKLLEQKQIRKEINDRSINPLQIKIGDIVYLQKENRRKL FT DSFYSGPYKVKSITEPNCEIINLQTNQLSLVHKNRLIKS" XX SQ Sequence 6062 BP; 2307 A; 955 C; 1034 G; 1766 T; 0 other; gatggcgacc ggactggttg acactttgtg aaatattcac agtcaacaaa gttttagtgg 60 aaaatcagac aaagttttag tggaaaaact gacaaagtct tagtggaaaa tcagacaaag 120 ttttagtgga aaaacagtga tattataagt gataatgggc tttttcagta acgatgagat 180 tgttgccaat gcatccacgc ccgagaacac agtaattgca tctacactcg tagtgattgt 240 ggcagtttta attcttttcg tcataatgag gatgtacaca aagtacatgc aaggaaaagt 300 gaaagaagaa gcgaggagag aagtacaatt aagtaacctc cgcacagttt gaaaaggtta 360 aaaggtgatt acgaacaaaa cattcttggt tccagtacag tatgtgtgca aatgaaaatc 420 taataccgcc gtacagtgca caattatgtg agcattttga gagttcacgg aaaaagttat 480 ttcagaaaag tggaaaagaa tgactttaca tggttcaagg tgaaattgcg agtcgtgtaa 540 atttgagatt ttttttgcgg tggagcagtg gtgcaacaat tcaacgtaca gggatggtgc 600 cagtgtccag tgattctagg cagcttaata agaagatgat gcctgcttgc tgccgcggag 660 attcgtggtt gcggaatgcg gaccagttcg tacgtaagat gattccggct taccgtcgct 720 gaggtttgta gtcgcggacc agtcgactag atgaggtcag tgtatctatt tctctacgtg 780 caattacatc actattacat caccatcata tactgattta ggaaaaatta ttatgtaaca 840 catagcaagc atcacaaatt tattgttatt ttagtaattt catttatagt taataatttc 900 caaagattta ataatcttga atttcaaatg agtgaatatg aaaaattaaa acatttatat 960 aattatataa aaaaatcaat taacactcct ctcacagaga acaccctgag agagaaatca 1020 gattcttgta aggaaattta tgaaaatttg caaatttatt ttagtgaaaa ctctgtggat 1080 gaaattttaa taagtaatgc aaaaaagtgg tatttagaaa taaatcaagt tttaaaagcc 1140 aaattagata atttaaacac caaaaaccag ttggacgaaa ctacgatcag tatggtgcaa 1200 ccggaagtgg aacgtcagtt tgacgttaaa caagcaacag cgctagtgca gccctatgat 1260 ggtagtgcgg ctggtttgga atcgtttatc gattctgtaa attttttaaa agaattgatt 1320 caacctaacc atacggcaat ggccgtaaaa ttccttaaaa cacgtttaac aggaaaagct 1380 cgctgtggct ttccagatac gatagtaaca attgatgcat taattgagca cattaaaata 1440 aagtgtcaag atactttaac accagatctg attgttgcaa aattaaatgc aacaaaacag 1500 agagggcaaa tattaaattt ctgtgaagag attgaaaatc tctgtagcaa actcgaaaat 1560 agttacattt ctcaaaaggt cccggaaaat gtagcaaagg caatgaccac aaaagcaggc 1620 gtaaatgctc tcattaatgg cattaacgct agcgaaacta aactaatcct aaaagctggc 1680 aatttctctt cgattagaga agccttgcaa aaagcacaag agagtgctac tgataataca 1740 tcatcgcaag ttttcaattt gcggaataaa caatttaacc caaatgaaag gaaaatgaat 1800 ttccgaggta atggtttcaa cagaaaccga aattttggaa attgctatag aggaggtttt 1860 tctcaaagac cccaaaacaa ttattatcat aactcttcgc gtggaagata ccaacgtggt 1920 ggttaccata acggatacaa taatcgtgga agatttaatg ataatcgtcc acgatatgac 1980 aaccctaatc gagtatatgt ggcaaatgtg gatcaaccgt acggtccacc acaaaaccag 2040 aaccgcgatg ctccaaacag agctgctccc caacaacaac aacaacaaca aagtcaaaga 2100 aattttttag gtcagtgtta accataaacc tttctgcttc taattttatc gagattggag 2160 tagaaatggc aaattcagtt tgtactttac ttgtagattc cggtgcggac atttctatat 2220 ttaaaactga caaaatttta acaaatcaca atataaacta caaagttaaa acaagaatta 2280 caggcattac agaaggagaa attgaaacaa tggcaataac agagacagtg ttgaattttg 2340 gaaatgggat aagagttaat cattcatttc atttagttga tgcgaatttt ccgattttaa 2400 cagatggtat tttaggaaga gattttttgt ctctttttag atgcaatatt gattacgaat 2460 cttggctttt gaattttaat attgataata tagctgtatc tgttccaatc caagatacag 2520 taaatgagaa ttacataatt ccgcagagat gtgaagtgat tcggaagatc aaagatttaa 2580 aaatcacgga agacatggtt ttattttcta aggaaattaa gcctggtctt ttctgtggta 2640 atgcaataat ttcaccaacc tgtcagtatg taaaatttat gaatactaca tcacgaccca 2700 tcaatgtttc aaatgttgac ctagaattta tgttagaacc attaacccgt tatcaaatat 2760 taaatactag aaaaaataat aaacccgatg acacacagaa aagaaataat gaaattttga 2820 aggaagttga cttaacaaat gttccagatt atgcaaaaaa tgatttaaaa caattaatta 2880 caaaatatac agatattttt tcactacccg atgaaccagt aactacaaac aatttttacg 2940 aacagaccat taatttgaat gatccagttc caacttatat accaaattat aaaactattc 3000 attctcaaag taatgaaata caatcccaag ttaataaaat gttgaaggat aatattattg 3060 aaccctcagt atcctcatat aatagtccaa tactcttggt tccgaagaaa tcaacagatg 3120 ataagaaaaa atggcgtctt gtcatagatt ttagacaact caacaaaaag attttaagtg 3180 acaaatttcc cttaccaaga attgattcca tattagatca gcttggtaga gcgaagtatt 3240 tttcaacttt agatttaatg tcgggatttc accaaatccc attggatgaa aattcgaaaa 3300 aatatacagc tttttcgacc caatcaggtc attaccagtt tacaagattg ccgtttggac 3360 taaacattag cccaaacagc tttcaaagga tgatgacaat tgcaatgtct gggttgacgc 3420 cagaactagc atttgtttac atcgatgata taattgttgt aggttgttca atgcaacacc 3480 atttacacaa cttatcagtt gtttttgaaa gaattcgaaa atataattta aaattaaatt 3540 tgtcaaaatg caaatttttt aatacagaag taacgtactt agggcacaga ataaccgata 3600 gaggtatttt gcctgacagt agtaaatatg ataccgtaaa gaactatcca gttcccaaga 3660 acagtgacga agtcagaagg tttgtagctt tttgtaacta ctaccggaaa ttcgtcccaa 3720 attttgcaac aattgcatac ccgttgaatc aattgttgag aaaaaacata aaatttgaat 3780 ggtcatcaaa atgtgaagat gcttttaatg ctctaaaatc atttttgatg tcaccaaaac 3840 tattacaata tccagatttt aagcaaacat ttatattgac taccgacgca tcagatgtag 3900 gatgtggtgc ggtattatca cagaatattg atggcaacga tttacccata gcgtttgcca 3960 gtaaaagttt tacgcctgga gaaaagaata aatcagtaat tttgaaggaa ttgacagcaa 4020 tacattgggc tatcgattac ttcaaagctt atttgtatgg acgtaaattt gtaattcgta 4080 cagatcatcg tccattggta tttttattta atatgaaaaa cccaacatcc aaactgaccc 4140 ggatgagatt agatttggaa gatttcgatt ttactattga atatattcaa ggtaaaacaa 4200 atgttactgc cgacgcgtta tcacgtatag taacgacgtc agatgaatta aaatctaaaa 4260 gtatatttac tgttaataca cgagccatga caaaaagaaa actgaaagag tctaaacgta 4320 gtcaggaact acctgatgaa aacaaaagtc aaaacaatgc cgttaataat aagcctgatc 4380 aacttgttat ttatacgaca gagaacccga ctgaaactag aaaactacca aaacttcgaa 4440 gtgcaatgga aaatgagaat ttaattttta cattacatga agcatcacaa aagaataaaa 4500 taataatacg aattacaaca caactgcgaa gtataagaca aacattagtg tttgcacttt 4560 cagtgttaga aaaagaaatg caaatgaaga gaattacaaa aatcgcgcta tcgagtcatg 4620 actgtatttt tgaatgggta cccaaagaaa catttaaaga actagttttg aaatctataa 4680 aaaatttgca gataataata tataatccac ccaagttcat tagtgatctt gatgagatta 4740 aaagaatact tgaaaattac catcaaacac cgactggagg acatataggt cagcacagac 4800 tatacctcaa gttgagagaa atttataatt ggacaaatat gaaaagttca atagcacaat 4860 ttataaaggc ctgtgaattg tgtaaactga acaagattac caaacataca aaggaaaaat 4920 taacaattac aaatacaccg tctaaaccgt ttgaagtggt atctattgat actataggac 4980 cacttcccaa atcaaacaat aataatagat atgctgtaac gatacagtgt gaattaacta 5040 aatatgtagt tttagtccct attccaacaa aagaagcaaa cgtaattgct aaatctttag 5100 ttgaaaattt tattttaata tatggaaaat tcttagagtt aaaatcggat caaggaactg 5160 aatacaaaaa tgaagtcctt gatcaaattt gtaaactatt acaggttaag caaacttttt 5220 ctacagccta tcatcctcaa acgattggtt cgttggaaag aaatcacagg tgtttaaatg 5280 aatacctgcg aaattttgtc aacgaacatc agtcagattg ggatgattgg ttacaatttt 5340 atgcatttac atataatact agcccacaca ctgatcatgg tttaacacca tatgaattgg 5400 tttttggagt taaagcaaga gtaccacacg aaacatacac agatagaatt gatcctgttt 5460 ataattttga tctctatagt aaagaattaa gatacaaact tcaaaagtca catgaaatag 5520 ttaaacaaaa attacttgaa caaaaacaaa tacgaaagga aataaatgat cgttcgataa 5580 acccattaca aatcaagata ggagatatag tttatttgca aaaagaaaac agacgtaaat 5640 tagattcatt ttattcaggt ccatataaag taaaatcaat aacagaacca aattgtgaga 5700 tcatcaattt acaaacaaat caattatctt tagttcataa aaataggtta attaaatcct 5760 aggaagtgtt agttttaaaa ccgagcctaa tctcataaaa aaaaataatg aaaatcaaaa 5820 aacagtaaaa aaatgagata ggtcaaaatt cttagcgcaa gctgagaagt gagaaattcg 5880 gatatttaat taagcatttt ttcttcttgt ttttgaatac atttaccaat attctatcaa 5940 aataatataa tctaacattg taaaggagat tgaaacaaca tttttttttt catttatatc 6000 acatatgttg acaagaattg aatgtcttca ctacgttacg tcattcaccc aaagggggat 6060 gg 6062 // ID Gypsy-91_AA-LTR repbase; DNA; INV; 1142 BP. XX AC supercont1.249; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-91_AA_; KW Gypsy-91_AA-I; Gypsy-91_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-1142 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.249; Positions 128551 129692. XX SQ Sequence 1142 BP; 295 A; 276 C; 251 G; 320 T; 0 other; tgtaacctta cacatgaccg tcactgttgt gtcgcgtgag tacctatgga actcaaggtc 60 taccacaatg aatgagagca aaaattatct taccatgtac aaaacctcat cacactcaca 120 acagaacaac acaacacatc ttccgatcag cgataatcga ccatagttga tcatgatctc 180 cgtctcctct ggacttcaac accactgact gatctgtgta gaggtaaggg agagcacagc 240 atccgagccc gattcttgtt tggctatcac cacacagcca acagataacc gaactgttgt 300 ctagctacgc tcatatatgg tcttgaaccg cagtgtctcg atcagtagcc agttaatccc 360 aaagcaggca agtcctaggc ataattacta ttgcgcctga ctggcatttt tgtgatttgt 420 agtgagtatt aagtgcatct gtggtgtggt gagaaaattt agaagtgcgg gctggaaagg 480 taaaattcgt cgatgaagtt ctgcaggtga tatacaagaa gtgcttgttt tctacagccg 540 cctaaatgcc cagaacaccc aagttaactg catgtcgatg catgccatga tggagtagtt 600 cgtatacgtt gttttgagat cggtaagcgt ggaagaaatt agttctacgg catatcccaa 660 attgtgcttc taagccgaaa cttatgtgct cccttcctca actacctcta ttcaggcctt 720 tcttttcaac ccctaaatcg atggcggtgg agaccatcag catggctcag tgcttgggca 780 tcccagacgg ccatagtggg ctttgtcgcg gagtggttgc tgtcctaccg gcaatacggt 840 ttgtccggac aagctcctga ttctgcggtc gccgagggat acacagaaca ctgacgccca 900 tttcacctag gtttaagtac acaccctaac cgtaagtaat aaaaccatca ctaattttac 960 tatgttaaat ttatgctttg attttgccat ggaatgactc gtaatgattt cagcgggttt 1020 ccctgtattc gaacccggta gtaaggttga ttggttcatg ggtttaggaa ccaatgtctc 1080 cgcttctcct ttctaccatc tttgtctccg tctcatgcga cctttcccct ggggaggcga 1140 ca 1142 // ID Crack-10_BF repbase; DNA; INV; 3991 BP. XX AC . XX DT 30-APR-2009 (Rel. 14.04, Created) DT 30-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE Amphioxus Crack-10_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; KW Crack-10_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-3991 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-3991 RA Kapitonov V. and Jurka J.; RT "Young families of Crack non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(4), 815-815 (2009). XX DR [2] (Consensus) XX CC Its 5'-terminal portion is incomplete. XX FH Key Location/Qualifiers FT CDS 360..3764 FT /product="Crack-10_BF_2p" FT /translation="MNINLRLYRARIGLFGPIPQLCLYKDNNVRWRFYQVL FT LITICLILTSNQNGILTAANQENELKTNQLMHHWQNASQQVTSLYHKNLLT FT EQCNDQVNLICLAFDIHPNPGPISSTCGTCSKRVTNKQRAICCDTCDTWFH FT ASCVNLSTEDYDYLSNSSDQWDCFGCTMPSASRSFFEDCTNLSEQYQANDS FT ILSDNLDYQTVFDQLKSKKGLKVAHLNIRSLLPKLDELRAGMIHNPIDVLA FT LTETWLDASIDDNELYISNYTLYRKDRDRHGGGVACYVNDRLQHNLISELT FT DFNIENVWVEIKYSIGKPIVIGTVYRPPSTTTQFFDTLEPAMTAATALSDE FT IFVLGDLNCDTLSKSSSKKIDNLCNLFQASQLIDKPTRITENSSTCIDVII FT TTSPENVTDYGVCSTGLSDHSFIYVTRKVRQPRGTPRTATVRSYRNFDESS FT FQEELFNAPWSKVEEHADVNGALDCFHSILHNICDKHAPWVTVRIRGHEPP FT WMTQEYLSMARDRDYYFHRAKKTKQPGTWETAKRLRNKCNNMAQCLKKTYY FT RSEIESKQNDSKGLWSTLKTLLPGQTKHADKVPKQSENNIANEFNSYFTSI FT GAKLAAAFSSVYMATIGPPKSMFSFADIPTEFTHQQLLNIPLGKSTGVDGV FT SSRLIRHAAPAIAAPLTYIYNLSLSTGTVPTGWKKAKVTPLYKDGDKTDCS FT NYRPISVIPSFMKILEKAIHTQVYNFLTEHKILNPSQSGFRPKHSTTTTLI FT NVTDTILDNMDKGLLTGAVFLDLKKAFDTVSHEILLDKLRINGIQDNALLW FT FESYLSNRCQVTVVDGTISDCLGIAEGVPQGSVLGPLLFILYINDMPNYIS FT HGQIVLYADDTALFYASKSVADINRALNADLCNIEKWLDANRLTLNVRKCK FT SMLFGTIRRLRLETEELNLTLSGTYLEVVACFKYLGVWFDSCLTWSIHINK FT LCSTVSSRLGVLRRLVPILPPKTLSMLFNCMILPKIEYCDIVWGNCGKSLS FT DNLQKLQNRAARLVLGLSHRSHVDNDHLSALGWNSLASRRKMHLLQTVFKS FT IHRQLPEYLQIFNRTSHTYATRLNSNLSLQLPKVRLESGRRKFEYRGAFSW FT NELPPTVKVASSALSFKKLLRNGNHC*" XX SQ Sequence 3991 BP; 1196 A; 862 C; 809 G; 1124 T; 0 other; tcaaagaggt cagaggtcag ctatagcagt ggcacccact tttggtaggc tcctccgaat 60 ttgcgtattt ccttcgtttc cttattctat tttgcttcat atttcagttc agggctcact 120 ttgttattga gacctacagg agccagtcct actggcttta tacttatgtt acagttatct 180 tttgcgttgc tttgttctgc ttgattttag ttgtcatggg gagcctatcg cagggtggta 240 tgccatacat gtgtaacgcc tgacgcattg tatcactcta gtttaggtaa cagaccacag 300 aagttcattg tactgacatt tttttgtgat cgcctggaaa cccaagtgct ttcaccagaa 360 tgaacattaa tcttagattg tacagagcgc gtatcggctt atttggccca attcctcagt 420 tatgcttgta caaagacaac aatgtaagat ggcggttcta ccaggtgttg ctgattacca 480 tatgtctcat ccttacctct aaccaaaatg gcatcctaac ggcagccaat caggaaaacg 540 agctgaaaac aaatcagtta atgcatcatt ggcaaaacgc aagccaacag gtgacaagcc 600 tgtaccataa gaatttactg actgagcaat gtaatgacca ggttaacttg atttgtctgg 660 catttgacat ccatccaaac cctggaccaa ttagcagcac ctgtggtacc tgcagtaaga 720 gagtcaccaa caaacaaaga gctatatgct gtgatacttg cgatacctgg tttcatgcgt 780 cttgtgtaaa cctgtcaacc gaggactatg attatttgtc aaacagtagc gatcagtggg 840 attgttttgg ttgtaccatg ccaagtgcat ctcgttcctt ctttgaagac tgtactaacc 900 tatcagaaca gtatcaagcc aatgatagca ttttgtcaga taatctagac tatcagactg 960 tttttgacca gttaaaatca aaaaagggtc tcaaagtagc acatcttaat atacgtagtt 1020 tattaccaaa acttgatgag ttaagagccg gcatgataca caatcccatt gatgtgctag 1080 cgctcacgga aacctggttg gacgccagta ttgatgacaa tgaattgtac atatccaact 1140 acacgcttta cagaaaggat agagaccgtc atgggggtgg cgtggcttgt tatgtaaatg 1200 atcgtctgca acataatttg atttctgagt taactgattt taacatagaa aacgtctggg 1260 tggagattaa atattcaata gggaaaccta ttgttattgg tacagtatac cgcccaccga 1320 gcacaaccac acaattcttt gatacccttg aaccagccat gaccgccgcc acggcgctat 1380 ctgatgagat ttttgtactt ggagatctga attgtgacac tctttctaaa agttcgagta 1440 aaaaaataga caacctttgt aacttgtttc aagcatctca actcatagat aagccaacac 1500 gtattactga aaactcatct acctgcattg atgtcattat aacaactagt ccggagaatg 1560 ttacggacta tggtgtgtgc tcgaccggac tcagtgacca ctcgtttatc tacgtcacaa 1620 ggaaggtcag acaaccaaga ggtactccca gaacagccac agtccgttcg tacagaaact 1680 tcgatgaatc gtctttccaa gaggaactgt ttaatgcgcc ctggagcaag gtggaggaac 1740 acgctgatgt taacggcgca ctggactgtt ttcattctat actgcacaac atttgtgata 1800 aacatgctcc gtgggtgacc gtacgtattc gaggacacga gccaccctgg atgacgcagg 1860 aatatctgtc gatggcccgt gaccgagatt actactttca ccgagctaaa aagacaaaac 1920 aaccagggac ttgggagaca gcaaaacgat tgcgcaacaa atgtaacaat atggcacaat 1980 gcctgaagaa aacgtactac cgaagtgaaa ttgagtctaa gcaaaatgat agtaaaggcc 2040 tttggtcaac cttaaaaaca ctcttgccag gtcaaactaa acacgctgat aaagttccta 2100 aacaaagtga aaacaacatt gcgaatgaat tcaattcata tttcacttcc attggagcca 2160 aactagccgc agccttctct agtgtttata tggcaactat agggcctcct aagtctatgt 2220 ttagttttgc cgacatacca acagagttca cacaccaaca attgctgaac atccctcttg 2280 gcaagagcac aggggtggat ggagtaagta gcaggctgat ccgccatgct gctccagcaa 2340 ttgcagcacc actgacgtac atatacaacc tatcactctc cacaggaaca gtacccactg 2400 gctggaagaa agccaaggtc acaccgctat acaaagatgg ggacaaaact gactgcagta 2460 actacagacc tatatcagta attccatctt ttatgaaaat attggagaaa gcaatacata 2520 cacaagtcta caacttcctt accgagcata agattctgaa tccttcacag tcaggtttca 2580 gaccaaaaca ctcaacaaca acaacattaa tcaatgtgac agacacgatc ctagataaca 2640 tggacaaggg gcttttgaca ggtgctgtct ttttggacct gaaaaaggcc tttgacacgg 2700 tgtctcatga aattcttctt gacaaactac ggataaatgg catacaagac aacgcattgc 2760 tgtggtttga gtcatacctg tcaaataggt gtcaggtaac agttgtagac ggcaccataa 2820 gtgattgtct tggtatagct gaaggcgtac cacaagggtc tgttctcgga ccgctattgt 2880 ttatattgta cataaatgat atgccaaact acataagcca tggtcaaata gtactctatg 2940 ctgatgacac agcactgttt tacgcatcaa agtctgtcgc tgatataaac agagcgttga 3000 atgctgatct ctgcaatatt gagaagtggt tggatgcaaa caggctaact ctcaatgtcc 3060 gtaaatgcaa gtctatgctt tttggcacta taagaaggct gcgtctagaa acagaagaac 3120 taaatcttac cctttctggt acttatttgg aagttgttgc atgttttaaa tatctaggtg 3180 tttggtttga ttcctgtctt acatggagta tacacattaa caaactatgc agtacggtgt 3240 cctctaggtt gggagtgtta agacgtttag tacccatttt accacctaag acactttcca 3300 tgttgtttaa ctgtatgatt cttcctaaga ttgagtattg tgatattgta tggggcaact 3360 gcggaaaatc tctctctgac aatctccaga aactccaaaa ccgtgcagca cggcttgttc 3420 tcggcctgtc acatagatca catgtagata atgaccatct gtctgcatta ggttggaatt 3480 ctttggcatc tcgtcggaaa atgcacctcc tgcaaaccgt cttcaagtca atccatcggc 3540 agttaccaga gtacttacaa atatttaaca gaacatctca cacatacgca acaaggctta 3600 attctaactt atcactacaa ctaccgaaag tcagacttga atctggccgt aggaaatttg 3660 agtacagagg tgcgttttca tggaacgagt tgccgcccac ggttaaagtg gccagttctg 3720 cactgtcatt caagaaactg ctaagaaacg gtaatcactg ctgacccctg acctcctgac 3780 ccggcacgaa ttttgatcaa tgagttacga ttttgattta tgttcattta tatattgttt 3840 ccttgatttg tttatcgttt agtttatctt atttctttgt tatgctattg tgtgtatgtt 3900 gtttgtacag ggcacacctg aaaagcagta cgcaaagtac tgagtgtgtc caccctgtat 3960 aaagaaaaca gaaataaata aataaataaa t 3991 // ID BEL-164_AA-I repbase; DNA; INV; 6313 BP. XX AC AAGE02018006; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-164_AA_; KW BEL-164_AA-LTR; BEL-164_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6313 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02018006; Positions 18014 11702. XX CC Positions [5363-5923] - Integrase core CC 'GTAAT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 30..4880 FT /product="BEL-164_AA-I_2p" FT /translation="MVDNGNDSQVPRHCASCTKPDELDDMVACDACHNWHH FT YSCAQVDATVQDRAWKCASCELPTNSDQSPSGTGVKEVNKTGAKPKAPIGL FT PQTLAVPSKKSGAKSSAGSRKSKKVTGEQSTTSSARMRLEAELKAVEEQQR FT IKEEELAAEKELQDLERKMEAELRERELAIETKRIAEAKAELKKKISDERE FT FRMKQMAIRRQSEEEKAKLIRQASEYGSTRGSLVGSATRSDSKVEDWLEKS FT TQQTAGPLQQFTSAVDPSCGQTQVFLEKTNRQQPADIRADRSLDSPPLTLP FT NKAVPSFKLHPTQPLVSQVRKDQQAHTNAFVNRHFVDEFDLLNVGGASGNQ FT ANSAGQNQKRVGGNIDGWGNRDHASARERDEGACRLQNVGEAPLPEGGAYH FT QRNDPPVRNMGDNRRGQAATDQSGAGIGVRIRQPVSRTDESINQGRNNEEE FT FLESPTHRQLTARQVMGKDLPNFSGNPEEWPIWISNFRRSTSTCGFSDDEN FT LIRLQRCLKGPALEAVRSRLLCPASVPHVIRTLEMRYGRPETLIRSMTERV FT RRQPSLRINDLDGIIEFGLVVDNLIQHLKNAGQQAHLTNPSLLHDLVGKLP FT VDYKLKWSAYKSMRLDVDLSTFGQFMSTLVELAYEVADDFPNIEKIQKQKQ FT KPKERAFVQTHTESTGPPVVRATNSGTPGTRTPKKLCAVCKAEGHRVADCS FT RFQTMDVSGRLNVVQQSGLCRSCLNFHGKWPCKTAKECSVNDCRLKHHPLL FT HSPAATTHAVVSTSHLSETTTSEGPFFRIVPVTLYGKDCVVNVYAFVDEGS FT KITLLEDTVADQLGVEGPTEPLNLQWTGNIKRSEPNSRRITAAISGIGSSK FT QYQLINAHTVGGLLLPSQTVNYEEMCSRYPHLRSLPISSYEKVSPKLLIGL FT DNLKLTVPLKIREGRWTEPIAAKSRIGWSIYGCAATTETTVVCGLHFGGWT FT NQEQDLNQLVRDYITLDNAGVTCSNTPLESEEDKRARMLMESTTKRIGGAF FT ETGLLWKVDHPRFPSSYGMAYKRMCAMERRLKKDSTLYERVRQQVREYEMK FT QYAHKATPHELSTTSSDQCWYLPLGIVLNPKKPNKLRMIWDAAATVDGVSL FT NTALLKGPDFLTSLPAVIGNFRLYRYALTGDIKEMFHRFFIRAQDRQFQRF FT LFRDHPHEEPVIYVMDVAIFGSACSPSSAQYIKNLNAREFEAEYPRAAAAI FT IHRHYVDDYLNSFGTVEEAVKIGRQVKNIHAEGGFEIRNFLSNDPTIAAHV FT GDESSEAEKHIHVEKNERIESVLGMKWIPSSDTFTFTISLRDNLQHVLNRS FT HVPTKREVLRTVMSFFDPLGLISFYLVHGRILMQDIWAIGIGWDDLINEKI FT YMQWKKWIDLIPELSNLRIPRCYIRGATESSYCELQVHVFVDASQSAYAAA FT AYFRAETPEGPVVNLVAAKSKVAPLKLLTIPRLELQAAVLGSRLLNSVIAM FT HALPVTKRVLWSDSNTVLAWIRSDQRRYHQYVGFRISEILSLTDVNEWRKV FT PTKENVADDATKWGAGPNISSDSRWFEGPQFLRQPEECWPGRDVQIDPTEE FT EIVACNIHDAIQPALVDVARFSKWEKLHRTMAYVHRFVQNVQRTHRNEAR" FT CDS 5228..6313 FT /product="BEL-164_AA-I_1p" FT /translation="MRQLYEIPRLRSIVLKVTEKCMTCRIRRATPKPPPMA FT PLPKVRVTPYVRPFTFVGVDYFGPVLVKMGRSNVKRWIALFTCLTIRAIHL FT EVVHSLSKESCVMAVRRFVSRRGAPAEIFSDNGTNFLGANNQLKREIEELD FT QHLASTFTNTTTRWSFNPPGAPHMGGAWERMVRSVKVAIGGLLETQRRPDD FT ETLETIIIEAEAMINSRPLTYIPLESADQESLTPNHFLLGSSSGVKQMPVM FT PTDYQATLRNGWKLAQHLSDSMWKRWIKEYLPVISRRSKWFDEVKEVEVGD FT LVLIVDGAIRNQWMRGRVDKVITGKDGKVRQVWVRTANGVLKRPVVKVALL FT DVVDSGKPEVSNRGLRVGG" XX SQ Sequence 6313 BP; 1718 A; 1469 C; 1757 G; 1369 T; 0 other; attcctttag aatttttcat cgcaaagcaa tggtggataa tgggaatgat tcgcaggtgc 60 cacgccattg tgcttcatgc acaaagcccg acgaactcga tgacatggtg gcctgtgatg 120 cttgccataa ctggcatcat tactcctgcg cccaagtcga tgccacagtt caagatcgag 180 cctggaaatg tgcatcttgc gaactcccaa caaacagcga tcaaagtccc tccggaactg 240 gtgtgaagga ggtgaataaa actggtgcga aacccaaagc gccgatagga cttccgcaga 300 ccctagctgt tccgtcgaag aagtcaggag cgaagtcgtc ggccggcagc cgaaagtcga 360 agaaagtgac cggggaacaa agcactacgt cgagtgcccg gatgcgactg gaggcagaat 420 tgaaggctgt ggaagagcag cagcgtatca aagaagaaga gttggcggcg gagaaagagc 480 tccaggacct cgagcggaag atggaggcgg aactgcgaga aagggagttg gcaatcgaaa 540 ccaaacgaat tgcggaagca aaagccgagt tgaagaagaa gatatccgac gaacgagagt 600 tccggatgaa gcagatggcg atcaggaggc agtccgagga agaaaaagcg aagttgattc 660 ggcaagcttc cgagtacggc agcaccagag gcagtttggt cggcagtgct acacgatcgg 720 atagtaaggt ggaggactgg ctggaaaagt cgacgcagca gacagcggga ccgttacaac 780 agttcacgtc agcagtcgac cctagctgtg gtcaaacgca agtattccta gaaaaaacta 840 accgtcaaca accagccgat attagggccg ataggagctt agattctccc cctctaaccc 900 tgccaaataa agctgtcccc tctttcaaac ttcaccccac acaacctctc gttagtcagg 960 tgcgtaaaga tcagcaagct cacacgaatg ctttcgttaa tcgacatttt gttgatgagt 1020 ttgaccttct caatgtgggt ggtgcttccg gtaaccaggc caattctgcc ggccaaaacc 1080 agaagcgagt gggcggtaac atcgatggtt ggggaaatcg tgatcatgca agtgctcgtg 1140 aacgcgatga aggggcatgt cgtcttcaaa atgtgggtga ggcaccattg cctgaaggcg 1200 gtgcttacca ccaacggaac gacccaccag tgcgaaatat gggcgacaac cgtcgcggtc 1260 aagcagccac ggatcaaagc ggagcgggaa tcggtgtcag gattcggcaa ccggtttcga 1320 gaactgatga gtccatcaac caagggcgta acaacgaaga agaattcctc gagagcccaa 1380 cacatcgcca gttgactgcc aggcaggtga tggggaagga tttgcctaat ttttctggga 1440 atccagaaga atggccgatt tggatcagca attttcgtcg ttcaacatct acttgcggat 1500 tctctgacga tgaaaacctg attcgccttc aacgctgtct caaaggtcca gctctcgaag 1560 cagttcgcag cagactgtta tgtccagcta gtgttccaca cgtgatccgc actttggaga 1620 tgcggtatgg tcggccggag acacttattc gttccatgac ggaacgagtt cgtcggcagc 1680 catctctgag gatcaatgac ctggatggga tcatcgaatt cggcttagta gtcgataatc 1740 tgattcagca cttgaagaat gcagggcagc aggcacatct aactaatcca tcgttgctgc 1800 acgatctggt tggaaagttg cctgttgact acaagctcaa atggtccgct tataagagta 1860 tgcggcttga cgtggatctg agtacgtttg ggcagtttat gtcaacttta gtggagctgg 1920 catatgaagt agcagatgat tttcctaaca tcgaaaagat ccagaagcaa aaacaaaaac 1980 ccaaggaacg ggcgtttgtg cagacgcaca cggaatctac tgggccacca gtggttcgcg 2040 cgactaacag tggtactcct ggcactcgca ctccgaagaa actgtgcgcc gtttgcaaag 2100 cggaaggaca ccgggtcgcc gattgcagcc ggttccagac aatggacgtc agcggaagat 2160 tgaatgtggt gcagcagagc ggattgtgcc gatcttgcct aaactttcac ggaaaatggc 2220 cttgcaagac ggcgaaggaa tgtagcgtca acgattgtcg cctgaaacat catccactgt 2280 tgcattctcc agcggcgaca acccatgctg tggtgtcaac aagtcatctg agcgaaacaa 2340 ctacaagtga ggggcctttc ttcagaatag ttcctgtgac actgtacggc aaggattgcg 2400 tagtgaacgt ctacgcgttc gtggacgaag ggtcgaagat tacgctattg gaagacacgg 2460 tggcggatca acttggagtc gaggggccaa cggaaccttt gaacttgcag tggaccggaa 2520 acattaagcg tagtgaacca aattcaagac gtatcactgc tgcgatttcc ggaataggtt 2580 catcgaaaca gtatcaactg attaatgctc atacagtcgg tggattgctg cttccttccc 2640 agacggtgaa ctacgaagaa atgtgcagtc gttatccgca tttacgtagt ctgccgatta 2700 gcagctacga gaaggtctcg ccaaaactct tgattggtct tgacaacttg aagctcaccg 2760 taccactgaa aatccgagaa ggaaggtgga cggagccgat agccgccaag agtcgcatcg 2820 gatggagcat ctatggatgc gcggcgacaa cagaaacgac ggtagtgtgt gggcttcatt 2880 ttggaggatg gaccaatcag gagcaagatc tcaatcaatt ggttcgcgac tacattacgc 2940 tggacaacgc cggtgtaacg tgttccaata cgcccttaga gtccgaggag gataaacggg 3000 cgagaatgct gatggagtca actaccaaaa gaatcggtgg ggccttcgaa acgggacttt 3060 tgtggaaggt cgaccaccca cggttcccga gcagctacgg catggcctac aaacgaatgt 3120 gtgccatgga gagaagactg aagaaggatt cgacactcta cgaacgtgtt aggcagcagg 3180 ttcgggaata cgagatgaag cagtatgcgc acaaagcaac gccgcatgaa ctttccacca 3240 ctagttcgga ccagtgctgg tacttgccgt tggggatcgt cctgaaccca aagaagccga 3300 acaagctgag gatgatctgg gatgcagcgg caacagttga cggggtctcc ctaaatactg 3360 ctctgctgaa agggccagac ttcctaacct cattaccggc ggtcattgga aactttcgat 3420 tgtaccgcta cgccctaacc ggagacatca aagaaatgtt tcaccggttc tttatccggg 3480 ctcaagatcg tcagttccag cgattcctgt tcagggatca tccacatgag gagccagtaa 3540 tctacgtcat ggacgtagcc atctttggct cggcctgttc accaagcagt gcgcagtata 3600 ttaaaaacct gaacgccaga gaattcgaag cagagtatcc acgagcagca gcagccatca 3660 ttcatcgcca ctacgtggat gattacctga atagcttcgg aacagttgag gaagcagtga 3720 agatcggacg ccaagtgaag aatatccacg ccgaaggtgg gttcgagatc cgcaatttct 3780 tgtccaacga cccgacgatt gcagcacatg ttggcgacga atcatcagaa gcggagaagc 3840 atatccacgt agagaagaac gagcgaatcg aatccgtact gggtatgaag tggattccaa 3900 gcagcgacac attcacattc accatttccc tgcgagacaa ccttcagcac gtcctgaata 3960 ggtcacatgt tcccacgaaa cgagaagtct tgcgcactgt aatgagtttc ttcgacccgc 4020 tgggactgat ctcgttctat ctggtgcacg ggcgcattct gatgcaagat atttgggcga 4080 tcggtattgg ttgggacgac ctgatcaacg agaagatcta catgcagtgg aaaaagtgga 4140 tcgatcttat cccagagttg agcaatctcc gcattccgcg ttgttacatc cgtggagcca 4200 cagaaagcag ttattgcgag ttgcaggtcc acgtgttcgt cgatgcaagc cagtcggcat 4260 atgcagctgc agcgtatttc cgcgcggaaa cgccggaggg tccggtggtc aacctcgtag 4320 cagcgaaatc caaagtggca ccgttgaagc tgttgacgat tccacgactc gaacttcagg 4380 cagctgtttt gggttcccgt ctgctcaaca gcgtcattgc catgcacgct ctcccagtaa 4440 caaagcgggt actgtggtcc gattcaaata ccgtcctggc ctggattcgg tcggatcagc 4500 ggcgatatca tcaatatgtc gggtttagga tcagtgaaat tctatcgctg acggacgtca 4560 acgagtggcg gaaggtacct acaaaagaga atgtagccga cgacgcaaca aagtggggag 4620 cggggccgaa tatcagttcc gacagccgat ggttcgaagg accccagttt ctccgacagc 4680 ctgaagagtg ctggcctgga agagatgttc agattgatcc cacggaagag gagatagtgg 4740 cgtgtaacat tcacgacgca atacagccag cgctggtgga cgtagccagg tttagtaaat 4800 gggagaaatt acatcgcacc atggcgtacg tgcatcgctt cgtccagaat gtgcagcgaa 4860 ctcatcgaaa cgaagcacgg tagtctgggg ctttgtcgca ggacgaattg gtgcttgccg 4920 agagatggtt atggatacta gcacaaacag agtcctttac catggaaata cagctgttgg 4980 agaacactaa agggattccg gacggtgtcc acggagtcct gcctaaatca agcaccttgt 5040 ataagatgtg gccgtttatt gatgaagcag gggtgttgcg gaaacgaagt cgtctaagca 5100 atgcagattg gatggcgttc aataccaaat atccggtgat tctttcgcgt caacacccaa 5160 tcacttttct tctgactact accatcgtag atttcgccac tccaatcgcg agactgttgt 5220 caacgagatg cgacaacttt atgaaatccc aaggctccgt tcaatagtgc tcaaagtgac 5280 ggagaaatgt atgacgtgta ggattcgccg ggccactcct aaaccgcctc ctatggcacc 5340 tttgcctaag gtcagagtga ctccgtatgt gcgtcctttc acgttcgtag gtgtggatta 5400 ctttggtccg gtgctggtga agatgggacg cagcaacgtc aagaggtgga tagccctttt 5460 cacctgcttg acaatccgtg caatacacct ggaagttgtg catagtttat ccaaagagtc 5520 atgtgttatg gcggtcagga gatttgtctc gaggagagga gctccagcgg aaatcttcag 5580 cgataatggt acgaactttc ttggtgcaaa caatcagctg aagcgagaga ttgaagaact 5640 tgatcagcat ttggcgtcca cctttaccaa tacaacaacc cgttggtctt tcaaccctcc 5700 tggagctccc cacatgggtg gcgcatggga gcgcatggtt aggtcagtga aggtagcaat 5760 cggaggctta ctggaaacgc agcgacgacc cgatgacgag acactggaga cgattataat 5820 tgaggcagaa gcgatgatca actcacgtcc actaacttac ataccattgg agtcagcgga 5880 ccaggagtct ctaactccaa atcacttctt attaggaagc tcaagtggag tgaagcagat 5940 gccggtgatg ccaaccgact atcaagctac gctgcgaaat gggtggaagc tcgcacagca 6000 tttgtcggat agcatgtgga aaagatggat aaaggagtac ttgccggtga tctcaagacg 6060 gtcgaaatgg ttcgatgaag tgaaggaagt ggaagtagga gatttggtgc tgatcgtgga 6120 tggtgcaata cggaaccagt ggatgagagg aagagtggac aaggtaatta caggaaagga 6180 tggaaaggtt cggcaagtat gggtgcgaac agcgaacggc gtgctgaaaa gacctgttgt 6240 gaaggttgcc ctactcgacg tcgtagatag tggaaaaccg gaagttagta acagaggttt 6300 acgggtgggg gga 6313 // ID Gypsy-49_CQ-I repbase; DNA; INV; 6971 BP. XX AC AAWU01035675; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-49_CQ_; KW Gypsy-49_CQ-LTR; Gypsy-49_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-6971 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 477-477 (2011). XX DR Genome; AAWU01035675; Positions 33500 40470. XX CC Positions [3388-3891] - Reverse transcriptase CC Positions [4942-5256] - Integrase core CC 'ATTT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 291..1892 FT /product="Gypsy-49_CQ-I_2p" FT /translation="MVRFYILPNPEYLEEEEVDFELTMKREYDADQNLSEK FT RRLLRAIFKDERSNPAPVYFEFNILKEIDILIPQIGDADKRLRTKFEYRDV FT SRLRHFLRRLTAATANNDSEIIKRTEMCKLVENILRGHKQRVTYDPTELVS FT ETDKESKHDEEAANKSWRNPDKSALVQKYKTPQKGLDEVETESGEDSKHNY FT EKGKPKLFGTGKDKTNSTGTGTKPKKPLVSKDDLAELLKQYLDLNSTQGRN FT KGNSYKKESGGLGEFQDSESGVVGKNEMYKNNQGEKGTRNRSRFCWSSEDS FT LPDRRDRHGGKNRNHSIDSLPDRRDRHGGKNRNHSIDSSPDRRERHGGTNR FT NRSIDSSPDRRDRHVGKYRNRSKDASPARRGRHNHRHRDQNSFEESSTDRQ FT RRHRERNRNRNLRRNRSSPNSSDSGEGYPESGHRRRQYSRERYPRGGNRHS FT RVEHWDISFSGDVKSLQVEDFLTRIRKLARHEQVGQHELLNKIHYKLKGEA FT NDWWFTREHRHGTDLRGKLGSVLEIRIGSEEYVRNLRI" FT CDS 2887..5781 FT /product="Gypsy-49_CQ-I_1p" FT /translation="MDFWNAFQIRPMIQGEGRMTDIDLLDVNGEPIGNLTT FT EELLVDQYINRENEINLTVCALELTEQEQFFHRAQPEEDESLDVPSLELPN FT NPEEAIENIETEHELTSPEREKLTETLKHFQWTSESKLGRTTLIEHEIEIM FT EGAKMRDLPMYKYSPSMWEKIEKELERYEQLDVIEECTSESASPLVPVKKS FT NGKLRVCLDSRRINSITKKDAYPMRNMNEIFHRLQKAKYFSIIDLKDAYFQ FT IPLKESSRNLTAFRTPKGLFRFKVVPFGLKNAPFTMSRLMNKAIGFDLEPY FT VFIYLDDVVVATESLEEHFRLLREVAERLKKANLTISVEKSRFCRNQVRYL FT GYLLTENGLAIDSSKLEPILNYSRPKCIRDVRRLMGLMGFYQKFIKNYSHV FT TAPITDLLKKEKKFKWTEAAESALEELKSVLTSSPVLANPIYNRPFIIETD FT ASELAVGAALLQDFPDGRRIIAYYSKKLSSTQRKYAATEKECLAVVLAVEN FT FRHYVEGTTFAVITDAKSITWLFSISATNGNSRLLRWALKLQSYDFSIQYR FT KGKDNILADCLSRIETLQTVDNDYAELRDRIRKNSNKFSDFKVAGNKIFKY FT VHDSNTVQDKRFLWKYYPQMSERSDLIRVAHEPGHLGYEKTIAKLRERYFW FT PKMAQETKVYCKSCLPCKTSKSTNINTTPPMGSQKQNCDYPWQFITFDYLG FT PFPPSGKARSTCLLVVTDVFTKFVLIQPFRRATASTLVHFLEQSIFLLFGV FT PEIILSDNGSQFTSKMFADLLKHYGINHWLTPSYHPQVNNSERANRVITTA FT IRATIKSNHKTWADNIANAIRNATHDSTKYTPYFLTFGRNMISDGREYDMI FT RETNAQSQEMEINDEERKKLYEQVRMNLKNAYTKQAKYYNLRSNVKAPKYQ FT VGEKVLKKSTFLSDKGKDFCAKLAPKYTEAIVSKVLGDSYQLEDLSGKSIG FT IFHASFLKKY" XX SQ Sequence 6971 BP; 2354 A; 1232 C; 1472 G; 1913 T; 0 other; ttggcgccca acgcaaaatt gataaatatc attttcgaga ttgccttgtc taaattgttt 60 tttttttccc tctgatactt tacaaggaag ggagaatata ttttcgaata ttgcgtttgc 120 atcctccatc tattattatt attttttttt taatattttc gatatttttt tttattgaac 180 tcagcgtttt gtatatatat atttttttcc atattctaaa ttggttgtag tttaatattg 240 aattggagtt tcttaacttt cgtttagatt tttaaattta aagtgtcaaa atggttcgat 300 tttacatttt accgaatcct gaatatttgg aagaggagga ggtggatttc gagttgacga 360 tgaaaaggga atatgatgct gatcaaaatt tgagtgagaa aagaagactt ctcagggcaa 420 ttttcaagga tgaaagaagt aatccagctc cggtttattt tgagttcaat attttaaagg 480 agatcgatat tctaattcca caaattggag atgctgataa aagattaagg acaaaatttg 540 agtacagaga cgtgtcacgt ttgcgacact ttctgagaag gctgacagca gcaacggcta 600 ataatgatag cgaaattata aaacgtacgg aaatgtgcaa attagtggag aacatcttaa 660 ggggacataa acaaagagta acatatgacc caactgaatt agtatctgag accgacaaag 720 aaagcaaaca tgatgaggaa gcagcaaaca aatcatggag aaaccctgat aagtctgcgc 780 tagttcaaaa gtacaagacc cctcagaaag ggctcgatga agtcgagacc gagtcgggag 840 aggatagtaa acataattat gagaaaggaa aaccgaaact ttttggaaca gggaaggaca 900 aaactaactc aacaggaacc gggaccaaac caaaaaaacc tctggtttcg aaagatgatt 960 tagcagagtt actaaaacaa tatctggatc taaactctac tcaaggtaga aataaaggaa 1020 acagttataa gaaggaatct ggaggtcttg gggaatttca ggatagtgag tcaggagttg 1080 ttggaaaaaa tgaaatgtat aaaaataacc agggggaaaa aggtacacgg aatcgctcca 1140 gattttgttg gtcttcggag gactctttac cagatcgacg ggataggcac ggaggtaaaa 1200 accgaaacca ctcgatagac tctttaccag atcgacggga taggcacgga ggtaaaaacc 1260 gaaaccactc gatagattcg tcgccagatc gacgggaacg gcacggaggt acaaaccgaa 1320 accgctcgat agattcgtcg ccagatcgac gcgataggca cgtaggcaaa taccgaaacc 1380 gctcgaaaga tgcttcacca gctcgacggg gtcggcataa ccataggcac agagaccaaa 1440 attcatttga ggaatcttca accgatcgac agaggcgaca cagggagaga aatagaaata 1500 gaaacctcag acggaacaga tcttcaccga acagtagtga ttcaggggag ggatatccag 1560 agtcaggaca tcgaagacgt cagtatagca gagagcggta cccaagagga ggaaaccgac 1620 actccagggt tgagcactgg gacatcagct tcagtggaga tgtcaaatcg ctccaagttg 1680 aagatttttt aacaagaatt cgaaaattag ctagacatga acaggtagga cagcacgaac 1740 tattgaataa aattcattac aaactgaaag gggaagctaa tgattggtgg ttcacccgag 1800 agcatagaca tgggacagat ttgagaggga aattaggttc cgttttggaa atccgaatag 1860 ggagcgagga atacgtgcgc aacttaagga tctagagcaa aagaggggtg aaaaatttgt 1920 tgcttatgtc aacgaggtag aaagattaaa tcagtgtctt aaacacccac tttctgaacg 1980 ggaaattttt gagttggtat gggataaaat gcgaccccat tacaggtcca aacttgtaac 2040 tgtaacggtg aacaacctag atgagttatt agcggttaac cgcagaatcg acgcaactga 2100 cccaacgttg tacaggtcgg gaggacactc aagaaatgaa attcagcatg taagaggtca 2160 ggatggtttt gaagacgagt actcgtcaga agaagaagct tcagtaaacg cagttcagcg 2220 gaagttcaaa gatgatagga aaccctcaaa accaaacgga caatcggtta agtcgcgaag 2280 tgcacaggta actgaagatc cagagatttt gtgctggaat tgtcgaaaaa tgggtcatca 2340 ttggagaaca tgcagggaaa ctaaactagt gttttgttac gcgtgtggaa atattggacg 2400 taccactcga acgtgtgaaa tgaaccaccc tattcaacag gtgcaagcac gcacccagaa 2460 tagctcaggt aatcatcagc aatcgttaaa ctagaggcgg aatgttcttc ggggaaccca 2520 ctcattccgc ttaagaaaga ggttcctcca aacccgaata tagaagagac acaatttcaa 2580 aaatttaagc aggttttgac tatcagagtt gatccaaatt taagtccaaa aattaaggtt 2640 tccatttttg atagcgatat tatcgctttg ttggattcgg gagctaatat aagtgtccta 2700 aattcaacta aatttttacg aaaacatggt cttaaactgt ttaaagcaag tgtttcaatt 2760 tgcacagcag acaacacgga acacgaatgc ttaggatacg ccaacatccc ctatacattt 2820 gcgggggtga ctagggtaat tccaactatt attgtgccac aactttctaa gccactgatt 2880 ctaggaatgg atttttggaa cgcttttcag attcgcccaa tgatccaagg agagggtaga 2940 atgaccgata ttgacctttt agatgttaat ggtgaaccaa ttgggaattt aacgacggaa 3000 gaattgttgg tagatcaata tatcaacagg gaaaatgaga ttaatttaac tgtttgtgca 3060 ctagaattga ctgaacaaga acaatttttc catcgtgctc aaccagaaga agatgagtca 3120 ctcgacgtgc cttccctgga acttccaaac aatccggaag aggccattga aaatatagaa 3180 actgaacacg aactaacttc tccagaaaga gaaaagttaa cggaaacttt aaaacatttt 3240 cagtggacta gcgaatctaa acttggtaga accactctaa ttgaacacga aatcgaaata 3300 atggaagggg caaaaatgag agatttacct atgtacaaat attcaccaag catgtgggag 3360 aaaattgaaa aagaattaga aagatacgaa caattagacg ttattgagga atgcaccagt 3420 gaatcagcta gtcctttagt accagtaaag aaatccaacg gtaaactacg cgtttgtctt 3480 gattcaagaa gaataaattc cattactaaa aaagatgcat atcctatgcg aaacatgaat 3540 gaaattttcc acagactgca gaaggccaaa tattttagca taatcgattt aaaagatgca 3600 tattttcaaa tccccttaaa ggaatcttca agaaatttga cagcttttag aaccccaaag 3660 ggcctatttc gatttaaagt agtccccttc ggtctcaaaa atgctccatt cactatgagt 3720 aggttaatga ataaagcaat tgggttcgat ttagagccgt acgtgttcat ctacctagat 3780 gacgtggtgg tggcaactga atcgttagaa gaacatttta ggctcttgag agaggtcgcc 3840 gaaagattga agaaagctaa tttaacaatc tcagtcgaga agagtagatt ttgtcgaaat 3900 caagttagat atttagggta tttattgaca gaaaatggct tagccattga cagcagcaaa 3960 ctagaaccga ttctcaacta ctctcgtcca aagtgcatca gagatgtaag aagattgatg 4020 ggtttaatgg gtttttacca aaaatttatc aaaaactaca gtcacgtgac agcacctatt 4080 actgatcttc taaaaaagga aaaaaagttc aagtggacag aagcagcaga aagtgcacta 4140 gaagaactta agtcggtcct aacttcgtcg cccgtactcg cgaacccgat ttacaatcgt 4200 ccattcataa ttgaaaccga tgcatccgaa ctcgcagtcg gcgcagcatt gttacaagat 4260 tttccagatg ggagaagaat aattgcgtat tatagtaaaa aactttcaag tacccaacgt 4320 aaatacgcgg ccacagaaaa agagtgtcta gcggtagtcc tagcagtaga gaatttcaga 4380 cattatgttg agggaacaac gtttgctgta attaccgacg cgaaaagcat tacatggtta 4440 ttttcaatct ccgccacgaa cggaaattcg cgattactgc gctgggcact gaaacttcag 4500 tcctacgatt tctcaattca atatagaaaa ggaaaagata acatattagc ggattgttta 4560 tctagaatag aaacacttca aacggtagac aacgactacg ccgagttgcg cgacaggatt 4620 agaaaaaata gcaacaaatt ttccgacttt aaggtggcag gcaataaaat tttcaaatac 4680 gtccacgatt cgaacaccgt tcaggataaa cgttttctct ggaagtacta tcctcagatg 4740 agcgaaagaa gtgatttgat tagggtagcc cacgaaccag gtcacttagg atacgaaaaa 4800 accatcgcga agctaagaga acgttatttt tggcccaaaa tggctcagga aaccaaagtt 4860 tactgcaaat cctgtttacc gtgtaaaacg tccaaaagta ccaacataaa taccacccca 4920 ccgatgggtt ctcaaaaaca gaattgtgat tatccatggc aatttattac atttgactac 4980 ttaggcccgt ttccaccttc gggcaaagct aggagcactt gccttttagt agtaacagat 5040 gtttttacaa aatttgtact aatacaacca tttaggagag cgacggcgag tacgctagtt 5100 cattttctag aacaatcaat ttttctatta tttggcgtac cggaaataat tctatctgac 5160 aacggaagtc agttcacttc taaaatgttt gcagacttgt taaaacacta tggaatcaac 5220 cattggctca ccccgtcata ccacccccaa gttaataact cagagagggc aaacagggtg 5280 atcaccacag cgattagggc aacgatcaaa tcgaatcata aaacatgggc ggataacata 5340 gcaaacgcga ttaggaatgc aactcacgat tcaactaaat atacacctta ctttctcacg 5400 tttggccgaa atatgatttc tgacggtcga gaatacgaca tgataaggga aactaacgct 5460 cagagtcaag aaatggaaat aaacgacgaa gaaagaaaga aattgtatga acaagtaaga 5520 atgaacctca aaaacgcgta cacaaaacaa gcaaaatatt acaatttacg ttccaacgta 5580 aaagcaccga aatatcaagt aggtgaaaag gtcctcaaga aaagtacatt cttgtctgat 5640 aaggggaaag acttttgcgc gaagctcgca cccaagtaca cggaagcgat tgtcagcaaa 5700 gtgttagggg atagctacca attagaggat ttatcaggga aatcgatcgg tattttccat 5760 gcctcgtttc tcaaaaagta ctaaccttag ggtggaccag ctatgacatc ttcttttaga 5820 attttacaac ggctttgcct aaaaaaaaac gtctaggtac caccctagta acggtaaaca 5880 taacaacgtt gatgtgcatg cattgacaaa cactttattc tgtaattaaa caacataaac 5940 ataactcact gtattttttt ccattgtaaa tattttctat tctgattttt cgtgcgttgt 6000 cacttttaat tgattatttt gtagattgta atttgtagca taagtatgta ggctccattc 6060 ttgtttacag cagcataaat tcccattgtt gttggccagt tgagtcgtag caacgtagat 6120 gattttcttt tgcggttctc tgatattttt tttgttaatt tttcctagaa ataagtagaa 6180 aacggttaga atactgaatt ttgtgtagaa ttaccaaata ctctccagat ttaggccact 6240 agtttgcagt ttttccgttt tccgagattt ttcacgtgaa ttcataaatt ccttcccacg 6300 tttgttttgt ttcgaaatct gtcagcgacg ctgtcaaagc tttgcagggt tgcttttaat 6360 gctgtcagtt gaatgttgca gtatggtgtt gcgcgactgt cgcacaagtt ttaatttata 6420 gcaaccaaat ttctatagat gtgtcaaact gacggttgat gattagattt ttggaagctg 6480 taaaattaac caatgtttac gaaagtgccg aaagtcaatc aaggagtcaa tatttggatt 6540 gatgttgttg atttgaagca aaattttaag tgaatacatg agtttacgag ttgtcgggac 6600 atttccgata tggtagacgc ggtacaagct ttgtttacgt tttgagttga gaggtggttt 6660 tgatattctg cgttctatgc tgcgattggg aaaggcccaa taatatagac aagcagttat 6720 ttttgatttt gttgttgttt ttaagaaacc cttgcaggat ttgatgacgg atgcaaacac 6780 aatttctttc actactacat ggtttgaact ttttttcttt agtatttgtt ataaaagtta 6840 aacaattgta aatatgttta cttatagtta gttgtttatt tcttaatgga gttatttttt 6900 agtttatatt tgggttatga aaatttgatc aattttcatt gatcaaattt tcataaaatt 6960 gcgtggtgtg a 6971 // ID Mariner-23_HM repbase; DNA; INV; 2635 BP. XX AC . XX DT 17-DEC-2008 (Rel. 13.12, Created) DT 17-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE Mariner-type family: consensus sequence. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-23_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2635 RA Bao W. and Jurka J.; RT "Families of Mariner elements from Hydra magnipapillata."; RL Repbase Reports 8(12), 1957-1957 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 568..2286 FT /product="Mariner-23_HM_1p" FT /translation="MTTSKKSYNEEDIEAALNDIKEKNTSIRKAAFKYGIP FT FSTLIGRKNNNNASFKGSGSTTVLSQQTESIMVHAFKFLCDWGYGLTRNDV FT MDFVCGYLLRTNQSGLFKNGRPGKDWFYAFINRWSKEISTRKTHTLASARA FT ASCTQEIIDNYFEVVTKQYQLSGITSGCHIWNCDETGFCGDQGKATVVCRK FT GAKRVLKLTGNNEKIHYTVNNCCNADGYFTPPFVVYKAKRNFRAEWALGGP FT TGTKYSISKSGWMEHDTFIEWLKEVYIPECQAISGTHILHLDGHTSHVSLA FT AVLLCRENNIILICLPAHSSHILQPLDRGVYCHVKQVWKAVLTKYYKDTKC FT KNLDKENFPTLLKQVYESGKCFTRLHAIAGFEYTGLFPLNKNKINQQALAI FT SETFNQPTSSTPLRTIKSLALSSEPATAASRFDKYRALHERMKIEMENELK FT SFFQEKNQSHVKKSRQPNIPKIAGQCITEHGVADHLKKREEVKQLKLAEKQ FT AKKRKTICKLIMPLPNPAEKEVTQNIEPITTPSAKRRKVIKCRKCKKELHS FT DMMSCTRIQIAILGSATKHVCQKIL*" XX SQ Sequence 2635 BP; 966 A; 418 C; 422 G; 829 T; 0 other; gagtaaaagc gggtcaaacc gatcatacta ttttttcgat cactgatcga aaaatttaaa 60 taaaaccgga actgatagga atatttaaaa aataaaaaca ccagaagaaa gagaatcatc 120 tcctctattc aaatgtgtca aaataaaatc caagtcattt ttcagtcatt ttatttttat 180 aaaagtttaa aagagagtcg taaaaatttg cttttttctt ttatttgcta tcgtaataat 240 tcgggaacca taattcgttt aaaaataatt ttacctctat cttgtagaaa atttaatttt 300 aataccgatt acataatttt tttttcgttt tatagttaaa aattagacaa aaaagtaaga 360 aaaattaata ttaaattttt tctggtaaaa ccgatcattc acatatttac tttaaattcc 420 tttattaagt agtatgtgaa tttttaaatt aggtacctag gtgaattaaa ttattgaatt 480 ttctcatgtg ttagttccgg gttcttccta tataaagtta ttttacaaaa atgactcaat 540 gatatttatt aattattcac tttcaagatg acaacaagta aaaagtcata caatgaagaa 600 gatatagaag ctgcgttaaa tgatataaaa gaaaaaaata cttcaatcag aaaagctgca 660 ttcaaatatg gcattccatt ctcaactcta ataggtcgca agaacaacaa caatgctagc 720 ttcaagggat caggatcaac aaccgtactc tcacagcaaa cagagtcaat tatggtgcat 780 gcgtttaagt tcctatgtga ttggggttat ggtttaacac gaaatgatgt catggacttc 840 gtatgcggtt accttcttcg tacaaatcaa tcaggcctat ttaaaaacgg caggcctggc 900 aaagactggt tttatgcatt tataaaccga tggtctaaag aaatttccac aagaaaaact 960 cacactttag catctgccag agccgcttct tgtacgcaag aaataattga taattatttt 1020 gaagttgtca ctaaacaata tcaattgtct gggataactt ctggttgtca catttggaat 1080 tgcgatgaaa cgggtttctg tggtgatcaa ggcaaagcaa ctgtagtatg tcgaaaaggt 1140 gcaaagcgtg ttttgaaact cactggtaac aatgaaaaaa tccattatac agtgaacaat 1200 tgctgtaatg ctgatggata ctttacacca ccattcgtgg tatacaaagc aaaacgaaac 1260 tttagagctg agtgggcttt aggtggtccg actggcacca aatattcaat ttctaaatct 1320 ggttggatgg agcatgacac atttattgaa tggttgaagg aagtgtatat accagaatgt 1380 caagctatta gtggtactca tattttacat cttgatggac atacgtctca cgtcagttta 1440 gcagctgtat tgctatgtcg ggaaaacaat ataattttga tttgtctacc agcccactca 1500 tctcacattc ttcagccact ggacagaggt gtctattgcc atgttaagca agtgtggaag 1560 gcagttctta ccaaatatta caaagacaca aaatgcaaaa acttagataa agaaaatttt 1620 cctacgttgc tcaaacaggt gtatgagagc ggtaagtgtt ttacacgttt gcacgctatt 1680 gctggttttg aatatactgg gttatttcca cttaacaaaa acaaaatcaa tcaacaggca 1740 ttagctatct ccgaaacgtt taatcaaccg acgtcttcaa cgccattgcg aacaataaag 1800 tctttagcat tatcttctga accagcaaca gcagcttcga ggttcgacaa gtacagagca 1860 ctacatgaga gaatgaaaat cgaaatggaa aatgagctaa agagtttttt tcaagaaaaa 1920 aatcagtctc atgtaaaaaa atcgaggcaa cctaatattc caaaaattgc tggccaatgt 1980 attacagaac acggtgttgc agatcatctc aaaaaaagag aagaagttaa gcaactcaaa 2040 ttagctgaaa aacaagctaa aaagcgtaaa accatctgca aactcataat gccattgcca 2100 aaccctgcag aaaaagaagt aactcaaaat atagaaccaa ttacaacacc ttcagccaaa 2160 agaagaaaag ttattaaatg cagaaagtgt aaaaaagaat tacattcgga catgatgtca 2220 tgtacgagaa tccaaattgc aatacttggc tctgcaacaa aacatgtttg ccaaaaaatt 2280 ttgtaatggg ttcagatttt ttttgttgta aaaaatgtaa accttaaatc ttttattatt 2340 tctgtatgct atataaaaaa tatataaaaa tatatacatt tacatatata aaatgttaaa 2400 tttattaaat gtattttata ttgatcggtt ttaacagact ttctgccaaa agcgatcttt 2460 tttttttctt ctagttttca tattttcaaa tacaattaaa aaaaaaaaaa aataataaat 2520 aatgtgcttg ttggtatttt ctatcttatt taaaaaaaat agtttaaaaa tttttaaaaa 2580 taataaagtt acttgatttt tattgttcgg tgatcggttt gacccgcttt tactc 2635 // ID TED repbase; DNA; INV; 6964 BP. XX AC M32662; XX DT 06-FEB-1997 (Rel. 2.01, Created) DT 06-FEB-1997 (Rel. 2.01, Last updated, Version 1) XX DE Internal part of retrotransposon TED inserted in Autographa DE californica nuclear polyhedrosis virus. XX KW Gypsy; LTR Retrotransposon; Transposable Element; LTR; TED; KW TED retrotransposon; Gypsy group. XX OS Autographa californica MNPV OC Viruses; dsDNA viruses, no RNA stage; Baculoviridae; OC Alphabaculovirus. XX RN [1] RP 1-6964 RA Friesen D.P. and Nissen S.M.; RT "Gene organization and transcription of TED, a lepidopteran RT retrotransposon integrated within the baculovirus genome."; RL Mol. Cell. Biol 10(6), 3067-3077 (1990). XX DR GenBank; M32662; Positions 274 7237. XX CC LTRs of TED are named LTRTED. CC TED, found inserted within the insect virus belongs to the family CC of CC retrotransposons that includes Drosophila melanogaster elements CC 17.6 CC and Gypsy and thus represents the first nondipteran member of CC this CC invertebrate group to be identified. The internal portion of TED, CC flanked by long terminal repeats (LTRs), is composed of three CC long CC open reading frames comparable in size and location to the gag, CC pol, CC and env genes of the vertebrate retroviruses. CC Three conservative domains of TED open reading frame 2 (pol) CC encode CC protease, reverse transcriptase, and integrase functions, CC respectively. CC TED integration within the baculovirus genome thus represents one CC of CC the first examples of transposon-mediated transfer of CC host-derived CC genes to an eukaryotic virus. CC misc_binding 1..17 CC /note="tRNA primer binding site" CC /bound_moiety="tRNA primer" CC CDS 290..1622 CC /note=ORF1 (gag) CC CDS 2033..5287 CC /note=ORF2 (pol) CC CDS 5712..6950 CC /note=ORF3 (env). XX SQ Sequence 6964 BP; 2399 A; 1552 C; 1210 G; 1803 T; 0 other; ggcgcagtcg gtaggatact cgcgggccgg ttcgcgagca atagagtcat ctcatatcga 60 tattcgcgtt cgcgagtcgc ccgggcggct ttcaagccac atcctgagta tcctgaatct 120 acgggacaac tgcggagcca ggcggagaat ccagacatcg attcgcagag aaagaaccaa 180 tatgaaactt atgtaagtac acttgaatcg ttctcgtgtt gcttcctttc tattcctagt 240 tactttaatc tctaaaaatt tttcgtcacg aaaagtttta cacccgggaa cctgcatcac 300 gccaggctcg ccggaatatc attgatatta atagagactt agataattca gatttaaact 360 taggtttaga tagattattt agtataaaaa tgccacacga tcctgacgta attttcaagg 420 ctttgcgact tgtgccggaa tttaatggca accccaatat tttaacgaga tttataaata 480 tatgtgataa actagtagag caatatgcga gcgctgagcc gggaagtgag ttaggaaatt 540 tatgtttgtt aaatggcatc ttgaacaagg tcactggaac agctgcctct accataaacg 600 caaatggcat tcctgaaacc tgggtaggca ttagatcatc tttaattaac aacttttcag 660 accagcgcga tgaaacggct ttatataatg acctctcatt agcttcacaa ggtaataaga 720 ctcctcagga gttctacgaa caatgccaaa ccttattcag taccataatg acgtatgtaa 780 cgttgcatga gactttacca acgactatcg aagctaaacg ggcactttac aagaaagtaa 840 ctgtgcaggc ttttgtgcgg ggactaaaag aacctttagg ttcacgtata aggtgtatgc 900 gccccgagac tatcgagaaa gcccttgaat atgtgcagga agagctaaat gtaatatatc 960 tgcagcaacg caatgagtct tcgagggctc acagctctcc taaaatgctt cctataccgc 1020 aacagtctcc tgtgacccca ttcaatacat taggtattca cagaccgccg gtacctaact 1080 ggccggttcc gatgggacaa cgtggcaatc aaccaccacc tcaacccttt aaatttaatg 1140 tgcctaacca atatcataat cgcatgccta ctaaaactca acagatgctt agagcaccgc 1200 caccaaatta tcatcctcag agtaacgttt tccgcttacc accacgtaat ccaccaccaa 1260 atcaaattgt aaaaccgatg agtggagttc aacattttgt cccaaaaact ttacctgtaa 1320 tgacgggaca tgactggcgt aaatccggga atccgccgcc aaataattac ttcaaaactc 1380 gcgaattaaa cgttaacgaa ttctactcgt ctgacgactc atataactca gtcgactatt 1440 attccgaacc agggtgcgac tattatactg actattataa caacccctat gactatgaca 1500 caaacgcgac ctgttacgac ttaccttacg acgctagtga gacagaagct caaccaggcc 1560 ctagccatgt acatgaaagt caggattttc aatcgaccaa accatcaaac gaacaaggat 1620 agatattaac ctacagtacc aaagacaact accatatata gaattttctg atccaccatt 1680 aaaattcttg attgacactg gtgccaatca atcatttatt agcccccaag ccgtccaaaa 1740 atacttttcc aattactcgg tgaattacga cccatttgaa ataacgaaca tacacggtgt 1800 cagtagaaat gagcactcaa ttacattacc atgtttccag gagtttaacg aaacccaaga 1860 tattaaatta tttatatacc attttcatga ttatttcgat ggattaatag gactagattt 1920 attgtctaaa tgggaggcca agatagattt aaaagacttt ttactaataa ctaaatttgc 1980 aactaaccgc attaagctat ataactctcg taatgtcaac ctgtacgagg atatgatacc 2040 tgcaagaagt tctaagttag taaggatacc tattaacgct acggatggcg aagttttagt 2100 agaagaacaa atgttttgca actgcattgt ccatgaatgc gttaccatgg taaaagatgg 2160 ccgtggatat gtagaattag agaatccaac ccctaatgat gtaatttttt acctggatca 2220 gccagcttct gccgaattat tcaatatcaa gtgcacacaa gttgaacagt cacaacgtgt 2280 agacgatgtt ttatcacgat tgcgtacaga ccatctcaat gaggaggaaa aagctaacct 2340 tttaagactt tgctctcgat attcagatgt attttacatt gacggggaag cccttacatt 2400 cactaataaa attaaacacc gtataagaac aacggacgaa gtacccgtgt acaccaaaag 2460 ttaccggtac cccttcatcc atcgccagga agttagggac caaatcacga aaatgttgga 2520 ccaaggaatt ataagaccat cagactctgc atggagctca cccatatggg ttgtgcccaa 2580 gaaaatcgac gcttctggga aacaaaagtg gcgtctcgta gttgacttcc gtaagttgaa 2640 cgagaagact atcgatgaca aatacccgat accaaacata agtgacgtac ttgacaagtt 2700 aggtaagtgc caatacttca ccaccttaga tttggcaagt gggttttatc aggtggagat 2760 ggaccctcaa gatatatcga aaaccgcgtt taacgtagaa cacgggcatt ttgaattcct 2820 tcgaatgcct atgggattaa aaaactcacc atctactttt caaagagtta tggacaatgt 2880 cctaagaggt ctccaaaata acatctgtct cgtctacctt gacgatatta ttgtctatag 2940 tacttcccta caggaacacc tggagaacct ggaacgagtt ttccaaagac ttagagaaag 3000 taacttcaaa attcaaatgg acaagtccga attcttgaag ctcgaaactg cttatcttgg 3060 tcacatcata agcagggacg gtatcaagcc taaccctgat aagatttccg ctattcaaaa 3120 atatctgatt ccaaagaccc ctaaggaaat aaaacaattt ttaggccttc tcggttatta 3180 ccgaaaattc attccagatt ttgcacgact cacaaaaccc cttacacagt gcttaaaaaa 3240 aggtagtaaa gtaactctta gtcccgaata tgtaaatgct tttgaacact gtaaaactct 3300 gttaaccaac gacccaatat tacaataccc agactttacc agagaattta acctcacgac 3360 agacgcttct aacttcgcta ttggagcggt actatcccaa ggaccaatag gatccgacaa 3420 acccgtctgt tacgcttctc gaacactcaa tgagagcgaa ctaaactata gcacaataga 3480 gaaagaatta ctggctatag tttgggctac aaaatatttt agaccctact tattcggtag 3540 aaaatttaag atattgactg accacaaacc actacagtgg atgatgaact taaaagaccc 3600 aaactcacga atgactagat ggcgactacg actaagtgaa tatgacttct ctgtagtgta 3660 caagaaagga aagtctaata ccaacgctga tgccctttct cgtgttgaga tccataccac 3720 ggaaatagac gaaattgact cagtaataga aaacattaaa gaacttagct caatgattaa 3780 taacccctcc gagacatctc gacctcaaag acaaaccgaa aacacagacg aaatcagaca 3840 gaactcgaca accgacactg tacatactag tgatgaacac cctattctgg aagtgcccat 3900 tacaaacgaa ccactcaaca gatttcatag acagattcat cttaccgtag taggagacat 3960 aaaacgacga cctattgtga caaaaccttt cgaaagtcat acccgaatag caatccaact 4020 atcagagtca aacttagaac aagatgttat aagtgccatc aaagaatacg ttaacccaaa 4080 agtcaaaaca gccctgatca taaatccgcc cttaaagatg tattctatta tccctattat 4140 acaaaagacg ttcagaagtt catcccttaa cttagtgtta accaaagtcg aacttgaaaa 4200 tgtcaaagag tatcttagac aacaagacat tatacgacat taccatgacg gaaagacaaa 4260 ccaccgggga ataaatgaat gctacctagc actctcaaaa aggtattact ggcccaggat 4320 gaaagatcaa atcactaaat ttatcaatga gtgtactatc tgtggtcaag ccaaatatga 4380 caggaaccca atacgacctc agtttaatat tgtaccccca gctacgaaac ctctagagac 4440 cgttcatatg gacctattta cagttcaaaa tgagaaatat ataacgttca ttgacgtatt 4500 tacgaagtac ggtcaggcat accacctacg tgatggcacc gctattagta ttttacaggc 4560 attgttacga ttttgcactc atcacggatt acccataacc atagttactg ataacggcac 4620 cgaattttcc aatcaattat tctctgaatt cgtacgtatt cataagataa ttcatcataa 4680 gaccttaccc cacagcccga gcgataacgg aaatattgaa cgtttccatt ccacaattct 4740 tgagcatatt cgaattctaa aactacaaca taaggatgaa ccaattgtta accttatgcc 4800 atacgctatc ataggctaca atagttccat acatagtttc accaaatgca gacctttcga 4860 tctactaaat ggacactttg atccaaggga cccgcttgac atagacctaa ccgaacacat 4920 tctgcagcaa tatgctcaga atcatcgtca acagatgaaa caagtctacg aaattatcaa 4980 tgaaacatct cttgctaatc gtacggcttt aatagaaagt agaaacaaaa cgcgcgaatc 5040 tgaagtagaa tacatccctc agcagcaagt attcattaaa aaccctctcg ccagccgtca 5100 gaaggtagca ccacgctata cccaggatac ggtcttagca gatctgccca tacacatcta 5160 tacgtctaaa aaacgtgggc ccgtagctaa agcgcgacta aaacgtgttc ctaaaggtaa 5220 cacattgtta caggactctg ctgctactga caatacatgc gacgcatcct caagagataa 5280 gacttgagac cctagctgat ggacccggac tattaccata caaactggga ccgacacgac 5340 taaccataca ctatcattcc tttatacaac ccattgacct caacgatatt gaaaataaaa 5400 tcgactctgt acagactcaa cttaatacat ttaggactaa actcgataac gaaacttacc 5460 tactctatga atatcaaatt gattatctta ctaataaggt tggcaaatta ctacaccaaa 5520 ttaaatcttt agaacctgtt agagttaaaa gaggtcttat agatggccta gggtctatag 5580 taaaaagtgt cactggcaac ttagactacc aagatgccct taaatacgac gaggctctta 5640 aaaccttaca gaccaacgaa ggcaaattaa catcagaatt taatagccat ctgagtcttt 5700 gcaaagagtg gatgtcccaa cacaataaag tgttagaaca actaacttta aatcaaataa 5760 gagttaatgc cactttagaa ctactattac aaaaagaagc ttatagggac tatagcttaa 5820 ttaaatttgc gaaattcgca caaatcttag gaattataac aaacaacgta gaagatttaa 5880 tgttagaaat aatcagatta gaaaatatga tggcttttat acgcgcatct agtactcatc 5940 attccatgat tgatatagag gccttgcagt caatgataga tagattaaaa tccctttata 6000 ctccaaatca aattctaaat ttagaactta gggaatatta cagcttaatc aagccaggat 6060 cttatttcat cgataaacgt atagtaatag tatataattt tccaattgtt tcccaagata 6120 catatgacct atacaaacta tccattgtac ccaacaaaag acaacttgcc cttattcctt 6180 cctctcctta tatagcaaca gatgagaaat cgttcgtgta catagaggct gaatgcccga 6240 agtatagcag tacttatctc tgcgaaaaga agaccggcca gcagatccag tcgaaacctg 6300 attgtattca gaaactcatc gttcatcaga gtctagagaa tacttgtcaa ttcacgaaga 6360 tatctctcat caaggaagca gtagaaaaat tagacgacca acattacgtg ctgtctctac 6420 ccgaacctac caaagttcag ttggcatgtg ggagaaagga cttcaacaca cttcaaggaa 6480 gctacctcgt aaccatccct atgggttgct atctacagac tccagaatta actataataa 6540 acgatgacaa cgcgataaag ggtcaaccat tgaaactagc gaagatacca tacgatgaaa 6600 tgaatctgac tgccgtctct acccacatca atttcagctc gatcgatctg gaagacttac 6660 acagcatcca aactaaattc atgttgggaa aacctattga catcgaggag attcaaccaa 6720 ctgccctgta ccacacaacc atcccactat acgtcatatt actgggcgca atcctatttt 6780 tcactctgag attaattcgc aaatacaaat gttggagact aaaatcagaa gataaagaaa 6840 agcagtcatc tcttgagata catacatacg aagacgtcaa gaaaaatacc agaaaacgag 6900 atgactttcc agcaacattt tctcttaata tggtcaaaaa tagttgctga tctatgggtg 6960 gagg 6964 // ID Copia-14_CQ-I repbase; DNA; INV; 4183 BP. XX AC AAWU01015579; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-14_CQ_; KW Copia-14_CQ-LTR; Copia-14_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4183 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 343-343 (2011). XX DR GenBank; AAWU01015579; Positions 12740 16922. XX CC Positions [1645-1935] - Integrase core CC 'GTATC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 80..1642 FT /product="Copia-14_CQ-I_3p" FT /translation="MSTGGDGADAPGRAATKQTLSNLPPMELLMGRENWPD FT WKFAVQTMLEVEGLWDAVEPAAGAVVDPVMDKLAKGRIILMVHKSNFHHLR FT GVQTAKEAWTKLENVFEDSGLTRRMALINKLTSTRLDNSESMQAFVSDILE FT AAHQLRGIGFDVTDEWVGSFLLNGLPASYRPMIMALENCGQAITGDLIKTK FT LLQEVSVPSEGAAFASNKQHKKSTKSHQSNPPSSHLSTSKGPKCHIARDCK FT MKEPKKDGSARCTVLSALNEEDDNWHFDSGASNHFSKSDAMLEDLQQYGGT FT VVAANRGAMKVVAKGRMQLTPKSCPDSPIVVNDVQVIPELSSNLLSVNQIV FT KRGCTVIFNSAGVKVLNSDGDVMATGTRYRELFKLDLRSQETKSALACSSA FT VGLEVWHQRLGHLNLPSVRKLASGLANGIKIVGADKEDCKVCPMGKHTRQP FT FNKKGSRATGMLDVVHTDICGPFEVPSMGGSKYYILFVDDRSRKMWVFFLK FT TKSEEEVLRTFKEFTGWSNASPSAS" FT CDS 1803..2954 FT /product="Copia-14_CQ-I_4p" FT /translation="MSTVRSWTGEEVLGGSRGHSCLPAQPVSHARSRIDAG FT RGLDKQEAGFVAPENLRHQSDGPNPQTEATEAGPEVAQVHLCRLRRTRERV FT SVLRPAVTESLQQPRGPLHQRRRTGEAGESTSRASGAAGHRTVRAAGSKWF FT AESLCGNRRRHRERAGESRHGPDVDNSEDEFLGFEETAGSVTDVTSVLPPQ FT SSSNPPLSPELRRSGREHKLPGKYDDFHVSLRGLPRPNPLQEVESTDTSDD FT DASEDNSSEKSDDPVPPSRDTSCVVATARSLAVDGDPLSHKDARSREDSAS FT WKKAMFDEYEALMANDTWTLTELPKGRNAIKCRWVFKTKHDASGNVDRHKA FT RLVVKGFSQRKGEDYDETYSPVVRHSSLRYLSRSPPDMTCR" FT CDS 3007..4173 FT /product="Copia-14_CQ-I_2p" FT /translation="MEQPPCFEDPTRRNLVCRLNKALYGLKQSSRVWNGKL FT DAALKRFGLHSSKYDPCLYNRIADGKILFVAIYVDDVVIVSNDEGVKNEIK FT AKLSSTFRMKDLGPVSSCLGIRVTRGQGCVALDQQAYIESMLTRFNMQNAK FT PVSTPSNPCVKLIKEMAPKTADEAEKMKRVPYPEAVGCLMYLATCTRPDIM FT HAVNQLSRFNANPGPKHWEAVKHLFRYLRGTCGLKLRYRKHGNTELVGYAD FT ASWASDLDDRKSTSGYIFMLQGGAVAWSCKRQPTVALSTCEAEYMALAATV FT QEASWWHGLLAQLGRKQPIELRCDNQSAICIAKNGGYTPRTKHIDIRHHYI FT RDALDKKIVQLTYVSTDEQTADGLTKSLERIKLERNRSAMGISGSA" XX SQ Sequence 4183 BP; 1049 A; 1112 C; 1266 G; 756 T; 0 other; ggttatgggc ctgttcgaac gcgtttgttc gagttaaaaa agtttttcgc cgcgagtttt 60 cggaagtttc gcgatcgaaa tgtccacggg cggagacgga gctgatgctc ccgggcgcgc 120 tgccacaaag caaaccctgt cgaatctgcc gccgatggag ctcctgatgg gccgtgaaaa 180 ctggccggat tggaagttcg ctgtgcagac gatgttggag gttgaaggcc tttgggacgc 240 agttgaaccg gcagcaggcg ccgtggtcga cccggtgatg gacaagttgg ccaaggggcg 300 gatcatcctc atggtccaca aatcgaactt ccaccacctc aggggggtgc aaacggcgaa 360 ggaagcttgg acgaagctcg aaaatgtctt cgaggattcc ggattgactc gacggatggc 420 gctaatcaac aaactgacgt cgacgaggct ggacaacagc gagtcgatgc aagcgtttgt 480 gtccgacatc ctggaggccg cacaccagtt gcgcggaatc ggctttgacg tgacggatga 540 gtgggtcgga tcgttcctgc tcaacggatt gccagcatcg tatcgtccaa tgattatggc 600 cttggaaaat tgcggccaag caatcacagg ggatctaatc aagacgaagc tgctgcagga 660 ggtgtctgtt ccgtcggagg gggccgcgtt cgccagcaac aagcagcaca agaagtcgac 720 gaaaagccac cagagcaatc cacccagcag tcacctgagc acatccaaag gaccgaaatg 780 tcacatcgct cgtgattgca agatgaagga gccgaagaag gacggaagtg ctaggtgcac 840 cgtgttgtcg gcactcaacg aggaggatga caactggcac tttgactccg gcgcatcgaa 900 ccatttttcg aagtcggacg cgatgctgga ggatcttcag cagtatggag gaactgtggt 960 tgcagccaac agaggggcga tgaaggtcgt ggcgaagggg cggatgcagc tcaccccgaa 1020 aagttgccca gattccccga tcgtcgtgaa tgacgtgcaa gtcatcccgg agctgtcatc 1080 gaacctgctg tcagtcaacc agatcgtgaa gcgaggctgc acggtaatct tcaactccgc 1140 tggagtgaag gtgctgaact cagacggtga cgtcatggca actggaactc gttaccggga 1200 actgttcaag ctcgatctgc gatcgcagga aacgaaaagc gcactcgctt gctcgtctgc 1260 ggttggtctc gaggtctggc accagcgcct gggacacctg aatttgccga gcgttcgaaa 1320 gctagccagt ggactcgcca acgggatcaa gatcgtcgga gccgacaagg aggactgcaa 1380 ggtgtgcccg atggggaagc acacacgaca gccgttcaac aagaagggtt cgcgcgctac 1440 cgggatgctc gatgtggttc acaccgacat ctgcggaccg ttcgaagtac catcgatggg 1500 cggaagcaaa tactacatcc tgttcgtgga cgaccgttcc cgaaagatgt gggtgttctt 1560 cctgaagacg aagtcggagg aggaagtctt gagaaccttc aaggagttca ccggatggtc 1620 gaacgccagt ccgagcgcaa gctgaaggtg ctgcgaagtg acaacgggaa ggagtacgtg 1680 aacaacggta tgagaagcta cctgaaacag cacggcatcg tgcaccagac gacgaacccc 1740 tacaccccag aacaaaacgg catgtccgag cgcggaaacc gcacaattgt tgagcgcgct 1800 cgatgtctac tgttcggagc tggactggag aagaagttct gggcggaagc cgtgggcaca 1860 gctgtttacc tgctcaaccg gtctcccacg caaggtcacg aatcgacgcc ggaagaggtt 1920 tggacaagca agaagccgga tttgtcgcac ctgagaatct tcggcaccaa agcgatggtc 1980 caaatcccca aacagaagcg acggaagctg gacccgaagt cgcacaagtg catctttgtc 2040 ggctacgacg aacacgtgaa agggtatcgg ttctacgacc cgcagtcacg gaaagtcttc 2100 agcagccgcg aggtccgctt catcaacgaa ggcgtactgg agaagcagga gaatcaacgt 2160 caagagcaag tggtgcggct ggacatcgaa ccgtacgtgc cgccggcagc aagtggttcg 2220 ccgaatcgct ttgtggaaat cgacgacgcc accgagaacg agctggagaa tcccgacacg 2280 gaccggatgt cgacaacagc gaggacgagt ttctcgggtt cgaggaaaca gctggctcag 2340 tgaccgacgt gacctccgtg ctcccaccgc aatcatcttc aaatcctccg ttgtcaccgg 2400 agttgaggcg cagcggtcgg gagcacaaac ttccaggcaa gtacgatgat ttccatgttt 2460 cgctcagagg tttgccgcgt cccaatccct tacaggaggt cgaatcaacc gataccagcg 2520 acgacgacgc cagcgaggac aactcgtcgg agaaatccga tgacccggtc ccgccgagtc 2580 gtgacacgag ctgcgtggtg gccacagcca ggagtctggc agtcgacggc gatcccttgt 2640 cccacaagga tgcgcggtct cgagaggact ctgcaagctg gaagaaggcc atgttcgacg 2700 agtacgaggc gctgatggcg aacgacacct ggacgctgac ggaactaccg aagggcagaa 2760 acgccatcaa gtgtcggtgg gtcttcaaga ccaagcacga cgcgagtggt aatgtagatc 2820 ggcacaaggc gcggttggtt gtgaaaggat tttcgcagcg aaagggggaa gactatgacg 2880 aaacctactc gccagtcgta cgtcacagtt ctctgcgata tctgtcgcgc tctccgccag 2940 atatgacttg tcgatgacag atgacgcaca accgccttct gcagagcgag ttgaagagaa 3000 attttcatgg agcagccacc ctgtttcgag gacccgacca gacgcaacct ggtctgccgc 3060 ttgaacaaag cgctctacgg attgaagcag tcaagccgag tgtggaacgg aaagttggac 3120 gccgcactga aacgttttgg cctgcattcc tcgaagtacg acccgtgcct gtacaaccgg 3180 atcgccgacg gaaagatact gttcgtcgca atctacgtcg acgacgtcgt gatcgtttcc 3240 aatgacgaag gggtgaagaa cgagatcaag gccaagctga gcagcacatt ccggatgaag 3300 gacttgggac ctgtcagcag ctgcctgggc atccgcgtca cccgcggaca aggctgtgtc 3360 gcgctagacc agcaagcata cattgaatcg atgctcaccc ggttcaacat gcagaacgcc 3420 aagcccgtgt cgacaccatc gaacccgtgc gtgaagctga tcaaggagat ggcgccgaaa 3480 acggcggacg aggcggagaa gatgaaacga gttccctacc ccgaagccgt aggatgtctg 3540 atgtacctgg ccacctgtac ccggccggac atcatgcacg cagtgaacca actgagtcgt 3600 ttcaacgcaa atcctgggcc gaagcactgg gaagcagtca aacacctgtt ccggtaccta 3660 cgaggcacgt gcgggttgaa gctgcgttac aggaagcacg gcaacacaga gctagtcggg 3720 tacgccgacg caagctgggc atcggatttg gacgaccgga agtcaacaag tggctacatc 3780 ttcatgctgc aaggaggtgc agttgcttgg agctgcaagc gacagccgac tgtcgcgctg 3840 tcaacctgtg aagccgagta tatggctctg gctgccactg ttcaggaagc ttcgtggtgg 3900 cacggattgt tggctcaact gggacgaaag caaccgatcg agctacgctg cgataaccag 3960 agcgccatct gcatcgccaa gaatggaggt tacacgccgc gtaccaagca tatcgacatt 4020 cgtcaccatt atatccgtga cgctctcgac aagaagatcg tgcagctgac ctacgtcagc 4080 acggacgaac agacggcgga cggcctgaca aagtctctgg aacggatcaa gctggagcgc 4140 aaccgatcag cgatggggat cagcggatca gcttaaggag gtg 4183 // ID BEL-171_AA-LTR repbase; DNA; INV; 667 BP. XX AC supercont1.315; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-171_AA_; KW BEL-171_AA-I; BEL-171_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-667 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.315; Positions 1016871 1017537. XX SQ Sequence 667 BP; 225 A; 123 C; 116 G; 203 T; 0 other; tgtggccgac gaacgctgtg gtctccacag tggccggttg atctataacc aacacatgga 60 tcgagtgacc ggcatgaaga gataagggaa gatgaactga atggaagaca aacgacataa 120 aaacaatcca aatttttagt atcagagagt tgaactacgt gctatatgct ttttcctaaa 180 cttaaaattt ttcgtgaatt tttcatttac cttattatct acttaatatt aaatagtgcc 240 tacagtacta gtggttaaga acacattgat ttgctacctc atataagtgc gaaggtaaca 300 atctagtaga attcttatat tgatagattt ataacctaaa gaatatgttc taggttctca 360 ataagcttag taccatcaga ttgatgtaac gattgaactc cgactagaat ttatatgatt 420 attgtaagta tttacatgaa ttcgaatttg ataccaactt acccagcgta taactttaga 480 cgggataaaa tacgtaatca gatccgttaa ggtgatacga cccgacctcc tagaagagtt 540 caccaaataa tgtaagtagg aaatagcttc ccacaacttt tatgaagcca ataaaattct 600 ctttagcttt tagcttacca actcaagatc gggtctgctt aaaagaattt ggaaccttcc 660 cccctca 667 // ID CACTA-1_Dpulex repbase; DNA; INV; 5783 BP. XX AC . XX DT 12-MAY-2011 (Rel. 16.05, Created) DT 12-MAY-2011 (Rel. 16.05, Last updated, Version 1) XX DE DNA transposon: consensus. XX KW EnSpm; DNA transposon; Transposable Element; CACTA-1_Dpulex. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-3053 RA Yuan Y.W. and Wessler S.R.; RT "The catalytic domain of all eukaryotic cut-and-paste transposase RT superfamilies."; RL Proc Natl Acad Sci U S A 108(19), 7884-7889 (2011). XX DR [1] (Consensus) XX SQ Sequence 5783 BP; 1811 A; 1018 C; 1042 G; 1862 T; 50 other; cacaattata tggttcaata gtagtagggt tctactataa agtagatcct ctgcctaata 60 cgtagataac gctactctct aggttacaaa aaagtgctac ttaaatgagt gctaccagaa 120 aagtagcccc agtttgaacg agtgtgtgtg gtttcttact taagaggttc cataatcatg 180 caaatacata aatttggata cactctcgtt tttaatagga tattctaatc ttatacgttg 240 gttatccgag attcgaatac ctctcagcta gaataggtct ggtagatggt cttacctatt 300 ctaaatcttc gactatctat aagacaatcc ttattcgaaa tagaataccc ttcgtacctc 360 cgggtattcg aagatcgagt ttgaataaaa cgccccctcc gtattcgaat tcgaatacca 420 ggtattcgaa tttttgcatc agcgtcagta aacgttgcca tccaagagat atttattaat 480 aaaaatcatt ttaaaataca gaatactgga tagaaattcc acaaataatt cttattagac 540 accgggattt gatttaatat aacagattta gcaatattga aacaataaat attttttgtc 600 aaatctaatt tgaaatatgg caaacatcca gcatgcaaaa aaagttaatg acggtcgtca 660 gtgcagacta ttttgacagt tatatgattg agggctttaa tggaatctga aacttgtatc 720 tcaactttat ctgcagaaga cctgctgttg catctcctgt cgactaaaaa actgccggct 780 ggaattccag aaaaattgct aggtaatatc tatttagtag gaatctattt taagttgaac 840 attttctgtg ctgtgtgttt gttggttggt gagacccttt tccaaaatgt ccattgattc 900 ttttaaatgc tttttatgtt cagttatagt taaaggcaat tattctcaac tgacaagtca 960 cttgaatctt tttcatggct tccttaaaac aacaggtcat agtaacagaa ctttgtattg 1020 tgggaacgct gactgcaagg ctggcttcaa tagttttgct aaatatagag accacattat 1080 caattgttta gttaaagatc cacaaccaga tgttagacaa gtacttcctg atgtttcttt 1140 tcaaccagga gtaactgatt tcgatgttag tgatccttta ctaagtcata atgattcata 1200 catttcagat ccagaacagg acagttttca ccaagcgaat gcaacagcag ataaaacaac 1260 tcagagatta gctagattct ttctggaatt gagggccaaa cacaatgtta gtcattctgc 1320 cattaatttc attgctaaaa atattgatac tatctttaaa gatgttgctc agttttgtga 1380 tttagaaaca aatttgacat taaccaacat tgaaaaagcc actagcaaat tgaattctcg 1440 ccagaaaagg attagttatt ttaccagtaa tatgggattt gtagaaccag tcgcgaagtg 1500 cgtaggccat aacaatcaac cagaactgac tgtaaatagt agaggggaaa ctcagcctag 1560 aaagaatacc ttccagtaca tctctattag gaaaacgttg agtagattat tttctaatgc 1620 tgaattttac aataaatatt ttagtgaatc tcctagtact gatggttttc ttcgtgggca 1680 tcgtgacagt tctcatttca ggagtcataa attgttttcc aaagtaaaaa atgcccttcg 1740 tttacaaata ttttttgatg aatgtgaatc aactaatcct ttgggctcca aaactaagaa 1800 acatgaactt ggaatgttca attttaagat tttaaacctt cctgagtgcg aaaactccct 1860 attgagtaat atacagtgct ttgctgtttg taatagtaag gatttaaaag actctgaatt 1920 ccggtttgtt attgatgaat tcatgaaaga aattaaatta ctagagtctg acgagggaat 1980 gcttcttgat attccaagca aacctggttt caggctccga ggtgctgttg taaatgtttg 2040 tgcagattca aaaggtgccc atcaactctt tgggtttgct gggacttctt cagccaaatt 2100 ttgtcgtctt tgtttaattt ctcgttcaga tttgattaaa agttgtaaag ttggaatttt 2160 aagaaccaag gaaaattaca atgcagccat agaggaagca aagatcagca aagacaacat 2220 tccgttgagt ggagtacaat tcgacagccc tctaaatcag tgtaggtatt tgcatgttgc 2280 agaacatacc gtccttgatt gcatgcacga ctttctggaa ggtatagttc cgtttataat 2340 taagcttgtc cttttcgttt acgtaaccaa ctctcattat aaaataacag cagacctgct 2400 aaatagtaga ctgcagaggt tcgcgttcag tttttatgat caaagtaaca aaccttcccc 2460 aaaatttaag accaatctgt tgcgaaaaaa aggtaactat aacacgaagc aacgggcatc 2520 acagaattgg tgtttaattc gaatgcttcc tcttttaatt ggtgatttga tcccggaagg 2580 agacgaatat ttttccttga ttctcatttt actagaaata atggatattg tttttgcacc 2640 tagtgtttca attgagcaga ctattattct agaagatctt attaaccgaa tgtattcaag 2700 gttttattca ttatttccat tgactagacc tattaacaaa ttccaccata tggcacacta 2760 tccagcagct atccggactt atggtccagc tgttggatat tggtgcatga gatatgaggc 2820 ttaccataat agttgtaaac gtgttgcaca tataaactgt aattttatga acattccaaa 2880 atcagttgct tatcatctgc aaacgatttc ctgtcacaat ctattaacta atgatttatt 2940 tcacgatgat attcccacgg tcggtcccag gtcaaaaaaa aacttagtta acaggcaaga 3000 agactcaatc actgatggaa atagttcttt aaatttggat tcaaacgcaa ctgttataag 3060 atgggtaaag tttaaaggct ggcattaccg accaggaacc gtaatattac taaaacacag 3120 ctttgaaaat ctttcggggt atccggaatt cggtaaaata ataaaaattt tgtcaagagg 3180 ccaggcaatt tattttgttg ttacagtttt agagaccatt actttcaacc atcactttca 3240 tgcctttgaa ttacaatctt gttttcctgc taaatctatt ctgattgcac atgataattt 3300 gagaccaaat gttgctccaa catggttatt aaaaagtttt aatgtttccg aaaatgtctc 3360 ttacgtaact gttcgttgtt ttgtctaaaa ttaacatatt tctcattcct tttaaatagc 3420 tgctgatatt gatggacagt gccttctctg tttagatgaa gcaaatattg aagtacttca 3480 actaacactg ggacacaaga tgaaactgct tagtttcata aaaacaatca aaagtcagca 3540 gccccaagag caagaaaagg aatctactgg aattgtggct tgtgatccaa cgatagaata 3600 cgaagcaatc gtcaacgaga atcagatact tgatatcatt gtgacggacc aaccttcaaa 3660 taacaatgta tttcgtcact acatatttac ttcttgcttt ttctaactgt tggaattttt 3720 actttacttg taggtgattg atttcaacgt catcgacatt ttaaatgcga ccgacgaggg 3780 gaaagtaatt gtgcaaaact acctggaacg acttgaggct ttcattactg aaaaagagcg 3840 catcacaatt gttcagatct tagtgaatcg cttagtgaca ttgactggaa aactcgactt 3900 atttcccccg acggagaaaa gaaaaattct tggtcaagcc atagtggctg cgtttccttg 3960 cctgggaatc aacgtggatg gcaaagtgtt ttcttgtcat ttctataatc aaacaactgg 4020 atctggtttt attgagacaa ggctgaaaag actgagggaa gttcatcgtg aagaaccacg 4080 ccggaagagg ccaacacagg actcttctcc aatccaaaaa aaaccgcgtg tcacgaaacg 4140 gcctcatgta aaaaaaaatg ccgactacat cttcaatgaa gccgaatgcc agtttaaggt 4200 ttatttttta aataacattc aatccacaat aaaagcttta actttgttta tatttacagg 4260 ttgaatggtt gaagagtcat ccgcctggag gagataatga cacagtcatc attgagtaca 4320 gcaaggcgac ttttgggtat cgccaaaaag aaatcaaaca aactctgact gaacctggtc 4380 tcataatgaa agaatatccc cgcttcaagg acttccaaaa tggatctctg gtaatgtgct 4440 acagcaatac gttcgagatg tttcagctaa cttattttta ttcctaaagt ttcaggtaga 4500 atttcagtta ctctatcccg atgccaagaa cttcgaaaaa gtctttaaag aaaaattctt 4560 atttaaaata ttggctctgg ctaatagaaa aaacatggac attcctgact gccctgacgg 4620 taagttgatt atagtctact tttcgagtta atttgtcatt tttcacgaat taccttttaa 4680 cagaatgtct taaggccgtt ttcgtgttac taaagctgac tccaagcatc gttaaactcc 4740 gaaaagttaa ttttcgtttg attacggaac gactgatagt ctttttaaag gcaattttct 4800 tttcagttta aaagtttact ctttttcgtc tttatttata actatatcaa ttttttattg 4860 tgttgtgttt acaggataat gctgacttga ttcaattaag cgaaacgcga gaccctcatt 4920 tgaagcagcc attcatctgc tgcatgggaa cgatggaaac tcctacaaac ttctggattg 4980 ttatcgatcg tgatataata ctgtgtggta acgactttgc aactgctttc gttaacttgt 5040 tttgctcgtt ttatgcatnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 5100 nnnnnnnnag ttgaggtgtt ccaaaaagtg caaatactgt tgtcttattt tgttctcctg 5160 ttctttataa taacaaactg ggttttttat gataaaaaat ttgttgtatt tgtacaaatg 5220 ttgtcttgca ccgttgtagt atgctccttc ccatggccta atctcagatg aatacagact 5280 tctgaaagac atttatattt tgtaatttat tcatagtcgt gaagaagtat atttatagat 5340 tggtgggtat gcatgaagaa tcgggggaag ggtaaattca ggatgctagt aggaatcgaa 5400 tatccctagt ataggagcta ttcgctggat atccgaactc ttagaatagg catggaatag 5460 ggggtattcg agggatattc gaacccctat tctagaacag gtatgctact aatagaatac 5520 ccccctatac tattagtagc aaaaaccaaa acgctactag ctcgaatacc ctcgaatacc 5580 cccaaaaaaa cgagagtgta cttaaatttc acccatactt ataagagatt cccttttaaa 5640 aacttgtgtg atgcaagttt gatgtaagta gttatgaagg ctcaatatag ccccttgggc 5700 tccttcggga tttagaatta gcatagaaaa gtaaattcct atgtattttt aattggggat 5760 tgcatatagt ttgctagttt tct 5783 // ID PIF_Harbinger-1_Ngru repbase; DNA; INV; 2143 BP. XX AC . XX DT 17-MAY-2011 (Rel. 16.05, Created) DT 17-MAY-2011 (Rel. 16.05, Last updated, Version 1) XX DE DNA transposon: consensus. XX KW Harbinger; DNA transposon; Transposable Element; KW PIF_Harbinger-1_Ngru. XX OS Naegleria gruberi OC Eukaryota; Heterolobosea; Schizopyrenida; Vahlkampfiidae; OC Naegleria. XX RN [1] RP 1-2143 RA Yuan Y.W. and Wessler S.R.; RT "The catalytic domain of all eukaryotic cut-and-paste transposase RT superfamilies."; RL Proc Natl Acad Sci U S A 108(19), 7884-7889 (2011). XX DR [1] (Consensus) XX SQ Sequence 2143 BP; 599 A; 404 C; 394 G; 746 T; 0 other; agagcaggtg caaactcgcg tgtaactcgt attttttatt attttcattc gttattgtca 60 ctctccgatt catcttcttc cttgtcatcc ttcgtttgga aattgtcaac caattcgaca 120 gtgacgttac ctgcttccgt tgctggttgt ggagatgaat attcgatggg ttgtgtgttt 180 ggttgagtgt gttctgttgc atttgaactg tcaatcttct tcttctttga ataaaccatc 240 actgccctcg caggtttaac aatgtcaact ttcttgtcac atacaatcac aaaaatcttg 300 tttccatcag aatccatttc cgtttcaatc ggcgccaaag tttgcatatc tccgtctttt 360 ggaatgtatt gtttcaaagt gtacagtgag gtcgtaccac tttgcttttc gtgtatgaca 420 ccaacgtcaa actcaatttc atctaatacc ttttgcatcg tttttgaatc taatatcgtt 480 tcaatggaaa catcaatcaa caattgcctt tcttttctac taacaccttg aaaaacaaat 540 ttcaagtgtg aagggtaacg aatcgcaaaa atatgcttgg ttgcagtttc gaaatgatga 600 aacgctaaca agtcggtttc atatttgact gttttagtga tgtgtgattt tgatggagtg 660 cgaaataatg attcttctga tgagtcttca gatgatgatg aatcatctgt gttcgtttcg 720 tattcacttt cgtcttcctc ttcgtcatca tcaacaactt taggtgcttc taactttcgt 780 ttctttgctt ccttgacacg taccaattca tcagcatatt ttgcggattg ttcttcaaag 840 tatgcctttt tgttctccat ttcaattgat gacttgtctt taattgtgct tttgtatttt 900 cgttgaactt gagaccaact gtaaccgttt tcgttcgcga acgcaacaaa agcgtctttg 960 tttgtttgat aaactttacc tctatttgcg agccagaaga ggcgtaatct ttctgtatca 1020 acttcttgcc actttttgga cttctttttc ttcagtgact gggctgtatc aacagtacta 1080 ctattggcaa cagagctaga agtcggggct tgtacttttt ctgtgttctt cctcttcttg 1140 gtgcccattt ttttttcaat ttcgaatatc gtttttcaca aaaacattcg tgttcacttt 1200 tgaattatga aaaaatgcaa aacttgtttt ctgtgtttcg atcatacatg ccaaaagcaa 1260 ctgaagaaac attcaaagcg ttttttgcat tcaatcctaa ggaattacaa attgtttatg 1320 acacgtttgt tggtagagta atagcaccaa agtggtttct aatcacctta gaatggttaa 1380 cgaattatcc taaacatcat gtacaacatg tttattggaa tatttctaga agaacacttg 1440 ttggaagatt atggaagacg gttttgtttc tctctcgtct tctccctgat tttgattttg 1500 atgatagact agaagaagaa ccagcaaaac gaattttcaa agatatcaag tgtatcgttg 1560 acggtgttga aattagaata cgtcgcccat ctaaaagacg attcactaaa aaacaaaatt 1620 ggatccacca gaaaacattt ttctcaaaca agaagaaaat gcatgcgatg aaatatcaag 1680 tggttgtcac tctccgtaaa gggaagatta taaatgtcag tcgtgcgtat agaggacgtg 1740 ttcatgataa aaagatattt gatgattaca taagggaaca cggtcacgaa tttggtgaac 1800 aagaacatct tggtgacctt ggttatgttg gagtaccaga gatatttaca cctttcaaga 1860 aaccgcgggg tggtgaatta gaacagtatc agaaaacaat taacaaagtc attggtgctg 1920 cgcgtgtaac tgttgaaaat gtgattggtc gtatcaaacg atttgacata cttaaccaca 1980 catttaggca cccacttgtc aaacacgaag atgtgttcat catttgtgcg aaactggcga 2040 acttacatct tcgctttcat cccgttcgaa agtgtgacga accgctcttg gacgtttaat 2100 aaaaataaaa aataacgttt atttaaaagg ttgcacccgc tct 2143 // ID Gypsy-21_AA-I repbase; DNA; INV; 4284 BP. XX AC supercont1.124; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-21_AA_; KW Gypsy-21_AA-LTR; Gypsy-21_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4284 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.124; Positions 1276959 1281242. XX CC Positions [3251-3538] - Integrase core CC 'ACCAG' target site duplication CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 14..3538 FT /product="Gypsy-21_AA-I_1p" FT /translation="MESTNMPANQEPGIQDVILQMAQLLQQMAVQQQLPQH FT QSQEQILESLSSNITELSFDEENGVTFGRWFDRYQDLFENDARQLDDAAKV FT RLLLRKLDTASHSRYLNYILPKLPKEVKFEDTVKILKKIFGNQISTFRKRF FT KCLQLVKSDCEDIISYGGHVNRACEDFQFQNLTLDHFKCLMFVSGLKASKY FT ADVRARLLSILENEKPEAPATLQSLIDEYQRLVNLKEDTTIIEQQGSSKSM FT VHAVRDRKSATHQKVSVKYESKKPKSPCWQCGQMHFVKECGYSTHICKECN FT HVGHKEGYCSSSMKSAKPSNSAQTDNSKKRKKGKSSAETKGVFIVNHIEHR FT EKERKFVTMMINDVPISLQLDSASDISIVSEAVWEELGKPKMKPSSSQASN FT ASGEPLLLLGEFDCNVTLKGTTKRGRCFVTSVRNLNLMGIEWIDLFNLWSI FT PFDTICNQVVVPSRQEIDREIQQLKMNHKDVFSESLGFCNRTKVQLHLKPG FT SKPVFCPKRPVPFHSVPMIDQLNRLQSLGIITPVDFSEWAAPIDAVKKPDG FT RVRICADYSTGLNNALEANNYPLPTPEEIFAQLAGSQYYSVIDLSDAYLQC FT KVTEDSTQLLTINTHRGLFRFNRLAPGVKSAPGAFQRLIESLVADIPGVRL FT FIDDIIIAGKDWKSHAESLNLVLQRLKDWGFHLKMQKCKFFQTEVRYLGHI FT TDRSGIRPDPEKVKTISRIPPPQNISELRSFLGAVNYYGKFVPNIHDLRHP FT LDQLLRKDVKWNWSADCQRSFEKFKHVLKSNLLLTHYNPNLPIVVAADASK FT TGIGAVIFHKFPNGSLKAIQHASRSLTTAEQAYGQPEKEALALVYGVTKFH FT KFLLGRRFTLQTDHKPLLSVFGSKKGIPVHTANRLQRWALIMLNYDFDIEY FT VSTNEFGCADMLSRLIDRSMRPEEDYVIASVSLEEDMVSILNTTVEAVPVS FT ALQDATKKCKILQQVKRYILDGWPAVSKQVAQEVQPYYNRRESLSIISECV FT LFKERVIVPETFRRRLLNQFHRGHPGVVRMKSIVRNYVYWPGIDGEIEDFV FT QRCVHCASAAKAPVKTKPEPWPAPEKPWSRIHIDYAGPVDGNYFLVVVDPF FT TRWPEVVLTKSITAKTTIKLLKQMFSTFGIPETVVSDNGTQFTGSEFKNFC FT KSFGIRHIRTAPYHPQ" XX SQ Sequence 4284 BP; 1255 A; 1062 C; 968 G; 999 T; 0 other; tttggcgacg aggatggaat ccacgaacat gccagctaat caagaacctg gaattcaaga 60 cgttatcctt caaatggccc agcttcttca gcaaatggcg gtccagcagc aacttccgca 120 gcatcaaagt caagagcaga ttttggaatc cctgtcatca aacattaccg aattatcttt 180 cgatgaagaa aacggagtca cgtttggccg ttggtttgat cgttatcaag acctcttcga 240 gaatgatgcc cgccagctgg acgatgccgc aaaagttcgt cttcttctcc ggaagctaga 300 caccgcttcg cacagccgct atctcaatta cattttgccg aagttgccaa aagaggtcaa 360 gtttgaagat accgtgaaaa tcctcaagaa aatttttggt aatcaaatat ccacattccg 420 gaagcgattc aaatgtctcc agttggtgaa gagcgattgt gaagacatta tcagctatgg 480 aggacacgtg aaccgagcct gtgaagattt tcaattccag aatctcactc tcgatcattt 540 caagtgcctg atgtttgtca gcggattgaa ggcctcgaag tacgccgatg tcagagcaag 600 attactttca attctggaaa acgaaaagcc tgaagctcca gccactctcc agtcgttaat 660 tgacgagtat caacgccttg tcaacctgaa ggaagacacc acgatcatcg aacagcaggg 720 cagttcgaaa tcaatggttc acgccgtcag ggacaggaag agcgcaacgc atcagaaagt 780 ttccgtgaag tatgaaagta agaagccgaa gtctccttgt tggcaatgtg gccaaatgca 840 cttcgtgaag gaatgcggct acagcaccca catctgcaag gaatgtaatc acgtcggtca 900 taaggaagga tattgttcct cctccatgaa gtctgcaaag ccatccaatt cagctcaaac 960 agacaacagc aagaagcgga agaagggaaa atccagcgct gaaactaaag gggtattcat 1020 cgtcaaccac attgagcatc gtgaaaagga acggaagttc gtcactatga tgatcaacga 1080 cgttccgatc tctcttcaat tggattcagc cagcgatatt tccatcgttt cggaagcagt 1140 ttgggaagaa ctcggcaagc cgaagatgaa accatcatca agtcaagcct ccaacgcttc 1200 gggcgaacca cttctgctgc tcggtgagtt tgactgcaat gtaacgctaa agggcacaac 1260 caagcgaggc cgatgtttcg tgacatccgt acgaaatctc aatctgatgg gcatcgaatg 1320 gattgaccta ttcaacctct ggtcgattcc gtttgataca atctgcaatc aagtcgttgt 1380 accttcaagg caagaaatcg accgagaaat tcaacaactg aagatgaacc ataaggacgt 1440 tttcagtgaa tcgcttgggt tttgcaaccg cacgaaggtc cagctgcatc tgaagccggg 1500 aagcaagcca gttttctgcc caaaacgacc tgttccattt cattccgttc cgatgatcga 1560 tcaactcaat cgacttcaaa gcctaggaat cattacacct gttgattttt cggaatgggc 1620 agcaccgatc gatgctgtca agaaaccgga tggcagagtt cggatttgcg cggattactc 1680 gaccggactt aacaacgcac tggaagccaa caactatccg ctgcccacac ctgaagaaat 1740 ttttgcacag ctggcaggaa gtcaatacta cagtgtgatc gacctatccg atgcatatct 1800 gcagtgcaag gtgactgaag actccacgca acttctaaca atcaacacgc atagaggttt 1860 gtttcgcttc aaccgcctag ctccaggtgt caagtcagcg ccgggagcgt tccaacggct 1920 gattgaaagt cttgttgcgg acattcctgg agtccgtcta ttcatcgatg acatcatcat 1980 tgctggtaaa gattggaaat cacacgctga atcgttgaac ctagtgctac aaaggctaaa 2040 ggactgggga tttcacctga agatgcaaaa atgcaagttt tttcaaactg aagtccgcta 2100 cctaggtcac attaccgacc gcagtggtat ccgtccagac ccggagaaag tgaagaccat 2160 ttctagaatt cctccaccgc agaacatctc tgaattgcga tcgtttcttg gagccgtgaa 2220 ctactatggt aagtttgttc caaacattca cgacctgcgt caccctttgg atcaactact 2280 gcggaaggac gtcaagtgga attggagtgc cgattgtcag cgatctttcg aaaaattcaa 2340 gcatgtgctg aaatccaacc ttctcttgac ccactacaat ccaaatttac caattgttgt 2400 tgctgcagac gcatcgaaga caggaattgg agccgtaatc ttccacaagt tcccgaacgg 2460 aagcttgaaa gccattcaac acgcttcacg ctctctcaca actgcagagc aagcatatgg 2520 acaaccggaa aaagaagctt tagcccttgt gtatggtgta accaagtttc acaagtttct 2580 actgggaaga agattcacgc tccagacgga tcacaagcca cttttgtcgg ttttcggatc 2640 gaagaaaggc ataccagtgc acactgccaa ccggttgcag cgctgggctc tcatcatgct 2700 gaattatgat ttcgacattg aatatgtttc gaccaacgaa tttggatgtg ctgacatgct 2760 ttcccgcctc atcgaccgaa gcatgcgtcc agaagaagac tacgtcatcg cttcagtttc 2820 gctagaagaa gatatggtaa gcattctgaa cacgacggta gaagcagttc ccgtttcagc 2880 actccaagat gccacaaaaa agtgcaagat tctccagcaa gtaaaacggt acattctgga 2940 cggatggcca gctgtttcca agcaagtagc ccaagaagtt caaccgtact acaaccgacg 3000 tgaatctctc agcataatca gtgaatgcgt actgttcaag gaaagagtga tcgttccgga 3060 gacttttcga cgacgcctgc tcaatcaatt ccaccgtggt catcctggcg tggtaagaat 3120 gaaatcaatc gttcgcaact acgtgtactg gcccggaata gatggtgaaa ttgaagattt 3180 tgtccaacgt tgcgttcatt gtgcttctgc tgccaaggct cctgtcaaga cgaagccaga 3240 gccatggcct gctcctgaaa aaccgtggtc ccgcatccat atagactacg ctggaccggt 3300 agatggcaac tactttctgg tggtcgttga tcctttcact agatggcctg aagtagtgct 3360 gaccaaatca ataacggcaa aaacgacgat taagctcctc aagcagatgt tctcaacgtt 3420 tggaattccc gagaccgtgg taagtgacaa cggaacacaa ttcaccggtt cggagttcaa 3480 gaatttttgc aaatcttttg gaattcgtca catccgtacg gctccgtacc acccccagtg 3540 aaacgggttg gcggaacgtt ttgtggacac cgtgaagaga agcgtgaaga aaattcgttc 3600 gggaggagag tccctagaag attcactgca cacttttctc accgtatacc gctccacttc 3660 agcacaagat ctcggaggac agtcaccagc gaagaagaag ctaggacggc caatgcggac 3720 ggtggcagct ctattgaaac ctccagatag tcatatacgt cagaaccccg aagaaacaga 3780 acgccagaat aatgctaccg aacgagtcca taggaacttc gaccgaggtg atacggtgta 3840 tgcccaagtc caccaagcca attcatggtc ctggatgccc gctgtaatca tcgaacggat 3900 tggaaaggtc aactacaacg tattactgaa ccaaggcaag cgattaatcc gttctcacat 3960 caatcaactg aaaaaccgag ttgaagcaga aataccccgc tgtcaagaat caccactttc 4020 cattttcttt gacgaattcg acctgccaca ggaacctgct gaattacccc caatcaacat 4080 tcaagaacta aatgctgttc atgatcaagt tccactcgaa gaagtaccag ctttactaat 4140 acctgcagca gaagaacctg aacaagaaca acacgaagaa cctgagccaa ctccaacaga 4200 cgagccggaa gttcccgatc gacctagacg attacttagg cttccgtcaa aacttcaagg 4260 atattggcta ttctaagggg gaga 4284 // ID Crack-31_BF repbase; DNA; INV; 2531 BP. XX AC . XX DT 30-APR-2009 (Rel. 14.04, Created) DT 30-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE Amphioxus Crack-31_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; KW Crack-31_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-2531 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-2531 RA Kapitonov V. and Jurka J.; RT "Young families of Crack non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(4), 836-836 (2009). XX DR [2] (Consensus) XX CC Its 5'-terminal portion is incomplete. XX FH Key Location/Qualifiers FT CDS 3..2342 FT /product="Crack-31_BF_2p" FT /translation="KSSCKKIDTICNTFQAKQLIDEPTRVTENSSTCIDLI FT IATAPEKVIECGVESTGLSDHSFTYAVRKAKQPRGTPRTARVRSYRQFNEA FT AFQEELYGAPWAEVEKCSNVNDALCCFHTILTNICDKYAPWITVRIRGHEP FT PWMTQEYLALARDRDFHFSKAKKTKLPHVWETAKRLRNKCNNMAQYLKKQY FT YHNEIKAKGKDSKSLWTTLKTILPGQTRHTTTENVSKQNEENTANRFNAYF FT TSIGAKLAAAFTSVYTPIFGPPRFSFKFTNISTEFTYKQLIGLPLGKSTGL FT DGVSSRLIRHAAPAIAGPLTYIYNLSLSTGEVPREWKRAKVTPLFKGGDKD FT DCCDYRPISVLPSFMKTFEKAVHAQIYTFLNGHNILNKFQSGFRPAHSTTT FT TLIHVTDTILDNMEKGLFTGAVFLDLKKAFDTVCHTILLDKLRMSGFQDTE FT LAWFRSYLDDRKQTTVVNGVASDFMGISVGVPQGSVLGPLLFILYINDMPD FT ILQHGKVCLYADDTALFYAANSISDLNEALNEDLRNIDVWLHSNKLTLNVK FT KCKAMLFGTRNKLRLGTDKLTVTLSGTCLEVVSSFKYLGVWFDSCLTWQDH FT IDKLCKGVSARLGVLRRLKPVLPKRTLEMLFNCMVLPKLDYCDVVWGNCGK FT CLSDKLQKLHNRAARIVLGLSYVSHIGSDELSSLHWKTLETRRNEHLLHTV FT YKSVNHLLPDYLNVFHMVSETHSYPTRHSLSQSVQLPQIHLECGRRKFSYR FT GVCQWNHLPMTLKTAPNIATFKHMMKLG*" XX SQ Sequence 2531 BP; 772 A; 551 C; 533 G; 675 T; 0 other; caaagagttc ctgtaagaaa atagacacca tctgtaatac cttccaagca aaacaactca 60 tagatgagcc aacccgtgtt acggaaaact cttctacctg tatagacctc attatagcaa 120 cagctcccga gaaggtaatt gaatgtggtg tggagtccac cggtcttagc gatcacagct 180 ttacctatgc agtgcggaag gccaagcaac cccggggaac tccaagaacg gcgagagtca 240 ggtcctacag acaattcaac gaagctgcat ttcaggaaga gctgtatggt gctccatggg 300 ccgaagttga aaaatgtagc aatgtcaatg atgcactgtg ttgttttcac acaatattga 360 ctaatatatg tgacaagtat gctccttgga taactgttcg aatccggggt catgagccgc 420 cctggatgac tcaagaatac ttagccttgg ctcgtgatag agattttcat ttcagtaagg 480 cgaagaaaac aaaactccct catgtttggg agacagccaa gagactaagg aataaatgta 540 ataacatggc acaatatctg aagaagcaat attaccacaa tgaaatcaaa gccaaaggta 600 aagacagtaa aagtctctgg acgacgctaa aaaccatctt gcccggccaa accagacaca 660 ctaccaccga aaatgtctct aaacagaatg aagaaaacac tgcaaataga tttaatgcct 720 actttacatc tataggtgct aaactggccg cagctttcac tagtgtctat acaccaatct 780 ttggaccgcc aaggttctcc ttcaagttca caaacatatc tacagagttt acttacaaac 840 aactgatagg tcttccccta gggaaaagca ctgggcttga tggggttagc agtcgactca 900 ttcgacatgc tgcacctgct attgctggtc ccttaaccta catctacaat ctgtctctct 960 ctactggcga agtacccagg gaatggaaaa gggctaaagt gactccattg tttaaaggtg 1020 gtgacaaaga tgattgctgt gactaccgac ccatttctgt actcccatca ttcatgaaga 1080 cattcgaaaa agctgtccat gcccagatat acacctttct gaatggacat aacatactta 1140 acaaatttca gtcaggtttc agaccagctc attctactac aacaacattg atccatgtca 1200 ctgacactat tctcgacaac atggagaagg gactttttac cggtgccgtc tttcttgatt 1260 taaagaaagc tttcgacacc gtatgtcaca ctattcttct cgacaagtta agaatgagtg 1320 gtttccaaga tactgaactg gcatggttcc ggtcctattt agatgataga aagcaaacca 1380 ctgttgtaaa tggggtggca agtgatttca tgggaatttc tgttggggtt ccccaaggtt 1440 ccgtactagg gcctttgttg ttcatcctat acattaatga catgcccgac atattgcaac 1500 atgggaaggt atgtctatat gcagacgaca ccgcactgtt ctacgcggcg aattcaatca 1560 gtgacctaaa cgaagcactg aatgaagacc tccgtaacat agacgtatgg ctacactcaa 1620 acaaactgac tttaaatgtg aagaagtgta aggctatgtt atttggcact aggaacaagc 1680 tgcgtctagg tactgataaa ctgacagtaa ctctttctgg cacatgtttg gaggttgttt 1740 ccagctttaa atatctcggt gtctggtttg actcgtgcct aacgtggcaa gatcatatag 1800 ataaattgtg caagggtgta tcggccagac taggtgtgtt acggcgtctt aagccagttt 1860 tgccaaaacg tacccttgag atgttattca attgtatggt ccttcccaaa ttggactact 1920 gcgatgttgt gtggggtaac tgtggaaaat gcctttcaga taagctacaa aagctgcaca 1980 accgggctgc tagaattgta cttggtcttt catatgtgtc tcacatcggt agtgatgaac 2040 tttctagtct acactggaaa actctggaaa ctagaagaaa tgaacacctg ttgcatacag 2100 tttacaagtc tgttaaccac ctactgccag attatctcaa tgtattccat atggtatcag 2160 agactcacag ttaccctact cgccatagtt taagccaatc agtacagctc ccacaaattc 2220 acttggaatg tggaagacgt aagtttagct acaggggcgt ttgtcagtgg aaccaccttc 2280 caatgacttt aaaaacggct cccaacattg caacattcaa gcatatgatg aaactgggct 2340 gacctctgac ctcatgaccg gactgaagta tgattatgga aaacggtctt tgtatatatt 2400 atttgtatgt tatgtatggt gctgatgtgt catgtattat gtgttatcca gggccccact 2460 gtaaagcagt gcgatgcact gagctgggct accctggcta aagaaaacag aataaaaaaa 2520 aaaaaaaaaa a 2531 // ID CR1_Ele19 repbase; DNA; INV; 4077 BP. XX AC . XX DT 26-OCT-2010 (Rel. 15.1, Created) DT 26-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE A CR1 clade non-LTR retrotransposon family from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1_Ele19. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4077 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4077 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (26-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. This consensus is generated from 6 CC sequences with >98% identity, and ~99% identical to the original CC sequence in [1]. XX FH Key Location/Qualifiers FT CDS 216..1001 FT /product="CR1_Ele19_1p" FT /translation="MAISVCCSCAGELVKAEEIVCNGFCRSSFHLKCVHQS FT VATRDAVINCSQLFWMCCACTKMMANANFRHAISSTNNAVEAISAEHSKAL FT IELRQEMEQNTAKINSILSQIPSALQNRTGRRGSTSSNTNRKRPRIDEEDT FT QPESNVTVGTKEIDPQVTIPLAESKTDDNLFWLYLSGFDPKASEDDIRGLV FT QQNLNTTDTIDVRKLVPKGKKLEELTFVSFKVGVGVQLKDLALLTSTWQKG FT ITFREFDFHPRPTFQFHRTQQ" FT CDS 919..4002 FT /product="CR1_Ele19_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="PLPGRKESLSANSISTLDRLFSSTERSSNETEIIXPL FT SSNPARLCYPPVHINAPTVCTSFAPLTTRDNIVKTVDTHNPLKSSMTVYYQ FT NVRGLRTKTNELFAALSCCDYDIVVLTETWLNGNVLNSELTNEYAIYRCDR FT NPSTSQHLRGGGVLIGVKNTIPSNVLSVSGAERLEQVVVQATVGCVSLVVC FT AIYLPPNTELDLYEQHAACINELHNRLSDGNRVVVLGDYNLPLLRWCFDDD FT IGCFIPMNASSEQELSLVENVVASGLQQVNHLINENGRLLDLAFVNSGNTC FT EIVEPPLPLMNVDRHHQPFVMVIETLLSELDDDCSNLAYDFNQCNYDELNF FT AISQVNWVEILNLDALDDAVDRFYGELNSIFRRYVPLRRQTQLPRNKQPWW FT NGELRNLRNRLRKARKRYFRSRSDDLKALVRDLECKFNSLNASRFRSYISR FT LESKMKDDPKHFWTFLRNRTSNCGAPRIVNYQNETSTTWIESANLFSSFFR FT SVLSNNSPPLSETYLNSLQTFDFNLPATTLTEQDVYSKLRDVDRSKGPGSD FT GLPPSFIKACSISLALPASLLFNRSLAESTFPSRWKDALITPIHKAGNLHD FT VRNYRGISILSCLAKVFESLVLDLVYPAVQNVIMNEQHGFVKKRSTTTNLM FT NYVSTLIENLEKRQQIDAVYIDFSKAFDQVPHQLAVEKLKRIGFPDWLTKW FT ISSYLSGRFASVKLGAVKSDPFQVTSGVPQGSHLGPLLFVLFVNDLCSELK FT SSKTMYADDLKIFRVISSLVDCCALQEDIEHVLRWCERNGMEVNVQKCNII FT TFARIRSPIVFSYAVKDCSLNRVSLVKDLGILLDSKLRFTEHISSVVAKSF FT AVLGIIKRNAREFKDVYCLKTLYISLVRSILEYGVVVWTPFHAVHIERIER FT VQKAFIRFALRRLPWRNRPHLPPYEHRCALINLPTLKTRRTYLQRLFIFDL FT LENNIDCPELLTRLRFNVPARTTRRTEFFRRSTHRTQYGQNSPLYVCCQSF FT NDVNQYYEFRISKLAFKSRIRF" XX SQ Sequence 4077 BP; 1163 A; 918 C; 866 G; 1127 T; 3 other; ggtaaaccta tatgttgatt tttcgcaaaa tcgttttcta atttcaacat ttttgtgttt 60 catatacctg ttttgatact cgtgacgtgc tcatcgtgct gaccaaccac tttcccgtcc 120 ggaacaactg gagtacaatc gtttcaagcg caataacatc caaaaaatcg tcaatttcat 180 ccgtactaca acttcactac ccacaatctc aaacaatggc tatttctgtt tgttgttcat 240 gcgctggtga gctggtgaag gcagaagaaa tcgtttgtaa tggtttctgc cgatcgtcct 300 ttcatctcaa gtgtgtgcat caatctgtag ccacccgcga tgctgttatc aactgctccc 360 agctcttctg gatgtgctgt gcatgcacca aaatgatggc aaatgctaat tttcggcatg 420 caatatcgtc caccaacaac gctgtggagg ccatcagcgc tgagcacagc aaggcgttga 480 tcgagcttag acaggagatg gagcagaaca cagcgaaaat aaacagtatt ctttcacaaa 540 ttccttccgc actacaaaac cgaactggac gaagaggaag tacttccagc aacacgaatc 600 gtaaacgacc acggattgat gaagaagata ctcaaccgga gagcaatgta accgtaggta 660 cgaaagaaat cgatccacag gtgacgatac cattagcaga aagtaagacg gacgataact 720 tgttctggtt gtatctgtct ggtttcgatc cgaaagcatc agaggacgat atacgaggct 780 tggtacaaca aaatctgaac accaccgata ctattgacgt gcgaaaattg gttcctaaag 840 gcaaaaaact tgaagaactc acattcgtgt cgttcaaagt tggcgttggt gtgcaactaa 900 aggacttggc gctgttaacc tctacctggc agaaaggaat cactttccgc gaattcgatt 960 tccaccctcg accgactttt cagttccacc gaacgcagca gtaacgaaac tgaaatcatc 1020 wcacctctct cgtcgaaccc agcacgcctc tgttacccac ccgtacatat caacgcacca 1080 actgtgtgca catctttcgc accgctaacc actagggata atattgtgaa aacagtggac 1140 acacacaacc cactgaaatc ctcgatgaca gtgtattacc aaaacgttag gggactacga 1200 actaaaacga acgaactgtt tgctgctctt tcttgctgtg attatgacat cgttgttcta 1260 acagagactt ggctcaacgg caatgtgtta aattcggaac tgacgaatga gtatgcgata 1320 tatcgatgcg atcgaaatcc ttctactagc caacatctcc gtggtggtgg cgttttgatt 1380 ggtgtgaaaa acacaatccc gagcaacgtt ctatctgtca gcggagcaga gcgattagaa 1440 caagtggtcg ttcaggcaac cgttggatgc gtctctttgg tggtatgtgc tatttatctc 1500 ccaccaaaca cagagctcga tctctatgaa caacatgctg cttgtatcaa tgagcttcac 1560 aatcggttga gcgacggtaa cagggttgta gttctgggag actacaactt gcctcttctt 1620 cgctggtgct ttgatgacga catcggatgc tttatcccaa tgaatgcttc atctgaacaa 1680 gaattatcgc tggtcgaaaa tgttgtcgca tctggtttgc aacaagtaaa ccacctaatt 1740 aacgaaaacg gtcggttgtt ggatttggca ttcgtgaata gtggaaatac ttgcgaaatt 1800 gttgaaccgc cattgccttt aatgaacgtt gaccgtcatc atcagccttt cgtgatggtc 1860 attgaaactc tccttagcga actcgatgac gactgctcga atttagctta tgatttcaat 1920 caatgcaatt acgacgaatt gaattttgca atttcwcaag taaattgggt ggaaatttta 1980 aatctggacg ctttggacga cgctgttgat agattctatg gcgaattgaa ttccattttc 2040 cgacgatacg tacctttgag aagacaaacg cagctgccca ggaacaaaca accttggtgg 2100 aatggggagc tgagaaattt gcgtaacaga ctacggaaag ctcgcaagmg gtattttcga 2160 tcgagaagtg atgatttgaa ggctctggtg cgcgacttag aatgcaaatt taattctctg 2220 aacgcatcac ggtttcggtc atacatttcc cgactcgaaa gtaagatgaa agacgacccg 2280 aagcatttct ggacattcct gcgaaaccgt acatcgaact gtggagctcc tcggatcgtc 2340 aactatcaga acgaaacatc gacaacgtgg atcgagtcag caaacctgtt ctcgtcgttc 2400 tttcgaagcg tgttaagtaa caactcacca cctttgtcgg aaacgtacct gaatagtttg 2460 cagactttcg attttaactt acctgctact accctcacgg aacaggacgt ctattcaaaa 2520 ttacgtgatg tggatagatc aaaggggcca ggatcagatg ggctgccacc gtcattcatc 2580 aaagcgtgct caatctctct cgctcttcct gcatcgttgt tattcaatcg ttcacttgcc 2640 gaaagtactt ttccatcaag gtggaaagat gcactgatca cgcccattca taaagctggg 2700 aacttacacg acgttcgaaa ttacagagga atttccattt taagctgcct agcaaaagtt 2760 ttcgaaagct tagtgttgga tctcgtttac ccagcggtgc aaaacgtcat tatgaatgaa 2820 caacacggat ttgttaaaaa acgttcaaca actacgaatc tcatgaacta tgtgtcaacg 2880 ctaatcgaaa atttagagaa acgacaacaa atcgatgcgg tgtatattga tttttcgaag 2940 gctttcgacc aagttccaca tcagcttgct gttgaaaaac tcaaaaggat tggatttcct 3000 gactggttga caaaatggat ctcttcttac ctcagcggcc gttttgcatc cgtgaagctt 3060 ggggcggtta agtcggatcc tttccaggtt acttccggcg tcccccaagg gagccatctt 3120 ggaccgcttc tcttcgtgct ttttgtgaac gacctctgca gtgaattgaa gtcttccaaa 3180 acaatgtacg ctgacgattt gaaaattttc cgcgttatat catctctagt ggactgttgt 3240 gctttacaag aggacatcga gcacgttttg agatggtgtg aaaggaacgg aatggaggtg 3300 aatgtacaga aatgtaacat aatcaccttc gcgcgcattc gctctccgat agttttcagc 3360 tacgcggtga aggattgctc tttgaacaga gtctcactcg taaaggatct tggtattctg 3420 ctggacagta aactacgttt caccgagcac atttcatcag tcgttgccaa atcgttcgct 3480 gtattgggca tcataaaacg caatgctcgt gaatttaaag acgtttactg cctcaaaaca 3540 ctgtacatat cactagtgcg cagtattctc gagtatggtg tagttgtatg gactcctttt 3600 catgcagtgc atatagagcg aatcgaacga gtgcagaagg cgtttatcag atttgcgctt 3660 cgacgacttc catggaggaa tcggcctcat ctgcctccct acgaacaccg gtgtgcactg 3720 atcaacctgc ccacattgaa aacccgtaga acttatttac aacgactttt catctttgat 3780 ctcctcgaaa acaacatcga ctgtcctgaa ttgttgacgc gacttcgttt taatgtccca 3840 gcaagaacta ctcgcagaac ggaattcttt cgacgctcca cgcatagaac gcagtacggt 3900 cagaatagtc ccctttatgt ttgctgtcaa tcttttaacg atgtgaatca atattatgaa 3960 tttagaataa gtaaactagc tttcaaatca agaattagat tttaagaatt agtctgagcg 4020 attttttttt tttaatcgaa ggctgtaaat aaataaataa ataaataaat aaataaa 4077 // ID BEL-628_AA-LTR repbase; DNA; INV; 667 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-628_AA_; KW Pao_Bel_Ele79; BEL-628_AA-I; BEL-628_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-667 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 667 BP; 226 A; 140 C; 115 G; 183 T; 3 other; tgtcgcaccc taacgggacg cgatgcaccg acagtcaatg cagtcaatca acagtaagcg 60 ccgaccctgg tcaggcctat cgtgagtaga caaaacaaag tggaaagttg atcagcagat 120 aacgtgtagg tgctttattt atctttctac ttaaatccta atttgaattg tttattctac 180 cttaatctag tatttacggc taggccgagt gttcaatttg ttatcctact taaactctac 240 gtaaactaaa acacacagca caggtaaaat aagaactaaa aaccatgcaa atcttaaccc 300 taattatgca catattcaca gcaattgcga cataaatcat taaactaata gtacggatgt 360 tgtagctagt acggacgaaa gtaccaacag aagaacacta aacgtgagta actaattatt 420 atacggatta tgcatgcaac gacacaaaaa cgaattccta cgtatcccaa acacaacata 480 ttagctatca agcactgaac ccacagtggg ctgaattmgc gtaattaatg ccgaaattcg 540 aaattaatts tgaaatattt tttggttktc tctattccgc aaagggattt tataacgctt 600 cgtgaataaa ttgttacgaa atcggaaacc gttctgtctg tcaatccgtc cgtctgcttg 660 cgcaaca 667 // ID Gypsy-57_CQ-LTR repbase; DNA; INV; 1136 BP. XX AC AAWU01017621; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-57_CQ_; KW Gypsy-57_CQ-I; Gypsy-57_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-1136 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 494-494 (2011). XX DR GenBank; AAWU01017621; Positions 10331 9196. XX SQ Sequence 1136 BP; 327 A; 256 C; 259 G; 294 T; 0 other; tgcagcgacc tcaaatttaa atcattaaat ttgtccattt ttcgaacctc accaccgttc 60 tgagcatgcg cgcttacgca agcataaata attttacccg tcgactcgcg ttgcgcgaca 120 aataattaag cttatgtaca ggtgtggatc atgagtcatc ctcacctgaa aagcccttta 180 ttaaatttcg ccaattatta ggcaatcaaa aaaatggtta agctttcaat tatgaatgtg 240 cgatgttatt ccacagggga aattgaaggt gtggcttaac caaattcatt ggcatgtttg 300 ttcaatggag aattcgacct tctcattatt gaccaacatt tttgagaata attgaggaac 360 attattttga tattcctata acataattta aagtttcacg acaatggttc gaacccatga 420 cttccgattt tgaggtgtac tcactacacc atacggaaag tcatgaagaa ttgcagaggt 480 ctgcacaatt taattcagga ggttcatcga tgcttctcat cgaaacggac cacttttagt 540 agaacaaaat ttagtagagt tttcggggac tgactataaa aaccccaaaa ttcttgagga 600 ggccagttag ttccagaatt agttccagtc agttccagcc agctcaactc cagtccagtt 660 ccagttccag aagacgcccc tgaccagcca gacccgccag cagccaccag tagcagaggc 720 cccagtccag ccggacttag cggtcggtgg aggccgcagt ccgccgggac ttagccgctg 780 tgaagagtcc gccgggactt agccgccgtg aagagtccgc cgggacttag ccgctgtgaa 840 gagtccgccg ggacttagcc gccgtgaaga gtccacccgg gacttaggag ttcggggtct 900 ggcatggctc ggcgaagagg tctggaggac aagtctggct tcaacataga gaactggctc 960 gacgaagatc ttaatgcaaa gtgatgtgat cgtgtgcgta aaagttgaaa gtgttaccgc 1020 aaaaatgtaa tctttaaaaa gtgtaatata cagataagtg aagtttactg atctgttttg 1080 actctacgag taaatatcac aaaaatcctg aaagcctaaa ccgctcgggc cgttca 1136 // ID Gypsy-247_AA-LTR repbase; DNA; INV; 551 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-247_AA_; KW Gypsy-247_AA-I; Gypsy-247_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-551 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1096-1096 (2011). XX DR [1] (Consensus) XX SQ Sequence 551 BP; 172 A; 150 C; 101 G; 122 T; 6 other; tgtagcatac cagttggatc cataacccaa aatcaaaacc catgttgtaa aataacacac 60 accaaccatc ctagcaccaa agcaatgacc aggcaattat gcaatcacaa acggcacata 120 tggaagtcat tcaccaccga ggtgcatcgc tctatcgtct ccccccacct ccctcggtga 180 ggtccggagc cggtkgcgkg cagckcgkag gagtcaaata acaagccaag ccaaagtgat 240 cgcgacaaat aaccgatatt gcttagcacg caaagatgag catttttgtc acattgtcaa 300 tccagttgtc tcacctackc acgaattact ctttccatca tgctcactkt agcaaactgt 360 tccggacctc ggaaatgtat aaaaaccttt gaactgcaat aaacgtaacg attctgatac 420 attagctcga cccaattgga cgtgtccaaa tttggccatc gaccactaat tgaagaaccc 480 ttaagcgacc caactgacag acgtcaggat ggctatcgtt tagcaagtaa agtcaagttc 540 gtcgcgattc a 551 // ID BEL-224_AA-LTR repbase; DNA; INV; 396 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-224_AA_; KW BEL-224_AA-I; BEL-224_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-396 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 906-906 (2011). XX DR [2] (Consensus) XX SQ Sequence 396 BP; 120 A; 73 C; 68 G; 133 T; 2 other; tgtttgatac ggtccacatt agctaataat tttgtaatta gattactttc aatttgaatt 60 taaaatatca tgaattagaa tccatctcca cgtatagaaa atatcgcata gatgaatctg 120 cataaaacgc atttgtacaa gctwttgtta cctttttgtg cgttgcatcg attgcacgtc 180 agaccccttt ttgttcattc ggtttctatt agackatcag taagagttca gatcgccacc 240 aacagcgtgc gaaataaagt gtaccaagaa aagtattgaa tatatttgcc catttcttta 300 ataaatcagt tgtgtcgtta gttgagtaaa cgcgtcgtgt tttggagtga ttatttcccg 360 aaaaataagc ccgtgattcg accccctgat acaaca 396 // ID Tx1-1_AAe repbase; DNA; INV; 4984 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A Tx1 non-LTR retrotransposon from Aedes aegypti. XX KW Tx1; Non-LTR Retrotransposon; Transposable Element; Tx1-1_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4984 RA Kojima K.K. and Jurka J.; RT "Tx1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1457-1457 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 10 sequences with >95% CC identity. It is positioned at the deepest branch of the Tx1 CC clade, and does not show sequence specificity. XX FH Key Location/Qualifiers FT CDS 121..1185 FT /product="Tx1-1_AAe_1p" FT /translation="MEGLVKNTLMFRFPDGAAGPSVVEMARFAKAFDADRF FT TMESVYKISEERCICIKFMNERTMKDALMQNHEEHVFEYSNGDKVQIKMSV FT AGGCSKYIRIFDLPPEVPDQEISTVLSRYGVVRRMVREKFPAQLELDLTTG FT VRGVYIEIKKEIPATLFFLNRRGRIYYEGVKHKCFLCKEEGHLKADCPRNA FT ANQRKSQTADTSGETVGASAKVAPTTSGRSSYAETLKNKVSVAEVSGSTMT FT VLVPAAKVTHGNVKEGSSSVVGVIGDDSRPPAAEIVGNDGDSQESESDSAA FT VLDAEMDESGHAVKRQHDSSSTEDEGFNRVTRSRKQKKDEDALEIIAATSA FT QLKNTGKSKLKN" FT CDS 1597..4851 FT /product="Tx1-1_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MNFIRKVVSININAIKVNVKKSLLKEFILDNDIDVVF FT LQEVAFDDFSFLPSHEAFVNRNEASSGTAILVRKSIDTRDCLKCPSGRIIS FT IVIDNVNFVNIYGHAGSQQRAERDELFEISAVDHMSKTAANSTVIGGDFNC FT ILEEDDTRGNFKNICRGLKTLTNLFDLKDVELKLKCQKKQFTFFRNDSASR FT IDRFYSTKCFLNEIIKFETKAVPFSDHHAIQFCYKFDPNHRIPVYGRGYWK FT IKGFLLQDDDLTAEFAKTYESLRSRAKYLNNFAEWWCFDVKNKIKQFYNSK FT FFEYKQRDRAKKEVWYTKLKELEERQLNNENVSNEMFEVKSRIVSAENQRL FT QLLCSGFQPNTMLEAEKMGIYQVSKYVKMHSNHNHVQLKNGDGLLTDQIQT FT KEIIYDHYKNLFDEINTNQSALPMLNSIRKVLSNDSKQALTSPITENELLD FT TIKNSAMKKSPGPDGITYEFYLVRYSILKEDLLKLFNGIYSGAISPIENFS FT SGIVTLIPKTGNKFDLNNYRPISLLNSDYKLLCKIIANRLKIYVEELIEEG FT QTAGVKNKSCTNNLDVIRTLTVKAQQSKKFKFALLSIDLEKAFDTVCHKRL FT WETLEKFQLPDQLITCIRQLYGKASSRILVNGFLTPSFKIRRSVRQGCPLS FT MILFCLYIEALIRHLYVNVSGILISGRFVKVLGFADDLTLIIRSDNEFDTV FT MQIIGNFSIFAGIKLNVKKSGFLRYNNCKLGPQLIEEKKELKILGLLISTN FT YKVMVENNFKKIIQNINVTTQAHASRRLSLLEKVIVLNSYILSKLWYASQT FT LPPSNVHIAQIRKTMGYFLWGYHRIFKIERTQLYLHNLKGGLNLLDPDSQC FT KSLFIRNILFGDSGTTEHYLISTSNLKGLTLNTVRYISKAKEIKEIPYIVT FT NKQIYNYLIDKLSIRVKVEEKYPQIMWSTIWENINHNFLTLSTRSTLYEVF FT NDIVPNKIKMYNYLANVDNFGCDVCGKPDSNMHRIKDCPKTLEIWQWISKI FT VKNRXKIQIESPHMLLYKGIGKRNSKYKAALWLVCEAIVYNTNNHRNPSLF FT QFQKGIRDVRWNNRKLFSKHFDKYLNIC" XX SQ Sequence 4984 BP; 1688 A; 788 C; 1057 G; 1449 T; 2 other; cagttgacga tcagctctcg ttcggtattg acgtgctttg ttagatcgaa aacaataatt 60 caaaagctgg ctcacagcta aagtgctttg tgtgaaataa agtggaacag tgcgagaaaa 120 atggaaggct tggtgaagaa cacgctcatg ttcaggtttc cggatggtgc tgcagggcca 180 agtgtagtgg agatggcaag atttgctaag gctttcgatg cggacaggtt cactatggaa 240 agtgtctaca agatctcgga agaacgatgc atatgcatca agttcatgaa tgagcgaacg 300 atgaaggatg ctttgatgca gaatcacgaa gaacacgtgt tcgagtattc gaatggggac 360 aaagtgcaaa tcaagatgtc tgtcgctggt ggatgcagta agtacatccg gattttcgat 420 ttgccaccgg aggtgcccga tcaggaaatt tctaccgttc tctcgagata tggagttgta 480 cgccggatgg tccgggagaa atttcccgcg caactggagt tggacctgac aacaggagtt 540 cgtggagtct acatcgaaat caaaaaggag atcccagcta ctctgttctt tctcaatcgt 600 cgaggaagaa tttattacga aggagtcaag cacaagtgtt tcctgtgcaa agaggaagga 660 catcttaagg cggattgccc gcgaaatgcg gcaaaccaga gaaagagcca aactgccgat 720 acmagtggcg aaacggtggg tgcctcggct aaagttgctc ctactacttc cgggcgttcg 780 agttatgctg aaacgttgaa gaacaaagtg tcagtcgctg aagtgagtgg aagtacgatg 840 acggtgctag taccggctgc aaaagtaaca catggcaacg ttaaggaggg cagttcttcg 900 gttgtcggtg tgattggaga tgactcgcga ccacctgctg ccgaaatcgt tggtaacgat 960 ggagatagtc aggagtctga atcggattct gcggctgtcc tggatgcaga aatggatgaa 1020 tccggccacg cggtgaagag acaacacgac agttcctcga cggaggatga gggctttaat 1080 cgggttacga gaagccgaaa acaaaagaaa gatgaggatg ctctggagat cattgcagcg 1140 acgtctgctc agttgaaaaa cactggtaaa agcaagttga aaaactaagg tcagttgaag 1200 gttaaagccc tgtttattca tgacgacaat ggctatgttt taaggttatt gttttggaaa 1260 cgatatggta cagcgtcagg atcgatttgg ccatagttga tgacaatagc aatgtttcga 1320 ggttattgat ttggaaagtg atttatagta tatgcaagtt cttgatctta ttatgatagt 1380 ggctatgatt tgaggttatg agattggact attctatttt tggttatact acttgaacat 1440 ttatgatgaa atatgctacg tctttaaggt ttgatttggg tatgtgaatt aactctaact 1500 ggcacacttg aacattggcg atgacaacag ctacgaataa cggttatgat tacggttttt 1560 ctgcgtcttt ttcggtagtg cagttgtatg tgagagatga actttatccg gaaagttgta 1620 tccattaaca ttaacgcaat caaagtgaac gttaaaaagt ctttgttaaa agaattcatc 1680 ctggacaatg atatagatgt ggtgtttcta caggaagtag cgtttgatga tttttcattc 1740 ctgccatctc acgaagcctt tgttaacaga aatgaagcta gttctggtac agctatactg 1800 gtgcgtaagt cgattgatac acgtgattgt ttgaagtgcc cttctggaag aataatatcg 1860 attgtgatag ataatgttaa ctttgtaaac atctacggtc atgctggatc gcaacagaga 1920 gcagaacgtg atgaactgtt tgaaatttca gctgttgatc acatgagcaa aactgcagca 1980 aactcaacgg tgataggggg tgattttaat tgcattttgg aagaagatga tacgcgtggt 2040 aatttcaaaa atatctgtcg tggtttaaaa acattgacaa acctgtttga cttaaaagat 2100 gttgaactca agcttaagtg tcagaagaaa cagttcacct tttttagaaa tgactccgct 2160 tcaaggattg atcgctttta ttcgacgaaa tgttttttaa acgaaattat taaatttgaa 2220 acaaaagccg taccgttctc cgaccatcat gcaatccagt tttgttataa gtttgatcca 2280 aatcacagaa tccctgtgta cggtagagga tattggaaaa ttaaaggttt tttattgcaa 2340 gatgatgatt taacagccga attcgcgaaa acttatgaaa gtttgcgctc tagggcgaaa 2400 tacttaaata attttgcaga gtggtggtgc tttgatgtta aaaacaaaat taagcagttt 2460 tacaattcaa agttttttga gtacaagcaa cgcgaccgtg ctaaaaaaga ggtttggtac 2520 acaaaattga aagagctaga ggaaagacaa ctgaataacg aaaatgtaag taacgagatg 2580 ttcgaagtta aatcgcgtat tgtaagtgca gaaaatcagc ggcttcaatt actttgttcg 2640 ggatttcaac ctaatacgat gctagaagca gaaaaaatgg gtatctatca agtttcaaag 2700 tatgtgaaaa tgcacagtaa tcacaatcat gtgcaattaa aaaatggtga tggtctttta 2760 accgatcaga tccagacaaa ggaaattatt tacgatcatt acaagaatct gtttgacgaa 2820 atcaatacca accagagtgc tcttccgatg ttaaattcta ttaggaaagt actaagtaac 2880 gactctaaac aagctttaac ttcgcccata actgaaaatg aactacttga cacgatcaaa 2940 aacagtgcaa tgaaaaaatc tcctgggcca gacggcataa cgtatgaatt ttacttggta 3000 cgttactcta tattaaaaga agatttgctc aaacttttca atggaattta ttctggcgcc 3060 atcagtccga ttgaaaactt ttccagtgga atagtaacac taataccgaa aactggaaat 3120 aagttcgacc tgaataatta tagaccaatc agtctattga attccgatta taaacttttg 3180 tgcaaaatca ttgctaaccg tctcaaaatt tatgtagaag aattgattga agaaggtcaa 3240 acagcgggtg ttaaaaataa aagttgcacg aataatcttg acgtgatccg cacacttact 3300 gtaaaagcgc agcagtccaa aaagtttaaa ttcgcgttac taagtattga tctcgaaaaa 3360 gcttttgaca cggtttgtca taaacgatta tgggagacct tggaaaagtt tcagctacca 3420 gaccaattga ttacctgcat aaggcagtta tacgggaaag cgtcttctag aattctcgtt 3480 aatggatttt taaccccatc atttaaaatc aggcgatcag tgagacaagg atgtccacta 3540 tctatgatac ttttttgctt atacattgaa gcactgataa gacatttgta tgtaaatgtt 3600 tcaggaattc tgatttctgg acgttttgtt aaagtcctgg gatttgctga cgatcttact 3660 cttataataa gatctgacaa tgaatttgat acggttatgc agataatagg aaatttttca 3720 atatttgcag gcattaagct taatgttaaa aaatctggat tcttaaggta taataattgc 3780 aagctaggac ctcagcttat tgaagaaaaa aaggaattaa aaattcttgg tttattaata 3840 agcaccaatt acaaagtaat ggtagaaaac aatttcaaaa aaattataca gaatataaat 3900 gtaaccactc aagcacatgc gtcaagacgt ttaagtttat tagaaaaagt aatagttttg 3960 aattcttaca ttctttccaa gttgtggtat gcttcacaaa ctttacctcc aagtaatgtg 4020 cacattgcac aaattaggaa aaccatggga tattttctgt ggggctacca cagaattttc 4080 aaaattgaac gaacacagtt gtatttgcat aacctcaaag gaggtttaaa tttacttgat 4140 ccagattcac agtgtaaatc actgttcatt aggaacatac tattcggaga cagtggaaca 4200 actgagcact atttgatttc aaccagtaat ctaaaagggt taactctcaa tactgttaga 4260 tatatatcaa aagctaaaga gataaaagaa attccttaca tagtaacaaa taaacaaata 4320 tacaattact taattgacaa attgagtatt agggtaaaag tggaagaaaa gtatccacaa 4380 attatgtgga gcactatttg ggaaaatata aatcataact ttctaacttt gtctacaaga 4440 tctacacttt acgaggtatt taatgatata gttccaaaca aaattaaaat gtataattat 4500 ttagcaaacg tagataattt tggatgcgat gtttgtggta aaccggatag taacatgcac 4560 agaatcaaag actgtccaaa aacgctagaa atatggcaat ggatttcgaa aatagtgaaa 4620 aatagattwa aaatacaaat tgaatcgcct catatgttgc tttacaaagg aataggaaaa 4680 cggaatagca aatacaaggc agctttatgg ttagtttgtg aggctattgt gtataatact 4740 aataatcaca gaaatccaag tttgtttcaa tttcaaaaag gaataagaga cgttagatgg 4800 aataatagaa agttatttag taaacacttc gacaaatatc ttaacatctg ttgaaatcga 4860 tacagaaatt tgtttaagat ttagcattaa gaatagcttt aaaacatctt tgtatataat 4920 tcaatacttt gtaatgtttt ttatgaatgc ttgttaaaag gtaaataaac agttaaaaaa 4980 gaaa 4984 // ID RTE-6_BF repbase; DNA; INV; 3465 BP. XX AC . XX DT 29-JUL-2009 (Rel. 14.07, Created) DT 29-JUL-2009 (Rel. 14.07, Last updated, Version -1) XX DE Amphioxus RTE-6_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW RTE; Non-LTR Retrotransposon; Transposable Element; RTE-6_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-3465 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-3465 RA Kapitonov V. and Jurka J.; RT "Young families of RTE non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(7), 1704-1704 (2009). XX DR [2] (Consensus) XX FH Key Location/Qualifiers FT CDS 307..3447 FT /product="RTE-6_BF_1p" FT /translation="PLTEKENSGKPNLERRLAPPSFCRSPVKGQGRQKTER FT RERQPDMSSAKHSIGLAQEGNNRGTTAQPAIKLPTEGTVIGTWNVRTLHAC FT GRLEELTHELDRYTWDILGLAEVRWTGFGETTTEEGHKIWYSGEEKQHRHG FT VAFIVRNEISNCVISCTPVSSRVISIRISAKPHNITIIQVYAPTTDHEDEQ FT VEQFYEELENTIEKAPKKDLLVVQGDWNAKVGTDAYQHWAGTVGQYGVGET FT NDRGLRLLEFAQQQRLTLTNTLFPHKLSRRTTWHAPNGQVHNQIDYILTPQ FT RFKSSINRARTRTYPGADIGSDHDLVLTTLKLRLKCQRCPRNPRIRFNLEK FT LKDPEVVEMFQAQIGGKFAALNVLDTDVDELAGNIKEVLMSTAEETLGRHR FT KKNKPWVSDDILDLCDRRRELRQQKYTSQENAEQYRQAHKTVRRKMREAKE FT KWIEDQCTEIQHGMASGNSQKAYETLKTLTRTQQPRATVIEDSSGNLLTES FT TAVLNRWTEYCDKLYNYELQPDTAKLNNPPWAERVPEDLPILRDEVADAVR FT NMKPGKSPGIDNIPAELLKHGGEVVVDKYSELCQKIWQEKKWPKEWTQSLV FT IPLPKKGNTRQCQNYRTISLISHPSKILLKVILKRLKSKAEELLAEEQAGF FT RPGRSTTEQIFNCRLIIEKHLQHQRDLFHNFIDFRKAFDRVWHDGLWWVLR FT GFGVEEGLIQSIEALYQSASSAVFLNNKVGEFFRTTVGVRQGCLLSPVLFN FT LYLEKIMQDALYDHHTSISIGGRRICNLRFADDIDLLAGSNTELQDLTDKL FT TESTGSFGMEVSTEKSKVMVNSRDETHAVILMNGELLEEVGTFTYLGGTIT FT KDGTSETDVRRRFAMATSAMARLSRIWKSNNISFSVKFKLYKTLVVSILLY FT GCETWTLLAATERKIRAFEMRCLRKLLHISYYEHVTNEYVREQVRSLVGPQ FT ETLLATVKRRKLQWFGHVTRHNNLAKTILQGTVEGGRRRGRQRKAWQDNIA FT DWTGLSLQERITLARDRRTWRQLSIFMSTLSPQRPGGQGTR" XX SQ Sequence 3465 BP; 1045 A; 813 C; 907 G; 700 T; 0 other; ggtcagaggt caccagacaa gatggctgcc ttctgaggtg tgtccagctc tggggtttcc 60 atcgatagga atcctcagtg gggacagccc aagaaggcct acaactttgt cagtagataa 120 cttagacagg gtttaaagtc actttatgtt catttttgag ttttttgctg ttagtcagac 180 atttgttacc ggacagtgat acacccccct cccaccctac cgcatgtggg agttggctac 240 ttctgctttt acttttgagt ttcaaacgtt cttttcattt gtactggatt accaacgcga 300 ccctaaccct tgacggagaa ggaaaactct ggaaaaccca acttggagag gagattggcg 360 ccaccaagtt tttgtagaag cccagtgaaa ggacaaggta gacagaagac agagagaaga 420 gaaaggcaac cggacatgtc atcagccaag cacagcatcg gtttagccca ggaaggaaat 480 aacagaggta caactgcaca acctgccatc aaacttccaa cagaaggtac agttatcgga 540 acttggaatg ttcgtactct acatgcctgt ggtagactgg aagaacttac acatgagcta 600 gatcgttaca catgggatat tcttggcctt gctgaagtta ggtggacagg ctttggtgaa 660 actacaacag aagaaggaca taagatctgg tacagtgggg aggaaaaaca gcaccggcat 720 ggagttgcat ttattgtcag gaatgagatt agcaattgtg ttatcagctg tacgccagtc 780 tccagtcgag ttatctcaat ccggatttct gccaagccac acaacatcac cattatccag 840 gtttatgcac ctaccacaga ccatgaagat gaacaagtag agcagttcta tgaggaacta 900 gagaacacca tagagaaggc accaaagaaa gacttacttg tagttcaggg agactggaat 960 gccaaggtag gcacagatgc atatcaacac tgggcaggga cggtaggaca atacggtgtc 1020 ggggagacaa atgacagggg actgagacta ctggagttcg cacagcagca gcgactgacc 1080 ctcaccaaca ccttatttcc ccacaagctg tctagacgca ctacgtggca cgctcccaac 1140 ggccaagtac acaaccagat agactacatc ttaactccac aacgcttcaa gtccagcatc 1200 aacagagcac gcacaaggac ctatcctggc gcggacatag gaagcgacca cgatctggtg 1260 ctaactacac tcaaactgag actgaagtgc cagcgctgtc ctaggaaccc ccgtatcagg 1320 tttaaccttg agaagctgaa ggacccagag gtggtagaga tgttccaggc acagataggt 1380 ggcaagttcg cagcactcaa cgtccttgac actgacgtgg acgagcttgc agggaacatc 1440 aaggaggtgc tgatgtccac agcagaagag acactgggaa gacacaggaa gaagaacaag 1500 ccttgggtgt ctgatgacat tcttgatttg tgcgacagaa gaagggagct cagacagcag 1560 aagtacacca gtcaggaaaa tgcagaacag taccgacaag cacacaagac agtcaggagg 1620 aagatgagag aagccaagga gaagtggata gaagaccagt gtacagaaat acagcatggg 1680 atggcttcag ggaacagcca gaaggcttat gagacactta aaactctcac aaggacacaa 1740 caacccaggg ccacggtgat cgaggacagt agtggaaacc ttctgacaga aagcacagcc 1800 gtcctcaacc ggtggacaga gtactgtgat aagctctaca actacgaact tcaacctgac 1860 acagcaaagc tgaacaaccc cccttgggcg gagagagtac cagaagacct cccaatccta 1920 agggatgagg tagctgacgc agtgcgcaac atgaagccag gcaaatcccc aggaattgac 1980 aacatccctg cagagctgct gaagcatggt ggtgaagtgg tggtagacaa gtactctgag 2040 ctatgtcaga agatttggca ggagaagaag tggcccaagg aatggacaca gtctctggtt 2100 attcccctgc ccaagaaggg gaacacacgc caatgtcaaa actacagaac catcagtctg 2160 atcagccacc cgagcaagat acttctcaaa gtaatcttga aaaggctcaa gtctaaggca 2220 gaggaactac tggcagaaga acaagctggc ttcaggccag gtcgaagcac cactgagcaa 2280 attttcaact gccggctcat aattgagaaa cacctacagc accaacggga cttgtttcac 2340 aactttattg actttagaaa ggccttcgac agagtgtggc acgatggctt gtggtgggtc 2400 ttgaggggat ttggtgttga agaaggctta atacagagca tcgaagcctt gtaccagagt 2460 gcatctagtg cagtgttcct caacaacaaa gtgggggaat tcttcaggac aacagtcggg 2520 gttcgccagg gatgcttgct gtctccagtt ttgttcaacc tgtatctgga gaagatcatg 2580 caggacgctt tgtacgatca tcacacctcc atctccattg gtggcagacg catctgcaac 2640 cttcgctttg cagacgacat tgaccttcta gccggcagca acactgaact ccaagacctc 2700 acagacaagc taacagagag tactggatcg ttcggcatgg aggtcagcac cgagaaaagc 2760 aaggtgatgg tgaacagtag ggatgagaca catgcagtca tccttatgaa cggggaactg 2820 ctggaagagg tgggaacatt tacttaccta ggagggacca ttacaaaaga cggtactagc 2880 gagacagatg tacggaggag gtttgccatg gcaacatctg caatggctcg tctgtccagg 2940 atctggaagt ccaacaacat cagcttttct gtcaagttca agctctacaa aaccctagtt 3000 gtgtccatac tgttgtatgg gtgtgagacc tggactctat tggcagcaac agagaggaag 3060 atcagagcct ttgagatgag gtgcctgcgt aagctactac acatctcgta ctatgaacac 3120 gtcacgaacg agtatgtacg ggaacaggta cgcagcctcg ttggcccgca ggaaacactt 3180 ctagcaacag ttaaaagacg taaactgcaa tggttcgggc acgtcacccg acacaacaac 3240 ctggccaaga cgatcctaca aggaacggtg gaaggtggac gtcgccgagg acggcagcga 3300 aaagcttggc aggacaacat cgctgactgg acgggcctca gcctacagga gaggataacc 3360 ctggctaggg atagaagaac ctggaggcaa ctctcgatct tcatgtctac cttgtccccc 3420 caacgacctg gaggtcaagg gactaggtag gtaggtaggt aggta 3465 // ID Ronin1_Cis_LTR repbase; DNA; INV; 267 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 06-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE Long terminal repeat of Gypsy LTR Retrotransposon from Ciona DE savignyi. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Ronin1_Cis_LTR. XX OS Ciona savignyi OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. XX RN [1] RP 1-267 RA Smit A.F.; RT "Ronin1_Cis_LTR - Gypsy LTR Retrotransposon from Ciona RT savignyi."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX CC Ci000110 Some copies < 1% diverged. XX SQ Sequence 267 BP; 50 A; 54 C; 49 G; 113 T; 1 other; tgaagtgata ttacgtcacg tccctctcct tagttctcct gtgttttagt aataatcgtt 60 tcgccttcat tcttagttaa tcaacgcgcg ttttcgttgc tatttaggtc cttgtgtttt 120 gcccttgctt tgccagttac cgttatgttg tgctktgttt gttttaaact aggttttcca 180 ttattgcaga aatagcctgc ttattaacgt tgaataaagc taagttaatc gacttctgtt 240 gtcgttttcc gttagcgtct cagttca 267 // ID Gypsy13-I_Dya repbase; DNA; INV; 5737 BP. XX AC chr2h; XX DT 18-MAY-2009 (Rel. 14.05, Created) DT 18-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy13_Dya; KW Gypsy13-LTR_Dya; Gypsy13-I_Dya. XX OS Drosophila yakuba OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; melanogaster subgroup. XX RN [1] RP 1-5737 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1089-1089 (2009). XX DR Genome; chr2h; Positions 790072 795808. XX CC Positions [4843-5319] - Integrase core CC 'GAGA' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 873..2414 FT /product="Gypsy13-I_Dya_3p" FT /translation="MPLEHSFESTQPQHQADAAANCCIICEEFMLNTQPCV FT ITHCNHVFHKSCLESHIQVHKECPCCLKTVQPKDWKVVVGNHSPKARSTVT FT YPPKRPVVISSRPCTRSLAHQLESLTQPAPTEAEVLIRVHDQSNSPITENQ FT HPPGSTPTNQGVRPRAYQAANAPGRRGRPPRQRRQNNPPVANSVPRAEIES FT IVRETVTRVLSSISGSGSLAPPPDQIAGHGSPGTQRSYRSAHGIHPLGPHK FT IADVLQKWGVRFDGTVEGLSVEEFLYRVKILTAEHFDGDLEIISRNIHNLL FT IGKASSWFWRYHKRVQTFTWREFCVAFREQYRESRTATELREQIRSRKQKP FT GESFDTFYESVCKLMDKLSVSFAEDELVEMIQGNLLIDVKERLLFENISSI FT SELRRLVHKRENFLKETGRVRPPQTKPGPYRVYAMDDGQLQETQECDEENE FT IAAIQRTTPLLCWNCEAPGHIWENCLAERRIFCYGCGAPQIYKPNCQNCKR FT KTENRQGGTSNPGRVSQK" FT CDS 2597..3748 FT /product="Gypsy13-I_Dya_2p" FT /translation="MRNYYRSLKKKRTRIVSTVFQDNIDRRSYASVKFMDF FT SELGLLDTGANISCIGSSLALVDFSKYTNFKPILSKVRTADAKSHKVLESL FT SVKMKYQDKEKDLELFIIPSIAQRLILGIDFWRAFELAPGIISSILNEDIE FT VEEGEEKYPLTENERQQLEIVKELFPNFEKQGLGRTAIIKHSIDIGESKPV FT KQRYYPVSPAVEKLMYAEIDRMLSLGVIEPSQSAWSSPMRLVVKPNKVRLC FT LDSRKLNEATKKDAYPLQSIEGIFARLPKANIISKLDLKDAYWQIELTEEA FT KPLTAFTVPGRALYQFTVMPFGLCNAPSTMSRLMDDLIPADLRHCVFGYLD FT DLCIVSEDFASHSTVLVRIADQFKKANLTLNIKKVTSALLR" FT CDS 3817..5685 FT /product="Gypsy13-I_Dya_1p" FT /translation="MNWPKPKNMKQVRGFLGVCGWYRRFIKDFADITCPIT FT EVLKTKKCFKWSPEAQDAMQKLKQILTSAPVLQNADFSRKIFVHCDASDYG FT IGAVLVQVSDTGEERPLAFMSKKLSKSQRNYSVTERECLAVILAIEKFRCY FT LELQPFEVVTDHSSLLWLMRQQNVSGRLARWIFRLQAFKFTISHRKGKDHI FT VPDALSRICDPEFVILEWIGPEIDLESSSFKDSDYLQLKKTIEENSEKFPD FT LRTADQFVYIRTEHPSGNTIQDSDCWKLWVPEALRIAAIRQAHDCIVSAHG FT GMQKTIERLRRHLFWPGLSKQVREYVRACDVCKETKAPNTTLRPPMGQQSV FT SDRPFQKLYIDILGPYPRSKKGHIGLLIVLDHFSRYHWLCPLKKFTTSAVI FT DFLGNRIFAEFGTPETIVSDNGSQFKSHEFEAFLTKFGIKHSFTALYSPQS FT NAAERVNRSLLAAIRSYLRADQTEWDINLTSISSSLRSSLHQALGCSPYKA FT LFGFNMINHASDYKLLRKLSLLEESPLLNCQDKLYWLRENIQQNSRTAYER FT NATQYNLRSKPVSYKEGQEVFKRNFYLSNFSKNFNKKLGPQFSKCRVKKKI FT GTAYYVLENLQGKEIGTYHAKDLRS" XX SQ Sequence 5737 BP; 1821 A; 1100 C; 1247 G; 1569 T; 0 other; tagatttttt ggcgcccaac gtggggcaag tgcttattcc ttacatatat tttagtccag 60 caggcagtca cacagatttt tggtgcagcc gccgatgctg aaaggcaggt gctaaagtat 120 ttgtctggta tatagtacta acccctgatt tttgaaagtt gtatgtaaag ctagaaaaga 180 atctaatttg actattggaa agcgagccct tattagggag tgctgatagg tttcgaccgt 240 agaaaaatta attccataaa aatataattc tttggaattg aaaaagctta tgaccaaatg 300 acatcgacgt attatccttc gagttatcga gtgaaatgaa gatttgcttc gctttagtcc 360 ttgtaattgg tgggaaaagt accgggttgt tacaacagac attaatctgg aagctctttg 420 tcttcgattg agatttttgt tatcattatg aaagtagtca agaaagctgt ttgatattga 480 tatacatctt tggaattcga gtttttgaga attcgaaaac tttcgaccaa gattacaggt 540 agcttgttat tcagaattat taacgactgg agtaaagtaa gagaaattct tagtcctatg 600 tgttctttgg tgtgatcagt gtttgaaaac tgactagagg taagctactg gaatacaata 660 taaatgcatt tgtgtttatt tgtccgtaga agtctgcaca gtacagtaga taatggatgt 720 tcatacaaag aagtaacact tattaattgc catagcccaa tttaagtaaa ttcgctaagc 780 acagaattag ttgaaagcct agaaatatgt atccgtatta tgtctccctg agaatgccat 840 cgaagtaaaa tataaaagaa tttttattta gtatgccact agaacatagc tttgagtcaa 900 cccaacctca gcaccaagca gatgctgcag caaattgttg tattatttgc gaagaattta 960 tgttaaacac acaaccttgt gttataacgc attgcaatca cgtttttcac aagagttgtt 1020 tagaatcaca tatacaagta cataaggagt gtccttgctg cttaaagaca gtacagccca 1080 aggattggaa agtagttgta ggtaaccatt cgccgaaggc aaggagtacg gtcacctacc 1140 ctccaaagag acccgttgta ataagttccc gtccgtgtac tcgttcttta gcacatcaac 1200 tagaatcact tacccaaccg gcaccaacag aagccgaggt cctgatcaga gtccatgatc 1260 agtctaattc accaattaca gaaaatcaac atccacccgg tagtactcct acgaatcaag 1320 gagtcaggcc acgagcgtat caagccgcca atgcacccgg tagacgaggt cggccaccaa 1380 ggcaaagaag acagaataat ccgccagtag ccaatagcgt accgagagcg gagatcgaat 1440 ctatcgttcg agagacagta actcgtgttc tgtcatcaat ttccggaagt ggtagccttg 1500 caccaccacc cgaccaaata gctggacatg ggagtccagg tactcaacgt agttataggt 1560 cagcccacgg tatacatcct cttggtccac ataaaatagc cgacgtctta cagaaatggg 1620 gagtaaggtt tgatggcaca gttgaaggat tatcggtaga agaatttcta tatcgcgtga 1680 agatattgac agctgaacat tttgacggtg atcttgaaat tatcagcaga aatattcata 1740 atcttttaat tggcaaagcc agcagttggt tctggagata ccataaaaga gttcaaactt 1800 ttacttggag agagttttgc gtagcttttc gagaacaata tcgtgagagt agaacggcaa 1860 ctgagttacg tgaacaaatt cgctcgcgga agcaaaagcc aggagagtcg ttcgatacgt 1920 tttatgagag tgtttgtaag ctcatggata aactatcagt gagttttgca gaagatgagt 1980 tagtagagat gatccaagga aatttactaa tcgacgtgaa agagagactg ttatttgaaa 2040 atatttcatc tatctcagag ttgcgtaggt tggttcataa gcgagaaaat ttccttaagg 2100 aaacaggccg agtgcggccc ccccaaacta agcccggtcc atatcgcgta tacgcgatgg 2160 atgacggcca actacaagag actcaagaat gtgatgagga aaatgaaatt gcagcaattc 2220 aacgtacaac tcctttattg tgctggaatt gtgaggctcc tggccacata tgggaaaatt 2280 gcttggccga gagacgtatt ttttgttatg gttgcggagc cccgcaaata tataaaccca 2340 attgccagaa ctgcaagcga aagacggaaa accggcaagg ggggacatcc aatccaggaa 2400 gggtgtccca aaagtagaag atagcgctag aatagatttt gaaagagtta caattctaca 2460 aagtaagcta gctcaagagt cgaataatga aattaacata ccttttaagc cgtatcatga 2520 gaggttaaga aactattatt tagtgcgaga acgaattttc gggctcggaa aagaaagtaa 2580 gcgaaatacc cgcagaatgc gaaattacta tcgttcgtta aaaaagaaga gaactaggat 2640 tgtgtcaact gtttttcaag acaacattga tcgtcgtagt tatgctagtg taaagtttat 2700 ggacttttct gaactaggtt tactcgacac tggtgcaaac ataagttgta tcggatcatc 2760 actagcgctt gtggactttt cgaaatatac aaattttaag cccatattgt cgaaagttcg 2820 aaccgcggac gccaagtctc acaaagttct tgaaagtcta agcgtaaaaa tgaaatatca 2880 agataaggag aaagatttag aattatttat tattccttct atagcacaga ggttgatatt 2940 aggtatagat ttctggcgcg cttttgaatt agctccaggg atcattagct ctattctgaa 3000 tgaggacata gaagtagagg aaggtgagga aaaataccca ttaacagaaa atgaaagaca 3060 gcaacttgag attgttaagg aattatttcc aaattttgaa aaacaaggat taggtcgcac 3120 ggccataatc aagcatagta ttgacatagg tgaaagcaag ccggtcaaac agcgttatta 3180 tccggtgagt cctgcagtgg aaaaactaat gtacgcggaa attgaccgaa tgcttagcct 3240 cggagttatc gaaccttcgc aaagtgcctg gagttctcca atgcgcctgg tggtaaagcc 3300 aaacaaagtt cggttatgtc ttgactcacg caaactaaac gaagctacaa agaaagatgc 3360 ctatcctctg caaagcatcg aaggcatttt tgctcgtctg ccaaaggcaa atataatttc 3420 gaaattagat ctcaaagatg cttactggca aattgagttg acagaggaag cgaaaccatt 3480 gaccgctttt acagtcccag gtcgagcatt atatcagttt actgtcatgc cttttggctt 3540 atgtaatgct ccctcgacga tgtctcgtct aatggatgac ttaattccgg cggatttacg 3600 tcattgcgtt ttcggatatc tcgacgatct atgcattgtc tcagaagatt ttgcgtcaca 3660 ttcaactgtc ttagtccgca ttgctgatca attcaagaaa gcaaacctta cactgaacat 3720 aaaaaaagtc acttctgcgt tactaaggtg aactatctag gttatgtgat aggaagtggt 3780 ggaatagcca ctgatccaga gaaaatatca tgtgtaatga actggcccaa acccaaaaat 3840 atgaaacaag tccgaggttt cctaggtgta tgtggttggt acaggcgatt tataaaagac 3900 tttgctgata taacctgtcc cattacggaa gtcttaaaaa caaaaaaatg ttttaaatgg 3960 tctcctgaag cccaggacgc catgcagaaa ttaaaacaaa tactaacgtc tgcgccggtg 4020 ttacaaaatg cggatttttc ccgtaaaatt tttgtacact gtgatgctag tgattatgga 4080 attggtgctg tgctggtaca agtctctgat accggagagg aaagaccatt ggcatttatg 4140 tccaaaaaac tgtcaaagtc gcaaaggaac tatagcgtaa ccgaacgcga atgtctggcg 4200 gtgatattag cgatcgaaaa attccggtgt tatttggaat tacagccttt tgaagtagtg 4260 acagatcatt caagccttct ctggcttatg cgacagcaaa atgtatctgg aagactggcc 4320 cgttggattt ttagattgca agctttcaag ttcaccattt cgcaccgaaa agggaaagat 4380 cacattgttc cagatgcgtt gtcccgaatt tgtgaccctg aatttgtgat ccttgaatgg 4440 ataggtccag aaattgactt agaatcgtca tctttcaaag attctgatta tcttcagctc 4500 aagaaaacaa tagaagaaaa tagcgaaaaa tttcctgatc tgcgaacggc agaccaattt 4560 gtgtacatac gcaccgaaca cccttcggga aataccatcc aggactcgga ttgttggaaa 4620 ctgtgggtgc cggaggcgtt acgaatagca gcgatccgcc aagcacatga ctgtattgta 4680 tctgcgcatg gtggaatgca gaaaacaatc gaacggctgc gaagacactt gttttggcca 4740 gggttgtcca aacaagttag agaatacgtt cgagcctgcg atgtttgtaa agagactaaa 4800 gccccgaaca caacattacg gccacccatg ggtcaacaaa gtgtttcgga cagacccttt 4860 cagaaacttt acattgacat cttaggtccc tacccaagaa gcaagaaagg tcacatagga 4920 ttacttattg tccttgatca tttcagtcgt taccattggt tatgtccact aaaaaagttt 4980 actacctccg cggttatcga tttccttggg aatagaattt ttgctgaatt tggaacaccc 5040 gagaccatcg taagcgacaa tggatctcag ttcaagtcac atgagtttga agcctttcta 5100 accaagtttg gaatcaagca cagtttcaca gcgttatact cccctcagag caacgctgcg 5160 gaacgggtta atcgctcttt attggctgcg ataagatcct atctgagagc ggatcagact 5220 gaatgggata tcaatttaac tagcatttcg tcgtcattac gatcctctct gcatcaagct 5280 cttggatgct caccttataa ggcattgttc ggattcaata tgattaatca tgccagcgat 5340 tataaattgt taagaaagct atccctctta gaagaatctc cgctgttaaa ttgccaggat 5400 aaactatatt ggttaagaga aaatatacaa caaaatagcc gtacggctta tgaacgtaat 5460 gcgacacagt acaatttgcg atccaagccg gtctcgtaca aagaaggcca agaagtgttt 5520 aagagaaact tttacctaag taatttctcg aagaatttta ataagaaatt agggcctcag 5580 ttttcaaagt gtcgagtgaa gaaaaaaata ggaacagcct actatgtatt agaaaattta 5640 caaggcaaag aaatcggtac ataccacgct aaggacttac gaagctaatt ccagctaatc 5700 gtgcctcttt cgaccctctc gttagattat ctggtgg 5737 // ID BEL-2_AA-I repbase; DNA; INV; 5756 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-2_AA_; KW BEL-2_AA-LTR; BEL-2_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5756 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 853-853 (2011). XX DR [2] (Consensus) XX CC Positions [4750-5340] - Integrase core CC 'GAGTG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 298..5754 FT /product="BEL-2_AA-I_1p" FT /translation="MKAEAMRKNAHHHKTLLRRRDNIIGSAKLIKSFDDQY FT DVEQANQVPLRIERLDELWRSFETVQDEIEVIEDSDQEFSDLRQEFHDMYF FT SLKASLITKLPTNEAPALNQAVPVAPPPPIPVMSVKLPELKIPEFDGQPEQ FT WIEFRDLFKSVIHSNVQLSAVQKIHYLRSSLKGEASRLISSIPITANSYSI FT AWKAICDRYENTNYLVKQHMSALFRIPSIKKGSATALAELADEFNRHVGIL FT DKLENPNRHWNSFLVERLSSLLDEKSLLEWESQCDEEGTEYTDLLEFIHKR FT SRTLQKCAASCVAPASGYAKPAKGKASSSHVASEHVPKCPTCKQAHPITQC FT DAFSKLSPNQRLECAKKHRLCINCLRGGHMAKDCRSSLCRTCGKKHHSLLH FT LPSLTSASSIVETPGESESPVTSQACAVICPSTTVPTSQSDGSRSVVNEPS FT VVHAMSIPQVVSSLRSSKSSIDRVCPPSSSSVVSSDILEHEPSPTCLQQPA FT ASLTQSNNARDSTVFLSTAVIRVKDVNDVYHFARAMLDSGSQSNFISESLC FT QKLNLKRIRHNLPVSGIGQAIVNVRYKVNLTFASRFGGFDQLLECLVLPKL FT TVNLPCRSVDITQWNIPNNLPLADPRFNICHGIDLIIGAELFYTLLETQRI FT TLADEFPILQKTTLGYVVSGKASIQPTESVLCHVATNQELNAQLERMWEIE FT EFSHGKAFTPEEQQVEEHFIRTAARDESGRYVLRLPFRESAMPLLGDSYKA FT AVNRFLLMERRFTKDHELRQEYTEFMEDYIRLGHMEECSRVAGPQFFLPHH FT AIRRPDSTTTKTRVVFDASSKSHGQLSLNETLFTGPTVQPSLLAIVLNFRI FT PKYVFSADVEKMFRQVWIHPDDRRFLQVVWRPDATVPLKTYQLKTVTYGLA FT CSPFQAARVLNKLADDEGDKFPLAAPIVTKCFYVDDVLGGGDDLDKTAESC FT RQLQDLLSQGGFSLRKWSTNHPKVLQQIPEELRATSSLTEIGKSASTKALG FT LLWNPSTDLFGFQVPTLQPVDIITKRSVVSEMSQLFDPLGLLGPVVVSAKI FT FVQRLWAKQFSWDESLPKEETDWWKNFRSELVQLKNVTVSRCVRPSCNENY FT QLHCFCDASSKAYGCCVFVVGPNDRGRIECRLLIAKSRVAPLRGLSIPRLE FT LCAAVLGSELVHNLLATTNFSSPVTFWSDSTVVLQWIQSPPNVWKVFVSNR FT IAEIQRLTRTCRWRYVPSSLNPADRISRGIQPSQIAEDLLWWEGPPFLSQS FT VDTWPESPKGIPSPNDFEEEKRATVVLTAFEVDDTIFERYSELGKLLKVVA FT LCIRFGRNCRRTPVDRTVGNLNAAEVDEALKRLVRMSQISSFPKEIHHLTN FT KTVDTRCSYNFDSKSPLRNLNVFVDEHGLLRLDGRLKNITAPFDSKCPLIL FT PAKHRYSLLIARSLHLRTAHSGPSLLLATIRQRFWPLRGRDLVRRTVRSCI FT TCFRCRPTECNQQMAPLPTVRVVPSRVFSRSGLDYCGPFNVRPLYGRGANI FT KMYVAIFVCLAVKAVHFEIVTDLTTAACINAIKRFVARRGRLTELHCDNGT FT AFVGADRELNALRRRYLEQFQSDEWTNYCVEAGITFRFIPARSPHFGGLWE FT AGVKSFKFHFRRIFGGKSYTFDEFSTAAVHIESILNSRPLTPLTDHPDDLN FT VLTPGHFLIGEPMFSIPEPDVTNTTVSRLSRLQEMRRSVQHFWKVWSTDYI FT SQLHQRTKWRTAKPNVQVGALVLLTQDGLPPFMWNLGRVEEVYTGSDGLVR FT VVLVRTNRGSFKRAVTQVRVLPIEDPEDDDQQRTTRSMVETDRFNGAQ" XX SQ Sequence 5756 BP; 1540 A; 1384 C; 1358 G; 1474 T; 0 other; ataaatggtg ccgaaacccg ggacgttgat cccacagtga ctgagtgctt ttcgattttt 60 aattcccacg tgctccggat cattccattc acgcttcaat tgtgctgttg tcgagtcgtt 120 gccgtattgc accgagattc taccatccca tcgcagagtg gaagacgtcc gcgagaaggt 180 tttgtccagg aatcgaagtc gagcattgtg tcgaacgaac ggagaggtta gacgaaacac 240 gtggcgtagg tcaatcatca tctgcaggtg ggtacactac tatttttcac ctaagtgatg 300 aaagcagaag ccatgcgaaa gaacgcacat catcacaaaa ctcttctgcg gcgaagagat 360 aacatcatcg gctcggcaaa attgatcaag tcctttgacg atcagtatga tgttgagcag 420 gccaaccaag tgccattacg aatcgagcgt ctcgatgagc tatggcggag cttcgaaacc 480 gttcaggatg aaattgaggt tatcgaagat agcgatcaag agttttccga ccttcgtcag 540 gaattccacg atatgtattt ttcattgaaa gcttcgttga tcaccaaact gccgacaaac 600 gaggctcctg cactcaatca agctgtaccc gtagctccac cgcctcctat tcccgtaatg 660 tctgttaaac ttccggaact aaaaataccg gaattcgacg gacagccaga acaatggata 720 gaattccgtg acctttttaa atcggtcatt cattctaacg tacagctatc tgctgtgcaa 780 aaaatccatt accttagaag ctcgctcaaa ggtgaagctt cccgtttaat ttcatcgatt 840 cctatcacgg cgaatagcta ttcaatagcg tggaaggcca tatgcgatcg ttatgaaaac 900 acgaactatt tggtgaaaca gcacatgtcg gccctgttcc gtattccttc catcaaaaag 960 ggatcggcca cggcattagc tgaacttgca gacgagttta atcgacacgt cgggatttta 1020 gataaacttg aaaatcctaa ccgtcactgg aactcattcc tagtagaacg attgagtagt 1080 ttgttggacg agaaatcgct tctagaatgg gaatcacagt gtgatgagga agggacagaa 1140 tacaccgatt tgttggaatt catccataaa cgatcgcgta ccttgcaaaa gtgtgctgca 1200 tcttgtgtag ctcccgcgag tggttatgca aaacccgcta aggggaaagc ttcctcttct 1260 cacgttgctt cagaacatgt acctaaatgt ccaacgtgta aacaagcaca tccgataaca 1320 cagtgcgatg ctttctctaa actttcacca aatcaacgat tggagtgtgc caaaaaacac 1380 cgattgtgca tcaactgcct gaggggtggt catatggcga aggactgccg tagcagcttg 1440 tgcagaacat gtggcaagaa acaccacagc ttgctgcacc ttccctcgtt aacctctgct 1500 tcttcaattg tcgaaacacc aggggaaagt gaatcaccag tcacttctca agcatgcgct 1560 gtaatctgtc cctcaactac agtgcctacc tcacaatcag acggttcgcg ttcggtcgta 1620 aatgaaccat cggtggttca cgcgatgtcg ataccccagg tagtgtcgtc gttacgcagt 1680 agtaagtcgt caatcgatcg ggtgtgcccc ccttcttcgt cgtcggttgt gtcgtccgat 1740 attcttgagc atgaaccctc tccaacatgc ctacaacagc ccgccgcttc attaacacag 1800 tcaaataatg ctcgtgacag taccgtcttt ttatcaacag ccgtcatccg agtaaaggac 1860 gtgaacgatg tataccactt cgcacgagct atgctagaca gtggctcaca gtcgaatttt 1920 atttcggagt ctttgtgcca gaaactaaac ctaaagcgta tacgtcataa ccttcctgtc 1980 agtggaatag gccaagcaat tgtaaacgtt cgctataaag taaatttgac atttgcctct 2040 cggttcggtg gattcgatca gctgctcgaa tgtttggtgc tgcccaaact gacagtcaat 2100 cttccttgcc gcagtgttga catcactcag tggaacattc caaataatct accgctagct 2160 gacccaagat ttaatatttg ccacggtata gatttaatta ttggagcgga attgttctat 2220 acgctgctag aaactcaacg tataacatta gctgacgagt ttcctatttt acagaaaaca 2280 accttaggat acgtcgtgtc cgggaaagcg tctattcagc ccacagaatc ggtgctgtgt 2340 cacgtagcta ccaatcaaga actgaacgct caattagaac ggatgtggga aatcgaggaa 2400 ttcagccacg gaaaggcctt tactccggaa gagcaacagg ttgaggaaca ttttattcgt 2460 accgccgctc gtgatgaaag tggccgatac gttctgcggc tgccttttcg agaatcagct 2520 atgccactcc tgggagactc ctacaaagca gctgtaaacc gttttttgtt gatggagcgt 2580 cgtttcacga aagaccacga attgagacag gagtatactg agttcatgga ggattacatc 2640 aggctggggc atatggagga atgctcacgc gtcgcgggcc cgcagttctt tcttccacac 2700 cacgccattc gacgaccgga cagcacgaca accaaaacaa gggttgtctt cgatgccagt 2760 agtaaaagcc acggtcaact gtcgttgaac gaaacactgt ttaccgggcc tacagtgcaa 2820 ccaagtctgc ttgccatcgt actcaatttt cggatcccaa agtacgtatt ttccgcagac 2880 gtggaaaaaa tgttcaggca agtgtggata catcctgatg accgaaggtt cctccaggtt 2940 gtctggcgtc cagatgcaac agtaccattg aaaacgtatc aactgaaaac cgtgacctat 3000 gggctggctt gttctccctt tcaagcagca cgggtcctaa ataagctagc tgacgatgaa 3060 ggcgataaat ttccacttgc agctcccatc gtcacaaaat gtttttacgt cgatgacgtg 3120 ctaggcggag gagacgattt ggataaaacc gctgaatcat gccgacagtt acaggatctc 3180 ctcagtcaag ggggattttc cttaaggaaa tggagtacaa atcatccgaa ggtattgcag 3240 caaatcccag aagagctccg agctacctcc tcattaacgg agatcggtaa aagtgcatct 3300 acaaaagcac ttgggttgct atggaaccca agtacggatc tatttggatt tcaggtacca 3360 actcttcaac cagttgacat cataacgaag cgctcggttg tttccgaaat gtcgcagctt 3420 tttgaccctc tggggctgct tggcccagtt gtcgttagcg ccaagatatt tgtccaaagg 3480 ctgtgggcaa aacagttttc ttgggatgaa tcactgccaa aagaagaaac cgattggtgg 3540 aagaactttc ggtcagagtt ggtccaattg aaaaatgtta cagtgtcgag atgcgtgcgg 3600 cctagctgta acgaaaatta tcagttgcac tgtttttgtg acgcctcgtc aaaagcatac 3660 ggttgctgcg tttttgttgt cggacctaac gatcgcggac gcatcgagtg taggctgctc 3720 atcgcgaaat cgagagtagc cccattacga ggactatcta ttcccaggtt ggaattatgt 3780 gccgcagtgc ttggcagtga attagtccac aacctactgg caaccactaa tttctcgagt 3840 ccagttacgt tttggtcaga tagtactgtt gtgttgcagt ggatacaatc accacccaat 3900 gtttggaagg tgtttgtttc aaatagaata gccgagattc agcgccttac acgaacttgc 3960 cgatggagat acgttccatc ctctctcaat cctgctgacc gaatatccag aggtattcag 4020 ccaagtcaaa ttgctgagga tctcttgtgg tgggaaggcc caccattctt gtcccagtca 4080 gtggatacat ggccggaatc tccaaaaggc ataccatcac ccaacgattt cgaagaagaa 4140 aaacgtgcga ctgtcgttct aactgcattc gaagttgatg acaccatttt tgagcggtac 4200 tccgagttgg gaaaactttt aaaggttgta gcattgtgta tccgtttcgg aagaaactgc 4260 cggcgaactc cggtcgatcg aaccgttggc aacttgaatg cagcagaggt agatgaagcg 4320 ctgaaacgac tggtaagaat gtcacaaatt agtagttttc caaaggaaat acaccacctt 4380 acaaacaaaa cggtagacac tcgatgctct tataattttg acagcaaatc ccctttgaga 4440 aacttaaacg tgtttgtaga cgagcacggc ttactgcgtt tggatgggcg tttaaagaac 4500 ataactgccc cgtttgattc caaatgcccc ctcatattac cagccaagca tcggtacagc 4560 cttttgattg cgcgatcatt gcatttaagg acagcccact ctggtccttc attactactc 4620 gccactatcc gtcagcgatt ttggccgtta cgtggccgcg accttgttcg aagaacagta 4680 cgtagctgca taacgtgttt tcgttgtcgc ccaaccgaat gcaatcaaca aatggcacca 4740 ctaccaacag ttagggtggt tccatcgcga gtattctcaa gatccgggtt ggattactgc 4800 gggccattta acgtacgtcc gttgtatggg aggggggcaa acattaagat gtacgtcgcg 4860 attttcgtat gcctggcggt taaagcggta cattttgaaa tcgttacgga ccttacgacc 4920 gccgcatgta ttaacgcgat aaaacggttt gttgctcggc gcggtcgatt aaccgagtta 4980 cattgtgaca atggcacagc attcgtcgga gctgatcgcg aactaaacgc gttgcgtcgt 5040 cgttacctcg aacaattcca gtcggatgaa tggacgaact attgtgtcga ggctggaata 5100 acatttcgct ttataccggc acgatctccc catttcgggg gattgtggga ggccggagtt 5160 aagtcgttta agttccattt ccggcgcatt tttggaggaa aatcttacac cttcgacgaa 5220 ttttctaccg cggcagtgca cattgagagt atactaaact cccggcctct cacacctctt 5280 actgaccatc cagatgacct caacgtgctg accccaggtc attttttgat tggggaaccc 5340 atgttctcga taccagagcc tgatgtaact aacaccacag tttcgcgact ttctcgcctg 5400 caggaaatgc gtcgatcagt acaacatttc tggaaggtgt ggtcgacaga ttacattagc 5460 cagctacatc agcggaccaa atggagaacg gctaagccca acgttcaagt tggagctctg 5520 gtgctcttga cgcaggacgg tctcccaccg ttcatgtgga acctcggtcg agtggaggag 5580 gtatatacag gaagtgacgg cctagtacgt gtggtgttgg tacgcaccaa ccgtgggtca 5640 ttcaagcgag ctgttactca ggtccgagtt ctgccaatcg aggaccctga agacgatgac 5700 caacaacgta cgacgagatc aatggttgaa acagaccgtt tcaacggggc ccagga 5756 // ID Gypsy-253_AA-I repbase; DNA; INV; 4574 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-253_AA_; KW Gypsy-253_AA-LTR; Gypsy-253_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4574 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1107-1107 (2011). XX DR [1] (Consensus) XX CC Positions [3435-3893] - Integrase core CC 'CGATC' target site duplication CC LTRs are 96% similar to each other. XX FH Key Location/Qualifiers FT CDS 306..4487 FT /product="Gypsy-253_AA-I_1p" FT /translation="MTEERTDGTDAIGGAAAAAAAAAPVIPPTFAFEAFDK FT NKTRWSRWVKRFEGALQIFGIGEALKRNMLLHYMGTETYNVLCDHLSPEDP FT EAKTYEEIVTLLGQYFDPEPLEMVELWKFRQRMQREGESVAEYITALQREA FT KYCGFGDYLQKGLRNQLVFGLRNQRIRTRLIEEKDLTFDKAKQIALSMEAS FT GEGAEVLNRRMQEVNLMDRKKLQPGRDAAVKVTNPTNKFMCFRCGSEAHLA FT DKCEHKNKICGLCKKKGHLKRVCLSSNAQTSKIKNHPKKHSANLVDNSEVS FT ESEGSDEERVYVVDVCKLEHCSNDMSKIFLNVRVGGALIQFEVDSGSPVSL FT ISSTDKAKYLERLSMQPTDIELRSYCGNKIRVLGVIQAKVVYGGQMERLRL FT FVVEGKRHPLLGREWMRALRLDWNDILGGTVSSVDKITLLNPLPAAVKGLI FT EEFSSVFDESIGNIEGIQAALHFKSDSKPVFLKARSLPFSTRDAVEREIHS FT MVESGVLVKVDRSEWATPVVPVMKSGGKIRLCGDYKLTVNKSLLVDEHPLP FT TINEMFSNMAGGQKFTKLDLAQAYLQMSVRAEDQPTLTLNTHLGLYQPTRL FT MYGVASAPAIFQREISQILQGIPGVTVFLDDVKITGPDDKTHLDRLREVLK FT RFHERNMRVNVAKCEFFADSIEYCGYIVDQHGIHKMQQKVDAIQNMPRPEN FT REQIRAFLGLVNYYGRFMENLSTRVYPINNLLKDKTPFEWTDACEQAFLWV FT KSEMQSDRLLVHYDAKLPLVLATDASPYGVGAVLSHIYPDGSERPIQYASQ FT TLNATQQNYSQIDKEAYAIIFGVRKFHQYLFGRKFILVTDNKPVVQIFSPD FT KGLPTLCALRMQHYAVFLESFDFDIRYRPSKDHANADGMSRLPVTDRSIRN FT HVEEIDVVQINQIETLPVDAKELGEITIRDRSVKELIQGLKSGRTVDGRFR FT FGVDQKEFSMQGLCLMRGIRVYVPPPLRSRVLSELHTGHFGVSRMKSLARS FT YCWWENIDRDIEEVARNCVDCARVRASPAKVQVHCWERPSEPFQRIHADFA FT GPFMGLYFLIIVDAHSKWPEVKVIPDMTTETTIEKMREFFATFGLPSVLVT FT DRGTQFTSDQFQKFLKKNGIVHKMGAPYHPATNGQAERYVQTFKDKIKALK FT CPRSEVHVELQQILMAYRRTVHPATGKSPSMLVFGRQIKSRLDLLIPEQQN FT SARKDFRGEEKPVRSFAINDRVAARDFMSQSEKWRFGVVAERVGKLHYMIE FT LDDGRMWKRHIDQLKPGPVAQYLQPGEQNPENIQRRSEVLLPTQDYQAAVQ FT VKVELDSTGQELEKTVDVPEQNTAISDTGRQSTPVSTAVNDTGPGSGEKGK FT RSVNTRALIGNDGETVRKSRREIRAPRRLNL" XX SQ Sequence 4574 BP; 1289 A; 936 C; 1228 G; 1117 T; 4 other; gttggcgacg agcggattag taatcccgca ttgaaagtat tcggccggcc atatcagaac 60 cgtggcggag ctcccattga ggcgcgcgga cagcttggac gaggaggaga gatccggaca 120 agtagcattc tgcattgggg ctcatcaaak cgaagtgata actgacaccc acaccactat 180 acgttgttgc tgttttccgg tgaggggaaa cggagtacac atattwctgc tatagtgtac 240 gtgctgctgc tactaccatc gtgtgcgttg gtatgcwgtg tgtgtgaggt gcgacgaagt 300 gaaaaatgac ggaagagaga acagatggaa cagatgcaat cggaggcgca gctgccgctg 360 ccgccgccgc cgctccggtc atcccgccta ccttcgcgtt cgaagcattc gacaaaaaca 420 aaacaaggtg gtctcggtgg gtgaagcgat tcgaaggcgc actgcagatt tttggcatcg 480 gtgaggctct gaagaggaat atgctgcttc attatatggg aaccgagaca tacaatgtgc 540 tttgcgacca tttgtctcca gaagatccgg aagccaaaac ctacgaagaa attgttacgc 600 tgctggggca gtacttcgac cccgaaccat tggagatggt tgagctgtgg aagttccgtc 660 agcggatgca acgagaggga gaatccgtag cggaatacat cacggcgtta cagcgagaag 720 cgaagtattg cggttttggt gattatctgc agaaaggact acggaatcaa ttggttttcg 780 gcctccgkaa ccaacgaatt cgaaccagac tgatcgaaga aaaggatttg acgttcgaca 840 aggccaagca gatcgcgttg tccatggagg cttccggcga aggagcagaa gtgctaaacc 900 gacgaatgca ggaagtcaac ttgatggatc ggaaaaagtt gcagcccgga agagacgcag 960 ctgtgaaggt aactaacccg actaacaagt ttatgtgttt ccgttgcgga agtgaggcac 1020 acttggctga caagtgtgag cacaaaaata aaatttgcgg cctgtgcaag aaaaaaggcc 1080 atttaaagcg agtgtgcctc agctccaatg ctcaaactag taagattaag aaccacccaa 1140 agaaacattc agcaaacctg gtagacaata gcgaagtatc agagtcagaa ggctccgatg 1200 aagagcgggt atatgttgtt gatgtgtgta agttagaaca ctgctctaat gatatgtcaa 1260 agatcttcct aaatgtgaga gtgggtggag cgctgatcca gtttgaggta gacagtggat 1320 ctccggtgtc gttgatcagc agtactgaca aagctaagta tttggagaga ttgtcaatgc 1380 aaccaacaga tattgaactg cgtagttact gtgggaacaa aatcagagtt ctcggtgtca 1440 tccaggcgaa agttgtgtac gggggtcaaa tggaacgctt gcgtttgttc gtagtggaag 1500 gaaaacggca tccattgtta ggacgcgagt ggatgcgagc cttacggctg gactggaatg 1560 atatcttggg gggcaccgtt agttctgttg acaaaattac tttgcttaat cctcttccgg 1620 cagcagtcaa agggttgatt gaagagtttt cctctgtctt tgatgaatca attggtaaca 1680 ttgagggtat tcaagccgct ttgcacttca aatcagattc aaaacctgtg tttctcaaag 1740 cacgctcact tccgttttca actcgagatg ctgtggaacg ggagattcat agcatggtcg 1800 aaagtggagt tctggtgaaa gttgaccgta gtgagtgggc tacaccggtg gttccggtga 1860 tgaagtctgg agggaaaatt cgtctttgcg gagactacaa actaacggtt aataagagtt 1920 tgctagtcga tgaacatcca ttacctacga tcaacgaaat gttttccaac atggcaggtg 1980 ggcagaaatt cactaagctg gatttggctc aagcttattt gcaaatgtcc gtgcgtgctg 2040 aagaccaacc aacgttgaca ttgaatacac acctgggtct gtatcagcca acaaggttga 2100 tgtatggagt tgcttctgca cctgccatct ttcagaggga gatatctcaa atattacagg 2160 gaataccagg agttacagta tttctagatg atgtgaagat cactggtcca gacgataaaa 2220 ctcacttgga tcggctacgt gaggttttga aaaggttcca tgagcgcaac atgcgagtga 2280 atgtggcaaa gtgtgagttt tttgccgaca gtatcgagta ctgtggatac attgtggatc 2340 agcacgggat tcataaaatg caacagaagg ttgacgctat ccagaatatg ccgcgcccag 2400 agaatcgcga acagatccgt gcgttcttag gtctcgtgaa ttattacggt agatttatgg 2460 aaaacctaag cactagagtt tatcccataa ataacctact gaaagacaag acaccatttg 2520 agtggactga tgcatgtgag caagcctttt tgtgggtgaa atcagaaatg cagtcggatc 2580 gtttgctagt acattatgac gctaaactgc cgttggtgtt ggccacagac gcctcaccat 2640 atggggttgg cgcggtactg agccatattt acccagatgg tagtgaacgg ccgattcaat 2700 acgcttcaca gacactcaat gctacccagc aaaactacag ccaaattgac aaggaggctt 2760 acgcgattat ctttggggta cggaaatttc accaatacct tttcggacga aaatttattc 2820 tggtgaccga caacaagcct gtagtacaga tattttcgcc tgataaaggt ttgccgacgc 2880 tatgtgcact gcgtatgcaa cattacgccg tttttctcga gtcatttgat ttcgatattc 2940 gttatcgtcc ttcaaaggac catgcaaacg ctgatggaat gtcacgttta ccggttacag 3000 accgtagcat ccgaaatcat gtggaagaaa tcgacgtggt tcagattaat cagattgaaa 3060 cgcttccggt agatgctaaa gagcttggcg agattactat cagggaccgg agcgtcaaag 3120 aactaattca aggactgaaa tcaggacgta ctgttgatgg ccgctttcgg tttggagtcg 3180 accagaagga gttcagtatg caaggtttgt gcttaatgcg agggatacgg gtatacgtgc 3240 ctccaccgct tcggagccgc gtcctcagcg agttgcatac cggtcatttt ggtgtctcca 3300 ggatgaagtc attggcgaga tcttactgct ggtgggagaa cattgatcgg gacatagaag 3360 aggtggctcg gaactgtgtg gactgcgctc gagttcgagc aagtcctgct aaggtacaag 3420 tacactgctg ggaacgtcct tcagaaccgt ttcaaaggat ccacgctgat ttcgctggac 3480 cgttcatggg gctgtacttc ctaatcatag ttgatgctca cagcaagtgg cccgaggtga 3540 aggttattcc tgacatgact acggagacta ctattgagaa gatgagggag tttttcgcta 3600 ctttcggact accatcagtt ttggtaacgg atcgaggcac acaattcacg tccgaccagt 3660 ttcagaaatt tctgaagaag aacggaattg tgcacaagat gggagcgcca taccatcccg 3720 cgacaaatgg tcaagcggaa cggtatgtcc agacatttaa ggacaaaatc aaagcactga 3780 aatgcccgcg ctctgaagtc cacgttgagt tgcagcagat attgatggcg tatcgaagga 3840 ctgtgcatcc tgccacagga aaaagtccat ccatgttggt ttttggaagg caaataaaat 3900 ctagactgga tcttctgatt cccgaacagc agaactccgc tcgcaaagat tttcgggggg 3960 aggaaaagcc ggtgcgctct ttcgcgataa atgatcgagt agctgcacga gactttatgt 4020 cacaatcaga gaagtggaga tttggagttg tagctgagag agtcggaaaa cttcactata 4080 tgatcgagct tgatgatgga agaatgtgga agcgtcatat tgaccaactg aagccggggc 4140 ctgttgctca atatttacaa ccgggtgaac agaatccaga gaacattcag cgacgtagtg 4200 aagtgttgtt gcccacccaa gactatcaag ctgctgtcca agttaaagtt gagcttgatt 4260 cgactggaca agagctagaa aaaactgttg atgttcctga acagaatacg gcaatatcag 4320 acactggaag acaatctacc cctgtcagta ctgcggttaa cgatacggga ccaggttcag 4380 gggaaaaggg taagcggagt gtaaatacaa gagctttgat agggaatgat ggcgaaactg 4440 tcagaaaatc tcgccgtgaa attagagctc cgcgtagatt gaacctgtaa tgtgttgtga 4500 ctaacctact aaccaaatat atgtggatct gtagacactt acatgacagg atattactta 4560 agggggggga gaga 4574 // ID CR1-7_CQ repbase; DNA; INV; 3288 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-7_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-3288 RA Kojima K.K. and Jurka J.; RT "CR1 non-LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 9-9 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 10 sequences with >98% CC identity. XX FH Key Location/Qualifiers FT CDS 36..3185 FT /product="CR1-7_CQ_1p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="LAVGVRSGQTSGASYRAVPDDRRAAAVGSRADPDADG FT ACTTDRFTCQPTNXGRCRNGDYRVETAATSNSPQIAASTQPTHSVLLPRRQ FT RRPNRKLVPSSLLVYYQNAGGIRTKTNQFNLALASSDYHVIAVSETWLRDG FT ILDSELSSNYQFFRKDRSPATSELRRGGGCLVAVKNGLDATLVKLAGYDHL FT EQTIIRVKVRRQNVFLCCIYIRPNSPNDIYLSHYAAVREVLSKAGVNDLVI FT VVGDYNLPGLRWTFDDDVNAFIPNNASTEIELDLMESLLTSGMQQVCNLSN FT ARGNILDLAFVNDADRVDLIEPPSAILKPDRHHKQFVLKVDLHHNPDQVSQ FT HSADVADFDFNRCDHVAVTDALNQIDWDNVLNSEDANTQASQFYSVVFDVI FT QQLVPRKRIARDRSIKQPWWNAELRHKRNILRKARKRLFRSKSPEDNVCVE FT RLETEYELLNETLYREYLDNIQTRLTDDPSSFWSYVKSRKRTDGIPSVVDS FT GDRSSNSPEEAANLFADFFKGVFNHGLHTAPADYLDEIPSHNIDMPFPVLS FT EAEVLEALAAVDSSKGPGPDKLPPSFIKMCAGSLAHPVSLVFNRSLTNGQF FT PDVWKVAAITPIHKSGRRKDVQNYRPISILSCLPKVLEQLIHKRMYAAAVP FT IISEFQHGFVKKRSTVSNLMLYVSSINSSLEKRCQVDSVYVDFAKAFDKVP FT HNLAIEKLRRMGFPRWLTVWLSSYLTGRTGFVRIGSSRSLQFAVTSGVPQG FT SHLGPLIFLLFVNDICTHLKSQKLLYADDLKVFRVVSSAIDCVALQQDICA FT INEWCVRNGMKVNEEKCNVISFSKKQDTIHYRYEMNSHLLTRVDCIKDLGV FT MMDHKLTFDKHVAVTSAKAFATLGFLRRNTVDFENIRALKTLYISLVRSIL FT EYAVQVWAPHYAVLCDRLEKVQKSFTLYALRRLPWRNPAYLTSYDDRRRLL FT QLESLEHRRIFLQRLFVFDVLTNRVDCPEIRQKMSLHEPQRALRTYLPFRI FT ARHTTVYGQNHPIDRCCRYFNPVWCFFNFDMTKELFKTKIRNVL" XX SQ Sequence 3288 BP; 869 A; 831 C; 751 G; 836 T; 1 other; tcgctgattt tgtgccgtga gttcgaggac tataactcgc agtcggggtt cgatccggac 60 agacctcagg cgccagctac agagcagtcc cagatgatcg aagagcagca gccgttggat 120 cccgtgcaga cccagacgct gacggtgcct gtacaacaga ccgtttcacc tgccaaccaa 180 ccaacwccgg ccgatgtcgc aatggagact accgagtaga aactgctgcc acatccaact 240 ctccacaaat tgctgcttca actcaaccaa ctcattcagt gttattacct cgacgccaac 300 gccgaccgaa tcgaaagctt gtcccttcca gtctcctcgt ttattaccag aacgcaggtg 360 gtattcgaac caaaactaac cagtttaact tggccctagc cagcagcgat taccatgtga 420 tagcggtgtc tgaaacgtgg cttcgtgatg gtattctgga ttcagagctc tcatccaact 480 atcaattttt tcgaaaagat cgtagtcctg ctacgagtga actgaggcga ggaggtggtt 540 gccttgttgc cgtgaagaat ggactcgacg caactcttgt gaaactcgcc ggctatgatc 600 atctggagca gaccataatt cgtgtcaaag ttcggcggca aaatgtattt ttgtgctgca 660 tctacataag accgaacagc ccgaacgata tctacctttc gcactacgca gctgtccgtg 720 aagtgttatc gaaagctggc gtaaacgact tagtgattgt agttggagac tacaacttgc 780 ctggtctgcg ctggacgttt gacgacgacg tgaacgcatt cattcccaac aatgcctcga 840 ccgaaattga acttgacctt atggaatcct tactgacgtc cggaatgcag caggtttgta 900 atctctccaa tgctagaggc aacatacttg acctagcttt cgtgaacgac gcagatagag 960 ttgacctgat cgaacctccg tcggctattc tcaaacccga ccgacatcac aaacagtttg 1020 tactgaaggt tgacctccac cacaatcccg accaagttag ccaacactct gctgacgttg 1080 ctgattttga cttcaaccga tgtgaccatg ttgctgttac ggatgcattg aatcaaatcg 1140 actgggacaa cgtgctgaac tcggaagacg ccaacactca agcatcacag ttctacagcg 1200 tagtatttga cgtgattcaa cagctcgtgc cgaggaaacg gatagctaga gatcgatcaa 1260 tcaaacaacc atggtggaat gctgagttac ggcataagcg gaacattctg cgcaaagcaa 1320 ggaaacgact gtttcggtcg aaatcaccgg aggataacgt ttgtgtcgag cgactggaaa 1380 cggagtacga gctgctgaac gaaacactgt accgtgagta cctcgacaac attcaaaccc 1440 gcttaacgga tgacccttcg tcgttttgga gttacgttaa gagcaggaag cgaactgatg 1500 gcattccttc cgtggtcgat tcgggtgatc gtagctcaaa cagtcctgaa gaagcagcta 1560 atttatttgc cgactttttc aaaggcgtgt tcaaccatgg cttgcacacc gctccagcag 1620 attatctcga cgaaattccc tcccacaaca ttgacatgcc atttccggtt ctttctgaag 1680 cagaagttct agaagctttg gctgcagttg actcttcaaa aggccctggt cctgataagc 1740 tgccaccttc tttcattaaa atgtgcgctg gatcgttggc acaccctgta agcttagtct 1800 tcaaccgatc gcttacaaat ggtcagttcc cagatgtgtg gaaagttgct gccattacgc 1860 caattcataa atctggacga aggaaagatg ttcaaaacta cagaccgatc tcaatactat 1920 cctgcctgcc aaaagttcta gaacaactga ttcacaaaag aatgtacgct gccgcagtcc 1980 ccatcatatc ggagtttcag catgggttcg tgaagaaacg atccacagtc tcgaacttga 2040 tgctgtacgt tagctcgata aattccagcc ttgagaaacg gtgccaggtt gattccgtgt 2100 acgtggactt tgccaaagcc tttgataagg tgcctcataa cttggcaatt gaaaagctga 2160 gacgtatggg attccctcgc tggctcacgg tatggctctc ttcctatctg accggcagaa 2220 cagggtttgt tcgcatagga agctcacgat cgctccagtt tgccgtgacc tccggagtgc 2280 cgcagggaag ccacctcggc ccattgatct ttttactgtt cgtcaacgac atttgcactc 2340 acctcaaatc ccaaaagttg ctgtatgcag atgacctcaa agtgtttcgt gttgtttcct 2400 ctgccatcga ttgcgtcgcc ctgcagcagg atatctgcgc tattaacgaa tggtgtgtta 2460 ggaacgggat gaaggtgaat gaagagaaat gcaacgtcat cagcttctct aagaaacagg 2520 acacgattca ctaccgatat gagatgaatt cacacttgtt aactcgtgtc gactgcatca 2580 aagacctcgg cgtaatgatg gaccacaagc tcaccttcga caagcatgtc gctgtcactt 2640 ccgctaaggc ctttgcaacg ctgggattcc ttcgacgtaa cacggtggac ttcgagaaca 2700 ttcgggcact taaaacgctg tacatctctc tggtacgaag tattctcgag tatgctgtgc 2760 aagtctgggc accgcattac gcagttctgt gtgaccgctt ggagaaagtg cagaaaagct 2820 ttacactata cgctctacgt cgcctgccct ggaggaaccc tgcgtaccta acaagttacg 2880 atgaccgccg aagattacta caattagaat ctctagaaca tcgccggatt ttcctgcaac 2940 gtctgtttgt gtttgacgtt cttacaaacc gagtagactg cccggagatc cgtcagaaaa 3000 tgagccttca cgagccccaa cgtgcgctga gaacctacct gccgtttagg atcgctaggc 3060 acactaccgt ttacggacaa aaccacccga tcgatcgctg ctgccgttat ttcaacccag 3120 tttggtgctt ttttaatttc gacatgacaa aagaactatt taaaactaaa attagaaacg 3180 tgttgtgacg tacctttatg tttttaatgt tttatatctg tttatacgtt tagtttataa 3240 gtatatacag tctgtgcggc ataatgccga agacggtgac aaataaaa 3288 // ID Neptune1_Ap repbase; DNA; INV; 2765 BP. XX AC . XX DT 27-DEC-2006 (Rel. 11.12, Created) DT 27-DEC-2006 (Rel. 11.12, Last updated, Version 1) XX DE Neptune1_Ap is a Penelope-like retrotransposon. XX KW Penelope; Non-LTR Retrotransposon; Transposable Element; KW Penelope-like element; reverse transcriptase; KW GIY-YIG endonuclease; Neptune1_Ap. XX OS Acropora palmata OC Eukaryota; Metazoa; Cnidaria; Anthozoa; Hexacorallia; OC Scleractinia; Astrocoeniina; Acroporidae; Acropora. XX RN [1] RP 1-2765 RA Arkhipova I.R.; RT "Distribution and Phylogeny of Penelope-like elements in RT Eukaryotes."; RL Syst. Biol 55(6), 875-885 (2006). XX DR [1] (Consensus) XX CC Neptune1_Ap is a Penelope-like element (PLE) from the elkhorn CC coral, Acropora palmata. It belongs to the Neptune group of PLEs. CC Its ORF contains regions homologous to reverse transcriptases and CC to GIY-YIG endonucleases, with the characteristic CxxC motif in CC between. The element is apparently inactive, its copies are CC 85-95% identical. Consensus sequence was assembled from trace CC archives. XX FH Key Location/Qualifiers FT CDS 100..2535 FT /product="Neptune1_Ap_1p" FT /translation="MRITIRAMCVKRDDLDKQILQCRSELSKICPVVLLNS FT IRAKIQELNAALFNHLHQIKTLKLKHFIGPKTSSHGTFDSQNTVVTIPENL FT PLTDSEKSVLSRGLNFVPIAKSTDEFSVKQDVEKFPRRIQLKAFFREKEDN FT SYASDKDIFETLHVRKSKWTPPEGQFASLDFFIKKCRHDVHKLKSNCNTKL FT SNLSKEEWTALINLKNRNDLVIKAADKGGATVVWRTDLYHQEAIRQLSDPT FT FYTKVNKDLTPANQKIVKDTIQELITKQELPVTAQNLIITTPRTSCIYFKP FT KIHKTQQPRPSIVSACSCPTELISSYFRQSHDTHSQITTFIYQRQQPRTRN FT IPGNFQFSQAENKIIFTMDITSLYTVIPNNEGLQALKYFFNQRPIKKPSSE FT TLLRLAELVLTLNCFSFGDNYYKQINGVAMGTKMGPGYANLFVGLIENKFF FT SNYHGPKPDLYKRYIDDCVGATSSSKEELNLFINSVNSFHPALKYTWEISE FT NSLAFLDIKLSINDNGLSTSVHYKPTDSHNYLLHSSSHPQHVKNAIPFSQF FT LRLRRLCSDDTDFNNKCEEMCQFFKKRGYPDSAVTTGKHRAQEIDRETALQ FT TSQNEETDRIPFTLTYHPQNLAIKNVILKNFKILRNDPETKHIFSLPPLIS FT FKRDKNLGNFLVRSAFKFNNQPGTFTCKRTRCKTCPFISNTVKISGPNRSV FT KVTDHFTCISTNVIYCITCTLCKKIYIGETGRRLADRFREHLRDVEQNNTD FT ASKPVARHFNLPNHSHHNMTICGLSLHHGNTESRKNLEQKFIFQLGTLSPH FT GINERLSFH*" XX SQ Sequence 2765 BP; 905 A; 776 C; 388 G; 696 T; 0 other; gtgtgcggct tgccaatttt tcatgcgtct tcattttctc gttgcaatca ataagtctac 60 caaatcccat gcgttcaaaa ttctttttca cgtaacatta tgagaatcac tatcagagcc 120 atgtgcgtaa aacgcgacga cctggacaaa caaattctcc agtgccgctc cgaactttcc 180 aaaatctgtc cggttgtcct gttaaattct attcgcgcta aaatccaaga acttaatgct 240 gcacttttta accatttaca ccaaattaag accctaaaac tcaaacattt cattggtccc 300 aaaactagca gtcatggaac atttgatagc caaaacactg tagttacaat tccagagaat 360 cttcctctta ctgactcaga gaaatctgtt ctcagtagag gcctaaattt tgttcccatc 420 gcaaaaagca ccgacgaatt ttcagtcaag caagacgtcg aaaaattccc tcgccgcatt 480 cagttgaaag cctttttccg tgagaaagaa gataattctt acgcttcgga caaagatatt 540 tttgaaacac ttcacgttcg taaatcaaaa tggactcccc cagagggcca atttgcctct 600 ttagattttt tcatcaaaaa atgccgccat gatgttcata aactaaaatc taattgcaac 660 accaaactct ccaacctctc caaagaagaa tggactgccc tcataaatct caaaaaccga 720 aatgacctcg tcatcaaagc agccgacaaa ggcggcgcga cagtcgtttg gcgcaccgac 780 ctctaccatc aagaagcgat tcgccaactt tcggacccga ctttttacac caaagtcaac 840 aaagacctaa ctcccgccaa ccaaaaaatt gtcaaagaca ccattcaaga actcataaca 900 aaacaagaac tacctgtcac cgctcagaat ctcatcatca ctactcctag gacctcgtgc 960 atttatttca aacctaaaat tcacaaaacc caacaaccca ggccgtcaat tgtttcagca 1020 tgcagttgcc ctactgaact tatctcgagc tattttagac aaagtcatga cacccatagt 1080 caaatcacta ccttcatata tcaaagacag caaccacgca ctcgaaatat tcccggaaat 1140 tttcaatttt ctcaggccga gaacaaaatc attttcacta tggacataac atccttatac 1200 actgtaattc ccaacaatga aggcctccaa gcactcaaat acttttttaa tcaacgtcct 1260 atcaaaaaac caagctcgga aactttactc cgtctagctg aattggttct cacactcaac 1320 tgtttttcgt ttggtgacaa ctactacaaa caaatcaacg gtgttgcaat gggaaccaaa 1380 atgggacctg gctacgccaa cctcttcgta ggcctcatag aaaacaaatt tttctccaac 1440 taccacggac caaaacctga tctttacaag cgctacatcg atgactgcgt cggcgccact 1500 tcatccagca aagaagaact taacctattt attaactcag tcaattcttt tcacccggct 1560 ctaaaataca cctgggaaat ttccgaaaat tcattagctt tcctcgacat taaactttct 1620 atcaacgaca acggtttatc cactagcgta cactacaaac caactgattc tcataactac 1680 ttgctacatt cgtcctctca tccacaacac gtaaaaaatg ccatcccatt ctctcaattt 1740 ctcagactga gacgcctctg cagtgacgac accgacttta acaacaaatg cgaggaaatg 1800 tgccagtttt tcaaaaaacg cggctaccct gactccgctg taaccacagg caaacaccgc 1860 gcccaagaaa tcgaccgaga gaccgcacta caaacttcac agaacgaaga aaccgacaga 1920 attccattca cccttaccta ccacccacaa aaccttgcaa tcaaaaatgt cattctcaaa 1980 aacttcaaaa ttctccgcaa tgatcccgaa actaaacaca tattttctct accaccactc 2040 atttcattca aacgcgacaa aaacttaggc aatttcttag tcaggagcgc attcaagttt 2100 aacaaccaac caggaacctt cacatgcaaa cgcacacgat gcaaaacttg tccctttatt 2160 tctaacacag ttaagatctc aggacccaat cgatccgtca aagtcactga ccattttaca 2220 tgcatctcca caaatgtcat ctattgcata acctgcacgc tatgcaagaa aatctacata 2280 ggcgaaacag ggagaagact ggcggaccgc ttccgcgaac acctacgaga cgtagaacaa 2340 aacaacacag atgcgtccaa accagtcgcg cgccatttta atcttcctaa tcactcccac 2400 cacaacatga ctatttgcgg gctatcctta caccacggga acacagaaag ccgcaaaaat 2460 ctcgaacaaa aattcatttt tcaactgggt acactctctc cacacggaat taatgaacgc 2520 ctctcattcc actaatttat tcacaaattc atgtgaccat atttccacca atggcaaagc 2580 tcctctacac tctcatataa accacaacaa cccacaattc ctctattcgc tctgacgaag 2640 ggctaacgct cgaaacgtca gctttctaaa tctttcacgg tggtaattca acctttatca 2700 actcgtttga taaaaccaaa tttttgtttt gatctctccc accgacgcag caccacagtt 2760 tcttt 2765 // ID Gypsy12-I_Dya repbase; DNA; INV; 4402 BP. XX AC chr3R; XX DT 18-MAY-2009 (Rel. 14.05, Created) DT 18-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy12_Dya; KW Gypsy12-LTR_Dya; Gypsy12-I_Dya. XX OS Drosophila yakuba OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; melanogaster subgroup. XX RN [1] RP 1-4402 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1085-1085 (2009). XX DR Genome; chr3R; Positions 208022 203621. XX CC Positions [1995-2501] - Reverse transcriptase CC Positions [3555-4106] - Integrase core CC 'TACA' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS join(1659..2564,2568..4385) FT /product="Gypsy12-I_Dya_1p" FT /translation="MQLIIGEQLCLQAEIVFNCNGLQVRKNKRNNKQNFLN FT ICVEVDNELFDINEAASEYAKQEVRELISNYEPVKTKTTNVEMRIVLNDKA FT PIFSRPRRLAFSEMHIVDEQIKEWLENDIIEESTSEFTSPIVLVKKHDGSV FT RLCVDYRRINKVIVKDHFPLPLIEDQLDRLQEAKIFSTIDLKNGFFHVPVA FT ELSRKYTSFVTHREQYQFLKVPFGFSNSPGVFQRHVNAIFRDLSRRGIALP FT YVDDIIIPAKTEQDAVQNLKEVNATCKDYGLQLNLKKCNFLKTRIQFLGHV FT IENRKIYPSPKIEAVSKFKMPQTIKQVESFLGLTGYSRKFIPNDAGIARPL FT TELTKKNTRFRFDLDEENAVNILKKLLTENPVLSIYNQTYETEVHTVASID FT GFGEVLLQMNVDDGQLHPVYFMSKKTTDAQRKYSSYELEILAVIEALTKFR FT VYLLGIHFKLVTDCNAFTKTLEKQSLCTRVARWILFLQEYNYTVEHRSGVR FT MKHVDALSRYPVMLIKGDSISTALKDAQARDEKIRAIKEVLSDENKVYDDY FT FLKRGVLYKLVGDDELVIVPNEMQKRVIISSHDRGHFAAKKTNCREFCIPH FT VEDKIKKYIECCIPRILTDHKRGKQEGLLHPLQKEDVPLFTYHIDFLGPLE FT STHKDYKHILAVIDSFTKFCWLYSTKSTTANEVIVKLSNQSVNFGSPAFII FT SDRGSAFTSQDFVKYCEDEGIKPVKTTTGLPRVNGQVERLNSIIISVLSKL FT SVDDPTNGINTTAFELLVGIKMRTKDDLQIRDLINKEAVAVFNEERNELRV FT KAKTQILKLQNENKKDYNLRRKQARRYDIGDLIAIKRTQFGSGLKLKPKFL FT GPYKVVKVKYNNAYDVQKVGNSDGPKNSATCAEYMKPWPIGNDDSDDDEAF FT GSKA" XX SQ Sequence 4402 BP; 1553 A; 757 C; 981 G; 1111 T; 0 other; ttttgggcgc tcgtccggaa ttgagcaaat gtgaacggaa ttgagcaaat gtgaacggga 60 ttgagcaaat gtgaacaaaa ttaatgatat taaacaagag ttgaacgtgt aaagcaaaaa 120 agtacagtcg caattttttt ttaccataat ggcgtgttta aataatttca taagtgctag 180 tttacgctca agtaagaagg agagcattgc tgcactgcat aagtttgtgt tcgagagtga 240 aggaaacaga agaatcgcca aagactacgg gaatttactg gtttcgattt tgccgaaaat 300 gaccagagat atggtgcaaa gaaggaatac gcagaagata atttgacgca ggttgacctt 360 gtggcaatat gtaacgtgct aggcataggg tacagagacg aagatttgca tatacacatt 420 ttccgcaact tgaagaagaa cagtttactg tcgtcaaccg aagccgacga aacagacgat 480 gaaacagaca gtgaagcaga aaacgttgta ttccagacgc cgaacagagc aacgaacgcc 540 gaaagaagtg atgtagaaat gtacaaggaa tagacgagat gaataccagg cgcaacgtat 600 taactgaaaa cgacagtatg tcgacgccac gctttgcgat cagttttcgc gacgttgaag 660 agagcatacg cagttatgat ggcactaaca gtattccaat cgaagtatgg atagaagagt 720 acgaagagca agcgacgctt atgtattgga atgacttcca gaagtttctg tttgcaaaaa 780 aaaatcactt aaaggaattg caaaattgta tgtgttgagc gagagaggtc taaattcgtg 840 gaattctctc aaagaagcat tgctatccga atttaagtca agtgtcagca gcaagctagt 900 tcacgagcaa ctcgcgcaaa cgaaacgtgg caacaacgag tgtgttttcg agtactttta 960 tagaatgaaa aatattgctt ctagaggtaa aattgcagat gacgctttca tacagtattt 1020 aattgacggc atcgacgaaa aaagtgtaaa caaatcaata ttatatggtg ccaaaaatat 1080 gagtcaattt aaagacaaac taaagtgttt cggaacaatg atagcaaagg aaaatcagag 1140 tgagaaagac agatcgaagc gcaatgagaa cattgtgaaa gagaaagaca gaaaaaatca 1200 gaatgcaaac aaagacgcga aaaaagagct gaaatgctac aactgtggtg aaaaaggaca 1260 tgtgttgaat aactgccaga ataaagaaaa aggcagaaag tgttaatcag tacggacata 1320 tttctaaaga gggcgcaaat attgacaaaa atgttttaaa cacgaacact cgcagtttga 1380 tttcaaaaaa cgatttgacc agcaaagaag tttatattgc aaataaaaag tgtacggcgt 1440 tattcgatac gggtagtaaa ttcaacatca tacgggaagg tttttattgt gatattggaa 1500 agccgtagct ggaaagctgc aatgtcgttc taattggatt cgggagtgac cgcgactcca 1560 ataaaataaa accaattgga caatttaaac aacgaattct gattgatgaa gaggattatt 1620 tgttagattt ttacgtcgtt ccaatcaaat ttcttgatat gcaattaatt atcggagaac 1680 agctatgttt acaagcagag attgttttta actgtaacgg cttacaagtg cgaaaaaata 1740 aaagaaataa taaacaaaat tttctaaaca tttgtgttga agtggacaat gagttgtttg 1800 acattaacga ggctgcgagc gaatatgcca agcaagaggt gcgtgagcta atttcgaatt 1860 atgaaccggt caaaacaaaa acaacaaatg tagaaatgcg tattgtgcta aatgataaag 1920 cgccaatttt ctcacgtcca cgcagactag ctttttcgga aatgcatatt gttgacgaac 1980 agattaaaga gtggcttgag aatgacataa ttgaagagtc aacatccgag tttacaagtc 2040 ccatagtact tgtaaaaaag cacgacggtt ctgtaaggct gtgtgtagat tatcgcagaa 2100 ttaacaaagt aattgttaaa gaccatttcc cattaccgct cattgaggac cagttggatc 2160 gtctgcaaga ggcaaaaata ttcagtacaa tcgatttgaa aaacggattt ttccacgtac 2220 cagttgctga actcagcaga aaatatacgt catttgtgac acaccgagaa caatatcagt 2280 ttttgaaagt accctttgga ttttcaaatt cgccaggcgt ttttcagcgc cacgttaatg 2340 cgatatttcg ggatctgtca cgcagaggaa tcgcattgcc ctatgtagat gatattatta 2400 tccccgctaa gacggaacaa gacgctgtac aaaacttgaa ggaagttaat gcgacgtgca 2460 aagattacgg tctgcagctt aatttgaaaa aatgcaattt tctgaagacg cgcattcaat 2520 ttctgggcca tgttatagaa aatcgcaaaa tttacccctc accatagaaa atcgaagctg 2580 tatccaaatt caagatgcca caaactatta agcaagtaga aagttttttg ggcctgacgg 2640 gctattccag aaagtttata ccaaatgacg caggaatagc tagaccttta actgagttga 2700 caaagaaaaa cactagattc agatttgatt tggacgaaga gaatgcagtt aatattttga 2760 aaaagttact aactgagaac cctgttttaa gtatttataa tcagacttac gagacggaag 2820 tgcatacagt tgcgtctatt gatggattcg gagaagtttt attacagatg aacgtcgacg 2880 atggtcaact gcatcctgtg tattttatgt caaaaaagac cacagatgca cagcgtaaat 2940 acagcagcta tgagttagaa atattggcag tcatcgaagc tttgactaag tttagggtgt 3000 atttgttagg cattcatttt aaactagtaa cagactgcaa cgcttttacg aaaactttag 3060 aaaaacaaag cttatgcacc agggtggctc gctggattct ttttctacaa gaatataact 3120 atacagttga acatcgatct ggcgtgagaa tgaagcatgt cgatgctctc agccggtatc 3180 cagtgatgct cataaaagga gatagcataa gtactgcttt gaaggatgcg caagcacgcg 3240 acgagaagat aagagccatc aaagaagttc taagcgatga aaataaagta tacgatgact 3300 actttttaaa aagaggtgtg ctgtacaaac ttgtaggtga cgatgagctg gtaatagtgc 3360 caaacgaaat gcaaaagcga gtaattatca gttcccatga cagaggtcat tttgctgcaa 3420 agaagacaaa ttgccgagaa ttttgtatac cacacgttga agacaagatt aaaaaatata 3480 tcgagtgctg tattccacgc attttaacgg atcacaaacg aggcaagcaa gaaggactac 3540 tacacccact ccagaaagaa gacgtcccat tgtttaccta ccatattgac tttttgggtc 3600 cgctggagtc aacgcataaa gactacaaac acattttggc cgtaatcgat tcttttacga 3660 aattttgctg gctctattct acgaagtcaa ccacagcaaa cgaggtgatt gtgaagctca 3720 gcaatcaaag tgttaacttt ggcagcccag cattcattat ttcggataga ggctcagcat 3780 tcacatcgca agactttgtg aaatattgcg aggatgaggg aataaagcca gtgaagacga 3840 ctactggctt gccaagggta aacggacagg ttgagcgact aaattctatt atcatttcag 3900 tactctcaaa gctaagtgtc gatgacccca cgaatggtat taacacgacg gcttttgagc 3960 tactggttgg tatcaaaatg agaacgaaag atgatcttca gatacgagac ttgataaaca 4020 aagaggccgt ggcagttttc aatgaagaac gcaacgagtt acgagtaaag gctaagactc 4080 agatattgaa gctgcagaac gagaacaaga aggactacaa tcttcgacgc aagcaagcaa 4140 gacgatatga tattggcgac ctaattgcta ttaaacgtac ccagtttggt agtggcctaa 4200 agctgaaacc taagtttttg gggccttata aagttgtaaa ggtgaagtac aacaacgcct 4260 atgatgtcca aaaagtaggc aacagcgatg gaccaaagaa ttctgcaacg tgcgcggaat 4320 atatgaagcc atggccaatt ggcaacgatg acagcgatga cgatgaagca ttcgggtcga 4380 aagcttagtc agaatggccg aa 4402 // ID Sola3-1_HM repbase; DNA; INV; 5258 BP. XX AC . XX DT 28-JAN-2009 (Rel. 14.02, Created) DT 28-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE Sola3 DNA transposons from Hydra magnipapillata. XX KW Sola; DNA transposon; Transposable Element; Sola3; TTAA TSD; KW Sola3-1_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-5258 RA Bao W., Jurka M.G., Kapitonov V.V. and Jurka J.; RT "New superfamilies of eukaryotic DNA transposons and their RT internal divisions."; RL Molecular Biology and Evolution 26(5), 983-993 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 1392..4145 FT /product="Sola3-1_HM_1p" FT /translation="MHELSVGKKSSAVRSVSINMSTNVSKHYNKSVPVGSV FT FCHNHLKSERILNQTKTSTAANEELVEDTTDFDFEPKEILLSGEKVEGATF FT TGNNLSEALHISPFQFQIKNKKISDLSNGTKKNLRRKFERAKVQLEKRFAE FT AVAPNQSEDFISIVLKRSLETSDIDVVPDDLKKLVKIYEESDSMGKHIVLC FT LIEHEKFTKKLLMKVFGCSKYKIDQVRKMKNANIGITIPVEKEIKRNRLDQ FT KKAEHFLDFIFNSNLLQDVAYGVTKIKFDSNDVYKISRAVLTAKMSHTIAF FT YNEVCRNENYSPLSESSLWRILHAVKPSQQKCLAGLDDITAAGMNGFSMMQ FT DLSLKYKDRELSNMFERSKRYYKTSFQFNCSATYPNSISHCPIFALSDLTN FT KQLQQISIAPNDKICYECENLNKSLVEIKRLASNNSADEDTIYDIEIAEKD FT IIDYKKHLMRDCQQKKAKVFAFESLDEESGFWLKDYCQKVLPSKFREGQKE FT YFGKKGMSLHVDIFFTKKNNVLQKKVYFTALYRCEQSLSETLSIAQLVLPK FT FKHDHSGVNKLYAKSDNASSYHGNFIMEALFMLCKNNQIQLKRYDYNEPCR FT GKDQCDREAAGAKSLIRSFVDAGNDLVCAEDIYTALHYGHGMKNSAVGVAT FT IKGKSKLSGTKIAKISQYHSFEFFQNYMTMWRYYAVGDGVIQEYSNVNFGL FT QMLLTHPYSDTDKKTRVFKNNMKEKLREDRSLNLFHFCSEPICNGSFNTSE FT ELEEHMMSGRHIISTLKSGMDNVRQSFIMKMQDQSNLHSYKPNSQQEVFEE FT VCSDISKYVKGWALPTRSTFRYSMRQKDILYKLFMEGEVSGKKFSPEQVHL FT LIRKELQVSEYVTTQQIRSLFSRWSRLKKDNRLSEPIDVGSTNADNEEEEE FT TADQGNNFYIYSH*" XX SQ Sequence 5258 BP; 1887 A; 764 C; 893 G; 1706 T; 8 other; gagccatttt ctgagaaaat caggcaatgc tcaaaagtga tctttcaatc agtctcaatt 60 ttacatattt ttaatttttc tagtttttga gttatggtcg gtcaaacttt tgccctttta 120 atacttggaa agtcactttt ggggttttgt gatttttaaa ctaagattaa aaccaatgag 180 accactgcac actaagtttc ttttcatatt atggaagtat aggtagaact ttaaaaaaaa 240 aatcaaaaaa accgaggaca accctttctt tgtctgaaaa tcataagctg aaatcacaca 300 tttttgaatt tggggcctac tttaaataca tttatctcgg agcgtaaaaa gtgcccgtga 360 cttatcttat ttttttttta ttgaaagagt atctcattgt tatttgaatg gtctgaagaa 420 actgaaagga tattgtgtac aaaaaaactt atgagtcatc aaagtgtcaa aaaatgtttt 480 aacyagcgtt caaatatcag ccattatggg aaatgggaaa ggtcccaaga ttgaagatac 540 agattttata ttactccata ttacaatttt ttaaatgcga aaaaaaaact tagggggtaa 600 tattacattg ctgaaataac tttacgtcaa aatgatgcta aatctgattt tttttttttt 660 tatttatacg attgtggact tgaatttttt ttttttcgag taaacatttt taacttgatg 720 tttatttgaa cataattaat tattgttagg cattatttct taatgttaca gctattcagt 780 atggtttatt gagcatttac ttctgaatta ttcaaaacag agaatgagca ttttatgcta 840 aatacacttt aaaaaaaaaa cacttagttg caaaaaatat gtcagagcta gaggttgttg 900 ttgaaaatgt ttgcaaagtt aatgatgtca agtgtggacc atatcctcgt ttaaatgaac 960 aaaggttatt caagatacaa gatctgacat tggatgtcaa agcacactta gtaaaaatta 1020 aagtaagaaa tgctcttagc agttgtaaaa aaaaccgttt gcatcatata gttgtgtaaa 1080 caattgttgt ttaattcctt cagtttacat tgcataatac ataagtttta tttttttagg 1140 ttttaaagat acaccaagag acggaatgga cagaaaagtc acttatagaa aaccgagggt 1200 gcatatcaat ttctgataac gactatatat gcccttacca tagatttaac tttggtaaga 1260 atttgtttac aaccgtattt caacaacaaa aaaaattaac tactaaacct ttaaaataaa 1320 tatttcaaat tttgattttt gttaaaattc aaggtattgg ttgggttcct ccaagaagat 1380 gctgtcatcc aatgcatgaa ttaagtgttg gtaaaaaatc atctgctgta agaagtgttt 1440 caataaatat gtcaacaaat gtttcaaaac attacaacaa aagtgttcca gtaggttccg 1500 tattttgtca caaccaccta aagtcagagc gcattctaaa tcaaactaaa acaagtactg 1560 ccgctaacga agagctagtt gaagacacaa ctgattttga ttttgagcct aaagaaattt 1620 tactttctgg agaaaaagtt gaaggtgcta catttactgg taacaaccta tcagaagcac 1680 tccacatcag cccttttcag tttcagataa agaacaaaaa gatatctgat ctcagcaatg 1740 gaacaaaaaa aaatttacga agaaaatttg aaagagccaa ggtacagctt gaaaagaggt 1800 tcgctgaagc tgttgcccca aatcagagtg aagactttat ctcaattgtt ttaaaaagat 1860 ctttggaaac ttcagacatt gatgtagtac ctgatgatct aaaaaaattg gttaaaattt 1920 atgaagaaag tgactctatg ggaaaacata ttgttctgtg tttgattgaa catgaaaaat 1980 ttactaagaa gttgctaatg aaagtttttg gatgctccaa atacaagata gaccaggtac 2040 gaaaaatgaa gaatgctaac ataggtataa ctattcctgt cgaaaaagaa attaagcgaa 2100 atagactaga ccagaaaaaa gctgaacatt tcctcgattt catatttaac agcaaccttc 2160 ttcaagatgt tgcatatgga gttacaaaaa ttaaatttga ctcaaatgat gtatataaaa 2220 tatctcgtgc agttttgaca gctaaaatga gtcatacaat tgcattttat aatgaagttt 2280 gcagaaatga gaattactct cctttatcgg aaagcagttt gtggcgaata cttcatgctg 2340 ttaaaccatc tcaacaaaaa tgtttagcgg gactggatga tattactgct gcaggtatga 2400 atggtttctc tatgatgcaa gatttatctt tgaagtacaa ggatagagag ctgagtaata 2460 tgtttgaacg tagcaaacga tattacaaaa ctagtttcca gtttaactgc agtgcgacct 2520 accctaattc tatctcccac tgcccaatat ttgctttgag tgatttaacc aacaagcaat 2580 tgcaacagat ttcaatagca cctaatgata agatttgtta tgagtgtgaa aatttaaaca 2640 aatccttagt tgagataaaa agattagcat caaataattc agccgacgag gatactattt 2700 atgatattga aattgctgaa aaagatatca ttgattataa aaaacattta atgcgtgatt 2760 gtcaacaaaa aaaagccaaa gtttttgctt ttgaaagtct tgatgaggaa tctggatttt 2820 ggttgaaaga ctattgccaa aaagttctac catccaaatt cagggaaggt caaaaggaat 2880 attttggaaa aaaggggatg tcactgcatg tagacatttt ctttactaag aaaaacaacg 2940 ttttacagaa aaaggtgtat tttacagctc tttataggtg tgagcaaagt ttatcggaaa 3000 cactatcaat tgcccaatta gttcttccaa agtttaagca tgaccattct ggtgttaata 3060 aactctatgc aaaatctgac aatgcttcat cgtatcatgg aaactttata atggaagctc 3120 tttttatgtt gtgcaaaaat aaccaaatac agcttaagcg ctacgattat aacgaacctt 3180 gtcgtggcaa ggatcagtgc gacagrgaag ctgccggagc aaagtcattg attcgtagct 3240 ttgttgatgc tggtaacgat ttagtgtgtg cagaggacat ttatactgct ttacattacg 3300 gccatgggat gaaaaattct gctgttggtg ttgccactat aaagggcaaa agcaagttga 3360 gtggaacaaa aatagctaaa ataagtcagt accattcgtt tgagtttttt caaaactata 3420 tgacaatgtg gcgatactat gcagttggcg atggagttat tcaagaatat tctaatgtca 3480 attttggtct gcaaatgtta ttaacacacc catatagtga cactgacaag aaaacacgag 3540 tttttaaaaa caatatgaag gaaaaattgc gcgaagatag atcactaaat ttgtttcact 3600 tttgctctga gccaatttgt aatggatcat tcaatacatc agaagaacta gaagaacata 3660 tgatgtcggg aagacacatt atcagcactt taaaatcagg aatggataac gtgagacaga 3720 gtttcataat gaagatgcaa gatcagtcaa acttacatag ttataaacca aactcacaac 3780 aggaggtatt tgaagaagta tgctctgaca ttagcaaata tgttaaaggt tgggctttac 3840 ctacacgcag taccttccgg tatagtatga gacaaaaaga tatattatat aaactcttta 3900 tggaaggtga agtatcagga aaaaagttca gcccagagca ggtgcacctt ttgataagaa 3960 aagagcttca agtctcagaa tacgtaacaa ctcagcaaat aagatcacta ttttcacgtt 4020 ggagcaggct aaaaaaagac aatagattaa gtgaaccaat tgatgttggt agtactaatg 4080 cggataacga agaagaagaa gagacagcag atcaaggtaa taatttttat atttactctc 4140 attaatatat attttaatat tattttgtaa ttataaatat ttgtgtttgt ataagtgttt 4200 gtataaatat ttatattttt ctttctgtag atcaagaaga attagcaaat gaagaatttg 4260 aaaaggaatt tataagcatt gccattgatt tatctagtgc atggcatgaa aacgattaga 4320 ttgttgttat ttatataggc aattggtata tatatataaa cttatggatg atcaatttga 4380 aaaagcccaa aaggcatttc aagcaacaat gcactaaaat caaagatata agtgtaagga 4440 tatgtaaaac tgatttataa cgcaagcaaa cacataataa gataaggacg tcatataata 4500 tattgattaa ttccgctgct acattcgttt tttttttttt aaatatataa aaaaatttaa 4560 ctttttgtgg tggagttatt ttaacaataa aatattaccc cctaagtttt ttttttgcat 4620 taaaaaaatt gtagtgtgta gtaatataaa atctgtatct tcaatcttag gacctttccc 4680 atttcccata atggctgata tttgaacgct arttaaaaca ttttttgaca ctttgatgac 4740 tcataagttt ttttgtaaac aatatcctct magtttcttt agaccattca aataaaaatg 4800 rratactctt tcaaaaaaga ataagataag ttgcgggcac cttttacgct ccgagataaa 4860 tgtatttaaa ataggcccya aattcaaaaa tgtgtgattt cagcttatga ttttcagaca 4920 aagacaaggg ttgtcctcgg tttttttgat ttttttttta aagttctacc tatacttcca 4980 taatatgaaa agaaacttag tktgcagtgg tctcattggt tttaatctta gtttaaaaat 5040 cacaaaaccc caaaagtgac tttccaagta ttaaaagggc aaaagtttga ccgaccataa 5100 ctcaggaact agaaattttt ttttaaaaac ttttttttaa aaacttacag gctttttttt 5160 taccctattc gcattactgt ataaaaaatt gagcacgatt aaaggatatc acttttgagc 5220 cgacagtact tttgcctgat tttctcagaa aatggctc 5258 // ID DNA7-1_CQ repbase; DNA; INV; 468 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A DNA transposon family from Culex quinquefasciatus - consensus. XX KW DNA transposon; Transposable Element; nonautonomous; DNA7-1_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-468 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the southern house mosquito."; RL Repbase Reports 11(1), 76-76 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 22 sequences with >95% CC identity. 7-bp TSDs. XX SQ Sequence 468 BP; 141 A; 93 C; 96 G; 138 T; 0 other; cacgggattg aaactcgtct gaaaaacctg tatcattttc tcattcgaaa cagtggcgcc 60 acgtggtggt agctgccctg tttattacac tgtgaaatta tccatggcca atccaaggaa 120 ccagaaggtg acaacagaaa tgtccgtcca aaaggggtgc tgcaaaaaaa cttattattt 180 ttaacaaatt ttcgaacaaa tcagcaacga attacaagtt tgacagtcga actacactta 240 aaagttggtt ttgttatttg tttttgcaaa accgcatatt tttaacgaaa gtgccaaaaa 300 tattcgtaat caccttttgc ctcttggtgg cgctttgtgt tagtatgtgt gtggtcgatg 360 tttgcggacg tgcacgcaga acacagaaca gtgagaacta gcgaaaatgc ctcactttct 420 tctgatacaa gtcgccatat tgaaattgct tgaagattca tacccgtg 468 // ID TTAA11_AP repbase; DNA; INV; 374 BP. XX AC . XX DT 25-AUG-2009 (Rel. 14.09, Created) DT 25-AUG-2009 (Rel. 15.12, Last updated, Version 0) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; nonautonomous; TTAA11_AP. XX NM TTAA11_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-374 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2076-2076 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC TSD is TTAA tetranucleotide. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea CC Aphid Genome Project. XX SQ Sequence 374 BP; 120 A; 78 C; 72 G; 104 T; 0 other; cacgttgacc gccaccggcc gccggcggtc acacgattat gcatcatctg tacgccaagt 60 atatttttca gtgtcattcc aaatttgtat attatactac gtatttgaaa aaaagtttaa 120 aaatataaaa aaaatttaaa aaatacattt attacatcaa aaaatttgta atgattcccg 180 ctacgccacg gcactttttt gggtgcatta ttgtagcgcc atggaaaagt taataatata 240 aaaccatatt catttatggc cgccggcggt catgtggcgt agtagtaatg atcaactgtg 300 gccgccggcg gtcacaaaaa catacaattt gaaatttaca aatgaccgcc ggcggccaca 360 tggcggtcaa cgtg 374 // ID CR1-49_HM repbase; DNA; INV; 4624 BP. XX AC . XX DT 19-DEC-2008 (Rel. 13.12, Created) DT 19-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE CR1-type family - consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-49_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4624 RA Bao W. and Jurka J.; RT "CR1 families from Hydra magnipapillata."; RL Repbase Reports 8(12), 1877-1877 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(767..1444,1448..3754,3717..4427) FT /product="CR1-49_HM_1p" FT /translation="MPPKADNSELNNKYETLLKQVEALSKTVVEQTSTIIA FT LTKRIQQLEEKPIASSTTTETWNFVVGKKSTKTPAQLDMLNCVANENKDRM FT KRDKNLIVFGIPASTKKEPNEKQVDDENKIKELLTNVKINSDNIIKCFRLR FT SKDPNKPPPIIVETTDTVTRNNIVKAARRKFEGIYVNPDLTEAQRRQDRQL FT RDELKKKNEPLNLKNNWAKTSHYYVIRNNAVVKVDKQSTVSNLKPYNDNLN FT INEISLERKSFSTISLLQQSAFHKIPCCYTNATSLNNKMPEFLASIETYNP FT KIIFVTETWWHESSITNIENYMLFRKDRVSSRGGGVAIYVDTSLKPNEVSE FT IVLLDSPEQIWCEISYGSEHILLGCIYRPHTLDPLENIEASIIKANELLSK FT IYSGLLICGDFNLPGINWDLSSIPTVSNTDILSSNFIDILAESFLIQFVTK FT PTFQLDEITSNNTLDLIIAENENRVLDIIHHAPLTSLKKSHHVLQWDYVVK FT SAAVVGKHIEKRVYHKSNFVKISEYFNNINWQMEFNGHNTDDCYNIFLNHY FT EAACKNYVPVTNLKWIRRRSEWMNDDIKNKIKAKNKLWFRCRANGYKNLKL FT VNEYKACNIQLKKLIYKARCEFELDLAIKSKHNPKILFKYVNSKQNVRTTI FT TALKGINGDTITNPIEIANRLNNHFQSVFNKNEDDELPYFIKRTEAICKYS FT QITLDLVKNKLSNLSVDKTVGLDGVSAYVLKNCSDSFSIPLSAIFQKSLDS FT GCCPKVWKKANVTPLFKNGSRLDPGNYRPISLTSVVCKVMEKILRDTMVNH FT LVEHSLISKNQHGFVNKKACVTNLLESIDMMSKALSDKISMDVAFIDFSKA FT FDMVPHKRLIYKLEAYGFTGNLLNWIKSFLHQRFQRIVMGDYVSSWLEVFS FT GVPQGSVLGPILFIIFVNDISDIVNHPCKMYADDTKLLARLDHPLASQMLQ FT TDIYNIADWCNTWKMKLNLDKCKIMHIGKKKQLLCLLNASVKKNNFYVYSM FT PANNNNSVELASTLVERDLGIMITPDLKWHNNSTFATNKANRVLGMLYRTF FT SHMTPQLLKILYTTFVRPHLEFAVAASNPSSRIDIDKIEKVQRRATRLVPS FT YRHMSYKDRLNILNLTSLETRRIRGDLIQAYKIINNIDIVSWSEPHVLRVG FT QSNKIKTRGHHLKMTREYVKNCEQRHQFFTNRIVNHWNALPSEVVCASSVN FT SFKAKLDAEISNNPHKYRDYN*" XX SQ Sequence 4624 BP; 1663 A; 790 C; 731 G; 1439 T; 1 other; tttttcgcgt gtttcacttg aagagataag acgtatttta ttttggcgta agtcttcttt 60 tttttcctgc attataagta ttttgctatc aactttttca ctccaatata tctcatctct 120 atttatatat ctcatcaact atatctcata cttctatttc ttagttttgg tatttacttt 180 acttttttag taatactttt ttagtattac tttcgctact tttctacatt tactattata 240 tattccttac gtgcttttga gtaatttaat tatagtattt tgtttcttaa tatttactcg 300 tatatacata attatatacc tctattatat gttattgttg taaacattta tttaattaac 360 ggttttattt tttcttaaat tcagtgttat tatcttagag agttttttga aagtttcagc 420 gactttttat tttttatttt tattatactc aagtcgtatt ctaaactctg aagytgcctt 480 tctatcgttt tctaatcttt ttgtttattt tcacttagat aaataagtca ttgttcgtat 540 agagaaacta ttgcatttat tgtaaaaaaa aaaaaaaaaa aaaaaaaaaa aaagttattt 600 ataaaaaatt tatataaatt atatgaaatt tatattcgta ataataatac aaatatatta 660 ttatatgtct atatatattt atatatatac atatatatat caatatatat ataatttaaa 720 tattatataa actatatata ctgttgttta aacatattta cgctaaatgc ctccaaaagc 780 cgacaattct gagttgaata acaagtatga gacgcttcta aagcaagtag aagcgttatc 840 aaaaacagtt gttgaacaaa catcgacgat catcgccctc acaaagcgaa tccaacaact 900 tgaggaaaaa cctattgctt cctcaactac aacggaaaca tggaactttg tggtcggaaa 960 aaaatcaact aaaacgccag ctcaactaga tatgcttaat tgtgtagcta atgagaacaa 1020 agatagaatg aagcgtgaca aaaatcttat cgtctttggc attccagcat cgactaaaaa 1080 agaacctaac gagaagcaag tagacgatga gaacaaaata aaagagcttc taaccaacgt 1140 taaaattaat tcggacaaca taataaaatg ttttcgctta cggtcaaaag atccgaataa 1200 acctccacct attatagttg agacgacaga cactgtcacc agaaataata ttgttaaagc 1260 agcccggcgt aaatttgaag gcatctatgt taatccagat ctgacagaag cacaaagaag 1320 acaagataga caattacgag acgagttgaa aaagaaaaac gagcctctta atttaaaaaa 1380 caactgggca aagacaagtc attactacgt tattcgtaat aatgctgtag ttaaagtaga 1440 caaatagcaa tcaacagtaa gtaatttaaa accatataat gacaatttaa atataaacga 1500 aatttcttta gagcgaaagt cattttcaac tattagtcta ctacaacagt cagctttcca 1560 taaaatacca tgctgttata ccaatgctac atctcttaac aataaaatgc cagagtttct 1620 agcatcaatt gagacataca accctaaaat aatctttgtt acagaaactt ggtggcatga 1680 atcatcgata actaacattg aaaactatat gttattcaga aaagatcgcg tgtccagtag 1740 aggaggtggg gtggctattt atgtcgacac ttcattaaaa cctaatgaag taagcgaaat 1800 tgttttatta gattcccctg aacaaatttg gtgcgaaata tcgtacggtt ccgaacacat 1860 ccttctgggt tgtatataca ggccacacac gctagatcca ctcgagaata ttgaagcgtc 1920 cattattaaa gcgaatgaac ttctgtctaa aatatacagc ggcttgttga tttgtggaga 1980 ctttaatttg ccagggatta actgggattt gagcagtatc ccaactgtca gcaacacaga 2040 tatcctttca agtaatttta ttgacatcct cgccgagtcg tttttgatac aatttgttac 2100 taagcccact tttcaacttg atgaaattac atcgaacaat actcttgatc taattatagc 2160 cgaaaatgaa aatcgggtcc tcgatattat tcaccatgca cctttaacgt ctctgaagaa 2220 atctcaccac gtccttcaat gggattacgt tgttaaaagc gctgcagttg taggaaaaca 2280 cattgaaaaa cgcgtttatc ataaaagtaa ctttgtaaag atatccgaat attttaacaa 2340 tatcaactgg cagatggaat tcaatggaca caatactgat gactgttata acattttcct 2400 gaaccactac gaggctgcat gcaaaaatta tgtcccagtg actaacttaa aatggattcg 2460 tcgtagaagt gaatggatga atgatgacat caaaaataaa ataaaagcta aaaataaact 2520 atggtttcga tgtagagcga atggatataa aaatcttaaa ctagtcaacg aatacaaagc 2580 gtgtaatatt caattgaaga agctcatata taaagctcgt tgtgagtttg aacttgattt 2640 agcgataaaa tcgaagcaca acccaaagat cctgttcaag tatgttaata gtaaacaaaa 2700 tgttagaact acaataacag ccttaaaagg aatcaatggc gacactatca caaatccaat 2760 agaaattgca aatcggctga acaatcactt tcaaagtgtt ttcaacaaaa atgaagacga 2820 tgaactgcca tattttataa aacgaactga agccatctgc aaatactcgc aaataacgct 2880 agatctggtt aaaaacaagc tgagcaattt atcagttgac aaaaccgttg gtttggatgg 2940 tgtaagcgca tatgtactga agaactgttc tgacagcttc tcaatacctt taagcgcaat 3000 ttttcagaag tcgctagata gtgggtgttg tcctaaagta tggaaaaaag ctaatgtaac 3060 accactgttt aaaaatggaa gtagactcga cccaggcaat taccggccta tatccttaac 3120 atcagttgta tgtaaagtga tggagaaaat attacgtgac actatggtca atcatttagt 3180 cgaacatagc cttatttcaa aaaatcaaca tggttttgtc aataaaaaag cttgtgtgac 3240 taatttgttg gagtcgattg acatgatgtc aaaagcatta tctgacaaga tatctatgga 3300 cgtagctttt atagatttca gtaaagcatt cgatatggtt ccccacaaga gactaattta 3360 caaacttgag gcatatggtt ttactggaaa tctcctgaat tggataaaat cgtttttaca 3420 ccagcgtttt caacgaatcg tgatgggcga ttatgtttcc tcgtggcttg aggtatttag 3480 tggcgtccct caaggatccg tattaggtcc aatacttttt atcatttttg tgaacgatat 3540 atccgatatt gttaaccatc catgtaaaat gtatgctgat gacactaaac tgctggcccg 3600 tttagaccac ccattagctt cacaaatgct tcaaactgat atctacaaca tagccgattg 3660 gtgtaatacc tggaaaatga aacttaatct cgataaatgt aaaatcatgc acataggtaa 3720 aaaaaaacaa cttttatgtc tactcaatgc cagctaataa caataacagt gtcgaacttg 3780 catcaacttt agtcgaacgt gatttaggca tcatgattac gccagactta aaatggcata 3840 acaattcaac ttttgccacc aataaagcta accgtgtatt aggcatgttg tatcgtactt 3900 tcagtcacat gactccgcaa ctacttaaaa ttctatatac aacttttgta cgacctcatc 3960 tggaatttgc ggtagcagcc agcaatccat catcaagaat agatattgac aaaatcgaga 4020 aagtacaaag aagagctacc cgtttagtac cttcgtaccg tcatatgtca tacaaagatc 4080 gtctgaatat actgaactta acgtccttag aaacccgtcg cattagggga gatctgatac 4140 aggcctataa aattattaac aatatagaca ttgtctcatg gagtgaaccc catgtccttc 4200 gtgttgggca gtcaaataaa ataaaaacta gaggacatca tctaaaaatg actagagaat 4260 atgttaaaaa ttgtgagcaa aggcaccagt tttttactaa tcgaatagtg aatcattgga 4320 atgcattgcc gtcagaagtt gtatgcgctt catcagttaa cagtttcaag gctaagcttg 4380 atgcagaaat ttcaaataat ccgcataagt atcgagatta taactgaaaa aaaaaaaaaa 4440 aaaaaaaaaa caataaagga aaaacaataa agtacaaact tatactttat actatatatc 4500 ttattgtaaa tttttatata tattgatatg aataaaatta atgtatgaag tagataaggc 4560 tgccaaaacc tgtaggttat acagggaaac ctgttgcagc aatttctcta ctactactac 4620 tact 4624 // ID Gypsy-228_AA-I repbase; DNA; INV; 4307 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-228_AA_; KW Gypsy-228_AA-LTR; Gypsy-228_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4307 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1059-1059 (2011). XX DR [2] (Consensus) XX CC Positions [1783-2289] - Reverse transcriptase CC Positions [3863-4345] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 68..1006 FT /product="Gypsy-228_AA-I_2p" FT /translation="MIVENENFARFEVAGSSPAESILGNGKSAKFSEASQF FT GQADFDLGFESLAYDEPASGTSFGRNALRDEVIIYYGEVSEEHESEGENQL FT DCSARSNSSSVRNPENFRNLSVNTNCYRNQQPVASITEAGNKSILESSSRT FT NFSTIFLQANEISTMLPTFSGHVQEDVNHFLKIISNTKEALKLDDQTMKIL FT LIKQLTAKAKYWLHSNTDFMLKSYNEIFEGLQARFAININVFQMRKQLARR FT TWTGHEPFAEYYQSKQIIAQKLNLPEKEFVQYLVEGIENQVLRNQARMQNF FT STVSRVGGNRSYVFHAITRDI" FT CDS 1087..3147 FT /product="Gypsy-228_AA-I_1p" FT /translation="MTINVLKDTDKRQHNEKVKCDGLVKFRLLNSITNLTA FT LFDTGSPISLIRRGLINEKDIQKIPYQKSFRGIGGKRVKVLGLFTNKILIQ FT NILFSIDFHVVPDNVLNLLDAIIGRDIIMQPGILTIIENDIKMYSVDHENL FT KSEDPVNEVFDNSLFNINSNSLQNLKIGDVSKDHLAIFSEIFQSKYLNYEK FT PSILHTDYEMVISLKKGEAFHFKPRRLSFDQKRKVDEKIKELLDAGIIKES FT SSPFASPIVLIPKKNDDIRLCVDYRKLNKDTVRDNFPLPVIEDLFDNLKDK FT SYFSILDLKSGFHQIKISKECTKYTSFVTPSGQYEYLRVPFGLCNAPAVFQ FT RFINKIFANLIKEGKLCVYLDDILIATKSIEEHFEILKQVFEILSQNLLEI FT NLEKCKFLFNEIDYLGYTVNSKGRKPNSSHIETVKNFPPPKNSKDIHRFLG FT LMSYFRKFIRNFAIIARPLYSLLQKDAVFRFEEEHTKAFTLLKQRLISAPI FT LAIFDPLAETELHCDASSFGFGAILLQKQSDNKFHPVSFFSKKTDEFESKL FT HSFELETLAVIHAIKRFHIYLAGKKFKIFTDCNALVQTLKKKEINPKINRW FT ALFLESYNFELEYRNEDKMKHVDALSRMYMSKRVKNSDDKNNDSKDVICLL FT DESEIERNIIIAQEQDIKLKNIKQILETTTFPNFLLLG" FT CDS 3668..4771 FT /product="Gypsy-228_AA-I_3p" FT /translation="MIDNVLRLCHDNCGHLGIEKTINEIKKLYWFSCMRKY FT VKKHIKNCLSCIFYNPSGKKEGFLKIIDKGNRPFNTLHLDHYGPISLPSSA FT RFKYILVATDAFTKFTKFFPTKTTNSDEIMQHLKNYISFYSKPLRIVTDCG FT TGFTSNKFKNFLNGHCIDHVKTAIRTPQANGQVERMNKTLTPMLAKLCNET FT NFKWDQLLPKVEYIFNNTYNKSICNYPSILLFGQLQNNIANENCNFESFVL FT NRQPSNEVQNLLEVREKAQQDNLKSQNYNKKLIDKKRIKTNNYREGDFVVL FT KINESHKLSPKFKGPYKISKILPNERYVVVDIEGFQISNVPFNAVCSPQNM FT RKWIDDDDLTDDCFDEDIENIRMAE" XX SQ Sequence 4307 BP; 1626 A; 635 C; 739 G; 1307 T; 0 other; acgtcagaag tgggatccac caacggctac agctgaaaat ctatacagga gaatatcgaa 60 accagagatg attgtggaga atgaaaattt cgcccgtttt gaagtagctg gaagttcacc 120 tgcggaatct attctaggga atggaaaatc tgccaagttt tcagaagcaa gtcaattcgg 180 acaagcggac ttcgatctag ggtttgaaag cctcgcctac gatgaaccag cttctggaac 240 cagctttggg agaaatgctt tgagagatga agttattatc tactatggag aagtctcgga 300 agaacatgag agtgaaggag aaaatcaact ggattgttcc gcaagatcaa actcatcttc 360 tgtgagaaat ccggagaatt tcaggaattt atcggtaaac acaaattgtt atagaaatca 420 acaacccgtt gctagtataa ctgaagctgg aaacaagtca attttggaat catcatcaag 480 aacaaatttt tcaactattt ttttacaagc aaatgaaatt tcaacgatgc taccgacgtt 540 ttctgggcac gttcaagaag atgtgaatca ttttttgaaa attatatcaa atacgaaaga 600 agcactcaaa cttgacgatc aaaccatgaa aattttacta atcaagcaac tgacagcgaa 660 ggccaaatat tggttacact caaatacaga tttcatgttg aaaagctaca atgaaatatt 720 tgaaggttta caagctagat ttgccatcaa catcaacgtg tttcaaatga gaaaacagct 780 tgcaagacgt acttggacag gacatgaacc atttgccgaa tattatcagt cgaaacaaat 840 aatagctcaa aaactcaact taccggaaaa agaattcgtt caatatcttg tggaaggaat 900 tgaaaatcaa gtacttcgta atcaagcaag gatgcagaac tttagtacag tcagccgagt 960 cggaggaaac agaagttatg tttttcatgc aatcacccgg gacatttagc agcaaattgc 1020 agatcaaacc aaaagccata tgatgctcgt attacaaatg gaaattcaaa acaaaaacat 1080 aatgacatga caattaacgt gttaaaggac acagacaaaa gacaacataa cgaaaaagtt 1140 aaatgcgacg gattggtaaa atttagatta ttgaatagta ttacaaattt aactgcttta 1200 tttgacacgg gtagcccaat ctcactgata cgtcgaggtt taattaatga gaaagatatt 1260 caaaaaattc cttatcaaaa aagttttcga ggtatcggtg ggaagcgggt gaaagttcta 1320 ggattattca ccaacaaaat tttaattcaa aatatccttt tcagcattga tttccacgta 1380 gttcccgata atgtgctaaa cctcctcgat gcaataattg gacgagacat catcatgcaa 1440 cccggtattt taacgataat tgaaaatgac attaaaatgt attcagttga ccatgaaaat 1500 ttgaaatctg aagaccctgt aaacgaggta tttgataata gtttgttcaa tatcaattca 1560 aactcactac aaaatttaaa aattggggat gtttcaaaag atcatttggc aattttttca 1620 gaaatttttc aatctaagta tcttaattat gaaaaaccgt ctatactaca cactgattat 1680 gaaatggtaa ttagtttaaa aaaaggagaa gcctttcatt tcaaacctcg aaggttgtct 1740 tttgatcaaa agcgtaaagt tgatgaaaaa atcaaagaac ttttggatgc aggtatcata 1800 aaagagagta gttcgccttt tgctagccct atagttttga ttcccaagaa aaatgatgat 1860 attaggctat gtgttgatta caggaaatta aacaaagata cagtacgaga taattttcca 1920 cttccggtca ttgaagattt gtttgacaat cttaaagata aatcatattt ttctattctt 1980 gatctgaaat ctggttttca tcaaataaaa atttccaaag agtgtactaa atacacatct 2040 tttgtgacac ccagtggtca atacgagtat ttacgtgttc cttttggctt atgtaatgct 2100 cctgcagtat ttcaacgctt tataaacaaa atttttgcta atctaatcaa agaaggtaaa 2160 ctatgtgtgt atttagatga catattaata gctacaaaat caattgaaga acatttcgaa 2220 attttgaaac aagtttttga gatattaagc caaaatttat tggaaatcaa tttagaaaaa 2280 tgtaaatttt tgttcaatga aattgattat cttgggtata ctgtcaatag taaaggtcgt 2340 aaaccaaaca gtagtcatat agaaacagtt aaaaattttc ctcctcctaa aaatagtaag 2400 gatatccata gatttctagg tttgatgagt tacttcagga agtttattcg aaattttgct 2460 attattgcta gacctttata cagtttgtta caaaaagatg ctgtatttcg tttcgaagaa 2520 gaacacacga aagcttttac acttttgaaa caaagattga tttcagctcc tattctagca 2580 atttttgacc ctcttgctga aactgagctt cattgtgatg ccagttcatt cggattcggt 2640 gcgatattat tacaaaaaca gtcagataat aaatttcatc cagtcagttt ctttagtaaa 2700 aaaacagatg aattcgaatc aaaacttcac agctttgaat tagaaacttt agctgtaata 2760 catgcaataa aaagatttca tatttattta gctggcaaaa aatttaaaat attcactgat 2820 tgtaatgcat tggtgcaaac gttgaagaaa aaggaaatta atccaaaaat aaaccgttgg 2880 gctttgtttt tggaaagcta taattttgaa ttagaatata gaaatgaaga caaaatgaaa 2940 catgtagacg ctttaagtag gatgtatatg agtaaaagag tcaaaaatag cgacgacaag 3000 aataatgatt caaaagatgt tatttgctta ctagatgaat cagaaataga aaggaatatc 3060 ataatagcac aagaacaaga tattaaattg aaaaacataa aacagattct tgaaacaact 3120 acatttccaa attttttgtt attatagatg gcattttgtt tcgaaaggaa agtgagcgaa 3180 atttattagt tgtgccaaaa atatgataga caatgttttg agactgtgtc atgataattg 3240 tggacattta ggcatagaaa aaactataaa cgaaattaaa aagctatatt ggttctcgtg 3300 tatgagaaaa tatgttaaaa aacatataaa gaattgtttg tcgtgcattt tctataatcc 3360 atccggaaag aaagaaggat ttttgaaaat aatagataaa gggaatcgcc ctttcaacac 3420 attacattta gatcattatg gtcccatttc attaccgtct agtgctagat tcaaatacat 3480 tttagttgcg acagatgctt tcacaaaatt tacaaaattt tttccaacca aaacaacaaa 3540 ttctgatgaa ataatgcaac atttgaaaaa ttatatcagt ttttatagca aaccattaag 3600 aattgtgact gattgtggaa ctgggtttac ttctaacaag tttaaaaatt ttcttaatgg 3660 gcactgtatc gatcatgtaa aaacagctat tcgtactccg caagcgaatg gacaagtaga 3720 aagaatgaac aaaacattaa cacctatgtt agcaaaatta tgtaatgaga ctaattttaa 3780 atgggaccaa ctgttaccta aggtagaata tatatttaac aacacttata ataaatcaat 3840 ttgtaattat cctagtattt tactttttgg ccagttgcaa aataatatag caaatgaaaa 3900 ttgtaatttt gaatcttttg ttttaaatag acaaccatcg aatgaagttc aaaacctttt 3960 agaagtaaga gaaaaggcgc aacaagataa tttgaaatcg caaaattaca ataaaaaact 4020 aattgataaa aaacgcataa aaacaaataa ttatagagag ggagattttg ttgtattgaa 4080 aataaatgaa tcacataaat tatctccaaa atttaagggc ccgtataaaa tttctaaaat 4140 tttgccaaat gaacgatacg ttgtagttga tattgaaggg tttcaaattt caaatgttcc 4200 ttttaacgcg gtttgttcac ctcaaaatat gagaaaatgg atagatgatg atgatttgac 4260 tgatgattgt ttcgatgagg acatcgagaa tatcaggatg gccgagt 4307 // ID BEL-38_AA-LTR repbase; DNA; INV; 695 BP. XX AC supercont1.26; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-38_AA_; KW BEL-38_AA-I; BEL-38_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-695 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.26; Positions 1258236 1257542. XX SQ Sequence 695 BP; 203 A; 147 C; 180 G; 165 T; 0 other; tgtctgggag tgacaccaaa cccaaccgat atcacccata tcgcaaaccc aaccgaccca 60 tatagcaatg cacaaatgtg acctacattg cgagcaagaa tgcaacgaga tccattgtgc 120 tgtgttggtc aggcagtcag tggtcgccgt gaccattgct gagaggagcc aagagcggag 180 cgatcgcaac atgatggctc catgtatatt gctgcggcta cccgtaaggc tactgcagtg 240 gtgaatatcg caaccggtga taagagagcg ataaggaacg atccgactct cagtgggccc 300 gaagaaatat catcgcatcg atcgaggttg gatgatgacc atttaggata ggtagtgtta 360 gaagtattcc aacgacccac ctatcgtgag tggaggctgc gcgcgggaat tttaatttgt 420 tgttatgtgt taggattaag attgaatgtt agaatataag ccgaaagtta atgtaccgtg 480 taatgtgtaa taaatgtagt gtctagtaaa tgtgttttta taaatgtcgc gtgtgttgat 540 tgtgcctaat tggatcccga aacacccgga atactacagc aagggagttt gtagtggaag 600 ccctcacccc taaaacctca ccatccatcg agtgttggag agccaccaac ggaacaaggc 660 tatcgaaggc gaggaagtaa atcggttacc caaca 695 // ID SATII_PC repbase; DNA; INV; 157 BP. XX AC . XX DT 14-SEP-2004 (Rel. 9.08, Created) DT 14-SEP-2004 (Rel. 9.08, Last updated, Version 1) XX DE Palorus cerylonoides AluI satellite II repeat region, consensus. XX KW SAT; Satellite; Simple Repeat; AluI satellite II repeat; SATII_PC; KW tandem repeat. XX OS Palorus cerylonoides OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Coleoptera; Polyphaga; Cucujiformia; OC Tenebrionidae; Palorus. XX RN [1] RA Mestrovic N., Mravinac B., Juan C., Ugarkovic D. and Plohl M.; RT "Comparative study of satellite sequences and phylogeny of five RT species from the genus Palorus (Insecta, Coleoptera)."; RL Genome 43(5), 776-785 (2000). XX RN [2] RA Gentles A., Kohany O. and Jurka J.; RT "Palorus cerylonoides AluI satellite II repeat region, RT consensus."; RL Direct Submission to Repbase Update (JUL-2004). XX DR [2] (Consensus) XX SQ Sequence 157 BP; 49 A; 22 C; 22 G; 64 T; 0 other; ctggaaatcg ctttttttcc aaaaccctcg ttttttggat gatttcagca actaccgtat 60 tttcgttatt ttaaatatta aaggtaatct gattttattt ctgataataa aggcatagat 120 cttacacatt ttgaatatag gcaattgtta aatatag 157 // ID DNAX-2_TCa repbase; DNA; INV; 1506 BP. XX AC . XX DT 21-MAR-2009 (Rel. 14.03, Created) DT 22-MAR-2009 (Rel. 14.03, Last updated, Version 1) XX DE Non-autonomous DNA transposon. XX KW DNA transposon; Transposable Element; Nonautonomous; DNAX-2_TCa. XX OS Tribolium castaneum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Coleoptera; Polyphaga; Cucujiformia; OC Tenebrionidae; Tribolium. XX RN [1] RP 1-1506 RA Jurka J.; RT "DNA transposons from insects."; RL Repbase Reports 9(3), 671-671 (2009). XX DR [1] (Consensus) XX CC 3bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Tribolium CC castaneum Genome Project. XX SQ Sequence 1506 BP; 533 A; 280 C; 249 G; 444 T; 0 other; atgcccggtt gcacaaaaga taattgacaa ttaacaaagc gttaatcgtc catcaaaaat 60 tttgtgcaac acaaaacacc gttgacggct gattaatctc tcatcaaata tttgatgaac 120 gtcaaagata attgtcaatt aatcgttgat taactccgtt ttccgttgca caaacgcaaa 180 tcaaggcaaa tgtcatagtt aatcagtgat taacgaatca tcaatttgat ggccgaacga 240 cgggcgatta atctctgatt aaccgtcttt aacacttgtt gtttgagaat ttaaggtgtt 300 tttgataaaa tttcattaac taaactgaac cattacctaa ggggtaagac aataacaaga 360 attaaagaag aaaaccttga atttatttta ttggatgaga atggttatgt gcaaatgact 420 gaagtgtctg agaggcctgt gcagaaaatc agagatcttg aggatctgtt tgagatgttg 480 gttaaccatg agtgtaaaaa gcgctttagt ttaaataaaa accagagtaa aaattgcacc 540 aaaacgcaac taatcaaatc tctcagtgca caaacaaaat tttaattttt tttgcgtttt 600 tggttttttg tcttttgtag aacatttgta gggttatcac agaaagtcac aacacaactc 660 gctcacacat gaatttgtat cgtttgttca atgaaacaat tcgaggaata actcattttt 720 taaccaataa cataatatcc actttattgc ttcccaatgt gccaatgcgc atgattcact 780 tttaaacagc attgtccaac agttcgaata acttaacaaa aaaagaaatg caaaaacttt 840 attagtttta gttgaaataa aaccccaatg ttcaaatcaa ggtttattga ttttggtaat 900 aaacacgtgt agattccttt acgccctaac ttcatctacc cttagggcta ttcccttgtt 960 tcgccttcag gaatcgcttc ttggtacccc cttggcactt agtggcaagg caaaaacccc 1020 cgaaaaaaga aaagtggaga gttcgcgaac acgtcaaaat ttcacaacgc caattttgct 1080 tcttgcaccc tttaggtcta gaaattttcc ataaactcat atcactagaa ttaacaaatc 1140 aacacaaggc cacaaacacc atttctggca tcatagattg tctacaccaa acagaacttt 1200 ttgaagacga aagaacacaa gttaaatcgt caaataaagc actgagtaaa acccgaagtc 1260 cgagaaaaca caatattaat aaaaaaaatc aggctgtcaa atgtatatca aatgacaaat 1320 tcaagacgta aggccttgtt gccaaattgg ttttttgcta atggatgatt aacggagcat 1380 catatgatga acgattaaaa catgagcaaa ttgttttgtg caacgcaatc aaaattaaca 1440 agccatcaat ttttgacgga ccatcatatg atggccgatt aatttgactt ttgtgcaacc 1500 gggcat 1506 // ID BEL-106_AA-LTR repbase; DNA; INV; 787 BP. XX AC supercont1.254; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-106_AA_; KW BEL-106_AA-I; BEL-106_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-787 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.254; Positions 385287 384501. XX SQ Sequence 787 BP; 222 A; 223 C; 189 G; 153 T; 0 other; tggaagtagc caaactttca tccgggctaa accttggagc ccggcccaaa ttcaaaccaa 60 aatctacttc tgatgtgtga cattgtcaca caaaaaacat acgaatgcag caccagcata 120 aaatgtgctg ccaacggtcg caatcgtcgc gacaaagtcc atagcagtca gcaacatata 180 tccggctggc cagcggtcac gaccgcgagg gtctaacgtg catcgatgtt cccgctgcat 240 cagcaatgcc atacgctgcc catcaactat aaactgggca gcaaccaccc aagtcgcccc 300 gagaaggaag acatcgttcg acatcgcaac acacctatcg caagaaattg gagatgtggc 360 cattctcgga aagcagcccc gaagtcacgg tggagaacaa tatctccacc catacagtgg 420 cagggtatac gatcgatgcc ctgttcatca tcgccgtgat catcaccgtc atcgtctggc 480 gttggcggca aaacatgcgg cgcatccggt cgttggaggc agcgaccagg cgcagccaac 540 tcaacgttta agcgggcgtc acccgacgcc gcttacgcag caatttaggt gctccaaccg 600 gagcacatca aagttggtgc acccggcacc aaaggacgag cgaaagggaa gagtgcacac 660 ggcactactg tgtacaacat aatcaaactc tgtaagattc tgaaatatat ttcgtcagtc 720 tataagcgag cgcgaacggg tacagtcttt tttcttgaaa gaacggtccc cggtttgtgc 780 accatca 787 // ID Gypsy-176_AA-LTR repbase; DNA; INV; 1569 BP. XX AC supercont1.156; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-176_AA_; KW Gypsy-176_AA-I; Gypsy-176_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-1569 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.156; Positions 239938 238370. XX SQ Sequence 1569 BP; 446 A; 342 C; 309 G; 472 T; 0 other; tgtaaccgtt ggtcccataa tgtatgcttg tagctccacc acaagtatgc tttccttttc 60 ctaatttgtc ctttatgagt tcttaccaaa agtaccattc gaccctgtcc taattaaaag 120 aactcaaaca ctcatgttgt ctgacagtca ctcacctgcc caatttagtt catttgcagc 180 ggtttgaatc ttaaacttgt atgcaagaca caatgtataa cggcacgaaa ggtaacccac 240 ggagggcaat ttaacaaatg ttcggcaaac ttcgtgccca tttctcacat atttaacgtt 300 tctgagcttg gcctaaatct aattctgctt ttctgtccgt gacctttcaa ggagtcagaa 360 gtgcgcgacg tgtccttaaa atttaaagtg tgtattgcat tgtatatagt tggtgtaaat 420 tatttaaatt attggtatat ttctaggata aaacttaaaa gcaaaaaact aaaagtagtt 480 aaaatcgcct taaatctagt atcaccagtt attagtacgg ttagtctagt gcgaagaaaa 540 gtttaggcat tgtgtgataa tgtggtcttt tgttctttta tatagttgta aacctattgt 600 aaaaaccaaa aaagaatagg aaacaatcgt gtgccaacaa ccattataaa aaggtaattg 660 taatctagat tgttaattat gtgactagat atagtggtac aagcggttag ggtatcaccc 720 tatttcgaca accttctaat ctgcttggac aggtccgtcc agcgggcctt tttggtttct 780 ctgttcaagt gacttatcca ggacgattgc tggtaagccc atattttccc cagagccacc 840 tgtgcgacat ccgaggactg aaggatcggg aaacgccaag aaatctcgat ttacgacaaa 900 cggacagaaa ttgtatccca gccttggaag acgacaaagc tctacataag ggaaccgtgt 960 cctgttcagg acgaattagc caccgccatt gttgaaccac catttgtcac catcgtccag 1020 ccgccaccac cattgccatc tgccacttcc accgcatgtc aacccgccgc cacctttata 1080 aaccactgca aaaagtgtta aaccgcaagt accgctagct agagttaaga gagtgacgtc 1140 acttcaatat taagccaatg ttaggaataa aatcaggcat gttgttaaaa tttgaatcag 1200 tggcatagtt ctttcctagt tatccctgat ttgggtaatt tgcacctctt ttggttagtt 1260 tcggcttttc gtgtacatta ataagtcccc caggcctcgc cttgattgga ccgttctgaa 1320 gggcagatag tgtggttttt gagcaaccct tccgtgagaa gtctgtggtt catctttcca 1380 gcgtaacatc gttttttatt tctcccagcg atcccactcg gtgaacataa ttgtggtagt 1440 gtgagtgtaa ccgactgaca gcataagcga ccctgttaac ataattaccg taaatggtag 1500 taatattatc ctagcctagc ataagtgcat ctaggcgaaa ttcctccact gagcagcgtg 1560 taggctaca 1569 // ID BEL-10_DWil-I repbase; DNA; INV; 8555 BP. XX AC scaffold_181117; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-10_DWil_; KW BEL-10_DWil-LTR; BEL-10_DWil-I. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-8555 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_181117; Positions 203477 194923. XX CC 'GTATC' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 3090..5060 FT /product="BEL-10_DWil-I_1p" FT /translation="MFVFFCSIKLPKNTLSSWEQSLGDKTAIPEWRMMDDF FT LTERFRTLEAVENTTALAISVPTPKPPRNPLPATRKANTFEAKVTTKPKSC FT DLCMKENHPVRRCPRFLQMTVPERVAYIRKKSLCLNCFARGHQVKDCPSQH FT NCFTCKGRHNKLLHKDARPENNPTPVPAQPSSHNIVRTYFTSEARSSRDKL FT LGTAIVDFWHKGTTYASRALLDPASEANFISERMFRLMKLPFQTVRAEVTG FT INGKVNRTLKVCHISIRSSHGPPVQIEIEAFVLSQVADDLPSCTIAQATLE FT GMPNIPLVDPSFRLKAKIDLLLGIDVIPAVTLSGIQTEVCGSLMAQETRFG FT WVISGPLQGVPVSSCAAFTTRVAVSREEGLETLLTRFWEVEDLPGKPVEDS FT DSVCEVNFQKTTRRDVSGIYTVSLPFRDPTNINLGHSRPGALAQFLKNESR FT LLKNIPAKTEYDAVIQEYLDLGHMRQVPFDGTNNNFYLPHHAVFKPDSTTT FT KVRVVFNASSKSTNGVSLNDILYPGPVLQSDLTTQILKWRFLRVVFKADIT FT KMYRQIMVDPIHTPFQRILFRNKEGQLGDYELNTVTFGINCAPFLAIRVLQ FT QLASDVQSKFPLASDIIKSYMYVDDVLTGAHNEHGIAWSMNFKTLLVQLAF FT LCASGDRM" XX SQ Sequence 8555 BP; 2303 A; 2149 C; 1863 G; 2240 T; 0 other; atttggtcct tcgagccgga tactaagatc ctctcgaacc attagcatcg gtggaatttc 60 atccatattt caagctaagt aattttttcc aaattatagg tatatgagta catgtacatg 120 cctccagcat ccgtacccat actcacatac atataattgt tgcccttgat tctcctaatc 180 caagtaaaat tactttcgcg caaatttttc ctcctcaaag gagagaaaat tttggtcccc 240 aaatttttta taaatatata catatgtgtc cagtacataa ggcgccatat tggttcgccc 300 acttcaagtt tttttttctc gcactttggc atatataggc caagtatata aggcgccata 360 tttttatttg cctatttcca atatatattt ttttttcgca atttcaaaaa atattaacac 420 atatgtatgt atgtatgaag tgaatcaaac ctacgaaatt aaaagtgatc tcctacatat 480 aaaatatata tatgcaagca gagtgtgtga atgtgacaaa aaattacaaa tttatccagt 540 gctacattaa atccaaaaat aaaggttttg tgcaaataca gtccactcct gggaaaagtg 600 tgcgcgcagt gaaaggtggt tctaacgaag agaagaaaat tttcaataca actaataaag 660 atttaaaaaa aaaaaaacaa aagtgaacca ccctctcacg tataaaagcc acacttagac 720 atttctggtg tcccgcgaca ccccataaag ttggggcata tcggttagac acagcaaaag 780 aatataaaaa aaaagaaaat taatttatat aaagaaatcc gatccattca cacctctgca 840 tatcgccatt ccacattttc atatttaaaa aaaaaaaaaa aaattttaaa tacatatata 900 tatactccaa cgttttacat atccatacac atacatatat atacatattt ttccatacat 960 aaaattgtac atatatatat tcatatccac atacatatat atacatattt ttccatacat 1020 aaaattttac atatatatat tcatatccac atacatatat atacatattt ttccatacat 1080 aaaattttac atatatatat tcatatccat atacatatac atatttttat ccatacatat 1140 attccacaaa aatttctaat tttttcgcac acaaatatat ttttccatca ccgcgaacaa 1200 aatcccgcta ccccttgcag cctatataca tacatacata cgtgcataca taagtaccgc 1260 ttaactgcag tataccagca cagctgcaag ctcatgtgcc aaagaccggc acaaagcaaa 1320 cgaccaaagg tagctgtgct gtatacccta ggttggcgta tccttgattc tatttcaaat 1380 ccaaagtatt cccacgcgtt ccaaggccaa ttccaaggtg tttccaaacg tttcaagtcg 1440 tttctttcgt tctatttcaa ttccaaagtg tttcaacccg ttccaagtcg catctttgga 1500 gttatattgc aattccaaag tattcccaca cgttccaagt cgtatctttg gagttagatt 1560 ccaattccaa agtgtttcca ctcgtttcaa gtcgtatctt tggttctatt tcaatagaaa 1620 agtggttcca cccgttccaa gtcgcatctt tggagttaga ttccaattcc aaagtgtttc 1680 caagtcgtat ctttggagtt agattccaat tccaaagtgt ttccacgcgt ttcaagtcgc 1740 atctttggtt ctatttcaat agaaaagtgt ttccactcgt ttcaagtcgt atctttgtag 1800 ttatatttga attccaaggt gttccaagcc gctcctaggg ttatttggca gtgttccaag 1860 ttattccaaa cgtaccagct cggtttctgg attttcaatc tatttcaagg cttttcaaag 1920 ccgttcttta agtggtttgg tagtgttcca gttttttccc gaacgttcca gttggtcttt 1980 ttggctaatt ttcaagccgt gccaaaccag ctcaccactt tataaccgcc actcgctggg 2040 gttatgagca cgagcccgtc aaaggacgag aaagcagctg agaacactgc agcccccagc 2100 aaatcctcaa ttgctgctcc aggtcagcgc cagaggagta ccagccctac tactcctcct 2160 tcccgagcat ttcagaccct ctctagtcca ttaactctca cgtccgcacc atcgagcacg 2220 acctttcagc aagccatgac gagcaagcca aaaagaggtc aaatcattgg taccacttcc 2280 acagattttg gtaaccgccc caagagtcac tcggtccgaa aacaaaagtg ttttagctgc 2340 ttccaccatg gctttgaaga agtttgtttc catttgcgat aaactaattc agtttgaagt 2400 cgacgcccac ttcacttcac tgtccgacgt cagcgtattt acgctccaag tccgtcgtaa 2460 tcgagttcaa gcgttgtggg agaaggtcga gcaggaatat gagaaatgcg aggcactcac 2520 agatggtaaa gacgataact ctgaagattt agccatccaa gctaaatatg acaaatgcta 2580 tcaagtttac gagcgatgca cggctcgatt aaacgagctt atccatgaag ggtctgctcc 2640 tactcctccg aaccgtcatg cgcctacctc tgcaccagca gaacgaggct gtcgattgcc 2700 acccattgac acagaagtct ttacgggaga ttacttaaaa tggcccacgt tccgtgactt 2760 gtttacagcc atctacataa gcgattcgcg aatttccccc gttgaaaagc tgtaccatct 2820 cctagcgaaa acgaggggtg aagccaacct cattgtagca aaattcccgc ttacaaatga 2880 tggttttgaa acggcatgga atgcactcaa acagcgtttc gaaaataagc gacttatagc 2940 gactcgtgaa cagccaattc cgagcacttc tcgacctgcc tcaaatttcc catgagtccg 3000 gaaaagcatt acaggatcat cagaatgctc tgcaggcatc cattcgcgct tttgagcact 3060 gtggcctgcc agtaaccact tggggaggca tgttcgtatt tttctgttcc atcaaattgc 3120 caaagaatac cctttcatca tgggaacaat ctctgggaga caagactgcc attccagagt 3180 ggcgaatgat ggacgatttc ctcactgagc ggttccgtac gctcgaagca gtggaaaata 3240 caactgcact ggctatttcg gttcccacac cgaaacctcc acgaaatccg cttccagcca 3300 cccgaaaggc taatacattt gaagccaagg tgaccaccaa accaaaatcc tgtgatctct 3360 gtatgaagga gaatcacccg gtgcgtcgat gtccccgctt ccttcagatg acggtgccag 3420 agagggtggc atacataagg aaaaagtccc tttgtctgaa ctgtttcgca agaggacatc 3480 aagtaaaaga ttgtccgagt cagcacaatt gcttcacgtg caaggggcgt cacaataagc 3540 tcttgcacaa ggacgcccga ccggagaata atccaacgcc ggtcccagct cagcccagtt 3600 cgcacaatat tgtccggact tactttacct ctgaggcaag gtcctcacga gataagttgc 3660 taggtacagc catcgtggac ttttggcaca agggcactac ttatgcttca cgggcgctat 3720 tagacccggc ttcggaagcc aatttcatat ccgaaagaat gttccgactc atgaagttac 3780 cattccaaac tgtcagggct gaagtgacag gcattaacgg aaaggtcaat agaaccttga 3840 aggtctgtca cattagcatt cggtcatcgc atggacctcc agtccagata gagatagaag 3900 cttttgtctt gtcccaagtt gctgatgatt tgccgtcatg tacgatagcc caagccacac 3960 tggaaggaat gcccaatatt cctctagtcg acccctcctt cagattaaag gcaaagatcg 4020 atctacttct tggcattgat gtcataccag cggtgactct ttccggcatc cagaccgaag 4080 tgtgcggatc tttaatggct caagagacga gattcggttg ggtcatttca ggcccgttgc 4140 aaggggtccc cgttagttca tgtgccgcgt tcaccacgag ggtcgctgtc agcagagagg 4200 agggacttga aactcttctc acaaggtttt gggaggtgga ggatctaccg ggcaaaccgg 4260 tagaagattc tgactctgtt tgcgaggtca atttccagaa gaccacaaga agagatgttt 4320 cggggatata tacggtctca cttccgtttc gagatccaac caacatcaac cttggtcatt 4380 ccagaccagg ggcattggct cagttcctaa aaaacgaaag ccggctcctt aaaaatattc 4440 cggcaaaaac tgagtatgac gccgtcatcc aagaatattt ggacttgggc cacatgcgtc 4500 aagttccatt tgacggcact aataacaatt tttatttgcc acaccatgcg gttttcaagc 4560 ccgacagcac cacaacaaag gtacgggtag tctttaacgc ctctagcaag tccaccaacg 4620 gagtaagtct caacgacata ctttaccctg gtccggtact tcagtccgat ctcactactc 4680 agattctcaa gtggcgtttt ttacgagtag ttttcaaagc cgacatcaca aaaatgtacc 4740 gccaaattat ggtcgatcca atccatacgc catttcagcg gatcctgttc cgaaataaag 4800 aggggcaact tggtgattac gagctgaata ccgtcacgtt tggcattaac tgtgctccgt 4860 tcttagccat ccgagtactt cagcaactcg caagtgatgt acaatccaaa tttcctttgg 4920 ctagcgatat tatcaagtcc tatatgtatg tcgatgatgt cctgacaggg gctcataatg 4980 agcacggcat cgcatggtca atgaacttca agacgctctt ggttcagctg gctttccttt 5040 gcgcaagtgg agatcgaatg tgaagagcgt gctcagttgc attccaaaaa gtcatttact 5100 gaatgctgat ttcctcgata tcgaagaggc tagcacagcg aaaacactgg gaatccgatg 5160 gaaagcgact acggatgagt tttattttgt catctcaccc caagatgtca agtcagctta 5220 tactaagcga gaagtcctcg ggcagatcgc caagttattc gatcccgctg gctggctggc 5280 tccctttgtg attcaagcca aggtgtttat gcaagagctt tggctacaag agctagggtg 5340 ggatgatcaa ttgcccagtg aagtccatca cagttggcag gattttacaa agagctactc 5400 cttccttgac cagatccgaa ttcaacgatg ggttcttcat gacccaaatt ccgacatcca 5460 attccattgt ttttgcgatg cgtcacagcg ggcttatggt gccgcaattt acgtgcgcgt 5520 tagcaatgac caaggcatat catgctgcct tcttgcagcg aaatctagag tagccccggt 5580 taaaacggtt tcattgcccc gccttgaatt atgcggagcc acacttttag ctgaactggc 5640 agctgcagtt cttccgcaac ttccagttga taatgcggag gctttttatt ggtcggattc 5700 caccgtcgta ctgtcgtggt taaataagcc accctgcaca tggacaacgt ttgtcgcaaa 5760 tcgggtagca aaaattgtga cggggcaaca aatgacaccc catggaatca cgttcgttca 5820 gaggacaatc cagcagatct ccctagccgc ggcctgagtg cgcaggaact ggtacacaag 5880 gatttgtggt ggcacggccc accatggcta cgggaaccac aagagtcctg gcagcgagcg 5940 acaccactcc cactagacac taccttggag aagcgggtag tgaaggtcca cgtcgcgatc 6000 gctaagccag ccaacgagat actatctcgt ttttccaatc tggcacgtgc cttacgagtg 6060 atagcatatg tgatccgttt cggcagaagg tgtcgtaaac ttccaaacga ttactccggg 6120 gggtgacctc tagcgagatc aatcaagtac tccaggcgtt gatccgagtc actcagcgtg 6180 attactttcc ggcggaacat cggtgtctac agcagaagaa atctctgccc acgtctagca 6240 ccattctcaa tttgaatcct ttcattgacg catcaggtgt gatccgagca tgtgggcgcg 6300 tgcaacaagc agcggccctt agttacgatg agcgtaaccc cattctgttg ccagtagtaa 6360 gtccattgtc ccggctgttg gtgcttttca cgcatcagat ctctctccat gggggcagcc 6420 aactggtagt ccgtctcatc cgacagagat actggattcc aagactgcgg aatctagtaa 6480 aatcggtggt caattcatgc aaagtctgtg tgatttacaa aagaaggctg caatcacagc 6540 taatgggcac ccttccggcc cagcgaacta ctttcgcacg tcctttcacg accaccggta 6600 tagattttgc aggtcctttc gacatcaaga gtttcaccgg ccgcgcttgc cgcatatcaa 6660 agggacgtgt gtgtgtcttt gtttgctttt ccaccaaagc cgttcacctt gaagcaacgt 6720 cggatttgag taccgaagca tttctggcag cattcgctcg ctttgtatcg aggcgagggt 6780 gtccccagca agtgcagtcc gacaacggaa aaacttcgta ggtgcatcca gagtgcttga 6840 aaaagatttc ctaagttcta cccagcagaa aatattatcc ccctacagcc atcagaatct 6900 gtcctggcat ttcatccctc ccggggcacc acacatggga gggttgtggg aggctggagt 6960 taagagtttt aaaactctgt tctacaaggc cacctccaac caaaggtata cttttgaaga 7020 gttctctacg ctgctggcta aggtagaggc gtgtttgaac tcgcggccga tttctcctat 7080 gtctgaagac cccagcgact ttttagcttt gaccccaggc catttcctag tgggcggtcc 7140 attgctatcc gtgacggaac ctaaaatcaa ggatcaggta ccgtctatca ttaaccggtg 7200 gcagcgtctg aaggccgtga gtcagcattt ctgcacccga tggaaagacg agtatctgaa 7260 ggagctacat aagcgcaata agtggcgatt cccttctaaa aatcttgaag tgggcgatct 7320 ggtggtactc aaagacgaca atcttccatt taatgaatgg cgtctcgggc gaatccacca 7380 aacacaccct ggcagtgacg gcaatgtccg cgtcgtcgaa ctgcttaccg ctcgcggtat 7440 tgtgaagcgt cccgtcgcaa agttgatctt tcttcctcca gaagaaagaa ttcaaaccca 7500 ttacgtaaaa ctacagtagc gtccagcact tggtccttcc tcccaattaa gaagggccga 7560 gctgccagta gtaatatgtg tattctatcg tcctacatta atccgctttc ttcattattt 7620 gtcccggcag cttcctgata ttcgtcaacc tagatcccgt ttccatggct ccgagaccgc 7680 gttcagtaca ggcatcgaaa gaggagcgca gatgttcgcg aggaactgag tccttccgtt 7740 gccgagtctg tcgcggaatc catcctctga agcgatgccg tcgcttcctg cggctgaacg 7800 tggagaagag gttgcgaata agtactgcgc caattgcctg gcccaccaac attccggagg 7860 aagctgctta agcggtgata agtgccgaat atgtgaggag gatcaccaca cgctgctcca 7920 cttccatgaa cagccacgtc gacgtactcc cagctccgtc gtacgccgag tcacccccga 7980 gtcctccaga cgaagggtcg caccaacgcc agcctccgat ccaaaactga ccttgaccac 8040 actgctgcag caccgcaatc cgcatctgat gcccacggcg atggtgcgcc ttgagacggg 8100 gggaaaaacc ttcgacgtga aggccctggt ggacccttgt tctgcggtat cgtcgatggc 8160 gacgtcactg gccacggcct ttaagttgac agccgtcaat ataggggcag agaaagcggt 8220 ggcagcagtg atccggtccc caatcagtga aggatggcga ttggaggcga tcctgaaggt 8280 tgtcgatggc ttgtgctgcc gcactccaag cgctccggtg gacccgcaga tcgccaaaaa 8340 gttcgagggc atcgtcctgg cggatgatac gttctaccga ccgtcgtcag tgtctttggt 8400 cctgggcgcg gatgtgatca cggaggtcat gttggagggg agtctacccg gggttggcgg 8460 gcggccaatt gcgatgcgca cagtctttgg ctggaccctg tccggagcgt gccattagac 8520 tgcctgggcc ttgcactcct gcaagggggg gagca 8555 // ID CR1-45_AAe repbase; DNA; INV; 5021 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-45_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5021 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1132-1132 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 23 sequences with >95% CC identity. XX FH Key Location/Qualifiers FT CDS 431..1258 FT /product="CR1-45_AAe_1p" FT /translation="MSTACDRCAKAIKRNDEVITCMGFCDHIVHLRCANID FT KNFAKSISDATNLYWMCDECAKLMKLVRFRNAVSSVGDAICELTKNQEVAN FT TELKNELVKHSQQIAHLSNRINSITPSMPISAERRAMKRRRIDNDNQPAKS FT LVGGTRIADNNGVATVPVPSTMFWVYLSRLHPSVNYEAVEKLARECLQCEA FT AKAVPLVKQGTDVNSLNFISFKVGVDPKFRAVALDPSSWPKGILFREFEDS FT RRGNYWMPEPSTPSIIVTSDAEETPQQASMEATEN" FT CDS 1262..4948 FT /product="CR1-45_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="PLGTHSTGRRKQPRRIVESTLETSSQYDSVLPFAVIG FT LQSRRGPDVETVEEVLQSASPGKYNYFQEQALPDQFPASSRGTSSRRILLS FT PDLINHVSIVSDPNFNFNQFDEASFTPGRTSASLMEGPNQPSAVEPLLPSI FT ISRPGPVTGFGIGLFQAAYSGKYTSNMCSMIPDALTNSSFPSPPTTLLDFQ FT ILGRTDTSVMEAPEPPATVELTQPAIDSRLSPVCGSGAGVFQSSVNGKYAT FT LSNNAIVDVMTASSSSLPVDNTIIATSSPFQADNIWLYYQNVRGLRTKIDD FT FYLATNDCNLDVIILTETGLDDCINSIQLFGTIYNVYRCDRSTRNSHKTCF FT GGCLIAVGQQYPSSLIELSNGQNLEQICVSSIIKGQKLKLMVVYIPPDRSQ FT DVSTIEAHIASVQELCDKSSSDDKVIVCGDYNQPRLSWTVGTEGLTPNGSR FT VLPPAAAALVDGFDFLNMTQANNHLNHLGRVLDLVFSTSGDINVLRADFPL FT LPIDPHHPPLDISVAASSTQRSISSHRESRVLNYRRLDFVALTEYLSNVNW FT NELLNVQDVNEMTERFCRAICSWLNDNLPVKRPPVQPVWSTSELRRLKRDR FT NAKQRKLRRHRSAENQRAFKTAADSYHRLNTLLYKSHVSRVQAGIRRDPKN FT FWGFVNSKRNSPSIPSNVYLDETESASVEESCELFAEFFASVFSPGSTSQQ FT EADAAASNLPTDAVDIDIFNITSDMIITASRKLKSSYSPGPDGVPAVIFRR FT CIASLAVPLSRIFNTSFEQKTFPDMWKQSFMFPVFKKGDKRNVKNYRGITS FT LSAGSKLFEIVVSSVILNYSKSYISVNQHGFMPGRSVTSNLLDFTSACISQ FT IETKAQVDAVYTDLKAAFDVIDHRILLCKLSRIGISDRLLSWLESYLSNRI FT LRVKLDSAVSRAFSNSSGVPQGSNLGPLLFALFFNDVALLLGSEYVLIYAD FT DLKIYLTVKSVEDCRRLQGLINTFADWCRLNKLLISVSKCMVITFHRSKNP FT IIFDYRIEGNVLERVDQISDLGIILDAKLSFNQHISSMISKASRQLGFVTK FT VSREFTDPHCLKTLYCALVRPILENASIIWTPYHLSWNIRIERIQRRFIRI FT ALRSLPWRDPQNLPPYNERCRLLGLDSLQRRRSIQQSIFVAKLLNGEIDAP FT ALLTRVNFRTMGRQLRSSTMLTTRFHRTSYGCNEPLTACLRTFSLVEELFD FT FGETTGCFKRRVTSSRILQ" XX SQ Sequence 5021 BP; 1323 A; 1196 C; 1054 G; 1448 T; 0 other; ttctggcatc actgttgaat gtttgtatgc tgtttttcgg cccggagttt ttaactcatt 60 agaaagtgaa atcggcgtca taaaaccgtt attttgtttt cgtatgtttg gagtgactct 120 gtgaattgta ctcgtgtgtt aattagtgtc cattgccgtc gtgatacatt atcacttatt 180 gcgagtgaac tgtgttttgt ttgctccact tgttcacaaa agcttacctg ccccataccg 240 attttccact gaaagacatc tgttggtggg ttacacaagc tttaactgca agctctggag 300 ttccatcgat ttcatcgact accgaggaat acttttggta ttgtgatagt ggtgatacac 360 attcaaccat aaccatacac agctagcaat ttgttcttcg tgtgtagaaa ggctcaggag 420 tatttacgac atgtccaccg cttgcgatcg ttgcgccaaa gcaattaagc gtaacgatga 480 agtgatcacg tgtatgggct tctgcgatca tatcgtccac ctcagatgcg caaatattga 540 taaaaatttt gccaaatcca tttctgatgc gaccaatttg tattggatgt gtgatgaatg 600 tgcaaaatta atgaagcttg tacgtttccg taatgccgtc tcttctgttg gtgatgcgat 660 ttgcgaactt accaaaaatc aagaggtggc gaacaccgaa ctgaaaaatg aactcgtcaa 720 acatagccaa caaattgctc atctttcgaa tcgaatcaac tctatcacac catcaatgcc 780 catctccgct gaacgtcgcg cgatgaaacg tcgccggatc gacaatgata atcagcccgc 840 taaatcactt gtcgggggaa ctaggatcgc tgataacaat ggtgttgcaa ctgtaccagt 900 accctccacc atgttttggg tttacttgtc tcgcttacat cccagtgtga attacgaagc 960 tgttgaaaaa ttagcacgag agtgcttaca atgcgaggcg gcgaaggctg tgccattggt 1020 taagcaggga accgatgtga attctttgaa cttcatttcg ttcaaagttg gcgtcgatcc 1080 taaatttcgt gctgtagcgc tggatccttc ttcatggcct aaaggtattc tcttccgaga 1140 atttgaagat agtcgtcgtg gaaactattg gatgcctgag ccaagtactc cttctattat 1200 cgttacgtct gacgctgaag aaactcctca acaagcatcc atggaggcca ccgagaattg 1260 acctcttggt acacactcta ctggacgaag gaagcaaccg cgacgcatcg tagaaagcac 1320 tttggagacc tcttctcagt acgactcagt cctgcctttt gctgtcatcg gtcttcagag 1380 tcgtcgcggt cctgatgtcg aaactgttga agaggttctc cagtctgcat ccccaggcaa 1440 gtacaactac tttcaagagc aagcacttcc tgatcaattt cccgcttcca gcagaggcac 1500 atccagtcgt cgtattttac tatcacctga ccttatcaac catgtgtcca tcgtttctga 1560 cccgaacttc aattttaatc aatttgacga agcatcattt actccgggac gcacatccgc 1620 tagtcttatg gaaggcccta accagcccag cgcagtcgag cctttgctgc catcgatcat 1680 cagccgtccc ggtcctgtga ctgggtttgg gatagggctc ttccaagccg cctactctgg 1740 caagtataca tccaacatgt gcagtatgat tcctgatgcg ctcaccaatt ctagctttcc 1800 atcaccgcct acaacgctac tcgattttca aatactggga cgcactgaca ctagcgtcat 1860 ggaagcccca gagccccccg ccacagtcga gctaactcag ccagcgatcg acagccgtct 1920 cagtcctgtg tgcgggagtg gtgcaggggt cttccaatct tccgtcaatg gcaagtacgc 1980 tactttatcg aacaatgcaa tcgttgatgt aatgaccgct tctagctcat cactccctgt 2040 cgacaatact atcatcgcta ccagttctcc gtttcaagct gacaacattt ggctgtatta 2100 ccagaacgtt cgtgggttaa gaacgaagat tgacgatttc tacctcgcca ccaatgactg 2160 caacctcgat gtcattattc taactgaaac tggtttagac gattgcatca actctataca 2220 gctctttggt actatctaca atgtataccg ctgtgatcgc agcaccagaa atagccataa 2280 gacatgtttc ggtggctgtc tcattgctgt tggtcagcag tatcctagct cattgattga 2340 actaagtaac ggccaaaact tggaacaaat ttgcgtatca tctattatca agggacaaaa 2400 gctgaaactg atggtagtct acattccccc tgatcgaagc caggatgttt caaccattga 2460 ggcacatatc gcatctgtgc aagagctgtg tgacaaaagc tcctcggatg ataaagttat 2520 agtatgcggg gattacaatc aaccgcgatt gtcatggacg gttggtaccg aaggtcttac 2580 ccccaacggt tctcgtgtcc taccccctgc cgccgctgct ttagttgacg gtttcgactt 2640 cctcaacatg acacaagcta acaaccatct caatcatctc ggcagagttc tggaccttgt 2700 attttccact tccggagaca tcaacgtctt gagagctgac ttcccgttac tgcctatcga 2760 tcctcatcac cctccgttag atatttccgt ggctgcttca tcaacgcaac gtagcatttc 2820 tagtcatcga gaatccagag tgctaaacta tcgtcggctc gattttgttg cgcttaccga 2880 gtacttgtca aacgtgaatt ggaacgagtt gttgaatgta caggatgtga acgaaatgac 2940 ggagcgtttc tgtcgcgcta tttgctcttg gcttaatgac aacctccctg taaaaaggcc 3000 accagttcaa ccagtatgga gcacttcaga actgcgtagg ttgaagcgtg atcggaatgc 3060 caaacaaaga aagcttcgta gacatcggtc tgccgaaaat caacgagcct tcaaaacagc 3120 tgctgactct tatcatcggt tgaatactct cttgtataag tcacacgttt cgcgtgttca 3180 agctggcatt cgtagagacc cgaaaaattt ctggggattt gtgaattcaa aacgaaatag 3240 tccgtccatc ccgtccaacg tttatctcga cgaaactgaa tcggcttcag tggaagaatc 3300 ttgcgagtta tttgccgagt tttttgcatc cgttttctca ccaggtagta cttctcaaca 3360 ggaagccgat gctgcagcgt cgaatcttcc tactgacgct gtcgacattg atatcttcaa 3420 tattacgtca gatatgatta tcaccgcctc caggaaatta aaaagttcgt actcaccagg 3480 tccagatggt gtgcctgcag taatttttcg tcgctgtatt gcctctctag ccgttccctt 3540 aagtcgtata ttcaatacgt cttttgagca gaaaaccttt cctgatatgt ggaaacaatc 3600 tttcatgttt cccgttttca agaaaggtga taaaagaaat gtgaagaact acagaggaat 3660 aacaagcctt tctgccggat cgaagctgtt tgaaattgtt gtcagcagtg tcattctcaa 3720 ctacagcaag agctacatct cagtgaatca gcacggattt atgccaggga gatctgtcac 3780 atcaaacttg ctcgatttca ccagcgcatg catctcacag attgaaacta aagcgcaggt 3840 tgacgctgtg tatacggact tgaaggccgc cttcgacgtt atagatcacc gtattcttct 3900 gtgtaaactt tctcgcattg gcatatctga tcgacttttg tcatggttgg aaagctattt 3960 atctaaccga attctgcgag ttaagcttga ttccgctgtt tcgcgtgcat tcagcaatag 4020 ttcaggtgtc ccacaaggaa gcaatcttgg ccctctgctt ttcgcattgt tctttaacga 4080 tgttgcactt cttttgggtt ccgaatacgt acttatatat gccgatgacc tgaaaattta 4140 tcttactgtc aaatctgttg aggactgtag gcggcttcaa ggtctgataa atacttttgc 4200 agactggtgt agattgaaca agctcttaat aagcgtttct aaatgcatgg ttataacatt 4260 ccaccgatca aaaaatccca tcatctttga ttatcgcatt gaagggaacg tacttgaacg 4320 agtcgatcag ataagtgatt taggtatcat cctggatgct aaactcagct tcaatcaaca 4380 tatttcgagc atgatctcaa aagcatctcg tcaacttggc tttgttacga aagtttctcg 4440 tgagttcact gacccgcact gcctcaaaac tctctactgt gcattagtac gtccaatact 4500 cgaaaatgct tcgattatat ggactcctta tcatctgtca tggaatatca gaatcgagcg 4560 tattcagcgc agattcatcc gtattgcgct acgaagtctg ccttggcgag atccacaaaa 4620 cctaccaccg tataatgagc gatgccggct tctaggactt gattcactgc agcgacggcg 4680 aagtatccaa caatctattt tcgttgctaa actgctgaat ggagagatcg atgcacctgc 4740 gcttctaact cgtgtcaact tccgtacaat gggcaggcaa ttgaggtctt caactatgtt 4800 gacaacaaga tttcaccgca cttcttatgg ctgtaacgaa ccactgaccg catgtttgcg 4860 aaccttttct ttagttgaag agctgtttga cttcggagag actactggat gctttaaacg 4920 gagggtgact agttcaagga tactacaata actattaagt atattcatgt agacaactgt 4980 cagatgaatt aatcaaataa ataaataaat aaataaataa a 5021 // ID Copia-4_SI-LTR repbase; DNA; INV; 438 BP. XX AC AEAQ01007866; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: long terminal DE repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-4_SI_; KW Copia-4_SI-I; Copia-4_SI-LTR. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-438 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01007866; Positions 457 20. XX SQ Sequence 438 BP; 157 A; 82 C; 117 G; 82 T; 0 other; tgcggcttac gcaacgagag aagaaaagcg ttacggtaaa agtgccacgt atgcaaaaaa 60 ccaggacacg tggcgaaggt gtgctggcac cgaagatcca acgatgatgg cgacaagaaa 120 gatcgaccag acaaaaagcg cacgaattag cgcagtgaca ctcgatctca aggccgttcg 180 caacagagtc gtcaggaggg acgcaatgat aagagtgaga aagctaacaa agccaccgac 240 gactacgcgt tcgaggcatg cgacgaagag tgtgcgagcg atggcgacag ttacataata 300 agttggattc tagatagcgg ttgtaacagc catatgataa aaaaaaattt gtaattgatg 360 attttaatca cataaatggt aatgttctat tggccggtaa aggaaatgta acgaaatcag 420 aaggaatagg ctcaatca 438 // ID BEL-34_CQ-LTR repbase; DNA; INV; 284 BP. XX AC AAWU01003986; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-34_CQ_; KW BEL-34_CQ-I; BEL-34_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-284 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 222-222 (2011). XX DR GenBank; AAWU01003986; Positions 10112 9829. XX SQ Sequence 284 BP; 72 A; 95 C; 69 G; 48 T; 0 other; tgttcggatt ctgtaaaccc ctcggaccgt gtggaccgag agttctaaac cgcgaacggc 60 agatccctct gatcgccatt tcgccgcgaa ggccgcgaga gaagaggtga gagagaaact 120 ggaccgcgat ctccgctggt gagccttcga ggattctgcg atccagcttg cgcaacagta 180 cgcttcagcg cttcactccg cgaagacccg cacgaaaacc tcctccggag catcccacat 240 cccaacaacc aaccaggtac cctcaaccaa aaccggttcc aaca 284 // ID BEL-60_AA-I repbase; DNA; INV; 5881 BP. XX AC supercont1.98; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-60_AA_; KW BEL-60_AA-LTR; BEL-60_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5881 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.98; Positions 1792248 1798128. XX CC Positions [4907-5467] - Integrase core CC 'TTCTT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 32..5881 FT /product="BEL-60_AA-I_1p" FT /translation="MDDRVFRSVTTPRNCQSCERPDAAENEMVQCSICLLW FT EHFGCAGVGSEVKHFSVRYVCQKCSDRQGTSGALQVPMEDKRKSKGSKASS FT KGTAKRGKVIPDPPKSVSSSVRAKLLEEEFKLVEEERQLMEQELQEQEEIK FT KRQLIEEERKLAEKRRLAEEESLFRDRKLQEELEMKRKQMQIRKESLEKRQ FT AIIRHAASMSSRSGSIVDSEEKVNKWLNSQKNVGKVNEVLEDDDNNAKQFV FT SPDPLDVPKGPELEYPDLGTLPFVPNPSHRLNQPEPIVRSDPGHLVSAYSP FT HTLTQVQIAARQVLGKDLPTFGGNPEDWPIFISNFEQSTASCGYSDAENLV FT RLQRSLKGNALEAVRSRLLLPASVPHVIQTLRTLYGRPEILIRSLMSKIQQ FT VPAPKHERLETLIQFGLSVQNLVDHLKAAGQLNHLSNPALMQELVEKLPGT FT MRLDWAFYKNKNLPATLETFGDFMSGLVTAASEVSFELPTLSNTSRMDKRS FT RETGIVHAHTAEQMSMSTGESSSNSNAKVSKPCAACGRAGHRVAECYQFKN FT ATVDERWQLVQQKGLCRTCLNSHGKWPCRSWNGCGIDGCRQKHNTLLHSSS FT PLLENVGMSASHVSSENLKWPLFRIVPVVLFSDSSSQTIFAFIDEGSSYTL FT LEESVATQLGVRGQLEPLTLQWTGKVKRKESNSRRIQVDISGKGCTPRHKL FT IDARTVNRLVLPSQTLKYGDLAKQFPHLRGLPLEDYELVQPKLLIGLDNLR FT LCVPLKLREGGPRDPIGAKCRLGWSIYGCIPGQSPQPAIVTFHVGTVSDPD FT RELNEQLRDYFTLENAGVLGSREILECEDEKRARLLLESTTRRLPVGSGFE FT TGLLWATNNPSFPDSYAMAVRRLEALERKLKDEPETERRVREQVVEYVRKG FT YAHKASLVELTSVDAGRVWYLPLGVVRNPKKPGKIRLIWDAAAKVGNVSFN FT SKLLKGPDLLSPLPKVLSQFRQFPVAVCGDIMEMFHQIKIRAPDNQSQRFL FT FRDSPSEYPSVYVMDVATFGATCSPASAQYIKNLNAEECSKDYPRAAAAVK FT NKHYVDDYLDSFETLQEAIAVVNEVKLVHSVGGFTLRNFSSNKSEVLDGIG FT ASSKHEPKSLDLERGEKSESVLGMKWIPCEDVFVYTLGLREDLQYILTEDH FT VPTKREIARVVMSLFDPLGFIAFFLVHGKILLQDVWATGTEWDQRIPNDIN FT NRWQQWSHLFEQLHQLRVPRCYFRAPFASSLNGLQLHLFVDASEAAYSCVA FT YFRLDTDNGTQVALVGAKTKVAPLKTLSIPRLELKAAVLGVRFLETIQNHH FT EFTIHSRYCWSDSGTVLAWIHSHDHRRFHKFVAVRVGEILSSTEQKEWKWV FT PTKLNVADLATKWGIGPQIAMESPWFQGPHFLHGPESSWPQQRQASSTNEE FT LRPVSFHFVHSDPILDFFRFNNWTKLHRTMAYVLRFFDNIRQKKRGQKLEL FT GALTSDELRRSEEILWKSAQVESYPYEVAILAKSQGPPELRHNVVDSSSDI FT YTKWPYLDERGILRSRGRIGAAPYISVEAKYPVILPKNHVITLLIVDRYHR FT NVRHANRETVVNEIRQRFEISTLRRLVDKVEKNCVWCRIAKAAPKPPSMAP FT LPEMRLTAFVRPFTFTGLDYFGPVLVKVGRSNAKRWVALFTCLTTRAIHLE FT VVHTLSTESCIMAVQRFVSRRGIPREFWTDNATCFQGISNELQIISEATKK FT AMAEKFTSPQTTWKFIPPATPHMGGAWERLVRSVKVAIRAIIDGPRRPDDE FT TLETVLLEAEAMINARPLTYVPLESADQEALTPNHFLLGNSSGAKFLPSGT FT IDTRSTLRSSWKLARFMTDDFWNRWLKEYLPVITRRCKWFKDVKNIEVGDL FT VFVVDGKMRNQWVRGRIEEVYPGRDGRVRQALVRTSTGVIRRAATKLAVLD FT VLEKCKPWSLGDPERLDPQQGLRAGV" XX SQ Sequence 5881 BP; 1599 A; 1384 C; 1501 G; 1397 T; 0 other; aactttaaga aatttgtggt agctagcaag gatggatgat cgcgttttcc ggtccgttac 60 tacccctcgg aactgccaat cctgcgaacg acctgatgct gcagaaaacg agatggtgca 120 gtgcagcatc tgtctgctat gggagcattt cggttgtgct ggagtaggaa gcgaagtgaa 180 gcatttcagc gtaaggtatg tctgccaaaa gtgctccgac agacagggaa cgtcgggcgc 240 gcttcaggtt ccaatggaag acaaacgcaa atcgaaaggc tcaaaggcaa gttccaaagg 300 cacggcgaaa aggggaaagg ttattcccga tccgccaaag agtgttagtt ccagtgttcg 360 cgccaagctt ttagaggagg aattcaagct agtggaagaa gaacgacaat tgatggagca 420 agaacttcag gagcaggagg agatcaagaa gcgtcagttg atcgaagagg agcgcaagct 480 ggcggaaaaa cgacgtctcg cagaggaaga gagcctcttc cgtgatcgga agctacaaga 540 agagctggaa atgaagcgga agcaaatgca gattcgcaaa gaatctttgg agaagcgcca 600 ggctattatt cggcatgcag cctcgatgag tagcaggagt ggctcgattg tcgactcgga 660 agaaaaggtc aataagtggc tcaactcgca gaaaaatgtc ggcaaagtga atgaagttct 720 tgaggatgac gacaacaacg ccaaacaatt cgtcagcccc gacccactgg atgtaccgaa 780 gggcccagaa ttggaatacc ccgatcttgg cacgctgcca tttgttccaa atccgtcgca 840 tcgtttgaat caaccggagc caatcgtgcg gagtgatcca ggtcatctag tgtctgcata 900 ttcgcctcat acactcacgc aagtgcaaat agccgcgcgt caagtattgg gaaaagatct 960 cccaactttc ggaggtaacc ctgaagactg gccgattttc atcagtaact tcgaacagtc 1020 cacagcctcc tgtggatact ccgatgcgga aaatttggtt agactgcaac gatcgttgaa 1080 aggaaacgcc ctggaggcag ttcgtagccg gcttctgtta ccagcaagtg ttccacacgt 1140 tatacagact cttcgcactc tgtatggaag gccagaaatt ctcattcggt cgttgatgag 1200 caaaatacag caggtaccag ctcctaaaca cgagcgactg gaaacactga ttcaatttgg 1260 actctcagtg caaaatctgg ttgatcatct caaggcggca ggccagttga accacctttc 1320 aaacccggcg ttgatgcaag agcttgtaga aaaactgccc gggaccatgc gattagattg 1380 ggcattctat aaaaataaaa atctaccagc aactctcgaa acgttcggcg atttcatgtc 1440 cggattagtg actgcagcaa gcgaagtatc attcgagctc ccgacactga gtaatacgtc 1500 aagaatggat aagcgatcca gagaaactgg gatcgttcat gcacataccg ctgagcaaat 1560 gtcaatgtcg actggtgaat catcgtccaa ttccaacgcg aaggttagca agccgtgcgc 1620 agcatgtggc cgagctggac atcgagtagc ggaatgttac cagtttaaaa acgcgactgt 1680 agatgaacgc tggcaactcg ttcagcagaa gggactatgt cgaacatgcc tgaatagtca 1740 tggcaaatgg ccatgccgct cctggaatgg ctgcggaata gacggatgtc gacagaaaca 1800 caatacgctg cttcattctt cctctccact gctagaaaat gtaggaatgt cggcaagtca 1860 tgtgtcgtct gaaaatttga agtggccgct cttccgcatc gttcctgttg ttctcttcag 1920 tgacagttcc tctcaaacaa tatttgcgtt catcgatgag ggttcttcgt atactctact 1980 ggaagagtcc gtcgcaacgc agctgggagt caggggccag ctagaaccct taacccttca 2040 atggaccgga aaggtgaaac gcaaagagtc aaattcaagg cgtatacagg ttgacatctc 2100 cggtaaagga tgtacccccc ggcacaagct gattgatgca cgaacagtta atcggctcgt 2160 cctaccctca caaacactca agtatgggga tttggccaaa caatttcctc acctacgtgg 2220 tcttccccta gaagattacg aacttgttca acccaaatta ctgatagggt tagacaatct 2280 tcggttgtgc gttccattga agcttcgcga aggagggcca agggatccta ttggcgcaaa 2340 gtgtaggctg ggatggagca tttacggttg tatacctggt caatcgcccc aacccgccat 2400 tgttaccttc catgtcggta cggtttccga ccctgatcgt gaactgaatg agcagcttcg 2460 tgattacttc actctggaaa atgcaggcgt tttggggtca cgtgagattc ttgagtgtga 2520 agatgaaaag cgagccagac tacttttgga aagtacgaca cgtcgtttgc ctgttggttc 2580 tggttttgaa accgggttac tctgggcaac aaacaatccg agttttcccg acagctatgc 2640 catggctgta cgtcgtctgg aagcgttgga gcgaaaactc aaagatgaac cggagacaga 2700 gcgacgggta agagagcagg ttgttgagta cgtccgtaaa gggtatgccc acaaagctag 2760 tctggtggaa ttaacatccg tggatgccgg tagggtgtgg tacctacctc taggtgtcgt 2820 tagaaacccc aaaaagcctg ggaagatacg tctcatttgg gacgccgcgg cgaaagtggg 2880 aaatgtctct tttaattcca agttactcaa gggacccgac ttgttgtcac ctcttccaaa 2940 ggtcctcagc caatttcgtc agtttccggt cgccgtttgt ggcgacatca tggagatgtt 3000 ccatcaaata aagatacgtg ctcccgataa tcagtcccaa cgcttcttat tccgcgacag 3060 cccatcagaa tatccttcgg tgtacgtgat ggatgttgcc accttcggcg caacctgttc 3120 tccggcctct gcacaatata taaaaaacct gaacgcggag gagtgttcca aggactatcc 3180 acgggcggca gctgcagtta aaaataagca ttatgtggat gattatttgg atagctttga 3240 gacactccaa gaagcaatcg cagtggtcaa cgaagtgaaa ctggtacact cagttggtgg 3300 atttacgttg cggaacttct cttccaacaa atctgaagta ctagatggaa ttggagcgag 3360 ttcaaaacat gaaccgaaga gcttggattt agagcgagga gagaagtcag aatcggtgct 3420 cgggatgaag tggattccgt gcgaggacgt ttttgtctac accctaggtc tccgagaaga 3480 tctgcaatat atactcaccg aagaccacgt tcccaccaaa cgagaaatcg ctagggtcgt 3540 catgagcttg ttcgaccccc tcggattcat cgcattcttc cttgtgcacg gtaaaatttt 3600 actgcaagat gtgtgggcga caggcacaga gtgggatcaa aggattccga acgatatcaa 3660 taatcggtgg cagcaatggt ctcatctctt cgagcagttg catcagcttc gcgtacctcg 3720 atgttatttc cgcgctccct ttgcttcaag tctcaatggt cttcaacttc acttgttcgt 3780 agatgctagc gaggcggcat actcctgtgt agcctacttt cgcttggaca ccgataatgg 3840 aactcaggta gcacttgttg gtgcaaagac caaagtggcg cctttgaaaa cactttcaat 3900 ccctcgacta gagctgaaag cggctgtact aggcgtccgc ttcttggaga ctatccaaaa 3960 ccatcacgag ttcacaatcc atagtcgtta ctgctggagc gattcgggga cagttcttgc 4020 gtggattcat tcccatgatc atcggcggtt ccacaaattt gtcgcagtac gtgtcggcga 4080 aatcttaagc tcaacggaac aaaaggagtg gaaatgggtg ccaacgaagc tcaacgtcgc 4140 cgatttagct accaagtggg gcattggtcc tcaaatagct atggaaagcc cctggtttca 4200 gggaccacat ttccttcacg gcccagaaag cagctggccg caacagcgac aggctagttc 4260 aacaaatgaa gagctgcggc cggtttcctt ccactttgtt cattctgatc cgattctgga 4320 cttttttcgg ttcaacaatt ggacaaaact gcatcgcaca atggcgtatg tattgcgatt 4380 tttcgataat attcgtcaga aaaaacgagg tcaaaaacta gaactcggtg ctcttaccag 4440 tgatgaactt cggcgttctg aagagatttt gtggaagtcc gctcaagttg aatcgtatcc 4500 ctatgaagtt gctattctcg caaaatctca aggtccacct gaactccgcc acaacgttgt 4560 agacagttcc agtgatatct acacaaaatg gccctatctt gatgaaagag gaatcttgcg 4620 cagtcgtggc cgaataggag ccgcgccata catatctgta gaggcgaaat atcccgtaat 4680 tcttcccaaa aaccatgtta ttacacttct gattgtggat aggtaccacc gcaacgttcg 4740 tcacgccaac cgtgagacag tcgtgaacga gatccgtcaa cgcttcgaaa tctcgacgtt 4800 gagacgatta gtagacaaag tcgagaagaa ttgcgtctgg tgtcgcattg caaaagccgc 4860 tcccaaacca ccttcaatgg ctccacttcc tgaaatgcgt ctcacagcgt tcgtacgacc 4920 gtttaccttt accgggctag actactttgg ccctgtactt gttaaagtag gccgcagcaa 4980 cgccaaacgg tgggtggcac ttttcacctg cctcactacc agggcaatac acctagaagt 5040 ggtacacact ttaagcacgg aatcatgtat catggctgtc caacgctttg tctctcgccg 5100 aggcattcca agagaattct ggacagataa tgccacgtgc tttcagggaa taagcaacga 5160 gctgcagata atatcggaag caacgaagaa ggctatggct gaaaaattca caagcccgca 5220 gactacatgg aaatttatcc ctcctgcaac accacatatg ggcggtgcgt gggagaggct 5280 cgttcggtca gtcaaggtgg cgatcagagc gataattgat ggaccacgca gacccgatga 5340 tgaaacttta gaaacagtgt tactagaagc agaagccatg attaatgcca gacctctcac 5400 ctacgtgccg ttagagtccg cagatcaaga ggctttaaca ccaaatcatt tcttgttggg 5460 caattcctct ggtgcaaagt ttctcccttc gggaacaata gacacccgtt caacgcttcg 5520 gagcagctgg aagttggcta gattcatgac cgatgatttc tggaacaggt ggttgaagga 5580 atatcttcct gtcatcacac ggcgatgcaa atggttcaag gatgtgaaaa atatcgaagt 5640 aggtgatttg gtattcgtgg tcgacggcaa gatgagaaat cagtgggttc gtggacggat 5700 cgaggaagtc taccctggac gagatggtag agtacgacag gcgttggtgc gaacgtcgac 5760 aggggtcatc cgaagagcag cgaccaaatt agctgttctt gatgtcttgg agaagtgtaa 5820 accttggagc cttggagatc cagagcgcct agatcctcag caaggtttac gggcgggggt 5880 g 5881 // ID Gypsy-8_SI-I repbase; DNA; INV; 4512 BP. XX AC AEAQ01018735; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-8_SI_; KW Gypsy-8_SI-LTR; Gypsy-8_SI-I. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-4512 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01018735; Positions 6198 1687. XX CC Positions [3395-3862] - Integrase core CC 'TTGTC' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS join(1787..2926,2930..4357) FT /product="Gypsy-8_SI-I_1p" FT /translation="MLEKEGIITKIDCVDWGSPLVIIPKPDGSVRLCVDFK FT IAVNPQLKGAHYPIPRVNDVLNNFRNANFFCKLDIFKAYLHVGVDDESKAI FT QTISTHCGTYKMNRLSFGIKTAPSEFHRILDQILSGLEGTIAYFDDIVVFS FT QTYEECKQRLITCLERLRAYNLHLNIKKCRFFEKEISYLGYVISGNQISKC FT PEKTKAILKSPRPENVDDVRKFLGLFTYYSRFIPDVSSITFPIRELLQTKK FT RFVWSSKCEAAFIKLKNEIASDRVLVPFDPSLPVTVSCDASPVGIAGALSH FT IINGIERPVEYMARALTAAKRNYSQLDREAVTIVFSIQKFYRYLYGRSFTL FT ITDNRPLTRIFQQDAKLPAITSTRLLRYATFLSNFDYVKHRKAEEHGNVDY FT LSRAPWESSPVIQDEDEEINEQVINQISTSAITSETIAEETSKDKELSKLR FT EELASGKIYDPIYSLHNNIVFRGRRVFIPASLRSEILKELHCIHPGISKMK FT NLARRYCYWRNIDKEIKELVRSCPECAKIQNEPKKVSLHHWKDPKINFQRV FT HIDYAGPFQGHQFFIVVDAKSKWPEVRIIERNPTSTSTIHFLENIFSSHGL FT PEVMVSDNAAIFKSEEFRQYCKNNGIFQKFIAPGHPATNGLAERYVQILKR FT KLKAMENKPGTMTTKIEKILYRFRATPLQEGKSPSELYLNRQIRTKLDLWH FT PSHTSQNLVKNSNTRQLSVGDRVLSRAYGRTERWKTGSITQRLGRLNYIIL FT LDDGYIIKRHINQLRPSGIPKANVEEKSVCFNPKIRYWYPVQDDEPEVQPP FT HQQAQDPEEMPIPQTQNTQAANEQNMSRGNLKPILRRSERIRKSPIHLKDF FT VQK" XX SQ Sequence 4512 BP; 1664 A; 894 C; 813 G; 1141 T; 0 other; attggcgacg aggtaaacat cagaatagag aatcagatct tgtagaatca gatccttttc 60 acgccaacgc cacataacct catcagcaac cgacggtcca gaaagaagca taccaccaag 120 tcagtcggaa tcaaaaaaat aatggacacg caacaattcc aggagttcat gaaagtattg 180 acatcatgct tcgaaagtta agcacgaacc caacaggcaa gtcaacaccc aataaatccg 240 aacaataatc tagtatcaaa tttcgggagt ttcgattcag acaaggaaga cttctcacat 300 tataaacaac ggctggaaaa ctatctagag ctaagaaacg tattcaatga caaagacatc 360 tgtgccaaag tcttactgaa ctgcatctga tgaaaatact acgaactact aatgtcttta 420 accgctccaa acttacctac agaaaagacg tacgagcaat taatagaact cttggaaacc 480 tatttgtgcc caaaaccgaa tgtggtggta caacaacatc gttttctttc gtgcattcaa 540 aagccggaag agaatatcgc atcatacatt gcggaactaa gaaaattcat tgtgtcctgc 600 gaattcaaat gcaattgcgg acgcacagtg gcagacatct tcctacaagc acaatttatt 660 cggggacttc gagatgcaac gattcgagag aaactattgc agacaacgga gctaaccttc 720 aaaaaagcgg tggatacagc tctcgcactg gaagcatcaa aattagataa cgtcgaaata 780 tcaggtactt cgaagaaagc aaccgaaact gtaaatcgca tttctaaaac taaattcaaa 840 actaacaaac gttcaagccg tctagaaaag cgtgtttatc cacagaaaaa tcgttcaaat 900 agtcaatcgc attcaaaaag taaaatcgat tactgtgcat aagggttaga aaatgtatgc 960 atacgttgcg ggcgagataa tcataaagtt caagactgtc gtgtaaacta caatagcgta 1020 aaatgtcact cctgtagtaa accgggacac gtcgaaaagg tttgcaattc gaaattattg 1080 aaaacaaaga acagtaatgt acgtaaagtc gaagcagaaa tccaggaaag taataaaagc 1140 ttatacgata ctaatagtat tagaaatttt cacggaatta atactgtaat tgatatttat 1200 gaaaaatcag ctcataaagc aaacgagaca aaaaaaattc tatacaaata tattaattaa 1260 tggtaagagt caaaagttcg acgtagactc gggagcggga tttaccctct tgccaaaaga 1320 ttaccttgag gagctcaaaa tatcaactcc gatacagcca acagatatcc ggtttcgctc 1380 atacaccgga aaaatttttc aaccaatagg tgttatggaa gtttcaagaa tataaaaagt 1440 caaagaatat aaaaagtcaa aaacaaaaga actcatgttt attgtaacaa aaggaacccc 1500 actacttggt agaacgtgga tacgacatct caaaattaat ttactcgaac ttgacacaaa 1560 tagtcattct gaaagttcaa actcggattt tattcatttt cttagcacgc agagcgatga 1620 cacaaataat attttaaatc aattcaaaga tatttttgaa caaaaaatag gatgcattcc 1680 aaattttaca tgttccttga aattatgaga taatgccaaa ccagtattct taaaagcccg 1740 cgatatccca tttgcacttc gaaataaagt cgaaaacgaa cttaatatgt tagaaaagga 1800 aggaattatc acaaaaatag actgtgtcga ctgggggtct ccattagtaa ttatacctaa 1860 accagatggt tccgtccgac tatgtgtcga ctttaaaatt gcagtaaatc cacagttaaa 1920 aggagcccat tatccaattc ctagggttaa tgacgttttg aataattttc gtaatgcaaa 1980 ttttttctgt aaattggata tatttaaggc atacctgcac gtaggcgtcg atgacgaaag 2040 caaagccatt caaaccattt caactcattg cggcacgtac aaaatgaacc gattatcctt 2100 cggaatcaaa acagcgccaa gcgaattcca taggatctta gaccaaatat taagtgggct 2160 tgaaggaacc atcgcatatt tcgacgacat tgttgtattc agccaaacgt acgaagaatg 2220 taaacagcgt cttattactt gtctagaaag actcagagca tataacttac atttaaacat 2280 aaaaaagtgt cgattttttg aaaaagaaat ttcctattta gggtatgtca tatcaggaaa 2340 tcaaatttca aaatgcccag aaaaaactaa agctattcta aagtcaccta ggccggaaaa 2400 tgtcgacgac gtaagaaaat ttcttggatt attcacgtat tattctaggt ttattccaga 2460 tgtttcgtca attacatttc ctatccggga attattgcaa accaagaaga gatttgtctg 2520 gtcctcgaaa tgtgaagcag catttattaa gcttaaaaac gaaattgcca gcgatcgcgt 2580 cttagtgcct tttgatccgt ccctgccagt cacggtatca tgcgatgcaa gtccagtagg 2640 aatcgcagga gcactgtccc atatcatcaa cggcattgaa aggcctgtag aatacatggc 2700 tcgtgctctc acggcggcca aacggaatta tagccagctc gacagggaag ctgttactat 2760 cgtattttca attcagaaat tttatagata cttatatggc cgatcgttta cgctaataac 2820 agacaaccgt ccgctaacaa gaatttttca acaagacgca aagctacctg ctatcacatc 2880 gacacgatta ctacgttatg caacgtttct aagcaatttt gactactagg taaaacatcg 2940 gaaagcagag gaacacggta acgtcgacta cctgtcacga gctccgtggg aatccagtcc 3000 ggtgatccag gatgaagatg aagaaataaa cgaacaagta atcaaccaaa tttcaactag 3060 tgcaatcaca agtgaaacta tagcagaaga aacgagcaaa gataaagaac tttcaaagct 3120 aagagaagaa cttgcctccg ggaaaatcta tgatccaatc tacagcctac ataataacat 3180 tgtcttcaga ggtcgacgtg tttttatacc cgcatccttg cgctcagaaa tcttgaaaga 3240 actacactgc attcatcctg gtatatcaaa aatgaaaaac ttggccagac gatactgtta 3300 ttggcgcaat atcgacaaag aaattaaaga gctagtgcgc tcatgtccag aatgcgcaaa 3360 aatacaaaac gagccaaaga aagttagcct tcatcattgg aaagatccta aaataaattt 3420 tcaacgcgtt cacatagact acgctggacc atttcaagga catcaatttt tcattgttgt 3480 agacgcaaaa tcgaagtggc cagaggtcag aatcatagaa aggaatccaa catctacttc 3540 aactatccat tttctggaaa acatctttag ttcacatgga ttgcctgagg tgatggtatc 3600 tgacaatgct gcaattttta aaagcgaaga atttaggcag tactgtaaaa ataatggcat 3660 ttttcaaaaa tttattgctc ctggacatcc tgctacaaat ggattagcag aaagatacgt 3720 acaaattctt aaaagaaaat taaaagccat ggaaaacaaa ccaggaacaa tgactacgaa 3780 aatagaaaaa attctatatc gttttcgggc cacaccatta caagaaggga aaagcccatc 3840 agaactctat ttaaacaggc aaattagaac aaagttagac ttatggcacc cctcacatac 3900 aagccagaac ttggtcaaaa attcaaatac aagacaatta agcgtgggag acagagtgct 3960 atcacgagca tacggaagaa ccgaacgttg gaaaactggc tcaataacac agaggctagg 4020 acgactgaac tatataatac ttttagatga tggctacatt ataaagcgac acatcaacca 4080 gttacggccg tccgggattc ctaaagccaa cgtagaagag aaatcagtct gttttaatcc 4140 gaaaatcagg tactggtatc cagtacaaga tgacgaaccg gaagtacaac cgccgcatca 4200 acaagcacag gatcctgaag agatgccgat tccacaaaca cagaacaccc aagcggcaaa 4260 tgaacagaat atgtctcgag gaaacctgaa accaattctt cgtagatctg aaagaatcag 4320 aaaatctccg atacatttaa aagattttgt tcagaaataa gattgttcca tactccatat 4380 tttagttata aatacgcgat aagacttttt tcttattttt actataccaa attagattac 4440 tgtattactg tattatcagt ctatcacatt ataattatct tcataattca cagcataata 4500 agcgtgggag at 4512 // ID Copia-1-LTR_BF repbase; DNA; INV; 190 BP. XX AC . XX DT 16-JUN-2009 (Rel. 14.06, Created) DT 16-JUN-2009 (Rel. 14.06, Last updated, Version 1) XX DE LTR retrotransposon from Branchiostoma floridae: long terminal DE repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; 5-bp TSD; KW Copia-1-LTR_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-190 RA Kapitonov V.V., Bao W. and Jurka J.; RT "LTR retrotransposons from Branchiostoma floridae."; RL Repbase Reports 9(6), 1167-1167 (2009). XX DR [1] (Consensus) XX SQ Sequence 190 BP; 51 A; 35 C; 47 G; 57 T; 0 other; tgttagttta cttagtttag aataatccac ccccctcacg aatggtacag tccgggtctg 60 gaatgttctt tgtagagtga ccttgagtga atacacatgc aggactgact cgtgagtatg 120 tgtgtcagcg tgtttattgg acatccatgt atatccctaa gttaagaggg agcaggaagt 180 cagattaaca 190 // ID TPRP1 repbase; DNA; INV; 623 BP. XX AC J03991; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 28-SEP-1995 (Rel. 1.08, Last updated, Version 1) XX DE T.parva repetitive element. XX KW Repetitive element; TPRP1. XX OS Theileria parva OC Eukaryota; Alveolata; Apicomplexa; Aconoidasida; Piroplasmida; OC Theileriidae; Theileria. XX RN [1] RP 1-623 RA Allsopp A.B., Carrington M., Baylis A.H., Sohal S., Dolan T. RA and Iams P.K.; RT "Improved characterization of Theileria parva isolates using the RT polymerase chain reaction and oligonucleotide probes."; RL Mol. Biochem. Parasitol 35(2), 137-147 (1989). XX DR GenBank; J03991; Positions 1 623. XX SQ Sequence 623 BP; 158 A; 131 C; 133 G; 201 T; 0 other; gtccttgtag gaatgggtct ggtctactgc atatatcctg ccatagctcc tggaatgatt 60 gttccatttt acctcattga taagattgag atggttctgt tgatagctac catatttcca 120 gctctgtatg tagcaattgc cagaagtggt aaactgatac caggttttgg tggtttcttt 180 gatcccgtgt ctcctacatg taattgggga gccaccaata aacctgtttg gctaccaagt 240 gatttaggca ctggatacca ctggcaccta actgacctag tgattcccac catgatcatt 300 ttagcctatt tatttattta ctcactgcat tacagagact catcagttgc cagatcaatc 360 atcaaccaac ccaaaatgtc tacatgtcta accattctat tctatatgtg tcatgagatc 420 tcattggctg taggatttcc aggtatcttt ggtggtaatg gaggtggtag catcttagcc 480 ctaacggcac agctgatagg tgcatttctg atgtgtctgt tagctccata ttctgaaggt 540 tacattattg agtataagag acatgatcct agcaattggc ctacagcagg aatgactagg 600 tggaatgccc ttaggtactg gac 623 // ID Kolobok-16_HM repbase; DNA; INV; 2838 BP. XX AC . XX DT 16-JAN-2009 (Rel. 14.02, Created) DT 16-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE Kolobok-type family - consensus. XX KW Kolobok; DNA transposon; Transposable Element; Kolobok-16_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2838 RA Bao W. and Jurka J.; RT "Kolobok-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 9(2), 425-425 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 448..2232 FT /product="Kolobok-16_HM_1p" FT /translation="MNKQGNKTKQNLSRRIKRKWNGVVGNKKSSSSYILSS FT TMTSSTTQAASSCSSERKIKLNYSNTFDSEKDNYFVFINFSILKELIAKTA FT CNNCFQPLLLTDVDSSRKGFAHLFQLKCEHCDYVKRFNSSNKSINAKFDAV FT TANSPYDVNIRAIIAFREVGCGYGAIKTFSSCMNLKCISENGFQKLNKTIM FT VAYKSASEKSMLLAIKDSKKVDIAQDIPCVRVSIDGTWQKRGHNSLHGVVT FT AISGDKCIDIEVLSKYCMGCKMWNSKKGTPEYQCWIIDHQCEINHKSSSGS FT MESAGAVTIFNRSVKKNNLIYKEFLGDGDTSSFKDVKNSNPYQDFDITPIK FT LECVGHVQKRLGTRLRNLVKAHKGTKTPLSGRGNLTEKCINSMQNYYGMAI FT RQNVNNLYAMKKAVYAILFHFTNFENQQMQHQFCPRGLTSWCKYWALNNTN FT YKSKSCIPIWIKNLILPIIKDLQADDLLIKCLHGTTQNANEALNSIIWSRV FT PKHTFVSKSTIEMGTYSAVLHYNDGANGVLEVLKYFGLNGIVTLASSSKVD FT KTRIRHMKSKSTDKSKMQRKKIRAVKKGFIDDQQIKETTDSYISGGF*" XX SQ Sequence 2838 BP; 1072 A; 389 C; 426 G; 951 T; 0 other; ggtggcacac aactgaaaaa aagcaagttt tttgaaaaaa atcaattttt agatttttgg 60 gtattttgat tctataacca ttccttgaac atataaaaaa aaaattagtc tcaaaaatga 120 tataaaagtt gatttaatat cattttagta gatgacactt gaaaataaat tccctaacaa 180 cgccttagca accatcagtc ataggagttt ttctgaattc aaaacagtaa aacaccatca 240 caaacttggt tttaatttgt tatacgcaga ttatacgcag actcttgtta gttttccgtt 300 ttaaaagtca taattcaaaa ttttatttaa gaacaagagt gtttttaatc tgttaatgtt 360 agtcttttat actgtaactt ttctctcatt ttataccaaa taatttaaaa agtgatttca 420 atttagttgg aataaagtta tttttaaatg aataaacaag gaaataaaac taaacaaaat 480 ttatcaagaa ggataaaacg aaaatggaat ggtgttgttg gaaataaaaa atcatcttct 540 tcttacatct tatcatcgac gatgacatcg tccacaactc aagcagcaag tagctgttct 600 agtgaaagga aaatcaaact caactactcg aatacatttg attctgaaaa agataactat 660 tttgttttta ttaatttttc aattcttaaa gaacttatag ctaaaactgc atgcaacaac 720 tgctttcagc cattactttt aacagatgtt gattccagta gaaaaggatt tgcacatttg 780 tttcaattaa aatgtgaaca ctgtgattat gttaaaagat tcaattcatc aaacaaatca 840 ataaatgcta aatttgatgc agtaacagct aactcaccat atgatgttaa tattcgtgca 900 ataatagctt ttcgagaagt aggctgtggt tatggagcta taaaaacatt ttcatcttgc 960 atgaacttaa aatgtatatc tgaaaatgga tttcaaaaat taaacaaaac tattatggta 1020 gcttataaaa gtgcttcaga gaaaagtatg ttattagcga ttaaggactc taaaaaagtt 1080 gacattgctc aggatatacc atgtgtaaga gtttcaattg atggtacatg gcagaaacgc 1140 ggtcacaatt cattgcacgg tgtagttact gcgatttctg gtgataaatg cattgatatt 1200 gaagttttgt ctaagtactg catgggctgt aaaatgtgga acagtaaaaa aggaactcca 1260 gaatatcagt gttggataat tgatcatcaa tgtgaaataa atcacaaatc ttcgtctgga 1320 agtatggagt ctgcaggagc agtaacaatt tttaatcgct cggttaaaaa aaataatcta 1380 atttataaag agtttcttgg tgatggagac acttcttcat tcaaagatgt caaaaactca 1440 aacccttacc aggattttga catcactcca ataaaacttg aatgtgtagg tcatgtgcaa 1500 aaaagattgg gtacacgact tagaaattta gtcaaagcac acaaaggcac taaaacacct 1560 ttatcaggta gaggtaacct aacagaaaaa tgtataaatt ctatgcaaaa ctattatggc 1620 atggccattc gtcaaaatgt caacaacttg tatgctatga aaaaggcagt ttatgctatt 1680 ttatttcatt ttaccaactt tgaaaaccag caaatgcaac atcagttttg cccaagagga 1740 ttgacaagtt ggtgcaaata ttgggctcta aataatacaa attacaaatc aaaatcatgt 1800 atacctatct ggattaagaa tcttatttta ccaatcatca aggatcttca ggctgatgat 1860 cttctaatta aatgtttgca tggaacgacg caaaatgcca atgaagcatt aaattctatt 1920 atatggtctc gcgtaccaaa gcacacattt gttagtaagt ctacaattga aatgggaaca 1980 tactcagcag tgctgcatta taatgatggt gctaatggtg tattagaagt tttaaaatac 2040 tttggcttga atggaattgt tacattagct tcgtcaagta aagttgacaa aaccagaatt 2100 cgacatatga agtcaaagtc aactgataaa agtaaaatgc agagaaaaaa aataagagca 2160 gttaaaaaag gtttcataga cgatcagcaa ataaaagaaa caactgatag ttacatctct 2220 ggtggatttt gataattttt atgttcttat atattttttt cttatttttg cttttttctc 2280 aatatgtatt tttttaaact ttgaaacaca ttttctcaat aatcaaactc aaacaatctc 2340 attaacaaaa cagaattaca taataaaatt ttcaggattt gtttatcata atataatact 2400 tcatttaacc ctcagaattt tcataaaaat ttatagaaag tgaaatattg atgtttatag 2460 tgctggattt attgatattt catatatata tatatattat atatatatta ggtgattttg 2520 ttagaaatgc aatatttcat gaaataattt aaatcataaa aattcccacg gtcaagcagg 2580 gtgaaatatg attaggaagc tatgttcaaa attttagttt cagttgttaa ggcaaaatta 2640 agttattagg ttttgaagat ttactaaaaa cacatgtttt acaacacatt tttggccatt 2700 atggataata aaattaatta taaaaaaaaa ttatctgttt tttattttat tttaatataa 2760 attaattgtc attaaaaaaa aaaaaaatca aaattttgat tatttacaaa aaaaacttgt 2820 tttcagttgt gtgccacc 2838 // ID hAT-2_HM repbase; DNA; INV; 4435 BP. XX AC . XX DT 07-DEC-2008 (Rel. 13.12, Created) DT 10-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-2_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4435 RA Jurka J.; RT "hAT-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 1991-1991 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 933..3110 FT /product="hAT-2_HM_1p" FT /translation="MVTLKKKKCQYGKTAFICTYCNAKFCTQNSTRAKKHL FT VTDCTKSPKAVKKKFATDVMKQSFAPKTKNGILLVKSLIELDTNTTNNAGV FT SDLIIENDCFIDCCDNNYAIFKRKSGTSSTMNVANNELHSDLQECSLSNIL FT EEKKAFCSTSIYTSNQKSISYFFDKVSTQECDEINIQYARAIYATASPFRM FT SENPFWTNFFKKIRPGWSPPNRYALSHNLLDTVYNQVNKFIQDLISQSETF FT VLMTDGWSCISNDSHIQIMLATPTPIYIRSIHPKENSHTGQYIFDQIVEII FT EDKNVISNKRKIKAFCTDNAANMVAVWKLIENKYDWMSAYGCFSHGLNLLA FT CDLANKNKIISETLKQNKQLVKFFKSKHGLKSIVERCSLHSLNKSFILILP FT VVTRWSSQFYMLQRNILLRDSLRLAVTDEKAQKYLDKHNNKKGIDLKELVL FT NETFWKKTDLIFKLLKPISEAILESESDAASVSIVPQVFFYVITEISDVLQ FT NSTLLTLHQKEDIITCIKKREDFVVRPIHFAANLLDPRFVGQKLNHSEMLL FT AEQHVFRTAENETLAADIILDELMSFKARSGIFSQDNRSYIWKLNEDCNPV FT TWWKAYAPTTLLNRIALILLSCNCSIASVERANKEFSLQKSKVRNRLTDTR FT ASKLLFIAYNLKMQNKLAKRNSKKIKHSCLKTKNIDEKSQKNHSCLSIDND FT DDDDDDDDEYDSEYSSEDSIDFHIP*" XX SQ Sequence 4435 BP; 1659 A; 574 C; 659 G; 1543 T; 0 other; gggttgaaaa tttcccggaa attttgaagt cgggaaaatt tccggaaaat ttccggaaat 60 tttcaaatac ttgttaaaaa aaaaaatttc agggcttctc tgattatttt tatatgttta 120 ccataatttt actgttttaa taaagaaggt ttatataaat aaactaataa atttcagaaa 180 taaataccta ggtattcact aataatatta gtaagtagtt ctaaagtaaa tctaatggta 240 ataatggtaa aaataataat tacgacgacg acgattatta taacgatggt gatgatgatg 300 atgatgatga tgatgatgat gatgatgatg atgatgatga tgatgattat aaaaatatga 360 ttataaatta aaatactaac agctttttta aataaaatag ataagtaaat aacttattta 420 cttatttatt ttatacaata ttatgaattg gtaaaagtgc aatattatta attttttggt 480 gtataacttt atagtatttt aagtagtatt aaaactttat aatattatag atgcataact 540 ttatagtatc tgaaatgatt gctacacaaa cattcctttt tgacattttt taatgattta 600 tataaagctg tcaacctaaa ccaaagaata ggtgatacaa gagaaacaac aaataacttg 660 ccattatata tgttatataa ttaaataagt tttttactct ttttgaaata ataacaataa 720 aaacattaca aaaatacttt ataggtatat tagtactata gctataatga taataataaa 780 ttagtgttga atataaatta tttattagat ttgcagtcgg tatgaactgc ttttctaaac 840 taagtataca ttttaaatta ttctcattta tttttgtact gtaaaaattt ttttagcaac 900 aattatgcct agagaattag atcaagtctg gaatggtcac tttaaaaaaa aaaaaatgtc 960 aatatgggaa aacagcattt atttgcacat attgcaatgc aaaattttgc acacaaaact 1020 ctacacgagc taaaaaacat ttagttacag attgtactaa atctccaaaa gctgtaaaaa 1080 aaaagtttgc aacagatgtt atgaaacaaa gttttgctcc taaaacaaaa aacggaatat 1140 tattagttaa atctttaatt gaactcgaca caaatacaac aaataatgca ggtgtgtcag 1200 atttaataat agaaaatgat tgttttattg attgctgtga caacaattat gccatcttta 1260 agagaaagtc aggaaccagt tccacaatga atgtggctaa taacgaatta cattctgatt 1320 tacaagaatg ttctttatca aatattttag aagaaaaaaa ggctttctgt tcaacttcta 1380 tatatactag caatcaaaaa tcaatatcat atttttttga taaagtttca acacaagagt 1440 gtgatgaaat aaatatacaa tatgcaagag caatttatgc aacagccagc cctttcagaa 1500 tgtcggaaaa tccattctgg actaattttt ttaaaaagat tcgaccaggt tggtcacctc 1560 ccaatcgtta tgccctgagt cataatcttt tagacacagt atataaccaa gtaaataaat 1620 tcatacaaga tttaatatcc cagtcagaaa cttttgttct gatgacagat ggttggtctt 1680 gtataagtaa tgattcacat atacagatta tgttagctac tccaactcca atatacatca 1740 gaagtattca tccaaaagaa aactcccaca ctggacaata catattcgat caaattgttg 1800 agattattga agataaaaat gttatatcaa acaaaagaaa aataaaagct ttttgcactg 1860 acaatgcagc taacatggta gcagtttgga aattaataga aaataaatat gattggatgt 1920 cagcatatgg ctgtttttct catggcctaa atttattagc ttgtgatctt gcaaataaga 1980 acaaaataat ttcggaaaca ctgaagcaaa ataaacaact agttaagttt tttaaaagta 2040 agcacggtct aaaaagtatt gttgagcgtt gtagcctaca ctccttgaat aaaagtttta 2100 ttttaattct tcctgtagta acaagatggt catcacaatt ttatatgtta caacgcaata 2160 ttctactccg tgattcattg cgactggcag taactgatga aaaagcacaa aagtatcttg 2220 acaaacataa taataaaaaa ggaattgact taaaggaatt agtattgaat gaaacctttt 2280 ggaaaaagac ggatttaatt ttcaaacttc tgaaacctat atcagaagcc attttggaat 2340 cagagagtga tgcagcgtca gtatctattg ttccacaagt ctttttttat gtaataactg 2400 agatatctga tgtgttgcaa aactcaacac ttttgacttt acatcaaaaa gaggatatca 2460 ttacttgcat taaaaagcgt gaagactttg ttgtaagacc gatacacttt gcagcaaatt 2520 tacttgatcc aagatttgtt ggtcaaaagt taaatcattc agaaatgctt ttagctgagc 2580 agcatgtatt cagaacagct gaaaatgaaa cactagccgc tgatataata ttggatgaat 2640 taatgagttt caaagctcgt agtggaatat ttagccagga taatcgcagc tatatttgga 2700 agcttaatga agattgtaat ccagtcacat ggtggaaagc atatgcacca acaacattat 2760 tgaatagaat tgcattaata ttattgtctt gtaactgcag cattgcatct gttgagcgtg 2820 caaacaaaga gttttcattg caaaaatcta aagtaagaaa tcgcctaacc gatacaagag 2880 catcaaagct tttatttatt gcttataatt tgaaaatgca gaataaattg gctaaacgta 2940 attctaaaaa gataaaacat tcatgtttaa aaacaaaaaa cattgatgaa aaaagccaaa 3000 aaaatcacag ctgcttatca atcgataatg atgatgatga tgatgatgat gatgatgaat 3060 acgatagtga gtattcaagt gaagatagta ttgactttca cataccatga aaatggctaa 3120 gcattaacgt ttgcaaaaac atgctatttt tttataatag tctagttggg gagtttaaat 3180 aaaatatata tataaaatta tttttttatt tttatttatt agttaaaaac ccactataca 3240 agtaagctaa attaaagttt agttcaggat aaatgaaaaa tcttaaagtt tttttttttt 3300 ttttatcaag attttcattc aaacatgatc atctaggaat tttttttttc aagaatgtcc 3360 ttgaagattg tttttttgtt cctttgagtt ccagaactca actgctcatt ttagagtttg 3420 aagatccgtt tcaggaatta catctattaa gtcaatgatt ttttttttta ctttgcatta 3480 ttttctctag atttagttca tgactattgc acaaaccttt gacactacct aaaaaaggtt 3540 ctgaagctgt tttgtttttc agctgctact ttacaatatt aaatgtcatt ttattgctct 3600 tcaggtgaat aaataagttt agcttgtttt gttgtgtaat tattgataca tactcaataa 3660 aaaataaggg atgaataaaa tttatgcaat gttttaaaac gttaaggagt gacttcagtg 3720 cgtttcttcc aactatatca agcttttgca ttaatatttt tttatgttca caaagcttag 3780 accataaggt agtggaacaa ccaaagtttt atataactgg acaaaggaat ttgggtgaac 3840 aaaacggatt ccagattgaa ttaaagtttg gattactgcc cggattttag gaaaggtgtc 3900 ccatataaag cctagatgac ttaatttatc gtgaagtttt atttgttcga tattgtgcgg 3960 tttttaattt gcttttttcc ggtaagtatg aactcggttt tgtcagcatt caattttatg 4020 cagtttttgt gtatatatta caggtattaa taagcttttg taaaaaaatt ccaaccctat 4080 taaatactta aagaaagatg tatatatata aaaaaagatg ttgtgcctat agaggtagca 4140 tgtttttaat aaatttcttt ttttattaga ttttaaacga cttagaaatt ttaaataaac 4200 ttaaaattta aattttctcc gaaacgtctt gtggttttta gtttttatgt atttaagcaa 4260 aaaaaaaaaa tgacagttat ctcattttta agagatttga tgcttaattc tttgcaattt 4320 ctttaataca tagtagtaaa ttaaaaatta aaaaaaaaaa aaattcaaaa tttccggaaa 4380 ttttaagcaa attttcccgg aaaatttccg gaaaatttcc ggaaattttc aaccc 4435 // ID Gypsy-41_AA-I repbase; DNA; INV; 6269 BP. XX AC supercont1.338; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-41_AA_; KW Gypsy-41_AA-LTR; Gypsy-41_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6269 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.338; Positions 310506 316774. XX CC 'GGGGGG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 1988..4105 FT /product="Gypsy-41_AA-I_1p" FT /translation="MFRKIKMFVYLMYKSQFKCLHPMLLILVLSIKKICPN FT NYEKIFFYIAILQMEEIRQIPPFRCETIEKGKISREWESWKWSLECYFEAY FT GVTDQKLKRAKLLHLGGVDLQRIFRSLPDHDKTPLVTLEPKVYDLAIELFD FT SYFQAGRQDVIERRRLRKIKQEHNEKFSHYVIRLRQQALNCGFEKYSAEVG FT EILKEIYLIDVVVENCRSDELRKAILKRDRTLREIEEIAATIEDTDQQLKD FT LKENGNIVRDTPVFGIDNARTVKPASKRLYLDEGNDRAPFRSFPKRSVFQE FT PSRGSSFQRKEKVVCYSCGYQGHLSKSSECPAKGRTCRRCRGLDHFEAMCR FT KRKQSSEVRGKPKKVFNVQHVPETNNPCDEEQEEISENSTKVYYAFYSGNE FT SNTLDCVIGGFSLKVLIDSGADANLIRMETWNMMKQERIQVLKSSKGCSRV FT LKGYGSDRPLSIVGTFEAKVTIGRKTTVAEFFVVKGGQKDIIGDLTAKRLG FT VLKVGIEVNNVDAKVKPFSKIKDVQASISIASDARPVFQPLRRVPIPMEEA FT VNRKIDQLLQRDIIEVKQGPATWVSPLVIVGKASGEPRICLDLRRVNEAVI FT REHFPMPVVDEYLARLGTGKVWSKLDIREAFHQVELAEDSRDYTTFITSRG FT LFRFKRLPFGLVTAPEIFQRIMEEMLSGCEGTYWYLDDIIVEGETKEIHDL FT NLQKVE" FT CDS 4496..6244 FT /product="Gypsy-41_AA-I_2p" FT /translation="MASAAKLGFYAVKDKTMVMADASPTGLGAVLLQENDK FT GEPRVICFASKSLTDTERRYCQTEREALALVWSVERFSMYLYGKNFDLMTD FT CKVLQYLFAPRSRPCARIERWVLRLQSFDYQVVHIPGPQNIADCLSRLATV FT ASHAFDPEEELMIREIATMAGSAVALKWNEIKHESEKDEEIAEIIQILKSG FT NQQDLPLVYRVIVNELSSIDTVLLRMDRIVIPNTLRNRVLQIGHEGHPGMK FT MMKSHLRTNVWWPKMDNEIEKFVKQCKGCTLVSAPNPPEPMIRRELPTQPW FT LDVAADFLGPLPEGQYLLVVIDYYSRFMEVSEMKEITASETIRELAAIFGR FT YGLPTTLRTDNGPQFSERCDEFREFCDSSGITLINTIPFWPAMNGEVERQN FT RSLLKRLRIAQELGKDWRIELRKYLLTYHSTSHSTTGSSPAELMFGRKIRS FT KLPVVPAMALDDGEIRDRDRVVKEKGKIYADIKRKAKDSDIEIGDRVLAKR FT MKKNNKLDADFSPEEFEVIRKMGADTTVRSCQSGKEYRRAVTHLKKVEVSE FT SPTTSGTNNLQGTENIQGRARRNRTEPAKHKDYISH" XX SQ Sequence 6269 BP; 2078 A; 1011 C; 1439 G; 1741 T; 0 other; attggcgacg aagataaagg taggaaaaca gttaaaaaga taaaaatagt gaaggttgca 60 ttgtgatttg gaaaaaaaaa tcaaatttga agcgtgatct gcttcccgct atcgtatgaa 120 aaaatggagt atcggatgaa gtgcacataa attgtggaat ttttgagtga attttacctg 180 aattatgaat aattaattga taaatgtaac taaaatgtaa tatttccaaa tgcagcagtt 240 tcatgcattg aattaaaaaa aatgaaatga agagaccaaa ttggtgattg agaagtgaaa 300 aaatcagctg ttcctatggt aacagaatca ccctctacca cccacttacg tacctgagtt 360 gatcagttga tagtatgtgt gggttaacta gaccatcatg ttgcagcaac gaaaaaaaaa 420 aaaaatatgc cgtcttcttg ccagtacttc actgtgttgg ctatttttgt tcacttacct 480 attccttttc aagagcaaga cttcattctc agaactgtgt tattcacgaa aaaatggttc 540 aaaagtttgg ttttctttct aaacactatc aaaattgttg acaaacttga aggaggcgaa 600 cggcaaatcg acagggtgaa atgttgagcc tacctgcttt tcatgcgtaa cacgttcgtg 660 tatgtgtatt ggtgatatga atgtatgaat gacatagatt ttttttctgg agtgatgtat 720 gtttctcctc gatctgtcat atatggactt ggttgtgttg atggtttcta aacaacatta 780 gtcatcagaa actgtcattc atataaaata ctcaaaacaa aacagaatcg gctcggaata 840 ctctatgaaa aatgaagatg ggtagaatca tgaatgaaag gttccgctaa ttgacaccaa 900 gttatgtctg gaagaataac ggttgcggtt gatgaattga gatgcactgt tgaactatga 960 acgatatatt tggctaaaat tgatttgaaa gaattgaaat ttatgcgtat taatggcaaa 1020 gagtaaaaac aaagaaataa agcactaaaa aaaaaaaaag gttttctgat tgtttttttt 1080 tttgtatata tatttataat cgtaaattaa acgttttatt tgtttgttgt aaaatgttca 1140 tacgtaaatt atgtattgat attggttgaa accgcagcaa taagggatta tgacaaaaca 1200 cttgtagcgt gttgttttgt aaattaagat gcgaaccagt agcgttcagg aatattcgtt 1260 tgaaccagta gcgttctaag gatacaaaga accagaagcg ttcttaggat acaaagaacc 1320 agaagcgttc ttaggataaa aggaaccagt agcgttctaa ggatacaaag aaccagaagc 1380 gttcttagga tacaaagaac cagaagcgtt cttaggataa aaggaatcag tagcgttttt 1440 agtatgtgtg aaaacgttga acgtcaacta gttgcgattc acaggatgta ttcgaccagt 1500 agcattcttt ggatgcaatg aaccagaagc gttcttatga tacaatgaac cagaagcgtt 1560 cttaggatgg tgtactgtaa gcgtgaacca ataaggattc acaggatgta ttgaaccagt 1620 agcgttctta ggatgtaatg aaccagaagc gttcatagga tttagtgaac cagaagcgtt 1680 cagaagcttg tatattatga accagtagcg tcataggatc aacataaaag aattgattga 1740 gccagttgtg ttcaaggact aatgtgtata ccggtacgtt tattcttcat aattgatcat 1800 agaatagtat ttgaaccata agcgctcgat ggatgtaatg aactaattgt gttaaaggga 1860 tttgtgaata aaccagaagc gttcatagga tgctatgaac cagaagcgtt cattgggtga 1920 tgttatacaa aagaatgtag ggttttattt catcacactg gtagcattac tttactgata 1980 gggagtaatg ttcagaaaaa taaaaatgtt tgtttatctt atgtataaaa gccaatttaa 2040 atgccttcat ccgatgcttc taattttggt attgagtatt aagaaaatat gtccaaataa 2100 ttatgaaaag attttttttt atatcgcgat attacagatg gaagagattc gtcagatccc 2160 accttttcgt tgcgaaacta tcgagaaggg caaaatttct cgtgaatggg aatcttggaa 2220 atggtcgttg gaatgctatt ttgaagcata tggcgtcacc gatcagaaat tgaagcgtgc 2280 taagctgctt catctgggag gcgtggacct ccaaagaatt tttcgcagtt tacctgacca 2340 cgataaaaca cctttagtta cattggagcc aaaagtttac gacttggcta tcgagctgtt 2400 tgattcatat tttcaagctg gtcgacaaga tgtcattgag agacgtagat tacgaaagat 2460 taagcaggaa cataatgaga aattttctca ttatgtgatc cgtcttcgtc agcaagcgct 2520 aaattgtggt tttgaaaagt attctgcaga agttggagaa attttgaaag aaatttattt 2580 gattgatgtc gttgtggaga actgtcgatc agatgaactt cgtaaagcta tactcaaacg 2640 agatcgtact ctaagagaaa tagaggaaat cgcggcaaca atcgaggaca cagatcagca 2700 gttaaaggat ttgaaggaga atggcaacat tgtacgtgat actccagtgt tcggaattga 2760 taacgctcgc acggtcaaac ctgcgagtaa gagactttac ctggatgaag gaaatgatcg 2820 agctcccttc agaagttttc cgaaacgatc ggtatttcag gaaccatcac gaggatcatc 2880 gttccagaga aaagagaagg ttgtttgtta ctcttgtggt tatcaggggc atctttcaaa 2940 gtcttcagaa tgtccagcaa aagggcgcac atgccggcgt tgccgagggt tggaccattt 3000 tgaagcaatg tgccggaaac gcaagcagtc ttctgaagta cgaggaaaac cgaaaaaagt 3060 tttcaatgtt caacatgtac cggaaaccaa caatccatgc gatgaagaac aagaagaaat 3120 ttcagagaac tcaacgaagg tatactatgc tttttatagt ggaaacgaaa gcaatacctt 3180 ggattgtgtc atcggaggct tttctctgaa agttttgatt gattctggag ctgacgcgaa 3240 tttaatccga atggaaacat ggaacatgat gaagcaggaa agaattcagg ttctaaagtc 3300 atcaaaggga tgttcgcgag tgctgaaagg ctacggaagc gatagacctc tctctattgt 3360 tgggactttc gaagccaaag ttactatcgg ccgtaaaacc actgtcgctg aattttttgt 3420 cgtaaagggt ggacaaaaag atataattgg agatcttaca gctaaaaggt tgggagtgtt 3480 gaaagtgggt attgaagtga ataatgtcga tgccaaagtc aaaccattca gcaaaataaa 3540 agacgttcag gcttccattt caattgcctc agatgctaga cctgtttttc aaccgttacg 3600 ccgagttcca ataccaatgg aagaagctgt gaacagaaaa atagatcaac ttctgcaacg 3660 cgacattatt gaagtgaagc aaggaccggc cacatgggta tcgccattag ttatcgtggg 3720 aaaagcatct ggtgagccaa gaatttgcct ggacttgcgc cgagtaaatg aagctgtgat 3780 acgtgagcat tttcctatgc cagttgtcga tgagtacctg gcgcgacttg gcactgggaa 3840 ggtgtggagt aaacttgata ttcgagaagc gtttcatcaa gtcgaactag cagaagactc 3900 gagggattac acaacgttca tcaccagtcg tggattattc agatttaagc ggcttccgtt 3960 cggattggtg accgcgcctg aaatttttca gagaattatg gaagaaatgc tttccggatg 4020 tgaaggtaca tattggtact tggatgacat tatcgttgaa ggagaaacta aagagatcca 4080 tgacctgaat ttgcaaaagg tagaataaaa tatctatatt gaactgtaaa aaaaaaaaaa 4140 atatatgtaa atgtcctaat aattttttta ggttcttcag aggctcaaag agagaagtgt 4200 agagttaaac tgggataaat gtgagttcgg ggttaacagg ttagagtttt tagggcatcg 4260 tatttctgag gaaggtatta gtccatcgaa tgcgaaagtt aaagctgtgt tatctttccg 4320 tccaccttgt acggaatctg aagtgcgcag ttttctcggt ttagcaaact acttgaataa 4380 gttcatcccg gatctagcta cactagatga acctcttcgt gagcttacga agaaagacgt 4440 gaagttcgaa tggagcgata aacacaaaga agccttcgaa gaaatcaaga aaaagatggc 4500 gtcagcagct aaacttggct tttacgcagt aaaagataaa actatggtga tggctgatgc 4560 tagcccaacg ggtcttggag cagtgttgtt gcaagaaaat gacaaaggcg agcctcgtgt 4620 tatatgtttc gcgtccaaat cattaacaga tactgaacgt agatactgtc aaaccgaacg 4680 agaagctctg gcgttagttt ggagcgtaga gcgtttcagc atgtatttat atggcaagaa 4740 tttcgatttg atgacagact gcaaagtttt gcaatatctt ttcgcaccac gatcacgtcc 4800 atgtgcacgg atcgaacgat gggtattacg tttgcaatcc ttcgattatc aagtagtaca 4860 cattccagga cctcaaaaca ttgcggattg tctctctcgt ttggcaactg ttgcatcaca 4920 tgcttttgat cctgaagaag aattgatgat cagagagata gcaactatgg caggatcagc 4980 agtagcgttg aaatggaatg agatcaagca tgaaagtgag aaggatgaag aaatagcaga 5040 aatcattcaa atattgaaga gtggtaatca gcaagattta ccgttagttt acagagttat 5100 tgttaacgaa ttgtcaagca ttgatactgt gctattgagg atggatagaa ttgtaatacc 5160 aaatacttta cgaaatcgtg tactacaaat tggacatgaa ggacatccgg gaatgaagat 5220 gatgaaaagt catttaagaa caaatgtttg gtggccgaag atggataacg aaatcgaaaa 5280 attcgttaaa caatgcaaag ggtgcacttt agtatcggct ccgaatccac cagagccaat 5340 gatacgacga gaactgccga cccaaccgtg gttggatgta gctgctgatt tcttgggccc 5400 acttcccgaa ggccaatatt tgctggtggt tattgactac tacagtcgat ttatggaagt 5460 cagtgagatg aaggaaataa cggcttctga gaccattaga gagctggccg caatattcgg 5520 acgttatggc cttccgacga cgctacgaac agacaatgga ccacaattca gcgaaagatg 5580 tgatgaattt cgagaattct gtgattctag cggaataaca ctcatcaata ctattccatt 5640 ttggcctgcc atgaatggag aagtagaaag gcagaataga tcgctactta aaagacttag 5700 gatcgctcaa gagctcggaa aggattggag gatagagttg cgcaagtatc ttttgacata 5760 tcattctaca agtcacagta ctaccggaag ttcaccggct gaacttatgt ttggaaggaa 5820 aattcgaagt aagttgcctg tagttccagc catggcatta gatgatgggg agattcgtga 5880 tagagacagg gtagttaaag aaaaaggaaa gatatatgca gatatcaaaa ggaaagctaa 5940 ggatagtgat atcgaaattg gtgaccgtgt tttagcaaaa agaatgaaga aaaataacaa 6000 attagatgca gatttttcac cagaagagtt cgaggtaatt cggaaaatgg gagcagacac 6060 aacagttaga tcttgccaat ccggaaaaga atatcgacgt gccgttactc atctgaaaaa 6120 ggttgaagtg tctgagtcac caacaacttc cggtaccaac aatcttcaag gcactgaaaa 6180 cattcaaggt cgagcaaggc gtaatagaac ggaacctgct aaacataagg attatatttc 6240 gcattgaatt attaaaagct aggtgggat 6269 // ID Hoana8 repbase; DNA; INV; 3123 BP. XX AC . XX DT 21-SEP-2009 (Rel. 14.09, Created) DT 21-SEP-2009 (Rel. 14.09, Last updated, Version 1) XX DE Hoana8 is a putative autonomous DNA transposon. XX KW hAT; DNA transposon; Transposable Element; HOBO; transposase; KW Hoana8. XX OS Drosophila ananassae OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; ananassae subgroup. XX RN [1] RP 1-3123 RA Ortiz M.F. and Loreto E.L.S.; RT "Characterization of new hAT transposable elements in 12 RT Drosophila genomes."; RL Genetica 135(1), 67-75 (2009)DOI 10.1007/s10709-008-9259-5. XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 352..2019 FT /product="Hoana8_1p" FT /translation="MVSLWGRNWLSKILSAYLVSFSYFNHSLLDIRVSNMT FT SEAEAIPRKITSVELKTIPRGKGKSLIWNILYEIQREDGSLVDGWIFCSTC FT EKVLKYSQKQTSNLSRHKCCVQLKNPPEEKNVADADKKEAIKKCTSWIVQD FT CRPFTAVSGPGFKELVKFFIKIGATYGSNVNIEDLLPDPTTLSRNTGKEAA FT EKKVELAQNLKNAVNAGDVSATVDMWTDNYVQRNFLGVTVHFFKDARLQSL FT VLGIKSMEYQRSTADNISRKLRTLFADFNIDNVEKVKFVNDRGSNIKKALE FT EYTRLNCSTHLLSNALESSFNEATDLADTLDTCKKVVKYLKKSNMQHHLQT FT TLKNPCPTRWNSHYTMLKSIVDNWAEVNKIINETPTAPVLNENLAELKSLV FT ALLEGFERVFKELQICKSPSIFFVVPSIAKIFELCETQMTDFLSIAKLKEN FT LLNHINAIWKENLSIWHKVAFFLYPPATVIQQDLHEIKSFCVYQMEINTAE FT NDQDSNSSFILNNNNAMRLTPPSVASKQSFYTLAEGIIILVKSVQRSEGDI FT SDPIKYIYS" XX SQ Sequence 3123 BP; 1011 A; 608 C; 590 G; 914 T; 0 other; cagagaacgg cacgggtggc acttttcggc aacttatgca tgctcataaa aattcgtata 60 tggatggcac tctccactca tcagaaaaat tacatgatgc gacacacaca aattccaata 120 agtgtaatga gtgtattgcc tttctagcac ttgttcactt tgtgtatgtg tgactttaaa 180 cactcagaca ctcaggtatt ttacgctcgt aagtattaat catacttgcc acacaaatga 240 gagattagta aaaatcatga ttgtgtcaaa aaaacatgag tagtaaaaac acacatgaaa 300 gtcttgagtc ttttcagaaa cacctaggtg ttttcagatt gatctgttgc tatggtttca 360 ctttggggac gaaattggtt gtcaaaaata ttatccgcat atttagtatc attttcttac 420 ttcaatcatt cattattgga tatccgagta agcaacatga ctagtgaagc cgaagctatt 480 ccccgaaaaa ttacaagtgt cgaacttaaa acgataccta gaggaaaagg taaaagctta 540 atatggaata ttttatatga aattcaaaga gaagatggat cactggtgga tgggtggatt 600 ttttgcagca cttgtgaaaa agttttaaaa tattcccaaa agcaaacctc caatttgtcg 660 agacataagt gctgcgttca attaaaaaat cctccggagg agaaaaatgt ggcagatgcg 720 gataagaagg aggccataaa gaaatgtaca tcgtggatcg tccaggattg ccggcctttc 780 acagcagtgt cgggacccgg atttaaagaa cttgtgaaat ttttcattaa aataggagcc 840 acgtacggaa gcaacgtaaa tattgaagat ttacttcccg atccgacaac tctaagccga 900 aacaccggaa aggaagcggc cgaaaaaaaa gttgaattgg cgcaaaattt aaaaaacgct 960 gtaaatgccg gagatgtgtc agccacagtc gacatgtgga cggataacta cgtgcagcgg 1020 aacttcttag gagtaaccgt ccactttttc aaggacgcta gactgcaaag tctggtcctg 1080 ggaattaaat cgatggagta ccagaggtcc acagcggata acatttcgag aaaactgcgg 1140 accctcttcg ccgacttcaa catagacaac gtagaaaaag ttaaatttgt aaatgatcga 1200 ggatcaaaca taaaaaaagc tttggaagaa tatacccgac taaactgcag cacccatctg 1260 ttgtctaacg cgttggaaag ctccttcaat gaagcaacgg atttggcgga taccttggac 1320 acgtgcaaga aggtggtaaa atatttaaaa aaatctaata tgcaacacca tttacaaacc 1380 accttaaaaa acccgtgccc cacccgatgg aactcccact ataccatgct aaaatcaatt 1440 gttgacaact gggcggaggt caacaaaatt ataaatgaga cgccaactgc acctgtccta 1500 aatgaaaatt tagcagaact aaaatcttta gtggcgttat tagagggctt tgagagggtt 1560 tttaaagagc tacaaatttg taaatctccc tcaattttct tcgtagttcc ttcaatcgca 1620 aaaatctttg aactatgcga aactcaaatg acagactttt tatcaatagc taaattaaaa 1680 gaaaatcttt tgaatcacat aaacgctatt tggaaggaaa atttaagcat ttggcataaa 1740 gtcgcctttt tcctttatcc accagctact gtaatccaac aagatttaca tgagataaaa 1800 tctttttgcg tttaccaaat ggaaattaat accgccgaaa acgatcaaga ttccaattca 1860 agcttcattt tgaataacaa taacgcaatg agacttacac ccccctcagt tgcttctaag 1920 caatcttttt atacccttgc agagggtatt ataattttgg tcaaaagtgt gcaacgcagt 1980 gaaggagaca tctccgaccc tataaagtat atatattctt gatcaggatc acctcctgag 2040 ttgatatgag catgtccgtc tgcccgtctg tccgtttctt tggtgttttt taagttagag 2100 ggttgggact ttccacacat gttatatttg accaaaatat cttgtgtgca aaatttcata 2160 aggatcggcc gatatcctat agctgtcata gaacgatcga aattggcata actttggtgt 2220 tttttaagtt agaaagatgg gatttggtac agattactct tttggcaaaa taattcgaca 2280 tgccaaattt cataaggatc ggccatttat atacgatccg ctacatatct aataatacaa 2340 gatgcgtggc gccacctagc ggactgcgac tgaactgcaa gggtatataa acttcggctc 2400 cgctcgaagt tagctttcct ttcttgtttt ttcaaattta gtccacccta ttgactatac 2460 atccgaaacg ccattagaag aactggaaag gtattgccga gaaagagtgg ctctaaccga 2520 gggctttgaa ccaactgatt ggtggaatca aaataaaaat gcataccccc gactttctca 2580 tcttgcactg caagtccttt ctatacctgc cagtagtgcc gcctgtgaga gggtgttttc 2640 tatggccggt aatattatga ccgaaaagag gaataggctt acaccaaagt ctgtagacaa 2700 cataattttt ctccacttca tctttaaaaa ttcacgcgaa gcttaaatat tttaatttca 2760 ataaagtccc ctattatgta aattcaatgt tttttatttt ttatatgtta ttataaatta 2820 atatatatgt tcaagttatt ttaataaaac catccttgcc tattattgtt ttatgaaatt 2880 ttttttgtgc ctttccttcc actcatcttt ttttcatcag tgcctacgat tgattttcaa 2940 aaattttaca gagtaaagaa aaggcaagta ctcaaaatca tgtgtgttta cacacacccg 3000 aaattttttg acacagtaac tactcgtagc cgaaactttg tgcacgaaag gcactcggat 3060 gtataaaaca cccggtgttt ttcgggtgta ctcattgagt gagtggcatg ctgccagact 3120 ctg 3123 // ID Helitron-1_AP repbase; DNA; INV; 2907 BP. XX AC Contig2609; XX DT 21-FEB-2009 (Rel. 14.02, Created) DT 27-FEB-2009 (Rel. 15.12, Last updated, Version 3) XX DE Helitron-type DNA transposon family. XX KW Helitron; DNA transposon; Transposable Element; Helitron-1_AP. XX NM Helitron-1_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-2907 RA Jurka J.; RT "Helitron-type DNA transposons from pea aphid."; RL Repbase Reports 9(2), 466-466 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. It can be incompletw. XX FH Key Location/Qualifiers FT CDS 265..2898 FT /product="Helitron-1_AP_1p" FT /translation="MIHGPCGNSNNRSPCMESGSCSKKYPRPFIQETQTGD FT DGYPKYRRRAPENGGFTVEINGKTLDNRWVVPYNPVLLRTFGAHINVEYCN FT SVKSIKYICKYITKGSDQAAFGFENDNDEVKLYESDRYISSSEAVWRILAF FT PIHERFPTVFHLSVHLENSQRVYFNPNDSSRLTDMINNPPKTTLLAFFDLC FT KTDDFAKTLLYVDVPSYYVWKNNRFERRKRGINVIGWPGIKRDQALGRVYT FT IHPNNTDCYYLRLLLHEIRGPTSFLKLKTVNGTIQPTYQTACKALGLLEYE FT RHWDTTMEEAVLCGSPFKLRELFAIMLIFCQLSDAISLWEKYKDSLSEDIR FT HRVELDIQPENVNSIINEVYNKCLVTIEDTVLSLGGTFLQHYGLPQPSRCE FT AVLQNKDYLREINYDSNILAQYINFNENLLNNEQSYVYNKILEGIEKNTGK FT TFFLDAPGGTGKTFVINILLARVRKDHGIALAVASSGIAATLLEGGKTAHS FT VFKLPLNLTRVETPMCNISKQSNIAQVLKDCKLIVWDECTMAHKGGFEALN FT RTLKDIRGYNSLMGGVTVLLAGDFRQTLPVIPRGTRADEVKSCLKASYLWP FT HIQKVALNKNMRVHVKGDTSAGIFAEMLLKIGDGNFPSLEGEITIPSNLCT FT VVSSLSELTSRIYPDIINIKMKPIEWLCERAILTPKNDKAAEINEILLKAF FT NEKAVEYKSFDSVIQSDDAVHYPLEFLNTLNPSGLPSHKLILKIGAPIMLL FT RNLNPPKLCNGTRLQVKALHKNVIEAIVITGCARGDIVLIPRITLIPTDYP FT FEFKRIQFPIKVCFAMTINKSQGQSLSMAGIDLREECFSHGQFYVACSRVS FT SASSLVILALKGSTKNTVYKEVLR*" XX SQ Sequence 2907 BP; 982 A; 483 C; 557 G; 885 T; 0 other; tcctacgatc gtcatgatat tatagctagg atatttcatt taaaagttaa gacgttaatg 60 aaattgttaa caaaagggaa tttgtttggg gaagtgcaat gttttatgta ttcggtcgaa 120 tgacaaaaac gtggtttacc acacatacat attttgttat ggctaaaaca gcgcatttct 180 ccagataaat tagattttat cattagtgcg gaaatacctg atcctgaaaa agatcctctc 240 ctttacggta ttatcaaagc caatatgatc catgggcctt gcggtaattc aaacaataga 300 tcaccctgta tggagtcagg tagctgtagc aagaagtatc ctagaccttt cattcaagaa 360 acacagacag gagacgacgg gtatccaaag tataggcgaa gagctccaga aaatggtggt 420 tttactgttg aaataaatgg taagacactt gataatcgtt gggtagtacc gtataaccca 480 gtgcttttac gtacatttgg tgctcatatt aatgtagagt actgtaactc tgtaaaatct 540 ataaaataca tttgtaaata tattacaaaa ggaagtgatc aagcagcatt tggttttgaa 600 aatgataatg atgaagtcaa gctttatgaa agtgatcgat atatcagcag ctctgaggct 660 gtatggagaa ttctggcttt tcctatccac gagagatttc caacagtctt ccatctttcc 720 gtgcatttag aaaatagcca gcgtgtttat ttcaacccta atgactccag tcgtttaaca 780 gacatgatta acaatccacc aaaaactact cttttagcat ttttcgactt atgtaaaaca 840 gatgattttg caaaaacact tctttatgtt gatgttccat cttactatgt atggaaaaat 900 aatagatttg agagaagaaa acgcggaatt aacgtaattg gttggccggg aataaaacga 960 gatcaagctc tgggtagagt atataccatt caccctaata ataccgattg ttactatctt 1020 cgccttttac ttcatgaaat ccgaggtcct acatcttttt tgaaattgaa aactgttaac 1080 ggcacaattc aacctactta tcagacagcg tgtaaggctc taggcttact ggaatacgaa 1140 aggcactggg atacaactat ggaagaagct gtgctttgtg gttccccatt taaattacgg 1200 gaactttttg caattatgtt gatattttgt caattgtcag atgctataag tctttgggaa 1260 aaatataagg acagtctttc agaagatata agacatcgag tggagttgga tatacaaccc 1320 gagaatgtga attcaattat taatgaagta tacaataaat gcttagttac tattgaagat 1380 acagttctat ctctgggagg tacatttttg caacattatg gtttacctca accatcaaga 1440 tgtgaagcag tcttacaaaa taaagattac cttcgagaga tcaattatga ttcaaatatt 1500 ttggcacagt atataaattt taatgaaaat ttactaaata atgaacagtc ctatgtttat 1560 aacaaaatat tagagggtat tgaaaagaac actggcaaaa catttttctt agacgcacct 1620 ggaggtactg gtaaaacatt tgtcataaat attttattgg ccagagtacg aaaagatcat 1680 ggaatagctt tggctgtagc ttcttcaggc atcgctgcta cattgctaga aggaggtaaa 1740 acagctcatt ctgtatttaa attaccatta aacctaacga gagtagagac acctatgtgt 1800 aatatttcca aacagagtaa tatagcacaa gttctaaaag actgtaagtt aatagtttgg 1860 gatgaatgca caatggccca caaaggtggc tttgaagccc tcaatagaac attaaaagat 1920 ataagaggat ataatagttt gatgggtgga gtcactgtgc tgctagcagg agattttcgg 1980 caaacattac ccgttatacc taggggtaca agggctgatg aagttaagtc atgtctaaaa 2040 gcttcgtatt tgtggccaca tatccagaaa gttgcactta ataaaaatat gagggtacat 2100 gtgaaaggtg acacatctgc tgggattttc gctgagatgt tacttaaaat aggagacgga 2160 aatttccctt cattggaagg tgagatcact attccctcaa atttgtgtac agttgttagc 2220 tctttatcag aactaacttc cagaatttac ccagatataa taaatatcaa gatgaaaccc 2280 atcgaatggt tgtgcgagag agccatatta actccaaaga acgacaaggc agccgaaatc 2340 aatgaaattc tactaaaggc atttaatgaa aaagctgttg aatataaatc ttttgattct 2400 gtgatccaat cggatgatgc tgtccattac cccttagaat ttcttaatac tctgaatcct 2460 tctggtcttc catcacataa acttatcctt aaaattggag cacctataat gttactaaga 2520 aatcttaacc cacctaaatt gtgtaatgga actagattgc aagttaaagc tctgcacaaa 2580 aatgtaatag aagctatagt tatcactggg tgtgctagag gagatatagt tttgatcccg 2640 agaataacgt taataccaac tgattatccc tttgagttta aaagaataca attcccaatc 2700 aaagtttgct ttgctatgac tattaataag tctcaaggac aatcgttaag catggcaggg 2760 attgacttaa gagaagaatg tttttcacat ggacagttct atgtcgcatg ttctagagtt 2820 agctctgcca gtagtttggt tattttagcc ctaaaaggca gtaccaaaaa tactgtttat 2880 aaagaggtat tgagatgaac acaaaaa 2907 // ID Gypsy-199_AA-I repbase; DNA; INV; 5370 BP. XX AC supercont1.68; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-199_AA_; KW Gypsy-199_AA-LTR; Gypsy-199_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5370 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.68; Positions 1932926 1938295. XX CC Positions [4352-4828] - Integrase core CC 'AGCAG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 1825..3093 FT /product="Gypsy-199_AA-I_1p" FT /translation="MVAFGKRFDQHPKKFSAKRKFPFNQSELGCLNCGHRD FT HQTHDTACPARGKLCYTCKRVGHFGSRCRFRKFKEKHTIPSKVRMIEPQQS FT PSTPDASGDTGVGKTYYAFYSGNETNVIECAIGGVKLDLLIDSGSDANLIT FT DEAWESLKQAKVVVHQSTKGSSKVLKGYASDRPLPIIGTFVAEIIAGERSA FT RALFYVIKGGQRCLLGDQTSKDLGILKVGLDIQNVAAQIIPFSKIKAIQVQ FT VHVDPAAKPVFQPLRRVPIPLEDAVNRKLEQLLARDIIEVKDGPASWVSPL FT VVVGKSNGEPRLCLDLRRVNEAVLRERHPMPTVDDFLARLGKGKCWSKLDI FT KEAFLQIELAPESRDVTTFITSKGLYRFKRLPFGLVTAPELFQKTMDQILT FT GCEGTYWYLDDVFVEGRDREEHDKRLEEV" FT CDS 3317..5332 FT /product="Gypsy-199_AA-I_2p" FT /translation="MSFRRPENEAEVRSFLGLANYLNKFIPNLATLDEPLR FT TLLKKGTKFEWNESHQKSFENVKIAIAEAQELGFFNKLNQTSVMADASPFG FT LGAILMQTDQAGESRVISCASKSLTDTEKRYCQTEKEALAIVWSVERFQAY FT LYGSTFNILTDCKALEFLFTDRSRPCARIERWVLRLQAFDYRVVHIPGEKN FT LADVLSRLASLPAVPFDRDDELMIKNIALCSATTAALKWDDIVRETKKDAE FT IQQVLECLGNASIQELPLQYRVVANELCRCEDILLRTDRIVIPQSLREKVL FT QIAHEGHIGIRMMKLHLRSSVWWPKMDAAVESFVKKCRGCILVSAPEPPEP FT MVRKETPNGPWQEIAIDFLRPLPDGQTLLVVVDYYSRYIEVCEMNQTTTRE FT TINQLSTIFSRFGVPLTLRADNGPQFNASCEEFQAFCEDLGVQLINTIPYW FT PQQNGEVERQNRSILKRLKIAQQLGQDWRQILAQFLLSYHATPHPTIGRAP FT SELMFGRRIRTKLPEVPIFTRVNEEHRDHDTLQKEKGKEYADGKRKARFSE FT ISVGDRVLVKRMRKDNKLSSNFAPEEYVVTRKIGADCTVKALENGKEYRRN FT VAHLKKLLDDIGDSSKTGESSLKETTTAPSNSGNTEVSTKTSSAMTEPDSQ FT EDIQRSVKRNHNVPVKFKDYLSH" XX SQ Sequence 5370 BP; 1697 A; 944 C; 1253 G; 1476 T; 0 other; aacctggcga cgaggataaa ctatcgaggt gagctcaagt ttttgatata aagttgtaaa 60 tatcgattgt ttgtgatttc tctatgactt ggaaaagcaa atggtggcgt gagtatcaga 120 atcgtaacgg ggggaaaacg atgcttgtgg tataatataa tagtattagt gagatctgaa 180 ctattttcct gtgggtgtag aagtgacaag atggaaaagg gtaaccattg aaatagtgtt 240 gcaaacacgg aaaagagaaa ctttttgttt tttttttgtt tttaattact tgctgaatta 300 tgagaaaaaa aatttacgat gatgtgggaa gtattcttcg agcgtaaaca ttgtttggta 360 cggtaaattg gggtggaatc ttaagcatta agaaaatatt aacggattgt tcccggaaat 420 atcatgtgtg tgtttcacga ctgtataaaa atgaatattg actattcata gatacctaat 480 gtatggatta catttcgaaa tatttcaata tggattattc acttataatt agaattcatt 540 agaaaacaaa tgaacatgga ttgtttgtgg acatattctg tatggattac agttcaagga 600 aaggcataac taacctgaat taaccggaca ttgactgttc atggaaactt aatgtatgga 660 ttacattttt gagatagttg aatatggatt attcagttaa agcttatgta tggattactc 720 ttatggacat ttcctgtatg gattacagtt taagaaagga catcactcaa ctgaatatgg 780 attattcaga acagtaccct atatggatta taggatgtaa acgctatgtt tacggttacc 840 aaatattgag catggattgt tcattaagta attgatagtt actgtatgga ttacagaatt 900 caaacaaggt atgtttatgg agtacttagt taagcatgga ttgctaaagc ttttgtgtgg 960 atcacataac tgaaatctca agtactttgt atttagttaa ataaatgaga aaaataaaaa 1020 gaaggaaaac aaaagaaaaa aatgttcaat ttcaaatcat ttgattcaag ttctgaaagt 1080 atatgacagt gctatggtgt tttctctttt gttaatgtct ttagatggaa gacagtcgtc 1140 cagttcctcc gttccgatgc gacgacattg agaaaaatag gctgcacaaa gagtggagaa 1200 tttggaaagg ggcattagaa tgctattttg acgcgtatga tgtgatagat cagaaaaaga 1260 aacgagccaa actattacat tttggtggac ctcagcttca gcgagtattc ctgaatctcc 1320 cagaacgtga aaagtttcct ttggtgtcca ctgaaaagca atggtacgac gtcgcgataa 1380 acgctttgga tgggttcttt caaccttgca agcaggattg ccttgagcga cacagattga 1440 gaaacatgaa acaaaagcag ggcgaaaggt gaagttgttt taaaatgacg attgaacgta 1500 aaatactaat taaataaata aaaatttagt tttgctgact ttgtgctccg ccttcgtcaa 1560 caagctagtg actgtgggtt cgacaaatat ccagaagaaa caagagatgt tcttacggaa 1620 atttttctga ctgatgtcat catagaaggc tgtttgtctt ccgagcttcg tcgtcgcatt 1680 ctccaacagg atcgatcttt ggttgagatc gaagctcttg gtgctgctct agaaggtgtc 1740 gaaaatcaaa tacaagattt tggagataag tcagggaatc aaaccagtgg tgatcacaag 1800 gtccttgaaa ttaacagcaa accgatggtc gcattcggca agcgttttga tcaacatccg 1860 aagaagtttt cagcgaaaag aaagtttcca ttcaaccaaa gcgaacttgg ttgtcttaat 1920 tgtggccatc gagatcatca aactcacgac acagcgtgtc cagcaagagg taagctctgc 1980 tatacctgca agcgagtcgg tcatttcgga tcacgatgcc gattccggaa attcaaagag 2040 aagcatacga ttcctagtaa agtgcgaatg attgagccac aacaatctcc gtcaactcca 2100 gacgcatccg gagacacagg tgtaggaaaa acgtactatg ccttttactc aggaaacgaa 2160 acaaacgtaa tcgaatgcgc tattggtgga gtgaaattag atttgctgat tgattctgga 2220 tcagatgcga atcttattac agacgaagca tgggaatcgt tgaagcaagc gaaggtggtt 2280 gtccatcaga gcaccaaagg aagctcgaaa gttctgaaag gttatgcaag tgatcgtcca 2340 ttgccaataa taggaacatt tgtggccgag ataatagccg gcgagcgttc agcacgagca 2400 ttgttttacg tcatcaaggg tggtcaacgt tgccttcttg gcgatcaaac ttccaaggat 2460 cttggtattt tgaaggttgg attggacatt caaaacgttg cagctcaaat cataccgttc 2520 tcgaaaataa aggccattca agttcaagta catgttgatc cggctgcaaa gcctgtgttt 2580 cagccgctaa gaagggtacc cattccactg gaggatgcag tcaataggaa attggagcag 2640 ctattggcca gagacattat tgaagtcaaa gatggtccgg cttcgtgggt ttctcctttg 2700 gttgtggtgg gcaaaagcaa cggagaacca agattatgtc tagatttgcg acgtgtcaac 2760 gaggccgttc tcagagaacg ccaccccatg cctacagtgg acgatttttt ggcaagactg 2820 gggaaaggaa aatgttggag caaattagac atcaaagagg cgttcttaca aattgagctt 2880 gcaccggagt ccagagacgt gaccaccttc attaccagta agggtcttta cagatttaag 2940 cgacttcctt ttggtctagt aacagcacct gaactattcc aaaaaaccat ggaccaaatt 3000 ctgaccggat gtgaaggcac ttactggtat ttggacgatg tgtttgttga gggacgtgac 3060 cgtgaagaac acgataaacg attggaagag gtatgaattt ttaagtcagt ttggagtttt 3120 tctttttaat ggattgaata aagaaatgca catgaattat ttatatttgt ttctataggt 3180 acttcgaaga ttcaaggagt ggaatgttga actcaattgg gagaagtgtg tttttcatgt 3240 gaacgaggtt gagtttttgg gtcacaaatt aactgaaaag gggatacatc cctcagattt 3300 gaaaaggaac gcaataatgt cctttcgacg tcctgaaaat gaagcagaag tgcgtagttt 3360 cttggggcta gcaaactacc tcaacaagtt cataccaaat ctagctacgc ttgatgagcc 3420 tttgcgtact ctgttgaaga agggaactaa gtttgaatgg aacgagagtc accaaaaatc 3480 attcgaaaac gtcaaaattg ccattgcaga agctcaagag cttgggtttt tcaataaact 3540 caaccaaact tcagtgatgg cggatgctag tccgtttggc ctgggggcta tattgatgca 3600 aaccgatcaa gctggcgaaa gtcgagtgat aagctgcgcg tcaaagtcat tgacagatac 3660 cgaaaagcga tactgtcaaa cagaaaagga agcgttagct attgtatgga gtgtagagcg 3720 attccaagca tatttatatg gcagtacgtt caatatactg acggattgta aggctctaga 3780 gttccttttt accgaccgtt cgagaccatg tgccaggatc gaacgttggg tcctacgtct 3840 tcaggctttt gattatcgcg ttgttcacat acctggtgaa aaaaatcttg cagatgtatt 3900 atcccgcctc gcatcattgc ctgctgtgcc attcgatcgt gatgacgaac ttatgataaa 3960 gaacatcgcc ttgtgttctg ctacaaccgc agcattgaaa tgggatgaca ttgtacgaga 4020 aacgaaaaag gatgctgaaa tccagcaggt tctagaatgt ctaggaaacg catcgatcca 4080 ggaattgccc ttacagtatc gggtcgtagc aaatgagctg tgccgctgtg aagacatttt 4140 actacgaact gatcgaattg ttattcctca atctttgcga gagaaagtgc ttcagattgc 4200 tcatgagggt cacattggaa ttcgaatgat gaaactgcat cttcgaagtt ctgtctggtg 4260 gcctaaaatg gacgcggcgg tggaatcgtt cgttaaaaag tgtcgtggat gtattttggt 4320 ttctgcaccg gagcccccag aaccaatggt acgcaaagaa actccaaatg ggccctggca 4380 ggagattgcc atagatttcc ttagaccttt gccggacgga caaacgttgc tggttgtagt 4440 tgactattac agtcgttata tcgaagtttg tgaaatgaac cagaccacta ccagagaaac 4500 aatcaatcaa ctgtcgacga tattcagccg attcggagtt ccactaacac tgcgagctga 4560 taatgggcca caatttaatg ccagttgcga ggagtttcaa gcattctgtg aggatttggg 4620 ggtgcagttg attaacacca ttccttattg gcctcaacag aacggagagg tcgagcgaca 4680 gaatcgatcg attctcaaaa ggttgaagat agcgcaacaa cttggacaag attggaggca 4740 aattttagca caatttctgt tatcttatca tgcaacacct catccaacaa ttggacgcgc 4800 accttcggag ttgatgttcg gcagaagaat tcgaacgaaa ctaccagaag taccaatttt 4860 cactcgagtt aacgaagaac atcgagatca tgataccctt cagaaagaaa aaggaaaaga 4920 atacgcagat gggaaacgaa aggcacgatt cagtgaaata tcagtaggcg atcgtgttct 4980 agtaaaaagg atgcgaaaag ataacaaatt gagttccaat ttcgctcccg aggaatacgt 5040 tgtaacaaga aaaattggag cagattgtac agtaaaagca cttgaaaatg ggaaagaata 5100 ccgccgtaat gtagcgcatt tgaaaaagtt gctggatgat attggtgatt cgtcgaaaac 5160 aggagagtct agcttgaaag aaactacaac agcgccatcg aactccggca atactgaagt 5220 atctacaaag acctcatcag cgatgacgga accggattcg caagaggaca tacagagaag 5280 tgttaaacga aaccataatg tacccgttaa gtttaaagat tatctatccc actaagtgtc 5340 tctaaaacat tgttattgta aaaggggggt 5370 // ID hATm-49_HM repbase; DNA; INV; 2839 BP. XX AC . XX DT 07-DEC-2008 (Rel. 13.12, Created) DT 11-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type family: consensus sequence. XX KW hAT; DNA transposon; Transposable Element; hATm-49_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2839 RA Jurka J.; RT "Families of hATm elements from Hydra magnipapillata."; RL Repbase Reports 8(12), 1943-1943 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 448..2571 FT /product="hATm-49_HM_1p" FT /translation="MISRSKASKNRHILFGEGRELPNFCLPTHKQVGKAYL FT SEKNYETEPLRSVCKRLAEKLSQIWIKASVPTISLRGIELQLEKYIGDVQK FT VIRSKSATVKQDEIEKDLDTLFDICSCKCKELSSCCCPMVLKVPAIEHDFL FT TDQRTIRVGRIAGIDKKESARAERRLKRKSVSLSKEETAVKCARVVEASDN FT ESDNETDNNSDTETYSPSVSDVDTEVTSTRLKNRCLKNIALQADGFGISDR FT AVAAIVSATLVDFGIEHMGPVDRNKIRRERKLLRTSCKNADCRITGLYFDG FT RKDQTLKLEGDRQTVISEEHISILSEPNSKYVDHCTPTSGSAKSIADSIID FT TVDCTNLVCIGSDSTNVNTGLNNGVIRRMEIELGRPLHWSICLLHLNELPL FT RHLFHSLDGATTGPHSFNGPIGKSISMSVQKPIVSFAIIVGEPIICDKQLL FT SSDQLYFFEIVNAVKTGVVPDRFNSRTPGRLNHARWLTLANRVLRLYISTE FT DPSNELILLAKFIVNSYAVVWFDIKKHSNCWNAAPHYYKMIETSRFLPVRY FT RIIVQQVLKRNSYPAHIESIILAMLKDNRQVIRKLALKRILRAREDDIPNR FT KFQLPEIRFQATSYEELIDWQTQPRLEPVLTKHIPTVQILAWLSSVEDIVI FT DIEPFPCHNQSVERVIKIVTESASKVYGHENRDGYIRVQLERRKLMPHFDT FT KSEFKNI*" XX SQ Sequence 2839 BP; 926 A; 487 C; 526 G; 900 T; 0 other; ttaggccgga tcataaaaat taattttttt tagaatttgt gaaaagtggc gtgatatagt 60 gttgtttaat atttacaaaa aaaatatttc taaagaattg attatacttt gcatttttaa 120 gttttaaggt tttatatgct aagtatattt actttagtac atgcattaac tgaaacatgc 180 atttaaccct aatactagat ataatcccaa ggttatgaaa ttgaaccaaa cctataccta 240 cctaactaaa ttaaccctaa tgatatgatt aactccaacg ctattataac tttatattat 300 tatattatta ttattatatt attctatttt tatatgattg tttattaatt tctgctttta 360 tcaactgatt gcagaattta tccgtctcat cttgtatttt tagaaattaa ctgaaattct 420 gatatgttat aaaattgtaa aacaataatg atttcaagat ctaaagcttc aaagaatcgt 480 catatacttt ttggagaagg tcgtgaatta cccaattttt gcttgccaac ccacaagcaa 540 gtaggcaaag catacttgtc tgagaagaat tatgaaactg aaccactacg gtctgtttgt 600 aagagactag cagagaaact ttctcagata tggattaaag catcagttcc tactataagt 660 cttcgaggca tagaattaca gctggagaag tacataggcg atgttcagaa agtgattcga 720 tcaaagtctg caacagtcaa acaagatgaa atagaaaaag atcttgacac attgtttgac 780 atttgttcgt gtaaatgcaa agaactttcg tcttgttgct gtccaatggt tctgaaggtg 840 ccagcaattg agcatgactt cctaactgat caacgcacta tacgtgttgg cagaatagct 900 ggaatcgaca agaaggaatc cgcacgtgct gagaggcgtt taaaacgaaa atctgtttca 960 ttatcaaagg aagagactgc agttaagtgt gccagagtcg ttgaagcatc tgataatgaa 1020 tcagataatg aaacagacaa caattctgac accgaaactt attccccttc cgtatcagat 1080 gtcgataccg aagtcacatc tactcgtctg aaaaaccgtt gtttaaaaaa tattgctcta 1140 caagcagatg gttttggcat atcagataga gcagttgccg cgattgtatc agcaactctg 1200 gttgattttg gtatagaaca tatggggcca gtggatcgga ataaaattcg cagagaacga 1260 aaactactga gaacatcctg taaaaatgct gattgtagaa ttactggatt gtattttgat 1320 ggaaggaaag atcagacttt aaaacttgag ggggataggc agactgtgat ttctgaggaa 1380 catatttcca ttttatcaga gccaaattca aagtatgttg atcattgtac gccaacaagt 1440 ggatcagcca aatccattgc agattcaata attgatactg ttgactgcac gaaccttgtt 1500 tgtattggtt ctgattcaac taacgttaat actggtttaa acaatggagt tatacgcaga 1560 atggaaattg aacttggtcg accactgcac tggtccattt gtcttttaca tctgaacgag 1620 cttcctttgc gacatctttt tcactcctta gatggtgcaa caactggacc ccattcattt 1680 aatggaccaa ttggcaagtc tatttcaatg tctgttcaga aacccatagt cagctttgcc 1740 ataattgttg gtgaacctat tatatgcgat aagcagttgt taagcagtga tcaactgtac 1800 tttttcgaaa ttgttaatgc ggtgaagaca ggtgttgtac cagatagatt taattctaga 1860 acgcctgggc gactaaacca tgcaaggtgg ttgacactag caaaccgtgt tctacgcctc 1920 tatatatcta ctgaagatcc atcaaatgaa ctaatacttt tggctaaatt tattgtgaat 1980 tcatatgcgg ttgtttggtt tgacattaag aaacattcaa actgctggaa tgctgctcca 2040 cactattata aaatgataga aacatcacga tttcttcctg tgcgttatcg aataattgtt 2100 cagcaagttt tgaagcgcaa tagctacccg gctcatatcg agagcattat tctggctatg 2160 ctcaaagaca accgtcaagt gatccgtaag cttgcgttga agagaattct tcgtgctcgt 2220 gaagatgaca taccaaatcg aaaatttcaa ctccctgaga tacgatttca agcgactagt 2280 tatgaagaac tgattgactg gcaaacacag ccccgcttgg aacctgttct gacaaaacat 2340 attccaactg tacaaatatt ggcatggctt tcgtcagttg aagacatagt tattgacatt 2400 gagccatttc catgtcacaa tcaaagtgtt gaacgtgtga ttaaaattgt gactgaatca 2460 gcatccaaag tttatggtca tgaaaatcgt gatggctata ttcgcgttca actggagcgc 2520 cgcaagttga tgcctcactt tgacacaaaa tctgaattta agaacattta agtatagttt 2580 ctaatcgacg tttgctaatg atatattgtt ttgactgttc aaataatttg aattttatta 2640 tacattcaat caataaataa tgtaacaaat gagcgttttt aagttaattt atttgaaaac 2700 ttcaaaggta aaaatgcaaa gtatagcaaa gtagtagcga aaaacggttt gcagcattgc 2760 attcagcgta tttgagagcc aattactgga aattcccgaa gaaattttcc aaattttttt 2820 aagttatgat ccggcctaa 2839 // ID Gypsy-9_AC-LTR repbase; DNA; INV; 221 BP. XX AC AASC02062576; XX DT 18-JAN-2011 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from Aplysia californica (California sea DE hare): long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-9_AC_; KW Gypsy-9_AC-I; Gypsy-9_AC-LTR. XX OS Aplysia californica OC Eukaryota; Metazoa; Mollusca; Gastropoda; Heterobranchia; OC Euthyneura; Euopisthobranchia; Aplysiomorpha; Aplysioidea; OC Aplysiidae; Aplysia. XX RN [1] RP 1-221 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Aplysia californica (California sea RT hare)."; RL Direct Submission to Repbase Update (02-FEB-2011). XX DR Genome; AASC02062576; Positions 1801 2021. XX SQ Sequence 221 BP; 59 A; 47 C; 48 G; 67 T; 0 other; tgtggtataa tcaattagcc aacctggcat tgtattagac atctacaact ggttgtacta 60 ctcgaataaa tctggctggt tgttggcctg ctgaacacaa tgatggctgc acgtttaaat 120 ctgttccttt gtacagaccg ttctttgtaa cttattgtaa gtgacttaag ttaccccgag 180 tggcctaagt agccccggct gaaggtaggt aaaacaccac a 221 // ID Gypsy-36_OD-I repbase; DNA; INV; 10136 BP. XX AC CABV01002103; XX DT 24-FEB-2011 (Rel. 16.02, Created) DT 24-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Oikopleura dioica genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-36_OD_; KW Gypsy-36_OD-LTR; Gypsy-36_OD-I. XX OS Oikopleura dioica OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; OC Oikopleuridae; Oikopleura. XX RN [1] RP 1-10136 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Oikopleura dioica genome."; RL Direct Submission to RU (24-FEB-2011). XX DR Genome; CABV01002103; Positions 3312 13447. XX CC Positions [4704-5189] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 485..1870 FT /product="Gypsy-36_OD-I_1p" FT /translation="MELLRRKIELIYNKRHRKDDISNFKLLDTEEEFDESQ FT VIEVREEEPVSVKKEIKQTEPEKFESAIDKTMRSVRTGLKSIIGPTSSENS FT SEDDEYETIEEPEESEIEGDGENTEDTEYDSVNTIKNDTTGRSSILKDIRI FT QRFTANDDMDYLQSWFNANSFQLSLQSSKKVDDQKIVGHLLSAIDNRGIKE FT SLISLLKDYSDKGIPITLKRFKSAIDKISRKSSETLIRDIDSIIFNSNKDT FT PRGLYKEIEMKTAQLIPDKTGARSYVDMMFKKKIAPNNVVFNQLNSDVTGE FT KLIKAAEDFLSLTKIDQVNNTMKGYNNNRGRPQQGQKSFHNKGPNSYGRPR FT MYEKQPGWQNHQDNKSFHQDKTTQKQNYERKGNYERRGGYSSRGRYNNGRQ FT NFYQPRQNSQNENQGRKDYQDKRSQGNSWSQGQDKTARIQNQGGCFQCGGP FT HLKRDCPQSNFKEGNRK" FT CDS 1875..5729 FT /product="Gypsy-36_OD-I_2p" FT /translation="MNSASQAQGLPVIDVDIKFCNKCVRISALLDTGSSLN FT LIKRKYVEATENISRCERFIIKGGDGKVLDRISEKINPNIIFPSLDIEVPE FT CKLFLTNANINFECILGYPLIKLLDINFSKITMQMNPLEINEIKIIQEHDT FT FMCSTNRPKLENEITSKNLVIIGPGESFSIHVNEAALEEVMIDRSEDMKIC FT QVQFDVLNGHSKILHGINEGKHTVIIEKDTILGEKNKNHRILTSLNTLIMA FT NQLSPPELEIHNEEYKEWKQKRNILTKNNNLIKVIQQKMLEADVEVNTRII FT MTNIAQKYDKVFARSDNDVGFNKNYVMKFKFKENIENKKPESRKAYRFPNN FT EEKDIMRFLDELVETGVLENSNSSWTSPSMIIKKKNLKYRLVTNFRLWLNP FT LLSVPQWPIRPIRSILHSIGEHIRDLKNAQPGSPIKFITVDLKSGYFTLNI FT APEDRKFTAFPTPKKLLQYKKVSQGLSIAPAVFSQFIAEIYESPKRFHNAF FT RIETYLDDCILVATEKTMPLALEWLLKTTYDNDIILDIQKCSFGTNNVEFL FT GFNLNEIGLRPIKSKYEALEKLQIPRSQRELQQFLGAVQFYVRVIPRAIQT FT LKPLTMEISNKTFKGSSQVEEAVIKLREILKSSKELFHPLPSHDENYILAV FT DSSFTGYGSLFGLAKIREDQAEDIKVIAYGSGHFDKRLQLDSSRNRELCGL FT EKSLTYFKDLIDITKPLYAFVDHKSLATLENNEIAKKEPVTRVRKALATIM FT TMPNLIIKYVSAKSDLIRAVDGISRTISETQEVGQTYVTNWAESVLPQKLN FT NIQVLRNKKSEHPSVISFNENDIMKAQKSDGYCSDLLRKSDEKILWDVGTS FT TFEKKDGIMYRFNKAGNSEVILPAKLAYETLVIYHYSNAHCDKRLLAKLIS FT RNNCYVHKLQKIIKDIKSDCFICQMRKRRVPQKSHRKNIPSIRPMQEMCID FT IIVLKKNSLTPYCFTITDKFSEIFDALPIQDKTAYSVRKAMMICIFKYNLG FT IGTKVLSDNGSEFRSEIFRNLLKTHHITHSFITPLNASSNPVERIHKNFRK FT HLLTTQKLSEEELMTDYEHLQLAIASYNNKPKSNLNWVSPTQILTNITPHN FT ILSFGYNTDVENNYTNLNEDEQAEARKIWSRTLMNYHMEIGNDKYIKYIQD FT QGTISKDEVKESDVVVVQDYSPLGHGRKTTGPYFVQRNQRGKCKLEDLLTG FT QILERNEKLLRKIILKEDVRLKLRKTSLMRDGYGIPIPIKDVTEIKTDDEE FT RRRRRRRRRRRKTAWTAHSKRDNE" FT CDS 6060..8492 FT /product="Gypsy-36_OD-I_3p" FT /translation="MFWIAYLIPICQGAFLNNYTNTVVSLENKSLVLNIDN FT KPIEINFHINFPVVEHSTTSKCFRDPYTYKSINVLIANSIIDRLSGIQDFF FT TCSDFKQCEELDDRHNELQTKTLDRFNKLFQNTPALIESKEHYTRPVNTRP FT KLPFVRLQDGTVVKVKRSSENPLISQAYIFNKDTVGDIIESVLSNLDSQNK FT NYARNAIKTNINYVPFQSDMENSTHALAFSNWIKNTNGENFKLELKGKCIG FT RNFYWVFIKLKFLVEEIGFRSPINRQTLEWLDSEQGHTWLKENLTYNIGVF FT HLTTEEFAHKENEGIKYYRQECTLRVYEYISISCPMPHEPTEKQEYFLGAR FT VELSQEACNMFRDFLNQTGLPTFSVTAHYPFYLADRHRPRRQAAIAASAAF FT GAVSSIIFQRFFRGVENDDDKTIKLLSEKVNEIISESENIRTNTKIELDRI FT EGRFCETEKLVITNTVNTMVNENVDKFMNQVENFMTKKALDLVTTFSHYCE FT KINSDERLITPNDCTNIYDENRIDFLGIYKSNNQAHKIILKYILKVPLAEK FT MDYQQIRINPIPIPKNLTKNSRTFGVTSLANWNFIDKNLAEIISARECYEK FT VGLTICDYATIDNNIKQICAKHLALGKTAPIECYSPLTSRDSCLTQVINEG FT ILLSTSQDTSILMTRSDEITTKTLEFKPGQYFLPASELINQEILIKCNNKI FT VTRLFYTKNESENVNITSRAINLNIKFNSFESKNGSENEISFWTLPGVKQI FT IRNQEHKNGFLLVEKILTTIILIIIALTLIRNTAKCLLKICRTFTGLAPKK FT KKSRTGKNY" XX SQ Sequence 10136 BP; 4157 A; 1822 C; 1792 G; 2365 T; 0 other; tattgaggac ccttgtatcg tggtgactga agcaaagtgg aacgccttaa agagaattgg 60 aagatgatgt cttcaacgaa actcgaggga ccaccgcaaa catcttcgtc tcagacgttt 120 ctagaagtga taaaaataga aatcggaaca attcctatga gacatgcaca actgctagta 180 agttcattga agacaaaaca aagcgaaaag gggtccacat gatacaatca tcaaagcttt 240 tgacaaacga ttctcaaaaa cggacattca acaaaaatat gctgttctgg ttgtattctt 300 tgtggaagaa ccacaactag aagaacaaga aaacgaactt tactgtgtaa cagctattcc 360 aacaatagaa tggcacaaaa agaacaaagc aaatcctatt tatgagtgga acgaaataat 420 cgaattcgca aaaaacaaac cacaatcagc tcagatttat gatcaacaaa agtcaagctg 480 cagtatggaa cttctacgtc gaaaaatcga gcttatctac aataaaaggc acaggaaaga 540 tgatatttca aacttcaaac tgctagatac agaagaagaa tttgacgagt cacaagttat 600 agaagtccga gaggaggaac ccgtaagtgt taaaaaggaa ataaaacaaa ccgagccaga 660 aaagtttgaa tcagcgatag acaaaacaat gagaagtgtc aggactgggt taaaatcaat 720 aattggaccg acttctagcg aaaattcatc agaggatgat gagtatgaga caatagagga 780 accagaagaa tcggagattg aaggagatgg ggaaaacaca gaggatacgg agtacgactc 840 agttaacacc atcaaaaatg acacgacagg aagaagttca attttgaaag acatcagaat 900 tcaaagattc acagcgaatg acgatatgga ttacttacag tcgtggttta atgcaaactc 960 attccaacta agtctgcaaa gttccaaaaa agttgatgat caaaagatag taggacactt 1020 attatcagca atagataata gaggcatcaa agaaagtctc atttccttat tgaaagacta 1080 ctcagacaag ggaatcccca taactcttaa aagatttaaa agtgctattg acaaaatttc 1140 gaggaaaagt tcagaaactc tcataaggga catagactca atcattttta acagtaataa 1200 agacaccccc agaggactgt acaaagaaat tgaaatgaaa acggcacaac tgattccaga 1260 taaaacagga gcacggtcgt acgttgacat gatgttcaag aagaaaattg caccgaataa 1320 cgtcgttttc aaccaactga actcagacgt cacgggtgaa aaacttatta aggcagctga 1380 agatttctta tcgctaacaa aaatcgacca agttaacaat acaatgaagg gttataacaa 1440 taacagaggg cgaccacagc agggacagaa atcctttcac aacaagggac cgaatagtta 1500 cggaagacca agaatgtatg agaagcaacc tgggtggcaa aaccatcaag acaataagtc 1560 tttccaccaa gataagacca cacagaaaca aaattacgaa aggaagggaa attacgaaag 1620 aagaggaggc tactcgtcaa gagggcgcta caacaatgga agacagaact tctatcaacc 1680 aagacaaaat agtcagaacg aaaaccaagg aagaaaagat tatcaagaca agagaagtca 1740 agggaactcg tggagtcagg gtcaagataa aacagcaaga attcaaaatc aaggtggatg 1800 tttccagtgc ggaggaccac acttaaagag ggactgccca caaagtaact tcaaggaagg 1860 aaacagaaag taaaatgaat agtgcatccc aagcacaagg cttacctgta attgatgtag 1920 atataaaatt ttgtaacaaa tgtgtaagaa tatcagcctt actggatacc ggtagctcgc 1980 ttaatctaat caaacgcaaa tatgtagagg caaccgaaaa tatttcgaga tgcgaaagat 2040 ttataatcaa aggaggagat ggaaaagtct tagacagaat ctccgaaaaa attaatccaa 2100 acataatttt tccttcactt gatatcgagg ttccagaatg caaattattt ctaacaaacg 2160 ctaacataaa ttttgaatgt attttaggat acccacttat aaaattacta gatataaatt 2220 tctcaaaaat aacaatgcaa atgaatccat tagaaattaa tgaaattaaa attatacaag 2280 aacatgatac ctttatgtgt tcaaccaaca gacccaagtt ggaaaatgaa ataacaagca 2340 agaaccttgt tataattgga cccggtgaat cattctcgat ccatgtaaat gaagcagcat 2400 tggaagaagt tatgatagac agaagtgaag acatgaaaat atgtcaagtc caatttgatg 2460 tgctaaatgg acactcgaaa atactacatg ggataaacga gggaaagcat acggttataa 2520 tagaaaaaga cactattctt ggggagaaga ataaaaatca cagaatttta acatctctaa 2580 acacactgat catggcaaat caattatcgc ctccagaact ggagattcac aacgaagaat 2640 ataaggaatg gaaacaaaaa cgaaatattt tgacgaagaa taataattta ataaaagtaa 2700 tacagcaaaa gatgctagag gcagatgtag aagtaaatac tagaataata atgacaaaca 2760 tcgcccaaaa atatgataag gtattcgcaa ggagcgacaa cgacgtcgga tttaacaaaa 2820 actacgtgat gaaattcaaa ttcaaggaaa acatagaaaa caaaaaacct gaatcaagaa 2880 aagcttatag atttccaaac aacgaagaaa aagatataat gcgattcctg gatgagttgg 2940 tggaaacagg agttttggaa aacagtaata gttcatggac aagtccatcc atgatcataa 3000 aaaagaaaaa cttaaaatat cgactcgtta caaatttccg actgtggctt aacccactac 3060 taagtgttcc gcaatggcct ataagaccaa ttagatcaat actccatagt ataggcgaac 3120 acatacgaga tctaaagaac gcccagccag ggtcaccgat aaaatttata actgtagact 3180 taaaatcagg atactttaca ctaaatattg caccggaaga cagaaaattc actgccttcc 3240 caacacccaa aaaacttttg caatataaaa aagtctccca aggattaagt atagctcctg 3300 cagtattttc acaatttatt gcagaaatct acgaaagtcc aaaaagattt cacaacgcgt 3360 ttaggataga aacttatctc gatgattgca ttttagtcgc aaccgagaaa actatgcctc 3420 tcgcgcttga atggcttcta aaaacaactt atgataacga tataatactt gatatccaaa 3480 aatgtagttt tggaactaat aacgtggaat ttctaggttt taacttaaac gaaataggat 3540 taagaccaat taaatcaaaa tacgaggctc tcgagaaact ccaaattcca aggagccaga 3600 gagaactcca acaatttctt ggagcagtcc aattttatgt aagagttata cctcgagcca 3660 tacaaacgct caaaccactg accatggaaa tttcaaacaa gacattcaaa ggatcctcac 3720 aagttgaaga agctgtcata aagttaaggg aaatattgaa atcgtcaaaa gaactttttc 3780 atccactacc aagtcatgat gaaaattaca tacttgccgt cgatagtagc ttcacaggtt 3840 atggaagctt atttggactt gcgaaaattc gcgaggatca agcagaggac ataaaagtaa 3900 tcgcgtacgg ttcaggacat tttgataaaa gactacagtt agactcaagt agaaacaggg 3960 aattatgtgg tctagaaaaa tcgcttacgt atttcaaaga ccttatagat ataacgaaac 4020 cactttatgc ctttgttgac cacaaaagtc tagcaacact tgaaaataac gaaattgcga 4080 aaaaggagcc agtaactaga gttaggaaag cactagccac aatcatgact atgccaaact 4140 tgatcattaa atacgtttcg gctaaatcag atctaatacg tgcagttgat ggaatctcaa 4200 gaacaattag tgaaacgcaa gaagtaggac aaacttacgt cacgaattgg gcggaatctg 4260 tgctaccaca aaagctaaac aacatccaag tactgcgtaa caaaaaatca gagcacccca 4320 gcgttatatc atttaacgaa aatgatatca tgaaagcaca aaaaagtgat ggctactgta 4380 gtgacttgct taggaaatcg gatgaaaaaa ttctgtggga tgtaggtaca agcacatttg 4440 aaaagaaaga tggaatcatg tacagattca ataaagcggg taactcagaa gttattttac 4500 cggccaagtt agcctatgaa acgcttgtga tttaccacta ttcgaatgca cattgtgaca 4560 aaaggttgct tgcgaaatta atatcaagaa acaattgcta tgtacataaa ttacaaaaaa 4620 ttataaagga tatcaaaagc gactgtttta tttgtcaaat gcgtaaacga cgggtaccac 4680 aaaaatctca caggaaaaac ataccgagca tccgaccaat gcaagaaatg tgcatcgata 4740 taattgtact aaagaaaaac tcacttacac catactgctt cacaataact gataaatttt 4800 ctgaaatttt tgacgctcta cccatacagg ataaaacagc ttactccgtt aggaaagcaa 4860 tgatgatatg tattttcaaa tacaacctcg ggataggaac aaaagtactt tccgataatg 4920 gatctgaatt tcgaagtgag atatttagga atttactaaa aactcaccac attacacatt 4980 catttatcac accactcaat gcttccagca atccagttga gaggatacac aaaaacttca 5040 gaaaacattt actaactact caaaaactgt ccgaagaaga acttatgaca gattatgagc 5100 acctgcaact cgcaattgca tcttataaca acaaaccaaa atcaaacctg aattgggtca 5160 gtccaaccca aattctgaca aacataacac ctcataatat cctatcattt ggatacaata 5220 ctgatgtgga aaacaactac acgaatctaa acgaggacga acaggcggaa gcaagaaaga 5280 tctggagtag aacactaatg aattaccaca tggaaattgg caacgacaaa tacataaagt 5340 acatacagga tcaaggaacg atcagcaaag atgaagtaaa agaatctgat gtggtggtcg 5400 tacaagacta cagtccactt ggccacggac gaaaaaccac aggaccatac ttcgtacaac 5460 gtaatcaaag aggaaaatgc aaactggaag atttgcttac tggacaaata cttgaaagaa 5520 atgaaaaatt acttagaaaa ataatattaa aagaagatgt cagacttaaa ctaagaaaaa 5580 cgagtttaat gagagacgga tacggaattc caataccaat taaagatgta acagaaataa 5640 aaacagatga tgaagaaaga agaagaagaa gaagaagaag acgaagaaga aaaacagctt 5700 ggacagcaca ttccaaaaga gacaatgaat aaagatgaag atttaggact tacaaaatca 5760 tccaagcaca gagttcctga gaatggaata agaccacctg aaatctactc agaagcggac 5820 caagaaaccg cagaggataa acgaaataat gagacgaaaa agaaatcaga agattcagaa 5880 gaaccgaacc gtataaccac agagactaag aggaccagtg aaactcaaag cgacaacaat 5940 gatctattga aaataatcga agtgataaag aagaagggaa ataccagacc acaaccctca 6000 aggtcacagc caagacgaag caagagaacg aagaaaattg taagttacaa aatgtagaga 6060 tgttttggat tgcttattta attcctattt gccagggagc tttccttaac aattatacta 6120 acacagtagt gtcgttagaa aacaaaagct tagtcttaaa tattgacaat aaaccgatag 6180 aaataaattt tcacattaac tttcccgtag tagaacactc tacaacatca aagtgttttc 6240 gtgacccata cacatacaag tcaataaatg tcctgatagc gaactccatt atagacagac 6300 tatcgggaat acaggatttt ttcacttgtt cggacttcaa acaatgtgaa gaactcgatg 6360 acagacacaa cgagttacag actaaaacac ttgacagatt caacaagcta tttcaaaata 6420 caccagcact aatagaatca aaagaacatt acacaagacc agtgaacact agaccaaaat 6480 taccatttgt gaggctacaa gatggtacag ttgtaaaggt taaacgatcg tccgaaaatc 6540 cactaatatc tcaggcttat atttttaata aagatacagt aggagacata attgaaagcg 6600 ttttatcaaa ccttgacagc cagaacaaaa attacgcaag aaatgccata aaaaccaata 6660 taaactacgt tccttttcaa tctgatatgg agaactccac gcacgcactc gctttctcaa 6720 attggatcaa aaatacaaac ggggaaaatt ttaaattaga actaaaggga aagtgcatag 6780 gaagaaactt ctactgggtt ttcataaaac taaaattttt ggtcgaagaa ataggtttta 6840 ggtcacccat aaatagacag actcttgaat ggctagacag tgaacaaggt catacgtggt 6900 taaaagagaa tctcacatac aacataggag tttttcatct taccactgaa gagtttgcac 6960 ataaggaaaa tgagggaata aagtattacc gtcaggaatg taccctacga gtatatgagt 7020 acatctcaat atcctgtcca atgccccatg aaccaaccga gaaacaagaa tattttctag 7080 gagcacgagt agaactttca caagaagctt gtaacatgtt tcgcgacttc ttaaatcaaa 7140 ccggattacc gacatttagc gttacggcac actacccatt ttacctagcc gataggcata 7200 gacctcgccg acaagcagct attgccgcgt ctgcagcatt cggagcagtt agttccataa 7260 tttttcaacg tttcttcaga ggggtagaaa atgatgatga caaaaccatt aaactactct 7320 cagagaaagt taatgaaatt attagcgaat cggaaaatat taggacaaat actaaaatcg 7380 agctcgatag aatcgaagga agattctgtg agacggaaaa attggtaata actaatacag 7440 tcaacactat ggttaatgaa aacgtagaca aattcatgaa ccaagttgag aattttatga 7500 caaaaaaagc acttgactta gtaaccactt tctcccacta ttgtgaaaaa attaatagtg 7560 acgagagact aatcactcct aacgactgta ctaacattta cgacgaaaat cgaatcgact 7620 ttttaggaat ttacaagagt aacaaccaag cacataaaat tattcttaaa tatatcctaa 7680 aagtaccgct ggcggaaaaa atggactacc agcaaataag aataaatcct attcctattc 7740 ctaaaaattt aacaaaaaat agtcgtactt tcggtgtaac atcactcgca aactggaatt 7800 ttattgacaa aaatctcgcg gaaatcatct ccgcaagaga atgctatgaa aaagtaggac 7860 taacaatttg tgattatgca acaattgata acaacatcaa acaaatttgc gcaaaacatc 7920 ttgcactagg aaaaacagca ccgatcgaat gctactcacc tttaacttcg cgcgattcgt 7980 gtctaacaca agtcataaat gaaggaatac ttttgagcac aagtcaagac acgtcaattc 8040 taatgacgag atctgatgaa atcacgacaa aaacattgga attcaaacca ggtcaatact 8100 ttttacctgc aagtgaatta atcaatcaag aaattctaat aaaatgtaat aacaaaatag 8160 tgacaaggct attttacacg aaaaacgaat cggaaaacgt taacataact tcaagagcga 8220 taaatctaaa tataaagttc aactcttttg agtcaaaaaa cggaagtgaa aatgaaatct 8280 ctttttggac attgccagga gtcaaacaga ttatccgaaa tcaggaacac aaaaatggat 8340 ttttattggt agaaaaaata ttaacaacaa ttattttaat cataatagcg cttacactaa 8400 ttagaaatac ggcaaaatgt ttactaaaaa tctgtagaac attcacaggt ttagcgccaa 8460 aaaagaaaaa atcgagaacc ggaaaaaatt attaaactgt acattttaaa aaattacttc 8520 caattactgt aaataagatt attttaaaag aaaaagatcc caaaaaatca agggcaaaaa 8580 agatggtgag aaaacaccct ataataaatt aaatcctgta caaagaaaat tacgacccta 8640 cgaaaacgaa aagcaacgaa aagactcgcc aaaatcagag agaaaaatca aaatgagcga 8700 attttccgaa accgaagcat ggacctacac gaaactaggg ccctacttcc caacttggaa 8760 aataaacaaa ggagaaatcg aacttaacga gaaagaggat ttcagaatct ggatccaatg 8820 gacaacaaaa ttttcagatg gagcgaccca ttcagcagaa gagcaacaaa atttgacggg 8880 ggtaagaaag gaaagctacg actccaatct ggtttttcca tggccatcga caaattcacc 8940 ggaagatgtt agaaaacgaa tgctactgga aaaagctgga tacgaaaaat tcggagataa 9000 caagatgtgg attataactg accattttag tcgtcatatt tataaaaatt aatttcactt 9060 tacaaaaatt ctcatgtatt atactgttaa ccttatcaaa tttcgtaact aaaacactat 9120 cattctaaat aactcgcata ccgtaaagaa agtgcttacc aattcaaacc accgccgtga 9180 aaaatcagaa cccggaacga cacaacgtca atgaagagca tagaccatga agtattttga 9240 atagacaaca gcttagaaga actcgatctc acaactctcg atctcagacc aagaaaaaaa 9300 atgatgaaga gaaacacaaa tcgcagctat aaacccaatt cctggaacga caggaagaac 9360 gtatcctgga acgacgaatg gaataatgga agttcctggg agagtaaatc cgattacgaa 9420 gctttcgacg aaggaacacc aaaatacgga gcgataaagc ctggagaaac gattcagaga 9480 ctaatttctg atcagtttat gataacgata acagcagtca agggcaccta cgtcagtgac 9540 gtaaaaatcg acgcgataaa aatgcgattc agtcatgatg aaaacaaatt tataagaaaa 9600 acgattactg aaaaagacga actaatcgag actctcaagg cgaaactgaa ggaagccgaa 9660 gaaaagttaa ctcagtaggt caaatacaag gcctagtcaa gttttcaaat aaaagtttca 9720 actaaaaaat tccttacgaa atttttatga attcagtttt tacgtaattt tctattattt 9780 aatcataatc agcttgaact tttagcacaa cttacataca tttctttaaa aactctgtca 9840 gtaacattca gtcaaatgcg tgaaccttat tttctttctt atcgtttcaa caatgtgtat 9900 aaaatacttt tttcaaaaaa tgtttaaaaa aaaaaaaata ttcgcttaca aaaaaagaca 9960 aaaaaaaaaa aaatttttta taaaattgaa aacaagaaaa aaatagataa aacatcaaaa 10020 aattaaaaca aaaaattaat aacctgcata actgtacatt tttcaatgtg ttttcacaac 10080 cggatacgac aacaaccaaa aatccaagag ggaggaagga tagaggatcc acaacg 10136 // ID Homo7 repbase; DNA; INV; 2743 BP. XX AC . XX DT 09-OCT-2009 (Rel. 14.1, Created) DT 01-APR-2011 (Rel. 16.03, Last updated, Version 2) XX DE Homo7 is a putative autonomous DNA transposon. XX KW hAT; DNA transposon; Transposable Element; transposase; HOBO; KW Homo7. XX NM Homo7. XX OS Drosophila mojavensis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; repleta group; OC mulleri subgroup. XX RN [1] RP 1-2743 RA Ortiz M.F. and Loreto E.L.S.; RT "Characterization of new hAT transposable elements in 12 RT Drosophila genomes."; RL Genetica 135(1), 67-76 (2009)DOI 10.1007/s10709-008-9259-5. XX RN [2] RP 1-2743 RA Jurka J.; RT "Removed LTR from the consensus sequence."; RL Direct Submission to Repbase Update (01-APR-2011). XX DR [1] (Consensus) XX CC Original consensus included a Gypsy-type LTR, 834 bp long, CC starting at position 537. Removed from the current consensus. XX FH Key Location/Qualifiers FT CDS 736..2313 FT /product="Homo7_1p" FT /translation="MSGVWDYFKKCADGKTAKCTKCGQICQTSGNTTNLSA FT HLIRKHPNLSTIEESKAAGPISTLLQKKYDASSARKKTLDSALTTYITSDR FT RPYCVVEDKGFRHLVEVLDPRYQLPSRRTLRDVCIPNLFIEMKQKLREILD FT KIEFCAVTTDGWTSKANENYLSVSCHFITEEFEMRTAVLSTTKLKEETNHS FT AQNIANSLRDVFNEWGVAQKITAIVTDNASNMKSACEILSKSNVPCVAHTI FT NLIVQECLANENLKPLLAKCKQIVGFFKSSTIAYAKFREAQNKENPLSLKQ FT ECPTRWNSAYHMVKRFLETKDAICKVLLNTPKALSPLSADEIVILEDLEKL FT LSPFEHATKSTSSSSLVTVSMVIPVCCGLLHNLESLKGNLATEEGERACTI FT LIYGTRNRLLKYESRSNTRISTLLDPRFKKDGFLSPFNVSEAEKFLESELS FT MFNALSEPHPSSSNLPAPKPAVQPHPLFDFLKKKKSEKIRSIRADSIIGLK FT QYFNTENVEPSISPLDYWKVKIHNIRVNHF" XX SQ Sequence 2743 BP; 920 A; 475 C; 546 G; 802 T; 0 other; tagtgctggg aaaaagatcg atctagcaaa cgatatatcg atgtttcata tccatcgaaa 60 atatcgatgt agaaaacttc gatacttgat atatcgccat cactatacca ttaattcccc 120 atcacttgaa ttaaaagcgt gggataagac tgtcaggtaa tttgcaaatt attttaacaa 180 tataattcaa atactaacgt aaacccactt ttcatttagg attttggtgt tggtagtgtt 240 gtagtgtaag gcggcctttt ttgcccacct tacatgtttg taggtaataa gtaaagatat 300 cgaatgaaat agagttttgt tatggtgaat gggatttgaa taaggatttg aggcatagtt 360 tgattttgta gtgattaagg gaaatgaaat ggattttgtt gcagattttg atttggtaga 420 tagtaaggtt gtattttgaa gtgtattatt agaatgagag ttagtttgta agagtggatt 480 tcgatataga ttttggttta cattttgaag tcatttataa gttgatagga gttagtgtag 540 agaaatggat aagtttttaa agagcaagga caatggcgaa ggtaaatatt attttactgc 600 ttatgtcagc cacacacgtg gtctgcagct tttgcgtttt aaaacaatgt aacaaacacg 660 catttgtttg tagattgcga cgcggataag gaaaatggag aggaggagat accacgtaag 720 aggaccaaat atggtatgtc tggggtatgg gattatttta aaaagtgtgc ggatggcaag 780 actgccaaat gtacaaaatg tggccaaata tgccaaacaa gtggcaatac cacaaatttg 840 tctgctcatt taataagaaa acatcccaat ttgagcacca ttgaagagtc gaaggctgca 900 ggccccatct caacgcttct tcaaaaaaaa tatgatgcct catcagccag gaaaaaaaca 960 ctggacagtg cgcttacaac gtacattacg tctgatcgtc gcccatactg tgtagtggag 1020 gacaaggggt ttcggcatct cgtcgaagtt cttgatcctc gctaccagct tccatcaaga 1080 agaacattgc gcgatgtctg cataccaaat ctctttatag aaatgaagca gaaactgcgc 1140 gaaattcttg ataagattga gttctgcgct gttactactg acggctggac ttcaaaagcg 1200 aatgaaaact atttatcagt tagttgtcat ttcattactg aagagtttga gatgcggaca 1260 gcggttctgt caactacaaa acttaaagaa gaaacaaatc attccgcaca aaacattgcc 1320 aattctctac gcgatgtatt caatgagtgg ggagttgcac aaaaaataac ggcaatcgta 1380 actgataacg catcgaacat gaaaagcgct tgcgaaattc tttcaaaaag taatgttcca 1440 tgcgtagcgc atacaattaa tttgattgta caagaatgtt tggcaaatga aaatttaaag 1500 cctttattgg caaaatgtaa acaaattgtg ggttttttta agagcagtac catagcttat 1560 gcgaaattca gggaggctca aaataaggaa aacccactca gcctaaagca agaatgccca 1620 acaaggtgga acagtgccta tcacatggta aaaagatttt tggaaaccaa agatgccata 1680 tgtaaggtgt tactcaacac tccaaaggca ttgtcgccct tatcagcgga tgaaattgtc 1740 attttggaag accttgaaaa gcttttgtct ccctttgagc acgcgactaa gtccacatca 1800 tcgagcagct tggtcaccgt gtcgatggtt attccagtat gctgtggcct tcttcataac 1860 cttgaaagcc tcaaaggcaa cctagctact gaggagggtg agcgtgcttg tacgatatta 1920 atatatggca caagaaaccg actacttaag tacgagtcgc ggtcaaatac aagaatctcc 1980 actctcctgg acccccgatt caagaaagat ggatttttat caccttttaa tgtcagcgaa 2040 gcagagaaat ttttagaaag tgaactctct atgttcaacg cattgtccga gccacatcca 2100 tcttcctcta acctgccagc accaaaacca gcagtccaac cccacccact attcgacttt 2160 ctaaaaaaga aaaaatctga aaaaattaga agcattcgag cagattcaat tataggattg 2220 aaacaatatt ttaacacgga aaatgttgaa ccaagtattt ccccattgga ctactggaag 2280 gtaaaaatac ataacataag agtaaatcat ttttaattaa aacaatttct tcacagatct 2340 caactgacaa cgcatttaaa agatgcgtga aaaagttttt gtgcgtccca gcaacgtcca 2400 cagaaagtga acgaatgttc agcaaagctg gacacatagt aaatgagaaa aggagttgtc 2460 tcaagccgca gcaagtagat atgctgttct ttataaacaa aaatgattgg acgaagtaaa 2520 tgatcttatt aaataatata atctttgtta catagtttag tttagtttat atgaaacgtt 2580 ttttaatttt aataagaaat tcatcttttg ttttttatat gaaaactttt tttttaattt 2640 tgggagttaa aatatgttta tgaaaatatc gatacatcga tctgagaccc atacataata 2700 tcgaaatatc gatctacgta agttcgatct tttcccagca cta 2743 // ID CR1-68_AAe repbase; DNA; INV; 4742 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-68_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4742 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1156-1156 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 15 sequences with >97% CC identity. XX FH Key Location/Qualifiers FT CDS 310..1167 FT /product="CR1-68_AAe_1p" FT /translation="MACERCQKKISTKKERVVCHGYCGASFHAVCVNVDGP FT LQEQLAQNGKNVFWMCDGCAGLFTNAHFRVMMTNFDGKASVLPEAFQSMRL FT QIEQLQSAVDALTTKVEEKSSTPTPFATPNLWPNRDRLNTPVNSTKRRRGQ FT DGHMLGVPTVVGNVGTKAAGVIKTVQLNQRDDDNLLWIYLSAFHPLTSEGQ FT IASLVSECLDLTSTSTKVVKLVPKGKDVNSLQFVSFKVGIAKQLKEKALSS FT DSWPENIQFREFDDLRSKNSRRIVSLLSIEDKPLTMETAESNSAT" FT CDS 1074..4673 FT /product="CR1-68_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="SSLKKQSTNSQFAVNRGQTIDNGNRRIELRNLKSTSS FT SNQLGRTQGSTLEAPSDPVTVEPCSTVNINQVSCHHSRPGPVNGGLEGVFQ FT PALPGKYSCIFSRPIPEDSSNSSFNTKHPAEEHEPHANQPKAPLNSISCPS FT SSLDQPGRTQGGTLEAPSDPVTVEPCSTVNINQVSRHRSRPGPVNGGLEGV FT FQPAIPGKYACPVELSTAEDFSNFSSNSGTSSSAYRSNFSQTVSTSKYNTS FT RPIAPNPLTIFYQNVRGLRTKTTDLRLALSSCDYDVIALTETWLGSDILNS FT EFSSDYAIYRLDRNPATSQLQRGGGVLIGVKKHLHCKLVTLDNAENLEQVA FT VCISLPTQAVYICCVYIRPNSDTAVYSDHALCVQQLCELAETTDSVIALGD FT YNLPRLLWEFDADMNSYIPVNASSEQEITLTQAVLSCGLQQIQDVRNENER FT LLDLAFVTDTDRIEIFESPCGILKLDAHHKPFIIKLDYSSNPDDIIQDRCY FT YDFNSCNEDSIHARLRSIDWETFLSDCDVDDAVSRFYSTVYDIIHEYTPIR FT RRRHRIHFKQPWWTPELRNLRNRLRKVRKHFVKNRTEENRQSLRAAEEQYS FT VRLHTAFHAYILRNESNAKRDPSSFWTFVKKLKRNAGIPHDVCYRDENAAT FT VEETADLFAKFFKHVFSNTAPSHSPDEHREWQSSLMHYDFPVPQLNVTFEE FT IKKVLSSVDPSKGPGPDNLPPKFVKQFAASLASPITIIFNRSLLNGVFPAV FT WKIASITPIHKTGNIHNVENYRGISILSCIPKALEKIVYDVVYQAVRPIIS FT EYQHGFVKKRSTTTNLMVFVNSLVTSVEKRRQVDAVYVDFSKAFDKVPHLL FT AVDKMKCLGLPDWTTQWLRSYLTDRTAFVNINGINSSSFEIPSGVPQGSHL FT GPLIFILFVNDMCNLLSSFKLMFADDLKIYRVIESQLDCCALQEDIDRLLR FT WCTLNGMQMNIQKCSVISFNRLREPLRFDYTIGQTNLHRVYAVKDLGITMD FT SKVRFNEHIAAITAKAFATLGFLKRNTNAFQDIYALKSLYCALVRSLLEYG FT VVVWAPFHAVQVNRIEAIQRNFIKYALRILPWNDTVNLPPYEERCRLISLD FT SLATRRKLLQRMFVFGLLNGDVDCNSLLGQLNFSIPQRPLRNYSLLWQPAH FT RTNYGYNEPWAMCCRLFNETCDVFDFNLSKSAYKHRIKYLA" XX SQ Sequence 4742 BP; 1324 A; 1117 C; 990 G; 1311 T; 0 other; ccactcacgc tctgagaata cagattgttt gtttacctct gtgttgaatt agttgaattt 60 taacgttgtt tacgaaatat taaatatttg ttttaagttg attgaaaccg tattgtgagc 120 agtgtataat ttttgtgcca aagctatcgt ctgtaaccgt tatttgtcca gaatagcact 180 atcaaaaaag gaacattcag tgttgttaga tagcaatacc gtcactcagt gccaatcaat 240 tgtcccatac ttcagttcat cgtgcactgc attcgtcttt cgtactgccg atttgcacct 300 cttctcatca tggcttgtga gcgctgccag aaaaaaattt caaccaaaaa ggaacgtgtc 360 gtttgccatg gctattgtgg cgcgtcgttc cacgcagttt gtgtgaacgt tgacggacct 420 ctccaagagc aattggcgca aaatggaaaa aatgtgtttt ggatgtgcga tggatgcgct 480 ggcttattca caaatgctca cttccgtgtg atgatgacca actttgatgg gaaagcttca 540 gtcttgccag aagcatttca atctatgcga ttgcagattg aacagttgca atctgctgtg 600 gacgcattaa cgacaaaagt tgaggaaaaa tcgagcacgc ccacaccctt tgctactccg 660 aatttgtggc ccaatagaga tcgcctcaac acccctgtga attcaacgaa acgtcgtaga 720 ggccaagacg gtcatatgtt gggcgttcca acggttgtag ggaatgtagg caccaaggca 780 gctggcgtta tcaaaactgt gcaattgaat cagcgagatg acgataattt gctttggata 840 tatctctcgg cttttcatcc attgacctct gaagggcaga ttgcctcgct cgtcagtgaa 900 tgtttagatt tgaccagtac atcgacaaaa gtagtgaagc tagttcccaa aggaaaagat 960 gtcaattcac tacaatttgt ttccttcaaa gttggcatcg ctaaacagtt gaaggagaag 1020 gccctttcga gtgattcatg gcctgaaaat atccagtttc gggaatttga tgatcttcgc 1080 tcaaaaaaca gtcgacgaat agtcagtttg ctgtcaatag aggacaaacc attgacaatg 1140 gaaaccgcag aatcgaactc cgcaacttga aatcaaccag ctcttcgaat caactgggac 1200 gcacgcaagg aagcactttg gaagcccctt ctgaccccgt aacagtcgag ccatgttcca 1260 ccgtcaacat caaccaagtt tcctgccatc atagtcgtcc cggtcctgtt aacggaggct 1320 tggaaggggt cttccagcct gcacttccag gcaagtactc atgtatattc agtcgaccga 1380 ttccagaaga ttcttcaaat tctagcttca acaccaaaca tcccgccgaa gagcacgaac 1440 cacatgcgaa ccaaccaaaa gcacctctaa acagcatttc ctgcccatcg tcatcattgg 1500 atcaaccagg acgcacgcaa ggaggcactt tggaagcccc ttctgacccc gtaacagtcg 1560 agccatgttc caccgtcaac atcaaccaag tttcccgcca ccgcagtcgt cctggtcctg 1620 ttaacggagg tttggaaggg gtcttccagc ctgcaattcc aggcaagtat gcgtgtcctg 1680 tagaactttc caccgctgaa gacttttcga attttagctc gaactccggc acatcatcca 1740 gcgcttatcg atcgaacttc agccaaacag tctccacgtc aaagtacaac acctcccgcc 1800 cgattgcgcc aaatcctctt acgatattct accaaaatgt ccgtgggttg agaaccaaga 1860 ctaccgatct tcgtctagcg ctctcgtcct gtgattatga tgtcatcgcg ttaactgaaa 1920 cgtggctcgg tagtgacata ctaaactccg agttttcttc cgattatgcc atataccgct 1980 tagatcgaaa tccagccact tctcaactgc aacgcggagg cggggtgctg atcggcgtca 2040 agaagcatct gcactgtaaa ttagttacat tggacaatgc cgagaatttg gagcaagttg 2100 cggtgtgtat atcgttgcca acgcaggccg tctacatatg ttgtgtatac ataaggccga 2160 attctgacac tgctgtttat tcggatcacg cattatgcgt gcaacaactg tgtgagcttg 2220 ctgaaactac cgactcagtc attgctttgg gagactacaa cctacctagg cttctgtggg 2280 aattcgatgc tgacatgaat tcttacattc cggtaaatgc ctcctctgaa caagaaatca 2340 ccttaacaca ggcggttctt tcatgtggat tacagcaaat tcaggatgta cgcaacgaaa 2400 acgagagact acttgatcta gctttcgtaa ctgatactga cagaatcgaa attttcgaat 2460 ctccctgtgg aattttaaaa cttgacgctc atcataaacc ttttattatc aaacttgact 2520 acagcagcaa tcctgacgac atcatacaag atcgctgtta ttatgatttt aactcttgca 2580 acgaagattc aatccatgct cgattaagat cgattgattg ggaaacgttt ttgtcggact 2640 gtgacgttga cgacgcagtt tctcggtttt atagcacagt ctacgatatt atccacgaat 2700 acactcccat tcgaagacgc agacatcgca tccacttcaa gcagccttgg tggactcctg 2760 aactccgcaa cctccgaaac agattacgca aagttaggaa acactttgtc aagaatcgta 2820 cagaagaaaa taggcaatca cttcgtgccg ccgaagaaca atactccgtt cgtctgcata 2880 ctgctttcca tgcttatatt ttgcgtaacg aatcgaatgc aaagcgggat ccatcttcct 2940 tttggacttt tgtcaaaaag ctaaaacgta atgccggcat acctcacgac gtgtgttata 3000 gagatgaaaa tgcagctacc gtggaagaga ccgcggatct cttcgcaaag tttttcaaac 3060 acgtattttc aaataccgct ccctctcatt ctccggacga acaccgcgaa tggcagagca 3120 gtttgatgca ttacgatttt ccagtaccgc agttaaatgt aacttttgag gagataaaga 3180 aagtcctctc atccgttgat ccctctaaag gacctggccc cgacaactta ccacctaaat 3240 ttgttaagca gtttgctgca tcgctcgcct cacctatcac tataattttc aatcgctcac 3300 tgctaaatgg tgtctttccg gctgtatgga aaatcgcttc catcacgcct atccacaaga 3360 ctgggaacat ccacaatgtc gaaaattatc gcggaatttc gattctgagc tgtattccga 3420 aagctttgga aaagatcgta tacgatgttg tgtaccaggc agttcggcca attatttcgg 3480 aatatcaaca cggctttgtg aagaaacgtt ctacaaccac caacctgatg gtttttgtca 3540 actcgctggt tacttcagtt gaaaaaagac gccaagttga tgcagtatat gttgatttca 3600 gcaaagcttt tgataaagta ccgcacttgc tagcagtgga taaaatgaag tgtttgggtc 3660 tacctgattg gacaacgcaa tggctgagat cttatttaac cgaccgcacc gcttttgtca 3720 acattaacgg catcaattcg tctagcttcg aaataccatc gggggtccct caaggaagcc 3780 acttaggacc cttgattttc atcttgtttg tgaatgatat gtgtaacctt ctgtcttcgt 3840 ttaagttaat gtttgccgac gacttgaaaa tttatcgagt gatagagtct caacttgatt 3900 gttgtgctct ccaggaggat attgatcgtc ttttgcggtg gtgtacattg aatggtatgc 3960 agatgaatat tcaaaaatgt tctgttatat ctttcaaccg tctgcgcgag cctctgcggt 4020 tcgactacac gataggacaa accaacttgc accgtgtata tgcagttaaa gacttgggca 4080 tcactatgga ctccaaagtc cggtttaatg agcatatagc cgcgatcact gctaaagcat 4140 tcgccacgct gggatttttg aaaaggaaca cgaacgcgtt ccaggatatc tatgcgctca 4200 aatctctgta ctgtgccctt gttcgcagcc ttctggaata cggagttgta gtttgggctc 4260 ctttccacgc tgtgcaggtg aacagaatag aagcaatcca acggaatttc atcaagtacg 4320 ctctaagaat tcttccctgg aatgataccg tgaacttacc gccctatgaa gaacgatgta 4380 ggctgatcag cttggactca ctagcaacta ggagaaaact attgcaacga atgtttgtat 4440 ttggcttact gaatggtgat gtagattgca actccctttt gggacaactc aatttctcca 4500 ttccccaaag accccttcgc aactacagtc ttctgtggca accagctcat cgcaccaact 4560 atggatacaa cgagccatgg gccatgtgct gccgtttgtt caacgaaaca tgtgacgtct 4620 ttgatttcaa tttgagtaag tctgcgtata agcatagaat taaatatcta gcttaagtaa 4680 atatctgtgc aacactagtt gaagatgtaa gaaataaata aataaataaa taaatataaa 4740 ta 4742 // ID BEL-33_AA-I repbase; DNA; INV; 5810 BP. XX AC supercont1.344; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-33_AA_; KW BEL-33_AA-LTR; BEL-33_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5810 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.344; Positions 1156201 1162010. XX CC Positions [4867-5415] - Integrase core CC 'CCCCC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS join(10..3213,3217..5565) FT /product="BEL-33_AA-I_1p" FT /translation="MNIHRHNRNSTEVSSYNCAVCSGPDHDDVAMVGCDNC FT SQWFHFKCVGVSADVKEVSWSCINCKEKAAHQSYGRTAEDSGQKKPAAAQT FT KASEADVKDVSVEVDELEKELQRLEETRKLELRKMELEKTVFRRRLEVQRE FT LVEQRQKVEREKRQMELELEKEHLQKAIADEEAYRKAQQAMRNEMQSKLEQ FT LRIQRTSDNVQAICGPERSKIAGSNAANGQPKNYCETYQKHSTPKGTGIIP FT EMSGTRNESPPIRLNEENTTSESEGDDVSETTQVSVYKGPTKAQLSARQFL FT VRKLPVFSGHPEDWPMFISSYETANEACGFSNVENLARLQESLKGQALEAV FT RSRLLLPNAVPQIIETLRMLYGRPEQLLNMLLAKVRKADPPKADRLASFIG FT YGIVVQQLVDHLDATNMRDHLFNPMLIQELVEKLPAGTKMEWVRYKRKVSA FT VTLRTLADFLSEIVRDASEATMFAEGASVPDNRSKKGKVREGYLNAHRMSP FT DEEEEQRNSMNSKKPCRVCNRVDHRIRNCDQFRKLSWENRWEVVYKWKLCK FT NCLNEHGDSRCKLNYRCNVERCGEAHHPLLHPEEALSDCNVHGVQHHSVIF FT RLIPVTIYNGKIAVNTVAFIDEGSSYTLVETSLVNQLNLKGRTQPLRVTWT FT AGVSRVEKDSQIVDLSISARNSSKRFTIHKAHTVQELKLPEQTLGFKIMAN FT KYQHLQSLPVVDYNHAVPQILIGLKDLHLYAPLETRMGKPGEPIAVKSKLG FT WTVYGPISAAAPERNLIGHHLCEGISNQEIHDLIKNQYAMEESGISIALLP FT ESADDQRARMILENTTVKVGERYETGLLWRTDERCLPDSYPMALKRMKSLE FT KRLLKDDAMYGKVRTLIADYVKKGYAHKATVRELQESRPEQVWYLPLNVVV FT NPKKPEKIRLVWDAAATVEGISLNSQLLKGPDMLSSLPSVISRFREREVGF FT GGDIREMYHQLLIRKDDKQYQRFLYRNDAESSVDVYVMDVATFGATCSPCS FT AQFVKNKNANEAAEKYPDAARAIVDNHYVDDYFDSTDTVEEAIQRSKEVRY FT VHSCAGFEIRNVANCDEILQSLGEPKVEQQVHFNQDKDTGTERVLCIVWSA FT DRDEFSFSTKLRDELMPFLSGERRPTKRIVLSCVMSLFDPLGLLAPFTVYG FT KILIQDLWRSGTEWDEIIDIECLEKWKNWIRRLPEVEEIRVPRYHFTKGSE FT VELKTLQLHVFVDASQYAYGCAAYFRVLMAGQPVCSLVMAKSKVAPLKQLT FT IPRLELQAAVLGARLMNTVIDTHTLAITECFIWSDSRTVLSWIHSDQRKYK FT QFVSFRIGEIHSLTKLNQWRWVPTKCNVADEVTKWQAGRKFESKSAWFNGP FT SFLYLPESELPEQTLVIANVLEEMKTVHLFHDVSILEPLVNPERISRRKVL FT VRSTACVLRFISNCRLKAKGQKIETILASNNIRQLVQKKFPSITVPLKREE FT HQKAESYLWKIAQNDAFGEDMKILQKNQHLPCEKQFSLEKGSYLYDKSPFL FT DDNGVIRMEGRTKYGEFIPFELRFPIILPRRHPITSKLLEHYHQKLGHANT FT ETVVNELRQRFCIPYLRADLKRITQMCVRCKLNKSKPVTPRMAPLPVQRFT FT PYKRPFSYTGVDYFGPVTVTVGRRSEKRWIALFTCLTTRAVHLEVAHSLSA FT QSCIMTIRRFICRRGSPIEFFSDNGTNFKSASKEILRNIDVECGEIFTDAT FT TRWNFIPPSAPHMGGAWERLVRSVKAALSELDDGKKLTDEILLTTIIEAED FT LVNSRPLTYLPLDSGSETALTPNQFLRGMVTAVQEQPIGQIDDAEALRDNY FT KRSQRLADLLWKRWLSEYLPTLNQNKVAC" XX SQ Sequence 5810 BP; 1781 A; 1139 C; 1472 G; 1418 T; 0 other; atcttaaaaa tgaacataca ccgacacaac cgcaactcga ccgaagtatc tagctacaac 60 tgtgcagttt gtagtggtcc ggatcacgat gatgtggcca tggtgggatg cgacaattgc 120 tcgcaatggt ttcacttcaa gtgcgtggga gtttctgctg acgtaaaaga agtttcatgg 180 agttgtatta actgcaagga aaaagctgcg catcagtctt atgggcgtac ggcagaagac 240 agtggccaga agaagccagc tgctgcccaa acaaaggcat cagaggcaga tgttaaggac 300 gtatcagtgg aagtggatga gcttgaaaag gagttgcaac ggctggaaga aacacggaaa 360 ttagagttga ggaaaatgga gctggagaag actgtattcc gtcggcggct ggaagttcaa 420 cgagagttgg ttgagcagag acaaaaggtg gaacgtgaga agaggcagat ggagcttgaa 480 ctcgaaaagg agcatttaca aaaggcgatt gctgatgagg aggcctatcg taaagcacag 540 caagcaatgc ggaatgaaat gcagagtaaa ttggaacagc tgcgtattca acgcacctcg 600 gataatgttc aagctatatg cgggcctgag agaagcaaaa ttgctggatc aaacgctgca 660 aatggacaac cgaagaacta ctgtgaaacg tatcaaaaac attcgacacc aaagggtaca 720 ggaattattc ctgaaatgag tggcaccaga aacgagagcc ccccaataag gcttaatgaa 780 gaaaacacta cctcagaatc ggaaggagat gatgtttccg aaacaactca agtatcagta 840 tataaaggtc caaccaaagc tcaactttcg gctagacagt tcctagtgcg taagctaccg 900 gttttcagtg ggcatccgga agattggccg atgtttattt cgagctacga aacggcaaac 960 gaagcgtgtg gtttctccaa tgtggaaaat ctggctcggt tacaagagag tctgaaggga 1020 caagcgctgg aagcagtgcg cagccgtctg cttttgccga atgccgttcc tcaaatcatt 1080 gagacactgc gtatgttata tggtagaccc gagcagttac taaacatgct gctagcaaag 1140 gttcgcaagg cggatccacc aaaagctgat cgattggcgt cgttcattgg gtacggaata 1200 gttgttcagc agttagttga ccacctggat gctacgaaca tgagggacca tctgtttaat 1260 cctatgctga ttcaggagtt agtggaaaag ctacccgctg gtacgaaaat ggaatgggtc 1320 cgctacaaga ggaaggtgag cgcagtaact ctgcgaacct tagctgattt tctatctgaa 1380 atcgtcagag atgccagtga agctacaatg tttgctgaag gagcatccgt acctgataat 1440 cgatcgaaga aaggaaaagt gagagaaggg tacttaaatg cacaccgcat gtcaccagat 1500 gaagaagagg aacaacgtaa ttcgatgaat agcaagaagc cttgccgcgt gtgcaatcgt 1560 gtagaccatc gcattaggaa ttgtgaccaa tttcgtaaac tgagctggga aaatcgctgg 1620 gaagttgtct acaagtggaa actgtgcaaa aattgcctaa atgagcacgg tgatagtaga 1680 tgcaagctca actaccgctg taacgtagaa cgttgtggag aagctcatca tcctttacta 1740 catccagaag aggctttatc agattgcaac gtacatggcg ttcagcatca ttccgtaatt 1800 ttccgcctga ttcccgttac tatatacaat gggaaaattg cggtaaacac cgttgcattt 1860 atagacgaag gctcttcgta tacgttagta gaaacttcgc ttgtaaatca attgaatctc 1920 aaaggcagga cacaacccct gcgtgttact tggacggctg gtgtatcgag agtagaaaaa 1980 gattcgcaga ttgtagattt gtcgatttct gcgagaaatt catcgaaacg tttcaccata 2040 cataaggcac atactgtgca agaacttaaa ttgccagaac aaacgctggg atttaagatt 2100 atggcgaaca aatatcaaca tctccagagt ctacctgtgg tagattacaa tcacgctgtt 2160 ccgcaaatat tgattggatt aaaagatctt catctttacg cccctctgga aactaggatg 2220 ggaaaacctg gtgagccaat agcggtcaag tcgaagttag ggtggacggt ttatggacca 2280 atttctgctg ctgctcctga gagaaacctt attggtcatc atttgtgcga aggaatttcc 2340 aaccaagaga tacacgacct cataaagaat caatacgcta tggaagaatc gggtatttca 2400 attgctttgc tgccggagtc agccgacgat cagagggcca ggatgattct ggaaaataca 2460 acggtgaaag ttggcgagcg ttacgagaca ggattactat ggaggaccga tgaacgatgt 2520 ttgccggaca gctacccaat ggcgttgaaa cgaatgaaaa gcctagagaa acgtttgctc 2580 aaggatgatg ctatgtatgg aaaggtgcgt acgttaattg ctgattatgt gaagaaaggc 2640 tatgctcata aggcaacggt tcgagagctt caggagtcta gaccggaaca agtttggtat 2700 cttcctctga acgtcgtagt gaatcccaag aagccagaga agatccgttt ggtgtgggat 2760 gctgcggcca cagttgaagg aatctcatta aactcccagt tacttaaggg gcccgacatg 2820 ttatcttcac ttccttcagt aattagccgg tttcgagaaa gagaagtggg attcggtgga 2880 gatatcagag aaatgtatca ccaattgctc attaggaagg atgataagca gtaccaacgt 2940 tttttgtacc ggaacgatgc tgagtcaagc gtggatgttt acgtgatgga cgtagcaacc 3000 tttggggcga cgtgttctcc gtgttccgct caatttgtta agaacaaaaa cgcgaatgaa 3060 gcggctgaga aatatccaga tgcagcaaga gcaattgttg ataatcatta cgttgatgat 3120 tatttcgaca gtactgatac tgtggaagaa gcgatacaac gatccaaaga ggtacgatat 3180 gtgcattcat gtgcagggtt tgagattcga aactgagtag cgaattgcga cgagattcta 3240 caatcgctgg gggagcctaa agtggagcaa caagtacact tcaaccagga taaagatact 3300 ggaactgagc gtgttctgtg tatagtatgg agcgcagata gggacgagtt ttctttttcg 3360 accaagttgc gtgatgagtt gatgcctttc ttgagtggcg aacgaaggcc aacgaaacgt 3420 atcgtgctaa gctgtgtaat gagtctattt gacccgttgg gactcttggc accatttaca 3480 gtttacggaa agatactgat acaggatttg tggcgaagcg gaaccgaatg ggacgagatt 3540 attgatatcg agtgtctcga aaaatggaaa aattggatta gaagattacc agaagttgaa 3600 gaaatcagag tgcctagata ccatttcaca aaaggatcag aagtcgaatt aaaaactctt 3660 caattgcatg tttttgtcga cgccagtcaa tatgcctatg gatgcgccgc ctatttccgg 3720 gtactgatgg cgggtcaacc agtatgttct ttagtaatgg ctaaatccaa agttgctccg 3780 ctaaagcaac taactattcc gcgtctggag cttcaagcag cggtgcttgg agcacggctg 3840 atgaatactg tgattgacac gcacactctt gccataacag agtgtttcat ttggtcggat 3900 tccaggacag tgttgtcctg gatacactcc gatcaacgaa aatacaaaca atttgtttcg 3960 tttcgcattg gagagattca cagccttacc aaactcaatc agtggcgatg ggtgcctacg 4020 aaatgcaacg tggcggatga ggttacaaaa tggcaagcag gacgtaaatt cgaatcaaaa 4080 tcggcatggt ttaacgggcc aagctttctg tatctgccgg agtccgagtt gccagaacag 4140 acattggtta tagcaaacgt actagaagaa atgaagacag tccatttatt ccacgatgtt 4200 tcaatactag aaccattagt gaaccctgag agaatctctc gcaggaaggt tctagtgcgt 4260 tctacggctt gcgttttgcg gtttatctca aattgtcgat taaaggcgaa ggggcagaaa 4320 atcgaaacga tactagcatc taacaacatt cggcagttag tacagaagaa atttccttct 4380 ataacagtgc cgttgaaacg tgaagaacat caaaaagctg aatcctattt atggaagatt 4440 gcgcaaaatg atgcttttgg agaagatatg aaaatattgc aaaagaatca acatcttccc 4500 tgcgaaaaac agttttcact ggaaaaaggt agttatctgt atgataaatc tccatttttg 4560 gatgacaacg gcgtgattag gatggaagga aggacgaagt atggagaatt cattcctttt 4620 gaactgagat ttcccataat tttaccgaga aggcatccaa taacatcaaa actactagaa 4680 cattatcatc agaagctagg ccacgcaaat accgaaactg tggtgaacga acttcgtcaa 4740 cggttttgca taccgtatct tcgtgcagat cttaagcgta taacgcagat gtgcgttcgg 4800 tgcaaactaa ataaaagcaa gccagtcaca ccaaggatgg cccctctccc agtacagcgg 4860 tttacacctt ataaaagacc ctttagttat acaggagtag actatttcgg tccagtaacc 4920 gtgacggttg gtagaagaag tgaaaaacgg tggatagcat tattcacatg tctcacaacg 4980 cgggctgtgc atctggaagt cgcccacagc ttgtcagccc agtcctgcat tatgacgata 5040 aggcgattca tatgtcgacg aggctctccc atcgagtttt tttccgacaa tggaacaaac 5100 ttcaaaagtg cgagcaagga aattcttcgg aatatcgatg ttgaatgcgg agaaatattt 5160 actgatgcta cgacccgatg gaattttatc ccaccctcgg ctcctcatat gggcggggcc 5220 tgggaacgct tagtacgatc agtaaaagcg gctttaagtg aactcgatga tgggaaaaag 5280 ttaacggacg aaattcttct tactactata atagaagcag aagacttagt gaattcacgt 5340 ccattgactt atcttcctct ggacagcggt tcagaaacag ccttgactcc aaatcagttt 5400 cttcgtggaa tggtgactgc ggtgcaggaa caaccgattg gacaaatcga cgatgcagaa 5460 gcgttgcgcg acaactacaa gaggtcacaa cgtctggcag atctactctg gaagcgatgg 5520 ctttcggagt atttgccaac attgaaccag aacaaagtgg catgctgaaa cagaaccgtt 5580 ggtggaaggt gatgtggtgt atgtaacgga cgaaaatcag cgcaaatctt ggattcgtgg 5640 catagtcatg caaggattta aaggaaagga tggtaggatt aggcaagcgc tagtaagaac 5700 aacaagaggt ttgctgaaac gaccggtaac taagttggcg gtgcttgaaa ttcagaatcg 5760 taaatctggc cataaggcgg agcctatgcc aatgttacgg ggcggggcta 5810 // ID Gypsy-25_CQ-I repbase; DNA; INV; 4888 BP. XX AC AAWU01010957; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-25_CQ_; KW Gypsy-25_CQ-LTR; Gypsy-25_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4888 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 429-429 (2011). XX DR GenBank; AAWU01010957; Positions 42650 47537. XX CC Positions [3897-4358] - Integrase core CC 'CACTT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 822..4865 FT /product="Gypsy-25_CQ-I_1p" FT /translation="MEKFEIPAFRFKQMPPNEVRDEWARYKRQFQFLANAN FT SVTNKTKLKNIFLARAGVDVQDVFSTMPDADVEERAGVDPFKVAIDHLDSY FT FAPKHHDAYQRFLFWSMQPKDSDETLDKFLLRASDMANKCNFGATAQEAKE FT ISVIDKAIQLAPPDLREKLLQKEVLTMDDVSKTISAHQAIKFQVGQMSNNG FT EQPKPFREVNAVRNEAYGRTVCGRCGNYDHSYSDFSCPARPYPCGACGKRG FT HFAKMCRTQKRHNFRNSREQTSDQRKRKFNEQTSTQRKRPRTEYVRKLDTR FT SDEGKCTDFIFTVGDGEEFLWVSLGGVMVQMLIDSGSQKNIIDDTTWDKLR FT NQGVAVSDARTKSHVNFRAYGHTEPLTVKLVFDATIGVEDKEHRTSTKATF FT YVIAGGQQPLLGRTTAKELGVLALGLPSTQVPDVYRLQCEQKRQFPKMKGI FT KVNIEIDPTVTPVAQHARRPPLALLNKIEDKLDSLLASDIIEQVREYSPWV FT SPLVAVVKDNGEIRLCVDMRRANLAIKRETHLMPTFEDFLPRLKEARVFSR FT LDVKDAFHQVELDESCRHITTFISHKGMFRYKRLMFGICNASECYQKIIEQ FT ILAGCPNAVNYIDDILVFGRDEQEHDAALAKVMSIIKEKNILLNHLKCVFK FT VTEISFLGHHISGKGIRPAEDKISSLRAFCAPKTAEELRSFLGLVTYIGRF FT LPDLATATAPLRHLTHKGVQFLWEQEQEEAFRNLKSMICNLETLRYFDNTL FT RTRVIADASPVGLGAVLVQFADRRNDSEVRIIGYASKSLSPTERRYCQTEK FT EALALVWAVERFSVYLLGRHFELETDHKPLELIFSPSSNPCPRIERWVLRL FT QAYRFTVQYRKGSSNIADPFSRLCNVEGGDDFDPDSKFLVLVIQESAAIDT FT CELEEISQNDTELSAVRECIRNGNWNQPEARAYERFQGELGFVGDALLVRG FT TKLVIPRALRSRMLALGHEGHPGETVMKKRLRDRVWWPGMDKDVVKYVTSC FT EGCRLVGLPSKPEPMCRREMPARPWIDVAIDFLGPLPSGEYLLVIVDYFSR FT YKEIEIMKRITAEETTCRLHTIFTRLGYPVTITLDNARQFISTHFDEYCKL FT HGIQLNYSTPYWPQENGLVERQNRSLLKRLQISHATGRNWKEDLNDYLMMY FT YTSPHATTGKTPTELCYGRTIRSKIPSITDIESTPNMDDVRERDQLLKQKG FT KESEDRKRRAKESDLQVGDSVLMKNLLPGNKLTPTFDPSEYLIVNKEGPRV FT TLQNKVSNKIYERNVTHVKRIPPAEPEEPSHQELSPDDLSLARDAHAELMS FT SPPVEDSQTPNVYRSRRNVKPPAKYRDYVISRTESN" XX SQ Sequence 4888 BP; 1364 A; 1242 C; 1318 G; 964 T; 0 other; agtggcgact gtggatggga tccgaaacgg taacggtaag aatagtttca acacctttcg 60 agtggtgcaa ggacggaaag caaacaacga aggtcggggt gaagcagcac acattttcgg 120 aagggagaag cccaagcaaa aaaaaaaaaa aaaccagaaa caggagcttc gagctcgggg 180 aattgagcca cgagctcggg ggggaattga gcttcgagct cggggaaatt gagccacgag 240 ctcggggaat tgagcttcga gctcgggata ttgagcttcg agctcgggaa taatgagctt 300 cgagctcggg atattgagct tcgagctcgg gaataatgag cttcgagctc gggatattga 360 gcttcgagct cgggatattg agcttcgagc tcgggaataa tgagcttcga gctcgggata 420 ttgagcttcg agctcgggaa tatggagctt cgagctcggg atattgagct tcgagctcgg 480 gacattgagc ttcgagctcg ggaatattga gcttcgagct cgggacattg agcttcgagc 540 tcgggacatt gagctgcgag cttgaaatgg agcttcgagc tcagaaccca accgggaatt 600 ctacaacaaa ttttaacaac ggcagttgat aaactcgggg gcaataactc aaataataat 660 cttaatttcc cgtctcacac agacaacaat cggtgctcta ccaggacgaa acaagacagc 720 ttatctacca gcccagagtt gctaaagcaa ccaggtaaac actaagaatt acgggaagta 780 aacaagaata caattgagaa ttttgttcta gaacccccac tatggagaag ttcgaaattc 840 cggcattccg gttcaagcag atgcccccga acgaggttcg cgatgagtgg gctcggtata 900 aaagacagtt tcagttcctg gccaatgcca actccgtaac aaacaagacc aaattgaaga 960 atattttcct cgctcgggct ggagttgacg tgcaggatgt attcagtacc atgcccgacg 1020 cagacgtcga agaacgcgcg ggggtcgacc cgttcaaagt tgctatcgat cacctggact 1080 cgtatttcgc gccgaaacat cacgatgcat accaacgatt tctattctgg tccatgcagc 1140 cgaaggacag cgatgagact ttggacaaat tcttgttgcg cgcgtcagac atggccaaca 1200 agtgcaactt cggggccacc gcacaagaag caaaagaaat cagcgtgatc gacaaggcca 1260 tccagctagc gccaccagac cttcgtgaaa aactcctcca aaaggaagtc ctgacaatgg 1320 acgacgtgtc caaaaccatc agtgcacacc aagcgatcaa gttccaagtc ggtcagatgt 1380 ctaacaacgg agaacagccc aaaccgttcc gcgaagtgaa cgcagtccgg aatgaggctt 1440 acgggcggac tgtctgcggc agatgcggaa attacgacca cagctactcg gacttttcgt 1500 gtccagcgag accgtacccg tgcggcgctt gtggcaagcg cggacatttt gccaagatgt 1560 gtcgcacaca aaagcggcac aacttccgca actcccgcga gcagacatcg gaccaacgta 1620 agcgtaaatt taacgagcag acatcaacac agcgtaagcg accaaggacc gaatacgtac 1680 ggaagttgga cactcgatcg gatgaaggta aatgtacaga tttcatcttt acggttggcg 1740 atggcgaaga gttcctctgg gtgtcgctgg gaggtgtcat ggtgcaaatg ctaatcgact 1800 ccggaagcca aaagaacatc atcgatgata ctacctggga caagttgagg aaccagggag 1860 tcgcggtctc cgacgctcga acgaaatcgc acgtgaactt ccgggcttat ggtcacacag 1920 aaccacttac ggttaagctg gtgttcgacg ctacaatcgg agtagaggac aaagagcata 1980 gaaccagcac caaagcgact ttctacgtga ttgccggcgg gcagcagccg ctactggggc 2040 gcaccaccgc caaagagctt ggtgtgttag cgttgggttt gcccagtact caagtcccgg 2100 acgtctaccg actccaatgc gagcagaaac gacaattccc gaagatgaaa ggaatcaaag 2160 tgaacatcga aatcgatccc actgtaacac ctgtcgcaca acacgctcgc cggccgccgc 2220 tggccctgct taacaagatc gaagataagc tcgactcttt gttagcgtca gatatcatcg 2280 agcaagtacg cgagtacagc ccgtgggttt cgcccctcgt cgcagtcgtt aaggacaacg 2340 gtgaaatacg gctctgcgtg gatatgagaa gggcaaacct cgccatcaag cgggaaacac 2400 acctgatgcc gacgttcgag gatttcctgc cacggctcaa agaggctcgc gtgttcagcc 2460 ggttggatgt gaaggacgcc ttccatcagg tggagctcga tgaatcgtgc cggcacataa 2520 cgacgtttat atcacacaaa gggatgttcc gttacaagcg cttgatgttc ggcatctgca 2580 acgcctccga atgctaccag aaaatcatcg agcaaatcct ggcaggctgc cccaacgcgg 2640 taaactatat cgacgatata ctagtcttcg gccgggacga gcaggagcac gacgcggcgt 2700 tggccaaggt catgagcatc atcaaagaaa agaacatcct cctcaaccac cttaagtgcg 2760 tattcaaggt caccgaaatc agttttctcg gacaccatat ctcaggaaag ggaatcaggc 2820 cagcagaaga taaaatcagc tctctcagag cgttctgtgc accaaaaact gctgaagagc 2880 ttcggagttt tcttggtttg gtgacgtaca tcggaaggtt cctcccggat ctggctacag 2940 caacggcacc actgaggcac ttgacgcaca aaggagtaca atttctctgg gagcaggaac 3000 aggaagaagc gtttcgaaat ctcaagagca tgatctgcaa tctggaaacg ttgagatatt 3060 tcgacaacac actccgcacg cgcgtgatcg cagatgcttc ccctgtaggt ctgggcgcag 3120 tgcttgtgca gttcgcggat cgcagaaacg attcagaggt ccggatcatc gggtacgcca 3180 gcaaaagctt gagccccacg gagagacgat attgccagac ggaaaaagaa gctcttgcgc 3240 tggtctgggc ggtcgaacga ttctcggttt acttgttggg ccgacatttc gaactggaga 3300 cggaccacaa acccttggaa ctgatatttt ctccaagctc gaacccctgt ccgcggatcg 3360 aaaggtgggt gctacgttta caagcctacc ggttcacagt ccagtacagg aagggcagca 3420 gtaacatagc ggacccgttc tcgaggcttt gcaacgtaga aggtggagac gatttcgacc 3480 cggacagtaa gttccttgtt ttggtcatcc aagaatccgc cgccattgac acgtgcgaac 3540 tggaggagat ctcacaaaac gacacggaac tgtcggcggt ccgcgagtgt atccggaacg 3600 gcaactggaa ccaaccggaa gcgagggcgt acgaaaggtt ccagggggaa ctcggttttg 3660 tcggagacgc cctgctggtt cgtggaacta agctggtaat accgcgggcg ttgcgcagca 3720 ggatgcttgc cctagggcac gaaggacacc cgggcgagac cgtgatgaag aagcgcctga 3780 gagaccgcgt ctggtggcct ggaatggaca aggacgtcgt gaaatacgta acgagctgcg 3840 aaggctgcag attggtaggt ctgccgtcca aaccagagcc aatgtgtcgt cgagaaatgc 3900 ccgccaggcc gtggatcgat gtcgccatcg attttctagg ccctctccca tccggtgagt 3960 acctccttgt tatcgtcgac tattttagcc ggtacaagga gatcgagatc atgaaacgca 4020 tcacggcgga ggagaccaca tgccggctgc atacgatctt cacgcgtctc gggtacccgg 4080 tcacgatcac gctggacaac gcaaggcaat tcatcagtac gcatttcgac gaatactgca 4140 aacttcacgg gatccagctg aattactcga ccccgtactg gccgcaggag aatggtttgg 4200 tggaacgcca gaaccgttct ctcctcaagc gattgcagat cagccatgcc acgggacgga 4260 actggaaaga agatttgaac gattatttga tgatgtacta cacaagccct catgcgacga 4320 ccggaaaaac ccctacggaa ctatgctatg gcaggacgat caggtcgaag atcccctcta 4380 tcacagacat cgaatcgacc cccaacatgg atgatgttcg tgaaagagac cagcttctga 4440 aacaaaaggg aaaggagtcc gaagaccgga aacgacgcgc caaagaatct gacttgcagg 4500 ttggtgactc ggtgctgatg aaaaatttgc tgccaggtaa caagttgacc ccaactttcg 4560 atcccagcga gtacttgatc gttaacaaag agggacctcg agttacgttg caaaacaagg 4620 tctcaaacaa aatctacgag cgaaatgtga cacacgtgaa acgaatccca ccagcggaac 4680 cggaagaacc atcacaccag gaactatcgc cagacgacct aagtctcgct cgcgacgccc 4740 acgctgaact catgtcatcg ccgccggttg aagactctca aacgcctaac gtttacagga 4800 gccgccgaaa cgtgaagccg ccagctaagt acagggacta cgtcatttcg cgaacagaat 4860 caaattagta tctagagaaa aaacgaga 4888 // ID BEL-165_AA-I repbase; DNA; INV; 6891 BP. XX AC AAGE02018869; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-165_AA_; KW BEL-165_AA-LTR; BEL-165_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6891 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02018869; Positions 135820 128930. XX CC 'TCTAT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 22..6891 FT /product="BEL-165_AA-I_1p" FT /translation="MMNSLSNVRQTRSQTRFQQANANLNPDDDDKASSKCS FT EDSFIPSRVDEDGGELCDCAGCNRPNNTERYMVQCGKCERWYHFTCANVNT FT ATVHSVSFVCAVCMPVVSEPAHSQISGISGTSSAHRSRVERELQRLEEERK FT LLEDLNRERIARERALNEKELQEKLEQGKQFIARKHELLNRKDGEEGNSVR FT SMRSSQRSAKRTEDWVRQTKAADEVIGEQIPASSSVNADLSVCASSGNLQV FT VHPSSTPLRIPPAISSPVGDGRVDVGLEAKSIPRTLGSITIGSEGSLEGAV FT GGDAEKDVQRKGLSALPFVDLQPYADLLKVEEIVPSASASVMPGIKRIGTH FT YKRWSVETGELRRQQRQTEAEHRAIQDLVVKHQLDIDTRRKREVELVNQIK FT SLQWRHDNELQLIRESEEGLRIQLNQRDREQAVLKVQVETLEKLIVEEKER FT SRATEVKLQELLMLRNKECDALQLQLAELESEIQCLRDSERLLRTQLEAAE FT QRENESHRLRNEAMKEYWDLHDEVQQFISGTRDGLGDSGCSPPLPPPPASW FT FESTNGPNIENSIRVDSNHAFPPPPPPLSVDSSSLNRQQEYVTQPISCGVG FT VMNEPVPIPPTSSMFPSVPTTHRQIPSPTLVPAQFGPTPQQIAARQVVTRE FT LPIFSGDPVDWPLFISSYQHSTQICGYTDSENLLRLQRSLKGSAKDSVSSF FT LLHPSTVPQVMSTLQQLYGRPEQIVNNMIAKVRATPPPKPDRLETLVSFGL FT IVQNLCGHLKAVGLERHLANPILLQELVDKLPATVKFSWALYQEQVPMVDL FT NVFSDYMAKVSSAASGVTQLASIPQKAVKEERSRPKDKSFVNAHVTSEQPK FT ANHREVTQKAGSGSNSGHLNEKGAKSCSVCNVESHQIENCSSFKRLDLDGR FT WKAVKANKLCARCLTSHARWPCKGEVCGINDCPKRHHRLLHFEPPVTSKTT FT SAVVSVHRQLSSSTLFRILPVTLFGAKGQFNTFAFLDDGSSVTLVERSIAE FT ALGASGEVETLNIEWTGGINKTIAGTEVVTMEISEAGVNKRYKLSEVYTVE FT NLGLPQQTTDYAELASRYTHLKKLPVKSFRCAVPGILIGQSNVHLLATLKL FT REGKLNEPIATKTRIGWAVCGSLRKPRANTIHRQLHVHAEPNTEDLHEYVS FT RFFDIESLGVALVPTVKGVEEQRAYSILDATTRRLNSENFETGLLWKHDYV FT EFPDSRPMAERRFRVLEKRLARDPQLYDSVRQQIADFKAKGYIHEATAEEI FT EGFDLRRTWFLPIGVVVNEKKPGKVRVIWDAAAKVDGVSLNSMLLKGPDLL FT TPLLSVMFPYRERQVAVSADIKEMFLQISIRPEDRSALQFPYRDFPELPMS FT TMVTDVAIFGAACSPAHSQFIKNLNASEQEAEFPRGAAAVKKRHYVDDYVN FT SHDTAEEAIEAAKEVIEVHKRAGFHIRNWMSSDKSVVEQLGEPNQKTAKAM FT LSEKDTGLERVLGMAWRQDEDVFTYTLQFGEKVRILMENTTIPTKREMLRL FT VMSIYDPLGIVASFVIHGKILIQEVWRSKIDWDSCISEEIATRWTDWTTVL FT KRMDGLRIPRCYFSGYDPASFKTLELHIFVDASEQAFAAVAYFRIVDQGQI FT RVALVSSKTKVAPLRGLSIPRLELMAALLGARLRRTVEENHTLKVQKTCFW FT SDSSTVGSWIRSETRRYRQFVAFRVDEILSLSKIDEWQWISTKINVADEAT FT KWGKGPSCNVDSRWFQGPEFLYSNSEDWSMTPDEDTDESEMELREAYVFSH FT HIRKPMIDTNRFSRFERMLRSVAYVHHFVDSLRYKKAHGSADFAGLTSAEL FT QKAERTLWSIAQSEAFPEEVATLKQNPNRSTGQQKQIEASSSIVKQSPFAD FT EYGVLRVGSRAAEAEVLTYDAKFPIILPRNHRITELLLDFYHRKYGHANDE FT TVVNEVRQKFHVPRLRVEIRLARKRCMWCRVYKSTPVCPKMGPLPAVRLQP FT DVRPFTFVGVDLFGPYLVKVGRSVAKRWVCLFTCLTVRAIHLEVVASLSTD FT ACKKAIRRFVARRGSPQEIFSDNGTNIIGASRELQDEFRRMATELGSTFTN FT AHTQWRFNPPAAPHMGGCWERMVRAVKAALGCVPVVRKLDDESFATVLAEA FT ESMVNSRPLTFIPLETANHESLTPNHFLLLSSNGVREPEKFPTDAGMALRS FT SWNTVKHTLDNFWRRWVVEYLPTIIRRTKWFRDVKAIEVGDLVLVVDENVR FT NRWIRGRVIQTMPGKDGVPRRAEVQVSGGTLKRPVTKLAVLDVIRSGDAGP FT DVKATRGGG" XX SQ Sequence 6891 BP; 1860 A; 1581 C; 1930 G; 1520 T; 0 other; atcttaagaa ttcaccggat catgatgaac tcgctgtcga atgtgcgaca aacccggtcg 60 caaacgaggt tccagcaggc gaatgctaac ctcaacccgg atgacgacga caaggcttcg 120 agtaagtgtt cggaagattc tttcatccca tccagagtgg atgaagacgg aggtgagttg 180 tgcgattgtg ctggctgcaa tcggccgaat aatacggagc gatacatggt tcagtgtggg 240 aaatgtgagc gctggtacca cttcacctgt gccaacgtga acacggccac ggtacactct 300 gttagcttcg tttgtgcggt atgtatgcct gtcgtgtccg aaccggcaca tagccagata 360 agcggtattt cgggtacgtc tagtgctcat agatcccgag tagaacgtga attgcagcgt 420 ctggaagaag agcggaagtt gctcgaggac ctgaatcgcg agcggattgc aagggaacga 480 gcgttgaacg agaaggaact ccaggagaaa ctggagcagg gtaagcagtt tatcgcacgc 540 aaacatgaac tgctaaatcg gaaagacggc gaagaaggaa acagtgtgcg gagtatgcgg 600 agcagccaga ggagtgcgaa gcgcaccgaa gattgggtac ggcagacgaa ggcagctgat 660 gaggtaatcg gtgaacagat acctgcgtca tcgtctgtga atgctgattt gtcggtatgt 720 gcgtcctcag gaaatctgca agtagtacac ccttcctcaa ctccgttgag aattccacct 780 gctatttctt cacctgtcgg tgatggcagg gttgatgtgg gattagaagc gaagtcgatc 840 cctcggacct tgggaagcat caccatcggc tctgaaggtt ctcttgaagg cgctgtcgga 900 ggagacgcgg aaaaggacgt tcagagaaag ggactgtcag ctttgccgtt cgtcgatctt 960 cagccctatg cagatctgct aaaggttgaa gaaatcgttc caagtgctag tgctagcgtc 1020 atgccgggta taaaacgtat cggtactcac tacaagcgat ggagtgtcga aacaggtgag 1080 ctccggaggc agcaacgaca aacggaagca gagcatcgag ccatccagga ccttgttgtt 1140 aaacaccagt tggacatcga cacaagacgg aaacgcgagg ttgagctagt gaaccaaatc 1200 aaaagtttgc agtggcggca tgataacgag ctgcagctga tacgtgaatc ggaagaaggt 1260 ttacgaatcc agcttaatca acgagaccgt gaacaggctg tgttgaaggt tcaggtggaa 1320 accctggaga aacttatcgt cgaggagaag gagcggagtc gtgcgacgga agtgaaactt 1380 caggagcttc tcatgctgcg caacaaggaa tgcgatgcgc tgcaactaca gctggcagag 1440 ctggagagtg aaatccagtg tctgcgtgac agcgaacggc tgctgagaac ccaactagaa 1500 gcagcagagc aacgcgaaaa tgaatcacac cgtctacgga acgaagcgat gaaggagtat 1560 tgggacctgc acgacgaagt tcaacagttt atcagcggta ctcgagatgg gctgggagat 1620 agcggttgtt ctcctccatt gcctcctccg cctgcgtctt ggtttgaatc tacgaatggg 1680 ccaaacattg aaaattctat tcgcgtcgat tcaaatcatg catttcctcc tccgcctcct 1740 ccattgtcgg tagattctag ttcgctcaat cgtcaacagg aatatgtgac tcaacctata 1800 tcgtgtggtg ttggagtgat gaacgaacct gtccccattc ctcctacttc gagtatgttc 1860 ccttccgtgc caacaaccca tcggcaaata ccctcgccga ctctggtgcc agctcagttt 1920 ggaccaacgc cccaacagat tgccgctaga caagtagtca ccagagagct tccgattttc 1980 tccggggatc cggtcgattg gccgctattt attagcagct accagcattc aacacagatt 2040 tgtgggtaca ctgattcaga gaaccttctg cgtttgcagc gaagcctgaa aggtagtgcg 2100 aaagattcgg taagcagttt tttgctccac ccgtcaacgg tgccccaagt catgtccact 2160 ctgcaacagt tgtacggccg tccagaacag atcgtcaaca atatgatcgc caaggtgcga 2220 gcaacacctc caccgaagcc cgaccggttg gagacgctgg tcagtttcgg gctgatagtt 2280 caaaatctgt gcggacactt gaaagctgtt gggctagaga ggcatctagc aaacccgatt 2340 cttctgcagg agctggtgga caagctacca gcaacggtaa aattcagctg ggcgctttac 2400 caagagcaag ttccaatggt ggatctgaac gtattcagcg actacatggc gaaggtgtct 2460 tcggcggcta gtggcgtaac gcagcttgca agtatcccac aaaaagctgt gaaagaagaa 2520 cgaagccgac caaaggacaa atcgtttgtg aacgcgcacg tgacttcaga gcagccgaaa 2580 gcaaatcacc gagaagtgac gcaaaaagcg ggatccggat ccaacagtgg acatctgaac 2640 gagaagggtg ccaaatcttg ttccgtatgc aacgtcgaaa gccaccaaat cgaaaactgc 2700 tcatcgttca aaaggctgga tttagacggt agatggaagg cagtgaaagc aaacaaactg 2760 tgcgctcgtt gtctcacgtc acatgctcgg tggccatgta aaggagaagt ttgcggaatc 2820 aatgactgcc caaagcgaca tcatcgttta cttcacttcg aacctccagt gacttctaag 2880 actactagtg cggtggtctc cgttcatcgt cagctgtcgt catccacgct cttccgcatc 2940 ctgccagtga ccttattcgg ggcaaaggga cagttcaaca ccttcgcatt tctagacgac 3000 ggatcgtcag taacactcgt cgagcgatcg atcgctgaag ctcttggtgc cagcggagaa 3060 gtcgagactc tgaacatcga gtggaccgga ggtattaaca agacaatcgc cggaacagag 3120 gtcgtcacga tggagatttc cgaggctggt gtcaacaaac ggtacaagtt gtccgaggtc 3180 tacacggtcg aaaatctcgg cttaccacag cagaccacag actatgctga gcttgctagt 3240 cgatacacac atctcaaaaa gcttccggtg aaaagcttca ggtgtgctgt accaggaatc 3300 ctaatcggac aaagcaacgt tcatcttctc gctacactga agttgcgtga agggaagttg 3360 aatgaaccga ttgcaacgaa gaccaggatt ggatgggcag tgtgtggcag tctacggaag 3420 ccacgagcta atacaataca cagacaacta catgtgcacg ccgaaccgaa cacagaggat 3480 ctccacgagt acgtcagtcg ctttttcgac atcgagagtt tgggtgtagc tcttgtgcct 3540 accgtgaaag gtgtggaaga acaacgtgcc tacagtattc tggatgcgac cacacgccgg 3600 ttgaacagcg agaactttga aacaggcctg ctctggaagc atgactatgt agaatttcca 3660 gacagtcggc cgatggcgga acgacgcttc agggtcctgg aaaagcgttt agctcgagac 3720 ccacagcttt acgacagtgt tcgtcagcaa atagctgact ttaaagccaa gggttacata 3780 cacgaagcaa cagcggaaga aattgaagga ttcgacctgc gtcgcacgtg gttcttaccg 3840 attggagtcg tcgtgaatga aaagaagcct gggaaagttc gagtgatctg ggatgcagct 3900 gcgaaagttg acggtgtatc attgaactcc atgttactca agggtccgga tcttctaact 3960 ccgttgctat ccgtgatgtt tccgtaccgg gagcgtcaag tcgcagtgtc cgcagacatc 4020 aaagagatgt ttctgcaaat ctcgatccgg ccggaagatc gtagcgcctt acagtttccc 4080 tacagggatt ttcctgaact gccaatgagt acgatggtta ccgacgtggc aatattcgga 4140 gcagcttgtt cccccgcaca ctcgcaattc atcaagaact taaacgcatc agaacaagaa 4200 gcagagtttc ccagaggtgc agcggcggta aagaaaaggc attacgtgga cgactacgtg 4260 aacagtcacg atacggcgga agaagctatc gaggcggcga aagaagtgat cgaggtgcac 4320 aagcgtgcgg gatttcatat ccgaaattgg atgtctagtg acaaaagcgt ggtcgaacag 4380 ctaggggaac cgaatcagaa aactgcaaaa gctatgctat cggaaaagga caccgggctt 4440 gaacgggtgt taggaatggc gtggagacaa gatgaggatg tcttcaccta tacgttgcag 4500 ttcggcgaga aggtgcgtat tctgatggaa aacaccacga ttcctacgaa aagggaaatg 4560 cttcggctcg tcatgagcat ctacgaccca ttagggatag tggcgtcatt cgtaatccac 4620 gggaaaattc tcatccaaga agtgtggcgg tccaaaatag attgggatag ctgtatttcc 4680 gaagagattg ccacgcgctg gacagattgg accactgtgc tcaagaggat ggatggattg 4740 cgtataccac ggtgctactt ctcaggatac gacccagcga gcttcaaaac tcttgagtta 4800 cacatttttg tggacgcgag cgaacaggct tttgcagcag tagcatactt ccggatcgta 4860 gaccagggac agatacgagt tgcacttgtc tcatcgaaaa cgaaagtcgc tccacttaga 4920 ggactttcaa taccacggtt ggagttgatg gcagcgttgc ttggagctcg tctacgcagg 4980 actgttgagg aaaatcacac gctgaaggta cagaaaacct gcttctggag cgattcatct 5040 acggttggtt cgtggatcag gtctgaaacg cgtcgatatc gtcagtttgt cgccttcaga 5100 gtcgatgaaa tactgagtct ctccaaaatt gatgagtggc aatggatttc gacgaagatc 5160 aatgttgcag atgaagccac caagtggggg aaaggtccgt cgtgtaatgt ggacagccgt 5220 tggttccagg gtcctgagtt cctctacagc aacagtgaag attggtcgat gactccagac 5280 gaagatacgg atgaaagtga gatggaacta cgggaggcgt acgtcttcag tcatcatatc 5340 aggaagccaa tgatagacac taacagattt tcgcggtttg agcgaatgtt gcgttctgta 5400 gcatatgtcc accatttcgt agacagccta cgctacaaaa aggctcacgg atcggcggat 5460 ttcgccggct tgacgagtgc agagcttcaa aaagcagaaa gaacgttgtg gtcgattgct 5520 cagtccgagg cgtttcctga ggaagttgcg accctgaagc agaacccaaa tcgaagcact 5580 ggacaacaga agcagataga agcgtccagc agtattgtga aacaatcacc attcgccgat 5640 gaatacggag tattgcgggt aggcagtaga gcagcagaag cggaggtact cacgtacgat 5700 gcaaagtttc cgattattct tccgagaaac catcggatta ccgaacttct gttggacttt 5760 tatcatcgaa agtacggtca tgccaacgat gagactgtag tgaatgaagt gcggcagaaa 5820 tttcacgtgc cacgtttgcg agtggaaatt cgcttagcca ggaaacgttg catgtggtgt 5880 cgcgtctaca aatccacacc agtttgtcct aaaatgggac cgctgcctgc ggtacgatta 5940 caaccggatg tacgcccgtt tacgtttgtt ggagtggatc tttttggacc atatttggta 6000 aaggttggac gcagcgtggc gaaacgatgg gtctgccttt tcacctgcct tacggtcagg 6060 gctattcatc ttgaggtagt tgccagtttg tccacggatg cgtgcaagaa ggcgatcaga 6120 aggttcgtag cacgacgtgg ctctcctcaa gaaatatttt ctgacaacgg cacaaacatc 6180 atcggagcga gtcgggagct tcaagacgag ttcagaagga tggctacgga attggggagc 6240 acttttacga acgcccatac gcaatggcgg ttcaaccccc ctgctgcacc gcatatgggt 6300 ggctgttggg agagaatggt acgggcagta aaggcagctc tagggtgcgt tccagtagta 6360 cggaagctgg acgacgaatc gtttgcgacg gtattggctg aagcggaaag catggtcaac 6420 tccagaccgc ttacgttcat cccactggaa acggccaacc acgaatcact gaccccgaac 6480 cattttttgc tgttaagttc gaatggcgta cgagaacccg agaaatttcc gacggatgca 6540 gggatggcgc tgagaagtag ttggaacacg gtgaagcata cgctggacaa cttctggcga 6600 cgttgggttg tagaatatct accaactatt atacgtcgga cgaagtggtt ccgagatgtg 6660 aaggcgatcg aggttggaga tctagttctc gtagtggacg aaaatgtgag gaaccggtgg 6720 atacgtggac gagtgatcca aacgatgccg ggaaaggatg gcgtaccgcg tagagcggaa 6780 gtgcaggtat cgggcggaac gttgaagcgg cctgttacga agttggctgt gctggacgtg 6840 ataaggtctg gtgacgccgg accagatgtt aaggcgacac gggggggagg a 6891 // ID P1a_Cis repbase; DNA; INV; 2614 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 06-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE P DNA transposon from Ciona savignyi. XX KW P; DNA transposon; Transposable Element; P1a_Cis. XX OS Ciona savignyi OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. XX RN [1] RP 1-2614 RA Smit A.F.; RT "P1a_Cis - P DNA transposon from Ciona savignyi."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX CC Ci000947, Ci000957, Ci000859 26 bp TIRs. Multiple internal CC deletions with respect to P_Cis1; also 5-10% divergence from it. XX SQ Sequence 2614 BP; 897 A; 486 C; 451 G; 776 T; 4 other; catagagata ctaatataca ctagagtggc ctttgttaaa taattgataa tgctgcattg 60 ttgtaaaatg aatgcatttg ccgtgcaatg gctgtgttgc cattcaaata atggtttcat 120 tttttataat ttgtatttaa ctgaacgtac ttttaagcgg agttttaaga gaaaataaat 180 aattcttact accaaacttt attttccata attttaggaa caaatttatt aaacttttgc 240 atttcaaatg catcatataa tttttaaata gaaatatatg cgcaacactg gacaagacga 300 cgttcaagtc gtcagccatt tcttcgtgcg aaacaagtcc acagaaacgt taaaagttgt 360 aacctatttt aatacctacc ggtcacgttt aatacctacc actcagacac tgtctttgaa 420 atatacaccg ggtatatatc tattgtttca gtgttactgt atttgacgca aacggcactt 480 tacgttgcca cgttatatta tatacaataa aaactgtacc aacagtagcg cgtcggaccg 540 tcagccggaa cggtgcaagc ttgaaacctc catcgtatgc aatacactgc tatagctcca 600 ataaccaagc attgggcgag tgggcccatt aaaccggagt atcctatcct cgttaggtac 660 ttttagaaaa taatatcagt accggtagcg gaaatgattg tttcttgttc tgcgtggtgt 720 tgcataaaac gtcaaatcaa gactgcaatg tagatttttc acaggtacat tgtttaacgc 780 ataatattat acaaatatac cacatacacg aagttgaact gtgagaccac aactactgta 840 atactggtag tccaaaacaa aagataaaaa acataaaaca gaagttaatc aaaacaggaa 900 aacataaaaa cagaagttaa tcaaaatagg aaaacaaaaa agtgttactt gtgcagaatt 960 agcaantgat atntttcact catattggat gagatggcaa tcaaaaagca agtggaatgg 1020 aatggaaagg agtacacagg atttataaaa tttagatgac gattctctat ctgttgcanc 1080 aaacgctctt gtgtttatgc tngttccaat aaatgcaaac tggaaaattc caatagctta 1140 tttcttaatt aatggcttgt ctgggaaaac tcttgccgaa gtagtgcgaa aggtgttgtc 1200 ctgcctgcat gacaataaca tactagtttg ttcattaact tgtgatggct gtgggagcaa 1260 ccagagtatg ttaaatgaat taggagtgca tgtgagatat ccaatcaacc aaagttattt 1320 tcttcaccca gaaaatccca atcaaaaaat ttccatcttt cgagataatt gcctatgttt 1380 aagttaatgc ggaaccttct tggtcaaaaa aaggtgttgc aaaattcaat tacaggtcaa 1440 catataaaat ggggttatat aaaaaaaact tgcacactat acaaacaaag gaaggtctaa 1500 agcaggctaa taaattaaaa cggaattata ttgagtacaa gtacaaacct tgagtgcttc 1560 cactggtgaa gatatctagt gtcgtacaca gcttgacttg aaattatttt ggggggcatt 1620 agaggggcca atggttggaa caacaaccct acatgtcagc aaatccgcct acagaaagct 1680 gttaagttta actgatgtta gtaacagccg gaaattgttc ccaaacagca gagtttacca 1740 caagcctttc tattttagca ccgatttaaa tggcatcacc taatatcgat attgaatgaa 1800 aatacaacct tgattcagac atatcagacc aaaatgcaac tgatcatacc tatctgcaaa 1860 catgttactt ctcgtctggt acatatttgt caccgatgat agaaactata gtgttgtaca 1920 tatcaggatt tgttgcaaaa aagttaatac aaaagttact ttgcaacatt ttcattaact 1980 caacgtacac cccagtacct ccagatcatg accaaagctt catactcatc tcagtgttct 2040 aacacacctg cagagtgaaa agtttttatt ttattccttg aaattcttcc ctttaaaatt 2100 tgaacaagtg caaaaaccta gtaagaatgt cttcccagtg cgaatctgct ggcaaccagt 2160 ttgtagccta gcctacagat tttttagtta aacattcaat ggagttttta accagtttta 2220 gtttatcttt gctgtgcatt tgaaataaac taaaataact tttctgaaaa tttggtccgt 2280 gtaagtacgt aagcctatgc aaaagtgcgt actgccggta ccggtactcc gatatacgcg 2340 gtggcctagc ctactctaag gcgaattaaa aaccttttta tataaaataa ggtttaaaac 2400 ttaattaaca actaatatat atcagcaaag catgtatatt aatcagaaaa caaatgcaca 2460 ctgcacattt gttaactatt taccatacct gggtaaatca gcaccaaaca cgtgtcggaa 2520 ctcttgccgt gcaaagcttc aaaatggcgg ccaccatggt gacaacaatt acgcaacagt 2580 aatgccgcct ctagtgtata ttagtatctc tatg 2614 // ID Hoana3 repbase; DNA; INV; 2542 BP. XX AC . XX DT 21-SEP-2009 (Rel. 14.09, Created) DT 21-SEP-2009 (Rel. 14.09, Last updated, Version 1) XX DE Hoana3 is a putative autonomous DNA transposon. XX KW hAT; DNA transposon; Transposable Element; HOBO; transposase; KW Hoana3. XX OS Drosophila ananassae OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; ananassae subgroup. XX RN [1] RP 1-2542 RA Ortiz M.F. and Loreto E.L.S.; RT "Characterization of new hAT transposable elements in 12 RT Drosophila genomes."; RL Genetica 135(1), 67-75 (2009)DOI 10.1007/s10709-008-9259-5. XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 387..2294 FT /product="Hoana3_1p" FT /translation="MCEKMDKAKVQRALKENDPDYKLVKPENSKSDVWQLF FT SLIEKCGEKINFVCCTKCKIVFAYTAKTGTGTLLRHKCDVVKAQLNNQPSM FT KEFVKHDVPRSVLRNLACKQLKLVAKDLQPLNATEGTTIHTQISFFSITFF FT SQYTQLVSLQIFNLQLHYLILVGEGFQEYVQEVINVVSTYGKQNFKDLVVS FT RRTLTRDIMVTEYTRIKSELQKTLHLYELAFTTDMWTDTYTQRSFISLSAH FT YITNSFELKVSILGIKEFTSEKKTGVNILAHVQNILEEYDLKSSLHKSVIV FT TDNGANVVAAFNKYKRISCACHNLNLVMEDVLERNRNQELEILLDNCKKLV FT GYFKHSELNNKLTKSLKQDVRTRWNSIYIMLSSILEGQEEIEKLLLQKNEI FT KRIANIDFVFMKHLINFLAAFKECSEQLSSEKQPTMHIYVLWFEKLKKHCN FT SLNEFVDADIISQLRSMTLNSLQKRFQPATINFVGLFLNPPFKELKFLGEE FT KKNHVILTVKNMIDELKKNLDFNFESESENSPPKNTNHNFAEFFDNRQTKK FT IKLMKSDTDIEIDNYLTIEYSADTPILTFWENAHHLKLLRMLAKQILNIPV FT SSATSERVFSCSGQILNDRRTRLTSSNLDKILFLNKNM" XX SQ Sequence 2542 BP; 903 A; 422 C; 445 G; 772 T; 0 other; tagaggtggg catgggctcg tgttttcgtc gtggcacgcg tgcgcacgcg tggccacgcg 60 tgttcgtaac aaaaataaaa atttaacacg cgtgcgtgcg ttttcgtgcc gaccctccaa 120 aacacgacaa aacacgaagc attaaaaatt gtattttctg gaaatttcga gaaaaaaaaa 180 tttttgcaac gttgtatttc gattttgtca tttgacaaag tgccaaaatg ccaaaatgcc 240 aagtgccgtg atgccactcg atattggcaa aaaaaaaata ttttttacaa ctttttcatt 300 cgatagtggc gactttcgat aattttcagc ttaagcttct caatgacata gttgaccacc 360 tctatatttt gtgtaaaacc aatagaatgt gcgaaaaaat ggataaagca aaagttcaac 420 gggctttaaa agaaaatgat ccggattaca agcttgtgaa accggaaaat tctaaaagtg 480 atgtgtggca attattttcg ttgattgaga agtgcggtga aaaaattaat tttgtttgct 540 gcacaaaatg caagatagtg ttcgcctata cagccaagac aggcactgga actttattgc 600 gacacaaatg tgatgtagta aaagcacaac ttaacaatca gccatcaatg aaggaatttg 660 tgaaacatga tgttcccagg agcgtgcttc gcaatttggc atgcaagcag ctgaaactag 720 ttgccaaaga cttgcagcct ctgaatgcta ctgaaggtac aacaattcat acacaaatta 780 gttttttttc aattaccttt ttttctcaat atacacaact agtatcttta caaatattta 840 acttacaatt acattaccta atacttgtag gtgaaggatt ccaagaatac gtccaagagg 900 tcatcaatgt tgtttctacc tatggcaaac aaaattttaa ggacttggta gtgtcgagga 960 gaactttaac ccgagatatt atggtgacgg agtataccag aatcaagtcg gagctacaaa 1020 aaactctaca tttgtatgag ctagcattta ctacggatat gtggacagat acctacaccc 1080 aaagaagttt tatatcatta agcgctcatt atataacaaa ctcatttgag cttaaagttt 1140 ctatattagg tatcaaagaa tttacctcgg aaaagaaaac gggcgttaac attttagccc 1200 acgttcaaaa tatcttagag gagtacgatt taaaatcaag tttgcataaa tctgtaattg 1260 taaccgacaa tggcgcaaat gttgtcgcag cttttaacaa gtacaaacga atttcatgcg 1320 cctgccataa cttaaattta gttatggagg acgttttaga aaggaatcgt aatcaagaac 1380 ttgaaattct tttggataac tgtaaaaaat tggttggata ttttaagcat tccgaattga 1440 acaataaatt aactaagagt ctgaaacaag atgtaagaac tcgatggaat tccatttata 1500 tcatgctctc gagcattctt gaagggcaag aggaaattga aaaactactt ttgcaaaaaa 1560 acgaaattaa acgaatagcc aatatagatt ttgttttcat gaaacatctt attaattttc 1620 ttgcagcctt taaagaatgc tccgagcagc tgagttcaga gaaacaacca acgatgcata 1680 tttacgtact ctggtttgaa aagttaaaaa aacattgcaa ttcacttaat gagtttgtag 1740 atgcagatat tataagtcaa ctaagatcta tgacgttaaa ttcattgcaa aaacgctttc 1800 agccagcaac aataaatttt gttggcctct ttttaaatcc gccatttaaa gaattaaaat 1860 ttttagggga agagaaaaaa aaccatgtaa ttctaacagt aaaaaatatg atagatgaat 1920 taaaaaaaaa tcttgatttt aattttgaat ctgaatctga aaacagccca ccaaaaaata 1980 ccaatcacaa cttcgcagag ttttttgata atcgacagac taaaaaaatt aaattaatga 2040 aatctgatac agatatagaa attgacaact atttgactat tgaatacagt gcagatacac 2100 ctatattaac attttgggaa aatgcacatc atttaaaact tttgcgaatg ttggcaaaac 2160 aaattctgaa tattccagtt agctctgcta caagtgagcg agttttttca tgttcagggc 2220 aaattttaaa cgatcgtcgt actagactta caagctctaa cttggataaa atattgtttt 2280 taaataaaaa tatgtaaaat attaataaga ttttcctaca tatacaatta tactattata 2340 tattaagata ctttaatttg ataaaaaaaa tatatgtacc tattcctatt taaataaata 2400 atgtaaataa tttatttcgt gttgggccgt gttgggccgt gttgccaaaa tttttggccg 2460 tgcgtgtgcg tgcgtgctaa aaaaacttcg ctgtcgtgtc gtgttgggcc gtgtcgaaaa 2520 attcagccca tgcccacctc ta 2542 // ID BEL-648_AA-I repbase; DNA; INV; 6178 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-648_AA_; KW BEL-648_AA-LTR; Pao_Bel_Ele213; BEL-648_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-6178 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [5240-5797] - Integrase core CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 23..1417 FT /product="BEL-648_AA-I_2p" FT /translation="MPKDDAHDCGACNRPNSADVGMVSCDGCKVLYHYTCA FT KVSPGVLKRSWRCSTCQTEPPAEPTGTKKRGVRKQSASLTVLGAASSEIPK FT SSAASQKDLEKSKASQQSKTSEKSITVVVPIPGTSGVITPKKNPEVPTVAT FT HGEKKSTKSSTSTAKARAQLALQRLEEERLLEEQKLMDERARVEEERIRLE FT KEKQLKEQEHAIKAKELAMREKYLRDKFELKEQIADEESSSNSSPISSRTR FT TKAWLQDQQKLKPIQDDNSVTHKSEYNLQNIEAGSNRLQPPSVVRSYRPRP FT PSVVGPTQLQPADLENDRQVEDLRFNAPLGFPNQRGDFGNEMQPVALEGVP FT NLQGNREGCENRMQPEDYLPSAVQNEAGFNAGMLPALGGCLEGPMNEMRPG FT LHGAGPNANQIAASKSGRRSCPRSLATRKIGQYSSTVSRRPTLRVASQTWK FT ISSGCGSACEGQRETPSSRS" FT CDS 1420..3693 FT /product="BEL-648_AA-I_1p" FT /translation="MFPQNVRTIMDTLRRLFGRPELLVKNLLNKVHRAEAP FT KPERLDSLINFGLTVQQLCDHLEAANLRSHLSNPTLLGELVEKLPASIKLE FT WARFKRAHADPTLRHFGMFMEELVNDASEVTMPIQQKSAVSRADRDKLKEK FT GHVYAHEEVADSQSRSKERQPCPICSKIDHRVRNCEQFQKLSLDARLKAVE FT RWKLCEVCLFDHGQWRCRSRFRCNVGSCRDRHHPLLHRPKQEQRPSQSQAS FT ECHTHQRPQRSVLFRIIPVTLFNENRKCETFAFLDEGSSLTLVEAGLARQL FT GANGVAEPLELLWTSSVMRNENDSKRVDCEIAGKGLPRRFALKNAHTVEEL FT KLPNQSLAMKELSERYPHLRNLPVSSYSEAVPRILLGLENLSLFAPLDSCV FT GRPGEPIAVKSLLGWSVYGPVGNGRAKEGFVNVHECNCGADRELNDLVRQQ FT FILEDSMIRSTPPLESTDEKRAREILECTTKFVDGKYETALLWKADDVELP FT NSLPMAMKRLKSFEAQLAKDPSLRDAVNQQINDFVQKGCIHKATEEELIEI FT SRRQVWYLPLGLVTHPKKQKKRLVWDARAQVNGISLNSQLLKGPDLLVSLP FT SVICKFREKRVGFGGDIREMFLQLRMRTSDKYFQCFLFRSDSQQPPEVYIA FT DVAMSGATCSPCVAQHVLRVNADKWAKEFPQAAAAIKDKTYMDDYYDSADT FT PEEAAELAVQVRTIHAHGGFEMRNWVSNSAEVLEKLGESAVVEPRILQSSV FT KSDGKEF" FT CDS 4133..6178 FT /product="BEL-648_AA-I_3p" FT /translation="MAKSRVAPLQHLSIPRLELEAAVLGAKLLNTVMTNHS FT LKPRKVYLWSDSSTVLSWIRSDHRRYKQFVAHRIGEILSLTQTENWRWVPT FT KENIADCLTKWVDDTEPDSNGSWFKGPAFLYHPEDLWPQQRVKANTQEELR FT SRYLLAHISVPIRMIDPSRISKWTVLLRTVACVYRFISNCRLRIAGRPIEA FT IPVTRNQEKLLKRTVAAQVVPLKQEEFQQAERFFWRMVQGEYYPDEVRTLM FT NNRDQPIDKWTPVEKDSPLYRFSPFADEYGVVRMEGRTADAAYASFDARFP FT IILPKDSPITQRLLEHYHHRYGHANKETVVNEVRQRFEIPHLRSVVEKISR FT NCLSCKVNKCKPHPPRMAPLPEQRLVSNVRPFSYVGIDYMGPLEVSVGRRK FT EKRYVVVFTCMVVRAVHLEVAYDLSSDSCIMAIRRFTRRRGSPVEIFSDNG FT TNFVGASRELQKQVERIDLECAGTFTDARTKWSFNPPSAPHMGGVWERMVR FT SVKEAMAALEDGRKLTDEILWTTLVETEGLINSRPLTYMPQEPDNPEALTP FT NHFLLNSSSGAHEPLEPPEDLGQALRSSFARSQQLADVAWERWSKEYFPTI FT NERSKWLNEAKSLKVGDLVYVAEGKRRSWTRGIVDEVFPGKDGRIRQVIVR FT TASGKLKRPVVKLAVMELGGSTESPPVDPRGGG" XX SQ Sequence 6178 BP; 1630 A; 1513 C; 1754 G; 1279 T; 2 other; tcctcaaaga tttgtgctta agatgccaaa ggacgatgcc catgattgcg gcgcctgcaa 60 ccgaccgaat tccgccgacg tcggaatggt gtcctgcgat gggtgcaagg tgttgtacca 120 ctacacgtgt gctaaggtgt caccaggggt tctaaaacgg tcttggagat gcagcacttg 180 ccaaaccgaa ccgcctgccg aaccaaccgg tactaagaaa aggggagtta ggaagcagtc 240 ggctagcttg actgtcctcg gtgcagcatc gagcgagatc ccgaagtcaa gtgccgccag 300 ccagaaggat ttggagaaat cgaaagcatc ccagcagtcg aaaacctccg agaaatcgat 360 taccgtcgtt gttcctattc ccggcacgag cggcgtaatc actccgaaaa agaatccaga 420 agtcccaact gtagcgaccc acggggagaa gaaatccacc aaatccagca cttctaccgc 480 caaagctcga gcgcaattgg cattgcagcg attagaagag gagcgactat tggaggagca 540 gaagttgatg gatgaacggg cgcgagtaga agaggaacgg atccgtcttg aaaaggagaa 600 gcagctcaag gagcaagaac acgcgataaa ggccaaggaa ctagcgatgc gagagaagta 660 tctgcgagac aagttcgaac tgaaggagca aattgccgac gaggagagca gcagcaattc 720 cagtcccatt agtagtcgga ctagaacgaa ggcctggctg caggaccagc agaagttgaa 780 accgattcaa gacgacaata gtgtaacgca caaatcggag tacaatctgc agaatatcga 840 ggcaggatcg aatcgacttc aaccgccatc cgtggtaaga tcttatagac ctcgaccacc 900 atccgtagtc ggaccgactc agctacagcc agctgacttg gagaacgatc gccaagtgga 960 agatttgaga tttaatgcac ctctcggttt cccgaaccag cgtggagatt ttgggaacga 1020 gatgcaaccc gtcgccctgg aaggagttcc taacctccaa gggaaccgtg aaggctgcga 1080 aaaccggatg caacccgagg attatcttcc cagtgccgtc cagaatgaag ccggattcaa 1140 tgctggtatg ctgccagcgc ttggtggatg cttagaagga ccgatgaatg aaatgcgacc 1200 aggattgcac ggggcggggc cgaacgccaa ccagatagcg gcaagcaaat ctggccgaag 1260 aagttgccca cgttctctgg cgacccggaa gattggccaa tattcgtcca cagtttcgag 1320 acggccaacg ttgcgtgtgg cttcacagac gtggaaaata tcatccggct gcgggagtgc 1380 ttgcgagggg cagcgagaga cgccgtcgtc acgaagctga tgtttcctca gaatgtgagg 1440 acgataatgg ataccctacg ccggttgttc ggtagaccag aactactggt gaagaatttg 1500 ctgaacaagg tacaccgggc ggaggcaccg aaaccggaac gattggattc gttgatcaac 1560 ttcggtttga cggtgcaaca gttgtgcgat catttggagg ccgccaacct tcgaagtcat 1620 ttgtcgaatc ctacgctgct tggcgagtta gtggagaagt tgccagcatc catcaaactc 1680 gagtgggcga gatttaagcg agctcacgcc gatccaacgc ttaggcactt cgggatgttc 1740 atggaggagc tggtgaacga cgccagcgaa gtcacgatgc caatacagca gaagtcggca 1800 gtttcaagag ccgacaggga caaactgaag gagaagggac acgtgtacgc ccatgaggag 1860 gtcgccgatt cgcagagccg cagcaaagag agacagcctt gcccaatatg cagcaagata 1920 gaccaccgcg tacggaactg tgaacaattc cagaaactga gcctggacgc tcgtctaaag 1980 gccgttgaac gatggaaact gtgtgaggtt tgtttgttcg accacggcca atggaggtgc 2040 cgatcgagat ttcggtgtaa cgttgggagc tgccgagatc gtcaccaccc gcttcttcat 2100 cgtccgaagc aagagcaacg tccatcacaa tcccaagcgt cggagtgcca tacgcatcag 2160 cggccgcagc ggtcggtcct gtttaggatc attccggtta cattgttcaa cgagaaccgt 2220 aaatgtgaga ctttcgcctt cttggacgaa gggtcttcat tgacactcgt agaggctggc 2280 ttggcgcggc agcttggagc gaacggtgtc gctgaacccc tagagctgct ttggacttcg 2340 agcgtaatga ggaacgagaa cgattcgaag cgtgttgatt gcgaaatagc gggaaaagga 2400 ttaccccgtc gttttgcatt gaagaacgcg cacaccgttg aggagctgaa gcttccgaac 2460 cagagtctgg ccatgaagga gctatcggag cggtacccac acctccgaaa tctacctgtt 2520 tcgtcgtact ctgaagctgt ccctaggatc ctgctcggac ttgaaaatct gagtcttttt 2580 gctccattgg atagttgtgt cggccgcccc ggagaaccaa tcgccgtgaa atcgctgcta 2640 ggctggtctg tctatggacc ggttggaaat gggagagcga aggagggatt cgtcaacgtt 2700 cacgaatgca actgtggcgc tgatcgagag ttgaacgacc tagttcggca gcagtttatt 2760 ctagaagact cgatgatccg gtcaacacct ccgctggaat ccaccgacga gaagcgagct 2820 cgcgaaatct tagagtgcac tacgaagttc gtggacggaa agtatgaaac tgctctactt 2880 tggaaagccg atgacgtcga actaccgaat agtctcccga tggcaatgaa gcgactgaaa 2940 agtttcgagg ctcaactagc gaaagatccg agtttgcgcg atgcagtgaa tcagcagatc 3000 aacgacttcg tacagaaggg ctgtatacac aaggccaccg aggaagaact gatagagatc 3060 agtcgaagac aggtttggta cttaccgcta ggcctggtaa ctcacccgaa gaagcagaag 3120 aagaggctgg tgtgggacgc aagggcgcaa gtcaatggca tttctctcaa ctcccaattg 3180 ctgaagggcc cagacctgct ggtatcgctt ccgtccgtga tctgcaaatt tcgcgagaag 3240 cgcgtcggat tcggaggcga tattcgagaa atgtttttgc agctgcgtat gcgaacgtcg 3300 gacaagtatt tccagtgctt cctgtttcgc tccgattccc aacagccacc ggaagtctac 3360 atcgctgacg tagcaatgtc tggtgccaca tgttcgccat gcgttgcgca gcatgtgctg 3420 cgagtgaacg ccgacaagtg ggcgaaagag tttcctcaag cggcggcggc gattaaagac 3480 aaaacatata tggacgacta ctacgacagc gccgacactc ccgaagaagc tgcggagcta 3540 gccgtacaag tgcgaaccat ccacgctcac ggaggcttcg aaatgaggaa ctgggtcagt 3600 aacagcgcgg aggtgttaga gaagcttggg gaaagtgccg tcgtggaacc gcgaatcctg 3660 caatcaagtg tgaagagcga tgggaaagag ttttaggaat gttgtggcat ccacggacac 3720 tctgacgttt tctacggaac taggagagca gctgcttccc tacgtgtccg gaggaaagcg 3780 accaacgaaa cgcacggctc tgaagatcat aatgagcctg ttcgatcctc tgggtttgct 3840 ggcgccgtat ctcatccacg gaaggaccct gatccaggat ctttggagaa gcggaataca 3900 gtgggacgag caaatgaagg acgacgagca cgagaactgg aagcgctggg taaaattgct 3960 gcctggtatt aaagacctca gcattccacg atgctacttc ggtgatgtag atccacgctg 4020 ctataagtcg ttgcagtgtc atgtgttcac cgacgccagc gaggtcagtt atggctgcgc 4080 ggtgtacttt cgcataacgg acgamtacga acgagtgcac tgcacgttga tcatggccaa 4140 gagcagagta gcccctctac aacacttgtc gataccccga ctggagctgg aagcagccgt 4200 gctaggagcg aaactgctga acaccgttat gacgaaccat tcgctgaagc cccgcaaggt 4260 gtatctgtgg tcggattctt cgaccgttct ttcgtggata cgttcggatc acaggaggta 4320 caagcagttt gttgctcatc gtatcgggga gatcctgtcg ctgacccaaa cggagaactg 4380 gcgatgggtt ccaacgaagg aaaacatagc cgactgcctg acaaaatggg tggacgacac 4440 agagcctgat tcgaacggta gctggttcaa aggtcccgcg ttcctgtacc atccagagga 4500 tctatggcca cagcaacgcg taaaggccaa cacgcaagaa gaactacggt cgagatattt 4560 gctcgctcat atttcggtgc ccataagaat gatcgatcct tccagaatct ccaagtggac 4620 tgttctatta cgaaccgtcg catgcgtgta ccggtttatc agcaactgcc gattgcgaat 4680 cgcgggacgc ccgatagagg ctattcccgt aactagaaat caagagaagc tactaaaacg 4740 gacggtagcg gcgcaggtcg ttccactgaa gcaggaagaa tttcaacaag ctgaacgttt 4800 tttttggcgg atggttcaag gagaatacta tccagacgaa gtccgaaccc tgatgaacaa 4860 ccgtgatcag ccgattgaca aatggacacc cgtggagaaa gacagtccgt tatacagatt 4920 ttctcccttt gccgacgaat atggagtagt cagaatggaa ggaagaaccg ctgatgctgc 4980 gtacgccagt ttcgatgcaa gatttccaat cattcttcct aaggacagcc cgatcactca 5040 acggctactg gaacactatc atcatcgata tggtcatgcg aacaaagaga ccgttgttaa 5100 cgaagttcgc cagcgattcg agatcccaca tctacggtcg gtggtggaga aaatttcccg 5160 gaactgcttg tcctgcaagg tgaacaaatg caagcctcat cctccacgaa tggctccctt 5220 accggaacaa cgattggtgt ccaatgtccg acctttcagc tacgtcggta tagactacat 5280 gggtccgctg gaggtctccg taggtcggcg caaggagaag agatatgtcg tggtgtttac 5340 gtgcatggtt gttcgggcag tccacctaga ggttgcctat gacctgtcga gcgattcttg 5400 cattatggca attcgtagat tcacccgtcg gcgtgggtca ccggttgaga tcttttctga 5460 caatggcacc aattttgtcg gtgccagtcg agaactgcag aagcaggtgg agcgaatcga 5520 cctggagtgt gccggtactt ttactgatgc gcggacgaaa tggtcgttca atcccccctc 5580 cgcaccccat atgggcggag tgtgggaaag aatggtgcgg agcgttaagg aagccatggc 5640 cgcattggaa gacggccgaa aactcaccga cgaaatattg tggactacgc tggtcgaaac 5700 tgaagggctg atcaactctc gaccacttac ttatatgccg caagagccgg acaatcctga 5760 ggctttaacg cctaaccatt tcctcttgaa cagctcatcc ggcgcacatg aaccgctgga 5820 accaccagag gacctaggac aggcccttcg aagcagtttt gcgcgttcac aacagctagc 5880 ggacgtcgca tgggagcgat ggtcgaaaga gtactttcct acaatcaacg aaaggtctaa 5940 gtggctaaac gaggcgaaat ccctgaaggt cggcgacctc gtgtacgtag cggaaggtaa 6000 acgaagatct tggacgagag gtattgtaga tgaggtcttt cccggaaagg acggwagaat 6060 ccgtcaagtg attgtaagaa cagcgagtgg taagttgaag cgtcctgttg ttaaactggc 6120 tgttatggag ctgggcggat ctacggagag ccctcccgtc gatccacggg gcggggga 6178 // ID Tx1-12_BF repbase; DNA; INV; 5178 BP. XX AC . XX DT 28-APR-2009 (Rel. 14.04, Created) DT 28-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE Amphioxus Tx1-12_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW Tx1; Non-LTR Retrotransposon; Transposable Element; L1-12_BF; KW Tx1-12_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-5178 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-5178 RA Kapitonov V. and Jurka J.; RT "Young families of Tx1 non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(4), 849-849 (2009). XX DR [2] (Consensus) XX FH Key Location/Qualifiers FT CDS 219..1394 FT /product="Tx1-12_BF_1p" FT /note="ORF1p." FT /translation="MATAMTHPSSYAQAAATTGPVTTSREETAGSNVRSGC FT KPLFFLERDVHEVSQNEPYFSNVEVYKAISRVIDGSKILGIQQVRSMWRIY FT LSEIADRAKLLVSGFNVRNKHVDLRDVHPFRQNEGVRITVKDVPLSVDDSV FT VAEGLRRYGAKLLGPLRREKLRVDGRLTNCETGDRFGYMQEPATPADNIPR FT NVELAGRWRARIFYRDQTNDRNCSKCLGKGHLKRDCTSEWRCSQCNKLGHK FT RADCSGVVASTGENSLETGAEVRGAVPASEGYSCEDADTEPERSAAARQSQ FT ITDFTVAASKKKNKKDQQRPGGQKSTGSRVTRSNSRQQPNSRQIQQSRKEA FT AGVQPSRQVQAPDPRVMELIEAGRHSWSSPPGSDATGSTLTSSSEDDA*" FT CDS 1441..5127 FT /product="Tx1-12_BF_2p" FT /note="endonuclease and RT." FT /translation="MTSNLSFISLNVRGLRNHSKRTTLYHWLERQKADIFL FT LQETYYTEDKMDVFERDWNGPQFHCFGDSHSRGVSILCKKKIDMNVTDVIK FT GNDGRKLLIKLKDLTILNVYVPTEKKEQQTFLTSLRDWINDNNVDNLIAGG FT DFNMVNNTAMDRNSKQSLAPSAGEKELIRFKRQLKLVDIWRERHPQVRQFT FT WRRKNPDLISSRIDFWLIQKSIIDSVVKCDIRCSIKTDHQAIFLKLLLTDF FT ARGPGVWRFNTSHLSDDCYVNGVKSTLIDVVKEFKDNNVSKQMLWEICKNK FT IKDFTDKFSRQKAFQLNNDLVRLETQLTELDKKIDETLDKDVLDVYAQVKS FT KVETIYEYKAKGAQVRSREQWIEKGERSNKYFLSLESSRSKQNTITRLKHE FT NKEETNPFKILDLESAFYENLYCCDNPATDKQIDTFLESVDVPSLSDEDRL FT LLDDPITKNECFSALKYMKQNKAPGTDGLPAEFYLKFWKEIGDMVFESLKE FT GIENKRLSVTQKRALLKLIYKKGDKTDLSNWRPISLLNTDFKILAVVLAKR FT LQDFLPKLISEDQTAYIKSRFIGQNVRLIQDVIYYSNFYNIPAAVVWLDFE FT KAFDSVNWNFMLRTLDKFGFGPTFTSIVKTMYTDIESCIINAGWQSAYFKL FT QRGIRQGCPLSALLFNLVVEILATKIRQTRTVKGINVEINDNMFELKISQV FT ADDTTLTLCGDDSIRNALQIVDDFSNVAGPTLNRRKCEAMWIGANRGRRDT FT PFGIDWPNRPIKSLGIYYGLSEEECETLNWEKRIDNFENMLLKWLSRKLTL FT YGKVLIIKTLAISQLIYNMSMISTPDWALKRIEKACFSFLWNRKPDKIKRK FT VLYSNYQEGGLKMVDIYSKRSSLQAVWVKRLMQSALDDTKERHWHLIPLHF FT INHFGKDFLLFRFNFCKGGDAPSLRVLPPFYRQVVLTWHELGGGKTDFVRQ FT RQEIIWCNKNYKLNNSTLFFEHWIRSGLIYVNDLCENSEYVASDILLRKLS FT DKRNWMSEFMQVKRAIGYSWTDENDCTGRETLYGTCVWFNSSMISLDTLNS FT RYLYSVSLMENVVRPTAQAYWDSVTPNLDWKAIWESQTCYIRYNKLREFNF FT KLLHKILPCRYLLNKWKIRDTSGNVYSPFCDICNKLENYEHMFLECTRLRK FT FWGRLNNTEDNQSVKNVLLEHNKCNITLLTIAKFSIYISWIRWKDNPRQLL FT LDDVHNTFMSLCSLYSAE*" XX SQ Sequence 5178 BP; 1703 A; 976 C; 1140 G; 1359 T; 0 other; gaaatgtcag aggtcgtctc ccaagggaga cgggttttcg agcgctctta cgcttacgat 60 cgcttattcc atacaatttc aaagtcttga ggtgttatat tagacaatcg tgtgtcggag 120 atatccaaga ccacactgtg cctcccacga gccggtaaac aagctggaga ggcgcagacg 180 gactgagaaa agccgatagt gcggcggaaa gttggatcat ggcgaccgcc atgacacatc 240 cctcgagcta cgcgcaggca gcagcaacaa caggccctgt aacaacatca cgggaagaga 300 ctgctggatc taacgtgagg tcgggctgca aacccctgtt cttcctggag agggatgtac 360 atgaggtatc tcagaacgag ccgtactttt ccaacgtcga ggtgtataaa gcaattagtc 420 gcgtcattga tggctccaaa attctaggca tccagcaggt gagaagtatg tggcgaattt 480 acttaagtga aatcgccgat cgagcaaaac tactcgtgtc tggatttaat gttcgcaaca 540 aacatgtcga cctacgagat gtacatcctt ttcgacagaa cgagggagta cgcatcacag 600 tcaaggatgt accactttcc gttgacgata gtgttgttgc agagggccta agaagatacg 660 gcgccaagct tctgggaccg ctcaggcgcg agaagttaag agtagatggc cgcttaacca 720 actgtgaaac cggagacaga tttggttata tgcaagagcc tgctactccg gcagataaca 780 ttcctaggaa cgtcgagttg gcaggacgct ggagggctcg gattttctat cgagaccaga 840 cgaacgacag aaactgctca aagtgccttg gtaaaggaca cctgaagaga gattgcacct 900 ctgagtggag atgctcacag tgcaataaac tagggcacaa gcgagctgac tgtagcggag 960 ttgtagcctc tacaggcgaa aacagtcttg agacaggagc tgaagtgagg ggtgctgtac 1020 ctgcttctga gggctactcc tgtgaagacg cagacactga acctgaacgg agcgcagcgg 1080 caagacaaag ccagatcact gactttactg ttgcggctag caaaaagaag aacaaaaaag 1140 accaacagag acctgggggc cagaagtcga cagggtctag ggtgactaga agtaacagtc 1200 gacaacagcc caatagcaga cagatacaac agagtcggaa ggaagcagca ggtgtccagc 1260 ccagcagaca ggtgcaggca cctgacccac gagtaatgga gctgatagag gccggacggc 1320 actcttggag cagtcccccg gggtctgatg ccacaggatc aacgcttaca agctcaagtg 1380 aagatgacgc ataggctaaa cttcatggac ttatgattgt gagtttcttt ttgtctcaac 1440 atgacatcca acttatcctt tatctcgctt aatgttcgag gtcttagaaa ccacagtaaa 1500 cgtacaaccc tttaccattg gttggaacgc cagaaggcag acatctttct cttacaagaa 1560 acctattaca ctgaagataa gatggatgtc tttgaaagag attggaatgg acctcaattc 1620 cattgttttg gtgactctca tagtagaggt gtgtccatac tttgtaaaaa gaagattgat 1680 atgaatgtca cagatgtaat caaaggaaat gatggtagaa aactattgat taagttaaaa 1740 gatcttacaa ttcttaatgt ttatgtacca actgaaaaga aggagcaaca aaccttcctt 1800 acctctctaa gggattggat taatgacaac aatgtcgata atctgattgc tggaggggac 1860 tttaatatgg taaataacac agctatggat aggaatagta aacaatctct ggcgcctagc 1920 gcaggagaaa aagaactaat tagatttaaa agacagctaa aactcgtgga tatctggcgg 1980 gagcgccacc cccaggtgag gcagttcacc tggaggcgga agaacccaga cttaatttca 2040 agtagaatag acttttggct cattcagaag tcaattattg attctgttgt gaaatgtgac 2100 atacgctgtt ctataaaaac tgatcatcaa gcgatatttt taaagttatt actgacagac 2160 tttgccagag gacccggagt gtggcgtttc aatacttcac acctttcaga tgattgttat 2220 gttaatggtg ttaagtccac tcttattgat gtcgttaaag agtttaaaga caataatgta 2280 tcgaaacaaa tgctctggga aatctgcaaa aataagatta aggactttac ggataagttc 2340 tctagacaaa aagcattcca actgaataac gaccttgtgc gcttggaaac tcagttaaca 2400 gaacttgata agaaaataga tgagacgtta gataaagatg ttcttgatgt atatgcacag 2460 gttaagtcaa aagtagaaac tatatatgaa tataaagcga aaggggcaca agtaagatct 2520 agagaacaat ggatcgaaaa aggggaaaga tccaacaaat atttcttatc tcttgagagc 2580 tcccgttcaa aacaaaacac cattactaga ttaaagcatg agaacaaaga ggaaacaaat 2640 ccgtttaaaa ttttagattt ggaatcagct ttctatgaaa atctttattg ttgtgataat 2700 cctgctactg acaaacaaat cgacacgttt cttgagtcgg ttgatgtacc ctctctttcg 2760 gatgaggaca gacttctgct tgatgatcct ataacaaaaa acgaatgttt ttcagctttg 2820 aaatacatga agcaaaataa agctcccgga actgacggct taccagccga attctacctg 2880 aaattctgga aagaaatagg agacatggtc ttcgaatcct tgaaagaagg gatagaaaat 2940 aaacgccttt cagtaacaca gaagagagcc ttattaaaac ttatttataa aaagggtgat 3000 aagactgact tatccaactg gagaccaata agtttgctga atactgattt taagatatta 3060 gcagtcgtcc tagcaaaacg actgcaggac tttcttccaa aattgatttc tgaggatcaa 3120 accgcctaca ttaagtccag atttattgga caaaatgtta gacttataca ggacgttatt 3180 tattactcca atttttacaa catacctgct gcagtcgtct ggctagactt tgaaaaagcc 3240 tttgattcgg taaactggaa ttttatgctt agaaccttag acaagtttgg gtttggaccc 3300 acatttacct ccatagttaa gaccatgtat actgacatag agagttgtat cataaatgct 3360 ggttggcaat ctgcttattt caaactgcaa agaggaatca gacagggatg tcctctgtca 3420 gccctactgt tcaacctggt agttgaaatt cttgctacta aaattcgaca aacgcgcaca 3480 gtcaagggta ttaatgtcga gattaatgat aatatgtttg aactaaagat aagtcaagtt 3540 gcagatgata caactcttac tctttgtgga gatgattcta taagaaatgc cttacagatt 3600 gtagatgact ttagtaatgt ggctggacct actctgaatc gtagaaagtg tgaagctatg 3660 tggattgggg ccaatcgagg acgaagagac actcccttcg gaatcgactg gccaaataga 3720 cccattaaat cccttggaat ttactatggt ctgtcagagg aagagtgtga gactttgaat 3780 tgggaaaaaa gaatagataa cttcgaaaat atgcttctaa aatggcttag tcgcaaatta 3840 accctttatg gtaaagtact gattataaaa acattagcta tctctcaatt aatttataat 3900 atgtccatga tctcaacacc tgactgggcc ttgaaacgta ttgaaaaagc atgtttctct 3960 tttctatgga accgcaaacc ggacaaaata aaaagaaaag tgctgtactc aaattaccaa 4020 gaaggagggt taaaaatggt tgatatatat tcaaaacgca gtagcctaca ggctgtgtgg 4080 gttaaaagac taatgcagtc ggcactcgat gacactaaag agcgccactg gcatcttata 4140 ccattacact ttattaatca tttcggaaaa gatttccttc tcttccgctt caatttttgt 4200 aaaggaggag atgctccttc tcttagagtg ctcccaccct tctatagaca ggtagtcttg 4260 acctggcacg aacttggagg gggaaaaaca gactttgtaa gacaaagaca agaaataatt 4320 tggtgtaata aaaactacaa gcttaataac agcacattgt tctttgaaca ctggattagg 4380 agcggattga tatatgtaaa tgacttatgt gaaaactctg aatatgtagc atctgatata 4440 ttgttaagaa agctaagtga caaaagaaac tggatgagtg aatttatgca ggtcaaaaga 4500 gctataggct atagctggac tgatgaaaac gattgtactg gtcgtgaaac attatatggc 4560 acttgtgtat ggttcaattc cagcatgatt tcacttgata cacttaacag tagatatttg 4620 tattctgtta gtctcatgga aaatgttgta agacctactg cccaagcata ttgggattct 4680 gtaaccccta atcttgattg gaaagctatc tgggagtccc agacatgtta tatcagatac 4740 aacaagctcc gcgaatttaa tttcaaattg ctccataaaa tacttccttg cagatacctc 4800 ctaaacaagt ggaaaataag agatacgtct ggaaatgtat acagtccttt ttgtgatatc 4860 tgtaataagc tggaaaacta tgaacacatg tttttagaat gcactagact aaggaaattt 4920 tggggtagat taaataatac tgaagacaat caatctgtta aaaatgtact gttggaacat 4980 aacaaatgta acattacatt gttgactatc gccaagtttt caatttatat ttcttggata 5040 cgatggaaag ataacccgag acaattgcta ctcgatgatg tacacaacac gtttatgtcc 5100 ctgtgctccc tgtacagtgc tgagtaaata aaataaagtg ccgtggcgga cacggaaaag 5160 ttaaaaaaaa aaaaaaaa 5178 // ID hAT-63_HM repbase; DNA; INV; 3257 BP. XX AC . XX DT 30-DEC-2008 (Rel. 13.12, Created) DT 30-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type DNA transposon family - consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-63_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3257 RA Bao W. and Jurka J.; RT "hAT-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 2051-2051 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(603..1676,1752..1973,1977..2669) FT /product="hAT-63_HM_1p" FT /translation="MFPDSNIAKSYEMSETKLGYIINFGNGPYFYRLLTEN FT IKKSPYYTLSLDESLNDSFKNVQLDLNIQFWSSNLNHVESRYFDSKFLGHP FT TAKNLLKSMTQLSMEGTNVNCSLLNLLELQHDQQELPKLLNVGSCNLHLIH FT GFKSGFQSAEWNIKKLMKASYNLFYDSSARRADYVTVTKCESFPCAFCSTR FT WVVDQKVADRLCLIWENVKKNNQPSIGRLFLKVSNQNERAMTSVWGRVVNK FT AKIIKTKAKAAHYKSKAKIKGXKAQGQTEGQEFKVEAKTKSFKANVYANIF FT KTKYFNFCLALRPIINTTVSSKWCDNKTTFFKNRPRVWAELNKINKKIGFF FT IGQDQNFQGQDQVEGQLWFXGTKDKLILTKLKFFGYIASFLQKILLRYQTN FT AQMIPFLFEDLFSTVKSLMGLIIKQDLLSSIVNGRKLQYDFTKKENFLLLK FT LMIIGFAAEHDIIELRKKDALSQDVINRFIKDCCLIIIGILEKIIERSLNQ FT FSFLEATACISPVSNLSKQKADLLHQMKSVLHLVLSCSHITSNECDQAFIQ FT FLKLLEESSSTXLLVFKYFRTGNCLNKFHFETNKVGVDYKVLSKVLIFIFT FT SCHGQASIERGFSITSNMLSEKMKGMSLISRRTIKDFMVSNKLKAHEIEIN FT NEMINGVCSASLQ*" XX SQ Sequence 3257 BP; 1190 A; 466 C; 581 G; 1015 T; 5 other; cagggttgcc accaaatcag ggaaatcagg gaattcaggg aaaatcaggg aaatttagat 60 acaatcaggg aaaatcaggg ggaaatgatt gatttaggaa aaaatcaggg aaaatcaggg 120 aactttttaa tttttcttta atgttttgaw attttttcta ataaaaatct tacttttcaa 180 gggtaccagg gttaatgttt aacattttac caatttaagg gtatcgtggt taatgtttaa 240 catttaacaa gaaayttaat catatcgtgt ttacattaat tcaatgtttg caactcaatc 300 ctttgtttga aaaatggctg gtaataggca gtgtaaaaaa gtctgccaag tgcaaaatat 360 gaaaagttga atttaatttg tcaaatatgc agaagagcta tgataagtca catggatggc 420 aaaaaacatg cacaaaagtc aaagccagta tcttgcctgc acagtctaca tctaaaaagc 480 aagcacttca aacagtagat taatgtgata aaagttgtac agaaatctac tctgttttaa 540 atgcaataca gaataatttc tcattaaact catgtaatga aaatggccta ctttatcaga 600 aaatgttccc tgatagcaat attgctaaat cttatgaaat gagtgaaaca aaacttggtt 660 atataattaa ttttggtaat gggccttatt tttacagact tttaactgaa aatataaaaa 720 aatctcctta ttatactctt tcacttgatg aaagcctgaa cgattctttt aaaaatgtcc 780 aattggatct aaatattcag ttttggagca gcaatctaaa tcatgttgaa tccagatact 840 ttgactctaa atttttaggt caccccacag caaagaattt acttaaaagt atgactcaac 900 taagtatgga aggtacaaat gtgaactgtt cccttcttaa tcttcttgaa ttgcagcatg 960 atcaacaaga gctacctaaa cttcttaacg ttggaagctg taaccttcat ctaatacatg 1020 gtttcaaatc gggttttcaa tctgctgaat ggaatataaa gaagttaatg aaagcatcat 1080 acaacctttt ttatgattct tcggctagaa gagctgatta tgtaactgtt actaaatgtg 1140 agagttttcc ttgtgcattt tgctcaacac gctgggttgt agaccaaaaa gttgcagata 1200 gactttgttt aatttgggag aatgttaaaa aaaataacca accaagtatt gggagactct 1260 ttctaaaagt aagcaaccaa aatgaaagag ctatgacaag tgtttggggc agagttgtta 1320 acaaggccaa aattatcaag accaaggcca aggccgctca ttataagtcc aaggccaaga 1380 tcaaaggtta maaggcacaa ggacaaaccg aaggccaaga gtttaaggtc gaggccaaaa 1440 ctaagagttt caaggccaac gtctacgcaa acattttcaa gactaaatac tttaactttt 1500 gccttgcgtt aaggccaatt attaacacaa ctgtaagtag caagtggtgt gacaataaaa 1560 ccacattttt taaaaataga ccaagggtat gggcagaatt aaacaaaatc aataagaaaa 1620 ttggtttttt tataggccaa gaccaaaatt ttcaaggtca agaccaagtc gaaggctaac 1680 agtttcaaag ccaagattaa ggccaaggcc tcaatttttg gtcttaaggc aaggtcaagg 1740 tcaaggacta acaactctgg ttcgrgggaa caaaagataa gttaattttg actaaattga 1800 agttttttgg ttacattgca agttttttgc agaaaatctt actgagatat caaaccaatg 1860 cccaaatgat tccctttctt tttgaagatt tgttctcaac tgttaaaagt ttaatgggtt 1920 taattataaa acaagatctc cttagcagca tcgttaatgg caggaaactt caataatatg 1980 attttacaaa aaaggaaaac tttctgctat taaaattaat gataattgga tttgctgcag 2040 aacatgacat tatcgaattg aggaaaaagg atgcattatc tcaggatgtc atcaatagat 2100 tcataaaaga ttgctgtctc attatcattg gaattctgga aaaaataata gaaagaagtt 2160 tgaatcaatt ttcttttctg gaagcaactg cttgcatcag tccagtgtca aatctttcta 2220 aacaaaaagc tgatcttctt catcagatga aatctgtact acatctagtt ctttcttgct 2280 cacatattac ttcaaatgaa tgtgatcaag ctttcataca atttttaaag ttgttagagg 2340 aaagttcatc tactytttta cttgtattca aatattttcg cactggaaat tgtttaaaca 2400 agtttcattt tgaaacgaat aaagttggtg tagattataa ggttttgtca aaagttttga 2460 tatttatttt tacgtcatgt catgggcaag catcaattga gagaggtttc agtattacca 2520 gtaatatgct cagtgaaaaa atgaaaggaa tgtcacttat atctcgtcgc actatcaagg 2580 attttatggt gtcaaacaaa ttaaaggctc acgaaataga aataaacaat gaaatgataa 2640 atggtgtttg tagtgcctca ttacaataaa aaaactattt ggaggaaaat agaaaactcc 2700 aagcaaaaga aaatagcgtg ataatgcttt tagatcagga aatattggga cttaaagaaa 2760 agcaaaaagt attaaacgat gtatgtaaaa catacaatta gaaatttatt acattcatta 2820 catttttttg aatttcatta tcttgtatcc tagataagtc tgacaggttt tcaatgcctg 2880 ttgaacatat ccattttaaa tataccctgg ttcaaagaaa aatttcatat gttgattgaa 2940 gtatttgatt atttatttta atttttaagc aaattgattg gttaattact tattggggga 3000 ttatagtatc tgttttttta gaggtaaaaa tgctctgggg ttattaggga agttttaatg 3060 ctaagtttgg gcatttaaat gctggaaata tcaggtggaa ataagcaaat ttcaagggga 3120 aaaatctaga aaaaagggag aaaaaagcaa aattttagca aaaaatcagg gaaaatcagg 3180 gagaaaaaag caaatttttg gcaaaaaaat cagggaaaat cagggaattt tgttcctaaa 3240 taacggtggc aaccctg 3257 // ID Gypsy8-LTR_AP repbase; DNA; INV; 130 BP. XX AC Contig23376; XX DT 21-APR-2008 (Rel. 13.04, Created) DT 21-APR-2008 (Rel. 15.12, Last updated, Version 0) XX DE LTR retrotransposon from pea aphid: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy8AP; KW Gypsy8-I_AP; Gypsy8-LTR_AP. XX NM Gypsy8-LTR_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-130 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from pea aphid."; RL Repbase Reports 8(4), 452-452 (2008). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 130 BP; 40 A; 17 C; 13 G; 60 T; 0 other; tgttgtagtt gtcaacactg tcgttattat tattattatt attattttat cattatgtac 60 agcaaactgc tttttcatgg ttatgtatct ttcattatac tcttaatatt tcaagttaaa 120 ataaaacaca 130 // ID Copia13-NVi_I repbase; DNA; INV; 7069 BP. XX AC AAZX01023252; XX DT 13-NOV-2007 (Rel. 12.11, Created) DT 13-NOV-2007 (Rel. 12.11, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: internal DE portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia13-NV; KW Copia13-NVi_LTR; internal portion; Copia13-NVi_I. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-7069 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from Nasonia parasitic wasp."; RL Repbase Reports 7(11), 1159-1159 (2007). XX DR Genome; AAZX01023252; Positions 8226 1158. XX CC Positions [4312-4821] - Integrase core CC 'TATT' target site duplication CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 2104..6987 FT /product="Copia13-NV_I_1p" FT /translation="MNQNFERFTTKISGDVQQLMRSVDSLKLRGQDGQAVK FT DWQEKPTLSANNKPSIVSCTRPAEPIKLHPRGRGRGFRPANRNPLTLLAPG FT PRELGPAEPRCSSTLVADSYEWNEKNEVRREQSPYKDWRVESAERLGIALD FT DLNVSRRFSDQDLTQLNKLNLFKSMNSGNTSINDAKLKQINITRMYKLKLD FT SEFAPFEDFLFSELKSRKLFYVFEKNNQSLICPDELEHDKNIIREIILNRI FT DDKYHLYTMNIKDPVELLDKIKEIKKNELRINAHTAQKRLFNLKLNKDKET FT AFNFCLRFEDLVRKYESNEGIEKMSDTAKKSLLFNAVEQAYPEIITAENVA FT SHGTGRKYTYNELKDLIFQIDNSKQRTPEKSALLGFAEERNRSSDERSKSR FT ERPNKRCERCGSTAHLSKDCKHDEPKCFNCNKFGHIAVDCSEPRKEPPRKR FT ATDRNRSISPKRKQRREHSRSRSKSPFRRPGTPAKGTTNTDSVCMNVSLNE FT WDDETQKFIVDSGATDHMTNSKLILTQLHEKTERVIGCANKDKRSNIIATN FT QGQVKFELSYGNVIKLDNVVYAENLRKNLLSLKRFTDQGLCVYLDNKTINV FT YEPNTREKILEGKFENPFWIIDLKPKLDNIQKMPSGALIAHESNEKIVKKK FT VTFADEKSVIDSANQRKENGDESTVINTINLWHIRLGHTALNKVKLLKGLY FT PDIKQFRENDHGVCLKDCEICKVAKMNRPKFDNNKEMAKKPLQRVSADTMG FT PITPATHPKGCRFIVVLVDNYSRFAMAYPVKQKSDAAECIDDFVVKCRNIC FT GRDEKFCYLDCDQGTEFTSGKTQKVLDKYGAELRTICPYTPQLNGIAERYN FT QTIQKITRALMYDSRLPANAWDLAVKAAAYLYNLTPHKSIGLKSPLLMINP FT ESKAYINQIKRFGCVAFTKVQGQQETKFDKLSNKTVLVGYTDTGYLLLNPE FT DGKVYESRNVQFIESKVFGDIYKPEDIKNWTITNDTLNKETWLFEFDEESP FT DQPEVVIKAPKKKGRPRKENQKTAMKPISKINHRPVTRKQTQSKKIVSAEK FT AQGETEGLNIYDPDALMTLGTSESEKMQDKIEGLSEYELYTLLTGINGDPK FT SYREAVSSPEADRWQEAINKEREALQKNRVYEVVSRTSCDGKKLNILDSRW FT VFKRKDSGTKYPDYKARIVIRGFKDSNLYDLCETYAPVSRLALVRAVLTTA FT NKHDYVLWQLDVKAAFLHSPIKKDIFLEIPDGFEEYENQKQTKVWKLHKSL FT YGLKTSPKNWNEYFSEFAKQLKFQANSRDPCLFIYAEEKSRIIMLLYVDDI FT LLTGNDEPKMREIQKELSKKFDMKFLGEPKEFLGITITRNSKERTIKLDQI FT KFINKMQTKFGYAQAKGQSTPMVTNQVLNRQRKQREEESTNDKAINQRGYR FT EVVGSLMYLMSGTRPDLAYAVNMLSRHQQNPTEHDWKMVTRVFKYIAYSKN FT WQLTYKGLKDGMETFSDASFADCKDSLTTSGHVVTLYGDAIAWRTHKQSYV FT ALSTCQAEYVAMGDACRESISISLSLKVIPNMTYFPIILWCDNKAAVACTK FT MDGSNKLRHMAGVYEDYVKQCVKLHYVKIEWIESKEQRADIFTKPLPSDSH FT NKLSRLLLGN" XX SQ Sequence 7069 BP; 2538 A; 1432 C; 1602 G; 1497 T; 0 other; ggttatgggc ccagcacgct tccactgcgc gaattagtga aaagtgacac gaaaaaaaaa 60 aaaatagaag aaaagccaag agaagtcgag acgagttgac acctgtacaa agacacgaga 120 acgacctaag ctgagaaaca aaggcaaaag tcacgatggc cggaaaatcc tacaaatggg 180 caaacccgca aactgcacgt aagtacgcat aaaaataaaa acgtgcttat gaaactcgtg 240 aagtaaaatc agcggtacgc aagtgtataa gaaaagtgaa gtgaaaaaat aaaaatcgcg 300 agataaaaaa aaaccgcatt agaaaagaaa aggcgcgaaa atcgcggaaa agagaaaggc 360 gcgaatcagt gttgcatcag ttcggcttgt cagaactcga aaatccgaac gcgcttaaac 420 aaagaaagtc gaccaaaaca aaaacgcatt tccgagaagt tcaaacgaac agaaacgcgc 480 ggtaaaaaaa ttcccggtac gaagcaacag gcgaaagaaa ggggggttgc taggcaaccg 540 agccaacatg gccgcctcat ctctctcctg cacgtaagct atacggaacg gagggagagc 600 cgccgttgcc gcgctctcgg ctcacgacga cgacgcgcaa gttcaggcgc acgcagaaaa 660 agacgacgct agtttaacct ggctcgtaat aagttaacac gcgcgtgcat gaaaacggag 720 ggagagttgc ggtcattaca accacacgta gtggaaacga tacacggacg cgcacgcata 780 catacgtgca caaaaagttc gcgctgagtc aactcaaacg caaagcgaag tcaatacaag 840 attgtgcgca agcgcacacg aattcacgtg accactcact tgttgcgcgg caatttttga 900 atttcttaaa cgcacaccta tgcatctagt gcaagtctgc ctgcatgctg cactgtttac 960 agacaccgcg caggtggggc tgctgatgca tgggtaaact aagcaaccat ttaacgtatt 1020 aaactagcag aaaatcatct ataaagaaat ttaacgcgat aaacgcgaac gtcggctttc 1080 gcatctagtg taagtctgcc tgcgagctgc actgtttaca gacaccgcgc aggtggggct 1140 gctggtgcgc gagctcactc tgcaataaat taagcgaaga attaattcaa cacaacgact 1200 ctggtttatc aaaataaatt aagtttgaat attaattaca aaatctttcg acagcacgta 1260 gcacaagtac tgactgggat gacatacgcg ccaagattct ggagtactac gacctgaagg 1320 acaaagaacc aaatgagagt ttctatcttc aactctacgg cggaggcgtt tggttaatga 1380 attctaggga cgaaacactc caactaggta atgaagtaaa gtcataaata acaatattta 1440 tatgtaagtc tacacttgta cggaaagttt gacgggctac tagccaacgt aacgtacgac 1500 acgaagtctg tgtggatctg tgagaggttt aacacgaacc cacgaatgtg acttggtacg 1560 tgcgcacgta cagatactta gtcagtgtcc cacgctcaaa aactagaccc tcacacagat 1620 ctgccaaacc tgctctcgtg cttagtgggt ctcggactgt gggagccgtg cttaggcaca 1680 cattccgcaa aaagagacta agtcggtttg ggaactttgt cgaaaagtga ctggaccgac 1740 agcaaggtgg cccgatactt gtgcacaaag tatagggagc tttgcagggg tatcctctca 1800 accgcagact aacgtttttt gtatatttac agacatgaac atcctcagtt tggaagaagc 1860 agccagactc ctgggaagtg ttgaaagcta cagaggatcg agtggtaggg ttgagaggac 1920 ccgtaagcca ctacagcgac gcagggagag caggagccac tacgtccacg cccagggatg 1980 acgaccagac atacagcatg aatgagatca ccaagctgtt gcgtgagagc accctggctc 2040 aagtcaagct cagcgaaagg ctggtagaag gaatgaagtc ccctcgtgga ggagaatgct 2100 aagatgaacc aaaacttcga aaggttcaca accaagatca gcggagatgt acagcaactg 2160 atgaggtccg ttgatagtct gaagctcaga gggcaggatg gacaggcagt aaaagattgg 2220 caggagaagc caactctttc agccaacaac aagccaagca tcgtcagctg tacacgcccg 2280 gcagaaccga tcaagctgca tccacgagga cgcggacgtg gctttcgacc agccaaccgc 2340 aatccgctca cgctcttggc tcctggacca agggagctcg gacctgctga acccaggtgc 2400 tcgtccacac tagtggccga ctcgtacgag tggaacgaga aaaatgaagt aaggagagaa 2460 cagtcgccat ataaagattg gcgcgttgaa tccgccgaac gtcttggaat cgcgctcgac 2520 gatttgaatg taagtagacg tttttctgat caagacttaa cgcaactgaa taagctgaat 2580 ttgttcaagt caatgaactc aggtaacacg agtataaatg acgcaaaatt gaaacagatt 2640 aacatcacgc gaatgtacaa gttaaaatta gattcggaat ttgccccctt tgaggatttc 2700 ttattctcag agctaaagtc gcgaaaactg ttctacgtct tcgaaaagaa caatcaaagt 2760 ctgatctgtc cagatgaact cgaacacgat aagaatatca tacgcgagat aattctgaac 2820 agaattgatg ataagtatca cctatacacg atgaacataa aagatccagt tgaattactg 2880 gataagatta aagaaattaa aaagaatgaa ttgagaatta atgctcatac agcccagaaa 2940 cgactcttca acttaaagct caacaaagat aaagaaacgg cctttaactt ctgcttgagg 3000 tttgaagacc tcgtaagaaa gtatgaaagt aatgaaggca tagaaaaaat gagcgatacg 3060 gcaaagaaaa gtctactgtt caacgcagta gagcaagcct acccagagat cattacggca 3120 gagaacgttg caagccatgg tacgggtagg aaatatactt acaatgaact aaaagattta 3180 attttccaga ttgacaactc gaagcagaga accccagaaa aatcagccct gctcggattc 3240 gcagaggaaa gaaaccgttc ctctgacgaa cgatccaaaa gtagggaaag gcccaacaaa 3300 cggtgcgagc gatgtggtag cacggcgcac ttgtccaagg actgcaagca cgacgagcct 3360 aagtgcttta actgcaataa gttcgggcac atcgccgtag actgcagtga accacggaaa 3420 gagccaccac gaaagagagc gactgatcgc aacagaagta tctctccaaa gagaaagcag 3480 aggagagagc atagcagatc tcgatcaaag tcacctttca gacgacctgg cactcctgca 3540 aaaggtacga ccaatactga ttcagtatgc atgaatgtga gtctgaatga atgggacgac 3600 gagacgcaaa aatttatcgt agattcaggc gcgacagatc atatgacaaa ttctaaactc 3660 atcttgactc agctgcacga aaagactgag cgcgtaatag gctgtgccaa taaagacaaa 3720 cgatctaaca taatcgcaac caatcaaggc caagttaaat tcgaactatc ttacggtaac 3780 gtaataaaac tagataatgt tgtatacgca gaaaatttga gaaaaaacct gctgtcactc 3840 aaacgcttta cagatcaggg cctgtgcgtg tatttagata acaaaacgat aaacgtttac 3900 gaaccaaata cgagagaaaa gattctagaa ggaaagtttg agaatccatt ctggataatc 3960 gatttaaagc caaaattaga taacatacaa aagatgccct cgggtgcgct aattgcccat 4020 gaaagcaatg aaaagattgt taagaaaaag gtgacattcg ctgatgaaaa gtcagtcata 4080 gattcagcaa atcaaaggaa agaaaatggt gacgaatcta ccgtaataaa taccattaat 4140 ttatggcaca ttagattagg acatacggct ctgaataaag taaagcttct gaaaggatta 4200 taccccgaca ttaaacaatt ccgagaaaat gatcatggtg tgtgtttgaa agattgcgaa 4260 atatgcaaag tagcgaaaat gaatagaccg aaattcgata ataacaaaga gatggccaag 4320 aaaccattgc aacgggtcag tgctgacact atgggaccga ttacaccggc tactcatccg 4380 aaaggctgta ggttcatagt tgtattagta gataattatt cacgctttgc catggcctac 4440 ccagttaagc aaaagtcgga cgccgccgaa tgcatagatg actttgtcgt gaaatgtaga 4500 aatatatgcg gaagagatga aaaattctgc tacctagact gtgatcaagg aacagaattt 4560 accagcggaa agacacagaa ggtattggat aaatatggtg ctgaactcag aacaatttgt 4620 ccatacactc cacagttaaa cggcatagca gagcgctaca atcaaaccat acagaaaatc 4680 accagggccc taatgtacga ttctagacta ccagcaaatg cctgggatct ggctgtaaaa 4740 gcagcggcct acctgtataa cttgacgcct cacaagtcca ttggtcttaa atcacctttg 4800 ctaatgatca acccagaatc aaaagcatac ataaatcaga ttaagagatt tggttgtgta 4860 gcttttacca aggtccaagg acaacaggaa actaagttcg ataagctctc gaataaaaca 4920 gtgttagtag gatatactga cacaggatac ttgttactaa acccagaaga tggtaaagta 4980 tacgaaagca ggaacgtgca attcatcgaa agcaaagtat ttggagatat ttacaagccc 5040 gaagatataa agaattggac aataacaaac gatactctga ataaagaaac ctggcttttt 5100 gaattcgacg aagaaagccc agatcaaccg gaagtggtga ttaaggcacc caagaaaaag 5160 ggtagaccac gtaaagaaaa tcagaaaaca gccatgaaac ctatttcaaa aataaatcac 5220 agaccagtga ctcggaagca gactcaaagt aagaaaattg tatcagctga aaaagcacag 5280 ggtgaaacag aaggtctaaa catatacgac ccagatgcac tgatgacgct tggtacgtca 5340 gaatcagaaa aaatgcagga taaaattgaa ggcctaagtg aatatgaatt gtatacacta 5400 ctgacaggta ttaatggaga tccaaaatct taccgagaag ctgtatcatc accagaagca 5460 gatagatggc aagaagccat aaacaaagag agagaagcac ttcagaaaaa ccgagtatat 5520 gaagtagtca gtagaactag ctgcgacggt aaaaagttaa acatcctgga ttcgagatgg 5580 gtattcaaac gtaaagacag tggaacaaaa taccccgact acaaagcaag aattgtcata 5640 cgaggtttca aggattcaaa tttgtatgat ctttgtgaaa cgtacgcacc agtgtcaaga 5700 cttgcactag taagagctgt actgactact gctaataaac acgactatgt gctgtggcaa 5760 ctggacgtaa aagcagcgtt tcttcacagc cctataaaaa aggatatatt tctagaaata 5820 ccagacggtt ttgaagaata cgaaaatcag aaacaaacga aagtttggaa gcttcacaaa 5880 tctctttatg gcttgaaaac tagtccaaag aattggaatg aatacttttc agaattcgct 5940 aaacaactaa aattccaagc gaacagtaga gatccatgcc tgttcatata cgccgaagaa 6000 aaaagccgca taatcatgct gctgtatgtt gacgacatat tgctgacagg caatgacgag 6060 ccaaagatga gagaaataca gaaagaactg agtaaaaagt ttgatatgaa attcctgggt 6120 gaacctaaag agttcttggg aatcactata acaagaaaca gtaaagaaag gaccataaaa 6180 ctggaccaaa ttaaattcat aaataagatg caaacgaaat ttggatatgc acaagctaaa 6240 ggtcaatcaa cccccatggt aacaaatcaa gtgctgaaca gacaacgaaa acaacgagag 6300 gaagaatcaa caaatgacaa agctataaac cagcgtggat acagagaagt ggttggatct 6360 ctcatgtact tgatgtccgg caccaggcca gatttggcat acgcagtcaa tatgttaagc 6420 agacaccaac aaaatccgac tgaacacgac tggaaaatgg taacaagagt attcaagtac 6480 attgcttact caaagaattg gcagctgacc tacaaaggcc tgaaggacgg aatggaaaca 6540 ttctcagatg caagttttgc agattgtaaa gactcgctca cgacaagcgg acatgtagtt 6600 acgctgtacg gagatgcaat agcctggaga acccacaagc aaagctacgt ggctttatca 6660 acttgtcaag ctgaatacgt agcaatgggt gatgcttgta gggaatcaat tagcataagt 6720 ttatctctga aagtgattcc aaacatgaca tactttccaa tcatcttatg gtgtgataat 6780 aaagcagcag tcgcttgcac aaagatggat ggaagtaata agttgagaca catggctgga 6840 gtgtatgagg actatgttaa gcagtgtgtc aagttacatt atgtaaaaat agaatggata 6900 gaatcaaagg agcaaagagc ggatatcttt accaagcctt tgccctctga tagccataac 6960 aaattatcca gattattgct aggaaattaa cagatctttc atgttacaga ataactaagg 7020 gtcagcctgg aaacgatgtc gaggacaatc gggagggagt gttggaata 7069 // ID CR1-1_BF repbase; DNA; INV; 4878 BP. XX AC . XX DT 30-JUL-2009 (Rel. 14.07, Created) DT 30-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE Amphioxus CR1-1_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-1_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-4878 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-4878 RA Kapitonov V. and Jurka J.; RT "Young families of CR1 non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(7), 1628-1628 (2009). XX DR [2] (Consensus) XX FH Key Location/Qualifiers FT CDS 1311..4775 FT /product="CR1-1_BF_2p" FT /note="APE and RT domains." FT /translation="MELHYTCIFLALCLMIGKTQPGSQPKDKQEMKEVHVT FT CTFRTDCTSDWATFAFMIKHSGVNGTPFSHKKTMIHNDSTKLSLIMILLLS FT GDIEINPGPRPPKCPCGSCNKAVQNKHAAICCDDCNTWYHKDCIELGSQVY FT TVLATHRSYSWICCDCGIPNFASDMFDITDDSFHSVNSFDTLNSSTSLPGD FT TPTATSTPLRSKQGKTHTRGNGHKKGSKAKLLLVNCQSVRNKTAELATVID FT TYKPDILAGTESWLNENISSSEIFPDSYIVHRKDRQTGPGGGVFQANRDDL FT IVTHRTDLDTDCEIIWTQTQLAGKKPLMIGTYYRPPSDQGSSLDELDSSIN FT KMGTKINSDNVIIFGDFNTPGINWDVSTTDNAQNHTGQAQKLLHLVDNHGL FT FQTVQEPTRNGNILDLVLVNNPNIIEKTTVVPGISDHDMVLVDVNLAPKQN FT KKPKRKVYIRTKSDEPAIKNDLEDYATNFNQRTKDMSVTQKWDDFKGKMKH FT TMNKHIPSKMTSSRYNLPWFNRNLRRHCRKKQRLYNKAKRTGQEEDWNKYK FT KVKKGVQKSIRSAHSKYVANILGEAITDKPKTFWSYIKGLRRDLVGVATLK FT VGSTIISDSEKKAEALSSQFKSVFTEEDTTDMPSLGNPCTTPMEHIDISTY FT GVEKMLQGLNPSKASGPDQIPPWFLKMTASEIAPVLTNIFQQSLDTGEIPQ FT DWRDANICAIFKKGDRAVPSNYRPVSLTCISCKLLEHIIHSHIMKHLEGHN FT ILTDYQHGFRAKRSTETQLILTVHDIAGALNNKRQVDLAILDFTKAFDKVP FT HSRLITKLEYYGIQGSTLNWLKAFLTNREQTVVVEGKASAPVKVASGVPQG FT TVLGPLLFLLYINDLPDQLDSSVRLFADDCLLYIEVSAKNDSQLLQKDLNT FT LEEWQRKWLMQFNPDKCYIMHITNKRTPNVSPYQFCGQTLATTKSHPYLGV FT TLTTGLKWGVHVNKITTKAKQTLGVIKRNLWACPARVKSLAYTSLVRPNLE FT YAATVWDPYTKKDKDKLERVQNQAARFCTNNYNRDASVTKMKTDLQWTSLE FT DRRKMSRLCMMYKMTNKLVDVPTDKYLIPAQKRTRNSHDFKYQSFQARIDV FT FKNSYFPRTIVDWNLLSPSTVGASSLDSFKERLQLDVQKLGVTDRSV" XX SQ Sequence 4878 BP; 1524 A; 1004 C; 1147 G; 1203 T; 0 other; gcgcaggcgc ggtggcgagc agcagcgact tacgagagct ctgagaattt tcctgaattt 60 caccgtattt tgtaccagtt actatttata attcttctca gtttatagta taggcaatac 120 tcgcaaatac cgggaacctg taataccggt aatagtaaaa ttttcggtat ctaaatcaaa 180 ttccggacag gtttgaaaca atagcttata gagtgggagg gctgcatgcc gcggtaagta 240 cactgaggga aggggaaccc gtttatatac tgatttcgtt gatatctggg actgcttatg 300 attgcacagt ttacttagta actcggacca cttatacttt ttgtactgat atatttcttt 360 cgccttgtaa cggctaaatc ttgtggaatt tgcaagaatc tgaacatcgt ttaccgcctg 420 atttgcatgt gaattaccga cagagtggga gggcgcgcaa actgtttgtt acttgaactc 480 actttgatag ctcattttga ggtttgaatt acatatatct ggttaaccac ttacaaattt 540 tgtactgata tatctgctcc acattgtaat ttgttaactc ttgtgaaatt tgtgagagtt 600 acatcgcgtc caacctgatt tacatgtaaa ttacgtacag ggtgtgaggg cacgcaagct 660 gtttttgatc tgaaatcacg atttgattta cgtatcttgt gtcgaatcgt cggtcaactt 720 tgtgtattaa ttgcttgcaa caatatttat gacgtcctac agttctattt gtgtataacg 780 gaagcacgtg tagggtctct acttgtctgt gtggcgggaa cctgtgtggt aaggtcgcct 840 gtgtggcggg tacctgtgtg gtaaggtcgc ctgtgaggcg ggtacctgtg tggtaaggtc 900 gcctgtgtgg cgggaacctg tgtggtaagg tcgcctgtgt ggtgggaacc tgtgtggtaa 960 ggtcgcctgt gtggcgggaa cctgtgtggt aacgtaccct gtgtggcggg aacctgtgtg 1020 gtaacgaagc ctgtgtggcg ggaacctgtg tggtattacg tagcctgtgt ggcgggaacc 1080 tgtgtggtaa cgtagcctgt gtggcgggag cctgtgtggt aaggtcgcct gtgtggaggg 1140 aacctgtgtg gtaaggtcgc ctgtgtggag ggaacctgtg tggtaaggtc gcctgggtgg 1200 cgggaacctg tgtggtaggg tcgcctgggt ggcgggaacc tgtgtagtaa ggtcgcctgt 1260 gtggcgggaa cctgtgtttg tagtagaaac gcatgtgctg ggattagaac atggaactac 1320 attatacttg catattcctg gctctatgcc tgatgattgg caagactcag ccagggtcac 1380 agccaaaaga taaacaggag atgaaagaag tacatgttac atgcacattc agaactgact 1440 gcacctctga ctgggccacg tttgccttca tgattaaaca tagtggggtg aatgggactc 1500 ctttcagtca caagaaaaca atgattcaca atgatagtac aaagttgtca ctgattatga 1560 tcttgctcct aagtggagat attgagatca acccagggcc aagaccacct aagtgtccct 1620 gtgggagctg taacaaagca gtacagaata agcacgctgc tatttgttgt gatgactgca 1680 atacgtggta tcataaggac tgtattgagc ttggctctca ggtatacact gtattagcaa 1740 cccaccgatc atactcatgg atatgctgtg actgcggcat accaaacttt gcatcagaca 1800 tgttcgacat cactgatgac agctttcatt cggtaaattc atttgatacg ctgaactcca 1860 gcacaagttt gccaggtgac acaccaacgg ccacatccac tccacttaga tcaaagcagg 1920 ggaaaactca cactagaggt aatggtcaca aaaaggggag caaggctaaa ctcctgctgg 1980 taaactgcca gagtgttaga aacaaaactg ctgaattggc aactgtgatt gacacttata 2040 aaccagacat attagcaggc acagagtcat ggctcaatga aaacatatca agcagcgaaa 2100 tattccctga cagctatata gtacaccgta aagacagaca aactggccca ggaggtggtg 2160 tatttcaggc aaacagggat gatctgattg tcacacacag aacagatctg gacacagatt 2220 gtgagatcat ttggacacaa acccaacttg caggaaagaa accgcttatg attggtacat 2280 actacagacc tccatcagat caaggcagta gtctggatga gcttgacagt tctataaata 2340 aaatgggaac taagataaac tccgacaacg taatcatttt tggagatttc aacacccctg 2400 gtataaattg ggatgtcagc acaacagaca acgctcagaa ccatacagga caggcacaga 2460 aactactgca cttagtggat aaccatgggc tattccaaac agttcaggaa ccaactagaa 2520 atggtaatat tttggacctg gtcttagtta ataacccaaa cataattgaa aagaccacag 2580 tagttccggg aataagtgac catgacatgg ttctagtgga cgtgaacttg gcacccaaac 2640 agaacaaaaa accaaaaagg aaagtgtaca ttcgcacgaa atctgacgag ccggctatca 2700 agaatgacct tgaggactat gctacaaact ttaaccagag aacaaaagac atgtcggtga 2760 cacagaaatg ggatgacttc aaaggaaaaa tgaagcacac aatgaacaag cacattccta 2820 gtaaaatgac atcaagcaga tacaacctgc cttggtttaa cagaaactta agaagacact 2880 gtcggaagaa acaacgtctt tacaataaag ctaagagaac agggcaagag gaagactgga 2940 acaagtacaa gaaagtaaaa aaaggagtac agaaaagcat aagatcagca cactcaaaat 3000 acgtggcgaa catacttgga gaagctataa ctgacaaacc caaaacattc tggtcataca 3060 taaaaggcct gaggagagac cttgttgggg tagccactct gaaggtgggg agcaccatta 3120 taagtgacag tgaaaagaaa gcagaagcac tcagttcaca gtttaaaagt gtattcacag 3180 aagaagacac cacagacatg ccatctcttg gaaacccctg caccactcct atggaacata 3240 tagacatttc tacgtatggt gtagaaaaaa tgttgcaagg ccttaatccg tccaaagcat 3300 ccgggcccga ccaaataccg ccttggttcc tcaaaatgac agccagtgaa atcgcacctg 3360 ttttgactaa cattttccaa caatcgcttg acacaggaga aatcccccaa gactggagag 3420 atgcaaacat atgcgccatt ttcaaaaagg gagatagagc ggttcccagt aactacagac 3480 ctgtttcact tacctgtatt tcctgtaaac tactcgaaca cattattcat agccatatca 3540 tgaaacactt ggaaggccac aatatactaa cggactacca gcatggtttc agagcaaagc 3600 gatcaacaga aacacagctt attttaacag ttcacgacat agctggggca ctgaacaata 3660 aaagacaggt ggatctagca atacttgatt tcactaaagc cttcgacaaa gtcccacata 3720 gcagactcat aacaaaactg gagtactatg gaattcaagg ttctacatta aactggttaa 3780 aggctttcct gacgaataga gagcaaacgg tagtggtaga agggaaggcc tcagctccag 3840 ttaaagtagc atcgggagtt ccgcaaggga ctgttctggg accattgctc ttcctcttgt 3900 acataaacga cttgccagac cagctcgatt caagtgtgag gttattcgcc gatgattgtc 3960 tgctatatat agaggtatct gcaaagaatg actcacaact tcttcagaaa gatctgaaca 4020 ccctggaaga atggcaaagg aaatggctta tgcagttcaa tcctgataag tgctatatca 4080 tgcacataac gaacaaacga acaccaaatg tgtcccctta ccagttttgt ggtcagacat 4140 tagcaaccac aaaaagccac ccatacctag gtgttacgtt aacaactggg ctgaagtggg 4200 gtgtccatgt taacaaaata acaaccaagg ctaaacagac attgggagtg atcaaacgta 4260 acttgtgggc atgcccagcc cgcgtaaaat cacttgcata cacgtcactg gtaagaccta 4320 accttgaata tgctgcaaca gtatgggatc cgtatacaaa gaaagacaaa gataaacttg 4380 aaagggtaca gaaccaagct gcccgattct gtacaaacaa ctacaacagg gatgctagtg 4440 ttaccaaaat gaaaacagat ctacagtgga catcacttga agacagaagg aaaatgtcca 4500 gactttgcat gatgtacaaa atgaccaata agctggtgga cgtaccgact gataagtatc 4560 taataccagc tcaaaaaaga acaagaaaca gccatgactt caagtaccag agtttccaag 4620 ctaggattga tgtgttcaaa aattcgtatt ttcccagaac tatcgtagat tggaatttgt 4680 tatcaccaag tacagtaggg gcatcttctc tggatagttt taaagaacgc ttgcagttag 4740 atgtgcaaaa gttaggtgtg acggatcgtt cagtgtaata taaccagctg ctgccgcgcc 4800 gcgtgcctgc gaagctggtg tgttacgccg aatggcggtt ataccggcta tatagataca 4860 gatacagata cagataca 4878 // ID TRE3D repbase; DNA; INV; 4708 BP. XX AC NW_633189; XX DT 11-DEC-2010 (Rel. 15.12, Created) DT 11-DEC-2010 (Rel. 15.12, Last updated, Version 2) XX DE Non-LTR retrotransposon from Dictyostelium discoideum. XX KW L1; Non-LTR Retrotransposon; Transposable Element; TRE3D. XX NM L1A_Mim; LTR6_MD; LTR86_MD. XX OS Dictyostelium discoideum OC Eukaryota; Amoebozoa; Mycetozoa; Dictyosteliida; Dictyostelium. XX RN [1] RP 1-4708 RA Jurka J.; RT "TRE3D: Non-LTR retrotransposon from Dictyostelium discoideum."; RL Repbase Reports 6(6), 358-358 (2006). XX DR EMBL/GenBank/DDBJ; NW_633189; Positions 21850 17143. XX CC This is a recently inserted copy 99% identical to another copy on CC chromosome 1. XX FH Key Location/Qualifiers FT CDS 1094..4540 FT /product="TRE3D_1p" FT /translation="MEKLKILLWNCRGSSSPLSNKKAEETIKRIGPQITLL FT TETNGSTFNHYKSMFNYERIDHGNGKGTGVSIENRNLSTGYLTVNFKDTEG FT RILSVKFNSYININILLIYAPASIPRRNAFIIAAKNLIKDQKIKHDIIAGD FT FNADHNNNSHYGIGIRSIIEESNLKDIGANNNIATFSRSNSRLDRIYCKPT FT MVGSNPLRVHEDIYNKSDHTPISIDLKMNNNSANDININNIKIEKLPWTLC FT KETLADPTIHEEISQIIENNKQYINTLDDWIHFKNYTIRHHLKKQQNLIKK FT EKNKKKSRLHRLIEREDLYPGQRKQLEDEMEELLKEEEANKQWETKLKLHL FT NQETPSRYLTSKLKKRKKDRSIFQIKNKDGTIVSEKVEIAKCFLEFYQEQY FT DEKPDNESKHKELLDKWIVDKQYISRLSLDSAITPKEVNNAIKNSNPHKSP FT GLDGINAALYRNHSASLSTILAKVFNDTLTNQKEINPNFKEGFITTLFKKG FT DELQIANRRPITLLNTDYKLLSKIINNRLLAVTKKIINKFQNGFVPNRYIQ FT DNIQIMKEVIERANKSRNSTTLITFFDFNKAFDSISHKSIRRTLSHIGIPK FT TIVELIMNLLKETTNKIKINDFLVGHITVNRGTKQGDPLSPTIFALVMEAL FT LIDILKEIEIKGFKLSDQKRIKLTAFADDMSTFNNSAEELKLVLDKINNYC FT LGTSSKLNQEKTVMICIGNIPPNLPFKISEAPERYLGLNFTKVGLNSKITS FT IINSIKDSLNNWKSQATTIRAKMTIVKTYALSRLSYHQYLDSLNESHITEL FT NNIIKWFLFSAIKNTYTEEHKYRTLMTMDRAYGDWKEGGIKMWDLGIRQVA FT FKVWNMNRLLHIIKIKACNILQEWYMEQISNSKYISIGLREMVDRWKDFRN FT NFAPQHNKLKSLPDCIKGQNKKPLQLKEIYNMLITHKYKSIRKTDGQKNLI FT SINNINIPSIFQHINHISHQKGRNTLFRFFSRSLPGINYERNVPCKICNNI FT IRDPYTHLFIDCMQVKEIENIIISTFNNLSFFKIRNWDLNSLDISKTNKKE FT RIYPNLIGIIIHQLWRIICHKLFNQDESKTPPSFDPTLIEKELTNLIKIEK FT FILIKKIERSETIYKLNNRDQLIINFNTSWHNPNTPNPIPL" XX SQ Sequence 4708 BP; 2236 A; 802 C; 579 G; 1091 T; 0 other; atcacgtaat aaaataataa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaata 60 cataaaaaaa taaaaaaaaa aaaaaaaaaa atgaacaata actttttcaa tctcatcagt 120 aacaaatttg tattcaatga agattccttg aaggtctgat cacaggtaaa aaaaaaaaaa 180 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaagtatt caatgaagaa ttactaaata 240 actcaacata taaaatcaga ctatactacc caaatacagc agaattaaat agtaatactc 300 tcaaattcat tgacaataca ctcctcaaag ataaaacaat aattagaaaa tcaaaaggta 360 taacagtgtt gggtagaatt ctaaaagaaa atttgaaaca tctacacgat tcattttgca 420 catttcaaat aaaggatgag gatggaatat atagaattca aagattcatc ccatactttc 480 ctaatcacca ctacctccac ttccatacct tcacaaaaaa tataacaaaa gcaataacaa 540 aagaagatgt acaaacaaac atgggagtag aagcagagga agttgatgaa tgcaacgtta 600 aaattatgaa tgatgtgcac agatattcag caataatact attcaacaaa tcaacaaata 660 caaatataat caataatcaa attcaatcat ataaaaacaa caacaatcaa attaaagtat 720 acttctgtga atctattagt agagaagata gggaggaagc taaaaagaaa aacaaaaaac 780 aaacaacaac caatcaacct aacaataata caaatataat acaaggtaat aataataata 840 ataataataa taataataat aataataata ataataataa taataccaat aactccaata 900 actccaataa ctccaacaac aacaacaata ataataacaa cagcaataat aacaacgaca 960 aaaatgtcaa caacaacaac aaaacaatca tcacaaataa attaaacaac tcaaatacaa 1020 caaacaatat tactacacaa acaaaaccaa gtggtggtca accaccaaat ggtaacccaa 1080 ccctatcaag aggatggaaa aattaaaaat tttattatgg aattgcagag gtagttcaag 1140 ccctctctca aataaaaaag ctgaagaaac gatcaagaga ataggtcccc aaatcacatt 1200 acttacagaa acaaacggtt caacattcaa tcattataaa tcaatgttca actatgaaag 1260 gatagatcat ggaaacggta aaggcactgg tgtatcaata gagaatagaa atttaagcac 1320 tggctatcta acagtaaatt tcaaagatac agaaggtaga atattatcag ttaaattcaa 1380 ttcgtatatc aacatcaaca tccttttaat ttatgcccca gcatcaatac ctagaagaaa 1440 tgcattcatt atagcagcaa agaatctcat taaagatcaa aaaatcaaac atgacatcat 1500 tgcaggtgac tttaatgcag accacaacaa caattcacac tatggtattg gtatcagaag 1560 tataattgaa gaaagcaacc tgaaagatat aggtgcaaac aacaatatcg caactttttc 1620 aagatctaat agcagattgg acagaattta ttgtaaacca acaatggtgg gttcaaaccc 1680 gctaagggtg catgaagata tttacaacaa atctgaccac acacctattt caatagattt 1740 aaaaatgaat aacaatagcg caaacgatat caacatcaac aacatcaaaa ttgaaaaact 1800 cccgtggaca ttatgcaaag aaacattggc tgatccaaca attcatgaag aaatctcgca 1860 aatcatagaa aacaacaaac aatacatcaa cacactagat gactggatcc actttaagaa 1920 ttatacaatc aggcaccatc tcaaaaagca acagaacttg ataaaaaagg aaaaaaacaa 1980 aaagaagagt aggttacata gattaattga aagagaagat ttataccctg gtcaaagaaa 2040 gcaactagaa gacgaaatgg aagagttatt gaaagaagaa gaagcaaata aacaatggga 2100 aaccaaattg aaacttcact taaaccaaga aactccaagt agatatctaa caagtaaact 2160 aaagaaaagg aaaaaggaca gatccatatt ccaaattaaa aataaagatg gaacaattgt 2220 atcagaaaaa gttgaaatag ccaaatgctt cttagaattc tatcaagaac aatacgatga 2280 aaaaccagat aatgaaagta aacacaagga actattggat aaatggatag tggacaaaca 2340 atacatttcc agactatcat tagatagtgc aatcacacca aaagaagtca ataatgcaat 2400 taaaaattct aatccacaca aatctccagg actagatgga atcaatgctg cattatatag 2460 aaaccactct gcatcacttt caacaatctt agcaaaagta tttaatgata cattaactaa 2520 tcaaaaagaa atcaatccca acttcaaaga aggatttatc acaacactat tcaagaaagg 2580 agacgaacta caaatagcta atagaagacc aattactcta ctcaatacgg actacaaact 2640 attaagtaag atcatcaaca accgtttact tgcagttaca aagaaaataa taaacaaatt 2700 tcaaaatggg tttgttccaa atcgttacat ccaagataat attcaaataa tgaaggaggt 2760 aatcgaaaga gcaaacaaat caagaaatag tactacgtta atcacattct tcgatttcaa 2820 caaagctttc gactcaatta gccacaaaag tatcagaaga acattgagcc acattggcat 2880 accaaaaacg atagtagaat taataatgaa tctattaaaa gaaacaacga ataagatcaa 2940 gataaatgac ttcctggtgg gacacattac agttaacaga ggcacaaaac aaggagatcc 3000 attatcacca acaatatttg cacttgtaat ggaagcacta ctaattgata tacttaaaga 3060 aatcgaaatt aaaggattta aattatcaga ccagaaaaga atcaagctaa cagcattcgc 3120 agatgatatg tcaactttca acaactcagc agaagaatta aaattagtat tggacaaaat 3180 caacaactac tgtctaggta catcttcaaa acttaatcaa gagaaaactg ttatgatatg 3240 cattggtaat ataccaccaa acctcccatt taagattagt gaagctccag aaagatactt 3300 gggtctcaat ttcacgaaag tgggcctcaa ttcaaaaatc acatcaatca tcaactcaat 3360 caaagatagc ttaaacaatt ggaaaagtca agcaactaca ataagagcaa agatgacaat 3420 tgtcaaaaca tatgcgctat cgagattatc ctaccatcaa tatctagact cactaaatga 3480 atcacacatt actgaactaa acaatatcat taaatggttc ctcttctccg caattaaaaa 3540 tacatacaca gaagaacata aatatagaac attaatgaca atggatagag catacggtga 3600 ttggaaagaa ggtggaatta agatgtggga cttaggtatc agacaagtag cattcaaagt 3660 ttggaacatg aatagattat tacacattat caagataaaa gcatgcaaca tactacaaga 3720 atggtatatg gaacaaatca gcaacagcaa atatatatca attggattaa gagagatggt 3780 agacagatgg aaagatttta gaaacaactt tgcaccacaa cacaataaac taaaatcact 3840 acctgattgt atcaaaggtc aaaacaaaaa gccacttcaa cttaaagaaa tctataacat 3900 gttaatcact cacaaataca aatcaataag aaaaacagat ggacaaaaaa atctaatttc 3960 aataaataac atcaacatac catccatctt ccaacatatc aaccacatat cacatcaaaa 4020 aggtagaaat acactattca gatttttctc aaggtcactt cctggaatca actatgaacg 4080 taacgtacca tgcaaaatct gtaacaatat aatccgtgac ccatacactc acctttttat 4140 agactgtatg caagtaaaag aaatagaaaa cataataatt tcaacattca acaatctctc 4200 attcttcaag atcagaaatt gggatctcaa ttcactagat attagcaaaa caaataaaaa 4260 agaaagaatc tacccaaatt taatcggcat aattatacat caactatgga gaataatttg 4320 tcacaaacta ttcaatcaag acgaatctaa aactccaccc tcatttgatc caacattaat 4380 tgaaaaagaa ctaaccaact taattaaaat agagaaattc attttaataa agaagatcga 4440 aaggagtgaa acaatataca aattaaataa cagagaccaa ttaataatca acttcaacac 4500 atcatggcat aacccgaaca ctccaaaccc aattccatta taataaaaaa aaaaaaaaaa 4560 aaaaaaaaaa aaacaagtag tagtaatcat aatataacca aaaatatcaa catataacaa 4620 tataaaaatt taaaaattta aaaatctaaa aatttaaaaa tgtaataaaa tataacaata 4680 caccgctttt aggtaaaaaa aaaaaaaa 4708 // ID BEL4-I_AP repbase; DNA; INV; 5997 BP. XX AC Contig12244; XX DT 21-APR-2008 (Rel. 13.04, Created) DT 21-APR-2008 (Rel. 15.12, Last updated, Version 0) XX DE LTR retrotransposon from pea aphid: internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL4AP; BEL4-I_AP; KW BEL4-LTR_AP. XX NM BEL4-I_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-5997 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from pea aphid."; RL Repbase Reports 8(4), 435-435 (2008). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC Positions [4747-5340] - Integrase core CC LTRs are 98% similar to each other. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX FH Key Location/Qualifiers FT CDS 4591..5559 FT /product="BEL4-I_AP_3p" FT /translation="MHGGPQAILSAVRQKYWPLNGRSIARSVVHRCIKCFK FT YRPVFVQPIMGDLPEARVRPTRAFKKTGVDFAGPFMIKSSLRRNAPLNKVY FT ACLFVCFITKAVHVELVGDLTTQAFLNALQRFCDRRGLCTDIYSDNATNFV FT GANRQLQELQALFQSTEHQERIQDTLSKSSIQWHFIPPRSPHFGGLWEAAV FT KSLKAHLYRTLGNASLTFEELNTILIRVEAILNSRPLTPLSSHPSDLSVLT FT PGHFLIGDALTSLPDRDERDVPTNRLSRWRRVVQFTQQLWARWTRDYLTQL FT QERGKWMSSSGPKLGVGTVVLMRDDNVPPLQ" FT CDS join(895..2310,2314..4296) FT /product="BEL4-I_AP_1p" FT /translation="MALIHDNKLMSDIQKFFYLRSCLSGDAKKVIDCLQTT FT TENYQVAWTSLIMRYDNKRVLIQVRVKALFDLQPLKKESAVHLHTLIDTVS FT GHLKALESLGQTPDSWGVSLSHLITSKLDANTRRAWESEATKMEDFIVPIL FT IDFLKSRFRILEVIESNKNMNTQVPTDIPRFKKSGDRSSSFTTTTSFRCYN FT CNGPHSIYKCTKFLALTILERIKRINDLKLCNICLRAHSDKCKSRNCPKCA FT RPHNGLHSSRTNNNNIHAGGAGNSGESSNTTPPAENDRNDIANEAMINSAP FT ASVNAHARQNSEAGQVLLSTAEILVFDSQNKPVLCRALLDSGSQHNFMTES FT LTRRLNLKRIKSSCSIIGINDASHTSTYKVTTTIKSRIYDYSLNLEFFALP FT NLTSKLPLMPVNLTELKVPVDIRLADPSFHVPKKVDIILGAEIYFELLTKE FT QIQTVSRGPIFQGTRLGWVIAGPIPSPNLAENSTFSLCTSVMSEFANIENQ FT IAEFWRLEEVKNCNVYTLEEKRCKQQFELNVKRGEDGRFTVGLPFRDNAPK FT LGKSYDVAIRRMLSLERRFQTDTKLKEEYSKFMKEYSELGHMEITQNTIPD FT GKGYYLPHHAVRNESSTSTKLRVVFDASARTSTNESLNDVLLKGPTVQEDL FT VSIMTRFRIHKYVFTADIKKMYQQIWVSESQRDFQRILWREDPSQPLNVYR FT LKTVTYGIVTASYLATACLQKLSEQESSKFPEACQALSCDFYVDDFLGGAM FT SKQSALRLRDDLITILRGAGLELCKWSSNDPDLLLGVEAVPSANNGLDLQY FT TENVTKILGLYWNKDIDAYQYKVLMYNNKTRPTKRAILSEIAAIFDPLGLI FT SPVIICFKIIMQKIWQTRVDWDATLPTEIEGEWRKCRANLIHLNTLKITRS FT LVVDGDIADIQLHGFADASLTAYGACLYLRVKNYNGEVIANLICSKSRVAP FT LKTISLPRLELLAAVLLVRLAAKYATSLSIPIEHKYFWSDSTVVLSWISSE FT SARWKTFVAHRIGEIHETTSISQWHHVSTKDNPADIVSRGYCPTKIGESNL FT WWRGPMWLHKDVSDWPKENNIYSRADEVLLEARTVSHVITDRYDLSIIERY FT SSLTKLIRVTAFCLRFINNASKKEN" XX SQ Sequence 5997 BP; 1901 A; 1173 C; 1292 G; 1631 T; 0 other; tggtccttcg aaccggatta tttagttgat gcaaaataat attatataat atcgcagttt 60 aattcataag agcatctagt gctcatcgcg taaatcgatc gtccagtgat ttaaacgcga 120 aatccaaagt gcgtgcacaa tacttcatta tagagaagta caaggaagaa cttcgtcaag 180 gaagaacttc gtcaaggaag aactacgtca aggtcatcga aggcaaggta agcgagttgt 240 taatagcaca tagtctataa gtcagtgttt cggggtttat ttccttttcc cgtttaccta 300 ttcataatat tctgacactc tcgagtctcg gttgacatta tatattagcg tatataagta 360 tagccagtgg aaagtacgtg gtagaattag acagcgcata gaatagcgac acagtacaca 420 ctacaatata gataatacaa tgagtgagct ggcgaacatt atcgtacgcc ggggccagct 480 aaaaggccaa ttaactcaca tagcaaatta catccgcgat aataaaggta gtccagatat 540 cgatcaaatt acggtaagat tggaaaagac cacagaaaca tggcatgaat ttcaacaagt 600 tcaagcgcaa attgaagaag aaaccgagac aactaatgag agcgagaaat accgtagcga 660 gttcgaggag ttgtattatg ataactagct aaatgcaata aaataattag agccgcgtcg 720 ccaggcgtta attcaaatcc gtcatctaat cggacttcga acgaatccag caacgacgaa 780 gcaaatggtc atgcaagcct acagcccgta cagtcggcaa tcaagttagc tgctatcgag 840 ataccaaggt ttacaggagt gtatacagag tgggccgcat tctacgatat atatatggcg 900 ttaatacacg ataataaatt gatgagcgac attcaaaaat ttttttattt acgatcgtgt 960 ctcagtggag atgcgaaaaa ggttatagat tgtctgcaaa ctacgacgga aaattatcaa 1020 gtagcatgga caagtttaat catgaggtac gacaacaaaa gggtgttgat ccaggtacgc 1080 gtgaaagcgt tatttgatct acagccactg aaaaaagagt cggcagtaca tttgcatacg 1140 ctaattgaca cggtgtctgg tcatttaaaa gcgcttgagt ctttagggca aactccagac 1200 agttggggcg tatcgttatc tcatttaata acgtcaaaac tggatgcaaa cacgagacga 1260 gcatgggaaa gcgaagcgac caagatggaa gatttcatag tgccaatact aatagatttt 1320 ttaaaatcac gttttcgaat tttggaggtg atagaatcaa ataaaaacat gaatacgcag 1380 gtaccaacag acattccaag gttcaagaaa tcaggggaca ggtcttcatc atttacaact 1440 accacatctt tcagatgcta taattgcaat ggcccacact caatttataa atgtaccaaa 1500 tttctagcac tcacaatact agagagaata aaaagaatta atgacttaaa attatgtaac 1560 atatgcttac gcgcacactc cgataaatgt aagtcacgta attgtccgaa atgcgcgcgt 1620 ccacataatg gattgcattc gtcgcgcact aataataata atatacatgc aggcggtgcg 1680 gggaacagcg gcgaatcgtc caacaccact ccacctgcgg agaacgatag aaatgatatc 1740 gctaatgagg caatgattaa tagcgcaccc gcatcggtaa atgctcatgc acgtcaaaat 1800 agtgaagcag gtcaggtatt gttatcaacg gccgagattc tcgtattcga cagtcaaaat 1860 aaaccggttt tgtgccgtgc gttactggat agtggatcgc agcacaattt tatgaccgag 1920 tcattgaccc ggcgtttaaa tttaaaacga ataaagtcat catgttcaat aatcggcata 1980 aatgatgcat cgcatacgtc aacgtacaaa gtaacaacta ctattaaatc gcgtatatac 2040 gactattcat taaatttgga atttttcgca cttcccaatt taacgagcaa gttaccgttg 2100 atgccggtca atctgacaga actgaaggtc ccggtggata taagattggc ggatccatct 2160 ttccacgtgc ctaaaaaggt agatatcatt ttaggcgccg aaatatattt tgaattacta 2220 actaaagaac aaattcagac agtttcacgg gggccaattt tccagggcac tcgtttaggt 2280 tgggttatcg cgggacccat cccttcgcca taaaatctag ccgaaaattc aacattttca 2340 ttatgtacga gcgtaatgtc cgaattcgca aatatagaga atcaaattgc cgaattttgg 2400 cgcttagagg aggtaaagaa ttgcaacgta tacacattag aggaaaaaag gtgtaaacaa 2460 cagttcgagt taaatgtaaa gcgtggtgaa gacgggcgtt ttacagtagg tctacctttt 2520 cgtgacaatg ctcctaaatt aggaaaatcg tatgacgtag ccattcgcag aatgttatca 2580 ttagaacgac gatttcaaac cgataccaag ctcaaggaag aatatagcaa atttatgaaa 2640 gagtactccg aacttggaca catggaaatt acgcaaaaca cgattccaga tggtaaaggt 2700 tattatttac cgcaccatgc agttcgcaac gaaagcagta ccagtactaa actgcgcgtg 2760 gttttcgatg cgtcggcgag aacaagtacc aacgaaagtc taaatgatgt gcttctaaag 2820 ggacctaccg tacaagagga tttggtctct ataatgacgc gatttcgtat acacaaatac 2880 gtattcacag ctgatataaa aaaaatgtac cagcagattt gggtgtcgga aagccaacgt 2940 gatttccagc gcattttgtg gcgtgaagac ccgagccagc cactaaatgt atatcgacta 3000 aaaactgtta cgtatggcat agtcaccgct tcatacctgg caacggcctg tttacaaaaa 3060 ttatccgaac aagagtcgtc aaaatttccc gaagcctgtc aggccctaag ttgtgatttc 3120 tatgtagacg attttctcgg cggagctatg tcaaaacagt ctgcattaag attacgcgat 3180 gacctaataa cgattttgag gggcgctggt ctagagctgt gcaaatggtc gagcaacgat 3240 cccgacttgt tgcttggtgt tgaagcagta ccaagtgcca acaatggctt agatttgcag 3300 tatacggaaa atgttacaaa aattttggga ctttattgga ataaagacat tgatgcatat 3360 caatacaaag tgttaatgta caacaacaaa acgcgaccaa caaaacgcgc gatattgtcc 3420 gagatcgccg caattttcga tccgcttggt ttaataagcc ccgtaataat ttgttttaaa 3480 atcataatgc aaaaaatatg gcaaacgcgg gtagattggg atgcaacgtt accaacagaa 3540 attgaaggcg agtggcgcaa gtgccgtgca aatttaatac atttaaacac gttaaaaatt 3600 acgcgttcgc tcgtagttga tggtgacatt gcggatatac aattacatgg gttcgcggac 3660 gcatcactta cagcatatgg cgcttgtcta tatctccgcg taaaaaacta taatggtgag 3720 gtcatagcta atttaatttg ctcgaaatca cgtgtcgcac cattaaaaac aatttcattg 3780 ccacgtctag agctattagc cgccgtccta ctagtgcggc tggcggccaa gtacgcgacg 3840 agtttatcta taccaattga acacaaatat ttttggtctg actctaccgt cgttttatct 3900 tggatatcgt cggaatcggc aagatggaaa acatttgttg cacaccgcat tggcgaaata 3960 catgaaacta cttcgatttc gcagtggcac cacgtcagca ccaaggataa cccggcagat 4020 atagtttcgc gcggctattg tcctacaaaa atcggtgaat caaatttgtg gtggagggga 4080 cctatgtggt tgcacaaaga tgtttctgat tggccgaaag aaaataatat ctatagtaga 4140 gcggacgaag ttttattaga agcgcgaacg gtatctcacg taattacaga caggtacgat 4200 ttgtcgatca tagaaagata ctcatctcta acgaaactaa tacgcgttac tgcattttgc 4260 ctacgattta ttaataatgc gtcgaagaaa gaaaattaaa ttggtgcgct tcaggcagat 4320 gaaatcgaca aggctgcgaa tagtattatt aagttagtgc aagctagtaa ttttacagga 4380 gagatcaagg acttacgtaa atcaggccaa gtaagctcta aaattgtatt gttacgcctg 4440 catccgtttt tagataaaaa tgatttgatc agggtgggcg gtcgcctagg aaatgcattt 4500 actctaaatg aatcgcagca acatccgata attttgcctg cgaaaaatgt tttaaccaaa 4560 ttaatattta tacacgaaca tgatagatta atgcatggcg gcccacaggc tatattatct 4620 gcggttcgtc aaaaatactg gccacttaac ggacgcagta ttgcgcgtag cgtagtacat 4680 cgctgcataa aatgttttaa atatagacct gttttcgtac agcctatcat gggtgatctt 4740 cccgaggctc gagtacggcc tacacgggca ttcaaaaaga caggtgtaga ttttgcgggt 4800 ccatttatga tcaagtccag tttaaggcgg aatgctccat tgaataaggt atatgcttgc 4860 ttatttgttt gttttattac taaggcggtg cacgtagaat tagtaggcga cctgaccact 4920 caagcgtttc ttaatgcgtt gcagcggttc tgtgatagac gcggactgtg cacggacatt 4980 tattccgata atgcaacgaa tttcgtgggc gccaaccgtc aactccaaga attgcaagca 5040 ctttttcaat ctactgaaca tcaggagcgt attcaggaca cattatccaa atcaagcatt 5100 caatggcact ttataccacc gaggtcacca catttcggcg gcctttggga agcggctgtg 5160 aagtcactga aagctcattt atatcgtaca ctaggcaatg cctcactcac ttttgaagag 5220 ttgaacacta tattaatacg tgtagaggca attttaaatt cgcgaccttt aactcctttg 5280 tcctcacatc cctccgattt atccgttctc acccccggcc actttcttat tggtgacgca 5340 ttaacttcgc tacccgatcg tgatgagaga gatgtcccga ccaatcgatt gtcgagatgg 5400 cgacgagttg tgcagttcac gcaacagtta tgggcccgtt ggacgaggga ttatctaacg 5460 caactccagg aaaggggaaa atggatgagc tccagcggac ctaagctcgg agttggtacg 5520 gtcgtgctca tgcgcgacga caatgtgcct ccgctccaat gatccatggg cagggtagtc 5580 gaggtcattc acggttccaa tgggcaagta cgggtggcca aggtgcgaac ttgcaaaggc 5640 gaattttccc gggcagttcg ttatttatgt ccccttctct tagagggcaa ttcaccactc 5700 gatcaacagt aaaaaaaaaa aattgttttt tttttttaag tcagggttaa taaccgagta 5760 ttatatatag tatggaatta attaacttta ttgtataaaa tttgtggaat tatttaatat 5820 ttatagtaac catatgtcat gatttatttt tttttttggc gtacatatgt atttttattt 5880 ttatgtcatg tataagctgt tgattgtata acaatgtata tattaatgtg cataataatg 5940 tatttgtata ataactataa gaaaaatttg aaacttaatg ttccaaggtg ggcggca 5997 // ID Kolobok-9_HM repbase; DNA; INV; 2818 BP. XX AC . XX DT 31-DEC-2008 (Rel. 13.12, Created) DT 31-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE Kolobok-type family - consensus. XX KW Kolobok; DNA transposon; Transposable Element; Kolobok-9_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2818 RA Bao W. and Jurka J.; RT "Kolobok-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 2067-2067 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(475..1023,1016..1531,1446..2270) FT /product="Kolobok-9_HM_1p" FT /translation="MAKGNKQNNARKLKRKAPLNHFRKQQKNDLLVKTNSL FT NTISMCSSAKKLNMGIQNHLECQDLYDYNILINFSVLKNLFLEIACPKCHM FT LTINLFDNTQKRMGFSHNLELKCTSCDQWSHTVYSSKECVKNSSNTTGRSI FT FEANARAVIAFREIGRGHSSINTFTQCMNMYGLAENAFDNLNYKTTNEIYK FT AYDDAALKSMKKAAEDLQDSSDGPTQQRIKIDGAWQKRGHSSLNGFVTGIV FT EDKVVDVQAFSKHCKGCLMWEKQKGSSSYDRWKAKHVCNINHVKSSGAMEG FT AGAVKIFERSVEKNSLIYKEYLGDGDSSSYKEVVAANPYKDFGINTSQIGM FT YWACSEALRYSSQQIPIKILVLTPLKLECIGHVQKRLGTRLRNLVKSHKGT FT KNPISGKGKLTEKTINSMQTYYGKAIRNNLNNVYAMKKAIGSVLFHCTAFT FT DENQRHEMCPRDDTSWCKWQLDKTNNTNKFHKTKTFKNTISLPMSIHYLLK FT PIFVSLSDDELLNKCVHGQTQNANESFNSVVWTRCPKNVFIERRTFECSIN FT SAILHYNDGSGGVNAVLGHFGLHGKATADKSIARDCQRVTRMEKRSSPIVK FT KQRKKLRAFKKGYMDKENESEKRESYTAGGF*" XX SQ Sequence 2818 BP; 993 A; 388 C; 486 G; 951 T; 0 other; ggtggttgcc caagaaataa atcttgattt tttttgaaat tttttttttt tttttaatgt 60 tttttatgag cagaatttta tcctctttaa aatggtatat gaatctcata tgtataattt 120 atataaagtc ttttatttct aatttttata tagtggtata taaagttttt tcttcacatt 180 tttgtttttg acatttttct tcatttgtga cttcaattga agtgtcaaat atcttgggtt 240 tacctcttca aataatcatc aactccgtat caaagtaatg tcgtttggtg tatcttaggt 300 tttatggatt ttaataaact aaaagtcatt atttgaatgg atatttttcg ttttgaccaa 360 cgtgatgcaa cataaaatat actatttatt agatttttat gaatgtttat tgtggtttaa 420 tgacttattt gtatttttat tggttcttac tcttttagac ttaaatatat aataatggca 480 aaaggaaata agcaaaataa tgcaagaaaa cttaaaagaa aggctcctct caatcatttt 540 agaaaacagc aaaaaaatga tctattggtc aaaacaaaca gtttaaatac aataagcatg 600 tgttctagtg caaaaaaact aaatatgggt attcagaatc atttggagtg tcaagacctg 660 tatgactata atatccttat taacttctct gttctcaaaa atctattttt agagatagct 720 tgtccaaaat gtcatatgct aaccattaat ttatttgata acacacaaaa aagaatggga 780 ttttcacata atctcgagtt aaaatgtacg tcttgtgacc aatggagtca tactgtttat 840 tcatctaaag aatgtgtaaa gaatagcagt aataccactg gaagaagtat atttgaagca 900 aatgcaagag cggtgatagc ttttcgagag attggtcgag gtcattcaag tataaacacg 960 tttactcaat gtatgaacat gtatggttta gccgaaaacg cttttgataa cctaaactac 1020 aaatgaaata tataaagcat acgatgatgc agctttaaaa agcatgaaaa aagctgcaga 1080 ggatcttcaa gattcttctg atggtccgac acaacaacgt ataaaaatag atggagcgtg 1140 gcagaaaaga ggccattcat ctcttaatgg atttgttact ggtattgttg aagataaagt 1200 agttgatgta caagcttttt ctaaacactg taagggttgc cttatgtggg agaaacaaaa 1260 aggttcatca agttatgata gatggaaagc aaaacatgta tgtaatatta atcatgttaa 1320 gtcgtctggt gctatggagg gagctggtgc tgttaaaatc tttgaacgtt ccgtggaaaa 1380 gaacagcctt atttacaaag aatatttagg tgatggagat tcttcgtctt ataaagaagt 1440 cgtagcagca aatccctata aagattttgg tattaacacc tctcaaattg gaatgtattg 1500 ggcatgttca gaagcgctta ggtactcgtc ttagaaatct tgtgaaatca cataaaggta 1560 caaagaatcc catatcagga aaggggaagt tgactgaaaa aacaattaac tcaatgcaaa 1620 cctattatgg aaaagctata agaaacaact taaacaatgt ttatgccatg aaaaaagcaa 1680 tcggctcagt gttgtttcac tgtactgcgt ttaccgatga gaaccagcgg catgaaatgt 1740 gtccacgtga tgacacttct tggtgcaaat ggcagttaga taagactaat aatactaata 1800 agtttcataa gacaaaaaca ttcaaaaaca ccattagttt accaatgtcc attcattacc 1860 ttttaaaacc gatctttgta agtttatcag acgacgagtt attgaacaaa tgcgttcacg 1920 ggcaaactca aaacgctaat gaatcgttca actcagttgt atggactcga tgtcccaaga 1980 atgtttttat tgaacgaaga acatttgaat gttcaattaa ctcagctatt ttacattata 2040 atgatgggag tggcggggtt aatgcagttt taggacattt tggcttacat ggaaaggcga 2100 ctgctgataa atctattgca cgagattgcc aacgtgttac tagaatggaa aagcgatcaa 2160 gtccaatcgt taaaaaacaa agaaaaaaat taagagcatt caaaaaaggg tacatggata 2220 aagagaatga aagtgaaaag cgtgaatctt acactgctgg tggtttttaa gcatcttttt 2280 cttattattt tagacaaaaa cttgattttc tcatgtttat ggttttacat aatttttaaa 2340 attttaaaac atgatatctc aagattgaat ggataaaata agttgaaatt ttcaggaaat 2400 actcatatat tatatataat ccatacatca taggcttttt ctttattgag gtaaaaaact 2460 tcttaatttg taaaatttat acaaaaaata ctacgcttta aacactcttt ttactaatag 2520 aggcataact tttgactaag tgagtcaata aaaaaattga tatgtcaaat ggattttact 2580 atcgttctta agctatgatg aaaatttgaa gtagtaacta tcttcacatc tgcagatatt 2640 gtgttttgaa atatttcatt tttttgtctc attttagcga taaaataggg taatcgagac 2700 aaaaaaatgg tttggtgtgt ttgaaatatt ttttaagttg agaccagcca tttataccaa 2760 aatttggtta attaaattca cagggtcaaa aaaaaagttt atttcttggg caaccccc 2818 // ID BEL1-LTR_AP repbase; DNA; INV; 334 BP. XX AC Contig29878; XX DT 21-APR-2008 (Rel. 13.04, Created) DT 21-APR-2008 (Rel. 15.12, Last updated, Version 0) XX DE LTR retrotransposon from pea aphid: long terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL1AP; BEL1-I_AP; KW BEL1-LTR_AP. XX NM BEL1-LTR_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-334 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from pea aphid."; RL Repbase Reports 8(4), 430-430 (2008). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR Genome; Contig29878; Positions 1243 1576. XX CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 334 BP; 100 A; 51 C; 58 G; 125 T; 0 other; tgcgtatcgc atattcgcat cgtatatgtg ttttgtgtca cttgtatatt atgaatatct 60 ataatgtcga ttattgttat tattatttat tgtacgatac tgcgataaga tattggccgt 120 acgtctgatc gctgtccgta catcgcagta ttgtacgata acagttttta gtcgttttgc 180 ggtctgcgta cctgtgaggt ctgcgctatt ttgtacagag cgtaagacct cacacaaaaa 240 ttataattat tattatttta aagtgtaatt aaagaaataa agagattgca tttaaaatac 300 cacaattacg attcaattat taatatacag tcca 334 // ID Gypsy-4_DPer-LTR repbase; DNA; INV; 909 BP. XX AC super_55; XX DT 04-MAR-2011 (Rel. 16.03, Created) DT 04-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-4_DPer_; KW Gypsy-4_DPer-I; Gypsy-4_DPer-LTR. XX OS Drosophila persimilis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; obscura group; OC pseudoobscura subgroup. XX RN [1] RP 1-909 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (22-MAR-2010). XX DR Genome; super_55; Positions 4060 4968. XX SQ Sequence 909 BP; 253 A; 226 C; 294 G; 136 T; 0 other; tgtgaggaaa ggtattccca cacgcagggg gcggtacagg gttaagagct atcggccgtt 60 atccggggat atccggaggc ggcgaattcc cgaggagagc tctcccggag acccaagatg 120 agagcggagg cccccggaga gagaacagcg ggatgacggg atcgggccgg cagtggccca 180 gcgcatccgg tcaggtcccg ttgtaaagcg agcagagaat gaaggaaaca ccgatggaaa 240 gtacgggcac cgcatggcgg aaagtgccgt tagcgccagg cggcgccgac gcagaagcga 300 tcctataaga gggggccgcg tccaggaaaa gggacagtca gcgagcggag ctatcgagtg 360 tgaagtgaat agtgagcaac aagagaccga gcaaagggca aagaactttg gaatgaacgc 420 gttgtgtgag gattgagagg tggactcgag catacagcca agagccaaat ttcccaatcc 480 cttgaaccaa cggtgccacg cttggagtgc atcccacagg gagtgagaca ccttcgggcc 540 gaaaagccaa gtgagtcgag tgagagcgaa cgcgtcaggc gccgactgag aggcggattt 600 gggcctcgtg ccaagaatcg ctggccccag tcagtagagc cggcggtgcc acgcttggag 660 acgccttggg gggtccaggg taagtcacgc tcaccaaaaa cgcagaggag aaccaacggt 720 agaaccagca ccggagtcgg aatagcgtaa gatagctcta agataccaat aaagcgcaat 780 tgtcataaca gtacaaaacc caacccgtgc cttcttactt gtcgtcgtgg caacatacaa 840 atattgttgc agcgagccca ccgaccgctg agtccagctg aatcagacgg aaaagaacgg 900 ttcgttaca 909 // ID I-14_AAe repbase; DNA; INV; 5122 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A I non-LTR retrotransposon family from Aedes aegypti. XX KW I; Non-LTR Retrotransposon; Transposable Element; I-14_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5122 RA Kojima K.K. and Jurka J.; RT "I non-LTR retrotransposons from the yellow fever mosquito."; RL Repbase Reports 11(4), 1369-1369 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 4 sequences with >98% CC identity. XX FH Key Location/Qualifiers FT CDS 109..1350 FT /product="I-14_AAe_1p" FT /translation="MPGNDGASQQTAAPRKRCYQPDFPGPFLVIAEAQGEA FT NLSISTMAKRIIQNFGRDYVKAAPISRRKMKILMRTANAANSLLEMDVENV FT SFMVPQRLIETLGVASIELEVEDEELRDAISFDKGKSIQLFNPDIVEIRRV FT VKKHGTELTPLTTVIITFSGLTLPTHVEINRALYKVKPYIYPLRQCRNCWR FT LGHSLKNCRSKKRCHCCLENDISDDHECNQTSPQCVNCGGKHAANDTRQCP FT KIKRQXEADRDRQNSYSQGPTDWFSLLGKKEAAPLEAPCVIISPPNAESTM FT VPAIQSQPSLQASSDLQVPSTCCGPASSGSKQPSVTGSVTRAKRTKRASTG FT DISEDGLPSLELHVEEQVCTAIKEGLNTNNVREMIRELQDSSSGFTEQQFR FT SEILALISERVKQYIGSLRL" FT CDS 1366..5022 FT /product="I-14_AAe_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="MNTSTSMWPLQFMQWNCRGINSKKASMINLINSTKSS FT VLLISETHLDADKQFDIQGFHVFRKDHRSNARGVLVAARKELQPKSFPLTT FT PEGIDAVACQIACSLGNMVIVSVYIHPNSTVSQAELEEFLGGIPQPCIVGG FT DWNAKHPLWGDHRQDRRGDSLQSALLNLNLVVLNNGSYTRVDVHRASSVID FT ITLASADISLLFEWRTLDYAFGSDHFPIAFGTEEITTEIKQQPFNYRKMDW FT DAFRNRVEDEISANMGVVDYETFFQIVWGSLTQSSPAQRTQNNPKMVPLPY FT WDDELKSALSDSRRAFKVWRRSLEYAAYEGYKEAELKFKKLVKKKRKESWR FT KHCDTLDSQTSVRDLWRWAKRFKGRKSGNQCQLRNEQVLEQLLDRLAPAYT FT CTQPMEILECDCGRHSPGGFFTIHDLQTAFKQGKDSSPGVDGICYSVLNNL FT PFNGQVCLLKIFNAIFDCGNIPEAWKIFKIIPILKPGKPPEEAASYRPIAL FT ASCFRKTYENMVKEKIEWYLEHNLRFPELICGFRKGKGTFDALHILIDEVR FT TAFHNHQHVVLCSVDVMSAYDNVQIPLLVRELRRVGISECLVKSIYGLFVE FT RIISGSIDSLRNIERRTWIGLPQGSPLSPLCFNILISSLMVSRTPGPVKID FT FADDMSIIVRGTTLEDSVSIIQGAVSDLADGLTSVGLEVSPTKCSAMIFSN FT RALGESYPIYINNTMVNYKSCIKLLGLYLTPTLSWGMQISYIKQKASLYVN FT FMRSVAGQTWGAHPDALLTIYKTCIRSILDYGCIFFESAPTDELVKLDRIQ FT WACIRIILSSTKTTHTGSLEVLCGLLPLKLRREMLACKFVNKRFSLESWYE FT KFVIPVIEGRAAQSGIGKAILMYRVYAGPLSTIRILPCFHFTPQIRQKLVS FT IDLSIHRLLQNNHRYCTEEIENMIATRYPNATLLGTDGSRSSAACGYAVVS FT QTGEALCQTKLPPTVSVFHAELLGIYRALEHIYACQGLEFLILSDSLSSLT FT SLANRKLQPKLPPLWYQTRKLIRDIEESGKSVTLMWVPAHRNIAINEAADR FT EAKAASVHGSPERYQFSSFDIVFPSKRKAVSIWQEVWDNGSKGRFCYSILP FT TVTTNPWWSSYNLNRRQIVVLSKLISNHSRIAAHLKRNNIIEDDNCECGDG FT ASSPDHLIFTCARYEAHRQELWRAVIKEGIAPDVQIILKTQNSILYELLAN FT FFIRTNIDL" XX SQ Sequence 5122 BP; 1543 A; 1130 C; 1126 G; 1321 T; 2 other; cagttacaat cgaactgcgg tgagccacag acgtgttgag ccaactttcg atatatctac 60 ctatctctat tatcggatcg gttattgttc ggtcgctttt gtgaaataat gcctggcaac 120 gacggtgcat cgcagcaaac cgctgctcct cgaaaacgtt gctatcagcc ggacttccca 180 ggacctttcc tggtaatagc agaagcacaa ggtgaagcta atctatcaat ttcaacgatg 240 gccaaacgga ttatccagaa ctttggccgc gattatgtga aagctgctcc aatatcaaga 300 cgtaagatga aaatcctcat gcgaacggcg aacgctgcta acagtctact ggagatggat 360 gtggaaaatg tttcatttat ggtccctcaa cggctgatcg aaactctcgg tgtagctagt 420 attgagctag aagtggaaga tgaggaatta cgcgatgcca tctctttcga caaaggcaaa 480 tcaattcaac tattcaaccc tgacatagtt gaaattcgcc gggttgtaaa gaaacatggc 540 accgagctga cccctctaac gacagtgata atcactttca gtggtttaac tctacctacg 600 cacgtggaaa taaatagggc gctctacaaa gtcaagccat acatctatcc actacgtcaa 660 tgtcgtaatt gttggaggtt aggtcatagc ctaaagaact gtaggagcaa aaagaggtgt 720 cattgctgtt tagaaaacga catttcggac gatcatgaat gcaaccagac ctcaccgcag 780 tgtgtcaatt gcgggggtaa gcatgcagcg aatgacacta gacaatgtcc gaaaataaaa 840 cgtcaaatkg aagctgatcg agatagacag aattcgtatt cgcaaggscc tacagactgg 900 ttttcgcttc ttggaaagaa ggaagcggcc cctttggaag ccccatgcgt aataatttct 960 cctccaaatg cagaatctac aatggttcct gcaattcaat ctcaaccttc tctacaagca 1020 tcatcggatc tccaagtgcc atcaacctgc tgcggtcctg ctagtagtgg cagtaagcaa 1080 ccatctgtta ctggttctgt cacacgagcc aaacgtacga agcgagcatc aacaggggac 1140 atttctgagg acggcctccc aagcttagag ctgcacgtgg aggagcaagt atgtacagct 1200 ataaaagaag ggttaaatac caacaacgtc agggaaatga ttcgtgagct gcaggattcg 1260 agctcgggtt ttactgagca acagtttcgg tcggaaatac ttgccctcat cagcgaaaga 1320 gtgaagcagt acataggttc actgcgttta taatgaacaa ctactatgaa cacttcaacg 1380 tcgatgtggc ccttacaatt tatgcaatgg aattgtaggg gaattaactc taaaaaggct 1440 agcatgatca acctaattaa ttctacaaaa tcatcagtat tgctgatttc cgagactcac 1500 ctggatgctg acaagcagtt cgatattcaa ggcttccatg tcttcaggaa agatcatcgg 1560 agcaatgcac gaggagtact agttgccgca aggaaggagc tacaaccgaa atcgtttcca 1620 ttgactacac ctgagggcat tgatgcggtg gcctgccaaa tcgcatgcag tctcggcaat 1680 atggttattg tatcagtcta catccatcca aactcaactg ttagtcaagc agaattggaa 1740 gagtttctgg gtggcattcc gcaaccttgc atcgtaggcg gtgattggaa tgctaaacat 1800 cctctttggg gtgaccatcg tcaagatcgt cgaggagatt cacttcagtc agccttatta 1860 aatttgaatc tcgttgtgct gaataatggc agctacacgc gggttgatgt tcatcgtgct 1920 tccagtgtga tagacatcac cttagcatca gctgacataa gtcttctttt tgaatggaga 1980 actttggact acgcattcgg aagcgatcat tttcccattg ctttcgggac tgaagaaatt 2040 accaccgaga ttaagcaaca accattcaac tatcgcaaaa tggactggga cgcattcaga 2100 aatagagtag aagacgaaat ctctgcaaat atgggtgttg tagactatga aacctttttc 2160 caaattgttt ggggaagcct cacgcaatct tcaccagccc aacgcactca gaacaacccc 2220 aagatggttc ccttaccgta ctgggacgat gaactaaaat ctgcactatc ggacagcaga 2280 cgcgctttca aagtgtggag acgttcactg gaatacgctg catacgaggg atacaaggag 2340 gcggaattga agtttaagaa gctagtgaag aaaaaaagga aagaatcttg gagaaaacac 2400 tgtgatacat tggattctca gacttcggtt agagatcttt ggagatgggc taagcgattt 2460 aaaggaagga aatcgggaaa tcagtgccag ctgcggaatg agcaagtatt ggagcaattg 2520 ttggatcgtc tagcccccgc atacacttgc acccaaccta tggagatatt ggagtgcgat 2580 tgtggaaggc attcgccagg aggattcttc actattcacg atcttcaaac ggccttcaaa 2640 caagggaaag actcctctcc cggtgttgat ggaatttgct actcagtgct caacaatctg 2700 ccattcaatg gacaagtatg tctcttgaaa atcttcaacg ctatatttga ctgtggaaac 2760 attcccgaag catggaaaat tttcaaaatc atcccgattt tgaaaccagg caaaccacct 2820 gaagaagcag cgtcttaccg cccgatagcg ttggcttcct gctttcgtaa aacttacgaa 2880 aatatggtaa aggaaaaaat cgaatggtac ctagaacaca acttgcgatt ccctgaatta 2940 atatgtggtt ttcgtaaggg aaagggtact ttcgatgccc tccacatatt aatagatgaa 3000 gttcgcactg cttttcacaa ccaccaacat gtagttttat gtagcgtgga tgtgatgagc 3060 gcctatgata atgtgcaaat tcctttactc gttagagagc tgcgaagagt aggtatatct 3120 gaatgtctcg tcaaatcaat ttatggtctc ttcgttgaac gtataatatc ggggtcaatc 3180 gatagcctca gaaacatcga aagaaggaca tggattggat taccacaggg ttctccgtta 3240 agcccccttt gcttcaacat cttaatcagc tcgctgatgg tatctcgtac tcctgggcct 3300 gtcaagattg atttcgctga tgacatgtcc atcatagtac gagggacaac tctagaagat 3360 agcgtttcca ttatccaagg tgcagtgtcc gatcttgcgg acggattgac ttcagtgggg 3420 ttggaagtat cccctacaaa atgttcagca atgatattta gcaatcgtgc tctgggagaa 3480 agttatccaa tctacataaa caatacaatg gttaactaca aaagctgtat caagctacta 3540 ggattatatc taactcctac actctcctgg ggtatgcaaa tcagctacat caaacaaaaa 3600 gcatcacttt acgtaaattt catgcgatcg gttgctgggc aaacttgggg tgcccatccc 3660 gatgctttgc tgaccatcta caaaacatgt attcgatcaa tactcgatta tgggtgcatc 3720 ttctttgagt cagcaccaac cgacgaactc gtcaaactgg atcgaataca gtgggcttgc 3780 atccgcatta tcctaagttc caccaaaacc acccacactg gatcattaga ggtattgtgt 3840 ggacttctac ctctaaaact ccgaagagag atgttggcct gtaaatttgt caacaaaaga 3900 ttttcactag aatcatggta tgaaaaattt gtgataccag taatagaagg acgagcagcc 3960 caatccggga taggcaaggc gattcttatg tatcgcgtat atgccggacc tttatccaca 4020 atacgaatcc taccgtgctt tcatttcaca ccccaaataa gacaaaaact tgtttccatt 4080 gacctgtcga tccataggct gttacaaaat aatcatcggt actgcactga ggaaattgaa 4140 aacatgattg ctaccaggta cccaaatgcg actttactgg gtacggatgg atctagatct 4200 tcagcagcat gtgggtatgc agtggttagt caaacaggcg aggcattgtg ccaaaccaaa 4260 cttcccccta cggtgtcggt ttttcatgcc gaacttttag gcatttaccg agctttagag 4320 cacatatacg cttgtcaggg attagaattt ctcatcctgt ccgatagtct tagcagctta 4380 acaagcctag ctaatcggaa gctacaaccg aaactaccgc cattgtggta tcagacaaga 4440 aagttgattc gagacatcga agaaagcggt aagtccgtca cgctaatgtg ggttccagca 4500 catcgtaaca ttgctatcaa cgaagccgca gatagagaag ccaaagcagc atccgttcat 4560 ggtagcccgg agagatacca attctcttca ttcgacatcg tcttcccctc taaacgcaaa 4620 gcagtgagca tttggcagga agtttgggac aacggctcta aagggagatt ctgctacagt 4680 atcctgccta cggtgactac caatccgtgg tggtcttctt ataaccttaa ccgtcgtcaa 4740 atagtggttc tatccaaatt gattagcaac cattcaagaa tagccgctca tctcaagagg 4800 aacaacatca tagaagacga taactgcgag tgtggcgacg gagcttcatc acctgatcac 4860 ctgatattca catgcgcaag gtacgaggct caccgccaag agctttggag agccgttatc 4920 aaggaaggaa ttgctccgga tgttcaaatt atacttaaaa ctcaaaacag tatcttgtat 4980 gaattgttag ctaacttttt cattcggaca aatatcgatc tataaaatgt atttcatttg 5040 ttaatctcac ctggctaaaa tatgttgtaa aacaagaggc caaataaaga agtgttgaat 5100 taaaaaaaaa aaaaaaaaaa aa 5122 // ID Copia-17_CQ-I repbase; DNA; INV; 4229 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-17_CQ_; KW Copia-17_CQ-LTR; Copia-17_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4229 RA Jurka J.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 349-349 (2011). XX DR [2] (Consensus) XX CC Positions [1592-2119] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 332..4216 FT /product="Copia-17_CQ-I_1p" FT /translation="MAAWLEKDGEARHAIGMAVGNDQLVHICRKKTAKEMW FT DTLLGIHEVESMNTMMTVMRKMCSMKLPEDGNLPEHLKQLTAMHNRLEVAD FT EGLKPRQFVAVILSSLPMSYGTLINVIEGYPREQITVDFVKSKLRDEWRRR FT QECQEFQNGPSDEKALKISAKSGSRAKPKAAELKKKKSGVCHFCQEEGHFI FT RDCPVMAEKRKEVMKSDAKKGSAKVNLAAEVHSEKEVCLMATTACETNRWY FT LDSGATSHMTSDRAVLSNWNPAKQPEICLADGTKIVCSGTGSGKLVSVDEG FT GARVKVTLNDIIHVPSLAGNLLSVSKICDLGLKVLFDRDGFEILRGQEAVL FT LGERHGGLYRLKQFVEKAMLAKPEHPQLCEHLWHRRLGHRDPDAISAIVRE FT ELGFGLQMKKCDVQCVCGVCCEGKMSRLPFPKESLSKSQAVGDLVHSDLGG FT PMEVATPAGNRYFMTMLDDFSGYTMVYLLKAKSEAESKIREYFNLVKNQFG FT RPPRVLRTDGGGEYSGSSLKKFLAESGTIHQMTAPYSPQQNGKAERKNRYA FT VEMVRCMLAESGLGKQYWGEAITAANYLQNRLPSSSVEKTPHELWYNKKPS FT YSHIRIFGSEAFVHVPKEKRLKLDMKAEKMVFVGYAEGRKAYRFLDPDTNR FT IVISRDAKFIEQDGVTELQRNRPEPAEKADAVEMVQLPSYSAGRNEGSTSR FT EVQTEPVAEVDEPETTDTEEPEDTASEEQNESVYENASEGELSFHGFPLDE FT IARRSLRPNLGVPPRRLIEEIFVAREEEMEPRTLKEALSCDDSSEWKRAML FT EELKSHATNGTWDLVELPAGRKPVGCRWVYKRKRNAAGEILKHKARLVAQG FT YCQKYGEDYDEVFAPVTKQTTLRALLAVASKKKLILKHFDVKTAYLYGKLE FT EELYMKQPAGFEKKGAEKLVCRLRRSIYGLKQSARCWNHRLHAVLVGMEFV FT QSGADPCLYTKDVDGERTYILVYVDDILVGSAAETNIQEIYGCLKKEFEMT FT DLGDLNYFLGLEIHREDGKYSVSLEGYIERVAERFGLRDAKVAKTPMDEAF FT VKVEPESVALQESTTYRSLVGALLYIAVCARPDIAVSTSLLGRRVASPTEA FT DWTAAKRVVRYLKGTKHWRLHYGDTEAGLVAYSDADWAGDLKTRKSTSGMV FT FLLAGGAISWASRLQNSVTLSSMESEFVALSEASQETVWLLHLLDDLGEAP FT QKPVVVMEDNQSCIKFVGSERVTRRSKHIETRECYVKELCNDGVIELVYCP FT TEDMLADLLTKPLGAVKMAKLSSLLGSKSGVGKR" XX SQ Sequence 4229 BP; 1054 A; 1032 C; 1388 G; 755 T; 0 other; agaggttatg ggcaccagtg cagcgcggaa ttagtggaat cgcggataaa atcgtcgttt 60 tttcgaagtg caaaagttgg tcgggaaagt tggagtcgcg tcgcgtgtga aaaagcaaaa 120 tggcggacga caagatcgag aagttgaacg accagaactt cgaaatatgg aagttccgga 180 tggagctcaa gctgacgaag gagaagctca tcaaggtcgt cctggagccg aaaccaagac 240 tgaagccggt cgcaggtagc ggccaaacgg agcgacggga gaccagacgg atgctgcagc 300 ggcgggggtt ccagcggtgt cggcagcaga gatggcggcc tggctcgaga aagacgggga 360 ggcgcgtcat gcgatcggca tggccgtggg aaacgaccag ctggtccaca tctgccggaa 420 gaagaccgcc aaggaaatgt gggacaccct gctcggcatc cacgaagttg agtccatgaa 480 cacgatgatg acggtgatgc ggaagatgtg ttccatgaag ctgccggaag acggcaatct 540 cccggaacac ctgaagcagc tgacggcgat gcacaaccga ttggaagtgg cggatgaggg 600 gctaaagcct cgtcagtttg tggcggtgat tctttcgagt ctcccgatgt cctacggcac 660 gttgatcaac gtcatcgagg ggtacccgag agaacaaatc acagtggatt tcgtgaagag 720 caagttgcgc gacgagtggc gtcgccggca agagtgtcag gagttccaaa atggcccaag 780 cgacgagaag gccctgaaga tcagtgctaa aagtggttcg cgtgcgaagc ctaaggcggc 840 agaattgaag aagaagaaaa gtggtgtgtg tcatttctgc caagaagagg gccattttat 900 tcgtgactgt ccagtgatgg ctgaaaagcg gaaggaagtg atgaaaagtg acgccaagaa 960 gggcagtgcg aaagtgaatt tggcagcgga agtgcactcc gaaaaggagg tgtgcttgat 1020 ggcgacgacg gcgtgcgaga cgaaccggtg gtacttggac tccggggcga cgtcgcacat 1080 gacgagcgac cgggcggtgc tgagcaactg gaatccggcg aaacaaccgg aaatttgctt 1140 ggcggacgga acgaagatag tgtgcagcgg taccggaagt ggaaagctcg tgtccgtcga 1200 cgaaggtggt gctcgggtga aggtgacgct gaacgacatc atccacgtcc cgtcgctggc 1260 aggaaacttg ctctcggtga gtaaaatttg cgatcttgga ctgaaagtgc tattcgaccg 1320 agacgggttc gaaatcctgc ggggccaaga agcggttctg ctgggcgagc ggcacggcgg 1380 gctctaccgc ttgaagcagt tcgtggagaa ggcgatgctt gcgaagccgg agcacccaca 1440 actgtgcgaa catctgtggc accgacgatt ggggcaccgt gaccctgatg caatctctgc 1500 aatcgtccga gaggagcttg gatttggcct gcaaatgaag aagtgcgatg ttcagtgtgt 1560 ctgcggggtg tgctgcgagg gcaagatgag tcgcctaccg ttcccgaagg agtccctgag 1620 caagtcgcaa gcagtcggag atctggtcca ctctgacctc ggaggcccca tggaagttgc 1680 gacgccggca gggaaccgat acttcatgac gatgttggac gatttcagcg ggtacacaat 1740 ggtctacctg ctgaaggcga agtcagaggc cgagtccaag atacgggagt acttcaactt 1800 ggtgaagaac cagttcggac gtcctcccag agtccttcgc acagatggcg gcggcgagta 1860 ttccggatcg tcgttgaaga agtttctggc tgaaagcggg acgatccacc agatgacggc 1920 tccgtactct ccccagcaga acgggaaagc ggagcggaag aaccggtacg ctgtggagat 1980 ggtgaggtgc atgctcgccg agtctggcct tggcaagcag tactggggcg aggcgataac 2040 agctgcgaac tacctgcaga atcgcctacc ttcgtcgtcc gtcgagaaga ctccgcacga 2100 gctgtggtac aacaagaagc catcgtacag ccacatccgg atcttcggct cggaggcatt 2160 cgtgcacgtg ccgaaagaga agcgcctgaa actcgacatg aaggcggaga agatggtctt 2220 cgtggggtac gctgaagggc ggaaggccta ccgtttcttg gaccccgaca ccaaccggat 2280 cgtgatcagc agagacgcca agttcatcga acaggacggt gtcacggagt tgcagaggaa 2340 tcggcccgaa ccggctgaaa aagccgacgc ggtcgagatg gtgcagctac catcgtatag 2400 cgccgggcgg aacgaaggat ccacaagcag agaggtccag acggaaccgg tggctgaggt 2460 cgatgagccg gagaccactg acaccgaaga accggaagac accgcaagtg aggaacagaa 2520 cgagtcggtc tacgagaacg catccgaagg cgaactgtcc ttccacgggt ttccactgga 2580 cgagatcgcg cggcggtcgc tgcgaccgaa ccttggagtc ccacccaggc ggctgatcga 2640 agagatcttc gtcgctagag aggaggagat ggaaccgaga acgttgaagg aagcgctgag 2700 ctgtgacgac agttcagagt ggaagcgagc tatgttggag gagctcaaat cacacgcgac 2760 gaacggtacc tgggacttgg tggaactacc tgcaggccgc aaaccagtcg gctgtcggtg 2820 ggtctacaag cggaagcgaa acgcagccgg agagattctc aagcacaaag cgcgcctggt 2880 ggcacagggg tactgccaga agtacggcga ggattacgac gaagtctttg cccctgtaac 2940 gaagcagacc acgctccgcg cgttgctggc cgtcgcaagc aagaagaagc tgatcctcaa 3000 gcacttcgac gtgaagacgg cttacctgta cggcaaactc gaggaggagc tgtacatgaa 3060 gcaaccggcc ggtttcgaga agaaaggagc agagaagctg gtgtgcagac tcaggaggag 3120 catctacggg ctgaagcagt ccgcccggtg ctggaaccac cgactgcacg ccgtcctggt 3180 tggaatggag ttcgtccaaa gtggtgcaga tccctgtctg tacacaaagg acgtcgacgg 3240 cgaacggacg tacatcttgg tgtatgttga cgatatcctc gtcggcagtg cggcggagac 3300 gaacatccaa gaaatctacg gctgcctgaa gaaggagttc gagatgacgg atctcggaga 3360 cctgaactac tttctgggtc tggagattca ccgagaggac ggcaagtaca gtgtatcgct 3420 ggaggggtac atcgaacgag tggccgaacg ctttggactt cgcgacgcga aggtcgccaa 3480 gactccgatg gatgaagcgt tcgtcaaggt ggaacccgaa agtgtcgcac tgcaagaatc 3540 gaccacgtac aggagtctcg ttggagcact tttgtacatc gcagtgtgtg caagaccgga 3600 catcgcggtg agtacatcac tgcttggtcg ccgagtggcc tcgccaaccg aggcagactg 3660 gacagccgcg aagcgcgtcg tacgctactt gaaggggaca aagcactggc ggctgcacta 3720 cggcgacact gaagctggtc tggtcgcgta ttctgatgcc gactgggccg gagacctgaa 3780 gacacgaaag tccacgtcag gaatggtgtt cttgctagct gggggcgcta tttcctgggc 3840 aagccggttg cagaactctg taaccttgtc ttcgatggag tcggagttcg tggcgttgag 3900 cgaagcgagc caagaaactg tttggctact acacctgctg gacgatcttg gcgaagctcc 3960 ccagaaacct gttgtagtga tggaagacaa ccaaagctgc atcaagttcg tcggatcgga 4020 acgtgtcacc cgacgctcaa agcacatcga gaccagagag tgctacgtga aggagctgtg 4080 caacgacgga gtcatcgagt tggtctactg ccctaccgag gacatgctgg ccgacttact 4140 caccaaaccg ctcggagcgg tcaagatggc caagttgtcg tcgctgctgg gttcgaaatc 4200 cggtgtcggc aagcgttgag gaggagtat 4229 // ID TTAA21_AP repbase; DNA; INV; 245 BP. XX AC . XX DT 25-AUG-2009 (Rel. 14.09, Created) DT 25-AUG-2009 (Rel. 15.12, Last updated, Version 2) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; nonautonomous; TTAA21_AP. XX NM TTAA21_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-245 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2089-2089 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC TSD is TTAA tetranucleotide. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea CC Aphid Genome Project. XX SQ Sequence 245 BP; 91 A; 30 C; 34 G; 90 T; 0 other; ccctacactg catacaaggg catatatgca ttataacatt agatggttat gcacctaaag 60 tatgttcgat taaagctaat gaaaacctgg ctatcatttg cttaaaccag tgtaaaaatt 120 agtagttcgt aaaatttttt taaaatttaa aatttttttt tttatgattt ttcctaaaat 180 tattgaatgg aactatataa catataattt ttttagcaaa aaaaaaaatg tcatgcagtg 240 taggg 245 // ID Gypsy-258_AA-LTR repbase; DNA; INV; 112 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-258_AA_; KW Gypsy-258_AA-I; Gypsy-258_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-112 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1118-1118 (2011). XX DR [1] (Consensus) XX SQ Sequence 112 BP; 38 A; 13 C; 30 G; 31 T; 0 other; tgtaacgata tacaagagtt gaaataatac tttagtttca gtgtaggaaa tgttgtacat 60 ttgggaacac tggaatgtgg tacgatgcaa tgcacgaacg aaggttgagg ca 112 // ID BEL-45_CQ-LTR repbase; DNA; INV; 368 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-45_CQ_; KW BEL-45_CQ-I; BEL-45_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-368 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 244-244 (2011). XX DR [2] (Consensus) XX SQ Sequence 368 BP; 106 A; 89 C; 105 G; 68 T; 0 other; tgtccgccaa cgcgtccggt ttcgagcatc ttcgagcgtg aaatgaaaaa tcacgacaga 60 cggcagcact ggttgctgcg attggcgaac cccgccggcg ttgccaaacg tgacagcggc 120 gaaagacgca acggcgactt cgtctcgacc aaaaagcgcg cgcaaagaag gagagaggag 180 agaggtagag agaaaattta ctccgttcgc gagcctcgta cggtgaacgt cgcgcagtgt 240 aaataaaaat cggaatcgaa agagtttttg caaagaaatc gcgtgttatt tctttttccg 300 aaagcggatt acagtcccgc tagaagcccg aggaaacatt ctcggttgag gacgaagtgg 360 ccccaaca 368 // ID Gypsy-252_AA-I repbase; DNA; INV; 4004 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-252_AA_; KW Gypsy-252_AA-LTR; Gypsy-252_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4004 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1105-1105 (2011). XX DR [1] (Consensus) XX CC Positions [3242-3697] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 23..3988 FT /product="Gypsy-252_AA-I_1p" FT /translation="MKIKKHPSAPDKPNLRISLKRKKWPPRSNLAKFTAEQ FT IERIATLRPIGRPRGTVSSQQYIGETFRMDSFKRPDTLELSGNVAEHWKKF FT KQQLEFFMIAIGYDTKADKVKVATLLNLIGDDAVEVYNTFVFDDEEGRTYA FT KVVEKFDAYCNPRNNEIYSRWKFYSRCQAENEPFEHFLTDLRTLVKDCCTL FT EEDKMIRDRIIFGCHSTELRDKLITKDCTKDVTLNAAIEAARLDEVKRKQL FT RLINRGSSSNVRVEAVDRSAVPPNVMPNRSIQSGSRDNRSIKQCEYCLTSH FT AKGRCPAYGQQCRRCQKMHHYQRACRSRPVLNQEIRVESSDGSTRARDWGD FT ADECNDVIVSSVVREMPPLSLVNIKKERWYADVKVENTVLRFKLDTGSDVN FT FIPCDVFNKMCLNVKSKPIITFRAYGGVVYKPKAIVELKVCHNELELIDEF FT WVTDVSTQPILGLTTCVRLGLLKRCSPNIDSVHCESTHDLVEKHRDVFEGV FT GKFSEKCHLSLKSDAIPVQHPPRRIALALKEKLKRKLQEMEKSSIISRVDT FT PTGWVHHVTVVEKPNGDIRVCLDPAELNAALRRDFYQIPTLEDLASALSGA FT EYYTVLDLKDGFYHMELDDASSDICSFSTPFGIFKFLRLPFGLNVSAEMFQ FT KWNEHFFGDLEGVIIYIDDILIYGRTREEHDRRLSAVLRRAAEINAKFNPK FT KIQLRVRRVKYMGHLFSAKGMEPDPERTKAIIEMDMPNNTKALQRFLGTVN FT YLRSFLPRLAEMTAPLRELLKKDTVFKWLPIHSDAVNNIKKSITTAPVLSI FT FDEHKGLVIQSDASKCGLGACLLQDGKPLAYSSRSLSTSEQNYAQVEKEFL FT AILFACKKYHHYIYGREVKVQTDHKPLVALMEKPVSKIHASRLQRIRMKLS FT KYNLKLHYVPGKYMYLADFLSRAFIQSEEKISTSLDYVVHTINVSSDKLEE FT FRKETSNDPILSKLLQVYRDGWPNDRSKLPDDLRYYWQFRDKLYEDGGLLF FT MEEKLFVPSCMRRQMLQLLHETHGGITKTKKRAATIFFWPRMNQDIEDTVA FT KCAVCEQFHRSNVKEPLLSLPIPQRCFEQLGSDFFEFGGKSFLMIADYFSK FT WIHVAETKGKSAKDVCEVFRRTFCIFGVPKTIVCDNNPFNSVAVKDFCSNW FT GCRVLTSSPNYPQSNGFSERCVQTVKRIMRKCTEDNVDFYYALLEYNNTPA FT AGLSVSPAEILMSRSLRSKLPVAENRLKGMKDDVIREQLKENQQRYKQDYD FT KSAHPRSEFEEGDLILAQDQATKKWVRGEILRKLDEPRSYVVKTNLANYRR FT NSRFIRKRNT" XX SQ Sequence 4004 BP; 1275 A; 743 C; 973 G; 1013 T; 0 other; tggcgcagcc ggtaataaag tgatgaaaat caaaaagcat ccatcggctc cggataaacc 60 gaacctacga atttcgctta agcgaaaaaa gtggccgccc aggagtaatt tggcaaagtt 120 tacggctgaa cagattgaac gaatagctac attgcggcca attggaagac cccgcggaac 180 agtcagttcg caacagtaca tcggagagac attcaggatg gatagcttca agcgaccgga 240 tacattggag ttatcaggaa atgtggcgga gcattggaaa aagtttaagc agcagttgga 300 atttttcatg atagctattg gatatgacac gaaggctgat aaagtgaaag ttgcgactct 360 tctgaacctg attggtgatg atgctgttga agtgtataat acgttcgttt ttgatgatga 420 agaaggtcgt acgtatgcaa aagtggtgga aaaatttgat gcatactgca atccacggaa 480 caacgagata tattcacgct ggaagttcta ttcccgttgt caggcagaaa atgaaccgtt 540 tgaacatttt ctgaccgatc tccgcacttt ggtgaaagac tgttgtactc tggaggaaga 600 caaaatgata cgggatcgca ttatctttgg ctgccatagt actgaactga gggacaagct 660 tatcaccaag gactgcacga aagacgtcac attgaatgca gcgatcgagg cggcgaggct 720 ggacgaggta aagaggaaac aactaaggct tatcaaccgt gggtcctcga gcaacgtaag 780 ggtggaagcg gttgatagaa gtgcggtacc acctaatgtg atgcccaacc gttctataca 840 gagtggaagc cgtgacaata gaagcatcaa acaatgtgag tattgcctaa caagccatgc 900 taaagggaga tgtcccgcat atggacagca gtgtaggaga tgccagaaaa tgcaccacta 960 ccagcgagct tgccgaagta gaccagtgct gaaccaagaa attcgcgtgg agagctccga 1020 cggttcaacg agagcgagag attggggtga tgctgatgag tgcaatgacg tcatcgttag 1080 ttcagtggta cgagaaatgc cgccactcag tttggtaaat attaaaaaag agaggtggta 1140 tgccgatgtg aaggtagaaa atactgtgct gaggtttaag ctagatactg gatcggacgt 1200 aaattttatt ccgtgtgatg tttttaacaa aatgtgtttg aacgttaaat ctaagcccat 1260 aataactttt cgtgcatatg gaggtgtagt ttataaaccc aaagctattg ttgagttgaa 1320 ggtatgtcac aacgaacttg aactaattga cgaattttgg gttacagatg taagcacaca 1380 accgatacta gggctcacaa catgcgttcg gctaggacta ttgaagcgtt gttctcctaa 1440 tattgactca gttcactgtg aaagcaccca tgatcttgta gaaaaacatc gagacgtttt 1500 tgaaggagta ggtaaattta gtgaaaagtg tcatctttct ctaaaatctg atgctattcc 1560 agtgcaacat cctccccgcc gtattgcgct agcgttaaaa gaaaagctaa aaaggaagct 1620 acaggagatg gaaaagtcga gtatcattag tcgagtagat acaccaacgg ggtgggttca 1680 tcatgtcact gtcgtcgaga aacctaatgg agatattagg gtgtgcctgg accctgctga 1740 attgaatgct gcgttgagga gggattttta tcaaattcct accctggagg atttggcttc 1800 agcgttaagt ggcgcggagt actacaccgt gttggacctt aaagatggct tttatcatat 1860 ggaactcgat gatgcttcta gtgacatttg cagtttcagt acaccgttcg gtattttcaa 1920 gttcctgcga ctaccatttg ggctaaatgt atctgccgaa atgtttcaaa aatggaatga 1980 acactttttc ggagatctcg agggcgtcat catttatatc gatgacatat tgatttatgg 2040 gcgaacccga gaagaacacg accgtcggtt aagtgccgtt ttacgtcgag ctgcagaaat 2100 caacgccaaa ttcaacccga agaagattca actgcgagtg cgaagagtga aatatatggg 2160 ccacctgttt tcggcgaaag gaatggaacc ggatccagaa cgtacgaaag cgatcatcga 2220 aatggatatg ccaaacaata cgaaagcatt gcaacgattt ttgggaacag tgaactacct 2280 tagaagcttt cttccacgtt tagcagaaat gactgcccca ctcagagagc tgctgaaaaa 2340 agatacggtt ttcaagtggc tgccaattca ttcggacgcg gtcaacaata tcaaaaaatc 2400 catcaccact gcaccagttt tgagcatttt tgacgaacat aagggtttag taatccaatc 2460 ggatgcgtca aaatgtggac taggagcgtg tttgctccaa gatggaaagc cgctggcgta 2520 cagttcaaga agcctttcaa ctagtgaaca gaattatgcc caagtggaaa aggagttttt 2580 ggccattctt tttgcgtgca agaagtacca tcactacatt tatggaagag aagttaaagt 2640 gcaaaccgac cataaacctc tagtagcgct gatggagaaa ccggtatcaa aaattcacgc 2700 ctcaaggttg caacggatta ggatgaaact ttcaaagtac aacctaaaac ttcattatgt 2760 gcccggcaaa tacatgtact tagctgattt tctttctaga gcgttcatcc agtcagaaga 2820 aaaaatatca accagtctgg actatgtagt acacacaata aacgtttcga gtgacaaatt 2880 agaggagttc aggaaggaaa caagtaatga tccgattctc tcaaagttgc tacaggtcta 2940 cagagatggt tggcctaatg atcgttccaa actacctgat gatcttcgat attactggca 3000 gtttagagac aagttatatg aagacggtgg gctgttattc atggaagaaa agctattcgt 3060 gcctagctgt atgagacgac aaatgttaca actgcttcat gaaacgcatg gtggaattac 3120 aaaaacgaag aagcgagcag caacaatttt tttctggccg agaatgaacc aggacatcga 3180 agatactgta gcaaagtgcg ctgtttgtga gcagttccac agatcaaatg taaaggaacc 3240 tcttctatca ctcccaattc cacaacgatg tttcgagcag ctgggtagtg attttttcga 3300 atttggagga aaatcatttt tgatgatagc cgattacttt tccaagtgga ttcatgtagc 3360 ggaaactaag ggtaaatcgg caaaggatgt ttgcgaggtg ttcagacgca cattttgcat 3420 ctttggtgta ccaaaaacca ttgtttgcga taataaccct ttcaactctg tagcagttaa 3480 agatttttgc tcgaactggg gatgtcgagt tcttacgagt agcccaaact atccacaatc 3540 caatggattt agtgaacggt gtgttcaaac agtaaaaaga ataatgcgaa agtgtactga 3600 agataacgtg gatttctact acgcgttgtt ggagtataat aacactccag cagctggcct 3660 aagtgtatca ccagcagaga ttttaatgag caggagttta aggtcgaagt tgccggtggc 3720 cgagaatcgg ctcaaaggaa tgaaggatga tgtcatcaga gaacaattga aggagaatca 3780 gcaaagatat aagcaagact acgataagtc agcacatcca agatcagaat ttgaagaagg 3840 cgacttaata ctcgcacaag atcaagcaac caaaaagtgg gtaagggggg aaatattgcg 3900 aaaattagat gaacctagat cgtatgttgt gaaaacgaat ttagcaaatt ataggcgaaa 3960 ctctagattt ataagaaaaa gaaatacttg aaaaagggag atgt 4004 // ID Gypsy-2_TCa-I repbase; DNA; INV; 4498 BP. XX AC ChLG3; XX DT 21-FEB-2011 (Rel. 16.02, Created) DT 21-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the red flour beetle genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-2_TCa_; KW Gypsy-2_TCa-LTR; Gypsy-2_TCa-I. XX OS Tribolium castaneum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Coleoptera; Polyphaga; Cucujiformia; OC Tenebrionidae; Tribolium. XX RN [1] RP 1-4498 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the red flour beetle genome."; RL Direct Submission to RU (21-FEB-2011). XX DR Genome; ChLG3; Positions 29081813 29077316. XX CC Positions [1742-2245] - Reverse transcriptase CC Positions [3354-3863] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 152..3535 FT /product="Gypsy-2_TCa-I_1p" FT /translation="MGTEKVVADMMTMFTDALRVMEQNRVASEERLMKMML FT EQQNRGTTIQTVPDFSKSVETFDGDCESGAAIEWLEKINVTAEIHSWSNEC FT CLETARNHLVGAARKWYDAHRTEIKTWTDFVTQFKKTFEIRVSTTECWRQM FT QSRQQIPGESVSTYFHDKFQLCQKLKLPFAETKEQIVVGLSNKNIVQGLVA FT KSHTDADELLHDIIEFSRILNNSRAFSGINERSKKPSKNINEIKCYNCLQA FT GHLASKCTQPKQTICFQCKQTGHVAKDCKNSYGEKSQNFSRKPVATVKKEV FT SLISSHQKPREDSDGIEKYLRVTKVDDVNIHAMIDMGSSVCTVRSSVVKEN FT NWPITRKDVQLFGFGNNSPIPCLGVVNAGIQIDEVRVDNVDICIVADDAQS FT VDMIIGRPFTENENLIYLRVDDKLIFKKREDFPFSEWEISTDEHKKTAEDS FT EVVSEECEITEDQIIVDPSTPPEITHELIKLFNEFRSCFAFNLKELGCTDV FT MKMKIRDTNVPVSAKPYRASLREREIIKNIVRDWKENGIVTETNSQYASPV FT VLVQKKSGEPRLVVDYRRLNKQTEKVPFPIPSIDEQFEMLSESRIFSTLDL FT AHGYLQIPLDEESKEKTAFITPDETGQFERLMFGLTNGPYEFCRLMHIVLG FT PLRNKVCMTYIDDILLGAKDWTDMLDKLKMVLTALKKAKLTLKLRKCVFGL FT KEVEFLGYTISSEKMQPAKSKMRAISEFPTPKNIHELKRFLGLTSYFRRFL FT RNYAIRARPLTDLTRKTQTFTWGEDQSAAFEDLRDSLCAEPVLKLYNPKAT FT ITELHTDACSAGVAGMLLQTDSAGQLHPVYYISKKTTDVEEKYHSTKLELM FT AVVWCIERLRNLLIGLKFVVYTDCQAITFLNNFKTQNAQICRWFNLLQEYD FT VEIKHRPGGRMAHVDALSRAPVDDSQDTLDHIHERLEVLQTLDETDYVLMI FT QYGDAELRELIQILKKPNEERTPVERQRVLPYKLDSARLLRNVNVNGSGKW FT LYVIPKSMRKSIVVKHHDQLGHFGVDRTVAKIRERYWFAKMKQYVQRHISA FT CAECLLNKIPSGKRPGELHPPRPPRRPFEKNSFRPCRPFSKNRRRKFLHSC FT GNRCFNEVCETVPHEEYKIQ" XX SQ Sequence 4498 BP; 1531 A; 810 C; 999 G; 1158 T; 0 other; atcagaagtg ggattttccc aaaaagtcca gtgtgtggag aaaataagtg agcgcgtgtg 60 tttggcgttt gtaaaagtgg tgttacagcg tcattcaatt tttctctttt gacgaattgc 120 gatcagccga tcagtggaag aaatcatcat catgggtacc gagaaggttg ttgcggacat 180 gatgacgatg ttcacggatg ctttacgagt catggaacaa aaccgtgttg ccagcgaaga 240 aagactaatg aaaatgatgc ttgaacaaca aaatcgtgga acaacgattc aaactgtgcc 300 cgatttttcg aaatcagttg agacatttga tggcgattgt gagagtggtg cagcaatcga 360 atggttggag aagataaacg tcacggcaga aatccattcg tggtcaaacg aatgttgctt 420 ggagacagcc aggaaccatc ttgtgggagc ggcaagaaaa tggtatgatg cccatcgcac 480 ggaaatcaaa acatggaccg acttcgttac ccaatttaag aaaaccttcg aaatcagagt 540 aagcacaacg gagtgttggc gacaaatgca aagtaggcaa cagatacccg gtgagtctgt 600 atccacatat tttcacgata aattccaatt gtgccaaaaa ttgaagttac ctttcgcgga 660 aacaaaggaa caaatagttg tgggactatc gaataaaaat attgtgcaag gactagtcgc 720 gaaatcgcat acagacgcgg atgaacttct tcatgacatc attgaatttt cgagaatttt 780 gaataattcg cgcgcgtttt ccggtataaa tgaacgttcg aaaaaaccga gtaaaaacat 840 taacgaaatt aagtgctaca attgtttaca agcgggacat ttggcgtcaa aatgtactca 900 accaaaacaa acgatttgtt ttcaatgcaa acaaacgggt catgtggcaa aggattgtaa 960 aaattcttat ggggaaaaga gtcagaattt ttcgcgaaaa ccagttgcaa cagtcaagaa 1020 agaagttagt cttatttcgt cccatcagaa gccaagagaa gatagcgatg gaattgaaaa 1080 atatttacga gtgacaaaag tagacgacgt aaatatccat gcaatgattg atatgggcag 1140 ttctgtttgc acagttagaa gttcagttgt caaagaaaac aattggccca tcacgagaaa 1200 agatgtgcaa ttgtttggtt ttggaaataa ttcaccaata ccatgcttag gtgtagtgaa 1260 tgctggtatt caaatcgatg aagttcgcgt agataatgtt gatatctgca ttgttgctga 1320 tgatgcccaa agtgtggaca tgattattgg cagaccattt actgagaatg aaaatctaat 1380 ctatcttaga gtggacgata agctaatctt taaaaaacga gaggattttc ccttttcgga 1440 atgggaaatt tcgaccgatg aacataagaa aacggcagaa gattcagaag ttgtttccga 1500 agagtgtgaa attactgaag atcaaataat agtagaccca tcaactcctc ctgaaattac 1560 acacgagcta atcaagcttt tcaacgaatt taggtcttgt tttgcattta atttaaagga 1620 actaggttgc acagatgtga tgaaaatgaa aattagagat acgaatgttc cagtaagtgc 1680 caaaccatat cgagcaagtt tgcgagaacg agaaattatt aaaaatatag tccgtgattg 1740 gaaggaaaat ggaatagtga cggaaacaaa tagtcaatat gctagcccag tcgtcttggt 1800 ccagaagaaa agtggtgaac ctcgactagt tgttgactac aggagattga ataaacaaac 1860 tgaaaaagtt ccttttccaa ttccaagtat tgatgaacag tttgagatgc tgagtgaatc 1920 aagaatattt tcgacattgg acctcgctca tggatatttg caaattccgt tagatgaaga 1980 atcgaaggag aagactgctt ttataactcc agacgagaca ggccagtttg agaggttaat 2040 gttcggatta acaaatggcc catatgagtt ttgccgatta atgcacatag tccttggtcc 2100 actcagaaat aaagtgtgca tgacttacat tgacgacatt ttgcttggtg cgaaagattg 2160 gacagatatg ttagacaagt taaaaatggt tttaacggct cttaagaaag caaagttgac 2220 tttaaaattg cgtaaatgtg tatttggact aaaagaagtt gaatttcttg gctacacaat 2280 ttcaagtgag aagatgcaac cagctaaatc caaaatgaga gccatatcag agtttccaac 2340 gccgaaaaat attcatgaat tgaagagatt cttaggacta acaagttatt ttcggagatt 2400 tttgcgaaac tacgcaatac gtgcccgacc tttgacggat ttgacaagga agactcagac 2460 atttacgtgg ggagaagacc aaagtgcagc ttttgaagat ttgcgcgata gtttgtgtgc 2520 tgaaccggtt ttaaaactgt ataatccaaa agcgactata accgaattac atacggacgc 2580 atgtagtgcg ggcgtggcgg gaatgctact tcaaacggac agtgcaggac aactacatcc 2640 cgtatattat atctcgaaga aaaccactga cgttgaggag aagtatcatt cgacgaaatt 2700 agaactgatg gcggtagttt ggtgtatcga gagattgcgc aatttgttaa tcggactaaa 2760 atttgtggtc tacacggact gtcaagccat aacatttcta aataatttta agacgcaaaa 2820 tgcccagatc tgtcgttggt ttaatttgtt acaggaatat gacgtcgaga ttaaacatcg 2880 tccaggtgga agaatggccc atgtggatgc cctttcccgt gctcctgtcg atgactctca 2940 agacactctt gaccacattc atgagagact agaagtcctg caaacacttg acgagaccga 3000 ctacgtgcta atgattcaat atggagatgc tgaacttcgt gagctaatcc aaattttgaa 3060 aaaacctaat gaagagagaa caccagtgga acgtcagcga gtgcttccct acaagttaga 3120 tagtgcgaga ctactcagaa acgttaacgt gaatggttct ggaaaatggc tgtatgtgat 3180 cccaaaatca atgaggaaaa gtattgtggt gaaacatcac gaccaattag gacatttcgg 3240 cgttgaccga acagttgcaa aaattcgaga aagatattgg ttcgcgaaga tgaaacaata 3300 tgtacaaaga catatcagtg cgtgtgcgga gtgtttacta aataaaatcc catcaggaaa 3360 gcgacccgga gagctacacc caccgagacc accacgacga cctttcgaaa aaaattcatt 3420 tagaccatgt aggcccttta gtaaaaaccg acgacgaaaa ttcttacatt cttgtggtaa 3480 tcgatgcttt aacgaagttt gtgaaactgt accccacgaa gagtacaaaa tccagtgaaa 3540 gtgtaagcgc cttacaaaac tgtattgaca attacggaat accccgtctc gtagtggcgg 3600 accagggttc atgtttcaaa tcagatgaat tcggtgaatt ttgcgacagc ttaggcatac 3660 aggttttatt aatacctccc cggtggcctc aagcaaatgg acaagtggaa agagtaatgc 3720 gaactcttat acccacatta atgtgcgaaa tggatatcga agaacaatgg gacaaaaaga 3780 ttgtcaaggt agaacgcaat ttaaattcta tgttgaataa aaccactggt agagttccat 3840 ttgaagccct tcatggatac ttacccgtgt tcgacgatgg aaaactcact aagcttgcag 3900 aaggcgaaga ttgtacctgg actcctcccg aaagtatcca agcagaaatt cgtgaagcaa 3960 ttgttgaaaa acaaaaagca tataaaagac gatatgataa gaagaagttc caaggtgtga 4020 catacaacgt tggagacatc gttatgttca aaacctacat ccctggagga accggacaat 4080 catcaaaaac taaagccaaa tatagaggac cattaacagt tattgagaaa ttgccttcgg 4140 atatatatcg tatctccagt cttgcggatg agggacgcat ttttacgact acagccaatg 4200 tttctcaact caaactctac agaaacccaa tcgaagacac agaacccgaa gacacagaat 4260 ccgaagacag tgacagtagt gaagatgaac ttccacaaat cgaaactgtt gaagttcaga 4320 tacattctac accagatgca attgtaaccg aagaactacc aaaacgaaat agacggatgc 4380 caaagaaact ggctgactat aaactatttt aagtgatttt gtttcattag tgtattaaat 4440 ttcattgctc cttgttttat gtcgagcgga ggtccgctcg aaggtcagaa tgaccgag 4498 // ID BEL-31_AA-I repbase; DNA; INV; 6161 BP. XX AC supercont1.241; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-31_AA_; KW BEL-31_AA-LTR; BEL-31_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6161 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.241; Positions 1408702 1402542. XX CC Positions [5212-5772] - Integrase core CC 'AACAT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 21..2063 FT /product="BEL-31_AA-I_2p" FT /translation="MSNPDHSANTSNPATPANEQTTSCECCDRPETIDDCV FT QCDQCNGWWHMTCAEVTASVADRSWTCSHCLPLSVSSRTTTSSVKAARLAL FT KKQQLEEQQAMEQRHLAEKYKLLQEELNEMDETGSNRSRISRRTSLDKVKQ FT WQQKCVEQSEGAQGLPPVNQSVNLAAVPQASLNDSRDGSSMQQVPSKSNVS FT AVPPADPNQTQQSDVHACKLVSVGLTQGYSEHKPPLSSTVKACSLPKFNHA FT KRDQQQYTGAISKAIPNQQHLANAVGKSHSQEHPVAQQGKPLRDPFIIPPQ FT SSKYDNSLQQIVKQFGSLATSATASSFHTNMLPFGTNTLSLPETSVGIAPP FT VNPNQSSISAFPAVGLSNVGPSPHANPTLYSGPVPSVPGLQNVGSLPQPNP FT VTSFGIVPPPSGLPIIDQFTPSPSQLAARQVMSRDLPPFSGDPADWPIFIS FT SFMNSSLACGYNSAENLARLQRCLKGPAYESVKSRLLLPDSVPQVIDTLRL FT LYGRPELLINALLQKVRSVSAPKAEKLETIIDFGMAVRSLCDHLEAAGQRE FT HLSNPTLLMELVEKLPAHTKMQWADYMQQHPVVNLKVFGDFMLGVITAVSR FT VTMYVSGSSGSQQKLKQKAAINAHTSETDPVREPVREKERVCVCCKKSGHR FT IAECSIFKSYPVDNRWKFAQKWTVSELLKRAW" FT CDS 2959..6159 FT /product="BEL-31_AA-I_1p" FT /translation="MAVKRLECLERKMVRDPELAANLKNQISEYQLKGYAH FT RATREELAQADPKRVWYLPLGVVTNPRKPGKVRLIWDAAAKVDGISLNSML FT LKGPDQLTSLPAVLSRFRQFKVGVSADIREMFHQLRIRESDRHSQRFLFRS FT DPMKPIETYLMDVATFGSTCSPASAQYVKNKNAEEFSELFPRAVEGILENH FT YVDDYLDSFGNEEEAERVSSEIRSIHQSGGFQLRNWLSNSAVVLRGLNEVD FT PRASKNLCWNTNDSSDRVLGMLWQTADDELRFSMKLKEEVQQVIDSRKRPT FT KRQMLRCLMGIYDPLGLLGVFIVHGKILLQDVWRTGLQWDEAVPDEIFERW FT IRWTSLFPKIGDLRIPRCYFKAATDKMYERLQLHVFVDASEAAFSAVAYFR FT VTNKESYSECTLIAAKTKVAPLKPLSIPRLELQAAVLGSRLMSFVQESHSI FT EVKQRYLWSDSATVLAWLRADHRRYKQYVACRIGELLSTTEVAEWRWVPSK FT LNPADAATKWVKNACPGVTDVWFKGPNFLKESEENWPKQVRLANHPEEELR FT PCFVIQESIIPECVVDFTRFSKWRRLLGTVAYVHRFIDNCKRRQKKENLQR FT LHLTQDELKKAKNTLMRIVQWQEYPDEMTLFSKGLRELPKTSTLYQLTPTM FT DEYGVLRVDGRIGAAPHVAFDAKFPVILPRKHLVTKLLVDDFHRAFRHGNS FT ETVVNEIRQYYHIFQLRTVVKQTAAACQWCKVMKAAPKVPRMASLPVTRLS FT AFVRPFTYTGIDFFGPLLVKVGRSSAKRWICLFTCLTTRAVHVEVAYSLST FT PSCVKCVRRFVCRRGAPAEIYTDNGTNFLGAERLLREQLKTLHSDLAATFT FT NADTKWSFIPPGAPHMGGAWERMVRSIKSAMETAYNSDRKLDDEGLETLVV FT EAEGIVNSRPLTYLPLDAEEGESLTPNHFLLGSSRGVRQPAMPLNDPATAV FT KNSWNLIQHQLDIFWKRWIREYLPMLTKRMKWFGEVRPVAVGDLVLIVDES FT RRNGWTRGRVQEVMTAGDGRVRQAIIQTARGMLRRPVSKLAVLEVESGGKT FT GTDGQCYGGE" XX SQ Sequence 6161 BP; 1697 A; 1438 C; 1612 G; 1414 T; 0 other; aatctttaaa aattatcgct atgtcgaacc cggatcactc agccaacacc tcaaacccag 60 caacaccggc caatgaacaa accactagct gcgagtgttg tgatcgtccg gagacgatcg 120 atgattgcgt tcaatgcgat caatgtaatg gatggtggca tatgacgtgt gcagaagtga 180 cagcatcggt agcagatcga tcgtggacct gcagtcactg tttacctttg agtgtttctt 240 cgcgaacaac aacatctagc gttaaagcgg cacgtttggc tttgaaaaag cagcagctag 300 aagaacaaca agcgatggag caacgtcatc tggccgaaaa atacaagctt ctccaggagg 360 agctgaacga gatggatgag accggtagca acagaagcag gatcagtcgg cgaacgagtt 420 tggacaaggt gaaacagtgg cagcagaaat gtgttgagca gtctgaaggc gcgcagggtc 480 taccgccggt gaaccaatcg gtgaaccttg cagcagttcc tcaggcatca ctaaacgact 540 cgagagacgg cagttcgatg caacaggttc catcaaaatc caacgtttca gcggtacctc 600 cagcggatcc gaaccaaaca cagcagtcag atgttcacgc atgcaagcta gtttcggtag 660 gcttgactca gggatacagt gagcacaagc ctccgttgag ttcgacggtg aaagcatgta 720 gtcttccgaa gttcaaccac gcaaagagag atcagcagca gtacaccggc gcaatctcta 780 aggctattcc gaaccaacaa catttagcca acgctgtagg taaatcccat tcccaagaac 840 accccgtagc acaacaaggt aagccacttc gtgatccttt tatcattcca ccccaatcat 900 caaaatatga taactccctc caacaaattg taaaacaatt cggaagtctg gcaacatcag 960 ccacggcttc ctctttccat acaaatatgc ttccatttgg aacgaatact ctttcactac 1020 cagaaacgag tgttggcatt gcgccaccgg taaatccgaa tcagagttcc atatcagcct 1080 tcccagcagt gggactatcg aatgttggcc cttcgccaca tgcaaatcca accttgtatt 1140 ctggaccagt cccttcagtt ccgggactac agaatgttgg ctctttgcct caaccaaatc 1200 cggttacatc tttcggaata gttcccccac cgtcgggact accgattatt gatcagttta 1260 ctccctcccc ctcgcagtta gccgcgcgtc aagttatgtc acgcgattta ccgccgtttt 1320 ctggcgatcc tgctgattgg cctatattca tcagcagctt tatgaatagt tcgctagcat 1380 gtggctacaa cagtgctgaa aatttagcgc gacttcaacg ctgtttgaaa ggccccgcgt 1440 atgaatcagt caaaagccgg ttattacttc ccgattcagt gccccaagtg attgatactc 1500 ttcgtttgct gtacggtaga cctgaattgt taatcaacgc cttactacaa aaagtgcgca 1560 gtgtttcagc accgaaagca gagaagttgg agaccatcat cgacttcggc atggcagtac 1620 gtagcttgtg cgatcatctg gaggctgctg gtcagcgtga acatctttcc aacccaacgt 1680 tgttgatgga gctagtggag aaactgcctg cacataccaa aatgcagtgg gcggattata 1740 tgcagcagca cccggtggtc aatctcaagg tgttcggtga tttcatgctt ggagttatca 1800 cggcggtcag cagagtaacc atgtacgtga gtggaagtag tggcagccag cagaaattga 1860 agcagaaagc ggcaattaac gcccacacca gtgaaacaga tcccgtccgt gagccagtac 1920 gagaaaagga gcgagtttgt gtttgctgca agaagtcagg tcaccgaata gccgaatgtt 1980 ccatcttcaa gtcatatcct gtcgacaacc gttggaagtt tgcgcagaaa tggactgtgt 2040 cggagttgct taaacgcgca tggtagaaga agttgcagga atgcaacgca gtgcgtcttc 2100 gagggttgcc aataccggca tcatccgttg ttgcattcga atagatccgc ttctggtgca 2160 cgatcgtcgc aacaattgtc aaccgtgcag aatcacattc accgacagta caagccggtg 2220 cttctgttcc gcattgtacc ggtcattatc tccgggcctc aagcaactgt cgaaacattc 2280 gccttcttgg acgacggttc agacctttcg ctcatcgaga acagtttggt ggagcaattg 2340 ggagttgatg gatggaggaa gccgttatgt ctaaaatgga cagggaacgt caccagagtg 2400 gagtcggatt caaagcacgt acggatactg atcaagggag cgagcagcca gcaagaattc 2460 tctctgaacg acgtccgtac cgtggaagaa ctcactttac cagaacaaag tctagactac 2520 ggtgaactgt cgcaacgcta ccgttatctc aaaggtttgc cggtagtcag ttacaccaaa 2580 gcagtacctc gtctgttgat aggcgtcaac aacgcaagtc tcactgtccc actccaagtg 2640 agagaaggca agaaagatga acctattgca gtaaagacta gacttggatg gtgcatattt 2700 ggcggtcgtg gtaaggaggc atcacattcg ctgaactacc acgcctgcga atgctcaagt 2760 gaccaggagc tgcataacgt agtaaaagaa tacttcgcaa tagaagatgc aggagtgaag 2820 ccaccagttg tgctggaatc ggaagaggat aagcgggcga ggagaatcct ggaacagacg 2880 actgttcgca ttggcgagcg ttttgaaacg ggtctactgt ggaagtacga cgtcattgaa 2940 ttcccggaca gttacatcat ggcggtaaaa aggctggagt gcttagaacg caagatggta 3000 cgagatccag agttggcggc taatctgaag aaccaaatat cagagtatca gctgaaagga 3060 tatgcacatc gagcgacacg ggaagagtta gcgcaagcag atccgaagcg cgtgtggtat 3120 ctgccgttgg gcgtagtaac gaatccacgg aaaccgggca aagtgcggct catatgggac 3180 gcggctgcta aagtggatgg aatatccttg aactccatgc tgctcaaggg tccggaccag 3240 ctaacttcgc ttcctgcagt tctatcacgt tttcgacagt tcaaggtggg cgtatctgca 3300 gacattagag aaatgttcca ccagctacgg atccgtgagt ccgatcgcca ttctcagcgt 3360 ttcctattcc gtagcgatcc gatgaaaccg atagaaacct atctaatgga tgtggcaacc 3420 tttggatcca cgtgttcacc tgcatcggct caatatgtga aaaacaaaaa cgccgaagaa 3480 ttttccgagc tttttccgcg agctgtcgaa ggaatcctgg aaaaccacta cgtggatgat 3540 tacctcgata gttttgggaa tgaagaggag gctgaacggg tgtcaagcga aatccgatca 3600 attcaccaaa gcggagggtt tcaactcagg aactggctgt cgaatagtgc cgtagtcctg 3660 cgtgggctca atgaggtgga ccctagggcc agtaaaaacc tatgttggaa cacaaacgat 3720 agtagtgatc gggttctggg catgctatgg cagactgctg atgacgagct ccgattttca 3780 atgaagctga aagaagaggt tcaacaagtt atagacagca gaaaacggcc cacaaagcga 3840 caaatgttga gatgcttaat ggggatttat gatcccctgg gtcttctagg cgtgttcata 3900 gtccacggta aaatacttct tcaggatgtg tggcgaacgg gtttacagtg ggacgaggcg 3960 gtgccagacg aaattttcga gcgttggata aggtggacca gcctgtttcc caagatcgga 4020 gacctgcgga tacctcgatg ctattttaaa gcagcaactg ataaaatgta cgagcgacta 4080 caattgcatg tcttcgttga tgcaagtgag gccgcatttt ccgctgtggc gtacttcagg 4140 gtaaccaata aggaaagtta ctcggaatgc actttaattg ccgcaaaaac aaaagttgca 4200 ccacttaagc cactttcaat tcctcgttta gaattgcaag ccgcagttct aggcagccgt 4260 ttaatgtcct tcgttcagga gagccacagt atcgaagtga agcagcgata cctgtggagc 4320 gattcagcaa cagtattagc ctggttgcga gcggatcatc gccgctacaa acagtatgta 4380 gcttgccgga taggagagct actgtccacg acggaggtag cagaatggcg atgggtgcct 4440 agcaagctca atccagcaga cgcagcgaca aagtgggtta aaaatgcgtg tcctggtgta 4500 acagatgtat ggttcaaagg accgaatttc cttaaagaat ccgaggagaa ctggccaaag 4560 caagtacgac ttgcaaatca tccagaagag gaattacgac catgttttgt aatccaagaa 4620 tcaattattc cggagtgtgt ggtggatttt acgcgcttct caaagtggcg acgactgcta 4680 ggtacagtag cgtatgtgca tcgtttcatc gataactgta agcgtagaca gaaaaaggaa 4740 aatttgcagc gtctacatct aactcaggat gagctgaaga aagcgaaaaa tactttgatg 4800 cgaatcgtgc agtggcagga gtacccagat gagatgactc tattttcgaa aggattacgt 4860 gagctgccca aaacaagtac cttgtaccag ctgacgccaa caatggacga gtatggagta 4920 ctgcgcgtcg atggaagaat cggggcggca ccgcacgttg cgttcgatgc taagtttccg 4980 gtgatcctcc ctaggaaaca tctggtaacg aaacttctgg tggacgattt ccatagagcc 5040 ttccgtcacg ggaactccga gacagttgtg aacgagattc gccagtatta tcacatcttc 5100 caattaagaa cggtagtgaa acaaaccgcc gccgcatgcc aatggtgcaa ggtgatgaaa 5160 gcggctccta aagttccaag aatggcatcg ctaccggtaa cacgtttgtc ggcgtttgtt 5220 cgaccattta cgtacaccgg aattgatttc ttcggcccgc tgctggtaaa agtgggaaga 5280 agttctgcca aacgttggat ctgtctattt acatgtctga ctacccgggc cgttcatgtg 5340 gaggttgcct atagcttatc cacgccttcc tgcgtcaagt gcgtccgccg ttttgtctgc 5400 cgccgaggag caccggcaga gatctatacc gataacggta caaatttcct aggtgccgag 5460 cgtctattgc gagaacaact taagacgcta cacagtgatt tggcagcgac tttcaccaat 5520 gcagatacca agtggagctt cattccacca ggagcgcctc acatgggtgg tgcttgggaa 5580 agaatggtgc ggtcaattaa gtcggccatg gaaacggcct acaatagcga taggaaactc 5640 gacgatgaag gactggagac attggtggtc gaagccgaag gaatagttaa tagcagaccg 5700 ctcacctacc tgcctttgga cgccgaagaa ggagaatctc tcacaccaaa tcactttctt 5760 ctcggaagct ccagaggtgt ccgccagcct gcaatgccgc tcaacgaccc tgctacagcg 5820 gtgaagaact cctggaactt aattcaacac cagttggaca ttttctggaa gcgatggatt 5880 cgggagtatc tcccgatgtt aacgaagcgt atgaagtggt tcggagaagt taggcctgta 5940 gccgttggag acctggtact tattgtggac gaatcccgga ggaacggatg gactcgtgga 6000 cgagttcagg aagttatgac ggccggagac ggaagagttc gacaagctat cattcagact 6060 gcgaggggga tgttacgtag accggtatcg aagctggccg tgttggaggt agagtcaggt 6120 ggtaaaactg ggaccgatgg ccagtgttac gggggggagg a 6161 // ID BEL-7_SI-I repbase; DNA; INV; 6252 BP. XX AC AEAQ01030469; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-7_SI_; KW BEL-7_SI-LTR; BEL-7_SI-I. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-6252 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01030469; Positions 642 6893. XX CC Positions [5135-5686] - Integrase core CC 'CTTTT' target site duplication CC LTRs are 94% similar to each other. XX FH Key Location/Qualifiers FT CDS 861..3110 FT /product="BEL-7_SI-I_2p" FT /translation="MSKDELSKLIVRGRIKAKLTAFSTYLERAATESVKIA FT ELPIRLEKAEGFWQEFDITQSQIEDLENTVAQFTERENFENLYHAIITSAR FT ALQSRQDQANRPSSVNSVPERSVGNPQVNVFAQPVGPTKVKLPNIELPKFD FT GSYNKWIPFRELFESLIDANAALPAIQKLHYLKLALTHEAAKVIQSLELSN FT ANYEIAWDLLKQRYENKRLIVHHMQELLDLAPIVKESHAALRQFIDGISQH FT IQPLIKLGQPVEHWNTILIHIFTPKLDKITKREWELKRATIDTFPTLNEFI FT EFLNTRSAFLESLSYSANNTNSVSSVNKSHNKAVICACVSEEKSSCPVYQG FT NHELPDCGTFRKWSAAEHMTEIKKRRLCIRCLKNFHGRNCKASECKRCKGY FT HHTMLHLEKTIKPDKEANQTEPSTGATKTSATDKQIVATVTQAPTTTLMHC FT ATKGPLQVMLATAIIYIKDQKGILHECRAFLDSGSHSNFITCKLSDRLQLL FT KQQTTMTITAINEIPVKTSQQITATIQSRYNAFQATLPFFTIPKITKRLPL FT NKVDLRKVEIPERIQLADPEFHIPATIDILLGASVFWDLLYVGQIKIFKQQ FT PTFRKTQLGWIVAGAMELHHMPTTMTTYCGFSSERTLCDQLQKFWQIEEIT FT SSRHLSQEETTCEKHFTDTIRRTEDGRFVVQLPLKRNPSKLGDSYEIARKR FT FQSIERKLNKDPTLKKEYHAFIQEYLQLGRMSEVEERDSTVKQAICRIMR" FT CDS 3200..6076 FT /product="BEL-7_SI-I_1p" FT /translation="MMVGPTIQDELFCIILRLRQHKYVMSADITKMYRQIW FT MHESQRDLQRILWRWTPDEPLRTFRLNTITYGTSSAPFLATRCLTEIAQLH FT HSKHPAAAEIIRRDFYVDDLLTGKDEIGELVKLQRDIIQILKTAQFELRKW FT RSNAPELCTTEIDKESTVPIGEEVKTLGLPWNTTSDSLTYKACTNSSNKRI FT TKRSMLSDIAQLYDPLGLLGPVIVQAKIMLQELWQLKLGWDESMPMQSHSS FT WKKWRQHIECVNHISIARQVICDKPTFIELHGFCDASEKAYGACVYICSIN FT AKQERTIKLVCAKSRVAPLKRISMPRLELCGVLLLAQLCGKIKDAMIIPIK FT DTHHWSDSTITLAWIAGEPHQWHTFVANRISEIHRLTDKHKWHHVRSDQNP FT ADLLSRGVNAEDLKGKQLWWEGPSFLQQQSGFNSHTPPDLKDIPERKGTNP FT CLAAMESEGPDTTLIQRFSSFTRLKRVVAYCLRFAANARLKGSRLYLKRSG FT VLSVQELEDSTVVIIKQVQSVEFADELRSLQSNSPVNHKSELRNLHPFLDS FT KGQIRVEGRLSHTQLPYSQKHPIVLPGKHHITELLVRHEHYRNLHAGQQAV FT LPAIRTRYWPLSGRIAIRRVLRRCIVCFRAQPVCIKQLMGDLPRHRVIQAR FT PFLNTGLDYAKPFSVKLSRNKTSKAYVCLFVCFAVKAVHLELVNDLSTTCF FT LNALKRFVARRGRCINLYSDNGTNFVGANNALKEVKEILTKEATQDQIKTF FT LAEQSIAWHFIPAYSPHMGSLWEAAIKSAKTHMRKVIGSMTLTLEELYTVL FT TQIEACMNSRPLTLISQDPNDLIALTTGHFLIGEPLTAIPECEVSDVPSNR FT LNRYEYLIQLKQHFWKRWSQEYIAQLQPRTKWQIAKENNIRIGTMVLLKNE FT NTPLMTWLMGRILESHPDKKGLIRVVTIRTKNGTCQRAISKICLLPVDSMP FT LTDA" XX SQ Sequence 6252 BP; 2018 A; 1363 C; 1387 G; 1484 T; 0 other; ttagtcattc gagccggatc gggcaccaag aagaggttag ccataatccg tccacgcaac 60 aagtaaggct gggcggacac acgcatctcg gaaggacgca gacgaatgag acagatccac 120 gataccgtcg ccaagtaaag aaggtcagtt aagggcagat agtcgattga aggtacgaga 180 ccacgcagga aatagcaaga ctcgcgccgt ctttacatta gaacggtacc gacgatcggc 240 gaaactagca tcaaggcatc cccccccccg cccgaagtaa aacaacgagc gtgaggcata 300 acagcaacgc acgtttgcac gcaaagaaat aaattctctc accgcctcgt tctttcgggg 360 ggacagagga aagaactcaa caatcgtggt gaatgaaggg ccagagagcc gatcgaggta 420 agcaaagatc gacgaagaca gcaaagcgta tcgttctccc ctcgccgctc ggcagaagtc 480 actaacgagc gagaggcaaa gcgacgacgc gcttcaacca aggttaggtt aggatagccg 540 gtcgaggtaa cgacgatcgg cggatacatc gacagcgaat tgtcccccct cctcccccac 600 gccgctcggc agccgaaatc accattgaac gagagatatt acgaagacgc gcgtcttttg 660 cgaaaaggag aatattgccc cgccgccccc tcctcagtaa acagaagcag gagcttcgtc 720 aagcgcggcc atcattaagg cgacatctca aggagcgcaa gtattctact caaacgacat 780 acgtggtcta acgagtttgt tctttattgt tgaaaaaaaa aattgtattg agtaattaat 840 attaagtaat attaagtaaa atgtcaaaag atgagctgag caaattaatt gtgcgaggta 900 gaattaaagc gaagttaacc gcgtttagca catatctcga aagagcagca acagagtccg 960 taaagatcgc ggaattgccc atcagattag agaaggcgga agggttctgg caagagtttg 1020 atataactca atcacaaatt gaagatttag agaatacagt agcgcaattt actgagagag 1080 aaaatttcga aaacttatac catgccatta taacttctgc cagggcactt cagtcgcgtc 1140 aagatcaagc aaatcggcca agttcagtaa actctgtacc tgagaggtca gtaggcaatc 1200 cacaagttaa tgtttttgct cagccggtag gacccaccaa agtaaagttg ccaaacatag 1260 aattgccaaa gtttgacggg agctataata aatggattcc atttcgcgaa ttgtttgaat 1320 ctcttataga cgctaatgcg gcgctaccag caattcaaaa attacattat ttaaaactgg 1380 cattaactca tgaggccgct aaagtaatac aatcgttaga gctctctaat gccaactatg 1440 aaattgcatg ggacctactg aaacagagat acgaaaacaa acgattaata gtacatcata 1500 tgcaagaatt gcttgatctg gcaccaattg taaaggagtc tcacgctgcc ttgcgacaat 1560 ttatagatgg catttcacaa catattcagc ctttaattaa attaggacag ccagtagaac 1620 attggaatac aattttaatt catattttca cgcctaagtt agataaaatt actaagcgag 1680 aatgggagtt gaaaagagct acgatagata cattccccac actaaatgaa tttattgaat 1740 ttttgaatac tcgcagcgcg tttcttgaat cactgtctta ctcagctaat aatacaaact 1800 ctgtttccag cgttaataaa tcacataata aagctgtcat atgcgcatgt gtgtcggaag 1860 agaaatcaag ttgtcctgtt taccaaggta atcatgaact acctgattgt ggcacctttc 1920 gtaagtggtc tgcggcagag catatgacag aaattaagaa aaggcgactg tgcatcaggt 1980 gtttgaagaa ttttcacggc aggaactgca aggcgtcaga atgcaaacgg tgcaaaggct 2040 atcatcatac catgctacat ctcgagaaaa cgataaagcc ggacaaagaa gcaaatcaga 2100 cggagccttc tacaggagca accaagacgt cggcgacaga taaacagatc gttgccacgg 2160 tcactcaggc accaacaacc acattgatgc attgcgccac taaaggaccg ctacaagtaa 2220 tgttagcaac ggccataatc tatataaagg atcaaaaagg catacttcat gaatgtcgag 2280 ctttcttgga cagtggttcc cactccaatt ttatcacctg caaactcagt gaccgccttc 2340 aattgctgaa acaacaaacc acaatgacga tcaccgcaat taatgaaatt cctgtaaaaa 2400 ctagtcagca aattacggca acaatccaat ctcgctacaa cgcatttcaa gcaacgcttc 2460 catttttcac cattccaaaa atcaccaaga ggctgccact aaataaagta gacttgagga 2520 aggtcgaaat tccagaaaga attcagttag cggacccgga atttcatata ccggctacca 2580 tagatatact attgggagcg agtgtctttt gggaccttct gtacgtagga caaataaaga 2640 tattcaagca gcagccaacg ttccgaaaga cacaacttgg atggatagtt gcaggagcaa 2700 tggagctcca ccacatgccg acaacaatga cgacgtattg tggcttctcc tcagaacgga 2760 cattgtgcga tcaattgcaa aaattctggc agatagaaga gattacatcg agccgccatt 2820 tatcacagga ggagacaact tgtgaaaaac attttactga cacaatcaga cggacggagg 2880 atggtagatt cgtagtacaa ttgccactaa agagaaatcc ttcgaaactg ggagattcct 2940 atgaaatagc gagaaagaga tttcaatcaa ttgaacgcaa attaaataag gaccccacat 3000 taaagaagga atatcatgca tttatacagg aatacttgca gctgggccgt atgtcggaag 3060 ttgaagaaag ggacagcacc gtcaaacaag ctatctgccg catcatgcgg tgattaaagc 3120 agaaagcacc accacgaaag tgagagtggt ctttgacgcc tcttacccca ctagctcggg 3180 aaattcgcta aatgacatta tgatggtcgg gccaacgata caggacgaat tattttgtat 3240 catcttacgt cttcggcagc acaaatatgt gatgtcagcc gacatcacta agatgtacag 3300 gcaaatctgg atgcacgagt cacaacgcga ccttcaaaga attttatgga gatggacacc 3360 cgatgagcct ctacgtacat tccgattaaa tacgattacc tatggtacgt ccagcgctcc 3420 ttttttagca acgagatgcc tgacagagat cgcgcaacta caccatagca aacatcctgc 3480 agcggcagaa atcatcaggc gagatttcta cgtcgacgat ttgttgacgg gtaaggacga 3540 gataggtgaa cttgtcaagc tacaaaggga catcatccaa atattaaaga ctgcccaatt 3600 tgagctgagg aaatggaggt caaatgcacc agagctctgt acaacggaaa tagataagga 3660 gtctacagta ccaatagggg aggaagtcaa gacccttggc ttaccgtgga acactacatc 3720 agatagtctt acatacaagg cttgtacaaa tagcagcaat aaaaggatca caaagcggag 3780 catgctgtcc gatatagctc agctatatga cccgcttgga ttgttgggtc cagtgatcgt 3840 gcaagcaaaa ataatgttgc aagaattgtg gcaactcaag ctgggctggg atgagtccat 3900 gcctatgcag tctcattcaa gttggaaaaa atggcgccag cacattgaat gcgtcaacca 3960 catctccatt gcaaggcaag tgatctgcga taaaccaaca ttcatagaac tacacggctt 4020 ttgcgacgcg tcggagaagg catacggcgc gtgtgtgtat atctgcagta taaatgcgaa 4080 gcaagaacgc acaattaaat tagtatgcgc caaatcgcga gttgcaccac tcaaaaggat 4140 atcgatgcca cgtctagagc tttgtggagt gctcctgtta gcccaactct gtggcaagat 4200 aaaagacgcc atgataatcc ctataaagga cacgcaccat tggagtgact ctactattac 4260 gttggcgtgg attgcaggtg aacctcatca gtggcacaca tttgtcgcaa atagaatatc 4320 ggaaatacac cggctaacgg acaagcacaa atggcaccat gtgcgatcag atcagaaccc 4380 agctgatctg ttgtcgcgag gtgttaacgc ggaggactta aaaggaaaac aattatggtg 4440 ggaaggacca tcattcttgc aacagcagag cggttttaat tcacacacgc ctcccgatct 4500 gaaggatata ccagaacgca aaggaactaa tccgtgtcta gccgctatgg aatcagaagg 4560 gccggacacc actctcatcc agaggttttc atcgttcaca cgtttaaaaa gagtcgtcgc 4620 gtattgcttg cggtttgcag ccaatgcacg actcaaggga tctcgcctat accttaaacg 4680 aagcggagtt ctatctgtgc aggagctaga ggatagcact gtagtcataa taaaacaagt 4740 tcagagcgtg gagtttgccg acgagttgag atctctgcaa tctaattcac cggtcaatca 4800 taaaagtgaa ctacgtaatt tacatccgtt cctagattcg aaagggcaaa tcagagtgga 4860 ggggagactt agtcacactc agctaccata ctctcaaaaa catccgattg tgcttccggg 4920 gaagcatcat atcaccgagt tgctcgtaag acatgaacat tatcgaaact tacacgcggg 4980 acagcaagcg gtgttgcctg ctattcgtac acgatattgg cctttgtctg gaagaatagc 5040 gattcggaga gtcttgagaa gatgcatagt gtgttttcga gcacaaccag tttgtataaa 5100 gcagttaatg ggcgacttac ctcgtcatcg agttattcaa gccagaccat tcctcaacac 5160 ggggcttgat tatgccaagc cgttttcggt aaagctgtct agaaataaaa cctcaaaggc 5220 atacgtatgt ctattcgtat gttttgccgt taaagctgta cacttggaat tagtgaatga 5280 cttgagcact acatgtttcc ttaacgctct taaacgcttt gtcgcaagac gtggacgttg 5340 catcaattta tactctgaca atggtaccaa ctttgtgggg gccaacaatg cacttaaaga 5400 ggtaaaggag attctgacaa aagaagccac tcaagatcaa ataaaaactt ttttagccga 5460 gcaatcaatt gcttggcatt ttattcctgc atactcccca cacatgggaa gcctctggga 5520 ggctgcaata aaatcagcaa agacacacat gcgcaaggtg attggttcga tgacactgac 5580 cttagaagaa ttatatacag tattgacaca gatagaagcg tgcatgaatt cgcgtccgct 5640 gacgcttatt tcacaagatc ccaatgatct aatcgcacta acgaccgggc acttcctgat 5700 cggggaacca ctcacggcga ttccggagtg cgaagtgtcc gacgttccta gtaatagact 5760 caatcggtac gaatatttaa tacaactcaa gcaacatttt tggaaaaggt ggtcgcagga 5820 gtatattgca caacttcagc cacgtactaa atggcaaata gccaaagaaa acaatatccg 5880 tattggtacc atggtacttt taaagaatga aaatactcct ttgatgactt ggctgatggg 5940 tcgtattctt gagtcacacc cagacaagaa aggtttgatt cgagtggtaa ccattcgcac 6000 taaaaatggc acgtgccaaa gagctatatc aaaaatatgt ttgttacctg ttgatagtat 6060 gccgttaaca gatgcataat ttaagacccg tgtcacattc aaactatgtg ttcagaatta 6120 aaattagaat aatttaagca aattttaatt taagttgttc tgttccttat tagcatgttt 6180 atttacattt aagtcaaatt tatatactca agatatttgt tgtaaaacgt aagttttcca 6240 aggcgggcgg aa 6252 // ID Gypsy-1_RP-I repbase; DNA; INV; 6135 BP. XX AC ACPB02032162; XX DT 28-JAN-2011 (Rel. 16.02, Created) DT 28-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Rhodnius prolixus genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-1_RP_; KW Gypsy-1_RP-LTR; Gypsy-1_RP-I. XX OS Rhodnius prolixus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Euhemiptera; Heteroptera; OC Panheteroptera; Cimicomorpha; Reduviidae; Triatominae; Rhodnius. XX RN [1] RP 1-6135 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Rhodnius prolixus genome."; RL Direct Submission to RU (28-JAN-2011). XX DR Genome; ACPB02032162; Positions 1241 7375. XX CC Positions [3764-4243] - Integrase core CC 'GATA' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 27..1163 FT /product="Gypsy-1_RP-I_2p" FT /translation="MPLTDPTMEALRREVEELRRANRELQTLDEERLRMEG FT ELTTLRTMVGTDGPSLASERESTGHSRRREMGLQALIKPWSGETGAPPAKD FT FLNEIEMVAESGAWTDHDKRLICKLKTTGAAAVFVSSHPIFSAAVSSFEDY FT KRAILERFQDHRTPEQNLLALNSVSQRRGEGVREFADRCRQLGEFATSHTG FT SGEQKQWERNFMERVVLAAFIQGLAGEPGRQLRFNPPTGLAEAVKRAHLVE FT EAESGPGIARSTLERGIYAIARNADSSYGRRGRCFNCAQEGHFARECPRPY FT QSPPREWKSSGTRGRVREPRARGVKCYTCGELGHICRMCPKRATPGSVRPD FT YVVRKSNYLEAAPKSLLPQSAPPAGGMPPRGQDTGL" FT CDS 1370..5059 FT /product="Gypsy-1_RP-I_1p" FT /translation="MWRGQTIAQEFLIAEIDVEGDGILGIDGMAALKMRLD FT MRTLEFEGDEGKGALDGRENVLLVGKAEPQSRFERNEFPEVEAERDESGDI FT YLVAAEKITIPPRCEVLVRGRASGWRSGATVLVEPYPTETNRDWCVGRGIS FT QVVHGAVWVRIVNVSQTEVMVGKSERLGWVTEIETQGSEWVGVCQESGGVP FT RARKGLGKLLGDKLGHLRGEERDLLATVLSQYEDVFQDPDTSSLGCTSQIK FT HTINTGNATPIAKRPYRVPYHQRETMKELVDDMLTKGVIEPSDSPWSAPVV FT LVRKKTSDGSVKYRFCTDFRALNAVTKVPVYPLPNINETLESLGRSRYFST FT LDLASGYHQIEIATEDQAKTGFSTPEGHYQYRRMAFGLAGAPATFQLLMDR FT LLADLKGVTCFVYLDDVIIHSATIEEHAERLSGVLSKLREAHLVVNLAKCQ FT FATERVCYLGHVVSRRGVQVDPEKVTAVKDFPPPGNVRELRSFLGLVGYYR FT RYISGFAEKARPLTYLTRLDQRFQWGEAEEQAFKGLKDELCSDSVLAYPDF FT SLPFILATDASGVALGAVLSQLQDGAERPIAYASRQLNKAEQNYSTTEREM FT LAVVWATKHFRCYLYGRPFNLITDHAALRWLLTVKDPSSRLTRWQLRLAEF FT QYEVFHKPGRKNSNADALSRRVATINGEADSGRGTIRALQLADADCVHFKN FT TNTAQCVMDLDGLLYRFAGGNEPYQLMVPEPMRETILQECHDSPFAGHPGL FT ERTLALLRQGYYWPTMSQDAKRHIEKCRSCLERKTPQQLRVPLQVPYAAGR FT PFEQVSMDIVGPLPRTREGNMYLLTMIDHLTRYAEAVPLKSQTAQETAQKF FT VNHLVVRHGAPSRLLTDQGRNFTSALVRDACDTLGIKKLFTTAYHPQSNGR FT VERFHRTLVDALSHFVRRDGEDWDRWVPYVLMAYNSTPHSATGFTPNYLLY FT GRELPRPSALDPRPRQEVDQETSYAEALRGRLREAHRIAAETTERAFEAQA FT AQYNKKARVRSLVEGQWVYLHNPAGRRGEAQKFRSRGRGFRVQRSITDVTF FT EIAMGDGSTRMVHINRLKPVVVEGIPQETTEEDVPTSLSQNLGAEDTHDLF FT YEGLSLEGAPGIVGEVAGDGARNAESGQTAESIEEEQTEINTSGGDDLYAG FT VNLRPRGYQRADDRTETTELATELNVGELSPSAYSPPDDTIRDPTYTPVGT FT PRLPCTRSSYVLRSQRSRED" XX SQ Sequence 6135 BP; 1535 A; 1483 C; 1885 G; 1232 T; 0 other; attctggtgt cagaagtggg atctctatgc cactaacgga ccctaccatg gaggctctga 60 ggagggaggt ggaggaatta cgccgagcca atcgtgagtt acagacccta gatgaggaac 120 gtttacgcat ggagggggag ttaaccacgt tgaggacaat ggtaggcacg gatggccctt 180 ccttagcgag cgagcgtgaa tcgactggtc attccaggcg gagggagatg ggcttacaag 240 ccttgatcaa gccatggtcg ggggaaaccg gggcgccccc agcgaaggac ttcctgaacg 300 agatagagat ggtggcggag agtggtgcct ggacagacca cgataagagg ctgatctgta 360 agctaaagac gacaggcgca gctgctgtgt ttgtgtcgag ccaccctata ttttccgctg 420 cggtgtcctc ctttgaggac tacaagagag ccatcctgga aagattccag gaccaccgta 480 cccctgagca aaacttactc gcgctcaact cagtgtccca gcgtcgtggg gagggagtcc 540 gagagtttgc ggatcgctgt aggcaattgg gggagtttgc tacttctcac acggggagcg 600 gggagcagaa gcagtgggag cgcaatttca tggagcgagt ggtcctcgct gcgttcatcc 660 agggcctcgc gggtgagccg ggacgccagt tacgctttaa tccccccaca ggcttggcgg 720 aagctgtgaa gcgagcgcat ctggttgaag aggcggaaag cgggcccggc attgctagat 780 ccaccctgga gagggggata tatgccatag caagaaacgc ggatagtagc tatgggcgcc 840 gggggagatg tttcaactgt gcacaagagg ggcattttgc gcgggagtgc ccccgcccct 900 accagtcgcc ccctcgtgag tggaaaagct cgggtactag gggtagggtg agagaaccac 960 gggcgagggg tgtgaaatgt tatacctgcg gggaattggg gcacatctgt aggatgtgcc 1020 cgaagagagc tacacctgga tccgtgagac cagactatgt tgttaggaaa tccaactatt 1080 tggaggctgc cccaaaatcc ctcctcccgc agtcagcccc acctgcggga gggatgccac 1140 cacggggcca agatacagga ttataaacgt tgccactccc agctccagat tcatcaggat 1200 cccggtgaag ctctcaggag agaccaggac cttgctcgtg gatacggggg cgggtgtttc 1260 catcttgcgg caaccagtac cgggggttcc gttgttagag agtggggtgg tagcgcgggg 1320 tgtgacgggt agccccgttg gaatcaaggg agtacagcgg ttagacttga tgtggcgggg 1380 tcaaaccatc gcccaggagt ttttaatagc ggagattgat gttgagggag atggcattct 1440 agggatcgat ggtatggctg ctttgaagat gagattggat atgcggacgc ttgaattcga 1500 aggagatgag gggaaagggg ccttggatgg ccgggagaat gtgttacttg tagggaaggc 1560 tgaacctcaa tccagatttg agagaaacga gttcccagag gtcgaggcag agagagatga 1620 gtcgggagac atatacctag ttgcggcgga gaagataact atcccgcccc gttgcgaagt 1680 gttggtgcgt ggtagggcct ctggctggcg ctcaggcgcc accgtgctgg tagaacccta 1740 cccaacagag accaatcggg attggtgcgt aggcagaggg ataagccaag tagttcatgg 1800 cgccgtgtgg gtgcggatag taaatgtatc gcaaacggaa gtgatggtag ggaaatctga 1860 gaggctaggg tgggtgacgg aaatagaaac tcagggaagc gagtgggtcg gcgtttgcca 1920 agagagtggc ggggttccca gagcccgtaa aggattggga aagctccttg gggacaagct 1980 aggccattta cggggggagg aacgagacct cttggctact gttctatccc agtacgagga 2040 tgtttttcag gacccggaca catcctcgct agggtgtact tcccaaatca aacacaccat 2100 taatacggga aatgcgaccc ctattgcgaa gcgcccttac cgggtgccat accaccaacg 2160 tgaaactatg aaggagctgg ttgacgacat gctgacgaaa ggcgtaattg aaccgtctga 2220 cagtccatgg tcggcaccag tggtgttggt acggaaaaag acctccgatg gatcggtgaa 2280 gtaccggttt tgtaccgatt tccgggcgtt gaacgctgta acgaaggttc cggtttatcc 2340 gctaccaaat atcaatgaaa cgctagaaag tctgggaagg agccgctatt tcagcacctt 2400 ggatctggcc tcgggctatc accagattga aatagctacg gaggaccagg caaaaaccgg 2460 attttccaca cccgaggggc attatcagta ccgtcggatg gctttcggcc tggcgggggc 2520 tcctgctacg ttccagttac tgatggaccg gctattagcg gaccttaaag gagtaacatg 2580 ctttgtatat ctggatgatg tgatcatcca tagcgcgacg atcgaagaac atgccgagag 2640 actaagtggg gtgttgagca aactgagaga agcgcaccta gtggttaacc tcgccaagtg 2700 ccaattcgct acggaaaggg tttgctactt agggcacgtt gtgagtcgga ggggagttca 2760 agtggatcct gagaaagtaa ccgctgtgaa ggactttccc cctcctggca atgttagaga 2820 gctacgctcg tttctagggc tagtaggtta ctaccggagg tatatctcag ggtttgcgga 2880 aaaagcccga cccctaactt atctgacacg cctagaccag cggttccagt ggggagaggc 2940 agaggaacaa gcctttaagg gcctaaagga tgagctatgc agcgattcgg tgttagcata 3000 tccagatttc tcactaccct tcattttagc tactgacgct tcaggagttg ccctcggggc 3060 ggtactgtct cagctacaag acggggcgga acggcccatt gcttatgcaa gtcgccaact 3120 gaataaggca gaacaaaatt atagtacgac cgaacgggaa atgctagcgg tcgtctgggc 3180 cacgaaacac tttaggtgtt atttatacgg ccggcccttc aacctgatca cggaccatgc 3240 agcgctgcgg tggcttctca cagtgaaaga cccatcatcg cgtctaactc gatggcagct 3300 ccgcctagcg gaatttcagt atgaagtgtt tcataagcct gggcgtaaaa actccaacgc 3360 cgatgcgctg agtcgacggg tggccacgat aaatggggaa gcggattcgg ggagagggac 3420 gatccgggcg ctgcaattag cggacgcgga ctgtgtgcac tttaagaaca ccaacacagc 3480 tcaatgcgtc atggatctgg acggactatt atatagattt gcggggggaa acgagccgta 3540 ccaactgatg gtaccagagc caatgcggga gaccatttta caggagtgcc atgactcccc 3600 tttcgcggga caccccggac tagagaggac cctggctctg ttgaggcaag gctattattg 3660 gccgactatg tctcaagatg ctaagaggca tatagagaag tgccggagtt gcctggaacg 3720 gaagacgccc cagcagttac gagtacccct gcaggtacca tacgcggccg ggcgcccctt 3780 tgagcaggtg tccatggaca tagtgggacc cctaccacga acaagggagg gtaacatgta 3840 cctccttacg atgatcgacc atttaacgcg ctacgccgaa gcagtaccgc taaaaagcca 3900 aacagcccaa gagacggccc aaaaatttgt gaaccacttg gtagtccgac atggagctcc 3960 tagtcggcta ctaactgatc agggaagaaa tttcacctcg gccttggtaa gggacgcctg 4020 cgatacgctt ggaataaaga agttgttcac gaccgcttac catccccaga gtaacggaag 4080 agtggagagg ttccaccgca cgttggtaga tgcgttatcg cattttgtgc gccgcgatgg 4140 ggaagactgg gacagatggg tgccctacgt attgatggca tataatagta cgccacattc 4200 tgcgacggga ttcaccccga attatctcct ttacggccga gagctgccgc gccctagcgc 4260 actcgaccct aggcctcgtc aagaggtaga tcaagaaacg agttatgcag aggcattacg 4320 tgggaggtta agggaagcac accggatagc cgctgagacg accgagcggg ccttcgaggc 4380 acaagcggct cagtataata aaaaggcgcg ggtacggagc ttggtggaag ggcaatgggt 4440 ttacctacac aaccccgctg gacgccgagg agaagcgcag aagttccgca gccgtgggag 4500 gggctttcgc gtacagcgga gtatcacaga tgtgacattt gagattgcga tgggcgacgg 4560 aagcactcgg atggtgcata taaatcgctt gaaaccggtg gtggtggagg gaatccccca 4620 agaaaccacg gaggaagatg tccccacctc cttaagccag aacctaggag cggaggacac 4680 gcacgacttg ttttacgaag ggttgtcact cgaaggggcg ccggggatcg tgggagaagt 4740 tgcaggggac ggcgcaagga acgccgagtc tggacaaact gcagaatcca tcgaagagga 4800 gcagactgaa attaacacct caggaggtga cgatctttat gcaggggtca acttgaggcc 4860 gcggggttac cagcgggctg atgacagaac ggagacgaca gaactagcca ctgagctgaa 4920 cgtgggagag ctctccccgt cggcgtactc acccccagat gacacgatac gagaccctac 4980 atacacccca gtcggtacac ctaggcttcc ctgcactcgc tcgtcctacg tgttgcgttc 5040 ccaaaggagc cgggaggact gatgaagtaa tccggtttag gaaactattg ggcggtggtc 5100 ggatgaattt ggttgagcag cggaactgag tccccgaacg agggacgagt gctgctgacc 5160 cgggaccaat gacaaaatta gccaactagc cttgacaacc acgatgggag gcacccgcac 5220 agccattagg gaattaaggg taaccacctc taacctcatg cacctcctct tcctcacgtc 5280 ctccggaatc ttgagatcaa taccttggag gcaaaggtgg aattgcggtc gctcgagggt 5340 gccctcagca ccctagccag gagaaagtta gcaggctatt ggccccaagt gactgagtag 5400 gatagtgtat aaacagtcag atattatccg agagactatg gttcgtgtcg ggaaccgctg 5460 ccggcggata tgtaccatta ctacgacatc gctgagatag acgctgcctc cacacatgac 5520 gacattcaca tcttggtgga cataccctcg gtgaccaaag gaagatatat tttgacaact 5580 accgggtgga ccaatacccg tgaaggatcc gataactgga ctatacctgc tgctacaacc 5640 taaggccgac ttcacactgg tgggagtaga ctgagagagc ttcgtgtttc tgaaaacgga 5700 ggacctcagg cactgttggg gaaggagtcc actaatctgc ccagcacgat ttccggtctt 5760 ctccctccat agaccgtctc gccatacact actgttcaag aaactataag agaaggcgaa 5820 ccccttgtga gcgtggcgga tgctggtaac gagattcaag cttacgtaga gctggaggga 5880 ggagtcgaac gagtggttat acagtttgct cgagccagtc aagctgaatt gcacctccgt 5940 ggcatgaggg cacccaccga cacagagaga tcctggagac aaccaaccag cgaatgcagg 6000 ggcaccattt gttccctatc gtccgccgag cgtgaagacg cgaagctccg gagagcccgg 6060 tggagagttg ggaagtgaca agcgagtggt gggtcctccg caaggcggta gaacccactt 6120 taacgggggg gtaag 6135 // ID DNA8-36_AP repbase; DNA; INV; 262 BP. XX AC . XX DT 25-AUG-2009 (Rel. 14.09, Created) DT 25-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-36_AP. XX NM DNA8-36_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-262 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 1966-1966 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 262 BP; 80 A; 43 C; 45 G; 94 T; 0 other; cacagatata tactaatact agatagccgt cttctcgtta tcagtgaaac acgccgccat 60 atgcaaagaa tactactcga taaacaaacg ttatcacata gactaattta ctattataat 120 gaatatgata ttatattgat aatagtctaa tattgataac gtttgtttat cgagttgtct 180 tctttgcata tggcgtcgcg tttcagctta ttctgtggtg gtgtattcga agacggctat 240 ctagtattag tatatatctg tg 262 // ID Ingi-3_AC repbase; DNA; INV; 4358 BP. XX AC . XX DT 17-AUG-2010 (Rel. 15.08, Created) DT 17-AUG-2010 (Rel. 15.08, Last updated, Version 3) XX DE A family of Ingi non-LTR retrotransposons from a sea slug - DE consensus sequence. XX KW Ingi; Non-LTR Retrotransposon; Transposable Element; I group; KW Ingi-3_AC. XX OS Aplysia californica OC Eukaryota; Metazoa; Mollusca; Gastropoda; Heterobranchia; OC Euthyneura; Euopisthobranchia; Aplysiomorpha; Aplysioidea; OC Aplysiidae; Aplysia. XX RN [1] RP 1-4358 RA Kojima K.K., Kapitonov V.V. and Jurka J.; RT "Recent Expansion of a New Ingi-Related Clade of Vingi non-LTR RT Retrotransposons in Hedgehogs."; RL Molecular Biology and Evolution 28(1), 17-20 (2011). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 451..4320 FT /product="Ingi-3_AC_1p" FT /note="AP endonuclease, RT, RNase H." FT /translation="MMTGSKQDFRCGVSSVESRQADEVPGPLPRRGPPQAT FT GGTSVSSRGARRRDQRNTTTGQSSPESQQRPLKTFQWNCEGIYNKKDALKD FT TLIRERIDIVCLQETHLTSAMRFNIRGYQTFRKDRNYAPKGGVLTLVANSI FT PAKEIRVQDTGESEIIGVELKLPDRIVTVYNCYAPPDKALTLQVMDIPAED FT CLVLGDFNSHSPSWGYRDLDAKGEEIEDWQAFTKLHLLNKPDDPPTFFSRA FT WKTTSHPDLSFASNNISRGSTKQVMEQLATSDHKPILITSQVSPPTVSTST FT LPRWNYKKADWTKFASLTDVYTTKIHAKTSNTNRSAREFTQAILQAAKESI FT PRGARKDYVPNWSEELRMLNDKVTTAREAVENLPTTENNIQLKKAAAEFKK FT ATNSAVRKSWHEKTSSLNLHRQGNKLWRIVRSLNGENLNKDAPVILDKDGE FT SNVDKQAANLLIQQYKEVGKLEISPDRIEEVQRKIHAFESPQEPKDDVMTK FT PLVMTELDRALLQLKPRQAPGPDKISNDMLMHIGNTAKKKILQIFNSSWKN FT GKIPKAWKTAIQIPLLKDGKPKNSAESYRPISLTSCLCKLLERILNYRLCW FT FLEKNHLLSDNQAGFRKHRSTEDQITYIAQMVEDGFQRKEHTVAVWIDLEK FT AFDKVWTEGLILKLQNLNISHNMLKWISQYLSQRRACVKLQGKRSKMETIS FT NGVPQGGVLSATLFLIFLNDISTTITKKVTAAEYADDIALLCSAESVGTAQ FT VRLQSTLNNMMKWTEEWGQKINKNKTTYTTFSLSPKPRKVNLHLDGNTLQH FT ESNPTYLGVTFDPRLTWQAQTERCRSKGNRRTALLRKLAGSDWGANHTVLR FT GAYVGYVRPVLEYGIAAWGTASDSNFHKVARIQNQNLRIITGAMRSTPIHV FT METLTGLEPLNDRRDLKAIVHQEKVKRMPTHPMKDRSSKSQSKRLKRTSFL FT HMARHSSSGLEISLEESPTPLDPCQVHPLQTNHPPQIRENLEKLGTKSEVP FT PPILKSLTLDYLDRNFPSNRWNRVYTDGSATDAVRNGGGGIYIEWIDGSTE FT SHSIPTGSLSSNYKAETSALETAATILLNHQKSKANTVILTDAKSVLQALQ FT KPQTAQVRQLLSLLCDLNSQANTTLQWIPGHCGIHGNEKADDLAKEGAKMT FT QIEDGMDLSESKTLIKTALRNRWKRSHPKHNRHDPYYILNRADQVTIFRLR FT CGHNRLKHHMHTKLKVGETAICECGHDQEDANHILQHCPRYATLRTQTWQN FT VAPLERKLYGSLSELKRTAQFVRDIGLTV" XX SQ Sequence 4358 BP; 1489 A; 1124 C; 948 G; 797 T; 0 other; gggccccggg gcgaccaacc ccaacccagt gcgtccacag gtggcggata gtggaacggc 60 ctctagatat agaggttagc tgcgaatatc taaataagca gtcgcagacc aaatggcggt 120 gaggatggag ggcggggtga agactgtggt gtgggatccc attcttctac tcaatgtaaa 180 ataccctaag tgcccggtga aagactcgcc agcgagactg gggcgcgata tccccacacc 240 ccgggtagat ggaactcgtt agccagggat ggcagttcat ctaggagaag gaaaactctg 300 atcaaaaacc cctgctgcct agcaggcaaa cccaggaaag ttataggata tgggagcact 360 cccagacaga aaagcacgca gaagacggag aggtggaccg aaaggtgccg atagtcgggc 420 ttgcccaaac cgcctcacgc aagtctcatt atgatgacag gatccaaaca agattttcga 480 tgtggcgttt catctgtgga atcccgccag gcggatgaag tgccgggccc tctgcctcga 540 aggggcccac cccaagctac tggtggtacc tctgtctctt cccggggagc caggagaaga 600 gatcagcgga atacaacgac tggccaatca tctccagaga gtcagcagag acctctaaaa 660 accttccaat ggaactgcga aggaatttac aacaagaaag atgccctaaa agacaccctc 720 atcagagaaa ggattgacat agtctgtctg caagagacac accttacttc agcaatgagg 780 tttaacatca gaggctacca aacctttaga aaggatagga actacgcacc taaaggagga 840 gtgctcaccc tagtcgcaaa ctccatcccg gcgaaagaaa tcagggtgca agatacaggt 900 gagtcagaaa tcataggagt cgagctcaaa ctcccagacc ggattgtaac agtctacaac 960 tgctatgcac ctcctgataa agctctcacg cttcaagtga tggacattcc agcagaagac 1020 tgcttagtac ttggtgactt caacagtcat tctccaagct ggggatacag agacctagat 1080 gccaaaggag aggaaataga agactggcaa gctttcacca agctacacct gctcaacaaa 1140 ccagacgatc cccccacttt cttctccaga gcctggaaaa ccacgtctca cccagacctg 1200 tcatttgcca gcaacaacat cagcagagga tcaacaaaac aagtcatgga acaacttgca 1260 actagtgacc acaaacccat cctaatcacc agccaagtct ctccccccac cgtatcaact 1320 agcaccttgc ccagatggaa ctacaagaag gcagactgga ccaaatttgc atctcttaca 1380 gatgtatata ccaccaaaat tcatgccaaa acctcaaaca ccaacagaag tgcgagggag 1440 ttcacacagg ccatactgca agcggcaaaa gagtccatcc ccagaggagc tagaaaggac 1500 tatgttccaa actggtctga agagctcaga atgcttaacg acaaagtcac cacagcaaga 1560 gaagcagttg aaaatctacc aacaacagaa aacaacattc agctaaagaa agctgcagcg 1620 gaatttaaga aggcaactaa ctcagccgtt cggaaaagtt ggcatgaaaa gacaagctcc 1680 ctaaacttac acagacaagg aaacaaactg tggaggattg tcagatcatt aaacggagaa 1740 aacctaaaca aggatgcacc agtaatactt gacaaagatg gcgagtccaa cgtagataaa 1800 caagcagcaa acctactcat tcagcagtac aaggaagtgg gcaagttaga aatatcccca 1860 gacagaatag aagaagtaca gagaaaaata catgcttttg aatccccaca agaacctaaa 1920 gatgatgtta tgaccaaacc actcgtaatg acagaactgg atagagccct actccagctc 1980 aaaccaagac aagccccagg cccagataaa ataagcaatg acatgcttat gcatatagga 2040 aacactgcta agaagaaaat cctccaaata ttcaacagca gctggaagaa cgggaaaatt 2100 ccaaaagctt ggaaaactgc tatacagata cccctcctca aagacggaaa acccaagaac 2160 agtgcagaaa gctaccgccc aatcagcctc accagctgtc tgtgcaagct tctagagagg 2220 atcctcaact acagactctg ctggttccta gagaaaaacc atcttctatc agataaccaa 2280 gctgggttta ggaaacatcg atcaaccgaa gaccagatta catacattgc tcagatggta 2340 gaggatggct tccaaagaaa agaacacact gtagccgtct ggatcgacct ggagaaggcg 2400 tttgataaag tctggacaga aggactcata ctgaaattac aaaatctgaa catctctcac 2460 aacatgctga aatggatctc acaataccta tctcaaagaa gagcctgtgt gaaactgcaa 2520 ggaaaaagaa gcaaaatgga gacaatcagt aatggagtcc cacaaggggg agttttatca 2580 gcgaccctgt tcctgatttt cctcaacgac atttcaacca ccatcaccaa gaaagtgaca 2640 gcagcggagt acgcagatga catcgctctt ttatgctcgg cagaatcagt aggaacagcc 2700 caggtacgac tccaaagcac acttaacaac atgatgaagt ggacagaaga gtggggacag 2760 aagatcaaca agaacaagac cacgtacaca acattcagcc tctcaccaaa accccgaaag 2820 gtaaacttgc atcttgatgg aaacactcta caacacgagt caaatcccac ctatttgggt 2880 gtgacgtttg atccaagact aacgtggcaa gcccaaacag agaggtgcag aagtaaaggc 2940 aacagaagaa cagcactgct tagaaagttg gcagggtcag attggggagc aaaccacacc 3000 gtcttacgtg gagcctatgt tggatatgtc cgcccagtcc tggaatatgg tatcgcagcg 3060 tggggaacag catcagacag caacttccac aaagtagcga gaatccagaa tcaaaatctg 3120 cggatcatca cgggagcaat gagatccacg cccattcatg tcatggaaac actaacgggg 3180 ctagaacccc tgaacgacag acgcgatctg aaagcgattg ttcatcagga aaaggtcaaa 3240 agaatgccaa cccatccaat gaaggataga tcttcaaaaa gccagagcaa acgcttaaaa 3300 agaacaagct tcctccatat ggccagacac tccagctctg gactcgaaat ctctcttgaa 3360 gaatctccaa ccccactaga cccgtgtcaa gtccaccccc ttcagactaa tcacccccct 3420 caaattcgag agaacctaga aaagcttggc accaaatctg aagtccctcc tccaatactc 3480 aagtcactga cgctcgacta cctggacaga aacttcccaa gcaaccggtg gaatcgagtt 3540 tacacagatg gatcagcaac tgatgcagtc agaaacggcg gtggaggtat ctacatagag 3600 tggatagatg ggtcaacaga aagccattcc atccccacag gatctctatc ttccaactac 3660 aaagcagaaa catcagcact cgaaacagca gccaccattc tcttgaatca ccaaaaatct 3720 aaagccaaca ccgtcatcct cacagatgcc aaatctgtcc tacaagctct ccaaaagcca 3780 caaacagctc aagtacgaca actcctttca ctcctatgtg atctaaactc ccaagccaac 3840 acaactctgc agtggatacc aggtcactgt ggcattcatg gaaatgaaaa agcagacgac 3900 ttagccaaag aaggagccaa aatgacccag attgaagatg gcatggatct ctcagagtca 3960 aaaacactaa taaaaactgc tttaagaaac agatggaaaa ggtcccaccc taagcacaac 4020 agacatgacc cctactacat actaaataga gcagaccagg tcacaatctt tagactccgt 4080 tgtggccaca acagactcaa acaccatatg cacaccaaat taaaagtggg agagacagcg 4140 atctgtgaat gtggacacga ccaagaggat gccaaccata tccttcagca ctgcccgcgc 4200 tacgccaccc ttcgcacaca gacgtggcag aatgtggctc cccttgagag aaagctctat 4260 ggcagtctct cagagttgaa gcgaactgca cagtttgtgc gcgacatcgg actgactgtc 4320 tagctgctgt catcgagaac gaggaagaag aagaagaa 4358 // ID Gypsy-9-I_HM repbase; DNA; INV; 4051 BP. XX AC . XX DT 30-DEC-2008 (Rel. 13.12, Created) DT 30-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE An internal portion of the LTR retrotransposon - a consensus DE sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW reverse transcriptase; Gypsy superfamily; Gypsy-9-I_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4051 RA Bao W. and Jurka J.; RT "Gypsy LTR retrotransposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 1984-1984 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 22..4035 FT /product="Gypsy-9-I_HM_1p" FT /translation="MMHLSNISCLDLENDDFVEYIERFDNFLLANDIEDAE FT KQKAVFLSTIGGPAYKLIRSLCENDTKNKSFTQIVKLMKDHLKPIPNSIAQ FT RFQFYKRDRKDEESVSGYITELRRLSEHCEFGENLNDYLRDRFVCGLNNEN FT VQQKLLTIKNLTLETTLDTARAYETAYKDTKAIRTKESFIEQDELHKVEIR FT TKSEESKECFRCGFRGHQANNCNYRYLKCHTCGKIGHIKRKCRSEKKELDH FT RKEKKLGVKKIGIQDESEKGCTAETEGNKDDFLALYSISEESDKVTGPVMV FT NVKINGSEVSMEVDTGAAVSVMGISAYKRIKKNEGKLQNSGVVLKTYTGEL FT IRPEGIGLVEVVYNGQCCKLPITVVKGNVPTLMGRDWMQRLNLQWLELFKR FT MRGINFCDKVDSRVKTLVEKYPEVFSDRLGCLKNFQCHIPLREGAEPKFYK FT ARPVPYALRTRIEQELDRLEDQGVWRKVQYSKWAAPIVPVLKNSKDPTGPL FT RICGDYKITVNQAAPVDSYPIPNITDQLATIAGGERYTKLDLSQAYQQLEL FT DETSREFLTINTHQGLYQPTRLQFGVHSATGIFQREMDRRLGRLSFVKVRV FT DDILISGKTDAEHLNNLESVLKILKESGLTLKVSKCSFMQPEVEFCGFIIS FT QKGCKPTARNVEAVMNAPRPTNIKELRGFLGMANYYNAYIPRMASITEPLH FT NLLRKNVSWEWNRNSEQAFESVKTILCNAPLLAHFDPSRKIVVHCDASPYG FT VGAVLSQQQYDGSEKPVSFASRTLNTAERNYAQIEKEGLALVFAVKKFHQF FT LFGQKFTLYTDHKPLLGLFSESKELPTRAAARVLRWALLLSAYDYQLLYCP FT GEKNATADGLSRLPLDVSKEKSRLKTIEVAMLELVKTPITEKQLRGATQND FT PILGVVLNRVLEGGLTMEPSKVEMKPFALRFSELSTEGGCLLWGRRVIVPS FT VLRETVLEELHEVHPGISRMKALARSYVWWPGIDTEIEDKVKNCETCQRSQ FT KNPLTWSHPWEYPSRPWERLHIDHAGPMNGKIYLVVVDSFSKWIEVEVVRS FT TEAKATIKVLRKLFSTHGIPRVIVSDNGSGFSSEEYKEFLYSNRIKPVYSA FT PYHPASNGQAERMVQTFKNSIKNFQGNDIETQLCRFLFKYRVTPHSTTGVS FT PAELLLGRRLRNPLSMLHPEVSSRINEKKLSDSSVDKVRFFQPEDTVYVKN FT YSRGEKWIAAIIISAVGNVSYKVLTLDGRIQMRHVDQIVNRYVGKVNSTKT FT PESINTPILSNADKPNEVIETPCSSEILSENRDEISMDVEPEKLCEQPGAF FT SEPTSSINRRSGRVRRKPSYLEQYE*" XX SQ Sequence 4051 BP; 1430 A; 610 C; 916 G; 1094 T; 1 other; attggcgacg aagataaaaa aatgatgcat ttatcgaata tatcttgttt ggatttagaa 60 aatgacgatt ttgtagaata tattgaaaga tttgacaatt ttctactggc caatgacata 120 gaagatgctg aaaaacaaaa agctgtattt ttatcaacca taggaggacc tgcttataaa 180 cttatccgaa gtttatgtga aaatgacaca aagaataaga gttttactca aatagttaaa 240 ctaatgaagg accatttgaa accgatacca aactccattg cacagcgttt tcaattttat 300 aaaagagata gaaaagatga agaatctgta agcgggtata ttactgaatt acgtagacta 360 tctgaacact gtgagtttgg tgaaaacttg aatgattatt taagggacag gtttgtttgt 420 ggattaaata atgaaaatgt gcaacaaaaa ttattaacca taaaaaatct aacgttggag 480 acaacacttg acacagcgag agcatatgag acagcatata aagataccaa agctatacgt 540 acaaaagaaa gctttataga gcaggatgaa ttgcataaag tagaaattcg gacaaaatca 600 gaagaaagta aagaatgttt tcgttgtggg tttaggggtc atcaagcaaa caattgtaat 660 tatcgttatt taaaatgtca tacatgtgga aagatagggc acataaagag aaagtgcaga 720 tcagaaaaga aagagttaga tcataggaag gaaaagaaat taggtgtgaa gaaaattggg 780 attcaagatg aatcagaaaa aggatgtact gctgaaacag agggaaataa agatgatttt 840 ctagctctat attcaataag tgaagagtcc gataaagtga ctggaccagt tatggttaac 900 gtaaaaatta atggaagtga ggtttctatg gaagtggata caggggcagc tgtctctgta 960 atggggatct cagcttacaa aagaataaag aagaatgaag gaaaattaca aaattctggt 1020 gttgttttga aaacctacac aggagagctg attagaccag agggaatagg gttggttgaa 1080 gtagtgtata atggacaatg ttgtaagtta cctataactg tagtaaaagg aaatgtgccc 1140 acattaatgg gtagagactg gatgcagcgt ttaaatctgc aatggttaga actgtttaaa 1200 agaatgagag gaattaattt ctgtgataag gttgattcaa gagtgaagac attagtagaa 1260 aaatacccag aagtatttag tgaccgttta gggtgcttga agaacttcca atgtcacata 1320 ccactacgtg agggtgcaga acctaagttt tataaagcaa gaccagtacc ctatgcttta 1380 aggactagaa ttgaacaaga acttgatcgc ttagaagacc aaggagtttg gaggaaagtc 1440 cagtattcta aatgggcagc ccctatagtg cctgttttaa agaattctaa ggatcctact 1500 ggtccattaa gaatatgtgg agattataag attacagtaa atcaagcagc cccagttgac 1560 agttatccaa taccgaatat cactgatcag ttagccacta ttgcaggagg agagagatac 1620 accaagcttg atttgtctca agcttatcaa caattggaat tagacgaaac ttcgcgagag 1680 tttttaacta taaatactca ccaagggttg tatcagccga ctcgtcttca atttggtgta 1740 catagtgcga ctggcatctt tcaaagggag atggacagga ggctgggcag actgtcattt 1800 gtaaaagtgc gagtagatga catacttata tctgggaaaa ctgatgcwga gcatctgaac 1860 aacttagaat ctgttttaaa aattttgaaa gaatcaggac tgacgctaaa agtatctaag 1920 tgctcattca tgcaacctga ggtggagttt tgtggattta taattagtca gaaaggttgt 1980 aaaccaacag cacggaatgt ggaagcagta atgaatgcgc ctcgacccac gaacatcaag 2040 gagttgaggg gatttttagg aatggccaac tattacaatg cctacatacc caggatggct 2100 tccatcacag agcctctcca taatttgtta aggaagaacg taagttggga atggaataga 2160 aacagcgaac aagcctttga aagcgtaaaa actatattgt gcaatgcacc tttattagct 2220 cactttgacc catcaaggaa aatcgtggtt cactgtgacg ccagtccata tggcgtagga 2280 gccgttttga gtcaacaaca gtatgatgga agtgaaaaac ctgtcagttt cgcttcaaga 2340 acattaaata cggcagagcg aaattatgca caaattgaaa aggaaggatt agcattagtt 2400 tttgctgtaa aaaaatttca tcagttctta tttggacaaa aatttacatt gtatacagat 2460 cataagcccc tattaggact attttcagaa agcaaagaac ttcctacaag agctgctgct 2520 agagtgttac gctgggcttt gctattgtca gcttatgatt accaattact atattgtcct 2580 ggcgaaaaga acgcaactgc agatggtcta agccgtttac ctttagatgt atcaaaggaa 2640 aagtcaaggt taaaaaccat agaggtggct atgttggaac tagtaaaaac cccaattaca 2700 gaaaaacagt taagaggagc tacccaaaac gatccaatat taggagtggt actgaataga 2760 gtgttagagg gagggttgac gatggaaccg agtaaggtgg agatgaaacc atttgcatta 2820 aggttttctg agttatcaac ggaaggaggt tgtttactgt ggggaagaag agtgatagtg 2880 ccaagtgtct taagagaaac ggtacttgag gaattacatg aggtgcatcc cggaataagt 2940 agaatgaagg ctttagcaag aagttatgtt tggtggcctg gaattgacac agaaatagag 3000 gataaagtaa agaattgtga gacatgtcaa agaagtcaga agaatccatt aacctggtct 3060 catccttggg aatatccaag tagaccatgg gagcgactac atatcgatca tgcaggccca 3120 atgaatggga aaatatattt ggtagtagtt gacagttttt caaaatggat tgaggtggaa 3180 gtagtgcgca gtactgaagc caaagcaaca ataaaggtac tcagaaaatt attttctacc 3240 catggtatac cgcgagtgat cgttagtgac aatggatctg gattttcgag tgaagaatat 3300 aaagagtttt tatattcaaa tagaattaaa ccagtgtatt ctgcaccgta tcatcctgct 3360 tcaaatggtc aagcagaacg aatggttcaa acatttaaga attcaattaa aaactttcaa 3420 ggaaatgata tcgagacaca gttgtgccga tttcttttta aatatcgtgt tacacctcat 3480 tctacaactg gagtttctcc agctgaactt ttattgggta gaagattacg aaatccgttg 3540 tcgatgctac atcctgaagt tagctctcga atcaatgaaa agaagctgtc agacagttct 3600 gtggataaag ttcgtttttt tcaaccagaa gacacggttt acgtcaaaaa ttatagtaga 3660 ggtgaaaagt ggattgcagc tataattata tcagcagttg ggaatgttag ttacaaagtg 3720 ttaacgttag atggtcgtat ccagatgaga catgtggacc aaattgtaaa tcgttatgtg 3780 ggaaaggtta actctactaa aacaccagaa agcataaata ctccaattct gagcaatgct 3840 gataaaccaa atgaagtgat tgaaactcca tgttcaagcg aaatcttgag tgaaaatcgt 3900 gatgagatta gcatggatgt tgagcccgag aaactgtgtg aacagcctgg tgctttttct 3960 gaaccaacct cttcaattaa ccgaagatca ggtcgagtac gtcgtaaacc atcttattta 4020 gagcagtacg aatgatttat gggtggagga t 4051 // ID BEL-622_AA-I repbase; DNA; INV; 7884 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-622_AA_; KW BEL-622_AA-LTR; Pao_Bel_Ele104; BEL-622_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-7884 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [5228-5809] - Integrase core CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 861..1901 FT /product="BEL-622_AA-I_3p" FT /translation="MPDLRSLNKLEHQLRKSLEGIENFVHNFDVKRDVRQV FT NVRLEALEKIYGQFVEVRMKIELLLEDALEDGEGDEETLMQENDKVRQDFE FT DSYYSLKAELMAFQSLGTSASRNSTLQAPDTQAATQFAKVKLPEIKLPSFG FT GKIHEWVPYRDSFRSLIHDNTLLSDVDKFTYLRSSLSGDALQEINSIDLSA FT ANYTVAWKALESRYENKKLIVKAHLDALFTVEALKKESYEGLNQLIGDFEK FT NLQMLEKIGEKPSDWSTILAYMVCSRLDMATLRHWETHHNSKDVAKYSALM FT EFLKGHCSVLQSIAPSKATSSVQQRQFRPAVCHTSSSSPRSAISAVSRVIP FT CFSA" FT CDS 1448..4699 FT /product="BEL-622_AA-I_1p" FT /translation="MEGVGKSLREQEVDRQSASRRTFHRGSAQEGKLRGAQ FT PVDWRLREEPADVGEDRGETIRLEHHSGLHGVFQIGHGNPTPLGNPSQQQG FT RSEVFGTDGVFEGSLLRPTIDCSVESYKFRATATVSSSGVPYFFKFAPKCH FT FCSEPRHSVFQCVKFQRMSIPERIEAANRNKLCRNCLYPGHFARTCEKGAC FT RQCHQKHHTLLHTEQPRSSVPPTQSRPPTVNQQSRQPQSKPPTPNQQTRTA FT NTANTQDSHTQPATEHATTSQTHVALPITPTQNIILSTALVSIQDRYGRSV FT PARALLDSCSQHCLMTKSFASKLKLDETPVYLSIQGIGSSQAVSTRLVTAV FT VGPRSPNISSFAEEMSFHVLPKLTVSLPTVSINTAAWNLPNSTLLADPNFF FT EMGPIDVIIGAEYYMELLKEERRRATDDGPTLQDTVFGWIVSGRIPEGPAA FT SYSLVHVCSTAEIQDQLSRFWEVETCQSTSTLSVEESACEEFFDRTTYRND FT EGRYVVTLPKKERLIQQLGDSRSTAIKRFLGVERRFAMNPELKKQYAEFMR FT EYLAMGHMREVSEEDTSAPSYYLPHHAVLKPDSTTTKLRVVFDASCKTSTG FT VSLNDALMVGPVVQDVIVDITLRFRTHRFALVGDVAKMYRMVLMNAADQRL FT QRIVWRDSPSESIRTFALTTVTYGTASAPYLATKCLQRLSEKGEESHPAAA FT KVLKKDFYVDDMLSGVDDIEEGKRLVSQMNDLLQSAGFSLRKWNSNSKELL FT SAVPEGLRDERSILELDSSSAAVKTLGLTWEPSTDCFRFSSPAWNESTAIT FT KRCVLSDASRLFDPLGLVGPVIIQAKIFLQDLWKHDCEWDEPLSPQLQDQW FT REYRWNLTGLDGIAVPRWVGTSRSTEAVELHGFCDASNKAYGACVYVRTVA FT ADGSVMVHLLTSKSRVAPLENLKKNKKSLSIPRLELSSALLLAHLYEKVAK FT NITVATKCRFWTDSTIVVCWLSSSPSRWKQFVANRVSEIQHITKGSVWNHV FT AGEDNPADIISRGMSPAQLQYESRWFHGPKWLMSDEHYWPSSAQINESEFA FT KADLEEKSIVAALPAFPQVRYSVFARR" XX SQ Sequence 7884 BP; 2016 A; 2019 C; 2068 G; 1751 T; 30 other; ataaaktgtc tttttgataa aagtccgtcg cgagtttttg cataagtgtc cgaaaactgt 60 gattacttgg cgcgcgaaca tttggtcctt cgagccggat tacgtctccc ggcgaaggtt 120 ttcggacacc ggtccgtgaa gttggacatt tttagtgacg gacagtgatg gacgattgca 180 ctattgtgct ggtgactggt gaatagtgtc accattttcg acgcgggttt gcgccatcgc 240 gaaggaactt tgcgcgccgt tgcacagtgg ttcgatctag aacaaagtga ggcggaattt 300 ttcttcgagt gaagtggacg aactgcatag gagcggccga cggtgtttac ccggtcaagt 360 gcatccagga cgacaacgac gacgacgacg aattgaacgt gcagcgcaag gacggatgcg 420 accaggaaga aattaaggga gtagtttgca tgtgtggtgg cgttaattgc atggaacgcc 480 attcctaaac tctgtggcaa tttgttgaaa taaatgattg attgcatgtg atgtgagtcc 540 aactttattt aagagcagta gccctggtgt ccgctcatca aagtcttgtt tggtctcggt 600 ttcgctggaa tttgtcggct ttagttttct tccgaggttg gttcccctgc tgctgtcgaa 660 tttgggttgt aagtctgtcg ttccgggtcg cgttgggtgg ttcgtcgagt cgatcgtgtc 720 tagttaaagt gcaaagtgca gtksagtggt ksgaatttac tkcgaccgtg cgtgggtgac 780 ttgtactttt cgattgagta gcctacactg cttaatcgat agtgatagtg ccagtgacgt 840 cagtgattcg ttgattcacc atgccagacc tacgttcatt gaacaagctg gagcatcagc 900 tgcggaagtc gctcgaagga atcgagaatt tcgtgcacaa tttcgacgtg aagcgagatg 960 tacgacaagt gaacgttcgc ctggaggcgc tggagaagat ttacggccaa ttcgtagagg 1020 tgcgaatgaa gatcgagctg cttttagagg acgctttgga ggatggcgaa ggcgacgagg 1080 aaacgctaat gcaggagaat gataaggttc gccaagattt cgaggacagc tattattcgc 1140 taaaggcaga gctgatggcc ttccaatcgc ttgggaccag cgcttcaaga aattccacgc 1200 ttcaagcacc cgatacgcaa gcggcgactc agtttgcgaa ggtgaaactt ccagaaataa 1260 agctgccgtc gtttggtgga aagattcacg aatgggtgcc ataccgtgac agcttccgga 1320 gtctaatcca cgacaacact ctactctccg atgtggacaa gttcacctat ttgaggtcat 1380 cgttgtccgg tgatgcgctg caggaaatta attcaattga tttgtctgca gccaattaca 1440 cggtggcatg gaaggcgttg gaaagtcgct acgagaacaa gaagttgatc gtcaaagcgc 1500 atctcgacgc acttttcacc gtggaagcgc tcaagaagga aagctacgag gggctcaacc 1560 agttgattgg cgacttcgag aagaacctgc agatgttgga gaagatcggg gagaaaccat 1620 cagactggag caccattctg gcttacatgg tgtgttccag attggacatg gcaaccctac 1680 gccactggga aacccatcac aacagcaagg acgtagcgaa gtattcggca ctgatggagt 1740 ttttgaaggg tcactgctcc gtcctacaat cgattgctcc gtcgaaagct acaagttccg 1800 tgcaacagcg acagtttcgt ccagcggtgt gccatacttc ttcaagttcg ccccgaagtg 1860 ccatttctgc agtgagccgc gtcattccgt gtttcagtgc gtgaaattcc agcggatgag 1920 tattccagag aggattgagg ctgcaaacag gaacaagctc tgccggaatt gtttgtatcc 1980 tggtcatttc gccagaacct gcgagaaagg ggcgtgtcga cagtgtcatc agaagcatca 2040 cacactgctg cacactgaac agcctagatc ctccgttcca cccacgcagt caagaccccc 2100 gacagtcaac caacaatcta gacagccaca atccaagccg ccaacaccga accaacagac 2160 tcggaccgct aacactgcca acacccaaga ctcacacact cagccagcca cagaacacgc 2220 caccacaagc caaacacatg tagctctccc aattacaccg acacaaaaca tcattctttc 2280 gaccgcgcta gtcagcattc aagatcgcta cggcaggtcc gttccagcac gagcactgct 2340 ggactcgtgt tcgcaacatt gtctcatgac aaagagtttc gctagcaagc tcaagctaga 2400 cgaaacgcca gtctatctgt cgattcaagg cattggatcg tctcaagcag tgtcaacaag 2460 gctggtcacc gccgtagtgg gtccgagatc gccgaatatt tcgtcattcg ccgaagagat 2520 gtctttccac gtgctgccga agctgaccgt gtcgctgcct accgtcagta tcaataccgc 2580 wgcgtggaat cttccgaatt caacgcttct tgcggatccc aatttcttcg agatgggacc 2640 gatagacgtc atcatcggag cggaatatta catggagctg ctgaaggagg aaagacggag 2700 agcaaccgac gacggcccaa ctcttcaaga caccgtattc ggatggattg tttccggtcg 2760 tattcccgag ggtccagcgg catcgtactc ccttgttcac gtctgctcaa cggcggaaat 2820 tcaagaccag ctatcgcggt tctgggaagt agagacttgc cagtcaacca gcacgctatc 2880 cgtcgaggag tcagcatgtg aggagttctt cgacaggacg acatacagaa acgacgaagg 2940 gaggtacgtg gtcacgctgc cgaagaaaga gaggttgatc cagcagctag gcgattctcg 3000 atcgacagca atcaagcgtt tcttgggggt agagcgacga ttcgccatga atccggaact 3060 gaagaagcag tacgcagagt tcatgcgcga atatctggct atgggccata tgagagaggt 3120 ttccgaggaa gacacgagcg ccccgtcgta ttacttgcct catcacgctg ttctgaagcc 3180 ggacagcacg acgacgaagc tgcgtgtcgt attcgacgca tcctgcaaaa cgtccaccgg 3240 ggtttcgttg aatgatgccc taatggtagg ccccgtcgtt caggacgtga tagtggacat 3300 cacactacga ttccgtaccc accgatttgc cctcgtagga gacgtcgcaa aaatgtaccg 3360 tatggtactc atgaatgcag cagaccagcg gctgcaaaga atcgtttgga gagacagccc 3420 ctcggaatca atccgtactt tcgcactcac cacagtcacc tacggtacag cgtccgcccc 3480 ctacttagcc acgaagtgtc tgcagcgtct gtcagagaaa ggtgaagagt cgcatcctgc 3540 tgctgctaag gtcctcaaga aggattttta cgtcgatgac atgctgtctg gcgtagacga 3600 catcgaagaa gggaagcgcc tcgtcagtca gatgaatgat ctgctccaat cggcaggatt 3660 ctccttgcgg aagtggaatt ctaacagcaa agaattgctg tcagcggtgc cggagggttt 3720 gagagacgaa cgatcgattc tagaactgga ttcatcgagt gccgcagtca aaacgttagg 3780 tttaacctgg gagcctagca cggattgctt ccgattcagt tcgccagcgt ggaatgagtc 3840 cacagcgata acgaagcgtt gtgttctttc ggatgcgtct cgtttgttcg acccgttggg 3900 actggtaggt ccagtgatca tccaagcgaa gatcttcctg caggacctct ggaagcacga 3960 ctgcgagtgg gatgaaccgc tcagcccgca gttacaagac cagtggcgtg aatatagatg 4020 gaatttgacc ggcttagacg gaatcgctgt tccccgatgg gttggaacga gtcgcagtac 4080 cgaagcggta gaactccacg gcttctgcga cgcgtcgaac aaggcgtatg gcgcatgcgt 4140 gtacgtccgc accgtagctg cagatgggag tgtaatggta cacctgctaa cctccaaatc 4200 acgagtagct ccgctcgaga atttgaagaa aaacaagaag tcgctatcca tccctcggct 4260 agagttgtcg tcggcactac ttctagcaca cttgtacgag aaagtcgcca agaacatcac 4320 cgttgccacc aaatgtcgtt tctggaccga ctcgaccatc gtcgtatgct ggctttcatc 4380 gtcgccttcg aggtggaagc agttcgtggc aaatcgcgtc tccgaaatcc agcacattac 4440 aaagggtagt gtgtggaatc acgtcgccgg tgaagacaac cctgcggata tcatttctcg 4500 aggcatgagt ccagcgcagc ttcagtacga gtctcgttgg tttcacgggc cgaaatggtt 4560 gatgtcggac gaacattact ggcccagttc ggcgcaaatc aacgagagcg agttcgctaa 4620 ggcagacctg gaggaaaagt ccatcgtcgc ggccctcccg gcgtttcccc aagtgagata 4680 ttcggtcttc gctcgtcgct agtagatcta gttcgactga ccgtttacat tcgacgattc 4740 aaatggaatt cgtcaccggc gaatcgatcc tgtaggaaag taggcagcat cacttccagg 4800 aatatgaaga agcgattagg gagctggtaa agctgtctca gcgggaaagt ttcccgcaag 4860 aattctcaga ccttgccaaa cacggtcagg tacaggattc gtcaagaata tcggcgctaa 4920 atccccaact tgtcggggtg tattgtgcgt tggcggccgg ttgaaaaacg cggcagtggc 4980 agatagtagg aagcatcctt acattctcga ccaccggcat cccttcacca aaatcgtcgt 5040 agtccactac catcgcaaat atctccacgc cggtcagcaa cagatggtat cagcagttcg 5100 agagcagttt tggccaacga gcgtcagaaa tctcgcaaga caggttgtcc acgaatgtgt 5160 ccagtgcttt cgtgtgaaac ccaagatcca ggagcaactg atggctgatc tgccccccga 5220 aagagtcagg ccatgcttcc cgttccagaa ggtaggagtc gactactgcg gaccgtttta 5280 cgtggtctat ccgcagcgcc gagctcgtcc ggtcaagtgc ttcgtggcag tatacgtgtg 5340 cctagttacc aaagctgtgc atttggagct agcagcagat ctgtcgacgc aggcattctt 5400 agcgacattg caacgcttca ctgcacgccg aggtaaaccg tcgctgataa tgtgcgacaa 5460 cgcgacgaac ttcgttggmg cacgacgcaa gctagatgag cttgcgcagc tttttgcaag 5520 tcagcaattc accgaagcag ttctacgaca gacgatcgaa gacmgaatcg agttccggtt 5580 cattccagcc cgctcaccga acttcggtgg actgtgggaa tcagcggtga aatcgttcaa 5640 gaccctgttc aagcgaacga ttggcacgcg cagtctggaa tacgacaaca tgctcacggt 5700 gctggctcaa gcagagtcaa tcctgaattc cagaccgctg accccaatca gcaacgatcc 5760 ggacgatttt gaggccctca cccctggaca ttttttgatc caccggccgc tgacggccat 5820 tccgggacct gatcttggcg aagttccgga aaacagactt tcggcttggc aaaagatgcg 5880 gatttcaacc agaggttgtg gaagaaatgg tcggagcagt atctgtccaa cctccacaat 5940 cggacgaaat ggacgcgaca gagggacaac ataggcgttg gcactatggt tgtcataaag 6000 gaggagaatt tgccaccact gaagtggcag ttggcacggg tgacagaggt tcacccaggt 6060 tctgacggca acatcagagt tgtcaccgtt cggaccaaag acggcagtta ccagcgagcg 6120 atctccaaga tctgtgtgct gccaatacga gataatctat cgtctgctga aggggagaac 6180 taggactcct ccaacggcgg aggtcgaaag gcctccgcac ccccagttat gttatgtatt 6240 tgttaaactt tcgaaaaagt aatccgatca tctttctcca gacaaacctc cttccgtttt 6300 ggttcaacca aaactgcaga cactcaaata caaattatga gtcgtcccgc gtccaatcca 6360 ccgatattga ggtcggccgc cgcccaatat cggtgccggg ccggccccgc ctcgtgttcc 6420 gcccggttgg tggtagttgc atgtgcatcg ttgcatgtcc cacgggcagt cgcaagtcaa 6480 cgctctagca taccgcggga gggcccggct aggatcggaa caagctccga ttggtgccgc 6540 ggtattcggc gcgggcgggc ggggcaggcg cagtgtttgt cctgaaacta aagctgttcc 6600 aaaccgcccc cagcccccgc ccgtttgata ttscgaggta cccgagcctg ccacggcgag 6660 ccggggttaa gggtcgcatt magaggaaga acggtgcgca attcaacctc ctggattcac 6720 gctatgattg tccacgggta tggccaacct aaggatcgtc actgcatcac gggtccagaa 6780 tcgtaaccgc cacggagagc atcaataccg aagagtatca acgtcatcga gcaacagggc 6840 aaatgaaact atcatcgtca gtcagaggga ccgtcggcgc cgctcgggcg tccaccggag 6900 gccwtagggc ctccgttcag gcaagtcgtt ttcgttttca gcattacatt tcaaaaagca 6960 ccctctcatt ttattcagag accaccagcg ttacacggga cacggattgc ctcagcatcc 7020 gttcacacca aatcatccgt tcatcatcac caagtcgaag tcgaaacgtc caagacaaac 7080 ttcctaaatc agccaagtca gtcggagttc actgaagcgt cgtaacgtct cgtccaaatc 7140 gtcgtcacca agagtccgga gamgtcatca acccaaatcg ttcaacggaa ccaagctatc 7200 gtcatcgaag ttgaaattca tcatcgtcga gagcattatg cagtcatcat caagagtccg 7260 agtacccagc caaatcgtca acatcatcaa gtcmaagcat agtccacgca tcgtmgtcag 7320 tcsctaaggt taaggtcatc gcgaacgcac gcgagactta gaagtcggcg taawcagagt 7380 catccaccaa agwcgtcgag gctgaaatca gggatgcamc aawttgagcg gcgtcaaaaa 7440 tgcaattcmg cwcctacctg cagcacawsc gaagawcgat taccggakmc agcaagtcct 7500 ggcagacagc aaaatatcga atggaacaag ttcagacaaa acaaattgtc tgccaggtaa 7560 gttgttccgg ttttacaaaa tcttacttca cgacactata tccacctcca ttctccwccg 7620 aggaaaacaa caacaccatc gatmgtccta cccaacactg atccacaagc aggctgccgg 7680 catcgagacc gttggtatat aggccaaggt caactcggtg gctaacatcg tcgcaatata 7740 tgggtcagca gtaccagcca gcagagcagt acagtgtagt atataagsaa ctgaaaattt 7800 caatagtttt aaggcaattt tgaaaacatc tgggagttag aaagggaaag tttgaaatct 7860 agggattcca aggcggccgg cata 7884 // ID DNA-6_AAe repbase; DNA; INV; 2094 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A non-autonomous DNA transposon family from Aedes aegypti. XX KW DNA transposon; Transposable Element; nonautonomous; DNA-6_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-2094 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the yellow fever mosquito."; RL Repbase Reports 11(4), 1260-1260 (2011). XX DR [2] (Consensus) XX CC >93% identical to consensus. 8-13 bp TSDs. TIRs are ~130 bp CC long. XX SQ Sequence 2094 BP; 615 A; 424 C; 394 G; 661 T; 0 other; gggaccctaa cgaaaggcgg aaacacgaaa ggcggaatca caaaaggcgg aaactcaaaa 60 ggcagactaa cttagacgtg taacaaaagg cggaaaataa cataaggcat aatctaaaaa 120 ggcggaatcg ttcaaaaacg atttcaaaat gcttacttta gcttccaaac gataattaga 180 ctgagctgtt taggttccta ttcatatcag cgttttcaaa atcactaagc ggacttagaa 240 tttcttttga aaaacggtgt tttttgtttt tcaatttata tgtaggtatt ggccaataat 300 acatgacagg taaataccgt ccaatgccct ccattacgct gcggtgccac tataaccgag 360 aaatatcttc ccgcaccagg ttgatgttag gtgaaataac cgtcgcctgt ttctatttct 420 tttatttatt gtcgcctgtt tcagccttag gtttgcgcta aattttcgat aaatctagcc 480 aattttgttg ctttttcttc tgtggcacta cgcccccact gagacacagt ctgcttctca 540 gcttagagtt caatgagcac ttccaccgtt attaactgag agctttcttt gtcaaagttg 600 tcattttcgc attcttatat cgtgtaattc tacgtcacta gcccgaaaga cacttgctcg 660 aaagtaacat tagcccgaat gatggacata gtctattctt tgaaaaagaa atatctaata 720 cattatcagt gatttgtcct tcttttaatc atcggctatt ctttctggac ttatcacctt 780 gaatatgttt ggcgccacga gccactaaag agccagttga aattgattct tctttgaaac 840 atatactgtt ttttcgccta atattgctgc tagtggtatt caagctgctc atattcgcgt 900 taatgttatg gcacagaaaa acagacgtaa cagctagaac aattacccat ttgaaacttt 960 gtctcgcaaa cgttactgtc taatgctgaa aatgctgtgc ttggcaaacc gagaatcagg 1020 tggcagtagt gcgcaaaagc gatacgagcg ctaagactgg ccaatcggcc aaccataaaa 1080 tattagaatt gtgcacaatt ttgctgtacg atgcacataa ttcgagtgtt ccgtctgttt 1140 gtcagtggtt atagaatcct cgcattccaa aatgcttcat agaagtgcct ttgtagttat 1200 ttttcccaat acggtgctat aatagattta ctacaaactc cgtaattcgg accatcgcat 1260 tggctcatgg aaacatcgag tcatggaaca gaaatgcttt ggagagttgc acagaataca 1320 aaactattaa gtcatattct ttactagtgt gaaaatcaaa tttgatttgt gaacatgtga 1380 catcattcct atcgtagcat taccacccgg acaactagat cattttttac aaatttgttc 1440 gaggtggata ggaatgacct agggaaacag ttatgttatg ttatgttatg taacagttat 1500 ggccatagtg gttccctatt tggccatatg ataaatttcg aaatcattca catttggaat 1560 ctttttgaat ggtttaccgt gacggcatat ttaatacctc acaactgaga cacaaaaaat 1620 aattcaaaat ttgaaaacaa tttaattaac acgtatagcg aaatagggaa ccattacgtc 1680 catcactggt acacctaccc tatatcacct ttttaatatt gtacaccgta aaaaactgtt 1740 tgaggggacc atcatattaa ccatgaaatt taggttttgg ttttagtatg atatcatgag 1800 tcgttataga gtcatggaac atcgactcat ggaggtttca ctgtaaaagc aaaccgcaga 1860 atcacataag gctaatttaa atttggcgga ctttttgaac cagcagagca aacggccggc 1920 cggctggttc aaaaagtccg ccaatttaaa ttagccttat gtgattccgc ctttcgtgtt 1980 tctgcctttc gtgattccgc cttttgtgcg tagtcgcctt ttgttagttc tgcctttcga 2040 aattccgcct tatgtgtttc cgcctttcgt gtttctgcct aacgtaggac accc 2094 // ID hAT-28_SM repbase; DNA; INV; 2311 BP. XX AC . XX DT 14-FEB-2008 (Rel. 13.02, Created) DT 03-MAR-2008 (Rel. 13.02, Last updated, Version 1) XX DE hAT DNA transposon from Schmidtea mediterranea: consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-28_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-2311 RA Bao W. and Jurka J.; RT "hAT DNA transposon from Schmidtea mediterranea."; RL Repbase Reports 8(2), 77-77 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS join(334..1416,1346..2221) FT /product="hAT-28_SM_1p" FT /translation="MLKQKRSCKFLDKYAEEYPFMKKGRTETEGYCSHCKS FT YLSISHGGISDIKAHVMTTKHVCAVNSSASCGKITSFFVKPNTTEADQVIA FT AELSFSYHIIKHHQSFVSASCTNKLFPKIFSDSKISKKYASGATKSTCMIT FT KVLAPHSVSVAVNEINSSFFYSLSTDASNHLSEKIFPLVVQYFTLSGTKVK FT LLKLGNLKGETADLVANFCIQSLKDLNLEMKNCVAFSGDNANVNFGGRMRA FT GTNNIYTKLKGAIGGQIEGIGCPAHVVYNTLETACDQLSCDAGMIVFKLFN FT YFSIYMVRTERLKDFCAFVSVEFKKLLSYTKTRWLSLLPSIERVLKIFDGL FT KSYFLSEDKAPKVLLDFFKYLMDLNLIFCLKIKLQRFFSIFFNNPLSEAYF FT WFIHSQASSVSFLILKIEGKNQSVVEALLVLKATVSIFEEKLIQCFIPTSV FT LSLLKKLLDSGIITDQDIEKFKKEVICFYETFCDYLKKWVKPLEHFEIFSW FT MLLKEKPEWKDVELVLDYFKEKEIEINDTLLFNQIQIVRAIVDSESKENLD FT WKFKIASEKWVCVLQKMDKNFPEQYIDLLKICEYVFAIPAHNANVERIFSL FT MGIQWTDERNSLSSSTVEAILQCITNFDLNCSEMYDYILCKKELLKAAKSS FT NKYL" XX SQ Sequence 2311 BP; 836 A; 313 C; 371 G; 791 T; 0 other; tagggctctc agacgtcccg gaaaaccggg attgtcccgg ttttgaaaaa ttaaaattaa 60 ttaattaact tttaattaac ttttaaaatt tgtaagtcaa atcattttta ttcgaatatg 120 cattgatttt aaccgttaaa atcaatgcat attcgaataa aaatgattaa aataaataaa 180 tcttttttat attggtaaag taaagttgat atttatcatt ttgaaattgt gtttgaattt 240 ataaaagtta ttcaggtatt ttattaattt tggctaaatt gatgttcatt taatattgta 300 aaataattat ttaggtgaaa taatacccat aaaatgttaa aacaaaagag atcctgtaaa 360 tttctagata aatatgctga agagtatcct tttatgaaaa aaggtagaac agaaactgaa 420 ggatactgta gtcattgtaa gtcctatttg agtatttctc atggtggtat atccgatata 480 aaagcacatg taatgactac aaaacacgtc tgtgcagtaa attcatcagc atcgtgtgga 540 aaaattacca gtttctttgt caaaccgaat acaaccgaag cagatcaagt tattgcagct 600 gaattatcct tttcgtatca cattattaaa catcatcaat cattcgtatc tgcaagttgc 660 acaaataagt tgtttccaaa aatattttct gattcaaaaa tttcaaaaaa atatgcctct 720 ggcgccacaa aatctacatg catgatcaca aaagttctgg caccacactc agtttcagta 780 gctgttaatg aaattaattc tagctttttt tatagtttga gtacagacgc aagcaatcat 840 ctatcagaaa aaatttttcc attagtagtt caatacttta cattatcagg aactaaagtt 900 aaacttctaa agctcggaaa tttaaaagga gaaactgcag atctcgtggc taatttttgt 960 atccagtccc taaaagattt aaatttagaa atgaaaaatt gtgttgcatt tagtggagat 1020 aacgcaaatg ttaatttcgg tggccgtatg agagcaggta cgaataacat ttatacaaag 1080 ttgaagggag ccataggtgg ccaaattgaa ggaattggat gtcctgcaca tgttgtatac 1140 aacactttgg agacagcatg tgatcaatta tcatgtgatg ctggtatgat tgttttcaaa 1200 ctttttaatt atttttctat ctatatggta agaacagaac gattaaaaga cttttgtgct 1260 tttgtttctg ttgaattcaa aaaactatta tcttacacaa aaactagatg gctctctctc 1320 ctaccatcaa tagaaagagt tttgaaaata tttgatggac ttaaatctta ttttttgtct 1380 gaagataaag ctccaaaggt tcttctcgat tttttttaat aatccattaa gcgaagcata 1440 tttttggttt attcatagcc aagctagttc agtcagtttt ctaattttaa aaattgaagg 1500 aaaaaatcag tcagttgttg aagccttgct tgtattaaaa gcaactgtct ctatttttga 1560 agaaaaatta attcaatgtt ttataccaac gagcgttcta tctttattaa aaaaacttct 1620 agattcggga ataatcacag accaagatat tgaaaaattt aaaaaagagg taatatgttt 1680 ttatgaaaca ttttgtgatt atttaaagaa gtgggttaaa ccattagaac attttgaaat 1740 attttcctgg atgctgctga aggaaaaacc tgaatggaaa gatgttgaat tagtattgga 1800 ttattttaaa gaaaaggaaa ttgaaataaa tgatacactt ttattcaatc aaatccaaat 1860 agtgcgcgca atagttgact ccgaatcaaa ggaaaatctt gattggaaat ttaaaatagc 1920 ttcggaaaag tgggtttgtg ttttacaaaa aatggacaaa aactttcctg aacaatatat 1980 tgatctatta aaaatttgtg aatacgtttt tgctattccg gcacacaacg caaatgttga 2040 gcgtattttt tcgctaatgg gaattcaatg gacggatgaa agaaattcac tatctagtag 2100 cacagttgaa gctatattac aatgcataac taacttcgat ctaaactgta gtgaaatgta 2160 tgactatata ttatgtaaaa aagaattact aaaagcggct aaatctagta ataaatactt 2220 ataaatgtaa gaaacaaaaa aaatattttt ttaaaatttt gttgtttttc tttatgtccc 2280 ggttcatctt gatggaaatc tgagagccct a 2311 // ID Academ-2_Aplcal repbase; DNA; INV; 3747 BP. XX AC . XX DT 16-MAY-2011 (Rel. 16.05, Created) DT 16-MAY-2011 (Rel. 16.05, Last updated, Version 1) XX DE DNA transposon: consensus. XX KW Academ; DNA transposon; Transposable Element; Academ-2_Aplcal. XX OS Aplysia californica OC Eukaryota; Metazoa; Mollusca; Gastropoda; Heterobranchia; OC Euthyneura; Euopisthobranchia; Aplysiomorpha; Aplysioidea; OC Aplysiidae; Aplysia. XX RN [1] RP 1-3053 RA Yuan Y.W. and Wessler S.R.; RT "The catalytic domain of all eukaryotic cut-and-paste transposase RT superfamilies."; RL Proc Natl Acad Sci U S A 108(19), 7884-7889 (2011). XX DR [1] (Consensus) XX SQ Sequence 3747 BP; 1090 A; 777 C; 801 G; 1079 T; 0 other; tagtgggctc cctaactcca cccccactgc aaattttagt aggaacatgc cacttggcac 60 acacgtacct tgaacaaaac taagaatttt gataaacgga cacaagctct atctgcataa 120 tttcactcta atttgcataa attaaaaatg gccgccattg aaaaatcaag caaaatgaaa 180 ataactctaa attatcatac acatcttcga ttttcatgaa aatggtgtca aaatatatgt 240 ttttggggat gccttaacca ttgaaatcaa aattttggat gtagaaggtc cggtttttta 300 tttattcaat atgttagtca catgatgtga ttatgaagta tatatatttt gcatatttca 360 aattgacaag cttggacctc cgctttcacc agtcacccca ttgatgatgc ttcccctttt 420 tagggaatca gctcacagtc caatgatggt attccatggg atgaacattg ttgctgctgt 480 tacaaagcaa ctgaattcag gacagacccc ggtcatggtt gctgatcagc cactctttac 540 attggcaaag aagattccgt ggaaattttc agatgtcttg ggagaggaca agtttgtggt 600 gatgcttggg gcactgcaca ctgagaaaat gctgtttgaa gggcctaggg gagtggctgg 660 aagggagtgg atggacatca cttctgtcag tcagtggtgt ggcttcaagt ggagttgctg 720 attctatgat gtaggtgaca catctgaccc gcactcgcta catgcaccaa gtcacagctc 780 ttgctttgta tagtttactg agacaagccc atcgtgaata cctgacaaca actattgaag 840 atgagccatt ggaactggat gcttggatta ggtatcagcg agaccgagaa ccgcaattct 900 tgtactggca ccagactctt gagctgcagc tgatggttct gcagtttgtg aagttgattc 960 ggactgcatc gtttgatctg tacttagtga ctttagaaca tctcatgcca tgggtgtatg 1020 ctcttgatca catacactat gtccgcaatc tacccattca tctacgagat atgtgtggcc 1080 ttcgagaaat acatccaaca gttcataaag agttcatgaa gggaaaattc gttggacaga 1140 agacaagtcg tgcattttct agcattgcct tggaccaaat tcatgagcag ctcatcggtt 1200 ctctgaaggg tgatggagga ataattggac tgactgagga tcctgtagcc cttgaaagat 1260 ttatgattac tggaccagaa cttgctcgca tcattgagga gttcgaaaac acacctcaaa 1320 acagtcacag gaagcaccat gaacagtatc ctaaatgtcg agagactttc agagaagatg 1380 tgaatggctt aatcagtgca ttctctgatg tgggaaatcc tttccttgaa gtatctgggc 1440 aattgatttc tcttgacagg tccagaatca tgacaggtga tgttatgagc tctgtgagaa 1500 acatcacaaa aattggaaaa gacaagtatg atacattctt ttcagaaagg gtttcatcca 1560 gcacatcccc atggacctcc acctttcacc tgaatagatt gcctcttttc ggtgtaaaag 1620 ctcaacaacg taaagaaaaa tccgaccttg cttccatgaa ggaagaacga acccagttta 1680 tttaaatgat gctctctggg cactcaggaa gggacattgg tgaagaaatg tttgcacatg 1740 agaacactac attaccacca tctctcagtt cccagggtca gatgcacaaa ggaacaaaat 1800 ctgaaattct caagtgcctc gaaggtgata ttgagattct acttctgata cttcccctaa 1860 ttctgatgtt gttattctag atggagcctt catcgtgcaa tcactgagac ctggtacagc 1920 atcaacattt caggattatg cagaccaggt gtttcttcca tattgtgatg tcatggctaa 1980 agggtgtgtc gtgggatgcc tacaaacacg atagtctcaa atccagcact cgggaaacca 2040 gaggcagtgg catgcggtgt cgagtgacat catcaaccaa agtagcgagt aactggcaag 2100 gatttttacg ggtcgatgag aacaaaacag aactgttttc tctgttagca gagaaaatgc 2160 aagagatgta cacagagggc aagcaagttg ttacaacctt cggagagaat gttctcactt 2220 cacctgttcg ctcaagtact cgcgctttag caccttgtat tcacgaggaa gctgattcaa 2280 ggattttttc catgttgcag atgcgacaca tgaaggattt aaacacatca ccattcgtgc 2340 tagtgacagt gatgtggtga taatagctgt gtcctgtttc caagatctca accttgatga 2400 gctctggata gcctttggat ccggcaaatc atacagacat atcccagttc accagattgc 2460 tatgcagttg ggaccatcca agtctcgagc tcttctagca tttcattctc tgaccggctg 2520 tgacaccacc tcagcatttc aaggaaaagg caagagaaca gaatgggctg tttggcagga 2580 attcccagag ctcactgcag cacttcagcg cctgtcaaca ttcccaagta gaatcgatga 2640 tgatgtcatg tcctaagtag aggcattcat cgttcgcctt tacgacaggt ccttggacat 2700 cacatcagtc aatgcagcac gcatggaatt gttctactac aaaggcaggc attttgagaa 2760 cttgccacct acaaaggatg cattgctaca acacacatta cacgcagcat atcaggcagg 2820 ccacatctgg ggccaagccc tcatctgttc tccaaccttg cctgactcat atgaatgggg 2880 atgggtcatg acaaatgaca agtggttacc tcgatggaca acacttgccc ctattacaaa 2940 gaaccacaaa tgcctcgtga catgtcagtg caagaaagtg tgcaagcctc catgcaaatg 3000 ttgcaaagct ggtgtgaaat gcagtactct ctgtgcatgc aggggagctt gtttcaatca 3060 aggaaaataa tgtgtacata tcagtgaaca taacgcattg gttattgaaa gaataggcaa 3120 aatatgggtg aacctggcct gctaaaggga ttttaaatta caaaaacaaa accacatgtg 3180 tagtcctagc atcttcccaa aattatggtt gtgcgtcatt ttaaaaaatt tttcaaggcg 3240 gccattttta atgtatgcaa attaagaatc aatttgacct tttatgataa ttctttgggt 3300 gtcattgagt ttgtcagcca gaacttacta tgaaacaaca catttatatt tctatcattc 3360 ttctgtcaaa agttatatat tcggttcgta ttttcattat acttcagtgg cggccatttt 3420 tcatttatgc aggttaagag tgaatttgac cttttattat aactttgatg gtgtcattga 3480 gtttgccagc catatattta tatgaaacaa caccaaaatt aatgttctac catttgtctg 3540 tcaaaagtta tatccagtta tcattttcat tatatttcag tggcggccat ttttaattta 3600 tgcaaatgag ggcctgaata tgcatgtgga ccttgtgtcc gtttctattt tccaatgata 3660 tggtgtaagg tacttacatg ccaagtggtt ccttcctact aaaatttgca gtgggtaatg 3720 ggactgctgg cctaagtcag cccacta 3747 // ID TransibN2_DP repbase; DNA; INV; 40 BP. XX AC . XX DT 13-JUN-2005 (Rel. 10.05, Created) DT 13-JUN-2005 (Rel. 10.05, Last updated, Version 1) XX DE TransibN2_DP is a nonautonomous DNA transposon - a partial DE consensus sequence. XX KW Transib; DNA transposon; Transposable Element; Nonautonomous; KW Interspersed repeat; TransibN2_DP. XX OS Drosophila pseudoobscura OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; obscura group; OC pseudoobscura subgroup. XX RN [1] RP 1-40 RA Kapitonov V.V. and Jurka J.; RT "RAG1 core and V(D)J recombination signal sequences were derived RT from Transib transposons."; RL PLoS Biol 3(6), (2005). XX DR [1] (Consensus) XX CC TransibN2_DP belongs to the TRANSIB family of DNA transposons. CC This element is characterized by 5-bp target site duplications. CC TransibN2_DP is characterized by its TIR only. There are >10 CC elements in the genome that have this TIR and different internal CC parts. XX SQ Sequence 40 BP; 14 A; 9 C; 13 G; 4 T; 0 other; cacagtgggg cctcagccga aaagaggtgg caaaagtcaa 40 // ID Transib-5_HM repbase; DNA; INV; 3092 BP. XX AC . XX DT 30-JAN-2008 (Rel. 13.01, Created) DT 30-JAN-2008 (Rel. 13.01, Last updated, Version 1) XX DE Autonomous Transib DNA transposon -a consensus sequence. XX KW Transib; DNA transposon; Transposable Element; Transib-5_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3092 RA Kapitonov V.V. and Jurka J.; RT "Autonomous Transib transposons from the hydra genome."; RL Repbase Reports 8(1), 5-5 (2008). XX DR [1] (Consensus) XX CC Transib-5_HM is a young family of autonomous Transib DNA CC transposons that were active in the hydra genome just a few CC million years ago (they are ~1% divergent from their consensus CC sequence). The consensus sequence was obtained based on a CC multiple alignment of ~20 copies; it codes for a 663-aa Transib CC transposase. Like other Transib transposons, Transib-5_HM is CC characterized by 5-bp target site duplications and short terminal CC inverted repeats. CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 637..2525 FT /product="Transib-5_HMp" FT /note="Transib transposase." FT /translation="MKIIDLIPKVREICITNKDIVSLLYEEILISEGLDGE FT KVHSEKLTEKIRILVIKLKAKWIAVNKTYDRLLMKEKKWLQTEIQIVMKAD FT NVSRKEFSGGRPKKDWKDLGERSKRTRVVGLSAHDPEVLALAAVKSAKSSG FT DLQDFAYVVKESIQKASEIRKSMDSKEPIMMSEKEALSLKVNCDLSDDQYQ FT MIRNSSIKHNANIYPTLHALFKEKSNCYPEDLVVTETSAESKLQSMVNHTL FT KKVIVLSEDNLKTLDPDQNVLNGRFILKAGFDGASSQSLYKQRYDNTDIEE FT AKKNEESLFQTAIVPLKLSVESKTIWENKKPNSSHFCRPLRLQYKKETTEL FT SKSEEKHIRDQMADLEDYVEIITLADGRKVSINIKSHVELTMFDGKVVNAL FT TNTKSSASCNICGAKPSEMNKLDLVKIKPVNDDAMSLGLSSLHCWIKCFEY FT ILHLGYRLEMTIPRTDVRLAEDKVLVQARKKTIQDRFRAELSLLVDVPKTG FT FGNTNDGNTARRAFMHGEAFADITGVQVELILRVRTVLQAVCSGYDLDIDR FT FSQYCNETLAFIVNNYDWYRMPPTLHKLLVHGPAIAENLELPIGYYSEEAQ FT EAQNKVIRHARLNHSCKISRLNVMKNQFQYLL" XX SQ Sequence 3092 BP; 1120 A; 444 C; 557 G; 971 T; 0 other; cacagtgcct cacagatacg aaaaaacggc aaaaacccta ttaatttttt aaatttattt 60 tgatgttttt taactttatc ttgattgtta taagcatact gaacaaggta aacccatttt 120 ttatgatttt gtaaattttt actctttttg tgagcttgtt atatgcaggt tgtttgtggg 180 caaaaattca ctagaaattt tagagttttc aagtagatct taaaattgtc acaaaatggt 240 catttttaaa aatttttctt tcattttttc ggtgaacatc atatatatat atatatatta 300 tatcatccgg taagtaactg acattggact tttatcaaca ccaaaaatta atgttcgaat 360 ttcacaattt gtgaaaaaat taaaaatttt cagaaaataa tattcacaat atttactttt 420 tctaaaaatg tccacacttg tccaccctga attttgatgg ggaagtatct tgaatatata 480 caggaatgcc cggaaaaaaa aaacccgccc caaagtagca cgtttttgag aaattgaatt 540 ttaaagtttt ttgcaattaa taagaaatcc gcaaaaagtt gtttacaaat ccaaaaaatt 600 agttagtaat atgtgccttt gtgattacat ctcaaaatga aaataataga cttgattccc 660 aaagtaagag aaatttgtat aactaacaag gatattgtga gcctccttta tgaggaaatt 720 ttgatatcag aaggtttgga tggtgaaaaa gtgcatagtg aaaaattaac agaaaaaatc 780 cgaattttgg tgataaaatt gaaagctaag tggattgcag taaacaaaac atatgacagg 840 ctgttaatga aagaaaagaa atggttgcaa accgaaattc aaattgttat gaaagctgac 900 aatgtaagca gaaaagagtt cagtggaggg cgaccaaaga aagattggaa agatctagga 960 gagagaagta agcgaacaag agttgttgga ttatctgcgc atgatccaga ggttttagct 1020 ctggctgctg ttaaaagtgc aaaatcatca ggtgacttac aagactttgc ttatgtggtt 1080 aaagagtcta ttcagaaagc atctgaaatc agaaaaagta tggacagcaa ggaaccaata 1140 atgatgagtg agaaagaagc actttcatta aaagttaatt gtgatctcag tgatgatcag 1200 tatcaaatga taagaaactc ctcaatcaaa cacaatgcca atatttatcc aacattacat 1260 gctctattca aggaaaaaag taattgttat ccagaggacc ttgttgtaac tgaaacttct 1320 gctgaaagta aattacagtc catggtgaat catactctta aaaaagttat tgtattaagt 1380 gaagacaatt taaaaactct ggatccagat caaaatgtgc tcaatggtcg ttttattcta 1440 aaagcagggt ttgatggtgc atcaagccaa agtttgtata aacaaaggta tgataacaca 1500 gacattgagg aggctaaaaa aaatgaagaa tcccttttcc aaactgcaat tgtacctctt 1560 aagttaagtg tagaaagtaa aacaatttgg gagaataaga aaccaaatag ttctcatttt 1620 tgcagacctc ttcgtcttca atacaaaaaa gaaaccacag aacttagcaa gtcagaagaa 1680 aaacacattc gtgatcagat ggctgatttg gaagattatg ttgagattat tacacttgcg 1740 gatggaagaa aagtcagtat caatataaaa tctcatgttg aactaaccat gtttgatggc 1800 aaagtggtta atgccttaac caacacaaaa tcatctgcaa gctgcaatat ttgtggtgcg 1860 aaaccatcag aaatgaacaa gctagatcta gttaaaatta aacctgtgaa tgatgatgct 1920 atgtcccttg gtttatcatc attgcattgt tggattaaat gttttgaata catacttcat 1980 ttaggctaca gactggagat gactattcca aggacagatg tcaggttagc cgaagataag 2040 gtacttgttc aagctagaaa aaaaactatt caagataggt ttagagcaga acttagttta 2100 ctggtagatg ttccaaaaac tggatttggg aatacaaatg atggtaatac tgcccgaaga 2160 gcattcatgc atggtgaggc ttttgctgac ataacaggag ttcaggttga gttgattctc 2220 cgggtgagga cagttcttca agctgtttgt tcagggtatg acctagatat tgatagattt 2280 agccaatact gcaatgagac tctagccttt atagttaata actatgattg gtacagaatg 2340 cctccaactc ttcacaaact tctagttcat ggacctgcaa ttgctgaaaa tctagaacta 2400 cccattggtt attattctga ggaggctcag gaggcccaaa ataaagtgat aagacatgct 2460 aggttaaacc attcatgcaa aatttccaga ctaaatgtaa tgaaaaatca atttcaatat 2520 ttgctgatca gaactgatcc agaaatttcg tcgatcaact tcataaaaca caaatcttcc 2580 aatggaaacc ccttgtctga acaagttctt aacttgctga aacactaagt ttagtttttt 2640 tgtttaaatc taaattatat tgttacatgt tatatatgtt atatgttaca acatacagaa 2700 attgtagata tttgtttctg aataatgtaa tctttattgt tttcactgtt tactgacaaa 2760 atcggctgtt attaaaggat tatatgtaga gtgtttaaaa acattgaaaa tttgtcactt 2820 tttatttaaa taaactatat atatatttat gaatttgtct ctataagtac aataaaaaac 2880 gacagaaaag catgttgaga aagtgggaaa aaaattcaga atttgcaaat ttgcaaaaat 2940 attttgaaaa aagtgtgcaa aattattttc tgactttcgg gttgggttac tgattaaatt 3000 ttaatatatt tagagccaaa aaaggttgaa atccattcag aaacactgat ttgagggttt 3060 ttgccgtttt tccgtagctg tgaggcactg tg 3092 // ID BEL-78_AA-I repbase; DNA; INV; 6023 BP. XX AC supercont1.278; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-78_AA_; KW BEL-78_AA-LTR; BEL-78_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6023 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.278; Positions 457149 451127. XX CC Positions [5066-5623] - Integrase core CC 'CTGGT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 418..5139 FT /product="BEL-78_AA-I_1p" FT /translation="MEEELALKEKQLEDERQMREKKLEIEKRFIQRKMQQE FT RELREKELAQEKALLEKQLADQVEFQNRQRELRERFQREKFEILSANLELE FT GAVGGMLPGVDVAQDNQKMVEAWLDKEKPVHTANQKLDEAEAKSIAQERIA FT KKQYALETPKRHPEILAVPPGAGNIKLPNFQPLDELDEEAASDDTSEDDGS FT EDTPPVRVVMPTGPTKAQRAARQVLSKKLPIFTGKVEEWPLFYSSFVNSTT FT ACGFSNIENLVRLQECLKGAALESVRSRLLLPQAVPQVIDTLQMLYGRPEQ FT LIHSLLNKVRRAEAPRADRLETFIPFGLAVQELCDHLMAANFHDHLVNPVL FT IQELVDKLPAATKREWVQFKRFVQPVTLTTFAQFTSKLVSEASEVILVIDP FT KTKSEKVKGKDRGFVNAHSNAQERSFIKSKPVAPCQVCRKGEHRIRNCEKF FT RQLNLGERVKVMEQLKLCERCLNEHEGWCKFKITCNVGDCRHHHHPLVHRE FT QSNASAHQIRSHLNPASTSRFHCMHMSSTSSIIFRIAPVILFNGSKSIRTM FT AYLDEGSELTLIDEDLVRVLKASGMSQPLKLRWTGNIVRDERNSECVSLEI FT SGINCGKRFDLKDAHTVERLHLPKQCISFGEIGRQYRHLQGIELLDQTGDQ FT PKLLIGLNNAYLLAPLDSRIGKADEPIGVKSHIGWTVYGPRRSSLPTGTFV FT GYHAGLTNEELHSAMKKYFTAEDVEGTVSQLPESATDRRAREIMQNTTVKR FT NGRYETGLLWKMDEFTFPDSLPMALHRLRGLERRLDKDPQLEKKVQQQIQE FT YQTKNYIHEASVEELENADQAKVWYLPLNVVLNPKKPEKVRLVWDAAAQVR FT GISLNSMLLPGPDLLVPLPAVLQPFRERAVAFGGDLKEMFHQFIIREADRQ FT AQRFLFRTDRQREPQIFVMDVGTFGATCSPASAQFIKNLNASQYAGKYPDA FT ARSIVRLHYVDDYLDSADSVNEAVKRAKEVRFIHSQAGFTIHKWISNSPTF FT LEELGEHETEDKISLHLDEGTAERVLGLIWRPREDVFMFSTSVRDDLAPYL FT LATARPTKRLALSCVMSLFDPLGLLAPFTLYGKILLQDLWKSGCDWDQEIN FT DECFKKWTWWRNILSSIENIQIPRCYLKGVSADAYNTLQLHVFCDGSSQAY FT GCVAYFRLETPGGIICSLVMAKTKVAPLKQLSIPRMELQAAVLGARLANTV FT IANHRVKVTKRFLWSDSRTVLSWIQSDQRKYKPFVGFRIGEILQETAVDDW FT RYIRTKMNIADLLTKKKRDSNLTTDGEWFHGQKILYAPEESWTGINRMAPS FT TNEELRACHLYHASHTVEFAGPLVDVTRFSKWNILVRTMACVFRFISNCRE FT KRRNKPIEVLKTNKRENNLECRSASTISILLKQEEYEMAEIALWKLAQSDG FT FWEEKAILLRNQTTPMEKWSKIDKKSALYKLAPFLDEYQVLRVDGRLSQAE FT FIPYDSRFPIILPSHHPVTNLLIDHLHRRFGHAFKDTVVNELTVLSSQSLF FT LCFTGGRVVPTMQIEEESTAYTQDGSSSSRAAYTIRTTVQLCWDRLFRSSR FT CSRWKAL" FT CDS 5036..6022 FT /product="BEL-78_AA-I_2p" FT /translation="MAPLPVERLTPFVRPFSYVGIDYFGPVDVVVGRHCEK FT RWIVLFTCLTVRAVHLEVAHKLDAVSCIMAIRRFVLRRGPPISIFSDNGTN FT LKAANKELQEQINRIDTACANVFTNARTQWRFNPPSAPHMGGVWERMVRSV FT KHVMSELHDGRRLNDETLLTVISEAEEIVNSRPLVYMPQDSLHAEALTPNS FT FVRGITSCLENPAIPPTNQAEALRNAYKRSQVTADALWKRWIEEYLPALNN FT RSKWLEESRPLAPGDLVFIVDGQHRNGWVRGQVESVFSGKDNRIRQALVRT FT AHGVYRRPVTKLAVIEVASSGKSDPEPGSGQDLRARE" XX SQ Sequence 6023 BP; 1755 A; 1350 C; 1470 G; 1448 T; 0 other; attactcaaa ataattaccc acgatatggc cggacaatgc aagaggtgca acgaaccgga 60 taacgatgaa atggtagcgt gcgatacgtg ctcaacctgg tatcatttca cgtgtgtagg 120 ggagtcaccc ggagttgcaa accgctcctt ctgctgtcac acatgcgcag agaaaaagaa 180 gaaacccaag agaggccaaa agggcaaaac gacgaatcag ttgggcccac atgctagcat 240 acccgagccg tcaattgatg cgtcatctaa taagagtgtc tcatctaaaa aaaccatagc 300 tacagcgaac gaaaacgacc cagcgggaaa accgtcatct tgaggacacg acacagatca 360 gtttcttctc acacgagccg tgggtcacat tcacgtgtgc agttagagtt tcgccgcatg 420 gaggaggagt tagcactgaa agaaaaacag cttgaagatg agaggcaaat gagggaaaag 480 aagctggaga tcgagaagcg attcattcaa aggaagatgc aacaagaacg ggaattgaga 540 gagaaagaac tagctcaaga gaaggcgttg ttggaaaaac aactagcgga tcaagtggag 600 tttcaaaatc gtcagcgtga gctacgagag cggttccaga gagaaaaatt cgagattcta 660 tcggcaaatt tagaattgga gggtgctgtc gggggtatgc tacctggcgt ggatgtagcg 720 caggacaacc agaaaatggt ggaagcctgg ctggataagg aaaaaccggt tcatacagca 780 aaccaaaagc tcgatgaggc agaagcaaaa tcgattgctc aagagcgcat cgccaagaaa 840 caatacgcac ttgagactcc aaagcgacac cccgaaatac tggccgtgcc acctggggca 900 ggtaatatta aattaccaaa ttttcagcca cttgatgaac tcgacgaaga agctgcaagc 960 gatgatactt cagaggatga tggttcggaa gatacgccac cggtcagagt ggtaatgcct 1020 acaggaccta ccaaggcaca acgtgcagct cgtcaagtcc tctctaagaa gcttcccatc 1080 ttcacaggaa aagtggaaga atggcctttg ttttatagca gtttcgtgaa ctcgacaact 1140 gcgtgcggct tttcaaacat tgaaaacctt gtacgcttac aagaatgttt aaaaggcgct 1200 gcattagagt ccgttcggag ccggcttctc ttaccacaag ccgttccaca ggtaatcgat 1260 acgctccaaa tgttatacgg acgccccgag cagttgatcc atagtctctt gaataaggta 1320 aggagagctg aggcgcccag agccgaccgc ttggaaacat ttattccgtt tggactcgca 1380 gttcaagagc tgtgtgacca tctaatggca gcaaactttc atgatcacct tgtgaatcct 1440 gtactcatcc aagagctggt ggacaaactt cctgcggcta cgaagagaga atgggtacag 1500 ttcaaacgat tcgttcaacc ggtgactctt actacgtttg cacaattcac atcgaagctg 1560 gtttctgaag ccagtgaagt cattcttgtc atcgatccta aaacgaagag tgaaaaggtg 1620 aaaggcaagg atcgaggatt tgtgaatgca cactcaaatg ctcaagaacg atccttcatc 1680 aaatcgaaac cagttgcacc ttgtcaagta tgccgaaaag gagaacatcg catccgcaat 1740 tgcgaaaaat ttcggcagct gaacctcggc gaacgtgtga aagtcatgga acagttgaag 1800 ttatgtgaac ggtgtttaaa cgaacatgaa ggatggtgca aattcaaaat tacctgcaat 1860 gtaggtgact gccgccacca ccaccacccc ctggtacatc gtgagcaatc aaacgcttca 1920 gcgcatcaaa tacgttcaca tttgaatcct gcatcaacta gcagatttca ctgcatgcat 1980 atgagctcca catcgtcaat tatcttccga atcgctcctg tcattttgtt caatggtagc 2040 aaatccatcc gtacgatggc atatttggat gagggttccg aactaacttt aatcgatgag 2100 gatcttgtgc gtgttttgaa agccagtggt atgtcgcaac cgttaaaact gcgatggact 2160 ggaaacattg ttcgtgatga gcgcaattca gaatgcgttt cgttggagat ttccggaata 2220 aattgcggta aaagattcga cttgaaggat gcacataccg tcgagagact gcatcttcct 2280 aaacagtgta tcagctttgg tgagattggt cgtcaatatc ggcatttaca aggcattgaa 2340 ctactagacc aaacaggtga tcaacctaaa ctcttaattg gactcaacaa tgcctatcta 2400 ctagctcctt tggactctcg cattggtaaa gcagacgagc ctattggagt gaaatctcac 2460 atcggatgga ctgtctacgg tcctagacgt tcgtctcttc ccacgggaac tttcgttgga 2520 tatcatgctg gtctgacaaa cgaggaacta cacagtgcta tgaagaaata tttcaccgcc 2580 gaggacgttg aagggacagt gagtcagctt cctgagtctg cgacagatcg tagagctcga 2640 gaaattatgc aaaatacgac agtaaaacgg aatggtcgct acgagaccgg ccttttatgg 2700 aaaatggatg aattcacttt tcctgacagc ctgccaatgg cgttacatcg actacgtgga 2760 ttagaacgtc gcctggacaa ggacccacag ctagaaaaaa aggtccaaca gcaaatccag 2820 gaataccaaa ccaaaaacta tatccatgaa gcgtcagtgg aagagcttga aaatgccgat 2880 caagccaagg tttggtatct tccccttaat gtggttctaa atcctaagaa acctgaaaag 2940 gtgcgattgg tttgggacgc tgcggcacag gtacgaggaa tatcactcaa ctcgatgctg 3000 ttacccggtc cagatttatt ggtgcccttg ccggcggtgt tgcaaccatt tagggagcgt 3060 gcagtagcct ttggaggaga tcttaaagag atgttccacc agttcatcat acgtgaggcc 3120 gacaggcaag cgcagcggtt tctctttcgc acggatcgtc aaagggaacc acagattttc 3180 gtgatggatg ttggaacttt cggcgccact tgctccccgg catcggcgca gtttattaaa 3240 aatcttaatg cttcacagta tgccgggaaa tatccagatg cggcgcgctc catagttcgc 3300 ttgcactatg tcgatgatta cctcgacagt gcggatagcg tcaacgaggc agtaaaacga 3360 gccaaagagg ttcgcttcat ccactctcaa gctggattca ccattcataa atggatatca 3420 aattcaccaa catttctcga agaactgggt gaacacgaaa cggaggacaa aatttccctc 3480 caccttgatg aaggcactgc agaacgagta ttgggtctta tttggcgacc tagagaagac 3540 gtatttatgt tttctacatc tgtacgagat gatctggctc catacttgct agcaaccgct 3600 cgcccaacta aaagattggc tctcagctgc gttatgagcc tctttgatcc tttaggctta 3660 cttgcgcctt tcactctgta tggcaaaata ctactacaag acttgtggaa aagcggttgt 3720 gattgggatc aagagattaa cgacgagtgc ttcaaaaaat ggacttggtg gagaaatatc 3780 ctttcttcta tcgaaaatat acaaattcct cgatgctatc ttaaaggagt ttcagctgac 3840 gcctataaca ccttgcaact ccatgtgttt tgtgacggaa gctctcaagc atatggatgc 3900 gtagcttact tccgtttgga aacccctggc ggcataattt gtagcttggt gatggccaaa 3960 actaaggtcg cccccttgaa gcagctttct attccccgaa tggagctaca agcggcggtc 4020 ctaggagctc gactggccaa cacggtaatc gccaatcaca gagtaaaagt tacaaaacgc 4080 tttctttggt ctgattcgcg gactgttcta tcttggatcc aatccgatca gagaaaatac 4140 aaaccttttg ttggatttcg gatcggggaa attcttcaag agacagcagt ggacgattgg 4200 cgttacataa gaacaaaaat gaatatcgca gatctgctaa ccaaaaagaa gcgggatagc 4260 aacttaacca cagatgggga gtggtttcat ggacaaaaaa ttctatatgc accagaagaa 4320 agctggaccg gaatcaaccg aatggcccca agtacgaacg aggagttgcg agcttgtcat 4380 ctttatcatg ctagccatac agtggagttc gcaggaccac ttgtcgatgt tacacgcttt 4440 tcgaagtgga atatattggt gcgcactatg gcgtgcgtgt ttcggttcat ctctaactgt 4500 cgtgagaagc gacgcaacaa accaatagaa gtcctgaaga ctaacaagag ggagaacaat 4560 cttgagtgca ggtcagcgag tacaatatcg attcttctga aacaggaaga atatgaaatg 4620 gctgagatcg ccctatggaa gctagcccaa agtgatggtt tctgggaaga aaaagctatt 4680 ctgctgcgaa atcagacaac accaatggag aagtggtcga agattgacaa gaaaagcgcg 4740 ctctataaac ttgcgccttt cctggacgaa taccaggttc tacgcgttga tggacgcttg 4800 agtcaggctg aattcattcc atacgactcc cgttttccaa taattttacc aagccatcat 4860 ccagtcacca atctccttat tgatcatctt catcgacgct ttgggcatgc tttcaaggat 4920 acagttgtga acgaattgac ggttctatct tcccaaagtt tgttcttgtg tttcacgggt 4980 ggtcgagtgg tgccgactat gcaaattgaa gaagagtcga ccgcttacac ccaggatggc 5040 tcctcttcca gtcgagcggc ttacaccatt cgtacgaccg ttcagttatg ttgggatcga 5100 ttatttcggt ccagtagatg tagtcgttgg aaggcactgt gaaaaaagat ggatagtact 5160 ttttacctgt ctgacagtac gtgctgtcca tctagaagtg gcgcataaat tagacgcagt 5220 gtcttgtata atggctatac ggagatttgt cctgagacga ggtcctccga tctcaatatt 5280 ttcggacaac gggacaaacc tgaaggcagc aaataaagag ctacaggagc agatcaatcg 5340 cattgatacg gcgtgtgcca acgttttcac aaatgcccgg acgcaatgga gatttaatcc 5400 accctctgcg ccccacatgg ggggcgtttg ggagcgcatg gtgcgctccg tgaaacatgt 5460 gatgtcggag ttacacgatg gtaggaggtt gaacgacgaa actttgctga cagtgatatc 5520 cgaggccgaa gaaatcgtca attctagacc gcttgtgtac atgccacaag attcgttaca 5580 tgctgaggcg ttgaccccaa atagtttcgt tagaggtatt acgtcgtgct tggagaaccc 5640 agcaatccct cctacaaacc aagctgaagc tcttcgcaat gcctacaagc gttcacaggt 5700 tactgccgat gccttatgga aaaggtggat agaagaatat ctacctgcgc ttaataaccg 5760 aagcaaatgg ttggaggaaa gcagaccatt ggctccggga gacttagttt tcatcgtaga 5820 cggacaacac cgcaacggat gggtacgagg ccaagtggag tcagtattct ctggcaagga 5880 caatcgcatt cgacaggcat tagttcgcac cgctcatgga gtgtatcgtc gccccgtaac 5940 caaattagca gtgattgaag tagcatcttc tggtaaatct gatcctgaac caggttctgg 6000 acaggattta cgggccaggg aat 6023 // ID I_Ele42 repbase; DNA; INV; 5990 BP. XX AC . XX DT 14-OCT-2010 (Rel. 15.1, Created) DT 14-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE An I clade non-LTR retrotransposon family from Aedes aegypti. XX KW I; Non-LTR Retrotransposon; Transposable Element; I_Ele42. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-5990 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5990 RA Kojima K.K. and Jurka J.; RT "I clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (14-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. This consensus is generated from 14 CC sequences with >98% identity, and ~99% identical to the original CC sequence in [1]. XX FH Key Location/Qualifiers FT CDS 306..1649 FT /product="I_Ele42_1p" FT /translation="MAGSSSAPPGGTAGHQSRSNLPQWMIGPDDIGQTVVL FT VLRRDTKSTDDFDHGLSQNSPLPHPFVVGSSIQQIIGVEAARSINTTREGR FT GSRYLLRTKSRIISDKLMKITELCDGTRVEIVPHPTLNTVQGMVYEPDSLD FT VDEKLIEQELKSQGVLAVRRIKKRVNGKLQNTPLLVLSISGTILPEFVYFG FT LLRIPIRKYYASPMMCFNCGFYGHSRRFCNQTAICLRCSTTHTVVEGEQCN FT HDPKCLHCKGGHPTYSRDCPKYKEEEKIVRLKTDLGISNAEARRMYNESNR FT TDSFSKVVQDEVQQELAKKDQLIASLQKQVAALAKEIGLLKKSQRSFAHSL FT SPVPQNQKSSATQSKQIKPTTSTTHQQPTQNTSNRISRRDKIASSTPCKRT FT TERSNEKMDYDILTRSRSGKRHFEISPTDSGKSSGKRSLAPPGSKSPTSIE FT VDE" FT CDS 1649..5911 FT /product="I_Ele42_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="MMIQQPRINHIKTIDMTDPAMTKDPRMEDQTTTHANF FT DDSSIQSEHDLEQRLHTTPATYQLHRKRFEGDESVVLPSLPEEEAVLESAV FT VQDVILPPVDHQTSSPAVEVIHLPPLPNSAKSNCNIYDALRNITVHPSTSK FT DHATRNFDSPSARLPTSTLTREVPDWSDEPLAAVDVDHPYDPASINLTLPP FT SVQSNPIGYSNIPAMSNDRTHCEQSTLSNDDSSYQSSQEQRRNLLFLQWNI FT RGFWSNHTELYHLVHRNPPIAIGLQEMMAKDADKALQRQYTWCFSDRHYSQ FT GVGAAGLGIRTEVPHKFIGISCPIPVCTARLFDPYNITVVSIYVPPNSPDK FT HIIDTLNSIISNAEPPFIIGGDFNAAHEAWGSSKASKRGRQLLCWFVENDF FT IVLNNGEPTFLSSRHGSTSCIDITAVSKSFASRLHWSIANDTHGSDHFPIK FT VYSQEALLEKRNRKRWLYKNANWTLFEHNILESMEIEADLSIDKLGDIILK FT SAKASIPRTSGKPSKRAEIWWSEEVKAAIKTRRKALRKLKRTATNDPEHER FT IRKAFQSARSIARKTVSTAKRESWDVFCNSFNPQTPSETLWNNFNKLNGKR FT KNGTKGLVIRGTYEQDPSVITEHFADFFQQASATSVSHESNLPQSVTNENS FT PLNEDFHISELLRAIDSTKGYSTGSDNIGYPMIRHLPIAGKYAMLRSFNRV FT WAEGRFPQSWREGIVIPIPKPNMPRNTVENYRPITLLSCIGKIYERMVNHR FT LVTFLEEQNILHPQQHAFRAGRGTSTYFAELKEIIENAKQSKTHLEFALLD FT IQKAYDQAWRPHILHQLNRLHVGNRIKSCIKHYLQDRNFRVSYGGSLSSQK FT SLDNGVPQGSVLAVTLFLLAINPVFEVIPKNIQVLVYADDIVLVASSKQLS FT KVRQHLSNAVKAVNEWAKSVKFSLSPSKSCILHVCQRTGHRGYRSLPDICL FT NGEVIPEVNSARFLGVWMNRRGKFSIHAAKTKIALQNRINFIKAISPKASR FT ESLWKISNAVCISKLLYGVELFGNEITTLLQPVFNQLLRISSGALRSSPTH FT SLAAEAGELPLQLRVIELFVRKYCKITEKSRRTYTHYRRAVDLTLCEVVGV FT ELPNIAKLQRVGRRPWNSPRIKTDLSIKHAFKKGQNTRIARVITEDHLISK FT YANYMKIFTDGSKSGSEVGVGVIGPTFFVERSLXSQCSVYTAEAAALLTAA FT HHAPRTETVILSDSASCITAIEKGESTHPFLQEFEKISAHKNITVCWIPAH FT SGIQGNELADKAAERGRSSRLLYRSIPASDVIFWVKQQLRRNFQEQWSNNK FT TTFLQFCKPTVSKWIDRENRLEQKMLTRLRIGHTIITKKHLYDKMSPRKCD FT TCNIDLTVEHILLNCRKFELIREELAMSNNIEIVLGNSKENESTLIKFIKK FT CKLVDKI" XX SQ Sequence 5990 BP; 1897 A; 1431 C; 1225 G; 1436 T; 1 other; tagcgctcgt aattttcgtt ttttttttcg ccaaccgctg tattaagaag acccaaatca 60 agttacaggt gggagtagtg aagttttccc aaaactcgtt agttcagttg ttaaaaacag 120 taaactgtta gtgcaattat cacgcggcgc cgggttgtgt agttacatac atgcacactc 180 gctattgtgc tagctggtaa acagcccgag tgaacagata tttccaaact ggtgctgaat 240 ctgtacaaat acctccgtct ggaaaaaaaa atcgaagtgt ataaagcgtt agcaagctac 300 ggcccatggc cggtagctca agtgcccctc cggggggcac agctgggcac cagagccgta 360 gtaatttgcc ccaatggatg ataggacccg atgatatcgg ccaaaccgtg gtgttagtac 420 ttcgtcgcga cacaaaaagt acagatgatt ttgaccacgg gctatcacaa aactcccctc 480 taccacatcc attcgttgtt ggatcgtcaa tccaacaaat tatcggcgtt gaagcagcta 540 gatcaatcaa cactactcgt gagggacgcg gatctcgata tttgctccgt accaaatcaa 600 ggatcatttc tgataagctt atgaaaatca ctgagctgtg cgatggtacc cgtgtggaaa 660 ttgttccgca tccaaccttg aacactgttc agggtatggt ttacgaaccg gattccctcg 720 acgttgacga aaagctgatt gaacaagaat tgaaatctca gggagttctg gcagtgcgtc 780 gaattaaaaa acgggtcaac ggaaaactgc aaaacacccc actacttgta ctgtcaatca 840 gtgggacaat actgcctgaa tttgtctact tcggcttgct tcgcatacca attcgcaaat 900 attatgcttc gcccatgatg tgttttaatt gcggtttcta cggccattcc cggagattct 960 gcaaccaaac cgctatctgc ttacgctgct caacaacaca tacagtcgtt gaaggggagc 1020 aatgtaacca cgatcctaag tgcttgcact gtaagggcgg acatcccaca tactcacgcg 1080 actgtcccaa atataaggaa gaagaaaaga tagtgcgact gaaaacagat ctgggtattt 1140 cgaacgcaga agcaaggcgc atgtacaacg aatcaaatag aaccgactcg ttcagtaagg 1200 ttgttcaaga tgaagtacaa caggaactag caaaaaaaga tcagctcata gcatctctac 1260 agaaacaagt ggctgcactt gctaaagaaa ttggattgtt aaaaaaatca caaagatcgt 1320 ttgcacacag cctgtcccca gttccgcaga atcaaaaatc atccgctact caatccaagc 1380 aaataaaacc aaccacttct acaacacacc agcaacccac acagaatact agcaatcgca 1440 tatctcgtag agataaaata gcctcgtcta caccatgcaa gcgcactacc gagagaagca 1500 atgaaaaaat ggactacgat attctaacgc gaagcagaag tggtaaacgg cattttgaaa 1560 tatcccccac ggactccggc aaaagttcgg gaaaacgaag tctagcgccc cctggatcga 1620 aaagccctac ttccattgaa gtagacgaat gatgattcaa caaccacgca taaatcacat 1680 caagaccatc gacatgactg accctgctat gacaaaggac ccgagaatgg aggaccaaac 1740 tactacccat gctaatttcg acgacagctc tatccaatcg gaacacgatt tggaacaacg 1800 gctacatact actccagcta cataccaact acatcgaaaa cggtttgaag gcgacgaaag 1860 tgtagtactc ccttcacttc cggaagagga agcggtactg gaaagtgcgg tggtccagga 1920 cgtcattctt ccacctgtag atcaccaaac ttctagtccg gccgttgaag ttattcatct 1980 tcccccgcta ccaaattcag ccaaaagtaa ctgcaatatc tacgacgctc ttcggaacat 2040 caccgtacat ccatctacga gcaaggatca cgccacaaga aactttgatt cgccaagtgc 2100 gaggcttccc acgtcgaccc tgaccagaga ggtgccggat tggtccgatg agcctctggc 2160 ggcagtcgac gtggaccacc cctacgaccc ggcaagtatc aatcttacat taccccctag 2220 tgttcaatca aatcccatcg gttactccaa cattccagcc atgtccaacg atagaactca 2280 ctgcgaacaa tcaaccctga gcaacgatga cagcagctat caatctagcc aggagcaacg 2340 cagaaaccta ttatttttgc aatggaacat aagaggtttt tggagtaatc acactgaatt 2400 gtaccacctt gtgcatcgga acccaccaat agctatcggt ttgcaagaga tgatggcgaa 2460 agatgcagat aaggcgcttc aacggcaata tacatggtgt ttctcagacc gacattactc 2520 tcaaggtgta ggtgctgccg gtctcggaat tcgaactgaa gtaccccaca agttcattgg 2580 gatatcttgc ccaatccctg tctgtaccgc tcgtttattt gacccgtaca atatcacagt 2640 ggtttcaatt tatgtgccac caaatagtcc agacaaacat attatcgata ccttgaacag 2700 catcatatcg aatgcagagc ccccctttat catagggggg gacttcaacg ccgcccacga 2760 agcttggggc agttcaaaag catccaaaag aggacgacaa ctattatgtt ggttcgttga 2820 aaatgatttc attgtgttaa acaacggaga acccaccttt ttgagctcgc gccacggcag 2880 cacatcttgt atagatataa cggctgtgtc aaaaagtttc gcaagccgac tgcactggtc 2940 aatcgctaat gacacacacg gcagtgatca ttttccgatc aaagtgtact cccaagaagc 3000 gttattggaa aaacgaaatc gcaagcgatg gctgtacaaa aacgccaact ggactctttt 3060 cgaacacaac atactcgaat ctatggaaat cgaagcagac ctttcaatcg acaagctagg 3120 ggatattatc ctcaaatcag ctaaggcatc aattccaagg acatctggca agccaagcaa 3180 aagagcagag atctggtggt ctgaagaagt caaggcagca atcaagacgc gtcgaaaggc 3240 cctgcgaaaa ttaaaacgta ctgcaacaaa tgaccccgag cacgaacgta taaggaaagc 3300 attccaatcg gctcgttcca tagctagaaa aactgtgtcc acagctaagc gagaatcctg 3360 ggacgtattc tgcaacagtt tcaatccaca aactccctcc gaaacgctat ggaacaactt 3420 caacaaactt aacggaaaga ggaagaatgg aactaaaggc ctcgtcatcc gtggtactta 3480 tgaacaggac ccatccgtca tcactgaaca cttcgccgat tttttccagc aagcttcagc 3540 aacttctgtc agtcacgaaa gcaatttgcc gcaatctgtt actaacgaga actcaccctt 3600 gaatgaggac tttcacatca gtgagctctt gcgcgccatc gactcaacaa aaggctactc 3660 tacaggtagc gacaatattg gatatccaat gattcgacac ctacctattg ctggaaaata 3720 cgctatgctg cgatcattca atcgcgtttg ggccgaagga agatttccac aaagctggag 3780 agaaggtata gtgataccca ttcctaaacc caacatgccc cgaaacaccg tggaaaacta 3840 ccgacccatc actcttctga gttgcattgg gaagatctac gagaggatgg taaatcatcg 3900 tttagtcact ttcctagaag agcaaaatat tctccaccca caacagcatg cttttcgagc 3960 tggtcgagga acgtcaacat atttcgccga actaaaagaa ataatagaga acgccaaaca 4020 atcgaaaact catctcgaat ttgcgttact tgatattcag aaggcttacg atcaggcgtg 4080 gcgaccgcac attctgcatc agctaaaccg cttgcacgtg ggcaatcgca tcaaatcttg 4140 catcaaacat tatcttcagg atcgcaattt tcgagttagc tacggcggtt cactttcgtc 4200 acaaaaatct ttagacaacg gtgttccgca aggatcggtg ctggcagtta cgttatttct 4260 attggccata aatccagttt tcgaagtaat tccaaaaaat attcaagttt tggtttatgc 4320 ggacgatata gtgctagtgg cttcttcgaa acaactctcc aaagtacgac aacatctctc 4380 gaacgcagtt aaggcggtaa atgaatgggc aaaaagcgtt aaatttagcc tctctccatc 4440 taaatcgtgc atcttgcatg tttgccaaag aacgggacac agaggatacc gatcactacc 4500 tgacatttgt ttgaacggtg aagtcatccc cgaggtaaac tcggctcgat ttctcggagt 4560 ctggatgaat cgaagaggaa aattttcaat ccatgctgcc aagaccaaaa tagctcttca 4620 aaaccgtatt aacttcatca aagctatatc gccaaaagcc agcagagaat ctttatggaa 4680 aatttctaat gcagtttgta tatcgaagct gttgtacgga gtagaattat tcggaaatga 4740 aatcaccacc cttctacaac ctgttttcaa tcaactgcta agaatctctt ctggagctct 4800 tcgttcttct cccacacata gcttggccgc tgaagcgggg gaacttccat tacaattacg 4860 ggtaattgag ttgttcgtcc gaaaatactg caagataacc gaaaaaagta gacgaacata 4920 tacgcactat cgaagagctg ttgatctcac tctctgcgaa gtcgtaggtg ttgagcttcc 4980 gaatattgct aaattgcaac gtgtcggccg ccgcccttgg aattcaccac gcattaagac 5040 tgatttgtct attaagcatg ccttcaagaa agggcagaat accagaattg ccagagttat 5100 taccgaagat catctcatct ctaaatacgc caactacatg aagattttca ctgacgggtc 5160 aaaaagtggt tccgaagtcg gtgttggagt tatcgggccc accttttttg tcgaaagaag 5220 tcttcwatca cagtgcagtg tctacacggc agaagctgcc gctttactaa ctgcagctca 5280 tcatgctccg agaactgaaa ccgtgatatt gtctgactct gcaagctgta tcactgctat 5340 cgaaaaaggt gaatcaactc acccattcct acaagagttt gaaaaaatat cagcgcataa 5400 aaatataaca gtttgctgga taccagccca ttctggtatt caaggaaacg agctagcaga 5460 caaagcggct gaacgtggtc ggtcgtctag gctcctttat agatcgatac ccgcttcaga 5520 cgtcatcttc tgggttaaac aacaacttcg cagaaatttt caggagcaat ggtccaacaa 5580 caaaacaact ttcctgcagt tttgtaaacc aacagttagt aaatggatag atagagagaa 5640 tcgactggaa cagaaaatgt taacaagact ccgcataggc cataccataa taactaaaaa 5700 acatttgtat gataaaatga gcccccgaaa atgtgatacc tgtaacatag atttaacagt 5760 tgagcatatt cttttgaatt gtagaaagtt tgagttgatt agggaagaac ttgccatgag 5820 taacaatatt gaaatagtct tgggaaattc caaggaaaac gaatcaacac taatcaagtt 5880 catcaaaaaa tgtaaactag tcgataaaat ttgaagttct tttaccccag aaataaaaga 5940 ggcgaatgaa ccataaggtt taaaacctct ataataaaaa aaaaaaataa 5990 // ID MARINER_CA repbase; DNA; INV; 1290 BP. XX AC . XX DT 14-SEP-2004 (Rel. 9.08, Created) DT 14-SEP-2004 (Rel. 9.08, Last updated, Version 1) XX DE Chymomyza amoena transposon mariner Camar1 transposase gene, DE consensus sequence. XX KW Mariner/Tc1; DNA transposon; Transposable Element; MARINER_CA; KW mariner transposon Camar1; transposase. XX OS Chymomyza amoena OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Chymomyza; fuscimana group. XX RN [1] RA Robertson M.H., Lampe J.D., Witherspoon J.D. RA and Soto-Adames N.F.; RT "Recent horizontal transfer of mellifera subfamily mariner RT transposons into insect lineages representing four different RT orders."; RL Unpublished. XX RN [2] RA Gentles A., Kohany O. and Jurka J.; RT "Chymomyza amoena transposon mariner Camar1 transposase gene, RT consensus sequence."; RL Direct Submission to Repbase Update (JUL-2004). XX DR [2] (Consensus) XX CC 99% average similarity to consensus between 6 copies of the CC sequence from Chymomyza amoena. XX SQ Sequence 1290 BP; 423 A; 217 C; 257 G; 393 T; 0 other; ttgggttgcc caaaaagtaa ttgcggattt ttcatatagt cggcgtttat aattttttta 60 acagcttgtg actatttaat tgtatttttt cttttgacat ttattagctg tgactattag 120 cttgctttag aaaaaaagtg cgcgtaattt tgtttacatt tgtttgtttg gcgccctttt 180 taatatggag tccacaaaag agcatttccg tcatatttta tattattatt tccgtaaagg 240 aaaaaacgca gagcaggttg ctaaaaagtt acgtgatgtg tatggtgata aagccttaaa 300 agaaagacag tgtcaaaatt ggtttcgcaa attccgttct ggagattttt cacttaaaga 360 tgagccacgt tcaggtcggc caaatgaagt tgatgatgac caaatcaaag cattaatcga 420 attggatcgt catgtaactg agcgtgagat aggagagaag ttaaatatac caaaatcaac 480 cgttcattat cacataaaaa gtcttggact ggtgaaaaag cttgatattt gggtaccaca 540 tgaattgaaa gaaattcatt taacaaaccg aatcaacgct tgtgatatgc atcttaaacg 600 caatgaattc gatccgtttt taaagcgaat cataactgga gatgaaaaat ggattgttta 660 caataacgtt aatcgaaaac gatcatggtc caagcatggt gaaccagctc aaaccacttc 720 aaaggctgat atccaccaaa agaaggttat gctgtctgtt tggtgggatt ggaagggtgt 780 ggtatatttt gagctgcttc caaggaacca aacgattaat tcggatgttt actgtcaaca 840 actggacaaa ttgaatgcag ccatcaacga gaaacggcca gaattgatca atcgtaaagg 900 tgtcatattc catcaggaca acgccagacc acacacatct ttgatgaccc ggcaaaaact 960 gagagagctt ggctgggaag ttttgatgca tccaccatat agccctgacc ttgcaccatc 1020 agactaccat ttatttcgat ctttgcagaa ctccttaaat ggtaaaactt tcggcaatga 1080 tgaggctata aaatcgcact tggttcagtt ttttgcagat aaaggccaga agttctatga 1140 gcgtggaata caaaatttgc cgggaagatg gcaaaaggtt atcgaacaaa atggaaatta 1200 tatatttgat taaagttcat tctaagtttt attaaaaatg catttacttt cttttaaaaa 1260 atccgcaatt actttttggg caacccaata 1290 // ID Copia-33_DPu-LTR repbase; DNA; INV; 290 BP. XX AC ACJG01006013; XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the common water flea: long terminal DE repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-33_DPu_; KW Copia-33_DPu-I; Copia-33_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-290 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the common water flea."; RL Direct Submission to RU (08-FEB-2011). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR Genome; ACJG01006013; Positions 15455 15166. XX SQ Sequence 290 BP; 81 A; 67 C; 47 G; 95 T; 0 other; tgttgaagtt tatcaaaccc acacgctgtt gctgaaccct atctgtttgc tttaacaaat 60 acctctatcc ctctggtgtc caaactgtta atctgttgtc ggctgctact gccccgtatc 120 tcgcccactt cttgtcacat tgtgctgtat gacagaactc tcgttaattg tctttatcga 180 agacagaaaa ggtatcttac agttagaaga agaaggtacg tcaaatacag actgactagt 240 attaaatcat tgtctcaatt gttatgtcat ttcactaact aacatcaaca 290 // ID BEL-597_AA-I repbase; DNA; INV; 7047 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-597_AA_; KW BEL-597_AA-LTR; Pao_Bel_Ele56x; BEL-597_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-7047 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [6010-6591] - Integrase core CC LTRs are 94% similar to each other. XX FH Key Location/Qualifiers FT CDS 5521..6951 FT /product="BEL-597_AA-I_3p" FT /translation="MNNCRRPTQRNTSLFLSTQEIAAAKTLLVRLVQEESF FT PEEIKSLKKRFQVPPKSSLKLLRPFIDKDGIIRVGGRLRHSLEDYTTRHPA FT VLPQSHIFTRLIIESYHQQIFHGGHRATLAVIRQEFWPIHGKRAVTSVLRK FT CQRCFRFDPKPIQQPMGQLPSARVRPARPFLITGVDYCGPFYLKPPHRRAA FT PHKVYIAVFICFTTKAIHLELVMDLSSAGFISALRRFIGHHGCPKEMHSDN FT ATNFQGAKHELHDLYKTLHSKSGQTAIGTELSQQEISWHFIPPRAPNFGGL FT WEAAVRSSKTALKKEIGSHQLTHENFCTLLVQIAAALNSRPLSPLSDDPSD FT IDALTPAHFLIGTSMKALPDPDLTAIPSNRLTHYQQRQQMFQRYWQRWTQE FT YLTGLQQASKHVQPTPIRIGNIVVVREDNIPPLEWPLARIIEVHHGADGVI FT RVVTIQTSKGTYKRPVNRICPLPIEDKTDVED" FT CDS join(2953..3636,3640..5370) FT /product="BEL-597_AA-I_1p" FT /translation="MVEENQEEAIESSSAGSYSIGTKQGGFSSSVFLSTVV FT VSIRDQHGGKQLARALLDSGSQVNVMSERLCQALKLKRRTICVPITGVGQS FT ESTAKHAVSATVSSRITEFSVGLDFLVLQRVTAELPSTTVSVSHWKIPDKL FT QLADPQFNVSGRIDLLLGAEHFFRFLYEREMKRVVLGPELPVLVDSVFGWI FT VAGRDASLQQEKSVRCCTATAVGNLEELLERFWRVESCEQPAWSKEEQDCE FT EDFKHTHSRSEDGRYIVQLPKHVGFERMMGDSKVLALDRYRKLENRLERNQ FT SMKEQYHAFMKEYLEMGHMRRITEGELDADSNKADQPKAFYLPHHAVLKES FT STTTKVRVVFDASAQMDSGYSLNDILLKGPVIQDDLLNLLVRFRKHEVALV FT GDVEKMYRQVLQDVNDPLRLRIFFRFSKEEPINVYELLTVTYGLKPSSFLA FT TRALKQLAIDEGSMDSPAKSALEEDFYVDDYIGGAASVEDAVSLREELTSL FT MAKGGFPIRKWCSNRPEVLAGIPVDQLGTNLSITFDLQPDEKIKTLGITWE FT PKADQLRFLFSVDDNGGEWTRRRILSAIAKLFDPLGLIAPVVVTAKIIMQE FT LALLQTGWDSPVPLQLEQKWMHFHGRLSKLAEYSIPRFAFITDYVDVQLHC FT FADASEFAYGACLYVRSVDPLGSVRIELLSSKSRVAPLKRLTIPRLELCAA FT REAARLYRTVTMALAMDSVKSFFWSDSTVVLHWLKAPPNTWKTFIANRVST FT IQTITYGHLWQHVAGKENPADLVSRGMPVEDFLKNDLWKSGPVWLKEAPAS FT WLM" XX SQ Sequence 7047 BP; 1783 A; 1631 C; 1856 G; 1767 T; 10 other; tttggtgccg tgaccaggat cggcggttga gttggaagtg tcattggccg tgaggacaat 60 tgggtcattg ttggatggaa tttggacttt ggacggaacg gaaccatttt tgctaaagtt 120 ttgtgagcgg aaatatttca acaattgtga agtcggtctc tctcatcgat atcaccaaac 180 ggattcgcta gcaagactca cggtgggata cttgctgaaa atataattca aggcctaatt 240 gagktaggcc acaggtgagt ggcgctcctt ttccttaaaa tccattgagt ggtgctaccg 300 agattctctc tctcctttcc tgtgaggtac gcgatcgttg tatccacgtc ttgctacgac 360 ggctgcgagg tgcttagtga agttggttcg acaatcacca cgcacggkgg ataaacaaca 420 ttaccgttgc ggactggatc gtcgtatgcg aagtaaggcc agcgcgcgat agtgtctgsa 480 atagatagag ccgagtttgg attgatckgt gatcgattgc tgtgcataca ttgcaacgtc 540 ggttggattc ggttggactc gttgttggag cacgtgactg gtcggattac tagtactact 600 agggactacg aggaacggta tacatccatg ccgttgttta catctagatc ctgaagtaca 660 caacatacaa ggcctattgt cgacaggcct caggtgagtg cctgtccaag tatataccct 720 tatcgttgtg gtgcttctgg ttctttttca ccctttcatg tgcttgctgg gacgttcgtc 780 aattgagctg tcggtgtatt ggattgttta cgtctggctc gatcttggac gagattgcgc 840 tgcgctgacc agtggtagga gcaacagaag gcaactatat ctatacctac aagcagcaaa 900 ggtttggcaa tcacgagacg gggtaagcga atattgagca ttctttggat tcgtcatacg 960 gggtgcgttt agtggtttac ttcaaattgc acatacattt ttgggcggtg tgaggcgaaa 1020 ttatttgcat ttaatcaagg cctgttgtag acaggctaca ggttagtagc tgtccaacct 1080 ttatgtactg gtttggtggt gctgcgtaag atttctttct ctttcattgg gctgttggaa 1140 gcggtaacgg atcgatcgac ctagataagt ttgactggga tcaatcgcaa ggatcagcta 1200 caaatactat tgggtcacgg gggccggagt ttggacgatt tcattgttta tacaaggcct 1260 gaattggaca ggccacaggt gagtggctgt ccattgattg tacaaccata tcggcggtac 1320 ttcctggttc tttttggttg gcctattatt ctcggtccct ggatttttcc acgtttggct 1380 gatctgatag tgtgggtatg gctggaggat taaggcggcc cgccggcgcg gccgcgccgg 1440 ccgccgggcg gccgcgcgcc gcacaaaata gcgcgcgcgg cggccccgcg gcgggggcaa 1500 gacccaacgc gccgcgcccc gcctcgaggg cctttttgcg gctctgcgtc gacatgttca 1560 gtttctggat agttataatc cagaagcgcg cgggcggcag tctaggctgg acaaattaga 1620 ggcaacatgg gaggatttga tgagttgcgg agcaaatttg tgagctagac gcagaaggca 1680 actggaacag gagacgaaca acgcgtacgc tgtatttgag cggcagtatt acgaaatcgg 1740 gctgctttac tggcaaaatt ggccctgctc aacagtgcca aatttggata attccattcg 1800 gaacaccagt gcgttgggaa tgcacacagg ggttcgcctt ccacaaattt cgttgcctga 1860 gtttgatggg gattacaaag ggtggctctc cttcaagtca acctttgtat cgttgattca 1920 tgattctcat gagcttagtg atgtgcagaa attccattac ttgaaatctg cattgaaagg 1980 ggaggcagcc aaattgatcg agtcactacg atcacaacga taattacgtc attgcatggg 2040 agaccattac gaagcgttat tccaacgaat atctgctgaa aaagcgtcac cttcaggcac 2100 ttatggagtt tccaaagatt gagaaggaat cggcggtaag gatacacgga ttggtggacg 2160 agttcgatca acggttgaag atcttgaagc agctgggcga gaaaacggag cattggggag 2220 cgatgatcgt ccactggatg tgctcgaagc tggacacaca cacgttgcag ctctgggagg 2280 accacgctgc ctccttgcgg gatccaactt ttgcatcgtt ggtggacttt ttggagaaac 2340 gaacgagggt attggacgct gtacaatcga actgtaggcg ggcccgggcc cccgcgggcg 2400 ggccccccgg cccccggccc cgcgggcgcc gcccgcccgc gggggccggg cccgccgcgg 2460 ccccgcgcgc ccgcgcgcgg ccgcctaagg ccccccggcc gccgcccgcc gggggggcgc 2520 ggggggggcc ggcgcgcggc cgcaaatcag cgcgggccgg cccgtgtagc accgaaagtt 2580 acagtgaaac gggagcggtt cgctgtacat acgtccgtgg aaagtggaag gaatgatccc 2640 gtatgccaat gttgcggcga tcagcattat ctggtacggt gctccaagtt tcagaatttt 2700 gggctaaagg agaaactgga tttcgtgaat agcaaacggt tatgcagcaa ctgcttcaag 2760 tctggtcatt gggtacgaga ttgtaattcc aaattcaact gcagaacgtg tggcaaaaag 2820 cacaactccc tcattcatcc tggctttcaa gcaaacggtg gaggtagcag cagcggcaaa 2880 gataatgatt tcaatagttc ggcaaacggt gcaaaaagac aggatgcagc tagatcgcat 2940 gtcgcagcag agatggtgga ggaaaatcaa gaagaagcaa tagaaagttc ttctgctgga 3000 tcctacagca ttggaacgaa gcagggaggt ttcagttcca gtgtattctt gtctacagtt 3060 gtagtatcga tacgcgatca gcatggtggc aagcagctag ctcgtgccct gctggatagc 3120 ggctcacagg tgaacgtgat gagcgagcgg ctctgtcaag cattaaaact gaagcggaga 3180 acaatatgtg tccccataac gggagttggg cagtcggaat cgacagcgaa gcatgcagtt 3240 agcgcaacgg ttagttctcg aataacggag ttttcggtcg gattggattt cttggttctg 3300 caacgtgtga ctgctgagct gccctcaaca acggtatcag tttcccactg gaaaatccca 3360 gataagcttc agctagcaga tccgcaattc aatgtcagcg gtcgcatcga tctgctgctt 3420 ggggctgaac atttcttccg tttcttatac gagagggaga tgaagcgagt tgttttgggg 3480 ccggaattac cagtactcgt ggattcsgta ttcggatgga tcgtagccgg cagagacgct 3540 agtctacagc aagagaaatc ggttcgttgt tgtacggcaa ctgccgtcgg aaacctcgag 3600 gaattactcg aaaggttttg gagagtggag agctgtgamg agcaaccggc ttggtccaag 3660 gaagaacagg actgtgagga ggatttcaag catacgcatt caagatcgga ggatgggcga 3720 tacattgttc agttgccaaa gcatgtcggc ttcgagcgaa tgatggggga ttcaaaggta 3780 ttggcactgg ataggtatcg gaaattagag aaccgattag aaaggaacca gagcatgaag 3840 gaacagtatc atgctttcat gaaggagtat ttagagatgg gacacatgag aagaattacg 3900 gagggagaac tggatgctga ttcaaacaag gcagatcaac cgaaggcatt ctacttgccg 3960 caccacgcgg tattgaagga atcgagcacc acaaccaagg tgcgcgtcgt ttttgatgcg 4020 tcggcgcaaa tggatagcgg atactcactc aacgatattc ttttaaaagg acccgttatt 4080 caggacgatc ttctgaatct actagttcga tttcgtaaac acgaagtggc gcttgtagga 4140 gatgtggaga aaatgtacag gcaggttctt caggatgtca atgaccctct tcgtctccga 4200 atatttttcc gcttctccaa ggaggaacca atcaatgtct acgaactgct aacggtgacg 4260 tacgggctga aaccatcatc atttttagca acacgcgctc tcaaacagct tgctatcgac 4320 gaaggatcca tggattctcc agcaaaaagc gctttagaag aagattttta cgtkgacgac 4380 tatattggag gagcagctag cgttgaggat gccgtcagtc ttcgtgaaga gttaacgtcg 4440 ttgatggcaa agggaggatt tcccatacgg aaatggtgtt ctaatcgccc cgaagtattg 4500 gctggtatcc ccgtggatca gttgggaact aatctttcca ttacatttga tttacaaccc 4560 gatgaaaaaa tcaaaacctt ggggattact tgggagccga aagccgatca actacggttc 4620 ctattcagtg tggatgataa tggaggagaa tggacaagaa ggagaatttt gtcagcgata 4680 gccaaactgt ttgatcccct tggattgatc gctcctgtag tagtgacagc aaaaattatt 4740 atgcaggagt tggctctact tcaaaccgga tgggattctc cggtgccttt gcagttagag 4800 cagaaatgga tgcacttcca tggccgtttg tctaaactgg ctgaatacag catccccaga 4860 tttgcgttta twacggatta cgtcgacgta cagctacatt gtttcgccga cgcttcagaa 4920 tttgcgtacg gagcgtgttt atacgttcgc tctgtagatc cactcggcag cgtacgcata 4980 gagctgctct cctctaaatc tcgggttgct cctctgaaac ggctgacaat cccaagatta 5040 gaactttgcg cagctagaga agctgcacgt ctttaccgaa cagtcaccat ggcccttgcg 5100 atggacagcg tcaagtcatt cttctggtcg gactcaaccg ttgtcttgca ttggttgaaa 5160 gctccgccaa atacgtggaa aacctttatc gcaaatcgcg tttccaccat tcaaactatt 5220 acgtacgggc atctatggca acacgtagcg ggaaaagaga accccgctga tcttgtttct 5280 cggggaatgc ctgttgagga tttcctaaag aatgatctat ggaaaagtgg acctgtctgg 5340 ttgaaggagg cacctgcttc atggctgatg sagtactctg aatctcttcc acctgaagtg 5400 gatttggaac ctcgtatcgt cgttcaagct aacactgcat cactggagcc aaacttcatc 5460 ttttccctac gatcttcgct acttcckctc gtcagagtag tggctcgctg cttgcgattt 5520 atgaacaact gccggcgtcc gacacaacga aatacctcat tatttctttc aacacaagaa 5580 atcgctgccg caaaaactct tctagtgcgt ttggttcaag aagagtcgtt cccagaagaa 5640 attaaatcgt tgaagaagag gtttcaagtt ccacccaaat catcattgaa acttcttcgg 5700 ccctttattg acaaagatgg aatcattagg gttggcggcc gtcttcgtca ctcccttgag 5760 gattacacga ccagacatcc cgcagtactt ccgcaatctc acatcttcac tcgcctgatt 5820 attgaaagtt atcaccaaca aatttttcat ggaggacatc gagcaacact agcggtaata 5880 aggcaggagt tctggccaat tcatggaaaa agagctgtaa cctctgtttt acgaaaatgc 5940 cagcgctgtt tccgtttcga tcccaagcct attcagcaac cgatgggaca gcttccgtca 6000 gcccgagtgc gcccagctcg gccgtttttg ataactggtg ttgactactg cgggcccttc 6060 tatttaaagc ccccgcatcg acgagcagct ccccataaag tgtacatagc cgtgtttatt 6120 tgcttcacaa caaaagccat ccatcttgag cttgttatgg acctatcttc ggctggattc 6180 atctcagcgc ttcgcagatt catcggccac cacggatgtc cgaaagaaat gcattccgat 6240 aatgcgacaa actttcaagg tgccaagcat gagttgcatg atctctacaa aactttgcac 6300 agcaagtcgg gtcagaccgc tattgggact gaactgtcgc agcaagaaat ttcgtggcac 6360 ttcatacccc ctcgagctcc caattttggc ggattgtggg aagcagctgt ccgctcctca 6420 aaaactgctc tgaaaaagga aattggctca caccagttga cccacgaaaa tttttgcact 6480 ctattggtgc agattgccgc cgccctcaac tcaagacccc tatcgccatt atcagatgac 6540 ccctctgata tagatgccct caccccagcc cattttttga tcgggacttc catgaaagcc 6600 cttcctgatc ctgatctgac agccataccc tcgaaccgcc taacccacta tcagcaacga 6660 caacagatgt ttcaacgcta ttggcaacgc tggacccagg agtatctcac gggacttcaa 6720 caagcgagta agcatgttca acccactcct atccgcatcg ggaacatcgt ggtagttcga 6780 gaagacaaca tacctcccct cgaatggcca ttagcgagga taatcgaagt tcatcatggc 6840 gcagatggag tcataagagt tgtcaccata caaacgtcca agggaacgta caagcgtcct 6900 gtgaaccgaa tttgtccatt acccatagaa gacaaaacag acgttgaaga ttagctggta 6960 atgttagtat tgttaaaaat gttagtaata atgagttgaa acaagtttca agggggccgg 7020 taatgttagc tgttagaagt ttagtat 7047 // ID SMAR1 repbase; DNA; INV; 1369 BP. XX AC . XX DT 22-SEP-2007 (Rel. 12.09, Created) DT 22-SEP-2007 (Rel. 12.09, Last updated, Version 1) XX DE Consensus sequence of Mariner-type family of repeats. XX KW Mariner/Tc1; DNA transposon; Transposable Element; SMAR1. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-1369 RA Jurka J.; RT "SMAR1: Mariner-type element from freshwater planarian (Schmidtea RT mediterranea)."; RL Repbase Reports 7(9), 990-990 (2007). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 254..1303 FT /product="SMAR1_1p" FT /translation="MNFAREDIRKLIYYNWKRGLVPSQIANEINLTLGKDT FT TNERTCRRWVANFKEGNFDVEDKERCGRPSLDINDKILALLEEHNSTVEMS FT VELGVCQQTIRNHLLAMGMRYLCNHWVPHKLSEQQMANREKICNELLCKYA FT ANDFLSQLITMDEVWIYWSNDGTYHHRSWRGAGDAPDVEVRRTLSPRKHLM FT SVFWDCKGVIMMDILPQGTTITADVYCEQLSRLVTAIQQKRRRLLGGGFHQ FT IHYLHDNARPHTAAKSVQKLTDIGFTVLPHPPYSPDLAPSDFYLFSPLKSA FT IRGRDFNGADEIQVVLDAWLDSKPRSFFADGIRKLPDRWRRCVLHNGAYFE FT HLSDTDD" XX SQ Sequence 1369 BP; 377 A; 293 C; 297 G; 402 T; 0 other; tacaggataa accaataagt tttgtccggt gctcatactc agctttccaa tgctgtaatg 60 agtgtatagg tctgcaactt tgtgtgtcta tgctatatat gttgtagatt tgtaaaattt 120 taaaaaaatc acttattaat aatttttaac attgctatta actttgatat gtcttacatg 180 gattaagcac ttttaataaa tttcttgttt gactgaaata ctttttgttt tcattagaat 240 acgtcgattt gaaatgaatt tcgcacgaga ggatatccgc aagttaatct actacaattg 300 gaaacgtgga ttagtgcctt cccagattgc aaatgaaata aatttgacac tggggaagga 360 cacaacaaat gagcgcactt gtcgacgctg ggttgcaaac ttcaaggagg gaaactttga 420 tgttgaagac aaagagcgat gtgggaggcc ctctcttgat atcaacgaca aaattctagc 480 tcttttggag gagcacaaca gtactgtgga aatgagcgtg gagcttggag tttgtcagca 540 gacaattcga aatcatctgc ttgcaatggg catgcgttac ctgtgcaacc actgggtccc 600 acacaagttg tccgagcaac aaatggccaa ccgtgagaaa atatgcaacg agctgctgtg 660 caagtatgct gctaatgact tcctcagtca gctcattaca atggacgaag tctggatcta 720 ttggagcaac gacggaacct atcaccaccg ttcgtggaga ggcgcaggtg atgcccctga 780 tgtggaagtt cgccgcacac tctcacccag gaaacatctc atgtccgtct tttgggactg 840 taaaggtgtg attatgatgg atattctgcc gcagggtact acaatcactg ccgatgttta 900 ctgcgaacag ctgtcccgtc tggtgacggc aatacaacaa aaacgacgcc gtcttcttgg 960 tggtgggttt catcaaattc actatctcca cgacaatgca cgaccccata cggcagccaa 1020 atctgtgcaa aaacttactg acattggttt tactgttctg ccacatcccc cttactcccc 1080 agacctcgca ccatccgatt tttatttgtt ttcccctttg aaatctgcga ttcgtggaag 1140 agatttcaac ggagctgatg aaattcaagt ggttctggac gcttggttgg acagtaaacc 1200 acgcagcttc ttcgccgatg gaatcaggaa actgcctgat cgatggcgcc gatgtgtcct 1260 tcataatggt gcatattttg aacatctctc tgacacagat gattgacatt tttgctgttt 1320 ttaaatttac acatgtttca ccggacaaaa cttattggtt tatcctgta 1369 // ID BEL-87_AA-I repbase; DNA; INV; 6264 BP. XX AC supercont1.41; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-87_AA_; KW BEL-87_AA-LTR; BEL-87_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6264 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.41; Positions 2613185 2619448. XX CC Positions [5299-5856] - Integrase core CC 'TCTAG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 34..6264 FT /product="BEL-87_AA-I_1p" FT /translation="MDGDHDTTAFNCKTCNLPDSVDAQMVACDKCHLWEHF FT TCAGVDETVKDRQYVCKDCIAKVSAEKSKRTKQLKVDNRSTKTGGRSTSRK FT GSKNPSIPASQSSSARAAALEAQMKVIEEEKAMKEQELKEQEELKKWELEE FT EQRQIEEKKQLMEEERKLRQRKLEEQRALLAKQQMIRRESMEKKNELIKLM FT SECGSVNSTSSMPDSRDKVKSWLADQKSNGSSMGKMVKERESNNGLHPTGF FT HDPLPYQLAPPLHRAAQLATQSVLEKKRGPIKLQQARSEIVVQPFHQYVSS FT SQPRPQQSMKENFTMAQQLHQSQSEDGLPLQRIQTLVHQHQPLPNRQLHQT FT TGRPQLCQPLPQQQYDNCFRMDATQSQRVFEQPIQHHAEIPVPNRQAYPPE FT FHGVANPQVDWQTNPRGQMFVPPQVPIQQTTMVAEPTVLQSQQIAARQVVG FT KQLPRFDGNPMDWPMFISSYEQSTAACGYTNAENLIRLQQCLSVHAKESVC FT SKLLLPESVPHAIETLRIRYGRPELLLKSLLEKVRRTPGPRQDKLETVIEF FT GLAVGNFVDHLRAAQLQQHLSNPMMMQELVDKLPGSMKMEWASFKSQHPFA FT DLETFGLFMEKQLTAASAVSFELPLHDKPYKSEKHKMRENAGIHAHSQGDR FT QGDKQPNLTEQHRKPTKVCGVCTREGHRAADCPQFKELNADERWKTVQKKG FT LCRTCLNSHGKWPCKSWQGCAVEGCRLKHHTLLHSPSPTHHVNCSSSHSSS FT PKALFRILPVILYNEGRSVSVFAFIDEVSEITLLEDNVAKQLGVKGPVKAL FT TLQWTGNVKRNESKSQEVSLGISGKSGSGKFDLRNARTVSCLFLPSQRLDY FT GELAQRFPHLRGLPMDSYETIQPKLLIGLDNLRLGVPLKLREGSSYDPIAA FT KCRLGWTVYGCTGERSSNAAIVNFHTTESSTADHLLNAQLRDFFTIENDGI FT TNRSENLVSEEDKRAKMILEKTTRRTGDSFETGLLWRTDNPEFPDSYPMAA FT RRLVGLEKKFAKEPWLGDQVRQQIHDYEKKGYAHKATQAELDSVNRSRLWF FT LPLGVVQHPRKQKVRLIWDAKATVGSVSLNSKLLKGPDLLTPLVAVLSSFR FT QFPVAVCGDIREMFHQIKIREQDRLSQCFLWRDRPSDEIQIFVMDVATFGS FT TCSPVSAQFVKNLNAEELSIQYPRAATAITRHHYVDDYLDSFKTVQEAVEV FT VKEVKLVHSRGGFELRNFFSNSGEVLQGIGEVSGDMTKDLALQRGENMESV FT LGMKWIPSVDCFTYTVNINAGLQLVLEESHIPTKREILKVVMSLFDPLGLI FT SFFLIHGKVLIQGVWASGTGWDDRINENLFKQWRQWVNCFDQLDKLRIPRC FT YFSTPYVSTENRLEAHVFVDASETAYCCVVYFRLIKADCVEVALIGSKSKV FT APLKTLSIPRLELKAAVLGAQYLQFVLNNHEFPVHQRYLWTDSTTVLAWIL FT SDHRRFQKFVAVRVGEILTLSDPQEWRWVPTKINVADRATKWGKGPELQAE FT SSWFCGPAFLHQGEEHWPERRRNISTQEELRPVHTHWSAVSLIDATRFSRY FT EKMLRTAAYVLRFIDKLLRKRSDESLPLGLLNKNDLKRAEETLWKIAQLES FT FPEEYTILSSTQGSPESRHACVSKSSPIFKAWPYLDDRGILRMRSRIGAAV FT FAPFEAKYPAVLPRQHPITKLIVDWYHRRFRHANRETVVNEVRQRFEIAKL FT RSLVEKVSRECALCRVKKAYPKPPPMAPLPPARLQAFVRPFTYVGVDYFGP FT VFVRVGRSQVKRWVAVFTCLTIRAVHIELVYSLSTESCIMAVRRFVARRGP FT PAEIYSDNGTCFQGASRDLEGQRKINEALATTFTSAQTKWLFIPPAAPHMG FT GAWERLVRSVKVAIGAVADAPRKPDDETLETILVEAEFTINSRPLTYIPLE FT SADQESLTPNHFLLGNSSGLKILPSKPIPQRGALRSNWKLAQSICDDFWRR FT WVKEYAPVIARRTKWFAETRDLKVGDLVLMVGGTARNQWVRGRIEKAISGR FT DGRVRQALVRTTSGVFRRPTVKLAVLDIVESSEPERVQGALGHHAGSRGGV FT " XX SQ Sequence 6264 BP; 1741 A; 1456 C; 1623 G; 1444 T; 0 other; ttctttaaga atttttgtac gtggatagac gcaatggatg gagatcacga tacgactgcg 60 ttcaactgca aaacctgcaa tctaccggac tcggttgatg cgcagatggt cgcctgtgac 120 aagtgccact tgtgggaaca cttcacgtgc gcgggcgtag acgagacagt gaaggatagg 180 cagtatgtat gcaaagattg catagccaag gtgagtgccg agaaatctaa acgtacgaag 240 cagcttaagg tggacaatcg gtccaccaaa accggtggaa gatcgacttc gagaaagggc 300 tcgaagaatc cgtctattcc ggcgagtcaa tcgtcaagtg ctcgtgctgc ggctctggaa 360 gcccaaatga aggtcatcga ggaagaaaag gcgatgaagg agcaggagct gaaagaacaa 420 gaagagctaa aaaaatggga actcgaggaa gaacagcggc agattgaaga gaagaagcag 480 ctcatggaag aagagcgaaa gctccgccag cgaaagcttg aagaacaacg ggcgttgttg 540 gcgaaacagc aaatgattcg cagagaatct atggagaaaa aaaacgagct gatcaaattg 600 atgtcggaat gtggaagtgt caacagtaca agctcgatgc cagattcgcg ggacaaggta 660 aaaagttggc tggcggatca aaagtccaac ggaagttcga tgggaaagat ggtgaaggag 720 cgagaaagca ataatggcct tcatcctact ggtttccacg acccgttacc ataccagctg 780 gcacctccgc tacatagggc tgctcaactc gccacgcaat cggtccttga gaagaagaga 840 ggacctatca aacttcaaca agctcgttct gaaattgttg tgcagccatt tcaccagtac 900 gtttcgagtt ctcagcctcg tccccagcag agcatgaagg agaacttcac tatggcgcag 960 caactgcatc aatcccaatc ggaggatggg ttaccgttac agcgtatcca aacgttagtt 1020 catcaacatc aaccgctgcc aaatcggcaa ttgcatcaaa caaccggtcg gccgcagtta 1080 tgccagcccc ttccacaaca acagtacgat aattgttttc ggatggatgc aactcaatct 1140 caaagagtgt tcgagcagcc aattcagcac cacgctgaga tccctgtgcc aaatcgtcaa 1200 gcttatccac cagaatttca tggtgtcgcc aatcctcaag tagattggca gaccaaccct 1260 cgagggcaga tgtttgtacc accgcaagtt ccgattcagc agacaacaat ggttgcggag 1320 ccgaccgttc ttcaatcgca gcagattgca gcccggcagg tagttgggaa acagttgccc 1380 aggtttgacg gaaacccgat ggactggcca atgttcatta gtagctatga acagtctacg 1440 gctgcttgtg gttacacgaa tgcagaaaat ttgattcgtc tacagcagtg tctctcggta 1500 cacgcaaaag agtcagtttg cagcaaactg ctactgccag agagtgtacc tcacgccatc 1560 gagacactta gaattcgcta cggtcgtccg gaactgctac tcaaatccct gttggaaaag 1620 gttcgccgca caccgggtcc tagacaagac aaactagaaa cggtgatcga gtttgggttg 1680 gcagtaggga acttcgtcga tcatctgcgg gcggcgcaac tgcaacaaca tctctcaaat 1740 ccgatgatga tgcaggaact ggttgataaa cttccaggct ctatgaaaat ggagtgggct 1800 tccttcaaga gtcaacatcc gtttgcggac ctggaaactt ttgggctttt catggagaag 1860 caactaacag cagcgagcgc agtaagcttc gagcttccac tgcatgataa gccgtataaa 1920 agcgagaaac ataaaatgcg ggagaatgca ggtatacatg cacattccca aggtgaccgc 1980 caaggagata aacagccgaa tctgacagag caacatcgga aaccgacaaa agtgtgtggc 2040 gtctgtactc gcgagggtca cagagctgcc gattgccctc agttcaagga gttgaacgcc 2100 gacgagcgtt ggaagactgt ccaaaagaag ggtttgtgca ggacatgctt gaacagccat 2160 ggcaaatggc cgtgcaaatc atggcaaggt tgcgcagtgg aaggatgccg cctaaagcat 2220 cacactcttc tgcattctcc ttcaccgaca catcatgtca actgctcatc cagtcactcg 2280 tcttctccga aagctctctt ccgtatacta cccgtcattt tatacaacga aggaagaagc 2340 gtttccgtgt tcgcctttat cgacgaagtg tcggaaataa cactactcga agacaacgtc 2400 gcaaagcagc taggagtgaa aggtcccgta aaggcgttaa ctcttcagtg gacgggcaac 2460 gtgaagcgga atgaatccaa gtcccaggaa gttagtttgg gaatatcagg aaaaagtggt 2520 tcagggaagt tcgatcttcg aaatgccagg accgtaagct gtttgttcct gcccagccag 2580 cgacttgact acggagaatt ggcacagcgt ttccctcatc tcagagggct tccaatggat 2640 agctatgaaa ccatccaacc aaaacttcta ataggattgg ataatcttcg cttaggagta 2700 cctttgaaac tccgtgaagg tagttcgtat gacccaatag cagcgaagtg tcgactgggt 2760 tggaccgtat atggatgtac cggtgaaagg agcagtaatg cagcgatcgt aaactttcac 2820 actaccgaat cctccactgc agatcatcta ctgaatgcgc agttgcgtga ttttttcacg 2880 atagagaacg acggaatcac caatcgcagc gaaaatctag tttctgagga ggacaaacga 2940 gcgaagatga tattagaaaa gacaacacga cgaactggcg atagttttga gacaggcctc 3000 ctatggcgga cggacaatcc agagtttcct gatagttacc caatggctgc tagacgactc 3060 gtaggtttgg agaagaagtt cgcgaaggaa ccatggctcg gagatcaagt gcgacagcaa 3120 attcacgact acgagaaaaa aggctacgca cataaagcaa ctcaagcaga actggactcc 3180 gttaaccgca gccgtctatg gtttctccct ttgggagttg tacaacatcc caggaagcaa 3240 aaggtacgac taatttggga tgctaaagca acagtgggtt ccgtctcgtt gaattccaag 3300 ctactgaagg ggcctgactt actgactccg ttggtcgccg tgcttagctc cttccgtcaa 3360 tttccggtgg cggtgtgtgg agacatccga gagatgttcc accaaatcaa aatcagggaa 3420 caagaccgtc tctcccagtg ttttctgtgg cgggataggc catccgatga aatccaaata 3480 tttgtgatgg acgttgcgac atttggatcc acatgttcgc ccgtatcagc tcagttcgtc 3540 aaaaatctca atgccgaaga gctgtccatt cagtacccac gggccgcgac tgcgattacg 3600 aggcatcact atgtggacga ttacctggac agcttcaaga cagtccagga agctgtagag 3660 gtcgtgaaag aggtgaaact ggtccactct agaggcggat ttgaacttag gaacttcttc 3720 tctaattccg gagaagtact tcaaggtatc ggcgaagtat cgggtgatat gacgaaagat 3780 ctggcgttgc agagaggtga aaatatggaa tcagtattag ggatgaagtg gataccaagc 3840 gtggattgtt tcacttacac ggtgaacata aatgcaggtc tgcaacttgt cctcgaggaa 3900 agccatattc ctacgaagcg ggagatactt aaggtggtaa tgagcctctt tgacccgctg 3960 ggccttattt cattcttcct catacacgga aaggttctca tccaaggagt ttgggcttcc 4020 ggcacaggct gggacgaccg gatcaatgaa aatctcttta aacaatggcg acagtgggta 4080 aactgttttg atcagctgga taaactaagg atacctcgat gttacttctc tacgccgtat 4140 gtcagtaccg aaaatcgatt ggaagcccat gtcttcgttg atgccagcga aactgcatac 4200 tgctgcgttg tatacttccg tctgataaag gcagattgtg ttgaggtagc cctaatcggc 4260 tctaaaagca aagtagctcc tctgaaaact ctttcgattc cacggttgga actaaaagct 4320 gcagttctag gtgctcaata tcttcaattc gttctgaaca atcatgaatt tccagtgcat 4380 cagagatatt tgtggaccga ttcgactacg gtactagcgt ggatcttgtc agaccatcga 4440 cgattccaga agtttgttgc tgtacgagtg ggggaaattt tgaccttgag cgaccctcaa 4500 gaatggagat gggtcccgac aaaaatcaat gttgcagaca gagccacaaa gtggggcaaa 4560 ggccccgaat tacaagccga atcttcatgg ttctgtggcc cagcgttcct gcatcaaggt 4620 gaagagcatt ggcctgaacg tcggcggaac atttctactc aggaggagct tcgacccgtt 4680 catacgcatt ggtcagcagt ttctttgatc gatgcgacaa gattcagtag atacgagaag 4740 atgctacgaa cagcggctta cgtcctacgt ttcattgaca aactacttcg caagcgttct 4800 gatgaaagtc taccgttagg tttactgaat aaaaacgact taaagcgggc tgaggaaacc 4860 ttgtggaaaa ttgcccagtt ggaaagcttt ccagaggagt acacgattct ttcttctact 4920 caaggatcac cggaatcgcg acatgcttgc gtttccaagt ccagtccaat tttcaaagca 4980 tggccatatc tcgacgaccg agggatactt cggatgcgta gccgaattgg cgctgcagta 5040 ttcgcaccat ttgaagccaa atatcccgct gtattacccc gccaacatcc aattaccaag 5100 ctaatcgtcg attggtatca tcgtcgtttt cggcatgcta acagggaaac cgttgtgaac 5160 gaggtaaggc agaggtttga aattgcaaag ctccgaagtc tcgttgagaa agtgtccaga 5220 gaatgtgcct tgtgtcgtgt gaagaaagcg tacccaaagc caccgcctat ggctccactc 5280 ccaccggctc gtcttcaagc gtttgttcga ccatttacat atgttggagt cgattacttt 5340 ggtccagttt ttgtcagggt tggaaggagc caagtgaaac gctgggttgc agtattcacc 5400 tgtctcacaa ttcgtgcggt gcatatcgag ctggtgtata gtctatccac cgaatcgtgc 5460 attatggcgg ttcgcagatt tgttgctcgg cgtggtcctc cagctgaaat ctacagtgac 5520 aatgggacgt gtttccaggg agcaagtaga gacttagagg gacagcgcaa aattaacgaa 5580 gccttagcta cgacctttac gagtgcccag acaaagtggc tgttcattcc cccagccgcg 5640 ccccacatgg gtggtgcctg ggaacgactt gtacgatcgg taaaggttgc cattggagca 5700 gttgcagatg cccctcgaaa accggacgat gaaaccctgg aaactatctt ggtcgaggca 5760 gagttcacca taaattcaag acctctcacc tatattcctc ttgagtcggc cgatcaagaa 5820 tcactcacac ccaaccactt tttgttgggt aattcgagcg gtttgaagat cctgccatcc 5880 aagccaatac cacagcgagg ggctctacga agtaattgga agctggcgca atcgatttgc 5940 gacgacttct ggcgcagatg ggtgaaggag tatgccccag ttatcgctcg tcgcaccaaa 6000 tggtttgcgg aaacacgaga cctgaaagtt ggggatctgg tattgatggt aggtggaact 6060 gcacgtaacc agtgggtccg tggacgcatt gaaaaggcta tttccggaag agatggacga 6120 gttcggcaag cgctggtgcg cactacatcc ggagtcttca ggagacctac ggttaagtta 6180 gcggttctgg acatcgtgga gagcagtgaa cctgaacgtg ttcaaggagc cctaggacat 6240 catgcaggtt ctcggggtgg ggta 6264 // ID Copia-8_SI-I repbase; DNA; INV; 4076 BP. XX AC AEAQ01014066; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-8_SI_; KW Copia-8_SI-LTR; Copia-8_SI-I. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-4076 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01014066; Positions 482 4557. XX CC Positions [1561-2064] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 58..3270 FT /product="Copia-8_SI-I_1p" FT /translation="MEATHLKIEKLRDKDNWHQWRFVIRTLLEEDQDVLDV FT CEGRLVRPDANDQDSASKLKKFEKADKTARKLIVTTVERKPLDLLLNCTTA FT KEMWTKLSSVYDLKSDENLSLVQKQFFELKWDCNESVAHNLSKLEQLATKM FT KSLGGEIPNSMILTRALSVLPQKFNHFHSAWDSKSDTKKTFKNLTMRLMNE FT EVRLQKQENSEETTIALFSKSGSRNTSSKNNKSKQKEQSTKEKGVVCFSCG FT QRGHVKKDCTGCYSCGLKGHIKKNCPKQSKKTGTREEANSAEQDGGTTRQA FT FLGSSDAAVNDFWVIDSGASDHMTRRRDWFSFYEEYEEPLMVRVGNGDGIP FT AYGRGNIEIETLVNGQWISGVIQDVLYVPCLKRNLFSVKVAAKKGVNFSLT FT NHGSRFELTRDKKIIATGSDNGDLYKIDLRVLIPKECNIVSKFSKIDTVKS FT KVDTLQLWHERLCHQDKRHVKTLLKNMGINVLDVDDFCDGCAYGKHHRLSF FT HNRVERATRVREIIHTDVCGPMQQESLGGKRYSLIFKDEFSGFRKIYFLRE FT KSEVFEKLKMFCKELENQFGEIVREIHSDGGKEYINKNVNSFLSSKGIKHS FT VNVPYTPQQNGVAERDNRIIVEAARSMIYAKPDLPLSLWAEAMNTAAYVIN FT RTGPTKQSGRTPYELWFGKTANISNFKIFGTECFVHIPNAKRRKLDKKAEK FT GFVVGYIEDCKGYRVYIPSLKDIVLSRDVLFKKEVVVPNQIEIKSGGEQND FT TYDSGILYDKDLQKNEKEKETQCDCKTKPVQSSSNDNSNVRQLRDRNKIKK FT PELYGSPVSFFTEMLPENYSEAINSNDKCNWEVAMHDEINSLLENETWMLV FT EKPETKKILKSRWVYTVKSNPDGSKRHKARLVIKGCFQTEGIDYKETFSPV FT VRFDTVRTLLSVAAHDGLNLAQFDIKTAFLYGSLKEDIYMYQPEGFNNGTA FT RVCKLLKSLYGLKQAPRCWIEHFTGFLEFFGFSRSIADPCFYIYKSEIDKM FT LLAVYVDDGLLAATNKLLIERFFLELRKHFKITETNNVTSFLGVEIVKLPD FT GSTLLTKENLLEKC" XX SQ Sequence 4076 BP; 1403 A; 592 C; 937 G; 1144 T; 0 other; ggttatgggc ccagtaccac gctattgaga agtgaacaaa aaagtattga gtgaaaaatg 60 gaagcgacgc acttgaaaat tgagaagctt cgtgacaagg ataactggca tcaatggcgg 120 tttgtgattc gtactctttt agaggaagat caagatgttc tggatgtctg tgagggcagg 180 ctcgttcgcc ctgacgccaa tgatcaagat tcagcaagca agctgaagaa attcgagaag 240 gcagacaaaa ccgctcgaaa attgattgtc acgacagtcg agaggaagcc gttggatttg 300 ctgttgaact gcactacagc aaaggaaatg tggacgaagt tatcttcggt atatgattta 360 aaatcagatg aaaatttgtc gcttgttcaa aaacaatttt ttgagttaaa gtgggactgc 420 aatgaaagcg ttgcgcataa tttatcgaag cttgagcagt tggcaacgaa aatgaagagc 480 ctcggtggtg aaattccaaa ctcgatgatt ttgacacgtg cgctttcggt tcttccacag 540 aagtttaatc actttcatag tgcgtgggat tccaaaagtg atacgaagaa gacctttaaa 600 aatttgacga tgcgattaat gaacgaggag gttcggctgc aaaagcaaga aaattccgag 660 gagacgacga ttgctttatt ctcaaagtca ggatcgcgaa acacgtcctc gaagaacaac 720 aagtcgaaac aaaaggaaca atcgacaaaa gaaaaaggtg ttgtgtgttt ttcatgcgga 780 cagcgaggcc atgtaaaaaa ggattgtacg ggctgttatt cttgtggatt aaaggggcat 840 ataaagaaaa attgtccaaa acaaagcaag aagacaggga cgcgagagga agcaaattct 900 gcggaacaag atggcggaac gaccaggcag gcgtttctcg gaagtagtga cgctgctgta 960 aacgatttct gggtaatcga ctcgggcgca tcagatcaca tgacaagacg acgtgactgg 1020 ttcagttttt acgaagaata tgaagaaccg ttgatggtaa gagttggaaa tggagacggg 1080 attccagcct acgggagagg taacattgaa attgaaactc ttgtgaatgg acaatggatt 1140 tcgggtgtca ttcaagatgt tttgtatgta ccatgcttaa agcgaaattt attttctgta 1200 aaagtcgcag ctaaaaaggg cgtaaatttt tctcttacga atcatggttc acgttttgaa 1260 ttaactcgcg acaaaaagat tatagcgacg ggctcggata acggtgattt gtataaaatc 1320 gatctgcgtg ttttaatacc gaaagaatgt aatattgtca gtaagtttag caaaattgat 1380 actgtaaaat caaaagtgga tactttgcag ttatggcatg aaaggctgtg tcatcaggac 1440 aagagacatg ttaaaacttt gttaaaaaac atgggtatta atgttttaga tgtagatgat 1500 ttctgtgacg gctgtgctta tggcaagcat cataggttaa gttttcataa tagagttgaa 1560 cgtgcgacga gagttcgaga gattatacac actgacgtgt gtggtcctat gcaacaggaa 1620 tcactcggtg gtaaaagata ttcactaatt tttaaagatg aattttcggg ttttagaaaa 1680 atttattttt tgcgagaaaa gtcagaagtt tttgaaaaac taaagatgtt ttgtaaagaa 1740 ctcgaaaatc agttcggtga gattgtaagg gaaattcaca gcgatggtgg taaggaatat 1800 ataaataaaa atgttaacag ttttttaagt agtaaaggta taaaacacag tgtgaatgtt 1860 ccttatactc cacagcagaa cggcgttgca gaacgggata acagaatcat tgttgaagca 1920 gcacgatcga tgatatatgc gaagccagat ttacccttat ccttgtgggc ggaggccatg 1980 aatacagcag catatgtcat taatagaaca ggtccgacta agcaatcggg aagaactcca 2040 tatgagttat ggtttggtaa gacagctaat ataagtaatt ttaaaatttt tggaacggaa 2100 tgttttgttc acataccaaa tgcgaaacgt agaaaacttg ataaaaaggc agagaaagga 2160 tttgtagttg gttacattga ggactgcaag ggctatcgag tatatatacc aagtttaaaa 2220 gatatagttt tgagtcgtga tgttctgttc aaaaaggaag tggtagttcc taatcaaatc 2280 gaaataaaga gtgggggaga gcaaaatgac acatacgatt ctgggattct ttatgacaaa 2340 gatttacaaa agaacgaaaa agaaaaagaa acacaatgtg attgtaaaac aaaacctgtg 2400 cagtcttcaa gtaatgacaa ctcaaatgta agacaattaa gagacagaaa taaaattaaa 2460 aaaccagagc tctatggttc tccagtttca ttttttactg aaatgctgcc tgaaaattat 2520 agcgaagcta taaattctaa tgataagtgt aactgggaag tagctatgca tgacgagata 2580 aattcgttac tcgaaaatga aacatggatg ctcgtagaaa aaccagaaac gaaaaagatt 2640 ttaaagagtc gttgggttta tacagttaaa tcgaatcctg atggttcaaa gcgacataaa 2700 gctcgtttag ttataaaggg ctgttttcag acggaaggaa tagattataa agaaacgttt 2760 agcccagtag tacgttttga tacagtaaga actttattga gtgtagcggc ccacgacgga 2820 ttaaatcttg ctcaatttga tattaaaacc gcttttttat acggatcatt aaaggaggat 2880 atatacatgt atcaacccga gggttttaat aatggtacgg cacgtgtatg caaactatta 2940 aagagtctat atggactaaa gcaagcacca agatgttgga tagagcattt cacgggtttt 3000 ctcgaatttt ttggtttttc aaggagcata gcagatccat gtttttacat ttataaaagt 3060 gaaattgata aaatgttatt ggctgtttat gtagacgatg gtctgttggc tgcaaccaac 3120 aaactgctca tagagaggtt cttccttgaa ttgcgtaaac attttaaaat aaccgagaca 3180 aataatgtaa cgagttttct tggagtggaa atcgttaagt tgccagatgg atctacttta 3240 ttaaccaagg aaaatttgtt agaaaaatgt tagagaaatt taatatgagt aacgctaaca 3300 tagtttctac tccaattgaa actggttggg acttatccag tccatgtaaa ttagaaaaag 3360 agattccgta tcgagaagcg gtaggtaact taatgtatct gcaagtaatt agcagaccag 3420 atattagttt tgcggttaat attgcgtcta gagcgttaga aaatcctagt atagcacact 3480 ggttacttgt aaaacgtatt atgagatacc taaagggtac tgcggacatt ggattgttgt 3540 attgtaagac aggtggcttt gaagcatata gcgacgcaga ttttgcaggt gataaagaga 3600 ctcgaaaatc gacgtctggt attctatgta aaaacgcaaa cgcagcgatt gtttggcaaa 3660 gtaaacgaca acaatgtatc tctctgtcta ctacggaatc cgaatatgtt agcgctgcct 3720 cagctgttaa agagataatc tggttaaaaa gcttattgac tgaatgtgga gattgcgata 3780 acgaggggtt ttgtttattc atagacaaca tgagtgcaat aagattaatt aaaaaccctg 3840 aattccatca gagaagtaaa catatcgatg taaaatttca ttttatacgc gacatgtatg 3900 aaaaaggtgt gattaatgta aaacatgtta gtagcgatga gcaaactgca gacattttta 3960 ctaaggcttt agcaaaaccg agatttataa atttacgttg taaattagga ttaatcacta 4020 aggaaaatat taataagtcg ttttttttgt aaaatgttga gttttgggga gagtgt 4076 // ID BEL-188_AA-I repbase; DNA; INV; 6505 BP. XX AC supercont1.88; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-188_AA_; KW BEL-188_AA-LTR; BEL-188_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6505 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.88; Positions 1280708 1287212. XX CC Positions [5720-6112] - Integrase core CC 'CACGT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 38..2116 FT /product="BEL-188_AA-I_1p" FT /translation="MLGSKKATDRRSSVIRQLTEEESSTVGCCPSCDRPDV FT ADKWMVQCELCERWFHFSCAKVNESIKDRSFACTTCALPPCPESTRTGASR FT TSSTRRRLELLRLEEEKEVQERILREHAEEEAAQQKKAQEEEKDRRKRIME FT EKLEIARRYITKKFEVLQEEEQSIDGRSRKSNLSTRSKLDNVQKWVEDHDL FT AAATGSGTNHVPAGSTPITEAGPNRASVETSESVILPTSSAHAHAPLARVA FT ATGLYDKDEIASWGILVPRFGQYASNPTSGLNTLPRTTPQTTVQDGQTSHA FT APFFAAAEISPPASVQQTGPGAFSMEISPPPANEKSRADLERDLHELQQQL FT ANLKRASSTQHYPELRPNNSSEPTAATTSLSNTLGSGMSNLHASTQAGHVV FT DRQVHIPISRSVSQASGLGQDTATYVNPMVVSFPSQSNLSGIPNLLFNHTY FT QATSADWYRQYHESVPNYRRQNPLLPVSTAAGRVPRQYEQSFPNPPNQHPP FT FPSGVPSYPFVASVPSVPNPIVTPSVSQEVVHHPPVSVCGPSPQQIAARHV FT VPKELPEFYGDPIDWPLFVSSYNNSTRMCGYSDDENLMRLQRCLKGNAKEA FT VRGHLYHPSSVPQIMATLETLYGRSELIVRCLMNKVLATPAPKADKLESLI FT SFGLVVQPTTCSSGGFARFQYVYDHDCIGSQQRYTLRRTVAESG" FT CDS 2052..5726 FT /product="BEL-188_AA-I_2p" FT /translation="MTTIVSAASNVTLYVEPLRKVDKPKGKEKGFVNAHSS FT ESEQKADRNVRKESGGASTVYQPKPCLMCKKDGHKVKDCISFKKLSLENRW FT KTAQQLSLCRRCLIPHGKWPCKATECGENDCKARHHRLLHPGNPQSPTSEA FT TTSTRGQTTSIPSATVNVHRQLPCPVLFRILPVTLYGKKVSINTLAFLDDG FT SSYTLMEKSLADELGVEGDTEPLCLQWTSNIKRTEWDSQNVQLEISAVGKN FT QKHSLEFVRTVESLNLPPQSLQYEEMAQKFEYLKGLPVDSYPSTAPRILIG FT ADNAKLLLTLKKREGRYNEPVAAKTRLGWTIYGKMQGLDGKPDHHLLHICE FT RSNDQQLHDAVKDFFTIENVGVALAPLVESEEDRRSREILEATTVRTPSGR FT FQTGLLWKFDYIEFPENRFMAEKRLMCLERSLAKKPELYVKVRQQILDYQT FT KGYAHKLTEEESSDSDPRRVWYLPLGVVQTPSKPGKVRVVWDAAAKTGGVS FT LNSMLLKGPDLLTSLPSVLFRFRERQVAITGDIKEMFHQILIRPEDRQSQR FT FLWRENSEDPIQEFVMDVATFGSTCSPCSAQYIKNKNAEEWKNVYPEAAAA FT IVENTYVDDYANSSDTVEEAVRIALEVKEIHASAGFEIRNWLSNSPQVLQR FT VGEQRLQVTKSFIAEKSTESERILGMAWQPESDEFTFALRLRTDVQLLLAE FT EIIPTKRQVLQVVMSIFDPLGLVAAFVIQGKCLIQDIWRAGVGWDDYIPND FT LLPHWRRWVNVLQNLNQVRIPRCYFPGYDPRSIGSLELHVFVDASETAYAC FT AAYFRIVDRGQVRCALVSAKTKVAPLNPMSVRRLELQAAVIGVRLMKSVQS FT NHSLPIKRRVLWGDSKTVQSWIRSDQRKYRQFVAFRVSEILNATSLEEWRW FT VPTRLNVADAATKWGKGLSFNADSRWFLGPEFLYEDESRWPNQEYSPPNTL FT EEIRSIHAHHRVTSDPLINYQRFSNYQRLLRTVGYVLRFIDRCRRRSITLS FT DMDVLSREELLNAEIVLWKLVQLEEYPDEVATIRRNKSLRPEQAIRLEKGS FT PLRKLSAFLDDHGVLRIEGRIDAAEFVPYGPKHPVILPKAHRFTELLVLRF FT HQRYAHGYGETVINELKQIYHIPNLRTVVRKVARRCSWCIVYRAVPKTPRM FT APLPAARVTPYVRPFTFIGIDYCGPFLIRIGRSNVKRWVVLITCLTIRAVH FT LEVACSLSTESCKWR" XX SQ Sequence 6505 BP; 1795 A; 1543 C; 1684 G; 1483 T; 0 other; aactttaagg tttctaacct tacaaacgac cttcggaatg ctgggctcta agaaggcaac 60 agatagacgg tctagtgtca ttcgacagct gacggaagag gaaagttcta cagtcggttg 120 ctgtccttct tgtgaccgcc cagatgtcgc cgacaagtgg atggtacaat gtgagctttg 180 cgaaaggtgg ttccacttct cttgtgcgaa ggtgaacgaa agcataaagg accgtagctt 240 tgcgtgcact acatgcgcac tccctccatg tccggagtcc accagaacag gtgcgtcaag 300 aaccagttca acaagaagac gactggaact tctacgattg gaggaggaga aggaggtaca 360 agagaggata ttaagggaac acgcagaaga agaagctgcc cagcagaaaa aggcgcagga 420 ggaggaaaag gatcgaagga aaaggattat ggaggaaaag ctggagatcg cgagacgtta 480 catcacgaaa aagttcgaag tgctacagga agaggagcag tcgatcgacg gtagaagtcg 540 aaagagtaat ttgagcacac gcagcaaact cgacaacgtt caaaaatggg ttgaggatca 600 cgatttagca gcggcaactg ggtccgggac gaaccatgtc cccgctggtt ccacgccaat 660 cactgaagct ggccccaatc gagcatcggt cgagacttct gaaagtgtta tacttccgac 720 ttcttctgct catgctcatg cccctctcgc tcgtgtagcg gctacaggac tgtacgataa 780 ggacgaaatc gcatcgtggg gaatattggt gccccgtttc ggccaatacg catcaaatcc 840 aacttccgga ttgaacacac tacctcgaac tactcctcaa accacagtgc aagatgggca 900 aacttctcat gccgccccat tcttcgcagc agctgaaatt tcaccacctg catcggtaca 960 gcaaactggg ccgggggcgt tttccatgga aatatctcca ccacccgcca acgagaaatc 1020 aagagcagat ctggagcgag acttacatga gctacaacag cagttagcaa acctgaagcg 1080 agcgtcgtct actcagcact atcctgaact gcgtccgaac aacagcagtg agccaacggc 1140 agctaccacc agcctaagta acacattggg ttcaggtatg tcaaacttgc atgcttcgac 1200 acaggcaggt cacgtggtag acagacaggt gcatattccg attagtcgtt cagtaagtca 1260 ggcttcaggt ttaggtcagg acactgctac atatgtgaat ccgatggttg tatcatttcc 1320 ttcacaatca aatctatcag ggattcccaa tcttttattt aatcatacct atcaggctac 1380 ctccgccgat tggtatagac aataccacga aagtgtccct aattatcgaa gacaaaatcc 1440 gcttcttccg gttagtaccg ccgccggtcg tgtgccgcga caatacgagc aaagttttcc 1500 gaatcctccg aatcaacatc caccatttcc gagtggcgtt ccttcatatc cgttcgtcgc 1560 gagtgtacca agtgttccga acccgatcgt cacgccatca gtatctcaag aagttgtgca 1620 ccatcctccc gtctcggttt gtggtccgag cccccagcaa atagccgcta ggcatgtggt 1680 tcccaaggag ctaccggaat tttatggcga tccgatagac tggccattgt tcgtgagcag 1740 ctacaataat tcgacgcgta tgtgtggtta tagcgatgat gaaaatctca tgcgtcttca 1800 acgttgcctc aaggggaacg ctaaggaggc agtgcgaggg cacctttatc atccgtcatc 1860 tgtaccccag atcatggcaa ccttagagac actttacgga cgctcggaac taatagtgag 1920 gtgcctgatg aacaaagttc ttgcaacgcc agctcccaaa gcggacaagc tggagagcct 1980 tatcagcttc ggactcgtgg ttcaaccaac gacatgttcc agtggtggat ttgcgcgctt 2040 tcagtacgta tatgaccacg attgtatcgg cagccagcaa cgttacactc tacgtagaac 2100 cgttgcggaa agtggataaa cctaaaggca aggagaaagg attcgtcaat gcacattcat 2160 cggaatctga acagaaagcc gatcgtaacg tcaggaagga aagcggtggc gcaagtacag 2220 tctatcaacc gaagccgtgc ctaatgtgca agaaagacgg acacaaagtg aaggactgca 2280 tcagttttaa aaaattgtcg ctggaaaatc gctggaagac cgctcaacag ttgagtcttt 2340 gccgtcgctg cctgattccg catggcaagt ggccatgcaa agccacagaa tgcggtgaaa 2400 atgattgtaa agcgcgtcat catcggctac tgcatcctgg aaaccctcaa tcaccgactt 2460 ccgaagcgac aacgtctacg agaggacaga ccacttccat tccgtcagca acagtaaacg 2520 ttcatcgaca gctaccatgc ccggtgcttt ttcgcatact tccggtaacc ttatacggga 2580 aaaaggtatc gatcaacacg ttggcatttc tcgacgatgg atcatcgtac acgttgatgg 2640 agaagtcctt agcggacgaa ctaggtgttg aaggcgatac cgagccgctg tgtttgcagt 2700 ggaccagcaa tattaagcga accgaatggg attcccaaaa cgtgcagcta gaaatctccg 2760 ccgttggtaa gaatcaaaaa cattcgctgg aatttgtacg aacagtggag agtttgaacc 2820 ttccaccgca atccttgcag tatgaagaaa tggcacagaa attcgaatat ctcaaggggc 2880 ttccggtgga cagttatccc agtaccgctc cacgcatttt aatcggtgcc gataatgcaa 2940 agctattatt aaccttaaag aagcgtgaag ggcggtataa tgagccagtg gcagcgaaaa 3000 caaggctagg ttggactatt tacggaaaaa tgcaaggcct cgacggtaaa ccagaccatc 3060 atcttcttca catatgtgag agatcgaacg atcagcagct tcatgacgca gtgaaggact 3120 ttttcaccat cgaaaacgtc ggcgtcgctt tagctccgct cgtggagagc gaagaagatc 3180 gaagatcgcg ggagatacta gaagcgacaa ccgtgcggac accatctgga cgtttccaaa 3240 caggtttgtt atggaagttc gactacatcg agtttccaga aaaccgattc atggcagaga 3300 agcggttgat gtgcctggaa cgaagtttag ccaagaaacc ggagctgtac gtcaaagttc 3360 gtcaacaaat cctggattat cagacgaaag ggtatgctca caagcttacc gaagaagaat 3420 caagtgacag cgatccaaga agagtctggt atttgccgtt gggggtggtg caaaccccaa 3480 gcaagcctgg aaaggtcaga gtagtatggg atgccgcagc aaaaactggc ggcgtctctt 3540 tgaactctat gctactgaag ggcccggatt tactcacatc tctgccatcc gtattgttcc 3600 gtttccgtga gcgacaagtt gccataacag gcgacattaa ggaaatgttc caccagatcc 3660 tgattcggcc agaggaccgg caatctcagc gcttcttgtg gcgggagaac tcagaggatc 3720 ctatacagga attcgttatg gatgttgcaa cgttcgggtc tacctgttca ccatgctcag 3780 cccaatacat caaaaataag aatgccgagg aatggaagaa cgtatatccg gaagctgctg 3840 cagctatcgt ggagaacaca tacgtcgacg actatgcgaa cagttccgac acggttgagg 3900 aagcggttcg catagcccta gaagtgaagg agatccatgc aagtgcgggt ttcgagattc 3960 gtaattggct ctcgaactcc cctcaagttc ttcaacgggt cggggagcaa agattgcaag 4020 taacgaaatc cttcattgcg gagaagtcta ccgaatcgga acgaattcta gggatggcat 4080 ggcaaccgga aagcgatgag ttcacgttcg ccttaaggtt gcgaaccgac gtgcagctac 4140 tgttagcgga agagattatt cccacgaaac gtcaagtcct gcaagttgta atgagtatct 4200 tcgatcccct gggacttgta gcagcattcg taatccaagg caagtgctta atccaggata 4260 tttggagagc cggcgtgggc tgggatgact atattccaaa cgacttgctt cctcactggc 4320 ggcgatgggt taatgtgctt caaaatttga accaagtgag gattcctcgc tgctactttc 4380 ctggttatga tcctcgaagc ataggcagcc tagaactcca cgtgtttgtt gacgcgagtg 4440 aaaccgctta cgcttgtgct gcctattttc ggattgtcga ccgcgggcaa gtgcgttgcg 4500 cattagtgtc cgcaaaaaca aaagtggcac ccctcaatcc aatgtcggtc cgtcgactag 4560 agctgcaggc cgcagtcatc ggagttcgtc tgatgaaatc ggttcagtcg aaccattcac 4620 tacctatcaa acgtcgtgtc ctctggggtg actctaaaac agtccagtcc tggatacgtt 4680 cagatcagcg aaagtaccgt caatttgtgg cgttccgagt cagcgagatt ttgaacgcaa 4740 cgagcctgga agagtggaga tgggtcccca cacggctcaa tgtcgctgat gcagctacga 4800 agtggggcaa aggactaagt ttcaacgcag attctcggtg gtttctcggg cctgaattcc 4860 tttacgagga cgagtctaga tggccaaacc aagagtactc cccaccgaac actttggaag 4920 aaattcgttc aattcacgct caccatcgtg taacttcgga tcctttgatc aactaccaac 4980 gattttcgaa ttatcaacgg ttgctacgga cggtcggata cgttctccgt ttcattgatc 5040 gctgccgaag aagatcgatc acactttctg atatggacgt actttcgaga gaggaattgt 5100 tgaacgcaga aattgtgctg tggaagttgg ttcagcttga agaataccct gatgaagtag 5160 ctacgatccg cagaaacaaa tctctacggc cggaacaagc gatacgattg gaaaagggaa 5220 gcccgttgcg caaactgtca gcatttctcg acgaccatgg agtgctgcga atcgaaggaa 5280 gaattgatgc ggccgaattt gtcccatacg gacccaagca tccggtaatt ctccctaagg 5340 cgcaccgttt tactgagttg ctagttctac gatttcacca gcggtatgcg cacggttatg 5400 gagaaactgt gatcaatgaa ctgaaacaga tctaccacat ccccaacttg agaactgtag 5460 ttcgaaaggt tgcgagaaga tgttcttggt gcattgtcta tcgagctgtt cctaagacac 5520 cgcggatggc accgcttcca gccgctagag taacaccgta tgttcggcca ttcaccttca 5580 tcgggattga ttattgtgga cccttcctaa ttcgcattgg gcggagtaac gtgaaaagat 5640 gggtggtact tatcacctgt ttgaccatac gggctgtaca tttggaagta gcgtgcagct 5700 tgtccactga atcgtgtaaa tggcgataag gcggttcata gctaggagag gagctccgct 5760 ggaaatttat agtgaccagg gcacgaactt cgtaggagct agtaaggagt tgcggacaga 5820 atcaacggcg gtaaatcgtg ccttggctga atcatttacc aatcgtgaaa cacagtggcg 5880 tttcaaccct cctgctgccc cacatatggg tggcgtgtgg gaacgtatgg tgcgaacagt 5940 gaagaattcg ttggagacat tatcgaccaa cagaacacca gacgaggaga cgttccagac 6000 ccttttgatt gaagttgaag gaataatcaa ttctcggccg ttgacgtttg taccattggg 6060 aactgaagaa gaggaagctc ttacgccaaa tcatttcctg atgctcagtt ccagtaatgt 6120 aaatcaacct ccccaggagc cggtggctga taccgggcga acactaagga ctaattggga 6180 ccaaatacgg agcctgttgg atcagttttg gaagaagtgg ataaaaggat acctcccaac 6240 tatctgccgc cgtacaaaat ggttcgacga caccaaaccg gtgaaggtcg gagatttggt 6300 ggtggtggtc gaagaatccg tgcgaaacag ttggatgcgc gggaaggttg tgaaggtgtc 6360 tcccggcaaa gatggaagaa tacgagaagt agaggtgcag actacgaaag gagtatttcg 6420 gcgtcctgtt actaaagtgg ccgtgcttga tgtggcagtc gatggcatag ctgaggaaca 6480 ccgagcagca atacgggtcg gggaa 6505 // ID L1_Ele3 repbase; DNA; INV; 4465 BP. XX AC . XX DT 06-OCT-2010 (Rel. 15.1, Created) DT 06-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE An L1 clade non-LTR retrotransposon family from Aedes aegypti. XX KW L1; Non-LTR Retrotransposon; Transposable Element; L1_Ele3. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4465 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4465 RA Kojima K.K. and Jurka J.; RT "L1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (06-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. This consensus is generated from 30 CC sequences with >92% identity, and ~94% identical to the original CC sequence in [1]. It is close to IseAg1 in Anopheles gambiae. XX FH Key Location/Qualifiers FT CDS 332..1198 FT /product="L1_Ele3_1p" FT /translation="RTHEVEIDGKKHKLRITIEDGSVEVKVHDLPEDVTKE FT KIVEFLRAFGEVVSIHELTWGESYEFAGVPLGIWSARMLLHRNIDSWVTID FT GQQAYVVYKGQVISCKHCKEQAHSGISCVQNKKLLVQKSYANVAKQSGNRP FT PPKKSSGVKPPSTKPVGPTAPSLTSSAAFPELPKPPSRTEQSAPVTKANSQ FT SDKIDLTASPSPQQQRAHGSSSSHRSTQQATEATQPNVAVVDLFKKPIHAL FT RSHSKNGNGNETDESSASTSSRRSRGRPPGKKPRREDDDDEQDEDSQP" FT CDS 1201..4038 FT /product="L1_Ele3_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MALTSYNIASINIGTITNPTKLAALRTFINSQSLDIV FT CLQEVENDQLSLPGFVVYTNVDHTRRGTAVALKEHIQVSHVEKSLDSRLIA FT LRVHDTTICNLYAPSGSAQRAAREDFFNGTLAYYLRHPTQHVILAGDFNCV FT LRACDSSSPNTSPALKTAVQQLQLHDVWEKLRPRDAGFTYVCRNAQSRLDR FT IYVSSGLREHLRTAHTHVCSFTDHKALTLRLCLPQLGHEPGRGFWSLRPHL FT LTDENVAEFQTQWQYWTRQRRNYGSWIEWWISYAKPRIKTFFKRKSRIVFD FT EFHDAHQRLYAELRLAYDGYYQHPEMLPTINRLKGEMLALQRNFSHMFMRM FT NETHVAGEPVSIFQLGERRRRRTVITQLRTEENETVAEPAAIEANLLNFFS FT RLYTEEATAENNNDDGFTCERVIPPDDPLNQACTDEITTAEILSAIRXSAP FT RRSPGNDGIPREFYLRXFDVIHRELNLLLNEALAGNLPPAFVEGIIVLVKK FT KGGDNTARSYRPISLLNSDYKLLSRILKSRLECVMRAHHVLSDGQKCSNSE FT RNIFQATLALKDRIASLRHHRRAGKLISFDLENAFDRVRHSFLFGTMCSLG FT FSRDLVALLARIASRSTSRLLINGRLSRPFEIQRSVRQGDPLSMHLFVLYL FT HPLVCRLEQACGNDLLVAYADDISVIVTSAGQIELMNGIFIRFEAVAGARL FT NLRKTVAIDVGFCEGNPINTQWLQTADVVKILGITFANSIRLMTTLNWNAL FT VGKFAQQMWLHSLRMLTLHQKVIMLNTFGTSKLWYVSSVLPPLGVHTAKIT FT STMGAFLWRGMPARVPMMQLARSKEHGGLNLHLPALKAKSLAINRHLKEID FT SLPYYKSLLFHDNPRPAISIDHPDLKIILSNYSHIPPLIQENPSADLIHRH FT FVSQTDLPKVERSHPTIDWPRAWRNISDKQLSSAERRSSTCW" XX SQ Sequence 4465 BP; 1105 A; 1259 C; 1161 G; 929 T; 11 other; cagttatcgc tcagcttctg agacgcawag acgtgtttac gactgcgctc ctcattagtc 60 aggcatattt tttctgatcg cttttcgatt gtttcgttgt gccgcgtcgc gtgtgtggtg 120 attacgcgat gaggcgagaa aacaccttcc ggatcgacta ctcgagcttc cccgtgaagc 180 cggctttcga gaaggtgcac ggcttttgcc gtacagttct cggcctgaas aaggamgatg 240 tgawaagact mcagtgcagt agaggtgaac aatgtgcatt cgtcaaggtc ggcgacctgg 300 cgctcgcaca gaaaatagtg gaagaacatg acggacgcac gaagtggaga tagatgggaa 360 gaagcacaag cttcgcataa ccatcgagga tggaagtgtc gaagtgaagg tgcatgacct 420 ccccgaagac gtgacgaaag agaagatcgt ggagttcctg cgcgcgttcg gtgaagtggt 480 ctcgattcac gagctaacgt ggggagagag ctacgagttc gccggcgtac cactcggcat 540 atggtcggcm cgaatgctac tgcaccgaaa catcgactcg tgggtcacca tcgatgggca 600 gcaggcgtac gtggtgtaca aggggcaggt gatctcgtgc aagcactgca aagagcaggc 660 acactctggc atatcatgtg tccagaacaa aaaacttctg gtgcagaaga gctacgccaa 720 cgtagcaaag cagagtggca accgtccgcc accgaaaaaa tcgtccggtg tgaaaccacc 780 gagcacgaaa ccggtaggtc ctaccgcgcc atcactcacc tcgtcggctg cttttcccga 840 gctgccgaag cctccgagtc ggaccgaaca gtctgccccg gtgacgaaag caaacagtca 900 atccgacaag attgacttaa cggcgtcccc cagcccgcaa cagcaacgcg cgcacggttc 960 gtcgtcgtct caccgatcga ctcaacaagc caccgaagct acccaaccca atgttgctgt 1020 ggtggacttg ttcaaaaagc cgatccatgc gctgcgatcg cacagcaaga atggcaacgg 1080 caacgaaacc gatgaatcgt cggcatccac gagcagcagg cgaagcagag gccgaccacc 1140 cggcaaaaag ccccgccgag aagacgatga cgacgagcag gatgaggatt cccaaccgta 1200 atggctctca catcctacaa tatcgcgtcc atcaacatcg gcaccatcac caaccccacg 1260 aaactagcag cgctgcgcac cttcatcaac agccaaagtc tcgatattgt gtgtctgcaa 1320 gaagtggaaa acgaccagct ctccttgcct ggcttcgtcg tatacacgaa tgtagaccac 1380 acgagaagag gaacagctgt tgctctgaag gagcacattc aggtatctca cgtcgagaaa 1440 agcctggaca gccgcctaat cgctctgcga gtgcatgata ccacgatctg taacctctac 1500 gctccctccg gctctgcaca gcgcgcagca cgggaggatt tcttcaacgg cactctcgcc 1560 tactatctcc gccatcccac tcagcacgtc atcctcgctg gcgattttaa ttgcgtgctt 1620 cgagcatgcg attcgtccag ccccaatacc agccctgcac tcaaaaccgc tgtgcagcag 1680 ctgcagctcc acgatgtgtg ggaaaagttg cgcccccggg atgccggttt tacgtacgtt 1740 tgccggaacg cacaatcgcg cctcgaccgc atttatgtga gcagcggttt gcgggagcac 1800 ttgcgtaccg cgcatacgca tgtctgctcg ttcacggacc acaaagcgct gacactccgt 1860 ctttgcctcc cccagctagg acatgagccc ggccgtggtt tctggtccct tcgaccgcat 1920 cttctcaccg acgagaacgt tgccgagttc caaacacagt ggcaatattg gactcggcag 1980 cgcagaaact acggctcgtg gatcgagtgg tggatttcgt acgctaagcc aagaataaaa 2040 acgtttttca agcgtaaatc gcggatcgtc tttgacgaat tccacgacgc gcaccagcgt 2100 ctatacgcgg agctgcggct agcgtacgac gggtactacc agcatcccga aatgttaccc 2160 accatcaacc gattgaaagg cgaaatgttg gcgcttcagc gcaacttttc gcacatgttt 2220 atgcgaatga acgagacgca cgtggcggga gaacccgtct cgatatttca gttgggggag 2280 agacgaagga gaaggaccgt catcacccag ttgcgaacgg aggagaatga gaccgttgcc 2340 gagccagcag cgatcgaagc gaatctgcta aacttcttct cgcgtctcta cacggaggaa 2400 gcaacagcag aaaacaacaa cgatgacggg tttacatgcg agcgtgtgat cccacccgat 2460 gatcccctca atcaagcctg cacggacgag atcaccacag cagaaatcct ctctgccatc 2520 cgakcgagtg ccccacggcg ttcccccggc aatgacggca tcccamggga attctatctg 2580 cgcmtgttcg acgtgatcca tcgggagctc aatctcctgc tcaacgaagc actagcaggc 2640 aaccttcccc ctgcttttgt ggagggcatc atcgtgctgg tgaagaagaa gggaggcgac 2700 aacacggccc ggtcgtatcg acccatctcg ctgctcaaca gcgactacaa gcttctctcg 2760 cgcattctga aatccaggct ggagtgcgtc atgcgagccc accacgtgct gagcgacggg 2820 cagaaatgct ccaactcgga gcgtaacatt tttcaggcca ctcttgctct gaaagatcgg 2880 attgcgagtc tccgtcacca ccggcgcgcc ggcaagctaa ttagctttga tttggagaac 2940 gcattcgatc gggtccggca ctcattcctg ttcggcacca tgtgctcgct cggctttagt 3000 cgggatctcg tcgctcttct cgcgcgcatt gccagtcgat ccacctctcg gctgctcatc 3060 aacgggcgtc tctcacgtcc gttcgagatt caacgttcgg tccggcaggg tgacccgttg 3120 tccatgcatc tcttcgttct ctacctccac cccctggtgt gtaggctcga acaagcatgt 3180 ggcaacgatt tgctggttgc gtatgccgac gatatcagcg tcatcgtgac gtcagccggg 3240 cagatcgagc tgatgaatgg gatattcatt cgcttcgagg ctgttgccgg cgcgcggttg 3300 aatctgcgca aaacagtggc gatcgatgtc gggttttgcg aaggtaatcc aattaacacc 3360 caatggctac agacagccga tgtagtcaaa attttgggta ttaccttcgc aaactcgata 3420 cggttgatga ccacgctaaa ctggaatgcg ctggtgggga agtttgcgca acaaatgtgg 3480 ctgcactcgc tgcgcatgct gaccctgcac cagaaagtca tcatgctgaa cacgttcggc 3540 accagcaagc tgtggtacgt ttcgtcggtg ttgccaccgc taggagtgca cacggcgaaa 3600 atcacctcaa cgatgggcgc gttcctgtgg agaggaatgc ccgcccgcgt cccgatgatg 3660 cagctggcgc gcagcaagga gcatggggga ttaaaccttc atctgccggc gttgaaggcg 3720 aaatcactcg ccatcaaccg tcatctcaag gagatcgatt cccttcccta ttataaatcc 3780 cttcttttcc acgataatcc ccgcccagca atctcaatag atcatcccga cctaaaaata 3840 atcctatcaa actactccca cattccaccc cttatccaag aaaacccctc cgccgatctc 3900 atccaccggc atttcgtttc gcaaacggac ctgcccaagg tggaacgaag tcatccgacg 3960 atcgactggc cacgtgcgtg gcgaaacatc agcgacaagc agctctcttc ggcggagcgt 4020 cgaagctcta cctgctggtg aacgagaagt gggagcaccg aaaactgctg tccgtgatgc 4080 ggagagcaga caacgagttt tgcacgcact gcgacggacg aacggttaga aacgctccac 4140 cataaatttt gcgactgtgc tcgtgtcggc ccggcttgga cggtcctaca gcagaggctg 4200 gcagtcgtga tgaatggatg gaggmgactg acgttcgaag acctggtgag acctgcsttc 4260 gcaggcgtaa atagaggtag acgagtcgag attctgaaaa tgttagttaa atacatcacc 4320 tttgttaacg agtgtaacgg tagaatcgat gttagagcac taaattttca tcttgatctg 4380 taatttttag atgtaaataa ctgttttgta aatcgctgaa ctgactaaat aaaaccgaat 4440 tttacaaaaa aaaaaaaaaa aaaaa 4465 // ID Gypsy-7_RP-I repbase; DNA; INV; 4668 BP. XX AC ACPB02048231; XX DT 28-JAN-2011 (Rel. 16.02, Created) DT 28-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Rhodnius prolixus genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-7_RP_; KW Gypsy-7_RP-LTR; Gypsy-7_RP-I. XX OS Rhodnius prolixus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Euhemiptera; Heteroptera; OC Panheteroptera; Cimicomorpha; Reduviidae; Triatominae; Rhodnius. XX RN [1] RP 1-4668 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Rhodnius prolixus genome."; RL Direct Submission to RU (28-JAN-2011). XX DR Genome; ACPB02048231; Positions 3822 8489. XX CC Positions [3674-4147] - Integrase core CC 'CTTT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS join(2189..2839,2843..4492) FT /product="Gypsy-7_RP-I_1p" FT /translation="MAVDYRELNKITLKDHYPIPRIDDHIDQLKDKVYFTT FT LDLKDAFHHIKIKESSIPYTAFVTFKGQYEYMRMPFGLTNGPPHFMRFINL FT AFRKLLVAEKILVYMDDILIATSSVEENLNTLIEVLEILVQNCLELKFAKC FT SFLQTKINYLGYLVDDKGIRPSPANVAAVANYPVPKNFRELHSFLGLISYF FT RKFIHNFAIVAQPLYKLLKSKINFVWSEESNCFERLKNFLISEPLLAIYSP FT TALTELHCDASAQGYGSILMQKQSDNKFHPVCYFSKRTTEAESRYHSFELE FT ALAIIYSLERFRVYLQGIPFKIITDCNSLKLTLEKKDINPRIMRWSLILQN FT FEYILEHRGNERMRHADAISRHSQLMVISENSLERNLALLQDKDDDITKLK FT LELEKNQSKTFELSNGLVYLKSYDLLLFYVPKSMENAVIQECHHTVGHQGC FT EKTLGFLTRVYWFPNMKKKIQKYIDNCLKCITYSKKSSGKPEGKFYSYDKE FT GEPFHCVHIDHYGPLATTKFKYRYIFEIIDSFTKFVKFYPVRTTNADEVIK FT WLECYFSHYSKPVKLISDRGTAFSSSKFTEFCKDYGIQHVMIATGTPHANG FT QIEIVNRTLTPILAKLTTSPTQWYEVLAKVEFAFNNSINRATGETPSMILF FT GINQKGNHNDDLRQILEFQHPVFRDLEQSRYNAINKLQNSANYYEKYYNSK FT HKAPGKYEVNDYIMIKNTDVTPGINKKLIPAFKGPYVIKNPEGHQTTQLPY FT EGICSPMNMKPWLPNS" XX SQ Sequence 4668 BP; 1645 A; 745 C; 872 G; 1406 T; 0 other; gcttcaggtg tggggtacga ggcagatcaa gttggtaaga ggaaaataaa gttatattgc 60 aaaaaaaaat ggaacgaatt gaacattgaa aaatagtgaa gtgttgtcgt atatactatt 120 tgattgcaag taagtaatag cagttcagat tgatcatcgg ccgtaagtaa aacccagtta 180 catattacat tggttcattt tatttgcaaa tatgacgtca ttaggagaaa ttgatagaat 240 taacaaaaat attagcaatt ctaccattaa cgctattcga gcacttcatt tattaatttt 300 tgagtgcgaa ggaaatcgtc agaaaagagc tagactcaga aaattggaag gatttagttt 360 cttgttggat tcagaagaat ataaggccaa acttaaattc atacatgaaa atttaactga 420 tggtgacctt acactaatct gcaatattct taacattgac catgacggaa atgcggacga 480 actagccaaa cgtatttgta caagtctaat gaatttagaa gatttaaaaa gattaaatga 540 agaggttaat gagcaagaag atgaagatac tgaagagcaa gatgtaaaag aagaagaaaa 600 taaagaagat gaagaaagaa gtgattcaga gttgtgttac catcccaaat ttttattaag 660 agatatagaa gattctattc gtccttttac tggcaatgac aattacaccg ctaattgatg 720 gattaaagag tttgaagaga tgaggaaatt attgaaatgg aacgatttgg aaacttttat 780 atatgccaag aaatctataa aaggatttgc aaaaacacta attaattgtg agccgaatat 840 caacagttgg gttaatttga agaaggtgtt aagagctgaa tttccaacta cgataaatag 900 tgctactcta catcacatgt tagttaagag aaaaatgaaa aaagaagaga cacttaaaga 960 atacttttgc cacatgcgcg aactggccgg aaggggttgt atcgaagatg atgcattaat 1020 taaatacaca atcgacggca tcaatgatga tgttagaaac aaagttgttc tttttggctg 1080 ctctacagtt gcagaattta agaaaagaat tgatatttat taagaaatta agttaagcag 1140 taaagtacat caagagaaat atcaattgtt agaattcaag aaatttccag agtcaatgca 1200 ggaaagaaga gtgaaaaatg caggaactac gggtaatgcg aatgttgttc gatgttacaa 1260 ctgcgggaaa atgggtcatg tttcttcagc gtgcacggct ccgaaacgag aacctaattc 1320 ctgtttcaaa tgcggatctc aaggacatca aaaaaaggac tgtccactaa tacgtcccac 1380 atcgagacag ctcactggag gtaccacgca aaaagagact tctattggct gaattgcaga 1440 gagctcatcg ttagcaccaa cctatgaagt aaaaattaag ccagaggtaa acgacgtaga 1500 aattagcgcc tttttagaca gtggtagccc gatttctcta atttctgaac atattttgcc 1560 tgttaatgtt gttataagtc cgtacacgcg taaatttcaa tttgaaggag tcaatcaaag 1620 taaattaaat atattaggag agcttaaaca aaatgtatct gttaattcgt ttaaaattcc 1680 aattactttc tatgttgttc cggaaaatac tataaatcct gtctgtttat taggtcggga 1740 ttttctttca tatgatggac tgcaggttaa gtttcagggt aataatgttt gtattgatgt 1800 taaaagtaga ttagttgaag aaccgatagg tgaatgtatt tctgatattt taaatattga 1860 tatttccaat aacgtaaaac ttaatgaaca tgattttgat ataaataatg aattatcatg 1920 ggaaattcaa agtaatgtta aaaatattaa ggaaaaatat aatgacgttg taaaacctaa 1980 agagcccgta acaaacttag agctagagct cacagttaag cccgattgtc aaccattttg 2040 tttcagaccg cgaagacttt cctatgcgga aaagttagct gtaaccgaaa ttattaatga 2100 cttactaagt agagatataa ttcgaccaag caattctcaa tttggaagcc caatcgtatt 2160 agtgcgtaag aaagacggaa atatccgaat ggcagttgat taccgagagt taaataaaat 2220 cactctcaaa gatcactacc caattccgcg gatagatgac cacatagatc agttgaaaga 2280 caaagtctac tttactacac tagacttaaa agatgctttc caccacataa agattaagga 2340 atcctcaatt ccatacactg catttgtcac cttcaagggg cagtatgagt atatgcgaat 2400 gccatttggc ttgactaatg gtcctcctca ctttatgaga tttataaatc ttgcatttag 2460 gaaattgtta gttgcagaga aaatattagt ttatatggat gatatcctta tcgcgacatc 2520 ttctgtggaa gaaaatttaa atactttaat cgaagtttta gaaatcctgg ttcaaaattg 2580 cttagaatta aaatttgcta aatgtagttt cttgcaaacc aaaattaatt acttggggta 2640 tttagtggat gataaaggga ttcgtccgtc cccagcaaat gttgctgcag ttgcaaatta 2700 tccagtaccc aaaaattttc gcgaattgca tagcttccta ggattaattt cttatttccg 2760 aaaatttatt cataactttg ctattgtcgc ccagccacta tataagttgt tgaaatctaa 2820 aatcaatttc gtatggtcct aagaagaatc aaattgtttt gagcgactta agaatttcct 2880 aatatcagag ccgctgttag ctatttattc acccactgca cttactgagt tgcattgtga 2940 tgcaagtgca caaggctatg gttcaatcct tatgcagaaa cagtctgaca ataaatttca 3000 ccctgtttgt tattttagta aaagaacaac cgaagcagaa tcgcgttatc attcctttga 3060 gctcgaagcg cttgctataa tttactctct cgagcggttt cgcgtttatc tgcaaggtat 3120 cccgtttaaa attattactg actgtaacag tctaaaatta acgttggaaa agaaggacat 3180 taacccgcga ataatgcgtt ggtcgcttat tttgcaaaat tttgagtata tccttgaaca 3240 tagaggcaat gagagaatgc gccacgcgga cgctatcagt aggcacagcc aattaatggt 3300 gatttcagaa aatagtttgg agcgtaattt agctctgtta caagataaag atgacgacat 3360 cacaaaatta aagctcgagc ttgaaaaaaa tcagagtaaa acatttgaat tatcaaacgg 3420 cttagtatac cttaaaagct atgatttgct actattttac gtccctaaaa gcatggaaaa 3480 tgctgtaata caagagtgtc atcacacggt aggtcatcaa gggtgtgaga aaacattagg 3540 gttcttaact agggtgtatt ggttcccgaa tatgaaaaag aaaattcaga aatatattga 3600 caactgtttg aaatgcataa cttactcaaa aaaatcgtca gggaaaccgg aaggtaaatt 3660 ttattcgtat gataaagaag gagaaccatt ccattgcgta catattgacc attacggtcc 3720 tttagccact actaaattta aataccgtta tatctttgaa attatcgact catttaccaa 3780 atttgtcaaa ttttatccag tgcgtactac taacgctgac gaagtaatta aatggctaga 3840 gtgctatttc tcccactata gcaagcccgt caagttaatt tccgacaggg gtactgcttt 3900 cagttctagt aaatttactg aattctgtaa agattatggc attcaacatg ttatgattgc 3960 aactggcact ccccatgcca atggccaaat tgaaattgtt aatagaacgt tgaccccaat 4020 actagcaaaa ctgactacct cgccgactca gtggtacgaa gttttagcga aagtcgaatt 4080 tgcctttaac aatagtatta atcgggcgac aggtgagacc ccatccatga tcttatttgg 4140 gataaatcaa aagggaaatc ataacgatga tttaagacaa attttagaat ttcaacaccc 4200 agtttttcgc gacctagaac aaagccgata caatgcgatt aataagttac aaaatagtgc 4260 taattattat gaaaagtatt ataacagtaa acataaggca cccggaaaat atgaagttaa 4320 tgattacatt atgataaaaa atacggacgt gacacctgga ataaataaaa aattaatacc 4380 tgcttttaag ggtccctatg ttatcaaaaa tcctgagggg caccagacta cccaactacc 4440 ttatgaagga atatgttcac ctatgaacat gaaaccgtgg ttacctaata gttagccaaa 4500 cttatttgtt ttaagtttgc ttaatttttc aaacttaatg ttttatactt ttctaatttc 4560 tgttttggat tctgaatgtt attgttattt attttcataa tttaaatttt aattttgcta 4620 agaattactg tttgattgag agaccaatca aggtcaggtt ggacgagc 4668 // ID DNA8-71B_AP repbase; DNA; INV; 649 BP. XX AC . XX DT 29-AUG-2009 (Rel. 14.09, Created) DT 29-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-71B_AP. XX NM DNA8-71B_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-649 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2006-2006 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 649 BP; 216 A; 71 C; 82 G; 280 T; 0 other; tagagcgcgg atatttaggt ttttacatat ttttattggc tttatgcagc tctacgagga 60 tgagaaatgt ggacgattta tccatcaaag agttcgtagg aaaaaattta ttttgcatat 120 taaagcatat tttggacatt agagaatatt tggcctattt tacatatttt agaatatatt 180 ctattattgt ttctaaaaca tcatgtatat tgttaatttc tcgatttgtg attttttcta 240 gctacaatat tttatgaagt acatcacatt ttttatattc aaataactta gctgtcacca 300 aagggcatgc aagttatgta ttgtttttat taaaatattt tgttttttta tatatatatt 360 tcattatata atcatttcac tatcatttca catatcatta tacctatgta tgaattgttt 420 taaatttttt atatttaaat aatttacctg ttacctacca aaggaatgat agtgtatact 480 gtataactat aaagttatat attgttttta ttgaattatt ttgaaaaaaa aaatcatatt 540 atttgcatat ttaaaagttt aaagcatatt tagagaatat tttagggttt ttttagagca 600 tattagcatc atattttaag cttttaagaa cctaaaaatc cgcgctcta 649 // ID Gypsy-102_AA-I repbase; DNA; INV; 5082 BP. XX AC AAGE02017444; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-102_AA_; KW Gypsy-102_AA-LTR; Gypsy-102_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5082 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02017444; Positions 51615 56696. XX CC Positions [4093-4413] - Integrase core CC 'ATGCT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 1715..4015 FT /product="Gypsy-102_AA-I_2p" FT /translation="MDAGEFECMMVSSINKINNPCLVNLLVDGKNLNMEVD FT CGSSVTVIGKNQYFRNFNKNLSKCKKQLVVVNGNKLNIMGEANVLVIFRGI FT KAKLQLLVIDYENEFIPLLGRTWLDVFFTDWRNFFTSSAINGLVEHNVDST FT IDEVKKKFSNVFIKDFSSPIKGFEAELVLKTDSPIYKKAYNVPYRLRDKVV FT DYLNRLEKEGVITPIKTSEWASPVVIVMKKNNDIRLVIDCKVSINKLIIPN FT TYPLPTAQDLFAGLADCKVFCALDLEGAYTQLSLNKRSRKFMVINTIKGLF FT VYNRLPQGATSSASIFQQVMDQVLEGLAYVYCYLDDVLIAGKNFSDCKEKL FT LLVLQRLKNANIKVNWEKCKFFVLELDYLGHVLSEKGLSPCQDKIATIQKA FT KAPRNTTELKSFLGLINYYNKFIPHLSSKLYHLYNLLKKNVTFKWDENCDK FT AFEIGKCSLLSTNFLEFYDPKKAIVVVSDASSYGLGGVIAHEVDGVEKPIC FT FTSFSLNSAQKSYPILHLEALALVCTIKKFHKYLYGQHFKVFTDHKPLVGI FT FGKEGKNSIFVTRLQRFVMECSIYDFDIAYRPSGRMGNADFCSRFPLDEPV FT PNSYDVEYVRNINFSNEFPLDYSAIAMETKTDEFLQQIILFMRHGWPDKVE FT KQFVNVFANQYDLELVDECLLYQERVVVPSKMQKNILKLLHTNHNGIVKMK FT QLARRTVYWFGINTDIEKFVARCETCNSMAMVPKPKITSKWTPTTKPFSRI FT HVDFFFLLQSYILANS" XX SQ Sequence 5082 BP; 1618 A; 833 C; 1116 G; 1515 T; 0 other; gctggcgaac gagaaaattt ggaaagcgtg cataatttaa agtgctgcgg gtgataagtc 60 cgtggtgata ttgtagtgat tgaatttggg caacttgtgg agaaaaccgt cagcgtcatt 120 agcgtggaaa gaaccgcatc ggcagtttga gaaaagttga caacagcggc gggagaaatt 180 tttcgtttct acgattcgac gtgaggtgat ccgtacccag tagaggtgat tttagttggc 240 gttgtcggaa gtcatccggt ttttggagta agtgcagtga atgtacggtg gaagtgattg 300 ttcttttcat ttctatgcac ctgatgccac ggggctattc agtggacaaa tcaggcgaat 360 atcgccattg ttttttgatt gtgctaggtc atgaacagtg aataggcggt gcacatttag 420 cagaaccatt taaaaaaaaa aaaaagtaga attgttaaag aaaacgaaac acttctattg 480 aagattgttt taagattcta attaattgat atttttctat tgtttattta ggaatctgct 540 gattggcacg tactatccct acttactggt agccactccc tgggaaaaag acctggtgct 600 tgcgaacaag cagaatatcg ctcaagtgac gaaaccgact tctacaacaa tcttccgttg 660 tttgttccgc tggctgactg gctagtatcg gggagactca atagtatctg gtaagctact 720 caaattttct ttgcattcct gtgattggtg tgaagtttta cgagatggcc gctcttacga 780 cgattatgga gccctataga aaggggacat cttttactga gtgggctgag cgtttggcgt 840 cggtttttcg gttaaataaa attaaagatg aagacaaaaa ggattacttt gctacgttat 900 gtgggccggc cgtttacagt gaactcaaac tgttgtttcc caacgataaa tatgacgata 960 tcgtctacga caatatgatc acaaaattaa aagggagatt tgacaaatcg gagtcggata 1020 tcattcaaag atttaaattc aatcaccgtg tgcagctgcc ggatgagacg atagaagatt 1080 tcgttctttc ggtgaagctg caggcggagt tttgttcatt tgaaaattac aaaaaaaggc 1140 aatccttgat cgcataatag ccggggttaa agacaaagca ctgcagcagc gtttgttgag 1200 tgaagaaaat cttactttag aaaacgcgga aaaaatggtg gttacttggg aaatggcctc 1260 ttgtaatgct agaaaaatta gttctaatga agtttacggg caaatttctt ctctccgtca 1320 ccgcggtcca acgggaaaaa ctttcaaaaa gcttgcggac acttttcagt tagcagcagg 1380 gtcgtcatgg caaaataatc cgagaggacc agtgaagaac cgtttaggtt accagtaaca 1440 gcaaggacag cggtcttaca atcggccgaa acaaataggc tggaacaaag ggtcctggaa 1500 acaaaaggaa tacgaggata gattttgcga attttgtggg tcgaaaggac atttgaaacg 1560 caagtgtttt aaattgaaaa acctgaaaag agattctgta cagtttgttg aggattggaa 1620 gacggaaaat ggcgttgata aggatggcga aaccagtatt agtggacttt tcaacagact 1680 taaggcggac aattcggaca gtgaggacga tgacatggat gcaggtgaat ttgaatgcat 1740 gatggtttcc tcaatcaaca aaattaacaa tccttgttta gtaaacttat tggtcgatgg 1800 taaaaattta aatatggaag tggattgtgg atcttcggta acagtaattg gtaaaaatca 1860 atatttccgg aatttcaaca aaaatttgag caaatgcaaa aaacaattag tggtcgtaaa 1920 tggcaacaag ttaaatatta tgggagaggc aaatgttctt gtgattttca gaggcataaa 1980 agcaaaacta cagcttttag ttatcgatta cgaaaatgaa tttattccgt tgttaggcag 2040 aacatggcta gacgtttttt ttacggattg gaggaatttt tttactagtt cagcgattaa 2100 tggattggtt gaacataatg ttgattcaac aattgatgag gtgaagaaaa agttttcaaa 2160 tgtttttatc aaagattttt catctcctat taaagggttt gaagctgaac tagtgttgaa 2220 aactgattcg cccatttata aaaaggctta caatgttcct tatcggctgc gtgacaaggt 2280 tgttgattat ttaaatcgtt tggaaaaaga aggggttatt actccgatta aaactagcga 2340 atgggcgtca ccagttgtta ttgtcatgaa aaagaacaat gacataaggc ttgttatcga 2400 ttgcaaggtc tccataaata agctcatcat acccaacact tatcctttgc caacagctca 2460 agatttgttt gctggtttgg cagattgcaa agttttttgt gcgcttgact tagaaggagc 2520 atatactcag ctatctttaa ataaacgatc cagaaaattc atggtgataa atacaattaa 2580 aggacttttt gtttacaaca ggctacctca aggagcaacg tcaagcgcat caatttttca 2640 gcaagtaatg gatcaggtgt tagaaggttt ggcatatgtc tattgttatt tggatgatgt 2700 attgatagca gggaaaaatt tttcggattg taaagaaaag cttctcctag ttttgcaaag 2760 gctaaaaaac gcaaacatta aagtaaattg ggaaaaatgc aaattttttg ttttggaact 2820 tgactacctt ggtcatgttt taagtgaaaa ggggctatct ccttgtcaag ataagatagc 2880 tacaattcag aaggcaaagg ccccgagaaa tactactgag ttgaagtcct ttttagggct 2940 aataaattat tacaacaaat ttattcctca tttatcttca aaattgtatc atttgtacaa 3000 tttgttaaag aaaaatgtta cttttaaatg ggacgaaaat tgcgataaag cttttgaaat 3060 tggaaaatgc tctttactat caaccaattt tttggaattc tatgacccta agaaagcgat 3120 tgttgttgtt tccgacgctt ccagttatgg actaggtgga gttattgccc atgaagtaga 3180 tggtgtcgaa aaacccatat gtttcacgtc attttcctta aattcagccc agaaatctta 3240 tcctatactt catttagagg ccttagcatt agtgtgcaca attaaaaaat tccataagta 3300 tctttatggc caacatttca aggttttcac tgatcataag ccattagtgg gtatatttgg 3360 gaaagaagga aaaaactcca tatttgtgac tcgacttcag cgttttgtta tggaatgctc 3420 aatttatgat tttgacattg cgtatagacc ttcagggaga atgggaaacg ctgatttttg 3480 ttccaggttt cctctggatg agccggtacc aaatagttat gacgtagaat atgtaagaaa 3540 tataaatttt tccaacgaat tcccccttga ctatagtgca atcgcaatgg agacaaaaac 3600 ggacgaattt ttacaacaga taattttatt tatgcgacat ggatggccgg ataaggtcga 3660 gaaacagttt gtcaatgtgt ttgctaatca atatgattta gaattggtag atgaatgtct 3720 attatatcag gaacgagttg ttgtacccag taaaatgcag aaaaacattc ttaagctact 3780 tcatacaaac cacaatggga tcgtcaaaat gaagcaatta gctcggcgta cagtttactg 3840 gtttggcata aatactgaca tagaaaaatt cgtagctcgt tgtgaaacat gcaatagtat 3900 ggcaatggtg cccaaaccaa aaataacgtc taaatggacg cctactacta aaccgtttag 3960 tagaattcat gttgattttt tttttcttct ccagtcatac attcttgcta atagttgata 4020 gcttttctaa atggatcgag gttgaatgga tgaagaaagg gactgattgc ccgaaagtcg 4080 ttaagaaatt agttgttttt ttttgctcga tttggactac cagattgctt agtgtcagac 4140 gggggtcctc ctttcaactc acattccttt ataaattttc ttaaaaaaca aggcataaat 4200 gtgcttaaaa gtcctcctta caatccagca agcaatggcc aagcggagag acttgtaaga 4260 acggttaaag aggttttaaa gaagttcctc attgatccag aagttatgga acttgatttg 4320 gaggatcaga ttaatctttt tctgtttaat tatagaaaca cttgtttgac ggaggatgag 4380 gccttcccat ctgaaaaagt tttttctttc aaaccaaaga ccattattga cctaatcaac 4440 cccaaaaaca gacttaagta taacatctct acacagtctg aacgctcaca agatgatatg 4500 tcgtttgaat cgggtcagat tgacaagaaa gaccctttag acaacctgat gagtggtgat 4560 aacgtatggt ataaaaacca caatccacat aataccagaa gatggataaa agcggtattt 4620 atagcaagat tctctcgtaa tgttttccag gtggaaattg gaagcgtgcg aacaacggca 4680 catcgcacac aaattcgacc gttttgtagt ctggatgagt ttgatcgccc caacgtgatg 4740 tttccagtgt cggatactcc aagcatggta gacgcaagca atacaccgtc accggtcggt 4800 cgtgaagaac ttacagcatg ctcggtaccg acgtcgatcg gtggtgatga acccgcagtt 4860 cgttcggaac cattgagtat ggaaagtcgt acaaaacgga agaggagatt ggggacaccg 4920 acgatttcaa tcactgattt aaggcgatca aaacgagcca ggcctaataa gggcgattca 4980 gattttgtat attataagta gacaatatag tttcaagatg aattaatgtt cgttcgaatt 5040 tcaaactaga attaaggtca agtttttaaa gggtgaagaa gt 5082 // ID Gypsy-44_AA-LTR repbase; DNA; INV; 289 BP. XX AC supercont1.385; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-44_AA_; KW Gypsy-44_AA-I; Gypsy-44_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-289 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.385; Positions 216481 216193. XX SQ Sequence 289 BP; 90 A; 67 C; 58 G; 74 T; 0 other; tgtggtggat acatggctta agtggttgcg ataaacgcaa cccaataaag taaacatgga 60 tcagcataaa cgcgatcacg ccatccgggg cgatgagcgc atttatgctg agtaaagcaa 120 aaacagtcac gtagccaccg cccgagtagg tttcctctag cataagtttc tttgaataaa 180 tgaattcttg ttcggacctc attcggatca ctcgagtcat ttagtatcac gccccgacat 240 atatcgtcaa tggacttcat catcatcaac caaataagtt aacaattca 289 // ID BEL-9_CQ-I repbase; DNA; INV; 5976 BP. XX AC AAWU01008465; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-9_CQ_; KW BEL-9_CQ-LTR; BEL-9_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-5976 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 171-171 (2011). XX DR GenBank; AAWU01008465; Positions 6368 393. XX CC Positions [4980-5558] - Integrase core CC 'AATC' target site duplication CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 429..5924 FT /product="BEL-9_CQ-I_1p" FT /translation="MQLRSGAKKCLFFTPCGKSPGSPVLRVTTPVLSVREQ FT LEARRAEKTRIAMAETVAALASCCEKTKAKVTRIKRAVLTAEQDHKKFSVH FT ALKLYLKTTDAAYEEFNSFQNKIYLADPTKKDEFEPKFIDFEELYEFTRIA FT LCEMLQAYEDDEKAALAAAAQVAAERAIKPVVKCQHGDSSSSVPVSFPPTL FT VLQQSALPTFDGRYENWFRFKQMFCDIAEKCTADSAATKLHYLDKALIGKA FT QGAIDQQIIRDNDYAQAWVSLQEQFENLPALVNDTVSRLLGVKAMMSDSYV FT QLKNLLDDVEKSVNSLEYHNLKMDRLSEAIITNLTASKLDMATRKIWESSV FT ARGALPEYKKMMKVLRNQQAVLERCERAKPTQKTKVTNSNPRTSTPPTKAH FT TATVGKHESCAMCNAEHLLEKCEAFLKLNVNARYGKAKQFGLCFRCLKRGH FT RTADCKQKDKCSECSRAHHTLMHPDSKKEPEKQPPETSAAGTNASSATVSK FT DGSTSTNCPLYCNTNETPKQVLLATAVVNVVDACGVQHKCRALLDSGAMAN FT FMSQSFADLLNLKKMPANVPVVGVNGMETVVKFKVEAKVESRTTAYNFCLN FT YLVMPKVTGPLPVDKVEVERWPIPKDLALADERFYEPGRIDLLIGAEVFFE FT LLQSGKMMMSAELPILQESVLGWLVSGRVAETGTTGTVRVCHALAKQTPEA FT QLAGLLRQFWSVDEQEFPATTSETSSDCEKHFLETHSRNESGRYVVRLPFR FT SNVGELGLSREQAEKRFYALERRLDKNANLKQQYKMFIDEYISLGHARVID FT DAKADANYLPHHCVLKPDSATTKLRVVFDASAKSSTGLSLNDVMMTGPTVQ FT RTLFDIVLRFRCHKYVFTADVPKMYRQVEVHEKDRKYQQILWRDDHQQPLR FT TIELSTVTYGTAAAPFLATRALNQLALDERHDFPEASTSVLNNFYVDDVLA FT GADDLCQAKQLQQDMIEMLARGGFQLHKWCANHAALLEEIPVENRAKTLNF FT EKNDEGEVVKTLGLLWDPVSDVFKFSVKPFKKSTERPTKRQIMSDIARLFD FT PLGYLGPAVMIAKLVMQQLWKEKFHWDDPVPEDVAKMWERFRSELCEISEL FT KIPRAVAAAVDVTSYELHGFADASMKGYGCSVYLRSLKRDGTAEMMLLCAK FT SRVAPLKELNRQPKDDAELEDLTIPRLELCASKLMVEQVVKVLDAVELDIS FT RVVLWTDSQIVLDWLKRLKPDTSVFVRNRVAAILKLSKKFEWRHIPTALNP FT ADLISRGVYPLKLIVCDLWWHGPEFLRTTVATVQPPNLVADEANPDGDQPG FT DPDPEIHVLAASTDTNENPPLAVINDCSDYRKLQRIFGYVTRFLYNCRHKP FT AERRGSYLKIGELRNAQLLMVMVVQHAVYGEEISCIRKNQPVKGKLRNLNP FT MYDETERVLRVGGRIRHSDLPRDQKHPMILPERQHLTELLINTLHRENLHV FT GRNGLLAVIRRAFWPVNAKRTIYRVLKMCVQCFRVKPSDVPQFMGDLPSSR FT VTEALPFSRTGVDYAGPLLLKQGRMRSPVKAYIALFVCMTTKALHLELVTS FT LSAEAFLGALHRFVGRRGNVSVMRSDRGTNFVGGDRQLKELHELLKTQLLE FT RKIADFCQVRGIDWKFNPAKAPHQGGLWEAGVKSVKHHLNRTLKEAYLTYE FT ELNTLLVQIEAILNSRPLCQQSDDPCDYQALSPAHFLIGRELTAVAEPLYG FT GVRENTLTRYQIVQKRKQDFWRRWSRDYVTELQKRGKWDKAPAMIRIGMLV FT MLKEDNTPPQTWRLGRIVDTHPGGDGVVRVVTVRTSNGATFSRPTTQIAIL FT PIQDNEEEKD" XX SQ Sequence 5976 BP; 1493 A; 1537 C; 1806 G; 1140 T; 0 other; tttggtcact tcgaacctga tatcgaaccg cgagtgaata tccggattgt tacgccgggg 60 acggaatttc ggtgaaattg tgggcggcga gacggattca tccggtgccg cttggaattc 120 cgcgcgtagc tcagattgtg gactcccgcg aggcctgtgg ccaatttgat tgttgggagg 180 gttgttgaca cgttgattgt ggacgcaacg gggaagtgac ggccgtttca aggtgattga 240 tgcccgctga acctgcgctg aagagcggtt ctgcggcaag gatttgatcg gccgaagaag 300 tgagtgaaag aagaagacgt gagagtccac aaactgagcg agaagtgtga ttcggcgaaa 360 agtcgaaaga agaagaaaaa gtgcttctga aaaactaaac tgaaagagaa agaaaagtga 420 aaaagggtat gcaactgagg tctggggcaa aaaagtgctt gttttttacg ccctgtggaa 480 aaagcccagg atcaccagtg ctccgagtga ctacgccggt tctgtcggtg cgtgaacagc 540 ttgaagcgcg aagagcagag aagacgcgta tcgcgatggc ggagaccgtc gcggcgctcg 600 ctagctgctg cgagaaaacc aaggcgaagg tgacgcggat taagcgggcg gtcttgaccg 660 ctgagcagga ccacaagaag ttcagcgtac acgcgttaaa gctgtacctg aaaacgacgg 720 atgctgccta cgaagaattc aactcgttcc agaacaagat ttacctggcc gaccccacga 780 agaaggacga gtttgaaccg aagttcatcg attttgaaga gctgtacgag ttcactcgta 840 ttgctctctg cgagatgctg caagcttacg aggacgacga aaaggcagcc cttgccgcgg 900 cggcgcaggt cgctgcagag agagctatca aaccggtcgt gaaatgtcaa cacggcgaca 960 gttcctctag tgtccccgtg tcgttccctc caaccctcgt gttgcagcag tcagcgctcc 1020 caaccttcga cgggcgctat gaaaactggt tccggttcaa gcagatgttt tgcgatatcg 1080 ccgagaagtg cacggcggat tcggcggcta cgaagcttca ctacctcgac aaggcgctga 1140 tcggaaaggc tcagggagct attgatcaac agataatccg ggacaatgac tacgctcaag 1200 cctgggtctc cctccaggaa caatttgaga acctgcctgc gttggtcaac gacaccgtct 1260 cgaggctgct gggcgtcaag gcgatgatga gtgattcgta tgtccagctg aaaaacctgc 1320 tggatgacgt cgagaagagt gtgaactcct tggagtacca caacctaaag atggacagac 1380 tttcggaggc catcatcacc aacttgaccg cttcgaagct tgacatggcg acacggaaga 1440 tctgggaatc cagtgttgcg cgtggtgctc tccccgagta caaaaagatg atgaaggtat 1500 tgcggaacca gcaggcagtg ctggagcgct gcgaaagggc gaagcctacc cagaagacca 1560 aagtgacgaa ctcgaacccc cgcacatcga cgccacctac gaaggcccac acggcgactg 1620 ttggaaagca cgaaagctgt gcgatgtgca acgctgaaca cctgctggag aagtgcgaag 1680 catttctgaa gctgaacgtg aacgcgcggt acggcaaggc gaagcaattt ggactctgct 1740 tccgttgttt gaaacggggc caccgcacgg cagactgcaa gcagaaggac aagtgctcgg 1800 agtgctcgcg agcacaccac acgctgatgc accccgacag caagaaagaa cccgagaagc 1860 aaccaccgga aacttcggct gccggaacga acgctagcag cgccactgtt agcaaggatg 1920 gatcgacctc gacgaactgc cccctctact gcaacaccaa cgaaacacca aagcaagttc 1980 tgttggcaac cgcggttgtc aacgtcgtcg atgcatgtgg agtacagcac aagtgccgag 2040 ccctactgga ctcgggcgcg atggccaact tcatgtcgca gagcttcgcg gacctgctga 2100 acctgaagaa gatgccggcc aatgtcccgg tcgttggcgt gaacggaatg gagacagtag 2160 tgaagttcaa ggttgaggcc aaggtcgagt cccgcaccac ggcgtacaac ttctgcctca 2220 actatctggt gatgccgaag gtgactggac cgctgcccgt ggacaaggtc gaagtggagc 2280 ggtggccgat tccgaaggat ttggcgctgg cagacgaaag attctatgaa cccggccgaa 2340 ttgatttgct catcggagca gaagtgttct tcgagttgct gcagagcggc aaaatgatga 2400 tgtctgcgga actacccatt ctgcaggaga gcgtcctagg atggctggta tcgggacgtg 2460 tagctgaaac aggaacgacg ggcacggtgc gagtctgcca cgcgctggcc aagcagacac 2520 ccgaagcaca actggctggt ttgctgcggc aattctggtc ggtcgacgag caagaatttc 2580 ctgcaacgac ctcggagacg agcagtgact gtgagaagca tttcctggag actcacagcc 2640 ggaacgagtc cggacgctac gttgtgcggt tgccattccg tagcaacgtg ggcgagctgg 2700 gactgtcgag agaacaagcc gagaaacgat tctacgcact ggaacgtcga ttggacaaga 2760 acgcaaacct gaagcagcag tacaagatgt tcatcgacga gtacatctcg ctcggccacg 2820 cccgagtgat cgacgacgcc aaggcggacg caaactatct accccaccac tgcgtactga 2880 agcccgacag tgcaaccacg aagctcaggg tagtgttcga tgcgtcggcc aagagctcca 2940 ccggtttgtc gctgaacgac gtgatgatga ctggcccaac agtgcagcgc accctcttcg 3000 atattgtgct gcgattccgg tgccacaaat atgtcttcac cgccgatgtg ccaaaaatgt 3060 atcggcaggt ggaggtgcac gaaaaagatc ggaagtacca gcagattctc tggagagatg 3120 atcaccagca gccattaagg acgatagagc tttcaacggt cacctacggc acggctgcgg 3180 cccccttttt ggcgacacga gcgttgaatc agttggcgct ggacgaacgg catgactttc 3240 ccgaagcaag cacatccgta ctgaacaact tctacgtgga tgacgtgctt gccggagcag 3300 atgacctatg ccaagcaaag cagctccagc aggacatgat cgagatgctg gcaagaggcg 3360 ggttccagct gcacaagtgg tgcgccaacc atgcagcgct gctggaggag attccggttg 3420 aaaaccgggc caagacgctg aacttcgaga agaacgacga gggcgaggtt gtcaaaacgc 3480 tcggcctact ttgggacccg gtgagtgacg tgtttaagtt cagcgtgaag ccgttcaaga 3540 aatcgacgga acgtccgacg aagcggcaga tcatgtccga cattgcccgt ttgttcgatc 3600 cgctcgggta cctcggacca gctgtgatga tcgccaaact tgtgatgcag cagctctgga 3660 aggagaagtt ccactgggac gacccagtac cggaggatgt ggcaaagatg tgggagagat 3720 tccggtctga gctgtgcgag ataagcgagt tgaagattcc acgcgccgta gctgcagcag 3780 tagacgtgac gagctatgaa ctccacggct tcgcagacgc gtcgatgaag gggtacggct 3840 gctctgtgta cttgcggagt ctgaaacgag acggtactgc ggagatgatg ctgttgtgtg 3900 cgaagtcgag ggtcgcaccg ttgaaggaac tgaaccgcca accgaaggac gacgctgagc 3960 tggaagattt gaccattccc cgactggagc tctgcgcgtc gaagctgatg gtcgagcaag 4020 tggtcaaggt tctcgacgcc gttgaactgg acatcagccg cgttgtgctc tggacggact 4080 cgcagattgt tctggactgg ttgaagcgct tgaaacctga cacttcggtg ttcgtgcgca 4140 accgggttgc tgcgattctg aaactgagca aaaagtttga atggaggcac ataccaaccg 4200 cactgaaccc agctgacttg atctcacgcg gtgtgtaccc actgaagctc atcgtgtgcg 4260 acctgtggtg gcatggccca gaattcctgc ggacaactgt tgccactgtc cagccgccga 4320 acctggttgc cgacgaagca aaccctgacg gcgatcaacc tggagatccc gacccagaaa 4380 tccatgtatt agcggcctct accgacacga acgagaatcc acccttggcc gttatcaacg 4440 attgcagtga ctacaggaag ctgcaacgga tctttgggta cgtgacgagg ttcctgtaca 4500 actgccggca caagccggcg gagcgacgcg gatcctactt gaagattggc gagctgcgga 4560 acgcgcagtt gctgatggtg atggtggtac agcacgcagt gtatggagag gaaataagct 4620 gtatccgcaa gaaccagccg gtgaagggca agctccggaa cctgaacccg atgtacgacg 4680 aaacagaacg tgtgctgcgg gtcggtggac gcatccgtca ttccgatctg cctcgcgacc 4740 agaagcaccc catgattctt ccggaaaggc aacacctcac cgaactcctg ataaacacac 4800 tgcaccggga gaacttacac gttggtcgca acgggttgct ggctgtgatt agacgtgcct 4860 tctggccagt caacgcgaag cgaaccatct atcgggtgct gaaaatgtgt gtgcagtgct 4920 tcagggtgaa gcccagtgac gtgccgcagt tcatgggaga tctgccgagc agccgagtga 4980 ccgaagccct acctttctcc agaaccggcg tggactacgc cggaccgctt ctactgaaac 5040 aggggagaat gagatctcct gtgaaggcat acatcgctct gtttgtatgc atgacgacga 5100 aggctctgca cctcgaactg gtgacgtcgc tctcggctga ggcctttttg ggcgcactgc 5160 atcgttttgt tggacgacga ggaaacgtgt ccgtgatgag gtcggaccga ggcacaaact 5220 tcgtcggagg agatcgccaa ctgaaggagc tgcacgagct gctgaagacg cagctgttgg 5280 agcgcaagat cgcggacttc tgccaggtcc gcggaataga ctggaagttc aaccccgcca 5340 aggccccgca ccagggtggc ctctgggaag caggagtgaa aagtgtgaag caccacctga 5400 accgcacgtt gaaggaggct tacttgacgt acgaggagtt gaacacactg ttggtgcaaa 5460 ttgaggcgat tttgaactcg cgccctttgt gccagcaatc cgacgatccc tgcgactacc 5520 aagcgttgag cccggcacat ttcctcattg gccgagagct caccgcggtg gctgagccgc 5580 tttacggagg agtacgagaa aacacgctga ccaggtacca aatagtgcag aagaggaagc 5640 aagatttttg gcgtcgctgg tcccgcgact acgtgacgga gttgcagaag cgaggaaagt 5700 gggacaaggc gccggcgatg atcaggatcg ggatgctagt tatgctgaag gaggacaaca 5760 cgccgccaca gacctggcgg cttggtagaa tcgtcgatac tcatccggga ggcgacggcg 5820 tggtccgcgt tgtgacggtc cgtacaagca acggtgccac gttcagtcgg ccgactacgc 5880 agatcgcaat tctgccgatc caggacaacg aggaggagaa ggattgagcc cagctcaacg 5940 gggggagaat gttgcgtgaa aactagattt aagatt 5976 // ID Gypsy-16_IS-LTR repbase; DNA; INV; 132 BP. XX AC ABJB010391617; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the black-legged tick genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-16_IS_; KW Gypsy-16_IS-I; Gypsy-16_IS-LTR. XX OS Ixodes scapularis OC Eukaryota; Metazoa; Arthropoda; Chelicerata; Arachnida; Acari; OC Parasitiformes; Ixodida; Ixodoidea; Ixodidae; Ixodinae; Ixodes. XX RN [1] RP 1-132 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the black-legged tick genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; ABJB010391617; Positions 9623 9754. XX SQ Sequence 132 BP; 49 A; 18 C; 37 G; 28 T; 0 other; tgtttaagag gcagacaaga aggaagacga cgataggcca ggatatatca agaagaggag 60 gatgaagaag tggagagtat aaaaacacca gttttaatct tgacgcgttt acgaatcgtt 120 atacttggcg ca 132 // ID Transib-N1_CQ repbase; DNA; INV; 1011 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A non-autonomous Transib DNA transposon family from Culex DE quinquefasciatus - consensus. XX KW Transib; DNA transposon; Transposable Element; nonautonomous; KW Transib-N1_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-1011 RA Kojima K.K. and Jurka J.; RT "Transib DNA transposons from the southern house mosquito."; RL Repbase Reports 11(1), 632-632 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >95% CC identity. 457-bp TIRs. 5-bp TSDs are usually CANTG; indicating CC it is a non-autonomous Transib element. XX SQ Sequence 1011 BP; 322 A; 191 C; 185 G; 312 T; 1 other; cagtgggcga aatggaaccc aaaatcggac ttaattgaac gcggctggtt ccctggtatg 60 aawaatagtg tttcttatgt aaaaaaatcc ggggaatcga ttggtgatgg tttcatccac 120 cgcacgaaac ggcaaagggt ccattttgcc ccaattcccc attttttaac attttttctt 180 gaaaatcggt ctgtatttag agcggaggct ttatgcggcc ttccaaaatg cacttaactt 240 atgtgaaaat gtcccaggaa tccagtaaaa ataaccactt gcaccgcaaa aatcatctag 300 ggtccattta accccaattc cgctataaaa gcatttttgg ccgttttcaa atgttaggtc 360 agattttaaa aatctgaaaa tatttttatc gtaaagatca gacaatttta cataagaatg 420 acgatttgca cttgaaagtt tgacacattt atgttgatat agaaattaaa ctaaataaaa 480 agattgaaga atcactttcg ggccaatttt cccaaacgca gaataaactg cctttcgcat 540 acattttatt tttatcaaca taaatgtgtc aaactatcaa gtgcaaatcg tcattcttat 600 gtaaaattgt ctgatcttta cgataaaaat attttcagat ttttaaaatc tgacctaaca 660 tttgaaaacg gccaaaaatg cttttatagc ggaattgggg ttaaatggac cctagatgat 720 ttttgcggtg caagtggtta tttttactgg attcctggga cattttcaca taagttaagt 780 gcattttgga aggccgcata aagcctccgc tctaaataca gaccgatttt caagaaaaaa 840 tgttaaaaaa tggggaattg gggcaaaatg gaccctttgc cgtttcgtgc ggtggatgaa 900 accatcacca atcgattccc cggatttttt tacataagaa acactattat tcataccagg 960 gaaccagccg cgttcaatta agtccgattt tgggttccat ttcgcccact g 1011 // ID Slatif1cons repbase; DNA; INV; 491 BP. XX AC . XX DT 19-APR-2011 (Rel. 16.04, Created) DT 19-APR-2011 (Rel. 16.04, Last updated, Version 1) XX DE Consensus of mariner-like element internal region of mellifera DE subfamily. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Slatif1cons. XX OS Scaptodrosophila latifasciaeformis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Scaptodrosophila; OC latifasciaeformis group. XX RN [1] RP 1-491 RA Wallau G.L., Hua-Van A., Capy P. and Loreto E.L.; RT "The evolutionary history of mariner-like elements in Neotropical RT drosophilids."; RL Genetica 139(3), 327-338 (2011). XX DR [1] (Consensus) XX CC Consensus of clones with show less than eight percent divergence. CC Slatif1cons. XX SQ Sequence 491 BP; 133 A; 130 C; 134 G; 94 T; 0 other; ttggggtgcc gcacgaatga cgccaaataa ccttctggac cgaatcacgc tgcgatatgc 60 tgctgaaacg gaacgaactc cgacccattc tgaaagcgga tggtgacggc gacgaaaaat 120 ggatcacata cgacaatatc aagcgaaaac ggtcgtggtc gaaggccggt gaatcgtccc 180 aaacagtggc caagccggga ttgaccgcca ggaaggtttt gctgtgtgtt tggtgggatt 240 ggaagggaat catccactat gagctgctcc catatggcca gacgcttaat tctaccatct 300 actgcgaaca actggaccgc ttgaagcagg cgatcgacca gaagcgtcca gaattggcca 360 acaggaaggg tgtaagtgtt ccaccaggac aacgccagac cacacacttc gttgatgact 420 cgtcagaagc tacgggagct cggatgggag gttttatcgc atccaccata ctccccagac 480 ctcgccccaa g 491 // ID Gypsy-25_IS-I repbase; DNA; INV; 4149 BP. XX AC ABJB011009743; XX DT 15-FEB-2011 (Rel. 16.02, Created) DT 15-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the black-legged tick genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-25_IS_; KW Gypsy-25_IS-LTR; Gypsy-25_IS-I. XX OS Ixodes scapularis OC Eukaryota; Metazoa; Arthropoda; Chelicerata; Arachnida; Acari; OC Parasitiformes; Ixodida; Ixodoidea; Ixodidae; Ixodinae; Ixodes. XX RN [1] RP 1-4149 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the black-legged tick genome."; RL Direct Submission to RU (15-FEB-2011). XX DR Genome; ABJB011009743; Positions 6511 10659. XX CC Positions [3210-3548] - Integrase core CC 'GAGGC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 621..4085 FT /product="Gypsy-25_IS-I_1p" FT /translation="MKALTIARQHETVRQQQKELRFPENTESVDQIHTRRK FT FTTPQKAAVAWDRTLSKRYQHDTKAQERSCIWCGHETHPRSRCPAKDAVCR FT SCGKGGHFMTVCRSTSSLRNVTEQQEGLVSTGQLMTASKPATATKNNGKQE FT PGFLGTLSNFGASKWELSILVQSQPVLFKIDTGADETVIPEDLFRSLFKDR FT TLHSPRRQLQGPDGKSLHVLGVASMNLSFKNYSSEEEVYILRGLNTPLLGK FT PAIEKLKVLSEICEVRGLYPEEDFPSLFQGLGTMEGDYDVKLAPDATPFAL FT TSPRRVPLPLFNKTKVELERMQNLGVISPVTEPVEWCAPMVVVPKASGDIR FT ICVDFTQLNKYVQREWHPIPSVEHSLGMLQKATAFSKIDANSGFWQIPLSE FT NSKPLTTFITPFGRFCFNRLPFGISSAPEHFQRRMNDVLLGLEGVACHMDD FT VVIWGSTKDQHDSHLRAVLERLTQAGITLNREKCLFSVTKISFLGHVIENN FT QVRPDPAKLSAILQMPNPTSKKELRRIMGMATYLARFVPNGAQLLEPLSSM FT LSTKHDFVWNQPQQLAFDRWKSMLSSDPVLGIYDPSKETVVSADASAYGLG FT AVLRQKEGEWFKVIAYASRLLTDTERRYAQIEKEGLALVWASEKFRDYLVG FT RNFTLETDHKPLVSLFGSKALDDLTPRLQRMRMRLMRFTYAIVYVPGKDLT FT AADALSRSPILQSEPFELENEIDGYITHVASSMPLTTYSIQKVASEQRKDK FT VCMRLQKIIQTGWPRRRDVELDLKPYWENKDNISIVDDVLVYTSRVIIPES FT MRDQVLRQLHEGHFGMDKCKGRAKDSVWWPGVDSDIERLIKNCHVCLKQSA FT NHKMPLLPTDFPQRPWERVAMDLFFWDGAWWLIVTDYFSRYPELTRLASLS FT SKSVINHCKSIFARHGIPDVVLSDNGPQFSSAEFDKFAKEYSFRHITSSPR FT YPQSNGMAEAAVKIIKTSLKKTGDPYKTLLAYRTSPLKNGYSPAELLMGRR FT LRTTIPVSSKCLVPCLPDLSSVRTYETSQRRKQKQYFDRRHGVRDLVDLDE FT GTDVWVVDLQRQGVVESNANEPRSYWVDTDAGEPIRRNRTHLIPLPSQIPR FT PEIVPVHEDVQRDETEQPEPGPHSAENNHTYTRSGRCVRPPKRYPEQ" XX SQ Sequence 4149 BP; 1204 A; 1047 C; 1011 G; 887 T; 0 other; tggtgtcaga agtgggatcc ggccatgcat aatccctcag cttgcgaaac aaccggctct 60 tcagacgccg aagcgaccgc aaagcagctg ccacggaaca gttctggtca acctctaata 120 caaggtacgc aatcagacat cagctctata atggcgtcgt tccaaatagc tccacccgac 180 ccatttaact tcaccactcc aaacgagtgg cctgcttgga agaaacgttt tcttcggtac 240 cgcacagcct cgggattaca agctgctacc gaagaaaagc aagtcgacac gcttgtctac 300 ttgatgggaa cgcaagcgga ggacatcttc aacactttca agctaagtcc tgagaacgct 360 aagaagtttg aagtcgtgtt aggccactac gagacctatt tcttccaaga cgcaacataa 420 tcttcgaaag agcacgattc aatacccgaa cacagggaga gagtgaatca gtggaagact 480 ttgcgacggc attgcacaca ttagccgaca cctgcaactt tggcgagcta aaagaagaac 540 tcatcaggga tcgcttggtc gtgggcctgc aagacagaaa agtttcagaa aaactacagt 600 tggactcgtc gctggacttg atgaaggcac tcacaattgc ccgtcagcat gagacagtcc 660 gacaacagca gaaagagcta cgattcccag aaaacacaga atcagtagac cagatacaca 720 ccagaaggaa atttactacg ccgcaaaagg ctgcagtcgc ttgggacaga actttatcaa 780 aacgttacca acacgacacc aaagctcaag aacgatcctg catctggtgc ggacacgaaa 840 cacaccctcg atcgagatgt ccagcaaaag acgcggtgtg ccggagttgt ggaaaaggag 900 gacactttat gacggtttgt cggtcgacat cttctcttag aaatgtcacc gagcaacaag 960 aagggctcgt ctcaacagga cagttaatga ctgcctcgaa acccgccacc gcaacgaaga 1020 acaatgggaa gcaagagcca ggatttctcg gaacgttatc aaactttgga gcctcaaagt 1080 gggaattgtc aatattggtc caaagccaac cggttctatt caagatagac accggtgcag 1140 atgagacggt gatccccgaa gaccttttta ggagtctctt caaagaccgc actctacact 1200 cgccgagacg ccagctgcag gggcccgacg gcaagtcgct tcacgttcta ggcgtcgcaa 1260 gcatgaacct ttcgttcaaa aactacagta gcgaagagga agtttacatt ctgcgtggcc 1320 tcaacacacc gctgcttgga aagccagcta tagaaaaact caaagtactc tctgaaatct 1380 gcgaggtgcg tggactgtac cctgaggaag attttcccag cctttttcaa ggtctcggca 1440 caatggaggg tgactacgac gtaaagcttg caccggatgc gacgcccttt gccttgactt 1500 cacctcgcag ggtccctctg cctcttttta acaagacaaa agtggagctt gagaggatgc 1560 agaatctggg agtcatttca ccagtgacag aacccgtgga gtggtgcgcc ccgatggtag 1620 tggtccctaa ggcctcgggt gacattcgca tctgcgttga ctttacgcaa ctgaacaagt 1680 acgtccagag ggaatggcac ccgatcccat cagtggaaca ctctttggga atgttgcaga 1740 aggcaaccgc tttctcgaaa atcgacgcca actcggggtt ttggcaaatc cccttaagtg 1800 aaaacagcaa gcccttgact acattcataa caccttttgg aaggttctgt tttaacaggt 1860 tgccgttcgg aatttcttcg gcgccagaac acttccaaag aagaatgaac gatgttcttt 1920 tgggtcttga aggagtggcg tgccatatgg acgacgtcgt gatctgggga tctacaaaag 1980 atcaacacga tagccatcta agagcagtcc tggagcgtct cactcaagcg gggatcacac 2040 taaacagaga gaaatgcctc ttcagcgtta cgaagatttc tttcctggga cacgtgattg 2100 agaacaatca agttcgtcca gatccagcaa agctgtcagc tattctgcag atgccaaatc 2160 caacatcaaa gaaggagtta cgacgcatca tgggaatggc aacgtatctg gcgcgcttcg 2220 tgcctaatgg tgcacagctc ctagaaccgc tatcgtcgat gctaagcacc aagcacgact 2280 tcgtctggaa tcaacctcaa cagcttgcct tcgaccggtg gaaatctatg ctttcatcag 2340 acccagttct aggcatctac gatccaagca aggaaacagt tgtcagtgct gacgcatcgg 2400 catacggtct aggtgctgtc ttacgccaga aagaaggaga gtggttcaag gtgatcgcct 2460 acgcgtcaag actcctaaca gacactgaac gccgatatgc acagatcgag aaggaaggct 2520 tggccctcgt atgggccagc gagaagtttc gagactacct cgtgggaaga aacttcactc 2580 ttgaaactga ccacaagccc ctcgtctcac tctttgggtc caaagcgctc gatgacctga 2640 caccgaggct acagaggatg cgcatgagac tcatgcgttt cacgtatgcg attgtttacg 2700 tccctggcaa ggatctcact gcagccgacg ctttgtcaag gtctcctatc ttacaatccg 2760 aacctttcga gctcgaaaat gaaattgacg gctacattac acacgtcgcc tcttctatgc 2820 ctctgacaac atacagtatt caaaaagtcg ccagtgaaca acgtaaagac aaagtgtgca 2880 tgcgtcttca gaaaatcatt caaacagggt ggccgagaag aagagatgtg gagctggatc 2940 tcaaacccta ttgggaaaac aaagacaaca tctcaattgt tgacgacgta ttagtataca 3000 cgtcaagagt gataatcccg gaatcgatgc gcgaccaagt cctgcgccaa cttcacgaag 3060 gccatttcgg gatggacaaa tgcaaaggac gagcgaagga ctctgtctgg tggcctggag 3120 tcgattcaga catagaacgt ttgatcaaaa actgtcacgt gtgcctcaaa cagagcgcca 3180 atcacaagat gcctcttctg ccaacagatt tcccacaacg tccatgggaa agagtagcga 3240 tggatttgtt tttctgggat ggagcctggt ggctaatcgt aactgattat ttctcacgct 3300 acccagaact gacccgcctt gccagcctct cgtcaaaaag tgtgatcaat cactgcaagt 3360 ccatttttgc gaggcatggg ataccggacg tggtgttatc tgacaacgga ccccagtttt 3420 caagtgccga atttgacaaa tttgcaaaag agtacagctt tcgccacatt acttcgagtc 3480 cacgatatcc ccaaagcaac ggaatggcag aagccgcggt caaaataata aagacgtcac 3540 tgaagaaaac tggagacccg tataagaccc tgttggcata cagaacaagt ccactgaaaa 3600 atgggtactc tccggcagaa ctcctgatgg gcaggaggtt gcgaacaact attcctgttt 3660 catccaagtg ccttgttccc tgccttccag atcttagcag cgtaagaaca tatgaaacat 3720 cccagagacg aaagcaaaaa caatacttcg accgtaggca tggggtcagg gacttggtcg 3780 accttgacga aggaaccgac gtgtgggtag tggacctgca acgacaagga gtcgtcgaaa 3840 gcaatgcaaa tgagcccaga tcttactggg ttgacactga cgccggagag cccattcgca 3900 gaaacagaac acacctgatt cctttaccaa gtcagattcc caggccagaa atagtaccgg 3960 tacacgagga tgtgcaaaga gacgaaactg agcaaccgga gcctggacct cattccgccg 4020 agaacaatca cacctacaca aggagtggaa gatgtgttcg tccaccaaag aggtatcccg 4080 aacaatgaac gttgccgatg tgagacttag taggggagat gtgtgagata gtttacttag 4140 taggggaga 4149 // ID Gypsy-28_DPu-LTR repbase; DNA; INV; 190 BP. XX AC scaffold_218; XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from Daphnia: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-28_DP_; KW Gypsy-28_DPu-I; Gypsy-28_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-190 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Direct Submission to RU (16-DEC-2010). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR Genome; scaffold_218; Positions 92011 92200. XX SQ Sequence 190 BP; 46 A; 46 C; 46 G; 52 T; 0 other; tgttgtgtta atgtgccctt cttcctgcga ccgctagatg tcgtgcggcg ctcaagcgct 60 gaagcttgga agcagcgagc aggaagaaag gcagtctggt ttcggccgag tctgtagaga 120 gaagctcttt cgcctcgctt aatacactcc gttaatttac ttaattaact acccagatta 180 caacttaaca 190 // ID BEL-30_AA-I repbase; DNA; INV; 5285 BP. XX AC supercont1.240; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-30_AA_; KW BEL-30_AA-LTR; BEL-30_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5285 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.240; Positions 601435 606719. XX CC 'CGTGT' target site duplication CC LTRs are 94% similar to each other. XX FH Key Location/Qualifiers FT CDS 1372..3384 FT /product="BEL-30_AA-I_1p" FT /translation="MVAQVPLEPQQKSVVLPTAVIRVQGFDGGLLQVRALI FT DSGFEASLISEACIKKLGLLRCNGKVMVSGLGQQAAGTTRGMAKLVIANRF FT TDECVLQTSAYILGKLTSTLPTQHCSIHPSLLDKQIQENLADPAYHQPGPI FT DVILGSDVFLALLKPGQVKDNGGVPVAQNTIFGWIVSGNQAIYSSSVQGNI FT SIINLHAELDINRTLRLFWEQEEVPKPSQLTPSELAATELFRSSLTRDGNG FT RFIVRLPFDDSKPALGESLTSAIKRLRSMERRFRQDEEFHALYSEFRSEYL FT ALGHMEEVPPDELEVETSKCFYLPHVKTESTTTKLRVVFYAVVSLNDRLLA FT GPDVNQDLFSVFLRFRTHQVAFSADVEKMYRQVVVHPTDRDFQRIVFHERE FT DQPIRHYRLCTVTYGTKCAPYLAIEAMKQAAREYQPVYPEAVQRIELDTYV FT DDFLSGVRNVQQARIMKNQVMEILESAGFHLRKWTTNCPELLQEEFEGDQA FT PVDVKLAEQTNSVKALGIQWLPTEDTFSFKVTLKPNRVNTKRQILSDSCKL FT FDPFGWLSPVTIRVKILFQQLWLTELSWDDRLPPAVETAWLEIKESLQQVE FT QIRIPRFAPNFNGTIELHGFADALEMAYAAVVYAKGRNEAGDIIVNLLVAK FT TRVAPIKQVSLPRLELNASVIH" FT CDS 3387..4484 FT /product="BEL-30_AA-I_2p" FT /translation="MESIKQAFSHLEVEPWAYTDSSIVLQWLSAHSRKWKS FT YVAYRTSAILEVLPRDRWIHVRSEVNPADCASRGLAPSELVTHPLWWHGQD FT ELKTEVEHWKQVPVEETEDEEVLEPRRVKVLHAAAVAIRTNYDIEMRLLER FT RSNYTLIVRTLAYVNRFLMAIRAEDANLEPGLSPNEIYDAKAQLARAAQYA FT SYKQELDLLQKGKELSAKHKLSALHPFLDSQGTMRVGGRLQNASYSYDVKH FT PIILPGNHRVTELLLRDLHLRNLHAGPTLLTATVNQQFRPSGGGSLNSPRM FT YSVRATKRKDGDTADGQPTGVSGDGHSSIRPRRRGLRRPSEGSCLLRSGRE FT DDQRLHRSLRMYGDQGGALGGRK" XX SQ Sequence 5285 BP; 1392 A; 1334 C; 1428 G; 1131 T; 0 other; ttggtccttc gtcgtcggat tccgcgaatc gcgatggaaa aactagtgaa agtacggaac 60 gcgcaaatgg tgcaagtggt cggggaggag tcctacagcg aggcaacgga acggctcgac 120 cagctcaagg aacttgttgg aaacttccgc aagacacagg caagcatcga agagctcctt 180 gacgacccgg aagcggttgc ctcggtgcac aatattcgtg aagagttcaa cggagcctac 240 tttagtgcaa aagacatcct ggagaagtac atcgcggata ataatccgga tgaaaccaat 300 tccgtatcga gtagtggaac gatcgtcgat aacgatctcc acgaggcgat gaaaatgttg 360 ctcgtctcgt cgcgaacaac aaccaagcaa acctcgatcc agcagcggat atggtgtcga 420 atcgttcacc tttcctgagt gtgcggcttc catcgatcag cgtgccgacg ttcagcggtg 480 atcggaaaag ctggaggtcg ttcagacatt ttcgaatcga ccatccacag ccggaacgat 540 ctgaaagatt cgatcaagat gcagtatctt gtgtcgtacc tcgatggcag tgcgaagcag 600 ctagtgagtt ctttcccgat tttcgacgca aactaccagc aggcatggga ggcgctcacg 660 aactactacg ataagaaaaa atataccgtt ttcgctctta ttcgtgagtt cgtggaccag 720 ccggcggtga tcaccgccac atccggaaat cttcgcaagc tagtcactac tgctgatgag 780 gtggtgcgac aattgaatgc gctcggggag gaatataaca cacgggatcc gtggcttatt 840 tacctcctgc tggaaaagct ggaaaaggaa agccgttcat tatgggcgcg gtggatcatc 900 gaagaagaaa acccttcctt cgacgatttc atcaagttcc tggacaaccg atgtgatgcg 960 ctggaaacct gcacagcttt cacgaagaag aagacagtag aagcagtcaa gaaggagcca 1020 gcgaagaagc cactcagtga gaagaagatg cagtcgctgc acacagctgc agcaggggaa 1080 aagtgctcga aatgctccaa ggagcatccg ttgtttcagt gcagtgaatt taaagacatg 1140 gacattcaga gcaaacgcga actagtgaag aacgcgagat tgtgctttaa ctgccttcgg 1200 ggatcgcaca ccgcgaagtt ttgagtttct aaagcagtgt gccgaaacga gaactgtaag 1260 cagcggcatc atacactgct atgtgagcgt aaagctgaag atgacaggaa gaaaccagtg 1320 gtaaacagtg aatcaaacca gcaacagaag agtgaagcta ccatcaacgc aatggtggcg 1380 caggtaccgt tggaaccgca gcagaagtcg gtcgtgctac ctactgctgt catccgggta 1440 cagggattcg acggcggtct gctgcaagtt cgcgctctca tcgactcggg cttcgaagca 1500 tcattgatct cggaagcttg tatcaaaaag ctcggcctac ttcgttgcaa cggaaaggtg 1560 atggtttccg gtttgggaca gcaagcagcc ggaacaaccc gtggtatggc gaagctggtg 1620 attgccaacc ggttcaccga cgagtgtgtg ttgcaaacaa gtgcctacat cctggggaag 1680 ctcacctcga ctcttcctac acagcactgc agtatccatc caagcctttt ggacaagcaa 1740 atacaggaga accttgctga tccggcatat caccagccgg gtccgatcga tgttatattg 1800 ggatctgacg tctttcttgc tctcctcaag ccagggcaag tcaaggacaa cggaggtgtt 1860 cctgtggcac agaacacgat cttcgggtgg attgtgtcag ggaatcaagc catctactcg 1920 tcgagtgtgc aaggcaacat ctcgattatc aatctccatg ctgagttgga catcaatcgc 1980 accttgcgtt tgttttggga gcaggaggag gttccgaagc caagccagct tactccttct 2040 gaactagcag ccactgagct gttcaggtcc agtctcacac gtgatggaaa tggacgcttt 2100 atcgtgcgac ttccgttcga cgattcaaag ccagcgctgg gtgaatcact cacttcagcc 2160 attaaacgat tgcggtcgat ggaaaggcgt ttccgacagg atgaggaatt ccatgcgttg 2220 tactcggagt ttcgcagcga atacttggct ctgggacaca tggaggaggt gccgccagat 2280 gagctggaag tagagactag caagtgtttt taccttccac acgtcaagac ggaaagtacg 2340 acgaccaaac tgcgggttgt cttctacgcc gttgtttcgc tcaacgacag actcctggca 2400 ggacccgacg tcaaccagga tcttttctcg gtgtttctgc gtttccgtac ccaccaggtg 2460 gctttctcgg cagacgtgga aaagatgtac cgccaggtgg tggtgcaccc gacagaccgt 2520 gacttccaaa ggattgtgtt tcacgaaagg gaggatcaac cgatcaggca ctatcggttg 2580 tgcactgtga catacgggac caagtgcgcc ccgtatttag caatcgaagc catgaagcag 2640 gcagcccgtg agtaccaacc agtgtacccc gaagccgtcc aaagaatcga gctcgacacc 2700 tacgttgacg attttctgtc tggcgtacgc aacgttcagc aagcaaggat aatgaagaat 2760 caagtcatgg aaatacttga gtccgctggt ttccacctgc ggaaatggac tacgaactgt 2820 cccgaactac tacaggaaga atttgagggc gatcaagcac cggtcgacgt caagcttgcg 2880 gagcagacaa actcagtgaa ggcactcgga attcagtggc tgccgacgga ggacactttc 2940 tctttcaagg tcaccctgaa accgaacaga gtgaacacca aaaggcaaat actatctgat 3000 tcgtgcaagc tattcgatcc attcggatgg ctctccccag taaccataag agtcaagatt 3060 ttgttccagc aactttggct cacggaactc tcgtgggacg accggctgcc tccagcagtc 3120 gaaacagcgt ggcttgagat caaggaatct ctacagcaag tggagcaaat acgcatccct 3180 cgattcgcac ccaacttcaa tgggacaatc gagctgcatg gatttgcaga tgcgttggag 3240 atggcgtacg ctgcagtggt gtatgccaaa ggaagaaacg aagctggtga catcatcgtc 3300 aatctcctgg tcgccaaaac cagggtagct ccgattaagc aggtgtcatt gccaaggttg 3360 gagctcaacg cctctgttat ccactgatgg aatcaatcaa gcaagcattc agccacttgg 3420 aggtggaacc ttgggcgtac acagatagca gcatcgtact ccaatggttg tccgcccatt 3480 ctcgtaaatg gaaatcgtat gtggcctacc gtacgtctgc aatattggaa gtgctcccgc 3540 gcgaccgctg gattcatgtg cgaagtgaag tcaatcctgc agactgtgcc tcacgaggtc 3600 tcgcgcctag cgaacttgtg acacacccgc tctggtggca tgggcaagac gaactgaaaa 3660 cggaagttga acactggaag caagtacctg tcgaagaaac cgaggatgag gaggtattgg 3720 agccgcgtcg agtgaaggtt ctgcacgccg cagctgtggc gatccgtacc aactacgaca 3780 tcgagatgcg cctgttggaa cgacgatcga attacacact gatcgtgcgc acacttgcgt 3840 acgtgaatcg attcttgatg gcaatcaggg cggaggacgc gaacctcgaa ccaggacttt 3900 cgccgaatga aatctacgat gcaaaggcgc agctagctcg agctgcacaa tatgcttcat 3960 ataagcagga actggatctg ttgcagaaag gtaaggaatt gtcagccaaa cacaaattgt 4020 ctgcccttca ccctttcctg gacagccaag gaacaatgcg agtgggagga cgattgcaga 4080 atgcatctta ctcatacgac gtcaagcacc cgatcatact tcctgggaac catagagtaa 4140 ccgagttgtt gttgcgagat cttcatctgc gaaatcttca tgctggtccc accctactca 4200 cagctacagt caatcaacaa tttcggcctt caggcggcgg ttcgctcaac agtccaagga 4260 tgtactcggt gcgtgcgact aaaaggaaag acggcgacac agctgatggg cagcctaccg 4320 gtgtctcggg tgatgggcac tcgagcattc gcccacgtcg gcgtggatta cgccggccct 4380 ctgaagggtc atgcctcctg cgttcggggc gtgaagacga ccaaaggcta catcgtagtc 4440 ttcgtatgta tggcgaccaa ggcggtgcac ttggaggtcg caagtgacct ttcaaccaat 4500 atgtttatat gtgcactgaa gcgattcatt tcacgaaggt ctcatccgaa cgagatttgg 4560 tcggattgcg gaacgaactt cgtgggaacc gaccgttggc tgaaggaaat tcaaactgca 4620 cgagagaccc acaacgaagc aactgatcga tttctcacca acttgattat taagtggtgt 4680 tcaacccgcc ttcagcgccg caccgtggtg gcatctggga ggcagcggtc aagagcgcca 4740 aaaagcacct agtggcagtc ttgggcaacg aagcagcaac gtttgaagaa ctctcaacaa 4800 tcctggcaca ggtggaagcg tgcctcaact cgcggccgct atgtccactt tcaaccaatc 4860 ctgacagctg tgaagcgctg accccaggac attttctggt tgggcagtcg atgaacctca 4920 tcccggagcc tgatgtccgt cacattccaa tgaaccgtct ggataagtgg caacttttac 4980 acaagcacac aacggaaatc tggcgtcgtt ggcgagatga atatcttgct aatctgcaac 5040 cgcgaagcaa atggcgaaca acggaggaga acctagaaaa gaatcagctt gtattggtca 5100 agaacgacaa tgccccgccg actcaatggg agttggcccg tattgctgag ctgcacccgg 5160 actcgacagg agtcgtccgg acggttaccc tgcggcgtgg tcaagcggag tactctcgcg 5220 gcccatccag aagatctgcg ttctaccaac ggattgaggc agcagagcct caaggcgggg 5280 gtgga 5285 // ID Crack-13_AAe repbase; DNA; INV; 3053 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A Crack non-LTR retrotransposon family from Aedes aegypti. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; KW Crack-13_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-3053 RA Kojima K.K. and Jurka J.; RT "Crack clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1229-1229 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 10 sequences with >96% CC identity. Closely related to Crack elements in Culex pipiens CC (Crack-1_CP to Crack-4_CP). 5'-truncated. XX FH Key Location/Qualifiers FT CDS 3..2621 FT /product="Crack-13_AAe_1p" FT /note="endonuclease and reverse transcriptase." FT /translation="HGGLAVFVRNELQMDVCRNIVIEGFHHIHCRFKFEAN FT AINLHAVYRPPSFDVRRFLHEIEGMISLSKNNEKCVLVGDLNIPLNITTNN FT IVGEYIRLLETYNMLPTNTVVSRQSSNNILDHVVCSTSMIEDVFNETIFTD FT VSDHCLICSSINMKCHTSESTLHKRIIDHSRLNELFERSILNIPQNLSVNE FT RLNYVITHYNEILNDCSRLVTQRVKLKGSCPWMTLDLWKLIKIKDKLLKKK FT RNNPNDNHTTELFAHVSKILQDKKAKTKRDYYEGLLSGNNQKAAWKIIKNV FT MGNQKSNSKPTAIRNGAQLITDPQQVCSSFNDHFCAVGPKLSSTINTNRNI FT NRFGTLGSLQSTIFLKPSTRNEIILLINRLDSKKSAGPDGIPVSFIKHHHS FT FFASLLMETFNEIIVSGEFPDCLKIARVIPVFKSGDPKDMNNYRPISTLSI FT IDKLLEKLLVSRILEFSTKHHLIYGQQYGFRQGSSTLTACHELVDEIYDAL FT DNKQIVGALFIDLKKAFDTVDHDLLLRKLESYGIRGIAKDLLKSYLTDRHQ FT YVSIGEHKSQLRLVTTGVPQGSNLGPILFLLFVNDVARLNLHGKLRLFADD FT TAVFYRGTNCDTILNQIKMDLEVLLEYFGENVLSLNLNKTKYMLIHTPRRK FT IPAHGPISIGGHNLDKVYEHEFLGLTIDSVMSWAAHINKLKSKVSSLCGIL FT RKISSFMPQTCLKQIYFALVHSRLQYVTAVWGSASKSHLRELQVLQNRCLK FT IVLRKPLLYPTVNLYSNRNDSLLPIKALYEYQVLVQMQKIVKGTSTHHNTV FT LHMSQQSRSSRQANHIFLFRPNSEFGKKKITYFGSKLYNALPENCKSFQNM FT FHFKRFVRNALKQKVEQFVL" XX SQ Sequence 3053 BP; 984 A; 679 C; 561 G; 829 T; 0 other; cacatggagg tttggctgtg tttgtacgta atgaactaca aatggatgtg tgtagaaaca 60 tcgttataga aggattccat cacattcact gtcggttcaa atttgaagca aatgccatta 120 acttacatgc agtttaccga ccacccagct ttgatgttag gcgattcctc cacgagatag 180 aaggcatgat atctctttca aaaaacaacg agaaatgtgt tcttgttggc gacttgaata 240 tcccgttgaa cattaccacg aataatattg taggcgaata tatcaggtta ttggaaacct 300 acaatatgct gcctactaac acagtcgtat ctaggcaaag tagtaacaac atattagacc 360 atgtggtttg ttcaacttca atgattgagg atgtattcaa tgaaactatc ttcactgatg 420 tgagtgatca ttgtttgatt tgttcttcaa tcaatatgaa atgccatacc tcagagagca 480 ctttgcataa acgtatcata gaccattctc gactaaacga attgtttgaa cgatcgatcc 540 tcaatatacc gcaaaatttg tcagttaacg aaagactgaa ttatgttatt acgcactata 600 acgaaatact gaatgactgt tcaagactag taacacaacg tgtgaaactg aaagggtcgt 660 gcccttggat gacattagac ctgtggaaat tgatcaagat caaagataag ttacttaaaa 720 agaaacgaaa caacccaaat gataaccata ccacagaact tttcgcacat gtttcaaaaa 780 tacttcaaga caaaaaagct aagacaaagc gggactatta cgaaggactt ttgagtggaa 840 acaaccagaa ggcagcctgg aaaatcatca aaaacgtgat gggcaaccag aagagcaata 900 gtaaacccac ggcaattcga aacggggcgc agctaataac ggatccgcaa caagtatgtt 960 ccagcttcaa cgatcatttc tgcgcagtag gacccaagct ttcttccacg ataaatacaa 1020 atcggaacat caatcgcttt ggaacgttgg gctcgttaca gtcaacaatc tttttgaaac 1080 cttctacaag aaacgagatc atattattga tcaatcgact ggattccaag aagtcagcag 1140 gaccggatgg cataccagta tcattcatca aacatcacca tagtttcttc gcatcgctac 1200 tgatggaaac tttcaatgaa attatcgtca gcggtgaatt tcccgactgt ctcaagattg 1260 cgcgggtcat acctgtattc aagtccggag atccaaaaga catgaacaac tatcgtccaa 1320 tttctacatt gtcgatcatt gataagttac tagaaaaatt acttgtctcc agaatactag 1380 agttttcaac aaaacatcat ctcatatatg gccagcaata tggttttcga caaggatcca 1440 gcacactcac tgcgtgtcat gaactggtag acgaaatata tgacgcttta gacaacaagc 1500 agatagttgg agctttgttc atcgacttaa aaaaggcctt tgacacggtt gaccatgacc 1560 tgctgctacg aaagttggag tcatatggaa taagagggat tgccaaagat ctgctaaaaa 1620 gctaccttac tgatcgccat cagtatgtat caataggaga acataaaagc cagcttcgcc 1680 tcgtgactac tggagtccca caaggaagta atttggggcc catattattt cttttgtttg 1740 tcaacgacgt cgccagattg aaccttcatg gaaaactccg tcttttcgct gacgataccg 1800 cagtctttta ccgcggtacg aattgcgata ctatccttaa ccaaatcaaa atggatcttg 1860 aagtgctatt agagtatttt ggggagaacg tgctgtctct taatctcaat aaaaccaaat 1920 acatgctgat tcacactcca cgaagaaaaa ttcctgcaca cggcccgatt tctatcggtg 1980 gtcataatct ggataaggtg tatgagcacg agtttttggg cctcactatt gattctgtaa 2040 tgagctgggc agcacacatc aacaaattga aatcaaaagt tagctctctc tgtggaatac 2100 tacggaagat atccagcttt atgcctcaaa cctgtctcaa gcagatttat tttgctttgg 2160 ttcactcacg actgcaatac gtaacagctg tatggggttc tgccagcaaa tctcatctac 2220 gcgaacttca agttttacaa aaccgctgtc ttaaaatagt gctacgcaaa cctttgctgt 2280 atcccaccgt caatctatat tcgaatagaa atgactcact gttaccaata aaagccttat 2340 acgaatatca agtattggta cagatgcaaa aaatcgtcaa gggaacttcg acgcaccaca 2400 atacggtcct ccatatgagt caacaaagtc gctcatcacg gcaggccaac catatttttt 2460 tatttcgacc caattcggaa tttgggaaga aaaaaatcac ctattttggt agtaaactct 2520 acaatgccct accagaaaac tgtaagagct tccagaatat gtttcacttc aaacgttttg 2580 tacgcaatgc actaaagcag aaagtcgagc aatttgtatt gtaatatgca ttttttctaa 2640 tgccacttcc gtatctgttc tttgtcattc cacgccacca ccacccttcg ccaccaccca 2700 ccgccagcca ccaatcacca ccacccactg ctagccaccc gccgccagcc acccgccgcc 2760 atcgccacca ttcatcactt cacccatcgc ttgccaccaa cttcagtaaa gatcaatgct 2820 cataaatatt tataatgtga attgaaagtt tgtgaaaatt ctaaactata gcgcttcctt 2880 caaagagcta cgctcattgg aagtgcagtt atcgctcaat cgttgaatta tacttatttt 2940 tttgaaaaga tgagaaggtt ttatgcctat gggagaagtg gcttaaagta gctacactct 3000 cacgggcttt tcccttctcc aagagaaaaa aaagcagtaa ataaataaat aaa 3053 // ID Gypsy-1_OD-I repbase; DNA; INV; 6705 BP. XX AC CABV01000585; XX DT 24-FEB-2011 (Rel. 16.02, Created) DT 24-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Oikopleura dioica genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-1_OD_; KW Gypsy-1_OD-LTR; Gypsy-1_OD-I. XX OS Oikopleura dioica OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; OC Oikopleuridae; Oikopleura. XX RN [1] RP 1-6705 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Oikopleura dioica genome."; RL Direct Submission to RU (24-FEB-2011). XX DR Genome; CABV01000585; Positions 26231 32935. XX CC Positions [4594-5091] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 766..1971 FT /product="Gypsy-1_OD-I_1p" FT /translation="MNKLTNELRLLEEQLAALFQAKPEDGTESARIYTTMT FT ANIKTKEEEILAAVKAAPASSALDVINDGMMPNASTAQWRQNNLMSSYVAG FT LKFSGRDDEDINEFLFKIKAIGEACNLIPFSEIWAAARPQLPSSMLRAVDG FT VKIEDYKTLKETLSTLFGSHQNAHQRLESWMNKEKRWGMSFVQHHAELSGK FT LNSIKASYAEMLKERHSSHSGDRSYQPTWDDAFQILEYLKILQDCRGQSDS FT IFKTLSIELSKIKTPTALAMRAEQIRSQVPTQSRALNVSNNKKGNNKQQNK FT GNNISGDNNQSRSKPSTPKGGDRPTKDSQGKSQHNENSDRRVHDNTRGGRN FT SGGNNHNNRSNNNKWNNKRQGNNNQFAIQDTSFADYENIEFDINSGDEETF FT DHTYGTKN" FT CDS 2944..5787 FT /product="Gypsy-1_OD-I_2p" FT /translation="MQMEALLPPGQKYFMTADIKCGFWNIKVREEDQHKLA FT IQWKGTNYQFQRLPFGLKSAPSIFCRAIARCLETMQKNIKLYIDDICIFSE FT TFDVFVKTIDEVFSLLTDHGFVVSPKKVFALQPEIRWLGRLISPIGSRANP FT ENTQAILKITAPTSYKGLQRLLGMLQWIRQYAACRNKENVASKSFSQVIKP FT ISALLKTNTPRGKFTWTRNASKALEEIKERLASPEFIYFPDWNHTFVLTTD FT ASIDAIGYCLTQEIEGKSRIIRVNSKTLNGAQQRYSTTERECLGVYWAILD FT CKFYLSGSRFIVRSDHNPLTFLDTKTPKNDKVVRWLNQLSNFNFIIVFVRG FT VDNNVADFLSRPRDSDRPTRSSSPEDSEPAGEYHKFHKYSIYVPSWVDPNA FT DVSISMDGADLTDPGPVIYAITSSRPDIDTATRTSIASAQYDDLAVRKILD FT CLEYKTPILDDPSDEEVAWLFRHRNQLTRCPITACLLVGQKIYVPRVLRRE FT VLTEFHDNRNHNGAKRMTEQTSHLTWPSKATDISNWSQSCFCAHRKGGRGQ FT SHQPPLQPTKRGTRIFEKILVDFVEMPLSKSGFKYLLTCICTFSRFLIAIP FT SRRCRAVDTVRMLTEHVFYKYPKPDVISSDRGVHFTSHVNQAFAAKMNIDW FT KYHCGYHPESTGALEIQHRSIKDSIFIAVHATGLEWPEVLPSTIHIINFQP FT NAGTKVSPHEAVYGLKPNLSKYDPKPSLEATSLDTYTKKQAEVLAKIHKNI FT ARCQLQADKALKKSADPKRPAKEIFVGDKVLIYRPFSAIAKRSKLKWIGPY FT SVTDSFGNIVVIEDCEGCTDYIHRSQCQKIEERKPHLGPLPPFPQFDVPLR FT RTNFHHPRPSPQISDNLPTNMLDEEPVNPLEDEPELETTLFETPPTSPVRP FT KTPPQRVLTRSQAKRISQEEQFQPSQTPVRRSARISARPPWKPA" XX SQ Sequence 6705 BP; 2137 A; 1661 C; 1309 G; 1598 T; 0 other; gatcggactc tattaaatgg tgtcagaaga caaccactgt caaactcctt tgtttcaaac 60 tgataattga aagacaggaa tctcctcagt agatcaattt gctaggattc tagagctagg 120 ttaaaaccga gaatggcccg ttaggcccat tcaggcgagt ttttaatcaa attaaacctt 180 ttttttgact atttttccta atttcgagat ttatgaaatc tcggccaatc agagagcgcc 240 attcgctgga actgccacac agggtcgctt gcacgtggaa agctgttcat gcttacgcta 300 tgcattcgcg cgggggcgcg cgagcacgcc cgcgcgaacg cctacgcatt cgagccgacc 360 gctagcacac gacgcaaaaa cactcactcc aggcagaaat cagaggaaat attcaaaatt 420 taaccaaatt tttcaaaatt tataattttt aagtaaaatt ttacgctctt ctcattggtc 480 gagacaaaat gaagaaaata tgaataaaga agtctccaag gtataaatag ctcaaaattg 540 cgcgaaaaat ttaagaatta aatatctagc ggtcagctac tactatattt tgcttacgtt 600 tgggggcatg taggcttaat ataaacataa tacaatataa atgcgctcat tttagaaatt 660 acatatccta gttaatcgaa agtcacaaat tagtagaacc attgataaat tgagctcaaa 720 aaagtcaaat tctacatacg aattccagcc accctccttc ttacaatgaa caagctcacc 780 aacgagttga gacttcttga agaacaactt gctgcgctct ttcaggcaaa gcccgaagat 840 ggaaccgaaa gtgcccgaat ctacaccacc atgaccgcga atattaaaac gaaagaagaa 900 gagatcctgg ccgctgtcaa agctgctcca gcttcttccg ccctagatgt catcaacgac 960 gggatgatgc caaatgcttc tactgctcaa tggagacaga acaatctcat gagctcctac 1020 gttgccggat taaagttcag tggacgcgat gatgaagata tcaacgagtt tctctttaaa 1080 atcaaagcga ttggagaagc ttgcaacctt atcccttttt ccgaaatttg ggctgccgct 1140 cgcccacaat tgccatcctc aatgctgcga gcagtcgacg gagttaaaat agaagactac 1200 aaaacgctga aagaaacact atctacactt ttcggctcgc atcaaaatgc gcatcagcgc 1260 ctcgaatcct ggatgaataa agagaaacgc tggggcatgt cctttgttca gcatcacgcg 1320 gagttgagtg gaaagctcaa tagcattaaa gcctcatacg ctgagatgct aaaggagcga 1380 cactcttctc acagcggaga tcgctcttat caaccgacat gggacgatgc cttccagatt 1440 ctcgaatatt tgaagatcct tcaagactgc cgaggccaat ccgactctat tttcaaaact 1500 ctttcgattg aattatcgaa gataaaaaca ccaacggcgc tcgcaatgcg cgccgagcaa 1560 ataaggtcgc aagttcctac ccagtcacgc gctctgaacg tttccaacaa caaaaaggga 1620 aataacaagc aacaaaacaa gggcaacaac atttctggtg acaataacca gtcacgatcg 1680 aagccgtcta ctcccaaagg cggtgaccga cccaccaaag attcacaggg caaaagccag 1740 cacaacgaaa attcagaccg acgagtccat gacaatacgc gcggaggtcg caactccggc 1800 ggaaacaacc acaacaaccg ttccaacaac aataaatgga acaacaaacg acaaggcaac 1860 aacaatcagt tcgccatcca agatacaagc ttcgccgatt atgaaaatat cgagtttgac 1920 atcaacagtg gagacgaaga gacattcgat cacacttacg gtacaaaaaa ctagaagtta 1980 tggacgattc ccagtgcacg tccgatcgta aagaaagcac tgctcttcct gtaaacgagt 2040 ccctccgaag ccaatcgccc caatatctcc cagtttgctg tctaaattcc aaccagccaa 2100 aatcagcaat agcactgttg tttgaccctg gatctttcgc cagtattctg actaaacagg 2160 tagtcgacga cctcgggtta gagatacagc catccgacca aaccatcagc ggagttggat 2220 ccaaagacaa accatgcctt ggtaaaatag aaacagacat catgatcggt aactccacaa 2280 cctggccaaa aacaacgtgg tatatcgtct caaccgacat gttaggaata ccaggaaata 2340 ttggccgcaa cccgcttcac tgccggctca acgaaatctc atacaacttg ggtaacagga 2400 ccctattctt ccgctctcgt cagggctggg cgcgagtccc gtatatttcc aatcctcgca 2460 ccgaaaattg ggacagctca gtcgacaagc actttatcta caccattaca gaaccaattc 2520 cgaccgccga gttggtcgac aaattacgga ccgagcttgg tccaacagtt tatctggaca 2580 aacaaaattc cgacgaagcg cgcgaaatct gcatgctact tctgaagcac aagcaggtgt 2640 tctcttcgga agatcgaccc attggtcttg tccacggctt tgaagccgaa atatgcacac 2700 tcccaggcag aactgctatg gtcaatcaat atcgagtgcc acagaagcac gaagagccac 2760 ttacgcgaga aattgaaaaa cttgtttcaa ttggagtatt aaaaatttcg cgaagataac 2820 cgagggttca atacacccat tggtggcgtg tctaagcctg acggtacgat tcgccttatt 2880 ctcaatttta aaataactct taatcgcatt atacaaaacg aagattgctt caatataccc 2940 gatatgcaaa tggaagctct cctaccgccc ggtcagaagt atttcatgac tgccgatata 3000 aaatgcggct tttggaatat aaaagtgcgt gaagaagacc agcacaagct ggctattcaa 3060 tggaaaggca ccaactatca attccaacgg cttccatttg gtcttaaaag tgctccttca 3120 atcttctgtc gcgctatcgc acgatgcctc gaaactatgc agaaaaacat caaactgtat 3180 atcgatgata tctgcatatt ctcagaaaca tttgatgtgt ttgtaaaaac catcgacgaa 3240 gtattttctc ttttaaccga tcacggcttt gttgtaagcc ccaagaaagt attcgctctc 3300 caaccggaaa ttcgctggtt aggccgactt atttcaccga ttggaagccg cgcgaacccc 3360 gaaaatactc aagctattct gaaaattacc gcgccaactt catataaagg tttacaacgt 3420 ctgcttggaa tgttacagtg gattcgacag tacgctgctt gcagaaataa ggaaaacgtc 3480 gctagtaaga gtttctcgca ggtgataaaa ccaatatctg cacttttgaa aacgaacacg 3540 ccacgcggta aatttacatg gactcgaaat gcttcaaaag ctcttgaaga aatcaaggag 3600 cgtctcgcca gccccgagtt catttatttt cccgattgga atcacacgtt cgttcttaca 3660 accgatgcca gtattgacgc tataggctac tgtctcacac aagaaattga aggtaaaagc 3720 cgaattattc gggtcaacag caagactctg aacggcgccc aacaacgcta cagcactaca 3780 gaacgagaat gtctcggtgt ttactgggcc attctcgact gcaaattcta tctctccggt 3840 tcccgtttta ttgttcgtag cgatcacaat ccccttacct tccttgacac gaaaacgccc 3900 aaaaatgaca aagtcgtccg ttggttaaat cagctcagca atttcaactt cattattgta 3960 ttcgtccgcg gtgttgacaa caatgtcgca gattttctat ctcgtccgag ggactcggac 4020 agacccaccc gtagtagcag tccggaggac agcgaacctg ctggcgaata ccataaattc 4080 cacaagtaca gtatctatgt cccgtcttgg gttgacccaa atgcagacgt aagtatatca 4140 atggatggtg ctgatctcac agaccctggg cctgtgatct acgcaataac atcttcaaga 4200 ccggacatag ataccgctac ccgcacttcc atcgcaagtg cacaatatga cgatcttgca 4260 gtccgaaaaa ttctcgattg tctcgaatac aaaactccca ttttagacga tccttcagat 4320 gaagaggtcg cctggctttt tcgccataga aatcaattaa cgagatgccc cataacagcc 4380 tgtcttctcg tcggacaaaa gatctacgtc cccagagttc ttcgaagaga agttctcacc 4440 gaatttcatg acaacagaaa ccacaatgga gcaaaaagaa tgaccgagca aacttctcac 4500 ctcacttggc caagtaaagc tacggatatt tcaaattggt cccagtcatg tttttgcgcc 4560 catcgaaaag gcggtcgcgg acaaagtcac cagccgccac tccagccgac aaaaagagga 4620 acacgcattt tcgaaaaaat tttagttgat tttgtggaaa tgccattatc aaaatcaggg 4680 tttaagtatc ttttaacgtg catttgcaca ttctcaagat tcctcattgc aataccaagt 4740 cgacgctgcc gcgctgtaga tacagtcagg atgcttactg aacatgtttt ctataagtat 4800 ccaaaaccgg atgtaatttc gtctgatagg ggagtgcatt ttacatccca tgtcaaccaa 4860 gcattcgccg cgaaaatgaa catcgactgg aaatatcatt gtggttatca cccagagagc 4920 acaggagctc tcgaaattca acatcgcagc ataaaggaca gtatttttat agctgttcac 4980 gctacaggtt tggaatggcc agaggttctg ccaagtacaa ttcacatcat aaactttcaa 5040 ccgaacgccg gaacaaaggt ttcgccccat gaagctgttt atggccttaa gccaaattta 5100 tcgaaatatg atccgaaacc atctttggaa gcaacgtccc tcgataccta cacgaaaaaa 5160 caagctgaag ttctggcaaa aattcacaaa aacattgccc gttgccagct tcaggctgac 5220 aaagctctca aaaaatcagc tgacccaaaa cgtcccgcaa aagaaatttt tgttggtgat 5280 aaagttctaa tttatcgccc attttcagca atcgccaaac gatcgaagct caaatggatc 5340 ggcccttaca gcgtcacaga cagtttcggc aatattgtcg tcattgaaga ttgtgaaggt 5400 tgtacagact acatccatcg atcgcagtgc cagaaaattg aggagcgaaa acctcaccta 5460 ggtcccttac caccgtttcc acaattcgac gtgccacttc gaaggacaaa ttttcatcat 5520 ccaagaccct ctccgcagat ttctgataat ttgccaacaa atatgctcga tgaagaaccc 5580 gttaacccac tggaggacga acctgaactc gaaacaaccc ttttcgagac ccctccaaca 5640 tctccagtga gacccaaaac cccaccacag cgcgtactta ctcgctcaca agcgaaaagg 5700 atttctcaag aagaacaatt tcagccgtct cagactcctg ttagacgttc tgcacgaatt 5760 tctgctcgcc cgccgtggaa accagcttaa gtccgactaa cagtaatatt ttatttcaaa 5820 ttttatgttt gaaggacgat ttacagttaa attttatatt aaaaaacaaa taatttgact 5880 aactggttag aaaataaaaa attaaaaaat aataaaaaaa aaatcgcaaa atttacgtat 5940 cgtagttatt ttcacgccaa aagttctcga ggacttgatt aacgattata ataattaaac 6000 atagaatgtc tcgtattacg ggatcttgtg ttagtcaaga ctttgataag gaacaagtcg 6060 gaaaattcgt caacgccctc aaaaggcgcc gcgaaacctc aagcgcccga acattgcccg 6120 ctttatcgac gccgaagcct aatcatacct cagatgacat tctcgatctg gattgcccag 6180 aatctctcat tggaaccgtc acgccacaag aacgatcacc atcttcaccc ttgacggcca 6240 gcgaagacga atttgagcaa cactgctcag aacaggccca gcgaaagaaa cgaaaaaaat 6300 acgacgcaaa gaagaagctc cagaaggttc tacgtgaaac gaaacacaag tcagtaaccc 6360 ttgtacagaa tacgccagaa gcacgaatca aaacacgaca acacttaagc tccgtgctct 6420 caatgattcg ctccgccact caaaatggac atttgcgaaa gtctaaagaa acaacgcttg 6480 aaggaagact ccgcgaagct acccgtttat tggacttttg ggaatctgac gaaggtcaga 6540 acatttctga cgtctgctca cgcgttttcc agggcccata ccttcaacta tcacataata 6600 ttatgcgcga gtttgatcaa aaaactgaag gtatgagtaa aagactccga tcggactcca 6660 acgacgcgca gaaagccggt ccaaacaata agtaagaagg acatt 6705 // ID BEL-9_SI-I repbase; DNA; INV; 6453 BP. XX AC AEAQ01023320; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-9_SI_; KW BEL-9_SI-LTR; BEL-9_SI-I. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-6453 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01023320; Positions 6792 340. XX CC Positions [5364-5939] - Integrase core CC 'GATGT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS join(1089..5357,5361..6281) FT /product="BEL-9_SI-I_1p" FT /translation="MAENADIQRLKVKRRIAKAGFTRTENFIKSMQPDTVD FT IDELRIKLDKLAELRKSIEECVIELAVYDIEATDEQIDQQIASYEEKYIGL FT KLMGERVIKAREKPAAASDSNHLEQVIAGPSPFAYNTRNRTQETHIRLPKI FT ELPNFSGAYEEWHSFFGIFDSLIHSNDSLNNIKKFHYLKSALKGEAAEIID FT SLEITDANYRDAWSRLKNRYDNERIAIQNHIKAIFELPVTKRENGNTLRNI FT LDSTLKHTRALAALNRPVNEWDDLLIYIIRSRLDYLTIKEWENSLESKQML FT KFKEFVDFLTKRCETLEAVARRSPAIEFNRTRQNFGKVTTVHAAITGVKCA FT HCKGDHQIYQCREFKELPVADRLSKVKSLRLCLNCLKGKHTVKDCIASTCR FT KCTKRHSTLLHDDRYSTKDDKEETVKSNREDETNKKDNTICIHAQLTRLKN FT ATQAILSTATVLVKDNKGKYIEGKALLDSGSQSHFVTEEFIKKLGLKTKND FT LIKVNGINKQASHALKSVDLHVTSRFGTFSMDMSYIILPNIVENLPTIESR FT VADWDIPKNIKLADPQFYRSSKIDILLGTEKFWELLCIGQIKLGKNKPIIQ FT KTLLGWVVSGTVDISNSDRNGQTSCNLSTMEILNQTVQKFWEIEGFQNDKK FT LTSEEKYCEELFNSSYKRQTDGRFIVKYKKEVLPLLNDSREIALKRFLSLE FT RKLSKQPTFKEEYVAFMREYQQLGHMTRVNTNKENKFRVFLPHHAVIKETS FT TTTKTRVVFDASSQSSQGRSLNDAMYKGPVLQSDLFSLIIQFRCFKYVLCA FT DIEKMYRQILISDEQRALHSILWREDPKEEIQEFELNTVTYGTKSASFLAV FT RCLIQLAESESDNFPKAAEVIRNQFYMDDLLTGGNTETEIIKLKEDLTKLL FT AKGGFRLHKWKTNYHKSYENNQSVDSTDSNKDCVDIVKEKESKLLGVLWNP FT HHDTFHYEITQGDHEPRVTKRVVLSQVCKLFDPLGLVGPVVTSAKILMQEL FT WSLKINWDESIPMHLYKSWCQIRSQLTLLNELRIPRLIISKKNESVIQLHG FT FCDASQKAYGACVYIRERDKQGNIKVTLICSKSRVAPMKILSLPRLELCGA FT VLLVNLMSRVLGSLKFELEQRFYWTDSKIVLAWIGSLSRRWHVFVANRVSE FT IHSNSSPSQWKHVGSKDNPADLISRGTTPEQLINCDIWWNGPHWLRQEVEV FT WPREGEELLKDIPEEKKQSIVASATNCETIIEYNRFSSLYKLFRVSAYVLR FT FIYNIRNNKKDRIEGAINAKDLKRAKLTIVKLVQAEEFKEDIRRLKTDNKL FT QRSSRLISLLPFLDDKGILRVGGRLRHATIPEEAKHPAILPSRHHVTRLLI FT VHYHEKLFHAGVQTTLNSIREEFWPILARNSVKEIIHKCVQCRKASPKASW FT QLMGQLPDVVNAARPFYNTGVDYCGPFYVRDRIRRNSKQYKAYVSIFVCMA FT VKAVHIELVEDLTTDSFIAALKRFVARRGKVKNIYSDNGTNFVGADRVLQN FT TLKDKEFKKEIQDFATREMINWHFIPARCPHYGGLWESAVRLLKLHLRRTI FT GESCLTVSEMTTVLVQVEAVLNSRPLTPLSEDPNDLHALTPGHFLIGENLQ FT VYPDIDLRDVPENRLSRWQHVEQLKQRFWYRWQREYLNTCQQRTKWKSKAK FT ESFKVGQLVMLKEGENKPLKWMLARILEIHPGSDNLVRAVTLKTERGIYKR FT AVVNIAPLCD" XX SQ Sequence 6453 BP; 2310 A; 1132 C; 1410 G; 1601 T; 0 other; tggtccttcg agccggatcc cggaaaaata aagccgggaa gaagaaatca acgaaacgtg 60 ccgggcgcaa aatgggaaaa gccccgtgaa gagacaagct tcggactttc gtgcaaaccg 120 cacaaggcag atacgcagca gtggccaagg tatcaaggaa gagagctaga aggcacgggt 180 agtgtttcca gcattattac aaggaaagag aagcggtctc caagcgggcg agtccaccat 240 tccaagcgac gagtccacca ttccaagcgg acgagtccat tattcgctgg aggaaatccg 300 ccatcttgag cgtgagttca acattcacaa acgaggcgac gagccttcgt acaagaagga 360 acgctcggaa aacgaggcgg cgagtcctca cattggaggt tattggttct acctggcata 420 aggcggtgag caatatccat aacaaatacg taagtgacga cgagagcaca tacatacgta 480 gcaacgtaca cacgtacgtg aaggcgagtg cgcatacata cgtagcggtg cgcacacgca 540 tgcctgacga cgagtgcgca tacacaagta gcgacgtaca cacgtacgtg acgacgagtg 600 cgcacacaca cgtagcggcg agcacacgca tgcgtgacgg cgagtgcaca tacaagcgtg 660 gcggcatagc tctacacacg cagcagagca gcaaacatac aagcgtggcg gcgtagctct 720 acacacgcat acatcaggaa cagcgagttc ttcacacaga gcagcgagca tacacgtggc 780 ggcgaagctc tacgcacgtg acagcaagtc cttcacacac gaggcggcga gtcctagtac 840 aagggggacg agtttctttc tacaaacgag gcagcgagtc ttcgtaaata gcgattaatt 900 cgaatttatt gaagtaagcg agcaaatagt cacaaaaaac gtaagacaca taacctcttc 960 caggttaggc acacgcaaat gcaagtttgt aaatttctta ataattggtt gcgacaaaat 1020 atttctacga tattcggtag tactagcgtt attacaaatc ttaaggttac gcgaaaatta 1080 aaggtaaaat ggccgaaaac gccgacatac aaaggctaaa ggtaaaaaga aggatagcga 1140 aggctgggtt cacgcgcacc gaaaatttca taaaatcaat gcaacctgac acagtagata 1200 tagatgaatt aagaataaaa ttggacaaat tagcagaatt acgaaaatca atagaagaat 1260 gcgtaataga gttagcagtt tatgatattg aggcaactga tgaacaaatt gatcagcaaa 1320 tagcaagtta tgaagaaaaa tatataggtt taaagttaat gggtgagaga gtgataaaag 1380 cgcgcgagaa accagctgct gcatcagata gcaatcatct cgagcaagtg atcgcgggac 1440 catctccgtt tgcatataat acaagaaata gaacacaaga aacgcatatt agattgccaa 1500 aaattgagtt gcctaacttt agcggagctt atgaagaatg gcattcgttt tttgggattt 1560 ttgattcact aattcattca aatgactcgt taaataatat caaaaaattt cactacctaa 1620 aatcagctct aaaaggtgag gccgcggaga tcatagactc attagaaatt acggatgcga 1680 attaccgcga cgcttggtcg agattaaaaa acagatatga taatgaaaga atcgcaattc 1740 aaaatcatat caaggctata tttgaattac ctgtaacaaa aagagaaaac ggcaatacgt 1800 taagaaacat attagatagc actttaaaac acactcgagc tttagcagca ttaaatcgtc 1860 cagtaaacga atgggatgac ttgttaatat atataataag aagtaggtta gattatctta 1920 caatcaagga gtgggaaaac agtttagaaa gcaaacaaat gctaaagttt aaggagttcg 1980 tagatttttt aactaaaaga tgcgaaactc tagaagcggt agcgagacgg agtccagcca 2040 tagaatttaa tagaactcgt caaaatttcg gaaaagtgac gacggttcac gcggcaataa 2100 caggtgttaa atgcgctcat tgtaagggag accatcagat ttatcaatgt agggaattta 2160 aagagttacc agtagcggat cgtttaagca aggtcaaatc cttaagatta tgtttaaatt 2220 gtttaaaagg taaacataca gtaaaagatt gcattgccag tacttgcaga aaatgtacaa 2280 aaaggcatag cacgttgctt cacgacgatc gttactctac aaaagacgac aaagaggaaa 2340 cggttaaatc gaatcgggaa gacgagacaa ataaaaaaga caacacaata tgcatacatg 2400 ctcaattaac aagattaaaa aatgctactc aggcaatatt atctacagca acagtattag 2460 ttaaagataa taaaggaaaa tatatagaag gaaaggctct gttggatagc ggatcccaat 2520 cccactttgt cacggaagaa tttatcaaga aattagggct aaaaactaaa aatgatctaa 2580 ttaaagtaaa cggtataaat aaacaggcgt cgcatgcctt gaaatctgta gatttacatg 2640 taacctctcg ttttggtacg ttcagtatgg acatgtcata tataatattg cctaacatag 2700 tagaaaattt acccacaatc gagtcaagag tagcagactg ggatattcct aaaaatatta 2760 aattggccga cccacagttc tatcgctcaa gtaaaataga catattatta ggaacagaaa 2820 agttctggga gttattgtgt ataggtcaaa tcaaattagg aaaaaacaag cctattattc 2880 aaaaaacact actaggatgg gtggtttcag gtactgttga cataagtaat tcagatagaa 2940 atggtcaaac tagctgtaat ctcagtacaa tggaaatttt aaatcaaacg gttcaaaaat 3000 tttgggagat agagggtttt caaaatgata agaagctcac ctcagaagag aaatattgcg 3060 aggaattatt taattctagt tataaacggc aaacagatgg cagatttata gttaaatata 3120 aaaaagaagt gctaccttta ttaaatgact caagagaaat agcgttaaag cgttttttat 3180 cactagagcg aaaactcagt aaacaaccta cattcaaaga agaatacgtt gcattcatga 3240 gagaatatca gcaattagga catatgacac gagtaaatac aaataaagaa aacaagttta 3300 gagtttttct tccgcatcac gcggtaataa aagaaactag cacaacgacc aagaccagag 3360 tggtctttga tgcttctagc caatcttctc aaggccgatc gctaaatgat gctatgtata 3420 aaggtcctgt attacaatcg gatttattct ctttaataat tcaattcaga tgtttcaaat 3480 acgtgttatg tgctgacatt gaaaaaatgt acagacaaat attaataagt gatgaacaaa 3540 gagcactgca tagcatcctt tggcgtgaag atccaaaaga agaaatacag gaatttgagt 3600 tgaatactgt aacttatggc actaaatcag cttcttttct agcagttaga tgtttaatac 3660 aattagcaga atctgaaagt gataattttc cgaaagctgc ggaagtcatt cgtaaccagt 3720 tctacatgga tgatttgcta acagggggta atactgaaac agaaattata aaactaaaag 3780 aggatttaac taaattatta gctaaaggag gttttagatt gcacaaatgg aaaaccaact 3840 atcataagtc ctacgaaaat aatcagtcag tagacagcac agacagtaac aaggattgcg 3900 tagatattgt taaagagaaa gaatcaaaat tgcttggcgt attatggaat ccacatcatg 3960 acacatttca ctacgaaata actcaagggg accacgagcc tcgggtaaca aagcgagtag 4020 tactgtcgca ggtgtgcaag ttatttgatc cgttgggttt ggtagggcca gtggtgacat 4080 ctgccaaaat attgatgcag gaattatgga gtttaaaaat taactgggat gaatcgatac 4140 ctatgcactt atacaaaagt tggtgtcaaa taagatcaca attaactttg ctaaacgaat 4200 tacgaatacc gagactaata atttccaaaa agaatgagtc agtaatacaa ttacacggtt 4260 tttgtgacgc aagtcagaaa gcatatggcg cctgtgttta cattcgcgaa agagataaac 4320 agggaaatat taaggttaca ttaatttgct ctaaatcacg tgtagcgcca atgaaaatct 4380 tgtcattacc aaggttagaa ttgtgtgggg cagtcctatt ggtaaatctg atgagtcgtg 4440 tactaggcag tctaaaattt gaattagaac aaagatttta ctggaccgat tcaaagatag 4500 tacttgcatg gataggctcc ctctcacgaa gatggcatgt gtttgtcgcg aatcgcgtta 4560 gcgagataca cagcaattcg tcaccttctc agtggaaaca tgtgggatct aaagataatc 4620 ctgctgacct tatatcaagg ggtacaaccc cagaacaatt aataaactgt gacatttggt 4680 ggaatggacc acactggttg aggcaagagg ttgaagtttg gccgcgggaa ggtgaagagt 4740 tgttaaaaga cattcccgaa gaaaagaaac aatcgattgt agcatcggct actaattgcg 4800 aaacaatcat agaatataac cgattttctt ctctttacaa attgtttaga gtaagcgcgt 4860 acgttttaag attcatttat aacattagaa ataataagaa agatcgaata gaaggcgcga 4920 taaatgcgaa ggatctcaaa agggcaaaat taacaatagt taaattagtg caagcggaag 4980 agttcaaaga agacataaga agactaaaaa cggataataa actgcaacga agttcaagac 5040 tgatatcact cttaccgttc ctcgatgaca aaggaatact acgagtaggt ggcagattga 5100 ggcatgctac cataccagaa gaggctaaac atcctgccat actcccctct cgtcatcatg 5160 ttaccagact gctgatagtg cattatcacg agaagctttt tcatgcagga gtgcaaacta 5220 ctttaaactc gatacgagag gaattctggc caatcttagc gagaaacagc gttaaggaaa 5280 ttatacataa atgtgttcaa tgtcgaaagg cgagcccgaa ggctagttgg cagttgatgg 5340 gacaactgcc ggatgtttga gtaaatgctg ccagaccatt ctacaataca ggagtagact 5400 actgcggacc cttttatgta agagaccgta tccgccgtaa tagtaagcaa tacaaagctt 5460 atgtgtcaat cttcgtgtgt atggcagtta aggccgtaca tatcgaactg gtagaggacc 5520 taacgacgga ctcatttatt gcggctttaa agcgatttgt ggcaagaaga ggtaaagtaa 5580 aaaacatata ttccgacaat ggaacaaact ttgttggagc ggatcgagta ttgcaaaata 5640 ccctgaaaga caaggaattc aagaaggaga tccaagattt tgcaaccaga gaaatgataa 5700 attggcactt cataccggcc aggtgtccac actatggagg tctttgggaa tcagcagtgc 5760 ggttactaaa gctacattta agacgcacta taggcgaatc atgtctaact gtttctgaaa 5820 tgacgactgt attggtacaa gtcgaagcag ttctaaattc cagaccatta actccgttgt 5880 cagaggaccc aaatgatctg cacgcattaa caccaggtca ttttttaatt ggtgaaaatt 5940 tgcaagtgta tcctgacatt gatctaagag atgttccaga aaaccgatta agcagatggc 6000 aacatgtgga gcaacttaaa cagcgatttt ggtacagatg gcaacgggaa tacctaaata 6060 catgtcaaca gagaaccaag tggaagtcta aggccaagga gtcgttcaag gttggtcagt 6120 tggttatgct gaaggagggt gaaaataaac cattaaaatg gatgttagca aggatcttgg 6180 agatacaccc tggttcagac aatttagtga gagcagtcac cttaaaaacc gaacgtggaa 6240 tttataaaag agctgtcgtt aacattgccc cactatgcga ctaaacgatg tgatgtcata 6300 cgttatatta gttatgtaag agtttatagt ttatttgttt aaaatttttg cgtttaattt 6360 agaatagata gcgatctcat aaatggtaga gttacttgaa gaattttatg ttgaaaataa 6420 aaacaatttt tttatttgtc aaggggggcg gcg 6453 // ID Copia-13_SI-I repbase; DNA; INV; 4043 BP. XX AC AEAQ01018600; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-13_SI_; KW Copia-13_SI-LTR; Copia-13_SI-I. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-4043 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01018600; Positions 4417 375. XX CC Positions [1432-1965] - Integrase core CC 'GATTT' target site duplication CC LTRs are 92% similar to each other. XX FH Key Location/Qualifiers FT CDS 403..2475 FT /product="Copia-13_SI-I_1p" FT /translation="MKKEQGQSMIEFVNGFVDKVEQLEDAGIKLPDELVSV FT IMLNSLPIEYENFCVAIESRDNIPTIDFLKTKLIEEEARRIDQDDGKNQDP FT ENNALVANNKGSKKNVKDTHTKYTKNKFNGKCYKCNKFGHKSTECKSKFKH FT NTGNNTNDAMFACARHTGIQVSTQWCLDSGATSHMCHDGRKFSELDASKKC FT DVYTATDDSVGSEGVGDVRMKVKLKDSETNDIKLKDTILVPRFRSNLLSVS FT RMTDNGYTVMFKRNCAFVNRQDGSTALIAKRQGQLYVVDEAEQSRAQAAQD FT AGNDSPLRWHQRFGHLNLADLKKLKSKEMVIGLSMKGESEELKCEICDKCK FT IHQLPYKSSTKREKEVLGLVHSDICGPMRVPSLGGAKYFVTFIDDKSRYIE FT VVMLKRRSDVVSAFKAYKKRAEKETGCQIKKIRTDNAKEYLSKEFNDFLEE FT EGIKRQLSVEYCPQQNGVAERANRTLVEMARCMLLQSGVPNTLWGEAINAA FT AYIRNRCPTKILENLTPIEAWSSEKPYVGYMRIFGSKVVALEKKPHRDKFK FT PRGKEYILVGYSQESKAYRLWSPGTGTVVKRRDVRFTEDLRNKTEDTKEIL FT ETSLNPDWHKLTSTQRTNGESNENSDNSEWEEENEEQFNTPEEEIDTPTTS FT RRGPGRPKLLRTGKRGRPRKLYNETKQGEEASKSTGEEDDQA" XX SQ Sequence 4043 BP; 1433 A; 694 C; 963 G; 953 T; 0 other; ggtgcgcgtg agataaggaa cacgtgagta attcaagtcc tgttttacga caaaatacta 60 acctagaaag ctcatcatgt ctgtctctat cggcacgcaa aatatcgaga aattaaacga 120 tactaactat gaatcgtgga aaattcaaat aaagagcgtg ctcgtatgta atgaactgtg 180 gaaatacacg agtggatcgg aaactcgtac gcctgaaaac agcgttacgt ggacgtcgaa 240 ggatgagaag gcattagcat taattctact gagcgtctcg aagaatcaac tgagtcacat 300 taagaaggta gcaacgtcgc gagaagcatg ggaaaagctg agtagcatat atgaatccag 360 aggcccggta aggaaatcag tcctgtataa gcaactgtat ctatgaagaa agaacaagga 420 cagagcatga tagaattcgt aaatggcttc gtggataaag tagaacaatt ggaagatgct 480 ggaatcaaat taccagacga gctggtatcg gtaattatgt taaattcatt accgattgaa 540 tacgaaaatt tttgtgttgc tatcgaatcg cgagataata tacctacgat cgattttctc 600 aagactaagc ttatcgaaga agaggcacga cgcatcgatc aagatgatgg caaaaatcaa 660 gatccggaga acaacgcact ggtggcaaat aataagggct ctaagaagaa cgtaaaggac 720 actcacacta aatacacgaa gaataaattt aacggaaagt gttacaaatg taacaagttt 780 ggacacaaat cgacagagtg caagagcaaa ttcaagcata acaccggaaa taatacaaac 840 gacgctatgt ttgcgtgtgc ccggcacacc ggaatccagg tatctacgca gtggtgtttg 900 gatagtggcg cgacgagtca catgtgtcac gatggaagga aattctctga attagatgca 960 agcaagaaat gtgacgtcta cacagccaca gatgactccg ttggatccga aggtgtcggc 1020 gacgtaagaa tgaaagtgaa attaaaagac agtgaaacaa atgacattaa attaaaagac 1080 acgatattag taccgcgatt tagaagcaat ctattgtccg tatctcgcat gactgataat 1140 ggttatacgg tgatgtttaa aagaaattgc gcgttcgtga atagacaaga cggttctacg 1200 gcgttaattg ctaagaggca aggtcagttg tatgttgtag atgaagctga acaatcgcga 1260 gcccaggctg cacaagatgc cggaaacgat agccctctac ggtggcacca aaggtttggc 1320 cacttgaatc tagcagattt aaagaaactg aagtctaaag aaatggtaat tggactgagc 1380 atgaagggtg aatctgagga gcttaaatgt gagatctgcg ataaatgcaa gatccatcaa 1440 cttccttata aaagctcgac taaacgagag aaagaagtac tcggtttggt gcactcagac 1500 atttgtggtc ctatgagagt tccttcgcta ggcggagcta agtacttcgt aacctttata 1560 gatgacaagt ccagatacat cgaggtcgtc atgttgaaaa ggcgttcgga tgtggtatca 1620 gccttcaaag cctataagaa acgtgctgaa aaggaaaccg gttgtcaaat aaagaaaatt 1680 cgtaccgaca acgccaaaga atatttatct aaagagttca acgatttctt ggaggaggaa 1740 ggaatcaaga gacagctaag cgtcgaatac tgtcctcaac aaaatggagt tgccgaacgt 1800 gctaatagga ccttggtaga aatggcacgc tgtatgttac tgcagtccgg agtccctaac 1860 actttatggg gagaagctat caacgcggca gcctacatcc gtaataggtg ccccacgaaa 1920 atattggaaa atttgacgcc catagaagct tggtcatccg aaaaaccata tgtaggatac 1980 atgcgcattt ttggcagcaa agtcgtagcc ctcgagaaga aacctcacag agacaagttc 2040 aagccgagag gtaaggaata tatattggtt ggttactcgc aggaatccaa ggcttacagg 2100 ctgtggagcc ccggtacagg aaccgtagta aagagacgtg atgtaaggtt caccgaagat 2160 ctaagaaaca agacagaaga cactaaagaa atacttgaaa cgtcattgaa tcctgattgg 2220 cacaaattaa cttctactca aaggacgaac ggagaaagca atgaaaattc tgacaactca 2280 gagtgggaag aggaaaatga agaacagttt aatactccag aagaagagat tgacacgcca 2340 acaacttcaa gaagaggtcc cggtcgacca aagctattaa ggactggaaa acgaggtaga 2400 ccaaggaaat tatataatga gacgaaacaa ggagaagaag caagtaaatc aacaggcgaa 2460 gaagacgatc aagcatgact agatgccaag caagatgaat acgactcatt gatgaagtac 2520 aacacgtggg agctcgtacc acgacctaaa gcgaaaaagg tgctatctaa tcgatgggtg 2580 tacaaaataa agaggaagca ggacgggtca atacttaaat ataaagcacg tctggtagtg 2640 cgaggctgcg agcaaacgta tggtgttgat tacgaagaga tattcgcgcc tgtagcaagg 2700 tatgagacca tccgtgcttt tctcgctgga tgcgttcagg aagagatgca tgtgcatcag 2760 atggacgtcg tcaccgccta cgtacaaggc gatttagcgg acgagatcta tatggaacaa 2820 cccggaggtt tcgaggctca agtgaggaag ataaggtctg tttgctaaag aggcctctct 2880 acgggctgaa acaagcaggt cgatgttggt atggcaaatt ggacaattat ttaaaaagta 2940 taaacatgat aaatagtgat atagatcctt gcgtttatgt aagtaaaagt agagatataa 3000 taatcgtagt atacgtagat gatttgctta tcgcgtctaa aagtctttgt aaattgcaaa 3060 aatttaaaga aactttacga agtaaatttc aaatgaatga tttgggtcct gtaaatgata 3120 ttctcggaat aaatgtagaa agggatgggc ctaccggaaa aatgaggtta acgcaacgca 3180 aatacatcat agagacgcta caaaaattta acatggaaaa ttgtaagcca atatcaacgc 3240 cacttgagtc aagtcaaaaa ttaacgaaaa caatggattc gaacacggga gacatgaagc 3300 acaagccata tagagaatta ataggtagct tgatatattt agctaacgcg acgcgtccgg 3360 atttagcatt cgttgcaagc gcattaagtc gtttttgtac agatccgaaa gaggctcatt 3420 ggaaactcgc gaaaagaaca cttcgatatt tacaatgtac tatagattac ggaattatat 3480 atactaaaag tttaagaaaa tggtcgctta tgcagattca gattgggctg gcgacattga 3540 agatcgcaag tcgtgtagtg gaaatgtaat tgtattagct aacggtccaa taagttggaa 3600 gtcgaagaaa caaaaatctg tggcgctctc cacgatggag gccgaataca tgtcgctttc 3660 cgaagtttgt aaagaaatag tatatttaag acgattgtta gaacacatag gatttaagtc 3720 gtgtgtaaca ggtccaacgg aagtgtactg cgataatcaa agcgcgatag agctaaataa 3780 aaatcacgta tttcatggtc gaagtaaaca cattgacatt agatatcata attcacgaga 3840 aatgagagat aaaggtgaaa ttagtttaag ctatttacct acggataaaa tgttagcaga 3900 cttgttgaca aagcctttag tcaaagcgaa acacgaagaa tgtgtgagat tattaggatt 3960 aagacaatga tgtagttagg aggagtttgt tttcacaaat ctaaaatttt gtatgcattg 4020 ttatgtggat actttaagac gaa 4043 // ID hAT-57_HM repbase; DNA; INV; 3559 BP. XX AC . XX DT 23-DEC-2008 (Rel. 13.12, Created) DT 23-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type DNA transposon family - consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-57_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3559 RA Bao W. and Jurka J.; RT "hAT-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 2045-2045 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 975..3251 FT /product="hAT-57_HM_1p" FT /translation="MSSRAKLTRSQTELYLLGETITELTGSKLPSLRMALG FT FFLYHHLELKETIRQSSAVTITEITKFWQKARIPMRDHQNCQTKLEQTFEE FT WRLLKKNKARKTTTQLGRESAFVLRLDDLFDVAHADALTNTSVLQEDKDFL FT VAQREKGRRGSMAGVDETLAAKEKRASKRREQMFARQQRMEQMNQLAVSTA FT ELKSSSSESENSTAEDVEETETSDNADEGAVGGTTFQTPKRKRGRKTVVTP FT ELAAALDRTKMSDRKAVFVIAETAKSLGQNINEFALNRDSIRRHRLEHRVQ FT RSANIKAEFRDNVPLVVHWDGKLIPDLLGKEKVDGLPVLVSGKGVSQLLTV FT AKLPSGTGEAQAAAVFGAIENWGIADSIRAMCFDTTSSNTGRLAGACVLLE FT QKLEKELLSLACRHHIMELIIGAVFQVCMGATSCPEVPLFKRFQQYWGFID FT TANYEPGIATDDVERLVADIKQDTIDYANKHLEQSQSRDDYKEFLELVIIF FT LGAAPARGVRFMSPGAMHHARWMSKVIYSLKIWMFKAQFKLTPAEVRGLRD FT VCVFTVRVYLKAWISAPQASGAPYNDLLLLKSLIEYSSIHSAISKSTSRKF FT SNHLWYLSQELVSLAFFDSRVSSSTKRLMVSAMQNEEDQDQDHSKRITVDL FT DSFGNKNLEDFVTAKSMNLLRMMEMPYGFLTVDPDLWEDRDDYKLAAETVE FT SLKVVNDHAERGVALIQEYSGFLTRDETQLQFLLQVVEDHRRMYPDSRKQT FT LSGLPKP*" XX SQ Sequence 3559 BP; 1148 A; 644 C; 711 G; 1056 T; 0 other; tagggtggag cttatttttg aacttttgaa atatgatctt cttaccctct cattttgttc 60 ctatatatga aaaaataatt caggcaaaat atgagaacaa ttaaacaata tttaggggta 120 gctcaatgat gataaagttt cggatactcg acagctgcgg tttggctaca cttgattttt 180 acataatatt attacttata ccagaataat tctcatgtta aattaagaac ttaattttaa 240 ctacaaagct aaattgcact gtcctaaatc atcatatact ccacaattat ggtctatact 300 atcatatttg atcattgacc aaccaatgca atcatggcaa atggaaataa cccagattca 360 attttttgct gactgtgtaa tttttacata aatcaaaaca taatcccaga cagacaggat 420 cactagctag acagacttgt ttactgcaat gctgagttgc tgactaagca acactgttaa 480 agctgtatgc ttattttgtt tactaaatag aggcattgtt ttagttatta aaatataaat 540 ggcaatttaa gttgaaatat tgtttaaata aattttcttg attttcaaat ttaaagaaga 600 tttctataat tttggtgttg actttcttct ctccagtacc tgattctcat caattggtaa 660 attaaaagat tgatatatta aaatacctta actctttgtg tcttgtgagt gtaaaattgt 720 aaatagtaaa tatattaata attaattaat tctatattat aatatggacg ttaaactgat 780 aatgctgatg aatgtgatga tgatgatgag ataaataaat gaacataaat tatatgtaat 840 atacaatata tttttaaaaa taattaaata aaccatacaa tacatgttaa aatttctgac 900 ttcaaatggt caaatttttc caatgcattt tttatattta tttcagataa atattcaatt 960 ctgaattatt gaaaatgtca tctcgggcaa aattaaccag atctcaaact gagttgtatc 1020 tacttggcga aactattact gaattaacag gcagcaagct gccatcatta cgtatggcgc 1080 ttggattttt cttgtatcac catctcgaat taaaggagac tataagacaa tcatcggctg 1140 ttaccatcac ggaaataaca aagttttggc aaaaagctag aatcccgatg agagatcatc 1200 agaactgcca aacgaagctt gaacagacat ttgaagaatg gcgtcttctt aagaagaaca 1260 aagcgcgaaa aacaacaact caacttggga gagagtcggc ttttgtttta agacttgacg 1320 atctctttga tgtcgctcac gccgatgccc tgaccaatac atcagtttta caagaagaca 1380 aggatttctt agtagcccag cgagaaaagg gaagaagggg gtcaatggct ggagtagatg 1440 aaacacttgc tgctaaggaa aaaagggctt ccaaaagaag ggagcaaatg tttgccaggc 1500 aacagcgaat ggagcaaatg aaccagctag ccgtttctac agctgaactg aaatcatcta 1560 gttctgaatc tgaaaacagt acagcagagg atgttgaaga gactgaaact agtgacaatg 1620 cagatgaggg agctgttgga ggcaccactt ttcagacacc taaaagaaag aggggacgaa 1680 aaactgttgt tactccagaa ctcgctgcag ctctagatcg aacaaaaatg tcagaccgca 1740 aggctgtttt tgttatagca gaaactgcca aaagcttagg acagaatatc aatgaatttg 1800 ctcttaacag agactcaatt cgccgacaca gattggaaca cagagtccag agatctgcaa 1860 atataaaagc cgagtttcgg gataacgttc ctcttgttgt tcactgggac ggcaaactga 1920 ttcctgatct acttggaaaa gagaaagttg atggcctgcc cgtattggtt tcaggaaaag 1980 gagtctctca gctactcaca gttgccaagc ttccttctgg aacaggcgag gcacaggctg 2040 ctgccgtctt tggagccatt gaaaactggg gcatagccga cagcatccgt gcaatgtgtt 2100 ttgacacgac cagctcaaat actggtcggc ttgctggtgc ctgtgtcctc ctagaacaga 2160 aacttgagaa agagctctta tcactggctt gtaggcatca cattatggaa ctgataattg 2220 gtgcagtgtt ccaagtgtgt atgggagcaa catcttgtcc agaagttcca ctattcaaac 2280 gtttccagca atactggggt ttcatagaca cggctaatta tgagccaggg attgcaactg 2340 atgatgtcga acgtctcgtg gctgacataa agcaagacac tatcgattat gccaacaagc 2400 atcttgaaca gagtcagtcg agggatgatt ataaagaatt cttggaactt gtgattattt 2460 ttcttggtgc tgctcctgcc agaggtgtgc gattcatgtc acctggagca atgcaccatg 2520 cccgctggat gtccaaagtc atatacagtc tcaagatctg gatgtttaag gctcaattta 2580 aattgactcc tgctgaagtg agaggattac gtgacgtgtg tgtattcaca gtacgtgtct 2640 acctgaaagc ctggatatct gcacctcaag catcaggtgc tccttataat gatctgctgc 2700 tgctgaagtc actgattgaa tattcatcaa tccattcagc gatctcgaag tcaacatcac 2760 gcaagttttc caatcacttg tggtatttgt cacaggaact tgttagcttg gctttcttcg 2820 acagtcgggt cagttcatca accaagagac tgatggtgag tgcaatgcaa aatgaagaag 2880 accaagatca ggatcattcg aagcgtatca ctgttgatct tgattcattc gggaacaaga 2940 acttagaaga ttttgtcaca gcaaagtcaa tgaatctgct ccgaatgatg gaaatgccat 3000 atggatttct aacagttgat ccggacttgt gggaagacag agatgactac aagttggctg 3060 cagaaacagt cgagtcttta aaagttgtaa atgaccacgc tgaacgtgga gtggccctca 3120 ttcaagaata cagtggattt ttaacacgcg atgaaacgca gctgcaattc ctgctgcaag 3180 tcgtagaaga ccatcgcaga atgtaccctg atagcaggaa acaaactctg tctgggctgc 3240 cgaaaccatg aatttaaaac aataattcct gagtttgctc tggacattgt caggctgata 3300 ccctaaactg taaaaactga gcttagaact catatttaat gtgtttaaaa atgttaattg 3360 tgttgtttaa aatatataga ttagactcgt ttctcgtata atttttgtaa tattggaggc 3420 acttgagcta cccctaaaaa caattatttc ttagccaaac tttttatgca gtgtttctat 3480 gttatatgga acagatttgg ggggtaagac tttaaaaaat acaaatttta ttttttttta 3540 ttttataagc tccacccta 3559 // ID hAT-2N2_BF repbase; DNA; INV; 436 BP. XX AC . XX DT 30-SEP-2008 (Rel. 13.09, Created) DT 30-SEP-2008 (Rel. 13.09, Last updated, Version 1) XX DE Amphioxus hAT-2N2_BF non-autonomous DNA transposon - consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW hAT-2N2_BF; hAT-2_BF; non-autonomous. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-436 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-436 RA Kapitonov V. and Jurka J.; RT "hAT transposons in the amphioxus genome."; RL Repbase Reports 8(9), 920-920 (2008). XX DR [2] (Consensus) XX CC This transposon shares ~100-bp termini with the autonomous CC hAT-2_BF. XX SQ Sequence 436 BP; 137 A; 101 C; 75 G; 123 T; 0 other; tagtggtgcg tacctaaagc ccggacttgg aattggacct aaaaatgttt aggtccaagt 60 ccaaaaaaaa cctaaatgtc cggagccaag tccggacttg caaaaagcta tctgtcactt 120 atattccctt tttttgttct aaaccatcta aaaccttcca tagacagtga aatttgcact 180 ggaaaaagac aaaaaacatt tcgtgtttac actggctgta tataatttgc atgtattttt 240 cacattgcat ttcaattcat gctaacatgc aattttatat gcaccatccc ccctccccct 300 caaaaaagaa aactttctag tgcacataat atgcaatgat gggttcaggt ccggacttaa 360 agtccggacc tgaacctgaa ctgctggacc tgaatccgga cctgaacctg aatatgagtt 420 taggtacgca ccacta 436 // ID Dparam2cons repbase; DNA; INV; 490 BP. XX AC . XX DT 19-APR-2011 (Rel. 16.04, Created) DT 19-APR-2011 (Rel. 16.04, Last updated, Version 1) XX DE Consensus of mariner-like element internal region of mauritiana DE subfamily. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Dparam2cons. XX OS Drosophila paramediostriata OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; tripunctata group; OC tripunctata subgroup III. XX RN [1] RP 1-490 RA Wallau G.L., Hua-Van A., Capy P. and Loreto E.L.; RT "The evolutionary history of mariner-like elements in Neotropical RT drosophilids."; RL Genetica 139(3), 327-338 (2011). XX DR [1] (Consensus) XX CC Consensus of clones that show less than eight percent divergence. CC Dparam2cons. XX SQ Sequence 490 BP; 141 A; 132 C; 118 G; 99 T; 0 other; tttgggttcc acacgaactg accgaaactc aggaagaacg gcgaaaagtc acctgtgaaa 60 tgctcatcgc caggcacgat aggaagtctt tcctgcaccg aattgtgacc ggagatgaaa 120 aatggatctt cttcgagaag cccaagctct ccagaggttg ggttaataca gatggagcgg 180 tgccatcgac ttcaaagcca aatcgcttcg gaaaaaagac catgctctgc ctttcttggg 240 atcaaaaagg tctggtgtac tacgagctgc tccggcccgg tgaaacggtt aaccagattc 300 gttaccaaca acaactgaga gatttggacg taactcagct cgaaaaacgc ccataatggc 360 accaaagaca cgagggaaaa atcttgcgcc atgacaacgc tcccgcccac agaactcttg 420 cgacgaaaga agtgttttct gaacttggct gggaagtctt accccacccg ccgtactccc 480 cggacctcgc 490 // ID LOA_Ele6B_AAe repbase; DNA; INV; 6067 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A LOA non-LTR retrotransposon from Aedes aegypti. XX KW Loa; Non-LTR Retrotransposon; Transposable Element; KW LOA_Ele6B_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6067 RA Kojima K.K. and Jurka J.; RT "LOA clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1424-1424 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >94% CC identity. The consensus is ~80% identical to LOA_Ele6. XX FH Key Location/Qualifiers FT CDS 547..2232 FT /product="LOA_Ele6B_AAe_1p" FT /translation="MESKEESMDTVPTEIMEVDEGKKEDDLLQSVSADELG FT GAGSVSSSILNSDDEDDGVNVTIIATRTDMQEYQQNEQREILKPKLSGAKK FT KQFKKLLISGHGREEALSMVLAPSNVSTPKRPRNINSSTNSDGKPIPKKQK FT GLLNHQSVNKRMENIRQDQPTTMSGVQQPTYSEVAKWVRVGILPTDYPQTQ FT LTTEQMDIVQGAILQKVTQQRKESFKPKFSNCWQRTGHLVINCQDNDTADW FT LYSVVPALLPWEGAELAVIDADDIPRLEVMIGFFPQSVNDDNDTIRIFIES FT QNDGLSTDKWRIIQRNVLYEKHVEWFFTVDEASMQHFKACNFLINYKFGQT FT TLRKKGMYKSGSDGMAKSSDDPKGQELVSSVCSGNLHVPATCATDGKLPEL FT KSSKYHQQTNQNTGLPKDQIPQGIPEAQNHPGTSKSSSYFGHPVTGKCSGH FT SKDQQYSRTHTDPNSSEGSKDRNTPGLATDKNETGPPKNQNCPGTSKYHDV FT SEPSRDRNFSGAKKDRNISGPSKDRTISGPPKDRNVSELSKERNCSGLSKD FT RNSAKSKKKQNHLDH" FT CDS 2289..5966 FT /product="LOA_Ele6B_AAe_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="MNNIKFIQINLHHAKGATGVLCQRFKKENFSVALIQE FT PWAHKSRVLGINSLIGKLIYDENSSKPRTAILIKGGIKFVPITQFITKDIV FT VIRLEIPTSRGKTEAYIASAYFPGDGDTVPPPELADFVNYCKMHNKYFIIG FT CDANSHHTIWGSTNINKRGEDLFEYISSNNMDLCNKGDQPTFVTKVRQEVL FT DLTMCSPVLSTNIKNWHVSDYESLSDHKHIVFEYIAGALIKIAFKNPRNTN FT WERYASKVQFENITSDNVIESTEQLDRIAENITEKILQAYDASCPVKEELT FT SRDVPWWNKTLNNLRKKSRQLFNHAKVSGQWDQYKKVLTMYNKEIRKSKRR FT TWRSYCENMEKTPDVAKLQKVLSKDHSNGLGTVRKTNGKFTVDTQETLKIM FT METHFPGSYLASSEDERGAHIDESCRPTGTISNSSIPNTIFTRSKVEWAIN FT SFEPFKSPGKDGIFPALLQKGGEALIDYLTNIFKASLTFGHIPKIWTQVRV FT VFIPKAGKKDKTNPKSFRPISLTSTMLKIMEKIIDEYIKSEYLRTIPLNRF FT QFAYQPYKSTVTALHNLVTKIESTLDKKEIALVAFLDVEGAFDNTTYKSIK FT KSMDKRNFDPSVRVWIMKMLKSREVSSELGESSITIRTTKGCPQGGVLSPL FT LWSLVVDDILNKLVERGFEVIGFADDIAIIVRGKFDNIITDRMQSALNCTI FT EWCQNEGLNINPSKTVLVPFTRKRKITLKSLIINGSILQYSSSVKYLGVIL FT DDKLNWSLHLEQVISKATNALWVSKKTFGKKWGLKPKMIHWIYTVIVRPRI FT VYASFVWWPKTNQITSQQKLAKFQRLVCMAITGAMHSTPSKALEAVLYKLP FT LHQYVQMEAERSALRLRRTTIFLEGNLSGHLSILKDFRIDKTVQMIEDWME FT KILNLDKPYEVINTNRQIWESGGPIISPGSIIFYTDGSKMNDSAGAGIIGP FT GIHVSIPMGRWTTVFFAEVYAILECAYICLKRNYRHARICIFSDSQAALNA FT LKCFTCQSKLVWECITILKQLAMKNQVHLYWVPGHCGIEGNEKADLLARQG FT ASSQYVGPEPFCGISKCVTQLELKQWKDNMIQFNWKAIKNLRQSKLFITPN FT KQITEKILNLNKKSLRIFIGLLTGHCPARYHLKKISRSQIDVCRFCGCEKE FT TSQHLLCDCPALFASRKKYFNKGILNPSEIWLQNPNLVVKFILKVIPDWDN FT SHHQITAVILNGNSPY" XX SQ Sequence 6067 BP; 2142 A; 1107 C; 1281 G; 1537 T; 0 other; gtgctgagtg ctgagagtaa ggcagcttat agttggctat agaagctctt agttttctag 60 aaaaataatt ctaaaagtga tccctttatg ggcaaagtta tcgggaaagc tagataaagt 120 gccatagagc gttgtgcaag tggaagacag caagatagtg cagtgtatcg tgtgaaaatc 180 aatgcgaaaa agtgcaattc ttgctggcat tatcctgcac tggggaaagt gaagtaaagg 240 gttacaagta gtggagtaga agggggatac aacccaacag caacgggcgg cattgtttgt 300 tgagtgccag tagtgacagt agtgcacata tagaaaggtc gtgtaggtgc tatcgagtca 360 ctaccactgc atcacgtgaa gagtgtatct aagaatctct caaaacagtg cgcggtgtgt 420 cagaaggcga atattaaaaa aaaggaaatt gttttgtata tatatatata tatttttttt 480 ttcttctcaa tttggaaaat tattaacaaa aatctaaatt aaaaataatt cgtgttgtgc 540 agcattatgg agtcgaagga agaatccatg gatactgtac ctaccgaaat catggaagtg 600 gacgaaggaa agaaagaaga tgatcttcta caaagtgtct cagcggatga acttggggga 660 gcgggatccg tctcgtcctc cattctgaac tccgatgatg aggacgacgg ggtgaatgtt 720 accatcatag ctacacgcac tgatatgcag gaatatcagc aaaacgagca aagagaaatt 780 ctaaagccaa aattgagtgg ggcaaaaaag aagcaattta aaaaactgtt gataagtggt 840 catgggcggg aggaagcact ttcgatggtc cttgcaccat caaacgtgtc gaccccaaaa 900 agaccacgaa acatcaacag cagtaccaac agtgatggaa aacctattcc caaaaaacag 960 aagggactgc tcaatcacca gtccgtcaat aaacggatgg agaatattag gcaggatcag 1020 ccgacaacaa tgagtggagt tcaacaacca acatacagtg aggtggctaa atgggtacgg 1080 gtaggaatac ttccaacgga ctacccacaa acccaactga cgacggaaca gatggacatc 1140 gtacaggggg ccattctaca aaaagttaca cagcaaagaa aagagtcctt caagccgaag 1200 ttttccaact gttggcaaag aactgggcat ttggtaatta actgtcaaga caacgatacg 1260 gcagattggt tatattcggt ggttcctgcg ctactcccat gggaaggcgc ggaactagcg 1320 gtaatcgatg cggacgatat accaagattg gaggtgatga tcggcttttt cccccaaagt 1380 gtaaacgacg acaatgacac tatccgaatt ttcattgaaa gtcaaaacga tggcttgagt 1440 accgacaaat ggagaatcat tcaacgcaac gttctctacg agaaacatgt tgaatggttc 1500 ttcactgttg atgaggcatc catgcagcat ttcaaagcat gtaatttcct catcaactac 1560 aagttcgggc aaacaactct gcgaaaaaaa ggtatgtaca aatctggttc agatggaatg 1620 gctaaatcga gtgacgatcc caaagggcaa gagctggtaa gtagcgtatg ttctgggaat 1680 cttcatgtcc cggcgacatg cgcgacagac ggcaagttgc cggagttaaa gtcctctaaa 1740 taccatcaac agaccaatca aaatactgga cttcctaagg accaaattcc tcaagggatc 1800 ccagaagccc agaaccatcc agggacctcg aaatcctcga gttattttgg gcatccagtg 1860 actgggaagt gctctgggca ttcaaaagac cagcagtact ccagaacaca tacggaccca 1920 aattcctcag agggttcaaa ggaccggaat actcctgggc tagcgacgga caaaaatgaa 1980 actgggccac caaagaacca gaactgtcct gggacttcaa agtaccatga cgtctctgag 2040 ccttcaaggg atcggaactt ctctggggca aaaaaggacc ggaatatatc tggaccttcg 2100 aaggatcgga ctatttctgg gccgccaaag gaccggaatg tgtctgagct ttcgaaggaa 2160 cggaattgtt ctgggctatc aaaggaccgg aattcagcaa aatctaaaaa gaaacagaac 2220 catctggatc attaaaagac catagacgtt ttttaaaatt gaatatatct aataaaagta 2280 aaaacaaaat gaataacatc aaatttattc aaataaacct tcaccatgcc aaaggagcaa 2340 ccggagtcct ttgtcaaaga ttcaagaaag aaaactttag cgtggcattg atacaagagc 2400 cctgggctca taaaagccga gttcttggaa tcaatagtct gataggaaaa ctaatttatg 2460 acgaaaatag ttcaaaaccg cgaacagcaa ttttaataaa agggggaatc aaattcgtac 2520 ctataacaca attcattacc aaagatatag ttgtgattcg gttggagatt cctacatccc 2580 gggggaaaac agaagcctat atagcttctg cttatttccc tggagatggg gatacagtac 2640 ctccacccga actggcagat ttcgtaaatt actgtaaaat gcacaataaa tacttcataa 2700 tcggatgcga tgcgaattcc catcacacaa tatggggaag caccaacatt aacaaaagag 2760 gtgaagacct ttttgaatac atatcttcaa ataatatgga tttatgtaat aaaggagatc 2820 aaccgacatt tgtaacaaaa gttagacaag aagtattgga cttaacaatg tgtagtcctg 2880 tactttcaac aaacataaaa aactggcatg tttcagatta cgaatcatta tctgaccata 2940 aacacattgt ttttgagtac attgctggtg cattaatcaa aatagctttc aagaatccta 3000 ggaataccaa ctgggagcga tatgcgtcaa aagtgcaatt tgaaaatata acctcggata 3060 acgtaatcga atctacagaa caactggaca gaattgcaga aaatataact gaaaaaatat 3120 tacaggcata tgatgcaagc tgtccagtca aagaagaact aacaagcagg gatgttcctt 3180 ggtggaataa aaccttaaac aatttaagga aaaaatctcg gcaacttttt aaccatgcga 3240 aagtttctgg gcaatgggat caatacaaga aggtccttac gatgtacaat aaagaaatac 3300 gaaaatcaaa aaggcgaacc tggaggtctt attgtgaaaa catggagaaa actccagatg 3360 tagcgaaact gcaaaaagtg ctttctaaag accattcaaa tggtttggga actgtacgga 3420 aaacaaacgg taaattcact gttgatacac aggaaactct taaaataatg atggagacgc 3480 atttcccagg atcatattta gcgtcaagtg aagatgaacg gggtgctcat attgatgaga 3540 gttgtaggcc aaccgggact ataagcaatt catccatacc taataccatc ttcacgcgat 3600 ctaaagttga atgggcaata aactcctttg aaccgttcaa atcgcctggg aaggacggaa 3660 tttttcctgc acttctacag aagggtggag aagcactgat tgattatcta acaaatattt 3720 tcaaagctag tctaacgttt ggtcacatac caaagatatg gactcaagtt cgtgttgtct 3780 ttattcctaa ggccggaaaa aaggacaaaa caaatccaaa atcttttaga ccgataagtc 3840 ttacatctac catgcttaaa atcatggaaa agattataga tgaatatatt aaatcagaat 3900 atttgcgcac aattcctctt aataggtttc aatttgctta tcaaccatat aaatctactg 3960 tcacagcttt acacaatcta gttacaaaaa tagaaagtac tcttgacaaa aaagaaattg 4020 ctctcgttgc atttcttgat gtggaaggtg cttttgataa tacgacttac aaatctataa 4080 aaaaatcaat ggataaaaga aactttgacc ccagcgttag agtttggata atgaaaatgc 4140 tgaaaagtag agaagtgtct tccgaattgg gggaatcatc gataacgata aggactacta 4200 agggttgtcc tcaaggaggt gtattgtcgc ctttattatg gtcccttgtt gttgatgaca 4260 ttctcaataa actagtagaa agaggcttcg aagttatagg atttgctgat gatatagcca 4320 taatagtacg tggaaagttt gataatatca tcacagatag aatgcaatct gcactaaact 4380 gcactattga gtggtgccaa aacgaagggc taaatataaa cccatcaaaa acagttttag 4440 ttccatttac tcgtaaaagg aaaattacac tcaaaagctt gataattaat ggatccattt 4500 tacaatattc atccagtgta aagtatctag gggtcattct agatgataaa ctaaactgga 4560 gcttacactt agagcaggtc ataagtaaag ccacgaatgc tctctgggtg agtaagaaaa 4620 catttggtaa aaaatgggga ctcaaaccta aaatgattca ttggatttac acggttattg 4680 taagacccag aattgtttat gcctcatttg tttggtggcc taaaacaaat caaattacat 4740 cccaacaaaa gttagcgaaa tttcaacgtt tggtctgtat ggctataaca ggagcaatgc 4800 acagtacacc atcaaaagcc ttagaggcag ttctttataa gcttccatta catcaatacg 4860 tgcaaatgga agctgaaagg agtgcgctaa ggctgcgaag aactaccatt tttctagaag 4920 gcaatctttc tgggcatctt agtatactga aagattttcg tattgataaa acagtacaga 4980 tgatcgaaga ctggatggaa aaaatcctga acctagataa accatatgaa gtgattaata 5040 cgaatcgcca aatatgggaa tcaggagggc ctattatttc accaggatcg attatttttt 5100 acacagatgg ttccaaaatg aatgatagcg ccggagctgg tattataggc ccggggattc 5160 atgtttcaat acctatggga agatggacaa ccgtcttctt tgcagaagta tacgccatct 5220 tagaatgtgc atatatatgt ttgaaaagaa actacagaca tgcaaggatt tgcatatttt 5280 cagatagtca ggcggctctg aatgcattaa aatgtttcac atgccaatca aaattggttt 5340 gggaatgtat aacaattctt aagcaattgg ctatgaaaaa ccaggtccat ctgtactggg 5400 tgccaggcca ttgtgggatt gaaggaaacg agaaagccga tttacttgcc agacaaggcg 5460 caagttctca gtatgtagga ccagaacctt tctgtggaat atctaaatgt gtaacacaat 5520 tggaattaaa gcaatggaag gacaatatga ttcagtttaa ttggaaagct ataaaaaatt 5580 taaggcaatc aaaactcttc ataactccaa acaaacaaat aacggaaaaa atcttgaatt 5640 taaataaaaa atcactaaga atatttattg ggctgcttac gggacactgt ccggcaagat 5700 atcatttgaa aaagataagt cgaagtcaaa ttgatgtctg tcggttttgt ggctgtgaaa 5760 aagagacctc tcaacatctg ctttgtgact gtccagcatt atttgcaagt aggaaaaaat 5820 atttcaacaa gggcattcta aatccttctg aaatttggtt acagaaccct aatctagtag 5880 taaaatttat tttaaaggtc attccggatt gggataactc gcatcatcag ataacggcag 5940 ttatcctgaa tggtaactct ccgtactgag cacgttgcga tagtaaacag gggtatacca 6000 caatagatca attaatggtc gcagtggttc aaatcccaac aaaaaaaaaa aaaaaaaaaa 6060 aaaaaaa 6067 // ID R1-3_AP repbase; DNA; INV; 5510 BP. XX AC Contig971; XX DT 19-AUG-2009 (Rel. 14.08, Created) DT 19-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE Non-LTR retrotransposon. XX KW R1; Non-LTR Retrotransposon; Transposable Element; R1-3_AP. XX NM R1-3_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-5510 RA Jurka J.; RT "Non-LTR retrotransposons from pea aphid."; RL Repbase Reports 9(8), 1796-1796 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC The termini are approximate. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX FH Key Location/Qualifiers FT CDS 787..2142 FT /product="R1-3_AP_1p" FT /translation="MEDMEDTQEPGSPVQGTSHQPPRSPMSSGESPEQKKK FT KGSPDPVPLARESIGWIRYSLETAATKKSNVPVETQRSMFDKLKTLDSAIH FT DMVIMNLQLASQLEESRRSAEICVGAAAAQFGTKLWLREVAHEQTLEAVVA FT RYAEREALRLHELEKRGPPPEQVAMQEVPAETFAQVTRAARPAKTTAGRKP FT DRSVSRAANRNKALKEAKLTEHIPAYIIKSCGGKDPKEVRSVVWSKVASQN FT IQPKCHSIIAKDGRVIIKPMNRETADILKSLSKSSSLIVEDSPRWPRVMFR FT GVQTDIRSEDLQSSIVSQNAHLNIDEDTTDEILRPIFKQGKRDMDTTNWVM FT EVNPRYYDRFEDAIVYIGFMRCKVAAYEEVTQCHVCLKYGHPASKCQEKEP FT RSTHCSRAGHKAADCPAAEGDPTCANCRGKHNARDKTCSARTAFLLGRAKR FT TDYGVTQ" FT CDS 2142..5162 FT /product="R1-3_AP_2p" FT /translation="MSAQPLTIVQLNMGRASAVSDLLLEFCQNNEVDVALV FT QEPYTNRGRLVGFEVAPFRCYLSKPTRRRGGLLHLDHGAAIIVFNPNLVVA FT ARDSDLIENFVSIDIDCAADGIITMISGYFKFRVPTAIHVEVLDRIFQSTT FT NEVLIALDANAFSTRWFSRINDRRGETLTTWLDEHDLHTVNTRSPFTTFNG FT PRGRTNIDVTIFGQSLLGKIGDWKVVPDVTTSDHQVLMYTLHLQRRQFIHR FT TTRFNLDQRQRDAFVQEFARATTQRDDEHENLDSKAQHLHEDITVAAELFA FT PRRNPRKKVVPPWWSPELTRLRKEVRNASRRCRQTGDRVVYNTRRNEYTQL FT LRSSKVASWRRFCTLEGKNPWGKLYRWMKGNKATVAIGLIKRADGSLCQDI FT DESVTTLLNVLIPNDPTRPVGVRTEAVTGNLNPIACPELKALAWTIAPNRA FT PGTDGITGSMVRALWLTLAPRLLSITNECLARARFPESWKVAQVVPILKGK FT DRDVMQPKSYRPVSLLPVLGKVIEKVINLRLQEQITPSLTGRQYGFTRDRS FT TFDAFQNLLTWSDLRQERHVLTIFHDITGAFDNLEWTALQRDLQSLGASNH FT IRLLIADYLSGRTAKITIGGVEKSVRLTKGCPQGSILGPVLWNVTMEALLR FT VVFPEHVNIQAYADDVAMSVAAPNRRILIERAEEALIPVLAWAEERGLQFS FT AQKSVAMITKGDMVPGFTLAFGGERIVTVDCVKYLGIRLDQKRQYTEHLDE FT MKKASETIFTKMRGTLGSGWGMKRENIAILYRGVFLPKITYGARFWAHTTT FT SNRAIKLLGSIQRRALIGMTGAYCTMSTDALQILAGVPPLDLEIHWMVLKA FT EAATIPNHLRPQTLANGREELLEAWQARWAATQKGRWTYRIFPEVRNRLKL FT PLALGHEVTQFLSRHGNFRAKLAGFDLQPTPLCVCGDGEEDVGHVLFDCAL FT HIEHRAHLELAVHRAGYLWPCDPAVLVSTKRIYSALVRFAKTAAYLERTVR FT A" XX SQ Sequence 5510 BP; 1444 A; 1317 C; 1554 G; 1195 T; 0 other; gtgtccactt tacagtggag tccgtcagac gtgtgttgct acagttgatc gcgctccgag 60 gggcgccctc tagcggtttt ccacggcata cgagctaagt cggtgagttc ttgcaaattt 120 tttgtgtttt cagttcgagc cagtgcataa taagttcgcg tacagaccgg tgccggtttt 180 gagtcggtct gtccaaaacc ccctctgtgg ggaccgtttt tgtgtccgcc cataaggttt 240 acacggccgc cttgtggtaa tagctggtat ttgccgagcg atttttggat tttcgtcagt 300 tacgagacca ggcgtcaagt gcctgacggc gtgagttcgg ctattatata attgtctatt 360 tctgtgcatc acagccccca cttagtggct ggtataggta gtataccacc ccctccatct 420 ttctttgcgc agttatatct ccgtaagttt tacacggatc aacccgggac caagggggaa 480 cccagtttct tggggttagg ttatcgggaa caaagccccc acccccccca cccgttttga 540 gttaagtata actcggcggg aaatagtggt agagtcgtag gggcaacgcc aatctatccg 600 ccagggattg ggccataatt ttccacatta gggaacgggg caccccccca cccccctttt 660 tgagttttgt tttttcgatt ggaaatcggt tcatcagtac cataggacct tcccccccac 720 agggaaactt ggggtttcga atttaacccc cccgaaaggg gtgacaaaaa ttgagtttcc 780 tccaaaatgg aggatatgga ggatacccag gagccaggtt cgccggtgca aggcaccagt 840 caccagccgc ccaggtcccc catgtcatct ggggaatcac cagagcaaaa gaagaagaag 900 ggatcaccgg accccgtgcc tctggcccga gaatcgatcg ggtggatccg ctacagcctg 960 gagacagcgg ccacgaagaa atcgaacgtg ccggtggaga cccagcggtc catgtttgac 1020 aagttaaaaa ctttggactc ggcgattcat gacatggtca tcatgaacct ccagctcgca 1080 agccagctgg aggaatcgag gaggtcggca gaaatttgcg tgggggcagc agccgctcaa 1140 ttcgggacta agctctggct aagggaagtc gcccacgagc aaacccttga ggccgtcgtt 1200 gcaaggtatg cagaaaggga agcgttgcgc ctacatgagt tggagaagag aggtcctccc 1260 ccagagcagg ttgccatgca ggaggttcct gctgagacct ttgcccaggt gacaagagcc 1320 gccaggccgg ctaaaaccac tgctgggcgt aaacctgaca gatcagtgtc cagggcagca 1380 aataggaaca aggcactcaa ggaggcaaaa ctaacggagc acattccggc gtatataatt 1440 aaatcttgcg gtgggaaaga cccaaaggag gtacggagcg tcgtatggag caaagtggct 1500 agtcaaaata tacagcccaa gtgccactct ataatagcta aagatggtag ggtcatcatt 1560 aagccaatga accgtgaaac ggcggatata ttgaagtccc tatctaagtc ctcgtccctc 1620 atcgtggaag acagcccccg atggcccagg gttatgttta ggggagtcca aacggacatc 1680 aggtctgagg acctgcagag ttccattgtc agccagaatg ctcacctaaa catcgacgag 1740 gacacgacag acgagatcct gaggccaatc tttaaacagg gtaaacgaga tatggacaca 1800 acaaactggg tcatggaggt caacccaaga tactatgaca ggtttgagga tgctatagtt 1860 tatataggat tcatgaggtg taaggtagcc gcatacgagg aggtgaccca atgtcatgtc 1920 tgcctaaagt acgggcaccc agcctcgaag tgccaagaga aggagccgag gtccacgcac 1980 tgctccagag caggtcataa agcagcggat tgtccagctg cggagggtga ccccacttgt 2040 gctaattgta ggggaaagca caatgctcgt gacaaaacct gttcggcaag gaccgcgttc 2100 cttctgggga gggccaagag gaccgactac ggggttacgc aatgagcgcc caaccactca 2160 ccattgtgca gctaaatatg ggcagggcct cagcagtcag tgatctactg ctggaatttt 2220 gccaaaataa tgaggtcgat gtcgctctgg tacaggagcc ctatacaaac agaggtaggc 2280 tggtgggatt tgaggtcgct ccttttaggt gctacctatc gaaacctacg aggagaagag 2340 gaggcctact gcatttggac catggtgccg caatcatagt ctttaaccca aacctagttg 2400 ttgcggctag ggactcggac cttattgaaa actttgtttc aattgacata gattgcgcgg 2460 ctgatgggat tatcaccatg ataagtggct attttaagtt cagggtccca acagcgattc 2520 atgtagaggt actggatcgc atctttcaat ccacaactaa cgaggttttg attgccctag 2580 atgctaacgc tttttcaacc aggtggttca gtcgtataaa cgacagacgt ggggaaaccc 2640 tgaccacctg gctcgatgag catgacctac acacagttaa cactcggagc ccgttcacga 2700 ctttcaacgg acccagaggg aggacaaata ttgacgtgac aattttcggc cagagtttgc 2760 tgggaaaaat tggtgattgg aaggtcgtcc cggatgtcac cacgagtgac catcaagtgt 2820 taatgtacac tctgcatctg cagcgaaggc aattcatcca ccgaactacc agatttaatc 2880 tagatcaaag acaacgagat gcgtttgtgc aggagttcgc acgtgctaca acgcagagag 2940 atgatgaaca tgaaaacctg gacagtaagg cgcaacacct gcatgaggac atcacggtgg 3000 cggccgagtt atttgcccca aggagaaacc ccaggaagaa ggtggtccct ccttggtggt 3060 ctccggagtt gaccagactc agaaaggagg ttcggaacgc ttcaaggcgg tgcaggcaga 3120 ccggggaccg tgtagtgtac aataccagaa gaaacgaata tacccagctg cttaggtcaa 3180 gtaaggtagc ttcttggagg aggttctgca ccctagaggg gaaaaacccc tgggggaagt 3240 tatatcgatg gatgaagggt aataaggcaa cggtagctat tgggctcatt aaacgtgctg 3300 atggatcatt atgccaggac attgatgaat cagtgaccac tcttctgaat gtgttgattc 3360 caaatgaccc aacgcgcccg gttggggtcc gcacagaagc agtgactggg aatctcaacc 3420 caattgcgtg tccagaactg aaggcactgg cttggactat cgctccaaat agggcaccag 3480 gcacggatgg aattacgggg tcaatggtgc gtgcactctg gctaacgtta gcccccagac 3540 tgctcagtat tacaaatgag tgcctagcta gggctaggtt ccctgagagt tggaaagttg 3600 cccaggtcgt accgatctta aagggtaaag acagggatgt tatgcaaccc aaatcataca 3660 ggccggttag cctgctgccg gttcttggga aggtgattga gaaagtaata aacctcaggc 3720 ttcaagagca aattacccca agcctcacgg gtaggcaata cggttttacc agggatagat 3780 caacctttga cgctttccaa aacctactca catggagtga cctcaggcaa gaacgacatg 3840 tcctgacgat ttttcatgac ataaccgggg catttgataa ccttgaatgg actgcactgc 3900 aacgtgatct acaaagtttg ggagccagca accacattag attactgatt gcggactacc 3960 tcagcggccg cactgcgaaa ataactattg gcggggtgga aaagtcagta cggctcacga 4020 aaggatgtcc gcagggttca atcctggggc cagttctatg gaatgttaca atggaggcat 4080 tgctaagagt cgtatttccc gaacatgtta acatccaagc ttatgcggat gatgttgcca 4140 tgtcggttgc ggcaccaaac agacgaatcc tcatcgagcg agccgaagag gcccttatac 4200 cggtgctggc ctgggccgag gaaagaggcc tacaattctc tgctcaaaaa tcagtggcca 4260 tgataactaa gggagatatg gttcctggtt tcacgttggc attcggtgga gagcggattg 4320 tcacggtgga ttgtgtgaag tacctgggca tccggctgga tcagaagagg caatacaccg 4380 agcaccttga tgaaatgaaa aaggcatcgg agacaatttt cacaaaaatg agagggaccc 4440 ttggatccgg gtgggggatg aaaagagaga acatcgcaat tctgtacagg ggtgtcttcc 4500 taccaaaaat tacgtacggg gctaggtttt gggcacacac aactacgagt aacagagcaa 4560 ttaaattatt gggctcgata caaagaaggg cactaattgg aatgactggg gcatattgta 4620 caatgtccac cgatgcactg cagattttag cgggggtccc accactagac ctcgaaatcc 4680 actggatggt tttgaaagcg gaagcggcga cgattcctaa ccatttgcgt ccacagactt 4740 tggcaaatgg cagagaggaa ctgcttgagg catggcaagc gaggtgggcc gcaacacaaa 4800 aaggtaggtg gacctacagg attttcccgg aggtgcggaa taggttaaag ctacccttgg 4860 ccttgggcca tgaggttaca cagtttctgt ctaggcatgg aaacttcagg gccaagttag 4920 caggcttcga tctacaacca acgcccctgt gtgtgtgcgg agatggcgag gaagacgtgg 4980 gccacgtact gtttgactgt gcccttcaca tcgaacatag agcgcacctg gagctggctg 5040 ttcacagggc agggtatctg tggccttgtg acccggcggt gctggtctcc acaaagagaa 5100 tttacagcgc actggtgaga tttgccaaaa cagcggccta cctggaacga acggtacggg 5160 cataatacag gacggacacg acactctgcg cgccagagtg ccaaaccccg catgtgttct 5220 gaacgagtgt caaggagtga gttggcagaa gcggttcaca tgtggggccg tcctgagggg 5280 acactcgtgc gtcggggacg cgggtggcgc tgacgcggac aacagtggtt cgcggacaac 5340 cggaggcgcc gcggtgaccg aaagcttcgg tgagtaggtc acactacatc gggctctgtg 5400 tctggggcca tggaagaata cagacacagt ctgccccgct tgtcgtacaa ggcgacttaa 5460 ggggcaggct tgcgacctac gcaccgcggg cgagcgcccg gggtaaaggc 5510 // ID piggyBac-N1_CQ repbase; DNA; INV; 617 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A non-autonomous PiggyBac DNA transposon family from Culex DE quinquefasciatus - consensus. XX KW piggyBac; DNA transposon; Transposable Element; nonautonomous; KW piggyBac-N1_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-617 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the southern house mosquito."; RL Repbase Reports 11(1), 99-99 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >95% CC identity. The TTAA TSDs and CCC..GGG termini indicate that it CC is CC a non-autonomous piggyBac element. XX SQ Sequence 617 BP; 246 A; 94 C; 101 G; 173 T; 3 other; ccctctacaa cccaaccccg cctctagacg ggcttcgatc taaaaaatcg ccaaaaatcc 60 attttccaac caatttttga tcttcaaaaa gcattggaaa gaagaactct taaaattttg 120 gaaaatttca gggttggaag tttgacttgt tttatgtgac tttgccaatg tttttaaaaa 180 tgtcattttt ttaggggtca actttggctg tgttttttac taacatttcc tatattttma 240 gtaaaaagaa gtatgcagta atttttctag tgccccagac tatgcctcta cgcatttttt 300 tgcaatttaa atgatgatgg tkccattcta tagcagaaaa tgtgaaaaac atgcaaaaaa 360 ttgaaaaagt kactgtaaaa acatgaaaaa attagatagg caaaatgtaa tgataggagg 420 tggtagagta ggccaaatac taccaaaaac aaacataaac taaacaagat aaatgcaaat 480 taaaatacta aaaatgaaac aagaaaaaca taaaacaaga gaagtaaagt ttttcgtaga 540 acaaaagttg ctcaaaatga cctcctgaac acgggaaaaa taaaaatttt cgaaaaaaaa 600 atttgggcag tagaggg 617 // ID Gypsy-22_OD-I repbase; DNA; INV; 6206 BP. XX AC CABV01001706; XX DT 24-FEB-2011 (Rel. 16.02, Created) DT 24-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Oikopleura dioica genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-22_OD_; KW Gypsy-22_OD-LTR; Gypsy-22_OD-I. XX OS Oikopleura dioica OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; OC Oikopleuridae; Oikopleura. XX RN [1] RP 1-6206 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Oikopleura dioica genome."; RL Direct Submission to RU (24-FEB-2011). XX DR Genome; CABV01001706; Positions 17361 23566. XX CC Positions [2891-3364] - Reverse transcriptase CC Positions [4665-5141] - Integrase core CC 'ATTAA' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 2957..4873 FT /product="Gypsy-22_OD-I_3p" FT /translation="MPNLREVFNEVRPGIQYLASVDFRSGYWQIVLDTRDR FT YKTAFTWKGQTWQYTRLPFGLTCAGQIFSRAVSEALETIPDLEGIFVYVDD FT VLLATPTFSAYLKKLRSLFEASRKFGFRLHPGKCHFLAKSVKFLGRVLSPQ FT GMSVDPDHSSGIDALTPPTTRTELRCLLGRLTFIREFLDCRLHERIDTSCF FT SKLVYELNKLNRNGDFYWSHEANAAFENVKKRLKSSPIISYPDPSKDFLLV FT TDASEVAVGAVLLQLIDGKESIVGVASRTFTAVETRWSTTEREAYGILFGV FT RRFDYFLRSRPFVVKTDHAALQYIDSVDHKNAKLSRWMDELSSYRFCVEYI FT PGEENVWADMFSRPFGLKKPAHSDVHVPAGKFVDFENGLRAYVPSWCTKNQ FT VRQPVGTSPKLLREGITRALLARKDEFILNATAEGYLKLACDQRDNHSIRG FT LIDALQKFKHDPKAIKLDKNDHNYDVYRVNWPFFMICARTDLLLHVKDGKH FT RQVLPPENVARLLHHSHDSCAHPGTSRVADSLQSYWWPSKDERHQELRRLM FT RHLCQEERKSRSSVESRYRPLQTWRKTIRRYLHRLHRLRQGKWQALLLHNS FT RQLHKVLHRQACAWRTRNRCCQMYCRRSGTSTSVHPQICFQ" FT CDS 4494..5813 FT /product="Gypsy-22_OD-I_4p" FT /translation="MILVLIRVPVGLLTRFRATGGRPRMKDIKNYVASCVI FT CAKKKGSQGHPSNPDIGHCKRGERPFDVIFIDFTVFDKVNGKRFCCTILDS FT FTKFFIAKPVPGERAIDAARCIVEEVVLRHQCIPKFVSSDKGTAFISATMK FT ELYKQLGMVAEYHTAWRPQSTGNLERQHRTFKSMIFMMQHERRMTWIDCVP FT FVTAYMNSHKNKGTGQAPHTLVTGRRPRFHLPETIHDPDVSSDSPSTYGKI FT IRERLELLHSIAKIANKAADLELEKKLDKQHRARPLNPGDQVTLYRPASKR FT AKTAGYDWLGPFTVVVAEEKVVKIKDQQENVEWVHRAHLRYVPERLARLRN FT LEMMRELELLEASTSQDPLISEENRPESASGTYPQIADALPKLGRPATRNL FT DKSSGTVRKTPQLTSKKNQPAGSTPSASTFSSSETSDFSQIFYNFS" XX SQ Sequence 6206 BP; 1660 A; 1728 C; 1397 G; 1421 T; 0 other; actggtgact ctctggagct catcaatctt ctcgtacgac tcaacggact actcgtctct 60 cacaggcaag taacacgtcc ctcaccagat cacctggttt accactcggc cacttgtgat 120 caacaggaag aattcttgac ttctcaccct cgaggacctc ttggtgcgtg atcttcgttc 180 gatcgcggtc tgcttcagcc tggacttacc tgctcacgct caccgaagcc ccctaagtag 240 taattgaacc tttagaaggt tgattactgg ggtaaaagca ggtactttat ccagttccgc 300 cgattctgct caaacattta gccgccaccc gcgattcttt tgggccaact cacgaattcg 360 acgacttgaa aacttcacct ggcacacttg acttgctttg ctcatcaaat accccaaatc 420 aaggatttta agtttttcca gaaactgaaa tccggggtca acgcaggtac aagcttctgt 480 tagcagtgac tctgctcaac ctagattgcc atctgaacaa caatctcgtc gccgctgttg 540 ctggcgttaa caacaacgaa ccagccacaa cgatcttccg ccaacttgct tccgcgcaac 600 gcaaccgact tcttcacaac gatgacttct caacctgcag ctccgctttg gttgctttcg 660 ataccaagcg agtccatcga caacgccagc cgtgcgaatc ttcaactttg gtccaagcaa 720 ctccacgagt tcacggccag caactttaag aacacggctg aaggtgtcgc tgcccaagtc 780 gagactgcat ttgctcaggt gaaactcatg cgtgagaagc ttgatagttg cgaaccgccc 840 agccgccaaa ccactcccat gacaccaagc tatctggttg aaatccgaca agccttcgaa 900 ctggagaagg agaccgatct tcgagaccct gaccagtgcg aaatttggat ccgcaaaatc 960 cgccaagcca agtcggactt cgtcgaccat ttgcaggagc acttgaagcc tgacgccgaa 1020 gctctcctgg cccgaacagc gcgtaaatac ttcagtaacg aagtccgcca gtcattccag 1080 cgccagaatc ttcccgtcaa caccgtcgcc gaactcacag aggccatgga gacgatgtac 1140 agctccaagc tctcggtatt ccagtacatg aagctcgttc acgaggtcaa gtacacgaag 1200 ggcaagggtc ttcaaacctt cataagcgag ctaaccacgg cacagcgtcg agcgttccgc 1260 catgtcgaac gtctccaccg tcagcgtgcc aaagtcgaaa tcgtcgaagt tgaagtaacc 1320 gccactaccg ctgcttctgt cgtaaagaag gaacctggag agaccggaac tagatgctcg 1380 acatgcacat gtgctataaa agctcatgaa ttctcggatc tcacggctgc atgcttggtt 1440 tacctgaaag tcagcgaact gtacccgaaa cgtcgcactg aaactctgcg acaccatcga 1500 tgagtgcgag accgcacagg acgtctataa taagtccaaa atactcatcg atcgtcttcc 1560 agcaaatatg acgaacgaag attcttctgc ctacggcgcc aagttcggta actggaccaa 1620 aaagtccaac ggccagcccg agaagaaaac atcgctagag tcacagctca agacgatctg 1680 tgatcgccta aacaccatcg aggggaagaa acaggagccc aagactgaag gtgaaccatc 1740 caagaagcac aacaagagca aaaatcgagg tcagaagaaa actgacgcca atgtcgctga 1800 aacttctgaa gctacagctg atatcctcga acccagctcg caacattttc gttcgtcagt 1860 gtagagccgt gcgtgcctac actgaccaat tctcatcgcg cacattcccg aacagttcgc 1920 ccttcacgtc cttggcaagt aaacttcaag ctcactcttg gtgggtcaac ttttacagtc 1980 gaagacggta ttttggactg tggaagcgac gatttccttc ttccgaaacg tctcattccg 2040 ccaaatctgc tacagcatct tgacgagtcc aactatatca tcaaaggcgt gaacggagcc 2100 tccaaggctt ctggacagtt cgaatctgca gtgtgcattg gcgacgctac cttcaatgac 2160 atccggataa tcgtcacgga ccatgaacgg actccaccgc ttattggacg aaccgtgatc 2220 gaccatcctt cgaccgtcat ttttgggcga cgaggtacag aaatcttcgt ccagcgccga 2280 ctcgcaccag actccgagat cgtgactcaa tcgttcgagc tcaatggaga aagcagatac 2340 tggaacacga cgcgaaaaac tccttcagct cctgctcagc aaatcggcct tccagttcgc 2400 catgaagctg acttgagctc tggtgctcaa aatgtcgatg atgatgtctt ttcttgcgca 2460 tctctgaacg acacgagtat ccaagctaga aatggtactg taagtacttc aataccgccc 2520 aaacgcgaca aacctggtca aaacgcgaat acttcggaac ttcgcacttg gctcgagacg 2580 cacaagaatc tggttctacc caagcatcac gttaatcctg acgaacttca ctccatcgcc 2640 aagctgctcg ttgagtttga agacgtcttc ggaggtgaaa acgagcccct gggagagttc 2700 actaagccgg taaggattcc gactactgga aagtcagcct gccgcaagca acaccagatc 2760 cagcacagaa accaagaagt agtcgacgct gaagtcgaac gaatggctgc tgctggcgtc 2820 atcgagccat gcgcgaaccc gcgtggtttc aacactccgc tccactgcgt ggaaaaaaag 2880 gacgagaagc ccaagagttg taagcaactt caagaatacg ctcaacaccg tattgaagga 2940 tcctgacccg tacacgatgc cgaatttgag ggaagtcttc aacgaagtac gtcctggcat 3000 ccagtacttg gcatccgtcg actttcgctc cggctactgg cagattgtcc tagacactcg 3060 agatcgctac aaaactgcgt tcacgtggaa aggacaaact tggcagtaca caagacttcc 3120 atttggactt acttgcgccg ggcaaatctt cagccgagca gtcagcgagg cactcgagac 3180 gatcccagat ctcgaaggca tcttcgttta tgtcgatgat gttctgctcg cgacgccgac 3240 ttttagcgcg tatttgaaga aacttcgaag cttgtttgaa gcttcacgca agtttggctt 3300 ccgacttcat ccaggcaagt gccactttct ggcgaaatct gtcaaattct tgggacgagt 3360 tttatcgcct caagggatga gcgtggatcc agatcattct tctggtatag acgcgcttac 3420 tccgcccact acaagaactg aacttcgatg tctactcggc aggcttacct tcatacggga 3480 gtttctggat tgccgactgc acgaacgaat cgacactagc tgcttctcca agctggtata 3540 cgagttgaac aagctgaacc gcaatggtga tttctactgg tcccatgaag cgaatgcagc 3600 atttgagaac gtgaaaaaac gactcaagtc cagcccaatc atctcgtatc cagacccgag 3660 caaagacttc ctgcttgtta cagatgcaag cgaagtcgca gttggtgcag tacttcttca 3720 gctgatcgat ggaaaggagt ccatcgtcgg agttgcgtca cgcaccttca ccgctgtaga 3780 aacgcgctgg tcgaccacag aacgcgaggc ctacgggatc ttgtttggag ttcgtcgatt 3840 cgactacttt ctgcgtagcc gcccattcgt cgtcaaaacc gaccatgcag cgctacaata 3900 catcgattca gttgaccaca aaaacgcgaa gttgagccgt tggatggacg agttatccag 3960 ctaccgtttt tgtgtagagt acatacctgg cgaagagaac gtctgggctg acatgtttag 4020 tagaccattt gggttgaaga agccagctca ttctgacgtt cacgtccccg ccggtaaatt 4080 tgtagacttc gagaatggtc tacgggccta cgttccaagt tggtgcacga aaaatcaagt 4140 tcgccaacca gtaggtactt ctccgaagct tctgcgtgaa ggaatcacca gagcactact 4200 tgctagaaaa gatgagttca tactcaacgc tactgctgaa ggttacctca aactcgcgtg 4260 tgatcagcgc gataatcaca gcattcgtgg actcatcgac gctctgcaaa agttcaagca 4320 cgatccgaag gcaattaagc tcgacaagaa cgatcacaac tatgacgttt atcgagtgaa 4380 ttggcctttc ttcatgattt gcgcacgaac agatctcctg ctccacgtaa aagacggaaa 4440 acatcgacaa gttctacctc cagaaaacgt cgcccgtctg cttcatcatt ctcatgattc 4500 ttgtgctcat ccgggtacca gtcgggttgc tgactcgctt cagagctact ggtggccgtc 4560 caaggatgaa agacatcaag aactacgtcg cctcatgcgt catttgtgcc aagaagaaag 4620 gaagtcaagg tcatccgtcg aatccagata tcggccactg caaacgtggc gaaagaccat 4680 tcgacgttat cttcatcgac ttcaccgtct tcgacaaggt aaatggcaag cgcttctgct 4740 gcacaattct cgacagcttc acaaagttct tcatcgccaa gcctgtgcct ggagaacgcg 4800 caatcgatgc tgccagatgt attgtagaag aagtggtact tcgacatcag tgcatcccca 4860 aatttgtttc cagtgacaag ggaactgctt tcatctcagc tacgatgaaa gagctgtaca 4920 agcagctagg gatggtcgct gagtatcaca ccgcctggag accccaaagc acaggaaatc 4980 tggaacgaca gcatcgcacg ttcaagtcca tgattttcat gatgcaacat gagcgaagga 5040 tgacttggat cgactgcgta ccctttgtga ctgcgtatat gaacagtcac aagaacaaag 5100 gtaccggtca ggcaccacac acactcgtca ctggtcgccg tcctcgtttt catctgccag 5160 aaaccatcca tgatcccgac gtcagctcag actcgccctc cacgtatgga aaaatcatcc 5220 gtgagcgcct ggaactgctt cactccatcg ccaagatcgc caacaaagcc gctgatcttg 5280 aattggagaa gaaactggac aagcagcacc gagctcgacc gctcaatccg ggagaccagg 5340 tcactctgta ccggcctgct tccaaacgcg ccaagactgc tggttacgac tggctcggac 5400 cgttcaccgt cgttgttgct gaagaaaagg tcgtcaagat caaagatcag caagaaaatg 5460 ttgaatgggt gcaccgagca catttacggt acgtacctga acgtctcgcc cgccttcgta 5520 atcttgaaat gatgcgcgag cttgaacttc tcgaagcttc aacttcgcag gacccgctca 5580 tttcagaaga aaatcgaccg gagtccgctt ctggaactta tccacaaatc gctgatgcac 5640 ttccgaaact tgggagaccg gcaactagaa atctagataa aagttccgga actgtgcgaa 5700 agactcctca acttacatca aagaagaatc aaccggctgg ctcaactcca tcagcttcaa 5760 ctttctcttc cagcgaaacc agcgacttct cgcagatctt ctacaacttc tcttgattcg 5820 cgctcgacaa ctttaggacc gaaccctgcg ctcatagact tgttttctaa aatagagcaa 5880 cgtcagatgc gacaaaaatc tactagggtt cgcaaaaaga ccagcaaaat ttctcttgat 5940 ccgaagaaaa agtcgtacaa ctaacctgtc gctgattggc tctgctgcca gcatttactc 6000 atcgcaaagc cgggtaaaat tcttctagac ttgcttgtga agcgccgttt tcctttctgc 6060 gcgtcatttt cctttcgacc gccgcgccgc ttactcatcg ccgcactcgt ctccagctca 6120 actttcgact cctgatcata tctactcaac cttctactac attcgagaat ctcctatcgt 6180 cagcagactc tagaagtttg gagggc 6206 // ID TBSAT1 repbase; DNA; INV; 177 BP. XX AC K00392; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 28-SEP-1995 (Rel. 1.08, Last updated, Version 1) XX DE T.brucei satellite dna. XX KW SAT; Satellite; Simple Repeat; Repetitive sequence; TBSAT1; KW Satellite repetitive element. XX OS Trypanosoma brucei OC Eukaryota; Euglenozoa; Kinetoplastida; Trypanosomatidae; OC Trypanosoma. XX RN [1] RP 1-177 RA Sloof P., Bos L.J., Konings F.A., Menke H.H., Borst A.P., RA Gutteridge E.W. and Leon W.; RT "characterization of satellite dna in trypanosoma brucei and RT trypanosoma cruzi."; RL J. Mol. Biol 167(1), 1-21 (1983). XX DR GenBank; K00392; Positions 1 177. XX SQ Sequence 177 BP; 64 A; 24 C; 28 G; 61 T; 0 other; ctaataaatg gttcttatac gaatgaatat taaacaatgc gcagttaacg ctattataca 60 caataacttt taatgtgtgc aatattaatt acaagtgtgc aacattaaat acaagtgtgt 120 aacattaatt tgcaagtttg caacgctgtt ctttagtgtt taatgtgtgc aacaaag 177 // ID Tx1-14_CQ repbase; DNA; INV; 4879 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A Tx1 non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW Tx1; Non-LTR Retrotransposon; Transposable Element; Tx1-14_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4879 RA Kojima K.K. and Jurka J.; RT "Tx1 non-LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 646-646 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 3 sequences with >99% CC identity. XX FH Key Location/Qualifiers FT CDS 156..1313 FT /product="Tx1-14_CQ_1p" FT /translation="MEKRVRVNTLKVTFKRGSKEPTDAEMFQFCREQHFKP FT EEIYSVHKDKELGAIMIKLKNENLMRAAIQALQPVLNFSYSDGTTTEVTVS FT DADNSFKYVRVFNLPPEIDDKEIYQALSQYGTVRQQVRERYHQDTGFPVFS FT TVRGLYMEISKEIPPQIRIRHFQARIYYDGLVNKCFICRSPDHVKQNCPRR FT AAPVTKTPAQGRLYSDVAAVKSLLPGFLNRPLSETDSCQMTPLGKPVGGKQ FT SPPDKPEVPVEPMEAAMEQSEGLTSPPSALQVQLVESPVFRPNLEPEGTPD FT QGKQPDQEGKWQLQGRKGXSKKRPESFSPGADTSKKLLRSQSLRSRSRSQR FT RHPKGVPETSEPQGALSTSDGQSDDQIFKKPLPPPSPSPEQQK" FT CDS 1243..4599 FT /product="Tx1-14_CQ_2p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="MDNRTTRSSKNHCHHHPHPLNNRSKFNFYPKMTFSKK FT IATINLNSIKTEVNKSLLRDFVQNEDLDFIFLQEVLFEDFSFLGAYRAFVN FT ISEHDKGTAILVRKNQEFRNIIYDPSGRILSIVCDDINLVNVYAHSGTNKK FT RERDDLFRDEITPHVIKGNCSHTVIGGDFNCVLNKRDSRSTSANLCQGLKD FT LTTSMHLKDVANVRNGSNVEFTFHRAGSASRLDRFYVSAQFLDKVVEIRTV FT PVAFSDHSAVVMKFLVDQSALCQRGRGYWKLNSSLLNKDDISCRYVVDFYQ FT LKDREIYTRNRSTWWNTVVKPKTQFFFKNESRIFNCRINSAKSELYAKMYD FT LAAERRNGADNRXQLQEVKSELMRMELDRLKFYGSRFPQTSMLEGEKXSVF FT HVSNQMHKGCDGSQFQLRDGDDLITDSKRLKKKIEDFFTGTFQAEDSASLE FT SSVEALRDVSNSLDLQQQQELVRPIDEAELKAVVAGAAKKKSPGPDGISYE FT FYSVYFEMLKNDMLELFNRYLDGSLKPPKEFTAGIITLIPKVNQADNLNEF FT RPISMLNTDYKIFTKILANRLHAVVGDLLGAGQVACTVNQSCSTNLIDVRT FT IIAKAAESKRFKGFLLSVDLEKAFDKVNHVFLWKVLEKFGVPERFINCIKN FT LYESATSRIIFNGILTSDVKIGSSVRQGCPLSMVLFVLYIEGLVRKLSANI FT RGVLLSGKFIRVLAYADDLVILIRDDAEFDLVLQIINSFAEASCIRLNRAK FT SAFIRINNAKGGPQQFREADEIKILGVIFTERWSAIVKRNFDKLLGDIQFR FT LSKHRTRNLNLIERTWLLNVFVLSKLWYICQVFPPDNVHIAKLRKIAGQFI FT WGPRQIFKTERKQLYLDYEQGGIKLVDPEAKAKALFIKTILYGTGNNQVED FT TTLLNFSKKQALSRNTREWLELAVSLKPNPMSTSVQTLYRHFINENRTTPK FT VEERFPALNWNQLWENLGQSFLSSECRSQMFLLYNDLVISKEKLVKYNIGR FT LQDDLCEWCGVPESNEHKIKHCVKTEEIWQWLRRTLTDRLGICGDPEDVLT FT RQINGRNQKEKAGLWLTMFLMSYIVKKYPKISLYCVQKGIRDSRWNNRTYF FT KHCFGSELNVC" XX SQ Sequence 4879 BP; 1385 A; 1090 C; 1263 G; 1136 T; 5 other; agacgttttt tctctattgc tcgtcgatag cgtattccaa gccttttatc tcaaccggaa 60 caatggactc gtccgttgtg tgattgctgc gatctgcaga attctattgt gtgcgcgaag 120 tgaggttata cccggatacc acgtttggtg gaaagatgga gaaaagagtg cgagttaaca 180 cgttgaaggt taccttcaaa cgtggttcta aggaacctac ggacgcggaa atgttccagt 240 tttgccggga acagcatttc aagccggaag aaatctactc ggtccacaag gacaaggagt 300 tgggagccat catgatcaag ctcaaaaatg agaatttgat gcgtgcggcc atccaggccc 360 tgcagccggt actgaacttc agctactcgg acggaacaac cacggaggta acagtttcgg 420 atgcggacaa ctctttcaag tacgtgcggg tcttcaatct tcccccagag attgacgaca 480 aggaaatcta tcaggcgttg tcgcaatacg gcactgtccg gcagcaggtc cgggaaaggt 540 accatcaaga taccgggttc ccggtgtttt ctacagttcg cggactgtac atggagatca 600 gcaaggagat acccccacag atccggatcc gacacttcca agccagaatt tattatgatg 660 gtcttgttaa caagtgcttc atttgccgga gccccgatca cgtgaagcag aactgtcccc 720 gacgagcagc accagttacc aagaccccag cgcaaggacg gctgtacagc gatgtggccg 780 cggtgaagtc tctgctgccg ggttttctga accggccgtt atcggaaaca gattcctgcc 840 agatgacgcc ccttggcaag ccagtcggcg ggaagcagtc tcccccagac aagccggaag 900 tgccagttga accaatggaa gcagcaatgg agcagagtga aggtctgaca tccccaccgt 960 cagctctgca agttcaactc gtggagtcgc ccgtctttcg tcctaatttg gaacctgaag 1020 ggacaccaga ccaaggcaaa caaccggatc aagaagggaa gtggcagctt caaggtcgga 1080 agggamgttc gaagaagcga ccggagtcgt tttcgcctgg agcagatacc tcgaagaagt 1140 tgctgaggtc gcagtcactt cgctcgmgga gtaggagtca gcgccgtcat cctaagggag 1200 ttccggagac aagcgagccg cagggagcgc tttcgacgtc tgatggacaa tcggacgacc 1260 agatcttcaa aaaaccactg ccaccaccat ccccatcccc tgaacaacag aagtaagttt 1320 aatttttatc ctaaaatgac cttctcgaag aaaattgcca ccattaatct aaatagtatt 1380 aagacagaag tcaacaaatc actgcttagg gatttcgttc agaacgaaga tcttgatttt 1440 atattccttc aagaagtttt attcgaagat ttttcattcc tcggagctta ccgagctttt 1500 gtgaacataa gcgagcatga caagggaact gccattcttg tgagaaagaa tcaggaattt 1560 cggaacatta tttatgatcc tagtggacgt attctgtcca ttgtctgtga cgatatcaat 1620 cttgtcaatg tttatgctca ctccgggacc aacaaaaaac gcgaacggga cgacctgttc 1680 agggatgaga tcacgccgca cgtgatcaag gggaactgct cacacactgt cattgggggg 1740 gacttcaact gtgtccttaa caaacgtgac agccggagca cctccgcgaa cttgtgccag 1800 ggattgaaag atttgacgac ctcgatgcat ctcaaagatg tggcgaatgt gcgtaatgga 1860 tcgaatgtcg aattcacgtt ccatcgcgct ggatcggcat cacggctaga tcggttttac 1920 gtttcggctc agtttctgga caaagttgtt gaaattcgaa cggttccggt tgccttctcg 1980 gaccattcag cagtagtgat gaagtttttg gtcgaccagt ctgcgctttg tcaacgcggg 2040 cgaggatact ggaaactcaa cagcagtctt ctcaacaagg acgatatctc gtgtagatac 2100 gtcgtcgact tctaccagct caaggatcgt gagatttaca ccagaaacag aagcacttgg 2160 tggaacacgg tagtcaagcc taagactcag tttttcttca agaatgagag caggatcttc 2220 aattgccgca tcaactctgc gaagagcgaa ctgtatgcca agatgtacga ccttgcggcg 2280 gaaaggcgaa atggagctga caacaggamm caacttcaag aagtaaaatc ggagctgatg 2340 aggatggagt tggatcgttt gaagttctat gggtcgagat ttccccaaac ttcgatgcta 2400 gaaggagaaa aawtgagtgt cttccatgtt tctaaccaga tgcacaaggg ttgcgacggc 2460 agccagttcc agttgcgaga cggagatgat ttgattacgg attcgaaacg tttgaagaag 2520 aagatcgagg acttctttac gggaactttt caggcggaag attcagcaag cttagaaagc 2580 tcagtggaag ctctcaggga tgtgtctaac tctttggact tgcagcagca acaagaactc 2640 gtaaggccga tagatgaagc ggaactcaag gctgtggtcg ctggtgcggc caaaaagaag 2700 tcacctggac cggacggaat atcctacgaa ttttactccg tgtattttga gatgttaaag 2760 aacgacatgt tggaactgtt caaccgttat ctggatgggt cattgaagcc gccaaaggaa 2820 ttcactgctg gaataatcac tctcattcca aaagtgaacc aagcagataa tctaaacgaa 2880 ttcagaccaa tatcgatgct caacacggac tacaagatct ttacaaagat tctggcgaac 2940 cggctccacg ctgtggttgg agatttgctg ggtgctggac aggtagcttg cacggtcaac 3000 caatcgtgct ccaccaactt gattgatgtg agaactatca tcgccaaggc tgcggaatca 3060 aaacgtttca aaggtttcct actgagtgtg gatttggaga aagctttcga caaggtgaat 3120 catgtcttct tgtggaaagt gttggagaaa tttggtgtcc ctgagcggtt catcaactgc 3180 atcaagaacc tgtacgaaag tgcaacatca cggatcatct ttaatggaat tctcacatcg 3240 gacgtgaaga tcggaagctc tgtacgccaa ggatgccctc tgagcatggt gctgtttgtg 3300 ctgtatattg aaggattggt gcggaaatta tcggcgaaca tccggggagt tttactgtct 3360 ggaaagttca ttcgggtgtt ggcgtacgcg gacgacttgg tgatactcat cagggatgac 3420 gcagagttcg atttggtgct ccaaattatc aactccttcg ctgaagcatc ctgcattagg 3480 ctcaaccggg ccaaatcagc atttattcgt atcaacaacg cgaaaggtgg tccacaacaa 3540 ttcagggagg cagacgagat caagatcctc ggagttatat tcaccgaaag gtggtccgca 3600 atcgtcaaac ggaattttga taagctactt ggagacattc agttccgttt gtcgaaacat 3660 cggacccgta acctaaattt gatcgaacgg acgtggctac tgaacgtttt cgtgctgtcg 3720 aagttgtggt acatttgtca agtgttccca ccagacaatg tgcacatcgc aaaactccgg 3780 aagattgctg ggcagttcat ctgggggccg aggcagatct tcaagactga gaggaagcaa 3840 ctttatctgg actacgagca aggagggatc aagctggttg accccgaagc gaaggcgaag 3900 gcacttttca tcaagaccat tctgtatggg actggcaaca accaggtaga agacacaact 3960 ttgctaaatt tttcaaaaaa acaagcgctg agccgaaaca ccagagagtg gctagaatta 4020 gcagtatcac tcaaaccgaa tccaatgtca acatccgtgc aaactctgta caggcacttc 4080 atcaacgaaa atcggacaac acccaaggtg gaggaacgat ttccagcttt gaattggaac 4140 cagctgtggg aaaatctcgg acaaagcttc ctgagtagtg agtgccgctc tcagatgttt 4200 ttgctgtaca acgatctggt gatctcgaag gaaaaactgg tcaagtacaa cataggacgg 4260 ctgcaagatg atctgtgtga gtggtgtgga gtcccggaga gcaacgagca taaaatcaag 4320 cattgtgtaa agacggaaga aatctggcag tggcttagaa gaacgttgac tgatcgtttg 4380 gggatctgtg gagatccaga ggatgtactg actaggcaaa ttaacggaag gaatcagaag 4440 gaaaaagcag gactgtggtt gacaatgttt ttgatgtcgt acatagttaa aaagtatccc 4500 aaaatcagtt tgtattgcgt tcaaaaagga attagggaca gtagatggaa taatcgcacg 4560 tatttcaagc attgttttgg cagtgagctc aatgtttgtt agtgtattag gagaacccac 4620 ggttgcccgg gagacccggg gcccccctga ccacaagtca cttccgagtg gaacagggac 4680 caccccccgg gctcctcggg caaccgtggt ggcctgggag ccggggcccc cccgaccacc 4740 caccccccgg gctcccatgg cgaccttgga aaagctgtgg aacaaatgta aaggattgga 4800 ggtggaacaa atgtaaagga ttgtacaaat gtagaaataa aaacttgtaa aaacccaaaa 4860 aaaaaaaaaa aaaaaaaaa 4879 // ID RTEX-8_BF repbase; DNA; INV; 1937 BP. XX AC . XX DT 30-JUN-2009 (Rel. 14.07, Created) DT 30-JUN-2009 (Rel. 14.07, Last updated, Version 1) XX DE Amphioxus RTEX-8_BF autonomous non-LTR retrotransposon - DE consensus. XX KW RTEX; Non-LTR Retrotransposon; Transposable Element; Jockey-8_BF; KW RTEX-8_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-1937 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-1937 RA Kapitonov V. and Jurka J.; RT "Young families of RTEX non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(7), 1724-1724 (2009). XX DR [2] (Consensus) XX CC The 5' terminal portion is missing. The 3' terminus is composed CC of the (CATT)n microsatellite. XX FH Key Location/Qualifiers FT CDS 1..1815 FT /product="RTEX-8_BF_2p" FT /note="RT." FT /translation="LSQIVPIHKSGDPSLPDNYRGISIISCLCKLFSSILN FT NRLVSYAENNSLFKPHQAGFRKNFKTTDNLFVLSTLVSKYLSKNSRLFACF FT VDFSKAFDSVWRNGLLYKLNKLGIQGNFLRTIQDMYSKTTNRVKYSQGLTE FT PFVTNCGVRQGCNLSPTLFNLFLSDITSVFDNDICNPPTMYSKRVSCLLYA FT DDLVLFSESKQGLQCSLNRLEEYCQTWHLNVNLRKTKIIVFTKGGRIPKDV FT YFMYKNNPVEIVTNYNCYLGIVVNSAGTFKANNKHLYSKGLKALFGINQSL FT DKADAPLSVRNKLFDTCVKPIILYGSEIWGSVKSPKSCPIETIHLKFCKQT FT LHVPRSTSGLAARAELGRFPIHLEASLNAIKHLIRLRQKVPADSFQSDALS FT CQIDLDKAGAKCWASGVRQSLEECGYGYVWHCPLQHNTNSSQIINSINQRL FT KDIYFQTFLREIHNDNKGGSAKNKLRSYRLFKSTYNVEQYLDIDNIRHRTA FT VTKLRVSCHKLHIETGRHNRTPLEQRICRYCNLDKLEDEHHFVVECSLYKK FT ERAELYRLIESMFPHFIQLSNIDKFIFLMNLDKPLLIKHICSYIFNITKKR FT EEAECTQ" XX SQ Sequence 1937 BP; 631 A; 388 C; 342 G; 576 T; 0 other; cttagccaga ttgttccaat tcacaaatcc ggtgacccat ctctcccaga caactaccgc 60 ggaatttcca taattagctg cttatgtaaa ctattctcct ctatattaaa caatcgctta 120 gttagttatg ccgaaaacaa tagtcttttc aaaccccatc aggcagggtt taggaagaac 180 tttaaaacta cagataatct tttcgtgtta agtacattag ttagcaagta tctaagtaaa 240 aattcccgtc tctttgcatg tttcgtagat tttagtaaag cgttcgactc cgtctggaga 300 aatggcctcc tttacaaatt aaacaagtta ggtatacagg gcaactttct tcgaactatc 360 caagacatgt attcaaaaac tacaaatcgt gtgaaataca gtcaaggtct gactgaacct 420 tttgtcacaa actgtggtgt cagacagggt tgtaatttaa gcccaacttt attcaattta 480 tttttaagtg atattacatc tgtctttgat aatgatattt gtaacccccc aactatgtac 540 agtaagcgtg tctcttgtct attgtatgca gatgatttag ttcttttctc tgaatccaaa 600 caaggtctac aatgttcttt gaatagacta gaagaatact gtcaaacatg gcacctaaat 660 gtgaacctta ggaaaacaaa aataattgtt ttcactaagg gtggtcgtat accaaaagat 720 gtgtatttca tgtataagaa caatcctgtt gaaatagtga caaactataa ttgttattta 780 ggtattgttg tcaattctgc gggtactttc aaggcgaaca acaaacactt gtacagtaag 840 ggtctgaaag ctttattcgg gataaatcaa tccctagaca aagccgatgc tcctttgtct 900 gtcagaaaca aactttttga tacatgtgtt aagcccataa ttctctatgg atcggaaatc 960 tggggttctg ttaaaagccc caaatcctgt cctattgaaa caatacatct aaaattctgt 1020 aagcaaaccc tacacgtgcc aagatccaca agtggcctcg ccgctagagc cgagttaggg 1080 aggttcccca ttcatttgga agcctcccta aatgctatta aacacttaat tagacttcgc 1140 cagaaagtac cagcagacag tttccagtca gacgctcttt cttgccagat agatttagac 1200 aaggctggag ccaagtgttg ggcgtcagga gttcgccagt cactggagga atgcggctat 1260 ggttatgtat ggcattgtcc tctacagcac aatacaaatt cgtctcaaat tatcaattct 1320 atcaatcagc gtctgaaaga catttatttt caaacttttc taagagagat acacaatgac 1380 aacaaaggtg gatccgctaa aaacaaattg aggtcttaca gactgttcaa gtccacttat 1440 aatgtggaac aatacctgga tattgacaat attcggcaca ggacggccgt cacaaaacta 1500 cgagtcagtt gccacaaatt acacatcgaa accggaagac ataaccgcac tcctctagaa 1560 caacgaatat gtagatactg caatctagat aagttagaag acgaacatca ttttgtagta 1620 gaatgtagtt tgtacaagaa agagagagct gaactctaca gactaataga atccatgttc 1680 ccccacttca tacaactaag taatatagac aaatttatat tccttatgaa tctagacaaa 1740 cctttactta taaagcatat ctgttcttac attttcaata tcacaaagaa acgagaagaa 1800 gccgaatgta cgcagtagaa tacgatcctt gagtagtgta gattcattgt atcattatat 1860 catgttgtat catttgttgt gacctgtact aaacccagtg ggtatgaatg tacaataaag 1920 gtcttcattc attcatt 1937 // ID BEL-18_AA-I repbase; DNA; INV; 3186 BP. XX AC supercont1.352; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-18_AA_; KW BEL-18_AA-LTR; BEL-18_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-3186 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.352; Positions 1774 4959. XX CC 'AGATC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 1244..3184 FT /product="BEL-18_AA-I_1p" FT /translation="MAKYPNETSAMKILHLRNSLEGDAKCKIDQDVMNNND FT YERAWKILEDAYEDKRLILDTHIDAILDCAVISSDNRGKSISQLVETCSKH FT VDALDGHSYPVEGLGELILLNVLYKKLDKETQEQWEMKIPRGEVPEYETFM FT EFLRERGRVLQRTNRSQQQVQQSTVQAKQRVAGIKAPTQALKSFMQTGSEA FT CPCCPEEHAIYRCMKFQDMNFSERKSVAARMSLCYNCLKLNHRVNDCQSNQ FT RYKVQGCGRKHHSLLHPEIWQNSVERSPTPEVDKDYESNQQEPDGRRATTM FT CALIDTTKRQVLLSTAEVLVVGHGGFTIKCRALLDSGSDSHILTERLANKL FT RLKMKRVDFPISGLNDIQTRVKYLVSTKIRSRINTYATSDLNFLVVPRISA FT RLPQVEIKSGSCRVPTGLQLADPSFHVPGEVDMIIGNEIFFDLVKAGRLKI FT ENTAITLAETEFGWIAGGSAQLKKDAPRAVQCQLNYRTQVGKIMEKFSEIE FT NDPLGIKSAAVDSAVKKRARNGRRRGVQQLSFNGKRYQLGDSFVMTRNRCE FT PSIYSSSGKRIWCNEFKANPYDAAKSKTLLVKHGVTRRDLQEQQLVNEMVR FT EKRDVLVEDSPFPASAGTVTRMSERLQPGRSIKSEVVTVLLSQRAQRGV" XX SQ Sequence 3186 BP; 906 A; 646 C; 887 G; 747 T; 0 other; tttttggtcc ttcgagccgg atgttgtgag tggttttccc ggatgttgtg tccggaagtt 60 gttgctgttt cgtgtgtgga agaagagcac caagtgcaca agtgcggtcc gtgaccgaat 120 gatacaagac gattatgtcc gtggcatagt gcgtgttcgg aaaaatttac tgcatcgtgt 180 gcagaagcgt gacccgtggc caaatcggaa ttcacgtcga tccgtgatcg atgattggtt 240 gagattgtgt ccgtggcatg atccgacgta ccgtgtgcgt agtaccaatg gctgtaggca 300 gtttgaattt ccggaagtta attgcatcgt gtgcagtgga atagcccgtg gctattagcg 360 aacaatgtat gaagtggccc gtggccaaag acaatcagtg agcgtgtacg tggcgtgatg 420 ttgcccggcg agtgcttcgt gttgcccgtg gctgagttta ggaatgcaca gtggcacagg 480 agtaagtgaa cattcgtgcg caccggttac gtattttggc taaaagccaa ggaagaaaca 540 gttccaaaac gtttatcggg ttcgtgacac gcgtggtgag ttggatgtgt ccgtgacgct 600 ttgcgagctc gattgaacag gagtgaacaa gttcgaacat gagcgataat gaaacaccgc 660 gtggagtgtt aaaaggcgta tcgaaaaaga accttttaga agcgcttacg gaatcacgta 720 gtgaaaacgt gcctcccgtt ccgtcagtgg cagatcaaaa gattgcgaag caagtgttcg 780 agcaaaagaa gatgatggag tccaagttgg aagtgctgac agatcggcga gatctgataa 840 cggagaagct gatacgaatg aaggagtctt tgcgagggga aggtgtgagc atacatttgc 900 tcaaccttca ctttgaaaca ctgcgacgat gtgcggatga attcgacagg attcatagcg 960 aaatctcggc attgctgccg agaaaacagc gaaccgtagt acgtcaggag tacgcggtat 1020 tcgaggatat ccacaacgaa ctttacgtag acttgcagac aagaattgca cgaacgcaag 1080 aagcatgtcg cgtaacttca ggtacgtcaa gtagtttaag cattccaggt tccagcagcc 1140 gattgtagtg caagcagctg ccccgcagct acacgctccg ttcccgaaat tcgatgggac 1200 accggagaat tggtacagct ttaagagcct cttcaagagc attatggcaa agtacccaaa 1260 cgagacttca gcgatgaaga ttttgcactt gcgcaattct ctcgaaggcg atgcaaagtg 1320 caagatcgac caagacgtca tgaataacaa cgattacgag agggcctgga agattctcga 1380 ggacgcgtat gaggataagc ggctgatact ggatacgcat attgacgcta ttctggattg 1440 cgccgtcatc agcagtgaca accgtgggaa atcgatttca caattggttg aaacttgttc 1500 gaagcatgtt gatgctttgg atggtcacag ctatcccgtc gaaggtttag gggagttgat 1560 tctgcttaac gtcctctaca agaagcttga caaggagacg caagagcaat gggaaatgaa 1620 gatacccaga ggagaggtac cagaatacga aacgtttatg gagttccttc gtgaacgagg 1680 acgcgttctg cagcgtacga accgctcaca gcagcaggta cagcagtcaa cggttcaagc 1740 gaaacagcgt gttgctggta tcaaagcgcc aacacaagca ttgaaatcgt tcatgcaaac 1800 gggtagcgaa gcgtgtccat gctgtccgga agagcacgcc atctacagat gtatgaaatt 1860 ccaggacatg aatttctcgg agcgcaagtc cgttgccgca aggatgagct tgtgttacaa 1920 ctgtttgaaa ttgaatcatc gagtcaatga ctgccaatcc aatcaacggt acaaggtgca 1980 aggttgcggt cgtaagcatc atagcctttt gcatcccgag atttggcaga atagtgtaga 2040 gaggtctccg acgccagaag ttgataaaga ttatgagtca aatcaacagg agccagatgg 2100 tcggcgtgcg acaacgatgt gcgcgctgat tgatacaacg aagcggcaag tgctgctatc 2160 aacggcagag gtcttggtgg tcggacatgg aggtttcacc atcaagtgcc gtgccttgtt 2220 ggattcagga tccgatagcc atattcttac ggagagattg gccaacaaat tgaggctaaa 2280 gatgaaacgc gtcgactttc cgattagcgg tctcaacgac atccagacca gagtgaagta 2340 tctggtatcg acgaaaattc gctcccgaat caatacgtat gcgacgagtg atttgaattt 2400 cttggtcgta ccaagaatct cagcaaggct tccacaggtg gaaataaaat ccggttcttg 2460 tagggtaccg acaggtttgc agttagcaga tccatcgttc catgtaccag gagaagtgga 2520 tatgattatc ggcaacgaga tattcttcga cttggtgaag gctggacgcc tgaaaataga 2580 gaacactgcg attacgttgg ctgaaacaga gtttggatgg attgctggag gatcagcgca 2640 gttgaaaaag gatgcaccac gtgcagtaca atgccaacta aattatcgca cacaagttgg 2700 caagatcatg gaaaagtttt ccgaaatcga gaacgatcct ctcggtatta aatcagcagc 2760 ggtcgattca gctgtcaaaa agcgtgcgag aaatggacga agacgtggtg tacaacaact 2820 ctcattcaac ggcaaaaggt atcagcttgg tgattctttc gttatgacga ggaaccgttg 2880 cgaaccatca atctattcat catctggaaa acgaatatgg tgtaacgaat tcaaggctaa 2940 tccgtatgat gcagcaaaat cgaaaacgct gttagttaaa catggagtga cgcgaaggga 3000 tttacaagag caacaattag tgaacgaaat ggttcgagag aagcgagatg tgttggtaga 3060 agatagtccg tttccagcat cggcaggtac tgtaacgagg atgagtgagc gacttcagcc 3120 tggacgatcg atcaagagtg aagtagttac tgttttgttg agccaaaggg ctcaacgggg 3180 ggtgta 3186 // ID BEL-32_CQ-LTR repbase; DNA; INV; 215 BP. XX AC AAWU01004057; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-32_CQ_; KW BEL-32_CQ-I; BEL-32_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-215 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 218-218 (2011). XX DR GenBank; AAWU01004057; Positions 7862 7648. XX SQ Sequence 215 BP; 53 A; 62 C; 45 G; 55 T; 0 other; tgttccggtg gctccccacc ggtgaattgg caaccctgca caactgtcag attttgattg 60 gctgacaaga tcaacaagaa ggaaaacacg ctgtcattct tctttcacca ctcgatgcga 120 tcagaagcaa atacacgttt agtttagtac gcggtcgtct ttaattctcc tccggtctcg 180 agtcccgtcg gcccagtcca aggtccactc aaaca 215 // ID Gypsy-9_IS-LTR repbase; DNA; INV; 162 BP. XX AC ABJB010083307; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the black-legged tick genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-9_IS_; KW Gypsy-9_IS-I; Gypsy-9_IS-LTR. XX OS Ixodes scapularis OC Eukaryota; Metazoa; Arthropoda; Chelicerata; Arachnida; Acari; OC Parasitiformes; Ixodida; Ixodoidea; Ixodidae; Ixodinae; Ixodes. XX RN [1] RP 1-162 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the black-legged tick genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; ABJB010083307; Positions 7876 7715. XX SQ Sequence 162 BP; 38 A; 46 C; 45 G; 33 T; 0 other; tgttacgtac cggcgccacc ggcaggtgga ctaaggagcc gatgccgcga gcctcgtgac 60 aatcacgtgc tagttagagc tcgggcactt gttcctggac cgagaaccgc gatggacgca 120 ccgtccacac tctaaagctg acagtaaagc ttgtatttat ca 162 // ID hATm-54_HM repbase; DNA; INV; 4098 BP. XX AC . XX DT 16-DEC-2008 (Rel. 13.12, Created) DT 16-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type family: consensus sequence. XX KW hAT; DNA transposon; Transposable Element; hATm-54_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4098 RA Bao W. and Jurka J.; RT "Families of hATm elements from Hydra magnipapillata."; RL Repbase Reports 8(12), 1948-1948 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(1775..2038,1986..3413) FT /product="hATm-54_HM_1p" FT /translation="MDCDLDNKNDSETDPDYSITAPAERNQTYLTNLAKAA FT IRYEISDVATAALATAVLIDYGIVKESNKAQIITEKKIFGGKKKNQQYYKL FT QKKKFLGEKKRISSIISSKHLADVKELEVIGVDGKKDKFTLTHSIRYNSNG FT DPVVVAAIKEEYHLTFTAESGPTSQSYLTHVVIPNGIGATMANATANVLLE FT FNSVQSLIAVILDNTSSNTGCDSGLVVKLEKIINKNLHLLGCLLHQNELPL FT RHIISELDGKTVSPNSFKGPIGEAASNLTLHEQPLFQFVPIKSEVELFVNK FT EIISDLSTDQRKLYEYCVGISKGFISERFVSKKPGPVCHARWLTLALRIMM FT LYVRTKEPSLELLAVTKYIVQVYCVMWFAIKHSGQYKDACLLLYKNIQLVK FT LQESRIQSIVFTNLQGNSYCCLQENFLYCMLQDENIHIRNRALNQILIIRK FT LKEDITFKTQLKPKSVMKINFDAGSWDFLVSVDEIQIEPPATLSITSEKIR FT EAFESGQKLCLIDFPNNSQSVERAVKLTSEAAKQVYGLEKRHKFILAKNQS FT RKENKRNVKKCDYFIK*" XX SQ Sequence 4098 BP; 1513 A; 584 C; 650 G; 1349 T; 2 other; taagccactg ccagcagcaa gatttaataa gagtcaccct gggatttgtt cttatactat 60 aaatatatag acttaaaaaa aaataaacgt gcaaaagttc ataatcattt agggatatca 120 atttaagtaa atttgtgatc acataattta aggacgaggg gcgttataat ttataatatc 180 gaaacttagt aaaaagttat aattagaata acaaagtcat atttttcggt aatttttgtt 240 ttaactgggt tgttgttttt ttataccttt tgttttacac tagaaataat gttttatatg 300 aacaaactat tattatacat aacttatgta tttagtactt tttcaataac ttgttagctt 360 taaagaataa atgtatttgc agtttatttt tgtttacact tttgattgat gcaaaagtta 420 ttatttattt aaaaaaaaaa tttttcctgt tgaattaatt aatttgaaat gtacataaat 480 tatataactt tataaaggta tataatatat atatatatat atatatatat atatatatat 540 atatatatat atatatatat atatatatat atatatatat atatatatat atatatatat 600 atatatatat aaagaaaatg cctaatacaa taaaaagttt gagaactcgt aatgaaaaca 660 agagatatgc tggtttgcca tctgatcttc cagttagtga tcttccaact tataggcaag 720 tggttcaata ttcctacttt attttaaaaa acataaacag tgtcagaaat attaattctc 780 caaagattgt ttcaaaagtt gccagtgaca acttattgat ctgtggatta gagtaaatcc 840 taagctgcct ctgatacacg agaagtcagt aaccaaaaaa tgttttagac ttttcatcaa 900 agttttgtct gcagaaggca aattcaaaaa ttcaaataaa aacaaggagt atcttgattg 960 caattttgag cgtttatttg atttgtcatc atgtaattgt ggacttccag taatagcttg 1020 caatgataag taagaaaggc ctatttttat tttaatatat tttaaaatat ttattattat 1080 aaaacaaaaa tcaatattat taaatagttt catggtttgc agttctagtt ttcatttaag 1140 tttattttgt agataaataa attatgtttt gaacattttt atagatctgt gaagtgtagg 1200 aaccaaccag attgcaagac aacacatttt gcttgtctct gtgaagttaa caaaaaggta 1260 aataagttga aattataaaa gttaatttaa tttaaatcta tagtttgtta aacaagtaat 1320 cagaatcaaa agcatgtcag aaaaataacg taattaattt tctttgactt tggcaacaaa 1380 tcattttgtt atatatatta ttaagaaacg catcattgta atataataca agaatatatt 1440 atatattgtt tttttacttg actccatgtc tttgtaatat acaatatatt tgtttttaat 1500 tgacttaatg taaagtaatt gttaaaaagc tcttaaacat attataattt ttatgttttg 1560 caattaaaaa aagttttaca tttttcaggt accactatta gatagagaat atttgaagga 1620 tcaaaaattg aaaattggaa ctattggaag atttcaaatt ggtacgattg ataaaaaagc 1680 agttgccatc gaaaaaagaa agcgagaccg tgaaattaat cagcagataa gatctgctaa 1740 ctcacttgct ccagaaattc aatgtgatca ttatatggat tgtgatttag ataataaaaa 1800 tgactcagaa acagatcctg actattcgat aactgcacca gctgaaagaa atcaaactta 1860 tttaacaaac ttggcaaaag ccgctattag gtatgaaata agcgatgttg caacagctgc 1920 attggcaaca gctgtattaa ttgactatgg aattgttaaa gaaagtaata aagcacaaat 1980 aataacagaa aaaaaaattt ttgggggaaa aaaaaagaat cagcagtatt ataagctcta 2040 aacatttagc tgacgttaaa gaattagagg ttattggagt tgatggaaaa aaggacaagt 2100 ttacattgac acattctatc aggtataaca gtaatggaga tccagttgtt gttgctgcaa 2160 tcaaagaaga ataccatcta acatttacag cagagagtgg acccacatca caaagttatt 2220 tgactcatgt agttatacct aatggaattg gtgcaaccat ggcaaatgct actgctaatg 2280 ttttgttgga atttaacagt gttcagagtt taatagcggt tattttagat aatacaagtt 2340 caaatacagg ttgcgatagt ggtttggttg tcaaacttga aaaaattata aacaaaaact 2400 tacacctttt aggatgtttg ctgcatcaga atgaacttcc actgcgacat atcatatcag 2460 aattagatgg aaagacagtt agtcctaatt ctttcaaagg gccaatcggg gaggctgcat 2520 ctaacctaac acttcatgaa cagccattgt ttcaatttgt acccataaag tcagaggtag 2580 agttatttgt taacaaagag attataagtg atcttagtac agaccagcgc aaactctatg 2640 agtattgcgt tggtatttca aaaggtttta tttctgaaag gtttgtttca aaaaaaccag 2700 gaccagtttg ccatgccaga tggctaactt tagcactcag aattatgatg ttatatgtta 2760 ggactaagga gccatctctt gagctattag cagtaactaa gtatattgtg caagtttact 2820 gtgtaatgtg gtttgctatc aagcactctg gacaatacaa agatgcttgt ctactcctat 2880 ataaaaatat tcagttggtc aagcttcaag aaagcaggat tcaaagtatc gtttttacca 2940 atctccaggg aaattcatac tgttgtttgc aggaaaattt tctttactgc atgctgcagg 3000 atgaaaatat acatataaga aatcgagctc ttaaccaaat tcttataata agaaaattaa 3060 aagaagacat wactttcaaa actcagctta aaccaaaatc ggtaatgaaa ataaattttg 3120 atgctggttc ttgggacttt cttgtttcag ttgatgagat tcaaattgag ccaccagcaa 3180 cacttagcat tacttcagaa aaaattagag aggcatttga atctggacaa aaactttgcc 3240 taattgattt tcctaacaat tcgcaatccg ttgaaagagc agtaaaacta acctctgaag 3300 ctgctaagca ggtttatggt ctagaaaaaa gacacaaatt cattttggca aagaaccaaa 3360 gtcgaaaaga gaataagaga aatgtaaaaa aatgtgatta ttttataaaa tgaatatcta 3420 attttgcaat tttctgtgcg atttaattaa aaacagcttg ctatgacaaa tgcyataaaa 3480 aacattctga ctaagttgtt tagtgttgat aaattaatat tccttaaaaa taacaaagct 3540 acgaaagata taaatcgact gtaaaaaacg taataaatgg ctaaaaatta aatcagggcc 3600 atattaaggt gggggcggat gggtctgcag cccctggccc ttacaaaaaa ttccccaaag 3660 cccttcggag gcgttgtgaa taaataaata aatagagatt cagtaatcta tttaaggtgg 3720 acctatacaa aaagatctat ataaaagttt tacccccccc cctaataaat ttaagacccc 3780 ttacttgggg ggcttgcccc acaaaccccc ctggaaggaa ttttctggcc cctagatccc 3840 caggtatagc ctcgctaacg ctgggcattt ttacctcatc gccacattag gccctcctat 3900 aataacagcc ccaggcctaa gaatgcctta atccggccct caattcaatt aagatatata 3960 tttgaaccga tttcccaaag tgaatataaa ttaaacaaag ttaaatttat atttttcttg 4020 ttatttatat tgcaatgaac aaattccagg gtgactcttt tcagttttca aaaaaaacac 4080 cttctaggca gtggctta 4098 // ID Gypsy-85_AA-LTR repbase; DNA; INV; 1172 BP. XX AC supercont1.248; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-85_AA_; KW Gypsy-85_AA-I; Gypsy-85_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-1172 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.248; Positions 1430355 1429184. XX SQ Sequence 1172 BP; 407 A; 261 C; 222 G; 282 T; 0 other; tgttaccgtt ggggcataaa atccaaatca ctatgccgaa taaaaataga aacctgagat 60 atatttaatt tgtcagtgtg ttaaaattgt tagtcactta aagctatata aaacaagaac 120 tatgtatata ctaaatgtaa ttaaaactta taagaaacct ttaaaatgag catgcacatt 180 taatatgtta gtaaaattaa ataaaattat taaaaaggaa attataattg ttaaacataa 240 caaaatttgt aaaacacatt tagtacatta cacacaacac actaaaatct gctgcagggg 300 aaacttttga ggagttgtcg cgagttgagg aaaggactgg caaaaagaga tggaaaaggg 360 aatttgatat ggaaagggta ggctactggg gaaattaaaa agctggctcg gtctctttcc 420 taaagtgaac gccaaagcta tcagacgcta gattttagct gaagctataa aaaaagctaa 480 actttaactt tttaaaggaa aaccaccagt ggcactaaat acaccgtatt aacacggtaa 540 aataagtgaa tcaggtatgt tattttacca attaagaata attaccaagc aaaataccca 600 gtagcccatg aacctaaccc taaaccaacc ataacccttt cacaccatta ggacccaata 660 gtgctttgca caaaagcttt ccagacgcca agcaagaatt tggatctggt atagcctgag 720 gaaccggata tcaccccgtt tggtccacga tctcatcgta ttgagccaac aggacatcga 780 gcacgctaca ctgtacgatt tccaccccat cgtcggtcga aggccattcc cgacaggaat 840 tcaaccctcg aggacccgac attccgtcca gagtggagca accacgtacc gcctaggtcg 900 agacaggcca cgtgaacctc tcgcatcctc ccaaaagtga ctcactcacc ccagcaacag 960 cagtaagcgc cggccggcca tcacatcacc taacacccaa acagatcttg tgagtagtac 1020 cgaaataaaa ttgtgaaacg tatttagagt gttgcctttt gattgagccc ttggttagcc 1080 tctcgaatag ccgatcctga gagcaaaaag ctcacgctct agctcgccgg tactcaggtc 1140 ttgctgaggg gtcatccgac tcccaagtaa ca 1172 // ID BEL-4_SI-I repbase; DNA; INV; 5508 BP. XX AC AEAQ01012976; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-4_SI_; KW BEL-4_SI-LTR; BEL-4_SI-I. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-5508 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01012976; Positions 12406 6899. XX CC Positions [4389-4853] - Integrase core CC 'CTTCT' target site duplication CC LTRs are 94% similar to each other. XX FH Key Location/Qualifiers FT CDS 1487..2542 FT /product="BEL-4_SI-I_1p" FT /translation="MMNLASLHASTPELMISAIIYIQNSNHQQIKCRAFLD FT TCATANFITVSMANRLKLPISTQSLSINAINGTCTESKGIVRITIKSLHNE FT FTKNLTCLTIPSITDLVPSETFPRNAVKIPRNIKLADPEFHLPRSVDLLIG FT SGATLSLFAVGQISLSKDGHDLYLQKTRLGWVIAGNTTSQIPSNVTCHLTR FT LEELISKLWITEEITTDKLQSPEEIDCETYYLRTVSRNKSGRYVVRLPFRK FT NNTRLGDSRILALKRLSALERKLNANNTLKTEYTRVITEYINLNHMSEIKN FT PDDHGYYMPHHAVIKESSSTTKVRVVFDASAKTNNGLSLNDTLMVGPTIQN FT KLFSHLIRF" FT CDS 3156..4853 FT /product="BEL-4_SI-I_2p" FT /translation="MQDVWRCGLQWDESVPQNIYTEWLKFVKQWEAMGRVS FT FPQNLLIKNYQNVQIHGFCDASNIGYGACIYVRSTGEHGDVTVRLLCAKSR FT VAPLKITTIPRLELCGAIILAQLYQEIKDVFVFNVTKVVFWSDSTIVLHWL FT NISPHLLKTYIVNRVTIIQEITGSYEWRHVRTANNPMDALSRGQLPYDFLQ FT NNAWSTGPTWLIKDASQWPNEFKRLTEIPELKKNTCLTVVHKDYGIFDKYS FT SYFKLLRIIAWCLRFCSNNKNTDNLCTDEINNAESRVLKLLQDSQFSDVIR FT ELKKNIAYKGKFANLNPFLDEIGLIRVGGRIQRSNLSFAQKHPILLSNRHR FT ITDNIIRELHEKHLHTGIQTTLYTLRQRFWISDGRNQVRRIVRTCTRCSRF FT NANAIKYKLGNLPAAWVRATTPFAHTGVDFCGPFFIKEQKFRNRNEIKVYV FT CVFICMAIKAIHLEIVSDLTSDGFFAALRRFNARRGIPEHIYSDNGTNFVG FT ANNKLKEIYALINSEEHKTLVNQFANQHRIKWHFIPPLAPHFGGMWESTVK FT SFKHHFKRVIGDSILRMKN" XX SQ Sequence 5508 BP; 1894 A; 1105 C; 1020 G; 1489 T; 0 other; tttttttggt gcctcctgtg aggtttcacg gtttactcga ggacaaggaa accttcgacg 60 agtccgcgac tcaagacagt ggttctcaaa cgacgtccca cggattcaat ccgctgtctc 120 taggcgttgt cgcgacgact tcacgtgagt ggagggaaaa cagaatacaa attctcatta 180 tacgaaatcg atcgaagtaa gtcgtttgat tattctttaa ttattcaacc atggccgaca 240 aaatcaaatt aatcattcaa aaacggacgt ctttaaaatt acaactcact aacttgagta 300 atattctcga caaaggtagt gtagatcgtt ccttaataaa actaagaatg aatcgtatca 360 ccgaactata tcacgcgttc gaaggatata acgacgagct cgcgttgtta gatccgagtg 420 atgcgcatca atccaaattc gaaaacatac aagaacgtta ttatttaatc gcgagtaaag 480 tagaaaatat tttacatccg cctgatacaa gagaggagac agaatctaac gtgtcgatta 540 acggaactcg tagtgacagc acggagacaa taattaaaaa tcgacgaatc aaattaccgg 600 aagcttcttt accgactttc gatggaatgt acgagaattg gctgtcgttt aagaatgcat 660 ttgctaacat gatcggctcc cgaaccgatc tatccgatat tgataaatta cattatttaa 720 aatcagcttt gatcggagac gcagctagca aaattcaaat attcgcgatt gacgggatca 780 attatactaa agcgtgggaa ttacttgagc gcgcctacga agtcaaacgg attttaatat 840 cgcgacatta ttcgatgatt ctcaatttgc ccgcgataga caaggaatct actagcgggt 900 tgtacaaact tgtggatgac gcacaacaac atgtcgcgtc actgagtact ctaggaatta 960 acgtcgagcc gcaattaatc gtacatattt tagagactaa actacctaaa aatacgttat 1020 aaaaatggga gacgaaaaaa acaagtttcc gaaagttgag caaatgtacg aattcctgta 1080 caaatccgcg gtctgtgcgt cgaaatgcga aagaacaaag gcgatagaag gagataataa 1140 agacgagccc gcgataaaga agaaaaaatt gtctcactcg aatcgtgcac tcttattgaa 1200 taagacacgc agttgtttaa tgtgtaaaaa taaacgacat ccgttatata cgtgcgagaa 1260 attcaagcaa ttgtccgtac aaaagcgtat tgatacggtg aaaaacgcga agatttgcta 1320 caactgctta cgttcacata aagatttccc ttgtaaattc tcgaactgca tcatatgtca 1380 aaaacgtcat aatacacgat tgcacgttga caattacggc gcctcaaata aatccaaccc 1440 ttctaaagca gcatcgaacg ggacagattg actagccgat actcgaatga tgaatcttgc 1500 gtctctacac gcctccacac cagagttaat gattagtgcg ataatttaca ttcaaaatag 1560 taatcatcaa caaattaaat gccgagcgtt cttagacact tgtgccactg caaattttat 1620 tacagtatcc atggcgaatc gtttaaaatt accgatatcg acacaatcat tgtcaattaa 1680 tgcaattaat ggcacatgca ctgaatcgaa aggcatagta cgcatcacga taaaatcttt 1740 acacaatgaa tttacaaaaa atctaacatg tttaacaatt ccgtctatta ccgatttagt 1800 tccatccgaa acattccctc gaaacgcagt caaaataccg cgaaacatca aactcgcgga 1860 tccggaattc cacttacctc gctcggtcga tctattaata ggttccggag cgacactttc 1920 tttattcgcg gttggtcaga taagcttgtc taaagacgga cacgatttat atttgcaaaa 1980 gacacgtcta ggatgggtga tcgccggaaa caccacatca caaattccgt caaatgttac 2040 gtgccattta acgcgtttag aagaattgat atccaaactt tggattactg aagagattac 2100 gacagacaaa ttacaatctc cggaagaaat cgactgtgaa acatattatt taagaacagt 2160 ctcgcgaaac aaaagcggtc gatacgtggt tcgattaccg tttcgaaaaa ataatacccg 2220 tctcggcgac tcgcgtattt tggcactcaa acgtttatcg gcactcgaac gcaaacttaa 2280 cgcgaacaat actttaaaaa ctgaatacac cagagtaata actgaatata tcaatttaaa 2340 tcatatgtcc gaaataaaaa atcccgacga tcatgggtac tatatgcctc atcacgctgt 2400 aattaaagaa tcgagtagta ctacgaaggt acgagttgtg tttgatgcgt cggcaaaaac 2460 aaataacggc ttatctctca acgatacatt aatggtaggt ccaactatac aaaataaact 2520 attttcgcat ttaattcgtt tttgatctta caaatacgtt ttaacagcag acatcgaaaa 2580 gatgtataga cagattctcg tacacgataa ggatcgacgt tatcaacgca tattatggcg 2640 cgaaaacggt gaaataaaaa cattacaatt aaatacgctt actttcggga tttcttcatc 2700 gcaatccgtt tctcgcaatc cgcaccatca aaaaattagc agacaatgaa cgatcttcat 2760 atcctcgagc agccaaagta attgagtcac atctatatgt agatgattta ttaaccggca 2820 ctgagacgat aaatgaggct cggacgttgc gaaatgaaat aataacatta ttagctctcg 2880 gcggctttag tattagacaa tgggcatcta acgataaacg cgtagtcaat gatttaccga 2940 ttggcgcact acacgagaat tttgcattaa atacagattg cgcttttaaa aacattaggc 3000 gtaatgtgga atgcacgcga tgataaaata tattacgcga cgaaattaat cggaaataca 3060 acaaaaatta cgaagagaac tatattatca gagattgcca aaatatttga ccccttagga 3120 ttgctggccc ggttatttta tatattaaga aaataatgca agacgtgtgg cgatgtggtc 3180 tacaatggga cgagtccgta ccgcaaaaca tatatacaga atggttaaaa ttcgtaaaac 3240 aatgggaagc aatgggccga gtttctttcc ctcaaaattt attaatcaag aattatcaaa 3300 acgtacaaat acatggattt tgcgacgcga gcaatatagg ttacggtgca tgtatatacg 3360 tgcgctcaac gggagagcac ggtgatgtaa ccgttagatt gttatgcgcc aaatcgcgag 3420 tagcgccatt gaagataact accatcccac gacttgaatt atgcggtgca ataatattag 3480 cacaattata ccaagaaata aaggacgtgt tcgtttttaa cgtcaccaaa gttgtttttt 3540 ggagtgactc gactatagta ttacattggt tgaacatatc gccacacttg ttaaaaacat 3600 acatagtgaa tcgcgtcacc atcattcaag aaatcaccgg ttcatatgaa tggcgacacg 3660 tcagaaccgc aaacaatccc atggacgcgt tatcaagagg ccagttgcct tacgattttt 3720 tgcaaaacaa cgcatggtcc acaggaccta cttggcttat taaagacgcg tcgcaatggc 3780 ctaatgagtt taaacgttta acagaaatac ccgaattaaa gaaaaacacc tgtttaacag 3840 tcgtacataa ggattacggc atttttgata aatactcatc atattttaaa ttgctcagaa 3900 taatcgcttg gtgtctcaga ttttgctcaa acaacaaaaa tacggataat ttatgtaccg 3960 atgaaattaa taatgcagaa tcacgagtgt taaaattgct ccaagacagt caattctcgg 4020 acgtaattcg agaattaaaa aagaacatcg catataaggg taaattcgca aatttaaatc 4080 ccttccttga tgaaatcgga cttattcgtg tcggtggacg tatccaacga tccaatctct 4140 catttgcgca gaaacacccg attcttctct cgaatcgcca tcgtattaca gataacatca 4200 ttcgcgaact tcatgaaaaa catcttcata caggtattca aacaaccttg tatacattga 4260 gacaacgttt ttggatatct gacggccgaa atcaagtgag aaggatcgtt cgcacatgca 4320 cacgatgctc ccgattcaat gccaatgcga ttaaatataa attaggcaat ctcccggctg 4380 cttgggtgcg tgcgacaaca ccctttgctc ataccggtgt agatttctgc ggtccatttt 4440 ttatcaagga acaaaaattt cgtaatcgta atgaaatcaa agtatacgta tgcgtattta 4500 tttgtatggc tataaaagcc atacatctcg aaatagtaag tgatctaaca tcggacggat 4560 tttttgcggc tttacgccga tttaacgcca gacgcggtat tcccgaacat atttactcag 4620 acaacggtac caactttgta ggtgcaaata acaaattaaa ggaaatatat gcgttaataa 4680 actcggaaga acataaaaca ctcgtaaacc aatttgcaaa tcaacaccgt attaagtggc 4740 attttatacc gccactcgct ccccattttg gtggaatgtg ggaatcaact gtaaagtcat 4800 ttaaacacca cttcaaacgc gtgataggtg actctatttt acgtatgaag aattagaaat 4860 gttcacaacc gaagtcgagg gcattttaaa ttctagagca accacgtcca tatcggcaga 4920 tccaaacgac ttacttgttt taacccccgc tcattaccta attggcaaac ctattacttc 4980 tttgccggaa agcaatttat taagtgttcc agaaaatcgt ttatccatct ggaaacatat 5040 atcaaaagtc cgacaagact tctgggcaag atggagtatg gaatacttaa atgaactcca 5100 aacacgaaat aaatggaaaa aagacaaatc aaagcttaac atcggttttg tcatacttat 5160 caaagacaaa aatattccat gcacttaatg ggcgttaggc agagttacac acgtacaccc 5220 tggagaagat ggaattgtac gagtcgcaac tataaaaaca tcttcaggag aaaaaaaaat 5280 agctgtaaaa tcattatgtc tgttgcctat agaaaccgat taagcacgtg gtacagtatg 5340 aaatattgat cactaaaata ttcatataca tcattagatg taaatattta tccattcata 5400 tcacatgata cattacatac gcatatacat gtaatatata catgtatagc gtatatgtac 5460 atgcgagtat atattattga tccgtatttc ctctcaacgg ggggagta 5508 // ID Copia-128_AA-I repbase; DNA; INV; 4073 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-128_AA_; KW Copia-128_AA-LTR; Ty1_copia_Ele170; Copia-128_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4073 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [1473-1976] - Integrase core CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 90..3365 FT /product="Copia-128_AA-I_2p" FT /translation="MSSAKKAGIVQFAGEGYDTWKFRVETQLSAHGVKETI FT TEDPPAEEPARTAFLEKDEKAKALLVAYIADSHLEYIRDKQTAREMWTSLA FT NTYAKKGFAAQTYIRRSMALLRMEEGTPLADHFRRFDELGRQLKNAGATVT FT ELDLVSQLFISLPPSYDVVTTAMENLDDQLKLETVKARLLAEEQKRSGRES FT GSSAASGSDGVVLAARSGNRRPQSSKFAGKCYHCGEIGHKKFACPKVKNRS FT DRAQVAKIPIDKAVALASGAKVSEKPDEIEWVLDSGASSHMVGDEGYFQEI FT HELHVPLRIDSAKDGERLVATKEGTVKARLCVGENRYTLSFGHVLLVKQLK FT YNPMSLSKLLNAGVHVEFEPDRAILTKDGETIGVAKSKGNLFYPRMNRLPS FT GSALAAGSNTELQLWHKRYGHLSFKNALELKNSAMVDGLDGLHGDAKFCDA FT CAESNMVRQPFCGTRPQTRRPLERVHTDVRGQITPSTSDGYRYFLSFVDDY FT THFGVVYLLKRKDQVFDYFKVYEAMATAKFGTKIANLRCDLGREYFSKERL FT AYYDAKGIQVESTVGYTPQQNGVAERFNRTIVEKMRAMLTESGAPKYLWGE FT AVLNAVYVTNGSPTEAIKGKETPAERRHGEKPDVTKLRVFGCQAFSWVPKR FT KRGKLDPTGRKCAMLGYAPTGYRLWDQEKRKMFVARDVRCNEAAFPFKAPS FT ESDGRLVIPEDVSPVPEQEGEIEEQQMIKAVGEARNDARDSDDEDVGDQDP FT DEDSATEVEDNDEEGATALPPQSERNETTGEVPRSSGRECRRPGWFSDFLT FT YKALSAGANSLQVPECYNDVKGHPNEPDWREAIRDELVALQKNQTWTVVKR FT PAKVQPVPCKWVFNLKTDADGRPARYKARLVAKGYAQKRHVDYEETFAPVA FT RLTTIRTLLALATTNGMHIHQLDVKTAFLNGKLKEKVYMQCPEGVKMRPDE FT VCLLHRALYGLKQSPRCWNSHFNEFLTTLGFVRSKHDYCLYVRACGDATVY FT LVVYVDDLLIAAVDEADIERVKTTLMQKFEMTDMKELHHFLGIKIHRDLDR FT GLMKLSQTSQIETIIARFGMQNCNPVKTPPNQDCS" XX SQ Sequence 4073 BP; 1066 A; 951 C; 1226 G; 830 T; 0 other; cgtgtttctc tttaggttat cggcccagtc acgccatacc taagagccag tgaaaagtga 60 aaagttctcg aagccgttcc agccccaaga tgtccagtgc gaagaaagca ggaatcgtgc 120 agtttgccgg tgaaggctac gacacgtgga agttccgggt ggagacccaa ctttcggcgc 180 atggagtgaa ggaaaccatc acggaggacc ccccggcgga ggaaccagcg aggaccgctt 240 tcctggagaa ggacgaaaaa gcgaaagcgc ttctggtggc atacattgcg gacagccatc 300 tggaatatat tcgtgacaag caaaccgcaa gggagatgtg gacgtcgttg gcaaatacct 360 atgctaaaaa aggttttgca gcccagacgt acatccggcg gtcgatggcg ttgttgcgga 420 tggaggaagg aacccccctg gcggaccatt tccgtcggtt cgatgagctg ggaagacagc 480 tgaagaacgc cggtgccacg gttacggagc tcgacctggt gagccaactg tttatttcgt 540 tgccacctag ttacgatgtg gttaccacgg cgatggaaaa tttggacgac caactgaagc 600 ttgagaccgt caaagcacgc cttctagcgg aggagcaaaa gcgatctgga agagaatccg 660 gttcgagtgc tgcgagcggg tctgatggag tggtgttggc ggccagatcc ggaaatcgac 720 ggccacaatc atcgaaattc gccggtaagt gctaccactg tggagaaatc gggcataaga 780 agtttgcctg cccgaaagtg aaaaatcgtt ccgatcgggc tcaagtagcg aagattccga 840 tcgacaaggc ggtggcgctt gcgtccggag cgaaggtgtc ggaaaagccg gacgaaatcg 900 agtgggtcct agattcggga gcgagttccc acatggtcgg cgacgaaggt tacttccagg 960 agatccacga gctgcacgtg ccgcttcgta tcgacagtgc aaaggatggt gagaggctgg 1020 tagcaacgaa ggaaggtacc gtgaaggcca ggttatgcgt cggtgaaaat cggtacacgt 1080 tgtcgtttgg acacgtgttg ctcgtcaagc aactgaaata caacccgatg tcgttgtcaa 1140 agttgctgaa cgcgggcgtg cacgtggaat tcgagccgga tcgtgccatt ttgacgaagg 1200 acggcgaaac catcggtgtg gcgaaatcga agggcaacct tttttatccg cggatgaatc 1260 gtttgccaag cggatcggca ttggcagccg gtagcaacac ggagctacag ctgtggcata 1320 agcggtatgg gcaccttagc ttcaagaatg cgctcgagct gaaaaattcc gcgatggtgg 1380 acgggctgga tggtctccat ggtgatgcaa aattttgcga cgcttgcgct gagagcaaca 1440 tggtaagaca gccgttttgt ggtacgagac cgcaaactcg aaggcccctg gagcgcgtgc 1500 acacggacgt gcgtgggcaa atcaccccgt caacttcgga tggataccgc tactttctgt 1560 cgttcgtgga cgactatacg cacttcggag tcgtgtacct gctgaaaaga aaggaccaag 1620 tgttcgatta tttcaaggta tacgaagcga tggcgacggc taaatttggg acgaaaattg 1680 caaatttgcg gtgcgacctc gggcgtgagt atttttcaaa agagcggcta gcctactatg 1740 atgcaaaagg cattcaagtg gagtcaacgg tcgggtacac cccgcagcaa aatggtgtgg 1800 cggaacgttt caaccgcacc atagtggaga agatgcgagc gatgctgacc gagtcgggtg 1860 cgccgaagta cctttgggga gaagcggtcc tgaatgctgt ctacgttacc aacggaagtc 1920 cgactgaagc aatcaaaggt aaggaaacgc ctgcggaacg gcggcatgga gaaaagcctg 1980 atgtcaccaa gctgcgagtt ttcgggtgcc aagcgttttc ctgggtaccg aaacggaaga 2040 gaggtaagct ggatccaacc ggacgaaagt gtgcgatgct cggttacgca ccaactggat 2100 atcgtctttg ggatcaagaa aagcggaaaa tgtttgtggc ccgcgatgtc cgctgcaacg 2160 aagcagcatt cccattcaag gcaccgagtg agagcgacgg acgattagtg attccggagg 2220 acgtatcacc ggttcccgag caagaggggg aaattgaaga gcagcagatg attaaagcag 2280 ttggagaagc tcgaaatgat gctcgtgact ccgacgatga ggacgttggt gatcaagacc 2340 ccgacgaaga ttctgctacg gaagtggaag acaacgacga ggaaggcgca accgcactcc 2400 caccacaatc agaaaggaac gaaactactg gtgaagttcc gaggtctagt ggtcgggagt 2460 gcaggagacc aggttggttt tctgattttc ttacgtacaa ggccctttct gctggtgcga 2520 attccctaca ggttcccgaa tgttacaatg atgtcaaagg tcatccgaac gagcctgact 2580 ggagggaagc aatccgagac gagttggtgg ctttgcagaa gaaccaaacg tggaccgtgg 2640 tcaaacgccc tgccaaggta caacccgtac cgtgcaagtg ggtgttcaac ctgaagacgg 2700 atgctgacgg gcgtcctgcg cgctacaagg cgaggctggt agctaagggc tacgcgcaaa 2760 agaggcacgt ggactacgag gagactttcg caccggtggc ccggttgact accatccgga 2820 cgctattggc actagcaacg acgaatggga tgcatattca ccagctagat gtcaagaccg 2880 ctttcctgaa cgggaagctc aaggagaagg tgtacatgca gtgtccagag ggggtgaaga 2940 tgcgccctga cgaagtgtgc ttgctgcatc gtgcacttta tggtctgaag cagtcgccca 3000 ggtgctggaa cagccatttc aacgagttcc tgacgacctt gggtttcgtg cgatcgaagc 3060 acgactattg cttgtacgtt cgagcctgcg gagatgccac tgtgtacctg gtggtatacg 3120 ttgacgacct cctgatcgcc gctgtggatg aggccgacat cgaacgtgtg aagaccacac 3180 tgatgcagaa attcgagatg acggacatga aggagctgca ccatttcctt ggaatcaaga 3240 tccacaggga tttggatcgt ggcctgatga aactgtccca gacaagccag atcgagacga 3300 taattgctcg ttttggaatg caaaactgca atcctgtgaa gacgccgccg aaccaagact 3360 gcagttgaag cgagaagccg gaaaatgcga gtacccctac cgcgagctca tcggtagtct 3420 catgtacgtc atgatgggtt ctcgaccaga cctgtgtttc gtcgttggat acttggcgag 3480 gttccaggat gctgctggtg aagaacattg gaagcatgcc aaacgcgtcc tacgatacct 3540 gcaagcaacg aagaagttag gattagttta cagaagcaag ccgaaggaac caacggtagc 3600 tgcttacgtg gactccgact tcgcaagcga cgaatgcgac cggaaatgcg tttccggatt 3660 tttactgaag gtgcacggaa acactgtcgc ctggtcatca aagaagcagt cgactgtggc 3720 gatgtcgtct accgaagccg agtacgttgc catgagttcc tgcgtgagcg agacgatttg 3780 gctaaccgga ttaatggccg atcttcgaca agatgcccta ctgtttccgg taccgctcta 3840 tgaagacaac caaggagcga tcgcgatggc agaacgagag gaaacgagga gagtcaaaca 3900 cattgacgtg aagtttcatt tcatccgaaa tgcagttgcc gaagggaagg tcaagctgat 3960 ttacattccg acccagaagc aacaagcaga catcctcacg aagtctttgc ccgctccaac 4020 gtttatagct ttaagaagta agttagggtt agaaggaaac aactgagagg ggg 4073 // ID DNA-2B_PPac repbase; DNA; INV; 586 BP. XX AC . XX DT 07-JUL-2010 (Rel. 15.07, Created) DT 07-JUL-2010 (Rel. 15.07, Last updated, Version 1) XX DE Non-autonomous DNA transposon from the Pristionchus pacificus DE genome. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA-2B_PPac. XX OS Pristionchus pacificus OC Eukaryota; Metazoa; Nematoda; Chromadorea; Diplogasterida; OC Neodiplogasteridae; Pristionchus. XX RN [1] RP 1-586 RA Jurka J.; RT "DNA transposons from the Pristionchus pacificus genome."; RL Repbase Reports 10(7), 953-953 (2010). XX DR [1] (Consensus) XX CC ~84% identical to consensus. XX SQ Sequence 586 BP; 136 A; 154 C; 144 G; 143 T; 9 other; taggggccac aacgcggata gccgatttta cgcaatgggg tcaaatttgt gttcaaacna 60 tccctcgtgg ncccaataac atcatgcttt ttgccgtttg tcgatatctt gtctctgagc 120 cgcgctagag cccgagaagt acgcgcgcaa cgcactgggg aagaggaggg aaaggcgaga 180 cgagagggaa gggggcggag cctaatttgc gtgtgtctct ctccctcccc ccgccctctc 240 gcncgcctct ccccgcgttc caagcacacc ccggggaagc atgctaaaaa tctgattttt 300 tctctaaaaa tcctaaaata ttcnaattag tcagaaaatc gctgtccgag cagaattgtc 360 atactcggaa cacgtcgaan cgctactctc cccttcttta tcgattttga cctacttagt 420 gaagattntn caagatctgg acagcgctcg atcganaggg gatgctagta gctatccanc 480 gcgctatcga acgaccgatt tggtttagaa cgaaccttgt gcttctccgt ctcgctttta 540 ttggaggagc gtgtcacaca aaagtgggcg gagcttgtgg ccccta 586 // ID MERLIN2_SM repbase; DNA; INV; 1145 BP. XX AC . XX DT 07-OCT-2007 (Rel. 12.1, Created) DT 07-OCT-2007 (Rel. 12.1, Last updated, Version 1) XX DE Consensus sequence of Merlin-type family of repeats. XX KW Merlin; DNA transposon; Transposable Element; MERLIN2_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-1145 RA Jurka J.; RT "Merlin2_SM: Merlin-type element from freshwater planarian RT (Schmidtea mediterranea)."; RL Repbase Reports 7(10), 1088-1088 (2007). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 170..1057 FT /product="MERLIN2_SM_1p" FT /translation="MFHLLFERIKLNLAIQPEEIIRFLQELNVFQKNVICD FT LCNKQMSFISVNSLVDKFAFQCSDCKSRHSIRKGSSFYNSKLSIQEILAAL FT IGFVRNESIKKIAKTLHLSERCLVKWYSIFRACITMHLENNFQQLGGVGRV FT VEIDESVVGRRKYNVGRSRDQQWLFGCIDRSTSKILIKCVDSRTKRELGAL FT IKDVIADNTLVMSDEWPAYMSFFSENTSNYSHLAVKHKENFVDPDSGAHTQ FT NIECVWRRLKDMLRAKNYVRRSKLSSYVDEFCFREKFKFFENHEYLMFIEI FT IKLM" XX SQ Sequence 1145 BP; 414 A; 147 C; 189 G; 395 T; 0 other; acgattgatt tatatgtaag taatggaaaa atatgtaagt caaatattaa atttaaaaat 60 tataataata atagaatttt aagctcctat aaggacaaca aatttaaaaa aatataaatt 120 tgaatttgaa taatctttaa aagtgttata ctataaaaat catttgccca tgtttcatct 180 actttttgaa aggattaagc taaatttggc aattcaaccc gaagaaatta taagatttct 240 tcaagagcta aatgtgtttc aaaaaaatgt gatctgtgat ctatgcaata agcaaatgtc 300 atttatatct gttaattcac tagttgataa atttgcattc caatgttctg attgtaagtc 360 aagacattca attcgtaaag gttcgtcatt ttataactcg aaactttcaa ttcaagagat 420 tttagcggct ctcataggat ttgtaaggaa tgaatctatt aaaaaaattg cgaaaactct 480 ccacttatca gaaagatgtc ttgtaaagtg gtattcaata ttccgtgcct gtatcactat 540 gcatcttgaa aacaattttc aacaactcgg aggagtgggc agagttgttg aaattgatga 600 atctgttgtt ggaagacgaa agtataatgt agggagatca agagaccaac aatggttgtt 660 tggatgtatt gacaggtcaa catccaaaat tttaattaaa tgtgttgata gtagaactaa 720 aagggagttg ggtgcattaa tcaaagatgt aatcgctgat aatacattgg tgatgtctga 780 tgaatggccc gcttatatga gtttcttttc agaaaatacg tctaattatt cccatctagc 840 tgtcaaacac aaagaaaact ttgtggatcc tgattcaggt gcacatacgc agaacattga 900 gtgtgtttgg agaagactaa aagatatgct tcgtgcaaaa aattacgttc gaagatcaaa 960 actttcatca tacgttgatg aattctgttt ccgcgaaaag tttaaattct ttgaaaatca 1020 tgagtattta atgtttattg aaataattaa attgatgtga tgcttaaaat tctattatta 1080 ttataatttt taaatttaat atttgactta catatttttc cattacttac atataaatca 1140 atcgt 1145 // ID Merlin6_SM repbase; DNA; INV; 1233 BP. XX AC . XX DT 12-AUG-2009 (Rel. 14.08, Created) DT 12-AUG-2009 (Rel. 14.08, Last updated, Version 2) XX DE Consensus sequence of Merlin-type family of repeats. XX KW Merlin; DNA transposon; Transposable Element; Merlin6_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-1233 RA Jurka J.; RT "Merlin-type element from freshwater planarian (Schmidtea RT mediterranea)."; RL Repbase Reports 9(8), 1896-1896 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS join(67..195,199..1107) FT /product="Merlin6_SM_1p" FT /translation="MLNNFNIMYYRNLIPITLNLKLFIYRSXYCLICLQYI FT KAMKAFPMTNRTSRIEMLLSVRNMTVTQVIAILRAKELLKRAMKCSCCGEL FT MVERKRKDNIDGVSWVCYSKECPKKKTTISIRNGSFFSDFRLSLADVWTLV FT LMWTESVQVCDAAHRYGISRKSVSAVYAKLRGLASEYLLADPIRLGGPGIV FT CQIDESLFCHKQKHNRGRTAETQSWVFGIADTSTTPAKYYVEVVPDRSANT FT LLPIISRVCRPGTVIYSDKWTAYKRINQNLGFEHGTVNHFLNFVDPDSGVH FT TQNIESLWAQLKMKIKAMKGIRGNQLPLFLNELVWKSNAQANILMYLVCVM FT KTQ" XX SQ Sequence 1233 BP; 399 A; 188 C; 255 G; 390 T; 1 other; ggatcatgtc cattatctgc cacctaggtg gcagataata cccataacat tgaatttaaa 60 attattatgt taaataattt taatattatg tattatcgga atttaatacc cataacattg 120 aatttaaaat tatttattta tagatctatn tactgcttaa tttgtttgca atacataaaa 180 gctatgaaag ctttttagcc catgacaaat cgaacttcca gaatagagat gttgctttct 240 gtgagaaata tgacagtaac tcaggttatt gccattcttc gtgccaaaga attactgaaa 300 agagcaatga aatgttcttg ttgtggtgaa ttaatggtag aaagaaaaag aaaagacaat 360 attgatggtg tttcttgggt ttgctacagc aaagaatgcc ctaaaaagaa gaccacaata 420 agcataagaa atggctcttt tttctcggat tttcgtttat cactagctga tgtatggact 480 ttggtgctaa tgtggactga gtctgttcag gtttgtgatg ccgcccacag gtatggaata 540 agcaggaagt ctgtgagcgc ggtatatgca aagcttcgag ggcttgcatc ggaatattta 600 cttgcagatc ctataagact tggtgggcct ggtattgtct gccagattga tgaaagcctc 660 ttttgccaca aacaaaagca taatagggga aggacagctg agactcaaag ctgggtgttt 720 ggaatagccg atacaagcac aaccccagca aaatattatg tggaagtggt gcctgacaga 780 tctgcgaaca ctctcttgcc aattatttct agagtatgcc gcccgggaac agttatttat 840 agtgataaat ggacagcata caagagaata aatcaaaatt taggttttga gcacggaaca 900 gttaatcatt ttttgaattt tgttgatcct gattcgggtg tgcataccca aaatatagaa 960 tctctatggg cgcagctgaa aatgaagata aaggccatga aagggattcg tggaaatcaa 1020 cttcctcttt tcttaaatga gcttgtgtgg aaaagtaatg cacaagcaaa tattttaatg 1080 tatcttgttt gtgtgatgaa aactcaataa atcatgattt agtttattta tgtttttata 1140 attgaataaa accaatagaa ttcttattta ttattattta ggtggcagat aatacccatg 1200 tatcttgtag gtggcggata atggacatga tcc 1233 // ID Copia-13_DPu-LTR repbase; DNA; INV; 218 BP. XX AC scaffold_22; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-13_DPu_; KW Copia-13_DPu-LTR; Copia-13_DPu-I. XX NM Copia-13_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-218 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 690-690 (2010). XX DR Genome; scaffold_22; Positions 1125225 1125442. XX CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX SQ Sequence 218 BP; 53 A; 51 C; 42 G; 72 T; 0 other; tgttggagat gttgccaaac cgtggtggtt ggtgaacccc acccaaccgt ttagtactac 60 ctcctttgtg ttcccagccc ttttagtgct gtctatgtct gtcaagtgtc actgtcaatg 120 gtttcacttg ataaaagaga tttgtgctga caggtatgtc acccccgaat aaaagttact 180 tgcatacaat attgtttatt taacacttca ctccaaca 218 // ID CR1-25_CQ repbase; DNA; INV; 4236 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-25_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4236 RA Kojima K.K. and Jurka J.; RT "CR1 non-LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 29-29 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 6 sequences with >98% CC identity. XX FH Key Location/Qualifiers FT CDS 37..4170 FT /product="CR1-25_CQ_1p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="RALRRIFLGADKRXQPDSAXSASSSCPTCTNKTVNAE FT SHAKVNAETNTKVNVEVNPEIYAGHGADPKGYGTNFACESQWFTYGSRDRL FT TNTLRSTPSLPQQSSVQHLPASSPGRAVESIMRAPRLPDATAPSPSYRTRS FT PASRSHQRRPGAGVGVGGGVSQTAPLGKYPSQLQVSLPDACPTYSSSLITS FT SSTRTANASPPAQAIQSSPGRTVESIMRAPRLPDATTSSFSYRTRSPAGCS FT HQSRPGAGVGVGGGVSQPVLFGKYPFQSQRVLPDASPTHRSCPSTSTSTRT FT ANASPLPAQAIQSSPGRTVESIMRAPRLPDATTSSFSYRTRSPAGCSHQSR FT PGAGVGVGGGVSQPVLFGKYQFSTPHTSPDARSDFSATVPPSTFVTLAAPG FT CVYHGTMEDLGSANSGMPAALTASDPASASSERGFLNVYYQNVRGLRTKTN FT SVLLALSSCDYDVIIFTETWLVPGIKNSEFAAGYDIFRCDRSAETSSLQRG FT GGVLVAVKTGRNCKDVELVGCGSLEQIAVRIVLPHQALFICCIYIRPASNI FT DVYQLHTGAVQQILGMAGVRDLVLVVGDYNLPALTWFFDDDTHSFLPINAS FT AEQEVLLLESMLSCGLLQINNLASAHGNLLDLAFVSDATAAELIDPPCPLL FT KPDAHHKPILLKLDYDHGGNDDLDDADEIFDFNQCDYALLNERISAVDWDT FT LLDTSSVDESTANFYERLNSILHEVVPKKIRRMSHRYKLPWWNDELRTLRN FT RLRKATSRYMESKTAWDKVTVQNIEDEYNALNQESFRSYINNTQRSLKSDP FT SKFWSYVNSRKNANRTPTDVSYRDLNSTSPAIAANFFADFFKSVHSSDAIP FT PRDHEFEYVPTHYLPLPLPVLNDEETLKALSGVDGAKGPGPDGIPPSLIKA FT CADSLTVPVKRIFNDSLRKGVYPAMWKVASITPIHKSGSTKKVENYRPISI FT LNCLAKVLEKIVYDRLYAVVRPIISDDQHGFMQKRSTTTNLLTFVHSTVNA FT LESGEQVDAIYIDFEKAFDKVPHALTVRKLRKLGLPAWITNWLLSYLTNRK FT AFVNLRIARSSTFDIPSGVPQGSHLGPLIFILFVNEICALTNSNTLMYADD FT LKLFRSVKNNVDCCSLQMDVNALLHWCESNGMKVHVKKCKVISFSRSRNPL FT RFDYTMGDGCPLDRVNSIRDLGVTVDRKMKFNEHIALTTGKAFAMLGFLKR FT NTAHFDDPFALKVLYASLVRSVLEYAAVVWAPYHVTLAARIERVQRAFVRF FT ALRKLPWNNPLILPSYEGRCLLFRLQSLAERRTLMQRLFVFDLLRGNIDCR FT SIRNNVRFNEPARQGLRGERPLLWIPRHIFDYGYNNPLDVCFRQFNEICSC FT FDYVVSKTVFKTRIS" XX SQ Sequence 4236 BP; 1041 A; 1186 C; 1014 G; 992 T; 3 other; cacgtggcct cgcggtcttg tgatccgaga atttgaagag cgctccggag aatttttttg 60 ggagccgaca aacgcgwtca accagactca gcwgmatccg cctcctcctc ctgcccaacc 120 tgcaccaaca aaaccgttaa cgccgaaagc cacgccaaag tcaacgccga aaccaacacc 180 aaagttaacg tcgaagtcaa ccccgaaatc tacgccggac acggagctga cccaaaagga 240 tacggaacca acttcgcctg cgagtcacag tggttcacct atggatcacg agacagacta 300 acgaacacac tcagatcaac accctcactt ccgcaacagt cttcagttca acatctacca 360 gcttcatcac caggacgcgc cgtagaaagc attatgagag cccctcgact gcccgacgcc 420 accgcgccct cgcccagcta ccgcacgaga agcccagcta gtcgcagcca tcagagacgt 480 cctggtgctg gtgtcggtgt tgggggaggg gtctctcaaa cggcgcccct cggcaagtat 540 ccttctcaat tgcaagtttc tctacctgat gcctgcccaa cttacagttc gagtctcatc 600 acgtcgtcct cgactcggac tgcaaacgct tcaccgccag ctcaagccat ccagtcttcg 660 ccaggacgca ccgtagaaag cattatgaga gcccctcgcc tgcccgacgc caccacgtcc 720 tcattcagct accgcacgag aagtccagct ggttgcagcc accagagccg tcctggtgct 780 ggtgtcggtg ttgggggagg ggtctctcaa ccggtgctct tcggcaagta cccttttcaa 840 tcgcaacgtg tcctacctga tgccagtccc actcacagat cttgtccatc cacgtcgacc 900 tcgacgcgga ctgcaaacgc ttcaccgctg ccagcccaag ccatccagtc ttcgccagga 960 cgcaccgtag aaagcattat gagagcccct cgcctgcccg acgccaccac gtcctcattc 1020 agctaccgca cgagaagtcc agctggttgc agccaccaga gccgtcctgg tgctggtgtc 1080 ggtgttgggg gaggggtctc tcaaccggtg ctcttcggca agtaccaatt cagcacgcct 1140 cataccagcc ctgacgctcg ttcagatttc agcgctactg taccgccgtc gacttttgta 1200 actcttgctg caccggggtg cgtgtaccac ggcactatgg aagaccttgg ttccgcgaat 1260 tccggcatgc ctgctgcatt gactgcatct gatccagctt ctgcgtcgtc agagcgtggt 1320 tttttgaacg tgtactatca aaacgtgcga ggcttaagga caaagacaaa ttccgtgctg 1380 ttagccctta gttcctgtga ttacgacgtc atcatcttta cggagacgtg gctggtaccg 1440 gggattaaaa actccgagtt tgctgcggga tatgacattt ttcgttgcga ccgaagtgca 1500 gaaacatcat cacttcaacg cgggggaggt gtactggttg cggtcaaaac aggacggaac 1560 tgcaaagacg tcgaactcgt gggttgtggc agcctggaac aaatcgctgt ccgtattgtt 1620 ctaccacatc aggcactttt catctgctgt atctacattc gaccagcaag taacatcgat 1680 gtgtaccagc tccacacggg cgctgtgcag caaattctcg gcatggcggg agtaagagat 1740 ttggttctcg tcgtcggcga ttataacctt ccggcgctaa cctggttctt tgatgacgat 1800 acgcacagct ttctcccaat caacgcatcg gcagaacagg aggttttgct gttggaatcg 1860 atgctctcct gcggacttct gcaaatcaac aatctggcga gtgcacacgg aaatcttctt 1920 gacttggcat tcgtgagtga tgcgactgct gctgaactca tcgatccgcc atgtcctctt 1980 ttaaaaccag acgctcatca taagccaata cttttaaaac tggattatga tcacggtggc 2040 aatgacgacc ttgatgatgc agatgaaatc ttcgatttca accagtgtga ctatgccctc 2100 ctgaacgaga gaatatccgc cgtggactgg gacactctgc ttgacacttc ctcggtagat 2160 gaatctactg caaacttcta tgaaaggctc aattccatcc tacacgaagt cgttccgaag 2220 aaaatccgtc gaatgagcca ccgatacaag ctgccgtggt ggaacgatga actacgaacg 2280 ctcagaaacc gcttacgcaa ggcaacatct cgatacatgg aatcaaagac agcatgggac 2340 aaggtgacgg tgcaaaacat cgaggacgaa tacaacgcgc tgaaccaaga aagctttcga 2400 tcctacatca acaacacaca aaggagtcta aagagcgatc catcgaagtt ttggagttac 2460 gtgaacagcc ggaaaaacgc caatcgaact ccaacagacg tttcgtaccg cgatttgaac 2520 tcaacatctc cagccatcgc cgccaacttc ttcgctgatt tcttcaagtc ggtgcacagt 2580 tctgatgcaa ttccaccccg cgaccacgaa tttgaatatg taccaactca ctatctgccg 2640 ctgccgcttc cggtgctcaa cgacgaggaa actctgaagg cgctttcggg agtggacgga 2700 gctaaaggac caggtccgga tggaattcca ccgtcgctga tcaaagcctg tgctgattct 2760 ctcaccgttc cagttaagag gattttcaac gattctcttc gaaaaggcgt ttatcccgcg 2820 atgtggaaag tggcgtcaat cactccgatt cataaatcag ggagcaccaa gaaagtcgag 2880 aactaccgac cgatatcgat cctgaactgt cttgccaagg ttctggagaa gattgtgtat 2940 gataggctgt acgcggtggt gcgtccgatc atttcggatg atcagcacgg attcatgcag 3000 aaacgttcaa caacaaccaa cctcctaaca tttgtacact caactgtaaa tgcccttgaa 3060 tccggtgaac aggttgacgc catctacata gatttcgaga aagccttcga caaggtacct 3120 cacgcactca ccgttcgaaa gctccgcaag ctcggtcttc ctgcttggat caccaactgg 3180 ctgctttcgt acctgacgaa tcggaaagct tttgtcaatc ttcgaatcgc ccgttccagc 3240 acatttgaca ttccctctgg cgtcccacaa ggcagccacc tcggcccact cattttcatt 3300 ctctttgtga acgaaatatg tgcgcttaca aattctaaca cattgatgta cgctgatgac 3360 ctgaaactgt tccgctcggt gaagaacaac gtcgactgct gctcgctaca gatggacgtc 3420 aatgcgctcc tgcactggtg cgagtcaaat ggaatgaagg tacacgtgaa gaagtgcaag 3480 gttatctcgt tcagtcgttc aaggaaccct ctgcgatttg actacacgat gggcgatggg 3540 tgtcctttag atcgtgtaaa ctctatccgg gacctaggcg tgaccgtgga tcgtaaaatg 3600 aagttcaacg aacacatcgc tttgacaact ggaaaggctt ttgcgatgct aggcttcctg 3660 aagcgcaaca ccgcccactt cgacgacccg tttgctttga aagtgctgta cgcgtcgctg 3720 gtcaggagcg tcctggagta tgctgcggtt gtttgggcac cgtaccacgt aacgctggct 3780 gcgcggattg aacgagtcca gcgtgccttc gtgaggtttg ccttgcgcaa gctgccatgg 3840 aataacccgc tgatcctgcc atcgtacgaa ggacgctgct tgctgtttag gctgcagtca 3900 cttgccgagc gtcggaccct gatgcagcgt cttttcgtat tcgatctgct gcgcggcaac 3960 atcgactgcc gcagcatccg caacaacgta cgcttcaatg aaccggctcg ccaaggactc 4020 agaggggaac gccccctgct gtggattccg agacatatat ttgactatgg ttataataac 4080 ccgctagatg tttgtttccg tcaatttaat gaaatctgct cttgttttga ttatgtcgtg 4140 tccaaaactg tgtttaaaac tagaataagt taatgttagt tttttaagat tatacagcct 4200 gagcatttta gatggaggcg gtgaaataaa taaata 4236 // ID Kiri-21_AAe repbase; DNA; INV; 4554 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 16-FEB-2011 (Rel. 16.02, Last updated, Version 1) XX DE A Kiri non-LTR retrotransposon family from Aedes aegypti. XX KW Kiri; Non-LTR Retrotransposon; Transposable Element; Kiri-21_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4554 RA Kojima K.K. and Jurka J.; RT "Kiri non-LTR retrotransposons from the yellow fever mosquito."; RL Repbase Reports 11(2), 716-716 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 17 sequences with >94% CC identity. CC This family and related Kiri elements found in two mosquito CC species, Culex quinquefasciatus and Aedes aegypti, constitute a CC distinct lineage belonging to the CR1 group. The phylogenetic CC analysis with PhyML indicates that Kiri is a sister lineage of CC the L2 clade. Although several Kiri elements do not encode two CC proteins, probably due to 5' truncation, Kiri elements generally CC encode two proteins. Their ORF1 proteins commonly contain an CC L1-like RNA-binding domain. XX FH Key Location/Qualifiers FT CDS 264..1064 FT /product="Kiri-21_AAe_1p" FT /translation="MIRHTCQLDVAIKTMDTPIYNDLLNNVHKFDELCEKM FT QLMFQKVQDNIASKINAYQHELSERINCIEARISTIRNESIANVEQMIESV FT KKVRSDSLFINDKLQFVNRSKELIISGVPNDVNKNLYEIFRSIAKRLGYKD FT EDVPVVDLDRLPSTSSDKSFIVCRFALRTSRYKFFKSYLSDLSLCLKDIGY FT SPNPNDTNGSLSRIFINESLSKRVRDIRSTAMRMKKAGLIEKVSIRDGEVL FT VKLHKMDPFLPCRTKQALLETLNLSK" FT CDS 1557..4376 FT /product="Kiri-21_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MASSPIDNTSSSSIITKAVMDSVFRTDKLNICHINVQ FT SVCARGFSKFDELKAVFFNSKAHIVCMSETWLNESINDSMIRIEGYNLIRN FT DRNRHGGGLCVYFRQNLSLKLLKKSTFSPYEPSHITEYLLFEVATSNRKFF FT LGVYYNPPNNDCSNLIFEHLEEFKMKYDLTFITGDFNTDLKKVTPRTNRFT FT DVLSNLSYVCLNQEPTYFHTTGCSLLDLFIIDSPEIVFRINQVSMPGISRH FT DMILAVLDIFSDSSEQGFFHRDYKNFDSQGLLTAFNNIDWNYFHSISDSDM FT LIHILNEHFQYLHEEFFPLKFSKYRKNPWYNEDIEKAIIDRDLAYRNWKAS FT RLQSHNLLFKTLRNRVTTIIRNAKRSYYNQRINTNVPSKQLWRNIKSLGIA FT NKKQSSVNCDTTADEINAFFSENYSSDENPRLNLNYGPFGFRCVEEFEIVN FT ALFSIKSNASGLDNLPIEFFKIMLPLALPLYTHLFNTIITTSKFPQAWKYV FT KVIPIKKKPRSTSISNLRPISILNALSKAFEKIVSAQISDYVNRNNFLSPF FT QSGFRRNHSTETALMKVHDDIASSIDKKGIAVLLLIDFAKAFDRVSHRKLL FT NKLGALFGFSRPAVKLIETYLTNRYQSVFFNSQFSSLRPIDSGVPQGSILG FT PLLFSLFINDLPXALKFCSVHMFADDVQIYFCADSRTSTMEMSRLINYDLQ FT QVFQWSQENLLPINTTKTKAVFINRHRRSHFSMPELIMNNEQIQFADQVNN FT LGVIFNSKLDWEPQINSQVGKIYGILKQLSLTTNHLSYQMKIKLFKSLIYP FT HFIYNDFIYLNATANSLNKLRVALNSCIRFVFNIPRFGSVTHLQTSLISCP FT FDQFLKYRSCCHIYKILTSGNPGYLSSKLNPCRNTRTRSLLVPLHHTSYYS FT HSLFVRGIINWNYLPLNLKLSVNFSSFKRDLLLELN" XX SQ Sequence 4554 BP; 1399 A; 899 C; 773 G; 1482 T; 1 other; aagtttctga agggatgtgt gattgctagt gcacagtgaa gtggttctca tcttccggct 60 agtttcctga ttttaatcgg cttcatctat ggttgcatta atctgaagcc tgaagacgag 120 caattgtggt agctgttcca tctgaagttt cacttcaatt cgtcgccgat ttttctacaa 180 accggttcgc aaaatcttgt agaacctact actagcaaca attgtttctg tgaaacccct 240 tcagcaagtt cagtattggt tgtatgatac gtcatacctg tcaacttgac gttgccataa 300 aaacaatgga tactcctatt tacaatgatt tgctgaacaa tgtacacaag tttgacgagc 360 tctgcgaaaa gatgcaactg atgttccaga aagttcaaga caacattgca tctaaaatca 420 atgcttacca gcatgaactc agtgagagga tcaactgcat tgaggcccgt atttcgacta 480 tccgtaatga gagcatagcc aacgtcgaac aaatgatcga gagtgtgaaa aaagttcgct 540 cggattcatt gtttatcaac gataagctac agttcgtcaa cagaagcaag gaactgatca 600 tttcgggggt tccaaacgat gtaaacaaga acctgtatga aattttccgc agtatcgcca 660 aacgtctggg atataaggat gaagatgtac ctgttgtcga tttggatcga ttaccttcaa 720 catcatcaga taaatccttc atagtctgcc gtttcgcact ccgtactagt agatacaagt 780 ttttcaaaag ctatctgtcg gatttgtccc tttgcttaaa ggatattggc tactctccga 840 accccaacga taccaatgga agtctgtccc gtatattcat caatgaaagc ctatcaaaac 900 gagtacgaga tattagatct acagcaatga ggatgaagaa agctggactc atcgagaagg 960 tgtcgatcag agacggcgag gttctggtga agctgcacaa gatggatccg tttcttccct 1020 gtcgtaccaa gcaggcgctt ctagaaactc taaacctttc caaataagtt tctcttctcc 1080 tgcccagttt cttcccatgt tgcctctcct ttcattccat gttatcttcc cctcctgaaa 1140 gttatagaat actgatttgt acctttcctc attatgctct ttatccttat caaaatttct 1200 tcctgtgttt cccatcctgg ttttcccatg attattcctt cctgaaagtc aaatcatatc 1260 aacggaaatg catggcaact gttgatggcc tatctggttt gtgatctgct gctgctgctg 1320 ctgttgctgt tcggtcagtt gctgtgctgt tgttgctgtt gcttgccgat ctatatattt 1380 acactttctg ccggaatacg atgacaacca ttcattcaga agctttgcta tggattcaat 1440 tgatttgtta attcttggta gttataagtt cctgcgtgct gttgtgtttg ttatgcgttc 1500 tctcatctct tggataggtg ttcgtttttg gctcggtgtt tgtactatag aagattatgg 1560 ctagttcacc aattgacaat acctcttcaa gcagcatcat tactaaagcg gtgatggatt 1620 cagtttttag gactgataaa ttaaatatat gccatattaa tgttcagagc gtttgcgctc 1680 gagggttttc caagtttgat gaattgaaag cggtgttttt caacagtaaa gctcacatcg 1740 tctgtatgtc tgaaacatgg ctcaatgaat caatcaatga ttcgatgatt cgaattgagg 1800 gctacaatct aattcgtaac gatcgaaacc gccatggtgg tggactatgc gtctacttta 1860 ggcaaaatct aagcttgaag ctcctaaaaa agtctacttt tagtccgtat gaacctagtc 1920 atattaccga atatttgcta tttgaagttg ctacaagcaa tcgcaagttt tttcttggtg 1980 tatactacaa cccgccgaat aacgattgca gtaatttgat ttttgaacat ctcgaggagt 2040 tcaaaatgaa atacgatttg acgttcatta cgggggattt taacacagac ttaaaaaaag 2100 ttacacctag gaccaacaga tttacagatg ttttatctaa cctgtcatat gtttgcttaa 2160 atcaagaacc aacttatttc catacaaccg gttgctcttt acttgacctt ttcattattg 2220 attctccgga gattgtcttc agaattaacc aggtatctat gcctggtata tccagacatg 2280 atatgattct tgcagttcta gatattttct ccgattcatc cgaacaaggt ttcttccatc 2340 gtgattacaa aaactttgac agtcaaggat tattaactgc ttttaataac atagattgga 2400 attatttcca tagtataagt gattctgata tgttgattca cattctgaat gaacacttcc 2460 aatatttaca tgaagaattt ttccctctta agttcagtaa atacagaaaa aatccttggt 2520 ataatgaaga cattgaaaag gctataattg atagggattt agcttacaga aactggaagg 2580 ccagtcgctt acaatctcat aatcttcttt tcaaaacact gagaaacagg gtaactacaa 2640 ttatcaggaa tgcaaagaga agttattaca atcagcggat caatacaaat gttcccagta 2700 agcaactctg gagaaacatt aaaagtttag gaatagccaa caaaaaacaa tcttctgtga 2760 attgtgatac aacagctgat gaaatcaatg ctttcttttc agaaaattac tcttctgatg 2820 aaaatcctcg tttaaactta aactacggtc cctttggttt tagatgtgta gaagaatttg 2880 aaatagttaa tgcactgttt tcaattaagt ctaatgccag tggtctagac aatcttccaa 2940 tagaattttt caaaattatg cttccactag cccttccact ttacactcat ttgttcaata 3000 caatcattac aacttccaaa tttccacaag cttggaaata cgtaaaagtg attccaataa 3060 aaaagaagcc tcgttcaact agcatttcaa acctcagacc cattagcata ttaaatgcac 3120 tttccaaagc ctttgaaaaa atagtttctg ctcaaatctc tgattatgtg aacagaaata 3180 atttcttgag tccctttcaa tcaggttttc gccgaaacca cagtactgag acagctttga 3240 tgaaggtgca tgatgatatt gcttcgtcaa ttgataaaaa agggatagca gttctcttgt 3300 tgatagattt tgcgaaagcg tttgatcgtg tttcacatag aaaactgttg aataaacttg 3360 gtgctctttt tggattctcc cgtcctgctg tcaaattaat agaaacttat cttacaaata 3420 gataccaatc cgtttttttc aatagtcaat tctcttcttt acgtccaatt gattcagggg 3480 taccacaagg atctatttta ggaccccttc ttttttccct tttcataaat gaccttccak 3540 ccgcattaaa attctgctca gtacacatgt tcgccgacga cgtacaaatt tatttctgtg 3600 ctgattctag aacaagtacc atggaaatgt ctaggttaat caactacgat ctacaacagg 3660 tgtttcagtg gtctcaagag aatctactgc ctattaacac aactaaaact aaagcagtat 3720 ttattaatcg gcatcgacga tcccactttt ctatgcctga acttatcatg aataatgagc 3780 aaattcaatt cgcagaccaa gtgaacaatt taggagtcat ttttaattct aaattagatt 3840 gggaacctca aataaattca caagttggta aaatatacgg tattctaaaa cagctctcac 3900 ttactactaa tcatctcagc tatcaaatga aaatcaaatt gttcaaatcc cttatttatc 3960 cacattttat ctataatgac ttcatttatt tgaatgctac cgcaaacagc ttaaataagc 4020 ttcgtgttgc actcaattcc tgcattcgat ttgtttttaa cataccaaga tttggtagtg 4080 tcacccatct tcaaacatct ttaattagct gtccttttga tcagtttctc aaatatagat 4140 cgtgctgtca tatttacaaa atactcacat ctggtaatcc tggttatcta tcctcaaagt 4200 tgaatccatg tagaaacacc agaacaagaa gtcttcttgt tcctttacat cataccagtt 4260 attatagtca ctcgctgttt gtaagaggta taattaattg gaattaccta ccattgaatc 4320 taaaattatc cgttaatttt tcgagcttta aacgggacct actcttggag ttgaattgag 4380 cgcactaaac aagcacacag caattaatta atagtaaaat caatagaact attacatgaa 4440 taagaaatag cacatcctac tcaatatcta caaattgtaa catttaaaaa gattgtaaat 4500 cttgcgttac atgaatgaag tataaataaa taaataaata aataaataaa taaa 4554 // ID Ci000012 repbase; DNA; INV; 1886 BP. XX AC . XX DT 05-FEB-2007 (Rel. 12.01, Created) DT 10-APR-2007 (Rel. 12.01, Last updated, Version 1) XX DE Interspersed Repeat from Ciona savignyi. XX KW Transposable Element; Ci000012; Interspersed repeat. XX OS Ciona savignyi OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. XX RN [1] RP 1-1886 RA Smit A.F.; RT "Ci000012 - Interspersed Repeat from Ciona savignyi."; RL Direct Submission to Repbase Update (05-FEB-2007). XX DR [1] (Consensus) XX CC Contains a fourfold repeated unit. gt10000 copies in genome. XX SQ Sequence 1886 BP; 663 A; 296 C; 364 G; 562 T; 1 other; ttatgggtcc caattgaaaa attgggacac atattgtttt tgtccgtttt ttttttttct 60 tttttttttt tttcttcgtc aaaggtttga caggcatggc atggcgcccc gacatctacc 120 agaaggaata atggaatcgt aatgaaactt agccatttgt tagacaacat ggtctagata 180 tgcagaaacc aaaattttgg gtgataaatc aaggggaatt cccctgggag gttaaaaacc 240 aaaacggtat tttttataaa aaaaatagct ttttgtggca tagcgcttaa acagctgctc 300 caatttgaat aagatttggt agaatgtaag tgcattagac actgatttaa cacttggaaa 360 aaataaaaaa tttacctagg gaaataccct aaaaataaga ttttgttctc aaatcatgat 420 tgtgacaggc atggcatggc gccccgacat ctaccagaag gaataatgga atcgtaataa 480 aattwagcca tttgttagac aacatggtct agatatgcag aaaccaaaat tttgggtgat 540 aaatcaaggg gaattcccct gggaggttaa aaaccaaaac ggtatttttt ataaaaaaaa 600 ttagcttttt gtggcatagc gcttaaacag ctgctccaat ttgaataaga tttggtagaa 660 tgtaaatgca ttagacactg atttaacact tggaaaaaat aaaaaattta cctagggaaa 720 taccctaaaa ataagatttt gttctcaaat catgattgtg acaggcatgg catgtcgccc 780 cgacatctac caaaaggaat aatgaaatcg taatgaaact tagccatttg ttagacaaca 840 tggtctagat atgcagaaac caaaatgttg agtgataaat caaggggaat tcccctggga 900 ggttaaaaac caaaacggta ttttttatta aaaaaattag ctttttgtgg catagcgctt 960 aaacagctgc tccaatttga ataagatttg gtagaatgta agtgcattag tcactgattt 1020 aacacttaga agaaataaaa aaagtactta gggaaatacc ctaaaaataa gattttgttc 1080 tcaaatcatg attgtgacaa gcatgccatg tcgccccgac atctgccaaa aggaataatg 1140 gaatcgtaat gaaaatttga catttgtaag agaacatggt ttagatacga agaaaccaaa 1200 attttgggta atgaatccag gggaattccc ctgagacgtt aaaatccaaa aggttttttt 1260 tccaataaat tagctttttg gggcatatag cgctggaacg gctgctctaa ttttgatcag 1320 atttggcgaa atgttggact attagacact ttgtgccaat taaaggaaag aaaacattta 1380 cttagggtaa ttacgttaaa aacgacgata tgttcctgaa ttacgtttgt gactggcatg 1440 ccatatcgca ctgacaaata agacaagcag tcgacaaatc agattaaaac tttaagaatt 1500 gttacataac atgacccaca tatgtaaaaa cagaaactta tctactaaac ctaggggaat 1560 tccctgacac atgaaaaact aaatcgggct gttttggcaa tttagcttca tttcacttca 1620 gcgcttgaaa agaaggttta atttaaaata tttgtggtga aaatgttcgt aaatttaaga 1680 cgtttattct tttttaacta cacatggaaa agaagtttgg cttgataact ctgggaccca 1740 tgcgtttatg cttgcagaaa caaactacat atactcagtg tatgaaatac aaattgtggt 1800 tagggaccct gtagcatgag tatatggcct acaacatgtg gtttacaaat agtggtggtt 1860 aaacacgtgg ttgacactct ctagtt 1886 // ID Gypsy8-NVi_I repbase; DNA; INV; 8264 BP. XX AC AAZX01004364; XX DT 05-NOV-2007 (Rel. 12.11, Created) DT 05-NOV-2007 (Rel. 12.11, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy8-NVi; KW Gypsy8-NVi_I; Gypsy8-NVi_LTR; internal portion. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-8264 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from Nasonia parasitic wasp."; RL Repbase Reports 7(11), 1126-1126 (2007). XX DR Genome; AAZX01004364; Positions 14204 5941. XX CC Positions [6550-6975] - Integrase core CC LTRs are 92% similar to each other. XX FH Key Location/Qualifiers FT CDS 601..1941 FT /product="Gypsy8-NVi_I_1p" FT /translation="MRKRACYCSGDECHDRGKINSINKSRDVKMRHEVVKI FT CNKCCIAKSESGERRKSDSGYRNARLAKENKQHGDSKRSSYVHRSRRPPSD FT DDSYDESNDSDDESAEILEPCYIEKGTASQHKNHGSATIRRMTVHQAVNHI FT PYFDGDLDSLNLFCNAVREVLVTYGPEHERFLLLHIANRLKDKATEGYRAR FT TANYSSVEVLLRDLTLHYANIGIADQIYDEISTVEQNPGEFAGEYGLRVGK FT LYNRLRTFVGSAPDLSTADRESRLRQAEQDVLEQFLFGLKQPLDHLVRSKQ FT PQNIDVAIRFVIEFEGKRSARVATFSASPIVKPAAQVRLATATETKLIEAP FT ADGQMNPGTADPANQSAENKFCEYCNSNTHTLIECRTLISHAAKKIIRRPH FT YTRVSGTRRRQNDNRNADENQNKYDNKSNPNSDSRDNRDDEKRKNNSSQGN FT LN" FT CDS 4393..6975 FT /product="Gypsy8-NVi_I_2p" FT /translation="MLEDSAKQAESGAREERVEKKKLDVDALPPSLLAAWP FT WSGANRKAKLAVNFERREIKGKEGLRCEIYERELSVAAKSENTKNQRKPRK FT ATPSLTDRGNASSSECDDAQAKEARVADKTSDKTRMREESLNRPKPRIIEN FT IADPPGLRLHEIAGRGMSPDPEDIGPPENFDDIPRKYERKVDSSDVDSSAS FT SGNPLIQTYLRKKRCAEPCPTLRRSDDIPGFPPGNCLFFSLLKLSGPELSA FT LQLRQLLASSPLLKSCGEPEETERILESESEYGNVDCAFLFANMFDTDICI FT HFEPPKGVRTFLHIIVRNPKKQIHLNLRGCHFTPYIKTDAPVPSESTLERP FT RDDPPSDNGENAGSKSPILGKNKRALQKQSKLGKHPEVKETDVTNGENPKA FT QNNSEGEVSEPLRPYDSINVSPPQSPPVNTPSITSKLSADAPLSVGSQLTG FT GISGASAGENAEQSNIAKAQAAVEAALANNKLKRCVVARTRPLREKPPWAI FT PEGQEPPLFVHLQALNEHPFRFQENLLYLVSADNYLESEVQNALVERGYLN FT VEVTLEKACKLGEVNITEHKGCDLIGLYIKEYIDTRPLKSDILKCLKVLKN FT VLLKKRDTKPSDNSRLLTLSECITFTELFDSVFMNKPLVAVIYKNNLPVPP FT VRNRIKLIREYHEAAIGGHRGRTKTFSRIAADFYWKNMRDDVNQFVARCAT FT CIGNKLIRIKTRQPLLISNTPSLPFAQIEMDFYGPLETTERGNKYILSIQD FT MLSKYIILLPTKHANAKEVARALTERVICVFGPAAAILTDQGTHFQNKVLA FT TLAKIFGIDKFSTTAYHPQSNGSIERMHHTLTEYLRKYVKIASKWDEWTAI FT RQHAYNTT" XX SQ Sequence 8264 BP; 2589 A; 1916 C; 1940 G; 1819 T; 0 other; tttacgcaac atttagccct tcggtccacc gctcgctatc atcgaaatta taatctttca 60 attaaagctc atttaagcct gttttttttc ggctacgggg acgctcgcta gcacggcccg 120 gacgtgtcac ggatgcgagc ttatcgctac ccgatgacag tttgcgccgc tcatgacgca 180 ggcatcgcca cgacgcagga gcgggcagac aaccggtggc agtcgagcgg agaagagctc 240 cccagaagaa tacgaataaa gttccaatta accatactga acagcctagc tcctccaggc 300 gtgacccacc ctcaactctg ttacaactgg catccctcgg ggaaactgga cgcgaagcga 360 cgcgcagtca actggatttt ggatttttca ccaaattttt ttttcggaag tgccaattta 420 caaatcgact gccaccagcc aagaacgtat cacctccaaa attgggaaaa agtggatcga 480 gactgcaaaa aaaattagta atcttcagtc gaaaagaatc tcacgaaaat cgaaagccga 540 aaggaaagga atgaaagcga gtgttacgag tcgtccgata ggaacaaggt taatttagtc 600 atgcgtaaac gcgcgtgcta ctgcagtgga gatgagtgcc atgatcgcgg aaaaataaat 660 tcaattaaca aatctagaga cgttaaaatg agacacgaag tagtcaagat ctgcaacaaa 720 tgctgtatcg ccaaaagcga aagcggcgag cgtcgtaaaa gcgatagcgg ttatcgcaat 780 gcgcgcttag ctaaagaaaa taagcagcac ggcgacagca agcgtagttc ttatgtgcat 840 cgaagcaggc ggccgccgtc ggacgacgat tcttacgacg agtcgaacga ttcggacgac 900 gagtccgcgg agatcctcga gccttgctac atcgaaaagg gcacagcgag ccaacataaa 960 aaccatggga gcgcaactat taggcgtatg acggttcacc aagcggttaa ccatattcca 1020 tattttgacg gggatctgga tagtctaaat ttgttttgta acgcagtgcg ggaagttctg 1080 gtaacatacg gcccggagca cgaacgtttt ctgttattac acattgctaa ccgactaaaa 1140 gacaaggcca cggaaggcta ccgggctcgt acagctaact acagctcggt cgaagtactc 1200 ttacgcgact taacactgca ttatgctaac attggtatcg ctgatcaaat ttacgacgaa 1260 attagcacgg tcgagcaaaa tccgggagaa ttcgccggag agtacggact acgagtcggc 1320 aagctttaca atcggctacg cacgttcgtg gggtcggcgc cggatctgtc aaccgcggat 1380 agagagtcaa ggttacgcca agcggagcaa gacgttcttg agcaattttt atttggccta 1440 aaacagccgc tcgaccatct ggtccgcagc aagcaacccc aaaatataga cgttgcaata 1500 agattcgtaa tagaattcga aggaaagcgt agcgctaggg tggcaacatt tagcgcaagc 1560 ccgattgtaa aaccggcggc tcaagtacgt ctagcgactg ctacagagac caaacttatc 1620 gaagcacccg ccgacggtca gatgaacccg ggcacggcag atccggcaaa ccagagcgcg 1680 gaaaataaat tctgcgagta ttgcaattcg aacacccata cgctcataga atgtcgcact 1740 ctgattagcc atgcggccaa aaagataatt aggcgaccgc attatactcg agttagcggc 1800 acccggcgta ggcaaaacga taatcgcaac gcggacgaaa atcagaacaa gtacgataat 1860 aagagcaatc ctaacagcga tagccgcgat aaccgcgatg atgaaaagcg aaagaataat 1920 tcatctcaag gcaatttaaa ctagagggac gcccgccgta tccctcgagc atcgggcgtc 1980 aagtagaact gagagaaaca acaaccgcga tagaaacagc aaccgcgagc gaatcagcaa 2040 atatcagatt cttgctggcc ggcagacaac cgtcgatcgt gaacgtatct tgcccccagt 2100 tgaggaagag gcaagggaaa ttttacgcgg attcaggggt cgatatttcg gtagtaaaaa 2160 agagcgaatt agccccgtgc tatccattcg atgcgagccg tattataaaa atccagggag 2220 ttacacccgg atgttcgtac acgttggggg aggcaatcgt caagcttcac gggctggagt 2280 gtaaagtcca cgtagtgcca gatgatttcc cggtagaaaa ttcggggata atcggctggg 2340 atataataga cgcacataaa ggatgcgtcg acgcatcaag ccaaagttta aagttaggcg 2400 atgaaaattt gctcttcgat acgaacgaaa aagttactat accccaaggg taaaaatgat 2460 aatcggtgcg cgcgttcgta gtagtaacgt gagcgtcggg tgggttcctc tcatggactt 2520 gcacccggac ttgctttttg gaaactttgt ggcagaaaat agaaacggcc gagtattcgc 2580 cgaatgtatc aacataaacg aaaaggaagt cacgattgcc agtccatggg tatctattcg 2640 agtgcgagac agtagttgat aatccgcttt atcaagctag ggatgataat tctcccgata 2700 atgtcggtga attcgcggtg aatttaaggc gcttatttgg taacaaccaa agcaaaaaat 2760 ataaagaagt tctgactctc aatgaaaaat tacttctaga cgataaaatg cgacgcgaac 2820 gtgttcaaaa aatcttaggg ctcgctgatc tcgagggatg caacgaagaa gaaatcgagt 2880 acgtacgcga gattatcaac gatttcccgg gagtattcgg ccttgacggt gaaccacttc 2940 tggctacgca cttgctaaag cataaaattg tcttaaaatc ggataggcca attaaaaatg 3000 gtagattcag attccctcca gcattaagag agcatttgtc tcgcgagttg caaaaattac 3060 aggatcaggg tattattgtt ccctccaatt cgaactattc ctcgtcctta tggattgtcc 3120 cgaaaaaacc cgacgcacag ggaaataaac gctttcgcct tgtcacagat tttcgcggtc 3180 ttaatgaagt gacagaggga agctgccacc cgctcccaat caccagcgac ataatcgaac 3240 acctagccgg cgcggaatat ataacgtgct tagatctcag atagggttat cataagatag 3300 aaatggaccc cgattcggcg cacttaaccg cgttttatgc ccctgatggg aactatggga 3360 atcagttaat gcagtttaac cgcatggcta tggggctaaa agaggctacg attacattca 3420 caaaagccat gtcgctggct atgaaagggc tacaagggga cgaggtagaa atttacctag 3480 atgatctaat ggtattcagc cagacccttg acgagcacag agtccctctc cgtcgcgtac 3540 taaaacgctt aatcgacgct aatttatccg ttgaaccaaa aaaaatgtca atttctaaaa 3600 aaagaagctc atgtgctcgg acatatagtc ggaggcggaa tgataaagac agacccggaa 3660 aaagtaaaag caatggccga gttcccggta ccgaccaatg ctcataagct gaaacaagct 3720 ctaggcctat tcagttacta ccaaaggttc attaagaatt tctcgaaaat tgcgtatccc 3780 ttacattctc tccttcggaa ggatgtcgag tttatttggg gggaagagca gcaggctgcc 3840 ttcaacgagc tgccgaaatt aatggcggaa gagcccgtgc tgaaatcgcc ggatctctca 3900 gccgtttatc gtcactactg atagcagcga ctgggccctc ggcgctatcc taagtcaggg 3960 caagctgggc gcagactagc cgtgcgccta cgcgtcgcgc tgcctcaaag gtagtgaact 4020 taaatatcct acttacgata aggagctgct tgctatgttc ttcgccaaag agcagtttcg 4080 acattatatt tacgatagga aatttactat cgtaacagac cacgagagtc taaagcactt 4140 ccataacaca aagaaacccg atttgcgatt taatcgtctt aaggctgcgt taaccgggta 4200 cgactttggt atcgtgtatc gtcctggcgt aaaaaatgcg aacgcggacg cattatcgcg 4260 aagtcccgta atagaaaagg gagaaatcaa cccagagttg ccgcgcgccg aattgagaaa 4320 ataaatcaga tccgacggaa tttgacgccg ccgtagagct ttctcaagtc ggcgtgcgaa 4380 agtcctctgg cgatgctaga agattccgcg aagcaagcag agtcaggagc tcgcgaagaa 4440 agggtcgaga aaaagaaact cgatgtcgac gcgttaccac caagcttact cgcagcatgg 4500 ccgtggtcag gcgctaatcg taaagcgaaa ctagccgtga atttcgagag gcgagagatt 4560 aaggggaaag aaggcctacg gtgcgaaatt tacgagcgag aattatccgt cgctgcaaaa 4620 tccgagaata ctaagaatca gcgcaagccg cggaaagcaa caccaagtct gacggaccga 4680 ggtaacgcga gctctagcga gtgcgacgac gcacaagcta aagaagcaag agtcgctgat 4740 aaaacttctg ataagacacg tatgcgcgaa gaatctctga accgtccgaa gccccggata 4800 attgaaaaca tagcggaccc tcccggattg agacttcatg aaatcgctgg taggggtatg 4860 tctcccgatc ctgaggatat cggtccaccg gaaaactttg acgatattcc cagaaagtat 4920 gaaaggaaag tagattcatc ggacgtagac tcctccgcta gctcgggaaa ccctctaata 4980 caaacatacc tgcgtaagaa gcgctgtgcc gagccatgcc ccacactcag acgatcggat 5040 gatataccgg gatttccacc aggcaattgc ttgttttttt cgcttttaaa actgtcgggc 5100 cccgagttat cagcgttgca attaagacaa cttttagctt cctcacctct gttaaaatcg 5160 tgcggtgaac cggaggaaac agaacgaatt ttagaatcag aatcggaata tggaaatgta 5220 gactgcgcgt ttcttttcgc taacatgttc gacacagaca tatgcataca cttcgagccc 5280 ccgaaaggtg tacgtacatt tctgcatata attgtaagga atcctaaaaa acaaattcac 5340 ttaaatctta gaggttgcca cttcacgcct tatattaaaa cagatgctcc ggtcccttcc 5400 gaaagcacat tggagcgccc tcgagacgac cctccttcgg ataacggtga gaacgcgggc 5460 tcaaaatcgc cgatcctagg taaaaataaa cgagcgctgc agaaacaatc caagctcggc 5520 aaacaccccg aggttaagga aactgacgtg actaatgggg aaaatccaaa agcgcaaaat 5580 aattctgagg gagaagtaag cgagccgttg cgaccatatg actctattaa tgtcagccca 5640 cctcagtccc cgcccgtcaa cacccctagt atcacaagca agctgagcgc agacgcacca 5700 ttatcagtag gctcacagct gacagggggg ataagcggcg ccagcgctgg cgagaacgct 5760 gagcagagca atattgctaa agcgcaggca gcggtcgagg ccgctttagc caacaacaaa 5820 ctaaagcgtt gcgttgtcgc taggactaga ccgcttcggg aaaagccccc ttgggcgata 5880 ccagagggac aggaaccacc actttttgtt cacttacaag cgctgaacga acacccgttc 5940 cgcttccaag aaaacctgtt gtatctggta tccgccgaca attacctaga atccgaagtg 6000 caaaacgctt tagtagaacg cgggtattta aacgtagaag tgactttaga aaaagcctgt 6060 aagctaggcg aagtaaatat aacagagcat aagggttgtg acttgatcgg cctatatatt 6120 aaagaatata tcgatacgcg ccctctaaaa tcagatatcc ttaaatgcct aaaagtccta 6180 aaaaacgttt tactaaaaaa aagagatacg aagcctagcg ataattcgag acttttgaca 6240 ctgtcggaat gtataacttt tacggagctt tttgattccg tttttatgaa caaaccgttg 6300 gtagctgtaa tatacaaaaa caatctaccg gttccacccg taagaaatcg cattaagtta 6360 attcgcgaat atcatgaggc ggccatcggc ggccaccgtg gtagaacaaa aactttcagt 6420 agaatagccg cagattttta ttggaaaaac atgagagacg acgtaaatca gtttgttgct 6480 cgatgcgcaa cctgtatagg taataaattg attcgtatta aaacgcgtca accgctgcta 6540 atcagtaaca cgccgagttt gccattcgct cagatagaaa tggacttcta tgggccgctg 6600 gaaacaacag agcgcggtaa caaatatatt ctatcaattc aagacatgct atcgaagtat 6660 attattctac tccctactaa gcacgcgaat gcaaaagagg tcgcccgcgc tttgacagaa 6720 agagtaatct gcgtattcgg gccggcggca gcgatcctga cggaccaggg aacgcacttc 6780 caaaacaaag tactagcaac ccttgcaaaa atttttggaa tagacaaatt cagcacgacg 6840 gcctaccatc cccaatccaa cggatcgatt gaaaggatgc accacacatt aactgaatat 6900 cttcgtaaat atgtaaaaat cgcgtccaag tgggatgagt ggacggcaat acgtcagcat 6960 gcctataaca ccacctagca cgagagcacg cagtacacac cgcacgaaat cgtgtttggc 7020 gtaaaaccgc gcacgccatc tagctttcca ctggttagcg acaacatctt atataatgac 7080 tatattaaaa atatggtagg aaatttaacc gatctacaaa cagccgcggc gttgaactta 7140 gtacaatcta aatatcgttc caaattttat tatgatataa ataaacctaa gaaaggcaag 7200 ttcagacaga gtaccgaggc cctgttgaaa ttataagcat taatagaaaa actaataata 7260 tcacgataca gcatgacaca gcaatcgaaa cagtgcacgc taataaattg aaaatgccat 7320 gtgaaattgc gaaactggct gccgaatcat ccggcgaccc cgaagaataa gcgtttcctt 7380 tttactgttt ttttttgcac ggcaccggac gtgccgcgca aaagagtaac aaaaaaaaaa 7440 cgggtaaatt actaacattt gaaaaatcgt gacgcgtgtc gtattccacc ggagagccga 7500 gcattcgcaa ttagctaaat tgtaaacaag gttgtgtaat aaattttcag atgtatctgc 7560 gcagcagcag cgctagaaga tcagacagtg tccgaaagtt ccataagagc gatccatgtt 7620 ctagatcaaa atttaggctt aattacagaa aaaatcgcac cattagctac atcgagtacc 7680 aattggaaaa ttatacaaaa aattgactta acaccgtatt ttagagcaag cgcagaacta 7740 cctagtcgca tagccaaatt tggcagggcg tgtgggtgga gatgcacaaa tgaaaattta 7800 tttgaggaaa cgacagcagc ggctcgtcag gccgaaagag tattagatct actcctcgtt 7860 cacagcccag tagaagagcc gcgccgcgcc cgccgatccc tgttgccatt cgttggcact 7920 ctgcataaat ttctttacgg aactctaaat gaggacgacg agcgcgagat ccagctggct 7980 ataaagacag tggccagcga cgcacgcata acagcagcat tacttgcgaa tcaaacggaa 8040 atagttgaac gcaaattatt tgacctaaac gagaaaatag taaagttaca ggcagccact 8100 gacatcctag gcaataagac ggcgaacttg gacaaagaga ttagcattca aagcgcaatt 8160 tcaagcacta aagatggcct tattcagttc aggcaagaaa ccgaaacgct aacggatgcc 8220 attctttttg ccgccaaggg cctgattcac ccgcgaataa tacc 8264 // ID hAT-25_HM repbase; DNA; INV; 4876 BP. XX AC . XX DT 07-DEC-2008 (Rel. 13.12, Created) DT 12-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-25_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4876 RA Jurka J.; RT "hAT-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 2014-2014 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 902..3979 FT /product="hAT-25_HM_1p" FT /translation="LNRSMENGNCLYNSAALLINSSDKTHEILRLLTSVEL FT FEKATFYSRYPIILNQVESKSFSSLSSVFMACVSFESSDSFTELNDINIAE FT LIKKEAYNNCRNNKFASFLCLIALSTVLNKQIRSIYPEFGLEKYKKLFNVC FT IYPRGDHKTNKEILSILWCNTDCRSISKDKDMWSPNHFVPIVKREVLCSKK FT FVKSDLKRGLCSVLPSVLSHSSVNTKVLNLSCQPPKNLFSKKKSLVQSHIK FT FGISNETKIIDVNIDITPNPILLNCIPCSSDQSLSLPFYDVSTYYERGRQL FT SAGKDDSMLLDLVNKIFIPNSSFVFPISGEKKKRSFLHKWLLEYSWLAYSQ FT IEDGAYCLPCTLLGSRIPNNTSIINFIRKPFKIWGNAIRAYMDHDNNCRLH FT QMSMHCLNALQSRSNSKKNLIEVDIDNQRKKLIISNRNKLIPIIKTIIFLG FT RNDIAFRGHRDDSKYHPEIGETCKNNIGIGNFVELLNFRVEAGDKVLEHHI FT RSAPKNATYISKTTQNELIECCGKTIEEVLINKIKNSGYFSILCDGASDCS FT NVEQLSLVIRYVDCDNVICEDFLRFIECKSGTTGLSLAQNVVSAIDDLGLD FT IQKCRGQGYDGAGAMSGKLKGISSRIKFINSKAIFVHCACHKLNLVVSKSC FT NVQSVRNVLDQIKDISYFFNLSPKRASCLNKFIFPGQAKLIDTCRTRWVQK FT LKGLDVFFDNYISVFHSMEEMAYNEGKSYNIDTSSKASCFLNLMTNFSFIV FT SLVLTKQIMDYFYAITVVLQTKAFDISQQCNEINCLKTQILDLKKNIDVYH FT NEWYLIALDLAKTLDVSEVKPRLCGGQIYRDNYPSDTVSDYFKYSITSPLL FT DHLINELEDRFDKGDMVVYKGLSGIPATVVKKNKEKCFWKLDFMEFLHFYI FT SDMPHPTSIHAELDLWETFWNNQSVIPSTITETLKSIDMRGFPNIREGFLI FT MGTIPITTCECERSISVIRRLKTYMRSRMTESRFNSLALMSIHQEIIPDVE FT RVLNIFSVLGERRLELVFT*" XX SQ Sequence 4876 BP; 1709 A; 751 C; 763 G; 1651 T; 2 other; caggggcgga tctagggggt atcctgtgtt gcccaggatt cacccataat attctaagaa 60 attawaaaaa attccgtttc ataatctaaa aaatactaaa cttaaattta tagtatacta 120 atatgtatat atgagttcta tataaaaaaa aaaacattgc ctttatagaa aaacatacac 180 ggccgtaaca atgtttcaga cttagctctc ctccgtcatg ttcaaaacgc cgctattaaa 240 tcaggattct ttttttaaag taaacggtac cgaaagaacc cttttaatca atcgttattg 300 caatgtattt attttgcttg ccgtaataaa ggccaaaaaa atccctattg tttcaggatt 360 catttgaccc gaatttccgg tagacaacaa aaaaagttgc aggaacgttt gatcaaaagg 420 gttctctcgg ttccgtttat atataaaaat gaatccttaa ttattctaat aacgccttaa 480 ttattcgcat aaagccttaa gtattcgaat aaattgaagt ttatcaaaaa ttagctaata 540 agtagttttt attacaaatt ttatttttgt ttacaattat ggatttgttg ctatccaatg 600 ccattaaagc caagcatttg cctaatatta caacttacaa agcaaagctt cgtaataatc 660 ttaaagaatt aaattcaaac tttcctgttg taaacaaaga tcttctttct ttaagaaatt 720 ctgaaatgga agcttttatt cctatgcaaa aagaaaatga agactatatt gttttaaagt 780 atgagtttaa acacgttttt ttttaatgtt ttattatgtt tataaaacaa taattcttga 840 caaataatgt ttagttaaat atgtttgttt tagttatatt tagtatatat atatgttatg 900 acttaacagg tcaatggaga atggtaactg tctttacaac tctgctgctt tattaataaa 960 ttcatcagat aaaacacatg aaattcttcg actattaact tctgttgagt tgtttgagaa 1020 agcaaccttt tattcgagat acccaataat cttaaaccaa gtggaatcta aaagtttttc 1080 ctcgctctct agtgttttca tggcatgtgt ttcatttgag tcttctgatt cttttacaga 1140 attaaatgat ataaatattg ctgaacttat aaaaaaagaa gcctataaca attgcagaaa 1200 caataaattt gcttcttttc tttgtttaat tgcattatct acagtgttaa ataaacaaat 1260 tcggtctatt tatcctgaat ttggacttga aaaatataaa aagttattta atgtatgtat 1320 atacccacgt ggtgatcata aaacaaataa agaaattctg agtattttat ggtgtaacac 1380 tgattgtcgc tctatttcca aggataaaga tatgtggtca ccaaatcatt ttgtccctat 1440 tgtgaaaaga gaagttttat gctcaaagaa atttgtcaaa agtgatttaa aaagaggatt 1500 gtgttcagtt ttgccaagtg ttttatcgca ttcttctgtt aacactaagg ttcttaattt 1560 atcttgtcaa cctcctaaaa atctattttc aaaaaaaaaa tcattagttc aatctcacat 1620 aaagtttggc ataagtaatg aaactaaaat tatagatgta aatattgata ttactcctaa 1680 tccaatctta cttaattgta ttccttgttc atcggatcag tctttaagcc ttccttttta 1740 tgatgtttct acttactatg agcgcggtag acaactatct gctggcaaag atgattcaat 1800 gttattagat ctagttaata aaatatttat tcctaactcg tcttttgttt ttcctatttc 1860 tggtgaaaaa aagaaacgat catttttaca taaatggcta cttgaatatt cctggttagc 1920 ttactcacag attgaagatg gagcatactg ccttccatgt acactgttgg gaagcagaat 1980 tccaaacaat acctcaataa tcaacttcat tcgaaaacct tttaaaatat ggggtaatgc 2040 aattcgtgct tacatggatc acgataataa ttgtcgactt catcaaatgt caatgcattg 2100 tcttaacgcg ctgcaatcta gatctaattc aaaaaaaaat ctgattgaag ttgacatcga 2160 taatcagaga aaaaaactta ttataagtaa tcgaaataaa ttaattccaa ttataaagac 2220 aattattttt cttggccgca atgatattgc ttttcgaggt catcgtgatg acagtaagta 2280 tcatccagag atcggagaaa catgtaaaaa caacattggt ataggtaact tcgtagagct 2340 tttaaatttt cgcgttgaag caggtgataa agttcttgag caccatattc gttctgctcc 2400 taaaaatgca acctatatat ctaaaaccac acaaaacgaa cttattgaat gctgtggaaa 2460 aacaatcgaa gaagttctaa ttaataaaat taaaaattca ggttattttt ctatcttgtg 2520 tgatggagcc tctgattgtt ctaatgttga acagctatct ctagttataa gatacgttga 2580 ctgtgataat gttatttgtg aagattttct acggttcatt gaatgtaaat ctggtacaac 2640 tggtcttagt cttgcgcaaa acgtagtttc tgcaattgat gatcttggtc ttgatatcca 2700 aaaatgtaga ggtcaaggat atgatggtgc tggggcaatg tctggtaaat taaaaggtat 2760 ttcatcaaga attaaattta tcaattcaaa ggctattttt gttcactgtg catgccataa 2820 acttaattta gttgtaagta aatcatgcaa tgttcagagt gttagaaatg ttcttgatca 2880 aatcaaagat atatcatatt tttttaactt atcacctaaa cgtgccagct gtcttaataa 2940 attcattttt cctggtcaag caaaattaat cgatacatgt cgaactagat gggttcagaa 3000 acttaaaggt ctggatgttt tttttgataa ctacatatct gtttttcatt caatggaaga 3060 aatggcatac aatgaaggta agagttataa tattgatact tcttcaaaag cttcatgttt 3120 tttaaatctt atgactaatt tttcttttat tgtaagttta gttttaacca aacaaataat 3180 ggactatttc tatgccatca ctgttgttct tcaaactaaa gcgtttgata tatcacaaca 3240 gtgtaatgaa ataaattgtt taaaaactca aattttagat ttaaaaaaaa atattgatgt 3300 ctatcataac gaatggtact taattgctct tgatctagct aagactcttg atgtttcaga 3360 agttaaacca aggctctgtg gcggacaaat ttatagagat aattatccct ctgacacagt 3420 ttcagattat ttcaaatatt ccatcacatc tcctcttcta gatcatttaa ttaatgaatt 3480 agaagataga tttgataaag gtgatatggt tgtttacaaa ggtttatctg gtattcctgc 3540 aactgttgta aaaaaaaata aagaaaaatg tttttggaag ttagatttta tggaattttt 3600 gcatttttat atatcggata tgcctcatcc tacatcaatt catgctgaat tagatttgtg 3660 ggaaacattt tggaacaatc aatctgtgat cccatccact attacagaaa ctttaaaatc 3720 tattgatatg aggggtttcc caaacattag ggaaggtttt ttaataatgg gaacaattcc 3780 aattactaca tgtgaatgcg agaggagcat ttctgttata cgcagactga aaacttatat 3840 gcgaagccgc atgacagagt caagatttaa ttctttagct ctgatgtcta ttcatcaaga 3900 aataattcct gatgtagaaa gagttttgaa tattttttct gttttagggg aacgacgctt 3960 agagttggtt tttacctaat attaatttat taatttcttt tttatttatt gactatattc 4020 ttgtatttta gtttgtttta tatttaagtt tattttatat ttcatgaaaa ttagaaaaaa 4080 caaaataaaa aattaaaaaa aaactttaaa aaaaaaaatt ggcctaactg cctatcaaga 4140 aaaggaaaga atccagaaat ggagggctaa cctaaggcaa caatattaat actttacata 4200 aactgccttt caagaaaagg aaagaatcca gaaagggatg gctaacctaa ggtaataata 4260 ttaataattt acattaactg cctatcaaga agaggtaaga atccagaaat ggaggactaa 4320 cctaaggcaa caatacaatt actttacacw aactgcctat caagaaaagg aaagaatcca 4380 gaaatggagg gctaacctaa ggcaacaata caattacttt acactaactg cctatcaaga 4440 aaaggaaaga atccagaaat ggagggctaa cctaaggcaa caatacaatt actttacact 4500 aactgcctat caagaaaagg aaagaatcca gaaatggagg gctaacctta ggccacatga 4560 cagtttacag atggagtgca aagtcctatg gcttaccttc tcgttggtgc acttcgttaa 4620 ataggttaga caagtaaaaa atgtctaatt tatgaacaga atagccaagg caatattgtt 4680 gttagttttg ttttatatgg gtaattgaga acaatgcccg gtaagtaata ataaagtact 4740 gatgccgctt tgaactcccc ccctcccccc ccctcgctag cccggggaag gggcttacaa 4800 aaaaaatatt gcaaactatt ttggcaactt gaaaaaaaac aggcaacacc cacaaaaaat 4860 cctgcatccg cccctg 4876 // ID Gypsy-35_DWil-LTR repbase; DNA; INV; 155 BP. XX AC scaffold_181096; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-35_DWil_; KW Gypsy-35_DWil-I; Gypsy-35_DWil-LTR. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-155 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_181096; Positions 547735 547581. XX SQ Sequence 155 BP; 68 A; 30 C; 16 G; 41 T; 0 other; tgcgtacaga cgacgaaaag acgaaagaca ttaaacaata attaattata tctactatta 60 actaaaaacc aatacttatc ttaacattaa ataaaactat tataaaacta taaccgggac 120 tgtgtttatt cgtaagacaa taccccagtc ccaca 155 // ID Rehavkus-1N1_HR repbase; DNA; INV; 4361 BP. XX AC . XX DT 31-MAR-2008 (Rel. 13.03, Created) DT 26-MAY-2011 (Rel. 16.05, Last updated, Version 2) XX DE This is a recently transposed Rehavkus-1N1_HR DNA transposon - a DE consensus sequence. XX KW MuDR; DNA transposon; Transposable Element; Nonautonomous; KW Rehavkus group; Rehavkus-1N1_HR; Rehavkus-1_HR. XX NM Rehavkus-1N1_HR. XX OS Helobdella robusta OC Eukaryota; Metazoa; Annelida; Clitellata; Hirudinida; Hirudinea; OC Rhynchobdellida; Glossiphoniidae; Helobdella. XX RN [1] RP 1-4361 RA Kapitonov V.V. and Jurka J.; RT "Rehavkus DNA transposons from the leech genome."; RL Repbase Reports 8(3), 376-376 (2008). XX DR [1] (Consensus) XX CC Rehavkus-1N1_HR is a nonautonomous family derived from the CC Rehavkus-1_HR autonomous transposon. CC This sequence was derived from sequence data generated by DOE CC Joint Genome Institute. XX SQ Sequence 4361 BP; 1463 A; 628 C; 697 G; 1566 T; 7 other; acgaaggtaa tgttcgcgac tgtttcggcg acggtgaagc atatctaaca ggcggaacca 60 atattgtcaa attggctgaa cagtttgaac taatatattt tcgtttaata ttacacggaa 120 taattaaaaa ttgttacata aacatctgcg ccccgttttt attttgaaac gatacgattt 180 tggctacggt gaagcatact taacaggcgg acttagtcat gtcaaaacgg ctaaacagtt 240 tgatctaata aattttcgtt taataaaaat acacggaatt attaaaaatt aactttgttg 300 cataagaatc tgcgtcccgt tattattttg aaacgataca taacgatacg ataacgaggt 360 attgaagaat atcatcatgt gcacgagtat gtgcacaagt tgtatgattt ttgagttttt 420 gaaaattgta tttgaaccaa gacatcgcgc aataacaatg acgcttggta catttacagt 480 ctgccctaag ttgttttccc gccaactccg ctttacaaaa tgtactgttt taaaacagaa 540 aaaccccgtt actgcgaaac agcaccaggc gaccttaacc attagaccat cacgccgccc 600 agtcattccg cccagaaaac agtcattcaa ttaaaaaatt aaacaaaagt tacctctctt 660 tttgctaata gttttttatg caattatatc gattcgacac attgtaataa cgatacgaga 720 caaaaatgtt tttttcgtat tttttatttg atacgagact ttttggagaa tgtctcgttt 780 taataacgat acgatatttt tagagtcttt attaaaccaa agtcattcaa ttggaaaaaa 840 ttaaataagt ttcaattttt tataacactt tattttttta acaatttcca cacattgaaa 900 taatggtacg agacgaaaat tgtgtcttgt attttttatt cattatacat gtcttgttta 960 aataacgata ggagatttct ggtattgtta ttaaaggata cgagattttt cgtatcgttt 1020 tataacgata cgagatttct tttatcgtta ttattattaa atctctgtcg attgataacg 1080 aaaataacgg ttcgagacgc aggtgcatag tataatagat tctatatatt attttgttga 1140 cattataata caatttaaaa attgtacagt atattatcga atattttaaa attgtgttat 1200 acttacttta atttataaat taattttaat gaactcggtg aattaaacag gaactgcaaa 1260 gaagtgttgg acttaattat ggaagttttt ggtccttgtt attatctcgc aaacaaccta 1320 agtgcaccac agttttgaag tttcctgcat ctgatgtgtt acaacgtcat tcctgttctt 1380 gcttgtagta tagtttataa tttgattaac acaaaatggc aataccagaa tagaataccg 1440 tgattgtact ttttggattt ttttaaattt tagatgtttt ttcagattat gctttgaaat 1500 ggaataaaat taaaaaaaac ttttaaaatt atgctgcatg tttaaattct aatagaactt 1560 tctaaagaaa tgttaatgtt ttaattttcg tgttttaatt cgcggcattt ttcctttttt 1620 taaacgagtt atttgctaca aatttgtgta ccagatccca gtttttttgt tggcttctct 1680 gaaaagaatt acatttatac taatataaat tgttttaatt tcacacgtag tggtagttga 1740 tgactcaact tcattcattg ttttttggcc caaggtgaca taaaaattac atcacaaaac 1800 ggaccggatt cgcgtacatc cagatgttta tgtttcggta gtaaataccg tcagtgaaac 1860 aggacgttgc tcgtcaatag ttttattttt atgattctgc gtcaaatcaa aattagtcaa 1920 atggaggcat tagtccattg ttaaaagctc agcgtaatcg acaaattttt actgttaatt 1980 tggaacaaga aatagcgaaa ggtataggca gagaaaagtt actcaaggtt tttttataac 2040 taaaaattgt acgaagccgc aaaaataaat gttacgataa ctatatctgt tagcaatatt 2100 gatacctgta gcgtgcaatt accaattcat acacataaca ttatgatttt cagcacacac 2160 cacacacaca tatatatagc tatcataatc cgtgggcwag taatacttgg tamatatata 2220 tatatatata tatttattta ttcaatcgct tgggtttgca cttttttggt gctatacgtt 2280 tttcttttat tattataacg tattattata tttttttgtg tgttttcata catttaaatt 2340 aaagctaaaa cttatatttt ataaatatgc tatattaatt tgtcgcaaaa acactacagt 2400 ttattatatt ttacctcgta tatatagtta aaatttatac gcttgtcaat ctgtactatt 2460 cacaattata gtcatctaca tggtcatcac aaactttttt tcaatttaaa atgtttaaat 2520 ttcacttttt gctgcaattg cattagtatt tgttaatttt tagctaaaat aaaaaacata 2580 ttttgttttt ttttgtaaat ttaaatgttc acgcgttaaa gaactatagt tcaattactg 2640 tgcaatgcat attattcatt aaacgaaata acatttttat taaattacag tctagtttga 2700 atgaacaggg ttggtttggc caagcttgca ggagcctaaa gcagagaaat aacttcacca 2760 atcgaaagcg gtaaatgcac tgtcgaaaat gtaaaaaacc agtgataaca ataaaaaatt 2820 gaaattcaca gattgcatga cttattgcgg aagaataaat gctttttagg gttaactgaa 2880 aatggtccca aaggcaacga tgaatcttta caaaacctac agagcataat aacatgcaac 2940 aacaatgcct tagccaagtg gttggttgat aacaaaacaa tcgtatcctt aagatactgc 3000 aagataatat aactgacata caatgtttgt tgcagcaagt agataacact gatgtcatca 3060 atgaagcaca acggcaagcc actcagacat cacatgatga aaatgttgca attgttaaaa 3120 acaattcgcc ggcaggaggt attgatatat catttataat gtattataat ttgcatgtgc 3180 tacatcaatt tatttgtaat aaacattgcc tcaaatctaa tattcccttt ttgcatgatg 3240 gtaatttttc gtcatctatt tctaattccc aaagggtact agaactgaac gaagaaagaa 3300 aagaggttag ttgcatgatg taaaccaact gcacggtgca tcttttaagt agtgcagttg 3360 aagaagaggt gtaccaaggt taccgatgac gcaaatttag aaatgtatgg ataacgcgtt 3420 caaaccaata tttggggatg tgcacgaaac gggagaaagt tttgttatat atgrtatttt 3480 tgggaataaa attgtctatt agaatatgca aatatttttc agatgcacac ataaatggat 3540 ttttggcgat tttttaactt tctcgttacc gatgacgcaa atttggaaat gtatggttaa 3600 cgcgttcaat craatttttg agaatgtgca caaaacggga gaaagttttg ttatatatgg 3660 tatttttggg aataaaattg tctattagaa tatgcaaata ttttccagat gcacacataa 3720 atggattttt ggcgattttt taactttctc gttaccgatg acgcaaattt ggaaatgtat 3780 ggttaacgcg ttcaatcgaa tttttgagaa tgtgcacaaa acgggagaaa gttttgttat 3840 atatggtatt tttgggaata aaattgtcta ttagaanatg caaatatttt ccagatgcac 3900 acataaatgg atttttggcg attttttaac tttctcgtta ccgatgacgc aaatttggaa 3960 atgtakggtt aacgcgttca atcgaatttt tgagaatgtg cacaaaacgg gagaaagttt 4020 tgttatatat ggtatttttg gaaataaaat tgtctattag aagatgcaaa tatttttcag 4080 atgcacacat aaatarattt ttggcgattt tttaactttc tccttatcgt tggtatatta 4140 atatatagcg attttatgat gaagtttttt ttattgtttt atgcaatttt taatctgaat 4200 aaagttaaaa tgttataaat aacaaataaa aaacaataat aaatcttttt tcatttcatt 4260 aaacctttaa atatttatta tttaatcggt tttttcataa tccgcctttt acatatgctt 4320 aatcgtcgct gagttaaaac cgtcgcgaac attaccttcg t 4361 // ID Gypsy-20-LTR_NVi repbase; DNA; INV; 403 BP. XX AC . XX DT 17-APR-2009 (Rel. 14.04, Created) DT 17-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: long terminal DE repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy-20-LTR_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-403 RA Bao W. and Jurka J.; RT "LTR retrotransposon from Nasonia parasitic wasp."; RL Repbase Reports 9(4), 778-778 (2009). XX DR [1] (Consensus) XX SQ Sequence 403 BP; 138 A; 82 C; 71 G; 109 T; 3 other; tgttacgtcc tgcgacgtat tgcagttaag caatattgcg aagccgaaag cwcaaaattg 60 atgagtcaga ataataagta gatcagcgaa gttaatgagt ataccctgta tacgccgcgc 120 gatcagttgt aattatacac tcaatctgga ctctttgtac gacgctccta tccgtcgcta 180 ccamagagct accagtactt aagtcgcgat atcgatctaa cattataaag aactttatct 240 traataaatc gtgaaaatca gtcaattaca agtgtttttt attcgaatac acccatcctg 300 tataagcctg actacagcac agcagcagcg attcagccca agcagttaaa cgcaagtaag 360 taaaaataaa taattaataa tagatcggat ttgtgactta aca 403 // ID SMAR4 repbase; DNA; INV; 1295 BP. XX AC . XX DT 25-SEP-2007 (Rel. 12.09, Created) DT 01-OCT-2007 (Rel. 12.09, Last updated, Version 1) XX DE Consensus sequence of Mariner-type family of repeats. XX KW Mariner/Tc1; DNA transposon; Transposable Element; SMAR4. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-1295 RA Jurka J.; RT "SMAR4: Mariner-type element from freshwater planarian (Schmidtea RT mediterranea)."; RL Repbase Reports 7(9), 993-993 (2007). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 165..1196 FT /product="SMAR4_1p" FT /translation="MEAVEYRSIIKFLVFKKKTGKEIFKELKDTYGKDAPS FT YSTVKYWVREFQGGRKSVFDEERSGRPVEIPEKITEKLVEIVQEERRISVK FT SLSKRLNISDGKVHTIMKGLGIRKLCSRFVPVFLTSEMMERRLQCCSNNLQ FT IYSEYGRNFLSNIVTEDETPLSLYIPESKRSSSEWKLSAEKPTKKLRSGTT FT HRKSMMLSVFWDSLGIIMLDFAATGVKLNAQYYCHLLEIVRTKRRKSRNVP FT IWYLHDNAPIHTAAASASTMDNVGFSVLSHPPYSPDLAPSDYYLFRHLKKH FT LAGKIFENNDELKEAVEDFFDNWGPDFFENAFSELVIRWGKCVGNFGGYIE FT K" XX SQ Sequence 1295 BP; 428 A; 221 C; 262 G; 384 T; 0 other; tacgtggttg gttcattaag tactcggcct gacatagaaa tagtcgtcta aaattaacga 60 atttttgact atatatagct acataagaat aggtaaatat aaagatcttg ttttcatgta 120 ttaattaatt tataattatt ttcagcaatt ttttgaaaaa aagcatggaa gctgttgaat 180 acagaagtat aataaaattt ttagttttta agaagaaaac cggcaaagaa attttcaagg 240 aattaaagga tacatatggt aaggatgccc catcatatag cactgttaaa tattgggtgc 300 gcgaattcca aggtggccga aagtcagttt ttgatgagga aaggtccggt cgaccagttg 360 aaattcctga aaaaattaca gaaaaattag tagaaatcgt acaggaagaa cgcagaatta 420 gtgtaaagtc attatccaag aggctcaata ttagtgatgg taaagttcac actattatga 480 aaggtcttgg gattcgaaag ttgtgttcca gatttgtacc agtgttccta acatctgaaa 540 tgatggaaag acgtttgcag tgctgcagta acaatttaca gatttatagt gaatatggac 600 gcaatttcct atctaacatt gttactgaag atgaaacccc attgtctctt tacatcccgg 660 agtcaaagag atcatccagt gagtggaagt tatccgccga aaagcctacg aagaaactca 720 gaagtggtac tacccatcga aaatcaatga tgctgtcagt tttttgggat tctctcggga 780 taatcatgtt ggatttcgcc gccacgggtg tcaaattgaa cgcacaatat tactgccact 840 tattagaaat tgtacggacc aaaagacgaa aatccagaaa tgttccaata tggtatctcc 900 atgataatgc ccctattcac actgcggctg cttctgcatc gacaatggac aatgttggat 960 ttagtgtact ttcacatccc ccatacagcc cagatctggc tccaagtgac tattaccttt 1020 tccgccacct taagaagcac ttagctggaa aaatatttga aaataacgat gaactgaagg 1080 aggcagtgga agatttcttc gacaattggg gcccagattt cttcgaaaat gcattttcag 1140 aacttgtgat ccgctgggga aaatgtgtcg gtaatttcgg cggatatatc gaaaaatgat 1200 gcctatgtta catgaatttt atgtattatg aatgaaaatt tagtgttctc aattatttct 1260 atgtcaggcc gagtacttaa tgaaccaacc acgta 1295 // ID I-7_AAe repbase; DNA; INV; 6293 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A I non-LTR retrotransposon family from Aedes aegypti. XX KW I; Non-LTR Retrotransposon; Transposable Element; I-7_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6293 RA Kojima K.K. and Jurka J.; RT "I non-LTR retrotransposons from the yellow fever mosquito."; RL Repbase Reports 11(4), 1361-1361 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >94% CC identity. XX FH Key Location/Qualifiers FT CDS 698..2278 FT /product="I-7_AAe_1p" FT /translation="MDVTDDGGGPKRFDISDDDVESVESPSFELLSPTIPG FT KLQESMDTSVEDNAVLPSTSGMNNTGLKMHSVVPQAISLPHSQTNGSSSAS FT STATPRLKAYPPGSKGPFLVFFRPKGKPLNKLQIQKDLAKSFRDIVEVSSP FT NRNKLRVTVSDREQANRIAAYELFLREYHVYLPSQEIECSGVVTEPFLTCE FT DIRSGTGGFKNRAVPPVQIIDVKQMNHVSSDGTKKPSNSFRVTFSGSALPD FT VLVIGLLRLPVRLYKPTVMHCEKCQQLGHTATYCCNKPRCSKCGEQHVEGP FT CPSEPKCTCCGQAPHELVACPKFIEREKHTIRSLQQRSKRSYAEILKKIAP FT TNDPPAQSSNSNNLFTSLPIDDLGSGTEDGEEYTVINTGTKRKRAIAKRLK FT QRQHDPQNVSVGQVLQPSLVKSRRDRNSTNAVPPGFKFSVGDFPSLPGTSK FT TPDAPIFCSENQQSCHQESRQEQVDASGKMTLSGLVDIIFQMFEVSPAIRN FT LINMILPLVKPLLKQLASNWPILDSFISFDG" FT CDS 2274..5948 FT /product="I-7_AAe_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="MANLTNEVGDMIEVLQWNCRSILKNIDEFKFLVHSTR FT CDVFALCETWLTSDKDVPFHDFNIIRLDRGDGYGGVLLGINKLHSFYRIDF FT PPMVGTEVVACHVTIRGKSCSIASVYIPPNARVSRRDLSAICCAMPAPWLI FT LGDFNSHGTAWGSPRDDNRASLIYDLCDYFNLTILNSGEATRLKPPDPPSM FT LDLSICSSSLSLDCTWKVIQDPHGSDHLPIRISVTNNSNPIRQINRPYDLT FT KHIDWEKYAEVVSDGEQLVDVLPPMEECQFLSGLIIESALQAQRRPFPGSA FT VRSRPPTLWWDDECTKVYRDKSAAFKEFRKRGTLENYDRFTKLERKFKSLV FT KAKKRGYWRNFVNGLSRETSMRTLWTVGKRMRNASKVNEDRESSPRWIFDF FT AKKVCPDSVPTQQIIRDTQTNRNEMDAPFTMVEFSLALLSCNNSAPGMDGI FT KFSMLKNLPDVMKRRMLNLFNRFLEGNIVPDDWRQVRVIAIQKPGKPASDY FT NSYRPIAMLSCIRKLLEKMILFRLDKWVESNGLLSDTQFGFRRGKGTSDCL FT ALLSTEIQLAYAQKEQMGSVFLDIKGAFDSVNVDVLSDKLHECGLSPILNN FT YLHNLLSEKRMSFSHGTSTTSRISYMGLPQGSCLSPLLYNFYVRDIDDCLV FT ENCSLRQLADDAVVSVTGPGADDLQRPLQDTLDNLSTWAVKLGIEFSPEKT FT ELVVFSRKRDPVEIKLQFMGKELTQGLSHMYLGVWFDSKCTWGKHIKYLHQ FT KCQQRINFMRTLTGTWWGAHPEDLIKLYRTTILSVLEYGSFCFQSAAKTHI FT LKLQRVQYRCIRIALGCMNSTHTMSLEVLAGVLPLSDRFAELSLRFLIRCE FT VLNPLVIENFEKLIERNPQTKFMTLYYRYMTLEINPSSNVSHGCCFPVFSS FT STVAFDLTMKREIHGIPDPLRSEYIPLIFTDKFGQVSSDRTFYTDGSKIND FT STGFGVFNEFHSAAHKLQNPCSVYVAELAAIHYALERIASLPSDQYFIFTD FT SLSSIEAIRSMRPVKHSSYFLREIRSILSALSNHXITLVWVPSHCSIPGNE FT KADSLAKVGAMEGDIYERRITFNEFFTIARQQAMISWQQKWNEGDMGRWLH FT SILPQVSNRPWFKGLDMSRDFIKVMCRLMSNHYSLGSHFYRIGLADSNRCS FT CGAGYQDINHIVWYCPEYEIARTNLCASLRAQGKPDKEDIRDVLGRLDFDY FT MFLLFKFLKKIDVFV" XX SQ Sequence 6293 BP; 1721 A; 1476 C; 1340 G; 1753 T; 3 other; tcattccctg tcgagctttc ggacgagtca gacgcattgt tccgtttatc atcgcgcgat 60 aatctttcma caagtggagt ttttgttttt cattcctcac gctgaggttt ttattttata 120 tccttgccat cctgcttgtg caataattcg caaggtgatt cctgccgacg acgggaagcg 180 agctggcatt ggcgatgcca ttgcagcatc gttcatcttt aaccaaagac gatcatcgaa 240 gtccatcatc gtcatcatct ggtcgccaat aacaacaacg acgacaacgg cgacatagac 300 gagtacggct gctgtattgg aaattggcgc agcaacaaaa aaactaccta caccccgaac 360 ggatagataa gtagctttta ctctttattg cttttcgcct tattgttttc ccatccaccg 420 tctatcacgt gatagttgtt ttcgggtaca tatcagtggt tgccaacttg cggcccacgg 480 cccacgttag taagtttaaa aagttttttt tttttttttt ttattattat tattaatatt 540 tggattgatt catcagttat cattatcatt actactagta tccttcttct tttttttttt 600 ttttgttttg ttttgtaaat attaggttag ttgttgtgat agttttacta ttgtttttgg 660 tttcggctgt ccactagggt ggtgctaagc catctcaatg gacgtaaccg atgacggcgg 720 ggggccgaaa aggttcgata tctctgatga tgatgttgaa tcagtagaat ctccctcatt 780 tgaactattg agtccaacca tccctggaaa gctccaagaa tcgatggata cgtccgtcga 840 ggataatgca gtgcttccct ccacctcagg gatgaacaat actggtctga aaatgcattc 900 tgtggttccc caagcaataa gtcttcctca ttcccagaca aatggttctt ctagcgcttc 960 ttccactgcg accccccgcc ttaaagctta tccgcccgga tctaagggtc cctttctggt 1020 tttttttcgg cccaaaggca aaccgttaaa caagcttcaa attcagaagg atctggcaaa 1080 gtcgtttcga gacattgtcg aggtgtcttc gcccaaccgt aataaattgc gggtcaccgt 1140 cagtgaccgc gaacaagcaa atcggattgc tgcctacgag ctttttctga gggagtacca 1200 tgtctacttg cctagtcaag agatcgagtg ctcaggagta gttaccgaac cgtttttgac 1260 atgtgaggac atcagatctg gtactggagg ttttaaaaat cgtgccgttc ctccagtaca 1320 gataattgat gtcaaacaaa tgaaccacgt gtcatctgat ggcaccaaaa aaccttcaaa 1380 ttcgtttcga gtgacttttt ctgggtcagc cctcccggac gtcctcgtga ttgggcttct 1440 tcgtttacct gtccgtctct ataaaccgac ggtcatgcac tgcgaaaaat gccaacaatt 1500 agggcacacc gcaacgtact gctgtaacaa acctcgttgc agcaagtgcg gagaacagca 1560 tgtggagggc ccttgtccaa gcgaaccaaa atgtacmtgc tgtggacaag ctccacatga 1620 actagtcgca tgcccgaagt ttatagagcg ggagaaacac acaattcgat ccttgcaaca 1680 acgatcgaag cgatcctacg cagaaattct gaaaaagatc gctccgacca atgatccacc 1740 agcccaatcg agtaacagca ataacctctt cacttccctg cctattgatg atctgggctc 1800 tggcactgag gatggagaag agtacactgt cattaacaca ggaaccaaga ggaagcgagc 1860 cattgcaaag cggctcaaac aacgccagca tgatcctcaa aacgtttctg ttggacaggt 1920 tctccaacca tctctggtaa aatcaagaag ggacagaaac tctactaacg cagtacctcc 1980 gggttttaaa ttctcagttg gagactttcc gtcacttcca ggaacatcca aaaccccgga 2040 tgcccccatt ttttgctcag aaaaccaaca atcttgtcac caggaaagtc gacaagaaca 2100 ggtcgacgct tccggaaaaa tgactctttc tgggttagtg gatatcatct tccaaatgtt 2160 cgaagtctcg cccgcaataa ggaaccttat caatatgatt cttcctttgg taaaacctct 2220 tctgaagcaa ctggcttcaa attggccaat tcttgactcg ttcatatctt tcgatggcta 2280 atttaaccaa cgaggtcggg gatatgatcg aagtgctaca gtggaattgt agaagcattt 2340 taaaaaatat agacgagttc aaatttttag ttcacagtac gcgctgtgac gtatttgccc 2400 tttgtgaaac atggctaact tccgataaag atgtcccttt ccacgatttt aatattattc 2460 gtctagatcg aggggatggg tatggaggag tgcttttagg gatcaataaa ctccactcct 2520 tttatagaat tgatttcccc ccgatggtag gcactgaagt agttgcttgt catgttacta 2580 tacgaggtaa aagttgcagc atagccagcg tgtacatacc acctaatgcc agagtatctc 2640 gcagagatct ttcggccata tgctgcgcta tgcctgcacc atggttgatc ctaggggatt 2700 tcaattctca cggtacagcc tgggggtcac cgagggacga caaccgcgct tccttgatat 2760 atgacctttg tgactacttc aacttgacaa ttttgaactc gggggaagca acacgattaa 2820 aacctccaga tcctccaagc atgttagacc tctcaatctg ttcgagttca ctatcattgg 2880 attgcacgtg gaaagtaatt caagatcccc atggtagtga tcacttgcct atcagaattt 2940 cagttaccaa taattcgaat ccaattcgtc agataaaccg cccgtatgat ctcaccaagc 3000 acattgactg ggaaaaatac gccgaggtgg tctccgatgg cgaacagtta gtggatgtcc 3060 ttcctccgat ggaagaatgt caatttctct ccgggttaat tattgagagt gctcttcaag 3120 cccagcgtcg cccttttcca ggatctgcag tacgaagccg gccacccact ctgtggtggg 3180 acgacgagtg tactaaggtc tatcgcgata aatccgccgc gttcaaagaa tttcgcaaac 3240 gcggtacact agaaaattac gaccggttta ccaagctcga gcgaaaattt aaaagccttg 3300 taaaggcgaa aaaacgcgga tattggcgta attttgttaa cgggctatcg cgcgaaacgt 3360 caatgagaac actttggacc gtcgggaaaa gaatgcggaa cgcttcaaaa gttaatgagg 3420 atcgcgaaag ctctcctcgg tggatcttcg acttcgccaa gaaggtctgc ccggattccg 3480 ttccaaccca acaaatcatt cgcgacacgc aaaccaacag aaacgagatg gatgcaccat 3540 ttacgatggt tgaattttca cttgctctcc tctcatgtaa caactctgcc ccaggcatgg 3600 atggaattaa attcagcatg cttaaaaacc tcccagacgt catgaagagg cgcatgctga 3660 atcttttcaa tcgattcttg gagggcaaca ttgttccgga tgactggaga caagtgagag 3720 tgatagccat ccaaaaaccc ggtaaacccg cgtcggatta taactcgtat cgacctatcg 3780 cgatgttatc atgcattcga aagctattag agaaaatgat tctctttcgg cttgacaaat 3840 gggttgaatc gaatggcctt ctatcagata cgcaatttgg tttccgcaga ggtaagggaa 3900 cgagcgactg tctagcgttg ctttctacag aaatccaact ggcctatgct caaaaggaac 3960 aaatgggttc agtattcttg gacattaaag gggcttttga ttcagtaaat gtagacgtcc 4020 tttcagacaa actccacgag tgtggtcttt ccccaatttt aaataactac ttgcataact 4080 tgttgtctga gaaacgtatg agcttttctc acggaacctc aacaacttca cgaattagtt 4140 acatgggtct cccccagggc tcatgcctaa gcccccttct ttacaatttc tatgtcagag 4200 acatcgatga ctgtctcgtg gaaaattgct cgttaagaca gcttgcagat gatgccgttg 4260 tttccgtaac aggaccaggg gcggatgatc tgcaaagacc actgcaagat actttagaca 4320 atttgtctac ttgggctgta aagctgggta tcgaattctc tccggagaaa actgagttag 4380 ttgtcttttc taggaagcgt gacccagtag agatcaagct tcaattcatg ggtaaggaac 4440 tcactcaagg cctttcgcac atgtatctag gggtctggtt cgactccaaa tgcacctggg 4500 gaaagcacat caagtatctg catcagaagt gccagcaacg aatcaatttc atgcgtacac 4560 tcacaggaac atggtgggga gctcatccgg aagatctgat aaagctatat cgtacaacaa 4620 tactctcggt cctcgaatac ggtagctttt gcttccaatc cgccgcgaaa actcacatct 4680 tgaagctgca gcgtgttcag tatcgctgta ttcgaatcgc tttaggatgc atgaactcaa 4740 ctcatacaat gagtttagaa gtattagcag gagttctccc attatcagat cgtttcgcgg 4800 aattgtcgct ccggttcctc atccgctgtg aagtgttaaa cccattggta attgaaaact 4860 ttgaaaagct aatcgaacga aatcctcaaa caaaattcat gacactgtac taccggtaca 4920 tgactctgga gattaatcct tcctcgaatg tatcccacgg ttgttgcttc ccagtattct 4980 ccagttcaac tgtagctttt gatctgacca tgaagcgaga aatccatgga attccagatc 5040 cactccgctc ggagtatata ccattaattt tcacagacaa gttcggccaa gtcagcagcg 5100 acagaacgtt ttacacagac gggtcaaaaa taaatgattc cactggtttt ggtgtattta 5160 atgaatttca tagcgccgcc cataaacttc agaatccttg ttccgtatat gttgctgaat 5220 tagcggcaat acactacgca ttagagcgaa ttgcctctct tccctctgat caatacttca 5280 ttttcacgga tagtcttagc tccatcgagg ctattcgttc aatgaggccg gtaaagcact 5340 cgtcgtattt cttacgggaa atacgatcca ttttgagtgc tctatcgaat cacaamatca 5400 ccttggtttg ggtcccttct cattgctcga ttccgggcaa tgagaaagcg gactcactcg 5460 ccaaggtggg cgctatggaa ggtgatattt acgaacggcg tattaccttc aatgaatttt 5520 tcacaattgc tcgtcagcaa gccatgatca gttggcaaca aaaatggaat gaaggggata 5580 tgggcaggtg gttacattct attctcccac aggtatcgaa tagaccgtgg tttaaggggt 5640 tggacatgag ccgagatttt atcaaggtaa tgtgtcgtct gatgtccaat cactactccc 5700 tgggctcaca cttttatcga atagggctcg cagacagcaa tcgttgtagt tgcggcgcag 5760 gctaccaaga catcaatcat attgtttggt actgtcccga atacgaaatt gccagaacca 5820 atttatgtgc atccctcagg gcccaaggaa aaccagataa ggaagacatt agagatgtgt 5880 tgggtaggtt agatttcgat tacatgttcc tcctcttcaa atttttgaag aaaatcgatg 5940 tttttgtttg attacccatg tctgtcgttc cgcttgtccg cctcgttccc cttgaaacca 6000 ttttgttgta tcgttacagg tcgtttaagt ccgccccatg attgacgaac agcatcatac 6060 cgccactttg ctgctatctg tgatcaatca aacccttaaa tcccatcctt tccccaaaga 6120 taattgtatt ccctaacctc gaccaaaccg cgagttttac ggttccccaa aactaacata 6180 gatgtttaag aagcaaacaa gattttgtaa accaaatcaa atgaattcgg ctccgttatg 6240 cctgtcggcg catgagccta ccaaataaac gaataagtaa aaaaaaaaaa aaa 6293 // ID Chapaev-15_HM repbase; DNA; INV; 3013 BP. XX AC . XX DT 27-FEB-2009 (Rel. 14.02, Created) DT 27-FEB-2009 (Rel. 14.02, Last updated, Version 1) XX DE Chapaev DNA transposon -a consensus sequence. XX KW Chapaev; DNA transposon; Transposable Element; Chapaev-15_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3013 RA Jurka J.; RT "Chapaev transposons from the hydra genome."; RL Repbase Reports 9(2), 361-361 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX SQ Sequence 3013 BP; 1000 A; 540 C; 513 G; 958 T; 2 other; aatatgcctg tgaaagctta tgactcctgg attccaaaaa tataaaaaat caaatcatcc 60 tattacgtcc gagacagaaa atcattttga aaatttttat tttgcacata ttcatgaaat 120 gacagcattc tcgaaatttt acatttttac agcatatctt aacattcaaa aagctctttg 180 aaaataataa aacctaccta acataataaa ggaacaaaaa atatgaaaat gtgaaagatt 240 ttatgttata aaattggtag taatccaaat aggtaaaaaa tcacttgggt aaaatactgc 300 ttttctgaaa gttttttcta ataagttaac tctagaataa tttcaataat agtttttttt 360 aatttgcact ttttgaataa tattggtaat tttaaattat ttttaagaat ttaccaaaaa 420 attccgggaa aaggcagttt actacaaatc aaagcttttg aaggtagatt ttttaacctt 480 ttctcctgaa ataaattgaa aatgcctaat catgcaagaa gtcatgatga gaacagaaaa 540 gttgtttgta tcatgtgtct tagaaaagga aattgtcaat tgacaccatt tttggtcagc 600 aaaattcaaa acaactacaa aaagcagttt aactttgatg atttaagaat tccttaaggt 660 ctttgtgatc cgtgtagacc ggcccttgga aagtgtgatc aagggcagaa ggttgttttg 720 cctaagctgt ttcaatttga aaccattgat ataaaagttc acacacgtgg tcaagcttgt 780 gaatgcttga tttgtgacat agggcractt aaactgacag agaaacaccc tcttgaaaga 840 acaaaaactg gagagaaaac atcagacaga agatgttcta catgtctctc tcttattggt 900 caaggtttac cccatcattg ctcttcatca cagttccatg aaaacttaag ggagcttgca 960 aaatctgatg gaattgtggc agaacaaatt gccagctcaa caattgtcag cagagaatct 1020 tctcctcgtg gaactattag acttgtgcag ccaaaaggag gaggaccgct cccaatcaca 1080 caaggtttgt atataattct tcattaaggt cgaccatcac atttaagtaa ctgaaatatt 1140 atattttact ctttaacagg tccttcttgc tctaagcagt tgttcgaaga ttcagctgta 1200 gtcaccgcaa aggatcttgc tcaagttcaa ctaaatacag gattgtcaaa cagtggaata 1260 aaaaaactga cttccaccat taggcatgtt tcgtctgcca aagttctgga accaaatata 1320 ttacacaagt tccaggatct cggaaaaagt ctagaagaac actttacaca aaccccaaca 1380 acattcatta gttcagatca aaagaagtct caaagtgttg ttgtccattg caaaaatctc 1440 caagcactca ctgaagaagt tctactttca aggacaagta ctttcaaaac acatcgttaa 1500 acttggaata gatggtggcg gaggttttct taaagtctgc cttggagtta tacaaaatga 1560 agcagatagt gaagctgtgt ctccgccaaa caaaagattt ctgacatctt caacgtccaa 1620 ggattctgga gtaaaacagc aacttctggt tgcagttgct gaaggagttc aagagaatta 1680 ctccaatgta aagctcattc tctctctcat ttcaatcagt gaaatcaact ttgttgtctc 1740 gtgtgatatg aagttggcaa atataatttg tggattgcaa tcacatgctt cagctcatcc 1800 atgcacatgg tgcgctgtgg aaagctctaa ccttgcaaac cggggtactc tgaggacttt 1860 tagatctttg aagcaagcac acgaaacata cgcagcagct ggatccgacg tcgagaaagc 1920 aaaattgttt ggaaaygtaa ttcatgaccc tattctctca cttgataatg atgcgcttgt 1980 gcttgatctc attccaccaa tggagcttca tctactgctt ggggttgtta accatctgtt 2040 caagatttta acagaccttt ggcccgaatc agctgaatgg ctacaagctt tgcacattca 2100 gccccaacct tttcatggtg gtcaatttgc tgggaatgat tgccacaagc tcctcaaaaa 2160 tttcaacctt ctccaaacac tggctgaaaa tgacagtgcc tttcaagttt tcagcatcat 2220 tgacagcttt cgaaagttta gtcccattgt ttctgcaagc tttggattgc acttatgcag 2280 aaaaaattga ttcattcaga gattcatatt tggcgttgcc aagaggaagc gtcactccaa 2340 aagtacatgc agtgtttttc cacgttaagg acttcatcca aaaaacaagg tctcctcttg 2400 gaatctacag tgagcaagca acagaatcta ttcatcatcg ttttcagaac cattggcaac 2460 gatacaaacg acctatcagc catccagatt acggaaaaag gctagaaatg tgcctagctg 2520 attttaacag caaaagtctg tgaaaagtta ggaatttgta aagaattttt tttgtcaaga 2580 atgttgttgc tttaaatgtg gaatttattt taatatttct tttgtattta cctacatgat 2640 cttttaccta tttggattac taccaaacta ttaatgaaat tattctaaag ttaacttatt 2700 agaaaaaact ttcagaaaag caatatttta cccaagtgat ctttaaccta tttggattac 2760 taccaatttt ataacataaa atctttcaca ttttcatatt ttttgtttct ttattatgtt 2820 aagtaggttt tattattttc aaagagcttt ttgaatgtta agatatgctg taaaaatgta 2880 aaatttcgag aatgctgtca tttcatgaat atgtgcaaaa taaaaatttt caaaagtaat 2940 tttctttctc ggacataata ggatgatttg attttttata tttttggaat ccaggagctt 3000 tcacaggcat att 3013 // ID PIGLET1_EI repbase; DNA; INV; 1920 BP. XX AC Piglet-Ei1; XX DT 16-MAY-2005 (Rel. 10.05, Created) DT 16-MAY-2005 (Rel. 10.05, Last updated, Version 1) XX DE Piglet-Ei1 (PIGLET1_EI), a pogo-like DNA transposon from the DE single-celled eukaryotic reptilian parasite Entamoeba invadens. XX KW Mariner/Tc1; DNA transposon; Transposable Element; mariner; KW Interspersed repeat; Tc1/mariner superfamily; Piglet-Ei1; KW pogo-like; PIGLET1_EI. XX OS Entamoeba invadens OC Eukaryota; Amoebozoa; Archamoebae; Entamoebidae; Entamoeba. XX RN [1] RP 1-1920 RA Pritham E.J., Feschotte C. and Wessler S.R.; RG Department of Biology, University of Texas at Arlington, RG Arlington, TX 76019, USA; RT "Unexpected diversity and differential success of DNA transposons RT in four species of Entamoeba protozoans. Mol. Biol. Evol. (2005) RT In press."; RL Direct Submission to Genbank (16-MAY-2005). XX DR Repbase; Piglet-Ei1; Positions 1 1920. XX CC Piglet-Ei1 is a representative copy of a small family of CC pogo-like elements from the genome of E. invadens. The TIRs are CC 23-bp, have some similarities with pogo-like elements from other CC species and are flanked by TA TSD. This copy contains an ORF CC potentially encoding a 354-aa protein that is 51% similar to CC those encoded the pogo transposase from D. melanogaster. There CC are several closely related elements in the genome of E. CC invadens, E. moshkovskii and E. terripinae. XX SQ Sequence 1920 BP; 605 A; 306 C; 244 G; 765 T; 0 other; caataaaacc tcccaagagc gggcattttt agccaatcaa accttaaaac cgctcgatag 60 agaactgact gtttattgtt ttatgttata ataaatcggg attctttttt ttttggtaaa 120 catttttttg aataaccaga aaaacaaaaa aaataattac aaaaaatcaa aaataaaaaa 180 tttattaaat aaaatagttg tcaagttgac agttaacaaa attttttttc aaagttggtt 240 ttagcataaa ttacgaaatt ttaaattaat ttaaatctat ttccacgtaa acttttttct 300 gtcacactgt tccttgtacc attgtttagt ttcttttagt tcatcgtttt tactcaccat 360 cattttcaac tcttcgattc tcaacaaata tcttttggtt ttttaggttc ttcgaaattg 420 gaaagttcgt cgttctcttc gatcaaatct tcaataaaat aggttacatc tcccaaaaat 480 ccttacacct ttctctctaa aaaatcgtgc ttccagcagc caattattgt ttatgatgaa 540 acatgattgt tccagcagtc gacagcatat aatatagcat ctcccatatt cattttgagc 600 caaactgtgt tttcccaaga atgggcttca atgatttatt ctgcctttaa attaacaatt 660 tttctttgat atgcaacctt aaatgagtgt attatttttg cgtcacatgg ctggaatacc 720 gatgtcatgt tttcggtata aattccaacc ttatgtgtgc aaggtttagt tcatacacat 780 gcaatccttt ttcgttttgt cctttgtaat tataactggg ttcttcttta atagtagtaa 840 ttatgatatt cttatagctt ggggaattat ctactatgag aagaatgata atgttttctc 900 ttgtaaatgc ttcttcccat ttgaggagcc atcaacgaaa caattctatg tttatacaag 960 cattagtttg aaagtagtaa atcgtttaac gttaatgtaa tttttatcaa agaaccacat 1020 tggattacga tactttccaa caaccatagg attttctttt tctctattta gtgaacagca 1080 tatcataatt gtaattcttt ctttttcctc cttccctcca tggttatcag aatcacctga 1140 caatgtttgg aaaataacaa cttataaaaa agacctgttt catccacatt ttacatatta 1200 ctaaaagcat actatgtaat aattataacc agcttatcaa tttcatcctt ttgtttaaac 1260 aaataaactg aattactttt ttgcataatt tctttgattt aagttggttt cttgagcaaa 1320 acgatgaaat ccgaccattt caaagcatta tatcatcata tacattttgc aaagcagtat 1380 ttagaatcta agctcttgac aacattgtaa caaactagcg tattatatga cttcacagtt 1440 tctataattt tgagcacgtt ttcttctgag tctgaatgtt taacttttct gaaaactttg 1500 gcattccctt gtattgaaag ctcaattaaa ctttcctaga taccatactt taacatcctt 1560 agacacttgc gttttggaca ttattgtaaa ttatagttta ttatttcttt tagttttttt 1620 attgaaggtt tcggggaagc tattgtccac tctattattt gcttttttat caattgataa 1680 aaatgttttt gattttggta aagacatgaa taagtaataa gattatatat ttttttataa 1740 aatgaccttt gtgagccaaa ttaaacattg tttaaaatca tcttaaggtt aacacgtggt 1800 aatttttttc tatttgaatt atttttcttt tttcttattt tgaattattt tacattctaa 1860 aaaaaatgtt ttttttaaat tgattaaaaa taaaacgccc acttttggga ggtgttattg 1920 // ID RTE-15_BF repbase; DNA; INV; 3307 BP. XX AC . XX DT 29-JUL-2009 (Rel. 14.07, Created) DT 29-JUL-2009 (Rel. 14.07, Last updated, Version -1) XX DE Amphioxus RTE-15_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW RTE; Non-LTR Retrotransposon; Transposable Element; RTE-15_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-3307 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-3307 RA Kapitonov V. and Jurka J.; RT "Young families of RTE non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(7), 1713-1713 (2009). XX DR [2] (Consensus) XX FH Key Location/Qualifiers FT CDS 290..3295 FT /product="RTE-15_BF_1p" FT /translation="KNDRRSCDPCITTRNRKSQNMTSFPNLENSSCGIVKE FT RLLRCKKTLLVSTFNTRTLNSASKLGETAALANEYSIDIVCIQEHRLCHDD FT VIIQHKDMGNGWKLLISSAEKGSNNATIRGVGLLLSPKAYKCLNSVESISP FT RIIVANFEGNPATTVISCYSPTNATDEEEVREFYSALADTVKEIPKHNVCI FT VGGDMNAKIGQPDSKGSSYHSETNRNGKYLLEFINECNMLNLSINYTKKKG FT KLWSFSYPNGERAQIDYLLINKKWRNSALNCEAYNSFSNIKSDHRIVTAKF FT RLSLRAPKRQSERGTRYDWGKLLNDAEVNVAYTNEAKRNFVELNDLDETKN FT ANSIYANVIRAHEEAADLHVPIKPKVRKRVPWAYDKNILDKREALRKATVE FT QKSEVEINKMKDELDKAYEQELESYIKQKTTVIQSAHEEHRGALAWATVNE FT ITSRKKTKQGMLKAKSQQERVNLWHDHFSKLLGQPPNIVNIPVVPVIDYIL FT PINTSDFTREELMDCIKTFNNGKTPGLDNIPIEVWKTEALIDPLLEICNRT FT FHQDRADVWVSSGILPLPKKGDLGYTTNYRGISLSATAAKIYNKMLLLRIR FT PHVEPLLRDNQNGFRPGRSTLSQILVLRRLIEGIEDQNLKAVITFIDFSKA FT FDSIHRQKLMEILRGYGIPDIIVEAINILYVDTKAKVLSPDGETEFFQITA FT GVLQGDTLAPFLFIVALDYAMRQATEQPELTGFVLKPRKSSRHPAELLTDT FT DFADDIALTSNIVLEAQSLLHKVENASKIVGLHANDTKTEYMVFNQPEGTL FT ISTKGQALACVKDFKYLGSWIRNSSKDVDVRIALAWGAMSKMSVIWKTDLS FT PTTKISFFRAAVESILLYGSECWTLTKALKRRLNGTYTRMLRTVLKISWRE FT KIKNKVLYKDIPPIAETIRQRRLRFAGHCWRKKDEVVHKLLLWNPNHGNRG FT RGRPRKSYTKQLAEDTGLDLQQLPAAMDDKEGWRKLVMWRLRASSPE" XX SQ Sequence 3307 BP; 1137 A; 650 C; 727 G; 793 T; 0 other; tccaccccgg acatcaagcc acccctgcta ctgtctgccg aggagtaaac ctcgttaata 60 aatcctgctt tatttcacgg gaagtggagg ttcaagcggc attcatcgac agactggtga 120 taaaatggcg ctggacctgt gactacgcag aggattggac ctgctcggcg acggtagttc 180 ctatgacaac catcaacaaa ctgaaggttt ttcacgcacc gaacatatcg ttcttgaact 240 ggtacaacaa tgggtaggca cacctgcccc agttagaggg gaactgtgaa aaaatgacag 300 gaggagttgc gacccctgca ttacaactcg caatagaaag agtcaaaaca tgactagctt 360 tcccaacctc gaaaattcgt catgcggtat tgtaaaggaa agactgctca gatgcaagaa 420 aaccttgcta gtatcaactt ttaacactag aaccctgaac tcagcttcca aattgggtga 480 aactgctgca ttggcaaacg agtatagcat tgatattgtg tgtatacagg agcacagatt 540 atgtcatgat gatgttatta ttcagcacaa ggatatgggt aatggctgga aattactcat 600 aagttctgca gaaaaagggt caaataatgc aacaattaga ggtgttggct tattacttag 660 cccgaaagct tataaatgtt taaactctgt tgaaagcatt agccctagga ttattgttgc 720 caattttgaa ggaaatccag ccactaccgt gatatcatgt tatagcccca ctaatgcgac 780 agatgaagaa gaggtcagag agttttactc ggctcttgct gacacagtaa aagaaattcc 840 taaacataat gtttgcattg tgggagggga catgaatgca aagataggtc aaccagattc 900 taaaggatct agttatcata gcgaaactaa tagaaatggc aaatacctcc tagaattcat 960 aaatgaatgc aacatgttaa acctcagcat taactacacc aagaaaaaag ggaaactttg 1020 gagtttttcc taccccaacg gtgagcgtgc ccagattgac taccttctga ttaacaaaaa 1080 atggaggaac agtgcactta actgcgaagc atataatagc ttttcaaata ttaaatctga 1140 ccataggata gtaacagcta aattcagact gagtttgaga gcacctaaac gacaatctga 1200 aaggggcaca agatacgact ggggtaaact actaaatgac gctgaagtta atgtagcata 1260 tacaaatgaa gccaagagga attttgttga gcttaacgac cttgatgaga ccaaaaatgc 1320 taactcaatc tatgcaaatg tcataagagc acacgaagaa gcagcagatt tacatgtccc 1380 aattaagcca aaagtaagga agcgtgtgcc ctgggcttat gataagaaca ttttggataa 1440 acgggaagct ctaaggaagg caactgtaga acaaaagagt gaagtcgaga taaacaaaat 1500 gaaggatgag cttgacaaag catatgagca ggaacttgaa tcttacataa agcaaaagac 1560 tacagtgatc caaagtgccc atgaagaaca tagaggagca ctggcatggg ccactgtaaa 1620 tgaaataacc tcgagaaaga aaaccaaaca agggatgtta aaggcaaaaa gccaacaaga 1680 gagagttaat ctatggcatg atcatttttc caaacttctc ggtcaacctc ccaacattgt 1740 aaatatacct gttgtaccag ttattgatta catattacca atcaacacaa gtgacttcac 1800 tagggaggaa cttatggact gtatcaagac ttttaacaat ggtaaaaccc ctggtcttga 1860 caatatacca atagaagtct ggaagactga agccctgatt gaccctctac tcgaaatttg 1920 caatagaaca tttcatcagg atagagcaga tgtctgggta agtagtggaa tacttccact 1980 accaaaaaag ggtgatcttg gctatacaac aaactatagg ggaattagtt tgtcagctac 2040 agcagctaaa atctacaaca aaatgctttt gctgaggatc agacctcatg tggaaccact 2100 acttagagac aaccaaaatg gctttaggcc tggtaggtca accttatccc aaatactagt 2160 attgagaagg cttattgaag gtattgagga tcaaaatctg aaagcagtta taacatttat 2220 cgacttcagt aaagctttcg attccattca ccgtcagaag ctaatggaaa ttttgagagg 2280 gtatggtata ccagacataa tcgttgaggc tataaatatt ctttatgtag atacaaaagc 2340 taaagtgctc tcaccagacg gtgaaacaga gtttttccaa ataaccgctg gtgtcctaca 2400 aggtgatacc ttagctccct tcctttttat agtggctctg gattatgcaa tgagacaggc 2460 cacagagcaa cctgagctca cgggttttgt acttaaacca agaaagagca gtagacaccc 2520 agctgagtta ctaactgaca cagactttgc agatgatata gcccttactt cgaacatagt 2580 actagaagca cagtcattgt tgcacaaggt tgaaaatgcc tccaaaattg ttggcctaca 2640 tgccaacgat acaaaaactg agtatatggt gttcaatcaa cctgaaggaa ccctgatctc 2700 aacaaaaggt caggcacttg cctgtgtgaa agacttcaag tacttgggtt catggataag 2760 aaactctagt aaagatgtgg atgtcagaat tgctctagct tggggagcca tgagtaaaat 2820 gtcagttata tggaaaaccg atttaagtcc aacaactaaa atcagtttct ttagagcagc 2880 agtagagtca attctcctct atgggtcaga atgttggact ctgacaaaag ccctaaagag 2940 aagactcaac ggcacatata cgagaatgct gagaactgtc ctaaagatct catggaggga 3000 aaaaattaaa aacaaagtcc tttacaagga tataccgccg atagccgaaa ctattcggca 3060 aaggagactg cgttttgcag gacactgctg gaggaaaaag gatgaagtag tgcacaaact 3120 gttactgtgg aatccgaacc acggaaacag gggcagggga agaccgcgta agtcatacac 3180 taagcagctt gctgaagaca ccggtctgga cctgcaacaa cttcccgcag cgatggatga 3240 caaagagggg tggaggaaac tggttatgtg gagactccga gcgagctcgc cggagtaagt 3300 aagtaag 3307 // ID Gypsy-199_AA-LTR repbase; DNA; INV; 197 BP. XX AC supercont1.68; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-199_AA_; KW Gypsy-199_AA-I; Gypsy-199_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-197 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.68; Positions 1932729 1932925. XX SQ Sequence 197 BP; 57 A; 28 C; 56 G; 56 T; 0 other; tgtagggttt tccccttgaa tgcagccctg ataaaatata gaagtgaact cctaaatagg 60 gccctgggtg gattgacagg gagacgggag aatgtgtatc gtgccggtga gaaacggatg 120 tgggaaatcg ttgtgcatat tttgaacagt gaaataaagt gataattgtt gcagatattg 180 agtgtttcat tcctaca 197 // ID BEL-609_AA-LTR repbase; DNA; INV; 702 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-609_AA_; KW Pao_Bel_Ele71; BEL-609_AA-I; BEL-609_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-702 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 702 BP; 199 A; 166 C; 190 G; 147 T; 0 other; tgccgcatca gaccatcaat ttcggtgagc tgcaaggaaa atttccgtat ctgaaggggc 60 ttcctgtgac aagttatgaa gacgctgtcc caggaatatt gattggtttg gacaatacta 120 tgctgaaaac caccctgaaa ctgcgcgagg gtagcggaga tcaaccagtt gcggcgaaaa 180 cacgtctggg ctggatgcta tacggaagat ctggagagtc agaccccact ctgttgcgac 240 gagtattgca cctgtgcaat acaccctcaa atgaagatct gcacgactta gttaaatcgt 300 ttttcaccat cgaaggcgct ggtgttgggt cgaacgaact tgtcgaatcc actagtgaaa 360 agcgggctcg cgaaatcatg gaagctacca cagttcgcac ggagtccgga aaatttcaaa 420 caggtctgtt atggaggtac gacctggtaa agtttccgga tagtaagcct atggcggaga 480 ggaggcttca atgcctcgaa aagcggcttt cgaaagatcc agccttgtat gagaaagtgc 540 gacagcaaat tgctgattac ctctccaaag gctacgccca caaagcgacg atgcaggagc 600 taatggagac cgacccaagc caaacctggt atctgccact gggggtcgta accaacccga 660 aaaaaccgga gaaagtccga gtggtgtggg atgcagccgc ca 702 // ID CR1-3_TCa repbase; DNA; INV; 4800 BP. XX AC . XX DT 21-MAR-2009 (Rel. 14.04, Created) DT 04-APR-2009 (Rel. 14.04, Last updated, Version 3) XX DE CR1-type retrotransposon: consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; Nonautonomous; KW CR1-3_TCa. XX OS Tribolium castaneum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Coleoptera; Polyphaga; Cucujiformia; OC Tenebrionidae; Tribolium. XX RN [1] RP 1-4800 RA Jurka J.; RT "CR1-type retrotransposons from Tribolium castaneum."; RL Repbase Reports 9(4), 737-737 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Tribolium CC castaneum Genome Project. XX FH Key Location/Qualifiers FT CDS join(948..2114,2102..4798) FT /product="CR1-3_TCa_1p" FT /translation="MSVCYKCKSSVVNSIACLNCKKAYHPSCLRAFKNYSA FT IGQDLTSISACPCCDKLKSLQNPTSPTLNISTNTIFEDLGILKQETDLIKK FT TLDDLKTFITESPTTVCTFDKIDEYVRKSFDDLESKLISLLKDTLNREIEK FT LKNSIKNEVIAEISTNLNRTNDNIKTANSSDLTLKSTNSKNVSTQSVSKTN FT RYKVRIFADQKGRHCIDLLETYIPVSKYVIQSVVKPYSSLQELLKTENNNE FT QLGNNDFVVIYAGNHEAQKGKIPDENTLNELINHKFLNTNIILIGCPYFVN FT RPILNNFVYELNCNFYNVAKNNKNIYYLDSNDKIFDQFKTDNFVINKAVKR FT CIIKNLGDLITECQDHTNRSAVEQNPVATFLDTSLPSVDKSDSQSFLSVFL FT NRFVCLQPSHSKISISINKKTVNSFHGIYQNVRGLKSKSVTFFNSFLSTSD FT EFDLIVITESWLDDTVLDGELFSDNYYVFRRDRDLRHLGIHRGGGVLIAVN FT NKYNCRQLDISNISRRSSPAIDLVAIKINVTSYQHLFVFVVYIPPSISFSD FT FDNFFDLLNNFDEINNDNVLFLGDFNTPLFSGFSKNNFSISLKHFASFHNF FT NQFNNIRNNRNNILDLIFSNMDCAVSKSSFSFVPEDPFHPSLEFTLDIVAK FT TPDKFYMNRDHVAFNFKKANFDCLYTTLQVVDWSELEKFSDVNEACRVFYE FT TLYNCFELSVPKLFCRKNYRTFPTWFNTDIINCLRRKSNILKKYRRTKNMH FT FYEEFKKLRSLSKSLITTAYSQYIGHMENTISKDPKKFWSFVQAKKRNSRI FT PGVLYHNGNAISNPNDIVNSFAKFFKSVFSDMTLTSDSPSMMTNLNSFITV FT NEINEHEIIRVLKRSKDTLTAGCDGIPSFLLRDCAHIFSKPLVYIFNLILQ FT TSTFPNIWKIAHVCPVFKTGDITQIQNYRPISILCNFSKVFESIIYDRIYS FT SVKRFISPSQHGFVEKRSTVTNLACFTQFISDALDRKGQVDVIYMDFKKAF FT DQIDHHLLLLKLDQYGFSGSLLSLIKSYLANREQRVKYRNYISDSYVATSG FT VPQGSNLGPLLFLLFINDICGSLSTCAKLLFADDLKIYTEIKSIEDCLTLQ FT NNINAVVKWCNENRLYLNPSKCNVMSYTKKREFLEFVYDISSVTLHRTFII FT KDLGVIFDTELAFSEHIRDVTARAIKSYGFIYRNCRDFKNLSVMKTLFFSL FT VRSKLEYGALIWHPIYKIHIDQLENIQRRFLKFLVFIIDGNYPIRGYDQNL FT LLNRFGLQSLQFRRICIIIKFLYNLIN" XX SQ Sequence 4800 BP; 1561 A; 807 C; 735 G; 1697 T; 0 other; ataatcttta acaaaaattt gctttaaaat aaaatgtttg aaaaaccgaa attaaactaa 60 attaattgtg tctaacactt tagtagagga aaaaggagaa gacataaaag tcaccaaagt 120 cttcagagtc gtttttagta ctaaatttgc ttatatgcaa gcgcttaatt aaggtaacag 180 ttttttggtt tcttgcttat atctcgaaaa cagttactcc tatcaatttt tatcttttta 240 ctaaaattaa agctgataaa atttcctaca aaatagttat ttgcattttt catgtaggac 300 taaccataag cgagatataa gtttgaaaat aattaatatt taaaaaaagt gctttaacag 360 aaaatgcctt gtgatttaaa atacacttaa tttattgggt ccaatacttt attatttact 420 ttattttttt ggtgctatgc taggcttgtg ctaatgtgct ataaataagg tgttacaatt 480 ttgtgctgta aataaggtgt tgtgcatttt gtgctataaa cgtcagtttt ttaatttttc 540 gttaattatt tgtggtaata acttaataag tgactaaggt ttaaggttat gttaaactgt 600 agtcaatagc gttttgcatt tcttttggac tttcaaacat ttatacttta cacaaagagt 660 tatactgata taaatgacgt gatcaatgct cgtactataa ccaataatcc aattttgtgc 720 tactttttat gaacattagt ttttacatgt aaatttcata tcagaggcgt acgcttctag 780 aattgctgat aataattatt attctatcct ctgttccttt tgtgtccttt ccgtggccta 840 cctgtgcttg tttttttatt ttgagtcact acgaacggcc gcgtagtgtg aagcgtgtgt 900 ccttgtcttt attcccttgc ctgattgtat ttatcaactc actcataatg agtgtctgct 960 ataaatgcaa aagtagcgtc gtgaattcta tcgcttgctt aaattgcaaa aaagcgtatc 1020 accccagttg cttaagagcc tttaaaaact attctgcgat tggtcaagat ctgacatcaa 1080 taagtgcatg cccttgttgc gataaattaa aatctttaca aaatcccact tctccgactc 1140 taaatatctc tactaatact atatttgaag accttggtat cctaaagcaa gaaaccgact 1200 taataaaaaa aactcttgat gatttaaaaa cgtttataac tgaatcaccg actactgttt 1260 gtacttttga taaaatagat gaatatgtac gcaaaagttt tgatgatctt gaaagcaaat 1320 taatttcttt gttaaaggat acccttaatc gtgaaattga gaaattaaaa aattcaataa 1380 aaaacgaagt aatagcagaa atctctacta atttgaaccg taccaatgat aatattaaaa 1440 ctgccaactc ttctgattta actttgaaat ctacaaattc aaagaatgta agtacccagt 1500 ccgtgtctaa gacgaacaga tataaagttc gtatatttgc tgatcaaaaa ggacgtcatt 1560 gtatagactt attagaaact tatattccag ttagcaaata tgtgatccaa tccgtagtaa 1620 aaccttattc ctcgttacaa gagcttctta aaactgaaaa caacaacgag caacttggca 1680 ataatgattt tgttgtaata tacgctggaa atcatgaagc acaaaaagga aaaatccctg 1740 atgaaaatac tctaaatgaa cttatcaatc ataaattttt aaatactaat atcattctta 1800 tcggctgtcc atattttgtt aatcgtccca ttttgaataa ttttgtttat gaattgaatt 1860 gtaattttta taatgtggct aaaaataata agaacatata ttacttagac tcaaacgata 1920 aaatttttga tcaattcaag acggataact ttgtaataaa taaagctgtt aaaaggtgca 1980 ttataaaaaa cctgggcgac ctaatcactg agtgccaaga ccatacaaat agatctgcag 2040 ttgaacaaaa tccagtagca acgtttcttg acacatctct gccatcagtt gataaatctg 2100 actctcagtc tttttaaatc gcttcgtctg tttacagccc tcacattcta aaatatctat 2160 atcaatcaat aaaaaaactg tcaatagttt tcatggcatc taccaaaatg ttaggggatt 2220 aaaatcaaaa tcagttacat tttttaattc atttttatct acgtctgatg aatttgattt 2280 aattgttatt actgagtctt ggttagatga caccgtcttg gatggtgaat tgttttctga 2340 taattattat gtttttcgtc gggacagaga tctccgccac ctgggtatcc atagaggtgg 2400 tggtgtgctt atagcagtta ataacaaata taactgtcgt caactcgata tatctaatat 2460 tagtcgtaga agttctcctg ctattgatct ggttgcaata aaaataaatg taacaagtta 2520 tcaacatctc ttcgtgtttg tagtctacat tcctccttct atttccttta gtgattttga 2580 caattttttt gatcttctta ataattttga cgagattaat aatgacaatg ttttattctt 2640 aggagacttt aacactcctt tattcagtgg attttcaaaa aacaatttta gtatttctct 2700 caaacacttc gcttcttttc ataattttaa tcaatttaat aacatacgta ataataggaa 2760 taatatattg gatctcatct tttccaacat ggactgtgcc gttagcaagt cctctttctc 2820 gtttgttcct gaagatcctt tccatccgag tttggagttt accctcgata ttgtagcaaa 2880 aactcctgat aagttctata tgaataggga ccacgtcgca tttaatttta aaaaagcaaa 2940 ttttgattgt ctatacacta ctttacaggt agttgattgg tccgaactgg agaagtttag 3000 cgatgtgaat gaggcatgtc gagtttttta tgaaaccctc tacaattgtt ttgagttatc 3060 tgttccaaaa ctattttgtc gtaaaaacta ccgcacattt cctacttggt tcaacactga 3120 cattatcaat tgtctacgtc gtaaatctaa cattttaaag aaatatcgtc gaactaaaaa 3180 tatgcatttc tatgaggagt ttaagaaact tcgatctctt tcaaaatccc taataactac 3240 tgcctattct caatacatcg gccacatgga aaacactatc tccaaagatc caaagaaatt 3300 ctggtctttt gttcaagcta aaaagcgtaa ctctagaatt ccgggtgtgt tgtatcataa 3360 tggtaatgca atatcaaatc ctaatgatat agtaaatagt ttcgctaaat tcttcaaaag 3420 tgttttttct gacatgactt tgacttcaga tagtccctcg atgatgacga atctaaatag 3480 ttttattact gtaaacgaga ttaatgagca tgaaataatc cgggtattaa agcgatctaa 3540 agacacactt actgctggtt gtgatggtat ccccagcttt ttattaaggg attgtgcaca 3600 tattttttct aaaccactcg tatacatttt taatttaata ttgcaaactt cgacgtttcc 3660 taacatatgg aaaatagcac acgtttgtcc tgtcttcaaa acgggtgata taacacaaat 3720 tcaaaactac aggccaattt caatcttgtg taatttctcg aaagtttttg agtcaattat 3780 ttacgaccgc atctactctt ctgtaaaaag atttatatca ccttcgcagc atggatttgt 3840 tgaaaaaaga tcaactgtta caaatctagc ctgttttact cagttcatat ctgacgcttt 3900 ggacagaaaa ggtcaggttg atgtcattta tatggacttc aaaaaagcat ttgatcaaat 3960 tgatcatcat ctgttattgt taaaacttga tcagtatggt ttctccggtt ctttgctatc 4020 cctaataaaa tcttatcttg ctaatcgaga gcaaagggtt aaatatcgta actacatttc 4080 ggatagttac gttgcaactt cgggtgtgcc gcaaggttcc aatcttggac ccttattatt 4140 tttgcttttc attaatgata tatgtggttc tttatcaaca tgcgctaaac ttctctttgc 4200 cgatgatctt aaaatttaca cagaaataaa atctatagag gactgcctca ctttacagaa 4260 caatattaac gcagtagtga aatggtgtaa tgaaaatcgc ctttatctaa acccatccaa 4320 atgtaacgtt atgtcttaca ctaaaaagcg cgaattcctt gaatttgtat atgatatttc 4380 gtccgtaaca ctgcatcgta cctttattat taaagatttg ggagttattt tcgatactga 4440 actcgcattc agtgagcata ttagggatgt tacggctaga gcgattaagt catatggatt 4500 catctataga aattgtcgag attttaagaa cttatcagtt atgaagaccc tattcttctc 4560 gttagttaga agcaaactgg aatacggcgc acttatttgg cacccgattt ataaaatcca 4620 tatagatcaa ctcgaaaaca tacagcgacg attcctaaaa ttcctagttt ttataataga 4680 tggaaattat cctataaggg gctatgacca gaacttgctt ttgaatagat tcggtctcca 4740 atctttacag tttcgccgta tttgtattat aattaaattt ctgtataatt taattaataa 4800 // ID L1-47_AAe repbase; DNA; INV; 4420 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE L1-type non-LTR retrotransposon from Aedes aegypti. XX KW L1; Non-LTR Retrotransposon; Transposable Element; L1-47_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4420 RA Jurka J.; RT "L1 non-LTR retrotransposons from the yellow fever mosquito."; RL Repbase Reports 11(4), 1402-1402 (2011). XX DR [2] (Consensus) XX FH Key Location/Qualifiers FT CDS 399..1163 FT /product="L1-47_AAe_1p" FT /translation="MMEDGAVEVRLFDLSNDISNEQIAEFLSEYGDVLNVR FT DLLWDERSAFAGVKTGVRIARMVVKKNIPSLVTIRGEDSAVGYKGQRQTCL FT HCQEFVHVGIPCVQNKKLLVQKLSADLSYANVAKQSALPKPAKPNNPPARA FT PESRVQQNPPRPTTIKPSESRSVTQPTTVQTINAMPPPAVKATGAPATIQF FT TPFLNPSGNINPLRKSDGNETDSSQASLGSNSSRRLRSKRSPPGKKMRHSD FT SNEHIRGSGDDMQQ" FT CDS 1166..4342 FT /product="L1-47_AAe_2p" FT /translation="MEHTSVNIGTININTITNTTKINALRTFIQTTDLDIV FT FLQEVENEQLALPGFNIVCNVDHARRGTAIALKQHIKFSHVEKSLDGRLIA FT LRVHNTTLCNVYAPSGSAQRAERERFFNNTVAYYLRYRTDHIVLGGDFNSV FT IRQRDATGSNASPSLQSTVQQLRLLDVWQQLRQHESGYTFITHNSSSRLDR FT LYVSSGLREHLRATAVHVVCFSDHKAVTTRLCLPLPDRAPGRGFWSLRPHL FT LSDENIEELQTSWQYWTRQRRYYTSWMQWWLSCAKPKIKSFFRWKSKIVYD FT AFYREQQRLYEVLRHAYDNYHDDRSMLIRINRAKAEMLSHQRNFTEMFVRV FT NETYVAGEPISTYQLGERTRRRTTIEQLRNERDEIIDDCDAIRDHMTEYFT FT NLYARGRVDEIDGTGFESGRVIPENDPVNEACMSDITTVEILSAIRTSASK FT KSPGPDGLPKEFYLRAFDVIHRELNLVLNEAINSNFPTQFVDGVIVLVKKR FT DAGDSARSYRPISLVNYDYKILARILKARLENVMRTHNVLTESQKCSNGKR FT NIFQATLAIKDKIAQLKACRQTAKLIGFDLDHAFDRVDQSFLFNTMRGMGF FT NTSLVDLLSNIAAASSSRLLINGHLSASFPIQRSVRQGDPLSMHLFVIYLH FT PLLRRLEHVCGGDLIVAYADDISAIVSSVEKLNAMRDLFRCFGRVAGAVLN FT ENKTTAIDVGLIDYPITVPWLRTENTIKILGVYFCNSVRGMVTLNWDKIVT FT NFSRQVWLHSMRCLTMHQKVTLLNTFISARMWYMAAHLTPTAAHIAKLTAT FT MRRYIFRGVPATVPMQQLARSKVAGGVKLHLPTFKCKSLLINRHLNEIESL FT PFYNSYINPTNPPQPFSISDLPCLKLILDNLANFPFQIRQHPSADLIHRFY FT VEQTERPKVETEYPNANWCRVWANIWMRGLSSSQKSALYMWTNQKIPHRRL FT LFKMRRTDGEQCLYCAERSEFLQHKFFSCPRVNDAWRVLQRRLLAITGRRS FT FVCDELLRPSLEWTSVTTRRTVLKTLVNYISFIENSNLRIDVDALRFALEV FT EV" XX SQ Sequence 4420 BP; 1240 A; 1112 C; 1008 G; 1054 T; 6 other; cagttagcgc tcaacttccg agccgatcag tcgatctatc gaggagcgaa tttcggaagt 60 cgttcacgga ctatctcgta cgtttttgtc gatcgctatc accgccacaa gtatcacgtg 120 cwgtgacgmg tgccgtgttc cgcgatgccg cgccgcgaaa acacgtttcg cgtcgattac 180 tcgcagtttc cgaagcagct ttcccacgat gaaatmcaca aattcgtcgg gaaagaactc 240 ggtctcacgc gagaaaacgt kcttctcctc cagccgagcm ggcggctggg tgcaccttcg 300 tggaggtcaa caamctcgag ctcgccgagg cgatcgtcca gcagcacgac aacaagcacg 360 agttcgtgtt tgatggtaag atctacaagc tgcgcatcat gatggaggac ggagctgtgg 420 aagtacgctt gtttgatctg tccaatgata tttcaaacga acagatcgcc gaattccttt 480 ccgagtatgg tgacgttctt aacgttcgtg atttgctgtg ggacgaacga tccgcctttg 540 caggcgtcaa aactggcgta cgcatcgctc ggatggtagt aaagaaaaac attccttcac 600 ttgttacgat tcgtggtgag gactcggcgg tgggctacaa ggggcagcgg cagacgtgtt 660 tgcactgcca ggagtttgtg catgttggta taccatgtgt gcaaaacaag aaactgctgg 720 tgcagaagct ctcggcagac ctctcgtatg caaatgtagc gaaacagtca gcgttgccga 780 aaccggcaaa gccgaacaat ccaccagcgc gagcaccgga gtcgcgagta caacaaaacc 840 caccacgtcc tacgacaatc aaaccgagcg aatcccgatc ggtgacccag ccgacgacag 900 tgcaaacgat caacgcaatg cctcctcctg ccgtaaaagc gaccggagca ccggctacta 960 ttcaattcac accgttcttg aatccgagcg gcaacatcaa tcctctacga aaatctgatg 1020 gcaacgaaac cgacagctct caagcatcac tcggctccaa tagcagccga cgattacgca 1080 gcaaacgatc gccaccgggg aagaagatgc gacacagcga ctctaacgag cacattcgag 1140 gttccggaga tgatatgcaa cagtaatgga gcacaccagc gtaaatatag gcactatcaa 1200 tatcaacacg ataaccaaca caacgaaaat caatgccctc cgtacattca tccaaacaac 1260 cgatttggac atcgttttcc tgcaggaggt ggagaacgaa cagcttgcct tgcccggttt 1320 caacatcgtt tgcaacgtcg atcatgcgcg gagaggaacc gcaatcgctc tcaaacagca 1380 tataaaattc tcgcacgtcg aaaaaagcct ggatggaaga ttgatcgctt tgcgtgtgca 1440 caataccact ctgtgcaatg tgtatgcccc atcgggttca gctcagcgtg cggagcgaga 1500 acgctttttc aacaacactg tagcatacta cctacgatat cgcaccgatc acatcgtact 1560 cgggggcgac ttcaattccg taattcgtca gcgtgacgcg acgggatcca acgcaagccc 1620 ctctctccag tccaccgtac agcagcttcg tttgctcgat gtgtggcaac agcttcgaca 1680 acacgaatcg ggatacacat ttatcacgca caattcttcg tccaggctcg atcggcttta 1740 cgtcagctcc ggattacgag aacatctgag agcaacagct gttcatgtcg tttgcttctc 1800 ggatcacaaa gccgttacta cgagactatg tcttcctctc cccgatagag cacccggcag 1860 gggtttctgg tccctccgtc cccatcttct gtcagacgaa aatattgaag aactgcaaac 1920 aagttggcaa tattggaccc gccaacgccg atactacacg tcatggatgc aatggtggtt 1980 atcatgcgca aaaccgaaaa ttaaatcatt cttccgatgg aaatcaaaaa tcgtttatga 2040 tgcgttctat cgcgagcaac agcgcctcta cgaggtattg cggcatgcgt atgacaatta 2100 ccacgatgat cgttccatgt tgatcagaat caatcgtgcg aaagcagaaa tgctctccca 2160 tcaacgtaat ttcaccgaaa tgtttgtgcg tgtcaacgaa acgtacgtgg cgggagaacc 2220 aatatctacc taccaacttg gagaaagaac acgacgaaga acgaccatcg agcagctacg 2280 aaacgagagg gacgaaatta tcgacgattg cgatgcgata cgtgatcata tgactgagta 2340 ttttaccaac ttgtacgcac ggggccgtgt agacgaaata gatggaacgg gcttcgaatc 2400 tggaagagtc atccccgaaa atgacccggt caatgaagcg tgcatgagcg acatcacgac 2460 tgttgaaata ctctctgcaa tccgaacaag tgcatcgaaa aaatccccag gtcccgacgg 2520 gcttcctaag gaattttact tgcgtgcgtt cgatgtaatt caccgtgaat tgaatttagt 2580 attgaatgaa gcgatcaatt caaacttccc gacacaattt gttgatggag taatagtgtt 2640 ggtgaagaag cgcgatgctg gggactccgc tcgatcatac agaccaatca gtttggtcaa 2700 ctatgattat aaaattctag ctcgaatact gaaagcaagg cttgaaaatg tgatgcgcac 2760 ccacaacgtc ctaaccgaat cacaaaaatg ctccaacggt aagagaaata tatttcaagc 2820 cactcttgca ataaaagaca aaatcgcaca gttgaaagca tgtcgacaga cggccaaatt 2880 aattggcttc gatctcgacc acgcatttga ccgtgttgat caaagctttt tgttcaacac 2940 catgcgtgga atgggattca acacatcatt ggtggatctt ttatccaata tagctgcagc 3000 ttcttcgtct cgtttgctaa ttaatggcca tctctcagca agcttcccga tccaacggtc 3060 ggtccgccaa ggggatcccc tatcaatgca cttgtttgtg atctacctac atcccctcct 3120 acggcggctg gaacacgttt gtggtggtga tttgatcgtt gcttacgcag acgatatcag 3180 tgcgattgtt agcagtgtgg agaagttaaa tgcgatgcgt gatctgtttc gttgcttcgg 3240 acgtgtagcc ggtgccgttt tgaatgaaaa caaaacgact gcaatcgatg tcggcctgat 3300 tgactaccct atcactgtgc cgtggttacg cactgaaaat acaatcaaaa ttcttggtgt 3360 gtacttttgc aactcggtac ggggaatggt cacgttaaat tgggacaaaa tcgttaccaa 3420 cttctcaagg caggtttggc tccattcgat gcgctgtttg accatgcacc aaaaagtgac 3480 cctgttaaat acgtttatat cagcgaggat gtggtacatg gctgcccact tgactccaac 3540 agctgcgcat attgcaaaac tgactgctac gatgcgaaga tacatttttc gaggtgtgcc 3600 tgcgaccgta ccgatgcagc aactggcgcg cagcaaagtg gccggtggag tgaaactgca 3660 tctcccaacg tttaaatgca agtctttgct gattaacagg catctaaacg agattgagtc 3720 ccttcctttc tacaattcct acatcaaccc aacaaatccc ccccaaccct tttccatctc 3780 agatcttccc tgcctcaaat taatcctcga caatctagca aattttccct tccaaatccg 3840 ccaacacccc tccgccgatc tcatccatcg cttctatgtg gaacaaacgg aaaggcccaa 3900 agtggaaact gaatacccaa acgctaactg gtgtcgtgta tgggcaaaca tctggatgcg 3960 tggactctca tcatcgcaga agtctgctct atatatgtgg acaaaccaga aaataccaca 4020 ccgacggcta ctgttcaaaa tgcgacggac agacggggag caatgcttgt actgtgctga 4080 gcgaagtgaa ttcctgcagc acaaattttt ctcttgtcct agagttaatg atgcgtggag 4140 agtactgcag agaaggctgt tggcgataac cggtcgacgc tcttttgttt gtgacgaatt 4200 gctgcgacca tcactcgaat ggacatcggt aaccacgaga cgaactgtat tgaaaacttt 4260 agtgaattac atttccttca ttgaaaattc caatttaagg atagatgtag atgcgttaag 4320 atttgcatta gaagtcgaag tttaatcaat tatgtgaata atttttacta agcatatcga 4380 ccaataaaca gctttttata acaaaaaaaa aaaaaaaaaa 4420 // ID DNA4-9B_AP repbase; DNA; INV; 185 BP. XX AC . XX DT 30-AUG-2009 (Rel. 14.09, Created) DT 30-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA4-9B_AP. XX NM DNA4-9B_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-185 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 1956-1956 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 185 BP; 41 A; 51 C; 55 G; 38 T; 0 other; ctatactcta ttcggaatga gatcgtacct cggcggcgac cagttttcgt cgggcggcgg 60 tcagacatat ggtgagacgc ttcccccacc tagaaagctc gtggggctag cgctgcggaa 120 cgaagccacg cccatgcgca gaagttggcc gccgaggtac gatctcattc cgaatagagt 180 atagt 185 // ID Copia8-NVi_LTR repbase; DNA; INV; 209 BP. XX AC AAZX01003863; XX DT 05-NOV-2007 (Rel. 12.11, Created) DT 05-NOV-2007 (Rel. 12.11, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: long terminal DE repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia8-NVi; KW Copia8-NVi_I; Copia8-NVi_LTR. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-209 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from Nasonia parasitic wasp."; RL Repbase Reports 7(11), 1115-1115 (2007). XX DR Genome; AAZX01003863; Positions 9247 9039. XX SQ Sequence 209 BP; 45 A; 62 C; 43 G; 59 T; 0 other; tgttgaagct agagatgacc acttggaact ctcccacgtt ctactcatgc gcaagctcgc 60 cgagctgcgc cgctcgactc ggcgcgtgag ttcgtcgctt agccctccga gagagcgctc 120 gtactttccc gaagtttaat aaattacttt tttcacgaga ccttttgcat tttgattatt 180 tacgagactc accctacctg atcccaaca 209 // ID CR1-67_HM repbase; DNA; INV; 4598 BP. XX AC . XX DT 24-DEC-2008 (Rel. 13.12, Created) DT 24-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE CR1-type family - consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-67_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4598 RA Bao W. and Jurka J.; RT "CR1 families from Hydra magnipapillata."; RL Repbase Reports 8(12), 1894-1894 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(144..1445,1433..3412,3327..4472) FT /product="CR1-67_HM_1p" FT /translation="MAPRSIEEMFIYLKCDIYTFEENFLKNIEEICSHPKD FT LVIASMNSISESTIRNLRERLFAEIVETFCADEFSEANITLDLTNPVEKNL FT RKRYKTSLCLEDIYILSITLCEKQLHKEIVKVIISTKNVDTLPNDKFLTKS FT VKELLHISKEIQKENKEIKEELILLKEKIIFQNKLIEKLKIENRQSELDIS FT LLQPCQIPKLKIDGYTQCENKINNPAATKNQTQTTHKSFLNVEYKPQLLKT FT FANVVAQTHSNTNQPQIDAMLTTKINKKTNIDQQLHASNTNNDENNFTLVC FT RNSKSNHKTQNRSLVKKHRISEPVFGTKISENKTIAGERIVRKFDIFVGGV FT SNQINEELFKNYLETEIGISPLSVILNKENEYNRSYKVTVNNTEKNIIFNP FT ALWDNNIIVKPYRKKRLYTNISQNLETNGNQEDFHRSKWIIKMDLGNIFKE FT DKLPTTFNLCTFNCHGLKSNFEYTKSLILSHDITFICEHWLSNLEYFIIKN FT IYKNTHSSFFHQANKHEQGRPFGGNAFFIRKHLFQNIIILYEDDHIFAINL FT KKNQTNIIIIGIYLSSSRNNQTSLEDYKSQLDIISGVINNYEGVAEFIVLG FT DFQSFPHGIYDSLKRASSTKNNYSVVLSNFIKSNKLELVDVTKGSGPNITY FT HHNTLPNASYIDHIAFSKYTSLLYSNCFVIPFLSQNMSDHLPVSIEVGIIG FT QSSQENNHKNSDKYSIPNYAWNNNDFLQLYNNRLNTNFNFHNFTNDNYDEE FT LIKVYTVITKSASEALNQYLSMKTPSLYSKSWWTPELSRSKNVLSFYFKKW FT RETGFVKNLNSQIFIRYRMARKNFRNAVKNAQNKNLYKEYIKIETLKNTNP FT KYFWKTFRNIKKDANSKLYIINNKKDKDSITKEFAYSFENRLNSKAITYTS FT TNLKIPPCTKFDEVIISDENIKTVISRLKLNKSKDAFGISAEHLKNASCEA FT LTEWLRKFFNFSINHGQTPKSMSTSLIIPLTKSYKKSLTDPNNYRGISIIP FT IFTKLIEYLILLICPEIKETHPLQFGFTKNSSTLHAEFVISETIKHYNNNN FT SPVYLCSLDAEKAFDSCNWDILFERLYNDKKTPSVYCMLKKRLTAAIGTFC FT LRDCTMIKKLPLYIVNTISSLYYESSASVSYLDCKSYPFYLTQGVRQGSIL FT SPHLYNIYTQNLLETLQNESVVGTSINGNYTGVVAYADDIILLSSTLSGLQ FT KLINICNMYTHKNCIKLNADKTEFLLTGKKQIKNCTITLNHQQIKLHDKLS FT HLGFTWDTKSSPFASLNKSNIDHRISNFRTVIQTLIQSGIRFAHPNSIVQL FT YKTLAVPTLTYGLELCEHKKMLMQKLDIVGRNALKSLLDVSKHSKNYIHTL FT FFIEDISIIIQQNKLNLFIRLMNNKMTYDIIRSQLKNKTAPQHFLDSIKGL FT CNNHELNLETIMESKKKIKIVGLKNLIPETDLQSLKCAVEYWNLKEQRTIF FT KDILEKHIPR*" XX SQ Sequence 4598 BP; 1827 A; 800 C; 618 G; 1353 T; 0 other; aaagttttat acgcaagaga gcaagacgtg ttttttaaag agaaaaaata actggtaata 60 aataaagttt ttttttatat atataatatg gttctttatt tataaaactt gaaattgaaa 120 aagaatttgt caataaaaac aatatggcac cacgttcaat cgaggagatg tttatctatt 180 taaagtgtga catttataca tttgaagaaa attttttgaa aaatattgag gaaatttgtt 240 ctcatccgaa agacttagtt atcgcaagca tgaactcaat ctcagaaagc acaattcgaa 300 accttcgaga acgactattt gctgaaatcg ttgagacatt ttgtgcagat gaatttagtg 360 aggcgaacat aactctagac cttacaaatc ctgttgaaaa gaatcttcgt aaaagataca 420 aaacaagttt atgtttagaa gatatttata tactttcaat cacattatgt gaaaaacaac 480 ttcataaaga aattgttaag gtaattatct caacaaaaaa tgtggacaca ctgcccaacg 540 ataaattttt aactaaatct gtaaaggagt tgttacatat ctcaaaggaa attcaaaaag 600 aaaataaaga aataaaagaa gaactcatac ttttgaaaga aaaaataatt ttccaaaata 660 aacttattga aaaacttaag atagaaaaca gacaatcgga actggatata agtctcttac 720 agccgtgtca aataccgaaa ttaaagatcg atggttatac gcaatgtgaa aataaaatta 780 acaacccagc tgctacaaaa aatcaaactc aaacaacgca taagtctttt ttaaacgttg 840 aatacaaacc gcaactatta aaaacattcg caaatgtagt agctcaaact cactcaaaca 900 caaatcaacc acaaatcgac gccatgttga ccacaaaaat aaataaaaaa accaatatcg 960 atcaacaact acacgcatca aacaccaaca atgatgaaaa taattttact ttagtgtgcc 1020 gaaatagtaa atctaatcac aaaacacaaa atcgttcatt agtaaaaaaa catagaattt 1080 cagagcccgt gttcggaaca aaaatttctg aaaataaaac aattgcaggt gaaagaatag 1140 ttcgtaaatt tgacattttt gtcggagggg taagcaatca aatcaatgaa gaacttttta 1200 aaaactatct agaaacagaa ataggaatct cccccttatc tgttatattg aataaagaaa 1260 atgaatacaa tagatcatac aaagttaccg taaacaacac agaaaaaaat ataatattta 1320 accctgcttt atgggataac aatataatag ttaaaccata ccgaaaaaaa cgtttatata 1380 ctaacatatc tcaaaatctg gaaactaacg gaaaccagga agattttcat agatcaaaat 1440 ggatttaggg aatattttta aggaagataa acttccaaca acatttaatc tatgtacttt 1500 caactgccac ggtctaaagt ctaattttga gtataccaaa tcgttgattc tctctcatga 1560 cataactttt atatgtgaac actggttatc gaatttagaa tatttcataa taaaaaatat 1620 ctataaaaat acccactctt cattcttcca ccaggctaac aaacatgagc agggtcgacc 1680 gtttggcgga aacgcttttt ttattcggaa acatttgttt caaaacatta ttattctata 1740 tgaagatgat catatcttcg caataaatct taaaaaaaat cagactaaca ttattattat 1800 tggcatttat ctttcatcgt cgcgcaataa tcaaacatca ctggaggatt acaagagtca 1860 gttggatata atttcaggtg ttattaataa ctacgagggc gttgctgaat ttattgtcct 1920 tggtgacttt caatcttttc ctcacggaat atatgattca ttaaagagag ctagctctac 1980 gaaaaataac tactccgtag ttttatcaaa ttttattaag tcgaataaac ttgaattagt 2040 ggacgtaaca aaaggctctg gacctaacat aacgtatcac cataatacat taccaaacgc 2100 atcttacata gatcacatag ctttctcaaa gtatacgtct cttctttatt ccaattgttt 2160 cgtaattcct tttttatctc aaaacatgag tgatcattta ccagtatcga ttgaagttgg 2220 aattattgga caatcttctc aagaaaacaa tcataaaaac tcagacaagt acagtattcc 2280 aaattatgcc tggaataaca atgatttttt acaattatat aacaatcgct taaatactaa 2340 ttttaatttt cacaatttta cgaatgacaa ttacgatgag gaacttatca aagtatatac 2400 tgttattaca aagtcggcat ctgaggcact taatcaatat ttatccatga aaacaccgtc 2460 attgtattca aaatcctggt ggacacccga attaagccgt agtaaaaatg ttctttcatt 2520 ttattttaaa aaatggcggg aaactgggtt cgttaaaaac ttaaactcgc aaatttttat 2580 tcgataccgg atggcacgaa aaaatttccg caatgctgta aaaaatgctc aaaataaaaa 2640 cctgtacaaa gaatacataa aaatcgaaac gttaaaaaat acaaatccaa aatatttttg 2700 gaaaactttc agaaatataa aaaaagatgc aaattcgaaa ttgtatatca taaataataa 2760 aaaagataaa gactcgatca cgaaagaatt tgcctacagt tttgaaaatc gattaaactc 2820 taaagcaatc acatatacct ctacaaattt aaaaattcca ccttgcacaa aattcgacga 2880 ggtaataata tcggatgaaa atataaaaac ggttatttct cgccttaaat taaacaaatc 2940 caaagacgcc ttcggaatat cagctgaaca tttaaagaat gcgagctgtg aagcactaac 3000 agaatggctc agaaaatttt ttaacttttc tattaatcat ggacagaccc caaaatcgat 3060 gtctacatca cttataatac cacttactaa atcgtataaa aaatccttaa ctgatccaaa 3120 taactaccgc ggcattagca tcattccaat ctttacgaaa ctaatagaat atcttatcct 3180 actaatctgt ccagaaataa aagaaactca tcctcttcaa ttcggtttca ctaaaaatag 3240 ctcaacccta cacgctgaat ttgttattag tgaaacaatt aaacactaca acaacaacaa 3300 ctcacctgtc tatctctgtt ccctagatgc tgaaaaagcg tttgacagct gcaattggga 3360 cattctgttt gagagactgt acaatgataa aaaaactccc tctgtatatt gttaatacca 3420 tatcgtcatt atactatgaa agcagtgctt ctgtttctta tctagattgc aaatcatatc 3480 ctttctatct aacgcaagga gtgagacagg gatctattct ctcgcctcat ttgtataaca 3540 tttatactca aaacctacta gaaaccttac aaaacgaaag cgtcgtaggt acatcaatca 3600 atggaaacta tacgggtgtt gtagcatatg ctgatgacat tatactcctt agctccaccc 3660 tctctggcct acaaaagctt attaatatat gtaatatgta cacacacaaa aactgcataa 3720 aattgaatgc tgacaaaacc gagttcctgc tcaccggaaa aaagcaaatt aaaaactgca 3780 caataactct caaccaccaa caaataaaac ttcatgataa acttagtcat ctaggcttta 3840 catgggatac aaaaagttcc ccttttgcct cacttaataa atcaaacatt gatcaccgaa 3900 tatctaattt ccggacggta atccaaacct taattcaatc aggaatccgt tttgctcacc 3960 caaactccat tgttcagtta tataaaactt tagctgttcc aacactaacg tatggtcttg 4020 agctgtgtga acataaaaaa atgttaatgc aaaagcttga tatagttgga agaaatgcac 4080 ttaagtcact cctcgatgtt tcaaaacata gcaaaaatta tattcatacc ttatttttta 4140 ttgaagatat atcaattatt atacaacaaa acaagctaaa cttatttatt cgtttgatga 4200 acaataaaat gacatatgat attattaggt cgcagctgaa aaacaaaaca gccccacaac 4260 attttttaga tagtatcaaa gggttgtgca ataatcatga actaaatctg gagacaataa 4320 tggaaagtaa aaaaaaaata aaaattgtcg gcctcaaaaa tttaattcct gaaacggatc 4380 ttcaatctct aaaatgtgca gttgagtact ggaacttaaa agagcagaga actatcttca 4440 aggacattct tgaaaaacat attccaagat gatcacgttg aaaaaaaaaa aaaaaaaaaa 4500 aaaaatttcc aagatttttt atatttatct tgtatgaaac tcttaattta ccttacttgt 4560 atagtgggtg tttaataaat ctaaaataaa aaaataaa 4598 // ID Gypsy-15_OD-LTR repbase; DNA; INV; 1039 BP. XX AC CABV01004379; XX DT 24-FEB-2011 (Rel. 16.02, Created) DT 24-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Oikopleura dioica genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-15_OD_; KW Gypsy-15_OD-I; Gypsy-15_OD-LTR. XX OS Oikopleura dioica OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; OC Oikopleuridae; Oikopleura. XX RN [1] RP 1-1039 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Oikopleura dioica genome."; RL Direct Submission to RU (24-FEB-2011). XX DR Genome; CABV01004379; Positions 1167 129. XX SQ Sequence 1039 BP; 336 A; 182 C; 190 G; 331 T; 0 other; tgtataattt atgtgaggca cgatagcaac tacagcagca actactggta cggaaccgac 60 aacgatcttg gactatggac ttgcttttag tctttttcgg tcacatagtg agaagattaa 120 gactaattgc acttatttat tttgtttgtt tgttttattt atttcttcgt tatttatgtt 180 aaaaactaat taattaatta aacctgacca atttaatcaa attaattaat tttatttgtc 240 atttaaaaaa tattttataa caaaaatctt tcaaaattcc tcaaaattct tcatttgtga 300 aaatgaaatt actaaactat tgttctcttc ctaaaaagcc gcaggcctgg ccagcgcatt 360 cagctattgt aactacggtt aaaaaactta ttctcgaatc ttcgcgaagc ttttttcaaa 420 aatgcgcgcg tttcagtttc ttccttgatt tcaacttgct caaaaacaga tgcaatctgt 480 tctgcataaa agcgcccgac atgcaacgaa aatcattcat ttcacaaaat aaaccaccgg 540 tcgaattttg aatcctgact cgttctttgc ttggcattct agtactagta aattggcaag 600 agtagagacg cacgattcgg tatcatggtg aacacgacgg tcaaaacgcc tgaagaggta 660 atttattcga ataatctctt ctcgcttatt tcgaatcaaa tgatccgccc gcgggaatcg 720 atgaaaaatt ttgaatttaa ttttgtaaaa actgcaattg aagacaaaat tcagcgtgca 780 aaatggattt ttgacaaaag tttggaaagt catgaaagtt caaatgagtc cgatttgaga 840 gcgcaattta accgactagc cgctaatttt gtttacaatt cttgcagcga ggatttgctt 900 aaagattacg gggaggatct ctgtggtttc gaaacagcgg aggaaagatt aacgtttctt 960 tcgggatttt acggggctga aacgaaaact gaaaagataa ggcgtaacaa agacaaactg 1020 agtaaattgg ctcggggca 1039 // ID P-36_HMa repbase; DNA; INV; 3256 BP. XX AC . XX DT 15-SEP-2009 (Rel. 14.09, Created) DT 15-SEP-2009 (Rel. 14.09, Last updated, Version 2) XX DE P-type DNA transposon - a consensus. XX KW P; DNA transposon; Transposable Element; P-36_HMa. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3256 RA Jurka J.; RT "DNA transposons from Hydra magnipapillata."; RL Repbase Reports 9(9), 1920-1920 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 830..2836 FT /product="P-36_HMa_1p" FT /translation="MLKKCDGSIEHIINLSKKYLTNTQLQFFESQLRMNNR FT IKQGKRWTIEDKILALRIMYRSPQAFKMFRKVFSLPSRSTILMFLEKSFGT FT LESGFSNKVIALLKMRVNTMKKLERNCSLVFDEMSLKQHLDYDKNNDKIIG FT IQSNGKPVNQVLVLMVRGLSTKWKQPIAYFYSNTTMCSTNLAKIVNNAMIH FT LHKIGLAVRCLVCDQSSSNIRALKFLGFSLLNQQISHPTTQAKVYIIFDPP FT HLIKNVRNNLIKHDILSDGKIISWKHLEELYNLDKVNAVRLVPKLTDRHLD FT PGSLLAMRVKLATQIFSSQVATALTEYASAKLLPENVLSTAFFIKNMDILF FT DILNSRKLKADKSTRSALTLNSSFIQKLEDLRVWIKGWHFSGARSQKSISS FT HWGLDVTISNILCLANELLHEDFFYVCTAQFNQDCLENFFALIRSKGGWND FT RPTAMQFKSAYKNVLVLLSIEQSNSNSNCLPESDFSAAVTIDSITNIVFEN FT QISANEQLIFKHPTALCKKKQKPYTLQTVSTDSVSQSIKQVLKLIADDLIK FT KVQVCQKCVQVLSSVAPCESNLFNSDSFLQNLLHEMETVFITQMKLILTKK FT NIIQQIIKVIKNDCNFTFLYDEHIDHAFHLEESLIRHYVYMRISYYIRFYN FT RDLAPRNKTCKNKLARIIHA" XX SQ Sequence 3256 BP; 1222 A; 463 C; 452 G; 1119 T; 0 other; gggatgtctt gcgggaccat attataaaca acataaattt actatacatc tttgttgtct 60 gctaaatata tattatatat tataatatta ttagattttg cttttcaaaa attaagttgc 120 tttttctata aatacaatat taaatatgcc agccttttgc tcagccatca gttgtgttaa 180 caagcgtgga aagaatgttg gaaataatat atcttttttt agatttccaa aagataaaaa 240 taggtaagta attattagtc aaatcaagtt tattaaataa ctagtatatc aagaatttaa 300 agaaactttt aaaacattgt tttactttaa attttatgaa acaaaaaata tttatacata 360 atatgatgta aataattcca ctaaattaat taggtttaac ctattattta gatgcaagct 420 ttgggttcgt aattgtcgtc gtcaagatct cgatacaaaa tcatgtgaag aacttaatct 480 taactatact atttgtagtg aacattttga ttcttcccaa tactatcttg ccaataatgg 540 tagtaaaaga gcacttgtaa ctgctgtacc aacaattttt aactttccaa atcaaccttc 600 taccttaaat ttaaagagaa aacagcctac aaacagactt catttacaag ataaaataca 660 aaaaaaagaa atcaacgata aagtcaaatt aaataaaata aatagtgttg agaatagcag 720 tacaaataat gtaactgtca aaactacttt taaaaataaa tatttatcat ctcaagttaa 780 gatttgtcga cttaaaaaaa aacttaaaaa cctgaagata aagcatgata tgttaaaaaa 840 atgcgatggt tcaatagaac atattattaa tttgtctaaa aaatatttaa caaatactca 900 acttcaattt tttgaaagtc aattacgtat gaataatcgt atcaaacagg gaaaacgttg 960 gactattgaa gataaaattt tagctttaag aatcatgtat cgcagtccac aagctttcaa 1020 aatgtttaga aaagtttttt cgttaccttc tagaagcact attttaatgt ttcttgaaaa 1080 atcttttggg acccttgaat ctggtttttc caataaggtt attgctttac taaaaatgcg 1140 agtaaatacc atgaaaaaat tggaacgaaa ctgtagcctt gtttttgatg aaatgtcttt 1200 aaagcaacat cttgattatg acaaaaacaa tgataagatc attgggattc aatctaatgg 1260 caaaccagta aaccaagtac ttgtattgat ggtccgtgga ttatcaacaa aatggaaaca 1320 accaattgca tatttttaca gtaatactac tatgtgttca accaatttag ctaagatagt 1380 aaataatgcc atgattcatc tacataaaat aggtttagct gtaagatgtc ttgtttgtga 1440 tcaaagctcg agtaatataa gagctctcaa atttttagga ttttcattat taaatcaaca 1500 aatttcacat ccaactacac aagcaaaagt ttatataata tttgatccac cacatcttat 1560 aaaaaatgtt agaaacaatt taataaaaca tgacatttta tctgatggta aaataatatc 1620 gtggaagcat ctggaagaac tttataattt agacaaagtt aatgctgtgc gacttgtgcc 1680 aaagttgaca gatcgtcatt tagatccagg atcactatta gccatgcgag taaaattggc 1740 aacacaaatt tttagtagtc aagttgctac tgctttgact gaatacgctt ctgcaaaact 1800 attgcctgaa aatgttttgt caacagcctt ttttattaaa aatatggata ttttatttga 1860 catcttgaac tctcggaaac taaaagcaga taaatcaaca cgctctgctt taactctaaa 1920 tagtagtttt atacaaaagc ttgaagattt aagagtttgg attaagggtt ggcatttttc 1980 tggtgctcgc agccaaaaaa gtatatccag tcattggggt ttagatgtca ctatttctaa 2040 tattctttgt cttgcaaacg agttgcttca tgaggatttt ttttatgttt gcacagctca 2100 atttaatcag gactgtttag aaaatttttt tgcattaata cgaagcaaag gaggctggaa 2160 tgacagacca acagcaatgc agtttaaatc agcatataaa aatgtcttag tactcctttc 2220 aatagaacaa tctaactcta acagtaattg tctccctgaa tcagatttct cagctgcagt 2280 taccattgac tcaatcacca atatagtatt tgagaatcaa atttctgcaa atgaacaatt 2340 gattttcaaa catccaactg cattatgtaa aaagaaacaa aagccttaca cattacaaac 2400 tgtttcgact gattcagttt cacagtcaat taaacaggtt ttgaaattaa ttgcagatga 2460 tttgataaag aaagttcaag tctgtcaaaa atgtgtacaa gttctttcaa gtgtggcacc 2520 ctgtgaaagt aacttattta attctgattc ttttttacaa aacctacttc acgaaatgga 2580 aactgttttt ataacccaaa tgaaacttat attaactaaa aaaaatatta tacaacaaat 2640 tattaaagta ataaaaaacg attgcaattt tacatttctt tatgatgaac atattgatca 2700 tgcttttcat ttagaagaaa gtttaatcag acactatgtt tatatgagaa tatcttatta 2760 tataaggttt tataatcgtg atctggcacc acgaaacaag acatgtaaaa acaagttagc 2820 tcgcattatc catgcttaat tttattttaa tgataattct tgaaacaata tatttcttat 2880 ttattattgt tgttttttct ctcagcatta ttaactatat atttagcatc taattaaggg 2940 gtatatattt aaactaatat ggagtattta tttaaaataa tatgaaacga atacagtgat 3000 aacatgatta aaatgactta tgcaaaacat gctaaattta catcaaattt acaactgcgt 3060 cattaatgtt aacatattaa tgttagatga tatataaatg ttatataagc aatgttatga 3120 aatagtttta aaattatttt aatttataaa agaaaagttt tttaatggga agccctctgg 3180 taagaataat gcaaaggtat aaatatttca ctaagagttg cgttgtttat gattacttgg 3240 tcccgcaaga catccc 3256 // ID BEL-3_AA-LTR repbase; DNA; INV; 703 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-3_AA_; KW BEL-3_AA-I; BEL-3_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-703 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 856-856 (2011). XX DR [2] (Consensus) XX SQ Sequence 703 BP; 244 A; 116 C; 152 G; 191 T; 0 other; tgttgcgacg ccgcttgaac aggtctactg cctgaacagt cctaacggcg cagcataaac 60 acaacatgac agctcccgta aagtggtagt gtagacagag agggcatatg tttatgtagt 120 agaagttgac cagtcggata ttggaaaaaa gtagtttcgc catcgcaata aatcgttgtg 180 attacagtga gcatttagtt tattccatag ggccttatat taagtgatta ttaagcaagg 240 aagactagat tctgcaacat ctgtgacacg attctattgg taaggacctg aaatgaaata 300 acgaagttag tatattaaaa tgtaaatact tactagccct actctgcagc cgtggtgatt 360 gcaatcaata tccggtaaat tacctgaaaa agagaaggat tatccccatc aagtttagaa 420 gattcgctga taaactgtaa gtagaaaagt atagagaaga atattgtgtt aataagagtt 480 tgtatgcatt agaaaggcat ccgcagagaa gtagttcagt tagggtagcg gtacccaaca 540 tcatcacaca gtggaaaagg aagtacaagg aaaccacttc ttgtgagtaa cattttagtc 600 ttataagatg aataattata ataaaatgaa ataaatttca gttttaagcg ttgctgaaaa 660 ccaatcagcc gctgccgaaa aatttggttt cccttcggga aca 703 // ID Chapaev3-2_AA repbase; DNA; INV; 3850 BP. XX AC . XX DT 29-FEB-2008 (Rel. 13.02, Created) DT 29-FEB-2008 (Rel. 13.02, Last updated, Version 1) XX DE Chapaev3-2_AA is an autonomous DNA transposon - consensus. XX KW Chapaev; DNA transposon; Transposable Element; Chapaev3; KW Chapaev3-2_AA. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-3850 RA Kapitonov V.V. and Jurka J.; RT "Chapaev3, a distinctive group of animal Chapaev transposons."; RL Repbase Reports 8(2), 51-51 (2008). XX DR [1] (Consensus) XX CC Chapaev3-2_AA belongs to the Chapaev3 group of the Chapaev CC superfamily of DNA transposons (see comments on Chapaev3-1_PM). CC Chapaev3-2_AA is a very young family of mosquito Chapaev3 CC transposons: genomic copies of Chapae3-2_AA elements are ~99.4% CC identical to their consensus sequence, which was derived from CC multiple alignment of 16 Chapaev3-2_AA elements. Chapaev3-2_AA CC contains imperfect 520-bp terminal inverted repeats and encodes a CC 551-aa transposase (3 exons). XX FH Key Location/Qualifiers FT CDS join(1249..1475,1554..2746) FT /product="Chapaev3-2_AAp" FT /note="transposase." FT /translation="MCAAKREFKCKNDPQKFCFVCGNYIFTTPVNFTSSLQ FT TAYRLYFKTDPKNLDKTWTPNVVCSSCKTCLARWMSGESRHLSFVVPTIWR FT QPFCHAADCYFCLTKVMQHGRHKAVEYPQVQSMTKPIPHEDSSPYPVCPKR FT KREPSVDDSDEPPFDESDVEYERKPMLFTQAELNDLIRDLDLSKEKSELLA FT SRLMERNFLSADTKVTSYRKRHEKFGKYFSKKDSACFCHDIRGLFEEFGEP FT YDPSEWRLFIDSSKLSLKAVLLHQGNNKPSIPLVHAVNMKESYESMAFILD FT LICYNQHNWKICSDLKVVAMLTGLQQGYTKFMCFLCKWDSRARKQHYVREN FT WPERKSFTIGQENVRCHPLVAKEQIILPPLHIKLGLFKNFVKALNKEGPAF FT GYLKSIFPNLSDAKIKEGVFVGPQIKKMLNDEDFYTVLNNHEAEAWKSFQR FT VVAEYLGNTRSPDYEEIVADLLKNYKRMX" XX SQ Sequence 3850 BP; 1292 A; 627 C; 698 G; 1233 T; 0 other; cactagttta caaataaaaa tgaaagtcat gaactttaat agctgagctt cattttcaat 60 gtaaaatcgt acgctgaatc caaaaaaaat gtcaaaaaaa ttcacagtag aaccgttttt 120 gagatacgct caatatttga tttttttggt ttttcaaaaa agtacatttg ctaaaattga 180 tgtatctccg gatttagagc accaaatcta aacttttttt tagcaaaata aagctaatta 240 aatgtgcttt ctaccgtctg aacatcactt tgttgtgcaa attacggttt tctagatatt 300 tatggcataa tcaaattttt tcgatatttt tgagtaaaat ttaaccaaat tttcaacttt 360 tagctaagtt ttaagcacga tgctgatgtt ttaaaattcg tttatcatgt tgaactcaag 420 gtatgtaaaa aaccaaaaga aaaaagtacc taatgactga aatcggttga aaattgaaga 480 tgttatggca ttttttgtag aacacttgaa ttttgaagtt ttgacctttt aaaacaacaa 540 gacgaaatcc atataaattt cactaaatcc tttcactttg aatgactctt agtacagatt 600 ccatgctttg ttaaccgttt tatcaattat taattttgga tcacgactaa ttaatgaaaa 660 gggcgtatgt aaaagcagaa tttatgtgtt tgcgtatata ttaaaaaagg ccagttttac 720 taacatggag tcttgatata tttcaagcta ataatgatga tgatgaactt tttatgattc 780 aatattaaat ccagcaaaca aactacaatg aatacgtgta taaaataagg ccatgggctc 840 atttgagaac gtgaaagatg tatccgcctt ggctacatac tgtatgcaac ttaaaagaaa 900 aatttccgaa gccatccgtg ttcaattatg gcatacgaaa gttcaggcta ctgcgatgtc 960 aagctttctt cagtgttgct ggctcaacgg ctgaatgatc atggcgatta atgtgataca 1020 agaaatataa attcctgact gattcctgaa atgtttggtc ctttaactgt tttgatgtgg 1080 ttataactat ttttgatgtg gtttcaagtt taatgataag tttatgctgc ttgtctgttt 1140 ggcgtttgaa cgttataact ttgttatgac aaaatgattt tggcagcact ttcgtcgcag 1200 ggaaaacatt gttgacaggt agtactgttt gttgaacata gtgaaacaat gtgtgcggca 1260 aagcgtgaat ttaagtgcaa aaatgatccg caaaaatttt gtttcgtctg cggaaactac 1320 atttttacga caccggtaaa ttttactagt tcattgcaaa ctgcatatcg gctttatttc 1380 aaaactgatc ccaaaaacct cgacaaaacg tggactccaa acgttgtgtg tagttcgtgc 1440 aaaacatgct tggctagatg gatgtctgga gaaaggtaat tatttaattt tctacattca 1500 aatgactaac aagtgctatt tactaactaa ctaactaaca aatcttgtac cagccgccat 1560 ttgtcctttg ttgtaccgac aatatggcgc caaccatttt gtcatgcagc ggattgttat 1620 ttctgcttaa ccaaagtcat gcaacacgga cgtcacaaag cagttgaata tccacaagtt 1680 caatcaatga caaaaccgat tccacatgaa gattcatctc cttatcctgt ttgtccaaag 1740 cgaaaacgag agccttcagt ggatgacagt gacgaaccac cctttgatga aagcgacgtg 1800 gaatacgaaa gaaagccgat gctgtttaca caagcggaac tcaatgactt gataagggac 1860 ttggacttat caaaagaaaa atcagaattg ttggcatcgc gcttgatgga acgaaacttt 1920 ctttctgctg atacaaaagt tacttcctac agaaagcgtc atgaaaaatt tggaaaatat 1980 ttttccaaaa aagacagtgc ttgcttttgc catgacataa gaggcttgtt tgaagaattc 2040 ggtgaaccat atgatccttc tgaatggcgt ttgtttatcg acagtagtaa attaagttta 2100 aaagctgttc tgctgcatca agggaacaat aagccttcga ttccgctagt tcatgccgtg 2160 aacatgaagg aatcatacga atcgatggct tttatacttg acctgatatg ctacaaccaa 2220 cataattgga aaatatgctc agatctcaaa gttgttgcaa tgttgactgg cttgcaacag 2280 ggatatacta agttcatgtg ttttctttgc aaatgggata gccgagcacg taaacaacat 2340 tacgttcggg aaaattggcc cgaacgaaag tcttttacaa ttggtcaaga gaatgtaagg 2400 tgccatcctc tcgtagcgaa ggaacagata atactcccac cactgcacat aaagcttggg 2460 ttgttcaaga attttgtaaa ggcgcttaac aaggaaggac cagctttcgg atacctgaaa 2520 tccatttttc caaacttgtc ggatgccaag attaaggaag gtgtgttcgt tggtcctcaa 2580 attaaaaaaa tgttaaacga tgaagatttc tacaccgtat tgaacaatca tgaagccgaa 2640 gcatggaaat cgttccagcg tgtagttgcg gagtatttgg ggaacaccag gagcccagac 2700 tatgaagaaa ttgtagccga tttactcaaa aactacaaga gaatgggtaa attttaaagt 2760 tttgttttat ttactttcag ctgtaaaatt ttacatttta ttttaacagg tgtgaatatg 2820 tctctgaaga tacatttcct acactcgcat ttgaactttt tccctgaaaa cctcggcgac 2880 gaaagcgatg agcacgggga acgttttcat cagcaaatga aaataatgga acgtcgttat 2940 caaggatttt gggatgaagc gatgatgggc gattattgtt ggtttctctt tcgagaaaca 3000 aattgcaagc ataacagacg gagtgattca aacaaccatt tttaaacata tattttttac 3060 atgtgaattt atagcttaat ttaaggaagc tttgaaatat atgaaatgaa ctgattgttg 3120 tagagataat aagtagagat aaaacttatg ttctgagtga agtagaagct ctgttaatac 3180 acacgccctt tttacagatt agtcgtgatc taaaattatg gattgataac acaacattaa 3240 atcttaacta agacttatta actgttgaaa gtttaagcaa atctaaatga aaatgcatta 3300 aattttcttc atgttgttga gatcgcttga aaatcgtaaa aacttcaaaa ttcaagtgtt 3360 ctacaaaaaa tgccataaca tcttcaattt tcaaccgatt tcagtcatta ggtacttttt 3420 tcttttggtt ttttacatac cttgagttca acatgataaa cgaattttaa aacatcagca 3480 tcgtgcttaa aacttagcta aaagttgaaa atttggttaa attttactca aaaatatcga 3540 aaaaatttga ttatgccata aatatctaga aaaccgtaat ttgcacaaca aagtgatgtt 3600 cagacggtag aaagcacatt taattagctt tattttgcta aaaaaaagtt tagatttggt 3660 gctctaaatc cggagataca tcaattttag caaatgtact tttttgaaaa accaaaaaaa 3720 tcaaatattg agcgtatctc aaaaacggtt ctactgtgaa tttttttgac attttttttg 3780 gattcagcgt acgattttac attgaaaatg gagctcagct tactaagttc agaaatgttg 3840 taaactagtg 3850 // ID Gypsy-248_AA-I repbase; DNA; INV; 4258 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-248_AA_; KW Gypsy-248_AA-LTR; Gypsy-248_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4258 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1097-1097 (2011). XX DR [1] (Consensus) XX CC Positions [3230-3688] - Integrase core CC 'CAAC' target site duplication CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS join(1172..2032,2036..4096) FT /product="Gypsy-248_AA-I_1p" FT /translation="MRSCSTNLVSYCGTSIDVLGVLDARVEYGSGSVTLPL FT YVVDSEKHPLLGREWLNAIPVDWSSVLQGPDAVNAISNAALSHTAALKEVL FT GRFPKVFDDSIGKICDVQASLPLKKNAVPVFLKARKIPFNLQKTVEDELDK FT LEAEGVLTKVNQSNWATPIVPVKKSQGRVRICGDYKQTVNPNLIVDRHPLP FT TVDELFASLAGGRKFTKIDLVQAYLQLEVAPEDREILTLSTHRGLYRPNRL FT MYGVASAPAIWQRQMEVILQGIEGVSVFLDDIKVTGPDDATHLQRLEVLRR FT LDHRGIRVNKGKCEFFADQIDYCGYRIDKDGVHKMPNKVAAIQNMPQPKNE FT DEVRSFVGLINYYGRFFQNLSTVLYPLNNLLKNDVPFEWTRQCDQSFKIVK FT SQMQSEKCLVHYSPELPLLLATDASPYGVGAVLSHLMPDGTERPIQFASQT FT LSRVQQKYMQVDKEAYAIIFGVKKFFQFLYGRKFTLITDNQAISKIFGEHK FT GLPVMSAIRMQHYATYLQSFDYQIRFRKSADHANADAMSRIPLSQADPENM FT IEEADVVELNQIETLPLTAAELAQATAEDQSVRNLIQGIKHGQPVDPRDRF FT GVEQNEFSLQKGCLLRGIRVYVPPTLRRKVLQELHSTHFGTTRTKSLARGY FT CWWVGLDRDIEEMVSNCAECQSVRPEPAKSRFHCWETPTQPFQRVHVDFAG FT PFHDTYFFILVDAYSKWPEIKVCKSTTAENTVNMCREIFATFGIPSVLVSD FT HGVQFTSETFQQFLRMNGVVHKMGAPYHPSTNGQAERYVQTFKQKLKSLKC FT PKSEFNLEISNILLTYRKMLHPSTGQSPSMLMLGRQIRSRIDLMLPKNEPK FT PVGDFAVRTFLDGDRVRVRDFLSRDKWKFGKIVEKVGKLRYAVRLDDGRIW FT ERHIDHIVGVGANLREDLVNTPREEVNTERDVPTVPVPVSVATAAEPAGEV FT ATGARPVVRPQPATPSTET" XX SQ Sequence 4258 BP; 1091 A; 1048 C; 1201 G; 912 T; 6 other; caacagttct ggcgacgagg atgcggaatg cagaaaaccc cgaccgaaat gcatgctgcc 60 gctgccgttg ctgccgccgg agaagggaag accctcctgc cgctgctgga gctccgccac 120 caaactttgc gatcgacccc ttcgacaaga gaaagttgaa gtggatgcgg tgggtggagc 180 ggctggagaa cgctttcscg atctacggag ttgccgaccc ggcgatgcgg aagaattttc 240 ttctgcatta catggggtcc gagacgtacg acgtcgtgtg cgaccgggtt gcacccgaaa 300 ccccccggca gaaaaactac gatgaaatcg ttacggcgtt ttggaaggat acttcagccc 360 ccagccgctg gaaatcagcg agaatttccg gtttaagtgc cgccggcagg gcgacaaaga 420 tgccgcttct gccgacgaga cggttgatca gtacctggtg gcgctgcggc ggatagccgt 480 cacctgcaac ttcggtgcgt acctcgagac agccctgcgg aatcagctgg tgttcggcat 540 caagcggaac gatatccgaa gccggctgct agagaggaga caactgaccc ttcaagatgc 600 ccgggacata gccgtgagta tggagctatc ccgaaaagga ggtgccgaaa tcgaagggag 660 ctcgggcaga caagaggtaa acgccgtgca cgatcgacag gacaaaaagg taacccagag 720 aagaaaaata aaataaacac tgggggaaag tctttcacta aagtggggma aagtgcaagc 780 gattcttctt gttttcgctg cggggaaaag tcgcactttg caaatacgtg caggcacaaa 840 gatacagtgt gttcgttctg caaactaaaa ggacatctcg cgaaagtttg catgaagaaa 900 gccgcctccg ggaagtccag taccagccgt gccgggaaca aatcattcgg ttcagacgaa 960 ctatttgcac caatccggcg acggtgacgg ccccggacgt gtggaagtgc gggaagtgtg 1020 taccgtggac acgaagggcg gtagcgctgc aaaacgcttc tgactggacg ttcgcgtgaa 1080 cggtaaaaac attcgtttcg aagtggatac tggatcgccg gtcagtatca ttaacgcgaa 1140 gtgccgggac agatacttcc ccgaatcaca gatgcgmagt tgcagtacaa atctcgtgag 1200 ttactgcggt accagtatcg atgtactggg agtcctcgac gcgcgtgtcg agtatggaag 1260 tggaagtgtt acgttaccgt tgtatgtcgt ggactctgaa aaacacccac ttctcgggcg 1320 cgagtggctt aatgcgattc cggtcgattg gagtagcgtg ctccagggtc cggatgcggt 1380 taacgcgatt tcgaatgctg ctctctccca cactgctgcg ttgaaggagg ttttgggacg 1440 gttcccgaag gtgtttgatg attccatcgg gaaaatttgc gatgttcaag ccagcctgcc 1500 gttgaagaag aatgccgtac cggtgttcct gaaggcgagg aagattccgt tcaacttgca 1560 gaaaactgtg gaggacgagc ttgacaagct cgaagcagaa ggcgtcctta cgaaggtgaa 1620 ccagagcaac tgggctacac ccatcgttcc ggtgaaaaag tcccaaggcc gtgtgcggat 1680 ttgcggggac tataaacaaa cggtcaatcc aaacctgatt gtggacaggc atccccttcc 1740 tacggtggat gaactttttg cgtcgttggc cggagggaga aagtttacca aaatcgacct 1800 cgttcaggcc tacctacagc tggaagtggc tccggaggac agagagatat taacgttgag 1860 tacgcaccgt ggcttgtatc gtccaaatcg gctaatgtac ggcgtcgcat cggcgccagc 1920 aatctggcaa cgccagatgg aagtgatcct gcaagggatm gaaggagtca gcgtatttct 1980 ggacgatatc aaggtcacgg gccccgacga tgccacgcat cttcaacggt tgamagaggt 2040 tctacggagg ctagaccacc gtggtatccg ggtcaacaag ggaaaatgcg agttttttgc 2100 ggatcaaatt gactactgtg gttatcgcat cgataaggat ggcgtccaca aaatgccgaa 2160 caaggttgca gccatccaga atatgcctca gcccaagaac gaggatgaag tccggtcgtt 2220 cgtcggactg atcaactatt acgggagatt tttccagaac ctcagtacag tactctaccc 2280 tctaaacaac cttctcaaga acgatgtacc gttcgagtgg accaggcagt gcgaccaatc 2340 gttcaagatt gtgaagagcc agatgcagtc ggagaaatgt cttgtccact actcgccgga 2400 gctaccgttg ctgcttgcta cagatgcgtc accctacggt gtgggtgcgg tgctgagtca 2460 cctgatgccg gacggaacgg agcgtccgat acaatttgct tcccaaacgc ttagccgagt 2520 tcaacaaaag tacatgcagg tcgataagga agcatacgcg atcattttcg gagtgaagaa 2580 gtttttccag ttcttgtacg gccggaagtt cacgctgatc accgacaacc aggcaatatc 2640 caaaatattc ggagagcaca agggactgcc ggtgatgtca gccattcgaa tgcaacatta 2700 tgcaacatac ctgcagtcgt tcgactacca gatccgtttt aggaagtctg cagatcacgc 2760 taacgcggat gcaatgtccc gtattcctct gtctcaagct gacccggaaa atatgattga 2820 agaggccgat gtcgtcgaac taaaccagat cgagacgttg ccgttgactg ctgcagagct 2880 ggctcaggct acggctgaag atcagtcggt gcggaatctc atccaaggaa ttaagcacgg 2940 tcagccggtg gatcccagag atcgcttcgg ggtggaacaa aacgagtttt cgctccagaa 3000 aggatgtttg ctacgaggaa tacgagtata cgtgccaccg actcttcgga ggaaagtatt 3060 gcaagaactc cactctaccc attttggaac tacccgtacg aagtccttgg ccagaggata 3120 ctgctggtgg gtagggttgg atcgagatat cgaggaaatg gtttccaact gtgccgagtg 3180 ccaatcggtc agaccggaac cagcgaaatc tagattccac tgctgggaga cgccgacgca 3240 gccgttccag agggtccacg ttgactttgc gggcccattc catgacacgt atttcttcat 3300 tctggtcgac gcttacagca aatggccgga gataaaggtg tgcaagtcga ccacagcgga 3360 gaataccgtg aacatgtgcc gggagatatt cgctacattc ggaataccat cggtgctggt 3420 tagtgaccat ggtgttcaat tcacgtcgga gaccttccaa caattcctgc gaatgaacgg 3480 cgtcgtccat aagatgggag cgccgtatca tccatccacg aacgggcaag cagaacgcta 3540 cgtgcaaacg tttaagcaaa agctgaagtc cttgaagtgc ccaaaatcgg agtttaatct 3600 ggagatatcg aacatcttgc tgacctaccg gaagatgctg cacccttcca cgggtcaatc 3660 gccgtccatg ctgatgctcg gccgtcaaat ccggtcgaga attgatttga tgctgcccaa 3720 gaacgagccg aaaccggttg gagattttgc tgtgcgaacg tttttggacg gagatcgtgt 3780 tcgagtgcga gactttcttt cgcgcgacaa atggaagttc gggaagatcg tcgagaaagt 3840 gggcaaactg cgatacgccg tccgtctgga tgacggaaga atatgggagc gtcacatcga 3900 ccacattgtc ggagttggtg cgaacttacg tgaagatttg gtgaacaccc ctagagagga 3960 ggtgaacact gaacgagatg tgccaacagt tcctgttcct gtttcggtgg caaccgccgc 4020 tgagcctgct ggtgaggtgg ctactggtgc aagaccggtg gtgcgccccc aacctgctac 4080 gccatcaacg gaaactktcc ctgagtcgaa ccaggcgttt tcgaccccaa ggaccccaca 4140 gtcggtccaa gaagtgccaa ctcagcccct caggcgttca agtcgagtga tcaaacctcc 4200 ccaaaaattg aatttgtaat tttatcctat gaactttatt ttcaaaaaga gggagagt 4258 // ID SINE-2_CQ repbase; DNA; INV; 797 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A putative SINE family from Culex quinquefasciatus - consensus. XX KW SINE; Non-LTR Retrotransposon; Transposable Element; SINE-2_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-797 RA Kojima K.K. and Jurka J.; RT "SINEs from the southern house mosquito."; RL Repbase Reports 11(1), 621-621 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 9 sequences with >97% CC identity. CC 14-20bp TSDs. Long TSDs and frequent 5'-truncated indicate it CC is CC a SINE. XX SQ Sequence 797 BP; 232 A; 170 C; 189 G; 206 T; 0 other; gtgctagagc cattttagtt cgaaatagtg aaattatggt gaaactcaac tcgttcctgg 60 aaatgaagaa ttcgaggaaa agttagaaaa attgtgttgt gtaaaagtta catgaagaaa 120 actttttata atagagtgtg cgacatgctc cactcactgc acagctttgt ttttgttaat 180 aatcgtagtg atttttgtat tactgaaatt tggttgttga agatgacaca ccgcaagatc 240 gtgtccgcag ctgccaaggc aaactttcca gaatacactc gccaggatat gtgtcggatc 300 ggcaagagga agatcaagct gtccgtgatt tgcgacacgc agctggccgt ccggatccgt 360 acgactaaaa gtagctggca acgtccggat gtttagaagc agccgataaa gaaggctacc 420 gggattccag aaacttggtg agatctatcc aaactgaaac tgtaaaattc tgcctgctta 480 tcaacaacca ctaaatacgc gctgatctca tttttgcctt atgggattgg aagtctaaag 540 ataggtcgag gactcttctg gttcttggtc tatgtttggg cggggccaga caatgcccaa 600 tgacctcgga attggaccgc aaggagctct gtcattctga aacgggtcat tggaagcagc 660 gcgagggctg tcaccaccta atacccggga ctgagataac atgtatcagc ataacccaag 720 actgatacaa attatacttt cccataattt cgctgccggc cagcgggtat tggattggac 780 cacacacaca cacacac 797 // ID Helitron-1_DVir repbase; DNA; INV; 8816 BP. XX AC . XX DT 31-MAR-2007 (Rel. 12.03, Created) DT 31-MAR-2007 (Rel. 12.03, Last updated, Version 1) XX DE Autonomous family of Helitrons - a consensus sequence. XX KW Helitron; DNA transposon; Transposable Element; KW Interspersed repeat; Helitron-1N1_DVir; Helitron-1_DVir. XX OS Drosophila virilis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; virilis group. XX RN [1] RP 1-8816 RA Kapitonov V.V. and Jurka J.; RT "Helitrons in fruit flies."; RL Repbase Reports 7(3), 130-130 (2007). XX DR [1] (Consensus) XX CC This is a consensus sequence of a family of autonomous Helitron CC transposons transposed in the Drosophila virilis genome a few CC million years ago (copies are less than 5% divergent from the CC consensus sequence). The Helitron-1_DVir consensus sequence CC encodes the 2118 Hel-1_DVirp protein composed of the REP, HEL, CC and apurinic endonuclease domains. Helitron-1_DVir elements and CC non-autonomous elements transposed by the Helitron-1_DVir CC enzymatic machinery (e.g. Helitron-1N1_DVir) are usually inserted CC in the TTT|TTT target sites without the target site duplications CC (the insertion site is marked by "|"). Different families of CC Helitrons constitute ~5% of the D. virilis genome. XX FH Key Location/Qualifiers FT CDS 2285..8638 FT /product="Helitron-1_DVirp" FT /translation="MPRTKKLSVLNRARRQQARNELNRSNSEYAEHERINN FT IESQRERRENEGFSQTERNRNRQRNQVRRQYIVYRANEQRQNTVDHSSRRA FT NSPVREQEQLANTSRRSTRRQDPVYRANEQRQNTVDHSTRRANSPVRQQEQ FT LANTSRRSTRRQDPVYRANEQRQNTVEHSTRRANSPVRQQEQLANTSRRST FT RRQDPVYRANEQRQNTVDHSTRRANSPVREQEQLANTSRRSTRRQDPVYRA FT NEQRQNTVDHSTRRANSPVRQQEQLANTLRRSRRRENPVYRADEQEQDNIR FT RARHRQDIQRRLLERARDRNQRSQVRENPIRRIIEQARNTSVRRSARIHAR FT QEAFDANRSLSQRVSQYRGNTGNRIAENRRNATRNRGERAINFSEDRISPA FT IRSSLDGLIAGFNRTVKQGPDQVCLCCKGLWFPHQVEKISREVIESKCSAE FT VSSDILYLSSIFPSDNNKYDFCKTCSRSIKLGKKPKTSITNGLDFPDLPDC FT LKGLTPLEERLISPRLPFMIIKSLGYGRQNAIKGAVVNVPIPVANVVTSLP FT RAFNESEVVQIHLKRRMEYIHDFMAETIRPSKVAEAIKFLVNTELYRKHQI FT TINEQWLSSFSSDEVPFVASADDRDLVSQLVARSAQASIDEQNNIEEDINP FT GGQETLLENVPVENIAMQRIVIAPGEGQRPLDMIEDCDSEELAFPTIFAGI FT KRKCAESFTTIVRSELRNFDRRGCRTDKLFLNFKKLEMINIRNSTSICLRK FT HSATRGITAAQVLNEEYLENLIRHDDCYRVLKNIRTSPAHWEGEKKKVMAM FT IRQIGLPTFFITFSAAETRWAELLVLLSKNVDKLEISEELAANLSFANKAR FT LIRADPVTCARYFDFRYRQVLKLMKKEGGVFGSHRVIKYYWRVEFQQRGSP FT HVHGMFWLKDAPIVDLADESSIPSVKSFIDQFITTETNNPDVAPFLEYQTH FT NHGHSCRREVRAQNICRFGIPYPPMPQTEILFPLEEDAENKEVHRRNFEKI FT QGVLEYPFDRESIDYLSDFNNFLTHDQLNLSYECYINALRSSIKKPKIYLK FT RSFAAIKLNAYNTHLLLLQRANIDIQFILDPYACCSYVVNYINKSQRGISK FT LMREAVSEISRGNVTLKQRLQHIGHKFISGTEISAQEAVYCCLGMSLSESS FT NAVVFINTSLPENRVGLRKSKQQLQNLPDDSVAILEIGLIDHYVQRPDALD FT SLCLADFAAFYSFSKRHSGTRQNIDDADDIETSRASQQLVYALRDGSGFVK FT KRNTPIIIRYRRFNINTDRSNYFRELVMLYVPWRDENVELVARDCESFCGQ FT NHELIESNRRKYNSLNDSELQHALENAVQAENEENVENQEVVDDRFRVLGL FT PEINPHLNVLNINGNDENLDSEDNNVRLIKLPAIVSAESLADIVRSLNLKQ FT KTYFTHVLHNAKNKSVFYEFVGGGAGVGKSRLILALFQSLSVMYNSRPGSD FT PNSPKIILCAPTGKAAFGISGSTLHSMFSLPVNQSGSDLRALSNDLVNTLH FT SRLIDLKLIIIDEISMVGSKMFSQLDSRLKQIFKNTSPFGGISVIVFGDLR FT QLSPVGDRWIFESNSNDPYSAIYASLLWDLFKYFELTEIMRQREDQPFASA FT LNNMASGNMTADDISLVRSRICQQSDVPDDAIHLFSTNAATDSYNTIKLNS FT IPTDQFISTADDFIKATHLTAACRQRILEAVSSFRSSETQGLCMSLLLKTT FT AKYMITVNVDTGDGLVNGATGMLMQIDFDASSTPIILWMKFSDDTVGVLAR FT SKCTHRTEPFWTPIQKIVKSFQYRRNEQVSIDRRQFPVVPAEGITIHKSQG FT ATYSKVVVHTCSRMQRAAIYVACSRATTASGLFIIGDFVPPKPPKNDDKVQ FT NEIEKLRTGKLLRTHYDHLVNSSALDIFFHNTESIHCHIEDIRSDKLMLQS FT SVLCFVEASTYSNENIEIPGFTVVVRLDCEVIPNAPRPKRGIVIYIKNAIL FT NNISNCFSGRQISGSFVFEHAIFRYKSVCFLVIYKSPSYPLGLFKTEFEVL FT FQQYLFLRERCIVLGDFNLCLSKFDRCRALYDELLRKGLRCLLDLKTPTTN FT CDSHIDWAFSNLSSEEVSARTLETVYSYHSGISLTVQDRVN" XX SQ Sequence 8816 BP; 2831 A; 1575 C; 1758 G; 2652 T; 0 other; ttataccctg aacccattaa aaatgggtac aagggtatat tgtatttgtg caaaatccaa 60 atgtatgtaa caggcagaag gaagcttcac cgaccccata aagtatatat attcttgatc 120 agcatcaata gccgagtcga tctagccatg tccgtctgtc tgtccgtccg tccgtccgtc 180 cgtccgtccg tatgtatgaa cgcacggatc tcagaaccta taagagctag agacttgaaa 240 tttgaaagtt ttagatgtag atgctcctag ctcccgcgca gatcgagttt gtttccgata 300 atcgataact tgctccgttt ccaagcaatc gataaaaatc gatatcgata tcctgttttt 360 tgggcaattt tggtaaataa taagagctgg agtcaccaaa catgatatgt tgcttctaga 420 atattatata tatgtcaagt atctttcatt ttatacctat cgccaccccc ccgctaccac 480 cccagagcta taaatcaagt taataaccca acttgtatcg ccaaccaata taagctagaa 540 tttaaatgca attgactttt tcagaataca caaatattct atggtacaat aagatatact 600 gtataaaatt tcatcaagat cggttaagat acatagaagt tattaaagaa accatgtggt 660 tatatgtata tgtacatata tattggcagg ccacgagtca gcggcagctg gcacaaattt 720 ctgtatacat gtgcactaat acacacaaaa agaattacaa ttgcgattgc atcgtatgct 780 ctatgtacat atgtggtcct attttagttc agtttttttt caattagtag caactttcgg 840 tatggctgca aggtaaaaag ccacgggaag cagttcaatt gttatatgtc tggaataagc 900 ggcgcaagtt caattccaat ggggaaaaag aatttttttg actatttatt atttgtctaa 960 taatgttagt tttattcctt cacccattaa aaacggaaat aagtcaagag aatgcttttt 1020 ttttgcttaa agccctagta agcttggcaa gtgcgtagtg cgagctattt caggtaaggt 1080 atgcataaat atccatgcat cttaatcact aaaatgtcaa tttacgcaaa atttaatttc 1140 ctcattcaga aatcatattc tttattaaat cttacaaaag aatgtaatca tatgtattta 1200 tatctatttt catacataca tacgttttag gcttaggcct aaccagtgcg tatacacata 1260 cgtatataca ttataataat ttacatatgt acatatgtgt gtttgtacac attattttct 1320 tcccgcaagt agtcgaagac cacgaagtgg aacttttcaa cactgttact ataaatctgc 1380 gcgcgtaatt tgagaatttt atttcgtaaa tatttattat agttacatga ttttaataat 1440 tcataagaca gtcttaagga ggttgactga tgcattaata aatattatca ttagattaat 1500 gttattatat atgtgattat aataataaat aaagtagcgc gcgcatttcg atatgtatac 1560 taaatcgtgg cgttaggaga gtgaatattt cccaacactc tctcttcaaa tgtgctcaga 1620 agaagcagta atcgatttag tctgtagcct gcgattgcca tcaaactacc actaaccaaa 1680 cagcatatag atggaacgtg ccgtagaagc aaaatgggca caaaaagtgc ctgttttggt 1740 tttgtgattt gtgatttttg cgtttcttac tgatctaaca acgattttta ttcgaataaa 1800 ctatattcta tcggttccga gcggttgaaa ataaataaat tgcagtgttt tgtgaccatt 1860 ttaataaaaa cagtgaatta tcgatgaatt accgataaaa atagttttcg gccgttacaa 1920 cttggcgtgt atgtgcaagt gtgtgattga cacttattag gaatcgtatt tgtgcgaatg 1980 caattcacac agctggtgtt gtaagggaag ttttggagaa ccattttagc tgaaatcgca 2040 aaaatcgcaa gtttttaaat ataattataa ttttaaagtc tttttaaaga ctatacaaat 2100 attacctact ttttatttat tccagacata tatatatatc tatatataca tataatatat 2160 ataaattata taaattaatt ttaataaatt ttattgcaaa taatttattg gaattaatta 2220 atttaaatta cgaattttac tatattacat ttttattttt attaaagttt taaagttaag 2280 taaaatgcct aggacaaaaa agttgtcagt tttgaatagg gctcgtagac aacaagctag 2340 aaatgagcta aataggtcta attctgaata tgctgagcat gaacgtatca acaatataga 2400 gtcccaacga gagcgtaggg aaaatgaagg ttttagccaa actgagagaa ataggaatag 2460 gcaaagaaac caagttcgca gacaatacat agtttacagg gcaaacgaac aaagacaaaa 2520 tactgtagac cactcgagtc gacgagctaa ttctccagtt agagaacagg agcaattggc 2580 taatacttcg cgtcgatcga cacgcagaca agacccagtc taccgggcaa acgaacaaag 2640 acaaaatact gtagaccact cgactcgacg agctaattct cctgttagac aacaggaaca 2700 attggctaat acttcgcgtc gatcgacacg cagacaagac ccagtctacc gggcaaacga 2760 acaaagacaa aatactgtgg aacactcgac tcgacgagct aattctccag ttagacaaca 2820 ggagcaattg gctaatactt cgcgtcgatc gacacgcaga caagacccag tctaccgggc 2880 aaacgaacaa agacaaaata ctgtagacca ctcgactcga cgagctaatt ctccagttag 2940 agaacaggag caattggcta atacttcgcg tcgatcgaca cgcagacaag acccagtcta 3000 ccgggcaaac gaacaaagac aaaatactgt agaccactcg actcgacgag ctaattctcc 3060 agttagacaa caggagcaat tggctaatac tttgcgtcga tcgagacgca gagaaaaccc 3120 agtttatagg gcagacgaac aagaacaaga caatattcgt cgagctcgcc atcgacaaga 3180 catacaacga cgacttttag aaagagctag ggataggaat caaagatcac aggttagaga 3240 gaatccgatt cgtaggataa ttgagcaggc acgtaacaca agtgtgaggc gaagtgctcg 3300 cattcatgca cgacaggagg cttttgacgc caatagaagt ttgtcccagc gagttagcca 3360 gtataggggt aatacaggta ataggatagc agagaatagg cgcaatgcta cgaggaacag 3420 gggtgaacgt gcgattaatt tttcagagga tagaatctct ccagctatta ggagtagctt 3480 agatggtctg atagctggat tcaatagaac agttaagcag ggtcccgatc aggtttgcct 3540 ttgttgcaaa ggcttatggt ttccacatca agtggaaaaa atttctcgtg aggttattga 3600 aagcaagtgt agtgcagaag taagctccga tattttatat ctttcctcta tatttccctc 3660 tgataataat aaatatgatt tttgcaaaac gtgtagtcgt agcattaaat taggcaaaaa 3720 acctaaaact tctattacaa atggcctaga ttttcccgat ctgccagatt gtttgaaagg 3780 cttaacgcct ttagaggaaa ggctgatttc ccctcgcctt cctttcatga ttattaagtc 3840 attaggatat ggtcgacaaa atgcaattaa aggagcagtg gtcaatgtac ccattcctgt 3900 ggccaatgtt gtaacgtccc ttccgagggc gtttaatgag agcgaggtag tgcaaattca 3960 tctcaaacgt aggatggaat atatccatga tttcatggct gaaaccatta ggccttctaa 4020 agttgctgaa gcaataaaat ttttagtcaa caccgagctt tatagaaagc atcaaattac 4080 tattaatgaa cagtggttgt cgtctttttc gtctgacgag gttccgtttg ttgcatctgc 4140 tgatgacagg gacttagtat cgcagttagt ggcaagatct gctcaagcct ccatagatga 4200 gcagaataat atagaagaag atattaatcc aggaggccag gaaacattat tagaaaacgt 4260 ccctgtcgaa aatattgcca tgcaaagaat agttatagcc cctggagagg gtcagcggcc 4320 tttagacatg atcgaagatt gtgattcaga ggagctggcc ttcccaacta tttttgctgg 4380 cattaagaga aagtgtgcag aaagttttac aaccatagtt aggtcagaac ttagaaactt 4440 tgataggaga ggttgtagaa cagataaatt atttctcaat tttaaaaaat tggaaatgat 4500 taatataagg aacagtactt ccatttgctt acgtaagcac agcgccactc gaggcattac 4560 ggcagcacaa gtccttaatg aggagtattt agaaaattta atacgtcacg atgattgcta 4620 tagggttttg aaaaatattc gcacttcacc tgctcattgg gaaggagaaa agaagaaggt 4680 catggcgatg ataaggcaga ttggtcttcc gacatttttt ataacctttt cggcagcaga 4740 gactagatgg gcagagctgt tagttttact ttctaagaat gtagataagc tagaaatttc 4800 agaagaattg gcagctaatt taagttttgc caataaagcc cgtctaatta gagccgaccc 4860 tgttacttgc gctaggtatt ttgattttcg ttacaggcag gttttaaaac tgatgaaaaa 4920 ggaaggcggt gtttttggta gtcacagagt tattaagtat tattggcgag ttgagtttca 4980 gcaaagaggc tccccccacg ttcatggcat gttttggctg aaagatgctc cgatagttga 5040 cttggctgat gagtcaagca ttccgtctgt aaagtccttt attgatcagt ttatcacaac 5100 tgaaacaaat aacccggatg tggctccatt tttggagtat caaactcaca atcacgggca 5160 ctcatgtagg cgagaagtta gggcacaaaa tatatgtaga tttggtattc cataccctcc 5220 catgccccaa acagaaattc tattcccatt agaagaagat gctgaaaata aagaagtgca 5280 tcgcagaaat ttcgaaaaga ttcagggagt tttagaatat ccgtttgatc gagaaagtat 5340 agattacctt agcgatttca ataacttttt gacccacgat caacttaact taagttatga 5400 gtgctatata aatgcactca ggtcgagtat aaagaagccc aaaatatatt taaagcgatc 5460 attcgcagca attaagctta acgcttataa tactcacctc cttctactgc agcgcgcgaa 5520 tatagatatc cagtttatat tagatcccta tgcttgctgc agttacgtcg ttaattatat 5580 taacaagtcc cagagaggca tttcaaagct aatgcgtgaa gctgtaagtg agataagcag 5640 gggcaatgta actttaaagc aaaggttaca gcatataggg cacaagttta tttcgggtac 5700 ggaaatatcc gcccaggaag ccgtttactg ctgtttaggt atgtctttgt ccgaaagcag 5760 taacgcagtg gtttttatta atacctcatt accagagaat cgggttgggt taaggaagtc 5820 taagcaacag ttgcaaaact tacccgatga ctcagtagca atattggaaa ttggacttat 5880 agaccattat gttcagcgcc ccgacgcttt agatagttta tgcctagctg attttgctgc 5940 tttttacagc tttagcaagc ggcatagtgg tactcgccaa aatatagatg atgctgatga 6000 catcgaaaca tccagggctt cccagcaatt agtttacgca cttcgcgatg gtagtgggtt 6060 cgtaaagaag aggaacactc ctattataat taggtatagg agatttaata ttaatactga 6120 taggagcaat tattttcgag aattagttat gttgtatgtc ccatggaggg atgaaaatgt 6180 agaactagta gcaagagact gcgaatcgtt ttgcggccag aatcatgagt taattgagag 6240 caatcgcagg aaatacaata gcttaaatga cagtgagctt cagcatgctc ttgaaaatgc 6300 agtgcaggca gaaaatgaag aaaatgttga aaatcaagaa gtagtagacg atagatttcg 6360 cgtattagga ttaccagaaa taaatcctca tttaaatgtc ttaaacatta atggaaatga 6420 cgagaatcta gattcagaag ataacaatgt ccgattaata aaacttccag ctatagtttc 6480 agctgaaagc ttagcagata tagtcaggtc acttaattta aagcagaaaa cttattttac 6540 acatgtttta cacaatgcca aaaacaaatc cgtcttctac gaatttgttg gaggtggtgc 6600 gggtgtcgga aagagccgac ttattttagc tctttttcaa tctttgtccg ttatgtacaa 6660 tagtaggcca ggaagtgatc caaattctcc taaaataata ttgtgtgctc caacaggtaa 6720 agctgctttc ggaatcagtg gctcaacact gcattccatg ttttccttac cagtaaatca 6780 atctggttcc gatttgaggg ctttaagtaa tgatttagta aatactttgc attcaaggtt 6840 aatcgacctt aaactaatta ttatagatga aatatctatg gtaggctcca aaatgttttc 6900 ccagttggac tcccgtttaa aacaaatttt taaaaatact tctccattcg gtggtatttc 6960 tgtcatagtt ttcggggatc tacgacaatt atctccagta ggtgaccgtt ggattttcga 7020 atccaactct aatgatccct acagtgcaat ttatgcttca ttgctgtggg accttttcaa 7080 gtattttgaa ctcacagaaa taatgcgtca acgggaggac caaccgtttg cttctgcttt 7140 aaataatatg gcatcgggaa acatgacagc cgatgatatt agtcttgtcc gatctcgcat 7200 ttgtcagcaa tcggatgtgc ccgatgatgc cattcatcta tttagtacaa atgcggcgac 7260 agatagttat aacacgataa agctaaactc aattccgact gatcaattta tttctaccgc 7320 tgatgatttt ataaaagcaa cacatttaac tgcggcttgt agacagagaa ttttagaagc 7380 cgttagtagc tttaggtcgt ccgaaacaca gggattgtgc atgtcccttc ttctaaaaac 7440 cactgcgaaa tatatgatta cggtcaatgt agatactggg gatggattag ttaatggggc 7500 aacaggcatg ttaatgcaaa tagatttcga tgcatccagc actcccatta ttttatggat 7560 gaaattttca gatgataccg taggagtact tgctcgctca aagtgtacac atcgtacaga 7620 acctttctgg accccaattc agaaaatagt taaaagtttt caatatagaa gaaatgagca 7680 agtttctatt gaccgcagac agtttccagt tgtacctgca gaaggtataa ctattcacaa 7740 aagtcaggga gctacgtata gcaaagtggt ggttcacact tgctctcgta tgcagcgtgc 7800 tgcaatatac gtagcttgta gtagagcaac aactgcttcc gggttgttta taataggaga 7860 ttttgttcct cctaagcctc cgaaaaatga cgataaggtc caaaatgaga tagagaaatt 7920 gcggactgga aaactacttc gaacacatta tgatcactta gttaacagtt ccgccctaga 7980 cattttcttt cataatactg aaagcattca ctgtcatatt gaagatattc gatcagataa 8040 gcttatgctt cagtcgtctg ttctatgttt cgttgaggcc tcaacatatt caaatgaaaa 8100 tattgaaatt ccagggttta ctgttgttgt ccgtcttgat tgcgaggtaa taccgaatgc 8160 accgcgacct aagaggggaa tcgtcattta tattaaaaat gcaattttaa acaatatatc 8220 taattgcttt agcggtaggc aaatttctgg gtcattcgtt tttgagcatg caatatttcg 8280 ttataagtct gtatgttttt tagttattta caaaagtccg tcatatccgc ttggtttgtt 8340 taaaacagaa tttgaggttt tatttcagca atatttattt ctaagggagc gttgcattgt 8400 tttaggagac tttaatttat gcttgagcaa atttgataga tgtagggctc tttacgatga 8460 attgctcaga aagggattaa ggtgcctatt agatttaaaa acgcccacta ccaattgtga 8520 ctctcacatt gattgggctt tttcaaatct ctccagtgag gaagttagcg ctaggacact 8580 cgagacagtg tatagttacc acagtggtat ctctctaacg gttcaggata gggtaaatta 8640 agtttctata tatatgtcta tataattaca atatgtttaa tttaaactta agaatttttt 8700 tatgaatata aataaatata ttacctatat ttatatttaa aagttgtttt gtggtcttgt 8760 acttgggttc agggtatttt ctagtcgggc acccccgact agagcactct tacttg 8816 // ID Transib1_DP repbase; DNA; INV; 2983 BP. XX AC . XX DT 21-MAR-2005 (Rel. 10.03, Created) DT 01-JUL-2005 (Rel. 10.08, Last updated, Version 2) XX DE Transib1_DP is a family of autonomous Transib transposons - a DE consensus sequence. XX KW Transib; DNA transposon; Transposable Element; KW Interspersed repeat; transposase; Transib1_DP. XX NM Transib1_DP. XX OS Drosophila pseudoobscura OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; obscura group; OC pseudoobscura subgroup. XX RN [1] RP 1-2983 RA Kapitonov V.V. and Jurka J.; RT "RAG1 core and V(D)J recombination signal sequences were derived RT from Transib transposons."; RL PLoS Biol 3(6), (2005). XX DR [1] (Consensus) XX CC Transib1_DP is a family of autonomous Transib transposons. The CC consensus sequence encodes the 698-aa Transib1_DPp transposase CC (pos. CC 270-288, 542-2616. Transib1_DP elements are characterized by 5-bp CC target site duplications and 41-bp terminal inverted repeats. XX FH Key Location/Qualifiers FT CDS join(270..288,542..2616) FT /product="Transib1_DPp" FT /translation="MDFKGPDQLEFCHATLLDVWIDNGRNSTALANWIVGQ FT LTECSLTNENISEINKKVKSLAIYISSNLQKCKRNMRTFRKKTFTENKEET FT HSLQLGRPRLTYDEAGPRLKRKLAADVAIGYKNDTSLLIHAATMSARKSYA FT NATAFLLKNVGTSESEATEAKGKLDIVEPILLTTNDALEFLLENSLSKRLY FT NEIRQISKQHNFDIYPSYQNVQEAKLQLRPAGITATETMAKVALQHLLNHT FT ASRIILLQEELFANFENVSSITLIASYGYDGSTGQSMYKKRFETKEPDTFD FT QSLFVTTLIPLKLIDEAGTIFWNNRTPQSVRFCRPLKMEFAKETKDHILAE FT KNDLDTQMKNLTPCVVTISNEKKITVSYDLHMTLIDGKVLNVLTGTKSCQL FT CSICGAGPKQFMETYDTNSFKPNPRRLHYGISPLHAWLRIFELLLKISYRM FT GFKKWQVRNENDKLEMIAKKKKHIQRRLWQEMGLHVDKPKQNGSGNSNDGN FT TARKAFSNTKLFASILELDLELLESLHTILIAINCEFAIDALKFQNFSDKT FT IRSYMLHYSWYPMSPTLHKILVHGSQIITASVMPVGCLGENASEAKNKFYK FT RDRLMHARHNSRVNNMMDVFQRSMDSSDPLISSISIKKRAARHKKQTLPRD FT VIELLETPHCNFPKNIQPLANDYSFDYDSNESELEDINPYSFELSVQNN" XX SQ Sequence 2983 BP; 1049 A; 513 C; 533 G; 888 T; 0 other; cacactgggc ggaattccca aataagtggc ataaagtcga aaattttgtc gaagcaaata 60 ttaattatat ttcaatatat tgaaagtaga atccctatat attacaaatc aataaatttt 120 gttgctaaaa atgcaaccct gtcattctga tcaaacttca aagtcagttg attagtcagc 180 tgattcctgc gacaaaattg gcgggaaatt tttattcact ttgagcgctt ggcaaattca 240 cacattgcaa aaataaaata ttgtgaaata tggattttaa aggcccgggt atgtagattg 300 aataacaaaa aacattttgt gttaaaatgt gttatatgtt atataggtta attgttttgt 360 attgtccaaa aaaaacgaaa accaaaccaa tgtgttcgtc ttgtgtggag taaaaactaa 420 ccttctgcgt gtttgtgcgt tatgtaatgt atgttaaaat aaagtagaaa gtgtgtatta 480 aaagtaagta ataatccggt ttgcggcttt ttgcggctct tgcggtactt tgaactttta 540 gatcagcttg agttctgtca tgcaacgctg ctggacgtat ggatcgacaa cggccgcaac 600 tctaccgccc ttgcaaactg gatagttggc caattgacgg aatgttcact aaccaatgaa 660 aacattagtg aaattaacaa gaaagttaaa agccttgcaa tttatatttc ttcgaatctg 720 caaaagtgca agcgaaatat gcgcacattt agaaaaaaaa cattcacaga aaataaagaa 780 gagacgcaca gtctacaatt gggtagacca aggctaactt atgatgaagc agggccccgt 840 ctaaaacgaa aactagctgc cgatgtagca atcgggtata aaaacgatac cagtctttta 900 attcatgcag caacaatgtc agccaggaaa tcatatgcaa atgctactgc atttttactt 960 aaaaatgttg gcacttctga aagcgaagct actgaagcta aaggaaagct cgatattgta 1020 gaaccaatac ttttaacaac aaatgatgct ttggaattct tgctggaaaa ttctttaagc 1080 aaacgattat ataatgaaat tcgtcaaata agcaaacagc ataattttga catataccca 1140 agctaccaaa atgtccagga agccaaattg caattgagac ctgcgggaat aactgctaca 1200 gaaactatgg caaaggttgc cttgcaacat ttgttaaacc acactgccag ccgcataata 1260 ttgttacagg aagagttatt cgctaacttt gaaaacgttt ctagtattac attaattgcg 1320 agttacggat acgatggatc aactgggcaa agtatgtata aaaaaagatt tgaaacgaag 1380 gaaccggata cttttgatca atctttgttt gttactaccc ttatacctct gaagcttatc 1440 gatgaagcag gaacaatttt ttggaataat agaactccgc agtctgtaag attttgccga 1500 ccactcaaaa tggaattcgc caaagaaaca aaagatcata ttcttgcaga aaaaaacgat 1560 ttggacacac aaatgaaaaa cttgactcct tgtgtagtaa caatttcaaa tgaaaagaag 1620 atcactgtct catacgacct tcacatgact cttatagatg ggaaagtttt aaacgtttta 1680 acgggtacaa aatcttgcca attgtgttca atatgtggtg caggcccaaa acaatttatg 1740 gaaacttatg atacaaattc ctttaagcca aatcctcggc gtcttcatta tgggataagt 1800 ccacttcatg cttggttacg catttttgaa ttgctgctta aaatatcata tagaatgggg 1860 tttaaaaaat ggcaagtaag aaacgaaaat gacaaattgg agatgattgc caaaaaaaaa 1920 aaacatatac aaagaagatt gtggcaagaa atgggattgc atgtggataa gccaaaacag 1980 aatggaagcg gaaactcgaa cgatggcaac actgcgagaa aagctttttc aaatacgaaa 2040 ttattcgcat caattttaga actcgatttg gagttacttg agagtctgca tactatttta 2100 atagctatta actgcgaatt tgcaatcgac gctttaaaat ttcaaaattt ctcagataaa 2160 accattagat catacatgtt gcattactct tggtatccta tgtcaccaac tctccataaa 2220 attttagttc atggatcaca aataataact gcatctgtga tgccggtagg ctgtcttggg 2280 gaaaatgcgt ctgaagcgaa aaataaattt tacaagcgtg accggctaat gcacgctcga 2340 cacaattcgc gggtaaacaa tatgatggat gtattccaaa gatctatgga ctcttcagat 2400 cccctcatct caagtatatc cattaagaag agggcagctc gacataaaaa acaaaccctt 2460 ccgcgagatg tcattgagct tttagaaaca ccccattgta attttccaaa aaatattcag 2520 cctttagcta atgactacag tttcgattat gactcgaatg agtcggaatt agaggatata 2580 aatccatatt cgtttgaatt atctgtacaa aacaactaaa tgataattta acatcaattc 2640 ttagaaatag ttaaatagta gaataagaat ttgatcataa taagaattgt ttatcaaaaa 2700 atttgtttta tatttttata tttaccacta ttacatcaaa attacaacag aataataaat 2760 tcccttaaaa ttgtagtact tttttatagc tctcatgact ttcctttgtg atcacgtata 2820 cgccttgttg ccatatatgg gctattgcgg taggcgtggt cagatgaact taaaattttg 2880 ttcaaatatt tagagtgcta tatttattaa atacaccaag tttcagctcg atatctgctt 2940 tattcgactt tatgccactt atttgggaat tccgcccagt gtg 2983 // ID JAM1C_AAe repbase; DNA; INV; 3425 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE An RTE non-LTR retrotransposon from Aedes aegypti. XX KW RTE; Non-LTR Retrotransposon; Transposable Element; JAM1C_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-3425 RA Kojima K.K. and Jurka J.; RT "RTE clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1441-1441 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >97% CC identity. The consensus is ~80% identical to JAM1 and CC JAM1B_AAe. XX FH Key Location/Qualifiers FT CDS 403..3390 FT /product="JAM1C_AAe_1p" FT /note="endonuclease and reverse transcriptase." FT /translation="KSSVQEAYNVNSDRNNRHRPRHRKRTNDWKLGSWNCK FT SLNFLGSTRILSELLRVRKFDIVALQEVCWKGSTVHTYRDGYTIYQSCGNR FT HELGTAFIVMGEMQRRVIGWWPIDNRMCKLRIKGRFFNISIINVHSPHLAS FT DDDDKDAFYAQLEREYDGCPSHDVKIVIGDLNAQVGQEEEFRPIIGKFSAH FT QLTNENGLRLIDFAASKNMAIRSTYFQHSLPYRYTWRSPQQTESQIDHVLI FT DGRHFSDIIDVRTYRGANIDSDHYLVTVKVRQRLSVVNNIRYRRPPRYNLE FT RLKQPEVATEYAQSLEAALPEEGELTEAPLEDCWSSLKAAINNAAEGAIGF FT VEANRRNGWFDEECQTVLDEKNAARAMMLQQGTRQNVERYKQKRRQQTHLF FT RDKKRRLEELECEEMEQLYRSQETRKFYKKLNASRKGFVPRAEMCRDKDGG FT ILTDEREVIERWKQHYDEHLNGAEEEDQDSRRNGFISTADEGDVPTPTIGE FT VKDAIKQLKNNKAAGKDGIGAELIKMGPDRLATCLHRLIARIWDTEQLPEE FT WKEGIIYPIYKKGDKLECENYRAITILNAAYKVLSQIIFRRLSPLASRFVG FT SYQAGFVDGRSTTDQIFMLRQILQKCREYQVPTHHLFIDFKAAYDTIDREE FT LWKIMDENGFPGKLTRLIKATMDSVQCCVKISGALSDPFETRKGLRQGDGL FT SCLLFNIALEGVMKRAGFNMRGTIFNKSSQFICFADDVDIVGRTFQVVAEQ FT YTRLKREADRVGLKVNTSKTKYLLAGGTERDRARIGRRVTIDGDEFEVVDE FT FVYLGSLITSDNNCSREIRRRIIAGSRAYYGLHKTLRSGKLHFRTKCTMYK FT TLIRPVVLYGHETWTMLEEDLQALGVFERRVLRTIFGGVCENGVWRRRMNH FT ELAQLYGEPSITKVAKAGRVRWAGHVVRMPDNNPAKMVFNSNPAGTRRRGA FT QRARWFDQVEQDLGSVGRSRNWRLAAMDRVSWRNIVAQVMS" XX SQ Sequence 3425 BP; 957 A; 779 C; 971 G; 718 T; 0 other; gggtggtaaa tgactggaca atgcgtacca ttggtacttc gcgtacctgc aggtataaaa 60 tagaccccat ttgtggtcct tagcctcttg tccagtaact cctatcccta cctccccgtg 120 gtgccgcctg ggatacgagt aaccgtaggg aagatcgggt aaccaaccct ggtggaacct 180 tggtcgtatg ctgacaggga aggggggctc ctctcttctg agggtgtagc ttatcagagc 240 gtctgttctc catgttaggg gcggctcaaa acagcgtctg ttctccatgt taggagcggc 300 tgatcatcgt cctagtgcca gcgtgggact ctaaacagtg ctgtgcacga tgatcctccg 360 gcgagacagg gggttggtgc aggccttaca agccagccgt aaaaatcatc agtacaggaa 420 gcatacaatg taaattcgga ccggaacaat cggcatagac ccaggcatcg aaaacggact 480 aacgattgga aactcggatc atggaactgt aagtctctca atttcttggg aagtacccgc 540 attctttccg aattattgag ggtccgcaag ttcgacatcg tagcgctgca ggaggtttgc 600 tggaaagggt cgacggtaca tacgtatagg gatggttata ccatctacca gagctgcggc 660 aatagacatg agctgggcac agcttttatc gtgatgggcg aaatgcagag gcgcgtgatt 720 gggtggtggc caatcgacaa cagaatgtgc aagttgagga tcaaaggccg tttcttcaat 780 atcagcataa tcaacgtgca cagccctcac ctagcaagtg acgatgacga taaggacgct 840 ttctacgcgc agctggaacg tgaatacgac ggctgcccaa gccatgatgt caaaatcgtt 900 atcggagatc tcaacgctca ggttggccag gaggaggaat ttagaccgat tatagggaag 960 ttcagcgctc accagcttac gaacgaaaac ggccttagac tgattgattt cgccgcctcc 1020 aaaaacatgg ccatacgtag tacctacttc cagcacagcc tcccgtatcg gtacacctgg 1080 agatcacccc aacagactga atcgcaaatc gaccacgttt tgattgatgg aaggcacttc 1140 tcggacatta tcgacgtcag aacctatcgc ggcgcaaaca tcgattcgga ccactacctt 1200 gtgacggtta aagtgcgcca acgactctcc gttgtgaaca acattcggta ccgacgcccg 1260 ccccggtaca atctggagcg actcaagcaa cccgaagtcg caactgaata cgcgcaaagc 1320 cttgaggcag cgttgccgga agagggagag ctcaccgaag cccctcttga ggactgctgg 1380 agtagtctca aagcagccat aaacaacgca gcggaaggtg ccattgggtt cgtggaagca 1440 aatcgacgga acggttggtt cgacgaggag tgtcagacgg ttttggacga gaagaatgca 1500 gcgcgggcga tgatgctgca gcaaggcacc cgtcaaaacg tggaacgata caaacagaag 1560 cgaagacagc aaacccatct attccgggat aaaaagcgcc gcctggaaga gttggagtgc 1620 gaagagatgg agcagttgta tcgttctcaa gaaacacgta agttctacaa gaaactaaat 1680 gcatcccgca aaggctttgt gccgcgagcc gaaatgtgcc gggataagga tggtggtatc 1740 ttgacggacg aacgtgaggt gattgaaagg tggaagcagc actacgatga acacctaaac 1800 ggcgcagagg aggaagatca agacagcagg aggaatggct tcatcagtac ggcggatgag 1860 ggagacgtgc caactcccac aataggtgaa gttaaggatg ctatcaaaca gctcaagaac 1920 aacaaagcag ctggaaagga tggtattgga gcggaactta ttaaaatggg cccggacagg 1980 ttggccactt gtctgcaccg attgatagcc aggatctggg atacagaaca gctaccggag 2040 gagtggaagg agggaataat atacccaata tacaaaaagg gtgacaagtt agaatgtgag 2100 aactatcgag cgatcaccat tcttaatgca gcctataaag tgctttccca gatcatcttc 2160 cgccgtctat cgccactggc aagcagattt gttggaagtt atcaagccgg atttgtggac 2220 gggcgatcga cgacagacca aatctttatg ttgcggcaga tcctccaaaa gtgtcgcgaa 2280 tatcaagtcc ctacgcacca cctattcatc gatttcaaag cggcctatga taccatcgac 2340 cgcgaagagc tatggaagat tatggacgag aacggttttc ccgggaaact gactagactg 2400 atcaaagcta cgatggatag tgtacagtgc tgtgtgaaga tatcgggtgc tttatcggac 2460 ccgtttgaaa cacgcaaagg acttcgacaa ggcgatggtc tttcctgcct cctgttcaat 2520 attgcgctag aaggtgttat gaaacgggcg ggcttcaaca tgcggggcac gatcttcaat 2580 aaatccagcc agttcatttg tttcgctgac gacgtggaca ttgtcggaag aacgttccag 2640 gtggttgctg aacagtatac caggctgaaa cgtgaagcag atcgggttgg attgaaggta 2700 aatacgtcga agacgaaata tctgctggct ggaggaaccg agcgcgatag agctcgcata 2760 ggcagacgcg tgacgatcga cggggatgag ttcgaggtgg tggacgaatt cgtctacctc 2820 ggatcattga taacgtcgga taacaactgc agcagagaaa ttcgaagacg tatcatcgcc 2880 ggaagtcgtg cttactatgg actccacaag accttgcggt ctggtaaact tcacttccgt 2940 actaagtgta ccatgtacaa gacgctaata agaccggtag tcctctacgg gcatgagacg 3000 tggacaatgc tcgaagagga cctgcaagcg ctaggagttt ttgaacgacg tgtgcttagg 3060 acgatcttcg gcggagtatg tgagaacggc gtatggagga gaagaatgaa ccacgagctt 3120 gcgcaactct acggtgaacc cagtatcacg aaagtcgcca aagctggaag ggtacgatgg 3180 gcgggacacg ttgtgagaat gccggacaac aatcccgcaa aaatggtgtt caactcaaat 3240 ccggccggta caagacgaag gggagcgcaa cgagctaggt ggtttgacca agtggagcag 3300 gatcttggaa gtgtggggcg atcgaggaat tggaggttag cagccatgga ccgagttagt 3360 tggcgtaaca ttgtggcgca ggtcatgtct tgaaggacgt agcgccagca aaagtaaagt 3420 aagta 3425 // ID Gypsy-72_AA-I repbase; DNA; INV; 4372 BP. XX AC supercont1.159; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-72_AA_; KW Gypsy-72_AA-LTR; Gypsy-72_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4372 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.159; Positions 467681 472052. XX CC Positions [3416-3892] - Integrase core CC 'CATG' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 110..4333 FT /product="Gypsy-72_AA-I_1p" FT /translation="MSDHDKKLEEISREVAEQVISDTDASGEHSAGAVGLI FT RTLFAADQSDREQFEQLNAQDPRYKRRVSDTTIDAMDNEAVKFLVDALRSI FT SVDNVRRFDVRDVKDILVPFDSDVPTTPTAEQWIESIEKAAALYKGDDAWM FT LQCGIINLQGAAKICFTGATVTNWAEFKAKLVQDFPTSVDVVSIHQTMMSR FT KKQPGESLETYFYSHVALGRKGKLPDQATIKYIVSGLDGRYGTIAQVDTLP FT ELLKQLKWLAEVNQLKPIDTSRGSSTKVITKGEAVSGIKCYRCNGVGHVAA FT TCSEKKSTRSAANIECYRCSERGHYAKNCNRSLVKKSTPRVMQEIRQKSNY FT VKSINIDGNSVDALYDCGSAVTTIKESCRGILSNIEMCNMELIGFGGNKVQ FT VRERSSEKIELDGLFMEVTLLVVPNKVQANSVIIGQDILDRDDVRFVKERG FT SVRIEKIDDDRAQKTTLESDPLVQSSSVQPNMYIIRAYEPILPCEINVDGG FT EDERKMVLDVIQRYRHCFAKSYKEMGTAKNCEMEIELVDENPVYVKQYPLE FT YSREKIVEDTVEDLLEADIIQLSKSPYNSPTVLAKKKSGEWRMVVDYRAVN FT AKMVKDKWPMPTIEDCLNRLVGGSMFVAIDLFRGYHQIPLSLDSRKYTAFS FT TPIGHYEYKKMPFGLSNGIAVFQRMIDEVIAPLRRKGFVAYLDDVTFGGKD FT IKEVMGKFELFLNALSENGLTVNLEKTQFLKKKVNLLGYEISEGEVKPGLE FT KICAIKDFPIPTSVRNVREFLGLAYYFRRFVKKFSILVEPLTRLTKKDVEF FT QWKTDQQSAFERLKVVLVSRPVIVMYDPKREIELHTDACSHGLSGILLQRM FT DDGLHPISYFSRKTSSTECNKYSYELEALAVVESVERFRKYVLGRHVKIVT FT DCEAVKKTMAKKQMLPVVGKWLLKLLDYDYEFEHRKGEKMKHADCLSRNPV FT EDFVQEEPEPAMASVMQINIGEDELLKLLQREDEHLYGLMNDLSAPPVDNR FT GRQLHKEYVLQDEGVMRRCNDGLKWVVPTRARWRIARCYHDELGHKGVDKV FT YEAVRRNFWFRRMKNYLKKYIDSCVYCAYMKKKPGHKEGLLHPIEKIPVPF FT DTIHVDHLGPFVRSTLQNEHIIVLVDGFTKYVVLKAVRSTKTKPVIQMLSD FT VFATFGKPRRIITDRGTAYTSKDFEEFCAQLGVTHVKVAVGTPRANGQVER FT ENRNILHSVRCMVKHDDKSWDKQLRLIQWGLNTMVNDTTKVSPHSLLFSYN FT PRDIMQNQLLMMFSVETPLGHSEELNRVVRARIEKEQQRQKMYFDKHRRAA FT RRYQEGDLVLVENDVQATGQSKKLEPRYKGPFIIQKAVGNDRYLIADVPGI FT KLSQKRTSTIFAAERMKPWAVNAAIETSDDEDANNEDYLSDSNED" XX SQ Sequence 4372 BP; 1337 A; 761 C; 1195 G; 1079 T; 0 other; attctgtaga caggattctt acaaattgcg cgttaaagca aagtcgcgaa atatcgtttt 60 cgcggtggca aatgagccaa ttggatggtt tcggaatttg ccgattgtga tgagtgatca 120 cgataagaag ttggaagaga tttccagaga agttgctgaa caagtgatta gtgataccga 180 cgcgagtgga gaacacagcg caggggctgt ggggttaata agaactttgt ttgctgcaga 240 ccagtccgat cgcgaacaat ttgaacagtt gaacgcgcaa gatccacgat ataagcgacg 300 tgtaagcgat actacgattg acgctatgga taatgaagca gtaaagtttt tggtcgatgc 360 attaagaagt atatctgtgg ataatgtgcg ccggttcgat gtgcgcgatg taaaggacat 420 cttggtacca tttgactctg atgttcccac gacgccaacg gcagaacaat ggatcgagtc 480 tattgaaaag gctgccgctt tgtataaggg tgatgatgca tggatgctac aatgtggcat 540 cataaatctc caaggagcag ctaaaatttg tttcactggc gctacggtga caaattgggc 600 ggaatttaaa gcgaaattgg tacaagattt tccgacctcg gtggacgtag tgagtattca 660 ccaaaccatg atgagcagga agaagcagcc tggagaaagt ttggagacgt atttctacag 720 ccatgtagcg ctgggaagaa aaggaaagtt acccgatcag gccaccatca agtacatagt 780 atcgggtctt gatggacgtt atgggactat cgcgcaggtc gatacactgc cggagcttct 840 aaagcaactg aagtggttgg ccgaagtaaa ccaactaaag ccgatagaca catcacgtgg 900 atcgtcgacg aaagtgataa cgaaagggga agctgtgagt ggtataaaat gctaccgctg 960 caatggtgta ggtcatgtag cagcaacgtg tagtgagaag aagagtactc gctcagctgc 1020 gaatatagag tgctatcgct gtagtgaaag ggggcattat gcgaaaaatt gtaacagatc 1080 tcttgtcaag aaatcgacgc ctagagtgat gcaagagatt cgccaaaaaa gcaattacgt 1140 taaatcaatt aatattgatg gtaacagcgt ggatgctttg tatgattgtg gctctgcagt 1200 taccacgata aaggaaagct gtcgtggaat tttgagcaat attgaaatgt gcaacatgga 1260 gttgattgga tttggtggaa acaaagtgca agtgagagaa cgtagttccg aaaaaataga 1320 actggacgga ctatttatgg aagtaacatt gctagttgta ccgaacaaag tgcaagcgaa 1380 ttcagtgata attgggcaag atatcctgga tcgagacgac gtgcggttcg ttaaagaaag 1440 gggaagtgtc cgaatagaga agattgacga tgatagagct caaaagacta cgcttgagag 1500 tgatccgttg gtgcaatcca gcagtgtgca gccaaatatg tatattattc gtgcatatga 1560 gcctatattg ccatgtgaga taaatgtgga tggtggtgag gatgagcgca agatggtatt 1620 ggacgtaata cagcgatatc gccactgctt tgcgaagagc tataaagaga tgggaacagc 1680 taaaaattgt gaaatggaaa ttgagttggt agatgaaaat ccagtgtacg ttaagcaata 1740 tccgctagaa tactctcgtg aaaaaattgt tgaagacaca gttgaagact tactggaggc 1800 agatattatc cagttgtcca agtccccgta caatagcccg actgtgctgg cgaagaaaaa 1860 aagtggtgag tggcgtatgg ttgtagatta ccgggcagtt aatgcgaaga tggttaaaga 1920 taagtggccg atgccaacca ttgaagattg cttgaataga ctggtgggtg gaagtatgtt 1980 cgtggcgatc gatctgttcc gcggatatca ccagatcccg ttatcgttgg atagccggaa 2040 atacaccgcc ttttccacgc cgataggcca ttacgagtac aaaaaaatgc cattcggact 2100 gtccaatggc attgcggtgt tccagcgaat gatcgatgaa gtgattgcgc cactccgtcg 2160 aaaaggattc gtagcatacc tggatgatgt aacctttggg ggaaaagata tcaaggaggt 2220 gatggggaag tttgagctgt ttctgaatgc gctgtcggag aatggattaa cagtgaatct 2280 ggagaaaaca caattcctga aaaagaaagt taatctgttg ggttacgaaa tcagtgaagg 2340 tgaagtgaag ccaggattgg agaagatttg tgctattaag gattttccaa tcccaactag 2400 tgtccgaaat gtaagagaat ttctaggctt ggcatattat tttagacgtt ttgtgaagaa 2460 gtttagcatt ctggttgaac cgctaacacg gctgacaaaa aaggacgttg agttccagtg 2520 gaaaactgac cagcaatcgg cttttgagag actaaaagtg gtgttagtaa gccgaccagt 2580 gattgtgatg tatgacccaa agcgagagat tgagttacat acggatgctt gctcacacgg 2640 tttgtcgggt atactattgc agagaatgga cgacggtcta cacccgatta gttactttag 2700 tcgtaaaaca tcatcaacgg agtgtaataa atacagctat gagctggaag ctttagctgt 2760 tgtggaatcg gtagaacgtt ttagaaagta tgtgttgggc cggcatgtga aaatcgtaac 2820 tgactgtgag gcagtaaaga agactatggc taagaagcag atgttacccg tggttgggaa 2880 gtggctgtta aaattattgg actacgacta cgaatttgag caccgcaagg gagagaagat 2940 gaagcatgct gactgtttga gtcgtaatcc agtcgaagat ttcgtgcaag aagaaccgga 3000 gcctgcaatg gctagtgtga tgcagataaa catcggtgaa gatgagttgt taaagctttt 3060 gcagagagaa gatgaacact tgtatggtct aatgaacgat ttgtcggccc caccggtcga 3120 taaccgtggc aggcaattgc acaaggagta cgtcctgcag gatgaaggtg taatgcgacg 3180 ctgcaatgac gggctgaagt gggtggttcc aacacgtgcg cgatggagaa tcgcacgttg 3240 ctatcatgac gagttaggcc acaaaggtgt cgataaggta tatgaagcag ttcgtagaaa 3300 tttttggttt cgccgaatga agaactattt gaagaagtat atcgattcgt gtgtttactg 3360 tgcgtacatg aagaaaaaac ctggacacaa ggaagggctg ttacatccaa tagagaaaat 3420 accagtgcct ttcgatacaa ttcatgtgga tcacctcggt ccgtttgtac gctctactct 3480 ccaaaacgag catataattg tattggttga tggatttact aaatacgtgg tgttgaaagc 3540 cgtccgaagc actaagacca aaccagtgat acaaatgttg agtgacgtgt ttgctacatt 3600 tggtaaacct aggcgaataa tcaccgatag aggaacggca tacacatcga aggattttga 3660 agaattttgc gcgcagttgg gggtcacaca cgtaaaagta gcggttggaa ctccccgtgc 3720 taacggtcag gttgagcgtg agaaccgaaa tattctgcac tcagtgagat gtatggtgaa 3780 acatgatgat aagagttggg acaaacaact ccggctaatt cagtggggat taaataccat 3840 ggtgaacgac acgacgaagg tatctccaca ttcgctactg tttagttaca acccgcgaga 3900 tatcatgcaa aaccaattgc taatgatgtt ctcggtcgaa acacctctgg gtcactctga 3960 ggaactcaat cgtgttgtac gtgcaagaat tgagaaagag cagcaacggc aaaagatgta 4020 tttcgacaaa catcggcggg cggcaagacg ttatcaggaa ggtgatttag tgctggtgga 4080 aaacgacgtt caagctactg gacagagcaa gaagctggag ccacgctaca aaggaccgtt 4140 catcattcaa aaggctgtcg gtaacgatcg atatttgatt gccgatgtgc caggaattaa 4200 actaagccag aagcgtacgt caactatttt tgcggccgag aggatgaaac cgtgggcggt 4260 aaatgctgcc atcgagacta gtgatgatga agatgccaac aacgaagact atttgagtga 4320 ttcgaatgaa gattagttat cgccgggacg ggataagggc tgtagtgtcc gt 4372 // ID Copia-20_SI-LTR repbase; DNA; INV; 264 BP. XX AC AEAQ01023503; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: long terminal DE repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-20_SI_; KW Copia-20_SI-I; Copia-20_SI-LTR. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-264 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01023503; Positions 2518 2255. XX SQ Sequence 264 BP; 62 A; 67 C; 50 G; 85 T; 0 other; tgttggagat gaatgttcct cgccgtagtt atctgttgtg atggcactac ttcatagagg 60 cttagatcga cgggtcgata tgtttcacgc tgcgcgcctg cgcccgagca agtctctctt 120 gcttcgtctc cttacttacg gacacacaca cacgctgctg agtgcacgtt aaaaacctct 180 ttgtatatca ttgccaaata tatattctct gtaccaataa tcgtgtttct cactctgcac 240 taccaaagtt aacaattttt atca 264 // ID MSAT-3_AAe repbase; DNA; INV; 276 BP. XX AC . XX DT 11-APR-2011 (Rel. 16.04, Created) DT 11-APR-2011 (Rel. 16.04, Last updated, Version 1) XX DE Minisatellite-type sequence: consensus. XX KW MSAT; Satellite; Simple Repeat; MSAT-3_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-276 RA Jurka J.; RT "Tandemly repeated DNA from Aedes aegypti."; RL Repbase Reports 11(4), 1449-1449 (2011). XX DR [1] (Consensus) XX CC 23-bp unit. XX SQ Sequence 276 BP; 84 A; 60 C; 60 G; 72 T; 0 other; acagaatctc gtctagagat tcgacagaat ctcgtctaga gattcgacag aatctcgtct 60 agagattcga cagaatctcg tctagagatt cgacagaatc tcgtctagag attcgacaga 120 atctcgtcta gagattcgac agaatctcgt ctagagattc gacagaatct cgtctagaga 180 ttcgacagaa tctcgtctag agattcgaca gaatctcgtc tagagattcg acagaatctc 240 gtctagagat tcgacagaat ctcgtctaga gattcg 276 // ID Gypsy-1_AA-I repbase; DNA; INV; 5335 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-1_AA_; KW Gypsy-1_AA-LTR; Gypsy-1_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5335 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 971-971 (2011). XX DR [2] (Consensus) XX CC Positions [4338-4820] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 384..5177 FT /product="Gypsy-1_AA-I_1p" FT /translation="MEGVNNDYSWLSFPTSQPMMPRIHRVQFASTPRSTVG FT ENDGNYHVGQPRQQANNNLPDGDDRAPQQAEPSVASIPEAAEISEARGQGI FT SGGSENNTAELLERNIESQNVLVDMIRTLNRRIEALERNVVDSRPGPSEQP FT ALGPRNERPSSAAMHSEGRCDPTILHAAPTFVNNPPEPENFMRFTAPNFTA FT TVPPRLPQAYSTAFGDPTYAIYHPIGAPASRFAGLATGFSTSGHALNAGWP FT GGSVPTGDNGLQPSFFQPMQGPHGIGSKHNTGTGGMAPTSLNEGQTSRIGL FT FPRPTFRGENDDRHPIEFLTEFDRFCYHQGLVELEKLPVVLSCLTGEAKVW FT ARGFEYLFRDYTTFVHHFKENFWGQTAQRQVSEKITRGNYVQRNGSRMAEY FT FLGIVAKARYLTPPPTERELIQHLSRHFPRNVQHLLGSCMDIGSGYNILQS FT EDRFINGSLRYNNGLARENFNQPNWRNREAPSVRSSRGAPNLNVGGIAGTN FT NVRMAMVEEEESVEGIDPAVLTVFVGREELLESESDTKVKENRKFKRSSYI FT TVDLEGMQFNALVDSGCELSCISIDLYDQLKKANITMPELPMKTISLRGAF FT GKKSKNVNLQVLLEISIGQRTFDVPFVVVRELINSMILGEDFQDNYAVWIH FT QDEKKIRLLHNNVELWVDFSVGGKRSRCIEHVVLCETMINFDGEEDRQLEQ FT DAAQISSLVLTEEERQVWKELQKEFEDIFSDQKPGLTSEYCHEILVEDDTP FT FNQKQYPIPFSRVSKVDEKLHLMEQWGVISRSATAYVNPLVTTVKKNGDVR FT VCLDARRLNKVLVKDHEKPPNIQEILQKFHGSKYFSTIDLTASYWQIPIKH FT EHRKYTGFMHNSKTYVFNVLPFGLNTAVASFSRAMDLILGPEILEFVEKYI FT DDLMVHSATFSEHITHLRRLFLRLRDAGLKVSISKSTFFQEQVDFLGHIIG FT QNGISIDPAKTAAIRNFPTPTSVQDLRSFMGLASYVARFVPNFAMKARPLY FT DLLQNHRTWSWGTSQARAFLELKDLFLEHTMLNHPNFNQQFVVQTDSSIRG FT IGAVLLQKDEFDNNRIISYASRALKPAEANYTVTELEALAIIWALDKWRQY FT LLGRPIKIMTDHKALIFLKQCKLLNGRLTRWILFLQQYDFEIEHVKGSTNT FT IADVLSRYPPGTRSQEVKDDESLFTIARLSASERELRASMKDLEELQDQDL FT QIKKIKQSFSNTDVVTKGKNRYKMGDKILFMRTIKDGDWRIWLPKALIPNL FT LKVYHEEKGHFGVYKTTNSIKRSVYWRGMRKDIKNFIKQCLVCQLAKNQNF FT KLVGKCRSILPREKNQIISVDLYGPLPIGTAGVRHIFVILDVFTKFVTLCS FT IRKPTAPVLWRRIEKYIKDHGKPEALLCDQGTQFTSKVWAANCELQGIRLI FT HTCVRHPQANPVERVMREISRLCRTYCQENHKTWANKIGNFAELLNAVCHE FT STNCSPYELQFGVKPTDTLRNLIKYPSGDAACVNLEAVRATMRRKANARNS FT KAPPRTKVFRVGELVLLRANPMSSESEGEIKKFFMIYEGPYKVEKICQDDV FT YILTHPHHGKERGRFHVSSLKPYFGPNGPPQ" XX SQ Sequence 5335 BP; 1629 A; 1107 C; 1271 G; 1328 T; 0 other; tctgggggct caaccgggat cagaagccat cccgaaatat atctttctcg gttaagacgc 60 gacgacgacg ccagcaatcc attgttcgcg gtggctaata taaacgatat ccatgtgaga 120 taagggagaa cgtggatggg tccaagtgaa tagcaaggag tggtcacatc taaaacagtt 180 cggcgaggtg gtgagaaagt gagattgttc gggtggagat attcagaagc gatgagtgaa 240 acagcagcat gtgctggagt gcttttcaaa tgtcggtgtt agtggcctat gagatgagtt 300 gaaaatagtt atcgtttgga agagagtatt aggaaattta agttagtgat agtgttgcat 360 cagaatatcg tagagataca cagatggaag gggtaaataa cgattattcg tggttgagct 420 ttccaacaag tcaacccatg atgccgcgga ttcatagagt gcaatttgct tcaacgcctc 480 gatccacagt gggcgagaat gatggaaatt accatgttgg acaaccccga cagcaagcaa 540 ataataattt gcctgatggg gacgatcgag ctccacagca ggcggaaccg agtgttgcgt 600 caattcccga agcggctgaa atatcagagg cacgtgggca aggaatatct ggaggatcag 660 aaaataatac tgcggagctt ttggagagaa acatagagag ccaaaatgtg ttggttgata 720 tgattagaac attaaaccgt cgtattgagg ctctggaaag aaatgtagtt gacagtagac 780 ctggacctag tgagcaacca gcgttaggac cacgcaacga aagaccatcg tcagcggcaa 840 tgcattcaga aggaagatgc gatccaacaa tacttcacgc agcaccaacg tttgtcaaca 900 acccgccgga accagaaaat tttatgagat ttacggcacc aaactttacc gctactgtac 960 caccaagact acctcaggct tacagtacag cttttggaga tccaacgtat gctatatatc 1020 atccgatcgg agctcccgct agtagatttg cagggcttgc gactggattt tcaactagcg 1080 gacatgctct aaatgcagga tggccaggtg gtagcgtacc aacaggagat aatggattgc 1140 aaccaagttt ttttcaacca atgcagggcc ctcacggaat cgggtcaaaa cacaacaccg 1200 ggaccggcgg tatggccccc acctctctca atgaaggaca aacctcccgg attggacttt 1260 ttccgagacc tacctttcga ggtgaaaatg acgaccgcca tccaatagaa tttctcacag 1320 aattcgatcg cttctgttac caccaaggat tagttgaact agaaaagctt cctgtagtgt 1380 tatcttgtct aaccggagaa gctaaggttt gggcacgagg atttgagtac ttgttccgag 1440 attacaccac gttcgtacac catttcaaag aaaatttttg gggacaaacg gcgcagcgtc 1500 aggtttccga gaaaattacc cgaggaaatt atgtacaaag gaatggatcg cgaatggcag 1560 aatacttttt gggaattgtg gctaaggctc gatatttaac accgccacct acagaaaggg 1620 aacttatcca acatttatca cgacactttc cgcgaaatgt tcagcatttg ttgggatcat 1680 gtatggatat tgggtctggc tacaacattc tgcaatcaga ggatcgattc atcaatggat 1740 cgttacgata caacaatggg cttgcgagag aaaattttaa tcaaccaaac tggcgaaatc 1800 gagaagctcc ctccgtaaga agctcgagag gagcgcctaa tctgaatgta ggtgggatcg 1860 ctggtaccaa caacgtaaga atggctatgg tggaagagga ggagtctgtc gaaggaatag 1920 accctgcagt attaacagtg tttgtcggtc gtgaagaact tctagaatca gaaagtgaca 1980 caaaagtcaa agaaaacagg aagttcaaac gatcctccta cattactgta gacctggagg 2040 gtatgcaatt taatgcactc gttgacagcg gatgcgagtt atcttgcatc tcgatcgatc 2100 tatacgatca attgaagaag gctaatatta ctatgccaga gttgccaatg aaaactatct 2160 ctttacgcgg agcatttggg aagaaaagca aaaacgtcaa tttgcaagtc ttgttggaaa 2220 taagtattgg ccaacgtacg tttgacgttc cttttgtcgt cgttcgagag ttgatcaact 2280 ccatgatcct aggagaggat tttcaagata actatgctgt ttggatacac caagatgaaa 2340 agaagattcg gttattacat aataacgtcg agctgtgggt ggacttcagt gtaggtggta 2400 agagatcgcg ctgtattgaa catgttgtat tatgtgaaac catgataaac tttgatggtg 2460 aggaggatag acaattggag caagatgccg ctcagatcag ctccttggta ctcactgaag 2520 aagaaaggca ggtctggaaa gagctacaga aggagtttga ggatatattt tcagatcaga 2580 aacctggtct gaccagcgag tactgccacg aaattttggt ggaagatgac acgcctttca 2640 accagaaaca gtatcctatt cctttctccc gggtcagcaa ggtcgatgag aaattgcatc 2700 taatggagca gtggggtgtg atttcccgta gcgcaacggc ctatgtgaat cctcttgtta 2760 ccaccgttaa gaaaaacgga gatgtcagag tgtgcctaga tgcgcgcaga cttaacaagg 2820 tgctagtcaa agaccatgaa aagccaccca atatacagga gattttgcaa aaattccatg 2880 gctctaagta cttttccaca atcgatttga ccgcctctta ctggcaaatt cctataaaac 2940 atgagcatcg gaaatatact ggattcatgc ataattcaaa aacatacgtt ttcaacgtct 3000 taccatttgg gttgaacacg gcagtcgcca gcttttctag agcgatggac ctcatccttg 3060 gtcctgagat attggagttc gtagaaaagt acatagatga tttgatggta cattcggcta 3120 ctttttcgga gcacataact catctacgcc gtctcttttt gcgacttaga gatgctggcc 3180 tcaaagttag catatctaaa tccacttttt tccaagagca agtggatttt ttggggcata 3240 taataggaca gaatggtatt tctattgatc ccgctaaaac agcagcgata cgaaatttcc 3300 caactccaac atctgtacaa gatcttcgca gcttcatggg actagccagt tacgttgcac 3360 gttttgttcc caatttcgct atgaaagcca gaccccttta cgacctgctc caaaatcatc 3420 gaacatggag ttggggtaca agccaagccc gagcattcct ggagttgaaa gatttgttcc 3480 tcgaacatac catgttgaac catccgaact tcaatcaaca gtttgttgta caaacagaca 3540 gttctattag aggcatagga gctgttttgt tgcagaagga tgagtttgat aataatcgaa 3600 ttatttctta tgccagtcgg gccctcaaac ccgctgaagc aaattacacc gttacggaat 3660 tggaagcgct ggctattatt tgggcgctag ataaatggcg acaatatctt ttagggcggc 3720 caattaaaat aatgactgat cataaggcgc ttatttttct aaaacaatgt aagttgctca 3780 acggacggtt gactcgctgg atactatttt tgcaacaata cgactttgag atcgagcatg 3840 tcaagggttc aaccaacacc atcgcagacg tgctttcacg gtatcctccg ggtacgcgaa 3900 gccaggaagt aaaagatgac gaatcattat tcacgatagc ccgattgtca gcatccgaac 3960 gagagttgcg tgcctccatg aaagatctgg aagaacttca agatcaggat ctacagatta 4020 agaaaatcaa gcaaagcttt tccaataccg acgtggtaac gaaaggaaag aaccggtaca 4080 aaatggggga taaaatacta ttcatgcgca ccattaaaga tggagattgg cgaatctggt 4140 taccaaaagc attgattccg aacttgctca aagtctacca tgaggagaag ggtcatttcg 4200 gcgtctataa aaccacaaat tccatcaaac gatcagttta ttggagagga atgcgtaagg 4260 atattaaaaa cttcatcaag caatgtttgg tttgccagct tgccaaaaac cagaatttca 4320 aactggtagg caaatgtcgg agtatccttc cacgagaaaa gaaccaaatc atttcagtgg 4380 atctatatgg tccacttccc atcggcaccg caggcgtgag acacatcttt gttattttgg 4440 atgttttcac caagtttgta acgctttgta gcatcagaaa acccacagcc cctgtactgt 4500 ggcggcgtat tgagaaatat atcaaagacc atgggaaacc agaagcgcta ctttgtgatc 4560 agggcaccca gtttacgtcg aaagtttggg cagcaaactg tgaacttcaa ggaataagac 4620 tgattcacac ttgcgtacgt cacccacaag ccaacccggt cgaaagagtt atgcgagaga 4680 tttcacgctt gtgccgcact tactgccagg aaaaccacaa aacatgggcc aacaagatag 4740 gaaattttgc cgaattgctg aacgcggtat gccacgaatc aacaaattgt tcgccgtatg 4800 agctgcagtt tggagtgaag cctacagata ctcttcgaaa cctgattaaa tacccatccg 4860 gggatgctgc gtgcgtaaat ttggaagccg taagagcaac tatgagaaga aaagccaatg 4920 ctcgaaatag taaggcccca ccccgaacaa aggtgtttcg cgttggtgaa ctggtacttc 4980 ttcgagccaa tccgatgtct tctgaatcgg aaggagagat taaaaagttt ttcatgatct 5040 acgagggacc atataaagtt gaaaagattt gccaggacga cgtatatatc ttaacccatc 5100 cacaccatgg aaaggaaaga ggacggtttc atgttagctc actgaagccg tacttcggcc 5160 caaatgggcc acctcagtaa agaaaaattt atcagactag gagtggctga aaccgcgaaa 5220 ctgcagaatg tgccattcct agtgatcgct ggaggggggt gttgtaacgg accaccctaa 5280 cagatggagc tgagtttgat cagcatagtg gaaagagagg cagtttactc cttat 5335 // ID Gypsy4-I_AP repbase; DNA; INV; 4863 BP. XX AC Contig7751; XX DT 21-APR-2008 (Rel. 13.04, Created) DT 21-APR-2008 (Rel. 15.12, Last updated, Version 0) XX DE LTR retrotransposon from pea aphid: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy4AP; KW Gypsy4-I_AP; Gypsy4-LTR_AP. XX NM Gypsy4-I_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-4863 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from pea aphid."; RL Repbase Reports 8(4), 443-443 (2008). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC Positions [3714-4187] - Integrase core CC LTRs are 99% similar to each other. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX FH Key Location/Qualifiers FT CDS 279..3059 FT /product="Gypsy4-I_AP_1p" FT /translation="MGNSDPNNPNGEEFNANQLVRALLEQNRLREEQMDKI FT INKLMDQRSVTQQSSPMLPIISQTVPVFNGETGDTDVAKEWLNALKAVALI FT NKWSDTCTLETGRSHLDGAAKNWYLSHMSELDSFAKFTTAFEETFTSKESV FT TETWKRMNERVQKRDETVFAYFHDKVRMCRRLGLSAIEIKKMICVGLYSRE FT LSSALLSNGHTLETELLADIRMFNDVDLNRSERFRNTSQPKKGPKPITEKE FT QASTKPRGVSKSVDTVPPRDGPRCYNCQMFGHVARDCTQPRKPFKCGKCKQ FT EGHSSKYCQSGKTDVNLVSTRPKNSAVLYIKEVRINDHKDLIHGLVDTGSA FT VSIIKKSIAERFGLTINTKKINMWVYGKAQPVISKGETEANICVDEVKEHV FT KLVVVNDELQQYDIIIGRSFTELDNVTFIKTSDQLIFGYGMKFPYQETDVP FT GQIVHRHAVRIKCETEIMPAESAKVVEVLADNETIEVMIINDGESELRLHR FT GDHVGVLREQHPTRQSIIAPRTPITAKMVQHGPDFNQEEVNELVKLLNDYR FT CCFAFNLGELGCTTEIQMDIVDNGQPVVCKPYRASASERETISRIVREWKD FT EGIVTETKSSYASPVLLVSKKDGDARLVVDYRKLNSQTVRKVFPTPNLDEH FT LEQLHGAKMFTTLDLASGYLQVPLTEAAKEKTAFITPSESGQFERMVFGLI FT NAPYEFSRLMQRVLHPLKDKVAMWYLDDILIPSTSFKDMLHRLQQVLDALK FT GAKLTLKLSKCYFGYPEVTYLGFMLSVEGVRPGEQKIVAVKEFPKPNNKHE FT VRRFLSLCGFFRRFIPHYAMLAQPMSDLLKDSVSFEWTTTQEAAFNELKNR FT LISKPVFQVFNPSAETELHCDASSMGLSGMLLQRGRDKKLHLVHAVSKKTT FT SAERNYHSSKQELMAVVWSMSRL" FT CDS 3516..4694 FT /product="Gypsy4-I_AP_2p" FT /translation="MRKGIVIRFHDLAGHFAVDRTVNKIKERYYFPRMRRY FT VQVHIRCCPECVLTKIPRGRQPGELHPIKPGKRPFEIINIDHVGPFIKSTK FT GNSYILVLIDNLTKYVQLFPVKSCGTEGVIHSLQQFIVTFGIPKQIISDRG FT TAFTSKAFDAFCTQYGVRHILNSVRHPQSNGQVERVNSTLVPVMQSTMENN FT RTWDKTVKDIECHLNNAHNKTIGDTPFHVLYGYYPSFKDGTLRHATTVETW FT DSSTQIQAKVRERIAREHDMWKLRYDSKHVKPVQYQVGDVVFIRRPTEATG FT ESTKLQVKYRGPLVITKVLPNDVYRVSELHTEDKRCYSTTVHVTHIKGYNI FT PEDEEEPNETESIPTEEEEDQQEESTTNHQQVEKSKRDRRPPGWHATYQM" XX SQ Sequence 4863 BP; 1649 A; 894 C; 1131 G; 1189 T; 0 other; tttcagaagt gggatttgac cgtaaatccg cctaaagtcg agtgtgttcg aaccgccgag 60 tcacgaggtt ctagcatcta gttacgaaca taacctcaaa gtagagtaac gttaaccact 120 aaccgtcaac ctgataacgt tagagaacgt aatgattgcg gtatcataag ttgttgtgta 180 atcaagaaga tagagataaa tgataagtaa acaaaccgcg tgtgtgtatc gtcatcgcgt 240 agagaattgt gatctgtgat attttcgaaa acaatacaat ggggaatagc gatccaaaca 300 acccgaacgg ggaggagttc aatgcgaacc aacttgtgcg tgcgcttctt gagcagaaca 360 ggctgcgaga agagcagatg gacaaaatca ttaataagtt gatggatcag agaagcgtga 420 cacaacagtc ttcaccgatg ctaccaatca tttcgcaaac ggtaccagta tttaatggag 480 aaaccggtga tactgacgta gctaaggaat ggcttaacgc gcttaaagcc gttgcattaa 540 taaacaaatg gtctgataca tgcacactgg aaactggtcg ctcgcatttg gatggagccg 600 ccaaaaattg gtatttaagt catatgagtg agttagatag ttttgcgaag tttacgactg 660 cgtttgaaga aacattcact agtaaagaaa gtgtaaccga gacctggaag agaatgaatg 720 aacgagtaca aaaacgggat gaaacggtat ttgcgtactt ccatgacaag gtcagaatgt 780 gtaggcgatt aggattatca gccatagaga taaagaaaat gatttgtgtt ggactatatt 840 cgcgcgaatt atcatccgca ttacttagca atggacacac acttgagact gaattattgg 900 ctgatatccg aatgttcaac gacgtggatc tcaaccgaag tgagaggttt cgcaacacgt 960 cacaaccaaa aaaaggtcca aaacctataa ctgagaagga acaagcgtcg acgaagccca 1020 ggggagtcag taaatcggta gacaccgtac ccccacgtga cggtcctcgc tgctataatt 1080 gtcaaatgtt tggacacgtg gcccgagact gcactcaacc acgtaaacca tttaagtgtg 1140 gtaaatgtaa gcaagaaggg cacagtagta agtactgtca aagcggaaaa actgatgtta 1200 atttagtcag tactcgacca aagaatagtg cagtacttta tattaaggaa gtacgaataa 1260 atgatcacaa agacctgata cacggattgg tggatacagg tagtgcggtt tcaataataa 1320 agaaatctat tgctgaacga tttggtctga caatcaacac caagaaaata aacatgtggg 1380 tatatggtaa ggctcaaccg gtaattagta aaggcgaaac tgaagcgaac atatgtgttg 1440 atgaagtgaa ggaacatgtc aagttggtcg tagttaatga tgaattacaa caatacgaca 1500 taataatcgg acgatccttt accgaattgg ataacgtcac ctttattaag acgagcgatc 1560 agctgatttt cggctatggt atgaaatttc cgtatcaaga aactgatgtt ccaggacaga 1620 tagtacatcg acatgctgtg agaataaaat gtgaaaccga gataatgcca gcggagagtg 1680 ccaaggtagt agaagtactt gctgacaacg aaacaatcga agtaatgatt ataaatgatg 1740 gtgaaagcga gttgaggtta caccgcggcg atcatgttgg ggtactgcgt gaacaacatc 1800 cgacaagaca atcaattatt gcaccgagaa caccaataac tgcaaaaatg gtgcagcatg 1860 gtccagactt caatcaggag gaagtgaatg aactggttaa attattaaat gattatcgtt 1920 gttgttttgc gtttaattta ggtgaactgg gctgtaccac agagatccag atggacatag 1980 tcgacaacgg acaaccggtg gtgtgcaaac cgtatcgtgc cagtgcttcg gaacgcgaaa 2040 ctataagtcg tatagtacgt gagtggaagg acgagggcat agtaaccgaa actaaatctt 2100 catacgcgtc acctgtgcta ttggtcagta aaaaggatgg tgacgctaga cttgttgtgg 2160 actaccggaa gttaaattca caaactgtgc gtaaggtatt cccaactcct aatctggatg 2220 aacatcttga gcagctgcat ggtgccaaaa tgtttacgac tttggatctt gcgtcaggat 2280 acctccaagt tccacttact gaagcggcaa aagaaaagac agcattcatc acacccagtg 2340 aatctggaca attcgagaga atggtgtttg gcctaatcaa cgccccatat gagttttcga 2400 ggttgatgca acgagtcctg caccctttga aggacaaggt agcaatgtgg tatcttgatg 2460 acattcttat accatctaca tcgtttaagg atatgttaca tcgattgcaa caagtgctgg 2520 atgcactgaa aggggcaaaa ttaacactca agttaagtaa gtgttacttt ggctatccag 2580 aggtaacata tctcgggttc atgctgtcag ttgaaggcgt acggccagga gaacaaaaga 2640 ttgtggcagt taaggagttc ccaaagccga ataacaaaca tgaagtacgt cggttcctca 2700 gtttgtgtgg gttcttccga agatttatac cacattatgc gatgctagca caacctatga 2760 gcgatttact aaaggatagt gtgtcatttg agtggacaac gacacaagaa gcagcgttca 2820 atgaactgaa gaacaggttg atcagcaagc ctgtatttca agtgtttaac ccaagtgcag 2880 aaacggaact acactgcgat gcaagtagta tgggtctaag cgggatgtta ctacaacgag 2940 gcagagataa aaaattacat ttagtacacg cggtatctaa gaagacaact tcagcagaac 3000 ggaactacca ctcgagtaag caagaattga tggctgtagt atggagtatg agtagactat 3060 gaccatacct gatcggaatt aagttcttag tgattacaga ttgccaagca atcgttcacc 3120 tcaacactca gaaaaccctg aatccacaga tagcacgatg ggcaacattg ttgagtgaat 3180 ataactttga catcaagcat cgcccaggtg cgaaaatgaa tcatattgat gctcttagtc 3240 gggccccagc tagtatgtct caagatacag aaacagaact attggatgaa catttagagg 3300 tgtttatcac aatgaccgaa gaagagcagg tgatatcaat gcaacgcacc gacaccagac 3360 ttaagggaat aatggagatt cttagtcgag aaccgtcggg acgttctgca gttgataatg 3420 aaattgttaa aaactatcac atggaaaagg ggatattata tagaacagtt atcgtcgaag 3480 gtgaagctag acagctgtga gtagtaccca atgctatgag aaaaggaata gtcataagat 3540 ttcatgatct agcaggtcac tttgcagtag accgaactgt gaacaaaatc aaagagcgtt 3600 actattttcc acggatgcgt cggtatgtgc aggtgcatat caggtgctgt cctgagtgtg 3660 tgttaaccaa aatacctcga ggtagacaac caggtgagct gcaccccata aaaccgggta 3720 agcgtccgtt cgagataatt aatatagacc acgtagggcc ctttatcaaa tcaacaaagg 3780 gaaacagtta tatcttagtg ttaattgata atttaacaaa gtatgtgcag ttatttccag 3840 tcaaaagttg cggtacagag ggagtgatcc atagccttca acaatttata gtgacgtttg 3900 gtataccgaa acaaataatt agtgacagag ggacagcatt tacatctaaa gcatttgacg 3960 ctttctgtac acagtatggt gtgagacaca tactaaattc ggtgagacat ccacaatcga 4020 atggtcaggt ggaacgtgta aacagtacac tagttccagt tatgcaatca acgatggaga 4080 acaacagaac ctgggataaa acagtcaagg atatagagtg tcatttaaac aacgcgcata 4140 acaaaacaat aggtgatacc ccatttcatg tactttacgg atattatcct agttttaaag 4200 atggaacgtt gcgccacgct acaacagttg agacgtggga ctcaagtaca cagattcaag 4260 ccaaagtaag agagcgcata gccagagaac atgacatgtg gaaactaaga tatgattcta 4320 aacatgtcaa accagtgcaa tatcaagtcg gagacgtggt gttcatccgc cgacctactg 4380 aagctacagg agaatcaact aagctacaag tgaagtatcg aggaccattg gtgatcacta 4440 aggtgctacc aaatgatgta tatagagtgt ccgaactaca tactgaggat aagaggtgtt 4500 actcgaccac tgttcatgtg acacatatca agggttacaa tattccagag gatgaagagg 4560 aaccaaatga gaccgagtca ataccaacgg aggaggagga agaccaacaa gaagaatcaa 4620 ccacaaatca ccagcaagtg gaaaaatcca aaagagatag aagaccacca gggtggcatg 4680 caacatacca gatgtaaata taatcactat gaacagaaag aactgtctta attgttacta 4740 ctcactaggt tatataaata ttgtaatatt gttatgttaa gagtttaatg aaaatttgtt 4800 tttttttaaa ttgatggaaa tgttaatatt aagtggggac ccttaattgt caagataccc 4860 gaa 4863 // ID Gypsy-36_DPu-I repbase; DNA; INV; 4609 BP. XX AC scaffold_66; XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from Daphnia: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-36_DP_; KW Gypsy-36_DPu-LTR; Gypsy-36_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-4609 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Direct Submission to RU (16-DEC-2010). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR Genome; scaffold_66; Positions 413697 418305. XX CC Positions [3539-4000] - Integrase core CC 'GTAAC' target site duplication CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 423..1427 FT /product="Gypsy-36_DPu-I_2p" FT /translation="MPKKPFNPYGSPPPFDFDEYKDAFELWQRQWTIFLAL FT STIDTALPQADRVSYKANILLSCLSKPTLQAICTMGLSDIELGDPDIIIDK FT LRERCNAGRNCHVWRQQFALRIQREKESVDSWVCELRDLAQKCEFNTDCCN FT KCQDTRLLGQIIYGVNSDEVRRKLLQEGATLSLNNALTTLRTAEATLMQAA FT NLRKNDQPSIQQIKNNTAKTDPKSAARSPHPAFQRNFRRGPPAGAPPHGCW FT NCGADSFHPKSECPANGKECGGCGIIDPRQREVSSSDPFYQGNLLKPASRP FT PTATRPQQCSSCPTRAPRLMPYHMHFTNRDSPRLHFAPSQREP" FT CDS 1181..4462 FT /product="Gypsy-36_DPu-I_1p" FT /translation="MPSQWQRMWRLWHNRPETAGSIVIGSILPGELIETSV FT SPADSDTPTAVLFLPDTGASIDAVPHALHQSRFAATALCPITTRAVTATGD FT AITSFGTFQATISVGPVNGGTPVLTTIHVLKNLQQPVLSKETQKKLGILPL FT DYPHWRVQQITAHPTDLEKANKLKNLMAAHPLIFDGICHPMVGPPCHFQLA FT DGAIPSAMRGSRPVSVPLLPKLKTELDSLEEQQIICKVTEPTAWVHPMVIA FT PKKDGGIRVCVDFTPLNAFIIRPTFETATLFQAVRSIPAGMRFFTVIDALK FT GYHQVPLDDESATMTTFSTPFGHYMYRRLPFGVCHAGDDYSRRVSAVFDDI FT PNCRRIIEDILIFSETYEEHISLVQQVFQRAADNQIAINTGKIQFAQPSVL FT FGGYTLNNTGFQPNPELLSAISRFPTPTNITEMRAFHGLCQQMGNFSDNLA FT AALAPLSPLLKKNFQWEWTTQHDAAFNAARSELSSSAELSFYNPALPTVLH FT VDASRLRGLGFILRQQNPDGKWNVVQAGSRFLSEAETRYAMIELEWLGAAW FT AMQKCRQFLEGLPSFLLLTDHRPLIPILNDYNLDKLDNPRILRLRLKMQRY FT AFTARWIPGKENLMADALSRSPIRPTYPVDELAEGPTSFSTRVALISTIDG FT SDPCTSDPLLDKVSAAAAVDPVMIALRDVIRTGFPNEKCNLPLPLRPFWCV FT RSLLAIDKDDDIIVMGCRVVIPEAVRREVLKNLILMHQGATKLRQRARQTM FT YWPSMDSEIITAARACPTWTEHLPSHPPEPLLPHQKASRPFEFIHADLGCH FT NGRDFLISADQFSGWPHVVPFPDKNTSARKVVDAVRSFFAFGAGAPIKFWS FT DGGPQFTADEFQSFLREWGISHGTSSAGYPQSNGFAEVSVKSMKKLIRGSW FT SAGSFDMDQFSKGLLLYRNAPLSGGASPAQLVFNRPVRDCLPAHRRAFAPQ FT WQKSAVQLEKQALRARTLRAVHFNQRAHPLLPFSIGDAVVIQNHLTKRWST FT PGIIVETGPFRDYLVKTPAGRLFRRNRHLLRRHYPLATPAPSVHGPPDRPA FT QPSTTSTPNSPTPDGPATPTAEAPRRSARIRNRSNKN" XX SQ Sequence 4609 BP; 1075 A; 1499 C; 1006 G; 1029 T; 0 other; tggcgcagtt gattatcgtc cccacccctc ctcccactcg tgaaccgtga ctgtgtcttt 60 tgtaagttga ctaaacattt tcatatcgcc ttgtgaagct ctcgtgcaca ctctacctac 120 cgactcccac cacgtggttg cgcacggcac taccagcctc ccactttaac tcaaccgtaa 180 ccacacggca ctaccccgtg tttcgccatt tacactgtat acccttacct gaaatgtgga 240 tgtgatttat attacccttt gtgttgtgac gcatcacagg ctacatcgta ctcaaactca 300 gccatatccc gcagcacatt tttcatccgc cattttgttt ttgtgtgatc ggtttccttc 360 tttgcgcgtc cccgcctgcg gtcattctcc gtgttgggcg catcagatct tcgtcgtcca 420 ccatgcccaa gaagccattt aatccctacg gctccccgcc accgttcgat tttgacgaat 480 ataaggacgc ctttgagctc tggcagcgcc agtggactat atttttggcc ttatctacca 540 tcgatacagc ccttcctcaa gcggaccgcg tgtcctataa ggcaaacatc ctgctgtcct 600 gcttatcgaa gcctacttta caggccatct gcacgatggg cctgtctgat atcgaacttg 660 gtgaccccga tattatcatc gacaaacttc gtgaacggtg taacgctggg aggaattgcc 720 acgtttggcg ccaacaattt gcccttcgta tccaacgtga aaaagaatcc gtggacagtt 780 gggtttgcga gctgcgcgat ctcgcacaaa aatgcgaatt caacacggat tgttgcaaca 840 aatgccagga cacccggttg ttgggccaaa tcatttacgg cgtcaacagt gatgaagttc 900 ggcgtaagtt gttgcaggaa ggcgctacac tatccctcaa caacgcttta acaacactgc 960 gcacggccga agcaacacta atgcaggcag ctaacctgcg gaaaaacgat caaccatcta 1020 tccagcaaat caaaaacaat accgccaaaa ccgacccaaa gtccgctgct cgttctccgc 1080 acccagcttt tcaacgcaac ttccggcgtg gcccaccagc aggtgcccca ccacacggat 1140 gttggaactg cggagccgat tcgttccacc caaagtctga atgcccagcc aatggcaaag 1200 aatgtggcgg ctgtggcata atagacccga gacagcggga agtatcgtca tcggatccat 1260 tttaccaggg gaacttattg aaaccagcgt ctcgcccgcc gacagcgaca cgcccacagc 1320 agtgctcttc ctgcccgaca cgggcgcctc gattgatgcc gtaccacatg cacttcacca 1380 atcgcgattc gccgcgactg cactttgccc catcacaacg cgagccgtaa cggcgactgg 1440 cgatgccata acgtcatttg gcaccttcca ggcaaccatt agcgtaggac ccgtcaacgg 1500 aggcacgcca gtacttacca ccattcacgt acttaaaaat ctccagcaac cagttctatc 1560 aaaagaaacg cagaaaaaac tcggcatcct acccctggat tacccccatt ggcgcgttca 1620 gcaaatcaca gcccatccta cggatttaga gaaagctaat aagctgaaaa atttaatggc 1680 tgcccatccc ctcatcttcg acggtatatg tcatcccatg gtaggccctc catgccactt 1740 ccagttagcc gacggtgcca taccatccgc gatgagaggt tcgcgaccag tctccgtccc 1800 gttgctccca aaactcaaaa cagagctgga ttcgctcgag gagcaacaaa tcatctgcaa 1860 ggtcaccgag ccgacagcat gggtccaccc tatggtcatc gctccgaaaa aggacggcgg 1920 catacgcgta tgcgtcgatt ttacccccct taacgccttc atcatccggc caacattcga 1980 aacggccacc ctgttccagg ccgtccgctc gattccggcc gggatgcgct ttttcaccgt 2040 catcgacgcg ttaaagggat accaccaggt gccattagac gacgagtcag cgacgatgac 2100 caccttctct accccgttcg gccactacat gtaccgccgg ttgcctttcg gagtttgcca 2160 cgccggcgac gactacagcc gacgcgtttc ggctgtattc gatgatattc caaattgccg 2220 tcggataatt gaagacatcc tcatattctc cgagacatac gaggagcaca tttctctcgt 2280 ccagcaagtg ttccagcgtg ctgccgacaa ccaaatcgcc atcaacaccg gtaaaatcca 2340 gtttgcccag ccgtccgttc tgtttggcgg ctataccctg aacaacaccg gcttccagcc 2400 caacccggaa ctcttatcgg ccatcagtcg atttcctacc cccaccaaca tcacggagat 2460 gagagcattc cacggtctat gccagcagat gggcaatttc tccgacaact tagcagcggc 2520 actcgccccc ctttcccccc tcctaaaaaa gaatttccag tgggaatgga caacacagca 2580 cgacgccgcc ttcaacgctg cccgctccga attgtcgtct tccgccgagc tgtcctttta 2640 caacccagca ctacccacgg tcctccatgt cgatgcctcc cgcctgagag gtcttggttt 2700 catcctccgt caacaaaatc ccgatgggaa gtggaacgtg gtccaggccg gatcacgttt 2760 tctatccgaa gcggagacac gctacgccat gatcgagctg gagtggctgg gtgcagcctg 2820 ggccatgcaa aagtgccggc agtttttgga aggcctccca tcatttctcc tcctcacgga 2880 tcatcggccg ctcatcccga tcttaaacga ttacaacttg gacaaactcg acaacccacg 2940 gatcctgcgc ctacgcctaa aaatgcagcg ttatgcattc acagcccgct ggatccctgg 3000 aaaggagaat ctaatggccg acgccctctc gcgttcgcct atccgcccca catatccagt 3060 ggatgaactg gcggaaggcc caacatcatt ctccacccgt gtggcgctga tatccaccat 3120 cgacggatca gacccctgca cgagtgaccc gctcctcgac aaggtgtcag ctgccgccgc 3180 ggttgatcct gtcatgattg ctttacgtga cgtcatccga accggttttc ccaacgagaa 3240 atgcaactta ccgctgcccc ttcgtccttt ctggtgcgtc agaagcctcc tcgccattga 3300 caaagacgac gatatcatcg tgatgggctg ccgtgtcgtc attccagagg ccgttcgtcg 3360 tgaggtcctc aaaaatctta tcttgatgca ccagggcgca acaaagttac ggcaacgggc 3420 ccgtcaaacc atgtattggc cgtccatgga cagcgagatc atcaccgcgg cccgtgcttg 3480 tcccacttgg actgaacatc tcccgtcaca cccgccagaa cctttgctgc cccaccagaa 3540 agcatcgcgg ccgttcgagt ttatccatgc cgacctcgga tgccacaacg gtcgcgattt 3600 tttaattagc gcagaccaat tcagtggatg gccccacgtc gtcccatttc cggacaaaaa 3660 cacctctgca cgcaaggtgg tcgacgctgt ccgctccttc tttgcctttg gcgcgggtgc 3720 accaatcaaa ttttggtcgg acggcgggcc ccagttcact gcagacgaat ttcagtcctt 3780 ccttcgtgaa tggggcatca gccacggcac ttcctcggca gggtacccgc aatctaacgg 3840 ctttgcggag gtttccgtaa agagcatgaa aaagctcatc cgcggatcgt ggtcggcggg 3900 ctccttcgac atggaccaat tcagcaaagg cctcctactc taccgcaacg caccactgtc 3960 tggtggagct tcacccgccc agctagtttt taatcggccc gtcagagatt gccttccagc 4020 ccaccgccgc gcattcgcac cccaatggca aaaatcggcc gtacaacttg agaagcaggc 4080 cttgcgcgcc cgcactcttc gtgccgttca tttcaaccag cgcgcgcacc ctctccttcc 4140 gttttccatc ggcgatgccg tcgtcatcca gaaccactta actaaacgct ggtcaacccc 4200 tggtatcatc gtcgaaaccg gcccattccg cgattattta gtcaagaccc cagctggtcg 4260 gctcttccgc cgcaatcgtc accttttgcg tcgtcactac cctttggcga caccagcacc 4320 atcggttcat ggcccaccgg atcgcccagc acaaccttca acgacttcca cccccaattc 4380 gccgacgccg gatggtccag ccacaccgac agccgaagcc ccccgtcgct ctgcccgaat 4440 ccgcaacaga tccaacaaaa attaactcgc attatgttta tcaaccccta agtttctttc 4500 tttccatcct ctgttataat gtcatgcttt ccccatttgc catgtcagaa cccagcccgt 4560 aatcaaatca caatgcaaaa gaaaaaaaaa aatccgaaaa caaagaaca 4609 // ID Gypsy-12_TCa-I repbase; DNA; INV; 3726 BP. XX AC chrUn_5; XX DT 21-FEB-2011 (Rel. 16.02, Created) DT 21-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the red flour beetle genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-12_TCa_; KW Gypsy-12_TCa-LTR; Gypsy-12_TCa-I. XX OS Tribolium castaneum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Coleoptera; Polyphaga; Cucujiformia; OC Tenebrionidae; Tribolium. XX RN [1] RP 1-3726 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the red flour beetle genome."; RL Direct Submission to RU (21-FEB-2011). XX DR Genome; chrUn_5; Positions 417350 421075. XX CC Positions [1802-2308] - Reverse transcriptase CC 'AATC' target site duplication CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 156..1346 FT /product="Gypsy-12_TCa-I_2p" FT /translation="MNGQVDENIHNQDNNEDRDHQDREERPEEGGAGEGVP FT LRIEDIIAVTVRTMQNQRDYSVRQNDIINLIPEFAGTEEEDVLLWLNRIRA FT VRRNYGPNYNELLLAAISKLSGFAKKWFDSRPEHLDLSLAELMEMISTFGN FT REDRLSCMKRFEARKWRRNEKFLEYYQDKLLLGNQLQLSEQEILKYMIDGL FT DNCILQTQAKMARFTSLSEFMKVMNEISGSERQAGANLGGSSGSSRTCGRQ FT FTTQTKASSAVSGGSHRVMLGAHVRCFNCNITGHKATECRRPRREPGTCFE FT CGRGDHRIRDCPVAANKKRTTVQAQADSTTNVIDSVSIAPYTISTHVTTKD FT KYDTICNYHLNCLVDTGSPISLIKSRYVTDINFNESLKNEEFSGINGSKLT FT ILQ" FT CDS 1406..3466 FT /product="Gypsy-12_TCa-I_1p" FT /translation="MTFSVILGRDFLSFQNLKISFGKTVQIDCVENREQFS FT EQTTFHEILHINIDDDSKEPKFDLNVNPCLPLHIKLKLNEMYEESYRNHSN FT SVGLEPVSDFEMNIALKNDQPIAFRPRRLSFFEKEKLRVILNDLEKDGVIR FT PSKSPYCSPIVLVRKKNGEIRLCVDYRELNKITVRDNYPTPLIDDHLDMLK FT NKRYFSCLDLKNGFHHVRMAPSSIKYTSFVTPLGQYEFLKMPFGLTNAPHV FT FQRYIHEVFSDLINSHKILVYLDDIMIATEDLDEHLSLLREVFDLAATHKL FT EFRLDKCMFLNSETIYLGYLINEKGIRPNPENVKAMVEYPIPRNPKELHRF FT VGLASYFRRFIPQFSIIAKPLYDLLRKNAVFQFDADALNAFETLKSKLISA FT PVLAIYSPSAETELHCDASSYGFGAVLAQRQDSGKFHPIFYFSQRTTDTES FT RYHSFELECLATIYAIRRFHVYLLGIPFKIITDCNSFRLTLSKKDINPRIS FT RWALFLENDNYQVVHRSADKMAHVDALSRCNSILILEENLFEQTLNNTYSR FT SVGNTPSVLLFGVNQTDPYDSLPSYFESQNELDRTIEDIRADAKLKNSKVQ FT ESSKIAYDRKHKPPKQYTIGDFVVIKNIDVTPGTNKKLLPKFRGPYQVKKV FT LGNDRYLVTDVEDFQLTQIPFEGICSPEHMRPWLRDYN" XX SQ Sequence 3726 BP; 1154 A; 641 C; 770 G; 1161 T; 0 other; acttcagaag tgggatagaa aagtggattc agtgatgtta gtagtgaagc agaaatacag 60 tgatatgcta gattttgaat aaccacgagg tgctacgatg gttttgctcg tcgtccattt 120 ttaattgtga aggaaaaacg aaaaacacct gaaaaatgaa cggacaagtc gacgaaaaca 180 ttcataacca agacaataac gaagaccgtg accatcagga tcgcgaagaa cgtcccgaag 240 aaggaggagc gggtgaaggt gtacctttga gaattgaaga catcatagca gttaccgtaa 300 gaactatgca aaatcagcga gactatagcg tcagacaaaa tgatattatt aaccttatac 360 ccgagtttgc tggaaccgaa gaagaagatg tgttgttgtg gctaaatcgc attcgtgctg 420 ttaggaggaa ttatggtccg aattataatg agttattgtt ggctgcgatt agcaaactct 480 caggatttgc gaagaaatgg ttcgactcga ggccagaaca tttggattta agtcttgccg 540 aattgatgga aatgatttca acttttggaa acagggaaga cagattgagt tgtatgaaga 600 gatttgaagc gagaaagtgg agaagaaacg agaaattttt ggaatattac caagacaaac 660 ttctattggg caatcagttg cagttgagcg aacaagaaat tttgaagtac atgattgatg 720 gtctggataa ttgcatactc caaacgcagg cgaagatggc aagatttact tctttgtccg 780 agttcatgaa agtgatgaat gaaatttcgg gtagtgagag acaggcaggt gcaaatttgg 840 gtggtagctc tggttcctca agaacatgtg gaaggcaatt cacaactcaa acgaaagcgt 900 catcagcagt cagtggtggc agtcataggg tgatgttggg ggcacatgtg cgttgtttca 960 attgcaatat tacagggcac aaagctactg aatgccgcag accaagaaga gaaccaggaa 1020 catgtttcga atgtggacga ggagatcata gaatacgaga ctgtccagtt gcagcaaata 1080 agaagcgtac caccgtacag gcgcaagcag attccaccac caacgtcatt gactcagtat 1140 cgatagcgcc ctatactatc tcgacacacg ttacaaccaa agacaagtat gatacaattt 1200 gtaattacca tttaaattgt cttgttgaca cgggaagccc cattagttta attaagtcta 1260 ggtatgttac agacattaac tttaacgaat ctcttaagaa tgaggaattc tctggtatta 1320 acggatccaa attgactatc ctccagtaat tactgtaaat tgcgttgaag ttcctataaa 1380 attttacgta gtgcctgaca ataccatgac ttttagcgtc atacttggta gagactttct 1440 ttcctttcag aatttaaaaa tttcattcgg taaaacagtt caaattgatt gtgttgagaa 1500 tcgtgagcaa ttttctgaac aaacaacttt tcatgaaata ctccatatca atattgacga 1560 tgactcaaaa gagcctaaat ttgatttgaa tgtcaatcca tgtttaccct tacatattaa 1620 attgaaactc aatgaaatgt acgaggagag ttacaggaat cattctaatt cggtaggttt 1680 ggagcctgta tcagattttg aaatgaatat cgcactcaaa aatgaccaac ctattgcgtt 1740 tcgacctcga cgattatctt tttttgagaa ggagaagctt cgcgtcattt tgaacgacct 1800 tgagaaagac ggtgtaattc gccctagtaa atcaccttat tgtagcccga tagttctagt 1860 acgtaaaaag aatggagaaa ttaggctttg cgtggattat cgagaactca ataaaataac 1920 tgtgcgtgac aactatccta cccctctgat tgatgatcat ttggatatgt tgaaaaataa 1980 aaggtatttt agttgccttg atttaaaaaa tggttttcat catgtccgca tggctccatc 2040 ctctataaag tatacatctt ttgttacgcc cttgggccag tacgagtttt tgaagatgcc 2100 attcggtctt actaatgcgc cacatgtttt tcaaagatat attcatgagg ttttttcgga 2160 tttgattaat agtcataaga ttcttgtata tcttgacgac ataatgatag caaccgagga 2220 tttggacgaa catctttcgc ttctgcgtga ggtttttgat cttgctgcta cacataaatt 2280 ggaatttcgg ttagataaat gcatgttctt gaacagcgag actatttact taggatattt 2340 aataaacgaa aagggtatcc ggccaaaccc agaaaacgtt aaggcgatgg ttgaatatcc 2400 aatacctcga aatcctaaag aattacatag gtttgtcgga cttgctagtt attttcggcg 2460 tttcattcct cagttttcta ttatagctaa acctctatat gacttacttc ggaaaaatgc 2520 agtctttcaa tttgatgcag acgctttaaa cgcttttgaa actttgaaat ccaaattaat 2580 ctcggctccc gtattagcta tctattctcc ttcagccgaa acagaattac attgtgatgc 2640 gagttcttat gggtttggag cagtactcgc ccaacggcag gattcaggaa agtttcatcc 2700 aattttctat tttagtcaga ggacaaccga taccgaatcg cgttaccata gctttgagtt 2760 agaatgctta gccacaattt acgctattcg ccgatttcat gtttatcttt tgggtattcc 2820 ctttaaaatt ataactgact gtaatagttt tcgattaact ttaagtaaaa aggatataaa 2880 cccccgaatt tcgcgttggg cattgttttt agagaatgat aattaccaag ttgttcatcg 2940 atccgcagat aaaatggctc atgtggatgc cttaagtcgt tgtaacagta tacttatcct 3000 ggaagagaac ttgttcgaac aaactttgaa taatacgtac agtcgttcgg taggaaacac 3060 tccttctgtt ttgctttttg gtgttaatca gactgaccct tatgattctc ttccgtctta 3120 ttttgaatca cagaatgagt tagacagaac cattgaagat attagagctg atgcgaaact 3180 taaaaatagt aaggttcaag aatccagtaa gattgcttat gatcggaaac acaaaccccc 3240 aaagcaatac accattggtg attttgttgt tatcaaaaat attgatgtta ctcctggaac 3300 gaataagaaa cttttgccaa aattccgagg tccttatcaa gtaaagaaag ttttgggtaa 3360 tgacaggtat ctagttactg atgttgaaga ttttcagtta actcaaattc cttttgaagg 3420 catttgtagt cctgagcata tgagaccttg gcttagagac tataattagt cataacatta 3480 attattgaag tcatttgaga atttctgttc ctttttttgt ttgttttcag attatgcccg 3540 attatgtcct tgttatgtct tatgttttat gtcgggattt actcttgaat ttctttttga 3600 ttattgtttt aagatcagtt atgttattat gtgaactctt atggttatta tgtgtacctt 3660 ttaagttatg agcttattat actgatgcga tgatcgagga cgatcatccc gtcaggacag 3720 ccgagt 3726 // ID SAL_CL repbase; DNA; INV; 651 BP. XX AC X95597; XX DT 09-JUL-2004 (Rel. 9.06, Created) DT 09-JUL-2004 (Rel. 9.06, Last updated, Version 2) XX DE C.luridus Sal(lur) tandem repeat. XX KW SAL_CL; Sal(lur) element; tandem repeat. XX OS Chironomus luridus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Chironomoidea; OC Chironomidae; Chironominae; Chironomus. XX RN [1] RA Ross R., Hankeln T. and Schmidt R.E.; RT "Complex evolution of tandem-repetitive DNA in Chironomus RT luridus."; RL Unpublished. XX DR Genbank; X95597; Positions 1 651. XX SQ Sequence 651 BP; 231 A; 95 C; 107 G; 218 T; 0 other; gtcgacttac taaaaccgtc gtaacttttt attgtggaac tatattggac aatgtttata 60 cagttttaaa aactacaagc caagagctac atgctagtat ggttctaccc cctagaactc 120 ggcgatctaa gacacttgcg gcatgtttga agatattttt cctagaattt tcacacatat 180 cctaccatat cagtaccaaa actataaagt ttgagccttt cgaaaagctt gtaaagaact 240 tttggttctt attcttcgaa atcgaaaaat aagaacgagc ttttttctgt ttgccaacat 300 tggattgtaa tataattaag catttttgtg gttctaaaat gctaaaatat tgtattgtca 360 tcgggaatac acatgtaaaa aaaatttcag ctcaattgga gtagtagagt ggtcgaaaaa 420 tcaatcgcaa gatttgacca agaaagacag aaagacagaa agacaagaaa gcaagttaag 480 aaaaacgtgg taataatgct tataaagata aaatttcgat tttttttgaa aattcttcac 540 agtttgtttt taaatgctat ttgtgaatgg aaaacatcaa aaaacatgtt tgaagttatt 600 aatttgtatt ttttataaga aattttagtt tctgcctata tattcgtcga c 651 // ID Copia-126_AA-I repbase; DNA; INV; 4284 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-126_AA_; KW Copia-126_AA-LTR; Ty1_copia_Ele160; Copia-126_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4284 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [1390-1926] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 76..4266 FT /product="Copia-126_AA-I_1p" FT /translation="MNGGGNGLASSGLPAIERLVGRENWHTWKFALRTFLE FT VEELWEAVEPIPDEQGNIPAVNAAKDRKARGKIILGLDPTLYVHVQDTKTA FT SEAWKKLEKVFEDSGLTRKWGLLHKLITTNLDSCGSMEAYVTRMVGTAHQL FT NGVGLKIDEEWLGMLLLAGLPETYKPMVMALENSGAAITGDVVKTKLLQEV FT QQSSTTVDPAFVSKGPGRSKHSGSENFSTKGPKCRCCGRFGHIARFCPSKD FT STKKGVKKGDAFCTILSVAAMEEAADWYFDSGSCRHLTRNESLLSDVHKAD FT GQIFAANKGVMKVVAEGSVALHPRCSESSIDVNGVQLIPELAANLLSVSKI FT VDRGHTVIFRQKGREVINPEGKRIATGHRSNGLFKLDQQPQSALACQQMKS FT IDSWHKRTGHLSIKGLKRLKNGLASGINFVESESGDCKTCAIGRQARLPFS FT KEGRRANEILELVHSDICGPMEEVSLGGSRYYITFIDDKTRRMFVYFLRTK FT SEEEVVEAFKAFYSMAERQTGKKLKVLRTDNGKEFKNRGFENLLRSLGICH FT QTTVEYTPEQNGMAERANRTIVEKARCMLHEANLPKTFWAEATATAVYIIN FT RSPSKGIRTTPEEAWSGQKPDLSNLRIFGTTTMSHIPKQKRRKWDPKAEEC FT ILIGFDEESKAYRLYNKKTKKVFKSRDVTFINEGRSAVCQDSQNKVQDNEA FT DRGANRRQPTMVRLDFEDCETQEVLEVNRTNPPEQDDDSSSEDNYDSADDS FT VTIPSALPSRTVSNPPESHEASRRSGRERFIPGKYKQFHIKCKGLPASKFS FT DGASGTSEDMTGEVNSDLADNPGYAQHSASPSYPNQRPVEDVQQGLMLSDR FT GRFPQGKYCRYNSEDSCTAYDSPQCIADDPVTPEEALSRDDAELWKSAMQE FT EYEAQMRNGTWTLTDLPAGRKAIRSKWVYKTKLDAEGRPARHKARLVIKGY FT SQRKGVDYEETYSPVVRHSSLRYLFAIAARLNLDIDQMDAVTAFLQGELSE FT EIYMEQPAYFQDTQNRTKVCRLNKALYGLKQSSRVWNHKLDSALKKFGLIS FT TDYDPCVYYKIVGNKVLFVAIYVDDVLIFTNCRQWRKKLKDDLAKEFLMKD FT IGPAKHVLGMRITRSQGKISIDQEAYVESILDRFGMSKSNPVATPLNPNDK FT LTKEMQPIKDDEAERMKRVPYKEAVGCLMYLAQCTRPDICHAVNVLCRFNE FT NPGEKHWNAVKHLLRYLRGTSKFRLTYKKEADPTITGYSDADWATSSEDRK FT SITGFVFIAQGGAISWCCKRQQTVALSTCEAEYMALSAAVQEALWWKRLRA FT TFDIDEAVKINCDNQSTIAVAKNGGYYPRTKHIDIRHCFIRDAVQRGDVDI FT VYISTDKQLADCLTKSLPKPKIELQRAAMGIQSQ" XX SQ Sequence 4284 BP; 1237 A; 975 C; 1163 G; 909 T; 0 other; ataggttatg ggcccagaag tggtaagagg ttgaaaattc gaagcttcca gaactcaaga 60 ttttttcaac tgaagatgaa cggaggtggt aacggacttg cttcttccgg acttccagca 120 atcgaacggc tggtaggccg cgaaaactgg catacctgga agtttgcgct gcggactttt 180 ctagaagttg aagagttgtg ggaagctgtg gaacctattc cggatgagca aggaaacata 240 ccagcagtca acgccgccaa ggacagaaaa gctcgaggaa aaatcattct tggcctggat 300 ccaacgctgt acgtccatgt tcaagacaca aaaacggcca gtgaagcttg gaagaagctg 360 gaaaaggtat tcgaagacag cggcctcacg cgaaaatggg gacttctgca caagctgatc 420 acgaccaacc tggacagctg cgggtcgatg gaggcgtatg tgacgaggat ggttggtaca 480 gcccaccaac tgaatggagt cggcctgaag atcgacgaag aatggctcgg aatgctgctt 540 ctcgctggat taccggaaac ctacaagccc atggtcatgg cattagagaa ttctggagct 600 gcgataactg gcgatgtagt gaaaacgaag ctattgcagg aggttcagca gtcatcgaca 660 accgtcgatc cagcgttcgt gtccaagggg ccaggaagaa gcaagcatag cggatctgaa 720 aatttctcga cgaagggtcc gaaatgccgt tgctgtggaa gattcgggca tatcgctcga 780 ttttgccctt cgaaggattc aacgaagaaa ggtgtcaaaa agggtgacgc tttttgcaca 840 attttgtcgg ttgctgcgat ggaagaagcc gcagactggt acttcgattc gggatcctgt 900 cgccatctca caaggaatga gtcgttgcta agcgacgtac acaaggccga tggacaaatt 960 ttcgctgcga acaagggtgt gatgaaggta gtcgcggaag gctccgtggc attgcatcca 1020 aggtgttcgg aatcatcgat cgacgtcaac ggagtgcaac tgataccgga actggccgcc 1080 aatttgctct cggtgagtaa aatcgtcgac agaggtcata cggttatttt caggcagaaa 1140 ggtcgcgagg taatcaaccc agagggaaaa cggatcgcaa ccggacatcg atcaaatgga 1200 ctttttaaac tggatcagca acctcagagt gcattagcgt gccagcaaat gaagtcgatt 1260 gattcgtggc acaagcgaac gggacatctg tcgatcaaag gtctgaaacg attgaaaaat 1320 ggattagcgt caggaatcaa ttttgtcgaa tcggaatctg gtgactgcaa gacttgcgct 1380 attggtaggc aagccaggtt gccgttcagt aaggaaggaa gaagggccaa tgaaattctc 1440 gagttggtcc actccgacat atgcggtccc atggaggagg tttctcttgg agggagccgc 1500 tattacatca ctttcatcga cgacaaaacc cgcaggatgt ttgtgtactt tttacgtacg 1560 aaatccgagg aagaagtagt tgaagcgttc aaggcgttct actccatggc tgaacgacag 1620 acaggtaaaa agttgaaagt tttgaggaca gacaacggaa aggagttcaa gaaccgaggc 1680 tttgagaatc tactacgaag tttgggaatt tgccatcaga caaccgtgga atatacaccc 1740 gagcaaaacg ggatggcaga acgtgcaaac agaacgattg tcgagaaagc ccgttgcatg 1800 ttgcatgaag caaacttacc gaagacgttt tgggcggaag ctactgccac cgccgtgtac 1860 atcatcaacc gttcaccgtc gaaaggaatt cgcacgactc ccgaagaagc ctggtcgggg 1920 cagaaaccag atctgtccaa cctgcggatt ttcggaacaa caacgatgtc gcacattcca 1980 aaacagaagc gacgaaagtg ggacccgaag gcggaagaat gcattttgat cggttttgat 2040 gaagaaagca aagcataccg gctctacaac aagaaaacga aaaaggtgtt caaatccagg 2100 gacgtgacgt tcatcaacga aggaagatca gcagtatgtc aagactcaca gaacaaggtt 2160 caagataatg aagccgaccg aggagcgaac cgacgacagc caacgatggt ccggctggat 2220 tttgaggatt gcgaaaccca agaagtgctg gaagtgaacc gtacgaatcc acccgaacaa 2280 gatgacgata gtagctccga agacaactac gatagtgctg atgactctgt gacgatcccc 2340 tctgcgctcc catcgcgaac agtttcaaat ccgcccgagt cacacgaggc gtcgaggcga 2400 agcggtaggg agcgcttcat ccctggcaag tataagcaat tccatattaa gtgcaaaggt 2460 ctacctgctt cgaagttttc agatggtgca tccggtacga gcgaagacat gaccggtgag 2520 gtcaacagcg atttggcaga caatcctgga tacgctcaac attctgcgtc cccatcgtac 2580 cccaatcaaa gaccagttga agatgttcaa caagggttga tgctcagcga cagggggcgc 2640 ttcccccaag gcaagtattg ccgttacaat tctgaagaca gttgtaccgc gtatgattct 2700 ccacagtgca ttgccgatga tcccgttact ccggaagaag cattgtcccg agacgacgcc 2760 gagctctgga agtccgcgat gcaggaggag tatgaggcac aaatgagaaa tggaacctgg 2820 actctgacgg acctcccggc aggaaggaag gccattcgca gtaaatgggt ttataaaacc 2880 aagctcgacg cagaaggaag accggctcgc cacaaggccc ggctggtaat caagggatac 2940 tcgcagcgaa agggggtaga ctatgaagaa acctactctc ctgtggtacg tcatagttcc 3000 ctgcgatatc tctttgcgat agctgcccgt ttgaatctcg atatcgatca aatggatgct 3060 gtcactgctt tcctccaggg agaactgtcc gaagaaatct atatggaaca accagcctac 3120 ttccaagaca cccagaacag gaccaaggtt tgccggctca ataaggctct ctatggatta 3180 aaacaatcga gccgcgtgtg gaaccataag ttggactccg cattgaaaaa gtttggactg 3240 atttcaaccg actacgaccc ctgcgtgtat tacaagatcg tcggcaacaa agttttgttc 3300 gtggcaatct acgttgacga tgtactcatc ttcacgaact gtcgtcaatg gcgtaagaag 3360 ctgaaggatg atctggccaa ggaattcctt atgaaagaca tcggtccggc gaaacacgta 3420 ctgggaatgc gtatcacgag aagccaagga aagatttcta tcgatcaaga agcgtacgtt 3480 gagagcatac tcgatcggtt cggaatgtcg aaaagcaatc cggtggctac tccactgaac 3540 ccgaacgaca aactcacgaa ggaaatgcaa ccaatcaaag atgacgaggc cgagcgaatg 3600 aagcgagttc cgtacaagga ggctgtagga tgtcttatgt acctagcgca gtgtacacgt 3660 cccgacatct gccatgccgt caatgtcctg tgtcgattca atgagaatcc tggcgaaaaa 3720 cactggaatg ccgtcaagca tcttctgagg tatttgcgag gtacttcgaa attccgtttg 3780 acctacaaga aagaagcgga tccgacgatc accgggtact ccgatgctga ttgggctacg 3840 agcagtgagg atcggaagtc cattactgga ttcgttttca tcgcgcaagg tggggcaatt 3900 tcatggtgct gcaaacgtca gcagacagtg gcactttcga catgtgaagc cgagtatatg 3960 gcactctcgg cagccgtcca agaagcgctt tggtggaagc gcctgagagc aacgttcgac 4020 atcgatgaag ccgtgaagat caattgcgat aaccaaagca ctattgcagt tgctaaaaac 4080 ggtgggtact atccaaggac caagcatatt gacatccgtc attgttttat tcgggatgct 4140 gtgcagcgtg gtgatgtaga catcgtttat atcagtaccg ataaacaact ggcggactgc 4200 ttgacgaagt cgcttccgaa accgaagatc gagcttcagc gagcggctat gggaatccag 4260 agtcagtagg ttgaggagga gtat 4284 // ID Mariner-1_DPu repbase; DNA; INV; 4858 BP. XX AC . XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE Mariner DNA transposon from Daphnia: consensus. XX KW Mariner/Tc1; DNA transposon; Transposable Element; nonautonomous; KW Mariner-1_DPu. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-4858 RA Jurka J.; RT "DNA transposons from Daphnia."; RL Direct Submission to RU (24-MAY-2010). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR [1] (Consensus) XX CC >97% identity to consensus. TA TSD. XX FH Key Location/Qualifiers FT CDS join(859..1605,1554..2228,2243..3046) FT /product="Mariner-1_DPu_1p" FT /translation="MVRNYHRKTEKVPREILLSALRQLKIHHDLKWGQSSY FT RRVADSHGISKTTLFNHFQKVKHLASIPDNYFSNQAHHRQILKYEEEEELV FT GYLKLAMESNHSLTPTEVRKLVYSYVLGNGIICPAMWHEYHTAGEDWFTSF FT MKRNQCLSLRKAEATSQARAAGFNYPVVHCYQEKLGGVFARHQYPPDRIIN FT IDESSNQSVMAPETVVAIRGTKQVNIVICFIFESIAAIYFFNLLRFVKPPV FT QSEELMFLFVKVRQTTGAERGTNVSMICFGNAAGGFIPPAYVFPLKNVNRK FT SMNNSVPGSIAFANGTGWFDGNIMVDVIEHVQKYVQSSKENPLLIIWDNFS FT AHLDYLVVKKANDYGMELLTLPPHTSHELQPLDVSVFVAMKKFIKAAHMEW FT YRNNPGKRITIHEIAGLSKDPFYKAMTPANLISGFKRAGIYPFTLLQPDDP FT RFSSSLVTDLPGTKQSSYSCFSYYSCYLFCFFTFPLLATSQQQVAEALSDH FT EIHPPQQFIEIEGSNNIQTVEMEMDELDGMDISQEEENVPTRDCSVLCNTV FT ASTEIDNRQQMANSARETVQGASSQSTSGNCETPLRTNVTVNRVRLAELMP FT IPKVFQNGPRQTNRTRTGRSRLLTDPEEMAQLETDYLKKKKKEMEKENSKK FT IRAVKSRLSNKENITENGKVVATKRNDNAVHAKGPLSVQNRQPRNAVHEKE FT PLSVQNRQPRNAVQEKEPLSVQNRQPRNARKPAFLTQNYFFD" XX SQ Sequence 4858 BP; 1573 A; 902 C; 915 G; 1460 T; 8 other; ggggagaccg gggctatttg ggacattttt tttttacgct gccatttttt ttaaagcaaa 60 attattgaaa aataaaaata tgggagtgat tccctgtcct ttctttacat tttgtaattt 120 tttcaatttt ttcggtcgaa gtatgtaann gttgttaaat aaaatgcgaa gaggaaaaaa 180 tgacgaaaaa aaaaaatagg gttaaacgga acacagccag gggctacatg gaacagcatt 240 tttctctgca atggctgaat tttatttatt aaaagacgca ctggtgaaag gtaaattcct 300 tcatctttca gaagtgctgt ctttttatta gttatattaa gtataagtgg agatagcatc 360 ataaaagtga agatgacttt tttcccatct gtttttcctt gtttttgtcg gtgggaggtc 420 gtttatctgg ctgtagctcc acccttgtga tcacgtgact ttttttactt ctgtttagtc 480 aaaccctggc cccagtaaaa aggaaaagaa aatcgtagac tgccattatt gacataagaa 540 ttgaagctga aatcttgatg aataattcca ttcatagaca gtttagtgtc tttattgtga 600 gtcgtaattt ctccaactgg taattattat ttgccctaca taatttggtt tataacgttt 660 taagacgagt aaacaaattc taaaacatag agttgcacac ctgggattgc caaacanacc 720 aagacantct tcatttgtaa aagtcacgaa ttatataggc ctgctggtga gtggtgatct 780 aactatattg ttaaatcaac tatactgaaa tactgaaata ctgaaattaa cccatgtttg 840 tacattacag gtttaaatat ggttcgcaac taccacagga agacggaaaa ggtgccaaga 900 gaaatattgc tcagtgcact gagacagttg aaaatacatc acgatcttaa atgggggcag 960 agttcttata gaagagtagc agatagtcat ggcatctcta agactacgct cttcaatcac 1020 ttccaaaaag tgaagcatct tgcctctatc cctgacaact atttctcaaa tcaagcgcat 1080 cacaggcaaa tccttaagta tgaagaagag gaagagctgg ttggttatct caagcttgca 1140 atggagagca atcattcgtt gactcctact gaagtaagaa agctcgttta cagctatgtg 1200 ctaggtaatg gaattatatg tcctgcaatg tggcacgaat atcatacagc tggagaggat 1260 tggttcactt cgtttatgaa gaggaatcag tgcctatctc tgagaaaagc tgaagctaca 1320 agccaagcta gagctgccgg attcaattac ccagttgttc actgttatca agagaaattg 1380 ggtggtgttt ttgcccgtca tcagtatcca ccagaccgaa tcattaatat tgatgaaagt 1440 agcaatcaat cggtgatggc gcctgaaaca gttgttgcta ttcgtggtac taagcaggta 1500 aatatcgtaa tatgttttat ttttgaatca attgcggcta tttatttttt taatttgtta 1560 aggttcgtca aaccaccggt gcagagcgag gaactaatgt ttctatgata tgctttggaa 1620 atgctgcggg gggatttatt cccccagctt atgtctttcc gctgaagaac gtcaaccgaa 1680 aaagtatgaa taactcagtt cctggttcta ttgcttttgc caacggaact ggttggtttg 1740 atggaaacat catggtggat gtcattgagc atgtacagaa gtatgtgcaa agcagcaaag 1800 aaaatccact cctaatcatt tgggataatt tctccgctca cttggattac ctagtagtaa 1860 agaaagcaaa cgattacggt atggagcttt taactttgcc tccccacacc tcccatgagc 1920 tacaaccgtt ggatgtgtcg gtgtttgtag cgatgaaaaa gtttataaag gcagctcaca 1980 tggaatggta ccgtaataat cctggaaaga gaatcaccat ccatgaaata gcaggattat 2040 caaaggatcc tttttataag gcaatgactc ctgctaacct tatttctggt ttcaaaagag 2100 ctgggatcta ccctttcacg ttgttacaac cagacgaccc ccgtttctca tcatctttgg 2160 ttactgatct accaggtacg aaacaaagtt cttattcttg tttttcttat tattcttgct 2220 atttatttta ataatccaat aatgtttttt cacctttcct ttattagcaa catcgcaaca 2280 acaggttgca gaagctctaa gtgatcatga aatacatcct ccgcagcagt tcatagagat 2340 agaaggttcc aacaacattc agacggtgga gatggagatg gatgaattgg atggaatgga 2400 tatttcacag gaagaagaaa acgttccgac tcgtgactgt agtgttttat gcaacacagt 2460 agcttcaaca gaaattgaca atcgtcagca aatggcaaac agtgctagag aaacagttca 2520 aggagcttca agtcaatcaa catctggcaa ctgtgaaact cccttaagaa cgaatgtgac 2580 ggtaaatcgt gtacgtttag cagagcttat gcccattcca aaagtctttc aaaacggacc 2640 aaggcaaacg aacagaacac gtactggaag aagccgcttg ttaaccgatc ccgaggaaat 2700 ggctcaacta gaaacggact atttaaaaaa aaagaaaaag gagatggaaa aggagaactc 2760 taagaaaata cgagcggtta agtctcgctt gtcgaacaaa gaaaatatca cagaaaatgg 2820 caaagtcgtc gctactaaac gcaatgataa tgcagttcac gcaaagggac cattatccgt 2880 ccaaaaccgt cagcctcgaa atgcagttca cgaaaaggaa cctttatccg tccaaaaccg 2940 tcagcctcga aatgcagttc aggaaaagga acctttatcc gtccaaaacc gtcagcctcg 3000 aaatgcaaga aaaccagctt ttttaacaca aaattatttc tttgattgag tcagtttcac 3060 tagaaattaa tcattcttct gtgtacgatt tacaatgtgt ctaatttatt aatcaaggat 3120 tctctatttg tgcgatagca aagcttaccc aacctaattt gctacattga cggttaaaat 3180 cgagtctcga gtaaagtgat aacatttgtc atctcactgt ttcttaatga agaactggat 3240 attaatttaa acactgaaat aaacaaacaa aggtttctcc tatctattaa aaatcttttg 3300 tgaattgaga aatgcaaaac actcctatgt aagataatgg gtttttgtgt aaaagtgtca 3360 ataattaagc aatgaatttg agggattgtt tgtattaact gttaagaaag ttccaacaaa 3420 cagaatacta gtcaacataa gcaccttggt tgttgaccac ttgctcaaga ctaactgaaa 3480 tttggtttag acgacgtagc acaaagactg cactgcagac tttgtgccaa actttgtaaa 3540 tttcttccca tagttgttcc tcactaaaaa ctttaatttt gttttcagtt agctcgtcga 3600 gcatctgctt ccaaagcgat tccacaggca tcacgtctcc aaaacatctc ggccagtcaa 3660 gaagaatcaa cgcatttttg tgttcactat aatacttttt cattgcagta gaataatgga 3720 ccggaaacct gaataaagaa cgatgttatc atgaaacgtc acaacaactt tgtaatttac 3780 aaatttgtga gttaccagtc atgcacgtat ttaatcatgt gcgatgtagt ctgtggtgcc 3840 aatggaagca catgctcttc tagaagtttt ctgtatttag ctgagtccag tctaccatca 3900 actcgtgaaa ttgttgagtt gattgtagac atacagctcc aaacaagaac agtgctggga 3960 atcttctcct tttctgattt gtgatacaaa gatattgttc cgttgtcttc aaaactaaaa 4020 atagaagcaa gaggtgtaag taagtaacat gaataatgag aacataatta aatagctgag 4080 gaaacttttt actgtacctg aaccgttgac tgtctccgta actgacaccc gtggtgtaat 4140 tttctacatt cttcatgatt gtttttgcng ctgaaactct agaaattttt tgatcgtagg 4200 tcaattcagg acgaaacata tttgttttac atgagacgaa acctaggaaa taggaaagtg 4260 ttgtttcaat ggacaaaatg cgattaaatg gattccgatc tgtattatta ccagaaatca 4320 aattaatgga aggaacgtag attgggccta taatgtcaag tcgacaattc gtttgtgttg 4380 tcagttcgga aataggacta aacagcaatg tcttggcccc tcctacagct ccacccacag 4440 gaggtgccaa gatgctctgt tttaccccgt tttccatgtc ccagttaacc cgaacccaga 4500 caacattaaa taaactaaat tattcaatac ttaggaaccc attacaaaac aaaaaatata 4560 ccaccttaaa ctagaatgaa aatgctgtca aatagctatc tgggttttac aaaannttga 4620 attttagtat caataaaata aaaaactaaa cgtctgaaat attttttcag atccccgttt 4680 ttttattttc ccatagaaca agaaaaataa atcgaaattt caagaaataa agtttgcatg 4740 ataaataaaa cgtaatttan catactcttt atttatttca acatttttaa acgaatttta 4800 catattttaa ttgccttgta cattattgcc ccgtgtccca aatagccccg gtctcccc 4858 // ID Mariner-18_SM repbase; DNA; INV; 1189 BP. XX AC . XX DT 11-AUG-2009 (Rel. 14.08, Created) DT 11-AUG-2009 (Rel. 14.08, Last updated, Version 2) XX DE Mariner DNA transposon from Schmidtea mediterranea: consensus. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-18_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-1189 RA Jurka J.; RT "Mariner DNA transposons from Schmidtea mediterranea."; RL Repbase Reports 9(8), 1867-1867 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 398..916 FT /product="Mariner-18_SM_1p" FT /translation="MEQIGKEEIRNMMIKNVLPAIREKFPKSEKSRTVYIQ FT QDNAKPHLTNNDSALQEELVKEGWSLQLKSQPPNSPDLNVLDLGFFNSIQA FT LQHQMVTKNIDDLISAVETAFQQLDRDTLDSVFLTLQTCMENIMISNGNNF FT YKITHINKEKLRRESRLPINLKCNAEAVENTKNF" XX SQ Sequence 1189 BP; 458 A; 182 C; 191 G; 358 T; 0 other; ctacctccgt tccacaatag ttgtcacatt tggatatttt tggtccgatt attgaaaatg 60 tatgctattt gactaatttt gttaaagatt tgctaaattt gtactgtttt aattattttc 120 taaaagtgca aatttaagaa aaaaaaatta gtagtattta gttaaaaaat taaagaaata 180 aagtactagt agtattactc acaacacaat actagtagta ttacagtgtg aatgtgcctt 240 gtttaaaatt tgtacgcgcg cacacgcaag accaagatgg ggttcatcta aaaacgaata 300 tttcgatgga aaaattggaa tctggccatt cactttcaaa gaaaatgcca aaaggaactc 360 taaaaatcgt ctaaagggaa cttcggtaac gaaagttatg gaacagatcg gcaaagaaga 420 aattcgaaat atgatgatta aaaatgtttt gccagctata cgcgaaaaat ttccaaaaag 480 tgaaaaatca cgtactgttt acatacagca agacaatgcc aagcctcatt taacaaataa 540 tgattctgct ttacaagaag aacttgtaaa agaaggttgg agtctgcaat tgaaatctca 600 acctccgaat agtccagacc taaatgtttt ggatcttggg ttttttaatt caatacaagc 660 attgcaacat cagatggtca ctaaaaatat tgatgattta attagtgcag tagaaacagc 720 tttccaacaa ttggatcgag acacacttga tagtgtattc ttaacactac aaacatgtat 780 ggaaaatatc atgatatcaa acggaaacaa cttttataag attactcata taaataaaga 840 aaaacttcga agagaaagtc gactccccat aaatttaaaa tgtaatgccg aagccgtgga 900 gaatacaaaa aatttttaaa tattgaataa ctgaaataaa tgttactact ataaaattaa 960 taaagaactt atttctgact cgcaattttt tattcaccta ttgatcacct atttttaagt 1020 gcccaagctt gttagaaagc cttgagtaac ccttttcaac aagctattac ttttgaaaat 1080 cggtctaaca caagcgaagc tatagctata aaagttttat caaatgggac aagtattatg 1140 gaacaaaatt tatgcatcaa atgtgacaac tattgtggaa cagaggtag 1189 // ID I-50_AAe repbase; DNA; INV; 5806 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE An I non-LTR retrotransposon from Aedes aegypti. XX KW I; Non-LTR Retrotransposon; Transposable Element; I-50_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5806 RA Kojima K.K. and Jurka J.; RT "I clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1321-1321 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 19 sequences with >96% CC identity. XX FH Key Location/Qualifiers FT CDS 574..1950 FT /product="I-50_AAe_1p" FT /translation="MAASSGWDPGLPPTAFNGGPTVPEWMGGVGIGQLQIL FT SLXATDGETKLCPFIVGKSMQDLVGEIENTTTEANGMKYVLRVRDAGQVRK FT LLSMKKLFDGTAVTVELHPVFNKRRCVISCREIQNKTEQELMEWLAKDGVV FT GVKRITRMQDGKPVNTPTVILTLNGTAVPDHIKVGPLRIKTRMYIPDPMIC FT YKCFNYGHSKLRCKGAAKCRNCSKTHDLEGECNAAPFCQHCQGAHGPANRS FT CPVYAMEKEIVRLRFTKGISQEEAKKQIQSGGGSYAAVSSXVQQRLVNART FT STGQSDQLKAKDDLIKQLTETITKLTSRIEELEKKCTSKKEKKRSRKIQIM FT KDEGSGSEMETDSSAKQSAPGQPSVGVSNQVSAVLPKPSVQKHKRHPTTEI FT HAPIVKKASADQSQQSSDLAYPPLNNKSPPNHPGILDQLQTLMDSTRLNSP FT HSQSHNGQHTKPHK" FT CDS 1925..5605 FT /product="I-50_AAe_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="MANTPNPINSQSSSNQPTQSCLALQWNLFGLKGKLSE FT LQLLISQFNPIALALQETMIPNSTSINFIKGYDLYICENPDNPYKTGTSLA FT IKSDIPHRRISLPSYDSLFAVAIEIDFPLKITLVSIYISPSHNLPIKASLA FT KILDQISSPVLLMGDFNGHSTLWGCHRNDARGSILCQLFDEYGMTLLNDGS FT HTRISLSTGQSSALDLSVSSHSLNSQLSWMVHNDCCGSDHLPIVIQLNRAP FT PVSSCRPRWKYELADWPSYQNEVFSVFSNRAPSSAEEFVQQLFAIATHHIP FT RTNGSPGKKSVPWWNPEVRSAVKLRRKKLRALQKIAKEDRKDSTTFSEFKA FT ARNAARSVIKQAKQDSWDEFISSINPDSTSKELWDKVHRLNGSKSRQPIKL FT KISNQITDNPSIITEHLADHFSQSSSSSNYSDNFISHKNTIEASPPNFNTD FT ENLEYNCDFSYEELDWALHRVHGSSAGPDDVGYPLLKNLPLIGKSILLTIF FT NDIWNRGEIPNSWKEGLVVPIPKPDKDRNNADSFRPITLLNCIGKILEKMV FT NRRLITLLESRGLLDNRQFAFRPDKSTDDYLTELEDIINSHLERGMHGDVV FT SLDLSKAYDRAWRFPILKSFEEWGIKGRMGHYVQSFLQDRRFRVILGNNRS FT ELRCQENGIPQGSVIAPTLFLICIQSLFKDIPPNIFILVYADDITILTFHS FT IKSLSRKRIQSAVLSVSKWADEHGFTISPEKSQLLHISRNRKKMSKLPDIT FT LNNTIIKSQISLKILGIHLDRTLSFRQHLNSVRKSINQRLNLIKVIGSRIP FT CAHRTSILQIVNSWILPKMFYGVGLFSRGGDLVSNKLSPLYNKAVRLASGA FT FATSPILSIMAECGQLPFDYSLTTNVVSKAIRWLSLGGRNDVPLVTRASDL FT FQSLTNQSLPEIAVLPRSKTKSWNQPTPKIDLSLLDQVKAGDPPSIIQSHF FT NKLRSEKYSNHLTIYTDGSVCNGYVGYGIASFSTLPNISSALPPSCSIFSA FT EAYALLKATHLCSTLSDRPIVIFSDSASCLREMDSYSLKHPWLQEAERLAL FT VNRATFCWVPGHSGIHGNEEADRLAGVGRSMNPEDVSIPSIDAQRWTKEKI FT HDSWSTKWYNSRDIALRRLKSSTFPWPDNINPKHRRILTRLRIGHTRLTHS FT YRIDKIDPPTCTSCGTPLTVXHILIDCQCYNTERLACNLDGTLEETLSPPN FT EEALIAFLTTTGLLDHL" XX SQ Sequence 5806 BP; 1658 A; 1516 C; 1158 G; 1470 T; 4 other; tttttttttg acgtaggact acgtctttgt tttctatatt ggggtgcact ttaaaagtac 60 ggaaaatgag gcatgtaacg aaattggggg agattttgaa cgctaataac tcagttgttt 120 atgaaccaat tattacgatt ttagtatcat tcgatcagaa atgtatctac gcatcgttag 180 taatatatcc gataagtgaa ttttgttgtt aaactattga aaatttgata aacgttgacc 240 cctttcttat ccaaccaatc acgttcgagc acttttcgac ggtgtcgctt cccactataa 300 aagcaaacgg gtccctgctt cttccctcag tcattcagtt ttcaaccacc agcgcgcgcg 360 gacgtgtatt cgagagcgag tattttccga gcgattcaac agtgagcagt gcagtgacat 420 tccaaccgat cgcggttgaa gaaacagtta accacttttc ggtggtacag tgacagtgac 480 agtaacagtg atacggtaaa gacttgatac cgttagtgtg aaaaaagtga tacagtgcaa 540 agagagcctt tgcgcgaaga agtgcgagcg agcatggcgg cgagttcggg gtgggaccct 600 gggcttcccc ccacggcgtt caatggcggt ccaacggttc cggaatggat gggcggcgta 660 ggcatcggcc agctgcaaat cctttccctg twtgcaacgg acggtgaaac gaagctgtgc 720 ccatttatcg tgggcaaatc aatgcaggac ctcgtcggcg aaatcgaaaa caccacaacg 780 gaagcgaacg ggatgaagta tgtgctgcga gtacgtgacg ctgggcaggt gaggaaactg 840 ctcagcatga agaagctttt cgacggaacg gcggtgacgg tggaactcca ccccgtcttc 900 aacaagcggc gctgcgtgat ttcatgtcgc gaaatccaaa acaagacgga acaagagctg 960 atggaatggc tggccaagga cggagtggtc ggtgtaaagc ggatcacccg aatgcaggat 1020 ggcaagccgg ttaacacccc aactgtcatt ctgacgctga acggaacagc agtgcccgac 1080 cacatcaagg tcggaccact gcgtatcaag acacgcatgt acatcccgga cccgatgata 1140 tgctacaagt gcttcaacta tggccactcc aaactccggt gcaaaggcgc tgcaaagtgc 1200 cgaaattgct ccaagaccca cgacctcgaa ggggagtgca acgctgcccc tttctgccag 1260 cactgccaag gtgcccacgg acctgcgaac cgatcctgcc cggtgtacgc gatggagaag 1320 gaaatcgtta ggctgcgatt caccaaaggc atttcccagg aagaagcgaa gaagcagatc 1380 caatccggcg gcggttccta tgctgccgtc agctcakcag ttcagcagag gttggtcaac 1440 gcccgtacat ctacgggaca atcggatcaa ctgaaggcca aagacgacct gatcaagcaa 1500 ctgaccgaga ccatcacgaa gctaaccagc aggatcgagg aacttgaaaa gaagtgcacg 1560 agtaagaagg agaagaagcg atcccgcaag atccagatca tgaaggatga aggtagcggc 1620 tccgaaatgg aaacggattc cagtgccaag cagagtgccc ccggacaacc cagcgtaggc 1680 gtctcgaacc aggtaagtgc tgtccttcca aagccaagcg ttcagaaaca caaacgtcac 1740 ccaaccaccg aaattcatgc ccctattgtt aaaaaagcaa gtgctgacca atcgcaacaa 1800 tcttctgatc ttgcttatcc cccattgaac aacaaatctc ccccaaacca ccctggcatt 1860 ctcgaccaac tgcaaactct aatggattcc actcgtctca actcccctca ctcacaatct 1920 cacaatggcc aacacaccaa accccataaa tagtcaatcg tcctctaacc aaccaactca 1980 atcctgccta gcattacaat ggaatctctt cggacttaaa ggtaaactct cagaattaca 2040 attattaatt tctcaattca accctattgc tttagcgtta caggaaacaa tgattcccaa 2100 ttcaacttca ataaacttca tcaaaggtta tgatctgtac atttgcgaaa accctgataa 2160 tccctacaaa accggtacaa gtttagccat aaaatctgat attcctcacc gtagaatttc 2220 tctcccttca tatgattctt tattcgcagt agccatagaa attgatttcc ccttgaaaat 2280 cacactggtt tccatttata tctctccctc acataacctc cccattaaag ctagtcttgc 2340 caaaatactt gaccaaatat catcccctgt tcttctgatg ggtgatttta acggtcattc 2400 aactctttgg ggctgccatc gtaatgatgc tagaggatcg attttatgcc aactctttga 2460 cgaatatggc atgactctgc tgaacgatgg tagtcacact agaatcagtc tctcaactgg 2520 tcaatcatca gctttagacc tttcagtatc ctcccatagc ctcaattccc agctgtcttg 2580 gatggtccac aatgactgtt gtggtagtga ccatcttcct attgttatcc aactgaatag 2640 agctcctcct gtgtcttcct gccgtcctcg ttggaaatat gagctggctg actggcctag 2700 ctatcaaaac gaggtcttct cggtattttc caacagagcc ccttcctcag ctgaggaatt 2760 cgtccaacaa ttgtttgcca tagctacgca ccacattccc agaaccaatg gttctcccgg 2820 taaaaagtcg gttccatggt ggaaccccga ggttcgttct gcagtcaaac tccgtcgcaa 2880 aaaactacgt gctcttcaga agattgcaaa ggaagatcgt aaagactcta ccactttttc 2940 tgaattcaaa gctgctagga atgcagccag atcggtcatt aaacaggcca aacaggacag 3000 ctgggacgaa ttcatctcct caattaaccc tgacagtact tcaaaagaac tgtgggacaa 3060 agtccatcga ttaaacggta gtaaaagtag acagcctatt aaactaaaaa tcagtaatca 3120 aatcactgac aatccttcaa ttataaccga acatcttgct gatcactttt cccaatcatc 3180 ttcctcctcc aattactctg ataacttcat ttcccacaaa aacaccatag aagcatctcc 3240 tcccaatttc aatactgatg aaaatctaga gtataactgt gatttttcat atgaagagtt 3300 agattgggct ttacacagag tccacgggtc atcagctggc ccggatgatg taggttatcc 3360 acttctcaag aaccttcccc taataggtaa atccatcctc cttacaatat tcaacgacat 3420 ttggaaccgc ggtgaaatcc ctaactcttg gaaggaaggt ttagttgttc ccattcccaa 3480 acccgataaa gatagaaata atgccgacag tttccgcccc attactttat taaactgtat 3540 tgggaagatt cttgagaaaa tggttaacag acgactcatc actcttctgg aatctcgtgg 3600 tttactagat aatcgacagt ttgctttccg cccagataaa agcactgatg actatcttac 3660 cgaacttgag gatatcatta actctcacct ggaaagggga atgcatggtg acgttgtatc 3720 ccttgacctg tcgaaagcat acgaccgtgc gtggcgtttt cccattctca aatcttttga 3780 agaatgggga atcaaaggtc gcatgggtca ttacgtccaa agcttccttc aggataggag 3840 atttagggtc atcttaggca acaaccgatc tgagttgaga tgtcaagaaa atggtattcc 3900 tcaaggttcg gtaattgccc caacactctt cctcatctgc atccaatcgc tattcaagga 3960 cattccgccc aatatcttca tactagtcta tgctgacgac atcaccatac tcaccttcca 4020 cagcataaaa tctctgtcca gaaaacgaat tcaatcagca gttctgagtg tttcaaaatg 4080 ggctgacgaa cacggtttca ctatatcccc tgaaaaatcc caacttctac acatcagtcg 4140 caatagaaaa aagatgtcaa aactccccga catcacttta aataatacaa ttataaaatc 4200 ccaaatatcc ctcaaaatcc ttggtattca tctcgaccga accctcagct tcaggcagca 4260 ccttaatagc gtccgcaaat ccatcaacca aagacttaat ctcataaaag taatcggctc 4320 ccgtatacct tgcgcccacc gaacatctat cctacaaatc gtcaacagtt ggattctacc 4380 taagatgttc tatggtgttg gcctcttcag cagaggtggt gatttggtaa gcaacaaatt 4440 aagcccactg tataataaag ctgtccgtct agcttcagga gcttttgcta ccagtcccat 4500 cttatccatt atggctgaat gcggacaact cccttttgac tactccttaa caactaacgt 4560 tgtttccaaa gctatcagat ggctgagtct ggggggtcga aatgacgtcc ccctggtcac 4620 acgagcctct gacttattcc aatccctcac aaaccaatcg cttccagaaa tagccgtatt 4680 acctagatcc aaaaccaagt catggaatca acccaccccc aaaatagatc tctctcttct 4740 agatcaggtc aaagctggtg atccaccttc tataattcaa tcacacttca acaaactccg 4800 ctctgaaaaa tactccaacc acctaacaat atatacagat ggttctgtat gtaatggtta 4860 tgttggttat ggtattgcta gcttctctac actcccaaac atcagttctg ccctcccccc 4920 atcttgttcc atatttagtg ccgaagcata tgcccttcta aaagccactc acctctgttc 4980 caccctatcg gatcgtccca tagtaatctt cagcgattct gctagttgtc ttcgagaaat 5040 ggattcttac tccttgaaac acccctggct tcaggaggcc gaaagacttg ctttggtcaa 5100 cagagccacc ttctgttggg tgccaggtca ctctggcatc cacggtaatg aggaggctga 5160 ccggttagca ggtgtaggtc gcagcatgaa cccagaggat gtttctattc catccatcga 5220 cgctcagaga tggacaaagg agaagatcca tgactcctgg agcactaaat ggtacaacag 5280 cagggatatc gcwctcagaa gattaaaatc atccactttc ccctggcccg ataacatcaa 5340 ccccaaacat cgtcgcattt taacccgttt acgaattggt cacacccgtt taacccatag 5400 ttaccgtatt gacaaaatag atccccctac ttgcacatct tgcggtaccc ccttaactgt 5460 ccmccatatt ttgattgact gtcaatgtta caacactgaa cgcttggctt gtaatttgga 5520 cggcaccctg gaagagacgt tatcgccacc aaacgaggaa gctttaatcg catttttaac 5580 caccaccgga ttactggacc acctttaaaa acctgcgata agggacgaat gaccttcggg 5640 ttaaagtccc tctcaaacaa caacaacaac aacttcttcc ctcagtcttc tttttgcggt 5700 cggactgtga acacggtcgg gcgagtcatt ctcgctttgc ttttgcaccc gcgtcacagt 5760 ccaatttgtg tcgtcggaag aggaagacgc aagattgatt tgcggc 5806 // ID Sola1-1_HM repbase; DNA; INV; 3181 BP. XX AC . XX DT 10-SEP-2009 (Rel. 14.09, Created) DT 11-FEB-2011 (Rel. 16.02, Last updated, Version 3) XX DE Sola-type family: consensus sequence. XX KW Sola; DNA transposon; Transposable Element; Sola1-1_HM. XX NM Sola1-1_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3181 RA Jurka J.; RT "Families of Mariner elements from Hydra magnipapillata."; RL Repbase Reports 9(9), 1928-1928 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 659..2713 FT /product="Sola1-1_HM_1p" FT /translation="MHSRAAHMVSLALIKCKKVQRSLATDTTIADPNLAAD FT IQTERSTLAAESPIVADNSPVVAVESPIVTTESPIVTTESPIVTTESPIVT FT DQRPRKRKRCVTNWKKNVIKRLRNTGSEYVKHGGTLQKAKKLNEHYLINHS FT CKNKCAENLTLEQTKTLFENFWKLGDFDAQNLFLAGCVKQSTVERRRPRVR FT KNKITKNEKKHAKDFARVYSVFADNKPVTVCKPYFLAIFDISSGRLNRALV FT NQRENGGVSGKDGRGKYDHKTQRIHEEHITRIKDHIRMFPTYESHYTRSHS FT ASRRFLPAHLTVSTMYDMYREKCIQDSVEPQKEWKFRDIFNHEFNLTFHQP FT LKDTCKKCDIFKTDIEIEKTEHKKSQLKAAHEIHLRKAEKARQSLKLEKEN FT EDNKHASFCFDLQKVMSLPCLTTNEVYYCRQLSVYNLGIHDLKNDNAVMNV FT WHEGTASRGAGEIGSCLLAYCQNLANRDITSITAFSDSCGGQNRNIKIALL FT WMHLTQTTSIELVNHKFMVSGHSYLPNDADFGIIERAKPKSTEIFVPEQWY FT KLIENCSMKKRFSVARLSTDNFKSVEPLLKLITNRKVDESDQKVNWLDMQW FT LQLRKNEPYKLFFKYSLDDDASFSFISLAKRGRQTNLSEVSLAPLYCGATK FT LSKEKYSDLMKLLKFLPSIMTSIKSWNTAVTTRMY*" XX SQ Sequence 3181 BP; 1126 A; 529 C; 601 G; 925 T; 0 other; gagcctatga gagtaaaagt ggaccaagtc caattttttt caaatgaccc ttatgtatgt 60 aaaactgcta aaaatttgct atatttttag agcaaaaatg gaccaagtcc ataacacatt 120 tatttcttat tcaacagagg gcagcacatt tttactctaa caataaatca ccaggtatag 180 agagtaatag tggaccaagt ccattgatag tgcattaatg taagaaatca agattcaaga 240 ccacataaac tacaaatttt cttagaatta ttattattat attagaacaa gaataacatt 300 tgtacaggta agatgcttta aatgaatatc tgattatggt tttatattaa tatagtgaag 360 tatacagcac tttcacactt tataaagtga caaaactgaa atagtttggt gagcgataat 420 ataagattta ttcaattgct ctcagcagga atataaatat tgatttgtta gacagtagat 480 ataatggtct gtcactacca aagttacagc ctctttagaa aggtcaattt tgtggcttta 540 ataacatcta gctaggggag cacttgcatg catgttgtgt acatgtacat tattctttct 600 aataatattc tgaatgtatg taatatttaa aaaatattta ttgtagaata gtgtaaacat 660 gcattcacga gccgcacaca tggtgtcact ggcattgata aagtgcaaga aggttcagag 720 atctctagct acagatacaa caattgcaga tcctaactta gcagcagata tacagacaga 780 acgttctaca ttagcagctg aatctccgat tgtagccgat aattcaccgg ttgtagcagt 840 tgaatctccg attgtaacaa ctgaatctcc gattgtaaca actgaatctc cgattgtaac 900 aactgaatct ccgattgtaa ccgatcaacg ccctcgaaaa cgaaaacgtt gtgtaacgaa 960 ctggaagaaa aatgtgatta agcgcttacg aaatacaggt agcgaatatg taaagcatgg 1020 cggaacgtta caaaaagcta agaaattaaa cgaacattat ttaataaatc acagttgcaa 1080 aaacaaatgt gctgaaaatt tgactttaga gcagactaaa actctgtttg aaaatttctg 1140 gaaattggga gattttgatg cacaaaatct gtttcttgct ggctgcgtga aacaatcaac 1200 agtagaacgt agacgaccaa gagtaagaaa aaacaaaatt acgaaaaatg aaaaaaaaca 1260 tgcaaaggat tttgcacgag tatactctgt gtttgcagat aataagcccg ttactgtttg 1320 caaaccatac ttcttggcga tatttgacat tagcagcggt cgactaaatc gtgcgttagt 1380 caaccaacgg gagaatggtg gtgtgtcagg caaagatgga cgaggaaaat atgatcacaa 1440 gacacaacgc attcatgagg aacacatcac tcgcattaaa gatcatattc gcatgtttcc 1500 cacatacgaa agtcactata ctagaagtca cagtgcatct cgccgctttc tacctgctca 1560 tttgacggta tccactatgt acgacatgta tagagaaaaa tgcatacagg attcagtcga 1620 accccaaaaa gagtggaaat ttcgggacat ttttaatcat gaatttaact tgacgttcca 1680 tcagccacta aaagatacgt gcaagaaatg tgacattttt aaaacagaca ttgaaataga 1740 aaaaactgaa cacaaaaaat ctcaactaaa agcagcgcat gagattcatt tgcgcaaagc 1800 agagaaggcg cgtcaatcct taaaactaga aaaggaaaat gaagacaata aacatgcatc 1860 tttttgtttt gacctgcaaa aggtcatgtc attgccttgc cttacaacca atgaagttta 1920 ttattgccgg caattatcag tgtataatct aggcattcat gaccttaaaa atgacaatgc 1980 agtgatgaat gtatggcatg agggaacagc ctcacgaggt gcaggtgaga tcggttcatg 2040 cttattggca tattgtcaaa atctggcaaa tagggatatt acgtctatta cagcattcag 2100 cgactcgtgc ggcggccaaa atcgaaatat aaaaattgca ctgttatgga tgcatttgac 2160 gcagacgacg agtattgaac tggtgaatca taagtttatg gtctctgggc attcttattt 2220 gcctaacgat gcagattttg gaataattga gagagcaaaa ccaaaatcca cagagatctt 2280 cgtgccagaa cagtggtaca aattgatcga gaactgctct atgaagaagc gcttcagtgt 2340 tgcaagatta agcactgata atttcaagag tgtggagccc ttattgaaat taattacaaa 2400 tcggaaagtg gatgagagtg atcagaaggt caactggctg gatatgcaat ggctgcaact 2460 tagaaagaac gagccataca aactgttctt taaatactct ttagatgatg acgcatcatt 2520 cagtttcatt tcgctagcga aacgcggacg tcaaacgaat cttagtgaag tgagtttggc 2580 tccactgtac tgtggagcaa caaaactgag caaagaaaaa tactcagatc taatgaaact 2640 gttaaagttc ctcccatcta tcatgacttc tatcaaaagt tggaacacag cggtcacaac 2700 aaggatgtac tagaagatga actgctcgat gacagcggcg atgatgatca gtaaattcgt 2760 gtggttttta ttgtactgtt tgtaataatg aggactattt ttgccgataa ttagtaaata 2820 gtaattgtgt ttttttacaa catagtatga atttgtacta cttgtcttaa tttgaattgc 2880 tattgtaaac gaataatatg aaatttgttc atgattttac actaaaatga tttagcgatt 2940 tgttgattat gtttagagaa aaaatggacc aagtccataa ctattattat attattaata 3000 attattcaaa aatttatttg tgcatttcac atatcttgtt gtctaagaaa aaagatttaa 3060 aatgatacta gacatcaata atattacata tggctgattt ataatatatg caaacactaa 3120 gaacgcttat ttctaacaat atagaaatat ggacttggtc cacttttact cttatgggct 3180 c 3181 // ID Copia-32_DPu-I repbase; DNA; INV; 4329 BP. XX AC scaffold_96; XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from Daphnia: internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-32_DP_; KW Copia-32_DPu-LTR; Copia-32_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-4329 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Direct Submission to RU (16-DEC-2010). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR Genome; scaffold_96; Positions 232297 227969. XX CC 'TTACT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 411..4274 FT /product="Copia-32_DPu-I_1p" FT /translation="MWTRLSVQYLQNAAEDQHALRQRFYDYKFQPEHDVMA FT HITAIETMATHLNDAGATVDPIQVMTKIVCTLPRSFENFLSAWDNVPANEK FT TMQLLTSRLLREESVKKRWSSEDNVVTENAFAARNPPTSSRTPNPTGRNAS FT RRGRGRRGGYNKLADRECTYCTGPSRFSHTYEVCRVRMRDERRKRDEAETS FT KGESSFTSIFQQDDCYLSSPRTMALMWSHWIADSGATAHMTDQRQAFSKFE FT PFLSFSSYPVKGIGGTKLFAHGQGSIDIFTIVSGVQKKVTIKNVLFVPNLG FT ASLISITAATSNGMTVIFSGEKVIFSRLNKIEMTGSRVDKLYLMDIEVHTE FT HQNNQEETAYVADPKKSTIEIWHQRLAHLNYKTILEMSSLDLADGLSLPRE FT CSIPEDICHGCAYEKMQRKSFTTGRSRATYIGQLIHSDLCGPMQTPSLTGA FT RYFLLFTDDYSGWRQVYFLRQKSETPDKFKEYVTLLRSETSNFVHALRTDN FT GGEYCSINFRGWLSKKGIRHESSAPHCPEQNGVAERANRTVVEAARSLIHA FT KGLPIKFWAEAVACSVYTLNRVPSKASKSTPHQIWHKAKPDLTNLRVFGST FT AFVHIPAAERRKIDPKSVKCIFIGYCSTQKAHRFWDPVACKVKVSRDAIFD FT ENRQFDFFHPVANPIEGEPNLDNPPICNEMETASSIPPIHNEKESTQSNQS FT GMYMAPPETNHPSEDLLQPPEPAPLATQKRVHFHPSANPMSDDTMSIPATP FT PEEPNVRRSRRLQGHKALHSQMASHDSPYEPSSYADAISSPDAPLWKAAML FT EEYQSLIKNDTWTLSKLPQDRNTIKTRWVFTVKPSSFGNPPRYKARMVAKG FT FTQRPGIDYSETFSPVVKMDSLRTILSLSAARNLDMTQLDVKTAFLYGEIS FT EEIYLNQPEGFVTPGKEREVCKLNKCIYGLKQASRVWNQHFNKFLQDFGLN FT PSASDPCVYSYQKGEEFTIVAIWVDDGLVCSNNKESTTRILHYLSTHFEMR FT FGVVDYFVGLKITRDWNKMSVYLSQPEYINKVIHRFNMEECQPKSLPADPN FT SRLMSKNKSSPEENTDGENFPYREAVGSLIYLAVTSRPDISYAVNQVAQHS FT ENPNRSHWAAVKRIISYLKGTSKLGIKFDGRCSKTITGFTDADYAGDLDTR FT RSTTGFVFLLNSGPVSWSSRRQPCVSLSTTEAEFIAASETVKEAIWLKRLL FT TEIGNNGDKPIPILCDNDSTIKLVKNNQFHQRTKHIEVRHYFVREKQEAGE FT IEVSYVPTQDNLADIFTNPFPIHGSVI" XX SQ Sequence 4329 BP; 1378 A; 1103 C; 920 G; 928 T; 0 other; ggttatggga ccagattctc gcataaccgc aaatcaagag aatgactagc cctacactga 60 aagatgttgg acatattgca aaatttgacg gaaagaattt ccctttgtgg aaatttggat 120 gttgggttat acttgaacat cacaacctcg taccaattgt agatggcacc gagaagaaac 180 ctgttgaggt atgataaagt ctttttctat catacaactt ttccgacaga tgtaaaacaa 240 aatccctttt tgctgtccca tcacaggtga aaaatgctga acgtgtggtc accaaccagg 300 ctcaaataga tgcctgggtg aaacaagata tattggctcg atactaccta actgccacca 360 ttgagaccca gcagcaaagg agcctgatca attgccgaac agcacatgaa atgtggacaa 420 ggctttctgt ccaataccta cagaatgcag ctgaagatca acatgcccta cgccaacgct 480 tctacgacta caaatttcag cccgaacacg acgtaatggc acacatcacg gcaattgaaa 540 ctatggccac acatcttaat gacgctggag caacggtcga tccaatccag gtgatgacaa 600 aaatagtttg cacactccca cgatcctttg aaaactttct ctccgcctgg gacaacgtac 660 ctgcaaatga gaaaaccatg cagctattaa catcacggtt actgagagaa gagagcgtga 720 agaaacgatg gtcgagtgaa gacaacgtgg tcaccgagaa cgcatttgca gccaggaatc 780 cacctacgtc cagccgtaca ccaaatccta ccggtagaaa tgcctccaga agaggaagag 840 gaagaagagg cgggtataac aaattggccg acagagaatg tacctactgc actggaccgt 900 cacgcttttc acacacctat gaggtatgca gagtacggat gagagacgaa aggaggaaac 960 gagacgaagc tgaaacaagc aaaggcgaat catccttcac ctccatcttc caacaagacg 1020 actgctatct ttcatcacca cgtaccatgg ctctcatgtg gtcacactgg atcgctgatt 1080 caggggcgac ggcccatatg acagatcaac gacaagcatt ctcaaaattt gaaccatttc 1140 tctccttctc atcttatcca gtaaaaggta tcggaggaac aaaactcttc gcccacggac 1200 aaggcagtat cgacattttc acaattgtca gcggtgttca aaaaaaggtg acaatcaaga 1260 acgtcctttt tgtccctaac cttggagcca gcctaatatc aatcacagca gccacgagca 1320 acggcatgac agtcattttc tctggagaga aagtaatctt ttcccgactc aacaaaatcg 1380 agatgaccgg tagccgtgtg gacaagctat acctaatgga catcgaagtc cacacagaac 1440 atcaaaacaa ccaggaagag acagcatacg ttgcagaccc aaagaaatcc accattgaaa 1500 tatggcacca gcgactcgct catctaaact acaaaacgat cctggaaatg tcgagtttag 1560 atctagctga cggcctatcc cttccaagag agtgcagcat tccagaagac atttgccatg 1620 gttgtgcata tgaaaaaatg caaaggaaat ctttcaccac tggtaggtca agagcaacct 1680 atattggcca actcattcat tcagacttgt gtgggccaat gcagacaccg tccctaacag 1740 gagcgagata ctttctttta ttcacagatg attacagtgg atggagacaa gtgtactttc 1800 ttcggcagaa atctgaaacc ccagacaaat tcaaagaata cgtcacattg cttcgaagtg 1860 aaaccagcaa ctttgtccat gcactacgaa ctgacaacgg tggcgaatac tgcagcatca 1920 actttcgcgg atggctgtca aagaaaggta taagacacga atcaagcgca cctcattgtc 1980 ccgaacaaaa tggagtcgcg gaacgtgcaa accgcacagt cgtagaagcc gccagaagtt 2040 taatccacgc caaaggtctt ccaataaagt tctgggcaga ggcagtagcg tgttcagtct 2100 atacgcttaa tcgagttcca tcaaaagcaa gcaaatcaac acctcatcaa atctggcaca 2160 aggctaaacc agacctgaca aaccttcgtg tcttcggatc caccgccttc gtacacatcc 2220 cagcagctga aagaaggaaa atagatccga agagcgtcaa atgcatcttc attggatact 2280 gctcaactca aaaagcccac cgcttctggg atccagtcgc atgcaaagta aaagtaagtc 2340 gagatgcaat tttcgatgag aacagacaat ttgatttctt ccatcctgtg gccaatccaa 2400 ttgagggaga gcctaacctt gacaatcctc ccatctgcaa tgaaatggag actgcctcat 2460 caattccacc catccacaat gaaaaagaat caacccaatc caatcaatct ggtatgtaca 2520 tggctcctcc ggaaacgaac catccatctg aagatctact ccaacctcca gaacctgcac 2580 cattagccac gcagaaaaga gttcatttcc atccttctgc caatccaatg tcggatgaca 2640 cgatgtcaat tccagccaca ccaccagaag agccaaatgt acgtcgatcg agacgtctgc 2700 aagggcataa agcacttcac tcacaaatgg catcgcacga cagcccatac gaaccaagca 2760 gctacgcaga tgcgatctca tcacctgacg cgcccctttg gaaagctgcc atgttagaag 2820 aatatcagtc tctgataaaa aacgacacct ggacactaag taagctgcca caagaccgca 2880 acaccataaa aaccagatgg gtattcacgg taaaaccaag ctccttcgga aacccaccta 2940 gatacaaagc cagaatggta gcgaaaggat tcactcagcg accggggatt gactatagtg 3000 agactttttc accggtagtg aagatggact cccttcgcac catcctatcc ctgtcggcag 3060 cgcggaatct ggatatgaca cagctggacg tgaaaacggc attcctgtat ggggaaatca 3120 gcgaggagat atatctgaat cagcccgaag gattcgtcac tcccggaaag gagagagaag 3180 tatgcaagct aaataaatgt atctatggtt taaagcaggc ttcccgagtc tggaatcagc 3240 attttaataa atttctccaa gactttggcc tgaatcccag cgcttcagat ccctgcgtct 3300 actcctacca aaaaggggaa gaatttacca tcgttgcgat ctgggtagac gacgggcttg 3360 tttgtagtaa caacaaagaa tcaaccacca gaatccttca ttatctcagc actcattttg 3420 agatgagatt cggcgtggtc gactactttg taggacttaa aatcacccga gattggaaca 3480 agatgagcgt gtacctatca caacccgaat atatcaacaa ggtcatccac agattcaaca 3540 tggaggagtg ccagcctaaa agtttgccgg cagaccccaa ctcccgcctg atgtcaaaga 3600 acaaatcatc acccgaagaa aacaccgacg gagaaaactt cccatatcgc gaagctgtag 3660 gcagccttat ctacctggcc gtcacttctc gaccggacat ctcctatgcc gtaaatcagg 3720 tggcacaaca ttccgaaaat cccaacagat cacattgggc agcagtaaag cgcataatct 3780 cctacctcaa aggaacgtct aagcttggaa ttaaatttga cggaagatgc agcaaaacca 3840 tcactggatt caccgacgca gattatgccg gggacctgga cactcgtcgt tcgacaacgg 3900 gcttcgtctt tcttctaaac agtgggccgg tgtcgtggag cagccgaagg caaccctgtg 3960 tctccttatc aacgacagaa gctgaattta tcgcggcaag tgaaactgtc aaagaagcta 4020 tctggctgaa gcgcctattg acggagatag gcaacaatgg agacaagcca ataccaatcc 4080 tatgcgacaa cgacagcacc atcaagttgg tgaagaacaa tcaatttcac caaagaacga 4140 agcacatcga agtccggcac tacttcgttc gggaaaagca agaagccggt gagatcgagg 4200 tttcctatgt gccaacacaa gataatctag cggacatctt cactaacccc ttcccaatcc 4260 acggttctgt gatctgagag ctagactggg aattacggag gttccgcaca tctaatgttt 4320 gagggagag 4329 // ID DNA8-18B_AP repbase; DNA; INV; 433 BP. XX AC . XX DT 25-AUG-2009 (Rel. 14.09, Created) DT 25-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-18B_AP. XX NM DNA8-18B_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-433 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 1964-1964 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 433 BP; 156 A; 56 C; 74 G; 147 T; 0 other; caagcccgga ttaagggggg tccaggaggg gcacggtccc ggggcccgtt gatttcgagg 60 gcccattttt atccccaaaa tatttatttt gtaatgcgta caaaagtttt caaaaaaatt 120 aaataattta aataattaaa taattttagt tataatagta attataaaat aaaaatgttc 180 ttaattttat atcatgaaaa atttttaaat tataataaat ataaaaatgt ttataaaatg 240 ggagttgtga actacatcaa ctacaaactt tattatccat gattgacgat tgtaagtggt 300 aatttagtac taaattatac aattttatac agtcgtaaaa atttagagat acaatttttt 360 gaagttatat ttgaaagggg gcccactaag tgtatggtat cccggggccc aattaatcct 420 taatccgggc ttg 433 // ID Gypsy-10_AC-I repbase; DNA; INV; 3691 BP. XX AC AASC02060368; XX DT 18-JAN-2011 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from Aplysia californica (California sea DE hare): internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-10_AC_; KW Gypsy-10_AC-LTR; Gypsy-10_AC-I. XX OS Aplysia californica OC Eukaryota; Metazoa; Mollusca; Gastropoda; Heterobranchia; OC Euthyneura; Euopisthobranchia; Aplysiomorpha; Aplysioidea; OC Aplysiidae; Aplysia. XX RN [1] RP 1-3691 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Aplysia californica (California sea RT hare)."; RL Direct Submission to Repbase Update (02-FEB-2011). XX DR Genome; AASC02060368; Positions 4536 8226. XX CC Positions [2092-2592] - Integrase core CC LTRs are 94% similar to each other. XX FH Key Location/Qualifiers FT CDS 538..2736 FT /product="Gypsy-10_AC-I_1p" FT /translation="MPIFPNKEENIRDRLVLGIQDKELSEKLQLKSDLTLT FT EALTMSRQQEMVKKELEEQRKIGVDSVQARNGKSSAQTQKSRSKTPEHSYS FT RTADCSKCGRIHPQGKCPAWGKECLKCGKKNHFAKVCRSRTQKYTPRMSRA FT VHEVDLDLDQVQKQGNESQCKTEETNAFFIDAVNDSNSEPWHVNLELQGST FT VKFKIDTGADVNVLSSDTFNSLKFKPILETVTNVRLSSPGRNLKMEGKFCS FT KVNSRERATFYVVSNKVESLLSRGTSSALGLVKRVESATFSVVKCNPVKIK FT LKEGAIPYCFATARRVSIPLPDKVKKELDRMKALNVIEEVTEPTEWVSPMV FT PVVKLMGTSDTLSRSPTASSDDKDTVRLQEDVNYHVSIVTSSWPVADAKLA FT EIRRETQLDVSLRTALKYTIDGWPEYKEDVMLAARHYYDIRNELSISDGLL FT LRGNRIVFPWSMREFFLERIHDGHLGITKCRERAIQGVWWPGVSKDIKDRI FT AKCRFCAEKRPAQPSEPLLPATLPERPFQKVGVDICTFRREHYLVFVDYYS FT RYIDVLKLESLSSNAVITKMKRIFSQHGIAEIVVSDNGPQYSSQQFADFGK FT EWNFHHVTVSPHYPQANGEAERAVKSAKEFLRQQDPYLALLSYRATPIVGL FT GASSAELAYGRKIRTTLPIVPSILNPHPVDQGRIRERDAASKANQKRYFDR FT TSHPLSDIQPGDYVLLNLKREGGKAWDRPGEII" XX SQ Sequence 3691 BP; 1159 A; 760 C; 836 G; 936 T; 0 other; tggtgtcaga agccaaacaa aactgtacct cctttttaca cctagaagtg taaacctatg 60 atactgtggc atctagtata ctagatctac tggtattttg agcctttgtt atcattcact 120 tttttttttt tgctagcagt tagatctaga ccaagagaaa tctagatcta gatctagacc 180 tagacatgtc gttcaaacca ccgaacgaat tcaattttag agagccagcg gcttggccca 240 gctggaaaga gaggttccag aattttagac tagcgtctaa actgtacaag gatgatgggg 300 aggtccaggt ggctgcattg ttgtattcga tgggccccga tgcagacaca attctaaaga 360 cattctctct gactcaggaa gaaaggaaaa gctttgtcac agtgatagac aagtttacag 420 agtatttcat tcctcagagg aatgtaattc atgtcagagc ccagtttttc aggcgtgagc 480 aaaaccaaag ggagagtgta gaggagtaaa tcagaaacct ctacgacctg gcagaacatg 540 ccaatttttc caaacaaaga agaaaatatc cgggatagac tagtccttgg aatccaagac 600 aaagaactct cagaaaaact tcaactgaag tcagatttaa ccctcacaga agctctaacc 660 atgtctcgtc agcaagaaat ggttaagaag gagttagagg agcaaaggaa aataggagtt 720 gacagtgttc aagctcgaaa tgggaaatct tccgctcaga ctcagaagtc tagatcaaaa 780 acccctgaac attcatattc tagaactgca gattgttcta agtgtggtag gatacatcca 840 caggggaagt gcccagcctg gggtaaagaa tgtttgaagt gtgggaagaa aaatcacttt 900 gcaaaagtat gtagatctag gacacagaaa tacacaccta gaatgtctag agcagtacac 960 gaagtcgact tagatttaga tcaggttcag aagcaaggta atgaatcaca atgtaaaact 1020 gaagaaacaa atgccttctt tattgatgct gtgaatgact ctaattctga accatggcat 1080 gtgaatttag aactccaggg atccacagtg aaattcaaaa tcgacactgg agcagacgtg 1140 aatgtgttga gcagtgacac tttcaactca ctgaaattca aaccaattct ggaaactgtg 1200 acaaatgtca gactgtctag cccaggacga aatttgaaaa tggaaggcaa attctgttca 1260 aaagtgaaca gccgggaacg agctactttc tatgtagtga gcaacaaggt tgaatctctt 1320 ctgagccgtg gcacatcctc agcactaggc ttagtgaaaa gagtagaaag tgcaacattc 1380 agcgttgtga agtgcaaccc tgtgaaaata aaattgaaag aaggcgcaat cccatactgt 1440 tttgctacag caaggagagt gtctataccc cttccagata aagtgaagaa agagcttgat 1500 agaatgaaag cactaaatgt gattgaagag gtgactgagc caacagaatg ggtttcacct 1560 atggttcccg tagtaaagct aatggggaca tcagacactc tgtctcgcag tcctacagcg 1620 agttcagatg acaaggacac agtgaggtta caagaagatg tgaactatca tgtatctata 1680 gtgacctcat cttggccagt cgcagatgca aagttggctg agattcgaag agaaactcaa 1740 cttgatgtta gtctaagaac agctcttaaa tacaccattg atggttggcc agaatacaag 1800 gaagacgtga tgttggcagc aaggcactac tatgacattc gaaatgaact gagtatctct 1860 gatggactcc ttttaagagg taatagaatt gtttttcctt ggtccatgag agaattcttt 1920 ctggaacgta tccatgatgg acacttggga atcaccaagt gccgtgaaag ggctatccag 1980 ggagtgtggt ggccaggagt ttcaaaggac attaaagatc gcatagcaaa atgcagattc 2040 tgcgccgaaa agagaccagc acagcccagt gagcccctgc ttcctgctac tttgcctgaa 2100 agacccttcc agaaggtggg agtggacatt tgtacattca gaagggagca ctaccttgtg 2160 tttgtagact attactccag gtatattgat gtactgaaac tggagtcact gtcatcgaat 2220 gctgtgatca caaaaatgaa acgcatattc tcacaacatg gcatcgcgga aatcgtcgta 2280 tcagacaatg gtccccagta ctcgtcacaa cagtttgcag actttggtaa agaatggaat 2340 ttccatcacg tgacagtcag cccccattac cctcaagcca acggagaggc tgaacgagca 2400 gtgaagtcag ccaaggaatt tctgagacag caggatcctt acctggcact actctcatac 2460 agggctacgc caatagtcgg actaggggca agttcagctg aacttgcgta tggtaggaaa 2520 atcaggacaa ccctacctat agtcccaagt attctcaatc cccacccagt agaccaagga 2580 aggatacggg aacgggatgc tgcgagcaaa gccaaccaga agagatattt tgacagaacg 2640 tcacaccctc tgtcagacat tcagcctggt gattatgtcc tcttaaactt aaagagagaa 2700 ggaggaaagg cgtgggaccg accaggagaa atcatatgaa aatgtgcacc aatgtcgtac 2760 atcgtaaaca ctccaggagg tgagctgagg cgaaacagga aacacattag ggtgcaccaa 2820 cgggactgag accgtgacac ccgatagacc tcgactgcaa tctgaagaat ctgctgttcc 2880 ggtgccagca actctcagta tgcctactac tgcgacaact cccagtcaca gtacatcgac 2940 tcaatcgact cgggagccta cagtcgtgac tgattcagag tcacctgcag aatcgccacc 3000 tcatcacctg tcttcaccag tacctgcacg gcaacagtgc acaaccagat cgggccgcac 3060 ggttgtgaag ccttccagat atcaagttca taatatcacg tattaatgtg ttagataaag 3120 cctccttgtt gtgaagactt ggggaaggct gctcaagcag cggttaggtt gtgtccatgg 3180 ctcaaggata gagcctcatt atctttagtt tgttcaagtt caagttcgag gctcaaggtt 3240 agagccatag ttccaagaaa tgtgtgccat attttcagct ttatatattt cagcagcttc 3300 atagttaaaa cgtttttata aatattattc ggttctctcg gatgtcctct gcgcatggag 3360 actaggaacg taggaacaga ttgtgaactt ttggtccaga ttcaaggaac ttttatttta 3420 gttcaaactt ctgaaagaca attctgaaga gaaactgcag aacattgaca ttccttcgtc 3480 agttgaatca cgaacacttt attgctattt actatcatat tgtgtttgtg attttagaat 3540 aagtttaagg agtaacattc tccactatca tcatcatata cattattttg atttggtttc 3600 tatgtccatg ctatatgggc tacttcatta ttaaaagttt agcttaccgc ttcttctaag 3660 aaagaagttt acttaaaagg ggaaggtgta a 3691 // ID Sola1-9_AP repbase; DNA; INV; 3907 BP. XX AC ABLF01024328.1; XX DT 28-JAN-2009 (Rel. 14.02, Created) DT 28-JAN-2009 (Rel. 15.12, Last updated, Version 2) XX DE Sola1 DNA transposons from Acyrthosiphon pisum. XX KW Sola; DNA transposon; Transposable Element; Sola1; Sola1-9_AP. XX NM Sola1-9_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-3907 RA Bao W., Jurka M.G., Kapitonov V.V. and Jurka J.; RT "New superfamilies of eukaryotic DNA transposons and their RT internal divisions."; RL Molecular Biology and Evolution 26(5), 983-993 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX FH Key Location/Qualifiers FT CDS join(455..637,592..975,979..2340,2328..2828) FT /product="Sola1-9_AP_1p" FT /translation="MELHRSKRASRLVNLVQYNVLPVNTPRNYVVDINLEG FT KKQYKCLDKYLLFFSVIIVSTQSNIFIVFFSYYSIYTIKLIKLFFKLEIQN FT ENIHLRNEEDLVDDPVEDNDIFDDSDEDPTFDPHCDILELIDEIQKENIHL FT PNEEDFVDDPVEDNDIFDDSDGDPTFDPHCNIVEPSDEDEQVSTHSSEINN FT NGGRPKKGRKRKHIHQSRSNIKVLKNSNLGYYNQKNIKVIPKKFKDYDCKC FT PMKCPQKIPLNKRKEHFENFWKLGEYTAQNAYIATLVHELPVKRRYGSSNN FT SNFNKQFTRKYNLDNTQVCRKMFCETTNISTFRVNTALKKMHGRSPILDQR FT GLKNGGLNKISEEKLKTIKDQINKIPKYTSHYCREQSNFQCLPPEMTIEKM FT YLAYREEENEPVSFSTYKRYFYNCFNLKFKNLKKDTCNTCDSLKVQINNEQ FT NVMKKEELNIKHTEHLNLAENAQTLLKIDLDNAKQNEHFQCLTYYMEKTLP FT LPRLPTNIIFYKRQLWLYNTGIFSGKDNQGYCYVWLEGQAERGAQEVGSCL FT RKHIKNNLNNSIKELVLWSDSFGGQNRNIKIVLMMKTLFNNTELETITLKY FT LYPGHSFLPNDRNFSDIESALKHQQRLYTPDDYIHIMKTCKKKILKKNPLT FT VIKMNKEDFVSTEILEKKITNRKVSESGDKMNWLAIRQIKLYRSNPLSIFM FT NNTLSVDNFVEINIKKRCRGRQSTTPFPAEILLTQLWPDGKEINKLQLEDI FT KSMMHLIPADAHAFYSNLIGRNSEIDEDVGGFNELLDFEMETNQIIMLTTS FT QVVTLLV*" XX SQ Sequence 3907 BP; 1488 A; 494 C; 549 G; 1376 T; 0 other; aagttatgca gaaaacaacc aaccctcaaa aacataaata ttgagctatg tatactacta 60 gattggaaat tttatattac gcattttact aaatgcaacc aacccacaag tcaataggag 120 ctctatctat caatgttcaa agatggaact gaatgtgtta tcttacaaaa ataatattgc 180 acaatacatc catgccacat ggttttgtaa tggtggttat actgtgaaaa acgtactatc 240 acacatcaat ccaagcagct tggttgtttt ctgcagattt gataaagata taattttaca 300 ttatcatgaa aaattaattt gtacttcgtt ggcattttta tttattttat aattttatat 360 aatatattgt tattcgtttt aacttcatat aagtaaattt aattttgtta gttaaacatt 420 acgtaatatg cgtcttagtc gttaagtagt ggttatggaa ttacatagaa gtaagagggc 480 cagtcgtttg gtaaatctag ttcaatataa tgtattgcct gtgaatacgc cgagaaatta 540 tgtggtagat attaatttag aaggtaagaa acaatataaa tgtcttgata aatatttatt 600 gtttttttca gttattatag tatctacaca atcaaattaa taaaactttt ctttaaatta 660 gaaattcaga acgaaaatat acatttgcgt aatgaagaag accttgttga tgaccctgtt 720 gaagacaacg atatttttga tgattctgac gaagatccaa cttttgatcc gcattgtgat 780 attttagaac taattgatga aattcagaaa gaaaatatac atttgcctaa tgaagaagac 840 tttgttgatg acccagttga agacaacgac atttttgatg attcggatgg agatccaact 900 tttgatccac attgtaatat tgtagaacca agtgatgaag atgaacaagt gagtacacat 960 tcttcagaaa taaattaaaa taatggtggc aggccaaaaa aaggcagaaa aagaaaacat 1020 atacatcaat ctcgctcaaa cataaaagtt ttaaagaatt caaacttagg ctattataat 1080 caaaaaaata taaaagttat ccccaaaaaa tttaaagatt atgattgtaa atgccctatg 1140 aaatgtccac aaaagattcc actaaacaaa aggaaagaac attttgaaaa tttttggaaa 1200 ctaggcgaat atactgctca aaatgcatac attgccactt tagttcatga gttaccagta 1260 aaaagaaggt atggttctag caataattcc aatttcaata agcagttcac aagaaaatat 1320 aatttagata atactcaagt gtgtcgcaaa atgttttgtg aaacaacaaa tatatctaca 1380 tttagagtta atactgccct gaaaaaaatg catggtaggt cacctatcct tgatcaaaga 1440 ggattaaaaa atgggggcct aaataaaatt tctgaagaaa aactcaaaac tataaaagat 1500 caaataaaca agattccaaa atacacatcg cattattgca gagaacaatc aaattttcaa 1560 tgtttacctc cagaaatgac tattgaaaaa atgtatttag cttatagaga agaagaaaat 1620 gaacctgtaa gcttttcaac ttataaacga tatttttata attgttttaa tttaaaattt 1680 aagaacctta aaaaagacac ctgcaatacc tgtgattcat taaaagttca aattaataat 1740 gaacaaaatg tgatgaaaaa ggaagagtta aacatcaaac atacagagca cttgaattta 1800 gcagaaaatg cacagacctt attaaaaata gacttagata atgcaaaaca aaatgaacac 1860 tttcaatgtt taacatatta tatggaaaaa actttaccac tgcctagatt acctacaaat 1920 atcatatttt ataagagaca gctttggctt tataataccg gaatcttcag tggtaaggat 1980 aatcaaggct attgttatgt atggctcgaa ggtcaagctg aaaggggtgc tcaggaagtt 2040 ggttcatgtt taagaaaaca cataaaaaat aatttaaata attctattaa agagctagtt 2100 ttgtggtcag attcttttgg agggcaaaat agaaatatta aaatagtttt aatgatgaaa 2160 acacttttta ataacaccga actcgaaaca ataacactga aatatttata tcctggtcat 2220 agttttttac ctaatgatag aaattttagt gacatagagt cagcactgaa gcatcaacag 2280 agactttaca ctcctgatga ttacatacat attatgaaga cttgtaaaaa aaaaatcctc 2340 taacagttat taaaatgaac aaggaagact ttgtaagcac tgaaatttta gaaaaaaaaa 2400 taactaatag aaaagtatca gagagtggtg ataaaatgaa ttggcttgct attagacaaa 2460 taaaattata tcgtagtaat ccattatcta tttttatgaa taatacatta tcagttgaca 2520 actttgtaga aataaatata aaaaagcgtt gtcgaggacg acaatccaca accccctttc 2580 cagcggaaat attattaacc cagttatggc ctgatggcaa agaaattaac aaattacaac 2640 ttgaagacat aaaatccatg atgcatctaa ttccggcaga tgcacatgca ttttattcta 2700 atttaattgg aagaaatagt gagattgatg aagatgtggg tggatttaat gaattattgg 2760 attttgaaat ggaaactaat caaattatta tgctaactac tagtcaagtt gttactttac 2820 tagtatgatt attttattat cttttttaag aattttataa tattgtttgt tttctattta 2880 attttttttt tttaagattc aaaaagtggt ttattattaa ttattttgat gttactaatg 2940 aagtagtttt tttaatatta tcaatatttt actatccttt ttaggatttt tttactatta 3000 taactcaaga acagatttca tttttatgtt aactactagt caagtaagtt gttacattac 3060 tattagatac attaactcaa aagactgaga taattactta atgttgaagt aaagtttata 3120 aatattatat tttaccatcc tatttagaaa ttgtatacat tttatgactc aagaacagat 3180 ttattatgaa ttattataat gttaactact agtcaagttg ttactttact agtatgatta 3240 ttttattatc ttttttaaga attttataat attgtttttt tattttattt ttttttttaa 3300 gattcaaaaa gtggtttatt accttaatta ttttgatgtt gctaatcaaa tagtttttta 3360 ctattatcaa tattttacta tcctttttag gaattgttta ctattataac tctagaacag 3420 gtttaattat tatgttaact actagtcaag tagttgttac attactttga ttagtatttt 3480 tttattgttt ttttttaatt ttataataat ttgactcaaa agactgagat aataaatatt 3540 taatgttttt tgttgtaata tacatcttat gttttaatca cataaaatgt ttttttttct 3600 tatttaagta taataatatt attaaagttt tcattttaaa aacaaccaac caacataaag 3660 caaccatgcc acatttttac attaatcata aaacaaccaa ccaacataag tcttaaaata 3720 atggtgagta ttaatttgta tcaggatttt attggtaagt cattgaaata catacatcaa 3780 attaataaga tatattggtt ttaggtcata tgaggtgcta accaatttaa aattaatatt 3840 tttctaaaac ttgaattatc tccaaataga acatgtgtgg gatggttgct ttctgtataa 3900 cgccgtt 3907 // ID AMARI repbase; DNA; INV; 897 BP. XX AC . XX DT 31-OCT-2005 (Rel. 10.11, Created) DT 31-OCT-2005 (Rel. 10.11, Last updated, Version 1) XX DE Mariner-type DNA transposon from Apis mellifera (a consensus). XX KW Mariner/Tc1; DNA transposon; Transposable Element; mariner; KW Interspersed repeat; AMARI. XX OS Apis mellifera OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; Apoidea; OC Apidae; Apis. XX RN [1] RP 1-897 RA Jurka J.; RT "AMARI: Mariner-type DNA transposon from honeybee."; RL Repbase Reports 5(11), 341-341 (2005). XX DR [1] (Consensus) XX CC Present in multiple copies >90% similar to consensus. The CC consensus sequence was derived from Genome Assembly Amel 3.0 CC available from BCM-HGSC. XX FH Key Location/Qualifiers FT CDS 114..809 FT /product="AMARI_1p" FT /translation="KWSVIRYNNVEWIINHEKVDAGFSSKIILSDEAHFHL FT DGFVNRQNCRVWGSEKPRVISEKQMHPQRVTVWCGFWAGGIIGPYFFENEA FT GQAATVNGARYRDTITRFFLPKLDDIDVADMWFQQDDATCHTANETIQLPH FT ETFPGRVLSRFGDQNWPPRSCDLTPLDFFLWGYLKSKVYVDNPTTTRALQE FT EIKRCIDEIQPQLCRKVMKNLDERVRMCPMCYSINNPILCTL" XX SQ Sequence 897 BP; 271 A; 175 C; 196 G; 255 T; 0 other; taaagggtgt cccaaaatta acgcaagatt tgaatttgcc gccatttttg catcaagttg 60 ttggcaagcc tgaaaaaaga acagtttgac agctgagagt ttagggttag taaaaatgga 120 gcgttatacg atacaacaac gtggaatgga ttattaatca tgaaaaagtg gatgctggtt 180 tttccagcaa aataatccta agcgacgaag cacattttca cctcgatggc tttgttaatc 240 gccaaaattg ccgtgtttgg ggttcggaga agccacgtgt gattagcgaa aaacaaatgc 300 acccacaacg tgtcactgtt tggtgcggat tttgggcagg aggcatcatc ggaccatact 360 tttttgagaa cgaggctggt caagcagcaa ctgttaatgg tgctcgatat cgcgacacga 420 taacacggtt ctttctgccg aaattggatg atattgatgt ggccgatatg tggtttcaac 480 aagacgatgc cacgtgccat acagccaatg aaacgattca attaccgcac gagacatttc 540 ctggtcgtgt actctctcgt ttcggtgatc agaattggcc ccctagatcg tgcgatttaa 600 cgccattaga tttcttctta tggggttatt tgaagtcaaa ggtctatgtc gacaatccca 660 caaccacacg tgcattacaa gaggaaatta aacgctgcat cgacgaaatt cagccacaat 720 tatgcagaaa ggtgatgaaa aatttggacg aaagggtgcg tatgtgcccg atgtgttatt 780 ccataaataa ccctatccta tgtactttat gattcgctta aaaaataaat atctaaagac 840 taaaaactct cttttatatt taattcaaat cttgcgttaa ttttgggaca cccttta 897 // ID BEL-7_AA-I repbase; DNA; INV; 5976 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-7_AA_; KW BEL-7_AA-LTR; BEL-7_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5976 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 863-863 (2011). XX DR [2] (Consensus) XX CC Positions [5004-5567] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 2635..4416 FT /product="BEL-7_AA-I_1p" FT /translation="MSPQKLMLSTEDERAIKLMQTSTKFDDGRYQTSLLWK FT YDDVRLPNSKPMALRRHNCLMKRMKRDPILAAAIKSKMEDYLEKGYIRRLS FT PDEIQQPRNKVWYLPIFPVFNPNKPNKIRIVWDAAASVGDVSLNSVLMKGP FT DLLNDLTSVLYRFRERPIAVSGDIAEMFLRVKMNETDQHCQRILWCQDDGT FT TAEYAVTVMTFGVKCSPSCAQYIINLNASKFESQFPEAVEVIYKGHYVDDM FT LASVDSEDDAIKLAKDICFIHEQGGFTMRNWISNSSTVVQALGESGSAEKN FT LYADKEAVLEKVLGIWWDTQTDTFRFKLSLERHQELLSGAKHPTKRDILRV FT MMSVYDPLGLLSHYMIYLKLLFQEIWREGTLWDDEIAETQLQKWKCWLHLL FT PEAESARIPRCYDLSPSLGTNRLVELHTFVDASELGYAAVCYFRFIEENRI FT TCSLVASKIKVAPLKFVSIPRLELQAAVIGTRLAKSVELGHSISIARRFFW FT TDARDVICWLKSDHRKYSQFVAFRVGEILETTDVNEWNWIQGKDNVADEGT FT KWNGLPSFDGKCRWFAGPSFLSKPTVEWRTDMVNGYTVEELRPSLLY" FT CDS 4713..5975 FT /product="BEL-7_AA-I_2p" FT /translation="MRIRSRIDVCDFADNDTQFPIVLPRGHPLTKLVLQHI FT HEKYHHTGNETFINESRRKFYIPRVRSECEKVRRTCQHCKIRRAKPEPPMM FT ASLPKQRLAAYVRPFSYVGIDYFGPYQVIVGRHPEKRWGVLITCLTTRAIH FT IEIAHSLSTSSCIMALQNCFARRGTPLQIISDRGTNLVGASKELKEALRNL FT DTSKIAAEFTTASTAWSFNPPSSPHMGGAWERLIQTVKRILIQIKPSTRLP FT TDECLRNCLIEIENIVNSRPLTHIPLDDESSSALTPNHFLVGSSNGLKPLV FT SFDDNVISLRQSWKTSQILANYFWKRWVVEYMPMITRRSKWFYPVKPVAVG FT DIVIIVDPSFPRNCWPKGRVIAVKQSKDGQVRSATVQTASGIYDKPVVKLA FT VLDVGTNRSRPDEHPVTAGDCHERPSS" XX SQ Sequence 5976 BP; 1686 A; 1349 C; 1359 G; 1564 T; 18 other; ctgtattctw ccgttgcctt cgcaacattt taaagtttcg tttttaatcc ggacgaattt 60 gattggacga tgcatcggcg ggcgtatgtk ctagaagaca gtatcgwaca aaactgtcca 120 ctttgtaktc taccggatag cagtcaaatg gtcagctgcg atgamtgcag ccgttggttt 180 catttggctt gcgcaggcgt gactgagaca atcgcggatt atgattggag ctgccaggga 240 tgcgttaatg ctagaatctc tgctcctgtt ccgcagttgt cgtcttcaca catacaatgc 300 gacgctatta ccactggact gccaackgga ctacaaccga cctcaaattt gccaccaatc 360 tmtggacagg aagcactcgg agttccagtc tcgcttgttw gtggaccaag cgtacaagac 420 accatctcga ccagtgtctt ccccgtaaga caaaataaca ccgcccatgt agtatstgca 480 ccagaaaaca ctactgtttc gggtccagmc atggatacga ggcatcagtt ttctgccctg 540 cccttggtcc cagatgctgc tccaataatg acgactgcac cwttgacaca accgtcgagt 600 gctgattttg ctgcacaatt gccgatcgag ctgaggcaag catttaatgc aaaagtatca 660 gatgctgatg gaagatacta ggatcccatc aagatctgct aatgtaccac aattcaatgc 720 atcatctacg tcgttaattc agcaaccctt tatgtcggcc catcagtcac acataccgac 780 cgttggattc gcttcaacct cagccattcc taatccgcag caccattcat ccccaatgca 840 aaatttgcac gaaatcaaca ggcctatgcg taccagtctg ccgtaccaat cgttacccgt 900 ccagcaacct aacccgcaat atcatccaag catagatcca ttatcaacca ttgggcttgg 960 aaatgatcaa acagtgctga ataggagtca gattgccgca cgtcaagccg tggttgccac 1020 aatccaaaat ttcaatttgg aagaacatct ttacaatatt tcattgctac aaaaccttgt 1080 cgatcgatta ccacctatga tmcggctcaa ttgggcgact tatcgccaag gtttatatcg 1140 agtaacactc acgcaattca gcgmctggtt mtacaaccta gccgaagcag ctagtgcggt 1200 gactattccc ccgactctag gatcgataga aaaacaacgc tacactcgta aggacgacgg 1260 atttttaaat gcccacacgg aaacttttga agaacaagga ttatcttcta aagcatttcg 1320 tcgatgcctc gtgtgccaag atgactgtaa aactgtagag gattgcagca agtttgcggc 1380 aatggacctt ccatcccgtt ggaacacttt acgtgaactt aaactgtgtc gatgctgcct 1440 tggaaagcat ttcggttcct gtaagtccac gaaggtttgc agtataaatg gttgtgcctt 1500 caaacaccat cagttgctgc acaacgtktc taaggatagg cgcacggaaa ccgccacagg 1560 ccaccatgwa cctgcaaaca atttacctat tcaaaacagc gttgaacaca attgcaatac 1620 ccatagagga aacggccaat cggtatattt ccaatacgtt ccagtcgttc tccataatca 1680 aggaagaaaa attcgcactt acgcattctt agactgtggc tcgtcgttga ccctgattga 1740 agaaggtctt gcttctgaac tgcgactcaa aggagaaaag catccattgt gcttacgttg 1800 gactgccgat acgtgccgat acgaggattc agcaagaatt gtcacgttgg atatatccga 1860 ttcaaacgaa attagtccca aacatcggct cgacgaagtg tattctgtga aggagctcaa 1920 gctaccggtg caatcattat cgttcgagga gctatcagct aagttcgctt atcttcaagg 1980 ggtaccagta gattcttacg tggatataca gccgcgaatt ttgattggta tgaacaatgt 2040 tcgtctaacc cacccacccc ctagaatgtc gagagggtga tgcgaatgag cccgtggcgg 2100 caaggactag attaggatgg atggttttcg gaacatggcc cgtacaaagt acaacggtcc 2160 acactgctgt tccacatagc ttccacattt gtcatcactc tgctcccaat gactttgaac 2220 tcaatgtggc cgttaaacaa tatttcgctt tagaagggtc ctattatgga tcaaatcgac 2280 tcatattatg gatcagaaca ctataaagca atttatctgc gaggtaaacg aatggtgtgc 2340 tagtgaagca tacgttccgg cgttttttcg gaatcgtctt atcttgattc ataactgtgg 2400 tagggtttca atgccccaca gttatgaatc aacgtctgga atacgtgaca ttttatttcc 2460 tacggacgga aaaatatgag atgttacgat tgattgcgat gctgtacaga tttatggttc 2520 acgaggtaaa atgtgttctg gtaattgcag ctctattgat gagatattcc ctttacaatc 2580 tgatccatat ctgtgtgcta tccataactg tggggtgact gtagtatcgg aatcatgagt 2640 ccacaaaagc taatgctgtc aacagaagat gaacgcgcaa ttaaactaat gcagacatct 2700 acgaaattcg acgacggcag gtatcaaacg tcattactat ggaagtacga tgatgttcga 2760 cttcccaata gtaaaccaat ggcgttgagg cgccacaatt gcctgatgaa acgcatgaag 2820 cgtgatccaa ttctagctgc agccataaaa tctaaaatgg aggactactt agaaaaggga 2880 tacattcgta gactttcgcc cgatgaaatt cagcaaccaa gaaataaggt ttggtattta 2940 cctatattcc ccgtttttaa cccaaataag ccgaacaaga tccgcatcgt gtgggatgct 3000 gcagcttcag taggtgacgt ttcgctcaat tctgtcctca tgaaaggtcc agatcttctg 3060 aacgatttaa cgtcagtatt gtatcggttt cgagagcggc cgattgcggt ctcaggagat 3120 attgctgaaa tgttccttcg cgtgaagatg aacgaaacgg atcaacactg ccaacgcatt 3180 ctttggtgtc aagacgatgg tactaccgcc gagtacgcag taacggtgat gacctttggt 3240 gtaaaatgct cgcctagctg tgcccagtat atcatcaacc ttaacgcaag caagtttgaa 3300 tcgcagtttc cagaagccgt agaagttatc tataagggtc actacgtcga cgatatgttg 3360 gcgagcgtcg atagtgaaga tgatgcaata aaactagcaa aagatatctg ttttatacac 3420 gaacaaggag gattcactat gaggaactgg atatcgaatt cctcaaccgt cgttcaagcg 3480 cttggtgagt ccggatcagc tgaaaaaaac ttgtatgccg acaaagaagc ggtgttagag 3540 aaagttctcg gcatttggtg ggacactcaa acggacacat tccgtttcaa attatccctt 3600 gaacgacatc aagaactact ttctggtgca aaacacccca cgaagagaga tattctacgg 3660 gttatgatgt cagtctatga tcctctcggc ttactgtccc actatatgat atatttgaag 3720 cttttgttcc aagagatttg gagagaaggg acattatggg acgacgagat agcagaaacc 3780 caattgcaga agtggaagtg ctggttacat ttactccctg aagcagagtc agcgcgtatt 3840 cctcgctgtt atgatttgtc accttcattg gggacgaata gattggtgga acttcatacg 3900 ttcgtggatg caagcgagct cgggtatgct gcagtatgct acttccgttt catcgaagaa 3960 aaccgaataa catgttcgtt agtggcgtcg aaaattaagg tggcgccgct gaaattcgtc 4020 tctattcctc gcttagaact ccaggcggcc gtaattggaa cccgattagc gaaaagcgtt 4080 gagctaggac actctattag tatcgcacga cgtttttttt ggactgatgc ccgtgatgtg 4140 atctgttggc tcaaatctga ccaccgaaaa tattcccagt tcgtcgcctt ccgtgtcggg 4200 gaaatattgg agaccactga cgttaatgag tggaattgga tacagggaaa ggataatgta 4260 gccgacgaag gaacaaaatg gaatggactt ccgagttttg atgggaaatg caggtggttt 4320 gctggaccat cgttcttatc caaaccaacg gtcgaatggc gaacagacat ggtaaatggt 4380 tataccgttg aagagcttcg tccgagtttg ttgtacmaca gcgccgatgt agttgataaa 4440 attttggatc ctgaaaaata ttcgaattgg atgcgattac tacgtcacac tacttgggtg 4500 cttcggtttc ctgcaaatat cttaaggaaa ggtgaaaatc gtcttcgagg accgttaact 4560 tgcgaagaac tackggtggc ggagaatttc ttgtataaac gggcacagat ggatggttac 4620 atgcaggaaa tggaaacgtt ggccgcaaat aagcgtctga ctaaaactag cccttataca 4680 aaactaatcc gtacattgat gactgcggtg ttatgcgtat acggagtcgt atcgatgttt 4740 gtgactttgc tgacaatgac acacaattcc ctatagtttt accccgcgga catcctctca 4800 ccaaattggt attgcagcac atacatgaga aatatcacca caccggaaat gagaccttta 4860 tcaatgaatc aagacgtaaa ttttacatcc caagagtgcg atcggaatgc gaaaaagtgc 4920 gtcgaacatg tcagcattgc aaaatacgcc gagcaaaacc tgaacctccg atgatggcaa 4980 gtctaccaaa gcaacgattg gcagcatatg tacgcccatt ttcatacgta ggtattgatt 5040 actttggtcc ataccaggtc attgttggac gacaccctga gaaacgctgg ggggtgttaa 5100 taacttgcct gactacacgg gcgatccata ttgagattgc gcattcttta tccacgtcgt 5160 cgtgtattat ggctctacaa aactgttttg cccgcagagg gactccactg cagataatta 5220 gtgatcgtgg aaccaacctt gttggggcct ctaaggaact gaaagaagcg cttcgaaatc 5280 ttgatactag taaaatcgca gcagaattca caacagcttc aacagcatgg agcttcaacc 5340 caccctcttc tcctcatatg ggtggggcat gggagcggtt gatccagacg gttaaacgaa 5400 tcctcatcca aattaaaccg tctacacgtt tgcccaccga tgagtgtttg agaaactgtt 5460 tgattgaaat agagaacatt gtcaacagca gaccgttgac gcacatccca ttggatgatg 5520 aatcttcctc tgcgcttacc ccaaaccact ttctggtggg gtcttcaaac ggattgaagc 5580 ctctagtatc tttcgacgat aatgtcatct ctttaaggca atcatggaaa acatctcaaa 5640 ttttggcgaa ttatttctgg aagcgatggg ttgtagaata catgcctatg ataacacgac 5700 gatcaaagtg gttctaccca gttaaaccag ttgcggttgg agacatagtt attatagttg 5760 accctagctt ccctagaaac tgctggccca aagggagagt cattgctgtt aaacaatcca 5820 aggatggcca ggtgcgatcg gcaacggtgc agactgcatc cggaatttac gataaaccgg 5880 ttgtgaaatt ggcggtgcta gacgtcggga caaataggag taggccggat gagcatccgg 5940 ttactgcggg ggactgtcac gaacgcccct cgtcca 5976 // ID Crack-5_HM repbase; DNA; INV; 4353 BP. XX AC . XX DT 15-SEP-2009 (Rel. 14.09, Created) DT 15-SEP-2009 (Rel. 14.09, Last updated, Version 2) XX DE Non-LTR retrotransposon: consensus. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; Crack-5_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4353 RA Jurka J.; RT "Non-LTR retrotransposons from Hydra magnipapillata."; RL Repbase Reports 9(9), 1936-1936 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 977..1609 FT /product="Crack-5_HM_1p" FT /translation="MAEISNFKHQAFNFSKMKTLIDPNYSHDLNNDCFYYY FT PSEIEGSLFNFNNSEQIKILHVNIRSISHNFEKLLYLLEETKNVFNIICLT FT ETWINANDLSNNANFYLPRFELISQERHTNKRGGGVLIYVKETIAHYVRND FT LSVSDGDKEILTIEILNNESKNVLLSCCYRPPDGLSENLSTFFEQNVIKIG FT IKEKKKIYYWRSKHEFFKLRE" FT CDS 2458..4047 FT /product="Crack-5_HM_2p" FT /translation="MDDCICSNNLSSELSFEEFQVSFKSIKKNKAPGADEI FT NGNIVLECFEQLKDILFKVFRASIQQGVFPDKLKIAKIIPIYKEGDRTNIC FT NYRPISILSTFSKILEKIIYNRLYTYFKEKNFFYKNQFGFKKNSSTEHAII FT QFVREISNSFKNSQFTLGIFLDLSKAFDTVNHGILIHKLKYYGVDGVILKW FT FKSYLTNRKQFVSFNDPYQKNVLDISCGVPQGSILGPLLFLIYINDLHKAS FT NLLSIMFADDTNLFLSDNNINKLFTTMNEELKIISSWFKCNKLTLNIKKTK FT WILFHSITKKRYLPPTLPDIFIDHIEIKRDCTTKFLGVFLDENITWKQHID FT YVGTKVSKNIGILYKARIYINKKVLSQLYYAFIHSYMNYANIAWASTEKSK FT LQSLYRRQKHAIRVINFADRFSHSKHLFSEMRILNIYELNIFNILCFVYIC FT KNNLSPAVFNDLITLKPTNKYSLRNNNLLYEPFCRTSFNQFCIDYRAPHLW FT NKIVLPNFNFDLPTSLHLFKVKLKNLFLSRIIKI" XX SQ Sequence 4353 BP; 1689 A; 595 C; 587 G; 1482 T; 0 other; gcgacctttt ccttaaagtt tgaaacttga aagatttgga aattgaaagt ttgttgttgt 60 ctttttattt tttttagaaa aagtgttttt tttttctttt aaatttgtat aagtgaaagt 120 tgcttttaaa gttgtttgcc ggatttgaaa ggtgattatt gagtgagtcg ttattgtttt 180 taattgtgcg aagatggatc tatcaatgaa aaatattgag aaactaataa tatctaagct 240 tgaagatcag aagcaaaaca ttttaaaaga aactgaaaaa ctattaaaaa aacaagaaga 300 aacttttaca aaaataatga gttccaacat aaaaataata tcagacagac tagataaact 360 tgaaaaagaa cttagtttta ataaatcccg agttgaaata ctagaaaaag atactagcga 420 cattaaagat agtattaatt tccaagagga aaacatcaag gaaatgctgt ctcaactaaa 480 tttaaagatc aatgaagaaa taaatacttt gaaaaaaaat atatagatct agaaaataga 540 tctcgccgca ataatttaag agttgatggt gttgcagaac tatcgtctga aacgtgggat 600 gactgtgcaa gagcagttaa aaagatcttt aaaaaccagc ttggaattgc cgatgatata 660 gtggttgagc gagcccatcg aataggaaga gttaaagaca agctaccaag aactatagtc 720 cttaaactcc taaattataa tgataaatgc aaaattttaa catctgtgaa aaaacttaag 780 ggcactggaa tttttatcaa tgaagattat gcaaaagaaa ccttagaaca tcgtaaaaaa 840 ttgtgggaag aagttaaaag acatcgtaaa gaaggtaagt atgcagtaat caaatttgat 900 aaaatttttg ttaaagatat gcataattaa aataatgtag tcgcggaaat gaaaggagat 960 tctaactctc actggaatgg ctgaaattag caattttaag catcaagctt ttaatttttc 1020 aaaaatgaaa actttaatag accctaatta ttctcatgat ttaaataacg attgctttta 1080 ctattatcct agtgaaatag aaggatctct ttttaatttc aataacagcg aacaaattaa 1140 aatccttcac gtaaatatta gaagcataag tcacaacttt gaaaagctcc tttacttatt 1200 agaagaaact aaaaatgttt ttaatattat ttgtttaact gaaacttgga taaacgcaaa 1260 cgatttaagt aataacgcaa acttttatct tccccgtttt gagttaatat cccaagagag 1320 acatacaaat aaacgcggtg gcggagttct tatttacgtg aaagaaacca ttgcgcatta 1380 cgttaggaat gatttaagtg tttctgacgg cgataaagaa attttaacta ttgaaatctt 1440 aaacaatgaa tctaaaaatg tattactaag ctgctgttat cgaccacctg acggtctgag 1500 tgagaatctg agcacgtttt ttgagcaaaa tgttatcaaa ataggcatta aagaaaaaaa 1560 aaaaatttat tattggcgat ctaaacatga attttttaaa ctacgagaat gactgtaaaa 1620 taaaaaaatt ttatgattta atatttgaaa taggagcaat tcctttaatt aatcgtccaa 1680 ctaattaatc gtccaactag agtatcagca aactcagcaa ctctcattga taacattatt 1740 acaactgatg tcttcaataa caatatacaa aaaggtatat tgaaagcaga tataactgat 1800 cattttccaa tatttctatc attcaactca aattctaaaa ttaaaaccaa aactaataat 1860 aaaataagta aacgcatata taatataaac aatattgaac agtttaaaaa acaattatca 1920 ctgcttcatt ggaagcatat aaattttaat gacgacgcaa ataaaattta tgaaaatttt 1980 tttaaaacat tttattctgt ctatgatgcc aattttccca tctgtgaaaa aacaataaat 2040 caaaaaagct taaatgttcc ttggataact aagggtttta aaaaatcatc taagatcaaa 2100 caaaagctat atataaatta cttaaaaaca aaaacatctt taaacgaaaa attatacaaa 2160 gattacaaac atctatttga aaaaattcgt aaaaatttaa aaaagaatta ctactcgaaa 2220 ttaattgaaa atgttaagaa cgactctaaa cgtacttgga aaatattaag agaaattggt 2280 ggaaaacaaa aaacttgctc aggttcccta ccgcaggtga ttagagtgga taacaaagat 2340 atttttgaac ctagattaat agctcatgag tttaacaaat tctttattga tatcggttcg 2400 aaactagcaa ataaaattcc taatacccaa gttgtgttca ataacttttt aacaccgatg 2460 gatgattgca tttgttcaaa taatttatct tctgagctat catttgagga gtttcaagtg 2520 tcgtttaaat ctattaaaaa aaacaaagcc cctggagctg atgaaataaa tggaaacata 2580 gtcttagaat gctttgaaca attaaaagat attctcttta aagttttcag agcgtcaata 2640 caacaaggag tttttcctga taagttaaaa atagctaaga taatcccaat atacaaagaa 2700 ggagacagaa ctaatatatg taactaccga cctatttcta ttttatcaac attttcaaaa 2760 atcttagaaa aaattattta caatagatta tatacttatt ttaaggagaa gaattttttt 2820 tataaaaatc agtttggctt taaaaaaaat agttctactg agcatgcaat cattcaattc 2880 gtccgtgaaa tctccaactc tttcaaaaat tcccaattca ctttaggtat ttttttggat 2940 ttatcaaaag catttgatac agtaaatcat ggcattttga ttcataaact taaatattat 3000 ggagtggatg gtgtaatttt gaagtggttt aaaagttatt taacaaaccg aaaacaattc 3060 gtttctttta acgatcctta tcaaaaaaat gttttagata tttcatgtgg tgttccgcaa 3120 ggatcaattt tagggccact tcttttttta atatacatta atgacttgca taaagcttct 3180 aatcttttaa gtattatgtt tgcggatgat actaatttat ttttatctga taacaatatt 3240 aataaactct tcactactat gaatgaagaa ctcaaaataa tttctagctg gttcaaatgc 3300 aataaactta ccctaaatat taaaaaaaca aaatggattc tttttcattc gataactaaa 3360 aaacgttatt tacccccaac tctacctgat atttttattg accatattga aataaaaaga 3420 gattgcacca caaaattttt aggcgtattc cttgatgaga acatcacatg gaagcaacat 3480 attgattatg ttggtactaa agtttcaaaa aatattggaa ttctttataa ggccagaatc 3540 tatataaata aaaaagtttt atcccaactt tattatgcat tcattcatag ctatatgaat 3600 tatgcaaata ttgcttgggc aagcactgaa aaaagtaagt tacaatctct ctatcgccgt 3660 cagaaacatg caatacgtgt aattaatttt gctgatcgtt tttctcactc taaacatctt 3720 ttttctgaaa tgagaatact taatatttac gaacttaata tttttaatat tttatgcttt 3780 gtatatatat gtaaaaacaa tctttctcct gccgttttta atgatttaat tactttgaaa 3840 cctaccaata aatactctct aagaaataac aaccttctat atgagccttt ttgtcgaaca 3900 agttttaacc aattttgcat tgattatcgt gcaccccatc tttggaacaa aattgttttg 3960 cctaatttta attttgatct acctacttct ttgcatcttt ttaaagttaa gctgaaaaat 4020 ctttttctct cccgtattat caaaatttga caataatatt tatattatgg aaatatttat 4080 taaactttat tatatatact acaaagtgat atatgtatgt atattttatt tttattttgt 4140 tgtacttata gttactttat aatataatat tttgtttaca ctagtataag gttctgacga 4200 taagatctta ctgatcttct ttcagatacc tagtttttca aatgttagta gtatttgctt 4260 actttataat ttgtattaat tttttattta ctctcattat attgtaaaaa cgaactaaca 4320 aatgtaaact aaaatattaa aaaaaaaaaa aaa 4353 // ID DNA8-2_AAe repbase; DNA; INV; 2744 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A non-autonomous DNA transposon family from Aedes aegypti. XX KW DNA transposon; Transposable Element; nonautonomous; DNA8-2_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-2744 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the yellow fever mosquito."; RL Repbase Reports 11(4), 1282-1282 (2011). XX DR [2] (Consensus) XX CC >94% identical to consensus. 8-bp TSDs. XX SQ Sequence 2744 BP; 960 A; 474 C; 440 G; 870 T; 0 other; catgggcgta aatagccatc agacttgagg ggggccacgg ccccttctca tgagagataa 60 aagtcgtaaa ggatttagat aacttaacaa tgcttgtttc gataatattt gtgaaccgat 120 tatagtggcg cactttcata cagaggtaga tcaaaaccta aaaatcgtcc aaaaccgact 180 gatcaacttt cttagatacc aaatgtcact tttatggaaa aatagtaaaa atttatttaa 240 aaatctatgt tttcgtttta tattgccaag atgacgtatt caacccaatc ttatagatct 300 cgaatagaat acaaggattt aaaagcagat tataatcagg atttaccttt ccaattcgat 360 ttgaactaga tgtttgcaaa gttataaaac aagataaaga tttaaatgta tcacccaagc 420 ctttgaaaaa aatgcttgct agtttgaaaa tatggattat tgttgctcag catttctgta 480 gcaaacgatt catttctcta aacctctcga aaccaaaatt tgacatagga ttgaataata 540 ccttttttgt tagagttaaa aactttaaat cctaaagctg cttggctacc atcatattga 600 aggattgaag tttttttgta ttcaaataaa gctgtcgtca gcaaattcgt acgataacag 660 ttgtttgaag aaaacaggat ggaaggtatt ttgctcatgg atttattaat atccaatcat 720 agtaaggcag agcaaaagtt cgaccttagt ggtataatca aaatttttat aaaaacaata 780 gcagttgaaa ataaatacca tacagtgaac cttcaacata tttgctataa tttcgctgat 840 tgtcttaaac gaacttttgc cccatcggtg gggctcgaca cttttacgac agacatactt 900 tatataaact tatgtttaca gataaataca ctctaatttt tcatcaaaaa attgcccaaa 960 gcccaaaaaa aaacgttttt tcgcaaaact aggtgaaaat gtgaaatgtt ggtgactttt 1020 ttcgcacgat caggcaaatt tcgctaaact tgaagataat atgtgaattt ggggtaattt 1080 gtaaatttct ctgctagatt atatgggtcg aacttttggc ccgctgattt gaacttttgc 1140 cacactatgg gccaaaaatt gattccaagc aattatgtag taactaatac acctcaaagc 1200 atccttatga tgcctaaaaa cgcccttaca taaaaaatca aagaagattt tatccttatt 1260 tatttccatg caatgaaagt ttgatcaaaa attataatgt ttacgtcaaa aaacaacaaa 1320 tagccattat ttttccaaat ctcaatcgat tcttatgata tttgtagtga aagtctctta 1380 cttgaacact atggcattta taagcttgat ttgatttgag ctagaaatcc taaacattcg 1440 atatcttatc attaatctag aaacaagttt tatctatttt tcttacaaat aacgcaacgt 1500 cttctaatta cagaaatttt acatgaatgc atatatattt ataacgaaaa aaaaacatct 1560 ctgaaccgaa ttgatgtggt gacggacctg gtgtagaaca cacgtcgagg acctgggatc 1620 gaatcccatt ctggaggtag tcacttatga ctcaaaaagt aatagtggcg acttcatttg 1680 gaagggaagc gaagctgttg gtctcgaaat gaactagccc agggttaaaa atctcgttaa 1740 caaagacaaa caaaaagaat tgatgtaatg tccatataaa ctacgtaaga cgccaaaaaa 1800 attggcgcaa gaaaaaataa taaaaataaa taaaaaaagg gtcatattat aaacacttat 1860 agaagaatta aaagcgtgtc ttaaaacaat cttcagtatg aataaggttt tacgtttgtt 1920 ctacagaaac aaatagtaac atggtcttat tatcttttca atctaaaaac ttcagaaaat 1980 tggtgtcaac ataacattga aaaaaaaatt gttgcgacat ctacgatttt acacggttcg 2040 atgtggtttt tgtacgaatc cgccactatc ttactatacg taaaggaatt tcaatgttca 2100 cagttttgct gtcacatttg aatgcaactg cttcactgtc gtctacttac cagcagttca 2160 tatctgttcc cgacgtctcg gagacaaatt tatcgaatgg ctcataaatt tgaatatgaa 2220 acatttaaaa gttcgcgaaa aatgtataca aacaaagttc gcttgctgca ccattgttta 2280 ttttcgagat tttttttttg tagaacttta ccataactct catacacaaa catttcccag 2340 taacaacgat tcgttctaga taattgatca caaaatatcg cagcttgttt taaaatatga 2400 aaagactttg gttttctaat taacaatacc actacaaatg gatcaagcca gttcttctaa 2460 ttctaaaaca tgcttcaatg aatttcaata tcgggaaagg caaaaaggaa acagtatggc 2520 agtatgtgac gttttaaaat tacatattgc tattgcctgt caaacaatga tcatccataa 2580 atttatattc aaaatgcatt taatgcgtct agtatcataa gtgtttgata ctaaaaacta 2640 gttttaactg tattgtcatt tcttctccat tttttttcgg gggggccaag cccatggttc 2700 gggggggcca ggccccccct ggcccctccc tatttacgcc aatg 2744 // ID SMAR19 repbase; DNA; INV; 1511 BP. XX AC . XX DT 05-OCT-2007 (Rel. 12.1, Created) DT 22-OCT-2007 (Rel. 12.1, Last updated, Version 1) XX DE Consensus sequence of Mariner-type family of repeats. XX KW Mariner/Tc1; DNA transposon; Transposable Element; SMAR19. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-1511 RA Jurka J.; RT "Mariner-type element from freshwater planarian (Schmidtea RT mediterranea)."; RL Repbase Reports 7(10), 1077-1077 (2007). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 272..1291 FT /product="SMAR19_1p" FT /translation="MSINETKIVRDSEVVRLHKENLSDSEISKRLDIPRST FT IYSIIKKYEKHDTVLRLSGSGRKKSLDDSDIQIILNEIEKNPFTSSESIKN FT EIEKTTKKSVTSRTVRNYIKDTSYRSRVPRKVPKLSKSNIKRRLDLANEWS FT SWEESDWDKVIWSDETKINLFSSDGRQRVYRKKGKALESKNCLPTIKHGGG FT SIMLWGCMSSKGVGRLHIIKGKMDGLMYKSILTKNLGPSARDMGLDDEFVF FT QQDNDPKHTSRVVGEYFIFKNVNVLDWPAQSPDLNPIEHLWDYVKREVRKR FT KPTNLRELEQYVIEIWENIDSELCKRLVKTMNDRVNCVIRAKGKHIPY" XX SQ Sequence 1511 BP; 554 A; 202 C; 278 G; 477 T; 0 other; tacactgttg gaaaaaaaat ttcggcaaat gtcttttttt cgaattagtc cttaataata 60 tcattattta ttaatgaaaa tttggcatgt attaaaacta acaagttcaa catggtttat 120 gctttttatt ttcgatatta taattgattt tctgtttaaa tctaaaaatt ttattatatg 180 cctggaaaat aaaaatcggc agatttctac aaacctctca aaaacttttc ccgccaaaaa 240 ataaaattgt ggaataattt attacccctt aatgagcata aatgaaacta aaattgtacg 300 tgattcagag gtagtaaggt tgcataaaga aaacctctcg gattctgaaa tttccaaaag 360 actggatatt ccaaggtcta caatatacag tatcataaag aaatatgaga aacatgatac 420 tgtattgaga ttgtctggtt ctggaagaaa aaagtcattg gatgattcag atatccagat 480 tattttaaat gaaattgaaa aaaatccgtt tacgtcatct gagagtatca aaaatgaaat 540 agaaaagact actaaaaaat cagttacaag cagaacagta agaaactata tcaaagatac 600 ctcttataga tctcgagtcc caaggaaggt accgaaatta tccaaatcca atattaagag 660 aagattggat ttggcaaatg aatggtcttc ttgggaagaa agtgattggg ataaggtaat 720 ttggagcgat gaaacaaaga ttaacctttt ttcatccgat gggcggcaga gagtctatag 780 aaagaaagga aaggccttag aatctaaaaa ttgtcttcca actataaagc atggaggggg 840 tagtataatg ctatggggat gcatgtcatc taaaggtgtc gggagactcc atattatcaa 900 gggaaagatg gatggtctaa tgtataagtc cattctaact aaaaatctag gtccttcggc 960 aagagatatg ggtctcgatg atgaatttgt ttttcagcag gataatgatc ctaaacatac 1020 ttccagagtt gtaggcgaat acttcatttt taagaatgta aatgtgctag attggccagc 1080 acaatctcca gatttgaatc ccattgaaca tttatgggat tacgtgaaga gagaagtaag 1140 gaaaagaaag ccgactaatt taagagaact tgagcagtat gttatagaaa tatgggaaaa 1200 tatcgattca gagctttgta aaagactcgt aaagactatg aacgatagag ttaattgtgt 1260 gattcgagcc aaagggaaac atatacctta ttaaatataa ttgccgaaat tttttttcca 1320 gtctgttttg aaaaagtata aattttattg attttaaaag catatactag accgttttcc 1380 tccttaatat gtttaaataa tataaaaaga gaatttactt tgtttaatta tgtttaaagg 1440 tgcttttctt tcggattagc gtacttttta agaaaattgg catttgccga aatttttttt 1500 ccaacagtgt a 1511 // ID Gypsy-12_AA-I repbase; DNA; INV; 4487 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-12_AA_; KW Gypsy-12_AA-LTR; Gypsy-12_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4487 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 993-993 (2011). XX DR [2] (Consensus) XX CC Positions [3403-3873] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 117..2249 FT /product="Gypsy-12_AA-I_2p" FT /translation="MEGEGVSGVAQTAMKREAVGAAPIVRAGMFGQIKQYV FT VGENFGEYVNRLEMFFLVNDTPDDKKVPVLVTVAGLSLYSIAARLSSPEDP FT RTKSYADLVDLLKKHLDPTTNIVAERYKFRRCEQMSSQSVTEFIISLKATA FT QSCKIGDFLSDALRHQFVAGIKDQSLRKKLLTEPELTFEKACSIGRSWEAA FT LSQNKEMSSQSNHVVAAMREKRNPVHKTKQNVTRAVESKSNRASTKQFKPC FT YRCSRLHDSATCPAHTWTCYSCGKEGHVSTCCRMKSSSSKKSSGNHQRVAE FT MAEAVEVLRLNLMTEPEKPNSIPKAKDGCSLSILEQTSTSNINSLDKADSP FT EFVTLDCQGRRLNFEVDCGACRSVISTDMYRKHFPDCKLVPTALEFVSVSG FT QRIMPKGIVGIRVPAPSGIQGQLELVVIPTDRAVHPLFGRDGLDLIFPGWR FT KAFSIKSVGKSEQSFDTELRTRFPNVFSEPPENAITGFTAEIVLKPGMTPV FT FHKAYAVPFALREKVDKSIDTLVQDGILVPVRSSPWASPIVVVPEKDGTVR FT ICLDGKAALNSRTTVDHYPLPRIDDILAKLANWKVFCKIDLSGAYLQVTLS FT KASQVLCTVNTQRGLYQYTRMPFGLSSAPAIIQSIIDQILVNTPGIPYLDD FT IIVGGRNREECHANLAIVLQKLNDHNVRINLSKSCFYVEKVNHLGSGTKIL FT NLPSLTLIFAG" XX SQ Sequence 4487 BP; 1310 A; 951 C; 990 G; 1236 T; 0 other; atactttaac ttggcgacga gtgtaaaaca gtgcaaattt cggtaagaac aatagtatcg 60 cgtgcgtaca aggtccgttg caatcacgtg gtgagtgaat agcctaacct caaataatgg 120 agggcgaagg agtaagtggt gtggcgcaaa ctgcgatgaa aagagaagca gtaggtgcag 180 ctccgatagt tcgtgctggc atgttcggcc agatcaaaca gtatgtcgtc ggagagaatt 240 tcggggaata tgtgaaccgc ttggaaatgt ttttcctcgt caatgacaca ccggacgaca 300 aaaaagttcc ggtgctggtc acggtggctg gtctcagctt atactcaatc gcagccagat 360 tgagttcacc ggaagatcca cgtacgaagt cgtacgcgga tctggttgat ctcctcaaaa 420 agcacctcga ccctaccaca aatatcgttg cggaaaggta caaattccgt cgatgtgagc 480 aaatgtcgtc acagagtgtt accgagttta ttataagtct caaggctact gcgcagtcct 540 gcaagattgg tgatttttta tccgacgcgc ttagacatca gtttgtcgcg ggaattaagg 600 atcaaagttt aaggaagaag cttctaacgg aaccagagct gactttcgaa aaagcgtgtt 660 caataggccg tagctgggag gcagcgctca gtcagaataa ggaaatgtcg tcgcaatcaa 720 accacgtggt tgctgcaatg agggagaaga gaaatccagt ccacaaaacg aagcaaaatg 780 tgaccagagc agtagaatcc aagtcaaatc gcgcgtctac gaagcaattc aagccgtgct 840 accgctgcag ccgtctacac gattctgcta cctgcccggc tcatacttgg acgtgctact 900 cgtgtggcaa agaaggccac gtatcaacat gctgcagaat gaagtccagt tcttcgaaaa 960 agtctagtgg aaaccatcaa cgtgtcgcag aaatggcgga agcagtggaa gttcttcggt 1020 tgaatctgat gacggagcca gagaagccga acagtatacc aaaagcgaag gatggttgca 1080 gtttgtcgat tttggagcag acatctacaa gcaacattaa ctcgctggac aaggccgatt 1140 ctccagagtt cgtgacattg gattgtcaag gtcgtcgtct gaattttgag gtcgactgtg 1200 gagcctgtcg gagtgttata tccacagata tgtatcgaaa gcattttcct gactgtaagt 1260 tagttcccac tgccttagaa tttgtttcag tttcaggtca gcgcattatg ccaaagggga 1320 tcgttggcat ccgagtcccg gcaccatcag gaattcaagg tcaactggag ttggtggtga 1380 tacccactga ccgtgcagtg catccgttat ttggtcgtga cggtcttgac ctgattttcc 1440 ccggatggag gaaggctttc tccatcaagt cggttggtaa gtcagaacaa tccttcgata 1500 ctgagcttag aacaagattt ccaaacgtat tttccgagcc accggaaaat gcgataacag 1560 gttttaccgc tgaaatagtg ttgaaacccg gtatgacacc tgtgtttcat aaggcatacg 1620 ccgttccttt cgcacttcga gaaaaagttg acaaaagtat cgatacacta gttcaggatg 1680 gcatcctagt tccagttcgt tcgtcacctt gggcaagtcc catagtggta gtgcctgaaa 1740 aagatggcac agtcagaatc tgtcttgacg gcaaagccgc actcaatagt cgtactaccg 1800 tggatcatta tccgttacct cggatagacg acattctagc taaattagcg aactggaaag 1860 ttttttgcaa gattgatctt tcgggagcat acttacaagt cacgctctcg aaagcttcgc 1920 aagtattgtg tactgtgaat actcaacgag gactttatca gtatacacgg atgccttttg 1980 gcctttcgtc cgccccggcg attatccaat cgataattga ccagatattg gtgaatacac 2040 caggaattcc ttatctggat gatattatag ttggtggtcg caatcgtgaa gagtgtcatg 2100 cgaatcttgc aattgttcta caaaagctta acgatcataa cgttagaatt aacctgagca 2160 aatcatgttt ttatgtagaa aaggtcaatc accttgggtc agggacgaaa atactcaatc 2220 taccctcgct tacattgata tttgctgggt agaacttcgt tgatttttat ttgtttttat 2280 cctcgccttg acaacacaac aaagattgac aaagtgctta cgcaagattg tcagtttgct 2340 tttaacctgc tgctactcaa ccgctttttg ataatcggaa acaaaaaatg atattcattc 2400 taaccagttt ggcttgttta cattccgctt cgggaagctg taatttttgc acgaggcaca 2460 ctaaagacgc gtcaatataa tccattccgt gcatcggatc cttcattact cgtatagcag 2520 catatttttt aattaatttg aacctcattg ataatcaatt ttctaaaact gacacttttc 2580 tgcttctgat attcaaaaac accaacgaca tacgggcatc gttgtttgtt ttgcccatct 2640 acttttgtgc catcttcatt ggtacaatca tattggtggt ggtgcggtta ctttgatggt 2700 ggcataaatt cagtaaattg agtatagtga gcatgatttt tttcgtccct gccttgggta 2760 taccatcaca gcagatggta tacggcctga tgagtcgaag gttaaagcaa tattaggagc 2820 tccgtcaccc aaaaatgtta ctcagctaca agcgtatctg ggactattaa actactatca 2880 cagattcctc cctaatctgt cgatagaact acgtcctttg tacaatttgc taaagaaaaa 2940 ttgcagattt gtttggtctt caaactgcca gacggcgttt gaaaacacga agtcgttgtt 3000 agtggaaaac gatctactag agccttatga tccttcaaaa ccgatcattt tggcagttga 3060 cgccagccct tatggcgtag gagctgtcct gtcgcatttg gtgaatagcg aggaaaaacc 3120 tgtgtgtttc gcatcctcca ccttgactcc ggctcaaata aattatgctc aagtgcataa 3180 ggaagctttg gctgttattt ttggaataaa caagttccat aaatacttgt atggtgccaa 3240 gtttaagctt gttacagata acagtgctat taaagaaatt ttcaatccag tcaaaagcac 3300 ttcggcaatc gcagcagcaa gattgcagag atgggctgtg attttgtcta attatgatta 3360 ctcgatagaa cgtcgaccag ggaagtttat gaatcatgct gatgctcttt cgcgtcttcc 3420 attgaacgag cccacagatg tggagcatat aagccttggc agactgcctt aatcattatt 3480 gacgcttact caaaatttat cgatgtccgt ttgttgaaag gctctacaag tgttcaattg 3540 attgaacaac tcgagtcgtt tttcgctgtc tttggaatta ccgaagaagt cgtttctgat 3600 aacggtccgc cctttaatgc agaacttttc attgcatttt tggaagctaa tagcgtcaag 3660 gtctccaaaa cgccacctta ccacccacaa tcaaatggat tggcggagag aggtgtgcgt 3720 acggtcaaag atgttttaaa gaaatatctt ttagatgaaa gatgtaagca gttgtcaata 3780 gcccgaaaaa ttaatcgatt tttaattaat tatcggaata ctccgacaac aacaacaaat 3840 cgttctccat cctccatgat tttctcttac acacctcgaa ctctaatgaa ttcgataaat 3900 ccccggaaag tagaaatcag tcctccaacg ccgccccctt cagttgctga gaaacctaag 3960 ttaattgcaa ctgaagttac ggttaataaa acgtataagc cgggagataa agttctgtat 4020 cgcaatcact tcaaggacat cgtacgatgg gtacctgcga tagttcttca aaagctcagt 4080 cctttaacat acttaatcag tatcgaagga aatgttcgta tggtacatgt taaccaaatc 4140 cgactttcaa atctgtccga taagtaccac ccttcgttgc ctgtcgctgt accgagttca 4200 tcgcagatca tcgatggtga cgatcgtttg aatcgtgaaa acggacctga tcctcaaccc 4260 gaggttcaga agccagtagg aacctgccaa caaagcaagc agggcaagaa aaacaagcat 4320 ataaaacgtc gaagaagtga atccaagtct ccaaaagtta gacgatcaga tcgactaaag 4380 ggacaaccaa gactaaaata ccctaaataa tttaaatatt gtttaaaacg aacaaatatg 4440 aaaactggat tgtaaaatat tgatgaattt gttaagccag gagagat 4487 // ID BEL-624_AA-I repbase; DNA; INV; 6115 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-624_AA_; KW BEL-624_AA-LTR; Pao_Bel_Ele61; BEL-624_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-6115 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [5170-5706] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 16..5448 FT /product="BEL-624_AA-I_1p" FT /translation="MSTSNPTRQDALSTSLAACIGCKILDSNQKTVTCSKC FT YALWHGSCAGIPDGFDGPWRCNHCSTFCGSVAGSISSTKSTSSARLRLKQL FT QELKALEDKIQLERAERERRFLLEKHSLEAEAEAEETGCGSIRSSASRLRN FT RHRDVNTWVEATNIAKQSLQQTEGFEQPPVGVHTSTPVSDAGAIPKTHPVD FT TRVDDKQQQQVDQSQQNQLNVLNTPLGLPKHMLPPKKPSGILKPTLPAIPS FT ANAQLDAYASQNVGHYTSQPSTHAIPGSVFPSGIGTQCKAMFPSATATDFQ FT SSQPTLPGLLTQQPQSNVIRSSIQLPHPTTTVQSQYRYVPPSSIPGAIPVT FT TTSMFPSLRPLNQSEYVPWNQQASMQYMPNLVPEHSSQSTGGLISAQQLAA FT RQVVSKELPKFSGDPLEWPMFLSAFESTTAICGIQPDENLARLQRSLVGKA FT REMVQSILTLPSAIPEIIATLRDECGRPEQLVHCLLSKVRNAPPPNVNKLE FT TIVTFGREVRNLVTYIEAANLQMHLANPMLLSELVAKLPPSMRLDWGLHSQ FT RTQDASIRAFSDYVSSLKTAACHVSLPSESIHADDSRRAKKEKSGFVNAHA FT LEQEPNNSVSRWKEKLEAKPCLGCHQTDHKVRFCNKFKSLSIAERWTFVEK FT NGMCQRCLVAHGKWPCRTKQPCGIDDCKDLHHKMLHPGKNTQASITTSPSP FT ALVTIHNQKQSSTLFKVLPVRLFGKNKSLNVFAFLDDGSNKTLLENCIANE FT LGLDGEKQSLVLQWTSNVTRREAASKRVELDIAGIGNAHKYHLTDVCTVEE FT LGLPRQSLDYPELSKHFPYLQGLPVSSYQEAKPQILIGLADARLKLALKSR FT ERRDGEPIASKTRLGWTVFGGRRSVPESVQVMVHEITESDENIHDVIKEYF FT DSENFGIIGARPPESPEDQRARKILQSTTKRTTTGRYETGLLWKTDNVEFP FT NSFSMAERRMKCLERRLEKDPDLKKIVNDQLLEYLERGYAHIATDDEIQSA FT DPRRVWYLPLGIVRNPRKPGKVRVVWDAAAKVKDVSLNSMPLKGPDLLVSL FT PSVICRFRQRKFAVAGDVKQMFHQLQIRKEDRHSQRFLYRADPADPPAIYI FT MDVATFGAACSPCSAQYAKNLNATEYQNTFPEAATAIIENTYVDDFLDSRD FT TADEAAQLVIDVREVFSKACFEIRNWQSNSEEVLSRVGEGNPENVKRFSVD FT KTMDSERVLGISWIPETDVFVFNLQVRDDLKHLLYGEIIPTKREVLRVVMS FT VFDPLGFIATYTIHGKILIQDIWRSGIGWDERILPRDFANWQRWIKLSSEL FT SNVSISRCYFPNYAPESLESTELVVFTDASEMAYCCVAYFRIMDRGVLRCA FT LVSAKSKVTPLKPQSIPRNELNAAVMGVRLANVINESHTLPITKRTFLTDS FT KCVISWLRSDPRKYRQYVSFRVAEILDTSRPSEWKWIPTQVNVADKGTKWG FT KCGPSLDPSSEWFTAPEFLFQPSSSWPTKYPDSDENGEELRAVFAYHSTTD FT DCIVKFENFSTFQGLLKRIAMVYHFIHGCRHRVRERTSSSTALLTQEDYMA FT AENGLWRMIQSEKFADEKVSLQANPGRSSNNQCKLGKNSWMFKFSPFLDDN FT GIVRMDSRIDPEAAYFAYDFRNPIIVPREHYVTNLLISHFHQRYGHANTET FT VLNELRLKYYIPKMRSTVKKVVNRCQWCRVYRAKPTPPRMAPLPQPRVSPY FT VRPFTFVGLDYFGPMIVKRGRTNIKRWVALFTCLTVRALHLEVVHTLSAES FT CKMAIRRFLARRGAPQKIYSDNGTNFRGAAKELSDEINAI" XX SQ Sequence 6115 BP; 1726 A; 1398 C; 1496 G; 1492 T; 3 other; aactcaaaaa ttacgatgag cacctcgaat cccacaagac aagatgccct atccacatcg 60 ttggctgcgt gcatcggttg caagatattg gattccaatc agaagaccgt tacctgcagc 120 aagtgctatg cattatggca tggtagctgt gctggtattc ccgacggttt cgacgggcca 180 tggcgatgca atcattgttc tacgttctgt ggttccgtag cgggttcaat atcgtcgaca 240 aagtctacat ccagtgctcg actgagactc aaacagctgc aagagcttaa agctttggaa 300 gataagattc agctcgagag agctgagaga gagcgcagat tcctattgga aaagcacagt 360 ttagaagccg aagctgaagc cgaagagacg ggatgcggca gtatcaggag cagcgccagt 420 cgtctccgaa accgtcatcg tgatgtgaat acatgggttg aagccacgaa cattgcgaag 480 caaagtctgc aacaaacgga gggttttgag caaccaccag taggtgtcca cacgtctaca 540 ccagtgagtg atgctggagc aattccaaaa actcatcctg tggatacaag agtggatgat 600 aaacagcagc agcaggtcga ccaaagtcag caaaatcagc tcaacgtgtt aaatacgccc 660 ctcggattgc cgaagcacat gctccccccg aagaagcctt cgggcatact gaagccaacg 720 cttcccgcca tcccatcggc gaacgctcag cttgatgcgt atgctagtca gaacgtcggt 780 cattatacct ctcagccatc aacacatgct attccggggt cagtttttcc atcggggatt 840 ggcactcaat gtaaagcgat gtttccatca gcgactgcca ctgatttcca atcatctcaa 900 ccaaccctgc ctggattact aacgcagcaa ccacagagca acgttattcg gtcttctata 960 caattacctc atccaactac aactgttcag tcacagtatc gatacgttcc accttcatcg 1020 attccagggg caatacctgt gacaacaacc tcgatgttcc cgagcttacg tccgttgaat 1080 caatcagagt atgtaccatg gaatcagcag gctagtatgc agtacatgcc taatttggta 1140 cccgaacatt cgtcccaatc aactggtgga ctcatcagcg cacagcagtt agcagcaaga 1200 caggttgtat cgaaagaact cccgaagttc tctggagatc cacttgagtg gccaatgttc 1260 ttgagtgcct tcgagtccac gacagcaatt tgcggaattc aacccgacga aaatctggct 1320 cggctccaaa gaagcctggt agggaaagct agagaaatgg tacaaagtat actcacccta 1380 ccatcagcga ttccagaaat tatagctacg ctgcgcgatg aatgcggccg cccagaacaa 1440 ttggtacatt gtttgttatc caaggtcagg aatgcgcctc cacctaatgt taataaactg 1500 gaaacaatag tcacgttcgg tcgagaagtt cgtaatctcg tgacgtacat cgaagctgca 1560 aatctgcaaa tgcatttagc gaaccctatg ctgctttcag agctcgtcgc caagctgcca 1620 ccgagtatgc gtctggattg gggcttgcac tcccagcgca cgcaggacgc ttcaataagg 1680 gcgttcagtg attatgtgtc gtctctgaag acagcagcat gccacgttag tctgccatca 1740 gagtcaattc atgcggatga ttcccgaagg gcgaagaaag agaagagtgg ttttgttaat 1800 gctcacgcat tggagcaaga accaaataat tctgtcagca gatggaagga gaagttagag 1860 gctaagccgt gtcttgggtg ccatcaaact gatcacaaag tacgattttg taacaaattc 1920 aaaagtctct caattgccga acggtggacc tttgttgaga aaaatggaat gtgtcagcga 1980 tgcctggttg cgcatggcaa atggccgtgc cgtacaaagc agccttgtgg gattgacgat 2040 tgcaaggatt tgcatcacaa gatgctgcat cccggcaaaa atacgcaagc cagcatcacc 2100 acttctccct cgcctgcttt ggtcaccatc cataatcaaa aacaatcttc cacacttttc 2160 aaagttctac ctgtaaggct ctttggaaag aacaaatcgt tgaatgtatt tgckttcctc 2220 gacgacggct caaataaaac actactggag aattgcattg ccaacgagtt aggactcgac 2280 ggcgagaaac agtctctcgt gcttcaatgg acctcaaacg ttactcggag agaagcagcg 2340 tcgaaaagag tagaactgga catcgctggg attgggaatg ctcataaata ccacctcacg 2400 gatgtgtgca ccgtggaaga acttggacta ccacggcaga gtttagacta cccggaactt 2460 tccaaacact tcccttatct tcaaggtctg ccggtcagta gctatcaaga agcgaagcca 2520 caaatactca tcggactagc ggatgccagg ttgaaattgg ctttgaaatc aagagagcgt 2580 cgggatggag agcctatcgc ctcaaagact cgattaggat ggacggtatt cggtgggcga 2640 cggtctgttc cagaaagtgt gcaagttatg gttcacgaaa tcaccgaatc agatgaaaac 2700 attcatgatg ttatcaaaga atatttcgac agtgagaatt ttggaataat cggagcacgt 2760 cctccagagt ctcctgagga tcagcgtgca cgcaaaatcc ttcagagcac cacaaaacga 2820 acgaccactg ggaggtacga gactggttta ttatggaaaa ccgacaatgt tgaatttcca 2880 aacagtttct caatggctga gcgacgaatg aagtgcctgg aacgtcgttt agaaaaggat 2940 ccggatttga agaaaattgt aaacgatcaa ctgctggaat acttagagcg cgggtatgcc 3000 catatagcta cggatgatga aattcaaagc gcagacccgc gacgcgtttg gtatttgccg 3060 ttgggaattg ttcgaaaccc acgcaaacca ggcaaggtac gagttgtttg ggatgcggcg 3120 gcaaaagtaa aggatgtctc gctaaactcc atgccgctga agggaccgga cctactggtt 3180 tcccttcctt cagtaatctg cagatttcgc cagagaaaat tcgccgttgc aggagacgtt 3240 aagcagatgt ttcatcagct tcagataagg aaggaagatc ggcattcgca aaggttcctt 3300 tatcgagcag atccagcaga tcctccagct atttatataa tggatgttgc tacatttgga 3360 gctgcgtgct ccccctgctc agcgcagtac gcgaagaatt taaatgcaac ggaataccaa 3420 aatacatttc cggaagccgc cacagcgatc atcgaaaata cgtatgtgga tgacttcctt 3480 gacagccgcg atactgcaga tgaagccgcc caattggtaa tcgatgttcg agaagtcttc 3540 agcaaagcgt gttttgaaat ccgaaactgg cagtccaact ctgaagaagt tttaagccgc 3600 gttggggaag gaaaccctga aaacgtcaaa agattttccg tggacaaaac catggattcc 3660 gaacgcgtat tgggaataag ctggattccc gagacagatg tattcgtttt caacttgcaa 3720 gtgcgtgatg atctcaaaca tttgttatac ggtgaaataa taccgaccaa gcgggaggtg 3780 ctcagggtgg taatgagtgt cttcgaccca ctcggattca ttgccaccta cacaatccat 3840 ggcaagatcc tcatccaaga catttggcgg tctggcattg gatgggatga acgaatcttg 3900 ccgagggact tcgccaactg gcagcgatgg atcaaactat cttcagagct gagcaatgtg 3960 agcatttcga ggtgttactt tcccaattat gctccggaga gtctcgaatc aacagagtta 4020 gttgtgttca ccgatgccag tgagatggct tactgttgcg tcgcttattt tcgcataatg 4080 gatcgagggg ttctgagatg tgcgttggta tctgcgaaat caaaagtaac ccctttgaaa 4140 ccgcagtcaa ttccacggaa tgaactaaat gctgctgtga tgggagtacg gttggcgaat 4200 gttataaatg agagtcacac tttgccaatt acaaaacgaa catttctgac cgattccaaa 4260 tgtgtaatat cttggttgag atctgatccg cgaaaatatc ggcaatatgt atcatttcgc 4320 gtcgcagaaa ttcttgatac tagtcgccct tcagaatgga aatggatacc aactcaagtg 4380 aatgtggcag ataaaggtac aaaatggggc aaatgcggac cgtcactcga ccctagtagc 4440 gaatggttta ccgctcctga atttttattc caaccatcga gtagttggcc aacaaaatac 4500 ccggattcag acgaaaatgg tgaggagctg agggcagttt ttgcttatca tagcacaact 4560 gacgattgca ttgttaagtt cgaaaacttt tcaacatttc aaggcttact taaaagaatt 4620 gctatggtat atcatttcat tcatggatgt cgccatcgtg tacgtgaacg tacatcatcg 4680 tccaccgcgc tgttaacaca agaggattac atggctgcgg agaacggtct ttggcgaatg 4740 attcagtcgg aaaaatttgc cgatgaaaaa gtttctcttc aagcgaatcc tgggcgttca 4800 tcaaacaacc agtgcaaact agggaaaaac agttggatgt tcaaattttc accatttctc 4860 gacgacaacg ggatcgttag gatggatagt cgaatcgatc ccgaggccgc ttattttgcg 4920 tatgactttc gtaatcctat tattgttcca agagaacact acgtcaccaa cctgttgatc 4980 agtcactttc atcaacgtta cggacatgcc aacacagaga ctgtgcttaa tgaactgcga 5040 ctgaagtatt acatccccaa gatgcgatca acggtgaaga aggtagtcaa ccgatgtcaa 5100 tggtgccgtg tgtatcgtgc taagccaaca cccccaagaa tggcaccgtt acctcagcca 5160 agggtaagtc cctacgtacg tccttttacc tttgtgggcc ttgattattt tggaccaatg 5220 atcgtaaagc gtggtcggac taacatcaaa cgttgggttg cactttttac gtgcttaact 5280 gtgagggcgt tgcatctgga agtggtgcat actttgtcgg ccgagtcgtg caaaatggct 5340 attcggcgct ttttggcacg acgcggtgca ccacaaaaga tttatagcga caacggcacc 5400 aactttcgcg gagctgccaa agagctttca gatgagatta acgctatcma ccgagagatt 5460 tcaggaacgt tcacgaatgc tgatactgaa tggcacttta accctccggc tgccccacat 5520 atgggcggag tgtgggagag gaaagttcgt tctgtgaagg agggattgaa agtgctttcg 5580 cataaaaggc atttggacga tgaaggcttc atcacattgc tggcagagat agaaatgatt 5640 gttaattcgc atcctttaac gttcgtgcct ctggagtcac caggacaacc cgttctgact 5700 ccaaatcatt tcttaatgat gagttccagt ggagttaatg ccgaatctag aatcccgata 5760 caagagtcca ctgctttgag aacgaattgg aagcttatgc tgcacttaac tgaccagttc 5820 tggaaacggt ggatcaaagc ttatctaccc acgatcgcca ggagaacgaa gtggtttgca 5880 ggggtgcgtc cattaaaaga aggagatctt gtwatcattg tggacgagtc cgtgcgaaat 5940 gggtggcaac gtggtcgtat tttgcgggct catttggcac cggatggaca ggtacgaaaa 6000 gtagatgttc aaacggagtc tggagttgta agccgtccgg ctataaaggt agctctgctt 6060 gagatattgg aggaaggtaa gaccacctaa cctggtggca ttacgggccg gggtg 6115 // ID Gypsy-32_CQ-I repbase; DNA; INV; 4451 BP. XX AC AAWU01031813; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-32_CQ_; KW Gypsy-32_CQ-LTR; Gypsy-32_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4451 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 443-443 (2011). XX DR Genome; AAWU01031813; Positions 6729 11179. XX CC Positions [3366-3830] - Integrase core CC 'GAAAT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 113..3334 FT /product="Gypsy-32_CQ-I_2p" FT /translation="MSNPPGINKSGSAGSGQASGEQPFKDTIIQLLANQHA FT LMEELTEQVKNLQAAPRNELVLDSLATNITEFVYDADKGCTFDTWYGRYAD FT LFDADASSLDDAAKVRLLLRKLSPAAHDRYTSFILPKVPKDFKFKETVDKL FT KSIFGTPVSIFHRRYQCLQTAKDESEDLISYSCRVNRACVDFQLKKLDEEQ FT LKCLVFVCGLTAAKDSDIRMRLLSKINESQGMTLEKIVEECKSLATLKQDT FT VLIGSHASSSVQLSTNAVRTETKQKPKGKFGGKQRDNIPKSPCWSCGGMHY FT MDECKFREHKCRDCNRVGHREGYCSCFSAKNRTKKEHKFQPKPQQKQRSSK FT SVTVGSIQQGRRYTTVGINGVPVELQLDSGSDITILSYQNWEKAGRPPTVP FT PDCSARAASGDKLNISGMFSATISIGGVHKQGKCYVSGPQLNLNVLGADAM FT DKFGLWDVPLSSICNLVASADEDADVAELKSRFADVFSDRAGVCKKTQVHL FT DLKPDAKPVFRQKRPVAYSMEATVEDELQRLQDLGIITPVTYADWAAPIVV FT VRKPDRSIRICADYSTGLNNALEPNSHPLPLPEDIFARMANCTIFANIDLS FT DAYLQVEVDEESRKLLVINTHKGLFQYNRLSLGIKNAPGAFQKIMDTMLCG FT LPRTCPYLDDIIVGGRTRQELKLNLQQVFQRLQEYGFTVKLSKCRFFMNQV FT KYLGQLLDAEGIRPDPEKIAAVVNMPAPHDVPTLRSYLGAINYYGKFIKEM FT RKFRQPLDDLLRKGAKFEWSADCQRSFDRFKEILQSPLLLTHYNPRLDIVV FT SADASNVGIGARIAHRFPDGTEKAIYHASRSLTPAEKNYSQIEKEALALVY FT AVTKFHRMIYGRNFLLQTDHKPLLSIFGSKQGIPAYTANRLQRWALTMLLY FT DFHIEHIATDHFGHADILSRLINAHVKPDEDFVIASIELERSICSIVSQSL FT GFLPVTYKVIAAETEGDETLQKVKHYVIHGWPENKKAVEGAEIQQFFARRE FT ALSIAHKCLMYGERIIIPKKLQKRVLQQLHKGHPGIERSRSLARNYVYWPN FT IDDHISNLISGGGGGQN" XX SQ Sequence 4451 BP; 1028 A; 1300 C; 1268 G; 855 T; 0 other; agtggcgacg ggtctgtgga gtttgtcggt ttgtcaattt ttttttcggc gaaaatcgca 60 aaatcgttgt tgccgtgctt ccacgtggac gtttcttccg tggcccgcag cgatgtccaa 120 cccaccggga atcaacaaat ccggaagcgc tggatccggt caagcgagcg gggaacaacc 180 gttcaaggac accatcatcc agctgctcgc caatcaacac gcgctgatgg aggagctcac 240 cgaacaggtg aagaacctgc aggcggcgcc ccggaacgag ctggtgctgg actccctggc 300 cacgaacatc acggagttcg tctacgacgc ggacaagggc tgcacgttcg acacctggta 360 cggacggtac gcagatttgt tcgacgccga cgcgtcgtcg ctcgatgacg ctgcgaaagt 420 ccggctgctt ctgcggaagt tgtccccggc cgcgcacgac cggtacacgt cgttcatcct 480 cccgaaggtt ccaaaggact tcaagttcaa ggagaccgtc gacaagctca aatccatttt 540 cggcacaccc gtgtcgatct tccatcgtcg ctatcaatgt ctgcagacag cgaaggacga 600 gtcagaggat ttgatcagct attcctgcag agtcaaccgc gcctgcgtgg acttccagct 660 gaagaagctc gacgaggaac aactgaagtg cctggtgttt gtgtgtggac ttacggctgc 720 caaggactca gacatccgta tgcgtctgtt gtccaagatc aacgaatctc agggcatgac 780 cctcgagaag atcgtagagg aatgcaaatc gctcgctact ctcaagcagg acaccgtcct 840 aattgggagc catgcatcct cctccgtgca gctgtccaca aacgcggtcc ggaccgagac 900 gaagcagaag cccaagggga agttcggagg caagcagcgc gacaacatcc cgaaatcacc 960 gtgctggtcg tgcggcggaa tgcactatat ggacgagtgc aagtttcggg agcacaagtg 1020 ccgggactgc aacagagtgg gacatcgcga aggctactgt tcctgcttct cagcaaagaa 1080 ccggaccaag aaggagcaca agttccagcc gaagccgcag cagaagcagc gctcgagcaa 1140 gagcgtgacc gtcggcagca tccagcaggg tcggcgctac acaaccgtcg gcatcaacgg 1200 cgtgccggtg gagcttcaac tagattccgg ctcggatatc accattctct catatcagaa 1260 ctgggagaaa gctggacgtc ctccaacggt tccaccggac tgctctgcga gggcagcctc 1320 cggggacaag ctgaacatct ctgggatgtt ttcggctacc atcagcatcg gcggagtcca 1380 caaacaaggc aagtgctacg tgagtggacc gcagcttaac ctcaacgtcc tcggagccga 1440 cgctatggac aagttcggcc tctgggatgt tccactgtcg tccatctgca acctggtggc 1500 cagtgcggac gaagacgctg acgttgccga gctgaagtcg cggtttgcgg acgtgttcag 1560 cgatcgcgct ggtgtctgca agaagacgca ggttcacctc gacctgaagc cagatgcgaa 1620 gcctgtgttc cggcaaaagc ggcctgtggc gtactccatg gaggccaccg tggaggatga 1680 gctgcagcga ctccaagatc tgggcatcat caccccggtc acgtacgcgg attgggccgc 1740 gcccatcgtg gtcgtgcgga aaccggatcg ctctatccgg atctgtgccg attattcgac 1800 cgggctgaac aacgctttgg agccgaacag ccacccacta cccctgcccg aggacatctt 1860 cgcgcgaatg gccaactgca ccatcttcgc gaacatcgat ttgtccgacg catacttgca 1920 agtggaggtc gatgaggaga gccgcaagct gctggtcatc aacacccaca aggggttgtt 1980 ccagtacaac cgcctgtctc tcgggatcaa gaacgctccg ggggcattcc agaagattat 2040 ggacacgatg ctgtgcggac tgccgcgcac ctgtccgtac ctggacgata tcatcgttgg 2100 cggtcgaacg cggcaggagc tcaagctaaa tcttcagcag gttttccagc gcctgcaaga 2160 gtacgggttc accgtgaagc tcagtaagtg ccgcttcttc atgaaccagg tcaagtacct 2220 gggccagctg ctggacgccg aaggtattcg ccccgacccg gagaagattg ccgccgtcgt 2280 gaacatgcca gcccctcatg acgttcccac acttcggtcg tacctgggtg cgataaacta 2340 ttatgggaag ttcatcaagg agatgcggaa gtttcggcag ccgctggacg acctgctccg 2400 gaagggtgcc aagttcgagt ggtcggccga ctgtcaacgc tcattcgacc gcttcaagga 2460 aatcctgcag tctcctctct tgctcaccca ctacaatccg cgtctggaca tcgtggtctc 2520 cgcggacgca tccaacgtgg gcatcggcgc ccgcatcgca caccgcttcc cggacggcac 2580 cgagaaggca atctaccacg cgtccagaag tctgactccg gcggaaaaga attacagcca 2640 gatcgaaaag gaagccctcg ctctggtgta cgccgtgacc aaatttcacc ggatgatcta 2700 cggacgcaac ttcctcctgc aaacggacca taaaccactt ctctcgattt ttggttcgaa 2760 gcaaggaata ccggcgtaca ctgctaaccg cctgcaacgg tgggctttaa cgatgcttct 2820 ctatgatttc cacatagagc atatcgctac cgatcacttt gggcacgctg acattctatc 2880 tcgcctgatt aatgctcatg ttaaaccgga tgaagatttt gttattgcgt cgatagagct 2940 tgagaggtcg atctgcagca ttgtgagcca gtctttggga ttcttgccgg tcacctacaa 3000 ggtgattgcc gcagagactg agggagacga aaccctgcag aaggtgaagc actacgtcat 3060 ccacggctgg ccggagaaca agaaagctgt cgaaggcgca gaaatccagc agttcttcgc 3120 tcgtcgtgag gcactcagca tcgctcacaa gtgtctcatg tatggagagc ggatcatcat 3180 ccccaagaag ctgcagaagc gggtgctcca gcagctgcac aagggccatc cgggcatcga 3240 gcgatcgcgg tcgctggcgc ggaactacgt gtattggccg aacatcgatg atcacatttc 3300 caacctgatc agtggcggtg gcggtggcca aaactgacac gaaaacagcg ctctcatcgt 3360 ggcccatccc ggagcggcca tggcagaggc tgcacgtcga ctacgccgga ccggtggacg 3420 gaacgtactt tctggtcctg gtcgacgccc tctcgaagtg gccggaagtg gttccaacgc 3480 ggcggatcac cacggagaag accctggcca tcctgcgtaa catcttctcg agattcggca 3540 tgccggaggt gctggtctcc gacaacggcc ggcagttttg cagcgagcac ttcgagaggt 3600 actgtgacgt caacggtatc atgcacctca agaccgcgcc gtaccacccg cagagcaacg 3660 gccaagctga gaggtttgtt gatactttta aacgtacact caagaaaatt caagcaggag 3720 gggaggaact agaagaagcg atcgacactt tcctccagtg ttaccggtcc acaccgtgcc 3780 gcagcgctcc agaggggaaa tctccagcgg aagttcttct cggcggtcgc atccgtacct 3840 cgctcgagct gatgaaacca ccaagcagct tctacaagga ctccagttcc aagcaagatc 3900 agcagttcga ccgtaaacac ggtgccaagg cgaagaactt cgagcccaaa gacggagtgt 3960 ttgcgaaggt tcaccacggc aacgagtgga gctgggtgcc aggtgaggtc gtggagcgga 4020 tcgggacggt catgtacaac gtgtggttac cagaccgtca gcagctgatc agatcccaca 4080 gcaaccagtt gcgcaaacgc cacggtggtg cttacgccac tgaagcggag tctcaaccaa 4140 ccattccact ggatctgcta ctcaacacgt ggggcatcaa acctcccggg ccatcggagt 4200 ctccggaacc ggcgggttca ccggaagctt cgctgccgta cccggacagc ccactgacgt 4260 atcctgagga tgagcagctg aacgagttgc agcaggagtt cctccgcgag ttgatgcagc 4320 cgcccgagcg acgtcgtcca agacgacccc aatcgccagc tgttcctgcg gtggctcctg 4380 ccggacgacc agtccgcaac cggaaagcgc cgagtcgcta cgagccgtac catctatact 4440 aaagggggag g 4451 // ID CR1_Ele32 repbase; DNA; INV; 2908 BP. XX AC . XX DT 22-OCT-2010 (Rel. 15.1, Created) DT 22-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE A CR1 clade non-LTR retrotransposon family from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1_Ele32. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-2908 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-2908 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (22-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. This consensus is generated from 21 CC sequences with >95% identity, and ~99% identical to the original CC sequence in [1]. XX FH Key Location/Qualifiers FT CDS 3..2837 FT /product="CR1_Ele32_1p" FT /note="endonuclease and reverse transcriptase." FT /translation="LRTKIDALYCNAIDCNYDVIILIETGLDAAINSVQLF FT GDNFNVFRCDRNAKNSTKSSFGGVLIAVHRRFTSSSISITTGESLEQVCVS FT VLVGNQRILLCAIYIPPDKSKDVNVMDEHIESVRELRNLSSGNDYLLVFGD FT YNQPELCWKRCGDEIRPTNSGSISAASSSLIDGMDFCNLHQMNIERNHLNR FT TLDLVFTSLELSPEITCSVAPLLPTDQHHPPLELDLMLPNGVNNVPTVSNA FT TGELDFTKIDFEALSLFLSETVWHVLLDGLTADNMAETFCSVIANWLNTNV FT PTRRPAVSPVWGTPLLRNLKRRRNALQRRLRRHRSLENIRSFHRACDEYRE FT LNSSLYKSYVLRVQSNLQRNPKQFWNYVNSKRKSSTIPMSVFLNDSVSSSP FT LEKCELFAAHFSSVFDAKPASPSEVDSAVSDVPRDLLDLDIFQITPAMVSQ FT AAKKLKCTFSPGPDGIPSAVYCRCMSALAEPLCRIFNQSFVDSTFPAIWKQ FT SFLVPVHKKGEKRNVINYRGITSLSAASKLFEMIVSDVILAQAKGYISIDQ FT HGFMPGRSVTTNLLDFSTTCFEQLENGAQVDVIYTDLKAAFDTINHDILLA FT KISKLGASNRLSSWLLSYLTGRTLRVKVDSCMSSSFTSMSGVPQGSNLGPL FT LFLLFFNDVMLLLGHGCRLAYADDLKLYFVVRTVEDCVRLQALLDMFVDWC FT RRNRMTLSVPKCIVVSFYRITRPIMHNYVVNDVVLKRADSFCDLGVLLDHK FT LTFNLHRSNVIAKANRQLGFIAKISRDFTDPHCLKALYCSLIRPILESAAV FT VWTPYQLSWSLRLESVQRKFMRLALRNLPWRDPLNLPPYPDRCRLLDLDTL FT DRRRKTQQVAFIAKLMNGEVDCSRLLSMLNFRVAPRVLRGGALLQQRFHRT FT AFGSNEPFTSMIRLFSMFEDLYDFDGSSNQFVSRVRSSDLI" XX SQ Sequence 2908 BP; 729 A; 665 C; 655 G; 858 T; 1 other; ggctgcgcac taaaatcgat gcactgtact gtaacgccat tgactgcaat tatgacgtaa 60 tcattctgat cgaaacagga cttgacgcag ctatcaactc tgtacagctt tttggagata 120 attttaatgt gtttcgctgt gatcgaaatg ccaagaatag taccaaatcg agctttggag 180 gagttcttat tgccgttcat cgccgtttta ccagctccag catcagcatt accactggtg 240 aatcgcttga acaggtttgc gtgtctgtac tggtaggcaa ccaacggata ctcctgtgcg 300 caatttacat tccaccagac aaaagcaagg atgtcaacgt tatggatgag cacattgaat 360 cggttcgtga actacgaaac ttaagcagtg gtaatgacta tctgctggtt tttggcgact 420 acaatcagcc tgaattatgc tggaagcggt gtggcgatga gattcgtccc actaatagtg 480 gatctatatc agcggctagt tcatcgctaa ttgatggcat ggatttctgc aatcttcatc 540 agatgaacat cgagcgcaac cacctgaatc ggactctaga tttggttttc acctctttgg 600 aattatcacc cgaaattaca tgttccgtcg caccgctttt accaactgat cagcatcatc 660 ctccattgga gcttgaccta atgctaccta atggagtcaa taatgtgccc accgtcagca 720 atgcaactgg ggaattggat tttacgaaga tcgattttga ggctctctcg cttttcctgt 780 ccgaaactgt ttggcatgtt cttctggacg ggttgacggc cgacaatatg gctgaaacat 840 tctgctccgt catagcgaat tggttgaaca caaatgttcc aactcgtcgc cctgcggtct 900 ctccggtatg gggcactcca cttttgagaa acctgaaacg acgtcgcaat gctcttcaac 960 gtagactccg tcgccaccga tcgcttgaaa atatacgcag ctttcatcgc gcctgtgatg 1020 agtatcggga actcaattcg tctctttaca agtcatatgt gttgcgagta caatcaaacc 1080 ttcagcgaaa tccaaaacag ttttggaact acgtgaattc caaaaggaaa agctcgacca 1140 taccgatgag tgttttcctg aacgattccg tttcctcgtc acctttggag aaatgtgaac 1200 tttttgccgc ccacttctca tctgtattcg atgcgaaacc cgcatcccca tctgaggtgg 1260 attctgctgt atcagatgtc cctagggacc ttttagatct ggatattttc caaattacgc 1320 ctgccatggt ctcgcaagct gccaaaaaat tgaaatgcac mttttcacct ggtccggacg 1380 gtattccttc agccgtatat tgtcgatgca tgagtgcttt ggctgagcca ctatgccgga 1440 ttttcaatca atcttttgtt gattcgactt tccccgcgat ttggaaacag tcattcctgg 1500 ttccggtaca taaaaagggc gagaaacgga atgtaattaa ctatagaggt atcactagcc 1560 tttctgctgc atcgaagttg ttcgagatga tcgtgagcga tgttatattg gcgcaagcca 1620 aaggctatat ttccattgat cagcatggct tcatgcctgg acgctcagtg acgacgaatc 1680 tgcttgactt ttcaacgacc tgctttgaac aattggaaaa tggtgctcaa gtcgacgtga 1740 tttatactga tcttaaagca gcgttcgata caattaacca cgacattctg ctggcgaaaa 1800 tatccaaact aggcgcgtcg aatcgactat catcatggtt actctcctat ttaaccggaa 1860 gaacgctgcg agtcaaggtc gattcctgta tgtcgtcgag ctttacaagt atgtctggag 1920 tgccgcaggg cagtaacctg ggtccactcc tgtttctttt atttttcaac gacgtgatgc 1980 tgcttctagg gcatggttgc aggcttgcat atgctgacga tcttaagttg tactttgtcg 2040 tacgcacagt ggaagactgc gttcgtctgc aagccttgtt ggatatgttt gttgattggt 2100 gtagacggaa ccgtatgacg ctgagtgtac ccaaatgtat agttgtgtcg ttctatcgaa 2160 taactcgacc gattatgcac aattatgttg taaatgacgt cgttcttaag agagccgata 2220 gtttttgcga tttgggagtc ttgctggacc acaagctcac gtttaatctg catcgatcca 2280 atgtaattgc gaaagctaac cgtcagctgg gatttatagc aaaaatctcc cgggatttca 2340 ctgatccgca ttgtctgaaa gctctttatt gctcactcat acgacctatt ttagagagtg 2400 ccgctgtagt gtggacccct tatcaactat cgtggagctt gaggttagaa agcgtgcaac 2460 ggaagtttat gcgactagca ctgcggaacc ttccttggcg tgatccgctg aatctgccgc 2520 catatccgga tcgctgtagg ctgcttgatt tggatacgct ggatcgtcga cgcaaaaccc 2580 aacaagttgc ttttatagcc aaactcatga acggcgaagt tgactgctcc cgattgctct 2640 cgatgctgaa tttccgtgtc gctccaaggg ttctacgtgg cggcgccctt ctgcagcaaa 2700 gatttcatcg tactgctttt ggaagtaacg agccttttac gtcgatgatc cgtttatttt 2760 ctatgtttga agatctctat gacttcgatg gttcgtccaa ccaattcgtt tcgcgtgtga 2820 ggagctcgga cttaatatag gatttagttt caattcatgt agacaaaagt cagatggata 2880 atatacaaat atacaaatat acaaatat 2908 // ID hAT-39_HM repbase; DNA; INV; 3890 BP. XX AC . XX DT 07-DEC-2008 (Rel. 13.12, Created) DT 13-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-39_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3890 RA Jurka J.; RT "hAT-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 2027-2027 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 637..3612 FT /product="hAT-39_HM_1p" FT /translation="MPHPGVKRYKVKCKVKNCSHSIFDNDYRMTHNRIYHL FT DYIRNHKGIPYEKYGAPRNPFETNSAINSQSCSKISLSKINFNAQELCPSN FT ITNEVEMNINENLTKISEAIENESSEIECEIINESKNIDITNDEILAVPEI FT LDFCDQTLLVESQPSDVNLLEKETTFSDSVGKFAKLCKLLSECELKLDEVK FT NTDNLHLEKLLIEMTELASEVKISAENVLDANDQALANILNKRKYDNIETL FT TTKNFFLSHDPGQRSKIENDNLRNYLIELGPHQVKLNNFPINKDIDKRKQN FT RFSSLWYKEYPHLEYSILKDSAFCYVCCLFPNGPGRENSEVAWVENGIRTW FT HKMKSRGKKGKRGKLSEHFSSKSHKASLLDYCNYCNSAQHINLLLDKKQRQ FT IVISEKEEMESNKCVIKVLIDLARTLGRQGIAFRGDDRDENGNFRQLVYFM FT SRHNLVLKRWLSNIKLRKYHVTYMSAKSQNDFIHLLGQDVQSKIVSEVNQA FT SMFSIMADTTPDVSHSDKLAIAVRYVDNNCNIKERIVQICEVADKTGDGIA FT DLIISTINNLQLDVNSVVFQGYDYASSMSSNIKGVHAKISEKLKRVVPYIP FT CQAHRTNTFVEHAVKRNKMASNVFEVLEMCYVFISSSTKRFSLLNSKLEFI FT ENSLKLRNLSKTRWSARSESIQAFWISMESIVECFYEISQSDLFDKKTKDQ FT ALAILKKIVNPDFIIMLMLIKGVIAKTKILTDELQKVELNILDAKMIVLST FT IQSLERNRKEEFEVHREIDAAIKICSKYEIDALKIYEDRHRRRVTPKRYDD FT NPDTLCSRNIYQYYTQQLNCVLDCLIYEMNENLLASWLSIEPFSLLQQSTK FT TFTIKDVEKLCLLMPNNGVNTDAHLFHAELEIFMNTFTKSDAPTNNEKLKL FT SWQRQNIFPGVFKAFKLMASAPVSVASDERTFSKLKIVKNILRSTMSDARL FT ESLILLASEKDLVDNIDLDVLVKKWSITTQRRIQIE*" XX SQ Sequence 3890 BP; 1401 A; 567 C; 672 G; 1250 T; 0 other; cagggccgta ccgatagggg gtgtgggggg gtgtccaccc ccccagattt ttaatatttt 60 gtcggttaaa cagtacaatg gtaaattgtc ggtaagatta caataaattg taatcttacc 120 gacaacttta ccattatact atttaaccga caaaatatta aaattattat tttcttgttg 180 tttctgtatt gcaaaaatgt ttttatatca tccttaacgt taattttcaa aggataaaaa 240 actaattctt tttaggaatt taacgcgtat aacaaataaa actcagctaa gcatgttttt 300 ggttgccgtt ttttattctc aaaaattgcg ttacgtaatt agtcaacaat ttctaattac 360 tttaaagcaa gaatcttaat caagagactt gtttacttga tgtttgcaca gaagaaatca 420 aagtttagtt agcggtgtgt ttttattatt agaaatttgg tactagtatt ggaagtttgg 480 cgaaagaagt aataaactat tttgtttata cttgtaagtt acgtatttac taataaaatt 540 ttaaaaagtt taaaatttat taaatcattt atatattttt ttaaggcgaa tattttgtta 600 gtttaatttg taattatctt agattttggg ttcaagatgc cacatcctgg tgttaaaaga 660 tacaaagtca aatgcaaagt caagaactgt tcacattcaa tttttgataa tgactatcgg 720 atgacacata atcgaattta tcatttggat tatatccgaa accataaagg gataccttat 780 gagaagtatg gcgcaccacg aaaccctttt gaaaccaatt cggcaataaa ttcccaatcg 840 tgcagcaaaa taagtttgtc taaaattaat ttcaatgccc aagaactttg cccaagcaac 900 attacaaatg aagtggaaat gaatatcaat gaaaatctga caaaaatctc tgaagcaatt 960 gaaaatgaat catctgaaat tgaatgtgag ataataaatg agagtaaaaa cattgatatt 1020 actaatgatg aaatactagc agtgcctgaa attttggatt tctgtgatca aactctgtta 1080 gtagaatccc aaccgagtga tgttaattta ttggaaaaag aaactacgtt ttcagattcg 1140 gtgggtaaat ttgcaaagct gtgtaaactt ttaagtgagt gcgaattaaa actagatgaa 1200 gtgaaaaata cagacaattt acatttggaa aaacttctca ttgaaatgac agagcttgct 1260 tctgaggtaa aaatttcagc agagaatgtt ttggatgcca atgaccaagc tcttgcaaac 1320 attttaaaca aaagaaaata tgataatatt gaaacactta ctacaaaaaa tttctttctt 1380 tcgcatgatc ctggccagcg ttcgaaaatt gaaaatgata atctaagaaa ctatctaatt 1440 gagctcggac cacatcaagt aaaacttaat aactttccta tcaataaaga tattgacaaa 1500 aggaagcaaa atcgtttttc atcattgtgg tacaaagaat accctcactt agagtacagc 1560 atacttaaag atagcgcatt ctgctatgta tgctgtctct ttccaaatgg accaggtaga 1620 gagaatagtg aagttgcttg ggttgaaaac ggtatccgca cgtggcataa aatgaagagt 1680 cgaggaaaga aaggtaaaag aggaaaacta tcagaacatt tttcatcaaa atctcataaa 1740 gcttcattat tagactattg caattattgt aatagtgctc aacatattaa tctcctcttg 1800 gacaagaaac aaagacaaat tgttattagt gaaaaagaag aaatggagtc taataagtgt 1860 gttataaaag ttcttattga tttggctcga acattgggta ggcaaggaat cgcgtttcgt 1920 ggagatgatc gggatgaaaa tggaaatttt agacagttgg tatattttat gtcgagacat 1980 aatttggtac ttaaacgatg gttgagcaac ataaaactta gaaaatatca tgtgacatat 2040 atgagtgcta aatcccaaaa cgacttcatt catttgcttg gacaagatgt tcaaagcaaa 2100 attgtttccg aagttaacca agcatcgatg ttttcaatta tggcagatac aactcctgat 2160 gtgtcacaca gtgataagtt agcaattgct gtaaggtatg tggataataa ctgtaatatt 2220 aaagaaagaa ttgtgcagat atgcgaggta gctgacaaaa cgggtgatgg aattgcagat 2280 ttaataatat cgacaatcaa taaccttcaa cttgatgtca atagtgttgt gtttcaagga 2340 tacgattacg ccagctctat gtccagcaac attaaaggag ttcatgcaaa aatttcagaa 2400 aagttaaaac gagttgtacc ttatattcca tgtcaagctc atcgaactaa tacttttgtt 2460 gaacatgcgg ttaaaagaaa taaaatggct tcaaatgtct ttgaagttct tgaaatgtgc 2520 tacgtattta tttcttcaag tactaaacga ttcagcttac ttaattctaa acttgagttt 2580 attgaaaatt ccttgaaact tcgcaatctt tcaaaaacta gatggtcagc gcgttcagaa 2640 tccattcaag cattttggat aagtatggaa tctattgttg aatgttttta tgaaataagt 2700 cagtccgatt tatttgataa gaaaacaaag gatcaagctt tagcaatact taaaaaaatc 2760 gtgaatccag actttatcat tatgcttatg cttatcaaag gagtcattgc aaaaacaaag 2820 attcttactg atgagttgca gaaagttgag ttgaacattt tagatgctaa aatgattgta 2880 ttgtctacaa tacaatctct tgaacgaaat agaaaagaag aatttgaagt acatcgtgaa 2940 attgatgcag caattaaaat atgtagcaag tatgaaatcg atgcgttaaa aatatacgaa 3000 gatcgtcata gacgaagagt aactccaaaa cgctatgatg acaatcctga tacgctatgt 3060 tctagaaata tctatcaata ctacacccaa caacttaatt gcgtgttaga ctgtcttatt 3120 tacgaaatga atgaaaattt actggcttca tggttgagca ttgaaccatt ttctcttctt 3180 cagcagtcaa caaaaacctt tacaataaaa gatgtcgaaa aactgtgcct attaatgcca 3240 aataatggag taaatacaga tgctcattta tttcatgcag agttagaaat ttttatgaac 3300 acatttacta aatctgatgc tccaacaaac aatgaaaaat taaaactgtc gtggcaaaga 3360 caaaacatat tcccaggagt tttcaaagct ttcaagctaa tggcttcagc accagtgtcc 3420 gttgcaagcg atgaaagaac atttagcaaa ttaaaaattg ttaaaaatat tttgcgttct 3480 actatgtcag atgcaagatt ggagtcatta attttgttgg cgtcagaaaa agatttagtg 3540 gacaatatag atcttgacgt gttggtaaaa aaatggtcta taacaacaca aagacgtatt 3600 caaattgagt aattgattat gcagttttgc gttttatacg ttaaatttgt tgttgtttgt 3660 atatgctttt gaaatactgt aaatttcatt tgcattgtaa aatttatata atctttcaga 3720 aaaaatatat tttatttctt ttaacgtgtt acaggtagta aattaagaat gcaattatga 3780 ttttgaaaat ttttccgaaa aaaatccttt tttcaaaaca agaaaaattt ttgtcggtta 3840 atttgacgat tgcacccccc cccagtcttt gtatgctaga tacggccctg 3890 // ID L1-N1_CQ repbase; DNA; INV; 1147 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A HAL1-like non-LTR retrotransposon family from Culex DE quinquefasciatus - consensus. XX KW L1; Non-LTR Retrotransposon; Transposable Element; nonautonomous; KW L1-N1_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-1147 RA Kojima K.K. and Jurka J.; RT "HAL1-like non-autonomous non-LTR retrotransposons from the RT southern house mosquito."; RL Repbase Reports 11(1), 100-100 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 7 sequences with >91% CC identity. CC ~16 bp TSDs. This family encodes a protein similar to ORF1ps of CC L1 in Aedes aegypti. Thus it is likely a HAL1-type element. CC However, its 3' terminus is 92% identical to that of I-1_CQ and CC it is also possible that it is a composite non-LTR CC retrotransposon. XX FH Key Location/Qualifiers FT CDS 110..1012 FT /product="L1-N1_CQ_1p" FT /translation="MSAVRKNTLVVDFSVLPERPKLDMVQRFVEKSLGLNP FT TDLRSIQLHNIRHCVLIQMADPAAAVEVAVQHHLKHAFRISPTKKIAIPVY FT VDDDIVDVRIHDLPPDMSNMQIAEAMLQYGEVLTIRDEVWRDIFAGVPNGV FT RVLKMKISKPVPSYVTICGLSSLATHTGQISTCRRCGFQRHVSKSCSEAAK FT KQQPKPKPQKNPSPSGPVLPIADLAAVQSQVVVPHTHEDNGNDGFITVVKK FT GKTKRQLEEKQAPEEKRSCLDDDDDDKSPTLDNDQRNDNGHLIGLPTTTTP FT KRRVWKQTV" XX SQ Sequence 1147 BP; 357 A; 290 C; 263 G; 237 T; 0 other; cagtcggctc tgtactcgcg tgtgcaccag acgtagcttt gtgctccgcg ttcaaaaatt 60 cgctattgtt gctcgaacaa acaaacaacg cgttgtggaa gactctgaaa tgagtgctgt 120 ccgcaaaaat acgctcgtcg tggacttcag cgtcctccca gagcgtccca agctggacat 180 ggtgcagcgg tttgtggaaa aaagtctggg actaaatccc actgacttga gaagcatcca 240 gctgcacaac attcgccatt gcgtgctcat ccagatggcg gatccagcag ccgccgtcga 300 agtagcagtg cagcatcacc tcaagcacgc cttccggatc agcccgacga agaagatcgc 360 gattccggtc tacgtcgacg atgacatcgt cgacgttcgg atccacgatc tcccgccgga 420 catgtccaac atgcagatcg ccgaagcaat gctccagtac ggcgaagttt tgacgatcag 480 ggacgaggtt tggagagaca ttttcgccgg cgtacccaat ggtgtgcggg tgctaaaaat 540 gaagatctcc aagcccgttc cctcgtacgt cacgatttgc ggccttagca gtttggccac 600 acacacgggt cagatttcca cgtgccgacg atgtggattc cagcgacacg tgtcgaaaag 660 ctgctccgag gcggctaaaa aacagcaacc gaagccgaaa ccacaaaaaa atccatctcc 720 atctggccct gttttgccta tcgccgatct agcagcagta caatcccaag tcgttgttcc 780 tcacactcac gaggacaatg gtaatgacgg atttattact gttgtaaaga aaggcaaaac 840 aaagaggcag ttggaagaga agcaagcacc cgaagaaaaa agatcgtgct tggacgacga 900 cgacgacgac aaatcaccaa ctctcgacaa cgaccaacgc aatgataatg gacatctgat 960 tggactacca acaacaacca cgccaaaaag aagagtttgg aagcaaactg tttgacgtag 1020 aactacgtcg ataaaatgta ttattattat tattattgta ttatttttca aaacttggcc 1080 aaattatggc aacaaaaggc cagtacctct aataaaaaaa aaaaaaaaaa aaaaaaaaaa 1140 aaaaaaa 1147 // ID PERERE-10 repbase; DNA; INV; 2377 BP. XX AC BN000801; XX DT 04-SEP-2005 (Rel. 10.08, Created) DT 04-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE Schistosoma mansoni Penelope-like retrotransposon Perere-10 DE (EST). XX KW Penelope; Non-LTR Retrotransposon; Transposable Element; Cercyon; KW PERERE-10. XX OS Schistosoma mansoni OC Eukaryota; Metazoa; Platyhelminthes; Trematoda; Digenea; OC Strigeidida; Schistosomatoidea; Schistosomatidae; Schistosoma. XX RN [1] RP 1-2377 RA DeMarco R., Machado A.A., Bisson-Filho A.W. RA and Verjovski-Almeida S.; RT "Identification of 18 new transcribed retrotransposons in RT Schistosoma mansoni."; RL Biochem Biophys Res Commun 333(1), 230-240 (2005). XX DR EMBL/GenBank/DDBJ; BN000801; Positions 1 2377. XX FH Key Location/Qualifiers FT CDS 51..1232 FT /product="PERERE-10_1p" FT /translation="MDVEIQFENLYAQTAELVSSSKDNVESFKSTLVDCCF FT RYLNHGHSSKGILTNKHKEALRKLKTNDNLLITKPDKGYGIVLMDKNNYIN FT KMKAFLNDQSKFQKLVVKNDLADKIEKQIIDSLKQIKQQGFISEKVFEMLK FT SIGTRTPRLYGLPKIHKSGLPLRPVLDMNNSAYHTIAKWLMQILKPLHKEI FT VKHSVKDSFEFVNNIKNLSLKNKFMISLDVTSLFTNIPLLETVDFICNELT FT ERHTETVIPVTAIKQLILRCTMNVQFRFDNEYYRQLDGVAMGSPLGPILAD FT IFLAKLENGPLKDTISHLTSYCRYIDDTFIVLEKEHEKENILNIFNNIHPS FT ITFTLEEEQNGSISFLDVQLTRRIDGTLKRGLHRKSTVGQYTHFYRAVSIK FT " XX SQ Sequence 2377 BP; 865 A; 408 C; 391 G; 712 T; 1 other; aancgttatc attaggccct aagttttgtg attttaagac ggttaataag atggacgtcg 60 aaattcagtt tgaaaatctg tatgcccaaa cggcagaatt agtatcttct tcaaaagata 120 atgtcgaaag tttcaaatca acactagtag actgttgctt tagatactta aatcatggac 180 atagttcgaa aggaattttg acgaataaac ataaggaagc tctaagaaaa cttaaaacta 240 atgacaattt attaatcact aaaccagaca aaggttatgg gatagtactc atggataaaa 300 acaactatat aaacaagatg aaagcctttt taaatgatca aagtaaattt cagaaactag 360 tagtgaagaa tgacttggca gataagattg aaaagcaaat aatagattcg ttgaaacaaa 420 tcaaacaaca gggttttatc tcagaaaaag tgttcgaaat gttaaaatct ataggaacga 480 gaacaccaag gctatatggc ttgccaaaaa tacataaaag tggcttgccc cttcgacccg 540 ttttagatat gaataactca gcctaccaca ctatagcaaa atggttgatg caaattctca 600 aaccactcca caaggaaatt gttaaacata gtgtcaagga ttcctttgaa tttgtcaata 660 atataaaaaa tctatcgctc aagaataaat ttatgatatc attagatgta acgtcccttt 720 tcaccaacat tcctctactt gaaacggtgg atttcatttg caatgaatta accgagaggc 780 acacagaaac cgtaattccc gtaaccgcaa taaaacagct aattcttaga tgcacaatga 840 atgttcaatt ccgatttgac aatgagtatt atagacaatt ggatggggta gccatggggt 900 caccattggg ccctatactg gcagacattt ttctggcaaa actagaaaat ggtccactca 960 aagacacaat tagccatcta acatcatatt gtcgctatat agatgatacc ttcatagtat 1020 tagaaaaaga acatgaaaag gagaacatcc ttaatatatt taataatatc cacccatcaa 1080 ttacgtttac actagaagaa gaacagaatg gtagtatatc gtttttagat gtccaactaa 1140 ccaggagaat agacggaaca ttaaaaagag gattacatag aaaatctaca gttggacagt 1200 acactcattt ctatagagca gtatcaatca aataacaatg aaacttgata acgatgctga 1260 atcatcgtgc tagaatgatt tgatcggatg acattatttt agatgaattg gttaatattc 1320 gcaagttgct tagtaggaat ggatatccaa tgaaattcat cgacaaacac atgaaggaga 1380 tgaaaagaag gacaaggata cctaccgtta tgaagaaggt attgttctta aaattgcaat 1440 tcttaaaaga tgctacggaa gaaatcgtga catggaggct aagaaaagca gcacagaaaa 1500 catttaatgc agtcaaactg agtaacatct tctacaaccg ccccacaata agaacggtta 1560 ataaagacaa attgtctgaa ttcaccatat ctatgtgtat ttaccaattc aactgcttct 1620 gtggagccaa ttataaaagg cgcacgattc ggcaagttcg tcaacgaata atagaacact 1680 atccgtcgtg gttaagcaaa taacaggtaa aagtgattaa gagttctatt cttgctcact 1740 tgatagacac aggtcatcaa gttgaactgg gataagcttt taagattatc caccatattc 1800 catcaaattt gccgcatgct ttacgagctc gtcttttgca catagctgaa gatatcgcaa 1860 tccatgttca caaaccgaac atttgtattc aaaagaagtt tgtacaacca ctctcactac 1920 cttggccatc ggtatagcgg agtttgtagc agaactttgg taaacaatta tcttattcct 1980 tccatttttt ttattttact tattctctac atttgtctct tgattcttac ataactacta 2040 tattactgac cactcatcaa aatattttgt tagttcttta ttctcctttt atatattatg 2100 caacacagca acagcttgat ttttgaacaa ttaatttgat ttaataacca aattccaatt 2160 aattaatgtt agcttataat cgacatgtca ttatgtaatt attacgtaac gtatgtactc 2220 gaagtgttat ttcattattg taaatcatat atatgtatta gtgtagagcc atgatgagaa 2280 actaattgaa aggaaatttc ttcgctcgat tcgtttcttc tttatttacc aaatgtcaaa 2340 atgggaattt tcactataaa taaacaatga ttacgga 2377 // ID DNA8-28B_AP repbase; DNA; INV; 859 BP. XX AC . XX DT 28-AUG-2009 (Rel. 14.09, Created) DT 28-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-28B_AP. XX NM DNA8-28B_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-859 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 1965-1965 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. Note the imperfect TIRs. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 859 BP; 268 A; 149 C; 169 G; 273 T; 0 other; ggctaaagtg cactttacca accccctcac gattttcaaa tttttttttt caaaagactt 60 gttttttatg cttaaaacga tggtgtaaat tttttttccc ggaaactatt attttagaag 120 ttatagcctt ccaaagttta cgattttacg tttatgcgca tgcgcgcaac gctacttgac 180 tgccacgcgc agcgcctata acaaatttta atttttttta cgtattttaa caagttttgg 240 gacatagtag ttttgggtaa taatattgtg aatcacgtca taatacatgt tttgaaccgg 300 tagaatagag aaaagtttag gtcacccgtt tagtaagttt agatatagaa tataggttaa 360 taggtaacca ttagtcggtc atctacctgc gaagataatt gctttttaac aactttgttt 420 aattccgagg cctacccaag gtggttttgc acagcaaatc gggggatatt ttcgttttta 480 attgtccgag cgagcgcagc gagcgagtga ccccgctcgg tgactcggtg caacaaaaat 540 gcaattttct taaccctgtg acccacttgg ggaggtgttg catagcaaat cggggataaa 600 ttattattgc accatcgttt taaacaggtt agaatacgta agtacaaaaa aataacaaaa 660 atggtcttcc gcgcgacatt atagtagtgt tgcgcgcatg cgaataaacg taaaatcgca 720 aactttggaa ggctataact tctaaaataa tagtttccgg gaaaaaaaat ttacaccatc 780 gttttaagca taaaaaacca agtcttttga aaaaaaaaat ttgaaaatcg tgagggggtt 840 ggtaaagtgc atttgttcc 859 // ID Gypsy-55_AA-LTR repbase; DNA; INV; 1731 BP. XX AC supercont1.16; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-55_AA_; KW Gypsy-55_AA-I; Gypsy-55_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-1731 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.16; Positions 3598695 3596965. XX SQ Sequence 1731 BP; 494 A; 303 C; 373 G; 561 T; 0 other; tgtaaaatat gaatctgttc cgtttgtaaa aatctttaaa tattttgtgt catcttaaaa 60 atgacaacac atttctttcc gtgtagcaaa tacaactaca gctacaagtg tatctctctt 120 tttatgtgtg ctgccatctg ttagtcgttt tatctctgta gaaatatttc ggcatgtgca 180 aataaattga agctcgcaag taaaagaaat tgcataattg aaggcgcaac gatgaaattg 240 agcaaattgc gtcgtaggaa ttttccactg acttctatag aatttaggca gccatactat 300 ttggaaatgg aaaattactg gtaaaactct cgctctccag aactgaccaa tggaaaaaga 360 ggttttaaac ccattgtagt ccattgggta gtaattgcgt cttaatttag aggttctaac 420 cacttcgaat cgcgctcgtc atttcatatt ggtggagtac gtgatagtgt tcacatgtga 480 atttcactgt ggaaattatt attggaacgc aagttaagct ttggaaagta tttgaaattt 540 aatttaaagt gtgtaacagt taactaaatt cgaagtaagt ttcgcgagtg taatttaaac 600 catattttaa ttaatttgaa tgttttattt taaagctgaa tttaaaaagt tcgttaggaa 660 tttggaacaa attgataaaa tcgtttgaaa taggaaggac aaatcagtaa ggtaagcttt 720 aatgcatttt cggtggattt aagtgaatta atgaagataa ctgcgttgtt caggcgtgtc 780 aggcccaaca acgtgggcgg ccaatctagg cttaatcagg ttagtctagg gtcatcttgg 840 ctaatctagg ccacacgacg cgcaattgag ctcaggccca ctccgacaag tgggatagtt 900 tttccgatac tacggccaga gctttcgtgt ggaaatacgg tccaatcttc gagtggtggt 960 tgtcgaggat caccacgtga caactcggtt cgtcaagtgt gaagcataga gcgacgtcca 1020 taaagctctc tccaccgaag cccggggacg cctatacccc gatgattgag ctcacacacg 1080 agttgaggta atttgaattt cgaccgttca agcgtctgac gcctacccaa tcgacgtagc 1140 aaagacaccg tccagttggc atccgtcgtt ggagcatcgc ctaggaaaca ggttagatgt 1200 atcgctaggg tgtgtgaacc aattgtagac accttattaa attcgctacg aatgaacgtg 1260 acgtcacaaa aacagttagg gtttgccttg gctaatgaaa tagaatcgaa cgtgaataaa 1320 attctgaact aaggcttttg tgaaactttt taaattctat tttgttttct ttttatttat 1380 tatcgtataa attttgaatt tttggattac ttcgagttta ggtttgactt caccaaactc 1440 ggtcgcctat tgttggaatt tgtcggagtt gttgaatctt tagttgagat tacttttatt 1500 accatttagt tttggatagc gttttctgta ttttaaattc ttttttccat cttccattcg 1560 gttaattggc aattctgttg ggtcaaagtg gaatttcagt cctggattat aactgttggc 1620 ctgccctgag gaggcagtct catgccggga tatttgaaca gagttaacgg tcctttctcg 1680 aaaggtggcg cttaagctgt tagctttatc ggaacagagc agaacgttac a 1731 // ID MuDr-4_HM repbase; DNA; INV; 9392 BP. XX AC . XX DT 30-MAR-2009 (Rel. 14.03, Created) DT 30-MAR-2009 (Rel. 14.03, Last updated, Version 1) XX DE MuDr-type DNA transposons from Hydra magnipapillata, consensus. XX KW MuDR; DNA transposon; Transposable Element; Ulp1; MuDr-4_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-9392 RA Bao W. and Jurka J.; RT "MuDr-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 9(3), 654-654 (2009). XX DR [1] (Consensus) XX CC TSD is 9-bp and TIR is 75-bp long. Transposase is encoded by 7 CC exons. The C-terminal is Ulp1 domain. XX FH Key Location/Qualifiers FT CDS join(4337..4369,4531..4677,4922..5143,5408..5824, FT 5925..6173,6321..6527,6817..8403) FT /product="MuDr-4_HM_1p" FT /translation="MKEVVYFTDFKLINSSNYRKKQQSKEIKYSLLQKKDV FT VISRKIIVEIDCLNDHKNHPLGKIELLSQKIDPRLSVKISEYIKDGVSNVR FT EMRRLLNITVNEMFGKKNLPSRTNRRFYPRTDIIRSHMVKERLKQRYSNID FT QDCLEIKIKEWKEADSSANIYFRPKERGNSAELLFVYQAFWQKKLLKKYGN FT EMLFLDATYKTTRYSLPLFFLVVKTNVDYQIVATFVSESETFESISEALVI FT LKSWNEDLQPLYCMTDYCNAEIRALETIFIGCKVFICDFHREQSWDRWLKK FT LSSGCSNRKDDILCILRRLAWSKSIDEEENAQRILEQSEFWSQESFTKFKQ FT YIEKYWLPIKKRWVWWYRQDRLLINCNTNNGVERQNESFKYTYLQRHKNSS FT LTGMLTLLIDDFFIDKLDRYHFFKVSLRLNFVTMNIISFSYTDSNFKMDAR FT YRRYARWVPSYLYNRPHSFVKHCLIKKSNVENLDLSGVIMIKHGVFAVLSS FT TKPATKYFVSFGDEITMPKCTCLNWISTAYPCKHFFLVFRKYPAWSWSALS FT SLYLSSPFLNLDADDYEASHEKPPFSRPESSILPKKELDVINDWNEKSKKI FT TSVCSGVNVREMLTTIQNLTYEFEENSEEIVLVYSTLASLIKKLEKNRTKE FT SGIPLSPLDIKLPCHLKRNIPIPVRKPMKLPTDRVGNHKDMVIASSKIFLE FT EKKHKTEPIEVDIIEHIDGNSIIIDDESVTINENDGFEIKRLNHLSPNCNL FT SSNDLNDILTNQMLSDIVIHTMQKMITGVGGLQDPVLGQNLSFRVQKASFV FT QVLHDGDAHWLAISNFGCSVGEVFLLDSFFRGKVKNHVVRQICAIMHCNED FT KLTVRVVPVQQQTNYVDCGLYALAFIKHITDTRSNPSYVAFDAFQMRNHLL FT KCVKGNQFTEFPKSETAMRFCKEKEFNFSLYCICRQVWLASDSCIKDR" XX SQ Sequence 9392 BP; 3307 A; 1284 C; 1360 G; 3441 T; 0 other; ggttatcgaa cgaaagtatc actcaaaagg cgatcaaagt ccacttgaaa attatacgta 60 attatgcgga aattacatat gcaaaattaa ataaaatatg ctaaaaaaag cttcgagagc 120 acatgttact cttacagccg tgaaggaaca aaagtgaact tcaatattgt tctaattgcc 180 actgttatat ctgttttgaa accgttaatc ttattgtgat tttataattg ttgttttgtc 240 aatatcacgg tgccttgttt tcataacatt caattcttca ttgtctttga tttagttaac 300 atttttgtgt aaatagaata atgtcaataa ggtaaataat tctataaaaa ggtttgtgtt 360 tttttaataa ttatgaacgg tttttttttt tatctattac aagatacagg acatttttta 420 gagctattta catttttaga gctatttaca tttttaattt aaagtttgaa atatgtggct 480 tcgttcttgg taccaagaac aaggactatt ttagtttttt cactagtttt tggccctgaa 540 catagatctt gttgtaacca tggcccatga cattgtcaca accatgccct aattatagtc 600 tgttatgata tgttcattat cttttaagac ttaaacatgt tggacaaatt tttttcaaat 660 aaatgcaact ttaacttaag ttggttaata atgcaccctc ctagagcttg tgtatctaac 720 tttgatgttt gttaaatgta ttaaatatat cagagtttta ttggcagtgt acagatgaaa 780 taaaccatac aacaactaac gagcactcta ataatgtgtt gaaacttgat aatttattag 840 taaatccaca catggaatat ttcgcatcac tctgtcatct ttttatgtcg aagattatta 900 aaaagataat ctaaaaagta catcatgggt ttagcaaatt gataatagtt gcaaaacatc 960 ttccatatac agacaatact taaattaggt ttatggacac ttgaacaaca atgacacaga 1020 tttgacctaa aagaagtata caagataatg tatgagtttt ataggtctat tctataagat 1080 attctttgaa tcagtgaaaa acaaaagaag aaatggcaac ctttttggtt tggtaaagca 1140 caattttctt aactcaaatc agataacatt ttttcttaga acatgtaatt gaccactgga 1200 tctcatttaa tatcaatact atctcagtca ttttccagaa tttttttata gcatgattca 1260 acaagttgtg tattaaaaaa tgggttttgt catggactaa catctacgta gtttgttatg 1320 tcttaatgac attattacat aactaatgat gccattagta actaataaaa aaataaataa 1380 tctttttata tatatgttgc tagttatatt tcattttcaa ctttcaaaag tatgttatta 1440 gccatgcttt atatttgcaa catggggttt agattttggt tttacagagc ctggaattcg 1500 gtatactagt aacacaatga tttgaatttg cagcctttta ctttagctct gccacgtgta 1560 tccttatcag cttctaaaac tgaacatcat ttaaatgatt agttttactt atttgctctc 1620 ctcatcacaa agtttattga tgaattttaa ttattaatca aaatcactgt aaaataagta 1680 aaatagcaaa agctattatg cattattcta gattattata ttttttttta gctctggttc 1740 agatttgtcc gattatgaag acattactga tacatataaa gtagaagaag acactacaac 1800 tttgaaaggt tttttttttt tattctttga tatgattatt aatataatat aatatctcaa 1860 tgagtttaaa gttttagaca acaccaagcc ttgcatttgc gattcagttg aggaagctac 1920 tcgatttctt catagttatg aaatcagtaa ttctgttagg tttgctgccc tttccactcc 1980 aaagagcttt ggaagttcag gtataatata tatttttaac tcaagcttac attatctgga 2040 aaagtgacaa tatttacttt tataaatatt aaaagttttg tgaactcggt gtagttatgg 2100 tgattttgaa aattcttttt tttccatgtt atttttcttt tcttttttta tttcagcaaa 2160 aatactgttg ataaatggat ttaatttagt aaatttacat gactggaatt tacttttttg 2220 accattaatt gacattggat tggaacatta aaacaaacat caccacaaac aaagtaagta 2280 gtaataaaac cacacataaa aaaatttaat ctgagttaag gtaagaagta actcaaaaaa 2340 tcttaaatgc taatgttctt aattgtacta tgacaattac tctgctctga ttggcaaaac 2400 aagttattgg tttgacatct ggttgtaaga agttgcttcc actcttctag aatgtcgtat 2460 ttatatttaa aaaattgcat tacgtcgatt catgggaaca acacattggt gtaattttca 2520 tcagttttta ttaatcatta actaacaacc ttgattttta tattcttgaa aacgcatttt 2580 tcattcctca acttattatt taatctggtg gatacacagg agatgcaccc acgtcttata 2640 gatgatcact gtaaataata taaactttta attataatca tttatgttta acctattgta 2700 cattttttat ttgcttttgt ttttctttaa tttctgatta atttgcatat tgagttgtca 2760 gaatatcagc atgtgcttca gaaaaaaaac tataaacgaa attatatatt aaacattttc 2820 agttggatgt acaatttaat agcctaataa tagccttttt tttttttttt ttttttcttt 2880 ttacaggaga cctactggat tcaatttgga aatgtaaacg atccaatttt agtaaaatat 2940 caataaacaa ggcttctttt tttttttaac gtaattggaa cctttgtttt gataaactta 3000 ctgttttttc taatatacct aacatgttta actaattact ttgacttgga caaaataaac 3060 ttaagctaga ttaaaaaaaa attgattcta taaactggtg cttatatacg ccgtattggt 3120 gcaatttatt ttgcatagta taaagatttt catgaagact tggtacctaa tttgacagcg 3180 tgtatctatt gaaatctgta atttttgtgt aactaaatgt tattagataa ataaacgagg 3240 aacttttatt ttaaaacgct gtaaaagcaa acaaattata cttgtatata ttttaagaag 3300 aattaatgtt atttgtaata taaaataatt atggggaaac aatgactgtt attttaattt 3360 ctgcatacgc gcgaaatata tgaatttata atatttaaat atttcttacg aaaacctaga 3420 ccaaaaattg actaagaaaa aatttggctt cctttttcag attttattat aattaaaaat 3480 aagtgcgtta tatagatgaa gtaattatca ttctttatta ttattagttt agctcagtct 3540 tttgaatttt tttttaatta ttggtttatt ctattcctaa atttgtttga aactttatag 3600 aaatgttata ttttaactaa tattaaatta aatgtgatat taaaatcaaa ataaagttaa 3660 attaaatgca cctgaataga tttgaacaag gatctgttgt gaaaagacaa attttttcca 3720 aaaattttat ttaacgctat tgaggaaaat caatatccct ttaaaatgat gttctttaca 3780 ctttgtgtta gttgtagtaa catttttaac taatgttaaa aaaagagaac aatgattgtt 3840 aaaatataat aagaggtgcc gactcacaca aaaaaaataa atggtgcgaa ttagaaatat 3900 aatgtatcaa tagttaataa atcaccttaa ctacatcgtt gttctaatgt gtgtgcataa 3960 taattctgca caatcacatg atttgaaatc atacacaaaa catgatatcc taatttcctt 4020 gcttagttat acatggaacc caacataaag tattatggga aacatctgaa gactatgcta 4080 ctccttatat tatattatct acaaaaagat acaattgtca ccatggtttc gatcggaacg 4140 ctgctgcaaa aaggaagcat aatgaagcaa aagaaaatca ggtaaagatt ctcctatttt 4200 tacaattcgc atcgttgtca aataaattta tgttcacttt ggctattttt ttttaaagaa 4260 agtggatcat ttatacataa ctgattatgt tatcatacaa gatactaaaa agtttaattg 4320 tcctgcaaaa gccattatga aagaagtggt atatttcact gacttcaagg taacaaatct 4380 actgacatta caatgttcta ttaaatagga aatcatttta taaattatta cgcactcaca 4440 atattttatt aagttcttaa ctgtttactg tcgatctatt acttttgaaa taatctttgt 4500 ttagttttct tatttaaata tttttttcag cttattaatt ccagtaatta tcgtaagaag 4560 caacagtcaa aagaaataaa atattcgttg ctacaaaaaa aggatgtggt aattagtcga 4620 aaaattattg ttgagataga ctgcttaaat gatcacaaaa atcaccctct tggcaaggta 4680 ttaatgctag agttttctta actgcgttta aatagtctta ttttaactat aaccaatagt 4740 ttttgtatat tcataaagta cctacacaca acggtgacga cattaaatcc taaattgggg 4800 catgtggact ttttaatcgt tgtgaaggcc actcacctat gaaaaaagaa gctctaatat 4860 aaaactctga tatttgtttt aattagcttt tataacagta aatcttttta acgtttctta 4920 gattgagcta ctctcacaaa aaattgatcc acgcttatca gttaaaattt ctgaatacat 4980 taaagatggt gtttctaatg tgcgtgagat gagaagactt ttaaatatta ctgtcaatga 5040 aatgtttgga aagaaaaatc ttccttctag aaccaacaga agattttatc caagaactga 5100 cataattcgt tcgcacatgg tcaaagagcg tctaaagcaa aggtaataga gtttaaaaaa 5160 tgcaaaataa ataaattttg atgttgcaaa tattataata tgtaggacta tttaagagag 5220 gttttaggtg tttattaaat attaaagata acttcagtca ttcattaatt taatcaatct 5280 ttttaatcaa ccaattcaat caattaatca ctcaatcata taattaaaaa caacaacaag 5340 gctatgttac taatttttat aatttaatat ttcattttat aaagcagaca ataaaattta 5400 acgttttaga tactcgaaca tcgaccaaga ctgcttagaa atcaaaataa aagaatggaa 5460 agaagctgac tcatctgcca atatctattt tcgacctaaa gaacggggta acagtgctga 5520 gctcttgttt gtataccaag cattttggca aaaaaaactg ctaaaaaaat atgggaatga 5580 gatgttgttt ttggatgcaa catacaaaac gacacgatat tctttacctt tgtttttttt 5640 ggttgtcaaa acaaatgtgg attatcaaat tgtagcaacc tttgtaagcg aaagtgaaac 5700 ttttgaatcg atatctgaag cacttgttat cttaaagtct tggaatgaag atttgcaacc 5760 tctttattgc atgactgatt attgtaatgc tgagatacgt gctctcgaaa ctatttttat 5820 tggtgcgtat ttgtaaaaca agtttagtat ttatttattt tacatgaagt taatttaact 5880 atacaaacat ccaatgaatc aactgatttt taatatattt tttaggatgt aaggtattta 5940 tatgtgactt tcacagagaa cagtcttggg accgttggct aaaaaagcta tcaagtggtt 6000 gctctaacag aaaagatgat attttatgta ttttaagacg actcgcttgg tcaaagtcaa 6060 ttgacgagga agaaaatgct caaagaatat tggaacagtc tgaattctgg agtcaagaaa 6120 gctttactaa gtttaaacaa tatatagaaa agtattggct cccaattaaa aaagtatgta 6180 aaatataaaa tttttgaagc aagtaaaacg tttaatatat atttttagcg aaatacagga 6240 agattcaata aaaagttatt taattttttt atatttaact gaacaaaaca aaataaaaaa 6300 aaaagctttt tttaaattag aggtgggttt ggtggtacag gcaggatcgt cttcttatca 6360 attgcaatac aaacaatggt gtggaaaggc agaatgagtc ttttaaatat acttatttac 6420 agcgacataa aaactcatca ttgactggaa tgttaacttt gcttattgat gacttcttca 6480 ttgataaact tgacaggtat cattttttta aagtttcgtt acgtttgtac caattttaaa 6540 aagtaataaa cacctaaata aaaacaacaa acactttctc tggttaaaaa aaaaaacaag 6600 ttcagaaaac ttaggtataa gtatgttaac agcaggtttt taccctatat atttaataat 6660 aattcattgt atagattgtt cacgtaacta atataaaatt ttttttcctt atcattacat 6720 tttgcatttt aaagatatcg tgtctatggt actacatacc aaatcattta cattcattta 6780 cacaatattg ggtttttcct aatactttat taaagaaact ttgttactat gaatattatt 6840 tcatttagct atactgattc caactttaaa atggatgcac gttacagaag gtatgcacgt 6900 tgggtgccaa gttatcttta caaccgtcct catagctttg ttaaacattg cctaataaaa 6960 aaaagtaacg ttgagaatct tgatttgtcc ggagtaatca tgattaaaca tggagttttt 7020 gctgtattaa gttctaccaa gccagctact aaatattttg tatcatttgg tgatgagatt 7080 acaatgccaa aatgtacttg tttaaattgg atatcaactg catatccatg caaacatttt 7140 tttttagtat ttagaaagta ccctgcttgg tcatggagtg ctttgtcatc attatatcta 7200 agttcaccat tcttaaactt agatgctgat gattacgaag catcacacga aaaaccacca 7260 tttagtaggc cggaaagttc aattctacct aaaaaagaat tagatgtcat taatgattgg 7320 aatgaaaaat ctaaaaaaat tacaagtgtt tgctctggag taaatgttag ggaaatgctt 7380 acaacaattc aaaacctaac atatgaattt gaggaaaatt cagaagaaat tgttcttgtt 7440 tatagcactc ttgcttctct cattaaaaaa cttgaaaaaa atagaactaa agaatcagga 7500 attccgcttt ctccattgga tatcaagttg ccttgtcatt taaaaaggaa cattccaatt 7560 cctgttcgta aaccaatgaa gcttcctact gatcgcgtag gtaatcacaa agatatggtt 7620 attgcatctt ctaaaatatt tctagaggaa aaaaaacaca aaaccgagcc gattgaggtc 7680 gacattattg aacatattga tggaaatagt ataattattg atgatgagtc agttactatt 7740 aatgaaaatg atggattcga gataaaaaga ctaaaccatt tatcacctaa ctgtaatctc 7800 tcttccaatg atttgaacga tattttaaca aaccaaatgt tatctgacat tgtaatccac 7860 accatgcaaa agatgatcac aggtgtaggc ggtctccaag acccagttct tggacaaaat 7920 ctatcgttcc gtgtacaaaa agcatctttt gtccaggtgt tgcatgatgg cgatgcccac 7980 tggttggcta taagtaattt tggatgttct gttggtgagg tttttctttt agacagtttt 8040 tttcgtggaa aagtaaaaaa ccatgtggtt aggcaaattt gtgctattat gcattgcaat 8100 gaagataaat taactgtccg cgttgtaccg gtgcaacaac aaacaaatta tgttgattgt 8160 ggtttgtacg cattagcctt tataaaacat attaccgata ctagatcaaa tccaagttat 8220 gttgcttttg acgcttttca aatgagaaat catttgctaa aatgtgtgaa aggcaatcag 8280 tttactgagt ttccaaagtc cgaaacagct atgcgattct gcaaagaaaa agaattcaat 8340 ttttccttgt actgcatttg tagacaagtc tggttagctt cggatagttg tattaaagat 8400 aggtaaatat aactcattgc attgtattct gtttacgtaa tttctttttg tctaactggg 8460 gattatattt agacatatgg tgcaatgcgg aatatgcgaa aattggtatc atcgcgcttg 8520 tgaacgtatc cctgattatg ttttggaaga caaatgtgca gattggtcat gttcaaaatg 8580 cagctctatg ttataaaatg tttttattat tttagttatt atttagtttc gtaaatatcc 8640 taatttaaaa aatggtttgt taataataca aacaaattat tatactgact ccatatttgt 8700 agatgtattt tatttatcct ttttctttgt tcgcgttgaa acgaaagttg aatataattc 8760 aacaacaaca gtttaacact ttattaattt tttattcgag tagaaagctt aaataagctt 8820 tgttaaaata taaccgaatg cataacactt agacctttgc ttttcagaaa tataaaataa 8880 tgtgaaaaag cgaaaggttg tacttattac ttcatacctc aaagatatta tttaataaat 8940 atttatgtaa ctagtagtat ttataaaaat agattattta gtaactattt atataagtag 9000 tgtttatgaa aagagattat ttaataacta tttatgtaac tagtagcatt atatagtatt 9060 tatataactt caatatagat aattatataa gatgtaaaaa attcatataa aaaaaaatta 9120 aaatcatttt ttcttttgca ttttattaat gttcaaaagt tctaaaaaaa ttatgttgta 9180 aataagaaac agtttattat tttataatat tccttgtaaa caaagtttat ttttcttata 9240 tagcgctaga ataaattgac ttcaaaattt acgcaacttt cttattgcgc tccattttgt 9300 ttttttctca aacaattaat ttacgcataa attatattaa attagcagtt ggactttgat 9360 cgccttttga gtgatacttt tatctgctaa cc 9392 // ID Sola1-N8_AAe repbase; DNA; INV; 77 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A non-autonomous Sola1 DNA transposon family from Aedes aegypti. XX KW Sola; DNA transposon; Transposable Element; Sola1-N8_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-77 RA Jurka J.; RT "DNA transposons from the yellow fever mosquito."; RL Repbase Reports 11(4), 1298-1298 (2011). XX DR [2] (Consensus) XX CC >98% identical to consensus. XX SQ Sequence 77 BP; 23 A; 17 C; 17 G; 20 T; 0 other; ctgcccatac tcgcataaca gtcccattta tataggaaat cccatagaat atgggactgt 60 tatgcgagta tgggcag 77 // ID Gypsy-11_OD-I repbase; DNA; INV; 5169 BP. XX AC CABV01000403; XX DT 24-FEB-2011 (Rel. 16.02, Created) DT 24-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Oikopleura dioica genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-11_OD_; KW Gypsy-11_OD-LTR; Gypsy-11_OD-I. XX OS Oikopleura dioica OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; OC Oikopleuridae; Oikopleura. XX RN [1] RP 1-5169 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Oikopleura dioica genome."; RL Direct Submission to RU (24-FEB-2011). XX DR Genome; CABV01000403; Positions 1466 6634. XX CC 'ACGC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 204..1322 FT /product="Gypsy-11_OD-I_2p" FT /translation="MGRVMDISKEIEIFENLKKCDEVQNDAEKLEDIEKKI FT KSLRGQLIAAAKNLDTDEFSRKDQSDISFIRTQLGEIGIFEGGSFETTNFF FT IARCQKVYDLTVKSNKKLEKQFITDVKARCSVHVTHRMNESHGTWEELKKE FT IFANFSGGITAIQSLQRALNTPYDKRRGFKRLATQISLNLQAAKATIEKAV FT KKKVMKQEPGGEAAVGITSSTEPLTVDKLMEHLGASIMATIIQVNHNELYN FT SMAPAWKKIESAGELAEQAEYYQNQRSSGEVSTYYGRQSNGNSNRKQNGNA FT KKQGDYNEANENGNRKGGNSGGGGSNSRGNRKNWNNKKQFKKRAYIGNEEQ FT DEKETPNERENSSDDEQKETLFVSGRSDFR" FT CDS 1490..3814 FT /product="Gypsy-11_OD-I_1p" FT /translation="MISRKNIPKELLSQVRQYDGKVNGIGSGRLTGLLSTE FT MKLGNIWLANVRIFIVESDIPMLIGRDVIFERSDYKCLPDTDGLRMINKAL FT KKEEQHIQYVQRSAFIGNKLDQRLFSKLKEEKMIDLEKISTIGTEQRMELA FT LLINEFKEIFCIDGDPVGTFKEYGRIPTQPGKTAARRQHPIPAQFNDAVDK FT EILAMRKDGIIEDCPDSRGFNSPILCVKKKNGKVRICCNYKSTLNTILQEV FT DREVWTLPKIENIFSGIGTGVKYFTTLDVSRGYWHCQIHPDDRHKTAFKWK FT GRCYMFCRMPFGLFHSGDVFCKSLQVALNQVERSENVISYVDDVLLFDVSF FT ENHMETLRQVLTALGQSGVRLGASKCQFFTQTTTFMGRHISPDGISPDPKN FT IEAILNLNPPTNRKELLSCLGMLGWVSSWISSKISENVAQYCFSNVTRELN FT KLTKKATSKSFTWTDQANEAFVEAKKRLAEPKVLAYPDFALPFILTTDASD FT FCAGAALLQKQKEVVRMIGASSLTFSDTQQRWSTVEKEGFALLAGMERFKY FT YLEGSKPFIVLTDHKPLLNIDKKLSKNRKLQRWRERMSIFNFVVQYIPGSK FT NVVADYLSRPFGKIDKPVMENNDPAGQFYKVKNENLEIYVPSWACGGVFPK FT EILLSKASEAQECLLVSGLVSNSSNLYLHELVVITRAQEEDITLREVRENV FT RNDDKPDKWSLSSFGERDIFLRYRKQFKFEENSDVLLIIQNERYRIVIPYS FT LRDPLFETSARESAFRNKKDTRAT" XX SQ Sequence 5169 BP; 1673 A; 1221 C; 1141 G; 1134 T; 0 other; aattggtacc agttcgctcc actacgaatc acgccgcctg attggccata attcgtcatc 60 ctgaagagaa agccactcga aaatataaac gcatcaaaat cccgcgaggg tcacccaaat 120 tttaaacaaa taagggtgca cgactgaggt acgagaatcc gttatcagaa tatcagcaaa 180 cttcattttt tattgtcatc gtcatgggac gtgtaatgga catctcgaaa gagattgaaa 240 ttttcgaaaa cttgaagaaa tgcgacgaag tgcaaaacga cgcagagaag ctggaagata 300 tcgaaaaaaa aataaagtcg ctgcgcggac aacttatcgc ggcagcaaag aacttggaca 360 ctgatgaatt cagccgaaag gaccagagcg acatcagttt tatcaggacg caacttggcg 420 aaataggaat attcgagggt ggctcttttg aaaccacgaa tttttttatc gcgagatgcc 480 aaaaagtcta cgatctaact gttaaatcga ataagaagct cgaaaagcaa ttcatcacgg 540 acgtgaaagc acgctgctca gttcacgtaa ctcatcgtat gaatgagtct cacggcactt 600 gggaagagct caaaaaagaa attttcgcca atttttctgg cggaataaca gcaatacagt 660 cacttcaaag agccctcaac acaccgtatg ataaacgcag aggatttaaa agactcgcga 720 cacagatcag cttgaatctc caagcagcaa aagctactat cgagaaagcg gtcaagaaga 780 aagttatgaa gcaggaacca ggaggcgaag ctgctgttgg gatcacatct tctaccgaac 840 cgcttacagt cgacaagctc atggaacatc ttggcgcatc cattatggcc acgattattc 900 aagtaaacca caacgaatta tacaattcga tggcacccgc ctggaagaaa attgagagcg 960 ctggcgagct cgccgagcag gctgaatatt accaaaatca gcgcagcagc ggagaagtgt 1020 caacctatta cggccgacag agtaatggca actcgaatcg caaacaaaat ggaaacgcta 1080 agaaacaagg cgattacaac gaagcaaacg aaaacggaaa tcgcaaaggc ggaaattccg 1140 gcggcggagg cagtaattcc cgcggcaatc gaaaaaattg gaacaacaag aaacaattta 1200 aaaaacgagc ttatattgga aatgaggaac aagatgaaaa agaaacgcct aacgaaagag 1260 aaaactctag cgatgacgaa cagaaggaaa ccctcttcgt ctccggtcgg tcggattttc 1320 gctaagggct ggcgctacgt gcttcgcgct agacccgtat attagtaaaa gcacattcag 1380 ctcgtacggc tgcaactcta tacagccctg cgtagacatc aaattaaagt catccaattt 1440 ccgctcagca tacattaacg cgttaatcga tacaggcagc tccataacga tgatttcaag 1500 aaaaaacata ccaaaggagc tactcagtca agttcgtcaa tacgacggca aagtcaatgg 1560 aattggatcg ggccgactga ctggcctact cagcactgaa atgaaattag gaaatatttg 1620 gttggccaat gtacgtatat ttatcgtcga aagtgatatc cccatgctaa tcggccgcga 1680 tgtcatcttc gagaggtcgg attataaatg cctgcctgac acagacggat tacgaatgat 1740 aaacaaagcg ttaaaaaagg aagagcaaca tatccagtat gtgcagagaa gcgcgtttat 1800 cggcaacaaa ttagatcagc gcctgttttc aaagcttaaa gaagaaaaaa tgatcgattt 1860 ggagaaaatc agcacaatcg gcactgagca aagaatggaa ctcgcgctac tcataaacga 1920 gttcaaggag attttctgca tcgatggcga tccggttggt acttttaaag aatatggtag 1980 aataccaact cagcccggta aaacagcagc tcgacgtcag catccgatcc cagcacaatt 2040 caacgacgca gtcgataagg agatcttggc aatgagaaaa gatggaatta tcgaagattg 2100 cccagacagt cgcggcttta attctccgat tttatgcgtc aagaagaaaa atggaaaagt 2160 acggatctgc tgcaattaca aaagcacgct caatacgatt ttacaggagg tcgacagaga 2220 agtctggacc ttaccgaaaa tcgaaaatat cttctctgga atcggaacgg gtgtaaagta 2280 tttcacaact ctcgatgtgt cacgtggtta ttggcattgc caaattcacc ccgatgaccg 2340 tcacaaaacc gcgtttaagt ggaaaggtcg ctgctacatg ttctgcagaa tgccttttgg 2400 gctatttcat tccggcgacg ttttttgtaa gtctcttcaa gtcgcactta accaggtcga 2460 gcgctccgaa aatgtcattt catatgtgga cgacgtgcta ttattcgacg tgtcctttga 2520 gaatcacatg gaaactctac ggcaagttct gaccgcactc ggacagtcag gagtcagact 2580 gggcgcctcg aaatgtcagt tcttcacgca gacaacaaca ttcatgggtc ggcacatttc 2640 tccggatggt atatctcccg atccgaaaaa tattgaagct attctaaatc tgaatccgcc 2700 gacaaataga aaagagctct tgtcatgctt aggaatgctc ggctgggtat cttcctggat 2760 atcatctaaa atctcggaaa acgtcgcaca atactgtttc tcgaacgtaa cccgagaact 2820 gaacaagctc acaaaaaaag cgacttcgaa atctttcacg tggaccgatc aagctaacga 2880 agcttttgta gaagcaaaaa agagacttgc ggagcccaaa gttcttgcat accctgactt 2940 cgccctacca tttatactca ctactgacgc ttcggatttt tgcgcaggag ccgcattact 3000 tcagaaacag aaagaagttg tccggatgat cggagcgtca tcactgacgt tttcagatac 3060 tcaacagcga tggagcacag tcgaaaagga aggcttcgct cttcttgcgg gaatggaacg 3120 cttcaagtac tacctggaag gttcaaaacc gtttatcgtc ctaactgacc acaagccact 3180 actgaatatc gataagaagt tatcaaaaaa tcgaaaacta cagcgttgga gagagcggat 3240 gtcaattttt aacttcgttg tacagtacat ccccggaagc aaaaatgtag tcgccgacta 3300 tctcagccgc ccgtttggaa aaatcgacaa gcccgtcatg gaaaacaacg acccagccgg 3360 acaattctac aaggtaaaga acgaaaatct cgaaatttat gttcccagct gggcttgcgg 3420 cggtgtcttt ccgaaagaga tactcctatc aaaagcatct gaagctcaag aatgcttgct 3480 agtcagcgga ttggtgtcta actcttcaaa cctatatttg cacgaactcg tggtgataac 3540 cagagcacaa gaagaagaca tcacactcag ggaagttcgt gaaaatgtac gcaacgacga 3600 caaacccgat aaatggtcgc tgagctcatt tggggaacga gatatctttt tgcgatatcg 3660 gaagcaattc aagttcgaag aaaactcaga cgttctctta atcatccaaa acgaacggta 3720 tcgaattgtt attccgtaca gtctcaggga tccgctattt gaaacaagcg cacgagaaag 3780 cgcatttcgg aacaaaaaag acactcgagc tacttagctg ggcgtggtgg ccaggcatgt 3840 acgaagatat caccgattat tgtgcctcct gcgtaacttg cttgaaagta aaaggatcgg 3900 acactcaacc caacaaactg gaactcctta acctgccgcg ggggagcgcc ccgttcgaaa 3960 taataatgat cgattacatc gaatttgaaa gaagcaagtc cggaaagaaa tacgccatga 4020 cgctccagtg tcagtttacg cgatttgttc aggtttatcc gtgcgcaaag cacgacgcga 4080 tggccacagc tcgccacctg atgtcattca tatccacatt cggattccct cgagtgctgt 4140 caagcgacag gggacggcat ttctcagcag agttagttca aaacctctgc aaaatgctca 4200 acgtcaagca gctactccac gttccatcga ccggaagcct ctgggacgat cgagagactc 4260 caccgaactc tgaagtctgg aatctgggcg gtagcttacg atcaaggaag cgactgggaa 4320 gacgttatcg ccccagttgt cttcgcgata aatacagtaa aaaatcgggc cacaaaatgc 4380 accccgttcg aagcagtctt tggcagagta gctgcccctc ttaacataac cgtcccaagt 4440 tcaaagcccg accagcataa tccggaaact tatgctcatc aatgcgccga aactctccgg 4500 cgttcggcaa atataattaa aatcgcgcag acagaggcag acgaacgaca aaagcgcgca 4560 aatccgtcga caagacttcc cgaagaaata cacgaaggtg agttcgtatt tttaaaacgc 4620 ccggtaagcg cgcaagcaaa aagggaacat acccaaatga ttggacccta ccgcgtcacg 4680 gcatccgacg gtcacgtact ctgcatcgac gacaaccagc aaaacaaatg ggtccataga 4740 cagcatattg tcctcgcaaa acggcgcaga cccgacctca gcgaggaaat cgacatgttc 4800 ggaagcagcg acgaagaagt tgctcccatc tccgaaatca aaaagtccga cgaaatccaa 4860 ctccggcgga gtactcggac gagacgaaag ccagatcgct ttcagccttg acaaacactt 4920 caaatcttaa tcgtcgaggt agtcaccaaa ctttcatatt ttcaaatcta ttttacgcca 4980 tttcacctca aacgcgccta caaatcctgg acttccagct tacagtcgca aataagaaaa 5040 gaacgtccaa aagtttaaac gccgaggtag ttttcttatt tacattcact tacgccttta 5100 tttcacgaca acttaccgca gcaaccgaca catcttcaac aaaagaacta acgaaaaagt 5160 ggagggaca 5169 // ID Sola2-N1_AAe repbase; DNA; INV; 1394 BP. XX AC . XX DT 11-OCT-2010 (Rel. 15.1, Created) DT 11-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE A non-autonomous Sola2 DNA transposon family from Aedes aegypti. XX KW Sola; DNA transposon; Transposable Element; Nonautonomous; KW m4bp_Ele13; Sola2-N1_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-1394 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-1394 RA Kojima K.K. and Jurka J.; RT "Sola2-type DNA transposons from the yellow fever mosquito."; RL Direct Submission to Repbase Update (11-OCT-2010). XX DR [2] (Consensus) XX CC [1] Named as m4bp_Ele13. CC [2] Consensus update and characterization as a non-autonomous CC Sola2. ~96% identical to consensus. This consensus is ~94% CC identical to the original sequence in [1]. 4 bp TSDs. TIRs are CC ~920 bp long. XX SQ Sequence 1394 BP; 529 A; 176 C; 190 G; 499 T; 0 other; gacgaattcc atgcagaatg agtcgaaaaa aataaaaatc gtgctcgata gtcttcgatt 60 tggattaaat tttgcacatg cttttggtat ggtagaataa gtgttttcca tagaaaaatt 120 gatcattttg actcaagtgg aacttttgaa aatggcctat gaatttttgc atgcaactta 180 tttgaaaaat tctaactccg aaactgttga ttttagagaa aaatgttcta tgaagaagtt 240 gtagtgaacc gtttggacta taagaaaaaa atatacactg aaaaaatatt ttattaattt 300 tcataaaaaa ttcaaaaata aaacttaaat tttaaattac acaaaaaccc catttttaat 360 attttttaaa ttttcaccat ataatcttga tagtaaaaag agttttggga caaagtttca 420 tgatggagaa atttttaata aaaaagtttt tctaagaaca acttttggac gattttcaaa 480 attttgatat tttgtcaaaa taaatacgtt ctgatgatac agtaagcaac ttgaaatgat 540 tctaagccat aatgattata tgaataattt ctctagaagt agccatttgt gaattatatc 600 gagtttaatg caaaaaatat tatttaaata ttaaaatagg ccattttcat tagttattcg 660 tgattctcca tcaaggaaaa taagtaaatg taaagaaata tggtctttac tctcgaaaac 720 gaattattat taatggagaa acgcgaataa ctaatgaaaa tggcctattt taatatttaa 780 acaatatttt ttgcattaaa ctcgaaataa ttcgaaaatg gctacttcta gagaaattat 840 tcatataatc attatggctt agaatcattt caagatgctt actgtatcat cagaacgtat 900 ttattttgac aaaaaatcaa aattttgaaa atcgtccaaa agttgttctt agaaaaactt 960 ttttattaaa aatttctcca tcatgaaact ttgtcccaaa cctgttttta ctatcaagat 1020 tatatggtga aaatttaaaa aatattaaaa atggggtttt tgtgtaattt aaaatttaag 1080 tttaatattt gaattttgta tgaaaataaa tgaaatattt ttccagtgta tatttttttc 1140 ttatagtcca aacggttcac tacaacttct tcatagaaca tttttctcta aaatcaacag 1200 tttcggagtt agaatttttc aaataagttg catgcaaaca tttataggcc attttcaaaa 1260 gttacgcttg agtcaaaatg atcaattttt ctatggaaaa cacttattct accataccaa 1320 aaacatgtgc aaaatttaat ccaaatcgaa gatggtcgag ttctgtgact gaccgatttg 1380 acatggaatc cgtc 1394 // ID Gypsy17-I_Dpse repbase; DNA; INV; 11390 BP. XX AC Unknown_group_151; XX DT 15-MAY-2009 (Rel. 14.05, Created) DT 15-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy17_Dpse; KW Gypsy17-LTR_Dpse; Gypsy17-I_Dpse. XX OS Drosophila pseudoobscura OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; obscura group; OC pseudoobscura subgroup. XX RN [1] RP 1-11390 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1104-1104 (2009). XX DR Genome; Unknown_group_151; Positions 53483 42094. XX CC Positions [5030-5503] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 3266..5812 FT /product="Gypsy17-I_Dpse_2p" FT /translation="MIRQGIIRKSNSPYCSPISIIPKKIDASGKQKYRMVV FT DYRSLNEITVKDKFPTPRMDEILDKLGKCQYFTTIDLAKGFHQIPMEPSSI FT PKTAFSTKHGHYEFTRMPFGLTTAPATFQRCMNNLLEELIFKDCFVYLDDI FT IIFSTSLEEHLLSLRRVFGKLRDANLKLQMDKCEFMKKETEFLGQVVTTEG FT IVPNPGKISAIVKFPIPETTKQIKSFLGLCGFYRKFIPNFANIAKPMTLKL FT KKGAVIDPKEEKYVEAFERLKVLITSDPILAFPDFDKMFTVTTDASNIALG FT AVLSQEHKPICYASRTLNEHEINYSAIEKELLAIVWATKYFRSYLFGRTFE FT IQSDHKPLIWLNNLKEPNMKLQRWKIKLNEFDFKMKYLPGKENYVADALSR FT VKINENLLGEASNSVGATVHSAQEDNLNHIAITDRPINYYNRQIELIKGNE FT DKVETIRYFHKLIIKITYKEMTNTLAKYIIKEYLCTKKSALYFHNETDFPP FT FQRAYLEIINSNNITKAIKSSTKLLDIQNYAEFKETILKNHKGLLHPGIEK FT TVNWFRGKYYYPDYQKLIQNIINECPTCNIAKTEHRNTKLAFETTPEILNI FT REKYVMDFYMVGNQQFLSCIDVYSKFATLVEVKSRDWLEAKRAIMKVFNEM FT GKPIEIKADKDSAFMCTALQAWLNAENVKIEITSSKNGISDVERFHKTVNE FT KLRIISSEDNTEDRFTKFELILYTYNHKTKHNTTNRTPADIFIYAGSPDYD FT TQLNKVNKITQLNKNRIEYEVDTRYKLSPLVKSKVTNPFKRTGEIRQVDEK FT HYEEKNRGRKITHYKSKFKKKKKSNRSKYNNSRETEEVNGIGELQHN" FT CDS 5787..7199 FT /product="Gypsy17-I_Dpse_4p" FT /translation="MESANFNTIKILIFLIITIVMAQGQNIEINSIKSQNG FT YMIFKTGSINIPINYEYHYLTVNVTKTEELYQNLLRQATKFQDIIQIKYLV FT DKLQREMNGLKITKRNKRGLINIVGTAYKYLFGTLDQEDKVDLEQKIENLA FT SHSIQMNELNLVIDAVNSGINVINKLNEEKDRNQQIEILIFNLQHFTEYIE FT DIELGMQLTRLGIFNPKLLKQDYLEQINSEKILNIKTCTWLKSDTNEILII FT SNIPKDITKVPIYKIVPYPDELNNMLTDMTYDKYYIKNEEVFVKETRLKTN FT DNCIKGILMQTPTKCPYSKTFQNFQINFIEPNILITWNLPKTTLNQNCVNQ FT EIIAEGNTIIKIYNCSIQLNEFTISNTMLDFAQNVYVNNNITKIKPLSYVQ FT TNEIIMQYTTYSNLLQISLLISFVIIILVLLSYVAVKFTKTSKRVTVKCIN FT PIIEAKEEPTSATALDTPSLYPRVIA" FT CDS 9131..10873 FT /product="Gypsy17-I_Dpse_3p" FT /translation="MKENYIDSEIKDDNLTVHTINGPVRLTKSIMRNANKT FT CPSEQKFYIHNFSENYDILLGREYLMASKAVIDYRNDTVTLGARSYPIIQS FT GEAIEAGKTAQKCTDPSTKEEKIAPKCLDPSPEGDQYFASALDSELRECNQ FT LRLEHLNDEEREELKKVLYEFKDVQYREGDNLTFTSTIKHEIKTQHEDPVY FT RRPYKYAQIHDQEVNQQIKDMIRQGIIRKSNSPYCSPISMIPKKIDASGKQ FT KYRMVVDYRSLNEITVKDKFPTPRMDEILDKLGKCQYFTTIDLAKGFHQIP FT MEPSSIPKTAFSTKHGHYEFTRMPFGLTTAPATFQRCMNNLLKESIFKDCF FT VYLDDIIIFSTSLEEHLLSLRRVFGKLRDANLKLQMDKCEFMKKETEFLGH FT VVTTEGIVPNPGKISAIVKFPIPETTKQIKSFLGLCGFYRKFIPNFANIAK FT PMTLKLKKGAVIDPKEEKYVEAFERLKVLITSDPILAFPDFDKMFTVTTDA FT SNIALGAVLSQEHKPICYASRTLNEHEINYSAIEKELLAIVWATKYFRSYL FT FGRTFEIQSHHKPLIWLNHLKEPNMKLQRWKKKI" XX SQ Sequence 11390 BP; 4470 A; 2294 C; 1913 G; 2713 T; 0 other; aataaattct gagaaaatac taaatataaa aaacctccac atggctaaaa tccgatacca 60 acgaaattct aataatttcc aacattccca aagacattac aaaagtacca atctataaaa 120 tcgttccgta tccagacgaa ttaaataaca tgttaacaga tatgacatac gacaagtact 180 acataaaaaa tgaagaagta tttgttaaag aaactagatt gaaaaccaat gacaattgta 240 taaaaggaat tttaatgcaa actcccacta aatgtccata ttccaaaact tttcaaaact 300 ttcaaataaa ctttattgaa cccaatatac taatcacatg gaatcttcca aaaacaactc 360 taaaccagaa ttgtgtaaac caagaaatca tagcggaagg aaataccatt ataaaaatct 420 acaactgttc aatccaatta aatgaattta caatatcaaa cacaatgtta gattttgccc 480 agaatgttta tgtcaacaac aacataactt aaataaaacc actgtcgtat gtccaaacta 540 atgaaataat tatgcaatac accaaatata gtaacctatt acaaataagt ctaatttcat 600 ttgtcataat tatattagta caactaagtt atgtagctgt taaatttaga aacttctaaa 660 agagtcacgg ttaaatgtat taacctataa tagaagccga agaggaaacc acctcagcca 720 ccgcacttga aaccccctcc ctatacccga gggttatcgc ctagggacag gctaaataca 780 aacgttgggg ggatggaata tccataattc cccacatccc atacacatta tgtttatgta 840 caattcatcc accccatcca cacaagtaac aggaccccac tcaagaatca ataactgttt 900 cttcccatac tccaagctag ggccccccat aatataaaca atggaccaca ttgtttcagc 960 aacattagaa tcacgccact ctcgagaaat cgattcttta gaatcacgcc actcatgagg 1020 accaatcaaa gctagacata aaaccaagga gtatcacatt cctagctccg tcatgtaatg 1080 caacaccgat cgccagcagt ggatgccttt catcaaagcc gacgcatccc caaatatcga 1140 tccaggcacg tacgccaccg acacgaagat gcagccgagg aaagcaacgc ccgaagccag 1200 agtcgggagt tccaagagaa gggctcatgg aaactagcca gcagcagaga gatcagttcc 1260 gtttgagaca agcagaccga aagtcaagtc ggggaattca ttcaaatctt gtgacataaa 1320 gatatttaag ttgtaaaata aaagtctttg ttaaaaactt aaaatcgagt tgcggatatt 1380 taactggcgc agtcggtagg atttcgttta aaattaacac ctatatttcg gtaatggtca 1440 gcttgtgtaa tttataatta ctgtgatcac gtaatcccgg aattgttaat aaaattgtga 1500 aatccgccaa agaacctatg tgctgcaagc acatacgatt tgcaaaacta atttaagtga 1560 actgaagtaa gcggagatag tgaatcacca aggacagtga aacacttaat acaaaataaa 1620 gttaaaaact acaacaaaac aaaacccaat caacaccaac aacatggcgt taccaccact 1680 atcattgagc gaaatccact tgaaccaggc actgctgtcg atccgacaaa tacctgccta 1740 tgacggttca tctggacagc taagctcatt tatcaaacga atcgaatatg tatgcggatt 1800 atatccaact gaagatcttc gccaacaaag gatcctattt ggagcgaccg agatccaact 1860 gtcaggagag gcacaacgca tatcgcaaat gatacaaccc aacacgtggc tcgaactgaa 1920 gacggcgctg atcaacgagt tcaaaaatca cacaccatat gaagaactct tacgacaact 1980 atacaccacc agattcaacg gcagtcttcg taagtttata gaagaattag agataaaagc 2040 attattaata aacaataagc taaacttaga aaacatacca gaaaataatg tcatttataa 2100 aaacgctttg aacaacacaa ttaaagatgt cattactcga aaacttcctg acagaatgtt 2160 catgacctta gcaagaaaag atattaccac attgtctaat cttaagaagg cagcacaaga 2220 agaaggattg tacgaatcta gtacaacaga tgcaggtaca aataaagatc aatcgaaatc 2280 gtctcaatca aatcgaaaga atgttggaaa ctattatcca aattataata ctactaacta 2340 tcacaatcaa actcagagtt catctggaac aggtcagacg aatcgaacaa atcaaaatca 2400 aaattcccaa aatcctggat tatacaatga gttcaaaatt tccttgaatc aagggagggc 2460 tcagaatcca ttcaatcagc aaatatttca aaaccaagct agtgggagtg gacccaacca 2520 acctactaaa agagggaggg agagccagag tggccaaacg aagatggacg taaattttca 2580 tcaagccgcc tcggaagaaa ccgaggagga ccctatataa acattaccat taataaaaga 2640 atattaaggt gtatcgctga ctcgggatca tcgatcaaca tcatgaaaga aaattacaca 2700 gattccgaga taaaagacga taaccttacg gtccatacca tcaatggacc tgtccgttta 2760 actaagagta taatgaggaa tgcgaacaaa acatgtccgt ccgaacaaaa gttctacata 2820 cataactttt ccgaaaatta tgatatttta ttaggaagag aatacttgat ggcaagtaaa 2880 gccgtcatcg attacagaaa tgacactgta acactaggag ctaggtcaac ccaagtgcac 2940 tgatccctct acaaaagaag agaagatcgc acctaagtgc cttgacccat ctccagaggg 3000 agatcaatac ttcgcttccg ctctagacag tgaattaagg gaatgtaatc aattaagatt 3060 ggaacactta aacgacgaag agagggaaga gttaaagaag gtcttatatg agtttaagga 3120 tgtgcagtac agagaaggtg acaatttgac tttcacaagt acaatcaagc acgagatcaa 3180 aactcaacat gaagacccgg tctacagacg accatacaaa tatgcccaga tacatgatca 3240 ggaggtaaac cagcaaatca aagatatgat caggcaggga ataattcgaa agtcgaattc 3300 tccttactgc tcgcccatat ctattattcc gaagaagata gatgcttcag gaaagcaaaa 3360 atatcgaatg gtagtagact atagaagtct aaatgaaata acagttaagg acaaattccc 3420 aactccaaga atggatgaaa tcctagacaa actaggtaaa tgtcagtatt ttacgacaat 3480 cgatttagca aaaggtttcc atcaaatacc catggaacct agctcaatcc ccaaaactgc 3540 gttctccact aagcatggac attacgagtt cactcgtatg ccctttgggc taaccactgc 3600 cccagctacc ttccaaagat gtatgaacaa tctgctagaa gagttaatct tcaaagattg 3660 ctttgtatac ttagatgaca tcattatatt ttccacttca ttggaagaac acttgttgtc 3720 actaaggaga gtatttggaa agttgagaga cgccaacttg aagctacaaa tggataaatg 3780 tgaattcatg aaaaaggaaa ctgaattcct aggtcaagta gtcactactg aaggaatagt 3840 accaaatcca ggcaaaatct ctgccattgt caaatttcca ataccagaaa cgactaaaca 3900 aataaagtca ttcttaggtc tgtgcgggtt ctacaggaaa ttcattccta attttgcaaa 3960 catagcaaaa cccatgacac tcaaactaaa gaaaggagct gtcatagacc ccaaggaaga 4020 aaagtacgtc gaggcatttg agagactaaa agtactcatc acctcagacc ctatccttgc 4080 gttcccagac tttgacaaaa tgttcacggt aacaaccgac gctagtaaca tagcattggg 4140 agcggttttg tcacaagaac acaaaccgat ctgctacgct agcagaaccc taaatgaaca 4200 tgaaataaac tattcagcca tcgaaaagga attattagcg atcgtatggg caaccaagta 4260 tttccgttca tacctgttcg gtagaacatt cgagatacaa agcgatcata aacccttgat 4320 ttggttgaac aacctcaagg agccaaatat gaaattacaa aggtggaaaa taaaactgaa 4380 tgagtttgat tttaaaatga aatatctacc aggcaaagaa aattatgtag ctgatgcctt 4440 atcaagggta aagattaatg aaaatctcct tggagaagct tccaatagcg ttggtgcaac 4500 tgtacatagc gcacaggaag acaatctaaa tcacattgct atcacggata gaccaattaa 4560 ctattacaat cggcaaattg agctaataaa aggcaatgaa gataaggtag aaacaattcg 4620 ttattttcac aagctaatca tcaagatcac gtataaggaa atgactaaca ccttagcgaa 4680 gtatataata aaggaatatc tatgcaccaa aaagagcgca ttatatttcc ataacgaaac 4740 agattttccg ccattccaaa gagcctattt agaaatcatt aattctaaca atataaccaa 4800 agctattaaa tcaagcacga aactcctaga tattcagaat tacgctgaat ttaaagaaac 4860 catattaaag aatcataaag gcttactcca tccagggata gagaaaaccg tcaattggtt 4920 taggggaaaa tactattacc cggattatca aaaactaatc caaaacatca taaacgaatg 4980 tcctacttgt aatattgcca aaacagagca tagaaatacg aagttggcat tcgaaactac 5040 accggaaata ctgaacatca gagagaaata cgtcatggat ttctatatgg ttggtaacca 5100 acagttccta tcatgtattg acgtatactc gaagtttgcc actcttgtag aagtaaaaag 5160 tcgtgactgg ttagaagcaa aaagagccat catgaaagtt tttaacgaga tgggcaaacc 5220 aattgaaatc aaagccgata aagattcagc ttttatgtgc acagcattgc aagcatggct 5280 gaatgcggaa aacgtcaaaa tagagatcac ttccagcaaa aatggaatat ctgacgtaga 5340 aagatttcat aaaacagtaa atgaaaaatt aagaataata tctagcgagg ataacacaga 5400 agatagattc acgaagtttg aattaattct ttatacttat aatcataaaa ccaaacataa 5460 taccacaaat aggacaccag ctgatatatt catttatgca ggctcacctg attatgacac 5520 acaactaaat aaagtgaata aaatcactca attaaataaa aaccgcatag agtatgaagt 5580 agatacccgc tataaactat cacctctagt taagtctaag gtaacaaatc ctttcaaaag 5640 gacgggtgaa attagacagg tagacgaaaa acattatgaa gaaaagaata ggggtaggaa 5700 aataactcat tataaatcaa aattcaagaa aaagaagaaa tctaatagaa gcaaatataa 5760 taattccaga gaaaccgagg aagttaatgg aatcggcgaa cttcaacaca attaaaatcc 5820 tcatcttcct cataattacc atcgtaatgg cacagggcca aaacatagaa ataaattcta 5880 taaaatccca aaacggatat atgatattca aaactggatc gatcaacata cctatcaatt 5940 acgaatacca ttatctaact gttaatgtaa ctaaaaccga agaactatac caaaatttgt 6000 taaggcaagc cactaaattc caggacataa tccaaataaa atatctagta gacaaattgc 6060 aaagagaaat gaacgggcta aaaataacca aaagaaataa aagaggccta atcaatatag 6120 taggaacagc ctacaaatat ctctttggaa cactagatca ggaagataaa gtcgatttgg 6180 aacaaaaaat agaaaattta gccagccata gcattcaaat gaatgaacta aatttggtaa 6240 tagatgcagt taatagcgga ataaacgtta taaacaaact aaatgaagaa aaagacagga 6300 atcaacaaat agaaattttg atattcaatc tacaacattt cacagaatat atcgaagata 6360 tagaactagg tatgcaatta actaggctcg gtatatttaa tccaaaatta ttaaaacaag 6420 actacctgga acaaataaat tctgagaaaa tactaaatat aaaaacctgc acatggctaa 6480 aatccgatac caacgaaatt ctaataattt ccaacattcc caaagacatt acaaaagtac 6540 caatctataa aatcgttccg tatccagacg aattaaataa catgttaaca gatatgacat 6600 acgacaagta ctacataaaa aatgaagaag tatttgttaa agaaactaga ttgaaaacca 6660 atgacaattg tataaaagga attttaatgc aaactcccac caaatgtcca tattccaaaa 6720 catttcaaaa ctttcaaata aactttattg aacccaatat actaatcacg tggaatcttc 6780 caaaaacaac gctaaatcag aattgtgtaa accaagaaat catagcggaa ggaaatacca 6840 ttataaaaat ctacaactgt tcaatccaat taaatgaatt tacaatatca aacacaatgt 6900 tagattttgc ccagaatgtt tatgtcaaca acaacataac taaaataaaa ccactgtcgt 6960 atgtccaaac taatgaaata attatgcaat acaccacata tagtaaccta ttacaaataa 7020 gtctactaat ttcatttgtc ataattatat tagtactact aagttatgta gctgttaaat 7080 ttacgaaaac ttctaaaaga gtcacggtta aatgtataaa ccctataata gaagcaaaag 7140 aggaaccaac ctcagccacc gcacttgaca ccccgtcact atacccgagg gtaatcgcct 7200 agggacaggc taataacaaa cgttggggga gttacatatc cataattccc cacatcccat 7260 acacattacg tttatgtaca attcatccac cccatccaca caagtaacag gaccccactc 7320 aagaatcaat aactgtttct ttccatactc caagctaggg ccccccataa tataaacaat 7380 ggaccccatt ggaccccaat cgagaaatcg attctttaga atcacgccac tcatgaggac 7440 caatcaaacc tagacataaa accaaggagt atcacattcc tagctccgtc atgtaatgca 7500 acaccgatcg ccagcagtgg atgcctttca tcaaatccga cgcatcccca aatatcgatc 7560 caggcacgta cgccaccgac acgaagatgc agccgaggaa agcaacgccc gaagccagag 7620 tcgggagttc caagagaagg gctcatggaa actagccagc agcagagaga tcagttccct 7680 ttgagacaag cagacccaca gtcaagtcgg ggaattcatt taaatcttgt gacataaaga 7740 tatttaagtt tgtaaaataa aagtattttt ttaaaaactt aaaatcgagt tgcggataat 7800 gaactggcgc agtcggtagg atttcgttta aaattaacac ctatatttcg gtaatggtca 7860 gcttgtgtaa tttataatta ctgtgatcac gtaatcccgg aattgttaat aaaattgtga 7920 aatccgccaa agaacctatg tgctgcaagc acatacgatt tgcaaaacta atttaagtga 7980 actgaagtaa gcggagatag tgaatcacca aggacagtga aacacttaat acaaaatcaa 8040 aaaaaactta caagtgaacc aaaataaagt taaaaactac aacaaaacaa aacccaatca 8100 acaccaacaa catggcgtta ccaccactat cattgagcga aatccacttg aaccaggcac 8160 tgctgtcgat ccgacaaata cctgcctatg acggttcatc tggacagcta agctcattta 8220 tcaaacgaat cgaatatgta tgcggattat atccaactga agatcttcgc caacaaagga 8280 tcctttttgg agcgaccgag atccaactgt caggagaggc acaacgcata tcgcaaatga 8340 tacaacccaa cacgtggctc gaactgaaga cggcgctgat caacgagttc aaaaatcaca 8400 caccatatga agaactctta cgacaactat acaccaccag attcaacggc agtcttcgta 8460 agtttataga agaattagag ataaaagcat tattaataaa caataagcta aacttagaaa 8520 acataccaga aaataatgtc atttataaaa acgctttgaa caacacaatt aaagatgtca 8580 ttactcgaaa acttcctgac agaatgttca tgaccttagc aagaaaagat attaccacat 8640 tgtctaatct taagaaggca gcacaagaag aaggattgta cgaatctagt acaacagatt 8700 caggtacaaa taaagatcaa tcgaaatcgt ctcaatcaaa tcgaaagaat gttggaaact 8760 attatccaaa ttataatact actaactatc acaatcaaac tcagagttca tctggaacag 8820 gtcagacgaa tcgaacaaat caaaatcaaa attcccaaaa tcctggatta tacaatgagt 8880 tcaaaaattc cttgaatcaa gggagggctc agaatccatt caatcagcaa atatttcaaa 8940 accaagctag tgggagtgga cccaaccaac ctactaaaag agggagggag agccagagtg 9000 gccaaacgaa gatggacgta aattttcatc aagccgcctc ggaagaaacc gaggaggacc 9060 ctatataaac attaccatta ataaaagaat attaaggtgt atcgctgact cgggatcatc 9120 gatcaacatc atgaaagaaa attatataga ttccgagata aaagacgata accttacggt 9180 ccataccatc aatggacctg tccgtttaac taagagtata atgaggaatg cgaacaaaac 9240 atgtccgtcc gaacaaaagt tctacataca taacttttcc gaaaattatg atattttatt 9300 aggaagagaa tacttgatgg caagtaaagc cgtcatcgat tacagaaatg acactgtaac 9360 actaggagct aggtcgtacc caatcattca gagtggagaa gctattgaag ccggcaagac 9420 cgcacagaag tgcactgatc cctctacaaa agaagagaag atcgcaccta agtgccttga 9480 cccatctcca gagggagatc aatacttcgc ttccgcgcta gacagtgaat taagggaatg 9540 taatcaatta agattggaac acttaaacga cgaagagagg gaagagttaa agaaggtctt 9600 atatgagttt aaggatgtgc agtacagaga aggtgacaat ttgactttca caagtacaat 9660 caagcacgag atcaaaactc aacatgaaga cccggtctac agacgaccat acaaatatgc 9720 ccagatacat gatcaggagg taaaccagca aatcaaagat atgatcaggc agggaataat 9780 tcgaaagtcg aattctcctt actgctcgcc catatctatg attccgaaga agatagatgc 9840 ttcaggaaag caaaaatatc gaatggtagt agactataga agtctaaatg aaataacagt 9900 taaggacaaa ttcccaactc caagaatgga tgaaatccta gacaaactag gtaaatgtca 9960 gtattttacg acaatcgatt tagcaaaagg tttccatcaa atacccatgg aacctagctc 10020 aatccccaaa actgcgttct ccactaagca tggacattac gagttcactc gtatgccctt 10080 tgggctaacc actgccccag ctaccttcca aagatgtatg aacaatctgc taaaagagtc 10140 aatcttcaaa gattgctttg tatacttaga tgacatcatt atattttcca cttcattgga 10200 agaacacttg ttgtcactaa ggagagtatt tggaaagttg agagacgcca acttgaagct 10260 acaaatggat aaatgtgaat tcatgaaaaa ggaaactgaa ttcctaggtc atgtagtcac 10320 tactgaagga atagtaccaa atccaggcaa aatctctgcc attgtcaaat ttccaatacc 10380 agaaacgact aaacaaataa agtcattctt aggtctgtgc gggttctaca ggaaattcat 10440 tcctaatttt gcaaacatag caaaacccat gacactcaaa ctaaagaaag gagctgtcat 10500 agaccccaag gaagaaaagt acgtcgaggc atttgagaga ctaaaagtac tcatcacctc 10560 agaccctatc cttgccttcc cagactttga caaaatgttc acggtaacaa ccgacgctag 10620 taacatagca ttgggagcgg ttttgtcaca agaacacaaa ccgatctgct acgctagcag 10680 aaccctaaat gaacatgaaa taaactattc agccatcgaa aaggaattat tagcgatcgt 10740 atgggcaacc aagtatttcc gttcatacct attcggcaga acattcgaga tacaaagcca 10800 tcataaaccc ttgatttggt tgaaccacct caaggagcca aatatgaaat tacaaagatg 10860 gaaaaaaaaa atttaatgag ttcgacttta aaataaaata tctaccaggc aaagaaaatc 10920 atgtagcgga tgccttatca agggtaaaga ttaatgaaaa tctccttgga gaagcttcca 10980 aaagcgttgg tgcaactgta catagcgcac aggaagacaa tctaaatcac attgccatca 11040 cggaaagacc aattaactat tacaatcggc taattgagtt aataagaggc aatgaagata 11100 aggtagaaac aaatcgctat ttccacaagc taatcatcaa ggtcacatat aagcaaatga 11160 ctaacacctt agcgaaggat ataataaggg aatatctgtg caccaaaaag aatgcattat 11220 atttccataa cgaaatagat tttcctccat tccaaaaggc ctatttagaa atcattaatt 11280 ctaacaatat aaccaaagct attaaatcaa gcaccaaact tataaatatt caaaattact 11340 ctgaatttaa agaaaccata ttaaagaaat actgaataca ccggaaatac 11390 // ID Gypsy-63_AA-LTR repbase; DNA; INV; 226 BP. XX AC AAGE02026010; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-63_AA_; KW Gypsy-63_AA-I; Gypsy-63_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-226 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02026010; Positions 44738 44963. XX SQ Sequence 226 BP; 65 A; 54 C; 37 G; 70 T; 0 other; tgtcataagt tacactagat tttctagacc attgcgttat acctgttttg tattagacat 60 accttgcgac agctgtcaaa gacttcccgc tcttttgtaa aagcttttag ttgagaccga 120 gcaagcaaac tataaataca cgttttccaa catcagtgaa atctacttta cggataattg 180 aacggcgttc ctcttccgag acccacattc ggctaccgtt acaaca 226 // ID Gypsy-8_PPc-I repbase; DNA; INV; 4467 BP. XX AC . XX DT 08-JUL-2010 (Rel. 15.07, Created) DT 08-JUL-2010 (Rel. 15.07, Last updated, Version -1) XX DE LTR retrotransposon from the Pristionchus pacificus genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-8_PPc_; KW Gypsy-8_PPc-LTR; Gypsy-8_PPc-I. XX OS Pristionchus pacificus OC Eukaryota; Metazoa; Nematoda; Chromadorea; Diplogasterida; OC Neodiplogasteridae; Pristionchus. XX RN [1] RP 1-4467 RA Jurka J.; RT "LTR retrotransposons from the Pristionchus pacificus genome."; RL Repbase Reports 10(7), 1008-1008 (2010). XX DR [1] (Consensus) XX CC Positions [2630-3091] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 480..1958 FT /product="Gypsy-8_PPc-I_2p" FT /translation="MNSTLEQLFSIIDKTTWEYLGKPELRRVSYSASAYNG FT TKIAFKGVCRVPVCLDRSETLFDLHVLEDSTRKHPLMGRDLIDALRMDMGP FT FYNGTPRVNELSQQKSVLGQLDGVLKANAELFRPELGKFIRRQAELKFKED FT KPSPVFRRARPVPHALRPKVEATIEKMVEQKVVTPVEHSEWASPLVIVPKP FT GGKLRICADFKQTLNPLLDINIYPLPKPDDLFHLLNGGEKYSKVDLKDAYL FT QMELCDEAKQYLVINTHRGLLRYERLPFGLASAPAIFQKSMEELLAGIEGV FT VIYLDDVTITAPNDAEHLVRLAKVLERFRSAGLRLKREKCEFLKEQIEFLG FT HLVSKEGTRPNPEKVKAISEMPPPGDLKQVESFLGMIQYYGKFIPNLSATA FT APLNALRRKGVTFEWSKSQQIAFETLKRRLVQADRLTHYDPQLPIVLATDA FT SDYGLGAVIYPDGNERAIGFASRSLTKEEKNYAQIEKEGLGIISGVEK" FT CDS 2090..3670 FT /product="Gypsy-8_PPc-I_1p" FT /translation="MEYSFDIEYRNTLEFGNADGLSRLPLSNQSQTVARIE FT DARSVQILNQEELSQIFITTEEWVRATKGDPILQKVEANLKRKWPSGPQDK FT ELVPYQERLDELSIVDGCLRWEHRLVVPRTLQPRVLKMLHENHFGQDRMKA FT LARSRFWFPTMDKMIELLAKSCETCAVMGKKPTKVPLHSWEIPQKPWQRLH FT IDFCGPFHGFMWLIVVDAKSKWPEMLKMRTTTSLATINKLMDIFAIHGFPE FT QIVSDNGPQLASKEFEEFCRKYGIQRTLTPPYHPNSNGQAERYVQTLKTAV FT EKCTYATKGDLDSVVRKVLFEYRVTPHPATKISPAQALMGRELRSRLDTQI FT PDTVPMPEPTEPNTYYENAKRNHDKISATRAFVVGEEVYVYDTAGGPQCWN FT PGMVVQKLSECIYVVNLGNHEKKVHANHMKRKETGSAYERLRLEYGRDLNT FT EVRIPQATVVEKGKVQIEAPSTRNLQNPSLSKSRQLDSSNREPLASVPTSQ FT NGEEIGLRRSTRKRQLTERFSEYVTETKRRH" XX SQ Sequence 4467 BP; 1253 A; 1178 C; 1097 G; 937 T; 2 other; actggcgttc agaactcgcg cctttttgct aagtacggta tattgaaacc ccttgtctag 60 ctatcgaaat gcctgacgag gatactaagg cggtgaaagc cgagatctat ctggccacca 120 tggtccaccc aaccctgctc aaggaaaggg aggccaccat gataagtgga aaaacaagaa 180 ccccaagcca tcgggacctc atccctttgc cggaaaaggt ggaaaatctc gaaaccccgg 240 aaacggtaaa gggcaccact cgggttctca aaatcaaggt ccaagatgtt tcaactgtaa 300 caaaaatggc cacatggcca atgtgtgcag agccccacgt aatgcccacg cattaggcta 360 ctctgaaaca tcagacgatg acatgggtat cttccaggta accttaggat cggtggaaag 420 gaactcatcc atccccccga tcctggttgc ggttgagaca tacgggaaac tcatccagta 480 tgaactcgac actggagcag ctcttttcca tcattgacaa aaccacttgg gagtatctcg 540 gtaaacccga gctgcgccga gtctcatact cagcgtcagc ctataatgga accaagatcg 600 ctttcaaggg agtttgtcga gtccccgtgt gcctggatag atcggagacc ctgttcgacc 660 tccatgtcct tgaggattcc acgcgaaagc accccctcat ggggagggat ctaatcgatg 720 cgctgaggat ggacatggga cccttctaca atggtacacc tcgggtgaac gaattgagtc 780 agcagaagtc agtattaggc caactcgacg gagtactcaa ggctaatgct gaactgtttc 840 gcccggaact aggcaagttc atacggcgac aggctgaact gaaattcaag gaggataaac 900 ccagcccggt tttccggaga gccaggccgg tgccccacgc actacgaccc aaggtggagg 960 caaccatcga gaagatggtc gaacagaagg tggtgacccc cgtagaacac tcggaatggg 1020 cgtcacccct tgtaatcgtg ccgaaaccgg gcggcaaact tcggatatgc gcggatttca 1080 aacaaacatt gaatcctctc ttggatatca acatctaccc gctaccgaag cccgatgatc 1140 tctttcatct cctcaatgga ggtgaaaaat actccaaggt ggatctgaag gatgcctacc 1200 ttcagatgga gttgtgtgac gaggctaagc aatatctagt aatcaacacc cacagaggat 1260 tactaagata tgagaggctt ccgttcggac tagcctccgc cccggccatc ttccaaaagt 1320 ccatggaaga actgttggct ggcatcgaag gtgtggtgat ctacctcgat gacgtcacca 1380 taaccgctcc taacgacgct gaacatctcg ttcggctcgc caaggtgttg gaaagatttc 1440 gctccgcagg cttgcgcctg aaacgcgaaa aatgcgagtt cctgaaagaa cagatcgagt 1500 tcttgggaca cttagtgagt aaggagggta ctcgacctaa ccctgaaaaa gtaaaagcaa 1560 tcagcgagat gccacctcca ggggacctca aacaggtcga gtctttccta ggaatgatac 1620 aatattacgg gaagttcatt cctaatctat cagcaacggc cgctccccta aatgcactcc 1680 gcagaaaagg agtgactttc gagtggtcta aatctcagca aatcgctttc gagaccctca 1740 aaagacggct ggtacaggct gatagattga ctcactatga tccacaatta ccgatcgtcc 1800 tcgcaacaga tgcgtcagat tatggtctag gagccgtaat ctacccagat ggcaacgaac 1860 gggccattgg gtttgcgtcc cgatctctga ccaaagagga aaagaattat gcccaaatcg 1920 agaaagaagg tctcgggata atttctggag tggaaaaata aaccagtttc tatatggtcg 1980 gaaatttctc ctactcaccg atcaccaacc tttggtcagg agattcggtc ctaagcacga 2040 aattcctata gttgctcagc gtcggttaac mcgctgggaa ctccgtctca tggaatattc 2100 gttcgacata gagtaccgta atacactcga attcggtaac gcggatggtc tgtcgcgact 2160 gccgctgtcg aatcagtctc agactgttgc taggatcgaa gacgcccgat ctgtgcaaat 2220 tctgaatcag gaagaattat cacagatatt catcaccact gaagaatggg tcagagcgac 2280 caagggcgac cccattctgc agaaagtgga ggctaacttg aaacggaaat ggccttccgg 2340 tcctcaggac aaagagctgg taccgtatca agaacgccta gatgaattat ccattgtaga 2400 cggatgtcta cgatgggaac atcggctagt cgtccctaga acgttgcagc cgcgcgttct 2460 aaaaatgctc catgaaaacc acttcggaca ggatcgcatg aaagcactcg ctagatcccg 2520 attttggttt ccgaccatgg acaaaatgat cgaactcttg gccaagtcct gcgagacttg 2580 tgcagtcatg ggaaaaaagc cgacgaaagt ccctctgcac agctgggaaa taccgcaaaa 2640 gccgtggcaa cgcctgcata tcgatttttg tgggccgttt catggtttta tgtggctgat 2700 agttgtcgat gcgaagtcta aatggcctga aatgttaaaa atgagaacta caacatctct 2760 cgctacgatc aacaaactaa tggacatctt cgctattcac ggttttcccg aacagatcgt 2820 gagcgacaac ggccctcagt tggcttcaaa agagttcgaa gagttctgta ggaagtacgg 2880 tatccagagg acacttacac caccatatca ccccaactcg aacggccaag cggaacgata 2940 tgtgcaaacg ttgaaaactg ctgtggaaaa atgcacatac gctacaaagg gtgatttgga 3000 ttcggtggta aggaaggtct tgttcgagta tagagtaacc ccacatcccg cgactaagat 3060 tagtccggcc caagctctca tgggcaggga gttacgatct agactcgata cacagatacc 3120 ggacacagtc ccgatgcccg aacctacaga acccaacacc tactatgaaa acgctaagag 3180 aaatcacgac aaaatctcag ccactcgagc gtttgtagta ggtgaggagg tctacgtgta 3240 cgacactgct ggtggtccgc agtgctggaa cccaggaatg gtagtacaga aactcagcga 3300 atgcatctac gtggttaatc tcggaaacca tgagaaaaaa gtgcatgcga accacatgaa 3360 aaggaaagaa accggatcag cttacgagag gttgcgtctt gaatacggta gagacctgaa 3420 cacggaggtc agaattccgc aggctaccgt agtagagaaa ggaaaggtgc aaatcgaggc 3480 gccaagtacg agaaacctac aaaatccatc cctcagcaag tcgcgtcaac ttgattcttc 3540 gaatcgagaa ccgcttgcat cagttcccac gtctcagaat ggggaggaaa ttggattaag 3600 acgctccaca agaaagagac aattaacaga gagattctct gagtacgtga ctgagacaaa 3660 aaggcgtcat tagcatgtta gcgcgagtcc ctcgttgcct aatctctcaa attacataag 3720 ccaacgatac tcaaagcgca tttacatgtc tgccccaatg ggtatataag gaggtcaggt 3780 tgaggagcag acgcattctc aatccattcc cagtctcaaa tgggtaaagg aaagaggaag 3840 atcgtgtcta cgtccgcctc cggagccccg ccaacgaccc gaccccctcc tgcggttacg 3900 ccggaactgg taagcatggt agtctcaatt accataccct ctgaccggac taaccaatac 3960 cctccagatc tctcaattgg tggcgtcgct ggtgaacccc aamccgtcat cctccgtgcc 4020 ggtcaccact accaccactc cccctcctac cccccctgac tcttccggtg actccgcagc 4080 ctcctttccc acctgcatct caaggtggtg aaggcaccgc tgacggatcc cgaggatcag 4140 cggtacccga ggctcgtctc cagacggtcg gccagatcgc acaagcacgc acaccgaaag 4200 cctgcgcatc ggctgaacag aatcgaggcg aacatcgctc gatctctgac cgagacagcc 4260 agactcgatc aggatcgagt caatctcctg gcccaagtga ccgagttgag aactgaggtc 4320 cgggagctcc agtaccctcc gaaaccttcc atcctgtgct ctccaccgcg tcgtccactc 4380 ggcgcgagag tccactcgga tggaactacc gtacccaaca ctcctgcaaa gggactggga 4440 acggaggtga gtatctcggg gaggagt 4467 // ID Gypsy-135_AA-LTR repbase; DNA; INV; 165 BP. XX AC AAGE02025177; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-135_AA_; KW Gypsy-135_AA-I; Gypsy-135_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-165 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02025177; Positions 34410 34574. XX SQ Sequence 165 BP; 53 A; 35 C; 28 G; 49 T; 0 other; tgtcgtgtcc gaagataact gttatactag catatttcga tatccgaatg taactgcgat 60 cgaatagcga agcttatcgc gatacgaatg tagaccaata aatcagttag acttttacct 120 ccaagtgaac cagtcgtttc atataatttc ccttatcaca gaaca 165 // ID CR1-99_AAe repbase; DNA; INV; 4886 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-99_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4886 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1187-1187 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 15 sequences with >96% CC identity. XX FH Key Location/Qualifiers FT CDS 345..1178 FT /product="CR1-99_AAe_1p" FT /translation="MAKPCAKCCDAITGFDYVICRGYCGGVFHMNNCTNVT FT RALSSYFTTHKRNLFWMCDSCADLFENSHFRTISAKADAQSPLASLTTAIT FT ELRTEIKHLNAKPTAQAPTPGNNRWPPLEMRRATKRPRDIETVGLPSEQCR FT IGSKQASENVVTVPMCSDSTDKKFWLYLSRIRPDVTNEMICAMVKENLAMD FT DEPDVVKLVPKGKDISTLSFISFKIGLDPSMKTPAMDPTNWPEGLLFREFE FT DYGIPKFRMPLKTRKPNTPLILPQNPSSPATPVMDLS" FT CDS 1444..4827 FT /product="CR1-99_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MEAPNPLIAVEPFQPAFTSHSGPAFEFGGGVFQTSNT FT GKYACVMNHSLPELSTVCRFSSHLQATNQSATTASLTLDESRSSSSQLESI FT IPRREPGRNVAACTKEAPIPLNSVEPLLPATHSRPGPVVELEDGVFRNPTI FT GKYMYPEPLFVPDGISISSVTTSNFTQPVATIRGTQNTEAQHLLTLYYQNV FT RGLRTKLEDLRLALSSCDYDILVFTETWLRPDICDLELSSEYCFFRCDRNE FT LTSEFLRGGGVLVAVKTNLCCSSVSLSDFNQLEQVAVRVKLPNRSVYISCI FT YIRPNSSPDLYWSHCNAVLSIFDQLDDSDFLVVLGDYNLPHLRWRFDEDLA FT AYLPVNASSEQELALLESLPAHGLLQVNRLTNINERLLDLAFVNSSDVVEL FT VDPPQSLLPVDPHHKPFLMLLDMHTSMEEQRVTSMEMDFRHCNFDNLNDVI FT ASVDWQFGEGAVDYKMSCFYDQLNEIFDAHVPRARRVSHGPLRRPWWTAEL FT RNSRNMLRKSRKRFFKTRSSTNYTNLRNMESSYKNLLAVTYREYIQKLERD FT LKQNPTAFWKYVRSLKANPKIPTNIVFGGVTANSDAEAAGLFAKFFQSVYS FT ADAPTAHAGCFEKVPTYDINIPELSFSVENVRKALEDVDESKGAGVDGLPP FT LFLKKCASSLALPIASLFNQSLRDRTFPKAWKAARIVPIHKSGSSNHVENY FT RGISILCSLGKVFESMVRDALYRTVHSVISEHQHGFVEHRSTTTNLMCYSS FT VLFKEIEQRKQVDSIYVDFSKAFDVVPHVYAVEKLKHMGFPEHITGWLLSY FT LTHRKAAVFVNSASSDEFPIPSGVPQGSVLGPLIFTLFVNDVCYRLRSPKQ FT LFADDLKFYRVIESTIDCLALQNDIDDLLLWCNQNGMQINISKCRVISFTR FT RRNVMNYQYTIGNRSLERSDTIRDLGVTFDSKLRFHEHISNMTAKAFSVLG FT FIRRHTSNFTDVYTLKTLFCSLVRSILEYAVPIWAPYQATHIIRIERVQRK FT FIRFALRQLPWNDPVNLPDYQARCRLIGLETLSDRRKNSQRLFVFDVLEGN FT TDCPELLEQTNFQVPSRNLRSYPAISVPFHGTNYGQHSSYSSCLRAYNEVS FT RLHDFGMSKTHFKSRIRRLE" XX SQ Sequence 4886 BP; 1330 A; 1134 C; 1052 G; 1370 T; 0 other; aattttacat attttattgc tgaactttgt gttctaaaat tacttttgtg cgttctcgtg 60 ttgtatttta ccgacagttt gcgtgtttca ctacagtttc ggtcggtcgt gcccccgttt 120 cactaatttg atagtgtatt ttaaagtgtt caattcgcgt tgtgtttata aaattggaca 180 gctcactacc tccacgttaa ttgtttttcc gagaagcgac atctaccggt aaacgagtgg 240 aaacaacacg gcatattcag catattcagc gtttttttgt catcgaagaa gctttcttgc 300 atccaagtta ccactgtgta gtctacgtaa aggcgcaata agcaatggca aaaccttgcg 360 caaagtgctg tgatgccatc accggctttg attacgtcat ttgtcgtgga tactgtggtg 420 gagtttttca catgaataat tgtacgaatg tcacacgtgc actatcatca tactttacga 480 cgcataagag aaatttattc tggatgtgcg atagttgtgc tgacttgttc gaaaattcac 540 attttcgaac aatttcagcc aaggcagatg cgcaatcccc attggcctcg cttaccacgg 600 caatcacaga actgcgcacc gaaatcaaac atttgaacgc aaaaccaact gctcaagcac 660 caacacctgg aaataaccgt tggccgccac tcgaaatgcg tagagccaca aagcggccac 720 gtgatattga aactgttgga ttaccatccg aacaatgtag aattgggagc aaacaagcct 780 cggaaaatgt tgtcacggtg cctatgtgta gcgactctac ggacaagaaa ttttggcttt 840 atttatcaag aattaggccg gatgttacaa acgagatgat atgcgctatg gttaaagaaa 900 atctcgctat ggatgatgaa cccgacgtcg tgaagcttgt tcccaaaggg aaagatatta 960 gcacgctgag ctttatttcc tttaagattg gattggatcc atcgatgaaa actccagcta 1020 tggatccaac taattggccc gaaggactgt tgttccgtga gtttgaggat tatggaatcc 1080 caaaatttcg gatgccatta aaaaccagaa aaccaaatac accattgatt ttgccgcaaa 1140 atccatcgtc tcctgctact ccagtaatgg atctcagtta gataacttca tcccgggacg 1200 caccagttat tgcatgatgg aagccccgta tcctcccaac acagtcgagc cattcctgcc 1260 agcgaccaac agtcgtcccg gtcctgtgtt tgggtctgga gaaggggtct tccaccctca 1320 ttcgcgggca agtactcttt tcacatcaat ttttcgccac ttgatgaatc caccgcttct 1380 agtcaaccga catcgatatc gtttctcacc aacctgtcac cggaatgcaa gcctgtcagt 1440 tgcatggaag ccccaaatcc actcatcgca gtcgagccct tccagccagc gttcaccagt 1500 cattccggtc ctgcgtttga gttcggagga ggggtcttcc aaacttcgaa tacaggcaag 1560 tacgcatgcg ttatgaacca ttcgcttcct gagctttcca ccgtttgcag attttcctcc 1620 catctccaag caacaaacca atcagccact actgcgagcc ttactttaga cgaaagtcgg 1680 tcttcatcta gccaactgga atccatcatt cccagacgtg aaccgggacg caatgttgcc 1740 gcttgtacta aggaagcccc tattcctctc aactcagtcg agcctctcct gccagcgacc 1800 cacagtcgtc ccggtcctgt agttgagttg gaagacgggg tcttccgaaa tccaacaata 1860 ggcaagtaca tgtatccgga gcccttattt gtgcctgatg gaatttcaat ttccagcgtg 1920 actacttcca actttactca gccagttgcg acaatccgtg gcactcaaaa tacggaagca 1980 caacacctat tgacactata ttaccaaaat gtacgaggct tgcgaaccaa gttggaggac 2040 ctgcggttgg cgctctctag ctgcgattac gatatactag tctttaccga gacatggctc 2100 aggcccgaca tatgtgattt ggaattatca tctgaatatt gctttttccg ttgtgaccgc 2160 aatgagctta ctagtgaatt cctacgaggt ggaggagttc ttgttgctgt aaaaactaat 2220 ctgtgttgct catctgtatc gctgtctgac ttcaatcaac tggagcaagt tgctgttcga 2280 gtcaagcttc caaatcgctc ggtgtacatt agctgcatat acatacggcc aaactcaagc 2340 cccgatctgt actggtctca ttgtaacgca gttctttcca tctttgatca gcttgatgac 2400 tctgattttt tggttgtgct aggggactat aaccttccgc atctccggtg gcggtttgat 2460 gaagacctcg ctgcatatct acctgttaat gcttcgtctg agcaagagtt agcacttcta 2520 gagtcactcc cggcgcatgg tttgttacag gttaaccgac tcacaaacat taatgaacgg 2580 ttacttgact tggcgtttgt caactcatca gacgtagtgg agttagttga tccaccacag 2640 tcgctactac cagtggatcc tcaccataaa ccatttctta tgttgttgga tatgcacact 2700 agtatggaag aacaacgcgt aacttcaatg gagatggact ttaggcattg taattttgat 2760 aatctgaacg atgtgattgc ttccgttgat tggcagtttg gtgagggggc tgtcgactac 2820 aaaatgtcat gcttttacga tcagcttaat gaaatattcg atgcgcatgt cccacgagct 2880 cgtcgcgttt cccatggacc cttgcgaaga ccttggtgga cggcggaact cagaaactcg 2940 aggaacatgc ttcgtaaatc gaggaaacgg ttcttcaaga cgaggtcctc gaccaactat 3000 acaaatcttc gaaatatgga atcgtcttac aagaatttgc tagccgtcac ttacagagaa 3060 tatattcaga agttagagag agatctgaag caaaacccga cggcattctg gaaatatgtc 3120 aggtctttga aagcaaatcc taaaattcca acaaatattg tgtttggcgg agtgactgcg 3180 aattcggatg ctgaagcagc aggacttttc gccaagttct ttcagagtgt ctacagcgca 3240 gatgcgccta cagcccacgc gggatgcttt gaaaaagtac caacgtacga tatcaacatt 3300 cctgaactaa gtttttcagt agagaatgtg cggaaagctc tggaagatgt cgatgaatca 3360 aagggtgctg gtgtggatgg tttgcctcct ttgtttctga agaaatgtgc ctcctctttg 3420 gctttgccta tcgctagtct ctttaaccag tcgctcagag ataggacttt tccaaaagcc 3480 tggaaagcag ctcgaatagt acctatacat aaatcgggca gctcaaatca cgtcgaaaac 3540 tatcgaggaa tttctattct ttgctctctc ggaaaagttt ttgaatcaat ggttcgagat 3600 gcactgtata gaacagtcca ctcggtgatt tctgaacacc aacatggttt tgtagagcat 3660 agatcaacga cgacgaacct tatgtgctat tccagtgtgc ttttcaaaga gattgagcag 3720 agaaagcagg ttgattcaat ttatgttgac ttctctaagg cttttgatgt tgttccacac 3780 gtgtacgccg tcgaaaagct gaagcacatg ggatttcctg aacacatcac tgggtggctt 3840 ctctcatacc tgacgcatag aaaagcagca gtgttcgtga attcagcttc atcagatgaa 3900 tttcccatac cgtctggcgt accacaagga agtgtacttg gaccactcat cttcactctt 3960 ttcgtgaatg acgtgtgcta ccgactcagg tccccgaagc aattgtttgc agatgatctt 4020 aagttctaca gagtgatcga atcaacaata gactgtcttg ccctgcaaaa tgacattgat 4080 gaccttctac tgtggtgcaa tcagaacggg atgcagatca acatatcgaa atgccgagtg 4140 atttcgttca cgcgtcgtcg aaatgtaatg aattaccagt acaccatcgg taaccgctcg 4200 cttgaacggt ccgacaccat ccgtgactta ggtgttacat ttgactcgaa actacgtttt 4260 catgaacata tttcaaatat gacagccaaa gcattttcgg ttcttggatt cattcggaga 4320 cacacatcaa atttcactga tgtttacaca ttgaaaacgc tcttctgttc gttggttcga 4380 agcattctcg agtatgcagt accaatatgg gctccttacc aagcaacgca catcatccgc 4440 atagagcgag ttcagcggaa atttattcga ttcgccttgc gccaacttcc gtggaatgac 4500 cctgtaaatt taccggacta tcaagcgcgt tgcaggctta ttgggctgga gactttgtca 4560 gatcgacgga aaaacagcca acgattgttc gtttttgacg ttcttgaggg aaacaccgat 4620 tgtccagagc ttttggaaca aacaaacttt caagttccat cacgaaacct gcgcagctat 4680 cccgcgatta gtgtaccatt ccatggaact aactatggac aacacagtag ctacagttct 4740 tgtttaagag catacaatga agtaagcaga ttacatgact ttggtatgtc gaagacgcat 4800 tttaaatcta ggataaggag attagaataa gaacagtctg tacaattttt ataattgaag 4860 acgaagtaca aatacaaata caaata 4886 // ID CR1-83_HM repbase; DNA; INV; 3752 BP. XX AC . XX DT 15-JAN-2009 (Rel. 14.02, Created) DT 15-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE CR1-type family - consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-83_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3752 RA Bao W. and Jurka J.; RT "CR1 families from Hydra magnipapillata."; RL Repbase Reports 9(2), 370-370 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(58..750,687..1892,1858..2409,2363..3625) FT /product="CR1-83_HM_1p" FT /translation="MVVSKDTTQAINTAIEVLRKELIERIIKVEKENERLN FT TENESLKKQLAEQCNQNNNKPLFSSLIKKTSEGAPPISDSETRLLNAISAE FT QKEKIKKEKNVLVFGLKESKRKSKDEIKEDDDESVGEVFDAIGLDKSCIVK FT HFRLNSRDKNRPGPLVIELVDPSYQKQVLSVARDLNKVEAYKNKVFINPDL FT TYRQRASLKLLLGERNSLNKIEEGQKFDISICYQKRYAAENNRGRSKVRHF FT DLLSETICCRKLKSKNTKKINAPSSLISKLSEITCLYMNATSLNNKFDELI FT EVINSNKANVVMICETWWNNESVTNINGYTLFRRDRGVGRGGGVCIYVQNN FT IKAYEVSEKCLNDPNIEQIWCVIELGSEKVLCGSMYRTGISNISNCINITK FT SINFAHSMNLLNKYTGILICGDFNFNAIKWTEDYYGTLTSETDTPAKIFLE FT CLNNCFMHQNVIKPTFQNNIGNETNVLDLIIAENKNRVYKLVHLPPLGGIN FT HGHHIIKFNYSHEECIDNIVLIKEKFNFNRGKFDKLNNYFKILNWDNEFKG FT LEINKCYEKWLSLYEEGCKLFIPIIKXGSIQINTKPLWINRELKALSKEKR FT NVWFECRRQKFRDNCLLAKYKELNKVYKKKSESKNTRFIKKKVNQRIKDFE FT YDLALNSKFNPKKVYSYINNKSKVKDNIRAIQLKNGSIETDLKVIANTLNG FT YFASVFNKPQTLSAPIIDIETQKICKNPDFNASVVEEYLNKLDTNKATGMD FT RVHPKVLKECKSTLAKPISLLFNKSFETGKLPKLWLCANIVPLFKNGDKLN FT PCNYRPTCIFNICSLTHATIDLPVSLTSVVCKVMEKIIKDKMMKHLVENKL FT INKNQHGFVNNKSCVTNLXESLDFITSSIDNGLDVIVLFLDLAKAFDSVDH FT ERLLLKLNALGFKSKLFDWCQGFLKNRLQRVVIGNNKSEWEEVVSGVPQGS FT VLGPLLFVVYINDISNKIKSSCKLFADDMKILRAIKNDNDVIELQQDIDLL FT MEWSNEWLMKFNQQKCKVMIIGNSQHKVLMNNTALAYTSMEKDLGVYISDD FT LEWKHHVNKAVNKANQKLGQIKHTFKYLDEKTMKLLFIALVRPYLEYAAPI FT WNPYRQYDKDKLECVQQRASRIKILKGYSYEERLKKLDLLSLENRRRRGDL FT IQMFKYLKGYDIINFYTKPEMLDNGYKTRGHNSKLRRQYTGNIKRHNFFTN FT RVVNDWNNLEQETIDAVSINQFKKKIG*" XX SQ Sequence 3752 BP; 1526 A; 494 C; 661 G; 1068 T; 3 other; atggcggcgt aaagatggca gcctagttga gcacgtgttt tgaaataatt tttaataatg 60 gtagtatcaa aagatacaac tcaagcaatt aatactgcta ttgaggtact tagaaaggaa 120 ttaatcgaac gcattattaa agtcgagaaa gaaaatgaaa gattaaatac cgaaaatgaa 180 agtttaaaaa aacagttagc cgaacaatgc aatcaaaata acaacaaacc actctttagt 240 agtcttatta agaaaacaag cgaaggagca ccccctatta gtgattcaga aacgagatta 300 ttaaatgcaa tttcagctga acaaaaagaa aagatcaaaa aagaaaaaaa tgtactagtg 360 tttggattaa aggaatcgaa acgaaaaagc aaggatgaaa ttaaagaaga tgatgatgaa 420 agtgttggag aagtatttga cgctattgga ttagataaat cttgtattgt caaacatttc 480 aggttaaaca gtagagacaa aaatagacca ggaccactag ttattgaact agttgatcca 540 tcataccaga aacaagtttt aagtgttgct agagatttaa ataaagttga ggcatataaa 600 aataaagtat ttatcaatcc agatctaacg tatagacaaa gagctagtct taaattgcta 660 cttggtgaaa gaaatagtct gaataaaata gaggaaggtc aaaagttcga catttcgatt 720 tgttatcaga aacgatatgc tgcagaaaat tgaaatcaaa aaacactaaa aagattaatg 780 caccatcaag tttaatatca aaactcagcg aaataacgtg cttatacatg aacgcaactt 840 cattaaacaa taagtttgac gaactaattg aagtgattaa ctctaacaaa gctaatgtag 900 ttatgatttg tgaaacttgg tggaacaatg agtcagtaac taatatcaat ggatacacct 960 tatttagaag agacagaggc gttggtagag gtggaggagt ttgtatatat gtacaaaaca 1020 atataaaagc gtatgaagtt tcagaaaaat gtttaaatga ccctaatata gaacagattt 1080 ggtgcgttat tgaattagga agtgaaaaag tattatgtgg cagtatgtac agaaccggaa 1140 ttagtaacat aagtaactgt atcaacatca caaagtctat taattttgct cattcaatga 1200 atttattaaa taaatatact ggaatcctta tttgtgggga ttttaatttc aacgcaatca 1260 agtggacaga agattactat ggcacactaa ctagtgagac tgatacacca gccaaaattt 1320 tcttagaatg tttaaacaat tgcttcatgc atcagaatgt gattaaacca acgtttcaaa 1380 ataacattgg gaatgaaacg aatgtactgg atttgattat agctgagaat aaaaatagag 1440 tttataaact ggtacacctt ccaccattgg gaggtattaa tcacggtcat cacattatta 1500 aatttaacta cagtcatgaa gagtgtattg acaacattgt acttataaaa gaaaaattta 1560 attttaatcg agggaaattt gataaattaa ataattattt taaaatctta aactgggaca 1620 atgaattcaa aggtttagaa ataaacaaat gttacgaaaa atggttaagc ctttatgaag 1680 aaggatgtaa actctttatt ccaatcataa aaycgggaag cattcaaatc aacacaaaac 1740 ctctctggat taatagagaa ttgaaagcgc tgtcaaaaga aaaaagaaac gtttggttcg 1800 aatgtcgaag gcagaaattc agagacaatt gtttacttgc aaaatacaaa gagttaaaca 1860 aggtttataa aaaaaaaagt gaatcaaaga attaaagatt ttgaatatga tcttgcttta 1920 aattcaaagt ttaaccccaa gaaagtttat tcatatatta acaataaaag taaagtcaaa 1980 gacaatataa gagcaatcca actaaaaaat ggtagcatcg aaactgattt aaaagtaatt 2040 gcaaatacat tgaatgggta ttttgcatca gtattcaata aaccacaaac tttatctgct 2100 ccaataatag atatagaaac acaaaaaatc tgtaaaaatc cagattttaa tgcctctgtt 2160 gttgaagaat acttaaacaa actggacact aataaagcaa ctggaatgga cagagttcat 2220 ccaaaagttc taaaagagtg caaatctact ttagccaaac caatatcgtt attgtttaat 2280 aaatcatttg aaacgggtaa acttccgaaa ttatggttgt gtgcaaatat cgtcccmctg 2340 tttaaaaatg gtgacaagtt gaacccatgc aactatagac ctacctgtat ctttaacatc 2400 tgtagtttgt aaagtcatgg aaaaaataat taaagataaa atgatgaaac atttagtaga 2460 aaataaatta atcaataaaa accaacatgg atttgtaaat aataaaagtt gtgtgaccaa 2520 cttawtggaa tctctagatt ttatcacaag ttcaattgat aatggtttgg atgtaatagt 2580 tttgtttctg gacttagcca aagcgtttga ttcagttgat cacgaaagat tattgttaaa 2640 attgaatgct cttggtttta aatcaaaact atttgactgg tgccagggat ttttaaagaa 2700 cagacttcaa agagtagtga taggaaataa taaatcagaa tgggaagagg tcgtgagtgg 2760 agtaccacaa ggatcagtgc taggaccact attatttgta gtgtatatta atgatatctc 2820 caacaaaata aaatcaagct gtaaactatt tgctgacgac atgaaaatcc ttagagcaat 2880 aaaaaacgat aatgatgtta ttgaacttca acaagatata gacttattaa tggaatggtc 2940 gaatgagtgg ttgatgaaat tcaaccagca aaaatgtaag gttatgataa taggaaatag 3000 ccagcacaaa gttttaatga ataatacagc tttagcctac acaagcatgg aaaaggattt 3060 aggagtgtac atttcagatg atttggaatg gaaacaccat gtaaataaag cagtaaataa 3120 agcaaatcaa aagctaggtc agatcaaaca cacttttaaa tatttagacg aaaaaacaat 3180 gaaactcttg tttatagcct tagtacgtcc atatttggaa tacgcagctc ctatctggaa 3240 tccttatcgt cagtatgata aagataaact agagtgtgtc caacaaagag cttctagaat 3300 taaaattttg aaaggataca gttatgaaga gaggttaaag aagttggatt tactatcgtt 3360 ggagaacaga agaagaaggg gtgatttgat acaaatgttc aaatatttga aagggtacga 3420 tattataaat ttttacacaa aacccgaaat gctagataac ggatataaaa ctagaggtca 3480 caatagtaaa ctgagaaggc aatatacagg aaatataaaa cgacataatt ttttcacaaa 3540 tagagtagta aacgactgga acaatcttga acaagaaaca atagacgcag tcagtatcaa 3600 ccagttcaaa aaaaaaattg gatgaacatt ttaaatacta acttttttta tgtttgtatg 3660 ttagtgtgtt acagctgtta aagttacaaa ctctacgtag tttggactca tcactcttta 3720 agtgtacagc actattatat tatatttttt ga 3752 // ID Gypsy-594_AA-I repbase; DNA; INV; 5764 BP. XX AC . XX DT 13-JAN-2011 (Rel. 15.1, Created) DT 13-JAN-2011 (Rel. 15.1, Last updated, Version 2) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-594_AA_; KW Gypsy-594_AA-LTR; Ty3_gypsy_Ele66; Gypsy-594_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5764 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX DR [2] (Consensus) XX CC Positions [4908-5318] - Integrase core CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS join(1860..2480,2484..3887,4449..5402) FT /product="Gypsy-594_AA-I_1p" FT /translation="MRLINKLNEAEDLTLQRVVEQCNSLVNLKQDTVLVEN FT PSTVNFVAHNKRNKPYRSNSNSPSRDQPRTPCWSCGGMHFSKDCRFRDHKC FT RDCGKQGHKEGYCSCFSSKYSSSAPSSKKPSTKNKKNQRKQSTKTVTVRNI FT SQGGSSSTSSLTVFLSGCNWTRDPMCRSFRTVRGSSLVDRKRKLQRVKLEQ FT RPESLWILSRSCGVASPTTSRNRVSATLPLLTSSLTFWGLNSWICLDCGTK FT TPVQLVLKGTPKPVFRPKRPVAYSMESAVEEELHRLESLGILKKVDFSDWA FT APIVVVRKPNGAVRICADFSTGLNSALEPNRYPLPLPEDIFTKMANCRYFS FT HIDLSDAYLQVPVDEASQQLLTINTHKGLYQFTRLSPGVKSAPGAFQQIMD FT TMLAGLKFTVGYLDDVLVGGRTKAEHQQNLQKVLARLQEFGFTVRIEKCTF FT SMRQVKYLGQVLDGDGIKPDPDKVAAIVNMPPPHDVPTLRSYLGAINYYGK FT YVKEMRTLRQPMDELLKVDTKFDWSPACQKSFDRFRELLQSPLLLTHYNPS FT VDIIVSADASSVGLGARIAHRFPDGSVKAIYHVSRSLTAAETNYSQVEKEG FT LALVFAVTRFHRMLFGRKFTLETDHKPLLQIFGSKKGIPTYTANRLQRWAL FT TLLLSTTSKFAMSPPTASVTRTYYQDFWSIPSQSGLGSSSNEGHHYSNNLA FT NSPWNLCKIRTYGQPETLVTDNGTQLTSDKFEAYCAANGIIHLKTAPYHPQ FT SNGLAERFVDTFKRTLKKIIAGGEALDEAIDSFLLCYRSTPCRSAPDGKSP FT GELLYGRPIRTSLELLRPPTPYNESTDEQEKQFNRKHGTKAREYHPHDLVW FT AKVYSANTRTWQPGRVIERIGSVMYNVWLSSKQNLIRSHCNRMRSRYESEG FT SARNISAPANTQVPLSILLDSWGLRSTVTDTEDESSQPTPLPAALLDDLQP FT IQRRQRTPQSNTQQPPIPTRQSSRIRRLPVRYEPYSLY" XX SQ Sequence 5764 BP; 1663 A; 1389 C; 1303 G; 1409 T; 0 other; gtggcgacga ggcggtagta aattcgtgct aaattcgcgc gttcatcccg agaaaaggcg 60 ttcttcgtgg atcggtagtg ggagacgaaa ttgtgcgatg gcgactaaca acgccaatag 120 tgcagcaggt tctggaggcc aacaacagtc gggcaacatc acggacacca ttgttcagat 180 cctccgtaac caacaagcgc ttatggggca aatgtcacaa cagctggcag caactcaatc 240 cgccatcaag agtttgtccc gcgatgaggt tgtgctcgat tccctctcaa gcaatatgca 300 ctgtaagact aaagtaccct ttaaagggta aaaaagtacc ctctttgaga gaaactagtt 360 taactcataa aaagagtaaa gggcctgtac ccattaaaga gtaaacggct tgtacgtcaa 420 actaacgagt aaattttacc cactattcga aataaattca gttaggaaag agagtaaatt 480 ttactctgtt tctagtctta cgatttataa taacaatttg aaatcaattt atgaataata 540 ataatgaata agggacaaaa atcatacagc actttattaa gatcaataaa tattgcattt 600 gtttaatatt agcattaagt tatatataaa agtattgtaa aaattctgct gcgtgggggg 660 atgctgcgtg ccacgaccgt ctctcgggca tttggccgca gtttcctctc tggtatcatg 720 ctcctaactg aacgaacagc aacgaagttt ttggttctcc aatccattct gtaaaaacaa 780 cagcgtttct tgaaatgaaa tgaataatat ttttcgattt tggactaacc tgcaaatcta 840 gacactcctc cattatcagc gacccagatg cgactccagt tacaattaag tataacggac 900 tgaagaacag catccgccaa aatattttaa gaactttttc actcattaaa gggtaaacca 960 ctggaactta caaatgggca aaatttaccc actatgcaaa attttgaact acccttcatt 1020 ttgacagctt catatatcag aaaatagtgg gtacttttta caccttaaag ggtaatgcca 1080 ggtttacagt gtggtggaat tcgtgtacaa caaggaaagc ggtaacacat tcgacgcatg 1140 gttctctcgc tatagtgatc tttttgatcg agacgcgtcc aagttggatg acgccgagaa 1200 agtgcgttta ctcctacgca agctcagccc accggatcac gagcgttata ccatttttat 1260 cttgccgaaa actactcggg aattcacttt caccgagact gttgaaaaca cgggccctcc 1320 aagcttgtca accacatatt gatgcaccgt ggccaacatg attatgacat ttttgggaat 1380 ctgacacttt tgggaaatct gacaattttg ggttgtacac gttagcttat aactactcat 1440 taacgtgggt ttaagctggg taaggcgcat aatgacggaa ttgattttag tgtgttattg 1500 cttaaacatt acacgattcc tagaaatacg caagtaaatg agtaaacaat tgtgtaagcc 1560 actaaataaa aggagaaaat atcttcaacg tgttcttgcc gaaaactact cgggaattca 1620 ctttcgccga gaccttggtt gaaaagctaa aggcgctctt cggtgccacc atttccatct 1680 ttcgccggcg ctataactgt ctacaaacaa ccaaagaaga cagtgaagat tatctcgcct 1740 attcatgcaa ggtgaataaa tcatgcgtcg atttcaagtt gtcggagttg accgaagaac 1800 aattcaagtg tttgacgttt gtttgtggac tcaaatcgaa gcaagatgcg gagatccgaa 1860 tgcggcttat caacaaactg aatgaagcag aggatctcac tctccagcga gtggttgaac 1920 agtgcaacag cctggtgaac ctcaaacagg acaccgtgct tgtggagaat ccgtcgacag 1980 tgaattttgt agcccacaac aagcgaaaca agccatatcg gtctaacagc aacagccctt 2040 ctcgagatca acctagaaca ccctgctggt cctgtggcgg gatgcatttc agcaaggatt 2100 gtcggttccg cgatcacaag tgtcgtgact gcgggaaaca gggccacaaa gaagggtact 2160 gctcttgttt ctcatcgaag tactcatcca gcgcacctag ttcgaaaaag ccaagtacga 2220 agaacaagaa aaatcagcgg aagcagtcga cgaaaacagt gacagtccga aacataagcc 2280 agggcggaag ttcatcaacg tcaagcttaa cggtgttcct ctccggctgc aactggacac 2340 gggatccgat gtgtcgatca tttcgcaccg ttcgtggatc aagcttggtc gaccgcaaac 2400 gaaaactgca acgtgtcaag ctagaacagc gtccggagag cctctggatc ttgtctcgga 2460 gttgcggtgt ggcatcaccc taaacaacgt cacgaaacag ggtaagtgct acgttgccgc 2520 tcctaacgtc cagcttgaca ttttggggat tgaactcatg gatatgtttg gattgtggaa 2580 ccaaaactcc tgtccagttg gtgctgaagg gaactccaaa accggtgttt cgacccaaac 2640 gaccggttgc gtacagcatg gagagcgcgg tggaggagga acttcatcgg ttggagagtt 2700 tgggaatcct gaaaaaggtc gacttttccg actgggcggc cccaatagtt gtcgtccgta 2760 aaccgaacgg ggcggttcgg atatgcgccg atttctcaac aggcctgaat tctgcgttag 2820 aaccaaaccg ctaccctctt ccattacccg aggatatttt cacgaagatg gccaattgtc 2880 ggtatttcag ccacattgat ttatccgacg cttatcttca ggttcctgtc gacgaggcaa 2940 gtcagcagct tctgaccatt aacacccaca aaggcctata tcagttcacc agactatcac 3000 ccggcgtgaa atcggccccg ggggcctttc aacaaataat ggacacaatg ctcgctggac 3060 tcaagttcac tgttggttat ttagatgacg ttctggtcgg aggacgcacc aaagcagaac 3120 atcaacaaaa tctccaaaaa gttctcgctc gcttacagga atttggcttc acggtgcgca 3180 ttgaaaaatg taccttcagt atgcgtcaag tgaagtacct gggccaagtt ttggatggag 3240 acggcatcaa acctgaccca gataaggtgg cagcaatagt caacatgccg cccccacatg 3300 acgtgcccac tcttcggtcg tatttgggtg caatcaatta ctatgggaag tatgttaagg 3360 agatgcgcac tcttcgacaa ccgatggacg agctcttgaa ggtggacacc aaattcgatt 3420 ggtcgcccgc ctgccagaag tcattcgacc gtttccgaga gctacttcaa tcgccattac 3480 tgctcaccca ttacaaccca tcagtggaca tcattgtttc agcagatgct tcatccgtcg 3540 ggctgggagc ccgcattgcg catagattcc ctgacggatc tgtgaaagcc atctaccatg 3600 tgtcccgtag cttgactgct gctgaaacaa attacagcca agtagagaaa gaaggcctgg 3660 cactagtgtt cgccgtgact cggttccacc ggatgttgtt cggtagaaaa tttaccctgg 3720 aaaccgatca caagccgtta ttacaaattt tcggctcgaa aaagggcatt cccacctaca 3780 cagccaacag gctccagcgg tgggccctta cgcttcttct ctctacgact tcgaaatttg 3840 ctatgtctcc accgacagct tcggtcacgc ggacgtacta tcaagactaa tcaatagcca 3900 cgtgcggccc gaagaagaat acgttattgc ctctctagag ttggaacaca gcgttcgagc 3960 caccatcaat gaatcgatgc aagctttccc actctcgttc aaggtcattc aaggggcaac 4020 caaaagagac gactgcttac ttcgaaccat acgttacgtc aatgaaggtt ggccatcgaa 4080 taaaaagaat atagctgatc cagctatcga gcaattttat ttacgtcgtg aatgtctttc 4140 cattgttgct ggatgcctca tgtatggtga aagattagtc atcccatcag tctgtcgcaa 4200 gaaagtgctc gatcaactgc acaaggggca ccctggcgtc gaacgtatgc ggtccgtatc 4260 tcgtcagtat gtttattggc cgaatataga cgacgacgtg tccaagtttg tccgttcctg 4320 caacgagtgc gcaagcgtag ccaagacgga tcgaaaaacg aacctggagt cctggcccaa 4380 accagcaaag ccgtggcagc gtctgcattt ggatttcgcc ggccctttag atgggagcta 4440 ctttttgatt ctggtcgatt ccttcacaaa gtggcctggg aagtagttcg aacgaaggac 4500 atcactacag caacaacctt gcgaattctc cgtggaatct ttgcaagata cggacatacg 4560 gacagccgga aacgttagtt accgataacg gaactcagct aaccagcgac aagttcgaag 4620 catactgcgc agcaaatgga atcatccacc tcaaaaccgc tccatatcac ccgcagtcca 4680 atggacttgc agagcggttt gtcgatacat ttaaaaggac cctcaaaaaa atcatcgccg 4740 ggggggaagc cctcgacgaa gccatcgata gttttctgct ttgctacaga tcaactcctt 4800 gtcgcagtgc tccggatgga aaatctcctg gtgaattact gtacggaagg ccgatacgaa 4860 catctttgga actgctccgg ccaccaacac cgtacaatga gtcaacagat gaacaagaga 4920 agcagttcaa tcggaagcat ggaacgaaag ctcgcgagta ccatcctcac gatctcgtct 4980 gggctaaagt ctactcagca aacacgcgga cctggcagcc tggaagagtg atcgagcgta 5040 tcggtagcgt catgtacaat gtttggctat catcgaagca gaatctaata cgatcccact 5100 gcaatcgaat gcggtcgcgc tatgagtctg aaggcagcgc gcgaaacatc agtgcaccag 5160 caaacaccca agttccgctc tcaatactcc tggacagttg gggtcttcga agtaccgtaa 5220 cggacaccga agatgaaagt agtcagccta caccactgcc ggctgcgcta cttgatgatc 5280 tgcaaccgat tcaacgtcgt caacgaactc cccaaagcaa cacacagcaa ccaccgattc 5340 ctactcgaca gtcttcaaga atccgaaggc tgcctgtgag gtacgaaccg tacagccttt 5400 attaacaggg ggaggtgttg ggaggacaac cctatcccaa taacggacgc aaccgaaaca 5460 accaccaagg ctcttacgct gacagccgtc agcttgtgac agatgtcatg ggaagtttcg 5520 gcattcgtca ttccattgtt ggccaccatc atcataaaat ttgataaaat atacttcaat 5580 ttttaggtaa acatcaataa aatcaatgtt ttatacaact ttggcgaccc gtaactaaaa 5640 attgtgacgt gctaggaaat ttctgagaac gacatcagat ttagcaaccc caaatctttt 5700 agagacacat aatttgatcc ttgagacacg caaaaatgtc atttttgttg cgctgtgtta 5760 caat 5764 // ID Saci-1_I repbase; DNA; INV; 5312 BP. XX AC BK004068; XX DT 09-JUL-2004 (Rel. 9.06, Created) DT 02-SEP-2005 (Rel. 10.1, Last updated, Version 3) XX DE Schistosoma mansoni Saci-1 LTR retrotransposon mRNA, internal DE sequence. XX KW LTR Retrotransposon; Transposable Element; Saci-1_INT; KW internal portion; Saci-1_I. XX NM Saci-1_INT. XX OS Schistosoma mansoni OC Eukaryota; Metazoa; Platyhelminthes; Trematoda; Digenea; OC Strigeidida; Schistosomatoidea; Schistosomatidae; Schistosoma. XX RN [1] RA DeMarco R., Kowaltowski T.A., Machado A.A., Soares B.M., RA Gargioni C., Kawano T., Rodrigues V., Madeira M.A. et al.; RT "Saci-1, -2, and -3 and Perere, Four Novel Retrotransposons with RT High Transcriptional Activities from the Human Parasite RT Schistosoma mansoni."; RL J. Virol 78(6), 2967-2978 (2004). XX RN [2] RA DeMarco R., Kowaltowski T.A., Machado A.A., Soares B.M., RA Gargioni C., Kawano T., Rodrigues V., Madeira M.A. et al.; RT "Direct Submission to GenBank."; RL Direct Submission to Genbank (03-DEC-2003)Departamento de RL Bioquimica, Instituto de Quimica, Universidade de Sao Paulo, Av. RL Prof. Lineu Prestes, 748, Sao Paulo, SP 05508-900, Brazil. XX DR Genbank; BK004068; Positions 116 5427. XX CC Key Location/Qualifiers CC CDS 125..5167 CC /codon_start=1 CC /product="pol polyprotein" CC /protein_id="DAA04498.1" CC /db_xref="GI:44829169" CC /translation="MPNNRAKKSIPKLIGEGEDKVNDKSLSTKQVFSDVFTPHGSNVF CC KESYSDSDSDVMLDKVADLSLDKGERLRHAVSGTTPKVPIVGLELPKVELCYFDGKPK CC GYWKFIKQYDTYVAARVSDDSQRLLYLLHYCKGKAKAAIEGCVMMEATSGYKRARDIL CC KRLFGQAHVIARETLEDLFNDVRRGCHDAEQLSSLAIKMENCSMILEQMNYTADLNSL CC VTLERVVRFLPQPMQRQWAEVVDSLTEEDREPTFAELTQFVAAKARVAASRFGQLAER CC PRGGVTTKSCYHSVVRPKNTPIASAKCSMCSGDHAVYECSQFLALTTEERLSHVKGKS CC ICFVCLKQGHKAIECKVTRRCAIDGCSGRHHSLLHKGSAKSKSEDRLATVNHCGHDRP CC VDGHVCLGMIPVRLRSGNAEVVGYALLDNGSDVTLIRSGCLKLLGLNEEQSSVVVQTV CC SGNKATRVIKTPFEVYSLDQTEHVKIQGALVVSQIPGHKPTKTIMSSLVKWPHLSDVP CC LEVIDSGEVVLLIGCDVPEAHWVLDQRLGGRKNPYAVKTLLGWTVFGPTSFSGLKKKV CC SNCMSKLQTLEDQFRKLYDVEFADVYSSDKSPSVEDRAAIEIVERGTFYDGGRFVVPI CC PWKMYPNKKTGNYEVAASRLLSLKRRLLKDSNLYTKYAKSIESNLAKGYAQRVPEIQL CC RSDYLPRWYLPHHAVINPKKPEKLRVVLDCAAKFAGVSLNDMIYQGPDTTAELVCILL CC RFRKEAIAICADVEEMFMQVKVPESDQGALRFLWWQETDMSKEPSEFQMTVHPFGATS CC SPFCANFALIKTAQTFSDGFDSYIVEAVKNNFYVDDCLVSFSTSNQAKNFVKQVSELL CC CKGGFALKKWITNSVEVRSVLHGVCKEGPVMEMSRDCDVIHRTLGVQWDCERDVFQFH CC FDAPERPLTRRGILSVASSLFDPLGLISPVCLTVKLLLQELCKSQIGWDQPINEPYTS CC IWLNWVNFMRQISHVKIPRGIKNKLDEPNARVELHLFSDASEIGYGAVAYARVSYLNE CC PPYCILLYSKSRVAPIKPVTVPRLEMAAAVLSVRLSEVLQRSLPNFFCEVNFHTDSMI CC VLYYIRNTGNRYSTYIANRLAILHQHTKVEQWTYVKSSENPADWTSRGIQKLADLESW CC IKCPTLQQAEFGKTITDCPQTIPDSIEFRKTAVVNLSKLNYDKFPIISYYSDWLRLVR CC AVAWLRRFIEYWMILHSSSLEGSVHLGCLKFEELESAKRKVLMIVQKEVYGRLLSEFK CC NNSNVNSQNDLKRLSPVMRDGLLCVGGRLNYSDYPDDFKYPVILPSRHLVTEMIVRHY CC HKEEGHSGTSQVLAAIRKYYWIVKGTSTVRRVIGKCVICRRYTTNSGQQLMAPLPACR CC VKPGWHSFSIVGVDYFGPILVKRGRSLEKRYGCIFTCLQTRAVHIELAYSLNTDSFVM CC ALLRFIGRRGKPSEIYSDNGSNFVGAISELRKYVRQWDQQRISNELSAKQIQWHFNPP CC SSSHRGGVWERMIRSVRRLLLLITREQTLNDETLGTYLVEIERILNDRPLTPIVQDAN CC DKLALTPNSLLLLRECDSIVDESSIRVNYDKRWKQVNYLANVFWKRWLREYIPSLQAR CC QKWLVERRNFQPGDVVIVVSDIFTRGKWPLGVIESCETDKDGKVRTVSVRTNNGSIRR CC DIRKVCLLEGSN". XX SQ Sequence 5312 BP; 1580 A; 928 C; 1350 G; 1454 T; 0 other; gttctaaacg aatattggtc ggttggcagc ttggtccgtg ttcaacagtc ctgaacaaat 60 attggtcggt ttgcagcttg gtcctcgctc acagttctga aacaaatttg gtcgataatt 120 cgagatgcct aacaatcgtg caaagaagtc aatccctaag ctgatcggag agggtgaaga 180 caaggttaac gacaaatccc ttagtacaaa gcaagttttt tctgatgtct ttacgcctca 240 tggttctaat gtatttaaag agagctatag tgatagtgat agtgacgtaa tgttagataa 300 ggtagcggat cttagtttag ataaagggga aagacttagg catgcagtct cgggcacaac 360 acctaaggtt cctatagttg ggttagagct accgaaagtc gaattgtgtt attttgatgg 420 aaagccgaag ggatattgga agtttataaa acaatacgat acctacgttg cggcaagagt 480 atctgatgat agtcaacgtt tgctgtattt gctccattat tgcaagggta aggctaaagc 540 agccatcgaa ggatgtgtaa tgatggaggc cacgtctggg tataaaagag ctagagatat 600 tttgaaacga ttattcggtc aagctcatgt tatagcaaga gagactttgg aggatttatt 660 caatgatgta aggcgtggtt gtcatgacgc cgagcaactc tcaagtttag caataaagat 720 ggaaaattgt agtatgattc tagagcaaat gaattacact gccgacttga attcgctggt 780 tacgttagag agagtagtaa gatttctgcc ccaacctatg caaagacaat gggccgaagt 840 ggtagatagt ttgacagaag aggacaggga gccaacgttc gccgaactca ctcagttcgt 900 agcagctaaa gcgagagtag ctgcgagtag gtttggacaa ttggctgagc gtccaagagg 960 cggagttact actaaatcat gttatcattc cgtagtgaga ccgaagaata caccgatcgc 1020 aagtgctaag tgtagtatgt gctcaggcga tcacgctgtt tacgagtgta gtcagttctt 1080 agctctcacg acagaagaac gtctgtcgca cgtcaaaggt aagagtatct gtttcgtatg 1140 tcttaaacaa gggcataaag ctattgaatg caaggtgacg agacgttgcg ccattgacgg 1200 atgttctggg agacatcact ctctgttgca taagggttcg gcaaagagta aatcagagga 1260 tagactggct acggtgaacc attgtggaca tgataggcca gtggacggtc acgtgtgtct 1320 gggaatgatt cccgtgaggt tgagatcggg aaacgctgag gttgtggggt atgccctttt 1380 ggataatggc tctgacgtaa ccttgatcag gtcaggatgt ttgaagttgt tggggctaaa 1440 cgaggaacag tcgtcggtgg ttgtacaaac cgtgagcggc aataaggcaa cacgagtgat 1500 aaagacacct tttgaggtgt attccttaga tcaaactgag cacgttaaaa ttcaaggagc 1560 cctggtagtg tcgcaaatac ctgggcataa gccgacgaaa acaatcatga gcagcctagt 1620 gaagtggccg catttgagcg atgtaccttt agaagttata gattctggcg aagttgtgtt 1680 actgattggt tgtgatgtcc cggaagccca ctgggtactc gatcaacggt taggtggaag 1740 aaaaaatcca tatgcagtaa aaacgttgct cgggtggacc gtgtttggac ctacatcatt 1800 ttcgggattg aagaaaaaag ttagtaattg tatgagtaag ctacagacgt tagaagatca 1860 attccgtaag ctttatgacg tagaattcgc tgatgtatat tcaagcgata aatcgccgtc 1920 agtagaggat cgagcagcaa tagaaatagt ggaaaggggt accttctacg atggtggtcg 1980 cttcgtagtt ccaataccat ggaaaatgta tccaaacaaa aaaacgggaa attatgaggt 2040 agcggccagt agattactta gtttgaagcg taggttgttg aaggatagca acctatacac 2100 taaatatgct aaaagtattg aaagtaatct tgctaaagga tatgcacaaa gggtccctga 2160 aatccagtta aggtcagatt atctccctcg ctggtatctg cctcatcacg cggtaataaa 2220 tcccaaaaag ccagagaaac tgagggtggt gctggactgt gctgctaaat tcgctggggt 2280 ttcactgaat gatatgattt atcaaggtcc tgacactacc gcggagttag tttgtatatt 2340 attgcgattc cgcaaggagg caattgccat ttgtgccgac gttgaagaaa tgttcatgca 2400 agtaaaggtc cctgaatccg atcaaggagc cttacgattc ctatggtggc aagagacaga 2460 catgtcgaaa gaaccatcgg agtttcaaat gaccgtccat ccattcggag caacgtcttc 2520 gccattctgt gcaaactttg ccttgatcaa gaccgctcaa acattctccg atggatttga 2580 tagctacata gtagaggcgg ttaagaacaa tttctatgtg gatgactgcc tagtatcttt 2640 ttccactagt aatcaagcaa aaaactttgt caagcaagta agtgaattac tgtgtaaagg 2700 cggttttgca ttaaagaagt ggataacaaa ctcggtagaa gttaggtccg ttttgcatgg 2760 ggtatgcaag gaagggcctg tgatggaaat gtctagagat tgtgatgtca ttcatcgtac 2820 cttgggggta caatgggatt gtgaaagaga tgttttccag tttcactttg acgctccaga 2880 aagaccgttg actaggagag gtattctatc tgtggcatct tctttatttg accctttggg 2940 tctaatttct ccagtttgtc tgacggtcaa acttctgctg caagagttat gtaagtccca 3000 aataggctgg gaccagccga tcaatgagcc ttatacgtcg atttggctaa actgggtgaa 3060 ctttatgcga cagataagtc acgttaaaat acctcgaggg atcaagaata aattagatga 3120 acctaatgca cgggtggaat tgcatttgtt tagtgatgcc tccgagattg gttatggagc 3180 agtagcttat gcacgagtca gttatttgaa cgaaccgccg tattgtattt tgttgtacag 3240 caagtccagg gtcgcaccta taaaaccagt cactgttccg aggcttgaga tggcagcagc 3300 agttttaagc gtaaggctaa gtgaagtgct acaaagaagc ttacccaatt ttttctgcga 3360 agtaaatttt catactgatt ctatgatagt gttgtattat attagaaata caggaaaccg 3420 atatagtacc tatattgcta atcgtcttgc tatcttacat cagcacacta aggttgagca 3480 atggacttac gtcaaatcgt cggagaaccc agcggattgg acttcgagag gtatacaaaa 3540 attggccgac cttgagtcgt ggattaagtg ccctacttta cagcaagccg agttcggtaa 3600 aactattacc gattgtccac aaactattcc ggacagtatt gagttcagaa agactgctgt 3660 ggttaatttg agtaaactga attatgacaa gttccctatt atttcttatt actcagactg 3720 gttaagatta gtcagggcag tagcttggct acgaaggttt atagaatatt ggatgatact 3780 tcattcatca agtcttgaag gatccgttca tttgggatgc ctgaagttcg aagagcttga 3840 aagtgccaag cgtaaggtct taatgatagt ccagaaagag gtatatggga gattactcag 3900 tgaatttaag aacaacagta atgtgaatag tcagaatgat ctgaagcgtt tatcgcctgt 3960 aatgcgcgac ggtttactat gtgttggagg tcgtctaaac tactccgatt accctgatga 4020 cttcaagtat ccagtaatat tacctagtcg tcacttagtg acagaaatga tagtcagaca 4080 ttatcataaa gaagaaggac actcaggaac ttctcaagta cttgccgcaa tccgaaaata 4140 ctattggata gtaaaaggga ccagcacagt caggagagtg ataggtaagt gtgtaatttg 4200 tcggcggtat acgacgaact caggacagca attgatggca ccgcttcccg catgtagagt 4260 gaaaccaggc tggcatagtt tctcaattgt aggggtggat tacttcggac ccattttagt 4320 aaaaagggga agatcactag aaaagaggta tggttgtata tttacttgtt tgcaaacacg 4380 tgcagtgcat attgaattgg cttatagttt gaataccgat tcatttgtga tggccctgtt 4440 acgttttatt ggaagaagag ggaagccctc agagatatat agtgataacg ggtcgaactt 4500 cgtgggtgca atatctgagc taaggaaata tgttcggcag tgggaccaac aaaggataag 4560 taatgaattg tcagcgaagc agattcagtg gcattttaac cctccgtcat ccagccacag 4620 agggggtgtg tgggaaagaa tgataagatc cgtacgtaga ctattattat tgatcaccag 4680 agaacaaacg ttgaatgatg aaactttagg cacctatttg gttgaaattg aaaggatatt 4740 gaacgatcga cccttaaccc cgattgtaca agatgctaac gacaagctag cattaacacc 4800 taacagtttg cttttgttga gggaatgcga cagtatagtt gacgaaagta gtatcagagt 4860 taattatgat aaacgctgga aacaggtgaa ttatctagct aacgtgtttt ggaaaaggtg 4920 gctgcgtgag tacataccat ccttacaagc tcgtcaaaaa tggttggttg agcgtcgtaa 4980 tttccagcca ggtgatgtag taattgtggt ttctgatatt ttcacccgcg gtaagtggcc 5040 tctaggggta atagagagtt gtgaaacaga caaggacgga aaggttagga cggtgtcagt 5100 acgtactaat aacgggtcta ttagaaggga tatacgtaag gtatgtctcc tagaagggtc 5160 caattagtta attgtcaaac aatgtagaca gaacgagtag gcgtactgtt gtaagtccta 5220 cccgttcttt tagtgaggaa agcgattagg tgaaccatgg caaaaaagac atcagaatag 5280 cttgtgcagt atggagggat tttgggggcc gg 5312 // ID Copia1-NVi_LTR repbase; DNA; INV; 257 BP. XX AC AAZX01001609; XX DT 05-NOV-2007 (Rel. 12.11, Created) DT 05-NOV-2007 (Rel. 12.11, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: long terminal DE repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia1-NVi; KW Copia1-NVi_I; Copia1-NVi_LTR. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-257 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from Nasonia parasitic wasp."; RL Repbase Reports 7(11), 1139-1139 (2007). XX DR Genome; AAZX01001609; Positions 11615 11871. XX SQ Sequence 257 BP; 62 A; 72 C; 50 G; 73 T; 0 other; tgttagaatt tgcgcgtagc aacagaactt tcgctcttcg tcagtcccta aagcctaagc 60 gcatgcgcgc cgtcagcgcg ctcggtgaga gagactctcg tctctcagcc gcttacttct 120 ccgccgtctc tcaaagagag agagagagca ctcttaaaga gtaatcgttg ttcttaagtt 180 acaataaagc ttcggtgcgt tttcttatac acacatgttt catcatttta tagtctacct 240 caccacctcc tccaaca 257 // ID hATw-3_HM repbase; DNA; INV; 6400 BP. XX AC . XX DT 13-JAN-2009 (Rel. 14.02, Created) DT 13-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE hATw-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hATw; 7-bp TSD; KW hATw-3_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-6400 RA Bao W. and Jurka J.; RT "hATw-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 9(2), 420-420 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 836..3925 FT /product="hATw-3_HM_1p" FT /translation="MINESIYDWYLALKDTSYDAIYCRYCLDFKVELFCCK FT KALAKKINRLVDTIRKLKKQRKNKKLALLLGEAFFPPVANFSDNTVSASEI FT PKETILISENKMLKKENKILKRKMESMMDLQMDYFTHLKDFEDMSNNLKIL FT ISKNSLIEAELNFIKKCYCEKEEKLNNIENKLELLKSKKESFNVRNLNKKI FT KYRDTQLEIKKNKINSLKNEEKKKILQLKNKITDINSILENSDINISELKN FT EKIKLQKKVSNLKIILQNKTNYINDNNMKDVISLEKEVVSLYSKTNILKEE FT NLELQKLVSLLDDDEVITFEDGRYSDDIRETIMKLLSMNVSMNSVNEVIKV FT VLNKLAKKNISRLPSAAVKCRLMQEASILGQFQVAEAMLENNLINKNSKIG FT NCLHGDGTQKYHKHYQNFQITTTSGKTLSFGLSEVVGKDAATVLQNFTNIV FT DDICDVVSNSATEKEINFAKLITSIKSTMSDLSSVNPLFNSQLKTLREKLL FT PIAISNWDFLSSNVKDNMKDMSNFFCKLHLLANFASETDKVLNSFEKILLQ FT NDFETVFAFSGKECGAVRLVRTTCKAFHSRGSEEAGVASWFNSFLSSRNDK FT SYFVPFIGNRFNILYYNAAALYYHKDSITEFLNAWPNPNNLLKAIKEDITN FT LVFIAEVRALGIIDKLLTGPMWRIIESLDSILFLNPYLLRLKVRLSSLCID FT ASSLLFANGKIFEDTNLLHQDVVYTKLFECTQNDEFDSLTVQSLEIIFHAI FT LVILERQCADQLPGGKYWNPSNSVSEAASNVPVTNKASESDFAILDLMVRT FT KPNANVQTIQALTMWSQNQTLEWLNGKSDEERKQLLEYSRTKVTLMKSKYD FT ERKEALKKDKAMHLLAKQEEKNREEIKLQHKKVAACNDLINLNVRAWISLE FT EAEKTVFNIEEPLKSKVLLAQLNFYRIILGVECDKKLFCKTKVEGSKRINL FT SSEELYQNLLIVIQANLTPNKQSLSNNISNNLKPRTMRDAAVENKKKRING FT KNSGCKTKQINRATKKALFIKIY*" XX SQ Sequence 6400 BP; 2399 A; 805 C; 987 G; 2207 T; 2 other; tagggagttt tgtcacgtga cattttatcg cggaaaaaaa tcgttaatat atttttgaaa 60 aacttcgtcc aaaaaacgcc agaaatgatt ttatgtaggt tttgaagaat aaaaacccac 120 ccatgaaaat aaatattata tacgtacatg tagatgttat tttttttaga aaaggttgta 180 ctacaataat ttgtttgggg acgacgtttt ggtttagaat tgataaagtt atttagtttt 240 aaatctttca gttttgttaa aatgcaagta aattaaaata actctttcgt ccccacgtaa 300 aaataagttc tgcaactatt tatttattaa gtgaacaagt tttttttttt tttttttttt 360 gtagtctaat agggtgggtt ttttgttttc taagtaaaga ttattataat ttcccccgtc 420 tttagatatt tttggtaaat ttgccgtttc tcttttactt gagaaaagat ttttgcacac 480 gttgaattct atattttata tagctgagat tttattgtaa attttaactt aaggtattct 540 aaagtgagtt tgttttgtgt tgttaatgtg gttgttgcta accaattagt aattgaatct 600 cagcagttta tcagtaagtc taatttgtaa cttagcaatt ttctttttat acaatttact 660 taatttttaa gtttatatat atatatatat atatatatat atatatatat atatatatat 720 atatatatat atatatatat atatatakta aaagactttg aaatatatat atttatatgt 780 aaattatata tttgttcgaa ttgatgactg gaaaatcttt atttatttag ataaaatgat 840 aaatgaaagt atttatgatt ggtatttggc tttaaaagat acatcttatg atgcaatata 900 ttgtagatat tgtttggatt ttaaagttga actgttctgc tgcaagaaag ctttagccaa 960 aaagataaat agattagttg acaccatcag aaaattaaag aaacaacgaa aaaataaaaa 1020 attagcattg ttattaggtg aagccttttt tccacctgtg gcaaattttt ctgataatac 1080 agtgagtgca tcagaaattc ctaaagaaac aatactgata agcgaaaaca aaatgcttaa 1140 aaaagaaaat aaaattttaa aaaggaaaat ggaatctatg atggatttac aaatggatta 1200 ttttacacat ctaaaagatt ttgaagatat gtcaaacaat ctaaaaatac ttatttcgaa 1260 aaactctcta atagaagctg aattaaactt tattaaaaag tgttattgtg aaaaagaaga 1320 aaagcttaat aatattgaaa acaaattaga gctattaaag tcaaaaaaag aatcatttaa 1380 tgttagaaat ttgaataaaa aaataaaata tcgtgatact cagctggaaa taaaaaagaa 1440 taaaataaat tcgttaaaaa atgaagaaaa aaaaaaaata ttgcaattaa aaaataaaat 1500 aacagatatt aattcaattc ttgaaaactc agatataaac atttctgaat taaaaaacga 1560 aaaaataaag ctgcaaaaaa aagtgagtaa tttaaaaatt attttgcaaa ataaaaccaa 1620 ctatattaat gacaacaata tgaaagatgt tataagttta gagaaggaag ttgtttcgtt 1680 atatagtaag acaaacattt taaaagagga gaatctagaa cttcaaaaat tggtttcttt 1740 gctggatgat gacgaagtaa taacatttga agatggcaga tacagtgatg acattcgaga 1800 aacaataatg aaacttcttt caatgaacgt gagtatgaat agtgtcaatg aagtcataaa 1860 agtagtttta aacaaactag ccaaaaagaa catatctaga ctgccatcag ccgctgtaaa 1920 atgtaggtta atgcaggaag cttcaatttt aggtcaattt caagttgcag aagcaatgct 1980 tgaaaataat ttgataaata aaaactctaa aataggcaat tgtttgcatg gtgatggtac 2040 tcagaaatac cataagcatt atcaaaactt tcaaattact acaactagtg ggaaaacttt 2100 atcttttggt ctgtcagaag tggttggcaa agatgcagca actgttttgc aaaactttac 2160 taatatagta gatgacattt gtgacgttgt tagcaactca gccactgaaa aagaaattaa 2220 ttttgctaaa ctaataacat caataaaatc aacaatgtct gatctttctt cggttaatcc 2280 tctttttaac tcacagttaa aaacattaag ggaaaaactt ttgcctattg caattagtaa 2340 ctgggacttt ctttcctcaa acgtcaaaga caatatgaag gacatgagta attttttctg 2400 taaacttcat ttgctcgcta attttgcttc tgaaactgat aaagttttaa attcttttga 2460 aaaaattctg cttcaaaatg attttgaaac tgtttttgct tttagtggta aagaatgtgg 2520 tgctgtacgg ttagttcgta ctacatgtaa ggcatttcat tcaaggggta gtgaggaagc 2580 tggtgttgca tcatggttta attcttttct ttcctctcgt aacgataaat cttattttgt 2640 tccatttatt ggaaatcgat ttaatatttt gtattacaat gcagctgccc tatactatca 2700 taaggattct ataacagaat ttcttaatgc ttggcctaac cctaataatt tattaaaagc 2760 aattaaggaa gatatcacaa atcttgtatt cattgctgaa gtccgtgcat taggtattat 2820 tgacaaacta cttactggtc caatgtggag aattatcgaa tcacttgata gtatattatt 2880 tttaaacccc tacctacttc gcttgaaagt tagactttca tctctttgta tcgatgcgtc 2940 ttctttattg tttgctaatg gaaaaatttt tgaggatacc aatcttcttc atcaagatgt 3000 ggtttacacc aagctgtttg aatgcactca aaatgatgag tttgattctt taactgttca 3060 atcacttgaa attattttcc atgcgattct tgttattttg gaacgtcagt gtgctgatca 3120 gttaccaggt ggtaaatatt ggaatccatc taattctgtt tctgaagctg cttctaatgt 3180 ccctgtcact aataaagcat cagaaagcga ctttgccatt ttggatttaa tggttcgcac 3240 aaaacccaat gcaaatgttc aaactatcca ggctttgaca atgtggtctc agaaccaaac 3300 attagaatgg cttaatggta aatcagatga agaaagaaaa caattgcttg aatattcaag 3360 aactaaagta acattaatga aaagcaaata tgatgaaaga aaagaagctc ttaaaaaaga 3420 taaagctatg caccttctag caaagcaaga agaaaaaaat cgagaagaaa ttaaactaca 3480 acataaaaag gttgcagctt gtaatgattt gataaattta aatgtaaggg cttggatatc 3540 attagaggaa gcagagaaaa cagtttttaa tatagaagag ccattgaaat caaaagtact 3600 cctagctcaa cttaattttt atcgcataat cttaggtgtc gaatgtgaca aaaaactttt 3660 ttgtaaaaca aaggtagaag gaagcaaaag aataaacttg tcttcagagg aattatatca 3720 aaatcttctt attgttattc aagcaaattt aacaccaaac aaacaatctt taagtaataa 3780 tataagtaat aatttaaagc ctcgtacaat gcgtgatgca gctgttgaaa acaaaaaaaa 3840 aagaattaat ggaaaaaatt caggatgcaa gactaaacaa attaatagag caacaaaaaa 3900 agcactcttt atcaaaattt attaacaatc catctgattt tgtaggtttt tctatacaac 3960 atcgtgttag agaagaagat gctcatgaag tatcttggga acgtgcgcaa gtggttcaaa 4020 ttgatcaatt aaacggtaga agaactacat accttgttaa gtatgataat gaacctgatg 4080 aagtatggtc atttccttta ttaatagatt ttgaaaaagg tgatcttatt ctttgtagct 4140 aggattgcga ggttttattt aatgtaaatt tagaattatg ttacagtata tattaatctt 4200 atttctttta ttgttatgtt ccaatagttt tttttactac atttgagttc agtggttttt 4260 atctttgtaa atagtatgaa tcatatctga tatgtttatc tgcaccaatt ttgtttattt 4320 gttgttgttt atttgcagct taactgcaca ttattattat tattaaaaaa aaaaaaaaag 4380 cttatgcaaa aatactatta cttttacatg aactgcctat caagtaaagg aaagtatcca 4440 gaaatggagg gctaacctta ggctaattta ctattacttt tacatgaact gcctatcaag 4500 taaaggaaag aatccagaaa tggagggcta accttaggct aatttactat tacttttaca 4560 tgaactgcct atcaagtaaa ggaaagaatc cagaaatgga gggctaacct taggctaatt 4620 tactattact tttacatgaa ctgcctatca agtaaaggaa agaatccaga aatggagggc 4680 taaccttagg ctaatttact attactttta catgaactgc ctatcaagta aaggaaagaa 4740 tccagaaatg gagggctaac cttaggcaac atgacagctc acggatggag tgcaaagtcc 4800 tatggcttac cttctcgttg gtgcacttcg ttaagtaagt tagacaattt tgtctaattt 4860 atgaacggaa tagccaaggc aaattatgta taaaattgtt tgttaacaat acacacacag 4920 cttactctga agtttgaatt gaaattcgat aattaaaata ttttttcagt aaatttaccg 4980 atatgaagta tgctaagttt taaatgaatt ttgaaaagtg aaatttctga gtttgcaaca 5040 tatagtcaaa tttgggtaca ttaatagtgc gataatttat ttcaaagaca aaacgattta 5100 agacatctat taaaaagttt aagacaaata gatagataag ataatccgta taagaatttc 5160 catgactgtt aatttccatg actgttaatc tatgttaatt ccgtatctgt taatctatgt 5220 aatctcaatt tcttaataca agagggatgg ggttaataaa agtggatgga ggagtttagt 5280 caagatttaa tgaaccgggg gaggggaggg tcagaattta cactgaaagt tttattactt 5340 atcagtaact agtatgtatt ttttgatttt taaagctgat tttcactaat ttttttttag 5400 tttttagata taattttttt gttaaaacaa gtaaaaatta ataaacggga ggggggtctt 5460 aataagctta gtgtgggtgg caaagtttcc ccaaaattaa taaacgcccc cccycccttg 5520 gaattaagga ctcgagagta cacaatcaaa taaagccaaa cacaatgaaa tagaacaatt 5580 caacgagttt ttcattcact gttttcaaaa ttttaaaaaa agttaaaaac cttaacccca 5640 ttaaaaaata gaacattttt tatattatga taggtgctga aaacaataaa ttagtcgaat 5700 cagtagacta taagtggtta aatctttttt taaatctata acaatactga aattattaac 5760 aagtctttga ctaatttctt tatttgtttg tgtaaaaata aattttagat ttatgtttat 5820 tttgtacata ttttgacgtt ttcggcgtgt tttatttggg tccacgttgt tcaattgata 5880 atttattaaa gaaaaataaa attaatttaa gttaatattt atcatcaaag ttttgttacg 5940 ctcaacgtat tataaattaa aacatttgaa aatttatttt gtgtcaaaaa agtttaaaac 6000 atatttaaac atattataca gttatttgac tatttcatgg tacgtctgcg ttaacccccc 6060 aaaaaaattc tacttatgaa ttgttcaaat aatttagatt tacaatttat tttctaataa 6120 aattttatta taagttagtt tacgcaactt tttacgcaaa aaaaaaaaaa aaaaaaaaaa 6180 aaaaaatttt tttttcggtg ccaaaaattg aacgagatac tttaagaaaa tatttcataa 6240 aaagtctatg aattgaaaat attaaaacac gtatatcttt ggaactaaat aagctacaga 6300 ggtggttgaa accattttga aaatgaaata tattaaggaa tataatgcat ttaatataaa 6360 aaatatttac atttgctata aatattgcct aaactgccta 6400 // ID I_Ele24 repbase; DNA; INV; 7504 BP. XX AC . XX DT 08-OCT-2010 (Rel. 15.1, Created) DT 08-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE An I clade non-LTR retrotransposon family from Aedes aegypti. XX KW I; Non-LTR Retrotransposon; Transposable Element; I_Ele24. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-7504 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-7504 RA Kojima K.K. and Jurka J.; RT "I clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (08-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. This consensus is generated from 20 CC sequences with >91% identity, and ~99% identical to the original CC sequence in [1]. XX FH Key Location/Qualifiers FT CDS 883..2619 FT /product="I_Ele24_1p" FT /translation="MATGLGGPPWGDYPPLSQDQGDSYHAGPTLPGFMDRD FT GFYGELRFLRIQGIDGRPLPEAPFMIRKSVHKSAGGKIEGAFPESGGKTYA FT LKVRQLTFYQRLLDMKKLNDGTPIEVIEHPGFNQTRCVVSCREVVNVDVNV FT LLEEMQEQGVKEIRRILRRAGGKQINTPTLIVTTRGTVRPEFLDFGFIRCR FT TRPYYPSPMQCFSCWAFGHTRSRCNSSPICGRCAQNHAFEPEHPCIAAKYC FT KKCESSDHGVGDRSCPEYIKEGDIQRIRVDSNLSYAAARRKYEEAHGTRSY FT ANMTAVPGFTSLNNQNDLASLHRKIDSLIATIEKKDQRIEQLEAALAQRSS FT TAATTSVPNVSSQTGDDMPTFMKLFLDRQEQMFGNTVKKMWESNLRMQKDI FT LELQSRSELPSDEQIYLPAPLQANTASLSTLPTRMIAATKSTPTASAVLPN FT STMESSEKSSPTPTNIKDKSNTDPLNNVTPNPVSPATKTIAIDDSSHDLNS FT DVSSNSSSSPNQNLLSHNSTPRPSPKTPVQSENIPKAPLRAVSPGSANKRT FT LRDISPDRVINAQTVSKQQRRSLPRSGHVKKS" FT CDS 2598..7109 FT /product="I_Ele24_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="KRPCQKVLIAFPAKNITQTMTDPCTTPEHNSNTANND FT KPFSNFAPDSRGTFGPEVKPQPEPQDRPRRSSCETNENLIRNSASGSRSTG FT RPEVMPPPERPERLRCPMDEDAINAKKNPTISPGSWRPDSADVLPTPELAN FT HLQRRREVEGGTDKGQPTYSPQDDEGSEALVIVGTHKGSNPRTPKDSVGSK FT SYAAHSNPRNSTPGHPEHLNSGGKSHHAIPHPIVSNPPAESSPSSSIDGSL FT SFAAPATNPLPIHLPANQVASFQRQSRHRYGSRPYRQTSYNRSRGSSISET FT YHPTPALPKPVNILLQWNINGFFNNLANLELLTSSTTPWCLALQEVNRVTT FT EQLSRSLRGQYRWALFRGTNLRHSVAVGVLASIPFEVLRLETDLPAVGVRL FT HGPFNISVVNIYIPCGAQPNLQEQLHRLLEATPDPVLLVGDMNAHHPIWGG FT QRTDNRGTSLLHVFEEHDLIVLNNGSNTFLKGNSASAIDLAAASRSCTNRF FT HWNVDSDLHGSDHYPIHIATNVAPPATTRRPRWLYEKADWDNYEQTFRNSS FT RGRESAHLQELTDIINKAAKSSIPRSSNQPGRKALHWWTDHTRIAVKARRK FT ALRAVKRIPIGDPSRENALNLYRSRHSECKKIIADAKRNSWENFLDSINAS FT QTSSALWTRVNALSGKRKTKPLTLRIENSHISDPSTVARILGEYFASLASV FT DSYTDSFKRRMQPTISSVPTFPVPDDPENLLINQPFSISELLFALKCCTGK FT SAGPDGIGYPLIKNLPPSGKLQLLELMNKYWLTDTFPPEWRESLVVPIPKS FT NAETHDPSKYRPIALTCCMCKVMERMVNRRLKQFLESQDLLDHRQHAFRAG FT HGTSTYFAQLGDVLHTAQSEGLHTEIVSLDLSKAFNRTWTPLVLRQLVEWK FT LSGHILRFVQNFLSERSFRVIVGDYLSDSFPEETGVPQGSVIAVTLFLIAM FT NSVFLTLPKGIYVFVYADDILIVVSGKTPVRVRIKAQAAVNAIFKWTTATG FT FELSAPKSVRCHVCPTNHRISVPISINGQRIPTKKTVKVLGVTIDRALTFQ FT HHFDLVKQSCRTRLNLIRTISRPHRSNNRNIRFRVAHAIVDSRLTYGLELT FT CIARERLITTLAPLYHGYIRIISGLLPSTPADSACTEAGLLPFRFFISAVI FT CKKTAAIIEKTIGDDRICLLTEGNSILRTAANVDFPPVARIHWNGDICWLS FT PKPNVDSTIGNQFRAGDNSIALRTTVLDWLRTKYPNHDHRYTDGSLSMRGV FT GLGVFGANISVSLSLPPLCSIFSAEAAAVYLAATTPSDRPIIVLTDSASVV FT SALQSNRPSHPWIQRTITDARPNTTFAWIPGHCGIAGNTAADRLAGAGFSC FT PRYEDNVPFDDVKRWLTKQFRSTWSMEWNQQNSPYLRKVKSSTERWKDLPL FT LKEQQLVSRLRTGHTRFSHNMSGNGSFRRMCSYCHIRNTVEHVICVCPLYE FT FSRNIHGISRSIREALGDDVAALEALLRFLKDSGLYNMI" XX SQ Sequence 7504 BP; 2051 A; 2027 C; 1613 G; 1812 T; 1 other; cagttagcat caagcatcgc ttgcgtgcgg acgtttttcg tttatcgctg gtcgatttta 60 ttgcgtcgat attcttttaa tagtttttta ttaactgtta cttttgtgcg tgtgtttatt 120 tttactacgg ggttccgtat tttaaagcgg ttgtgtttta cttgagaaac ataacttgtt 180 acgcgtaacg tcgggagtgt gaaaactacc gcgaaagtgt ctagaccaag ccgtctcaaa 240 tagaccggct ctaggagacc tctccatata gaaaaacgac cctctgggtc gaacgatacc 300 ccctcaatcc cagtgcttct tagggaggag ggggtgtgtg taacagaatc aggctagctg 360 tagccgtatc attcactgcc agtgtggtat agtggcttaa aaagtgcaaa aatagcaaac 420 tgcttcaact ttgaccgata gtgaacttac agattgcgcg tgtgtatctt ttcgcacaaa 480 ggttgttttt acatatcggt ggctcatcat aaccaccgcc caaagaggaa tacgaaagaa 540 gaccagtgag tttgacgtgg atagatagag tttgctcact ctcaagccgt ggtgcgcctt 600 tccccagagg ggccactctg ataggggaaa aagtatcaaa acgattcgaa aagtgtgatt 660 ttgtgaagtg gtgaaaatct actttttccg aaggaaaata atccgttgaa aaagcgtttg 720 aaagaataca tctccgccaa ctttcaagct tgtgttgtgt tgtgcttatc ccagaaaagc 780 cttcaacgcc tacatcgagg aaagatcata cgggaacgcc gtcagaacat tcatcggtga 840 gttactgggg ctttatccgg gtttttcttg ggggtcgcct tgatggcgac gggtctgggg 900 ggccctcctt ggggggatta cccccctttg tcacaagacc agggcgatag ttatcacgct 960 gggccaactc tgcctggttt tatggatcgg gatggatttt atggggaatt acgtttcctt 1020 cgcattcagg gtatcgatgg gagaccttta ccagaggccc ctttcatgat aagaaaatct 1080 gtacacaaat cagcaggcgg aaagattgaa ggagcctttc ctgaatcagg aggtaaaaca 1140 tacgccctta aggttcggca actgaccttc taccaaaggc tgctcgacat gaaaaaactc 1200 aacgatggaa cacccataga agtaattgag cacccaggct ttaaccaaac gcgctgtgtt 1260 gtttcttgcc gtgaagtggt gaacgtggat gttaacgtgc ttctggaaga aatgcaggaa 1320 caaggcgtga aagaaattcg gaggattctc agaagggccg gcggaaaaca aatcaatacc 1380 cctactctta ttgtcacaac cagaggaaca gttcgacctg aattcttgga cttcggtttt 1440 attcgttgtc gtactcgccc ctattaccca tcacccatgc aatgtttttc ttgctgggct 1500 ttcggtcata ctcgatctcg ttgcaattcc tctcccatct gtggaagatg tgcacaaaat 1560 catgctttcg aacctgaaca cccgtgcata gctgcaaaat attgtaaaaa atgcgaatcc 1620 agtgatcatg gtgttggcga tcgttcatgt ccggagtata tcaaggaagg cgatattcaa 1680 cgcatacgtg tagactccaa cttatcctat gctgctgccc gtcggaagta tgaggaggcc 1740 catggtacgc gctcttacgc caacatgaca gcggttcccg gtttcaccag tctcaacaac 1800 caaaatgatc tagccagtct tcaccgcaaa atcgactctc ttatagctac catcgaaaaa 1860 aaggatcagc gcatagaaca gcttgaagct gcgcttgcgc aacgctcgtc tactgctgcc 1920 accacctcag taccaaatgt ttcttcccaa actggagacg atatgccaac ttttatgaag 1980 ctcttcctag atagacaaga gcagatgttc ggaaatacag tcaaaaaaat gtgggaaagt 2040 aatcttagaa tgcaaaagga catccttgaa ctacaatcac ggtctgaact tccttccgat 2100 gagcagattt atttgcctgc ccctctccaa gcaaacactg cttccctctc aacactacca 2160 actagaatga tcgctgcaac caaatccaca cctactgcct ccgcagttct gccaaatagc 2220 accatggagt ccagtgaaaa atcttcccca actccaacca acatcaagga caaatctaac 2280 accgaccctc tcaacaacgt aacccccaat ccagtaagcc ctgccactaa aaccatagca 2340 atcgatgatt cctctcacga tctaaactct gatgtctcgt ccaactcaag tagcagccct 2400 aatcaaaacc tactcagcca taacagtacc cccagaccct cccctaaaac ccctgtacag 2460 agcgaaaata tacccaaagc cccactgcgt gcggtatctc ctggttctgc caacaagcga 2520 accttaagag acatctcacc cgacagagtt ataaacgcac aaaccgtttc gaaacaacag 2580 cggcggtcac ttcctagaag cggccatgtc aaaaagtctt aattgctttc cccgcaaaaa 2640 acataactca gacgatgacc gatccctgta caacgcccga acacaacagc aataccgcca 2700 acaacgataa gcctttttcg aactttgcac cggacagtcg aggcaccttt ggaccggaag 2760 taaaacccca accggaacct caggaccgac ctcggcgttc gagctgcgag acgaacgaaa 2820 atctaatcag aaacagtgcc tcgggtagcc ggagcaccgg aagaccggaa gtcatgcccc 2880 caccggaacg gccggagcga cttcggtgcc cgatggacga agatgcaatt aatgcgaaga 2940 aaaatccaac gatatcccct ggtagctgga gacccgacag tgcggacgtc cttcccacac 3000 cggaactggc gaaccacctc cagcgccgga gggaagttga aggagggacg gacaaaggtc 3060 aaccaaccta ttcccctcag gacgacgagg gaagtgaagc cttagttatt gtcgggacgc 3120 acaaggggag taatcctcgt accccaaagg actccgtggg aagcaagtcg tatgcagctc 3180 acagcaaccc tcgaaacagc actcctgggc accccgagca cctcaactcg ggcggtaagt 3240 cccatcacgc tattccccat cccattgtat caaatccccc agcggaaagt tcaccgtcat 3300 cttccatcga tggatctctg tctttcgctg ctcccgcaac gaatccccta ccaattcatt 3360 tacctgcaaa tcaggtcgca tcgttccaac ggcagtccag gcaccgttat ggcagcagac 3420 cctacagaca aacttcgtac aatcgtagcc gtggctcatc gatttccgaa acctatcatc 3480 ctacgcctgc actcccaaag ccagtcaaca tcttacttca gtggaatatc aacggttttt 3540 ttaataacct cgcgaacctg gagcttctca caagttcaac tacaccatgg tgcttagctc 3600 tgcaagaagt gaacagggtc acaaccgaac agcttagtcg atctcttcgt ggccaatacc 3660 gttgggccct ctttcgaggt accaacctaa gacattccgt tgcagtcgga gtcttagcct 3720 caattccttt tgaagtcctt aggctcgaaa ccgaccttcc agccgttgga gtgcgtctcc 3780 atggaccatt caacatcagt gtagtaaaca tctatattcc ctgtggtgcc caaccaaatc 3840 ttcaagaaca gctccataga ctccttgaag caacaccaga tccagttctg ctagttggtg 3900 acatgaatgc ccaccatcca atatggggcg ggcaacgcac tgataatcga ggaacctctc 3960 tactacatgt cttcgaagaa cacgatctca ttgtcctcaa caatggatca aatacctttc 4020 tcaaaggaaa ctccgcatcg gctattgacc tggctgcggc tagccggtcc tgtaccaatc 4080 gattccattg gaatgttgac tccgaccttc atggtagtga tcattaccct attcacattg 4140 ctaccaacgt cgctccaccg gcgaccactc gtcgtccaag atggttatat gaaaaagccg 4200 attgggataa ctacgaacaa actttccgga attcatctag gggtcgcgaa tctgcccatt 4260 tgcaagagct cactgacatt attaacaaag ctgccaagtc ctccataccc cgcagcagta 4320 accaaccggg tcgcaaagca ctacattggt ggaccgatca cactcggata gcggtaaagg 4380 ccagaaggaa agcattacgc gccgtcaaac gcattcccat aggcgatcct agcagggaga 4440 atgctttgaa cttataccga agccggcaca gtgaatgcaa aaagatcatt gcagacgcca 4500 aacgcaatag ttgggagaac tttctggata gcataaacgc atcacaaact tcatctgccc 4560 tctggactcg agtaaatgcc ctcagtggca aaagaaagac aaagcctctg actctccgta 4620 ttgaaaactc gcacatctct gacccctcaa cggtagctag aatcctcggc gaatacttcg 4680 ccagcttggc tagtgtagat agctatactg atagcttcaa gcgtcgtatg cagcccacca 4740 tcagctcagt ccctaccttt cccgttcctg atgacccaga aaacttgttg ataaaccaac 4800 cattttcaat tagtgagctt cttttcgcac tcaaatgttg taccggaaaa tctgccggcc 4860 cggacggcat tggttaccca cttataaaaa accttcctcc ttcgggtaag ttgcagctat 4920 tagaacttat gaacaaatat tggctcactg acaccttccc cccggaatgg cgggaaagtc 4980 tcgtggttcc cattccaaag agtaatgcag aaacccacga cccatccaaa taccgaccga 5040 ttgctctaac ctgctgcatg tgcaaggtga tggaacggat ggttaaccgg aggctgaaac 5100 aatttttgga gtcccaagac ttgttggatc atcggcaaca cgccttccga gctgggcatg 5160 gaacaagcac ttactttgcc caacttggcg acgtcttaca cacagcgcag tcggaaggct 5220 tacacaccga aattgtctca ttggacctgt ccaaggcctt caatcgaaca tggaccccgc 5280 tagtgctacg tcagctagtt gaatggaaac tatcgggtca cattctgcga tttgtccaaa 5340 acttcttgtc tgagcgatct ttccgagtca ttgttggaga ctacctatcc gactcctttc 5400 ccgaagagac cggcgtgcca caggggtccg ttatagctgt cacactattc ttgatagcaa 5460 tgaacagcgt ttttcttacc ttaccgaaag gmatctatgt attcgtatac gctgacgaca 5520 tactcattgt cgtctccggc aaaaccccag tccgcgtcag gatcaaggca caggcggcag 5580 ttaacgcgat cttcaaatgg accactgcaa ctggtttcga gctgtccgcc cctaaaagtg 5640 taagatgcca cgtctgtccc actaaccacc gcatatccgt acccataagt atcaacggtc 5700 aacgcatccc cacgaagaaa acggtgaaag tgctcggcgt cactattgac cgagcgctca 5760 ctttccagca tcactttgat ttggtcaagc aaagctgccg cacgcgcctc aacttgattc 5820 gaacaatctc tcgccctcac cgctcgaaca accgtaatat ccgtttccgc gttgcgcatg 5880 ccattgttga cagccgatta acatatggcc ttgagctaac gtgcattgca agggaacgat 5940 taataacaac actcgcacca ctgtaccatg gatacattcg catcatatcc ggacttcttc 6000 catccacgcc tgctgactct gcttgcactg aagctggcct ccttcctttc cgcttcttta 6060 tatccgcagt gatttgcaaa aaaaccgctg ccattatcga aaaaacaatc ggggacgaca 6120 ggatctgcct cctcactgag gggaacagta tcctccgcac tgctgccaat gtagatttcc 6180 ccccagtggc caggatccac tggaatggag acatttgttg gctttcacct aaacctaacg 6240 tagatagcac gattggaaac cagtttcgtg ctggggacaa ctcaatagct cttcgaacaa 6300 ccgttctgga ctggctacgc acaaaatatc caaatcatga tcacagatac accgacggtt 6360 ctctttccat gagaggcgtc ggccttggcg tcttcggcgc taatatttca gttagcctta 6420 gtctgccacc actttgctcg atattctcag cggaagccgc agccgtgtat ttagcagcca 6480 ctacaccatc cgaccgacca ataattgttc ttacggattc agctagcgta gtatccgcat 6540 tgcagtcaaa caggccttca cacccttgga ttcagcgcac gatcaccgat gctcgaccca 6600 acaccacctt tgcatggatc cccgggcatt gtggtatagc cggcaacaca gctgccgatc 6660 gtctagctgg tgctggtttt tcctgccccc gctacgaaga taatgtaccc ttcgatgatg 6720 ttaaaaggtg gctaacaaag caattcagga gtacatggag tatggaatgg aatcagcaaa 6780 attcaccgta cctacgaaag gtgaaatcgt ccaccgaacg gtggaaagac cttcctcttt 6840 tgaaggaaca acagttggta tctcggctga ggacggggca cacgcgtttt tcacacaaca 6900 tgagtggcaa tggatctttc cgccggatgt gttcctattg tcacataagg aataccgtag 6960 aacacgtaat ctgcgtctgc ccattatacg agttttccag gaatattcat gggatctcaa 7020 gaagcattcg tgaagccctg ggagatgacg tagcagccct ggaagcccta ttgcgttttt 7080 tgaaagattc tggcctatac aacatgatct gacgcccacg ataacaaact gctacaagta 7140 tcactttttc gtgtccgtcc tattagtcca ccagctacac aaaccgccct tggtttaagg 7200 aaactttttt agaataacaa attaacagtg tacagtattt atactggtag cattcttaga 7260 ataacaaatt acaaaatgtt ttgacgtagg acttgacata ctggcaaacc ttggaagctg 7320 tggttcggcg agaccccttt ggttgggctc gagtagtgat cccttctggg tcacttctac 7380 ctttttagtg ttgcccctct ggcacacccc ttttcgaggc accgttctgg tgttctctaa 7440 tccgacaaag tgttgaacta gcgtcagtta aaaaacactc taataaagac aaaaaaaaaa 7500 aaaa 7504 // ID BEL-10_DPu-LTR repbase; DNA; INV; 500 BP. XX AC . XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE BEL-type retrotransposon from Daphnia (long terminal repeat). XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-10_DPu-I; KW BEL-10_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-500 RA Jurka J.; RT "LTR retrotransposons from Daphnia."; RL Direct Submission to RU (16-DEC-2010). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR [1] (Consensus) XX CC ~97% identical to consensus. XX SQ Sequence 500 BP; 99 A; 119 C; 106 G; 176 T; 0 other; tggttcggcg gatgccgttt agcctttttt gttaatttta atggcgccac ttgtctaccg 60 ctgtttagcc ctatttcccc ctttcccctg tcaatctaag tcgtcttctt tgtgatctcg 120 tcctttccaa tttttctgaa ctgaagtgac tcttgccctg acaggtattt taagttgtat 180 tcgcgttttg aatattgtga ccaaattgtt ctgacgtgca gtctcagacg tcgtcgacac 240 atgcaatttg tgacaaggtg gcaattccca cacctactcc caggttgtgt gtgtgttgtg 300 ccctatatgt gcatgagtgt gtccaggacg gaccactctt atttccattt tcccatcgaa 360 gactggtccc caggtattga tgatgacgga tgcctgatta tgtgtcgatg ttctgattct 420 gacagtgact tgaagttaca cgacggacat tgattgcttt aaatacagtc cacttctact 480 tgtccgacct cgagttatca 500 // ID Gypsy-30_DPu-LTR repbase; DNA; INV; 131 BP. XX AC . XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from Daphnia: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-30_DP_; KW Gypsy-30_DPu-I; Gypsy-30_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-131 RA Jurka J.; RT "LTR retrotransposons from Daphnia."; RL Direct Submission to RU (16-DEC-2010). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR [1] (Consensus) XX SQ Sequence 131 BP; 42 A; 29 C; 32 G; 28 T; 0 other; tgttgtaatg tatcgcactt cagctagcaa ccaccagatg tggccactat cggtgagtca 60 gcatgaggtg gtgatagcaa acacagagac tgaatacaga cagttacgga ctagtactcg 120 agactacaac a 131 // ID BEL-65_AA-LTR repbase; DNA; INV; 268 BP. XX AC supercont1.275; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-65_AA_; KW BEL-65_AA-I; BEL-65_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-268 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.275; Positions 742264 742531. XX SQ Sequence 268 BP; 72 A; 60 C; 52 G; 84 T; 0 other; tgttccggct tcgtatcgcc cctcgtacga tcgccgtatt attttctatg ctatgacttg 60 tcactaatat attttccact gggactgaaa aagcgcggta accaccgctt gtgaaaagaa 120 aaaaaagagt tagtttgtag tctgtcatca acgtgaataa acgtttttgt atttgctttc 180 cgaagtcgcg tgcaatttta cttatatttc cggaaaaccc gcagtccaag tcgcgtagtg 240 aatttacagt ccattccatg ccgaaaca 268 // ID hAT-10_SM repbase; DNA; INV; 3324 BP. XX AC . XX DT 23-OCT-2007 (Rel. 12.1, Created) DT 23-OCT-2007 (Rel. 12.1, Last updated, Version 1) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-10_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-3324 RA Jurka J.; RT "haT-type repetitive family from freshwater planarian Schmidtea RT mediterranea."; RL Repbase Reports 7(10), 1042-1042 (2007). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 768..3038 FT /product="hAT-10_SM_1p" FT /translation="MSSMAQATRSQNQLYLLGDSVTVLTGSKLPSLRMTLG FT FFLHHHLELKETIRQSSAVTITEISKFWVKARIPMREHQNCQTKLEKTFEE FT WRLLKKNKTRASPTQLSRESAFVSRLDDLFDVAHADTLNMKSVLQEDKDFL FT LAQREKGRRGSMAGVDEKLAGKEKRASERDKKMLAKRQQMIEMKQFADSTV FT ELVSSGSENSSTEDTEKMETENDACEEAVGSIAFRTPKRKRGRKTVVTPEL FT AAALDRTKVSDRKAVFVIAETAKSLGHNIDELALNRDSIRRHRVEHRVQRS FT ANIKAEFQGNVPLVVHWDGKLIPDLIGKEKVDRLPVLVSGKGVSQLLTVAK FT LPSGTGEAEAAAVFGAIEDWGIANSIRAMCFDTTSSNTGRISGACVLLEQK FT LKKELLSLACRHHVMELIIGAVFQVCMGASSSPVVQLFKRFQEYWAIIETD FT KYEPGIAADDVANLVEDIRQSTVDFAIKHLEQSQPRDDYKEFLELVIVFLG FT AAPARGVRFMLPGAMHHARWMSKVIYSLKIWMFKAQFKLTSAEERGLRDVC FT VFAVRIYLKAWISAPQASDAPYNDFLLLKSLLEYSSIHSAISKQTSQKFSN FT HLWYLSQELVSLAFFDDHVNSSTKRLMVSAMQNEEEQDKDHSKRITVALDS FT FKTRNLEDFVTAKSMTLLRMLELPHGFLAVDPDLWEDRDDFRQAKETVKSL FT KVVNDHAERGVALIQEYSGLLTHDESQLQFLLQVVEDHRRMYPDSRKQTMS FT GVPHKQ" XX SQ Sequence 3324 BP; 1026 A; 636 C; 702 G; 960 T; 0 other; gggtggagct tattttttaa cttttgaaat atgatcttct taccctccaa ttttgttcct 60 atttatgaaa taataattca gacaacatat gagcacaatt gaacaatatt tagaggtagc 120 tcaatgatgg tgaagtttcc agcactctgg tcgtcagact cggacgggtc cataattgta 180 ttcttccagt gtggttaact ggttatgggg ctttaatcat ttttaatcct tacatcataa 240 caatcaataa tagctggaat tgttcattca aacaaaataa aactaaactt atgatattac 300 ataattatca ttattatcac acaatttttg catcagctta gtgtttttta ttttctataa 360 cttttgccca tgatgcatgt aggtttgctt acttattcac aggttagagg cagtgcaata 420 gactaccaaa gctgacagcc aatcagccat tttgcagaag tttgctttta agcagaagaa 480 atacaatttt tgtttttatt gattattatt tattaatgat tgaaatagtg agaaattcgt 540 aaatttgatt actttatcat caacactaca ctacaacaaa acaggtattt gaaattattt 600 ttaaatttat acaatgtagt cagcagtatg gcaattcaga atgtttgctt gattgagggc 660 ggcattaatg tttcatgttt aaaatggtaa aattcgtgat tccaaatgcc aaatgattga 720 taaaaatgcc ttttctaatt ttttttcaga ctaattgctg ataaaaaatg tcatctatgg 780 cacaagcaac aagatctcag aaccagttgt atcttcttgg ggactctgtg actgtattaa 840 caggcagcaa gttgccgtca ttgcgtatga ctcttgggtt cttcctacat caccatctcg 900 aactgaagga aaccataaga caatcgtcgg ctgttaccat cactgaaata tcaaagtttt 960 gggtgaaagc tagaatccca atgagagaac atcaaaactg tcaaaccaag ttagagaaga 1020 catttgagga atggcgtctt ctcaagaaga acaaaacacg agcttcacca actcaactct 1080 caagagagtc ggcttttgtt tccagactcg atgatctttt tgatgtcgct cacgccgata 1140 ctctgaacat gaaatcagtt ttgcaagaag ataaagattt cttattagca cagcgggaga 1200 aaggaagaag agggtctatg gctggggtag atgaaaagct tgctggcaag gagaaaaggg 1260 cgtctgaaag agataagaaa atgcttgcca agcgacagca aatgattgaa atgaaacagt 1320 ttgcagattc tacagtggaa ttggtttcat ctggttcaga aaacagttca acagaagaca 1380 cagagaagat ggaaactgag aatgatgcat gtgaggaggc tgttggaagc atagcattta 1440 ggacaccaaa gagaaaaagg ggacgaaaaa ctgttgtcac tcctgaactc gcggcagcac 1500 tggatcgaac aaaagtttca gaccgcaagg ccgttttcgt tatagcagag accgccaaaa 1560 gcttgggaca taacattgat gaacttgctc taaacagaga ttcaattcga cgacacagag 1620 ttgaacacag agtccagaga tcagcaaaca taaaggctga gtttcagggc aacgttcccc 1680 ttgttgttca ctgggacggg aaacttatcc ctgacctaat tggaaaagag aaggtcgatc 1740 gcctgccagt gttggtgtca ggaaaaggag tttctcagct actcacagtt gccaagcttc 1800 catccggaac aggcgaggct gaggctgctg ctgtttttgg agccattgaa gactggggca 1860 tagccaacag tatccgagct atgtgttttg acacgaccag ctcaaacact ggtcggatat 1920 ctggtgcatg tgtcctccta gagcagaaac ttaagaagga gttgttgtcg ctggcttgca 1980 ggcatcacgt tatggaactg attattggtg cagtcttcca agtctgtatg ggagcctcct 2040 cctctcccgt ggttcaactc ttcaaacgct ttcaggaata ctgggcaatc atcgaaacgg 2100 acaaatatga accagggatt gcagcagacg acgtggcaaa tctagtggaa gacattaggc 2160 aaagcaccgt cgattttgcc atcaagcatc ttgaacagag ccagccgaga gacgattata 2220 aagaattctt ggaactcgtg attgtctttc tcggtgctgc tcctgccaga ggagtccgat 2280 tcatgttgcc cggagcaatg catcatgctc ggtggatgag caaagtcata tacagtttaa 2340 agatctggat gttcaaggct caattcaaac ttacttctgc agaagaaaga ggattacgcg 2400 atgtgtgtgt gtttgctgta cgcatctacc tgaaagcctg gatctctgca cctcaagcat 2460 ccgatgctcc ttacaatgac tttctgctgc tcaagtctct gcttgaatac tcgtctatcc 2520 attctgcaat ctcaaagcaa acatcacaaa agttttccaa ccatttgtgg tacctgtcac 2580 aggaacttgt tagcttggct ttcttcgacg atcacgttaa ttcgtcaacc aagagactga 2640 tggtgagtgc aatgcagaat gaagaagagc aagataagga ccattcaaag agaattacag 2700 ttgctctcga ttcctttaag accaggaact tggaagattt tgtcacagct aagtcaatga 2760 ccctgctccg aatgctagaa cttccacatg gattcctcgc ggttgatcct gatttgtggg 2820 aggacagaga tgacttcagg caagctaagg aaacagtcaa gtcactgaaa gtagtaaatg 2880 atcatgctga acgtggagtt gctttaattc aggaatacag cggcttgtta actcacgatg 2940 aatcgcagtt acaattcctc ctgcaagttg tcgaggatca tcgtcgaatg taccctgaca 3000 gcaggaaaca aactatgtct ggagtaccac ataaacaata aatttggaaa ttgttgaact 3060 gtctccctga actctgccca aaactgaact gagattgcag tctaattgtg ttgtaaaaat 3120 taatactggt ttgttataaa taaattgaga atttatgtaa ttttcataat ttttgtaaat 3180 ttgaagcact tgagctaccc ctaaacacaa gaatttatta ttcaaacttt ttatatagtg 3240 tttctatatt ataaggaaca gattaggagg gtaagacttt caaaaatacc tttttttatt 3300 tttttttatt tttaagctcc accc 3324 // ID DNA8-89_AP repbase; DNA; INV; 375 BP. XX AC . XX DT 26-AUG-2009 (Rel. 14.09, Created) DT 26-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-89_AP. XX NM DNA8-89_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-375 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2025-2025 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 375 BP; 120 A; 72 C; 60 G; 123 T; 0 other; cataggcgca atttgggtta aatttatggg ggtgctagat ttatgtcgac taaagtgatc 60 agtgggggct ttgccccccg ccccccccct ccattcatac aacttaattt tatacacctt 120 cgcattaatc ttaatataaa caattaccta taacataata cacttttatc tatattatta 180 ttctgatagg tacctaatat tatgtattat tattattatg aaaatgataa ggtcttagaa 240 ataaatgtat ccaaatacct atggcataat attatagtca ataaaaattc gatgatgcta 300 tcaaaaattt gggggtgcta atatgaattt tgggggtgct aagcaccccc aagcaccccc 360 caaattgcgc ctatg 375 // ID Gypsy-81_CQ-LTR repbase; DNA; INV; 1289 BP. XX AC AAWU01003442; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-81_CQ_; KW Gypsy-81_CQ-I; Gypsy-81_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-1289 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 542-542 (2011). XX DR Genome; AAWU01003442; Positions 3200 4488. XX SQ Sequence 1289 BP; 313 A; 308 C; 390 G; 278 T; 0 other; tgtgagtgtt gtgtacaata attgatgtga ctcgtcgcga attttgtgtt aaattattta 60 aatgtaaatt attgtgccaa gaagacgttg tgtttgtcga cgggaagtta ccaggacgcc 120 gtccagcaag caagaaaccc gaagctggaa gtcccacacc gtgcgtcgtt accgctagtc 180 accaccagga gcgctgctgc ataacatcgc atgggcggcc attgggggtg gtgaatgtgc 240 gtaatccttg aggatgacag gatgaggagg agatgggttg gcgcgtaaga ggtcaggggt 300 tcgggatttt ggtctgaccg ttgctcagtc attaagtccg gatatcgaga agtggaaggt 360 tgtcggcaag tgcaaccgtt acccgcgaag accagtgttg gaaaagttaa tccttggagg 420 atcgtgaagt gctcggaaga gctacatccg tggccgtgac aagagagcag gagtttcccg 480 taggagattt ggacatccag gtatgaccgg aaaaccccag gaaaatcccc gaacaaaccc 540 ctagcctgac aataccctaa ccgccatttt gaatccctca tccaggaccc ctttgtgttc 600 ctttgtgttc ctttgttcac ttcgtgaaca ccgcttggtt tcaccccggc cgtacgacat 660 ccggcccggt gaaccatctg tacgagccgt cagcgtttcg ctggacagtg ctcaaccgcc 720 aggccgttac cgttataccc gcgaacccta accccggact gttccgacgg agtccccgga 780 caccgtcgaa gcctgttgca cgtcaccaag cacaggaagg gcgcgaacgg ctggcgaggg 840 tccgtgaagg ctgcagccga aacgtcggag ggtccatccg actgacgaag aagccacccc 900 cttggcagag cgccggagga gcgtcgacga cagcaggaag gagcagcgac gacggcagag 960 cggcgagaac ggggccccga acgcgtgcac cggaagtgcg aagagaggaa gaggacgtgt 1020 gaggtaggag tatttataag gaagtgggac gggacgtgtg agctagtgga agtttggaag 1080 ggattcccgg aataaatcgt cgactgtgca cgaaataaaa gtagtttccc ttaggtaaat 1140 gcagtgtttt cttggttctt gagtaatttc gagcgtcacg ccgaccctgg gtattataag 1200 ggagagtctc atcggttttg tacgccctga caccggaagc acatctctta gagtgtgtgt 1260 tcacttcgag tatggactgg gagtgaaca 1289 // ID Gypsy-12_OD-I repbase; DNA; INV; 14379 BP. XX AC CABV01000024; XX DT 24-FEB-2011 (Rel. 16.02, Created) DT 24-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Oikopleura dioica genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-12_OD_; KW Gypsy-12_OD-LTR; Gypsy-12_OD-I. XX OS Oikopleura dioica OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; OC Oikopleuridae; Oikopleura. XX RN [1] RP 1-14379 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Oikopleura dioica genome."; RL Direct Submission to RU (24-FEB-2011). XX DR Genome; CABV01000024; Positions 4200 18578. XX CC Positions [3062-3535] - Reverse transcriptase CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 158..5968 FT /product="Gypsy-12_OD-I_1p" FT /translation="MSGSDAIASAFARANARSYEPYFSLSDSQKPPKLNAD FT TPSALAKLIYDAWANHFERYVSKAISSRDSKNPYSQYSLDLPCLWEAPPLG FT REMPNWAGNCGIPSSLAEFQQWKDANPRRYKGKGLIEFLIVVDGIPRPNLF FT FNGTGDEEGAPDLQSDPLEIQNETNRIFLNFFFPGRVLFEGWNEPENLAYV FT RSAVKRAGETFITQCRPDHKIYVQRGELASPRAYIQDLLKGVDETEKKPEF FT FRWHALMRTLTDSTYDGFTVKSKMDLVRQVITKCLGNYSRPGENVTIGHRF FT DRSVFDAIETWTPFLELTSFLCSILCKSEKDLVEVLEALAAKVNCEFTSID FT LRTVLNNQKFIISKIERKEEEELALICHERWEMAYRVDPEKPMRAYGEVVA FT AEARAVKDSEKRDKFKSFGDRKFGDNKKQTKRFKPEGKSSKKKRFHANSVN FT LAEDLKVDANSGTVWQERKDLKMNQQEVARVAPNKVMWMNIQKPAQKFATR FT LQKSARDLHKKSWSSKKPRKGDYRVIELDEIKRELDFVARQFKEEEGGWYM FT TQSMGGLENESGPTGEDVDTTSETSSEDEEQEDSDSSSEQGLSVEKQAENN FT HAFRLIRSAKEKHAVIEETTKITIKFDYLSQSKNNSENKIRNRHSYTLVFD FT TGASSTLVPKRFFIQLQESSKVVCRKTVEHAGAAAGGGTIELLPFRVSFAL FT KVGALKLNITDAAVHANTNDQSTILLGLRDWYPNGIEFKKDKNGSHKILIE FT DVPLEEFWQFTEAVDSKLQSWRFLSESFLKITSESLIAGVSVKSDIDPNDE FT ISSLVFNMKPTDKFYPNGMSHFWRKIEKIHQENKETNTVEDVVIDEFGDVL FT DLSKPEDRSFKRKIKRLLLKYSTVVFSKTQGVVPGAEFVVKGTIKGNTTGK FT FSQEPYERMPPAQVKLITDKMNSELAEGILRPLGNEMTARNYVRIFPVAKK FT NLPGMTVEEKVANVRLVLDCSRNINQGTTFANHEMDSIRDVIQRMAPYTKT FT GLIGSLDISSMFFSFRLDRELWEDFCVRHPEIGDCAITRLPMGWVSSPSYA FT RDFLSRIYYPCNNFLVKYVDDFVFCADSREDFLKNLEMILQITHNMNLRFS FT GKKVSLLSKSLKVLGKRLKDGKITVDAHTVQKIVDEEVDKLKTVRQMRVIL FT GRIAFIADAIPSRVEITAKLAEMVGSKKSTERLEWTPQLIDDFELLKKTIS FT NALVDLYPVIPGLETILVVDSSYISTGGFLLQIKDNKKRLIRLFSRKRSDY FT NNQISMSSCLLEICGLVAAAKYFKVEMQIAAVTTRIYTDSISVEKLFNRLL FT KGESLCDDLRINTHLLDLINYDIEIIYLSNKSEYIVLADHLSRRESLSTKC FT EGQCKVCEAADAPLLNSKTLMLTQDTMKANEFVLINDLEPVGSVEQFELAQ FT TIKDEYLWWNEHKATFSHARFAPLEGELLSFKVGRSNPAILNDFPELKGMS FT LKQFLNNKELLNRIQMRCKKLRAVFKAKEDLILPNARNRPGETARWRNEVI FT DGVVVRKRNFGIRQNYVIVIPERMTNWVVQKIHDEKGCSSQTAMLNEAKAC FT MEVPLVRDAIKAITSRCRNCSFLRNVPNIATQLKEYNDFNPQFAGNVVSWD FT QLTRHSEKANKEFKFWLIVDHLTSFAKLIPVEGRSNTENNRKALIRAIGTI FT KSDMDGNTKVITDGATINMALVNDFELKSNNIEVVITDQLTRSKNNIAVLD FT SRSQKLTKFLVLALNQWTDPWQIAEKVNFDHNKLRSPHGFSPQELLSSRCQ FT STGDKITVNWNHLKDRIKTARAISRKANNKMVKKGFVRDPMLFTPFEESDK FT IYGGTIRTPIKLGDIIILSEAYNKNKSRIFWQVTKSPEMPEGIDFDNQIVS FT TVKMDLRKIQPSSRKVWSFNAIKLVIDGEKDFGNGKPEGRNELPEKFSCNF FT TQLNTTEIEAWRL" XX SQ Sequence 14379 BP; 4682 A; 3081 C; 3364 G; 3252 T; 0 other; cttatcttgt ggtgactgaa gtgcttctaa gcaaggaccg ataaggcctc cgaacttagt 60 tggtagtcgc agattctcag ctaaaagtcg ggatagcttg tcttctgaac aagcatcagt 120 ccaccaaaag ttgactacgc gtacaattga aaaacgcatg tctggcagcg acgcgatcgc 180 atctgcattc gcgcgagcga atgcgcggtc ctacgagcca tactttagtc tcagcgactc 240 tcagaagccg cccaagctta acgctgacac gcctagcgcg ctagcaaagc tcatttatga 300 cgcgtgggcg aaccatttcg aacgctacgt gtcgaaggcg atctcctcga gagactcgaa 360 aaacccgtac tcgcagtaca gcctggacct tccatgttta tgggaagccc cgccgttagg 420 tcgagagatg cctaactggg ccggaaactg cggaatcccg agctcacttg ccgaattcca 480 gcaatggaag gacgccaacc cccgccgcta caagggaaag ggcctgattg agttcttaat 540 cgtagtagac ggcattccca gacccaatct tttcttcaac ggaactgggg acgaagaggg 600 cgcgcccgat ttgcaaagcg acccgctgga aatccagaac gaaacgaatc ggattttttt 660 aaactttttc ttcccgggca gagtcctttt cgaaggatgg aacgaaccgg aaaacctggc 720 atacgtcaga agcgcggtaa aaagagctgg tgaaactttc atcacgcaat gccgccccga 780 ccacaagata tacgtccaac ggggcgaact cgcttctcct agagcctaca tccaggattt 840 gttgaaaggt gtagacgaaa cggagaagaa gcccgaattt tttcgatggc acgcgctcat 900 gaggacgctg accgacagca cctacgacgg atttaccgtt aagagcaaga tggacctggt 960 cagacaggta attaccaagt gcttggggaa ttactctcgg ccaggtgaaa atgtaaccat 1020 tggtcataga ttcgaccgat cggttttcga cgccattgaa acgtggacac cattcttgga 1080 gctcacgtcg ttcctctgca gcatcctctg caagtcggaa aaggatctcg tcgaagtcct 1140 cgaggcactt gctgcaaagg taaattgtga atttacctcg attgacctcc gaacggtctt 1200 aaataaccaa aagttcatca taagcaagat agaacgcaag gaggaagaag aactagcact 1260 catctgccac gaaagatggg aaatggccta tcgggtagat ccagaaaagc cgatgagagc 1320 ctacggtgag gtcgtagcag cagaagctcg cgctgtcaaa gacagtgaga agcgcgacaa 1380 gttcaaatct ttcggcgaca gaaaattcgg cgacaataag aagcaaacga aacgcttcaa 1440 gccggaagga aagtcctcca agaaaaagcg tttccacgcc aactccgtca acctcgccga 1500 agacctcaaa gtcgatgcga actccgggac ggtttggcaa gagagaaaag acctcaagat 1560 gaaccagcag gaggtagcca gggtcgcgcc caacaaggtc atgtggatga acatccaaaa 1620 gcctgcccag aagttcgcca cacggctgca aaagtcggcc agagatcttc acaagaagtc 1680 gtggtcgagc aagaaaccta gaaagggcga ttacagagta atcgagcttg acgaaatcaa 1740 acgtgaactc gactttgttg ctcgtcagtt caaagaagag gaaggaggct ggtacatgac 1800 gcagagcatg ggcggcctgg agaacgaaag cggccccact ggagaagatg tcgacacaac 1860 ttccgagacg tcatcagaag atgaagaaca agaagactct gacagctcat ctgagcaagg 1920 tttgtccgtt gaaaaacaag ctgagaataa ccatgcattt aggctaatcc gctcggccaa 1980 agaaaaacat gctgtcatag aggaaactac gaaaattaca attaaatttg attacttatc 2040 tcaaagtaaa aacaattcgg aaaataaaat taggaatagg cactcgtata ctctcgtttt 2100 tgacacagga gcctcatcta ctcttgttcc aaagagattt tttattcagc tgcaagaaag 2160 tagcaaagtt gtgtgcagga aaacggtaga acatgcaggc gctgcagcag gcggtgggac 2220 gattgagctt ctcccattcc gagtgtcgtt cgcactaaaa gttggcgcat taaaacttaa 2280 tattacggat gcagccgttc atgcgaatac gaatgaccag tcgacaattt tgttaggact 2340 gagagactgg tacccaaacg gtatcgaatt caaaaaagat aagaacgggt cgcataaaat 2400 cttgattgaa gacgtacctt tggaggaatt ctggcagttt acagaagctg tagactcgaa 2460 gctgcaatca tggaggtttt tgagcgaatc gttcctgaaa atcacgtcag aatccctaat 2520 tgcaggagtt tctgtaaagt cagatattga cccgaatgac gaaatatcgt cgttagtatt 2580 caacatgaag ccaacggata agttctaccc gaacggtatg tcgcatttct ggaggaaaat 2640 tgagaagatc caccaggaga acaaggagac aaacacggtc gaagacgtgg tcatcgatga 2700 gttcggagac gtacttgact tatccaagcc ggaagatcgt tcgttcaaaa gaaagatcaa 2760 acgattgttg ctaaagtaca gcactgtggt tttcagtaag actcaaggcg tcgtacctgg 2820 cgcagaattc gtcgtaaaag gcaccattaa gggtaacacc acgggaaagt tctcacagga 2880 accgtacgag aggatgccac cggcccaagt aaagcttatc acagataaga tgaactcgga 2940 gttagcagaa ggcatactca gaccactggg aaacgaaatg acggctagaa attatgttag 3000 aatcttccca gtcgcgaaga aaaacctacc aggtatgaca gtggaagaaa aggtagccaa 3060 tgtcaggctg gttctcgatt gtagccgaaa cattaatcaa ggcaccacgt tcgccaacca 3120 cgaaatggat tccattcgtg atgtaatcca acgcatggca ccgtacacca agacgggctt 3180 gataggatcg ttggacataa gcagcatgtt cttcagcttc aggctggaca gagaactatg 3240 ggaggatttc tgtgtgagac atcctgagat aggagactgc gcaattacga gattgccgat 3300 gggttgggtg tcatcaccga gctacgctcg agattttctc tcaagaattt attatccatg 3360 taacaatttt ctagtcaaat acgttgacga ttttgttttc tgcgccgatt ctcgagaaga 3420 cttcctaaag aatttagaaa tgattctgca aataactcat aatatgaacc tgagattttc 3480 aggcaaaaag gtcagtctgc tcagcaaaag cttgaaagtg ttgggaaagc gtttgaaaga 3540 cggaaagatc acggtagacg cgcacactgt ccagaagata gtcgatgaag aagtcgataa 3600 gctaaaaacg gttaggcaaa tgcgtgtaat tctgggaagg atagcgttta ttgcagacgc 3660 catcccgtcc agggtggaaa taaccgcgaa actggcagaa atggtcggta gtaagaagtc 3720 tacagaacga ctcgaatgga cgccgcagct catagatgac tttgagttgc tcaagaagac 3780 catatcgaac gcattggtcg acctgtaccc cgtgatccca ggcctcgaaa cgattttagt 3840 agtggactcc tcatatatta gcacaggagg ttttttgcta caaataaagg acaacaaaaa 3900 gaggctgatc agactgtttt ccaggaaaag atcggactac aacaaccaaa tcagtatgtc 3960 aagttgttta ttggaaatat gcggactagt agcggcagca aaatacttta aagtagaaat 4020 gcaaatcgcg gctgtcacta ctcgtattta tacagacagt atttctgtgg aaaaactgtt 4080 caatcgcctg ttgaaagggg aaagcttatg cgatgacctc aggataaata cgcacctatt 4140 agaccttatc aattatgata tcgagatcat ttacctgtcg aataaaagtg agtacattgt 4200 actagcggac catttatccc ggcgagaatc gctgagtaca aaatgcgaag gccagtgcaa 4260 agtgtgcgaa gcagctgacg ccccactgct taactccaag acactcatgc tgacgcagga 4320 tacgatgaag gcgaatgagt tcgtcctaat caacgattta gagccagtag gttcagtcga 4380 acagttcgag ttggcccaaa caattaaaga tgaatacctg tggtggaacg agcataaagc 4440 tacgttctcg cacgctcgtt tcgcaccgtt agagggagaa ttattgagct ttaaagtcgg 4500 gcgatcgaac ccggcgatat tgaatgactt ccctgagtta aaaggcatgt cgttgaagca 4560 attcctgaat aataaagagt tattaaacag gatacaaatg agatgcaaaa agctacgcgc 4620 ggttttcaaa gcgaaagaag atctgattct cccaaacgct agaaacagac ccggcgagac 4680 ggcgagatgg agaaacgaag ttatcgacgg tgtagtagtt cgaaagcgaa acttcggcat 4740 ccgccaaaat tacgtgatag tcataccgga aaggatgacg aattgggtcg tccagaagat 4800 ccatgatgaa aaaggttgct cttcccaaac ggcaatgcta aatgaggcaa aggcatgcat 4860 ggaagtacca ctcgtaagag acgcgatcaa ggcgataacc agcagatgta gaaattgctc 4920 cttccttcgc aatgtgccga acatcgcgac tcaactgaaa gaatacaatg actttaatcc 4980 tcagttcgca ggcaatgtgg taagctggga ccagttgacc agacactcgg aaaaagcaaa 5040 taaagaattc aaattttggc taatcgtaga ccatttgact agctttgcaa aattgatccc 5100 ggtcgaaggc cgatccaaca cggaaaataa ccgtaaagcc ttaattcgag ccattggcac 5160 gattaaaagc gatatggacg gaaacactaa ggtaatcacg gacggagcca caattaatat 5220 ggcattagtg aacgatttcg agctaaaatc gaacaatatt gaggtagtca taacggatca 5280 actcacacgt tcaaagaaca acatagcagt cctagactct agaagtcaaa aattaacaaa 5340 atttttagtg ttggctctga atcagtggac tgatccgtgg cagatcgcgg aaaaagtgaa 5400 ctttgaccac aataaactaa ggtccccgca cggtttctca cctcaagaat tgttatcgag 5460 cagatgccaa tccacgggag acaaaattac tgtcaactgg aaccacctga aagacagaat 5520 taaaactgca cgagctattt cccgaaaagc caataataag atggtaaaga aggggttcgt 5580 acgcgatcca atgttattca cacccttcga agaatccgat aaaatttacg gaggtacgat 5640 aaggacgcca attaagctcg gcgacataat cattctatct gaagcatata ataagaataa 5700 atcaagaata ttttggcagg tcacaaagtc gccagaaatg ccagaaggaa tcgatttcga 5760 caaccaaata gtaagtacgg taaaaatgga tttgaggaag attcagccgt cttcgagaaa 5820 ggtttggagt tttaacgcaa ttaagttggt catcgacggc gagaaagact tcggcaacgg 5880 caagccagaa ggaagaaacg aacttccaga aaagttctcc tgcaacttca cccaactcaa 5940 tacaaccgaa atagaagcgt ggcgcttata attaatttct ttaacctatt gtctaaaata 6000 tcgtatattt aaaaagctga acaagattac aatgcaacga ataaaatatg aaaacgtaca 6060 ttccgagtct caaagtgtat ttgtgaacat acagtattac taacaatata ttatatttat 6120 cgtaagacag attaagtaag gacgagtccg ccgagtttca tcacttcggg atattgagta 6180 gccaaaccag agacgagatc gtccaaagtt cggtcaaggg cagaaatgcc acaaaccaag 6240 gcaggatccc gatggaggtg accagcgcga gcaaaaaggt acatagatcg ggcaagagac 6300 tgcaggtagg tggctagata gtcgacaaaa agactggaag tcacaccggc aagcaatcgg 6360 ccgcgaagct catcggtttc agcacagcgg cgagatacct gaatataaaa tatcgaccaa 6420 atacgcgcaa tctaccgttg gagaaaaacc aagtcggatg acgtagttgg gactcagact 6480 ctcgctagcg aggaaatgtc cgaatcgcgt ggcaaaacga acgtgccgat cgaacaacgg 6540 catcggtaca gggtttaaaa attgaggagt tttgtcagcg cgcataagat gtccaaaatc 6600 ttgaggatag gccttttggt agcgcagtcg tccgttcagc tccgtctcac atatggtaac 6660 ctagaagaaa taatgttaaa gataggttat aatatacaat attaataata tttaatttaa 6720 tattgaaaga atgagaactc acgggatggt cagcgtagaa gttgaaaaat aaggcaaatt 6780 cgggtaagtg ccatccatta tgcaacaagt cgatgagcag atagcgatga gagtgacagg 6840 attgcccagg tcgagaataa gttcgctcaa ccgaacgatc agataattct atctgacgaa 6900 cccgtggaaa tcgaggcgaa aaatatcctt tgactcgagc tttagccaca gattcacgcg 6960 tttcgggact cagagcagca ggttgctcac tctccaactc cagcgtagtt gaccgtccag 7020 taggaaactc ttcttcggtg tcaaaaggtt ccacagtctc agtttcgccg atatgacgag 7080 aaaggctctc accggagggt aaatcaggag taacttcgga ttcatccaaa accttaacaa 7140 gggaagctaa tgtctttggt gtaccgaaaa gaaaaccgtg gaaggacgcg gaaacggaat 7200 cccgggacga gagttcaagg gagtcgaacc gacaagagta ccactagctt ggccaagagc 7260 gatgagcggg agaccataac cacggttgcg tgcgacaatg tcgtacatat tgcggtcaat 7320 gacgccgcgc tgccgctgac cgtacatcat gctcaaatac taaaaagaaa gcgaaaaatt 7380 gtaaaatcta tgaagaacaa attacagtca gtcgacttat ctgaagttga tcgtgctgat 7440 gctgtagctc gtgaggaggt agttccgttt tttcctgcag atcggaacga tcaatcagct 7500 gatactcacc cagctcgtca acagcagact cgacaggcac acgatcggaa tcagacttgg 7560 gaacaggcaa cgatacaggt ttcacatcaa gagtgaccat agtctcgtcg tctgaaaaat 7620 aaaacggtta gattgcgaag atcaaatcag taaaatctcg acgaaaagaa gacaaccgat 7680 gcaccgcaaa gacgaaaaca agtgtttgcg tggatcgaaa aatatgaagt gagttatgat 7740 tcgccaaaat aagctgtgac agcgacccac cggctcatga gacgtgttta acgcacatcc 7800 aacaaatccg agattatact gtactgagag aatttctgtg aaatattacg caaagaaagt 7860 ttaataaaat tatttacccg tgtcttgtat agtatggaag taccgggaaa ttggcccgaa 7920 aattggtcgc acgatcggcg ttcttgcgca tactgtgagg aaagaggcat gaccttctac 7980 ggtaactgcc aatcctgcga tggatcattt cgaaccagtg cttttgacga aacttggaat 8040 cggaagctgt gcgataattg cgtcgcgcaa cagaacacag aaattaaaaa caagagatag 8100 ctatccgttt gtgaacgaca tgttttcatt attaaataca aattcatcaa tgtcgtatct 8160 aataaactaa aattaagtaa aaattcagcg cgtagtaaat aaaatatttt aagccatgga 8220 tcgtatttgt ggagctatgt tgtctagatt taaaggcccc cacaacgaac ttacagttcc 8280 aggtcgtacg aagaaggagc tggcgaaacg actactcgta agagattacg aaggaagcga 8340 attggccaac cttctcagcg acagatatac agctggagca ggacttcgac tggctgattg 8400 ggacgtcgcg gtgaaagaac tggtgtcgtt aaatcctagg aaaaggatga agatgtcgcc 8460 aggtcaggag tctgtacgga aaagcctttt cgaacgatta gacttactgt taatggcaca 8520 caaggggcga cgtagggctg atgacgtact catcatttcg gagaaagata gtaattacga 8580 atctttcgtc acggaactct gcttaagcct cggctggaag tacagactgg aaagagttgg 8640 aaatcgcaca cgcgaaggaa gactcacttg gaacctgcta agaccgaatt cgtcggcaat 8700 ggaccacgcg tttcagcaaa agctcgccat cttccacatg gggtcgtcga tcgaaagtac 8760 gaccgaaagc atcgcttacg agtctaaaat taaccactct ccgtactcgt tcacactcgt 8820 cagcgagccg gttatgaaag aactcatcaa cgacgcgtac gccaagggtt tacgcccaaa 8880 gagcgccttc cattcatgct cggacttcta ctctctttgc gaagaagtcg agttaaacaa 8940 cgaccagttt gagtcagact catcgacgca ggtaaaacat cctaagtaaa tggtaatact 9000 gatatgaaaa atagatatca atttatttgg aaggtaaata cgaagtgtga gttatgggaa 9060 tgtaaatgaa tttttcaata attaatgggt gaacaaaaga aatctactga aaagtacagt 9120 aatcgcagag ctaagcacca cgtcgttcga gcccgtcttc aattttcaaa ttgacagctt 9180 cgtatgtaat acgaacttct cgatggcgaa tttgagcgcg agtaaactcg gtttcgtctc 9240 gctgatcgta aatttgtggg tcaccgcgga atcgcgcggg caagtcatca agggcttcga 9300 catcacgatg aaattgttgc taaaatatac attaaggaag gcagctccga cggtttaatg 9360 ttcgaaacga gccgagcagc caacgtagcg accacaccat cgtcgccact aacaactcta 9420 ccgtgctggt ctatggaagc acaacaacgc gtgctcaaac aggaattcgc ccgagcattc 9480 tcacctcaat ctgacgagag acgagagcag gagcccgcga ggtaggcatt ctatatcctg 9540 cgatacgtct tcgaagcatg tagtggggct tgaagtcagt cagctcaatc atgcaagtgc 9600 ccaagaaagc aatatgcata ccaggcaaaa caacgggagc aagacctgaa tgagcatgta 9660 aaaattgatt tttagtaatt tataattatc attagcagtt aataattact cacgtgattt 9720 gggtccgacc gggcaatcgt taaagcgcga gtcattgtga atcgtttgat tgcagggatc 9780 tcggatcaga gtatgctcaa atgatcttcg agacaatcgc tgtcgatcgc gaacaatttt 9840 attgtaaatc aagaatccaa cctaagataa aaataaaaat taaattaagg aaatcagtgc 9900 tatgactcac ttgatattgc aaaatactat gaagccctgt cccggttatt tgatagatat 9960 cgtgagtgcg agaaagtaat tccaaatgat gcgcattgta gaagcaggtc tccagactag 10020 tcgcaatcgg cgcattgtaa aacagcccgc agagatacag cgctctgctc aacaaagctc 10080 cccacaccaa gttgccaaca agacgagtgg gctggtagaa ttctgaatta acaaattaat 10140 ttagggacac tgaaccacgt ttggtgttta gaagccggcc aacgttcgat ccaccgattg 10200 agttaagcgc ggaaacgaac tccgaagcaa tggacacgtc tggcgaaatc gtgaatccgc 10260 cgaaaaacat ccctaagggg gtgaaaatac tgtatccggc tctagcatgc ggcattcaat 10320 tctgcccgca cgccgacccg tggccgctat ggaacgtgtt aactcacaac gtcgaagttt 10380 gggagaagct ttgttccgaa cgagaatcag atctgctaag atccgcacgg ctagaaaata 10440 tcaaaacgcc ggagctgcag atcagaaaga gcttccctgt ggccggattt actgactcat 10500 cagagtgtct cttctgggaa ttatcccaat ttaccgctct aaacgaatcg gcatgcccag 10560 atgaggacca cgtgttcgta cgcaaacctg gagatcttac catctcgcaa gaggtgtgtg 10620 gatggacacg cgatctgacc cgaaaaatgg tagatacctc cgatcgagtc gactacgaaa 10680 accccgacct agtaatggga acggtgagcc gcagaaagct agaaatctcg atcgagttcc 10740 ctgaactgaa cgagatgatt gcggccaaaa attggttgga gaaaatcatt tccacaataa 10800 aagagttgga atgtcgcgaa gagaaccgca aaatcttgct ggaaaaggaa aaaggtattt 10860 tccgatttca acaataaaat aaattgactt acgtccaaga ttggcgttgc gatgagcgcc 10920 cagcatccga tcattctcct caaacgagcg actctctccg agggactcca caagcagacg 10980 gagaatcatt cgtgttttct tcggacgcac gtctcggcca tcaagacgac cagtagacaa 11040 gccaattccg ccccgagcgg ttttacaatc agcaccagtt ttgccctctg cgcgaaatcg 11100 tttctcttgc tgaaattata aaattaactt agaaaatcta gcacttgatc ataagaataa 11160 atcagttaag attgacgata attacctgtt tgcagcgaaa gaggaataga agaaccgcga 11220 tacagggctc attgcaaaaa tgggaatcaa tgaacccttt tcgaaagcag ttattgacgc 11280 tgcggctaaa aggaatcgat gagtaccact cgtcgaccat ggcgagagga catctcagca 11340 gaagcatgag acgcagaaac tgtgtagtaa ggttagcaaa cttagaatca gttacgctgt 11400 gaacatcatc tgaaatacgg tagataaaat tgataaataa gagactaaac ttacgatcag 11460 gatcaaagtc gccttcggcc tctaaggact cccactctcg ctgatttcgc agttgataca 11520 aaaccgactg ccagaggtaa tccaggctgt aggagaagcg atctaaatga ccataggact 11580 cttcagagcc atcggaagaa gagctgtctc cgccgttatc gtcgggataa aaatcgacgt 11640 aaagaatgcc atgctcaaca ggagcgtcct cttgagcacg ttggccattt tcatactgac 11700 cgcgacggta aggctaaaat taaccatgaa ttaaagccac ttatattaca aattatatga 11760 gtatagtatg aataaaattc aataaaatct gttatataaa agtagaaaga aaatcaaaca 11820 aattcaaatt aagaattacc ttccaagagt aatgccattg caatacgaaa cgatcctgtc 11880 gctcaaaaca gccgtcctcc agttcgcctt gatgagctat ggtagcgccg acaagcgcat 11940 taatcgttaa agttgtattc ttcagcgtat gcgaatgcgg aggaggacaa gtcgtggtaa 12000 atgtcgacgg gtgctcgatc gtattgtttt cggcaagagt cagaacgggc gccgtgagct 12060 caatttcttg attgagatca acaacgagat tctgataatg ttcataacag aaatcctcca 12120 aatgacacgg gacactaaaa acatccattt aattaacaaa ttttaaaaga aaaaatatta 12180 aagattgtct aaaaacatca atcatgaccg attccgacaa catgagctgg aacagcgcgg 12240 atgaagaagc cgccttagca atgcctccgt tgagggaaaa tcgaaacgaa gccggtcaag 12300 agccagctcc ccaagagcag cttcctgtac cagcaccacg accagctcaa gctgagccgc 12360 cacaaggagg cctaatggac gccttggcac gaggagttca tcaaatgaac gctcgccaag 12420 aagaactggt taacttcgag tcttcggacg aggacgaggc cccggtgaac gagcacgagg 12480 acgaaccgct cgaagctcaa acggaagcac atcgtcgggc cgttcaagct aacgaatcga 12540 aaaaggacca ggataaatat agaagggaat tgtcggaagg tgcaacccct gaccgttttc 12600 tcgaaatttt cccgctcctc gatacgatcc gccaacagtg tatgcgcgac tccatcgcga 12660 aacgatacgt cgatgatata aacgacgtgg cgaaggaact catcaaagtt aacaccgcag 12720 tgcaaacaga gctggaccct gtaagttaaa acgaggaaaa taaattgaaa aattggggga 12780 taagctcata caattcgggt aacttacgcg ttgaagagtt tctggtagta ttcctcgtca 12840 agttcctcgg gatacgaaat atcgtaacga tcacagaagc ggcgcatatc gattctgaaa 12900 agtaaaaata tgttaattcc ataaaataaa atgcggggaa aaagtgacca gaccttagca 12960 aaataaagat tctagaaaat aaaacgagat tatctgtaat aaaaataaaa ttaataaaga 13020 gtattttagt accgcccggg aaacgcttgt ttcatgaccg agctggccct cgcaggagca 13080 tcgaaaaagc tcgtggacgc cattctagca gtgccagccg atgcaaaatg ctctcactgg 13140 cggatcgaac ttgaataccg gaggacgttg gtggtgaaga cttcaaaacc tattgtgatg 13200 aagttaatct tcacacaccg ctctttcaag aacaccctcc aattcatgga cgagcgccat 13260 gtaaactact cggttggcga catgatcgag catttcaaga acgagctcga agacgttttc 13320 gtctcactgg ggggacgcgg attcaaaatt ccgcaaatgc cgaaaacttt ggtaacgcac 13380 ccagattttg gcggccctgt aatgcagatg gttaattcga tgaaatcatt gaagaaattg 13440 gtaagaagtg aattgaattg tgaaaagaaa cggaagcaaa actataaatt tgtaaattaa 13500 tacaatatta tggaaaagtt taaaacagga aacaataagc tatcgagtga tcagaaagtt 13560 ttgtttttct ctgaataata tgaacacaat ataattttga aatgattttc tctttggaaa 13620 taaaatttct atttggaaaa gataattaag atctgacttg atatgaaaat gtgacgcgaa 13680 aaataaaagc gtatcagtcg cagcgcgcca atagtaagcc gccattcgaa atcgagccac 13740 aattagtaat ttcaatattt tggttcgata ttaagatctg taaaactctt tgaataatat 13800 aaataattcg ttaatagaaa ttaaaatcaa taagagaaat aagagaattt tgatagttat 13860 ttttaatata taataatttg tcaacgcgaa aatgaaatat taaagattca tttcagatgc 13920 gggaatgcaa aacctctctc gacggactga gaggtaaaag tctctctgag atcgccaatc 13980 tcacagagca ggaacgactc gctggcagaa tgaaggctct ccgaggctgc gcagaagcca 14040 gggaatgcac tatcgtagtg ttgagcccga ctaacaagct ggtggggtac cgccttccgg 14100 caccgggatc tgatgagcca gcgaaggagc tgacgaatct accatcatgg gcctcatctg 14160 cttttgttac ccaaagcaag aaattcttct ttggaaagcg caaaggaaac aacaaccgca 14220 acgcaaacaa caagcgcccg cggaacaaca acaattaggt aattaggcac aatataaagg 14280 gaaatatata taatcttatc ttgggactct actgttcctc attaataata ttaacaaggc 14340 catccagtct aaagatcaaa aaaggggacg cgacaatta 14379 // ID CR1-86_HM repbase; DNA; INV; 3073 BP. XX AC . XX DT 12-APR-2009 (Rel. 14.04, Created) DT 12-APR-2009 (Rel. 14.04, Last updated, Version 2) XX DE CR1-type family - consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-86_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3073 RA Jurka J.; RT "CR1 families from Hydra magnipapillata."; RL Repbase Reports 9(4), 733-733 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(3..827,831..2579) FT /product="CR1-86_HM_1p" FT /translation="EINSNFFLPGYNSIHQPRKNGIGGGVCIFILNSFTFR FT KLLSLSVNDANCESLSIEILNRKTKNIFITSLYRPPNGNFEAFEKHLKNIL FT PKVIKKHIYITGDFNIDIQKINNDSYAKRFVNTLLQNSLIPVINKPTRVTR FT NTSSIIDNIFTNVAMTCQIKSGILKTDISDHFPNFLINYSLISESKNSFVT FT YVRQINNQSIIKFRELLTEIDWNLLTECNDANDAYNLFLRLFSKQYERSFP FT KIKKTTNSIKINNPWMTNGLRKSSRKKQRLYEKFKNKTYNNEIKYKKYKNC FT FERIKKQIKKSYYSNLLEHSKGNSKKTWEVIKEVTGKKQLHKSTIPNRIKL FT NSNREIFDKKEIAENFNKFFINIGKKLAADITPGNKTFQSYLKKTHYVMDE FT SELSLNELQASFNALKINKSAGFDDINVNVLKAVFDAIKLPLLFIFNLSIT FT TGIFPCQLKIARVVPVYKNGDDSIPSNYRPISILSCFSKLLERIMYNRLYS FT YLEKHNILFSKQFGFRKGHSTHHAVTDLASQILDGFAKKSYTLGLFIDLSK FT AFDTVDHKILLYKLETYGVTNNYLKWLQSYLKNRKQGVSYDSTCTKLETIS FT CGVPQGSILGPLLFLIYVNDIYLSSNMLNFVLFADDTNLFYTHSNLKSIFF FT TVNRELENLNDWFKSNKLSLNISKTKYILFHNQSKSNDLPLQLPQLIINNN FT IVNRVSSLKILGIIFDEHLNWKNHITLIENKISKTIGIMYKTKHLLSKKCL FT IDLYYCFIHSYISYCNIAWASTYPSALKNIYIKQKQASRVIFHVHKYAHSK FT PLLREIKALNVYELNVFQNLLFMFKYQNDMLPKSFNNRFVKINHKYLTRCS FT TYFY*" XX SQ Sequence 3073 BP; 1196 A; 462 C; 382 G; 1032 T; 1 other; atgaaataaa ttctaatttt tttctaccag gttataattc aatacaccaa cccaggaaaa 60 acggcattgg cgggggcgta tgtattttta ttcttaattc ttttactttt agaaaattac 120 ttagcctcag tgtcaatgac gctaattgtg agtcgttatc aattgaaatt cttaatcgga 180 aaaccaaaaa tatttttata acttctcttt acagaccgcc taatggtaat tttgaagcat 240 ttgaaaaaca cttaaaaaat atattgccaa aagtcattaa aaaacacatt tatataactg 300 gtgattttaa catagatatt caaaaaataa ataatgattc ctacgctaaa cggtttgtaa 360 atactctact tcagaacagt ttaatacctg taatcaacaa accaacgcgc gtaacacgca 420 atacttcttc aattattgat aacattttta ctaatgttgc catgacttgc caaatcaaaa 480 gtggaatatt aaaaactgat attagcgatc attttccgaa ttttcttata aattactctt 540 taatttctga atctaaaaat tcctttgtaa catatgtaag acaaataaac aaccaatcta 600 ttataaaatt tagagaacta ttaacagaaa ttgattggaa tcttttaact gagtgcaatg 660 atgctaatga tgcatacaat ctgtttttac gtttgttttc taaacaatac gaaagatcct 720 ttccaaaaat taaaaaaaca actaattcca taaaaataaa taatccgtgg atgacaaatg 780 gtcttagaaa atcatccaga aaaaaacaaa ggttatatga aaaattttaa aaaaataaaa 840 cttataacaa tgaaattaaa tacaaaaagt ataaaaattg ttttgaaaga ataaaaaagc 900 aaataaaaaa aagttattat tctaacctct tagaacattc aaaaggaaat agtaaaaaga 960 catgggaagt aattaaagaa gttacaggga aaaagcaatt acataaaagc actattccaa 1020 atagaataaa gcttaactcg aacagagaaa tatttgataa gaaagaaata gctgaaaact 1080 tcaacaaatt ttttataaac ataggaaaaa aattagcagc tgatataact cctggaaata 1140 aaacattcca atcctattta aaaaaaactc actatgtaat ggatgagtcg gaactgtctc 1200 taaatgaact tcaggcatcc tttaatgcgc ttaaaataaa taaaagtgcc gggtttgacg 1260 atattaatgt taatgttctt aaagctgttt ttgatgctat taaattacct cttttgttta 1320 tcttcaatct ttcaattaca actggtattt ttccttgtca attaaaaata gcaagagtag 1380 tccccgtata caaaaatggt gacgattcta taccatctaa ctacagacct atatctattc 1440 tttcttgttt ttcaaaattg cttgagcgta ttatgtataa cagactgtat agttatcttg 1500 aaaaacataa tatcttattc agcaaacagt ttggtttccg gaaagggcat tcaacacatc 1560 atgcagtaac tgatttagcc agtcaaatac tggatgggtt tgctaaaaaa agttatacac 1620 tcggcttgtt cattgatttg tcaaaagctt ttgacactgt tgatcataaa attctcttgt 1680 acaaactaga aacatatgga gtaacaaata actacttaaa atggctacaa agctacctta 1740 aaaatagaaa acaaggagta tcttatgatt caacatgtac aaagttagaa acaataagct 1800 gcggagttcc tcaaggctct attcttggac ccttactgtt tctgatttac gtaaatgata 1860 tctatctatc ttcaaacatg cttaattttg ttctttttgc tgatgacact aatctttttt 1920 atacccattc caatttaaaa tcaattttct ttacagttaa tcgtgagctg gaaaacctta 1980 atgactggtt taaatcaaac aaactttctt tgaatatcag taaaactaag tacattttat 2040 tccataacca atctaaatcc aatgacttgc ccctgcaact tcctcagcta ataataaata 2100 acaatatagt aaacagagtt tcatcactta aaattttagg aataatattt gatgaacatc 2160 ttaattggaa aaatcatatt acgctaattg aaaacaaaat ttctaaaact ataggcatta 2220 tgtataaaac aaaacaccta ctaagtaaaa aatgtttaat agatttatat tattgtttta 2280 tccatagcta cattagctat tgtaatattg cttgggcaag tacttaccct tcagcattga 2340 aaaatatata tattaaacaa aaacaagcca gtagagttat atttcatgtg cataaatacg 2400 cccactccaa gccgttactc cgggaaatta aagctcttaa tgtctatgag ttaaacgtgt 2460 tccaaaactt gttattcatg tttaaatatc aaaacgacat gcttccaaaa tcctttaata 2520 accgatttgt aaaaattaat cataaatacc taacaaggtg ctctacctat ttttattaat 2580 aacytaactc ttaataacat ctgactactc aatttcatat agaggccctc gcctgtggaa 2640 catagttctt aatgaaaata tgaaaagtat aacttcaata aattgtttta aaaatatagt 2700 taaacagtat ttacttgaat taacagacgc aaacatattt ttgcattttt aagtaaaaaa 2760 aaataaaaaa aataaaaaat cttgtaatat ttcttaatat atctgattat ttatcttttg 2820 caatttatgg tattcttgtt gtaatgtgtt aacttgtttt tttatataac gagtatttgt 2880 aaatttatat aatgagaaat ttgttaaaag tactttacgc ttactatagg ggcttgatga 2940 caagacttct gtcttctact tgctcctgtc gttattaatt tttttttaaa ttttattgat 3000 gaaaattgta aaacaagaac gttgtaaaat gtttataaaa gacgaattat atataattaa 3060 ataaaaaaaa aaa 3073 // ID Gypsy-91_CQ-LTR repbase; DNA; INV; 1369 BP. XX AC AAWU01007361; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-91_CQ_; KW Gypsy-91_CQ-I; Gypsy-91_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-1369 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 562-562 (2011). XX DR GenBank; AAWU01007361; Positions 18183 19551. XX SQ Sequence 1369 BP; 373 A; 363 C; 322 G; 311 T; 0 other; tgttaccgtt gtggcatttt accctattac tcaccaagag caacaagaga tagagagtga 60 gagcaaacgg tcatagtact cttcggagtg aactcactat gatgacccgt gccataatat 120 tcgtttacat agtttcgatg tttttcaata gtattttcta tatgttttgc gtaatatatg 180 ataggttagt cattgtatgt tattcgttcg tcctttaatc aatggttaat agaaggaaat 240 tgtttaggag cactagtttg aataaactcc acagtaggat tagcaacggt cattttgaat 300 acacataaca caatttgtta aacactcctc accaattatg tgtactgccg tgcgactaac 360 gatggaatga ataaaaaaag ggtcacgatg aacctggatg atcgtggggg ggaggttagc 420 tactgggtca tgtccactgc agcgatcttt tccacttgaa cctcggaacc aacaacaccc 480 atcaaagttc cagaatataa gtaactcggt ctaagaagtt cgtccggtta ccaaagccaa 540 ataaaagtcg accacggata gtttatcctc gcgagtggaa aagtgatttg gagcaacgtg 600 cagaattgcg tagaatcagg taggcgaagc tcgctaacct caaacccgaa ccccagtagc 660 aaacccaacc taaccctccc ccattgtgcg aaacccacgt gtaggaccca acagttttaa 720 gctgtagaac cgctatcgtc ggggaacgga gttctccccg gccagtccgg gctgtagaca 780 cgccaaacgt ctcagccaga agtcgagccc gcccaaaaag ctcgccaagc aggtcccgtt 840 gaccacgtgt cctgacgtgg cacccgcaga ccgccgtcct cggtccgcga ggtaaagtac 900 cccctttggt taccaccaag cccgttgtgg tcgttcatcc gccgacgttc ggtagcctgg 960 ctaggctccg tcggtccatc cgcacccgtc cagaagcaca cctgtgcgcg actcggtaag 1020 ccggacgagg cttcgccgag atcatcgtca gcagctcgca gcaaactgga gtgcagaaca 1080 ccagcagcag cacgcagtac cgccggccgg ccccgaacac acacttacac acacgaacac 1140 ttcgtaagta attaaaagca agtgaaaaat gcaaagaccg tgtttctttt gagtccctga 1200 tcagcttccc cctgttgcat ctcgacccag agctgaggag tcctctccga aagctcgccg 1260 cgtttggtaa aagaggccag tggagagccc accaccgatc cccgtggcta ttagaccagg 1320 gagaggtgga tttgtctgga ggcgttgacc tccaagttaa aaactaaca 1369 // ID Ginger2-1_TS repbase; DNA; INV; 2815 BP. XX AC . XX DT 03-FEB-2010 (Rel. 15.01, Created) DT 03-FEB-2010 (Rel. 15.01, Last updated, Version 1) XX DE Ginger2 DNA transposon from Trichinella spiralis. XX KW Ginger2/TDD; DNA transposon; Transposable Element; Ginger; KW Ginger2; integrase; Ginger2-1_TS. XX OS Trichinella spiralis OC Eukaryota; Metazoa; Nematoda; Enoplea; Trichocephalida; OC Trichinellidae; Trichinella. XX RN [1] RP 1-2815 RA Bao W., Kapitonov V.V. and Jurka J.; RT "Ginger DNA transposons in eukaryotes and their evolutionary RT relationships with long terminal repeat retrotransposons."; RL Mobile DNA 1, 3-3 (2010). XX DR [1] (Consensus) XX CC TIR is 112-bp long. XX FH Key Location/Qualifiers FT CDS 1064..2626 FT /product="Ginger2-1_TS_1p" FT /translation="MDRKAVFEEKLRALIKEKGQNGTIFPISQRDQIITDI FT LRIQSDGPKSARDYNLKNRYGVLKIGEENKLIRLGKNDAIRCIASIEEMFD FT IINDAHQKIGHGGEKKTFREAQNKWANVTQEACHLFLTFCEECHKKRARKL FT PKSLVVKPLVSTNLMSRAQVDLINFQTMPDGDFKYIMTYLNHFTKFCILSP FT LKSKRTEEVASKLLEIFLTFGAPSILQSDNGREFSNAIIAELKTCWPELKL FT VTGRPRHPQSQGAVERLNGVVQDKLAIWMRENGCKRWSMGLKFVQWQINVS FT VHETTGQSPFKVTFGEEPRIGLESYVLPKSLVDAAKTEEEIEEFLTSHEAN FT DEDSLNRDGKNYEENESNIMKHFPETFIKARKEAASGQTRAAAKMTRRSKK FT MLIPLQIGQNCTLRVPDVDRGPADPKNFLVVVMAECEGLYTVGCREGKLAS FT KFTAADLQVISENLLSIDEVPDAEIPLRTAVTKATGGQGYVKCMCLSGCSS FT GRCSCSRKRVLCNSRCHPGKSCNNI" XX SQ Sequence 2815 BP; 935 A; 511 C; 584 G; 785 T; 0 other; tgttacgctt aatgttgaaa attgcctatt attgataata ggcaataagc atttgcctaa 60 tattgaaaaa acggcttacg agtcacgctg tatgttgaaa tgcttttatt aaccctttag 120 tgactgaatt gttatccagt tgaagtgaat acttaatttt gcagatacag tccttgagag 180 ttagttaaga tataaatgaa aaatcattat gaaaaaataa tttgaaagtt gttttttatc 240 gtgtactaat ttgggaatat tgtacactgc agtgtcttca tgaaagcagt atacggtgtt 300 tatttccaca attcatcact gttgaacagt aaacatattg gcagcaaccg ttacacaata 360 ctatcatgat attgagctgc tgatgatata ttgggaaacc tatctggtga agccgtttgc 420 cacactgaac acaatgattt ctgcagtttt tttgtgaact gtgcaacgac cctgcgtact 480 tgactgcagg aaatgtccac gtgagcttcg caaactgctt ggcaaatatc attacgttta 540 ggccttaaga tctgtagcgg tcttgctcgc aggagatgag ctgcaatatc ccttctaaat 600 gccaaatgtg tcatcgctga tctgtcagca tcatgaagtt cacaaggaag tcgccaagca 660 gctatcactg ttagttgaca cttcttagaa cagttttctt cagtcgttac aaatagaaat 720 agaaatagac gcttttgata aaattccata taaaccattt gatgatgcca gcataaaaag 780 ttgaacggcg ccaagtcgaa ggacacgatt aaaatttttt ttatttcaat gggcattcga 840 catcgcagtc tttagcgtaa gtattggcgc catttgtgga taaattaaaa gtgttggcac 900 cacattccaa attttgcaga catattatcc acttttctga catcaaaccc agaagtctac 960 atcaaatcta gtagcaacaa gttcagctac aaatcgttgg agcatgattt tacgtataat 1020 caggaagtaa cattcaacca cttcctcatt tgttaaagtt ctgatggata gaaaggctgt 1080 gtttgaagaa aagctccgtg ctctgatcaa agaaaaagga cagaatggaa cgatttttcc 1140 aatttcacag cgagatcaaa ttattactga cattttgcga atccagtcag atggtccaaa 1200 atctgctcgt gattataatt taaaaaatcg gtatggagtt ttaaaaatcg gcgaagaaaa 1260 taaactaatc cgcttgggca agaatgatgc aataagatgc atcgcctcta ttgaagaaat 1320 gtttgatatt atcaatgatg ctcatcaaaa gattggtcat ggtggagaaa aaaagacatt 1380 tcgagaagca cagaataaat gggcgaacgt cacacaagaa gcttgccatt tgtttctcac 1440 gttttgtgaa gaatgccata agaagagggc cagaaagctt ccaaaaagcc tagttgttaa 1500 accactggtt agtacaaatt tgatgtcacg agcccaggtg gacttgataa attttcaaac 1560 catgcctgat ggtgatttta aatatatcat gacttatttg aatcacttta cgaagttctg 1620 cattttgagt ccgctaaagt caaaacggac ggaagaagtg gcatcgaagc tgctcgaaat 1680 ctttttaacg ttcggcgctc ccagcattct gcaatccgac aacggtcggg aattttcaaa 1740 tgccataatt gcggaactca aaacatgttg gccagagctg aagcttgtca cgggtagacc 1800 caggcatcct cagagccaag gtgcggttga gcgcctaaac ggcgttgtgc aagacaaact 1860 ggcaatatgg atgcgagaaa atggatgcaa aagatggtca atgggactga aatttgttca 1920 atggcaaata aatgttagcg ttcatgaaac aacaggacaa agtccattca aggtgacatt 1980 tggagaggag ccacgaattg gactggagtc ctacgtcctg ccaaaatcac tagttgatgc 2040 agcgaaaaca gaagaagaaa ttgaagaatt tttgacatct catgaagcca atgatgaaga 2100 tagtttgaac agagatggaa aaaactatga agaaaatgaa agcaatatta tgaaacattt 2160 tcctgagact tttataaaag cacgaaaaga agcagcttca gggcagacta gagcagcagc 2220 aaaaatgaca cgacgatcta agaaaatgtt aataccgctc caaataggtc agaattgtac 2280 actgagagtg ccagacgtgg atcgtggacc cgcggatccc aaaaacttct tagtagttgt 2340 catggcggaa tgtgaaggat tgtacactgt tggttgcaga gaaggaaaat tggcatctaa 2400 gtttacagct gcagatttac aagtgatatc ggaaaattta ctatcaattg atgaagtacc 2460 tgacgccgaa attcctctta gaactgcagt aactaaagct acgggcggcc aaggatatgt 2520 taagtgcatg tgcctatccg gctgctcatc tggtcgatgc agttgcagca gaaaaagagt 2580 actatgcaat tctcgttgtc accccgggaa gtcatgcaat aatatataaa cctcgtgcga 2640 tacatttgtt tctgataata aatacaaata tcattaataa aagcatttca acatacagcg 2700 tgactcgtaa gccgtttttt tcaatattag gcaaatgctt attgcctact atcaataata 2760 ggcaaaacgc ttgcctatta tcaataatag gcaattttca acattaagcg taaca 2815 // ID Gypsy-24_IS-LTR repbase; DNA; INV; 173 BP. XX AC ABJB010964315; XX DT 15-FEB-2011 (Rel. 16.02, Created) DT 15-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the black-legged tick genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-24_IS_; KW Gypsy-24_IS-I; Gypsy-24_IS-LTR. XX OS Ixodes scapularis OC Eukaryota; Metazoa; Arthropoda; Chelicerata; Arachnida; Acari; OC Parasitiformes; Ixodida; Ixodoidea; Ixodidae; Ixodinae; Ixodes. XX RN [1] RP 1-173 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the black-legged tick genome."; RL Direct Submission to RU (15-FEB-2011). XX DR Genome; ABJB010964315; Positions 4142 4314. XX SQ Sequence 173 BP; 36 A; 33 C; 50 G; 54 T; 0 other; tgtagtgggc gctactgaag tgatggtgtt tttttttatt gctcaccaga ggacaccatc 60 atttatagag gattatataa ggagcggcgc gtgagattaa agggcttttt tggttgtggt 120 cacgagctga gctgctgtca cttggccgtc gtccgttccg aacgttccga aca 173 // ID Gypsy-3_PPP-I repbase; DNA; INV; 6054 BP. XX AC ADBJ01000006; XX DT 13-DEC-2010 (Rel. 15.12, Created) DT 13-DEC-2010 (Rel. 15.12, Last updated, Version -1) XX DE LTR retrotransposon from the Polysphondylium pallidum (slime DE mold) genome: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-3_PPP_; KW Gypsy-3_PPP-LTR; Gypsy-3_PPP-I. XX OS Polysphondylium pallidum OC Eukaryota; Amoebozoa; Mycetozoa; Dictyosteliida; Polysphondylium. XX RN [1] RP 1-6054 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Polysphondylium pallidum (slime RT mold) genome."; RL Repbase Reports 10(12), 2162-2162 (2010). XX DR GenBank; ADBJ01000006; Positions 100832 94779. XX CC Positions [4459-4983] - Integrase core CC 'GTTTT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 307..6045 FT /product="Gypsy-3_PPP-I_1p" FT /translation="MSQLNTSIPTTNDETNSGQMADISPTENQAETNVSDN FT TMSITNPLNATSMTNTNAIEQTPLSASSAGDIQSATSTNVTNSGSPRDTQS FT ATSTNTTNNGSPSLNQSATSTNATNGGSPRTTVTNGNLSTTSTINNVPNLP FT SISSTNKNATSISNHGTSPNRNVSTNSHDKGHQYHRSESEDSNSESEDDYD FT DDNNHRRKRSKKEIGRIAFQDPSDPESSDSDTSDQSDDETVKMPIDKIKYT FT RNTTPKSARSNRHRTVVEFSSQALRDARARIRAVSFNDMLLASNKDLIERA FT AQFLVSNPLLGDEGAFEAAVTERVDLLGIQSLVSSHLRDQVKNYTAKIPLF FT VPDMSDTQTFISQVENVVPEDRLRCVIALSKLSEEAITSVSSAEDAEILKR FT TMKWKEFASILTNTFGIKRESDEYVTDLKNLRLEQFETTHKFIVKFREIYS FT KSRLSQTDIATQLYSLLPSNIKADPELTKAHHKKSINALTKRLNDLGRKQL FT DDKLRLKREEKVMHFTTRNHQPVEYHTSNVQPNLLQKWSPNNTNKRSAPPS FT VGQNKTKFTEKKSKIGTCSICKKGNHLPNNCFYGKFGKFRDIDPATGQRKQ FT ANAFLCSMPGTINNMYVNIFIDSCSNINLLSFDLLQESDIKNLTDVNITLN FT GINKDSSTLSKQINKNITINNKVFKIPFFIINSKTNFILLGTETLQNLNLK FT IDFKNKIISLPNVDKENNIKELAIKFNTLNDSIQTNLIETRNCLISQSESD FT NDECCALVYLNDISVDENTNKSNTHEKLDESKNSNHTFESNLSKKIKNLIL FT NKYPDVVKPFDKELPPSRGDFDFKINLTVDQPPKPKNYPIPHSFLDELEKQ FT INTYLENGQITPSQSAFAAPLLFIKKKDGGWRLCVDYRSLNGITIKDTYPL FT PNITEVLNNTRDGVLFSKIDLLQGYHQIRVHENDQSKTAFRTSFGVFQYTV FT LPFGLTNAPACFQRLMDSIFQRHVIAKKLLVYLDDLLIKTNIDDEDKHIQD FT VLEIVDLLNQNKLKIKLTKCIFGQYSLEYLGHIIGHNKLIPINDKILAIKN FT WKQPITKRELRGFLGLTNYYRKFIPKLSEIEAPLIDITRKNKLFKWEDIHT FT ETFNLIKNQISDSSFLFIPDYKLTFHIDCDASNDGIGHVIYQYKDNIEQED FT NKQIVLYGSKKFNTTERDYHVFEQEVMAIKHALESNYHMLLGYKIVIHTDH FT QNILFINNKLNDNTKPKLIRWLQYIFSFNPTLIYKKGSDNVIADGLSRYTY FT STTISIDDNDLIAMIENGYIEEETQIKNKTIHPSKYYKSKSVKTEGKLKYF FT EDKIVIPQVRSIIKRILFIYHDSLLAGHHGIEQTLELISRHFIWEGIAKDV FT TNYVKACHTCNKATDARGQSIGKLQPLIVPTKCFESISMDFIPLIDTEYKG FT IKYNKVWVIVDRLSKMVRLIPVHSTYTSKDLAEIFMKEIFKHHGMPVEIVS FT DRDSKFTSKFWKDLMDILNCEIRTTTTENQQANGQVESIIRYLGNLFRKAI FT IATYNEENHDEWITLVDTIEFCVNNSIHHGCEYSPFKIYTGENPITPLSLI FT QYKLKMNSSNVDIENMINSKRNQLKIVRNWLFDNQYNMRIQYNKNKKEMDI FT KVGDSVYLRKFRSNPKNKIKLETRFDGPYEVTNVNGLNVTVANVESGTKRK FT RINPSFTRHAKYFKKHNSLDDEDDDDIEFGEVEEEDEIEIDSITQNEQNNQ FT NNNNDDTDDTEAMDIDSELDLVTNNNSTTNNNPPINTPDSNTLILDNIPNS FT TTTHPIIHNTLTFHNERGFSESSEITNFQRISAKEIKEAFEKKNKLTVLEE FT YIQYHNSHPNHPPTKQSAKILIALINANIIRKDTMFVTRKRTFNLTNKRDA FT KTEYLFNIGGVQGWILDNDVHQSYKSLTGDYRNWLKTNSN" XX SQ Sequence 6054 BP; 2355 A; 1156 C; 899 G; 1644 T; 0 other; tggtagcatc tgctctttaa agcaccgatc gaccatagca cctccaagtg cataatatat 60 atcaccacca agtgattttc accatcaagt gaaatacata tatatatatc accaccaagt 120 gattttcacc atcaagtgaa caatatatat atatacatca ccatcaagtg attttcacca 180 ccaagtgaac aatatataca catcaccacc aagtgattat aattagaaat atctataaat 240 atttactaat taaatttttc tttattctat ttatatagaa cataaaagaa acaaaccaac 300 taaaagatgt cccaattaaa cacatccatt ccaaccacta acgacgaaac taactcgggt 360 caaatggctg acatttctcc aactgaaaac caagcggaaa ccaatgtttc tgacaacact 420 atgtccataa caaacccact caatgcaaca agtatgacca ataccaatgc aattgaacaa 480 actccactaa gcgcgtccag cgccggagat atccaaagtg cgaccagcac caatgtaacc 540 aacagtggtt ctccaagaga tacccaaagt gcgaccagca ccaatacaac caataatggt 600 tctccaagtt taaaccaaag tgcgaccagc accaatgcaa ccaatggtgg ttctccaaga 660 acaactgtca ccaacggtaa tctaagtact acgagtacca ttaacaatgt tccaaacttg 720 ccaagtattt caagtaccaa caagaatgct actagcattt ccaatcacgg aacaagtccc 780 aaccgcaatg tgagcaccaa ctcccatgat aaaggacatc aataccaccg atccgaatca 840 gaagattcta attccgaatc cgaagacgac tatgatgatg ataacaatca tcgtaggaaa 900 agatcgaaga aggagattgg tagaatagct ttccaagatc catctgatcc agaatcatcc 960 gactcggaca cctctgatca gtcggatgat gaaactgtca aaatgccaat tgataagatt 1020 aaatatacac gtaacaccac tccaaagtct gcaagatcca atagacaccg aactgtggta 1080 gagttctcta gtcaagcttt gagagatgca agagcaagaa tcagagctgt ctccttcaac 1140 gacatgctgt tggcatcaaa caaagacttg attgaaagag ctgcccaatt cttggtatcg 1200 aacccactac ttggcgacga aggagcattc gaggcagcag tcaccgaaag agttgatcta 1260 ctcggaatcc aatctctggt ctctagtcat ttgagagacc aagtcaaaaa ctatactgcc 1320 aagattccat tattcgttcc agatatgagc gacacccaaa cattcatatc ccaagtggag 1380 aacgtagtac cagaagatag gttacgatgc gtcattgctt taagcaaact ctcagaggaa 1440 gccatcactt cggtttcatc agcagaagat gcagaaatcc taaaacgtac aatgaaatgg 1500 aaggaatttg cctctattct aaccaatacc ttcggtatca aacgagaatc cgatgaatac 1560 gttaccgatc ttaagaattt gagacttgaa caattcgaaa ccactcataa gttcattgtc 1620 aaattccgag agatctatag taaatcgcgt ctttctcaaa ctgacatcgc cacacagctt 1680 tacagtttgc ttcctagcaa catcaaagca gatccagaat taaccaaggc acatcataag 1740 aagagcatca atgctttgac caaaagatta aatgatttag gtcgaaagca actcgacgat 1800 aaacttcggt taaaaaggga ggagaaagtg atgcatttca ccactcgaaa tcatcaaccg 1860 gtagaatacc atacatctaa tgtacaacca aatcttctcc agaaatggtc accaaacaat 1920 accaataaaa gatccgcacc accaagtgtc ggacaaaata aaaccaaatt caccgaaaag 1980 aagtccaaga ttggtacttg ctctatctgt aaaaagggca accatcttcc aaacaattgt 2040 ttctacggga agtttggaaa attccgagat atcgatccag ccactggcca aagaaagcaa 2100 gctaatgcat tcttatgttc gatgcctgga actataaata acatgtatgt taatattttt 2160 attgacagtt gtagtaacat taacttactc tcatttgatc tattacagga aagtgacatc 2220 aagaatctaa cagatgtgaa cattaccttg aatggaatca acaaagacag ttctacatta 2280 tcaaaacaaa taaataaaaa tataacaatc aataataaag tatttaaaat accatttttt 2340 ataatcaatt ctaaaacaaa ctttattctt ttgggaacag agacgttaca aaatttaaat 2400 ttgaaaattg attttaaaaa taaaattata tctttaccta acgttgataa agaaaacaat 2460 attaaagaac tagctatcaa attcaacact ctaaacgatt ctattcaaac taatttaatt 2520 gaaactagaa attgtttaat ttcgcaaagc gagagtgata atgatgaatg ttgtgcttta 2580 gtatatttaa atgatattag tgtagatgag aatacaaata aatcaaacac acacgaaaag 2640 ttggatgaaa gtaaaaattc taaccacact tttgaatcaa atctttcaaa gaaaataaaa 2700 aatctcattt taaataaata ccctgatgtg gttaaacctt ttgacaaaga attaccccca 2760 tctcgcggag attttgattt taaaataaat ttgacggtgg accaaccccc aaaaccgaaa 2820 aattatccaa tcccacattc atttttagat gaattggaaa aacagataaa cacttatttg 2880 gaaaatggtc aaatcacccc atctcaatcc gcatttgcgg cacccctttt atttattaaa 2940 aagaaagatg gaggttggag actctgtgta gactacagat ctctaaatgg tataactatc 3000 aaagatacat acccattacc aaacatcact gaagttctta acaatacaag agatggtgtt 3060 ctcttctcta aaattgatct actccaaggt taccaccaga ttagagttca tgagaatgat 3120 caatctaaga cagcatttcg aacttcattc ggtgtattcc aatatactgt attaccattt 3180 ggactcacaa acgcaccagc gtgttttcag agactcatgg acagtatatt ccaaagacat 3240 gtaattgcca aaaaattact tgtttacctc gatgatttgt taatcaaaac aaacattgat 3300 gatgaagaca aacacattca agatgtttta gaaattgttg atttattaaa tcaaaacaaa 3360 ttaaaaataa aactaactaa atgtattttt ggtcaatact cacttgaata tctcggccat 3420 attattggtc ataataaatt gattcctata aatgataaaa tcttagcaat caaaaattgg 3480 aaacaaccta tcacaaagag agaattgaga ggattcttag gtttgacaaa ctactaccga 3540 aagtttatcc ccaaactctc tgaaatagaa gctccattaa ttgatataac tagaaagaat 3600 aaattattta aatgggaaga catacataca gagacattca atttgattaa aaaccaaata 3660 agcgatagta gttttctttt tatacctgat tataagttaa catttcacat cgattgtgac 3720 gcctccaatg atggaatagg tcatgtcata tatcaataca aagacaatat tgaacaagaa 3780 gataacaaac aaattgttct ttatggatcc aagaagttta acacaaccga aagggattac 3840 catgtctttg aacaagaggt tatggcaatc aaacatgctc tcgaatctaa ctaccatatg 3900 ctattaggtt ataagatagt tattcatacc gatcaccaaa atatactttt tataaataat 3960 aaacttaatg acaatacaaa acctaaatta attaggtggt tacaatacat tttttctttt 4020 aacccaactc ttatttataa aaaaggatct gataatgtaa ttgctgatgg tttaagtcgt 4080 tacacttact ccaccaccat ttccattgat gataatgatc taattgcaat gattgaaaat 4140 ggatatattg aagaagaaac gcaaataaag aataaaacaa tacatccttc aaaatattac 4200 aaatctaaat ctgtgaaaac agaaggtaaa cttaaatatt ttgaagataa aatagtaatt 4260 cctcaagttc gctctattat taaacgtatt cttttcatat accatgattc attacttgca 4320 ggtcatcatg gaatagaaca aactttagaa ttaataagta gacatttcat ttgggaagga 4380 atagccaaag atgtcactaa ttatgtcaaa gcttgtcata cttgtaacaa agcaactgat 4440 gctcgtggac aatcaattgg taaacttcaa ccacttattg ttccaactaa atgttttgag 4500 agcatcagta tggattttat tccattaatt gatacagaat acaaaggtat taaatataat 4560 aaagtctggg taattgttga tcgtctatca aagatggtac gactcatacc agttcattca 4620 acttacactt ctaaagattt agctgaaatc tttatgaaag aaatattcaa acatcatggc 4680 atgcctgttg aaattgtctc tgatagagat tcaaagttta ctagtaaatt ttggaaagat 4740 ctgatggata tattaaattg tgaaatcaga actaccacta cagaaaatca acaggccaat 4800 ggacaagtgg aaagtataat cagatactta ggtaacttgt ttcgtaaagc tatcattgca 4860 acttacaatg aagaaaacca tgatgaatgg ataactttag ttgatacaat tgaattttgt 4920 gtaaacaatt ccatccatca cggatgcgaa tactctcctt ttaaaatcta tacaggtgaa 4980 aatcctataa caccactctc tcttattcaa tacaaattga aaatgaactc atcaaacgtt 5040 gatatcgaaa acatgatcaa ctcaaagaga aatcaactca aaattgttag aaattggtta 5100 ttcgataatc aatataacat gagaattcaa tataataaaa ataagaaaga aatggatatt 5160 aaagtaggtg atagtgttta tctaagaaag tttagatcaa accctaaaaa caaaatcaaa 5220 ttagaaacaa gatttgatgg tccatacgaa gtaacgaatg tcaatggatt aaatgtcaca 5280 gttgccaacg ttgaatctgg tacaaagaga aagagaatca atccatcttt cactagacac 5340 gctaaatact tcaagaaaca caactcttta gacgatgagg atgatgatga tattgaattt 5400 ggcgaagttg aagaagaaga tgaaattgaa atcgattcaa ttactcaaaa tgaacaaaac 5460 aatcagaata acaacaacga cgacacagat gacactgaag ccatggacat tgattcagaa 5520 ttagatttgg taacaaacaa caattcaact acaaacaaca acccacctat caatacacca 5580 gattcaaata ctttaatttt ggataatatt ccaaactcaa ctactactca tccaatcatt 5640 cataatacat tgacattcca caatgaaaga ggattttcag aatcttctga aatcactaat 5700 ttccagagaa tatctgccaa agaaatcaaa gaagcattcg aaaagaagaa taagttaact 5760 gtccttgaag aatacatcca ataccacaat tcccatccga accatccacc aaccaaacaa 5820 agtgccaaaa tactcatagc cttgatcaac gccaatatca ttcgtaaaga tacaatgttc 5880 gtcacaagaa agagaacctt caatctcacc aacaagagag atgctaagac cgaatacctt 5940 ttcaatatag gaggagtcca aggatggata ctggataacg atgttcacca atcatataag 6000 tctttaaccg gagactatag aaattggttg aaaactaatt caaactaaag acag 6054 // ID hATm-3_HM repbase; DNA; INV; 3831 BP. XX AC . XX DT 22-MAR-2008 (Rel. 13.03, Created) DT 22-MAR-2008 (Rel. 13.03, Last updated, Version 1) XX DE hAT-type family: consensus sequence. XX KW hAT; DNA transposon; Transposable Element; hATm-3_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3831 RA Jurka J.; RT "Families of hATm elements from Hydra magnipapillata."; RL Repbase Reports 8(3), 207-207 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(817..1335,1392..3263) FT /product="hATm-3_HM_1p" FT /translation="MDVLQHIVYRRSLLPHNVKKSAILSCPPYKIQGYKRC FT EVDDCECILGLIKRPWLKAGFQVMSDKSIIKNLMKMDKKYSDLSRNKCRGT FT ARDKVNQGEFISFLKKLFWIGSTDLKNQIRKDKKRSDKDKHDDLTFLEDQE FT GPRYFSLGSEDRKFSKKVLCCFYWVNLTFFIQICVSEGVRKRTRLEDESPK FT IDATSELDLSFTDTSADSDSEFEPEVWHHRRADDCNITAKVPKNIFDGNVS FT IIATANDISPNVLQKVTGAILQDCGVDIAKCKASASTALRKMKKTAKDTAE FT IAKLDIKKAVEKSRYPCIIHFDGKTLFEINQGKRLKNERLAVLVNIEGVSH FT LLGVPALPSSSGENMYIGIMKILEEYDLISKVCGVCFDTTSSNTGSKKGSL FT TRIAREVDKYLLLLACRHHIIELRMVHFCEAVIKENSVGPENPLFVKFKHM FT FENPNFKYDENNLTSLDWKTVEGTVLKEAARKTLDYCETYITKKCNMRNDR FT RELAELTMQYLSPSAHFKLKKTGAVHHARFLGKSLYYLKLQLLCKQLTFVQ FT ENDNLREDLKLICEFIVCFYTRWYLQAHKAIQSPASDLDAIFQMKEYKKVC FT SNPEAVDAVLASLYKHTWYLDSTIIPLALLDKNISMDKKTAIADALLSFQM FT PDPDFFKHRSKDRIDEINIINNMKVPTLSLLVNEFSYLIFSMIGLDNQRVR FT DWLSLPPQYWHTQSSFKNFENFAKMLIVVNDHSERAIGMMQQFVQRFENED FT DKQNRLLTVDKVRSTFKVFGVGKSNNTKKKLSESLMSLSKLKKRKL" XX SQ Sequence 3831 BP; 1359 A; 548 C; 678 G; 1246 T; 0 other; taggggtagt catttttagg gaatttttga aatcgaagta catacccagc taaagttgga 60 gcaaatatgc taaaactttg tcaaaatttt ttttaggtca atcgggtgac ttttaagtcc 120 ccctcaagac taaagttgtt aagaaattgt acttcaaaaa aggaaaaata gggtgtgttc 180 agggtgtttg agggggattt aaatgaaaaa attatttcca tttttttttt ttttttttgt 240 tatttgatat ctatattttc attttaaaag ggtatgtcat catgtttttt acatattctt 300 ggttaaaacg atatttacga aaggaaaaaa taaaaatttt atgttacgca agaggttatg 360 aaaatcggca acgataaaaa aattcttata aaataatgca gaagaaaatg tgctgaagat 420 cctgcactgt atttttaagt cactaagagt ttttataaag aaattgtacg tgaaattgcg 480 ttacattatt ttatttttgt aaataacttt cgtatttact tgattgttat taattgaatt 540 acaatataaa gtaaaattgt attgaagttt ttaattaatt acagaccatt aaatatattt 600 agattgaata caaaaatgga aaaaaaaatt gtaaaagata ttgggaagag gaagtttctc 660 agatcaagca aaaacaaact ttttatcata aatgaggcta aaccagctat tacatgtaca 720 tatctaattt aaaaaaattg ttttgaattt taaatttttt gttaaaaatt aaaatattga 780 tttttttttt cttaaggtct tcaactccct actggaatgg atgttcttca acacattgtc 840 tacagaagat cactacttcc tcataatgtt aagaagtcag caattttatc atgccctcct 900 tacaagattc agggttacaa gagatgtgaa gtagatgatt gtgaatgcat tttaggctta 960 attaaaagac catggcttaa agcgggattt caagtaatgt cagataaaag tattatcaag 1020 aaccttatga agatggacaa gaagtattca gatttgagta gaaataaatg cagaggaaca 1080 gctagagaca aggttaatca aggggagttt atttcatttt tgaaaaagct gttttggatt 1140 ggttctacag atttaaagaa ccaaattagg aaggataaaa aacgctcaga taaagataag 1200 cacgatgatt taactttttt ggaagaccag gaaggaccga gatattttag tttgggatca 1260 gaagatagaa aatttagcaa aaaggttttg tgttgttttt attgggttaa tttgacgttt 1320 tttattcaaa tttgctaatt taacatacat tatttgttaa tacacataca ctatgttaaa 1380 acaaatttta ggtttctgaa ggtgtgagga agaggacaag gctagaagac gaatctccta 1440 agattgatgc tacctcagaa ttagacttga gtttcactga cactagcgct gattctgatt 1500 cagaatttga acccgaagtt tggcatcata gaagagcaga tgattgcaac attactgcca 1560 aagtccctaa gaatattttt gatggaaatg tatctatcat agctacagca aatgatatct 1620 cccccaatgt cttacaaaaa gtaactggag caattcttca agattgtgga gttgatatcg 1680 caaaatgcaa agccagtgct tcaacagcat taaggaagat gaaaaaaact gccaaggata 1740 ctgcagaaat tgctaaatta gatataaaga aggccgtgga aaaaagccgt tatccctgta 1800 taattcattt tgacggaaag actttgtttg aaataaatca gggaaaaaga ctgaagaatg 1860 aaagattagc tgtgctcgtt aatattgagg gggtgtcgca tcttcttgga gtacctgcct 1920 tgccgtcttc ttcaggtgaa aatatgtata taggaatcat gaaaatacta gaagaatatg 1980 acctcatttc aaaagtatgc ggagtatgct ttgatacaac ttcaagtaat actggttcta 2040 agaaaggatc acttactagg attgcaagag aggttgacaa ataccttctt ctactagcat 2100 gtaggcatca tatcattgag cttagaatgg ttcatttctg tgaagcagtg ataaaagaga 2160 atagcgtagg gcctgaaaat cctttattcg taaagtttaa acatatgttt gagaacccta 2220 acttcaaata cgacgaaaat aacctgacct ctcttgattg gaaaaccgtg gaaggaacag 2280 ttttgaagga agctgcaagg aaaactttag attactgtga gacctatatt actaaaaaat 2340 gcaatatgag gaatgaccga agagagttgg ctgagctaac catgcagtat ttatcccctt 2400 ctgcacattt caaattaaag aaaactggag ctgttcatca tgctaggttt ttaggcaaga 2460 gtttatacta ccttaaactt cagcttttgt gcaagcaact gacatttgtg caagaaaatg 2520 ataatctacg cgaagattta aaactcatct gcgagttcat agtatgtttt tacacaaggt 2580 ggtaccttca agcacataaa gcaattcaat ctcctgcgtc tgatcttgat gcaatctttc 2640 agatgaaaga gtataagaaa gtttgctcca atccggaggc agtagacgca gtgttagcat 2700 ctctttataa gcatacatgg tatcttgatt caacgattat tcccttggct cttttggata 2760 aaaatatttc tatggataaa aaaacagcca ttgcagatgc tttgctttcc ttccaaatgc 2820 cagatccaga tttttttaaa catagaagca aagatcgaat agacgagatt aatatcataa 2880 ataatatgaa ggtaccaacc ctctctcttt tggttaatga gttttcatat ttaatttttt 2940 caatgatagg gttggataat cagagagtga gagactggtt gtcacttcca ccccaatatt 3000 ggcacaccca gtcctcgttt aagaattttg aaaactttgc caaaatgtta attgttgtca 3060 atgatcattc tgaacgagca atcggaatga tgcagcagtt tgtacaaaga ttcgaaaatg 3120 aggatgacaa gcagaacaga cttttaactg tcgacaaagt aaggtcaact ttcaaggttt 3180 ttggtgtggg aaaaagtaat aatacaaaaa aaaaactatc agaaagcctg atgtcactct 3240 ctaaattaaa gaaaaggaaa ttgtagactt aggaaattgt tattcatttt aaaatactta 3300 tttcaattaa tgcataattt ttgtcttgat agtatttgta aagattttaa ttaaatggta 3360 aattacttgt ttagttcaaa atagtttgtt tttttggttg atttaagcgg gaatttttac 3420 gtctgtccat ctgaatgaac tttaagtatg aaaatacata atttactgat ttcataacca 3480 cttgcgtaaa ttaaactttt cattttttga tttcttaaat atcgttttaa ccaagaatat 3540 gtaaaaaaca tgatgacata cccttttaaa atgaaaatat agatatcaaa taacaaaaaa 3600 aaaaaaaaaa atggaaataa ttttttcatt taaatccccc tcaaacaccc tgaacacacc 3660 ctatttttcc ttttttgaag tacaatttct taacaacttt agtcttgagg gggacttaaa 3720 agtcacccga ttgacctaaa aaaaattttg acaaagtttt agcatatttg ctccaacttt 3780 agctgggtat gtacttcgat ttcaaaaatt ccctaaaaat gactacccct a 3831 // ID Copia-12_CQ-LTR repbase; DNA; INV; 125 BP. XX AC AAWU01014235; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-12_CQ_; KW Copia-12_CQ-I; Copia-12_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-125 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 340-340 (2011). XX DR GenBank; AAWU01014235; Positions 28458 28334. XX SQ Sequence 125 BP; 41 A; 29 C; 21 G; 34 T; 0 other; tggagataat cgagtaggcc ggtgctaacg aagaagaggc tttcaagtcc atcaataatt 60 tatgtcatta cctgaataaa ctccaaaccc tacagacgtg tttctttcac actcttcaat 120 aacca 125 // ID hATm-13_HM repbase; DNA; INV; 3687 BP. XX AC . XX DT 22-MAR-2008 (Rel. 13.03, Created) DT 22-MAR-2008 (Rel. 13.03, Last updated, Version 1) XX DE hAT-type family: consensus sequence. XX KW hAT; DNA transposon; Transposable Element; hATm-13_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3687 RA Jurka J.; RT "Families of hATm elements from Hydra magnipapillata."; RL Repbase Reports 8(3), 217-217 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(517..816,894..2147,2582..3088) FT /product="hATm-13_HM_1p" FT /translation="MPPKKKLKLTRNKAEVFLVKKVRENLSCYKLPTGIEV FT LQYYLYLRNLNPKVKKTVLIRCHFEINSFELRCPPQADCCVMSKLVRPWNL FT AGFPIITLENIRRKKIDKMVNNYYKLKKNSKRNSPTDTKRNEAFFEEMLKC FT FWISKKKEELIEDIIKDRRRTSEAKNEDIVFILDQIGNRIGRIGTNDKRFT FT YTIKRSHKKLCSMLNMEIKVNNGNNLYSLENSSSSDNNSDCEYVADIKKEQ FT STSVVLLPKNILKKTALTAIGEGLSVRQHTAIVASVIANSGGDVTNFAISS FT STAFRVAEEVVTKEGEKIKKHVTDLVKSSSLPIILHFDGKIVKEFTSGIEH FT QLDRLAVSIRIDGQSELLGVPHLSSSTGETQKNTIIKLLHQFDIFDKLQGC FT VFDTTSVNTGNKKGVCIRLAAELDRPLLLLACRHHIYERHIIHCWNIYPSS FT KINGPDNPLFKKLKDVWNSIDQNSIVLNKVKIEDISDSWLQVQFKEALSFS FT QNIIRMKTMKKVFKILNKVNTLDMKALWQMMRYSKINPVQAKPVIESLKRH FT TWYIDPNILVMSLVDKNVQERSDIAKKLYSTPRPTTYAIERHQVNLNILNS FT LMFNDDTPPSLTPLITDQSWLIFHILNHANAEVQWLKTPSETWDLNDYYLE FT FKQFVNNLEVVNDCSERAIKLVQVTILYVWSLMFMLKS" XX SQ Sequence 3687 BP; 1349 A; 518 C; 584 G; 1235 T; 1 other; aatttcaact aacattgctt ttcataaggt gccccaacct tgatattact aataattttt 60 ttttttttca tgattagatc actccatgac ttccctcaat gggtccaaag gtcatttttt 120 gcccgttttt tgcccatatt tcggactaat tgaggggagt gttaccatta tccaaattaa 180 tttttctttc tttttttttg ttattttatc tttttgaaga aaataaaaat ataatgttaa 240 gcttaatttg tttttttatt tgaacaatga agatcaaaaa attcaaaaac attgatttat 300 tttactatat ttcactacct tttactacct ttgtcctatg taccatttta atgtgttttc 360 tgttatattt taaatacgcg tataattgtt tcattttgtg aacaattata ttttaccttg 420 tgtatgcaat ttgattttta acttatttta tatttttggt taccatttat ctttttagta 480 actaattaac ttaaaaaaaa gttattttat agtaaaatgc caccaaaaaa gaagttgaag 540 cttactcgaa ataaggctga agtatttcta gtaaagaaag ttagagaaaa tctttcatgt 600 tataaacttc ctactggcat agaggttttg caatactacc tatatctaag aaacttgaat 660 cccaaagtaa agaaaacggt tttaattagg tgtcatttcg aaattaattc atttgagctc 720 agatgccctc cacaagctga ctgttgtgta atgtctaaat tggtaaggcc atggaatctt 780 gcagggttcc ctatcataac cttggaaaat atcaggtaaa ttatgaattc aaaatttatc 840 aacaaattta tttttatctt tattttaaac attgaataaa ttttgcattt taaagaaaga 900 agattgataa aatggtgaac aactattaca agttaaagaa gaatagcaaa aggaacagcc 960 caacagatac gaaaaggaat gaggcatttt ttgaagaaat gttaaaatgc ttttggatta 1020 gtaaaaaaaa agaagaacta attgaagata ttataaaaga caggagaagg acatccgaag 1080 ccaaaaatga agatattgtg tttatcttag accaaattgg aaataggata ggaagaatag 1140 gaacaaatga taaaaggttt acgtacacta ttaagagatc ccacaaaaaa ttgtgttcca 1200 tgttaaacat ggagattaaa gtaaacaatg gtaacaatct atacagttta gaaaactctt 1260 caagtagtga taataattct gactgtgaat atgtagcaga tataaaaaaa gaacaatcta 1320 cctctgttgt tttgttacca aaaaatattt taaaaaaaac agctctcaca gcaataggtg 1380 aaggactttc agttagacaa cacactgcaa tagtggcaag tgttattgct aattcaggag 1440 gtgatgttac taattttgca atatcttctt caaccgcatt tagagttgca gaagaagttg 1500 taactaaaga gggagaaaag attaaaaagc acgttactga tcttgttaaa tcttcatcat 1560 taccaattat tctccatttt gatggaaaaa tagttaagga gttcacaagc ggaatagaac 1620 atcaacttga cagacttgct gtttccatca ggatagatgg acagagtgag cttcttgggg 1680 ttcctcattt gagctctagt actggagaaa cacagaagaa tacaatcata aaacttcttc 1740 atcaatttga tatttttgat aaattacaag gatgcgtttt tgatacaact tctgtaaata 1800 ctggtaataa aaagggtgtg tgcattagat tagctgcaga gcttgataga ccactacttc 1860 tcctagcatg taggcatcac atctatgaaa ggcatattat tcactgctgg aacatttatc 1920 caagcagtaa aataaatggt ccagataatc cattgtttaa gaagcttaaa gatgtatgga 1980 actcaattga tcaaaattcc attgtgttaa acaaagtaaa aattgaagat atttctgata 2040 gctggctgca ggtacagttc aaagaagcct tatctttttc tcaaaatatt atcagaatga 2100 aaacaatgaa aaaggtattt aaaattttga ataaagtgaa tactctataa taaaataagt 2160 ttagtaaggt actgttttgt gtctaaataa ataatcttat ttaggatgga tgtaggtctg 2220 attatctaga gttagtggaa ttgacaacaa tggttttatc tggtgatgaa cagtacaggc 2280 taagaaaacc tgggcctgta caccatgcca gatttatggc aaagggaata tactttctga 2340 aaatgtatct ccttctaaac aacatttcta gcttgacaga ttttgagaag aaagaaataa 2400 atgatatggc attcttcaca gctgtatttt acactgagtg gtttataaag gctgaaattc 2460 cggctgtcgc tccatatcag gtgacatttc tatgttttaa aagtttttaa tcattttgct 2520 ttttattact tttttttctt tgaaaatttt gtgtaatatt tcatagtcta aaaatcttta 2580 ggatatgaaa gctctttggc agatgatgag atactcaaaa atcaatcctg tccaggcaaa 2640 acctgttata gagtctctaa agagacacac ctggtacata gacccaaata ttttagttat 2700 gtctcttgtt gacaaaaatg tacaagaacg gagtgacatt gccaagaagt tgtactcaac 2760 cccaagacct acaacttatg ccatagaaag acatcaagtt aatttaaata tactgaactc 2820 ccttatgttt aatgatgata ctcccccaag tttgaccccg cttataactg accagagttg 2880 gttaattttt catatcttaa accatgcaaa tgctgaagta cagtggctta agactccatc 2940 tgagacatgg gatctaaatg actattactt agaattcaaa caatttgtga ataatcttga 3000 agtagtaaat gactgtagtg agagggctat aaaactagtt caggtaacta ttttatatgt 3060 ttggtcctta atgtttatgt taaaatctta atttatttta caagctcatt acttttaaat 3120 attctttaat aggagttgat caataaatcg cataatgagg ataaaagaca aagtaccttc 3180 ctcgtatcaa acaaatataa gagtgaaaga acaattaaaa aaaaatgtga ttatgtaaag 3240 aattctttgt ttacaaaata aatttataaa catgaactat gtttgacaac taatttcttc 3300 agaaactaat cataaatatg attttattat gtaaatataa atattttaat acgtaaagta 3360 tcttgttaat aagcagaaca ctaatatagt gcaaaaaagc ttaaaaatgc ctttttttta 3420 attttttggt cttcattgtt taagtaaaaa aaaaaaatta agcttaacat tatattttta 3480 tttccttcaa aaatataaaa caacaaaaaa aaagaaagaa aaattaattt ggataatggt 3540 aacactcccc tcagctagtc cgaaatatgg gcaaaaaacg ggcaaaaaat gacctttggg 3600 ctcattgagg gaagtcatgg agtgatctaa tcatgaaaaa aaaatattar taatagcacc 3660 ttatgaaaag caatgttagt tgaaatt 3687 // ID Gypsy-4_DWil-I repbase; DNA; INV; 5204 BP. XX AC scaffold_181026; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-4_DWil_; KW Gypsy-4_DWil-LTR; Gypsy-4_DWil-I. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-5204 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_181026; Positions 158411 153208. XX CC Positions [2727-3314] - Reverse transcriptase CC Positions [4330-4806] - Integrase core CC 'TATATA' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 1206..3059 FT /product="Gypsy-4_DWil-I_2p" FT /translation="MLYNEITELTVAGLKEKLKELNLSVKGNKPELRERLI FT SYFDLDTNEESLYSEANTSIQGEVEEVEREVTMAFTLKDIRDSLTEFDGSN FT KKDVLEWLSDFEGTAVTVQWNDLQKFIYGRQLLTGAAKLFVNSQEGLGRGR FT LKSALQDEFTVKLSAKEVHKQLESRKKKYNESLVEYFYLMKSIAKRGSLDE FT ESIVEYIIEGIPDSKMNKSVYYQARDLKELREKLTMYEKYASNQESKTKPR FT FEKLQSRNTLEDFKCFKCGGKNHVAKNCKDNFKCFKCNQMGHKANQCEARN FT TKETKKDNTNYQQMTLKFNRSFKNVPIGNGVLSALFDSGSDLSTISESTYK FT EINPVNLNRECKEVIGVGGKKIRTLGFFNIKTELDGNAVEIDFHVVRDKDT FT LYKGVIGNDVLENLNVTIGRKRIVFHKEDEIPKPISREVQTKTNISELQTT FT FERIMTMTLDEDAPLEIDLIHLDEGIKSDILEMLKSYKPVKPTISPVEMKI FT VVTDDIPISQRPRRLPYQDQVIVNQQIKEWMEPGIVRQSFSEYSSPVVLVS FT KKDGKKRLCCDYRKLNQKIIKDNFPTALIDDVLHKLQRGKVFTTLDLCNGF FT FHVPVDESSRKFTSFVPVRV" FT CDS 3352..5202 FT /product="Gypsy-4_DWil-I_1p" FT /translation="MAVESYPIPTDKKGLQRFLGLTSYFRRFVKDFATIAR FT PLTNMMKKDVPFMMNEEALASVRQLKVCLSNPPVLRLFDPMGITEVHCDAS FT MYGYGAMLLQFNSEDQNFYPVEYMSRKTTPAEEKYHSYELEVLAIIQALSK FT WRVYVLGKKIKIVTDCNAYAMTIKKKKVPLRVARWAIFLQDFDFEIEHRPG FT TKMKHVDALSRIHCLLLENSLRHKIQQAQLLDEWINAVRKVLENGTYDDFY FT LQYDILHKDLIKELIVIPSAMEREIITMAHRQGHFGIKKTIDVLGKDYYIS FT NVASKFEVVIKSCVECIVSEQKHGRKEGFLNVIDKGDEPLTTFHLDHIGPM FT ESTKKQYNHVLVIVDAFSKFVWLYLTKSTGSEEVVDRLQRQSEVFGNPKRF FT ITDRGTAFTSNMFKEYCTSQKIQHLLIATGVPRGNGQVERLNRTVSTLLTK FT LCAEEPKSWYKNVGRVQQFINSSPPRSTKISPFKILTGIEMRTTYDQELKS FT QLEEELLIELQREKDEIRRLAKDNIAKIQEENRKSYNKFRKPETEYKVGDM FT VAIKRTQFGAGLKLKKNYLGPYKIVRKLRHGRYSVEKIGDGEGPNRTNTVA FT EYLKMWNPSFGTNEAVEWP" XX SQ Sequence 5204 BP; 1951 A; 802 C; 1153 G; 1298 T; 0 other; tttttgaggg ctcgtccggg atagtaacaa agagcgagtg agagagaaaa ctttatgaaa 60 aaaccgcgcg taattgcaaa cgtgacgcca acataaggaa acatgagagc aagcgagaca 120 aaataaaatg cgtgctttaa tttcaccacc gaaagacaag aaaagagacg aacacatgag 180 aaataacaac aaagacgaaa agatgagtaa cgcttaaaag acgacaattt tgttcagaga 240 ggaacaaaat gaggaaaagc aagaacaaag acgaaaatat ttgaaagacg aacaacttcc 300 caaagacgaa aatttgttca gagacgaaca aaatacgaag agcatcaaaa gacgaaaacg 360 acgaccacat aagacgaaaa caggagaaaa attgaacttg cttggattcg gcgctatcca 420 attattcatg caagccaaga agtaacctgg agcagagatc ggactaggag tcacacggag 480 agtggctgct aacacgagga gtaccagatc aaaacgagac aaaccgattc accactacat 540 gcacagtcag aaggcatcat ccatccaaca ttgtgtgtgt gtgagtgaga agaagctaag 600 acgtaaagaa gacgacacgt aagaagacaa ctgcgtaagg aagactacac gtaagaagac 660 aactgcgtaa agaagatcac gagaaacgga caacatagac gtaaaataac cagggctaca 720 tcgcaagtac attttttttt aatactaaca atatttaaaa gttgtgaaat aagagaacaa 780 tgaattacaa tttttgaaaa taagaaaaga agatttaaaa ttttggaaaa agaaaagatg 840 atttaaaagt tgtgaaataa gagaacaatg aacaattttt gaaaataaga aaagaagatt 900 taaaattttg aaaaaaagaa aagatgattt aaaagttgtg gaataagaga acaatgaatt 960 acaattttta aaaataagaa aagaagattt aaaattttga aaaataagaa ggaagattta 1020 aaattttgaa aaataagaaa gaagatttaa aattttcgaa ttagagaaag agaaataaga 1080 aaagaccaac tacgagtttt ggaaaataag taagaacaaa gtggaattag aatatttgta 1140 atcatataaa ctaacatcta atttaaacac taatttaaat tattttgtaa tcaatatttt 1200 tacacatgtt gtataacgag atcaccgaac tgaccgtcgc agggttaaaa gaaaagttaa 1260 aagagctgaa tttaagtgta aaaggcaaca aacctgaatt aagagagaga ttaatttcat 1320 attttgattt agacacgaac gaggaatcct tatattcaga agctaatacc agtattcaag 1380 gcgaagttga agaagttgag agagaagtga caatggcgtt cacattaaaa gatattcgtg 1440 attcgttgac agagtttgat gggagcaaca aaaaggatgt attagaatgg ctttcagact 1500 ttgagggtac tgctgtaaca gtacaatgga atgatcttca gaaatttatc tatggtagac 1560 agctgttgac aggagcggct aagttgttcg taaatagtca agagggtcta ggtaggggta 1620 gacttaagag cgcgctacag gatgagttta ctgttaaatt gtcggcaaaa gaagttcaca 1680 agcaattaga gagtagaaag aagaaataca acgagagtct ggttgagtac ttttatctaa 1740 tgaagagtat agccaaaaga ggcagcttag atgaggaaag cattgtcgaa tacatcattg 1800 agggtatccc cgactctaaa atgaataaaa gcgtttatta ccaagcgaga gatttgaagg 1860 aattacgaga gaaactaacg atgtacgaga aatatgcttc caaccaagag tctaaaacaa 1920 aaccacgatt tgagaagctg caatcgagga atactttgga ggatttcaaa tgctttaaat 1980 gtggaggaaa gaatcacgtc gctaagaatt gtaaagacaa tttcaaatgt tttaaatgta 2040 atcagatggg ccacaaagct aatcagtgtg aggcaaggaa cacaaaagaa actaagaagg 2100 ataatacaaa ttatcaacaa atgacactta agtttaatcg ttcttttaag aacgttccaa 2160 taggcaatgg agtattatct gctctgtttg attcgggcag tgatctttcc accattagcg 2220 agagcactta taaggaaata aacccggtta acttaaaccg tgaatgcaag gaggtaattg 2280 gcgtaggagg gaaaaagatt cgtacactgg ggtttttcaa tataaaaaca gagttagatg 2340 gcaacgctgt agaaatagac tttcatgttg ttcgagataa agacacattg tataaaggag 2400 tgatcggcaa tgatgttctg gagaacttga atgttactat tggaagaaaa cgaatcgtgt 2460 tccacaaaga agacgaaata ccaaagccaa taagccgaga agtgcaaact aagacaaaca 2520 tttctgaatt gcagacaacg tttgagagaa ttatgacaat gaccttggat gaagacgctc 2580 cactggaaat tgatctcatc catttagacg aaggtataaa atcagatata ttagaaatgt 2640 tgaagtcata taaaccggta aagcccacta tttcgccagt ggaaatgaag attgtggtaa 2700 cagatgatat tccaatttct caaaggccaa gacgtttacc atatcaggat caagtgatag 2760 tgaaccagca gataaaggaa tggatggagc ctggaatcgt tcgtcaaagc ttttccgagt 2820 attcatctcc tgtggtgcta gtttcaaaaa aagatggcaa gaaaagatta tgctgtgatt 2880 acaggaagct gaatcagaaa ataataaaag acaattttcc gacagcatta attgacgacg 2940 ttttgcataa actgcagaga ggaaaggtgt ttacaaccct cgatttatgc aatggattct 3000 tccacgttcc cgtagacgaa agctcacgga agtttacatc gtttgttcca gttcgagttt 3060 aatttcgtgc catttgggat aaccaattcg ccagctgtat tcatgagata catatttgca 3120 gtcctaaggc cacttatcga tgaagggatt gttattttgt atatggatga cattatcata 3180 ccatcaggtg atgaggaaga tggattaaat aagctgaaat gagtattaaa tctagccgag 3240 acttcaggac tgaaaattaa atgggaaaaa tgtcaatttt tacaaaggcg agtcaatttt 3300 ctaggataca taattgaaaa ttcgactatt aaaccgtcca gagaaaagac tatggctgtg 3360 gaaagttacc caatcccaac agacaaaaag ggcttacaaa ggtttttagg tcttacttcg 3420 tattttcgcc gattcgtgaa agactttgca acaatcgcta ggccgttgac gaatatgatg 3480 aaaaaagatg taccattcat gatgaatgaa gaagcacttg catccgttag acagttaaaa 3540 gtatgtttgt caaatccgcc agtattacgt ttgtttgatc cgatgggtat taccgaagtt 3600 cactgtgacg ctagtatgta tggttatggc gctatgttac tacaatttaa ttctgaagac 3660 cagaattttt atcctgtgga gtacatgagt cgtaagacta caccagcaga ggagaagtat 3720 cactcatatg agttggaagt actcgccata attcaggctt tgtcaaagtg gagggtttac 3780 gtacttggga agaagataaa aatcgttaca gattgtaatg catacgcaat gacgattaaa 3840 aaaaaaaaag tgcctttaag agtggctcga tgggcgatat ttctacaaga tttcgacttc 3900 gagatcgaac atcgacctgg aactaaaatg aaacacgtgg atgcgttaag cagaattcat 3960 tgcttattgc tcgaaaattc tctgagacac aagatacaac aggctcagct tttggatgaa 4020 tggattaatg cggttagaaa ggtattggaa aacgggacat atgacgactt ttatttgcag 4080 tacgatattt tacacaaaga cctaattaaa gaacttatag ttataccgtc ggcaatggaa 4140 agagaaatca taactatggc gcaccgccag gggcattttg gaattaagaa gaccatagat 4200 gtcctaggaa aagattatta tatatcaaac gttgccagca aatttgaggt ggttatcaaa 4260 tcgtgtgtgg agtgcattgt gagcgagcag aagcatggaa ggaaggaagg atttttaaat 4320 gtcatcgaca aaggtgatga gccattgaca acgtttcacc tagatcacat aggccccatg 4380 gaatcgacaa aaaagcagta taaccatgtt ttggtcatag tagatgcttt ctccaagttc 4440 gtctggctat atctcactaa gagcacagga tctgaagaag ttgttgatcg tctgcagaga 4500 caatcggaag tattcgggaa cccgaaacgt tttataactg acagaggaac tgctttcact 4560 tccaacatgt ttaaagaata ctgtacctcc caaaagattc agcatttact tattgctaca 4620 ggagtacccc gtggaaatgg gcaagtcgag cgtcttaata ggactgtttc aacgctattg 4680 accaagttat gtgcagagga acccaagtcc tggtataaaa acgttggcag ggtacaacag 4740 ttcatcaact cttcgccacc acggagtact aagatctcgc cttttaaaat tttgactggt 4800 attgaaatga gaacgacata tgatcaagag ttaaagtccc agttagagga agaattactt 4860 atagagttgc aacgggagaa agatgaaatt cgtagattag caaaagataa tatagcaaaa 4920 atacaggaag aaaatcgtaa gtcatataac aagttcagaa agcccgaaac agaatacaaa 4980 gtgggcgata tggttgctat aaagcgtacc cagtttggag cagggctgaa gcttaagaag 5040 aattatctag ggccgtataa aattgttagg aagttgagac atggtagata ctcagtggag 5100 aagattgggg atggtgaggg cccaaatagg accaataccg tagcggagta cctgaagatg 5160 tggaatccat cattcgggac gaatgaggct gtagaatggc cgaa 5204 // ID BEL-167_AA-LTR repbase; DNA; INV; 532 BP. XX AC supercont1.348; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-167_AA_; KW BEL-167_AA-I; BEL-167_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-532 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.348; Positions 897658 897127. XX SQ Sequence 532 BP; 185 A; 89 C; 98 G; 160 T; 0 other; tgttggtcgc gccacgtcta ccagcaacct taatacaccc ctgactgtac caagagataa 60 cgctagccac accggtcgat cgatggagga tcgtccgtct aaggacggtc acagtagagg 120 tcataacgaa caataatgaa ttgtacatgc ataaaacaaa tgaacttcag tgaaattggt 180 tgctaaaaat ttgttgctcc tacttaaaac tagatttagt tctacataca atctaatttt 240 cgttatactg tttgcctaaa gggtaaaagt gataaactat tagcctagtg tgaattgaac 300 ttaattttct actatttaca aataggaatt gaactacgag tttgagaatt ctatgcagtt 360 tatattagtg ctttcttaaa taatctacga ggtaaaaaga gactggcaca acacaatgta 420 aaattactaa ttatgactaa atttattaag gttactacac atttattgga aaactacaaa 480 ctactgaatt gttgggtgat agcgagaagt actgaagtag gtattaggta ca 532 // ID Daraw1cons repbase; DNA; INV; 500 BP. XX AC . XX DT 19-APR-2011 (Rel. 16.04, Created) DT 19-APR-2011 (Rel. 16.04, Last updated, Version 1) XX DE Consensus of mariner-like element internal region of mellifera DE subfamily. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Daraw1cons. XX OS Drosophila arawakana OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; cardini group; OC dunni subgroup. XX RN [1] RP 1-500 RA Wallau G.L., Hua-Van A., Capy P. and Loreto E.L.; RT "The evolutionary history of mariner-like elements in Neotropical RT drosophilids."; RL Genetica 139(3), 327-338 (2011). XX DR [1] (Consensus) XX CC Consensus of clones with show less than eight percent divergence. CC Daraw1cons. XX SQ Sequence 500 BP; 124 A; 131 C; 136 G; 109 T; 0 other; tttgggtgcc gcacgagctg acgcaaaaaa acatttttgc ccgtatggat gcatgcgaat 60 cgcttctgaa tcgcaacaaa atcgacccgt ttttgaagcg gatggtgact ggcgatgaaa 120 agtgggtcac ttacgacaac gtgaagcgca aacggtcgtg gtcgaaaagc ggtgaagctg 180 cccagacggt ggccaagcct ggattgacgg ccaggaaggt tctatctgtg tgtttggtgg 240 gattggcagg gaatcatcca ctatgagctg ctcccctatg gccaaacgct taattcggac 300 ctgtactgcc aacaactgga ccgcttgaat gcagcactca tgcagaagag gccatctttg 360 atcaacagag gccgaattgt cttccatcag gacaacgccg cccaccttca cacatctttg 420 gtgacgcgcc aagaagctcc ggggaagctc ggatgggagg ttcttttgca tccaccgtaa 480 cagccccgac cttagctcca 500 // ID Polinton-1_DY repbase; DNA; INV; 14782 BP. XX AC . XX DT 15-MAR-2006 (Rel. 11.02, Created) DT 15-MAR-2006 (Rel. 11.02, Last updated, Version 1) XX DE A family of autonomous Polinton DNA transposons - a consensus DE sequence. XX KW Polinton; DNA transposon; Transposable Element; KW Interspersed repeat; Maverick; Tlr; Polinton-1_DY. XX OS Drosophila yakuba OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; melanogaster subgroup. XX RN [1] RP 1-14782 RA Kapitonov V.V. and Jurka J.; RT "Self-synthesizing DNA transposons in eukaryotes."; RL Proc. Natl. Acad. Sci. USA 103(12), 4540-4545 (2006). XX DR [1] (Consensus) XX CC Termini are not included in the consensus sequence, which was CC built based on multiple alignment of several copies that are >90% CC identical to each other. It encodes a family B DNA polymerase CC (POLB-1_DY), retroviral integrase (INT-1_DY), ATPase (ATP-1_DY), CC cysteine protease (PRO-1_DY) and additional four unclassified CC proteins (PX-1_DY, PW-1_DY, PY-1_DY) conserved in Polintons from CC different species. XX FH Key Location/Qualifiers FT CDS 1642..4989 FT /product="POLB-1_DYp" FT /translation="MKHFKTAYKSVYINEDIIDELNSAIDCLLTLSSEFQE FT KDSGWAIKNFNYFETTIIKLENIPASGYIKAPQKIRARNALINVQNADVFC FT FKWCILAFIANQNHENRTFTTPQQRKYSRERMTKPQSYNININDETIVYGG FT MTLDFSGIKFPIEKIGVRHFEKNNVNFSINIYEIDEAGEKIVGPTIKTKER FT KRNHINILGIDNITQNIMHYAYITNLYTLCSSQFSKGKSGGYFCENCLQCY FT HVKSTTHNKLECGKVSSFYPDPNTTTSFKGYHKKLSPPVVIYADIEAVLEN FT YKTCLNSSASSSTTQVQKHTACAVSFYVAHKDYPNLNELWTYEGKQNNIFI FT KLYLTYIFKFYNFIGVDCIQTFCKTLKEKTLSLYYKYWVNSKKPRDDTFDE FT QLQKGNCCACEKEINASDLDKFFDQFSGEYVGPIHRNCKPKFRLSDPFFPV FT VFHNLSKYDIHLFITELEGDLSPIPCNKELYIALTQTIKINATSRYKIRYI FT DSVRFLNSSLDKLSSYMEDKDFKILSTKFQGEKFKQMRRKGVFPYDYLDSF FT EKFNDTQLPSIDSFYNSLSEENCSIDNFNFAQKVWETFNCRTIKDYLKLYL FT ESDVLILADVFENFRKICKKIYKLDPINYVTAPSISWDAMLKFTNVNLELI FT SDGDMYNFLKRAIRGGLTQCTQRISIANNKYLKNFDPKKPNNFLSYIDANN FT LYGWAMSQPLPLSGFQFLDKEEVDSFDYKNIACDGSIGYMLEVDLEYPADK FT HDLHNYLPFCPENKIPPGGKQRKLIADLTNKSEYIIHLKQLQLCLEHGLIV FT RKVHRVLSFTQSCWLKPYIDLNTQQRKLAENDFEKNFFKLMNNAVYGKTME FT NVAKRRQVALVKHYQSKQNSPGFRQRIARNDFHSVEIFGSDLAAIESTPSK FT IMYDKPIYIGVTVLELSKWLMYEFYYDFLLIKSPSTKIIYMDTDSFILSSE FT EDFYQTIKDNPQRFDTSNYKQENQFNIIQANNKEPGKMKDEHAGKVMTSFA FT GLKAKAYSCLVDGEGNDPECIKKIKGVKKSVIKRLNHVDYIECIKYKKTFY FT GSQRVIRSRSHILYTEIVNKISLNYKDDKRYISPDNCNTLAHGHYNIDNII FT SSNFTNTQ" FT CDS 6013..5093 FT /product="INT-1_DYp" FT /translation="MSKQQVVNEIHKPARKNFKRRKFVQKGINDTWQIDLV FT EMIPFSKENKGYKYLLTVIDTFSKYAFAEAVKTKSAPDVVQAMETVFRTSK FT CRPKNIQSDQGKEFFNSAFRELMTKFHINHYHTYTHLKASICERFNRTLKN FT RMWKMFSMKGDYKWIHKIKFLVDQYNNSYHRTIKMKPVDVGVKNEEFILKS FT VYNIPKIFPKHKFSVGDFVRISKYKGVFMKGYEPNWTTEVFKVLKVKSTRP FT VTYEIEDYMSNPIQGSFYEEELQKVKDKNAFLIEKIIKRKKNQVLVKWSGF FT DSSHNSWISVNDMLK" FT CDS 9296..9841 FT /product="PX-1_DYp" FT /translation="MNYVNDFFITLLSNDSLNVFPDNVKCAFTNIVNIPED FT MNDDWEVGITDMYLNNYISKTTAKETNLSEEAEFVSLVNAETFRYKNVAVE FT SKISECGLIFIYTDIIKPRSVGSMRPRCLRVMHYNGKEKNIHFKNIEYYPL FT DIWSPNEISILITDSSGYKVPFETSAVPIFVTLHFRKRRRSNL" FT CDS 9891..10357 FT /product="PW-1_DYp" FT /translation="MVKSYQDYYVNQCGAGLSNIGTLYVTPRIIQQGRGIG FT NFFSGLFNYIKPLFISGLNAIKNQAAETGKAVINEFGRKPLRNIIQEQSKI FT ALQNLSNKAINKMTRFQNGSGKRNKKRGGRKKRNHLTSKRQRKHTQPSHKX FT KKSSRKLNITDIFSNTX" FT CDS 7565..8281 FT /product="ATP-1_DYp" FT /translation="MFRQFIHPYTMLICGPSGSGKTTFLENLINKKNDLFN FT VSIDRIIWCYGEESAKPNFNNIEYFKGVPETIENETNEPILLILDDLMMGA FT FNKNVCELFTKGSHHRNLSVIVVTQNIFHKSSHTRDISLNTKYIIAFKNPR FT DKLQFQCLARQIYPENPMELFRIYKEVTEQPHGYLLIDLTQGINDLLRFRS FT DIFNSEYTVCYSDEYGLQKEPKSLESEQTYIVCSQKGQQQTTEGNYKQW" FT CDS 10358..11647 FT /product="PY-1_DYp" FT /translation="MSNHIECMKGELDLFAPHPTQSSILRTEEVSYNPIAS FT LDGASSIEFVCLGNGETYRDLSSVYLRLVVQLKKNDNSVIEGNAVGVVNNI FT LHSIFRSSSVYLNNILVSQSDNNYHYRAYLQTVLNYGSDASESHLASQGYF FT PNFGRLTSGKYVYPSSNETLKNIFQNSNKVELFGKIHGDIFNQTKLLVNNV FT DLRINFNIEKTAFYLMEKDSESNLKILEAQLFMNHVTVNPSILLAHHHVLQ FT TKNALYPFSKVEVKSFTIYPGNNTLSIDNAVIGQLPNFLAFCMIKNRSYSG FT NRGLDPFNFEHFKMQRFNLMVNGVQVPSQALEFDYSNSENVQSSRGYNMLF FT RSSGIKHYDRGLQITKEMFDTNSFILAFDLTADQSNTTICSNLMSQGTIRI FT EGRFSEPLSEAVTCLVYCEYDSMIDIDKHRNIRTLF" FT CDS 11652..12107 FT /product="PRO-1_DYp" FT /translation="MNTLQIHNLLTKHIYTKSIFKGVFPSDQLPKTISKYP FT ALIIANTDTSDQPGTHWIAFYFESRKSAEFFDSYGQFPQNKEFVTFLKSNA FT NKYCYNKQQLQGYFSNTCGHYCIMYGLFKCKKKTLKYLLTNFKRNDFSYND FT KLILKMFKSNFKK" XX SQ Sequence 14782 BP; 5333 A; 2405 C; 2369 G; 4674 T; 1 other; aaaaaaaaaa caagttgggt gtcattggaa aggatttttc aagccctttc caatggtatg 60 cttttcacga ttctgaaatt aaaaatacga cttttcataa aaactctcaa aattacaatt 120 gcacttcttc gcgcacaagc aatcttggaa gggggtgtac ctgcgcgcac aagcagtcat 180 aaggatgatc gcaagggtgg gggtgagatt tctgttcaag gctatgcacc ttactgtgtc 240 ctcaattttc tgggaaggta actggtggtt ggattattag ttgagaaata tttagaaata 300 tcaaacaaac tggttatagg gtcgtgtata cgtatatata aaaaatcata taatcataaa 360 accgttaaaa aaccgtcgtt cccgtatatt ccagctactt tcagctaaaa gctatggagc 420 cgttgagaag aaaatctaaa caatctgaag tcatagcgac tttcagctta aggctaccca 480 cgttttaaat gtatgccttg caaaatgtta cttataaagt aacaatgact agtagtcacc 540 aataatttca tgatccctga ctgcggtcat ccgaaataat tccctacaga gatatgtatt 600 tcagattatc tgtgatctgt ctccattttc attatgctag gagaccccat aaaatcaaat 660 atttatgact ctttgaacct attatcgaga aagtcaagcg gctatttacc tgcatccaaa 720 cccatagctc gccctccttt tctgacttcc ccattgcagt ctgtgcagtc tgcagtctgt 780 gaaggctcct taaatcgaga accaatcagg tggattctaa gagtagacac cccctgtaga 840 tgtcaaactc tctaatcctc gtgtataaaa atggatcccc ttaaattgct caggatcagt 900 tcttataata cctcagcact gcattcttat aatataattt cggtcgaact tgctcgggat 960 atcacgtcta accgttttaa ttgtttgtgt ggactcaaac catttactgt gaaaaatttt 1020 aaaaacgaat taaaatatat aaataaaata tatttataaa aaattcaaaa tggaatcatc 1080 agtggcttgt gatttgtgca accaatgcat cgaaattagt gagtttgtct ctcattacgc 1140 gaagtgctca gaacccagta aagcctgcga ggctttaatg ttaaactcta acattagatg 1200 tgacttatgc tctaatctag taaaaaattg tgaatttgat gatcactacc aacgctgtga 1260 aggaccccga aatgcattta gcattttaat gggtagagga aaactggatg aagaagaccg 1320 ccctatgaca tcagctaagt ctataaaacg attgaggatg gaagaggata aggaaatatc 1380 taacccccca aaaagaaaac aaattgactt aagaagggaa ctcttatgtc ctactcagag 1440 aacgtcaaca tattttgacc gatatacata aaaatgcgtc ggccagagaa gttaagaaca 1500 atgcgaatat catattgctt acatcagaga cgacatttcc agtaaaatca tatcgaattg 1560 atagtccatt atttaaaaga gtatataagt gtatatggta catatataaa gcaaacagat 1620 gaaggagtta cagtggaaac aatgaagcac tttaaaactg cctacaaatc tgtatacata 1680 aatgaagaca ttattgatga gctaaattcg gccatagatt gcttacttac attaagcagc 1740 gaatttcaag aaaaagactc aggctgggct attaaaaatt ttaactattt tgaaactact 1800 ataataaaac tggaaaacat tcctgccagc ggatacatta aagctcctca aaaaataaga 1860 gcccgtaatg cgctaataaa cgtgcagaat gccgacgtat tttgtttcaa gtggtgtata 1920 ctagcattta tagctaatca aaaccatgaa aataggactt ttacaacccc ccagcaaagg 1980 aaatattcga gagaaaggat gacaaagcct caaagctata acattaatat taatgatgaa 2040 actattgtgt atggcggaat gactttagat ttttcgggaa ttaaatttcc aatcgaaaaa 2100 ataggagttc gacattttga gaaaaacaat gtcaacttta gcataaatat ttatgaaatt 2160 gatgaagcag gtgaaaaaat tgttggtccc acgattaaaa caaaagaaag aaaacggaac 2220 cacattaata tattgggaat cgataatatt acccaaaaca ttatgcatta tgcgtatata 2280 acaaatcttt atacattatg ctcttcacaa ttttctaaag gaaaatcggg aggatatttt 2340 tgtgagaact gtcttcagtg ttatcacgtg aaaagcacta ctcacaacaa actggaatgc 2400 ggaaaagtgt cctcatttta tccggaccct aatacgacaa catcgtttaa aggatatcac 2460 aaaaaattat ctcctccagt ggtaatttat gcagacattg aggctgttct tgaaaattac 2520 aaaacatgct tgaattcttc tgcttcctcg tcaaccacgc aagtacaaaa gcatacggca 2580 tgtgcagtat ctttttatgt tgcccataaa gattatccta atcttaacga gctgtggacc 2640 tacgaaggta aacagaacaa catttttata aaattatatt taacttacat atttaaattt 2700 tataatttta taggcgttga ttgcatacaa acattttgta aaactcttaa agaaaaaact 2760 ttaagcctgt attacaaata ttgggttaac tccaagaaac cgagggatga cactttcgat 2820 gaacagcttc aaaagggaaa ttgttgcgcc tgcgagaaag aaataaatgc gagcgacctg 2880 gacaaattct ttgatcaatt ttctggagaa tacgtaggtc ctatccatag aaattgtaag 2940 cctaagttta gattaagtga ccctttcttt cctgtagtgt tccataattt atctaaatat 3000 gatattcatt tatttattac tgaattagag ggggacttaa gtcccatacc ttgcaataaa 3060 gagctctata tagcacttac gcaaactata aaaatcaatg ctacaagcag atacaaaata 3120 agatacatag attcagtccg atttttaaat tcaagtttag ataaactatc aagctatatg 3180 gaagataaag attttaaaat tctaagtacg aaatttcagg gagaaaaatt taagcaaatg 3240 aggaggaagg gggtgtttcc ctatgactac ttagatagtt ttgaaaagtt taatgatacc 3300 caacttccaa gcattgatag tttttataat tctcttagtg aagaaaattg tagtatagat 3360 aactttaatt tcgcacagaa agtctgggag acatttaact gtcgaactat taaagactat 3420 ttaaaactat atttagaaag cgatgtttta attttagcag atgtgtttga aaattttagg 3480 aaaatttgca aaaaaattta taagttagat cccattaatt atgttactgc gccatcaata 3540 tcatgggatg ctatgcttaa gttcactaat gtaaatttag aactaataag tgatggtgat 3600 atgtataatt ttttaaaaag agcaattcgg ggagggttga ctcaatgcac tcagcgcatt 3660 tctatagcaa ataataaata cttaaaaaac ttcgatccaa aaaaaccgaa caatttttta 3720 agttatattg atgccaataa tttatatggg tgggctatga gtcaaccctt accactatca 3780 gggtttcagt ttttggataa agaagaagtt gattcattcg actataaaaa tatagcatgt 3840 gatggtagta tagggtatat gctagaagtt gatctggagt atcctgccga taagcatgat 3900 ttacataatt atttaccatt ctgtcctgaa aataaaattc cacccggggg caagcaaaga 3960 aagcttatag ctgatttaac taataaaagt gaatacataa ttcacttaaa acagcttcaa 4020 ctttgtttag agcatggtct tattgtaaga aaggtacacc gagttttgtc ctttacgcaa 4080 tcatgctggc tgaagcctta tatcgaccta aacacacagc agagaaaatt ggctgaaaat 4140 gattttgaaa aaaatttttt taagctaatg aataatgctg tttacggaaa aacaatggag 4200 aatgtagcaa agcgtcgtca agtagcttta gttaaacatt atcaatcgaa acaaaattct 4260 ccaggtttta gacagcgcat agctagaaat gattttcata gcgtagaaat atttggttca 4320 gacttagcag caattgagtc aacaccgtcc aaaattatgt atgataaacc catatatata 4380 ggagtaacgg ttttagaact ctcaaaatgg ctaatgtatg aattttatta tgacttcctt 4440 ttaattaaaa gtccatctac gaaaataata tatatggata cagattcatt tatactatct 4500 tcagaagaag atttttatca aactataaaa gacaacccac aacgctttga tacttctaac 4560 tataagcaag aaaaccaatt taatataata caggctaata ataaagaacc aggaaaaatg 4620 aaagatgaac atgccggtaa ggttatgacc tcttttgcgg gacttaaggc taaggcatac 4680 tcttgtttag ttgatggaga aggaaatgac cccgaatgca taaaaaaaat taaaggtgta 4740 aaaaaatctg ttattaagag gctgaatcat gtagattata ttgaatgtat taagtataaa 4800 aagacatttt atggctcaca gagagtaatt cgtagtaggt ctcacattct ttataccgaa 4860 attgtaaata aaattagtct taactataag gatgataaga gatatatttc accagataat 4920 tgtaacactt tagcccacgg tcattataat atagataata taattagcag taactttact 4980 aatactcaat aacaaaaaat tattgtatac ccttgtaaat tacaaaacaa tacagaaatg 5040 taaataaaca aaaaccaaat tgaatttgaa attattttat tgtgtttgat tacttaagca 5100 tatcatttac agatatccat gagttgtggc tggagtcgaa accgctccac tttacaagca 5160 cctggttctt ttttctttta attattttct ctattagaaa cgcattttta tcttttactt 5220 tctgaagttc ctcttcataa aacgaacctt gaatagggtt actcatataa tcttctattt 5280 catacgttac aggccttgtt gacttaactt ttaaaacttt aaaaacttct gtagtccaat 5340 taggttcata tcccttcata aatactccct tgtatttact tattctaacg aaatccccta 5400 ctgaaaattt atgttttggg aatatttttg gtatattata tactgatttt aaaatgaact 5460 cttcattttt gacacccaca tccacaggct tcattttaat agttcgatga taagaattat 5520 tgtattgatc tacaagaaat ttgattttat gaatccattt atagtcacct ttcatactga 5580 acattttcca catacgattt tttagagttc gattgaagcg ttcacatata gaagccttta 5640 ggtgtgtata agtatggtaa tgatttatat gaaattttgt cattaattcc ctaaaagcgg 5700 aattaaaaaa ctcttttcct tgatctgact gaatgttctt aggtctacat ttactagttc 5760 taaaaacagt ttccattgcc tgtactacat ctggggcaga ctttgtctta accgcttcgg 5820 cgaacgcata ctttgaaaac gtgtcaataa cggttagtaa atatttatat cccttatttt 5880 ctttactaaa tggaatcatt tcaaccaaat caatttgcca tgtatcattt attccttttt 5940 gaacgaattt tcggcgttta aaatttttcc gagcaggctt atgaatttca ttaacaactt 6000 gttgtttgct catgatcagg atttgatttc tttgagttac tttttattat aagacgttgc 6060 ttgattagat aatagtgatt gcttatatac tagtacatcc cgttttattt caacacgcat 6120 ctttttagaa atttcaaaac ataatcgagt taaatctatt tgtacatctt tatcatagga 6180 accctccaaa tcagacagtt gaatatctaa cttttgtaag gatgcacact taggaattat 6240 atgtttgtga taaagataaa taacttgttg tcgtaatatg gaattccagc acaaaaatgt 6300 ggtaaaaaag atacccaatc ttaataaaga tctccaggac tcacaatcga actcaatttt 6360 gttaccatat tgattaatta ctcaataata gtatctggca tcaggaagtc atactggtca 6420 agcttcagct gtatacgagt atattcctta tatgtgagca cttgcatcca ttcgtcttta 6480 tcaaatccaa tgtaattgtt tttactttgc tgcatccaaa tcatgaccga aaaattacga 6540 gaagttgaca gaccacattt caactttagg ttatgaccaa cataatattc gaaaccaaaa 6600 atgacttcat gattctgggt ctttcgagaa gatttcattc tttttactta tgcacgtatc 6660 cattggctta gagtgcttta ttcaaactaa aagtatattc tgatcttggc ggtatataaa 6720 ggaggaaaac tcaccctcag ttgattaaaa agtcaaccaa tcatcatgaa cactgtagat 6780 cgaaattcaa gcattatagt actttcaata tttactggga aacaacaaac aagtttttgt 6840 aaaagagtta gcgtttatgc cggccggaac tgtcatacct aatttctttc attttaaacc 6900 accatttaac attaaggaat tggataatga aactcttaag caactacatt ttaatcatca 6960 gaaaattaac ggattggact ggtcagctgg agatattccg tatacatgtc ttgaagaaat 7020 actttcaccg ctaaacaaat accgtacagt tatcgttcgt ggtgaaatta agaaaatctt 7080 tttaagcaag tatttaacag caaacataat tgatatagat attggaaaaa gtttaactca 7140 gctacccaat ttttttacca attgtagaat acataagaaa aaacttccac ttcgctgttc 7200 attaaataat ttgtttaagt tatttgtatt cctagaaaac tcgaattatg aaatatacag 7260 taatggtgac acatatgtct aaacttaaat aaaaaataat ttgtaaacca tttagatcgt 7320 ctactatgga ttccgttaaa aagttgtttg gtaataatcc tgagaaaaaa ggatactctc 7380 gacttatata ttgtaattat tgcggatctg atacgcattt cgagaagaac tgcgccaaga 7440 aacgcgacca ggaaaatcat aaaaaagcat agtatatttc ataacaaata tacaataaat 7500 aacggaaaaa tatacatttt atagtttatc ttgtataaaa taaatcagtg aatatacatt 7560 taaaatgttt cgacaattta tacatcctta cacaatgtta atttgcggac cttcaggatc 7620 gggaaaaaca acatttctag aaaatctgat aaacaagaag aatgatcttt ttaatgtttc 7680 aattgatcga attatttggt gttatgggga ggagagtgct aagccaaatt ttaataacat 7740 tgaatacttt aaaggagttc ctgaaaccat cgaaaatgaa actaatgagc ctattctctt 7800 gatattggac gatttgatga tgggagcctt taacaaaaat gtatgtgaac tctttactaa 7860 aggttcccac caccgaaatt tatctgttat tgttgttact caaaatatat ttcacaaatc 7920 atcgcatacc agagatattt cacttaacac taaatatata attgcattta aaaatcctcg 7980 tgacaaactg caatttcaat gcctggcaag acaaatttat ccggaaaacc ccatggagtt 8040 atttagaata tataaagaag taacagaaca accacacggg tatttactta ttgatcttac 8100 tcagggaata aacgatctct tgagatttag atcagacatt tttaattcgg aatatacagt 8160 gtgctattct gacgaatatg gtttacaaaa agagcctaaa tcgcttgaaa gcgaacaaac 8220 atatattgta tgttctcaaa agggccaaca acaaactacg gaaggcaatt ataagcaatg 8280 gtgataatga gctggtaaaa acaatagtag aaatagtttt aaatacatta cgaggaaatc 8340 agacagtcaa taaagcgcat ctaagtgctt taaaaaagta taaaaatgca ttgcgatata 8400 tttcatgtcc aaagcgtaca ttaaactcta agagaaaagt cctaattcaa aaaggtggtt 8460 ttttaccaat tctaattagt tcgttgcttt caggaatttt tggaaaaatt ttaaaaaatg 8520 atcaaaaata agcataattt aaataaagta attttaatga acccgaatac cttacaaaaa 8580 ttacttaata atgatcgagg gaatgaatta ataattacaa aatttttacc aattttaaat 8640 aataacaaaa taagtgattt tgataaatgg attaagattc gtcaagaact aagtttctat 8700 caaaataaaa aacgtagtaa atcatggaat acttcaaatg aacaaaaatc atacttaaat 8760 tcaacgagaa cagctgaatc tcaaacaata gaatcgattg atataagtcc taacacttca 8820 aaattaaatt caaagttttc accgaacatg tcggagctct cggatttacc tcaaaaaact 8880 gatgaagaag attttaaatc aaaagaaact gctaagttat tgaatgttaa aaaaaaggta 8940 aaatccggta ttttactttc aagaaacgaa gagattcctg acttacctcc agaaagtgat 9000 gatgaaattc caaataaaat tatcaataaa cgaaagataa gcaacacttc gttttcagat 9060 gataaagaag atataaaaaa acaagcagta gtaagagtaa agcttactga aaaacaacgc 9120 aggttaagag aaaagttaaa taaaattaaa gcaacggaaa cattaccata tcaaaattta 9180 aattatataa caggtcgtga aaaattgcga agatcaacta gaaaaaattt agtgcaaata 9240 gggaaaaaca aacttaattg gatatcttac caatagcaga aatttaaata taaaaatgaa 9300 ttatgtaaat gatttcttta ttacattgct cagtaatgac tcattgaatg tatttcctga 9360 taatgtaaag tgtgcattta caaacattgt aaatatcccc gaagacatga atgatgactg 9420 ggaagtaggc attacagata tgtatctgaa taattatatt agcaaaacaa ctgcaaaaga 9480 aacaaattta tctgaagaag cagagtttgt gagtttagta aatgctgaaa catttagata 9540 taaaaacgtc gctgtcgaat cgaaaatatc tgaatgtgga ttgattttta tttacactga 9600 cataattaaa cccaggtctg ttggtagtat gagaccccga tgtcttcgtg taatgcatta 9660 taacggaaaa gaaaaaaata ttcattttaa aaatattgaa tattatcctt tggatatatg 9720 gtctccaaat gaaatctcaa tattaattac tgattcaagt ggatataaag tacctttcga 9780 gacttccgca gtgccgattt ttgttactct tcactttaga aaaaggagaa ggtcaaatct 9840 ataaataata catacccaaa taatgcttac ttagaacact aatagtcatc atggtgaaat 9900 cttatcagga ttattatgta aatcagtgtg gagcaggttt gtccaatata ggaaccctat 9960 atgtaacccc gcgaattatt caacaaggtc gaggaatcgg aaactttttt agtggacttt 10020 ttaattacat aaaaccactt ttcatttcgg gtttaaatgc tataaaaaat caagctgcag 10080 agactggaaa agctgtaatt aatgaatttg gtagaaagcc attaagaaat attattcaag 10140 aacagagtaa aattgcttta caaaacctat caaataaagc aataaataaa atgacacggt 10200 ttcaaaatgg cagtggaaaa agaaataaaa agcgaggtgg tagaaaaaaa cgaaatcatt 10260 taacatctaa acgacaacgt aaacataccc aaccgtctca taaaaraaaa aaatcatcaa 10320 ggaaattaaa tattacggat attttttcta acacaaaatg agtaaccata tagaatgtat 10380 gaaaggcgaa ttggatttgt ttgcaccaca ccctacgcaa agttcaattc tacgaaccga 10440 agaggtttct tataatccaa ttgcatcctt ggatggtgcc tcctcaatcg aatttgtttg 10500 tttaggaaat ggtgaaacat atcgagatct atcaagtgtt tatttacgac ttgtggtgca 10560 attgaaaaaa aatgataaca gcgtcattga aggaaatgct gtcggtgtag taaacaatat 10620 cttacattca atatttagaa gttcatcagt atatttaaat aacatattag tttcgcagag 10680 tgataataat tatcattata gagcatatct gcaaacagtt ttaaactatg ggagtgacgc 10740 ctcggaaagc cacttggcct cgcaaggtta ttttccaaat ttcggacgat taacatcagg 10800 aaaatacgtt tatccgagct ctaatgaaac tctgaaaaat atatttcaaa atagtaataa 10860 ggtagaacta tttggaaaaa ttcatggtga tatatttaat caaacgaaac ttcttgtcaa 10920 taatgttgac ttaagaatta attttaatat tgaaaaaact gcattttatt taatggaaaa 10980 ggatagcgaa tcaaatttaa agatattaga agcacaactt ttcatgaatc atgtaactgt 11040 gaacccaagc attttattgg ctcatcatca tgttttacaa acaaaaaatg ctctttaccc 11100 attcagtaaa gtagaagtaa aatcgtttac aatttaccca ggaaacaata cgctatcaat 11160 agacaatgct gtaattggac aattaccaaa ttttctagct ttttgtatga ttaaaaaccg 11220 ttcatactca ggcaacagag gactagatcc atttaatttt gaacatttta aaatgcaacg 11280 ctttaatcta atggtgaatg gagttcaagt tccttcgcaa gctctggaat ttgactactc 11340 gaactctgaa aacgttcaaa gctcaagagg ctataatatg ctattcagat caagcggaat 11400 aaaacattat gatcggggtc ttcaaattac caaggaaatg tttgatacga acagttttat 11460 attagccttt gatttaactg ctgatcagtc aaataccacc atatgttcaa atttgatgtc 11520 gcaaggcaca ataagaatcg aaggaagatt ttcagaacct cttagtgaag ccgttacttg 11580 tctggtatat tgtgaatacg attccatgat tgatattgat aagcatagaa atattcgaac 11640 ccttttttaa aatgaatact ttacaaattc acaatttgct tacaaaacat atatacacaa 11700 aatcaatttt taaaggagtt ttcccttcag accagcttcc aaaaactatt tcaaagtatc 11760 cggctttaat tattgcaaac accgacactt cagatcaacc aggaactcat tggattgcat 11820 tctatttcga aagtcgcaaa tcagcagaat tttttgattc ctatggacaa tttccccaaa 11880 acaaggaatt cgtgacattt ttaaaatcta acgcaaataa atattgctat aacaagcaac 11940 aactacaagg atatttttct aatacgtgtg gtcattattg cataatgtat ggtcttttta 12000 aatgtaaaaa gaaaacgcta aaatatttat taacaaattt taagagaaat gacttttcat 12060 ataacgataa attaatactt aaaatgttta agtctaattt taaaaaataa aatgcaagat 12120 atttaagaaa atcaatgtag tttatttatt aagtcgtttg tttagcttat aatattttaa 12180 cataatattg aaggcctcgt ctcttaaaat tggaagccta tgacactggc acgtatcggg 12240 tccttccgaa gcgtcatcat cactccaggg acaaagcttg tagtggaatc catatttaga 12300 atggatcagt tctttatcct ctaagtcctc cagagtgctc ttcaaatata taagctgcgt 12360 tacgtgctcc tcaaatatca tgtcttcgaa ctgcatcaag aagagctcaa tttgtttgga 12420 gtccattttc gttatggttc ttttaaggtt ttttaaagac tgatgttgga tttagaaaac 12480 tcaaaccttt atattggaaa atttcattta cttatttatt tactgataaa aattcaattc 12540 gtgcagaact atatctattt acatccttcc ccttatatac catatgggta ataccattct 12600 catttaattt cctaaaaggt gtcttagtac agaatttgtt tacataactc ttaggtagaa 12660 aaatccaaac atctttttcg tccttgtctg ctaaataaag agctacctta gttccataag 12720 gtgtcttttt cagcttacaa ccagtgatct tatactcaaa tccaatacat agatcttcag 12780 tttttgtata accaatcaca ggcttcgatt cattcagagt ctctaggaaa tcctagaaat 12840 tcaataaaat ttgaacagga tttatttatt aatgtcctat atgcatacca tttctgacca 12900 acgtgagatt tgattcgtta tgctcttgta actattcgta ttatactaat gcatttactt 12960 acaattacga gacttttata ctgttcagca ctacatatat atatatatat atatatatta 13020 caagaacaaa atactacgtg aaagagatac caatatatct gttatgctct ttatatacat 13080 aaactcagtt aagcctatgt agaatatttt tatatacttg agtatatcta tctcattttg 13140 tataaatata taacttcata tgaaataaaa ggttactaca aacctaatat aagacagcac 13200 ttttctcgaa aactcagatt caatttttac tctcagaatg acaaagttgt attgtgagtt 13260 gtgttcggaa acaaaatcgt ttgcttcacg cagcggttta aaaaaacata ccttaaaagt 13320 gcataacttc agctatcaaa aatcaaaatc ttcttattat tgttgctttt tctgtcccaa 13380 tgaacgaaaa ttctggttgc gatataatta caatgttcat ctcttgctga tgcacgatgc 13440 agaattaaag agtatgcgta aatcaatgga aagttatcaa ataaagataa aacaagtttg 13500 agaatttaat tttgtaaaat agttttattt aataaatttc tgtataatat agacaaatta 13560 aaaacctaga tcaaataaaa cgtctagtaa tgaatggtca tcttcatcct tcattattcg 13620 taaataaatg ttgggttgac ttggctcgat gaactgatca aaaatatgat catcatcagg 13680 aaactcatag attataggat ctgtgtgcag tatatccata tcgaaatgct gtgcacattg 13740 catttcttga tcgtctccat tgcctcgtat aagatcattt aaaacgtcct gcgttgccgc 13800 tccaaaagga aaatcgctcc actcaagaga cgcaaattca cttccaattt ttgtagtggg 13860 ggtagatgac atagaagggt caaatactcc tcttaagaag cgatctatcg aatcctctgt 13920 gtcgatagcc tccacataac cttccttcga gccttcaatt atttcgacat cagagtcttg 13980 agttaatgga acatttattc catttactga gtcaacaaac tctacagtaa gatccattgg 14040 tatgcagtgc tgaggtatta taagaactga tcctgagcaa tttaagggga tccattttta 14100 tacacgagga ttagagagtt tgacatctac agggggtgtc tactcttaga atccacctga 14160 ttggttctcg atttaaggag ccttcacaga ctgcagactg cacagactgc aatggggaag 14220 tcagaaaagg agggcgagct atgggtttgg atgcaggtaa atagccgctt gactttctcg 14280 ataataggtt caaagagtca taaatatttg attttatggg gtctcctagc ataatgaaaa 14340 tggagacaga tcacagataa tctgaaatac atatctctgt agggaattat ttcggatgac 14400 cgcagtcagg gatcatgaaa ttattggtga ctactagtca ttgttacttt ataagtaaca 14460 ttttgcaagg catacattta aaacgtgggt agccttaagc tgaaagtcgc tatgacttca 14520 gattgtttag attttcttct caacggctcc atagctttta gctgaaagta gctggaatat 14580 acgggaacga cggtttttta acggttttat gattatatga ttttttatat atacgtatac 14640 acgaccctat aaccagtttg tttgatattt ctaaatattt ctcaactaat aatccaacca 14700 ccagttacct tcccagaaaa ttgaggacac agtaaggtgc atagccttga acagaaatct 14760 cacccccacc cttgcgatca tc 14782 // ID BEL-619_AA-LTR repbase; DNA; INV; 643 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-619_AA_; KW Pao_Bel_Ele177; BEL-619_AA-I; BEL-619_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-643 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 643 BP; 200 A; 135 C; 110 G; 198 T; 0 other; tgttgcgacg ccaaacttat tttacgatcg gataggaaaa atccataaga gcaattagag 60 atcctgcttt gtatacagtc tagaatttgc ctgccacaga taagacaaat gaaaatgttt 120 tgtttttgtt ttgcttcgca tcattctagg gatgtaatag aatccaataa taattatgtt 180 cgtttacgcc caatggcacc gatacatcat acttaggtaa tgcgcatcga agacaaaata 240 tagacaaagg atgcaggcag tttaaatgac acgatgatag gaggtagcga ttaataacca 300 tttttgtatt gttagccaaa ttcagtcggg actttctgtc gcccgccttc ctttcgttct 360 ttccgtttac tttccattcg gtcaatagca gatactctct tttcattttt tgtacctact 420 agatttaagg acactatata tagcatttga caataaagga aaatagctca caactcaatt 480 cgacactcgt tcgacatgat ccgaactcca aacggttttt caaatcaata gttgctacaa 540 accattgacc aggcctttca cactgagaag gaaccagtct ctgctagttc ttctattctt 600 gaaaatcgcc ttaaaaaaaa ataacttcgg cctagccgaa cca 643 // ID Copia1-LTR_Dpse repbase; DNA; INV; 214 BP. XX AC Unknown_group_825; XX DT 15-MAY-2009 (Rel. 14.05, Created) DT 15-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia1_Dpse; KW Copia1-I_Dpse; Copia1-LTR_Dpse. XX OS Drosophila pseudoobscura OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; obscura group; OC pseudoobscura subgroup. XX RN [1] RP 1-214 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1021-1021 (2009). XX DR Genome; Unknown_group_825; Positions 5149 4936. XX SQ Sequence 214 BP; 66 A; 47 C; 34 G; 67 T; 0 other; tgttgaaatt tgaagtgtaa ccataatatg atctgtctgg caacgctaat taacataatg 60 tacatgaaca gcgagagcct ctattatctc tcgtctctcc tattcgtatg taagtacata 120 tgtacacatg tcatcgcttc agttctgaat aaaccactta tgagcacaga cgcgccgttc 180 tctatttaat acatttcaaa aggcgcaatt caca 214 // ID hAT-77_HM repbase; DNA; INV; 2840 BP. XX AC . XX DT 20-JAN-2009 (Rel. 14.02, Created) DT 20-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE hAT-type DNA transposon family - consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-77_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2840 RA Bao W. and Jurka J.; RT "hAT-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 9(2), 417-417 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 335..2563 FT /product="hAT-77_HM_1p" FT /translation="MAVSTRKQSEIYLLGTVKEEFTGAKLPSIQDVLGVFF FT YHLKVVEETKHSAAKETIRQLFTYWSRARIPVRDERRAIKKLEDLVLTWDN FT LKKNKNRKTETQKRNEEAFIAEFPNLFDISHGDALQMINIEEDKQFLLAQR FT EPGRRGSMAGLDKNLENKEKEKEKRDNQMKKRKDASEEAKRKMFDTAVLED FT TSSDESQDADFSGPSTSSLTSPNPCLRKRGRKNILTPEVLGSLDRAGISDR FT HAVHTVASVISATGQNAAEFAVNRSSVRRYRLKHRELRSKELKVEFQSHGK FT PLTIHWDGKLMKDLTGDEKVDRLPIVVSSDGVDQLLAVPKLSSGTGHAICD FT AIVQIIDDWKIKDDIKAFSFDTTAANTGRRNGACVLLEGKLGHDVLYLACR FT HHIHEIMLEEVFSLCISPSSSPEIQLFKRFKQFWPNIQTADFHAGIQDDAV FT MAQLDSETRQRTLSYATEQLHTEHPRDDYRELLEITIIFLGGVPPRGVRFM FT KPGAMHRARFMSRLIYSMKIFIFRNSGFHLTDCELNGLKDLCVFAVKFYIR FT GWFSSRLGISAPKNDLLLCKELLASGNCTSLAALKKLKGHLWYLSEVLVGF FT SFFDDSISVEDKRKMVKALDNPGVQEPAKRISLLDQDIREKEIPDFVTTNT FT KKFFSALNISTSFFDEDPAIWRDLPAFTEAQTFVGKLRVTNDTAERGVALI FT QEYSGLRTKSEEQTQFILQVVAEHRKKLPEISKVAVVESLKT*" XX SQ Sequence 2840 BP; 876 A; 566 C; 642 G; 756 T; 0 other; tagggtggtc cttattttac aacttttgaa ttttaatacg ggcgccccct aggagggatc 60 caattgttca aaaacaaaat accctgaaag atttttacaa ttaaataata tttagggtat 120 gctcaatgac ccggaatttt ggtgtatgtg cccgagctgt ccagccacgc cataatgcca 180 aaagtatgaa ataaggacca ccgtaccgac tgtggcaggg tttgttttag cctgatctcc 240 tttgttgctt actttaacgt gcttttgtct acactagttg cttgtgaaca tgtgtttttt 300 atttgtgtta tagaaattaa ttaataaagt tgccatggca gtatcgacaa ggaagcagtc 360 tgagatatat ttacttggga cagtgaaaga agagtttaca ggcgctaaat tgccatcaat 420 acaggacgtc cttggtgtgt ttttttacca tctgaaggtc gttgaagaaa cgaagcattc 480 agctgccaaa gaaacaattc gacaactctt cacgtactgg agccgtgcaa gaataccagt 540 ccgggatgaa cgaagagcaa taaagaagtt ggaagatttg gttctgactt gggataacct 600 taagaaaaac aaaaacagaa aaactgaaac acagaagagg aatgaagaag cttttattgc 660 tgagtttccg aatctgtttg atatttctca tggcgatgct ctgcaaatga taaacattga 720 agaagataag caatttttgc ttgctcaacg ggaacctggt aggcgaggat ctatggcagg 780 attggacaag aacttggaga ataaggaaaa ggagaaagaa aaacgagata accaaatgaa 840 gaagagaaaa gatgctagtg aggaagcgaa aaggaaaatg tttgacacgg ctgtgctgga 900 agacacctcc tccgacgaat cacaggatgc agatttctca ggaccttcga catcaagcct 960 taccagtcca aatccatgcc ttcgaaaacg cgggagaaaa aacatcctta caccggaagt 1020 tcttggttcc cttgacagag ctgggatcag tgacaggcat gctgttcaca ctgtggcatc 1080 cgttatttct gcaacaggcc agaacgcggc agaatttgct gtgaaccgat cttccgttcg 1140 caggtaccgg ctgaagcatc gagaactgag atcaaaggaa ctgaaagtag agtttcaaag 1200 tcacggaaag cctctaacta tccactggga tggaaagctg atgaaggatt tgacaggaga 1260 tgaaaaagta gacagattgc cgattgttgt ttcttctgat ggtgttgacc agcttctagc 1320 tgttcctaag ctttcatcgg gaacaggaca tgccatatgc gacgccattg tccagataat 1380 tgatgattgg aagatcaaag acgatattaa ggcctttagt tttgacacaa cagcggctaa 1440 cactgggcga cgtaatggag cgtgtgttct tctggaagga aaactcggcc atgatgtact 1500 gtacttagca tgccgccacc acatccatga gattatgctc gaggaagtgt tttccttatg 1560 catcagccct tcatcatcac ccgagataca attattcaag aggtttaaac agttctggcc 1620 aaacattcaa actgctgact ttcatgcagg gatccaggat gacgctgtaa tggctcaact 1680 tgattcggaa accaggcaga ggactctttc ctatgccact gagcagctac acacggaaca 1740 ccccagagac gattacagag agctccttga aataacaatt attttcttgg gaggcgttcc 1800 tccgagaggt gtcagattca tgaaaccagg agcgatgcac agagccagat tcatgtcacg 1860 tctcatttac agcatgaaga tatttatctt cagaaacagt ggatttcatc ttactgactg 1920 tgagctcaat ggactcaagg atctctgtgt gtttgcagtg aagttctaca taagaggctg 1980 gttttcttct cgtcttggaa tctcagctcc aaagaacgac cttctgctct gtaaggaatt 2040 gttggcttct ggaaactgta catcgcttgc tgccttgaag aagttaaaag gacacctttg 2100 gtacttgtcg gaagtactcg taggcttttc tttttttgac gattccattt ctgttgaaga 2160 caagagaaag atggtcaagg cgttggacaa ccctggagta caagaaccag ccaaacgcat 2220 ctctctatta gaccaagata taagagaaaa ggagatacct gacttcgtga caacaaatac 2280 aaagaagttc tttagtgctc tgaacatttc aaccagcttc ttcgatgaag atcctgccat 2340 ctggcgtgat ctgcctgcct ttactgaagc acagacattt gttggtaaac ttcgagttac 2400 taacgacact gcagagcgag gtgtcgccct cattcaggag tatagcggac tccggacaaa 2460 atctgaagag cagacgcaat tcatccttca agtagtggct gagcatagaa agaagttacc 2520 agaaataagc aaagtagcag ttgtggaaag cctgaagaca taatttagtt tggaagatga 2580 ctaagtgagt tgctttcaga acattatgtt atacaaatat cttacatttg aaaaacattc 2640 atgtcacttt actgcgttga acactgagac tgagtgactg ctatatgcga aagtttgttt 2700 gaattgagca taccctaaat gttaataaaa ttcaacaata attaaaagca agtgttttta 2760 caccgaatgg aaacattttg aggggcgccc gccaaatttt ttccaaaata aattttcaac 2820 ctcctataag gaccacccta 2840 // ID Copia-24_CQ-LTR repbase; DNA; INV; 144 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-24_CQ_; KW Copia-24_CQ-I; Copia-24_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-144 RA Jurka J.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 364-364 (2011). XX DR [2] (Consensus) XX SQ Sequence 144 BP; 35 A; 40 C; 27 G; 42 T; 0 other; tgttggaagt caaccctaag tgcagccctg gtgattagat ttagttagct ttaggcgcag 60 taggctttgc cacgcgcatt cttttatata aacaccaccc actctgtgtt aaccctcttt 120 ccccgaataa acgccaactc gtca 144 // ID Copia-23_DPu-LTR repbase; DNA; INV; 216 BP. XX AC scaffold_34; XX DT 11-MAY-2010 (Rel. 15.05, Created) DT 11-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE Copia-like LTR retrotransposon from Daphnia: long terminal DE repeat. XX KW LTR Retrotransposon; Transposable Element; Copia-23_DPu-LTR. XX NM Copia-23_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-216 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 710-710 (2010). XX DR Genome; scaffold_34; Positions 1007252 1007467. XX CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX SQ Sequence 216 BP; 59 A; 50 C; 34 G; 73 T; 0 other; tgttgaattt atgttgtcaa tcctaaattg tgtaaatctg gcaactccct ccctaccctc 60 tcttctctct catcgaatgt tctgtgtgtg tgtgtaactc agtctgctac ttgagtgcgt 120 tctaatctcg atctaatctt gtgtcgcaat ctcaaataca aagaaacgta atcacagtga 180 acaaaacgcg agtctcatat tagctaatat tcaaca 216 // ID MuDR-4_TV repbase; DNA; INV; 2662 BP. XX AC . XX DT 11-DEC-2008 (Rel. 13.12, Created) DT 11-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE MuDR DNA transposon from Trichomonas vaginalis. XX KW MuDR; DNA transposon; Transposable Element; MuDR-4_TV. XX OS Trichomonas vaginalis OC Eukaryota; Parabasalia; Trichomonadida; Trichomonadidae; OC Trichomonas. XX RN [1] RP 1-2662 RA Kapitonov V.V. and Jurka J.; RT "MuDR DNA transposons from protozoans."; RL Repbase Reports 8(12), 1814-1814 (2008). XX DR [1] (Consensus) XX CC The MuDR-4_TV consensus sequence was derived from multiple CC alignment of 15 copies that are <1% divergent from it. MuDR-4_TV CC copies are usually flanked by 10-bp TSDs (several copies are not CC flanked by TSDs). MuDR-3_TV contains imperfect 20-bp TIRs (6 CC mismatches) and codes for a 489-aa MuDR transposase. CC This sequence was derived from sequence data generated by TIGR CC Database: The Trichomonas vaginalis Genome Sequencing Project. XX FH Key Location/Qualifiers FT CDS 275..1741 FT /product="MuDR-4_TVp" FT /note="MuDR transposase." FT /translation="MISVDLEELIAFGIKQEVKGGVPYRRRTGSPKQYVTY FT SCSYPLCKAKFRVEEQNGVFTISKVNEEHCHHIGVEMGSHYASKFYQKWIE FT LYIKNGGKQFLAQKAFFDKFGIDTNNKETMKLFAQSTDALKHKFYRLDEII FT AQRLSNDPYTSLEAFLNNLQEECPDDLVQFDYNGDFSEFTIIYAPFAAKEL FT VHSIDNPLHVDSTHSLIKGKIQLFAVTMKTSQNTIFPFCYFLVNPQTSEKI FT QECLVKFCEFAHLDPQHWSADCALNIARAIEDGFPLAQLSWCAVHVLRACA FT KVGSYFDNPQNFENFYLAMDFLTLRADATNAEKLAEESDETYEFLTTMLEE FT EEPRALKYFDKQWRRTKEHWMMLYRGEGDSTNNIAESHFHQLKQTFFLGKK FT NERIDDIVITLITVVIPNAIIKVQVANVLNEERAVRILTRAGNQIVEQRAS FT KHDECLKLIQHILEIVESRRIEPNVLIPSLKAILNLAQRSLKEE" XX SQ Sequence 2662 BP; 874 A; 399 C; 449 G; 939 T; 1 other; gagtaggtga caaaagtgca tactcttaaa ttaaccttaa atgttgattt ttcgatattt 60 tcggtattaa ccttattgta aatttttccg gtgataaatt tatcacactg aggttatttt 120 tccggtgaga tttttaacct tgatttaatg aaattaatat atataaattt tattattttt 180 aatttcgaat gagtataata aaatattttt tatgtgtttt tgtttttgat gtgatttgaa 240 ttttgttaac tcttttcaaa atcacttcat ttttatgatt tcggtggact tagaggagct 300 tattgctttt ggtattaaac aggaagtcaa gggcggtgtc ccatacagaa gaagaacggg 360 atctccaaag caatatgtca cttattcatg ttcgtatccc ctatgcaagg caaaatttag 420 ggttgaagaa caaaatggag tgtttactat aagcaaagtt aatgaagaac attgccatca 480 tataggagtt gaaatgggtt cacattacgc gagtaaattt tatcaaaaat ggatcgaact 540 ttatattaaa aacggtggaa agcaatttct tgctcaaaaa gcattttttg ataagttcgg 600 aattgatacg aataacaaag aaacgatgaa acttttcgcg cagtctactg atgctttaaa 660 gcacaaattt tatcgccttg atgagattat tgcacaaaga ttatcaaacg atccttacac 720 ttctttggaa gcatttttga ataatttaca ggaagagtgc ccagatgatc tcgttcaatt 780 tgattacaat ggagattttt ccgagtttac aatcatatat gcaccttttg ctgcaaaaga 840 attggtgcat agcattgata accctctcca cgtcgattcc actcattctt taattaaagg 900 aaagatccaa ttgtttgcag taacaatgaa aacaagtcaa aacacaattt tccccttttg 960 ctatttttta gtgaaccctc agacatctga aaaaattcaa gaatgtttgg tgaaattctg 1020 tgagtttgcg cacctggatc cacagcattg gagcgctgac tgtgcgctaa acattgctcg 1080 cgcaattgag gacggatttc ctcttgcgca gctcagttgg tgcgctgtgc atgttttacg 1140 cgcatgtgca aaagttggaa gctattttga caatccacag aattttgaga acttctattt 1200 agcaatggac tttttgacat tacgtgcaga tgcaacgaat gcagaaaagc tggctgaaga 1260 gtctgatgaa acatatgagt tcttgacaac aatgttagaa gaagaagaac cgcgcgcttt 1320 gaagtatttc gataaacaat ggagaagaac caaagaacat tggatgatgt tatatcgtgg 1380 agaaggtgat tccacgaata acattgcgga atcacacttt catcaattaa agcaaacatt 1440 tttcttgggt aagaaaaatg agcgcattga tgacatcgtc ataactctca tcactgttgt 1500 gattccaaat gcgattatta aggttcaagt cgctaatgtt ttgaatgaag aaagagcagt 1560 aagaattctt acaagggcag gtaatcagat tgttgagcaa agagcttcaa aacacgatga 1620 atgtttgaaa ttaattcaac acattttgga aattgtcgaa agccgaagga ttgaacctaa 1680 tgtccttatt ccatctttga aggcaatatt aaatttggca caaagaagtt taaaggaaga 1740 gtaagcaaga ttagataatt tggacttttt ttaaaattaa tgaataaaca gtatattttt 1800 gaagtgtaag cttatttacc tgaagctata ttacttaata taacattcaa agtattattg 1860 ctatttattg gagaacaaat ttatgtaatt ttcacatgaa aattgataaa tttgtatttc 1920 cttatgataa tcaatttttg tatttgtaga gagtctccct gcatgatatc aagacttggc 1980 tattagtctt tatccatttt tgaaaattgt attgtaaatt gtaagaattt ctgaatacta 2040 ctactataaa atataattgc ttatctatgt atattctccc tgcatgatat caagacttgg 2100 ctataagtct ttatccattt ttgaaaattg tattgtaaat tgtaagaatt tctgaatact 2160 actactataa aatataattg cttatctatg tatattctcc ctgcatgata tcaagacttg 2220 gctattagtc tttatccatt tttgaaaatt gtattgtaaa ttgtaagaat ttctgaatac 2280 tactactata aaatataatt gcttatctat gtatattctc cctgcatgat atcaagactt 2340 ggctatwagt ctttatccat ttttgaaaat tgtattgtaa attgtaagaa tttctgaata 2400 ctactactat aaaatataat tgcttatcta tgtatattct ccctgcatga tatcaagact 2460 tggctattag tctttatcca tttttgaaaa ttgtattgta aattgtaaga atttctgaat 2520 actactacta taaaatataa ttgcttatct atgtatattc tccctgcatg atatcaagac 2580 ttggctataa gtctctattc attatgagta tgaccccatt ttttttacaa aaagtgccga 2640 attgcaactt tgtcccaatc tc 2662 // ID Copia-15_AA-LTR repbase; DNA; INV; 226 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-15_AA_; KW Copia-15_AA-I; Copia-15_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-226 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 942-942 (2011). XX DR [1] (Consensus) XX SQ Sequence 226 BP; 62 A; 50 C; 47 G; 67 T; 0 other; tgaggagcaa tatctgcact gatcccactg tgcaacgtga cgctcaacgc ttacccccat 60 catcagtgtt gcgcccatga ggtgtgaagt gaaagaaaga gaacgatgac agttcgaaat 120 ggaatcttta ttaccgttgt agtttttgct aaataaacac gttttagttt gctttataaa 180 ccgtggtgcg attatcatta ttttcccggc cgaattccaa ttccca 226 // ID Dtripu2cons repbase; DNA; INV; 501 BP. XX AC . XX DT 19-APR-2011 (Rel. 16.04, Created) DT 19-APR-2011 (Rel. 16.04, Last updated, Version 1) XX DE Consensus of mariner-like element internal region of irritans DE subfamily. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Dtripu2cons. XX OS Drosophila tripunctata OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; tripunctata group; OC tripunctata subgroup IV. XX RN [1] RP 1-501 RA Wallau G.L., Hua-Van A., Capy P. and Loreto E.L.; RT "The evolutionary history of mariner-like elements in Neotropical RT drosophilids."; RL Genetica 139(3), 327-338 (2011). XX DR [1] (Consensus) XX CC Consensus of clones that show less than eight percent divergence. CC Dtripu2cons. XX SQ Sequence 501 BP; 161 A; 110 C; 113 G; 117 T; 0 other; tgggtaccgc acgagctcac agttgaccaa aagcaacaac gagtggatga ttctgagcag 60 tgcttaaagc tgttcaatcg tcaaaagtca gagtttttgc gtcgatatgt gacaatggat 120 gaaacgtggc ttcattattt cactccggag tccaatcgac agtcagccga gtggactgca 180 cgcgatgaac cgactccaaa gcgtggaaag acgcaacagt cagctggcaa ggttatggcg 240 tcagtatttt gggatgcgca tggtataatt ttcattgact acctcaaaaa aggaaaaacc 300 atcaatagtg actattatat agcattattg gagcgcttaa aggaagaaat cttggaaaaa 360 cggccccatt tgcaaaagaa aaaagtgctg tttcaccagg acaatgcacc gtgtcacaaa 420 tccatgaaaa cgatggcaaa attaaatgaa atgggcttcg aattgcttcc gcatcctcca 480 tatactcccg acctagcccc a 501 // ID Transib-N3_AAe repbase; DNA; INV; 2727 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A non-autonomous Transib DNA transposon family from Aedes DE aegypti. XX KW Transib; DNA transposon; Transposable Element; nonautonomous; KW Transib-N3_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-2727 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the yellow fever mosquito."; RL Repbase Reports 11(4), 1310-1310 (2011). XX DR [2] (Consensus) XX CC ~97% identical to consensus. 5-bp TSDs. TIRs are ~700 bp long. XX SQ Sequence 2727 BP; 918 A; 454 C; 419 G; 936 T; 0 other; cacagtgctt ccaagggcct atattcgtga tcgaaaatgt tttgcaccag aaaagttggt 60 tttagaggta tggtgtcttc agagaagttg ttcgggatgt tgaagccctt cttaagaaag 120 aattattttt gtgatgaatc cacctagcag tgagatagat tttcactttt ttgcaaaagt 180 ggagatacag caatggtgtc ttcggcaaag ttgtagataa tttcactaca tgaaaatttg 240 ctgaagacac tttagctcta tctattgaat tttaagagat atgggtcatt ttttgtgaac 300 gaccccttaa aactagtttt ttcgttatat ctctttctgg gtatttttga gatttttcac 360 atgttctaca aagttgtttt aattgataaa atacgcattt ctttagaaga cataaaaaat 420 gtatctctta tattttcgga gttatatgat attttatgtt gaaaaatcgg ctatttcacc 480 ttcctctaac atttacaggg gcaaattggg gcaacatttt tagcttgcat ataaaagagt 540 aactcaatag ctatcaaatc catggtaact gacacgtggc actgcgttcc ttgcctacaa 600 aatatgttct gtaaaatatt gaaaaattaa aattatgcag cttttaatac cgtttatttt 660 ttaaatacgc catttttcag ggctgagtga caattttcag gtcttttacg caatgatata 720 ttcggtttgg ctaatggtga tattgcctaa gataggatat cttgaaatca aacgtcatac 780 ttctcattta cagtgaaaac tagatgtatg aattcttaga gatagccttt attacgtaac 840 atcagctcta tactgaacgt agtataaagc acatagaata atgttagtaa tttatttgcc 900 ttaaaacgga agtcatccta tttaagctta aaataataat aataatagta ataataataa 960 taataacaat aataaaggaa agaatgtttt ctttagtttt ggtattttgc tcttttgtat 1020 tcacaagaga acataatctt tataaatcta tccgtctttt catataaata aaactaaagt 1080 ggcgtgacat acgaacagaa tggctttcaa acggactatc atttgttacc ataaacaagc 1140 tataatttcg ctgatatgtg tatttttaag attttagata ctactttttg agcatgtgaa 1200 gaagaaaatt taaaataaga ccctttttat atatttctgt gcgataatac agtcgactat 1260 ccacatctca atatttttgc ctatgtagat tatttctttg atcctctctc tcaacaattt 1320 ctttctccat aaaaagacta tcttttaact attattatta caaaagtgtt catatgatga 1380 tatgtagtct taggcgaatt gattatttac attatccact gtgcttaaag tgaaacatat 1440 cggaatctag agatggtcat cccaatggcc gacttatgtc atatctctgt ccgcattcta 1500 aaagcactta actgaacaaa tattgaatta acaagcatta tattcttatt ttatgctttc 1560 tactaatgca caaagctaaa ataaaaatca ctgtcgttcg atgcctactt cgccagattt 1620 gtgtcttaaa agttttagtg caatatttac ttaatataac taactatttc acgttaacaa 1680 tatccaaaaa tatttagtat ttgagagatc acgtttttac agtcataacc gaatatatca 1740 tatcctagaa gacctgataa ttttccactc agccatgaaa aatgggaacg gttcatctgg 1800 cttaaagcca tttagcatat ggtcatttgg cataacccca attgaaataa tggccattta 1860 tcataacggt catttcgcat aattagaata aaattctttc tttcaaaata tacctgttct 1920 ttataaatgt gtttatgtta aatgaccatt gtgccaaatg accgttatgc caaatgtctt 1980 ttataacaaa cggcttgatg ccagatgaac ttatgccaaa tggctttata ccaaacgacc 2040 cagacccatg aaagatggcg tatttaaaaa atacacggta ttaaaagctg catactttta 2100 atttttcaac attttacgga acatattttg tagtcaagga aagcagtgcc acgtgtcagt 2160 taccatggat ttgatagcta ttaagttact cttttatatg caagctaaaa atgttgcccc 2220 aatttgcccc tgtaaatgtt agaggaaggt gaaatagtcg atttttcaac ataaaatatc 2280 atataactcc gaaaatataa gagatacatt ttttgtgtct tctgaagaaa tgcgtatttt 2340 atcaattaaa acaactttgt agaacatgtg aaaaatctca aaaataccca gaaagagata 2400 taacgaaaaa actagtttta aggggtcgtt cacaaaaaat gacccatatc tcttaaaatt 2460 caatagatag agcaaaagtg tcttcagcaa attttcatgt agtgaaatta tctacaactt 2520 tgccgaagac accattgctg tatctccact tttgcaaaaa agtgaaaatc tatctcactg 2580 ctaggtggat tcatcacaaa aataattctt tcttaagaag ggcttcaaca tcccgaacaa 2640 cttctctgaa gacaccatac ctctaaaacc aacttttctg gtgcaaaaca ttttcgatca 2700 cgaatttagg cccttggaag cactgtg 2727 // ID Gypsy-1_DWil-I repbase; DNA; INV; 4671 BP. XX AC scaffold_179905; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-1_DWil_; KW Gypsy-1_DWil-LTR; Gypsy-1_DWil-I. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-4671 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (04-MAR-2011). XX DR Genome; scaffold_179905; Positions 14158 9488. XX CC Positions [1968-2513] - Reverse transcriptase CC Positions [3518-3982] - Integrase core CC LTRs are 95% similar to each other. XX FH Key Location/Qualifiers FT CDS 1881..3509 FT /product="Gypsy-1_DWil-I_1p" FT /translation="MKDDIPIKQRYYPKNPKIQGEINTKVDELLTMGCIEP FT SQSPYSSPIVMVRKKQTGKWRLCVDFRQINAKSIKDAYPMPRIDYILNQLR FT EAKFISSLDLKDGYWQIPLEENSRPMTAFTVPGKGLFQWKVMPFGLHSASA FT TFQRALDQVIGPEMIPNAFAYQDDIVVVSKTIKEHMVHLQEVFRRLKEANL FT RINAEKCQFFRKELRFLGHLVTENGIATDPDKVAAIAKLKPPTCTKELRQY FT LGVASWYRRFVPNFAMITHPLNALLKKKTKWHWSEEHQAAFESVKEKLTSD FT PVLACPDFTKTFVLQTDASNTGLGAILVQNTEEWERVVSYLSRELNQAEKN FT YSATEKECLAIVWAVRKLRPYLEGYRFKVVTDHMALKWLNSIESPSGRIAR FT WALELQQYDFEIAYRKGKLNIVADALSRQPMEECKRIVDNGARLRQANKEC FT EWIQDITKKIQQQPQKYPDYILENSQVYRNIPHRLGQEEVISWKLCVPKIL FT RNQVMEENHTSITAGHTGSRRTISRTASNARRVSIINPTNSKQQGKC" XX SQ Sequence 4671 BP; 1762 A; 948 C; 1119 G; 842 T; 0 other; actggcgccc aacgtgctat actggtgaaa ttttaaacaa aacaaaagtt tgaaaattgt 60 aaagaaaaca tcaaaatggg gaaaggatgg atttaccaca tgaagaagga ggatctgtta 120 cagacaggcc aaaggattgg gatagagatg accggcacag tggaggaaat gaggaaacag 180 ctaggcgagt gggtagaaag gaatgaaacg gacccagact ggaaggagga aataaataga 240 ttggagacaa agtactcgaa aaatctgaaa atgaaaacaa tagcaagcac atcaaaaaca 300 tgcgaaaatg gcgtagacaa ggaggctgag gaattccaga aaataaacat cgccacacta 360 gcaattcctc catatttacc accaagatgg ggcagcgaag gagccctaga tagcgcagag 420 acaaagaggg acataaggag acaaacgata gaccccaaga aaacgggacc acatacaaca 480 aacaggacca ccgaatacgc caaggtggca aaacaagtga gggaatagac attcagattc 540 gacggcaatt ccagaccgct agagttcata gaacagatcg agtggtcagc agacatgtat 600 ggcatcgagc tggacatgat cccaagagca atgccagaac tactgaaaga caaagcactc 660 aagtggtacg tcgcaaacaa taagcagtgg cgatcgtgga aggagttcgt tgagggcttc 720 aagaaatact tcctgccaag aggattcttc aacaaattgg cggatcaagt gaaaaatcgc 780 aaacaaaaca gaaacgagta ttttaaggac tatatgatcg acatgcaaac aatgatgaga 840 cccctcaaaa taccagagta cgaacaattg gaggttatat tagaaaattg cactccagat 900 ttgaagatat tcgttcggcc gcacagagtc agaaacctag atgagctgat gcagttggcg 960 gaagaacatg agatgcttga acaagaacga tcagatttcc atcgcccaaa caatgctcaa 1020 agaacacaca acaaccaatc aagccacaga ggagaaaata attacgggcc accagtcaca 1080 cagtgccgaa gatgccaaga caacagcatg gagaacaatt caagaccaag ccaagaaaat 1140 aaaaacatcc aagatggaaa agtcttgaat ccaagccagg catgccgcag atgcggagga 1200 gccggacact gggcaaggga ctgcaacaat cagagactca ttttctgttg gcagtgtgga 1260 aggattggga aaataacaaa agaatgttgt tggagttcgg gaaacgaaag gcggttccag 1320 cctatcaggg gcgagcaggg acccagccaa caagctcctc accactaata ggccaactaa 1380 tagaggaaca ggatcaattg tcggcaatta tcgaagtagc aggagtccag atgaaagcga 1440 caattgactc cggagccacg agtagcttca tcagcaagca gatgttggaa acagtgagga 1500 ataagggaaa atgggagaac acacacaaac aggtatacat ggcagacgga aaggtgcgat 1560 acattgaaga gcaatggaca ggatacgtga gattcgggga caaggaaatc cacattcctg 1620 gtcatgccag aaggaataga tgcgctgatc ttagggtgga atttcctccg gattatgaac 1680 acagagatat cgtgtggagg ccacaaagta cgaataccga cagtaaagcg aatacagggc 1740 aggctggtag aaagaatgtc catcatggtc actcagaata atgatcaaca agaaatacaa 1800 agttttcttt tctaagtcag gagttgaagg atttagggaa agttaaggga ctgtcaaagg 1860 tggggataca tcgcataacg atgaaggatg atatcccaat aaaacaaagg tattacccga 1920 aaaatccaaa gatacaggga gaaatcaaca caaaggtcga tgaactgttg acaatgggat 1980 gcatcgaacc atcacaaagc ccatatagct cacccatagt gatggtgaga aagaaacaga 2040 cgggcaaatg gagattatgc gtagatttta gacagattaa cgcaaaatct ataaaagatg 2100 catacccaat gccacgcatc gactacatct taaaccaact acgtgaagcg aaatttataa 2160 gcagcctaga cctgaaggac ggctattggc aaattccgtt ggaagaaaac agcaggccaa 2220 tgacagcatt cacagtccca gggaaaggtt tattccaatg gaaagtcatg ccattcgggt 2280 tacactctgc atcagcaacg ttccaacggg cactggacca agtcattggc ccagaaatga 2340 taccaaacgc gttcgcctac caagatgaca tagtggtagt aagcaaaaca atcaaagaac 2400 acatggtcca cctgcaagaa gtgttcagga gattgaaaga agcaaaccta agaataaatg 2460 ccgaaaagtg ccaattcttc agaaaggagc taaggtttct aggccacttg gtaacagaaa 2520 acggcattgc aacggaccca gacaaggtgg cagcaatagc taaattgaag ccgcccacgt 2580 gcacaaaaga attacggcaa tatctagggg tggcctcgtg gtacagacga ttcgtaccaa 2640 attttgctat gataacacat ccgttaaacg cactgctcaa gaaaaaaacc aaatggcatt 2700 ggagcgaaga gcatcaggca gcattcgaat cagtaaaaga aaaactcacg agtgacccag 2760 tattggcatg cccagatttc acaaaaacgt tcgtactcca gacagatgca agcaacacag 2820 gattaggggc catactagta caaaacaccg aggagtggga aagagtggtt tcatatctgt 2880 cgagagagtt gaatcaagca gagaagaact actcggcaac agaaaaggaa tgcctagcaa 2940 tagtgtgggc agtacgcaag ctgagaccat atctcgaggg ataccgattt aaggtggtga 3000 cagaccacat ggcgttgaag tggttgaata gcatagagag cccctcgggc cgcatagctc 3060 gatgggcact ggaattacaa caatacgatt tcgaaatagc atatcgaaag ggcaaattaa 3120 acattgtcgc cgatgcgcta tcaagacaac cgatggaaga gtgcaaaaga atagtagaca 3180 atggcgcccg attacgacaa gcaaacaaag aatgtgagtg gattcaagat ataacaaaga 3240 aaatacaaca acaaccacag aaatatccag attacatatt agaaaacagc caggtgtaca 3300 ggaatattcc gcatagatta ggtcaagaag aagtaatatc atggaaattg tgtgtgccca 3360 agatactacg gaaccaagtg atggaagaaa accacacttc cataacggca gggcatacag 3420 gaagcaggcg aacgatttct agaactgcgt caaatgcacg acgtgtcagc attataaacc 3480 caaccaacag caagcagcag ggaaaatgct gacgcaaata ccggatgaac catgggccac 3540 tgtttgtgct gatttcgtcg gaccattacc acggtcgaaa catggaaact caatgttact 3600 ggtaatcatt gaccgatttt caaaatggac agaactaata cccatgcgaa aggcaactgc 3660 ggagaatcta atcaaggcct ttagggaaag aattctgtcg agatttggag ttcccaaagt 3720 ggtaataact gacaatgggg tccaattcac tagcagaaca tttaagaaaa tgctagagga 3780 atggcgatgc caacaccaat taacagcacc atacacacca caagaaaacc caaccgaaag 3840 ggcaaacaga acagtaaaaa caatgattgc tcagtttacc ggtgaaaatc aacgaaattg 3900 ggatgagagc tggccggaaa aaacactggc tgtaaatact agtcaagtca atcagagtca 3960 actggattta ccccagcata tgtgatacaa ggacgcgaac ctagaatgcc aggcgcgata 4020 tacgatagtg tcacgacagg cacaggaaag caaagcgtac caccagaagc aaaggcaaaa 4080 caagtgcaag agctgctcga gctgataaga cgtaatctgg agacggcggc acaagatcag 4140 gcacgtcact ctaacctaag gaggagagag tggagaccaa caattgaaac cacagtatgg 4200 gcaaaagaac accatctctc gaatgcggcc gaagggtttg cagcaaagtt ggcaccaaga 4260 tttggaggcc cttatgaagt gacaggtttt ttgtctccag ttatatgtaa attacggcac 4320 aaagaaacag gaaaaaccag gcaagcacat atcagccaga tgaaacatca aatatgtgaa 4380 caaacagatg tcaataaact agatgaacaa tgagtatgga cgactctgat gacatggcag 4440 cgagaaatct gcacagaccg gccggctggg cagaaatctc aaaaacatga cacagagcca 4500 aaaatatcaa agtgcatcga aatcgtacaa aatcacataa cataaaaaaa aagaaaaccc 4560 caaagtgcgt cggaatcatg taaataagaa caagcagaat agcaaagtcg caaaccgcat 4620 accggaacaa gataaatata tgcggatatc atatatatac agtcaggcaa a 4671 // ID hATw-2_BF repbase; DNA; INV; 6205 BP. XX AC ABEP01041645.1; XX DT 12-JAN-2009 (Rel. 14.02, Created) DT 12-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE hATw-type DNA transposon family from Branchiostoma floridae. XX KW hAT; DNA transposon; Transposable Element; hATw; 7-bp TSD; KW hATw-2_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-6205 RA Bao W. and Jurka J.; RT "hATw-type DNA transposons from Branchiostoma floridae."; RL Repbase Reports 9(2), 514-514 (2009). XX DR EMBL/GenBank/DDBJ; ABEP01041645.1; Positions 63798 57594. XX CC TSD is 7-bp long. XX FH Key Location/Qualifiers FT CDS 1500..4916 FT /product="hATw-2_BF_1p" FT /translation="MASTSDGSVAADLGGMLASGVRLKTVDEFTNEFVIHL FT RKLQGNASMCLATDLKQLAPLEMMKDIEKCKEDSLRKRMDSVYLNYKKLSA FT SKNKEKLAIYCKTQFELPKSRESPTAAAAESTSTQYKLKQTRKELKDSKKR FT CAQSEHEVYELEQSKEVLTQEYFSVVSQLHENADSFRKIIEEKSDKVSDLS FT KALSAANKTNDELTAKLDSLEARLSRVGTVKNVNKKVKRRDETIQKKNELI FT EQLQGAVSAGNQELNELREAQEHQTSATEKVRALQTKVKSLQDSKRKLQVK FT LQKIEEDKVQVQTDLDATVDTLNGKIQDLENENKHLEAVQSLMDSDTVTTF FT HEGRYSNDLREVIMNLLTSGVSLKKVDHVITTVLSKLAGKKIDRFPSVGLK FT HRLVIEAKILGDRQVAEAIIAGAGNAEQGNCLHQDGTSKFFKKYQTFDVTT FT PTGDTLTMSMTEVPSGDAAGILTAFSESCRSLAEVLCEPGEDVAKKTAEII FT TSFTSTMSDRGATNPLFNKHLESLRRELLPVVKDEWDTLSDEVKDELGDMC FT NYFCKMHLLVNFATEANVTVKIFEDAVSEGKNPFAFAQQGESGAARLIRTA FT CSAFTDRGNEKSGAPQFFQSYLSQHGEQKNYMVTYRGNRFNILFYNAAAVY FT HHLKQMVTFVSSWPDANNLLKAVKADASQKVYVAGVRALGIIDKTITGPFF FT RVLGTEKNILGMNTHLHQMQIGLERWSRDASTLLAGEPLFNETEVERHKDA FT LFDSLFAATEDEDLDVLTQQALEVVCSALLILLERQAEEQLPGGKFWKPTE FT TEVTKSQHVPTTNVASERDFAILDNLLRAKPYGTSLSFEAHLLWINNQTSA FT WLRNLTPEERNKHMDYARTNASRVQQRFKERQSEIKEKRLQTLLERQRQKE FT EKRRKARLKTVALTDKVMNMGGVWKTPEEVEDKVKTLNKEQEKLEAIKFQL FT QFHKQVLKSEGSKELFQMTITKPGKKKHTFSSEEKILHLKEVVMKNVSGVT FT DIDLDSEDEDDQEYVTLKEPEARTVPEMRVKLVEKRDIAREKREIDRQKQS FT LPVYIKDPSKLVGCRVKHQCDSTDEQGNKEARWFEGAVTEIVNRTEEAKDT FT VYKIVYDDENESDEEFNLLSDLAKGWLIIL*" XX SQ Sequence 6205 BP; 1950 A; 1218 C; 1393 G; 1644 T; 0 other; tagggtgttc agtcacgtga cctgacccca gcgaaatcgg tgaaaacacc ctaccttgcc 60 actttctggg gcgatatctc aacaatttca acaccaatat acacgtatta tacatcaaaa 120 tgaagggaat ttagttgttg aaataacctg taaatggctt aaaaatacgt gctaagtaga 180 aagagatatt tttggaaacg tgcggcaaat gcccccgtct ccgggccggg gcggcgcggg 240 ccgggatttc gggcctgagc cgcgctgaca tcgtcccata tttgggcatt gccaaggggt 300 tgcccttccc atatatggca ttattcgtgt ttttccggca agaggaggcc tgtgggtgga 360 gctacgaaaa atcccgaaga ttttcgacca cggcaaagtc atactgcgag gtgagacagt 420 tttcaatcgt aaaaattttc aacaatttga aaagttacga atcttggatt tggatatatt 480 aaaatataca ttctaaggaa tagtttgatg tgtggtttgt gtagttttga cgagaattag 540 tgccttggct tggacgagat tgttgagagt attgcggtac gcgtcggctg ttgttggggt 600 gggagggggg cgcacgtgcg ctcacttacg ccggagcggc cagacagatt ctatcgtaaa 660 aattgcgtac aatttgtaaa tttggcgacc tggaattctg atacagtgaa aaatacattc 720 ttttgaacaa ttctatctat aatttgtgta gttttgagta aaatcagcgc gccactttca 780 cgaattttgt aaggctaaga atagcggtac accacggctg caccgttacg tatatcaaaa 840 cacaactgca ttcgtaattt ctatgtgcgt ttttactatt tttctgatca gttacccttt 900 ttttctgccg aaaatgttgt ttctaataga atagattttg tttatgactt atattagggc 960 ttcactgtat tatttatttt atttctttta ttcaaattca ttgacttcta tttctagaaa 1020 atctagacag tccatttttt ctaccaatta aaaatttaac caggttcttc atttgtcaaa 1080 aggtaaatct aggaaattta ttgacagttt catttccttt tctatgaaat tctaaattta 1140 gagaggattc ttttctaatc cacaaagtaa gattattaat attgccatta tttacatatt 1200 attctacagg aataccccac cagtcttcga gtgacgtcag agatccgggc taccgttgga 1260 caggagtacg gcctgcagct gtaggctgca gacggcggtg tggtcacctt cacttcagcg 1320 tcgtgcaggg accgggaact tcaacagcac cacacgtggg tcatcttcat tttttctatc 1380 aatctatgta ccaaacttgt acaggtatct attttgaatt cttagatttc attggacaat 1440 ttatattaca tacattacta attatcattt tcatttcttt tctttccttc tagaacacca 1500 tggcatctac atctgacggt tccgttgctg ctgaccttgg agggatgttg gcttcaggtg 1560 tcagattgaa gacagttgat gagttcacaa atgagtttgt aatccactta cgcaagctgc 1620 aaggcaacgc gtctatgtgt ttagctactg acttaaagca gctcgctccg ttggaaatga 1680 tgaaggacat tgagaagtgc aaggaggact cactccgaaa acgtatggac tcagtttatt 1740 taaactacaa gaagctctcc gcaagcaaga acaaggaaaa attggccatc tactgcaaaa 1800 cacagttcga actgccaaag tcaagggagt ctcccacggc tgcagcggca gagtcaacgt 1860 caacacagta caaactcaag caaaccagaa aagagctcaa ggacagcaag aagcgatgtg 1920 cccagtccga acacgaggtg tatgaactgg agcaaagtaa agaagtgcta acgcaggagt 1980 atttctccgt tgtatcacaa ctgcatgaaa atgctgacag tttcagaaag attattgagg 2040 agaagtcaga taaagtatct gacttaagca aagctctgtc tgctgctaac aaaacaaatg 2100 acgaactaac agcaaagctg gactcacttg aggcccgact gtcgagggtg ggaactgtaa 2160 aaaatgtcaa caaaaaagtg aaaaggagag atgagaccat ccaaaagaag aatgagttga 2220 ttgaacaact acaaggtgcg gtcagtgctg gcaaccaaga acttaacgag ttgagggaag 2280 cacaagagca ccagacaagt gctactgaga aagtgagagc actacagacg aaagtaaagt 2340 ctttgcaaga ctctaaacgc aaactacaag tgaaactgca gaaaatagag gaggacaaag 2400 tccaagtaca aacagacctt gatgccaccg tggacactct gaatggcaag attcaggatc 2460 tcgagaatga aaacaaacat ctagaggcag ttcaaagttt gatggacagc gacactgtaa 2520 ctacgttcca tgagggcaga tattctaatg atctcagaga agtcatcatg aacctcttga 2580 cttcgggagt atctctcaag aaggtggacc atgttattac tacagtttta agcaaactag 2640 ctgggaaaaa aatcgatcgt tttcccagcg ttggcctcaa gcacagactg gttatcgagg 2700 caaagattct gggcgatcgg caggttgcag aagcgattat tgcaggagct ggcaacgcag 2760 aacagggcaa ctgtcttcac caggatggca catcaaagtt ttttaagaaa tatcaaacat 2820 ttgatgtgac cacaccaact ggcgacacat taactatgtc aatgacagaa gttcccagtg 2880 gggatgcggc tgggattctc acagcattct cagagtcctg tcggagtcta gcagaggtgc 2940 tatgcgaacc tggagaagat gttgcaaaaa aaactgctga gataataaca tcctttacct 3000 ccacaatgtc ggatcgaggt gcaacaaatc ctctgttcaa taaacacctt gagagtttac 3060 gccgagaact tttgccagtt gtgaaagatg agtgggacac actatctgat gaagtgaaag 3120 atgaacttgg cgatatgtgt aactactttt gcaagatgca cctattggta aactttgcta 3180 cagaagccaa cgttacagtg aagatctttg aggatgcagt ttcggaaggc aagaatccat 3240 ttgcctttgc acaacagggt gagtcggggg ctgcgagact tatccggaca gcttgcagcg 3300 ctttcacgga ccgtggcaac gaaaagtctg gagcgccgca gttcttccaa agctacctgt 3360 cccaacatgg agaacagaag aactacatgg tgacgtatcg cgggaacagg tttaacattc 3420 tcttttacaa tgctgcagct gtctaccatc acctcaaaca gatggtcacc tttgtgtcat 3480 catggcctga cgcaaataat ctgctgaaag cggtcaaggc tgatgcttca cagaaagtgt 3540 atgttgctgg ggtgcgtgcc ttgggcataa ttgataagac tattactgga cccttcttcc 3600 gggtactagg tacagagaag aacatccttg gcatgaacac tcatttgcac caaatgcaga 3660 taggcctgga gcgatggtca cgtgatgcaa gcacactgct tgcaggggag ccacttttca 3720 atgagactga agttgaaagg cataaggatg ccttgtttga ctccttgttt gcagccacag 3780 aagatgagga tttggatgtg ctgacgcagc aggctttgga agtggtgtgc tctgcattgc 3840 tgattttgtt ggaaagacag gcagaagaac agctcccagg tggaaagttc tggaaaccaa 3900 ctgaaacaga ggtgacaaag tctcagcatg tgccgacaac aaatgtagca tcggaacgag 3960 actttgccat cttggacaac ttgctgaggg ccaagccata tggcacgtca ctatcatttg 4020 aggcgcatct gttatggatc aacaatcaga ccagtgcctg gctgcggaac ctgacgccag 4080 aagaaaggaa caaacacatg gattatgctc gaacaaatgc aagcagggtc cagcaaagat 4140 tcaaagaaag acagagtgaa atcaaagaaa aacgccttca aacactcctg gaaaggcagc 4200 gacaaaagga agagaagagg agaaaagctc gactgaagac agtggccttg accgacaaag 4260 taatgaacat gggaggagtg tggaagacac ctgaagaagt agaggacaag gtcaaaacac 4320 ttaacaaaga acaagagaaa ctagaagcaa tcaagtttca gcttcagttc cacaaacaag 4380 tcttaaagtc agagggaagt aaggaactct ttcagatgac aatcacaaaa ccaggaaaga 4440 agaaacacac cttcagcagt gaagaaaaga tcctccacct gaaagaagtg gtgatgaaga 4500 atgtgtcggg agtaacagat attgaccttg attctgaaga tgaggatgat caagagtacg 4560 tgaccttgaa ggaaccagag gccagaactg tgccagaaat gcgggtaaag ttagttgaga 4620 agcgtgacat tgcacgagag aaaagggaaa tagataggca aaagcaatcc ttgccggtgt 4680 acataaaaga cccttccaaa ctggttggct gtcgtgtaaa acatcagtgt gactctacag 4740 atgagcaggg gaacaaggaa gcgaggtggt ttgaaggagc cgtcacggaa attgtcaaca 4800 gaactgaaga agcaaaggac actgtgtaca aaatagtgta tgacgatgaa aatgaaagtg 4860 atgaagaatt taacctcctt tcagatttgg caaagggttg gctgattatt ttgtagctgt 4920 ccttcttgtg accttacacc atatcttacc tgctgtagca gtcctcgtgt gactttataa 4980 tgctcatgta actcttagtt atagttgtat actgcaagtg ttgtaccata tcttatctag 5040 cagtccttct aatgtagcct tgtatctact agttatagtt acatttgtat atgatatatc 5100 atacatactg caagtgcaag tgttgtacac catatcttaa ctagcagtcc ttctaatgta 5160 gccttttatc tactagctat agttacattt gtatatgatc atacattctg caagtgaagg 5220 agttgtacat tatatgttca ggtgtacatt tgcaaggata tgaacttaag aggtcaagtc 5280 attgaatgat gtacaattgc ttaatgatga agaacaaaca ctttgtactg atggctttta 5340 tcaagccagg actcctattg taaagcacct cccaagtccc atcttttcac acaattgcaa 5400 tgtgtgcagt ggctgccttg ggagtggaac attcacaagt tgtttttttg tattgacctt 5460 tgagtgtatg gtggtgctgt aattttccac agcacaggca tatccatgga agaaaaactg 5520 cccaactgtg tatctttagt aacctggaaa ataggagaag acatcatgtg agatttcaaa 5580 agaaattgta aatgtaaata gctgtaatca aataggatct gtaaacccta tgattcacat 5640 tccaatatat ctatgtacat atcaaagtca tgattgttgt ttcccttttg tccctttatt 5700 gatttaatca cactaaatac aataaggtag tcagtttaca ggtgagattc aatgggaaaa 5760 cgtgtctatt taaccaaaaa acagtcaaat actgtccctg cataaattat gctaatttgg 5820 cctaatgtat gctaattaag cataaattat gcaaattaat gaagatatgc ttcataccaa 5880 catacccacc aagtttgaac atcatatgac caagtgatgt taagatatat taatccaact 5940 gacaaagcaa atttcttgta attcttccca ttgaaaacaa tgggaagacg ggtccgtcct 6000 tgaacccatt gaccccatag ttaaggcctt tgtttcttgg tgacccaaat cctcgatttc 6060 caagaaatga aattcttgta actcttcaac catacatcat agaaagatga aataaaaacc 6120 attgtaattg tcagatctgg ctctatgata tgatataagt ttcagtaccc taggacaaag 6180 gacaattttt gacctgaaca cccta 6205 // ID Academ-3_BF repbase; DNA; INV; 8481 BP. XX AC . XX DT 16-MAY-2011 (Rel. 16.05, Created) DT 16-MAY-2011 (Rel. 16.05, Last updated, Version 1) XX DE DNA transposon: consensus. XX KW Academ; DNA transposon; Transposable Element; Academ-3_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-3053 RA Yuan Y.W. and Wessler S.R.; RT "The catalytic domain of all eukaryotic cut-and-paste transposase RT superfamilies."; RL Proc Natl Acad Sci U S A 108(19), 7884-7889 (2011). XX DR [1] (Consensus) XX SQ Sequence 8481 BP; 2537 A; 1806 C; 1885 G; 2253 T; 0 other; tagtacagga gccagaccct gaaaaaaatg atttcgttgc acccacttta cagaaaaggt 60 ttgacctaat tttgtggtag aggggtcttt ggtccatcac ccagcatgat ttttatgccc 120 aaattgtggt atcacttttt tgctagcgaa gcattaaggt agaacactta aactgcataa 180 gcgaaaaatt catttttcaa agggggaggg gcaaaaaatt aaacatcaat gattttcgat 240 gaaaattggt aggcatataa aacttttcat tggaattacc ccaaaaaaat ttttttccca 300 gaattgtact ttggtgtaaa attatggccc taaaaagttc gatttttaag caattttggt 360 actaaaaaac gagatgttga cgttacataa tcaggcaatt atttgtgtca cttatggggc 420 cttgtttgat acaatatttg gaatggccta aatattcaca ttattctaac cttaaaatca 480 tcgaattccg acttttctat aaaaagatat gagcgtttga agaattactt accctcaagg 540 ctattgtcgg aaaagcgttg ctagggccgt tgctaaggat ggtctagaag gctttcccac 600 ataaatacga ctcctaagtc atagtttcag actttaatta aactgtttgt ccttaacata 660 atgaatatat agtatgtagt gactatcaag ctatcaattg tgaaagggtt ttataaattt 720 tagtttagaa accttatttc tcacaacatg actatgctat acataatgac aaaattattg 780 ctacggaggt cgctatgata aacgtggagt gtaatctgga cacgtttctg tctacttttc 840 tgttcagttg gagtttgaca attgtctttc acaactaagg aaacttgaat aaaactgtca 900 aggaaccaaa ccaaggacac catggccatg tatagcgtta cccccgattg atcgttgctg 960 tgccgtccac acttcgagtc tccgttatag cccgcttgct cgttcactga tgaaggcgca 1020 cgttaggagt tctttgaatt tcaatatgaa cagaccacaa tgatccaata taatacgatg 1080 ccaatacaaa agtagaacca atacgatgcc ccacaacgcc ggcattcctg tgattactgt 1140 ccttgagacc gccgtgaact ttgccctgaa tatttgcaca cacgcaaaat attacaatgt 1200 ctcaaatgtt gcgagacctt ggtaaatggt ataggttctg ctactggtga catcttccaa 1260 aatcttcgac atatttgtga tatgtaaaag tggcacatga ttgctatcct tccgaaatac 1320 attatgtaaa attttctgat aacgttttga tgctatggga caaggtgact gaacgcgtaa 1380 agggctcaac agctgcacag gagcagtatc gcgttacaca aggcggcgga tgcgaccaag 1440 ttttaggggt ggacataatg ctctatgttg tagagataaa acgcagctat tggctcatta 1500 cttgtgatgt tttgttacaa ttgacattga atatgacaat aaacaacaac agaatctact 1560 gtgtgtccca atttttaatt gatatactct ttgttttggg atcatgccag tggtatttga 1620 gtggaatagt ccgagtggtc gtcatatctc gcgccctttt caaagatttg cgtgttgcta 1680 tgacaggact tgtggcatca ttttgacttt cgaaataagg taagatttac gaagggtttt 1740 gcaaagcaaa atgggcattt ttgatcagta aggattgctt aacaattctg acaaagtgtt 1800 gtgtcctaac aatgggtatt cttggaggtg ttacaagatg aagtgttttg tatgtgtgat 1860 gacgcgttaa gggcgggaaa ccgccttcca tataaggcgg acgttttgac ataacgttac 1920 atgtatgctg cagtattgcg gtgtcatagc ttcgaacgca tatgaaaaat ggggagaaag 1980 ccggggaaat tgctcacttg aattacaaaa tgtagattat agatagcata tagcaacgtg 2040 atatctctat ttcctatttc aaataaaatt ttagctgccg gaaataatgc ggtttgtaat 2100 gttatcaaaa gaaaggtttc agcacacgtg tattacgtcg aagggcggtt taatatacag 2160 atacaggcta tatagataca gatacgtctg actatgaaga aaataactgt aaaagccatc 2220 aataaaattg attaagcact aactaattgg ttcagtaatt aattagaaag tagcaatatc 2280 taatgtacca caactcgttc attacaacat cttaaacatt taaacatgca ggtaatgcag 2340 gaaaatggaa agtgaagaag acgccgtttg gcttctcccg cctgaaaaaa ggaccaggac 2400 atctgtagcg acgagtaaaa cagtaagccc ttcctcttca gcaaccgctg tgccgccagt 2460 tccagtgtgc cgttcaaaga caatttcctt tgacttttca aagtgtattt tctgccgaaa 2520 gcaaacgcat aagaaagtaa aagatctcat caatgtgtca acctttgaag catgtcagcg 2580 tattgtagag gctgctcatg tcaaggagga cggggaatta cttcgtatcc tcggggcctt 2640 taacaacgac ttaattgcag ccgaagcgaa gtaccataag gcttgtcagg cgtcttacac 2700 tagtaagtca aatcttaaag ttgcctcagg taaggccagt gatggtttct cacaagcctt 2760 tcgagaccta ctcgacgaag tccagcccca tcttgattct ggacgagcct ttaccatgtc 2820 ggccctgttg gacaggtaca agaccaatct tgagaaaagg agtgaggata gcaaaagcta 2880 taccaagtat catcttaagg caaaacttca aggtcatttt ggggcaagca tagctttcca 2940 tgtcccctat gatcgatcca tgtctgagct ggtctactca agctccgtgt gtctggagga 3000 tgttcttgat gcagcctata ggacacctac agctgttccg actgctactg atgatgcgca 3060 acaaagcaag tctgcacaag caacagaagg catggacaag gacctgcttc tctaccacgc 3120 tgccatggca atcaagtctg acatcagcaa tgccaccggg attcagactc aaccaatcag 3180 tgtggcagac atgacgcttg ctacgtgcag gtcagttgta cccgacagtc tgttcaagct 3240 tttgcaacac ataataacag ttggacatga tgatacacct agtgatcagg aaaggaagat 3300 cttaatgttg gcccaggata tcatgcactg tgcaagctat ggaagaataa agacaccaaa 3360 gcacacgtgc ttgggactaa ctgtacggca cttaacacga tccaagcagc tcttaactat 3420 cctgaacaga atggggcact gtgcgtcata tgacgaaata gaagtagtag acactagcct 3480 tgcaacagaa tgtattgcta aggctgacat ctacggagtc gtcgtgcctt caaacattaa 3540 gccaggaaca tttgttcagg cggcagctga taataatgac cttagtgagg aaacgttgga 3600 cggcaagaac actacccatg ccaccacact cgtgctgttc cagcgaggca cctatggacc 3660 agcgcccaga agccagacac tgggagacca ctcagtaaag aagaaatcgc taacgtcctc 3720 ctgtgccaca gagattctag attttagtgc gtgtgggaaa aggccagccg ttaccacttt 3780 cctcaacaag gtaggcgagc taatcgggcg acaagaagac cacgacagta tctgtaccac 3840 agcgatggtc gacatggaat gggtactgat gcgaatgctg ccaacaaagc tggtagacgt 3900 atcccttcac cagaaccgac ccgaccagtc gattcctggc tggagcggat tccacgccgc 3960 cgtcacctcc actttctcac cacacatgct cagtacagtg attggctact gtcccatgat 4020 tcaaggatcg cccacagaat acagcacagt gtacacagtc atgaagcagg ttcaggccat 4080 gatgaaacac ctcgaccagg aagacagcgt aataacgttt gaccttgcca tctactctat 4140 ggcaaaagag attcaatggc ggctccccga ggagttctct gacactgtca tccgccttgg 4200 tggctttcac attgctctta actatctatc cctccttggc aagcagtaca aaggatctgg 4260 gttagatgac ctgctactcg agtcgggtgt gtacgggtca aacactgcca caattctcct 4320 tgaagggaag tcatacaata ggggagtgag ggcacacaag cttacaatgg aggtcatgct 4380 acgtctacag tggcaggcat acatttcctc ggtagttcct gaaggagggg agatcccacc 4440 tgacgttgaa gaggaggtca gcgcagtcca gcttgcatat gaacgggcag gtgatttgca 4500 cgagccaatg acgtccctgc atgaggccat gcctgccctg atgacgaagt tcgagcactt 4560 taagacttca atgaaagcaa agtcccacct cttctctttc tgggataact atgtctcaat 4620 ggtgctcctg ttactccagt tcataaaagc agagagaagt ggagattggt cactacattt 4680 ggcgtcaacg gcatgcatgg taccatgctt taactcaatg gaccgcacaa actatgccag 4740 atggcttcct gtgtatcttg ccgacatgcg acgcctaccg gagacacatc cccaggttca 4800 caatgccttc atggccggcg accacgccat ttcccggtct aaccaaccct tcgcaaaggt 4860 atggacggac atggccctcg aacaatctat taatctagac tccaagacca gcgggggcat 4920 cgtgggtatc agccaaaggc ctggcgccct tgagcgatgg tttgcaacgg tacacgaaag 4980 ggtggcagtg gtgacggcgc tcaaagaaat gtgcggcatt gccgattctg acctgattgc 5040 cacgcataaa gaagccaggg acggaagggt aaagagagat gaggacgacg tgcgaaagat 5100 gcgggctatg ttcgaagtgg ggctgatgac aaatccgttt tgtgccgatg tcagtcaaga 5160 cattcaccca ctgataaact tcgcgactgg cgtcgtgatg cctgaagacg cagccgtacg 5220 actgatcagt gcgtatgaga taggtctaca acaatgctcc acgtttgtga ggcaacgact 5280 tgattcaaac gagaaaggct tttgggagcc gttacctcac ctcaagatca agaccttcca 5340 ctccctttca caaaaaagga gaatgactac caatgagaag gtagttacaa tcagtgctga 5400 tcgcgatttg ttcggtcgtc ttgtaatcgc agccaggtct cgtcagattg acctgaaaga 5460 agtcctaacc tacgaactgt gcgcggtccc tctctccttg gtgcatcccg acggtacgat 5520 gaggaagacg tgcaaaagtg tgctgctgtc agagttggag aaagaagttc aagtacatgc 5580 acgcttacct gttacaaatg acatatctac tgcatacatc tctgatggaa tggccacgat 5640 tcaggcggta gggacaggag gtgcagcttt atttggtgat ctggctacgc tacattacag 5700 gatgctgacg tcgaacctcg gacagcaatg ccaaagagtg gacgtggtgt tcgaccaata 5760 caacaagacg agcatcaaag atggagaaag acagcgaaga gatcagcaaa gtacgcttga 5820 agtcaagatt cagggtcaca ccacaccgac accaaaacag tggaagaaat acgtggccaa 5880 tcccaagaac aaggagaatc tatctgagta tctgacacaa agtttctgtg aactaggaat 5940 gagccatctt caacctggtc agaagatagt cgtaggaggt ggcttggaag atggccaaag 6000 agctataatg gtaacaaaca atcacacgga agatctcaag tgcctctatt cagaccatga 6060 ggaggccgac acaaggatac tgctgcatgt tcaagacgct gcaagagagt gcagacgtat 6120 agttgtcaat tctcccgaca ccgatgtagc tgtcctctgc acgtcattct gcagtgagct 6180 gggttgtgac gagctatggt tcaggacagg ggttaaagat aaggcaaggt acataccagt 6240 acacagtttg agtcagtcgc tgggacctca actatgccag gcactaccag catttcatgc 6300 aataactggc tgtgactcaa ctgggagctt tcatgggatc gggaagaaga aagcgttgtc 6360 cgttctgcgc cagaatcctg agcatcagtc caacttggcg gtctttggtc aagaaccaaa 6420 gctgggagaa gagtgcttca ggagtagcga gaactttgtc tgtgatctct atgaatccgg 6480 aaaagctccg tgcacgacgg atgaattgcg gtacttcatc ttctgccaga agaaacaaag 6540 gaatgaagca cttccaccaa cttccaacag cctacgacat cacttacaaa ggtgtgcata 6600 ccaaacatgc ctgtggagga ggtcactgaa aggaatgcag aagatgccca aaccagatgg 6660 ccacggctgg gaagacaagg ggacagtgct agttccactc ctgatgtcta aggatccggc 6720 gccaaagagt ctgattgagt tgaacacatg cagttgtaca acttcatgta gtagagctaa 6780 ttgctcatgc agggtgcatg gtctagcatg cacggagtca tgtaaatgca tggggtgcga 6840 ctgccaaaat ccaaacaaac tagtggacat ggccagcagt gtttaaatga tgatgatgta 6900 cttagtttgg ttccttgcca gtttcatata tgttatttca tatatcacta actaagttgt 6960 gaaagacaat tgtcaaactc caagtgaaaa gaaaggtaga cgaaaatgtg ttcagattac 7020 acacaacatt gatcgtagag gccaccatag caatggcttt ggcaataaag aattagtgta 7080 aattatttac ccctgggaaa ccgattgttc acctgtgcag aatgtactgt atgtatgtac 7140 cgcagcttgt caatagcagc acaccagtac tacacagtac acccatagaa aagtacaatg 7200 tacgtgcatt cattctaaaa atactattgt cacacaaatg tacatattat gtgtattact 7260 accgtagtac tttgtgtatc tacatagatt ttcaacatgt tgtatatttc tgtatccatt 7320 tttcctatgt agtcaacgtt atcacctttg tggtgccctg aaaaccccca tgtagaacct 7380 tttcattgtg ttccacatag tagatataag cagttattta gtaaggtaca catgtacctg 7440 cataatggca ttagagagca acaatgaaaa tgagaaaaat cagcgtaatt ccttttcaag 7500 tatccttata tcccccatgt agagcccaaa ataaatttct gatcaatcaa tatcatcaat 7560 agatgatgat attttaagta atgtatgggt cacaatatga aatgagcaga atatgtcaga 7620 aatcttgatt ctctgctgta ttttgcttaa aacatgatga ctacccactt agcaacgggc 7680 ttagcaacaa ttttgctgat gattcttgtt tgtttgtttg tttgtttgtt tgtttgtgaa 7740 ataaatgaca ttcaaacgca atattcatgc atctgttaaa tgttttacat taccacaact 7800 ttcctaacac ttctactgag ataactacgg caaattacga cacaaaagtt agttctggag 7860 tcgtatttat gtgggaaagc cttctagacc atccttagca acggccctag caacgctttt 7920 ccgacaatag ccttgagggt aagtaattct tcaaacgctc atatcttttt atagaaaagt 7980 cggaattcga tgattttaag gttagaataa tgtgaatatt taggccattc caaatattgt 8040 atcaaacaag gccccataag tgacacaaat aattgcctga ttttgtaacg tcaacatctc 8100 gttttttagt accaaaattg cttaaaaatc gaacttttta gggccataat tttacaccaa 8160 agtacaattt tgggaaaaaa aatttttttg gggtaattcc aatgaaaagt tttatatgcc 8220 taccaatttt catcgaaaat cattgatgtt taattttttg cccctccccc tttgaaaaat 8280 gaatttttcg cttatgcagt ttaagtgttc taccttaatg cttcgctagc aaaaaagtga 8340 taccacaatt tgggcataaa aatcatgctg ggtgatggac caaagacccc tctaccacaa 8400 aattaggtca aaccttttct gtaaagtggg tgcaacgaaa tcacatttct aggcctcttt 8460 caggatttgg ctcctgtact a 8481 // ID BEL-9_CQ-LTR repbase; DNA; INV; 208 BP. XX AC AAWU01008465; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-9_CQ_; KW BEL-9_CQ-I; BEL-9_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-208 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 172-172 (2011). XX DR GenBank; AAWU01008465; Positions 392 185. XX SQ Sequence 208 BP; 65 A; 42 C; 52 G; 49 T; 0 other; tgttacggtg tagctaactg tcctaccccg cgagtagtga caggactggc ttgaattttt 60 cttgccgcat cgcagcggaa ctagatcgac gaagggaagg cgaaaaagga caagaacgaa 120 ctgacactcg cgcaacgcgg acgcgtttga attgaataaa aattagtttg taaaatacag 180 tccattgaat ttgattttga ccgaaaca 208 // ID BEL-112_AA-LTR repbase; DNA; INV; 703 BP. XX AC AAGE02020778; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-112_AA_; KW BEL-112_AA-I; BEL-112_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-703 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02020778; Positions 122208 122910. XX SQ Sequence 703 BP; 252 A; 131 C; 122 G; 198 T; 0 other; tgttccgatc aacaccactg ggctcatcac gatatccctg ccggagtgtg cagagtcggt 60 gcgcacgtac aatacagcga gactggcaac tcaggacagt ataagtaacg tagttgtaag 120 atccagaaca aaacaaacca aaataattat tcataaaacg tggaggtgtg ctatattgaa 180 atcattaaaa tttactctat tttgttgcaa aatacactgc aaaaatttcc aataagcttg 240 aacatttact gaagaagtaa gttttattga attgggtaaa tatttatcct caaactaact 300 taaagttcaa taattaggca cggaacgata actcaggtta ggcttgcagc aattgcacca 360 aaggctattg caaactccag aaaattaaag taagttctta cacatttcct atcaactacc 420 tgaaatccta atactagata catttgcttg aaactattta caaattcctt attctactat 480 gtacaattta atcgtgaact atatggagta ggagatcgaa cattgccagt gttaacctag 540 cgaggcccat ttgcaagagg ctgacggatt gaaaaccaaa ttgtgagtag taaaatctta 600 actgtcaatt gaaatttact aaatacatac catattctag ctttgatcag ctgtaaaaga 660 agacggtgat tccaaagact ggttcctaga ataaatccga aca 703 // ID Sola1-N2_AAe repbase; DNA; INV; 424 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A non-autonomous Sola1 DNA transposon family from Aedes aegypti. XX KW Sola; DNA transposon; Transposable Element; nonautonomous; KW Sola1-N2_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-424 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the yellow fever mosquito."; RL Repbase Reports 11(4), 1292-1292 (2011). XX DR [2] (Consensus) XX CC ~93% identical to consensus. 4-bp TSDs. TIRs are 32 bp long. XX SQ Sequence 424 BP; 141 A; 71 C; 74 G; 138 T; 0 other; ctgcccataa aagcataact gtcccatatg gattttcgaa ccaacacctt ttttcgtcgc 60 aacattcatg ttttaatata tactttacta tgcatatgaa aatttacaca ctttgtttga 120 aaatgttgtg gaaaaatatg aagttggtgc agtcccatat tgaaaaataa aggcataaca 180 gtctctatat gaactttcaa cccaaatagc aactgcaaaa cgtgaattgt gttcaagttt 240 atattttttg ctgatcaata taagcaaaat gagttatctg tgcggtggtg ctaaatgaat 300 atggcatgca gaaatgtact agaaaattat gaacatttgt atgagaaggc aaacttcaaa 360 ctttgagcgc tgttttctca atgcctacta attccatatg ggacagttat gcctttatgg 420 gcag 424 // ID DNA8-97_AP repbase; DNA; INV; 698 BP. XX AC . XX DT 28-AUG-2009 (Rel. 14.09, Created) DT 28-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-97_AP. XX NM DNA8-97_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-698 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2034-2034 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 698 BP; 205 A; 75 C; 94 G; 324 T; 0 other; actagggcta ggattttgta gcatttgcat tttctttcat agtactacag gacatagcta 60 atcaaaaaca aaattcgatt cattcatcac agagttcaaa tatgcaattt tgattttgct 120 ttttttgcat tttcttatga tattcgtaaa taaatgcttt tttggtactt ttttgtcagt 180 ttattagctt tttaattagt ttttgttaat tatacgcata tttaacattt ttagaacatt 240 tacgtacata tttatctaaa ttattattta ttaatttatt ttaattgtat tattatatat 300 acttatttcg tgatttcgta gtgaaatatt atcgccttat cggaaaatat ttctgaaaat 360 agattcttag attttgtcaa agtgattacc gagtggggag tagtggttac gtaatacgtt 420 gacatttatc gacatttaat aaaacggttt ttaatttatt tttttcaata tttataaatt 480 gtgcttagtt gttctaagaa ttacttagtg tgaacatata aaaatgtgtt taataaatta 540 ttttttgata ttaaaaatat ttttcagcta aacttaaaaa tgttcaagtc ttttttagtt 600 gcattttctt ggttaaaatg cttttttttg tagcattatt ggcaattttt taattccttt 660 gttgtaggtt tttagtgctt tattttccta gccctagt 698 // ID Copia-36_AA-LTR repbase; DNA; INV; 211 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-36_AA_; KW Copia-36_AA-I; Copia-36_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-211 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 960-960 (2011). XX DR [2] (Consensus) XX SQ Sequence 211 BP; 52 A; 41 C; 44 G; 74 T; 0 other; tgttggcata taatttactt tgggtcgaca tgcagagaag taattctggc agcactgcac 60 attcggtgcg gtgttcgcgc tcaaaataaa aacattgttt tgtctgtttt caaagttgaa 120 ataaagcaca cgcgtttttt ctttcgttga tcgtatattt cgtcgtagta agttagctag 180 ttccgaaaat ccttttgctt ccacctggcc a 211 // ID Harb_Cis1 repbase; DNA; INV; 4252 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 06-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE Harbinger DNA transposon from Ciona savignyi. XX KW Harbinger; DNA transposon; Transposable Element; Harb_Cis1. XX OS Ciona savignyi OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. XX RN [1] RP 1-4252 RA Smit A.F.; RT "Harb_Cis1 - Harbinger DNA transposon from Ciona savignyi."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX CC Ci000586 The ORF from bp 282-1580 encodes a protein 34% CC identical (48% similar) to the Danio rerio Harbinger1 CC transposase. Anopheles gambiae contains closer related Harbinger CC transposons. TWA target site preference. XX SQ Sequence 4252 BP; 1353 A; 823 C; 763 G; 1308 T; 5 other; agccgcctcc acacagaacg cggtttgcgt ttcacacggt agcggtaaaa ccgcccattt 60 taagagcggc tctaccgcgc ggttgccaag cggtaaacat gttcacacaa aacgcgttga 120 cgcaacacta attttcgcgg gaaaacttac atacagacca gcttttcagc attagaagtg 180 tccaaaatga aagtgtctaa tagaaaaaag ctattggtgt taagtcacaa ccagattttg 240 atggcataca tatcaagcct tgtaacaaaa tctcaaaggt aatgtatagg atacgttgcg 300 ttttgtaaaa actattttaa cttatcttta atttatttta cgcccagaaa gtggtgggtt 360 cgacctgttt ttgcgcacag agtaattcgt ggcgaagtga ttctcgtaga tgagataaga 420 acagatcccg aacttttttt cgagtatttt cgaatgagcc caacaacttt tgacgagctg 480 ctttcgcacg tggagccact tatatacaaa atcacaacta actggagaac accaatctcg 540 gctgataaac ggcttgcctt aactcttcgg tatgtagctc gaatttacta tcaaggccga 600 tttgcattan atactggcta tttttttaga tatctggcga ctggaaactc gatgcgttcg 660 ctgtctttca gttttcgcat cggatatagt acctgctgtg aaatcatcgg ggaaacatgt 720 gaagcaatat ggtctacact acgaccacta tacttgaaga cccctacatc acctcaagaa 780 tggcgattaa tcgcctctaa atttgaaagt aattggaatt ttccactgtg tatgggggcc 840 atcgacggta agcacgtgat gatccaagct cctgaaaagc agggctccga gtattacaat 900 tacaaaggat atcacagtat tattcttttg gtcgccttat gtgatgcaga atattgcttt 960 acggcagttg atatcggaga ttccggtcga cacagtgacg ggggggcatt ttccaattcc 1020 cagtttggaa aatgttttac ccagaaggaa aatgttctgt gcgttccaca agatgacctt 1080 ctcccaggtt cttctatccg agtaccgtac tgtatagtgg gggacgcggc ttttcctctc 1140 aggaaaaaca taatgagacc gtatcctggg cgatacttac cagacgacca acaaatattt 1200 aactatcgtt tgtcacgcgc acgccgagtt gttgaaaatg cattcggaat tctttcatca 1260 aggtggagaa tatttcgaag accaattatt gcggaaccca aaaaagtgat tgcgataacc 1320 aaagcctgct gttgtttaca caactttctc caaactcgtg gaaaaatatg cccaccaacg 1380 tttgttgacc gagaagatgc cgaaggtaat gttatactgg gagactggcg gtcaagcgga 1440 agcattctgc aacctgtagg gcgcacagga agtaacactt atcctcaatc agcttctgac 1500 atccggaaca aatttaaaga ctattttata tcaaacgttg gtgaagttga ctggcagtat 1560 agagtcatta aaaggcactg atttttaccg tagtaaagtc attgctgcat tttcgttgta 1620 agttgtgttg tttatagttt acttaaataa taacaaaaat tgttgcamaa agcctgagtt 1680 ttattattga ttatacttta gacgatataa atactccatt atgtgaagca ttgcttcatc 1740 gctctggtct gatggcaaac cgcgcagttt tttttcaact aaggcactaa aattagaaaa 1800 ctcgtctgag tcntgcccct catcaantcc tttggcacac attttctcaa tcgcatccgt 1860 gctcgattga acagcacata ttagagactc ctgtgaaaga agaaaaatat attaaaatca 1920 cagagattat gattaaaaaa ggacaataga ttatatttat atgcaataaa atttcatgcg 1980 gtacatatat acaatacaaa ctatatgcca catatatata tggattgcag tttatatgtt 2040 aagacaatgg ataaatcggg tcattcgtcc aaacaattta aataaaaaaa attatcaaac 2100 tgactacagg gtaaaactaa aaattctatt tacttgaaac ttgtttgaaa ttttttgcct 2160 tttagggtgt ggtgctggag aatcacaatc gacttctagc gtgatagggt tattatcata 2220 actggacgtc gattgaacat cactaagatt gaatacatct ttctctgcaa atgtttttaa 2280 gagcacagat aataataaat ataaaaaata aggctacaag acaaaaacaa agataatagg 2340 atgtatattc atattttata cccttgaatg ttaaaagaca aggggtgtta gcacacagcc 2400 aattgtcaga ttactggttt ccaacattgg gttaccaaag ctattgcata tgattgagac 2460 attttaacag cttgatcaat ttagttaggt aatcttcgct tagtggtcta agccagcaga 2520 tggtttcgat tataactttt agccaaaaac tgattctttt taaacaactg tgattatgca 2580 tccaaaaata aataaaattg tcaacatgta tatacatata atgtacgata ggtcataaat 2640 acaatatctc tagcaaggta aaaacttttt ctgattgtaa aattcagata attatctgac 2700 ctgtcacatc tgtctcactc atgtctgcaa cactaacagc tggcaatact actgaactac 2760 atgtcctgta agtcaaagtg aaataaattt acttataatc ctattaatat aaacattgta 2820 caacaggctt tgaaattaac tagtaaaaca agtaatcatt acaatgtacc ggtagaatgt 2880 aagatttatg tgtgtttaaa taatgtctag ctaatgttaa ctgacaattt acaggcctgg 2940 ctctgtactt catttaagtc tgttagatct atacccaact tcaaaactat tacaagtata 3000 aataagagct gataggctaa actctttagc ttgtagacaa ttaactgcca caccatttat 3060 aagcaaaaaa acttacactt tgtgtctcaa gtgatcattt aaaaactgca gagcctcgta 3120 gtgagtccag catgattgcc atgacgcagc tgctccactt ttttgagaat ccattttgcc 3180 ccggcgttct ttggcataac ggtcccttaa acttttccac cgtttagagc atacttcagc 3240 tttgcagaat aaatatgtta ttatctttgc aatactagct taaaattaca tgaacatcta 3300 tataataatt aaacctgaca atcaaacatt ntaaccatac caacattcat agttttccaa 3360 atattaaaat tttgcgtccc taggtggcaa aacaacaagt ttagtttaga ctaccagttt 3420 cttacgagat tgtttcaatt gagatccaat ttctttccaa gcattttctt ttcgaatgtt 3480 gtcctttttg ccgctgcatt gcgaactgta caggcaagga tattttctaa ccaattcaat 3540 caaaatttca ttaaagttca tcgtgccgtt ctgtaattta ggttacatgt aattaggctt 3600 accaaaagtc aaggtacttg atataacaat ctgcaaacca gtataattgt ataatttaaa 3660 atagtttagt agctaacaaa ttaggcatat gggcaaacga ctaagaatgg aatacggtac 3720 ctgtataccc gtagttattt gcattttgtc ctattttaag tgcatttacg aatattggta 3780 cgaaggtaat tagcctacta aaatacagtt agaaagaaat agcactatgc tctatggtgt 3840 caacttgtat ttttatacat tttacacgat actcgttttt tcaaaagtga tttcttactt 3900 acttcaagta tctttggact tggtctacac tccctgttgt actttggcca agtattttag 3960 gtctacctat tgctaacctg ccgacacaat tcaaatttaa cggtaatata tttgcttgct 4020 ttcaatatat tatacatcct actgtttact attatgggca aaacagaaat ggaatatttg 4080 ccgccactac cgcgcgtaaa agataaacta aatttatgtc tgccgctacc gcaaaaacca 4140 accgcacagc cgcacgaacg cagcaaaatc ggctgtgtgt atccgctagc cgctgccgcg 4200 aaccgccaac tgcgatattt taccgccaaa cgcgttctgt gtggaggcgg ct 4252 // ID Sat627_Cis repbase; DNA; INV; 147 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 06-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE Satellite from Ciona savignyi. XX KW Satellite; Simple Repeat; Sat627_Cis. XX OS Ciona savignyi OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. XX RN [1] RP 1-147 RA Smit A.F.; RT "Sat627_Cis - Satellite from Ciona savignyi."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX CC Ci000627. XX SQ Sequence 147 BP; 42 A; 39 C; 39 G; 27 T; 0 other; taagggagcg ttaccacacg ggcaatcgtc agattacctc gcgaatacgt aagggagcgt 60 taccacacgg gcaatcgtca gattacctcg cgaatacgta agggagcgtt accacacggg 120 caatcgtcag attacctcgc gaatacg 147 // ID PONY_AA repbase; DNA; INV; 511 BP. XX AC . XX DT 29-OCT-2001 (Rel. 6.09, Created) DT 29-OCT-2001 (Rel. 6.09, Last updated, Version 1) XX DE Non-autonomous DNA transposon PONY - a consensus. XX KW DNA transposon; Transposable Element; PONY_AA. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Tu Z.; RT "Molecular and evolutionary analysis of two divergent subfamilies RT of a novel miniature inverted repeat transposable element in the RT yellow fever mosquito, Aedes aegypti."; RL Mol. Biol. Evol 17(9), 1313-1325 (2000). XX DR [1] (Consensus) XX SQ Sequence 511 BP; 170 A; 96 C; 92 G; 150 T; 3 other; taccgttttg attcatatta cggacactta aggcctcagt gaagtataac ccagcataga 60 gcatacaaaa taaatcattc tgtatgattc cttagcgcta tcgagcgtcg gaaaccctta 120 actttcgaat ggtgggtgaa gaatacgctt ccgcaacttg aataatttta aataaatcaa 180 aatgtgtggc cttttcatga ttcttattcc ggacgcttcc tcacttttgc ctcatattcc 240 ggacactttg attcaaattc cggacagctc atgaaaatca ttaatagaac agtcaaatca 300 ttaattgaaa tcgttaaacc actaaagaga cgtctaargy agttgggcat tataaatttt 360 canagatatt tatggaaaat gcttattaaa acgagcctcg aaagtgagaa cttttgaacg 420 gcaaaaattg aaacatttcg tgtgaaatgt ttcccataca aagtagagtg tccggaatat 480 gaagctgtcc gtaatctgag ccaaaacggt a 511 // ID CASAT_HD repbase; DNA; INV; 280 BP. XX AC . XX DT 09-JUL-2004 (Rel. 9.06, Created) DT 09-JUL-2004 (Rel. 9.06, Last updated, Version 2) XX DE Haliotis discus discus DNA, CA repeat region. XX KW SAT; Satellite; Simple Repeat; CA repeat; CASAT_HD; KW satellite repeat. XX OS Haliotis discus OC Eukaryota; Metazoa; Mollusca; Gastropoda; Vetigastropoda; OC Haliotoidea; Haliotidae; Haliotis. XX RN [1] RA Sekino M., Takahashi H. and Hara M.; RT "Construction of a microsatellite repeat sequences-enriched RT library in Japanese abalone, Haliotis discus discus."; RL Unpublished. XX RN [2] RA Gentles A., Kohany O. and Jurka J.; RT "Consensus."; RL Direct Submission to Repbase Update (JUN-2004). XX DR [2] (Consensus) XX SQ Sequence 280 BP; 86 A; 72 C; 63 G; 59 T; 0 other; caaaagacgt taaaagatgg tacttgttgt tgccctgtct ggcgctcggc attaaggggc 60 taaaacaagg actggtcggc tcggagtcag tatactgtgt ctgagtgggg tattcatgtt 120 aaactgcggc atggtatctc agtgagctag cactataaat cggcatcagt ccggactagt 180 acaagcaacc acacacacac acacacacac acacacacac acacacacac acacacacac 240 gtgcatgcac gtcgctatat aagtgcaata atcttggacg 280 // ID Gypsy-39_CQ-I repbase; DNA; INV; 4752 BP. XX AC AAWU01034372; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-39_CQ_; KW Gypsy-39_CQ-LTR; Gypsy-39_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4752 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 457-457 (2011). XX DR Genome; AAWU01034372; Positions 52270 47519. XX CC Positions [3592-4056] - Integrase core CC 'CAAAC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 133..4740 FT /product="Gypsy-39_CQ-I_1p" FT /translation="MSDSDAQLLYQEQLRQERLRQEQLQQEQLRQEQLREE FT QLRQEQLRQEQLQEQRRQEQLQEQRRQEQLQQEQLRQEQLRQQQLQNPDPM FT IRMMGMFEQLVASVQQTVQQQQQFMERFSRSATQPPPNPEQIIDSLAGNIK FT DFRYDADNSVTFAAWYSRYDDLFAQDAARLQDDAKVRLLLRKLGLPEHERY FT VSFILPAVPKDFSFADTVDTLKSLFGAKESVVSRRYRCLQISKQPTEDHVS FT YACRVNKLCVEFDLGKLTQDEFKCLVYVCGLKSGSDAEIRTRLLSKIEENT FT NVTLQQLSDECQRLLNLQHDNAMIETPSTSDVNKVEQRFDRNKSNRRDRDQ FT QPYAAGKSKSKPPRPCWFCGADHYVRDCTFRNHKCADCGVTGHREGYCSSA FT KPLKPFKRGGPRDSVSSNVVVINECTVQKRRRFVSVDFGGTPIRLQLDTAS FT DITVISKELWNEVGCPRLSPPSVRAKTASGTELSLNGEFRCDITIAGSTRN FT ELIRVTEKPLQLLGSDLVDSFGLASIPMDSYCCNVSSVPDPAPALKSAFPK FT VFSKNLGLCTKTKVKLELKENCRPVFCPKRPVAYAMVEAVDRELDRLQQLN FT IITPIDYSEWAAPIVVVRKANGSIRICGDYSTGLNAVLQPHQYPLPLPEDI FT FAKLANCKVFSQIDLSDAFLQVEVDEQDRHLLTINTHRGLYLYNRLAPGVK FT NAPGAFQQITSTMLAGIPAASCYLDDVIVGGATQEEHDRNLKAVLQRIQDY FT GFTIRLEKCSFGKTEIRYLGHIIDSRGLRPDSAKIEAIVNMPPPTDVSGVR FT SFLGAINYYGKFVPNMRMLRYPIDKLLKEDAKFVWDSECQKAFEKFKQILT FT SGLLLTHYDPRQEIIVSADASSVGLGATISHKFPDGSVKVVQHASRALTKA FT EQNYSQPDREGLAIVFAVTKFHKMIFGRRFLLQTDHAPLLRIFGSKKGIPV FT YTANRLQRFALNLLLYDFSIQYVSTDKFGDADVLSRLINQHVRPEEDYVIA FT SVRLEEDIRSVATESINALPLSFRAVAKSTQSDPQLRKIYRYVLEGWPRAK FT VADPEIRRYQTRADSLTVVDGCILFAERLVIPAQHRKRCLEQLHRGHPGMV FT RMKQIARSYVYWPSLDDEIVGYVKACQPCASVARSPPHSAPVPWPKAAGPW FT QRIHIDFAGPINGDYFLIAVDSFSKWPEVVRTQRISALATIAILRRMFSQL FT GMPTLIVSDNGTQFTSAEFAEFCASNGIHHTTTAPYHPQSNGQAERFVDTF FT KRAVKKISEGRGTIEEALDIFLLAYRSTPNRCAPEGKSPAEIMFGRKIRTC FT LELLRPPADPEPPQKEDQRRSFAKQDLVYAQVHSGNAWKWTPGVILERIGS FT VMYNVHVGNQRVIRSHINQLRSRSESGPAASLCAKPKALPLDVLLGEWKLP FT QPSLVPELPPLSPMQLTPRPSSLELPDPPETQTSASSTPRPSTPESPDQVE FT TPPCASSTPRLGSLPPQESGATPSSSAASSLSSSSSSSTSEPTIVPQLVPE FT LRRSDRNRRPPVRFDYYQLY" XX SQ Sequence 4752 BP; 1119 A; 1455 C; 1254 G; 924 T; 0 other; gtggcgacga aaattcgtcg aatcgccggg attttattcg tttttcgaag gtgggaatcg 60 tcctaaccta aaacaacaaa aaaaatttcg gcacccgcgt gcaaacgaag gaggagctcc 120 agcggatccg aaatgagcga ctcggatgcc cagctcctct accaggagca gcttcgccag 180 gaacgactgc ggcaggagca gctccagcag gaacaactac gccaagagca gctccgggaa 240 gaacagctcc ggcaggaaca gctgcgccaa gaacagctgc aggaacagcg ccggcaggaa 300 cagctgcagg aacagcgccg gcaggaacaa ctccagcagg aacaactccg tcaagaacaa 360 ctccgccagc agcagctgca gaatcctgat ccgatgatcc ggatgatggg gatgttcgag 420 cagctggtag cttcggtgca gcagactgtc cagcagcagc aacagttcat ggagcggttt 480 tcaagaagcg caacccagcc acctccaaac ccggaacaga tcatcgactc gcttgctggg 540 aacatcaagg acttccggta cgacgccgac aacagcgtaa ccttcgcggc gtggtactcc 600 cggtacgacg atctgttcgc ccaagacgct gcccgtctgc aggatgacgc caaggtccgg 660 ctattgctgc gaaagctcgg ccttcccgag cacgaacgat acgtgagttt catactcccc 720 gccgtcccca aggacttcag cttcgccgac accgttgaca cgctcaagag cctcttcggt 780 gcgaaggaat cggtcgtgag caggcgctat cggtgcctcc aaatctcgaa gcagccaacc 840 gaggatcacg tctcgtacgc ctgtagagtg aacaagcttt gcgtcgaatt cgacctaggc 900 aaactcaccc aagatgagtt caaatgcctt gtttacgttt gcggattgaa gtcggggagc 960 gacgccgaaa tccgtacgcg acttctttcc aagatcgagg agaacactaa cgtcacactg 1020 cagcagctgt cagacgagtg tcagcgtcta ctaaacctac agcatgacaa cgccatgatc 1080 gagacgccgt caacctccga cgtaaacaaa gtcgagcaac gattcgaccg caacaaaagc 1140 aacaggcgcg accgagatca acaaccttac gctgccggca agtcaaaaag caagccgccg 1200 cgtccgtgct ggttttgtgg agcggaccac tacgtccgcg actgcacctt ccgaaaccac 1260 aagtgcgccg attgcggagt caccggacat cgcgagggct attgcagttc tgccaagccg 1320 ctgaaaccct tcaaacgagg aggcccgagg gactcggtca gcagtaacgt ggtggttatc 1380 aacgagtgca cggtacagaa gcgacggaga ttcgtttcgg ttgatttcgg cgggacaccg 1440 atccggcttc aactggacac agcatccgac atcaccgtga tcagcaagga actgtggaac 1500 gaagttggct gcccaaggtt atcaccgccc tccgttcgcg ctaaaacagc ttctgggacg 1560 gaattgtcgc tgaacggcga gttccgctgc gacatcacca tcgccggaag cacccgcaac 1620 gagctgatcc gcgtaaccga gaaaccactc cagctgcttg gctcggatct tgtggacagc 1680 tttgggctag catccatccc gatggacagc tactgctgca acgtatccag cgttcccgac 1740 ccggcgccgg cactcaagtc ggccttcccc aaggtgttca gcaaaaacct cggcttgtgc 1800 accaaaacaa aggtgaagtt ggagctgaaa gaaaactgcc gaccggtttt ctgtcccaaa 1860 cgtccggtgg cctacgcgat ggttgaggcc gtcgaccgcg aactggacag gttgcagcag 1920 ctcaacatca tcactccgat cgactactct gagtgggccg ccccgattgt cgtcgtgcgc 1980 aaggccaacg gctccatccg gatttgcgga gactactcca ccggcctcaa cgctgttctt 2040 cagccacacc agtacccgct gccgcttccg gaggacatct tcgccaagct ggccaactgc 2100 aaggtattta gccagatcga cctttctgat gctttcttgc aggttgaggt agacgaacag 2160 gaccgccact tgctcacgat caacacccat cgtggcctct acctctacaa ccgccttgcg 2220 cccggcgtca agaatgcccc cggcgcattc cagcagatca cgtccaccat gctggccgga 2280 atcccagcag cgtcctgcta tctcgatgac gtcatcgtcg gcggcgcaac ccaagaagag 2340 cacgatcgca acctcaaagc cgttctgcaa cgtattcaag actacggctt cacaattcgt 2400 ctcgagaagt gctcgtttgg gaagactgag atccgctact tggggcacat catcgacagc 2460 cgcgggctgc gaccagactc tgcgaagatc gaggccattg tcaacatgcc gcctcctact 2520 gacgtgtctg gtgttcgatc cttcctgggc gcaataaatt actacgggaa gttcgtgcct 2580 aacatgcgga tgctgcgcta cccaattgac aagctgctga aggaggatgc gaagtttgtg 2640 tgggattcgg agtgccaaaa agcttttgag aagttcaagc aaatcctcac ctccggtctg 2700 cttcttacac attacgaccc aaggcaggag attattgtct ctgctgacgc ctcatctgtt 2760 ggactggggg cgacgatttc gcacaaattc cccgatggtt ctgtgaaggt ggtccaacac 2820 gcatcgcgag cgctaacgaa ggcggagcag aactacagtc agccggatcg cgagggtttg 2880 gccatcgtgt ttgcggttac aaagttccac aaaatgattt ttggtcggag atttttgctg 2940 cagaccgacc acgcgccact acttcgcatc ttcgggtcaa aaaagggaat tccggtctac 3000 acagcaaacc ggttgcaacg cttcgcactg aacctgctcc tctacgattt ctccatccag 3060 tacgtctcca ccgacaagtt tggcgacgcc gacgtgctat ctcgtctgat caatcagcac 3120 gtacgaccag aggaggacta cgttatcgcc agcgtccgcc tagaggagga catcaggtca 3180 gtagccacgg agtcgattaa tgcgttgcct ctcagtttta gagccgttgc caaaagcacc 3240 cagtccgatc cacaactccg caaaatctat cggtacgttc ttgagggttg gccaagagcg 3300 aaagtcgctg atccggagat ccggcgctac caaacccgtg ccgactcgct caccgttgtg 3360 gacgggtgca tcctgttcgc cgaacgacta gtcatccctg ctcagcatcg caagcggtgt 3420 ctggagcagc tgcaccgagg ccatcctggg atggtgcgca tgaagcagat tgccaggagc 3480 tacgtgtact ggcccagcct ggatgacgaa atcgtgggct acgtgaaagc atgccagcca 3540 tgcgcctcgg tagcacggtc cccgccgcac tctgctcctg tgccatggcc gaaagcagct 3600 ggcccgtggc aacggatcca tatcgatttt gcgggaccga tcaacggcga ctacttcctg 3660 atagcagtgg actccttttc gaagtggcct gaagtagttc gcacgcagcg catctctgcc 3720 ctcgcaacca tcgctatcct ccgccggatg ttttcccaac tcggcatgcc taccctcatc 3780 gtgagcgaca acggaacgca gttcacgagt gccgaatttg cagagttttg cgcttcgaac 3840 ggcatccacc acaccacgac cgccccgtac cacccgcagt ccaacgggca agcggaacga 3900 ttcgtggaca ctttcaagcg tgccgtcaag aagatttcgg aggggagagg caccatcgaa 3960 gaagctttag acatcttcct gttggcgtac cgcagtaccc caaaccgctg cgcacctgag 4020 ggcaagtctc cagccgagat aatgttcggt cgcaaaatcc gaacctgtct cgaactactt 4080 cgcccacctg ccgatcccga accgccccag aaggaagacc aacgtcgaag cttcgccaag 4140 caagacctcg tgtacgctca agttcactcc ggcaacgcat ggaaatggac acctggagtc 4200 atcctcgagc ggattggaag tgtgatgtac aacgtgcacg tcggcaacca acgcgtcatc 4260 cgctcgcaca tcaaccaact gcgaagtcgc tccgagagcg gaccagctgc ttcactgtgt 4320 gccaaaccga aagcgctgcc gttagacgtc ctactcggtg agtggaaact tccgcaaccg 4380 tcgttggtcc ctgagctgcc accgttgagc ccgatgcagt tgacgccgcg tccgtcttct 4440 ctcgagcttc cggaccctcc tgagacgcaa acgagcgcgt ccagtacacc acgcccgtca 4500 accccagaat ctccggacca agttgagacg ccgccgtgtg cgtcctctac gccgcgtctt 4560 ggatcgctgc ctccacaaga gagcggtgca acaccatctt cttcggctgc gtcgtcgttg 4620 tcttcctcgt cttcatcaag tacgtcggag ccaaccatcg tgccgcagct ggtacctgaa 4680 cttcggcgtt ccgataggaa tcgtagaccg cctgtaagat tcgactacta ccagctgtat 4740 taagagggga ga 4752 // ID Mariner-1N1A_BF repbase; DNA; INV; 1281 BP. XX AC . XX DT 13-APR-2011 (Rel. 16.04, Created) DT 13-APR-2011 (Rel. 16.04, Last updated, Version 1) XX DE Mariner-1N1A_BF DNA transposon DNA transposon - consensus. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Nonautonomous; KW TA TSDs; Tc1-1N1A_BF; Mariner-1N1A_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-1281 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M. and Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-1281 RA Kapitonov V.V. and Jurka J.; RT "Transposable elements in the amphioxus genome."; RL Deposited to Repbase as the tmplanrep.ref file (02-May-2008). XX RN [3] RP 1-1281 RA Kapitonov V. and Jurka J.; RT "A family of Mariner-1N1A_BF DNA transposons from the amphioxus RT genome."; RL Direct Submission to Repbase Update (13-APR-2011). XX DR [3] (Consensus) XX SQ Sequence 1281 BP; 364 A; 273 C; 283 G; 360 T; 1 other; aactcggcac aaaaagtttg gaatatcaac tttggctgat catatttccg ttgtttctgc 60 atcaatttta atgtgttata tatcaataga aagcgtgtgt gatttccttt catatggtat 120 caaactcctt atgattataa attcgtggaa cgagcaccag ggctacttac gtcagtgggt 180 cacgaaaaaa aagtgcccaa atttcaattt ttgtgttcaa ctctgtatca tcctgctggt 240 atgggtgcgg gtgtcaagga ttcaatctat ccttttttca cgtaatcttt ggtatacaga 300 acatccaagt ttatctttaa catttgatta gagtatatgt gccmttttgg ctgttggatg 360 accgaatcga ggcgtggagg cggcaaggag aatgccacgc tgactgtcgc acggatagag 420 cgtataacag cgcgtagcgc ttggtggagg cagtgtcatg gtgtggtgta tgtatggctg 480 ccacaggaaa aacaaggttc accatcatcc cagacaagac aacctcaatg ctgtgagata 540 cagggacacc attctccatg cagtcgcaat gtcatacctc gtcagcgtgt gtgggggacc 600 aaatgcaatt ctgcaagtgt tagactacaa cgcccgtcca cacagagcaa ggatcattgc 660 agactgtgtg cgtggacggg cggtgtaatc ttacagaatg tcattcgacc ccatattgaa 720 ggggtaagac attgcgactg aatggagaat ggtgtccctg tatctcacag cattgaggtt 780 gtcttgtctg ggatgatggt gaaccttgtt tttcctgtgg cagccataca cacaccacac 840 caagcgctac gcgctgttac actctatccg tgcgacagtc agcgtggcat tctccttgcc 900 gcctccacac ctcgattcgg tcatccaaca gccaaaatga cacatatact cttatcaaat 960 gttaaagata aacttggatg ttctgtatac caaatattac gtgaaaaaag gatagattga 1020 atccttgaca cccacaccca taccagcagg atgatacagt gttgaacaca aaaattgaaa 1080 tttgggcact tttttttcgt gacccactga cgtaagtagc cctggtgctc gtttcacgaa 1140 tttataatca taaggagttt gataccatat gaaaggaaat cacacacgct ttctattgat 1200 atataacaca ttaaaattga tgcagaaaca acggaaatat gatcagccaa agttgatatt 1260 ccaaactttt tgtgccgagt t 1281 // ID hAT-5_SM repbase; DNA; INV; 2648 BP. XX AC . XX DT 08-OCT-2007 (Rel. 12.1, Created) DT 08-OCT-2007 (Rel. 12.1, Last updated, Version 1) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-5_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-2648 RA Jurka J.; RT "haT-type repetitive family from freshwater planarian Schmidtea RT mediterranea."; RL Repbase Reports 7(10), 1030-1030 (2007). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 578..2488 FT /product="hAT-5_SM_1p" FT /translation="MSENKKRKTEASSNRPFQDWWTDKYGMMKSNNNALCA FT LCGNTVVCRTSSVKRHYETTHSFLLSKSDSEQKEYIARDVKNKNLQSNSFT FT TFLKHTNYLAAASFEVSKTIATHSKPFSDGDYIKEAFIKCAPHLFEDFQNK FT EQILKRIEELSISRNTVKDRILRMHKNINIKKINDIRSAKLFSICLDESTD FT VTSSARLSIIARYAKGEKIVEELLSLVSLSAKTTGKDISDAVIKTFQIAEI FT DTSTIVSITTDGAPSMIGRQAGFVSLFQEHVGHPIINFHCIIHQEVLCAKS FT GLAAFDDILSIVSKIINFIAARALNKRQFQLLLQEVGSNYHGLLMYNHVRW FT LSRGFVLQRFVECFDEIIMFLNDKKPEFPELFDNNWTTKLMFLTDISNHLN FT ELNIKLQGPGKSIDVMFDIIEGFESKLVIFKRDIDNNNFKYFPLLKDHLQK FT FSIHGVYEININFYSQVINDIQDSFAKRFQDFKKFQPLINFIKYPDKFDLE FT TAKMDQLDWLQLENLEMQLAEFQSSSIWKLKFIELRQEIENNEKKRLNEEE FT YENMNNMLLKTWCSLPETFNAMKKLSLAILTIFSSTYSCESLFSIMNNIKS FT DVRNRLTDECGEACISLKLSNYDPDIDALSRNIQQQKSH" XX SQ Sequence 2648 BP; 985 A; 354 C; 420 G; 889 T; 0 other; cagggatggc gaacctatgg cacgcgtgcc caaagtggca cgcagctaaa ttttcgctgg 60 cacgtataat atttttcatt ttcatacata tgtaatgtac tgactaatcc aaatgcattt 120 ttgatattgg aaaaattgtg aactcaattc aaatttaatg cataaatttt gatatttaaa 180 aacaaacttc cagatttctg gcgcctttat tcataacgtt ataatataat tatattattt 240 gcgtatgtag gtataggtat gtacaagtat gaacatgtaa ttttaaatat aaaaaaaaat 300 gttaaaatga acatagtatg ctttggattt tcaaatatat acatacatat atatatgaac 360 atctgcattc tcgccaaaat tgaaattcac taattctaat tcagttcagt taaaaacttt 420 gtaagtgcga ttgaaaattg tgttttctta ttttagtatt attttttgct aataaaaaaa 480 aatctattta tatttttaga agacaattat aattgatttc aaaaggtaat tcattaacaa 540 ttttaatgat aaataaataa tatgtttcaa ctttagaatg tcagaaaata agaaaaggaa 600 aacagaggcc agctctaaca gaccatttca agattggtgg actgataaat atggaatgat 660 gaaaagcaat aacaatgccc tgtgtgcgtt gtgtggaaat acagtagtgt gcagaacatc 720 ctcagtaaaa cgtcattatg aaactactca ttcatttctt ttatctaaaa gtgattcaga 780 acaaaaagaa tatattgcac gtgacgttaa aaataaaaat ttacaatcaa actcattcac 840 tacatttttg aaacatacaa attatttagc tgctgctagt tttgaggttt ccaagactat 900 tgctactcat agcaaaccat ttagcgatgg agattatatt aaagaagctt tcattaaatg 960 tgcaccacat ttatttgaag attttcaaaa taaagagcag atacttaagc gcattgaaga 1020 attaagtata agtcgaaata ctgttaaaga caggatatta agaatgcata aaaatattaa 1080 tattaaaaaa ataaatgata tacgttcagc aaagctattt tctatatgtc ttgatgaaag 1140 taccgacgta acatcttcag ctcgcttgtc cataattgct cgttatgcta aaggagaaaa 1200 aatagttgag gaattgcttt cactagtttc cttatcagca aaaacaactg ggaaagatat 1260 tagtgatgca gttataaaaa cgtttcaaat agctgagata gatacatcaa caatagtgtc 1320 aattacaact gatggagctc cctcaatgat aggacggcaa gctggttttg tatctctttt 1380 tcaagaacac gttgggcatc caattattaa ttttcattgt ataattcatc aagaagttct 1440 atgtgcgaag tctggtttgg cagcatttga tgatatttta agtattgttt ccaaaataat 1500 aaattttata gcagctcgag ctttgaataa gcggcaattt cagttgcttc tacaagaggt 1560 aggttcaaat tatcatggat tgttaatgta taatcacgta cgctggctta gtagaggatt 1620 tgtgttacaa cgttttgtag aatgttttga tgaaattatc atgtttctta atgataaaaa 1680 acccgaattt cctgaacttt ttgataataa ctggaccacc aaattaatgt ttttaacgga 1740 tatttccaac catttaaatg aattaaatat taaattgcaa ggtcccggaa aatcaattga 1800 tgtaatgttc gatattattg aaggttttga gtccaaatta gtaatattta aacgcgatat 1860 tgacaacaat aattttaagt attttccttt gttgaaagat cacctccaaa aattttctat 1920 tcatggagtc tatgaaatta atataaattt ttatagccaa gttataaatg acatacaaga 1980 cagttttgct aagagatttc aagatttcaa aaaatttcaa ccattaatta actttattaa 2040 atatccagac aagtttgatt tggaaactgc taaaatggat caattagact ggttacagtt 2100 ggagaattta gaaatgcagt tggcagagtt tcaatcaagt tctatttgga agctgaaatt 2160 tatagagtta aggcaagaaa tagaaaataa tgaaaaaaaa agattaaatg aagaagaata 2220 cgaaaacatg aataatatgt tgttgaagac atggtgttcg ctcccagaaa catttaatgc 2280 gatgaagaaa ttgtctttag ctattctaac aatattttca tcaacgtata gctgtgaaag 2340 tctattttcg attatgaata atataaaatc tgatgtacga aacagactta ccgatgaatg 2400 tggggaggct tgtatttcat taaaattgtc aaactatgat ccggatattg atgcattatc 2460 gagaaatatt cagcaacaaa aatcccatta aaaaatatgt cagcatatta tagaaatgaa 2520 tctatatata atttttaaaa gcatgtattt tcatttaata aaaataattt gtattggcac 2580 gcatgtattt tcattttata aaaataatat ttattggcac gcagtcaaaa aaaggttcgc 2640 catccctg 2648 // ID Copia-28_AA-LTR repbase; DNA; INV; 140 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-28_AA_; KW Copia-28_AA-I; Copia-28_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-140 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 955-955 (2011). XX DR [2] (Consensus) XX SQ Sequence 140 BP; 44 A; 33 C; 18 G; 45 T; 0 other; tgaccagtgt aactgtcgaa tttatttaac ccatttactc tgtatttacc caagcgacaa 60 taaacgtgta tctttgtcag ttcattaaat caaaccaaaa caaacgcacg cgtttttcat 120 gtatgctctg caaattccca 140 // ID Gypsy-11_TCa-I repbase; DNA; INV; 5108 BP. XX AC chrUn_5; XX DT 21-FEB-2011 (Rel. 16.02, Created) DT 21-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the red flour beetle genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-11_TCa_; KW Gypsy-11_TCa-LTR; Gypsy-11_TCa-I. XX OS Tribolium castaneum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Coleoptera; Polyphaga; Cucujiformia; OC Tenebrionidae; Tribolium. XX RN [1] RP 1-5108 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the red flour beetle genome."; RL Direct Submission to RU (21-FEB-2011). XX DR Genome; chrUn_5; Positions 96663 91556. XX CC 'GTAT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS join(222..1091,1095..4127) FT /product="Gypsy-11_TCa-I_1p" FT /translation="MSRASRSRGETPGRRRGPRGSEVTELSAALLQGMGQL FT VQAATKGKVTNDATRQLVSIPPIANNVIAEFNPLTDNIIQWINAVDEFASM FT YNWDDRTTSYLALQKLRGPAEVWYRGLPTRLFTWTEWRDMLSNNFRVKRDL FT HSTLKKMMACEAKDESRLYEYVFEKLALIHATNLPMTDEDRVNLIMGGVRD FT KNIKFAVETAGIKDPAELANHFKTLTKYKTSGNSSNSERAHYSSICFKCRK FT SGHKAKDCWAKKNEHLGNDTGHPTRHTKQRALEYNVNYISPTENNFTSSLN FT AKFFKNILINNVPIKCFIDFGSECSLISEFAVRTLALVPESLSGALLLTTL FT SDTKVVVTSSVQVTVTIDQIKKTLNLLIVSQCVMNVEVLIGQDFTEQADVN FT YIKAGSSLRFFPSKQIHTISHSVVNVGVTENETVQQLVTLLNTYSACIARN FT MSEVGNISISQMKILLTTEKPIFQRPRRLADAERAEVRKIVDELMKQGIIR FT ESNSPYASPVVLVTKKGGEKRLCVDYRALNKITVKDRYPLPLIEDCLRRLS FT GYKYFTSLDLIAGYYQIQMSPESIQYTSFITQDGQYEFLRMPFGLCNAPAV FT FQRVINTVLGQLRFTKVLVYLDDILIPAKSVEESFCVLKEVFNLLKHNGLT FT LKLSKCNFLQTEIDYLGYKISANGIEPSCHKIQAVEKFPEPSNVHSIRQFL FT GLTGYFRKFIKDYAIKSKPLTALLTKNADWEWGRKQLDSFTELKQCLTSRP FT ILAIFDPSLEIRLYTDACRIGVAGILIQVKEKKEQVVAYFSRCTTPAEQNY FT HSFELEALAVVASVKRFKQYLLGRHFTIFTDCAAVRGTFEKNEINQRVARW FT VIELSQYIFEIKHKSGQQMRHVDALSRHLPGSQLAVHALKISNFDWLQTAQ FT INDANIVQIKEQLETGERTNHPDVFNNYALKGGKVYRITTHGLRMVVPKFV FT RFQLLKMAHDESGHFGPAKTYELLESHYWFPKMRHFVTKYCRSCLNCLYFK FT SNSGKKPGLLHPIKKIPVPFHTVHLDHLGPFVNSSKKNAYVLVAIDAFTKF FT ILLYAVKDTKSKNVIAAIKDLIKTFGVPHRVIADRGTAFSSKEFQQYVTEI FT GSKLHLTAVAVPRANGQVERYNRTVLNSLATTGATQQDRHWDENLQNIQLG FT INGTINNAIGVTPSEALMGYRVVTNGLLNSDNQATKDVTTIREQMVEKMEE FT YQASQKQRFDAARCEGKHYSVGDLILLKITSNAPTGSSQKLLPKWRGPYRI FT TKILGNDRYEVQNIPGLSRSRGTFISVAASENMRPWIQFG" XX SQ Sequence 5108 BP; 1650 A; 968 C; 1096 G; 1394 T; 0 other; tcagaagtgg gattacgatc gtgaaaaaga cccggttata cttagtgata gtgtttaaac 60 tacgaaaaat taagtgtaag taaagttaag tggaatatgg aagggacaga agaaacaggg 120 ataacagtga acgtcggtat cgtcggtcag aaatcagaaa tacggatcgt ctacgaagtc 180 ggtccacaag tagcgaggat caacgtcggt accgtaggtc gatgtcacgg gcatcgagat 240 cacgtgggga aactccaggg cgccgacgag gaccccgtgg aagcgaagtg accgagctga 300 gtgcggcact tttacaaggg atgggacagt tggtgcaggc ggctacgaaa ggtaaagtga 360 cgaatgacgc tacccgccaa ctggtaagta ttccacctat tgcaaataat gtaattgcgg 420 aatttaaccc gctgaccgat aatataatac aatggattaa tgcggtcgat gaatttgcgt 480 ccatgtacaa ttgggatgac aggacaacca gctacctggc tctgcagaag ctgagaggtc 540 ctgccgaggt atggtacaga ggcctgccca cccggttatt tacatggacg gaatggcgag 600 atatgctatc aaataatttc agagtaaagc gcgatttaca ttcgactcta aaaaaaatga 660 tggcatgcga agcaaaagac gaatcacggc tgtacgaata cgttttcgag aagcttgcac 720 ttatacacgc tactaactta ccgatgacag atgaagatcg tgtaaattta ataatgggtg 780 gagtgcgtga taaaaatata aaatttgcgg tggagacggc gggcataaaa gacccagctg 840 agttggctaa tcattttaaa actctcacga aatacaaaac aagtggcaat tcctctaatt 900 ctgaaagagc tcattattca agtatttgtt ttaagtgtcg aaaatcgggc cataaggcaa 960 aagattgttg ggctaaaaaa aacgaacatc tgggcaatga caccggacac cccactcgac 1020 acacgaaaca gcgcgccctt gaatataatg taaattatat cagtccaacc gaaaataatt 1080 ttacatcttc atgactaaat gcaaaattct ttaaaaatat actaattaac aatgtcccaa 1140 ttaaatgttt tattgatttt ggtagtgaat gctcactaat atctgaattt gctgtccgga 1200 ctttagcatt ggtgcctgaa agtttgtcgg gagctttgtt gcttaccaca ctatctgaca 1260 caaaagttgt tgttacttct agcgttcagg ttacagtcac aattgatcaa ataaaaaaaa 1320 ctttgaattt gttaatagtc tctcagtgcg tgatgaacgt tgaagtctta atcggacaag 1380 actttaccga gcaggcggac gtgaattaca tcaaggctgg tagttcgttg cgctttttcc 1440 cttcaaaaca aatacacaca atatcgcact cggtggtcaa cgtaggcgtc accgaaaatg 1500 aaacagtgca acaactggta acactattaa atacatattc tgcttgcatt gcaaggaata 1560 tgtcggaggt cggaaatatc tcaatttcgc aaatgaaaat attactgacc acagaaaaac 1620 ccatttttca acgacctcgc cgattggcgg atgcagaaag ggcagaggtt cgtaaaatcg 1680 ttgatgaact aatgaaacaa ggtataatac gggaatctaa ttcaccgtat gcaagccctg 1740 ttgttctagt tacaaaaaag ggaggtgaaa aacgcctgtg tgttgattac cgagccctca 1800 ataaaataac cgtgaaggac cgttacccac tgccactcat tgaggattgt ttgcgtcgcc 1860 tttccggata taaatatttt acaagtttag atctcatcgc aggctattat cagattcaaa 1920 tgtctccaga atcaattcag tacacatctt ttatcactca ggatgggcaa tatgagttcc 1980 tacgaatgcc attcgggctg tgcaatgcac cggcagtttt tcagcgtgtg attaacactg 2040 tcctcggaca attacgtttt acaaaagttt tagtctatct ggacgatata ttaattccag 2100 caaaaagtgt cgaagaatct ttttgtgtgt taaaagaagt tttcaattta ttaaaacata 2160 acggactcac attaaaatta tcaaaatgca atttcttaca aacagagata gattatttag 2220 gatacaaaat tagcgccaac ggcattgagc caagctgtca taaaatccaa gcagttgaaa 2280 aatttccgga accatcgaat gtccactcaa tcagacaatt tcttggactg acagggtatt 2340 ttagaaaatt tattaaagat tatgcaataa aatcaaaacc attaaccgct ttattaacca 2400 aaaatgccga ctgggagtgg ggtagaaaac aacttgattc atttactgaa ttaaaacaat 2460 gtttaacttc tcgccctatt ctagcaattt tcgatcctag tcttgaaatt cgactctata 2520 cagacgcgtg ccgaatcggg gtggctggta ttttaattca agtgaaagaa aagaaagaac 2580 aagttgtcgc atattttagc aggtgtacaa cccctgctga acaaaattat cactctttcg 2640 aattggaggc attagctgtt gttgcatccg tcaaacgctt taaacaatac ttattgggac 2700 gccattttac aatttttacg gattgtgccg cggtacgcgg aacgttcgag aaaaacgaaa 2760 taaaccaaag ggtcgcaaga tgggtcattg aactgtcgca atatattttc gaaatcaaac 2820 ataaaagcgg acaacaaatg cgccacgtcg atgcgttaag tagacattta ccgggtagcc 2880 agctggcagt ccatgcgttg aagatttcaa attttgattg gcttcaaacg gcccagatta 2940 acgacgctaa tatcgttcaa atcaaagagc agctagagac gggtgaacgt acaaatcatc 3000 ccgacgtttt taacaactat gcgttaaagg gcggcaaggt atatcgaatt acaacacacg 3060 gtttacgaat ggttgtccca aaatttgttc gatttcaact attaaaaatg gcacatgacg 3120 aatctggaca ttttgggccc gcaaagacgt atgaattgct cgaaagtcac tactggttcc 3180 ctaaaatgcg acactttgtt acaaaatatt gcagaagctg tttaaattgt ttatatttta 3240 aaagtaattc cggcaagaaa ccaggtttac tacatcccat taaaaaaatt ccggttccat 3300 ttcatactgt ccatctagat catttaggtc catttgttaa tagttcgaag aaaaatgctt 3360 atgtccttgt tgcaattgac gcgtttacaa aatttatctt actttacgcg gttaaagaca 3420 caaaatctaa aaatgttatc gccgccatca aagatttaat taaaactttc ggggtacccc 3480 accgtgtcat agctgatcgt ggcacagcgt tctcgtccaa agagttccaa caatatgtca 3540 cagaaattgg aagcaaactt cacttaacag cagtagccgt tccccgagca aacggacaag 3600 ttgaacgata caacaggaca gttctaaatt cattggccac gacgggagct acgcaacaag 3660 accgacattg ggacgagaat ttgcaaaata ttcaactggg aattaacggt accatcaaca 3720 acgctatagg agtcacacct agtgaggctt tgatgggtta ccgagtagtt acaaacggat 3780 tgttaaactc cgacaaccaa gcaaccaagg acgtaactac aattcgggaa caaatggtgg 3840 agaagatgga agaataccag gcctcccaaa aacaacgatt tgatgcagcg agatgtgagg 3900 gcaagcatta ttccgttggc gacctaattc tactcaagat aaccagtaat gcccctaccg 3960 gaagcagcca gaaattatta cccaagtgga gaggacctta caggattaca aaaatcttag 4020 gaaatgaccg ctacgaagtg caaaacatcc ctgggttgag ccgatctaga ggcacgttta 4080 tcagcgtggc cgcttccgag aacatgaggc cttggattca atttggatag tcaagtaaga 4140 ggtgagacgc taattgcaaa atactgcggg tcgcacatag acctgtgtct ttattttgct 4200 ttgattagtt aaaatggtgg agcactgcgg gagtgggtaa acgtaataac tcctgtgtct 4260 ttgctctatc cataaaggtt tatcacaatt cagaaattag aaattctagc tcctgtgtct 4320 ttgcttcagc attgcgggag gttaaacaga agtaccattc ctgtgtcttt gctgatatgc 4380 attgcgggag gctaaataaa aatactactc ctgtgtcttt gctaaaatgc attgcgggag 4440 gtgagataag gacaccactc ctgtgtcttt gctgagatgc actgcgggag cagcgaggtt 4500 tgtccaagtg ttcggggccc ctaaattaag taagttgcaa ttgcgaattt agcgtctagt 4560 tttaagttgc ggatgcgaat ttagcgtcta gttttaagtt gcggatgcga attcagcgtc 4620 tagttttaag ttgcgtttgc gaactgagcg tctagtttta agttaaggat gcgaataaag 4680 cgtttagttt taagatgcgg ttgcggaatt aagcgtctta tttaaattac gttcataaag 4740 atttaaggtt ttaaagtaag cgtctatgtt taagttacac caaaaaaaaa aaaaaaaaaa 4800 gtttcgaatt aagcgtctag ttttaagttg tattcattaa aacttaaggt ttcagagtaa 4860 gcgtaatctt agcttgtatc tattatcgcg tttagtttaa gtccaaataa ataatataat 4920 atataatata tataataaaa taaactaata atgataatcc attcttagac caacaggccc 4980 taccgttctg gcgtttatga gtaactgtgg attgcttctt aaattatcta aatcttggtt 5040 gagtttcaga gacaaattgg ttattagagc aagtggagtg aggacactct ggatgtcagg 5100 aaaggccg 5108 // ID DNA-2_DPu repbase; DNA; INV; 983 BP. XX AC . XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE DNA transposon from Daphnia: consensus. XX KW DNA transposon; Transposable Element; nonautonomous; DNA-2_DPu. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-983 RA Jurka J.; RT "DNA transposons from Daphnia."; RL Direct Submission to RU (22-MAY-2010). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR [1] (Consensus) XX CC ~95% identity to consensus. TTAA TSD. XX SQ Sequence 983 BP; 285 A; 209 C; 207 G; 280 T; 2 other; caagcaatcc accactcaas cgcaccaaat ccatggaccc cccctttttt tcccattccg 60 ctatgggacc taaaaatctg aaataattca ggcatatcca gttttctact ccggattcac 120 ctgtttttgc agaatttaaa tattccctgc cgttcgggag atatttaatg ttaaagtttg 180 ggttttttca aattttaaca attttggggg gtctttggca tcgctttttg ggtaaatgct 240 aacctttaaa aaaaaacaaa ttctcccaaa agtttgctcg gccatccctc aataccaatc 300 caaaaggaat ttacattccg gattagattg gccgagttat tcccaatcta gtaaagtgta 360 attttgacaa gtttcccacc cagcctagtc gccatttttt atcactctcc ctcgcgttcc 420 tggcggatga cagccgaagc atacgagttc tctgcacctc gggcttcata ttgccggcca 480 ctagactata tgaaagggcc tacagtatgt cccccggtct aaagtgaagg ccaggaaagc 540 gtagcttcgg ctgtcatccg ccaggaacgc gagggagagt gataaaaaat ggcgactagg 600 ctgggtggga aactggtcaa aattacactt tactagattg ggaataactc ggccaatcta 660 atccggaatg taaattcctt ttggattggt attgagggat ggccgagcaa acttttggaa 720 gaatttgttt ttttttaaag gttagcattt acccaaaaag cgatgccaaa gaccccccaa 780 aattgttaaa atttgaaaaa acccaaactt taacattaaa tatctcccga acggcaggga 840 atatttaaat tctgcaaaaa caggtgaatc cggagtagaa aactggatat gcctgaatta 900 tttcagattt ttaggtccca tagcggaatg ggaaaaaaag gtggggtcca tggatttggt 960 gcgsttgagt ggtggattgc ttg 983 // ID RTE-19_BF repbase; DNA; INV; 2285 BP. XX AC . XX DT 29-JUL-2009 (Rel. 14.07, Created) DT 29-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE Amphioxus RTE-19_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW RTE; Non-LTR Retrotransposon; Transposable Element; RTE-19_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-2285 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-2285 RA Kapitonov V. and Jurka J.; RT "Young families of RTE non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(7), 1717-1717 (2009). XX DR [2] (Consensus) XX FH Key Location/Qualifiers FT CDS 1..2271 FT /product="RTE-19_BF_1p" FT /translation="MVNKKWQRSVLDVRAYRGADVGSDHHLVVTKVRLKLR FT ASPPTKQRCKVFDTAKLRKPEIRREFALELRNRFKALENLVDEEEPNVVET FT SWDTITKVYSETAKKVLGHRKRKDKEWLTQETWRKIEERKVAKQKLLTSKT FT QQAKEAYRNKDKQVKRSARRDKRAFVEDLATEAEQAATRGELSTVYRITKK FT LCNQSSASSVPIKDKQGKLLTSEREQTARWAQHFEEVLNRPEPDVPADPTP FT SVDIPINIDPPTEEEVETAIKALKNGKAPGIDSIQAELLKADCATATLLLT FT DLFAKIWKHEVIPQDWSKGLIVKIPKKGVLSNCDNWRGITLLSIPSKVFCR FT ILLKRIDSAIDSRLREEQAGFRKGRGCSDQIFALRNIIEQCIEWNSPLLIN FT FIDFQKAFDSVHRESLWKILRAYGIPHKIVTLIETFYKHFECSVITENSLS FT ESFPVRSGVRQGCILSPILFLITIDWVMKETTSDRPRGIRWTPSKYLEDLD FT FADDLAVLASKRTLLQEKTDRLSERGKQTGLYINKKKTQVMYINTPTTPPI FT SLDGEVLESVGVFTYLGSVMSTQDGARKDIKSRLDKARASFSRLQTIWKSK FT QYSRKTKIRLYNSNVKSVLLYGAESWRATKTDMRKLDAFHNNCLRKICNIF FT WPNKITNEELHRKTECRRITTEITHRRLKWLGHVLRMTDNRIPKVALTWRP FT DGKRKRGRPKTTWRRTVSDDLVSMGLSWKSVQTAAQDRPRWRKKVIALCLT FT RDEEDK" XX SQ Sequence 2285 BP; 774 A; 516 C; 547 G; 448 T; 0 other; atggtgaaca agaagtggca gaggtccgtg ctggacgtga gagcctaccg cggagcagat 60 gttggcagcg accaccatct ggtagttacc aaggtcaggc tgaagctgag agcaagccct 120 cccaccaagc aacgatgcaa ggtgtttgac actgccaaac tgcgtaaacc tgaaatcaga 180 cgtgaatttg ctctagaact aagaaaccgc tttaaagccc ttgaaaacct agtagacgaa 240 gaagaaccta acgttgtaga aacatcctgg gataccataa cgaaagtgta cagtgaaact 300 gccaagaaag tcctaggcca tagaaagagg aaggacaaag aatggctgac acaagagact 360 tggaggaaaa ttgaagagcg gaaagtagcc aaacagaaat tactgacctc taagacccaa 420 caagcaaagg aagcatacag gaacaaagac aaacaggtaa aacgaagtgc ccgccgtgac 480 aagagagcat tcgtagaaga tttagcaacc gaagcagaac aggctgcaac tagaggggag 540 ttaagcacag tgtacagaat caccaagaaa ctctgcaacc agagcagtgc cagctctgta 600 cccattaagg acaaacaggg gaaattactg acttccgaaa gggaacagac cgcaagatgg 660 gcgcaacatt ttgaagaagt gctcaacaga ccggaacctg atgtgccggc tgaccccaca 720 ccttcagtgg atattccaat caacattgac ccacctacag aggaagaggt agaaacagca 780 ataaaggcac taaagaatgg taaggcccca ggcatagact ccattcaagc agaacttctg 840 aaggctgatt gtgcgacagc tacccttctc ctcactgacc tgtttgctaa aatctggaaa 900 cacgaagtca taccgcaaga ctggagtaaa gggctaatcg tcaagattcc taagaaggga 960 gttcttagta actgcgacaa ctggaggggt ataacacttt tgtccatccc cagtaaagtg 1020 ttctgcagaa tcctcctgaa aaggattgac agtgccatag actccagact aagggaagaa 1080 caagcaggat tccgaaaagg ccgcggctgc tcggaccaaa tctttgcact gaggaacatc 1140 atcgaacagt gcattgaatg gaactcccca ctgctaataa acttcatcga cttccagaaa 1200 gcatttgaca gtgtgcatag ggaaagcctc tggaagattc tgagggctta tggtattcca 1260 cacaagatag ttaccttaat agaaaccttt tacaagcatt ttgaatgcag tgtcatcaca 1320 gagaacagcc tgtccgaatc gttccctgtc aggtctggtg tcagacaggg gtgcattctt 1380 tcgccaatac tgttcctaat cactattgac tgggtaatga aagaaacaac ttctgaccga 1440 ccacggggta tcagatggac tccttctaag tacctcgagg accttgactt tgccgatgat 1500 ttggctgtct tagcatccaa gcgtaccctt cttcaggaaa agactgaccg gttaagcgaa 1560 aggggtaaac aaactggact gtacattaac aaaaagaaga ctcaagttat gtacataaac 1620 actcctacaa cccctcccat cagtttagac ggagaggtcc tggagagcgt aggggtcttc 1680 acttaccttg gaagtgtgat gagtacgcag gatggtgcac ggaaagacat taaaagccga 1740 ctcgacaaag ccagggcgtc cttcagccgt ctgcaaacca tttggaagtc caaacaatac 1800 agccgcaaaa ccaaaatcag gctatacaat agtaatgtca aatcagtcct tctctacggt 1860 gctgagagtt ggagggccac caagacagat atgaggaagt tagacgcatt ccacaacaac 1920 tgtcttagga aaatatgcaa catcttctgg cctaacaaga taacaaacga agagctgcac 1980 cgaaaaaccg aatgtagaag aataactaca gaaattacac acaggaggct aaaatggctt 2040 ggccatgttc tgagaatgac tgacaacaga attccaaaag ttgcactaac ttggagacct 2100 gacggtaaaa gaaaaagagg aagacccaaa accacgtgga ggagaacagt gtcagatgac 2160 ctggtgtcca tgggtctctc ctggaagtca gtacagacag cagcacagga ccgacccaga 2220 tggaggaaga aagtcatagc cttatgtctc accagggacg aagaggataa gtaagtaagt 2280 aagta 2285 // ID Gypsy-8_DPu-LTR repbase; DNA; INV; 128 BP. XX AC scaffold_126; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-8_DPu_; KW Gypsy-8_DPu-LTR; Gypsy-8_DPu-I. XX NM Gypsy-8_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-128 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 732-732 (2010). XX DR Genome; scaffold_126; Positions 190555 190682. XX CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX SQ Sequence 128 BP; 26 A; 39 C; 26 G; 37 T; 0 other; tggaatgact tggcaaggtc gctcccctcc cctggcagca tagagacggc agttggacct 60 agtacgttga cgagcgaata catcctgttt ctctcctctc aattctctca tcgtttcact 120 tcattcca 128 // ID Homo1 repbase; DNA; INV; 2817 BP. XX AC . XX DT 09-OCT-2009 (Rel. 14.1, Created) DT 09-OCT-2009 (Rel. 14.1, Last updated, Version 1) XX DE Homo1 is a putative autonomous DNA transposon. XX KW hAT; DNA transposon; Transposable Element; HOBO; transposase; KW Homo1. XX OS Drosophila mojavensis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; repleta group; OC mulleri subgroup. XX RN [1] RP 1-2817 RA Ortiz M.F. and Loreto E.L.S.; RT "Characterization of new hAT transposable elements in 12 RT Drosophila genomes."; RL Genetica 135(1), 67-75 (2009)DOI 10.1007/s10709-008-9259-5. XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 426..2240 FT /product="Homo1_1p" FT /translation="MTSEEIKIKINRGVYKIAAKHKGKSIIWSILYDIYKE FT DESVLEGWVFCNKCGKVLKFVANQTSNLSRHKCCLSLRQPTEAKKVSNLDK FT ITATKKCLEWVVQDCRPFSAVNGIGFRSLVEFFVKIGATYGANVDIDDLLP FT DATTLSRNAENDAEEKRAIISNDVKEAVDNNAASATIDMWTDRYVNRNFLG FT VTLHYLKDFKLNDIILGLKSMDFEKSTGENILKKLKSIFTQFNVENINNIK FT FVTDRGTNVKKALENNIRLNCSSHLFSNVLEKSFDEACELKDILHACKKIV FT KYLKKASLQHKLSTTLKSPCPTRWNSNYNMVNSIVNNWSTINNILSETEGG FT QKLIVVNISTLKIILLLLEDFERIFKELQTCSSPSLCYVLPSIAKIKLLCE FT PNAEDVSSISVLKSRILVNVNTIWKENLSIWHKAAFFLYPPAAKMEHEELL FT EIKNFCIEQMYNYVRFSNQPSQNNSCTLESNPASPNCPLEAKIPKRDMFVP FT KTTFFFSQLVAQANNNLKSPPEEFESYSGERVTLTEDFEPIEWWKTKENCY FT PLLSKLALQLLAIPSSSAAAERVFSLAGNIITEKRNRLGPKTVDNLLFLHS FT FFKNNNL" XX SQ Sequence 2817 BP; 990 A; 481 C; 480 G; 866 T; 0 other; cagagagctg caagtgtgcc aatttttgcc accttactca cactcaataa aatttgtgtg 60 ccggtgctaa tcaacactgg ttcagcaaat cataaacaca cacttataaa aaattgtgtg 120 agtgctgctc atgcaggcaa tatcttgctg tcactcacaa caatgtaccg ttttcttttt 180 gcttgtcact caagtatttt tcgtcaacgc ttgccggtcc cacttacact tacagcgaac 240 gtagaaaaaa agcgagagtg tgaaaagagc aatgtgtaac aaacacaagt agacaaaaca 300 catgtatgac acccaagtgt tttacaggca ccatggtgta agtgtcaaag taacggttgt 360 tttgattttg ctttagctcc tttcttagct tcctagttta cttcgtgttt gattttggca 420 tcgaaatgac atctgaggaa attaaaatta aaataaatcg tggtgtttat aaaatagccg 480 caaaacataa aggcaaaagc ataatctggt caattttgta tgatatatat aaagaagatg 540 aaagcgtcct cgaaggttgg gttttttgca acaaatgcgg aaaagtctta aaatttgttg 600 ccaaccaaac atcaaatttg tcgcggcaca aatgttgtct ctcattacga cagccaactg 660 aagcaaaaaa agtttcaaac ttggacaaaa tcacggcaac aaagaagtgt ctggaatggg 720 tagttcaaga ttgtcggcct ttttctgcag tgaacgggat tggttttcgc agcctggtgg 780 aattttttgt aaaaatcggt gctacttatg gagcaaacgt cgacattgac gacctactgc 840 ccgacgctac aacattgagt cgaaatgcgg aaaatgatgc agaagagaag cgagctataa 900 tatccaatga tgttaaagaa gcagtagata ataatgcagc atcggctacc attgatatgt 960 ggactgaccg atatgtaaat agaaattttt taggcgtcac tctgcattat ttaaaagatt 1020 ttaagctgaa tgacatcatt ttgggcctaa aatcaatgga ttttgaaaag tcgacagggg 1080 aaaatatttt aaaaaaattg aaatctattt ttacacaatt taatgtcgaa aacataaaca 1140 acattaaatt cgtgacagat aggggcacca atgtaaaaaa agctttagaa aataatatta 1200 gattaaattg tagtagtcat ttattttcaa atgttttaga aaaatcgttt gacgaagcat 1260 gtgagcttaa agacatatta cacgcctgca aaaaaatagt taagtattta aaaaaagcaa 1320 gtttgcagca caagttaagc acaacattaa aaagcccatg ccctactcga tggaattcca 1380 attacaatat ggtcaattca atagtaaata actggtcaac aattaacaat atattaagtg 1440 agacagaggg aggtcaaaaa cttatagtcg taaatatatc cactttaaaa ataatattat 1500 tactacttga agattttgag agaatattta aggaattaca aacgtgtagc tctccttcat 1560 tgtgctatgt tttaccctca atagccaaaa taaaacttct ttgtgagcca aatgctgaag 1620 atgtttcgtc gatatctgtt cttaagtcaa gaatattagt aaatgtgaac acgatttgga 1680 aagaaaattt aagtatttgg cacaaggcgg cttttttttt atatcctcca gcagcaaaaa 1740 tggaacatga agaattatta gaaattaaaa acttttgcat tgaacaaatg tacaattatg 1800 ttcgtttttc aaatcaacct tcgcaaaata attcttgtac attagaatca aatccagcaa 1860 gtccaaattg tccacttgaa gcgaagattc ctaaaagaga tatgtttgtg ccaaaaacaa 1920 cttttttctt ctcccaactg gtagctcagg ccaataacaa tttaaaatcg cccccagaag 1980 aatttgaaag ttattctggt gaaagagtaa ctttaactga ggatttcgaa cccattgagt 2040 ggtggaaaac caaagaaaat tgctatccgt tattatctaa gttagcattg caacttcttg 2100 caataccttc aagtagtgca gcagctgaaa gagtattttc tttagcgggc aacataataa 2160 cagaaaaaag aaatagatta ggtccaaaaa cggtagacaa tttgctcttc ttacactctt 2220 tttttaaaaa caataatttg taatttaatt tttattttaa acttgactga aaaaaaaaaa 2280 aaacagaata ttttgttatt tgcgatataa tgtttatatt tctcttatgt taagctttaa 2340 tatatttagt ttgttaatta aatgaagtat tataatgttt atttaatata tttaatgaat 2400 ataaaaaaat atatttttta tttaaatatt gcttttttgt tcactatttt ttcctattgt 2460 gcatgcattt ccaacagcaa gtacgtaaac atgtacatac gtaaaaaaaa aaacacatct 2520 caaccgaagg cctgcacaaa agaaacctca acacacacac acattgtatg cttcaatttt 2580 tgtgtatctc ttttcaccca actttcttgc gacgctcaag cgtcggcata tgtcacacac 2640 acatatgcaa aagaaatacc tactcacttt tgcagctagt gtgcacaaac actcaaatat 2700 ttttgtgagg cgagcgattg tactcatttg tacggtgttt gaaaacaccc gggtgtcaaa 2760 atcacccaac atttacccgg tgcaccttca gtaccggtgg cttgttgcag ttctctg 2817 // ID MuDR1x_SM repbase; DNA; INV; 2406 BP. XX AC . XX DT 26-OCT-2007 (Rel. 12.1, Created) DT 30-OCT-2007 (Rel. 12.1, Last updated, Version 1) XX DE MuDR-type DNA transposon element - a consensus. XX KW MuDR; DNA transposon; Transposable Element; KW Autonomous DNA transposon; MuDR1x_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-2406 RA Jurka J.; RT "MuDR-type element from Schmidtea mediterranea."; RL Repbase Reports 7(10), 1091-1091 (2007). XX DR [1] (Consensus) XX CC Analogously to hATx elements reported in this issue, MuDR1x_SM CC and MuDR2x_SM are highly diverged from other MuDR elements. XX FH Key Location/Qualifiers FT CDS 747..2099 FT /product="MuDR1x_SM_1p" FT /translation="MLEFVTSQKGNRKLIVQGYLFTKHSENSDGKSLWRCE FT KRTCTARVHTKDDEIVHEVATHNHTVTHGQVEVEKARAGMKRRGETTDETT FT RSIVQNELMNVPVSAAHLLPKRTTLARDVRRHRQKVGPNDQDDIMRYSLTQ FT SNRPFIRIRRDDMIIFAAQDDLEFLSRCEHWFADGTFRVSPNGYDQLYTIH FT GFINGEVFPVVYALLSSRTEEAYQHLLQEILILKAGLNPASIMVDFELAAI FT RAFQSTFPTATITGCMFHFGQCVWRKLQAEGFSERYRNEPDFALLVKRLLA FT LAFVPPQDVIDLFEHLIEDPAYRDIEVICDYMEDNFIGRLRRRRRGPPRFS FT IQLWSQFSRVIDNLPRSNNAIEGWHNAFNNVVGFAHPTTTKLARKLQQEQH FT SNQLLRRQLELGTTAGKKKKTYIRVNEALHTMVTDYNNRDSITYLGDIARV FT LNINVV" XX SQ Sequence 2406 BP; 786 A; 457 C; 432 G; 731 T; 0 other; acaaaatgtc acggacaaaa tgtcatgggg acaaaatata agatgtttgg gtatttactt 60 gtcgttgatc agttgggaac aagtaccaaa acactgtact gtgtagcata cggtgccatt 120 tcagcataac gagccgtttg ctcatcttaa actcattgaa tattaatgtc ctttcgtatt 180 gaaatataag aatccttaaa gccttttcaa ctgagctaaa acttaaatga aataaagcat 240 tcttccgtga aataatcaag ggcacctaac cattgaaaac agtattaatg aaaccaatat 300 tcgctcttct tcttcactat ctccgcctac tttcatctgc gctcatgatt tcgtgtaatt 360 ttaattgcct tacttgtcgc agcacgtcag tgttatgtgc caaaacatgt ttggtttcgg 420 tccgttgagt gcatgttggc ctcaaattaa aaacacagct atgtaaatta aaattaactt 480 tgccagatta ttattaataa cttaagtccc agcgctataa ctgtagcatg agctggattt 540 ccaatttttg gtttgtttta tgaatatcgt tagactatat ataaaaattt ccagaaattc 600 aaatttaaaa tattatatga aaaaaataga aataacttat aattcatatt agttggaaat 660 ataattttat tttacaattt attgcgttta ctgtagacta ttatagctac ttagtattta 720 caaataaatt ttgaaaagtt accacaatgc tagaattcgt tacatcacag aaaggaaatc 780 gtaagcttat cgttcaaggt tacttgttta cgaaacattc ggagaacagt gacggaaaaa 840 gcctttggcg ctgtgaaaaa agaacgtgca cagcgcgagt gcacacaaaa gatgatgaaa 900 ttgttcatga agttgcaact cacaatcaca cggtaacgca tggtcaagta gaggtcgaaa 960 aagcacgagc cggcatgaaa agaagaggtg aaactactga cgaaacaaca cgatcaattg 1020 ttcagaatga attgatgaat gttccagttt cagcagcgca tctacttcca aaacgaacaa 1080 cattggcacg agacgtacga cgtcatcgtc agaaggtggg accaaacgat caagatgaca 1140 tcatgagata ttctttgacg cagagtaaca gacctttcat ccgtattagg agagacgaca 1200 tgatcatctt cgcagctcag gatgatttag aattcctttc cagatgtgag cactggtttg 1260 cagatggaac atttcgagtg agtccaaatg gatatgacca actatacacg attcatggct 1320 tcattaatgg cgaagttttc ccggtagtat atgcactact atcttcaaga acagaagaag 1380 catatcaaca tctgctccag gaaattctga tattgaaagc aggtctgaat ccagcttcaa 1440 ttatggttga tttcgaacta gcagcaatta gagcattcca gagcaccttc ccgacagcaa 1500 caattactgg atgcatgttt cacttcggcc aatgtgtttg gaggaaattg caagcggaag 1560 gattctcgga acgttacaga aatgaaccag atttcgcgct tctcgtcaaa cgcctacttg 1620 ccttagcatt cgtcccaccg caggatgtta ttgatttgtt tgaacattta atcgaagatc 1680 cagcttatcg tgacatcgag gttatctgtg attacatgga agataatttc attggtagat 1740 taagaaggcg aagaagagga ccaccaagat tctcaattca actttggagt caattttcta 1800 gagtaatcga caacctgcct cgcagcaaca acgcaatcga gggttggcat aatgcattca 1860 acaatgttgt cggttttgct catccgacca caaccaaact tgcacgcaaa ctacaacaag 1920 aacaacacag caatcaactt cttcgtcgtc aattggaact gggaacaacg gctgggaaga 1980 aaaagaaaac atacatccgc gtcaacgagg cgttgcacac tatggtgacc gactacaaca 2040 accgagattc aatcacatat ttaggagaca ttgcacgtgt tttaaacatt aatgttgttt 2100 aacttctaat tgttgtttaa ttttaattgc taaaagactc gattatttat ttttagatgc 2160 gtttttaaaa taacctaata atgttaataa tactgtgctt ttcattaact gaatatctca 2220 gactattatt gatataggct aaatattatt acaataataa attacattag gacttaattt 2280 taataaatat attatttcat ttcttcattt gttattccat cttatatttt ttcccctacc 2340 attttgtccg catgacattt tatccctgac attttgtccc catgacattt tgtccatgcc 2400 attttg 2406 // ID NAUT1_NVi repbase; DNA; INV; 546 BP. XX AC . XX DT 06-NOV-2007 (Rel. 12.11, Created) DT 06-NOV-2007 (Rel. 12.11, Last updated, Version 1) XX DE Putative non-autonomous DNA transposon. XX KW DNA transposon; Transposable Element; Nonautonomous; NAUT1_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-546 RA Jurka J.; RT "NAUT1_NVi: Putative non-autonomous transposon from Nasonia RT parasitic wasp."; RL Repbase Reports 7(11), 1171-1171 (2007). XX DR [1] (Consensus) XX CC 42 bp TIRs; >3800 copies/genome. XX SQ Sequence 546 BP; 173 A; 113 C; 75 G; 185 T; 0 other; catatcgtcg ctccctacga agagtagcac tgttcaaatt tcccgtcata agtagcacaa 60 ctcatttgac gcacgctaga ctagcacaac tcaaaatttc ccgcgaagac tagcacaact 120 caaactttaa acaacgaatt tctataaaaa tttcccgatt tttgcgaaac tgtccctaat 180 gaacattttt aaatattcac aatttacaac tttttaccga atttaaaatt gttttcacaa 240 atacaataat tttttacaaa catttttttt taattattca ctttttttag aaaccttcta 300 aaatttccga aaccttttaa aactttgagg tttgtctgtc tttgtccaag gttttcatga 360 acttaaccaa ccaaataacc caataaaaac gtaaaatact cacattttaa gtaatataaa 420 gtcgcgagtt gtgctactct tggcggcatt tctgaactgt gctactcttc gcggcaaatc 480 gtgagttgtg ctagttttcg tttgaaaatt tgaactgtgc tagtcttcgt acggagcgac 540 gatatg 546 // ID BEL-52_AA-LTR repbase; DNA; INV; 657 BP. XX AC supercont1.337; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-52_AA_; KW BEL-52_AA-I; BEL-52_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-657 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.337; Positions 583588 584244. XX SQ Sequence 657 BP; 228 A; 106 C; 109 G; 214 T; 0 other; tgttgcggaa tacgagtgat aggcccttta gatctgctcc acactaaggc gccgcagcgt 60 aatactgtta agctgttgaa tgtcaaactt gacttgtcgg tgacagacct atagcaatga 120 catcagcaaa atacgcactg aagcaagcta ttgtaaagtt aaaatttaat gaaaacacag 180 ttctatttgc atttcttatt aactattcga tttatatact gaattcaatc gtctaaatct 240 tctagttgag gtttgtactt ttcttagtta tctattttaa cgaatcgtgc atttgataat 300 tatttctgaa catgcaatgt aaacatgtga tctcatgaat tattcctaaa attacagtga 360 gtgcaataca tattatctcg cctaaaacga tcttacatcc taaattataa gctaaatagt 420 acggatatgt aagtaaagat tgttaattga taaggattcc aacaataatt gaaattttat 480 gtgagcagat aacttaacct aaacagctca aactggagaa taattggatt gctcggacag 540 ttaggaaact aatttgtaag tgaattatga atttattgca ttaataagac acctaataaa 600 actattattt ccagtcgagt tagcggaaaa accgaactgc gaaaccgttt tttaaca 657 // ID SMAR31 repbase; DNA; INV; 1263 BP. XX AC . XX DT 22-JAN-2008 (Rel. 13.01, Created) DT 22-JAN-2008 (Rel. 13.01, Last updated, Version 1) XX DE Consensus sequence of Mariner-type family of repeats. XX KW Mariner/Tc1; DNA transposon; Transposable Element; SMAR31. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-1263 RA Jurka J.; RT "Mariner-type element from freshwater planarian (Schmidtea RT mediterranea)."; RL Repbase Reports 8(1), 19-19 (2008). XX DR [1] (Consensus) XX CC Youngest copies are >10% divergent from consensus. CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 137..1195 FT /product="SMAR31_1p" FT /translation="MNNFVEQRVCLKFCVSNEISCVESLKMLQKAFGESAL FT SKTRAYKWYKEFKSGHEVVEDLPRSGRPSTSNTDENVXIIKKTVLENRHVS FT LREMASDLNISYGTVQHIVVDILGMQRVAARLVPKELNFVQKDHRKKVAED FT MISEAQKDPTFMKRIITGDETWVYEYDVETSQQSSKWRLKGEPKPKKPRQS FT RSKIKVMLTVFFDYRGVVHSEYLPPGQTVNKDYYLAVMKRLRDAVRRKRPE FT LWAENNWILHHDNAPSHKAKIVVDYLTKHSVNLLEQAPYSPDMAPCDFFLF FT SKLKMPLRGKRFESIEAIKENATKELKAIPPVAYEKCMDDWVKRWHMCVAS FT NGCYFEGGTK" XX SQ Sequence 1263 BP; 372 A; 263 C; 296 G; 331 T; 1 other; cagtatgatc aaaaagtacc ggcactttgt aatttaaatg aatcgctctc gaccgacccg 60 gttatttttt ttcttaagtt ggtacatgtt taactgacat ctgtcagaaa tttgagcgcg 120 atcgaagcat ttcaccatga ataattttgt tgagcaaaga gtttgtttga aattttgtgt 180 ttccaacgaa atttcttgtg tggaatcgtt gaaaatgttg cagaaagcct ttggggaatc 240 tgctctttcg aaaaccagag catacaaatg gtataaagag ttcaaaagtg gtcatgaagt 300 ggtcgaagac ttgcctcgtt ccggacgccc ttcaacatcg aataccgacg aaaacgtgrc 360 aattattaag aaaactgtgc ttgaaaatcg tcatgtgagc cttagagaga tggctagtga 420 tctcaacatc tcatatggaa ccgttcagca cattgttgtt gatattttgg gcatgcaacg 480 cgttgccgca cggcttgttc cgaaagagct gaatttcgtg caaaaagacc atcgaaagaa 540 ggttgctgaa gacatgattt ccgaggccca aaaggatcca accttcatga aacgcatcat 600 aacgggggac gagacatggg tctatgaata cgatgtcgaa acaagccagc agtcgtccaa 660 atggcgcctc aaaggcgagc cgaagccgaa aaaaccgcgc caaagtcgat cgaaaatcaa 720 agtcatgctc accgttttct ttgattatcg tggtgtggtg cattcagagt acctgccacc 780 tggtcaaaca gtcaataaag actattacct ggccgtcatg aagcgtttgc gtgatgccgt 840 gcgtcgtaaa cgaccagaat tgtgggccga aaacaattgg attttgcacc acgataacgc 900 gccgtcacac aaggccaaga ttgtcgtgga ttatttgacc aaacacagcg tgaatttgct 960 cgagcaggcc ccgtattcac cagatatggc cccgtgcgac tttttccttt tctccaaact 1020 caaaatgcca cttcggggaa agagattcga gtccattgag gccataaaag agaatgcgac 1080 gaaggagctg aaggccatcc ctccggtggc ctatgaaaag tgcatggatg actgggtcaa 1140 gcgttggcac atgtgcgtcg cttcaaatgg gtgttatttt gaaggaggaa caaaataaat 1200 ttgcaagaaa attgagaaaa ttttgtttta tttacaaatt cccggtactt tttgatcata 1260 ctg 1263 // ID Gypsy-128_AA-LTR repbase; DNA; INV; 1018 BP. XX AC supercont1.366; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-128_AA_; KW Gypsy-128_AA-I; Gypsy-128_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-1018 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.366; Positions 584898 585915. XX SQ Sequence 1018 BP; 321 A; 236 C; 216 G; 245 T; 0 other; tgttaccgtt gcgaagatct taatttgttt atttattttt cagaataatt gttagtattt 60 gttgttagtt tgttgtttag ttatattaaa ttatttatta gaattataaa gaaatgttag 120 taaaattgtt atatggttgg tattaggctg ttaataattg ttaattaaat gtattactcc 180 ggaattagtt gttaggaaca aaatttgtaa atacactaaa cacacattga acacgtctaa 240 caataatcat gcacgaaaaa ggggaaaacg cgtcagccat aaggtatgcc acgtcaaaaa 300 gggaataaaa gaaagggtat ggaaaaggga acaggatgag caggctactg ggttggaaag 360 cgctaagtaa aaggtctatt ccaatctaga tctcaacgag accacacgga aatattccaa 420 acttaaagtg accgttaccc gaagaaagtt taaagtacat aagtgcataa gtgcaaccgt 480 tcgaatcagg tacgagtagc ttcgatatga acggtatcta gaataccaaa aagccgtcct 540 aacgccctac ccataaccca tactaggacc cgttatagtt tgcctgaaga ttttggcaga 600 ggctcccacg acccgtctaa acccgtggtg aacccctgta gctgccctga aggcccgcct 660 aaggtcccag gatcaccaag gatacctgaa gccgaccatt tgggaacccc gacgccctag 720 gaagctggcg caacccgtta aggaaccaac ttcaacgcgg agaccgttag gccacaccaa 780 gccatccaag ccccgtagag ttcgcgttca gcctagaacg tgccgatcaa cccccacgtg 840 ctccacttcc ggccggccgc atcgagacac gacacacctc acacctacac cacacccagt 900 aagtccaata aaaagaatta aagaagttga gtgcgtttta cttggaccct tgacgagctt 960 acgccgaccc taagagtctc ggaatttgag cctggtccct ccgaccagcc gggtaaca 1018 // ID Gypsy-19_RP-I repbase; DNA; INV; 4217 BP. XX AC ACPB02043044; XX DT 28-JAN-2011 (Rel. 16.02, Created) DT 28-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Rhodnius prolixus genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-19_RP_; KW Gypsy-19_RP-LTR; Gypsy-19_RP-I. XX OS Rhodnius prolixus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Euhemiptera; Heteroptera; OC Panheteroptera; Cimicomorpha; Reduviidae; Triatominae; Rhodnius. XX RN [1] RP 1-4217 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Rhodnius prolixus genome."; RL Direct Submission to RU (28-JAN-2011). XX DR Genome; ACPB02043044; Positions 337 4553. XX CC Positions [3414-3890] - Integrase core CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 37..981 FT /product="Gypsy-19_RP-I_1p" FT /translation="MQPVRVRIRFEVGEERNNVVIRVASIRLMSEDTTYVF FT PTHLQHSDLHEELFSIGTVKNVCKALIKVGQFRNIMVTLPSEIVPLYLDDN FT LNFVFRSTYLEELSMEQTLISGKKIPITAVENQTELIKIIEKLTTKLDAKE FT NKQVDLTHIHKQFILQKFQGKENAIEWITLFEKECNRCEVTTDHTKIQILR FT LFLEQNAVEWYSSTLCKLTLTGEWKDWRESFIQTYGDKSWRSVHYAYTFRF FT IKGSLLDYALKKERLLLETEPKMSDRSRINHIVVGLPLYIQDRLDKEEIQA FT TESLMYQLRRYSQEYPRVKKGEV" FT CDS 1317..4205 FT /product="Gypsy-19_RP-I_2p" FT /translation="MLQNEQTFKTISGVQKSCCTIKLPMQIHKIREEIEAH FT VIKNDAFSYDVLLGLDAIQKFHLLQDDKLNILQRVKENNIEIVDYEQRKLT FT EEISYCEYEQGGYLEKLQHLDNLKKTKLMSVISEDREIFAKHKYDVGKVEN FT HEAHIKLLENKYISKKPYRCSIPDRQEIESQITKLLEAGLIEESESPFAAP FT VTLAYKKEDGRRSRLCIDFRDLNKILVPESQPFPIIEDIIVKTRNCKWYSV FT FDISSAFWSVPIRPKDRHKTAFVSQTGHYQWKRLPFGLKISPAIFQRVLAN FT TIRRHGLHEFCTNYIDDILVFSKTFDEHISHIKKILQAIKLEGFRLKLSKC FT NFAQSSVKYLGHIIGCNSVSPHQDNIRSIKDFPRPRNKKNIRQFLGKINFY FT HKYIPDYTRLMEPLHNLLRKNVDFQWTSECNTTFVQIKELLCQSPILSIFD FT PERETFLFTDASTLGIGAVLKQIQDNGELHPVAYFSKKVSPAQAKKKAIYL FT ECLAIKEAIRYWQHWLIGHKFQVVSDHKPLENIKVKARTDEELGDLVYYLS FT QYDFTIRYAPGRTNQEADALSRNPVLEHFENKDDVLQIVNFVSIEKIKLDQ FT NSNKNELNTDRNTHKKGDLLYKNLRGRERIFVSQDFGLKLIENVHTYYGHI FT GVSHVAAKIRPFYYFRNMDKHIKEYCDHCDICKKNKSRRCREIGLLSQLGP FT PKEPYEIMSLDTIGGFAGNRSPNRYLHLLADHFTRFAYTCPSKGQKAKDLI FT RFLRPILEKNKVKLLLADRYTGINSREFKAFLRQYGVTLMFTAVDCPFSNG FT LNERLNQTLINRIRCKINEQTQRPWSQIAIECTEEYNRTIHSVTKFSPMYL FT LTGNESSILPPTLTQPQDLGRNRICAFENSQRHHVFNKSRVDQKRKEHDFK FT VGDFVYVETGNRLNRHKLDPIRSGPFKIVRRLSPTIYEVASDRRRLHANIV FT HSSKLVPF" XX SQ Sequence 4217 BP; 1534 A; 695 C; 797 G; 1191 T; 0 other; ttataaatcg gtttttcgtc aacaagcatc cgcaagatgc agccagtcag agtaagaatt 60 cgttttgaag ttggagaaga gaggaataat gtggtgatta gagtggcttc tattcgatta 120 atgtctgagg acactaccta cgtttttcca acgcatttac aacactcaga tctgcatgag 180 gagcttttta gtattggtac agttaagaat gtatgtaaag cattaataaa ggtaggtcag 240 tttcgaaata taatggttac gttaccgtct gaaatagttc ctttatactt ggatgataat 300 ctaaacttcg tttttagaag tacttatttg gaagaattat ctatggaaca aacattaatt 360 tcaggaaaaa agataccaat aacagcggtc gagaaccaaa cagaattaat aaaaattatt 420 gaaaaactga ctacaaaact agacgctaaa gaaaataaac aagtagatct aactcacatt 480 cacaaacagt ttattttgca gaaattccag ggaaaagaaa atgccatcga atggataacc 540 ctttttgaaa aagaatgtaa tagatgtgaa gtaactacgg atcatacaaa aatacaaatt 600 ttgcgactat ttttggaaca aaacgctgta gagtggtaca gttcaacatt gtgtaaattg 660 acattaactg gtgaatggaa ggattggcgt gaatctttca tacaaacgta tggtgataaa 720 agttggcgat ccgtacatta cgcttacact tttagattca ttaagggatc attactggac 780 tatgcattga agaaagaacg tttattgtta gaaacagaac ctaaaatgtc agacaggtca 840 cgtattaatc atattgttgt tgggcttccg ttatatatcc aggatcgttt ggataaagaa 900 gagatacaag ctacagagtc tcttatgtac cagctacgcc gatattcaca agaatatcct 960 agagtaaaga aaggtgaggt atgaagacta gcaaatgtcc aaccttatga cactgaaagt 1020 cagaagatca taaaaaaaca accgtgtaca atttgtgagt cactgggcta tcaaggacgt 1080 tttcacccac cggattattg tagaaataag ggaaagactg ctaatgggaa acaagtaaat 1140 ttcactgcga taactgatgt agatgtcgat actcacaatg agacaaaaaa cggagataca 1200 caccacttat tcgattaaat gtaacaatta acaataacaa agttacggga atttacgaca 1260 gtggttccaa tgtgacatta ataaattcaa aaattgtaga agacataaaa cataagatgt 1320 tacaaaatga acaaacattt aagacaatta gtggtgtgca aaaaagttgt tgtacgataa 1380 aattgcctat gcaaatccac aaaataagag aagaaataga agcccatgta ataaaaaatg 1440 atgctttttc gtatgatgta ttgttgggat tagatgcaat tcagaaattt catttacttc 1500 aagatgacaa actgaatata cttcaaaggg ttaaagaaaa taatatagaa atagtagatt 1560 atgaacaacg taaacttaca gaagagataa gttattgtga atatgaacaa ggtggatatt 1620 tagagaaatt gcagcatcta gataatctca agaaaacaaa acttatgagc gtgataagtg 1680 aagacagaga aatatttgct aaacataaat acgatgttgg taaagtagaa aatcatgaag 1740 ctcatattaa acttttagaa aataaatata tttcaaaaaa accttataga tgttcgatcc 1800 cagataggca ggaaatagaa agccaaataa ccaagttact tgaggcaggc ttgattgaag 1860 aatctgaatc tccttttgca gcgccagtaa ctttagctta caagaaagaa gatgggagac 1920 gttcgcggtt atgcattgat ttccgagatc taaacaaaat tttagtacca gaatctcaac 1980 cgttcccaat aattgaagat atcattgtaa aaacacgaaa ctgtaaatgg tactcagtgt 2040 ttgatattag ttctgctttt tggtctgtac caataagacc aaaagatcga cacaaaacgg 2100 cttttgtttc tcaaactggt cattatcagt ggaagagact tcctttcggt ctgaaaattt 2160 caccagcaat ttttcagcga gttctggcaa ataccattcg acgtcatggt cttcacgagt 2220 tttgcacgaa ttatattgat gacatcttgg tattctctaa aacttttgat gaacatataa 2280 gtcatattaa aaaaatattg caggcgatta aattggaagg atttcggttg aaactatcta 2340 aatgtaattt tgctcagtct tcggtaaagt atttaggaca tataattgga tgtaattctg 2400 tcagtcctca tcaggataac ataagatcaa ttaaagattt tcctagacca cggaacaaaa 2460 aaaacattcg ccagttctta ggtaaaataa atttctacca caagtacatt ccagattaca 2520 ctaggctaat ggaacctctt cacaacctcc taaggaaaaa cgtagacttt caatggacat 2580 ctgaatgtaa tactacgttt gtacaaataa aagaattgtt atgccaaagc ccaattctat 2640 caatttttga tccggaaaga gagacatttc tttttacaga tgctagcact cttggaattg 2700 gagcagtttt gaaacaaatt caagataatg gtgagttaca tccagtagca tatttttcga 2760 aaaaagtgtc tccagcacag gcaaagaaaa aggcaattta tcttgagtgt ctagcaataa 2820 aagaggcaat acgatattgg caacactggt taatagggca caagttccaa gttgtgtcag 2880 atcacaaacc attggaaaac ataaaagtta aagctagaac tgatgaagag cttggagact 2940 tggtatatta cttatctcaa tatgatttca ccatacgtta tgctccagga aggacaaacc 3000 aagaagcaga tgccctttcc cgtaatccgg tattggaaca tttcgaaaat aaagatgacg 3060 tgcttcaaat tgttaacttc gtctcgattg agaaaataaa acttgaccaa aatagcaata 3120 aaaatgaact caatactgat aggaacaccc ataaaaaagg tgatctcctc tataaaaatc 3180 tacgaggcag agaacgtatt tttgtgtcgc aagactttgg gttaaaatta attgagaacg 3240 tacacacata ttatggtcac attggggtaa gtcatgtagc agcaaaaata cgtccgttct 3300 attatttccg aaacatggac aagcatataa aagaatattg tgatcactgt gatatttgta 3360 agaaaaataa atctagacga tgtcgagaga ttggattatt gtcccaatta ggtccaccaa 3420 aggaacctta tgaaataatg tcactagata cgattggggg ttttgcaggc aacaggtctc 3480 ctaatcgtta cctacatctt ctggctgacc attttactcg ttttgcttat acatgtcctt 3540 cgaagggtca gaaagctaag gacttgattc gcttcttgcg gcctatttta gagaaaaata 3600 aggtaaaact tctccttgca gatagatata caggtataaa ttcaagagaa tttaaggctt 3660 ttctgcgtca atacggcgtt actttaatgt ttacagcagt agattgcccc ttctcaaatg 3720 gtctcaatga acggctaaat caaacactga ttaatcgtat tcgttgcaag attaatgaac 3780 aaactcaacg accatggtct caaattgcca tagaatgtac tgaggagtac aaccgtacta 3840 ttcacagtgt gacaaagttc tctccaatgt atctacttac gggaaatgaa tcttcaatat 3900 taccgccgac tcttacccaa ccacaagact taggaagaaa tagaatatgt gcatttgaaa 3960 attctcagag acatcatgtt ttcaataaat ctagagtgga tcagaagaga aaagaacatg 4020 atttcaaagt gggagatttc gtttatgtgg aaacaggtaa tcgtctaaac aggcacaagt 4080 tggatccaat acggtcagga ccatttaaga tagtccgtcg attgtctccc acaatatacg 4140 aggtagccag tgaccgccgc agattacatg ccaatattgt gcattcgagc aagctcgtac 4200 ctttctaagg ggggaga 4217 // ID Copia-46_AA-LTR repbase; DNA; INV; 238 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-46_AA_; KW Copia-46_AA-I; Copia-46_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-238 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 967-967 (2011). XX DR [2] (Consensus) XX SQ Sequence 238 BP; 56 A; 55 C; 43 G; 83 T; 1 other; tgatggagaa ggatgaccac ctcattatta tgtaccttgc agaagagttt gaattacctc 60 atktattttt atgcaccttg cgtgcagttg tcatttcatt gttttaacct aacgaacatc 120 atcacaagta aactcgattc actagagatc ggccgcgtta tttctttcag ttgtccgaaa 180 tactttacgg tccgttttac tccggttctg cgcatctgct ggtcgttcca atccttca 238 // ID Gypsy-25-I_NVi repbase; DNA; INV; 5574 BP. XX AC . XX DT 08-MAY-2009 (Rel. 14.05, Created) DT 08-MAY-2009 (Rel. 14.05, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW internal portion; Gypsy-25-I_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-5574 RA Bao W. and Jurka J.; RT "LTR retrotransposon from Nasonia parasitic wasp."; RL Repbase Reports 9(5), 986-986 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS join(325..1269,921..2150,1589..2815,2586..5066) FT /product="Gypsy-25-I_NVi_1p" FT /translation="HQVSYCLSFFSTDGARRRVALFRQSRAREGSNNRRSE FT SLSLRSVRNTWCTRPHIIAATRVPLSTLINLSLRARSRPCSERSRSRPFEP FT RNRSPRILSAIFLSARSLSRALRIAAQSGGHLVFALSPAFARCLAPRAATE FT VRLSKRTPRRFVVYDAAILFFPRYHRARSPPPNARVVEAARAGVRRGRFAL FT VTSGRPLRRVWSSAPRASRWLHSLLSCASGKPAQPATPRCSERTTPVRVAN FT LVPLCGQPHAAHQISPRVARPTIRCCTAQCLVSSPPTAKICCSLSRTSSLT FT LRRTARLITSSSSLTTGSSKIARSVEQRPARVSLAPLPSVVRQRQARTARH FT TAVLRAHNTSPRSESRAALRAAARRAPNFSARSSPDDSMLHSPVPGQQSAN FT GQNMLLAQQNFELNAPPHGAANNEQQQPHNRLQQNCALIEDDDANGVEAIS FT KCSLPNYWSFNPRLWFAQTESAFQSNRIIKDSSRYNLLVSNLPPEVAQEVS FT DIILAPPAEHRYETLKAAVISRLAASADQQLHQLLNEVQLGDRTPSQLLRY FT MRRLAGNAISDEALRIRWLDLLPSQASRMCRVLRTSTLDELATLADELVAP FT TPSVSAVSRPSSPASSGIVSGTTTPLPQHEVVSLRLAMSQISATLQQQSLV FT LQSILTALATKQQQQQQQPQTQRRSRSRTPVSQRQPRSASQPSSTANPAWC FT WYHRRFGSEATQCRPPCSYPTQGNCPHAFPAASVHAPPGRKRHLRRGATHQ FT MARPASIPGQPHVPRAQDVHAGRARHARRRTRRANPISQRSLAPQLASIER FT HRFWYHHAAPTTRGCLAPSRHVANFGHAAAAVARPAVNTDGARNETTTATT FT ATADAEAQPLAHARQPAPATERVATIVHRKPSMVLVSSEIRKRGNAVQASM FT LLPDAGKLLAPPPPQAAAVGANDEQRLHVLDRSSGLRFLVDTGSAVSLLPR FT NFLKRTLQRGPLKLSAANATSIDTFGAHHMDVDLRLHRPLSWKFIVADVSN FT PILGADFLAHYGLVVDVKRKRILDTDTSRHATCSLKSPEIHSVEVCISADV FT AEGHLRRPDAAILGPRNPRQHNHRPPGPRRAASHHHNRPSRSSQGSSTPWP FT STRSRTCGIQGIVGDGDSAPVRQLLGQKGIFADLMRQYSDLATPGSTTIAL FT PGLDAQHHITTTGPPAAAKARRLLGPRLEAARAEFKVLLEMGIVRLSDSSW FT ASPLRLVLKPDGTYRITGDYRQLNSRTVPDRYPLPIIEDLLLALGGTIFSV FT VDLKKAFYQLPIAPEDAHKTAIITPFGLFEFTRSSLGLRNAAQSLQRAMDQ FT LLRDLPFARAYLDDIIVASNNEEEHVGHLRTLFDVLRAANIKVNPEKCVLG FT KKQVTYLGYLVSAEGSRPPQDRIDAIQAFPKPGNSAQLRRLLGLINYYRRC FT IPGAARLLAPLNDLLRHIPPKKKAIPITWTPAAEEAFSACKRALANAASTT FT FLRDDAPRRLLADASDHAIGAALEQQDPDGSWKPLGFFSRKLSNAERNYST FT YDRELLAAFAAIKHFKGILEGGPFTLVTDHKPLTFAAQQPSEKASPRQARQ FT LDYILQFQVTLAYTKGPDNVVADALSRVETIGMPANLDLATLAQHQATDPD FT LPHILEAPTPSLVLRPLDIDQTTLHCAIDGNQVRPYIPRQLRRTAFDVTHG FT LSHPSVRASIRLIAQKYVWPGMRKDIARWARNCVQCQQSKVHRHNRAALGD FT FAAPDARFNHIHLDIVKMPLHSGFQYCLTIIDRFTRWPEAIPMPDQQAITT FT ARAYFGRWIAVYGTPLTITTDQGAQFEAALFSELAKLIGASRVRTTPYHPQ FT SNGMVERFHRTLKGALMCCAPTPWPDALPAVLLGLRTTFKEDLQASPAEML FT FGTTLRIPGDFFVPSSHPGANAPAFVAELRALMQRLRAVPGSRHAPPLAPF FT FHPNLRTCTHVFRRVETVQADAAATLHRAHIESWSA*" XX SQ Sequence 5574 BP; 1132 A; 2036 C; 1430 G; 976 T; 0 other; gtggtgaccc cgagaagaac atcaaccgac gccaagcgga gcagccatct tgggactttg 60 gcgtccctca ccaacgcagc cgaaaaccgg agcccacctg cagtacggcc ccggagatct 120 gcagcgcttc ctggcccaac ctcgccgact ccaaatcatc gctacacgcc ccatctacag 180 ccagcaaccc gaggacgccc agcaccagcc cctttacagc gaacgtgcct gggcctctcc 240 tccagctcca gcggcagcag cacgagcagc gcgcccgcag ccttcgccga gcaacgacat 300 cgcatcagcg taaggtttcg ctagcaccag gtatcgtatt gcctttcctt tttttccacc 360 gacggcgcgc gacgacgagt cgctcttttt cgccagtccc gcgcgcgtga aggaagcaat 420 aatcgccgga gcgagtcgct ctctcttcgc tccgtgcgca atacctggtg cacgcggccg 480 cacatcatcg ccgcgacgcg cgtgccgctg agcacactca taaatctctc tctccgcgcc 540 cgctcacgac cttgctccga acgctcaagg tcaaggccgt tcgagccgcg caacagaagc 600 ccgcgcattt tatccgccat ttttctttcc gctcgctctc tctcgcgcgc gctccgcata 660 gccgcgcaga gcggcggcca tcttgttttc gctctctctc cagcgttcgc tcgctgcctc 720 gccccgcgcg ccgcgaccga ggtccgcttg agcaaacgca cgccgcgtcg gttcgttgtg 780 tacgacgcgg ccatcctttt ttttccacgt taccatcgtg cgcgctcgcc gcctcccaac 840 gcgcgcgtgg tcgaggccgc tagagcaggc gtgcgtcgcg gtcgcttcgc cctcgtcacg 900 agtggccgtc cgcttcgtag agtgtggagc agcgccccgc gcgcgtctcg ctggctccac 960 tcccttctgt cgtgcgccag cggcaagccc gcacagcccg ccacaccgcg gtgctccgag 1020 cgcacaacac cagtccgcgt agcgaatctc gtgccgctct gcgggcagcc gcacgccgcg 1080 caccaaattt ctccgcgcgt agctcgcccg acgattcgat gctgcacagc ccagtgcctg 1140 gtcagcagtc cgccaacggc caaaatatgt tgctcgctca gcagaacttc gagcttaacg 1200 ctccgccgca cggcgcggct aataacgagc agcagcagcc tcacaacagg ctccagcaaa 1260 attgcgcgtt gatcgaggac gacgatgcga acggcgtcga ggcgatctct aaatgctccc 1320 ttcctaatta ttggagtttt aaccccagat tgtggtttgc acagacggag tcggcgttcc 1380 agtccaacag aatcatcaaa gacagcagcc gctacaacct cctcgtctcg aacctgccac 1440 cagaagtagc gcaggaggtc tccgacatta tcctggcacc accagcagag caccgctatg 1500 agaccttgaa ggcagcggta atctcacgcc tcgcagcatc ggccgaccag caactccatc 1560 agctcctcaa cgaggtgcaa ctcggtgacc gcacgccttc ccagctgctt cggtacatgc 1620 gccgcctggc cggaaacgcc atctccgacg aggcgctacg catcagatgg ctcgacctgc 1680 ttccatccca ggccagccgc atgtgccgcg tgctcaggac gtccacgctg gacgagctcg 1740 ccacgctcgc cgacgaactc gtcgcgccaa ccccatcagt cagcgcagtc tcgcgcccca 1800 gctcgccagc atcgagcggc atcgtttctg gtaccaccac gccgctccca caacacgagg 1860 ttgtctcgct ccgtctcgcc atgtcgcaaa tttcggccac gctgcagcag cagtcgctcg 1920 tcctgcagtc aatactgacg gcgctcgcaa cgaaacaaca acagcaacaa cagcaaccgc 1980 agacgcagag gcgcagccgc tcgcgcacgc ccgtcagcca gcgccagcca cggagcgcgt 2040 cgcaaccatc gtccaccgca aacccagcat ggtgctggta tcatcggaga ttcggaagcg 2100 aggcaacgca gtgcaggcct ccatgctcct acccgacgca gggaaactgt tagcgccgcc 2160 gcccccgcag gcagccgcgg ttggtgctaa cgacgaacag cgcctgcacg tcctcgaccg 2220 ctcgtctgga cttcgcttcc tcgtcgacac gggctcagcc gtttcgctgt tgccccgcaa 2280 ctttctcaag agaacactac aacgaggccc tctcaagctg agcgcagcta acgccacgtc 2340 catcgacacg ttcggcgcac accacatgga cgtggacctt cgcctgcacc ggcctctctc 2400 gtggaagttc atcgtagcag acgtctcgaa tccgatcctc ggcgccgact ttctcgccca 2460 ctacgggctc gtcgtcgacg ttaaaaggaa acgtatcctc gacaccgaca cctcaagaca 2520 cgcaacatgc tcgctcaaat cgccagagat ccactccgta gaagtgtgta tatcagcgga 2580 tgtagcagaa gggcatcttc gccgacctga tgcggcaata ctcggacctc gcaaccccag 2640 gcagcacaac catcgccctc ccgggcctcg acgcgcagca tcacatcacc acaaccggcc 2700 ctcccgcagc agccaaggct cgtcgactcc ttggccctcg actcgaagcc gcacgtgcgg 2760 aattcaaggt attgttggag atggggatag tgcgcctgtc agacagctcc tgggctagcc 2820 cgctacgcct cgtccttaag ccagatggca cctaccgcat cactggcgac tacaggcagc 2880 taaacagccg caccgtgccg gatcgctacc cgctcccaat aattgaagat ctcctgctcg 2940 cgctaggagg cacaattttc tctgtcgtcg atttgaagaa agctttttac cagcttccga 3000 ttgcgcccga ggacgcgcac aaaaccgcca tcatcacccc attcgggcta tttgaattca 3060 ccagatcgtc gctgggccta cgcaacgccg cccagtctct gcagcgagcc atggaccagc 3120 ttcttcgaga cctgccattc gctcgtgcgt acctcgacga tataatcgtc gcctccaaca 3180 acgaagaaga gcacgttggt cacctcagaa ccctgttcga cgttcttcga gcggccaaca 3240 tcaaagtaaa cccagagaaa tgtgtcctgg ggaagaaaca agttacgtac ctgggctact 3300 tggtctccgc cgagggctca aggccaccac aagacagaat cgacgctatt caagcctttc 3360 cgaagcctgg caactccgcc cagcttcgcc gattgctggg cctcatcaat tactatcggc 3420 gctgcatccc aggcgcagct cgcctcctcg caccgctgaa cgacctcctg cggcacattc 3480 caccgaagaa gaaggccatt ccgatcacct ggacgcccgc cgcggaggaa gccttcagcg 3540 cgtgcaagcg cgcactcgct aacgccgcca gcacgacatt cttgcgcgac gacgcgccga 3600 ggaggctgct cgccgacgcc tctgaccacg caatcggcgc ggctctcgag cagcaagacc 3660 cggatggctc gtggaaaccc ctgggcttct tctcaaggaa gctctccaac gcagaaagaa 3720 actacagcac gtacgaccgc gagctcctgg cagccttcgc ggctatcaag catttcaagg 3780 ggattctcga gggcggccct ttcacgctcg taaccgacca taagccgctc accttcgccg 3840 cacagcagcc atcggagaaa gcatcacctc gccaagcacg ccagcttgac tatatcctgc 3900 aattccaggt cacactcgca tacacgaagg gtccagacaa cgtggtcgcc gatgctctct 3960 ccagggtcga gacgattggc atgccagcca atttggacct ggcgacgctc gcccaacacc 4020 aagcaaccga cccagacctg ccacacatcc tcgaagctcc aacaccgtct ttagtgctac 4080 gcccgctcga catcgaccag acgacgctcc actgcgcgat agacggcaac caggtgagac 4140 cgtacatccc tcgacagctc cggaggaccg cgttcgacgt cacacacggc ctttcgcacc 4200 cgagcgttcg agcgtccatc cgccttatcg cgcagaagta tgtgtggcca ggtatgcgca 4260 aggacatcgc gcgctgggct cgcaactgcg tccagtgcca gcaatccaag gtgcatcgcc 4320 acaaccgtgc tgctctgggg gacttcgcag ctccagacgc tagattcaac cacatccact 4380 tagacatcgt caagatgccg ctgcactccg gcttccaata ttgtctgacc atcatcgacc 4440 gcttcacgag gtggcctgaa gccatcccaa tgccggacca gcaggccata acgaccgcca 4500 gggcttactt cggacgctgg atcgctgtct acggcacacc tttaacgatt acgacagacc 4560 agggagcaca atttgaagcc gctctgtttt cagaattggc aaagctcatc ggcgccagca 4620 gggttcgcac gacaccgtat catccccagt caaacggcat ggtcgagcgc ttccaccgta 4680 cgctcaaggg cgccttaatg tgctgcgcac ccacgccttg gcccgacgct cttcccgctg 4740 tacttctcgg actgcgcacg accttcaagg aggatctcca ggcatcgcca gccgagatgt 4800 tgttcggcac gacactccgc atcccaggcg acttcttcgt cccgtctagt catcctggag 4860 ccaacgcacc cgctttcgtc gccgagctga gagcgctgat gcagaggctg cgcgccgtac 4920 ctggatccag gcacgcaccg cctctcgcgc ctttctttca tcccaacctt cgcacgtgca 4980 cgcacgtctt tagacgtgtc gagacggttc aggcagacgc tgcagccacc ctacaccggg 5040 cccatatcga gtcctggagc gcctgagcga ccaggtttac cgagtcgaga tcaacggcca 5100 gtccaaggcc atctccactg catcgctgaa gccagctttc ctcgagtcgg cggatcgaga 5160 ggactctccg ccggctccac agcctccagc tcaacctggc tcgcagcctg cccaggcgcc 5220 ggccccacca tctccagcat cccagtcgtc ggcggatcga gaggactctc cgccggcccc 5280 gcagcctgcc cgggagcagg cctttccacc tgcgactcca cagcggtcgg cggatcgaga 5340 ggtctctccg tcggcctcac cgcctgcccc agagcaggcc tcaccgcctc cggtgccgcg 5400 gccagagccg ccgccggccc tccagccgtc ccagcagcag gccgcaccac caacgcccgg 5460 cctgccaccc tcggcagtgc gaccagccca gaaccacacc ctcaacaggc aaccacggcg 5520 acatcgtgta tcgttcctcc tggcacctga cgtcgtcact ggtgggggag tagc 5574 // ID Crack-9_CQ repbase; DNA; INV; 2335 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A Crack non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; Crack-9_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-2335 RA Kojima K.K. and Jurka J.; RT "Crack non-LTR retrotransposons from the southern house RT mosquito."; RL Repbase Reports 11(1), 40-40 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 5 sequences with >98% CC identity. XX FH Key Location/Qualifiers FT CDS 177..2333 FT /product="Crack-9_CQ_1p" FT /note="reverse transcriptase." FT /translation="RPSNETSLQQRAGSCGLQRRASRERNQRDHLHLSDHN FT LIVSFFKSSTPYLKQTHQKQIVDHNKLNEMFIQSMAGLPRSLPAEEKLIHI FT IDRYNSLRQQCTKTVTIKAKVKGHCPWMTFNIWKLIQIKESILKRRRRNPD FT DQHLADLLAHVSRKLQNEKSASKRNYYGNLLANGDQKTAWRVINDALGKQA FT SRTHPNELCVNGRSTKDLNEICQLFNDFFCDVGPNLAATINSDRNINKFGT FT ITPLRWNIYLRPTTAAETTILINELNSKKASGPDNIPVTFVKTHFNIFSLL FT LADVFNEMLETGNFPQCLKIAKVIPVYKSGDTKDPSNYRPISCLSVLDKII FT EKLLVTRIMDYANHLKIIYEHQYGFQKAKSTLHATCDLVEDIYDSLDRREI FT DGAVFIDLKKAFDTIDHQLLLEKLSSYGFRGVARSLIQSYLSDRLQFVAIG FT ESRSSPGLVTTGVPQGSNLGPILFLLFINDLAKLGLRGKVRLFADDTSLFY FT KGKDCTTIQQHIREDLEKLCSFFQTNLLSLNLAKTKYMLIHSPRRPLPARQ FT PILINNHTIEEVSNYPFLGLTLDNTMSWDAHIDQLKRKLSSIGGVLRKVSS FT FLPPSALKALYFSLVHSRLIYLITIWGHANCHRLRELQVTQNRCLKTVLRK FT PHLFPTRQLYEDPDNSFLPVRALHELQMVTHIWKNPSLGQGRMHVNRASRQ FT EGHFYLPRPRSEFGRKKIK" XX SQ Sequence 2335 BP; 683 A; 615 C; 491 G; 546 T; 0 other; cataccacca gcattttggt agtcatcaat gaactggacc aaatcttatc gagcacaacc 60 agcagccaag aatgtatcct gcttggcgat atgaacattc ctgtgaacct tcctgctgtc 120 ccttccgttc gtgactacac gcacctactt gcgtcttaca gcttagctgt gactaacgac 180 cgagtaacga gaccagcttg caacaacgtg ctggatcatg tggtttgcag cggagagcta 240 gccgtgaacg taaccaacga gaccatctac acctgagcga ccacaaccta attgtctcct 300 tcttcaagag ttcgactccg tacctgaagc aaacacatca gaagcaaata gttgaccaca 360 acaagctgaa cgagatgttc atccaatcca tggcagggct accgcgatcg ctgccagctg 420 aggaaaaatt aattcacata attgaccgct ataactcact caggcaacaa tgcactaaaa 480 cagtaacaat taaggctaaa gtaaaagggc attgtccatg gatgaccttc aacatatgga 540 aactcattca aattaaggaa agcatcctta aaagacgccg gcggaatcca gatgatcaac 600 accttgcgga tcttttggcg catgtttcaa gaaaactgca gaacgagaaa tcagctagca 660 agagaaatta ctacggcaat cttctcgcta atggtgacca gaaaacagca tggagggtta 720 tcaacgacgc tctagggaaa caagcctcgc ggacccaccc gaacgagcta tgcgtcaatg 780 gaagatcgac taaagatctg aatgagatct gtcagttgtt caatgatttc ttctgcgacg 840 tcggtcccaa cttagcagct acgattaaca gcgacaggaa catcaacaaa tttggaacta 900 tcacccctct tcgttggaat atttaccttc gacctacgac tgcagcagaa acaacgattc 960 ttatcaacga acttaactcc aagaaagcat ctggtccgga caatattcca gtcacattcg 1020 tcaaaacaca cttcaacatt tttagtcttc ttctggctga cgtcttcaac gagatgctcg 1080 aaactggcaa cttcccccaa tgtttgaaga ttgccaaagt aattcctgtc tacaaatccg 1140 gagacaccaa agacccaagc aactaccgtc caatatcctg tctctctgtt ctggacaaga 1200 tcatcgagaa gctgctagtt accaggatta tggattatgc taaccatctc aagataatct 1260 acgagcacca gtatgggttc cagaaagcga agagcacctt acacgcaacc tgtgatcttg 1320 tagaagacat ctacgactcg ctggacagac gggaaattga cggagctgtg ttcattgatc 1380 ttaaaaaggc ttttgacacc attgaccacc aactgctact ggaaaagctg agttcttatg 1440 gattcagggg cgtggcgcga tcgcttatcc agagctacct gtcagaccga ctacagtttg 1500 tcgcgatagg tgaatctcgg agttcaccag gcctcgtgac aaccggtgtt ccgcagggta 1560 gtaatctcgg tccaatcctg ttcctgctgt tcataaacga cctggcgaaa ctgggtctgc 1620 gaggaaaagt tcgtttattc gctgacgata catctctgtt ctataaaggc aaagattgta 1680 caaccatcca gcagcacatc cgagaggacc tcgagaaact ctgcagcttc tttcagacga 1740 acctcctctc gttgaacctg gccaagacga aatacatgct gatacactcc ccccgaagac 1800 cgctcccagc acgtcagccg attctgatca acaatcacac aattgaagaa gtttccaatt 1860 acccattcct cggacttacc ctggataaca caatgagctg ggacgctcat atagatcaac 1920 tgaagcggaa gctatcttca ataggtggag tcctacgaaa ggtgtcgtcg ttcctgccac 1980 catctgcact aaaagctctg tatttctcgc ttgtccactc tcggctgatc tacctgataa 2040 ctatctgggg acatgcaaat tgtcatcgtc tccgggagct gcaagttaca caaaataggt 2100 gccttaagac cgttctccgc aagccgcatc tgttccctac gcgacaacta tacgaggatc 2160 ccgacaactc ctttctacca gttcgagctc tgcacgagtt gcagatggtg actcatatat 2220 ggaagaaccc ctccctaggc cagggtagga tgcacgtgaa ccgcgccagt cgacaggaag 2280 gacatttcta ccttccgagg ccgagatcag aattcgggcg gaaaaaaatc aaatt 2335 // ID Gypsy2-I_Dpse repbase; DNA; INV; 4676 BP. XX AC Unknown_singleton_20; XX DT 13-MAY-2009 (Rel. 14.05, Created) DT 13-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy2_Dpse; KW Gypsy2-LTR_Dpse; Gypsy2-I_Dpse. XX OS Drosophila pseudoobscura OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; obscura group; OC pseudoobscura subgroup. XX RN [1] RP 1-4676 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1033-1033 (2009). XX DR Genome; Unknown_singleton_20; Positions 16472 11797. XX CC Positions [2189-2788] - Reverse transcriptase CC Positions [3803-4279] - Integrase core CC LTRs are 83% similar to each other. XX FH Key Location/Qualifiers FT CDS 2636..4330 FT /product="Gypsy2-I_Dpse_1p" FT /translation="MDDIIIPAESELEGINKLKRVLSLAEASGLKMKWSKC FT QFLQRRVIFLGYIIEDGTIIPSREKTGAVENFPVPRDRKGVQRYLGLTSYF FT RRFVKDFATIAKPLTNLLKKEVSFKMGMEELASFEQLKLRLTNPPVLLLFN FT PKRVTEIHCDASKFGYLAILLQRSSEDQQFHPVEYMSRKTTPTEEKYHSYE FT LEVLAIIQALKKWRVYVLGMKIKIVTDCNAFAMTMKKREVPLRVARWAIFL FT QDFEFDIEHRSGVRMKHVDALSRVHCLLLEDTIRIKIQTAQKLDEWTSVIV FT KVLEQGTYDDFYVQHGILHKDPTKELIVIPSSMEREIILMAHRQGHFGVKK FT TTDLVQRDYYIPSLPSKVEVIVRSCLECIVSESKHGRKEGFLNLIDKGDEP FT LNTFHVDHIGPMELTKKRYNHILVVVDAFSKYVWLYPTKSTGAEEVVDKFQ FT RQSELFGNPRRIVTDRGTAFTSNIFKEYCDSQNIQHLLIATGVPRGNGQVE FT RLNRTIISLLTKLCAEDPKAWYKNVGRVQQFINSSPPRSTKVSPFKILTGI FT DMRIGYDIELKDMWKKSC" XX SQ Sequence 4676 BP; 1649 A; 736 C; 1172 G; 1119 T; 0 other; acaattattg cgagaaagta gcctcaaact cgatacaacc gggaggatta tacgcgcata 60 tgtataagtc cccccacatt aatgggggct cgtccagtaa aaaaaaaaaa agcaatagtg 120 tacatcaatc attaaatcgt gtgtacatta tcgccacaaa acataaaaca cccaggagaa 180 gcagaatttt gaacgtctag aaatttctgc aaaggagaga acgagacagc acattgagca 240 gcacacacac agaaatggct atgtcacgac aggcctaaat ttcacgagag cgagtgagac 300 agagtatgac agcaaaggca agcagaagag cagttacggc attatccgtt aagttcgtgc 360 atgtctgagc gacggtagaa ccacagacga caacgaaaat tcgagaagaa gcacataaat 420 agtgcacaag agacgagaaa attatgtgaa actgtgagag aggcagtgga acagatggaa 480 aagaggcaca cagagagtgc ataaaagaga gttgaaaact tcgatgtcga actgttagag 540 agagatacag agaagcaagg agtaacaaca acaacagaat ccagagagcg gcattcacct 600 acacaggcga cgacaatatt catcagatgt gtgtataagt gcagaacgcg acgaaagggg 660 tccagaaaga gcagagaaca tcagaattca gagagcgggc ttcacacaca acgacgacgg 720 catcaaccag cagataggaa caaggcgtga cgagacgatc gagagcagag gaaaagaaga 780 aaatcaaagt cccgagtgcg gcagccacat acacagacga cggcgttatc catcagacgt 840 gtgtatgggt gcagagcaaa gaaacgacgt aagaaacaat tagcgtaata gcgacagatg 900 tcaaagagag acagcagagt aagagtaaca gagataaagt ataagaaatt taaagagtga 960 gataaaacat attattcgta aacattaatc aaaattctta aacattattt taagtttata 1020 tcatattaat tacaatacta atgttgcaca acgagatcac acaactcacc gtagtcgagc 1080 taaaagaaaa gttaaaagag ttgaagctga aaactaaggg gaacaaagca gagttacaag 1140 acagactaat atcatatttc aagtcggagg ttaatgaaga gtcaatgtat gaggatacaa 1200 cagattttag tttgaatatc cagagtgaac aattagaagc tccaagtaaa atggcgttta 1260 cattgaaaga tattcgtgat tctttagttg agtttgatgg cagccagcat aaagatatta 1320 acgattggct tacagatttt gagagcacag cagagacagt acagtggaac gcgctgcaaa 1380 agttcgtata tggtaggcaa ttgctgaaag gagctgcgaa gttgtttgta aacagtcaag 1440 atggtcttac taattgggat acacttaaag aagctttaga ggaagagttt acagagaaac 1500 tgtcggcaaa gcaagtacac aagcaactcg agaataggaa gaagaagccg aatgaaagtt 1560 tggtagaata tttttatcaa atgaagagtc tggccaaaaa gagtaattta gatgaagata 1620 gcgtcataga atacataata gaaggtatac ctgcaaggcg caagtgaaag aattagtggg 1680 aatcggagga aaaagaattg gtaccctcgg atttttcctt gttgatactg agctagatgg 1740 catcccgatt gagatgtgtt ttcatgtggt caaagataga gacactttgt atagcgcggt 1800 aatcggtagt gacgtattag aggtagtgag cgttacactt gggaaaaaag gagtagtttt 1860 tcataaggtt gaagaggtga gggataatga aaagagatcc gagatagagg atagcgctca 1920 aaccagtatt gctcattata gtgatattaa gagctctatc gatgtttcac cggcgaatcg 1980 cgtagcagag ttaaaagaag cattcgagca gataatgacg ataaaagaga tagaagattt 2040 ttcgtcagag gtcgatcttt cgcatttgag tgttatgacg aagggaatag tgaaggatat 2100 gattacccag tataatccag taaagtcgac taattgtcca gtggaaatga agattatact 2160 catggacgaa gtgccagtat atcagagacc gagacggttg ccttatgacg accaagaaag 2220 ggtagatcag caaattaagg aatggttgga gaaaggaatc attcgacaca gtgtttcgga 2280 atactcttcg cctattgtgc tagttccgaa gaaagacggt aagaaaaggt tatgttgtga 2340 ctataggaag ctcaaccaga agattatacg agataatttt ccgacagcag ttatcgatga 2400 tgtgttacat aaattgcaaa gaggaaagat attcacaaca ctcgatttgt gtgacggata 2460 gtttcacgtc ccagtagagg agaactcgag gaaatttacg tcgtttgtca cccagaacgg 2520 ccaatacgag tttaattttg taccatttgg gattaacaat tcggcagcgg tgttcacgcg 2580 ctatatattc gcggtgctga ggccacttat tagcgaaggc gttttgatat tgtatatgga 2640 tgacattatt ataccagctg agagtgagtt agaaggaatc aataaactga agagagtttt 2700 gagcttggca gaggcgtcag ggttgaaaat gaaatggagt aagtgtcagt ttttacagcg 2760 cagagtcatt ttccttgggt acattataga agacgggacg attataccgt caagagagaa 2820 aacgggagca gttgagaatt ttccagttcc acgagataga aagggagtcc aacgatattt 2880 gggtctcaca tcatatttcc gtaggttcgt gaaggatttc gccacgattg ctaaaccact 2940 tacaaatcta ttgaaaaagg aggtttcttt taagatgggg atggaagaac tggcctcgtt 3000 tgaacagcta aagcttcgat taaccaatcc accggttttg ctacttttca atccaaagag 3060 agttacagaa atccattgtg atgctagtaa atttggttat ctagccattt tgttgcaaag 3120 aagctcagag gatcaacagt ttcacccagt agagtatatg agtaggaaga ctacgccaac 3180 tgaagagaaa tatcattcat atgagttgga ggtcctggct ataatccaag cattgaagaa 3240 atggagagta tacgtattgg ggatgaagat aaagatcgtt acagattgca acgcttttgc 3300 aatgacgatg aaaaaacgag aagttccctt aagggtagcc cgctgggcaa tatttttgca 3360 agattttgag tttgatattg aacatcggtc gggcgtcaga atgaaacacg tcgatgcttt 3420 aagcagagta cattgtcttc tattggaaga tacgattagg attaaaatac aaactgcaca 3480 aaaactagat gaatggacaa gcgtaatagt taaggtgtta gaacaaggaa catacgatga 3540 cttttacgtg cagcatggta ttttgcataa agatcctact aaggagctca ttgtgatacc 3600 atccagcatg gaaagggaaa ttattctaat ggcccatcgc caagggcatt ttggggtaaa 3660 aaagactact gatctagtac agagagatta ttatataccg agtttgccga gtaaagtaga 3720 agtaattgtt aggtcgtgct tggaatgtat tgtcagcgag tcgaaacatg gtagaaaaga 3780 aggatttctg aatttaattg ataagggtga cgaaccgttg aatacttttc atgttgatca 3840 tataggccct atggagttaa cgaaaaaacg gtacaatcat attttggtag tagtagatgc 3900 tttctcgaaa tatgtgtggt tgtatccgac gaagagcact ggtgcagaag aagtggttga 3960 caagtttcaa aggcaatcag agttgtttgg aaatccaagg cggatcgtta ctgaccgcgg 4020 tactgctttt acttcaaaca tctttaagga gtactgtgac tcacaaaata tacagcattt 4080 gcttattgcc acgggagtgc cgaggggaaa cgggcaagtg gagcgtctaa atcggacgat 4140 catctcgttg ttaacaaaat tgtgtgcgga agatcctaag gcatggtata agaatgtcgg 4200 tagagtacag caattcataa attcgtcacc cccgagaagt acgaaagtgt cgccatttaa 4260 gatcctgaca ggcatcgata tgagaatagg gtatgatatt gagttgaagg acatgtggaa 4320 gaagagttgc tagccgagct acagagtgaa aaagaagaga tacgaaaaac agtgaaagga 4380 aacatagcta aaatgcaaga agaaaatagt agaacgtata acaggaatag gagagaggag 4440 cgcgagtatg aggtggatga attggttgcc atcaaaagaa cgcagtttgg agcgggtttg 4500 aagttaagga aaaaatattt aggaccttat aaaatagtcc gcatattgag gcatggtaga 4560 tatttagtcg aaaaagtagg agaggaagag ggacctaacc ggactaacac agtggcagag 4620 cacatgaaaa agtggtatcc atcattcggg acgaatgagg atgtcggatg gccgaa 4676 // ID BEL-8_CQ-I repbase; DNA; INV; 5685 BP. XX AC AAWU01032143; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-8_CQ_; KW BEL-8_CQ-LTR; BEL-8_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-5685 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 169-169 (2011). XX DR Genome; AAWU01032143; Positions 7446 13130. XX CC Positions [4630-5220] - Integrase core CC 'CCGGC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 363..1706 FT /product="BEL-8_CQ-I_3p" FT /translation="MSKQLKQHLIRRNNILGSLKLIHHYDDNFVFDRDFPQ FT LKFRLEKLDHLWDEFNEVQAEIEFSQPLTDEPDAVRTQFENDYFELKGALS FT SKLEIATQNTHAPPSPIAPPAFHSSVRLPELKIPDFHGDYDEWLNFHDLFN FT SLIHSNHHLSSVQKFQYLKSVLKGDAARRIQNLDVNTANYTLAWNIVKKKY FT DDKNLLLKQHYMALLSIPSVHKESSTALSELADEFEKRVGVLNKLEESKDH FT SNSFLVTLLSIKLDPGTQTAWEHSLEDDQKPTYTDLVAYILKRSRTLQSLK FT LSQTSQHGTKPEPKVEHKSTRSKTSAYHSTNTSDTLSQCKLCKQEHMLHQC FT DEFKKLSPQNRFDVAKKHGLCLNCLKANHMMKACTAGSCRTCNKRHHTLLH FT LNTAAADKPPASTVQVAHCQQSSRLASVVESPSSVTYTAEQCHVVPREHAD FT RLR" FT CDS 3448..5631 FT /product="BEL-8_CQ-I_2p" FT /translation="MLKIPRRVLIDHSVVIELHCFSDASDKAYGACVYVKS FT TNSYGRSKVNLLISKSRVCPRRRLTIPRLELCAALLGAQLTAAVLEATNLS FT CPVVYWSDSAIVLHWIASCSSTWKVFVSNRIAEIQRLTQGMPWRHIPTHLN FT PADLISRGLLPSELVGCVLWEHGPEVIELPSSHWPQNCIELSPLHMQARDD FT EARQTVSVLTTSSLEDRIQIQDHSALLPLLRSVSWIRRFTDNCRSRPEERR FT NGFLKFREIDDTLKLLITLTQRTHFAQEIKHLLKNPGPVRKDFDLKSQIKT FT LNPFIDPEGLLRVHGRLENSSLPFDTKYPIILPAKAHLTHLIAKNTHWDTL FT HSGPQLLLSTLRQRYWPVRGRDVARRVVNECLDCFHCKPRSIDQMMGPLPA FT ARLVPSRAFASSGVDYCGPFAVRPPNRRGASVKVFVAIVVCMWSKAVHIDM FT VYDLTSASFINVLKRLVGRRGRVTDLYCDNARTFVGANRHLEELRRAFDAM FT HSSDAVAAHCAEKGITFHFNPARSPHHGGLWEAAVKSFKHHLYRVMKDTLL FT TIDDFNTLIVQIEGVLNSRPLTPLSSDPTDVVALTPGHFLVGEPLFSIPEP FT DLCDISINRLNSFQKMQQKLQHFWRAWSRDYIGHLQNRKKWPVVRPNIEVD FT TMVLLKQDNAPPMRWKLGRIEAVFPGKDGLVRVVDVRTAHGTYRRAITEVC FT PLPIEQPSEEPSEQPSDLPTDAAVQPSH" XX SQ Sequence 5685 BP; 1375 A; 1663 C; 1385 G; 1262 T; 0 other; ttttttggtg ccgaaacccg ggattgcctg gtcattacca tcgtcggccg gcctgaggag 60 aacatcgtgg gaatcttgga ctccggcaac aaacgtcgtc tgtcggagag tagctgattt 120 cgtcctacga gggatcttcg gtcggaacaa tctgcccacg gcagtcttct cgacctggac 180 atttgtacga ggcaaggaca tcgttggact gtgcacgtgg aaaccgcgtg gacgaggaag 240 caggtctagc caagcgagca aaggtaacct cacagcagca gataaggtga gttacaagtg 300 aaccacaatt gcgacaaata aagtgtgaat tcacggggtt tgaggatttt gcgctcatca 360 caatgtcgaa gcaactgaag cagcatttga ttcgtcggaa caacattttg ggttccctta 420 agttgattca ccattacgac gataattttg tgttcgatag ggatttcccc caactcaaat 480 ttcgcctcga gaaactggat catctttggg atgaattcaa cgaggtgcaa gctgaaattg 540 agtttagcca acccttgacc gacgagcctg acgctgttag gactcaattt gaaaacgact 600 actttgagct gaagggagct ctttcgtcga aattggaaat cgccacccag aacactcacg 660 cgcctccctc ccccattgct ccacctgcat ttcattcctc cgttcggctc cccgaattga 720 aaatccctga tttccacggg gattatgatg agtggttgaa ctttcacgat ctgttcaact 780 cacttattca ttcgaaccac catttgtcgt cggtgcagaa gtttcaatat ttgaaatctg 840 tgctgaaagg ggatgctgcg cggcgcatcc agaatttgga tgtgaacacg gctaactaca 900 cgctcgcgtg gaacattgtg aagaaaaagt acgatgataa gaatctgctc ctgaaacagc 960 actacatggc gctgttatct attccttctg ttcacaaaga gtcgtcgaca gcgctctcgg 1020 aactcgctga cgaatttgag aagcgcgttg gcgtgcttaa caagcttgaa gagtccaaag 1080 atcactcgaa ctcgttcctc gtcacccttc tcagcatcaa gcttgatcct ggaacccaga 1140 cggcgtggga gcacagcttg gaggacgacc agaaacctac ttacaccgac ctggttgcat 1200 acatcctcaa gcgctctcgc actttacaat ctctaaagct ctcccaaact tcgcaacacg 1260 gcacaaaacc cgaacccaag gtcgaacaca aatccactcg ctcgaagacc tcagcgtacc 1320 actctaccaa cactagtgat acgctatccc agtgcaagct gtgcaagcaa gagcacatgc 1380 tgcaccagtg cgatgaattt aagaagcttt cgccacaaaa ccgcttcgat gttgccaaaa 1440 agcacggtct ctgcttgaac tgcttaaagg caaaccatat gatgaaagca tgcactgcgg 1500 gttcgtgccg aacgtgtaac aaacgccacc acacactgtt acacctgaac acagctgccg 1560 ccgacaaacc accggcctca actgtacaag ttgcccactg tcaacaatcc tcacgattgg 1620 cttcggtcgt cgagtctccg tcgtcggtta cgtacactgc cgagcagtgt cacgtggtcc 1680 ctcgggaaca cgctgaccgc ctacgttaga gtgatgtgtg cagatggcac acacatccat 1740 gctcgagccc tgttggattg cgcctccgaa gctaatttcg taaccgaatc tctcgctcag 1800 gctctgcggt cgaaacgcaa acccgcaaac gtcgatgtgt acggtatcag tcaaactgtc 1860 aagaaagtga agcaccaaac cacaatcact gtttcgtctc gtgtgggccc gtacactaca 1920 agcatggact ttctgattct gccctcccta accagaatcc ttcccactac caacgtggat 1980 gtttcgaagt gggtcattcc tcgccacctc ccacttgccg atccaaaatt caacatcgct 2040 catgacgtcg acatgatcat tggcgtcaag cactttttcg ccatcctgca gggtgaacaa 2100 ctatcgattg gatctggcct accaacacta cgcaacactg tctttggata cgtcgtcgct 2160 ggcgacacgg atgcacctgc acaacaatcc gtaacttgca acgttactgc tgtcgaacgg 2220 ctggaagctg cagttcgcaa gttctgggag gtggagagct ttgagacagg gaaagccctg 2280 tccctcgagg aacggtattg cgagaatcat tttgtgaaga cccacactag agcccctgac 2340 ggtcggtacg tggttcgtct gccaattcgt gaggagttgc gatctagttt gggagagtcg 2400 attgcggtcg ctcaacgcag attctttagt actgagagga agttcacgtc gaacccggag 2460 ctgaacaagg agtactcaaa gttcatggcg gagtgcgatt agctcgggca tatggaagag 2520 gtcaccccag atctcagcaa gcctcacttc taccttcctc accacgcgat ccagcgacca 2580 gagagcacaa caacaaaaac cagagtggtc tttgatgctt cgtgtagatc gtcaaacaac 2640 atctcgctca acgatctatg ctacattgga cctacggtgc aaccaccact cttggacacc 2700 attctgcgat ttcgtctgcc aaagtacgtc gtcagcgtca ctgccgacgc ggagaaaatg 2760 tacaggcagg tactcgtcca tccggacgat cggccgttgc agcagattct atggcgatca 2820 accccgaacg aggacctgaa gacgtatcag ttgaatactg ttacctacgg taccgccccc 2880 gcgccgtacc tcgcaacccg cgtcctgaac cagctggccg acgacgaggc agaaaactac 2940 cccctcgctg ctcctcaagt taagcgatcg ttttacgtcg acgactacct atcgggtgac 3000 aacgacgaga accgcttgat ggaaactaat cgtcaactca tcggacttct tggatctggc 3060 ggtttcacca tgcgcaagtg gtgcagcaac agttctcgcg ttctgtctca cattcccgag 3120 aaactccgag atccccgaac cgagttggag ctcagcgaat ccggatcgat caagacgctc 3180 tgccttctgt ggcagcctgt ggcggtgcct gaccacctca gctgcaaggt tcccgaatac 3240 aactcgacgg agccaatcac gaaggcactc attctctcgg aattgtcgca gctgttcgac 3300 ccaactggta tggcaggacc agtcattgtt cgagcaaaaa tgtttctcca atcgctgtgg 3360 gaaaagaact tcgggtgaac ccaagagctg ccagaggagt atcaagaatg gtggaagcag 3420 tacagaatcg aaattcgcgt cctcagcatg ctcaaaatcc ctcgccgagt tctaatcgat 3480 cactcagtgg tcatcgagct gcactgtttc tcggacgcgt ccgacaaagc gtacggcgct 3540 tgtgtctacg tgaaatcaac caactcgtac ggcaggtcga aggtcaacct gctcatttcg 3600 aaatctcgcg tctgtcctag gcgcagatta acgatccccc gtctcgagct gtgcgcagct 3660 ctgctggggg cccaactgac cgcagctgtc ctggaagcta ccaacctgag ttgtccagtg 3720 gtctactggt ccgattctgc gatcgtcctg cattggattg cgtcgtgctc atcaacctgg 3780 aaggtgttcg tctcgaaccg gatcgcagaa atccaacgcc tcacgcaggg catgccctgg 3840 cgtcacatcc caacccacct gaatccagca gatcttattt ctcggggact tcttccttca 3900 gagctggttg gatgtgtatt gtgggagcac ggaccagagg tcatcgagct gccctcttcg 3960 cattggcccc aaaactgcat tgaactctca ccattacaca tgcaggctcg tgacgacgaa 4020 gcacgccaaa ctgtgtcggt gctcacaaca agctctctcg aagatcgcat ccaaatccaa 4080 gatcattccg cactcttgcc cctgctccgc tcagtctctt ggattcggag attcacggac 4140 aactgtcgca gccgtcccga agaacgacga aatggttttc taaagtttcg agaaattgac 4200 gacacactca agctgctcat tacactgacg cagcgcaccc acttcgctca ggagatcaag 4260 catcttctca aaaaccctgg cccagtccgt aaagacttcg atttgaaatc ccaaatcaag 4320 accttgaatc ccttcatcga cccggagggt ctgctcagag tgcacggccg gttggaaaac 4380 agcagcttac cgttcgacac caagtacccc ataattttgc ccgccaaagc acatctgaca 4440 catctgatcg ccaaaaacac ccactgggat actctacact cgggtccaca actgctgcta 4500 tcgacgctac gtcagcgcta ctggccggtc agggggcgag atgttgcgag gagagtcgtc 4560 aacgagtgtc tggactgctt ccactgcaag ccaagaagca ttgaccagat gatggggcct 4620 ttgccagcag ctcgtctcgt tccgtcgcgc gcgttcgcga gcagtggtgt cgactactgc 4680 gggccgttcg ccgtaaggcc accgaatcgt cgcggcgcgt ccgtcaaagt attcgttgcg 4740 atcgtcgtct gtatgtggtc gaaagcggta cacatcgaca tggtgtacga cctcacgtcc 4800 gcgtcgttca taaacgtcct aaagcgcctc gttggacgtc gcggccgcgt gaccgacctc 4860 tactgcgata acgcccgaac gttcgtcggg gctaatcgcc acctggagga actgcgccga 4920 gcattcgatg cgatgcactc gtcggatgca gtcgccgctc actgcgcaga gaaggggatc 4980 accttccact tcaaccctgc ccgatcgccg caccacggag ggctgtggga ggccgctgtt 5040 aaaagcttca aacatcacct ctaccgcgtg atgaaggata cgctgctcac cattgacgat 5100 ttcaacaccc tgatcgtcca aatcgaaggc gtccttaatt cccgacccct tactcctctt 5160 tcctccgatc ccactgacgt ggttgccctc actcctgggc acttcctcgt cggagaaccg 5220 ctgttctcca tccctgagcc cgatctgtgc gacatttcta ttaatcggct caactctttc 5280 cagaagatgc aacaaaagct ccagcatttc tggagagcat ggtcgcgcga ctacatcggc 5340 cacctccaaa accgcaagaa atggcccgtg gtccgcccaa acatcgaagt cgacacgatg 5400 gttctcctaa agcaggacaa cgctcctcca atgcgctgga agctgggccg gatcgaggcg 5460 gtgttccccg gcaaagatgg gctcgttcgt gtggtggatg tccgcacggc gcacggcacg 5520 tatcggcgtg caatcacgga ggtctgtccg ctgcccatcg agcaacccag cgaggaacca 5580 tccgagcagc catccgacct accaaccgac gcagctgtcc aaccatccca ctgagcctgg 5640 aggcgcttgg tggtttcttg aaagtgcctt tcaacggggc cggca 5685 // ID Gypsy-91_CQ-I repbase; DNA; INV; 6656 BP. XX AC AAWU01007361; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-91_CQ_; KW Gypsy-91_CQ-LTR; Gypsy-91_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-6656 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 561-561 (2011). XX DR GenBank; AAWU01007361; Positions 19552 26207. XX CC Positions [4901-5377] - Integrase core CC 'TTAT' target site duplication CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 465..2588 FT /product="Gypsy-91_CQ-I_2p" FT /translation="MDFDNLYKTMDVSHLSVEEVEHELLIRNILYDFKTHE FT SVKRRKLKDRMKEEKEMGVQPAALARTWRSTPDEIKITSTAIKLIHKMLNA FT KHDKEVRRRLRTRMIHYRVRLEHLATSADAHKYVTKINEIQFEIAEIFENF FT FVKASAKESQSEAEKKAIRDKISEVLKTVQNELDVLNETASALEIDKEIEQ FT DESKEIQAKEGGNDKLQDVIAKLSDYEQGKVADVDSLLKLFKDFVVDTAVD FT EKQRQKAEIEREKARKQEIAVNIEQKEKLEEILLQVTNELQKMEALQSKDP FT ASEKNLKTDKKKTPNSSDSDSESFDEEEEERGTRRLKNKNSKRKKKISFSG FT LSSTEISSSSSSSSSSDTSSSSSSSESSSDSEEERKKKKKRKRKNKKNNRN FT QHMKRMPVAEWKLKYDGKDGGKILTEFLKEIKMRCRSEKVSDKEIFRSAIH FT LFTGRAKDWFIDGTECRDFRNWKELKKELKREFRPHDLDFQMEVQATNRKQ FT ARGEKFADYYYEMKKIFQNMNEQISDKKKFRLIWRNMRHDYKNALTGAKIG FT SLKRLRKLGKIVDENNCNLYQKPTEFVNRPRSAQINEVVSSGKAKSKQNPY FT SSGNQSKSFTTGKNNNSKPKVENQGKEGNEKEQKGEEKQDCMEGTAKSTFN FT QLVAQYKRPPLGVCYNCRESGHHYAECPKPKDKFCRLCGFADVSTTNCPAC FT QKNADSSA" FT CDS 2768..5767 FT /product="Gypsy-91_CQ-I_1p" FT /translation="MGREMLGLLDSGAQCSVLGNGSENFVKSMKLTLYPSL FT VKVKTAGGNCVPVKGYVHLPITFNNTTKIINTLVAPELKRKLILGYGDFWK FT AFEVRPSVPMEVSIEEMECMLGEEEGNRNELTVEQRKQLEEVKKLFKVAIE FT GECLETTSLMSHNIELKDEFKRAPPVRINPYPTSPEMQRKINVEIDNLLAQ FT QVIERSKSDWSLSTVPVVKPTGEVRLCLDARRLNDRTVRDAYPLPHQDRIL FT SRLGASKYLSTIDLTKAFLQIPLDPSSRKYTAFSVLGRGLFQFTRLPFGLV FT NSPASLARLMDEVLGFGELEPNVFVYLDDIVVVNDTFEAHLDSLREVAKRL FT QKANLSINIDKSKFCLSELPYLGYILSPEGLRPNPDRISAIVNYERPQSIR FT ALRRFLGMANYYRRFISNFSEKTAPLTNLLRKKPKSIVWNDQAEESFNKIK FT ESLISAPVLSNPDFYLQFTIQSDASDTAIAAILTQQHEEGEKIIAYFSRKL FT TPAEQSYAATEKEGLAVLSAIEKFRPYIEGSYFVVITDASALTYIKNGKWK FT TSSRLSRWSIELQGLHCEIRHRRGRDNIIPDALSRAVEMIEVEELVDDWYS FT GLFRKVQNSPDEHLDYKIESGKLYKFVPTKSDVFDYRYEWKLCVPEALRSD FT ILRREHDNSFHIGYEKLLEKVRQKFFWPNMASIIKKYAQNCKSCKEFKPTN FT VSQHPEMGKQRLTTKPFQILALDFIQSLPRSKSGNQHLLVLMDLFSKFTIL FT VPVRRIGTDLITKILQEQWFLRYSVPEIIISDNASCFLSKDFKAFLDRFNV FT QHWTNARHHSQANPVERLNRSINQCIRTYVKNNQRLWDTRIAEVEFTINNT FT KHSSTGFSPYRIIYGHEIVASGEDHKLDVETKELTEQERIEKKLKIDRTIY FT DTVYSNLQKAHEKSKKNYNLRFQKPAPVYEIGQKVYKRNFTLSSAGDAYNA FT KLGPAYVSCTVRARRGTNSYELEDEDGKNLGVFSSADLKPGPPDS" XX SQ Sequence 6656 BP; 2254 A; 1132 C; 1486 G; 1784 T; 0 other; attggcgccc aactataatc aactaagctg atcacgggtt cagggaaaca agctagagtg 60 gagagtgtta gggccaaggt gtagatctgt tgtgtgctct acagagtaaa gggtggaatg 120 ttcaattgtg acactattct catcgtcact acgaattgat aagatttgtg ttttattgcc 180 ggtgcgcgta cggtcattag agagaatatt gaagggttaa aagtaccaaa tagttgctcg 240 cgactgattt acgaccgaaa gggtcaattt tgctttcgat tctcatcctt ttttgggtcc 300 tgattcataa gaatcttgat tgtaaattgg tagattttaa gccatgtttt attacaatag 360 ttttaatatg tttagttttt ttttggcatg actgaagatt tcatgtaaat attagaaatt 420 tggcatattt ttgaataaaa gtgtatattt tgtagatttt tgaaatggat tttgataatt 480 tgtataaaac tatggatgtt tcacatctat cagttgaaga agttgaacat gaattgctta 540 ttagaaacat tttgtatgat ttcaaaacgc acgaaagtgt caaaagaaga aagctcaaag 600 atcggatgaa agaagaaaaa gaaatgggag ttcaaccagc agctttggct cgtacttgga 660 gatcgacacc agatgagatc aaaataacca gtacagctat caaacttatt cataaaatgt 720 taaatgctaa gcatgacaaa gaagttagac gcagattaag aacacgaatg attcactatc 780 gtgtccgact tgagcactta gcgacttcag ctgatgctca caaatatgta acaaagatta 840 atgaaataca atttgaaatt gctgaaattt ttgagaattt tttcgttaaa gcttctgcaa 900 aagagagcca aagtgaagct gaaaagaaag cgataagaga taaaatttca gaggttttga 960 aaacagttca aaacgaattg gacgttttga atgaaactgc ttctgctcta gaaattgaca 1020 aggaaataga acaagatgaa agcaaagaaa ttcaagctaa agagggagga aatgataaac 1080 ttcaggatgt gattgcaaaa ctatccgatt atgaacaagg taaagtagca gatgttgatt 1140 cgctattaaa acttttcaaa gatttcgttg ttgacacagc tgttgacgaa aagcaaagac 1200 agaaagcgga aattgaacgt gagaaagcta ggaaacaaga aattgctgta aatattgaac 1260 aaaaagaaaa gctcgaggaa attctcctgc aggtaacaaa tgagctgcaa aagatggagg 1320 cgctgcagtc gaaagaccca gcgtcggaga aaaatcttaa aactgataaa aagaaaactc 1380 caaattcttc tgattctgac tctgaatctt ttgatgagga agaagaagaa agggggactc 1440 gcagactcaa gaataagaac agtaagagga agaagaaaat ttcttttagt ggtttatctt 1500 cgaccgaaat atcatcaagt tcgtcgagtt ctagttcgtc agatactagc tcgtcgtctt 1560 cgagttctga gagctcgtcc gactcagaag aggagagaaa gaagaagaaa aaaaggaaac 1620 gaaagaacaa gaagaataat aggaatcagc acatgaagcg tatgcctgtc gctgagtgga 1680 aactgaaata tgacggaaaa gatggtggaa aaattttaac agagtttttg aaagaaatca 1740 aaatgaggtg tagatcagag aaagtgtcag ataaagagat ttttcgttcc gcgatccacc 1800 ttttcacggg tcgtgccaaa gattggttca ttgatggtac cgaatgtagg gattttcgta 1860 actggaaaga gttgaaaaaa gaattgaagc gtgaatttcg tccgcacgat ttagattttc 1920 agatggaagt tcaagcaaca aaccgtaagc aagctcgcgg tgaaaaattt gcagattatt 1980 attatgaaat gaagaaaatt ttccaaaaca tgaatgagca gatctccgac aaaaagaaat 2040 ttcgtctaat ctggagaaac atgcgtcatg attacaaaaa tgctttaacc ggtgcaaaaa 2100 tcggttcgct taagcgactg aggaagctcg ggaaaattgt tgatgaaaat aattgtaatt 2160 tgtatcagaa accgaccgag tttgtgaata gaccgcgctc agcccagatt aatgaagttg 2220 tatcttcagg gaaagctaag tcgaagcaaa atccttatag ctcgggaaat caatcgaagt 2280 ctttcacgac tgggaaaaac aacaacagca aaccgaaagt agaaaatcaa ggaaaagaag 2340 gaaacgaaaa ggaacaaaag ggagaggaaa agcaggattg tatggaaggc actgcaaagt 2400 cgacgtttaa ccagcttgtt gcacagtaca aaagaccacc acttggggtg tgttataact 2460 gtcgcgagtc tggtcatcac tacgcagaat gtcctaaacc gaaagataaa ttttgtcgtt 2520 tgtgcggatt tgcagatgta agcacaacaa attgtccagc ttgtcaaaaa aacgcggaca 2580 gttcagcttg aggggggttg ctgaatccgt gaaaaacaca cctcccactc gtacaaggat 2640 agaagttgaa ctgaacagca gcggatttga acctgttaga agtaccgaat atgtcgaaga 2700 tcgagtgatc gatgaactct ttgttcaagt gcagggagat gagcgacctt ttgcgagagt 2760 aagtgtgatg gggagagaaa tgcttgggct tctggatagt ggagctcagt gctcagtgtt 2820 gggaaatgga tcggaaaact ttgtaaagtc gatgaaacta acactctatc cctcattggt 2880 gaaggttaag accgccggtg gaaattgcgt accagtcaag ggttatgttc atttgccgat 2940 cacgtttaac aacactacga aaattatcaa cacacttgtc gcacctgagt tgaaaagaaa 3000 gttaattttg ggttacggcg atttttggaa ggcgtttgaa gtccgtccgt cagtgcctat 3060 ggaggtatca attgaggaaa tggaatgtat gctcggagaa gaggagggga atagaaacga 3120 acttacggtg gaacagagaa aacaattaga agaagtaaaa aaactgttta aggtggctat 3180 cgagggggag tgtttagaaa cgacctcttt gatgtcccac aacatagaac tgaaagatga 3240 gtttaaacgt gcaccgcctg ttcgcattaa tccctatcct acttcaccgg agatgcaaag 3300 gaaaattaat gttgaaattg ataatcttct agcacaacag gtcattgaga ggagtaaaag 3360 cgattggtca ctgagcactg tgccagtggt gaagccaacc ggagaagttc gattgtgcct 3420 ggacgctcgt cgtttgaacg atcgtactgt aagggacgcc tatcccttgc cacaccaaga 3480 ccgtatacta agccgactgg gggcaagcaa gtacttatcg acaattgatt taactaaagc 3540 gttcttgcaa attccccttg accccagttc gcgcaagtac acggcctttt cggtgttggg 3600 aagaggattg ttccagttca ccaggttacc cttcggctta gtcaacagtc cggctagttt 3660 ggctcggctg atggacgaag ttttaggatt tggtgaactg gaaccgaatg tattcgtata 3720 tctcgatgac attgtggtcg taaacgatac attcgaggca caccttgaca gtctgcgtga 3780 agtagcgaaa agacttcaaa aggccaattt gtcgatcaac atcgataaat ccaaattttg 3840 cttatctgag ttaccttacc tcgggtacat tttgtcaccc gaagggttga gacccaatcc 3900 agaccgaatt tctgcaattg tgaattatga gagaccgcaa tcgattcgtg cgttgcgtcg 3960 ttttttaggc atggcaaatt actacagaag atttatttct aactttagtg aaaaaacggc 4020 tccgcttaca aatcttttac ggaaaaagcc caaatctatt gtctggaatg atcaagccga 4080 agaatcgttc aataaaatca aagaaagttt aatttcagca cctgtactga gcaatcctga 4140 tttctatttg caatttacaa tacagtcaga cgccagcgat acggcgatcg ccgccatttt 4200 gacgcagcaa catgaagagg gggagaaaat catagcatac ttttcgagaa aattgactcc 4260 tgccgagcaa tcttatgcgg cgacggagaa ggagggctta gcggtccttt cagcgatcga 4320 gaagttccgt ccttacatag aaggttcata ctttgtagtc attacggatg cttcggcatt 4380 gacttacatc aagaatggaa agtggaaaac ttcgtcccgt ttaagcaggt ggagcataga 4440 acttcaaggt ttgcattgcg agattagaca caggcggggg agggacaaca taatccccga 4500 cgcactgtcc agagcggttg aaatgatcga ggtcgaggag ttggttgatg attggtactc 4560 agggttgttt aggaaagttc aaaatagtcc agatgagcat ttagattata aaattgagag 4620 tgggaaactt tacaagttcg tgccgacgaa gagcgacgtt ttcgattatc ggtatgagtg 4680 gaagctttgt gtccccgaag ctctgcgctc ggacattttg aggagagaac acgacaattc 4740 ttttcatatc ggttatgaga aattgttaga aaaagtaaga caaaagtttt tctggccaaa 4800 catggcgtcc attatcaaga agtacgcaca aaattgcaaa tcatgtaaag aatttaaacc 4860 aactaatgtt tcacagcacc ctgaaatggg aaagcaaaga ttaacgacaa aaccttttca 4920 aattttggct ttagatttta ttcaatcctt gccaagaagt aagtctggta atcagcatct 4980 attagtgttg atggacctgt tctcaaagtt cacgattttg gtccccgtgc gaagaattgg 5040 aacggattta ataaccaaaa ttttacagga acagtggttt ttaaggtatt ccgtaccaga 5100 aatcataatt agcgataatg ctagctgttt tctgagtaaa gattttaaag cctttcttga 5160 ccgattcaac gtgcagcatt ggactaatgc acgccatcac agtcaagcaa atccggtcga 5220 aagactaaat aggagcataa atcaatgtat tagaacttat gtaaagaaca accaacgact 5280 gtgggataca agaatcgctg aagtggaatt tactatcaat aacaccaaac actcgtccac 5340 tgggtttagt ccgtaccgaa tcatttacgg tcacgagatc gtggcgagtg gggaagatca 5400 caaactagat gttgaaacaa aagaattaac tgaacaagag cgaattgaaa agaaattgaa 5460 gatagatcgt accatttacg ataccgtcta tagtaacctt caaaaagcac acgaaaagag 5520 taagaagaac tacaatttac gattccaaaa accagcgccc gtttacgaaa ttggtcaaaa 5580 agtttataaa cgaaatttta cactgtcctc ggcaggtgat gcatacaatg caaaacttgg 5640 cccagcatac gtgtcctgta cagttagagc gagacgaggt acgaactcgt acgagctcga 5700 agatgaagat ggcaaaaatt tgggagtttt ctcatccgcc gatctgaagc ccggtccgcc 5760 ggattcttga gcatgaattt agaatatttc aataaatttt agcatgtttg tagcgtaact 5820 tttaaggtgt ttgtagaggt tataattttg tccaagcatt atttattgcc attagtgttg 5880 tgaaaagtaa aaaaatcatt gtttgtgagt ttaaaaatgg ctgagtctcc atgttgaatt 5940 gagctctgca agaatataaa tttggaaagc cacagaaaaa caaagaaaac agactagata 6000 acagaatcga tgcgaattca tgaaatagcc caattttttt aaatttaaat ttacgcattt 6060 gtaggcgaat gcaagcataa ttaactcatg cggaataaaa ccggcaaagg ccatcaaaaa 6120 ttgattaatg caaaatttcc gatgaagcgt agaaaatttt agaactagtt cattttcgtt 6180 atgcaaacac tagccgaatc aaatgttatc gaatttcact aatcactttc aaaattacca 6240 tagttttttc caataaatca ctacgttttt aaaacacaat tttgccaatc acacaatccc 6300 aaccttccaa cccaatagat ggatcccatc tgattctaga gagttatctg cttaagtttg 6360 tagcaaagtt ttgaaaaaga gaggaactta tctcttccac tcaaacaact ttgagattca 6420 tggatgagag atgttcaaaa attgatgaga agattaatgg cgttattaaa catagacttg 6480 tttaatttga acaagattat ctatttggtt attttaacgt tttagtatta attcttttga 6540 aatgtcaaat ttgcgaaagg catagccaaa acccggatac aacagacttt aagtcaagat 6600 ggaatataaa ttgtcaacta aagttgtcaa tttattttag agggggtgta gggtag 6656 // ID Gypsy-26-I_NVi repbase; DNA; INV; 15180 BP. XX AC . XX DT 08-MAY-2009 (Rel. 14.05, Created) DT 08-MAY-2009 (Rel. 14.05, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW internal portion; Gypsy-26-I_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-15180 RA Bao W. and Jurka J.; RT "LTR retrotransposon from Nasonia parasitic wasp."; RL Repbase Reports 9(5), 988-988 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 2313..6836 FT /product="Gypsy-26-I_NVi_2p" FT /translation="REYSKSRAKSNRNQGKQGAKSRTKTRQNRRNKKSKSR FT RKWREYSKSRAKSNRNQGKQESKSRTKARQNRRNNNLNQEENGENILNQGR FT XQVEIKENKELNQEQIRDKIGEIKNRNQGKNRHDISETSNRSQGRIQVEIG FT KNKNQNQEQKQVEIGEDKNLNQERIRDKISKNENQNREQNRELIREQNQAR FT IEGLNPEEKLNQERNHGTAKNLNRSEDEIGYIKTILEQNRSKKLNELEKTG FT RENSTEKESALKKLEAILRDAERVIKKEVQVEEIIFNMHTKAISVILSLNS FT LTGYSSEGTRIALEKILDAIRLGTWREAKFHGILEKAHEQLSKTRRDLQNE FT RVEACVYCQPVPTINNDRIKKVSEQLRLEHISEGEKTRVRRLVEEFHDIFH FT VEGEKLSKTPLLKLTIPTVDEVPVSTKPYRRSPEEREEQERQIAELLKDGI FT IEKSTSSFCSPCFLVPKKPDSKGVVKFRLVIDYSKVNAKTLNDQYPMQNIL FT DILDQLGGAKYFSVFDLKSGYHQIEIAEEYRHKTAFATQSGLYHFTRGSFG FT LRTMPATFSRALAIALAGLTGSELFIYMDDIIVHSDSFEQHLERIRHLFER FT LREHVLKLQADKCEFLKKEVAYLGHIISEAGVKPDPRKIEAVKEYPAPRNK FT KNIKQFLGFVGYYRRFIDKFANIAVPLTEMLKNDAEFVWTEKAQKAFEILR FT DKLCEEPILTFPDFNKPFTLVTDSSGYALGAVLLNGEPGKEHPVAYMSRTL FT TEAERKWDTYNKEANALVSAIKHFRPYLFGRRFDLITDNIALQWLRTHRDP FT NSRVNRWRLLLAEFEFNIIYKPGKTNIADALSRNPVEKAINIVAAEDINQS FT QSDNSADDAAGGKPARGKDTAPTHTMQTRGRGGRLQGTKYKEAIAALAPRK FT RRPKENRSKERVEKDISGESQEESPEIKQTTARGQNDDEQLGSEELGIETT FT GTRQDSNEGEVQSESDIRETQQGTDESESGDPGEGAEPLNTTKNGSQRDTQ FT NSEEAITCREQMFMRKDNFMYFISTKGEPCDEGARQLKIMGKLPMFARLQV FT GIPASIRVNNKTHIALPLRDEARIGPTLTRHNIEILCTNLTPFLKQADLKS FT ISIAKTERIEELEWRDAIKPLKDSFARSHTKLIICLGLIKYPLAEDRETII FT KAAHASIVGGHRGVTKTYKRIKQNYYWENLKEEIQAFVRKCLDCQLKKLVR FT IKTKLPMVITDTPTVSMEKLAMDVVGPLPPTDSGNEYVLTFQDNLTKFAIA FT EPLGDITAATVANVLIRKVICVFGAPRVILTDQGTNFLSKLMKRVAKRFRI FT KQVKTTAFHPQSNGSLERSHSSLMEYVKFFVQKNKRDWDEYIDLATFNFNT FT SVSEATWHTPFELVFGRLARTPEEGELEEEDLLPTFNGYMRELVTRLNSLQ FT ATARENSIKAKERAKQYYDRRVNIKTLQIGNHVWLLSGPKPHKFEDQYKGP FT YTVLDVSPNGNVKIQISATKEKIVHANRLRMSHVENKNDKAK*" FT CDS 11105..12601 FT /product="Gypsy-26-I_NVi_3p" FT /translation="MNQMYLMQDKILQLNKMAITFESIKGPYDIRTHKHGG FT KDFELLEGEGINNLILLEDMRRNRKEELSQPADRKTVIENELNEEINTLER FT KKHTLMTRLGLNEEDETNTVEXEQEPSIDETIQVLNENLELASKLITIMQI FT KEYGLSTRTMRNALARLQRTEGQALATPFESINQGTIREYGTVEMYLLQRQ FT LIIQVNIPLVTEQVYRTYELVSIPVFDPLELPRHTAIKIVPRGDYLVINPD FT NNEHFFMSKEERGECLKVHRVRICKANRELKRGMDCESTLKADPESYRAHN FT LCKRTLIHGPEYYLIRTQNPARWIFSTAGRVNGTGMCQEEKQSVTLEGSGE FT VKIPEGCHLRIADSIFYGGADQDEREIFIPHVHLTAASTLWADKRPEAIPQ FT VVRTGTTGVAGNTTITIMTVIIGIFGTMVSLLIVVGIRSLKGRNRQEEPEA FT IYVPMRGLDPTIRCEFHGPHARAEKVYDVPRVPAMQVRENALSAIIEERTE FT SG*" FT CDS join(69..989,955..1716,1602..2273) FT /product="Gypsy-26-I_NVi_1p" FT /translation="MALGGPNKVDEALRILEVNAKALEPDVQRDDWDIKKY FT ATYFRELKXKCSTKKPQSTMIQTRAAICDALEAELDREEIEKTVNCSTLVN FT VVGDFIAARVNPTSLIKFLAGDDEEDLTYIPELILRAEITEDYQTLGLPSN FT TMSFYENLNKTGHSSNKQINPDAVEVKKFQFSTPRINTFRPRENLLQSSAD FT AYNNASVSVLHSKAAEKVMRDLPRFDGKNIPVHVFVLRLQQAANALTTEGE FT LELAKVIYSKLDGDAYNSTINKTFRSVREISKHLESTFGSGKSTDELLGEL FT AKIRQKKGRKSRYLFGKKKGEKVVTYSNRIRILDVNIKAAANMERKATPSF FT NAELSRKLIYYFKKGLDWPISTRLGEFNDVNEAIIHAIEIERDFACYQENQ FT DQKNNTAQICVVNPQAIEKCQICSNKGHTAQKCWYRNQIPNNETVRSPVRP FT QTNPREGPNNSAMQDARQTPWRSQPPPRTTPQNQGEQYVHRQRVTFPNENA FT RAENKPTCNYCKKEGHFIAECRKREYNNKMRETQGNEAGQSYQGAGRTTES FT RPRPTQGFNRREQQQQDARNAGKRSGSIVSGCGTNDRKQAAPDAGIQPPRA FT TISISEVKLSLCAKSPSIKLINPLNGREIVLILDSGAGPNLLKESECIEGV FT KINENAIIKLRGITAGVTQTLGTITLYFAHFKIEFHLVNNDFPIKENGLLG FT CDFFAQSGASIDYKNNCLKIGEHMIAFTNEVREEITEIELSEKTETEHKET FT ERSALITKNSNQITLGGIKNLNQGENRENIGSX*" XX SQ Sequence 15180 BP; 5595 A; 3148 C; 3450 G; 2954 T; 33 other; gaaattgggg gccccttccg gtcttggttt ttactattag aagcaagcgc gattagatca 60 acggaatcat ggcgttaggc gggccgaata aagtagacga agcattacga attttagaag 120 taaacgcgaa ggcactagaa cctgacgtac agagggacga ttgggacatc aagaaatacg 180 caacatactt tcgcgaacta aaartaaagt gttcaaccaa aaaaccgcag tcaacgatga 240 tacaaacgcg cgccgcgata tgcgacgctt tagaagcaga actcgacagg gaggaaatcg 300 aaaaaacagt caattgcagc accttagtta acgtcgtagg ggatttcata gcagctcgcg 360 taaatccgac gtcgctaatt aaattcctgg cgggcgacga cgaggaagac ttaacgtata 420 taccggaatt aattctcaga gcagaaatta cggaggatta tcaaacatta ggccttccaa 480 gcaatactat gtctttctac gagaatttaa acaaaacggg acattcatcg aataaacaaa 540 taaatccgga cgcagttgaa gtaaaaaaat ttcagttttc gacaccgcga ataaacacct 600 tcagaccgag agaaaactta ctacaatcga gcgcagacgc atataacaat gcgtcggtaa 660 gcgtcttaca tagtaaagcg gcagaaaaag tgatgcgaga tttaccgagg tttgacggaa 720 aaaacatacc wgtgcacgta ttcgtgttgc gcttacaaca agccgcgaay gcgcttacca 780 cggaaggaga actcgaacty gcaaaagtaa tttactcaaa attagacggc gacgcataca 840 actcaacaat taataaaaca ttcagatcag taagagaaat tagcaaacac ctcgaatcaa 900 cattcggwtc gggaaaatcr acggacgaac tactcggaga actcgcgaaa ataaggcaaa 960 aaaaagggag aaaaagtcgt tacttattct aatagaatcc gaattttgga cgtgaacatc 1020 aaggcggcag cgaatatgga aagaaaagcg acgccaagtt ttaacgcgga acttagtcga 1080 aaattaatat attatttcaa gaaaggacta gattggccaa taagcacgcg gcttggcgaa 1140 tttaatgacg taaacgaagc aattatccat gcaatagaaa tagagcgtga tttcgcatgc 1200 tatcaagaaa atcaggatca aaaaaataat acggctcaaa tttgcgtagt aaaccctcaa 1260 gctatagaaa aatgccaaat ctgtagtaat aaaggccata cagcacaaaa atgttggtac 1320 agaaatcaga ttccgaataa cgaaaccgta agatcaccag tccgaccgca aacgaacccg 1380 agggaggggc ctaacaatag cgcgatgcag gatgcgagac agacgccatg gcgaagccaa 1440 cccccgccgc gaactacgcc ccaaaaccaa ggggagcaat acgttcacag acaaagagta 1500 actttcccga acgaaaatgc gcgcgccgag aacaaaccca catgcaatta ctgcaagaaa 1560 gagggccact tcatagcaga atgtagaaaa agagaatata acaacaagat gcgcgaaacg 1620 cagggaaacg aagcgggtca atcgtatcag ggtgcgggac gaacgaccga aagcaggccg 1680 cgcccgacgc agggattcaa ccgccgcgag caacaataag tatcagcgaa gtaaaactca 1740 gcttgtgcgc gaaatcacct tcaatcaaat taataaatcc actaaacggc agagagatcg 1800 tgttaatatt agactcagga gcaggaccga acttattaaa agaaagcgaa tgcatagaag 1860 gcgtcaaaat taatgaaaac gctataatta aattaagagg catcacagca ggagtaacgc 1920 agacgctcgg cacaatcaca ttatacttcg cgcattttaa aatagaattt catttggtga 1980 acaacgattt tccaataaaa gaaaacggac tgttggggtg cgactttttc gcacaatcgg 2040 gagcaagtat cgattacaaa aataattgct taaaaatcgg agaacatatg atcgcgttca 2100 ctaatgaggt aagggaagaa atcacggaaa tcgagttgag cgaaaaaacg gagaccgaac 2160 acaaagaaac ggaacggagc gcgctaataa caaaaaattc aaatcaaatt actctagggg 2220 ggataaaaaa tctaaatcaa ggagaaaata gagagaatat tggrtcaara taatattcta 2280 aatcaagggc gaaatcaagt cgaaatcgat agagagaata ttctaaatca agggcgaaat 2340 caaatcgaaa tcaaggaaaa caaggagcta aatcaagaac aaaaacgaga caaaatcggc 2400 gaaataaaaa atctaaatca agragaaaat ggagagaata ttctaaatca agggcgaaat 2460 caaatcgaaa tcaaggaaaa caagaatcta aatcaagaac aaaagcaaga caaaatcggc 2520 gtaataayaa tctaaatcaa gaagaaaatg gagagaatat tctaaatcaa gggcgaawtc 2580 aagtcgaaat caaggaaaac aaggagctaa atcaagaaca aatacgagac aaaatcggcg 2640 aaataaaaaa tcgaaatcaa ggaaaaaatc gacatgatat cagcgaaacc agtaatcgga 2700 gccaagggcg aattcaagtc gaaatcggaa aaaataaaaa tcaaaatcaa gaacaaaagc 2760 aagtcgaaat yggagaagat aaaaatctaa atcaagaacg aattcgggac aaaatcagca 2820 aaaacgaaaa tcaaaatcga gaacaaaatc gggagctaat ccgagaacaa aatcaggcga 2880 gaatcgaggg actaaatccg gaagaaaaac taaatcagga acgaaatcat ggcacagcca 2940 aaaatctaaa tcgaagtgag gacgaaatcg gatatataaa aacaattcta gaacaaaatc 3000 gaagtaaaaa attaaacgaa ctcgagaaga cggggagaga aaattcaaca gaaaaagaaa 3060 gcgcgcttaa aaaactggaa gcaatcctga gagacgcgga acgggtcata aaaaaagaag 3120 tgcaggtaga ggagattatt tttaatatgc acactaaagc aataagcgta atcctaagcc 3180 tcaacagcct aaccgggtac agctcggaag gtacaaggat agcactagaa aagatattgg 3240 acgcaattag gctgggcacc tggcgagagg cgaaatttca cggtatttta gaaaaggcgc 3300 acgagcaact gtcaaaaacg cggagggacc tgcagaacga acgggtagaa gcatgcgtat 3360 actgtcaacc cgtcccgacc ataaataacg acaggataaa gaaggtgtcc gagcaactac 3420 gcttggagca tatttccgaa ggagaaaaaa ccagggtacg gaggttagtg gaagaattcc 3480 acgacatatt tcatgtagaa ggggaaaaac tgtcaaaaac accgctactt aaattaacaa 3540 tacctacagt cgacgaggta ccggtatcga ctaagcctta tcgacgctca cccgaggaga 3600 gagaagaaca ggaaagacaa atagccgaac tactcaagga cggcattatc gaaaaatcca 3660 cgtcatcttt ttgcagcccg tgctttctag ttccaaaaaa gcctgactcc aaaggcgttg 3720 ttaaattcag actagttatc gattacagta aagtcaatgc taaaacactg aatgatcaat 3780 atccgatgca aaatatactt gacatcctcg atcaactcgg gggagccaaa tatttttcgg 3840 ttttcgattt aaagtcgggt tatcaycaaa tagaaatagc agaagagtac aggcacaaaa 3900 cggcatttgc tacgcagagc ggactatacc attttacgcg cggcagtttc ggtctacgca 3960 caatgccggc gacattttcg cgcgcgctcg caatagcact ggcaggcctt acaggtagcg 4020 aactcttcat ttacatggac gacataatag tacattcgga ttcatttgag caacatttag 4080 aacgaatcag gcacttgttc gaaagattaa gagaacacgt attaaaatta caggcagata 4140 aatgcgaatt tttaaagaaa gaggtagcgt atctcggaca cataatctcg gaagcgggtg 4200 taaaaccgga ccctaggaaa atagaagctg taaaagaata tcccgctcct cgcaataaaa 4260 agaatataaa acagttcttg ggctttgtgg gctactacag gaggttcatt gacaaattcg 4320 caaatatagc agtaccgtta acagaaatgc tgaaaaacga cgcagaattt gtatggaccg 4380 aaaaagcaca gaaggcgttc gaaattctgc gcgataaact gtgcgaagaa cctattttaa 4440 cattcccgga ttttaataaa cctttcacgc tagtaactga ctcatcgggg tatgcactcg 4500 gagcagtgtt gctaaacgga gaaccgggaa aagaacaccc ggtcgcatat atgtcacgca 4560 cattgacaga ggcggaaagg aaatgggaca catataataa agaagcgaac gcgctagtat 4620 cggcaatcaa acatttccgg ccgtacctat tcgggagacg tttcgacttg ataacggata 4680 atatagcgct acaatggctg cgcactcata gggacccgaa ttcgcgagtt aacagatgga 4740 gattattact cgcggaattc gaatttaaca taatctataa accaggaaaa acgaacatag 4800 ctgacgcgct ttcaagaaac ccggtagaga aggctataaa tatagtggcg gcggaagata 4860 taaaccagtc acagagtgac aattcggcgg acgacgcggc aggaggcaaa ccagcccgcg 4920 gtaaagacac cgcgccaacg cacacaatgc agacgcgcgg tcgaggcgga agactacagg 4980 ggacaaaata taaagaagcc atagcagccc tagcaccaag aaaacgcagg ccaaaagaaa 5040 accgctcaaa agaacgggta gaaaaagata tttcaggaga aagtcaggaa gaaagcccgg 5100 agataaagca aaccaccgcg aggggtcaga acgatgacga gcagctggga tcggaggaac 5160 tcggaatcga gacaacgggg acgcgacagg actcaaacga gggggaggta caatcggaga 5220 gcgacattag ggaaacgcaa caggggaccg atgagagtga aagcggcgac ccgggagaag 5280 gggcggaacc cctgaacacg actaaaaatg gaagccaacg ggatactcaa aactcggagg 5340 aagccattac gtgccgagaa caaatgttta tgcggaaaga caacttcatg tactttatct 5400 caacaaaggg agagccttgc gatgaagggg caagacaatt gaaaattatg ggaaaattac 5460 cgatgttcgc acgattacag gtcgggattc cagcgtcgat acgcgtaaat aataaaacgc 5520 atatagcgct tccactacga gatgaggcgc gaatcgggcc gaccctgaca cgacacaaca 5580 tcgagatttt gtgcaccaat ctaacaccat ttctaaaaca agcagactta aaatcaataa 5640 gtatagcaaa aaccgaaaga atcgaagaat tggaatggcg cgacgcgata aaaccgttaa 5700 aagactcatt cgctcgctcg cacactaaac tgataatttg tttgggactt ataaaatatc 5760 cattggcgga ggatcgcgaa acgataataa aagcggcaca cgcgtcaata gtcgggggac 5820 accgcggcgt gacaaaaact tataaaagaa ttaaacaaaa ctattattgg gaaaatttaa 5880 aggaagaaat acaagctttc gtgcgaaaat gtttagactg tcaattaaag aaattagtta 5940 gaataaaaac aaaacttcct atggttatca cggacacacc aacagtaagc atggaaaaat 6000 tagccatgga cgttgtaggc cctctacctc ctacggatag tggcaacgag tacgtactaa 6060 cgtttcagga taatctgaca aaattcgcga tagcagaacc actgggagac attacggcag 6120 caacggtcgc gaacgtacta atacggaagg tcatttgygt ttttggagca ccgagagtaa 6180 tattaacgga ccaaggaaca aattttctta gcaaactaat gaaaagagta gcaaaacgat 6240 ttagaattaa acaagtaaaa acgacagcgt ttcacccgca gtcgaacggg tcactcgaac 6300 gctcgcattc cagcttgatg gaatacgtga aatttttcgt gcaaaagaac aaaagagact 6360 gggacgagta tatagactta gcgacattta attttaacac cagtgttagt gaggccactt 6420 ggcacacgcc gttcgagctc gtattcggaa gattagcgcg cacgccggaa gaaggcgaac 6480 tcgaagagga agatttacta ccgaccttta atggttatat gagggagctg gtgactcggc 6540 ttaatagtct ccaagcaacg gcgagagaaa actcgattaa agcgaaggaa agggctaaac 6600 aatattatga cagacgagta aacattaaaa cgctgcaaat cggaaatcac gtgtggctat 6660 taagcgggcc gaaaccgcat aaattcgagg atcaatataa aggaccgtat acagtgctcg 6720 acgtgagccc caacgggaat gtaaaaatac aaataagcgc gactaaagaa aaaatcgtac 6780 acgcgaacag gctacgcatg tcgcatgtag aaaataaaaa tgataaggct aagtagaatt 6840 agaaccacga catagtcgta tgcacctcgc aagtaatgca tggaaaagca cagtaaaatt 6900 catccacatt cagcgcaaac cataacaaaa tcaaacacgc tatacacgcg ccataatggc 6960 ggacgcgcat acgcttactt aagaaaatat tcgactacgg aagaacacaa gcttcaaaat 7020 ttttctttgc tgccaactta cacacaraca gacaaaaaac aacaaacaaa atgaacacag 7080 aaaccagcaa gccgttcctg caaacaacaa gcatgcatgc aattcgtttt acgtaaatgt 7140 aataaaaaaa aacaagcatg tcgtttaatg aaccaattac aatgattaca gatgataaac 7200 atttggattt tgtggctaac agcaatttct caacttttca tcgggggtgc cgcgataata 7260 gcatacgact gttcgcagca agtctctaat ttaacaacct tctcgctaat taacgtcgga 7320 gaatgcgata tagaaatgcc ggtagtaaac acaacgcgag tattcgtaaa actattgcaa 7380 ctaaatgaat ttacgaacac gcacgtaaga ttctgtaaag tagaaataaa gagaacggca 7440 agaacctgcg gcgcattttc gcactcaagc gacgtgttta gaggcgaggt caagttcctc 7500 gacgagataa cgggagaaga gtgcaaaaca atgcataggc atcgaacaca caggtttcgc 7560 gatgagatga ttactaactt aaaaataaac gcgactacgg tgcacccgat gaccttcgcg 7620 ggaaaaatag atcacgcagg gaattgcgaa caagcaggca cgtacatgga cccatacggg 7680 gtatacgaca aagtagtagt ttcaggatay ctcaaaataa cgttaaaaga atacgaagca 7740 tcagttaatt taaatacgaa cgaaatagtt ttgagatcag gaacgaggtg cacacttacg 7800 caagaccaat gtatagactc ggaagacgga tacgcgtttt gggacccggt accnccaacg 7860 acatgcgagt tcaataaata cgcggagtta tacgcaggcg aagtaaataa agttacggag 7920 gaaggaaacg cgcaaagacc ggtgtatatt atggaatcat cgcaagtcac gttcgcgttt 7980 acagcggtcg gcacagtaaa cgtatgcgga tacgaccttg tacgtacaga acacccaaaa 8040 ttattatttt tagaaggtct gcaaacagaa tactttcgaa ctgtaaaagc tccgtcagta 8100 aataatttag acatattcgc gtatgttaat tcgaaaaaat aaaaaaaaga gcggctacat 8160 agggttagta gcgggcgaag tagtacactt aatgaaatgc gtgcaagtgg aagtacaccg 8220 gagggaagaa agcacttgct atcaggaact acctgtaatg aaagacgaca aaaaatactt 8280 tttaactccg cgaacgcaca ttcttaaaac tttcgggaca gaaactacat gtaatccatt 8340 aattccgcct atgtttcaac tagagaatga atggtacacg gtgctaccag ggtcggtaac 8400 aaagggaata acaccaaaca cgctaaaacc aaaaacaaaa ccgacatgga attacacgga 8460 ccccgcgaac ctcgcgaaaa gcggaatata cacgccagcg caaatacaag cactgcgaca 8520 tcacctttga ctcattgcac tctttgcaac agttttacga tcacttaatg ttcccggtcg 8580 aaaagccagc agtgcttaat tcaatcgcga tgagcatgtc gggcagaagc tctagaatgg 8640 acggactatc agtcacgaga ctgttcgatg agaacgctat ggcgagactc gcggaatcaa 8700 catggggtag aatatgggga aagttttttg acgttcggct cggcttcggc tgggataatt 8760 ggcttgttta tgatagcgag actcattaaa ttcgtggctg acacaataat tcacggttat 8820 gctctacatt ctatatacgg attctcaatt tacattttag gaagtttctg gaactcggta 8880 acacaattac tgttgcattt gggatcgcga aaggagaaat cagaaaaaca aggcgaagag 8940 gggaaaaagg acccggaacc tcagccgcag ccgaacgaat accaaacaca ggccccgccc 9000 cctccgaagg acgacggcat cgcttccgca gcgattttaa ccgagggggg agagcaacca 9060 ttacggtatc aagcggtagc gaaccaatta tacccggtgc taccaaacgc caacggtgta 9120 tttccgccgc gaaagagttt taaacgctca ttcgagacag aagtctaatt aaacaaaaaa 9180 ttatcgtctc aacaacgctg ttataattgt accggtacgc gtattgtagt atgtatgcac 9240 ttgacgacaa agcaaagaaa acattagtaa atcaattaat cgaagtcaga aaagaaatta 9300 cgtggtcagc acaaagacgc ggtgagtcgg cgaacgaata cggmgcacga gtagcggaac 9360 tcctcctcgc ctacaatagt aagggcaatt tagcaaaagc agaagtattc gaaatcaccg 9420 attcacccga agatagggaa agattggcta tttattcgtt cttgaaggga atccgtgggg 9480 aagttaagaa atatatgtgc gaactcgcgg tttataaaga cttaacacac gcgagaagac 9540 acgcacaaca gatcgcgcgt cgcctggaag atattaaaca aatcaaaaaa actatgatga 9600 cctttgtcta gaaatagcaa ayagaaaaca aactaagcaa ggacgaattg aaccttcgag 9660 aacaataaaa agccaatgga ctactactca gtctgtaacg cgttagaagg ctkgtactcg 9720 ttccaaacta aagcgggagt acacggatac tacgctaacg atcacaaaaa actatattgt 9780 caactctgtt atcatcagct tgtacaagac actcggggcg attacgtccg atttctagac 9840 acgcatatac tctcaaagat cacgttaccg atttcttgca acatttgcgg ggcgacactg 9900 cttgaggtag cgccagctcg cgagtgcgaa aaatgtcaca agatttatac cgggcaggca 9960 gcaatcagga ggccgaccgc ctcacatttc gtgatcggtt tcgtaaaact gaatccgaac 10020 taattagaaa ttacgaacgt gttagctacc aatacggatt aaattttctc tgcgaacggt 10080 gctattcgga aacggcattc gaattacgaa atttagcttc acttcacatc actcggacgg 10140 tggaygaact ccggaacgaa atcccgccca tctggtgtgg atactgcaag aaaatcatcg 10200 gattcgatag atgccattta gatcaggcga tcaaaaatag acggcgaaaa ttctcaatag 10260 tgggcagagt gaaagaaaag gttttagcca cgcatatcgg agtcaggtgg gagtcaatct 10320 ccgaaagaac aatagataag ataaacagca cagcagtgca agaagccaar cgattacggg 10380 aaaaaagagt aaacgagtac ggaaagctaa tagctgaaca gcgggcgcag gcagccgacc 10440 aggtacgcgc cggaacaata atcccggccg aggcgcaggt agtacggtca ctgaccctag 10500 accctcaawa ctacggagac tgtaaaaatt atacgatcac taccgaaggg ctcacgaacg 10560 acccactctc gggaacagca cacgcaaggg gaaatgaatc ggacggagta atcgagatca 10620 aggaggaagt cgaagaaaat atagatcact acatgataac gcgcagcgac ggcgctatag 10680 gaatttgttt tgtagacaag aactcggaaa acgcgacgcg taacgatgaa aatagttaga 10740 gactakatga catattcaat caaaaccact ataatctgac acaatcaaag atcwtatgat 10800 gaccagcgga attaaataag aacctcgcat tacagatgaa gtggagcaca tatttggggg 10860 caggactcct cctaaccctg ggaacagtaa tggcgggaac caacatcaaa atatccggga 10920 ctaaaaacaa ccccggactt atgtaccaac gaaacgacga tatacgcctc tcacctcggg 10980 agaatggaag ttcgtaaccg gaatcgacac gacccctgtt ttcgacagcg caggattatt 11040 ccaacgcgca gcgatattcc tgacacaaac agaagcatat cgaaacgcat ccaacggagt 11100 attaatgaat cagatgtacc taatgcagga yaaaatctta caactaaata aaatggctat 11160 tacgttcgaa tcaataaaag gaccgtacga tattcggacg cacaaacacg gcggcaagga 11220 cttcgaactg ttggaaggcg aaggaatcaa taacctaata ctactcgagg atatgcggcg 11280 aaacaggaaa gaggaattgt ctcaaccggc cgaccgcaag acagtcatag aaaatgaact 11340 gaatgaagaa ataaatactc tagaacgcaa aaaacacact ctgatgacta ggcttggatt 11400 gaacgaagag gatgaaacta acacagtgga argagagcaa gagccatcca ttgacgaaac 11460 tatccaagtc ctcaacgaaa acttagaatt ggcaagcaaa ttgatcacca tcatgcagat 11520 taaggagtat gggctgtcga cgagaacaat gagaaacgcg ctggctcgac tccaacgaac 11580 rgagggacag gcactggcga caccgtttga atcaataaac caagggacaa tacgcgagta 11640 cggcactgtg gagatgtacc tgctacagag gcagctaata atccaggtaa acattccgct 11700 agtaaccgaa caggtataca ggacctacga acttgtgtca attcctgttt tcgacccact 11760 cgaactacct cgacacaccg caataaaaat cgtcccaagg ggcgattatc tagtaataaa 11820 cccggataac aacgaacact tcttcatgag caaggaggaa agaggagagt gtttgaaagt 11880 acaccgcgtt cgaatatgta aggccaaccg agaactaaaa agggggatgg attgcgagag 11940 cactctgaaa gcggatccgg aatcatacag ggcccacaac ttgtgcaaac gaaccttgat 12000 acacgggccc gagtattact taatcagaac acaaaaccct gcacgatgga tcttttcaac 12060 agccggaaga gtgaacggga ccgggatgtg ccaggaagaa aaacagagcg tcaccttaga 12120 aggaagcggc gaggtaaaaa tacccgaagg atgtcatcta agaatagcgg acagcatttt 12180 ttatggggga gcggaccaag acgaacgcga aatcttcatc ccgcacgtac atctcaccgc 12240 ggcaagcacc ctatgggccg acaaacgccc ggaggcaatt ccgcaggtag tacgcaccgg 12300 cacgacgggg gtcgcaggaa acacgaccat tactattatg acggtaataa tcggtatatt 12360 cggaactatg gtgagtctct taatcgtcgt agggatccga agcctaaaag ggcggaatag 12420 acaagaagag ccggaggcaa tttacgtgcc tatgagagga ttagacccaa caatcaggtg 12480 cgaattccac ggaccccacg cgagggctga aaaagtttat gatgtaccga gagtgccagc 12540 aatgcaggta cgagaaaacg cactgtcagc aataatcgag gagagaacag agtctggata 12600 aacraayaag agaaaaacaa acaaccacga gtcggcaggt agagagaatg caaccaacta 12660 gagtctaaac accaatacgc gactacccgc aaagaaagta agcatttgaa aatgcatatc 12720 atatcaaata aataaaacct ataatcaagt gtggacatta tattcatgtg tgaaattata 12780 taatccttgt gtgaacaaac ttataactaa tctcagcata gacagggagc mactctctaa 12840 ctatgcatat atatatatat atttaaccaa aacaaaacat atttaatcac gtgtgcgacc 12900 atataatcaa tgagtgtgaa cataaatgaa aagaaacaaa aagattttga ttgcaaagtt 12960 aattttttta ttaatgctga caaaaaaact agttatagag atcttaggta cacataggat 13020 aaaactagct gacgtaaatt aattcatcta caaataactt aggttgtaaa atttacatga 13080 caaagattct tctaacttac taggaattaa gactaaaact atgtagaagt acgaactatt 13140 cgattatatg tgcaaggaaa gctttaggat aaatcaagag gaatgtacgt tttaagacat 13200 attaggctaa aactatactt aagaatttgt gtaaatcaag cgactaggag acgggcaaag 13260 ataaacaaaa taacatccgc actcatccac ttcgcgcgtc gcctactcgc ttgagatata 13320 aaataagcca aacaaaaaat tataactcta gaaatattaa tactcgtata agaatatagt 13380 tagtaaaaat atgtaaaccg gcagcaagaa aagcggagac gagactcctc gactcgagaa 13440 ggctcgtact gtttcgcttg cccaagggar acgtaaaatc gatgtatgtg acaaaaaaaa 13500 taaataaaga aaagggcact tgcccaccaa aaaaaaacga ataataaccg gaacatcatt 13560 tatcggaccg ggtgtatccc taaaggacag tcgagtttgg ggaggagggg atacggacga 13620 caaaaccaac agacgagaaa attccttttc tatacgtata tatttattta tttatttgtt 13680 tatttataca tcaacgggca ggagtaccct tacgaaaata taaaaaagca cgcagtacaa 13740 aatacattat gatcacacgc agaacgcaat tacaaaaatc tcaaaacaag tcggcgtcgc 13800 tcgtcgctgg acagcacttg tgaccgggaa tgcggatcca attctgtaaa caaaccaaga 13860 aatttttttt tattaaacac aatcatcgaa acagctgaca caaaatagag aaatacttac 13920 gcccttgccc cggagacgca tcatgtgggg tggatgatca aggcggcaac ggcagatcga 13980 gcagatcctc caggccgcgt gatcctccca ggactcgtgg tgacattcaa tccaggagtc 14040 ggggaaggag tcccgaacaa caggggaccg ggaatcgaag acgaactgca aaagaaaaat 14100 aagattaaac ataccaaacg acaatgagag ggcctttata tatatatata tatatataca 14160 tatatataca aaaaaacaca ctacatctta ccctgtcata atcacggtag tactcccgct 14220 cgatggtcgg aggagtgtgg cgcggctgac ggcataaggg gcaaaaagac ccgatgtcaa 14280 tccaccaggg agtctgggta tggcaaggta gagtgaacaa aggggactct tcctccggga 14340 agtcctcaaa atcagaagcc ggagagggcg gtggcgtaac agggacctga cggtcggcct 14400 acaaaaaaaa aaacaaagag gaggaattaa gacccaaaaa gggaacatta cacattgggt 14460 aatttaaaca aaaacaaaac ttacccaacc tacaggcttc gatgggaccc aggcaggacg 14520 atgccaacta aggacacgaa ccactggatc aaagatcgga gccagagtag agatcacggc 14580 ccgcgcaaac cgaaccagga caccagcaca aagcactact gagacgaaca tctcgaacag 14640 gtaggacgat cacggaggag cagaaggcga gaaactgagt gcggaatgag cagtccgggg 14700 tccttttaaa gggacacagg taggcgaacc ycgagaagga ccgagggggc gaagaaaagt 14760 gagaaaccaa tcgggcgaca aattttccga gtagacccaa aggaagggac tcacccccgc 14820 aaagagaacc aattagagga aaacgccacg gtcaagaagt gagaacaagg aggtacgccc 14880 accgcaacga ccaaaaatcg gagagggaga gaccccgcga caacgggacc gaccgcgtag 14940 gacaaccatc aacacgccga acggacctcg gacaccaaaa tcaagattaa cacgactcaa 15000 gaatcacaca ttgtaaacaa aaaaaagggg ggaaccaaaa acacggcgaa cgacgatttt 15060 gtataaatat atgaacttac ctcctcccgc tctctatcct taaatcttct catcccaatt 15120 cagcaaaaat ttaaaattca gcgtcctacg ctgaacaaca atcagggggg ggggggggag 15180 // ID Gypsy-158_AA-I repbase; DNA; INV; 5357 BP. XX AC AAGE02017317; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-158_AA_; KW Gypsy-158_AA-LTR; Gypsy-158_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5357 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02017317; Positions 55457 50101. XX CC 'AGAAG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 1223..2929 FT /product="Gypsy-158_AA-I_1p" FT /translation="MALQSRIKPFDVSIETSRLPTEWETWKLDLESFFLAH FT GIDKQSDKRAQLAYLGGPGLQELLRHLPGINQVPHVTIDPPYYDVAIKCLD FT QYFEPFRRKTYERHLFHQIVQQPGERFTDFVMRMRKQIARCSYDPSVVDEL FT IADRIAQGCASEELRTKLLQKDRTLDEVLTLGTSLAESHQQSKKLGRPITR FT HEPEVYAVSNRPFRNDQQPMKRQFGFQPRYQPKYNQTNRFSCYGCGRRGHI FT HGSSECPAKHTNCAACGKVGHWAKRCYSSGGVKRKMNRPVSYPKAKRIRAV FT AEEIEKEQSKDYVFYAMGGNVFTFKVGGIQIPMTIDSGSDANIITKKVWEE FT LKEAKVEAMEMTTQADRSLVGYASKQPMQIAGTFNAQIEAGENKTVAKFYV FT VEDGQRCLLGDRTAKELNVLKVGFDINAVGTNKNKPFPKIRGVVVELPIDP FT DVQPVQQAYRRPPIAWEAKIEAKLKTLLDLDIIEPVSGPSPWVSPVVPVMK FT DSGEVRLCIDMRRANQAVLRESHPLPLVDELLGSVCGAVRFSKIDIKDAYH FT QVEISERSRPITTFITKQGLFR" FT CDS 3066..5333 FT /product="Gypsy-158_AA-I_2p" FT /translation="MFGVSCAPELFQKVMESVVAGLDGVIVYLDDAVVYGT FT TVDEHDRRLAAVLERFAEYEILLNEKKCVYNTDTLEFLGHQLSVEGVKPTE FT SRILAIQSFREPHNLSELRSFLGLVCYVGRFVPNLASLTDPLRQLLRSKDQ FT FEWTLKHNSAFQRIKDEISQIHHLGFFNPKDQTKLMTDASPTGLGAVLMQV FT DCKGRCRVIAYASKALTELERKYFQTEREALSLVWGVEKFRLYLLGIRFTL FT VTDCKALKFLFNPRSRPCARIERWVLRLQGYTFKVEHVPGSENIADALSRL FT SMPVNETFDEPTEGYIRNIVAESIPMAVTFEKVIEDSKNDASIQKAIEALQ FT EGIRENMPKEFKPYVDELCSVDGVLLRGNRLVIPTQLQERIVTLAHEAHPG FT MAAMKRRLRQKVWWPQMDKQVESFVKRCKECTIVSCLGAPEPLKRTRMPEK FT PWKDIAVDFMGPLPSGHSLFVIVDYFSRFTEAVVMRTITARRTIEALHETF FT SRFGVPESIRSDNGPQFISEEFKTYCEEYGITLLRTTPYWPQANGEVERAN FT KTILKHLKISQEIGSDDWMWDLRSFLLMYNSTPHASTGVAPSTLMFGRILR FT DKLPAFGDVSLRMDQEAIRDRDWERKLKDAEYSNERRQAKHSDLKEGDVVL FT CKRMVKENKLSSTFAPEEYEVIRLEGSDASLRSINSGRKVHRNVAHLRRLT FT SDKPDERKQTEEPPPTGMNENKSINDDNVAGESNTTRPKRLHRIPAYLNEY FT ETDAV" XX SQ Sequence 5357 BP; 1716 A; 952 C; 1315 G; 1374 T; 0 other; tatttggcga cgaagatggg ataaaacttg atttcaaaag tgcgaatttc aactaaatcg 60 ggattcccga gatttttttg tgaatcgaac caagacgttt ttttggaaaa aattctgtgt 120 cagtagacgc gagtgaacgc cattttgatt tgcggtgaag aacggttgga tgagcgacaa 180 cgagttgggc ctgttgtgcc taagctgtga gcgtaagtga gataaaaaga gaaacgaata 240 aaatgaaggg gaatgctgtg atgtgctttc ccagtttgtc tgtttaatga gaaaaaaatc 300 gcatggcaga aaggaatgga gaatggatgt gaaaaaaata tgcatgtgtt tagtgttcta 360 ggttgtggta ccagaaattt gtatacatta tcagtgcgag tacgaaacac gtaagaaaga 420 gagagcaata taccttattg tactgtgaaa agagatgtgt tcttatgaaa aaatggggaa 480 tttgcgatgt ttttcttcgt tgacctgaag atacaaagtg tttacgaaaa tggaagagag 540 cggcggtggt tttgtaccat ggaaaggata tgttccaata ggaaaaaata agagagagca 600 gctgaaaaaa aaacaaatgt gacaaagata cacaatattt atgaaaatga gagagagtgg 660 tggtggtttt gtaccatgga aagagtgtgt ttcaaaataa aaaaaaaatg cagagaaggc 720 ggctgaaaac aaaagtgaca aagtaatcat ggtatgtgca aggaatgaag ttgtgggaaa 780 aaaatgttga aaaatacaga gattactaca tgattgtggc tgtgtatctg cattgaatgc 840 tacatcaatt gggtgaaaac ggttttgtgc tgaagataga aaataataat tattgtagtg 900 gtgatttggt tttgtaccat tgaatgagtt gatttagaaa gagatggtta tgtaccgctg 960 gtgtcgttgt attatggttt tgtaccaaaa gaaagaaata atgatagatg tgtgccgaac 1020 aaatgattac gtaatggtta tgtaccaatt tgaagactgt tgcgtaaact ggtgatatgt 1080 tgacatgata acattttgaa taactgaatt ggtaagtgtt acttaactga tgtcggaatg 1140 aattaccgtt aaaaaaaaaa aaaaaaaaag atactggtaa ttcagcacag aatatgtaca 1200 atcgttgcac gctcaactat ggatggcttt gcaaagcaga atcaagcctt ttgatgtttc 1260 cattgaaacc agtcgtttac caaccgagtg ggagacctgg aaactagatt tggagtcatt 1320 cttcttggca catggcatag acaaacaatc ggacaagcga gcgcaactgg cataccttgg 1380 aggtcctggt ttacaggagt tacttcgcca tcttcctgga atcaatcagg ttccacacgt 1440 aacaatcgat ccaccatact atgatgtcgc gatcaaatgt cttgatcaat attttgagcc 1500 ttttcgtcgc aagacatacg aaaggcatct gtttcatcag attgtacaac aacccggcga 1560 aaggttcacg gattttgtga tgcgtatgcg taagcagatt gcaaggtgca gttatgatcc 1620 tagtgtggtg gatgaactca ttgcggatcg tattgctcaa ggatgtgctt cggaggagct 1680 acggacgaag ttgttacaga aagaccgtac tctggatgaa gtactaacgt tgggaacaag 1740 cttggcagag tcacaccagc aatccaagaa acttggccgt cccattacac gtcacgaacc 1800 ggaagtgtat gctgtatcaa atcgaccatt ccgaaatgac caacagccaa tgaagcgcca 1860 gttcggtttt caacctcgtt atcagcctaa atataatcaa accaaccgtt tcagctgcta 1920 cggttgtggt cgtcgaggac atattcacgg aagtagcgaa tgtccagcca agcatactaa 1980 ttgtgctgct tgcgggaaag ttgggcattg ggctaaacgt tgttatagtt ccggtggagt 2040 gaaacgtaaa atgaaccgtc ctgtatcgta tcctaaggca aagcgaatcc gtgcggtagc 2100 tgaagaaatc gagaaggagc aatcgaagga ttacgtattc tatgcgatgg gaggcaacgt 2160 gtttacgttc aaagttggag gaattcaaat acctatgacc atcgattcag gatcggatgc 2220 gaacatcatc accaagaagg tatgggagga attgaaggaa gctaaggtgg aagcaatgga 2280 aatgacaaca caggcagacc gatcgcttgt tggatatgcc agtaagcagc caatgcagat 2340 tgctggaaca ttcaatgcac aaatcgaggc cggagagaac aagactgttg caaagttcta 2400 tgtggtcgag gatggtcagc gctgtttact tggcgaccga acagcaaaag aactaaacgt 2460 tctaaaagtc ggatttgaca taaatgcagt tggcacaaac aaaaacaaac cgtttccgaa 2520 aattcgtgga gttgtcgttg aattaccgat tgaccctgat gttcaaccag ttcaacaagc 2580 ctacaggagg ccaccaatcg cgtgggaagc aaaaatagaa gcaaaactga aaactctatt 2640 ggatctagac atcatcgaac cagtttcagg gccatctcca tgggtatcac cagtggttcc 2700 agttatgaaa gattcaggtg aagttcgttt gtgcatcgac atgcgcagag ccaaccaagc 2760 agtgttgcgc gaatcgcatc ctttgcctct agttgacgag ttgcttggat cggtttgcgg 2820 agccgtgcga ttttcgaaaa tcgatattaa agatgcctat catcaggtcg aaatttcgga 2880 acgctcgcgg ccaattacca cattcataac caaacagggt ctattcaggt agagagaagt 2940 tttaataaaa aagggttttc ttgttattag taattgattt cagcaaaaaa aaataaatat 3000 tgtaaaaaca agtttatttt caaatacgtt tttcctaact tttattattt agatacaaac 3060 gactcatgtt tggagtaagt tgtgcacccg aattattcca aaaggtaatg gagtcagttg 3120 ttgcggggtt ggatggagtc attgtatatt tggatgatgc agtcgtttac ggaactacag 3180 ttgatgaaca tgaccgtaga ctggcagcgg ttctcgaaag gtttgctgag tacgagattc 3240 tgcttaacga gaaaaagtgc gtgtataaca ctgacacttt agaatttctg ggtcatcagc 3300 tatccgttga aggtgttaaa ccaacggaaa gtagaattct agccatacag agtttccgag 3360 agcctcacaa tttatcagaa ttaagaagtt tcctgggttt agtttgctac gtgggaagat 3420 ttgtcccaaa cctggcgtca ttgacagatc ctctgagaca actgctgcgt tcgaaagatc 3480 aatttgaatg gacattgaag cacaattctg catttcaacg gattaaagat gaaatctctc 3540 aaattcatca tctcgggttt ttcaacccaa aagatcaaac caaattaatg accgatgcca 3600 gtccaaccgg actaggagcc gtgcttatgc aagtagactg taaaggaaga tgtcgggtga 3660 ttgcatacgc gagtaaggcc ttaacggagt tggagaggaa atattttcaa acagagcgag 3720 aagctttgtc gttagtctgg ggagtcgaaa agttccggtt gtaccttcta ggcatccggt 3780 tcactctagt aactgactgc aaagctctaa agtttctttt taaccctaga tcgaggccct 3840 gtgcccgcat tgagcgctgg gttctaaggc ttcagggata tacattcaag gtagaacacg 3900 ttccagggtc tgaaaacata gcagatgcac tatcaagatt aagtatgcca gtaaacgaga 3960 cgtttgatga accgactgaa ggatatatac ggaatatcgt tgcagaatca attccaatgg 4020 cagtaacgtt tgagaaggtc attgaagatt ctaaaaatga tgcaagtatt caaaaagcca 4080 ttgaagcatt acaggaggga atccgagaga atatgcccaa agaattcaaa ccgtatgttg 4140 atgaattatg ctcggttgat ggcgtactct tgagaggtaa tcggctagtc atccctacac 4200 agctacaaga acgaatagtt actttggccc atgaagctca cccaggaatg gcggctatga 4260 aacggcggct tcgccagaag gtatggtggc cacaaatgga caaacaggtg gagtcgtttg 4320 ttaaacggtg taaagagtgt actatagttt cctgtctagg agctccagaa ccattgaaac 4380 gcaccagaat gccggaaaag ccctggaaag atatcgcagt agattttatg ggtccactgc 4440 cctcaggtca ctcgctgttt gtaatagttg actattttag tcgttttacg gaagcagttg 4500 ttatgagaac gattaccgca aggcgtacaa tcgaggcact gcatgaaacc ttcagtagat 4560 tcggtgttcc cgagtccatt agatcagaca atggaccaca gttcattagc gaggaattta 4620 aaacatactg tgaagaatat gggatcacat tgctgaggac gacaccgtat tggccacagg 4680 ccaacggaga agtagagcgg gcaaacaaaa caatattgaa gcaccttaaa ataagtcaag 4740 agataggaag tgatgattgg atgtgggatc taagatcgtt cctattgatg tacaattcaa 4800 cccctcatgc atcaactgga gtggctccat caactttaat gtttggacgc atattacgtg 4860 acaaacttcc tgctttcgga gatgtatcat tacgaatgga tcaagaagca atacgtgata 4920 gagattggga gcggaaattg aaggatgcgg aatactcgaa tgaacgccgg caagcaaagc 4980 attctgacct aaaggagggc gatgtagtat tgtgcaagcg aatggtgaaa gagaataagc 5040 tgtcgagcac atttgcacca gaggagtatg aagtcattag attggaaggt tcagatgcct 5100 cattgcggtc aatcaactca ggtcggaaag tgcatcgaaa tgtggctcat ctcagacggc 5160 tgactagcga caaaccggat gaacgtaaac aaacggaaga accaccgcca acgggaatga 5220 acgagaataa atccataaat gacgacaacg ttgctggcga atccaacacc acacgaccga 5280 aaaggcttca tcgtattcca gcttatctta atgaatacga aacggatgca gtgtaagact 5340 ttaaaggaaa gggggga 5357 // ID hAT-N4_AP repbase; DNA; INV; 808 BP. XX AC . XX DT 25-AUG-2009 (Rel. 14.09, Created) DT 25-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW hAT-N4_AP. XX NM hAT-N4_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-808 RA Jurka J.; RT "hAT-type DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2103-2103 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX SQ Sequence 808 BP; 297 A; 84 C; 120 G; 307 T; 0 other; actagggccc ggaagttcat gcatttgcat gttttttttg aaccattaga tttattgcta 60 agtgagctag aaatgcattt caagacacta cgaaaaacat atcaaaagtt tatagtgcat 120 atttttgcat atttagtggt ttttaaattt tagagcatat ttcttaattt tacaatattt 180 tagagcatat tttataattt atgaagatat agtatgaaaa ttcgtaaata aaaaaaaata 240 aatttatatt aaaagaatac ttactagata ttcttatatt gtattaggta tatattgtgt 300 tataactaca gtaaaatgtt ccaaaggcat tacaatttat aaaaaatttg aacaagttat 360 tgaaaagaat ctgggcttta aaacactatc caaaatttcc aaaattatgc tgggagaaga 420 aatgacaatg gataatttaa ctgaagattt ttcatgtgat gatttaatat attttaaata 480 cgcaccaatt tcttcggttg acgtggagcg gagcttttca gtgtacaaaa atatgttggc 540 agataacagg cggtcgttta tgtttgagaa tttatcaaaa tcgttgattg taaattgtaa 600 tgtttaaaaa tatatataaa ttaatgaaat aatgtttttt ttttgtaaat tttaaatttt 660 gtaattgaaa taaaaatgtt tttttcctca tatttataaa tttaaaaaaa atatttaagc 720 atatttaagc gcatatatca ggatttttaa gcgcataagt gcatacatat ttccgtagtt 780 tttagtgcat aaacttccgg gccctagt 808 // ID RTEX-4_BF repbase; DNA; INV; 6317 BP. XX AC . XX DT 30-JUN-2009 (Rel. 14.07, Created) DT 30-JUN-2009 (Rel. 14.07, Last updated, Version 1) XX DE Amphioxus RTEX-4_BF autonomous non-LTR retrotransposon - DE consensus. XX KW RTEX; Non-LTR Retrotransposon; Transposable Element; Jockey-6_BF; KW RTEX-4_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-6317 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-6317 RA Kapitonov V. and Jurka J.; RT "Young families of RTEX non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(7), 1720-1720 (2009). XX DR [2] (Consensus) XX CC The complete RTEX-4_BF consensus sequence contains two ORFs. The CC RTEX-4_BF ORF1 protein contains the DnaJ (5-84 aa), PHD (218-269) CC and esterase domains (507-675 aa). XX FH Key Location/Qualifiers FT CDS 220..2646 FT /product="RTEX-4_BF_1p" FT /note="DnaJ, PHD, and esterase domains." FT /translation="MASKMAATASPDNGPTIRQLLHLTESASNDEVHEAYS FT IFVQEFTKEQEYVKKLRPAEKKTYRHKISGKYFSEVSAAFLSSMDTRNKIS FT NPTGEVGNHRVTHNSCSITIDIPSNTLSSWKTVCASYYGVQGKDRVTAKGT FT KLSTQFSTTFKDSAQQPERELGSIHITVYSTHRLLIQGTCFALWLCSHYDT FT LQAMVLADAAERHPSEADYSGTVEQATVCAVCDKHEIHCDKFVGCDDCASW FT THFDCTGLSEETKSLLKNTDKKYHCINCHIPQDAIPSTEQSEADQRHFAPP FT PHCTQADKNAKQSNSTMKGPNHQDDEKTETLQTAIMDLEASLAQMKLQSNT FT FECNVSNQLDMIFKRLDSSCPPAKQETEFKKLKEDNKKLQEENKKLCNRVK FT HLESLVTSLQTETTSIQGDMSEVKVCHSTLKVKVANIKEALLDQAVAKNNK FT KPDQNLVLDYAVNTSNRFDALQVDDDPEPNPKELIRSITNRAPSAKPRSQN FT SRPKSRPETKRVVIVGDSNAQKLKPDLLSPTADMPKPIWAPTLPDTLSALN FT KLSKEQPTPDTVVFHVGTNDVMSKSKEAVINEYETVISTTQSLFPSANVVI FT SSVPPRRDTNKRPNVNEDIASINEHLKNTCEANTTLTFVDHPQLWLDRDHN FT VKMFVRDGYHLSSDGVKVMAYNLKKHATDPLGLQPKGQGQHRRMENQGRAT FT HHNDKRRSTPRAPPKSERHASPRHGYDRPNYAMPRPSQHLPTSPPFLGPDP FT HAMGPYHPPPAPWYNEWPSAMEAYGDSYRYNANPMWNVEPAYNRGWNGFWT FT QGKTSRF" FT CDS 2653..6204 FT /product="RTEX-4_BF_2p" FT /note="AP endonuclease and RT domains." FT /translation="MCQTRNSDSRNKLRQYSFRKDHGFSFGLWNIHGLGRK FT VDDSEFIDEIDGFDFFSLLETWHTGKLYFPNFLHFCSSRKKSKNAKRNSGG FT LVFLYKKQYSKFISKLESKTEDILWVKIDKQLLSTDRDIYLATVYISPNRS FT TVHSNRTYDIFDILEDEISIYSCHGDVIVGGDFNARIGTLKDYVDDDCPTD FT NLPEDYTFDNPLPRNNMDLSNPTPFGKNLIDLCINSKLRILNGRTPGDVLG FT KPTCHQPNGSSVVDYVLASENILLHKTSFFHVHPLTPLSDHCKLSLILHNN FT ANNCSEQTTNNLPTRPHCKFVWGADSKQKFIETLNRYEYSSAIDNFLNRTY FT ELTPNSIDSALKDLITPLQEAGKKSLVLHRSREGTHPRNKRPNKKKWFTES FT CAKARKDLKELGKLLSKNHRDPFIRGLYFRRKKEYTRLIRKTKISYKNSIL FT QQLNSHPTNNPSVFWDLVKQFKTSDSIPAEDSIDKQALHTHFQSLYGRDST FT PNVPETNNHFQTRIKNKLLELEEQSASFNSLDTVIQNQEIIQAITQLKNNK FT ASGSDLILNEMLKCASHMLVKPLTKLFNLLLSSRYFPQQWRVSHIVPIHKS FT GPKENPNNYRGISVISCLAKLFTTVINTRLTNYLKENNLISPNQAGFRKNF FT GTNDNLFVIDTLISKYTNANKRLFACFVDFRKAFDCVWREGLRYKLLSSGV FT GGKFYSLIKSMYSNPQTCVKTKAGLTPLFITDKGVRQGCNLSPTLFNMFIN FT DLVNDLDDTDCAPPSLHNLLVSCLLYADDVVMLSESETGLQRALDKLNAFC FT TKWNIHVNLKKTKVMVFNKTGRTVKNIKFSLGVESIQITNSYTYLGLILSS FT SGSYSLAKKQLSLKARKAAFSIKPLIRTSLSPKALLKIYDTCIKSILLYAC FT EIWGKDFSNDKCPVETTQNRFCKNILGVHRNSCNQACRAELGMFPTDIDIS FT VRVIKYWLRLKQLDSSTHPLQVDALLCQEYLMGNSYKKNWLHHVKEIVDNT FT GYSFLWNINYPTLDNKNICTLLKRRLTDMYVQTFFHQLSLDNGKLRFYKSV FT KSEYVMENYLYHNEFNERSAICKIRTSAHPLEIERGRYKQLPVHERLCTYC FT PMNAVENETHFISQCTHYETERKTLFQTVATISPSFYHLNDHQKTTFLLKA FT QNPTIIQNVGQFINHCLQRRSQTRQ" XX SQ Sequence 6317 BP; 2087 A; 1585 C; 1186 G; 1459 T; 0 other; gtacccggat gttgccagtt tcggacggta aaaccgacgc tccgacaatg ttgtaatttt 60 ttcttaccta gaggtcaccc tgggccctaa tttttgctcc agttattcct aacgttacag 120 tgtacccact gataactttt gataacctct gtaatatcct agaggttaca ttttttagaa 180 atctctactt cgaacttgac ggaagaccaa acggccaaca tggcgtccaa gatggcggcc 240 acggcgtccc ctgacaacgg acctacaatc cgacagctgc tccacctgac ggaatctgct 300 tcaaacgacg aagtacacga agcatacagt atcttcgttc aagagttcac caaggaacag 360 gagtacgtaa agaaactgag acccgcagaa aagaaaacct atcgtcacaa gatcagtgga 420 aagtactttt cggaggtaag cgctgccttt ttgtcttcaa tggacactcg gaacaagatc 480 tctaacccca ccggcgaagt agggaaccac agagtcaccc acaattcctg ttccataact 540 attgatatcc caagcaacac cttgtcctca tggaaaacag tatgtgcatc atattacggt 600 gtacaaggaa aagatcgggt cacagcgaaa ggtaccaagc ttagcactca gtttagcacc 660 acgttcaaag actctgctca gcagcccgag agagaactcg gctcaattca catcacagtg 720 tacagcacgc ataggctgtt gatacaaggt acctgcttcg ctttatggct atgctcacat 780 tatgataccc tgcaagccat ggtgctggca gacgcagctg agcgacaccc cagcgaggcc 840 gactacagtg gaaccgtaga gcaagccacg gtatgtgccg tctgtgacaa acacgaaata 900 cattgcgaca agttcgtcgg ttgtgacgat tgtgctagct ggacacactt tgactgcaca 960 ggcttgagtg aggaaaccaa atccctcctc aaaaacacag acaagaagta ccactgtata 1020 aactgccaca ttccacagga cgccattccc tcaacagaac aaagcgaagc tgaccaacgc 1080 catttcgcac ctccgccaca ctgcacacag gcagacaaaa atgcaaaaca aagcaactca 1140 acaatgaaag gcccaaacca tcaagacgac gagaaaactg aaacactcca aacagccata 1200 atggacctag aagcaagcct cgcacaaatg aaactccagt caaacacctt cgaatgtaat 1260 gtatcaaacc agctggacat gatcttcaag cgactagact cgtcctgccc cccagccaaa 1320 caggagacag aattcaagaa actgaaagag gataacaaaa aactacaaga agaaaacaaa 1380 aaactctgta accgagtcaa acacctggag agccttgtga catccctaca aaccgagaca 1440 acaagcatcc aaggtgacat gagcgaagtg aaagtctgcc acagcaccct gaaagtcaaa 1500 gtcgcgaaca taaaggaagc cttacttgat caggctgtag ccaagaacaa caagaaaccc 1560 gaccaaaacc tcgttctgga ctacgccgtc aacacttcta accgttttga tgctctacaa 1620 gttgatgacg acccggaacc gaaccccaaa gaactgatca gatctataac aaacagagca 1680 ccctccgcca aaccccgttc acagaactcc agaccaaagt caagaccaga gaccaagcga 1740 gtggtgatag tcggtgattc aaacgctcag aagctaaaac cagacctttt gagccctact 1800 gccgacatgc ccaagcctat ctgggccccc actctgccag acaccctctc agctctgaac 1860 aaactgtcaa aagagcagcc taccccagac acggtggttt tccacgtggg aaccaacgat 1920 gtcatgtcta aatcaaagga agccgtgatc aacgaatatg agacagtgat cagtacgacc 1980 cagtcactgt tcccctctgc taacgtagtg atatcgtctg tgcctccaag gagagacaca 2040 aacaaacgcc ctaacgttaa tgaagatatc gcatccatca acgaacactt aaagaacact 2100 tgcgaagcaa acactacgct aacgttcgta gaccacccac agttatggct agaccgtgac 2160 cacaacgtga agatgtttgt gcgcgacgga taccacctga gtagtgacgg cgtaaaggta 2220 atggcctata acctgaagaa acacgccaca gatccccttg gactccagcc taaaggacaa 2280 ggccaacacc gtcgcatgga gaaccaaggt cgcgctactc accacaatga caagcggcgc 2340 agtaccccac gtgcccctcc caaaagtgaa cggcatgcgt ctcctagaca tggatatgac 2400 cggcccaact atgccatgcc acggccgtca cagcacctgc caacttcacc ccccttcctc 2460 ggaccagacc cgcacgccat gggaccctac caccctcctc ccgccccatg gtacaacgaa 2520 tggccgtctg caatggaagc ttacggggac agctacaggt ataacgcaaa cccgatgtgg 2580 aatgtggagc cagcctacaa cagaggatgg aatggattct ggacccaggg aaagacctcg 2640 cggttttaag taatgtgtca gacaaggaac tcagatagca ggaacaaact aagacagtac 2700 agttttcgta aggaccatgg tttttcgttt ggactatgga acattcatgg tttaggtaga 2760 aaagtagatg atagtgaatt tatagatgaa atagatgggt ttgacttctt ctccctactc 2820 gagacatggc acacaggtaa gctttatttc cctaactttc tgcatttctg tagctccaga 2880 aagaagtcta aaaacgctaa acgcaactct ggtggtttag tattcctcta taaaaaacaa 2940 tatagtaaat ttatttcaaa gttagaaagc aaaactgaag atattctatg ggtaaaaatt 3000 gacaaacagc tgcttagtac tgacagagat atctatttag cgacggtgta catcagcccc 3060 aacagatcaa cagttcatag taataggacg tacgacatat tcgatatttt agaggacgaa 3120 atttctatat acagttgcca tggcgacgta atagtcggag gagattttaa tgcccgtatc 3180 ggcaccctaa aagattacgt agacgatgat tgccccacag ataatctccc cgaagactat 3240 acttttgaca atcctcttcc ccggaataac atggacttaa gtaacccaac cccatttgga 3300 aaaaacttaa ttgacctatg cataaatagc aaattaagaa tattaaatgg tagaaccccc 3360 ggcgacgttt tgggtaaacc tacatgtcac caacccaatg gttctagcgt cgtagactac 3420 gtcttagcaa gtgaaaatat tctcctccat aaaacctctt tcttccatgt acacccgctt 3480 accccactat ccgatcattg taaactctct ctaattttac ataataatgc taacaactgt 3540 tccgaacaga ctactaacaa tttgcccacc cgcccacatt gtaaatttgt atggggggcc 3600 gactccaaac aaaagttcat agaaacatta aatagatatg aatattcttc agctatagat 3660 aattttctca atagaactta tgagctaacc ccaaattcaa tcgactccgc actgaaagac 3720 ttgataacac ccctacaaga agccggtaaa aagtcactag tgctgcaccg ttcaagggaa 3780 ggtactcatc cccgcaataa gcgccccaat aaaaagaaat ggtttaccga gtcctgcgct 3840 aaagccagaa aggatctaaa agagcttggc aagctcctct ccaaaaacca tagggacccc 3900 ttcattaggg gactatattt taggaggaaa aaggaataca ctcgactgat caggaaaacc 3960 aaaatatcct acaaaaacag cattctacag cagctgaact ctcaccccac caacaaccct 4020 tctgtcttct gggatctcgt aaaacagttc aaaacaagtg attctatccc tgcagaagac 4080 tcgattgaca aacaagcgct ccacacgcac ttccaaagtt tatacggaag agacagcact 4140 cccaacgtcc cagaaacaaa caaccatttc cagactcgga ttaaaaacaa actacttgaa 4200 ttagaagagc agtccgcaag ttttaactcc ttagatactg ttatacaaaa ccaagaaatc 4260 atacaagcga ttacacaact aaagaataac aaagcctctg gcagtgactt gattctcaac 4320 gagatgctaa aatgtgcttc tcatatgctt gtcaaaccac tgaccaagct attcaatcta 4380 ctcctttcat cccgctattt tccccaacaa tggcgagtaa gccacattgt cccaatacat 4440 aaaagtgggc ccaaagagaa cccgaataat taccggggaa tttctgttat aagctgcctt 4500 gctaaattat tcaccactgt aattaacaca cggcttacta actacctaaa ggaaaataac 4560 ctcatttctc cgaaccaggc tggcttcaga aaaaactttg ggactaacga taacctcttt 4620 gtcatcgaca cactaatatc taaatacact aacgccaata aacgcctgtt tgcttgcttc 4680 gtagactttc ggaaagcctt cgattgtgtt tggcgtgaag gtttgcggta caaactactc 4740 agctcagggg taggcggtaa attctacagt ttaatcaaat ctatgtactc caacccacaa 4800 acctgtgtca aaacaaaagc tggacttaca cctttattca taaccgataa aggtgtccgg 4860 caagggtgta atcttagccc gacccttttt aatatgttca ttaatgactt agttaatgac 4920 ctagacgaca ccgattgcgc tccaccatcc ctacacaatt tactagtgtc ctgtctgctg 4980 tatgctgatg acgttgtgat gctctcagaa tcagagacag gattacaacg cgcacttgac 5040 aagcttaatg ccttttgtac gaaatggaat atacacgtga atctaaagaa aactaaagta 5100 atggttttca acaaaacagg aagaaccgtg aagaacatca aattttccct tggcgtagag 5160 agtatccaaa tcactaactc atacacttac ctaggcctga ttttgtcctc gtcaggaagt 5220 tactcattag ccaaaaagca attgtcttta aaggcacgca aagccgcctt tagcattaaa 5280 cccctcattc gcacctcact gtccccaaag gctctactta aaatctatga tacttgtatc 5340 aagtctatac tgttgtatgc ctgtgagatc tggggcaaag atttctcaaa tgacaaatgt 5400 cctgttgaaa caacccagaa ccgtttttgt aaaaacatcc ttggtgtgca caggaattca 5460 tgtaatcaag catgcagagc agagcttggg atgttcccaa cagatataga tatatccgta 5520 cgtgtaataa aatactggct aagactaaaa caactggaca gttccacaca ccccttacag 5580 gtggacgcac tactctgtca ggaatacctt atgggaaata gttacaagaa aaactggttg 5640 caccatgtta aagaaatagt agataataca ggctattctt tcttatggaa cattaactac 5700 cctacattag acaataagaa catatgtact ttgttaaaga gaagacttac cgatatgtac 5760 gttcaaacgt tttttcatca actgtcccta gataacggta agctaagatt ttacaaaagt 5820 gttaaatccg aatacgtgat ggaaaattat ctatatcata atgagtttaa tgaaagatca 5880 gccatatgta agataagaac aagcgcacat ccgttagaaa tagaaagggg ccgctataaa 5940 cagctccctg tccacgagag attgtgtaca tattgcccaa tgaatgcagt ggaaaatgaa 6000 acacatttta tcagtcagtg tacccactat gaaaccgaaa gaaaaacatt atttcaaaca 6060 gttgccacta tttcaccaag cttttaccac ctaaatgatc accaaaagac tacatttcta 6120 ctaaaagcac aaaatcccac catcatacag aacgtgggac agttcattaa ccactgcctt 6180 caaagaagaa gtcaaacccg acaatagtat ccatagtatc atgtagttga ttagacacta 6240 gttttatgta cttacctctc ttagttcttg agactgtgta tgcaattagc cctcgggcat 6300 gattttgcaa ataaaca 6317 // ID Penelope-16_HM repbase; DNA; INV; 2228 BP. XX AC . XX DT 17-SEP-2009 (Rel. 14.1, Created) DT 17-SEP-2009 (Rel. 14.1, Last updated, Version 2) XX DE Non-LTR retrotransposon: consensus. XX KW Penelope; Non-LTR Retrotransposon; Transposable Element; KW Penelope-16_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2228 RA Jurka J.; RT "Non-LTR retrotransposons from Hydra magnipapillata."; RL Repbase Reports 9(10), 2803-2803 (2009). XX DR [1] (Consensus) XX CC ~89% identical to consensus. ORF is corrupted. CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 1269..1712 FT /product="Penelope-16_HM_1p" FT /translation="MLSKRISNISSNKDVFDKAAPFYDNALASNGYQETIC FT YKPNHTATKSKSRSRNILWFNPPFALNVRTNVARIFLNLVKKHFPKSHKFY FT KLFNKNNLKVSYSCFPNIKNIITAHNKKFYLVLTPQPTNNATADKLRNALW FT KENASKKI*" XX SQ Sequence 2228 BP; 873 A; 395 C; 318 G; 642 T; 0 other; ttagagcact ctctaaatta acgctcacat taaataatta ttgaaaatat ggaacaactt 60 tttttaaact attaaatgaa gaacatacct ataccaacaa agaagttata ctaatttact 120 taacaaaaaa agatgattaa ttaataaaga gaatgaggcg gaaagctatg ttttacgata 180 aaaaacttaa tcaagaacct actaaccaat tcgatgttaa attaagaaaa tgcccaccgc 240 agtcttgaac ttgtgagttt tgaatacaat ttacttaaaa tgataaaaaa tattaatttt 300 agtgcaacca aatgccattt ccaaaatcaa ttgcatagtg atattagcaa gatcaaatct 360 tctgaaaaag tcttcgtctt taaaggcata ttatgtaaag gcatcttata ataaacttta 420 caacgaaaac gtaacaaaaa catataaaaa atctgacgga aagcaatata atacctttga 480 taggaagcaa aaacaatcgc aaaatattta ggcatcgacg acggaatgaa gtgccttgcc 540 ataagtcagc atttataacc ataaaagacc ataaggaaga ttttgctaac aacacaaagc 600 gtcgattgat caatccatct aaaacgaaag gagaagaaaa tagaaaagcc cgtttaaacc 660 agtggcacag ctgatggttt aatattcaag acaaaaaaaa attgtacttt tatccagttt 720 gatattaacg agttttatcc gtcaatctca caaaaactac ttcatgaatc tttacatcac 780 gctaagcaat acgcagaaat atcagacaaa acacttggcg ttataaatca ctctagaaaa 840 tctttgcttt ttactgatca cgacaacatg tgggttaaga gacttggtga cccaaacttt 900 gacgtcacta tgggaagttt tgacggggca gagctgtgcg agcttgtcgg tcttatatcc 960 tgcatactat aagcaacata tatggattac agtctaatgg attgtataga gatgatggtt 1020 cgttctgttt tcataaggtc agtggaccag aatcagaaaa aataaagaaa aatcttgtca 1080 acctttttaa ggataaattc aatctagact taacagtaaa atcaaactta aagattgtaa 1140 attttttaga tgttacgttt aatttatcgg attgttcctt tcgaccatac agaaaaccag 1200 gtgaccaccc actgtacatt aacgttaatt ctaaccaccc tcctaatatt atcaaatcca 1260 taccaacgat gttatccaaa cgaattagca atatttcatc caacaaagac gtttttgaca 1320 aagctgcacc tttttatgat aacgccttgg cgtctaatgg ttatcaagaa acaatctgtt 1380 ataaaccaaa tcataccgca acaaaatcaa aatcaagatc acgtaacatt ttatggttta 1440 accctccttt tgcgcttaac gtaagaacaa atgttgcaag aattttctta aatttagtta 1500 aaaaacactt tccgaaatct cataaatttt ataaactctt caacaaaaac aacttaaaag 1560 taagctacag ctgttttcct aacatcaaaa atattattac tgctcacaat aaaaaatttt 1620 atctggtact aacacctcaa ccaaccaaca atgcaactgc cgacaaactt cgcaatgccc 1680 tttggaagga aaatgcctcc aaaaaaatat agtatactgc tgtaacgtaa aaacttcacc 1740 acaagatagc ggattcaatt atattggact aacagagaat acttttaaag accgttggta 1800 caagcacaaa aactcattcc aatacgaaag caaagcaaat tcaacagagc tttccaagtt 1860 tatgtgggaa attaaaaaaa agggatatac aaatcctatt attacatgga aaattatcga 1920 caaagcgcaa ccgtttaaac ccggaggaaa atcatgcaac ctttgcctaa ctgaaaagta 1980 tcacataata acatcaaaaa tgaaattatt aaacaaaaga aatgagctgg tatcaaaatg 2040 tcgccatgaa aacaaatttt ttataaacaa ctttaacgtc atcaaagcgg acactgcata 2100 gtatttttgt aattaatttt ttttgtcgtt aagattattt gtaatatctg atgatcgcca 2160 taggcgtgaa actcggagtc atgtttcttt aactaccagg tgttattatt taatttttaa 2220 agaatata 2228 // ID SAT_PF repbase; DNA; INV; 144 BP. XX AC . XX DT 14-SEP-2004 (Rel. 9.08, Created) DT 14-SEP-2004 (Rel. 9.08, Last updated, Version 1) XX DE Palorus ficicola major satellite repeat region - consensus. XX KW SAT; Satellite; Simple Repeat; SAT_PF; satellite repeat. XX OS Palorus ficicola OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Coleoptera; Polyphaga; Cucujiformia; OC Tenebrionidae; Palorus. XX RN [1] RA Mestrovic N., Plohl M., Mravinac B. and Ugarkovic D.; RT "Evolution of satellite DNAs from the genus Palorus--experimental RT evidence for the "library" hypothesis."; RL Mol. Biol. Evol 15(8), 1062-1068 (1998). XX RN [2] RA Kohany O. and Jurka J.; RT "Palorus ficicola major satellite repeat region - consensus."; RL Direct Submission to Repbase Update (JUL-2004). XX DR [1] (Consensus) XX CC 144 bp monomer unit of satellite repeat. XX SQ Sequence 144 BP; 54 A; 21 C; 26 G; 43 T; 0 other; gagatttcgt taaaatgatg cacaaataag ccgaaataaa tcgaattgag cctaaaatcg 60 tctgatttct tcgagatttt gttaaaatga tgcagaaata agccgaaata aatcgaattg 120 agcctgaaat cgtctgattt ctta 144 // ID R5-1_SM repbase; DNA; INV; 5465 BP. XX AC . XX DT 31-JUL-2009 (Rel. 14.07, Created) DT 04-AUG-2009 (Rel. 14.07, Last updated, Version 1) XX DE A family of planarian NeSL non-LTR retrotransposons - consensus. XX KW NeSL; Non-LTR Retrotransposon; Transposable Element; R5-1_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-5465 RA Kapitonov V.V. and Jurka J.; RT "NeSL/R5 retrotransposons from Schmidtea Mediterranea."; RL Repbase Reports 9(7), 1528-1528 (2009). XX DR [1] (Consensus) XX CC R5-1_SM belongs to the R5 group of the NeSL clade of non-LTR CC retrotransposons. The R5-1_SM consensus sequence was derived from CC copies that are ~95% identical to the consensus sequence. CC Analogously to the nematode NeSL-1, the R5-1_SM protein CC containing the reverse transcriptase domain is fused with the CC Ulp1 cysteine protease domain (aa pos. 17-162 in R5-1_SM_2p). CC Based on PSI-BLAST, the Ulp1 cysteine protease domain is fused CC with with the RT domain in other families of Schmidtea CC mediterranea NeSL/R5, including R5-2_SM, and LIN2_SM-LIN11_SM. XX FH Key Location/Qualifiers FT CDS 2..1309 FT /product="R5-1_SM_1p" FT /note="unknown." FT /translation="VQFTQTNQTQNKPDNNTPPQFHQPEFSNTTNSIPTND FT PPTDTLISNSTNQLSATQTPPPLNDTVLIEEEDVDLDESNKSSSSLFSPYN FT TQNDILNETLQSSPHPINTDKDDQLLTLEIQKPINLHPSSQSLFSPTQITD FT PLPKDGETLTNPEQYIHSESLDITQKPYNPHDLRDPFNFIHAIEVTNLYDV FT ANPHNAMTSQPIIPTPDTIIAKTPSNPIQNIIQTNETMAERSPIILRTSSP FT IFRLGAEGYVKCTVPKCKHGKIGPLMPLKQLDHLQSMHNRIIRKNDTFLCL FT LCERNKQKPFVIALNLIETHLTEKHPEKPIRGTREEYNDKKRVLTLTINHE FT GTLLCTYIAKKNPCDTHLTIHTPTAEIQKHILAKHKNKTFDEINIVKCFCG FT IMTSLNNAQSHYATHSPTPTTITIDNEPPTHIPVTTSDIGSD" FT CDS 1313..4618 FT /product="R5-1_SM_2p" FT /note="It contains the Ulp1 protease, RT and FT restriction enzyme-like nuclease domains." FT /translation="LSQHQISAYILTYAKTPNTIYIDSFIGTAHTDRNFTT FT FFAAFPFHQYPTWNDIIWPLYLDKNHWALFYVNKTNKSAVIIDPTCENSTI FT KHKTLAESIINTLTNLQNIIEKNTITLTDHQYPKCDLIIHSGLYICHFARC FT LATNSPITLPIIPELRNQLDHLITKIINNKRNPNDINKYKTFIRDIIIQLN FT TNHIEPDTSLAEMYRIHEVTCPQKYKPLHVVPNFLIQKDHKRMIFELRQSF FT DRKAKASIDKILYPITNISTTPTDDSLISYFESTVKHIDTIIHLDTYIPDP FT EFSSDTPMTSEDLVAAYSCLDTHSAPGSDKITFSDWRRLDPDFEFLAALFN FT NILRTGKSPNKWKTFRTKLIIKPGKEHSPHEVSSWRPLAILDTAYRFFATI FT LNNRLLSWIAKRNLLCSNQKAVGIPDGCAEHNTVISLAKEWAVINGKDINI FT VWLDLADAFGSVPHDLIWHSLSRLKLKNKTINLIKDMYNDCYTVYECEGKH FT TKTIKVNNGVKQGCPMSMTLFSLTINFILQNILKEYPLIIHNHNISIMAYA FT DDLVLIADTREKMRKMIKDITKYTDSATLKFRPSKCGYFQLKRNHNDPPIT FT LYDEQIPIIDENHLYKYLGVDFGQKGKHNIDTILNTALDDTTKILSSDLHP FT AQKLQAYKTFIHSRLIFPFRNCNINHMTLDTNRNRTVQHREKQLGFDQKIK FT RLFKTMLGDIHQNINNNFLYAHSKLGGLGITPSIDEYIIQSVTHLIKLLNS FT KDTKMRQFIIAELIEHTKMRFPFQQNTIDDALKWLHSETKGGTGGPKTIFT FT KFRSSANRLNEKFGIITKLTLDNDNFQLTIKGKNYKDTITHDNLKESSSKL FT HDLTSEWYADQWHTMTCQGHISKTIGSNRQSAFLIKHRALTDKQYLFLIKA FT RNNMITLNFSTNRNKNKMNIMCRLCHKEPETQAHIFNHCTQTQNARRNKHN FT NVMKTTSDYLISKGFSVDVECTPQGIDTRLRPDLIIKSKRNKKLHILDIKV FT PYDREESFNAAREDNYKKYRELALEMGKTNACKTTLSALVVGTLGSWDKGN FT DKALDQIGLNKIEQKKLAKLCTTTAVISSYTIYIDHITNRPKQTQ" XX SQ Sequence 5465 BP; 1933 A; 1399 C; 794 G; 1339 T; 0 other; tgtccaattc acacaaacta accaaacaca gaataaacct gacaacaata caccaccgca 60 atttcaccag ccagaattct ctaacaccac caactccatt cctacaaacg accccccgac 120 cgacactttg atatcaaact ccaccaacca actcagtgca acccaaaccc caccccctct 180 caatgacact gtcttaattg aagaagaaga tgttgacctt gatgaatcca acaaatcctc 240 gtcatctctc ttctcaccat acaacaccca gaatgacata ctgaacgaaa cccttcaatc 300 atcccctcac cccataaaca ccgataagga cgaccaacta ttaaccttag aaattcaaaa 360 gcccatcaac ctgcacccat catcacaatc tcttttctcc ccaacacaaa ttactgaccc 420 actcccaaaa gatggtgaaa cactcaccaa cccggagcaa tatatacact ccgaatcact 480 cgacattacc caaaaaccat acaaccctca tgacctccgc gatccattca actttatcca 540 cgcaattgaa gtcacaaacc tatacgacgt cgctaaccct cacaatgcca tgacatcaca 600 gcccatcata ccaacccctg acacaattat agccaaaacc ccatctaatc ccattcaaaa 660 cataatacag acaaacgaaa ccatggctga aagatcgcca ataatactac gaacatcgtc 720 ccccatcttt cgtcttggcg ctgagggcta tgttaaatgc acagtcccta aatgcaaaca 780 tggcaagata ggccccttga tgcccctgaa acaacttgac catctccaga gtatgcacaa 840 taggataatt aggaaaaatg acaccttctt gtgtctgtta tgcgaacgaa acaaacaaaa 900 gcccttcgtc attgcactaa accttattga aacacatctc actgaaaaac acccagaaaa 960 accaatccga ggcaccagag aagagtataa tgacaagaaa cgtgtcttga ctctcacaat 1020 caaccatgag ggcactctgc tatgcacata tattgcaaaa aagaacccat gtgacaccca 1080 ccttacaatc cacacaccaa ctgctgagat ccaaaaacac atcttagcaa agcataagaa 1140 caaaacattt gacgaaataa acatcgtcaa gtgcttttgt ggcataatga ccagcctcaa 1200 taatgcacaa tcccactacg ccacacactc tccaacaccg acaacaatca ctattgacaa 1260 tgagccacca acacacattc ccgtcaccac ttctgacatc ggatcagatt gactatctca 1320 gcaccaaata tccgcataca tcctgaccta tgccaaaaca cccaatacca tttacatcga 1380 tagctttatt ggcactgcac acactgatag aaacttcacc acattctttg ctgccttccc 1440 tttccaccaa tatcctacat ggaatgacat aatctggccc ctctatcttg ataagaatca 1500 ctgggcactc ttctacgtga ataaaaccaa taaatcagcc gtaattattg accccacctg 1560 tgaaaacagc acaattaaac acaagacact tgccgaatcc atcatcaaca ccttaaccaa 1620 cctccaaaat atcattgaaa aaaacacaat cacactaact gatcaccaat accctaaatg 1680 tgacttaatt attcattcag gtctctatat atgccatttc gccagatgcc tcgccactaa 1740 ctcacccatt actttgccaa tcataccaga gctgcgcaac caactagacc acttaattac 1800 taaaatcatc aacaataaaa gaaaccccaa tgacataaat aaatataaaa cattcattag 1860 ggacataata atacaactta acacaaatca catagaacct gacactagcc tcgctgaaat 1920 gtataggatt catgaagtca cctgccccca gaaatataaa cccttgcacg tagttccgaa 1980 tttcctcatc caaaaggatc acaagcgaat gatcttcgag ctacggcaga gttttgatcg 2040 aaaggctaag gcttcaatag ataaaatact ctaccctata actaacatat ctacaacacc 2100 aacggatgac agcctcatat cctactttga atccacagtt aagcacatcg acacaattat 2160 acacctagat acttacatcc ctgacccgga attctcatct gatacgccaa tgacatctga 2220 agatcttgtt gctgcatata gctgccttga tacacactct gcccccggca gtgacaaaat 2280 aaccttctct gactggcgcc gacttgaccc agacttcgaa tttcttgccg cactctttaa 2340 caatatcctc cgcactggta aaagcccaaa caaatggaaa acattccgaa caaaattaat 2400 aataaaacca ggcaaagagc attcaccgca tgaagtatcc tcttggaggc cacttgccat 2460 actcgacaca gcttatcgtt tctttgcaac tatcctcaat aataggctct tgtcctggat 2520 agcgaaaaga aacctgctgt gcagtaacca gaaggcggtt ggcatccctg atggatgcgc 2580 tgaacacaac acagttatca gtctggcaaa ggagtgggcg gtcataaatg ggaaggacat 2640 caacattgta tggttagatt tagctgatgc ctttggaagc gtcccccatg atctgatctg 2700 gcactccttg tcgagactga aactgaagaa caaaacaatc aacctcatca aagacatgta 2760 caacgactgt tacacagtgt atgaatgcga aggtaagcac actaagacga tcaaggtcaa 2820 taatggagta aaacagggat gtcccatgtc catgactctc tttagtctta caatcaactt 2880 catcttgcag aatatcctaa aagaataccc cctcatcatt cacaatcaca atattagcat 2940 aatggcgtac gctgatgacc ttgttctcat agctgacact agagaaaaaa tgagaaagat 3000 gattaaagac attaccaaat atactgactc cgcaactctc aaattccgac ctagcaaatg 3060 tggatacttc caattaaaaa gaaaccacaa tgacccacca ataacacttt acgacgaaca 3120 gatcccaata attgacgaga accacctata taaatacctt ggcgtcgact ttggccaaaa 3180 gggcaaacac aacatagata ccatacttaa tacagcactt gacgacacca caaaaatatt 3240 atcgtctgac ctacaccctg cccagaagct ccaagcatac aaaaccttta tacactccag 3300 actgatcttc ccctttagaa attgcaacat caaccatatg acactcgaca ccaaccgtaa 3360 cagaacagta caacaccgtg aaaaacaact gggatttgac cagaaaataa aaagattatt 3420 caaaactatg cttggagata tccaccagaa cataaacaat aacttcctct acgcccacag 3480 caaacttgga ggtctcggaa tcacaccaag catcgatgaa tatattatcc agagcgtcac 3540 ccacctcata aaactcctca actctaagga caccaaaatg cgccaattca taattgctga 3600 gctcattgaa cacactaaaa tgagattccc attccaacaa aataccattg atgatgctct 3660 gaaatggcta cactctgaaa caaaaggggg taccggaggc cccaaaacaa tcttcaccaa 3720 attccgctcc tcagcaaata ggctcaatga aaaatttgga atcattacca aactcacact 3780 agacaatgac aatttccaac tcaccatcaa aggtaaaaat tataaagaca ccatcacaca 3840 tgacaacctt aaagagtcct cctcgaaact ccatgacctg actagtgaat ggtacgctga 3900 ccaatggcac accatgactt gccaaggaca tatctccaaa acaattggca gtaatagaca 3960 atctgctttc ctcataaaac atagagccct gaccgacaaa caatacctct ttctcattaa 4020 agcacgtaac aacatgataa ccctcaactt cagcacaaac agaaacaaaa acaaaatgaa 4080 cattatgtgc cggctttgcc ataaagaacc tgaaacacaa gcacacatat ttaaccattg 4140 cacccagaca caaaatgcaa gacgcaacaa acacaacaat gtaatgaaaa caacaagcga 4200 ctacctcatt tcaaaaggat tctctgttga tgttgaatgt acgcctcagg gaattgacac 4260 ccgactccga cctgacctga tcattaaatc caaacgtaac aaaaaactac acattttaga 4320 catcaaagta ccttatgacc gtgaggagag tttcaacgcg gcaagagagg acaactacaa 4380 gaagtatcgc gagcttgccc tagaaatggg taaaaccaat gcttgcaaaa ctacgttgtc 4440 cgccttggtg gttggaactc ttggctcgtg ggacaaggga aatgataaag cactggacca 4500 aataggtcta aacaaaatag aacagaaaaa acttgcaaag ctatgcacta caactgcagt 4560 tatctcaagc tacacaatat acatagatca cattacaaat agacccaaac aaacacaata 4620 aggtgttact ggtggcatgg catgcttatt agtacactgt taattatgga gtactactct 4680 ccgagaactt gggttacccc taaaggctgt tgttgccgtt attgtgattg atattattat 4740 tactattatt attttcgtcg ttatccttat tatttttatc attattatta ctgatttatc 4800 agtattatta ttattaattt accattatta ttattatttt atcaatatta ttattttatc 4860 attattatta ttattaatat tattattatt attattatta ttattattat tattattatt 4920 attattatta ttattattat tattattatt attattatta ttattattat tattattatt 4980 attattatta ttattattat attattatta ttattattat tattattatt tctgttataa 5040 tgatggtatt agtgcctcac gggtacccca aatacataca gatccaaatg ttattggtag 5100 catatgatgc ttgccaatac attgacatag acagacaaca gtgtagtaac tcgccgatac 5160 aacactgtct agcagacata acctgaccca ggtatatctg tgaagcatcg cataatgatg 5220 ctcacaaagg cttgaactgg cacgtaccag aggggggatt ggtgggagtg ctgcccgatt 5280 aacacggagc gttgtgatgt gcgctgctga tccttaaggg gaacacagca atccaaacta 5340 cgagtagaaa atcaaaacca accgaaacct ttaaggtgga agttaatccg aaaggatttg 5400 cctactctta ggagaggctg gtaaaaatga gctcggtgca cgtcgtgtca aagccgttta 5460 ccagc 5465 // ID LYDIA_I repbase; DNA; INV; 6054 BP. XX AC AF177773; XX DT 30-MAY-2000 (Rel. 5.04, Created) DT 29-MAR-2007 (Rel. 12.04, Last updated, Version 2) XX DE LYDIA_I is an internal portion of LYDIA, a gypsy-like endogenous DE retrovirus. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW endogenous retrovirus; LYDIA; LYDIA_I; LYDIA_LTR; KW internal portion. XX NM LYDIA_I. XX OS Lymantria dispar OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Noctuoidea; Lymantriidae; Lymantria. XX RN [1] RP 1-6054 RA Pfeifer A.T., Ring M. and Grigliatti A.T.; RT "Identification and analysis of Lydia, a LTR retrotransposon from RT Lymantria dispar."; RL Unpublished. XX RN [2] RP 1-6054 RA Pfeifer A.T., Ring M. and Grigliatti A.T.; RT "LYDIA_I."; RL Direct Submission to Genbank (16-AUG-1999)Zoology, University of RL British Columbia,. XX DR GenBank; AF177773; Positions 301 6354. XX SQ Sequence 6054 BP; 2085 A; 1394 C; 988 G; 1586 T; 1 other; ggcgcagtcg gtaggatcct cgcgggccgg ttcgcgtcct tagagtcctt tcaaatcgat 60 attcgcgtcg cgagtcgccc gggtggcttt caagcaccgt catcgaggtt ccagaatcta 120 cggtccagca gcagttccag aggagagtca agacctcgac tcacggaaga gaaaccagaa 180 tgaagtatat gtaagtacac ttgaattgtt ctcgtgttgc ttgctttctt tgtcctagtt 240 atatccattc tccaaaattt tccatcgcga aaattcgtac cgggaacctg catctcggca 300 ggctcgttgg aatacaatag atttaattag aaatttaggt agtagccctc cgcagagttc 360 agaaaatatg gcttaaggac aggctaacgt agtagtaata acccatgacc cggacacgat 420 ttttaaggta cttcgcctag tcccggattt cgatgggaac ccgaatattt taactagatt 480 gattaatata tgcgatagaa tagttgtaca atatttaagt gatcagccag gaaacgaatt 540 agcgaactta tgtcttttaa acggaatact aaataaaatt acgggatcag ccgctagcac 600 tattaattcc aacggtattc cagacagctg gttaggtatt agaacagcct taataaataa 660 cttttccgat cagcgtgacg aaactgctct ctatagtgac ctatctttag ccaaacaggg 720 tacgaagaca cctcaagagt tttacgacca gtgccagaca ctattcagta ccataatgac 780 ctatgtaact ttgcatgaac cgattagtac cacagtagaa gctaagagaa tattatataa 840 aaaaattgct atgcagtcat tcgtgagagg cttacaacga accattagga tcgcggataa 900 gatgtatgcg accagagact atagaaaaag cactcgaata tgttcaagag gagttgaatg 960 taatatattt acaacaacgt aacgataccc cgaaatcggt taacacccct agaacgttca 1020 actatgttcc accccctata aatttcaaca aaccacgtgt tcaaaacatc gttagtcatc 1080 cccctaattg gccggcacca gtaggtttta aatcacacca acttccgccg caacccttta 1140 aattcaaccc cctacagaac caacagttcc agcaacgtat ccctaaccgt acgcagcaaa 1200 tatttcgcgc tcaaccaccc ggttataacc cacatagtaa cgtctttcga ctaccaccca 1260 ggaacgttcc accacaaaat ctaggcccaa aacccatgag cggcatacag cattttacac 1320 ctaagcctat attatcaggt catgattggc gtaaatcagg caatccacca ccaagtaatt 1380 atttcaaacc gcgtgaaatg aacgtgaatg agtgcgcgta atacgacgac tcatattacc 1440 cagagtattg ttacgaacac gaagatagct atttttacag taattattac aacgattaca 1500 tcccatacgc tacagaagag tgcacctata atactactta ttatgatacg acaaatgata 1560 ttcagattac cgaagatata gaaccacaac ctggtacaag ttcatcagac ccggattttc 1620 agttggaccc ggtgtcaaaa agaccaaaat agaattaaat ctccaaactc agagacagct 1680 accatatatt tctatttcag atccaccttt aaaattatta attgacacag gtgcaaacca 1740 atcgtttatt agcccggaag cagtcgagtt atattttcca aactttccac taaattacga 1800 tccatttgaa gtcaccgatg tgcacgccgt tagtagaaac gattactcca ttaccttacc 1860 caggtttgct gaatttaacg accctaaccc ggttaaatta ttcgtatacc gttttcatga 1920 ctatttcgac ggattgatag gattagatct attaaccaga tgggaggcca aaatagactt 1980 aaaagaccgc aaattaatta cacaattcgc ctcgaaccca atacgcatgt acaattcttg 2040 taatgtcaac ctatgcgagg acatcatacc accaaaatcg ccaaaattaa ttaaaattcc 2100 cattaacacc cccgacggcg cagcattagt tagcgaacaa actatatgca attgcattat 2160 tcacgaatgc cttactaacg taaaaaacaa tcgcggttac attgaaatag aaaactctac 2220 acccaacgat ataatattgt ctctcgaccg acctgcacat gcagaattat ttaataccga 2280 gtacactcgc accgaanaac cacactcacg cgtatcagaa gttatatcgc gtttgcgcac 2340 cgaacaatta ggccccgaag aaaaagcgaa ccttaccacc ttatgttcta gctatgcgga 2400 cgttttttac attgaaggag aaacacttac ttttacaaat aaaatcaaac atcgtataag 2460 aacaaccgat gaagtccctg tttacacaaa aagctatcga taccccgttt atacaccgac 2520 aagaagttag tgaccaaatt tcgaaaatgc ttaaccaagg tataataaga ccatcagact 2580 cggcatggag ctcaccaatt tgggtcgtcc caaagaaaat ggacgcgtct ggtaagcaaa 2640 aatggcgtct agttgtcgac tttcggaaac tcaacgacaa aactatagat gataaatatc 2700 ccatccataa tattagtgac caacttgaca aattaggtcg ttgccaatat tttacaaccc 2760 tttatttagc cagtggtttt taccaagtgg aaattgaccc cctcgatatt tcgaaaaccg 2820 catttaacgt cgaacatggt catttcgaat ttcttcgaat gcccatggga cttaaaaact 2880 cgccttccac ttttcagcga gttatggaca atgttctcgg agacctccag aataacatct 2940 gcctaatcta tcttgatgga atagtagtct tcagcacttc tctccaagag cacatggtca 3000 acctcgagga aggtattcca aagacttcga gagtctaatt tcaaaataca aatggacaag 3060 tcagcatttc ttaagcttga aactgcttat ctcggacata tcatcagtgg agatggagta 3120 aaacccaatc ccgacaaaat ttcggctatt gaaaattatc ctattcccaa aactcctaaa 3180 gaaattaaac aatttttagg tcttattggc tattatcgaa aattcatacc agattttggt 3240 aatttaacta agccacttac acagtgctta aagaaaggta aaacgattac gttggacccc 3300 cagtacatta gttgtttcga gaaatgtaaa acactcttaa gtaatgatcc tattcttcaa 3360 tatcctgaca tcgaaaaaga cttcagcctg acgaccgacg cttcaaacgt cgctattgga 3420 gccgtgctat cccaaggacc aattggatct gataaacccg tatgttacgc ctcgcgaaca 3480 ttaaacgata gtgaacttaa ctatagcacc atagaaaagg aacttctagc catagtttgg 3540 gctactaagt attttcgccc atatctgttt ggacgaaaat ttaagataat aaccgaccat 3600 aaaccactgc agtggatggt aaaccttaaa gaacctaatt caagattaac tagatggcgt 3660 ttaaaactta gcgaatacga tttcaccgtg gtatataaaa aaggcagaca taacaccaac 3720 gccgacgcct tatctcgtat agaaatccac aacgaagatc taaatagtat agtcgacgaa 3780 ttatgttctc tcataccgaa ccccactgat gaaccgccca ctattgtaaa ctcttcaacc 3840 gaaaccgcac acactagttt agaacatcct atattagaaa ttccaatcac cgatgaaccc 3900 cttaataaat actataggca aattcatttc accatcgtaa atgacgtcaa aagaaaaccg 3960 cacgtaacaa aacctttcga aactcacacg cgcatagcca ttcagctctc aaaatcaaac 4020 ttagaaaacg acatcattaa tgcgattaaa gagtatgtta gtcctaaagt aaaaaccgcc 4080 cttttaataa ctccacctct tgccatgtat tctattatcc ctgtaattca aaaaacgttc 4140 agaagttctt cactaaacct cgtacttaca aaactagaaa tagaaaacat caaggattac 4200 cttcgtcaac aggaaatcat tcgcaactat catgatggaa agacgaatca ccgtggaatc 4260 acagaatgtt atctatctct atcacgaaaa tactattggc ccaaaatgaa agaccagatt 4320 acaaaataca tcaatgaatg tattatctgc ggccaagcga aatacgacag gaaccctatc 4380 aaacctcaat ttaacatcgt accacctgca acaaaaccac tcgaggtagt tcacatggac 4440 ttattcactg tacacaacga aaaatatatc accttcgtag atgtattcac aaaatatgga 4500 caggcgtacc atctcaatga tggcacagct gtaagcatat tacaagctct tcttaaattc 4560 tgtacccatc acggagtacc attaactcta ataacggata acggaaccga atttacgaac 4620 caactctttt cagaatttac gcggcttcac aaaataaacc atcaccgtat tttacctcac 4680 cgaacgataa tgggaacata gaacgctttc attccacaat acttgaacat cttcgaattt 4740 taaaattaca aaataaagat gaacctatag tctatctaat accatatgct ataatagctt 4800 ataacagttc tatccacagt ttcacaaaat gtaggccgtt tgacttatta aatggacatt 4860 ttgaccctag agatatatta gatatagatt ttacacatca tttactacaa caatatagcc 4920 aaaatcataa ggcccagatg aacgaagttt ataaaattat aaacgattca tctttaacta 4980 atagaaccgc tctcatagaa caaagaaata taaatcgtga gcccgaaata gaatacactc 5040 cccaacagca agtcttcatt aaaaatccgt ttgctaacag tcaaaaagtt gcaccacgat 5100 ttactcaaga caccgtctta gcagatttgc caatacacat ttacacctca aaaaaacgtg 5160 gtcctgtagc caagtctaga ctgaaaagag gtcataaagt tccccaatcg ttaaatgatt 5220 ctactgttac tgacccttcg cgagacaacg cttctcgaga caaatctttt agttatactt 5280 agtaataaat aagtttagaa ataactagaa tagaaaatgg ttttactcat acatcgagta 5340 tgcaccactc tagtatagat atcaatatgt tagaattaat gttaggtaag ctagaactca 5400 tttataacct agatgagata ttaagagtag aattaaaata gtactacgaa gcaatcatcc 5460 ccagtttaca cttctctgat aaattcatag taatggtttt taaattccct gtatttttaa 5520 tatatccaga atagctataa cacccgacag gcctttctac cttcctatcc ttacattgca 5580 atcaacacag gtacattcaa gctgtacatg gaggctgaat gcccgaaggt tgggagttgg 5640 tatctctgta agaaacttcg catcagatac gcgtcagccc aaactacatc ccacagctca 5700 ttataaaaca agctttgctc ataactcctg ccagaaccgt ctcatcagct actgaagccc 5760 tggaataatt agacgaccgg cattatgccg tatcttttcc acattccacc aaactaatgg 5820 acataccaag taacactgaa aaacaagtat cgttgtacca agacacgtag tcttgactca 5880 acagatcacc gaagtctgca cagcatccag aacaaagtct gtacagcatc ctttcatcaa 5940 aaggataacc acatgaccca catggaaata gcaaagaatc aggaagcccc agataatatt 6000 ccagtgacat tttcactcaa cgttcgaaaa tggtcgcaga tctacgaggg gagg 6054 // ID Gypsy-19_CQ-I repbase; DNA; INV; 7646 BP. XX AC AAWU01010823; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-19_CQ_; KW Gypsy-19_CQ-LTR; Gypsy-19_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-7646 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 417-417 (2011). XX DR GenBank; AAWU01010823; Positions 25785 33430. XX CC Positions [4633-5112] - Integrase core CC 'GGTT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 368..2251 FT /product="Gypsy-19_CQ-I_2p" FT /translation="MGSKFPYVVHLLNDDVDYELRVRGRKDDLNKPLEEKQ FT KILRRLYREDKLEKKTYASIYAIDQEYDWMVSHIEKLRAFLEKDPQPKYIS FT RLRYFYMRANRSNTTTEQSEGLKKSILATISELLTIYDVNESETESDKSDG FT ESNDKDKDKDNKLKLQELQAKEEELKKKEEDIKRKEQELKEKQKQQKPFDQ FT TQFETAVMEAVKKMFAGPLPFPNTKPLTIPSQQTTQEELARTELTKQVKTL FT EKQLQQMKDKMSQMGRQNNNQGVVTAELLAQALSTLQTSQNQIPNQTQGNL FT DPQNNPNQNPNGNQYPPIPPNPPNSTDIFLDPEEENSSEEDSQIDWTTNSG FT RESGRTHYDRRIEKWNIYFSADSRSITIEDFIYRIKVLANMNQISKQRLLS FT HVHMLLRGEASNWFFTYFHPSWDWDMFEIQIRFRFGNPNQEQGTRQSIYNC FT KQQKGEKFVAFVDEIERLNKLLTKPLSRQRKFETIWENMHAHYRLQLAPFT FT IQTLEQLKALNQRIDANDPSLNQKGTKHAVHNLETSSDHAGSDEEEVNAIY FT KKPHGRGSQSQPSNRNETTRLPICWNCRNQGHFWRQCSERKTTFCYICGNP FT GTVAATCDKHPKKEPTGQTGDPSGNQDRNA" FT CDS 2653..5520 FT /product="Gypsy-19_CQ-I_1p" FT /translation="MMEGEKGLEEITEITQSPATGEMVNEIFHFFIHPVEA FT LPVLEKPAPDESLDIPGLDLPEPSKTTPDTVETEHELTSEERSHLAEVIRT FT FPCTADGVLGRTTLLQHEINLREDAVPRRQPLYRCSPAIQAEVDKEIQRYK FT DLDAIEECTSEWANPLVPVRKSNGKLRVCLDSRRINALTKKDSYPMRDMKG FT IFHRLGSANYFSVIDLKDAYFQIPLKEECRDYTAFRTSKGLFRFKVCPFGL FT TNAPFTMCRLMDRVIGFDLEPQVFVYLDDIVVATKTLDEHLELLKIVAERL FT RNANLTISLDKSRFCRKKVTYLGYLLTGEGVSIDNSRIEPILNYSRPKCVK FT DVRRLLGLCGFYQRFIQNYSRIVSPISDLLKKEKKKFTWTEAAEEGFQELK FT TALISAPILANPDFSLPFEIESDASDNAVGAALVQKVDEETKIVAYFSKKL FT SSTQKRYASVEKECLGVLLAIQHFRHFIEGSRFKVVTDARSLLWLFTIGVE FT SGNSKLLRWALKIQSYDIELEYRKGKNNITADCLSRSLDAIGIASVDPEYQ FT ELTKKILSDPQGYPDFRVVEGQIYKLVKNEGRMEDTRFCWKIYPAQGERSS FT ILERIHGTAHLGFEKTLAALRERYFWPLMSSQTKRFCQNCLICQTSKATNV FT NTTAPLTTQRKIAEHPWQFVTMDYIGPLPASGKGRNTCLLVITDVFSKFVL FT IQPFRQATAESLCPFVENMVFQLFGVPEVLLTDNGTQFVSKLFQDLLNRYN FT VSHWKTPSYHPQINDSERVNRVITTGIRATIKRDHKEWSNNIQTIANAIRN FT SVHEATHYTPYFVMFGRNMISDGREYRHLRDNSAGSGNLDDAQRAKLYEEV FT RENLKKAFEKHSKYYNLRSNDKCPKYTLGEKVLKRNTELSDKGKGYCAKLA FT PKYVPAVVKRVVGEHCYELEDEKGKRIGVFNCKYLKKLNHSPSQLVKQSGV FT T" XX SQ Sequence 7646 BP; 2519 A; 1398 C; 1638 G; 2091 T; 0 other; ttttggcgcc caactacagt cacggagtca cggaaagttt gtttttggta ggaaataatg 60 tttttttttt ttaccatgtt taggttctat tttaaggttt tgaaaattaa ataaggttgt 120 ctaggaaaat acaatttcgt gggaggaatt gagataatgt tttaggttac tttttcgaat 180 attttggaat gttttggtaa aacattttgt ttatttcggg aaacactgtt tggaaagatg 240 ttttttgtgg aaaactttat gagtgggtac aagtaagatt aaagattatt agtatacgat 300 attatagtac ataaggcata tttcatacac tttctttgca aaacacttat attagttggt 360 ttcaaagatg ggttcaaaat ttccatacgt ggttcatctg ctgaacgacg atgtagatta 420 cgagcttaga gttagaggtc gcaaagatga tcttaataaa ccgttagaag agaaacagaa 480 aattttaagg cgtttgtatc gagaagacaa attggagaag aagacttatg cttcgattta 540 tgctatagat caagagtacg attggatggt ctcacacatt gaaaagctca gagcgttttt 600 agagaaagat ccacagccga agtacatttc gaggttaagg tatttctaca tgagagcaaa 660 tcgtagtaac acaacgacag aacaatctga gggtcttaag aaatctatac ttgcaacgat 720 ctcagagttg ttaacaattt acgacgtgaa cgaatcagaa actgagagcg ataagtcaga 780 tggagagtcc aacgataagg acaaggacaa ggataacaag cttaaattgc aagagcttca 840 ggcaaaagaa gaggaactaa agaagaaaga agaagatatt aaacgtaaag aacaagaact 900 caaggaaaag caaaaacaac aaaaaccatt cgatcaaaca cagtttgaaa cagcagtaat 960 ggaagcagtt aagaagatgt ttgcaggtcc tttgccattt ccaaatacca aaccgttgac 1020 cataccttct caacaaacta cacaagaaga gttagcaaga acagaactaa caaaacaagt 1080 caaaacttta gaaaaacaac tacagcagat gaaagataag atgagtcaaa tgggtaggca 1140 aaataataat cagggagtcg taacagcgga acttcttgcg caagctttga gtaccttaca 1200 aacttctcaa aatcagatac ctaatcaaac acaaggtaat ttagaccccc aaaacaaccc 1260 aaatcagaat ccaaatggaa atcaatatcc tccaatccct cctaacccgc caaattccac 1320 agatattttt ctagatccag aagaagaaaa tagttcagaa gaagactcgc aaattgactg 1380 gacaacaaat tccggaagag aaagtggtag aacacattat gataggcgaa tagaaaaatg 1440 gaacatttat ttctcggcgg attcgcgatc tataaccata gaagatttca tttacagaat 1500 aaaagttctc gcaaatatga accagatttc caaacaacgc ttacttagcc atgtacacat 1560 gctactacgc ggggaggctt cgaattggtt ttttacctat tttcatcctt cttgggactg 1620 ggacatgttt gagattcaga tccgtttccg gtttggaaac cctaaccaag agcagggtac 1680 ccgtcagagt atttacaatt gtaaacaaca aaaaggggag aaatttgtcg cctttgtgga 1740 tgaaatagaa cgattgaata aacttttgac gaaaccattg tctagacaaa gaaagtttga 1800 aacaatttgg gaaaatatgc atgctcatta tcgcttgcag ttagcacctt tcacaattca 1860 aacattagaa caacttaagg ctcttaacca gcgaattgat gctaatgacc caagtttgaa 1920 ccaaaagggc acgaagcacg ctgttcataa cctcgaaact tcatcagatc acgctggatc 1980 ggatgaagag gaagtgaacg caatctataa gaaaccacac ggaagagggt ctcaatctca 2040 gccctcgaac agaaacgaaa cgacccggtt gccaatctgc tggaattgtc gaaatcaggg 2100 ccatttttgg aggcaatgta gtgagaggaa aacaacattt tgctacattt gtggtaaccc 2160 gggaacggtc gcggctacgt gtgacaagca cccaaaaaaa gaaccgactg gacaaactgg 2220 ggacccgtcg ggaaaccaag accggaatgc ttaagtggga gcacgagtat tccaggtgaa 2280 tcagtggttc ccaactcgaa acctttcatt gatccttttc aaaacctttc agaaattaaa 2340 attcaaacta atcaatgccc tcaggtgaca attcgaattt ttgaaaccga aattaacgct 2400 ctcttagatt cgggtgcgag tatttgtgtg acgaattcta ctgatttagt cgaacgttac 2460 ggacttaaaa ttcttccatc gccgattcgt atttgcacgg cggaccaaac gcaatattcc 2520 tgcgagggat atgttaattt cccaataaca ttccgaaaga ttacgagagt tgtaccagtg 2580 gtagttgttc cacaggtcgc tagaaatttc attttaggaa taaacttctg gaatgcattt 2640 ggaatcaaac caatgatgga aggagaaaaa ggtctcgagg agattaccga gataacacaa 2700 tcaccagcca cgggagaaat ggttaatgaa atatttcatt ttttcattca cccggtcgag 2760 gcacttcccg ttcttgaaaa accagctcca gatgagtcgc tagatatacc tggattggat 2820 ttacctgaac catcaaagac gacaccggat accgttgaaa cagagcacga actaaccagc 2880 gaggagcgaa gccacctggc ggaggtcatt agaaccttcc cgtgtacagc ggatggagtt 2940 cttggccgga cgaccctgtt acagcacgag ataaacctac gagaagacgc agtgccacga 3000 cgacaaccac tctatcgctg ttcgccggcg atacaagcgg aagtggataa ggagattcaa 3060 cggtacaagg acctagacgc gattgaagaa tgtacgagcg aatgggctaa cccgttagtg 3120 ccggtgcgga agtccaatgg gaagttgagg gtgtgtctcg attcgaggag aatcaacgcc 3180 ctcacaaaaa aagattcgta tccaatgaga gatatgaaag ggattttcca ccgactaggg 3240 agtgcaaact atttttctgt aattgacctg aaggacgcat attttcaaat tccattaaag 3300 gaagaatgta gagattatac agcatttaga acgtccaaag gtcttttcag atttaaagtc 3360 tgtccctttg gtttaactaa cgccccattc accatgtgcc gcctcatgga tcgagtgatt 3420 ggcttcgatt tagaacctca agttttcgtg tacttagacg acattgtggt ggccacaaaa 3480 actttagatg aacatttaga acttttaaaa atagttgctg aacgtcttcg caacgctaac 3540 cttaccattt cattagacaa gtccagattt tgtcgaaaaa aagttaccta tttggggtac 3600 ttgttgacgg gagaaggcgt ctccattgac aactctagga ttgaaccgat tttgaattac 3660 tcaagaccaa aatgcgtcaa ggatgtacga cgactgctag gtctttgtgg attttatcaa 3720 cgattcattc aaaattatag tcgaattgtt tcacccattt ctgatctctt aaagaaagaa 3780 aagaagaaat ttacttggac ggaagcggcc gaagagggtt tccaggaact taagacagct 3840 ttgatatctg cgcccatcct ggctaaccca gatttctctt tgccatttga aattgagtcg 3900 gacgcctccg acaatgctgt tggggcagcc ctagttcaga aggtagatga ggaaactaaa 3960 attgtagcct attttagtaa aaagcttagt agcactcaaa agcgttacgc aagtgtcgaa 4020 aaagagtgtt taggagtttt actcgccatc caacattttc ggcattttat tgaaggatca 4080 agatttaagg tagtcacgga tgccagaagt ctattgtggc tttttacaat tggggtagaa 4140 tcgggaaatt caaaactttt gagatgggcg ttaaaaatac aaagttacga catagaacta 4200 gaatatcgaa aagggaaaaa caatataacg gcagattgtt tgtcacggtc cctagacgcg 4260 atcggaattg cctcggtcga tccggagtac caggagctta ccaagaaaat cctcagcgac 4320 ccgcagggtt acccagattt tcgcgtagtc gaaggacaga tctacaagtt ggtgaagaac 4380 gaagggagaa tggaggacac gagattctgt tggaaaattt acccagcaca aggagaacga 4440 agcagcattc tggagcgaat ccacggaaca gcacatctgg gttttgagaa gaccctcgcg 4500 gcactcaggg aaaggtattt ttggcccctg atgagctcgc agaccaagcg gttttgtcaa 4560 aactgtctga tttgccagac gagcaaggcg acgaatgtca acacgacagc accgttgacg 4620 acccagcgga agatagcaga acatccatgg cagttcgtca caatggacta catcggccca 4680 cttccggctt ctgggaaggg acgaaacact tgcctcctag taataacgga cgtgtttagc 4740 aaatttgtct taattcagcc ttttagacaa gccaccgctg aatcgctttg cccatttgta 4800 gaaaatatgg tttttcaact tttcggagta cctgaagtgt tgctcactga taatggaact 4860 caatttgttt caaaattgtt tcaagatctt ctgaatcggt acaacgtttc tcattggaag 4920 actcccagtt accaccccca aattaacgat tctgaaagag ttaatcgcgt tatcacaacg 4980 ggcatacgag cgacgataaa gcgggatcac aaggaatggt caaataatat tcaaactatt 5040 gctaatgcaa ttagaaactc agttcatgaa gccacccact acacgcccta ttttgttatg 5100 tttggtagga atatgatttc agatggtaga gaatatagac acttgagaga caattcagcg 5160 ggaagcggga atcttgacga tgcccagcgg gcgaaattgt acgaagaagt tcgagaaaat 5220 cttaaaaaag cttttgaaaa gcactccaaa tactataatt tgagatcaaa cgacaaatgt 5280 ccaaaataca cattagggga aaaagtttta aaacgaaaca cagaactttc agataaaggg 5340 aaaggttatt gcgcgaaact ggcccctaag tacgtccctg cggtagtcaa gcgagttgtg 5400 ggtgaacatt gttatgaatt agaggacgag aagggaaaac gaatcggggt gttcaattgt 5460 aaatatttga agaaattgaa ccattctccg tcacaacttg tcaagcagtc tggtgttaca 5520 tgagaaatta aatttttaag ctatgagccc tcaagggagg ttacaaaaca tttaacgtgt 5580 acaaaaatcc tacaatggga aataatcaca aattttgcaa aattctttgc aacaaagtat 5640 gatttaacga gctatgtcac tactaaaata gggtaggaac aatgtaaatt aacaattact 5700 acaaaaacaa cgaatttatg ttgaaattca tcaacctcgt tgaaaattca ccactccact 5760 cggaaatcga gtaatgtttg ataagctatg acaacttttt gaaagtttac gaagcaatgt 5820 ttactattgc atgaaaacat caatcgttcc aactaactcc tcagcaatga gtcgagagtc 5880 ccacaaaaaa atggacaaac tactggtgac aagttgttga acggccattg agaatcgaaa 5940 cagaaaccat gaatgaaatc gatgacgcat tcaagaccaa acggtaccat cgaaaccacg 6000 aaacaaagta gaagaccaac cggatttgac aaaaagaaac cctacctatc aaaacaaaaa 6060 gcacttaagt tagctagtta gtagtgtacc tagtttagag atatattagt acttatttag 6120 gattagttca tttcttcgtc gaagtatacg acacgcacgt gcgatcaaca tcgagcacgg 6180 tgcgttcatt gattagatga tatcgatgat caggaatccg cttgtgtaca ttgacgcagc 6240 tccaatttgt aggttatgct tcaatagtag tagtagtagt ataattcatt gatgattgtt 6300 gaggtcatcg gttgacatat tttggttgca gtagttagtt gaatatccat agtccggtct 6360 tgcaattcat ttaattcctt agtcagttga atcatcatct taatagtccc ataattagta 6420 ttgtaatccg tccaatccag tcaccttcag cctcattatt tacatttgct tttcattaca 6480 aacgcctaaa aataaatcac acaaagcact tagttctttc attttttcaa ttttgtgagt 6540 tactcccccc gtaaatccat cagtttaatt agagttcaac ctgctccagt cgcatagaat 6600 acttaaattt tagattttag aaattttctg gccgcgtgaa cagcgagata tggacgattt 6660 cctgtttact tttgagtttg tcagttgagg tttgacagtt cgacccattg ctaagcaaat 6720 ggtgcgtttg atttcagtag gaatgtgatg ttagctgtta atcagtagct cacgtatttg 6780 ggaaatcagt gttgtgaatt tagtcttttt tgaattttag ttgttcagtt aatgatactg 6840 atggaaaagt tcaaattagg gtttgtgtaa ttttgcgttt ggaacgacta gggacaagtt 6900 acctgaagag gagctttagg gtgggcctga gtctgaccag actgatcttt cggtgaatta 6960 agacacgatc atactgctca acgtaatgat tgaggtgatg atcaagttct gaatgagcca 7020 gttttttggg tgtataaatt tgttattttt tgcttttcaa taagttgaaa gacctgttga 7080 gaaattgttg aggtaatgtt agtttgattg ggatttatga gtgtttgcga agttaagttg 7140 tggaaattag gcgcaccgaa gccatgagct ggtgttgaac tgttaaataa tttgattgat 7200 tttttttctc tggatatttt tttttttgta ggttaattta gttaatgaat tgaagatgaa 7260 agttttagca atacctggta actcgaaaac ccattgaacg gggaaaacta gtggaactag 7320 tacaatttac acgaagtgaa acgagttgag gacgatgacc tgtgaaaagt agtgggattc 7380 cacgagagta taaaggtaga ccttaacaat gacttgctta aattagtgga tgtactaaga 7440 tgaaactagc gagtaccaca cagttaccgg tagagcaaag aacgggattc taaagtttga 7500 ggataatcat gtaaaaagta ttgataatta ggttaattca ttttccccct tatagttagt 7560 tgtgtaaata attttgagac gaaatttacc cttacgaaaa tttagttgtt tctcaactaa 7620 attttcgtaa attcagtgag gagtga 7646 // ID Gypsy-4_DPu-I repbase; DNA; INV; 4444 BP. XX AC scaffold_118; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-4_DPu_; KW Gypsy-4_DPu-LTR; Gypsy-4_DPu-I. XX NM Gypsy-4_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-4444 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 723-723 (2010). XX DR Genome; scaffold_118; Positions 236387 231944. XX CC Positions [3044-3538] - Integrase core CC 'ATATG' target site duplication CC LTRs are 99% similar to each other. CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX FH Key Location/Qualifiers FT CDS 759..3068 FT /product="Gypsy-4_DPu-I_1p" FT /translation="MLKAGARLLVNSALKCGKSNHLEKVCNSENVASLDEY FT VINSLSGQQQSQWSVDLFIAGYTVPFKVDTGASCNVIPLSCFNKFGHTKKL FT LPGPRVRTYNGQSLHVLGQKDVAVFFNSKRFQIRVIVIKEDNVPILGLPSC FT CELCLVHVPQQKVNEIVVDSSSPSLPPAFQNYTGVFIGIGKLPIEHEIKLK FT DNCVPVVRPPRRIPFKIRDLVKKKLDDMEQLKLICKVTEPTEWVNPMLAVQ FT KAGGDVRICLDPLDLNKVIKRQHYPVPTAQELFARIGKAKFFSTLDATSGF FT LQIPLSKESSFVTTFATPFGRYRFLRLPFGICSAPEIYQQTMIQLVGDLSG FT VEIYVDDFFVWGETREEHDDRLKAVFDRCAQVNLKLNASKCKFLQSEIPWI FT GHIISHQQLKPDPDKIEAIRAFPESASKEDLQRLLGMVNYLTKFCENLSST FT AKPLRDLLKHDVEWVWDAASMQTFQAVKNLVILAPTLKLFDPAATVTVSVD FT ASPFGLGAVLLQDGQPVEFASRTLTETQRRYAQIEKELLAVQFGMQHFHHY FT VYGSHVLVETDHKPLIGLVDKPIGLCTPRFQRMRLQLQVYSYQLCYKPGKE FT LYIADTLSRAPEPREYSGDQSQCHDEHVHAVLSYVIPEPTIQEKYARATEA FT DSTLQLIVGLVNKGWPEHKRDCPVPVKPYWSERANLSTARGLLLRGQQVVV FT PYSLQREILASVHDGHFGETKSLERAKSAVFWPGYVEQIRNLVAGCNREET FT TIRLSSIIRQKSRNIPFKK" XX SQ Sequence 4444 BP; 1064 A; 1092 C; 1115 G; 1173 T; 0 other; tggtgtcaga agtaaaccga accaattgtt caacatggcg agtgatcttc atccaccgac 60 atccttctcc tttgatggcg atctgccttc aaagtgggct ctttggaaaa agcaattcga 120 gtggtatctt aaggccacga agaaaaacaa ggaagacgaa gatgttcagg tcggcgtgct 180 gatcacgttg cttggccatg agggcctacg catttacgag accttcactt ggacaacggc 240 tggtgatgcg gctaagattg cgccagtgtt agctaagttt gatgctcatt ttcagccgag 300 aaagagccaa acttttgaac gatacaagtt tttaactcgc caccagcgtg atggtgagtc 360 gtgcgaaacc tccttgttgg aacttcagtc tttgatcgcc acctgtgaat acgacgccca 420 acttgactca attctgcgtg accaaatcgt tattggcgtg gccgacaaca aaacgcgtga 480 aaagttgttg tttgatcccc tgcttacttt ggccaaagca attgacatat tgcgtgcctg 540 tgaaacatcg tcgtcgatcg ctcaacagat gtcggctgaa ggcatcagta ggctgacctt 600 ggaaaaattc aaatcgggaa aaccagttct tggcaagtct ggtcatggcg gtgatcgtcg 660 tagttcaacg gaagtcaaca agcctgaaca ggacgacaag tacagacatt ccagaacacc 720 cacaatgatt acaagttgca agtggtgtgg tggcagtcat gttaaaggcc ggtgcccgtc 780 ttttggtaaa cagtgctctc aagtgtggca agtccaatca tttagaaaaa gtctgtaact 840 ctgaaaacgt tgccagtctc gacgagtatg tgattaattc gctgagtggt caacagcagt 900 cacagtggag tgttgatttg ttcatagctg gttatacggt gcctttcaaa gttgacactg 960 gagcgtcttg taacgtcatt ccgttgtcct gcttcaacaa atttggacac acgaagaagc 1020 tattgcctgg tcctcgagtt cgcacctaca atggacagtc cctgcatgtg ctgggccaaa 1080 aggatgttgc tgtgttcttc aacagcaagc ggttccagat tcgtgtaatc gtgatcaagg 1140 aagacaatgt ccctattctt gggctgccca gttgttgtga attgtgtcta gttcatgtgc 1200 cacagcagaa ggtcaacgaa atcgttgtgg actcgtcgtc gcccagtttg ccgcctgcgt 1260 tccaaaatta cacgggtgtg ttcattggca tcggaaaatt gcccattgaa cacgagatca 1320 agttgaaaga caattgtgtc ccagttgtgc gtcctccccg cagaattccg ttcaagattc 1380 gtgatttggt gaagaagaag ttggacgaca tggaacagct aaagttgata tgtaaagtga 1440 ctgaacctac ggagtgggta aacccaatgc tggctgttca aaaagcgggt ggtgacgtac 1500 gcatctgtct cgacccgttg gatcttaaca aagtgataaa gcgccaacac tatccggtgc 1560 ctacggcgca ggaattgttt gcccgaattg gaaaggcaaa gtttttctca acgctcgacg 1620 caacgtcggg gtttctgcaa attcccctgt cgaaagagtc cagttttgtt accaccttcg 1680 ccactccatt tggacgatat cgttttcttc gtctgccatt cggtatttgc tcggccccag 1740 agatttatca gcagacgatg atacagttgg ttggagacct cagcggtgtt gagatctacg 1800 ttgacgattt cttcgtttgg ggcgaaacac gagaagagca cgatgaccga ttgaaagccg 1860 tgtttgacag gtgtgctcaa gtcaatttga agttgaatgc ctccaagtgt aagtttttac 1920 agtcggaaat tccctggatc ggacacatca tctcccacca acagttgaag ccggatcccg 1980 acaaaattga agccattcgg gcctttccag agtctgcttc gaaggaagat ctgcagcgac 2040 tgttaggtat ggtcaattat ttgacgaagt tttgcgagaa tttatcgtcg acagccaaac 2100 cgttgcgaga tctgctcaaa catgatgttg agtgggtctg ggacgcagca agcatgcaaa 2160 cgttccaggc cgtcaagaat ttggtcattt tggcgcccac cctgaagctt tttgatccgg 2220 ctgctacagt gactgtgtcg gtggatgcgt ctccttttgg acttggggcg gtcttgctgc 2280 aagatggcca gccagtagag tttgcgtcac gaacattgac tgagactcag cgcaggtatg 2340 cgcaaattga aaaggagctt ctggccgtgc agtttggaat gcagcatttc catcattatg 2400 tgtatgggag tcacgtttta gttgaaaccg accacaagcc cctcattggt ctggttgaca 2460 aacccattgg cttatgcact cctcggtttc agcgaatgcg ccttcagctc caagtgtaca 2520 gctaccagtt gtgctacaag ccgggtaaag agttgtacat tgccgacacc ttaagcagag 2580 cgccggaacc tcgggaatat tcaggtgatc agtctcagtg tcatgatgaa catgtccacg 2640 ctgtcctcag ttatgtcatc ccggagccaa cgattcaaga aaagtacgct cgagccacgg 2700 aagctgattc gactcttcag ttgatcgtcg gactcgttaa caaaggctgg ccggagcaca 2760 agcgtgattg tccggttccg gtgaagccgt attggtccga acgggcaaat ttgtcgacgg 2820 cccgcggcct acttcttcga ggacagcaag tggttgtacc atacagcctt caacgtgaaa 2880 ttctagccag cgtccatgac ggacacttcg gtgaaaccaa gtctttggaa agagcaaaat 2940 ccgccgtctt ctggccggga tacgtggagc agattcgcaa tcttgtcgct ggatgcaaca 3000 gagaagaaac aacaatccgg ctcagcagta ttatccgaca gaagtcccgg aacatccctt 3060 tcaaaaagta gcaacggatt tcttccagct ttctggaaaa cattatcttt tgacagttga 3120 ttatttcagc aagtggccgt gtgtggtcga gatgtcgtcg actaccagct ctgccaccat 3180 tcgagagttg gagaaaattt tctctgattt tcgcgtcccg gaaactctag tgtctgataa 3240 cgggccgcag tttggcagtg ccgagtttcg tgtgttttcc cgccagcagc agttctccca 3300 cgtaacgtca agcccgtttt atcctgaatc caacggattc gttgaacggt cggttcagac 3360 cgtcaaatcg tcttttataa aagccattga gagcggccgc tctcttcaag ccgcagtccg 3420 tgctattcgc tccactccac ttggtggtgg cctcccgtct ccgtccgtcc tgcttcagtc 3480 tcgtcatttg cgggatagtc tgccttttgt gcctgccgca ctcaagcatc agagtatcaa 3540 cagctgtgcc gtagaggaac ttttaaatcg ccgacaggat aaaatgatgt ttcatcagtc 3600 gtccgccgtt tccaaatgat atccggtttt atcagttggt caacgtgttc gtgttcgcgt 3660 cgaaaaaaaa atggattcct ggtgtagtga agattgtgtg tcaacaaccg gattcatacg 3720 tcgtcagcac cagtgatgga ggggagtttc gccggaatcg cagggcgatc aacgtttgcc 3780 gtagtcagca aagcgaatgt ccagcaactc ctcccacagt caaccagcgg gcccctgcag 3840 tagagcccag cagaagacca agagcttcgt tcatgtttcc ggccctctct ctgtctaaat 3900 cattttctaa ttcagcagta tccgttcaag ctccgttggt ggttcctgtt cccggtccgg 3960 cgacaagcaa cggggtggca cccgtcttgt ctttaccgcc aactcgccca gcggtgtctg 4020 cgcctgctga gccgatcagt caggccgttt tccccggcat taccaggaaa tcacgcaagc 4080 ctagagagtg gccgtcatgc tcacgctcga gcgcgtggtt ggctgccaag aacaggcgga 4140 gtgtttcgca cagcccagcc aggtcgacgg caagtaatcc gatttcaacg ctcgatcccg 4200 cccagctagt gacgattcct ccttcggaag atcccagcaa ccccctggac cctgttcaag 4260 atactcaaga ggcggtggtt ccagctaaca acgactaagc tagttcgttt tactgttctc 4320 gtttgtgttg taacctcaga actcttgttt gtttgttttt gtgttccctt ttttcattct 4380 tacttcggtt tcacttcctt atctttgcct tgcccatcta tttggttatg ttaaaaagga 4440 aaga 4444 // ID CR1_Ele14 repbase; DNA; INV; 4747 BP. XX AC . XX DT 19-OCT-2010 (Rel. 15.1, Created) DT 19-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE A CR1 clade non-LTR retrotransposon family from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1_Ele14. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4747 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4747 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (19-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. This consensus is generated from 18 CC sequences with >98% identity, and ~100% identical to the original CC sequence in [1]. XX FH Key Location/Qualifiers FT CDS 367..1194 FT /product="CR1_Ele14_1p" FT /translation="MAKQCGKCLEPITGIDVVVCRGYCGAFFHMTSCTNVT FT RALTSYFTTHKKNLFWMCDRCAELFENSHFRALTNHADERSPLNSLTSAIT FT DLRSEIKQLNSKPSTSISPATNIRWPTIDQQRSAKRRRENDSNVRVTDQCR FT TGSKKANENVVSVPICQTDVNQKFWLYLSRICPDVSVDSVSAMIKANLELT FT SDPTVVKLVPKDKDISTLTFVSFKIGLDPSLKSKALDPETWPEGLLFREFE FT DFGIPKFRKPLPLKLSIPPLQQQPLQSPMTPTMEC" FT CDS 1089..4637 FT /product="CR1_Ele14_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="GFWDSKISETTPAEIVDTAVTTATIAITDDSNNGVLT FT LVNPGYNRCTSFGETQLFSCSPASSSSPPGCTPASLWEAPDPLGTVEPLLP FT ATCSHPGPVSECGDGVFHHENAGKYNCTKDDSLPLTLIASSNLVETHSRLY FT NRRFVDVDDFRSSRSISSSSPGCMPASFLGAPYPLGTVEPLLPATRSHPGP FT VSECGDGVFQHVDAGKYKCIKNNILPSTSIVSSRDDNDDAIWIYYQNVRGL FT RTKIDDLLLATTDCNFDVIIMTETGLDNCINGQQLFGSGFNVYRCDRSPVN FT SSKSRFGGVLIAIAQRYVSSVERTVSAQSLEQICVSSTIKGRKVFMCAIYI FT PPDRSQDVNVINDHLSALNELCEKCSLGDAVLVCGDYNQPRMNWCLVDNTV FT QCNSRQLPLASSTLLDGMDYLCLAQRNLVRNQLDRTLDLVFCLSECVADVD FT CSLAPMLPVDSHHPPLEISLPACLVRDERIAQPQENRPLNFRQIDFDALLD FT YLLIIDWNEIFLSNDIDQVAENFCSILNNWLALNVPRIRPSVSPAWSTPRL FT RELKRARNACQRKLRRQRTPSNKRKFQRASNAYRFKNASLYKLYILRVQRD FT LRRNPRGFWRFVNSKRKSSAIPTNVFLNEQCARTAVESSQLFARHFEAIFA FT ANTASIQEVADATRDVPADVVDLSAFTINPDMVLLAAKKLKRSYVPGPDGL FT PAIIICRCISALVQPLCDIFNQSLQQAKFPRIWKQSFITPVFKRGDRHDVT FT NYRGITSLSAVSKLFEIIMCGIIFETTKCYISVEQHGFMPSRSVTTNLLTF FT TSKCMKSMEEKAQVDVIYTDLKAAFDKIDHRILLHKLSRLGVSTHLVSWME FT SYLTGRELRVKIDGCVSLPFSNKSGVPQGSNLGPLLFIIFFNDAGLVLGEG FT FKLIYADDLKLYIVVRTENDCMRLQNSLTLFADWCRRNKLIISVEKCQVIT FT FHRKMHPIVFHYEIDGIILNRVDHVTDLGVQLDEKMSFELHRSAIISKATR FT QLGFISKVAKDFSDPHCWKSLYCSLVRPILENASVVWHPHQLTWSLRIERI FT QKRFIRLALRNLPWRDPDNLPPYPDRCRLLGLETLDRRRKVQQSLLIAKLL FT NGEVDAPELLSLLEFRVPNRVLRSTTLLQSSFHRTVFGYNEPISACIRAFS FT AVEEFFEFDEPTRHFANRIRSVVQ" XX SQ Sequence 4747 BP; 1273 A; 1092 C; 1036 G; 1346 T; 0 other; tttggcatca ctgcttggtg tatgttatgc aaacaaaatc cgctcgtgat tattatttta 60 tttttgttgt gatatcgtaa ccattaaatc gttattttgt tcccccgtgc taatattgcg 120 tcttgtgaaa caataccgtg tatgtttatg catttgtgct gtgccgtggt gataaatatc 180 aatatttgag tgaaggttgt tgctgttcgt ggttcacacc aaacgaaaag ccttcacaat 240 tcctgtgcac ttttttcaac cggagcgaca tatactggca aataacggaa acatcgacca 300 gaagctcaac gtctagcttt atccgttcag cattatcgtt ggaagtctac gcgcaggcga 360 tttacgatgg cgaaacagtg tggaaagtgt ttggaaccca taactggcat cgatgtggta 420 gtctgccgtg gctattgtgg agcttttttc catatgacat catgtacaaa cgtaacccgc 480 gcattgacat catatttcac tacccataaa aagaatcttt tttggatgtg cgatagatgc 540 gcggagttgt tcgaaaattc tcattttcgt gcgctcacca atcatgccga tgaaagatca 600 cctctcaact cgcttacatc agccatcacg gatctacgca gcgaaattaa gcaactgaat 660 tcgaagccct caacttcaat ttctccagca accaacattc gctggccaac aatcgaccag 720 caaaggagtg ctaaacgacg acgcgaaaat gattcaaacg tacgcgtgac tgatcagtgt 780 cgcacgggta gcaaaaaagc aaatgaaaac gtagtatccg ttcccatttg ccaaacggat 840 gtgaatcaaa agttttggct ttatctgtcc agaatttgcc ctgacgtttc cgttgattct 900 gtctccgcta tgattaaagc taatctagag ttaacttctg atccaacagt agtaaaactc 960 gttccaaaag acaaagatat cagcacgctt actttcgtgt cttttaaaat tgggctggat 1020 ccgtcgctta aaagtaaagc acttgaccct gaaacttggc cagaaggact tctgtttcga 1080 gaatttgagg attttgggat tccaaaattt cggaaaccac tcccgctgaa attgtcgata 1140 ccgccgttac aacagcaacc attgcaatca ccgatgactc caacaatgga gtgctaactt 1200 tagtcaaccc tggctacaat cgatgcacca gcttcggcga aacccagttg tttagctgct 1260 caccagcatc aagcagttca ccaccgggat gcacgcctgc cagcctttgg gaagcccctg 1320 atcccctcgg cacagtcgag cctctcctgc cagcgacctg cagtcatccc ggtcctgtgt 1380 ccgagtgtgg tgacggggtc ttccatcatg aaaatgcagg caagtataat tgtactaagg 1440 acgactctct gccgttaacg ctcattgctt ccagtaacct tgtggagacg cattcacgct 1500 tgtataatcg tcgctttgtg gatgtagacg atttccgatc gagccgttcg atttccagtt 1560 cttctccggg atgcatgcct gctagctttt tgggagcccc ttatcccctc ggcacagtcg 1620 agcccctcct gccagcgacc cgcagccatc ccggtcctgt gtccgagtgt ggcgacgggg 1680 tcttccaaca cgtagatgca ggcaagtaca aatgcattaa gaacaatatt cttccctcaa 1740 catcgattgt ttccagtcgt gacgataatg atgatgctat ttggatttat taccaaaacg 1800 tgcgtggcct gaggacgaaa atagatgacc tgcttttggc gacaaccgac tgcaatttcg 1860 acgtgatcat catgaccgag accggattgg acaactgcat caacgggcaa cagctttttg 1920 gttcaggctt caatgtgtat cgctgtgatc ggagccccgt caacagtagt aaatctcgat 1980 ttgggggtgt tctaattgct attgctcagc gttatgtaag cagcgttgag cgaaccgtaa 2040 gtgcgcagag tttggagcag atctgtgttt cgtcgacaat caaaggtagg aaggttttca 2100 tgtgcgcgat ctatattcca ccggatagaa gtcaagatgt taatgtgatc aatgaccacc 2160 tttcggcatt gaatgaactt tgcgagaaat gttcacttgg cgacgctgtt ttggtgtgtg 2220 gcgattataa tcaacctcgt atgaactggt gcttggttga taacactgtt caatgcaaca 2280 gtcgtcaact accgttagcc agcagcactc tgctggacgg aatggattat ttgtgcctcg 2340 cccaaaggaa tcttgttcgc aaccagctgg atcgcactct tgacttggtt ttttgccttt 2400 ctgaatgtgt ggccgatgtt gactgttcgt tagctccgat gttacccgtt gattctcatc 2460 atcccccgct cgaaatctct ttacctgcgt gtttggtacg cgacgaaagg attgcccaac 2520 ctcaagaaaa ccgaccactg aattttcgtc aaatcgactt tgacgctctt ctggactatt 2580 tgctaatcat agactggaac gaaatatttc ttagcaatga cattgatcaa gtggccgaaa 2640 atttttgctc cattttgaac aattggcttg ctttgaatgt cccacgaata cgaccgtcag 2700 tttcgcccgc ctggagtacg cctcgcttgc gggaactaaa acgcgcgcgt aatgcttgcc 2760 aacgtaaact acgtcgtcaa cgcacgccaa gcaataaacg aaaatttcaa cgagccagca 2820 atgcataccg ttttaaaaat gcctctcttt acaagttgta catattgcgt gtacaaagag 2880 atttgagaag gaatcctcgt ggtttttggc gcttcgtcaa ctccaaacga aaaagttctg 2940 caattccaac gaacgtcttt ttgaacgagc aatgtgctag gactgccgta gaatcaagcc 3000 aattattcgc caggcacttt gaagctatat ttgctgcaaa caccgcatca atacaggaag 3060 tagctgacgc aactcgagat gtccctgcag atgttgttga tcttagtgca tttaccataa 3120 atcccgatat ggtcttattg gccgcgaaaa agctaaaacg gtcgtatgtt cctggtcctg 3180 atggattacc ggcaattatt atttgtcgtt gtatttctgc attggtacaa ccattgtgcg 3240 atattttcaa tcagtcactc cagcaagcaa aattcccacg catctggaaa caatcgttca 3300 taacacccgt tttcaaacgc ggtgatcgcc atgatgttac aaactatcgg ggtattacaa 3360 gcctgtcagc tgtttcgaaa ctgttcgaaa taataatgtg tggaattatc ttcgaaacaa 3420 caaaatgcta catctccgtt gagcaacacg gattcatgcc tagccgatcg gttactacaa 3480 acttgttgac ttttacgtca aaatgcatga agagcatgga agaaaaggca caagttgacg 3540 taatttatac cgacttaaaa gctgcgttcg acaagattga ccaccgaata cttttgcaca 3600 aactgtcccg cctcggagta tctacgcatt tggtttcttg gatggagtcg taccttacgg 3660 gacgagagct gcgagtaaaa atcgatggat gtgtatcatt acccttttcg aacaagtctg 3720 gtgtcccaca aggaagcaac cttggaccgt tgctatttat tattttcttt aacgatgctg 3780 gcttggttct cggtgaagga tttaagctga tttatgccga tgatctcaaa ctgtacattg 3840 tagtacggac tgaaaatgat tgtatgcgtc ttcagaactc gttaacactg tttgccgatt 3900 ggtgccgcag aaacaagcta attattagcg tagagaaatg tcaggtgatc acttttcacc 3960 gcaagatgca tccaattgtt tttcactacg aaattgatgg gatcattctc aacagagttg 4020 atcatgtgac tgacttaggg gttcagcttg atgagaaaat gtcgtttgag ctgcatcgtt 4080 ctgcaatcat ctcaaaagca acgcgtcagc taggctttat ttccaaggtt gccaaggatt 4140 tttcagaccc gcactgctgg aaatcattgt attgttccct ggtacgccct atcttggaaa 4200 atgcatccgt cgtttggcat ccgcaccaac tcacgtggag tctaagaatt gaacggattc 4260 aaaagaggtt cattcgtttg gcactaagaa acttgccatg gagggacccc gataacctgc 4320 caccgtatcc ggatagatgc cgactcctgg gactagaaac attggatcga cgaagaaagg 4380 tacagcaatc gttgcttata gcgaagttgc taaatggcga agtcgatgct ccagaactgc 4440 ttagtttgct agaatttcgt gttcctaacc gagttctccg gagtacaaca ctgctccaat 4500 caagtttcca tcgcaccgta tttggataca acgaacccat ctcagcgtgc atccgagcat 4560 tctcagctgt cgaagagttt ttcgagttcg atgaaccgac gaggcatttt gctaatagaa 4620 tacgatctgt agttcagtaa tctgtgttaa gtgtatgtta tgttcaagtt tttattcatt 4680 aagactaatt tgtcagatgg attattttaa atacaaatac aaatacaaat acgaatacga 4740 aataaaa 4747 // ID BEL-6_DPu-I repbase; DNA; INV; 5762 BP. XX AC scaffold_290; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-6_DPu_; KW BEL-6_DPu-LTR; BEL-6_DPu-I. XX NM BEL-6_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-5762 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 659-659 (2010). XX DR Genome; scaffold_290; Positions 49910 55671. XX CC Positions [4644-5195] - Integrase core CC LTRs are 98% similar to each other. CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX FH Key Location/Qualifiers FT CDS 991..3534 FT /product="BEL-6_DPu-I_1p" FT /translation="MRTLYTLSKPEGHLKSLRLFYDSLESYVRGLEALGKA FT PDTYGDLLVCILMDKLPIEIRKNVARQHDQDEWTLEQLRKALRGEIRVMEA FT GQSSFLPHQQQPSSRNQQQYHGGKQTVNLFSGNSRPIKKFPCVFCSGDHAV FT TQCQQVTSVEETKKLVGNKKLCSNCFSPKHQSKDCQSLASCRICGARHHTS FT LHSGPAKPSNPSTLPTAVSGNHCQTTALSSSVCDIYPFVFLKTAIIRAQSQ FT QHEEKVNALFDEGAQRSWMTRDAAKRLGLRVKSKELLILSGFANSPTAPKY FT YDLVELVIVTIENQPIVVRAIVIDFLVNPLEDGYRKNLQDLAHLKDLRLAH FT PFTGQETFHVDILIGADFYWQFIGDEAPIRGQGPTAVNSKLGYLVSGPLDS FT VRVSKPKQSINLHIQAVDCDDLSFLWSLEKMGIFPHQENVKEAAEYADKCI FT EFEDGQYTARFPWKSEHQELPANFNMVQNMTRAIIKRLAKQPKMLAVFSRL FT IEEQLNHNFIEKVPAEELAKNCHYIPYHYVKRESATTPIRIVYNCSCKGWN FT GVSFNDCVEAVAPLHNDQTQILINFRSHAIGIVADVEKAFHHINLHKADRD FT FLRWFWLEDPTNPESPLAVYRFRVVPFGAKSSPFILNAVMMHHLKKSTSEI FT AADMLQSVFVDNIITGCESPSAALNYFTEANRIMNEAHLPLQAWGFSDGAV FT EKSLAEGGSTDPSRLSKTLGLIWNRENDTLNVQPPHLSAADVVTKRDVLKG FT NGAFYDPLGFYTPLGTSSKILIQDICIADLKIDEELSADHLKRWKEIVDSI FT NGAVDQQLMSLPRSYFGTVDAVQELHVFCDASRRAYGAVAYLCYGS" FT CDS 3576..5735 FT /product="BEL-6_DPu-I_2p" FT /translation="MSKARITPIKDSQREGEREISIPEAELMAAYLGTLLA FT TTIIAALKKNGIRMKIFLWSDSQIIHFWISKPEGHPRQFITNRVKKIREFT FT QQSAATWRYLPSEYNPADILSRGATLSEFKKSSLWRSGPHWLIDRKKLPIW FT SVSQFKTSTSVHLTVSSKKENAEEEIGNIIDPSIYSWEMLLRITGQIYRLL FT SNLKKKDASRKSWNLQPLNALELQDAENVWIRFFQRKHLEEELKYLQRKKE FT AWRTSLVSQLDLFLDGSGIIRCKGRLQNAGISESAKHPVLIPKKTVLARLI FT ITSVHQRIAHYGVDSTIAHLMQKYWIPSARAQVKAVCRNCPKCRRESGPSY FT RYPDPAPLPADRVQENYPFAVTGVDYTGAIQVTSKGERISVYILLFTCGVT FT CAIHLEVVEDMTAGSFINALKWFTGHHPIPRLIYSDNASTFVNASNHLLEL FT FNHRKVQEELAAIRILWKFIPKAASWYGGWWERLIALTKTALLKMVGRTIL FT SFTQLQTVTTQIEAILNDRSLTKISTDENCIQPLTPSHLLYGQRLTTLPYH FT YDAEEELLDPSYGGPSQQPQVMTKAYLRSQNILRSFRRIWNSTYVPSLREH FT HQKTKGPMKSTIRVGDVVQIESESKRANWKLAIIESVNRGGDGKVRSAELR FT TATGRTSRPINKLYPVEVSENSSPTTQDANIQPRCIINESRSPSGRIPRQA FT ARQASEKIKNIAYLESIIEE" XX SQ Sequence 5762 BP; 1676 A; 1344 C; 1248 G; 1494 T; 0 other; tctttttcac tcttgttaat acaaagtttg aattgtagaa aaaaagtctc tcaatcactt 60 atcgcatagt tgcgacgaca atcttatttt cttaaactag cgttgcatag ttgcgacgac 120 gttttctctt tcttaactag tgtcgaagat agtattttcg acatttttgg tgccgtgacc 180 aggataaaag aagattctgg ttcatcactg ctacacaacg tgtggcgact caacgcaggc 240 tgttcaacgc atcagtcatc gcatctttaa tttcatcaaa aatcaacaca acgtagggta 300 acaaactgtt tcatcgaaaa tggccaacag agcacatctt gtatcgcttc gtggaggcag 360 cagaggtcaa gctacgacgc tcgtccgacg actcgaagct gtttttgcta atgcagaact 420 cgatgcaatc cacaaacttc acgagctagt aactaaacat caagcgctgt acgaacgata 480 ccaaataatc gaagacttgg atcagcaaat ttatgctttg actcaagcgg acgcattgga 540 ggcatacatt gcagcggtgg acgaggtgaa cattgtatat caagacgcgc ttagtcttta 600 ccaacaccga atcaacgtta tcaggcgcga aatcgaagca gcagatccag atcgtcgacg 660 gaatgatgca gcagcaccag ggcagcaagg caacgtacca gcgcatcaag tcattgtgcc 720 agcaagagca acgagtcgac cgaaaatcac gcttccgcgt ttcaatggag agattctcca 780 atggcgacag ttttggcagg ctttccaagc ggagatccat tctgatgaca ctctggccaa 840 cattaataaa tttaatacct tgtgggtcaa ttggagccca acgtactcgg cacggttgcg 900 ggtctcacgc cgtctaatga taattatccc gtgctcgtaa atttgctgac agagcgtttc 960 ggcagtattc ccaagattac ggcggcttac atgcgaactc tttacactct ttcgaaaccg 1020 gaaggacatc tcaaaagtct tcgtctgttt tacgattctt tggaatcata cgttcgtggt 1080 ttggaggctt tgggcaaagc ccccgacacc tatggcgatc tacttgtttg tattctcatg 1140 gacaagcttc ccattgaaat acggaaaaat gtggctcgtc aacacgatca agatgaatgg 1200 acattggagc agttgcgaaa agctttaaga ggagaaattc gcgtgatgga agcaggccag 1260 tcatcttttc ttcctcatca acaacaacca tccagcagaa atcaacagca gtatcacggc 1320 ggaaagcaga ctgtcaattt attcagtggt aactcgagac caatcaaaaa attcccgtgc 1380 gttttctgca gtggagatca tgctgtgact caatgtcagc aagtgacgtc agttgaagag 1440 acgaaaaaac ttgttggaaa taagaagctg tgttccaatt gtttcagccc gaaacatcaa 1500 agcaaagatt gtcaatcact cgcatcctgc agaatttgtg gagctcgtca tcacaccagc 1560 ctccattccg gtcctgcaaa gcccagtaat ccttcgacgt tgccgacggc agtcagcggg 1620 aaccattgtc aaacgacagc tctaagttcc agtgtgtgtg acatctatcc atttgtcttc 1680 ctgaaaacag ctatcatcag agctcaatcc caacaacacg aagagaaagt caatgcactt 1740 tttgacgaag gagctcaacg ttcatggatg acgagagacg cagccaaacg gttaggcctt 1800 cgtgtgaaga gcaaagagtt gctgatttta tcgggattcg ccaattcccc aactgcgccg 1860 aaatattacg acctagttga gctcgttata gtcaccattg aaaatcaacc gatcgtcgtt 1920 cgagccatag tcatcgattt tctggttaat ccattggaag acggatatcg gaagaatctc 1980 caagatctcg cgcatcttaa agatttacgt ctcgctcatc cctttactgg acaagagact 2040 ttccatgtgg acatccttat aggagctgat ttttattggc aattcatcgg agatgaagcc 2100 ccgattcgcg gtcaaggacc aacagcagtc aactcaaaat tgggctatct tgtatcagga 2160 ccactggaca gtgttcgcgt ctcaaaaccg aaacagtcaa tcaatctcca tattcaagcg 2220 gtagattgcg acgacttgtc tttcctctgg tcccttgaaa aaatgggaat ttttcctcat 2280 caagagaatg tcaaagaagc cgctgagtat gctgacaaat gcattgagtt cgaagatggc 2340 caatatacgg cccgtttccc atggaaatca gaacatcaag agcttcctgc taactttaat 2400 atggtacaga acatgactcg agccattatt aaacgtttgg cgaaacaacc aaaaatgtta 2460 gccgtgtttt cccgtctaat cgaagagcaa ctcaaccaca actttattga aaaagttcct 2520 gcagaagagc tcgcgaaaaa ttgccattac atcccttatc attatgtcaa gagggaatca 2580 gctactactc cgattcgcat tgtttacaat tgttcatgta aaggatggaa tggagttagt 2640 tttaacgact gtgtcgaagc tgttgcacca ttgcacaacg accaaactca aattttaatc 2700 aacttccgct ctcatgccat cggcattgtt gccgatgtgg aaaaggcatt tcatcatatc 2760 aatctgcaca aggccgatcg cgatttcctt cgttggtttt ggttagaaga tccaaccaac 2820 cctgagtctc cactagccgt ctacaggttc agagttgtgc cgtttggagc taagtcatca 2880 cctttcatcc tcaacgcggt gatgatgcat catctcaaga agtcgacctc tgaaatcgct 2940 gctgacatgc tgcaaagtgt attcgtcgac aacatcataa ctggctgtga gagccccagc 3000 gctgctctta attatttcac tgaagccaat cgcatcatga atgaagctca tcttcccctg 3060 caagcttggg ggttcagtga cggtgccgtt gaaaaaagtt tggcggaggg agggtcaact 3120 gacccatcgc gtctatccaa aactcttggt ttgatttgga atcgagaaaa tgatacgtta 3180 aacgttcaac cgcctcatct ctcagcagcc gacgttgtga caaaaagaga cgtccttaaa 3240 ggaaatggag ccttctacga tccccttggt ttttacacac cattgggtac ttcatcaaaa 3300 attcttattc aagatatctg catagcagat ctcaaaatag acgaagaatt gtctgccgat 3360 catctgaaac gttggaagga gatcgtggat tcaataaacg gcgcagtcga ccagcagttg 3420 atgtcattgc caaggtccta ttttggaaca gtggacgctg ttcaagagct gcatgtattc 3480 tgcgatgcga gtcggcgcgc ctatggagcc gtcgcatatc tttgctacgg gagctaagga 3540 gctaaggact tgctaaggag caaaccgcat ttgtgatgtc gaaagcacgc attactccta 3600 tcaaggatag ccagcgggaa ggagagagag agatctcaat tccagaagcg gaattgatgg 3660 ctgcctatct cggcactctt ctagcaacga ccatcatagc agctttaaag aagaacggca 3720 tccggatgaa aatatttctt tggagcgata gccaaatcat tcatttttgg atctccaagc 3780 ctgaaggcca tcctcgtcaa ttcattacaa atcgggtcaa aaagatcaga gagtttactc 3840 aacagagtgc cgcaacatgg agatatttgc catcggagta taatccagct gatattctgt 3900 cgcgaggagc gactttatcc gaattcaaga aatctagtct ctggcgttcc ggtccgcatt 3960 ggctgatcga tcgaaagaag ttgcctattt ggtcagtgag tcagtttaaa acgtcaacaa 4020 gtgttcatct cactgtctca tcaaagaaag agaatgctga ggaagaaatc ggcaatatta 4080 ttgacccatc tatatatagc tgggagatgt tgcttcggat cactggtcaa atttatcgtc 4140 ttctctcaaa tctcaaaaag aaagacgcat ctcgaaaatc atggaatctc cagcctctaa 4200 atgctttgga acttcaagat gcggaaaatg tctggattcg ttttttccag cgcaagcatc 4260 tggaggaaga attaaaatat ctccaacgaa agaaagaagc ttggcgaacc tccttagtat 4320 cccaacttga tctgtttcta gatggaagtg gcatcattcg gtgtaaaggt cgactccaga 4380 acgcgggaat atcggaatct gcgaaacatc cagtactgat acccaagaag acagtcctgg 4440 ctcgacttat catcacttct gtccatcaac gcatagctca ctatggagtt gattctacta 4500 ttgctcatct catgcaaaag tattggattc cgtcagctcg ggctcaagtt aaagctgtgt 4560 gtcgaaattg ccccaagtgt cgtcgagagt ccggcccatc atatcggtat cccgatccgg 4620 ctcccttgcc agccgatcgc gttcaagaaa attatccctt tgcagtcacc ggagtggatt 4680 acaccggagc catccaagtc actagtaaag gagaaagaat ttcagtctac atattattgt 4740 tcacctgtgg tgtaacttgc gccattcacc tcgaagtcgt cgaagatatg acagctggat 4800 cgtttatcaa cgcactcaag tggttcactg gtcatcatcc gattccccgt ttgatttatt 4860 cagataacgc ttcgactttc gtgaacgcgt ccaatcatct actagaatta ttcaatcatc 4920 gaaaggttca agaggagtta gccgcaatac gtattctctg gaaatttatc ccaaaagccg 4980 cttcatggta cggcgggtgg tgggagagac tcattgcact cacaaaaact gctctcctta 5040 aaatggttgg tcgaactatt ctaagtttta ctcagctgca aacagtcaca acacaaatag 5100 aagctatctt aaacgatcgc tccttgacta aaatctctac ggatgagaac tgcatccagc 5160 cactaactcc gtctcattta ttgtatggcc agcgattgac tacattgcca tatcattatg 5220 atgcggaaga agaattgtta gatccttctt atggtggacc atctcagcaa ccacaagtga 5280 tgactaaggc ctatcttcgc tctcaaaaca tccttcgttc ctttcggaga atttggaatt 5340 cgacatatgt gccctcgcta agggaacatc atcaaaaaac taaaggaccg atgaaaagca 5400 ccatcagagt aggagatgtc gtccagatcg agagtgaatc taaacgagcc aactggaaat 5460 tagccatcat tgaaagtgtc aatcgcggag gagatggaaa agtacgatct gctgaattga 5520 gaacggccac cgggcgtaca agccgtccta tcaataagct atatccagta gaagtttcag 5580 aaaattcatc tccaactacg caagatgcca acatccaacc tagatgcatc atcaacgaaa 5640 gccgatcacc atccggacgg attccgcgcc aagctgcacg tcaagcgtcg gagaaaatta 5700 agaacatcgc atatttagag agcataattg aagaataatt tgttttcttt ccgccgggtg 5760 ta 5762 // ID BOTMAR1 repbase; DNA; INV; 1259 BP. XX AC . XX DT 09-JUL-2004 (Rel. 9.06, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version 3) XX DE Bombus terrestris repetitive sequence mariner-like element DE Botmar1. XX KW Mariner/Tc1; DNA transposon; Transposable Element; KW Repetitive sequence; mariner-like element; transposase domain; KW BOTMAR1. XX NM BOTMAR1. XX OS Bombus terrestris OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; Apoidea; OC Apidae; Bombus; Bombus. XX RN [1] RA Bigot Y., Hamelin H.M., Capy P. and Periquet G.; RT "Mariner-like elements in hymenopteran species: insertion site RT and distribution."; RL Proc. Natl. Acad. Sci. U.S.A 91(8), 3408-3412 (1994). XX RN [2] RA Kohany O. and Jurka J.; RT "Consensus."; RL Direct Submission to Repbase Update (31-MAY-2004). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS join(161..529,537..1193) FT /product="BOTMAR1_1p" FT /translation="MSNFVPGNYDLRTALIFCYHLKKTAAESHRMLVEAYG FT EHALGKSQCFEWFKKFRSGNFDARNEERGRPPKKFRDSELQASLDEDDAQT FT QQQLADQLNVTREAVSIRLKAMGRSRRWENGFHMNKTAGKPKNHLRNAARQ FT IQKKSFLHRIVTGDEKWIYFENPKRKRSWVAPGEPPTSTTRPNRYGRKTML FT CVWWDQKGVIYYELLKPGETVNTERYRQQMIDLNQALREKRPEYQKRQHKV FT ILLHDNAPSHTAKPVKETIEAFSWEILSHAAYSPDLAPSDYYLFASMGHAL FT SDQHFTSYENVRKCLDDWFASKERQFFWRGIHQLPDRWEKCIASDGQYFE" XX SQ Sequence 1259 BP; 412 A; 255 C; 274 G; 317 T; 1 other; tatgaaaccg gaattggcct atagatggtc ctagctgata atgtagttat cactaaactg 60 cgtcrttgat gccaaaagtc tttgttgaca tctcacaaac attttcgact cagaacaata 120 caagtttcat acaacagcat agtttgtaat agcgttgaac atgtcgaatt ttgtgcctgg 180 aaactacgat ttgcggacag cattgatttt ctgttaccat ttgaagaaaa ctgctgcaga 240 atcgcatcga atgcttgtcg aagcttacgg tgagcatgct cttggtaaat cacagtgctt 300 tgagtggttt aaaaaattca gaagtggcaa ttttgacgcg aggaacgaag aacgtggaag 360 accaccgaaa aagtttcgag acagcgaatt gcaagcatcg ttggatgagg atgacgctca 420 aacgcaacaa caactcgctg atcaattaaa cgtgacacga gaagccgtct ccatacgttt 480 gaaagccatg ggaagatcca gaaggtggga aaatgggttc cacatgaact gaatgaaaga 540 cagcaggaaa accgaaaaac cacttgcgaa atgctgctcg ccagatacaa aaaaagtcat 600 ttctccaccg aattgtgact ggcgatgaaa agtggatata ttttgagaat cctaagcgta 660 aaagatcatg ggtagctcca ggcgaaccac cgacatcgac tacaagacca aatcgctatg 720 gacggaagac aatgctctgt gtttggtggg atcagaaggg tgtgatctat tatgagctgt 780 taaaacctgg cgaaaccgtt aatactgagc gctaccgaca acaaatgatc gatttgaatc 840 aagctttgcg tgaaaaacga ccagaatatc aaaaaaggca acacaaagta attttgcttc 900 atgataatgc accatcacat acagcaaaac cggtcaagga aacgatcgaa gcgttcagtt 960 gggaaatact ttcgcacgcg gcttactcac cagacttggc tccgtccgat tactatttat 1020 ttgcatcgat gggacacgca ctttctgacc agcacttcac ttcttacgaa aatgtacgaa 1080 aatgcctcga tgactggttt gcctcaaaag agcgacagtt tttttggcgt ggcatccacc 1140 aattgccaga caggtgggaa aaatgtatag ctagcgatgg gcaatacttc gaataaaata 1200 tttttaatca ttttcataca ataaacgtgt attttctata caaaaattcc ggtttcata 1259 // ID Copia-1_DWil-I repbase; DNA; INV; 4043 BP. XX AC scaffold_180634; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-1_DWil_; KW Copia-1_DWil-LTR; Copia-1_DWil-I. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-4043 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (04-MAR-2011). XX DR Genome; scaffold_180634; Positions 10964 6922. XX CC Positions [1445-1954] - Integrase core CC 'AATAT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 127..1596 FT /product="Copia-1_DWil-I_1p" FT /translation="MSSFYQTEKLEEGNYDSWSVQMRSVLIHAELWNSIGM FT TRPTEANAVAAWDAQNQKALATITLSVRPSQLVYLKNCATAKEAWDKLKET FT YEPSGPVRRVSLYKKLLGLQLKEDKMSDYLNMFFSTKDKLAEVNIKLEDEL FT AVIILLSSLPKEYENFVIAMETRDSLPSLDSLRIKLLEEGDRRSNERMSNP FT EEQRAYQANVRKNPIRKKRSLVDIECYSCGQRGHYKRDCPKFQEKPKENRQ FT SQHRSYTILAAAQAGNLDASSWCVDSGATAHLCGRRSMFANYVAHEEDVVL FT ANNHVIKAVGRGDVLVQTGFCELKLENVLHLPGLETNFMSVDRAVDRRCTV FT TFDKNSATVRQDDEMVLIAERQGKLFVFKETGNRCFGAKQISAEDWHYRYG FT HLNYSSLMNIVKRRMVTGMENVDFSEKMPCVTCMRSKIHTQPFPSASEHRA FT QHLLELVHTDVCGPFKQRSRRFFIFFDIHRRHVETNICLLSKIEG" FT CDS 1490..3997 FT /product="Copia-1_DWil-I_2p" FT /translation="MYVDLLSNVLGGSSYFLTFIDDMSRRIFAYCLKSKDE FT VFEKFVEFKEMAERQTGQKLRAIRSDNGREFINRRFDRYLKDPGIVRQLTV FT PYSPQQNGVAERANRTLVEMARCLLVHADMQEFLWAEAIMTACYIRNRPPT FT AALVGMTPYEAWTGRKPNVQGMHIFGSIAVALDKTRRSKFQAKGREYRLVG FT YSLVSKAYRLYDKEKREVVEKRDVMFKDPNDVVSIVESPGPAQTVTGLEMR FT WDESSKDDLENEEFVSAEDFEDESSGQSDDESCQAEEQEPEQEIRQEGQIG FT PGRPRLNRTGKPGRPKKIYNRVNKLHVKELDLPQTFDDAMQGKQSSFWKTA FT MDKEYEALLANKTWTLADLPKGQKAIGSKWVYTIKRSGDGSVERYKSRLVA FT KGCSQIYGVDYTETFSPVCRYETIRLVLAIAAELKLYLHQMDVCTAYLNSD FT LDEVVYMRQPQGYVDKTNSQQVLRLNKAIYGLKQAGRKWNAKLNGALCDLG FT FTSSKNEPCLYQQDIKGRLCLVLIYVDDLLIACQKMDDMIAIKAKIAMKFE FT CVDKGPLREFLGMQVDRNDDTGSIMMNQAQYIKGMLQRHSMEVCRAVATPL FT DAGFQVACSDEKCVIQDVTMYQSAIGELMWLALTSRPDIYHSVIKLAQRNK FT EPHSEHVAAVKHVMRYLAATKDLKLHYRSCGDALTGYADADWGGDVSNRKS FT YTGYVFFLAGGPISWKSEKQCSVALSSTEAEYMAMSSAAKEALHLRRLLIE FT IGCGDAGTSTKLFGYNLSAQSLAKNPVFHARSKHIDIRHHFVREVVTRGEI FT DLDYISTKDMVADILTKNLSKQKHNHCVKLLNLN" XX SQ Sequence 4043 BP; 1250 A; 701 C; 1096 G; 996 T; 0 other; ggttatgggc ccagatacac gcgagtgtgg gcaggaggat ttgtgttaaa agtttaagta 60 gatatttcgc ggctggcgaa aagtaattgt gcggaaagac acacggcata aggcgcgtgg 120 taaaatatga gttcttttta ccaaacagaa aagttggagg aggggaatta tgattcctgg 180 tccgttcaaa tgcgcagcgt gttgatccat gcggaattat ggaactccat aggcatgacg 240 aggcctacgg aagcgaatgc agttgctgcg tgggatgcac agaatcagaa agcgttggcg 300 acgataacgc tgagtgtaag accgtctcag ttggtttacc tcaagaattg tgctacagca 360 aaggaagcat gggataagct caaggaaacg tacgagccaa gcggacccgt ccggagggtg 420 tcgctatata aaaagttgct tgggttgcag ctgaaagagg acaagatgag cgactatctc 480 aatatgttct tctctacaaa ggacaaactt gctgaagtga atattaagct agaagacgaa 540 ctagcggtga tcatattact ttcgagtttg ccaaaggaat acgaaaactt cgttattgcg 600 atggagactc gtgatagtct tccttctttg gacagtctaa ggattaagtt gctggaagaa 660 ggagatcgca ggagtaacga gagaatgtca aacccagaag agcaacgggc ataccaggca 720 aatgtgcgaa aaaatccaat tcgtaagaag aggtctctgg tggatataga gtgctacagt 780 tgtgggcaac gaggtcacta taagagggac tgcccaaagt ttcaagaaaa gccaaaggaa 840 aatcggcaat cacaacatcg atcgtacact attctagctg cggcacaggc aggaaatcta 900 gatgcttcga gctggtgtgt ggattcggga gcaacggctc atttatgtgg caggcgcagt 960 atgtttgcga attatgttgc acatgaggaa gatgttgtat tggccaacaa ccacgtcata 1020 aaagcagttg gcagaggtga cgtgcttgta caaacaggat tttgtgagct gaaattggag 1080 aatgttttgc acttaccagg gttggagacg aattttatgt cggttgatcg agcagtcgat 1140 aggcgatgca ctgtgacttt cgacaaaaat tcagcgacgg ttcgccaaga cgacgagatg 1200 gtgctaatag cagagaggca aggaaagctc tttgtgttta aggagacagg caatcgatgc 1260 tttggagcga aacaaataag tgcagaagat tggcactaca ggtatggtca tttgaattat 1320 tcaagtctga tgaatattgt taaacgcagg atggtgactg gcatggaaaa tgtggatttt 1380 tccgagaaga tgccatgtgt gacttgtatg agaagcaaga ttcatactca acctttcccg 1440 tcagcgtctg aacatagagc tcagcatctt ttggaattgg tacataccga tgtatgtgga 1500 ccttttaagc aacgttctag gcggttcttc atattttttg acattcatag acgacatgtc 1560 gagacgaata tttgcttatt gtctaaaatc gaaggatgaa gtattcgaga aattcgtgga 1620 atttaaagag atggctgaaa gacagactgg ccaaaaattg agagcaataa gaagtgacaa 1680 cggccgagaa ttcatcaaca ggcgttttga taggtactta aaggatcctg gaattgtgcg 1740 tcagttgacg gttccatata gccctcaaca gaatggagtg gctgaacgag caaaccggac 1800 gctagtcgag atggctagat gtttgcttgt acacgcggat atgcaggagt ttctctgggc 1860 agaggcaata atgacagcgt gttacattcg aaacagacca ccaacagcag ctttagttgg 1920 tatgacacct tatgaggcat ggactggaag gaaacccaat gtgcagggta tgcacatttt 1980 tggatcgata gcagttgcat tggataagac acgacgaagc aagtttcagg ccaagggcag 2040 agaatatcga ctcgtaggat attcactggt gtcaaaagca tacaggttgt atgacaaaga 2100 aaaacgagag gtcgttgaga aacgggatgt aatgttcaag gatccgaatg atgtggtttc 2160 gattgttgaa tccccaggtc cagcacaaac agtcacaggt ttggaaatgc ggtgggacga 2220 atcaagtaaa gacgatctgg aaaatgagga atttgttagc gctgaagatt tcgaagatga 2280 gagttcaggg cagagcgacg atgaatcttg ccaagcggaa gaacaagaac cagaacaaga 2340 gatacgacag gaaggacaga tcggtcctgg aaggccacgt ttgaatcgca ccggtaagcc 2400 aggacgtccc aagaaaatat acaatcgtgt aaacaagtta catgtaaagg aactggattt 2460 gccacagaca tttgatgatg cgatgcaagg taagcaatct tcattctgga agactgctat 2520 ggataaggaa tatgaggctc tgttggcaaa taagacatgg acactggcag atttacccaa 2580 gggacaaaaa gcgattggtt caaagtgggt atatactatc aaaaggagtg gcgatggaag 2640 cgtggagcgg tataagtctc ggttggtggc taaagggtgc tcacagatat acggcgtcga 2700 ttataccgaa accttctcac cagtgtgtcg ttacgaaacg atacggctag ttttggctat 2760 agcagcggag ctgaaactat atctgcacca aatggatgtg tgcacagcct atcttaacag 2820 tgacctagac gaagtcgtct atatgaggca gcctcaggga tatgtggata agacaaattc 2880 acaacaggtg ttgaggctga ataaagctat atatggattg aaacaagcag gaaggaagtg 2940 gaatgccaag ttgaatggag ctctatgtga tcttggattc acttcttcca agaacgagcc 3000 gtgcctttat caacaggata taaaaggtag actttgtttg gtactaatat atgtagatga 3060 tttacttata gcttgccaaa agatggatga catgatcgca atcaaggcaa agattgcgat 3120 gaaattcgaa tgcgtcgata agggtccgtt acgagagttt ttgggcatgc aagtagaccg 3180 gaatgatgat acaggcagta tcatgatgaa tcaggctcag tacataaaag gcatgcttca 3240 aagacatagt atggaagtgt gtcgagcagt tgctactcca ttggatgctg gtttccaagt 3300 ggcgtgcagt gacgagaaat gtgtcataca ggatgtgaca atgtatcagt cagcaatcgg 3360 agagttgatg tggcttgcat taacatcaag acctgatatc taccattcag ttatcaaatt 3420 ggcgcaaagg aataaggaac ctcacagtga acatgttgca gcagttaagc atgttatgag 3480 gtatttggcg gcaacgaagg atcttaagtt gcactaccga tcgtgtggag atgctcttac 3540 gggatacgct gacgcagatt ggggcggtga tgtgagcaac cgcaaatcgt acacagggta 3600 cgtgtttttc ttggcaggtg gccctatctc ctggaagtca gaaaaacaat gcagtgtggc 3660 gttgagcagc acagaggccg aatacatggc tatgtcttca gcggcgaaag aggctttgca 3720 tttgaggcga ttgttgatcg agattggatg tggagacgct gggacatcta cgaaattgtt 3780 tggatacaat ttgagtgctc aatctctggc caagaatcct gtgtttcatg ctcgatcgaa 3840 acacattgac atcagacatc atttcgttcg agaagttgtt acaagaggcg agattgatct 3900 ggactatata tcaactaagg acatggtagc tgatatatta acgaaaaatt tgtctaagca 3960 aaagcataat cattgtgtca agttattgaa tttaaactaa gaaagtaatt gtaattggca 4020 cacagaatca tattgaggaa ggg 4043 // ID L2-9_NVi repbase; DNA; INV; 5547 BP. XX AC . XX DT 21-APR-2009 (Rel. 14.04, Created) DT 21-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE CR1-like retrotransposon: consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; L2-9_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-5547 RA Bao W. and Jurka J.; RT "CR1-like elements from the parasitic wasp Nasonia vitripennis."; RL Repbase Reports 9(4), 759-759 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 151..1572 FT /product="L2-9_NVi_1p" FT /translation="MNLSTPKVRLPRNSGTNSTKSNNLAVTTPAPKPAASE FT GKSTPEEVEEAISTKQSTTPDAGHHCALPSSMLNLFSSQFELLERAVEDLR FT SLISRKFDAVAGATVGAPASSGHQEDFTARIASLEGKIDAQSTKLDGLHND FT RMLLLQENTVLKEQLNGLVRKIEGMADSVASKCRAHDVSHSDSDADDSRIM FT ISLRDIDVSASCMRDAGSPVIDVANDFKQIRAAARSRVGGRRRRDAGAQFV FT SRAVDGESGSVSGDCGSDSRGIESESGGAGSSQTTKHKRLSPLSSNTSELG FT GMRGSENEADLIISNVSDGTDAQRLNVAHAVLAAIIPSISRDDVVSLRMLR FT REVDDGHGSRQRRSPWVVRLSRRDIANSIMRAKSSFVAFSTKDINVSLLNE FT ETRNSLISSKIFINELLSNDNFIKFNNLKRIARGLNFKYIWHRGGRFLIKR FT GDGDRTHEFTSAADLHALAASYNNNNTC*" FT CDS join(1787..3541,3442..4554) FT /product="L2-9_NVi_2p" FT /translation="MGMFRQCLANARPSYHLFGVAETRFGSEVDDVFAQVD FT GYSVLRQDRNKRGGGVALFISNVFKATILCSSQSEVEGKPGIPEYLMCGVQ FT QGNKPPIFVAVIYRPPGVSFSDNSELANNLRRYSAGYDYRLVMGDLNANML FT STSHDAEFVRDLACELNLKLVNHGATHHVRDSHTWIDMIFTDDDNVVLSAS FT NSPANFRSSHTIIDVEIYFQTTEPPALNSLTYRDFKSIKSDELLSLLAACD FT WSSVSCSDSGVDTRLEHLSQKLLTVIDQLAPLKEFKPRQKDYPPWVDAELR FT HLYSRRDALKRRHKHARRGSRRRDDLWTEYQALAAEAERCTNQAREEFIQN FT RIFDALENNKNVWNELRSLGLISKAKEDLHGFTTDELNTHFAGVSISDAER FT EVDLSGILAEANEDGFTFREVTFADVVLAVAHFSSQAKGEDGIPQSVIAKS FT LPVLGHLLVTLFNASLSCGVFPGAWKNAHLVPLKKKPIPSTVSDFRPIALL FT SFLSKVLEKIVHEQISEFLASKNILDPFQTGFRHNHSTQTALLKLTEDIRT FT GIDSDKQLLTILLLVSLGRWSCGSCLIYLDVTKEWLLNQTATHHFATGFSR FT SVVLWVMSYLSGRNQRVVTKLNGESSWLTTNLGVPQGSILGPLLFSIYIND FT LQEVLAGFRGPKGLLTDSVAHLLYADDLQTYTQVTRDNLREGVDRMSAAAR FT AVSDWASHNALHLNTGKTKAIIFGSEYNVNKLQGLNLPGVEVQTGVFVPFV FT DAVTNLGVVMDSKLTWKPQVDAVSRKVNRALYGLRSFRSCTNEALRKQLAG FT ALVISHLDYCSVVYLDVSGELETRLQRLQNSCVRYICGVGRYEHISPYRRK FT LGWMNIKERRTYFMAVLMYKAHSMGQPPYLSALFEKNQCRTSGRSSRDITV FT PGTRTDTGLKSYRVQGARLWNSLPRGMRTLPSLSRFKLAMRNYLLPTLSET FT *" XX SQ Sequence 5547 BP; 1421 A; 1258 C; 1436 G; 1432 T; 0 other; cctgccaaaa atgtgctgcc acaaaagaag gaaaaagaag aaactgagaa agagaaaggg 60 aaagacaata aaggcaagca tatcacgccg ggcaccaaga gctcgacgaa gggctcgttg 120 tcactgtcat catctcgcaa atctgataag atgaacctca gcacgccgaa ggttcgtctt 180 cctcgcaact cgggcacgaa ttctacgaaa tccaataatc tcgcagtgac aacacctgct 240 cctaaaccgg ctgcctctga aggcaaatcg acaccagaag aagtagaaga agcgatttcc 300 accaaacaga gcactactcc ggatgccggg catcattgcg cactgccatc gagcatgctt 360 aatttattca gctcgcaatt tgagctgctg gagagagcgg ttgaggatct tcgatccctc 420 atatcaagaa aattcgatgc cgtggctgga gccacagtcg gtgcaccagc gagcagtgga 480 catcaggaag attttacagc tcgtatagca tccctagagg ggaaaatcga cgcccagtct 540 accaagctcg atggcctcca taacgacagg atgctacttc tacaggagaa caccgtcttg 600 aaagagcaac tcaacggcct cgtgaggaag atcgaaggta tggcggacag tgtagcgtcg 660 aaatgtcgtg cgcatgacgt gagtcactct gactctgacg cggatgattc gcgtattatg 720 atctccctca gagacatcga tgtctcagcg agttgtatga gggacgcggg ttctccggtg 780 attgacgtgg cgaacgattt taagcagatt cgtgctgctg cgaggtctcg ggtggggggt 840 cgacgtcgtc gcgacgcagg tgcgcagttt gtctctcggg cagtcgatgg cgagtcggga 900 tcggtttcgg gagattgcgg gtctgattct cgcgggatcg agtctgagtc tggaggagct 960 ggctcctcgc agacgacaaa acataaaagg ttatcacctc taagctcaaa tacttccgaa 1020 ttgggtggga tgcgtggtag tgagaacgag gcagatttaa taatttctaa cgttagcgat 1080 gggacagacg cgcagcgtct aaatgtggct catgccgttc tcgctgcgat catcccatcc 1140 atatctaggg atgacgtagt ttcgttgcgc atgctcaggc gcgaggttga tgacgggcat 1200 ggatctcgtc agcgtcgctc cccatgggta gtgcgtctct ctcgccggga tattgcgaat 1260 agcataatgc gtgcgaagag tagctttgta gcttttagta ctaaggacat caacgtatcc 1320 ctgcttaatg aggaaacgcg taatagcttg ataagtagta agatttttat taatgagctt 1380 ttgagtaatg acaattttat aaaatttaac aaccttaaac gaattgcacg agggttgaac 1440 tttaaataca tttggcatcg aggcgggcga ttccttatca agcggggaga cggcgacagg 1500 actcatgagt ttacgtcggc agcggattta catgcactag ctgcatctta taacaataac 1560 aatacttgtt gatactaatg gtaacatcga caatgtcgtt agaaatgcca ctcccgctac 1620 gagggcaact aataaaactg ctgaatccac tggaccgacg cagacggaag caaatgtggc 1680 tccgactgcg caggatcagg attgactggg ccggggcaat gcgagagagg ccgggagcga 1740 ggcactgaag gcgggattcc tcaatgcgac ctcattgtat gcgcacatgg ggatgttcag 1800 gcagtgtctt gcgaatgctc gtccttccta ccacctcttc ggcgtggctg agacgcgctt 1860 tgggagtgag gttgatgacg tgtttgcgca ggttgacggg tactcggttt tgcgacagga 1920 caggaacaaa cgtggaggag gcgtagccct gtttattagt aatgttttca aagcgactat 1980 actatgctcg tcgcagtcag aggtggaggg gaagccgggt atcccggagt atcttatgtg 2040 cggcgttcag cagggtaata aaccaccaat cttcgttgcc gtaatctatc gtccaccggg 2100 tgtatctttc tctgataatt cagagctcgc gaacaacttg cgtagatact ctgcggggta 2160 tgattatagg ctcgtaatgg gcgacctgaa cgccaacatg ctgtcaacct cacatgatgc 2220 ggaatttgtt agagacctcg cctgtgaatt gaatctaaaa ctggtaaatc acggcgcgac 2280 tcaccacgtc agagattctc acacatggat cgacatgatc tttactgacg acgacaatgt 2340 agtgctgagt gcaagcaact cgccagcgaa tttcagaagc agccatacca tcattgacgt 2400 tgagatttat ttccagacga cagagccccc tgctctcaac agtcttacat acagggactt 2460 taaatcgata aaatctgacg agctcttatc cctgctggct gcttgtgatt ggtcgtcggt 2520 tagttgttcc gacagcggcg ttgacacccg actagaacac cttagtcaaa aactcttgac 2580 agtcattgac caactagctc cgctaaaaga atttaaaccg agacagaaag actatcctcc 2640 atgggtcgac gccgaattga gacacctgta cagccgacga gacgctttaa aaagacgaca 2700 caaacatgcc cgacgaggct ctagacgacg tgacgaccta tggaccgaat atcaggctct 2760 cgccgctgag gctgaacgct gtactaatca agcgcgagaa gaatttattc aaaatcgaat 2820 tttcgacgcg ctggaaaaca ataaaaacgt ctggaatgaa ttacggagtc ttggccttat 2880 ttcaaaggcc aaagaggatt tacatggctt caccacggat gaattaaaca cgcatttcgc 2940 tggggtgtcc atatctgatg ccgagcgcga ggttgatctg agtggcatcc tggccgaggc 3000 caacgaggat ggtttcactt ttcgtgaggt cacctttgcg gacgtggttc tggctgtcgc 3060 tcacttttca tctcaagcaa agggggagga tgggatacct caaagcgtta tagcgaaatc 3120 tctcccagtc ctaggacatc tcctggtgac cttatttaat gcctcactct cctgtggggt 3180 ctttcctggt gcctggaaga acgctcattt ggtgcccctt aaaaagaaac ctattccatc 3240 tactgtttcg gacttccgtc caatagccct tttgagcttt ctctctaagg ttctggagaa 3300 gatcgtacat gagcagatct ccgaattttt agcttcaaaa aacattcttg atcctttcca 3360 gactggtttt cgacacaacc attcgacgca gacagcacta ctaaaactga ctgaggacat 3420 caggactggc atcgacagtg acaaacagct actcaccatt ttgctactgg tttctctagg 3480 tcggtggtcc tgtgggtcat gtcttattta tctggacgta accaaagagt ggttactaaa 3540 ctaaacggtg aatcttcttg gctaacaacc aaccttggcg tcccacaggg ctcgatcctg 3600 gggcccttac tgttcagcat ttacatcaac gatctccagg aggtgttggc cggttttcgg 3660 gggcccaagg ggttactgac cgacagtgtc gcccatttac tctacgcgga cgatctgcaa 3720 acctacactc aggtcacaag agataatctt cgtgagggtg tggatcgcat gtcagcagca 3780 gcgcgggctg tgtcggattg ggcatctcac aatgctctgc acctcaacac tggcaagact 3840 aaagctatca tttttggatc tgagtataat gtaaataaac tacaggggtt gaacttgccc 3900 ggtgtcgagg tgcagacggg tgtttttgtt ccttttgtcg atgccgtaac taacctcgga 3960 gtggtcatgg attcaaagtt gacgtggaaa ccgcaggtgg atgcagttag ccgaaaggtt 4020 aacagagctc tttatgggct cagatctttt agatcctgta ccaacgaggc gttacgtaag 4080 cagctggccg gcgctctggt catttctcac ctggattact gctctgtagt gtatcttgac 4140 gtgtcggggg agctcgagac aagactgcag aggctacaga actcatgtgt gaggtacatt 4200 tgcggggtgg ggaggtacga gcatatctct ccttacagga gaaagctggg ctggatgaat 4260 attaaagaga ggagaactta tttcatggcg gtgttaatgt acaaggcgca cagcatgggg 4320 caaccgccgt acctttcggc cctctttgag aaaaatcaat gtaggacctc gggcaggtcg 4380 tcgcgagata tcactgtccc gggaacaaga accgatactg gcttgaaatc gtacagggtg 4440 caaggcgctc gcctttggaa ttcgctcccg cgaggtatgc ggacactgcc ttcgctttcg 4500 aggtttaaat tagcgatgcg gaactacctc ctgccgactc tcagcgaaac ataatttgtt 4560 attgtatgcg cctgcgcctt gtatatttgt atattcgtga tctagtctgt gttaatgtat 4620 gtatttatga aagattatgt aattacgcga catacactta atatatttaa gttgaaagta 4680 ctgaacaata taagtaaggt agcactgcgt gacgtgtacc tgattcgata cctggtatta 4740 tttaagaatt tcgattatga ccttgcacga attgtgatat ttgacggctg tgataaacag 4800 caaatattta ttttattttt ataaagtatg aaacttttat atttactctg tactgtgcta 4860 tgactgacat gatgtaattt tttgaaaatg ttttcgaata aataccttac ttacttactt 4920 acttacttac ttacttactt acttacttac tataacggag cggcatcgtt attacagttt 4980 ggtgtcgggg atttgatatt aggtttgcat ccggctgcga caaagtagta tgcgcacagc 5040 tccgttggtc cgaatgcgtc gtgtgatagc tatttcaatt gccctgtgta cgcacagcgg 5100 cgaaaggagt cgagagacgt cggcgtaaag gttggttcgg agtagtaagg ctaaccttag 5160 attcacagag acgtgagcgc acacctctgc gcatcgtcgg tgactgtctc gctactttct 5220 caaccctgtt tgcgcacatt ggcagagaga gcagagatac gtcggagtgt gagaggtagg 5280 ttttcggaga ctctgggtat cgtccacgta tgctctagag gacgagactc gttgtcgttt 5340 tagcttcctg attatatcgt gtgctcacga cgattatccg gaggatcgga gtaaggcgtg 5400 tgaactagcc gtcacgcgtg ctcacaagac ggttttgtcg gcggtggtct aactctggct 5460 ctccttgcag tatcgtgtgc tcacggtatt gcccggatcg aaagctcgag cagaactgct 5520 tacggaatat atatatatat atatata 5547 // ID hAT-N1_AAe repbase; DNA; INV; 491 BP. XX AC . XX DT 31-MAR-2011 (Rel. 16.04, Created) DT 31-MAR-2011 (Rel. 16.04, Last updated, Version 1) XX DE Non-autonomous hAT DNA transposon from Aedes aegypti. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW hAT-N1_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-491 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the yellow fever mosquito."; RL Repbase Reports 11(4), 1313-1313 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >96% identity. CC 8-bp TSDs. 52-bp TIRs. XX SQ Sequence 491 BP; 101 A; 88 C; 152 G; 149 T; 1 other; tagagatggt cgggtctcgg gttttcaaac ccgaaacccg acccgaaccc gaacccgacg 60 ggttcgggtc gggttcgggt ttgaaaattt ttattttttc gggttcgggt tcgggtcggg 120 tttgaaggct aaaaaattat cgggtacggg tcgggttcgg gcttgaaaaa wgtcgggttt 180 agtcgggttt gggtcgggtt tggtgagaat tgttcacaaa gtaacaacct gaaagcttcc 240 caatgcatca gaaacgatga atttatcatc ttttcgaacc cgggttctat tcgggttttt 300 cttagaaaac tttttcgggt ttcgggtcgg gttcgggttt gaacagcgaa aaattttcgg 360 gttcgggtcg ggtccgggtt tgcttttcaa tttttgtctc gggttcgggt cgggtccggg 420 tttgaaggaa taaaataagt cgggtacggg tcgggttcgg gtttggaaaa tgtgaaaccc 480 gaccatctct a 491 // ID Gypsy-609_AA-LTR repbase; DNA; INV; 1638 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-609_AA_; KW Ty3_gypsy_Ele180; Gypsy-609_AA-I; Gypsy-609_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-1638 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 1638 BP; 475 A; 297 C; 359 G; 507 T; 0 other; tgctgtcagc tatcgagctt ttgtatgctg tcagttatta tgctgccagc tactgtattt 60 tgtgttttgc tgtcagatct tataagcgaa ccgaagacaa gaaccaaacc tgcgccacct 120 aggttttgat aatgaaaaca aaacgaaata tatcaatttc aaatttattg cgcagtagag 180 aaattaattg cgtcgtagag aatcaaccca ggtagaaaac tctaaaacaa tgactataat 240 gatgctttag ttaaatgctc tctctataat cgaccaatgg gtggaaattg cgtcgcagaa 300 attaggttga aaccaatcta gggcgcgtcg gtcaaatcat tattgtggag tgcttcgaat 360 cgagtagttg atttttgtcg gagtgatatt actattgaaa agttaataag taatcattat 420 tataccaaat ttgaagtgag ttttcgggcg tttaaaatta atttaagtgt tgttaattaa 480 atttatgtgc ttttttgtgt ttagcgtttg aatttgtcga gcgtaaatag ttatttagag 540 aaatcctcta atttattaaa gtggaatatt gagttcaggt aagcctttct atttaattag 600 tgagcaaatt agtgaattta atccgcgttg tgcaggagag acaagcccaa gcaacgtggg 660 cggcaagtat agggccgttt tgggccagcc tagagtccgg attcacgttc tttcggactt 720 cgagcatcgt ggacgcaaac cgattcctac agtatcgtcg tttggcctac ccaacggaaa 780 ttggttgtcg tcgttgttgt gaaacacctt ctacgaaatt cgaaagtggg atcgccagtc 840 aataagtcgc gtggcgtagc cgagacacct agaacggctt cactccacct cgctagggga 900 catctatacc ccgacacaca cgccagcgcg cgaaccactt aggttgttcc gacacaccac 960 ttggtgacga cgaaggtagg ttttgagcca caatacacct cttcgctgac gggttacgtc 1020 gtcgagggac gtccgacgcg ggaaaatcgt tgcgcaagga aacgataagg ttagaccggt 1080 ccatatttgc tcttgaagac taaaattata aaaccaatta gcataattag aaaacaccaa 1140 gcctaaagaa attacgtgac gtcacgaata ggaaatagta ggttttcctg aatttaataa 1200 gaagtagttt tcttagagtt ttatcaataa actcgataaa ttcgaccgct tttgttatcg 1260 cgatcgttaa ttacttttct aaaaataaac ctacttcaaa tttagttaaa gccttcacaa 1320 tttcgagtca cgtaataaat atttattgat tttagattgg tttatacttt tttagtgatt 1380 cttttccctt tttggtgtct atcgagagat agttagtttt ttttagtaaa tttagtagga 1440 atttcatatt ggtaagtacg aattgggttg attggctacc gacgtttgcg agtgttgaag 1500 tgagacttcg gttcttttcg aacattgacc tgccctgagt caggaggtct catccgggta 1560 atgagcagag ctaacggtcc ttctttatag aaggtggcgc taaagccgtc agctttagga 1620 acagaacgga acgttaca 1638 // ID CR1-35_BF repbase; DNA; INV; 1255 BP. XX AC . XX DT 30-JUL-2009 (Rel. 14.07, Created) DT 30-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE Amphioxus CR1-35_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-35_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-1255 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-1255 RA Kapitonov V. and Jurka J.; RT "Young families of CR1 non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(7), 1606-1606 (2009). XX DR [2] (Consensus) XX SQ Sequence 1255 BP; 363 A; 279 C; 329 G; 284 T; 0 other; ttacagacgc tcaacatggg ttcgtaccca aaagatcttg taccactcag ctactggtag 60 tcatgaacga ttggtcacag tgtctagaga ggggggaacc agtggacagc atctacttag 120 acttcaaaaa agcttttgat tcagttcccc acaagaggtt actcgtcaag ctgcactcat 180 atgggataag gggccataca cttaagtgga tacaggactt tctcagcaac aggaggcaga 240 aagtagttat caatggagcg cactcctcct ggcgtaacgt caagagtgga atcccgcagg 300 gaagtgttct gggtcctacg ttatttgttg tttttataaa cgacctgccg gaggttgtga 360 ccagtgcagt gaagatcttt gctgacgata gcaaaatcta cagaccgatc ctatgcaagg 420 aggatcagga agcgcttcag cgtgacctcc aagctgtgga aaagtggtcg gaggtgtggc 480 aactcccctt caacgctggg aaatgcaaag tactccactt gggaagaggc aaccaaagag 540 cagagtacag actgggtggg ctaattctgg aagaaaccaa agtcgagaaa gacctcgggg 600 tgtctgtcga cgagcagtta aagttccacg ttaacaccac agtctcagcg caaaaggcaa 660 accaaatatt aggcttggta aaaagaacct tcagtaactt ggatgaagta acagtgcccc 720 tcctctacaa atcgatggtt cgcccccatt tagagtatgc gaacgctgtt tgggggccgc 780 atttcaaagt cgaccagaac actgtggaaa aagtacaacg tagagcaaca aggctggtcc 840 caacactcaa gtcctttccg tatagtgtta ggctggagag tttgaagcta ccatcactcc 900 actatagaag agccagaggg gatatgatac aagtctttaa gttcctcacg ggaagggaga 960 gggttagtgc acagagtttc ttcaccgaga gtcaacatca gtcaactaga gggcatcgct 1020 tcaaacttac ggtgccactg gcaaagtcct tggtacgtcg ccagtcgttt gcggttcgag 1080 tagtacaaaa ttggaactcg ctgccggctg aggtcgtaga ggcagagtct gttaacgcgt 1140 ttaaaacgag gttggactta tgttggagta agaagagata ccacgatcgt caggagggtg 1200 actacacagg cgcagcctac ttcacctaga catcatcaag gtatcatcaa ggtat 1255 // ID Helitron-2_DVir repbase; DNA; INV; 9141 BP. XX AC . XX DT 31-MAR-2007 (Rel. 12.03, Created) DT 31-MAR-2007 (Rel. 12.03, Last updated, Version 1) XX DE Autonomous family of Helitrons - a consensus sequence. XX KW Helitron; DNA transposon; Transposable Element; KW Interspersed repeat; Helitron-2N1_DVir; Helitron-2_DVir. XX OS Drosophila virilis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; virilis group. XX RN [1] RP 1-9141 RA Kapitonov V.V. and Jurka J.; RT "Helitrons in fruit flies."; RL Repbase Reports 7(3), 132-132 (2007). XX DR [1] (Consensus) XX CC This is a consensus sequence of a family of autonomous Helitron CC transposons transposed in the Drosophila virilis genome a few CC million years ago (copies are less than 5% divergent from the CC consensus sequence). The Helitron-1_DVir consensus sequence CC encodes the 2059-aa Hel-2_DVirp protein composed of the REP, HEL, CC and apurinic endonuclease domains. Helitron-1_DVir elements and CC non-autonomous elements transposed by the Helitron-1_DVir CC enzymatic machinery (e.g. Helitron-2N1_DVir) are usually inserted CC in the TTT|TTT target sites without the target site duplications CC (the insertion site is marked by "|"). Different families of CC Helitrons constitute ~5% of the D. virilis genome. XX FH Key Location/Qualifiers FT CDS 2729..8905 FT /product="Hel-2_DVirp" FT /translation="MPRASKKSANQRRQDAINRSRVYYIENTESVCLRNSQ FT RLADTRECNGDIREVERNTDALARSKRRSARFRQDERDLDAAARATQRQDE FT DVRALEQTQNTADHAVRRQNLEYRTLERERDAEAHATRRQDLEIRLQEQEL FT NTVEHTVRRQNVEYRESERQRDAAAHVTRRQDLEIRLQEQEINTVEHTVRR FT QNEEYRASERIRDASAHVARRLNVNVRELEQDANTVQHNIRRQNEAYRTRE FT RERDVLNRRNARQNIIARRREQVVNSSQRRVTRNQDRIRENELRNMAEARV FT INFRNSFQNRAIQSAQQSQRNRDAREEMSTEQREQQRDLQRQRRVFSARDK FT YLENIKKGPTEICISCGGTWFPFQVKLLNKINLSAKFPNMDISKAFYLSNK FT FPSLSEDYLFCSTCRRHISSGKIPSLCLSEGFNFPEIPDCLKDLTCLEERL FT ISPRIPFMRIISLGYERQCAVRGAVVNVPISVSETVTALPRTFENSHVVQI FT HLKRKLEYAHNFMTETIRPARVLEALRYLINRELYLKHNISVASNWLGVNA FT HNEEVPFVADPADITAVDSLLRCRMETDSEGHDTSDLNPGGQETLLDNESI FT ENNAIIGRIAMAPGEGRRPIDMTTDEDAEELSFPTISCGEICKSKSTYCRK FT AKSQIRYYDRRCAKVPKILYMYKCYELTRIKSSISICLRKKSRNANITVGN FT LRDEGFVSNLVQHDDGYHVLKGLRSATAHWEAEKKKVLAMIRQFGLPTFFI FT TLSAAETKWSELLVILSLILDNRVLSEEEAANLSSSEKARLIRSDPVTCSR FT YFDYRFRQLLKLFKTDIFGEYQLSHFYWRIEFQHRGSPHSHGMYWLSSAPK FT LNENSSQSSRQVIEFIDRFITTNGDNPDLEEVIKYQQHRHSFSCQREINRH FT KICRFNMPYPPMPQTEILYPLEEGDSNLRRHQDMWKKIKELLDSSVSDAEA FT LSLNNFDSFLAHINTDYIGYKFALRSSLKKPQIFLRRKYCDKMLNAYNEDI FT LKLYRANMDIQYILDAFACCSYIINYINKSNRGVSQLLREAMNEISSNNIS FT VKQKLQHIGNKFINGTEISAQEASYNILGLHMSGCSHGEIFVNTSHPDQRV FT GMVKAQRDLETLPSDSKDYFLPNLLDHYVQRPDELNNLCLADFAAKFKYSK FT SLNAHSTGDYEEENDDEEREDNANDGSPMVLKNRSGILRERRHSLILRWRR FT YSINISRSDYFRELVMLFFPWRNEQVDILSTDNEQTFNAQFQIIESNRKRY FT EYLDEAHFQEVLDEIQEVNNDNNDGQEISASQLLDEEFRGFAVPEVARNIN FT VFEDNVHEGNSDENAIRMIKLPAQVPENELNLMVRSLNSKQRQYYAHFMHN FT MNVMQIFYDYVGGGAGVGKSRLISTIYQSLTLRFNTRVGCDSDSAKILLCA FT PTGKAAFGIGGLTLHSVFSLPVNQYSMELRPLSNDAINTMHSKLIDLKLII FT IDEISMVGAKMLSYLDLRLKQIFRNNTYFGGISILAFGDLKQLPPVGDRWI FT FAPNSRDPYSIILGAPLWDLFQYFELTEIMRQREDCAFAIALNNMSEGKMT FT DTDKLLIQQRIVLPSEINNIPNDAIRLFYSNQKASDYNNIRLSQIIIEEYT FT SKSKDMLKTKSISERNKLNILENIKAFKTSETQGLPYLLKLKTTAKYMITV FT NINTNDGLVNGASGQLMHIDYTASVSDVLVSTLWLKFTEATVGAQARAKKT FT NPLHSDWTPIERVLRTFQYRKNDQVTIERLQFPLVPAEAITIHKSQGATYK FT KVAVHLQDGISRAALYVACSTATNASGLFLIGKFNPPREIQATDPVYIELE FT KLRGEKKLTTHHQIYFERVEGNILTYFHNIEGLHNHSLDISNDFIIKAADI FT LCFVETWSREGQVYNIDGFDILGRIDGSMCTGSTRPPGGIIVYAKSYIAQS FT AEILSVVNINLEHNKVCQILIFQHLDTKYVVVYRNPSFSFSEFRKLFLEHL FT EQELDRHSKIVILGDFNSCRARVRDQLEIRLNELGISSLLRGVPTTKQLTE FT IDWVFSTNGLDNKRAQVYETAYSYHDGIFVFNS" XX SQ Sequence 9141 BP; 3007 A; 1615 C; 1778 G; 2739 T; 2 other; ttataccctt gcagagggta ttataatttt gtcgtgaaat gtgtaacgca tagaaggaga 60 catctccgac cccataaagt atatatattc ttaatcagca tcaacagccg agtcgatata 120 gccatgtccg tctgtcgttt ctatgcgaac tagtccctca gttttaaagc tatcttaatg 180 aaactttgca gaagtcccta tttctgttgc acgcagcaca tatgtcaaaa ccagctggat 240 cggaccacta tatcatatag ctgccatagg aacgatcggt cgaaaattaa gttgttgtat 300 gaaaaaaatg ttttttttca agatatctcg aacaaactcg gcatttatta atgttactat 360 actactcata tatatgcaaa atyctattaa gatcggacca ctatatcata tagctgccat 420 aggaacgatc ggtcgaaaat taagtttttg tatgaaaaaa cattttgttt ttcaagatat 480 cttgaccaaa ctcggcattt attagtttta ctaagctcct catatatatg caaaatccta 540 ttaagatcgg accactatat catatagctg ccataggaat gatcggtcga aaattaagtt 600 tttgtatgaa aaaagatttt gtttgtcaag atatcttgac caaactcggc atttattagt 660 tttactaagc tcctcatata tatgcaaaat cctattaaga tcggaccact atatcatata 720 gctgccatag gaacgatcgg tcgaaaatta agtttttgta tgaaaaaacc ttttgtttat 780 caagatatct tgaccaaact cgacatttat tagttttact atgctcctca tatatatgca 840 aaattctatt aagatcggac cactatatca tatagctgcc ataggaacga tcggtcgaaa 900 attaagttgt tgaatgaaaa aattgtttgg ctttcacgat attttgaaca aacaaatttk 960 tttcctgtgc ctcttagata tgagcattat gagtcatcag tggaatgata attaacgtcg 1020 gagtaatata agctcttttt gttatttcta caagtggttg atttagaatg tgttcagaaa 1080 ggaggctaaa aagcatacgt ttttcagcaa aaaagttgcg cttaagtcaa atattcgcaa 1140 acgtttgtga aaattcgggg tttgtagtta tattgcagaa gaataattta attgtaatac 1200 aaaagatcgg tatttattat gttaccttaa gcatttcaaa tataaatagt tacgcgcgca 1260 ctattcagtg ttacaatgag ctttttcaat aggtagtacc ttgcccgaaa tgaaaaattg 1320 agagtgcatg ctttgcaaca tattttaagt aatcgatttt cgattctcgc tgtaaaaagt 1380 ttcactgaga agccctaaaa ataattgtcc catggaatta aattttaaaa tggtaatcaa 1440 ggaaggaatc gcttctgacc ccataaagta tatatattct tgattaggac ggcaagctga 1500 atcgactaag taatatccgc ctacctgttt ttctgtctgg cggtatgtca tatgtattat 1560 ataagttatg ttcacatata tatataaatt atatatgttc aaaaattggt caaattacct 1620 atacatggaa gtcctatcgt aaatctattt ttaatctata taaacaaatt aaaatatgtt 1680 gttttatgtt ttaatattat ttaatttaat attatttaca tttatgaccg aagcgcaaat 1740 acattaagtt gtatattcgt aatatatata aatatatatc tacacttacg cataagtatg 1800 tatgctttta gttatacaac cgtaatataa aactattata ttattgattc gataatgatc 1860 ttttgcattc aaatgtcgct ggaagtgcgc gcaatgaaat gttcagtgaa ctttcaaagg 1920 gagcttttga aaaagcgaat tcgcctatta tcaggcgaga caaacgaaat tgccattcac 1980 tcttcaacca accatcgatt tgtgcggtcg tcgattctcg attgcgatta cgattctaag 2040 aatttgcgtt tttcttagat ttaaaacaaa tcaaacttaa tattcgttat aataagatcc 2100 agtgcaactg aaattaaata attcttcggt tatttcagtt ctttgaagaa tatttaatat 2160 aacaatctat aatcgggttt gccgcgaaaa atcggcaatt attccccgcc agtgctagtg 2220 aacatttgtg tgtgtatgtg tgcgttgtga cctcacctcc cccttagact aacaacctgt 2280 gtgtgtgcgt gtgtaaacat tttgtgcaag agtgtgaggc gcacttgcta gcggtggacc 2340 aataatacgt gagtgttggt gttcgtgtaa gcgttgtgac tctggcattg ctttgcaata 2400 tattttctgg atgtttgcac tacacctgtt gtactggatt ttatctaact ggaaaaatcg 2460 caagtttatt cgaaaagctt agatagtctt tttaaagact tattgattta taatattttc 2520 ctttttcaat acatatttat atatatttaa atataataat atatatatat tcaaactaac 2580 aattaaatta tcaaataagt taatattata attattttct ttttcagtat ctatcttaag 2640 tttcgattca tatttaatta ttctttgtta tcttaacata actttaattt tcgaaatatt 2700 tgggaataat ttcttcagtt tttttaaaat gccgcgcgcg tctaaaaaat ccgctaatca 2760 aaggcgtcaa gatgccatta atcgtagtcg tgtctactac atagagaata ctgaaagcgt 2820 ttgccttcga aattctcaac gtttagctga cactcgtgag tgcaacggtg atattcgtga 2880 agtagagcgt aatactgacg ccctggcccg atctaaaaga agaagtgctc gatttcgaca 2940 ggatgaaagg gatcttgatg cagcagctcg cgcaacgcaa agacaggatg aggatgtgcg 3000 agctctggag cagacacaaa atacagcaga ccatgctgtt cggcgacaaa atctggagta 3060 ccgtacccta gagcgtgagc gcgatgcaga agcccatgct acaaggagac aggacttaga 3120 aattagactg caagagcagg aattgaacac agtggaacac actgtgcgtc gccaaaacgt 3180 ggagtaccgc gaatcagaac gccagcgaga tgcagcagcc cacgttacaa ggcgacagga 3240 tttagaaata agactgcaag agcaggaaat taacacagtg gaacacactg tgcgtcgaca 3300 aaatgaagaa tatcgtgcat ctgagcgtat acgtgatgct tctgctcatg tagcgcgaag 3360 actgaatgtt aatgttcgag agctagagca ggacgctaat acagttcaac ataacattcg 3420 tcggcaaaat gaagcatatc gcactagaga gcgtgaaaga gacgtcttaa atcggcgaaa 3480 tgctcgtcaa aatattattg caagacgcag agaacaagtc gtcaattcct ctcaaagacg 3540 agttactaga aaccaggatc gtataagaga gaacgaatta cgaaatatgg ctgaagctcg 3600 agttataaac tttagaaata gttttcaaaa tcgtgccata cagtcagctc agcaatctca 3660 gagaaataga gatgctagag aagaaatgtc aactgaacag cgcgagcaac agcgtgactt 3720 gcaaagacag cgtagagttt tttctgccag agataaatac ctggaaaata ttaagaaagg 3780 ccctacagaa atatgcatat catgcggagg aacttggttt cctttccaag taaagctttt 3840 gaataaaata aatctatcgg caaagtttcc aaacatggat atttcaaaag cattttatct 3900 gtccaataag ttcccaagtc tatctgaaga ctatttgttt tgctccactt gtcgccgaca 3960 catatcgtcc ggcaagattc cgagtctctg tctgtccgag gggttcaact ttcctgagat 4020 tccagactgt cttaaagact taacgtgtct tgaagagcgt ttaatttcgc caagaatccc 4080 atttatgcgg ataatttctc ttggctatga gcggcaatgt gcagttagag gagctgtggt 4140 caacgtacca atttctgttt cggaaactgt gacagctctt ccgcgcacct ttgaaaacag 4200 tcatgtagta caaattcatt tgaagcgtaa gctggaatat gcgcataatt tcatgactga 4260 gacaattagg cccgctcgtg tgctcgaagc gttgcggtat ttaataaaca gagaattata 4320 tcttaagcac aatatttcag ttgcttccaa ctggctaggt gtaaatgctc acaatgaaga 4380 agtccctttt gttgctgatc cagctgatat aacagctgtg gatagccttc ttcgctgccg 4440 aatggaaaca gattcagaag gacacgacac ttctgatctt aatccaggtg gacaggaaac 4500 tttgttggat aacgagtcaa tagaaaacaa tgctataatt ggtcgtattg caatggctcc 4560 aggtgagggt cgacgaccaa ttgatatgac gacagatgag gatgctgaag aattatcgtt 4620 cccaaccatt tcatgtggcg aaatttgtaa aagcaaaagt acttattgtc gaaaggccaa 4680 atcacagatt cggtattacg atcgtcgttg cgcgaaagta ccaaaaattc tgtatatgta 4740 taagtgttat gagctcacac gcataaaaag ttctatttct atctgccttc ggaaaaaatc 4800 aagaaacgcc aatatcaccg ttggtaactt gagagatgaa ggttttgtca gcaatcttgt 4860 gcagcatgat gacggatatc atgttcttaa gggcctgcga tctgctacag cacattggga 4920 ggctgagaag aaaaaagtgc tggcaatgat tcgtcagttt gggctaccaa cttttttcat 4980 aacgctttct gctgctgaaa ccaaatggag tgagctgttg gtcattctat ctctcatttt 5040 ggataataga gtccttagtg aggaagaagc agcgaatctt tcaagttccg aaaaagctcg 5100 actaattcgg tctgatcctg tgacttgctc aagatatttt gactataggt ttaggcagct 5160 gttgaaatta tttaaaacag atatttttgg agaatatcaa ctctctcact tttattggag 5220 aattgaattc caacataggg gatcgccaca ttcacatggc atgtattggc taagcagcgc 5280 tcctaaactg aatgagaata gttcccaaag tagccgtcag gttatagaat ttattgacag 5340 attcattaca acaaacggag ataatccaga tttggaagag gttattaagt accaacagca 5400 cagacatagc ttttcatgcc aaagagaaat aaataggcat aaaatttgtc gttttaacat 5460 gccataccct cccatgcctc agactgagat actatacccc ctagaagagg gagattcaaa 5520 tctgagaagg catcaagaca tgtggaagaa aatcaaagag ctactagata gttcagtttc 5580 ggacgctgaa gccttaagct taaataattt tgatagtttt cttgcacata taaatacgga 5640 ttatattgga tataaatttg cactgagatc gagtcttaaa aagccgcaaa tatttttgcg 5700 tagaaaatac tgtgacaaaa tgttgaatgc atataatgaa gatatattga agctatatag 5760 agccaacatg gatattcaat acatactaga tgcgtttgct tgttgtagtt atataataaa 5820 ctacataaac aaatcgaata gaggggtttc tcaattattg cgagaagcca tgaacgagat 5880 aagtagtaat aacatttcag ttaagcaaaa actacaacat atcggcaata agttcatcaa 5940 tggaacagaa atatctgcgc aggaggcttc atacaacatc cttggactcc atatgtcagg 6000 atgtagccat ggcgagattt ttgtaaacac ttcgcatcct gatcaaagag ttggtatggt 6060 gaaagcccag agagatttag aaacactacc ttcagattca aaagattatt tcttaccaaa 6120 cttgctagat cattatgttc aaagaccgga cgaattaaat aatttgtgcc tagctgactt 6180 tgctgccaaa tttaaatatt ctaagtcgct aaatgcccat tcaacaggag actacgagga 6240 ggaaaatgat gatgaagagc gggaggacaa tgccaatgac gggagtccca tggtactcaa 6300 aaatagatca ggaattttga gagaacgtag gcactctctt attcttcggt ggagaaggta 6360 tagcataaat atatcccgat ctgattattt tcgagaactt gtaatgctct ttttcccttg 6420 gcgaaatgag caagtggata ttcttagcac cgataatgag caaacgttca atgcacaatt 6480 tcaaatcatc gaatcaaata gaaaacgata tgaatatttg gatgaggctc attttcaaga 6540 agtcttagat gagatccaag aagtcaacaa tgacaataat gatggacaag aaatttcagc 6600 atctcagttg cttgatgaag aattcagggg atttgcagta ccagaagttg ctcgaaacat 6660 taacgtattt gaagataatg tgcatgaagg caactctgat gaaaatgcaa tccgaatgat 6720 taaattacct gcacaagttc cagaaaatga gcttaattta atggtgcgat cacttaacag 6780 taaacaacga caatattacg ctcactttat gcataatatg aatgttatgc aaatatttta 6840 tgattacgtt ggaggtgggg cgggtgttgg aaaaagtagg ctgatatcaa ctatatatca 6900 gtcactaact ttaagattta acaccagagt tggttgtgac tctgattctg caaagatatt 6960 gctgtgtgcc ccaactggca aagcagcctt tggaattggt ggactcactc tccattccgt 7020 gttctctttg cctgttaacc aatattcaat ggaattaaga ccccttagca acgacgcaat 7080 caacactatg cactcaaagc ttatagactt aaagcttatt attattgatg aaatttcaat 7140 ggtgggtgca aaaatgttgt cgtacttgga tcttcgtcta aagcaaattt ttagaaataa 7200 tacatacttt ggcggaattt caatattggc atttggtgat ctaaagcagc ttccaccagt 7260 tggcgataga tggatttttg ctccaaactc gagagaccca tatagcataa ttttgggagc 7320 accactttgg gatttatttc aatattttga attgaccgaa ataatgcggc aacgtgagga 7380 ttgcgccttt gccatcgcat taaataatat gtcagaagga aaaatgactg acacggacaa 7440 gctgcttatt caacagagaa ttgttttgcc ttccgaaata aataatattc ccaacgatgc 7500 tataagattg ttctactcaa atcagaaagc gagcgactac aataatatcc gacttagcca 7560 aattattatt gaagaatata cttcaaaatc taaagatatg ctgaagacca aatcaatatc 7620 tgaaaggaac aagctaaata ttttagaaaa tattaaggca ttcaagactt ccgaaaccca 7680 aggccttccc tatctgctta agctcaaaac cacagctaaa tatatgatta ccgtcaatat 7740 aaataccaat gatggtttgg tcaatggagc aagtggacag cttatgcata ttgactatac 7800 agccagcgta tctgatgtgt tagtttcgac gctctggctt aaatttacag aagcaactgt 7860 gggagcgcaa gcaagagcaa aaaaaacaaa tcctttgcac tctgattgga caccaattga 7920 aagagtttta agaacatttc aatataggaa aaatgaccaa gtaacaattg aacgcttgca 7980 atttccattg gttcctgctg aagccataac catccacaag agtcagggag caacctacaa 8040 aaaagttgca gtgcatttgc aagatggcat aagccgtgca gcattatacg tagcttgtag 8100 cacagcgact aatgcatctg ggcttttcct tataggaaag tttaatccgc ctagagaaat 8160 acaagctacc gaccccgtct atattgagtt ggaaaaactc cggggcgaga aaaaattgac 8220 cacccaccat cagatatatt ttgagagagt agaaggaaat attttgacat atttccataa 8280 tattgagggc ttgcataatc atagtttaga catttcaaat gattttataa taaaagcagc 8340 agatatttta tgtttcgtag agacttggag tagagaaggc caagtatata atatagacgg 8400 ctttgacata ctcggtcgaa tcgatggtag catgtgtaca ggtagcacta gaccaccagg 8460 tggcataatt gtttatgcaa agtcgtacat tgcacaaagt gctgaaatat tgtctgtggt 8520 aaatattaat ctggaacata ataaagtttg ccagatcctt atatttcaac acttagatac 8580 caaatatgta gttgtttata gaaacccgtc cttctccttt tccgaatttc gtaaattatt 8640 tttagaacac ttagagcagg agcttgatag gcacagtaaa atagttattt tgggagattt 8700 caattcatgc cgggctagag ttcgagacca gttagaaatt agattgaacg agcttggtat 8760 atcatctctt ttgcgtgggg tacctacaac aaaacagcta actgaaatag attgggtatt 8820 ttcgactaac ggactggaca ataaaagagc ccaggtgtat gaaacggcat atagctacca 8880 cgatggtatt ttcgttttca atagctagga aacttttgat aaatttatct tttgtgtgat 8940 ggatcaaaaa catatatata tatatatata tatatatata tataaatatt cttcggctgt 9000 tcgctctcga tcacatattt ttttttgaca tggacatgga gggaagccag accaaaaagc 9060 ttaaattcgc acttaataga aaagcttcgg tctgcaaggg tattaaatct tcggcgtgcc 9120 gaagatagcc cttctttctc g 9141 // ID YAMATO_BM repbase; DNA; INV; 2314 BP. XX AC AB042121; XX DT 17-NOV-2004 (Rel. 9.1, Created) DT 17-NOV-2004 (Rel. 9.1, Last updated, Version 1) XX DE Bombyx mori Pao-like retrotransposon Yamato, putative polyprotein DE region. XX KW BEL; LTR Retrotransposon; Transposable Element; YAMATO_BM; KW pao-like retrotransposon; putative pol domain; KW reverse transcriptase. XX OS Bombyx mori OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Bombycidae; Bombycinae; Bombyx. XX RN [1] RA Abe H., Ohbayashi F., Sugasaki T., Kanehara M., Terada T., RA Shimada T., Kawai S., Mita K., Kanamori Y. et al.; RT "Two novel Pao-like retrotransposons (Kamikaze and Yamato) from RT the silkworm species Bombyx mori and B. mandarina: common RT structural features of Pao-like elements."; RL Mol. Genet. Genomics 265(2), 375-385 (2001). XX DR Genbank; AB042121; Positions 1 2314. XX SQ Sequence 2314 BP; 612 A; 702 C; 670 G; 330 T; 0 other; ggaagtcctc gcgccgtgcc atcacctcac cgacctccgc catcacttca aaacgagcta 60 cgcctcgccg aagatactga tcggccaaga caactggcac ctgctggtga cggaggagat 120 gcggacggga cgacgcgatc aaccagtggc gtctcgtact ccgctcgggt gggttgtgca 180 cggagcacgc ccgggaggca agagacagcg cgtcaacttc gtaggacacg cgacgacggc 240 cgacgcgaac atggacgaag ccctcaagca ctacgtcgcg atcgagggac tcaacgtcgc 300 cgctaagacg ccgaagaacg acccggacga gcgtgcgctg aagatcctgc gagagaccac 360 tcaacagcaa cccgacggcc gctacgagac cgcactgctg tggcgtgagg agagcctgaa 420 gatgcccaac aatcttgaag ccgcaatgaa tcgcctgacg tccgtagaga agaaactgga 480 gaaggatcca aacttgaaag agcgctacaa gcaacgaaca aacgcactgg tagcgaaggg 540 atacgccgaa gtaactccgt cgacgggtac gaaggatcga acccggtact taccgcactt 600 cgacgtgacg catcccaaga agcaggagaa gattcgcgtc gtgcacgacg ccgatgcgaa 660 aaatagaggg aagaagccca aacgacatcc tgctcaccga cccggacctg ctgcagtcgc 720 tacccggagc gatgatgcgc ttcaggcagc acgccgtcgc cgtctccacg gacatcgcgg 780 agatgtttgt acagataggc gtccgaagcg aggaccgcga cgcgctccgg tacttgtgga 840 gggaggaccc ttcacgagag cctacagaat accggatgag gtccatcatc ttcggcgcga 900 caagctcacc tgcaaccgcc atctgcgtga agaaccgact gcagtacccg gacgccgccg 960 acctcaagct ccacacgtaa gtggacacca acaagtacgc ccacgacgcc gcgttgtact 1020 ggcgcgtgga gacgcccgac ggggagatga ggacctccct agccaaagcc cgagtcgcac 1080 ccatgaagcc aacatcgata ctgcgtctcg agctacaagc cgacgtgatg ggaagccgca 1140 tagccgcggc cgtcacggaa aaacactata ggaagccaga cacgagtgtt ctggaccgac 1200 agcgccacca tagaggaaaa cagcactgtg gaggaaaggc gctaggtccc cacaagagag 1260 aatgtcgccg acgacgctac gagagaaata cccgtcggct ttgatggaaa ccaccgctgg 1320 ttcataggcc cggaatacct gagacggcat ccggatgatc ggcccgtaca acggactgca 1380 agaagggcag aagagacggg tgaagagaag tgcgcaatgc tcgccgtgag cagagagagc 1440 ctcggcgaag cgatcccgga cccaagacgc ttctccaagt gggagaagta cctgcgtgca 1500 acggggcgaa tactacaatt cgtcgaccta tgccgcagaa gtcgcgagcg cactcattac 1560 aagaggacca ggcggaaccc gcgttcggac ccgacgtggg agaagaatag aaagaagacg 1620 acttcgaaga ccccgctgaa cacaccgcgg acgccagagc aagcagtcat tcaatggaag 1680 actttagacg ccgacacgct tcgacgcgcc gagacgctca tttggcgaca gagccagcgg 1740 gcggccttcg aggaagagat tttgacgctc cagcgaggga aacaaatatc cccagcgagc 1800 cgactgcgaa acttgtccgt aacttacgcg gacgggatac tgaaaataaa cggacgcatc 1860 ggcaacatag agggagctga cgtcatcacc tcgcctctcg tgctagacgg atcccgacgc 1920 gaaacgagac tgatgataga cttcatccat agaaagatgc accacgccgg aaccgaagcc 1980 acgatcgccg agtgccgaca gtcgtactgg gtcttacgcc tacgcccagt gacgcggagc 2040 gtaatccaca actgtcttcc gtgtcgtatc cgcaaggaca ctccgccgcg tccagccaca 2100 ggcgatcacc caccgagcag actggcacac catcggcgac catttacata cgtgggcctc 2160 gattacttcg ggccttatca agtgaccacc ggcagaagca cccagaagca ctacgtggcc 2220 atcttcacat gtctcactac gcgtgcggtg catttagaac cggcagcgag cctcagcacg 2280 gactcagcgg tgatggcacg gcgcgaggac ttcc 2314 // ID hAT-1_SM repbase; DNA; INV; 2690 BP. XX AC . XX DT 23-SEP-2007 (Rel. 12.09, Created) DT 02-OCT-2007 (Rel. 12.09, Last updated, Version 2) XX DE Consensus sequence of a hAT-type DNA transposon. XX KW hAT; DNA transposon; Transposable Element; hAT-1_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-2690 RA Jurka J.; RT "hAT-1_SM: DNA transposon from freshwater planarian (Schmidtea RT mediterranea)."; RL Repbase Reports 7(9), 842-842 (2007). XX DR [1] (Consensus) XX CC The youngest copies are ~98% identical to consensus sequences. XX FH Key Location/Qualifiers FT CDS 500..2293 FT /product="hAT-1_SM_1p" FT /translation="MEKKRKIDSECRKFKDQWNIQYFVIESSNKALCLICN FT ESIAVLKEYNIKRHYETKHFQNYSKYTGSLRTEKFEAMKRGLKSQQSSFTK FT LKTEQEAATRASFRVALEIAKRGKPFTDGEMIKECIIAVAEEMCPEKVNLL FT KTVSMSANTVARRVENIAENISSQLFDKNGHVECFSLALDESTDVSDTAQV FT LIYIRGVDKSYEVHEELLDMYSIHGTTTGTDIFKGVEMAINQKNLRWKNLK FT CITTDGGKNMSGKDKGVVALVSKAVENDGGSKPLVLHCIIHQQSLCGKCLD FT MSEVLKPVISTVNFIRSFGLNHRQFRQFIEEIGENDLPYHTAVRWLSCGKV FT LQRFFELRAVIEIFLNEKHRPLTELQNNAWLWKLAFYVDLTKHVNELNLRL FT QGENQHLPDLYTNIKSFRKKLILFQSQLRSKCFSHFKTCEIFSHTTETEFP FT IDFAIETLSALKINFDTRFSDFDAIANQIKIFQNPFDTDIETLAPELQMEM FT IDLQCSDIIKNKYQNSSLLEFYKGLPLTQFNNLHKFARGLFSVFGSTYLCE FT KTFSKMKYTKNVYRSKLTDEHLKSLLIIGTSKISPQLQTIVSGKSQLHKSH FT " XX SQ Sequence 2690 BP; 937 A; 430 C; 477 G; 846 T; 0 other; caggggtctc caaactttcc agccttaggg ccatactgat tactccagta agttccgagg 60 gccaaaacca tattatcatt gttgccggtg gtaaaaagtg aaatagtcca ctgatataaa 120 aaaaagtctg aaaaaatacg caacgtttaa aaaaggctgc aaatcggact aaagtccgca 180 aaacggcaac actgcatatt attaattttc ctgtcattat ctgtaatcgg taaatattaa 240 ttaacattaa aatcagttat ttatcattaa aattaattaa ttatcattaa aattaggaat 300 tatcattaaa attaattaat ttgcacagta aaatccaatt aaaattaatg aaaatccaat 360 taatcgaagc aaacatggtg tcaggacaga tagaaaaaaa caaaaaataa tttaaacgtg 420 tttgtgcgtg tttaagttta agacatcttt gtaacagtat tttgatttga attgttgctt 480 ttgtgtgtga tttagtagta tggaaaagaa gcgaaaaatt gatagtgaat gcagaaaatt 540 caaagatcag tggaacattc agtatttcgt aattgagtcc agtaacaagg cgctatgttt 600 gatttgcaat gaaagcatag ccgttctcaa ggaatataac attaaacgtc attatgaaac 660 aaaacatttt caaaattact caaaatatac aggaagtttg cggacagaaa aattcgaagc 720 tatgaagcgc ggattgaaat cgcagcaatc ttcatttaca aaactcaaaa ctgaacaaga 780 ggctgcaact cgtgccagct ttcgtgtggc tcttgaaatt gcaaaacgtg gaaaaccatt 840 caccgatgga gaaatgatca aagaatgcat aattgcagta gccgaagaaa tgtgccctga 900 aaaggtaaat ttattaaaaa ctgtcagtat gtcagcaaac actgtggctc gaagggtaga 960 aaacatcgct gaaaatatat cctctcaact gttcgacaaa aatggacatg ttgagtgttt 1020 ttctttggcc ttggatgagt caacggatgt gtcagatact gctcaggtgt tgatttacat 1080 tcgaggagta gataaaagct atgaagtgca tgaagaactt cttgatatgt atagtattca 1140 tggcacaact actggtacag atatttttaa aggagttgaa atggccatta atcaaaagaa 1200 ccttcgatgg aaaaacttga aatgtattac aactgatgga ggcaaaaaca tgagtgggaa 1260 agataaagga gtggtcgctc ttgtgtcaaa ggctgtagaa aatgacggtg gttcaaaacc 1320 attagtctta cattgtatca ttcatcaaca gtctttgtgc ggaaaatgtt tggatatgtc 1380 tgaagttctg aaaccagtca tatcaactgt taattttatc agatcttttg ggctgaatca 1440 ccgacaattc cgacaattta ttgaagagat tggagaaaat gacttacctt atcatactgc 1500 cgtacgttgg cttagttgtg ggaaagtcct tcagcgcttt tttgaacttc gagcagtgat 1560 cgaaattttt ttgaatgaaa agcatcgccc tcttactgaa ttacaaaaca acgcatggct 1620 gtggaagtta gcattctatg ttgatttgac aaaacatgtg aacgaactga atttgagatt 1680 gcaaggagaa aaccagcatc ttcctgattt atacactaat atcaaatcat tccggaagaa 1740 attgatactg tttcaatcac aactacgaag taaatgtttt tcacatttta aaacatgtga 1800 aatattcagc cacaccactg agactgagtt tcctatcgat tttgcaatcg aaactttgag 1860 tgctttgaaa ataaattttg atactcgttt ctcggacttt gatgccattg cgaatcaaat 1920 taagattttt cagaatcctt ttgatactga cattgaaacc ctagccccgg aacttcaaat 1980 ggaaatgatt gatctacagt gcagtgatat aattaagaac aaatatcaaa actcatcctt 2040 gttggaattt tataagggtc ttccactgac acaatttaat aatttgcata aatttgctcg 2100 tgggctgttt tctgtttttg gctctactta tttgtgtgag aagaccttct ccaaaatgaa 2160 gtacacaaaa aatgtttaca gatctaaatt aactgatgag catcttaaat ctcttttaat 2220 aattggtaca agtaaaatta gtccacaact acaaactatt gtcagtggaa aatctcaatt 2280 acataaatct cattagtatt tcatatgtca ttatgcttgt tataagttat gtttttattt 2340 aaaatgtata aaatgttagt tggattcatt tcaataattt gttagttatt atatactgat 2400 gttaaatttt ctgcatatta tacatgtctt tactcattac aattctgtgt ttctttatcc 2460 aacttatcac aaaaacaata acaataagcc tacaacaatg ttatgagtag ttagcccatg 2520 accatatcaa tgtagatgtc atattaataa ttattcaaca taaaattaga atacttcagt 2580 aaactaaatt taatttgcta taaaaaatat gttctctaaa aggagttttc aagtatgcta 2640 gctctccgcg ggccggatgt ggccctcggg ccgtagtttg gagacccctg 2690 // ID Gypsy-609_AA-I repbase; DNA; INV; 7094 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-609_AA_; KW Gypsy-609_AA-LTR; Ty3_gypsy_Ele180; Gypsy-609_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-7094 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [5083-5565] - Integrase core CC LTRs are 95% similar to each other. XX FH Key Location/Qualifiers FT CDS 443..2665 FT /product="Gypsy-609_AA-I_2p" FT /translation="MDWYRIYADDLDEEELDYELAIRACPLVGGQETRRRE FT LRNLLRDPDSSKALVVSDRMMQANLEIVPYKLQEIERIMIEEPNRSMLSRL FT VHYHQRIRRCVPRNPVEFDNRRLLLETVADLARRYFHLDFRAMETPGPQDD FT ILVGGTANSPRVTIQATSGPRPGEDGQVNPVRSNTSEASNVEGAVGGDFGP FT VTANRQWAANNPGQESDAFHLWRTTASQQQSTGAIPRAARPSVPEFVQSPP FT QLVSSLNFNFQNPQQGGIPTFSRAPEVLDNQQNFTRPPQTVQSRPVGTQYA FT QRSDEVRQSPRITAENVNMNEYVHCSQIEAYVKRCVEQMTQQGTRYSMGQD FT NLVNNLADEVAQVRFADYEPVHTQRNRANQPQQSILREPSPPLQLSGPRPD FT ARSTPRQNLANTYRVFPAEGNRFDPIQFGNIPNHRSTIDSPGLRRYQADVD FT SYRGNSYSRRQPHQQCAIIEKWPKFTGDTNSVPVTDFLKQIDILCRSYDIN FT KQELRMHAHLLFKDKAYVWYTTYEEKFNSWESLEVYLKMRYDNPNRDRLIR FT EEMRNRKQRPNELFSAFLADMEMLAQRMIRKMSEAEKFEMIVENMKLSYKR FT RLALEPIQSIDHLAQMCYKFDALESNLYQVYSQSKSVHHVALEDEGNEDYL FT DTTEADVCALRSKMLQNRNNIPGKINSETKSSNDQKIAEMTCWNCNATGHL FT WRDCDKRKRIFCHICGMMDTTAFRCPNHHNLRSEDEESKNE" XX SQ Sequence 7094 BP; 2096 A; 1427 C; 1564 G; 1978 T; 29 other; ttttggcgcc caacgtgggg cctgagagtt gattaggatt tgccaatttt gatatttcgt 60 atttgggtcg aatcagttag tttgtaaatt ggattgaaga cccttagtta ggataaattt 120 ttcgaattgg ttgattgaat aatcagtctg attggaattt aggaacaaat ttgatttagt 180 ttattagttt tggttggttt gtttagagat aattcttgaa ttcggatttt gtatataata 240 ttgattagaa tttataggtt agttgtatat actttgaatt agattagtct tacttcattt 300 ttttttgaac tatattactt ttgaattgaa ttacttattt ttctgctgtc atttttgcgg 360 ttgtcaaaat attggtgaag ggttgattgg aagtgcctta ttaaattgaa tttgatctga 420 taagtgataa ttgtgttttg aaatggattg gtaccgaatt tatgccgacg acctcgatga 480 ggaagagctg gattacgagc tggctattcg agcgtgtcct ctcgtaggcg gccaggagac 540 tcgtcgaagg gaactgagga accttcttcg tgatccagat tcgtcgaaag ccttggtggt 600 ttctgaccgc atgatgcagg cgaacctcga gattgttccg tacaagctgc aagaaattga 660 aaggattatg attgaagaac ctaacaggag tatgctttcg agattggttc attaccatca 720 gagaatccgt cgttgcgtgc cacgtaatcc ggttgaattc gacaatcgta ggctcttgtt 780 ggagacggtt gctgatttag cccggaggta ctttcatttg gatttccgcg cgatggaaac 840 tccaggaccg caggatgata ttctggttgg agggacagca aactcgccac gcgttacaat 900 acaagcgacc agtggccctc gtccaggtga agacggacag gtgaatccgg ttcggtcgaa 960 cacctccgaa gcgtcgaatg tagaaggagc agttggcggt gatttcggac cagttacagc 1020 taatcgtcag tgggctgcga ataatcccgg acaagaaagt gatgcattcc atctttggcg 1080 tacgactgcg tcccagcagc aatctacagg ggcaattcct cgtgcggcaa ggccatccgt 1140 tccggagttc gtgcagtcac caccgcagtt agtttcatct ctcaacttca acttccagaa 1200 tccgcaacag ggtggtattc cgactttcag tcgagctcca gaggtactgg acaatcagca 1260 gaatttcaca cgtccaccgc agacagttca atcgcgtccg gtcggcactc agtatgctca 1320 gaggagcgac gaagtacgtc agtcgccaag aataacagct gaaaatgtaa atatgaatga 1380 atacgtgcat tgttcacaaa ttgaggctta tgtgaagaga tgtgttgaac agatgacgca 1440 acaaggtacc agatactcga tgggacagga caatttggtc aacaatttgg ctgatgaagt 1500 cgcgcaagtg cgtttcgcag attatgaacc agtacacacc cagcgaaaca gggcaaacca 1560 accacagcag tccattttgc gtgaacctag tccgcctctt caactcagtg gtcctcgtcc 1620 agatgctaga agcacaccga gacaaaatct cgcgaataca taccgagtat tcccagcaga 1680 aggaaatcgc tttgatccaa tacagtttgg aaatatccct aatcacagat cgactataga 1740 ttcaccgggt ttacggagat accaagctga cgttgatagt tatcgtggta attcgtattc 1800 tcgacgacaa cctcatcaac agtgtgcgat aatcgagaag tggcctaaat ttaccggcga 1860 tacgaattct gtaccggtga ccgacttctt gaagcaaata gatattctct gtaggtcgta 1920 tgacattaac aaacaggaac ttcgtatgca tgcccatctc ttgtttaagg acaaggccta 1980 cgtttggtac actacgtatg aagagaaatt caactcgtgg gaatcactgg aagtttacct 2040 caaaatgaga tacgacaatc caaatcggga cagactgata cgcgaagaaa tgcgtaatcg 2100 taagcagcgg ccgaacgagc ttttcagtgc tttcctggcc gatatggaga tgttggctca 2160 gcgtatgatt cgaaaaatgt cagaagctga gaaattcgag atgatcgttg aaaatatgaa 2220 actctcgtac aagcggcgtt tagcattgga gccaattcag tccatagacc atttggctca 2280 gatgtgctac aagtttgacg cgcttgaaag caatttgtat caggtctaca gtcaatccaa 2340 gtctgtacac catgtcgcct tagaagacga aggaaatgag gattatttgg atacgacaga 2400 agccgatgta tgtgcgctgc gatcaaagat gctgcagaac cgaaacaaca ttcctgggaa 2460 aatcaattcg gagacgaagt cttcaaatga tcagaagatt gcagaaatga cttgttggaa 2520 ctgtaatgct acaggtcatc tgtggcgaga ctgtgacaaa aggaaaagaa ttttctgtca 2580 tatttgcgga atgatggaca ccacggcgtt taggtgtcca aaccatcata atcttcggtc 2640 cgaggacgaa gagtcaaaaa acgagtagtt gaggacgatt cagggaatca ttgtcctcct 2700 acgagtgatt taaggattcc caacaccaac ttttcgtatt ttcgacgagt ctttcagata 2760 aacacttccg tttctagatg tccacattta aaagttcgaa tcctttccga agaacttgaa 2820 ggtttggcag ataccggagc cagtatttca ataattagcg cactagactt gatagaaaag 2880 ctaggactgg aaattcatcc gattccactg caagttaaaa ctgctgatgg aaccggttat 2940 cactgccaag gatacgccaa cgtacctttt tcaataggga atgtaactca tgtacttccc 3000 acgatcattg ttccagagat taccaaaaag ctaatattgg gtacggactt tcttgataag 3060 tttggctttc gcttgacagc ttctggaaat gattcaactc tctgccgtct cccagatgaa 3120 aacgagccaa aagttaatca aattgatttg tgcttcgtgg aagaatactt tggagatacc 3180 gctgagacaa tatgctttga actacaaccg tgtgaactga gcgagaagcc cagtattgag 3240 ttagacgaaa gtttagaaat gcccacagtg gagattcctg aatctcatat tacaaaagca 3300 aatgatttac aaactgaaca caatttgagc gatccagaac gtcaactctt gttcgaagcg 3360 gttcagactc ttccagcaac caaagaaggt caattgggaa gaacctctct gttaaaacac 3420 aaaattgagt tagtccccgg agccaaacca aagaaatttc catcctatcg ttggtctcca 3480 gcagttgaag gcgtaattga tgctgaagtt gacagaatga aaaagttagg cgtcattgaa 3540 gaatgtcctc aagccgtaga tttcctgaac cctctgttgc ctattaaaaa ggccaatgga 3600 aaatggcgta tttgccttga ttcgcgaagg ttgaaccaat tcaccaaaag agatgagttc 3660 ccttttccga atatgatggc catactacag agaattccga aatccaaata tttcactgta 3720 atcgatctta gtgagtccta ttatcaggtt tcactagacg aatcwtcaaa gaacatgaca 3780 gcatttcgga ctgcgaagaa tctataccgg tttaacgtaa tgccgtttgg cttatccaat 3840 gctccggcga ccatggctcg ccttatgacg cgagtattgg gccacgattt ggaaccaaaa 3900 gtttatgtct acctcgatga cataatcatc gtttctaata gttttgagga acaccttgaa 3960 ctgattagag aagtagctcg gagacttcgt aatgcgggtc tcacgataaa tttacaaaaa 4020 agtaaatttt gtcagacgaa gatccgatwt ttggggtacg tgctttcaga agatggctta 4080 tccatggaca taagtaagat tcagccwgtt ctggaatatc ccgtgccaaa gacgatwaag 4140 gacatmaggc gactcctcgg attggctgga ttttatcaaa aattcatacc caattactca 4200 gaagtcacca stcccatctc cgatctcttg aaaaagaatc gaaaaaagtt cacctggact 4260 gaagaggctg atgcagcttt gmgaaawctg aaaactcttc tcgtaacagc accagttctc 4320 tcgaacccgg atttcacgaa aacgtttgtt atcgagaccg atagttcgga tcttgcgata 4380 ggagcggtac tcactcagaa cgttgatggt gagaggcggc cggtcgctta cttttctaag 4440 aagctttcaa gtacwcagcg tcggtatagt gcgacggaaa gagagtgcct cgctgtcctc 4500 ttaagcatag aaaacttcaa gcatttcgtt gaaggttcgc agtttgtcgt acagacggat 4560 gcgatgagcc tcactttcct acgcaacatg tccattgaga gcaaatcgcc cagaattgca 4620 cgctgggcat tgaaactgtc gaaatacgat ctgttattgc agtatcgcaa agggagcgaa 4680 aatattccgg cagatgctct gtcaagagcg gttaataccg tagaaataac cttagctgat 4740 ccttatataa cccagctgaa gcaaatgatc gagaagtttc ccgaaaagta tcgagacttc 4800 actatcaaag acggaaaact gtacaaattt gtcactaata cttcatttat agaagacaac 4860 mgtttcagat ggaaatacgt ggtkccgttc aatgaaagac ccgaaatwat acstagggtt 4920 catgaagaag cccaccttgg tccagtgaaa acgttagcaa aaattcggtt ctattggcca 4980 cgwatggcat ctgaagtaaa aaggttctgt ttccgatgtg aagtctgtcg tgaatcgaaa 5040 gtccctaact tgaacgtgac tcctcgatgc ggaaaaccaa aactgtgttc gcgtccgtgg 5100 gagctaatct cattagactt tctcggacca tatccacgca gcaggagagg aaacgtttgg 5160 atattggtcg tcagcgactt cttttcaaaa tttgtaatgg tacaatgtat gcgaacagca 5220 actgcgcaat cagtgtgtgc cttcgtcgaa aacatggttt tcaacctatt tggggcacct 5280 gccatatgta tcaccgataa tgcacaggtg tttaaaggcg aactgttcac aaaactattg 5340 caaaaatatt cggttactca ctggaatctt tccgtgtacc awccagcacc aaatccgacc 5400 gaacgtgtta atcgggtgat cgtcaccgcg attcgttgtt cgcttaacag caaaaaggat 5460 caccgsgact gggatgaatc cgtgcaccag atcgcgaaag cgattaggac gaacgtacac 5520 gacagtacgg gatacacgcc gtacttcgtc aactttggtc gcaatatggt gagcagcgga 5580 gatgaatatg agctcctaag acgcatccag gacaaccgaa acwctgagga cctaaaacga 5640 gaaacaagca atttatttaa tattgtacga gaaaawctca tgaaagccta taagaggtac 5700 agtacgccgt ataatttacg agctaatgcc aagcatcact tctcagttgg agatgttgtg 5760 tataaaaagg aaatgcacct ttccaataag cagaaaaact tcgtaggcaa atttggaaac 5820 aagtttagca aggttcgagt gcgagaagta ctcggaacca acacttatgt ggtagaggat 5880 ctaactggaa acaagatccc tggcagctac cacggatctt ttctgaaacg agcataagaa 5940 aaaccagcta tgacggtgca tccgccgatg cacaaaaaca agccacaatg aacaaaaata 6000 atcagagagg tgtaaggcaa taatccaacg tcgagatgtc ctatagtgtt tcctccgttg 6060 ttgataagtt aggtaccatt agataaaatc attcaattag atagaatatc ctttagtttt 6120 cgtcttgagt cacttttgtc gcaaattgkt tggatgagga atcatcaggt tgaccacttt 6180 ccttttcaca cccaaacttt ctccgaatat aaacaacctc taatagcact atttcacttg 6240 atacatacct cattcaatag tccagtcgct tgcctccttc aatcgatcca tagatttccc 6300 tttgttttta ccaaatttca cmgmtttctc cataaatttc atgattttgt tcgattcact 6360 atccaaattt gttttgatta cgtttcactt tccctttcgt ttgacaatag cccggccaat 6420 catcaccccc acccgattta ttttgacaga tccgtttaac agcattctca taacgtcaaa 6480 taactcacag catagattaa catatcattt ggtccaaakt atgtacgtgg caaactagtg 6540 tacagtttag ggtttttttc agattggaat ctgagtgaaa gcagagttag tttaatgaat 6600 gctcagatgt ttctgaggta attcggttgw atgtttaact ttgatgatta ggattctcag 6660 acgtttctka ggtagattta gtaagacata tagtaaggat atggttctca gatctttctg 6720 aggtaaattg gatktagtag ataaagtttg ttctcagata tttctgaggt gttacattgg 6780 agttagtcta gatagtttgg tagatatgat atgaatagga taaaagccac tacatatggw 6840 tcagagattc cgaaatagtt ttcatctaat ttgatacagt ttcattctga ttcttcattc 6900 aatcaacgtt tgttatgtcg tgttagtgwt cttggtgaat attagamtag ttttgtttag 6960 tataaaaatt tcgaaatctt caatttcaaa atttttattt ggttaggatg ggcgagaatg 7020 taagatcatt gtgaacccat catgaagcgt ttgtttttat ccgtgtctgt tttgaatktt 7080 gctgtcagtc aata 7094 // ID Gypsy-22_AA-I repbase; DNA; INV; 5063 BP. XX AC AAGE02020536; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-22_AA_; KW Gypsy-22_AA-LTR; Gypsy-22_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5063 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02020536; Positions 5789 10851. XX CC Positions [4093-4563] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 1242..2834 FT /product="Gypsy-22_AA-I_2p" FT /translation="MIWIPRFLLSYSRWKKAGRSRCSDANKSKSGKLAKEW FT QEWKSSLEYYFEACQVHDQKMMRSKMLHLGGPQLQKVFSTLEGTEDFPLVL FT LEKRWYDVAVACLDAYFKPRKQDVLERHRLRSMKQGSNERFSHYVLRLRQQ FT LKDCGLEKFSPEVKSVLEEIMLIDVIVEGCNSHELRRKILEKDQSLRDIET FT LGELIESVRVQEQELKIEKKVIESSYGEVCKVQNTMKSRKERREERIVNRF FT PTAASTWNKDARICFACGRQGHMSKSPLCPAKDQVCRRCKKPGHFENVCRK FT HPSDSRFPIPPKKVQVVEEVCDGASETPQPEPLATLQSQSQPMPYRTAAER FT KMYYTFYAGGETNVFQCTIGGVKSEIFIDSGSDVNLITSDAWENLKAQQVV FT VSKCERGSNKILKAYASCMPLTILGSFHAVIEIGKRSIEAEFMVVQNGQRN FT LLGDTTSKQLGILKIGLETNHVSSTHRGLYCFKRLPFGLVSAPEIFQRKMD FT EMLADCEGTYWYLDDVGVEGSSVEEHDLRLKKVYI" FT CDS 3028..5016 FT /product="Gypsy-22_AA-I_1p" FT /translation="MPSDSKRDAILSFRQPLNESEVRSFLGLANYMGKFIP FT DLAALDEPLRRLTQKGVKFQWSDREESAFRAVKQSLANAQRLGFYNQNDTT FT SVIADTSPHALGGVLIQTNTSGDSRVISYASKSLTDTERRYCQTEKEALAL FT VWSVEKFQVYLIGREFNLVTDCKALVFLFTPVSRPCARIERWVLRLQSFQY FT KIVHISGHTNIADVLSRLSTFVPTPFDETEELVVHDVASAAAQMVALKWKE FT IEDQSCDDPEILEIADALKSGVFDNLPLPYKVISSELCLVNDVLLRGDKIV FT IPNNLRGRVLQLAHEGHPGIRIMKANLRGNVWWPKMDLDVERFVKACRGCS FT LVSAPNAPEPMIRKELPSSVWEQIAVDFLGPLPDGEHLLVCVDYYSRYIEV FT VEMLEITTPCTIKELLKIFSRYGIPEVLRADNGPQFSSEEFRNFCDEFGIH FT LESTIPYWPQMNGEVERQNRSILKRLRIAQELGQDWRHQLSKYLMMYHSSS FT HTTTGRSPAELMFGRKMRTKLPYVPATTVENEGVRDRDRLEKEKGREYADN FT KRRAKSSSIAVGDCVLAKRMKKDNKLSSEFSPEEFSVVQRKGTEVVIRSTK FT SGKEYRRSVTHLKAIPGACNDKNQDEEVVEESGEKLAGDASSEEVDRASKR FT VRKESKLLKDYITY" XX SQ Sequence 5063 BP; 1567 A; 859 C; 1307 G; 1330 T; 0 other; tttggcgacg agcgaaaagg aattaataag gtatttatct ttaaaatatg tgatttaact 60 gattattatc catttcgttg aaaacgggag gagatcagtt tatcgtcttt ggaattcaaa 120 gatggcggaa ttaccgcgtg tagtgattgc gttacagctt tgaaccagtt tgactagagc 180 cggccatttt gaagcatgtt tgaaattatt gaaagtggaa agataaattc aatttggaaa 240 atgaatggat gggtgtggaa ttgaaacttt tttttactgt taattaaatg gcagtagata 300 tcgaagtgag tttagagaga gttttaattt tataagaaat ttggttgtat ggccttcatt 360 gatggaaaaa aactgaccgt ggtaggtgac ccgacaaata ttacgaatac aagaaatgat 420 gtggttgtgt gcccagtgtg attcgagaaa caatggaatc gagaggaata atcgcggcag 480 ggggccggat tcgaattgaa tgtgctaaga aaataggagt ttgccgtggt agggtacccg 540 gtaaatgtaa ctagtgccag ggggcctagg agagaaccct tacggggtgc ctagttagag 600 cggttaagct tgacagcaaa gctaagggtg agcactgtgg tagggtgccc ggttttgtac 660 ctagggtcag aggacctcct ttagggaact gtggtagggc ccagttgctg cgggtagcct 720 ttatgaagtt tatgggaaaa tctatgcagg atgcacgatt gaagtttcta gagctagggg 780 gcctagaaag gtgacgatta gagctgtaat tggagctttg attgataatg cttatgacaa 840 aggtataaag acaaaaaaaa aaatgtcaat cggagcaggg tgcacgattg aagtttctag 900 agctaggggg cctagaaagg tgaagattag agcttcaata gaagatggta atgacaaagg 960 tataaagata aaatttgcgt atcggagcag gggcacgatt gaagttccta gagctaggga 1020 gcctagaaag gtgagattca gagccatatt tggagtttcg attgaaaata cttatgacaa 1080 attgaagagg ggttagaaaa gtgagctatg attagagctt tcatttgaaa atgccaatga 1140 tgaaagtaga aagatggaat ttgcccagtg cttaattggg agtcattttt aatgaaattt 1200 gaagttctag tatatgatcc ttgacggtaa ttggtgtgaa aatgatttgg attccgcgtt 1260 ttcttttatc ttattctaga tggaagaaag ccggccgatc ccgatgttca gatgcgaaca 1320 aatcgaaaag tggcaaactt gccaaagagt ggcaagagtg gaagagctca ctagagtatt 1380 attttgaggc atgtcaagtc catgatcaga aaatgatgcg ttcgaagatg ctacatttgg 1440 gcgggccgca gctacagaag gttttcagta cgcttgaagg caccgaagat tttccgctag 1500 tgttgctaga gaagcgttgg tacgatgttg ccgtggcttg cttggacgcg tatttcaaac 1560 cgcgtaaaca agatgttctt gaaaggcacc ggctccgaag catgaaacaa ggatctaacg 1620 aacgtttttc tcattatgtc ttacgtttgc gtcagcagtt gaaggactgt ggactggaaa 1680 agttttcacc agaagttaag agcgtattgg aggaaataat gttgattgat gtaatcgttg 1740 aaggttgtaa ctctcatgaa cttcggcgga aaattcttga gaaagatcaa tcgttacgtg 1800 atattgaaac acttggagaa ttgattgaga gcgttcgagt ccaagaacag gaactgaaaa 1860 ttgagaaaaa agtcattgaa tcatcatatg gggaagtttg caaagtgcaa aatacaatga 1920 aatcacgaaa ggagcgcagg gaagagcgaa ttgttaatcg attcccaaca gcggcgtcga 1980 cttggaataa agatgcaagg atatgtttcg cctgcggtcg acaaggtcac atgtccaaat 2040 caccgttgtg tcccgcgaaa gatcaagttt gtcgacggtg caagaagccc ggacattttg 2100 aaaatgtctg tcgtaaacat ccgtccgatt cacgatttcc aatccctcct aagaaagtgc 2160 aagtcgttga agaggtatgc gatggagctt cagaaactcc gcagccggaa ccgctggcta 2220 cgctacagtc acaatctcaa ccgatgcctt accgtactgc agcagaaagg aaaatgtatt 2280 acaccttcta cgccggagga gagacgaatg tattccagtg tactatcgga ggagtgaaga 2340 gcgagatatt catcgactct ggatcagatg tcaatttgat cacatcggat gcctgggaga 2400 atcttaaggc acagcaggtt gttgtatcga aatgtgaaag gggcagcaat aagatattga 2460 aagcgtacgc atcatgcatg cctctgacca ttctgggaag ttttcatgcc gtaatcgaaa 2520 tcggaaagcg ctctatcgaa gcagagttta tggtagtcca aaacggacag cggaacttac 2580 taggcgacac gacttcgaag caactgggaa tcctgaagat tggcttggag acgaaccatg 2640 tttcgtccac gcatcgtgga ttgtactgtt tcaaacggct accttttggt ctcgtgtcgg 2700 cgccagaaat atttcagcgg aagatggacg agatgctggc cgactgcgag ggcacttatt 2760 ggtatctaga cgatgtcggg gtagaaggca gctctgtgga agagcatgat ttgagactca 2820 aaaaggtata tatctgaaaa gctggtgaaa taaatcatga agtattgaac taatgactgt 2880 tttttttact gtgtacattt ttttataaca tttctatagg ttcttgaaag atttgaggaa 2940 aaaggcgttg ttcttaattg ggaaaagtgc aaaatacggg tgacggaatt tgaattcctt 3000 ggatataaca taaacccaga aggtatcatg ccttccgatt caaaacgtga tgccatcctg 3060 tcgtttcgtc agccactgaa tgaaagtgag gttagaagtt tccttgggct tgctaattat 3120 atgggaaaat tcatcccaga tttggcagct ttagatgaac ctcttaggcg acttactcag 3180 aaaggcgtaa agtttcaatg gagtgataga gaagaatctg cgtttcgtgc ggtgaaacaa 3240 agtttggcga acgcgcaaag gctaggattt tataatcaaa acgatacaac atctgtcatt 3300 gccgacacta gtccacatgc tctaggtggc gtactcatcc aaacaaatac aagtggggat 3360 tcacgtgtta tttcttatgc ttccaaatcg ctgacagaca ctgagaggag atactgtcag 3420 acagaaaagg aagcactggc ccttgtttgg agtgtagaga aatttcaggt ctatctgatt 3480 ggcagggaat ttaacttagt aacagattgc aaagccttgg tttttctatt cactcccgta 3540 tctcgtccgt gtgcacgcat cgaacgatgg gtactgcgcc ttcaaagttt ccaatacaaa 3600 atcgtacata tttcaggaca tactaacatt gctgatgttc tgtcacgcct atcgactttt 3660 gtcccaaccc cgttcgatga aacagaggag ttagtagtgc acgatgttgc atcagctgct 3720 gcacaaatgg ttgctcttaa atggaaggaa atagaagacc aaagctgtga cgaccctgag 3780 atcttggaga ttgccgatgc attaaaatca ggggtctttg ataatctacc attaccgtat 3840 aaagtcatat catcggaatt gtgtttggta aacgacgtgt tgttacgagg tgataagatt 3900 gtaataccca ataacctgcg tggaagagtt ttacaactgg cccacgaagg tcatccgggt 3960 atcagaatta tgaaagcaaa tctacgagga aacgtctggt ggcctaaaat ggatttagat 4020 gttgaacgat tcgtcaaagc atgccgagga tgttcattgg tatcagcacc taatgcaccg 4080 gaaccaatga taaggaagga acttccatcg agtgtatggg aacaaatagc agtggatttt 4140 cttggcccgc taccagacgg agagcattta ctcgtttgtg tagactatta cagcagatat 4200 atcgaggtag tagaaatgct ggaaataacc acaccttgta cgatcaaaga actgctgaaa 4260 atattttcac gttatggtat acctgaagtt ctacgagctg acaacggacc acaattttca 4320 tctgaagaat tccggaattt ttgtgatgag tttggaatcc acctggagag cactattcct 4380 tactggccgc agatgaatgg agaggtggaa cgccaaaacc gatccattct gaagcgtttg 4440 cgcatagctc aagagcttgg gcaggattgg agacaccagc tgagcaaata tctgatgatg 4500 taccactcat ccagtcatac cacaaccgga agatcccctg cggagctgat gtttggccgc 4560 aaaatgcgaa caaagttacc ttatgttcca gcgacaacag ttgaaaatga aggagttaga 4620 gatcgggacc gattggaaaa agagaaggga agggaatatg cggataacaa acgtagagca 4680 aaatctagtt ctattgcagt gggcgattgt gtgctagcaa aacggatgaa aaaggacaat 4740 aagcttagtt cggaatttag tccggaagag tttagcgtgg tacagaggaa aggcactgaa 4800 gtggtgattc gttcgactaa atcgggcaaa gaatatagaa ggagtgttac tcatttgaaa 4860 gctatacctg gagcatgtaa tgataaaaat caagatgaag aagttgtaga agaatcagga 4920 gagaaacttg caggagatgc atcgtctgaa gaagtagata gggcttcgaa gagagttagg 4980 aaagaatcga agttgctaaa agactatatt acatattgat gataacgagc tatttgtgtt 5040 tttaaaataa agaagaggag agg 5063 // ID Gypsy21-I_Dpse repbase; DNA; INV; 3995 BP. XX AC Unknown_group_247; XX DT 15-MAY-2009 (Rel. 14.05, Created) DT 15-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy21_Dpse; KW Gypsy21-LTR_Dpse; Gypsy21-I_Dpse. XX OS Drosophila pseudoobscura OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; obscura group; OC pseudoobscura subgroup. XX RN [1] RP 1-3995 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1120-1120 (2009). XX DR Genome; Unknown_group_247; Positions 34322 30328. XX CC Positions [1621-2043] - Reverse transcriptase CC Positions [3097-3573] - Integrase core CC LTRs are 87% similar to each other. XX FH Key Location/Qualifiers FT CDS 25..3978 FT /product="Gypsy21-I_Dpse_1p" FT /translation="MENTNGPPAGNMQKNEGPSGSEENRINRQQNANEEAM FT RSTLLEENTQTLQQLVQLLSMKINADEIDKKNDAKLVVENFAKIIPEFNGE FT NMSVQQWFINFELNAEAYGLNDKQKYVQARAKMTGTAALFLESTAVYEYGQ FT LRHQLLQEFECERLCSAQIHNQLSMRVKIAGESFHEYILQMKRIAARGTTD FT TESVIRYIVDGLNLKTDYKYTLYGCSSYKQLREKYEIYERTLAVDGGFAPK FT QKDSQPFGKKSNFTKYERKQHCYNCGSLEHLRKDCRAALKCFRCNQTGHMS FT KDCPSAAAVNVAYEEKRLKSLKINDVQVKGLIDTGADVSLLKMSVFGKMSG FT IHLQASSSTLRGLGNNITRSAGQFTAEVEIDGMRLAHKFLVVADHAVGCEL FT LLGHDFISKFTMTSTQGGYKFSPLPGEETEDIKQISIYNVVEENTDINIPS FT QFRDAVQSMIQDFTEKKQTALCPIMLKIVPDEKIAPFRHSPSRVAISEADV FT VKQQIDEWANAGIIRRSSSNFASRTVIVKKKDGCSRVCVDYRQLNKMVLKD FT CFPVPIVEEVLEKLENAKVFTIMDLENGFFHVPVEESSKKYTSFIMKEGLF FT EFNRAPFGFCNSPAVFIRFISFVFQNLRNENILDLYMDDIVIHAETADECL FT GKLKKVFDVAAEYGLKMKWKKCRFLQSTITFLGHQVGGVEIRPGLEKTKAV FT SKFPMPKNIKAVQSFLGLTGFFRKFIRNYSLIAKPLTNLLRKDVPFKMSLE FT EQQAFATLKEALVKEPVLELYRRNAKTEIHTDASKDGFAAALLQWHEGQLH FT PILFWSKKTSESEARQHSYIQEAKAIFLACKKFRQYILGTNFKLVTDCAAF FT KQTLSKKDVPREVAQWVLYLQDYDFEVEHRPGERLKHVDCLSRYPIDVMMV FT SSEVTARIKKAQQEDTMIKAVSEILKSKPYDNFKLKAGLVYKVVQGTDLLA FT IPKSLEKEIITDAHNVGHFAAQKTMHTVQQSYWIPHLEGKVIQIIGNCMKC FT IIYNKKLGKKEGFLHQIDKGSQPLYTIHVDHLGPMDATSKQYKYIFAMVDS FT FSKFVWMYPTKTTGADEVLKKLREWSDVFGNPARIVSDRGAAFTSSSFEEF FT VKEKNIEHIWSTTGVPRGNGQIERVNRSILSIISKLSSEEPGRWYKFVPRV FT QRAINSTVHVSTKRCPFELMIGVKMRCGPDSDILQLIEKEMVDNFEDERQK FT MRQEAKENIQQAQDRYKEQFDKKRKCEYGYKVGDLVAIRRTQFVTGRKMAS FT EFLGPYEITNVKRGGRYNVRKAADVEGPSNTTTSNDNMKLWSFYVENEDAL FT SSGTDD" XX SQ Sequence 3995 BP; 1310 A; 808 C; 982 G; 895 T; 0 other; agagtattgc gcagtagaat tataatggaa aatacaaatg gaccaccagc cgggaatatg 60 cagaaaaacg aaggcccaag tggatcagaa gaaaacagga tcaatcgcca gcaaaacgcg 120 aacgaggaag cgatgaggtc aacactccta gaagaaaaca cacagacgct ccaacagctg 180 gttcaactgc tctcgatgaa aataaatgca gatgaaatag acaagaaaaa tgatgcaaag 240 ctggtagtgg aaaattttgc caaaattatt ccagaattca acggagaaaa tatgtcggta 300 cagcagtggt ttattaactt cgagctaaac gctgaagcat acggactaaa tgacaaacaa 360 aaatacgtgc aagcccgagc aaaaatgaca ggtacagccg ctctttttct cgagtccaca 420 gcagtgtatg agtacggcca attgcgtcat caacttctac aggagtttga gtgtgagcgt 480 ttatgcagtg cgcaaattca caatcaattg tctatgcgag tgaagattgc cggcgaaagt 540 tttcacgaat atatactaca gatgaaacgc attgctgcac gtgggacaac agacacggag 600 tcggtgattc gatacattgt cgacggactg aatttaaaaa cggattataa gtacaccttg 660 tatggatgca gctcatacaa gcagctgcga gagaagtacg aaatctacga aagaacgctc 720 gcggttgatg ggggttttgc cccgaaacaa aaagactcac agccgttcgg aaagaagtcc 780 aatttcacca aatatgaaag gaagcaacac tgctacaatt gtggatccct ggagcatcta 840 cgcaaggatt gccgagctgc attaaagtgt ttccgctgca atcagactgg acacatgtct 900 aaggattgcc cgagtgctgc tgccgtaaac gttgcatacg aggaaaaacg cctgaagtct 960 cttaaaatta acgacgtgca agtcaagggg ctcatagaca ccggtgcaga tgtttcccta 1020 ttaaagatgt cagtgttcgg caaaatgagt ggtatccatc tgcaagccag ttcgtcaact 1080 ttgaggggac ttggaaacaa catcacacgc tccgctgggc agtttacggc agaagtggag 1140 atcgacggaa tgcgactggc acacaaattc cttgtggtgg ccgatcatgc cgtcggatgc 1200 gaattgttgc ttggacacga ctttatatcc aagttcacaa tgacatcaac gcaaggagga 1260 tataagtttt ctcccctgcc gggggaggaa actgaggaca tcaaacagat aagtatttac 1320 aacgtggtcg aagaaaatac tgacataaac attccgtcgc agttcagaga tgccgtccaa 1380 tcaatgattc aggactttac tgaaaagaag cagactgcac tgtgcccgat catgctgaag 1440 atcgtaccgg acgagaaaat agcgccgttt cgacattcgc caagtcgagt ggccatcagc 1500 gaagctgacg ttgtcaagca gcagatcgat gaatgggcca acgctgggat aatacgacgc 1560 tcatcatcaa atttcgccag tcggactgta attgtcaaga aaaaggatgg ttgcagccgg 1620 gtctgcgtgg actatcgtca gttgaacaag atggtcttga aagattgctt tcccgttcca 1680 atcgttgaag aggtgctgga aaaactggaa aatgctaagg tgttcaccat catggactta 1740 gaaaatgggt ttttccacgt tcctgtggaa gaaagcagca aaaaatatac gtcgtttata 1800 atgaaggaag gcctctttga gttcaaccgt gcgcctttcg gtttttgcaa ttcgccagca 1860 gttttcatcc gttttataag ttttgtattt caaaacctta gaaatgaaaa cattcttgac 1920 ctgtacatgg acgatatcgt tatacacgct gaaaccgccg acgaatgcct gggaaagctg 1980 aagaaggttt ttgatgttgc agctgaatac gggctgaaga tgaagtggaa gaagtgccga 2040 tttctgcagt cgaccattac atttttgggc caccaagttg gaggagttga aattaggcct 2100 ggattggaga aaaccaaagc cgtcagcaag ttcccgatgc cgaaaaacat aaaggctgtg 2160 cagtcgtttt taggattaac tgggtttttc cgtaaattca taagaaacta ttcgctaatc 2220 gccaaaccac tgaccaatct gctgcgaaaa gatgtcccat ttaaaatgag tctggaggag 2280 cagcaggcgt tcgctacatt aaaggaagca ttggtgaaag agccggtctt ggaactttat 2340 cgcaggaatg caaaaaccga gatacacact gatgcgtcaa aggacgggtt tgcagcggcg 2400 ttgttacaat ggcacgaagg acaattgcat ccgatcttat tctggagtaa gaaaacctcg 2460 gagtctgaag cacgacagca tagctatata caggaagcca aagcaatttt cctagcgtgc 2520 aaaaaatttc gtcagtacat acttggaacc aactttaagc tcgtaacgga ttgcgccgct 2580 ttcaagcaaa cgttaagcaa gaaggacgtt ccgcgggagg tagcccaatg ggtcctgtac 2640 ttgcaggact acgactttga agtggaacac agaccaggag aacggctgaa gcatgtggat 2700 tgcctgagta ggtacccaat tgatgtcatg atggtatctt cggaagtcac agcaagaatt 2760 aagaaagccc agcaggagga tacgatgatc aaagcagttt ccgaaatttt gaaaagtaaa 2820 ccatatgaca actttaagtt gaaagctggt ctcgtataca aagtggtgca aggcacggac 2880 ttgctggcaa taccgaagtc attagaaaaa gaaataataa ccgacgctca caacgtagga 2940 cactttgccg ctcaaaaaac aatgcacacc gtacagcaga gctattggat tccacatctg 3000 gaaggaaaag taattcagat cataggaaac tgtatgaaat gtataatcta caacaaaaag 3060 cttggcaaaa aagagggatt tctgcatcaa attgacaaag gatcacagcc gctgtatact 3120 atacacgtgg atcaccttgg gcctatggat gccacttcaa agcagtacaa gtacatcttc 3180 gcaatggttg atagctttag taagttcgta tggatgtatc ccacaaaaac aacaggagca 3240 gacgaagttc tcaagaagtt aagggagtgg tctgatgtat ttggtaatcc ggcacgcata 3300 gtaagcgatc gtggagcagc atttacatct tcaagctttg aggagtttgt caaggaaaag 3360 aacatcgaac acatatggag caccacaggg gtaccaagag gaaacggaca aattgaacgc 3420 gtaaaccgat ctattctcag catcatttcc aaattgtcgt cggaggaacc tggaagatgg 3480 tataagttcg taccgcgagt tcaaagggct atcaacagca ctgttcacgt gtctacaaag 3540 cgttgtccat ttgagctgat gatcggggta aaaatgcgtt gtggaccgga tagcgacata 3600 cttcagctaa tagaaaaaga gatggttgat aattttgaag acgagaggca aaagatgcga 3660 caagaggcaa aagaaaatat tcagcaggcc caggacagat acaaggaaca gttcgataaa 3720 aagcggaaat gcgaatacgg atacaaggtc ggagatctcg tagccatacg tagaacacaa 3780 tttgtaaccg gaagaaagat ggccagcgaa ttcttagggc catacgaaat aaccaacgtg 3840 aaacgaggcg gacgctataa tgttcgaaag gcagcagacg tcgaaggacc aagcaacacg 3900 accacaagca atgataacat gaaactatgg agcttttatg tagaaaacga ggacgcattg 3960 tcatcaggga ctgatgacta atcaggatga ccgag 3995 // ID Gypsy-237_AA-I repbase; DNA; INV; 4794 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-237_AA_; KW Gypsy-237_AA-LTR; Gypsy-237_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4794 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1075-1075 (2011). XX DR [1] (Consensus) XX CC Positions [2125-2505] - Reverse transcriptase CC Positions [3761-4228] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 676..2397 FT /product="Gypsy-237_AA-I_1p" FT /translation="MPLIGAIEAYIPGSTSFAQYVEQIEWIFKINKIPEEE FT KLAYFLGLCGRETYSELKLLHPGVDLATLTYDAMIDSLKRRFDKAETDMFQ FT RYKFYNMIQSESEAAEDFILRVKLQAESCEFGAFKETAIRDKLVMGVQDKQ FT IQQRLIEEEDLTLAKAEKLIVNRESAGVRTRMMNGNNLTVVAKIDDNKRKV FT YSSSRSRSKERRSRNNRNFSNSHSRSRSRSSPTPRKSFYCNYCNKKGHTKK FT YCYELKKQKKLHVKFVDAPPVQIPSTSDYFKRLREDPKDDDSESEENCMKI FT TSVNRRNEPCFVRPIVEQVEMRMEVDCGSAVSVIDERDFLDYFGNLSLERY FT DNRLVVVDGASLEVLGCVRVTVQLNGIREHQLELVVLKCAKRLRRIVPLLG FT RKWLDVFFPEWRLPFSNERVNKISENLIENTVEDVKRKFDHIFKKTFLDPI FT VGFEGDLILKEDRPIFKKAYEVPLRLRQKVIDHIDALEKDGVITPIEASEW FT ASPVVVVVKKNQDIRLVIDCKVSINKLIVANTYPLPNAQDLFASLAGAKVF FT CSLDLAGAYTQLKLSKKIPKVHGDQHP" FT CDS 2381..4795 FT /product="Gypsy-237_AA-I_2p" FT /translation="MVINTLKGLYQYNRLPQGASSSASIFQKIMDQILSGL FT PLVVCYQDDVLIAGVDFEDCKKNLYLVLDRLSKANIKVNLSKCKFFVDSLP FT YLGHVVSDKGLMPCPEKVDTIRAAKAPQNVSELKAFLGLLNFYGKFIPHLS FT SRLNCLHKLLKKDARFVWSHECQKTFDECKTYLLCSNLLELFDPKKPVVVV FT SDASSYGLGGVIAHEIDGVEKPISFTSFSLNDAQRKYPIIHLEALAVVSTI FT KKFHKFLYGQKFTVYTDHKPLIGIFGKEGRNSMYVTRLQRYVLELSIYDFD FT IVYRPSSKMGNADFCSRFPLPIEVPKSLQREYIKNLNFSSDFPADYQQIAS FT ETLKDNFLQQVQSFINFGWPNKPKPEFKDVHTHYQDLEIVDGCILFQDRIL FT IPRTMQQTILKMLHKNHTGTQKMKQLARRSVYWFGLNKDIESHVKACRTCN FT EMYIAQKPKTSNEWIPTTRPFSRLHADFFYFQQKVFLIIVDSNSKWLELEL FT MPHGTDSKKVIRKFTALFARFGFPDVVVTDGGPPFNSTPFIKFLENQGIKV FT LKSPPYNPQSNGQAERMVRLIKEVFKKYLLDPQLQSLDTEERIYNFLFNYR FT NTCLDNGDFPSERVFCFKPKTDLDLINPKLHYKKQLTIPKHDKTNSQKRTV FT IPAHDPLIKLTAGDPVYYKNHNPTDIRRWVRVKFIKKLSTNIFQISFGGRL FT LSAHRQQLRLVENCSDRNFVPVLFDRDAPDRKHDPDPASTTNAEQTESEEE FT DFYGFAAESCIYRAPVQSSTLPLSLNDASLSRKRKNVDTPVWRSKRLKHSK FT KNS" XX SQ Sequence 4794 BP; 1473 A; 868 C; 1035 G; 1418 T; 0 other; agttaaagtg gcgtacgagg ataaatcttt ggaaaattat tgtgatttcg agtgattagt 60 gcctagtaat tggcagtttg cggttcgcta gtgtttgcgc ggcgttgaga aacagaaagt 120 tcgtcgtgtg ctggtgagta acagtgtaac agccagctgc taattttgtt tgcggtgtta 180 ggctttttgt tacctgttgt gttgctatcg tggtttggcc atcgcgtatt ttgtgaatgg 240 ttggcagtgt tttgtcaaca agacggagtg atttttcgat tgttttgttt gcccggcagg 300 ttcggtgtag ttatttaccg gccaactctt ttgcaaaggc agcaaacgaa tcgatcgtgg 360 agtttggaag aaacggcggt tgtttatacc aagctgtgtg gagcatattt gggctgcggc 420 tgaataagtg acggattttc gggctgtttt tggtgctgcc tgctggtgtg ctgttgctgt 480 tactgttgct gttgtggaga agttgtgctg ctgattggac tgagccagta ttttttggtc 540 ggttgtacag cgttgaaaat aaggtatgtg ttttgtttta ttttattgat tttcttgctg 600 gaaatttgct ccaatttttg agattttttc tctgttgttt ttttttctaa ttattttcaa 660 tcagaagtgc taaatatgcc gctgattggt gcaatcgaag cttacatccc tggttctact 720 agctttgcgc agtatgtgga gcaaattgaa tggatcttca agatcaataa aattccagag 780 gaggaaaagc tagcttattt tcttgggcta tgtggtcgag agacttatag tgaacttaaa 840 ctgttgcatc ctggcgtcga ccttgcgacg ttgacctatg acgcaatgat tgactcttta 900 aaaagacgtt ttgataaagc agaaactgac atgtttcagc gctataaatt ttacaatatg 960 attcaaagtg aatcagaggc tgctgaagat tttatattaa gggtcaagct ccaggcggaa 1020 agctgcgaat ttggcgcttt caaggaaaca gctatcagag ataagctagt aatgggggtc 1080 caggataagc aaattcaaca gagattaata gaagaggagg atttaacgtt ggccaaagca 1140 gaaaagctca tcgtaaaccg agaatccgcg ggcgttcgaa ccaggatgat gaacggtaat 1200 aacttaacag ttgttgctaa gattgatgac aacaagcgga aggtttacag ttccagtcga 1260 tccaggagca aggaaagaag aagtaggaac aatagaaatt ttagcaatag tcatagcaga 1320 agccgcagta gatcatcacc aactccaagg aagtcatttt attgtaacta ttgcaacaaa 1380 aagggccaca ctaaaaagta ttgctatgaa ctaaagaaac agaaaaagtt gcatgttaaa 1440 ttcgttgatg ctcctccagt tcaaattccc agcacttcgg attacttcaa acgcctgaga 1500 gaagatccaa aagatgacga ttccgaatct gaggaaaact gcatgaaaat tacatcggtc 1560 aatcgtagga atgagccatg ttttgtaaga ccaatagttg aacaagtgga aatgcgtatg 1620 gaagtcgact gcggttctgc agtcagcgtc attgatgagc gagattttct tgattacttt 1680 ggaaatttat ctctggaacg gtatgataac aggcttgtag tagtagatgg agcgagtttg 1740 gaagttttag gatgcgttcg agttacagtc cagttaaatg gcattcgaga gcatcaactg 1800 gagctagttg tgttgaaatg cgccaagcgg ttgcgtagaa ttgtccctct tcttggaaga 1860 aaatggctgg atgttttctt tccagaatgg cgtctgcctt tttctaatga aagagtcaac 1920 aaaatttcgg aaaatctaat cgaaaatact gttgaagatg tgaagcgtaa gtttgatcat 1980 atttttaaga aaactttcct agatccaatt gttggttttg aaggagattt gattctgaag 2040 gaagatagac ccattttcaa aaaggcctat gaagtgcctt tgagactgag acaaaaggtt 2100 attgatcaca ttgatgcttt ggagaaagat ggagtaatta cacccataga ggcgagcgag 2160 tgggcttctc cagtcgtggt agtagttaaa aagaaccagg atatccgctt agtaatcgac 2220 tgcaaagtgt caataaataa gctcattgtt gcaaatacct accctttgcc aaatgctcaa 2280 gatctttttg catctttggc aggagctaag gtattttgtt ctttagacct tgctggagct 2340 tacactcaat tgaaattgtc aaaaaaaatc ccaaaagttc atggtgatca acacccttaa 2400 aggtctttat caatacaata ggctgcctca gggtgcatct tccagtgcgt caatatttca 2460 aaagatcatg gaccaaattt tgagcggttt acctttggta gtttgttatc aagatgacgt 2520 tctaattgca ggagttgatt ttgaagattg caaaaagaat ctttacttag ttctagacag 2580 gctttctaaa gcaaatatta aggtgaatct tagcaaatgc aaattttttg tagattcttt 2640 accttatctt gggcatgtag tttccgataa aggtcttatg ccgtgtccag aaaaagtaga 2700 cactatccgt gcagcgaaag ctccccagaa tgtgtctgaa ctgaaagctt tcctaggact 2760 tttgaacttt tacggaaagt tcattccaca tttgtcatct cgtttaaact gtctccataa 2820 attactaaaa aaagatgctc gttttgtttg gtcccatgaa tgtcagaaaa cttttgacga 2880 atgtaaaaca taccttttgt gttcaaacct tttggaactt ttcgatccta agaaaccagt 2940 ggtagtagtt tctgacgctt caagttacgg acttggagga gtgattgccc atgaaattga 3000 tggtgttgaa aaaccaattt catttacgtc attttcattg aatgacgctc aacgaaagta 3060 tccgataata catttggaag ctttggcggt agtaagtaca ataaaaaagt tccataaatt 3120 tttgtatgga caaaaattta cggtatacac cgaccataaa ccactcattg gtattttcgg 3180 aaaagaaggg cgtaactcta tgtacgtaac acgacttcaa cgctatgtat tggaactttc 3240 gatatacgat tttgatattg tgtatcgacc ctcttctaaa atgggaaacg cggatttttg 3300 ttcccgattc cctctaccta ttgaggttcc aaaatcgttg caaagggaat acataaaaaa 3360 cttgaatttt tcgtcagatt tcccagccga ttatcaacaa atagccagtg aaacattgaa 3420 agataatttt cttcagcaag ttcaatcctt cataaatttt ggctggccga acaaacctaa 3480 accagaattt aaagacgttc atacacacta tcaagatctt gaaatagttg atgggtgcat 3540 tttattccaa gacagaatac tcataccgcg cacaatgcaa caaacaattc ttaaaatgtt 3600 gcacaagaat cacacaggaa cacagaaaat gaaacaacta gcgcgaagga gtgtatattg 3660 gtttggttta aacaaggata ttgagagtca cgttaaagcg tgtcgaactt gcaacgaaat 3720 gtacattgca caaaaaccaa aaacctccaa cgaatggatc ccgactaccc gaccattcag 3780 tcgtctgcat gctgattttt tctattttca acaaaaagta tttttaatta tagttgatag 3840 taattctaaa tggttagaat tagagctgat gccacatggg acagacagca aaaaggtgat 3900 aagaaaattc actgcattat ttgctagatt tggattccca gatgtcgtgg ttactgacgg 3960 aggtcctcct ttcaattcaa ctccattcat caaatttctg gaaaaccaag gtatcaaagt 4020 gcttaagagc ccgccttata atccacagag caacggacaa gcagaaagga tggtaagact 4080 gataaaggaa gtgtttaaaa agtacttact tgacccgcaa ttacaatctt tggacacaga 4140 ggaaaggatc tacaatttct tgtttaatta cagaaacact tgcctagaca acggtgattt 4200 tccttcagaa cgagtcttct gtttcaagcc aaaaacagat ttggatctaa ttaacccaaa 4260 attacattat aaaaaacagc taactattcc aaagcatgac aaaactaaca gtcagaaaag 4320 aacagttata cctgcacatg accctttaat caaactgact gcaggtgatc ctgtttacta 4380 caaaaaccac aaccctacag acatacgtcg ctgggtaaga gtaaagttta ttaagaagtt 4440 atctaccaac attttccaga tatcttttgg tggtcgtctg ctatccgcac accggcaaca 4500 acttcgattg gtggagaact gttctgatcg gaattttgtc ccggtgttgt tcgatcggga 4560 cgcgccggat cgtaaacatg acccagaccc agccagcaca accaatgcag aacaaaccga 4620 gtcagaagaa gaagattttt acggttttgc tgcggaatca tgcatttacc gtgcacccgt 4680 ccaatcttct accttacctt tgagtctgaa cgatgcaagc ttatcgagaa agcgcaagaa 4740 cgtggacaca cctgtttggc gatcgaaaag actcaagcat tccaagaaaa actc 4794 // ID Poseidon_Ap repbase; DNA; INV; 2066 BP. XX AC . XX DT 27-DEC-2006 (Rel. 11.12, Created) DT 27-DEC-2006 (Rel. 11.12, Last updated, Version 1) XX DE Poseidon_Ap is a Penelope-like retrotransposon. XX KW Penelope; Non-LTR Retrotransposon; Transposable Element; KW Penelope-like element; reverse transcriptase; KW GIY-YIG endonuclease; Poseidon_Ap. XX OS Acropora palmata OC Eukaryota; Metazoa; Cnidaria; Anthozoa; Hexacorallia; OC Scleractinia; Astrocoeniina; Acroporidae; Acropora. XX RN [1] RP 1-2066 RA Arkhipova I.R.; RT "Distribution and phylogeny of Penelope-like elements in RT eukaryotes."; RL Syst. Biol 55(6), 875-885 (2006). XX DR [1] (Consensus) XX CC Poseidon_Ap is a Penelope-like element (PLE) from the elkhorn CC coral, Acropora palmata. It belongs to the Poseidon group of CC PLEs. Its 5' truncated ORF contains regions homologous to reverse CC transcriptases and to GIY-YIG endonucleases. The element is CC apparently inactive, its copies are 80-95% identical. Consensus CC sequence was assembled from trace archives. XX FH Key Location/Qualifiers FT CDS 1..1767 FT /product="Poseidon_Ap_1p" FT /translation="TRQRELPESNKVDIRSRIASTIQSASLTDCNLTKDEL FT HALRRLRNDKDIVILPSYKGCATVGMNKKDYSHKMDSLVNDKQTYEPVKRN FT PTPALQRRVSGKLLDFKKKETMDIQLYYRLRCRVPQSTKLYRLPKLHKPNI FT PMRPIVSFCGSPTYQLSKHLIFLKPLTDKSRHKLQSTDNFIDAIKTVQIPD FT DHKLVSFDVKSLFTSIPLQLALDRTKTAIKKSHYQPPLPTDDLMDLLHLCL FT TSTYFQYNGKHYKQLHGTAMGSPVAVVVAEIVMQNIEEQALATYSETLPLW FT LRYVDDTITAVHESKIDEFHEHLNEQNTSIQFTKEIEENGKIPFLDCLVTR FT ENNTLRTTVYRKPTHTDRLLDQTSYNPTSHKATTVRTLTRRAQIVCDSDDS FT LTDEIKHLNTVFIKNNYNTDFIERNTYIRPNDSSNNSYTTTATIPYVRGTS FT ETIARILRPYNIRVAHKPIFTLRRLLTHVKGKDKPEDRPGAVYKIHCSDCQ FT ATYIGETGRNLTTRLTEHKRATKKGDLNNNIAEHHLKTNHAIDWDSATCLT FT YSTDYYQRITLESWFTNLEQTALNRCQPLPAPYKRLLNRKQ*" XX SQ Sequence 2066 BP; 699 A; 587 C; 326 G; 454 T; 0 other; acccgtcaga gggagctacc agaatcgaat aaggtcgaca tcaggagcag aatagcttcc 60 actatacaat cagcctcact caccgactgc aacttgacga aagacgaatt acacgcattg 120 agacggttaa gaaacgacaa ggacatagtc atacttccct cgtacaaagg atgtgctaca 180 gtgggtatga acaagaagga ctatagccac aaaatggact cactagttaa tgacaaacag 240 acatacgaac cagtgaagcg taaccccaca ccagcactcc aacgaagagt aagtggcaaa 300 ctacttgact ttaaaaagaa agagactatg gacattcaac tatactacag actcagatgc 360 cgcgtaccac aatcgactaa actttacaga ctacctaaac tacacaagcc taacataccg 420 atgcgaccca tagtctcatt ctgtgggtct cccacttacc aactttcaaa acacttaatc 480 tttctcaagc ccttaactga caagtcacga cacaaactac aatccacgga caacttcatt 540 gacgctatca aaacggtaca aataccagac gaccacaagc ttgtatcctt cgacgtcaaa 600 tcacttttca ccagcatacc acttcaactt gcccttgacc gtactaagac cgccatcaag 660 aaatcacact accaaccacc attacccaca gacgacctta tggacctcct gcacctttgt 720 ctgacctcaa cctactttca atacaacggt aaacactaca agcaactaca cggaacagct 780 atgggatcac ctgttgccgt tgttgtggct gaaatagtca tgcaaaacat cgaggaacag 840 gccctagcaa cttacagtga aacactccct ctctggctac gctacgttga cgatacgatc 900 actgctgtac acgaaagcaa aatcgacgaa ttccacgaac acttgaacga acagaatact 960 agcatccagt ttactaagga gatcgaggag aacggtaaga tacctttcct cgactgcttg 1020 gtaacacgcg aaaacaacac cctacgaacc actgtttaca ggaaaccaac acacactgac 1080 agactacttg accaaacgtc ctacaatcct acttcacaca aagcgactac ggtacgaacc 1140 ttgacaagaa gagcacaaat tgtttgcgac tcagacgaca gtttgactga cgaaatcaag 1200 cacttaaaca ctgtttttat taagaacaac tacaacacag atttcatcga acgcaatact 1260 tacatcagac cgaacgacag ctctaacaac tcatacacca ccacagccac tataccttac 1320 gtacgaggca cctccgaaac catagcacgc atacttcgac cttacaacat tcgagttgca 1380 cacaaaccca ttttcacttt acgacgctta ctcactcatg ttaagggaaa agacaaaccc 1440 gaagacagac caggagcagt ttataagatc cactgctccg actgccaggc cacttatatc 1500 ggtgagaccg gcagaaactt aaccacgcga ctaaccgaac acaaacgagc tacgaaaaag 1560 ggtgacctca acaataacat cgccgaacac cacttaaaaa caaaccacgc tatcgactgg 1620 gactctgcta cgtgtttaac ctacagtacc gactactatc aacgaattac actcgaaagc 1680 tggtttacca acttagaaca aactgcccta aatcgttgtc aacctcttcc cgcaccttac 1740 aaacgtttac tcaacaggaa acaataacac cttgtttatt attcatttta caatccatct 1800 atttgcataa cctatctccg cacatgtctt cacagccaat cacatcacgt acttaccaac 1860 ggctactcta tctattgacc aatcaaatcg ctcaccaggg tttttgaatt ttcaactgac 1920 taacactacc acttgactct gaagatggct tccgcacagg ttgtcgaaac gtcagtcact 1980 aacaacagtc cttctcagga ctccaatcac ccagatgatc tttttcaatc aaggtatgtt 2040 actcctgggt tcaaaccatt ttctta 2066 // ID Chapaev-1_HM repbase; DNA; INV; 2733 BP. XX AC . XX DT 26-FEB-2008 (Rel. 13.02, Created) DT 26-FEB-2008 (Rel. 13.02, Last updated, Version 1) XX DE Autonomous Chapaev DNA transposon -a consensus sequence. XX KW Chapaev; DNA transposon; Transposable Element; Chapaev-1_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2733 RA Kapitonov V.V. and Jurka J.; RT "Autonomous Chapaev transposons from the hydra genome."; RL Repbase Reports 8(2), 27-27 (2008). XX DR [1] (Consensus) XX CC Chapaev-1_HM is a young family of autonomous Chapaev DNA CC transposons that were active in the hydra genome just a few CC million years ago (they are ~1.6% divergent from their consensus CC sequence). The consensus sequence was obtained based on a CC multiple alignment of 15 copies; it codes for a 654-aa Chapaev CC transposase (two exons). Chapaev-1_HM is characterized by 4-bp CC target site duplications and 211-bp terminal inverted repeats. CC Based on the TPase identities, Chapaev-1_HM forms a distinctive CC group together with Chapaev-2_HM and Chapaev-5_HM. The N-terminal CC portion of this group TPase contains a Chapa-like zinc finger CC (H-X7-C-X2-C-X35-C-X2-C-x36/38-C-X2-C) but is free of the RING CC finger motif. CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(377..962,1058..2433) FT /product="Chapaev-1_HMp" FT /note="Transposase." FT /translation="MPNKAKTHEENRKCVCFLCLKKANREITSFLVEKIRT FT VLKIELDFSNFQIPCGICERCRVAIRRREEGEDAPIPRLFDFSTISVQRAA FT TNIPCNCLICQTARTNMNQRHPLEPPKPKENSSIEKRCSDCFSVIGRGLPH FT NCTTGTLRQNLVEVASRDRIAAERVASLTIANKTPSPHGTVRLSQPLGGNR FT FPVVPGPSSARELFPAVPKLTAQDMVGVQIGTRLSNRGMSKLASSLNQATP FT LRIVEKNFREKFDSFGKSLSEHFETKAIRDSTKNDDQHPSRLLVFCPDVGS FT LANHVIKVRNVSGDPLMKIGIDGGGGFLKVSLGIIARNTDSNSPPPKRGLK FT DTGVKRQLLVAISEDLPESYDNLKSIISSLQLHKLSYIISCDMKVANLVCG FT LQSHASAHPCSWCDAESKDLSRSGSLRTLGSIRENFLQFQSSGANARRAKD FT FKNVIHEPIMTLPDDTLILDIIPPMELHLLLGVVNHLYKSLCQIWPAAKEW FT PAALHIQEQSFHGGQFAGNDCRKLLKNTDRLQSLAEQSSFFLAVPFVETLR FT HFDSVVNACFGNHLSPDFEQKILQFQESYLKLPNTSVTPKAHAVFFHLKDF FT ILRHNSSLGIFSEQATESLHHDFSSHWQRYKRARNHPEYESKLLNCVIDFN FT SKHSF" XX SQ Sequence 2733 BP; 847 A; 530 C; 528 G; 828 T; 0 other; cacagtgtca cgaaatggcc atcaacctta acctcatgtc ccgagaaaac cctttggtgg 60 gatgatagcc cacatatagt agatcactta gatataaatt tggaaatcga actatgacgt 120 ccgagagaga tattttagtt tttattttct tcaattctgg catatcaaaa tttaaatgag 180 atgataggaa tttggcattt ttacagcatt tttgttgtta cgttgcttag agtactattt 240 aagacattat ataactaatt ggagcaggtt ttttgaaatc attattattt tttgtttgtt 300 tttcaatttt gctaattccg ggaaaagggt gcacattcag tttttttcaa ataagacaca 360 ttgaaagttt gataaaatgc ctaacaaagc taaaactcat gaagaaaacc gaaaatgtgt 420 ttgtttccta tgtttgaaga aggcaaatcg agaaatcaca agttttttag tggagaaaat 480 tcgcacagtc ctcaaaattg agcttgattt ttcaaacttt caaatccctt gtggaatttg 540 tgaaagatgt cgtgtggcta ttcgcagacg agaagaaggg gaggatgcac ctatcccaag 600 gctctttgat ttcagcacca tctcagttca gcgtgctgca acaaatattc cttgtaattg 660 tttgatttgt caaactgcaa ggacaaatat gaatcagcgt catcctttgg aaccaccgaa 720 acccaaagaa aattcaagca ttgaaaaaag gtgctctgat tgcttctctg tgattggtcg 780 tggattgcct cacaattgca caacaggaac attaaggcaa aatttagtgg aagttgcttc 840 gagggatcgc attgcggcag agagggtagc atcattaaca attgctaata aaactccatc 900 acctcatggc actgttcggc taagtcaacc acttggagga aatcgttttc ccgttgttcc 960 aggtaaagca atatagattt aagcataaac cacttataaa tttcttttaa aatgtaaatg 1020 tagaaaattt taacctcggc cattaaatgt tctttaggac catcaagtgc aagagaattg 1080 tttcctgctg tgcccaagct cacggcacaa gacatggttg gagtccaaat tggaactaga 1140 ttgtcaaatc gtggcatgtc caaattggca tcatctttga accaagctac acctctccgg 1200 attgtagaaa aaaatttcag agaaaaattc gacagtttcg ggaaatctct ttctgagcat 1260 tttgagacca aggccatacg agattctaca aagaatgatg accagcaccc ttcacgactg 1320 cttgtttttt gcccagatgt aggaagtctt gcaaatcacg ttatcaaagt aaggaatgtt 1380 tcaggagatc cgctgatgaa gattggcatt gatggaggtg gtgggttttt gaaagtatct 1440 cttggaatca ttgccagaaa cacagattca aattctccac caccaaaacg agggcttaaa 1500 gacacaggag tcaaacggca acttctggtg gcaatttcag aagatctgcc agagagttat 1560 gacaacttga aaagcataat ttcttctctt caacttcaca aactttccta cataatctct 1620 tgtgacatga aagtcgccaa tcttgtttgt ggtcttcagt cccatgcaag tgcacatcct 1680 tgttcatggt gtgatgcaga atcaaaagat ttgagtcgct ctggaagtct cagaacactt 1740 ggttcaattc gagaaaactt tctgcagttt caaagcagtg gtgccaatgc aaggagagca 1800 aaagacttca aaaacgtcat ccatgaaccc ataatgactc tgccagatga cacactgatt 1860 ttggacatca ttcctcccat ggaacttcat ctgcttcttg gagtcgttaa ccatctctac 1920 aaaagtttat gccagatttg gcccgcagca aaggaatggc cagcagccct tcatattcaa 1980 gagcagtctt ttcatggtgg ccaatttgca gggaatgact gccgcaagct cttgaaaaac 2040 actgatcgcc ttcaaagttt agctgagcaa agctcatttt ttcttgctgt gccatttgtt 2100 gaaacattga gacattttga ctcagtggtg aatgcttgct ttggaaatca cttgagcccc 2160 gattttgaac aaaaaatttt acagtttcag gagtcgtacc tgaagttgcc aaatacttca 2220 gtcactccca aagcacatgc ggtgtttttc cacctgaagg atttcatcct tagacacaat 2280 tctagtcttg gcattttctc agagcaagct acagaatctc ttcatcacga cttcagttcc 2340 cattggcagc gttacaaacg ggcaagaaac catccagaat atgaatcaaa gcttctgaat 2400 tgtgtgattg atttcaacag caaacattca ttttagaata aaaaggatca attttattgt 2460 aaaatatgtt tttgtttata tgtcttatag aaatatatgg gaaaattaac tttttgctct 2520 ataaatactg aaaaaatgcc aaattcctat catctcgttt aaattttgat atgccagaat 2580 tgaagaaaat aaaaactaaa atatctctct cggacgtcat agttcgattt ccaaatttat 2640 atctaagtga tctactatat gtgggctatc atcccaccaa agggttttct cgggacatga 2700 ggttaaggtt gatgaccatt tcgtgacact gtg 2733 // ID Gypsy-1-I_MI repbase; DNA; INV; 10685 BP. XX AC CABB01000278; XX DT 28-FEB-2009 (Rel. 14.02, Created) DT 02-MAR-2009 (Rel. 14.02, Last updated, Version 1) XX DE An internal portion of the LTR retrotransposon from Meloidogyne DE incognita. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW reverse transcriptase; Gypsy superfamily; Gypsy-1-I_MI. XX OS Meloidogyne incognita OC Eukaryota; Metazoa; Nematoda; Chromadorea; Tylenchida; Tylenchina; OC Tylenchoidea; Meloidogynidae; Meloidogyninae; Meloidogyne; OC Meloidogyne incognita group. XX RN [1] RP 1-10685 RA Bao W. and Jurka J.; RT "Gypsy LTR retrotransposons from Meloidogyne incognita."; RL Repbase Reports 9(2), 463-463 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS join(1526..4078,3997..6447,6396..9998) FT /product="Gypsy-1-I_MI_1p" FT /translation="MEEEVGTDDDMHTADEEGKTPIKKVARQLNERELQQK FT LATLEKEIAKLGINRMREGLPLPEFYSGIEDFEAYLRRFNKIATAHQWSGT FT RCTQILPLYLLEEARAIYDSLGADSKNTWKGLTDALAIKLKKLNTKESARR FT MLTNRKQKSNESIIEFAQVIRGLINKAFPENSFKKVVLETEEARTQRIKDW FT RDDLAKDYFKNGIKLEIKEKLAFITTDTLEDTINQAKQIEEVQNALKEDRM FT RSYQNKLSEKAILEVNAVRDQVNELKEQVDRRMPTQPVRGNYNNWRGNFNT FT RGNWNQRNNYNREQYSNNRNNFQNYNNWRGANFNRGRGTVRWGQRRGAGSF FT NSNKIPINNPRNFGAGASNSARINSLAMPYITIMMLLMFCQGICGQYQICP FT KDKIGESIDFPSPQDCTLSLTHQMEIKNVTLFLPKLVPRYFPIYRCWMERQ FT VKCTDAILFYREDLPVKKERFSLSPESCWNLYKSTKLKRINEVLWSIIFYN FT DLKYAWFGQQCVMDRHYFLEEGEGALMNKRFETSFGVSKSVEAHTINGSFT FT DKEKLFSVIVWNAPSEDYIYTHYSFGPVQAEIFYKNNSINGLVKINELQYV FT FAPSENITKSSDVLGVPDEAWPMDNDVYILILNTTERIIQKRQVTKKQNFR FT RTTTTTTTTTLIPPTIKPIKTTTRPILTTKKPILSTTTTPIHNKRMHTTPT FT IDPNYRLPVQQQIKNNYLKGDRGLPGPDGRPGAPGEKGDMGYTGPRGMKGA FT PGEKGDMGYTGPRGSKGASGQDAAPGLKGDIGIPGRPGKEGLPGLNGKDGL FT DGLQGPQGPIGPKGDKGSSGINGKMDYLDQKESLEIGMKILKFYRLCLPWE FT NGLPGPKGEPGNWDENFKILSTLSPLNKRELFDPEEAQKERDNKIKNNHEK FT SRLNYLSWEEAQAREQIMREQWKSTCQNRNQQLEIAKIIAETNPTKAARYL FT YGRTDLIAFLESGIKLKWNLGLCTPVNVEEIIWDQKVEDKCFELLPVKIRG FT KLLFAAPGSPDLLMESKTKSCEEQPIILNKTKLLNFDQEQHSLKIKPIIDV FT KTGILFKAGNKFEEEKAEVKDIQKFSDLIHQSMTLPEDIKQPEVEDVLSEI FT FGLPKILKEKGSNVTGEVFENLKENLEAGLEKGKEVAKDITEKGSEIISDA FT KEFIEDKWTFMQRLYWILMAFLILSALLVNTYLFWKFRALFSLIWKGIMIG FT RTVIKFFFGGKEGSHERKEIKVNAIEMEELNPKPSAPTLNDLEEKAYILDY FT IPSVCSVKSRKRCYVEVLFGGKIQKALIDCGADISYCGESVASRCGLKINS FT KDVPMAWAANSTPITFLGSAMVTIEIGGSTMRWPFLVSEDKSCPGGLVIGT FT DLMEEKEEIKLNFKNKTIQLGEDILPMIAAMEYVELPKRQIEVRLLENYVL FT PPLSDSLLWGTVNRIFDPTQQFLLEEWSQHEYWPVKIGRTLSQPGSSRLIP FT LRILNFGNSHVQIYGKSRVGILEPVYDNEKKANSVEVFMRKDEYVSPEVNW FT EDELPLLPNFEKKSEKISDKLNLEGTSLSKQGIVSLKNMVDEKSEAFVKSD FT GIIGLYKGNIVHQIELEPGTRPVQQRPYTIPHALREEVEKQIKEMLKQNII FT KPSSSAWASPIVLVKKADGKSWRFAVDYRALNKCTKNKLTYYREFKIYLMM FT HEKQTYLLPRIQDLLDVVGGKSLFSIFDLQSGFHQVKMNKKHTDRTAFITH FT CGLYEFLRLPFGLAGAPHTFQKVMEEMRQQLSRSFLVYLDDVILGSETENQ FT HLNDLNAFLNVLCEVGMKLRAEKCRWGCSEIRYLGFLISEKGVRLDDSDLK FT PILKIKRPENLAELRSLIGMFSYFRRFIRGFAGIMAPIYDLTKKESTRDWN FT EKHDKILEEMKKRLTSAPILATPKFGRPFILETDASGTAIGACLLQESVQK FT EIHPIAFYSRTLNKHEKNYSVVELEALALVSALKQFRVYLEGAGTSTVITD FT NSALTSLFRRKDLQGRLARYQIVLQAFDVNIIYRPGKFNKVGDHLSRYPPE FT ADIEIKSIEVEKKISIDELSKAQKCDREILLKRQEKEENIKEINGVIWEKI FT KNDWKILIPQAIKNKIIKEFHEDPLQGAHLGINRTLEKIKRSMHWKGLAKD FT VADRIRICEVCQKRKVVGVHMSREEICPIEPASRPFDRVHLDLLGPIQKSF FT RGHQYIFVAVDSFSKWAIAVPIRNQTATTISEIFLNEIICRFGIPSLVVTD FT QGTQFMSSTFTDLAQAMNFCHKPTTAYHQSANGMVERFNRTLADMIATCTE FT QGKKWMDVLPQIIFAYNTSYNQQINNSPFYVIHGFVPKLPTEATLGIEKEK FT FNDMNLYVKNLVENLELVRSDVKCRLQGNMDIMRKQQIKINKKNWEKGEKV FT LVKKTENIKKFGDKYEGPFTIVELQCPNLLISDSGLADEAWFVHMDRCKPF FT YSEEILNKKDRRVAGDTNLEQSDSDEEDQIQINSVTNSLGLTAPNKTNAIT FT HNSSFEEDTKKFTNTHATTETNLNINFQIKKDLIKSKNNIILKMPNSNSTS FT SSIIIEQQNKLPWHFEDVSSPELSEDEERIMPYSKEDKGKEKDKKIAKNIE FT DNKSRSAGVKRKKEEQEIANSVDWEMAYRLKRIRSKYEDDMLTKHNDIWKE FT ILKKGDQFHSKIQDKLDKERKALEKERKEKKKGRKNEKKQTRKEAETEAEI FT VKKKLDDGKKKPREAEMKNEGDTKPKEKKTGVGDFAKFASRVVQRAVVDKI FT DDVVPEELPEDKEKIIKKKEEEEKLLKLGKYQEKMKSRRKDFDEEFVARGY FT KTREYSMSPVRRSVVGYIDEKDKKDKMGKIKAKENKKVEITTKEIGVNTVF FT NTLVFNRILENLDEVQEFLTLKPTAK*" XX SQ Sequence 10685 BP; 4158 A; 1472 C; 2044 G; 3011 T; 0 other; tggaggtcct agaataccgt atcgctactg gattcaacac tcaacaatat tttatttcaa 60 tttattaaat ttattattcc aacttataaa tttcaactta ttatatttct attttaacgt 120 atctaatttc aacattatta atataattat tttattacaa catatattta ctggctctta 180 cctggatcaa gtatgttatt acctttattt caaacaggtg tttattactc ccactcaaaa 240 aaatgtttac gttcattaac tttaatattt gttaaataat actggaatta aatcaacaaa 300 tttatttatt gtattaaaat taattatttt aacaattgaa tgatcaacaa gcacataatt 360 tcaacaaatt aattttaaat taacaaaaat tttattgtgc tgtggtggtg agtgcggtga 420 tttattaagg agttatactg ccgatataaa aatcggtttg ggtttaagtt tagtggtagt 480 tggagtgctt tttggaatgg gacgtccagg aattgtagga cgatgtttag agatataata 540 agccaagata aggatagaga ccaggattaa accaaggagg gcgtaaatta tataaaagtg 600 aagtgtatgt gagaattgtg gttcttgttt atcttccgcc tcagttgtaa ctacaggccg 660 agtttttgaa caaataatcg gctcattcca ttctactggt ccaaaacatc ggtggacaca 720 ttttaagtct gcgggacagg caacctcttc aaattccact taaaaattat aattaaaata 780 aaacacaaaa atgggaattt taatacaaaa atcagaattt attaaaaaca aaatacaaaa 840 atcagaattt attaaaaaca aaatacaaaa atcagaatta tttaaaattt aatagtcaga 900 attacataac acaaaataca aattttaata taaaataagt ccaaagaaca tccaataaat 960 ctaaaaaggg agatcacacc aaaaaatcat atgggcccaa aaataaatat agaaagaaaa 1020 aataaaatta attagttgat caacaaaaat aaaagaaata taaaagatta attaacgtgg 1080 ctcttacacc ttttaactac ggccggagtt atctggtgtc ggaccacgag tttattaaaa 1140 ctataaatag ttttaattaa aagaagtgat ttaattagaa tttattatgg ctgtgactat 1200 tggattaatt ttataaattt aattttgtat cttaaattgt ttatgtaatt aattattcta 1260 acaccaacat atatttaaca acagataaaa gatattttaa actacagctt ttattgaaac 1320 acattatttg aacaacagtt acaacataaa aattttattt taaatttaac aacacaataa 1380 aataaaaatt atttaattaa atttatttaa atacagactt cgaattacta attgaattaa 1440 ttataagtat tgaataaagt ggaaatactt attggaatac tccattcaaa ttattttaaa 1500 ttatttttgg aaaattaaat acaaaatgga ggaggaagtt ggaacggatg atgacatgca 1560 cactgccgat gaagaaggga aaacaccgat aaaaaaagtg gctcgccaat taaatgagcg 1620 tgagttacag caaaagctcg cgacacttga aaaggaaatc gccaaacttg gaataaatag 1680 aatgcgtgaa ggattacctc tgccagaatt ttattcggga atagaggatt ttgaggctta 1740 ccttcgtaga ttcaataaaa ttgccaccgc gcatcagtgg agtgggacgc gttgcactca 1800 aatattacct ttatacttgt tggaggaagc tcgcgcgata tatgattctt tgggcgcgga 1860 ctctaaaaat acgtggaaag gactgacaga cgccttggca ataaaattga agaaattaaa 1920 tacaaaggaa tcagcgcgta ggatgttgac taacagaaaa caaaaatcta atgaatctat 1980 aattgaattt gcgcaggtta ttcgcggtct aataaacaag gcttttccag aaaattcctt 2040 taaaaaagtt gttcttgaga cagaagaggc gcgaacccag agaattaaag attggagaga 2100 tgatcttgcc aaagactatt tcaaaaatgg aattaaattg gaaattaagg aaaaattggc 2160 atttattaca acggatacac tcgaagatac tattaaccaa gctaaacaga ttgaggaggt 2220 gcaaaatgct ttgaaagaag atagaatgag gtcataccaa aataaattat ctgaaaaggc 2280 tatacttgaa gttaacgctg tgagagacca ggtaaatgaa cttaaagagc aggttgatcg 2340 tcgcatgcca acacagcctg ttaggggaaa ttataacaac tggagaggaa actttaatac 2400 tcgaggcaat tggaaccaac ggaataatta taatcgggag caatactcta ataatagaaa 2460 taattttcaa aattataaca attggagagg agcaaatttt aaccgaggtc gtggtactgt 2520 aagatgggga caacgtcgag gagccgggtc ttttaactcg aataaaattc caataaataa 2580 tccaagaaat tttggagctg gagcttcaaa ttcagcacga attaattctt tagcaatgcc 2640 ttatattact ataatgatgc ttttaatgtt ttgccaagga atttgtggac aataccaaat 2700 ttgtccaaaa gataaaattg gtgaatcaat tgattttcca tcgcctcaag actgtacttt 2760 atcactgact catcagatgg aaattaaaaa tgttacttta tttttgccta aacttgtccc 2820 aaggtatttt ccgatttatc gatgttggat ggagaggcaa gttaaatgta cggatgctat 2880 tttgttttat cgggaagatt taccagttaa aaaagaaaga ttttcattaa gtccggaaag 2940 ttgttggaat ttatataaaa gcacaaaatt aaaaagaatt aatgaagtct tatggtcaat 3000 aatattttat aatgacttaa aatatgcgtg gtttggacag caatgtgtta tggatagaca 3060 ttattttctg gaagaaggag aaggtgcatt aatgaataaa agattcgaga ctagttttgg 3120 agtgtcgaaa tcggttgagg ctcatactat aaatggttct tttacagata aagaaaaatt 3180 attttcggtt attgtttgga acgctccatc tgaagattat atttatacac attattcatt 3240 tggcccagta caagcagaaa tattttacaa aaataattct ataaatggat tagttaaaat 3300 taatgaatta caatatgttt tcgcaccgtc cgagaacatt acaaaaagtt cagatgtgtt 3360 aggagttcct gatgaagctt ggccaatgga caatgacgtt tatatattaa tattaaatac 3420 aacagaaaga attattcaaa aaaggcaagt tacaaaaaaa caaaatttta gacgtactac 3480 taccacgact actacaacaa cactaattcc accaactatt aaaccaataa agacaacaac 3540 tcgtccaata ttaaccacaa aaaaaccaat attgagtacc acaacaacac ctattcataa 3600 taaaagaatg catacaacac ctaccattga tcctaattat cgtctcccag tacaacaaca 3660 aattaaaaat aattatttga aaggagatcg aggtttaccg ggcccagatg gaagacctgg 3720 agcacccgga gagaaaggag atatgggtta tactggacct agaggaatga aaggagcacc 3780 tggagaaaaa ggagatatgg gttatactgg accgagagga tcaaaaggag catcgggcca 3840 ggatgctgct ccagggctta aaggcgatat tggaatacct ggaaggccag gaaaagaagg 3900 attacccgga ctaaatggga aagatggact ggatggttta caaggacctc agggcccgat 3960 tggacctaaa ggtgacaaag gaagctcagg tattaatggg aaaatggatt acctggacca 4020 aaaggagagc ctggaaattg ggatgaaaat tttaaaattt tatcgacttt gtctccctta 4080 aataaaagag aattgtttga tcccgaggaa gcacaaaaag aacgagataa taaaattaaa 4140 aataatcatg aaaaaagtcg attaaattat ttatcatggg aagaagcaca agcgcgagaa 4200 caaataatga gagaacagtg gaaaagtact tgtcaaaata ggaatcaaca attggaaatt 4260 gctaaaatta tagctgaaac taatccaacc aaagcagcaa gataccttta tggaagaacg 4320 gatttaatag catttttgga gagtggaata aaattaaaat ggaatttggg attatgtacc 4380 ccagtcaatg ttgaagaaat aatatgggat cagaaagttg aagataaatg ttttgaatta 4440 cttcctgtaa aaataagagg taaattatta tttgctgcac caggaagccc tgatttgtta 4500 atggagtcta aaacaaaatc ttgcgaggaa caaccaatta tattaaataa aacaaaatta 4560 ttgaattttg atcaagaaca acattcatta aaaattaaac ccataattga tgtaaagaca 4620 ggaattcttt ttaaagctgg aaataaattt gaggaggaga aagcagaggt gaaagatata 4680 caaaaatttt cagatttaat tcaccaatct atgacattgc cggaagacat taaacaacca 4740 gaagttgagg atgtactctc agaaattttt ggcttaccaa aaatattgaa agagaaagga 4800 agtaatgtta ctggtgaagt ttttgagaat ttgaaggaaa atcttgaggc gggtttggaa 4860 aaagggaaag aagttgctaa ggatataaca gaaaaaggaa gtgaaataat aagtgatgca 4920 aaagaattta ttgaggataa gtggacattt atgcagagac tatattggat tttaatggct 4980 tttctaattt tatcagcctt attagtaaat acttatttat tttggaaatt ccgcgcatta 5040 ttctcgctca tatggaaagg aataatgatc ggaagaacgg ttattaaatt tttctttgga 5100 ggaaaagagg gcagccatga aaggaaagaa attaaagtaa atgctataga aatggaagaa 5160 ctgaacccca aaccttcagc ccccacttta aatgatctgg aagaaaaggc ttacattttg 5220 gattatattc cttctgtttg ttctgtgaaa agtcgaaaaa gatgctatgt tgaagtatta 5280 tttggaggaa aaattcagaa agcactcatt gattgtggcg ctgatatatc ttattgtgga 5340 gaatcagttg cttcaagatg tggtttaaaa ataaattcca aagatgtacc aatggcgtgg 5400 gctgcaaatt caacaccaat tacattttta ggttctgcaa tggtaactat tgaaattgga 5460 ggatcaacaa tgagatggcc attcttggta tcagaggata aatcatgtcc tggaggttta 5520 gtaattggaa cagacttaat ggaagagaag gaggaaatca aattaaattt taaaaataaa 5580 acaattcagt tgggagaaga tattttacca atgattgcag caatggaata cgttgaatta 5640 ccaaaaagac aaatagaagt aagactgttg gaaaattatg ttttgcctcc attatctgat 5700 tcactcttgt ggggaactgt taatagaatt tttgatccaa cacaacagtt cctacttgaa 5760 gaatggagcc agcatgagta ttggccagta aaaattggaa gaactctctc tcaaccagga 5820 tcttcgcgtt taattccttt aagaatatta aatttcggaa attcgcatgt tcaaatttat 5880 gggaaatcac gtgttggaat tttggaacct gtttatgata atgaaaagaa agccaattca 5940 gttgaagtat ttatgagaaa agatgaatat gtttccccag aagttaattg ggaagatgaa 6000 ttacctctat tacccaattt tgaaaagaaa agtgaaaaga tttcagataa attaaattta 6060 gaaggtactt ctttgtctaa acaaggaatt gtctctttaa aaaatatggt tgatgaaaaa 6120 agtgaagcat ttgtaaaatc ggatggtatt attggtctat acaaagggaa catagtacac 6180 caaattgagc tggaaccagg aacacgtcca gttcaacaaa gaccatatac cataccacat 6240 gccttgaggg aagaggtcga aaaacaaatt aaagaaatgt taaaacagaa tataattaaa 6300 ccatcttcct cagcctgggc atcaccaatt gttcttgtaa aaaaagcgga tggcaaatct 6360 tggagatttg cagttgatta tagagctctg aataaatgca cgaaaaacaa acttacttat 6420 taccgcgaat tcaagattta cttgatgtag ttgggggaaa gagtctcttt tctatttttg 6480 atctacaaag tggattccat caggtaaaaa tgaataaaaa gcacacggac agaacagctt 6540 ttattaccca ttgtggcctc tacgaatttt tgcggctccc ttttggtctt gctggagcac 6600 cacacacgtt tcagaaagta atggaggaaa tgagacaaca attgtctcga agtttcttgg 6660 tatatctgga tgatgtaata ctaggaagcg aaacggaaaa tcaacattta aatgatttaa 6720 atgcattttt gaatgttttg tgtgaggttg gaatgaagct cagagcagaa aaatgcagat 6780 ggggttgttc tgaaattcga tatctaggat tcttgatatc cgaaaaagga gtgcgtctgg 6840 atgattctga tttaaaacca atattaaaaa ttaaaagacc ggaaaattta gctgaattac 6900 gatcactgat agggatgttc agttacttta gaagatttat tcgtggattc gcaggaatta 6960 tggcaccaat atatgacctt acgaaaaagg aatctactag ggattggaat gaaaaacatg 7020 ataaaatttt ggaggaaatg aaaaagagat taacatcggc accgattttg gccaccccta 7080 aatttggaag accttttata ttagaaactg atgccagtgg aactgctatt ggagcctgcc 7140 ttttacaaga aagtgtccaa aaggaaattc accccattgc tttttattct agaactctaa 7200 ataaacacga gaaaaattat agtgttgtgg aattggaagc cttggcttta gtatctgcac 7260 taaaacaatt tcgagtatat ttagaaggag cggggacttc gacagtaatt accgataact 7320 ctgctttgac ctcactattt cgaaggaagg atcttcaagg acgtttagct cgctatcaga 7380 ttgtattaca agcttttgat gtgaatatta tttatcgccc agggaaattt aataaggtcg 7440 gagatcatct ttcgagatac ccgcctgagg cagatataga aattaaatca atagaagttg 7500 agaagaaaat ttcgattgat gaattaagta aagcacaaaa atgtgatcgt gaaattttat 7560 taaaacgcca agaaaaggaa gaaaatatta aggaaataaa tggagttatt tgggaaaaga 7620 taaaaaatga ttggaagatt ttaatacccc aggcaataaa aaataaaata attaaagaat 7680 ttcatgagga tccactgcaa ggagctcatc ttggaattaa tcgtacgtta gagaaaatta 7740 aaagatcgat gcattggaaa ggattagcaa aggatgtcgc cgacagaata cgaatttgtg 7800 aggtctgcca aaagaggaaa gtcgtgggtg tacacatgag tagagaggag atttgtccaa 7860 tagaaccagc ttcaagaccc ttcgatagag tacatttgga cctgttgggt ccaattcaga 7920 aatcatttcg agggcatcaa tatatttttg tggcggtgga ctcattcagt aaatgggcga 7980 ttgctgtgcc cattaggaat caaaccgcga ccacaatatc cgagatattt ctaaacgaaa 8040 ttatttgtcg atttggaata ccttcattag tagtcacgga tcaagggacc caatttatgt 8100 catccacgtt tacagactta gcacaagcaa tgaatttttg ccacaaacct acaaccgctt 8160 atcatcaatc ggctaatgga atggttgaac gttttaacag aacattagca gatatgatag 8220 ctacctgcac tgaacagggt aaaaagtgga tggacgtatt gccacaaatt attttcgcat 8280 ataatacctc gtataatcaa caaataaata attctccatt ttatgttatt catggattcg 8340 ttcccaagct acccacagaa gctactcttg ggattgagaa agaaaaattt aatgatatga 8400 atttatatgt gaaaaatctt gtagaaaatt tggaattggt aagatcagat gtcaagtgcc 8460 ggttacaggg gaatatggat atcatgagaa agcaacaaat taagattaat aagaaaaatt 8520 gggagaaggg agagaaggtc ttggtgaaga aaacggaaaa tattaagaag tttggagata 8580 aatatgaggg cccgtttaca attgttgagc tccaatgccc aaatttatta atttctgact 8640 cggggttggc agatgaggca tggtttgttc atatggatag atgcaaacca ttttattctg 8700 aagaaatttt aaataaaaaa gataggagag tggctggtga tactaatttg gagcagtcag 8760 atagtgatga ggaggatcaa atacaaatta attctgtaac taacagtttg ggattaactg 8820 cacctaacaa aacaaatgct attactcata actcaagctt tgaagaggat actaaaaaat 8880 ttactaacac acatgcgaca acggaaacta atttgaatat aaattttcag ataaaaaaag 8940 atttaattaa atctaaaaat aatattatcc taaaaatgcc taattctaat tcaacttcat 9000 catcaataat tattgaacaa cagaataaac tcccatggca ctttgaggat gtgtcatcac 9060 ctgaactttc tgaggatgag gaaagaatta tgccatattc aaaagaagat aagggaaaag 9120 aaaaagacaa aaagattgcc aaaaatattg aggacaataa aagtagaagt gccggagtga 9180 agaggaagaa ggaggaacag gaaatcgcta attctgttga ctgggaaatg gcataccgtt 9240 tgaagagaat tagaagcaaa tacgaagacg acatgctgac gaaacataat gatatatgga 9300 aagagattct gaaaaaaggc gatcagttcc attctaagat acaggataag ctggacaaag 9360 aaagaaaagc gttggaaaag gagcggaaag agaagaagaa agggagaaaa aacgaaaaga 9420 agcaaacaag aaaggaggct gaaactgaag ctgaaatagt aaagaagaaa ctggatgatg 9480 ggaaaaagaa gcctagagaa gccgaaatga aaaatgaagg ggatactaag ccaaaagaga 9540 agaagactgg tgttggtgat tttgcaaaat ttgctagccg agtggtgcaa agagcagtgg 9600 ttgataaaat agacgatgtc gttcctgaag agttacctga agataaggag aaaataataa 9660 agaagaagga agaagaagaa aaattgttaa aattggggaa atatcaagaa aaaatgaaat 9720 ctcgacgaaa ggattttgat gaggaattcg tcgctagagg ttataaaacg cgagaatatt 9780 ctatgtcacc agttagacgt tcggttgtgg gatatataga tgaaaaggac aaaaaagata 9840 aaatgggaaa aattaaagct aaagaaaata agaaggtgga aataactaca aaggagattg 9900 gagtgaacac tgtttttaat actttggttt ttaaccgaat tttggaaaac cttgatgagg 9960 tccaggaatt tttaacctta aagccaaccg cgaaatgatg ggacaacaac aatcaataaa 10020 taaaaataaa acaacaaaaa aaattaataa aaaaattaaa aatttcaaaa aaaaaaatat 10080 aataatcaca aatattcata attacatcag atattttcat caaactatta ataaatattt 10140 ctttgtcata ttaatcacaa tattcaacaa cacaattact atcacaatta ttttaattca 10200 taccaaaatt attttaaaat atgtttatta aaattaccca cataaggaat gaccattgga 10260 agaacacaca aaaacaaaaa ttaataataa agtgaacatt gatgtttaat aataaaatta 10320 attttaaata aattaaatat tttaattcaa tttgatgaaa tatttaataa atgtttaaaa 10380 ttaaaaggaa attcttcgat tggtttatgg aatgctttcg aattttaaat aacttgtgtc 10440 ttccttttgg ccactcaaat ctttctgcca cctggcccct actaaattga aaatacaaaa 10500 gaaagaaaaa tttaataaaa gatgaaaaag aaaaaaaaat tgattaaaaa taaaaataga 10560 gaaaataaat tttaagccaa atttaatatg tttgtagggc cctactcctc caagttcacc 10620 ttaaagctac caaaacatct ttaaacttta aaagttcaag gatgaacttg gagggaggag 10680 gggaa 10685 // ID Penelope-4_CQ repbase; DNA; INV; 4524 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE Penelope-like element from Culex quinquefasciatus - consensus. XX KW Penelope; Non-LTR Retrotransposon; Transposable Element; KW Penelope-4_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4524 RA Jurka J.; RT "Penelope-like elements from the southern house mosquito."; RL Repbase Reports 11(1), 603-603 (2011). XX DR [2] (Consensus) XX CC >98% identical to consensus. XX FH Key Location/Qualifiers FT CDS 1880..4426 FT /product="Penelope-4_CQ_1p" FT /translation="MHSAKTLNQTTITMATKPSEHNKFIGLKKIIAGIHKD FT IAFLKKCVEHRVTPTSHKIRTKGHTDSKIIRRVELELVKESIKKLYGRLSI FT KTLECYSLHLKLAKEFPIEFEAFLTKVKVAEKCESERKRKILDNKFRQLIS FT HIAPTMKPVVKPLEEFVVNRSSETFSEEQLTLLNKGLKYAVSSKPNLENIV FT IDIETVINANSIDEKQTKPPPLLPAQINSVRMTVSKLIREATTKHQDNAEV FT RVVKELKEKPVYYLKADKGNRVVIMDRDEYDKELLEKLGNREKYSLQRGYT FT LIDMVKKVENTIKECAGLLKSVGKTAKSLRVPNPTLPRIKALPKVHKPGNE FT MREIISAVDAPTSRIAKWLVEEFKSMPKPFPSRSIISSQVFTKELLESGPI FT AEDEIMVSFDVTALFPSIPVKNAIGTLRDWLQSQYEGEPWRQKTIQYITFV FT KLCMEQSYFQFRDCIYQQRMGASMGNPLSPFMSDVYMAALEHELQTKNLLP FT ERWWRYVDDVFCIIKRDSLTTVLDTINSARRSIKFTYEMEVEGKLPFLDIL FT ILREPGCPSSFSFEIYRKPTNTRRTIPATSNHSFQHKMAAYHYFIHRMTTL FT PLSEAGKQKELEYIFETAEINGYSRTTIQAIIDKKERKLHRTSLTTLTPSA FT EPLRRAAVEFDYRITRPLRQKLAKFGIDLVFSSRNSQLESILGSTKDPIPT FT LGKPGIYEVSCGHCDMSYIGQTKRLLQTRLDEHIHTYVKIAMEEKRKGFVP FT HFKSAVTEHIIEEKHNITASDAVILRHITNPSKLAVAESLEIFKAGNKRLL FT NKDSGNGSTWLFKLLPIQAQKKNEEVVPTVTTNTDTPAMNLQSRTEH" XX SQ Sequence 4524 BP; 1390 A; 1000 C; 977 G; 1157 T; 0 other; gaattgtggt ttggttgagc gttttagcag aatgcattac tgtgaataca aagagacgag 60 ttgttccttt taattccgct atatatttta atatcttgac agatacgtat ttcgtctact 120 acttgcagac ttcatcagtg ttctgttctc gactgaaggt tcatcgctgg tgtatccgta 180 tttgtagtta ctgtaggtac tacctcctcg ttcttctttt gtgcctgtat aggtaacagc 240 ttaaacagcc acgttgagcc gttacctgaa tccttgttga gtaaacgttt gttgcctgcc 300 ttgaagattt ccaaactttc ggcgacagcc agcttcgatg ggttagtgat gtgtcttaag 360 atgaccgcgt cgctagctgt gatgttatgt ttttcctcga tgatgtgttc ggttactgct 420 gacttgaaat gggggacaaa tcctttccgt ttttcctcca tggcaatttt gacatatgtg 480 tgtatatgct catccaacct ggtttgcagc aatctcttag tttgtccaat gtagctcata 540 tcacagtggc cgcaggatac ctcgtagatg ccgggtttgc ctaaagtcgg tatggggtct 600 ttcgttgaac ctaagatgga ttccaactgg ctgtttctac tgctgaacac taaatcgatg 660 ccaaatttag ccagtttttg tcgtaatggg cgtgttatcc tgtagtcgaa ttctactgct 720 gctcttcgca gtggctctgc tgatggcgta agggttgtga gtgaagttct gtgcagtttt 780 ctttccttct tatcgatgat tgcttggata gttgttctgc tgtaaccgtt gatttccgct 840 gtttcaaata tgtactccaa ttctttctgt tttcctgctt cgctgagggg taaagttgtc 900 atcctatgta taaaataatg atatgctgcc atcttatgtt ggaatgaatg gtttgacgtg 960 gctgggattg ttcttctggt gtttgttggc ttcctgtaga tctcgaaaga aaacgaagat 1020 gggcaacctg gttctcttag gatcaggatg tctaagaaag gtaactttcc ttcgacttcc 1080 atttcgtaag tgaacttgat gctcctgcgt gcactgttta tggtgtccag taccgttgtt 1140 agtgagtccc ttttgatgat gcagaagacg tcatctacgt atctccacca acgttctggt 1200 agtaggtttt tggtttgtag ttcgtgttcc aaagctgcca tataaacatc gctcatgaac 1260 ggtgacagtg ggttacccat cgatgctccc atcctctgtt ggtagatgca atccctgaac 1320 tggaagtagc tctgttccat gcataatttg acgaatgtta tatactggat cgttttttgt 1380 ctccatggtt ctccctcgta ttgtgattgc agccaatccc gtagagttcc tattgcgttt 1440 tttactggta tgcttgggaa cagtgctgtt acgtcgaagg aaaccattat ttcatcctca 1500 gctatcggtc ccgactctaa gagctccttt gtgaacacct gcgaactgat gatcgaacgg 1560 ctcgggatca ctaacccatc gaagctggct gtcgccgaaa gtttggaaat cttcaaggca 1620 ggcaacaaac gtttactcaa caaggattca ggtaacggct caacgtggct gtttaagctg 1680 ttacctatac aggcacaaaa gaagaacgag gaggtagtac ctacagtaac tacaaatacg 1740 gatacaccag cgatgaacct tcagtcgaga acagaacact gatgaagtct gcaagtagta 1800 gacgaaatac gtatctgtca agatattaaa atatatagcg gaattaaaag gaacaactcg 1860 tctctttgta ttcacagtaa tgcattctgc taaaacgctc aaccaaacca caattaccat 1920 ggcaaccaag ccttcggagc acaacaagtt catcggactg aagaaaatca tcgcaggtat 1980 ccacaaagac atcgccttct tgaaaaaatg cgtcgaacac cgagttacac caactagcca 2040 caagattagg acgaagggtc atacggacag caaaatcatc cggagggtgg agttagagct 2100 tgtcaaagag agcatcaaaa aactctacgg tagactgagt atcaaaactc tcgaatgcta 2160 ttcactccat ctgaaactag ccaaagaatt ccccatcgaa ttcgaagcct ttttgacgaa 2220 ggttaaggta gccgagaagt gcgaatcaga aaggaaacgc aagatactgg acaacaaatt 2280 ccgacaactg atttcccaca ttgcccctac aatgaaacca gtagttaaac ccttagaaga 2340 gttcgttgtt aatcgttctt ctgaaacttt ttcagaagaa cagttaactt tactcaacaa 2400 aggtcttaag tacgcggtat catcaaaacc aaacctggaa aacattgtta tcgacatcga 2460 aactgttatc aatgctaaca gcatcgacga aaaacagaca aaacctcccc cgctccttcc 2520 agcacaaatc aacagcgtca ggatgacggt atccaaatta atacgagaag caacaacaaa 2580 acaccaggac aacgccgaag tacgagtagt gaaggaactc aaggagaaac ccgtgtacta 2640 cctaaaagcg gacaagggaa accgggtagt aatcatggac cgggacgaat acgacaagga 2700 acttcttgag aaactcggaa acagggaaaa atattcacta caacgaggat acacgctaat 2760 cgatatggtc aaaaaagtag agaacaccat caaagaatgt gcaggccttt tgaaatcggt 2820 tgggaaaact gcaaaatcgt tgagagtacc gaacccaact ctaccaagga ttaaagcact 2880 acctaaagtg cataaaccag gcaacgagat gcgagaaatc atttcagcag tggatgcccc 2940 aaccagtcgg atcgcaaagt ggcttgtcga ggaattcaag agtatgccga aacccttccc 3000 gagccgttcg atcatcagtt cgcaggtgtt cacaaaggag ctcttagagt cgggaccgat 3060 agctgaggat gaaataatgg tttccttcga cgtaacagca ctgttcccaa gcataccagt 3120 aaaaaacgca ataggaactc tacgggattg gctgcaatca caatacgagg gagaaccatg 3180 gagacaaaaa acgatccagt atataacatt cgtcaaatta tgcatggaac agagctactt 3240 ccagttcagg gattgcatct accaacagag gatgggagca tcgatgggta acccactgtc 3300 accgttcatg agcgatgttt atatggcagc tttggaacac gaactacaaa ccaaaaacct 3360 actaccagaa cgttggtgga gatacgtaga tgacgtcttc tgcatcatca aaagggactc 3420 actaacaacg gtactggaca ccataaacag tgcacgcagg agcatcaagt tcacttacga 3480 aatggaagtc gaaggaaagt tacctttctt agacatcctg atcctaagag aaccaggttg 3540 cccatcttcg ttttctttcg agatctacag gaagccaaca aacaccagaa gaacaatccc 3600 agccacgtca aaccattcat tccaacataa gatggcagca tatcattatt ttatacatag 3660 gatgacaact ttacccctca gcgaagcagg aaaacagaaa gaattggagt acatatttga 3720 aacagcggaa atcaacggtt acagcagaac aactatccaa gcaatcatcg ataagaagga 3780 aagaaaactg cacagaactt cactcacaac ccttacgcca tcagcagagc cactgcgaag 3840 agcagcagta gaattcgact acaggataac acgcccatta cgacaaaaac tggctaaatt 3900 tggcatcgat ttagtgttca gcagtagaaa cagccagttg gaatccatct taggttcaac 3960 gaaagacccc ataccgactt taggcaaacc cggcatctac gaggtatcct gcggccactg 4020 tgatatgagc tacattggac aaactaagag attgctgcaa accaggttgg atgagcatat 4080 acacacatat gtcaaaattg ccatggagga aaaacggaaa ggatttgtcc cccatttcaa 4140 gtcagcagta accgaacaca tcatcgagga aaaacataac atcacagcta gcgacgcggt 4200 catcttaaga cacatcacta acccatcgaa gctggctgtc gccgaaagtt tggaaatctt 4260 caaggcaggc aacaaacgtt tactcaacaa ggattcaggt aacggctcaa cgtggctgtt 4320 taagctgtta cctatacagg cacaaaagaa gaacgaggag gtagtaccta cagtaactac 4380 aaatacggat acaccagcga tgaaccttca gtcgagaaca gaacactgat gaagtctgca 4440 agtagtagac gaaatacgta tctgtcaaga tattaaaaat atatagcgga attaaaagga 4500 acaactcgtc tctttgtatt caca 4524 // ID L2-1_NVi repbase; DNA; INV; 3238 BP. XX AC . XX DT 15-FEB-2009 (Rel. 14.02, Created) DT 15-FEB-2009 (Rel. 14.02, Last updated, Version 1) XX DE CR1-like retrotransposon: consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; L2-1_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-3238 RA Jurka J.; RT "CR1-like elements from the parasitic wasp Nasonia vitripennis."; RL Repbase Reports 9(2), 478-478 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Nasonia CC Genome Project. XX FH Key Location/Qualifiers FT CDS 491..2803 FT /product="L2-1_NVi_1p" FT /translation="MIRLFLSTRSPFHVIAVTETWLXDKITSIPSLDDYXL FT YRRDRNRNGGGVALYIHKSLTVSVISSSDGEWSGKPGKPEYLFCEVSAKGV FT SPIFVGVVYRPPHAPFFQGSNFIDQLTTHMHNYSTKVIMGDFNSDQLSSSE FT DANFIRAFIDENSLSSVPYGATHHKQGSDTWLDLCLIDEQDRLLSRWKTDT FT PFINGHDLITATLDVQIPRYVPNTYSYRNYKGISAEKLRDFLSACDWSSLT FT TSSLDECISILNANLTNAINHLAPLRTVTPRRKRHPWFTTALRDLVSERDG FT LYRRFRDSRLDSDLRXYRLARDDAHRQVEEARLNYYYSRLSTLTDVAEIWR FT ELEKLGISASKAPSPSRFSTDELNKHFSSISNDPPAPAVEDYLLTLESLDL FT PEHFKFSTITESDVLAAVSHFDTQARGSDGIPQVVISKALPVLAPLLCQIF FT NLSLSESRFPSAWKLSLVRALNKVSSPTALTDYRPISLLCFLSKALEWLVH FT RQVSEYLESRLFLDNXQTGFRTGHSTQSGLIKLTDDVRVGINKKKVTLLLF FT DFSKAFDTVCHVRLLRKLSXFGFSKQVIRWFASYLTGREQAVVGDNSERSS FT SRPLNIGVPQGSVLGPLLFALYINDIGFCLDSDVSHLIYADDLQIYSQCHL FT EELDSXSNKMSANAERIMGWAAQNRLKLNVIKTKAIVLGSPYYINALPSVA FT NTFINIGGARVNXESSVRNLGLVLDSKLTWKEHVTQXCKRAHSLMYRLYFF FT RKSTNLRLRKHLVQALLFQ*" XX SQ Sequence 3238 BP; 826 A; 898 C; 692 G; 793 T; 29 other; gtgacggcag accgccacca ytagccgtca ccctctcttc aagtgcgctt gcacgctcga 60 tcgtcgtcgc caaagcccgg aaacgcaagc tcaccarcga aytggacgct accttgctgg 120 aggaggcaaa agctctgagc cctgaccatc aagggctcat aaacatcaac gagctgcttc 180 cttcagacgt ccacaagctg cgtacaargg ctaggctgga ggccaagaag aggcagggct 240 gccgaacttt cgtsagagac gggagactat acatgcgctg cgacgatgac agcgagcgca 300 ccgacgccga gctggagact tttttagccc ggattccgcc agctgccaac atcatccaac 360 aagctaacac acacacacgt cattccacca caaccaccat caarctcatc cgccacgtct 420 tcatctatct ctgtccgaag gtctaagagt ctgtcatttc aatgcgaact ctcttacggg 480 tcacattgag atgatcaggc ttttcttgtc cactcgctct ccatttcacg taatagctgt 540 aactgagacc tggctaarcg acaagataac atccatccct tcactggatg actacmtact 600 gtacagacga gacagaaaca gaaacggagg aggtgtggcc ctctacatac ataaatcact 660 gacagttagc gttatttcat catctgacgg tgaatggtcg ggcaagccag gcaagccgga 720 atatcttttc tgtgaggtat cggcaaaggg agtctctccc atcttcgtgg gggttgtgta 780 tcgtccacct catgctcctt ttttccaagg ctctaacttt atagaccaac taacaaccca 840 catgcacaac tattccacga aggtcataat gggwgacttc aactctgacc aactttcctc 900 atctgaagat gccaatttca tcagggcctt cattgatgaa aactcccttt cgtctgttcc 960 ctatggtgcc acgcaccaca aacagggttc tgacacctgg cttgacctgt gcctaattga 1020 tgagcaggat cgcctgctgt cacgctggaa gacagacaca cctttcatca acggacatga 1080 ccttatcacg gccactctcg acgtacagat tccacgctac gtacctaaca catactctta 1140 cagaaactat aaaggaatca gcgccgagaa gctaagggac tttcttagcg catgtgactg 1200 gtcatccctc accacgtcat cactcgacga atgcatatct atacttaacg ctaacctaac 1260 gaacgccatt aatcatctcg ccccattacg gactgtgaca ccaagacgaa aacgtcaccc 1320 gtggttcacc acggctcttc gtgaccttgt atctgagagg gacggacttt acaggcgttt 1380 cagggactct aggctcgatt cggatctccg cwtttataga ctagctagag acgatgctca 1440 cagacaggtc gaggaagcca ggctgaatta ttattattca cgcctgtcaa ccttgactga 1500 tgttgccgag atctggagag agctggagaa acttggaatt tccgcctcta aggccccttc 1560 accatctcgc ttctccacag atgaactcaa caagcatttc agctcgatct ccaatgatcc 1620 gccggctcct gctgttgagg attatcttct caccctggaa agtctagacc tcccggaaca 1680 tttcaagttc agcactataa cggaatcgga cgtgttggct gcggtatcgc actttgacac 1740 ccaggccagg ggaagtgacg gcatcccaca ggttgttatt tcaaaagcac tgccagtact 1800 cgctccttta ctatgtcaaa ttttcaacct gtccctgagc gagtcacgct tcccatctgc 1860 ctggaaattg tcgcttgtgc gagcactcaa caaagtcagt tcaccaacag ccctgactga 1920 ctaccgtccg atttctcttc tctgctttct atccaaggcg ctggagtggc tggtgcacag 1980 gcaagtctca gaatatcttg aatcaaggct cttccttgac aatyttcaaa caggcttccg 2040 cactggccac agcacwcagt ctggcctaat taagctaact gatgatgtca gggtygggat 2100 aaacaaaaag aaagtaacac ttcttctttt tgattttagc aaggcgtttg acactgtgtg 2160 tcacgtcagg ctcctgagaa agctatccwc tttcggcttc tcaaagcagg tcatccgctg 2220 gtttgcctcc tacctcactg ggagagagca ggccgtcgtt ggtgacaata gcgaacgttc 2280 ctcttctcgg cctcttaaca tcggtgtccc gcaggggtcc gttttgggtc ccytgctgtt 2340 cgcattgtac atcaatgaca tcggtttctg cctagattcc gatgtktccc atctmatcta 2400 tgcggatgac ttgcaaattt acagccaatg ccaccttgag gagctcgatt cttkatctaa 2460 caagatgagt gctaacgccg agaggataat gggctgggct gcacaaaacc gtctaaaact 2520 taatgttatt aaaactaaag caattgtcct gggctccccc tactacataa atgctttacc 2580 ttcagtagct aacaccttta ttaatatagg gggwgcccgg gtcaactwtg aatcwtctgt 2640 gcgcaatctg ggattggtgc ttgactctaa actcacgtgg aaagagcacg ttacacaawt 2700 gtgtaaacgt gctcactcac taatgtacag gctttacttt ttcagaaaga gtaccaatct 2760 cagrctgcgc aarcaccttg tgcaagcgct ccttttccaa taatcgacta ctgctcactt 2820 gtgtactgtg acctgactca agaacttgac acaaaactac agagacttgt gaacgcagga 2880 attcgttaca tctatggtgt aaggagggac aagcacatct ccccgtacag gcgtgagttg 2940 caatggctta ccaccgccgg acgcaggaag taactgttta attcagctgt accatcatac 3000 gtactggcct aytttgactt ccgcgtcgca ctccggcctg tgagggggga ggtgacacct 3060 ctggatatcc ckaccttcgc gacggagacg ctgaggaact cgtttcatat cagcgcctca 3120 tacctctgga acgryctacc atcacacatt cgcaacacga catccatcac cagcttcgar 3180 aaacttgcta aagaatactt tttcgaactc gaaaacacat cttgcacaca cacacaca 3238 // ID DNA7-1_AAe repbase; DNA; INV; 3206 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 11-APR-2011 (Rel. 16.04, Last updated, Version 1) XX DE A non-autonomous DNA transposon family from Aedes aegypti. XX KW DNA transposon; Transposable Element; nonautonomous; DNA7-1_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-3206 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the yellow fever mosquito."; RL Repbase Reports 11(4), 1280-1280 (2011). XX DR [2] (Consensus) XX CC ~95% identical to consensus. 7-bp TSDs. TIRs are ~500 bp long. CC The region 1700-1413 is an inserted FEILAI_AA (~96% identical to CC consensus). XX SQ Sequence 3206 BP; 1039 A; 554 C; 540 G; 1072 T; 1 other; cacataacat atgtggaact cagaagtgaa agcgagttga aaagttgtac ccacttccag 60 ctgctggcaa gtggtatatc tagtatagaa gacatcgaac tgtcattacc aatagtaaac 120 aaaaaacata tgattcgagc agtagaccag ttgctgcgaa gattacagac attaagatcg 180 ggggacggtc gacaacgttg ggaagcaatg tctaaagata aaacatgttt attttttcat 240 aaaatgtaat tttttttctg tcatactata taatattcat gtttcataac aacagaatgt 300 gatcgtttcg gaaatattat ttcctatcaa aaaacttcct gtttcataac gtgaaaaagc 360 ttttgctttc caaaatcctt ccattcattc acctaaaaaa tacaatatgc tagaaacaac 420 acagatatct aatacattga ctaaattatc tcggatacga gtggtgtaaa tctaattcta 480 aacttgtacc ataaatagtt attttcattc ccttcgttgc cactctggat atatgttatc 540 tttggttcat ttcaatacaa tgccacattg ggccagaccg caaaattaga tgaaaaaaaa 600 attttgatta tgaagatgat tttttagaac aaaaatggct gaggcaaaaa tgttacatat 660 gatgaggact acattttgac attaagtgac attagggtgg tccaacaaaa tacggaaata 720 tttttgccaa cttttatgct ttgcaattta acgcttttaa gtgttctatc aagttgatct 780 ttgtttaatt ttgaacaact ttgccgaaga cgccaatctt gtaagtctac tgtagccgtt 840 gctatggcgt tttcaatgat gcagggtagg gtggtccatg aaaatttgat attttgckca 900 ttactttttt atttcaaatt ctaccaaaat tacgccttct acaaagatct agaactaatt 960 gaaacgcgta ttttggtgaa agaattaagt tattctattc attcgtttat gagatatgac 1020 catttttatt ccaataaata atatttttcg aacttcaata tcttcaaaag ggggaaaaac 1080 ggggatgcaa tttgtgcagg atttgaaagg tataacttag aggttcaaat ggtatatggt 1140 atctccggag gatattggag gatttacagt ataattttga tgtgaaaaca aatagttttt 1200 acaaactcat gattttagtt gaaatattga acattacggc aaagttctaa acataatttg 1260 ggtgaaaact gttcaaccga atatttttta gcatattata cttatttata taatgtttag 1320 attcatattc cgaacacttg cggtttacgc tgacttgaaa tgcaacttat aggcgaaaat 1380 cagcctgagt tggtgataaa aatattaata tcttcttctt cttcttggca ttaacgtccc 1440 cactgggaca gagccggctt ctcagcttag tgttcttatg agcacttcca cagttattaa 1500 ctgagagctt tctttgccaa agttgccatt ttcgcattcg tatatcgtgt ggcaggtacg 1560 atgataatct atgcccaggg aagtcaagga aattttcttt tcgaaaagat cctggaccga 1620 ccgggaatcg aacccagaca ccttcagcat ggcgttgctt tgtagccgcg gactctaacc 1680 actcggctaa ggaaggcccc gaaaaactta caataaaagg agatattaat atatctcctt 1740 ttattgtaag tttttcgcaa aagcattgaa caacattttg tttccattgt caatcactgt 1800 gattattctg ccacatattc cgatcactgt gattcatatt ccgaacagca cgaataaatc 1860 ctattcatat aaataattta gcatacaaat taatctaagc tggtttatct gacatcgaac 1920 tagaaagtca ttactactcc cagggtataa aataggttca aacacaataa atttgagttg 1980 caattgactg ccacttcctg ggaatttgat gacatatatc ggtgaaatat ttcaaccaac 2040 acgccataca aaatccgcgt gtttggaata tgatactgat tgtttaatgt tttaaatcat 2100 caacgaaaat gttttgattc gttgtatatt tccaatgagc aggctgtgta tggttgaaag 2160 tgattaaatt ttatttaaaa aaaaattgga ctatgaaatt ggtaatgctt tttggttttg 2220 tagaaaaatt actttgcaat ggattacctg cttcgctctt gtccacagtt gatcagcaga 2280 agttttctcg ttttattttg tatggggaaa ggcatttaac ttcaaatttc tcgtaaatga 2340 aatcattagt caaaaaatat ttggtaggcg tgatcaccat caccatcaca cacaatcggt 2400 accaaatatt ttttcgaaaa tagtttccaa tttggagaaa taattcgtta atgtatttcg 2460 ccatacaaat attttaaaat tatacggtaa atcctcaaat attctccagc gatacaatat 2520 accatttgaa cctccaagtt atacctttca gatcctgtgc aaattgcatc cccgttgttc 2580 cccctttttg agatactgac gtttgaaaaa tatgattttc gaattaaaat ggtcatatct 2640 cataaacgaa tgaatagaat aacttaactc tttccccaaa atacgcgttt caactagttc 2700 taaatctttg tagaatgcgt aattttgata gaattggaaa taaaaaaagt tatgggcaaa 2760 atatcaattt ttcattgacc accctaccct gcatcattgg aaacgccata gcaacggcta 2820 cagtagactt acaagattgg cgtcttcgga aaagttgttc aaaatcgaac aaagatcaac 2880 ttgatagaac acttcacagc gttaaattgc agagcataaa agttggcaaa aatatttccg 2940 tattttgttg gaccacccta atgtcactta atgtcaaaat gtagtcctca tcatatgtaa 3000 catttttgtc taagccattt ttgttctaaa aaatcatctt catattcaaa atattttttt 3060 tatctaattt cgcggtctgg cccaagtgca atgtcgattg atgtattggt ggtataactc 3120 ttttgacatt tcgatgtcct tataccagca tgcaaatatt aaagtgggtg ccactttttg 3180 agctcgagtt ccacttatgt tatgtg 3206 // ID piggyBac-9_SM repbase; DNA; INV; 2774 BP. XX AC . XX DT 30-MAY-2008 (Rel. 13.05, Created) DT 30-MAY-2008 (Rel. 13.05, Last updated, Version 1) XX DE DNA transposon from Schmidtea mediterranea. XX KW piggyBac; DNA transposon; Transposable Element; piggyBac-9_SM. XX OS Schmidtea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae. XX RN [1] RP 1-2774 RA Kapitonov V.V. and Jurka J.; RT "Families of autonomous piggyBac element from the freshwater RT planarian genome and two clear examples of horizontal transfer of RT piggyBac transposons between a flatworm and insect."; RL Repbase Reports 8(5), 528-528 (2008)Repbase Reports. XX DR [1] (Consensus) XX CC piggyBac-9_SM is a young family of piggyBac transposons, CC characterized by 12-bp TIRs and TTAA target-site duplications. CC The consensus sequence was reconstructed based on multiple CC alignment of 26 copies, which are ~98% identical to the consensus CC sequence. XX FH Key Location/Qualifiers FT CDS join(920..1316,1374..2671) FT /product="piggyBac-9_SMp" FT /note="piggyBac transposase." FT /translation="MSSHPRAAKAFALRAILGESSSSSSDNEVVLEADDVH FT EIKSNDEEQLSYDDTESEDADNLQVEINPADLHEQDDINRFTDSNGNVWVT FT PNEGMSGRVNAANILRPSPGITRWATQRIGENSVLDCFNLLFIKSTMETIL FT IETNREGSKLYNNQFTALSLNELNAFFGILILRGVYRAAGESIDELWSAEH FT GRFIFRNTMSLNRFKLIRRILRFDNPDTRKGRLLRDKLAAVRLLIDNFVEN FT SQKCYMPGESVTIDEQLYPYRGRCKFVQYMPSKPSKYGLKFWAVCDSQNYY FT CWNLRFYMGKEDDRDMQVSYGEYITMDMLQGLYGSGRNVTCDNFFTSLNLS FT KRLFQNGITMVGTIRASRRDIPKHLTRYQGREVFSTIELITTSTADRRDKA FT LLVSYVAKKNKVVNILSTAHNKITGYVGSKHKPDVLLFYNATKGGVDAMDE FT RVATYTTKFKCRRWHVAFFCNILDISAFNAFALHYLVEPQWNANKSHRRRL FT FLIELGNSLISAYVEDRRRTAAADSLSSTPCKRGDCYMCIDRKRCTVTCVK FT CNRFICKEHLHNICKNCS" XX SQ Sequence 2774 BP; 853 A; 497 C; 533 G; 891 T; 0 other; cccgcaaacg attttttgga caaaaatgtc cattttgttt ttttagtgac gttttcttgg 60 aaactgaaaa acaccctgct ttgtgaatct atgactttgt caattttttt atgctttatt 120 caagtaaaac tagcagacat tcataactac taaaataaaa aagttatgac aagttattct 180 gttttgagaa aaaatggaca tttttgtcca aaattcattg tgtattagtt tttagctttt 240 tcagttgaca attcatgtca tattttctgg cataggcgtt aactcatttc aacaaatact 300 ggcgttaaaa tgaagctgat cagacttttt taaacaatta ttgtgactgg ctgacactat 360 gatatgttta tcacatgatg gcggatggag ttcttacaat agagttcatt gattacatat 420 ctctccatct ctgctttggc aggaatggtc ttccattaac cactctctga ccataagttg 480 ctttagccta cacacaagag aaatttaatg tagtatccgt ctcaccaaca tcttcaaaac 540 agttctgcat tcatattagt tagcttattc gctctcgtct attttgtgaa gtaggcagac 600 tataccattt tacactcatg acattggtac tctgtctgtc tctctctact catcccccac 660 acacagacgc acttagaata cctcatttca ggtatcaagt aataatgcac gtgacactta 720 caattgacaa ttgatattgg aaccaatgaa tgtctcacat aataatattc tctcacgatc 780 tttctcacct ggatttgttc atgcgtttgt ctgcactttg tcatactctc attataagtg 840 tgtgagtgtc gtgttagaat acaacattta tactagtagg gtagagggtt tgctactaca 900 taaacatatc acaattgaca tgagttctca tccacgagcg gcaaaagcat ttgctctacg 960 tgctatccta ggagaaagtt ctagtagcag tagtgataat gaagttgtgc tggaagctga 1020 cgatgtacat gaaatcaaaa gtaatgacga agaacagctg tcttatgacg atactgaaag 1080 tgaggatgct gacaatctac aagttgaaat caatccagct gatcttcatg aacaagatga 1140 tatcaataga ttcaccgatt ctaatggaaa tgtatgggtt acacctaatg aaggcatgtc 1200 aggtcgagta aacgcagcaa acattctccg cccatcccct ggtataaccc gttgggctac 1260 tcaaaggatt ggggaaaatt ccgtgttaga ctgctttaat cttctattca taaaatgtaa 1320 gtcaatttat tatgtacact aaataaaagt tataatattc ttttgacttt tagcaacaat 1380 ggaaacaatt ttgattgaga caaatcgaga aggatcaaag ttgtataaca accaatttac 1440 agcgctctcg ttaaatgagt tgaatgcctt ttttggcatt ctcattcttc gaggagtata 1500 tcgtgctgct ggagaaagta tagatgaatt gtggtcagct gaacatggac gttttatctt 1560 ccggaatact atgtctctca atcgatttaa acttatccga cgtattctgc ggtttgataa 1620 tcccgacaca agaaaaggaa ggctgttacg tgataaattg gcggctgtta gattgctgat 1680 cgataatttt gttgaaaata gccaaaagtg ttatatgccc ggtgaatctg tcactataga 1740 cgagcaactt tatccatacc gtggacgttg caaatttgtt caatatatgc cctctaaacc 1800 atccaaatac ggcttgaaat tttgggctgt ttgtgactca caaaactatt attgctggaa 1860 tctacgtttt tatatgggaa aagaagatga tcgggatatg caagtgtcgt acggtgagta 1920 tataacaatg gatatgttgc agggactata tggtagtgga agaaatgtaa cgtgtgacaa 1980 ctttttcaca tcattgaatt tatcaaagcg attgttccaa aatggtatca ccatggtagg 2040 caccattcgt gcatcgaggc gtgacatacc aaagcatttg acgaggtacc aggggcgtga 2100 agttttttca actattgaat tgataacaac aagtacagca gatcgtagag ataaagcctt 2160 gttggttagt tatgttgcta aaaagaacaa agttgtcaat atcttatcta ctgcccataa 2220 caaaataact ggatatgtcg gttcaaagca taaaccggac gtacttctct tttataatgc 2280 tacaaaaggt ggagttgacg caatggacga aagagttgca acatatacta caaagtttaa 2340 atgtcgaaga tggcatgtgg cattcttttg taatatatta gacatctcag cattcaatgc 2400 atttgccttg cactatcttg tggagcctca atggaatgcg aacaagtcac atagacgaag 2460 gctattcctg attgaacttg gtaacagttt aataagtgcg tatgtagaag acagaagaag 2520 gactgctgcc gccgactctc tatcgtcgac tccatgtaag cgtggtgact gctacatgtg 2580 tattgatcga aaacggtgca ctgtcacatg tgttaaatgt aacagattta tatgcaaaga 2640 acatttacac aacatttgta agaattgctc ataaattata gctttatttc ttggtattct 2700 gtcatttttg tgtacttgga catttttgtc caaaatgaac aaaatcgttg atttttattt 2760 ccatcgtttg cggg 2774 // ID Gypsy-71_CQ-LTR repbase; DNA; INV; 2055 BP. XX AC AAWU01040029; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-71_CQ_; KW Gypsy-71_CQ-I; Gypsy-71_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-2055 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 522-522 (2011). XX DR Genome; AAWU01040029; Positions 2936 882. XX SQ Sequence 2055 BP; 490 A; 584 C; 526 G; 455 T; 0 other; tgtaaccata ttaagaacga tcatgggcga tcatcaattt cccctaaatg agcgagcgcc 60 aactgagtga ggtggtgagg ttcggaacta aaaatttaaa aaaataggtc aactctacat 120 ctcccgtaga cgatgatcgt gtgatatatt gacctggccg aaactctctc aattttccgc 180 taagtgctta gcaaccaggg caccttgcga gtaggactcg gccttggcca atcagagccg 240 gccacgagtg agccattttt cgcggtcaag gtctcgacct agtgtcgtca agggctctct 300 gccatttgct ccgagatcgt gcgctagtga agatcaccag catttaggga agtgtagtag 360 ttgcgaaaat taataagtct gccaattagc ccagccgttg aatttaagtc cgcgtgtgca 420 aagcgaagtg tatttggacg tgagtttacc cccgagttat tgtagttcct ttaattaatt 480 tagatcactt taccaatttt gtgaaattag gcccaagtca ctagactagc gtttttccgc 540 ttcccccgcg ggcgtgttgt tacaaaccag aaccaaaggt aattaaatac attaaattag 600 gccagtgatc taagttgccc cgtttcgccc gtagaacctt caggcgaagg ttcttttggt 660 aggttctaca tcccaccagc acctgcgaac gtttcacaac gcggtgcagc actgtgaggt 720 aagcaccacg ccaccgcgtg cttcacaccg gagcagccca agcaagcccg cacagcgaac 780 gaaccgtgct atccgccagg gtccgccgag agtccgtagg aggccacacc atcccctccg 840 agccacagga cgcgatcgtc gacgaggctc cagcatagct ggcccgctga gccgcccacg 900 cggccacgtg gtgtcgtccg tagaggttgc cgccatcaac ctcgagccac tggaccacca 960 tcgactttcg tccacatccg ttcgaaccgg cgattgaccc gcgacacgtg gccagcaact 1020 ttagctcgac gcacgagtcc gtagcgtgcc tgccaacgca cgtgagtcat tggacccagc 1080 cgttttccac ccagcctcgc ttgcgagtat tccgacaacg atcggcaggt gagttgtccg 1140 taggagacgc gtcaaatctc tgagcgagtg gacctcaccg gcgcctggaa gtgccaagct 1200 cgagtgccac gtgtcccagc gaagtcccac tagcgtggga gtcctgccca gtagcagcgt 1260 tcgtaccagg cggcccgaga gcccgcccag acgtagcagt agcagatcag cacctgtgat 1320 cgcgagaacc ggccggcagc gtagtcggca gcggcgttca tccgccgagc aacagtagtg 1380 ataccgcgag tcagcagccg cagcagcaga gcaacgtgtc cgtagcgtgc ccgccaacgc 1440 acgtgagcca gaggacccag tcgttcacgc cagcccagct ggccgacttc cgatcgagaa 1500 tcggacaggc gtgcgagtcc gtaggggacg cgctaaatcc tcgagccagt ggaccgcgcc 1560 ggcgactgag agtgcagcaa caaaaaaaaa aaacagcctt aggctgtagc aggtcagact 1620 aggtttaaga gatctctccc cttacacgtg cacgtgcccc aaactgtaaa ccctgccccc 1680 cctagagaat aaacaagtgg atgttaagtt taattctagg cttgattctt ttacgaaacg 1740 cactttcgac agacttgtgg tttttgagtt ccttgttcga tatctatctt cgtttttttg 1800 atgcgctttt aataagtggt tctcacttga cctgccctga gaagctaagc cgggttctcc 1860 aagtaacacg ccctttcgca gtgtagcctt cagctacact ggcgcttaag cgagagggtt 1920 tccaaaggga cggtctttcc gggtcttcgg ttggcggaaa cagttgtggt tgttgaccaa 1980 aaaaaaacta agttttggtt ccgttccatc acctcactag tcgctaatcg catccccccc 2040 aaaagaactg ttaca 2055 // ID Gypsy-48_CQ-I repbase; DNA; INV; 5708 BP. XX AC AAWU01034932; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-48_CQ_; KW Gypsy-48_CQ-LTR; Gypsy-48_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-5708 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 475-475 (2011). XX DR Genome; AAWU01034932; Positions 8766 14473. XX CC Positions [4704-5165] - Integrase core CC 'GCTTG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 1422..3398 FT /product="Gypsy-48_CQ-I_2p" FT /translation="MMDDIYVPPVSTAETGSAAPPGSTAPPASPAVLFSPA FT SYNLPQFKFRHLPTSEVSNAWIQWIRWFENIMIASGISNSDARKAQMLAMG FT GIELQGVFYGLPGAEEWGSNAYGAAKQKLTDHFSPKQHDSFERFQFWSMKM FT ENDEPIEKFLLRVQQKAEKCVFGKTELECRQYAIVDKIIQNAPEDLRRKLL FT EKDALTLDTTVKIVNAHQAVKYQAAQMKPMANQQVEVHRMVVKDKGPKDAD FT SSDTGSCSRCGYPSHRSGQRCPALDQKCRGCHQFGHFWVMCSHRKRKSNFN FT QVDRKYNNQKRYRTVRNIATEPSHETEEYPVNLIHDDDKDEYITCRVGGVE FT IPMLIDSGSTHNLIDDATWEMMKLEDVTVHSERFDNSKRFLAYGRVPLELL FT TVFDAILEIQDGPDRIQAGATFYVIKGGQQALLGKITAQRLGLLRIGLPST FT IKEDVNQVKAAKQFAKIKDYKLTLPINRDIPPVIQPLRRCPIALQSQVKAK FT LDELLEMDIIERVDSPTSWVSPLVPIMKDNGELRLCVDMRRANQAIQRLNH FT PLPVFEDMLPRFSGSKALHNVGYKASISIKLSLAEDSRDITTFVTNWGLYR FT YKRLSFGINCAPEFYQFLMESILSSCPQTAVFIDDIIIWGVTQQDHDRAVK FT KVLKVKKSGY" FT CDS 3479..4633 FT /product="Gypsy-48_CQ-I_1p" FT /translation="MQKCKFNQLEVSFLGHKLSGKGVLPTDDKVKSLLQCR FT PPRSKEELRSFLGLATYVSRFIPDLASTNHPLRELIKTSVNFRWEKIHQDS FT FEELKKRMASMQHLAFFDPKDRTLLVTDASGVGLGAVLIQFKNGVPRVISY FT ASKSLTDTEKHYPPIEKETLAIVWGVERFQMYLLGVSFTVETDHRPLETLF FT TAKSRPTARIEKWLLRLLAFRFTVIYRKGSSNIADPISRLAAQGYDDNWTE FT ETEVFIRRVMVESLAALVDERIAEDFNNDTDLHIRTVQEAAAIDISEVVDA FT TDKDTELQFVKEAIMNGDWSKDVLRPYSAFRSELSYVNELIIRGSKLVIPA FT ALRARMLSLAHEGHPGQTCMKRRLRDRCWWPNMDKEISYRDV" FT CDS 4689..5591 FT /product="Gypsy-48_CQ-I_3p" FT /translation="MERRPLPEKPWLDLAIDFLGPMPTGEHILVVIDYYSR FT FVELVVMKNITAKDTINSLTKIFRVWGVPKTITLDNAKQFVSSEFSDFCKL FT KGIHLNHTSPYWPQANGEVERQNRSLLKRLKIANALYGSWKAEMDRYLEMY FT NNTPHSVTGKAPSELLQNRRLRFKFPDLEDLSSSPLESEVADRDKISKTKG FT KEKEDARRKAKYSDIEVGDEVLMRNLHPTNKLSTDFLKEKFVVEDRNRSNV FT RVKSLESNKSYDRNVSHLKKVLGSHTEQEEHNQDVDGATESKVDQRRSVRT FT RQIPSRYQI" XX SQ Sequence 5708 BP; 1735 A; 1018 C; 1425 G; 1530 T; 0 other; taaatggcga cgaagatggt ggagtgaaga aagcgaaatt taatttggaa tacagatttt 60 tatttggttt gtaattgagt gtgaaaaaaa ggtaatttat gtgtttaaat tggaaatagt 120 tatcgaaaga acgtgctctc aactgctgtg tgtaaatcag gaaaatttgt tgggaaactg 180 ggcgtcaatg ttgtgttaaa tgttgtgata aaaaaaataa gaaataatta taaataaaat 240 catggctgca aatttaacca aagaaaaaac gtggtgaaat tattattgct ctctaaaaga 300 aaaaaaaata tatgtcgtaa agtgaaagca gaaaaaaatc aagagcaata aaatttgtgt 360 taaatgtgcg cagcagtgat gatcgacgat ggatagtgac cttggaaggg gattaagatc 420 agcttggtgt gcaaaagaaa agaaaaaaaa aaatgatttg ctcgcttgag gctgcgtgac 480 cttggaaagg gattacaact gatcgacgca caaaacagga atgtgtgtcg ggatcgtgat 540 tacttattgg acattcaagg tgcgtatgta ttgtagacga gtgagcaaca atttattttc 600 gtttgctaat attgtgcaga gaaactttga tgatgcttga ttcctctata gggttgacgc 660 tgaacctata gattttttca ttgatagatt ttgacgatga aaatataaat atatatatat 720 ttttgataat gaatggacct agtaaatgac atttccggtt gctacgaaaa gattaaaatt 780 ttcggtagtt acggaaaacg ttaaatttcc ggttgttacg gaaaatgttg gattttcggt 840 tgttacggaa agaacgaaat ttccggtagt tacggaaaaa agtaaaattt ccggttgtta 900 cggatttgaa atttccggtt gttacggaaa gactgaaatt tccggtagtt acggaaaaaa 960 aagtggaatt tccggttgtt acggatttga attttccgtt tgttacggaa agactgaaat 1020 ttccggtagt tacggaaaaa agtgaaattt ccggttgtta cggatttgaa aattccggtt 1080 gttacggaaa tactgaaatt tccggtagtt acggaaaatg ttgaatatcc ggttgttacg 1140 gaatgaaaga aatttccggt agttaaggga aaaagtgaaa tttccggttg ttacggattt 1200 gaaatttccg gattttttac ggaaagactg aaatttccgg tagttacgga aaatgttgat 1260 attccggtta ttatggaaag aaagaaattt ccggtagtta cggaaagagt gaaatttgta 1320 tatgcatgtg tgtgtttcaa tgataaatat tggcagttga gagttcgtta ttgttggttt 1380 tgatttgtct ttctaaatga gtgtaaaatt taaaataggc aatgatggat gacatttacg 1440 tcccaccggt ttcaaccgct gaaacgggtt ccgctgctcc gccggggtca actgcgccac 1500 cggcatctcc agctgtactt ttcagtccag catcctacaa tttgccacaa tttaagttca 1560 gacaccttcc tacctcagaa gtgagcaatg cgtggattca gtggattcgc tggtttgaaa 1620 atatcatgat tgcgagtggt atttctaaca gcgatgctcg gaaagctcaa atgctggcca 1680 tgggaggcat tgagcttcaa ggagtattct acggtttgcc aggagcggaa gagtggggct 1740 cgaatgcgta tggagcggca aagcagaaac tgactgatca tttctcaccc aaacaacatg 1800 actcgttcga aagattccag ttctggtcga tgaaaatgga gaatgatgag ccgatcgaga 1860 agtttctgtt acgagttcaa cagaaagctg aaaagtgcgt gttcggaaaa actgagcttg 1920 aatgtcgcca gtatgcgatc gtcgacaaaa tcatccaaaa tgctccagaa gatctccgtc 1980 gaaagctgct tgaaaaagat gccttgacgt tggacaccac agtcaagatc gtaaacgcgc 2040 atcaagcagt taaataccaa gcagctcaaa tgaaaccgat ggcaaatcag caggtagagg 2100 ttcaccgtat ggtggtcaag gacaaaggac cgaaggatgc ggactcaagc gacacgggta 2160 gttgcagccg gtgtggttac ccgtcgcatc ggtccgggca gagatgtccg gctctcgatc 2220 agaaatgcag aggatgtcac caatttggtc atttctgggt catgtgcagt catcgtaaac 2280 ggaagtcgaa ctttaatcaa gtggatcgga agtacaacaa ccagaaacga taccggactg 2340 tacgaaacat tgctactgaa ccgagccatg agactgaaga atatccagtc aacttgattc 2400 acgacgacga taaggacgaa tacattacgt gtcgcgtagg aggagtggaa attccaatgt 2460 tgattgattc cggttctact cacaacttga ttgatgacgc tacctgggaa atgatgaaac 2520 tggaggatgt aacggtgcat tccgaacgtt tcgataactc gaagcggttc ctagcgtacg 2580 gaagagtacc attggaacta ctgaccgtgt ttgacgccat tcttgaaatc caggatggtc 2640 cggatcgtat tcaagctgga gctacattct acgtgatcaa gggaggtcag caagcactgc 2700 ttggaaaaat aactgctcaa cgattgggtc tgctgcgaat cggacttcca agcactatca 2760 aagaggatgt gaaccaggtc aaggcggcga aacagttcgc aaagatcaaa gactacaagc 2820 tgacgttgcc catcaaccga gatataccac ctgtaattca accacttcga cggtgcccaa 2880 ttgctcttca gagccaagtc aaggcgaaac tggatgaatt actggaaatg gacataatag 2940 agcgagtgga cagtccaact tcgtgggtat ctccgctggt gccaataatg aaggacaacg 3000 gggaattgcg cctttgcgtt gacatgaggc gggccaatca agcaatacaa cgtttgaatc 3060 atccacttcc ggtttttgaa gatatgctgc caaggttcag cgggagcaaa gcacttcaca 3120 acgttggata taaagcaagc atttccatca agttgagcct tgcggaggac agcagggaca 3180 ttactacatt tgtgaccaat tgggggttgt atcggtacaa aaggttgtct tttggaatca 3240 attgtgctcc ggagttttac caattcttga tggaaagcat cctatcgagt tgtcctcaaa 3300 ctgcagtatt cattgacgat attatcatct ggggagtcac acaacaagat catgatcgag 3360 cagttaagaa agttctgaag gtaaaaaaaa gtggatatta attgtttttt tttttctctc 3420 taaactataa attacattaa atctttaggt attgaacgag cgaaacattt tgcttaacat 3480 gcagaaatgt aaattcaatc agttggaagt ttcgtttctg ggtcacaagt tgtcaggaaa 3540 gggcgttctt ccgacggacg ataaagtgaa gtcgttattg cagtgccggc cgcccaggtc 3600 gaaagaagag ttacgcagtt ttctcggtct cgctacgtat gtctctcgtt tcattccgga 3660 cctggcttct acaaatcatc ctttgcgaga attgatcaag acttcggtga atttccgctg 3720 ggagaagatt catcaggatt cattcgagga attgaagaaa cgtatggcat cgatgcaaca 3780 tctcgctttc tttgacccaa aggatcgtac cttgctggtg actgatgctt ctggtgtggg 3840 cttaggagct gttttgattc aattcaagaa tggagttcct cgggtgatta gttatgcatc 3900 gaagagctta acggatactg agaagcatta tcctcccatt gagaaagaaa cgctagcaat 3960 tgtttgggga gttgaacgat tccagatgta ccttcttgga gtatctttca cggtcgagac 4020 agatcatcgt ccgttggaaa cactgttcac agcaaagtcg cgaccaacag ctcggattga 4080 aaagtggctg ctgagacttc tagctttccg attcactgtc atctatagaa aaggttcatc 4140 aaacatcgct gatccaattt cacgattagc tgctcaaggg tacgacgata attggactga 4200 ggaaacggaa gtgttcattc gacgcgtgat ggtggaatct ttggctgcat tagtggacga 4260 gcggatagct gaggatttca acaatgacac agatttgcac ataagaacgg tgcaagaagc 4320 agcagcaatc gacatttccg aggtggtgga cgctacagac aaagatacag agctacagtt 4380 cgtgaaggag gcgatcatga atggggattg gtcgaaggac gttttgagac cgtactcggc 4440 gtttcgtagt gagttgtcct acgtcaacga gctgattatt cgtgggtcca aacttgttat 4500 cccagctgca ttacgtgcaa gaatgttaag tttggcacat gagggacatc cgggacaaac 4560 gtgtatgaaa aggcggctca gggaccgatg ttggtggccc aacatggata aggagatatc 4620 gtatcgtgac gtgtgagacg cgtgtgtaag gatgccgcct tgttcaaata cccagtccac 4680 ctgagcccat ggagcggaga ccactaccgg agaagccttg gttggactta gctatcgatt 4740 ttctgggacc gatgccaacc ggtgaacaca ttctggtggt gattgattat tacagtcggt 4800 ttgttgagtt ggtagtgatg aaaaatatta ccgctaaaga cacgatcaac agcctgacca 4860 agattttccg tgtttggggt gtaccgaaaa cgatcacgct cgataatgcc aaacagtttg 4920 tgtcatcaga gtttagtgat ttttgcaaac tgaaagggat tcatctgaat catacttctc 4980 catactggcc gcaggccaat ggggaggtcg agcgacagaa tcggtcactt ctcaaacgat 5040 tgaagattgc aaacgctctg tacggtagct ggaaagcgga aatggatcgt taccttgaga 5100 tgtacaataa cactccgcat tcggtaactg ggaaggctcc gagtgaattg ttgcaaaatc 5160 gaagactcag gttcaagttt ccggatttgg aggatctttc ttcttctccg ttggagtcag 5220 aagtagcgga cagggacaaa atttccaaaa cgaaggggaa agaaaaggaa gatgcgagac 5280 ggaaggctaa gtacagtgac atagaagtag gagacgaggt tctaatgcga aatttacatc 5340 caactaacaa gctttcaacc gatttcctta aagagaaatt tgttgttgaa gacagaaaca 5400 gatcgaatgt tcgtgtgaag tcactggagt caaacaaaag ctatgaccga aatgtttctc 5460 acttgaagaa agttttgggt agtcacaccg agcaagaaga acataatcag gatgtagatg 5520 gcgccacaga gagcaaagtg gaccagcgaa gatcagttag aactcggcag attccgtcac 5580 ggtatcagat ttgattgttt gaacgaagga ttattgattt tgttaacacc tgaatgactc 5640 aatgtataaa gtgattaaag attgataaca actaatgatt tttgaacttt taattataga 5700 aagggagt 5708 // ID Gypsy-205_AA-I repbase; DNA; INV; 2496 BP. XX AC supercont1.58; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-205_AA_; KW Gypsy-205_AA-LTR; Gypsy-205_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-2496 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.58; Positions 2886800 2884305. XX CC 'AAAT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 54..2081 FT /product="Gypsy-205_AA-I_1p" FT /translation="MADRMNKSELMAWLKQRNVEFPPSATVRHLRTLYESQ FT PREGKIPDNSIDEDSTTSDDEEELDAEIRVLEKRKRIAELRRELAATDAAM FT SKQPNFQDIKHSVPKFANSDSYDASKWITDFERACDAVNGDDLFKLKCIRR FT MMEPGTEAEWYLRIDKLETYKQFRDHFLENFGHVYSVAEVIDKLRKTTFVA FT GKTSVMGYILQMQEIASRANIDEAQTVQIIIDGFRDRSANIAVLYTANNIA FT QLKQLARRYSHLREMNSVPSPSATSARAKNIKTTTTPAKSSDSVQRCYNCS FT GTGHLSAECLKPRREIGSCFRCGSMQHKFKDCPKPAPRNQDRVALVDLVRR FT QPKESIAEDDVAAALSEINMVSVAFLCNENVQPFDTLLSIFDTGSPINLIQ FT RSSVPENLIPTKEVFSGYRSVGGFPLCSYGIVSVLIMYRNRSHILKTYVVP FT DHFISLPLLLGREFLKKFGIILFDTHVSDSPMVTKPTIEITDKNKIKISGE FT ELHCLFRSYQAEYVDCTSSCVQCCHNELSQFVENGPEFVELADQSVPEVYA FT IECESPESYDINPCLGTETQNAIRAIINEDYLNLSNIPAKQHNYEMKLKLT FT SDTPISFAPRRLSYSDKKEVDRIVEQLLSENIIRPSCSPYSFPIVLRTKKS FT GEKRMCVDYRSLNKITVRRRICCYKER" XX SQ Sequence 2496 BP; 754 A; 507 C; 552 G; 683 T; 0 other; gcatcagaag tgggatagcg accgcgattt tcataaaagt gtgtaagcgg aaaatggctg 60 atcgaatgaa taaatccgag ttgatggcgt ggctgaaaca acgcaatgtt gagtttcctc 120 catctgcaac agtgcgccat ttgcgaactc tgtacgaaag tcaaccacgt gaaggaaaga 180 ttccagataa ttctattgat gaggattcca ctacgagcga cgatgaagaa gaattggacg 240 ccgaaatccg tgtactggaa aagcgaaaga gaattgcaga acttcgtcga gagctggctg 300 caaccgatgc agcgatgagc aagcaaccca atttccagga catcaaacat tccgtgccca 360 agttcgctaa cagtgattcc tacgatgcaa gcaagtggat tacagatttc gagcgagctt 420 gcgacgcagt caatggagac gacttattca agctgaagtg tattcgtcga atgatggaac 480 cggggaccga ggccgagtgg tacttgcgga tcgataaatt ggagacgtat aagcagtttc 540 gtgatcactt tttggagaat ttcggccacg tatattcggt tgccgaagtt attgataagt 600 tgaggaagac aacgtttgta gccgggaaaa cttcagtcat gggctacata ctgcaaatgc 660 aggagatcgc ttccagagca aatatcgatg aagcgcagac agtgcagata atcatcgatg 720 gattccgtga ccgttccgct aacattgctg tgctgtatac agccaacaac atcgctcaac 780 tcaagcaact ggctcgacgc tactcacatc ttcgtgaaat gaattccgtt ccgtctccat 840 cagcaacaag tgccagagcg aagaacatca agactacaac aaccccggca aaatcatccg 900 attccgttca acgatgctac aattgttccg gaacagggca tctttccgct gaatgcctca 960 aaccacgtcg tgaaattggt tcgtgcttcc gatgcggctc catgcagcac aaatttaagg 1020 attgtccgaa gccagcaccc cgtaaccagg atcgagtagc gctggttgat ctcgtacgcc 1080 gtcagccaaa agaatccatt gccgaagatg atgtggctgc cgctctttcg gaaattaaca 1140 tggtaagtgt tgcattctta tgtaacgaaa atgtgcagcc ctttgatact cttctttcta 1200 ttttcgatac gggcagtccg attaatttga ttcaaagatc gtcggttccc gaaaatttga 1260 ttccgacgaa agaagttttt tctggataca ggagtgtggg cggctttccg ttatgttctt 1320 atggcattgt ttcagtgcta ataatgtacc gtaatagatc acacatattg aaaacatatg 1380 ttgttcctga tcattttatt tctcttcctt tgttgcttgg acgcgaattt ttgaagaagt 1440 ttggaataat tttgtttgac actcatgttt ccgattcacc tatggttaca aaaccaacaa 1500 ttgaaatcac cgataaaaat aaaattaaaa tttcaggaga agaactgcat tgtttgtttc 1560 gttcatacca agccgaatat gttgattgta catcctcctg tgttcaatgc tgccataatg 1620 agttgagtca gtttgttgag aatggccctg aatttgtcga acttgctgac caatccgtac 1680 ctgaagttta tgctattgaa tgtgaatctc ctgaatccta cgacataaac ccttgtttgg 1740 gaacagaaac ccaaaacgcc atccgtgcta taatcaatga ggattattta aacctttcaa 1800 atatccctgc taaacagcat aattacgaaa tgaagctaaa attgacatca gacacaccta 1860 ttagttttgc tccgcgtagg ctttcatatt ctgataaaaa ggaagttgat cgtattgtcg 1920 agcaactgtt atcagagaat ataataaggc caagttgttc accgtactcg tttcccattg 1980 tcctaagaac taaaaaatct ggagaaaaac gtatgtgcgt ggactatcga tcgttaaata 2040 agataaccgt ccgaaggaga atttgttgtt ataaagaacg ttgataacac gattggtact 2100 aataaaaagc ttgttcccaa atttaaaggg ccatatcgaa tccataaagt ccttccccat 2160 gaccgatatg taattagaga cattgagaac ggtcaaatat ctcaactacc ttacgatggc 2220 atagtcgagg ctgcccgaat aaaacgctgg gcggactggc gtgattcaga tgataagaaa 2280 ttggaagata tagagctgca catcaacaca tagatgataa gtaatgcacc acttttgggt 2340 ttgggtgtac aatttagttt actaatctcg agaagttaat agagcacaat ggttactaac 2400 agtacttata tgtaagtgaa aataagctgc attataattt gtgcttgtat gtatggttgt 2460 atgatcggga cgatcagtag tcaggaatgg ccgagt 2496 // ID I-66_AAe repbase; DNA; INV; 6707 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE An I non-LTR retrotransposon from Aedes aegypti. XX KW I; Non-LTR Retrotransposon; Transposable Element; I-66_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6707 RA Kojima K.K. and Jurka J.; RT "I clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1337-1337 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 16 sequences with >97% CC identity. XX FH Key Location/Qualifiers FT CDS 505..1863 FT /product="I-66_AAe_1p" FT /translation="MASTSGGDPGGSNGRTLPVYLDGRNEFGAVTTLLMTG FT KDGKNLPIEPWIIGKSLELCAGPLESAKSENKCTRYVIKTRKPAQVEKLLK FT MTQLIDGTEVSIIPHPTLNVSRCVISAYDLLEKEETEIVREMSSQGVIGAR FT RIMRNNKERTPALILTFNRSVYPENVKVGVLNFKTRPYYPNPLLCFGCYEY FT GHPRSSCTNPRRCYNCSQDHEENEMCENAAFCRNCKKDHRPSSRQCPIYKI FT ETDIIRTKIDLNISSAEARKRVAAGNGTYAQVAAQPRLDQSRMDALTAQIA FT EKNKKIEKLEADLERNSREMLAKLNEVLERIEEKDNEIKDLLMHIQHRDEK FT ITKIEADNKNMKKHIEDMQHRTRTNSQSSEPSVTKSKSKRPTSQNPAEHHH FT RSNMSPPPKKQPNSNRSPIMTRSTSSQNQRDAVTPTDIDSLHDTNLLNQID FT YGPSKSQH" FT CDS 1811..6607 FT /product="I-66_AAe_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="MTRIFSIKSIMDHPKANTNQASLTPRKVAVHTSQNIK FT SYRPKHSTDKFAHRFEEWESAVPPSLPLEALQTGELVRDVLPQPVDNAPYC FT TAGCSNPSRITNAIAINNTSTELQDLPTKNTPSTSISCTSQFLTPNSDPAS FT ICSIRESIKHTIASSYTKNSPPSDESFVNEKLKQPLRRKYRRENQNIPELC FT GETDSNPNHHLLGEGSVNPRQILAPALTPDVPSSWIEPVAAAGAGNLSVSQ FT ASIQSIHPQRSNDAPVISLEINSQQPSTCSSSYTKNPPPSDESVGSEKFKQ FT RQPQKYQRGNQNIPALYGESDSNPNHHLLGEGSVNPRQVLAPALTPDVPSS FT WTEPAAAAGAGNLSEQQASIPSLHPQRSNDAPEIPTENNSQQPSSSSVSSI FT SSGNCTQTFALHWNVRSVWRNYAEFRKIVDDNNPVAICLQEMMTNRTNGLL FT NDNYKWITHSRPESMGNGICGLGIRKDVPHAFINVTSTLHVCIARIGKPYD FT LTIISIYVPHATSGDELCHKLTDLIQSLEPPFLLCGDFNASHEVWGSTQSN FT NRGRLLLEWAVDNNFVALNNGSPTHYQASSGSFSAIDISFSTNYLASKFSW FT HCDDDLHGSDHFPLFVRFNNPLPIFPTTRRWLYKDADWKKFEKIVALKIPR FT DGNVSIEEITNAIYEAAVESIPRTKGLPAKRNQIWWNEEINQVVKARRKAL FT RKLRKLKNDGPDKEEAKQHFNTIKHEARKLIEEAKKKSWSSFQEIFTEKSS FT TSQLWQSFNKLSGKRRSRAPGLTIDGEYNDDPSVIADHLADFFAGTCSTAA FT YPEDFLQKKQAREEDNLSFLSDRVTPIDENFSVHELFQALESANGTSVGVD FT NIGYLMIKNLPFHCKMTLLNAFNRIWESGNVPSVWKESIIIPIPKPGENTC FT ASNSFRPISLISCVAKTMERMVNRRLMNHLEVNGILNQQQYAFRKGKGTTH FT YLADLYETIAEAAERGEHCEMVTLDISKAYDRVWHRHIMETVIDCDLGTNM FT NKFINNFLQQRSARVSFSGVLSKLIHLENGVPQGSVLSVSLFLLAMNSVFK FT FAPKNVKVFLYADDIVIIASGKRVSYLRRRLQKAVESVENWASSIGFQLSP FT VKSSTMHCCKLKKHRNWHLEGPKILLDGKEIAKNKTTRILGVTFDRKCNFN FT QHTKNLKEDCRSRLNLLRAISRRADRKTLLYIGNSIVTSKLFYGLEILREE FT NIEKLAPIYNQIIRIASGALRTTPILSLVVESGCLPFELMAIMTISKKACA FT LIEKSDDDSNHIWRRAQDAFQNLTGEEIPKICKLATIHPRAWNKAVPCIDW FT SVKAKVRAGQPPMLGLIAFREVTERKYQGYAHWYTDGSLAEGKVGYGVVGP FT DTRLEASLAKQCSVFTAEAKALSQATKHARNRSVIFSDSASCLAALESGKS FT KNCYIQLIEADIEGKDIRFCWIPGHSGIPGNEKADDAARCGRNRSIATEEV FT PCQDTSKWIQKMIWKAHQNTWSQNPRTTLKIVKSGVGKLTDRTDRKEQIVL FT TRLRLGHSIFNKKHIFEKKEPEVCTLCNRKVTVHHVLAACQRYDSERQELG FT INNNLAEILSNEKDAEDKVIRFVKRIGLFSSI" XX SQ Sequence 6707 BP; 2206 A; 1540 C; 1411 G; 1550 T; 0 other; cagttagcgt tgagcatccg aagtaatcgg tcgttctacc gcagatactg agcggatatt 60 ttttcccgat tttcgttcta catccgagag cgctagcgct acgcctcccg gtggaatagt 120 tttttttttt ctcaaccgag tgaagtgaca aaaacgcgaa aatattcgcg ggttaccacg 180 ctcattacac caggggcgca tacgagtgaa gaagagtatt cccggttacg tgcgcgggtg 240 gatgaccaag cgtctcctcc cgcatagtgc aagtgctggg agaaggtacg aagtaaacta 300 ccgtgcttag tgtcgaagtg tatgcaacaa ggcgacatca acagccaata aagtgactaa 360 ccgagtggaa ttgacgcgtt gtaaggtagg gtcaggtttc ctttttcctc ttcctttccc 420 accgtaccgg ggtaggtagt gggtgaccac tacccttatt ctacggttct gatcccaccg 480 gtttgaagaa acctctagca ttgcatggcc tccactagcg gaggcgaccc tggagggtcc 540 aacggaagaa cgctacccgt gtatctcgat ggacgtaacg agtttggagc cgtaacgacg 600 ttgttaatga caggaaaaga tggcaaaaac ttacccattg aaccatggat catcggaaag 660 agcctcgagc tctgcgctgg tccgcttgaa agtgcgaaaa gtgagaacaa gtgtacacga 720 tacgttatca aaacgagaaa gcccgcacaa gtggaaaagc tgttgaaaat gactcaactg 780 atagacggaa cggaagtatc aatcatacct catccaacac ttaacgtaag ccgatgtgtt 840 atttctgcat acgatctttt ggaaaaagag gagactgaaa ttgttcggga aatgagctcg 900 caaggggtta ttggtgctcg taggattatg agaaacaaca aagaaagaac accagcattg 960 atccttacgt tcaatcgaag cgtgtatccc gaaaatgtga aagttggagt actcaacttt 1020 aaaacgcgcc catactaccc taatccattg ctctgctttg gttgctacga atacggacat 1080 cctcgctctt cctgtaccaa tccaaggcgc tgctacaact gctcacaaga ccacgaggag 1140 aatgaaatgt gcgaaaatgc agcgttttgc agaaattgca agaaagatca tcggccttct 1200 agccgtcagt gtcctatcta caaaatcgag acagatataa tcagaaccaa gatcgacctc 1260 aacatttcga gtgcagaagc gagaaagcgt gttgctgctg gaaacggaac ctacgcacaa 1320 gtggcagctc aaccacggtt ggatcaatcg agaatggatg cacttacggc tcaaatagcc 1380 gaaaagaaca aaaaaatcga gaaacttgag gctgacctcg agcgtaattc tcgcgagatg 1440 cttgccaagt tgaatgaagt gttggaaaga attgaagaga aagacaacga gatcaaggac 1500 ttgttgatgc atatccagca tcgtgatgag aagatcacta aaatcgaggc tgacaacaaa 1560 aacatgaaaa aacacatcga ggacatgcaa cacagaacaa ggaccaactc gcaaagcagt 1620 gagccatctg tcactaaatc gaaatcaaaa cgaccaacat cacagaaccc cgctgaacac 1680 catcatcgct ctaacatgtc gcccccgcca aaaaagcaac cgaattccaa cagaagtccc 1740 attatgacta gatcaacatc aagccaaaac caaagggacg cggtaactcc aactgacatc 1800 gactctttgc atgacacgaa tcttctcaat caaatcgatt atggaccatc caaaagccaa 1860 cactaatcaa gcgtcgttga cccctagaaa agttgccgta cacaccagtc aaaacatcaa 1920 aagttatcgc ccaaagcaca gcacagataa attcgctcac cggtttgagg agtgggaaag 1980 tgcagtacca ccctcacttc ctttagaagc gttgcagact ggggaacttg tccgggacgt 2040 tctaccccaa cctgttgaca atgcacccta ctgcacggcc ggttgtagta atccatctag 2100 aataacaaat gcaattgcta tcaacaacac ctcaactgaa cttcaagatc tccctaccaa 2160 gaacactcct agtacatcaa tttcctgcac ctcacagttc cttactccca attctgaccc 2220 tgcaagtatc tgttcaatac gtgaatctat aaaacataca atcgcttctt cgtatacaaa 2280 aaactcacct ccctccgacg aatcttttgt gaacgaaaaa ttgaaacaac cgctacgtcg 2340 aaaatatcgt cgagaaaacc aaaacatccc agaattgtgc ggcgaaactg acagcaatcc 2400 aaaccatcat cttctcggag aggggtccgt aaaccccagg caaatccttg caccggccct 2460 gaccccagat gttcccagtt cctggattga gcctgtggcg gcagccggtg cagggaactt 2520 gtctgtatca caggcaagta tacaaagtat ccatcctcaa agatcaaatg atgccccagt 2580 aatttcccta gaaatcaaca gtcagcagcc atcaacctgc tcttcttcgt acactaaaaa 2640 cccacctccc tccgacgaat ctgttggaag cgaaaaattt aaacaacggc agccccaaaa 2700 ataccagcga ggaaaccaaa acatcccagc attgtacggc gaaagtgaca gcaatccaaa 2760 ccatcatctt ctcggagagg ggtccgtaaa ccccaggcaa gtccttgcac cggccctgac 2820 cccagatgtt cccagttcct ggaccgagcc tgcggcggca gccggtgcag ggaacttgtc 2880 tgaacaacag gcaagtatac caagtctcca tccccaaaga tcaaatgatg ccccagaaat 2940 tcccacagaa aacaacagtc agcagccatc ttccagctct gtttcatcga tatcctccgg 3000 aaattgcact caaaccttcg ctttgcactg gaacgtgcga agtgtgtggc gaaactatgc 3060 agaatttcgt aaaattgtgg acgacaacaa tcccgtagcg atttgtcttc aagaaatgat 3120 gacgaatagg acaaacggcc ttcttaacga caactacaag tggatcaccc acagccgacc 3180 agagagcatg gggaacggta tttgcggcct cggcatccga aaggatgttc cccatgcatt 3240 tatcaacgtc acttctactc tacatgtatg tattgctagg ataggtaaac catatgacct 3300 aacaatcata tcgatctacg taccacatgc cactagtggc gatgagctat gccacaagct 3360 aactgatctc atacaatccc tcgaaccacc atttcttctt tgtggggact tcaacgcgtc 3420 acacgaggta tggggcagta ctcaatccaa taatcgtggt cgtctcttgt tggaatgggc 3480 agttgacaac aatttcgtag ctcttaacaa tggttcccct acacactatc aagcgtcgtc 3540 tggatctttc tcggctattg atataagctt ttctacaaat tatctcgcat ccaaattctc 3600 atggcattgt gatgatgatt tacacggcag cgatcacttc ccactttttg tacgttttaa 3660 caatccgctc ccaatcttcc ccacaacaag aagatggttg tataaggatg cagattggaa 3720 aaaattcgaa aaaattgtcg ccttaaaaat tccacgcgac ggtaatgtat ctatcgagga 3780 aattactaac gccatttatg aggcagctgt agaatcaatt ccgagaacaa aaggtcttcc 3840 ggcaaagagg aatcaaatat ggtggaatga agaaatcaat caagtagtga aggcgcgaag 3900 aaaggctttg aggaaactca gaaagctcaa aaatgatggt ccagataaag aagaagcgaa 3960 gcaacatttc aatacaatta aacatgaagc aaggaagttg atcgaagaag caaaaaagaa 4020 gtcgtggtct agtttccaag aaattttcac agagaaatcc agcacctctc aattgtggca 4080 aagtttcaac aagttatccg ggaaacgacg aagtagagcc ccaggtctca ccattgatgg 4140 agaatacaac gacgatccat ccgttatagc ggatcacttg gcggactttt tcgcaggcac 4200 ttgttctacg gcagcatacc cagaagactt ccttcaaaaa aagcaagcac gagaagaaga 4260 caatctttca tttttgtcgg atagagtaac cccgattgat gagaattttt ctgttcacga 4320 attatttcaa gcattagaga gcgcgaatgg aacatcagtt ggagtcgata acattgggta 4380 tcttatgata aaaaatctac cattccactg caaaatgact ctactaaatg ccttcaaccg 4440 catttgggaa agtggcaatg tcccttctgt ttggaaagag agtatcataa ttccgattcc 4500 aaaacctggg gaaaacactt gtgcatccaa tagcttccga ccaatatctt taattagctg 4560 cgttgccaaa acaatggagc gaatggtaaa tcgccgactg atgaaccatc tagaagtcaa 4620 tggaatactc aaccaacaac aatacgcgtt ccggaaagga aaaggaacaa cacactatct 4680 agcagatcta tatgaaacta tcgccgaggc agccgaaaga ggagaacatt gtgagatggt 4740 tacccttgat atcagcaaag cttatgatcg ggtatggcat aggcacatca tggaaaccgt 4800 cattgactgt gatctcggta cgaacatgaa taaattcatc aataacttcc tgcaacaacg 4860 atcggcgagg gttagctttt ctggagtcct ctcaaaattg atacatctag aaaacggtgt 4920 cccacagggc tccgtgctct cagtgtctct ttttcttcta gctatgaatt cagtcttcaa 4980 gtttgccccg aaaaatgtaa aagtttttct ctatgccgac gatattgtca tcattgcatc 5040 tggcaaacgc gtaagctacc tgcgacgtag acttcagaag gctgtagaga gtgtagaaaa 5100 ttgggcctca agtattggtt ttcaattatc accggtcaaa tcatccacta tgcattgctg 5160 caagctcaaa aagcatcgta attggcacct cgaaggacct aaaattcttc tagatggaaa 5220 ggaaatagcc aaaaataaga caacccgaat cctcggagtc acctttgaca gaaaatgcaa 5280 cttcaaccaa catactaaaa atctgaaaga agactgtaga agcagattaa atcttcttag 5340 agcaatctct agaagagcag accgtaagac cctattgtat ataggaaact ccatagtaac 5400 ttcaaaactt ttctacgggt tagagatact tcgcgaagaa aacatcgaga aacttgcacc 5460 aatatacaac caaattatta gaatagcttc tggtgcactt cgaactaccc caattctttc 5520 cttagtggta gaatctggat gtctaccatt tgaactcatg gctataatga caattagcaa 5580 aaaagcatgc gcattgatag aaaaatctga tgacgattct aatcatattt ggaggagagc 5640 tcaagatgcg tttcaaaacc taacaggcga agaaatacca aaaatatgca agctcgcaac 5700 aatacaccca agagcgtgga ataaagcggt cccttgcatc gattggtccg taaaagcgaa 5760 ggtaagagct ggacagcccc ctatgttagg acttattgct ttcagagagg tgacagaaag 5820 aaaataccaa ggatatgctc actggtacac ggacggatcc ctcgccgaag gaaaagttgg 5880 gtacggagtt gtaggaccag acacgagatt agaagccagc ctagcgaaac aatgttcggt 5940 gttcactgcg gaagctaagg cactatcaca agcaaccaaa catgccagaa acagatcagt 6000 catctttagt gattcagcta gttgcctagc agcattggaa tctggaaaat cgaagaactg 6060 ttacatccaa cttattgaag ctgatattga agggaaggac attcgttttt gttggatacc 6120 cggacattca ggaatcccag gaaatgaaaa agccgacgac gcagcacgat gtgggagaaa 6180 ccgatcaatt gcaactgagg aagttccttg tcaggatacc agcaaatgga tacaaaagat 6240 gatctggaag gcccaccaaa acacatggag tcaaaatccc aggactacct tgaaaatagt 6300 taaaagcggt gttggaaaat tgacagatag aacggacaga aaggagcaaa tagtattaac 6360 tagattacga ttaggtcatt caatttttaa taagaagcat atttttgaga aaaaagaacc 6420 tgaagtgtgt acgttatgta atagaaaagt aacagttcat catgtacttg ctgcgtgtca 6480 gcgatacgat tctgaaagac aagagctagg aataaacaac aatttagctg aaattcttag 6540 caatgaaaaa gacgcagaag acaaggtaat aagatttgta aaaaggattg gattgttcag 6600 tagtatataa gtcaaattgt aaaacaaaat aacttttgat aataaaaccc ttttaattaa 6660 gagacgaatg ccccttctgg gtaaagtctc tataaacaaa aaaaaaa 6707 // ID BEL-156_AA-LTR repbase; DNA; INV; 592 BP. XX AC AAGE02017627; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-156_AA_; KW BEL-156_AA-I; BEL-156_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-592 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02017627; Positions 611 20. XX SQ Sequence 592 BP; 195 A; 122 C; 107 G; 168 T; 0 other; tgttgcagac gctgcaataa acgattttag aaataacaga catataggaa aatccattgc 60 aacagaaaaa ccctttgttc agaaaaacgt agcgattgta aaatcattag atgaaaagaa 120 tttctacccc ttaatgagcg tggccaaaat ccccgagcaa taaaatcatc cctcaaatca 180 ggtaacgcac gcatctcggt tttaaacaaa agagcaatta tctattccat gcgcaagggt 240 agaataataa aacatagaca gtagccgaga agggcgccgc gaccaaaagg gccaaccgtt 300 ttttgtgctc aggtaatcgt aggtaaattt agtcgtaagt aatttcttgt atatatatct 360 catacatttt tgaaaatata tgtcattcat agtttagcga tcacacgccg tagtttttat 420 tcgttgtctt tctcgacaat accttggtcg aactcacatc cgaacgcttt gttttcagta 480 ctgttttaga attttggaat agcttctcga atgagtggaa ggactagcca aagatcaagc 540 tttacttgaa atcattgcca tattctcaga ccctttaacc ccggagccca ca 592 // ID BEL-11_DWil-LTR repbase; DNA; INV; 210 BP. XX AC scaffold_181148; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-11_DWil_; KW BEL-11_DWil-I; BEL-11_DWil-LTR. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-210 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_181148; Positions 836211 836002. XX SQ Sequence 210 BP; 93 A; 33 C; 24 G; 60 T; 0 other; tgtcgcagaa aatactactt tatttatatt taagctatat taccgaaaaa taccattacc 60 gttcatgaat ctagtatata agtcaattat catttttgtt gtaaaaagag atggtcactc 120 tataaaaaaa acgatggcga agcctaaaga ataaaacaat aaagctctct aaaaaatata 180 taataataaa aaaaaccacg gccaattcca 210 // ID Gypsy-61_CQ-I repbase; DNA; INV; 4546 BP. XX AC AAWU01037281; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-61_CQ_; KW Gypsy-61_CQ-LTR; Gypsy-61_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4546 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 501-501 (2011). XX DR Genome; AAWU01037281; Positions 11501 16046. XX CC Positions [1853-2458] - Reverse transcriptase CC Positions [3548-4009] - Integrase core CC 'ACCTG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS join(455..2140,2144..4531) FT /product="Gypsy-61_CQ-I_1p" FT /translation="MDKWDIPPFKFRTLPRNEIRSEWNKYKRHFQYVVAAT FT GEKDKTRIKNIFLAKAGPDLQEVFSSIPGADVEGSRTVDPFEVAIQGLDSY FT FAPKQHESFERNLFWTLRPSTGESLEKFMIRCTEQASKCNFGRTAAESRAI FT SVIDKMILFAPSDLKEKLLQVEDLNIDEAAKIISSHESIKLQAQVMGQAQL FT EDMDATAGVNKIQHRPHRGPPPKCTRCGYTDHASTDRNCPARNRQCSKCGA FT NGHFAVMCLTGPKRKFRDEAGPSKRWKTERIREVRTDESKPPESFIFNIGD FT QNEMIWLRVGGVLLHVLVDSGCKKNIVDESSWKYLKANGVQVTNQQVNCEE FT IFLPYGSQAKPLTTLGKFDAIVTIDDGGRKIEETATFYVIKEGQQCLLGRV FT TATSLGVLRIGLPSTHGINAIEPKEKHAFPKISGVQVEIPIDDSVTPICQH FT PRRPPIALQSRIEDKINALLASDIIEPVEGGCQWVSPLVTVVKDNGDLRLC FT VDMRRANTAILRERHIMPTIEDFLPRFTTAKYFSRLDVKEAFHQVRNISGK FT WLYCTGHKNIKTIIYVELKEESRYITTFITHVGLFRYKRLMYGIVIASEVF FT QRIMEQILCPYSKNLVNYIDDILIFGSTEKEHDDVLRAVLNTLHDRGILLN FT QEKCLFKACKLQFLGHEISSEGIEPCGSKVEALQNFRAPSTPEEVRSFLGL FT VTYIGRFLPDLATVTAPLRQLTHSGVKFVWGKEQQEAFLRLKDMISNVKLL FT YFFDNSLRTRVIADASPVALGAVLIQFGDETDDSPRPIAYASKSLTETERR FT YCQTEKEALALVWSVERFTVYLIGRSFELETDHKPLEAIFQPTSRPCARIE FT RWLLRLQSFRFHVKYRKGAGNIADPLSRLVQHSSSENFDTDNQFMILAVCQ FT SVAVDIHELDQATKSDSVLEAVKQCIRTGNWDPPEAKPFHPFRSEMCVLDD FT LLVRHDKLVVPDKLRARMLDLAHEGHPGESVMKRRLRDRVWWPGIDRDVTR FT RVVSCEGCRLVGLPNRPEPMCRRPLPCKPWVDIAIDFLGPLPCGVYLLVVI FT DYYSRYKEVELMTRITAKETVQRLDKIFTRLGYPQTITLDNAKQFVGIEIQ FT EYCKTHGIYLNHSAPYWPQENGLVEKQNRSFLKRLKISHALNRDWKQDLRE FT YLVMYYTTPHSTTGKTPTEMLYGRTIRSKIPALSDIEGAPSNTEEADRDRI FT LKQKGKENEDARRKARESSIGTGDTVLMQNLLPGNKLSTTFNPTEYVVLAR FT DGPRATIRDPNSGKSFKRNVAHLKRIEKPAADEVSTSEGAAMEGPAWSQAE FT NNGNASPNRSDETQGIEDHEDVEPEQPKKPRRSLKRPARFADYVSS" XX SQ Sequence 4546 BP; 1286 A; 1108 C; 1224 G; 928 T; 0 other; aattggcgac gaggtaaagt atgagttact gattcaagga aggggaatgg aagaaacacg 60 gatttctttt gtttttctgt gttgtgggaa ggttaaacta agcccactaa aagtgaggtt 120 aagacaatat atcaaggcaa atcttggtca ttagccaaag gtatattttt cattacagtg 180 gaggccacca gggtggaccg tgtctgagtt acaactcatt cggaaagcaa ccgcaaaggt 240 ggttgacagg acaagtagct accagggtag cggaggattg aatagccacc aagttaggca 300 agtccaggga catagttcgc aagggcgatc ggatcgataa tcggaacgga gcacaacccg 360 caagacggat ctggagacgt ggtaaacaaa gaaacggcaa tttgataagt caatctaaat 420 tctaattgga gttttttttc aatcgctcta gagtatggac aagtgggaca taccaccgtt 480 caagtttaga acgctaccaa ggaatgagat ccgttccgag tggaacaagt acaagcgcca 540 tttccagtac gtggtggcgg caactggtga aaaagacaaa actagaatca aaaacatctt 600 cctggcgaaa gccggtccgg acttacaaga ggtattcagt tcaatcccag gagcggatgt 660 tgaggggtca cggacggtcg atccatttga ggtcgcaatt cagggactgg acagctactt 720 cgcgcccaaa cagcacgaat catttgagag gaacctcttc tggacactca ggccgtcgac 780 cggagagtcg ctcgaaaagt tcatgatacg ttgcactgag caggccagca agtgcaattt 840 cggaagaacg gcggctgaaa gccgcgcgat cagtgtcata gacaaaatga tactctttgc 900 ccccagcgac ctgaaagaga aattgctaca agtggaggac ctgaacatcg acgaagccgc 960 gaagatcatt agctcacatg aatcgattaa actacaagcc caagtgatgg gccaggctca 1020 gctggaggac atggatgcga ccgcaggtgt caacaagatc cagcaccgac cgcaccgtgg 1080 gccaccaccg aaatgcaccc gctgtggtta cacagatcat gcgagtacgg atcgtaactg 1140 tccggctcgg aacagacagt gctcaaaatg cggagcgaat ggacatttcg cggtcatgtg 1200 cctcacgggg ccgaaacgaa agttcagaga tgaagccggc ccgagcaagc gctggaagac 1260 agagcggatc cgagaagtta ggactgacga gagcaaacca cctgagagct tcattttcaa 1320 catcggcgat caaaatgaaa tgatctggct gagagtgggt ggagttttgc ttcatgtttt 1380 ggttgattca ggatgcaaga aaaatatcgt cgacgagagc tcttggaagt acctcaaagc 1440 gaacggtgtc caagtcacca accaacaagt gaactgcgaa gaaattttct tgccgtacgg 1500 atcccaagct aaaccgctga cgaccctggg taaatttgat gcgatcgtta cgattgatga 1560 cggcgggcgg aagatcgagg aaacggcgac gttttacgtt atcaaagaag gtcaacagtg 1620 cctacttggc cgtgtcacag caacaagtct gggagtgcta cgaatcggac tgccgagtac 1680 gcatggcatc aacgcaattg aaccgaaaga aaaacacgct tttccgaaga tcagcggggt 1740 acaggtggaa attccgatcg acgactcagt gacacccatc tgccaacacc cacgtcggcc 1800 gcccatcgcc ctgcaatctc ggatcgagga caagataaac gcactactgg ccagcgacat 1860 catcgaacca gtcgaagggg gttgtcagtg ggtctcgccg ctggtaacag tcgtcaagga 1920 caacggggat cttcgattgt gtgtcgacat gcgcagggcg aatacggcaa tactgcggga 1980 acgacacatc atgccaacca tcgaagattt cctcccaaga ttcacgaccg caaagtactt 2040 cagtcgcctc gatgtcaaag aagcgttcca tcaggtaaga aatatttcgg gaaaatggtt 2100 atactgtaca ggtcataaaa acataaaaac aattatatat taggtggagc tcaaagagga 2160 gagcaggtac ataaccacgt tcatcactca cgtggggctt tttcggtaca agcggctcat 2220 gtacggaatc gtgatcgcat ctgaagtatt ccaacgcatc atggagcaga ttctgtgtcc 2280 ctacagcaaa aacttggtca actatatcga cgacattctc atttttggtt cgaccgaaaa 2340 ggaacacgac gacgttctgc gggcagtgtt gaacacactg cacgatcgcg gaatccttct 2400 gaaccaagaa aagtgcttgt tcaaagcttg caagctccag tttcttggcc acgagatctc 2460 ttcggaaggc attgaaccgt gtggaagtaa agtggaagcc ctgcagaact tccgggcacc 2520 gtcgacgcca gaggaagtcc ggagtttttt gggattggta acatacattg gccgcttcct 2580 cccggacctc gcaacggtta ccgctccact tcgccagctg acacattccg gggttaaatt 2640 cgtttggggc aaggagcaac aggaagcctt cctgcgtctg aaggatatga tctcgaacgt 2700 gaaattgctc tacttttttg acaactcgct gcggacaagg gtaatcgcgg acgcgtcgcc 2760 ggtcgctctg ggcgcggttt tgatccagtt cggcgacgaa acagacgact ccccgcgacc 2820 aattgcttat gcaagcaaga gcctgaccga aaccgagcgc agatattgtc aaaccgaaaa 2880 agaagcgctc gcactggtct ggagtgtgga gaggttcacc gtgtatctga tcggccgaag 2940 ttttgaatta gaaaccgatc acaaacccct ggaagcgatt ttccaaccta cctccagacc 3000 atgtgccaga atcgagcgtt ggctgcttcg actccaatca ttcaggttcc acgtcaagta 3060 tcggaagggc gcgggaaata ttgccgatcc gctgtcgcgt ctagttcagc actcctcatc 3120 cgagaacttc gacactgaca atcagttcat gatacttgct gtatgccagt cagtcgcagt 3180 cgacatccat gaactcgatc aagctaccaa gtcggactcg gtactagaag cagtcaaaca 3240 gtgcatccgc accggaaact gggacccgcc ggaggccaaa ccgttccatc ctttcagaag 3300 cgagatgtgc gtgttggacg atctgcttgt tcgacacgat aagctggttg ttcctgataa 3360 actgagggca aggatgctgg atttggctca cgaaggtcac ccgggtgagt ctgttatgaa 3420 acgtcggctc agagaccgag tatggtggcc gggaatcgac cgagacgtca cgcgtcgggt 3480 tgtctcgtgc gagggttgtc gattggttgg actgcccaat agaccggagc ccatgtgtcg 3540 tcggcctctt ccgtgtaagc catgggttga catagctata gactttctgg gaccgctgcc 3600 atgtggagtg tacctgctgg tcgtcatcga ttattacagt cgatataagg aagtagaact 3660 gatgacgagg ataactgcga aggaaacggt gcaaagactc gacaagatct ttacacgact 3720 ggggtaccca caaacgataa cgctggacaa cgccaagcag ttcgtcggaa tagaaatcca 3780 agagtactgc aaaacgcacg gcatctacct gaatcattcg gctccgtatt ggccacagga 3840 gaacggacta gtggagaagc agaaccgatc gttcctgaag aggttgaaga tcagccacgc 3900 tctgaacaga gactggaagc aggatctacg ggagtatctg gtcatgtatt acactacacc 3960 gcactcgacc accggcaaga caccaaccga gatgctgtat ggccggacca tccggtcgaa 4020 gattccggcg ctcagtgaca tcgagggagc tccgtcaaac actgaagaag ccgatcggga 4080 ccgcattctg aaacaaaaag ggaaggagaa tgaagatgcc cgacgcaaag ctcgggagtc 4140 atccatcggg accggagaca ctgttcttat gcagaatctt ctacctggta acaagctatc 4200 tacaacgttc aacccgacgg agtacgtagt gctggcgcgc gacggacctc gtgcaacgat 4260 ccgcgacccg aacagcggca aatctttcaa gcgaaacgtt gcgcacctta agaggataga 4320 aaaaccagcc gctgatgagg tgtccacaag cgaaggcgct gccatggagg gtcccgcgtg 4380 gtctcaagct gaaaacaacg gcaacgccag cccaaaccgc agcgacgaaa cccaaggaat 4440 cgaggatcac gaggacgttg aaccggagca accaaagaag ccaagacgat ctttgaaaag 4500 gccagccaga ttcgccgatt acgtttcttc gtgaaaaaag gggaga 4546 // ID Copia-26_AA-I repbase; DNA; INV; 3111 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-26_AA_; KW Copia-26_AA-LTR; Copia-26_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-3111 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 952-952 (2011). XX DR [2] (Consensus) XX CC Positions [1984-2511] - Integrase core CC 'ATTGA' target site duplication CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 1129..3027 FT /product="Copia-26_AA-I_1p" FT /translation="MTDPAPLTMALVKQKLLDEHQRRKDRWADSGETAMKS FT ERKVFQKQKVCFHCGKPGHFRRNCVELLQKKSEDDSRRRKADKVKVKDCSN FT GKPICFSIAERRLKSLWYIDSGCSCHMTNNRSFFTKLDGSKCVDVVLADGS FT STKSQGVGEGLVKCINAKGHIVDIKLTEVLYIPSLDSGLLSVQKLTLRGFE FT VDFNESGCVIKCETGETVVMGEVYGSLFVLKTVEFSKLSKVRHFKNCQHIW FT HRRFGHRDPAVLDRMKAESFVREFEMQDCGLRQTCEICLEGKLPRKPFPKA FT SQSRANRVLDLVHTDVCGPMANVTPGGNRYLMTLIDDFSRYTVVRLLKRKS FT DVVGCVKQYVSYVKCLFGRAPCVIRSDGGGEYVNNELRKFYKQEGIQAQYT FT VAYSPQQNGVAERKNRTLQEMATCMLIDANLPKRYWGEAVITAAYVQNRLF FT SRVVDRTPYEKWFGRVSAVGHIRVFGSLANAHIPGIEVQQLVFVGYNEEKE FT TYRLLDTKTNRITFSQDVQFFEQQVGYHDVTQKGSDSTENIGDEGLIPASL FT QKESTEAELLHEEPVQEERIDRENLLRKEQTDQEEDVESEDEFYGFEDEPE FT PEQRAEKRATRGVRPRRFDDYIVDVGIVVEESPMF" XX SQ Sequence 3111 BP; 896 A; 553 C; 851 G; 811 T; 0 other; ggttatcggc ccagaaaagt ggtaaagcgc gaagtgattc cggaacgcgg atattgtttc 60 cattggcgtc ctcgggctaa ttagtaatcg ttgctgtcgc gctgggcttg agcagttgtg 120 tgaagagttc tttgcaatgc tctgagagtg ggaccgaagt gaacgcaaaa cggtttgagt 180 gtagtgtgta tcaaccactc actgggtgtc cgggagcata aactgctcga aaaatttgtt 240 ttgctcagtc ggaagcctgt atagtagtgc cagtgcagca tcgagagaga agtaaacaat 300 ctctgcactt gtattttggt gggacgagaa agtttctgtt gttgttgctg ctgctggcgt 360 tgtcacagtt gttgctgttg ttcggcagac tggataccgt gctttggtgc gattgtgtac 420 ggattgtgac taaatatggc agagaaatac ttgtttgctc ggcttagcaa ccaaaattat 480 tcggtgtgga agacccgaat ggaaatgttg ctgaagagag aagaactgag gagtgttatt 540 gactctgcaa gaccagaacc agtgacggac acatggacta aaagtgatca gaagtgtcat 600 gcaacgattg tcttgtatat agatgacaat cagttgaatg tagtgaaaga tgccagaagt 660 gccaaggcgg tttgggatca attaaaacag tatcacgaga agacgtccat gacctcgcgc 720 gtgtcactgt taaagaaact gtgtagcctg aatatgtcgg aaggttctga tgttgagaag 780 catttattcg aactggaaga gctgtatgac cgactagcgt aagctggcca gtcattggaa 840 gatccattga aaattgctat gctcttaagg agtttgccgg aatcgtatgg agatcttgtc 900 acggcactgg agagtcggcc ggaggcggat ttaacgatca acgctccgtc atcgtaatca 960 tcgtcatcat cggaacttga aaagcagttt ttctgttcaa aactaaaaat tattcgatga 1020 aaatgtgttc attgtgtttc ggtaggggaa cagaatacag tgaacctatt ttcctgtaaa 1080 ttttcttgat tttgaacgaa aaaactgcta ttaaaaattc cgaaattaat gacggatcct 1140 gcccccttaa cgatggcgct agtgaaacaa aagttgttgg atgaacacca gcgacgaaag 1200 gatcgatggg ccgattccgg cgaaaccgcc atgaaatcag agcgaaaagt gttccagaag 1260 cagaaagtct gctttcattg cggaaaacca ggtcattttc gtcgaaattg cgtagagctg 1320 ttacagaaga aaagtgaaga tgattctagg agaagaaaag cagataaagt gaaagtgaag 1380 gactgttcga atggaaaacc tatatgtttt tcgattgctg agcgtcgatt gaagagtttg 1440 tggtatattg atagtggttg ttcatgtcac atgacgaaca accgctcttt ctttactaag 1500 ctggatggca gcaagtgcgt cgatgttgtt ttggcagatg ggtcttccac gaagtcccaa 1560 ggagtgggcg aaggcctagt gaagtgcatt aacgccaaag gacacattgt tgatataaag 1620 ctcacagaag tgttgtatat accgtctttg gacagtggac tgctttctgt gcagaagctc 1680 acgctgagag gatttgaagt ggattttaac gaatcaggtt gtgtgatcaa atgtgaaaca 1740 ggagaaacgg tagttatggg tgaagtttac gggagtttat ttgtgttgaa gaccgtcgag 1800 ttttccaaac tgagtaaggt aaggcatttc aagaactgcc aacacatttg gcacaggcgc 1860 tttggacaca gagatcctgc ggtgcttgat cgtatgaaag ctgaaagttt cgtaagagaa 1920 ttcgagatgc aggattgcgg attgagacag acttgtgaaa tctgtttgga aggtaagctc 1980 cctcgtaaac cgtttccaaa agcttcacag agcagagcta accgtgtgct tgatctagtg 2040 cacactgacg tttgtggacc gatggcgaat gtcacccctg gaggaaaccg gtatcttatg 2100 acgctaatcg atgacttcag ccggtacact gttgttcgcc tgttaaagcg gaagtcagat 2160 gtcgtcggtt gtgttaaaca gtacgtttcc tatgttaaat gtctgtttgg aagagctcct 2220 tgtgtcatca ggtcggatgg aggtggcgag tatgtgaaca acgagttgcg gaaattctac 2280 aagcaagaag gcatccaggc gcagtacact gtagcttact cgccccagca gaatggcgtg 2340 gcggagcgca aaaaccgcac gctacaagaa atggctacct gtatgctgat cgatgcgaat 2400 cttcccaaac gatattgggg agaagctgtg ataacagcag cttatgtaca gaatcgatta 2460 ttttcccgtg tggttgacag aactccatat gagaaatggt tcggaagagt atctgctgtg 2520 ggacatattc gtgtatttgg aagcttggcg aatgcccaca taccgggcat tgaagttcaa 2580 cagcttgttt tcgtcggata caatgaagag aaggaaacct atcgattgtt ggatacgaag 2640 acaaacagaa taacgttcag ccaagatgta cagttcttcg aacagcaggt tggataccat 2700 gatgtgactc aaaagggatc ggattcaacc gaaaacattg gtgacgaagg tttgatacca 2760 gcttcgcttc agaaggagtc aacagaagca gagctgttgc atgaggagcc agttcaggag 2820 gagcgaatcg atcgtgaaaa tttattgcgg aaggagcaaa cggatcaaga agaagatgtg 2880 gaatccgaag atgagttcta cggatttgaa gatgagccag aacctgagca acgagctgaa 2940 aaacgagcaa ctagaggcgt tcgtccacgg aggtttgacg actatatagt cgatgtgggt 3000 atagtcgttg aagaaagtcc gatgttctga gcgagtacgt aaggaatcta tattatatgt 3060 aagtaatttt gtgtaataga ctgtatagcc gctgacaatg tcgaggagga g 3111 // ID W2 repbase; DNA; INV; 715 BP. XX AC U10109; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 28-SEP-1995 (Rel. 1.08, Last updated, Version 1) XX DE W2 repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; KW Repetitive sequence; W2. XX OS Schistosoma mansoni OC Eukaryota; Metazoa; Platyhelminthes; Trematoda; Digenea; OC Strigeidida; Schistosomatoidea; Schistosomatidae; Schistosoma. XX RN [1] RP 1-715 RA Drew C.A. and Brindley J.P.; RT "Identification of female-specific genomic sequences from RT Schistosoma mansoni by representational difference analysis."; RL Unpublished. XX RN [2] RP 1-715 RA Drew C.A.; RT "W2."; RL Direct Submission to Genbank (27-MAY-1994)Alexander C. Drew, RL Tropical Health Unit, Queensland Institute of Medical Research, RL 300 Herston Road, Herston, Brisbane, Queensland 4029, Australia. XX DR GenBank; U10109; Positions 1 715. XX SQ Sequence 715 BP; 196 A; 123 C; 200 G; 196 T; 0 other; aagcttgctg atgtgcagtt tgccgatgtc tattcaagtg atcaatcagt atgagtagaa 60 gacaatcatg cacaacgatt gaacagtcaa cagctcaaat gatgagccga gtgccgagaa 120 tcgttgcgca tttcctcaat caataaacag tgaagtgtgt tcacgaacga ttcccgttcg 180 gttgagatcg ggaaacgctg aaattgtgtg ttatgctctt tcggacatca gtttccacgt 240 gacattgatg aagtgagact gtgttgaggt cggtcgggtt gaaggaaatc taagtgtcgg 300 tggaaatgca cactgtgtgt ggtaatagaa cgacgaaaat tacgaatgcg tctattgagc 360 tatgttctgc atattggtct gaacaagtga tgatcgaagg acctgtgatt gtgcatgtgt 420 atgagtgcat aagtcaacaa agttgataac gaacaacctg gtgaattgac cgcatctgtg 480 tgatgtacaa ttagatgatg tagattgtgg agaggttcta ctggtgattg gttgtgacgt 540 gtcggaagaa cgttgggtgt cagcccacga aatgtgtgta aggcgaaacg cgtacgccgt 600 cagaacgttg ttgtggtggc ctgttttcgg acctgcatca tactcggaag atagcaaatc 660 agttgttcat cgtagtagag gagtacacac gttggacgac caagtgcgta agctt 715 // ID BEL-176_AA-I repbase; DNA; INV; 5680 BP. XX AC supercont1.6; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-176_AA_; KW BEL-176_AA-LTR; BEL-176_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5680 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.6; Positions 2939314 2944993. XX CC 'CATAT' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 637..4845 FT /product="BEL-176_AA-I_1p" FT /translation="MKEEEARLKKLQAEEELKLKQVEEAAKLKQLDEEQEY FT LKQRYNLLMQIEDEKEPSSRRSGISQKSRRTGVNVKSKRDQLEKWLKGAAV FT DEVGQSKPVASGYQAVPQVQTVSESVQSPVVINELQSPALTVPNSLSQVSA FT SSNMNHLCLPASGKVPTLAKSYEQFVSGQVVSSVSTVPVSIPFRSADPTAS FT DTLLSGMQGLRICNEPNAAVLPTVTLPARDAVAHSELNATAQVSVYSSGLM FT PAVYGSHPYYTAAPMGLRASINAAVPGEVAQVSQSQSSFGRVPPPEFAVSS FT TVGHGNGPFSNPAAPVVTNMFRPPLASIGLPDSGLYEVRASSAVGPSSAQL FT AARQVMPRELPKFNGDPQEWPIFYSSFKNTTEVCGYTDAENLARLQRSLGG FT SALEAVRSRLLLPASVPYVMDTLHKLYGRPEILISSLLKKVRNVPPPKAEN FT LSSIVAYGLAIQNLVDHIVLADQQAHLSNPMLLQELIDKLLTSLKMQWGTY FT KQSVGYINLATFNSFMAGLVNLASELPVDVDSAQNHYKQIRAEKSKQKEKL FT FTHANESSDFPKKEKEASIERTTAKACSYCGNENHQILNCSSFKSLDIGAR FT WKAMRQKNLCRLCLVPHRKWPCHSKKECGVDGCRIRHHKLLHSSQTSRNEP FT VETTKPTEAVHHNFHHTKSFSLFRYLPVTLHGDGKSVETYAFLDDGSSSTL FT LEEAIAAQLGIEGELDNFWLGWTGKIGRHEKSSRRVCLEISGAGKQNTFQL FT SNVRTVRELGLPSQTLNYTDLSRVYPHLQGLRVSSYIDAKPGIIIGIEHVH FT LITSLKIREGGRSDPVATKTRLGWCIYGRNSGTEASVEQLNLHVGQEMSNS FT DLYDSMKKFFAVEEAVVTSTIESEEDKRARSIMEATTVRRGSRIETGLLWR FT NNDVQFPESYKMAMNRLLGLEKCLSRDPELQRRVNEQIDSYEQKQYISKAS FT QEEVDSLNPHRVWYLPLGVVINPKKPNKTRMVWDAAAKVGGVCFNDMLLKG FT PDLLVPLMNVLLRFRQGKVAVCSDIREMFLRILIRDEDKWSQCFLWRTSPD FT EEVQVYVINVAMFGATSSPCTAQFMKNWNASEFSEQYPRAVEAVTKNHYVD FT DFLDSVDSVEEAVTLVEQVQQIHAAAGFQFGKIVSNEQEVLDRLGESSSST FT SKPLPLDKDRTYERVLGVMWSPSEDIFTFSQQAVEEVLDTSWAVPTKRQIL FT RIVMKLYDPLGFVAHFVVHGKILMQEIWRTGTNWDEPIAEQLHELWTRWIE FT QYRNINEVNVPRCFFKSFWPQQISNIQIHVFTDASVSACACVAYVRMTVNG FT ESQCSLIAAKTKVAPLRSLSIPRLELQAAMMGSRLLQNMCSALTISIQRRF FT LWTDSATVLSWLRSDSRRFHQFVSFRVGEILALTSVDEST" XX SQ Sequence 5680 BP; 1612 A; 1236 C; 1491 G; 1341 T; 0 other; tttctacaaa gaatttcatc cgaagttgga tgattgttgc ccgtcgggtg tcgctaccaa 60 gttgtgctgc aaccgggaag acgtccgcca accgaacgac ccacagttga agactgaaca 120 tcatcatccc agccagtacc ggacccggag gtttgattgc atcgaaacac aagtcagtag 180 tcagcttttc tatttcgtgc ctttaccaca gtcgtggtta ctgcagtacg gcagttaagt 240 tagcttgtcc tatagatagc ttagtaggcg aattattaga ttttattatg ccaaaaaagg 300 gcaaaaacgt taccaatacg gaagggatat ataagccatg caaaatatgc aatcagaaag 360 agggtgatga taactgggtc tgttgcgatg ggtgggtcgt gggaacactt tcaatgcgcg 420 ggtgtcaatg agtcaatcgc ggaaaaaagt tgaaaatgcc gtgactgtct cgccgattgg 480 gacggtactg atattacgtt aaaatcggga gtgtccttta atgaatcttc agtgagtacg 540 agcagtagtc gtgcgtctca aagattgcaa cttagtctgc agcacctgga ggagcaacgg 600 aagctgagca aactgcgcgc agcggaagaa ctgaagatga aagaagaaga agctaggctg 660 aaaaagctgc aagcggaaga agaactgaag ctgaaacaag tagaagaagc agccaaatta 720 aagcagctgg atgaagagca ggagtatctg aagcaaaggt acaatttgtt gatgcagatt 780 gaggatgaaa aggaaccatc gagtcgacgt agcggaatta gccagaaaag tagacggact 840 ggagtgaacg tgaaaagtaa gcgtgatcag ttggagaaat ggttgaaggg agctgctgtg 900 gacgaagttg gtcagtcgaa accagttgca tcgggatatc aagctgttcc acaggttcaa 960 acagttagtg aatcggttca gtctcctgtt gtgattaacg agttgcaatc accagctctt 1020 acggttccaa atagtttgtc gcaggtttcc gcttcgtcta acatgaatca cttgtgtctt 1080 cctgcatcgg gaaaggtccc tacattggcg aaatcctatg aacagtttgt atcagggcaa 1140 gtagtctcct cggtgtcaac tgttccagta tcgattccgt ttaggtcagc agatccaaca 1200 gcttcggata cgctgctttc gggaatgcaa ggtcttcgaa tttgtaatga gccgaatgca 1260 gcagttttgc ccactgtcac attaccggcc agagatgcag tagctcattc ggaattgaat 1320 gcaacagcgc aggtttctgt ttattcgagt ggtttgatgc cagcagtata tggaagtcat 1380 ccgtactaca cagccgctcc aatgggactt cgtgcatcaa taaatgcggc agtaccggga 1440 gaagttgctc aagttagcca gtcgcaaagt agtttcggtc gtgttccacc tccagaattt 1500 gcagtcagtt caacagttgg gcatggaaat ggaccttttt caaatccagc ggcaccggta 1560 gtgacaaata tgtttcgacc accattagct tcgattggat tgccagattc tggtttgtat 1620 gaagtacgag ctagttctgc agtagggcca tcaagcgctc aactggccgc taggcaggta 1680 atgccccgtg agctgccaaa gttcaatggc gatcctcagg agtggccaat cttctacagt 1740 tcattcaaga acacgacgga ggtttgcggg tatacagacg ctgaaaattt ggctcgtctg 1800 cagcgaagtt tgggtggctc tgctttagaa gcagtacgaa gtcgtctgct gctaccggct 1860 tccgtgccat acgtgatgga tacgcttcat aagctgtacg gaagaccgga gatactgatc 1920 agttcgcttc tgaagaaagt tcgcaacgta cctccaccta aagcagagaa tctcagttcg 1980 attgttgcct acggcctggc gattcagaat ttggttgatc atatcgtact ggccgaccaa 2040 caggcgcatt tatcgaatcc aatgctgctc caagaattga tcgataagct gctaacctcg 2100 ctgaagatgc agtggggaac ctacaagcaa tcagttggtt acatcaacct ggcgaccttc 2160 aacagcttca tggctggact cgtgaatcta gcgtcagaac tcccagtcga tgttgattcc 2220 gcccaaaatc actacaaaca aattcgggca gaaaagtcaa agcaaaagga gaagttattc 2280 actcacgcta acgagtcttc tgattttccc aagaaggaga aagaggcgtc aattgaacga 2340 acaacagcga aagcctgctc gtattgtgga aacgaaaatc atcagattct aaactgttcc 2400 agtttcaagt cattggatat cggagcgaga tggaaagcga tgcgtcagaa gaatttgtgt 2460 cgtttgtgct tagttcccca tcggaaatgg ccttgtcatt caaaaaaaga gtgcggcgtg 2520 gatggttgtc gtattcggca tcacaagctt ctacacagca gtcagaccag ccggaacgaa 2580 cctgtggaga caacaaagcc aaccgaagca gtgcatcaca acttccacca cacgaaatcc 2640 ttctcgctgt ttcgatatct accggtcaca ctccatggtg acggaaagag tgtagaaaca 2700 tacgcattcc tggatgatgg atcgtcgtcg acactgttgg aggaggcgat cgcggctcag 2760 ctgggaattg aaggagagct ggacaacttt tggctaggat ggaccggaaa gattggcagg 2820 catgagaaga gttctagaag agtctgtctg gaaatttctg gtgctggcaa gcaaaatacc 2880 ttccagttga gcaacgtacg aacggtgcgt gagcttgggc ttcccagtca aaccttaaac 2940 tacaccgatc tgtcgagagt ttatccacat ctacagggtc ttcgagtcag cagttatatc 3000 gatgcaaaac ctggaataat catcggtatc gagcacgtgc atctcataac cagtctgaag 3060 attcgtgaag gaggcagaag tgatccagtg gctacgaaga ctcgcctcgg ctggtgcatc 3120 tacggaagga attccggaac cgaagcatcc gtggaacagt taaacctaca cgtcggacaa 3180 gaaatgagca acagtgactt gtacgattcc atgaaaaagt ttttcgccgt cgaggaagct 3240 gtagtgacat cgacaatcga atccgaagaa gacaaacgag ctcgcagtat catggaagca 3300 acaacagtac gtcgtggatc aagaatcgag acaggtttgc tatggcgcaa taatgacgtc 3360 cagtttccgg aaagctataa gatggcaatg aacagactct tgggattgga gaagtgtttg 3420 tccagagacc cagagcttca gagaagggtg aatgagcaga tcgacagtta tgagcaaaag 3480 cagtacatca gtaaagcatc gcaggaagaa gtcgacagtc tgaaccctca ccgagtttgg 3540 tatttacctc tgggtgtagt gatcaatccc aagaaaccaa ataaaacccg tatggtttgg 3600 gatgcggctg caaaggttgg aggagtctgc ttcaacgata tgctgcttaa ggggcctgat 3660 ctcctcgtac cgcttatgaa cgttctgcta cgattcagac agggcaaagt cgccgtatgc 3720 tctgatatcc gggagatgtt tctgcggatt ctcatccgcg atgaagacaa atggtcgcaa 3780 tgctttctgt ggagaacaag tcctgatgaa gaagttcaag tttacgtgat caacgttgcg 3840 atgtttggtg ctacaagctc tccatgtaca gcgcagttca tgaaaaactg gaatgcgtca 3900 gaattcagtg aacagtatcc tagggcagta gaagcagtaa ctaagaacca ctacgtagat 3960 gatttcctcg acagcgttga ctccgtggaa gaggcagtaa ctctagtgga acaagtgcaa 4020 caaatccatg cagctgctgg tttccaattc gggaaaatag tgtccaacga gcaggaagta 4080 ctagaccgac taggagaatc gagttcatcg accagtaagc cactgccctt ggacaaagac 4140 cgaacatacg aacgcgtcct gggcgtgatg tggagtccat cagaagacat ctttacgttc 4200 agccagcaag cagttgaaga ggttctggat accagttggg cagttccgac aaaacgacaa 4260 attctgcgga ttgtcatgaa actgtatgac ccattaggat ttgtcgcaca cttcgttgta 4320 cacggcaaaa ttctgatgca ggagatttgg cgcacaggga caaactggga tgaaccaatt 4380 gcggagcagt tacacgagct gtggactaga tggatcgaac agtaccgaaa cataaacgaa 4440 gtgaacgttc cgcgctgttt cttcaaaagc ttctggccgc agcaaatcag caatattcag 4500 atccatgtct tcacggacgc tagtgtgtca gcctgtgcgt gtgtggccta tgtaaggatg 4560 acagttaacg gtgagagcca atgctcgtta atagctgcga aaacaaaagt tgcacctctt 4620 cgatcgcttt ctattccgcg tttggaatta caagctgcca tgatggggtc tcgtctgcta 4680 caaaacatgt gttcggctct gactatcagc attcaaaggc gcttcctgtg gacagattca 4740 gcaacagtac tttcatggct tcgctccgac agtcgtcgat ttcatcagtt cgtttcattt 4800 cgtgttggcg aaatcttggc cctcaccagt gtagatgaat ccacctagag ttggcacaca 4860 gtctgtcaac caagtcatgc attatggcct ttaggcgatt cgtcaaccga cgaggggctc 4920 cgctagaggt cttctcagac aatggtacaa attttgtagg agccagtcgc cagttgtctg 4980 aagaaatgca gaaaatccag gcaatcaacg aagactgtgc atctacattt accaacgctc 5040 gcacacagtg gcatttcaac gttcctgcag ctccacacat gggaggacct tgggagcgga 5100 tggtgaaatc agtaaaagtg gcaatggcag caatatcgga cagtccacac cacaccagcg 5160 acgaggtatt cgaaacaata atgctggaag ctgaaggaat cgtaaactct cgcccgctga 5220 cttatgtacc tttggaagca gcagatcaag aagctcttac tccgaatcac ttcttgttgt 5280 acggttcaag cggagtcaaa cagccaacag atcagtcagt cagcctccga gacagttgga 5340 cagtcactaa gaacattgtg gacgagtttt ggcgcagatg ggtgttggag tacttgccga 5400 tgttgactag aaggtcgaag tggttcgaga aagtgaagcc gttggagcca ggtgatctag 5460 tcgtaatcat cgacgagaag acaaggaata gctgggaaag aggacggatt ctggaagtat 5520 cttcggataa atcaggtcaa gtgcgacgtg cagtagtgca gacagctagg ggagtgtttg 5580 ccagaccggc tgtaaagctg gcagttttgg acgttgcagg tggcaacaaa gggtcagagg 5640 aaatcgctcc ggaaacggaa gtggttcacg ggtcggggaa 5680 // ID Copia-1_RP-I repbase; DNA; INV; 2352 BP. XX AC ACPB02045731; XX DT 28-JAN-2011 (Rel. 16.02, Created) DT 28-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Rhodnius prolixus genome: internal DE portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-1_RP_; KW Copia-1_RP-LTR; Copia-1_RP-I. XX OS Rhodnius prolixus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Euhemiptera; Heteroptera; OC Panheteroptera; Cimicomorpha; Reduviidae; Triatominae; Rhodnius. XX RN [1] RP 1-2352 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Rhodnius prolixus genome."; RL Direct Submission to RU (28-JAN-2011). XX DR Genome; ACPB02045731; Positions 3563 1212. XX CC 'TTAT' target site duplication CC LTRs are 94% similar to each other. XX FH Key Location/Qualifiers FT CDS 91..1833 FT /product="Copia-1_RP-I_1p" FT /translation="MNLLNEVMEIDKLKKPEDFQCWKFKITILLKANGLYG FT IVSGEEASPPQDHKDYNLFVQRDAKAQKLIILSIDKIPLTHVMTCNNSKEM FT FKKLKEVYERDSTQQKFDLMQEFFNFKFNNSVDISTNISILENLVFRLKAL FT NNEINDSMVMSKILTSLDSKYRHFWSAWESTPNEEKTLSNLITRLANEEKR FT FSCSEDVAVAFKSVIKNKTNQTYSTPVKCFNCSKPGHISKFCPNKRQSKFC FT KICKKNNHEEKNCYFRKNKHPEKHDISFLANEASNSKWILDSGSSSHITND FT EIVLQGKRNSSGLYSVDLDIKERVEANFAENLNPRTWHRRLGHVSQECINK FT LSGMVDGLNLKNERPQFDCETGMQANLNKIISLEDEVKQMYVELRASKDEL FT LKKIEEESLMTNKYMKGKGELKTDVKPLENHILELNEEKRNLECKLKKIQS FT ELHYNQVSEDKIQTDIITAPNGNSPVKGREITGKNRKERRKKLLQLLTEKE FT KSERNLEKKEKKLRFADEPENTPVETYAKELEQLSETSDREVKEEEEDTVG FT DEKSATTYQMLKIKGLTPKRKKEVRNPRVRYKNK" XX SQ Sequence 2352 BP; 968 A; 327 C; 466 G; 591 T; 0 other; ggttatgggc ccaggccgtt gtccatctta aaaactacta aaggaatata aaggttagtt 60 gcacgttgtt acaataaaaa tccgtaaaaa atgaatctat taaacgaagt gatggagata 120 gacaaattga agaaacccga ggacttccag tgttggaaat ttaaaataac aattttattg 180 aaggccaatg ggttatatgg cattgtgtct ggggaggaag cgtcacctcc acaagaccat 240 aaggactata atttgtttgt tcagagggac gcaaaagcgc aaaaacttat tatcctctca 300 atagacaaaa taccactgac acatgttatg acttgcaata actctaaaga aatgttcaaa 360 aaactaaaag aagtgtatga gcgtgattca acgcaacaaa aattcgattt aatgcaagaa 420 ttttttaact tcaagtttaa taatagcgtt gatatatcaa ctaatattag cattttagaa 480 aacctggttt ttcggttaaa agccctgaat aatgaaataa acgacagcat ggttatgtca 540 aaaattttaa ctagtcttga cagtaaatat agacattttt ggagtgcctg ggagagtacc 600 cccaacgagg aaaaaacact gtcaaattta atcacaagac tagcaaatga ggagaaacgt 660 ttctcatgct cagaggatgt tgcagtagca ttcaaaagcg tgattaaaaa taaaacaaat 720 caaacttatt caacaccggt aaaatgtttt aattgttcga aaccaggaca catatctaag 780 ttttgcccca ataagaggca aagtaaattc tgtaaaatat gcaaaaagaa taaccatgaa 840 gagaagaact gctattttag aaaaaataaa caccctgaaa aacacgacat atcttttctg 900 gctaacgaag catcaaactc caagtggata ttggattcag gatcttcgag tcacataacg 960 aatgatgaaa tagtgttaca aggaaaacgg aattccagtg gcctatactc ggtagatctg 1020 gatattaagg aaagagtaga agcgaacttt gcagagaatt tgaatcccag aacatggcac 1080 agacgtttgg gacatgtaag tcaagagtgt attaataaac tcagtggaat ggtggatgga 1140 ttgaatttaa aaaatgaaag acctcagttc gactgcgaga caggtatgca agctaattta 1200 aataaaataa tttctttaga agatgaagta aaacaaatgt acgtagaatt acgagcgtct 1260 aaagatgaat tgctaaaaaa gatcgaagaa gaatctttga tgactaataa gtatatgaaa 1320 ggaaaaggtg aattaaaaac ggatgttaaa ccactagaaa atcatatttt agaattgaat 1380 gaagaaaaaa gaaatctcga atgtaaactt aaaaaaattc aaagtgaatt acattataat 1440 caagttagtg aggataaaat ccagactgat ataataactg caccaaatgg aaattcgcct 1500 gttaagggta gagaaatcac tgggaaaaat agaaaagaga gacggaagaa attactgcaa 1560 ttacttactg agaaggagaa atcggagaga aatttagaga aaaaggagaa gaaattaaga 1620 tttgctgatg aaccagaaaa cacgcctgta gaaacttatg ccaaagagct ggaacagctg 1680 agtgaaacca gtgacagaga agtcaaagaa gaagaagaag atacagtagg agatgaaaaa 1740 agtgctacaa cctaccaaat gctcaaaatt aaaggattaa cccctaaaag aaagaaagaa 1800 gtgagaaatc cgcgagtaag gtataaaaat aaatgaagaa gagcttatag cttactcaga 1860 cgccgactat gcgggagacc ctgagactcg tcgctcaaca actggattcg tgatatacta 1920 cggaggcggg ccggttagct ggtgctcgcg aaagcaaaat atagtggcat tatcaagtgc 1980 ggaaagtgaa tatattagta gtgccgaatg ttgtaaagag ttaattttta taaaagaact 2040 catatatgaa ctaactggta aaaatttgaa ttgtacatta tttatagata ataaaagtgc 2100 tattgatata ataaagaatg gtaacataaa taaaagatca aaacacatag atgtaagata 2160 ccattatata aaagaaaaat atgacaactg tgaaattaat gtaaaacatt gtagttcgaa 2220 tgatcaagtc gcagacattt tcacaaaacc tttaggtgga gtaaaattca ataaatttaa 2280 agacttaatt gtttcataaa taaatgcata cctgttatag ataaagataa aattaggggg 2340 tgtgttaaat aa 2352 // ID Sola1-5_AA repbase; DNA; INV; 2811 BP. XX AC . XX DT 28-JAN-2009 (Rel. 14.02, Created) DT 28-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE Sola1 DNA transposons from Aedes aegypti. XX KW Sola; DNA transposon; Transposable Element; Sola1; Sola1-5_AA. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-2811 RA Bao W., Jurka M.G., Kapitonov V.V. and Jurka J.; RT "New superfamilies of eukaryotic DNA transposons and their RT internal divisions."; RL Molecular Biology and Evolution 26(5), 983-993 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS join(364..894,936..1463,1414..2346) FT /product="Sola1-5_AA_1p" FT /translation="MALQLLKDYGSGSSSEEEFEGFPAEAQVVSTFPLSSS FT ATSTDCNDSDGSFTNEPVLKKRKLTKKERMKRSVAKGRRNTHRAKLTMCRC FT HRNCNDHVSHDEIKAVNHAFWILDLEGQKNWIRQQVTRREISRRSTNWFEN FT DLRKNTSFLFHLPTSNGTQVEVCRKKFLNAIGYGEDCGTYFCSNMVYRCFT FT EGEDGLPVPSRRGKHKRTTPKRDAVRAHILSYNPTISHYRRKHAPKRLYLP FT SDLTERAMYNDFVASSDAVSYALYCSVFKDLNISLVKLGHEQCEACVGFEQ FT HDRQTGHGNPPVDQPNCSICQVQLEHLKQSRTSRDEYRKDGEMLKKGVVVY FT SVDLQKVSNRRRGSWSILWTCRKLVTDNDRKYVYYLIILQFKVIQLPRLEG FT LKTVVFSQRLVAYNETFAPVGSYAETQPVLPCLWTEATAGRASENILSTFI FT KFLKHHKTVREITLWMDNCSAQNKNWNLFGFLILFLNSAETDTKTLNIKYF FT ESGHTFMAADSFHAAVEKQMKSSPPLTYPDFEEVVHRSKRNVEVLSMGVED FT FIELDFNISQYTLNQLKPRPYIDNIRLVQLTKGSLSFRYSDHVDACAEDLI FT SCELFSKKQRKTITAPAYTLEKAFRRQQNPVGISSERKDTLINTISPLIEE FT SKLIYWTSLATKQT*" XX SQ Sequence 2811 BP; 841 A; 608 C; 625 G; 737 T; 0 other; ctgccgctct aatctagtcc gtcctatatg taaaaagtga gtgctgagat aaagtcagaa 60 aaagtcaaaa acattgtcca ccattttttc actatttaac attcattcaa aaacaaactt 120 atacccacat catcttcttt gcggcacaaa ctttaataat ttttattaaa aatagaaaaa 180 tattttcaat aaattcatta ttttcaattt tttgaatgcg agacagactt gactttattg 240 aatcccaaat acaggcagat cagtctttgt gatcaaagaa ggcgaatttc tacaatctgt 300 cgagaatcca aacttcgatt atcacgtcgc gtttttcatt gccagttccg ggtgtgatac 360 aaaatggccc tacagctgct caaagattac ggaagcgggt ccagctcgga agaggagttc 420 gaaggatttc cggcggaagc acaagttgtg tcgacgttcc cattgtcttc ctcagccaca 480 tcgacggatt gcaacgattc cgatggaagt ttcacgaacg agccggtttt gaagaagcgc 540 aagctgacca agaaggaaag aatgaagcga tctgtggcca agggtcggcg caacactcat 600 cgtgctaaat tgaccatgtg ccgatgccac cggaactgca atgatcatgt ttcccacgac 660 gaaatcaaag ctgtcaacca cgcgttctgg atccttgatt tggaaggaca aaagaactgg 720 attcggcagc aggtgacgcg tcgcgagatc tcacgtcgtt ccacaaattg gtttgagaat 780 gatctgcgta agaacacttc gttcttgttc caccttccaa catctaacgg aacacaagtt 840 gaagtttgcc gcaaaaagtt cctgaacgcg atcggttatg gcgaagattg cgggtaaagg 900 ttttcgatca gctggacatc cttagtttta actagactta tttttgtagc aacatggttt 960 atcgatgctt cacggaagga gaagatggat tgccggtgcc atcacgacgt ggtaaacaca 1020 aacggaccac gccaaagcgt gacgcagtcc gggcacacat tttaagctac aacccgacca 1080 tctcccacta ccgtcgcaaa catgcaccaa agcgattgta tcttccatcg gatttaaccg 1140 agagggcgat gtacaacgac ttcgtggcca gtagcgatgc ggtaagctac gctctgtact 1200 gcagcgtgtt caaagacttg aacatttcgc tggtcaaact tgggcacgag cagtgcgagg 1260 cctgcgttgg attcgaacaa catgatcggc aaactggtca cggaaacccg ccggtggatc 1320 agccgaattg ctcgatttgt caggtgcagc tggaacatct gaagcaatcc cgaacctccc 1380 gggacgagta caggaaggat ggggaaatgt tgaagaaggg ggtcgtggtc tattctgtgg 1440 acctgcagaa agttagtaac agataatgat agaaaatacg tttattacct aataattttg 1500 caatttaagg tgatccagct accgcgtctc gaggggttga agacagttgt gttcagccag 1560 agattggtcg cgtacaacga gacgtttgcc ccagtgggaa gctacgccga aacgcagcca 1620 gtgctgcctt gtctttggac cgaggcgaca gctggccgtg cttctgaaaa tatattgagc 1680 acgttcatca aattcttgaa gcaccacaaa accgttcgcg aaatcacgct gtggatggac 1740 aactgttcag cgcaaaataa gaactggaat ctttttgggt tcctgatttt gtttctgaat 1800 tccgctgaaa ccgacacgaa gacgctgaac atcaagtatt tcgagtcagg gcacaccttt 1860 atggcggcgg actcgtttca tgctgcagtg gagaagcaga tgaagtcgtc gccaccatta 1920 acgtaccctg acttcgaaga agtggtgcat cgctccaagc ggaacgtgga ggtattgtcg 1980 atgggggtcg aggattttat cgaacttgat ttcaacatct cacaatacac cctaaaccaa 2040 ctgaagcctc ggccctacat cgacaacatc cggctggttc agttgacgaa agggagcctg 2100 agcttcaggt acagcgacca cgtggacgca tgcgctgaag atctaatcag ctgtgagctg 2160 ttttcaaaaa aacagcgcaa gacgatcact gcaccagcct acacgttgga gaaagcattc 2220 cggcgacagc aaaacccggt cggaataagt tccgaaagga aagatacgct gatcaacaca 2280 atctctccgt tgatcgaaga gagcaaattg atttactgga cgagcttggc gaccaaacaa 2340 acttaatttt tgttttaact gcaatattct gtactatccc atcataacat catttccgtt 2400 aaaaataaac gagcgatttg taaaaaatta taaaatgatt gagttcattc ataagtgacg 2460 gaaataatta gcatttaggt gtctaaactc aaacaaactg ttgaaaaaca gtcaaaaatc 2520 actaaatgtt ttccgatttg ttcttaatta gttaatatat atttaattta tattatttta 2580 ccattttgtt ctaactttga atgatcttgc aaagtgaatt tattgctgtg actgttctgc 2640 cttgattttg catatgggac agcaccaaag tgatttttgt tttcgaaatt atagaacaaa 2700 ctactatttt ttattcaatt atataataaa tcaacataaa aaataacgtt tcccaggttt 2760 atctcaaaat cgatcaaaat acatatggga cggactagat tagagcggca g 2811 // ID Crack-8_BF repbase; DNA; INV; 2840 BP. XX AC . XX DT 30-APR-2009 (Rel. 14.04, Created) DT 30-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE Amphioxus Crack-8_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; Crack-8_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-2840 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-2840 RA Kapitonov V. and Jurka J.; RT "Young families of Crack non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(4), 813-813 (2009). XX DR [2] (Consensus) XX CC Its 5'-terminal portion is incomplete. XX FH Key Location/Qualifiers FT CDS 3..2588 FT /product="Crack-8_BF_2p" FT /translation="TERTDMTQPGLEALFCQIQLPFTKPIVVGSIYRPPNA FT TVDFYSLLRDSLETWSTCTPNSELFLLGDMNIDLSNTDTSGARHITNLSQE FT YQLQQVIDKPTRVLQQSSTIIDHIYCSDLQQVCEHGVVHSTLSDHYAVFCV FT RKAKRPKSAPRYITSRKFSNFDENNFLTHLRTLXWDSVLQSTCVEEAWGLF FT RSLFITVSDLHAPYISKRTKANQPKWLTSDLQRLMHDRDSTKAKARRTGAT FT VDWEGYKAKRNHVNKLVKQAKAMYCQQKLKENMSNTSKLWATIKEVLPNKN FT PVVTKSLHWGGQFYKDPSSIVSCFNNFFVSIGNKLAEKFTKQTGNLVCPDK FT YKKLTSTFAFKEVTEVTVYDKLRVLGTNKATGLDKIHSRLLKAAAPCICKP FT MTHIFNLSLSSGIIPKEWKTAKVTPIHKGGEKEDPNNYRPISVLPVIMKAF FT EKEVHGQFLEYLHHHSILSEQQSGFRPGHSTATTLLDVTDHVYQNIDSGNL FT VGAVFLDLKKAFDTVDHCLLLKKLSWIGVKDVELKWFTNYLSDRKQTVSLD FT GCVSDYLPVTVGVPQGSILGPLLFIMFINDLPDCISNKVCLYADDTAIFCS FT SNDTRYIEATLNSELSHIATWFQLNHLTLNASKTKWMLFGSSGKLKKAHPV FT SIKIGLESIERVVSFKYLGLILDDKLLWNEHVDKLCSKVTQRVGLLRRLRP FT CFTVNIADMLYKAMVLPLLDYCDTVWDSCGVGRQQQLQVLQNRAARVVLQL FT NLQSSSVLNLHEKLSWQYLAGRRRDHVCIMVYKCINGLAPTYLSSIFSHNH FT NLHNYQTRQTSLLYKPFFKTTTGQRTFSYRGAKHFNSLPAYIKQVQSLNSF FT KSALKDLT*" XX SQ Sequence 2840 BP; 882 A; 639 C; 569 G; 749 T; 1 other; acactgagag gacagatatg acacaacctg gattagaggc acttttctgc caaatccagc 60 tgccctttac caaacctatt gttgttggtt ctatctacag acctccaaat gccacagtgg 120 atttctactc actgctaaga gacagtctag agacctggag cacctgcact ccaaactcag 180 aactgttcct gcttggggac atgaatattg acctgagtaa cacagacacg tcaggagcca 240 gacacatcac aaaccttagc caggaatacc agctacagca ggtaatagat aagcctacaa 300 gagtgctcca acaatcgagt actataattg atcacattta ctgtagtgac ttacaacagg 360 tgtgtgagca tggtgtagtg cactccaccc tttcggacca ctatgcagtg ttctgtgtca 420 ggaaggcaaa gcgacctaaa tcagcaccta gatatatcac ctcacgtaag ttctccaact 480 tcgatgagaa caatttcctg acccatctta gaacactaar ctgggacagt gtcctacagt 540 ccacctgtgt ggaggaagcc tggggactgt ttaggtctct cttcatcact gttagtgatt 600 tacatgcccc atacatatca aaacgcacta aggcaaatca gcccaaatgg ctcacatctg 660 accttcagcg attgatgcat gacagagata gcactaaggc gaaagcccgt aggacaggtg 720 ctactgtaga ctgggaagga tataaagcca aaagaaacca tgtcaacaaa ttggtcaaac 780 aggccaaagc aatgtactgt caacaaaaac taaaagaaaa tatgtcaaac acaagtaagt 840 tatgggcaac aataaaggaa gtacttccaa acaaaaaccc tgttgttacc aagtcacttc 900 attggggagg gcagttctac aaggacccct ctagcatagt ctcctgtttc aacaacttct 960 ttgtttccat tggaaacaag ttagctgaga agttcacgaa acagacaggt aacttggttt 1020 gcccagataa gtacaagaag ctaacatcca ctttcgcctt caaggaggta acagaagtta 1080 ctgtatacga caagctcaga gtcttaggga ctaacaaggc cacagggttg gacaagatac 1140 actcaagact gctaaaagca gctgcaccgt gtatatgcaa acctatgacc cacatattta 1200 atctgtctct ctcatctggt atcattccta aggaatggaa gacagctaag gtgaccccaa 1260 tacacaaagg gggggaaaaa gaggatccaa ataattatag gcctatatct gttcttccag 1320 ttatcatgaa agcttttgaa aaagaggtac atggtcagtt cctggaatat ctccaccatc 1380 acagcattct ctctgaacag cagtctgggt tccggccagg tcattccact gctactacat 1440 tactggatgt cactgaccat gtgtatcaaa acattgacag tggaaaccta gtgggggctg 1500 tgtttttaga cctgaaaaag gcttttgata cagtagatca ttgcctgtta ctgaagaaac 1560 tgtcctggat tggggttaaa gatgtagaac tcaaatggtt cacaaattac ctatctgata 1620 gaaagcagac agtgtctcta gacggttgtg tatctgatta tctcccagtg acagtaggtg 1680 tgccacaagg gtcaatcctt gggccattgt tatttattat gtttataaat gatctacctg 1740 actgtatctc caacaaagta tgtctctatg ctgatgatac ggcaattttt tgctccagta 1800 atgacactag gtacattgaa gccactctga attcagaact ctcccatata gctacatggt 1860 tccagttaaa ccatttgacc ctaaatgctt caaagacaaa gtggatgctc tttgggagca 1920 gtggtaagct taagaaagca catccagtgt caatcaaaat tggactagag tcaatagaaa 1980 gagtggtttc atttaaatac ctagggctta ttcttgatga caaattgttg tggaatgaac 2040 atgttgacaa actctgctca aaggtaaccc aacgagtagg tttactcaga cgtcttagac 2100 cttgcttcac tgtaaatata gctgacatgt tgtataaagc aatggtactc ccccttctgg 2160 attactgtga caccgtgtgg gatagctgcg gagtggggag acaacaacaa ctccaagtgc 2220 tacagaaccg ggctgccagg gtagttctgc aactcaacct acagtccagc agtgtcctca 2280 accttcatga gaagctcagc tggcagtacc tcgcagggag aaggcgggat catgtgtgca 2340 ttatggttta caagtgtatc aatgggctag caccaacata cctatcatcc atcttctctc 2400 acaaccacaa cttacacaat taccaaacca gacagacatc actcctatac aaaccctttt 2460 tcaagaccac aacaggacag cgtacatttt catacagagg tgctaaacac tttaattcac 2520 ttccagccta cattaagcaa gttcaaagcc taaattcatt taagtcagct ttgaaagacc 2580 taacatagtc tcagtactga gtagtgacct ctgacctccc aaactgttga ccttggatcg 2640 gattatgatt ttgatttatc tatgtctttc atgtttcagt tgtatttata tgttccgacg 2700 tatatgtcta cttttctctc tgtaagttgt ttttatttcc ttgtatgtat atgtgaatac 2760 tgggccctat tgaaaaccag tgtactagca ctgaataggc tacccaggtg tatgaaaata 2820 aacaaacaaa caaacaaaca 2840 // ID Gypsy-8_OD-LTR repbase; DNA; INV; 269 BP. XX AC CABV01000161; XX DT 24-FEB-2011 (Rel. 16.02, Created) DT 24-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Oikopleura dioica genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-8_OD_; KW Gypsy-8_OD-I; Gypsy-8_OD-LTR. XX OS Oikopleura dioica OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; OC Oikopleuridae; Oikopleura. XX RN [1] RP 1-269 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Oikopleura dioica genome."; RL Direct Submission to RU (24-FEB-2011). XX DR Genome; CABV01000161; Positions 12992 12724. XX SQ Sequence 269 BP; 54 A; 83 C; 44 G; 88 T; 0 other; tgtagtggtt attctaccgt gacattaacg cttcacgtac gccaacccta gttggcgtcc 60 gacaacccta gtcggcgtcc gcctcgtccg cctcgtccac ctcgtccgcc ttgtccgcct 120 tatccgcctt tgcggctata taagcgaccg ctttttcata tttcctcact ctcaatatta 180 cactgagtca tttacaaaac tatacctttt tgtaataaaa ctccgacaaa cttttgttgt 240 ctttttgttt gtggtctcca gccactaca 269 // ID hAT-N11_AP repbase; DNA; INV; 631 BP. XX AC . XX DT 25-AUG-2009 (Rel. 14.09, Created) DT 25-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW hAT-N11_AP. XX NM hAT-N11_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-631 RA Jurka J.; RT "hAT-type DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2111-2111 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 631 BP; 234 A; 103 C; 98 G; 196 T; 0 other; cagcgtttct taaactgtgt tccgcggaac cccagggttc cgcagagatc acttcagggt 60 tccgcgaaac ttaagtgctt tttttcaaaa ttttaaatcg atctctgtgt taaatgttgg 120 tgcatttcaa aagatgaata tcctcaattg tcacaaaagg ccgtgttggc acttttacca 180 tttgtaacca cttatatgtg tgaaacgggg ttcaccacct atgtaccaac aaaaacaaaa 240 taccgcaaca gacttgatgc cgaacctaat atgagaatac aactttcatc aataaagccc 300 aatattaaga atatttgtaa taataaaaag taatttcact catctcatta aaattaaaat 360 ttagaagtcc tgttattgtc tgttatttaa aatatttttt gaattaaata cagaatagtg 420 tgtacacatt attattaaac gaaattatta taatttttat tgcaaataat ccttacataa 480 aatgacacat taaaactatt gaaacaaagt atgggttcta attgtcgagc tataataact 540 tactgttgaa agctaaaaaa aaagtagggg ttccacgata aaagtggaca taaaaaaggg 600 ttccacggat aaaaaagttt aagaaacgct g 631 // ID Dneoca19 repbase; DNA; INV; 430 BP. XX AC GU229947; XX DT 19-APR-2011 (Rel. 16.04, Created) DT 19-APR-2011 (Rel. 16.04, Last updated, Version 1) XX DE Mariner-like element internal region of mauritiana subfamily. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Dneoca19. XX OS Drosophila neocardini OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; cardini group; OC cardini subgroup. XX RN [1] RP 1-430 RA Wallau G.L., Hua-Van A., Capy P. and Loreto E.L.; RT "The evolutionary history of mariner-like elements in Neotropical RT drosophilids."; RL Genetica 139(3), 327-338 (2011). XX DR EMBL/GenBank/DDBJ; GU229947; Positions 1 430. XX CC Clone Dneoca19. XX SQ Sequence 430 BP; 137 A; 102 C; 96 G; 95 T; 0 other; gattcttcgg taaaaaagaa agggtgtttt gataaataga gagacctgcg atgaaaagtg 60 gatatactat aacaacccga agcggaaaat atcgtggcga cttcatggcc atgcatcaac 120 ttcaaaagca aaacctaata ttcatggctc aaagctgatg ctgtgtatct ggtgggacca 180 gctgggcata gtgtactatg aacttttgaa accaggccaa accatcaccg gagacctcta 240 tcggactcaa ctgatgcgtt tgagtcgagc gttgagagaa aaacgcccgc aatactcgga 300 aatgcacgac aaggtcatac ttctgcatga caacgctcgg ccacatgttt ccaaggtagt 360 aaaaacatac ctggaaactt tgaaatggga catcttaccc cacccgccgt acagtcctga 420 cctagcccca 430 // ID Gypsy-37_AA-LTR repbase; DNA; INV; 839 BP. XX AC supercont1.16; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-37_AA_; KW Gypsy-37_AA-I; Gypsy-37_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-839 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.16; Positions 2718393 2719231. XX SQ Sequence 839 BP; 265 A; 163 C; 184 G; 227 T; 0 other; tgtagaataa gagagcgatg caataaatag atatgaaaaa tgataaatta tgttgaaata 60 cttttcctgg tgaaagaatc ataaatcatc ttgaattaga aaccctcatg aatatgcctt 120 tactcgggaa cgaaaacggt caattatttc tccaatttgt ttcttcttca ttgcttcatt 180 ttgcttgctt ctgattggtc gtaataactt gttttctgat tggccgatgt aactcgtttc 240 ttattggttt tcgctgcttg tttctgattg gccgttgttc atccgccatg ctatataaag 300 cgatggcatt gctcggtcaa gctcattcat cagtgtaacg ttgtgagtga cacatatcag 360 cagcagcagt agcagagaac gggagaaaag cagagaaacc agtcggcgag aaaccagcag 420 cagcggagcg gcgaaaagtg agctgagcca cacagtgaca cattgtcgga gcgagctggt 480 ggcaaagcgg tacagtgagg catagcgata tagtgacgca cactgataca gtgagtattg 540 cggtaaacta aacggccctt ttttagggcc acaagttatt aagagaaaat agttcaaaat 600 gcaacagttc tttttagagc taccagttgt taagaaaaca atcctaaacg gccctttttc 660 agggccacga gttattaaga gaaaccagtt cacaatgcaa cagttctttt tagagctaca 720 agttgttaag aaaaaaaaaa aataaaaatc ctaaacggcc ctttttcagg gccacgatca 780 agcggattca acagtagcgg ataccaaaag gagcctatcc accatttccg gttacgaca 839 // ID P-14_HM repbase; DNA; INV; 4966 BP. XX AC . XX DT 17-MAR-2008 (Rel. 13.03, Created) DT 17-MAR-2008 (Rel. 13.03, Last updated, Version 1) XX DE P-type DNA transposon family: consensus. XX KW P; DNA transposon; Transposable Element; P-14_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4966 RA Jurka J.; RT "P-type DNA transposon families from Hydra magnipapillata."; RL Repbase Reports 8(3), 360-360 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 895..2940 FT /product="P-14_HM_1p" FT /translation="MALALHGSKKVDKKLIYNDFEQFLKKIKVAAFTTWTV FT ETSQNYVLFKVIDGKHFIPEIELHVDESLAYTIRAYGWLLPDTHQIYKSNL FT RSVRYTMISQFLNHIRDFYLCNGISSNEIENFLLVKRHVVSKLFSIEDEKC FT DDGPTNETVYFRSSTCSILQIESSICKECKILTINSSKAINAKKKILKTPA FT SIKAPVKYTAPERLLLTLQNQRSENRYLKKQILELRQAIEKNAVSVGENIH FT QDLQKIINSFSEKVNLNRGVSSFMKLFLSEQLKYLTEHPSQTRYHPMIIKY FT SLALYAKSPAAYDLLRLQKGGKGILVLPSQRTLRDYKNYIKPQRGFNSQYV FT QELAKLVSNFSALEQYIILSFDEMKIQNDLVWDKHTGDLIGFVDLGDDDLT FT HATFEKTDTLASHILVFFIKSAMNPLSYSFATFATDSIMSFQIFPIFWRAV FT AILERTCKLKVIAVTCDGASPNHAFFKMHMNLNDNKGNDIIYRTKNIYADD FT KRYICFFSDAPHLIKTSRNCLSNSGSGRCSRYMWNNGKYLLWSHIAKFYYE FT DMECDLNYFPKLTRDHIVLNAYSVMNVKLAAQILSDTVGNTLLEFGPDEAS FT GTANFCLLMDKFFDCCNVTNTKEYLKNRKVFLKPYTDCSDERFYWLENTFL FT KYFVDWRESIEKRPEKFSKKELNIKNVHIKSNI" XX SQ Sequence 4966 BP; 1767 A; 723 C; 721 G; 1755 T; 0 other; catagtcaca agaatagata gtctggaagc gcacaaaagc ttattatatt atctctttgt 60 acccgaaaaa actagaaccc agatcctttg aagtgcaaaa cattaaagaa ataaaaataa 120 actcgttcag gtcatagtca caagaaagat ctattcttgt tacaaagttc aaataagttt 180 agttttgcgt caaataatcg tgcttgttac aatttataaa ttcttttcat tattctagtt 240 gtatcgttgg tttaactttt tagagactga aatttttgtc ataagtttaa acttctagag 300 gtaaaaaaaa aatattattt cagttacaat ttagtttatt aaacgtgttt aatatactag 360 agtggagctg aaatcagttg tttgaatcta gatttatcat gcctggagca aactgcgcag 420 tttatggctg tggaagtagt aggtataaaa aacaaggaga agatacagtt agtattttta 480 aacttccaaa cccaaataga agcgaacttt tctcaaattg gaataaagaa atactgagaa 540 ttatcacaag agatagagtt gtatcccttg attttaaaaa acaaatcaaa aagaatagtg 600 tttatatatg tgaaaagcat tttgaatatt cagatatata cacctgtaag tgtgcttaat 660 aagtgttgat atttattgag tacgcgtata cattactttt attataattt tatgataggt 720 tgaattacag acagtaaagc aaatactatt taacaaattc tttttaatta tttttcagat 780 caacacaaaa aagaattaaa agaatttgca cttccatcaa aaaacttacc aaaaaaaact 840 ttatctgaag atctcactgt gcatcgtaaa aaaacagcac ttgacaaacg agctatggct 900 ttagcattac acggtagtaa aaaagtagat aaaaaattaa tttataatga ttttgaacag 960 tttcttaaaa aaattaaagt agcagcattt actacatgga ctgttgaaac ttcacaaaac 1020 tatgtgttat ttaaagttat tgatgggaaa catttcatac cagaaattga gctacatgtg 1080 gatgagagtc tagcttatac tattcgagct tatggttggc ttttaccaga cacccatcaa 1140 atttataaga gtaatttacg atctgtgcga tacaccatga tatcacaatt tctcaatcac 1200 attcgtgatt tctatctgtg caatggaata tcatccaatg aaatagaaaa ttttttgctt 1260 gtaaagagac atgtcgttag caaactattt tcaatcgaag atgaaaaatg tgatgatggt 1320 cctacaaatg aaacagtgta tttcagatct tctacatgca gtattctaca aattgagtca 1380 agtatttgta aagaatgtaa aatattaact ataaatagta gcaaagccat taatgcaaaa 1440 aagaaaatct taaaaactcc agctagtatt aaagctcctg ttaaatatac tgcgcctgaa 1500 agattacttc ttactctcca aaatcaaaga tctgaaaata gatatttaaa aaagcaaatt 1560 ttggaactca ggcaagccat tgaaaaaaat gcagtatcag ttggagaaaa catccatcaa 1620 gatcttcaaa aaattatcaa tagtttctca gaaaaggtaa atttaaatag aggagtttcg 1680 tctttcatga aattattcct ttcagaacag ttaaaatatt taacagaaca tccatctcaa 1740 acacgatatc atccaatgat aataaaatac tctttagctt tgtatgcaaa atctccagct 1800 gcatatgatc ttctgcgttt acaaaaaggt ggaaaaggta tacttgtttt gccttctcaa 1860 cgcactttaa gagactataa aaactacatc aaaccacaaa ggggttttaa tagtcaatat 1920 gttcaggagc ttgctaaatt agtttcaaac ttttcagcac ttgaacaata cattatactt 1980 tcttttgatg aaatgaaaat tcaaaatgac cttgtgtggg ataagcacac aggtgattta 2040 attggttttg ttgatcttgg tgatgatgat cttactcatg caacttttga aaaaactgat 2100 actttagctt cacatatttt agtttttttt attaaaagtg caatgaatcc cttatcttat 2160 agttttgcaa cttttgctac tgatagtatt atgtcattcc agatttttcc aatattttgg 2220 cgtgctgttg caatattaga aagaacatgt aaattaaaag ttatagctgt gacttgcgat 2280 ggagcttcac caaatcatgc attttttaaa atgcatatga acttaaatga caacaaagga 2340 aatgatataa tatatcgcac aaaaaatatt tatgcagatg ataaacgata tatatgtttt 2400 ttttctgatg ctcctcatct tattaaaaca tcaagaaact gtttgagcaa ttctggatct 2460 ggccgttgtt caagatatat gtggaataat ggaaaatatc ttctctggag tcatattgct 2520 aagttttact atgaagatat ggaatgtgat ttaaattatt ttcctaaact tactcgagat 2580 catatagtat taaatgcata ctctgttatg aacgtaaaat tagcagctca aattttaagt 2640 gatacagttg gcaatacatt gcttgaattt gggccagatg aagcttctgg gacagcaaat 2700 ttttgtctgt taatggataa attttttgat tgctgcaatg tcacaaacac taaagagtac 2760 ctcaaaaatc gtaaagtatt tttgaaacct tataccgatt gttcagatga aaggttttac 2820 tggttagaaa acacttttct aaaatatttt gtagactgga gggaaagcat agaaaaaagg 2880 ccagaaaaat tttctaaaaa agaacttaat atcaaaaatg ttcatatcaa atcaaacata 2940 tgaaggtcta tctagaacgg taaactcctt agttgaatgt gttcaattct tactaagcaa 3000 aggattttta tatgtattaa ctgaaatatt ttgtcaagac gcattggaaa actactttgg 3060 taagcagaga actataggtt gcagaaggga taatcctaac cttaaagata ctggatacaa 3120 tgataatatc ataaaatctc aatttacaat tcaacccctt ggtggaaatg ttaggcagca 3180 tactaatgaa tgggtaattg atgatacacc ggttcccaag aaacttagaa aaaataaatt 3240 aacttaattg aattatgatc tttttttttg gaaaacctta tttcataaaa cattttttta 3300 attagtttat cagtagatat tatttgtctt ccccaaaatt actgttagtt tatttaaaaa 3360 tcaaatttaa cctaaaccta ttataattta actctgattg ctgataaaag cctatacttt 3420 taaaatttta aagaatatca tacattattt tgtaaagaga aaaaacacaa catttgatta 3480 aaataaataa gttggttaaa ataataaatt aataaaagta tttgttctaa ccaataatgt 3540 aattatttta atttgtagct atgaaaatag ctgtttgata agtattataa actggaacta 3600 taatactctt ataaatatgg tttttctata catctgcata gattgataaa atacggatga 3660 gaggaaatat gattgtaaat ctcagctatc gcaaatattt ttcttgatct tatttgattt 3720 tttttaatga tgatcgtaaa gattctgctt ttgttatttt ttttttgttt ttgaattttt 3780 cttttatttg ctttgtgtgt gaatgtgaac gaacacgaat aaaaagcatc actatatgat 3840 ataataaagt tttagcaact tctttgtcta tttctgtttc aggaaggtca catatttgat 3900 aaaagcagga cttgacaaaa cagttattta ataattgtga aaccaatact ttactgtcaa 3960 tttttctaaa atttacgcta gtatgtattc taaatttttt ttcacattca ctaaaaatat 4020 ttattactag tgtattcatt ttccacaatc cacctctatc tctactatct acaagttttt 4080 gttcctcttg agtttttgat gaaagtataa tatcaagcca ttgtccagac attgcaccat 4140 cgttaatttt tttatttctc aacttagagt acagtttttg aaatacataa cctgcaatat 4200 attctaagca tcttatgtct ttttcagaga gatttatttc tagattcgaa tcttcaccaa 4260 taagtttaga ttcttttgta atcaaaaaat ttagacataa tgttgttaat tctgctaaca 4320 ataaataata tagttccttt ggtattgttt ttatcgcaac tgactcagtt gtatacttat 4380 ggtggagtga acaataaaat ttttcggcat tgcctttaaa ttcatgcatt tcttttaaaa 4440 aaaaagattc aacttgtaaa tactcagatt cagtgacaac catgttttta aaacattctc 4500 gatcttcatt acaaaaacaa tcatctgttg acagtttatg tcttgcgtta ttcaatagat 4560 ctttgaattg tgtactttga atacataatt gtatattttt atttaaatca actgattcac 4620 taccgacaac ctcgacaagg ttgtcagtag gcacaagttc tttgtgctta gagtttatat 4680 gccttttcag accactagtt gttttgaata ccttcgagca atcactacaa tttaaagtct 4740 ctttacattc ttctaaagct gctatttcga tttcttgata aaatgaatcg ctttcaaact 4800 gatctgtttc tatcaaactc aaaagaactt gcaactctcc actttccatg attcacttta 4860 atgtttgttt tgcactttca gcgggccgaa agtacaattt tgacataaaa gttcaaaatc 4920 aattgttacg gcccgtgttt ccagactatc tattcttgtg actatg 4966 // ID BEL-62_CQ-LTR repbase; DNA; INV; 491 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-62_CQ_; KW BEL-62_CQ-I; BEL-62_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-491 RA Jurka J.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 278-278 (2011). XX DR [2] (Consensus) XX SQ Sequence 491 BP; 176 A; 90 C; 96 G; 129 T; 0 other; tgttgcggtt gctggcagta cgggaaaaag caaacgacac ttctttagcg agattagacc 60 ctttgggaga gtattctaca actgccaaac gtctaacccg cttaagaaac atgggaaaag 120 gaaaacaaac aaataccgga gacacataga cattaaaagt ggattccaga taaacgtaga 180 ttaatttaat taatcctgat gtgctcaaac aagtaaatgt aagataattt ccacataatt 240 tgatcctcaa aggcaattat ataacaatat ttgtgtgaca gggcaaacca gcaccagcag 300 caattgttgt ggtcgaacga taaggattgc acctacagag gaagactaaa tgtgagaatt 360 tgattattaa cctattttgc tctacaataa taacaataaa ttgtttcagc tttgagctgc 420 ggctaagaac agaatcgatg tcgctgctgc aaaaactttc tttagttgat ccgaaattcg 480 tcgtcacaac a 491 // ID BEL-614_AA-I repbase; DNA; INV; 6961 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-614_AA_; KW BEL-614_AA-LTR; Pao_Bel_Ele18; BEL-614_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-6961 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [5938-6516] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 1570..6879 FT /product="BEL-614_AA-I_1p" FT /translation="MAPTKKSPAKKTPNLKQLMTKLKQIEAAFHDIRRFSE FT RIQECTSTTQILLRLEKIDELWEKYGATLVDIESHEDFDDIEEDGDPLDKQ FT RQEFSDSYYEVKSVLTDQVRERQEPASLNQSIRGDPQTSATFDHVRLPQIK FT LQTFNGEIDEWLSFRDLFSSLIHWKADLPDVEKLHYLKGCLQGEPKTLIDS FT LQITAANYQVAWGMLTKRYNNSKLLKKRQIQSLLTLPVIAKESASELHSLV FT ENFERIVQTLDQIIQPGDYKDLLLVNLLTTRLDPSTRRSWEEFSSTKEQDT FT LKELTEFLQRRVRILESLPNKTSEVNKPSNQPPSRQRNSAVKNSFNTVQAS FT GGRCFACTGNHLLFQCGTFQRMSVTDRDSLLRSNSLCRNCFRPGHQAKDCQ FT SKYSCRNCKGRHHTLVCFKPERDSGARAATKNSNVAKEAQQSTSTSTQSNS FT NPPQVANMATTDSLVAGSVHQCSSKVLLATAVIVVEDNDGNRIPARALLDS FT GSESNFITERLSQRLKVSRSKVDISVLGIGQAVTKVKQRIVVTIRSRLSEF FT SRQMGFLVLPKVTANLPTANINIAGWTIPKGVKLADPSFCVSSDVDLVLGI FT EAFFDFFESSRKISLGERLPSLHDSVFGWIISGGFNEEHQGLHINCNISTT FT DRLEDLLARFWSMEEVDSTNNYSPEETRCETIFSSTVQRTADGRYSVALPK FT DENVISRLGESKEIAFRRFLGTERRLTRDANLREQYVAFMDEYLTLGHMRK FT IEYDPEPTKRCFLPHHPVVKEASTTTKVRVVFDASCKTAAGLSLNDALLVG FT PVIQQDLRSIILRCCTKQILLVADVEKMFRQILVHHEDRPYQSILWRPSPL FT EEVGVYELNTVTYGTKPAPFLATRTLNQLALDEGDRYPLAVKAITEDTYMD FT DVITGCDNLEEAKNLQNQLDGMTKGGGFRLRKWASNCAEVLQGISEDNLAI FT RPVDGINLDPDSAVKALGLTWLPGSDTFRFQFDVPPVDSDEVLTKRKVLSI FT IATLFDPLGFIGATTTAAKVFMQLLWTLEGENGKKLDWDQALPLTVGESWR FT KLYIQLPILNEIRIDRCVIVPNAVSIEIHCFSDASKKAYGACLYIRSQGQA FT GNVKVQLLSSKSKVAPIKSQSIPRLELCGALLAAQLFDKVRQATHLQVRTT FT FWVDSTCALRWIEATPSTWSIFVANRVAKIQAITEGCEWRHVAGSENPADL FT ISRGVGPKEIVNNSFWWHGPSWLTADRSNWPTSDLNTIMEGEEERRRTAVV FT GTASAVADFIDSFISTTNSYTTLVRRAAIWQRLIRLLRIPEEPITGFLRAE FT ELQIAEFTLIRKVQEEVFSEELRALRKKESVSSKSPLRWYSPYICENGVLR FT VGGRLEHSLETYETKHPMILPARHRLTKMIFEYYHLKLLHAGPQLLLATVR FT QRYWPLGGRNVARQVVQQCVMCFRSKPRQVRQFMGELPSSRVTVSRPFSKT FT GVDYFGPVYIRPAPRKTAVKAYGAIFVCMCTKAVHLELVSDLSTERFLQAL FT RRFIARRGRCTDLYSDNGTNFVGARNQLQELFRRWKDADHQQDLARFCTNE FT GINWHFNPPGSPHFGGLWEAAVRSAKHHILRVIGTNSVSHEDMFTLFAQVE FT GCLNSRPLTPMSEDPDDLEPLTPAHFLVGSSLQSVPEPDWRLVPTNRLKQY FT QVVQQRLQHFWDRWKREYICQLQGRSKRWSPPVKFEIGKLVVIQDKNQPPM FT RWRMGRIIDVHPGTDGIIRVVTLKTTSGELKRAVENVCVLPVPANENLVE" XX SQ Sequence 6961 BP; 1852 A; 1604 C; 1668 G; 1837 T; 0 other; taattggtcc ttcgagccgg atacctgagg accgtacctg gccaggaatc aacgcgccaa 60 aaccaggaat ttgcgccaaa agggtttgga aatagcctta catcatccaa caattgttcc 120 attgtgaccc acggaataaa ggagtgtcgc catcattccg cggaataatt agcggagaat 180 cgccatagct gctggtgaca aagcccaaag ggatcctttg tcaaggactt accgcgtcgc 240 ttgggttagc gccatttgga ttccagaaac tgctttcgga cttctggaca ttgttggaca 300 cttcccggaa cagcttttgg tttggactgg acactaaatt cggattggat tcctgtttct 360 ccaggtaatt atatatttag tgtatgtcct accgaagcct atacaatcgt gattctaaca 420 ccacttttca acctctactt ggacttaacg ctcttcgacc acggaatttg gaccattgat 480 gaccaaccgc ttctggggaa gccaatccca atcggaggtt tgcgtaatac tgcactactt 540 ggaaccggcc atcggtatct ctggctgtga gtgaggatcg aatgcattgc atttatgtgc 600 cactccagtc cccattttca caccacctta tcaacggttt gactgcctcc atcatcggta 660 ccaccaaggg ctgccatcgc ccttcaattg actaaccggg ttagcagcaa tgtgctgtcg 720 ttggtttgtt gcgagtattc atcgccacta gtgccccatc actgcgagaa ccattgctgc 780 aaaagatcat cgaatcaaac tagcaccgat tgtggccttt gaagcatcca acccaacacg 840 tccgactaag ggaacccaca ggaagaggag gcgccatcgt atattgccca ggttagtgaa 900 ctagagctag atatatgtct gccgaagctt gggcaacaga gttacacatt ttaattgaat 960 atatatcctc ttcggttgct gtggacgaat tcctaatttg aataccgagt atttcggtgt 1020 ttcatgcctt ttgaaaattt tggactgctg gatttattgc cgggttttga gccgctgctg 1080 gtttctccgg gtgagttgac cgcagtatat gcccggccga agccgaatac ttttgatgct 1140 acatttggat tttccttttt tccccaacca atccctggat atcactttga gctacgacga 1200 ctggtgacgt tgcagttgaa attcgactgc taatcaatta atttcgattc actatcgatt 1260 ggacaatagt gaagctactg tttcctccag gtgagcgcca cagtatatgt ccagtcgaag 1320 ctggaccatt tatgattact acatgctggt ttcctctcct cttcgttcct tccaactgta 1380 atcgagattg tttactcctc tgatgctgac tgactggaat cgctgtgtat ttcaactaga 1440 tttggcgaaa gtttcgttgt gaaggtcagt taatttagaa tagtgtgtgt ccgccgtagc 1500 aaaggactct gcagaatact acatcacctc tctttctcta ccgtacaact attgtcgagt 1560 tgaggaacga tggctccaac caagaagtcg cctgccaaga agacaccaaa tctgaagcag 1620 ctgatgacca aactgaagca gattgaagca gcttttcacg atattcgtcg attttctgaa 1680 aggattcagg aatgtacttc caccacccaa atcctccttc ggttggagaa aattgacgag 1740 ctttgggaaa aatatggcgc aactctcgtg gatattgagt ctcacgagga cttcgacgat 1800 attgaagagg acggtgaccc actagataag cagaggcaag agtttagtga tagttactat 1860 gaagttaagt cggtcttgac agatcaggtt agggaaaggc aagaaccagc tagtttgaac 1920 caatctatcc gcggtgatcc tcaaacgtca gccacatttg atcacgtgcg tcttccgcag 1980 attaagctgc aaacctttaa tggggagata gatgaatggt tgagttttag ggatctgttc 2040 tcttccttga ttcattggaa ggcagatcta cccgatgtag aaaaacttca ttacttgaag 2100 ggatgtctac aaggagaacc aaaaactctg atagactccc tccaaatcac agctgctaat 2160 tatcaggtcg catggggtat gttgaccaag cgttacaaca atagtaagtt attgaagaaa 2220 cggcaaattc aatctcttct cacgttaccc gttatcgcca aggagtctgc ctctgaactg 2280 cacagcttag ttgaaaattt cgaaaggatc gtgcaaactt tggatcagat tatacaaccg 2340 ggtgactata aggacctttt actggtgaat ttgttgacga ctcgattgga tccatccact 2400 cgtcgcagct gggaagagtt ctcttctact aaggaacagg acacgctgaa ggaactcacg 2460 gaatttctcc aacgtcgagt acgcattctg gaatccctac caaacaagac ttcagaggtc 2520 aacaagccgt ccaaccaacc tccatccagg caacggaact ctgcagtgaa gaatagcttc 2580 aacacggttc aagcgtccgg ggggcgctgc tttgcatgta ccggaaatca cctacttttc 2640 cagtgcggca cttttcaacg aatgtcagtg acggataggg attcgttgct acggtcgaat 2700 tcgttgtgca ggaactgctt cagaccagga caccaagcca aggattgtca atctaagtat 2760 tcctgtagga attgcaaagg tcgccatcac acgttagttt gttttaaacc ggaaagggac 2820 tccggtgcaa gggcagcaac caaaaacagc aacgtagcca aggaagctca gcaatccacg 2880 agtacttcca ctcaatcgaa ttcgaatcct ccacaagtag ctaacatggc aactaccgac 2940 agcttggtgg caggctcagt gcaccaatgc tcgtcgaagg ttttattggc aacagcagtt 3000 atcgtggttg aggataatga cggtaaccgc attccagctc gcgctcttct cgattcagga 3060 tcggagagca atttcattac cgaacgtctc agccaacgtt tgaaggtttc tcggagtaag 3120 gtagatattt cggttcttgg cattggtcag gcagtaacca aggtcaaaca aaggatcgtg 3180 gtcacgatcc gttcccggct gtcggagttc tcacgccaga tgggttttct ggtacttccg 3240 aaggtaacag ccaatctccc taccgctaac atcaacattg ctggctggac gattccaaag 3300 ggagtcaaac ttgctgatcc atcgttctgc gtgtcatctg acgttgacct cgtcctgggg 3360 atagaggcgt ttttcgactt tttcgagtct agcaggaaga tttcactggg agaacgtctt 3420 ccatctctac acgactcagt tttcggttgg attattagtg gaggattcaa cgaagaacat 3480 caaggcttgc atatcaactg caacatctca actactgatc gtttggagga tttactagct 3540 cggttctggt ccatggagga ggtggattcc acaaacaact attctcccga ggaaactcgc 3600 tgtgagacta ttttttcgag taccgtacag cgaacggcgg acggtcgcta ttccgtagca 3660 cttcctaagg atgaaaacgt catttcacgt ttgggcgaat ctaaggagat cgcatttaga 3720 cgttttcttg gcaccgagcg gaggctaaca agggacgcta acctccgtga acaatacgtc 3780 gccttcatgg atgaatacct gactctaggt cacatgcgga aaatcgaata tgatccagag 3840 ccaacaaaac gatgctttct tccgcatcat ccagtggtca aggaggcaag caccactacc 3900 aaggtccgcg tagtgtttga cgcatcctgc aaaacggctg cgggattgtc gttgaacgat 3960 gcgttgctgg tgggacctgt aattcaacaa gatctccgtt ccatcatcct tcgctgttgt 4020 acaaagcaga ttcttctggt agctgatgtg gagaaaatgt ttcggcagat tttggtgcat 4080 cacgaagata gaccatatca atcgatcctg tggcgtccgt caccattgga agaggttggt 4140 gtgtacgagt tgaacacggt tacatatgga acgaaaccag ctccgttctt ggcgactcgt 4200 acattaaatc aactggcctt ggatgaagga gatcgatacc cgctagcagt gaaggcaatt 4260 acagaagaca cctacatgga tgatgtcata actggctgcg ataacttgga ggaagccaag 4320 aatttgcaga atcaactgga tggaatgaca aagggtggcg gttttcggct caggaaatgg 4380 gcatccaatt gtgcagaggt cctgcaaggt atttccgagg ataatttagc gattcgtcca 4440 gttgatggta tcaatctcga tcccgattcg gcagtgaagg ctttaggatt aacctggttg 4500 ccagggagtg acaccttccg cttccaattc gatgtccccc ctgtagattc cgacgaagtt 4560 cttaccaaac gaaaggtcct gtcgattatc gctactctgt ttgatccgct gggtttcatc 4620 ggcgctacaa caacagctgc aaaggtgttt atgcagcttt tgtggacttt ggaaggtgaa 4680 aacggcaaaa aattggactg ggatcaagcg ctgcctttga cggtgggtga gtcctggaga 4740 aaactgtata tccagctacc catccttaac gaaatacgca tagatcgatg tgtaatcgtt 4800 cctaatgcgg tgtcaataga gatccattgt ttttcggatg cgtcaaagaa ggcatatggt 4860 gcatgcctct acattcgtag tcaaggccaa gctggaaatg taaaggtaca actgctgtct 4920 tcaaaatcca aggtcgctcc catcaagtcc cagtctatac cgaggttgga gctttgcggg 4980 gcattgcttg ccgctcaact tttcgataag gttcgtcagg ctacgcatct tcaagttcgc 5040 actactttct gggtggattc tacgtgtgct ctgcgatgga tcgaagctac tccgtcgacg 5100 tggtcgatat tcgtcgcaaa cagggtggct aaaatccaag cgatcacgga aggttgcgaa 5160 tggagacacg ttgcaggatc tgaaaatcca gccgatctta tatctagagg agttggaccg 5220 aaggagattg tcaacaatag cttctggtgg cacggtccca gctggttaac agcagatcga 5280 agtaattggc caacttcaga tttaaacaca attatggaag gcgaggaaga aagaaggcga 5340 actgcagttg tcggaacagc atcggcagtt gcagatttca ttgatagctt tatctccaca 5400 actaattcct acacaacatt ggtccgtcgt gctgccattt ggcagcgatt aatacgattg 5460 cttcggatac cggaggaacc tataacagga tttctaagag cagaagagtt acagatagca 5520 gaatttacct tgatacgaaa agtgcaggaa gaagtattct ccgaggaact tcgagctctt 5580 cgaaagaagg aatcagtcag tagcaaatcg ccactgcgtt ggtactctcc atatatttgc 5640 gagaatggag ttttacgagt cggcggtcga ttggaacatt cgttggagac atacgagact 5700 aaacatccca tgattcttcc ggcacgacac cgcctaacaa aaatgatttt cgagtattac 5760 catctcaaat tgctccacgc tggtccacaa cttctgcttg caacagtccg gcagcgctat 5820 tggcctctag gaggaaggaa cgttgctcga caggtcgttc aacaatgcgt tatgtgtttc 5880 aggtcgaaac caagacaagt acgacaattc atgggtgagt taccgtcatc aagggttaca 5940 gtatctcgtc ccttttccaa aactggagtg gactactttg gcccagtcta cattcgtcca 6000 gcaccgcgta aaactgcagt taaggcgtac ggagccattt tcgtgtgcat gtgtacaaag 6060 gcggtgcatc tggaactggt gtctgacttg tctacagaga ggttccttca agcactccga 6120 cgttttatcg cacgtagagg aaggtgtacc gatttgtact cggacaacgg tacaaatttc 6180 gttggagccc gaaaccagct acaagaatta tttcgtcgat ggaaggacgc cgatcaccaa 6240 caggatctag ctagattctg taccaatgaa gggattaatt ggcactttaa ccctccaggt 6300 tccccacatt ttggaggatt gtgggaagct gcggttcgat cggcaaaaca ccacattctt 6360 cgagttatcg gtacaaattc tgtttcgcat gaagatatgt ttacgctgtt tgctcaagta 6420 gagggctgct tgaattcaag gccattgact ccaatgtctg aggatccaga tgatctagag 6480 cctcttactc cagcacattt tttagttgga agttcccttc agtcggtccc tgaacccgat 6540 tggcgcttag tgcctacaaa ccgcttgaag caatatcagg tggttcagca aaggcttcag 6600 catttctggg atagatggaa gcgagaatat atctgccagc ttcaaggtag atctaagcgt 6660 tggagtccac cggttaagtt tgaaattgga aaactggttg ttattcagga caaaaaccaa 6720 ccaccgatgc gttggcgtat gggccgtatt attgatgtac accccggcac tgatggaatc 6780 attagagttg taactctgaa aacaacttct ggggaattaa aacgagcagt cgaaaatgtt 6840 tgcgtgctac cagttcctgc caatgaaaat ttggtcgagt aacccacccc gtcccatccc 6900 taccgaagag gattttatct attttcagat gcagaaatca agatttctgg gtgggtgaga 6960 a 6961 // ID CR1-75_AAe repbase; DNA; INV; 5464 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-75_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5464 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1163-1163 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 10 sequences with >98% CC identity. XX FH Key Location/Qualifiers FT CDS 355..1431 FT /product="CR1-75_AAe_1p" FT /translation="MEKCCVCSNNLDSGRTIICSGSCGRFFHFDCVGLNKS FT QFTAWTSKLGFFWFCDSCRRSFDAADYDREKTIMKALRELLIRIDSMDTRI FT GSFGENLRRINKTLFDVKQNRRSSTSSQHQASFLRSIDEITLDDVIDDPIN FT RSRSCEDTSFFEVLDEVNSSIALPSDKFVVGANKRVQIIANPCGSGNNRSS FT VDVSTPAAPARQSNSQNKRVPASRQLTELDSNHADFASDRNLVANTERTHE FT SSSKTRPQSMSLKVANSQTPHDQESFYVTPFAPDQNEEEIKQYVCEISNVN FT SALVHVVKLVPRGKNANDLSFVSFKVTVCKTVSTVVGDPWYWPDGITVRTF FT EPNVKNSTATRLPINT" FT CDS 1347..5243 FT /product="CR1-75_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MVLARWNNRSHIRTQCKKQHSYSSPNQHIIEPPALSV FT DRLSDEFASLVVSSEFQACHFAALPVSSGKTNSGICPAEAGKDAAPILNRE FT PFFQSLNRSAANIRCCEDYSIQRSSELVSFDNITLPTDRSNIELHSEYRAF FT SVDCQTRHFAALPVSSGKFNSGVCLAEAGRNTAPNFNRDPPFQPIGRSAIH FT VNRTDDHSRYPVETPTYHSAALPVLSGKFVPSGICPAEAGKDATPFLNRVP FT LFQPSDHRYIEPVHNFVPSEDPRICHFAALPVLSGKFDSGICLAEAGRDAT FT QNLNRGPNFQLNTCSTAHANLPXTVLSGPSINDYDITTPNSRETANRGLLV FT YYQNIGGINTRLSDYRLACSDSCYDAYAFTETWLNSDTISSQIFDPSYSVF FT RLDRNSQNSRKNSGGGVLLAVKAKLTARQLSVPNSESVEQIWVAITFQTHT FT LFICVIYIPPDRSQDMEVIDQHIGSLKWISDKMKIKDSLMILGDFNFPGIQ FT WKLSTSNYLYPDVTRSSISLSGNKLIDNYSLERVSQLNFVVNDNGRLLDLC FT FGSIDGGANFSVMKAPSDLVKPTIHHPSLLIEIVSVSPCIFIDPVETLFYD FT YKNGDFCGMNNFFSNINWLDYINQNLDSSLSAFSSIVLYAIDQFIPTRSRK FT APINPPWSNNHLKQLKRLKRSTLRKYTRFKTQRCKRNYHAANCRYKRLNKK FT LFYSHQMKVQQNLRQNPKSFWNYVNEQRKESGLPTVMFKGNIERSTTDGIC FT DLFLNQFSSVFTTEILDESQVFLAAENVPVHSPVGDHPFIEAEAINKICSS FT LKASTSCGPDGIPAMVLKKCSANLSMPLSCLFNLSLQVGKFPDGWKKSYVF FT PVFKKGNKRDVSNYRGIAALCAVSKLFEKVVYDFLLHNCHNFISEHQHGFM FT PKRSTNTNLLTYTTFIAQALQEGKQVDSIYTDFSAAFDKINHQITIAKFER FT LGFAGSFLCWLKSYISGREMAIKVGDSISPYFHVTSGVPQGSHLGPLIFLV FT YLNDVHFTLRSLKLSFADDFKLFYVVNDVCDAHFLQSQLDVFTVWCEKNRM FT DLNAAKCSVISFSRKRSTLLFNYKVGTTVLKREYVVKDLGVLMDSRLTFKE FT HVAFVTSKASKILGFIFRVAKHFKNIQCLKSLYCSLVRSVLEYSSVVWAPY FT YQNSIMRIESIQHKFVRFSLRHLPWNNPLNLPSYEQRCNLIGLELLSVRRE FT ISKALFISDLLQTQIDCPHLLSLLNLNIPYRPLRSSAFLYIRGARTNYGHH FT EPFTSMCRSFNRCFHDFDFHLSRRILRNRFSSILSSPT" XX SQ Sequence 5464 BP; 1507 A; 1227 C; 1063 G; 1666 T; 1 other; tggcaacact ggtttgcacc gtttttgttt gtcttctagc tctcgatatt ttctatgttt 60 acaccgtgta aattgcatcg ttaattgccg ttgcccattg attacgtcgg attgcagaag 120 tttgtttgtg tattcattgt gtgtgctcgc gttccggtgc cgcaaagtga caatattgtg 180 atagtgtgat tgcagttgtt attattcaca tctactacgc ccccattcgc tcgatttttc 240 ttgaagcgac atctggtaga aaaaatcaga aacttatttg gatacaagcg acacgcaagt 300 tgcatgcaag tttccaacca acgtttttac ccagtgggac attagacgac caagatggaa 360 aaatgttgtg tctgctcaaa caacttggac tctggaagga cgataatatg cagtggatca 420 tgtggtcggt tttttcattt cgattgcgtg ggcttgaaca aatcgcaatt cacggcgtgg 480 acttcgaaac ttggattctt ttggttctgt gactcttgcc gcagaagttt tgatgcggct 540 gattatgaca gagagaaaac cataatgaag gctttacgcg aattgttaat tcgtatcgat 600 tccatggaca ctcgaattgg aagcttcggc gaaaatctcc gtagaatcaa caaaacgctg 660 tttgacgtca agcaaaatcg gagatcatca accagttcac aacatcaagc cagtttttta 720 cggagtattg atgaaatcac tttggatgat gtgatagacg atccaattaa tcgatcgaga 780 tcatgcgaag atacctcttt tttcgaggtt ttggacgagg tcaacagctc tattgctctt 840 ccttcagata agttcgtcgt tggagcaaat aaaagagtgc agataattgc aaatccgtgt 900 ggatccggca ataataggag ttcagttgat gtttccactc ctgctgctcc tgcaaggcaa 960 tcaaattctc aaaacaaacg agtacctgct tcacgtcagc tcactgaact tgatagtaat 1020 catgctgact tcgccagtga taggaatcta gtcgcgaata ctgagcgcac gcatgaatct 1080 tcgtctaaaa ctagaccaca gtccatgtct ctgaaggttg caaactcgca gacacctcac 1140 gatcaggaat cgttctacgt tactccattc gcaccagacc aaaacgaaga agaaattaag 1200 cagtatgtct gtgaaatttc taatgtcaac tctgcattag ttcatgtggt taaattagta 1260 ccacgcggaa agaatgccaa cgacctttcc ttcgtatcat tcaaagtgac cgtctgtaaa 1320 acggtttcaa cagtggtcgg tgatccatgg tactggccag atggaataac cgttcgcaca 1380 ttcgaaccca atgtaaaaaa cagcacagct actcgtctcc caatcaacac ataatcgagc 1440 cacctgcttt atcagtcgat cgtttgagcg atgaatttgc gtcgttagtt gtttccagcg 1500 aatttcaagc atgccacttt gcagcacttc ctgtctcgtc aggtaagact aacagtggta 1560 tatgtcctgc cgaagcaggg aaggatgctg caccaatcct gaaccgagaa ccattttttc 1620 agtccttaaa ccgctctgcc gccaatatcc gttgttgcga agactactcc atccagcgct 1680 catccgagct tgtgtcattt gataacatca cccttcctac tgatcgttca aacatcgagt 1740 tgcattctga ataccgtgca ttttccgttg actgccaaac acgccacttt gctgcgcttc 1800 ctgtttcgtc aggtaagttt aacagtggtg tatgtctcgc cgaagcaggg aggaatactg 1860 caccaaattt caatcgagat ccacctttcc agccaattgg tcgttctgcc attcacgtaa 1920 atcgaaccga tgaccattca agatatcctg ttgaaactcc cacgtaccac tctgctgcac 1980 ttcctgtttt atcaggtaag tttgtgccca gtggtatatg tcctgccgaa gcagggaagg 2040 atgctacccc ttttctgaat cgagtaccgc ttttccagcc ttctgatcat cgctacattg 2100 aacctgttca taactttgta ccgtctgagg acccaagaat atgccacttt gcagcacttc 2160 ctgtcttgtc aggtaagttc gacagtggta tatgtcttgc cgaagccggc agggatgcta 2220 cacaaaattt gaatagaggt ccaaatttcc agctcaacac atgctctact gcccacgcaa 2280 atctacccak cactgtgctt tccggaccat cgataaatga ttatgacatc acgacaccga 2340 attccagaga aactgctaac cgtggcttgt tggtatatta ccaaaatatc ggcggaatca 2400 acactcgttt atccgactat cgtctcgcct gctccgattc ctgttacgat gcgtatgcct 2460 tcacagaaac atggttaaac agtgatacca tctctagtca gatctttgat ccatcttata 2520 gcgtttttcg tttagatcgt aactcccaga acagtaggaa aaactccggc ggtggtgtac 2580 tattggcagt caaagcaaaa cttactgctc gacagttatc ggttcctaat tctgaatctg 2640 ttgagcaaat ttgggtagca ataactttcc aaactcatac cttgttcatc tgtgtcattt 2700 acattcctcc cgatcgctcg caggatatgg aagtaattga tcaacacatc ggctcgctca 2760 agtggatctc agataagatg aaaattaagg atagtttgat gatacttggt gattttaatt 2820 ttcctggcat ccaatggaaa ctcagtacgt caaattatct atatccagac gtcactcgtt 2880 cttcaatcag cttatccggt aacaagctaa tcgataacta tagcttagaa cgtgtttctc 2940 agttgaactt cgttgttaat gataatggtc ggctattgga tctttgtttt ggaagcatcg 3000 atggaggcgc aaatttttca gtcatgaaag ccccttctga tcttgtgaaa ccaacgattc 3060 atcatccctc tctgctaatc gaaatagtta gcgtctcacc atgcatattc atagatccag 3120 ttgaaacgct tttctatgac tacaaaaatg gtgacttttg cggtatgaac aactttttct 3180 ccaacatcaa ctggcttgat tacatcaatc aaaacctcga ttcatctttg agtgcgttct 3240 ctagtattgt attgtatgct atcgaccaat tcattccgac gcgttctcgt aaagccccaa 3300 tcaacccacc gtggagtaac aatcatctaa aacagctgaa aaggttgaaa agatcaactc 3360 tgcgaaagta tacgagattc aaaacccaac gttgtaaaag gaactatcac gctgcaaatt 3420 gtcgctacaa acgcttgaat aagaagctgt tctattcgca ccaaatgaaa gttcaacaaa 3480 atctacgtca aaatcctaaa tcgttctgga actatgtcaa cgagcaacgg aaggaatccg 3540 gtctacccac ggtcatgttt aagggtaata ttgaacgctc cactaccgat ggtatttgtg 3600 atcttttctt aaaccagttt tcaagcgttt tcaccacgga aattttggac gaatcacaag 3660 ttttcttagc cgctgaaaat gtacccgttc attcaccagt aggtgatcac ccattcatcg 3720 aagccgaagc aattaacaaa atttgctcct ctttgaaagc atcaactagc tgcggtccag 3780 acggtattcc tgcgatggtt ctgaaaaaat gctctgccaa tctttcgatg ccattatcgt 3840 gtctcttcaa cttatcactt caggttggta aattcccaga tggttggaag aagtcgtatg 3900 tcttcccggt ttttaaaaaa ggaaacaaac gtgatgtcag caactatcgt ggtattgcag 3960 ctctgtgtgc cgtttcgaaa ttgttcgaga aagtggttta cgacttttta ctccacaatt 4020 gtcataattt tatttcggag catcaacatg ggttcatgcc gaaacgctcg acgaacacca 4080 acttgcttac ctacacgact ttcattgcac aagcacttca agaaggcaaa caagtagact 4140 cgatatatac ggatttttcg gcggccttcg ataaaatcaa tcaccagata accatcgcca 4200 aatttgaacg tcttggattt gctggttctt ttctttgctg gttgaaatct tatatcagcg 4260 gccgtgaaat ggctattaaa gtcggtgatt ccatttcacc ctacttccat gtaacttcgg 4320 gggtgccaca aggtagccac ttagggccac tgatattttt agtgtactta aacgatgtac 4380 attttacact caggtcattg aagctttctt tcgccgatga tttcaaattg ttttacgttg 4440 tgaatgatgt ctgcgatgct cacttcctgc agtcccaact tgatgtgttc actgtttggt 4500 gtgaaaagaa tcgcatggat ctcaacgccg ctaaatgctc tgtgatatcg ttctctcgga 4560 aacgctctac attattattt aactataaag ttggtactac tgttctcaaa agagaatatg 4620 tggttaagga tttaggtgtg ttaatggact caagattaac tttcaaggag catgtcgcct 4680 tcgttacctc caaagcctcg aaaatacttg gtttcatatt ccgtgtagca aaacatttca 4740 aaaacattca atgtttaaaa tctttatact gttcattagt gaggtcagtt ttagaatact 4800 catcagttgt ttgggctcct tattatcaga atagtatcat gcgtatcgaa tcaatccaac 4860 acaagtttgt acggtttagc ttgcgtcatc tgccatggaa caatccactg aatttgccca 4920 gttacgaaca gcgttgcaat cttattggtc tagagttact gagcgtacgc cgtgaaatct 4980 caaaagcact attcatttct gatcttttac aaacacaaat tgattgtcca catttgttgt 5040 cgctccttaa tcttaatata ccttaccgcc ctcttcgctc tagtgcattt ctttatattc 5100 gtggtgctcg cactaattat gggcaccacg aacctttcac cagcatgtgt cgttctttta 5160 atcgttgttt tcatgatttt gattttcatc tctctcgccg cattcttcgt aatagattca 5220 gtagtattct ctccagtcca acttaacata ttgcatcaga tgaattttgt agccgtacat 5280 tagttttact atagaaataa gaataattta tattactagt attaagcaaa aattgtatca 5340 tttggagtgt aactttctgt tggtactaaa agatgaggag gttttgcgcc catttgagga 5400 agagttatct cagctcaact caagtgggct tttccctgct ccaaaaaaga aataaagaaa 5460 taaa 5464 // ID BEL-62_AA-I repbase; DNA; INV; 5784 BP. XX AC supercont1.350; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-62_AA_; KW BEL-62_AA-LTR; BEL-62_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5784 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.350; Positions 66610 72393. XX CC 'GCATC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 28..5751 FT /product="BEL-62_AA-I_1p" FT /translation="MQTGNTSGYNCQSCNEHDGVDTMVACDACSSWHHFKC FT VGVDHTVKNRRWVCHKCEQGTFSGLLNPPQTKDKTAKSGGSKTSKSRSKKP FT EKTVGSKMSITSSARAAALEAQMRLLEEEELLKEKELKEQEELQRLEFAEE FT QRKLEIKKQLMEEESKLREAELSKQKDLQEKMMLLRRESMEKKKELMRQQA FT ELSESSSSSHVSKSERVRDWITSQQHTKGDKVDITQQRISWPTSHIANSAS FT PQQREGSLVTPRQQFSNLSLHDDDRCAVNPELPTAYMQIAARQVTGKDLPA FT FSGNPEDWPMFIRTYEETTAACGFSDVENLVRLQKCLRGSALETVRSRLMM FT PAGVPHVIKTLQMRFGRPELIIRSLLERIRRVAAPKPERLDTLIDFGLAVE FT NLVVHLQAAKQENHLTNPVLLQELVAKLPAQLRLDWARYKLLRQDATLATF FT GEFMNELIQAASEVSFDLPISLSSKAEKPKDREVFIHAHDAPDMESKNTGA FT IRKTPKPCIVCSAIGHRVAECEEFKAKNVEERMQIVRQHNLCRTCLNFHQK FT WPCRTWNGCNIHGCRDKHHFLLHPPTTSSPAHFSTNHSSVIREGCGRYPYF FT RILPVLVSMGNKRQTIYAFIDEGSSSTLLDRSVAEQLGLDGPTEPLTLQWT FT ANVARQESRSKRVNFQISGTGKAETFQITEAHTVERLLLPKQSLPYETLAN FT QYPHLHGLPIAEYDSAEPKLLIGLDNLSLCVPLKIREGQNKDPIAAKCRLG FT WTIYGYNARTSGPTVTVNFHVPAAQDADHELNEQLKEFISLDQSAANPTYE FT MPESDADKRARKLLEETTRRVSGRFETGLLWKSDHIEFPNSFPMAVRRLEL FT LEKKLDKDPTLRCRVHQKMEEYLAKGYCHRASREELKASNSKRVWYLPLCV FT VVNPKKPNKVRVVWDAAAKVNGVSFNSALLKGPDMLTCLVSVLYHFREHRI FT AVTGDIEEMFLRLLIRPQDSQYQRFLWRANAGDAPSVYIIDVATFGSTCSP FT SSAQFVKNANAREYMEEFPRASTAIIKYHYVDDYLDSFETVQEAIEVVKQV FT KMVHSQGGFNLRNFLSNSNEVLAAISDTTIEDTKQLNLVRAEKVESVLGMK FT WNPSKDVFVYTVSLRDDLVEVIDSTHIPTKREMLKLVMSLFDPLGFAAFYL FT IHGRVLIQDAWATGIGWDAPVNSDLSERWLQWISLLPLLNDLQIPRCYFRG FT KIEETKQLHIFVDASDAAYACVAYMRASGSEGIELALVGAKSKVAPLKVLS FT VPRLELMAAVIGARMAESIVSSHSFNIADIYFWSDSSTVLAWINSDHRRYT FT KFVGVRIGEILALTKINQWRWLPTKQNPADEATKWGKGPSFNISGRWFCGP FT NFLYHPESEWPRNQNLTSTEEESVSVHNVHYDTTSTSMIEISRFQKWERLL FT RTQAYVIRFINNIRCRCKGQTADVGALKQEEIRDAERHLWKQAQKEYYGQE FT QKLLLETQGSPTARHNVVQKSSTIYKLWPYIDQHGVIRMRGRIGAAWYATD FT EAKYPVILPKTHLITSLLVDWYHRRFRHANHETVVNEIRQRYEISNLRTLV FT KQVARRCAKCHIANAVPRPPPMAPLPEQRITPFVRPFTFVGLDYFGPLTVK FT VGRSEVKRWVALFTCLTVRAIHLEVVHSLSSESCVMAIRRFIARRGAPAEI FT FSDNGTNFVGANKQLRQEIAARDQVLASTFTNTNTRWNFNPPGAPHMGGAW FT ERLVRSVKSAIGTIVDAPRRPNEEALQTILFDAEAMINSRPLTYVPLETAD FT EESLTPNHFLLGNSSGVKQPPSELTNLRPNLRSSWTLVQNITDAFWKRWIK FT EYLPVITRRCRWFENVRDFNEGDLVLIVDTTMRNQYIRGRVEKVFFGRDGR FT VRQALVRTSTGLYRRPAVKLALLDVGLPGKGGQDS" XX SQ Sequence 5784 BP; 1693 A; 1310 C; 1409 G; 1372 T; 0 other; aattctttaa gaattttgtc gctaaagatg cagacgggaa atacctccgg ctacaactgt 60 cagtcgtgta acgagcatga cggcgtagat acgatggtcg cctgcgacgc ttgtagcagc 120 tggcaccact tcaagtgtgt aggtgtcgat cataccgtca agaaccgccg ttgggtgtgc 180 cataaatgtg aacagggaac cttttcgggt ctactcaacc ctccgcaaac gaaggataag 240 acagcgaaaa gcggcggcag taagacgtca aaatcacgga gcaagaagcc agagaagacc 300 gtcggatcta aaatgagtat aacgtccagt gcccgcgctg ctgctttaga agctcaaatg 360 cgcctactcg aagaagaaga actgctaaaa gaaaaggaat tgaaggagca agaggagcta 420 caacgcctgg agtttgctga agaacaacgg aagcttgaaa tcaagaaaca actaatggaa 480 gaagaatcca agctcaggga agcggagctg tcgaaacaaa aggatttgca ggagaaaatg 540 atgctattgc gacgggaatc catggaaaaa aagaaggaat tgatgcgaca acaggcagag 600 cttagcgaat cgtcttcatc gtcgcacgtt tcgaagtcag agagggtgag agattggatc 660 acgtcacaac agcacaccaa aggagacaag gttgacatca cacaacaacg gatctcttgg 720 ccgacttctc atatagcaaa ctcagcatca ccacaacagc gagagggatc cttggtaaca 780 cctagacaac aattttcgaa tctttcgctg catgacgatg atcgctgcgc agtgaatccg 840 gaattaccaa cggcctacat gcaaattgca gccagacaag taactggcaa ggatcttccg 900 gcctttagcg gcaatccgga agactggccg atgttcattc ggacctatga agaaacaacg 960 gcagcgtgcg gattctcgga tgtagagaat ttggtacgac ttcagaagtg cctacgtggt 1020 agtgcgctgg aaacggtccg cagtcggctt atgatgcccg caggagtgcc ccatgtaatc 1080 aaaacccttc aaatgcgttt tggtagacca gagcttatca tccgttcgct tttggaaaga 1140 attcgtcgag ttgctgcccc taaacctgag cgacttgata ctctcatcga ttttgggtta 1200 gccgtggaga atttggtggt tcatttgcaa gccgcaaaac aagaaaacca cctgaccaat 1260 cctgtcttgc tacaagaact tgtagcgaag cttccagcgc aacttagatt ggattgggcc 1320 agatacaagc ttcttcgcca agatgccaca cttgcgacct ttggtgaatt tatgaacgag 1380 ttgattcaag ccgccagcga agtgtcgttt gacttaccta ttagcctgtc ctcgaaagcc 1440 gaaaaaccta aggacagaga ggttttcata cacgctcatg acgccccgga tatggaatca 1500 aaaaacaccg gtgcgataag aaagacacct aagccgtgta tcgtttgcag cgccataggg 1560 catcgagttg cggagtgcga ggaattcaaa gccaaaaacg tcgaggaacg aatgcaaata 1620 gtgcggcagc acaatttgtg tcgcacgtgt ttgaatttcc atcaaaagtg gccctgtcga 1680 acctggaatg gctgtaacat ccatggttgt cgcgataaac accactttct tctgcatcct 1740 ccaacgacat caagtcctgc ccatttctcc accaaccata gttctgtaat cagggagggc 1800 tgtggtcgct atccatattt tcgcatactt cccgtactcg tatcaatggg aaataagcgc 1860 caaacaatat atgcgttcat tgatgaaggc tcctcttcga cgcttttgga ccgatcagtg 1920 gcagaacaat tgggattaga cggacctaca gagcctctga cgctgcaatg gaccgcaaat 1980 gttgcgcgcc aagaatcccg gtctaaaaga gtaaattttc aaatctcagg cactggaaag 2040 gctgaaacat tccaaatcac agaagcacat accgttgaaa ggcttttgtt gccgaaacaa 2100 tcgcttccat atgaaacgtt agcgaaccaa tatccgcatt tacatgggct acccatcgcc 2160 gaatacgaca gtgccgagcc aaaactatta attggactcg acaatctgag cctgtgcgtc 2220 ccactgaaaa tacgggaagg gcaaaacaaa gaccctatcg ccgcgaaatg tagattggga 2280 tggacaattt acgggtataa cgcacgtaca tcagggccaa cagtgaccgt taattttcat 2340 gtacctgccg cccaagacgc ggatcatgag ttgaatgaac aacttaaaga gttcatctca 2400 ctggatcaat cggccgccaa tccaacgtac gagatgcctg aatccgacgc tgataagaga 2460 gctagaaagt tgctagagga aaccacgcga agagtgtccg gcagatttga aaccggtttg 2520 ctgtggaaat cggaccacat cgaatttccc aacagctttc ccatggcggt gagacgactt 2580 gagttgttgg agaaaaaatt ggataaggat ccaactcttc gttgtcgagt acatcagaaa 2640 atggaagaat atcttgctaa gggatattgt caccgagcta gtcgcgagga gctgaaggct 2700 tccaacagca aaagagtttg gtatcttccg ttgtgcgtag tagtcaaccc gaaaaaacca 2760 aacaaggtac gggttgtgtg ggatgcggca gcgaaggtta atggcgtatc ctttaattca 2820 gctttactga aaggaccaga tatgctgaca tgccttgttt ctgttttgta tcactttcgt 2880 gagcatagaa tcgccgtaac cggtgacata gaggaaatgt ttcttcggct tcttattcga 2940 cctcaagaca gccaatatca gcgattcctg tggagggcaa atgctggaga cgctccttca 3000 gtgtatataa tcgatgttgc gacatttggg tcaacttgtt ccccaagctc ggcccaattt 3060 gtgaagaacg caaatgccag agaatacatg gaagaattcc cccgagcatc cacagctata 3120 attaaatacc actacgtgga tgattacctc gatagctttg aaacggtgca agaagctata 3180 gaggtagtta aacaagtgaa gatggttcat tcccaaggag gattcaatct gcgcaacttt 3240 ttatccaact ccaatgaagt gttggcggcg atcagcgaca ctactattga agataccaag 3300 cagctgaacc tggtgcgagc tgaaaaggtg gaatctgtgt taggcatgaa atggaatcca 3360 tcaaaagacg tatttgttta cacagtatcc cttcgagatg atctcgtcga agttatcgat 3420 tcgacccaca ttccaaccaa gcgagaaatg ctcaaactcg ttatgagcct ttttgatcct 3480 ttggggtttg cagcctttta cttgatccat ggaagagttc tgattcaaga tgcctgggcc 3540 actggaattg gttgggatgc accggtcaat agtgacttat ccgaaagatg gttgcaatgg 3600 ataagtctgt tacctttgct aaacgactta cagatccctc gatgctattt tcggggtaaa 3660 attgaagaga cgaaacaact ccatatcttc gtcgacgcca gcgatgcagc atacgcttgt 3720 gtggcttata tgcgcgccag cggatctgaa ggaatagaac tagcgcttgt gggtgctaag 3780 agcaaggtgg caccactcaa agttttatct gtccctcgtt tggaactgat ggcagcggta 3840 ataggtgcac gcatggccga atccatcgtt agctcccatt cattcaacat agctgacata 3900 tacttctggt ctgattcttc tacggttttg gcgtggatca actcggatca cagacggtac 3960 actaagtttg taggagtgag aattggagaa atccttgctt tgacgaagat taaccaatgg 4020 agatggctgc ccacaaaaca aaatccagca gatgaggcga ctaagtgggg aaaggggcca 4080 agcttcaata tcagtggccg ttggttttgt ggacccaact ttctatatca tcctgaatct 4140 gagtggccac gaaatcaaaa cttaacttcc accgaagaag aatccgttag tgttcataat 4200 gttcattatg atactacatc cacgtcaatg atagaaattt cccgtttcca aaaatgggaa 4260 cgcctactga ggacacaagc gtacgtgata agattcatta ataacatcag gtgccgttgt 4320 aaaggccaaa ctgcagatgt cggggcgctg aaacaagagg agataaggga tgcagagcgt 4380 catctttgga aacaagctca aaaggaatat tacggccagg aacagaaatt gttgctggaa 4440 acacaaggaa gcccgactgc acgccacaat gtagtgcaaa agtcgagcac catatacaag 4500 ttgtggccat acatcgacca gcatggagta atcaggatgc gtggaagaat tggagccgca 4560 tggtatgcta ctgacgaagc aaaatacccc gtcatattgc ccaaaacgca tttaattaca 4620 tcacttctgg tggattggta tcatcgacgt tttcgccacg caaatcacga aacggtggtg 4680 aatgaaatta gacaacgcta cgaaatttca aatcttcgaa cgctggtaaa gcaagtagcg 4740 agaagatgtg ccaagtgtca tatagcaaat gcagttcctc gcccgcctcc aatggctcca 4800 ctccccgaac aacgtataac gccttttgta agacccttta cgtttgtcgg gttggattat 4860 ttcggccctt taacggtgaa agtcggacgc tccgaagtaa aacgctgggt agcacttttt 4920 acgtgcctta cggtgcgtgc tatacaccta gaagtggtac atagcctatc cagtgaatcg 4980 tgcgtgatgg ctattcgacg cttcattgca cgacgcggtg caccggccga gatctttagt 5040 gataacggaa ccaattttgt gggcgcaaat aagcaactgc ggcaggaaat tgcggctaga 5100 gatcaagtgc tagccagtac attcactaac accaataccc gatggaattt caatccaccg 5160 ggtgctccac atatgggcgg ggcatgggag cgtttagtgc gctccgttaa aagtgctatt 5220 ggaacaatcg tcgatgctcc acgacgcccc aatgaagaag cacttcaaac aatcttattt 5280 gatgcagaag cgatgattaa ttcgcgtccc cttacttatg tgccactaga gactgcagat 5340 gaagagtcct taaccccaaa tcattttttg ttggggaatt cctctggagt gaagcaacca 5400 ccctcggaac tgacaaacct tagacctaat ttgagaagca gttggacatt agttcaaaac 5460 atcactgatg ccttttggaa gcgatggatc aaagaatatc ttccggtcat tacccggcgg 5520 tgtcgatggt tcgagaatgt ccgagacttc aacgaaggag acttggtatt aattgttgat 5580 actactatga gaaaccaata cataagagga cgtgtagaaa aggtattctt tggacgagat 5640 ggtcgtgtgc gtcaagctct tgtacgcacc tctaccggat tatacagaag accagcagtt 5700 aagcttgcat tactcgacgt tgggctacct ggtaaaggtg gtcaggactc ctagaaccgt 5760 caccatcctt tacgggtggg ggga 5784 // ID BEL-1_CQ-LTR repbase; DNA; INV; 620 BP. XX AC AAWU01033868; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-1_CQ_; KW BEL-1_CQ-I; BEL-1_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-620 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 156-156 (2011). XX DR Genome; AAWU01033868; Positions 30010 30629. XX SQ Sequence 620 BP; 151 A; 157 C; 152 G; 160 T; 0 other; tgttggagtg gtcacgatta acaaccctgc atctactgtc gggtcaccac tgtcgggtca 60 actgtgacca tccaacgcga gtgaccactc ttatcacggc tgcacgcgac ctacagcagc 120 tgatcgcggt gatcctctcg cgagatggac gatcgtgatc gctgatagga ccaacaccaa 180 cagcaggatg cacttttgcg gggtcgatga ccgccgggcc gggttgatga cctgacctgc 240 ctcaacgaac gacgggagca gctgacctgc tgcgattttt agtttttaag ttttacaacc 300 tctcgaaccg gcagatcgaa acggtcgatt ctgcgcctat caggcaattt aaatgtgtta 360 ggcttaatat taataaatgt agtccttttt tttagttcgt gtatttaaag taaaaattgt 420 gtttaaatta aagaattgtg ttttactgtg atttcgtgtt gttttaattg tgaaacgcgt 480 gaattccgga agccacccta cgcacggact acccacggcc aacgaaccag gaccggacca 540 gccagcccaa tccgatccgg tcgatgctgg gagccaccag agctgtcgaa ggctaaggtt 600 tttattccgg ctccccaaca 620 // ID Shinagawa-10_AAe repbase; DNA; INV; 2154 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.03, Created) DT 16-FEB-2011 (Rel. 16.03, Last updated, Version 1) XX DE A non-autonomous DNA transposon family from Aedes aegypti. XX KW DNA transposon; Transposable Element; nonautonomous; KW Shinagawa-10_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-2154 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the yellow fever mosquito."; RL Repbase Reports 11(3), 847-847 (2011). XX DR [2] (Consensus) XX CC ~90% identical to consensus. TIRs are 122 bp long and composed CC by degenerate repeats. Related non-autonomous elements, named CC Shinagawa, are found in Aedes aegypti and Culex CC quinquefasciatus. XX SQ Sequence 2154 BP; 648 A; 426 C; 382 G; 696 T; 2 other; gggctctacc ccatttggca taatgccatt tggcataatg ccgtttggca taatgccatt 60 tggcataatg gccatttggc ataatggtca tttggcataa cggccatttg gcataatttg 120 aagaaaacat ttcctcaaat tattgttctc ttttggcctg ataatgatct gmctatatga 180 aatgtagggt aggtatacca gttatggaca taatggttcc ctatttcgcc atacgtgata 240 atttgattac tttcaaatgt ttaatcattc tgtgtgttct agtaatttga tatcgaatga 300 atctcgatgt taaaacattc aaaattattc aacatgtaaa agttttccag aaaacatata 360 tggccaaata gggaaccact atagccataa ctggtacact ttccctatgt catcttcgat 420 tctagacaat ataatagaat ggttttcaac ttccgaccaa cgccctccaa gtcacgcttg 480 tagctgtgag aagttattcg cgtaaattca taaagagcaa tatgaggagt atagatttgg 540 cccatggagt gagctcactt ttatcagtct tataatgctt tttgaacgcc tgtctttact 600 tttttgtaaa gcattagctc aatttttgac cgtgggttgg gtgccacgat tttgtcgtaa 660 ctgcaactgc agccatctag cctagcctca aaaataaatt gatttttttt ttaattatga 720 gggtacagaa gttaagtatg aagaacatcc aatgttccaa agaaggaaga tctcaagaga 780 gacgctttgc attaagtgct acaacattca gttgatacac gattaataat tcgcctattg 840 ttgaataata tgtgtgttcc taatgaagtg ttcaacaatt agcgagcatt taaatgcatt 900 caataattgg cgaattacca cccaattaga aattttacct tctttcaaac aatagctgtt 960 cttttagttt kcagcgtggc gaatgatagg ctcgccatga tcgatgttgt attacatcgg 1020 attgccctgg taccatgttg attccgaatg taggtcaatc tggagatggg agggaaatga 1080 tgacgtgttt ctagctttag ggaaaccgga aaacttccac aagaaaacat tgggacacta 1140 ctcaatcccg gacaatgccc tttattcacc caaaagcctt tctactaagt ccaaacatat 1200 cgtgaaaaat accatcgtct ggctttcaag gcattaaaca attcaaaaca gcacgatatg 1260 aaagtaatga ttcgctaaac tgcaaactca aaagaacagc tcatgtttga aagaaggtga 1320 atattactaa ggcgctgtcc attaaccatg tggtcatttt tcggaatttc tgacccagcc 1380 cctccccctc ggggttattt gtacatacaa aatttttaaa atttatatgg accgtaccgt 1440 ggtctttgag cagacccttt ccttcccccc ataaatgacc acgtggttca tggacgagca 1500 cctaatctaa attaattatc ccacatgcta tggatcttca tctactcgat accttttcag 1560 caatatgctc aacctcagat gtgacaagta atggctttac gttggtgatg ttattatcat 1620 cagcctcaaa cgcacacttt gtgacttttt tcagtgcgag aatagctcta catattttga 1680 attttataaa ttaataaaac tcagtcttga tagtttatat tgacttgaaa aagtatcact 1740 gtacgcgcta acatgcataa agtatgctga tactttttca gctgtgtcag tgcaaaacca 1800 actgattttc tttgattcga aatcgtgaga tgaattagca acaatcatca acgacgcgta 1860 caaatttcaa tgacggccta cttcgcctta aagtatttcc ttgtacttac ttcacattaa 1920 tttctttggg ctaaaaatca ttatcaatat tttcccattc ttccagtaaa tttcacttgt 1980 ccacttaaat gaacgagcac ttaaataaac gttttgaaat gtgatttttt ttttcaaatt 2040 atgccagatg gccgttatgc caaatggcta ttatgccaaa tggccattat gcctaatggc 2100 tttatgccaa acggcattat gccaaatggc tttatgccaa atggggtaga gccc 2154 // ID TBVSG repbase; DNA; INV; 191 BP. XX AC K00617; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 28-SEP-1995 (Rel. 1.08, Last updated, Version 1) XX DE T.brucei repetitive sequence. XX KW Repetitive sequence; TBVSG. XX OS Trypanosoma brucei OC Eukaryota; Euglenozoa; Kinetoplastida; Trypanosomatidae; OC Trypanosoma. XX RN [1] RP 1-191 RA De Lange T., Liu Y.A., Van der Ploeg H.L., Borst A.P., Tromp C.M. RA and van Boom H.J.; RT "tandem repetition of the 5' mini-exon of variant surface RT glycoprotein genes: a multiple promoter for vsg gene RT transcription?."; RL Cell 34, 891-900 (1983). XX DR GenBank; K00617; Positions 1 191. XX SQ Sequence 191 BP; 49 A; 46 C; 37 G; 55 T; 4 other; acggcttgyg cacaggcccc tttgtttccc ataggtctac cgacacattt ctggcacgac 60 agtaaaatat ggcaagtgtc tcaaaactgc ctgtacagct tatttttggg acacagccat 120 gctttcaact aacgctatta ttagaacagt ttctgtacta tattggtatg agaagctccc 180 artrrcagct g 191 // ID BEL-130_AA-LTR repbase; DNA; INV; 398 BP. XX AC supercont1.259; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-130_AA_; KW BEL-130_AA-I; BEL-130_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-398 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.259; Positions 1321193 1321590. XX SQ Sequence 398 BP; 138 A; 91 C; 58 G; 111 T; 0 other; tgttggggtc aaccctacgc cactggcttt atagcgcaca agcctaaata tcgaaacgaa 60 cacctttcat tatgacagct gtcaatccct gaagtgttga cgtaatatcc gtttatcctt 120 cctcaaacca ttctttattc agtataaaaa aggcagagtc agaactacca gtgaattaaa 180 agtaattaaa gctaaatatc agaattatta aagaccgtca tcctgtttat taacaccgag 240 attaagacaa ggagaccaaa tgtaagtcta cttactttat tgaattcttt cgcctaaaac 300 ttataattat ccccttaata aatagctttg agctcgccta accccaaaga cagcgtgctt 360 catagaactc cgaaaataca tccgaaaacg ttccaaca 398 // ID ISL2EU-3_HM repbase; DNA; INV; 3439 BP. XX AC . XX DT 15-DEC-2008 (Rel. 13.12, Created) DT 17-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE A family of autonomous ISL2EU transposons - a consensus. XX KW ISL2EU; DNA transposon; Transposable Element; ISL2EU-3_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3439 RA Jurka J.; RT "ISL2EU-type transposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 2060-2060 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(121..690,617..1618) FT /product="ISL2EU-3_HM_1p" FT /translation="MISNDTISCDKLITHSIKENITVESTTTTKYRGGITC FT CVPLCNNNSLKNKNLSFYIIPKESNLRKKWLSLISRKNFVPSISHRVCSVH FT FTGGKKTYMNNCPSLLPKQVKDNIKKPRITVNSSIGYKRKIDEIENEKNDV FT NFSFEETEPEKLKKEIIILKDKLNIIKTKKKRNEKYYYEPKSHYQSIKIYS FT KLKKKEMKNIIMNQNHTINQLKFTVDRFKHNQAHFKFYTGFESYNLFKVLL FT EYLEPAASKLIYWGSNTNIEKTTDFNYNKKGRGRIMSSESELFLVLTRFRL FT GLLVEDMALRFDISSSHVSRIIVTWTDFLHSQMRMLPIWATKQTVKETMPK FT CFKEKYESTRVILDCTELFIEMPTSFRSQSATFSNYKHKNTGKGLIGIAPN FT GAITFVSDLYCGRFSDKQITKDCGIYNLLEPGDSVMADRGFDIADDLPENV FT SLNIPPFLNGKAQLSLEDENETRKIAAVRIHVERAIQRIKNYHILQTPFKL FT SMAPEINKTWIVCCYLANFLPQLVSDK*" XX SQ Sequence 3439 BP; 1240 A; 527 C; 485 G; 1185 T; 2 other; ggaccgttcc ctctatagaa aaaaaccgca atgcattttg gtctttgtat tcaccgaaaa 60 taaacaaatt agagttaaag ctttttaaaa taaacagtta atttaaaaag acttttcaaa 120 atgatttcaa atgatacaat atcttgtgat aaactaataa ctcattctat taaagaaaat 180 attactgttg agtcaacaac aacaacaaaa taccgcggtg gcataacttg ttgtgttcca 240 ttatgcaaca ataattcgct caaaaacaaa aacttatcat tttacatcat cccaaaggaa 300 agtaatttga gaaaaaaatg gctatcttta attagtagga aaaactttgt tccttctatt 360 tctcatcggg tttgttcagt ccactttaca ggaggaaaaa agacttatat gaacaactgt 420 ccatctttac tgccaaaaca agttaaagac aacataaaaa aaccaagaat cacagtaaat 480 agttctattg gatataaaag aaaaatagac gaaattgaaa acgaaaaaaa tgatgttaat 540 ttttcatttg aagaaaccga gcctgaaaag ttaaaaaaag aaattattat cctaaaggac 600 aaactgaaca ttataaaaac taaaaaaaaa agaaatgaaa aatattatta tgaaccaaaa 660 tcacactatc aatcaattaa aatttacagt tgacagattt aaacataatc aagcccactt 720 taaattttat actggctttg aatcatacaa tctatttaaa gttctactag aatatttaga 780 acctgctgca agcaaactta tatattgggg gtcaaataca aacattgaaa aaacaacaga 840 ttttaattac aataaaaagg gaagaggacg cataatgagt tcagagtctg aattattttt 900 agttttaaca agatttcgac ttggcctttt agttgaagac atggcgctgc gctttgacat 960 atcctcaagt catgttagta gaatcatagt gacatggact gattttttgc attctcaaat 1020 gcgtatgcta ccaatttggg caacaaaaca aactgtaaaa gaaacaatgc caaaatgttt 1080 taaagaaaaa tatgaatcaa caagggtaat tttagactgc actgaactat ttattgaaat 1140 gcctacatct tttcgaagcc agtctgcaac tttttctaat tataaacata aaaatacagg 1200 aaaaggatta attggaatag caccaaatgg agccataact tttgtttctg atttatactg 1260 tggtcgtttt tcggacaaac aaattactaa agattgtggc atatacaacc tgcttgaacc 1320 aggagacagt gtaatggcag atagaggttt tgatattgct gatgatttac cagaaaatgt 1380 ttcattaaac ataccaccat ttctaaatgg aaaagctcaa ctcagtctag aagatgagaa 1440 tgaaactaga aaaattgctg ctgttcgtat tcatgtagaa agagcaatwc agcgaataaa 1500 aaattatcat attttacaaa caccatttaa attgtctatg gcgccagaaa ttaataaaac 1560 ttggatagtt tgttgttatt tagcaaactt tttgccacaa cttgtttcag ataaataaaa 1620 tgtttgtaaa aaatataaaa acttttaatc taaactgcaa atatattttc ttgcataaac 1680 tcaaagtaaa atttagttaa ttttggtaaa gtagtttttt cccattgctc tttaagaaat 1740 ttaatactct caataaaaac atctttttca gtataaacta taaaatctag ttctttaatc 1800 tctaataatg ccataacacc ttgacattga taaaaataaa catgattttg tttaagcatt 1860 gttttgccat caacaacagc aagaaaaaat gttttatcta ggcaagcttc agctatagtt 1920 aaatcttttt ttgaataagg acatttgatt tcaatagccc ttattttttc atttactaat 1980 acaattccat cagggcttgt accgagccaa ggccattttg ggttaatgat aaatccgcat 2040 cttgttactg atacattttt tatattttca tactttttta ttgcagtgtt ttcatttatt 2100 ttaccccaat tacaagcttt gttgcaaaag ttatttttat tgtttttcaa aattctattc 2160 aaaatagagg caggaaatat acttttttgt ctattaataa caaaaccaaa gtttgaagat 2220 gtcaagcgtt tttttctttc ttcaagccat aaacttgtag attgtgataa agtagctctt 2280 tccaaagtct ttattttctc ttttgataga actaaatatt ttgaaataaa cattaattga 2340 gtttcatcaa attcaaatga gtcactgaat tctacttcaa atccattaat tatggtatca 2400 acaagattaa atataatatc agaatttaaa tccagttctt ttacttttag ctgatattca 2460 gatatcgatg tttcatagta ggacgtggaa tgataattgt ttccttttaa aagatttgca 2520 aatgctattc cttctcccag attatataaa ctatcagaca gttttttaat tttttcatcg 2580 cttggattat gtgcaaaaat tggggttgca caaaagtttc tatttcctga tacaaaagga 2640 cgctttcgtg agttgtttag atccttttct aaatcactct tttcaaagga aactttagaa 2700 agcttaattg gttctgagtt ttttccttca gcaggaacat gccattgctg taacacttct 2760 gtgcatgttt tatcgaatgg tacagagttt aaatttaatt gtttatattc atttaattga 2820 aataatactg cagcaacatg cttgcagcat cctcccatac cagctttaca attacattta 2880 gcatacaaaa tatctccttc tttttgacaa agatgtacat aaataaggta ctgatttttt 2940 ttcattgaag ctgaaacgtt agctctaata agaaaagaat ttatttcaat gttgacatta 3000 ggtttaacaa ctaaattttt aacgaagttc tccttaaata gctgataccc agagatttta 3060 tgctttgaag cacctctagc tccaaaatca gtcgaactat cgtctataat taagtatttg 3120 tgtatttttt cttggcttat attaggtagt aaggaaagac atcttgacca agagcttatt 3180 tgttcctgta caacttcttc agcatctatt tgattaaact gactagatat ttctcgaaat 3240 actggttgag tagcgattga aatgttttta aaaaaacatg gatcaatatc aatakcattt 3300 gaatcatgtt ccaaactaat ttcatagcta tcaactttgt ttattgctga catgataata 3360 tgttttgata atttgtttat tttcggtgaa tacaaagacc aaaatgcact gcggtttttt 3420 tctatagagg gaacggtcc 3439 // ID BEL-71_CQ-LTR repbase; DNA; INV; 497 BP. XX AC . XX DT 07-JAN-2011 (Rel. 16.02, Created) DT 07-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-71_CQ_; KW BEL-71_CQ-I; BEL-71_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-497 RA Jurka J.; RT "LTR retrotransposons from the southern house mosquito."; RL Direct Submission to RU (07-JAN-2011). XX DR [2] (Consensus) XX SQ Sequence 497 BP; 153 A; 102 C; 114 G; 127 T; 1 other; tgtagacgcg tagtgaacat ttataaagat ccggaaatat cttcaaaccg gacatttcaa 60 taagatttat tctcatagga aataccaggt agattcacct gaccacaggg gggctaccaa 120 gaacaaagga aatatcaaag ggttaatgag attagtgaga aggtagacac catgtggcga 180 ggtagttaaa aggattacgt gacaaggaag gaagtgtgta aatctgatta atccgtcccg 240 cttactctta tgctgatcaa taatttaaac tatgtactcg gtagtggaac tcgtaggctt 300 aaggttacat tgtacaaatt ttccaaataa aggtcaagtt cggaggtcac tcccaaaaac 360 ctgtgtatta atcacastca gtattgaact cctcgtcctc gatttttggg aaacctgcgg 420 tctcgactca tccagcagtc ggcagcggct tcacccggga tcatcgccag caggactgct 480 gcagtgttcg gcagcca 497 // ID Howilli1 repbase; DNA; INV; 2816 BP. XX AC . XX DT 07-OCT-2009 (Rel. 14.1, Created) DT 07-OCT-2009 (Rel. 14.1, Last updated, Version 1) XX DE Howilli1 is a putative autonomous DNA transposon. XX KW hAT; DNA transposon; Transposable Element; HOBO; transposase; KW Howilli1. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-2816 RA Ortiz M.F. and Loreto E.L.S.; RT "Characterization of new hAT transposable elements in 12 RT Drosophila genomes."; RL Genetica 135(1), 67-75 (2009)DOI 10.1007/s10709-008-9259-5. XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 394..2154 FT /product="Howilli1_1p" FT /translation="MSLIRKHSEIWTHFEDIGNQQAKCNYCKCLLSWKSQS FT NLSRHLKCKHPAAMEPVVRQNEESSVVVQSAQPRITTFIHRPVPVSKSQQI FT DRQLVKMIAKGHHALRIVEETEFQKLIHDVSQTPGYKLPTRKTLTNALIPK FT IKDECTGKIMKMLSASTAVCLTTDGWTSVTNESYLAVTVHFIETEFTVLTS FT YTLACQGFEESHTSTNLCGFLQNIVSEWGLENKVAAIASDNAHNIAGAIRI FT GNWKHVRCFAHCLNLIVQKALVKMSRVRTKAKAISQYFNRSTSGLKKLREM FT QALFKLPELKITQDVPTRWNSTYKMFQRLTVLKEAIVAALSTRTDLILSPE FT DWDLIDGVLPILQPFFQITEEISAENNVTLSKIIVLVGILQKKMVSISSNI FT KNPTLNELITEIIVDMDERFRDYEGNFLYAESTILDPRFKGRVFKSVEAFK FT KSVADIKTNLLRSVASSSENISPNNESLNFKTQDKDDIWTDYDTNFKQVSQ FT PTNNTAAAIREMDKYLAEEYINRKDDPLVWWNHRKSQYPLLYAYMLKRLCK FT VATSVPCERIFSSAGETVRKRRSLLKPKNVENLMLLHNNM" XX SQ Sequence 2816 BP; 921 A; 516 C; 542 G; 837 T; 0 other; tagtgttggg ttgctcatga gtgagtgaac aaaaaagagt tgttcagtga agtgagcaac 60 tgaacatgtt ctcttccaat tgttcttttt tgttcacttg ttcttttttg ttcacttgtt 120 cttttttgtt cacttgttct tttttgctca cttgttcttt ttggttcact tgttcttttt 180 ggttcactag atctttttgg ttcactagat ctttttggtt cacttgttct ttttggttca 240 ctagatcttt ttggttcact agatcttttt ggttcacttg ttctttttgg ttcactagat 300 ctttttggtt cacttgctcg atctacgttt cttttgtctc aagttctttt ttcatattag 360 cgatcatatc agtttcactt ttaaaaattc agaatgtcgc taataagaaa acatagcgaa 420 atttggactc attttgagga cattggcaat cagcaggcga aatgcaatta ttgcaaatgc 480 cttctgagtt ggaagtccca aagcaattta agccgacatt taaaatgtaa acacccggca 540 gctatggagc ccgttgtccg ccaaaatgag gaaagctcgg ttgttgtgca aagcgcccaa 600 ccaaggatta cgacatttat tcacaggcct gtcccagtga gcaaatctca gcagattgat 660 cgacaattgg tcaaaatgat tgccaagggt catcatgctt tgcggatagt ggaggaaacc 720 gagttccaaa aattaattca cgacgtatca caaactccag gctataagtt gcccacaagg 780 aagacattga cgaatgcatt gattccaaaa attaaagatg aatgcaccgg aaaaattatg 840 aagatgctca gtgcgtctac agcggtctgc ttaactaccg acggttggac atccgttacc 900 aacgagagct atttagcagt cactgtgcac tttatagaga cggagtttac agtgctaaca 960 tcatacacgt tagcttgcca gggatttgaa gaatctcaca cctccacaaa tctgtgcgga 1020 tttcttcaaa atattgtttc agaatgggga ctcgaaaaca aagttgcggc aatagcttca 1080 gacaacgcac acaatattgc tggtgctatt cggataggca actggaaaca tgtccgttgt 1140 ttcgcccatt gtctcaattt gatcgttcaa aaggctttgg tgaagatgtc cagagtgcga 1200 actaaagcta aagctatatc acaatatttt aatcgcagca catccggatt aaaaaagtta 1260 agggagatgc aagctctgtt taagttgccc gaacttaaga taacgcaaga tgttcccact 1320 cggtggaact ccacttataa aatgttccaa agactgacag tgttaaagga agcgattgtg 1380 gcagccctgt ctactcggac agatttaata ttgtcacctg aggattggga cctaattgat 1440 ggagttcttc ccatactgca gccgtttttc caaataacgg aagagatttc agcagaaaat 1500 aatgtgactc tttccaaaat aatcgttctg gttggcatac tgcaaaaaaa aatggtcagc 1560 ataagttcaa acattaaaaa tcctacttta aatgaattaa taactgaaat tatcgtggac 1620 atggacgaac gcttcagaga ctatgaggga aactttttgt atgcggagag caccatactg 1680 gatccaagat tcaagggacg ggtttttaaa tctgtggagg cctttaaaaa gtccgttgcg 1740 gacattaaaa caaatttatt gcgatcagtg gctagctcgt cagaaaatat ctcaccgaac 1800 aacgaaagtc taaatttcaa aacgcaagac aaggacgaca tatggactga ttatgacaca 1860 aattttaaac aagtcagcca gcccaccaac aacacggcag cagcaattcg tgagatggac 1920 aaatatctgg ccgaggaata cataaataga aaggatgatc cattggtatg gtggaatcat 1980 cgaaaatcgc agtatcctct tctatacgca tatatgctaa agcggctctg caaagttgcc 2040 acatctgtgc catgcgagcg catattttca agcgcaggcg aaacggtgcg caaaagacgt 2100 tcgcttttga aaccaaaaaa tgttgaaaat ctaatgcttt tgcataataa catgtaaatt 2160 tcaagtaagg atatcataat ttttgaagtt ttcccaaata agaccctttt ttcttttagc 2220 aatgcaagca gctatattaa catctaataa tcataagaat caaagcattc caactgaatg 2280 gaattataaa acatgttacc ttaagttttt gttttgaaat tgttttaatg ttgttttgaa 2340 atataaatat gttatttagc atgaaatgtg atatatttta tttcagcagg ggtcgaaaaa 2400 aaatatataa aactgtactt caaatatatc ggtaactgca ggacgttact tttatttata 2460 aaaaggacat ctagttgaaa aaaaaagaca ttttttttat tgttaaataa aaatcataaa 2520 gtatcactaa taatattacc aaaatcgaca acaatgctta agatgtaatt ttcaaaataa 2580 aactcagaat gtttatgctg agcaactgaa caactgagca atgtagagaa caactgaaca 2640 actgagcaat gtagagaaca actgaacaaa tgagcaatgt agagaacaac tgagcaactg 2700 tagtgaacaa ctgagcaatg aagtgagcaa gtgaacaaaa tgagcgagtt gactcacttc 2760 gaaaaaaaga acaagtgaac atgttcacca aaatgagcaa tctttaccca acacta 2816 // ID LOA_Ele2 repbase; DNA; INV; 4172 BP. XX AC . XX DT 06-OCT-2010 (Rel. 15.1, Created) DT 06-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE A LOA clade non-LTR retrotransposon family from Aedes aegypti. XX KW Loa; Non-LTR Retrotransposon; Transposable Element; Lian; KW LOA_Ele2. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4172 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4172 RA Kojima K.K. and Jurka J.; RT "LOA clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (06-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. This consensus is generated from 30 CC sequences with >99% identity, and ~100% identical to the original CC sequence in [1]. The 5' 400 bp shows similarity to those of other CC Lian elements, and therefore this family likely encodes only a CC single protein. XX FH Key Location/Qualifiers FT CDS 402..4055 FT /product="LOA_Ele2_1p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="MNRIKIIQVNLHHAKGASAVLSRRFTKENINVGLIQE FT PWVNNGRVLGCPSSSCRVLYDETQSKPRTAILVNRDTKFVPITEFIRRDIV FT AIKVEVPTIRGKTEVCIASAYFPGDVEDVPPCSVVHFVNYCKKINAQFIIG FT CDANAHHTIWSSTDINKRGEDLLNYMSSNNIDICNRGDSPTFVNSIRQEVL FT DLTICSDLLSEKIVNWHVSDEESLSDHKQIRFDFKAGSEITESYRNPKKTN FT WDLYRFHLTNKNSYNGEQFTTVSQLENASDGIIDQMVSSYHASCPIQQRSS FT NRDVPWWNDKLAGLRKKSRRLFNRAKITLDWVEYKKALTEYNRELRHAKRK FT DWRRVCETINNAPAAARLHKVLSKDHSNGLGCLKKEDGSYTVDSSETLEVM FT MRTHFPESIPYSDEEQVSDEARHVWSINAYQKACEIFTPSRVEWALSSFQP FT FKSAGKDDIFPALLQQGKETLCPLLTEIFKASVALSYIPKAWQKVRVVFIP FT KAGRKDKTTPKAFRPISLSSVLLKTMEKIIDDFIKSTSLIEMPLCKYQFAY FT QSGKSTITALQSLVNKIEKSLEAKEIAVVAFLDIEGAFDNASYSSMSSAME FT ARGLDKSVIEWVMTMLKGRTISADLGGAQISIRSTKGCPQGGVLSPLLWSL FT VVDELLRNLIDQGFEVIGYADDVAILVRGKFDNTISDRLQTALNITLKWCR FT KEGLNVNPSKTTIVPFTRRKKVKLIQPSLNGVQIQFSEEVKHLGVTLDKKL FT NWNSHLSKVISKGTNALWVCNKALGKTWGLRPNMVNWIYSAIVRPKISYAS FT LVWWPKTNEVTAQKKLAKLQRLACISMTGAMKSTPSVALDALLNILPLHQF FT IKLQAAKNALQFIRYNKILDGDLIGHMKIIKEFNLNSDNKTIEDWMTTKTN FT YDVPFKVVKPSRYTWDSGGPSLRAGSIVFYTDGSKMGDNTGAGVFGPGISK FT VIAMGRSPTVFQAEIQAIIDCTNVCLKRNYRFAKICIFSDSQAALNALKAF FT TCRSKLVWECILSLKQLASRNEVTLYWVPGHCGIEGNEIADNLARQGAASS FT FVGPEPFCGIPECTLRMKLKNWEMSMVESNWNATDTSKQAKRFIKPSLAKA FT RAILNLNKGNLRVITGLMTGHCPSRYHLKNIGKIQSSECRFCQAEDETAEH FT LLCNCGALLNQRTLTFGKGLLEPLEIWQGSPNRVIDFIKRVVPSWDSVTHQ FT PMSITSQL" XX SQ Sequence 4172 BP; 1327 A; 801 C; 933 G; 1111 T; 0 other; ggctattcgg cctcaatgga agttgatcct caatatcagc ttctatattc gaatagccta 60 gatagccgtg tagtgtcggt agccgattgc aacaaggcta gaaataacac tacggattgc 120 ctgttccggt ggtataagtc catcatacag gtggccccta attcatggtg tcatgcggct 180 aaaagcatac cgtgcctagg aatgaatggt taggggggtc aaaagaaacc taaccgtgaa 240 cggtgcctgt ggagtaccag ggcaccctcc acagtattat gcccttactg tgctacccgg 300 agcaatggcg cagttgacct tgtacttctc cgaaataatc ggctactcat atttagcctc 360 aaatcagagg cttgatacga gtgggagcat gttaatgtaa tatgaatcga atcaaaatta 420 tccaggtgaa ccttcatcat gcaaagggcg catcagcagt gctcagcaga aggttcacca 480 aggaaaatat caatgtggga ctaattcaag agccctgggt aaataatgga agggtacttg 540 gttgcccttc gagcagttgt agagttttgt acgacgaaac tcagtccaaa ccacggacgg 600 ctattcttgt aaatagggac acaaaatttg tcccaatcac agaatttatc cgtagagaca 660 ttgttgcgat caaagtagag gtgccaacta tccgtgggaa aacggaggta tgtattgctt 720 ctgcttactt tcctggagat gttgaagatg tgcctccatg ctctgtcgtt cattttgtca 780 actactgtaa gaaaattaat gcccaattta taatagggtg cgatgcaaat gcccatcata 840 caatttggag cagcaccgat attaataaaa gaggagaaga tcttttaaac tacatgtcgt 900 ctaacaacat agacatatgt aatagaggtg attctcctac ttttgtaaac tctatccgac 960 aagaggttct tgatctcacg atctgtagcg acttgttatc ggagaagatt gtgaactggc 1020 atgtttctga tgaagaatca ttgtcagatc ataaacaaat ccggttcgat tttaaagctg 1080 gatcagaaat aacagaaagc tatagaaacc cgaagaaaac caattgggat ctctatcgtt 1140 ttcatttaac gaataagaat tcgtataacg gtgaacagtt cacgactgtt tcacaactgg 1200 aaaatgcctc tgatgggatc attgaccaaa tggtctcatc ttatcatgct agctgtccta 1260 ttcaacaaag gtcatctaac agggatgtgc cttggtggaa tgacaagctg gcaggattga 1320 ggaagaaatc caggcggttg ttcaatagag caaagatcac tttagattgg gttgaataca 1380 aaaaagccct aacagaatac aacagagaac taaggcacgc caaacgtaag gactggagac 1440 gggtatgtga aaccatcaac aacgctcctg cagctgcaag gctgcacaaa gtcctttcaa 1500 aagaccattc aaatggtcta ggttgcctta aaaaagaaga cggcagttat actgttgact 1560 cttctgaaac actggaagta atgatgagaa ctcatttccc tgaatcgatc ccatattcgg 1620 atgaagaaca ggtctcagat gaggcaagac atgtatggtc gataaatgcc tatcagaaag 1680 cttgtgaaat cttcacgcct tcgagggtcg aatgggcatt gagttctttt cagcctttca 1740 aatctgccgg gaaagatgat attttcccag ccttactgca acaaggaaaa gagacgcttt 1800 gtccgctctt aaccgaaata ttcaaagcaa gtgttgcgct ttcatatatt ccaaaagcgt 1860 ggcaaaaggt tcgagttgtc ttcattccaa aagctggaag aaaggataaa acaactccca 1920 aagcctttag acctataagc ctttcctctg ttcttttaaa gacaatggaa aaaataattg 1980 atgacttcat caagtcaaca agtttaatag aaatgcctct ttgtaaatac caatttgcgt 2040 atcaatcagg taaatctacc attacggcgc tgcagtcgct ggtaaataaa atcgagaaat 2100 ctcttgaagc caaagaaata gccgtggttg cttttctcga tattgaagga gcattcgata 2160 atgcatccta tagctctatg agttcagcca tggaagcaag gggcttggat aaaagtgtta 2220 tcgaatgggt tatgaccatg ctcaagggtc gcacaatttc tgctgatcta ggaggtgcgc 2280 aaatatctat aagatctacg aagggatgtc ctcaaggagg agtgttatcg cccttactgt 2340 ggtctcttgt agtagacgaa ctccttagaa atttaataga tcaaggtttt gaggtaattg 2400 gatatgccga cgatgtggcc atcttagttc gtggaaagtt tgataacacg atttccgatc 2460 ggttgcaaac tgcgttaaac attactctca aatggtgtag gaaagagggg ttaaacgtaa 2520 acccttcaaa aactactatc gtgcctttta ccagaaggaa aaaggtaaag cttatacagc 2580 cttctttgaa tggagttcaa attcaatttt cggaagaagt taaacacctt ggtgttactt 2640 tggataagaa attgaactgg aactctcatc tgagcaaggt cataagtaag ggtactaatg 2700 ccctatgggt ctgcaataaa gccctaggaa aaacttgggg actgcgacct aatatggtaa 2760 actggattta ttcagcaata gtccgaccaa aaattagtta tgcttcactg gtatggtggc 2820 ccaagacaaa tgaggtaacc gctcagaaga agctggctaa gttgcagaga cttgcttgta 2880 tatcaatgac aggggcaatg aaaagtacac catcggttgc cttggatgcc cttcttaata 2940 tactgccttt gcatcaattc attaaactgc aagctgcaaa aaatgccttg cagtttattc 3000 gttataataa aatactagat ggtgatttga ttggtcacat gaaaatcatc aaggaattca 3060 atttgaattc agataacaaa acaatagaag attggatgac aacgaagacc aactatgatg 3120 ttccctttaa agtggttaaa ccaagccgct atacttggga ttctggtggg ccaagtttac 3180 gtgcaggttc tattgtgttt tataccgacg ggtcaaagat gggcgacaat accggagctg 3240 gagttttcgg ccctggtatc agcaaggtaa tcgctatggg gcgcagtccc actgtgttcc 3300 aagctgaaat acaagccatc attgactgta caaatgtttg tctcaaaaga aactataggt 3360 ttgcaaagat ctgtattttc tcagacagtc aagcagcttt gaatgctcta aaggcattca 3420 catgtagatc aaagttagtg tgggaatgca ttctttcttt gaagcaattg gccagtagga 3480 acgaagttac attgtactgg gtgcccggcc attgtggaat tgaggggaat gaaatcgccg 3540 acaatctagc aagacagggt gcagcttcga gctttgtcgg ccctgaaccg ttctgtggta 3600 ttcctgagtg cactctcagg atgaaactga aaaattggga aatgtccatg gtagaatcca 3660 attggaacgc cacggataca tccaagcaag cgaaaagatt cataaaaccc agtctagcta 3720 aagcccgggc tatactgaat cttaataaag gaaatcttag ggtaataact ggcctgatga 3780 ccggtcactg cccaagtagg tatcatctta aaaatatagg aaaaatacaa tcctctgagt 3840 gtcgtttttg tcaagctgaa gacgaaactg ctgagcattt actctgcaac tgcggagcat 3900 tgcttaatca aagaacgtta acatttggaa aagggttatt agagcccttg gaaatttggc 3960 aaggtagtcc taatagggta atagacttta taaaacgagt tgtgcccagt tgggacagtg 4020 tgacacatca accaatgtct atcacatctc aattgtgaaa ggatatttga tgagcaccaa 4080 ataaaatatg ggtcatacca caatattcct aaataatgga agcagtggtt aaaaggctct 4140 acagatgagg gaaaaaaaaa aaaaaaaaaa aa 4172 // ID Copia-25_SI-I repbase; DNA; INV; 4233 BP. XX AC AEAQ01023844; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-25_SI_; KW Copia-25_SI-LTR; Copia-25_SI-I. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-4233 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01023844; Positions 209 4441. XX CC Positions [1534-2070] - Integrase core CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 127..2229 FT /product="Copia-25_SI-I_1p" FT /translation="MATHEAKTETRITKLSDCNYRTWKTEMKWYLRGKGLL FT DYVLGTVALNENATEAERKVFRTNDDKAMAAIGLHIEPNQQIHIEDCNNAY FT EAWSSLEQVHQPINRIRIMQLKKEFYHIKMKDDETMSSYVARTKIAAANLK FT QAGAEVKDEDLAYAVLAGLPDTYENLNMALASLPDDRFTSVEVIRVLLAEH FT DRRKSRNDDTSAEAMEALQINKGSKGTKSDKTSNASAAEKTATCFNCKKPG FT HYARNCRSKTGNKQAQRDNKSKKNLDAFLVSLNNLDVEESWLLDSGCTHHV FT CKRKDWFQNFHEIEGETINTAANPEKQKGTQLCAKGKGDIVLKTFVGNAQK FT AIVLRNVYYVPHIRKNLMSVSQIERKGKELTIKDGKVRIRNTHTGKIMCEA FT YRKNDLYVVRAEVDLKTEAPAETNFASVKDSSIWHRRFCHVNNRAIEELAT FT HNRVRGLNDTKMDKYKCDACYIGKSTKSPCKKIKRRQSNDVCELIHSDLCG FT PMPVKSWSGNRYFITFTDDYSRKTTVMCIQSKDEVTDCVKKYIARVEREKG FT KKVKRFRTDNGLEYCNRQLTDFFEDTGIKHERSNVETPQMNGIAERINRTL FT MDLVRSMLKSAKLPQRFWAEAVVTAAYIRDRVGHSSIKGDVPLAIWTGRTP FT SVQHLKVYGCLAYANLPRQGRKKLDDRATECFLVGYASQTKGYRLWCQINQ FT T" FT CDS 2100..4223 FT /product="Copia-25_SI-I_2p" FT /translation="MSSIRKLTKTRSKETRRQSNRMFPSWLCIANQGIPSL FT VSDKSDVITTKHVRFAEDKIGYEWLYRDNTLHRYRYNNTWSESDSESEPEP FT TTTRRHSRSSTKKEESSSSPEDDSENSEESPTKKAISQTKACRIVGETDEP FT PRKVGRPKKKKVIRNPYGRKGKPKSNTDDQSTCDTEDNSEVEIHHVEVVEP FT RDLNDALASPQATEWKRAIREEIKSLEKQGTWEVTELPRGKRCIGCRWTFK FT LKTDSDGKICRFKARLVAQGFSQEKGVDYHETYAPVANFSIIRFMLAATAN FT FSWYTRHIDIKCAYLNGRLDEELYMKVPPLYKVEEGKVVKLKRPIYGLKQS FT GRSWNTEINNFLIENGFKRLRSASCVYCKGRWTILVIYVDDIFIFSRKKAP FT LLEAVQLIKNGYETRDLGDITYALGVKIQRNESGDLQLSQRAYIESLLSRF FT NMEECRAASTPLEHGLKLSKEDSSKSPDEKARMINVPYRQLIGSLMHLALY FT TRPDIMHAVTKLSQYNTDPGQLHWNQAKHILRYLSGTRDYALSYRAEKAPA FT IQIYSDADWAGDLDDRHSYSGTVVALGQSVIQWRSAKQKSITTSSMEAEYV FT ALSTGVKEAIWIQMFITELGMSELLPQTSELRCDNRAAIDFSKNCVEKGHT FT KHIDIAHHFVREKLDEKTITLSYIASNENPADIMTKPLRRVAHQGGVHKLG FT LNVAKVGD" XX SQ Sequence 4233 BP; 1470 A; 937 C; 1019 G; 807 T; 0 other; tggttatggg cccaggtacg taagaaatag gaagcgaatc gtgcaagcgc cagtccggca 60 cacgagaaga tatagattca tagcattcgc gcgtagacga tcgcccgaca agaacataac 120 ctcaacatgg ccacccacga agcaaagacc gagacccgta taacgaagct atcggattgc 180 aattatagaa cgtggaaaac agaaatgaag tggtacttga gaggtaaagg ccttctagat 240 tacgttttag gcaccgttgc gctgaacgag aatgccaccg aggcggaaag aaaagtcttt 300 cgcaccaacg atgacaaagc catggctgct atcggactgc acatagagcc gaaccaacag 360 atacacatcg aggactgcaa caacgcctac gaggcgtggt catcgctaga acaggtacat 420 caacctataa atagaattcg cataatgcag ttaaagaaag aattttatca cattaaaatg 480 aaagacgatg aaacgatgtc atcatacgta gccagaacta agattgcggc ggctaacctc 540 aaacaggcgg gtgcagaggt aaaagatgaa gacctcgcgt acgctgtact cgccggcctg 600 cccgatacat acgaaaattt aaatatggca ctagcgagtc tgcccgatga cagattcaca 660 tctgttgagg taataagagt cttactagcc gagcacgaca gacgaaagtc acgcaacgac 720 gatacctccg cagaagcgat ggaagcatta caaataaaca aaggatcgaa aggtactaaa 780 agcgacaaga catcgaacgc gtctgcagca gaaaagacgg caacctgctt caattgcaag 840 aagccaggcc attacgcaag aaattgccga tcaaaaacag gtaacaaaca agcccaacga 900 gacaacaaaa gtaaaaagaa tcttgatgca tttttagtgt ctctgaacaa cctcgacgtt 960 gaagaaagct ggttgttgga cagcggatgt acacatcacg tgtgcaaacg gaaagactgg 1020 tttcagaact tccatgagat agaaggagag accatcaaca ccgcagcgaa tcccgagaag 1080 caaaagggca ctcaactatg cgcgaagggc aaaggcgaca tcgtattgaa gacattcgtc 1140 ggcaatgcac agaaagctat agtactacgc aatgtttact acgttccgca cattagaaaa 1200 aatttaatgt ctgtttcaca gattgaaaga aaagggaagg agctcacaat aaaagatggc 1260 aaagtgagaa tacgcaacac acatacagga aaaataatgt gcgaagccta cagaaagaac 1320 gacctgtacg tcgtccgagc tgaagtcgat ttgaaaactg aggctccagc cgaaacaaac 1380 tttgcttcag tcaaagacag cagcatatgg catcgtcgat tctgccatgt aaataacaga 1440 gcaatcgaag aactcgctac acacaataga gtgagaggtc tgaacgatac caaaatggat 1500 aaatacaaat gcgacgcgtg ttacattgga aaatcaacga agagtccttg taaaaagatt 1560 aaaaggagac aaagcaacga cgtatgcgaa ctgatacact cggatctctg tggtccgatg 1620 cccgtcaagt cttggagcgg caacagatac tttatcacct tcaccgatga ttactcgagg 1680 aaaacgaccg tcatgtgtat acaatccaag gatgaggtca cagactgcgt caagaaatac 1740 atagctagag tggagcgtga gaaaggaaaa aaggtaaaga gattccgaac agataacgga 1800 ctcgagtact gcaacagaca acttacagat ttctttgagg acacaggcat aaaacacgaa 1860 agatccaacg tagaaactcc tcagatgaat gggatcgccg agcgaattaa tagaacgctt 1920 atggatctgg tgagatcaat gttgaaatct gcaaagctcc cccaaaggtt ctgggcggaa 1980 gctgtcgtga cagccgctta cattcgagac agagtcggac attcgtcaat caagggagac 2040 gtgccactcg ccatttggac aggacgaacg cctagcgttc aacatctcaa agtgtatgga 2100 tgtctagcat acgcaaactt accaagacaa ggtcgaaaga aactcgacga cagagcaaca 2160 gaatgtttcc tagttggtta tgcatcgcaa accaagggat accgtctttg gtgtcagata 2220 aatcagacgt gatcacaacg aaacacgtac gtttcgctga agataaaata ggctacgaat 2280 ggctctacag agataacacg ctacacaggt acagatacaa caacacttgg tcagagagcg 2340 acagcgaatc ggaaccagaa ccgacgacaa cgcgtcgaca cagccgatca agtacgaaga 2400 aagaagaatc ttcgtcatcc ccagaagacg actcggaaaa tagtgaggaa agtccaacga 2460 agaaagcaat aagccagact aaagcatgtc ggatagtggg ggaaaccgat gaacctccaa 2520 ggaaagttgg cagaccgaag aagaagaagg taataagaaa cccatacggc agaaaaggga 2580 agcctaaatc caatacagat gaccaaagta cgtgcgacac agaggataat agcgaggtag 2640 agattcatca tgtcgaagtt gtcgagcctc gtgatctaaa cgacgcactt gcctcaccac 2700 aggctacaga gtggaagcga gccatccgcg aagaaatcaa gagtctagaa aaacaaggaa 2760 cttgggaagt gacagaacta ccacgaggta aacgatgcat aggatgcaga tggactttca 2820 agctcaagac ggactccgat ggcaagatat gtagatttaa agcaaggtta gtagcgcaag 2880 ggttctcaca ggagaagggt gtcgactatc acgaaacata cgcacccgtt gctaacttct 2940 caattatacg cttcatgtta gctgcaacag caaatttttc atggtacact cgccacatag 3000 acataaagtg cgcctacttg aacggaagac tagacgaaga gctgtacatg aaggtgccac 3060 ccctatacaa agttgaagaa ggaaaggtag tcaaactcaa gcgaccaatc tacggattga 3120 aacagtcagg aagaagctgg aatactgaga tcaacaattt ccttatcgaa aatggattta 3180 aaagactacg ttctgcaagc tgtgtatact gcaaggggcg ctggacaata ctggtaattt 3240 atgtcgatga catttttata ttctcccgaa agaaggctcc gttactagaa gcagtgcaac 3300 taataaagaa tggctacgag acaagagatc ttggtgatat tacatacgct ctaggagtga 3360 aaattcaacg aaacgaatct ggcgatctac agttaagtca gagagcatac atcgagtcat 3420 tgttgtccag gttcaacatg gaagaatgtc gagctgcctc gacaccactc gaacacgggc 3480 taaaattgtc aaaagaagat agctccaagt caccagacga aaaggcaagg atgatcaacg 3540 tgccataccg gcaactcata ggatcgctga tgcacttggc gctgtatacg cgaccagaca 3600 tcatgcacgc agtaacgaag ctatcgcaat acaacacaga tcctggccaa ctacattgga 3660 atcaagccaa gcacatattg cgctacctga gcggaaccag agactacgcc ctatcatacc 3720 gtgccgagaa ggcacctgca attcagatct acagcgacgc tgactgggca ggcgacctcg 3780 acgatagaca ttcgtactca ggaacggtcg tcgccttggg ccagagcgtg atccagtgga 3840 gatcggcgaa acaaaagagt atcaccactt caagcatgga ggcggaatac gtcgcattat 3900 ctacaggagt gaaggaagcc atttggatcc agatgttcat caccgaactg ggtatgtcag 3960 aactacttcc tcaaaccagt gaattgcgat gcgacaaccg agcggcgata gacttctcga 4020 agaactgcgt cgagaagggg catacgaagc atatcgatat tgcccatcat ttcgtacgcg 4080 aaaaactaga cgaaaagaca atcacattgt cttacatcgc atcgaacgag aatccggcgg 4140 acatcatgac gaagccacta cgacgggttg cacatcaagg aggagtgcat aaacttggat 4200 tgaacgtcgc aaaagtgggg gattgaagag gtt 4233 // ID Boudicca_I repbase; DNA; INV; 4279 BP. XX AC BK004066; XX DT 04-MAR-2004 (Rel. 9.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version 3) XX DE Schistosoma mansoni Boudicca LTR retrotransposon mRNA, complete DE sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW internal portion; Schistosoma mansoni; Boudicca; Boudicca_INT; KW Boudicca_I. XX NM Boudicca_I; Boudicca_INT. XX OS Schistosoma mansoni OC Eukaryota; Metazoa; Platyhelminthes; Trematoda; Digenea; OC Strigeidida; Schistosomatoidea; Schistosomatidae; Schistosoma. XX RN [1] RA DeMarco R., Kowaltowski T.A., Machado A.A., Soares B.M., RA Gargioni C., Kawano T., Rodrigues V., Madeira M.A. et al.; RT "Saci-1, -2, and -3 and Perere, Four Novel Retrotransposons with RT High Transcriptional Activities from the Human Parasite RT Schistosoma mansoni."; RL J. Virol 78(6), 2967-2978 (2004). XX RN [2] RA DeMarco R., Kowaltowski T.A., Machado A.A., Soares B.M., RA Gargioni C., Kawano T., Rodrigues V., Madeira M.A. et al.; RT "Boudicca_INT."; RL Direct Submission to Genbank (03-DEC-2003)Departamento de RL Bioquimica, Instituto de Quimica, Universidade de Sao Paulo, Av. RL Prof. Lineu Prestes, 748, Sao Paulo, SP 05508-900, Brazil. XX DR Genbank; BK004066; Positions 1 4279. XX CC Key Location/Qualifiers CC CDS 236..1069 CC /codon_start=1 CC /product="gag protein" CC /protein_id="DAA04496.1" CC /db_xref="GI:44829165" CC /translation="MTEHSPKVFKLKSIPPISIQLTPFWPDNIESWFCYAEADFCMHG CC ITDSRTRFLAVVKALPREFNRYVTPSMFTSDVSEPYESLKLSILKRGDLTDRQRLDQL CC LNNIDLQHGSATDMLLRMREVIGQRTFDDGLFRQLFLSKLPQQVQAVLVSFQNNAIDE CC LAASADRILEITKFSKAEVFTIKEKPQSTSTDMAELCHTLTRCLRVRNDRSRSKTPRR CC SASRKRSGSRPRETDNPDWCWYHNQYGKFSRNCRKPCNYPTPKSSDTKNSSGNFPAGT CC R" CC CDS <1027..4200 CC /note="start codon not determined" CC /codon_start=1 CC /product="pol polyprotein" CC /protein_id="DAA04495.1" CC /db_xref="GI:44829164" CC /translation="HEKQFGKLPSRHALTATVAGEHSRLSYVTDVTTKVRYLVDTGAE CC VSVLPANLDDRLRESVLSLQAANGKPIATYGKRYVYLNVGLRKPIHWIFVVADVSMPI CC IGIDLLQHHNLLIDTRARRLIDGNTKLSVCVTPCSGCRLSPVTIKHTIDPLYQAVLDK CC YPGICQSQSKLPCVTSNVTHHISTTGPPVFSKARRLAPEKLRLAKNEFDHMMDLGIIR CC PSNSPWASPLHMVPKKDSNDWRPTGDYRRLNAKTIPDRYPLPHIHDLTATLKGTTVFS CC KIDLVKAYNQIPMAPDDIPKTAIITPFGLYEFLRMPFGLRNAAQTFQRFIDDVFRGLN CC FVHAYVDDCLIASPDRESHLKHLDIVFDRLQRHGITVNIQKCQIGTNCLDFLGHTIDA CC QGIRPLRSKVAAILDYPEPTTIKQLRTFNGLVSYYRRFIPKCASLMKPLTDQLRGNAK CC SISLDDIARKAFSPVKEHIVKAPLLAHQDTQAPISIAVDASDSAIGAVLQQWVNNSWQ CC PLGFFSRRLLDAESRYSTFGRELLAMYCAVRHFQHSIEGREFTLFTDHKPLTFSLNSS CC SDKYSPRESRQLDYISQFTSDIQHISGANNVVADALSRISSLNSFQGIDLLKLAQLQK CC EDIDLQHELSSTTLKLSIRQMGTGKETLLCDTSTGRDRPIVPKHYRRNVFNTLHNLSH CC PGARATVKLIAERFCWPGMNKDVREWARSCVSCQKSKVIRHNKCPLGYFKTPDARYDH CC VHLDLVGPLPDSNGYSYLLTCVDRFTRWPEAVPIKDITAETVARAFVERWVENFGCPS CC TITTDRGRQFESGLFRCLTSLLGIMRFRTTAYHPQANGLVERFHRRLKASLSASNVSQ CC WTDALPLVLLGIRNAVKADIGYTASQLVYGTTLRLPGEFVDPSASSLNMDLNSYTSRL CC TNAMRSVKPAHTRPQSTDVFVQPELRHSTHVFVRRDSHRRPLESAYEGPFKVLQREPK CC YYVVDKNGTKDSISIDRLKAAYLEGNPSYVDFPSVQANDTATTLVVPYTTTDTQLDTS CC STSIEKPKTTRFGRRVRFPEHLNEYCT". XX SQ Sequence 4279 BP; 1225 A; 1080 C; 842 G; 1132 T; 0 other; atagaaggtg ctactggtat ccagttgtaa aactagaata tccgttgcat catcattggt 60 gagaccgacg tgatctggta caccaagtca atatactgtg agtctgttcc attttggaat 120 attattattc attgttattc tttatttgaa ttttatatat tgtgatatac cttttttcgc 180 gaattttttt tctcggatat attttaatta gacatttttc gttagctata ttactatgac 240 ggaacactca cccaaggtct ttaagttaaa gtctattccc cctatttcga ttcagttaac 300 gccattttgg cccgacaata tcgagtcctg gttctgctat gcagaagccg atttttgcat 360 gcacggaatc acggactcgc gcacacgttt cctcgcggta gttaaggcgt taccgcgcga 420 gtttaacagg tacgtaacac ctagtatgtt tactagtgat gtttccgaac cttacgaatc 480 gcttaagcta tcgattttaa aacgaggaga cctaactgat cgacaacggt tagaccaact 540 ccttaacaat atcgacttgc aacatggttc cgcgacggac atgttgctta gaatgagaga 600 agtcatcggc caacgaactt tcgatgatgg cctatttcga caacttttct tgtctaaact 660 ccctcaacag gtgcaggcag tgcttgtttc atttcaaaac aatgccatag atgaactagc 720 tgcatctgcc gatcgaatct tggaaatcac caaattttct aaggccgagg tttttactat 780 caaagaaaaa ccccaatcga cctcgactga catggccgaa ctatgtcaca cgcttacacg 840 atgtcttaga gttcgcaatg accgtagtcg gtcaaaaacc ccgcgtagaa gtgcctcacg 900 taagcgatct ggctcccgac cacgagagac agacaatccc gactggtgct ggtatcataa 960 ccagtacggt aagttttcca gaaactgcag gaaaccttgc aattatccga ccccgaaatc 1020 gagtgacacg aaaaacagtt cgggaaactt cccagccggc acgcgttaac ggcaaccgta 1080 gccggcgaac acagccgtct ctcatatgtc acggatgtga ctacgaaagt tcgctacctc 1140 gtcgacactg gcgcagaagt tagcgttctt ccagcaaatc tcgacgacag acttcgcgag 1200 tctgttttaa gtttacaggc ggcaaacgga aagccgatcg ctacttacgg taaaaggtac 1260 gtctacctta acgtgggttt acgcaaaccc atacactgga ttttcgtggt tgcagatgtc 1320 tctatgccaa tcattggtat agacctgtta caacaccaca atctactcat cgacacacgc 1380 gcacgaaggt taatagacgg aaacactaag ttatccgttt gcgtaactcc ttgttctggt 1440 tgcaggttat ccccagtcac gattaagcac accatagacc ccttgtacca agcagtactc 1500 gacaaatacc ccgggatatg ccaatcgcaa tcaaaattac cttgtgtaac cagcaatgtt 1560 acacatcaca tctcgactac aggaccacct gtattttcga aagcccgaag actcgctccc 1620 gagaagctta ggttagctaa aaacgaattc gaccacatga tggacctagg gataattcga 1680 ccatcgaata gtccatgggc atccccattg cacatggtcc ctaaaaagga cagcaatgat 1740 tggcggccaa ctggtgatta tcggcgtctg aatgctaaaa ccattcccga tcgttacccg 1800 ctgcctcata ttcacgattt gacagctacc ttgaaaggta caactgtctt ttcgaaaatc 1860 gacttggtta aagcttataa ccaaatcccg atggcacctg acgacattcc taaaaccgcc 1920 atcatcactc ccttcggtct ttacgaattt ctacgaatgc cttttggttt aagaaatgcg 1980 gctcaaacct tccaaaggtt tatagacgac gtcttccgag gtctcaactt cgtacatgcg 2040 tatgttgatg actgtctgat agcaagtccg gacagagaat cacatctcaa gcatctggac 2100 attgttttcg atcgactcca aaggcatggc attactgtca acattcagaa atgccaaatc 2160 ggaactaact gcttagactt cttaggacac actattgatg ctcaaggcat ccgaccactt 2220 aggagcaaag tggcggccat tctggattac ccggaaccga ccacgatcaa gcagttacga 2280 actttcaacg gactcgtgag ttactataga cggttcatac ccaagtgcgc atccctaatg 2340 aaaccgttaa ccgatcagct tcgtggaaat gcgaaatcaa ttagtttaga cgacatagcc 2400 cgtaaagcat tctccccagt taaggaacac atcgttaaag cacccctgct tgctcatcag 2460 gacactcaag cgcccattag tatcgcagta gacgcatccg actcagcaat cggagcagtc 2520 ttacaacaat gggttaacaa ctcatggcaa cctctaggat ttttctcgag acggttgtta 2580 gacgcggaat ctaggtacag cacgttcggt agggaactcc tagctatgta ttgtgctgta 2640 cggcattttc aacattccat cgaaggaaga gaattcacgc tttttaccga tcacaaacct 2700 cttaccttct ctttaaattc atcttcagac aagtactcac cgcgagagtc tcgacaactc 2760 gactacattt cgcagtttac ttcagacata caacatattt ctggagcaaa caatgtagtc 2820 gcagacgcct tgtctcgaat tagctccttg aacagtttcc aaggaatcga ccttcttaaa 2880 ctcgctcagc ttcaaaaaga agacattgat cttcagcatg agttatcctc gaccactctt 2940 aaactaagta ttaggcagat gggaacaggt aaggaaacct tactgtgtga tacatctaca 3000 ggtagggatc gtccaatagt accaaaacat tatcgacgca acgtcttcaa tacgttgcac 3060 aatctttctc acccaggtgc tcgtgcaaca gtcaaactta tagccgaacg cttttgttgg 3120 cctggcatga ataaagacgt gagggagtgg gcacgctcct gtgtaagctg tcagaaatct 3180 aaggtaatta gacacaataa atgtccctta ggctatttca aaaccccgga tgctcgttac 3240 gaccatgttc atctagattt ggtaggaccc ttacccgact cgaacggata ctcatatctc 3300 ctaacatgcg tagaccgttt tactcgatgg ccagaagcag taccgattaa ggacattact 3360 gctgaaaccg tggctcgcgc tttcgtcgaa cgatgggtag aaaatttcgg ctgcccttca 3420 accatcacta cagaccgcgg acgccagttc gaatctggac ttttccgttg tctgacctca 3480 ctattaggaa tcatgcgctt ccgaacgacc gcctaccacc cacaagcaaa cgggttggta 3540 gaacgttttc atcgacgact aaaagcttca ttatcagctt caaacgtttc acagtggaca 3600 gacgctcttc cactcgtctt gctaggtatc cgcaatgcag tgaaagctga cattggatac 3660 acggcatctc aactcgttta cggaacgaca ctccgactcc caggagaatt cgtggatcct 3720 tcagcttctt cattgaacat ggatctaaac tcttacacga gtaggcttac aaacgctatg 3780 cgttcagtta aacctgctca cactcgacct caatcaactg atgttttcgt tcaacccgaa 3840 ttacgacata gtacacacgt cttcgttcgt cgagactcac atcgacgacc cctcgaatca 3900 gcatacgaag gacccttcaa agttcttcaa cgcgaaccta agtactatgt agtcgataag 3960 aacggtacga aagatagcat cagtatcgat cgactcaaag ctgcgtacct agaaggaaac 4020 cctagctacg tcgattttcc ttcggtacaa gcaaacgaca cagctactac actcgtagta 4080 ccctacacga caaccgacac acaacttgat acttcgtcaa cgtcgataga aaaacctaaa 4140 acaacgcgtt ttggaagacg agtaagattt ccagaacatc ttaacgaata ttgcacgtag 4200 gacataagct ttatcttgaa ttaccctttc aaagtttatt ataatttttt tttaatatgc 4260 tcattttttt tatttatat 4279 // ID Gypsy-98_CQ-LTR repbase; DNA; INV; 740 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-98_CQ_; KW Gypsy-98_CQ-I; Gypsy-98_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-740 RA Jurka J.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 576-576 (2011). XX DR [2] (Consensus) XX SQ Sequence 740 BP; 163 A; 175 C; 184 G; 217 T; 1 other; tgagtacaac tccccccttt gagactatct ccatgcgcat gtgaatttgg ggttttagcg 60 aagacacgca ccactaagag ctcggccgag ccaagaaacg aacgaaaaac gcacacgtag 120 ggaaaatcca gctagctatg tttcgggaac attcakatta gaggcctcaa ctataactat 180 gctaatccta ggttaatttg tagtttacga tcgtaccgct gtaataaaat catgctttgc 240 agtaccgtgt ttcctctaaa catgaagtag ttcgttcttt aagcgatttc ccgagaattg 300 agttgtcctg gcgtaggggt tttctcacat ccgtaccgtg gtagttttag cttcttgtag 360 tactccgagt tggactgctc gggaatccta gtccctccct tccgatttct agcacgttac 420 ggggtcttgg ttgggaacct tctaccggtg atgatgatcg tgagtgtgag ttaggttgct 480 ggtaagttcg tgaggaccta tcagatgatt tttggtatcg gttatcgggt gtgtcgaacc 540 ctctaccgcc tctttatctc tccggattgc tctgtctgat ggtaggacga tttttgatta 600 cccgccctga ggtctgggtt tcatgctggg caagttaacc tgcacgagtg gtcctttttc 660 caaaggtggc gcttaagcca ctcgttacag tcctgcgaca atccgaagac ggtcacgtgg 720 ccgctaccca gcgggctaca 740 // ID CR1-21_BF repbase; DNA; INV; 2441 BP. XX AC . XX DT 30-JUL-2009 (Rel. 14.07, Created) DT 30-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE Amphioxus CR1-21_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-21_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-2441 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-2441 RA Kapitonov V. and Jurka J.; RT "Young families of CR1 non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(7), 1592-1592 (2009). XX DR [2] (Consensus) XX SQ Sequence 2441 BP; 872 A; 505 C; 532 G; 532 T; 0 other; ttagaggcaa taagagatgt gtacctcttt caacatgtag actttgcaac aagacaccag 60 gctaaccaga acagcaatgt gttagacctg atacttacaa acgagatgca tatgatagat 120 acggtcactt cgaggccacc tgttgggaaa agcgaccaca tcatgctgga atggtcatac 180 aaatgctatg tggaaaggag tcaaaacggc agggaattcc gcttgtacca caaggggcac 240 tatgatgaca tgagagaaga tttggaaaaa gactgggata caatgttaga agatttagat 300 acaaacgagt gttgggaagt attgtcagaa acagtaagga aatcttcaga caaaaacata 360 cccaaagtta atgcaacaca gagaagaaga agactttcat ggaaagacaa agagacaaca 420 aacaaactaa acaaaaagcc tggaagagat acatccacac aagaagtcat gttgactatg 480 ttaaatatac aaaactgcgt aaccaagcac ggtgggcagc aagaaaagcg gtcaagaaac 540 atgaaaagga gatcgccaga aatattaaag ggaatacaaa gctattttgg aaatacgtaa 600 ataaaaagtc aaaaacaaga cataacatcc cagacctgat ggatggagat gagaaagcaa 660 gaactgatgt ggaaaaagca gaaatactaa acaaattctt cgttagtact tttactaaag 720 aagacacaag gtgtatacca caacctgaag agtcaacata caaacaggag ttgaaaggca 780 ttgaactaaa tatcgaagat attcaaaaac gtctagagaa cctcaacccc aacaagtcca 840 tgggaccaga ccaaattcac cccagagtcc taaaagagct tgctggaacg ctggctccac 900 ctctgtacat tctctttcaa aagtcactga acgagggaga ggtgccaaca gactggaaaa 960 ttggaaacat tactccactc ttcaagaaag gctgtagaaa aactccaggc aactacagac 1020 ccgtgtgtct aacttccgtg gttggcaaag ttctggaggg tataattagg gatgagatta 1080 taaaacacat gagaaggaat gaacttttta cacgacaaca gcatgggttc ctccctggaa 1140 gatccacagt aacacaaatc ctggaatgtt tggaaagctg gacgacctgg ctagaccaag 1200 gtatacctgt ggatgtaatc tacatggatt tccgcaaggc cttcgactca gttcccatac 1260 aaaggctttt gaggaaagta gaaagttatg ggatcaaggg gagccttctc aaatggatag 1320 cctcgtttct cgaaaataga aaacaaaggg tgtgtgtcaa cgatagtcac tcacggtggt 1380 cggaggtaca cagtggagtc ccgcaaggga gtgttctcgg accagtgtta ttcacaatat 1440 tcgtgaatga tatgcctaat gcagttacct gtcacttaaa attgtttgcg gacgacacaa 1500 aaatctaccg aacagtacca aacatacgtg actgtgaagc actacaacaa gacctggaca 1560 gtctgcagtt atgggcttca aggtggcaac tccaatttca cccagataaa tgcaccgtaa 1620 tcagaattgg cgggggacac ccagacttct catacaaaat gaaagacgaa gcaaatccct 1680 tgcctctgtc tctagaattc acggaggaag agaaagacct cggaattatt gtagacaagg 1740 acctgggttt taacaaacac atccttacca taacaaagaa ggcaaatcag ataatgggac 1800 tggtatggag aacatttgag tttatggaca aatgtatgtt tttaaccctg tacaagtcta 1860 tcatccgtcc tcatctagag tatggagcaa gtgtatggtc tcctcaccta tggtcactgg 1920 caatcgaaat tgagaaagtg caaaaaaggg caaccaaaag agtaccgggt ctggcggatc 1980 taccctaccc agagaggctt aggagtcttc acctccccac attggtatat agaagactaa 2040 gaggcgactt aataaacacc ttcaaataca cgaaaggact atacgatact ccatgtttag 2100 tccacttgtc tgtagaaaca agaacaagag gtcaccagtt gaaactaaaa agatccttgg 2160 cgaaaacaaa tacacggcta tactcctata caaatcggat tgtcagctgg tggaacaatc 2220 tcccagagga agtagtgatg gcaccaactg tgaatggttt taaagcaaga ttcgacaaac 2280 atatgaaaga ccatcctgtc gtatacaacc acagggcact ggaccatccc ttacgtccga 2340 aaatgtcagt atcctgagta ttccgaattg aagagcaggc caaactggag tggatactcc 2400 taccggactg aaaaactcta ctctactcta ctctactcta c 2441 // ID BEL-1_DWil-LTR repbase; DNA; INV; 381 BP. XX AC scaffold_180576; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-1_DWil_; KW BEL-1_DWil-I; BEL-1_DWil-LTR. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-381 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (04-MAR-2011). XX DR Genome; scaffold_180576; Positions 25473 25853. XX SQ Sequence 381 BP; 98 A; 95 C; 85 G; 103 T; 0 other; tgttcacgcc accgggtgaa cagcagtgag cggcgaagag cagtgaagag tgaggagggt 60 tttttgatcc tcggcatcta cggtcacacc aggaggggtg actgggtttt ttccaatggg 120 catcgccatt atactgaatc gagattttca cccgaacacc cgttcaccgc ctgtatatta 180 tactttgcga tttgtcactt tttccaaccg aaccatttcg tgtgtgtgtc ggtatatctt 240 gtgaactaaa taaagcaaac atagaattaa aacgaagttt tcgcgcgttt taatcattct 300 actcacgtac gagaccctga agatattggc acccgccaac cacttgatac tatatcgggt 360 ccttccgccc cgccacaaac a 381 // ID Gypsy-62_AA-I repbase; DNA; INV; 4647 BP. XX AC supercont1.345; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-62_AA_; KW Gypsy-62_AA-LTR; Gypsy-62_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4647 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.345; Positions 291 4937. XX CC 'GACCG' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 1505..4045 FT /product="Gypsy-62_AA-I_1p" FT /translation="MCKSKSVVHAVQLSEEDDIDSDEESEELFIGSVESSE FT SVNDWFEVVKVNKKKFSVKLDTGAHCNVVPLWLVKQLGISVRPSKTKWLVS FT FSNHRLKVLGEINPVCRIKNKNSSITFKVVEESVVPVLGKATCIAQQLIAR FT VDTVQVLDDSVYDGLGCLKDYVYDIDLIEDPQWEVRPARHVPHSIRSAVKK FT ELDSMERLGVVEKIHAPTPVVNAMVLVKRNEKLRICIDPSQVNQNLLRRHH FT PLTTIEEIATRLKNSKYFTILDCKKGFWQIRVSERTSKFLAFGTPWGRYVC FT KRLPFGLASAPEVFQKVMHDTLEGIEGVEFSMDDILIHADSSDKLNKITGK FT VVKRLHSAGLKLNRDKCLFNQTSVKFLGHIVTDGGLKADPDKLEAVIKLKT FT PSNRSELQRALGMITYMGKFVPNLSDVTAPLRSLLAKDVAWEWGYEQDKSF FT QKIKALMQSPPILKYYDLNAPVTLSVDASSHALGAVLLQQGRPVAYASRAL FT TSSEMNYPQIEKEATAIRYACSKFHQYIYGKRLTIETNHKPLESIFRKPLD FT RAPPRLRRIRLEVAQYNPTVVYVKGANIPIPDILSRDVANTAIAEDDEQLE FT VHIVLQMTKSASAELKQHTDKDYEMQSLKSVILQGWPETINQLIPELRKYW FT NFRDEMAVYDGLIFKSNQVLIPQPLRKAMLVKIHSGHAEIQSCIRRAKQVL FT FWIGMGNDIQEMVETCSICQRHQRSNTKSTIILKDIPRLPFERVASDLFHF FT KGKEYLLIVDSYSGYFDIKQLSETTSKRIIKHLKEWFSIHGIPQVLESDNG FT PQYASAEFRQFSRQWGFEHQTSSPHFPRANGLAERYVKVAKSMLK" XX SQ Sequence 4647 BP; 1449 A; 903 C; 1112 G; 1183 T; 0 other; tggtgtcaga agaaaccaag ctcgcgaaaa tattgaattt taccggaaaa aatcgataaa 60 atcgcgatga aatatcctgg cgagacagtt aagtcactaa tcattcagtg cgaatagaag 120 ataagtgtag tggcgtgaat attaaagctg aatcccgtca aacaaaatcg atcttcatag 180 cgatccgata aaatggcgtc gtcagcaggc ttgaaaccgc caaagcccat agtgctgggg 240 gaaaatatgg ccgcccagtg gaagagttgg gttcggcaat attcgtggtt cgcgattgca 300 acgcaacttt cggaaaaatc gaatgaggtg caggcagcaa cattcatgag tgcgataggg 360 gaagactgtg tgcgaatttt cgatactttc tgcttgcggc cggaagaaga aaacgatgta 420 gaagttataa aaacaaattt tgacgagtat tttctaccga agtcctgtat tacttttgaa 480 cggtacaact tcaatcaaat cgtgcagcag gaggacgaac agtttgacag tttcgttact 540 cgggttagag aacaagcgaa aaaatgctct ttcagtgtgc ttcacgattc acttgtaaaa 600 gatcgaatca taattggagt gagatataca agcttagtgc cacagctgtt aaacgacgat 660 ttaactttgc agaaaacgat cgagttgtgt cgtaatttcg aggccacgac actgcagtca 720 aaagcgttgg ccggcgaaag caaaattgac acagtgaaag taatgaagaa gaagatgaat 780 gcttgccggt ttgaagaaaa ggatgatgta aggtaaaaag gtgtaatacg ccccccctta 840 aggaaaaact gctctatcgc agttgcaaat gcatcacttc taaagttttt catgtcaatg 900 tgttgctcat ttagttggtc atgctctgca tgcaactttc tctttccaga ttgctgctaa 960 aatgtttact agatgttttt ttgtaaaccg tcccaatgtt tacatttcaa aacatcacgg 1020 gcaatatgcc cctcccatag aaaaaaggtc cacagcattg taaaaatgtt tccaacgaga 1080 attccaccac ttgctaagct taaaggggtc cattagcagt cgtcttccgg taaaagtttt 1140 tgcaataagt acaatagtaa aggaagggct aaagcggcaa ggcttgctta ttggacgcaa 1200 aattttacgc caaaaacctg attttcgtta tttttagaat ttctattgat tgttaatact 1260 tctcaaagaa catatatgcg aaacatcatt tattcgcgaa tagtgaagaa atgtaatcat 1320 atttacgtgt aaaagaagcc tcaatcccgt gaaacttcag aggggcgtat tgccccggtg 1380 gggcgtatta caccgagtta ccctattcca gtgcaaaaac tgtgggcgga agcatgcaaa 1440 gcgttcttgt cccgcgtttg gtaaaacgtg tcggaaatgt ggtcagccaa atcactttgc 1500 tgtgatgtgc aaatcgaaaa gtgttgtaca tgccgttcaa ttatcggaag aggatgatat 1560 tgattccgat gaagaatcag aagaactgtt cattgggtca gtggaaagtt cagaaagtgt 1620 gaatgattgg tttgaggtgg tgaaagttaa caagaagaag ttttcggtga agcttgacac 1680 aggtgcccat tgtaacgtag tgccgttgtg gttggtcaaa caactgggaa tcagtgtacg 1740 cccctctaaa acgaaatggt tagtttcgtt ttcaaatcat cgattgaaag tgcttggtga 1800 aataaaccct gtgtgcagaa tcaaaaataa gaacagcagc atcacattca aagtagttga 1860 agagtcggtt gtacctgtgc taggtaaggc aacctgtatt gcgcagcagt tgatagcgcg 1920 ggtggatacg gtgcaagtgt tggatgattc ggtgtatgat ggtcttggct gtttgaagga 1980 ctatgtttac gacatcgatc tcatcgagga cccacagtgg gaagtgaggc cggcacgtca 2040 cgtcccgcat agcatcagat cagctgtaaa aaaggagttg gattccatgg agcgtcttgg 2100 agtggttgaa aagattcatg cacctactcc ggtggtgaac gcaatggttc tggtcaaacg 2160 aaatgagaag ttacggatct gcatcgaccc atcacaggta aaccaaaatt tattaagacg 2220 tcaccatccc ctgaccacca tagaagagat agcgacccgg ttgaaaaatt cgaaatattt 2280 tacaattctt gattgtaaga aagggttttg gcagatcagg gtttcagaaa ggacgagcaa 2340 atttctggct ttcggtactc cctggggaag atacgtctgc aagcggttac cttttggact 2400 tgcctcagct ccagaggtct ttcaaaaagt tatgcacgac acgctagagg gaattgaagg 2460 agtagagttt tccatggatg acatcctgat ccatgctgac agctctgata agttgaacaa 2520 gattacgggc aaagttgtta aaagactgca ttcagctggg ctcaaattaa atagagataa 2580 atgcttgttc aatcagacta gtgtaaagtt tttggggcat attgtcactg atggcggttt 2640 gaaggcagat cccgacaaac tggaggccgt tattaaactt aagacaccat caaataggtc 2700 tgagctccaa agagcattag gaatgataac gtacatgggg aaattcgtac ctaatttgtc 2760 agacgtgaca gccccattga gaagtctgtt ggctaaggat gtggcttggg agtggggtta 2820 tgagcaagac aaatcatttc agaaaataaa agctcttatg caatctcctc caatactgaa 2880 gtattatgat ttgaatgcgc ctgtaacgct atcagtggat gcaagctcac acgcacttgg 2940 ggctgttttg ctacaacagg gtcgcccagt agcgtacgca tccagagcgt taacttcttc 3000 ggagatgaat tatccccaaa ttgaaaaaga agctaccgca atacggtatg cttgctcgaa 3060 gtttcaccaa tacatatatg gtaaaaggct aacaattgaa acaaaccaca aaccattaga 3120 gtcaattttt cgtaagccac ttgatcgagc tcccccacgt ctccgtagaa tcagattaga 3180 ggttgcccaa tataacccaa cagttgtgta cgtaaaagga gctaacattc ctatccctga 3240 tatcttaagc cgcgatgttg ccaatactgc aatagctgag gatgacgagc agttagaagt 3300 tcatattgta ttacagatga ccaaaagcgc aagcgcggag ttgaagcaac acaccgataa 3360 ggattacgaa atgcagtctc tgaagtccgt aatattgcaa ggttggccag aaactataaa 3420 tcagttgatt cccgaacttc gtaaatactg gaattttcga gacgagatgg cagtttatga 3480 tggactaatt ttcaagtcaa accaagtact gattccgcaa ccgttgagga aggctatgtt 3540 ggtgaagata catagtggtc atgcggaaat tcaaagttgc attcgaaggg ccaagcaggt 3600 gttgttctgg ataggcatgg gtaacgatat tcaggaaatg gtggaaactt gctcaatttg 3660 tcagcgtcat caacggagta atacaaaaag cactatcatt ctgaaggaca ttcctagact 3720 tccatttgaa agggttgcgt ccgacctgtt ccattttaag gggaaagagt acctccttat 3780 cgtcgacagt tattctggct atttcgacat aaagcagcta tccgaaacaa cgtctaagcg 3840 aatcatcaaa catttgaaag aatggttctc gatccacggt attcctcaag tcttggagtc 3900 ggacaatggc cctcaatacg cctcggctga gtttcgtcaa ttcagtcgtc agtggggatt 3960 cgaacatcaa acgtcaagtc ctcattttcc cagggcgaac ggattagctg aacgatatgt 4020 aaaggttgct aaatcaatgc tgaagtagtg cagcgaggac caatcagata tccatttggc 4080 cttacttcac atgcggaata caccacgtag tagctgcata ccatcaccta acgaaagact 4140 catgggtcga ttggtacgtt ccaacatgcc aatgacgttg gaagcactca ggccaagagc 4200 agctattgga acgtgaacga ggaatccaca aggattatgc agatagagga agcaaagaac 4260 caccaaaatt cgttggaagt gagaaagttc tcattcagga tcaagcgtcg caaagatgga 4320 cacatggaac cgtattgaaa ccacttgaag aaaaccgttc gtatctagtg accgatggcg 4380 agagaacact gaggaggaat gctcatcatt tgcgcaagct acgaggagga atggaagaaa 4440 gtgtgcaaga aaccgaacaa gaagcggcaa acgactccaa tgatactcga cgtgaatcat 4500 cagctgggac cagcatgaac cgaagtcaag ttttgttaag cgaaacatca agtggaaccg 4560 caccttcggc aacatgtgta actcggtctg ggcgtacagt gaagccaaag aggttcgatg 4620 aatttcaata ttattaagaa gggaaga 4647 // ID Transib-18_HM repbase; DNA; INV; 2999 BP. XX AC . XX DT 02-JAN-2009 (Rel. 14.02, Created) DT 02-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE Transib DNA transposon -a consensus sequence. XX KW Transib; DNA transposon; Transposable Element; Transib-18_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2999 RA Bao W. and Jurka J.; RT "Transib transposons from the hydra genome."; RL Repbase Reports 9(2), 458-458 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(720..2174,2202..2768) FT /product="Transib-18_HM_1p" FT /translation="MKISDVHNRIRHDQLNPSDKSFSLSVEAILVSMNLGS FT TGCAAFLKVYQKKWKDANRGNIKFESKNKNWLTQDFKSTYSNNISKPDNSS FT QLHKIGGRPSCSFKDASDKTKRRRLNDIINNFSSDELLHSARLKLRNEYNY FT EAAKQVAEISEPSSQFKKYTPDEGLALIIDGGMSKSTYQLMRNGAEERDCH FT IYPSYNDVRLSKAKCYPENIFVDDYSAHVPLQELLKHTVKRLCEVQAPVIS FT TMLDGLTRLTLRCKAGFDGATGQSMYKQISSEEAGDRNLKKEESLFITCLA FT PLELSGFCNNKKVLLWRNEKPSSTAYCRPIRFSFQNESKAVVIEEAKYLES FT EIKKCGVFNFTFNGKELVVKSIIELTMIDGKIQTILSHATNSTQCCSVCGV FT SPKNMNNLEMVLKLDNSNNLELKYGLSSLHAWIRFFEMVLHIGYKLETQKW FT QSRAIEDKENVMQVKKRIQTEFMNQMGLVVDFPKSGGSGKIIRLTITYIQF FT VGTSNDGNTARRAFANYQSTAKILKVDETLLFYFYIILVTLSSGFEIDTVK FT FKEFCITALKLYVSKYSWYYMAQSVHKILVHGHAIIARQYLPIGLMSEEAQ FT EACNKDFKKFREDFSRKTSRIDTNRDLVNRLLVSSDPVITSLRKCHKKNGK FT SLKKYPSEVLALLRESAPPSSFSL*" XX SQ Sequence 2999 BP; 1050 A; 434 C; 518 G; 997 T; 0 other; cacagtgggc caggtggtag acaaaacctc aaaaaaaaaa tttcgaaata cttccaacta 60 cttagtacat tatatttata agagcataat gtatggttta aaacattatt tgtagtttga 120 ttgtacatca taaatgtaca attattattt acaattgaat ataatgatta attaattttt 180 tttttttttt aatataccat aacaacataa ctataacaac aaataacttt taaattattt 240 aacatatgcg aaattaattg acatattatt aaaatttaat ttgatagctt ttcaaaaaat 300 accataacaa attccaaggt cttgtattaa agtcacagtt ttattttgaa atataaaggt 360 aagattcttg tttatagtat aaaaatatta tgttaaaagt ggctcatata gtttagaacc 420 tgctggaaca gcatattaat gttagaataa gttaatacaa tatatgtaca gcctgttggt 480 gctgttttat tctgctcata gctgcccctt taggtttggt aaacatttaa aaaatatata 540 ttgttgtaaa atgctattcg cgacaaacct attttttggt ttgttgtcat tattaggtgt 600 tgtgcttatt gtactggcct aattaaatat atctgaaata aaaataatgg ccatattcat 660 gttatattag tttatagtgg ttgcactaat aattttaaat taattttttt gtagttgata 720 tgaagataag tgatgtccat aaccgtatac ggcatgacca actaaatccg agtgataaat 780 cattttcttt gtctgtagaa gctattttag tctctatgaa tttgggttct actgggtgtg 840 ctgcattttt aaaagtttat caaaagaaat ggaaggatgc aaaccgagga aacataaagt 900 ttgagagtaa aaataaaaat tggttgactc aggactttaa atctacatac agtaataata 960 tatctaagcc tgataattct agtcaattac ataagattgg tggaagacca agttgttcct 1020 ttaaagatgc ttctgacaaa accaaaagga gaagacttaa tgatattata aataattttt 1080 ctagtgatga gttgctccac tcagcaaggc ttaaattaag aaatgagtat aattatgagg 1140 ctgccaagca agttgctgag atatcagaac cttctagtca atttaaaaag tatacacctg 1200 atgaaggtct ggcattaatt atagatggtg gcatgtcaaa atcaacctac caacttatga 1260 gaaacggagc tgaagaacgt gactgccata tatatccttc atacaacgac gtaagattgt 1320 ccaaagctaa gtgctatcca gagaacatct ttgttgatga ttattcagca catgtaccac 1380 tacaagaatt attaaaacat acagtaaaaa gactttgtga ggtgcaagct cctgtgatca 1440 gcacaatgct agatggtttg acccggttga ctttgcgttg caaggctggg tttgatgggg 1500 ctactgggca aagcatgtac aagcaaatta gtagtgagga agctggagat cgtaacttaa 1560 aaaaagaaga atctctgttc attacctgct tagcgccact agaactcagt gggttctgta 1620 ataacaaaaa agtactatta tggcgaaatg aaaagccatc atctactgca tattgtcgtc 1680 caataagatt ttcttttcaa aatgaaagca aagcagttgt tatagaagaa gcaaaatatt 1740 tagaatctga aatcaaaaaa tgtggtgttt ttaattttac ttttaatgga aaggaactgg 1800 ttgtcaaaag tattattgaa ctgaccatga ttgatggcaa aattcaaact attctgtctc 1860 atgcgactaa ctcaactcaa tgttgctctg tgtgtggtgt gtctccaaaa aacatgaata 1920 acttagaaat ggtgctgaaa ttggacaatt ctaataattt agaattgaaa tatggtcttt 1980 ctagcctgca tgcctggatt aggttttttg agatggtatt gcatattgga tataaactag 2040 aaactcaaaa gtggcagtct agagctatag aagataagga aaatgttatg caagtgaaga 2100 aacgtataca gacagaattt atgaaccaaa tgggactagt tgtggatttc ccaaaaagtg 2160 gaggatctgg taagtgaaat tacataaata cttttatata aatcatcagg ttaactatca 2220 catatataca atttgtaggt accagtaatg atggtaacac tgcccgacga gcctttgcaa 2280 actatcaatc aactgccaaa attttgaaag tagatgaaac tcttttattt tatttttaca 2340 tcatactagt cactttatca tctggatttg aaattgacac tgtcaaattt aaagagttct 2400 gtattaccgc tcttaaactt tacgtatcta agtattcctg gtattacatg gcacaatcag 2460 tacacaaaat tttagttcat ggccatgcca tcattgctag acagtacttg ccaattggac 2520 tgatgtcaga ggaggcccaa gaggcctgta acaaggattt taaaaagttt agggaagatt 2580 tcagtagaaa gacttccaga atcgacacaa atcgtgacct agttaacagg ctacttgtat 2640 ctagtgatcc tgtcataaca tcattaagaa agtgtcataa aaaaaatggt aagagtttaa 2700 aaaaataccc cagtgaagtt ttggctttgc tgagggaatc agctccacct tcttcttttt 2760 ctttgtagct aaacacaatt tgttgtttga tttttaaaaa aggttgatat tgaaaaaaat 2820 atttttttac tacctaagcc tattaaatgt cttttttttt attaaaaatt aaaatattct 2880 tgttaaaaat aaatattatt tcatatataa tgattattta tatatctaaa tgggtttgtg 2940 gaccaattgg gtaataatta ggttttgtcc accacttccc atatacctgg cccactgtg 2999 // ID Sola2-N2_AAe repbase; DNA; INV; 2373 BP. XX AC . XX DT 11-OCT-2010 (Rel. 15.1, Created) DT 11-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE A non-autonomous Sola2 DNA transposon family from Aedes aegypti. XX KW Sola; DNA transposon; Transposable Element; Nonautonomous; KW otherMITEs_Ele3a; Sola2-N2_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-2373 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-2373 RA Kojima K.K. and Jurka J.; RT "Sola2-type DNA transposons from the yellow fever mosquito."; RL Direct Submission to Repbase Update (11-OCT-2010). XX DR [2] (Consensus) XX CC [1] Named as otherMITEs_Ele3a. CC [2] Consensus update and characterization as a non-autonomous CC Sola2. ~98% identical to consensus. This consensus is ~95% CC identical to the original sequence in [1]. 4-bp TSDs. TIRs are CC ~780 bp long. Both terminal 800 bp are ~86% identical to those of CC Sola2-4_AAe. XX SQ Sequence 2373 BP; 768 A; 416 C; 429 G; 760 T; 0 other; tgagcaattc tcgctgaaac caggccgcca tcggcaccca tcgttagaat tccaatttta 60 tgtcactgat cgctagtttt cgataaaact taagggtggt cctttctgtt ttctcaaatt 120 ggtggacccc tcgtacgcca gctagctgaa cagtttgcga aaaagcccat tttttgataa 180 atcttgggta tttcttcacg ggatatgtct catatttcgc ttggaacaca tagcaaactt 240 agtggcatcg tatagagaaa ggatatagct ttcatttaaa ctcgaaaaaa ttttggcggc 300 cattttgaat ttggccgcca tcttgaattt tgttagaaaa atcgattttt caccattagc 360 gcaccgctag ttttgaattc tgagatcacc atcagaaagc tgaggaaaaa ttgcgtaaga 420 taggctacag aaactaggtg agcaatggta tttaccctat caaatgaacg attttctaaa 480 tcatgttata cgattttgac gtatatggcg agtgcaatca atacaaacat cattttttgt 540 acaacaaaga aacaagtttt caacctttgg tgttttattc gagtcgagta aagaataaga 600 atattatttt tagtgaaaaa atctggcggc catcttggat tttgacgcca tcttgatttt 660 aggtagtaga attaattttt caccttgata gcactcagca tgtcgaattt agaggctacc 720 aataaaaaaa ggttacgtat actcttcttc ttattattct tcattttact gacgttacgt 780 ccctactgga accgagcctg atactcagct ttattagtta taaatccagg ttattaaaca 840 ggaattttct tttttaatgc gatttctgac aacagaataa ttcttcgaca tatctttgaa 900 ttcagtaata atgaataaat aaattcaata aatgaaaacc gtcaaaaaac attgattgta 960 ttaaacaaac atttttttac cctatgcatt gttgttcatt gtatgaagtt tatagattgc 1020 gtgtcgggac attagtaagc cttgtggtaa tatctgtagt tctaacgtaa aacattgaca 1080 aaagtgcatt aatttataat gaaagtctta aattttgaag caaaaaacat ccttgatttt 1140 ctaagtcata aagtatcgtt tttagtaccg cccaacatgc atgtgaaaaa tagcgaaatt 1200 gaatgatgag aagcaggttt tgttctaatg tggacgtaat gccgaaaggc aggtgtataa 1260 tttcgaggtt agaaaaactg gcggccatct tggctttcga cgccatcttg gtttttagca 1320 gttgcatgat tttttaccat ctcagtgctc atcatgttga attataaggc ctccgttaaa 1380 aaaaaaacaa tcagattcta tctttcttat cttatctcag ctgttcaact gattgttcat 1440 ttgtatcaat ggttttagac caaataaatg aaagaaataa tgctatttta caactcgaag 1500 atatgcagaa gaatcattca ggtagcaaat aagcgttaaa aaaatcctat ttgataattt 1560 gtttttaaaa ctagttaagc tgaagggctg gctctgttcc agtagggacg taacgtgagg 1620 aaaaagaaga acaagaagaa gaataagtgt aacggtagca ttgaaattca acatgttgag 1680 tgctaacaag gtgaaaactc attctactac ttaaaaccaa gatggcgtca aaatccaaga 1740 tggccgccag attttttcac tcaaattaat actcctattc ttcatctgtc tcaaataata 1800 caccaacgat tgaaaacttg tttctttgtt gtacaaaaaa ttatgtttgc attgattgca 1860 ctcgccatat acgtcaaaat agtagaacat gatttagaaa atcgttcata tcatagggta 1920 aatacctttg ctcacctagt ttttgtagcc tatcttacgc aatttttcct cagctttctg 1980 atggtgatct cagaattcaa aactagcggt gcgctaatgg tgaaaaatcg atttttctaa 2040 caaaattcaa gatggcggcc aaattcaaaa tggccgccaa aatttttttc aagtttaaat 2100 gaaagctata tcctttctct atacgatgcc actaagtttg ctatgtgttc caagcgaaat 2160 atgagacata tcccgtgaag aaatacccaa gatttatcaa aaaatgggct ttttcgcaaa 2220 ctgttcagct agctggcgta cgaggggtcc accaatttga gaaaacagaa aggaccaccc 2280 ttaagtttta tcgaaaacta gcgatcagtg acataaaatt ggaattctaa cgatgggtgc 2340 cgatggcggc ctggtttcag cgagaattgc tca 2373 // ID Copia-118_AA-LTR repbase; DNA; INV; 219 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-118_AA_; KW Copia-118_AA-I; Copia-118_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-219 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 219 BP; 58 A; 58 C; 35 G; 68 T; 0 other; tgaaaaacga agcaatcgtg aaccctatct caacaaccct gctacgtccc tgacaaccgt 60 atcctggagt aggttggacc tagcaactaa gggttgttac acttttttct tattcctgtt 120 tcaaccacca atcgcatcag acgtgtgttt caataaacgt tattgttcct gttacattcg 180 tatcttgttc ttcgcctcca aagttctgca acatcaaca 219 // ID BEL-73_AA-I repbase; DNA; INV; 5242 BP. XX AC supercont1.277; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-73_AA_; KW BEL-73_AA-LTR; BEL-73_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5242 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.277; Positions 1014332 1019573. XX CC Positions [4273-4860] - Integrase core CC 'CCAGG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 19..5214 FT /product="BEL-73_AA-I_1p" FT /translation="MSPSMTLEQLEIRRSAVDGKLKRTNVRVNKLETAKVS FT VDDLNAELENLHVLWQDFREVHQAILDLCPDVQAIEPHLDMEAALEKRYLT FT LKATITQFIRVTANRDNPPNAGGSSSGPLATPETALVPPELHLPKGILPTF FT SGDYGEWNSFHDLFISSVHNNPRLTNAQRLFYLKTYTTGRAAALLKYIKVE FT DNAYAGALDILKKRFDRKDQIVNHHIQRFCEIPNTSVPSVAGLRKLHETAD FT DVKRALQAIQREDRDCWLIYLLLAKVDSESRQQWSDKIATKQAPPTLEEFL FT EFLEQRTYSLETAHAPAPKSPAKYTGRPQPRSSSFLSTNEDLKTPSKCSVC FT NEAHHKLYYCKRFQEIPVAERIRVVNQLGLCKNCLCKSHGRNSCSAGDCRK FT CKQRHHTLLHEGPSIDSIPATGSKLSFGVINQLSYNDCFTSVFLATAVVTL FT TDSNGQQHAARALLDSASQANFITANLSKRLGLKPHRINLPLKGISGLATR FT ITEAVKIDMKSRVSDTEIRIDCAVLPKIADPIPHQLADISDWELHPSRPLA FT DERFNIPGNVDLLIGASVFYNLLRAKRISLGPTKPVLQETALGWVVAGFYE FT STNATKASPLCFVSQVEPEAEASLSQLVAKFWEIEDFKPTKYLTKEEKFCE FT DHYVRTTVRDPSGRYVVKLPFIRDPATIGDTGYSVLSQFTTMQRKLQRDPP FT KYVLYHQYMDAFIAANHISRIEPDPNHSKIIYLAHHGVVKSESTTTKLRVV FT FNAAQKSSNGLSLNNLLATGPVVQDTLYNILINFRLHAIVLTGDIEKMYLQ FT IRVTEEDSKLIRLLWQEPGQPLAEYCVNTVTFGMSCAPFLATRTLNQLAED FT EESKFPLAKEAIKDFYVDDALTGAATKEEAIEKRQQLTDMLKRGGFTIRKW FT ASNDPDVLAEVPPEDRAVNPVQAVDSQDTIKTLGIQWQYREDLFTFKATCD FT PLKEYFTKREVLSTIAKIYDPLGLVGPVVVIAKMMMQELWKMKIDWSEPLA FT NQFNHRWRTYLTHLKDIWLVKVPRRCISVQNPVQYHLHGFCDASLEVYGAA FT VYLRSVDESGTVSSHLLGSKSRVAPINRPPLQRLELCAAALLADLIKAIKS FT TIRIPIHQTLGYSDSTTALAWISGDPARWKVFVANRVALINSIIPASDWRH FT VPTKKNPADLLTHGAAPCVLAQSELWWSGPSWHPSVMAESHPRPTITEEEI FT IDSEVRRNPPILSYLLTTSDVIEDLLHKHSRYLKLVRVVAWIIRFGHNCEM FT ERSRRTVGPLTVEELIGARNMLIRYTQHQAYGAEITALESGLPLPDHSKLL FT PLSPFIDSCLVRVGGRLARSNLPYNAKHPLIIPANSRLATIVFEHEHRRNH FT HLGPTSLLSAVRATYWVPQGRQLARNTVWHCVPCHKHNPNRGMLHQIMGQL FT PAHRLNQAPPFYICGVDYAGPITLVERRTRGTPSTKGYIAIFVCFVTKAVH FT IEAVSKLSTRAFLAAMRRFVARRGHCGHMYSDNGTNFVGAVNEMRHWYTKI FT STSDHNHRVADFLAMGGTQWHFNPPGSPHMGGLWEAAVKSAKHHFNIVAHN FT ARLTFEEFTTLLVEIEGILNSRPISPASSDPNDLQPLTPGHFLIGRPLTSS FT NEQDYNPSPDEPYDVRFRYIQELRQHFWDRWSKEYVPELQTKGKWHRATNL FT LHENDLVLIKQAKLPVNQWLIGRIIKVHQAPDGNPRLASVKTSSGEIVQSV FT HNLSKLPIQSGN" XX SQ Sequence 5242 BP; 1426 A; 1468 C; 1274 G; 1074 T; 0 other; tttggcgctg tagacaggat gtctccttct atgactttgg aacagttgga aattcgacga 60 tccgcggtcg acggaaaact aaagcggacc aacgttcgag tgaacaaact cgaaactgcg 120 aaagtgtcgg ttgacgattt gaatgccgag ctagaaaacc tacacgtgtt gtggcaagat 180 tttcgtgaag tgcaccaagc cattctggac ctctgcccgg atgtccaagc tatcgagcct 240 catttggaca tggaagctgc cctggagaag cgttacctca cgctgaaggc cacaattacc 300 cagttcattc gagtgacagc caatcgggac aatccaccta atgccggtgg atcgagttcg 360 ggtccattgg caactccgga aactgcgttg gttccccccg agctccacct tccgaaggga 420 attctcccca ccttttcggg agactacggg gaatggaact cattccacga tttgttcatc 480 agctcagtgc ataacaaccc acggttgacg aatgcacaac gtctattcta tctgaaaacg 540 tatacgacgg gaagagccgc tgctctgctc aagtacatca aggtcgagga caatgcttac 600 gctggtgccc tagacatact gaagaagcgc ttcgatcgga aggaccaaat cgtcaatcac 660 catattcaac ggttctgtga gattccgaac acgagtgttc catcggtagc gggtctacgc 720 aagctccacg aaacggccga cgatgtcaag cgggcactac aggccattca acgcgaggat 780 cgagactgtt ggttaatcta cctcctgctg gccaaggtag attccgaatc cagacaacag 840 tggtccgaca agatcgctac aaagcaagct ccacctacgt tagaggaatt cttggagttc 900 ctcgaacaac ggacatattc tctggagacc gcccacgctc cagctccgaa atcccccgca 960 aagtataccg gtagaccgca accgcggagt tctagcttcc tgagcaccaa cgaagacctg 1020 aaaactccgt caaagtgttc cgtctgcaac gaagctcatc acaagctgta ctactgtaag 1080 cgattccaag aaattcccgt cgcggagcga atccgggtgg taaaccagct cggcctgtgc 1140 aagaattgcc tctgcaagtc tcacggaaga aactcctgtt ccgcgggcga ctgccgcaag 1200 tgcaagcaac gtcaccacac gctactccac gaaggtccgt caatcgattc tattccagct 1260 acgggcagta aactctcgtt cggagtcatc aaccaattgt cctacaatga ctgtttcaca 1320 tccgtattcc tcgccaccgc cgtcgttact ctcaccgatt ccaatgggca gcaacacgcc 1380 gcccgtgcat tattggattc cgcgtcgcag gcaaacttca tcacagccaa tctctcgaaa 1440 aggctgggac tgaaaccgca ccggatcaat ctgccgctca aaggaatctc aggactggcg 1500 acgaggatca ccgaagcggt caaaatcgac atgaagtcac gagtttctga taccgagatc 1560 cgaatcgact gtgcagtact accaaaaatc gccgatccca tcccgcacca gttagcagac 1620 atctccgact gggaacttca ccccagcagg ccgctggcgg acgagcggtt caatattcct 1680 ggaaacgtgg atctgctcat cggggccagc gtcttctaca acctgttgcg tgccaaacgc 1740 atctcgctgg gtccaaccaa gccagttctc caagaaaccg ctttaggctg ggtagtagcc 1800 ggattctacg agtcaaccaa cgccacgaag gcgtctccat tgtgctttgt ctctcaagtc 1860 gaaccagaag ctgaagcatc cctgagtcag ctcgtcgcca agttctggga gatcgaggat 1920 ttcaagccga ctaaatacct tacgaaggag gagaagtttt gcgaggatca ctatgttaga 1980 accacggtcc gcgatcccag cgggcgatac gtggtgaaac ttccgttcat tcgagaccca 2040 gcgacgatcg gcgacaccgg ctactctgta ttgtcccagt tcaccacgat gcagcgaaaa 2100 ttacaacgcg atccaccaaa gtacgtactt taccatcagt acatggacgc gttcatcgct 2160 gcaaaccaca tctcgagaat tgaaccggac ccgaatcatt ccaagatcat ctacctggca 2220 catcatggag tcgtcaagtc cgaaagcact acgacgaagc ttagagtagt cttcaacgca 2280 gcccagaagt catcaaatgg cttgtcgctc aacaatctgc tggccacggg gccagttgtc 2340 caggatacgc tctacaacat cctcatcaat ttccgtctcc acgctatcgt ccttactgga 2400 gacatcgaaa aaatgtacct gcaaattcga gtcacagaag aagacagcaa gctcatccga 2460 ttgctgtggc aagaacccgg gcaaccgctg gcggagtact gtgtcaatac ggtcacgttc 2520 ggaatgtcgt gtgcaccgtt tctcgccacg aggaccttga atcagctagc ggaagacgaa 2580 gaaagcaagt ttccactggc taaggaagcg atcaaggact tctacgtcga cgatgcacta 2640 acaggcgcgg ctacgaagga agaagcgatc gaaaagagac agcagctcac cgacatgctg 2700 aagcgcggag gtttcaccat ccgaaaatgg gcgtcgaatg atccagacgt actggccgaa 2760 gtccccccag aggatcgagc ggtgaacccc gtacaggctg tggacagtca agacaccatc 2820 aaaactctcg gcatccagtg gcagtatcga gaagatcttt tcacgttcaa ggctacatgc 2880 gaccccctaa aggagtactt cacgaagcga gaggtactat ccacgatcgc caaaatctac 2940 gacccgctag gactggttgg cccggtagtc gtcatagcca aaatgatgat gcaagagctg 3000 tggaagatga agatcgactg gtccgaacca ctggcgaacc aattcaacca tcgttggcga 3060 acctacctaa ctcatcttaa ggacatctgg ctggtcaagg tacctcgacg gtgtataagc 3120 gttcagaacc cagtgcaata ccacctacat ggattctgcg acgcatcgct tgaggtttat 3180 ggcgcggccg tctacctacg atcagttgac gagagcggca cggtcagttc ccatctcctc 3240 gggtccaagt cacgagtcgc acccatcaac cggccgccat tgcagcgttt ggagctctgt 3300 gcagcagcat tgctggccga cctcatcaag gcgattaaat ctaccatccg catccccatt 3360 catcaaacgc tgggctacag tgactcaaca acggccctgg cgtggatttc cggcgatcca 3420 gcgcgatgga aagtgttcgt cgccaaccgc gtagcactca tcaactccat cattccagcg 3480 tccgactgga gacacgtccc cacgaagaaa aatccagcag accttctgac gcacggagca 3540 gcaccctgcg tattagctca atcggagctg tggtggtctg gaccgagttg gcatcccagc 3600 gtgatggcag agtcacaccc aaggccaacg atcaccgagg aagaaatcat cgacagcgaa 3660 gtccggagaa atccaccaat cttatcatac ctcctaacaa cgagcgacgt catagaggat 3720 ctactacaca agcactcaag gtatttgaag ctcgtcagag tggtggcgtg gatcattcgt 3780 ttcggtcaca attgtgaaat ggaacgtagc cgccgaacgg taggaccact aaccgtcgag 3840 gagctgattg gggctagaaa catgctgatt cgctatacgc aacatcaagc ctacggagcc 3900 gaaattactg cactcgaaag cggtcttcca ttacccgatc acagcaagct tctaccgctg 3960 agtcccttca tcgacagctg tctcgtcagg gtgggcggcc ggcttgctcg atcaaatttg 4020 ccgtacaatg cgaagcatcc gctgatcatc ccagcgaaca gcagactggc aacaatcgtg 4080 ttcgagcacg agcataggcg gaaccatcat ctgggaccga cgtcactact ttccgcagtc 4140 agagcgactt actgggtacc gcaaggaaga cagctagcga ggaacaccgt ctggcattgc 4200 gttccatgcc ataagcacaa cccaaatcgt ggaatgctgc accaaataat gggacaactt 4260 ccggcgcatc ggctgaacca ggccccaccg ttctacatct gcggagtgga ttatgcggga 4320 ccaataacgt tggtggagag acgcacccgc ggaacaccat cgaccaaagg ctatatcgca 4380 atcttcgtgt gcttcgtgac aaaagcggtg cacatcgagg cggtgtcaaa gctttccaca 4440 cgagcattct tggcggccat gcgaagattt gtggccagaa gaggacattg tggtcatatg 4500 tatagtgaca atggcaccaa cttcgtaggt gccgtcaacg aaatgcgaca ttggtacacc 4560 aagataagca cctccgatca caatcaccga gtggcagatt ttttggcaat gggcggaaca 4620 cagtggcatt tcaatccacc tggatccccc catatgggag gactatggga agcggccgtg 4680 aaatccgcca aacaccattt caacatcgta gcgcataacg ctcgtctcac gttcgaggaa 4740 ttcactactc tgctcgttga aatagaagga attcttaatt cacgacccat ctcaccagct 4800 tcgagtgacc caaacgatct gcagccctta acaccgggac actttcttat tggacgaccg 4860 ttaacatcca gcaacgagca agattataac ccaagcccag acgaaccata tgacgttcga 4920 ttccgttaca tccaagaact gcgacagcac ttttgggacc gctggagtaa ggagtacgtc 4980 cccgaactgc aaaccaaggg taagtggcat cgggcgacca acctgttaca tgaaaacgat 5040 ttagtactga taaagcaagc aaaactacca gtaaaccagt ggctaatcgg aagaattatt 5100 aaggttcacc aagcaccaga tggaaaccca cgattagcat cagtgaaaac atcgagcgga 5160 gaaattgtac agtcggtaca caacttaagt aaattgccaa tccaatctgg aaattaaaaa 5220 cattcgtttt tggcgggagg ca 5242 // ID CR1-14B_CQ repbase; DNA; INV; 4709 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-14_CQ; KW CR1-14B_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4709 RA Kojima K.K. and Jurka J.; RT "CR1 non-LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 16-16 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 9 sequences with >97% CC identity. CC The consensus is ~77% identical to that of CR1-14_CQ. XX FH Key Location/Qualifiers FT CDS 25..1515 FT /product="CR1-14B_CQ_1p" FT /translation="MRVDVPVDQTLDSNNSVPTVSDAEAALGAAALAVERA FT AAARTTNSSIVQAAQLDGTTPPIAPRNAVATSAAAAQSATAHPDAVHPAAS FT HAAPAAHSTTALLPPTDPGVQQAVLPHSQVSQQLLITPQAPPQPQTMPSSQ FT PLVVPPQSMLPSQPLVSAQPPPQTMPLLQPLVLPSPQRQAPPPPLITQPLQ FT TAPQALTHAASTSSSSSTATNRIVPDARFSVESYQQSKAPEVNFIPFKTSF FT IVSEGNRINVHGPSLTQCQQLLDSNISEPNCQIHEQNCTPHTTQATNYCET FT TNTISNLQHPSLLVAPPEQSTSDDSAELKWFYVSRFLPSETCDNLIRYIQN FT KTXCDSARIICQKLVRKNRNSARPLTFLSFKLSVPESIESLIVATDFWPEG FT VTIKPFLVKRQAPELIGPSSHNPQPLNNLVSPRSSRTTTPRRRPAPKMIPN FT QPQHTARLYPYNLPLPFYHPSQLAVPHMNQYLPLYGQMPEAVHHRQQWSTM FT V" FT CDS 1500..4478 FT /product="CR1-14B_CQ_2p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="MVNNGLRSAISPAPPPTDPRTTNQAALHREEATNLLL FT YYQNVGGMNTTIANYYLAISSASYDFYAFTETWLSTTTLSSQIFGTEYEVF FT RCDRSAANSCKDSGGGVLLAVRSSLKPRQLIPPNCTCPEQVWVSVPLASST FT LFVCVVYIPPKFDNDATLFEQHRRSLTWILSKTKVNDSIMIVGDFNFPGIR FT WTRTPTNKLLPNLALTPSNKLKHDLLDEYSTANLSQLNDLCNSSNNVLDLC FT FASSXVPVNCALLPAPSPLVKDVRHHLPFLVSISCTVFTFSDTNGSTFIDY FT RNGNYEGMNEFLATVNWTRLMANLDANEAAVTWTEILTQAINVFIPKKQRQ FT PPQHPPWSTHRLKKLKTVKRAALKKYAKHPTDRWKNHYRAKNRKYSLLNNQ FT LFRRHLNRIQSRLKREPKKFWNHVNEQRKETGLPTSMVLDGDEATTTESIC FT DLFRRQFCSVFNNETVADSQVARAASNVPLRPPIGPHPVISSESVRRACAR FT LKSSNSCGPDGIPAVVLKKCCDALAEPLAQLFNTSLSTGVFPCFWKKSFVF FT PVHKKGPKRDVRNYRGIAALCAVSKLFEVIVLDFIKFNCCDYIAQEQHGFM FT AKRSTSSNLVTYSSFILRTMQKRKQIDAIYTDLSAAFDKLNHRIAVAKLER FT LGFSGSLLEWLRSYLTGREMSVKVGDVISAIFAVFSGVPQGSHLGPLIFLL FT YMNDVHPLLKCHKLSYADDIKLFTVIEETGDCQFLQEQLNRFANWCSDNRM FT VLNATKCSAITFTRKRNKISFDYTLSNTTIPRTSCVKDLGVMLDSKMTFTD FT HITYMVSKASKTLGFVFRIAKHFRDLGCLKALYCSLVRSTLEYCSTVWAPF FT YQNAIQRVESVQRKFVKYAQRHITWPDPLNPPSYAERCKMLNLDPLSVRRD FT VAKAVFVADLLQSSIDCPAVLQLININTRRRVLRNHSFLTVRRALTNYGHN FT EPVSSMCRIFNLCSDHFDFDLSRDKIKIRFLNFLKSLP" XX SQ Sequence 4709 BP; 1170 A; 1381 C; 965 G; 1189 T; 4 other; tctttacctc agccagacaa aataatgaga gttgacgttc cggttgacca aacgctcgat 60 tcaaataatt cagtccctac cgtttcagac gctgaggctg ctctgggagc ggctgccttg 120 gctgttgaac gcgcagctgc cgctcgtacc accaacagct ccatcgttca agccgcccag 180 cttgacggga ccactccacc gattgcccct cggaatgcag tcgccacctc cgccgccgcc 240 gctcaatcag cgaccgctca cccagacgcc gttcatccag ctgcgtctca cgcggcccct 300 gctgctcact cgaccaccgc attactgcct ccaactgatc ctggggtaca acaggcggtt 360 ttgccgcatt cgcaggtctc gcagcagctg ctaatcacgc ctcaggcccc gcctcaaccg 420 caaaccatgc cctcgtccca gcctctggtt gtgccaccgc agtccatgct tccctcgcag 480 cctctggtct cagcgcagcc gccaccgcaa accatgccct tgctccagcc tctggttttg 540 ccgtcgccgc aaagacaggc cccgcctccg cctctgatta cgcagccgct gcagactgcg 600 cctcaggctt taacccacgc cgcatcaaca tcttcctcgt catctaccgc taccaaccgc 660 attgttcccg acgcacgatt ttctgtggaa agctaccagc aatcaaaagc tcctgaagtt 720 aacttcattc cattcaaaac tagttttatt gttagcgaag ggaataggat taatgttcat 780 ggacccagtc ttacgcaatg tcaacagctg ttagactcaa acatctcgga acccaattgc 840 cagattcacg aacaaaactg tacacctcac actacacaag ctacaaatta ttgtgaaacc 900 accaacacca tcagcaacct acaacatcct tcacttttgg tagcccctcc agaacagtcc 960 acctctgacg attcggctga gctaaagtgg ttttacgtct cccggttcct tcctagcgag 1020 acttgtgata atctgatacg ctacatacaa aacaaaacga amtgcgattc cgcccgcatc 1080 atttgtcaaa agttagttcg caaaaatcgt aactcagcca ggcccctcac gtttctctcg 1140 ttcaaactaa gcgttccgga atcaattgag agtttgattg tcgctacaga cttttggcca 1200 gaaggtgtta caatcaagcc ttttttagtg aagcgacaag ctcccgagct cattggtcct 1260 tcctcacaca acccgcaacc actgaacaac ctcgtttcgc cacgcagttc acgaacaacg 1320 actcctcgtc gccggcccgc tccgaaaatg attcccaatc agcctcagca cacagcacgt 1380 ctgtacccat acaatctacc gctaccgttt tatcatccca gtcagttggc ggtgccccat 1440 atgaaccagt atcttccgct gtacggtcag atgccagaag ctgtccacca caggcagcaa 1500 tggtcaacaa tggtttaagg tctgccattt cacccgctcc accaccaacc gaccctcgta 1560 cgactaacca agctgcgctg catcgtgaag aagcgacaaa tcttctgctc tactatcaaa 1620 atgtaggggg tatgaacact accatcgcaa actactatct cgctatctct tccgcttctt 1680 acgactttta tgcatttacc gaaacctggc tgtcaacaac taccctctcg agtcaaatct 1740 tcggtaccga atacgaggtt ttccgctgcg atcgttctgc cgcgaacagc tgcaaagatt 1800 caggcggagg tgtccttctt gccgtccgct ccagtctcaa gccacgccaa ctgatcccac 1860 cgaattgcac atgtccagaa caggtgtggg tttcagtccc tctcgcctca tccacattgt 1920 ttgtgtgtgt tgtttatatt ccaccaaaat ttgacaacga tgcgactctc tttgagcagc 1980 acagacgatc tttgacgtgg attttatcca agacgaaagt caatgacagc ataatgatcg 2040 tcggcgattt caactttccg ggtatccgtt ggacacgcac cccgacgaac aaattgctac 2100 caaatcttgc tctcaccccc tcgaacaagc ttaaacacga tctcctggac gaatattcga 2160 ctgcgaacct gagccagtta aacgacctgt gcaacagctc caacaacgtg ctcgaccttt 2220 gctttgccag ctcgaawgta cctgtcaatt gtgccctcct cccagcacct tcacctctcg 2280 tgaaagacgt tcgtcaccac ttgcccttcc tggtttctat ctcgtgtaca gtgttcacct 2340 ttagtgacac gaacggtagt accttcatag actaccgcaa tggtaactac gaaggtatga 2400 acgaattctt ggcaaccgtt aactggactc gactgatggc caatctcgac gccaacgaag 2460 ctgctgttac ttggacggaa attttgacgc aagctatcaa cgtcttcatc ccgaagaaac 2520 agcggcaacc tcctcaacat ccaccatggt ctactcaccg cctgaagaag ctkaagaccg 2580 tgaaacgtgc cgcactcaag aaatatgcca agcatccgac agatcgttgg aaaaaccact 2640 acagagctaa gaaccggaag tacagtttgt taaacaacca actttttcgc cgccacctga 2700 accgcatcca gagccgcttg aaacgtgaac ccaaaaagtt ttggaaccac gtcaacgagc 2760 agcggaaaga aactggtctc ccaacctcga tggtactcga cggtgatgag gctacaacca 2820 ccgagagtat ttgtgacctt ttccggcgcc agttctgcag tgtcttcaat aacgaaactg 2880 tagcagactc gcaagttgct agggctgcta gtaacgttcc actgcgacct cccatcggac 2940 ctcatccggt gataagctcc gagtccgttc gccgtgcctg cgcccgcctg aagagttcca 3000 acagctgtgg accggatggc atccccgcgg ttgtgctcaa aaagtgttgc gatgcactcg 3060 cggaaccgct ggctcaactc ttcaacacct cgctttccac cggagttttc ccatgtttct 3120 ggaagaagtc gtttgtgttc cctgttcaca aaaaggggcc taaacgtgac gtccggaact 3180 accgcggaat cgctgccctc tgcgctgtaa gcaagctctt cgaagtcatt gtgctagact 3240 tcatcaagtt caactgttgt gattatatcg cccaggaaca gcatggcttt atggcgaaac 3300 gttctaccag ttccaatctg gtcacctatt cgtccttcat cctacgaacc atgcagaaac 3360 ggaaacaaat tgacgccatc tacacggatc tatcggcagc attcgacaag ctaaaccacc 3420 gtattgctgt tgctaaactc gagcgactgg gcttcagcgg ttctctgctc gaatggcttc 3480 gctcctacct caccggacga gaaatgagcg tgaaagtggg cgacgtaatt tctgccatat 3540 ttgctgtttt ttcaggcgtc ccacaaggca gccacctggg tcccctgatt tttctcctct 3600 atatgaacga cgtgcaccct ctgcttaaat gtcacaagct gtcttacgcg gacgacataa 3660 agcttttcac tgttatcgag gagaccggag actgccagtt tcttcaagag cagctcaacc 3720 ggtttgccaa ctggtgctcc gataacagaa tggttctwaa tgcaactaag tgttcagcaa 3780 tcactttcac ccgcaaacgc aacaaaattt cctttgacta tactctttca aacaccacca 3840 tacctcggac gtcctgtgtg aaagatttag gcgtgatgct ggatagcaaa atgacgttta 3900 ctgaccatat tacgtatatg gtttccaaag cttccaaaac attaggattc gtctttagga 3960 tagctaagca ttttcgagat ttaggctgtc tcaaagctct ttattgctcg ctggttcgtt 4020 ctacgctcga gtactgttcc acggtttggg ctcccttcta ccagaacgcc attcaacgcg 4080 tggagtcggt gcagcgaaag ttcgtcaaat atgcccaacg tcacatcacc tggcctgatc 4140 ctttaaatcc tccgagctat gctgagcgtt gcaaaatgct aaaccttgat cctctctcgg 4200 taagacgtga tgttgcaaag gcagtgtttg tagctgacct cctgcagtcg tccattgatt 4260 gtcccgctgt tttgcaactg atcaatatta acacccgccg gcgtgtactt cgcaaccact 4320 cttttttgac ggttcgccga gctctgacta attacggaca taacgaacca gtttctagta 4380 tgtgtcgtat ttttaacttg tgctctgacc atttcgactt tgacctgtcc cgcgacaaaa 4440 tcaaaatccg tttccttaat ttcctcaagt cccttccctg acaatacaca cgtagatatt 4500 agaacactgt gatatttttg ttaatttatt gagttagttt taaagtgaac ccgtcttgta 4560 tcatttgagt tttgtgtact tgttgatgcg aaaagacgag gtggttttgt gcctctttga 4620 gagagtgtct tggaatatgc tagacacatc tcaagggggc ttttgtccac ctcctaataa 4680 agaaaatgaa aatgaaaaaa tgaaaatga 4709 // ID BEL-31_CQ-I repbase; DNA; INV; 5772 BP. XX AC AAWU01011368; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-31_CQ_; KW BEL-31_CQ-LTR; BEL-31_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-5772 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 215-215 (2011). XX DR GenBank; AAWU01011368; Positions 2310 8081. XX CC Positions [4829-5377] - Integrase core CC 'GGCTC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 41..5772 FT /product="BEL-31_CQ-I_1p" FT /translation="MADENPDFTETPCSTCKEVSTEDELMVGCDSCKLWFH FT ARCVNLNPADKKSGKKWYCPDKDCQKKKKTKKSVEVPLPLPPELQEKLKAL FT EETRKRMEAEQEMERILKEKELEMAVYLQERQMKIDEELREREFKQREANL FT EAEMQKKEEHLKRIQKLETTFRDKSASIDKLLQNSSTPLRNPEVKPLTTEN FT LGKHDNGENSGGTPSKPASKKTSSRSSSGSSTTSSGSGKGEADTKGDGNDE FT KSVKPGKAEDSGEKPDGLGQHGSGPTKAQRAARQGLTRKLPDFHGKPEEWP FT LFFAAYQASNEACGYTDVENLVRLQDCLKGRALEEVRGQLILPKSVPRVIA FT KLRQIYGRPEVLLQSHLQRVRKLEPPRADKLGSFIPFGNAVEQLCEHIEAA FT ELTAHLVNPILIQELVDKLPDGEKRSWVHYKRKKREVNLRTLTNFLSKIVE FT DACEATVNLDYKPDVRAASGPSGRSRNKEKAALFNHSVAEECTERREQKPQ FT KLCRVCQRMDHRLWTCEDFKKLPYEDRAKMATKWKLCQRCLNEHGGQCKLK FT LRCNVGECREPHNPLLHPETRAVGMNAHITSTSPVLFRMLPVKVYCGERSM FT TVLAFLDEGSSLTLIESGLADRLGLLGTKEKLTIKWTANITRVEQHSRRTN FT IWASAINGTNGEKLLLQSVRTVDKLMLPRQTLNAAEVSSHYDHLRGLPMDS FT YDGRPGMLIGLNNIHSFAPLETKVGTIVDPVAVRCKLGWTVYGPRDAESAG FT ASCFLGCHQEVTNEELHDLLKSHYSLEEAVVTVQRESVEDQRARKIMEETT FT RRVGDRFETGLLWKTDDVKFPNSYPMAVKRMKQLEKKLERSPELYDNVKKQ FT IEDYQAKGYAHEATAEELNGTNNNKAFYLPLNVVVNPKKPGKVRLVWDAAA FT TVDGVSLNSMLMKGPDLLVPLVSVICGFRERRVAFGGDIREMYHQLKIISG FT DKQAQRFLFRDNIKEKPKVYVMDVATFGSKSSPASAQYVKNRNAEEHAGQY FT PDAVAAIINRHYVDDYFDSVDTVEEAVKRAKQVSHVHKKGGFEIRNWVSNS FT PEVLTALGEEKPMSPVLLNQDKQTPSERVLGVRWDPELDEFAFAVMHREEL FT MKYLKEGKRPTKRIVLSCVMGFFDPLGLLSPFTIHGKIIIQHLWRTGCRWD FT QDIDDEAWILWKRWTALLPEVEALRFPRSYFGDAMSTSVESLELHIFSDAS FT ELAYGCAAYLRAVIDGQVHCSLVMARSKVAPLKRQSIPRLELMAACMGARM FT SQQILGTHTLQIDRTVFWTDSRTVLAWLHADPYKYKQFVAFRVGEILELTR FT LDDWRWVPSKKNIADVLTKWGPGPPLQNDSEWRNGPAILFEQDDDSPPEQI FT EETNEDARGLVLFHGTVNVEAVSTWTKLVRVTATAVRFIENCRRRKDGRPV FT LTAKATSRLAKLVKAQHETEVQPLQQDELQKAERILWKQAQFESFPDEMSV FT LTKNLQLKQGELPEKIERSSPLYKLLPVLDEDGVLRVRGRLEKNESIPFDK FT RFPIILGRKHAVTKKIIQHYHEKYGHANRETVFNELRQKFWIPNARAAILE FT VVKECVWCKVNRCVPFVPTMAPLPVERTAATMRPFSAVGVDYLGPVEVTVG FT RRREKRWIAVFTCLAVRAVHLEVVHSLTTQSCLMALRRFACKRGVSEKVFS FT DNATCFRGADAAITKEINKECAEQMTSPTTSWSFIPPGTPHMGGAWERMVR FT SVKEALRALDDGRKLTDEILSTSLAEAEDMINTRPLTYVPQDSAEEEAITP FT NHFLRGTVTSADLKVDETVDTAAALRDVYKRSQQLAGKMWERWSKEYLPSL FT NRRPKWFEDRKPLEAGDLVFVVDGKNRKSWVRGRVEEVFKGSDGRVRQADV FT RSADGKVNRRAVANLAVLEIMDCKSDVLECPTDVTGWG" XX SQ Sequence 5772 BP; 1449 A; 1488 C; 1837 G; 998 T; 0 other; atcttcaaaa ttaattgtgg gttcggtaat cctctccacg atggccgacg aaaatccgga 60 cttcaccgag actccctgct caacctgtaa agaggtgtcg accgaggatg agctgatggt 120 tggctgcgat agctgcaaac tgtggttcca cgctcggtgc gtcaacctca atccggctga 180 caagaagtcg gggaagaagt ggtactgtcc ggacaaggac tgccaaaaga agaagaagac 240 taagaagtct gtggaagtac cgctcccatt gccgcccgag ttgcaagaaa agttgaaagc 300 tttggaagaa actcggaagc gcatggaagc ggaacaagag atggagcgaa tcctgaagga 360 gaaggagctc gaaatggctg tctaccttca ggagcgccag atgaagatcg acgaggaact 420 ccgagagagg gagttcaagc agcgagaggc gaacctagag gcggagatgc agaagaagga 480 ggaacacctg aagcggatcc agaagctgga gactacattt cgggacaaaa gcgcctccat 540 cgacaagctg ttgcaaaaca gcagcactcc gctgcgaaac ccagaagtca aaccattgac 600 aacggaaaac ctgggaaaac acgacaatgg cgaaaactct gggggcacac cgagcaaacc 660 ggcttccaag aagacgtcca gccgctcttc gtcggggtcc agcactacca gttctggcag 720 tggcaaaggc gaggccgata cgaaaggtga tggaaacgac gagaagtcgg tcaagccggg 780 aaaagctgag gattctggag agaagccaga cgggctgggg cagcacggtt cgggtccaac 840 gaaggcccag cgagctgcaa gacaagggct gacaaggaag ctgccagact tccacggcaa 900 gccagaagag tggccattgt tctttgcggc ctaccaagcg tcgaacgaag cctgcgggta 960 cacggacgtg gagaatctcg tgcgcctaca agactgcctg aaaggcagag cgctcgagga 1020 ggtgcgtggc cagctcattc tgcccaagtc ggttccgcgt gtgatcgcca agcttcgcca 1080 gatctacggt cgcccagaag ttctgctgca aagccacctg caacgagtgc gcaagctgga 1140 gccgccgaga gcggacaagt tgggatcgtt catcccgttt ggcaatgcag tggaacaatt 1200 gtgcgagcac atcgaggccg cagaactgac ggcgcacctg gtcaatccga tcttgatcca 1260 ggaactcgtc gacaagcttc cggacgggga gaagaggagt tgggtgcact acaagcgcaa 1320 gaagcgcgaa gtaaatctgc gcacgctcac caactttctc tcaaagatcg tcgaggatgc 1380 gtgcgaggcg acggtgaacc tggactacaa gccggacgtc agagcggcgt cgggaccgag 1440 cggccgaagc cgaaacaagg aaaaagccgc actgttcaac cacagcgtag ccgaggaatg 1500 tacagaacgg cgagagcaga agccacaaaa gctgtgcagg gtgtgccagc gcatggatca 1560 ccgactgtgg acatgcgagg acttcaagaa gctgccatat gaggatcgag cgaagatggc 1620 gacgaagtgg aaattatgcc aacgctgcct gaatgaacac ggcggtcaat gcaagctcaa 1680 acttcgctgc aacgtcggag agtgccgcga gccacacaat cctctgcttc acccggagac 1740 gagggcagtt ggcatgaacg cacacattac gtccacgagt cccgtgctgt tccggatgct 1800 cccggtaaaa gtctactgcg gtgagagatc aatgaccgtg ctcgctttcc ttgacgaggg 1860 ttcgtccctc accctgatcg aaagtggtct tgccgaccgt ttggggctat tggggacgaa 1920 ggagaagctc accatcaagt ggacggccaa catcacgcga gtggaacaac attcgaggcg 1980 gacaaatatt tgggcgtcgg cgatcaacgg gacgaatggc gagaagctgc tgctgcaatc 2040 tgtccgcacg gtggacaagc tgatgttgcc acgccaaaca cttaatgctg ctgaagtttc 2100 gtcacactac gatcacttgc gcgggcttcc gatggattcg tacgacggtc ggccaggaat 2160 gctgatcggg ttgaataaca ttcattcgtt cgcaccgctg gaaacgaagg tcggcacgat 2220 tgtcgatccg gtggcagtcc ggtgcaagct cggctggacg gtctacggac cgcgggatgc 2280 agagtcggcg ggcgcgagct gtttcctcgg atgccaccaa gaagtgacga acgaggagct 2340 gcacgatctt ctcaaaagcc actattcgct ggaagaagcg gtggtgactg tgcagcggga 2400 gtcggtcgag gatcagcggg cgaggaagat catggaggaa actacacgtc gcgtgggtga 2460 ccgcttcgag accggactgc tgtggaagac cgacgacgtc aagttcccga acagctatcc 2520 gatggctgta aagcggatga agcagctgga gaagaaactt gagcgttcgc cagagctgta 2580 cgacaacgtc aagaagcaga ttgaagacta ccaggccaag ggatacgcgc acgaagccac 2640 cgcggaggaa ctgaacggaa caaacaacaa caaagcgttc tacctgccgc tcaacgtggt 2700 ggtgaacccg aagaagccgg ggaaagttcg gctggtctgg gatgcggccg cgacggtaga 2760 tggagtctct ctcaattcga tgctgatgaa gggaccagat ctgctcgtgc cactggtgtc 2820 ggtgatctgt gggttccgcg aacgacgggt tgctttcggc ggggacatcc gtgagatgta 2880 ccaccagctg aagattatct ccggggacaa gcaagctcaa cgcttcctct tccgggacaa 2940 catcaaagaa aagccgaaag tgtacgtgat ggacgtcgct acattcgggt cgaagagttc 3000 accggcgtcg gcgcagtacg tgaagaatcg caatgcggaa gagcatgctg gtcagtatcc 3060 ggacgctgtg gcagcgatca tcaatcggca ctacgtcgat gactactttg acagcgtcga 3120 caccgttgag gaagctgtca agcgagcgaa gcaggtgagc cacgtccaca aaaaaggtgg 3180 cttcgagatc cgcaactggg tgtccaactc gccggaagtg ctgactgctc tgggcgaaga 3240 gaagccgatg agtcctgtgc tgctgaacca ggacaagcaa acacccagcg agcgcgtact 3300 tggcgtgcgg tgggacccgg agctggacga attcgccttc gccgtgatgc accgcgaaga 3360 actgatgaag tacctgaaag aggggaaaag accaactaaa cggatcgtac tgagctgcgt 3420 gatgggattt ttcgatccgc tcggcctgct gtcaccattc acaatccacg ggaagatcat 3480 catccaacac ttgtggcgga ccggctgcag atgggaccaa gacatcgacg acgaagcctg 3540 gattctgtgg aagcggtgga cggcactgct gccggaagtc gaagcacttc ggtttccccg 3600 gagctacttc ggcgatgcga tgtccacgtc ggtcgagagt ctcgagctgc acatcttcag 3660 tgacgcgagc gagctggcgt acggttgtgc ggcgtatcta cgcgcggtca tcgacggtca 3720 agttcactgc agtctcgtca tggcgcgatc gaaggtggca ccgctgaaac gacaatcgat 3780 tccacgactg gagttgatgg cggcgtgcat gggagcgcgc atgagccagc agatcctcgg 3840 aactcacacg ctgcagatcg accgaaccgt tttctggacc gactcgagga ccgtccttgc 3900 gtggctgcac gcagatccct acaagtacaa gcagttcgtc gcctttcggg tcggggagat 3960 tctggagctg acgagactgg acgattggag atgggtgcct tccaagaaga acatcgcgga 4020 cgttctcaca aagtggggac ctggaccgcc actgcagaac gattcggaat ggcggaacgg 4080 tccggcgata ctgttcgagc aggatgacga ctcaccacct gagcagatcg aggagacgaa 4140 cgaagacgca cgaggcttgg tactgttcca cgggacagtc aacgtagaag cggtctcgac 4200 gtggacaaag ctggtgaggg tgactgcaac ggcggtacgc ttcatcgaaa attgtcgtcg 4260 cagaaaagac ggccgtccgg tactgactgc aaaggccacg agtcgcctag ccaagctggt 4320 taaagcgcag cacgagacgg aggtccaacc gctgcaacaa gacgagctgc agaaggccga 4380 gaggattctg tggaagcaag cgcagttcga gagcttcccc gacgagatga gcgtgctgac 4440 gaagaaccta caactgaagc agggtgagtt gccagagaag atcgagcggt caagtccact 4500 ctacaagctg ctgccggtgc tcgacgagga tggggttctg cgcgttcgtg ggaggctgga 4560 gaagaacgaa tcgatcccgt tcgacaagcg attcccgatc atcctgggtc gaaaacacgc 4620 cgtcaccaag aagatcatcc agcactacca cgagaagtac ggtcacgcga atcgggagac 4680 cgtgttcaac gagctgcgcc agaagttctg gattccaaac gcacgggctg cgatcctgga 4740 ggtcgtaaag gagtgtgtgt ggtgcaaggt gaaccgttgc gtgccgttcg tcccaacaat 4800 ggcgccgctt ccggtcgaac gcactgcagc aaccatgcga ccgttcagcg cagtaggcgt 4860 ggattacttg ggaccggtcg aagtcacggt ggggcgccgc agagaaaaac gctggatcgc 4920 agtcttcacg tgcttagcgg tcagggcggt gcatctcgaa gtggtccata gcctcaccac 4980 gcaatcgtgc ctgatggccc tccggaggtt cgcgtgcaaa cgcggcgttt cagagaaagt 5040 gttctccgac aacgctacgt gctttcgggg agcagacgcg gcgataacga aggagatcaa 5100 caaggagtgc gcggagcaga tgacgtcacc gaccacttca tggtccttta tcccacctgg 5160 gacgccccac atgggcggcg cctgggaacg gatggtgcga tcggtgaagg aggcgttgcg 5220 cgcacttgac gacggacgaa agctgaccga cgaaattctc tcgacatctt tggccgaggc 5280 cgaagacatg atcaacactc ggccgctgac gtacgtccct caagattccg ctgaagaaga 5340 agccatcacg ccaaaccact tcctgcgcgg gacggtaacg agtgcggacc tcaaagtgga 5400 cgagacggtg gacaccgctg ctgcactgcg agatgtctac aagcggagtc agcagctggc 5460 gggcaagatg tgggagcgat ggtcgaagga gtacctgccc tcgctgaacc ggcgtccaaa 5520 gtggttcgag gatcggaagc ccctcgaggc gggtgacttg gtgttcgtgg tcgatggcaa 5580 gaaccggaag agctgggtac gaggtcgagt ggaggaggtc ttcaaggggt cggacgggcg 5640 agttcggcaa gcggacgtga ggagtgcgga cggcaaggtc aacaggagag ccgtggcgaa 5700 cttagcggtg ttggagataa tggactgtaa atccgacgtg ctggaatgcc cgacggatgt 5760 tacgggctgg gg 5772 // ID TcVIPER repbase; DNA; INV; 4390 BP. XX AC . XX DT 17-JUN-2010 (Rel. 15.06, Created) DT 17-JUN-2010 (Rel. 15.06, Last updated, Version 2) XX DE TcVIPER - a family of tyrosine recombinase-encoding DE retrotransposons. XX KW DIRS; LTR Retrotransposon; Transposable Element; TcVIPER. XX OS Trypanosoma cruzi OC Eukaryota; Euglenozoa; Kinetoplastida; Trypanosomatidae; OC Trypanosoma; Schizotrypanum. XX RN [1] RP 1-4390 RA Lorenzi H.A., Robledo G. and Levin M.; RT "The VIPER elements of trypanosomes constitute a novel group of RT tyrosine recombinase-encoding retrotransposons."; RL Molecular and Biochemical Parasitology 145(2), 184-194 (2006). XX RN [2] RP 1-4390 RA Kojima K.K. and Jurka J.; RT "TcVIPER consensus sequence."; RL Direct Submission to Repbase Update (07-JUN-2010). XX DR [2] (Consensus) XX FH Key Location/Qualifiers FT CDS 169..1506 FT /product="TcVIPER_1p" FT /translation="MALVPYTPSEWCDIGARLWDMVCQRGETQVQMTFIDG FT NQMSDETLTVAASSTGLCLFHGPISKAWQDSTLIGHIANLRAPQIPNFETQ FT ALLPAMGAARELGHQQGLITAERERLNIDYANAVADVQNRNAQLNSQVEEA FT HINLSVRESNIAGAGNQVLELRQQLEADRAQLREREFNLGXLTGQLQEREA FT RLNALQTARDAETAGRDESPATRHVANDGVTTLPHQTDILMAGGVPSSAAR FT RSSAAYRPPHGNAGRPHFHPSTPLPSRTSPLEAQLPMEEDRSFCAFDVTTW FT PPRVWNEGASGVVVDLRCAYQCWQQEDEQVKLAFDNLVEWIGALEXGQDPT FT DRFMNLGRSLLNTFRMQLTIASDPGIRLSKLRARLYTAVHQTDAYARAAQL FT FVDRRETQRTLRCQRCHIYGHEASTCNVRPRGNYSQYRRPSKNGRGAAARR FT S" FT CDS 1386..4211 FT /product="TcVIPER_3p" FT /note="reverse transcriptase and ribonuclease H." FT /translation="MPTVPYLWTRGIYMQRAPQRELLTIQAPIKKRQRGCR FT TPLMNAGDGSPTTSHQSSQAAHRTADAPPDLSNCPTQSSGRAMELIRKRRS FT TVVRRITDIPALGQLYEESERGRSRDVGAVNVGAENVPSGAIHDLLPHART FT ADERGELCGLFDGNRXGAIDPTAVRQDAAVHVGNESNPAGHGDSGVTKNCG FT AIGDEASAPLDKGRDESSNPQPDRLEGTCCLTTCVDNGESLVRDCGLDXQK FT FHTGGGRDLNFGLVRGAEDGEGGSPPRLAVREDTRAGRVRHYKAMQDTSRK FT RKAHEYYXCPSRTSSGSLECHGAFHKTRCVEARCSNRGGVQFGSTRDLAVG FT EARRPVRPSPEYCSIFGEVHHNADPGVVAGRIDVKGETAEGQEPWLRAEER FT FFHDFNHNHAATITRRATRTRLATPQDEHAIPLQQVNVPVLNLEWIKSRLN FT PATLERLMQVWGLVGRSPFPPSSSGKAREGSRRIPTADARLLRKAGIIEDA FT SSTITGGWIIPFSVVEEKTTGLRRRWIAWPRDKNRDDPYEANVPLLHISHY FT LPPVMAEAASCLDLKASFFQVSLPRETRHLFRCRVEDGTLVELTRLPMGYK FT ASPEILQIITSAIAGVTTVVHPLWAAPPLVRVDVWIDNIRIAGSKSDATLW FT EAQVLRNADSRHATMGEDRESGATQYTFLGVQFDHTHRAVSLSDKFVRSVR FT AMPALNSLTIAEMEVMASRFLYAAAILGTRLCDHYFFIKAVRRRLSALNRG FT IVLETSPANLPPXAVGLGERLRHIIENNRKRIIKPTEKASAAIITDASLHG FT WGAVFIPDSGDVKIAGGKWERKPFLIMQAEARAVRLALSAFSAILPSTMDV FT WVDNTSLQGAANKGSSKSHAMTWELQRIYEFLDSRGIQATFAYVRSAENPA FT DGISRGRVFTLQDLAKGWNLRRGAAGSCGWRTPKSATS" FT CDS 1510..2514 FT /product="TcVIPER_2p" FT /note="tyrosine recombinase." FT /translation="MLATAAPPPPTNPHRQRTAPPTLHPIFRIARRNRREG FT RWSLSVNDARLWYDESPTSPLWASYMRRANEVAREMWAPSTWEQRMSLAGQ FT FTTFCRTHEQPMNEESCAAFLMAIXVAPSTRLQYARMLRSMLEMNRTPLDM FT VILGLQKIAARSETKQARPLTKEEMNQVIRSRTDWKERVVLRLAWITASRW FT SEIAALTXKNFTLEADGTLILDWSVAPKTARADPHRASRFVRIRGQDAFDT FT IKLCRTLQENEKLTNITXAQVERALAPWNATAHSIKRGALRHAAQIVEAYN FT LDPHVISQLAKHVDPFDLPQNTVRYLGRYTTMLTQVSSLVALM" XX SQ Sequence 4390 BP; 1029 A; 1184 C; 1235 G; 934 T; 8 other; taattaatgt atgtgtatat cctgataaat gaatgcattc tttatgatac tttctaccgt 60 atgaatcttt tgggaagaac gcgactttgt aggggcaggg agccgataga ggccggataa 120 tatttgtttt tattttttac agacacacag ccaggtaagt ttaaggaaat ggctttggtt 180 ccctacacac cctccgagtg gtgtgatatt ggcgcccgtc tctgggacat ggtgtgccaa 240 cgtggcgaga cccaggtgca gatgacgttc attgatggta accagatgtc ggacgagacg 300 ttaaccgtgg ccgcatcatc gacaggctta tgcttgttcc atggacctat ctctaaagca 360 tggcaggact ctactcttat tggacatatt gcgaatctcc gggcgcctca gatccctaat 420 ttcgagacac aagcgctcct gcccgctatg ggggcagccc gggagttggg gcaccaacaa 480 gggctcatca ctgctgaaag ggagcgcctg aatattgatt acgcaaacgc cgtcgcggac 540 gtgcagaatc ggaatgctca gcttaactcg caggttgagg aggcgcatat caacctatcg 600 gtgagggaat ccaatatcgc cggggccggc aaccaggtcc tagaactacg acagcagttg 660 gaggccgatc gcgcccaatt acgggagcgc gaattcaact tgggagkcct cactggtcaa 720 ctccaggaac gggaggcacg ccttaatgca ctacaaaccg cgcgggatgc ggagacggcg 780 gggcgggacg agtcccccgc tactcgacac gttgccaacg atggcgtaac tacactaccg 840 catcaaacgg atatcttgat ggccggaggc gtgcctagca gcgccgcccg acgcagttcg 900 gcggcttaca gaccccctca cggcaatgct gggagaccac atttccaccc ctccacgccg 960 ttgccttccc gcacttctcc cctggaagcc caattaccta tggaagagga ccgcagtttc 1020 tgcgcattcg acgtcaccac gtggcccccg cgtgtatgga atgagggcgc ctccggagtc 1080 gttgtcgacc tccggtgcgc ataccaatgc tggcaacagg aggacgagca ggttaaactt 1140 gcctttgaca atttggtcga gtggatcggc gcgctggaag kcgggcagga tcccaccgat 1200 cgcttcatga atttggggag gtccctactg aatacattcc ggatgcagct cacgattgca 1260 tccgatcccg gcatccggct ctccaaactg cgcgcccgtc tttacaccgc tgttcaccag 1320 acggacgctt atgcgagggc agcccagctt ttcgtcgatc ggcgagaaac ccaacgcact 1380 ttgcgatgcc aacggtgcca tatctatgga cacgaggcat ctacatgcaa cgtgcgcccc 1440 agagggaatt actcacaata caggcgccca tcaaaaaacg gcagaggggc tgccgcacgc 1500 cgctcatgaa tgctggcgac ggcagcccca ccacctccca ccaatcctca caggcagcgc 1560 accgcaccgc cgacgctcca cccgatcttt cgaattgccc gacgcaatcg tcgggaaggg 1620 cgatggagct tatccgtaaa cgacgctcga ctgtggtacg acgaatcacc gacatccccg 1680 ctttgggcca gttatatgag gagagcgaac gaggtcgctc gagagatgtg ggcgccgtca 1740 acgtgggagc agagaatgtc cctagcgggg caattcacga ccttttgccg cacgcacgaa 1800 cagccgatga acgaggagag ctgtgcggcc tttttgatgg caatcgawgt ggcgccatcg 1860 acccgactgc agtacgccag gatgctgcgg tccatgttgg aaatgaatcg aaccccgctg 1920 gacatggtga ttctggggtt acaaaaaatt gcggcgcgat cggagacgaa gcaagcgcgc 1980 cccttgacaa aggaagagat gaatcaagta atccgcagcc ggaccgattg gaaggaacgt 2040 gttgtcttac gacttgcgtg gataacggcg agtcgttggt ccgagattgc ggccttgaca 2100 sccaaaaatt tcacactgga ggcggacggg accttaattt tggattggtc cgtggcgccg 2160 aagacggcga gggcggatcc ccaccgcgcc tcgcggttcg tgaggatacg agggcaggac 2220 gcgttcgaca ctataaagct atgcaggaca cttcaagaaa acgaaaagct cacgaatatt 2280 acgamtgccc aagtcgaacg agctctggct ccttggaatg ccacggcgca ttccataaaa 2340 cgcggtgcgt tgaggcacgc tgctcaaatc gtggaggcgt acaatttgga tccacacgtg 2400 atctcgcagt tggcgaagca cgtcgacccg ttcgaccttc cccagaatac tgttcgatat 2460 ttggggaggt acaccacaat gctgacccag gtgtcgtcgc tggtcgcatt gatgtgaagg 2520 gagaaactgc ggaagggcag gaaccctggt tgagagcgga ggagcgtttt ttccatgatt 2580 ttaatcacaa tcatgccgct acaataacga gacgcgcgac aaggacgcgt ctggccactc 2640 cacaagatga acacgctatc ccgctacagc aagtcaacgt gcccgtattg aacctagagt 2700 ggatcaagag tcggttgaac ccggccacct tggagaggct catgcaagtt tggggactcg 2760 tcgggcggtc ccctttccct ccgtcttcct ccggtaaagc gcgcgaaggc tcacggagga 2820 ttccaaccgc tgacgcgcga ttgttacgaa aagcaggaat watcgaggac gcttcgtcca 2880 caataacagg cggatggata atacctttct cggttgtgga ggagaaaacc accggtttac 2940 gacgacgatg gatcgcgtgg ccacgcgaca agaacagaga cgacccttac gaggctaatg 3000 ttcctctttt acacatttcc cattatttgc cgcctgtgat ggctgaggcc gcttcctgcc 3060 tggatttaaa ggcatccttt tttcaagtct ctttaccgcg ggagactcgg catctctttc 3120 gatgccgcgt ggaggacggc acgctggtgg agctcacacg gctcccaatg ggatacaagg 3180 ccagcccaga aattctccag attattactt cggcaattgc gggggtgacg acggtggttc 3240 accccctctg ggccgcacct ccattggtgc gtgtcgacgt ttggatcgat aatatacgca 3300 ttgcagggtc gaaaagcgat gcgacattgt gggaagccca agtgcttcgt aatgcggaca 3360 gccgtcacgc cacgatgggg gaggaccgcg aatcgggcgc cacgcagtac accttcctgg 3420 gggtgcagtt tgatcacact caccgggcgg tatccctgag tgacaagttt gtccggtctg 3480 tgcgcgccat gccggcgctc aattctttaa ccatcgcgga aatggaggtt atggcgtcac 3540 gctttttgta cgcggctgcc attttgggca cgcgtttatg cgaccactac tttttcatca 3600 aggcagtgcg acgacgatta tcsgcactta accgggggat tgtgctggag acatccccgg 3660 cgaaccttcc gccgkcagcg gttggtttgg gcgagagatt gcgacacatc atcgagaaca 3720 atcgtaagcg aatcatcaag cccacggaga aggcatcggc cgccatcatc acggatgcat 3780 cgctccatgg atggggagcc gtttttattc cagactccgg cgacgttaaa attgccggag 3840 ggaaatggga gaggaagcct tttcttatca tgcaggccga ggcacgcgcg gtacgcttag 3900 ccttatcggc cttttccgcc atcttgccat ccaccatgga cgtttgggtg gacaacactt 3960 cgctgcaagg agcggcgaat aaaggcagct cgaaatcgca cgccatgacg tgggagctgc 4020 aacggatata tgagtttttg gactctcgcg gaatacaggc aacttttgcc tacgtgcggt 4080 ctgcagaaaa ccccgcagac ggcatatcac gcggtcgtgt ttttacactt caggacttgg 4140 cgaaggggtg gaacttgcga aggggagcgg cggggtcttg tggttggagg accccaaagt 4200 ctgccacttc gtaagtaata atatttcaaa tcctaactga ggacaaagga ccatgctaat 4260 ggtccacaga attctatata ttgaatgaaa ataaaacatt gaagcgatta actaacgaac 4320 cttttttcct aacttgtttg gtttccatag ataatttcag gatccggcca gctgcccgga 4380 agattattct 4390 // ID Gypsy-113_AA-I repbase; DNA; INV; 5063 BP. XX AC AAGE02027370; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-113_AA_; KW Gypsy-113_AA-LTR; Gypsy-113_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5063 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02027370; Positions 98851 93789. XX CC Positions [2217-2753] - Reverse transcriptase CC Positions [3852-4319] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 657..4850 FT /product="Gypsy-113_AA-I_1p" FT /translation="MAQPSMATVIEPFRRGVSFADWVDRLEFFFQANGIGD FT DLKKAHFITLGGSVVYKEIKLLYPNMNVSEISYADLIARLKTRLDKTESDL FT IQRFKFNNRIQQPDESLEDFVLSVKLQAEFCSFGTFKDMAIRDRIVAGVRE FT VALQQRLLNEENLTLAITEKIITTWEMAGANARSLGVAGNNLQTPGKTFQK FT LLTTYAIANQNVEQVASVRGPVKNRLGRRPYDNYDNGRTNGRRIDNKWGES FT SHSPEWKRNQIKKRPNYSGFTCDFCGAKGHIKRKCFRLKNLRRDAVNLVES FT YKPGPTADSHLNEMLNRMTTHDTDSEEDDDNDAGEFSCMLVSSINKVSEPC FT LLDLHIEGQLITMEVDCGSAVTVMSKGQYSSIINKPLRQCRKQLVVVNGSK FT LQIEGEATVIVRYREVEAKLKILVLNCTHNFIPLLGRTWLDNFFPNWRQNF FT LNPSKINSVTESHQGDQIEEIKQRFSNVFKKNFSMPIIGFEAELVLKDNTP FT IFKKAYDVPYRLRDKVLNYLDKLEKENVITPVKTSEWASPVVVVMKKNNEI FT RLVIDCKVSINQVIIPNTYPLPTAQDLFANLAGSKYFCALDLEGAYTQLAL FT SRRSKKFMVINTIKGLYTYNRLPQGASSSASIFQQIMEQVLRGIVGVYVYL FT DDVLIAGKDLNDCKRKLFLVLERLVEANIKVNWEKCKFFVKELIFLGHIIG FT EKGLSPCVDKISTIQNASVPKNVTELKSFLGLINYYNKFIPHLSSRLYCLY FT NLLKKDVKFVWSEQCQGAFERSKAALLETDFLEFYDPLKPIVVVSDASGYG FT LGGVIAHIVDDIEKPISFTSFSLTDAQKSYPILHLEALALVCTVKKFHKYL FT YGKKFSVFTDHKPLVGIFGKAGKNSIFVTRLQRYILELSIYDFDIYYRPSA FT KMGNADFCSRFPLKQNIPKKYDVDRVCSINFGQEFPMDFAVIAKSTLDDGF FT LQKVISFMKNGWPKRLERQFNDVFSNQHELEIFGDCLLYKDRVVIPQQLQI FT DVLKILHASHLGIVKMKQQARRHVYWFGINSDIEKFVARCEVCSSMAIVPK FT AKILSKWIPTTRPFSRIHVDFFHFNHRTFLLIVDSFSKWIEVEWMKNGTEC FT SQVLLKLVELFSRFGLPDVLVSDGGPPFNSYGFKHFLEKQGIIVLKSPPYH FT PPSNGQAERSVKTVKEVLKKFLLEPDLAELKLEDQINLFLFSYRNTALSDG FT VYPSEKLFSYRPKTLIDLLNPKKDLKCFLSTPQANDEPVSSNETKPLRRID FT QLDTLTEGDEVWYKNHNPNVSSRWLKAIYLKRFSINTFQIMIGNARIMAHR FT DQLKARGSRDSSAPMRSNVLVPARVTEELPMGERRTEGDHGQLPVESYRSS FT RKKKRDAQAAGLPDEQLRRSKRIKIVKRDNDFVYE" XX SQ Sequence 5063 BP; 1615 A; 848 C; 1096 G; 1504 T; 0 other; taaaagttgg cgacgagaaa aagtgactta gtgaatttaa agtggctgaa gttatttgga 60 atcgtacagt gaaactaatt aattgcgaaa ctagttcgag tgcatgcagg actgtattgg 120 ttcagtgatt tcgcaccatt ttggtttaaa acgtgttctt ttgtacacca attgtagccg 180 cgaataatag ctatgtgaac acacctattg ttcgcatcag aacccatata aattcctttc 240 ttaaacgaca tatttgaaga ctgattggct tccaacaaat tattgtgtgg tattaagaga 300 gtttcctttc tttcctacta tataggtttt ttaggcttgc atttccttcg tttgatgctt 360 ggggactttt agaccatcaa ccgacagatt agtgacttga acctcgctgg tttttctgcg 420 tgcatatcac catcatcgag ggtgcagcga ccggtacctg tcgtagattt tggtacggat 480 acgtccggga agcggtgagc cgataacgtt gcatcaagaa gcgtccaata ggtttcaagg 540 tcggccaagg tcgaagtgtg agctctggtg agtttttgaa tttcttattc atcctttgtt 600 acggttcaat accttttgtt tgatacgtat tgtgggatat atagtatcca aagaaaatgg 660 cccaaccaag catggctact gtaatagaac catttcgcag gggagtttcc tttgcggatt 720 gggtcgatcg tttagaattt ttctttcaag caaatggtat tggcgatgac ctaaagaaag 780 ctcacttcat cactctagga ggatctgtag tgtataaaga aattaaacta ttatacccaa 840 atatgaatgt gagtgaaatt tcttatgcag atctgattgc caggctaaaa actagattag 900 ataaaacaga atctgatttg atccaacgat ttaaattcaa taatcgtata cagcaaccag 960 atgagtcgct ggaagatttt gttttatccg tgaaattaca agcggaattt tgttcatttg 1020 gtaccttcaa agatatggca attagagacc gcatagtggc tggggtcagg gaagtggcct 1080 tgcaacaaag attgcttaat gaagagaatt tgactttagc tatcacggaa aaaatcataa 1140 ctacctggga aatggccggc gcgaatgcca ggagcttagg tgtggctggt aataatttgc 1200 agactcctgg aaaaactttc cagaaattat tgacgacgta cgctattgca aatcaaaatg 1260 tagaacaagt tgcttcagtc agagggcccg tgaaaaatag attaggacgg agaccttatg 1320 ataattacga caatggtcgc actaatggca gaagaattga caataagtgg ggggaaagct 1380 ctcattcacc agaatggaaa cggaaccaaa ttaaaaagcg tccaaattat tcgggtttca 1440 cttgcgattt ttgcggtgcc aaagggcata taaaacgaaa atgttttagg ttgaaaaacc 1500 ttaggcgtga cgcggttaat ctcgtggaaa gctacaagcc aggccccact gcggacagtc 1560 atttgaacga aatgctgaac cgcatgacga cacacgacac agatagtgag gaggacgacg 1620 ataatgatgc aggtgaattt tcgtgtatgt tagtgtcctc aatcaacaaa gttagtgagc 1680 cttgtttatt agatttacat attgaagggc aattgattac gatggaggtg gactgtggtt 1740 cagcagtcac tgttatgagc aaaggccaat attcttccat tattaataaa cctctgagac 1800 agtgtagaaa acagctagtt gtcgttaatg gttcaaagtt acaaattgag ggggaagcaa 1860 ctgtgatagt tcgctataga gaagtagagg ctaaattgaa aattttagtt ctaaactgta 1920 ctcataattt cattccccta ttaggacgaa catggttaga caattttttt ccgaattgga 1980 gacaaaactt tttgaatcca tcaaagataa acagtgttac tgagtctcac caaggtgatc 2040 aaattgaaga aattaaacaa aggtttagca atgtttttaa aaagaatttt tccatgccta 2100 ttattggttt tgaggcagag cttgtgctta aagataatac accaattttc aaaaaagcct 2160 atgacgttcc ctacaggctg agggataaag ttttaaacta tttagacaag ttagaaaagg 2220 aaaatgttat aacgccggta aaaacaagcg agtgggcttc tcctgtagtt gtggtgatga 2280 aaaagaacaa cgaaataaga cttgttatag attgtaaggt atcgattaat caagttatta 2340 ttccaaatac gtatccgctt cctacggctc aggatttatt tgcaaacctg gcagggagca 2400 agtatttttg cgctcttgac ctggaaggag cttatacaca attagctttg tctagaaggt 2460 caaagaaatt tatggtaata aacacaatta aaggtttata tacgtacaat agactgccac 2520 aaggggcttc gtctagtgca tcaatctttc agcaaattat ggaacaagtt ttgagaggaa 2580 tcgttggcgt atatgtgtat ttagatgatg tgttaattgc aggtaaagac ttgaacgact 2640 gcaagagaaa gctctttttg gtattggaaa ggctcgttga ggcaaacatt aaagtaaact 2700 gggaaaagtg caaatttttt gtgaaggaat tgatattctt gggacatatc ataggagaga 2760 agggtttgtc tccatgtgtt gataaaattt caacaattca aaatgcctca gtgcctaaaa 2820 atgtaacgga attaaagtca tttttggggt tgatcaacta ttataacaag tttattccac 2880 atttgtcatc cagattgtat tgcttgtaca atttactgaa aaaagatgtg aaatttgttt 2940 ggagtgaaca atgccaaggg gcttttgaac gaagcaaagc agcactttta gaaactgatt 3000 ttcttgaatt ttatgaccca ttgaaaccaa tagtggttgt atctgatgcc tcaggatacg 3060 gtctaggagg ggtgatagct catatagttg acgatataga aaaacccatt tcatttactt 3120 ctttttctct gaccgatgca caaaaatcct atcctattct ccatttggaa gctttggctc 3180 tggtttgtac cgtaaaaaaa ttccataaat atttgtacgg aaaaaaattt tcggttttta 3240 ccgatcacaa gcctttggta ggcatttttg gcaaagcggg taaaaactcg atatttgtta 3300 ctagattgca acggtatatt ttagaactct caatttatga tttcgatatt tactacagac 3360 cttcagcaaa aatgggaaac gcggactttt gttccagatt tcctttgaaa caaaatatac 3420 ctaagaagta tgatgtggac cgcgtttgta gcattaattt tggacaagag ttcccaatgg 3480 acttcgctgt cattgcaaag agcacattag acgatggttt tcttcaaaaa gtaatttctt 3540 ttatgaaaaa tggatggcca aaacgcttgg aaaggcaatt caatgacgtt ttctcaaatc 3600 aacatgagct agaaattttt ggtgattgct tgctatacaa agacagggtg gttatacctc 3660 aacagcttca aattgatgtg cttaaaatac tacatgcaag tcatttggga attgttaaaa 3720 tgaaacagca ggctagacga catgtttatt ggtttggaat caactccgac attgagaaat 3780 ttgtagcaag atgcgaagtt tgcagtagca tggcaatagt accgaaagct aagatattgt 3840 ctaaatggat accaacgacc agaccattca gccgcattca cgtcgatttc ttccatttca 3900 atcatcgcac attcttacta atagtagaca gtttttcaaa atggatagaa gtggagtgga 3960 tgaagaatgg aactgaatgc tctcaagtat tgctaaaact ggtagaattg ttttcaaggt 4020 ttggccttcc agacgttttg gtctctgacg ggggtcctcc cttcaattca tatggtttca 4080 agcatttttt ggagaaacag ggaataattg ttctcaaaag tccgccatat catccgccaa 4140 gcaatggaca agcggaaaga tcagttaaga ctgttaaaga agtactcaaa aaatttttgt 4200 tggaaccaga tttggcggaa ttgaagttgg aggaccagat taatttattt ttattcagtt 4260 atcgcaacac agctttgagt gacggtgttt acccctcaga gaagctattt agttatagac 4320 caaagacctt gattgatctc ttaaatccta aaaaggacct gaaatgtttc ctatccacac 4380 ctcaggctaa tgatgaacca gtttctagta atgaaaccaa accgttgaga cggattgatc 4440 agttggatac attaactgaa ggagatgagg tgtggtataa gaaccacaat ccgaatgtat 4500 cctcaagatg gttaaaagca atatacttaa aaagattttc aatcaatact ttccagatta 4560 tgattggaaa cgctcgaatc atggcgcatc gcgaccagct caaggctaga ggctctagag 4620 attcatcagc accaatgcga tccaatgttc ttgttccggc acgagtgaca gaagaattgc 4680 caatgggcga acgacgaacg gaaggggacc acgggcaatt acctgttgaa tcctaccgaa 4740 gtagcagaaa aaagaaacgt gatgctcagg ccgcgggact accggatgaa caactacgca 4800 ggtcgaagcg aatcaagata gtgaaaagag ataatgattt tgtttatgaa taagtcgatt 4860 gagccttcag ggaaggtgtt atgaatcgca aattataatt tcagatgatc tgatatcgaa 4920 tagcataaaa tgaaatcgaa gttaattatc ctatattttg tattgcatat tgtattgatt 4980 atttttgaca caagtcaatt cgaaatgtaa tttcgatact agtttgaaaa tctgattatt 5040 gtttctcttc aaggggggag aac 5063 // ID BEL-155_AA-I repbase; DNA; INV; 5958 BP. XX AC AAGE02018864; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-155_AA_; KW BEL-155_AA-LTR; BEL-155_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5958 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02018864; Positions 20387 14430. XX CC 'GTACC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS join(2105..3826,3830..5446) FT /product="BEL-155_AA-I_1p" FT /translation="MKIDMIVVPQLIDDQPSIHLEARDVQMPANVELTDPT FT FYKRRCVEIILGARVLFQILGPRQMRCDGGPNLQESALGWLVGGLASLRTP FT RTATMAIATVSADEQTVPDDEENAYEHLDTLFKRFWALEEVTSAEATSTSN FT KVNKCEEHFLQTTTIGADGKYVVRLPFKDTLVNLGNSYEHAKRRLFSLERR FT LARTPELYDQYREFLNEYLDLKHMEVVDSKDFSRIGYFIPHSCVIKPDSTS FT TKLRVVFDASAKSSNGYSLNDLQLSEPAAQRDLFELLLDFRRHDKVVTADI FT AKMYRQVNVHEQDSWLQCILWRDNPSHKIQAYRLKTVTYGEAAASFLACRV FT LHQVGEEIRPQQPNISKIIQQCFYVDNLMMGGDSAEVLLNQRKAVEAALIK FT RGFPLRKWASNDPSIISDVATEDLEKEIHVGDHDIIKTLGVAWPPQNDSFR FT FLVDDRNSSRTTMTKRQLASEVLRLYDPLGIMQPIIITAKILLQGMWKTRL FT KWEDHIPDETLHEWQQLKKTLPKLAELEFPRQAIPSNATDLELHGFSDAST FT RAYGGAIYAFATDSNGNKTMNLLCAKAVVPLKDLKVTSEKDTSTEPTDDIT FT LPRKELIGAKLLAELMNRVISIIPQHIDKVHYWCDSQVVLAWIHSEEQHKE FT VYVRNRVKIIQGLTNKRNWNYIPTDQNPADTISRGISVRKLLTTKRQLWLH FT GTTYVLAAPPNEHMPALKQTPIYVAVTTHATDIDLEDYVQRYKYSNSLPKT FT RRHFALVRRAMNNFMVRSETLKNKRIHLKNATGPLTTEELDSGMRLIIRNM FT QATCLKDELLNIRAGGQHNKQGPLQHLNPIIDDGLIRVTGRLNLADLTENQ FT RNPILIPREHPFSRIMIEHFHKANNHAGLEFVLAEFQRNYWMRGLRKTAQA FT VLQKCILCVRARPRRFEQQMGQLPRPRVNPSTAFTHTGVDLCGPFQVLPSQ FT RAKTRLTVYACLFVCFSTKAVHIEVVENQSTAAFIAAMIRFVSLRGRPEVV FT YSDNGRIFVGASRELTELRKIYNNEVFQNEIVGIAAEQGINFSFIPPRSPN FT FGGLWEANIKVAKKLFTAAARGASFNILEIQTVFYQIAAIMNSRPLTSVLS FT NVGAP" XX SQ Sequence 5958 BP; 1850 A; 1415 C; 1390 G; 1303 T; 0 other; ttggtgaccc cgacgtgata agattcttat cactcctaaa tcgcgttcgt gtgaacaaga 60 gaaaagtgtt ccatgtcttg ccggcggtgt caaaattgaa cggccattta gtgctaaccc 120 aagtgacttt ccgagtagaa agaaatcaac aacccaactg ccgacggtgt caaaattgaa 180 tggcgtctag tgcaactcaa gtgattttcc gaaagaaaag tgatcaataa atcaaccatc 240 gacggtggca aagtgcaacc caagtgactt tccgagggaa aagtaattaa taaatcaatc 300 gccgacggtg gcaaaggtga acggtgttct atcgcaaacc caagggactg tttgagtgga 360 aagtgatcaa taaatcatcc gccgacgggg gttgagttga gcggtaatca aacgtaaatc 420 taagtgacct ctcgcgcaga agtgacaaag aaaccaagaa attagctgcc gacggcggca 480 aggtgtaggg aaattaagtg aaaaccgtag tgagcacctg cgggcatcaa tcatgaaatc 540 agagtgcctg gttaacttgc cgaagcgaaa tcccaaatta ttgtgtcctg catcgtgatt 600 ttacaagtga ctgaaatcta gtaagtaact gactaaagtg gtccaccatg gacgaagaag 660 aactaaatga ggtaaaaaat cagcttgctg ctctagagag cggtatcaag aaaatgaaca 720 ccgatctaga caaaataact tcagcaaacc ggcccagtcg gtcggaagcg aagactaaag 780 actaaagtgc acgtaaccgc gaagtgcaag ctcgacaacc tgcagcagga gctggagcct 840 gtagatcgca cagcccctaa tatgatggag caaacattta tgcaagctcc aaaccgagca 900 gatcacttac cacgaatcga attacctcga tttaacggtt ctccaaccga gtggttagcc 960 tttaaggctc ggttcgagaa gaggattgcg acaatcggtg acgacgccga taaatacgca 1020 ttcctctcaa agtgcctaga gcactttgag cccgcccgca actcaattga tgcccttgag 1080 aatgcagggt cgagtttcgc agaagcttgg gcaaagcttg aaactcgttt ttacaagaaa 1140 cgaatagcct acgaaggcca tttcgtcaaa ctcataaagt tcaaaaaaat taattcgccc 1200 aacgcaaagg ccatcttggc actgatcgac gccgtggata cgacggtcca cgcagcgcaa 1260 caaatagaaa acgaagtagg acaagatttg gactgcgtgg cgaatggcct gttagtaagc 1320 ctggctaagg ctcgattgga cggggaaaca gcgtccaaaa tcgaagaacg catggacatt 1380 caccgcgttt atacatgggt gtaattcaag gaagaattgg agaaaagggc taaccaacta 1440 gcatgtcact ccgatttcga ggagcataaa cctcgaaatg tcaaaacggt agcggccgca 1500 tccgtcgctc agccacagag aaataacaac gaaggtaaat catcaaaagg tctacaggat 1560 aagcctcaac aacaatcctg ttttatgtgt ggggaaaaag gacaccccat ttggttctgt 1620 gtccgattta aggagctctc tcctgcacag aggtgggata aaacaattaa gtcccgccgg 1680 tgtttcaatt gtatttccag ggggcattcc taccaaaaat gcccctcttc taaacgatgt 1740 catgaatgtg gtgagccgca tcacacgctg ctgcataggg aagacaacgc cacggagaag 1800 aaacctgttg ctgattccgt ggtccaggct tcaacaagca gtaaccagtg actgaaggag 1860 gaatacgttt tcttagcaac tgctgagcta aacgtcaaag gtgcttctaa ttggtaccaa 1920 acttctaatt ggcacctgcc tgttggattc tggaagtcaa atcgaatcga ttactgaaga 1980 ggccgcccaa accctggggc taccttactc ccggagcaac ttacagatca atggaattgg 2040 tggaagagtc aacgcttcaa ggcggattac gaccaccata tcatcgcgat gcggtaagtt 2100 tactatgaaa attgatatga tagttgtacc acagctgatt gatgaccaac cgagcattca 2160 tctcgaagct agggatgtgc aaatgccagc aaatgttgag ctgaccgacc cgacatttta 2220 caaacggcga tgcgtcgaaa taatacttgg tgccagagtg ttattccaaa tattgggccc 2280 caggcaaatg agatgcgatg gaggtccgaa cctgcaagaa tctgcgctag gatggctagt 2340 aggtgggctc gcttctctgc gcacaccaag gacggccacc atggccattg ccactgtttc 2400 cgctgacgag caaacagttc cagatgatga ggagaatgcc tacgagcact tggacactct 2460 cttcaagcga ttttgggcct tagaagaggt aacgtctgcg gaagcgacat ctacttccaa 2520 caaggtcaac aagtgtgagg agcactttct gcagacaacg acgatagggg cagatggaaa 2580 atacgtcgta cgtctcccat ttaaggatac gcttgttaac ctaggcaatt cctatgaaca 2640 cgcaaagaga cgcttgttct cactagaaag gaggctcgca aggacacctg aattatacga 2700 ccagtataga gaattcctga acgagtacct ggacctaaag catatggaag tcgtagactc 2760 caaggacttc agtagaatcg gctatttcat accgcactct tgcgtaatca aacctgattc 2820 aacttctacc aaattgcgcg tcgtcttcga tgccagcgcc aagtcatcca acggatactc 2880 cttgaatgac ttgcaattaa gcgaaccagc agctcaaaga gaccttttcg agctacttct 2940 agactttcgc cgccatgaca aggtagttac ggcggacatt gcaaaaatgt accgtcaagt 3000 aaatgtccac gaacaagatt cgtggcttca atgcatcctc tggcgagata acccatcgca 3060 taaaattcaa gcctaccgtc tgaaaacggt cacttatggc gaggcggccg catcattcct 3120 agcatgccgc gtcttacatc aagtagggga agaaatccgg ccccaacagc caaacatttc 3180 gaaaataatt cagcaatgtt tctacgtgga taatctaatg atggggggag attcagcgga 3240 agtcctacta aatcaacgta aggctgtgga agctgcacta atcaaacgag gattcccctt 3300 acggaaatgg gcatccaacg atcctagcat catatcagac gttgcaacag aggatctaga 3360 aaaggaaatc cacgtaggag accatgatat catcaagaca ctaggagtgg catggcctcc 3420 acaaaacgac tcattccggt ttctggtaga cgatcggaat tccagcagga caactatgac 3480 caaaaggcaa cttgcctcgg aagtacttcg gctatacgat ccactgggga ttatgcagcc 3540 catcatcata acggcgaaaa ttctactgca aggaatgtgg aagactcgac ttaagtggga 3600 agaccacata ccggacgaaa cactgcatga atggcagcag ctgaagaaaa ctctgcccaa 3660 actggctgaa ttggagtttc cacgccaggc aattccgagt aacgcaacag atctggaatt 3720 gcatggattt tcagacgcgt caacccgcgc ctacggaggc gcgatctacg cattcgcaac 3780 ggattcaaat ggaaataaga caatgaatct gctctgtgcc aaagcatgag ttgtaccctt 3840 aaaggacttg aaagtgacat ctgaaaagga cactagtacg gagccgacag atgatattac 3900 tcttccgcga aaggagctga ttggtgccaa actgctagcg gaattgatga accgagtgat 3960 cagcattatc cctcaacaca ttgacaaggt tcactattgg tgcgattcac aagtggtgct 4020 agcttggata cattcggaag agcagcacaa ggaagtgtac gtacgaaatc gagtgaaaat 4080 cattcaagga ttgacaaaca aaagaaactg gaactacatt cctactgatc aaaatccagc 4140 cgacaccata tctcgaggta tttcagtgag gaagctactc actacgaaaa ggcagctgtg 4200 gcttcatggc accacatatg tattagcagc acctccaaat gaacatatgc cagccctcaa 4260 acaaacgcca atttacgtag cagtaaccac tcatgctact gacattgact tggaagacta 4320 cgtacagaga tacaaatatt ccaactcgct acccaaaact agaaggcatt tcgctcttgt 4380 acgacgagcg atgaacaact ttatggtccg atccgaaact ttgaaaaaca agcgaattca 4440 cttgaaaaac gcgaccggtc ctctaacgac cgaagaatta gactcaggaa tgcgcctgat 4500 aatcagaaac atgcaagcca catgtttaaa ggacgaattg ttgaacatcc gcgcaggtgg 4560 tcaacacaac aagcaaggac cattacagca cctcaatccc atcattgacg atggattgat 4620 tcgagtgact ggtaggctca atctagccga tttgaccgaa aaccagagaa atccgatact 4680 aattccgagg gaacacccct tctccagaat aatgatagag cattttcaca aggcgaataa 4740 tcatgccgga ttggaattcg ttctagctga attccaacgc aactactgga tgcgaggact 4800 taggaaaacg gctcaagccg tcctccagaa atgcatactt tgcgtacgag caaggccacg 4860 tagatttgaa caacaaatgg gtcaattacc acgacctagg gtaaacccat ccacagcttt 4920 tacacatact ggagtggatc tgtgtggccc cttccaagtt ctgccctctc aaagagcgaa 4980 aacgagactt accgtctacg cctgcctttt tgtatgcttc tcaacaaaag ctgtacatat 5040 tgaagtggtg gaaaaccagt ccaccgcagc attcattgcc gctatgattc gctttgtatc 5100 attacgtggt agacccgaag tggtctactc cgataacggc cgtatcttcg tgggagcaag 5160 ccgtgaacta actgaactaa ggaagatcta caacaacgag gtatttcaaa acgaaatcgt 5220 agggatagcg gctgaacagg gaataaactt ttccttcatc ccacccagaa gcccaaactt 5280 tggagggctc tgggaggcaa atataaaggt ggccaagaag ctctttaccg cagccgcgcg 5340 cggtgcaagc ttcaacatac tcgaaataca aacggtattt taccagatag cggcgataat 5400 gaactcccgg ccgcttacat cggtattgtc caacgtggga gccccttaac ccttgactcc 5460 gggacatttt ttgataggta gagctatgac agctgtacct ataccaatga gtcacttgga 5520 agaaggcagt ctgcccatgc gctggaggcg cattcatacc caaaccgcac atttttggcg 5580 taagtggcaa catgagtatc tgcaacactt gcgctgctta tccaagtgga ccaagaaaca 5640 acctaatgta caaccaggcc agatagtcct gataggagat gacaacaatc cggtagctaa 5700 atggcctatg ggaattgtga tccatacaca aagaggaccc gatggaattg tacgtgtagc 5760 cacagttcgc gtcggatcga acgtatacaa acgaaacgtt aggctgctgg cacctctgcc 5820 tatcgaagtg tcagcaatca atcactgcgc aaacgaggtc agctcagaaa atgtacaaga 5880 aaacaactat gattcatcct ccctcactgc tgtcaggaat tcttggagcg acagacttcg 5940 ctccaagggg ggagaaaa 5958 // ID Outcast_Ele3 repbase; DNA; INV; 5441 BP. XX AC . XX DT 15-OCT-2010 (Rel. 15.1, Created) DT 15-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE A Outcast clade non-LTR retrotransposon family from Aedes DE aegypti. XX KW Outcast; Non-LTR Retrotransposon; Transposable Element; KW Outcast_Ele3. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-5441 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5441 RA Kojima K.K. and Jurka J.; RT "Outcast clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (15-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. This consensus is generated from 5 CC sequences with >96% identity, and ~100% identical to the original CC sequence in [1]. XX FH Key Location/Qualifiers FT CDS 295..1626 FT /product="Outcast_Ele3_1p" FT /translation="MAFANSNGPPEGGTHRAQHNLNMDTTLNTPKPKINYQ FT YERTDQGPYRVMVELIDTQNGTLRINKLTLANVLRKMIIYKAHILDMKNIG FT RNKVMVYLNNYQLANRLTSDEDMKAKNYRAYIPRHLISVTGVIAGVPLDIT FT EEEVMEDIICEYPIMQVYRMNRFVNNQKEPTQRMSITFRAAKLPETIKIFH FT CSLRVRAFFKKAVLCLKCLRYNHKQENCKGTRRCQQCSRMHENDEEYQNCQ FT KPQRCLHCRKDHRTSDQACPERARQNNIQAILARTSLTAVEAVEQFPIHTQ FT NYYEALVESAQEPTPAESFAGVTSQLFRPRNNVQRRKREGISPCSRNIGEQ FT VTVYNEKKAKISRDGQHNGVALFNKYKTTEAEKWKTQLRQAQQAKAQEGIN FT IQPTPSSSNSTPATTGAPEIENEDFSMFVSNMINKGRINANSLELNRNQN" FT CDS 1670..5257 FT /product="Outcast_Ele3_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="MKILQSNVQSFRKNKEEISRLLSRDNYDVAILSETWS FT KIEEESSRAYNLSGYHKVLNSRADGYGGIGIYIRNHLNFQQIQIXSQADLI FT QVAAVKILRTNTVVAGIYVAPSITTNQFEAELHAIVMQLNRYPKVIFGGDV FT NCHHFARGDEKCDPKGTLLMDIINGSNLVLLNTGEKTFIPLELGRRSTAID FT ISCCSTTMFGELEWKVKPETIGGSEHQLIEISGKEKVEYTRYFVNHKKIKE FT EIEKFSCEEVRSIDELQERIQLSYQKNRKKSKHTPKVWWSDTVEEAWKNKM FT EAIRKFNTDSTIENAIMVRRTKAIFLRRKKEGIRKQIEELASTIDPQTSSV FT VIWDKIGRITGKRVNRKVNNPVQEDSQLAEKFFDIHFGKTDVDIDAPLSYG FT PLCQYNLMDREKWDRILSRKNDKSAPSNDKITYGMLKILKPDVTSMIIRQI FT NDMFIAGALTYTLKEIRVVAIPKQGKDQSTVEGKRPISLIPTITKVTNTAV FT LDKIQSHLYKTRNLPELSFGFRKNMSTSTCLNFVVDSIKHNKREGLITATV FT FIDLSNAYNSVKADILEAIMYKLRFPREVVTWVVSFMKNRKVTMQVGTQTV FT SRMVSNGLPQGDVMSPTLFNIYTSELHELTNGEVTLVQFADDFSLIITGKN FT VEEVRRKTQQALDTFQERTKQLELDINPAKTKVVLFHGGNHTFDVKLDGIT FT VELVKSHKYLGLIIDRFLSFGEQTRTVKKKIDERLKMLKIISSVRHGGHPQ FT TMNMLFTALIRNYVEYGCSIMNNASKTNKQALQVAINGCLRKVTGCSKTTP FT LNTLLAIAAQEPYGIRSKFVTAKEIAKCLAYQGPVYEQLSKLGVDVLDRDK FT LSYSERLYLEHVQMFKNISPIVYVELTDTKVNINSSVGTKFKKENVNPKVL FT KQMVLCLLNGKYRNLTKVYTDASKHNSRCGIGVYMEPGNRRLTWKLKHETS FT IMTAEVIALSMATKEILKTDLRDVVILTDSLSSCMVLENSQYQEWRSAMID FT EILQACLSRNITIQWIPSHIDLEGNDIADYLAKKGTEHSNEIDHQVLLKDA FT FLYFQNLKEEETDKWYKDYALEKGQKYYAIQQSFPNKPWYHQKQLDNFETR FT TLNRIMAGHDYSKYWLHKMKIEDDPNCEICDVPETAEHIILHCVRYGQSRM FT QFSFDCKFRNMKEVLETKDVNIFKEIVNFLKQTKNKI" XX SQ Sequence 5441 BP; 2006 A; 1099 C; 1175 G; 1160 T; 1 other; cattccagtt caagcgtcgg atacgcgcgc gtgtgtttcc aagcgaaggc aaattttcga 60 gcgttttttc caaataaagt tccggaaaag ttatcacaac ttgaaatttg aagagaaata 120 acattttttg aagttaagtg aataagtgcg agagaaaaaa aactaaataa agtagtgaaa 180 aaagcatatt ttgatcgagc cgaaaaggcg aaagggacgc cattgtgcta gtgaggttac 240 ataaagaaaa aaagaaaatg gcgacatcta agagagaaga agaataagag aagaatggct 300 ttcgcaaaca gtaatgggcc tcccgaaggg ggcacacatc gagcacaaca taacctcaac 360 atggacacaa ctctaaacac accgaaaccg aaaattaact accagtacga gcgaacggat 420 caaggcccat atcgagtgat ggttgaactc atcgataccc aaaatggaac cctcagaatc 480 aacaagctca cgctagcaaa tgttctgcgt aagatgatta tctacaaggc tcatatcttg 540 gatatgaaaa atatcggtcg caacaaggta atggtgtacc ttaacaacta ccagttggca 600 aaccgcctta ccagtgatga agacatgaaa gcgaagaatt atcgtgctta catcccacgt 660 cacctgattt cggttaccgg ggtgattgca ggagtccctt tggacataac cgaggaagag 720 gtcatggaag acataatctg cgaatacccg atcatgcagg tatacagaat gaaccgtttc 780 gttaacaatc aaaaagaacc tacgcaaagg atgagtatca ccttcagagc ggcgaaatta 840 ccggaaacca tcaaaatctt ccactgctct ctcagagtga gggctttctt caagaaagcg 900 gttctatgtt tgaagtgcct acgctacaat cacaagcaag aaaattgcaa gggaacgcgc 960 agatgtcaac aatgctccag gatgcacgaa aacgacgaag agtaccaaaa ctgccaaaaa 1020 ccacagcgat gtctacactg cagaaaggat cacagaacat cagatcaagc gtgtccagaa 1080 cgagcccgtc aaaataacat tcaagccatt ttggctcgca ccagcttgac agcggttgaa 1140 gcagtcgaac aattcccgat tcatacgcag aactactatg aagccttagt agagagtgct 1200 caggaaccga cgccagcaga atcctttgcc ggtgttacat cacagctgtt cagaccacga 1260 aacaatgtgc aacgacggaa acgagaagga atcagcccat gctcaagaaa tattggagaa 1320 caggtgactg tatacaacga gaaaaaagcg aagatatcac gagatggcca gcataacgga 1380 gttgcactgt tcaataaata caaaacaacg gaagcagaga aatggaaaac acaattacgt 1440 caagcacaac aagcgaaagc ccaagaagga atcaacattc agcccacccc ctcgtcgagt 1500 aacagtaccc cggcgacgac tggagctcca gaaatagaaa atgaagactt ctcgatgttc 1560 gtctcgaaca tgataaataa aggtaggatc aatgctaata gtttggaatt aaatagaaat 1620 caaaactaga taaataaaga aaacattaaa cttatcaatg taaacaaata tgaaaatttt 1680 gcagtcaaat gtacaaagtt tcaggaaaaa taaagaagaa atctctagat tactaagtcg 1740 ggataattat gatgtagcaa ttttgtcgga aacgtggtca aaaattgagg aagaatcatc 1800 cagagcgtac aacttaagcg gataccataa agtcctaaat agccgtgcgg acggatatgg 1860 aggtataggt atctacatac gaaatcattt gaatttccaa caaatccaga tcakaagtca 1920 agcggatctc atccaggtag cagcagtcaa aatattgcgg acgaacacag ttgtagccgg 1980 aatctacgtg gctccgtcta tcacaactaa ccagttcgaa gcagagttac atgcaattgt 2040 gatgcagttg aacaggtacc ccaaagtaat ttttggagga gatgtaaatt gccaccattt 2100 tgcacggggc gatgagaaat gtgatccaaa gggaacattg ctgatggaca taatcaacgg 2160 atccaatctg gttctactga atactggaga aaaaacgttt attcctttgg aactaggtag 2220 gaggtctact gctatcgata tatcctgctg ctcaacgaca atgtttggag aactcgaatg 2280 gaaggtgaaa ccagaaacga ttggaggcag cgaacatcaa ctgattgaaa tatctggaaa 2340 ggagaaagtc gagtacacga ggtacttcgt caaccacaaa aaaattaagg aggaaataga 2400 aaagttcagt tgtgaagaag tcagaagcat cgatgaattg caggagagga tacaactgag 2460 ctaccaaaaa aataggaaaa aaagtaaaca tacaccgaaa gtatggtgga gcgacacggt 2520 agaagaagcc tggaagaata aaatggaagc catcaggaaa ttcaacacag acagcactat 2580 agaaaatgca atcatggtga ggcgaaccaa agcgattttc ctacgaagga aaaaagaggg 2640 aatacgaaaa cagatcgaag agctggcatc gactatcgac ccgcagacca gctctgtcgt 2700 aatatgggac aaaataggaa gaattaccgg aaagagagtc aaccgtaaag tcaacaaccc 2760 tgtacaggag gacagtcaac tagcagaaaa gtttttcgat attcatttcg gaaaaacgga 2820 tgtagatatt gatgcgcctt tgtcatacgg accattatgc cagtacaacc tcatggatag 2880 agaaaaatgg gatagaatac tttcgcggaa aaacgataaa tcagccccgt cgaatgacaa 2940 aatcacgtat ggcatgctta aaatccttaa gccagatgtg acgagcatga taatcagaca 3000 gatcaatgat atgttcattg ccggtgcatt gacatatacg ctcaaagaga taagagtcgt 3060 agctattcct aaacaaggaa aggatcagtc cacagttgaa ggtaagcgac cgatttcatt 3120 gattcccacc attacgaaag taacaaatac tgcggtatta gataaaatac aatcacactt 3180 gtacaaaact agaaatctcc cagaactttc ctttggattt cgcaagaata tgtccacatc 3240 aacctgtcta aattttgtgg ttgactcaat aaagcacaat aagagagaag ggttgataac 3300 agccactgtg ttcatcgact taagcaacgc ctataattca gttaaagccg atatccttga 3360 agcaattatg tataagcttc gatttccacg agaagttgtt acgtgggtgg tttcattcat 3420 gaaaaacagg aaagtgacga tgcaagttgg aacccaaaca gtatcaagaa tggtgagtaa 3480 cggcctcccc caaggggacg tcatgtcacc gacacttttc aacatataca caagcgaatt 3540 gcatgaattg accaatggag aggtaacgtt ggtacaattt gccgacgatt tctcacttat 3600 aataaccgga aaaaatgttg aagaggtaag aagaaagacg caacaggcac tcgatacgtt 3660 ccaggaacga acgaaacaat tagaactaga catcaaccct gctaagacga aagttgtgct 3720 gttccacgga gggaaccaca cgtttgatgt gaaacttgat gggatcacgg tagagcttgt 3780 taaaagccac aaatatcttg gactgataat tgaccgcttc ctcagctttg gtgaacaaac 3840 caggacagtc aaaaagaaaa tagatgaacg cctcaaaatg ctaaaaataa tatcgagtgt 3900 gagacacgga ggacaccccc aaactatgaa catgctcttc accgctttaa tcaggaacta 3960 cgttgaatac gggtgctcaa taatgaataa tgccagcaaa acgaacaaac aagccctcca 4020 ggtagcaatc aacggttgtc tgcggaaggt aaccggttgc agcaagacaa cacctctgaa 4080 tacgctgttg gctatagcag cacaggaacc atatggaatt cgatcgaaat ttgtaacagc 4140 gaaggagatc gccaaatgct tggcctatca aggaccggta tacgaacagc tgagtaaact 4200 cggagtagac gtcttggaca gagataagct atcgtattca gaaagactat atctagaaca 4260 tgttcaaatg tttaaaaaca ttagtccgat agtttacgtg gaactaaccg acacaaaggt 4320 taacatcaac tcgagtgttg gaacaaagtt caaaaaagag aacgttaacc ccaaggttct 4380 aaagcaaatg gttctgtgtc tcctgaatgg aaagtacaga aacctaacta aagtgtacac 4440 agacgcatcg aaacacaaca gccgttgtgg aattggagtg tatatggagc caggcaatag 4500 aagattaact tggaagctga aacacgaaac atcgataatg actgcggaag tcatagcact 4560 cagcatggct actaaggaaa tcttgaagac tgatctacgc gacgtagtaa tactcaccga 4620 ctcactatca tcgtgcatgg tactggaaaa cagtcagtac caggaatgga gaagcgctat 4680 gatcgatgaa atactacaag catgtttgtc ccgcaacatc actatacagt ggatacctag 4740 ccatatcgat ctggaaggaa atgacatcgc tgactacctg gcgaaaaaag gaacggaaca 4800 ttccaacgag atagatcatc aagtcctact taaggatgcc ttcctttatt tccaaaacct 4860 gaaagaggaa gagacggaca aatggtataa agattacgct cttgaaaaag gccaaaaata 4920 ctacgctata caacaatcgt ttcctaataa accatggtac caccaaaaac aactcgacaa 4980 tttcgaaaca cgaacactca accgaataat ggctggtcac gattacagca agtactggct 5040 acataaaatg aaaatcgaag acgatcccaa ctgtgaaata tgcgacgtcc cggaaacggc 5100 tgaacatatt atactgcact gtgtacgata cggacagagc agaatgcaat tttcctttga 5160 ctgcaaattt agaaacatga aagaagtact agagacaaaa gatgtaaata ttttcaaaga 5220 aatcgtaaat ttcctgaaac aaacgaaaaa caaaatataa accaattcca aacaaaaaga 5280 gcattgaaac gttgtgtaac gagacgctag gtcgactttg tcgactcacc ccgccgtaac 5340 gtcaaaatgg ggtgggctgt caacaacgaa aaacagctgg atatatggtc cttgccacca 5400 atccaagtgt ttacactaca ccagaagaga agaagaagaa g 5441 // ID Gypsy-27_CQ-I repbase; DNA; INV; 4327 BP. XX AC AAWU01011305; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-27_CQ_; KW Gypsy-27_CQ-LTR; Gypsy-27_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4327 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 433-433 (2011). XX DR GenBank; AAWU01011305; Positions 102974 107300. XX CC Positions [3400-3864] - Integrase core CC 'GTTAT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 127..4317 FT /product="Gypsy-27_CQ-I_1p" FT /translation="MPDTPPAALAPAPVTSGNQLLLNLLQGQQKLLEELVK FT NSVTQKDTKNSNEFVMESFSNTMSEFAYDPENDVTFASWYSRYEDLFSKDA FT SALDGASKVRLLLRKLSPAAHQRFVDYILPALPKHKTFEQTVDILKAIFGE FT GISTFRKRFRCLQTVKSGQEDFISYSATVNRRCEEFNIGATKADQFKCLVY FT VCGLQSPNDAEIRMRLINRMEMEPEKVTLASLVDECNRLVNLRKDTSMVET FT QEIKAVYQPTNQGNRFKPTDLPKTPCWLCGGVHFVRDCGYSSHRCSMCHQI FT GHKDGYCACVRPSKQNGGHADKPQRKPAAQGQQGRPWKKDNRGSRSQKPYS FT TRIVCCRQVQASSGRKFVEVAINGQQAHLQIDSASDITVISTATWRKIGSP FT LVSSTSARASTASGSQLQLTSQFEAYVTLQGVDREGTIYVTNANLNILGID FT WIERFGLWDVPISSVCQAIKMLKGTEIDELKVKYADVFSEQPSVCSCVDVQ FT LSVKEGAKPVFRPKRPVAYGSLQLVEEELDRLEKRGIITPVTYSEWAAPIV FT VVRKADKSVRICGDYSTGLNDVLEPHQYPLPTPDEIFASLAGGQVYSILDM FT RDAYLLLNVAEPSRKYLTVNTHRGLYQYNRVPFGIKPAPGAFQQVIDTALA FT GLPGVKAYLDDIIVTGRTIEEHKRNLEEVLRRLQKFGLRLKLEKCKFFERS FT VKYLGHIVDQNGTRPDPEKTKAIAEMPAPTDVSTLRAFLGAINYYGKYIPH FT MRNLRYPLDNLLKSSQTKFLWTAECQRSFETFKRILLSDLALTHYDPSLPI FT VVSADASNVGLGACIQHVFRNGSRKTVYHVARSLTKAEANYGQIEKEGLAL FT IFAVTRFHRMIFGRHFTLQTDHKPLLAIFGSKKGIPVYAANRLQRWAVTLL FT NYDFDIEYVRTTEFGNADVLSRLINRCAKPEEDFVIAALTLEADLDEVLRV FT TLNSCNLPLSFRTMQVATVKDAILQQVQKHLRSGWPQSKKQLRAEIRPYFE FT ARESLSLVHDCILFGDRLVVPTIYRAKVLKLLHTAHPGIERMKALGRSYVY FT WPNMSADIRDLVLRCSQCAAAAKAPARAIPQPWPKSTKPFQRVHIDYAGPY FT HDQYFLVVVDSFSKWPEIVPTGTITTTATIAMLREIFCRFGMPEKLISDNG FT RQFTADAFLDFCTQNGIEHVRTPPYHPQSNGQAERFVDTFKRALAKIHQRG FT GNLRADLDTFLLNYRITPNRSAPDGKAPAELLFGNKLRTTLDLLLPPTNVN FT NTLQLDQDKLRAFKAGDLVYAQQHTAGTTRWTPGVVTGRRGKVLYEVELDN FT GRTISSHLNQLRKRLDQCQPKNRGPLPLDILLDTYELKRTKPAPVPGRPVV FT APALPALPQQEDAGPRRSERTRKQPDRYGDAVYC" XX SQ Sequence 4327 BP; 1001 A; 1305 C; 1206 G; 815 T; 0 other; ttttggcgac gaggccgtcc cgcgcgaata aaagttcggc ccgaaggtgc ggcgcgctag 60 cgcaaaaatc gaggtgaact gtggattgag gttacgacac ctttctcgcc acgaggattc 120 tgcacgatgc cagatactcc acctgcggcg ctggccccgg cacctgtaac ctcggggaac 180 cagctcctgt tgaacctgct gcaggggcag cagaagctgc tggaagagct ggtcaaaaac 240 tccgtcaccc agaaagacac gaaaaactcc aacgagttcg tcatggaatc cttctcgaac 300 accatgagcg aattcgccta cgatccggaa aacgacgtca cttttgccag ctggtacagt 360 cgctacgagg atctcttcag caaggacgcg tcagctctcg acggcgcaag caaagtccga 420 cttctgctgc gaaaactgtc tcctgcggcc caccagcggt tcgtggacta catccttccg 480 gctttgccga agcacaagac gttcgagcag acagtcgaca tcctgaaggc gatcttcggc 540 gaggggatct ctacgttccg gaagcggttc cgttgtctcc agacggtgaa aagcggccaa 600 gaggacttca tcagctactc cgcgacggtg aaccgccgtt gcgaggagtt caacatcggc 660 gcgacgaagg cggaccagtt caagtgtttg gtctacgtct gtggcctaca gtcaccgaac 720 gacgcggaga tccggatgcg gctgatcaac cggatggaga tggagccgga gaaggtgacc 780 ctggcgtcgc tggtcgacga gtgcaaccga ctcgtgaacc tccggaaaga cacgtcgatg 840 gtggaaacgc aggagatcaa ggccgtctac caacccacca accaaggtaa ccggttcaag 900 ccgacggatc tacccaagac gccctgctgg ttgtgcggag gagtccattt tgttcgcgat 960 tgtggctact cttcgcatcg ctgctcgatg tgccatcaga tcggccacaa agatggctac 1020 tgcgcgtgtg ttcgaccgtc gaaacaaaat ggtggacacg ccgacaagcc ccagcggaag 1080 ccagcagccc agggccagca aggacgaccc tggaagaagg acaaccgcgg atccagaagc 1140 cagaagccgt attcgacgag gatcgtgtgt tgccggcagg ttcaagcaag ttctgggagg 1200 aaattcgtcg aagtcgctat caacggacag caagcacacc tccaaatcga ttctgcctcc 1260 gacatcaccg taatctcgac ggcgacgtgg cggaaaatcg gcagccccct ggtaagcagc 1320 acaagcgctc gagcatcaac tgcctcggga tctcagctgc agctgacgtc acagttcgaa 1380 gcctacgtga cactccaggg cgtcgatcga gagggaacca tctacgtcac gaacgccaac 1440 ctcaacatcc tcggcatcga ctggatcgaa cggttcggtc tctgggacgt gccgatcagc 1500 agcgtgtgcc aggccatcaa gatgctcaag gggaccgaga tcgacgagtt gaaggtgaag 1560 tatgcggacg tcttctctga acaaccgagc gtgtgcagct gtgttgacgt gcagctcagc 1620 gtcaaagaag gcgccaaacc ggtatttcgc cccaaacggc ctgtggcgta cgggtcgctg 1680 cagctcgtcg aagaggaact ggatcgcctt gagaaacgtg ggatcatcac gccggtcacg 1740 tactccgaat gggccgctcc gatagtggtg gttcggaagg cagacaagtc tgtgcggatt 1800 tgtggcgact actcaactgg gctcaacgac gtcctcgagc cccaccagta cccattgccc 1860 acaccggacg agatcttcgc cagcctggcc ggaggtcaag tttacagcat tctggacatg 1920 cgcgacgcct acctgctgct caacgtggca gaaccgtcca ggaagtacct gaccgtgaac 1980 acccacagag gcctgtacca gtacaaccgt gttcctttcg gcataaaacc ggcgccgggc 2040 gcgttccagc aggtcatcga cacagcgctc gccggcctgc ccggtgtcaa ggcctacctg 2100 gacgacatca tcgtcaccgg cagaaccatc gaggagcaca agcgaaactt ggaggaggtt 2160 ctgcgtcgtc ttcagaagtt cggcctgcgc ttgaagctgg aaaagtgcaa gttcttcgaa 2220 cggtcggtga agtacttggg tcacatcgtg gatcagaacg gcactcgacc tgatccagag 2280 aagacgaagg ccatcgcgga aatgccggcg cccactgacg tctccacctt gcgggcgttc 2340 ttgggggcaa tcaactatta cggcaagtac atcccgcaca tgaggaacct gcgatacccc 2400 ctggacaacc ttctgaagtc gtcgcagacc aagttcctgt ggaccgccga gtgtcagcgc 2460 agcttcgaga ccttcaagcg gattttgctg tccgacctgg ccctcaccca ctacgatccg 2520 tcactgccga tcgtggtgtc cgccgatgcg tctaacgtcg gcctgggtgc ctgcatccaa 2580 cacgtgttcc gcaacggttc aaggaagacc gtgtaccacg tggccagatc gttgacgaag 2640 gctgaggcga actacggcca aatcgagaag gagggtcttg ccctcatctt tgcggtaacc 2700 cggttccacc gcatgatctt cggccgtcac tttacgctcc aaaccgacca caagccgctg 2760 ctggctatct tcggttccaa gaagggaatc cccgtctacg ctgcgaaccg tctccagagg 2820 tgggcggtga cactgctcaa ctacgatttc gacatcgagt acgtgcgcac gacggaattc 2880 ggcaacgcag acgtcctctc aaggctgatc aaccgctgtg ccaagcctga ggaggatttc 2940 gtgatcgcag ccctgacgct ggaagcagac ctggacgagg tacttcgtgt gacgctaaat 3000 tcctgtaacc ttcctctcag tttcaggacc atgcaagtcg caaccgtaaa agacgcgatt 3060 ttgcagcaag tccagaagca tctgcgttcc ggttggccac agtcgaagaa gcagctccgc 3120 gctgaaattc gaccctactt cgaggcccgc gaatcactct cgctggtcca tgattgcatc 3180 ctcttcggag atcgcctggt agtgccaacc atctaccgtg ccaaggtcct caaactgctg 3240 cacaccgcac accccggcat cgagcgcatg aaggctttgg gaagaagtta cgtgtactgg 3300 cctaacatga gcgctgacat ccgtgatctg gtcctgcggt gctcccagtg cgctgccgct 3360 gccaaagcac cagcccgtgc catccctcaa ccgtggccga agtcgacgaa gccgttccag 3420 cgagtgcaca tcgactacgc tgggccttac cacgaccagt acttccttgt cgtcgtggat 3480 tcgttctcca agtggccgga gatcgtacca accgggacca tcacaaccac tgcgacgatt 3540 gcgatgctcc gagagatttt ttgtcgcttc ggaatgccgg agaagctcat ctctgacaac 3600 ggccgccagt tcaccgctga cgcattcctg gacttctgta cgcagaacgg gattgagcat 3660 gttcgtacgc cgccctacca cccacaatca aacgggcagg cagaaaggtt cgtcgacacc 3720 ttcaaacgtg ccctggccaa aatccatcag cggggaggaa atctacgggc cgacctggac 3780 acgtttctgt tgaactacag gatcaccccg aatcgaagcg ctcctgacgg caaagcacct 3840 gctgagcttt tgtttggcaa caaactccgc accactttgg acctgttgct gccgcccaca 3900 aacgtaaaca acacgctcca actggaccag gacaagcttc gggcgttcaa ggcgggtgac 3960 ctcgtgtacg cacaacaaca taccgctggc acaacccggt ggacacctgg tgtcgtcact 4020 ggcagacgcg gaaaagtgct gtacgaagtg gagctggaca acggtcgcac gatttcgtca 4080 cacctcaacc agttgcgcaa gcggctcgac cagtgtcagc cgaagaatcg tggacctctt 4140 ccgctggaca tcctgctgga cacgtacgag ttgaaaagga ccaaaccagc gccggtgccc 4200 ggacgtcctg tggttgcacc tgcacttcca gccttgccac aacaagaaga tgctggacct 4260 cggcgttctg aaaggacgcg caagcaacct gacagatacg gcgatgccgt gtactgttga 4320 ggggaga 4327 // ID Gypsy-38_DPu-I repbase; DNA; INV; 5199 BP. XX AC ACJG01004157; XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the common water flea: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-38_DPu_; KW Gypsy-38_DPu-LTR; Gypsy-38_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-5199 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the common water flea."; RL Direct Submission to RU (08-FEB-2011). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR Genome; ACJG01004157; Positions 16807 11609. XX CC Positions [4038-4502] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 836..3037 FT /product="Gypsy-38_DPu-I_1p" FT /translation="MSGRQDKLHGDASLAQLRTWRNHWSDFCQLNQLITYP FT VSEQMAAFRMVLDPDMQQIVEVALGILPTAATSPEDVLDQINTYIRSKRNI FT ALDRVAFEDCHQSATESFDDFYIRLRNLADAADLCAACRDTRFTTRIMAGV FT RDTETKRKLLALSPFPTAQQTINICRSEESAKANEKSLSNPPAISHVQTRS FT HRPPRQTDGGRCGSCGRLAHRTGETCPAIGKQCHNCGANNHFSPCCPKKPK FT VETRDSGGDSHSSRPRVHMKRIVVGNVQTNRRRRPAPTISLQLCDLTGKVI FT ANIPRTTPDGGAEATVGGMDVLHALGFSEKDLSSSTFDLVMADKSTPLLVV FT GEKEVTANYEGISAKITITFSPDITGLLLSWYNCVSLGILHEDYPRPWRKQ FT KTCTKINSVTSAQRGEEPFVNTGFVPLNPSPDDIQRIGDDIANAFTAVFDQ FT TGELNCMEGPDMIIELTDDAVPFYVNGSRPLPFADRPLVKQLLDEYVEKKI FT MVPVTEPSDWAAPLVVTRKADGSLRIFVDHTRLNRYVRRPTHPTRTLHDAV FT AEISGDAKFFSTFDAANGYYQIPLSPESQHLTIFMTPWGRYKYLRAPMGLC FT SSSDEYNRRADLAFEGVTNTVRVVDDILRFDSSFPDHVAGVCAVLSAARSA FT GITFRLNKFQFARSQVQGVGFQIQPGGVSVDPEKLRAISDFPRLTNITELR FT SFMGLVEQLAGFSTAVATAKAPLRPLLSRRTPFL" FT CDS 3069..5036 FT /product="Gypsy-38_DPu-I_2p" FT /translation="MSRKLSWSHLSWRSLTRHWKFLFRWTRRAKMEWGTHC FT SSCTDLRGNSSTRTHAVCTDTGSRHAIVELELAAVEWAMRKCRLYLLDLHN FT FQLVVDHQALVTILDKYTLDAVENPKLQRLKERLSPFVFTTVWRKGRAQSI FT PDALSQAPVNDPGHDDEATNSDIQSFARRVVIGQIHTMQSASDDDAVAKVT FT AEQPHLQDPFLDELRAAAASDPEYAALMTAVATGFSVPRHHTAQHIRQYWS FT IREELSMDDGLVLFGRRIIIPLSARKDVLLKLHAAHQGIVRMKRRARQTVF FT WLGITNEITLAVEKCQAYQERLPRQPQEPLLRDPLPSRVFEDVSADLFQVG FT SLHFLVYADRLSGWPVVHQWRHDPSAREVTQAVIENFVDLGVPVRMRSDNG FT PQFNAHSFQTKLCQWGVDWGNSTPNYPQSNSHAEAAVAAMKDLVTKISPRG FT DVTSDEFASGMLEFHNTSRENGLSPEEMVFGHPLRSIVPAHRTSYATRWQA FT VMEGRDRQAELDAAVKFKYDEHARPLAPLSLGTHVRVRDPPSKLWDKVGVV FT VSIGRYRSYRIKFESGSVLWRNRRLLRPMVAIPDAGEPSPEADVQNTDGEG FT GDERSSVDHPMAAGSADGVSVPSPLPTDTENTEDADVSNTQPLRRSQRVRK FT RRVMFNV" XX SQ Sequence 5199 BP; 1142 A; 1538 C; 1404 G; 1115 T; 0 other; tggcgcagtt ggacttgcct taacttccaa attactcaga ggtaattgtg cagattctcg 60 cttttgtgta cggtgaacta cgttacggct tgtgtgtcca tgcccccctg tacggcggcc 120 atcttgacgc cattgtcgtc ttcctcaccg tttcgctcga gttcctgcga gcgcctcgtg 180 tgtccgagca tcttcacgag tgcgtgtgtc gttttttcgg tgatcgtggt cgtagtatcg 240 cgtgccaccc tatcgatatc tttcccgctc gttgttcatt ttacattctc tattcgcatt 300 gtgtgcattc ggtcttatcg gttgaaccac gtgtgtcggt actctcttcc cccccttcac 360 gaacaatcgc tgtgggcggc gtggggccag tgtagaacaa ccaaaacgat ccgttgcatc 420 ggccagtgtc tgtgtcattc gtcggcgagt ttatgtccac ttccggacac cattaccatc 480 tgcgatcctc tccccgtcaa agcccgcagc ccgtttcgac gaacgtaacg cacacgtcga 540 gaactctcgc catggcaacc gtggatgacg cattggacgc cgcggcggcg cccgcagcag 600 cggctacagc agcgaccgga cgcctcgacg cggtcaaacg aaaccagctc acgatgacga 660 cccaattagc cggtattgcc cagcagctac aggcattgct caacgctagt gaaggtggcg 720 gcggcggtgg cgctggtgga ggtggcgctg gtggaggtgg cgctggtgga ggtggcgctg 780 gcggaggtgg cgctggtggc ggcggcgtcg tccaacgacg ccgcatcgat ccatcatgtc 840 tggacgacaa gacaagctac acggcgacgc gtcgcttgct cagcttcgca cctggaggaa 900 tcactggagc gatttctgcc agcttaacca gctgattacg tatcctgtca gtgagcagat 960 ggccgccttc cggatggttc tcgatccgga tatgcaacaa atcgtggaag tggcgctggg 1020 aattctaccg acggccgcga catcaccgga ggacgtcctc gaccagatca acacctacat 1080 tcgctcgaag cgaaacatcg cactggacag agtcgctttc gaggactgcc accagagcgc 1140 cacggaatcg ttcgacgact tttacatccg tctccgcaat ctggcagatg ccgccgattt 1200 atgcgctgcg tgcagggaca cgcggtttac gacgcgcatc atggcaggcg tgcgggacac 1260 cgagacaaag aggaaattgc tcgctctgag tccttttccg acagcacagc agacaataaa 1320 catctgccgt agcgaagaat cggccaaagc gaatgaaaaa tcgctgagca accccccggc 1380 catttctcac gttcaaacca ggagccatcg accaccgcgg cagacggacg gcggtaggtg 1440 cggctcatgc ggtcggctgg cccatcgtac aggagaaaca tgtccggcca tcggaaaaca 1500 gtgtcataac tgcggcgcca acaatcactt ctccccatgc tgcccgaaga agccgaaagt 1560 agagacgcgg gacagcggtg gcgacagcca cagcagccgc ccgcgcgttc acatgaagcg 1620 cattgtggtg ggtaacgtgc aaactaatcg gcgtcgtcgc ccggcaccaa cgatttcttt 1680 acagctttgc gatctgaccg ggaaggtgat cgccaatatt ccaaggacaa ctccagacgg 1740 tggcgcagag gctactgttg gtggcatgga cgtgctacac gcgctgggat tcagtgaaaa 1800 agatctttca tcgtccacct tcgatttagt catggcggac aagtcgaccc cactattagt 1860 ggtgggagag aaggaggtca cggccaacta cgaaggaata tcagccaaaa tcaccatcac 1920 gttcagccca gacatcaccg gactactgct ctcctggtac aactgtgtca gcctaggaat 1980 cctgcacgaa gattacccgc gcccatggag aaagcagaaa acgtgtacaa aaattaattc 2040 cgtcacatca gcccaacggg gcgaagagcc gtttgttaac accggttttg ttcctttgaa 2100 tccctcccca gacgacattc agcgcatcgg ggacgacatc gccaacgcgt tcacagcggt 2160 gttcgaccag accggagagc tcaactgcat ggagggtcca gacatgatca tcgagttgac 2220 ggacgacgcc gtcccttttt acgtcaacgg gtctcgccca cttcctttcg ccgaccgccc 2280 gctggtcaaa cagttattgg acgaatacgt cgagaaaaaa atcatggtgc cggtgaccga 2340 gccatccgat tgggccgcgc cattggtcgt cacccggaaa gctgatgggt cgttacgtat 2400 atttgtggac catacacgat tgaatcgtta cgtccgacgt cccacgcatc ccacgagaac 2460 tcttcacgat gcagtggccg aaatttccgg cgatgctaaa ttcttctcta cgtttgacgc 2520 ggccaacggg tattatcaaa tccccctgtc acctgaatcg caacacctca ccatttttat 2580 gactccatgg ggcaggtaca aatacttgcg cgcgcctatg gggttatgca gctctagtga 2640 cgaatacaat cgtcgggccg acttggcttt cgagggcgtc acaaacactg tcagggtggt 2700 cgatgacatc ctacgtttcg actcttcgtt cccggaccac gtagctggag tctgcgccgt 2760 cctatcggcc gccaggagcg ctggcatcac cttcaggttg aataaattcc agtttgctcg 2820 ctctcaagtc caaggggtgg gatttcagat ccagccaggt ggcgtctccg tcgacccgga 2880 aaagttgcgg gccatctccg atttcccacg tctgaccaac atcaccgagc tccgttcttt 2940 catggggctc gtcgagcaac tagccggatt ttctacggcc gtcgccacgg caaaggcgcc 3000 cttacgcccg ctgctaagca gaagaacacc ttttctgtga acgtcggacc aaaaccgggc 3060 ctttaccgat gtcaaggaag ctctcgtgga gccacctatc ttggcgcagt ttgacccgtc 3120 actggaaatt tctcttcagg tggacgcgtc gcgcaaaaat ggaatggggt acgcattgct 3180 ccagctgcac ggatctacgt ggaaactcgt cgacgcgaac tcacgctgtg tgcaccgaca 3240 ccggatctcg acacgccatt gtggagctgg agctcgcggc tgtcgaatgg gccatgcgga 3300 agtgtcgcct atatctcctc gaccttcaca attttcagct tgtcgtggac catcaggcgt 3360 tagtcaccat tctggacaag tatacattgg acgccgttga aaatcccaag ctgcagcgcc 3420 tcaaggagcg cctatcgcca tttgttttca ccacggtgtg gcgaaagggc cgcgctcaat 3480 ccatacccga tgctctctct caagctccgg tgaatgatcc tggccatgac gacgaggcca 3540 ccaactcgga catccagtca ttcgccagac gcgtggtgat cggccaaatt cataccatgc 3600 aatcggcctc cgacgacgat gcagttgcga aagttacggc ggaacagcct cacctacaag 3660 atccattctt ggacgagcta agagcagcag ccgcatcaga tcccgaatat gcggcgctca 3720 tgacggcggt cgcaaccgga ttctccgtcc ctcggcatca cacagcccag catatccgtc 3780 agtattggtc cattcgggag gaattatcaa tggacgacgg gctggtattg ttcggccgac 3840 gcatcatcat tccactatcg gcccgcaagg atgtcctact caaacttcac gcggctcacc 3900 aaggtatcgt ccggatgaaa cgacgggcgc gccagacggt tttctggctt gggatcacca 3960 acgaaatcac cctcgcggta gaaaaatgcc aggcgtacca ggaacgactc ccgcgccaac 4020 cacaggagcc ccttttgcgc gatcccctcc cgtcgcgagt attcgaagat gtatccgccg 4080 atctgtttca ggtgggatca ttacacttcc ttgtttacgc ggaccgtctc tccggttggc 4140 ccgtggtcca ccaatggcga cacgatccat cggcccgcga agtcacccag gctgtcatcg 4200 aaaatttcgt cgacctgggc gttccggtac gcatgcgatc cgacaacggc ccccaattca 4260 acgctcacag ctttcaaacc aagctatgcc aatggggcgt ggattgggga aattcgacgc 4320 caaattaccc ccagagcaac agccatgcgg aggcggcggt ggccgccatg aaagatttgg 4380 tgaccaaaat ttccccccgc ggtgatgtga cgtcggacga attcgccagc ggcatgctgg 4440 aattccacaa cacctccaga gagaacggcc tatctccgga ggaaatggtt ttcggccacc 4500 ccctacgttc catcgtcccg gcgcaccgga catcgtacgc cacccgttgg caagcggtca 4560 tggaaggacg cgaccggcaa gctgagctcg atgccgcggt caaatttaaa tacgacgaac 4620 atgctaggcc cctcgctcca ctatcgctgg gcacccatgt ccgcgtgcgc gatccaccct 4680 caaagctgtg ggacaaagtc ggcgtcgtgg tcagcatcgg gcgctaccga tcttatcgta 4740 tcaagttcga gagtggcagc gtattgtggc gtaatcggag actgcttcgg ccaatggtgg 4800 cgatcccgga tgctggcgag ccgtcaccag aggcggatgt gcaaaacacc gacggtgaag 4860 gcggagacga gagaagcagc gtcgatcatc ccatggcagc cggatcggca gacggcgttt 4920 ctgtgcccag tccactccca accgacaccg aaaacaccga ggatgcggac gtctctaaca 4980 cgcagccact tcgtcgcagc cagcgcgttc ggaagcgcag agtgatgttc aatgtttaat 5040 ctggttttca gaagcttgtg ttatttgtgc attgtccccc cactcctaac gcatatgcag 5100 tatccttatc gcattgtgtt ccgccattgt gtgtcatgtc attttcttgt gtgtgttaag 5160 ctgttgtcgt ttgttgtagc gcaacagctt gggaagggt 5199 // ID TTAA3B_AP repbase; DNA; INV; 436 BP. XX AC . XX DT 22-AUG-2009 (Rel. 14.08, Created) DT 22-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; TTAA3B_AP. XX NM TTAA3B_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-436 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(8), 1781-1781 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC TSD is TTAA tetranucleotide. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 436 BP; 147 A; 70 C; 68 G; 149 T; 2 other; gaggacgtta caccgacatt tgttgtctcc gtcttacaag tgcgtaacat agcaaatttt 60 acgctcagca gatcacgttt agctccgtta gtttaaaaat tagagtgaat tgacctctta 120 taaaatttaa aggtaagatt attatctagg gnatctcata ggctttttat tatattttaa 180 ttttaaagcg agttatgagt attttaaaat tgtaaatcgt ttgtacatct taaaatactc 240 ataactcgct ttaaaattaa aatataataa aaagcctatg agatncccta gataataatc 300 ttacctttaa attttataag aggtcaattc actctaattt ttaaactaac ggagctaaac 360 gtgatctgct gagcgtaaaa tttgctatgt tacgcacttg taagacggag acaacaaatg 420 tccgtgtaac gtcctc 436 // ID GIZMO4_EI repbase; DNA; INV; 2190 BP. XX AC GIZMO4_EI; XX DT 16-MAY-2005 (Rel. 10.05, Created) DT 16-MAY-2005 (Rel. 10.05, Last updated, Version 1) XX DE Gizmo-Ei4 (GIZMO4_EI), a new member of the Tc1/mariner DNA DE transposon superfamily from the single-celled eukaryotic DE reptilian parasite Entamoeba invadens. XX KW Mariner/Tc1; DNA transposon; Transposable Element; mariner; KW Interspersed repeat; Tc1/mariner superfamily; Gizmo-Ei4; KW GIZMO4_EI. XX OS Entamoeba invadens OC Eukaryota; Amoebozoa; Archamoebae; Entamoebidae; Entamoeba. XX RN [1] RP 1-2190 RA Pritham E.J., Feschotte C. and Wessler S.R.; RG Department of Biology, University of Texas at Arlington, RG Arlington, TX 76019, USA; RT "Unexpected diversity and differential success of DNA transposons RT in four species of Entamoeba protozoans. Mol. Biol. Evol. (2005) RT In press."; RL Direct Submission to Genbank (16-MAY-2005). XX DR Repbase; GIZMO4_EI; Positions 1 2190. XX CC The TIRs of Gizmo-Ei4 are 382-bp long and are flanked by TA CC putative TSD. After correction for stop codons and frameshifts, CC the element may encode a 444-aa protein similar to other Gizmo CC putative transposases and to various IS630-like prokaryotic CC transposases and with a DD32E motif. It seems that Gizmo elements CC belong to a distinct clade of Tc1/mariner elements most closely CC related to the IS630 group of bacterial insertion sequences than CC to established eukaryotic clades of the superfamily (e.g. CC mariner, Tc1, pogo?). XX SQ Sequence 2190 BP; 839 A; 292 C; 342 G; 717 T; 0 other; cagtcggagt ttgtttaaaa aggatgtttt ttcgactgaa ttagtcattt tttataatat 60 tagttagatt tttttgttaa aaaggatatt tttttaataa aatagattga ttataataaa 120 ttagatataa tttttgtcaa ttaggaataa ttaccgtaat gaggtaatat ttttgttaaa 180 taagatttta ttattattat tcgaggtatt attataataa aaataggttg aattaccaaa 240 acattaggta tatttacatt aaataggtac atttttcgaa ctttagattt atttacaata 300 attttaggaa tgatttattt gaaatattat acatttattt ttaaattagg cactttttaa 360 cgtagtagga attaacaata aaaatatata acccatttac aataaaacaa cttttaacgt 420 atttaacatc atttaattta acattttttc attacataaa catcatgtct actttacaat 480 acattgatac atgagatccg gaagcgaatt taaattacat tcaaaacgcg ccaactccaa 540 caagtggtca accagttgtt ccatttgtaa taccacatca aagcattcca cctattgaaa 600 agaaaaaacg aggaagacct ccaaaaaaca caaaattgca aattcctact acattatcaa 660 agaagcgtga gtacacacat ttgacaaatg cacaaagaaa tcatttgttt aaagaatggc 720 aaactcatgg agatgattgg tcattcagtg aatatacttc aatggttggt attaaaaaga 780 aaatacttga aaatatgttg actttgctgc gtcaagggaa gtctatatac acaaagaatc 840 actataaacg ccaaagaaga acactcccct ttcaagaact cgtttataaa ggaattgaaa 900 gggatggaac aatttcagta gcatctcttc gcaaaacaat tgcacaagaa gaaaaattga 960 aaatggaaag acaacaagaa gacgtaatta taaacttgga aataattgat gaagttgaag 1020 tgactaaaga agacgttaac aatttagcac cctcaaaaaa ttcgataacg aaattcatgt 1080 ctggaaagac aacaaaagga gaagaaagaa cggttcctat attttcattt aaaagagtca 1140 aaacatgagg agtcccagct aacacgccag aaaatatgga taaaaagata gccgcagtaa 1200 atgagttaag aggtttgatc gggggaggtg caaagtgggt ttgcatagat gaaacaagtt 1260 ggagcattgc ttcaactgct gcttatggat ggtcagctag aggtaaagag tgttttatta 1320 ctaaagccag aggaggtatt cgcctcacat ccatttgcgc tatcgactta gatgggtttt 1380 cttattgtaa tatcactcct ggaaacaaca ctctggcaag tttcgaagtg tattttaaat 1440 atgttctcaa tgaatatgat aagttgggtc gctattgtgt attttggtgt aattattgtg 1500 atatacacaa ttcaatgaat cgaatagtta atggtacaaa tcattcagtg gtgtttaatg 1560 cagcttattc accagctctt aacccaatcg agaacatttt ggcttatgga aaagaagtat 1620 aaaaaaggat gtcaggattt ggaatggcct tcaagagcta ttggataaaa tcgcagaagg 1680 atttaaaaca atatctggag aagagactca ggcttcattt gaaagggtcc gagaagaaat 1740 ctggccaatt ttttttgcaa gagaagctat ttagttattc gttcgttaat aattatttat 1800 ttttgtttta tttattgtta attttgacta agttaaaaag tgactaattt gaaaataaac 1860 gtgtaatatt tcaaataaat catgcctaaa atttttgtaa ataaatctaa agttcgaaaa 1920 atgtacctat ttaccgtaaa tatacctaat gttttggtaa ttcaacctat ttttattata 1980 ataatacctt gaataataat aataaaatat tatttaacaa aaatattata tcattatggt 2040 gattattcct aattggcaaa aattatatct aatttattat aatcaatcta ttttattcaa 2100 aaaacatcat ttttaacaaa aaatctaact aatattataa aaatgactaa ttcagtcgaa 2160 aaaacatcct ttttaaacaa actccgactc 2190 // ID Gypsy-11-LTR_HM repbase; DNA; INV; 211 BP. XX AC . XX DT 30-DEC-2008 (Rel. 13.12, Created) DT 30-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE Long terminal repeat of the LTR retrotransposon - a consensus DE sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy-11-LTR_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-211 RA Bao W. and Jurka J.; RT "Gypsy LTR retrotransposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 1989-1989 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX SQ Sequence 211 BP; 74 A; 20 C; 28 G; 88 T; 1 other; tgttatattc ttaagaacta tgtatttgaa ctaaataatt gacgtcaatc acgtcattat 60 agaatacggt traatgaact taagttattc taattatatt taattgatgt gttataaaat 120 ctgttttgta gatcttgaat tgtgtttgat ttaatatttg ataaaagccg aaaatattta 180 atatagactt atttattacg gtttcacaac a 211 // ID Zator-2_Hrobusta repbase; DNA; INV; 3053 BP. XX AC . XX DT 10-MAY-2011 (Rel. 16.05, Created) DT 10-MAY-2011 (Rel. 16.05, Last updated, Version 1) XX DE DNA transposon: consensus. XX KW Zator; DNA transposon; Transposable Element; Zator-2_Hrobusta. XX OS Helobdella robusta OC Eukaryota; Metazoa; Annelida; Clitellata; Hirudinida; Hirudinea; OC Rhynchobdellida; Glossiphoniidae; Helobdella. XX RN [1] RP 1-3053 RA Yuan Y.W. and Wessler S.R.; RT "The catalytic domain of all eukaryotic cut-and-paste transposase RT superfamilies."; RL Proc Natl Acad Sci U S A 108(19), 7884-7889 (2011). XX DR [1] (Consensus) XX SQ Sequence 3053 BP; 1078 A; 475 C; 533 G; 967 T; 0 other; ggggtcatcc ataaaaaacg tccacactag gagggggagg gaggtttagc taaaagtgga 60 caaggaggga gggggggggg gtaaatgcca aagtggatgt ccaccagtat agatagtttt 120 aagctataaa tatggtgttt ctcattacat agaccttaca atgttatgaa aatgataatt 180 cttccatatt taatttaaat aacaaaacaa atgacatatt aactgtctaa acgacagtgg 240 acactggaca gtcttgatgt tagtcttata attaaaaaaa tatataatct atttttatgt 300 tttcttcaac tcaccgttat ctacctacat aaaccgttct ctctaacatc caaaagaaat 360 gtcatctaga atgccagtca atgaaacact tttatatagc gctttttcat atattcaaac 420 ttcttcttca ttatctttaa gtactaaaag tccctgatgg tttagttttc tatcgtgaac 480 ctaatagatt ttagtaattt aacaagaata tggtcaataa ttcaaaatat ttaaaaactg 540 aatttcactt tttattactt tatatattac caatgatttt agtagttgtt taaaaaaaca 600 tttaaaattt tatttccaaa tagattattt cgtatattaa gtttttagtg atcataaaaa 660 actttcagtc accagttttt attgaactgg cgttctattg cgatgacttt ccataaaaag 720 cggacaagca gaaactattt tgtgtagttt tttgaatttg aacaattttg aatatttgca 780 aactatattt ttaaataatg gagaaattgt atttagaatt taaagataat tatattgccg 840 ctaatcctaa tataaaaaaa gaattagctt ttaaaaattt acaaaaactt tggaatgagt 900 gtaaaagtaa tcttgatgag atcaagagca atatcatcaa attcaaggct atcacagcta 960 gtagacgtgg aaaaatgatg tccatgtgga gcaaagctgc agcaactaaa agctcgagag 1020 attcggctac cacgtctagc gactcacaac ctcctgacgt tattgtgata ccaatgaagt 1080 tatcagtaga tacaaaaaag taagtccaat atttaaaaaa gttttaaaaa cttcacattt 1140 ttgttattat ataatttatt atattagttt tttttataca taaattaata aaaacatgtt 1200 tgaatttaac acaattttta atttttatgt gatgaaataa acagtagttc tctataatga 1260 catttttata tttcataaaa aatgcagcat taatttgtaa tgtgtaactt tttatactat 1320 tacttagtac attgtgagat aacatgtaaa attaaatttg attttcttta gaattgacaa 1380 acgttctttg ttttctcttc agtgtcaaag atgatacaga ggtccaaaag cacatcgagt 1440 cgtctaacat aaagccagtg caacaagaaa tagatgaagg aaatttttct aaagtggaaa 1500 aaataaacca gcagagcaga aagcacaaga agagttaaat cttatcaatt caaaacaaac 1560 tagcttacta gctattcgaa aaactggtct ctggacagat gaaatgaaaa aaaatgcaga 1620 ggaacttgaa gtacagaaga aaaaactaaa gcagaagttg aatcgactta aatttgatcg 1680 aaatcgtaaa cgtttggcac gagcagtatt aaaaagtgta ttagaaagtg taatcacaga 1740 aaatccagat gtggcaaaaa agctaaaaaa atttaaaaga aaaacattag gacgaccaag 1800 cattgagact gatcaatctg acttgttaaa atcaataatt gatattgtaa cagccagtgg 1860 agccgctgat gatcgtcgac gtactgaatt gattcgctca tgcaagactc tggatgatct 1920 tcattcggaa ctaacgaatt tgggttataa tctgagtcga tctgcaatat acatccgtct 1980 tctgccacgc aactatctga ctactgaggg caagtggcat atatctactg tccctgtaaa 2040 acttataaaa gctgctaaca gcttgcgtaa agagcacccg gatgctcatt ttgcagcatc 2100 gaccatatct tatttgaaag atttggcggt attgatggga agccattgca cattctttct 2160 cagccaagat gacaaagcta gagtgccctt gggtcttcca gctgcctcaa agcaggcacc 2220 aattttaatg catttacagt tgcctgacca tgactaggtt gtggcagata gacataaact 2280 aataccttct gtctacgctg gatgcgttat caaagatggt cgtgccactt attctggacc 2340 tacaatgata tccataagga gtagtaaaca tgacaaaagc agtgctgcca ctcatgttgc 2400 agactttaac acgttagtta ggctggacac tttcaaggat gtagcatgta ctaatgctgg 2460 cgtactgaag tttatagtta ttataactgt tgatggaggc ccagatgaaa atccacgcta 2520 tccaaaaaca ttgtgcgctg cttattaact atttacagaa catgatgttg atgcattatt 2580 cattgccagc catgcaccag gtcacagtgc atataatgca gtagagcgac gcatggctcc 2640 actatctcac gatttgtctg gcctcattct tccctatgac tattttggat ctcatttgga 2700 cactaatagt aaaacaatag atccggaact gaaaaaggtt aattttaaaa aatcggattt 2760 tcaatattaa gttgttagaa tattatgttt taaaataaat tttttaatat ttttgattct 2820 ttatacattt aattttttat atctttgttt attgtttaga aaaaaaacac aaaattataa 2880 caaatatttt aatattaaac tttgtcagtt tttgatgaat aaaaaacact aattgtggac 2940 gtccacaacg gggggagggg ggttcatcaa aatgtggaca aagttagaca agggaggagg 3000 ggggggggta aaaatttgcc aaaaaattgt ggacgtcttt tatggatgaa ccc 3053 // ID BEL-21_CQ-I repbase; DNA; INV; 7569 BP. XX AC AAWU01039820; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-21_CQ_; KW BEL-21_CQ-LTR; BEL-21_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-7569 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 195-195 (2011). XX DR Genome; AAWU01039820; Positions 9659 17227. XX CC 'ACACC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 752..5257 FT /product="BEL-21_CQ-I_1p" FT /translation="MLFSWRSVVTMGDLQTLRKKQEILLNKLEVLNQFVEH FT YKAEEHECQLEVRLGMLNDVYQEFTDLRTKLELLLEEKDAAKFANAEPKVK FT QEVVSHREEANLQVVQEFDNKFCKIKAMLIAKRPVKDVAQTVPSVGDADTS FT FPLRVKLPDIHLPNFSGNLREWVTFRDTFKSLIHRNSKLTSMDKFTYLQSS FT LSGPALLEISGIDLSEENYSVAWSALEEEYGNKKLIVKAHLDVILDLEPLT FT KESYDGLSHLLGEFEKNLQMLDKMGEKTADWSTVMAHVLCSKLDSATLRNW FT ETHHNSKEVPTYKALLVYLRGHCSVLQSIKRAKAKSSEQRPPKAAVCHTAV FT RSSNQCHFCSGPWHTPFRCFKFQKMTISERNDAVSRNKLCRNCLKPGHYPR FT TCEGGTCHHCHQKHHSMLHNDQMRSSVPQQQSRPTATTSVQRQQPRPQNTN FT QTTHTPANNAPANQTNLQTTDSQTTQPQTTSQNYVALPVTPTHNIILSTAL FT IRIKDRFGNTLLARALLDSCSQHCLMTREFSRRLKFERQSSYLPIQGIGTS FT RCVSTQLVRADVGPRSDRISAYESEMQFYVLPKLNISLPTSYIDPSTIQLP FT DWMFLADPEFHKTGPVDVIIGAEFYMDLLTDERVKPAADGPTLHNTVFGWI FT ISGRLPGSVPESTSLVSVAAIDELLTRFWELETCRTKSTHSIEESTCEQLF FT EETTVRDETGRFVVTLPKKKYAVQRLGESRSTAIKRFLGLEKRLSANPDLK FT QQYSDFIHEYEAMGHMKRVAGDTAGGELAYYLPHHCVLRLDSTTTKLRVVF FT DASCRMSTGVALNDALMVGPVVQDDLLDIALRFRLHAVAIVADIAKMYRMI FT RVQPDDQRLQRIVWRDNVDEPIRIYELTTVTYGTASAPYLATKCLQKLGEI FT GEKTHPSAAKVLKRDFYVDDMLAGAHTVAEGKELVAEMVDLMESGGFSLRK FT WHSNSRDILLDVPEHLRDERTLLKLDTSDATVKTLGLVWEPSTDCFRFRSP FT KWNDVAVITKRVVASDMAMIFDPYGLIGPVFVQAKIFVQKLWRMELDWDAA FT LPEDLQEYWREYRRNLAGLDSLSIPRWIGTGADDQNVQLHGFCDASVNAYG FT ACIYIRTVSANGDVTVRLLASKSRIAPLENLKKKKRKQSIPRLELASALLL FT AHLYEKVANAINFRGKAFFWTDSMIVRCWLASLPSRWNQFVANRVSEIQHL FT TESGSWDHVPGIENPADIISRGMTPMQLQYSKLWFNGPDWLSLDHQHWPHA FT QVPNAADFDHEELEEHNAVAAVARDTEPCKLFTANSSYTVTIRTTAVICRF FT CFNSRAANKHCRRVGPLTAEELEVALKKLVRLAQRECFPEEYDALSRDRAI FT PSNSRIAALNPRIVDGILCVGGRLQHAAVSDNRKHPYILDHRHPFTKLIVT FT HYHETMFHAGQQLLISAVRERFWPINIRNLVREVIHKCVDCFRVKPKVLDQ FT LMADLPPERHRVHRSHESELTTADLFRSRTRSVELAQ" XX SQ Sequence 7569 BP; 1877 A; 2105 C; 2046 G; 1541 T; 0 other; tttggtcctt cgaaccggat ccgaaggatc cggtctggcc aggttcggga cacttcgcga 60 aacggacagt tcagtgcgcg aagtgttggc ctgctcgcga agagcagtgc aaagtgtacg 120 gtgcagaagc gcgcctggac agttttggtg cgtgctgaaa acggacaaaa agtgcacgga 180 aaatccgtga aaaaagtgca aaaaaaaacg ggaaaaaatc ccaaaaagcg gagctgaaca 240 gtggcacaat agtgcattga aaagcagctg aacaaaggca gtgacgatca gtgcagtgcg 300 gtgaagaaga tttcgccatc gcggaactca cggcgactca cggtgtgcgc gagctttggc 360 aagcgtcgcg acgtcaccat ctcaaccggt tggctgacgt catcgttcga gtcggtgatt 420 gattggcgcc tggagctttc gggacggatt tggcggattt cgaagcagat tcaggagagt 480 attttgcacg taaacggtgg ctcggctagt ttggctcttt gcggagaaat aaagtggctt 540 ttgaaaataa aattgcacga gagtttttat tagtctggtc aggttgcatg agtgttgtgt 600 tgttctgtct ggtcctctgg aattcccttg tcgattcccc tgtcgctgct gtcaatactg 660 ggtcgtacag ttcgcaatct gtcttgcgcg gtcgcgtcgt ggaacgtcga gtcgatctgt 720 ctgtttccgg tgaatctgaa agtgaaagtg gatgttattc agttggcgat cagtggtcac 780 gatgggcgat ctgcagactt tgcggaagaa gcaggagatc ctgctgaaca aactggaggt 840 gctgaatcag ttcgtcgagc actacaaggc ggaagaacac gagtgccagc tggaagttcg 900 actcggaatg ctgaacgacg tgtaccagga gttcacggac ctacggacga agctggagtt 960 gctgctggaa gaaaaggatg cggccaagtt cgcgaatgcg gagccgaagg tcaagcagga 1020 ggtcgtgtca catcgtgaag aggccaatct acaggtggtg caggagttcg acaacaaatt 1080 ctgcaagatc aaggcgatgc tgattgcgaa gcggccggtc aaggacgtcg cgcagactgt 1140 tccaagtgtt ggtgatgcgg atacctcgtt tccgttgcgc gtcaagctgc ctgacattca 1200 tctgccgaac ttcagcggaa atctgcgcga gtgggtgacc ttccgtgaca ccttcaaaag 1260 tctcattcac cggaactcga agctgacatc gatggacaag ttcacctatc tccaatcgtc 1320 tctttctggt cctgcgttgc tggagatcag tggcatcgat ctgtcggaag aaaactactc 1380 cgtcgcgtgg tccgcgctgg aggaggagta cggaaacaag aagctgatcg tgaaagccca 1440 cctcgatgtc attctggatc tggaaccgct gaccaaggag tcgtacgatg gtctcagcca 1500 cctgctcggt gagtttgaga aaaatctgca gatgttggac aagatgggcg aaaagactgc 1560 cgactggagt acggttatgg cgcatgtcct gtgctcaaag ctggactcgg ccacgcttag 1620 gaattgggag acccaccaca acagcaagga agtcccaacc tacaaggctc tactggtgta 1680 cctgcgcggc cactgttcgg ttcttcagtc gatcaaacga gcgaaagcga aatcgtcgga 1740 acagcgccct ccgaaagcag cagtctgtca cactgctgtg cggagcagca accagtgtca 1800 cttttgcagc ggcccgtggc acaccccgtt ccggtgcttc aagttccaga agatgacaat 1860 ctcggagcgc aacgacgctg tttcgaggaa caagctgtgc agaaactgcc tgaagcctgg 1920 acattacccg cggacgtgcg aaggaggaac ttgtcaccac tgtcaccaga aacaccactc 1980 gatgctgcac aatgatcaga tgagatcctc cgttccacaa cagcagtcga gaccgaccgc 2040 gacgacctca gtacagcgac aacaacccag accacagaac acgaatcaga caacacacac 2100 tccagccaac aatgcacctg ctaatcagac taacctacag actacagact ctcaaaccac 2160 acagccacaa accactagcc aaaactacgt tgcactaccc gtcacgccca cacacaacat 2220 catcctgtca accgcgctca tccgcattaa agaccgcttc ggaaacacgc tgctagcgcg 2280 tgcgcttctt gactcgtgct cacagcactg tctgatgacc agagagttct cgcggcgact 2340 caaattcgag cgacaatcgt cgtatttgcc gatccaaggg atcggaactt cccgttgcgt 2400 gtcgacgcag ctcgtgcgtg cagatgtcgg tccgcgttct gatcggattt cagcgtacga 2460 gtcggagatg cagttctacg tcctgcccaa gctcaacatc tcgttgccga cgtcgtacat 2520 cgacccgtct acaattcagc tgcccgactg gatgttcctg gctgatccgg agttccacaa 2580 aaccggtcca gtggacgtca ttatcggtgc cgagttttac atggacttgc tgacggacga 2640 acgggtgaaa ccagctgcag acggtcccac gctgcacaac acggtgttcg gttggatcat 2700 ttccggccgg cttcccggca gcgttccaga atcgacgtct ctcgtatccg tcgcggcaat 2760 cgacgagctg ctgaccagat tttgggaact ggaaacgtgc cgcaccaaga gcacgcactc 2820 gatcgaagaa tccacgtgcg agcagctgtt cgaggagacg acggttcgtg acgaaactgg 2880 cagatttgtt gtgacgctgc ccaagaaaaa gtacgctgtc cagcgtctcg gtgagtccag 2940 atcaacagcc atcaaacgct tcctgggact ggagaagcgg ctgtcagcga atccagacct 3000 gaaacaacag tacagtgatt ttattcacga gtacgaagcc atgggacaca tgaaacgggt 3060 tgccggcgac acggccgggg gagagttagc gtattacctg ccacaccact gcgtcttgag 3120 gctggacagc accactacga agctccgcgt ggtgtttgat gcttcttgtc gcatgtctac 3180 cggagttgct ctgaacgatg cgctcatggt gggaccagtt gtgcaggacg atttactgga 3240 cattgcgcta cgctttcgac tccacgctgt tgccatcgtc gctgacatcg ccaagatgta 3300 ccgtatgatt cgcgtccagc ctgacgacca gagactgcag aggattgttt ggagagacaa 3360 tgtggacgaa cccatccgaa tctacgagct gaccactgtc acgtacggaa cggcgtctgc 3420 gccgtattta gcgaccaagt gcttgcaaaa actcggtgag atcggagaaa agacgcaccc 3480 atccgctgcc aaggtcctca aacgtgactt ctacgttgac gacatgttgg caggtgcgca 3540 cacggtcgca gaaggaaaag agctagtcgc cgaaatggtc gatctgatgg aatctggcgg 3600 attctcgttg cgaaagtggc actcaaactc ccgagacatc ctgctcgacg tcccggaaca 3660 tcttcgcgac gagcggaccc tgctaaaact ggacacgtct gacgcaaccg tcaagacgct 3720 cgggctcgtc tgggagccca gtacggactg tttccgtttc agatcgccaa agtggaacga 3780 cgttgccgtg atcacgaagc gtgtcgtggc ctcagacatg gccatgatct tcgacccgta 3840 cgggctgatc ggtcctgtct tcgtccaagc gaaaatcttc gtgcagaagc tgtggcgcat 3900 ggaactggat tgggacgccg ctctgccaga agatctgcag gagtactggc gtgaataccg 3960 cagaaatcta gccggtctag acagcctgtc gataccccgc tggattggca caggtgcaga 4020 tgaccagaat gtgcagcttc acgggttctg cgacgcttca gtcaacgctt acggtgcgtg 4080 catctacatt cgaaccgttt cggccaacgg cgacgtcacc gtccgcttgc tggcctccaa 4140 atcccgcatc gcaccgctcg agaacctgaa gaagaagaag cgtaagcagt cgattccccg 4200 cctggagctg gcgtccgctc tgctgctggc gcatctgtac gagaaagtgg ccaacgccat 4260 caacttccga ggaaaagcgt tcttctggac ggattccatg atcgtacgct gctggcttgc 4320 gtcactgccg tccagatgga accagttcgt ggccaaccgt gtgtccgaga ttcaacatct 4380 gacggagagc ggatcttggg accatgtgcc cggaatagag aacccagctg acatcatttc 4440 tcgcggcatg acaccgatgc agctgcagta ctcaaagctt tggttcaacg gacccgattg 4500 gctgagtctt gaccaccagc actggccgca cgctcaagtg ccgaacgcag cagacttcga 4560 ccacgaagaa ctggaggaac acaacgctgt cgctgcagtt gcacgtgaca ccgagccctg 4620 caaactgttc acagcgaatt cgtcgtacac cgtgacaata cggacgaccg ctgtaatctg 4680 ccgcttctgt ttcaacagcc gtgcggcgaa caagcactgt cgcagagttg gtccgttgac 4740 ggctgaagag ctcgaggtcg ctttgaagaa gctggtgcgc ctcgctcaac gagaatgctt 4800 cccggaagag tacgatgctt tgtccagaga tcgcgcgatt ccaagcaact cccgcattgc 4860 tgcgctgaac ccaagaatcg tcgatggaat cctgtgcgtc ggcggccggt tacagcacgc 4920 cgcagtctcg gacaatcgga agcatccgta cattctcgac catcgtcatc cgttcacgaa 4980 gcttatcgtc acacactatc acgagacaat gtttcacgct ggacaacaac tgttgatctc 5040 tgctgtgcgt gagcgtttct ggccgattaa catccgcaac ctcgttcgcg aggtgatcca 5100 caagtgtgtc gattgcttcc gtgtcaagcc gaaggttctg gaccagctca tggcggatct 5160 gccgcccgaa cgacaccgtg tacaccgttc gcacgagtcg gagttgacta ctgcggacct 5220 tttcaggtcg cgtacccgca gcgtcgagct cgcccagtga agtgcttcgt tgccatcttc 5280 gtctgcctgg tcaccaaggc cgttcaccta gaactagcag cagacctaac gacgcagcat 5340 tctgcgccct caaacggttc actgcccggc gtgcaaacga agctggtcat gtgcgacaac 5400 gccaagaact tcgtcggcgc aagacgtgag ctgagcgaac tcgccaagct gttctaagtc 5460 agcagttcga agaggagatc attcgcgaaa cagcgaacga cagcatcgag ttcaagttca 5520 ttcccgctcg ctcgccgaac ttcggcggcc tctgggagtc cgcagtgaag agcttcaagt 5580 tgctgttcaa acgtacgatc gggctgcaca ccctgctgta cgacgagttc cagactgtgc 5640 tggtccagat cgaagcgatc ctcaactcgc gaccgctcac gccgctcagc aacgaccccg 5700 cagacttcga agcgctgacc ccaggacact tcctgatcca gcgtccattc actgcgattc 5760 cagaaccaaa cctcgaccac attcccgaga acagactgtc ggcctggcaa accgtgcagc 5820 ggtacacgca acagctctgg aagaagtggt ccaacctcta cctgtcggac ttgcacaacc 5880 gcacgaagtg gacaaagcaa aaagacaacg ttgctgtcgg aaccatggtc ctgctgaagg 5940 acgagaacct cccgccgcta aagtggcagc tcggacgcgt ttccgatatc cacccgggag 6000 ctgacggcaa catccgtgtc gtcacggtgc gcacaaagga cggtagctat cagcgtgcga 6060 tctctaagat ctgcattctg ccgatccgtg acaacctttt taccgcgcaa ggggagaact 6120 aggactcctc caacggcgga ggtcgaaagg cctccgcaca ccagttaagt tatgtattgt 6180 tttgaatttc aaaaagttca ccgaatcttt ttcctccaga gtcgccaccc tgtctgcgcc 6240 cagacctgcg ggtcgttctg cgctgggagg ccgtcaaaaa gatcggaact ttccgaaatg 6300 ctctgaggta acctgttttg tttcggttgt ggtttgcatg aagatgtttg cttgcgatct 6360 agctggcaaa agcccagcat aattacggtt acggataaca gcggcgcaca gcgccttttc 6420 ggtcgaaaag actgccccca ctgtttgaac tcagctaaat tctgttccac gtttggaact 6480 cttcgaggta cacgcgctca cgatgacagc agcggaggag acgtcctccc ggtcccatct 6540 gccacgacca gcagcgagct tggccagctc gttcccaatc gcagcccgat ggagccaacc 6600 cccgtcatcg cagcatcgta cacccggaga accatcgtgt cggtaaccgc tgagggtcaa 6660 cagcgacgcc aggaggcgtc gggggaggcc acggaggtct cccccagttc aggcaagtcg 6720 tttttgaata taatttaact gattaaaaag caccctctca tctgattagg gagtctcaca 6780 tcgcatcgac gccgacgccg gcgtcaccga tcgcatcaga caacggagca gcagaacacg 6840 cacggcgcca cccacgcacg gtggctttgc gttcagcaca gaccagcaca gatcaacagc 6900 aacagcagca gacagacgaa ccgacaactg acgtcggcgg cacgccgtcg aacaacaacc 6960 gcagcacatc gaagcatcca cccccgcgac aaacagaaga tcaaccgatc gagccaagca 7020 gcagtatttc aaatctacac cagcaaacgg tagcagttca agcgctacaa gtagcagatc 7080 cagctcaagt tcagggtcaa cctgctgtcg ccatgatgaa acagcacctg aaagaacgac 7140 aaatcagcat tttgcagcgc accacgacac caatcaaccc cagaagtgtc cccagcagca 7200 gcgcggcatc caggttgatc gtttgatgtc cagtccggtc cagaaagcag caaatctgac 7260 gcaaaattgg tgcattttcg actgcagcga aatgcacgat ccagccagcc cagaagaaga 7320 aaaatatctg gctattgttc gtgttcgatc gacggccgca gccgcgatcg ctagtacata 7380 aggagaacaa tgaccaatct tgacttaggg taatagcaga aacagtagaa gtagataaga 7440 gacgaagata acggataaga gataaggaac agaatcaaaa caaatcagtg ggatagcagt 7500 gggaattgta gtagtagttg taacagcgag aagaatgttt gaatcagagt gattccaagg 7560 cggccggta 7569 // ID Mariner-2_BF repbase; DNA; INV; 1297 BP. XX AC . XX DT 13-APR-2011 (Rel. 16.04, Created) DT 13-APR-2011 (Rel. 16.04, Last updated, Version 1) XX DE Mariner-2_BF DNA transposon DNA transposon - consensus. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner/Pogo; KW Pogo-1_BF; Mariner-2_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-1297 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M. and Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-1297 RA Kapitonov V.V. and Jurka J.; RT "Transposable elements in the amphioxus genome."; RL Deposited to Repbase as the tmplanrep.ref file (02-May-2008). XX RN [3] RP 1-1297 RA Kapitonov V. and Jurka J.; RT "A family of Mariner-2_BF DNA transposons from the amphioxus RT genome."; RL Direct Submission to Repbase Update (13-APR-2011). XX DR [3] (Consensus) XX CC It belongs to the Pogo group of Mariners. XX FH Key Location/Qualifiers FT CDS 169..1221 FT /product="Mariner-2_BF_1p" FT /note="Mariner/Pogo TPase." FT /translation="MEPVECRAVIRFLYLKGRTPKETFDEMKETYGDDAPS FT YGLVKRWHREFKHGRKSVETAPRPGRPCSAMIDESSVRQVEAAILEDRRIT FT VRQIAQEVKISTGSVETIIHKHLHMQKVSARWIPRLLTPFQKQERVECSRM FT NLEMCQEDESNFFKRLITQDETWVHHYDPETKAQSKQWKHSDSPPPKKARV FT QPSAGKVMLTVFWDQDGVVMTDFLAKGTTITGAYYASLLTKLREAIKIKRR FT GKISKGILLLQDNAPVHNSHVARSEARACGYEILPHPPYSPDLAPSDFHLF FT PAMKLFLKGKRFQDDAALISEVTTWLEDQPRVFYKNGVQSCIKRWEKCITL FT GGSYVEKD" XX SQ Sequence 1297 BP; 363 A; 292 C; 318 G; 324 T; 0 other; cgaggggcgt gcaataagta atggtcactg accaacttcc agttgtctga tctacatgaa 60 attttgcatg tgtaatgatt tatatctcta tgggttatgt tgcaaaatac agctgtgaac 120 taattgtggt tactgattta catgtgtttg aactgagtca gaagtaaaat ggagccagtt 180 gagtgtcgcg cagtgatccg gtttttgtat ttgaaaggac gcacaccaaa ggagactttt 240 gatgaaatga aagaaacgta tggtgatgat gccccatcat atggccttgt aaaacgctgg 300 catcgtgagt tcaaacatgg ccggaaatct gtggaaacag ctcccagacc tggtcgtccc 360 tgttctgcca tgattgatga gtcatctgtc cgtcaagttg aggcagccat tttggaagat 420 cgtcgcatta ctgttcgcca aatagcccaa gaggtcaaga ttagtacagg gtctgtggaa 480 actatcattc ataagcattt gcatatgcaa aaggtgtctg ccagatggat tcccaggttg 540 ctgacacctt tccagaagca agaacgcgtc gagtgctcga ggatgaattt ggagatgtgc 600 caagaagatg agtcaaactt tttcaaaaga ctgattacac aggatgaaac ttgggtccat 660 cactatgatc cagagaccaa agcccagtca aagcagtgga agcactcgga ctcaccacct 720 ccaaagaagg caagggtgca gccctctgct ggcaaggtca tgctcacagt cttctgggac 780 caggacggag tggtgatgac agatttccta gcaaagggta ccacaattac aggggcctac 840 tacgcttcac ttttgacgaa attaagggaa gctatcaaaa tcaagaggcg gggcaagatc 900 agcaaaggta tcctcctcct gcaggacaac gctccggtcc acaactcgca tgtcgccaga 960 tcagaagcac gggcgtgtgg ctatgaaatc ctcccccacc ccccttactc tcctgacctt 1020 gcaccctctg actttcatct cttcccagcc atgaagttgt ttttgaaagg aaagcgtttc 1080 caagatgatg cagccttgat ttctgaagtt acgacgtggt tggaggacca acctagggtg 1140 ttctacaaaa acggtgtcca gagctgcatc aaacgatggg agaaatgcat aactctgggt 1200 ggttcctatg tagaaaagga ctaacaactg tgccaagttt aatgaatctc ctactatggg 1260 aaatgggtca ggaccattac ttattgcacg cccctcg 1297 // ID Gypsy-1_DPu-LTR repbase; DNA; INV; 212 BP. XX AC scaffold_50; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-1_DPu_; KW Gypsy-1_DPu-LTR; Gypsy-1_DPu-I. XX NM Gypsy-1_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-212 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 718-718 (2010). XX DR Genome; scaffold_50; Positions 510787 510576. XX CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX SQ Sequence 212 BP; 34 A; 76 C; 41 G; 61 T; 0 other; tgttatagtc tccgcctagg ccctgccccc cctccgtctt agctcgacac atgagccccc 60 ctgcggctca tgtcggccct acttctctcc ggccacgggc cagtcagtct tgccttagtg 120 ttctgcctgt acagtactcc catcccgtcg ttcaatacag cttggttaac tcggacctca 180 gtttgttctt caatcctcgt tagatcataa ca 212 // ID hAT-46_HM repbase; DNA; INV; 2931 BP. XX AC . XX DT 15-DEC-2008 (Rel. 13.12, Created) DT 15-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type DNA transposon family - consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-46_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2931 RA Bao W. and Jurka J.; RT "hAT-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 2034-2034 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 1060..2787 FT /product="hAT-46_HM_1p" FT /translation="MCICCHSAISTADHLCEIISKFGVGSPLEHLKLHRTK FT CTKVIQNVISSTLEEEISKEIKSKPFSILIDESTDVSSMKHLAICVRYFSE FT KEKQIVDDFLGLIQVISTTGEDLFNALNYKLHSIGLTLKNCIGYSSDGASN FT VIGIHNSVWSRVLTKSPNCIMMRCICHSLNLVVQNAFNEIPSPVGFLLSEV FT PRFFNKSILRRNEYEKLFELMNPNNERVGTPSPFQKFSATRWLVRGKCFYN FT LLVNWQELKAFFICAEMNANSDVRYKISSISKMFQDSTLYLYVVFLTPVIQ FT EFERLNSLFQGSNVDPEDLMSELNMHLKSLKQRVFDTKGFVLPLQCVDFGA FT KFSSECQIYIQEQPPNAKLNAEESVRNVKIRCHXFLQKAITELEKRLPDNR FT SIFHGLSNLSPMKVLSQVEKVPFSKLPFPHLMGTKSQEIENQYRKVSTVEW FT TESAIFKEKEVPKNTVEFWIALRDYVDIFGHPAFRDISDYCLACLCVPISN FT CFVERVFSQVTYVKSKQRNRMSNQVLDAVIRIKSYLMSRKICCKDFNIPEL FT MISKVGNEIYATEACENEDELHVLAELY*" XX SQ Sequence 2931 BP; 964 A; 489 C; 481 G; 996 T; 1 other; cagggttgcc atctggtttt ttttcaagcc agatttctca aatctggttt gtttttttta 60 ggttttgctt gaaatttttt tcctggttta ttttagaaaa tctggtttaa atctgttttt 120 ttttaaaagt gttaggtgtt tttttttctt tttggattat tttattaaat ctggtttaat 180 tctggtttat tttttgttaa atcaatttaa tcaagctgtt ttgtttcata aaaaaagtga 240 aatagagcga aacaaacaaa aatgaatgaa aattcgattg aaattatgaa taaaaagaaa 300 aaaaagaacg aaaatgagta aaaaaatttt ttgtttacca cggtttatta aagaaattca 360 aagaaattta aagaatgaat aaactctcac tcgcgagttt caacaagttc aaacaattta 420 tttggattcg ttccattcat ttaaaacttg gtttttttct ttctttctaa aaattttttc 480 tttctgaaaa tttttttcta taaattattg catctgcgga aaatattaga aaaatctggc 540 atttaaaaag acgatttaaa gtgaaatact ttacgtaaaa aaggtaaaac gagtttttaa 600 tgtttctgcg gatattgaaa atttagtaac tgttactatt caaataacaa ctaattaatt 660 tttgctttct tcttaaaata tctaatttta aaattgatca gacctattgg ttattaagtt 720 taataatctg tagcttatac ttgatttttt gttttagatt agcatcttta aattgactat 780 gccaaacaaa aaatcatatg aaaactctcg tagttatcga agtgagtggg aaaaagaatt 840 tacctgggta aaaaaaaaag ctctgatgca tctgaacaag ctttttgtcg cctttgccgc 900 actaatctac gtccacacaa aggtagctta gagcaacatc aaaaagcagt ttcacacatt 960 aaaagagaaa caacattgaa tccatttcaa aagaagatta attttccggg tcaagtaaaa 1020 atttcaggtg caaacaaaaa agcagacatt cagctggcta tgtgtatctg ctgtcactca 1080 gcaatttcaa ccgctgacca tttgtgtgaa attatttcta agtttggtgt aggcagtccc 1140 ctggaacatc taaaactcca ccgcactaag tgtactaaag taatccagaa tgtcatctca 1200 tcgactctag aagaagaaat atctaaagaa atcaaatcaa aaccgttttc cattctcatt 1260 gatgaatcaa cggatgtgag ttcaatgaag caccttgcaa tatgcgtgag atatttttca 1320 gaaaaagaga aacagattgt cgatgacttt cttggcctca ttcaagtgat ttcaacaact 1380 ggcgaagatc tttttaatgc tttgaattat aaactccact ctattggatt aactcttaaa 1440 aactgcatcg gatacagctc agatggtgcc tcaaatgtta ttggcataca taactcagtc 1500 tggtctcgtg tactaactaa atcgcctaac tgtattatga tgaggtgcat ttgtcactca 1560 ctaaatcttg tggtgcaaaa tgcttttaat gaaattccat ctccagtcgg ctttcttctg 1620 agcgaggtac cccggttttt taacaagagc attcttcgtc gaaacgaata tgaaaaacta 1680 tttgaactca tgaatccaaa caatgaacgc gtgggaaccc catccccttt tcaaaagttt 1740 tctgcaactc gttggttggt tcgaggaaag tgtttttata acttgcttgt aaattggcaa 1800 gaactaaaag cattctttat ttgtgctgaa atgaatgcaa actctgatgt ccgttacaaa 1860 atctcttcaa taagcaaaat gttccaagac agcacccttt atctctatgt tgtttttctc 1920 actccagtca ttcaagagtt tgagcgtcta aactctctct ttcaagggtc aaatgttgat 1980 ccagaagatc taatgagtga gctaaatatg catctcaaaa gcttaaagca gcgtgttttt 2040 gataccaagg gctttgtact tcctctacag tgtgttgatt ttggagccaa attctcctca 2100 gagtgccaga tttacattca agagcaacct ccaaatgcaa aactgaatgc tgaagaatcg 2160 gtcagaaatg taaaaattcg ctgtcatgrt tttcttcaaa aggctataac cgagcttgaa 2220 aaaagactac cagataacag atcaattttt cacggcttgt caaatctgtc tccaatgaaa 2280 gttctttccc aagttgaaaa ggttccgttt tcaaagctac cttttcctca cctaatgggt 2340 acaaaaagtc aagaaattga aaaccaatac agaaaagtga gtaccgttga atggactgag 2400 tctgccattt ttaaagaaaa agaagttcct aaaaatactg ttgaattctg gatagcttta 2460 agggactatg tcgacatttt tggacatccc gcttttcgcg acatttccga ctactgcctt 2520 gcttgtctgt gtgttcctat ttctaactgt tttgttgaga gagtcttttc acaggtcacc 2580 tatgtgaaaa gcaaacagag aaacagaatg tcaaaccaag tgttggatgc agtcattcga 2640 ataaaatctt atttaatgag ccgaaagatt tgttgtaaag actttaacat tcccgaatta 2700 atgatttcaa aggtcggaaa tgaaatttac gcaacagaag catgcgaaaa tgaagacgaa 2760 cttcacgtat tagcggaact ttattaaatg ttttaaagat ttacacttaa caaaataaat 2820 tttttaataa aatagttcta ctttacttaa tttttttttt gttctggttt atttctggtt 2880 tattttttat atttttttgg cttgttgcca aaaaaaaacc tggcaaccct g 2931 // ID BEL-159_AA-LTR repbase; DNA; INV; 696 BP. XX AC supercont1.186; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-159_AA_; KW BEL-159_AA-I; BEL-159_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-696 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.186; Positions 537366 538061. XX SQ Sequence 696 BP; 220 A; 148 C; 131 G; 197 T; 0 other; tgttaaaaac agccagtctt ccaacactgc agatgacgga gtgaattccg tatgctccct 60 gtaaccgtct cgttaaccaa acgtcagaat gaggggaata aacaaaatgt gccaaatatc 120 ggtcaattca ttgttcatcg ttggaccgca ttcgagtgac cattgtacga tcgtgttcaa 180 cgccgatttg ccaatagtag gccagaaaat tatcatcacc tggacagcgt ccttccaaat 240 cccgaacgag ctacacgaga tcgacccagg aagctccgcc cagttcctcg aatttgccga 300 cgaagtcgga ctgataagtt cacaggcaag aaggtgtcca accttcttag atttgaaata 360 cgcgaagttc tgacccgtga gttgtgaatc taaattacct aatctctaac ctaaattttg 420 aattccaaat taaaattata tacatgcact tattctaatg ctcaatattg aactacatat 480 attcgtcagg cgaggtgaat tacgattatt atttgagatt gttggcgagt cgtggtttca 540 acgattaggc agtagaacac tgaatgtaag ttaatgaatt atgtttgtat taaaataatt 600 taactaaatc tccaataaat tacagcttta aagctgcttc tgtaaagaac tgctaacaga 660 agtgtttctg ctctaaaacc gttcgactcc acaaca 696 // ID Gypsy-12_SI-I repbase; DNA; INV; 4948 BP. XX AC AEAQ01024794; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-12_SI_; KW Gypsy-12_SI-LTR; Gypsy-12_SI-I. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-4948 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01024794; Positions 865 5812. XX CC Positions [2342-2800] - Reverse transcriptase CC Positions [3818-4291] - Integrase core CC 'ATATGT' target site duplication CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS join(1262..3265,3269..4696) FT /product="Gypsy-12_SI-I_1p" FT /translation="MRKMRCIAEGVSESEKKPSASASAPKPKDPREATCRN FT CGKRGHVHTTCRAEATCFYCKAKGHRQFDCPLVKKKTEGQPARQRPAAVTA FT AAVSAEPEEREEVVAAVAQQQETRLELTGSSVRVRLLNVANCRTVALVDTG FT SPVSLVKLDLVANYIEEDKEKLLPPRKQLRSLSCDLLKVVGLIRLDLRIEA FT LGDSKFSNNFYVVENYACDAPMILGRDFLAHEKLSVAYREHEAGSCDSVAN FT LELFAKLPECEVEATLKSLEQILENIETDVGVEGKRKIVEIISEIERTDVA FT PIDDGYAVRVHLKDDSVYAFAPRRFAHGERVQMRDIIDDLLARGIVQPSIS FT PYCARVIPVRKKNGTIRMCVDLRPLNSRVLKQKYPFPRIEMCLSRLTGKVI FT FTLLDLKDGFHQIPVHPDSMKYFSFATPDGQFEYKYLPLGYSEAPAEFQKR FT LMQILNPLTRQEKLIVYMDDILIASETVEENLTTVKEVLSILKRYRFELNY FT EKCLFLKKKIEFLGYIISGEGITLSARHTEAIRNYKSPSNVHEVQRWLGLA FT SYFRKFIKDFAVKAKPIQDLLKKDVAFKFDENCVRSFETLRRELTEYPILK FT VYDPGAETELHTDASSHGLGAILLQKSGNGQWAAVAYYSRTTNNAEKNYHS FT YELEMLAVVRAVERFHIYLGINFNILTDCHALVYAVNKASLNPRIARWTLA FT LQGYQFELTHRQGTRMQHVDALSRCIAYISERPLEQELEIRQLTDPHVKEI FT AHELEFKDSDKFALVDGLIYRKEDDRLKFVVPEPMVASIVRAYHDDMAHVG FT HEKTYQGIRRNYWFPSMRKRIHNYLENCLTCIVSNESPNRLEGETALYPSP FT SAPMQQIHVDHFGPLTLTKDGWKHILVAVDAFTRFTWLFPTKSTTTRETVN FT ALEFIFATFGRPDELVSDRGTAYTSQEFTDFTELHKVKHRRVAVAAPWANG FT TVERVNRFIKSSLIKMIKDPNEWKDHLPRLQYVINNTYHAVTKSTPSMLML FT GYDQRNHLDSDLAHYVRELIAVDTNTDEQREVASKVARRATDSVRQYNKEY FT RDSSKRKPTMYNVGEFVMIRDTQTKPGTSGKLKPEYKGPYRIDKVLGSNRY FT VVKEIPDFNLAPRPLNTILSSDRMKRWIRLDAPTSV" XX SQ Sequence 4948 BP; 1393 A; 1180 C; 1381 G; 994 T; 0 other; acctcagaag tgggatcact cccagggaaa ttactcccag cgccgtcagg gacaggatca 60 gggattgagc atttcgcgcc gtccgcgaag gcaacgggct attccccatt gccgcggaaa 120 agcgaacagt gtcgtcgagt tgtaaaagcg cggcccgcag aaaaccgtta cggcggtagg 180 cgagcgcgcg cgaaatttgt cgtgtcgaca cgcgccggcc gcgtgatatc gggcgtggcg 240 cgacgcgtga agccggaata cgtcgagtca aagaaccgag agtgcgggag tgcgcgcgat 300 cccaagtcga aggaagtgta gcggagtcca ccgcaagaga ctagaaatgg attgcgagca 360 tctagagaat gcggcgctag aagtgctgca aaaggaagcg atagcctgcg gattgccgtt 420 cacaccagac cgagcggcct taatagagtc cattctctca tacagagaga ggaacctccg 480 agcgtcgggg cgggcgaccg ggccggaaat ccgggagagt cggcagctac ggagcagctc 540 cgacagacgg tcgagctgct agcgcagaat atgcgacaaa tgcagcagga aatgtgcagc 600 aagagttcag ggcggagata ttgaaccgcc tgacgccgtc ttgccggaga atcggagtcg 660 aagcggaagg agtcctggag gaacgacttc cggagaccgc ggcgaacaaa tcgcccgcga 720 cgatcgccgg atccactggg agtcaagagt ggggaagaca gcagaggctt cctcacggca 780 atttgacgca gtgactggcg acgcaaatcc ctgaattttc gggttcacgg gatgaggatg 840 tggccgtctg ggtacaccgt gtggatcgcg tggcgcgcat ccacgacgct cccgatgggg 900 tcacattgtt ggccgcttca tcgaaactca cgcgaaaagt gaaaacgtgg tacgaatcgc 960 tcgatggaga gggcatcgag tcgtggagag ctctgaaggc agagctcctc gcaatatatc 1020 agcgcaatga tctggcgtac agatcactgc aaaaaataga ggcgaggaaa tggcatcatc 1080 aaaaggagtc ctttgatgat tacgccatcg aaaaaatcgc tttgatgaat cgtctggact 1140 tgtctagaaa agagaagacg catctgctca taagcggaat cggccacagc gctaccaagt 1200 cggccgcgct gtcggtagcc gatcttccgc tggaacaatt catggttagg ttaggtgcga 1260 gatgcgcaag atgcgatgca tcgcggaggg cgtctcggag tccgagaaga agccgagcgc 1320 ctccgcgagc gcgccgaagc cgaaggatcc aagggaggcc acctgcagga actgtggcaa 1380 gcgggggcac gtacatacca catgccgagc ggaagcaacc tgtttctact gcaaggcgaa 1440 ggggcaccga caattcgatt gccccttggt gaagaagaag accgaaggac agccagctcg 1500 ccagaggccg gccgcagtca cggcagccgc ggtctccgcc gaaccagagg aacgggagga 1560 ggtggtagca gccgtggcac agcaacagga gaccaggttg gagctcaccg gatccagcgt 1620 gcgagtaaga ttgctgaacg tagcgaattg taggaccgta gcgttggtcg acacgggtag 1680 tccggtgtca ttagttaaat tagatctcgt ggcgaattat atcgaagaag acaaagaaaa 1740 attgcttcct ccgagaaagc agttgagaag cttatcttgt gacctcttaa aggtcgtcgg 1800 gctaataagg ctagaccttc gtatcgaggc gcttggagat agcaaatttt caaataactt 1860 ctatgttgtc gaaaactacg catgtgacgc gcccatgatt ctcggccgtg acttcttggc 1920 gcacgaaaag ctctcggtcg cgtaccgcga acacgaggcc gggtcgtgcg acagcgtagc 1980 aaatctagaa ttgtttgcga aactaccgga atgcgaggtc gaggccacgc tgaagagcct 2040 cgagcaaata ttagagaaca tagaaacaga tgtcggggta gaagggaaga ggaaaatagt 2100 cgagataatt tcggaaatcg agcgtacgga cgtagcaccg atcgacgacg ggtatgccgt 2160 gagagtgcat cttaaagatg actcggtcta tgcgttcgct ccccgccgct tcgcgcacgg 2220 cgagcgagtg cagatgcgag atatcattga cgatctgctg gcgcgcggta tcgtccagcc 2280 gagcatctcg ccgtattgcg ccagggtgat tcctgtccgg aaaaagaatg gcaccattag 2340 gatgtgcgtc gatttgcgac cgctgaatag tcgcgtactt aaacaaaaat atccttttcc 2400 gcggattgaa atgtgcttgt cccgactaac gggaaaggtt atctttacgc ttctggactt 2460 aaaggacgga ttccaccaga ttccggtcca cccggacagc atgaagtact ttagcttcgc 2520 gacgccggac gggcaatttg agtacaaata cctgccatta gggtactctg aagccccggc 2580 ggaattccag aagcgactga tgcagatcct taatccgtta acgcgacaag agaaactgat 2640 agtgtacatg gacgatatat tgatagcgtc cgagacagta gaagaaaatt taacaaccgt 2700 gaaggaagtt ttgtcgattt tgaaaagata tcggtttgag ctgaactacg aaaagtgcct 2760 tttcctgaag aaaaaaatcg aatttctcgg gtacattata tccggagagg gaatcacgct 2820 gagcgcgcgg cacacggagg cgatccgaaa ttacaaaagc ccgtcgaacg tgcacgaagt 2880 acagcgatgg ttagggctag cgagctactt tcggaaattc attaaagatt tcgcggtgaa 2940 agcaaagccg atacaggact tattgaaaaa ggacgtcgcc tttaaattcg acgagaattg 3000 cgtcagatcg tttgagacgt taagaaggga gctcacggag tatcccattt tgaaagtgta 3060 cgatccgggc gccgaaactg agctgcatac cgatgccagc agccacggcc tcggagcaat 3120 cctgctacaa aaatcgggta acggccaatg ggccgccgtc gcgtattata gccgtacgac 3180 aaacaatgcc gagaaaaatt accatagtta cgaattagaa atgctagcgg tcgtgcgggc 3240 agtagagagg tttcacattt acctataagg gataaatttc aatatactga ccgattgcca 3300 tgcgttagtg tacgcggtga acaaagcgag ccttaacccg cggattgctc gctggacctt 3360 ggcgctgcaa ggctaccaat ttgaattgac gcatcgacag gggacaagga tgcaacacgt 3420 cgatgcgctg agccgttgca tagcctacat tagtgaaagg ccgctagaac aagagttgga 3480 gatccgtcag ctcacggatc ctcatgttaa agagatagct cacgaattag aatttaaaga 3540 cagcgataaa tttgcccttg tcgatgggtt aatttaccgg aaagaagacg atcgcctgaa 3600 gtttgtcgtt ccggaaccca tggtcgcgag catagtaagg gcataccacg acgacatggc 3660 ccacgtaggg cacgaaaaaa cgtatcaggg aattcgaaga aattactggt ttccgagtat 3720 gcgtaaacgc atacacaatt accttgagaa ttgtctgacg tgcatcgtta gtaacgagtc 3780 tccgaatagg ttggagggcg aaaccgcgtt atatccttca ccttccgcac ccatgcagca 3840 gatacacgtc gaccatttcg gaccgttgac gctaacgaag gacggctgga agcacatcct 3900 agtagccgtc gacgcgttca cgcgattcac ttggttattt cccacaaagt ctaccacgac 3960 gcgcgaaacg gtcaatgcgc tcgaatttat tttcgccacg tttggtagac cggacgagct 4020 agtgagcgat cgcgggacag cgtacacgtc gcaagaattt accgatttta ccgagctaca 4080 caaggtgaag catcgtcgag tagccgtagc cgccccatgg gcaaacggga cggtggaacg 4140 ggtgaatcga ttcattaaaa gttcgctaat taaaatgatc aaagacccta acgaatggaa 4200 agaccattta ccgcgcttac agtacgtaat aaataatacc taccatgcgg taacaaagag 4260 taccccctcc atgttaatgc tcgggtacga tcaacgtaat caccttgatt ccgacttagc 4320 gcattatgta cgggaactga tcgccgttga tactaacacc gacgagcaac gagaagttgc 4380 atcgaaagta gcccgtcggg cgaccgacag tgtacgacaa tataacaaag aatatcggga 4440 tagcagcaag cgaaaaccga caatgtataa cgtaggggaa tttgtaatga taagagatac 4500 tcagacaaag ccgggtacga gcgggaaact taaaccggag tataaaggac cttatcgaat 4560 cgacaaggtc ctcggcagta accgctacgt ggtcaaagaa atacccgact ttaatctggc 4620 tccccgccca ttaaacacca tactctcgtc ggaccgtatg aaaagatgga tcaggctcga 4680 cgctccgaca agcgtatgat gtatttttgc aatgtaacct tagaatcaca acaattatcc 4740 actcgagtca ttagatataa gagagagagc ctgtatagta ttaagataga ttataagata 4800 atgcgacgaa ctagaattaa ttagcctcgc cgattgtaac tcgccaattc tcctgcggag 4860 gtatgcgatc gatatttata agacgtctgc aaaataaaaa tccaccctcc ccccacttcg 4920 ggacgaagca aggtgtgaat ggccgagc 4948 // ID Gypsy-146_AA-LTR repbase; DNA; INV; 229 BP. XX AC supercont1.6; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-146_AA_; KW Gypsy-146_AA-I; Gypsy-146_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-229 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.6; Positions 2258626 2258398. XX SQ Sequence 229 BP; 60 A; 52 C; 52 G; 65 T; 0 other; tgttgggagg cccctcaccg acatcgccac aagggctctt cagctgatag gtgtcagact 60 gacagatgcc aacacgatcg tcggtgtatg acccggaagg cgtcatcagt tagattttgc 120 gatagcaagt gattcacacg tttttagtaa ttgtaacctg tagcaaataa agttatattt 180 tatactgtcc gtgaatcgcg tgcgtatctt catttccaat atcgcaaca 229 // ID hATm-5_HM repbase; DNA; INV; 3237 BP. XX AC . XX DT 22-MAR-2008 (Rel. 13.03, Created) DT 22-MAR-2008 (Rel. 13.03, Last updated, Version 1) XX DE hAT-type family: consensus sequence. XX KW hAT; DNA transposon; Transposable Element; hATm-5_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3237 RA Jurka J.; RT "Families of hATm elements from Hydra magnipapillata."; RL Repbase Reports 8(3), 209-209 (2008). XX DR [1] (Consensus) XX CC This family is very distant from most hATm elements and is closer CC to hATx-9_SM. CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(370..681,678..2924) FT /product="hATm-5_HM_1p" FT /translation="MSLSFTSFMSLLCNYVFMSFSWFKNCIKRVLVFICNH FT INENSNNYIKLTMTTSKNTRLSMETKLSTYLGRSKDLIPSEVPTLRDILRK FT GLLIQEALLREEDVNRQVIIKKLKIYETNYTFCIILYIFVFRRNISVRDLS FT RLLADIVFQQWMKANALFKPPVVIGKKTLMEKIEFSWNKIKDIAHGKASQT FT HVSFWESKLDQLLDITCCKCEIMLCNELCLKTYSCVDQVHIKCTCVKEKKI FT PKVELPWLNAQRKKKGSISKYQMGFNDIKETARQISREEKKMKRKSGVEKK FT LEMEKKKYKVENDDVVECLIEDYSDLKKEKISEDPLQNKIWKKNYMSISHT FT AEVSIRYDISPAATSAIVTSFLLDLIEAKYLTPDMSYLAVDPMKVRRAREA FT IMERVIMSEEKLIDEEPIRGLFIDGRKDSTLVLEQNPLTNKFHMRTIKEEH FT ITVTCEPDGRYLTHYTPPKAIPPSKPAKEAAFALYNWMVPRAIHESLEVVG FT GDSTNSMTGSGKDGGLLSNLEQLVGRKLFWSICMLHCNELPLRHIIVELDG FT PTNSKDGFTGPIGKALSKVNSIKRLETFEPIEQLEPLIEIPEKVLKTMSTD FT AALCYKLVQCLSTGKMDINLSKAKCGNICHSRWLTTAEAIIFLYMSDHTFT FT GDVLQKFHLIVKWVAQVYFHMWYEIKVKHSIVDGPGHLVTLLRLLKKQDGA FT YFAHSESLILTLLSSSDEESRKFAIKTIKDIRKTSQFGDTAVRSRKKVFLN FT TEALSIKELIDWEKEKISEPVFTCQISTYDLDKYLVTRFEAPYFPLHTQSC FT ERAVKEVTEAALSVCGFEKRDGFVRARMNNRKTLPELKSKKNFSNLFITKL FT " XX SQ Sequence 3237 BP; 1186 A; 462 C; 569 G; 1020 T; 0 other; aaatgcaagg gggtcaaaat tttaaaatta ttaaaatatg aaactattta gattttttta 60 tatattacca tatgctgtga aacgtgaata attttttttt ctaaaaaaat aaatttttgc 120 ggggaaaaag gccttataag attgaaaaaa aatgttaaaa tataaaatat aatgttaaat 180 tttttaaata ataaggacat tcattggaat tttttcatta gaaaatttca gactttattt 240 ttatgaatga cgtctttata attcagacaa aatgactata attaaaagtg tcttaataat 300 gatgttattt tacttggggg ttgaattgta agtgatttca gactattacg actttgtaat 360 tatgttttta tgagtttatc ttttacgagt tttatgagtt tactttgtaa ttatgttttt 420 atgagttttt cgtggtttaa aaactgtata aaaagagttc ttgtttttat ttgcaatcat 480 ataaatgaaa attctaataa ttatattaaa ttaactatga ctacttcaaa aaacacaaga 540 ctctctatgg aaacaaagtt atcaacatat ttaggcagaa gtaaggatct tattccgtca 600 gaagtgccaa ctttaagaga tattttgaga aaaggtctgc ttatccaaga agctctactt 660 cgagaagaag atgttaacag gtaataatca agaaactcaa aatttatgaa acaaattata 720 cattttgcat tatattatat atttttgttt ttagaaggaa tatttcagtg agagaccttt 780 ctcgactact agctgatatt gtttttcagc agtggatgaa agcaaatgct ctttttaaac 840 ctccagtggt tattggaaag aaaactttga tggagaaaat tgaattttcc tggaacaaga 900 tcaaagatat agcacatggg aaagcttctc agacacacgt gtctttttgg gaatcaaaac 960 tggatcaatt gctagatata acatgttgca agtgtgaaat tatgttatgt aacgagttgt 1020 gtttgaaaac ttattcttgc gttgatcaag tacatattaa atgtacttgt gtaaaagaga 1080 aaaaaattcc aaaggttgaa cttccatggc ttaatgccca aaggaaaaaa aaaggatcta 1140 tttcaaaata ccagatgggt ttcaatgata ttaaagagac cgcgagacag atttcgcgag 1200 aggagaaaaa aatgaagagg aagtctggag tagagaaaaa attagaaatg gaaaaaaaaa 1260 aatataaggt agaaaatgac gacgttgttg aatgtttaat tgaggattat tcagatttaa 1320 aaaaggaaaa gatttcagaa gatccgcttc aaaacaagat ttggaaaaaa aactatatgt 1380 caataagtca tactgcagaa gtttcaataa gatatgatat ttcaccagca gcaacatcag 1440 ctattgttac atcctttctt ctagacctta ttgaagcaaa gtatttgacg cctgatatgt 1500 cttatttagc tgttgatcct atgaaagtaa gaagagcaag ggaagcaata atggagagag 1560 taataatgtc tgaagaaaag ttaattgacg aagaacctat tagaggtttg tttattgatg 1620 gcaggaagga ctcaacttta gtgttggaac agaatcctct aacaaataaa tttcacatga 1680 ggactataaa agaagaacat ataactgtga cctgtgagcc tgatggaaga tacctcactc 1740 attacacacc acctaaagct ataccaccaa gcaaaccagc caaagaagcc gcatttgctc 1800 tttacaattg gatggttcca agggcaatcc atgagtcctt agaggtcgta ggtggagatt 1860 ctacaaactc catgacaggt tcaggaaaag atggaggttt gctgtcaaat ctggaacaat 1920 tagtagggag aaaattattc tggtctattt gtatgctcca ttgtaatgaa ttaccattga 1980 ggcacataat tgttgaactg gatggcccaa ctaattcaaa ggatggtttt actggcccta 2040 ttggaaaagc tttatctaaa gtcaatagca taaaaagact tgaaacgttt gaacctatag 2100 aacagctgga acctttaatt gaaattcccg aaaaagttct caaaacaatg tcgacagatg 2160 cagctctctg ttataagttg gtacaatgtt tgtccactgg caagatggac ataaatcttt 2220 ctaaagcaaa gtgtggaaat atttgccaca gcagatggct taccacagca gaagctatca 2280 ttttcctcta tatgagtgat cacactttta ctggtgacgt tcttcaaaag tttcatttaa 2340 tcgttaaatg ggtagctcaa gtatatttcc acatgtggta tgaaatcaaa gtaaaacaca 2400 gtattgtaga tggcccaggt cacttagtaa cactgttgcg actgctgaaa aaacaagatg 2460 gagcctattt cgcccactcg gaatccctta tattaactct tttgtcaagt tcagatgaag 2520 aatcgaggaa gtttgccatt aaaacaatta aagatataag gaaaacctcg cagttcggag 2580 acacagctgt aaggtcaaga aaaaaggttt ttctcaatac agaagctttg tctatcaaag 2640 agctaataga ttgggagaag gaaaaaatat cagaaccagt gtttacgtgc caaatctcta 2700 catacgatct tgacaaatat ctggtcactc gctttgaagc tccgtacttt cctcttcaca 2760 cgcaatcttg tgaaagggca gtaaaagaag ttacagaagc agcactttcc gtttgtggtt 2820 ttgaaaaaag agacggtttt gtgagagcac gaatgaataa cagaaaaact ttgcccgagc 2880 ttaaatcaaa aaaaaatttt agtaaccttt ttataacaaa attgtaaaca aaaacatttt 2940 tatgacaata ttataactta aatttgtttt gtattaaatg ttgttaagct aaacgatgtt 3000 gttaagttga actataaaaa cgtcattcca acaaaaataa agtctgaaaa tttattatga 3060 aaaaaattcc aatttttaca ttttttttca atcttataag gcctttttcc ccgcgaaaat 3120 ttattttttt tagaaaaaaa aattattcac gtttcacagc atatggtaat atataaaaaa 3180 aaaatctaaa tagtttcata tttcaataat tttaaaattt tgacccccct tgcattt 3237 // ID Harbinger-N1B_BF repbase; DNA; INV; 412 BP. XX AC . XX DT 04-AUG-2008 (Rel. 13.08, Created) DT 04-AUG-2008 (Rel. 13.08, Last updated, Version 1) XX DE Amphioxus Harbinger-N1B_BF autonomous DNA transposon - consensus. XX KW DNA transposon; Transposable Element; Harbinger-N1B_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-412 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-412 RA Kapitonov V. and Jurka J.; RT "Harbinger-N1_BF - a family of autonomous DNA transposons from RT the amphioxus genome."; RL Repbase Reports 8(8), 813-813 (2008). XX DR [2] (Consensus) XX CC It is a subfamily of Harbinger-N1_BF. XX SQ Sequence 412 BP; 138 A; 81 C; 73 G; 120 T; 0 other; ggccatgttg atttgattat atggatgaca tccgcgcgcg catcaatttt cggccgtttc 60 caaaaataaa aaaaagtttt caaccatact gtaaatcgta caaagcagct gcaaaatgag 120 caaatatatg ccgtagattg caaaaacaaa tatcggaaaa cgtatactaa tgtcatagaa 180 tgccaacgtc aattccaagg tcaaagaaga agatgtgaaa gaacaaaaat gaagaatcta 240 ttgttcggta taccatactc tgtgtgttca gtcatttgtt ttcaaccttc ctgaccactt 300 agatttcata atgaaggcaa cttttttttg gccaaacttt ttttattcgc acgctcgcat 360 cagttttggg gttcccagag gatgtcatcc atataatcaa atcaacatgg cc 412 // ID Gypsy-14-LTR_HM repbase; DNA; INV; 223 BP. XX AC . XX DT 06-JAN-2009 (Rel. 14.02, Created) DT 06-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE Long terminal repeat of the LTR retrotransposon - a consensus DE sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy-14-LTR_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-223 RA Bao W. and Jurka J.; RT "Gypsy LTR retrotransposons from Hydra magnipapillata."; RL Repbase Reports 9(2), 401-401 (2009). XX DR [1] (Consensus) XX CC Both CA-3' and TA-3' is observed at the 3'-end of the LTR. CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX SQ Sequence 223 BP; 74 A; 32 C; 39 G; 78 T; 0 other; tgttgtgttt gcggccctat tgtggcacgg catttttcta aagataaaac attttgaata 60 cgtcacaaaa caggagcgga aaaacgtgct caaaaataag atggcggttt cgtttcagaa 120 cttatatgta taaaatataa agtgctaatg actacaaaga gatttattgt ttaacatatc 180 tcttggtttt attttcattt acctgtatat tacgtacata ata 223 // ID BEL-81_CQ-I repbase; DNA; INV; 6326 BP. XX AC AAWU01022158; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-81_CQ_; KW BEL-81_CQ-LTR; BEL-81_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-6326 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 303-303 (2011). XX DR GenBank; AAWU01022158; Positions 13557 19882. XX CC Positions [5144-5746] - Integrase core CC 'TTATC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 833..6151 FT /product="BEL-81_CQ-I_2p" FT /translation="MPKKVSYKLKIRNTILASLNRADKFLQNFVLEKHADQ FT VQSRLDAVNQAWAEFEALQLEIETIEEEDLKEEDDATAMEEEHERVREQFE FT GLYHKVRAGLKSRLPSGVASGSSACVARAGIQLPKITLPKFSGEYDEWLPF FT HDTFKSLIHENVDLTAIQKFHYLRDCLIGEARKMINSSEFSAKSYQVVWDL FT LVKRHANKYLQKKRYVNGLLQFPRVKGVSSSSIHDVMDHFSRCTQILDQLG FT EVTGGWGVMLTQLLVSKLDDVTQKEWEALAAKKEDPQFTDVIAFLEERTKV FT LDAVAVDQLLGSQRGSPSVSSVPAAQPKKSFPKLTVNAASDSSSPKCRMCS FT GPHSLSSCSDFKKLSLDGRNRLVLSKRLCRNCLWSGHFVRDCHVRSRCSSC FT SEKHNTLLHPVGRTGSGSTPSAAVTGDSQEGGSRSSTAPAAQPQAVTSSVA FT TIYAANVAAESRKVHVFLSTVLVAVKDCHGRLHTARALLDSGSQANLISER FT LCQILRLPRKKVSVPISGVGSSRMQVDSSVSAIVSSRVTNYTVPMECLVLK FT KVTEDQPSATIPIDQWNLPSDMVLADPGFHKRAPIDLLLGLEFFYEFLLLN FT GGRVQIQRPGEGLPLFVNTVFGWIAAGKTDLGSLNPVPSCHVSVNATLEEK FT IERFWTIEELQEAPKRSQEEQDCEEHFQATFSRDVTGRYVVRLPKRMGFEK FT MVGESRNMAVRRLMQLERRLGKDESLRKRYNEAIQAYLDQDHMKVVSEEEL FT KGDKRLECYLPHHPVFKESSTTTKIRPVFDGSAKTTSNFSLNEALMIGPVL FT QDPLFDLVLRFRKKQVALVADIEKMYLQVKVHPDDTPLQRILWRPSPSEPI FT RTYEMLRVTFGLAPSSYLATRCMQQLAQDEGEVYSRAKEALLRDFYVDDFI FT GGADSEEEALLLRQELEQLLPKGGFRLRKWVSNATGALAGLAADDLGTQTT FT LSFDQEQVKTLGINWQPGPDVLGIDVTGLSVNGQWTRRKVYSVIAQLWDPT FT GITAPVISWAKIRMQLLWVATQGWDDPLRPDLAGRWTEFYQQLPELSKISV FT DRCAFVENPVHVEFHVFADASEAAYGACIYARSISQSGLVKISLLAAKSRP FT AGLKKVTLARLELCAALMAARLYRKVVQALKMEGTESWFWSDSTVVLAWLR FT SPSYVWPTFVANRVSHIQELTKGHRWNHVKGTENPADLVSRGVMPKDLVGL FT RLWFYGPKWLESFEEYWSRVKHLEVDEPAEELLEKRKNVLVVSASPEPHPL FT IDRYGSYWKLLRITAYCVRFVRRCQRRKQPPRTPFLTVGDLKEAKFALVRG FT VQQEPFAAEIKALANHRLVPAHSSLKLLNPFLDQQGILRVGGRLGLAGEAY FT STRHPMILPSSNVFTRQMAVAYHETSLHSGPRMTLAQIRHEFWPLNGKQLA FT TYTFRNCVRCFRTNPAPVKQPPGQLPKPRTTLSRAFTVTGVDYCGPVYLKP FT VHRRAAAQKAYIAVFVCFSTKAVHLELVGDLTTAAFLAALRRFVSRRGLPS FT EIHSDNGLNFQGASNHLRELYELLRDNKVITEITSEASRRGIEWKFIPPRA FT PSFGGLWEAAVKSAKTSLVRVLGQRQLTFEDMATVLTQIEAAMNSRPLTPL FT SEDPDELDVLTPGHFLAGSSLLAIPDPNYTDVPTNRLQHYQQLQQLVQQHW FT KRWRREYISQLHNQNQKFPQATQLKAGQMVVLKEDNKAAIEWPLARIVEVY FT PGPDGVVRVVKLRLPNGAVYKRQSARICLLPFEKDSAVQLTSAFNPQPTQG FT AA" XX SQ Sequence 6326 BP; 1292 A; 1629 C; 1982 G; 1423 T; 0 other; ttggtgccgt gaccaggatt ggtacctcgg gtggcccatc gaacgccatc gcgatcatct 60 gcgcgtcgaa cgagttcgcc atcgctgctg cggagaaagt tcgcggccat cgcgattgtt 120 ttggtttggc gctcagcagt agcagagttc atcgacctga gaggaaagtt cagcgacccc 180 gcgcgggtga agaggacatc cattgatacg tgcacggcct ggtggacagg ctcacacggt 240 gagtacactt gtccccattt ttcccgtaac tggtgtcctc attgcgctgt tttgaacttt 300 ttcggtgctg cttttcgtga cttctgctgc tgctgaggtt ctgttggacg attgctggac 360 gatcacgagc tgtttggacc gaccgctggt tatttcggcg acgattggag tacggaaggt 420 acttgtgccg tactgctggc cattgtttgg ccattgttta cccgtgctag ctcggggcgc 480 gtatgttacc gggggcgcgt atttagttac ggttttttgc aaggcctgta cagccaggct 540 cacagggtga gtactcggct ggaattttcg ttgctcgacg gtgtttgcaa tattttgctt 600 ttctcggtgg acgacgcgct gctctcgagg gactttcgct gtgtttcggc gacgttatcg 660 ctgcggctgt acattggccg gcggagagtt ttggagagta tcctgttggg cgttaaatat 720 acaaggcctt gtttaatagg ctcacaggtg agcctctttc caactcccta gcgattgttt 780 gtgtggtgct tcgcttcgct ctctttttgc agttcagttg ccctgcgtca ccatgccgaa 840 gaaggtgtcg tacaagctga agattcgcaa caccatcctt gcgtctctca accgcgccga 900 caagttcctg cagaactttg tgctggaaaa gcacgccgat caggtgcagt cgcggctcga 960 tgcggtcaac caggcctggg cggagttcga ggcgctccag cttgagatcg agaccatcga 1020 ggaggaggac ctcaaggaag aggatgatgc gacagccatg gaggaggaac atgagcgagt 1080 ccgagagcag ttcgaggggt tgtaccacaa ggtaagggca gggttgaagt cgaggttgcc 1140 ttctggggtt gcgtctgggt ctagtgcatg cgttgcgcgt gccggcatac agttgccgaa 1200 gatcactttg ccaaagttta gtggggagta cgatgagtgg ttgcctttcc acgacacttt 1260 caagtccctc atccacgaaa acgttgattt gaccgcgatt cagaagttcc actacttgcg 1320 agactgtctt atcggtgagg ccagaaagat gatcaactcg agtgagttta gcgcgaagag 1380 ctatcaggtg gtctgggact tgttggtcaa gcggcatgct aacaagtatc tccagaagaa 1440 gcgttatgtc aacgggttgc ttcagtttcc gcgagtgaaa ggggtgtcgt cgagtagcat 1500 ccacgacgtg atggatcatt ttagtcgctg tacgcagatt ctggaccagt tgggcgaagt 1560 gaccggtggc tggggcgtaa tgctgacgca gttgttggtg tcgaagctgg atgacgtgac 1620 gcagaaggaa tgggaggcgt tggcagccaa gaaggaggac ccgcagttca ccgacgtgat 1680 agcgttcctg gaggagcgga cgaaggtctt ggatgcggtc gcagttgatc agctgttggg 1740 cagtcagcgt ggttctccct ccgtgtcttc ggttccagca gcccagccca agaaatcgtt 1800 cccgaagctg acagtgaacg ccgcttccga ctcttcctct ccgaagtgca ggatgtgtag 1860 cggcccacac tcgctttcga gctgttcgga cttcaagaag ttgtcgctgg atggtcggaa 1920 caggctggtt ttgtcgaaga ggctctgcag aaattgtttg tggagtggcc atttcgtgag 1980 ggactgccat gtcaggagtc gctgctccag ttgcagcgag aagcacaaca ccttgctgca 2040 tcctgttggt agaactggtt ccggttccac tccgagcgct gcggtaacgg gagattcaca 2100 agagggtggt tcgcgttcct cgacggcgcc ggcagctcaa ccgcaagcgg tgacgagttc 2160 ggtggccaca atttacgctg caaacgtggc cgcggaatcg cggaaagtcc acgtgttcct 2220 gtcgacggtg ctcgtggcgg tgaaggattg ccatggccgg ttacacaccg caagagcgtt 2280 gctcgatagc gggtcgcagg cgaatctgat cagtgagcga ctgtgtcaga tcttgagatt 2340 gccgcggaag aaggtgagtg tgccgatctc cggcgtcgga agttcccgta tgcaggttga 2400 tagctcggtg tcggcgattg tctcctcgcg tgtcaccaac tatacggtcc cgatggagtg 2460 tctggtgctg aagaaggtga cggaggacca gccgtcggcc acaattccga tcgaccagtg 2520 gaacttgccc tctgacatgg tgttggcgga tcccggattc cacaagcggg ctccaatcga 2580 tttgctgcta gggctggagt tcttctacga gttcctgttg ctgaatggtg gccgggtgca 2640 gatccagcgg ccgggagaag gtcttccgct gttcgtgaac acggtcttcg gttggatagc 2700 tgcgggaaag accgatctgg ggagtctaaa ccccgtcccg agctgtcacg tctccgtcaa 2760 tgcgactttg gaggagaaga tcgagcgctt ttggacgatt gaggagctgc aggaggcacc 2820 gaagcggtcg caagaggaac aggactgtga ggagcatttt caggccacgt tttcacgcga 2880 cgtcacgggt cgatacgtgg tgcgacttcc gaagcgcatg ggtttcgaga agatggtcgg 2940 cgaatcgcga aatatggcgg tgcgacggct gatgcagttg gagcggagac tgggcaagga 3000 cgaaagcttg cgaaagcggt acaacgaggc gattcaggcg tacttggatc aggaccacat 3060 gaaggttgtt tcggaggagg agctgaaggg cgacaagcgg cttgaatgct acctgcctca 3120 ccatccggtc ttcaaggagt ccagcacgac cacgaagatc cggcccgtgt tcgatggttc 3180 tgccaagacg acgtcgaatt tctcgctgaa cgaggcgctg atgattggtc cggttctcca 3240 ggacccgttg ttcgatttgg tgttgcgctt tcggaagaag caggtagctc tggttgcgga 3300 catcgagaag atgtacctgc aggttaaggt ccatccggac gatacccctc tgcagcggat 3360 tctgtggagg ccttcgccgt cggagccaat aaggacgtac gagatgctac gggtgacgtt 3420 tgggttggcc ccatcgtcat accttgccac gcgctgtatg cagcagctgg cgcaggacga 3480 aggagaggtg tattcacgtg cgaaggaagc actgttgcgt gacttctacg tggacgattt 3540 catcggcgga gcggactccg aagaagaagc gctgttgttg cggcaggagc tggagcagct 3600 gttgccgaag ggtggattta ggctgcgcaa gtgggtgtca aacgccacag gggcgttggc 3660 agggctggct gcggacgatt tgggtacgca gaccacgctc agtttcgacc aggagcaggt 3720 gaagacgctg ggaataaact ggcagccagg gccagacgtt ctcggcatcg atgtcacggg 3780 tttgtccgtg aatgggcaat ggaccagacg gaaggtctat tcggtcattg cccaactttg 3840 ggatccaact gggatcactg ctccggtcat ttcgtgggcc aagattcgaa tgcagctgct 3900 gtgggtcgcg actcaaggct gggacgaccc attgagaccg gatttggccg ggaggtggac 3960 tgagttctac cagcagttgc cggagttgtc gaagatttcg gtcgacaggt gcgctttcgt 4020 ggaaaatccg gtccatgtcg agtttcatgt gttcgcggat gcgtccgaag cggcctacgg 4080 agcgtgcatc tacgctcgtt cgatcagcca gtcggggctg gtcaaaattt cgctgttggc 4140 ggccaagtca cgccccgcag ggttgaaaaa ggtcaccctg gcgaggttgg agctgtgtgc 4200 agctttgatg gcagcaaggc tgtaccggaa ggtcgtgcaa gccctcaaga tggagggcac 4260 ggagagttgg ttctggtcgg actcgacagt cgtgctggcg tggctgaggt caccgtcgta 4320 cgtgtggccc acgttcgttg cgaaccgagt ctcgcacatc caggagttga cgaaagggca 4380 tcgctggaac cacgtcaagg ggactgaaaa cccagcggat ctcgtttccc gtggagtcat 4440 gccgaaagat ctcgttggat tgcggctgtg gttctacggt ccgaagtggt tggagtcgtt 4500 cgaggagtac tggagcagag tgaagcattt ggaagtcgac gagcctgctg aggagttgct 4560 ggagaagaga aagaacgtgc tggtcgtgtc tgcttccccg gaacctcacc cgctcatcga 4620 tcgttacggc agttactgga agctgctgag gattacggcg tactgcgtga ggtttgtccg 4680 gagatgtcag cgtcgcaagc agccccctag aaccccattt ctcaccgttg gagatttgaa 4740 ggaggcaaag ttcgctctcg tgcgcggcgt gcagcaggaa cccttcgccg cggagatcaa 4800 ggcgctcgca aaccatcgac tcgttccggc tcactcgtcg ctgaagctgc ttaacccgtt 4860 tctcgaccag cagggcattc tgcgcgttgg tggccggctt ggactcgcgg gcgaagcgta 4920 ctctacccga caccctatga tccttcccag cagcaacgtt ttcacgcgac aaatggcggt 4980 cgcataccac gaaacttcgc tccactctgg cccccgtatg acgctcgctc aaattcggca 5040 cgaattttgg ccattgaacg gcaaacaatt ggcaacttac acttttcgca actgtgtgcg 5100 ctgttttcgt accaaccctg cccctgtaaa gcaaccccct ggtcaactcc caaaacctcg 5160 aacaaccctc tctcgcgcat ttaccgtcac cggcgtagac tattgtggcc ctgtctacct 5220 gaagcccgta caccgtcgag cagctgcgca gaaggcgtac atcgcggtct tcgtgtgctt 5280 cagcacgaaa gcggtccacc tggagctggt gggggacttg accaccgctg cgtttcttgc 5340 tgccctgcgt cgttttgttt ctcgccgagg tctaccgtcc gagatccact cggacaacgg 5400 tctcaacttc cagggagcaa gcaaccacct gcgcgagctg tacgagctgc tacgagacaa 5460 caaggtgatc acggagatca ccagcgaagc gtcccgtcgt gggatcgaat ggaaattcat 5520 tccgccgaga gcgccgagct tcggcggcct atgggaggcg gccgtgaagt cggccaagac 5580 gtcactcgtc cgtgtgctgg gtcaacgaca actcaccttc gaggacatgg cgaccgtcct 5640 cacccagatc gaagcggcca tgaattcgcg cccgcttacc ccgttgtcgg aggacccaga 5700 cgaactggac gtgctcacgc ccgggcactt cctcgccgga tcgtcgctgc tagcgattcc 5760 ggatccaaac tacactgacg ttcccacgaa ccggctgcaa cactaccagc aattgcaaca 5820 gctggtgcag caacactgga agcggtggag gcgggagtac atctcgcagc tccacaacca 5880 gaaccagaaa ttcccgcaag caacccagct gaaggccggg cagatggtgg tcctcaagga 5940 ggacaacaag gcggcaatcg agtggcccct cgcgcggatc gttgaggtct acccaggccc 6000 ggacggcgtc gtccgcgtcg tgaagctgcg tttgccgaac ggtgcggtct acaagcggca 6060 gtccgcacgt atctgtttgc tgccgttcga gaaggactcg gcggtgcagc tcacgagcgc 6120 cttcaaccct caaccgacac aaggcgcagc gtaggaggaa gatcgccgtc aaccgtcaac 6180 cctacggcgc aacggagttg gtgaagaagg cgtcaagaag aaaaatttga attttttgaa 6240 gtgctagttt aagtaggttt aggttttgag tttagcgcta gattttgttt gtgtttaaat 6300 tcctagagaa tttaaggtgg ccggca 6326 // ID EhRLE2 repbase; DNA; INV; 2854 BP. XX AC AB097128; XX DT 17-NOV-2004 (Rel. 9.1, Created) DT 03-JUN-2009 (Rel. 14.07, Last updated, Version 2) XX DE Entamoeba histolytica retrotransposon EhRLE2, complete sequence. XX KW R4; Non-LTR Retrotransposon; Transposable Element; EHRLE2. XX NM EHRLE2. XX OS Entamoeba histolytica OC Eukaryota; Amoebozoa; Archamoebae; Entamoebidae; Entamoeba. XX RN [1] RA Kojima K.K. and Fujiwara H.; RT "Cross-Genome Screening of Novel Sequence-Specific Non-LTR RT Retrotransposons: Various Multicopy RNA Genes and Microsatellites RT Are Selected as Targets."; RL Mol. Biol. Evol 21(2), 207-217 (2004). XX DR Genbank; AB097128; Positions 1 2854. XX SQ Sequence 2854 BP; 1428 A; 219 C; 559 G; 648 T; 0 other; agtaatacaa aggaagataa ataaactaaa agacatgcta ataagaggaa actataaaaa 60 attaagaagg atattaagaa taagaagaat gatagaagaa aagataaggg gagaagtaaa 120 tgaagaaata aaacaattag aatatttaaa agaaatgagg aaaaagaagt agacttaaac 180 aaaagtgaaa gaacaagaat ctacagaaac aaaatcatga gtactactgg aaaaaaagaa 240 ataagaaagg aaataggaag agaagaacaa ccaagtgaag aagacacaaa agagtttttg 300 aataagatat ataacagagg aaaaagaata aagaaagaga atgaaagaat aaaagagata 360 attgaagaga gtaatagaag gaatttgaaa agaataataa tcacaaaaga aaacatagag 420 aatgtgataa caagaatacc atcatggaaa gcaccaggga ttgaaaatat gcatggatat 480 tactggaaaa gattagagag tgttagagac agattagtta tagtgtttaa tgaatggtta 540 gacactccag aaaacatccc aatagaaatg ctaacaggaa gaactattct aatacacaaa 600 ggaggagata gcaaaaaagc agagaattat agaccaataa catgtcttaa tgtaataatg 660 aaaatcttta cttcaatcat aaatagaaag atagaagaac aactagcaaa taatgaagag 720 aaataccaaa ttaataagag tcaatatggg aataagaaaa gaagtctagc agcaaaggaa 780 gtaaatatca acagtttatt agaaagaaat gaaagagaag tacagaagaa aagatatttg 840 gaatcattct atgacattag aaaagcatat gatacagtta atgatgaatg gctagaaaat 900 gtattaagat acttcaaaat accatttcaa ataactgaat taataaagag tatgagtagt 960 agatggagaa taagaatagg atatagatat aaagaagaaa taagagaagt taaaataaag 1020 aatggtatat tacaaggaga ttccatttct ccactcttat tcatattaca aatgaatatc 1080 atttctgatg ttattgatag aacaaatgag aaaagactaa aagtgaaaca cgtactatac 1140 atggatgata ttaaagtaac aagtgaaaca aagaaaggaa tggaaatggt gcataaaaat 1200 ataatagaaa caatagaagt aataggaatg gaagttaatg taaataaaag tggagtaatg 1260 aagaaaggaa atataggaat acaagataat atgaaagaca ttccaatagt agactcagaa 1320 catcaatata agtatttagg tgtttatcaa tttacaagaa atgatgatgt tggatgcata 1380 gaaagaataa gtaaaagtat agaagaagga ataaaagaaa acaataaagg gaatgacagt 1440 agtatgaatt atattaacag aattaataca gaaataattc ctaagttcac atatagtgca 1500 tctgtagtta aattgcctat aatgaagctg attgaaatag ataaaatgat aaggatagaa 1560 gtgaaagaag gaaagattat tggagcagga atgtgtacga gtagattata tgtaaacaga 1620 aataaaatgg gagatggatt gaaaagcatt agagatgaat atgcaattaa attaataaga 1680 agtattttat attacacatg gaaaagagga aatagaatag gagacatgat aggaataaat 1740 gaaaataaga atagaatgtt gaatagattg tataaaagca tgttgaaaaa aagaataaac 1800 aaagaagaat ttaggaaaat agtagaagaa gaaagaggaa aacgggataa aaacagaata 1860 attaagtatg tgagagagaa gttccaagaa gattatatca ataaatggag aaacaagaaa 1920 gtaaacagtg aaatattgaa aggatttgaa gataaaacga tggaacagaa gaaaacaatt 1980 aaagtgtgga aaggaattaa tgttataaga aaagcatata tacaaataaa gaagatggaa 2040 gagaaagcaa gaaatgtagg agtaaggaaa gcattaatga aacatgatat gagatatgca 2100 atatgtccaa tatgtaataa aatagcaaca ataaatcata ttcttttgaa ctgtatagtt 2160 actaagaaaa cacaaatgaa taaacatgat agaataggag aatatatatg gaggagttta 2220 aagattaaat accaattata tgaaaacaaa gaaaagatga agatgagaga agaagaaaat 2280 acaaagaaat taataataag gaaagaagaa ttaggaaaga atgaaagaag aatagagaaa 2340 gataaaatac ttatagtttg gaatcacgaa gtagtaaaca gaagtgaaga caaattccat 2400 aaaagaatag atatttatgt cagagactta aataagaaag aagcaatgat tatagatatg 2460 acaatagtaa gagataaaaa cataagtaaa gcatttacag ataagattaa tatgtattca 2520 aaactacacg aacatataag aaagatagaa agattagtac gagtaatagt tataccagta 2580 gtaataacag tgagtggatt aataaacaaa gaaacagtta tattactaaa tgcagaagga 2640 atagaaatac caagggaaat aatgacaaga gagattgtaa tatcaaatat gaaagatctt 2700 atgagatatt gtggagatca ttctggagaa tacagcaatg tagaaatacc agaagaagaa 2760 gactctcagg agagtgaagg aagggaagaa attctcacct gaacaataag ttgaaaggta 2820 ttaaacctta atttaatatt tattctttat tctt 2854 // ID BEL-105_AA-I repbase; DNA; INV; 5677 BP. XX AC supercont1.4; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-105_AA_; KW BEL-105_AA-LTR; BEL-105_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5677 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.4; Positions 2092295 2097971. XX CC Positions [4564-5157] - Integrase core CC 'AATTA' target site duplication CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 1296..2654 FT /product="BEL-105_AA-I_2p" FT /translation="MSSILQDVCPPAPRAGIPKAPMLELSRSGHQGRSCTS FT KFSYRSCHQRHHTLLHENPSTNVSAAPAVVSGQPSQQLRPATSTTIVESSG FT SANPNSQVSLSVQSCQSTVLLETVNLLVVDQNGAEHTARALLDSGSMCSFI FT TKKLANILNLRRTKVDVAVSGIGDSSKQIKRKLTATIKSKMLSYTTTLEFL FT ILKRPTVCLPTTPIDVTSWKFPNVPLADPHFHVPADIDLVVGGEVYHELHT FT GSKISFDGGQPVFVETLFGWTISGKVPTKSPEVPRVCHLTTVDRHLEQALQ FT KFWELEAVEQCSKFTAEENVCEELYSTTTTRESSGRYVVSLPLTRDPLVTL FT GESRSIAERHFLSLERRLARDPTTKEAYCRFMEEYEHLKHMVKLVDPVDES FT QPHCYLPHQTGIQRIQHNDQGPRCLRRFVQNFFWILRKRPTAGWARRSRRS FT TVDPPPLP" FT CDS 2470..5343 FT /product="BEL-105_AA-I_1p" FT /translation="MNPSHTATCRTKPVFKESSTTTKVRVVFDASCKTSSG FT FSVNDLQLVGPVVQEDLLSIHLRFRSHQIALVADVEKMYRQILVHPSFRRY FT QLILCRPDPSLPIATYELQTVTYGMASAPFLATRTVVKIAEDSTERYPAAA FT PKAKKDFYVDDCLTGAEDVESAIRVRQEMSDMLSSAGLPLKKWASNSPEAL FT ANVPAEDLALAPYYGLQDAQSVSTLGLIWEPRSDMMSFKVQLPLPAPVLRK FT RRIMSYIAQIFDPLGLVGPVIVVAKLFMQRLWALKTEKGDPYEWDRPLPAH FT LQKEWKTFHGMLDAIATLRIPRCVCLANPTSLQFHFFCDASQKAYGACCYV FT RTESAGAVRIRLLTSKSKVAPLAHPKIIARLELCGAVLASNLYKKVIQSIL FT KSAQVYLWTDSAIVLHWLDSSPARWKTFVANRVAHIQETTASCHWNHVSGT FT CNPADPLSRGVDPINIQKHSLWWNGPEWLSLPPSEWPISELPSLDPSWISE FT ARIHVAMVVTVDSEFSNQLFSRYSNFSMLRRCIAYWMRYFRVLKAASKNST FT IEPFETLTTTDFHEADVTLCRLAQRASFPNEMSSLLSGERFPISSPLKWLK FT PKLSKVGVIRVGGRLGRAVVSEDVKHPIVLSSKHPLSSLLAKYYHEMLLHA FT GPQLMLATIRQKFWIIGGQNLVRRTYHQCLKCFRSKPILVQQSIADLPRSR FT VTPTRAFAVCGVDYCGPVYIKSPVRNQAPTKAYIAIFVCFATRAVHIELVS FT DLSTPAFLAALRRFVARRGRMREIHSDNGTAFKGAANELRRIYEMLKSNEA FT DRKQILDWCAQNEIVWRFIPPRAPHFGGLWEAAVKSAKNHLLREIGTVNVA FT YEDMCALLTQVEMCLNSRPLVPIPTEPSDLEVLRPGHFLIGESFQAVPEDN FT LCDVPDNRLTHFQLTQKRFQRIWSRWIPEYLQQLQSRATKCKNPVTITPGC FT RRD" XX SQ Sequence 5677 BP; 1403 A; 1666 C; 1345 G; 1263 T; 0 other; tttggtcctt cgagccggct cgaaggttcc cccgaccaga aggacaggct cggtgagtac 60 accatttcat tttcttcccg cgcggctggc aatcaattat tacgacctgt taaggtcaat 120 ggtttgctgc tggctttgtt cttataccct agccaagcca aacgaaagtc cagcaaggca 180 aacaaagtct atactcaagc cagtaacaaa gcgcgcgcgc gcaaatagca cagtaggccc 240 cacaaaggga gcagaaattt caacaccttg ctttggcgga gcaacgctgc tggttgtacg 300 tgattccgtg gaagccttct tcaagaagta cgacgatcag cgagacatct accagatccc 360 aattcgcctt gaagccctgg accgcgtcta caaggaattc caagagctgc agacggaaat 420 tgaaaagtac gacaccgccg agaatttcga cgaccaccta gaagaacgag cagctttcga 480 gtcgagattc tgcagcgcca agggatttct attgatgaag cgtttcactg atccagccca 540 ggcactgaac acgtcaacaa caaaccacca aaatgcactt tcgaccggtt ttcacctacg 600 actaccgaag atcgaccttc caaagttcga tggagatgct tctcgatggc tttccttccg 660 tgatacgttc acctcgatgg tgcattccaa tgcggacatt cctacagtag cgaaactaca 720 gtacctccta agatccctag aaggagaagc gcacaagcca ttcgagtcga tcgatataga 780 agcggacaac tatgggtcga catgggacgc gctgctaaag cgctatgaca accgcgttac 840 ctaaagcgcc acctcttccg agcattatat gatctcccac cgctgaagga agagtcccca 900 caagagctgc acgatctggt tgacgattac cagaggcacg ttaaggctct ggcgaagctg 960 aatgaaccag tcgaatactg ggacaccccg ttgatcaacc tgctgagcta caagctggat 1020 cccacaaccc ttcgagcctg ggaggagaag acgagcaaag tcgacgacat cacctacaac 1080 gagttgatcg atttcttgta ctagcgggtc cgaatgttga ggtccgtagt caccgatctg 1140 cagcagcgtt ccaatcaacc cggtcaaacc aaggtgaccg gtgccgctca attccagaaa 1200 aggccgtaca aaatggtttc caattccgcc acgatcgaat ccaaatccta cgctccgagt 1260 tgtatagcct gtccagagag ccacttccta ttccaatgtc cagcattctc caagatgtct 1320 gtccgccagc gccgagagct ggtatcccaa aagcgcctat gctggaactt tcccgttctg 1380 gacaccaagg cagaagttgt acgtccaagt tcagttaccg aagctgccat caaagacacc 1440 acaccctttt gcacgagaac ccatccacga atgtttccgc cgctcccgcc gtcgtatctg 1500 gccaaccatc ccaacaactg cgcccagcaa cctcgacgac catcgtcgaa tcctccggat 1560 cagcgaatcc gaactcccaa gttagtttgt cggtacagtc gtgccaatcc accgtcctgc 1620 tggaaacggt taatctcctt gttgtggacc agaacggcgc ggagcatacc gctcgtgcgc 1680 tgctagattc gggatcgatg tgcagcttca ttaccaagaa actagcgaat atcctcaacc 1740 tccgtcgtac gaaggtagac gtcgccgtgt ctggtatcgg tgattcctcc aagcagatca 1800 aacgcaagtt gactgccact atcaagtcca agatgctttc gtacactacc acgctagagt 1860 tcctcatcct aaagagacca actgtctgcc tgccaaccac tccgatcgac gttacctcgt 1920 ggaagtttcc caacgttccc cttgctgatc cccacttcca cgttcctgcc gacatcgacc 1980 tagttgtcgg aggagaagta taccacgaac tgcacaccgg cagtaagata tcgtttgacg 2040 gaggacagcc tgtcttcgtc gaaaccctgt tcggatggac tatctcaggg aaagtaccaa 2100 ccaaatcacc cgaagtccct cgagtttgcc atctcactac cgtcgatcgc cacctggaac 2160 aagccctaca gaagttctgg gagctagaag ccgttgaaca gtgctccaag tttaccgccg 2220 aagaaaacgt gtgcgaagaa ttgtattcca ctaccaccac tcgcgaatcg tcgggtcgtt 2280 acgtcgtttc cttaccactc acccgcgatc cgctcgtcac tctcggtgaa tctcgatcaa 2340 tcgccgaacg ccacttcctg agcctcgaaa gaagacttgc gcgtgatcca accaccaagg 2400 aggcctactg tcgctttatg gaagaatacg aacacctgaa gcacatggtg aaactcgtgg 2460 atccagtaga tgaatcccag ccacactgct acctgccgca ccaaaccggt attcaaagaa 2520 tccagcacaa cgaccaaggt ccgcgttgtc ttcgacgctt cgtgcaaaac ttcttctgga 2580 ttctccgtaa acgacctaca gctggttggg cccgtcgttc aagaagatct actgtcgatc 2640 cacctccgct tccgtagtca ccagattgcc ctcgtagccg atgtcgagaa gatgtatcga 2700 caaatcctgg tgcatccctc tttccgtcgg taccaactca tcctctgtcg tcctgatcca 2760 agtttgccca ttgctacgta cgagctgcaa acggttacat atgggatggc ttccgcaccg 2820 ttcctagcga cacgcacggt cgtaaaaatt gcggaagatt caactgaaag gtatccagca 2880 gctgcgccga aagccaagaa ggacttctac gtggatgatt gcctcactgg ggctgaagat 2940 gttgagtccg caatccgcgt acgccaggag atgtctgata tgttatcatc agcgggacta 3000 ccgctgaaaa agtgggcgtc aaactcgcct gaagcccttg ccaacgttcc agccgaagat 3060 ttggcacttg cgccgtatta tggtctccaa gatgcacagt cggtgtcaac cctcgggctt 3120 atttgggaac ccaggtccga catgatgtcg tttaaggttc aactgccact accggcaccc 3180 gttctcagga aaaggaggat catgtcatac atagcgcaaa ttttcgatcc acttggcctg 3240 gtgggaccgg tcatagtcgt cgccaagttg ttcatgcagc gtttgtgggc gttgaagacc 3300 gagaaaggag atccatacga gtgggaccgt ccgcttccag cacatctgca aaaagagtgg 3360 aaaacgttcc acgggatgct cgacgccatc gccaccctcc gaattccacg ctgcgtatgc 3420 ttggccaatc caacctcgct ccagttccat tttttctgcg acgcctctca gaaggcgtat 3480 ggagcatgct gttatgtgcg gacagaatcc gctggagcag tgcgtatccg actgctgacg 3540 tccaaatcca aggttgcacc gttggcccat ccgaaaatca tcgctcgtct ggagttgtgt 3600 ggtgcagttc tggcgtccaa tctttacaaa aaggtaatcc aatccatcct gaagtcggct 3660 caagtgtacc tgtggactga ttcggccatt gtcctgcact ggttggactc gtcgcctgcc 3720 cgatggaaaa ctttcgtggc caatcgggta gctcacatcc aagaaactac cgcatcctgt 3780 cattggaacc atgtttccgg gacttgcaat ccagctgatc ccctttcccg tggcgttgat 3840 ccgatcaata tccagaaaca ttcgctgtgg tggaacggac cagaatggtt atccctgcca 3900 ccatctgagt ggcctataag tgaactgcca tcactcgatc cctcctggat ttcggaagcc 3960 agaatacatg ttgctatggt agttaccgtc gactccgagt tctccaatca actgttcagc 4020 cgatactcca acttcagcat gcttcgtcgt tgcatcgcat actggatgcg atattttcgt 4080 gtcctaaagg cagcctctaa gaattccacg atcgaaccgt tcgagaccct aacaacaacc 4140 gatttccacg aagcagacgt taccctttgc cgcctagctc agcgtgcgtc gttcccgaac 4200 gaaatgtcca gcctactttc cggagaacgc ttccctatat cgtcgccgct gaagtggctg 4260 aagccgaagc tcagcaaagt aggtgttatc cgtgtcggtg ggagactcgg cagggctgtc 4320 gtatccgaag atgttaaaca cccaattgtc ctctcatcca agcatccgtt atcctctctg 4380 ctggccaagt actaccacga aatgctgctc cacgctggtc cccagctcat gctcgccacc 4440 attcgccaga aattctggat catcggtggc caaaacctcg tccgccgtac gtaccatcaa 4500 tgcctcaagt gcttccgcag caaacccatc ttggtgcagc agagcatcgc cgatttgcca 4560 cggtcaaggg tcacgccgac cagagcgttc gccgtgtgcg gggtcgatta ctgcggtccc 4620 gtctacatca agtctccggt gcgcaaccag gctccaacta aagcatacat cgccatattt 4680 gtgtgcttcg ctacccgtgc cgtccacatc gagctcgttt ccgacttgtc cacgccggct 4740 ttcctagccg ccctccgccg tttcgtcgcc cgccgtggaa ggatgcgaga aatccatagc 4800 gacaacggca ctgctttcaa gggtgcagcc aacgagctga gaaggatcta cgagatgctg 4860 aagtccaacg aagccgatcg gaagcagata ctcgactggt gtgcgcagaa cgagatcgtc 4920 tggcggttta tacctccccg tgccccacac tttggtggac tgtgggaagc agccgtcaag 4980 tccgccaaga accatctatt gcgtgaaatc ggcaccgtca acgttgccta tgaggacatg 5040 tgcgctctgc tgacccaagt ggagatgtgc ctgaattcca gacccctggt tcccatccca 5100 accgaacctt cagacctgga agtcctcagg ccaggacact tcttgattgg ggaaagtttt 5160 caagccgttc cggaagataa cctctgtgac gttccggaca accgccttac ccacttccag 5220 ttgacccaga agcgcttcca gcgaatttgg tcccggtgga taccggaata tctacagcag 5280 ctccaatcgc gagctacgaa atgcaagaac cctgtcacca tcactccagg ctgtcgtcgt 5340 gattaaggac gaaaatcttc cgccgatcca atggccgctc ggaaagatca tcaaggtgca 5400 tccgggaaag gacggagtag tacgtgtcgt cactctgaag accgcaacat ccgaggctgt 5460 cgttcgaccc gtcgccaaga tcgcgctgct accagtaccg gacaaccaag ttccgcaatc 5520 caacgattga agctggtaga gcacgccgaa atttgctcga tgccttcgtg gaaacctagc 5580 agacgacaac gaatccatcg acgattagtg ttgaactagc caagcaagtt caaggtggcc 5640 ggaatgatga gaattgaatt gaaattgaac taaatta 5677 // ID BEL-89_CQ-I repbase; DNA; INV; 6838 BP. XX AC AAWU01006427; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-89_CQ_; KW BEL-89_CQ-LTR; BEL-89_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-6838 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 311-311 (2011). XX DR GenBank; AAWU01006427; Positions 89953 96790. XX CC Positions [5816-6388] - Integrase core CC 'GAATG' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 1475..6745 FT /product="BEL-89_CQ-I_1p" FT /translation="MPRLPKSTVATATTSLKVLLAKLKSHQASFNIICQFK FT DDYPEDATANQINVRLERLDELWELMSEVTSDIMAHDDFEGTTATFDKERS FT SWENLYYEVKSFLLDKLKDFTQPNLNQSTLPADQSTSSGSMDHVRLPPITL FT QKFDGNIDEWLSFRDLYTSLIHWKTDLPDVEKFHYLRSSLIGEALSHIANI FT KISKANYALAWETLTKEYNNHKLLKKKQAQSFFELPVLVKESAKDLHKLVE FT SFNMIVQSLDQVVQPNDYKDLLLVELLSSRLDPATRRGWEEFTSTKETDSL FT KNLTDFLTRRVRILEALPVKTSEYKQDSNAAAKKKPFQVRVSHNAVQKQIS FT KCPACPESHGLYICPTFQKMSVASRESFIRNNNLCRNCLRSGHQAKKCTSR FT FSCRNCKARHHTMVCFQPEGGGNARSNSTEMVPTTSNNNQRQSTPGETAST FT AAGEQVSSHVANRRTSKILLATAVVLIENEHGLRVSARALLDSGSECNFIS FT EKLCQQLRIQREKVDVSITGIGNATTKAKHRVQATIKSRFSIFSRDLHFIV FT LPKVTVDLPMTQVDITEWEIPDGIDLADPSFFKPGSVDLVIGIQAFFSFFK FT TGKELSLGNGLPTLTESVFGWIISGEVVESSQPTTITCNMALTDRLDELLE FT RFWACEEVGDSNNYSVDETRCEEQYQRTVKRAPDGRYTVTLPKHEGGLEQL FT GDSEEIATKRLYGLERRLEKDLELRKQYNDFMTEYITLGHMEKVDDDSRAN FT VKRCFLPHHPVLRESSTTTKCRVVFDASCKTSTGVSLNDTLLVGPIVQQDL FT RTIMLRARVRQVMLVADVEKMFRQTDMDPEDRPLQSIKWRFGPDEEVASYE FT LSTVTYGTKPAPFLATRTLKQLATDERERFPLAAESVDDDIYMDDVICGAA FT DLDTAINLRQQYDGMMASGGLRLRKWASNWEDALKGVPNENLAFADEVSWD FT QDASVKTLGLTWLPRTDQFRFNFQIPEIKSNQPLTKRIVTSIVARLFDPLG FT LIGACVVGAKIYLQELWDLIDKETGKPIDWDTELPATVGEKIRRFLLKIPH FT LNELRIPRCVLARGATKFELHCFTDASEKAYGACIYLRSMDDEGNISVHLL FT TSKSKVAPRKLQTLPRLELCGAYLGALLVEKVLQALKVSVNVYYWTDSTCV FT LMWLRAVPSTWVTYVANRVAKIQALTAGHTWHHVPGTINPADLISRGIPPD FT MIGSNVIWWEGPIFLKLEKDQWPSQPTVSPDEAPERRKTACTATQEQTPFS FT VDFVSRYSSYMKLVRSTAYWLRLKAMLRHQTSPKQHKFLSATEFKRAELAI FT IGMVQREVFADELKALSEGKSVARNSPLRWFTPAVDKNGILRVGGRLSHSQ FT ETHDTKHPMVLPAKHPLTQLIFEDYHEQLLHAGPQQLLAAVRQRFWPLGGR FT NTARSVYHRCQRCFRAKPIVLQQQMGQLPSARVTACKPFLKVGLDFFGPVF FT TKAGRGRTAIKSYVAVFVCMCTKAVHMEHVSDLSTERFLQALRRFIARRGR FT PSDIYSDNGTNFVGARNQLLELFNILESDREVICREATKEGINWHFNPPSA FT PHFGGLWEAAVRSAKYHLLRVLGNSTVSAEDFNTLLVQVEGVLNSRPLTPL FT SDDPTDLEPLTPAHFLTGGPIHALPEPDYSNEKLNRLSRWQLVQRQLQDFW FT TRWRKEYLTQLQAKPKNWKSPVKVDVGMLVIIVDENQPPMRWKLGRIEELH FT PGEDGVVRVVTVRTATRSYKRPVVKLCILPTPDSESS" XX SQ Sequence 6838 BP; 1709 A; 1770 C; 1803 G; 1556 T; 0 other; ttgctggtcc ttcgaaccgg ataactccgg gaaggaccgt cgaggcaaca catccggatc 60 gtctgggccg cgtgaaccgc ggggacgacg ccatcgcagc agcagcgcgc gaaaccaggt 120 cgcgacgctg tacgacgcca ttgcaaccgg gcattcaacg ggactccgca ttgtcctgtc 180 gggatcaaaa ggaccttttc atcacggcga ggatcggcca tcgcggtagc ttactccatg 240 taactacctt tctggtgagg taagctaact ccgcactacg cctagtatat gtcttggccg 300 aagccgaact ccactctaac accaacttct tttacgtcct gctgtgactt tacgacttcc 360 gagtcaaccg gccatcggaa cggcccctgg caacttcaag cgctaccaat cgtcggtgtg 420 gtcaaccgct ggtgcggtca actaccgacg cggacaactc gagcgagaac gattgctgag 480 tttcgatttg gacaactacc ggtgcgggaa tccgccggtg aaaacgaaat cgactgtggc 540 atcaactaat cagcatcgtc aacgcgttcg tgtgctgctg gtgtagcttt ggacgccggc 600 aaggaacggc ggtactgaac gccggcctgg atcgccagta ccaatcgctg gtacggaacg 660 gcgacgctgg ttcgctggct tgggacgcta gcgcggaacg ctggtacggg acgacaactc 720 ggtggacgcc gaccgatcgg tgttggcagg ctggacgaag ccggccacgt gtacaaccgt 780 gtggatttgc agccgtcgac cagggcatcg cactcaaacc tgtttgcata ggtaacacac 840 tagtatatgc cctgccgggg caaactcatt tggttactac acaataaatc ccgttttcga 900 tgcacacaca aacaccgctt cgagttacca ttctttggcg cgcaatccct gggtgttaac 960 ccatccgctt caacgtgatt cttcgctact tcgtgttggt gttgctgtta gttctccagg 1020 tgaggcacag tatatgtcac gccgaagtaa gactttttgg tactacatct gccctctcac 1080 tctcctcggt ttggaacgat tcgatttggc tgctctgaga ctccgctttg ctgctcgttt 1140 tctggtcgtt ggacggtttc cttcggtatc ctgctaggct gttgtttgca caggtaaagc 1200 caacagtata tgtcggccga agtgaaaacg ttggatacta catgcactct tctcctcgtc 1260 gatttcccgt tgaactggca actggcattc attgacgacc tgcgtggacc gtttacggac 1320 tgtggatctg cgacgtttca ttcaaattta cccagcgcag taaactgttg ggcaggtaag 1380 ctgccacagc cacagtatat gtcttgtcgt tgctggacag acgtacggtc actacaactg 1440 catctcattt tcccggtcaa tcacgctcaa cgtcatgccg cgacttccga aatcaacggt 1500 ggcgactgcc acgacctcgc tgaaggtgct tctggcgaag ctcaaaagcc atcaagcttc 1560 gtttaacatc atttgccagt tcaaagacga ttacccggaa gatgcgaccg ccaaccaaat 1620 caatgttcgt ctcgagcgtc tggatgaact gtgggaattg atgagcgagg tcacaagtga 1680 catcatggcg cacgatgact tcgagggtac cactgcaaca tttgacaagg aacgatcaag 1740 ttgggaaaat ctgtactacg aggtcaaatc ttttctgctg gacaagttga aggatttcac 1800 gcaacccaac ctcaatcaat caacccttcc agctgatcaa tcaacgtcat caggctctat 1860 ggatcatgtt cgtcttccgc cgattactct ccaaaagttc gatggaaaca tcgacgaatg 1920 gttgagtttt agggatcttt acacctcact aatccattgg aagaccgatt tgccagatgt 1980 cgaaaaattt cattatttgc gcagtagtct gataggcgaa gctttgtcac acatcgcgaa 2040 cataaagatc tctaaggcca attacgcact agcttgggaa accctgacta aggagtacaa 2100 caaccacaaa ctgctgaaga agaaacaggc acagtccttc tttgagcttc ctgttctggt 2160 gaaggagtcc gcaaaggatc tgcataagct ggtggaatcg ttcaatatga tcgttcagtc 2220 cctggaccag gtggttcagc ccaacgatta caaggatcta cttttggtgg aacttctgag 2280 ctcaaggttg gaccctgcta cgcggcgtgg atgggaggag tttacatcta cgaaggaaac 2340 ggactctttg aaaaatctga cggattttct cacgcgtcga gtacggattc tcgaagcgtt 2400 acccgtgaag acgtcggagt acaaacagga ctctaacgca gccgcgaaga agaagccgtt 2460 tcaggttcgg gtgagccaca acgctgtcca gaagcagatc tcaaaatgcc cagcgtgtcc 2520 cgaatcgcat ggactctaca tctgtccgac tttccaaaaa atgtcggtcg cgagtaggga 2580 atcgttcatc cgtaacaaca acctctgccg caactgtctg aggtcaggtc atcaagccaa 2640 gaagtgcaca tctcgcttct cgtgcaggaa ctgcaaggcc agacatcata cgatggtatg 2700 tttccaaccg gaaggaggag gaaatgcgcg tagtaactcg accgagatgg tgccgactac 2760 gagcaacaac aatcaacgcc aatcaacgcc aggtgaaacc gcaagcacag ccgctggtga 2820 gcaggtgtcc tcccacgtgg ctaatcgtcg aacgtcgaaa atactattgg caaccgctgt 2880 ggtactgatc gagaacgagc atggtcttcg cgtatcggca cgtgcattgc tcgactccgg 2940 atctgaatgc aacttcatat cggagaaact ttgtcaacaa ctgaggatac aacgggaaaa 3000 ggtcgacgtt tcgatcactg gcatcggaaa tgccaccacc aaggctaaac accgggttca 3060 agccaccatc aagtcacggt tttcgatctt ctcacgggac cttcacttta tcgtgcttcc 3120 aaaggtcaca gttgatctcc caatgactca agtggacatc acggaatggg aaattccaga 3180 cggcatcgat ctggccgatc cgtcattctt caaaccaggg agtgtcgatc tggtgattgg 3240 cattcaggca ttcttcagct tcttcaagac tgggaaagag ctgtcgctcg gcaacggatt 3300 gccaactcta actgaatcgg ttttcggttg gattatatcc ggcgaggtgg tagagtcttc 3360 acaaccaacc acgatcacct gcaacatggc gctcactgat cggttggacg agttgctgga 3420 gaggttttgg gcctgcgagg aagtaggtga ctccaacaac tactcggtgg atgaaactcg 3480 ctgcgaggag cagtatcagc gcacggtgaa acgggcaccg gatgggcgct acacggtcac 3540 tctaccaaaa cacgaaggag gtttggaaca gctaggcgac tcggaggaga tcgcaaccaa 3600 acggctgtac ggattggaaa ggagactgga aaaggacttg gaattgcgta aacagtacaa 3660 cgacttcatg acggagtaca taactctagg gcacatggag aaggtggatg acgattcacg 3720 ggcgaacgtc aagcggtgct tccttccgca tcaccctgtt ctacgcgagt caagcactac 3780 aacaaagtgt agggtggtgt ttgatgcctc atgcaaaact tccacgggcg tctccttaaa 3840 cgacacactg ttggtagggc caatagtgca acaagacctg cgtacgatca tgctacgagc 3900 tcgcgttcgc caggtgatgc tggtcgccga tgtcgagaaa atgtttcgac agaccgacat 3960 ggacccagag gaccgaccac ttcaaagcat caaatggagg tttggaccgg acgaggaggt 4020 ggcctcgtac gagctatcca cggtgaccta cggcaccaag cctgcaccgt tcttagcaac 4080 acgcacactc aagcagctcg cgactgacga acgggaacgg ttcccgttgg cggcggaatc 4140 tgtcgacgat gacatttaca tggatgatgt tatttgtgga gctgctgatc tggacaccgc 4200 aatcaacctg cgccaacagt acgatggaat gatggccagc ggtggattga gattgcggaa 4260 atgggcgtcg aactgggaag acgcactcaa gggagttccg aacgagaatt tggctttcgc 4320 tgacgaggtc agctgggatc aggacgcaag cgttaagaca ctgggtctga cgtggctacc 4380 aagaactgat caattcaggt tcaacttcca aattcctgaa atcaaatcca atcaaccttt 4440 gacgaaacgg atcgtcacgt caattgtcgc acgacttttc gacccactcg gcctgatagg 4500 agcttgcgtc gttggggcga aaatctacct ccaggagttg tgggatctga tcgacaagga 4560 aactgggaaa ccgatcgact gggacacgga gttaccagcg acggtgggtg agaaaattcg 4620 acgttttttg ctaaaaattc ctcacttgaa cgagcttcgc attccgcgct gtgtgctggc 4680 acgaggagcc acaaagttcg agctgcactg tttcacggac gcgtctgaaa aggcgtacgg 4740 agcgtgcatt taccttcgca gcatggacga tgaaggcaac atttcggttc acctgttgac 4800 gtcgaaatcg aaggtcgcac cccgcaaact ccaaactctg ccccgccttg agctctgtgg 4860 tgcgtatctg ggagcactac ttgtggaaaa ggttctgcaa gctctgaagg tctcggtgaa 4920 cgtgtactac tggacggatt caacctgcgt tttgatgtgg ctgagggctg taccaagcac 4980 atgggtcacg tacgtggcca atcgcgtcgc caagattcag gcgttgacgg caggtcacac 5040 ttggcaccac gtaccaggaa cgatcaaccc agctgatctg atctcacgtg gaatcccgcc 5100 tgacatgatc ggaagtaacg tgatctggtg ggagggaccg attttcttga aattggagaa 5160 agaccaatgg ccatcgcaac caaccgtgtc cccggatgag gcaccggaga ggagaaagac 5220 ggcgtgcaca gcaactcaag aacaaacacc gttcagcgtt gacttcgttt caaggtattc 5280 ctcgtacatg aagctcgtca ggtcaacggc gtactggtta cggctgaaag ctatgctgcg 5340 tcaccaaaca tcacctaaac aacataaatt cctctctgcg acggagttca agcgtgctga 5400 actggccatc attggaatgg tccaacggga ggtgttcgcg gacgaactga aggctttgtc 5460 ggaagggaag tcggttgcac gaaactctcc actgcgctgg ttcacccctg ctgttgacaa 5520 aaacggtatt ctacgcgtgg gtggaagatt atcgcactcg caggagacac acgacaccaa 5580 gcatcccatg gttttgccag ctaagcatcc actgactcag ctgatcttcg aggactacca 5640 cgaacaactg ctgcatgctg gaccacaaca acttctcgca gcagttcggc agcgtttctg 5700 gccactagga ggtcggaata cagccagatc tgtttatcac cgttgtcagc gttgttttcg 5760 agctaagcct attgttctcc aacaacaaat gggacaattg ccatcagctc gtgttactgc 5820 ttgtaaaccg ttcctcaaag taggccttga tttctttgga cctgtgttta cgaaggctgg 5880 aagaggcaga actgcgatta aatcgtacgt tgcggtcttc gtttgtatgt gtacaaaggc 5940 cgtacacatg gagcacgtct ctgatctctc caccgagaga ttcctccaag cgttacgcag 6000 attcatcgca cggcgaggta ggccaagcga tatctactct gataacggca cgaactttgt 6060 aggggcaagg aaccaattgt tggaactgtt caacattttg gaatctgatc gagaggtcat 6120 atgtcgggaa gccaccaagg aaggcattaa ttggcatttc aatcccccca gcgccccaca 6180 cttcggtggt ttgtgggagg ctgctgtccg ctcggcgaag taccatcttc tacgagtctt 6240 ggggaacagc acggtatccg cggaggactt caacacactc ttggttcaag ttgaaggcgt 6300 tttgaactcc cgacccttga cacctctctc agatgaccct acagacctag aacccctgac 6360 accggcccat ttcttgactg gaggaccaat ccatgcgctg ccggaaccgg actacagcaa 6420 cgagaagctg aaccgcttga gtcgttggca gctggtacaa cgacagcttc aggacttctg 6480 gacacgctgg aggaaggagt acctgactca gctgcaagcc aagccaaaga attggaaatc 6540 gccggtcaag gtggacgtcg gcatgctggt aatcatcgtg gacgagaatc aacctccgat 6600 gcgttggaaa cttggacgga tcgaagaact acatcctgga gaagacggag ttgttcgagt 6660 ggtgaccgtt aggacagcta ctcggagcta caagcggcca gtcgtcaagc tttgcattct 6720 accaacacct gattccgagt cttcatgagc agtccagtcg ttctttggtt ccgtgtgcgt 6780 cgaaggggag tttctttttt cttttcagaa tcatcagctg cttcagggtg ggtgagaa 6838 // ID Tx1-7_CQ repbase; DNA; INV; 4984 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A Tx1 non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW Tx1; Non-LTR Retrotransposon; Transposable Element; Tx1-7_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4984 RA Kojima K.K. and Jurka J.; RT "Tx1 non-LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 639-639 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 7 sequences with >98% CC identity. CC 3' end is uncertain. XX FH Key Location/Qualifiers FT CDS 155..1501 FT /product="Tx1-7_CQ_1p" FT /translation="MATFEEARANSLRIYFGPGKKEPLDHGIFNFMKQKMR FT LNPNSLLSMYKESKENCVYIKFKTEEMLTNTLLDLPESMDFEYNKYESTKV FT TFSSASTVFKYVRIFNLPPEISDQEISTVMSKFGVIQRMVRERYAPETGFP FT IWSSVRGVHMEIKQDIPATIHVRNFQARVYYDGLQNKCFICGSKDHVKVNC FT PKKSKVNXRMEQNGSLSYSAVTAQPPLWLGRGSTHKAGESGGEGQMVVLNN FT FFGPAKPSLLVPXKLPEVPSAVKPIDQAGLGSSSVEQIVLPMSEPDAEMKE FT TTTAEGVLTGGGVSSLAVPIVTDGAVKEAQGEEEFQLVQGRKKKRKEEMRS FT LSADAPAKPGSGMIFIPPSNLSAAQSMETRGRSKKKEDNKERSRSRSLRDR FT DTEGKGDTRGKGDVKGKGDVKGKGDTRGKGDVKGKNKLGGNGEGEGVDANL FT EENL" FT CDS 1638..4895 FT /product="Tx1-7_CQ_2p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="MNFTLKIASVNLNSTTTTINQNLLRDFIWNQDIDLVF FT VQELCYQNFSFIPSHFAIINLNQNGMGTGILLRKSFEFNNVVLDPNGRISS FT LVINGINYINIYAFSGTNKKKERDELFLNNLTVHLSKSGIEYSVLAGDFNC FT ILNSSDATGATKNVSAGLKRLVESLQWKDIFAEMKQNEFTFFRSGVASRLD FT RIYGPSRFVDLIIEAKTISVPFSDHCAIVLKIKTDSNSFCLKGRGYWKINN FT SLLFDENISHKLSVEYQSICNRSSYFNDKNQWWNSIAKVKFKQFFKGESWA FT INKYILDRKSYFHGVLNDLHRRRCLGEEVSKDMFLVKSRLMEIELDRLKHY FT SNKLNNFSLLQGEVMNIFQVSAQIKKFSTSNNLKLKDGNVITGDCSIMKQI FT IFDHFSEQFKKSLDNALDGITDPLEGVTSRLSDEQQRELTRPITVDEMFTV FT LKLCAHKKSPGPDGLNYEFYLKHFDLIKDDLVVLFNEYLSGTKKPPRDFTA FT GIITLIHKGNDKNDLNNYRPISLLNCDYKLFTKIIAMRIKLHLDDLLGCEQ FT SACKSDRSCSDNLKDIRRIMLRANDSKRFKGALVSIDLQKAFDKVDHTYLW FT KVLEKFNFPDSLISCIRNLYDSATSKVLFNGFLTNSIKIESSVRQGCPLSM FT ILFVLYIEPLLRKISDGIAGVLIYDKFIKTIAYADDINIFIRNDEEFDLLI FT QILNSFESFGKIKMNFNKSVYLRINRANFGPQMIREVNSFKILGIHFSNSI FT SESINLNYDKVILNVKHTLHLHGVRKLNLVERGWLLNAFILSKFWYLAQIF FT PPYKKHIDIINSLVGQFLWNGFIFKTARDQLYLDYDAGGIRLVNAEFKMKA FT LFIRNILYPGGNVSENYILVYSKQHQLPLCFRQWIEVARLCKNDSLLDSSK FT AFYKHFLNQENFIPKVKVDFPQFQWTTIWKNLNFNFIGSDEKSRLFAFLNN FT LIPNKEKLMAYNIGRLSNNLCDICNGIDNNGHRIMECPQSRIISNWTKNKV FT EKISGIKLVNLEDILSFEIDENKKSEIAALWLCILAISFNLKCYPGPSLFV FT FKKEIREMRWNNRLIFNKVFGTHLLRLGM" XX SQ Sequence 4984 BP; 1610 A; 745 C; 1106 G; 1516 T; 7 other; cagttgatgg ccagacttca agccgaacgg tcgatattgt ctttgcggtc taatagttga 60 gacaacgttt tgttgtagct caggctactc tgcccgggtg ttttgttctt cgagaacaat 120 agtatttagt tttaaaatcg gaacagtgcg gaaaatggcg acttttgagg aagcacgtgc 180 caactcgttg aggatctact ttggccccgg gaagaaggaa ccgctggatc atggaatttt 240 caatttcatg aagcagaaga tgagattgaa cccaaattca ctcttgtcga tgtacaagga 300 atcgaaggag aactgcgtgt acattaagtt caagacggag gaaatgctga cgaacacctt 360 gctggacctg ccggaatcga tggactttga gtacaacaag tacgagtcga caaaggtgac 420 cttctcatcg gcatcgacgg tttttaagta tgtccgaatt ttcaacctgc caccggagat 480 cagcgaccag gagatttcga ctgtgatgtc taagttcggm gtcattcagc ggatggtcag 540 ggaaaggtac gcccctgaga ctggcttccc aatctggtcc tcagtccggg gagttcatat 600 ggaaatcaag caggacattc cggccaccat acacgtgcga aacttccaag cgcgtgtgta 660 ctacgatgga cttcagaata agtgtttcat ttgcggcagc aaagatcacg tcaaggtcaa 720 ctgcccaaag aagtcgaagg tcaaccawcg gatggaacaa aacggaagct tgtcgtacag 780 tgcggtgact gcgcaaccac cgctctggtt gggtcgagga tcgacgcaca aggcaggaga 840 gagtggagga gaagggcaga tggtcgtgtt gaacaacttt ttcgggccgg cgaaaccgag 900 tttgttggtg ccgmcgaagt tgcccgaggt accgtcggct gtaaagccga tcgatcaggc 960 cggactcggc tcgagcagtg ttgagcagat cgttctccca atgagtgagc cggacgcaga 1020 aatgaaggag acaacaacag ccgaaggagt tcttaccggc ggcggcgtat cgtcgttggc 1080 agtgccgatc gtgacggatg gtgcggtgaa ggaggcgcag ggtgaagaag agttccagct 1140 agtgcaaggt cgtaagaaga agaggaagga ggaaatgcga tcgctttcag ctgacgctcc 1200 agcaaagccc ggatcaggta tgatttttat cccaccgtct aacttgagtg cagcgcagag 1260 catggagacg cggggtcgtt caaagaagaa agaggacaac aaagagagaa gccgatcacg 1320 tagtttgagg gacagggata cagaggggaa gggggataca cggggaaagg gggatgtaaa 1380 ggggaagggg gatgtaaagg ggaaggggga tacacgggga aagggggatg taaaggggaa 1440 gaataaattg ggggggaatg gggaggggga aggggttgat gccaatctgg aggagaatct 1500 ttgaaaaatg gagttagtca ctatgggttt gtttcggatt ttggtcatat aagtagctat 1560 attttacatt caatttagaa taggttagct ttatggtagg attagttaga tttattttgt 1620 tcttttaatt ttatacaatg aactttacac ttaaaatagc gtctgtcaat ttaaacagta 1680 caacaactac tatcaatcaa aatttattaa gagattttat ttggaatcag gatatagatt 1740 tagtttttgt tcaggagctt tgctatcaaa atttttcttt tattccttct cattttgcga 1800 ttatcaattt aaatcaaaat ggtatgggaa ctggcattct gttgcggaaa tcgttcgaat 1860 tcaacaacgt kgttcttgat ccgaatggta gaatatcttc tttagttatc aatggcatca 1920 attatattaa tatttacgca ttttcaggaa ctaacaaaaa gaaagaaaga gatgagctat 1980 ttttgaacaa tttaactgtc catttgagta aatccggtat tgaatattcc gttctggccg 2040 gggattttaa ttgtatttta aactcttccg atgctacagg agcaacaaaa aatgtgtctg 2100 cgggcctcaa acgactcgta gaatcacttc aatggaaaga tatttttgca gaaatgaagc 2160 aaaatgaatt cacctttttt cgatctggag tcgcttcgcg acttgatagg atttatggac 2220 cgtcgcgttt tgttgatttg attatagaag ctaaaacaat atcagttcct ttttctgatc 2280 attgtgctat tgtgttaaaa attaaaacag attcaaactc attttgtttg aaaggaaggg 2340 gctattggaa aattaataac agtcttcttt ttgatgaaaa tatctcacat aaacttagtg 2400 ttgaatatca atcaatctgt aacagatcat cttatttcaa tgataaaaat cagtggtgga 2460 actcgatagc taaagttaaa ttcaaacaat tttttaaagg ggagagctgg gcaattaata 2520 aatatatttt agatcgtaaa agttattttc atggagtttt gaacgattta catcgtagaa 2580 gatgtcttgg ggaagaggta tccaaagata tgtttttagt taagtcacgt ttaatggaaa 2640 ttgaattaga tagacttaaa cattatagta ataaattgaa taatttctca ttgttacaag 2700 gggaggttat gaatattttt caagtttctg ctcaaattaa gaaattcagt acgtcaaaca 2760 atctcaaatt aaaggatggt aatgtgataa caggggactg ctcgattatg aaacaaatta 2820 tttttgatca tttttctgaa cagttcaaaa aatctctgga taatgcacta gatgggatca 2880 ctgatccttt agaaggtgta accagtcgct tatcagatga acagcaaaga gagctcacaa 2940 gaccgattac agttgatgaa atgtttaccg ttttaaaatt atgtgcacat aaaaaatcac 3000 cagggcctga tggtttgaat tacgaatttt atttaaaaca ttttgattta attaaggatg 3060 atttggtagt attgtttaac gaatatttat caggaacaaa aaaacctcct agagatttta 3120 ctgctggtat tattactctt atacataaag gtaatgataa gaatgattta aataattata 3180 gaccaattag tttacttaac tgtgattata aattattcac taaaattata gcaatgcgca 3240 taaagttaca tcttgatgat ttactaggct gtgaacagtc tgcatgtaaa tcagatcgat 3300 cttgctcgga caatttaaaa gatatcagac gaattatgct tcgtgcaaat gattcaaaac 3360 gttttaaagg agcattagtt agtattgatc ttcaaaaagc tttcgataaa gtagatcata 3420 catatttatg gaaagttttg gaaaaattta attttccaga tagtttgatt agttgtattc 3480 gaaatttata tgattctgca acgtcaaagg ttctgtttaa tggttttttg actaatagta 3540 ttaagattga atcgtcggtg cggcaagggt gcccacttag catgattttg tttgtattgt 3600 acatcgagcc acttttgaga aaaattagtg acggcatcgc aggtgtactt atttatgata 3660 agttcataaa aaccattgct tacgctgacg acataaatat ttttataagg aatgatgagg 3720 aatttgattt gttaatccaa attcttaatt ccttcgagtc atttggaaaa attaagatga 3780 attttaataa gtcagtttac ttgagaataa accgtgcaaa ctttggwcca caaatgatca 3840 gggaagtaaa tagttttaaa attttaggaa ttcatttttc aaatagtata tctgaatcaa 3900 ttaatttaaa ttatgataaa gtgattttaa atgttaagca tacgctccat ctccatggtg 3960 ttagaaaatt aaatttagtg gaacgagggt ggttattaaa tgcttttatc ttatcaaagt 4020 tttggtatct tgcgcagata ttcccaccgt acaaaaaaca tattgatatt ataaattcac 4080 tagtagggca gtttttgtgg aatgggttta twtttaaaac agctcgcgat caactttatt 4140 tagattatga tgcaggaggt ataaggttgg ttaatgcgga gtttaaaatg aaagctttgt 4200 tcataagaaa cattctttac cctggaggaa atgtttctga aaattacatt ttagtatatt 4260 caaaacaaca ccaactgccc ttgtgtttta ggcagtggat cgaagttgct agactttgca 4320 aaaacgattc tttattggac agttcgaagg ctttttataa acattttttg aatcaggaaa 4380 acttcattcc aaaagttaaa gttgattttc cacaatttca atggacaaca atttggaaaa 4440 atttaaattt taatttcatt ggatctgatg aaaagtcaag gctttttgct tttttaaata 4500 atttaatacc aaacaaggaa aaactaatgg cctacaacat cggaagattg tcwaataatt 4560 tatgtgatat ttgtaatgga attgacaata acggacacag aataatggaa tgccctcaat 4620 ccaggattat ttcaaattgg acaaagaata aagtagaaaa gataagtggg ataaaattag 4680 tgaatctaga agatattctg tcatttgaga tcgatgagaa caaaaaatca gaaatagccg 4740 ctttgtggtt atgtattcta gcgatttctt tcaatttgaa gtgctaccct ggacctagct 4800 tgtttgtttt caaaaaggaa atacgtgaaa tgagatggaa taacagatta attttcaata 4860 aagtttttgg aacacatttg ctaaggttag ggatgtagga cgagagagtt aggaaagtaa 4920 aatattgtaa actgtaaact gtatttactg tgtttgacca aaatgaccgt aacaataaac 4980 gttt 4984 // ID Shinagawa-2_CQ repbase; DNA; INV; 1961 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 16-FEB-2011 (Rel. 16.01, Last updated, Version 2) XX DE A non-autonomous DNA transposon family from Culex DE quinquefasciatus - consensus. XX KW DNA transposon; Transposable Element; Repetitive element; KW nonautonomous; Shinagawa-2_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-1961 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the southern house mosquito."; RL Repbase Reports 11(1), 94-94 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >83% CC identity. ~7-bp TSDs. Related non-autonomous elements, named CC Shinagawa, are found in Aedes aegypti and Culex CC quinquefasciatus. XX SQ Sequence 1961 BP; 605 A; 414 C; 372 G; 570 T; 0 other; gaatctatga catttccccg aaacccacat tcccgaaaag acatttcccc gaatgccaca 60 tctccgaaaa gacactttcc cgaaatgtca tttacccgaa aatcatttcc ccgaacaatc 120 catttccccg aatggccaca tccccgaatg gcgacttccc ctaaaaacca ttttcccgaa 180 tggccacttc ccctaatttt ccacatcccc gaaaatcatt ttcccgaatg accatatccc 240 cgaacggcca tttccccgaa cgccacttcc ccgaacccgg tcaggcgggt atgcccgttg 300 cccagaaccg cgcgcgcggt tctggacaac gggcataccc gcctgaccgg gttcggggaa 360 gtggcgatca atgaagtatc atgtccacaa ttgaacgttc aaaaaattcc gagtttttta 420 cgagcgtttt attccagctt cacttaaatt ttgaatattt tccaagtctg tggagttaaa 480 ggattcgaat tccggattct gagatttagt tactccaact gcctttcaaa aagaactgac 540 tccgccgaat ccaacaaaaa ctccgactat gacaaatgat gtagagaagg agaacatctt 600 gagcagagta catatttcta atgaacaaat ctaaatagct gcgtgccgct taatacattt 660 gcctattttt ctcatttcgt cgcagtcgag ctaacttgtc gtacgtcgat tttgggccaa 720 attgaattaa gaaagccatt ttgtgcagct cacaatgcct catcttttga cctacataga 780 tacccaaaat tcgatttcaa ttctgagata ttcaataaaa accgaaaaat gtcgtctcgt 840 gctaacttga cacactctga aaattgctgt aagtgcgaca actggccaaa gaggtttcag 900 atcagaatgc gtttgacaca cgtacatgcc cgactaccgt gaacgtttgt aattataact 960 cgggacctca gcaaccaaat tcaaacaaat ttcaggacaa tgcacagaat ggtcaaccaa 1020 ataaaacgtg tttgttattg tttacatagc gtgctctatt tttcgtttat tcaaggtcaa 1080 acactaacac gctttttctc ggaacgtcaa aaagcgacaa acgacaagat agcacgacag 1140 cgtcgatttg atacattgat catttcgctt tttttgcatt tagttgaaaa tcaaaataca 1200 gtccagactc ggttatccaa aggccttgtc aaaagttcac ttcgggtaat aacccctaaa 1260 ctactttaaa gtgatttagt aattttaaac ccaaaatggc ggtgatgaaa tattaaaaag 1320 cattttgtaa ttcaataatc aactatttag atttactaaa aagttaaaca aaatctagaa 1380 caaactgatt cctgcgtttc gagtcgtcgc ttaaagatat cgttgctgtg caatccttac 1440 cacgtggata ccgttttgga aaattgtgct ttgctgaaaa ttgtccatac aaaaaaaatc 1500 ttttttattg aacgtggatg atctcaaaac ctcgctttct tgaagtttcc gcgtggttta 1560 tgaatggcct tctacggtca tttgtttttt tttcatattt aaaaacacac gtaggagaat 1620 ttaatattaa ggtcatactt ttgtaaaaat atccctaaaa aacaatttag tttatatcta 1680 agttaatatc attcgggaaa atggaacttt cggggaaata accattcggg tatatgtaaa 1740 ttcgggaaaa cgacagttcg gtaaaatggc cattcaggta aatgacattc ggggatatgg 1800 aattttcggg aaaatgattt tcggggatgt ggaaatttcg ggaaaatgat tttcggggaa 1860 atggcacttt cggggaaatg attttcggga atgtggaatt ttcggggaaa tgattttcgg 1920 ggatatggga ttcgggaaag tgggattcgg gaatatggga t 1961 // ID MARWOLEN1 repbase; DNA; INV; 1643 BP. XX AC AAGB01000014; XX DT 10-SEP-2005 (Rel. 10.09, Created) DT 16-SEP-2005 (Rel. 10.09, Last updated, Version 1) XX DE Mariner-type transposon from Wolbachia endosymbiont of Drosophila DE ananassae. XX KW Mariner/Tc1; DNA transposon; Transposable Element; MARWOLEN1. XX OS Drosophila ananassae OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; ananassae subgroup. XX RN [1] RA Salzberg S.L., Hotopp J.C., Delcher A.L., Pop M., Smith D.R., RA Eisen M.B. and Nelson W.C.; RT "Serendipitous discovery of Wolbachia genomes in multiple RT Drosophila species."; RL Direct Submission to EMBL/GenBank/DDBJ (03-FEB-2005). XX RN [2] RA Salzberg S.L., Dunning Hotopp J.C., Delcher A.L., Pop M., RA Smith D.R., Eisen M.B. and Nelson W.C.; RT "Correction: Serendipitous discovery of Wolbachia genomes in RT multiple Drosophila species."; RL Genome Biol 6(7), (2005). XX RN [3] RP 1-1643 RA Jurka J.; RT "MARWOLEN1: Mariner-type DNA transposon from Wolbachia RT endosymbiont of Drosophila ananassae (procaryote)."; RL Repbase Reports 5(9), 265-265 (2005). XX DR EMBL/GenBank/DDBJ; AAGB01000014; Positions 1 1643. XX CC The transposase homology was identified in ref. 1. The transposon CC has ~220 bp long TIRs. This transposon is classified by its host CC rather than the endosymbiont due to its similarity to other host CC transposons. XX FH Key Location/Qualifiers FT CDS 395..1396 FT /product="MARWOLEN1_1p" FT /translation="MRKLSGEIQNNIVSLTESGNSTRQIAERLGISQSAVV FT SIQKRRKLTPKPQVSGRKKLLKDSDARLMMSEMRKNKNLTPKGACLAINKN FT VSEWTARRALQDIGYMSIVKKNKPALSDKNVKARLKFAKDHKNWTIDDWKR FT VVWSDESKFNRFQSDGKQYCWIRPGDRVQRHHVKQTVKHGGGNIMVWGCFT FT WWHIGPLQLVEGIMKKEDYLRILQTNLPNYFDKCAYPEKDIIFQQDGDPKH FT TAKIVKEWIGKQHFQLMEWPAQSPDLNPIENLWSIVKRRLGQYDSAPKNMG FT DLWERVAVEWSRIPQDILRNLVESMPKRVTEVIVNKGLWTKY" XX SQ Sequence 1643 BP; 567 A; 265 C; 323 G; 488 T; 0 other; tacagtggcg agcaaaactg agtgcatgtt cacaactctc actttggcca tcaataaaaa 60 cacaaccgct tatgcaaatt caataatttt ttttatatta ttaatcttaa ttaatttata 120 acttatttta tgagaaaaaa caaatttagc atatatattc aaagacttag aataaaaaaa 180 ctaaataaac tcacataaac tcaattttgc acgttttaag tcttgtaact gtttttgatt 240 ttaacatctc ttatcagtat aaatcggcca gatttaccgt tatatttctt ttaatgctat 300 tttcggtggt cattgattag ttaaaagcgt gtttttgcag agcaaagttc tttagggtga 360 ataaatataa ataattagtg ataaaatcct taaaatgcga aaattatcgg gagaaattca 420 aaataatatt gtcagcttga ccgaaagtgg caattccact cgtcaaattg ccgaaaggct 480 tggaattagc caatctgcgg tggtctccat tcagaagagg cgaaagttga cccccaaacc 540 tcaagtctca ggccggaaaa aattattgaa ggactcagat gctcgactaa tgatgtccga 600 aatgcggaaa aataagaatt taaccccaaa gggtgcttgt ctggcaataa ataaaaatgt 660 cagcgaatgg actgctagga gagcattgca agacatcgga tatatgtcga tcgttaaaaa 720 aaataagcct gcgttatccg acaaaaatgt caaggctcgg ttaaaatttg caaaagatca 780 taaaaattgg accattgatg attggaaacg agttgtctgg tctgatgagt caaaatttaa 840 ccgttttcag agtgatggaa aacaatattg ttggattcga ccaggcgata gagtccaaag 900 acaccatgta aagcaaaccg tcaaacatgg cggtggaaac attatggttt ggggatgctt 960 tacttggtgg catattggac ctctacaatt agtcgaaggt atcatgaaaa aagaggacta 1020 ccttcgaatc cttcaaacaa atcttccgaa ctattttgat aaatgtgcct accccgagaa 1080 ggatatcatt ttccaacaag atggggatcc aaagcacaca gccaaaatag tcaaggaatg 1140 gattggaaaa cagcactttc aattgatgga gtggcctgca caaagtccag acctcaatcc 1200 aattgaaaat ctatggtcaa ttgtgaaaag acggcttggg caatacgatt ctgctccaaa 1260 aaacatgggc gacctctggg agcgtgttgc ggtggaatgg agtcggattc cgcaggacat 1320 ccttcgaaat ttggtcgaga gtatgccaaa gcgcgtaaca gaagtcattg taaataaagg 1380 gctctggact aaatattagc aataatgtag ttattaagtt ttgattgaaa cgtgcaaaat 1440 tgagtttatg tgagtttatt tagttttttt attctaagtc tttgaatata tatgctaaat 1500 ttgttttttc tcataaaata agttataaat taattaagat taataatata aaaaaaatta 1560 ttgaatttgc ataagcggtt gtgtttttat tgatggccaa agtgagagtt gtgaacatgc 1620 actcagtttt gctcgccact gta 1643 // ID Copia-120_AA-I repbase; DNA; INV; 4008 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-120_AA_; KW Copia-120_AA-LTR; Ty1_copia_Ele186; Copia-120_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4008 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [1420-1944] - Integrase core CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS join(2038..2781,2785..4008) FT /product="Copia-120_AA-I_1p" FT /translation="MTFVGYASDSKAYRLLNTSTGQIVISRDVRFLELDDE FT DKNVKDGTVRSLPGECPNEDEQVAVRLYDCGQNAGRCDDSQDEEEFFGFDE FT DNDDPNEAVPVQEERVAVPVARDHLRRSQRENFGVPPGRYTDTIGLAVNTC FT TEPSTYREAIESQDSEKWKAAMDEEMASLRDNETWTLTKAPQRGNVVGCKW FT VFKCKPDESGKTVRYKARLVAQGFSQKYGTDYDQVFAPVVKQITFRTVPTI FT ASKRQMLKHIDIKTAYLYGVLQEEIHMKQPPGYSNGDPNTVCLLKRSLYGL FT KQAARVWNQRIDAALKSMGFHQSSADPCLYLRIANGMYSYILIYVDDIVVV FT CRTEQEYQKLVAVLSKSFKIVELGDLTFFLGIHVRRDADRFFLNQKSYVRK FT LLSRFNMVNCKASRVPMASGFVQQKEEDSDSLADGQQYQSLIGALLYIAVN FT TRPDIAISTSILGRRVTKPTTADWNEAKRVLRYLKGTENYELHLGGGDNLQ FT VECFVDADWAGEVDNRKSNTGYIFKFGGGLVGWGSRKQTCVSLSSTEAEYI FT ALAECLQELQWVRRLVDDLGEQLALPIVVNEDNQSCIALVAADRISRKSKH FT IDTKYCFVKDLASAGIVSIRYCPTEQMEADLLTKPLGAVKLQQLREAIGIL FT PHDVEEE" XX SQ Sequence 4008 BP; 1013 A; 908 C; 1249 G; 835 T; 3 other; acaggttatt ggcccagcaa acgtgacccg cggaattgag aagcgaagcc gttccagaaa 60 atatagccca ggtgaggagt tcgctgtttt ttcccgaagc tgtctgaaga tggagtgggc 120 gagaatcgtg aagctgamca actcgaacta ccaggcgtgg tcgttctccg tgaaagcgct 180 gctgcggagg gagcagctgt ggaagcacgt ggacccaggt actcctccgg aaccggtgac 240 ggacgcgtgg agcgatggcg atgagaggac gttggccacg attcacctga cggtggatga 300 gagccagtac gcgctgattc gtggcaagac gacggcgaag gacacgtggg cggctctgaa 360 ggcccaccac gagagggcca cgttggggca gcgggtgacc ctgctccgtc aaatcacgaa 420 ccagaatttc aaggtaggtg acgacatgga ggcctacctg gcggtaattg agaagcagta 480 tgcgcgtctc gaaaattccg gttttgagat gcaagagtgc ctcaaggtgg cgctgatact 540 gcggggactt ccggagtcgt tcaatcccct gacgacggca ctggaagcgc ggaacgaaga 600 tgagctcacc ctggacctcg tgaaggtaaa gctgctggat gaagcggaga agctacggaa 660 acgatccggt tccgacgagc aagtgctgcg aaccggcgga acgagtgcgt cgcagaaaag 720 tgtcgtctgc tatcactgcg gcaagccagg gcaccggaag cgaaattgta aggcgtacct 780 ggcagagtgt ggcggaaatg cggatccgaa gagagaaaag gctaaggcca aaacagtgcg 840 cgaaaacgac gcaaagtcgt tcacgtttat ggtgcggtcc ggtgagcatg atgcgagcgc 900 gtgggtggtc gattcgggcg cgacgtcaca tttggcgaac aaccgcaatt cgtttgtgcg 960 tctcgacgag agtgtgcgcc cggagatttc gaccgccggt ggtagagtgc tgcgtacggc 1020 aggtgtcggg gactgtgaga tcgaatgcgt acggtcgaac aaaaagtgtc ggttacgttg 1080 acggcgtttt gccccgcgcc gaaggacaat ttgatttcgg tgtcaaagct ggcagagaaa 1140 ggtgtgcgcg cggaattcga tgcggagcaa tgccggctgg tgtcgaggga gtattgcgac 1200 gctgatgcgt ggagtgtact ctgaacgttc gaagaatcgt tgttggtgag tgataagcag 1260 cacaacacgg attgtttgca cacgtggcat cgacgactag gccaccgaga tccagatgcg 1320 attggcgaag ttgagcggcg tggtttggct tctggaatga aaatcaagcc atgtggtatc 1380 cggatgactt gtgagagttg cgtcgaaggg aagatggcac gtccgccgtt tccacaagca 1440 gccgagaaga agtcgaaggc ggtgctagat atcgtgcaca gtgatgtatg cggtcctatg 1500 acgacatcgc cgggcgggtg ccgatattat atgacgatga ttgacgacca cagccgctat 1560 acggtcgtct attttttgaa ggagaagtcc gaggtcatcg aaaagatacg tgagtatgtc 1620 aggttcaccg aaactcaatt cggaaggaag ccgaaggtca tccgttcaga ccggggtggc 1680 gagtacacca gcaatgcgtt gcgtaagttt tacgcagagg aaggaatcaa ggcagaattc 1740 accgctggtt acgctccaca gcagaacggc gtggctgaaa ggagaaacag aacgttaaac 1800 gaaatgggtc tgtgtatgct gttggatgct ggcttacctc gtcgtttctg ggcggaagca 1860 gtgaacaccg ctgcttatmt tcagaaccgg ttgccgtcat cagcgatcgg atcgactccg 1920 cacgagatct ggttcggtac gaaaccggat ttgcaacatt tgaaggtgtt cggttgcagt 1980 gcctacgtgt ggactccgcc acaaaagagg aagaagtttg aagagaaggc gaccaaaatg 2040 accttcgtcg gatacgcgtc ggacagcaag gcttatcgtt tgctgaatac ctctactggc 2100 caaatagtca tcagccgtga cgtgcgtttc ttggagctcg acgatgagga caagaacgtg 2160 aaggatggaa ccgtccgtag tctaccaggt gagtgcccga acgaagacga acaggttgca 2220 gttcgactgt acgattgtgg ccagaatgcg ggaagatgcg atgattctca agatgaagaa 2280 gaatttttcg gattcgatga agacaacgat gaccccaacg aggcagtacc agtacaggag 2340 gagcgcgttg ccgttccagt tgctcgtgat catctgcggc gctcgcagag ggaaaatttc 2400 ggtgtacccc cagggcggta taccgacaca attggcctgg ctgtgaacac gtgtactgag 2460 ccaagtacat atcgagaagc gattgagagc caggacagcg aaaagtggaa agcagccatg 2520 gacgaggaga tggcatctct acgagacaac gagacgtgga cccttacaaa ggcgccccaa 2580 cggggcaacg ttgttggctg caagtgggta ttcaagtgta aaccggacga atcggggaaa 2640 actgtacgat acaaggccag gttggtcgca cagggttttt cccaaaaata cggaactgat 2700 tacgatcagg tgtttgctcc ggtggtgaag caaatcacat tccgtactgt gccgacaata 2760 gcaagcaaac ggcaaatgct cgwcaagcac attgatatca agaccgcgta tctgtatggc 2820 gtgctgcagg aggagatcca catgaagcag cctcctgggt actcgaatgg tgatccgaat 2880 accgtgtgtc tcttgaagcg aagcctctac ggactcaagc aggctgcacg ggtatggaac 2940 cagcggatcg atgcggcgtt gaagagcatg ggattccatc aatcttcggc tgatccgtgc 3000 ctctatttgc ggatcgcgaa tggcatgtac tcgtacatat tgatctatgt ggacgatatc 3060 gttgtcgtgt gcagaacgga acaagaatac cagaagctgg tcgcggtttt gagcaagagc 3120 ttcaagattg tcgagttagg tgacttgacg ttctttctgg gaatccacgt tcgacgcgat 3180 gctgaccgat tcttccttaa tcagaagtcg tacgtgagga agcttctatc gcggttcaac 3240 atggtgaact gcaaagcgtc ccgggttccg atggccagcg gatttgttca acaaaaggag 3300 gaggatagcg actcattggc agatggacaa cagtaccaaa gtttgatcgg tgctttgttg 3360 tacatcgctg tcaatacgcg cccggacatt gcaatctcga cttcgatact cgggcgtcgg 3420 gttacgaaac caaccacggc ggattggaac gaggcaaagc gggtgctacg ttacctcaaa 3480 ggcaccgaga attacgagct ccatctgggt ggtggtgata accttcaagt ggagtgcttc 3540 gtggatgcag actgggctgg cgaagtagac aacaggaagt ccaacactgg ctatattttc 3600 aagttcggtg gcggactcgt tggctgggga agcaggaagc aaacgtgcgt gtccctgtcc 3660 agcactgaag cggagtacat cgctctagcg gaatgcttgc aagagctgca gtgggtacgt 3720 cgactggtag acgacttggg cgagcaactg gcgttgccca tagtggtcaa tgaagacaac 3780 cagagctgca ttgcactcgt agcggcggat aggatctccc ggaaatccaa acacatcgac 3840 accaagtact gctttgtgaa ggatttggca tcagctggta tcgtatccat tcggtactgt 3900 cctacggagc agatggaagc ggatcttctt accaagcctt tgggagcggt gaagctacaa 3960 caactgcgag aagccatagg gattctacca catgatgttg aggaggag 4008 // ID I_Ele14 repbase; DNA; INV; 7294 BP. XX AC . XX DT 14-OCT-2010 (Rel. 15.1, Created) DT 14-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE An I clade non-LTR retrotransposon family from Aedes aegypti. XX KW I; Non-LTR Retrotransposon; Transposable Element; I_Ele14. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-7294 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-7294 RA Kojima K.K. and Jurka J.; RT "I clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (14-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. This consensus is generated from 20 CC sequences with >86% identity, and ~92% identical to the original CC sequence in [1]. ORF1 is broken. XX FH Key Location/Qualifiers FT CDS 3200..7057 FT /product="I_Ele14_1p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="MPGPALGGAPLESATRTLCPPNGFSQTYFSTAAPSPL FT PVVGSTPGPTLGGALSGSVSSTFLGSTNSKETSLSSFPVPALPTSHNFILQ FT WNLNGYRSRLCDLELIIQNNQPWILALQETNNISADEMGRTLGGQYAWTVK FT RLANQRHSAAIGILKGIPHKILDLPSDLPIVGIQLLGSPSVSVACAYLPGG FT NIPNLHGQIQNCLQALPEPRIFLGDLNSHHPVWGSPKADKRGSTLINAFEE FT EDMLILNDGSPTFFNGHFSSEIDVTAVSLSFARNIQWSVSSDLHGSDHFPI FT RIALTTGASIPKNTRRPRWKYDKADWAEYDLLIRSALRNSPPDNMADFIST FT IVESAKATIPRTSSTPGRKALRWWSDETKTAVKARRKALRQLRRLPLDHPD FT RQSTLENYRKLHLECRNIIGDAKKATWESFLESMNASQTTSELWNRVNALN FT GKRKTTPLTLQVQGNAISDPEAVASELGKYFASLSAIESYETHFXNSVKPS FT TSAINNFIIPPNTADPEMEQPFSARELSFALSRSNGRSAGPDEVGYPLLKN FT LPSVGKLALLDLYNKIWANNDFPPAWKESLVVPIPKANIPTRDPTKFRPIS FT LTCCAGKVMERMVNRRLRYHLEANGLLDFRQHAFRQGYGTSTYFAGLSDVL FT KEAYDKGNHAEVISLDISKAFNRTWTPLVLEQLAAWGIGGRTLHFVRNFLA FT DRTFQVVIGNTKSSTFPEETGVPQGSVLAVTLFLIAMNGVFSRLPKNIYVF FT VYADDIVIVVCGSTPTMTRIRAQTAVKSVAKWASDNGFQLSASKSIRCHIC FT PSGHRITGPDITIDGQPIPLRKTVRILGVIVDRALSFRQHFDTVKSACRSR FT LNLIKSISRPHRSNNREIRFRVARAIVDSRLVYGLELTSIAMDRLVEVLSP FT IYNSYIRIISGLLPSTPSDSACVEAGLLPFRIFIRSSICCKTAAFLSKTAG FT EDRVFLLDEGNRALSTAANLSLPPVTRVHWLGDRSWRSVPPKIDNKIKNSF FT SAGSNSAALRRSVAELLQTSYSDFALRYSDGSLTSAGVGIGVAGDIPDVSM FT SLPSQCSVFSAEAAAAFIAATTPSDRSILVLTDSASVISALQSDSPTHPWI FT QAILKYALPDTVFTWIPGHCGVPGNEMADHLARSGLSGQRYTSEVPFMDLK FT RWIKSTFRQHWEDSWYRTRTLFLRKIKNSTTTWSDLPILKDQRILSRLRTG FT HTLISHNMGGGPFHKECESCHIPASVDHVLCACPMYEHLRQIHGLSDNIGE FT VLRDDATTIAALLSFLRDANLYSSI" XX SQ Sequence 7294 BP; 1894 A; 2065 C; 1686 G; 1634 T; 15 other; cagttgacag ccgaccgccg tcgcgtacag tcgcctttcg cattgttccg tactgttaat 60 atcaattaac agcgcatcag ccaactcgat cgttgtctga cggggtgctc cacagcgggg 120 cgagtgtgca ttgttttatt ccggatcagt gcagcaccca gtggtgtggt gagaagtgag 180 gttaaataga ctcggtagcg tggttcgagt actgtatakt attatttcgg gcgcgagtgt 240 atgtgttagt aacttgtgag tggtataaat aaaattatct tcaaattaaa aagcggttca 300 gtggttttca actgtagtgt gactgaccaa ccaaaacagg tgagtcttcc acgagtgcct 360 ccttccggcc cgccggcgcc atgctggcga acggccatcc cctccaccca ggggacccag 420 ggggccccgg aggaggattt ggcaaagtag gcaacagagt tctcggcgac tacaccggtg 480 cattgctacc ggagtttatg gaccgtgaag gtactgcagg acaactgcag tatctaagaa 540 tggaagcgac tgctggaacc atgccccagg atccgtttct tcttcggcta tccgtcgaaa 600 aggccattgg aggtcccatt ttcggggcct tcaaggaaaa taaaggcatg tcctacgtgt 660 tgaaggtgcg cagtcaagac caggtmaata agctcctccg catgagccaa ctgcaagata 720 amaccccaat ccggatcgct gagcacccgc ggctmaatca aagggaatgt gtggtgacca 780 accatgacac aatcggacta tcggatgact acctgcttca acagctctcc tcccagggaa 840 tcaagggmst gcgaaggatc acgcgacgcg gaccggcatg gccaaagccg tggtaaacac 900 cgccaccatc gtactcacca tcaagggcac cgttattccg gagcatgtcg ttttcggttg 960 gtcacggtgc aagacgacgt cttttctacc catctcccat gctatgcttc cgctgctggg 1020 agtatgggca taccggtaag cgttgcacaa gcccccagca gcatctgcgg acgctgctgc 1080 gaggtacacc cggaagaaca aacccccgat cctacaaaca cccaggaagg agccgccagt 1140 gcccacgttg cgtcgaaaac gcggtaccag tgcaccgcag agccgttgct gcaagaccag 1200 tgcaaaacag gagacagcac tcagtctcaa gccgcaagtg tccggtgtac wwgaaggaac 1260 aggccataca acaccttcgc gtcgacgtgg acattccgta tccgcaggct cgacgtgaat 1320 acgaagccca acaagcgagc aagtcgaagt ggtgccaact tctgccggtg tggtcagtgc 1380 cagcaaggac gctgaaatcg acagtctgaa ggccatggtg gctgcaaaac tccgcggacg 1440 gagaagaaaa gaggatggct gaaatggaac gggctctgca gagccgcacc gtcagcgacc 1500 gcctcgaggc ggttaaagaa cacggcacca tcggagagct ggtcaagcaa gtggcaaccc 1560 taactgaaac ggttcagaac ctccaaaatg ctctagcagt aaggacaaac aaatcccgga 1620 tcaaagtcgc accatcgctg agcttctagg caggcaatca gctgccgtcc aacgacaaac 1680 ccgcgacaag ctttggctta ccgcgcaaga sgatcaaacg caaaccgccc caagtcgcca 1740 gaggttcccg tcatgaaacc gcccagtctg aaccccaaag tggacgtaca ggtgtccaaa 1800 tggatttcaa atataacacc ggacctcaag aaactctcac aggcgatgaa accaacgatg 1860 gaaatgcgtc tgatcatagc atgatgtcag ccacatccga agccactgga atctccaaca 1920 cgtctcacgt ttcgaaccct acgaaaagaa tgcactcaac ctcggatcac agttgcccat 1980 ccggcgattc aaatgcctca tccaccaact cgaagcggaa aaacaagcac cgcaaaagag 2040 gtggtagaac caaacagtaa aacagtaatt tacgctcagt cctcttcaag cacatggtct 2100 aaacaattcg acaatatttt cttcccgtca tcttcactcc gcaaattagc gttgagacag 2160 tcaaatcaca agagtcgact gagcaaatac cgatgaacac actacagcct ccgaacgaag 2220 tactcacgga caactcagcg acaacagggg tcacggacag ccggggcccc gtcagtgcgg 2280 acgctacacc ccaaccggaa ctggcggaca acccccggca tccgggcgta actgttgtta 2340 tcgggacgca aaagggacac agtgcctatc ccccgaaagt cccaactagc tgtatgggta 2400 cgccagggcc ccgtcggtgc ggaagttttg gaagaaccgg aaccggcgga caacctctgg 2460 cgcccatata cggtctggaa gagggacgca aaagggatca acatcctatt cccctkagga 2520 cagcgtggga agcaagccac aggaaacctt ttttctgccc ctgccaacac acagagggtg 2580 aaggtcggga cgcccaatac cccgaaggac aacgagggaa gcaagcctcc gtcgagtcga 2640 aacccccccg gaasagaaaa ccgccggacc actacacctc caactggcac aaccaccccg 2700 gtcagcgaca agctgaatcc cccgacttcc cgggcagcct tgggacccgg aagcaagcga 2760 cgccaggttg tatgacctgt tacacaacgc tcgttacaat cgctcagccc gccgacttca 2820 gcccgggtat tccggaaccg tttccagact tcaactccaa acgcggaagc atacgaaaac 2880 cgatcgccat ccgcgataat katagctcaa cccgctgctg gcccctcaag ccaaacagcc 2940 tccccagatc cctatcagga tcgccctggg cacctcggac tcaacaatcc gagcggtaag 3000 gccgaccctt tcccgctggg cctccacccg gtgaatcttc gtccgccaca acaacctatg 3060 tcaacccatt tttcgacgcg gctcccagtc ttcttcgctc tcaaactcac agcagcactc 3120 cttgataagg tttcctagca gccctgttgg aaacgactgt cttgccgcag ctcccgaagt 3180 tccggcagta gtaggtccca tgccgggtcc cgcgcttggt ggggctccgt tggagtcagc 3240 tactagaacc ttatgtcctc ccaatggctt ttcccaaacc tacttttcta ccgcagctcc 3300 ttcacctctg ccagttgtgg gttccacacc gggacctacg ctcggtggag ctctgtcggg 3360 gtctgtatcc agcactttcc tcggttcaac taattccaaa gaaacttctc tctcaagttt 3420 tccggttcct gctcttccta cgtcacataa tttcattctg caatggaatt tgaatggcta 3480 caggtctcgc ctatgcgatc tggaattgat tatccagaat aatcagccct ggatcttagc 3540 ccttcaggaa acaaataata tatccgccga tgaaatgggt cgcacacttg gtggccaata 3600 cgcctggact gttaaacgat tagcaaatca acgacattct gcggcaatag gaattctcaa 3660 aggaattccc cacaaaattc ttgacctacc ttccgacctt cctatagttg gaatacagtt 3720 actaggttcc ccgtccgtct ctgtagcctg cgcttatcta ccaggtggga acatcccaaa 3780 ccttcacggt caaatccaaa actgtcttca ggccctccct gaacctcgaa ttttccttgg 3840 tgatctcaat agtcaccatc cggtctgggg ctctcccaag gctgataaac gcggcagcac 3900 cttgattaac gccttcgaag aggaagacat gctgatttta aatgacggct caccaacttt 3960 tttcaatggc cacttttcca gcgaaattga cgtcaccgca gtctcgctct cttttgccag 4020 aaatattcag tggagtgtaa gttctgatct tcatggtagt gaccactttc caatccgaat 4080 tgcattgacc accggcgcgt caattccgaa gaacacgagg agacctagat ggaagtacga 4140 taaagctgat tgggctgagt acgatttgct tattcgttct gcccttcgaa acagtcctcc 4200 tgacaacatg gctgatttca tctcaaccat agttgaatcc gctaaggcca ccattccccg 4260 caccagcagc acccctggcc ggaaggccct tcgatggtgg tctgatgaaa ccaagactgc 4320 agtaaaagcc cgcagaaaag cgctgcgtca actacgaagg cttccacttg accatccgga 4380 caggcagtct accttagaga attatcgcaa gctccaccta gaatgtcgga acatcatcgg 4440 tgatgccaag aaggccacgt gggagagttt tttggagagc atgaatgcat cccaaactac 4500 cagtgaactg tggaatcgag taaatgctct gaatggaaag cgaaaaacta cacctctcac 4560 tcttcaagtg caagggaatg ccatatctga tccagaagca gttgcaagtg aacttggcaa 4620 atattttgcc agcctatcag ccattgaaag ctacgagact cactttttka attctgtcaa 4680 gccttccact tccgctatca acaattttat cattccacct aacactgccg atcctgaaat 4740 ggaacaacca ttttcagcca gggaactctc cttcgcgctt agtcggagca atggaaggtc 4800 agcgggtccc gatgaagtcg gatatccgct cctcaaaaac cttcccagtg tcggaaaact 4860 tgcgctgtta gatctctaca acaagatctg ggccaataat gatttccctc cwgcttggaa 4920 agaaagtcta gtagtcccca tacctaaggc taacatcccc acccgcgacc caactaaatt 4980 tcgaccgata tccttgacct gctgtgccgg aaaagtgatg gaaaggatgg taaatcgaag 5040 gctaagatat cacttggaag ccaacgggct actggacttc cgccagcatg cctttcgcca 5100 aggctatgga acatctactt acttcgcagg tctcagtgac gtcctcaagg aggcctacga 5160 taaaggtaat cacgccgaag taatttcact tgacatctcc aaggctttca acagaacgtg 5220 gaccccactt gttctggaac aactggctgc ttggggcata ggcggaagaa ccttgcattt 5280 tgtgcgcaat ttccttgccg ataggacctt ccaagtagtt attggaaaca caaagtcatc 5340 caccttccct gaagaaacag gcgttcccca gggttctgtg ctagctgtaa ctcttttttt 5400 gatcgcaatg aacggcgtct tctcccgtct gccaaagaac atatacgtat tcgtttacgc 5460 ggatgatata gttattgttg tttgtggatc aactcctact atgacgcgga tccgagctca 5520 aaccgctgtc aagtcggtag cgaaatgggc atcagataat ggtttccaac tgtccgccag 5580 caagagcatc cgttgccaca tttgtccatc cggacatagg ataactggcc cggatatcac 5640 gatcgacggt caaccaattc cgcttcgtaa gactgttcgc attctcggag taatagtaga 5700 tcgggctctc tccttccgtc aacattttga taccgtcaag tcagcgtgtc gatctaggtt 5760 gaacctgatt aaatccatct cccgcccgca tcgatcaaat aatcgagaaa tccgcttcag 5820 agtcgcccgt gccattgtag acagtcggct cgtatacgga ttggagctta ctagcatagc 5880 gatggataga ttggtcgaag ttctcagccc aatttacaac tcctacatca gaataatctc 5940 tggtctgctt ccatcaacac cgtcagattc cgcttgcgtg gaagctgggc ttcttccatt 6000 tcgaatcttt attcgctcct ccatctgctg caaaacggca gcattcctca gcaagacagc 6060 cggcgaggac agggtctttc tccttgatga ggggaacaga gccctaagca cggcagccaa 6120 cttgagtctc ccgccggtta ccagggttca ctggctcgga gacaggagct ggcgttccgt 6180 cccaccaaaa atcgacaata aaataaagaa cagtttttct gccggcagca actctgctgc 6240 tttgcgacga tcagtcgcag agctgctaca aacttcttat tccgattttg ctcttcggta 6300 ctccgatgga tcccttacaa gcgccggtgt cggcataggg gtggccggtg acatcccgga 6360 cgtaagtatg agcctgcctt cacagtgttc agttttttcc gccgaggcag cagcagcttt 6420 tatcgcggct accacccctt cggaccgttc gatcttagtc ctaaccgact cagccagcgt 6480 aatatctgcc ctacagtcag attcgcctac acacccttgg attcaagcaa tattgaagta 6540 cgcactacct gacacggttt tcacgtggat ccctgggcac tgcggagttc cagggaacga 6600 aatggctgat catcttgcca gatcgggcct ttcaggtcaa cggtatacct cagaggttcc 6660 tttcatggac cttaaacggt ggatcaaatc taccttccgc caacattggg aggactcatg 6720 gtaccgcaca agaacgcttt tccttcggaa gattaaaaac tcgaccacta cttggtcaga 6780 cctkccaatt ctcaaggacc aaaggattct atctcgctta cgaaccggac acaccctaat 6840 atcgcacaat atgggaggcg ggcccttcca caaagaatgc gagtcttgcc atatcccagc 6900 ctctgtggac catgtccttt gtgcctgccc catgtacgaa catctgcgac aaatacacgg 6960 cctttctgac aacatcggag aggtactacg cgacgatgca actactattg ctgctttact 7020 gagttttcta cgcgatgcca acttgtacag cagtatatga tcctgcccaa tatgacgacg 7080 acattgatgc cagcccacgg atacgaaact gttttactaa attgtactat gtatagatgt 7140 aagcttgaaa tgttagattg acagttatct ccttttcatc ggttcagtct tctgctgacc 7200 gtatttctcc agggtctcag ccttctgctg agccttctcc cgtgttgaac tagcataatg 7260 ttaaaaaaca cgttaataaa gatgaaaaaa aaaa 7294 // ID Copia5-NVi_I repbase; DNA; INV; 4159 BP. XX AC AAZX01004556; XX DT 05-NOV-2007 (Rel. 12.11, Created) DT 05-NOV-2007 (Rel. 12.11, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: internal DE portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia5-NVi; KW Copia5-NVi_I; Copia5-NVi_LTR; internal portion. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-4159 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from Nasonia parasitic wasp."; RL Repbase Reports 7(11), 1131-1131 (2007). XX DR Genome; AAZX01004556; Positions 5415 1257. XX CC Positions [1572-2072] - Integrase core CC 'TAAAG' target site duplication CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 3107..4135 FT /product="Copia5-NVi_I_1p" FT /translation="MKRIGFKQADADNCVYIGLHENDTVYIALYVDDGLLL FT AKNQKTLDRLIAILQEEFKITSGALKFFVGMEISRTEDGIFIKQTNYIKRI FT LEKFHMEGANPIRTPADQHSKLVSPLEPQTNFPYREVVGSLLFLAMVSRPD FT IAYAVSVTSRFLDNHDQTHWRAVKRILRYLKGSENIGLFYPSNSESDTLVG FT YSDADYAADLDTRRSTTDYVFKLYDACVSWSSKRQATVSLSTTEAEFIAAS FT EATKEIIWLRKLLSNIEHGCAQPTILHIDNQSAIKLSRNPEFHRRSKHIDV FT RYHFICEKVKNNEVNTKYVNTHDQCTDMFTKPLCCEKLNKLLNKINVTNRI FT " FT CDS join(135..1325,1329..3062) FT /product="Copia5-NVi_I_2p" FT /translation="MAETSDLKNISKFNGQNYQLWKFQMRAAFIAHDLLEI FT VEGTDKKPASTTTNAADVFKWSKKDAKAMFFLSDAMEYSQLEYLITCSTSN FT NYRMSPGDSIPQHIARVENLASQLKDIDQAISDTMIMAKILSTLPAKYNAF FT VSAWESVVADDQTLHNLRERLLREESRMTTMDNLDNALATLSLPKEKNQGH FT RPQSGNNANSNQRKKSITCDFCKKPNHIARFCFARKRSKNSNVNSNGNDQN FT NSRKQMDGDSANLTAFTVTKNDSNSESASANVAENPAWLSAIGDRDSWILD FT SGASRHISCCREWFRELRMTGQDEFVYLGDETRLKVEGYGDVYIKRFINNA FT WLDGVIRNVLYIPSLARNLLSSGVCTFDGHSIVFDREQANIFSRENTLIAR FT GIKRNNLIKMLFIPESRLDANYTSVAPLSLWHERMGHINCNRIKHMLNNDV FT VSGVLVSDKKDFFCESCPPGKQFKLPFHKTEKNANVAPGEIIHSDLCGPMQ FT TPSVSGAKFFVLFKDEATGFRTVYFRRHKNDRYDRLKCFVNLIKNQFGKDM FT KILRSDNGLEYKNKRVTEFLESRGIKHQTSALYTPEQNGRAEREMRTIVEC FT ARTMLLSRNFPTRLWAEAVNTAVYILNRSLGAQSNDATPFELWSKQKPDLS FT HLRIFGCDAYAHVPNALRKKWDEKSKKLIMVGYDADSANYRLFDPRTGRVT FT ISRNVSFNENLHERKSRNGEEAYIDFDNEVCETLQNDDRIPRAKPHNENDE FT NRADDAAADEPALGEADVHEQLDDDDPGAANADAIAANDRNPRIAEIARPN FT LRPRDKLLRPQRYRADTVVCESPTTYKDALNRADSDLWKIAIQNELEAHAR FT NRTWDIVELPKGRKAIGYKWTFKIKHAIDESEQQCKARLCAKGYSQEAGTD FT YDEFFSPVARFESIRVLLAIAARENLCSMQFDVSTAYLNSELNELIYMRVP FT DGLNVSDKNLVLKLNKAISD" XX SQ Sequence 4159 BP; 1282 A; 1006 C; 948 G; 923 T; 0 other; ggttatgggc ccaggaaggt gtggcttttt ttccagtttg tccatcacca acaacgctag 60 tgaaagtttt ttttcttttt tcgtgtttcg tacgttttgt gttcactcgc ttttgagaaa 120 tatcgagaaa caagatggcg gaaacttcag acctcaagaa tatctcgaaa tttaacggtc 180 agaactacca gctctggaag ttccaaatga gagccgcctt tattgcgcac gaccttctgg 240 aaattgtgga aggaacagac aagaaaccag cgagcacgac caccaacgca gccgatgtat 300 ttaagtggtc taaaaaggac gcgaaagcca tgttttttct gtctgacgcc atggagtaca 360 gccaacttga atacctcatc acgtgctcaa cttctaacaa ctaccgcatg tcacccggag 420 actcgatacc gcaacacatc gcccgagtgg aaaaccttgc cagccaacta aaagatatcg 480 accaggcgat atcagacacc atgataatgg cgaagatcct tagtacgctg ccggccaagt 540 acaacgcctt cgtctcagca tgggagagcg tagtggccga tgaccagacg ctgcacaacc 600 taagggaacg actgctacgg gaggagagcc gcatgacgac catggacaac ttggacaacg 660 ctctcgccac cctcagccta cccaaggaga agaaccaagg tcatcgtcca caaagcggta 720 acaatgcgaa ttcaaatcaa cgtaaaaaat cgattacgtg tgacttttgc aaaaaaccga 780 atcatattgc gcgtttttgt tttgccagaa agcgatcaaa gaattccaac gtaaactcga 840 acggaaatga tcaaaacaat agccggaaac aaatggacgg agattcggcg aatctcaccg 900 cgttcacagt tactaagaac gactcgaact cggagagtgc gagcgcgaac gtcgcagaga 960 accctgcttg gctcagcgca ataggggaca gagattcgtg gatcttagac agcggcgcgt 1020 cgcgtcacat ctcgtgctgc cgcgaatggt tccgagaact tagaatgacc ggccaggacg 1080 agtttgttta cctcggcgac gagacgagat taaaagtcga aggttacgga gacgtataca 1140 taaagagatt tataaacaac gcgtggctcg acggtgtaat tcgcaacgtt ttatacatcc 1200 catcgctcgc gaggaaccta ctatcctcgg gcgtttgtac cttcgacgga cactcgatcg 1260 tttttgaccg cgaacaagct aacatttttt cgagagaaaa tactttgatc gctcgaggca 1320 ttaagtagcg aaacaatttg ataaaaatgt tgtttatacc tgaatcaaga ctagacgcta 1380 attacacttc tgttgctccc ctgagcctgt ggcacgaacg tatgggccac attaattgta 1440 atcgcatcaa acacatgtta aacaacgacg tagtttcggg cgtgcttgta agcgacaaaa 1500 aggatttttt ctgcgaaagc tgtccacccg gcaagcagtt caaactgcct ttccacaaga 1560 ccgaaaaaaa cgcgaacgtc gcgcccggag aaataataca ttccgatctc tgtggtccca 1620 tgcaaacacc ttcagtcagc ggtgcgaagt tttttgtatt gtttaaggac gaagcgaccg 1680 gctttcgcac cgtatatttt cgcaggcata aaaatgacag gtacgatagg ttgaagtgtt 1740 tcgttaatct gataaaaaat caattcggga aagacatgaa aattttacga tcggataacg 1800 gccttgagta taaaaataag cgcgtgacag aattcttaga atccagaggc atcaagcacc 1860 agacgtctgc tctgtacact cccgaacaga acggtcgcgc ggaacgtgag atgcgcacca 1920 tagtcgagtg cgcgcggacg atgttgttgt cgcggaattt tccgacgcgt ctgtgggccg 1980 aagccgtcaa tacggctgta tatatcctaa accgatcgtt gggggcgcag tcgaacgacg 2040 ctacgccttt cgaactctgg tctaaacaaa aacccgacct gtctcatttg cgcatttttg 2100 gatgcgatgc ttacgctcac gtacccaacg ctttacggaa gaaatgggac gaaaagtcaa 2160 aaaaattaat catggtaggt tacgacgcgg actcggcgaa ctataggcta ttcgatcctc 2220 gaaccggacg cgtaacgatc tcaagaaacg tctcctttaa cgaaaaccta catgaacgca 2280 agagccggaa cggcgaggaa gcttacatcg atttcgacaa tgaggtttgc gaaaccctgc 2340 aaaacgacga tcgaatacct cgcgcgaaac cccacaatga gaacgacgag aaccgagcag 2400 atgatgcagc cgcggatgaa ccggcattag gcgaagcgga cgtacatgaa cagctcgacg 2460 acgacgaccc cggtgccgca aacgccgacg ctatcgcggc gaacgatcga aatccgcgca 2520 tcgccgaaat cgcacgtcca aatctacgtc cgcgagataa attgctccga ccacagcgct 2580 atcgcgccga caccgtcgtc tgcgagtccc cgaccacata taaagacgcg ttgaaccgcg 2640 ccgactccga cctttggaaa atagccatcc aaaacgagct agaagcacac gccagaaatc 2700 gcacttggga tatagtcgaa ttaccaaaag gacgcaaggc gataggttat aaatggacgt 2760 tcaaaataaa acacgcgatc gatgaaagtg aacagcaatg caaagcgcga ttatgtgcga 2820 aaggatattc ccaagaagct ggaacagact acgacgaatt cttttctccc gtcgcacgtt 2880 tcgaatcaat ccgggtactt ttagcaatag ccgctcgaga aaatttatgc tctatgcaat 2940 ttgacgtaag cacggcttac ctcaacagcg aacttaacga gcttatctac atgcgtgttc 3000 ccgacggatt gaacgtcagc gacaaaaatc tcgttctaaa actgaacaag gccatttcgg 3060 actgaagcag tcaggccgtt gttggaacga aaaattcgac cgatttatga aacgtatagg 3120 ttttaaacaa gccgacgcag acaactgcgt atacataggc ttacacgaaa atgatactgt 3180 atacatcgct ctatacgtgg acgacggtct attattagcc aaaaatcaaa aaacattgga 3240 caggttaatc gcgattcttc aagaggaatt taaaataact tctggcgctt taaaattctt 3300 cgtcggtatg gaaatttcgc gtactgagga cggtatcttt ataaagcaaa cgaactatat 3360 aaagcgaatc ttagaaaaat tccatatgga aggtgcgaat cctatcagaa cgccagctga 3420 tcagcattcg aaacttgtta gcccgttaga acctcagacg aattttccgt atcgtgaagt 3480 cgtaggatcg ctattattct tggccatggt ttcccgacca gacatcgcgt acgccgtaag 3540 cgtgactagc cgcttcttag ataaccacga ccaaacccac tggcgcgctg tcaaacgtat 3600 tttgcgctac ctaaagggaa gcgaaaacat cggacttttc tacccttcca acagcgaatc 3660 cgatacacta gtcggatact ccgacgctga ttacgcagcg gatttggaca cgaggcgatc 3720 taccaccgac tacgtattta aattgtacga tgcatgcgta tcatggtctt cgaagcgtca 3780 agctaccgtc agtctaagca caacggaagc cgaattcata gcggccagcg aggcgaccaa 3840 ggaaatcata tggttaagaa aactattgtc aaatatcgaa catggatgtg cgcaaccgac 3900 aattttacac attgataatc aaagcgcgat aaagctatct cgaaatccag aattccaccg 3960 cagatcaaag cacatcgatg tacgatatca ttttatatgc gagaaagtaa aaaacaacga 4020 agtgaacaca aaatatgtaa acacgcacga tcaatgtacc gatatgttca cgaaaccatt 4080 atgttgcgaa aaactgaata agttacttaa caaaataaat gtaacgaacc gcatttaaac 4140 gacgctcaag cagtggggg 4159 // ID Jockey_Ele2 repbase; DNA; INV; 4422 BP. XX AC . XX DT 18-OCT-2010 (Rel. 15.1, Created) DT 18-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE A Jockey clade non-LTR retrotransposon family from Aedes aegypti. XX KW Jockey; Non-LTR Retrotransposon; Transposable Element; KW Jockey_Ele2. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4422 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4422 RA Kojima K.K. and Jurka J.; RT "Jockey clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (18-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update and extension. This consensus is generated CC from 16 sequences with >97% identity, and ~99% identical to the CC original sequence in [1]. XX FH Key Location/Qualifiers FT CDS 178..1530 FT /product="Jockey_Ele2_1p" FT /translation="MIRKNYDKSKVRITSTQTEISTMVDNDLLTANVQQRR FT RHNSTDENSMRPRNQSSDSGHQFSSQPIAGCSNANNVLIAVPNVPTENPFD FT TLMDNEELQERVTPQQSASKIHCPPIFVQNGTVKDINKLMSSLEVGEKNYA FT QKIIKGGIRLHVKEKTKFTVVVAALKSENVKFFTHGTSDEVPIRIVLAGLP FT VLDLEEVREELKQANVLPVEVKLLYSSKDEDSALYLLKFPKGAVKLKELQK FT IKMLFNVVVSWRFFSRRIGEVIQCYRCQKFGHGMRNCNMDAKCVKCGELHL FT TKDCTLPPRRATDDRSKIRCANCSQNHTSSYKGCPARKNHIQENEEKKKMQ FT SSRRRDAPASSHAPGGRSFRSTFVTPSKSFADAIKDGSSATVVAAAAVAVD FT GAAGGGGYAGPDQSELFSLHEFMNLASDLFTRLSSCKTKAQQFLALSELMI FT KYVYNG" FT CDS 1517..4195 FT /product="Jockey_Ele2_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MCTMDNTTSLNILNWNSRSIRNKESEFFNLLDNQNID FT IAVVTETWLRPDISIYHHDYTCTRLDRSSSEATRGGGVAIFIRKGIKFTQS FT GGLNTNVIEALQVTVHTDATPIHLVAAYFPGSSNLSTLNKFKRDLRTITGF FT SEPFFVIGDFNARHRYWNCQKMNKAGEILFREYETSDFFIHFPAGFTRMAP FT NGRSTSTLDLVLSGNSMDMSVPETIQDLSSDHFPVMFTINADTPFKNEFRQ FT YRNYKRANWTVYVNSVSRSLDLTSDVINNLDTEDKIDTAIETLVTTLVEAE FT RVSVPVVNCNPQKRDLPDRIKLLIRLRNVRRRQYCRTRDPLLRTIVENLNL FT QIRNESATYKFRNFGTTIRNLANGSSKFWKISKNLRNKIKYQPPLRCGNRL FT MVSDAEKANTLAEHFASSHNNQLRSDPETTSEVHRAMEDLESHALHNIDVA FT TFTKPSEIKKIIQRLKSKKAPGQDGIRNAMLKRLPRKGLVYLTKVFNACLK FT LTYFPEKWKHANVVAIPKANKDVTLPTNYRPISLLSSLSKILERLILNRLN FT RHLEANPVVPNEQFGFKNGHSTNHQLARITRLVKQGFSAKKSSGMILLDVE FT KAYDSVWQEAIIFKLQKANCPLYLVKLIKSFLLGRSFAVSVNGCTSGVHRI FT PCGVPQGSVLSPTLYNIFTSDVLIVDGVTYAFFADDTAFLATDDDPKIVTC FT KLQHAQNKLQEFQHKWRIKTNATKSQAIFFTRRRSPRFLPTAEINVNGSNI FT PWKAEVEYLGLTLDQKLLFDKHINRAINKANCLSRSLYSLINRRSPLQIAN FT KLLLYKCVFRPVLTYGCPVWGNCALSHLRRLQVKQNKLLKMMLDLSPWYCT FT EEVHEIAEIETIFRFVEKAAERFRLSCEMSNNPLIVAIFP" XX SQ Sequence 4422 BP; 1386 A; 980 C; 906 G; 1150 T; 0 other; cagttcatgc cgaaacgtca acaaggctag atcgatttgt tcatacgtat actcgttact 60 gaataatttc tcaaaataca gtccgaaaaa tcaaaaactc tgggtaaacg taccccgggc 120 gtgtcgtcta acgtcgcgag ttcggaatgt agccccaatg ggggaccgtg ttccataatg 180 attcgtaaaa attacgataa aagtaaagtc cgcataacct caacgcaaac agagattagt 240 accatggttg ataatgatct cttgaccgcc aatgtccagc aaaggcgtcg tcataactcg 300 actgacgaaa acagtatgcg gcctcgaaac cagagcagcg atagtggtca ccagttttca 360 tctcaaccga tagcaggctg cagcaatgcc aacaatgttt tgattgcggt tccaaacgtt 420 ccgacagaaa atcctttcga tacgctcatg gacaatgaag agttgcaaga aagagtaaca 480 ccacaacaat cagcatcgaa gattcactgc cctccaattt tcgtgcaaaa cggtactgtg 540 aaggacataa ataaactaat gtcctcccta gaagtaggtg agaaaaatta cgcacagaaa 600 atcatcaaag gcggcattcg tcttcacgtt aaagaaaaga cgaaatttac cgttgttgtt 660 gctgccctca aatcggaaaa tgttaaattt ttcacccacg gtacaagcga cgaagttccg 720 atcagaatag tgctagctgg tctgccggtg ctcgatttgg aggaagtgcg agaagaactg 780 aagcaagcaa atgttctccc agtagaggtc aagttactgt actcatcgaa agatgaagat 840 tcagctctgt acctgctcaa attcccgaag ggtgccgtga aattgaagga actgcaaaaa 900 ataaaaatgc tgttcaatgt agttgtcagt tggcgctttt tctcaaggag gattggagaa 960 gttatccagt gttatcggtg tcaaaagttc ggccacggaa tgcgcaactg taatatggat 1020 gcaaagtgtg tcaaatgtgg ggaactgcat ctcacgaagg actgtacatt accgcctcgt 1080 agagcaacgg atgatcgttc aaaaattcgt tgcgccaatt gcagccagaa tcatacttct 1140 agttacaaag gatgtccagc aaggaaaaac cacatccagg aaaatgaaga gaagaaaaag 1200 atgcaaagct cacgaagaag ggatgcacct gcttcgtccc atgccccggg aggtcgcagc 1260 tttcgttcca cgtttgttac tccgagtaaa tcattcgctg acgctatcaa agatgggtct 1320 tctgctactg ttgttgctgc tgctgctgtt gctgttgatg gtgctgctgg tggtggtggt 1380 tatgctggtc ccgatcaatc ggaactattt tcgctacacg agtttatgaa tcttgctagc 1440 gatttattca ctcgtctctc ttcgtgtaaa acgaaggcac agcaatttct cgccttgtcg 1500 gaattgatga tcaagtatgt gtacaatgga taacacaacc agcttgaaca tacttaattg 1560 gaacagtcgt tctattcgta ataaggaaag cgaattcttc aatctactgg acaaccagaa 1620 tatcgatatt gcagttgtga cagaaacctg gttacgtcca gatatctcca tctaccatca 1680 tgactacaca tgcactcgct tagatcgttc ttcgtcggag gcaacccgag gaggaggcgt 1740 ggcgatattt attcgaaaag gaattaaatt cacccaatct ggtggactga atactaacgt 1800 catcgaggcc ttacaagtaa ccgttcacac cgacgctact ccaattcatc tagttgctgc 1860 ttatttccct ggatcatcca acctaagtac gctcaacaaa ttcaaacgtg atcttcgtac 1920 cataacaggg ttctctgagc catttttcgt aattggtgat tttaatgcca gacaccggta 1980 ctggaattgc caaaaaatga acaaggcggg tgaaatactt ttccgggaat acgaaacatc 2040 tgacttcttc attcatttcc cagctggttt tactcgaatg gctccaaacg gtagaagtac 2100 gtctacactt gatctagtgc tgtcgggaaa ttccatggat atgtcagtcc cagaaacaat 2160 tcaagatctt tcatcggatc attttccagt gatgttcact atcaatgctg atactccctt 2220 caaaaatgag ttccgtcagt acaggaacta taaacgcgca aactggaccg tttatgtgaa 2280 ctcagtctcc agaagtctgg atctaacatc tgacgtcata aataacctgg atactgaaga 2340 caaaatcgat actgccattg aaacgcttgt taccacatta gttgaggctg aaagagtttc 2400 ggtgccagtt gtcaactgta atcctcagaa gcgcgacctt ccagatcgga tcaaacttct 2460 gattcgtttg aggaacgtac gaagaaggca atattgccga actagagatc ccttgctgcg 2520 aacaattgtt gaaaatctca atctccaaat ccgcaacgaa agtgccacct acaagtttcg 2580 aaattttggt acaaccatcc gtaatctagc aaatggtagc agcaaatttt ggaaaatcag 2640 caaaaatctc cggaataaaa tcaaatacca accacctctt cgatgcggaa accggctcat 2700 ggtgtctgac gcggaaaagg cgaatacatt agcagagcac tttgcaagct cacacaacaa 2760 tcaacttcgc agtgatccag aaactacctc agaagtacat cgggcaatgg aggatttgga 2820 aagtcacgct ctccataata tcgacgttgc cactttcact aaacctagcg aaataaagaa 2880 aatcatccag agactaaaaa gcaaaaaagc accaggacaa gacgggattc gtaatgctat 2940 gctcaagcgg cttcctcgaa aaggtttggt ttatttaacc aaagtcttca acgcttgcct 3000 gaaactcacg tactttcctg agaagtggaa gcatgcaaat gttgtagcca ttccaaaagc 3060 caacaaagac gttacccttc ctacaaacta tcgtccaatt agtttgctga gcagcctgag 3120 taagatctta gaacggctaa ttctaaatcg cttgaatcgc cacctggaag caaatcctgt 3180 agttcccaac gaacaattcg gcttcaagaa cggtcattca actaaccatc aactcgccag 3240 aatcacgagg cttgttaaac aaggattttc ggctaaaaag tcatctggta tgatattgct 3300 agatgtagaa aaggcttacg actccgtttg gcaggaagcg ataattttca aactccaaaa 3360 agcaaactgc ccgctatacc tggtgaagtt gataaagtcg ttccttttgg ggaggtcctt 3420 tgctgtatct gtcaacggtt gcacatcagg tgtccatcgt attccatgtg gagtacctca 3480 aggatccgtc ctgagtccta cactctacaa catattcacc agtgatgtgc tgatagtgga 3540 cggcgtcacc tacgcatttt ttgccgacga cactgcgttt ctagcgactg atgatgaccc 3600 caaaattgtg acctgtaaac tccaacatgc ccagaataaa ttacaggaat ttcaacataa 3660 atggcgaatc aaaaccaatg ctacgaaatc tcaagcaata tttttcacca gaaggcgatc 3720 accgagattt ctaccaacag cagaaataaa cgtcaatgga tcgaacatcc cctggaaggc 3780 ggaagtagaa tatttggggc taaccctaga tcaaaaacta ttgttcgata agcacataaa 3840 tcgggccatc aacaaagcca actgtttatc tcgctccctt tactcattga taaatcgtcg 3900 atcaccactc cagatagcca acaaactcct tctctacaaa tgtgtgttta ggcctgtcct 3960 cacatatggc tgcccagttt ggggaaattg tgcgctgtcc cacctacgac ggcttcaagt 4020 taaacagaac aagcttctga aaatgatgct tgatttaagc ccttggtatt gtacagaaga 4080 ggtgcatgaa atagctgaaa ttgaaacaat tttcagattc gtagaaaaag ctgcagaaag 4140 attcagatta tcttgcgaaa tgtctaataa tccactaatt gttgccattt ttccttaagc 4200 atagctgtga tactattaga tgtaagaata gaataaaatt gaattcgagg gttttttttt 4260 taatttttcc ctgttagttg aattttggaa ataataatcg acctcctgaa atagtttttg 4320 acattactcg cagctgtaag tgacacaaat ctctgccaaa aaaaaaaacg aatttgtaac 4380 aacattaggt agaatcgaat gaataaataa ataaataaat aa 4422 // ID TE-X-4_NVi repbase; DNA; INV; 1080 BP. XX AC . XX DT 13-MAY-2009 (Rel. 14.06, Created) DT 13-MAY-2009 (Rel. 14.06, Last updated, Version 1) XX DE nonautonomous transposable element from Nasonia vitripennis. XX KW Transposable Element; Nonautonomous; TE-X-4_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-1080 RA Bao W. and Jurka J.; RT "Transposable elements from Nasonia vitripennis."; RL Repbase Reports 9(6), 1164-1164 (2009). XX DR [1] (Consensus) XX CC The feature of this element is like that of TE-X-3_NVi. XX SQ Sequence 1080 BP; 303 A; 290 C; 160 G; 327 T; 0 other; tgaagtgaat acttctatag gctcgctagg tacacatttt taccggggag tgaatcgaga 60 aaaaaccgac cgtcacgcag ttaccgaacg gcaggagggg aagcaggggc aactctctct 120 ctctctctca ctctcccctc caccgcgctc tcttcgccgc gcaaggcgta cgcactgcgc 180 gcatctctcc cttcaccgcg ctctccttct ctctctctct ctctctctct ctctctctct 240 ctctctctct ctctctctct ctctctctct ctctctctct ctctctctct ctctcccctc 300 caccgcgctc tccttaccgc atctttccgt tggctttcgg cgctctcggc gttctctaca 360 cactatactt ctctactaca cgctatactc tggacaatgc tatcccgcgt ccccacgcgc 420 tcggctttct ctacctagca taaactgaaa taatgctatc ccgcgtctaa gcgtcagttg 480 tacgtaaaca gaaatagctg aaatatgttc tgctcactgc gttttagaag acggtcaaat 540 atgtatcatg gctgcgattt acatttcacc caatcaaagc attacagaca taattcaatt 600 tatccataga tcactacttg tatacactca agtaggatca cgggagcttg gtacaaatga 660 ccatgaaatg ccactgattt tagcaggaga ttttaatgtt aactttgctg acgaaagtgc 720 acaaccatta attacattcc tcaaagataa atttcaatta gacatgaata atgatgcgca 780 acaatcaact actaagtatg gcacaacatt agatgcagta ttttcaagat ttttatataa 840 aatcgattct aaaacattta tatcatactt tagttaccac aaaccaattg tttcttttat 900 tgaattaaaa aatgaagata cctgtgattc taccatagat aaataaagtt tgtaagaaat 960 aaaataatgt atgaataacg ataaacatat ttttattatt tcacctatat aaaaaaaacg 1020 aagtctacaa cacacctaat tcagtattca cttctatcgg ctgtccccga gacagccaca 1080 // ID ERE2_EH repbase; DNA; INV; 1936 BP. XX AC . XX DT 25-JUN-2008 (Rel. 13.1, Created) DT 25-JUN-2008 (Rel. 13.1, Last updated, Version 1) XX DE Repetitive element ERE2 from Entamoeba histolytica - consensus DE sequence. XX KW Nonautonomous; Entamoeba; Transposable element; Eh_ERE2; ERE2_EH. XX OS Entamoeba histolytica OC Eukaryota; Amoebozoa; Archamoebae; Entamoebidae; Entamoeba. XX RN [1] RP 1-1936 RA Lorenzi H.A. and Caler E.; RT "GenBank accession number EU099444."; RL Direct Submission to Genbank (09-AUG-2007). XX RN [2] RP 1-1936 RA Lorenzi H., Thiagarajan M., Haas B., Wortman J., Hall N. RA and Caler E.; RT "Genome wide survey and discovery of repetitive elements in three RT Entamoeba species."; RL Repbase Reports 8(10), 1199-1199 (2008). XX DR [2] (Consensus) XX CC Positions [1..22 ; 1915..1936] Imperfect inverted repeats. XX FH Key Location/Qualifiers FT CDS 227..745 FT /product="ERE2_EH_1p" FT /note="Some elements have Met16 as initial FT Methionine." FT /translation="MNHIYYFYSFYSFIYMLFIYRIFIYLFLCLYYLYHCI FT LLIIYILLYYSYFMFFMFILFRFSYCYSFIFRCICLFVYSSLSFLYYYHLY FT LFSFYIYSLLIYININFIFYTLILLLYHLLSFLVLFIIYSYYSFVLYSISY FT IHIIFIYILSLIICYNNITHMIDYIINIMMKLYI" XX SQ Sequence 1936 BP; 654 A; 166 C; 119 G; 997 T; 0 other; gggtacgaaa attagcaaag aaaaaatgca tctaaataat ttacaaaaaa gaaaaaagat 60 gcaaaataat ttgcatccat agaattttcc atttattata ttttgctcct aaaataaaaa 120 taaaaaaaaa aagtatgata aatacagaat atgtaaatat ttatgtttat attttataat 180 catttgtatt cattttgtat gaaatattaa tgaaatagat gatttgatga atcatattta 240 ttatttttat tctttttatt cttttattta tatgttattt atctatcgta tttttatata 300 tttatttctc tgtttatatt atttatatca ttgtattctt cttattatct atatattatt 360 atattattct tattttatgt tttttatgtt tatattattt agattttctt attgttattc 420 ttttatattt agatgtattt gtttatttgt ttattcatct ttatcatttt tatattatta 480 tcatttatat ctattttcat tttatattta ttctcttttg atttatatta atattaattt 540 tatcttttat actttgattt tattattata tcatttatta tcatttctcg ttttatttat 600 catttattct tattattctt ttgttttata ttctatatca tatattcata tcattttcat 660 ttatatttta tcacttatta tatgttataa taacataaca catatgattg attatataat 720 aaacatcatg atgaagttat acatataaat aataaacata aaaaagagta aaatattttt 780 cattaataaa ataatattaa tcattttaca tcattaaatt agacactttt attgaatatt 840 tctaattttt acaataaaaa tacacttttt atttcttctt ttttcttctt ttattatatc 900 tttgtgattt attattttta tattcttctt ctactttatt tttattcttt tatatattta 960 tattcttttt atattatttg tttttcgttg ttgtttctat ttctttcttc tattatttat 1020 caattttata ttcatttatt ctttttattt cttttttatt aattcttgag tattgtatat 1080 ttctttatta tttcatttgt gtattatcaa aatatatgtt ttattgattt tatgttgaat 1140 aaaatattga ataaatcaat attttcattc ttaattattc attatttatt ctgaaatgtt 1200 gtgtatattt attatattaa atgaatatga tgaatgaata tcaatattat tgtttatttg 1260 attttaaaaa ataataaatg aatatgaaat aatgaaataa aataatgaag acaatgaaat 1320 gataatgaaa tagaataatc taatattgaa caaatcaaat aaagaatata ttgaatatat 1380 cattattgtt acattattat ttgttatttc aattattaaa tattaataaa ttattattca 1440 tgaatattta attgattgat attttacaat ttcattttca ttttatttaa taaatgaaat 1500 ataaataata aatgaatgaa taataattta atatatgaaa taataaaaag aaaagataaa 1560 taataaagaa atatgatgta tttaatatat aatataattt attatcaaat ataataatat 1620 caaatgatga tttattttta tttcaaaata ttttgcatca ataaaaatgt tgctaaatat 1680 tttgcatcat taaattttgc atcattaaat tttgcgcttt ttttatttgt ttcaataaaa 1740 tttgcatcct ttattttttt ctttcttttc ttcttttatc atctcttttg ttttattaca 1800 ctttatcttc tttcttttta tattctctcc tttctacttt tattctttct tcatcttatt 1860 ctacttttta ctttcttttt attcttttat atcattcatt ctcttctttt tcaattattt 1920 gctttttttc gttgcc 1936 // ID Academ-3_Aplcal repbase; DNA; INV; 4223 BP. XX AC . XX DT 16-MAY-2011 (Rel. 16.05, Created) DT 16-MAY-2011 (Rel. 16.05, Last updated, Version 1) XX DE DNA transposon: consensus. XX KW Academ; DNA transposon; Transposable Element; Academ-3_Aplcal. XX OS Aplysia californica OC Eukaryota; Metazoa; Mollusca; Gastropoda; Heterobranchia; OC Euthyneura; Euopisthobranchia; Aplysiomorpha; Aplysioidea; OC Aplysiidae; Aplysia. XX RN [1] RP 1-3053 RA Yuan Y.W. and Wessler S.R.; RT "The catalytic domain of all eukaryotic cut-and-paste transposase RT superfamilies."; RL Proc Natl Acad Sci U S A 108(19), 7884-7889 (2011). XX DR [1] (Consensus) XX SQ Sequence 4223 BP; 1224 A; 988 C; 955 G; 1056 T; 0 other; tagtggggcc gcttagcaaa cctattgtgc attttgtgat acaagcacca aacttggcac 60 aaatgtacat taacatattg aaatcatttt cagatattgg gccactcgga tttctctctt 120 aatggccgcc atattggatt tcaaaatggc cgccatttga aatctacatt tgcgattatc 180 tctgggtcaa atgctgctat tgacttcatt ttggtttcta aatgtatgtt tttggggtca 240 agaaatccaa tggtagcaat ctgcattgcg tatttttact aatgggtcgc catattggat 300 ttcaaaatgg ctgctttaca ttgcatggct gcttctcgat ctaccttcgt cgttatctta 360 tcgccgtcac atgtttggag aagccaagcc cttcacagtg tgccctggat gatgccaaga 420 agaatctgtt gcaaccgttg ctgcttgcct tagtcagcat gatcctagag agacccagca 480 tcaagggcca gatggcagac actaaccctg cagcaattac cgcagcctag atactgaagt 540 tcaactgcgt aaaacacaag cgaacacgtg gcaccacatc atcctcttgt gtcaggcaca 600 gtgttgcaca ggagacccca cttccaacat acattgggat gatgttgcat gctcatacac 660 acaagaagga actggtcgac agactgtcac accttggcct gagtacctct tatgatagag 720 tgctccagct ctcagcagag atgggcaacc atgtctgcca acaattttac agagaacaag 780 tggtctgacc tccaaaagtg cgtggcgaag agttcacaat tgccgctgta gataacatag 840 atcacaatcc tagtgcaaca acaccaaaag actccttcca aggtgccgct atctccctcg 900 tacaacatcc ttcttacact ggtgaaggag taaatcggag catattcgtt gcaggagaat 960 ctggggatgc acggtccaag actgtcgccc ctttgccgca ctattatact gatgtacccc 1020 ctgtcactag cagcatcaag acgtcgcctg tcccagctgc tggagttgcg tcattgacca 1080 gaggagactt caaacagtag actgatggag aataccattg gttaggcaat gcaaaacacg 1140 ccctgggaga caacactggt acagtggaca ataaaaacac atcttgggct gcatttcatg 1200 ccagccgaca gccaccagag ggtagagtca tctgccccac gttgctgctt ccactcttcc 1260 tggaaagtgc ccacactgtg gcaatgatca ggcattcaat ggacgttgtg aagaaagctg 1320 tagaacatct gaatcctgga cagacaccag tggtcacctt ggatcatcca ctgtttgctt 1380 tggccaagca gatacagtgg aagtggccag agagttatgg tgaagaccat atagtggtga 1440 tgtttggtgg tctccacata gagatggcag tactgaaaac actgggggac tggctgcaag 1500 ggagtggttg ggtgacaagc acttgtgcaa gccgagattg cgacagcaag gacagcagac 1560 tcattcctgc gagcatctca tgtcttgcgc acaagaagag cacatcaagt ggctacagca 1620 gcactgtaca tcctgcaaca ccatgcctac taccactact gtctgggaga aaccaaggat 1680 gcagaggaca ttcctgcgtt tgaagactgg tgacgccaga gacgagacaa catctcccag 1740 tttcgctact gggcagctgt gatggaacaa gaactgttgg ttctagtgta cgtgcgttct 1800 ctctggcagg gatcattgat gatgcacctc gatgctctga cagagctggt cacctggttc 1860 cacgcactag atcataccca ctatgctagg tggataccag tgcacctgaa gaacatcgct 1920 gaactcacta ccaaacaccc agacgtagca aggaaattca gtgaaggcca tttcacagtt 1980 gagaaaacac agagggtgtt ctctttaatc ccgattgacc aggcacatga gcagaataat 2040 gcctgcatca aaggaaacgg tggcgcagtt gggctaactg acaatcctag tgcccttcgt 2100 cgctggatgg tttcaagacc agaaattgca agcagcagac acacgtcatc atgatcagac 2160 gccaagtgta catgcctcat ttgtgaaaga tgttcgctgt ctcgtcggta tcataaagga 2220 gatgggcaac ccattcgagg aggagagtca ggatctaagc atactggaca caaaggacat 2280 cgcaggtcat tctgccgtgg agaccgtgat gaaagccaag aggattggac aagagcagcc 2340 cgaggctttc accagagagt gtctgttgga cagaacaaag gcagtggacg accccattcc 2400 tcgtgacaag ctgaaggtat tcagcacttc cactccaaga aacaagagta aatgtcagga 2460 acagttcgcc tctatcaaaa atgaccgtga actcgttgca cgcctgtaca ttggctggca 2520 gatgagggat ggaaaccttg aggagttctt tcgtcacaag aaccaggcat gtcctcctgc 2580 actgtctgct ggtggaaacc tcttcacttg tactaagaat gatctcatca caggctttga 2640 agaaatcttc gacgccaaga cagagactcc tgtcactagc tgtatagttc ttgatggagc 2700 agccatcgcc cagattggct gcattaaatt aaagacattt ggagagtatg ctccaaagat 2760 cttcatccca tatggcaaga agactgcgtg ggcagtctgg acggtactgc cagaactcac 2820 taaggcgctg ctgctggtgt cctctgcacc acgtgacata ccaaacgatg caatatccat 2880 catcaagagg ttcgtgatcc tgttgtatga tcgaaccagc aaatgcacgg acaatgacac 2940 ggccaggagg aaactcttcg caaggaagaa cgatatgctg ctgatccctc caacgtaggc 3000 agctttagag gaacatgtca agaggggagt gtattgaggt ggaaatgtgt ggggtcagat 3060 acttttgcca gcaccagagc tccctccaca aaacaactat ggttggtcaa tgactggagg 3120 acagtacaca ccatactaga ccaggctacc cgaggcagct cacagctgca atgagctggt 3180 ttcttgcaag tgccatttat tgagtgtgag gctctgcaaa tgcaagaaga ctgccctctg 3240 agtgtgtgaa agggactgca cagaacgatg agtccacaac atacatgtac ttgtggcagt 3300 gctgaacttg ttaacatgac actgtttatg tacatgcatc ttcagtaccg atatctacct 3360 agcctcagat accagaccat tgaagaatgt cagtatacac ctagcctcag atatttgagc 3420 atgtagcatg tcagaattgt acccccgaca tttacagact taagactttt acacccattt 3480 aacatgattt caaaattgag tcgatagtgc cgtaaaacac acactcaatc aatcaatcaa 3540 catgctttca tataatcatt cttttcaaag tatctagttc aatctattat tttcctagat 3600 cattttataa aacagagtgt gaaagataac attactttga aattggtgcc attggattgt 3660 gcattgaatt gtgcattgga taaatttgtg cattagaatt ttagatttgt gtctttgatt 3720 tagatttgtg cattagaatt gtgcattaga ttgtgcattg aattgtgaaa ttggtatcat 3780 agaattcctt gccccccccc accccccccc caaaaaaaaa accacataaa tttagacacc 3840 aaaatgaagt caatagcagc atttgcccca gagatactcg taaatgtaga tttcaaatgg 3900 cggccatttt gaaatccaat atggcggccc attagtaaaa atacgcaatg cagattgtta 3960 ccattggatt ccttgccccc caaaacatac atttagaaac caaaatgaag tcaatagcag 4020 catttgaccc agagataatc gcaaatgtag atttcaaatg gcggccattt tgaaatccaa 4080 tatggcgacc ataaagagag aaatccgagt ggcccaatat ctaaaaatgt ttgctatata 4140 ttaatgtaca tctgtgccaa atttggtgct tgtatcacaa aatgcacaat ccttttaaat 4200 ttttgggcta aaccacccca cta 4223 // ID Gypsy-588_AA-LTR repbase; DNA; INV; 668 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-588_AA_; KW Ty3_gypsy_Ele61; Gypsy-588_AA-I; Gypsy-588_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-668 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 668 BP; 215 A; 160 C; 95 G; 198 T; 0 other; tgtagagtat ttagttacat catatcataa atcacctgac ttcttcacca catgttctgt 60 gattgtaact ttcccacagt cacagaaccc gacttgacaa aaacgtacaa ctttcatata 120 atcagctaat tgttgcttgg ttacaaaacg acaatcagca ccctaaaata atttatagct 180 tatactcata ctcaaataac atagaacgac gacccacgtc gttgtcatgt attctcaaat 240 gtcacatagc aatgacccat gtcattggca cttcagaaat cacttcaaat gcaaaaccca 300 catgaaccgt aactcgactc cttcagcata gcactccgtt cccaggcaat aatacttgca 360 tacacataaa tgacttagac tgctgccaag tgcatcacgc aacaagcgtt gatgccttat 420 tcgcttttac ctctctctct ctctctctcc ttcttcatta ttctatcctt gttgcaaacc 480 tcttattgat ccctgattgg tttgcctaaa tgaaaagtaa ctccaagcac ctcccactag 540 gataagcatt gtgtagagaa caaatctatg tatgtagaat taagtgattt gtaaacttga 600 agattgaaat aaagacagat tttattcaac agtagactcg agcagttgat atccggaaag 660 atatcaca 668 // ID BEL-6_DWil-I repbase; DNA; INV; 6110 BP. XX AC scaffold_181074; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-6_DWil_; KW BEL-6_DWil-LTR; BEL-6_DWil-I. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-6110 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_181074; Positions 129854 135963. XX CC 'AATAT' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 4963..6108 FT /product="BEL-6_DWil-I_1p" FT /translation="MLLIKDAHKETLHGGINLMRSQIQRKFWIFGLRNALK FT KYLRECVTCARYRQVTAEQIMGNLPKNRVTVSSPFHSTGIDFAGPYHIRCS FT KNRGQKTFKGYIAVFVCLSTKAIHLEVVSDLTTDAFLAAFRRFVARRGKCA FT NIYSDNGTNFVGAARRLDEEFKKAVQDNVQIAPILEREQISWHFSLPAGPH FT FGGIWEAGVKSVKYHLKRVIGDSNLTYEELATLLCQIEAVLNSRPLYTTGE FT DMDNNEVLTPGHFLIGRPTLEAAESISEEKILVAHHSVQSESYIAIKTSHV FT GDISNIANIAWNPVEANIVNHSAELERIGSQLKSLKEHHIELLGLNFHHVT FT GHASLMLVILLMIILIIVCLRQLNKNKRIPIAPFGGPSQ" FT CDS join(828..3323,3327..4520) FT /product="BEL-6_DWil-I_2p" FT /translation="MFPTKTEKLLAKQHELHKRLKKLEELDASKFTSKIIN FT EVNEIQKLWISYHSQMLESGVTQHYYFTDGKYEEGLDMAAKLLAKGESSRQ FT STVGSETIEADATMSENVEVSNGESRGNATSSIENQEKFEEKVRHLQQYLE FT MVSKDLRTIDNRTGEMYIKEIKFLFRKVQVCQAYTHELYRDNETFKEVIEE FT LEFDLQNTLVALEEKINDQKSLKVENVTVRGELPVLPKIQVPTFSGEPREW FT DLFVELFTELIEKREDLSLALKFNYLKSALKGKARSMVAHLLSGSADNYSA FT AWELLNKRYENNRRVFSEQLSRIVDLPMINNESAKSVKGVIDIINESIHLI FT KIKANLSDEVDVIFAQLLLRKFDREALQLYESQIRKTREIQKLSDVMEYLE FT QRYCSLCAMRKDEKKPFQKTSVAVFKDSCPICKGPGQSTPLCYKFKALTPK FT ERVEIAKKVQLCTRCLKHSSNKRCTNEMVCSLCSKGHHTLLHFGEKQAKVN FT TCVTAEQKLLATALIQVKKINGECETLRALIDNGSQSTILSEEAAQMLKLP FT KIKKQTEISGVSSTKPRTSKYRVNIEMKTRDSEIELIKVEAIVLPQLMKAL FT PLKIANVDMSKWQTYVLADPTFNKPGRVDMIIGIDVYTHILKSGVTKINGL FT LGQNTQFGWIVSGCTKSKGDESIVATTIDITDMERFWEMEADEKDDTEASE FT CEENFNKTTMIDASGRYVVTIPLKQEKELGDSKAQATARFLNLEKKFQKNP FT KLKEEYEKFINEYLELGHMVETKVEGAYYLPHQAVIKEASLTTKLRVVFDA FT SAKTTNNRSLNDVMWIGPRLQKDIFDIIVKRKWQYVVSADIEKMYRQINID FT LNDQKYLHILWRISPQKKLRIYKLTTVTYGTASAPYLATRVLAEIANRCQD FT KVISEIIRDEFYMDDLMTGGNTVQESKRKVQQVSRELEKAGMNLRKWISNK FT SEIIEEVIDTGDNKVLKIDENEAIKTLGLQWEPIKDEVKFKVHCDENRNVN FT KRGVLSTLAKIYDPLGWLAPVTVIGKLFMQKLWLAEQNWDQELSDNETTEW FT KSFRENLVYLEEIRLPRWIKTQPNMLLQIHGFADASEKAYAAVVYAKIGEA FT VTLIASKNKVNPIKNRKTIPKLELCAAHLLSQLLQRVQQSTAAPSEKYAWS FT DSTITLCWIKNGASKEKFVRRRIEDILKIKEITWNHVRSEDNPADVASRGI FT YPNKLKDYELWWNGPKWLAR" XX SQ Sequence 6110 BP; 2259 A; 996 C; 1326 G; 1529 T; 0 other; ttttggtcct tcgagccgga tagatcattt tggtccttcg agaaaaagga actctgtagt 60 tttcccccac cagtggtaag agactcagag aatgaatcag ctccttaaag tctgaattct 120 tatgataaga tcagctgaag gtagcaaaat aaaaaactct tgttttcccc caccagtggt 180 aagagacaga gtataatata aaagcaaaat attgcttaat aaataaaaag tgcgtgcgac 240 cgagtatata agggcagttt ggctagtcga gcaactgcaa taatttcggc ggcggtaaaa 300 aaaaaacgaa cgatataaag ttaatgtgtg gctaattcaa agcccagtga aaaaaaaagc 360 aatataaaaa tataaagttg gtgtgtggcg tatttgaagc cctaatatca aaaaaaaaaa 420 agttaactag tgtatatata gactatatgt agttgacaac taccaacagc agcagagcaa 480 caagatagtt gcaaaggaga ggccgctgga gaacttaaat tgccaataga agaaagtctg 540 gtgcgatgac cgacaccggt accgcctgag cccataggcc ctggacaaca gcagctgatc 600 gagatataaa acaaaaaggc gagcaaacta tcgctacgca gcggcagaca agcagcggca 660 gataacttta cttgccggac aacagtggcg aacggtgtct acaacaggat catcagataa 720 gatataaatt aaaatagttt atattatatt atattattaa tattatatta ttatacatat 780 atacttatac tcttatacca atatcctaat atttgaataa gacaaaaatg ttccccacaa 840 aaactgaaaa actattagcg aagcaacacg agttgcataa aaggttaaaa aagctagaag 900 agttagatgc tagtaagttt actagcaaga ttattaacga agtaaatgag atacagaagc 960 tatggattag ttaccatagc caaatgctag agagtggtgt aactcaacac tactatttca 1020 cagatggaaa gtatgaggaa ggtttagata tggcagccaa attactagcc aaaggagagt 1080 ctagcaggca gagcactgtt ggttcggaaa ccattgaagc agatgcaaca atgtcagaaa 1140 atgttgaggt cagcaacgga gagtcgcggg gaaacgcgac ctccagtatt gagaaccagg 1200 aaaaatttga agaaaaggtt cgtcatctcc agcaatattt ggagatggtg agcaaagatt 1260 tacgtacgat agacaataga acaggcgaaa tgtatattaa ggaaataaaa ttcttgttca 1320 ggaaggtaca ggtgtgccaa gcttatacac atgaactata cagagataat gaaacgttta 1380 aagaagtcat agaagaattg gagttcgacc ttcagaatac cttggtagca ttagaagaga 1440 aaataaacga tcagaaaagt ttgaaagttg agaatgttac cgtgagaggc gaactaccgg 1500 tgttgccaaa aatacaggta cccacatttt ctggagaacc tagagaatgg gatctgtttg 1560 tggaactatt cacggaatta attgaaaaaa gagaggattt aagtctggct cttaagttca 1620 attatttaaa atccgcgtta aagggcaaag caaggagcat ggtagctcat ctattgtcag 1680 gttcagctga caattatagt gccgcttggg aactgttaaa caagcggtac gagaataatc 1740 ggagggtatt ttccgagcaa ctaagtcgga tcgttgattt accgatgatc aacaatgaat 1800 ccgcaaaaag tgtaaaggga gtgattgaca taattaatga gtcaatacac ttaattaaaa 1860 ttaaagcaaa tttatccgat gaagttgacg taatattcgc tcaactgctg ctaaggaaat 1920 ttgacagaga ggctctgcaa ttatatgaga gccaaattag aaaaacaaga gaaattcaaa 1980 agctatctga tgtcatggaa tacttggagc aaagatattg ctcgctatgt gccatgagaa 2040 aggacgagaa aaagccgttc caaaaaacgt cggttgcggt gtttaaagat agttgtccga 2100 tctgcaaagg tcccgggcaa agcacacctc tgtgctataa atttaaagcg ctaacaccca 2160 aagaaagggt ggaaatagcc aaaaaggttc aattgtgtac acgatgcttg aagcattcaa 2220 gcaacaagag atgcactaat gaaatggtct gttcattgtg cagcaaggga caccataccc 2280 ttcttcattt tggggaaaaa caggcgaaag taaatacttg tgttacggca gaacaaaaac 2340 tgttggccac agccctaata caagttaaaa agataaatgg agaatgtgaa acattaagag 2400 ccttgattga taatggatct caaagcacta ttttatctga agaagccgct caaatgttaa 2460 agttacccaa gatcaagaaa caaacggaaa taagtggcgt atcttcgaca aaacctagaa 2520 cctcgaagta tagggtaaat attgaaatga agacgcgtga ttctgaaata gaacttatta 2580 aggttgaagc aattgtattg cctcaattaa tgaaagctct tccgttaaaa atagcaaacg 2640 tggatatgtc aaaatggcaa acttacgtgc tagctgaccc tacctttaat aaacctggta 2700 gggtcgatat gataataggg atagacgtat acactcacat cttaaaaagt ggagttacca 2760 agattaatgg cctgctagga caaaatactc aatttggctg gatcgtgtcg ggatgtacga 2820 agtcaaaagg agatgagtct atagtagcta caacaataga tattacagat atggagcggt 2880 tttgggaaat ggaggcagat gaaaaagatg atacagaagc tagcgaatgt gaagaaaatt 2940 tcaacaagac gactatgatt gatgcctcag gaagatatgt agtaacgatt ccgcttaagc 3000 aagaaaagga attaggtgac tcgaaggcgc aggcaactgc gcgatttctt aacttggaga 3060 aaaagtttca gaaaaatcca aaacttaagg aggagtacga aaaattcatt aacgaatatt 3120 tggagttggg ccatatggtc gaaactaagg ttgaaggtgc gtattatctg ccacatcaag 3180 cagtaataaa agaagcaagt ctgacaacta aattaagagt tgtatttgac gcttccgcga 3240 aaacaacgaa taacagaagt ctcaacgatg ttatgtggat cgggccgcga cttcaaaaag 3300 acatattcga cataatagta aagtagcgaa aatggcagta cgtagtatct gcagatattg 3360 aaaaaatgta tcggcagata aacattgatc taaacgatca aaaatatttg catattttgt 3420 ggcgaatttc accgcagaaa aaattaagaa tatataaatt gacaacagta acgtacggta 3480 cggcatcagc accttattta gctacaaggg tacttgcaga gatagcaaac agatgtcaag 3540 ataaagtaat tagtgaaatt atccgagatg aattctacat ggatgattta atgactggag 3600 gtaataccgt acaagaatca aaacgcaaag ttcaacaggt ttccagagag ctggaaaaag 3660 caggaatgaa tttgagaaaa tggatttcga ataagtccga aattattgaa gaggttattg 3720 atacaggcga taataaagta ttaaaaattg atgagaacga ggcaattaaa accttggggt 3780 tgcagtggga accaattaaa gacgaagtta agtttaaagt acactgtgac gagaacagaa 3840 acgtgaataa aagaggtgtg ctatccacac tagccaagat ttatgatcca ctaggttggc 3900 tagcacctgt aacggtaata ggcaaattat ttatgcagaa gttgtggcta gcagagcaaa 3960 attgggatca agagttgtcc gataacgaaa caacagaatg gaaatccttc agggaaaatt 4020 tggtctatct ggaagaaatc agactacccc gatggatcaa gacacagcct aatatgttgt 4080 tacagattca tggttttgct gacgcttcag aaaaggcata tgctgcggtg gtatatgcta 4140 aaataggtga ggcagtcacc ttaatagcca gcaagaacaa agttaatcca atcaaaaata 4200 ggaagactat tcctaaattg gaattatgtg cagcacactt gctgtcgcaa ttgctacaaa 4260 gagtccagca atctactgct gctccatctg aaaaatatgc ttggagtgat tcgactatta 4320 cactgtgctg gataaagaac ggtgctagta aagaaaaatt cgtaaggcgt cgcatagagg 4380 atatcctaaa aatcaaagaa ataacatgga accatgtaag atctgaagat aaccctgcag 4440 acgtagcttc aaggggtatt tatccaaata aattaaagga ttatgaattg tggtggaatg 4500 ggccaaagtg gctcgctcgg tgaggcaaaa aacaaatgtc ctaaacaaaa atcagatgat 4560 gacaaggtta tggtaaattc cgtactatta aatacagaga gtagtattat gcaggagcta 4620 attgaaaagt attcctgtat gaataagctg gccagaatta cggcatatgt attaagattt 4680 attagcatta aaaaaagaga taaggtgtac ccatcatatc taaccgtcaa tgagcttaag 4740 aatgctaaaa attttataat taaacagcaa caggcatatc aatttagcag agagatatca 4800 tgtcttacgc aaaacaagca aattgacatg aaaagcaaaa ttttgagttt aaatccattt 4860 ttggataacg atggaatatt gcgcgtgggg ggaagattat aaaatgccaa catatgtttt 4920 gatactaaac atcctgtgat tttagataaa tcacatttaa caatgctgtt aattaaagat 4980 gctcataaag aaaccttaca cggaggaatt aaccttatga ggagccaaat tcaaaggaaa 5040 ttttggatat tcggtttaag aaacgccctt aaaaaatacc ttagggaatg cgttacttgt 5100 gccagatacc gccaagtaac tgcggagcaa attatgggta acctacccaa aaacagggta 5160 acggtgtcat ccccatttca tagcacgggg attgattttg ccggtccata tcatataaga 5220 tgctcaaaaa accggggaca aaaaacattc aaagggtaca tagcagtatt tgtgtgcttg 5280 tcaactaagg caattcattt agaagttgtt agtgacttga caactgatgc cttcctagca 5340 gcatttcgaa gatttgttgc aagacggggt aagtgcgcta acatatactc cgataatggg 5400 acgaattttg tgggagcagc taggagactc gatgaagagt tcaaaaaagc agttcaagat 5460 aatgttcaaa ttgctcctat tctagaacga gaacagataa gctggcattt tagcctcccg 5520 gcaggacctc acttcggagg tatttgggaa gctggagtaa aatcagtcaa gtatcattta 5580 aaaagagtaa taggagatag taacctcaca tatgaggaac tggcaacttt attgtgccag 5640 atagaagccg tactaaactc aaggccgttg tatacgacgg gtgaagatat ggacaataat 5700 gaggtgttaa ctcctggtca tttcttgatt ggaagaccaa cattagaagc agctgagtcc 5760 atatcagaag aaaaaattct agtagctcat catagcgttc agtcagagtc gtacatagct 5820 attaaaactt cgcacgtagg cgatattagt aacatagcaa atatagcatg gaatccagta 5880 gaagcaaaca tagtaaatca ttcagcagaa ttagaaagaa taggaagtca attaaaatcg 5940 ttaaaggaac atcatattga attactaggc ttaaatttcc atcatgtaac tggtcatgct 6000 tccttaatgt tagtcatctt actaatgata atattaataa tagtttgttt aagacaactt 6060 aataaaaata aacgaatccc aatcgcccca tttggcggcc cctcgcagaa 6110 // ID BEL-4_NVi-LTR repbase; DNA; INV; 497 BP. XX AC . XX DT 30-JUN-2009 (Rel. 14.07, Created) DT 30-JUN-2009 (Rel. 14.07, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia vitripennis: long terminal DE repeat - consensus. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-4_NVi-LTR. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-497 RA Bao W. and Jurka J.; RT "BEL type LTR-retrotransposons from Nasonia vitripennis."; RL Repbase Reports 9(7), 1341-1341 (2009). XX DR [1] (Consensus) XX SQ Sequence 497 BP; 148 A; 77 C; 120 G; 152 T; 0 other; tgttaccgtt tcaatgttaa taaatattgt attcacttaa ttgtttatta tgtctattcg 60 tttatttaaa aagatcaagt gttctgaacg aatggtgata gttgatacca tttcgtaggc 120 tgcgtgcacg gcgttcgtga aggatcgtcc ggggccgctt tcctccttcg gtaaggcaac 180 acgattatcg tgcaaaaaat tctttgaaaa ttaaaggagg gtacggggtg aaaaagcgta 240 gaggcttttc acggacgtga gccattggaa cacttggaag ttcgaggtcg gataaagatt 300 gaaaatcggg ttgcttagca atacaagaaa aggtttgggt aaaaacaatt gagtgagcgc 360 gtgaaaatgt aacgagcaga ttttggtgaa ctttcgaaga gtgcagatgt gtagtcagat 420 aattttgtgc aataaaaaaa cttaaaacct tcttgatttg tgttttcctc tgtgcgcgtt 480 tctaaacacc agctaca 497 // ID L1-37_AAe repbase; DNA; INV; 4760 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE An L1 non-LTR retrotransposon from Aedes aegypti. XX KW L1; Non-LTR Retrotransposon; Transposable Element; L1-37_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4760 RA Kojima K.K. and Jurka J.; RT "L1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1390-1390 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 5 sequences with >98% CC identity. CC ORF2 is broken. XX FH Key Location/Qualifiers FT CDS 166..1197 FT /product="L1-37_AAe_1p" FT /translation="MATKNRRENTFRIDYGNVPKKPSSEEVHQFVGETLKL FT KRDEVQRIQYSRNLGVAFVKTTSLEVAQKVVENNDNKHELIVDGKSFKLRL FT AMEDGAVEVKLFNLSEDVTNSKIARFLMSYGELLSIREEVWDEKHLFAGLP FT TGVRVVRMIVRKNIPSYVTIDSETTLVSYYGQQQSCRHCGESVHNGVSCVQ FT NKKLLVQKLATSGTSYADAAKNPLPSRTQIGSKQQRTKLALPKSTPPKDSL FT PTNAAPSPTPSTSTTMPPPASVIHRATEVAEMTNLPPDQQQDAEAEPWVRV FT TRRSAKNSDGNETDTSNSSRYSDRRPIGKKNETRQKRNQSCHRPETLSRSL FT H" FT CDS join(1280..2128,2598..4313,4317..4634) FT /product="L1-37_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MQLDIVFMQEVENEQLTLPGFNVIANVDHSRRGTAIA FT LKDHIQFSHVERSLDGRLTAVRILDTTLCCIYAPSGTAYRAQRERFFNSTI FT AYYLRHRTEHLLLAGDFNCVLRQSDATGSNSSPSLQSTVQQLQLNDAWIKL FT RSRDAGHTYITANSSSRLDRIYVSGGLCEQLRRIETHVCSFSDHMAVTLRV FT CLPHLGRPQGRGFWSLRPHLLTEENITELQIRWQFWTRQRRNFASWMDWWL FT EFAKSKIVSFFRWKSKQIYDEYHHAHENLYSQLRRAYTEKNITTRKRFKTK FT YCNTSPNYIEKRMVRRSDLTIFSVTRLSRKTTQRTNERCMEEITTADVWAA FT IKSSASKKSPGPDGIPKEFYQRAFDVIHREINLIMNEALNAQFSHKFVEGV FT IVLVKKSGTGNTLASYRPISLLNFDYKLFSRILKARLDSVMKANHILSNAQ FT KCSNGHNNIFQATLALKDRIAQLAATKRVGKLVSFDLDHAFDRVRHSFLYS FT NMRSLGIHPQLVDLLSRISALSSSRLLVNGYLTTAFPIQRSVRQGDPISMH FT LFVLYLHPLLTSLETVCGTDLIVAYADDVSVVATSAQKVHAMRDLFQRFGR FT VSGAVLNLRKTTSVDVGFINGNPIIVDWLKTENHIKILGVLFANSIRRMIT FT LNWDRLVGKIAQQVWLNSLRTLTLHQKIFLVNTYITAKAWYLASILPTYSV FT HSAKITATIGTFLWRGQPARIPMIQLARLKEQGGLKLQLISMKAKSLLINR FT HLREIGSIPYYNSVISQANPNVAADLPCLKTILQARQSLPTQILQNMSSDQ FT LHRYYVDQTDRPRVELRYPAVNWRRVWFNIGNNNFTPAQRSHWYLLVNEKV FT EHQLWCTIRRADNDQCVHCNAAIETLKHKFNECPRVAAAWEHLQNVLMNIL FT NGRRIPFEQMLHPELHNTNRTQRLYAMKHIINYVAFVNKCDNRVDVDELEF FT YLNVEL" XX SQ Sequence 4760 BP; 1464 A; 1132 C; 986 G; 1176 T; 2 other; cagtttgtgc tcaacttccg agccgatcag acgcattccg cgtttcataa gtcgtctgcg 60 gaagccaacc gtcggtttat atcgattttc agtgattctg tagaagcacc gttagtgaag 120 taccgcgagt ttctttagtt ttcgtttcgc tggcatcgtg tcacgatggc cacgaaaaac 180 cgtcgcgaga acactttccg tattgattat ggaaacgtgc cgaagaagcc atcctccgaa 240 gaggttcacc aattcgtcgg agagacgctt aagctgaagc gagatgaggt gcaacgtatc 300 caatacagtc gcaaccttgg agtcgccttc gtcaaaacta cttcgctcga agtggcacaa 360 aaggtagtcg aaaacaacga taataaacac gagctcatag tagatggaaa gtctttcaag 420 ctacgtctag ctatggaaga cggagcggtt gaggtcaagc tcttcaacct ctctgaggac 480 gtcacgaaca gcaaaattgc tagatttctg atgtcgtatg gagagttact ctcaattcga 540 gaggaggttt gggatgagaa gcatcttttt gctggactcc cgaccggtgt ccgagtcgta 600 cggatgatcg tccgcaaaaa tatcccatcc tacgtgacaa tcgactctga aaccacgctc 660 gtgtcttatt atgggcaaca acaaagctgt cgtcattgcg gcgaatctgt acataatggt 720 gtttcatgtg tacagaacaa aaaattgttg gtgcaaaagt tggcgacaag cggcacgtca 780 tacgctgatg ccgccaaaaa cccgctcccc tctcgaactc aaatcggttc gaaacaacag 840 cgcacgaagc tagccctccc gaaatctaca cctccaaaag attcgctccc tactaacgca 900 gcaccaagcc cgacaccctc gactagcacc accatgcctc ctcctgctag tgtaatccat 960 cgtgctaccg aagtcgccga aatgacaaat ctgccacccg atcaacaaca agatgcagaa 1020 gccgaaccgt gggtgagagt aacgcgccgt tcagcgaaga attctgatgg caacgagacc 1080 gacacttcca actccagtcg gtactctgat agacgcccga ttggtaaaaa aaatgagaca 1140 cgacaaaagc gaaaccaatc atgtcaccga cctgaaactc tgagccgctc tcttcactag 1200 ctacaatatt gctaccatca acctaaacac catcacgaac aacaacaaaa tcaacgcgct 1260 gagaacattc gtccaaacaa tgcaactgga cattgttttc atgcaagaag ttgagaacga 1320 gcaactcact ctccccggct tcaatgtaat tgccaacgtc gaccattcca gaagaggcac 1380 agcaatcgca ttaaaagacc atattcaatt ctctcatgtg gagcgaagtt tggatggccg 1440 tctaaccgca gtccgcatcc tcgatacaac cctttgttgt atttatgctc cgtctggcac 1500 agcgtaccgt gcgcagcgag aacgattttt caatagcact attgcatact atcttcgtca 1560 ccgaactgag catttgttac tagccggaga cttcaactgc gtactgcgac agagcgacgc 1620 aacagggtct aattcgagcc cctctctgca atctactgtc cagcagctac agctcaacga 1680 tgcttggata aaattacgct ctcgagacgc cggacacacc tatattacag cgaattcatc 1740 atcacgtcta gatcgcatat acgtaagcgg agggctatgc gaacagctac gtaggataga 1800 aactcacgtc tgctcattca gcgatcacat ggctgtgacc cttcgagtat gcctgcctca 1860 tctcggtcga cctcaggggc gaggattttg gtcgctccgt ccccaccttc tcaccgaaga 1920 aaatatcaca gagcttcaaa ttcgttggca gttctggacg cgccaaagaa gaaactttgc 1980 cagctggatg gattggtggt tggaatttgc caagtcgaaa attgtcagtt tcttccgctg 2040 gaaatctaaa cagatatacg atgagtatca tcatgcacac gaaaatctgt acagccagct 2100 gagacgtgcc tacacagaaa aaaatattta attttaaatg tggtgtaaac tgaagtgtag 2160 tgtaaaatta aatcaaattc gtgtattttt acagcatcat gtaaacttaa atgaatatac 2220 gctcaatttt caatcaaaac tggttaaatt ttacatgatc atgtaaattt aaagtggatt 2280 cgattgaaaa ataatggatt ggtcgttgaa atttaggttt attttgatgc tccaaatatg 2340 tgcatcaaaa taaacttaat tttacgacat ttttttagct gtgtacaacg ggtactatca 2400 aaatccaaga atgttgtcta cgataaaccg cccgaaaggg gagttactgg cactacaacg 2460 tcgtttcact catactttca cgcggataaa tgagccaata ctagcgggag agcctctctc 2520 gacatttcag ctgggcgata ggagaaagaa gcgtaccact attacacgac tcgatgatga 2580 acaaggtgga accataaacg actcgcaaac gattcaaaac gaaatactgc aatacttctc 2640 cgaactatat agagaaacga atggtgagac ggagcgactt gacgattttc agtgtgacca 2700 gattatcccg caaaacgacg caacgaacga acgaacgatg catggaagag attaccactg 2760 cggatgtttg ggccgcaatc aaatcaagtg cgtcaaagaa atcgcctgga ccggatggaa 2820 ttcccaaaga attttaccag cgagctttcg atgtcattca tcgagagata aatctgataa 2880 tgaacgaagc cctaaatgca caattttctc acaagtttgt tgaaggcgtt atagtacttg 2940 tgaaaaagtc aggaactggc aacactcttg cttcgtacag acccatttcg cttctaaact 3000 tcgactataa actgttttcc cgcatactga aagcgcgttt ggacagcgtg atgaaggcca 3060 accacatact aagcaatgct caaaagtgtt caaacggaca caacaatatc tttcaagcca 3120 ccctagcgct aaaggatagg attgcacagc tggcggcgac gaaacgggtg ggcaagcttg 3180 tgagtttcga tctagaccac gctttcgatc gggtgagaca ctcgtttctg tacagtaata 3240 tgcgttctct cggcatccat ccccaactcg tggatctgct atctagaatc tctgctctct 3300 catcatctcg tctgcttgta aacggatacc taacaactgc ctttcccatt caacgatcgg 3360 tgagacaggg agatccaatc tcgatgcatt tatttgtgtt gtatttgcat cctctcttga 3420 cctctctcga aacggtctgc ggcactgact tgatcgtcgc atatgccgat gatgtaagtg 3480 tggttgccac atcggcgcaa aaagttcatg ccatgcgtga cttatttcaa cgcttcggaa 3540 gagtttcagg tgccgtattg aaccttcgaa aaacaacttc tgtcgacgtc ggcttcatca 3600 acggcaaccc tatcattgtg gattggttga aaaccgaaaa ccacatcaaa atactaggcg 3660 ttttgtttgc aaactcaata cggcgcatga taacccttaa ctgggacaga ttagttggga 3720 aaatagcaca gcaagtttgg cttaattcgc ttcgcacact taccctacat cagaagattt 3780 ttcttgtcaa cacttacatc acggccaaag cttggtatct tgcatcgatt ctaccaacct 3840 actctgtaca ctcagctaag ataacggcaa ccatcggaac attcttgtgg agagggcaac 3900 ctgcacgcat accgatgata cagcttgcaa gactgaagga acagggtggg ctcaaactcc 3960 agctaatatc tatgaaagcg aaatctttgc taattaatcg tcaccttaga gagattggct 4020 caatcccata ttacaattcc gtmatttccc aagcaaatcc taatgtcgct gctgaccttc 4080 cttgcttaaa aactatcctg caggcgcgcc aaagcctgcc cacccaaatt ctacaaaaca 4140 tgtcctccga tcaattacat cgctattatg tcgatcaaac cgatcggcca agggtcgaat 4200 tgcggtatcc tgcagtaaac tggcgacgcg tgtggttcaa tataggaaat aacaacttta 4260 ctccagctca acgcagtcac tggtatctcc tagtgaacga gaaagtggaa cactaacagc 4320 tgtggtgtac tattcgtcga gcggacaacg accagtgcgt tcactgcaat gcagcgattg 4380 aaactctcaa gcataagttc aatgagtgtc ctcgagtggc tgcagcatgg gaacatctgc 4440 aaaacgtatt gatgaacatc ctgaatggcc ggagaattcc gtttgaacaa atgctacacc 4500 cagaattaca taacacgaac agaacacaaa gattatatgc aatgaaacac atmattaatt 4560 acgtcgcttt tgtcaataag tgtgataaca gagtagacgt agatgaatta gagttttact 4620 tgaatgttga attatgatta attgcaaaat attcaatgta attagttttt aaacaaatca 4680 actgaaataa acaatacttt tacaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 4740 aaaaaaaaaa aaaaaaaaaa 4760 // ID HOPEBm2 repbase; DNA; INV; 4047 BP. XX AC . XX DT 30-JUN-2010 (Rel. 15.06, Created) DT 30-JUN-2010 (Rel. 15.06, Last updated, Version 2) XX DE Bombyx mori retrotransposon HOPEBm2 DNA, partial sequence. XX KW R1; Non-LTR Retrotransposon; Transposable Element; SART1; KW HOPEBm2. XX OS Bombyx mori OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Bombycidae; Bombycinae; Bombyx. XX RN [1] RP 1-4047 RA Kojima K.K. and Fujiwara H.; RT "Evolution of Target Specificity in R1 Clade Non-LTR RT Retrotransposons."; RL Mol Biol Evol 20(3), 351-361 (2003). XX RN [2] RP 1-4047 RA Kojima K.K. and Jurka J.; RT "Consensus update."; RL Direct Submission to Repbase Update (30-JUN-2010). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 1..384 FT /product="HOPEBm2_1p" FT /translation="RLLEPRAWRCLRCFSTGHCLARCVSAVDRSGLCFRCG FT QPGHKAAVCLAAPHCSLCAAAGRKADHRAGGKARPPASKNAERNRRRQVKR FT RQKKRLAGGAPAGTLSPAEAFAPDSGGNGVGEGAMDVVQS" FT CDS 339..3614 FT /product="HOPEBm2_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="RGQWSRGGSDGCGSVLMDHTYRFLQGNMNHSAGAQDM FT LLQTMAEWSIDVAVVAEPYFVPPADDCWFGDVDGLAAIHIRRTAMVPPLAM FT VTRGPGIVAVQLENIVVIGVYFSPNRPTVEFERFLDGLEVIARHLAPRSVI FT LAGDFNAKSVAWGSPFTDARGRLLEEWAVAVGLCIVNRGSIATCVRWTGES FT IVDLTFASSPVARRILGWRVVVGAESLSDHRYIRYDLSAPAPAPAAGARGV FT DPPQSASRSFPRWALKLLNRELLVEASMVAAWAPKRPQPVDVETEAEWFRG FT AMHRICDASMPRVGSRHSGRRQSPWWSPEIARLRAVSIRASRRCTRHRRRR FT RLRRDEPVAFAEEEARLHREYRAAKDALRLAIRRAKEQNMERLLEALEADP FT WGSPYKMWADKLVRGKLRPWAPSMTERLQPQQLRDIVSALFPQEREGFVPP FT AMGGPSDTVDGQAPAEVPRITEAELRVAVVKMTAKDTAPGPDGVHGRVLAL FT ALGALGDRLLELYNGCLESGRFPSLWRTGRLVLLRKEGRPVDTAAGYRPIV FT LLDEAGKLLERILAARIVRHLVGVGPDLSAEQYGFREGRSTVDAILRVRSL FT SDEAVSRGGVALAVSLDIANAFNTLPWTVIGGALERHGVPLYLRRLVGSYL FT GARSVVCTGYGGTLHRFRSCRGVPQGSVLGPLLWNIGYDWVLRGALLPGLR FT VICYADDTLVVARGDDFRESARLATAGVALVVGRIRRLGLDVALNKSEALW FT FHGPRRAPPVDTHIVVGGVRIGVGVQLKYLGLVLDSRWAFRAHFAELVPRL FT MGTAGSLSRLLPNIGGPDQVVRRLYAGVVRSMALYGAPVWAEPRNRATMAR FT FLRRPQRTVAIRVIRGYRTVSFEAACVLAGTPPWELEAESLAADYRWRSEL FT RARGVARVPESELRARKAHSRRSVLESWSRRLANPTWGLRTVEAVYPVFDD FT WVNRGEGRLTFRLVQVLTGHGCFGKYLRRIGAEPTTRCHHCGHDLDTAEHT FT LAVCPAWEVQRRVLVAKIGPDLSLPGVVASMLGSDESWKAMLDFCECTISQ FT KEAAGRVRESSPHYAETRRRRAGGRDRGRIRDLAP" XX SQ Sequence 4047 BP; 620 A; 1166 C; 1456 G; 802 T; 3 other; cgcctkctmg aaccgcgcgc ctggcggtgc ctmaggtgct tcagcactgg gcactgcctc 60 gccaggtgcg tgagtgcggt tgaccgcagc ggactgtgct tccgctgtgg tcagcctgga 120 cacaaggcgg ccgtgtgctt ggctgcgccg cattgttcac tatgtgcggc ggccgggcgc 180 aaggctgacc accgggccgg gggcaaggcc cgccctccgg ccagcaagaa tgctgaacga 240 aatcggcgcc gccaagttaa gcggcgtcaa aagaagcggt tggccggggg agcaccggcc 300 ggcactttgt cccccgctga ggcgttcgcg cccgatagcg ggggcaatgg agtaggggag 360 ggagcgatgg atgtggttca gtcttaatgg atcacaccta tcgcttcctg cagggaaaca 420 tgaaccactc cgctggggct caggacatgt tgctccagac catggcggag tggtcaattg 480 acgtggctgt ggtcgccgag ccatacttcg ttcccccagc ggatgactgt tggtttgggg 540 acgttgatgg cttggcggcc atacatatca gaagaaccgc gatggtcccc cccctggcga 600 tggtgaccag gggccccggg atcgtcgcgg tacagttaga aaatatcgtc gtgatcgggg 660 tgtacttttc tccgaatcgg cctaccgtcg agttcgagcg gttcctggac gggctagagg 720 tgatcgctcg tcacctcgct ccccgttcag tgatactggc gggggacttc aatgcgaagt 780 ctgtcgcttg gggttcccca ttcacggacg ctcgtggtag gctgctggag gagtgggcgg 840 tcgcggtcgg tctctgtatc gttaataggg gctcgatcgc gacttgcgtg cggtggacgg 900 gcgagtctat cgtggacttg acgttcgcga gctcgcccgt cgcgcggcgc atcctcggct 960 ggagagtggt ggtgggggcg gaatcgttgt ccgatcaccg gtatatccgg tacgatcttt 1020 ctgcccctgc gccggcgccg gctgcgggtg cccgcggggt tgaccccccg cagagtgcat 1080 cccggtcatt ccctaggtgg gcactgaagc tcctgaacag ggagctcctg gtggaggcct 1140 ctatggtggc ggcgtgggcg cccaagcgcc cacaacctgt cgatgtggaa accgaggccg 1200 aatggttccg gggcgcgatg caccgcatat gtgatgcctc gatgccccgg gtcggctctc 1260 ggcactctgg acggcgacag tcgccctggt ggtcgcccga aattgcaaga ctgcgtgcgg 1320 tctccattcg ggcgagccgc cggtgcacta gacaccgccg tcgccgccgc ctgcggcgcg 1380 acgagcccgt cgcgttcgcg gaggaggaag ctcggctgca ccgcgaatat cgcgctgcga 1440 aggacgccct gcggctggcc atcaggcggg ccaaggagca gaacatggag agactcttgg 1500 aggcgctcga agcggatccg tgggggagcc catacaaaat gtgggcggac aaattggtac 1560 gcggcaagtt gcgcccgtgg gccccctcta tgactgagcg tctccagccc cagcagctgc 1620 gggacatcgt ttcggcgttg ttcccgcagg agcgggaggg ttttgtccct cccgctatgg 1680 gcgggccgtc tgacaccgtc gacggccaag ctcctgctga ggtgcctcgt attaccgagg 1740 cagagctccg ggtggccgtt gtcaagatga ctgcgaaaga cacggccccc ggcccggacg 1800 gagtccacgg ccgggtattg gccttggccc tcggtgccct gggggaccgg ctccttgagc 1860 tatataatgg ctgcttggag tcgggacggt ttccgtcgct ctggcggacg ggtagacttg 1920 tgttgttgag gaaggagggg cgcccggtgg atacagccgc cgggtatcgt cctatcgtgc 1980 tgctggacga ggcgggaaaa ttgctggaac gtattctggc tgcccgcatc gttcggcacc 2040 tggtcggggt ggggcctgac ctgtcggcgg agcagtacgg cttccgggag ggccgttcga 2100 ccgtggatgc aattcttcgc gtgcggtccc tctcggacga ggccgtttct cggggtgggg 2160 tggcgctggc ggtgtctctt gacatcgcca acgcatttaa cactctgccc tggaccgtga 2220 tagggggggc actggagagg catggagtgc ccctctacct ccgccggctg gttgggtcct 2280 atttgggggc caggtcggtc gtatgtaccg ggtacggtgg gacccttcat cgtttccggt 2340 cgtgccgtgg tgttccgcag gggtcggttc tcggccccct cttgtggaat atcgggtacg 2400 actgggtgct gaggggcgcc ctcctcccgg gcctccgcgt tatttgttac gcggacgaca 2460 cgttggtcgt ggcccggggg gatgatttta gggagtctgc ccgtctcgcc acagcgggag 2520 tggccctcgt cgtcggaagg ataaggaggc tgggtctcga cgtggcgctc aataaatccg 2580 aggctctgtg gtttcacggg ccgcggaggg cgccacccgt tgacacccac atcgtggttg 2640 gaggcgtccg gataggggtc ggggtgcagt tgaagtacct cggcctcgtg ttggacagcc 2700 ggtgggcctt tcgtgctcac tttgcggagc tggtcccccg attgatgggg acggccggtt 2760 ctttgagccg gctgctcccg aatattgggg gaccggatca ggttgtgcgc cgtctctacg 2820 cgggggtggt gcgatcgatg gccctgtacg gtgcacctgt gtgggctgag ccgcgcaacc 2880 gggccactat ggctcggttc ttgcgccggc cgcagcgcac cgttgccatc agggtcatcc 2940 gcggatatcg caccgtctcc ttcgaggcgg cgtgtgtttt ggcggggacg ccgccatggg 3000 agctggaggc ggagtcgctc gctgccgact atcggtggcg cagcgagctt cgtgctcggg 3060 gcgtggcgcg tgtccccgag agtgagctgc gcgcgcggaa ggcccattct cggcggtccg 3120 tgctcgagtc gtggtcgagg cgattggcca accccacgtg ggggctacgg accgtcgagg 3180 cggtttaccc ggtctttgat gactgggtga atcgtggcga gggacgtctc acctttcgtc 3240 tggtgcaggt gctgaccggg cacggatgct tcgggaagta cctgcgccgg ataggggctg 3300 agccgacgac gaggtgtcac cattgtggac acgacctgga cacggcggag catacgctcg 3360 ctgtctgccc cgcttgggag gtgcagcgcc gtgttctggt cgcaaagata ggacctgact 3420 tgtcgctgcc tggcgtcgtg gcgtcgatgc ttggcagcga tgagtcatgg aaggctatgc 3480 tcgacttctg cgagtgcacc atctcgcaga aggaggcggc ggggcgagtg agggaaagct 3540 ctcctcacta cgcagaaacc cgccgccgcc gagcaggggg tcgggaccgg ggtcgtatcc 3600 gtgacctggc cccctaagag ctaggggtcc cacccgtttt gtgcggggag ggacccagac 3660 gaggggtggc gtgctctgca cgctcacccg caataggaag ccgggtgatg gtagaccgcg 3720 ttcccccgac cgctctgggg gaaggtgtaa cgcggcgcca tcaaagcggg ctctcggccc 3780 gctgaacgag ggaaccggtg gtcgtttcgc tggcggccac cggtccggcg tccgtgggac 3840 ggtgggatgg atgtaatgtg ctctgcgtca accctgtccc gccgtttcaa tagccccgac 3900 tgggctccgg cccggtccga ggtagggcgc cggttgtgag cggcaggagt ttttagtgag 3960 gttcaactcc cacatacccc acctgccgcg cgggtgggga tccggcgatt ttctcctgta 4020 gaaaaaaaaa aaaaaaaaaa aaaaaaa 4047 // ID AVMAR1A repbase; DNA; INV; 1194 BP. XX AC DQ138264; XX DT 11-AUG-2005 (Rel. 10.09, Created) DT 06-OCT-2005 (Rel. 10.09, Last updated, Version 1) XX DE Mariner-type element from Bdelloidea. XX KW Mariner/Tc1; DNA transposon; Transposable Element; mariner; KW Interspersed repeat; AVMAR1A. XX OS Adineta vaga OC Eukaryota; Metazoa; Rotifera; Bdelloidea; Adinetida; Adinetidae; OC Adineta. XX RN [1] RP 1-1194 RA Arkhipova I.R. and Meselson M.; RT "Diverse DNA transposons in rotifers of the class Bdelloidea."; RL Proc Natl Acad Sci U S A 102(33), 11781-11786 (2005). XX DR EMBL/GenBank/DDBJ; DQ138264; Positions 1 1194. XX FH Key Location/Qualifiers FT CDS 110..1144 FT /product="AVMAR1A_1p" FT /translation="MDADHELNMNLSRREILVLLLHEFRLGHKATEAANNI FT CSTMGEDVLSTRTAQHWFNRFKNGNLELNDLLRSGRPLEVDVDLLQQLIEQ FT DPRLTLRCLAEHLGCSYVAVEKHLKELGKTWKYGVWIPHELSSHQLQHRVD FT ACMDLMTSHRNYEWLRNLITGDEKWVLYVNYTHRRQWLSPGQTGVATPKPD FT LHPKKVMLSVWWGVNGIIHWEFLPNGCTITADLYCQQLDRVAQKLNGKQDR FT VYYLHDNARPRVAKLTCEKLLKLGWINVPHPPYSPDLAPTDYHLFRSLSHH FT LREKKFDDDNDMKMDISNFFGRKSKDFHERGILSLPERWRQVIDSGGAYIT FT ES" XX SQ Sequence 1194 BP; 355 A; 239 C; 266 G; 334 T; 0 other; gttttttttc aaaattttct aaaacattcg agaaagttct agaagtttcc atactaaggg 60 tatataaagg gaaagtgaat ctctttcgaa tgataataaa gctatcgata tggacgctga 120 tcatgagctg aatatgaatc tttctcgtag agagattctg gtacttttac ttcatgaatt 180 ccgtcttggc cacaaagcaa cagaagcagc taacaacata tgcagcacga tgggggagga 240 tgtgctctct actcgtacgg cacaacattg gttcaatcgc tttaagaacg gtaacttaga 300 actcaatgat ttacttcggt ccggtaggcc actggaagtc gatgtggatc tcttacagca 360 gcttattgaa caagatcctc gattgacttt acggtgcttg gcagagcacc ttgggtgctc 420 ttatgttgca gtggaaaaac atttgaaaga attaggcaag acatggaaat atggagtttg 480 gatacctcac gagttatcat cacatcagct acaacatcgg gttgatgctt gtatggattt 540 aatgacgtct catcgcaact acgaatggct tcgcaatctt attactggtg atgagaagtg 600 ggtgttatat gttaactaca cgcacagacg ccagtggctt agccctggtc agacaggcgt 660 agcaacacct aagcctgatc tccaccccaa gaaggtgatg ttaagtgtct ggtggggcgt 720 caatgggatt attcattggg aatttcttcc aaatggttgc accatcactg ctgatctcta 780 ctgtcaacaa ttggatcgag ttgcccaaaa actcaacgga aagcaggatc gagtttacta 840 tttacatgac aacgcccgac cacgtgttgc aaagttgacc tgcgaaaaat tattgaagct 900 tggatggatt aacgttcctc atccacctta ttctcctgac ttagcaccaa cggactacca 960 tttgtttcgt tctctttctc atcatctgcg tgagaaaaag ttcgacgacg ataacgacat 1020 gaaaatggac attagcaact tctttggtcg aaagtccaag gacttccatg aacgcgggat 1080 cctgtctcta ccagagcgtt ggcgacaagt catagatagt ggtggtgcat atataactga 1140 aagctagttg tactgttgaa gttaaaaaag aagaataaaa tttggaaaaa aaac 1194 // ID Copia-1_CQ-LTR repbase; DNA; INV; 218 BP. XX AC AAWU01000183; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-1_CQ_; KW Copia-1_CQ-I; Copia-1_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-218 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 318-318 (2011). XX DR GenBank; AAWU01000183; Positions 18472 18689. XX SQ Sequence 218 BP; 48 A; 56 C; 64 G; 50 T; 0 other; tgtgaggtat tagattacca taataagata gacccttttg gaatgtctgt cgctagaacg 60 gtcggggctg aaccaaggct gaacccactg tgcgcaacgc taggtctagc tgtcatcgga 120 gtaaacaacc aagcagcgtg cggtccagtt ccgcagtcgc agcgtgcagg tgcatctccc 180 gtgttgggga ttctccctgg tggccagcgg gtctcaca 218 // ID DNA-11_AAe repbase; DNA; INV; 488 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A non-autonomous DNA transposon family from Aedes aegypti. XX KW DNA transposon; Transposable Element; nonautonomous; DNA-11_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-488 RA Jurka J.; RT "DNA transposons from the yellow fever mosquito."; RL Repbase Reports 11(4), 1266-1266 (2011). XX DR [2] (Consensus) XX CC >98% identical to consensus. Present in >5000 copies in the CC genome. TA TSD. XX SQ Sequence 488 BP; 154 A; 107 C; 111 G; 115 T; 1 other; gggagactta cggcttcggc accattcgac tattatcggc aaccaaattt caaacgccaa 60 atacacagta tttttctatc aaaacacatc agtggaaaca ttcagatgat tctttgttct 120 gcccgcttcc cgctggcgag atttccgaga ctaaacaaac gatatcgccg gagatgtcat 180 caattcaaat gagcacgcct caaaatgcga ctgatttgga gctgccgaag agattcacca 240 gcagttgtta tgggaaagtt gccgaagaga ntgaggctgc cgacaatagt cacactgaca 300 agggatgttc gcagcgttgt aaaaacgttt ccaaaagcat tttgacaagt ctgtcggtgt 360 gaatcgatag aggattgagc aacgcataaa tgtaaataga caaacacgga atttgtctgc 420 tgatgtccga gaaaattaca aaagaagcac ggcatcggct taagctgccg agaatacgta 480 agttcccc 488 // ID Penelope-3_AAe repbase; DNA; INV; 3222 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A Penelope-like element family from Aedes aegypti. XX KW Penelope; Non-LTR Retrotransposon; Transposable Element; KW Penelope-3_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-3222 RA Kojima K.K. and Jurka J.; RT "Penelope-like elements from the yellow fever mosquito."; RL Repbase Reports 11(4), 1437-1437 (2011). XX DR [2] (Consensus) XX CC >99% identical to consensus. Sequences 1-1111 and 2112-3222 are CC terminal inverted repeats. TSDs are 3 bp; usually TAA. XX FH Key Location/Qualifiers FT CDS 1161..259 FT /product="Penelope-3_AAe_2p" FT /note="reverse transcriptase and GIY-YIG FT endonuclease." FT /translation="VCRSLGTRRPYNRRGTQFLDLLVTREESSTYNFEIYR FT KPTNTQRVIPYTSNHSFQHKMAAFHHMIHRMQTLPLSEHGKTKELEYIYET FT ARINGYKERTIKAIIDKKERQRIRNALTTLTPITEPMKRVSIPYDVHISKQ FT LRPKLRNFGIDLVFSSRDNQLRTSLGSTKDPVNTLNKAGVYKISCSHCSKV FT YVGQTKRSLEVRFKEHLAEIGKAQKTIDKGMTYDFKSKVAEHIFSEGHTIT FT TADIKILRNVSSPWKLDVAESLEICKQVPTTLLNRDNGNGNTWLFNLVPTN FT RSRDVPFSI" FT CDS 1282..2964 FT /product="Penelope-3_AAe_1p" FT /note="reverse transcriptase and GIY-YIG FT endonuclease." FT /translation="RQRSRHHGQDGLRRTNGEKDQRRPLPTFESGSTTRTH FT QLTDKTLKDCKAIIGEARLKESNPILPRIKGLPKIHKPGKEMREIISADGS FT PTHKLAKWLVKEFQSMPNPFPTRSVKNTQEFSQKLLESGHIEDDEMMVSFD FT VAALFPSVPVKDSLNLLEDWLLPQRTDAAWKGKVRTYMRLARLCMDENYFQ FT FRGNFYKQTKGAPMGNPLSPFLCELFMANFEENLKKQGVLPDKWWRYVDDI FT FSVIKRNDLAKILDTINSVHKDIKFTHEEEKDGKLSFLDLLVTKEESSTYN FT FEIYRKPTNTQRVIPYTSNHSFQHKMAAFHHMIHRMQTLPLSEHGKTKELE FT YIYETARINGYKERTIKAIIDKKERQRIRNALTTLTPITEPMKRVSIPYDV FT HISKQLRPKLRNFGIDLVFSSRDNQLRTSLGSTKDPVNTLNKAGVYKISCS FT HCSKVYVGQTKRSLEVRFKEHLAEIGKAQKTIDKGMTYDFKSKVAEHIFSE FT GHTITTADIKILRNVSSPWKLDVAESLEICKQVPTTLLNRDNGNGNTWLFN FT LVPTNRSRDVPFSI" XX SQ Sequence 3222 BP; 970 A; 698 C; 704 G; 850 T; 0 other; gtagatgaaa aaaccgaatt tagtactata ccatttaatt ccactagagt ttgtatcctt 60 tgacagatac gcgtatttcg acctcaactg taaggccgtc ttcagtgtcg tgtactagac 120 tcgacttgaa gaaaaccatc gtatgcacac attatatatt aacactaggt ataaacgtat 180 ttccctaccc attattattt tttaacttaa aatttatgag ctttaggtac attccatccc 240 tcttttgttg aaactttaaa tgctgaaagg tacatcacgg gaacgattag taggtaccaa 300 gttgaatagc catgtattgc cgttgccgtt gtctctgttg agtagcgtcg taggtacctg 360 tttgcaaatt tccagactct ccgccacatc cagtttccaa ggagaagaaa catttctcaa 420 aattttgata tcggccgtcg taattgtatg tccttctgaa aagatatgtt ctgccacctt 480 agatttgaaa tcgtatgtca tccccttgtc tatcgtcttc tgagcttttc ctatttccgc 540 taagtgttcc ttgaatctaa cctcgagaga ccgctttgtt tgaccaacgt agaccttgct 600 gcagtgggaa cagctgattt tgtaaacacc agccttgttt agtgtgttta ccggatcctt 660 ggtagagcct aacgaagttc taagttggtt gtctctgctg gaaaatacca aatcgattcc 720 gaaattcctt agctttgggc ggagctgttt gctgatgtgt acgtcgtatg ggatggagac 780 tctcttcatg ggttcggtga taggggttag tgtcgtcaaa gcattccgaa ttcgctgtct 840 ttcctttttg tcgatgatag cttttatcgt cctttccttg tatccgttga tccttgccgt 900 ctcgtagata tattccagtt ccttggtctt tccgtgttcg ctgaggggta gagtctgcat 960 cctgtggatc atgtggtgaa acgctgccat cttatgctgg aacgaatgat tcgatgtgta 1020 agggatgact cgctgggtgt ttgtaggctt cctgtagatt tcgaagttgt aagttgaact 1080 ttcttccctg gtgacaagta gatccaaaaa ttgcgttcct ctccgattgt agggccttcg 1140 tgtaccgaga gaccggcaca ctcaacacta ggtatggacc ttccccacac catcaggaca 1200 acgaagcaag gcagacaagg acgagacgag gatcgtgaaa gaactcaagg ataaaccagt 1260 attctacatc aaggcggata aaggcaacgc agtcgtcatc atggacaaga cggattacga 1320 cgaacaaatg gcgaaaaaga tcaacgaagg cccctaccga catttgagag tggatccact 1380 acccggactc atcagctcac ggacaaaacc ctgaaggact gcaaggcaat tattggcgag 1440 gctcgtttga aagagtcaaa ccccattctt ccacggatta aaggactacc gaagattcac 1500 aaaccaggaa aggaaatgcg agaaatcatc tcggccgacg gatcccccac tcataaactg 1560 gcgaaatggt tagtcaagga attccagagt atgccgaatc cattccccac taggtcagtt 1620 aaaaacaccc aggagttctc ccagaaacta ttagagtcag gacacatcga ggatgacgag 1680 atgatggttt ctttcgacgt agcggctctt ttccccagcg ttccagtaaa ggattcccta 1740 aatcttctcg aggactggtt attaccccaa aggacggatg cagcatggaa aggaaaggtc 1800 aggacataca tgaggttggc aagattgtgc atggacgaaa actactttca atttcgagga 1860 aatttctaca aacagacgaa aggagccccc atgggaaacc ctctctcccc gtttttgtgc 1920 gaactattca tggcgaattt tgaggaaaat ttaaagaaac aaggagtatt accggataaa 1980 tggtggagat atgtcgacga cattttcagc gttatcaagc ggaacgatct ggctaaaatt 2040 ttggatacaa tcaacagcgt acacaaggat atcaagttca cccacgagga ggagaaggat 2100 ggaaaactat catttttgga tctacttgtc accaaggaag aaagttcaac ttacaatttc 2160 gaaatctaca ggaagcctac aaacacccag cgagttatcc cttacacatc gaatcattcg 2220 ttccagcata agatggcagc gtttcaccac atgatccaca ggatgcagac tctacctctc 2280 agcgaacacg gaaagaccaa ggaactggaa tatatctacg agacggcaag aatcaacgga 2340 tacaaggaaa ggacgataaa agctatcatc gacaaaaagg aaagacagcg aattcggaat 2400 gctttgacga cactaacccc tatcaccgaa cccatgaaga gagtctccat cccatacgac 2460 gtacacatca gcaaacagct ccgcccaaag ctaaggaatt ttggaatcga tttggtattt 2520 tccagcagag acaaccaact tagaacttcg ttaggctcta ccaaggatcc ggtaaacaca 2580 ctaaacaagg ctggtgttta caaaatcagc tgttcccact gcagcaaggt ctacgttggt 2640 caaacaaagc ggtctctcga ggttagattc aaggaacact tagcggaaat aggaaaagct 2700 cagaagacga tagacaaggg gatgacatac gatttcaaat ctaaggtggc agaacatatc 2760 ttttcagaag gacatacaat tacgacggcc gatatcaaaa ttttgagaaa tgtttcttct 2820 ccttggaaac tggatgtggc ggagagtctg gaaatttgca aacaggtacc tacgacgcta 2880 ctcaacagag acaacggcaa cggcaataca tggctattca acttggtacc tactaatcgt 2940 tcccgtgatg tacctttcag catttaaagt ttcaacaaaa gagggatgga atgtacctaa 3000 agctcataaa ttttaagtta aaaaataata atgggtaggg aaatacgttt atacctagtg 3060 ttaatatata atgtgtgcat acgatggttt tcttcaagtc gagtctagta cacgacactg 3120 aagacggcct tacagttgag gtcgaaatac gcgtatctgt caaaggatac aaactctagt 3180 ggaattaaat ggtatagtac taaattcggt tttttcatct ac 3222 // ID Gypsy-4_DPer-I repbase; DNA; INV; 3411 BP. XX AC super_55; XX DT 04-MAR-2011 (Rel. 16.03, Created) DT 04-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-4_DPer_; KW Gypsy-4_DPer-LTR; Gypsy-4_DPer-I. XX OS Drosophila persimilis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; obscura group; OC pseudoobscura subgroup. XX RN [1] RP 1-3411 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (22-MAR-2010). XX DR Genome; super_55; Positions 4969 8379. XX CC 'CAAC' target site duplication CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 106..1680 FT /product="Gypsy-4_DPer-I_1p" FT /translation="MGSFVEERGLGPVCAGVRSSIGRHAGRDAPNLQGVDR FT EARRGRGRARSLGDEAATWAQMEDGTDPYERLTLPASAEQATLWRRPSETQ FT EELIASLTVPVKRRSPSRPRAESGERPTRPAQPEQGSARNPHGLEEGRAGE FT DRQQGLDEQTTRPPTSSSAHVAKQVREWSFRFDGTAKPLEFLEQIEWSANM FT YGLDWNLIPRAMPELLKGRALMWFVANNRQCQTWNAFARSFQAYFLPRGYF FT EKLLQVQRWGEPFKDYIVEMQTLMRPLKCTPEEQLQLIRDNSMPDLFIRPH FT RCRDLEHIMTLADEFEALERDRLEFQRECPASKYSVKSRMTADPVLACPNF FT SKTFVLQTDPSDYGLGAILKGGIESPKGRIARWALELQQYDFEVAYRKGQL FT NVVADALSRQPLAERWLRTIEEETESGHEAEAECEWIKKVKERMRTEPMKY FT PDYVEEASQIIPAHPAPSRRRGDSIMEVMRPKGSKRKSHKREQRLAGGGAR FT GRQKDDCKSSRPVLLAGDAPRCEDICEKV" XX SQ Sequence 3411 BP; 943 A; 791 C; 1102 G; 575 T; 0 other; actggcgccc aacgtggggc ctcacgacaa gtaaaagaca aagtgcgttt gggaagggcg 60 gtactgagtt tggtattgcg agggcgcaaa aatgaggagc cggcgatggg ttcattcgtt 120 gaggaaagag gacttggtcc agtgtgcgcg ggcgtgcggt cttcgattgg aaggcacgct 180 ggccgagatg cgccgaatct tcaaggcgtg gatagagaag cacggagagg acgcggacgc 240 gctcgaagcc tgggtgatga agccgcgacc tgggcccaga tggaggacgg gacggatcca 300 tatgagcgcc tgacgctgcc ggcgagcgcg gagcaggcaa cgctgtggag acggccgtcc 360 gaaacccaag aggagcttat cgcgagcttg acggtcccgg tgaaaaggcg gtcgcccagc 420 cgaccacggg cagagagtgg ggagaggcca accaggccag cacagccaga gcaagggagc 480 gcgagaaacc cacatgggtt ggaagagggg agagcagggg aggatcgtca gcagggactc 540 gatgaacaga ccacgagacc gcccacttcg agctctgccc acgtcgccaa gcaggtcaga 600 gagtggtcat tccggttcga cggaacggcg aaaccactcg agttcctgga acaaatcgag 660 tggtcagcca acatgtacgg tttggattgg aatttaattc cacgagcaat gcctgaactg 720 ctaaagggtc gggcgctcat gtggtttgta gccaacaaca ggcagtgtca gacgtggaac 780 gcatttgcgc gtagcttcca agcgtacttc ctgccgaggg ggtattttga aaaattactc 840 caggtgcaaa ggtggggaga gccattcaaa gactacatag tggagatgca gaccctcatg 900 cggccactaa aatgcacgcc cgaagagcag ttgcagctca tcagagacaa cagcatgccc 960 gatttgttca tccggccgca taggtgcaga gacctggagc acatcatgac actggccgac 1020 gaattcgagg ctctggagag ggaccggctg gagttccagc gggagtgtcc ggcgagtaag 1080 tactcagtaa agagtcggat gacagcagac ccggtgttgg catgcccgaa tttctcaaag 1140 acgttcgtgc tccagacgga ccccagcgac tacggactgg gggcgatcct gaaagggggc 1200 atcgaaagcc caaaggggag gatcgccagg tgggcgctgg agctacaaca atacgacttc 1260 gaagtggcat ataggaaagg gcagctgaac gtggtggcgg acgcactgtc ccgacagcca 1320 ttggcggagc gatggctgag aaccatcgag gaggaaacag agagtggaca cgaagctgag 1380 gcagagtgcg aatggattaa aaaggtcaaa gagaggatga ggacggaacc gatgaaatat 1440 cctgattacg tggaggaagc aagtcagata ataccggcac atcccgcacc gagccggaga 1500 agaggagata gcatcatgga agttatgcgt cccaaaggat ctaagagaaa gagtcataaa 1560 agagaacaac gactcgccgg aggcggggca cgcgggcggc agaaggacga ttgcaagagt 1620 agccgcccgg tactactggc cggggatgca ccgagatgtg aggacatatg tgagaaagtg 1680 tgaagtctgc atacgctaca aaccgagtca actgcagacg gccgggaaaa tgctgacgca 1740 agtgccagag gagccgtggg tgacggtgtg tgccgacttc gtaggccctc tgtccaggtc 1800 aaagcatggg aatgccatgt tattggtgat ggtggatcgg ttctcgaaat ggaccgaatc 1860 ggtccccttg agaaaggcca ctgcggaagc actaacaaaa gcgttccgag agcgcatcgt 1920 gtcccggttt ggcgtgccaa aggtggtgat cacggataat ggagtccagt tcaccagtag 1980 agtttttaag aggtttttgg agcagatggg cgtacggcat cagttcacgg ccccttacac 2040 gccgcaagag aacccgacgg agcgaactaa ccggacagtg aaaactatga tagctcagtt 2100 caccgaaggg aaccagagga actgggatga gaaatggtca gagataatgt tggccgtaag 2160 ttaaggggtg accgagtcca ccgggtactc gccggcgttt attgtctttt attcttttct 2220 tttagtattg tcttttattt atagagccaa ggctaccaaa cgctctatac gatgaagaag 2280 tgcttggcac agggcagggc acgcccacac cggacgagaa cgcggccaag ttgagagaat 2340 tatttcaatt tcaattattt cagtggcagt gtgctctccg gtgcagcgct gtggagcaga 2400 gggagatggc gcgtctgcgt aaggagacgg aggaggcgat ggcgaagcgg acggagagcg 2460 cggcagacgg tccgaggtcg gagcgaggtc cctggcaatg gccggaaccc ggcccgagct 2520 gcagcgtcag gcatcgtgcc cccagacggg gccggcggac aacccacggt ggcagcacat 2580 caaacgcgag gagtggccga ccgtcgtaac gcgctctctc gcgcggatga ggcaaacgtc 2640 gatgagggtg aagcgcctgg tcaacgacgg tggcgtccgg taccgtctat cggtgtccac 2700 cgatcggcag gaggtgttcc gggcaacaca ctgaatagaa gagaagagta gaaagaatta 2760 gaaacaaaaa cgaataaaaa aaaaaataaa gaagaagacc ttggtattta cctttcagca 2820 gaagaacaga ggggcatcgc tagaaataat cggtaaagta tgaatgaatg cggcggaagg 2880 cgtgaaagag agaggaaaac ttatccgaag catctagagg gccgccggaa ggcctgcgca 2940 ccagcgaagg gcagaaaggg aaaccgggca cgcacaagaa aaaaaaaaaa aaaaaccagg 3000 aacaacgaga agatagcact tttctcagtc cattgaggaa ttcgcgatgc ggaattagaa 3060 tacgcacgct gcgagatatg gagtatatca caaaagcgcc cctatgcttt atcgagtgag 3120 tcgtaaaggc aactctacgc catatcgagt aggtcagaat agcgacccta ccctatatcg 3180 agcggatcac aagagctcct atgcgaacta tcgacaaggt gaaaacgaat cgatcgctca 3240 cattggagag atacggcaac accgcactgg gcgaggccga ggaagggcgc cagggtgcac 3300 ccggttcggg aacaaagcgc gcgcgcgccg ttcgcgcggg ttttttttta aacgtcccaa 3360 acatgggacg cggaaaatcg gggttttctg ccaatgaaag aaagggggaa g 3411 // ID Copia-5_Cfl-I repbase; DNA; INV; 4047 BP. XX AC AEAB01013273; XX DT 26-JAN-2011 (Rel. 16.02, Created) DT 26-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Florida carpenter ant genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-5_Cfl_; KW Copia-5_Cfl-LTR; Copia-5_Cfl-I. XX OS Camponotus floridanus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Formicinae; Camponotus. XX RN [1] RP 1-4047 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Florida carpenter ant genome."; RL Direct Submission to RU (26-JAN-2011). XX DR Genome; AEAB01013273; Positions 14120 10074. XX CC Positions [1434-1913] - Integrase core CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 99..1913 FT /product="Copia-5_Cfl-I_2p" FT /translation="MAETPRVIVAKLNNDNYESWKYKVELLLIREGLWDVV FT HKEQPAQLDAAWLNRDAQARAMIGLLVEDNQLTHIRSKITAAETWKALRDY FT HQKASLSSKVILLKNLCGMKLAENGNIEEHIINMSVIKDKLEAIGENIKEE FT LFIAMLLGSLPDSYNSLINALESRPEKDLTISLVKGKLIDEYRRRSSNTAA FT MNEAQDKVLKVRDNRKYTRPEQDPMECFFCKKKGHMKRDCKKYKAWKEKTE FT KVNKATTDKDNICFNIDNKDFSKESWYIDSGATSHMTNSKEFFNVSFTTIE FT DRVTVANGKEVKVMGIGSGRISCSNGNINKIITLKNVLYIPGLHSNLLSVK FT KITENGFTIEFKETTCNILKDQEIIVTAESAGNLYKLNVKHEALSSTKMHS FT SKCIHSWHRKLGHRDPDAIRKLAREDMAYGITIQECGIKEVCEDCIQGKLS FT KKPFPKISNRQTKDILDIIHSDVCGPMQTTTPGGKRYVLTMIDDHSSYTEV FT FLLAHKSEVFRYVKEYIGAVKNKFNRKPKVLRSDRGKEYVNKQLTDYLKEE FT GIKIELTAPYSPQQNGKAERKNRYLMEMARCMIIDSGLAKKYWGEAVTTAN FT YLNKVRR" FT CDS 2426..4021 FT /product="Copia-5_Cfl-I_1p" FT /translation="MTMHEDDEPKTIEEALTGKDRINWRKALDEEYESLIA FT NKTWKLMKPPENVKIIGCKWVFKLKKDEEGRTVRYKARLVALGYSQEYGID FT HHEVFAPVVKQTTLRILLTIANRDKLQVLHYDIKNAFLNADLAETIYMQQP FT KGYTVKGKEQYALKLNKSLYGLKQSANLWNKCISDKLIKLGYTQGKTDTCL FT YIKHDKDIKNYVTIYVDDIIISSNSTEEIIKLERILSNSYNLTKLGNIKEY FT SGIQIQTDKEVYYLHQRKHICNILKTFGLKDAKGSKIPLSVGYENLEDHKI FT LQNNKNFQSLIGSLLYIATCTRPDIKAAVAILSRKLNNPTETDWIEAKRVA FT RYLKYTIDYKLKLGNDRLKEEGLIGYADANWAGSREDRKSNSGYLFKYCGS FT TISWACRRQSCVALSSAEAEYIALSDGCQEMLWLKKLLKDFQEDVKLPVKI FT YEDNQSCIKLARNQNFSKRLKHVDTKYHFVRDLNEGNIILLEYCASEEMLA FT DMLTKAIQHIRLRELSKEAGLETFHGNKDDVIEEEC" XX SQ Sequence 4047 BP; 1628 A; 661 C; 895 G; 863 T; 0 other; ggttatgggc ccagaatcgg gattaattaa gagaataaag gtagtataaa ggctgtgtaa 60 agtacaggtt aaggcgatac aagaagaaag cacgaaaaat ggcggaaaca cctcgtgtga 120 tagtcgcaaa gttgaacaac gacaattacg agtcatggaa gtataaggta gagttgttgc 180 tcataagaga aggcctatgg gatgttgtgc ataaagagca accagcacag ctggatgctg 240 cgtggctgaa tagagacgcg caagcaagag caatgatcgg actcctggtg gaagacaatc 300 agctcacaca catccgcagt aagataacgg cggcagaaac atggaaagca ttgcgagatt 360 atcaccagaa ggcttcatta tcaagtaagg ttatactact aaagaatcta tgcggcatga 420 aacttgcaga aaacggtaat attgaggaac atattataaa tatgtcagta attaaagata 480 aattagaagc aatcggtgag aatattaaag aagaattatt tattgcaatg ttacttggca 540 gtcttccaga ttcttataac agtctgataa atgcgcttga aagcaggcca gaaaaggatt 600 taactatcag tctagtgaag ggaaagctca ttgacgaata ccgcaggaga tcgagcaaca 660 cagctgcgat gaatgaagca caagataagg ttttaaaggt acgtgataac agaaaataca 720 caagaccaga acaggatccc atggaatgtt ttttctgtaa gaagaaaggc cacatgaaaa 780 gggactgcaa aaaatataaa gcttggaaag aaaaaaccga aaaggtgaat aaagcaacta 840 cggataagga caatatttgc ttcaatatag acaataaaga cttttcaaag gaatcgtggt 900 acatagattc gggtgccaca agtcacatga cgaatagcaa ggaattcttc aacgtaagtt 960 tcacaactat agaagataga gtcactgttg caaacggaaa ggaagtcaag gtaatgggta 1020 taggatcagg aagaatcagt tgttccaatg gaaatattaa caaaataatc acattaaaaa 1080 atgtcctata cataccaggc ttacacagta atttattgtc agtaaagaaa attacggaaa 1140 acggattcac aatagaattt aaagaaacaa catgcaacat actaaaggat caagaaataa 1200 tcgtcacagc agaaagcgca ggaaatttgt ataagctcaa tgtaaaacat gaagcgttat 1260 catcaacgaa aatgcattca agcaaatgta ttcactcatg gcatcgaaag ctaggacaca 1320 gggatccgga tgcaatcagg aaactagcca gagaagatat ggcatacggc attacgatac 1380 aagaatgtgg cattaaggaa gtatgtgagg actgtatcca aggaaaatta agtaaaaaac 1440 cgttcccaaa aatatcgaac agacaaacaa aggatattct ggatatcata cacagtgatg 1500 tttgcggccc catgcaaact actactcccg gtggcaaaag gtacgtatta accatgatcg 1560 atgaccatag cagctataca gaagtgttct tgctagcaca caaatcggag gttttcagat 1620 acgtcaaaga gtatatcgga gcagtcaaga ataagttcaa cagaaaacct aaggtattaa 1680 gatcagatcg cggcaaagaa tacgttaaca aacagctgac agattacctt aaggaggagg 1740 gtatcaagat agaacttacg gcaccttact cgccacagca aaatggcaag gcagagagaa 1800 aaaacagata cttaatggaa atggccaggt gtatgataat tgatagcgga ttagccaaaa 1860 aatactgggg tgaggcggta actacagcta attacctaaa taaggtgagg cggtaactac 1920 agctaattac ctacaaaata gattaccaac aaaatgtaat aatggtacgc catatgaaaa 1980 atggttttcg cagaaaccaa acttaaagaa tttacacgta tttggatgtg aggcatatgt 2040 gaagattcca gacgagttaa ggcgcaaatt agatagtaaa gcaaggaagc tacattttgt 2100 aggatacagt gatcagtcga aagcattcag gctactagat aaaaagacgg acaaaataac 2160 tatcagtaga gaagttatat ttctggatga aaaacagaac caggaggagc ctcgtaaaga 2220 aaagaagagt gaagaatcag aagcagaaat agaaatcagc gagaagaaag aagatacatc 2280 tgcagaagaa aagacacaaa ttgaaaaaga taagaagaag acgaagcaat caacagaaga 2340 actagatata acggcaacac cacgaagagc atctgaatgg acaaataaag gagtaccacc 2400 attaaggttt aatgaatttg cgggaatgac aatgcatgaa gacgacgaac caaaaacaat 2460 tgaagaagca ctgacaggaa aagacagaat taactggagg aaagctcttg atgaagaata 2520 cgagtcacta attgcgaaca agacatggaa attaatgaaa cctccggaaa acgtaaagat 2580 tataggatgc aagtgggtgt tcaagctcaa aaaggatgaa gaaggaagaa cagttcgtta 2640 taaagcgagg ttagtcgcac ttggctacag tcaggaatat ggtatagatc atcacgaagt 2700 ttttgcacct gtagttaagc aaacgacgct caggattcta ctaaccatcg caaacagaga 2760 caaattacaa gtactgcact atgacataaa aaatgcattt ctcaatgcgg atctagcaga 2820 aacaatttac atgcaacaac ctaaagggta tacagtcaaa ggaaaagagc aatacgcact 2880 aaaattgaac aaaagcctct atggattaaa acagtcagca aatctgtgga acaaatgtat 2940 ttcagataag ctaataaaac taggatatac acaaggtaaa actgatacat gcctttacat 3000 aaagcatgat aaggacatta aaaattatgt aacaatttac gtagacgaca taataatctc 3060 gtccaatagc acggaagaaa taatcaagct agaaaggata ttatcaaatt cgtataactt 3120 aacaaaactc ggaaatatta aagaatattc aggaatccag atacagacgg ataaagaagt 3180 ttactattta catcaaagaa agcacatatg taatatatta aaaacatttg gattaaagga 3240 tgccaaagga tcaaaaatac cactaagcgt aggatacgaa aatctagagg atcataagat 3300 tctgcaaaat aacaagaatt ttcagagttt aattggaagc ttgctgtaca tagctacatg 3360 tacgcgacca gacattaagg ctgcagttgc aatactgagc agaaaattga acaacccaac 3420 ggaaacggac tggatagaag caaagagagt cgcgagatac ctcaagtaca cgattgacta 3480 caaacttaaa ctcggaaatg acagactaaa ggaagaagga ctcatcggtt acgcggacgc 3540 caactgggca ggatcaagag aagacaggaa atcaaacagt ggatatttat tcaaatactg 3600 tgggagcacg atcagttggg cgtgcagaag gcaatcctgt gtggcattat catctgcaga 3660 agcagaatat atcgcgctga gtgacggatg tcaggagatg ttatggctga agaagttgct 3720 aaaggatttt caagaagacg tcaagctgcc agtgaagatc tatgaagaca atcaaagctg 3780 tattaagcta gccaggaatc aaaacttcag caagagatta aaacatgtgg acacgaagta 3840 tcattttgtg agagatctaa atgagggcaa cataatccta ctcgaatact gtgcgagtga 3900 agaaatgctt gcagatatgc ttacgaaagc aatacagcac ataagattac gcgaactgtc 3960 aaaggaagct ggattggaga catttcatgg caacaaggac gacgtcattg aggaggagtg 4020 ttaaaatcaa gaagacaaaa acggcaa 4047 // ID Crack-16_AAe repbase; DNA; INV; 5491 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A Crack non-LTR retrotransposon family from Aedes aegypti. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; KW Crack-16_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5491 RA Kojima K.K. and Jurka J.; RT "Crack clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1232-1232 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 12 sequences with >97% CC identity. Closely related to Crack elements in Culex pipiens CC (Crack-1_CP to Crack-4_CP). XX FH Key Location/Qualifiers FT CDS 1120..2172 FT /product="Crack-16_AAe_1p" FT /translation="MDNESIICVECNKIEKDASKIITCMYCFSEAHYKCRN FT IGANAARRFKDRMYFCSHNCSSIYQRITEMQNNKSSIIESLAVELKGAVAN FT AVSQEIQCVRSEVKQVTTAIEKSQDFLSNKFDAIVTDFQELKKENESLRRE FT VDKLKNVQQNLSQTVYKLEHQVDKTSRDANSKNAVILGVPFLPDENTQEIA FT QKMITCLGADVAADAISAAARINSKNKPKNSLVPIRVVFKDECVKETVFSQ FT KKECGKIMSTSIDPNFTINGNPTSVTIRDELTPLSMELLNEMRRHQEKLKI FT KYVWSSRGGNVLVKKNENSKPEIVKTRDDLIELVNRYTGNLSPKDTPSPKR FT KCSSNNFN" FT CDS 2224..5103 FT /product="Crack-16_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MANEINFYHESIDDFNLKYPYTNQNYIRIAQWNVRGI FT NNMQKFDEVLMFLDSIKIPIDVFCVGETWLKANNCPLFSIPNYDPIFSCRE FT TSSGGLAMYIRSGLSFNIVKTFSNEGMHLIHVEIKINGLLYDVVGVYRPPS FT YDFEKFHDELEFLLSTHGSRPLFVVGDVNIPMNLTNNNVVSRYKSLLDSYN FT FVCSNTHVTRPISNNVLDHFVSRKEDLPYVRNDTVYSDVSDHLIVVTSFKI FT NDPKERIVLTKKIVNKDLLNQRFKHFLDTFTCRQDVNDSIITLISAYNNIL FT QECTKIKSEKINAKAKHCPWLNYYIWQWIKLKNKYLKKVKNDPNNDNLKEM FT FRHVTKKTDEAKKRCKTEYYKSLLDHTCQSKMWKNLNELMGRKKSNTKIEL FT NNEGYKTSNSLEVCEIFNQYFSNIGIKLAKTIKSNNRNPLSTVTRVEQTIF FT LRETNVPEVTSILNDLNVRKSCGPDNFPSSIIKYNTNKFSHIFMELFNQIL FT IQGKYPDCLKVAKVTPIFKSGDKSDPCNYRPISTLSVFSKVFEKLLVSRIT FT DFLNKYNVLYKFQYGFRKDCSTSTAIVELVDFLIRKIDNKFTVGGLFIDLK FT KAFDTLNHDILLQKLECYGIRGLAYEIIRTYLCDRQQFVMLQDAQSSLQTV FT NIGVPQGSNIGPLLFLLYINDIGNLPLKGTPRLFADDTALFYPHFETSCII FT DFINHDMLLLVRYFDTNLLSLNLSKTKYILFHSPRKKIVSHNNPIVGTTAI FT EKVKNFKYLGLILDATLSWKEHIEHLRNKIASLCGLMHRVKTFVPKHTLLS FT FYYSCIHSHLLYLIIVWGHASKSNLQKIQVLQNRCLKIIFHLPQLFSTVQL FT YSNVSHNILPLRGLCKLQTCLFMYDILKKPNMHQNLSFTVSNHEHNTRYAN FT NLLRSRASTCLGQMRISFFGPSEYNALPESLKTLNNRSLFKTSLKRHYKSS FT ISDFL" XX SQ Sequence 5491 BP; 1794 A; 926 C; 984 G; 1786 T; 1 other; ctggcaacac tgttgtttac ctgatacgtg tctctaacga ttgacttttc aacttaagtt 60 taaagagtgt aaatcattta gaaaattatt ttgcacacct caacctgttt gtgagtaatg 120 ttcttatgat tgtcttgtaa tcagtgatat ccatattgcg ttaaaagtgg tttaaaagtt 180 tgttttttgt atgagatgat ttctccacta tagacatctt aattgtttgc tgctgaattg 240 ctgaattgat ccgttgactg ctgctgtttt tgttgctgta cagatgtttc ccgatactta 300 tgcgtgaata atcaatcgag ctgtgttggt gctgttataa ctctacttgt tgcacacaat 360 ttggttcttg tcaactctga ccgaagtgaa atacatgtct gagtaattat atgttggttt 420 aacagattaa gtgaaaatag agcgaacggg cgtcgtacac aatcaagccg ttggctcgcg 480 cttggtgtat gggtgctgaa gtagttctac tagtagcgca cactgtgttt ctgtttattc 540 taattatata accaaaagct gtacggtagc ctaggcagtg tagatagatt tagcgaacgg 600 gggtgaacgg gcgtcgtaca caatcaagcc gttggttcgt gcttggtgta tgggtgctga 660 agtagtcttc tctagtagca cattgcgtgg ttctgttgac atacataagg cgaaataact 720 tgcatgcaag tttgagtagt ttttggtcag cagatttagc tgcaataggg taaacagtaa 780 actagcaaac ctcgactcaa attaatgaac catttaatac caaagattag aatttaaaag 840 ttgaaacata agaagtaaat ggctgaacgg gkcactgaag cttatcagta ataattgttt 900 taagttgatt tcttacctaa gttatacttt ttttctattt ttattgcaat ctttcactgc 960 ttgttatttg tcaatgagtt atttgtctat gaggtccttg ttgtgtggtg agtaatattt 1020 attcattttt ctttgttttt ttttcttaca gttgtccttt atttatttac attttatttt 1080 atttgactcg gatttgtgtg ttattttaaa ctggtcggta tggataacga aagcattatt 1140 tgcgttgagt gcaataaaat tgagaaggat gctagtaaaa tcatcacttg tatgtattgc 1200 ttctctgagg cacattataa atgccggaac attggtgcaa atgcggcaag gcgtttcaaa 1260 gacaggatgt acttctgttc tcacaattgc tccagcatct atcaaaggat caccgagatg 1320 cagaacaaca agtcgtctat tatagaatct cttgcagttg aacttaaggg agctgtcgct 1380 aatgctgtct ctcaagaaat acaatgtgtt agaagtgagg tgaagcaggt tacaactgca 1440 attgaaaagt cccaggattt cctctcaaat aaatttgatg ccattgttac cgattttcaa 1500 gaattgaaaa aagagaatga atctttgcgg cgcgaagtag ataagctgaa aaatgtccaa 1560 caaaatctat ctcaaacagt ttacaaattg gagcatcaag tagacaaaac aagtcgtgat 1620 gcgaattcca aaaacgcagt tatcttaggt gtcccttttt tgccagatga aaatactcaa 1680 gaaatcgctc agaaaatgat tacttgtctc ggagctgatg tagctgctga tgctatctct 1740 gctgccgcta gaataaattc gaaaaataaa ccgaaaaatt cgttagttcc tattcgtgtt 1800 gtgttcaaag atgaatgtgt caaagaaact gtctttagtc aaaaaaagga atgtggtaaa 1860 attatgtcta cgtcaataga tcccaacttt actatcaatg gtaacccaac aagtgtcact 1920 atacgcgatg agttaacgcc tctgtctatg gaactcttga atgagatgag aagacatcaa 1980 gaaaagctta aaattaagta tgtttggtct agtagaggag gaaacgttct agttaagaaa 2040 aatgaaaact caaaacctga aattgtaaaa acaagggacg atttgattga attggtgaat 2100 cgttacacag gtaacctctc acctaaagat actccttctc ccaaaaggaa gtgtagtagc 2160 aataatttta attaataagc tttatatgtg tgtagttgta aatttttgat tcataaatgt 2220 aaaatggcta acgaaataaa cttttatcat gaaagcatcg atgatttcaa tttaaaatac 2280 ccttatacca accaaaatta tattcgtata gcacaatgga acgttcgagg aataaataat 2340 atgcaaaaat ttgatgaagt gcttatgttc ttagacagta taaaaatccc tatagatgtg 2400 ttctgcgttg gtgaaacttg gctgaaagct aataattgtc ctttgtttag tataccgaat 2460 tatgatccca tcttctcctg tcgtgaaacg tcctctggtg gtcttgcaat gtacataaga 2520 agtggtttga gttttaatat cgtcaaaacg tttagcaatg aaggaatgca ccttattcat 2580 gtagaaatca aaataaatgg tcttctctat gatgtagtag gtgtttacag acctccatca 2640 tatgattttg aaaaatttca cgatgagtta gagttcttgt tgtcaaccca cggttcacgt 2700 cctctttttg tagtaggtga tgtcaatatc ccaatgaatt tgacgaataa taatgttgtt 2760 tcacgctaca aaagtctctt ggactcgtat aattttgttt gctcaaatac gcatgtgact 2820 cgaccaatca gtaacaatgt cttggatcac tttgtcagta gaaaagagga ccttccttac 2880 gtcagaaacg acacagtgta ctcagatgtt agcgatcacc tgattgttgt aacatcattt 2940 aaaattaacg atcctaaaga gcgtattgtg ctaacaaaaa agatcgtaaa caaagaccta 3000 ttaaaccagc gcttcaaaca ttttctggat acttttactt gccgccaaga tgtgaacgat 3060 tcgattataa ccttaatttc tgcatataat aacattctgc aagaatgcac aaagattaaa 3120 agtgaaaaaa taaatgctaa ggcgaaacac tgcccatggc ttaattacta tatttggcaa 3180 tggataaaac tcaagaataa atatctgaaa aaagttaaaa atgaccctaa caatgataac 3240 ttaaaggaaa tgtttcgaca tgtaactaag aaaaccgatg aagcaaaaaa acgatgtaaa 3300 acggaatatt acaaaagtct tctagatcat acatgtcaat caaagatgtg gaaaaattta 3360 aatgaactta tgggtaggaa gaaatctaat actaaaattg aattgaacaa tgaaggttat 3420 aaaacaagta atagcttgga agtatgtgaa atattcaatc aatatttttc caacattggt 3480 ataaagcttg ctaaaacaat caaaagtaat aaccgtaatc ctttgagcac tgtaactcgc 3540 gtcgagcaaa caatttttct aagagaaact aatgtaccag aggtcacttc tattctaaat 3600 gatcttaatg tcaggaaaag ctgtggacca gataactttc cctccagtat cattaaatat 3660 aatactaaca agttttcaca tatcttcatg gaactcttca atcaaatact tatacaaggt 3720 aaatatccag attgtttaaa agtggctaaa gttacaccta ttttcaaatc cggtgacaaa 3780 tctgatcctt gtaattatcg tcctatttca acgttgtccg tatttagtaa agtttttgaa 3840 aagttgcttg tcagcagaat tacagatttt ttgaacaaat ataatgtgtt atataaattc 3900 caatatggat ttagaaaaga ctgcagtacg tctactgcca ttgtagagct tgttgatttt 3960 ttgatcagaa aaatcgataa taaattcact gttggaggct tattcatcga cctgaaaaaa 4020 gcatttgaca cgttgaatca tgatattctt ttacaaaaac tggaatgtta tggcatacga 4080 gggttggcat atgagataat tagaacctac ttgtgtgatc ggcaacaatt cgtaatgtta 4140 caagatgcgc aaagttctct tcaaactgtt aacattgggg ttccacaagg aagtaacatt 4200 gggccacttt tgtttctctt atacataaac gatattggaa acttgccact gaagggcact 4260 ccaaggctgt ttgctgatga cacagcactg ttttatcctc attttgaaac ttcatgcatt 4320 atagatttta tcaatcacga catgcttttg cttgtaagat atttcgatac aaatttactc 4380 tcccttaatt tgtcaaaaac aaaatacatt ttgtttcatt caccgaggaa aaaaattgta 4440 tcacacaaca accctattgt gggaacaacg gcgattgaaa aagtgaaaaa tttcaaatac 4500 cttggtctga tattagatgc caccctttcc tggaaagagc atatagagca tcttcgaaat 4560 aaaattgctt ccctttgtgg tctaatgcat cgcgtgaaaa catttgtacc gaagcataca 4620 ctattaagtt tttattacag ttgtatccac tcacatctgt tgtacctcat tattgtttgg 4680 ggtcatgcaa gcaaatcaaa ccttcaaaaa attcaagttt tgcagaatag atgcctcaaa 4740 atcatttttc atttaccgca actattttcc actgttcagc tctactccaa cgtttcgcac 4800 aacatccttc ctttacgtgg cttatgtaag ttacaaacat gtttgtttat gtatgatata 4860 ttgaaaaaac caaatatgca tcagaacctc agcttcactg tttctaatca tgaacacaat 4920 actcgttacg caaataactt attgcgctca cgagccagta cttgtctagg ccaaatgaga 4980 atttctttct tcggaccatc cgaatataat gcacttcctg aaagtttaaa aactctgaat 5040 aatcgatcgt tgttcaaaac cagtttaaaa cgtcactaca aatccagtat aagtgatttt 5100 ttgtagaacc gtatcagttg ccagtatgat ttagtatgat ttacactctg tcgtgtttta 5160 tttacaatct gtcgtgtttt cacatattta atcaatatta aaataaattc tcccaattta 5220 gttttaaata tcatgaaata gggctccctt aaaaggaatt tttattccac tgggaaaccc 5280 cctatatatt catagtgtat aaatgcctca tcactgttca tccattcaat gtaaactgtt 5340 taaatgttac gtgtttactg ttgtgcttgt tttttttttt tgtaaaatat aaataatacg 5400 atagatgcgt ccactaccag ggggctccga aaatgagtct cttggtgtgg gggatagtgg 5460 tgggccataa aaaaaaaaaa agaaaaaaaa a 5491 // ID Gypsy-74_CQ-I repbase; DNA; INV; 4635 BP. XX AC AAWU01042220; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-74_CQ_; KW Gypsy-74_CQ-LTR; Gypsy-74_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4635 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 527-527 (2011). XX DR Genome; AAWU01042220; Positions 14762 19396. XX CC Positions [3585-3935] - Integrase core CC 'GTCAC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 345..4619 FT /product="Gypsy-74_CQ-I_1p" FT /translation="MDADQFQQFMAKQNELVTKLVAGLQQRAVPASNISSV FT VPPPPALCLDGDMEENYEFFLTNWRNYSQAVGMDSWPANQASKKVSFLMSV FT IGEAALKKYYNFDLTEDQKKDVAEALKAIKQKVVRSRNLFVDRLDFFAASQ FT VSSESIDDFAARLKSLAKPCKFALLEDEFILFKIVTSNKWGHLRTKMLSMQ FT DLTTGKAVDICRVEEIAEQRLRHLSLESPVVSDVKKINKKTKKKLQRCKFC FT GDEHEFVKGACPAYGKKCKRCGGRNHFEKACKIDQPWKQKRRHRRVKEVKD FT DESSSEERSESSSVDSDDEESEHEVQIAKIKSKSAVESSALAVLDFKLGRK FT WKPVTCEIDTGADATLIGYNCLTELLETPNPTLQPTSIKLKSFGGNPIPVL FT GQVMLPVKRKGKRFQLVLQVVDYDHRPLLSLKASQSLGFVKFCRSVKIRRP FT EAATEEEKLLSVYKVKAEAIVEKHSDIFAGYGRFPGVVSLEVDESVQPSIQ FT HPRRVPIALRPKLKQELDKLEEDGIIVKETRHTEWVSNIVLVRRGGQNESV FT RICLDPVPLNKALKRPNLQFDTIDELLPELGNAKIFTTVDTKKGFWHVELD FT TPSSKLTTFWTPFGRYRWVRMPFGISPAPELFQLKLQGAIQGLEGVTGLAD FT DLLVYGTGADLEEALDNHNRALEQLLVRLKENNVKLNRSKLKLCQTSVKFF FT GHVLTTKGLTADETKTAAIRNFPTPTNKKELLRFVGMVNYLSRYIRNLSAH FT FVNLRKLIAEKESWRWTAAEDEEFQKVKNLVADIHTLRYYDVTKPLVIECD FT ASGFGLGVAVFQQDGVIGYASRTLTATEKNYAQIEKELLAILFACLRFDQL FT IVGNPKTIIKTDHKPLVNIFRKPLLSAPRRLQHMLLNLQRYKPEIMFVAGK FT ENVVADAISRAPYDDDHAQAGDYQKLEIFKVMRDLEDVKLKHFLNISDDRL FT TEIMAETAADPVLQLVIRLIGEGWPETIGGVPDSVRVFFSYRNELTTQDGL FT VFRNDRILIPHKLRRKMIEKTHISHNGVEATLKLARANIFWPGMSAQIRDT FT VKECAVCAKFAGSQSKPPMPSHPVPVHPFQLVSLDVFFAEYRGIKRKFLVT FT VDHYSDLFEIDLLKDLTPQSAIAACKINFARHGVPQLLLTDNGTHFVGKEW FT RQFAGEWDFSHTTSAPHHQQANGKSEAAVKIAKQLIKKCEESGTDFWYALL FT HWRNVPNKIGSSPAARLFSRQTRCGVPTAAGNLLPRVVEGVPDAIKENRRK FT IKYYYDRKCRNLPQLETGAPVYVQVHPQKSTVWSPGTVAAKQTDRSYLVDV FT DGAFYRRDLVNLKSRKEPNTQPAVNHPVLQDNPLPSGPNPDVAVAEKPAEF FT PWTDSPSTPITKSSKPNKRLSSLSSTPQPKEKASPTETVTLERSRPKREVK FT LPTKLHDYCLE" XX SQ Sequence 4635 BP; 1255 A; 1123 C; 1268 G; 989 T; 0 other; tggtgtcaga agaatcgtga ccatagaaac tgatttcgtc gtcatttata tcgcgaatat 60 tgaattttgt gaaagttaaa aaaaaaaaaa actgaacaat cagtgaagtg ctacggaagt 120 ttcagcgttt tccccaaaag agactattgt tatctctttc tcgcattgtt ggctgtgcga 180 gtgcgatagt gtactgttta caccagtgta ttagtgcaaa cattgcacca gtgtgggaaa 240 acgtttgaga acgatccgct gaagtgacaa gtaaaacgta aacaaaacga taagttaact 300 gacagttgaa aagagactac aagtgaagaa aaatagtgga aaaaatggat gccgaccagt 360 ttcaacagtt tatggcgaaa caaaatgagc tggtgacgaa gctggttgct ggcctacaac 420 agcgtgctgt gccggcgtcg aacatcagtt ccgtggttcc gccaccaccg gctctctgtc 480 ttgacggaga catggaggag aactacgagt ttttcctaac caactggaga aactactcac 540 aagccgtggg catggacagt tggccggcga atcaggcctc aaaaaaggtc agttttctga 600 tgtccgtcat cggagaagct gcgctcaaaa agtattacaa cttcgacctg acagaggatc 660 aaaagaaaga cgtcgccgag gccctgaagg ctatcaagca gaaggtcgtg cgcagccgta 720 acttgttcgt ggaccgattg gattttttcg cggcgagtca agtgtccagc gaatccatcg 780 atgactttgc agcccggctg aaaagtttgg cgaaaccgtg caaatttgca ctcttggagg 840 acgagtttat cttgttcaaa atagtgacca gcaacaagtg gggccacctg cggaccaaaa 900 tgttgagcat gcaggatctc acaacgggaa aagcagtgga tatctgccgc gtggaagaga 960 ttgcagagca acgtttgcgg cacctctcct tggagtcgcc ggtggtcagt gatgtgaaga 1020 agattaacaa gaagaccaag aagaagctgc agcggtgtaa gttctgtgga gatgagcacg 1080 agtttgtgaa gggagcgtgt ccagcatacg gcaagaaatg caagcgttgc ggcggccgca 1140 accactttga gaaggcgtgt aaaatcgatc aaccctggaa gcagaagcga cgccaccgtc 1200 gtgtcaagga ggtgaaggac gacgagagca gttccgagga acgctctgaa tccagcagtg 1260 tggacagcga cgacgaagaa agtgagcacg aagtgcagat tgccaaaatc aagtccaagt 1320 cagctgtgga gagtagtgct ttggcggttc tggatttcaa gctaggccgg aagtggaaac 1380 cagttacgtg cgagatcgac acgggtgcgg atgccacatt gatcgggtat aactgtttga 1440 ctgagcttct tgaaactccg aacccgacac tgcaaccaac aagcatcaag ctgaagtctt 1500 tcggtggcaa cccgatacca gtgttgggac aggttatgct acctgtaaaa cgaaaaggca 1560 agcggttcca gctcgtcctg caagtcgttg actatgacca ccgcccgctc ttgtcgctta 1620 aggcttcaca gtcgctgggc tttgtgaagt tctgccgatc ggtgaagata cgccgaccag 1680 aagcagccac agaagaggag aagctgctca gtgtctacaa ggtgaaagcc gaagcaatcg 1740 tcgagaaaca cagtgacatc ttcgctggct acggccggtt tcccggagtg gtttccctgg 1800 aagtggacga aagtgtgcag ccatctattc agcatcctcg tcgggtccca atcgctttgc 1860 gcccgaaact gaagcaggag ttggacaaac tggaagagga cggcattata gtgaaggaaa 1920 cgcgacacac cgagtgggtt agcaacattg tgttagtccg ccgtggcggc cagaacgagt 1980 ccgttcggat ttgtctcgat cctgtcccgc tcaacaaagc gctgaagcgg cccaatttgc 2040 agtttgacac gattgacgag ttgcttcccg agttgggcaa cgccaagatt ttcacaactg 2100 tggacacgaa aaaaggattt tggcacgtgg aactggacac ccccagcagc aagctgacca 2160 ccttctggac accattcggt cgataccgct gggtcaggat gccgtttggt atttctccag 2220 cacctgaact gttccagctg aagctgcaag gtgcgataca aggactggaa ggagtgaccg 2280 ggttagcgga tgacttgctg gtgtacggca ccggagcgga tctggaggaa gctttggaca 2340 accacaatcg ggctctagaa caactgttgg tgcggctcaa agagaacaac gtgaaactca 2400 accggtccaa gctcaagttg tgtcagacat cggtgaagtt ctttggacac gtgctgacaa 2460 cgaagggact aacagcagac gaaaccaaga cagccgcaat ccgcaacttt cccacaccaa 2520 ccaacaagaa ggagctcctg cgatttgtcg gcatggtcaa ctatttgagc cggtacatcc 2580 ggaacctaag cgctcatttt gtcaacctgc ggaaactgat cgccgaaaag gaaagctggc 2640 gctggactgc tgctgaggac gaggagttcc agaaagtgaa aaatctcgta gccgacatcc 2700 acacgctgcg ctactacgac gtgaccaagc ccctggtaat cgagtgtgac gcgagtggtt 2760 tcggtctggg cgttgcagtg tttcaacaag acggtgtgat cggttacgcg tccagaaccc 2820 tcactgcgac ggagaagaac tatgcccaga tagagaagga gctgttggcg atcttgtttg 2880 cttgcctcag attcgatcaa ctaattgttg gcaacccgaa gaccattatc aaaaccgacc 2940 acaaaccgct ggtgaacatc ttccggaagc cgttgttgtc ggccccacga cgcttgcaac 3000 atatgctgtt gaacctgcaa cgatacaagc cggagattat gttcgtggcc ggcaaagaaa 3060 acgtcgtcgc tgatgccatt tcgcgagccc cttacgacga cgatcacgct caagctggag 3120 attaccagaa gctggagata ttcaaggtga tgcgcgacct ggaagacgtt aagctgaagc 3180 acttcttgaa catctcagat gatcgtctga cggagatcat ggcggagacg gcggcggatc 3240 cagtgttgca gctggtgatc cggttgattg gtgaaggttg gccagagacc atcggtggtg 3300 taccggacag cgtccgcgtt ttcttcagct acagaaacga acttaccacc caggatggat 3360 tagtgttccg taacgacaga atactgattc cccacaagct gcggcgcaaa atgatcgaga 3420 agactcacat cagccacaat ggtgttgagg cgaccttgaa gctggcacgt gcgaatattt 3480 tttggcctgg gatgagcgca caaattcgag acacagtcaa ggagtgtgcc gtctgcgcca 3540 aattcgccgg ttcgcagtcc aagcccccga tgccgagtca cccggttcca gttcacccgt 3600 ttcaactggt gtcgcttgat gtgttcttcg ctgagtaccg aggtattaag cggaagtttt 3660 tggtgacggt tgatcactat tccgatctgt tcgagattga tctgctgaag gatctcactc 3720 cacaatccgc catcgcggct tgcaagatca acttcgcgcg gcatggcgtt ccccagctct 3780 tgctgacaga taacggaacg cacttcgtgg gaaaagagtg gcgacaattt gcgggagaat 3840 gggacttcag tcacaccact tcagcgccac accatcaaca agcgaatggc aaatcggagg 3900 cagcggttaa aatcgctaaa cagttgatca agaagtgtga agaatccggc acggattttt 3960 ggtacgctct cctgcattgg cgtaacgtac caaacaagat cggatcgagt cctgcagctc 4020 gcttattctc tcgtcaaaca cgatgtggtg tcccaactgc tgctggcaac ttacttccgc 4080 gagtggtcga gggagttccg gatgctatca aggaaaacag gaggaagata aagtattatt 4140 acgatagaaa gtgtcgtaat ctgccgcagc tggaaaccgg tgcaccagtg tacgtccaag 4200 tccatccaca gaagtctacg gtgtggtcac ccggaacagt cgccgccaag caaactgacc 4260 gatcgtatct cgtggatgtg gacggggcat tctatcgacg ggatttggtt aacctaaagt 4320 cacgcaaaga accgaacacg cagcctgccg tcaaccatcc cgttctacag gacaatccgt 4380 tgccgtctgg tccgaatcca gatgtagcag tggccgagaa accagcggag tttccgtgga 4440 ccgattcacc gagcaccccg atcaccaaat cttcgaagcc gaacaaacga ttgtcttcgc 4500 tgtcatcgac accgcagccg aaggaaaaag catcgcctac cgagacggtt acactggaga 4560 gatcgcgccc caaacgagaa gtgaagctgc cgactaaatt acacgattat tgtctggaat 4620 aaaaagtggg gagga 4635 // ID Copia-104_AA-LTR repbase; DNA; INV; 154 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-104_AA_; KW Ty1_copia_Ele172; Copia-104_AA-I; Copia-104_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-154 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 154 BP; 49 A; 35 C; 21 G; 47 T; 2 other; tgagataacg aagttamcta agtagttsac cattcccgga gttactatag tcaaccctag 60 caacctgatt gtttagcagg tataagcagt tataataaat tttcattcca ccttcaagct 120 tacaccaaac cagacgttct ttcattaact ctca 154 // ID Copia-1_CQ-I repbase; DNA; INV; 6112 BP. XX AC AAWU01000183; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-1_CQ_; KW Copia-1_CQ-LTR; Copia-1_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-6112 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 317-317 (2011). XX DR GenBank; AAWU01000183; Positions 18690 24801. XX CC 'TAAAC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 1790..4732 FT /product="Copia-1_CQ-I_1p" FT /translation="MAEKFAIQKLNGRNWQTWNIRIEALLSREDMWSVVTD FT AIPVGDARKPEWVAQDRKARSTILLLIEDNQFPIVKGSTHAKEVYDKLKAY FT HLKTTRSFRVSLLKRLCSTNLTERGDLEQHVLELDDLFDRLQEAGMDLASD FT VRVCMLLRSLPSSFDHLVAALDLQSDKDLTLDTVKSKLSDEYHRRQERDGG FT GELKVEKAMRSAEQCEKVCYQCKKPGHLMRNCRLGRRRSVTPAKKEENGRA FT KAAHSDAKAVAFTAGDPGDERAGWVIDSGASAHMTSDRSIFTSLREFAGGW FT ITLADGKKTQILGEGSVVLFGLDGEEDVVKIEVKEVKFVPGLSTNLISVSK FT LAHNGLKVFFDDVGCQISDPSGRVEATGDRRGGMYFLCLAEASAGQHTESC FT QHLWHRRLGHRDLAAEESINNEQLATGMKVRDCGNRSVCECCLEGKSAKAP FT FSPVTERNSTQILDIVRTDLCGPIQMSTQTVKRYVMHLIDDVSRCTSEAVD FT RIKVRWTENRTCRKMQINRLDGGNGFDSKELRKCDAKFIELSNRSSQVELP FT AAGPPAVGPPAVRETVAGPPAAERPAAETPAAEEEIELLPCGEKLEEVPEE FT ENEGNETVSDLQDAVPEEEPEDYPAPTGGADGAGGGRPKRYIQRPKHLDEY FT EVGVASCAFEDPENYKEAMKKPEWRKAMQELAALSSRTALGKEELVCRRRR FT SIYGLRQSARCWNRKLNEVLTKLGFKAADADPCLYRMQQGGTTVLLLVYVD FT DLHDIDRIFRCLQREFELTNLGGAKHFLGMEVKRDDGCYKVRVKNHIDKLL FT TKFGMEQAKTMKSPMDPGYLKAEEPSTPFGDQTKYRSLVGGLMYIACTARP FT DIAVGAAILGRKFSAPNEADWTAAKRVLRYLKATREYYLRLGGEPGEALVG FT HSEADWAGDPGNRRSTSGFVFTFGGGAISWASRQQLCVTKQIALEYCPTNR FT LVADLFTKPLGPLKLGQFCGTLGLLA" XX SQ Sequence 6112 BP; 1554 A; 1336 C; 1810 G; 1412 T; 0 other; tttggcgatc ctgccggtta attacgggga cctgaaggga aaattttgaa cctgcgaaaa 60 tgtttggaga atccccaact tgaggagatt ttgtgcttca attgaggttg gaagtgtttc 120 gtggaagtgt tttgatgtga ttgttttgac tttggtaagt tgcggtgtgc ttgctggtgc 180 cggatggaag gcaggccgat tttattccct ttaatttgtt tgggcatttg gaatgtcccg 240 gcgatcacgt gggttgaaat cggtctaatt ttgaagacct tcaaaagatg ttttactgtc 300 gtgtttggac aaatgtggtt gaaagaggca cagacataaa tggaattgtt gttgcgagcc 360 ggagtcgcaa aagcgggttg ttgctaaaag ggttgccagc aggagatgga acaggttgtt 420 gtaagatgag aagctcacca ggcatgagac ggattgtgac ggaatttggg accgccagat 480 gaaagagctt gcattgccta gtaacgcgca ggcagggaac gaattggatc attggcctgt 540 agccaagaga gttaaccttt tcatgataag acgcctgcag gagcgtaata tgttgttctg 600 ctctgttcca tgtaatgtca gtgtcactcg ttgagtactc cagcgtcgtc gtctacttcg 660 gtgccgtcgg cgcgtactcg tcactcgggt gtgcacacgt gcaaacaggg gagagagtgt 720 ggatctggtc agtatagatc gcgataccga gcgagcgtgc ggcaggttcg cctatgtccg 780 actacaaaca cgcttcgctg ccaccggttc gcgtgatcca cacagtgcga ctgatcattc 840 cattcaaact caaacaaaca tttatgaatt gcgaggtgtg acggcacggc accactggaa 900 attgtaacga atttacgaat tgcgaggatg gaggtgagac ggcacctttg agccgtttgg 960 gaagaaagtc ccactggaaa ttgtaacgca tttacgaatt gcgaggatgg agatgagacg 1020 gcacctttga gccgtttggg aagaaagtcc cactggaaat tgtaacgcat ttacgaattg 1080 cgaggatgga ggtgagacgg cacctttgag ccgtttgaga agaaagtccc actggaaact 1140 gtaacgcatt tacgaattgc gaggatggag gtgagacggc acctttgagc cgtttgggaa 1200 gaaagtccca ctggaaactg taacgcattt acgaattacg aggatggagg tgagacggca 1260 cctttgagcc gtttgggaag aaaatcccac tggaaattgg aacgcattta cgaattgcga 1320 ggatggaggt gagacggtac gtttgagccg tttgggaaga aaagctcaca ggaaatttgg 1380 tcgcatgtat tgcgatgttg gatgtgtgga aatctgcgct gtccacatca agtctgcgca 1440 catcctaggc ccttctggca accacgatga cgacgatgac gagctctcct tctttgacga 1500 ctcgtcgatc gtcctcagaa ccactttagc gatcaactcg atcagtcgtg ttttttacaa 1560 caaataaacg ttagttagtc ttaactgtaa attctcgcgt tttcattctg actttcgact 1620 tctgctttcc tctggtcgcc gataggttat gggcccagca gagtcggaat agaaggttcg 1680 gttgtaaaaa gaaaaaccgc gtggttttct ttgttcccgc agtggaaaaa gtttttttag 1740 agactattgt ttgtctgtgc ctttgtgttc cgtaccggaa gtgaacaaaa tggcggaaaa 1800 gttcgccatc cagaaactga atggccggaa ctggcaaacc tggaacatcc ggatcgaagc 1860 gctgctttcc cgcgaagaca tgtggtccgt cgtgacggac gccatcccgg tcggggacgc 1920 ccggaagccg gagtgggtcg cacaggaccg gaaggcgcgg tcaacgatcc tgctgttaat 1980 tgaggacaac cagttcccga tcgtgaaggg cagcacccac gcgaaggaag tgtacgacaa 2040 gctgaaagcg taccacctga aaacgactcg gtcgtttcga gtaagccttt tgaagaggtt 2100 atgttcgacg aacctcaccg aacgcggtga cctcgaacag cacgtgttgg agttggacga 2160 tttgttcgat cgtttacaag aggctggcat ggacctggcc agcgatgtcc gggtgtgcat 2220 gctgctccgc agtctgccgt catcgtttga ccacctggta gcggcgctgg acctgcagtc 2280 ggacaaggac ctgacgctgg acacggtcaa gtccaagctg tcggacgagt accatcgtcg 2340 gcaggagcgc gatggcggcg gtgagctgaa agtggaaaag gccatgcggt cggccgagca 2400 gtgtgaaaaa gtgtgctacc agtgcaagaa gcccggacat ttgatgcgca attgtcgatt 2460 gggcagaaga agaagtgtca cgccagcgaa gaaagaagaa aacggcagag ccaaggcggc 2520 tcacagtgac gcaaaagcgg tcgcgttcac ggccggtgac cccggtgacg aaagggctgg 2580 ctgggtgatc gatagcggcg cgagtgcaca catgacgagc gatcggtcga ttttcacctc 2640 gctccgagag tttgccggcg gctggatcac gctcgccgac gggaagaaga cgcaaattct 2700 aggtgaaggc agtgttgtgt tgtttggcct agacggcgaa gaagatgtgg tgaaaataga 2760 ggtgaaagag gttaagtttg tgcccggctt gtcaacgaac ttgatatccg tgtccaagct 2820 ggcgcacaac ggcctcaaag tgttcttcga cgacgtcggc tgccagattt cggaccccag 2880 cgggcgtgtg gaggcaaccg gagatcgtcg cggcgggatg tacttcctgt gcctggcaga 2940 agcatccgca gggcagcaca cggagagttg ccaacaccta tggcaccggc ggctgggcca 3000 tcgtgacttg gcagctgaag aaagtatcaa caatgaacag ctggccaccg ggatgaaagt 3060 gcgtgactgt ggcaatcggt ccgtttgcga gtgttgtctg gaggggaaat ctgccaaagc 3120 ccccttctcg ccagtgaccg aaagaaattc gacgcagatt ttggacatcg tgcgtacaga 3180 cttgtgcggc ccgatacaaa tgtcaacaca aactgtcaag cgttacgtca tgcacctcat 3240 cgacgacgtc agtcgatgca catcggaggc cgtggacagg atcaaggtac ggtggacgga 3300 gaaccggacc tgccggaaga tgcagatcaa ccgtttggac ggaggtaacg gattcgacag 3360 caaagaattg agaaagtgcg acgccaagtt catcgagttg agcaacagga gttctcaagt 3420 ggagttgcca gccgcggggc cgcctgccgt ggggccgccc gccgtcagag aaacagtggc 3480 gggaccgcct gccgcggaac gacctgccgc ggaaacgcca gctgctgagg aggagatcga 3540 gctgctgccg tgtggggaaa agctggaaga agtgcctgaa gaagaaaatg aagggaacga 3600 aacagtgagt gatcttcaag acgcagttcc tgaagaagag cccgaggatt atccggcacc 3660 aaccggcggc gcggacggcg ccggcggcgg tcgaccgaag cggtacatcc agcggccgaa 3720 acatcttgac gagtacgagg tcggcgtggc atcgtgtgcg ttcgaagatc cggagaacta 3780 caaggaagcc atgaagaaac ccgagtggcg gaaggcgatg caggagttgg cagcactgtc 3840 cagccgtaca gcgctgggaa aggaggagct cgtgtgcaga cgtcggcgca gcatttacgg 3900 tttgcgccaa tccgctcggt gctggaaccg gaaactcaac gaagtgctga cgaagctggg 3960 gttcaaggcc gcggacgcag atccatgtct gtaccggatg cagcaaggcg gaacaactgt 4020 gttgctgctt gtttacgtcg acgacttgca tgacatcgac cggatcttcc ggtgcctaca 4080 gagagagttc gagctgacca acttgggggg tgccaagcac ttcctgggca tggaggtgaa 4140 acgcgacgac gggtgctaca aggtgcgggt gaagaaccac attgacaaat tgctcacgaa 4200 gtttgggatg gagcaagcga aaaccatgaa gtcacccatg gaccccgggt acctcaaagc 4260 cgaagaaccg agtacgccct ttggagacca gacgaagtac cggagcttgg taggtggact 4320 gatgtacatc gcctgtactg ctagaccgga cattgcggtc ggtgcggcaa ttcttgggcg 4380 caagttcagc gcgcccaatg aagccgactg gaccgcggca aaacgtgtgc tccggtatct 4440 aaaagcgacg agggagtact atcttcgact gggcggtgag cctggggaag cacttgttgg 4500 gcactcggaa gcagactggg ccggcgaccc aggaaacaga cgctcaacgt ccggctttgt 4560 tttcaccttt ggaggaggag caatctcgtg ggcaagccgt cagcaactgt gcgtcacgaa 4620 gcagattgcg ttggagtact gtccgacgaa ccggttggta gccgatctct tcaccaagcc 4680 cctgggaccg ttgaagcttg ggcagttctg cggaacgctc gggctgctgg cgtaggcgtt 4740 gctgctgtcc atcgaggagg agtgtggaaa tctgcgctgt ccacatcaag tctgcgcaca 4800 tcctaggccc ttctggcaac cacgatgacg acgatgacga gctctccttc tttgacgact 4860 cgtcgatcgt cctcagaacc actttagcga tcaactcgat cagtcgtgtt ttttacaaca 4920 aataaacgtt agttagtctt aactgtaaat tctcgcgttt tcattctgac tttcgacttc 4980 tgctttcctc tggtcgccga taggatggca cctgtgcacc tggttgacca gtatgatcac 5040 atttacgagt tctgggggag gaatttgatt tagaaaatga ttacattttc gagcgacgat 5100 aagggaattt gtagcagcca acacgacaac aggtcctgaa tctgacgtgg agaaaattat 5160 ctcgtggaaa ttgaggcgat gcttacaatt gaaaagccga ataacgaaaa ctgttttaat 5220 gatagttcgg aaactgttat agaacaggcg agcgacggtc ttcgtttgga aatgttcgtg 5280 tgtgaatcga actgggacga tcgctgggaa atgtttgttt actaataaga cttgttgcgg 5340 accgggacgg ccgcttcata atgttcgttt acaaatcgaa ctgggacgac tgctaggaaa 5400 catcagacta cgaatcagac tggttgcaaa ccgggaaacc gcttggaaat gttcgtttat 5460 gaatcggact gggaaaaccg ctaggaaatg tctgtttaca agttggaatg attgtagacg 5520 ggaaaatatc tgttgacaaa ttcaactaat aagtcgaact gggtctgtcg ctagaaaacg 5580 tctacttaag aatcggaccg actgcagact gggatggccg ctggaaattt ggccaatttt 5640 ttgggagctc tcaaggaact tcgatgaagt ttatttttta tgtgtgaaga agcaaaaaga 5700 ttaggaaatc taagagtttt gctaaagcct agattaatag aattgttgaa ggctcgcgat 5760 ggatggcatg agaaatggtg ctttggttaa ggattaagat tagccccacc tcattagaag 5820 aaaggactat aacggtcaat gaatgggata gatgggaagt gtgattgagt tttagattta 5880 taaagattgc tatttgtcaa aatagaagag ttcacttata aaattactaa tttaatgagt 5940 atgtgatcat gtcgattttt gctgcggctt ttctaccgtt acctgcaagc gctggagttc 6000 taaaagctcg ttagaattag agtgatgttt gaagatactt gatttttttt ccctttatca 6060 ttcaataatt ttgtaaatct cctatgtggt agttttaaga gaggaaagag ga 6112 // ID Mariner-1_BM repbase; DNA; INV; 2386 BP. XX AC . XX DT 25-APR-2010 (Rel. 15.07, Created) DT 25-APR-2010 (Rel. 15.07, Last updated, Version 2) XX DE Mariner-like DNA transposon - consensus sequence. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-1_BM. XX OS Bombyx mori OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Bombycidae; Bombycinae; Bombyx. XX RN [1] RP 1-2386 RA Jurka J.; RT "DNA transposons from Bombyx mori."; RL Repbase Reports 10(7), 936-936 (2010). XX DR [1] (Consensus) XX CC ~96% identical to consensus. XX FH Key Location/Qualifiers FT CDS 861..1859 FT /product="Mariner-1_BM_1p" FT /translation="MSESNEDIRYILKFYYKKGKKATQAVKKNCDVYGPSA FT VSVRVAQIWFKRFQSGNFDVKNARRSGRPITDKIDAIFEKVGQDRHISSYD FT VTEELGIDHKIVMAHLKKTGYTKKLTERNLMNRVLICDSLLRRNETEPFLK FT KLITGDEKWITYDKNVRERLWSKAGQASQTLAKPGLTRNKVMLCVWWDWKG FT IIHYELLPPGRTIDSELYCEQLMRLKQEVERKRPELINRRGVVFHHDNARP FT HTSLATQQKLRELGWEVLMHPPYSPDLAPSDFYLFRSLQNSLGSVRLTSRE FT DCQNQLSRYFDQKPQNFYSNGIMSLPTRWQKVIEQNGTDIL" XX SQ Sequence 2386 BP; 755 A; 440 C; 515 G; 676 T; 0 other; cgtaggatag accattctaa ccggcgccat ctaggcgggc ttcggacagc ctgccgaccg 60 agagggctgg tggtcgtggc gccgacgacc gccagtccgg cgtcccgagg gggagggtga 120 tgggagagat gtgctccgca ctaaacgctt cactttcccc cctttgcctt ttcatgagct 180 ctgtctcatg cgaggtttgg atgctggttg ttgagcgaca ggaggtttta gtcggttcga 240 ctccgacatg ccccgccctc catccccagt gaagggcgga agtccggcga tttcctcctg 300 acaaaaaaaa aacgtgtggc actcgggaac tgcggcggta aagctattgc atagcatttt 360 ttatcaactt atgacattat aattagacaa tgataattta atattaaaac aataataaaa 420 caagaccacg ctatattaaa ccttaataaa ggcaaaacct taactgtccc cttcacactc 480 ataagctaaa ccgcgcgaga gagagatggg cagacttttc atgatgcgca tgcagtgtga 540 cgtcacgcca cgcgcttatt cacaaacact acacaagcgc aacgtgtgaa tgtgttgaac 600 gcgagctaca tggtaggcgg agtgagggat gttaggtttt tttcgttacg gaatttcatg 660 attcggtcgc cgcgctcaag gcccgcgata aaagctatgc aatagcttaa aaatgtatga 720 acttgtaata aaatctcttt ggctatacta tttgtatctg gctggttttg gtatcattaa 780 aagtttaaat tcgaaagaag ataattccaa attaaaatta ggaagatgtg tgatttttat 840 ttatttttcg tactgtcaag atgagtgaat ctaatgaaga tattcgatac attttaaaat 900 tttactacaa aaaaggtaaa aaagcaacgc aagccgtgaa aaaaaattgc gatgtttatg 960 gacctagtgc agtgtctgtg agagtagcac aaatttggtt taagcgtttt caatccggaa 1020 attttgatgt caaaaatgca cgtcgctctg gtcgccctat tacggataaa atcgatgcca 1080 tttttgaaaa agtggggcaa gatcggcata tcagtagtta cgacgtaact gaagaactgg 1140 gaattgacca caaaatagtt atggcgcatt tgaaaaaaac tgggtacaca aaaaagctca 1200 ctgaaagaaa cctaatgaac cgtgtactca tttgtgattc tttattacga cgtaatgaaa 1260 ccgaaccatt tttgaagaag ctgataactg gtgatgaaaa gtggatcacg tacgacaaga 1320 acgtacgaga aaggttgtgg tcaaaggccg gtcaggcttc acagactctg gcgaaacccg 1380 ggttaactcg caacaaggtg atgctgtgtg tgtggtggga ttggaagggc attatacatt 1440 atgagctgtt accaccaggc aggaccatcg attctgaact ctactgcgaa caactgatga 1500 gattaaagca agaagttgag agaaagcggc cggaattaat caacagaagg ggtgtggttt 1560 ttcatcacga taacgctaga cctcacacat ctttagccac tcagcaaaaa ttaagagagc 1620 ttggctggga ggtgttaatg catccgccgt atagtcctga ccttgcacct tcagatttct 1680 acctgtttcg gtctcttcag aattctttag gcagtgtcag gttaacatca cgagaggact 1740 gccaaaacca attgtcgcgg tattttgatc agaagcccca aaatttctat agcaatggga 1800 tcatgtccct acctacaaga tggcaaaaag ttatcgaaca aaatggtacc gacatacttt 1860 agttaaatgt gaataaacta tattaaaaaa tgttgtgaat tttcttaaaa aatgcgaaga 1920 aactttttcc ccaacctatt attatataga aataaaatag taaacaatgt ttacaataag 1980 tgagctgttt aggcttacta tactatttga cattggacgg taatgggtaa attgtaaaaa 2040 ataataatct aagacaagtt tttgaacaat aattatatac gcatttccat tttatataaa 2100 aattatgatt cttacatcag taatttctta taaaatgcta ggaaaatgct gtaaaatgcc 2160 cttatgttaa aaacaagaaa gaaagtaaag aaaacctcaa tttgtgttag tttacgtatg 2220 tgtaatattt taaaactctg catggtagtt ttttttctaa tgtttaaaat aatttctggt 2280 gcaatgtaat taaggtaaaa ttgctctaat tttttaacta tcatcgatca tgttccgcag 2340 aaacttgtta caacttgatt gcgtacggtg tgtgggctat cctacg 2386 // ID LGRP1 repbase; DNA; INV; 213 BP. XX AC L42498; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 01-JUL-2005 (Rel. 2.02, Last updated, Version 3) XX DE Leishmania gerbilli DNA repeat. XX KW LGRP1; Repetitive element. XX NM LGRP1. XX OS Leishmania gerbilli OC Eukaryota; Euglenozoa; Kinetoplastida; Trypanosomatidae; OC Leishmania. XX RN [1] RP 1-213 RA Piarroux R., Fontes M., Perasso R., Gambarelli F., Joblet C., RA Dumon H. and Quilici M.; RT "Phylogenetic relationships between Old World Leishmania strains RT revealed by analysis of a repetitive DNA sequence."; RL Mol Biochem Parasitol 73(1-2), 249-252 (1995). XX DR GenBank; L42498; Positions 1 213. XX SQ Sequence 213 BP; 49 A; 65 C; 68 G; 31 T; 0 other; gcaagaatca agaggccgtg tcagagatgg gtcgaagggg gacggtggga gcgggaaaga 60 gacgacgggc acgtggcgac gctggaaaga aagaaaagca gaagacgcgt attccctttt 120 gctgatgtgt gcccgcctct ctgccacaga ccacgagctc agctccactc caccctaacg 180 ccccctcgcc gtgcggccct gtcgcaggct ccc 213 // ID hATm-40_HM repbase; DNA; INV; 4337 BP. XX AC . XX DT 07-DEC-2008 (Rel. 13.12, Created) DT 11-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type family: consensus sequence. XX KW hAT; DNA transposon; Transposable Element; hATm-40_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4337 RA Jurka J.; RT "Families of hATm elements from Hydra magnipapillata."; RL Repbase Reports 8(12), 1934-1934 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 2141..3931 FT /product="hATm-40_HM_1p" FT /translation="MESKKFKKVQSCKRRREEEKQMREKKHTNQKENNTQY FT EQDNEHENYEEKSNENEIFSIQKPCTKYLKKSRNYEDISNIALASIRYGVG FT LRATAAIATAAWVDGGLVSQSDTRLIIDHSKVRRARDKVMNEVAVQFDDYC FT NKEIIDCIFFDGRKDLTKVFLTVDGSDRQYPAKVKEEHYTVCSGDGKYLFH FT FTPAKEAKKNHAEIIADKIIEYTAMRNISTHLKAIGGDSTNVNTGWDGGVM FT HFMELKLGRKLNWIVCALHTNELPLKRLIKLLDGESKSGTKWSGPIGSLLD FT EATNLEINPFFSRVETGEPLINLSNDVVKDLSTDQYYGYQIVKAIRSGHVP FT QQLGLLEIGPVNMARWLTTANRICRIWVSHHGLQGDAAQNLLKLVEFIVGV FT YFPVWFNIKVKYSWTEGPKHVLYQLKLLQHQSLDIQNMVIPTIKRSAWYAF FT SESILQTMLCSENKEERDFSVNKILEIRGEGDENVQLGDLSVRPRRTPDIN FT TKAICLKDLIDWNEAFEPPLTCCMTTKDIKEYFFKPMIVPNWCCHTQAVER FT CVKKVTQACENVFTEDRRDGWIKGQELSQKLMSRNLSKQNLIGLTMFKN*" XX SQ Sequence 4337 BP; 1562 A; 599 C; 740 G; 1435 T; 1 other; ttagggtgtg tcatactttg gtcaaaactc aaaaattcat atttaccagt ggcatttttt 60 aaaaaagggt gaggtttaac atatgaaatt cagaaatatc aaaaaatttg aatttaaatg 120 agtttaagct accagggaac tgccttgaaa attgccgaaa agtgtgaatt tttgatattt 180 ttatactttc agggtcattt tcaatatttt agacttatat ctgttttaca acaaccttat 240 cttaagaaat taggaataag gaaaggaaga taagaaagca ggataagtaa aattcattta 300 atagttatac tgtttcagaa atttttatga atacttaaaa attattattt tttcaaagtt 360 ttgtggcgtt aatttacatc tcttcccccc tatttttatt tttgattatg atagaaaatt 420 ttatgctgat taagaatcgt ataaaaacat agctctgaaa attgaagcaa agtactctaa 480 tttgatcttg aagtttaaaa attacactaa aaacgctatt attcaccatc ttttttgtac 540 tctgttatga ttaaaattca ttcaaaagta aactttgaaa ataacataaa caacctggtt 600 tttattgtca acaaaaaacg aaactgtttt ttagtagcta ctgatttaga gataggcaag 660 ggcttcaagg aagaataatg atataaattg atacctgccc caaccctgac ccttcgatgt 720 gttatcaact ctcttgcgag ttctccctaa atcagtcgat gtatgtactg aagccctcgc 780 ctatccctaa atcagtaact aagggcaaga gggcaaaagt taacaaacag aattttacgc 840 cactattctg caacattacc tttaaaagtt taattaattt tttaaatata aagaataatt 900 tatgtaaaaa aaaaaagttt attttttaat tataattaat aattgtattt ttttttaaac 960 tatgtaaatt ttagtatcga tgaaaataaa aaccgggttg cgtaattaaa aataagattt 1020 ataaaaaagt gtttagttgc atttaaaaaa gatagcgaat attagcgttt tttgtgtaat 1080 attttaaact tcaagatcta attagagtac tttgctttag ttttcagagc tatgttttta 1140 tacgattctt aatcagcata aagttctcat aatcaaacat aaaaataggg ggtagagatg 1200 taaattagtg ccagttttgt atgtaattta agttatacat tttaattgat ggttctaaat 1260 ggctacacag aaaacatcat tctaaagttt tttgttttaa tatttgttat gcaagttata 1320 catatatata tatatatata tatatatata tatatatata tatatatata tatatatata 1380 tatataattt tcaaaattac tagatacaaa gtataattaa atttaaaatg gctgctggaa 1440 caagatggar aaatgattct tgtctacgcc aatatttggg atctggaaaa gagttgccaa 1500 cagctgaact gccaacactg agagatgtcc taagatatgg tattttttta agagaaacaa 1560 gcgaactaaa cagaagaaac tatagtaatt ctgatttgat ccaggatatt tataataaac 1620 tttcagaaaa atggcaaatg gccagtgttt tatttgtgcc tcccgtaatt agtttaaaac 1680 ataatttggt aacaagactt aaagttgcat ggaacaaagc taacctgatt tctcaaaata 1740 gagcaagtaa atttataaaa caaaagttta atttaattat tgacaaacta tttgatttgg 1800 tggagtgtaa ctgtaaaata tctctttgtt ctgagcaacg ctgttctact gactgtaaag 1860 ttggtgccca cattacctgc atatgtccta gggagagaaa aataccaata aatgaactgg 1920 tttatataaa aggtatgagt gaatatatga aagtcactat aaaagcgtta gtaaagcaga 1980 aaaccaaatt ttattatact atactgattt ctttgattat tttaaaaaga aaattttatt 2040 ttaatttgtc tggttgtaag caaattgttg tcacattttt ttagcccaac gagaaaagat 2100 tggatcttta agttcatatc aacttgggcc tgctgatttg atggagtcca aaaagttcaa 2160 aaaagtacaa agttgcaaac gaagaagaga agaagaaaaa caaatgagag aaaaaaaaca 2220 cactaatcaa aaagaaaata atacgcaata tgaacaagat aacgaacatg aaaattatga 2280 agagaaatct aatgaaaatg agattttttc tattcaaaag ccttgtacaa aatatttaaa 2340 aaaatctaga aattatgaag atatctctaa tattgcattg gccagtattc gatatggtgt 2400 tggtttaaga gcaactgcag ccatagctac tgcagcatgg gtagatggag gacttgtatc 2460 ccaaagtgac actagattaa taatagatca cagtaaagtt cggagagcta gggataaagt 2520 catgaatgaa gttgctgttc agtttgatga ttattgtaat aaagagatta ttgactgcat 2580 attttttgat gggcgaaaag atcttactaa agtattttta actgtagatg gatcagacag 2640 acagtatccg gctaaagtaa aagaagaaca ttacactgtt tgttctggtg atggtaagta 2700 tttatttcat ttcactccag ctaaagaagc aaaaaagaac catgctgaaa ttatagcaga 2760 taagattatt gagtatactg caatgagaaa cattagtaca cacttaaaag caattggcgg 2820 agactctacc aatgtaaaca ctggttggga tggaggagta atgcacttta tggagctgaa 2880 attagggcga aagttaaatt ggattgtctg cgctctacat accaatgaat taccactcaa 2940 acgtttaatt aaattgttag acggagaatc taaaagcggt acaaaatgga gtggacccat 3000 tggaagctta cttgatgaag caaccaatct tgaaattaac ccttttttta gtagagtgga 3060 aactggtgag cctctaatta acttgagtaa tgatgtggtt aaagaccttt caacagatca 3120 gtattatggc tatcaaatag ttaaagctat acgcagtggt catgttcctc agcaactagg 3180 acttctggaa attggccctg ttaacatggc cagatggcta acaacagcca atagaatatg 3240 tagaatttgg gtatctcatc atggtcttca aggcgatgca gcccaaaacc ttcttaaact 3300 agtcgagttt attgtaggag tgtactttcc tgtctggttt aacatcaaag ttaaatatag 3360 ttggacagaa ggaccaaaac atgttcttta ccagcttaag ttgcttcaac accagtcact 3420 cgatattcaa aatatggtca ttccaactat taagcgatct gcttggtatg ccttctctga 3480 atctattctt caaactatgc tttgttctga aaataaagag gagagagatt tcagtgtaaa 3540 caaaattctt gaaataagag gagaaggtga tgaaaatgta caacttggag atctatctgt 3600 tagaccaaga aggacaccag atatcaatac taaagcaatt tgtctcaagg accttattga 3660 ttggaatgaa gcttttgaac cacctttgac ttgttgcatg acaactaaag acatcaaaga 3720 gtattttttt aaaccaatga ttgtaccaaa ctggtgttgc cacactcaag ctgtcgaaag 3780 atgtgtaaaa aaagttaccc aggcttgcga aaatgttttt acagaagata gaagagatgg 3840 ttggattaaa ggacaagagt tgtcacagaa gttgatgtca cgaaatctat ctaaacaaaa 3900 ccttattggc ttgacaatgt ttaagaatta atctttattg tactttattt taatgtgcca 3960 gtagatgttt cgtggatcta tatgaaagtt ttatttttaa ttccaaaaca acctaatttt 4020 acagttttag tgggaggtag tgggaggggg gcaaaccact ctagttgttt ttatacaaat 4080 aattatattt ttttttaaat agagatgacg tttttttgcg ttttgtaagt tttttttaaa 4140 aaatacttca attttttaag ttcatacctc cccctccccc agggtttttg gtaaaatagt 4200 gtcactggtt ccctggtagc taaaactttt tttttttttt tttttgttga cggttctaaa 4260 cttgtaatat tatacctcac ccttttttaa attatgcgag tcaaaattca aaaaaaaaaa 4320 agtatgacac accctaa 4337 // ID U2_sat repbase; DNA; INV; 696 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 06-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE Satellite from Ciona savignyi. XX KW Satellite; Simple Repeat; U2_sat. XX OS Ciona savignyi OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. XX RN [1] RP 1-696 RA Smit A.F.; RT "U2_sat - Satellite from Ciona savignyi."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX CC This sequence starts with a U2 snRNA. Ci000131/Ci000039 The U2 CC snRNA appears the specific target for insertions of the LINE CC element R2_Cis1. XX SQ Sequence 696 BP; 168 A; 187 C; 150 G; 191 T; 0 other; atcgcttctc ggctcttttg agctaagatc aagtgtagta tctgttcttc tcagtttaat 60 acctgggacg gaaacgattc gtttcctcta tgtattaatc ggatttttga acttggagta 120 tggtgctgga gcttgctcca ccctgctcac gggttgtcct ggtattgcag tacctccagg 180 ttcggctcac gtttccacct tggtggaaat cattaactaa tcatctgtgc acgtgctaaa 240 gtaaaggaga acaccctcag atgatcgaac gttccaagat ttcactaaac cttagcaccc 300 caccccccct aaccccaaga aaatcatcga aatttttttg gcgcagtttt tcacgctgaa 360 ccgaccgcct gtggagcaag gcaattaaca ggcgccttgc tagaccgccc gtggtgatgc 420 taagcaatta aaaacgcggc ggcaaacccc tgccgagtct gcactgtcga aacatttgaa 480 acaccccccc tgagatctcg ccgggtggga gtttttcgtt tccgtttgcc gtaccggatg 540 ccgtaccgta ctcagcatgg tgccagcacc atgctgtgct cacagggcag attgctcgac 600 cacaaaacga ttgaaaaccc tgcaaaaagt tgaaaatttt tggaataaaa aatttttttt 660 ttcgtcctct gatcttccgg aggacccctc ctcttc 696 // ID TTAA26B_AP repbase; DNA; INV; 574 BP. XX AC . XX DT 30-AUG-2009 (Rel. 14.09, Created) DT 30-AUG-2009 (Rel. 15.12, Last updated, Version 2) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; nonautonomous; TTAA26B_AP. XX NM TTAA26B_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-574 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2094-2094 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC TSD is TTAA tetranucleotide. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea CC Aphid Genome Project. XX SQ Sequence 574 BP; 169 A; 95 C; 135 G; 174 T; 1 other; ggggatcggt ggcggacagt ttttaaggag ttcgaaatag gcaattttca aattgcatgc 60 atgcgggtct tgtaaatgtg tgcgtagtaa gtgatcatta gtgtgtgtga ctgtgtgtaa 120 aagaaatagc ggagcgttcc cgcacgcgca tgtacaagtc accgccxcgg cgttgcgcgc 180 gtgaacggta aacaatttat gctcatttgt atgtactttt gttccatcct accgatcgac 240 ctaataatgc gataattttt cagaaaactg atctgaattc gttataatgt ctacttgaga 300 tcggtcagtc cacattttaa gatttctgat ttaaattccg aataatcaac gtttgaaaat 360 tccgaaattt caaaatattc aaactaaatt tatagttgtt ggtggtgggg gagggggggg 420 tgaaattggg aaaagtggac tgaccgatct caagtagaca taataacgaa ttcaaatcag 480 ttttaaaaag tattatcgca ttactaggtc gatcagtatg atggaataaa gattggtatg 540 ttttggctaa aataactgtc cgccaccgat cccc 574 // ID Gypsy-3_PPc-LTR repbase; DNA; INV; 316 BP. XX AC chrUn; XX DT 06-JUL-2010 (Rel. 15.07, Created) DT 06-JUL-2010 (Rel. 15.07, Last updated, Version -1) XX DE LTR retrotransposon from the Pristionchus pacificus genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-3_PPc_; KW Gypsy-3_PPc-I; Gypsy-3_PPc-LTR. XX OS Pristionchus pacificus OC Eukaryota; Metazoa; Nematoda; Chromadorea; Diplogasterida; OC Neodiplogasteridae; Pristionchus. XX RN [1] RP 1-316 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Pristionchus pacificus genome."; RL Repbase Reports 10(7), 999-999 (2010). XX DR Genome; chrUn; Positions 65608227 65608542. XX SQ Sequence 316 BP; 79 A; 72 C; 62 G; 103 T; 0 other; tgttgtatac acaagaaatg gtgttactaa atagtcagtt gtcgattgac atatttattg 60 tgtgacttac cgtaactcct agtgcgccga gtgcaacttg cctacgtgcg cttgctcaca 120 catttagcta cacattttgc gtcattctac gcatctttcg caaggagctc gttttcctga 180 gctctcctag ttagttcacg tttaggttcc cgactgtgct ctcgtttaga attgttgcct 240 tatttattgt ggataaactg atagtttata ccacaataat cgaccaccgg atcacgacac 300 ggaaggatac acaaca 316 // ID BEL-97_AA-LTR repbase; DNA; INV; 222 BP. XX AC AAGE02034155; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-97_AA_; KW BEL-97_AA-I; BEL-97_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-222 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02034155; Positions 249 28. XX SQ Sequence 222 BP; 73 A; 36 C; 54 G; 59 T; 0 other; tgttccggca tcactggact acgagcaggc agtgaactcg ccgatgaatg cgaactggct 60 gtcaagccac cacgacatag cgtataggta gcgaagtatg aaacatggtg agatgatttt 120 ctgaaacact gaatgttaac gttgaattat taatttgaga aaaagattag tttgctgatc 180 gagagaagaa actttgtaaa agtaagttcc gtgttaatat ca 222 // ID SGALPHA repbase; DNA; INV; 359 BP. XX AC X52936; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 28-SEP-1995 (Rel. 1.08, Last updated, Version 1) XX DE Alpha repetitive DNA. XX KW Satellite; Simple Repeat; Alpha repetitive sequence; SGALPHA; KW Repetitive DNA. XX OS Schistocerca gregaria OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Orthopteroidea; Orthoptera; Caelifera; Acridomorpha; OC Acridoidea; Acrididae; Cyrtacanthacridinae; Schistocerca. XX RN [1] RP 1-359 RA Brown T.; RT "Direct submission."; RL Direct Submission to EMBL/GenBank/DDBJ (01-MAY-1990). Brown T., RL UMIST, Dept of Biochemistry & Applied Molecular Biology, RL Manchester M60 IQD, UK. XX RN [2] RP 1-359 RA Brown A.T. and Barker S.; RT "Direct submission."; RL Unpublished. XX DR GenBank; X52936; Positions 1 359. XX SQ Sequence 359 BP; 106 A; 66 C; 84 G; 103 T; 0 other; aagcttctaa gttaaagcaa tagccagtaa gaagtagtac aattcaagtt ttagttgtcg 60 attattcact ttttagccgg ctttggtggc ttggacatga cctaggacct taagattttg 120 catgaaagga ggactagcag ccgcgtggca cgggagaatg tgaaaagtgt tcatatgtag 180 taaaaacctg agaaatccgt ccgttttcgt cattttttta tcgaatcgac ttaaagccag 240 atcgtcacat tggccgcttt tgtctctgag atggttacat ttatcgtctg caaagctata 300 tcgcgtgcag cgctatgggg agacactaaa agcaccatag tagaaaacag aagaagctt 359 // ID DNAX-3_AP repbase; DNA; INV; 150 BP. XX AC . XX DT 25-AUG-2009 (Rel. 14.09, Created) DT 25-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNAX-3_AP. XX NM DNAX-3_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-150 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2055-2055 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC TSD duplication unclear (it could be TA or TATA) CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 150 BP; 27 A; 44 C; 44 G; 35 T; 0 other; ctatactctg tcccacgaga gaacatgccg gttctgcgca tccccctacg tcaccctgct 60 gtcgaaactg gttcccctat gacagggtgt cgcgcggggg aaccagtttt ggtcggtgcg 120 cggcatgatc tctcgtggga cagagtatag 150 // ID Gypsy-26_DWil-I repbase; DNA; INV; 4613 BP. XX AC scaffold_181145; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-26_DWil_; KW Gypsy-26_DWil-LTR; Gypsy-26_DWil-I. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-4613 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_181145; Positions 903584 908196. XX CC Positions [2003-2578] - Reverse transcriptase CC Positions [3629-4090] - Integrase core CC 'CTGAC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 515..4531 FT /product="Gypsy-26_DWil-I_1p" FT /translation="MCLVAYSIECVTHKNKHIIFVYIFRYWNFLLKIMSEI FT KPFLCDTIDKSLLRHEWEKWLRSFTLYLDAEDITSIPKKKSKLLHLGGPQL FT QEIIYNIPGALVDYDAQENNDVYKVLVGKLDEYFAPLRNSSFERHLFRNLS FT PLEGETFNRYVIRLRQQLAKCSYGTTKAEIEDICLADKIIDSWASNDLKKR FT FLEKEQKLSDLIDACQVHEQVNKHSKTMLAEEVSDSVNKIFARKRFGRDPP FT KACTRCGLEDHDANNGRCPARDKVCKRCNLLGHFARQCRTRNSKRSSDTGG FT YSNPKRRKVYANVQSVEDQVKSDDDEDKELECFRIDGSARNDDNIRCNIGG FT VSIGMIIDSGSPANLLCEKHWKMLCNSNAVMWNMRSQTTDRFKAYATEDPL FT RVLNVFEAPISVKTNQEMIATFYVIEKGSQSLLGKKTAIQLKVLTLGLDVC FT RIETSLPFSKIKGRVIKLAIDSSVKPVHQPMRRVPVAFEDKVADKIKDACD FT RDIIEPVRGPSAWISPVVIAFKESGKIRLCVDMRMANRAIQRENYPLPTFE FT TFMTKARGAKWFSRLDLKDAYHQLELHESSREITTFVTQQGLFRYKRLLFG FT VNSAPEIFQRVMAEILAPCKNAHNYLDDVIIFGRTEAEHDKALQEVLNLFE FT INCVLLNKDKCVYKAKELKFLGHILSSDGIAADPEKVKLILDFRPPETKEE FT LRSFLGLVTYVGKFIADLSHITEPLRVLLKVNSKFVWSSEQEKAFVNLKNR FT LARIPELSYFDPNRRTQIIADASPVALGGVLLQFDERNESVIISFASKSLS FT EVERRYSQTEKESLALVWAVEKFFYYVAGLEFELVTDHKPLEAIFKPTSKP FT PARIERWLLRLQAFKFKVTYRPGKENIADSISRLCKAQSCDSFDKGGDYNI FT CQIVSCATPKAIAIPEIAKTSVVDREISAAVSHLKDDNWNEGLSSCYYPFR FT AELSTVGSILLRGTRIVIPEVLRQRVLELGHEGHPGESAMKRRMRAKVWWP FT RLDRDVEQHVKMCRSCLLVSQPTRPPPMKRHVFPDGPWQCLATDLLGPLPN FT NDHVLVLIDYFSRYQEIKFIGSISSQTIITAFKEIFSRLGIPRSLRTDNGR FT QFVSAQFKDFCEASGITLITTPPYWPQANGEVENMNRSLGKRLKIASLNKG FT NFKEELQKFILTYNTTPHGTTGSTPSELMFNRTIRDKLPDIRDIIEEVVDT FT SARDNDLINKQKGKLTGDKVRGAKESDIKVGDKVLLKNVIFPSKLTPNFET FT TEYKVVERNGNEVELFGNGRKIRRNVSHCKKIPSNEGPPNNSKEDALTQTD FT SQRLIHSSSYSVIPDQPQVIPSDPSSE" XX SQ Sequence 4613 BP; 1535 A; 819 C; 1097 G; 1162 T; 0 other; ttggcgacga ctgggagaac tcggtggaaa agagaaacgt taaactaagt tgagtagatt 60 gaattgtaat tcaattttta ttgttaagcg actgtggtga aatagcgcga gtcgtacaaa 120 tgtacgagga ggctacaaga aaaatgatac aaaggtatca ggatagcgcg aatcgtacaa 180 atgtacgagg aggctacatg aaaatgatac aaaggtatca agatagcgcg aatcatacaa 240 atgtatgagg aggctacaag aaaaaaattg atacaaaggt atcaaaatag cgcgaatcat 300 acaaatgtat gaggaggcta caagaaattg atacaaaggt atcagaatgt cgcgaatagt 360 acaaaggtac aaggaagcta caagaaaatt gatacaaagg tatcaaaaga aaaattagca 420 cgaatcgtac aaatgtacga gaaggacaca ggaagatgat acaaatgtat caaagtagtg 480 gtacaaatgt gccaggagag cacgatggac gcaaatgtgt ctagtggcat attcaattga 540 atgtgttaca cataagaata aacatataat ttttgtttat atttttaggt attggaactt 600 tcttttgaaa atcatgtcgg aaataaaacc gttcctttgc gacacgattg acaaatcgct 660 actccggcat gagtgggaaa aatggctaag gtcttttaca ttatatcttg acgcggaaga 720 tatcacatca atacccaaga aaaagagcaa gcttttacat ttgggtggcc cacaactgca 780 agagataatt tacaacattc ccggagcgtt agttgattat gatgctcaag aaaacaatga 840 tgtctacaag gtcttagtgg gcaaactgga cgaatatttt gcaccgttaa ggaattcatc 900 gttcgagaga caccttttta ggaatttgtc accattggaa ggcgagacat tcaacagata 960 tgtaattcgt ctgagacaac aacttgcaaa gtgttcatat ggcacaacaa aggcagaaat 1020 tgaagatatt tgtcttgccg ataaaataat tgactcatgg gcttcaaatg acttgaagaa 1080 aagattttta gagaaagagc aaaagctgag tgatctaata gatgcttgtc aggtccatga 1140 gcaagtaaat aagcactcga aaactatgct ggctgaagag gtaagcgaca gcgtcaacaa 1200 aattttcgca aggaaacgtt tcgggcgaga tccacctaag gcttgcacca gatgtgggct 1260 tgaggatcat gatgctaata atggaagatg tccagcgcgg gataaagttt gcaaacgctg 1320 caatttgctt ggtcattttg ccaggcaatg tagaactcgg aactcaaaga ggtcttcgga 1380 cacgggagga tactccaatc caaagaggcg caaggtatat gcaaatgttc aaagcgtcga 1440 agatcaagtg aaaagcgatg atgatgagga caaagagttg gagtgtttta gaattgatgg 1500 ttcggcaaga aatgatgaca acattagatg taacattgga ggtgtgtcaa tcggaatgat 1560 tattgattca ggatcacctg cgaatttatt atgtgagaag cattggaaaa tgttgtgtaa 1620 tagcaacgct gttatgtgga atatgagatc ccagacgact gacagattta aagcatacgc 1680 gactgaagat ccgctcaggg ttcttaacgt ttttgaagca ccaataagtg tgaaaacgaa 1740 ccaagaaatg attgctacat tttacgtaat agaaaagggt agtcaatcgc ttctggggaa 1800 gaaaacggcc attcaactca aggtcctaac gttagggtta gatgtttgcc gtattgagac 1860 ttcattgccg ttttcaaaaa taaaaggcag agtgataaaa ttggccattg attccagcgt 1920 gaaaccagtg caccaaccta tgagacgggt tccagtggca tttgaagata aagtggcaga 1980 taagataaaa gacgcttgtg atcgtgacat aatagaacca gtaaggggtc cgagtgcatg 2040 gatctcgccg gtggtaattg cgttcaagga aagcggcaaa ataaggctat gcgttgatat 2100 gaggatggca aacagggcta tccaacggga aaattatcca ctaccaacct ttgaaacttt 2160 tatgacaaaa gccaggggtg cgaaatggtt ttcgagactg gatctcaagg atgcctatca 2220 tcagcttgaa cttcatgaat ctagccgcga aataacaact ttcgtgacac agcaagggct 2280 gtttcgatat aaacgtcttc tttttggcgt taattcggca ccagagattt ttcaacgggt 2340 gatggcggag atcttggcac catgcaaaaa tgcacataac tatttggacg atgtaataat 2400 ttttggaaga actgaagctg agcacgataa agcgttacaa gaagtgttga atctgtttga 2460 gatcaattgt gttttgttaa acaaagataa atgcgtgtat aaggcaaaag agcttaaatt 2520 tctgggacat atcttatcga gtgacggcat tgcggcagat ccagagaagg tgaaattgat 2580 tttggatttt cgaccacctg aaacaaagga ggaactgaga agctttctgg gactggttac 2640 atacgttgga aaattcatcg ccgatttgtc gcatattaca gagccattaa gagtactact 2700 taaagtcaat agcaaattcg tgtggtcgag cgaacaagaa aaagcgtttg ttaacttgaa 2760 aaatcggcta gctcgtatcc cggaactttc atattttgat ccaaatagac gaacacagat 2820 aattgcagac gcaagtccgg tagcacttgg tggtgttttg ttacaatttg atgagagaaa 2880 tgaatcggtt atcattagct tcgctagtaa gagcttatcg gaagtggaaa ggcgctactc 2940 gcaaaccgaa aaagaaagtc tggcgctagt ttgggctgtt gaaaaatttt tctattacgt 3000 tgcgggtttg gagtttgaac tcgtcactga tcataaacca ctggaggcga tcttcaaacc 3060 aacatcgaaa cccccggcga gaatagaacg ttggcttttg agactacagg cttttaaatt 3120 caaggttaca tatcgccctg gaaaggagaa cattgccgac tcaatatcaa gactttgtaa 3180 ggctcagtca tgcgattctt tcgataaagg aggtgactat aacatctgtc aaattgtttc 3240 ttgtgctaca ccaaaagcaa ttgctatacc agaaatagca aaaacaagcg ttgtggaccg 3300 agagatatca gctgctgttt cacatcttaa agacgataac tggaatgaag gtttatcaag 3360 ctgttattac ccgtttagag ctgagctatc aacagtgggt agcattctgt tgcgtggtac 3420 ccgtatagtg ataccagaag ttctacggca gagagtttta gagttagggc acgaaggaca 3480 ccctggcgaa tcggctatga agcgacgaat gagagcgaag gtgtggtggc cacgtttgga 3540 cagagatgtc gaacagcatg tgaaaatgtg cagaagctgt ttattggtgt cacaaccaac 3600 ccgtccacca ccgatgaagc ggcacgtttt tcctgacggc ccgtggcagt gcttagcaac 3660 cgacctgttg gggcctctac cgaacaatga ccatgtattg gtgctgatag attatttttc 3720 gcgttatcag gagataaaat tcatcggctc aatatcatcg caaacaatta ttacggcatt 3780 taaggaaata ttttccaggc ttggcatacc gagatcgtta aggacagata acggccggca 3840 gttcgttagc gcccaattca aggacttctg tgaagctagt ggcatcactt tgattacgac 3900 tcctccatat tggccgcaag ccaacggtga agtagaaaac atgaatcgtt cattgggcaa 3960 acgccttaaa attgcgtcgt taaataaggg aaattttaaa gaggagctac aaaaatttat 4020 attaacctac aatacgacac ctcatggaac tactggctct acaccttcgg aacttatgtt 4080 taataggacc attcgagaca agctgccgga cattcgagat attatagaag aagtagtaga 4140 cacctcggca agagataatg acctgatcaa taaacaaaaa ggaaagctta caggtgacaa 4200 ggttagaggt gctaaggaat ccgacataaa ggtgggcgac aaagttttac taaagaacgt 4260 tatttttccc tccaaattga cacccaactt tgaaacaacg gagtataaag ttgttgaaag 4320 gaatggaaat gaagtggagc tttttggaaa tgggaggaaa atcagacgga acgtgagcca 4380 ttgcaagaag ataccgtcga atgaggggcc gcctaataac tcgaaggagg acgcactaac 4440 gcagactgat tctcaacgct taatacactc atcatcatat tcagtgattc ctgatcaacc 4500 tcaagtgata ccctccgatc catcttcaga gtaaatgtaa ctttcgactc gacgttcaac 4560 tccagagaaa ccgaagcccg ggttaaaact gaagcttata agaaaaggag agg 4613 // ID Gypsy-225_AA-I repbase; DNA; INV; 2138 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-225_AA_; KW Gypsy-225_AA-LTR; Gypsy-225_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-2138 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1053-1053 (2011). XX DR [2] (Consensus) XX CC 'TACA' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 420..2138 FT /product="Gypsy-225_AA-I_1p" FT /translation="MQDGNISSVQPGSGEASLSPNPSQNSSSRNGLSLEEV FT ENIIVRLINQLNFSNATVKNLMQECNAMRSEIVSMRAETSRLNELVEKTRL FT SSLLPQQTRAPNGDSPFRCDRDSQTRTNATKSGTIINDASMCVRVDNKLSD FT DDGRVQVDAWIRNSGHEIVNGEANNYLLGGCPNNNIIRETNIKNLIDGTCE FT KDFVGDDLHSIPRVSESYCGFEAYGGLRGDCHRTTHAHNEIRVVESLLSVS FT EAEGAFSEFCGTEFYPVRKWIAEFEELSDSLGLSEFRKFVIAKRKLSGLAK FT LSLNTTNKVFNWRSLKDFLIAEFEYSENPAIVHGRLRNRKRHMGENVLAYF FT LQMREIGAKANVDISSIITYTINGINDNGPDKILLYGCKSIEEFKEKLRLY FT QGIKDSKYAKNVKLFNSNSEKKMFSRKPKGYDNIVKCFNCGIDGHMSSTCS FT KKLGRDYINLCRSEPNHLTYSKNTKKYLGPCGVKTRINKANNSKLDVIMTE FT VVEKSNGHQPEPIPKRSWRQHQDTMIIRRTDTTASKRDEIDRKQMNECRRS FT KRGESRLRVGQSDGTIRAGWWSGWPI" XX SQ Sequence 2138 BP; 713 A; 342 C; 493 G; 590 T; 0 other; aatttggggg ctcgtccggg aacgcgggaa gaaacttatg ggatattcgc cagtgcgcgc 60 cattgggcga acgaagaatc aacgtgaaac attgaaataa caggaaagaa atttgtagct 120 tatgtgaact gaattttagc tgttaagtga tttgaatttt taatgttaaa gtagtgttaa 180 gaccttgaat tcaaattatt agtgtaagca aagtgatttt tttttttaaa ctttttttta 240 aaccattcaa tcttcgatga cattgtacag tgaaaagaac ataattgcaa tctttttcgc 300 aaagtgtaca tcactcattt gacctggggt taagggtcat tgtacggtgc ttaaaacaaa 360 caacaaaaca agaagacagc gagtgttcga attcatcgtg gcaacttgat tacagtgcaa 420 tgcaggacgg gaacatttcg tcagttcagc ctggtagtgg cgaagcttct ctttctccta 480 acccatcgca aaattcttct tcaaggaatg ggctttcact ggaagaggtt gaaaatatta 540 ttgtgcgttt aataaaccag ttaaactttt ctaatgcgac tgttaaaaat ttaatgcaag 600 agtgtaatgc gatgcgtagt gaaattgttt ccatgcgggc agaaacatcc cgattgaatg 660 aattggttga aaaaactcgc ttaagttcct tgttgcctca acagacgcgt gctccaaatg 720 gagatagtcc ttttaggtgc gatagagaca gtcaaacaag aacaaatgca acaaaaagcg 780 gaacgattat caatgacgca tcgatgtgtg ttagggtgga caataaattg agtgacgacg 840 atggtcgtgt tcaggttgat gcatggatta gaaactccgg tcacgaaatt gtgaatggtg 900 aagccaataa ttacttgttg ggtggttgcc cgaataataa tataattaga gaaacaaata 960 taaaaaatct gattgatggt acatgtgaaa aagattttgt tggtgatgac ctacatagta 1020 tacctagagt gagtgagagc tattgtggct ttgaggcgta cggcggtttg cgtggtgact 1080 gccacaggac aacccatgcg cataacgaaa tcagagtagt ggaatcgctg ctatcggtaa 1140 gtgaagctga gggagctttt tctgaatttt gtggaacaga attttatcct gtgcgaaaat 1200 ggatagctga gttcgaagaa ctatcagatt ctcttgggtt gtcagaattt cgtaagttcg 1260 ttattgcaaa gcgcaaattg tctgggttgg ctaaattatc gttgaatact accaacaagg 1320 tctttaactg gagaagcttg aaagactttt taatagcaga gtttgaatat tcagagaacc 1380 ctgctattgt tcacggaagg ttgcgaaatc gcaaaagaca tatgggcgaa aacgttttag 1440 cgtatttttt gcaaatgcga gagattgggg ccaaggccaa tgtagacatt tcatcaatta 1500 tcacttatac aatcaatgga attaacgaca atggtcctga taaaatttta ttgtatggtt 1560 gtaaatcgat tgaagagttt aaagaaaaac ttcgtttata ccaaggcata aaagatagca 1620 aatatgcgaa aaatgtaaag ttattcaatt caaattcgga aaaaaagatg tttagtagga 1680 agccaaaggg ctacgacaat attgtaaaat gtttcaattg tggaatagat ggtcatatgt 1740 caagcacatg ctcgaaaaag cttggacgtg attatattaa tctttgccgc tctgaaccca 1800 atcatttaac atattcgaaa aacacaaaaa aatatctcgg accttgtggt gtcaaaacta 1860 ggattaacaa ggcaaataac tcgaaattag acgttattat gactgaagtc gtagagaaat 1920 ctaatggaca tcaaccagaa ccgattccaa aacgaagttg gcgtcagcat caggacacga 1980 tgatcataag aaggactgac acaacagcat cgaaacgaga cgagatagat cgaaagcaaa 2040 tgaatgaatg caggagatcc aagcgaggcg aatcaagact acgtgtgggg caatcagatg 2100 gcaccatccg agccggatgg tggtcaggat ggccgatt 2138 // ID Jockey-N5_CQ repbase; DNA; INV; 1793 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A non-autonomous Jockey-like non-LTR retrotransposon family from DE Culex quinquefasciatus - consensus. XX KW Jockey; Non-LTR Retrotransposon; Transposable Element; KW nonautonomous; Jockey-N5_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-1793 RA Kojima K.K. and Jurka J.; RT "Non-autonomous Jockey non-LTR retrotransposons from the southern RT house mosquito."; RL Repbase Reports 11(1), 588-588 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 10 sequences with >94% CC identity. This family encodes a protein similar to Jockey ORF1p CC but does not encode ORF2p. Thus it is a non-autonomous non-LTR CC retrotransposon derived from Jockey, like HeT-A. XX FH Key Location/Qualifiers FT CDS 115..1566 FT /product="Jockey-N5_CQ_1p" FT /translation="MPKRGARHGNSSAAPGTSRNPSPGRITKNNPANSQIK FT PSTAATALEGIDKTKLHPHYRSPYQLRVKTAKAGRQASGVPSVTPVPTRNE FT FEPLSGDDEDNEDDCSTSSSDDDDDPARGSKKAETSKTAAKERRPPPIFVV FT DTLADHVDELLEGQTYCLKIGKKNVQVITLLRANYDKVLMVLKTHNVKYFT FT FDPAETVPVKIVLQGYTDRPVEDLAGHLADVNVHPRDIKVLSRTTTVTGTH FT VLYLLYFDRGTVKLQDLRRKKTLDGFWVTWRYFSKNSTDAAQCHRCQKFGH FT GSRNCNLPPRCVKCGEMHFTEKCKLPQKARLSESDAQQHRSRVKCANCDGN FT HTANFRGCAARKAYLEEQAKKKKKLPATRPPPRLSPTVPVAGQGAADRITP FT AIPPGWGGSYASVTAASGTVPEQAAVSGDDLFTLPEFFALAGEMLTRFRAC FT RNKAEQFMALGELMIKYVIQRISCLVQRCSSASTDKIIL" XX SQ Sequence 1793 BP; 478 A; 476 C; 455 G; 383 T; 1 other; atcatcgggc tacctacgtt ggagccagcc acaacacaat tttagtgctc cgttaaattt 60 tcgatttttt cacgccgcgg gtgaagtgtt taatcgcgaa cttgccgccc ggagatgccc 120 aagcgtggcg ctaggcatgg caactcgagt gcagcaccgg ggacctcgcg gaacccgtct 180 cctggccgga ttactaaaaa caatccggcc aatagtcaaa tcaagccttc caccgcggcg 240 acagcgttgg aaggcattga caaaacaaag ctgcatccac actatcgcag cccgtaccag 300 ctacgtgtga agacagctaa ggcaggccgg caggcctccg gcgtcccctc cgtcaccccc 360 gtccccacac gcaacgagtt cgagccgctg agtggtgacg atgaggacaa tgaagacgac 420 tgcagcacca gcagtagcga cgacgacgac gaccctgcac gcgggtcgaa gaaagcagaa 480 accagcaaaa ctgcggcgaa ggaaagaaga cctccgccga tatttgtcgt cgacacacta 540 gccgatcatg tggacgagct actggagggt caaacctact gtttgaagat cggtaagaag 600 aatgtgcaag tgatcacgtt gttaagggcg aattacgata aagtgttgat ggtcctcaaa 660 acacacaacg tgaaatactt cacattcgat ccggctgaaa cggtgccggt taaaatagtc 720 ctgcaagggt acacggaccg gccagtggag gacctggctg ggcaccttgc cgacgtwaac 780 gttcaccccc gggacatcaa ggtgctctca agaacgacta cggtcacggg tactcacgtg 840 ctgtacctac tctacttcga tcgtggtacc gtcaagctcc aggatctccg acggaaaaaa 900 acgttggacg gtttctgggt gacttggagg tatttctcaa agaattcgac agacgcagcc 960 caatgccacc gttgccaaaa attcggccac ggctcaagaa attgcaacct cccgccgcgc 1020 tgcgttaagt gcggtgaaat gcatttcacg gagaagtgca aacttcccca gaaagcgagg 1080 ctgagtgaaa gcgacgctca gcagcaccgg tcgcgtgtca agtgcgcgaa ctgcgatggc 1140 aaccatacag cgaacttccg cggctgtgcc gcgcgtaaag catacctcga ggagcaggct 1200 aagaagaaaa agaagcttcc agcgacccgg ccgcctccaa gattaagccc aaccgttcct 1260 gtagctggac aaggcgcggc agatcggatt actccagcga ttcctcccgg ctggggaggt 1320 tcctacgcaa gcgtgaccgc ggcgagcggt acagttccgg aacaagcagc tgtgtccgga 1380 gatgacctat tcactctgcc agagtttttc gccctcgcgg gggaaatgtt aactcgtttt 1440 cgtgcctgcc ggaacaaggc agaacagttc atggctctgg gagagctcat gatcaaatac 1500 gttatacaac ggataagctg cctcgtgcag cgctgctctt cggcgtcaac agacaaaatc 1560 attctgtgat ctagctttaa gttttcctct aactatcccc tttctatagc aattttgtag 1620 gttttttttc ttgaactttt tcctacttgt gattagcctt tagaataacc aattgtgtcg 1680 aaagacaaat ttatcaacac atagctgaaa ggacctccaa atctctatta ggtttaattg 1740 caaagaaaat tgtgaattga ttatttactt aataaaaact gaattgaatt gaa 1793 // ID Copia-19_AA-I repbase; DNA; INV; 2676 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-19_AA_; KW Copia-19_AA-LTR; Copia-19_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-2676 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 947-947 (2011). XX DR [2] (Consensus) XX CC Positions [1494-2021] - Integrase core CC 'ACCAC' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 141..2630 FT /product="Copia-19_AA-I_1p" FT /translation="MADLSKLSFVRLNNENWQSWSFRMRMLLIREGLWRVV FT SAEQPEDVDDDDWQRDDERAVATIGLCLEDSQFSIIKKKKSAKDVWESLKT FT YHEKPNMTSRVSLLKRLCSINLAEGGDMEKHLVVLDELFERLDNVGQQLDD FT TLKVAMILRSLPDSFDNLVMALESRADADITLDFVKSKLSDAYQRRVDRSV FT DGMVQEKAMKMHTKRDKDRVCFFCKKPGHVKKFCRKFLATQQSDSDSDSNS FT KQPLQKAKQAQKSSEPLCFVAGQAVKNAWILDSGASCHMTSDKSFFTSLVE FT RAGADVVLANGKKTKTAGSGEGLIQGVDSDGKPVKIRLLEVLFFPGLDGNL FT LSVGKMASKGFVVHFTDVGCDIVNSSGCVVVVGDRFGALYKLKSVEQAMQS FT VGNCHTKMCHDLWHRRFGHRDANAVNKVIKEKLGSGMIVTECRRDEVCDTC FT LEGKLARKPFPSVVERKAKQILDLVHTDLCGPFNTATPSGNRYFLSLIDDY FT SRFTVIYLLKEKSEAKEKVKEFVHFCKNYFGRKPKVIRADGGGEYVNKDLC FT NFYAKEGIVAQFTTAYSPQSNGVAERKNRSLQEMASCMLIDAGMEKKYWGE FT AIRTAAFLQNRLPSRVIEGTPYEKFYGSKPKLDRLRVFGCDAYVHIPDVKR FT RKLDPRSRKLTFVGYAEDRKGYRFLDRATDTITVSRDATFIEWKNGSTQIE FT IIHTRNDSKREPQRKHGLPEAEVSSDDEYSDANVDGDFQGLDDAAVQEEQV FT NGNSKKQLPYRKTRNMMPTKLKDYVVGIAMCAAKESTDNREVKKQVNDGVT FT VTKPGISKTVLDAVMENHEGTGRFKKKW" XX SQ Sequence 2676 BP; 752 A; 484 C; 788 G; 651 T; 1 other; ggttatggtg cccagtggat gtttggaaaa gcgatcgttc gttttccaaa gtggtagagt 60 tttsgtcgga agatttttcg gtgtggttga gcgttccggg tagtgctttg tgacatcggt 120 gcaggaggag aagcaacaaa atggcggatc tctcgaagct ctcgtttgtg cggctgaata 180 atgaaaattg gcagtcgtgg agttttcgaa tgcgtatgtt gctaattcgc gaaggactgt 240 ggcgtgtggt cagtgcagag cagccggaag atgtggatga tgatgactgg cagcgagatg 300 atgagcgagc cgttgcaacg atcggattat gcttggaaga ctcgcagttc agtatcatca 360 agaagaagaa aagtgcgaaa gatgtgtggg agtcattgaa gacataccac gaaaaaccga 420 atatgacgtc gcgagtgtcc cttttgaaaa ggttgtgttc gataaatcta gccgaggggg 480 gtgatatgga aaaacatctt gttgtgctag atgagctgtt tgagcgcctc gataatgtag 540 gccagcagct cgatgatacg ctcaaagtgg caatgatttt gagaagtttg ccggacagtt 600 tcgataatct cgtgatggct ttagaaagcc gggccgacgc ggatattacg cttgattttg 660 tgaagtcgaa gttgtcggat gcgtaccagc gtcgcgtcga tcgctcggtt gatggaatgg 720 tgcaagaaaa agcgatgaaa atgcacacaa agcgtgacaa ggacagagtg tgtttctttt 780 gtaagaaacc gggacacgtg aagaaatttt gtcgcaagtt tttggcaaca cagcagagtg 840 atagtgattc ggattcgaac agtaaacagc cactgcagaa agcaaagcaa gcgcaaaagt 900 cgagtgaacc gttatgtttt gtcgcggggc aagctgtgaa aaatgcctgg attctggaca 960 gtggtgcgtc atgtcatatg accagtgata aaagtttttt cacctcgctg gtggaacgtg 1020 ccggagcgga tgtagtgcta gctaacggga agaaaacaaa aaccgctggc agtggagaag 1080 gattaattca aggtgtcgac agtgatggta agccggttaa aatacggttg ctcgaagtgc 1140 ttttttttcc gggactggat ggtaatttgc tgtctgttgg aaaaatggca agtaaaggct 1200 tcgtagtgca cttcacggat gtcggttgcg atattgtgaa cagttcgggt tgcgtcgttg 1260 tcgtgggtga ccgtttcggt gcactgtata aactgaaatc agtggaacag gcaatgcagt 1320 cggtcggtaa ttgtcatacg aagatgtgtc atgacttgtg gcaccggcgt ttcggccaca 1380 gagatgcaaa tgctgtgaac aaagtaatta aagaaaaact tggaagtggt atgatcgtaa 1440 cggagtgtcg aagagatgag gtgtgtgata cctgcctcga agggaaattg gcgcgcaagc 1500 cgttcccgag tgtagtggaa agaaaggcga agcagatttt ggatctcgtt cacacagact 1560 tgtgtgggcc tttcaacacc gccacgccga gtggcaatcg ttatttcctg agcctaatcg 1620 atgactacag tcgatttact gttatctatt tgctgaaaga gaagtcggag gccaaggaga 1680 aagtgaagga atttgtccac ttctgcaaaa actatttcgg ccggaaaccg aaggtcatca 1740 gagctgatgg aggcggtgag tacgtaaata aagatctctg taatttttat gccaaggaag 1800 gcattgttgc tcagtttacc accgcgtatt caccccaatc caatggagta gcagaacgca 1860 agaataggtc actccaggag atggcctctt gtatgctcat agatgctgga atggagaaaa 1920 agtattgggg tgaggctatt cgcacagcag cgttcttgca gaacaggttg ccgtctcgtg 1980 tgattgaagg tacaccgtac gaaaagttct atggttcgaa accaaagttg gatcggctac 2040 gtgtgtttgg ttgtgatgcc tacgtccaca ttccagacgt gaagcggcgc aagctggatc 2100 ccagatcgag aaagctaact tttgtgggat acgctgaaga cagaaaagga tatcggtttt 2160 tggatcgcgc cacagatacg atcactgtaa gcagagatgc aaccttcatc gagtggaaga 2220 atggttctac tcaaattgag atcatccata cccgtaatga ttcgaaacga gagcctcagc 2280 gtaagcacgg attgccggaa gcagaagttt cttcggatga cgagtattct gatgctaacg 2340 tggatggaga tttccaagga ttagacgatg cagcagttca ggaggagcaa gtgaatggaa 2400 attctaagaa gcagttacca tatcggaaga cgcggaacat gatgcctacc aagctaaagg 2460 attacgtggt cggcattgcc atgtgcgccg ccaaagagtc aactgataat cgcgaagtta 2520 agaagcaggt aaacgatggt gtgaccgtta cgaaaccggg gatatccaag acagtgctgg 2580 atgcagtgat ggaaaatcac gaaggcaccg gcagattcaa gaagaagtgg taaagaggaa 2640 gccgtaccaa actgtgagtt ggagattgag gaggag 2676 // ID BEL-27_CQ-LTR repbase; DNA; INV; 393 BP. XX AC AAWU01040532; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-27_CQ_; KW BEL-27_CQ-I; BEL-27_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-393 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 208-208 (2011). XX DR Genome; AAWU01040532; Positions 20537 20929. XX SQ Sequence 393 BP; 102 A; 79 C; 107 G; 105 T; 0 other; tgtttagaaa ttaagtttat ttgtgcattg cgccgtttac cctatgaatt agatcacatt 60 gaagatgaac cgtgagagaa gagagtgagt gagagagaga gaaagaggga ccgtacggga 120 gtgcccgacg ggtcagtttg tttatcattt gcgaataaag tttaagttga ttgccgatac 180 tcgaccggct cggttcttaa gccaccccgc gtacttgtac cgttagttcg tcgcgcaata 240 aagttgtttc gcacaacaac ttgtgttttt attcggccta cgggtgaaga aggtttggtg 300 cgaggtcgtt gacctcttga gaagtgcgct ctggacgagc agaagaactc ccccgcaata 360 cagtccacaa tttgaagggc tcgatccgga aca 393 // ID hAT-18_HM repbase; DNA; INV; 3376 BP. XX AC . XX DT 07-DEC-2008 (Rel. 13.12, Created) DT 12-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-18_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3376 RA Jurka J.; RT "hAT-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 2007-2007 (2008). XX DR [1] (Consensus) XX CC Average divergence from consensus >7%. CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 657..2942 FT /product="hAT-18_HM_1p" FT /translation="MSKQLSIFSMLKKDGTEERSIKRKGEEERETKHAKKK FT CNDLDCAMTLGKLDIWPKDIEFMFWNTVNGLPPSNKKHGQQLSWIHIDKSN FT ECFSCWVCQKYPNISNATNSVTMGCKFWKRTYLLRHNKDDGHKACISKYRE FT NLHNIAPSLEKTVSTSILRTTENNQEELNRLFRISYFVVKEDLAFKKFPQL FT CELQKLNGIKLGKNYLSDKSCKNFVSSISSDLKRELSLDVSNARFIAILSD FT GSTDKSISEQETVFLRFVKNGNPITIFAGINSPLTVDADGILNAIEFVLKS FT LTTDKMNQDLYLNNIYAKTVNVNFDGASVMSGNKAGVQTKMRNKNPSIVYT FT HCVAHRLELSVLDSIKFDIYLKKFDDNINSLFRFYYYSSQRRKELKEIASF FT LDEEFKQLGRLKNIRWLASRERALRLLETDYKVIIYDLESKANEKDETAQK FT AKGFLCFMKDPQFLFYLHFIQDIVKSLTTLSLEFQKEEILICEILRKVDAR FT IAVLDALSVNPGSNIGRLLDNLEVKEDQLIYKGIDLTALKKRNLKRIDELV FT RQQPNDYFDYYNKKYFIKIIDGAKEYIDKRFRRDFNQEPLKSLTTIFDCRN FT WPKSFTGSLEIKKWGFNEIKSVCNFYENNNMISRETSNLCVLQWPLFREKV FT SKLRTNSLLSIYKDILLENEPELASINVLLEIMLTFSASTASCERSFSCMN FT LQKNILRTSLNDDTLEDIMRIKLDGQLIEDFLPQKHVKNWIDAAVGIRHLE FT GHKLKSSSDF*" XX SQ Sequence 3376 BP; 1267 A; 465 C; 539 G; 1104 T; 1 other; cagggcgcga aaaatcgacg aacgtgcgaa cgtctgcaca aaaaaagaca gaaaaacgaa 60 cgtcacgttc ctttactttt cctaacgcaa acgttcgcct aaaatttttc aaaatgaacg 120 cgttactata gcaatttttt tttccaaaca attaactatt gatcgttttt tatttattat 180 aaaaaacaca aaaaaacaac taattacgaa aatttcaaac gcattttcaa agactttaaa 240 agaaaccacg ccgaagctat tttcttcaat ggaacgaaga aagtaattag tttgagcatg 300 gtttttgcat gcgtttaaaa aagcatttaa atttttattt aaaatgactg catttgcaat 360 tactttaaaa aaaaccatgc ttaaataacg attgcttttt ttaatagcgt gctttttttt 420 ttttcttatt ttctttaaag tgaagtaatg ttagtttgca ttttaaaact tataaaagtt 480 ttaaaataat gttcaagttt tgctaaaaaa tacataatct aagaatatat aaagttatta 540 tgtttgttaa aattttaatt tttttattgt gcgtttaaca acaccaataa tatgatattt 600 ttattaactt attattaaat acttaatata ctattatttt taaagttttt aaaaaaatgt 660 caaaacaact aagcatcttt tcgatgttaa agaaagatgg tacagaggaa cgtagcatta 720 aacgtaaagg agaagaagag agagaaacta aacatgcgaa aaaaaagtgc aatgacttgg 780 attgtgctat gactttggga aaattggata tatggccaaa agatattgaa tttatgtttt 840 ggaacaccgt aaatggattg ccgccaagca ataaaaaaca cgggcaacaa ctaagctgga 900 ttcatattga caaatcaaat gagtgtttct cttgctgggt ttgccaaaag tatccaaata 960 ttagcaatgc aaccaattca gtaacaatgg gatgtaaatt ttggaaaaga acatatttgt 1020 tacgccacaa taaagatgat gggcataaag cctgtatttc taaatatcga gagaaccttc 1080 acaatatagc tccaagtctt gagaaaactg tatcgacatc aattttaaga actactgaaa 1140 ataaccaaga agaattgaac agactttttc ggatatcgta ttttgttgtt aaagaggatc 1200 ttgcatttaa aaaatttcct cagctgtgcg agctacaaaa gctaaatgga attaagttgg 1260 gcaagaacta cttaagtgac aaatcttgca aaaactttgt ttcgtcaatt tcttctgact 1320 taaaacgcga attaagttta gacgtwtcaa atgctcgttt tattgcaatt ttatctgacg 1380 gttctactga taagagtatt agtgaacaag aaacagtttt tttaaggttt gttaaaaatg 1440 gtaatccaat aactatattt gctggtatta actcaccact gacagttgat gccgatggta 1500 tattaaatgc aatagaattt gttttaaaaa gtttaacaac agacaaaatg aatcaagatt 1560 tatatttaaa caatatctat gcaaaaactg taaatgttaa ctttgatggt gcatctgtaa 1620 tgtctggcaa taaagctggt gttcaaacta aaatgcgaaa taaaaacccc agtatagtat 1680 atacccactg tgtggcgcat agattagaac tttctgtgtt agactcaata aagtttgata 1740 tttatttaaa gaagtttgat gacaacatca atagtttgtt tcgtttttac tactattcat 1800 cgcagcgaag aaaggaactg aaagaaatag cttctttttt agatgaggag tttaaacagc 1860 ttgggagatt gaaaaatatt cgttggttag ctagccggga aagagcgtta aggttgcttg 1920 aaactgacta taaagttatt atttatgatt tagaaagcaa ggcaaacgaa aaagacgaaa 1980 ctgcacagaa agctaaagga tttttgtgtt tcatgaaaga cccacaattt ttattttatt 2040 tacattttat tcaagatatt gttaaatcac taacaacatt atcattggag tttcaaaaag 2100 aagaaatttt aatttgtgag attttgcgta aagttgatgc aagaattgct gtactagacg 2160 cgttatcagt taatccaggc tcgaatattg gacgtttact agataattta gaagtaaaag 2220 aagatcagtt aatctacaaa ggtattgatt taacagcatt aaaaaagcgt aatttaaaga 2280 gaattgatga gttagttaga caacaaccaa atgattattt tgattattat aataaaaaat 2340 attttattaa aattattgac ggagcaaaag aatatataga caagcgattc cgccgcgatt 2400 tcaatcaaga accattaaaa tctttaacca caatctttga ttgtcgaaac tggccaaaat 2460 cctttacagg ttcgctagag ataaaaaaat ggggatttaa tgaaataaaa agtgtttgca 2520 acttttatga gaacaataat atgataagca gagaaacttc taacttatgt gtattgcagt 2580 ggcctttatt cagggagaaa gtctctaaac taaggacaaa ttcactttta agcatttata 2640 aagatatctt gctggaaaat gaacctgaat tggcaagtat aaatgtcttg ttagagatta 2700 tgctaacttt cagcgcttca acggcaagtt gcgaacgcag tttttcttgt atgaacttac 2760 aaaaaaatat tttgcgaact tcattaaatg acgacacctt agaagatatt atgcgtataa 2820 agcttgatgg ccagttaatt gaagactttt tgccccaaaa acatgtcaaa aattggattg 2880 acgcagcggt tggtattaga catttggaag gccataaatt aaaaagttct agcgattttt 2940 aaatccattt atataaatat aaaaattaat atataaatat taatgagacg tcattttgcg 3000 atttatttct ttagttcctt ttattttaat ttcttcacgt ttatacgtat attttatgag 3060 taaataaatt taacaaacta aattaaaaaa aaaaaagttt tgcatcgttg aataacagtt 3120 tacaatcact tgcatattaa aaatcaataa tttgcgcgca ctttgcaaat gcgtgaaaaa 3180 ttgtttcgaa aatatgacaa ttgattaaat tcaataacca tctttcatat atccaaaagc 3240 ttattaacta gaataaaaat ttatttgaat ataaaactcc cttatttgac acgtatcact 3300 taaattagac gttcgtcaat aatttaagat acgaacgtca acgtccgtga aaaaaaaaac 3360 gatttgtcgc gccctg 3376 // ID Gypsy-19_IS-I repbase; DNA; INV; 4338 BP. XX AC ABJB010862880; XX DT 15-FEB-2011 (Rel. 16.02, Created) DT 15-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the black-legged tick genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-19_IS_; KW Gypsy-19_IS-LTR; Gypsy-19_IS-I. XX OS Ixodes scapularis OC Eukaryota; Metazoa; Arthropoda; Chelicerata; Arachnida; Acari; OC Parasitiformes; Ixodida; Ixodoidea; Ixodidae; Ixodinae; Ixodes. XX RN [1] RP 1-4338 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the black-legged tick genome."; RL Direct Submission to RU (15-FEB-2011). XX DR Genome; ABJB010862880; Positions 706 5043. XX CC Positions [3443-3922] - Integrase core CC 'ATAT' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 49..1008 FT /product="Gypsy-19_IS-I_1p" FT /translation="MEPPTSPEGDHTATTGAENLASSDPTASRLMALFKSW FT LQQAEASYARRESLLLQTMTSSIQDFTVSMTERFQRLESRFPELEEQRTAS FT HPVQNASRLSTLRLRPSTFDGTVPWAPYVSQFNLITDANQWSPAQKAQALA FT AHLTGPALTILHYLLPDTRHSYEVLVAALNERYGEGPHRFLHDAQLRNRRQ FT SATETVSHVADDIERLVHLAFTGCSQDTIDAVATMAFIDAISSVEVQDAVR FT LARSKTLRAAHTGALEIEAARQASRSTVLRPPTQPQAPPTATHAPPAPVCW FT NCHQQGHTRAQCPTYHTAAGTQQQHRRG" FT CDS 957..2873 FT /product="Gypsy-19_IS-I_3p" FT /translation="MSHVSHSSRNPTATSPRIITSTRVLTAPLTSIYLQGV FT VDGVSVGALVDTGATTSVIRPEVLMKANRAVPQSSDCYALQTATGDMVAVL FT GSRTVPFTVGSLHASLPVVVAAIEEQCILGVDFLKAHKCVLDFNAGLLHLG FT TNSYPLGAAKVTVLYDTGFPINDTLPYMCPRTSSIKLAAHVLPLYEGASAS FT LDKLDAERFCQLLEKFPSLFSTGDADIGLTSVATHRIVTGQASPILQPPRH FT IPLTQRSHVDDLVKTMLQQGVIEPASSPRSSPVTLATKKDATTRFCIDYRK FT LNDVTKKDSFTLPRTNDICDALAGCTWFSTLDLKSGYWQVPVQPEDCEDRE FT NTAFTTGIGWYQFRVMPFGLCNAPATFQRLMESVLNELIGTICLVYLDDII FT VFGKTVTDHLANLVFSRLQKANLKLTPKKCSLFTRQVEYLGHIISMDGVST FT DPRKLSALVEWPTPRSTTDVQSFIGLESYYRRFVSDFAHIAKPLYALTEKK FT AKFVWSSECQRAFETLKIALTSPPTLAYPKKTGRFILDTDASASASAIRAV FT LSQEQDGQEHVIAYYSKVFTKPERNYCATRRELLAIIKAITHFHNYLLGRK FT FVIRTDHASLTLMLSFKTPDGQVARWNTCMNMTSTYNIVPV" XX SQ Sequence 4338 BP; 1054 A; 1287 C; 1041 G; 956 T; 0 other; ctggtgtcag aagtggagct ccgttcgggc cgacgccttc cccgggtaat ggaacctcct 60 accagtccgg aaggggacca caccgcaact accggcgccg aaaatctcgc ttcttcagac 120 ccgacagcgt cccgactcat ggccctcttc aaaagctggc tccaacaagc cgaagcgtcc 180 tacgctcgac gagagagcct gcttctccaa accatgacat catccattca ggacttcact 240 gtctccatga cagaacgttt tcaacgcttg gagtctcgat tccctgaact cgaggagcag 300 cgcaccgcaa gccacccggt tcagaacgct agccgacttt caaccctgcg tctcaggcca 360 tcgacatttg acggtaccgt tccttgggcg ccttacgtat cccagttcaa cctcatcact 420 gacgctaacc aatggtctcc ggctcaaaag gcacaagccc tggccgccca tctaaccgga 480 ccagctttga caatactaca ctacctacta cccgacaccc gccattccta cgaagtcttg 540 gtcgcggcac tgaacgagcg ctacggggaa ggaccccacc gctttcttca cgacgcccaa 600 cttcgcaacc gtcgtcagtc tgcaaccgaa acagtttctc acgtcgccga cgacatcgag 660 cgcctcgtcc atcttgcatt taccggttgc tcacaagaca caattgacgc tgtggcaacc 720 atggcgttca ttgacgccat ctcatccgtt gaggtgcagg atgcggttcg cctagcacgc 780 tccaagacac tccgagcagc acacacgggc gcgctggaga tagaagctgc tcgacaagca 840 tcacgatcta cggttctgag gcctcctacg cagccgcaag ccccgccaac agcaacacat 900 gcgcctccag ccccagtttg ctggaactgc catcagcagg gccacacgcg ggcacaatgt 960 cccacgtatc acacagcagc cggaacccaa cagcaacatc gccgcggata atcacatcaa 1020 cacgtgtcct gacagcacca ctcacatcta tctaccttca gggcgtcgtc gacggagttt 1080 cggtaggcgc acttgttgat accggagcta caacctcggt aattagaccg gaagttttga 1140 tgaaggcgaa tcgtgctgtt ccacaaagct ctgactgcta tgctctgcaa accgcaacgg 1200 gagacatggt agctgtacta ggctccagaa ctgtaccctt caccgtgggt agccttcacg 1260 cttcgctgcc tgttgtcgtt gccgctatcg aggagcagtg catcctgggc gtcgatttcc 1320 tcaaagcaca caagtgcgtg ctcgatttca acgctggatt gctgcatctc ggaactaact 1380 cgtatcccct aggtgccgcc aaggtaacgg tcctctacga cactgggttt ccaataaacg 1440 acactctccc gtatatgtgt cctcgcacat cctctatcaa actagctgcg cacgtattac 1500 cattatacga aggcgcgtct gcttctctcg ataaactcga tgctgagagg ttttgccagc 1560 tgcttgaaaa attcccaagc cttttcagca cgggagacgc agatataggg ctaacatcgg 1620 tggcgacaca ccggattgtg acaggacagg cttctcccat tctacaaccc cctcggcaca 1680 ttcctctcac acagaggtct cacgtcgacg acttggtgaa gacaatgctt caacagggcg 1740 tgattgaacc agcaagttcc cccaggtcat caccagttac gctagcgacg aagaaggacg 1800 ctacaactcg attttgcata gactatcgaa aactcaacga cgtcactaag aaagactctt 1860 ttacacttcc tcggactaat gacatctgtg atgccctcgc tgggtgcacg tggttttcga 1920 cactcgacct caaaagcgga tattggcagg tcccggtgca accagaagac tgcgaagacc 1980 gcgagaacac cgccttcacc acaggtatcg gctggtatca gttcagagta atgccatttg 2040 gactctgcaa tgctccagct acatttcaga ggctaatgga atctgtcttg aacgagctca 2100 taggcactat ttgccttgta taccttgacg acatcatcgt cttcggcaag acagtcacag 2160 accatctggc gaacttggtt ttctcacgcc tgcaaaaggc gaacctgaag ctcacaccga 2220 agaaatgttc gctgtttaca aggcaagtgg aatacctcgg tcacatcatt tcgatggatg 2280 gcgtttccac agaccctcgg aaactctctg ctctcgttga atggccaacc ccgagatcca 2340 ccacggacgt gcagagcttc attgggctgg aatcgtacta tcgaagattc gtctccgatt 2400 tcgctcacat cgccaagcct ttgtacgctc tgactgagaa aaaagcgaag ttcgtctggt 2460 cctcggaatg ccagcgcgcg tttgagactc tcaaaatagc cctcacgtct ccgccaacac 2520 ttgcgtatcc gaagaagact ggacgcttca tacttgacac cgacgccagc gcctcggcct 2580 cggcaattcg cgcagtttta tcacaagagc aagacggaca agaacacgtc attgcttact 2640 atagcaaggt cttcacgaag cccgaacgca attattgcgc cacccgacgg gagttgctgg 2700 ccatcatcaa agccatcacc cattttcaca actatctact gggccggaag tttgttattc 2760 gaaccgacca cgcatcgctg acactgatgc tgagcttcaa gacacctgat ggtcaggtcg 2820 cccgctggaa cacctgcatg aatatgactt cgacatacaa catcgttccg gtttaaatca 2880 caccaatgcc gatggtctct ctcgtaggcc attaagtgaa aagtgcaagc agtgcttgag 2940 cattgaaaag cgcagtttcg acaggcttaa cgcactaacc aacctaccct gcgactacga 3000 agcgtggaca ccagaagagc tccgaaacgc tcaacttgac gacccggaca tcggcccatt 3060 agtaagatgg aagagacaga ataacaggcc ccctcctaag gaagtatctc ggctgtctcc 3120 ggccctcaag agttacttgg cgcaatggga ctctctgctg ctgcgcgacg gtgttcttta 3180 ccgacactgg gagtccgctg cgggcaactc tgttgtatgg caactggtgc tgccgcgaag 3240 actctgctct caagcgtttc atcagctgca cataagccct gaaggaggac atttcggcgt 3300 gaaccgcacc ctgaaaaaga tttggcagcg cttctactgg ctgcagtgcc ataatgacgt 3360 aaagcaatgg tgccgtctat gtgacatgtg cgtctcacgg gaaggtcctc acgtaaaaca 3420 gcgagcaccg cttcaagatt atgtcacggg agcacctttt gaacgcgtcg gcattgatgt 3480 gcttggaccg tttcctcgat ctaagcttgg caacaggttc ctgctggtca ttgttgacta 3540 tttcacgaag tggccggagg gcattcctgt gccgagccag accgcctccg atatcgccga 3600 cgctctggtg aggaactgga ttacacggta tggagctccc ctgagcatcc actcggacca 3660 aggcaggagc ttcgaatctt gggtgttcca agaagcctgc gaatcgttca gcatcaaaaa 3720 gacaagaacg acgcctctcc actcgcagtc aaacggttta gtcgagagga tgaatcgaac 3780 aatcctacaa catctgtccg tgttcgtcag tgatcagcaa gatgactggg atagcatcgt 3840 gcctttgttt ttgtatgcct atcgcacgtc tgctcatgag gtcactggct atacgcctgc 3900 acaatgctct tcggtcatga ccttcgcatg ccctgtgatt tagcccttcc acgagcatca 3960 taggcgacca caaaccctgc gtcctacgta ggtgaactga aaacaaacct cgagcggatg 4020 cacgcatttg ctcgacagcg tttggcgagc ggcttggatc gcatgaaagc ccggtacgac 4080 gcgcatgtca caacacgccc gttctctcca ggagaaagag tatggcttta caacctgatt 4140 cgcaagaaag gactgtctcc gaaactgcag cgtaactggc aagaaccttt catcgtcatc 4200 aaacgtctca gtgatgtggt ctatcgcata cgtcgcgggg atagattacg ggccctcgtg 4260 gtccaccgcg accggctatg tttgtatcag atacttctgt aggacttgct cggtcttcaa 4320 aggacgggag ggggatag 4338 // ID Gypsy-3_DWil-LTR repbase; DNA; INV; 245 BP. XX AC scaffold_180632; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-3_DWil_; KW Gypsy-3_DWil-I; Gypsy-3_DWil-LTR. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-245 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (04-MAR-2011). XX DR Genome; scaffold_180632; Positions 9258 9502. XX SQ Sequence 245 BP; 103 A; 40 C; 53 G; 49 T; 0 other; tgtgggattg ggtctggcaa ctctgctgag cggggcgaat tgagctaaag acgagcgaaa 60 gacgggcagt cgaaccgaag agacgagcga aagcggttgc aaaaagacga aagacgtaac 120 gacgcgaagc agtaaagacg actaatttcc aactaaatct aattaaaata atcaaataaa 180 tagttttaca ttaaaataaa ctaaaatact acaaataaat atattaaata ggacaaattc 240 ctaca 245 // ID L1-7_CQ repbase; DNA; INV; 3200 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE An L1 non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW L1; Non-LTR Retrotransposon; Transposable Element; L1-7_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-3200 RA Kojima K.K. and Jurka J.; RT "L1 non-LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 137-137 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 3 sequences with >99% CC identity. XX FH Key Location/Qualifiers FT CDS 16..3129 FT /product="L1-7_CQ_1p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="MGLDIVFLQEVYDDQLSIPGFNTITNIDHAKRGTAIL FT LKQHIKYTHIERSLDSRILAVRIHESVTLCNVYAPSGTQHRAEREKLFNST FT IAHYLRHNAHYTLLAGDFNAVIRSRDSTGSSNFSRSLANLTQQLRFVDVWA FT TLRRNDDGFTYITHNSAARLDRLYVSVGMKEQLRTAEIHACSFTNHHALTV FT RLCLPSLGNDHGRGYWSLHPRLLDAETLDELRTKWTYWTRQRRHYTSWLNW FT WQGYVKHRLKAFFRWKSTESHLASQQQHQHLYSQLSSAYDEYQCNPAALSK FT INKIKGQMLSLQRKHSEAFFRNNETRVAGESLSTYHLGERTRKRTSIDQLQ FT TDDHRTLENQTDIQNHVVSYFRDLYSAEERLPEQDFVCNRVVPCDSESNNS FT SMNEITTPEIYQAIKSSASRKAAGPDGIPKEFYLYAFDIIHRELNLIMNEA FT LNGDFSADFVAGVVVLVKKKNTPNTVHGFRPISLLNYDYKLLARILKSRID FT RILRSHGVLSSAQKCSNSERTIFQATLAIKDKIAKLNKEKKSAKLISFDLD FT HAFDRVDRSFLYTTMQDLGFKPSLIALLKRIGDLSSSRLLINGHLSESFAI FT ERSVRQGDPLAMHLFVLFLHSLICKLEAVCSGDDDLIAVYADDISVVVSNN FT SKVDELRRLFQSFESCSGAKLNTNKTFAMDIGWCNQRNRWDIPWLQTTDQM FT KILGIIFTNSIRRMITINWDKLVGKFAQLLWLQKLRLLSLHQKIAVLNTFA FT TSKLWYVSSILTINTAHIAKITSLMGSFLWDGAALRVPIHQLALPIETGGL FT KLQLPFFKCKALLVYRHLAERQSSPFYASYLASTLNPPNLSAIPSSLPCLK FT TVCQEFAYLPQRLRDSTQLSSSAVHGHLLCSAEKPKIMQQHPDLLWTRIWK FT NISAKELNTAEKTCLYLTVNQKISHQELLFRTNRAPSPNCEHCGFVGTLQH FT KLADCSRVSGAWALLQGKLQGITTINLPYEALARPELRRIGRRTRAAILKT FT FACYVIFIEKVNGLVDINALDFFLQTEVHLNC" XX SQ Sequence 3200 BP; 952 A; 828 C; 654 G; 766 T; 0 other; aattttatac gcacaatggg tctggatatc gtgttcctcc aagaggtgta cgatgatcag 60 cttagtatac ccggtttcaa caccataaca aacattgacc acgctaagcg aggcacagct 120 atcttactca aacaacacat caaatacaca cacatcgaga gaagtcttga ttcccggatt 180 ctagctgtac gtatacatga gtctgtaacc ctatgtaacg tatatgcacc atcgggtaca 240 caacaccgtg ctgagagaga aaagctcttc aactcaacaa tagcacacta tcttcgacac 300 aacgctcact acacgctcct tgctggagat tttaacgctg ttatcaggtc gagagactcc 360 actggtagta gtaactttag cagatctctc gccaacctga cacagcagct gcgctttgtt 420 gatgtatggg caaccttacg gagaaacgat gacggcttta cctacatcac ccataactct 480 gcggccaggc tggaccggct gtacgttagt gtgggaatga aagaacaact ccgtactgcc 540 gaaatacatg cttgttcttt cacaaatcac catgcgctca cagtacgcct ctgtttacca 600 tctcttggta acgaccatgg ccggggttac tggtctcttc acccacggct gcttgatgcg 660 gaaaccctgg atgaactgcg cactaaatgg acatactgga ctagacaacg tcgacactac 720 acttcgtggt tgaactggtg gcaaggctac gtgaaacacc gtttgaaagc cttcttccgc 780 tggaaatcaa cagagtctca cctggcttct caacaacaac atcagcacct gtacagccag 840 ctctcctctg cgtacgacga ataccaatgt aatccagcag ctttgtccaa aatcaacaaa 900 atcaaagggc aaatgttgtc actccaacgc aaacattctg aagcattctt ccgaaacaac 960 gaaacccgag tcgccggaga gtctctctcc acataccatc tcggagaacg cactagaaag 1020 agaaccagca tcgatcaact acaaaccgat gaccacagaa ccctggagaa ccagaccgac 1080 atccaaaacc acgtggtgag ttactttagg gacctttact ctgcagagga gaggctccct 1140 gaacaagact tcgtttgtaa cagggttgta ccctgtgaca gcgaaagcaa caacagtagc 1200 atgaacgaaa ttaccacgcc tgagatctat caagcgatca aatcaagtgc ttcgcggaaa 1260 gcagccgggc cagatgggat tccaaaagaa ttctatttgt acgcctttga catcatacac 1320 agagaactga acctaatcat gaatgaagct ttgaatggag atttctcagc tgattttgta 1380 gctggagtgg ttgttcttgt gaagaagaaa aacactccca acactgtcca cggcttccgc 1440 cccatatctc tactgaacta cgactataag ctactagcac gaattctcaa aagtcggatt 1500 gatcgaattt tgagatccca cggagttctg agctctgcac aaaaatgctc caactctgaa 1560 aggacgatat ttcaagccac cctcgccata aaagataaaa tcgccaaact gaacaaagag 1620 aagaagtcgg cgaagctcat atctttcgat ttggaccacg cgtttgatcg agttgatcga 1680 tcttttctct acaccaccat gcaagatctc ggtttcaaac cctcactgat cgctctacta 1740 aaaagaatcg gcgatctctc atcctctcgc ctgctcatta acggtcacct ctctgagtcc 1800 tttgcgattg agagatctgt ccggcagggt gacccactgg ccatgcatct gttcgtgctt 1860 ttcctgcatt cccttatctg caagcttgaa gcggtatgta gtggagacga tgaccttatt 1920 gcagtttacg ctgacgacat ctctgtagtt gtttccaaca attcgaaagt ggacgaactg 1980 agaaggttgt tccagagttt tgagagctgc tcaggggcca agctgaacac aaacaaaaca 2040 ttcgcgatgg acatcggatg gtgcaaccag cggaaccgat gggacatccc ttggttgcag 2100 acaacagacc agatgaaaat tctcggaatc atctttacaa actccattcg acggatgatt 2160 accatcaact gggataaact cgtgggaaaa ttcgctcaac tgctgtggtt gcagaaactg 2220 cgacttctca gtctacacca gaagattgca gtactcaaca cattcgcaac ttcgaagctg 2280 tggtacgtat catctatcct caccatcaac acagcacaca tagcaaagat tacgtcgctc 2340 atggggtcct tcttgtggga tggagctgcc cttcgagtac cgattcatca attagctctc 2400 ccgattgaga cgggtgggct gaaacttcag cttccttttt tcaaatgcaa agcacttctg 2460 gtatacagac acctcgctga aaggcaaagc tcacctttct acgcttcgta cctggcttca 2520 actttgaacc cacccaacct ctccgcaatc ccgtccagct tgccctgtct aaaaacggtg 2580 tgtcaggagt tcgcatatct tccacaacga cttcgagact cgacacagtt atccagctca 2640 gcagttcatg gacaccttct gtgcagtgct gaaaaaccga agatcatgca acaacatccc 2700 gacctgctgt ggactagaat atggaagaac atatctgcaa aggaattaaa cactgccgag 2760 aagacctgcc tctacctgac agtcaaccag aaaatatcgc accaagaact actcttccgg 2820 actaacaggg ctccttcgcc aaactgcgag cactgtggat tcgttggcac cttacaacac 2880 aagctagcag attgcagccg agttagtgga gcctgggctc tactgcaagg gaaactgcaa 2940 ggaataacaa caatcaacct tccctacgaa gcgttagcac gaccagaact ccggcgtatc 3000 gggagacgaa cgagagcagc aatacttaaa acatttgctt gttatgtcat tttcattgaa 3060 aaagttaatg gattggtaga cattaacgct ctagatttct ttttacaaac tgaagttcac 3120 ttaaactgtt aaataacact tagatttaat atttgcattg actgaataaa acaactatac 3180 aaaaaaaaaa aaaaaaaaaa 3200 // ID BEL-1_Cfl-I repbase; DNA; INV; 5694 BP. XX AC AEAB01030939; XX DT 26-JAN-2011 (Rel. 16.02, Created) DT 26-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Florida carpenter ant genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-1_Cfl_; KW BEL-1_Cfl-LTR; BEL-1_Cfl-I. XX OS Camponotus floridanus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Formicinae; Camponotus. XX RN [1] RP 1-5694 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Florida carpenter ant genome."; RL Direct Submission to RU (26-JAN-2011). XX DR Genome; AEAB01030939; Positions 7598 13291. XX CC Positions [4277-4855] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 113..5227 FT /product="BEL-1_Cfl-I_2p" FT /translation="MSVSSETKVLLEAQYDIHGRISRTVDNLRKMGVSNIT FT RDAIQARTGILDSLWQKMETQHELIRTALKDRYKESEYAVSNFIEVAENTY FT VTQRSTLTGYAEKLKGEPSAAPKSEPSQDQAPRTSLPRIKLPMFSGSYEEW FT PSFRDLFLSVIGENSSVSDIERLHYLRSCVKGSAEKLIKSLTVTGDNYQRA FT WVILCRHFENKRELIRSNFFAFTSVSKMKCESAEELGRIHNAVTTAVNAQE FT SIGRPIDTHGMDLLNFLTIELFDSRTRMEWESSISDSTDPPEHDTLTNFIA FT KRILTLNAAKPRVPTKSFEGASRTAKAQHARTGSEHLKCALCQGKHTLMLC FT GDFKAKTASDRKSFVEQNRLCYNCLGNHLASKCQSVKTCPTCKNKHHTTLH FT HAYTLPTSNEVSVLSATHSSSERKAILLATARLHIADRAGDLHPVRAMLDQ FT GSEVSIVSEALVQRLRLPRTSSSVSIIGIGGARSGSTRGRVALSLSSTTTG FT TKLKAVAFVLPRLSAYQGSSVRTLAPWPHIRGLELADPQYQEHDPVELLLG FT AEVCAHILEAGLRKGGPHEPVAQKTVLGWILSGGSSSAPFQGPRNSLQCSV FT DKELDQLVHRFWEQERESPAPVALTPEEEECETLFTTTHRRTASGRYIVRL FT PFSSSPTPLSDTRKPAERLLAAMDLRGRRDPEFGARYREFMQEYENLEHMS FT PVETVSTAGCYLPHHGVLRQTNSGTKLRVVFNGSQRTASGDSLNQRLMVGA FT NLLPALADVLLRWRRHRFVFVTDIEKMFRQILVHPEDRRFQRILRRTEASE FT NARECELNTGTYGRACAPFLAARTLRQLAVDEKARGPRGAPALLHDCYMDD FT IITGADSLHHAVALQTELRELCTAGGFPLRKWAANFEAVLEGVPSDHRLQQ FT AHHSWENEVHSTLGLRWHTASDNFSFAIQPPVASEFTKRRVLAETARLFDP FT LGWLAPVVIRAKILIQSAWLKRLGWDTPLPTEDAKSWQRLLSELPLLENLR FT IKRWLGVDHPAPRIEIHGFADASERGFAAVVYLRVTSGDETAIHLLAAKSK FT VAPIKPVSLPRLELCAAALLTNLTFHLRKTLELVSAPITLWSDSRVTLQWI FT QGHASRWKTFVANRVSHIQEKLPEARWRHIPGRDNPADCASRGVEPRDLIS FT HSLWWTGPSWLPKDPLLWPQSDHAEHEVDVPETRAVVSNLTTQEKTEPEML FT LRFSDLHRLLRVTAWCCRWRGNPPSSREHLPLAPDELEAALLRWIRLVQAH FT HFPKEIAALKRGNASVKGQLSKLTPFLDNHGIIRVGGRLKHAVLSPDERHP FT IIMPPESWLTQLLVRAHHRRTLHGGVQLTLGLLRLRFWIPRGRSLVKGIIH FT RCVTCARWKAAAPQPIMGNLPTARVTPARPFLRTGIDYAGPILIRTAKGRG FT HKAHKGFIAVFVCLATKAVHLEVASDYSTEAFLAAFRRFTSRRGLCEEVYS FT DCGTNFVGANRELRLLFQASSSDGRRIAHSAASDGIRWKFNPPAAPLFGGL FT WEAAVKSTKHHLRRVLGDTTLTFEEMSTFLAQVEACLNSRPLQALSDDHDD FT ITALTPGHLLIGAPLLAVPEPSLSDCNPNLLARWKLLQRMRDHFWERWSRE FT FINSLASRPKWLKDTVSPSVGDLCLLRSDVTPPTRWPLARIVQLRPGGDGV FT TRVVAVRTASSELPRPLAKTVLRPGANTTAPTTPVS" FT CDS 4069..4971 FT /product="BEL-1_Cfl-I_1p" FT /translation="MVDSAPRQGTPPANAPRRGPAHSWTAPPTLLDSSRPL FT PRQGHNPPVCDVREMEGCSTAAYNGQSPDGTSHSSTPVSAHRDRLRRTDLD FT TNRKGERSQGAQGVYRRFRLSRYQGRAPGGRVRLLHRSLSRCVPAIHFPSG FT PLRGSLLGLRHKFCRRQSGAAPFVPGVVIRRPAHRSQCRIRRDTVEVQSAG FT GTSLRRALGGSGEINQTPSAEGPRRHDPHLRGNVYLPRASGGVSQLPSSSS FT LVRRPRRHHGTHSRSPFDRGPAAGRARALAVRLQSKPLGPLETPPTDARPL FT LGEMVSRIH" XX SQ Sequence 5694 BP; 1178 A; 1855 C; 1523 G; 1138 T; 0 other; tttctggtcc ttcgagccgg atcccaacat cctccgcgtc cgctgtctcg tgagtgatcc 60 tcccagtgct gtgcagtttg tggcgtgtag ttacctctga aacagaagga caatgagtgt 120 tagttccgag acaaaggtgc tgctggaggc ccagtatgac atccacggtc ggatctcccg 180 caccgtagac aatctgagga agatgggcgt gagcaacatc actcgtgatg ccatccaagc 240 ccgtaccggc atcctggaca gcctctggca aaagatggag actcagcatg agctcattcg 300 aacagcgctg aaagacagat acaaggagag tgagtacgct gtatctaatt tcattgaagt 360 cgcggagaat acgtacgtga cgcaacgtag cacgttgact ggctacgctg aaaagctcaa 420 gggcgagccg tccgccgcgc ccaaaagcga gcctagccag gatcaggctc ctaggacctc 480 gttgccgcgc atcaaactcc cgatgttttc gggatcctac gaagagtggc cctcgttccg 540 cgacctcttc ctgtcggtca tcggggagaa ttcgtccgtc tccgacatcg agcgcctcca 600 ctatctccgg tcttgcgtca agggatccgc tgaaaaatta atcaaatcgt tgactgtgac 660 cggcgacaat tatcagcgcg cgtgggtgat tctgtgcaga cactttgaaa acaagagaga 720 gctcattcgc tcgaatttct tcgcgtttac ctcggtttcc aaaatgaagt gcgagtccgc 780 ggaggaactc ggtcgcattc acaacgccgt tacgaccgcc gtaaacgcgc aggaaagtat 840 aggaaggccc atcgacactc acggtatgga cctccttaac ttcctgacga tcgagctctt 900 cgattcgcgc acccggatgg agtgggagtc ctcgattagc gactccaccg atccgcccga 960 gcacgacact ctcactaact tcatcgccaa gcgtattctc acgctgaatg ccgccaagcc 1020 gagagtaccg acaaagagct ttgaaggtgc gtcgcgaacc gcgaaagctc agcacgcaag 1080 gaccggttcc gaacatttaa agtgcgcgct gtgccaaggg aagcacacgc ttatgctgtg 1140 tggtgacttc aaggcgaaga ccgcgagcga ccgtaagtcc ttcgtagaac agaacagact 1200 gtgctacaac tgcctcggga atcacttggc ctcaaagtgt cagtctgtca aaacctgccc 1260 aacgtgcaaa aacaagcacc acactacact acaccacgcg tacaccctgc ccacttccaa 1320 cgaggttagt gtcctgtctg cgacgcactc ttcgtccgag cgcaaggcga tcctcctcgc 1380 gacagcccgc ctgcacatag cggatcgcgc cggggatctt catcccgttc gagcgatgct 1440 cgatcagggc tccgaagtgt ccatcgtctc cgaagcactg gtccaacgtt taaggcttcc 1500 ccgcacaagc tcctcggtgt ccatcatcgg catcggcggg gctcgatccg ggtcaacccg 1560 cggcagagtg gcgctcagcc tctcctcgac gactacgggg accaagctca aggccgtcgc 1620 tttcgtgcta ccacgcctgt cggcgtatca ggggtcgtct gtgaggaccc tcgccccctg 1680 gcctcacatc cgcggcctcg agctcgccga tccgcaatac caggagcacg atccggtcga 1740 gctgctgttg ggtgcggagg tctgcgccca catcctggag gccggccttc gaaaaggtgg 1800 acctcacgag ccggtcgccc agaaaacggt cctaggatgg atcctgtcgg gaggatccag 1860 ctcagcgccc ttccaaggtc cccggaactc cttgcagtgc tctgtggaca aagagttgga 1920 tcagctcgta caccgttttt gggagcagga aagggagtcg ccagcccccg tggcgctcac 1980 gccggaggaa gaggagtgcg agactctctt cacaaccact caccggcgaa cggcgtccgg 2040 acgatacatc gttcgtctcc ccttctcgtc ctcgccaacg ccgctcagcg atacccggaa 2100 gcctgccgaa cgcctcttag ccgccatgga cctccggggc agaagagacc ctgagttcgg 2160 tgctcgatat cgggagttta tgcaggagta cgagaacctg gaacacatga gtcctgtgga 2220 aacagtcagc acggctggct gttacctacc tcaccacgga gtcctccgtc agactaactc 2280 cggtaccaag ctccgcgttg tattcaacgg gtcgcaacgc accgcctccg gggactctct 2340 caaccagcgc ttgatggtcg gcgccaactt gttaccggcc ttagctgacg tcctccttcg 2400 ctggcgtcgg caccgcttcg tgttcgtcac ggacatcgaa aagatgttca gacaaatcct 2460 cgtccaccca gaggatcgcc gcttccagcg gatactccgg cgcaccgaag catctgaaaa 2520 tgcaagagaa tgcgagctta atactggcac gtatgggcgc gcctgcgctc cgttcctcgc 2580 cgcccgcacg ctccgccaac tcgccgtcga cgagaaggca cggggtccga ggggagcgcc 2640 cgctctactc catgactgtt acatggatga cataattacc ggggctgact cgttgcacca 2700 cgccgttgca ctgcaaacgg agcttcgaga gctttgcacg gcgggcgggt ttccacttcg 2760 caagtgggcc gccaactttg aggccgtcct agagggtgtt ccctccgacc accggctcca 2820 gcaggcgcac cactcctggg agaatgaggt acactccacg ttgggcctgc gctggcatac 2880 cgcctccgac aacttctctt tcgcgatcca gcccccggtc gctagcgagt tcaccaagag 2940 gcgagtattg gccgagacag cgcggttgtt cgacccctta ggatggctcg cccctgtcgt 3000 tatccgcgcc aagatcctca tccagtcggc gtggctgaag agactaggtt gggacactcc 3060 gctcccaacc gaggacgcta agtcttggca gcgcctcctc tccgaactcc ctttgctgga 3120 gaacctccgg attaagcgtt ggctgggcgt ggaccacccg gccccgcgga tcgagatcca 3180 cgggttcgcg gacgcgtctg agcgcggttt tgcggccgtc gtctatctcc gcgtaaccag 3240 tggcgacgag acggccatcc acctcctagc tgcaaaaagc aaagttgcgc ctattaaacc 3300 agtgtccctg cctcgcttag aattgtgcgc cgcagcccta ctcaccaatc tgacgtttca 3360 ccttcgcaag accttagaat tggtttcagc tccgataact ctgtggtcag actcccgggt 3420 gacgctccag tggatccagg gacacgcttc cagatggaag accttcgtcg ctaatagggt 3480 gtcccacatc caggagaagc ttccggaagc gcgatggcga cacatcccgg gacgagacaa 3540 ccccgccgac tgcgcgtctc ggggagtaga gccccgggac ctgatcagcc actccctctg 3600 gtggacagga ccgtcatggc tccccaagga cccattgctc tggccacaga gcgaccacgc 3660 cgagcacgag gtcgacgtgc cggagacgag agccgtcgtc tccaacctga cgacccagga 3720 gaagaccgaa cctgagatgc ttctcaggtt ctccgacctc catcgcctcc taagagttac 3780 tgcgtggtgt tgccgatggc gaggcaatcc accgagctcc agggagcacc tccccctcgc 3840 gcctgacgag ctagaggcag ccctgctccg ttggatccgc ctggtgcagg cacatcactt 3900 ccccaaggag atcgccgctc tgaaaagggg caacgcgtca gtcaagggtc aactgtcgaa 3960 actgaccccc ttcctcgaca atcatggcat tattcgagtc gggggccggc tgaagcacgc 4020 tgtgctctca cctgatgagc gccatccgat catcatgcct ccagagtcat ggttgactca 4080 gctcctcgtc agggcacacc accggcgaac gctccacggc ggggtccagc tcactcttgg 4140 actgctccgc ctacgcttct ggattcctcg aggccgctcc ctcgtcaagg gcataatcca 4200 ccggtgtgtg acgtgcgcga gatggaaggc tgcagcaccg cagcctataa tgggcaatct 4260 cccgacggca cgagtcactc cagcacgccc gtttctgcgc accgggatcg attacgccgg 4320 accgatcttg atacgaaccg caaaggggag aggtcacaag gcgcacaagg ggtttatcgc 4380 cgttttcgtc tgtctcgcta ccaaggccgt gcacctggag gtcgcgtcag actactccac 4440 cgaagccttt ctcgctgcgt tccggcgatt cacttcccgt cggggcctct gcgaggaagt 4500 ctactcggat tgcggcacaa attttgtcgg cgccaatcgg gagctgcgcc ttttgttcca 4560 ggcgtcgtca tccgacggcc ggcgcatcgc tcacagtgcc gcatccgacg ggatacggtg 4620 gaagttcaat ccgccggcgg cacctctctt cggcgggctt tgggaggcag cggtgaaatc 4680 aaccaaacac catctgcgga gggtcctcgg agacacgacc ctcaccttcg aggaaatgtc 4740 taccttcctc gcgcaagtgg aggcgtgtct caactcccgt cctcttcaag ccttgtccga 4800 cgaccacgac gacatcacgg cactcactcc cggtcacctt ttgatcgggg ccccgctgct 4860 ggccgtgccc gagccctcgc tgtcagactg caatccaaac ctcttggccc gctggaaact 4920 cctccaacgg atgcgcgacc acttttggga gagatggtct cgagaattca ttaactcact 4980 ggcttcccgg ccaaagtggc tgaaggacac cgtcagcccc tccgtaggcg acctgtgcct 5040 actgcggtcg gacgtcacac cgccgaccag atggccactc gcccggatcg tccagctgcg 5100 ccctgggggt gacggcgtga cacgagtggt ggcagtgcgc accgcgtcct cggagctccc 5160 ccggccgctt gccaaaaccg tcctccggcc cggagccaac accacagcac caacaacccc 5220 ggtctcctga gcgtgaaaga tcgcattttt tttttcatcc tccttttttt ctattcatta 5280 taatcactgg tacttaccgc gacatcaccg ataagagtac cgcgcgtaat caccgctttc 5340 tcacatgcga taatcacacg cgaaccgcgc taccactctt cattcttttc tcaccgacac 5400 ttcacctcac ttgaatcgtt catcacccgc tgtcactcaa gcatcactgc actcgcgaga 5460 tcatccgcta aaaccccttg caatcagctc accgcattca ctgtatattc ttcattaccg 5520 caccacttca tattacggag gattctatcc gactgcacaa cacgtgctct cgcgtgacgt 5580 cacacattcc attgattgat cgcgatcaat ggctcgccga tagtatcagt gcgacatctg 5640 tccgtacccg cgcccaccca ggaatttcgt cacgcactga cgaggcgggc ggga 5694 // ID Gypsy-10_DWil-I repbase; DNA; INV; 4435 BP. XX AC scaffold_180708; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-10_DWil_; KW Gypsy-10_DWil-LTR; Gypsy-10_DWil-I. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-4435 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_180708; Positions 8960380 8964814. XX CC Positions [1975-2517] - Reverse transcriptase CC Positions [3535-4011] - Integrase core CC 'TGTA' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 400..4434 FT /product="Gypsy-10_DWil-I_1p" FT /translation="MSTPKTKKTVAKPVVKRASQKEIGDIKHKTTSMSEKA FT EHTTPIPIQSEMGDELKVCPNEYHMEEAEQSAEDKMEQIIQLLSLKIVSDE FT KRDGQMKISADNFGKVVGEYDGKSIQVEKWLEIFERNAEVYDLSEKQKYCY FT ARAKMSGAAQLFLETENICNYEELKSALVEEFTCNYNSADIHQLLQERKRK FT NSETLHEYLLQMKKIASLGYVEDTAVISHIVNGLDIKNEYKVAMYRCKSFK FT ALKEEFDIYDRLNISEKKVNKEQQKSFFKQNNGQMAKKEYCFNCGSGNHKR FT NKCDAETKCFKCNQNGHISRDCPKNADKVRVISNRSRLKKIRINGVEVECL FT VDTGSDVSVIKESIFKRLKNVALIEDIRVLHGLGNGTAKPIGHFTADVVVD FT KLNTNQKFIVLPNNQIDYDGLLGDDLIKKFRLVGSKEGYTFLEENSDEMPL FT NEFALIYNVTEESSFIAPPEYRKEVEQMIESSYHGSPEVVKQCPIQLKIIP FT DGVIKPFHHSPSRLAADEASEVKKQVEQWIKQGIVRKSSSNIASRVVVVKK FT KDGTLRVCVDYRKLNKMVLVDCFPVPIIEEVLEKLQPAKWFTIMDLENGFF FT HVPIEEQSKPYTSFVTKEGLFEFNQAPFGFKNSPAAFIRFVNHIFQELINS FT GVMQLYMDDIIVYAETPDVCMRRTKMVLETAAQFGLKMKWKKCSFMQTRIS FT FLGHIVENGRIWPGKEKTAAVSRFATPKDIKSVQAFLGLTGFFRKFIPGYA FT QIARPLTNLLRKEAVFYIGEAEQQSLQTLKNLLVMDPVLHIYSREAPTELH FT TDASKDGFGAVLLQHFGGYLHPVYYWSRKSTEAESKRHSYYLEVKAAYLAL FT KKFRHYLLGLKFDLVTDCAAFKQTTTKSDVPREVSQWILYMEDFTFNAVHR FT PGDKMRHVDFFSRFPLTSMVVTTELTTRIRKAQLEDNFIKAVLEILKQQSY FT QDFKLKGGLLFKVVNGNDLLAVPKLMERDVIQGAHEVGHFSTAKTMHSIQQ FT QYWIPHLERKVANVIANCISCIIFSKKLGKQEGYLHCIEKGDTPLHTLHID FT HLGPMDATAKQYKYVLTAVDAFSKFVWLFPTKSTGYEEVIKKITDWSAVFG FT FPRRIISDRGVAFTSNAFAEFLNENKVEHVWTTTGVARGNGQVERVNRSIL FT SIIAKLSAHDSAKWYKFVPQVQKAINSHVHASLKLSPFEVMFGTKMNSNAD FT GRLLEVLNEELITQFNSEREELRSRAKENIENAQQAYKKNFDKRRRAEYGY FT RVGDMVAVKRTQFVAGRKLASEYLGPYEVTKIKRNGRYDVRKAASVEGPNI FT TSTSCDNMKLWRFISENDELLSSGTDEDEQEGR" XX SQ Sequence 4435 BP; 1483 A; 786 C; 1073 G; 1093 T; 0 other; tttgggggct cgcccgtgtc aaatacagag acttaaaatg catatgtgat cgcgaaaagt 60 cgagtgttgt tttgttaaaa gttgctaagt tgttatatgc tgctaaaaag ttgttaaagt 120 tgttacaagt agctaaaaac attaagttgt gctgttcaaa acaaaacatt cttgcagacg 180 cgtggtctta aaaatcagtt gcataaagtt gtcaaattcg aagtcgccct agcgtcgcca 240 gaggagtcca aaagttaggc gaaagaaaga gacagaagaa aaaaggcgcg aaaaaaaaaa 300 aaaaagaaag ttgtgaaatt cgaagtcgcc ctagcatcgc cagaaaagtc ccgaagttag 360 gctaaagaaa gagacagaag aaaaaaggcg cgaagtacta tgtcgacgcc taagacgaag 420 aaaacggttg caaagccggt agtaaaacgt gcaagtcaaa aagagatagg agatattaaa 480 cataaaacaa caagtatgag cgagaaagca gaacacacga cgccgattcc aattcaaagc 540 gagatgggag atgaactcaa agtgtgtcca aacgaatatc atatggagga agctgaacag 600 tctgcagagg ataaaatgga acaaataata cagctgctga gtctaaaaat tgtttctgat 660 gagaagcgtg atggtcaaat gaagatttct gcggacaatt ttggaaaagt tgtgggcgaa 720 tatgatggaa aatctataca agtggaaaag tggttggaaa tatttgaaag aaatgccgaa 780 gtttatgacc ttagtgagaa gcagaaatat tgctatgcca gagctaaaat gagcggtgca 840 gcccaactgt ttttagaaac cgaaaatatt tgcaattatg aagaacttaa aagcgcccta 900 gtggaggagt ttacatgtaa ctacaatagt gctgacattc atcagctgtt gcaagagaga 960 aagagaaaga acagcgaaac actgcatgag tacttacttc aaatgaaaaa gatagcttca 1020 cttggttatg ttgaagacac agctgttata agtcacatag tcaatggtct ggatataaaa 1080 aatgaatata aggttgcaat gtatcgttgt aagtctttta aggctttgaa agaagagttc 1140 gatatctatg atcgtttgaa catatcggaa aagaaggtca ataaagaaca acagaaatca 1200 tttttcaagc agaataacgg acaaatggct aaaaaagagt attgcttcaa ctgcggatcg 1260 ggaaaccata agcgcaataa atgtgacgcc gaaacgaaat gcttcaagtg caaccaaaac 1320 ggacatattt cccgtgactg tcctaagaat gctgataaag ttcgtgttat ttccaatcgc 1380 agtcgtttga agaaaatccg tataaatggt gttgaagtcg aatgtctggt agatacagga 1440 tcagacgtct cagtgatcaa ggagagtatt ttcaaaagat taaaaaacgt ggcgctaatt 1500 gaagatatac gtgtgttgca cggtctaggc aatggtacag cgaagcctat tggacacttc 1560 actgcagatg tggtcgtgga taagttgaac acaaaccaaa agttcatagt acttccgaat 1620 aaccagattg attacgatgg actgctgggg gatgatttga tcaaaaagtt ccgtttggtg 1680 ggcagtaagg aaggatatac gttcctggag gagaactcag atgaaatgcc attgaacgag 1740 tttgcactta tatataatgt cacagaagaa tcttccttta tagctccgcc agaatatcgc 1800 aaagaggttg aacagatgat tgaaagcagt taccatgggt caccagaggt tgtaaagcag 1860 tgcccgattc aattgaagat tattcctgat ggagtaatca aaccgtttca tcactcaccg 1920 agtcgcctag cagcagatga agccagtgag gtaaagaagc aagtcgagca atggatcaaa 1980 caagggatcg ttcgaaagtc atcttcaaat atcgccagca gagtagtcgt cgtgaagaaa 2040 aaagatggaa cacttcgagt ttgcgtggac tacagaaagc tgaacaaaat ggtattggtt 2100 gattgtttcc ccgtgccgat tatagaggag gtgttggaaa agctacaacc ggcaaaatgg 2160 tttaccataa tggatctgga aaatggattt tttcatgtcc ctattgaaga gcaaagcaag 2220 ccgtatacat catttgtcac aaaagagggg ttgttcgagt tcaatcaagc gccttttggt 2280 tttaaaaact ctccagccgc gtttataagg ttcgttaatc atatttttca agaactaatt 2340 aactctggtg taatgcagct ctacatggat gacataattg tatatgccga gacgcccgat 2400 gtctgtatgc gaaggacaaa gatggtactc gagacggcag cccaattcgg cttaaagatg 2460 aaatggaaaa agtgcagttt catgcagaca cgcataagtt ttcttggtca tattgttgaa 2520 aacggacgaa tctggcctgg caaggagaaa acagcagctg tcagtcgatt tgctacgccg 2580 aaggacataa aatcagttca agcttttctg ggacttacag ggttttttcg aaaatttatc 2640 cctgggtacg cacaaatcgc acgtccgctg acgaacttac tcaggaagga agctgttttt 2700 tatattggag aagcagagca gcaatcgctg caaacactaa agaatctact ggtaatggat 2760 ccggtgttgc atatttactc aagggaagcg ccaaccgaac tccacacgga tgcatccaaa 2820 gacggatttg gagcagtgtt gctgcagcac tttggtggct atcttcatcc agtttattat 2880 tggagcagaa aaagcacaga agcagagtcc aagcgtcata gctattatct ggaggttaaa 2940 gcagcctacc tcgctctcaa gaagtttcga cactatctct taggcttgaa atttgatctt 3000 gttactgact gtgcggcgtt taagcagact actactaaaa gcgacgtacc aagagaagtc 3060 tcgcaatgga ttttgtatat ggaagacttc acttttaatg cagttcaccg gccgggtgat 3120 aagatgagac acgtggactt cttcagccgt tttccgctca caagcatggt cgtaacaacg 3180 gaattaacta ctcgtataag gaaggcccag ctagaagaca acttcatcaa ggcagtattg 3240 gaaattctaa aacaacaatc atatcaagat ttcaagctga aaggtggtct cctttttaaa 3300 gtcgtaaatg gcaatgattt gttggctgtt ccaaagttga tggaacgaga cgtgattcaa 3360 ggggcgcatg aagtggggca tttttcgacg gcgaagacga tgcactcaat ccagcagcaa 3420 tattggatcc cacatcttga aaggaaagta gctaatgtga ttgctaattg tataagttgt 3480 atcattttca gtaaaaagtt gggcaaacaa gagggctatc ttcactgcat agagaaaggg 3540 gacacgccac tgcatacgct tcacatagat catctgggtc cgatggacgc gacagcgaaa 3600 caatacaaat atgttctgac ggctgtggat gctttttcta aatttgtgtg gcttttccct 3660 actaagtcaa ccggctatga agaagtgatt aaaaagatca ccgactggtc tgctgtgttt 3720 ggcttcccaa ggcgtataat cagcgacaga ggagtagcgt ttacatctaa tgcttttgct 3780 gagtttctta atgaaaacaa agtggagcat gtgtggacaa caacaggtgt tgccagaggt 3840 aacggccaag ttgaaagagt aaaccgttcg atcctgagta tcatcgcaaa actgtcggct 3900 catgactcgg caaaatggta caagtttgtg ccacaggtcc agaaagcgat taattcccat 3960 gtgcatgcat cgttgaagtt atcaccgttt gaagtaatgt ttgggaccaa aatgaacagc 4020 aacgcggatg gaagattatt agaggtccta aatgaagagc tcattacgca gtttaacagc 4080 gagcgagaag agttgcgtag tcgagcaaaa gaaaacattg aaaacgctca acaagcttat 4140 aaaaagaact tcgataagag acgtcgagct gagtatggct atagagtggg agatatggta 4200 gcggtcaaga ggacacaatt cgtcgcagga cgcaagctgg caagcgagta tctaggccca 4260 tatgaggtca cgaagataaa acgaaatggc cgctatgacg tgagaaaagc ggcaagcgtc 4320 gaaggaccaa atatcacatc aacgagttgc gataatatga agttatggcg ttttatttct 4380 gaaaatgacg agctgttatc atctgggaca gatgaggatg aacaggaggg ccgaa 4435 // ID Daphne-1_TCa repbase; DNA; INV; 3970 BP. XX AC . XX DT 14-JUL-2009 (Rel. 14.07, Created) DT 14-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE Daphne non-LTR retrotransposon: a consensus sequence. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; Daphne; KW nonautonomous; Daphne-1_TCa. XX OS Tribolium castaneum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Coleoptera; Polyphaga; Cucujiformia; OC Tenebrionidae; Tribolium. XX RN [1] RP 1-3970 RA Kapitonov V.V. and Jurka J.; RT "Daphne non-LTR retrotransposons from Tribolium castaneum."; RL Repbase Reports 9(7), 1347-1347 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 377..961 FT /product="Daphne-1_TCa_1p" FT /note="ORF1." FT /translation="MIAGQSPPAHTVVDNKSAGRNKNEIKNKDKLNKVVFE FT KDQLQAQSNTTATKIVENNKKSMPRHETPKNEDSVIEERKNDWKTITHKKL FT DRKQQRNAVIGTHACDKIKTAPKKAFLYVSRLYPETKSEDLVSLLQNTFPE FT VTCMKLDSKYPEYYSSYKVTINLNNLEHAMKSEIWPEGSYVTRYFHPRRSK FT TSSQ*" FT CDS 904..3741 FT /product="Daphne-1_TCa_2p" FT /note="ORF2." FT /translation="RIICNAVFSSTSVEDQQSITPDSLQQCKGSLKILHLN FT IQCLSNKLNEILLFLENHRDYDVLGFSEHWLSEYEIQSINLLNFKLATCFC FT RRILRHGGTVLFVRDGIKFKHLDLSCFSIECDAEFSGIYLNEVKTVVICLY FT RSGLGNVDIFLEQLGLLLDHIQNKTTDIVLMGDFNIDFKMHKSNNKLKELL FT TLIMSHGLKVTIDTMTRISSSSESCIDNILTNINDGLYESEVFNPCLSDHY FT AQHITLFNKNTLKPRSTTVTIRNKSNRNMRLFIEALETVNLNFQLKDVEES FT TNFIINTYSYLIEKFFPYKEIECKDSYVPLKWFSDELRNLRNNVSILKTVS FT LVSKDPIDIKAYKSLHSQYKKNIATTKKNAYNQFLKNSVNKSRDVWKLINF FT ERNCNPHRINNEISSDEFNNFFCSIAEKIVNSIQPNSFPNTSQSFNRTINT FT QSFFLEPVSQKEVSNVINKLNNSKCCDIYDLDSSILKSSEFIVTPFLTELI FT NKCFVSGVFPNKLKCSKIIPIFKKGNKMNVENYRPIAIIPILGKIIEILVK FT ERLIRFFEKYNLLSNSQFGFRKGRCTITALRDMVEHVVECLDGGHAIGAVL FT CDLSRAFDCVSHSILLQKLEHYGIRGTPLKFFESYLQNRKQMVLSNNKYSN FT IGKIEHGVPQGSVLGPLLFIIYINDLSAHLSPIKNILYADDATLLFASKNR FT NEINLQFEDGLRKAEQWFNNNCLKLNKDKTQKIIFSFNNNHIRGVQVKLLG FT LIVDDGLKWNYHIDYLCRKISSQLFVLRQLKKVLNFNNLVTTYFSLIHSVL FT SYGILLWGNSCHSIRVFKLQKKAVRILAGITQRESCRPWFHKFQIMSLPSM FT FIYSTLIETHHSNNFQTHGELHGYNTRGAHLLVTTKHNHVTTDKNCLNLNI FT FNKLKRETKLLQHKQFKQTVKNYLKDNVFYSITEFLTSNITI" XX SQ Sequence 3970 BP; 1437 A; 656 C; 648 G; 1229 T; 0 other; gctgtgatgg ctggcgacgc tacacaacaa gccaatgtgc tctgtaagaa atgtggctca 60 aaagtggtga attttgtaaa gtgcaagctt tgtcaacacc agtttcatcc gagttgtgct 120 aagattttga atgcaacttt ccacgacaac aaatctatca cgtgttgtga aaaacacgag 180 atatttgaag atgcatacga tgggaaaaca tctgacgacg agataaattg ggaaaatgta 240 agtccgcaac ttttctcgta tattattaaa gaaaaggacg ccttaatcga agaattaaga 300 ggtaaaattg cattattgca aatttgcatt caagaaaaaa aataaatcgt tgtctccaaa 360 gaaacattct cctccgatga tcgccgggca gtcgccgcca gcgcatacag tcgttgataa 420 caaatcggca ggtagaaaca aaaacgaaat caaaaataaa gataaactaa acaaagtcgt 480 attcgaaaag gaccagctac aagcccaaag taatacgact gctacaaaaa tagttgagaa 540 caataagaaa agcatgcctc gacatgaaac tccgaaaaat gaagactctg tcattgaaga 600 aagaaaaaat gattggaaaa ctataactca caagaaatta gacagaaagc aacaaaggaa 660 tgctgttata gggacacacg cttgtgacaa aataaaaaca gcacccaaaa aggcttttct 720 ctatgtttca agactctatc cagaaaccaa aagtgaagac ttggtgagct tacttcagaa 780 tacttttcca gaagtaacat gtatgaagtt ggactccaaa tatccagagt actactcatc 840 ctacaaggtg acaattaacc taaacaatct agaacacgca atgaaatcag aaatttggcc 900 tgaaggatca tatgtaacgc ggtattttca tccacgtcgg tcgaagacca gcagtcaata 960 acacccgact ctttgcagca atgcaaaggg tcgcttaaaa tattgcattt aaatattcag 1020 tgcttgtcaa ataaattaaa tgaaattctt ttattcttgg aaaaccatag agattatgat 1080 gttttaggtt ttagcgaaca ttggttaagt gaatatgaaa tacagtcaat caacttgctc 1140 aatttcaaat tagcgacttg tttctgcagg cgcatattac gtcatggagg aactgttctt 1200 ttcgtaaggg atggtataaa attcaaacat ttggatcttt cttgcttctc catagagtgt 1260 gatgcagagt ttagtgggat ctatctaaat gaggtaaaaa ctgttgttat atgtttgtat 1320 aggtctggtc tgggcaatgt agatatattt ctagagcaac ttggtctgtt gctagatcat 1380 attcaaaata aaactacaga tattgttttg atgggtgatt ttaacataga cttcaaaatg 1440 cataaatcga ataataaact aaaagagttg ctaactctaa taatgtctca tggtttaaaa 1500 gttactatag ataccatgac aagaattagc tcttcatccg agagctgcat tgataatatt 1560 ttaacaaata taaatgatgg gttatatgag tcagaggtat ttaatccttg cttatctgat 1620 cactatgctc aacacattac attatttaac aaaaataccc taaaaccaag atctaccact 1680 gtgacaatcc gtaataaaag taaccgcaat atgaggcttt ttattgaagc gctagaaacc 1740 gtaaatttaa attttcaatt gaaagatgtt gaagaatcaa ctaactttat aattaacaca 1800 tactcttatt tgattgagaa attctttccc tataaagaga tagaatgtaa agattcctat 1860 gttccgctaa aatggttttc tgatgagttg cgtaacctca gaaataacgt ctcaatacta 1920 aaaactgtaa gtctcgtgtc taaagatccc attgatatta aagcttataa aagtttgcat 1980 tcacaatata aaaaaaatat tgcaacgacc aaaaaaaatg cttataatca gtttctaaaa 2040 aattcggtta ataaatccag ggatgtatgg aagctaataa attttgaaag aaattgtaat 2100 ccacacagaa ttaataacga gatttccagt gatgagttta ataacttttt ttgttctata 2160 gcagaaaaaa ttgtaaactc aatacaacca aattcatttc caaatactag tcaatccttc 2220 aatcgtacta taaatacaca atcatttttt ttagaaccgg tgtcacaaaa ggaagtcagc 2280 aatgttatta ataaactaaa taattcgaaa tgctgtgata tatatgacct agattcctca 2340 atacttaaat cttccgagtt tatagtaacc cccttcttaa cggagcttat aaacaaatgt 2400 tttgtttcag gagtttttcc caataaacta aaatgtagca aaatcatccc aatttttaaa 2460 aaaggtaata aaatgaacgt tgaaaactat aggcccattg caatcattcc cattttaggc 2520 aaaattattg aaatcttagt gaaagaaaga ttaattcgat tttttgaaaa atataatcta 2580 ctaagtaaca gtcaattcgg tttcagaaaa ggccgttgca caatcactgc tcttcgagat 2640 atggtggaac acgtagttga atgtttggat gggggacatg caataggtgc agttttgtgc 2700 gaccttagta gggcttttga ctgcgtgtct cactctattc ttttacaaaa gctagagcat 2760 tacggtataa ggggtacgcc attaaaattc tttgaatctt atcttcagaa tcgaaaacaa 2820 atggtactct caaacaacaa atactccaat atagggaaaa ttgaacatgg cgtaccccaa 2880 ggatccgtac ttggaccgct attgtttatc atttatataa atgatttgtc tgctcactta 2940 tcacctatta aaaatatttt gtatgctgat gacgcaacgc tactctttgc ttcaaaaaat 3000 cgtaatgaaa tcaatcttca atttgaagat ggattgcgta aagcagaaca gtggttcaac 3060 aataattgtc tgaaattgaa caaagataaa actcaaaaaa ttatcttttc gtttaacaac 3120 aatcatataa ggggtgtcca agtgaaactt ttaggtttaa tagttgatga tggtttaaaa 3180 tggaattatc atattgacta tttgtgcaga aagatctcga gccaactgtt tgtacttcga 3240 caacttaaaa aggtactcaa ctttaacaat ctagttacca cctatttttc actaatacac 3300 agtgttctca gttacggaat ccttctttgg ggaaattcat gtcactcaat aagagtattt 3360 aaacttcaga aaaaagcagt gagaatttta gctgggataa cacaacgcga aagctgtaga 3420 ccttggtttc acaaatttca aataatgtcg ttaccctcaa tgtttattta ttctacttta 3480 attgaaaccc atcattcaaa taattttcaa actcatggtg aactgcatgg ttacaataca 3540 agaggagctc atctcttggt tacgacgaaa cacaaccatg taacaacaga taaaaattgt 3600 ttaaatctaa atatatttaa taaactcaaa cgcgagacca aactcttgca acacaaacaa 3660 tttaagcaaa cagtcaaaaa ctatcttaaa gataatgtat tttattccat cactgaattt 3720 cttacgtcaa atattactat ataactaagt attcattgaa accagtagat atatttgtta 3780 aaatttaaat tgtaggtcac tgtttttaac atttaaatat tgtctgtatt gtcttattat 3840 ttacaacttt gtatgttgag ctgttgactt gtatacgtat gttttttgtt atactgacgt 3900 gtctgtaagt tcttttgtaa ctgttttgac gaataaatta ttattattat tattattatt 3960 attattatta 3970 // ID Chapaev-12_HM repbase; DNA; INV; 5680 BP. XX AC . XX DT 16-DEC-2008 (Rel. 13.12, Created) DT 16-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE Chapaev DNA transposon -a consensus sequence. XX KW Chapaev; DNA transposon; Transposable Element; Chapaev-12_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-5680 RA Bao W. and Jurka J.; RT "Autonomous Chapaev transposons from the hydra genome."; RL Repbase Reports 8(12), 1827-1827 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(1037..1639,1612..3288,3263..4222) FT /product="Chapaev-12_HM_1p" FT /translation="MYSTEREILKLYIGWEDGWENECNFPFVKRLIXTDKI FT TNNVISILMTVFHLRSDLFSFISFLLPIXFNMPSIDEFDKMCRKLKRXKKD FT SKKNFEASKNEAFFKMQLSPAPQTKKQKKSADADDFFILNRNKELSSQVDQ FT LTSENKTNVEQIDKLKMQVFKLEAEITDLNSKNKALNTHCNSLSSQLETMS FT IRLKQEEQFQLAKTGGAVPTLKKKKKLKTAQRKFASARAKNVRLLKKIKFE FT KKNAKISDINQNSCQICKCFASSDKDLVVVDSYLDYDNIGSSELFCGFKLS FT NPQTISVKKPIVGKKYTYIKLGNGKPYSKLGRHAAARRSKLAFIFLRLLAC FT SDLESDLSMLFCDSMSCEKSIFRCVANKLGLVRQLSVSDAVELQSQLRMPT FT AEFRRLRRILCNLGARILPSEPKIRQEQEMRTMHVNKEHVTVKSMLLYPSA FT NEAPCLVSVLMVKNLTTYIEEVFNSLHVRDHLNSDVGFGEEVWLLFAGDKG FT GKYMKFHFEVVNSKSSGSVYDVHLFCMYQGSDCRENIALVLGHYANDIKRI FT QSSDFKLKGKIVRLFLGGDFHFIDDVLGHQGSAASFPSSTDLVKLNSLRNH FT ASMPHTPANCNILRRTIQSLESAYNENLCENRQGGNLRALGKFHNSIIAPV FT IFPIITLDNVVPPVLHIMLGVVLKLYKLLLKECKTLDCQAVSSLSHNENVR FT TNELWAAKSVECSKTAEALRLLGNQFVDVANLQARWLAFNKNISELECISL FT NSSNKKKTLTPPIKKKPCKQQEMCASIRCLITKYDCNIDWVQCGTCQKLFH FT QFCEIIGDSEKSVLCDVEKYECLACQGEDIECLDLYISEKISFIRAQREVL FT DVEYIRLQMECEDLEKISKSSMGTHERSLYKALEDMKVERQAYHGNVFVGN FT HCKIVLRNHYKLCSVIDDEAIKNRFVRVFSVFNKLQPILFKRKFLSDAEIT FT ELHQYINLFAIEFHACFPKESITRKMHELIFNVPLFVKLHKTIGLLSEEEG FT ECLHNSVNKELRQLHSVRNQEQKLHLVLKRFELHSKADRKLQASKVRKCVV FT CSERGENSFYRQGLCLVCGHQI*" XX SQ Sequence 5680 BP; 1937 A; 742 C; 951 G; 2035 T; 15 other; cacggctgtt tcactcaata taaaagcgag cctgacgtca cgagtttatc aaaaaaaaaa 60 gtatttccgc ataaaaagta taaaaattca ctaactttaa aataatagta ttttttttct 120 ataaatgctt tcatcttcaa ataaaaaaaa tataaaagat ctatctttcc ttgttaaata 180 aatatttaca ttaaacggaa taaaaataaa aattgtaaaa atattgttaa taatttggca 240 tcacttttct gatttttgcg atttttacgs aaaacctgtc ttgggccaac taagatttga 300 taacatagct acatgcttga tggttgaaat ttagaacact tattaatgaa tgtccgttct 360 atctagattt ccgcagactt tgcaagtttt tatatagaaa gttagttttt aatgcattta 420 tatttttgtt tttattcgtt ttaaatgaaa aacataacat ttttgaaaag tctgcgaatt 480 taktgaaatt tcaacctttt atgcatatga aaatcaaaaa attattttct gtacctctag 540 ctatgttatc aaatatgttt tcaaacttgg catgtttagc aacaattttt ttagtttttt 600 ttgcataaat ttttgtttac attttaaagc gatgacgtaa cctttaatgc gttacgtccc 660 aaackattat ccgatgaatt ttaagaaaac aaacgwttca aacgagacaa attttatagt 720 ataaattaaa caaagtttcg cggattaatt tctcttggat tatttgacta atatcagata 780 agttgwttaa tccgatttta tctagtgcat ctatttgtca atattattyt atttaagttg 840 tttgtattat aattttatta ctatatcaat aaagattata aagcaaagtt tcgtggatta 900 tttcctcgtg gatattcgac taatctcaga taagttgctt aattctttta tttttagtgt 960 atttatttgt aaatattatt ttatttaatt tcttttatta tattattatt ataatatcat 1020 tatcattatt tatgtgatgt attcaactga gagggaaatt ttgaaactat atattggatg 1080 ggaagatggt tgggaaaatg agtgtaactt tccttttgtt aagcgattaa ttarcactga 1140 taagatcact aacaatgtga tttcaatttt aatgacagtt tttcatctaa gaagcgattt 1200 gtttagtttt ataagttttc ttttgccaat tgaktttaat atgcctagta tcgatgagtt 1260 tgataaaatg tgccgaaaat taaaacgama aaaaaaagat tctaaaaaaa attttgaagc 1320 ttcaaaaaac gaagcatttt ttaaaatgca actgtctcca gctccacaaa caaaaaaaca 1380 aaaaaaatca gcagatgcag atgatttttt tattttaaat aggaataagg agctaagtag 1440 tcaagttgat caattaactt ctgaaaataa aacaaatgtg gagcaaatag ataaattaaa 1500 aatgcaggta tttaagttgg aagctgaaat tacagattta aattctaaaa acaaggctct 1560 caacacacat tgtaactcat taagttctca attagagact atgagtatta ggctaaaaca 1620 ggaggagcag ttccaacttt aaaaaaaaaa aaaaaattaa aaacagcgca gcgtaaattt 1680 gcatctgctc gtgccaaaaa tgtaagattg ctaaaaaaaa ttaaatttga aaaaaaaaat 1740 gcaaaaatta gtgatataaa ccagaacagt tgccaaattt gcaaatgttt tgctagcagt 1800 gataaggatt tagttgtcgt tgattcttat ttagactatg acaacattgg ttcctctgaa 1860 ttattctgtg ggtttaaatt aagtaatcca cagactatat cagttaaaaa gccaatagtt 1920 ggtaaaaaat atacttacat taagttagga aatggtaaac cttattctaa gttaggtagg 1980 catgctgcag ctagacgatc taaacttgct tttatattcc ttcgattatt agcttgttca 2040 gatttagaaa gtgatttgtc tatgttgttt tgtgattcta tgtcttgtga aaagtcaata 2100 tttcgttgtg ttgcaaataa gttgggttta gttaggcagt taagtgtttc tgatgctgta 2160 gagctgcaga gccaattaag gatgcccacg gctgaattcc gtagrctaag acgtattctg 2220 tgtaatcttg gtgcaaggat tctcccatct gagcccaaga taagacaaga acaagaaatg 2280 cggacaatgc atgtcaataa agaacatgtc actgtgaagt ccatgttact ttatccatct 2340 gcgaatgaag caccttgttt agtatctgta ctgatggtca agaatttgac aacttatatt 2400 gaagaggttt ttaatagtct tcatgtacgc gaccacctta attctgatgt cggttttggc 2460 gaagaggtat ggctcctttt tgctggggat aagggtggga agtatatgaa attccatttt 2520 gaagttgtca attctaaatc atcagggtct gtatatgatg ttcatttgtt ttgtatgtat 2580 cagggttcag actgtcgtga gaatattgcc ttagttctcg ggcattatgc caacgacatt 2640 aaaagaattc agtcgtcaga ttttaagcta aaaggcaaaa tagtcaggtt gttcttaggt 2700 ggagattttc attttatcga tgatgttctg ggacaccagg gttcagctgc tagctttcct 2760 agttctacag acctagttaa gttgaattct ctaagaaacc atgcttctat gcctcacaca 2820 cctgctaatt gtaacatctt aaggaggaca atacaatctc tggaaagtgc ttacaatgag 2880 aacttgtgtg aaaatagaca aggtggaaac ctaagagctc taggtaagtt tcataattca 2940 attattgctc ctgtgatttt ccccataata actttagata atgtagtgcc acctgttttg 3000 catattatgt taggagttgt gttaaaactg tataagttat tgttgaagga gtgtaagact 3060 ttagattgtc aagctgtttc gtctttgtcg cataatgaga atgtgagaac gaatgaattg 3120 tgggctgcta aaagcgttga atgttctaaa acagcagaag ctttacgttt attaggaaat 3180 caatttgtag acgttgcaaa tctacaggcg cgttggttag cttttaataa aaatatttct 3240 gaattggaat gcatatctct aaactcctcc aataaaaaaa aaaccctgta aacagcaaga 3300 aatgtgtgct agtatcaggt gtttaattac waaatatgat tgtaacattg attgggtgca 3360 gtgtggtacg tgccaaaaat tgtttcatca attttgtgaa attattggtg attcggaaaa 3420 gagtgtattg tgtgacgtag agaagtatga gtgtttagca tgtcagggtg aagatataga 3480 atgtcttgac ctttatattt ctgaaaaaat ttcttttata cgcgcacaac gcgaagtttt 3540 agacgttgag tatattcgat tgcagatgga gtgtgaagac ctagaaaaaa tatctaaaag 3600 tagcatgggg actcatgaga ggtctcttta taaggccctt gaagatatga aggttgagcg 3660 acaagcttac catggtaatg tatttgtagg taatcattgc aaaatagtat taagaaacca 3720 ctataaactt tgtagtgtta ttgatgatga agcaattaaa aatagatttg tacgtgtgtt 3780 tagtgttttt aataagttgc aacctattct ttttaagcga aaatttcttt ctgatgccga 3840 aattactgag cttcatcaat atataaattt atttgctatt gagtttcatg cttgctttcc 3900 aaaagaaagt atcacgcgaa aaatgcatga actcattttt aatgtgccac tatttgtaaa 3960 attgcataaa actattgggt tgctgagtga agaggaggga gagtgtttgc ataatagcgt 4020 aaataaggaa ttaagacagt tgcattcagt aagaaatcag gagcaaaagc tgcatcttgt 4080 tttaaaacgc tttgaattgc acagcaaagc tgataggaag cttcaagcct ctaaggtgag 4140 aaagtgtgtt gtgtgttctg agcgtggcga aaattcattt tatagacaag gtctttgtct 4200 tgtttgtggt catcaaattt agtttaaata tttaaaaagt taaaaaaaaa aataaacttt 4260 ttacatctaa gataaattat ataagatcct taatttataa ataagagaat tttattctgt 4320 tttaatgtct gtgtaatgtt wgtttaaaat attaataatt ttttattata ttgattttat 4380 atattaggta ggtgcccatt ctgttcctat cttcaraagt taattatacc tatgcaatac 4440 cttcgctcag gttaaagaat ggaataaaag tctagcatca aatgtatgta aaaatctagc 4500 tattatatat taacttgtga tcagagcaac tggtatgttt gttggatatg atcttgtatg 4560 tttgtgtagt taatttaaaa aatgtattgt ccttcgttat gtaatatgta acatttttta 4620 tatatattag atctccattc gatttttatc atcagaagtt agttctacct ttgcggtacc 4680 tctgctctac atacctcctc tctggttaaa gaatggaata taagtctcat atataacaag 4740 ttggtgtaaa aatttagttc gataaattaa gaatttgtaa actttattat cacaattgtc 4800 gagtcgagca aatgttcttc tggtttaaaa tgaattatgt atgaatctag tatcaactgg 4860 gtgtacaaag atagtttact acttaaagaa cttatcaatg gaccaactga ttataaaaca 4920 gtttttaatg agaataaaaa caaattatca aaagatttaa attagctatt tgtgtatatt 4980 taaacacttt attttttgtg agaagagatt attgatacaa gtaccataaa ataattctac 5040 taccaaaagc taggttaatt caaaaccttt aactcaattt tggtcatgaa ttattttatt 5100 agaattgatg agtttttatt tattaactac ttaacaccat attattagaa ctttattttg 5160 taatgtaatg attttatcgg cgttcgaaaa taattttcta aatcaagaac ttctatattt 5220 aaaaaagtac tcatttaaca gctaatcccc ctttataacg acatttttca taattttgcg 5280 aaaaatgcgg tttgctttcg gaattcaata aagtctgttt tagttaacct taaacaatta 5340 agtaaggttt ttttgttttt ataatagtca aacaatgagt gtatttgata ataactatca 5400 gaaatatatt tttctgattt ttttttgata aaaagtaaca tcaaattact aaaatttagc 5460 caarttcaac atttgtcgtg tttattctaa actaaaagaa gtttttttac taatgcgtta 5520 aatgtcataa gataaaacat tttattggtg tttttattct ttttacaaaa caaaattatc 5580 tctaatattt caaaagttat aaaagtttta gtaacacccc ttwagtcaat attttcaaaa 5640 aataacaaga agccgttaag ctcggtttga aacagccgtg 5680 // ID BEL-222_AA-I repbase; DNA; INV; 6604 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-222_AA_; KW BEL-222_AA-LTR; BEL-222_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6604 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 901-901 (2011). XX DR [2] (Consensus) XX CC Positions [5546-6127] - Integrase core CC LTRs are 95% similar to each other. XX FH Key Location/Qualifiers FT CDS 1191..2252 FT /product="BEL-222_AA-I_2p" FT /translation="MAERLIKEKIKKRERMFASLKRHQQFLESYNQEIQSG FT QVQSRLDKLDDKLAEFEQLQEEIADLDDNEQYEEESNMVAERFENQYYAIR FT GALLMKIAPPVITPPPLDATVLRNTTAAGFHSGVRLPQISLPDFNGDYRNW FT LSFKSTYESLIHESTELSDVQKFHYLKSALKGEAAKLIDSLTITSGNYVIA FT WETITKRYSNEYLLKKRHLQALMEYPKIQQESATAIHGLVDEFEQRLKILK FT QLGEKTDTWGALIVHWMCSKLNDQSLQLWEDHAASLHEPTFTDLVNFLEKR FT TRVLEAVSSNVSESSKSSQKASTKYSKQMVHAATRNGGERSSSSVACPCCG FT ENHYMARCAKF" FT CDS 4274..5887 FT /product="BEL-222_AA-I_1p" FT /translation="MQQLALLQTNWDDPVPIQLENKWTEYYGQLNRLSELR FT IERFAFIPQWVDVQIHCFADASELAYGTCLYVRTIDQAGNIYVEMLSSRSR FT VAPLKRLTLPRLELCAAKEAANLHSKVIKALDLGQIRSTFWSDSTIVLHWL FT EAPPNNWKTFVANRVSAIQALSQGHRWKHVAGKENPADLVSRGMTVHDFLQ FT SQIWRNGPPWLRTPEETWPSEAQEINFTDEELESRKVVSYATNVSLEPNVI FT FSLRSSLETLLRIVAYCLRFCHNCRLPDQRNRTPYLTVEEISSAKLALVKL FT AQNECFYTDLQDLHKHGNVSRKSSLRRLSPFLDKDHVIRVGGRLRHSDQEY FT TAKHPAILPSKHPFTNLLVNYYHRQTIHGGRQLTLSTMRQDFWPIHGKRVV FT DGVLRKCYRCFRVNPTPVQQPTGQLPSTRVRPSRPFSITGVDYCGPFYLKP FT PHRRAAPPKAYISVFVCFATKAMHLEIVSDLSTTGFLAALHRFVGHHGIPL FT EIHSDNAKNFTGAKNELNELFNILNDKSSQERIGNELSLQGI" XX SQ Sequence 6604 BP; 1846 A; 1445 C; 1536 G; 1754 T; 23 other; gagaacatta tggtgccgtg accaggatgc gtggttgaga aaggagccat tgtacaggag 60 atcattgttc ggaatttggc ggtttcgtgg catagagttc gtcattttgc tggcagtcgg 120 agatagccat tccatcgcca taagtatcac tcgaaatatt tcaagcacca atccaacatc 180 gagcgaagcc atcacatcat ctatctcatc tgacgtcact ccagagaaga acgactagca 240 atcagttgta acacagagtg gtgattgtac ccaatcaatc cagcgttatt ggaccaacat 300 acaaggcctt atccctcagg ccaccaggtg agtggcattc caaagggttt actgtacgtt 360 tcggtgctac ttggatatct ttgtttcttt ctgtgttcaa cacgatcgat gcgaggagag 420 caacgtcgat gattggattg gactctcgga ccagagaaat tcatcggtcg taaatagcgt 480 gattaggaca caagggacgt ttatatgccg acaaggagtg tccgtattcg aactgaggct 540 cgtttggtgg atttaaggat atacaaggtc tcatattgac agacctcagg tgagtgtctg 600 tccatcactt ttcattcgtc tttgaggtgt tacgtggttc ttctttttcc cgtcctatgc 660 attgctgtac tgctacggag tcgattcagg aggactactt ctgacgcaat cacggattgg 720 cggctgtata cgctcggaga acgctgtgta caattaaacg acgcaaaatt tcggtgttat 780 ttgatctacg acatcaacgg tgacaamatt acaaggctgg tgagctgctc gttgaaagtc 840 tgctgaacgg aatttggata aatatacaag gcttacattc ggtaggccac aggtgagtgg 900 ccttccaatc catacmtatt ttctactggt acgcttatgc gtttattcct tttcggtttc 960 tgctgagaac gacatttcgc cacacgaagc gttaattaaa ccacacgtcg aggccgattt 1020 tccggaacca gaagctgcat ttattctwta caaggcctat tttggacagg cctcaggtga 1080 gtacctgtcc aaaatataat acttcggttk gcggtacttc ctggttccgt gtggtttctt 1140 tctttcccgg ttacggagtt tcgacgtgtt gatttggtca ggtgttaggc atggcagagc 1200 ggttgattaa ggaaaagata aaaaagcgag agcgaatgtt tgcatcgctc aaacggcatc 1260 aacaatttct agagagctat aatcaagaaa ttcaaagtgg gcaggtacaa agcaggctgg 1320 acaaattgga tgacaaactg gcagagttcg agcaacttca agaggaaata gcagatctgg 1380 acgataatga gcaatacgag gaggagagca acatggttgc tgaacgattc gaaaatcagt 1440 attatgcaat ccgtggagcc ctactgatga agattgcacc accggttatt acaccaccgc 1500 cgttggatgc gaccgtcttg aggaacacaa cagcagcagg tttccattcc ggtgttcgtt 1560 taccacagat atcgttgcca gacttcaatg gagattatcg gaattggctc tcgttcaaat 1620 cgacgtacga atcattgatc catgagtcga cggaactgag tgacgttcag aagttccatt 1680 atctaaaatc agcactcaag ggagaagcag cgaaactgat tgactcatta acgatcacta 1740 gcggaaacta cgtgattgct tgggaaacta tcacaaagcg atactckaat gaatatcttt 1800 tgaaaaagcg acatctgcaa gcactcatgg aatatcccaa aatccagcag gaatcggcaa 1860 ctgcaatcca cggattagta gatgagtttg agcaacgttt gaaaattctt aagcagcttg 1920 gagaaaagac cgacacctgg ggagcgctga tagtacactg gatgtgttca aagctcaatg 1980 atcaatcgtt gcaactatgg gaggatcatg ctgcttcgtt acatgaacca actttcacgg 2040 atttggtgaa cttcttggag aaaaggacac gggtgttgga ggctgtatca tcgaacgtat 2100 cggaatcgag caaatcatct cagaaagcgt caacaaaata ttcgaaacaa atggttcatg 2160 ctgctaccag aaacggagga gaacgatcat cttcatcggt ggcgtgtccg tgctgtggag 2220 aaaaccacta catggctcga tgtgctaaat ttmtgaacat ggacttgaaa gaaaaactgg 2280 agttggtgaa caataagcgg ctgtgcagca attgctttcg gccaggtcat tgggtacgcg 2340 attgtaactc aactttcagc tgtcgtacat gtagtaggaa acatcattcg ttgattcacc 2400 ctggctttcc tcctggcaat ggaacwactt cgaatggcaa gtctggtggc ctacaagcga 2460 aaaatwccga gmctccatgt gcatcaacaa acgtcgtwac caatggatct gaagtcgakc 2520 agtgcgaaat agaacaagaa ggagcagtag gagcttacaa tgcaggaacc aaaggagcaa 2580 actctagcgt ttttctktcg acagtggtgt tgacaattct ggatcaaaat gggaacaagc 2640 agttggcgag ggcgttgttg gataatggat cagaagcaaa tataatgact gaacggatga 2700 gtcaaatcct caagttgaag cgacgttctg taaatgtacc tgtttgtgga gttggcgaat 2760 ctgaaactag agcaagacat gcagttagta cgatcatcag ttcgagagtg acagacttct 2820 cagttggtgt ggacttctta attctgcaac gggtgacatc tgatctaccc tctacaactg 2880 tatcaataac acaatggaaa attccggaaa atttacgctt agcggatccg aatttcaaca 2940 actccaaccg gatcgatctg ctaattggag cggaacactt ttatcgattt ttgtacgagc 3000 gtgaaatgaa gcgagtctcc ttgggtccag ggcttcccat cttggtagaa acagtttttg 3060 gttgggttgt cacggggaaa catacggtag caaaaaatca accgatcaac tgctgtttag 3120 caacgtcaac ggaaaatttg gaagtagcgc ttgaaagatt ttggaaggtg gaaaccactg 3180 aagaccttta ctctaaggag gaacaggact ctgaaacaca cttcaaggaa actcacactc 3240 gtttggagga cggaagatat gttgtgtgtc ttccgaaaca cgtcaacttc gacaggatgg 3300 taggcgattc acgctccagg gccttggaca ggtttttgaa tatggagaaa cggttggaac 3360 gcaacacaga gatgaaaatc caatatcatg cctttangca ggaatacctg gatctcggcc 3420 acatgagaaa gctcgatgac gacgaaatac aagcaacatg tcattcgaaa gcacggaaaa 3480 ccttttntct accgcatcat gcagtcctaa aggagtcgag caccactact aaagtgcgcg 3540 tcgtgtttga tgggtcagca cgcaccgata gnggctattc attgaatgac gcactattga 3600 aaggcccaat aattcaggat gagcttctta gtctcttact acgttttcgt aaatacgagg 3660 tggccctagt tggagacgta gaaaaaatgt atagacaggt tcgactacac actgatgata 3720 ctccccttca acggattttt tggagatttt ccacggtcga tccagttagc gaatatgaac 3780 tgcttactgt tacntatggt ttaacaccgt cgtccttcct atccatccgg gctttacatc 3840 agctggcagc agacgagggg actgactttc ctgttgctag tgcttctttg gttcatgatt 3900 tttatgtaga cgattacatt ggtggagctc ccaatgtcaa cgaagctatc cagctccggc 3960 aagacttgac gcgcttatgt cgaaaggagn attcgtgcta cggaaatggt gctccaatcg 4020 tcccgaggtg ctgttagacg ttccgccaga tcaattgggt accaatcttt cgattacttt 4080 tcaattaaat ccagatgagg aaatcaaaac attaggcatt acctgggaac ccagctctga 4140 caagcttcgg tttagtgttg atattgaaga caggagaaca gtttggacca ggagatgcat 4200 tctatcctgt atcgccaagc tttttgatcc cttgggattg atttccccag ttgtcgtata 4260 cgctaagatt atcatgcaac aacttgctct gctgcagact aattgggatg atccggtacc 4320 aattcaattg gagaacaaat ggactgagta ttacggacag ctgaacaggc tctcagagct 4380 tcgaattgag cgattcgctt tcattcctca atgggttgat gtacagatac attgttttgc 4440 tgatgcgtcg gaactggcct atggaacatg tttgtatgtt cgcacaattg atcaagctgg 4500 taatatttac gtcgagatgt tgtcatcaag atcccgtgta gctccattga aaaggttgac 4560 tcttccacgc ttagagctat gtgcagctaa ggaagccgct aatctgcatt cgaaggttat 4620 caaggcactt gatctaggcc agattcgctc taccttttgg tcggattcaa ckattgtcct 4680 gcattggcta gaagctcctc ccaataattg gaaaacattc gttgccaatc gagtctccgc 4740 tattcaagct ctatctcaag gtcatcgttg gaaacacgtt gctggaaagg aaaatccggc 4800 agacctagtc tctcgaggta tgaccgtcca cgactttctt cagagtcaga tatggagaaa 4860 tggtccacct tggttgcgaa ctccagaaga aacatggcca tctgaagcgc aagagataaa 4920 tttcaccgat gaagaattag aatccagaaa agttgtgtca tacgccacaa atgttagttt 4980 agaacccaac gttatttttt ctttgcgctc ttctttggag acactgctgc gtatcgtcgc 5040 ttactgctta cgattttgtc acaattgtcg attgcctgat caacgaaatc gtacaccata 5100 tctgaccgta gaagagatct cgtccgctaa attagcctta gtkaaactgg ctcagaatga 5160 atgtttctac accgatcttc aagatttgca caagcatgga aatgtgagtc gtaaatcgtc 5220 actccgacgc ctttctccat tcctcgataa ggatcacgtt attagagttg gaggtcgatt 5280 acgtcattcg gatcaagaat acactgctaa acaccctgca attcttccaa gcaagcaccc 5340 gtttaccaat cttctcgtca attattacca tcgccaaact atccacggtg gacgacagtt 5400 gacattatcc actatgcggc aagatttttg gccaattcat ggcaaacgag tggtcgacgg 5460 agttcttcga aagtgttatc gctgttttcg cgtcaaccct acaccagttc aacagccaac 5520 tggtcaacta ccaagcacac gtgttcgtcc gagcagaccg ttctctataa cgggagtgga 5580 ttactgtggt ccgttttatc tgaagccccc tcaccgccga gctgcaccgc caaaggccta 5640 tatctcagtg tttgtctgtt ttgcgacgaa ggctatgcac ttagaaattg taagcgattt 5700 gtcaaccact ggatttctcg ctgcactcca ccgctttgtt gggcatcatg gaataccgct 5760 cgagattcat tcagataatg ccaaaaactt tactggtgca aaaaatgagc taaacgagct 5820 tttcaacatc ctgaatgata agtccagcca agaacgtatt gggaacgaat tgtcccttca 5880 aggaattwcg tggaagttca tcccccctag agctcccaat tttggaggac tttgggaagc 5940 tgcggttaag tctgtcaaag cagctcttaa gaaggaaatt ggaattcatc agctaagtta 6000 tgaaagcttc gccacattgc ttgtccaaat agccgcagcc ttgaactcga ggccsttagt 6060 cccactctcc gatgatccgt cggatttaaa tgcgctcact ccatcccatt tcttgatcgg 6120 tgccccaatg aatgccttgc ccgaaagaga tctcctcaac atccccgcaa atcgccttag 6180 ccaacaccaa caacagcagc aaatgttcca aaaatattgg gaccgttgga gcagagatta 6240 tctcgccgag ctacaaacca ccagcaagaa ccttcaaccc agccccatac gcattggaag 6300 tttggtagta ttgcgtgaag acaacatgcc tccgttgtac tggccgttgg ctagaatcgt 6360 agatgttcat ccgagtgctg aaggagtcgt acgtgtagtc actgttaaaa cagttagcgg 6420 aatatataag cgacccgttt gccgaatatg tccattacca ccagataagg agaccgtcat 6480 cgaggaggga tcctcaagta gtatgtaaga ctgawtcgat gttagatgta akattataat 6540 tttattattt gaaccatttg tatgttggta aattgttgaa actagttcaa gggggccggt 6600 atgt 6604 // ID Copia-1-LTR_DY repbase; DNA; INV; 205 BP. XX AC . XX DT 05-JAN-2009 (Rel. 14.02, Created) DT 05-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE LTR retrotransposon Ty1-copia like, long terminal repeat portion DE from Drosophila yakuba- consensus. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-1-LTR_DY. XX OS Drosophila yakuba OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; melanogaster subgroup. XX RN [1] RP 1-205 RA Bao W. and Jurka J.; RT "Copia LTR-retrotransposons from Drosophila yakuba."; RL Repbase Reports 9(2), 476-476 (2009). XX DR [1] (Consensus) XX CC The terminal is 5-TG, TA-3'. XX SQ Sequence 205 BP; 64 A; 40 C; 32 G; 68 T; 1 other; tgtagaaaat agaggccctg tatatgccct gtatgtatac aatgcacgct tgtattaccg 60 atcatagatt tatcccgctc taacgaatta ggaataagaa aatatgtata ataattttgt 120 catttctttc gttcatcrtt tagtcgcaaa taaaattccg cagcgaacac acggtgtttc 180 tcttataaaa ttctgccgct caata 205 // ID BEL-18_DPu-I repbase; DNA; INV; 5640 BP. XX AC ACJG01001526; XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the common water flea: internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-18_DPu_; KW BEL-18_DPu-LTR; BEL-18_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-5640 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the common water flea."; RL Direct Submission to RU (09-FEB-2011). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR Genome; ACJG01001526; Positions 19269 13630. XX CC Positions [4691-5272] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS join(50..2530,2534..5638) FT /product="BEL-18_DPu-I_1p" FT /translation="MARGRKSKAATASQNDNTVASRLRSRTVPANGEDEST FT PPADPTQQVKNISRVPQPETIERDATRKQIQMTCSCIDKLIKERGSRRACL FT GLLNKLDDMLATVERLNLLVVDPSDADEFSKQAQIQIDLFTLIESKKEDVE FT AYVTLRADDESSVGSCMQSIKGIPLATRPSAPDNEKSKKLLAEAQKRAAEA FT QQEAAQLQKRLEEVNKKADLANKEASKLSDAAMRRGSAPTIISNPFHGWGD FT GGIDDWHRRSTHEPTKEDGEEEAPDDWIDRYCNGLEEANWSYSSKSAARGE FT LPIYSGRATDWFWWSDLLRSMVHDQRIAPGEKLAVLKSRLRGDQFEIVQGL FT GGGESAYKAALFRLKQSAGRRDVMRIALLGELDRMELGRENSSFRRFAERA FT RTIFFDLTRIGEKFNADLIERLSRKLHPADRLEWNSGRRTGLERRPLKEFG FT EWMCERAAAYQNAYSLAAEQILPSGENKFPRRQQFPTTGRTNHVSLEPGTT FT EKPKSRCFCCEGEHRLMACPTFKEMPTKEKVRFCVKRRLCMKCFGTRHSAA FT DCKFGKGCGFSGCPFIHHPLLHDAEEKHDKGSSHHTSLSAQVDRVKVALGV FT IRMEAYTAEGQLIPVSVLIDEGSDTTLFREDFLQRLKIIGKGTTLDLVGVT FT GAESYKSQKAPVRFLLPDGEEALIKGTTIPQISRPTPVIDWRKLKNRWPHL FT ADVPVQESGGKIDVLVGLDHSNLLAVLESRVGGEHEPFASRTRLGWVVRGL FT LGSDIGPMTTAHIHHISATSESSLDEEFQKFCTTENFGTEFKGDGLSESDR FT IAEKIVDEGLKKLDIGYETPLTWLEEPTFENNRKLAEHRWQDLKERFKRDP FT DFEADYRAAIKKYVDEGYASRVKEEDLYSNNQYYLPHHGVYKKVYGKKKKK FT LRVVFDAAAKWKKKCLNDGMRQGRKLQNDLAKVLIRFQLGEIAFAADVTAM FT YSRIRLRPQDARFHRFLWQEKDSDVIITYQMDRLPFGSNCAPFLALKTIHR FT AASDATKGREECMEAVEKNMYMDDLLKAVDNEEQAIMKAKLIRDGLAEGDF FT HLTNWISNSPEFIKALQPEQAVAAAHHDLASDDVEKLLGAFYEPTTDEMTY FT RVTGVEDLKWTRAGLLSKVASIYDPLGRAAPALIKAKIKLRELGTRGLNWN FT EAISGDDKEWWQTWFKTLERLNDLSMPRNLQPDKEKIIQSDLLTFGDASEE FT AYAAAVYLRSVYQDGTSICRLVMAKTKLAPRQTQSIPKLELNAAVLGARLA FT TYVADALQISGLNRIFFTDSSTTRNWIRAVASHYMPFVSHRIGEIQSLTNA FT NEWRFIPGKMNIADAATRSLLTDGEPIPPRWLEGPSFIALPMEQWPKDLPW FT VAVAAEKRTAHIHHAISEPPLQDWTKVIIDSKNISTFLKVEGDAKEIILKC FT QKEGFREEISLLSKGKQISKKSSLIQLTPFLDKDGILRLGGRIHRAKLPYE FT VLHPPILPGKHPLAEKIVTAVHQESHHAGTDFLHAKIRQHLWILQGRELAK FT KIRFNCAVCVKSRAKLATQKMGDLPSARLDSYTAPFTHIALDYFGPIETSA FT YRNRVIKRYGLLITCLVTRNVYVEMVQSMSTPDFLYGLRRFIGEFTKPTEI FT FCDNGTNFVGAEKELATAFRELQKDEKLKEFARERSIIWKFQPPSAPHFGG FT SHESMVKSTKRALYRALDTEKAGLRYPTDEMLRTLLKEVAGMLNGRPLTLA FT SNDPEDFRPITPKDFLNLPPTSDLPAGDIRRALPRDHIRYVKKMANLFWDL FT WTKIFLPTLVPRRKWVAEERNFEVGDSIMLSDNNLLPNQWKTGRIIEVYPG FT SDGFVRVVKVKTDCGEYLRPIHRLVLLEPYSAVLPSASGE" XX SQ Sequence 5640 BP; 1635 A; 1379 C; 1444 G; 1182 T; 0 other; ttggtgatcg tagcaggact ttcttgacct gcaagtttgg agaattgtaa tggcaagagg 60 caggaagtca aaagcagcca cggcttctca aaacgacaac acggttgcgt ccagacttcg 120 ttctcgaaca gtaccagcaa atggtgaaga tgaatcgaca ccccctgctg atcccactca 180 acaagtcaag aacatttcac gagttcctca acctgagacg attgaacgtg atgccacacg 240 taagcagatc cagatgacct gtagctgtat tgacaagctc atcaaggaaa gaggatcaag 300 aagagcctgt ctaggcctac tcaacaaact cgacgacatg ttggccactg tcgaaagact 360 caatctattg gtggtagacc cgtccgacgc tgatgaattc tcgaagcagg ctcaaatcca 420 aatcgacctg ttcacgctca tcgagtcgaa gaaagaagac gttgaagctt atgtcactct 480 aagagctgat gacgaatctt cagtcgggtc ctgcatgcag tcgatcaagg gcataccact 540 ggcgactcgc ccctctgcac ccgacaacga gaagtcgaag aaattgcttg ctgaagcgca 600 gaaaagagca gcggaggccc aacaggaagc tgcccagctg caaaaacgac ttgaagaagt 660 gaacaagaaa gctgatttgg caaataagga agcaagcaaa ctatccgatg ctgccatgcg 720 acgcggttca gcaccaacca tcatcagcaa cccattccac ggttggggag acggagggat 780 cgacgattgg caccgccgtt ctactcacga accgaccaaa gaggatggag aagaagaagc 840 accagacgac tggatcgacc ggtactgcaa cggcctagaa gaagccaatt ggtcatattc 900 tagcaaatcg gcagcacgcg gggagcttcc catctactct ggaagggcaa ccgattggtt 960 ttggtggtcg gatttgctac gttcgatggt tcatgatcaa cgaatcgcac ctggagaaaa 1020 gcttgctgtc ttgaagagcc gactacgggg agatcaattc gaaatcgtgc aaggccttgg 1080 cggaggagaa tctgcttata aggccgcttt attccgactg aagcagtcgg ctggacgtcg 1140 cgacgtaatg cgtatagcac tactcggtga actcgatcgt atggagctgg gcagagaaaa 1200 ttcatcattc cgtcgctttg ccgaaagagc tcgaaccatc ttctttgatt taacccgaat 1260 tggggagaaa tttaatgccg acctcatcga acgactgagc cgtaaattgc atccggcgga 1320 tcgactcgaa tggaattctg gacggcgtac tggattagaa cggcgccctc tgaaggaatt 1380 tggagagtgg atgtgcgaga gagctgcagc atatcaaaat gcttatagtc tggcagcgga 1440 gcaaattttg ccgtctgggg agaacaagtt tcctcgtcgc caacaatttc caacaacagg 1500 gcgcaccaat cacgttagcc tggagcctgg cactacagaa aaaccgaaga gtcgctgctt 1560 ttgctgtgag ggggagcacc ggttgatggc ttgcccaaca ttcaaagaaa tgcctacgaa 1620 agagaaggta cgattttgcg tcaagcgccg tttatgcatg aaatgtttcg ggacgagaca 1680 ctctgcagcc gattgcaaat tcggaaaagg ttgcggattc agcggatgcc cttttattca 1740 tcacccattg cttcatgatg ccgaagagaa gcacgacaaa ggtagcagcc atcacacatc 1800 actttcagcc caggtggatc gcgtcaaggt ggcgctaggc gttatccgga tggaagctta 1860 cacagccgaa ggccagctca tcccagtttc ggtgctgata gacgagggca gcgacacgac 1920 gttgtttcga gaggactttc ttcaacggct taagatcatc gggaaaggta ccaccctaga 1980 tctcgtagga gttactggcg cagaaagcta caagtcgcag aaggccccgg ttcgatttct 2040 tttaccggac ggagaggaag ccctcatcaa aggaacaacg atcccgcaga tatctaggcc 2100 gactccggta atcgactgga gaaaattgaa aaatcgatgg ccacatttag ccgacgtccc 2160 agtgcaagag agtggcggaa aaatcgacgt cttggtggga ctcgaccact caaatctact 2220 tgcagtattg gaatcgcgcg ttggaggcga acacgaacct ttcgcatctc gcactagact 2280 cggttgggtc gtccgcggcc tgcttgggag tgacatcggt ccgatgacaa cggcccacat 2340 ccatcacatc tcagcaacat ccgagtcctc gctcgatgaa gagttccaga aattttgtac 2400 aaccgaaaat ttcggcactg aatttaaagg agatggcctt tcagaatcag accgaatcgc 2460 agagaagatc gttgatgaag gcctgaagaa actcgacatt gggtatgaaa ccccactcac 2520 gtggctcgaa tgagagccga catttgaaaa taacaggaag ctggcggagc atcgttggca 2580 agacctgaag gaacgattca agagagatcc agatttcgag gctgattaca gagccgcgat 2640 caagaaatac gtcgacgaag gctatgcatc gcgagtaaaa gaggaagatt tatactcgaa 2700 caaccagtac tacttgccac atcacggtgt ctacaagaaa gtgtatggta agaagaagaa 2760 gaagcttcga gtcgtcttcg atgccgctgc caaatggaag aagaaatgcc tcaacgacgg 2820 aatgcgacag ggtcgaaaat tgcaaaacga cttagccaag gtcctcattc gattccagct 2880 gggggagatt gcctttgcag cagacgtgac cgccatgtac agcaggattc gacttcgacc 2940 acaagatgcc cgctttcatc gttttctctg gcaagagaaa gattctgacg tcatcattac 3000 gtaccaaatg gatcgcttgc catttggatc aaactgtgcc ccatttctcg ccctaaaaac 3060 catccacaga gccgcttcag atgccactaa agggagagaa gaatgtatgg aagcagtaga 3120 gaagaatatg tacatggatg acctattgaa agcagtcgac aatgaagaac aagcaattat 3180 gaaagcgaaa ctcatccgcg atggactagc agaaggagat tttcatctaa caaattggat 3240 ctcgaactcc ccggaattca tcaaagcgtt gcaaccagag caagcagtag cagcagcaca 3300 tcatgattta gcgtcggatg acgtcgaaaa gctactggga gccttctacg agccaacgac 3360 agacgagatg acgtaccgag tgactggagt cgaagatctt aaatggacgc gcgctggatt 3420 gctgagcaag gtggcgagta tctacgaccc cctaggacga gcagcaccag cgctgataaa 3480 agcaaaaatt aaattgcgtg agctaggtac cagaggacta aactggaacg aagccatcag 3540 cggtgacgac aaggagtggt ggcaaacctg gttcaagacg ctggagagac tcaacgacct 3600 ttcgatgcca cgaaatttgc agccggacaa agaaaaaatc atccagagcg atcttctaac 3660 ctttggagat gcatctgaag aggcctatgc agcagccgta tatctacgca gtgtctacca 3720 ggatggcaca tcaatatgca gattggtgat ggccaaaacc aaattggccc ctcggcagac 3780 acagtcgatt ccaaaattgg agctgaacgc agcagtgcta ggcgctcgac tggccacata 3840 cgttgcagac gcgctacaaa tctctggact caacaggatt ttcttcactg attccagtac 3900 taccaggaac tggatcagag ccgtcgcttc gcattacatg ccgttcgtca gccacagaat 3960 cggagagatt caatctctca ccaacgctaa cgagtggcgc ttcattcccg gaaaaatgaa 4020 catagcagat gctgcgacgc ggtccctatt gacagacgga gagccgatcc caccgagatg 4080 gctggaagga ccatctttca tcgccttacc aatggagcaa tggccgaagg atttgccttg 4140 ggtggcagta gcagcagaga aaagaactgc ccacatccac cacgcaattt ctgagccccc 4200 tcttcaagat tggacaaagg tgatcatcga ttcgaagaac atctcaacat tcctgaaggt 4260 ggaaggagat gccaaggaga ttattttaaa gtgccaaaag gaaggattcc gtgaagaaat 4320 cagtttattg tcaaagggca agcaaatctc aaagaagtca agcctcattc aattaactcc 4380 tttcttagac aaagatggaa ttctccgact gggtggacga atccaccgcg caaagctacc 4440 atatgaggtt ctacaccctc caattctgcc cggaaagcac ccacttgcag aaaagatagt 4500 gacagcagtg catcaagagt cgcatcacgc aggcactgac tttctccatg ccaaaattcg 4560 gcaacatctt tggatcctgc aaggacgaga gctggcaaag aagatccgtt tcaactgtgc 4620 cgtctgtgtg aaatcgcgtg ccaaactcgc cacgcaaaag atgggtgact tgccatctgc 4680 tcgtctcgat tcttacacgg caccattcac gcacatagct ctggattatt tcggaccgat 4740 cgaaacgtca gcttacagaa accgagtaat caaacggtat ggcctgttaa tcacctgtct 4800 cgtcacccgc aatgtttatg tagaaatggt gcagtcgatg tcaacaccgg attttctcta 4860 cggtctacgg cgtttcatcg gagaattcac gaagccaact gaaatcttct gtgataacgg 4920 gacaaacttt gtgggagctg agaaggagct ggccacagca tttcgtgagt tgcaaaagga 4980 cgaaaagttg aaggagtttg cgcgagagcg ctcaatcatc tggaaattcc agccacctag 5040 tgctccacat ttcggcggat cacacgaatc aatggtcaaa tcaacaaaga gggcactcta 5100 ccgggcgctg gatacggaaa aagccggatt gcgctacccg acggatgaaa tgctcagaac 5160 gctactgaag gaagtggcgg gcatgctgaa cgggagacct ctgacgctgg ccagcaacga 5220 cccagaagac ttcagaccaa ttacaccaaa ggatttcttg aatctgccac caacgtcgga 5280 tcttccggcc ggagacatcc gtcgcgctct gccccgcgac catattcgtt atgtgaagaa 5340 gatggccaat ttattttggg acctgtggac caagattttc ctcccgacgc tggttccaag 5400 aaggaagtgg gtcgcggaag aaagaaattt tgaagtcgga gattcgatca tgctatcgga 5460 caacaatttg ctgccaaatc agtggaagac gggccgaatc atcgaagttt atcccggttc 5520 cgacggcttt gttcgagtcg taaaagtgaa aacggactgc ggagaatatt tgagaccaat 5580 ccatcgtcta gttctactgg aaccttactc ggccgtcctt ccgtcggcct cgggggagaa 5640 // ID Gypsy-21_CQ-LTR repbase; DNA; INV; 207 BP. XX AC AAWU01010825; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-21_CQ_; KW Gypsy-21_CQ-I; Gypsy-21_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-207 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 422-422 (2011). XX DR GenBank; AAWU01010825; Positions 48502 48708. XX SQ Sequence 207 BP; 55 A; 51 C; 58 G; 43 T; 0 other; tgtcggtggc gagagcgaga ccgatgccga cgaggattag agcgagactg cagacccgag 60 gtcggccgat cgcttgacga cgagcagcga gcagagttcc ttcgaccgcc gcgagaatcg 120 gacgtcaaag tgttgtcgcg tgtactcagc cggaaatctt aatattaaac tatcttaaac 180 taaacaaacg tgttttgctt taccaca 207 // ID Gypsy-16_DPu-LTR repbase; DNA; INV; 221 BP. XX AC scaffold_847; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-16_DPu_; KW Gypsy-16_DPu-LTR; Gypsy-16_DPu-I. XX NM Gypsy-16_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-221 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 748-748 (2010). XX DR Genome; scaffold_847; Positions 4096 4316. XX CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX SQ Sequence 221 BP; 45 A; 52 C; 58 G; 66 T; 0 other; tgttgtaaat gtggcgtcgt tgtattcgtc gcggtccgac aacgtcgccc ctacctgtgg 60 tgtgggggaa ggctcagttc ggtccaggcg tagctcgagt tcgagtgtct tttcgtgttt 120 gctcaataca aacctagtta aagtcgaaca cggtctgatt tatcgattta ttctaacgaa 180 aacgttcgtt ccctccgatc gagtcaggcg atttcgcaac a 221 // ID Vingi-1_CJ repbase; DNA; INV; 3196 BP. XX AC . XX DT 17-AUG-2010 (Rel. 15.08, Created) DT 17-AUG-2010 (Rel. 15.08, Last updated, Version 3) XX DE A family of Vingi non-LTR retrotransposons - consensus sequence. XX KW Vingi; Non-LTR Retrotransposon; Transposable Element; Vingi-1_CJ. XX OS Caenorhabditis japonica OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-3196 RA Kojima K.K., Kapitonov V.V. and Jurka J.; RT "Recent Expansion of a New Ingi-Related Clade of Vingi non-LTR RT Retrotransposons in Hedgehogs."; RL Molecular Biology and Evolution 28(1), 17-20 (2011). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 3..3146 FT /product="Vingi-1_CJ_1p" FT /note="includes endonuclease and reverse FT transcriptase domains and a CCHC zinc-finger FT motif." FT /translation="RPAPPPPSQHTTKCRAKPAKTRILSWNVNGWSKRRSE FT LATIIAKEKPDIVCLQETKLDKTNLGYTPTGYKVALASRNKFGGGVCTMVR FT DDWIFTISKTTSKKTAEILSITIDKLMVTNVYFPPRSSDSPTAEVREDVME FT ILDMASEKHIITGDWNAHAKTWHSHKTDERGAMIEDELDTRGVHVVLNEDS FT FTRAASGVLSSPDITISTSDLAACSSWRTCGVLNTDHLPVIIDTMAEIPKL FT VRPXRIMANFKKANWTDFSDSIETKVACYAGPRDAYSLENFLSATITRAAS FT KHIPHGKFTPTREHTGDTGEIAELTRERERIQKRNPLDSNLKALNDKIEAA FT RRELRRLXWHQKIEDMEDKRKIDCNALWRVIKGLKPKTKSTGSIRLPNGTT FT TTNPKTIANAMASSLVAVGKGRDCREERIAWRRTKRRLKKLRPQLPASSIS FT HAEVKLAILKGKPSKALGPDGICHLHLKHVPDNCINLMAELFNASIDENKV FT PDSWKKANVFMLPKPGKDPSEIKSFRPVSLLSPVAKVLERIILERIRREIE FT SPKDQHAFKKNHSTTTAITEATSCIVGGLNMEKPACRTIMTCLDMKAAFDT FT ISIAKLLCGLEDAIADKKVCGWLGNYLHKRRINVQYDDVKSSWLTLRGGVP FT QGAVLSPNLFNFFIRDVPAIPDTKIITYADDMTLLVQDPLLSKAAEKSQDA FT LDSLQKWFKDKHLDISAGKSTVTIFTADTKEFKYDPGLKWEGSVIPMQNRT FT RLLGVELDTMLRMTGHVEQVSARLAKANNILRALAGAKWGSSKETMLRTFK FT AIIKPLATYAGPAWHQLMSETQKTKLERQYVGGLKACCGLTKDTPRQLVYH FT ETRMLPLSEELTLGSEQLAIAAIKTPGHPTSDLSRRRPKERTCGNRPRTPP FT LGCLEDRGDEWRRMRGDTRAVQKRNHTSFVKSYLSNHGIHPILGRQPPALS FT EEESTLPRNTRVELARLRAERSLLLEKYKAKVENRPVVCCIKCNDDVGDLK FT HFLKCYPVKPLPMSKLWKDPVAAATALGLAVTPFDPGGDADS" XX SQ Sequence 3196 BP; 948 A; 878 C; 851 G; 513 T; 6 other; taagaccagc accccctcct ccctcacaac acaccacaaa gtgcagggcc aaacctgcca 60 agacgagaat actctcgtgg aatgtgaacg gctggagcaa gagacgcagc gaacttgcca 120 ccatcatcgc caaggagaaa ccggacatcg tatgcctaca ggagaccaaa ctcgataaaa 180 ccaaccttgg ttacacaccc acaggctaca aagtggcgct cgcatccagg aacaagttcg 240 gcggtggtgt ctgcacgatg gtccgagacg actggatctt cacaatctcg aagacaacgt 300 ccaagaaaac ggcggaaatt ctgtcgataa ctatcgacaa gctgatggtg acgaacgtct 360 acttcccacc cagatcatcc gactcgccaa cggccgaggt ccgcgaagac gtcatggaga 420 ttctggacat ggcgagcgag aaacacatca tcacgggcga ctggaatgcg catgccaaga 480 cgtggcacag ccacaagacg gatgagagag gtgcgatgat agaagacgaa ctggacacgc 540 gcggtgtaca cgtggtgttg aacgaggatt ctttcaccag agcagcaagc ggagtgctca 600 gctctcccga catcaccatc tccacctcag acctggctgc atgctcctcg tggcgcacgt 660 gtggtgtmtt aaacacagac catctccccg tcatcatcga cacaatggca gaaatcccca 720 agctcgtccg gccaaawcgt attatggcga atttcaaaaa ggcaaactgg acagacttca 780 gcgactcgat cgaaaccaag gtggcatgct acgccggacc aagggacgcc tactctctcg 840 aaaacttcct gtccgccacg atcactcgmg cggccagcaa gcatattccg catggaaagt 900 ttacaccaac gagagagcac acgggcgaca ccggggaaat cgcggagctt accagggaaa 960 gagagcgaat ccaaaagaga aatccgctgg acagcaatct gaaagcgctg aacgacaaga 1020 ttgaagctgc tcgaagggaa ctgcgaagac tgwgctggca ccagaagatc gaagacatgg 1080 aggacaaacg caagatcgac tgcaacgcgt tatggcgtgt gatcaaagga ctgaagccta 1140 agaccaaatc cactggctct atcagacttc caaacggcac caccacaaca aatccgaaaa 1200 ctatcgcgaa tgcgatggcg agcagccttg tagctgttgg aaaaggcaga gactgccggg 1260 aggaacgcat tgcatggagg aggaccaaaa ggaggctgaa gaaattgcgc ccccaactac 1320 cagcgagctc gatcagtcat gcggaggtga agttggcgat tctgaaaggg aagccctcaa 1380 aagctctcgg accggacggg atatgccacc tgcacctcaa acacgtcccg gacaactgca 1440 tcaatctgat ggctgaactg ttcaacgcat ccatcgacga gaacaaggtc ccggactcct 1500 ggaagaaagc caacgtattt atgctgccga aacctggaaa agacccgtcc gagatcaaaa 1560 gcttccgccc agtgtctcta ctgtccccag tcgcaaaagt gctcgaaagg ataatcctcg 1620 agagaattcg ccgcgaaatt gagtcgccga aggaccagca cgccttcaag aaaaaccact 1680 ccacgacgac agcaatcact gaagccacaa gctgcatagt gggaggcctg aacatggaga 1740 agccggcatg ccgtacgatc atgacatgtc tggacatgaa ggcagccttt gacacgatct 1800 caatcgccaa gctgctgtgc ggactggaag atgccatcgc cgacaagaaa gtgtgtggmt 1860 ggctgggaaa ctacctccac aaacgcagga tcaacgtcca atatgatgat gtcaaatcca 1920 gctggctcac actgagagga ggagtaccac agggagccgt gctatcaccg aacctcttca 1980 atttcttcat cagggacgtg ccggcgattc cagacaccaa gatcatcacc tacgccgacg 2040 acatgactct gctggtgcaa gatccgcttc taagcaaggc cgccgagaaa tcccaggatg 2100 cgctcgactc cctccaaaaa tggttcaagg acaaacactt ggatatctct gcaggcaagt 2160 ccaccgtgac cattttcacg gcagatacga aggagttcaa gtacgacccg ggactgaagt 2220 gggaaggctc ggtcatccca atgcagaaca ggaccaggct gcttggagtg gaattggaca 2280 ccatgctacg gatgacgggt catgtggagc aagtcagcgc caggctggcc aaggcgaaca 2340 acatcctgag agctctggca ggtgcaaaat ggggatcgag caaggagaca atgctgcgca 2400 cgttcaaggc aatcatcaag cccctggcga catacgccgg accagcgtgg caccagctca 2460 tgtcggagac ccagaagacg aagctcgaga gacaatatgt tggkgggctc aaagcctgct 2520 gtggcctcac caaggacacc cccagacaac tggtctacca cgaaacgagg atgctaccgc 2580 tgagcgaaga gctgacactg ggaagcgagc aacttgcaat cgcggccata aagacgcctg 2640 gccacccaac cagtgaccta agccgcagga gaccaaagga aagaacgtgt gggaacaggc 2700 cacgaacacc gccacttggc tgcttggagg acagaggaga tgaatggaga aggatgcgcg 2760 gcgacacgag agccgtacaa aagaggaacc acacaagttt cgtgaagagc tacctcagca 2820 accacggcat ccacccaata ctcggcagac aaccaccagc actgagcgag gaagagtcca 2880 cgctaccaag gaacacaagg gtggaacttg cgcggttgag ggcggagaga tctctgctac 2940 tcgaaaaata caaggcaaaa gtggaaaatc gcccagtggt atgctgcatc aaatgcaatg 3000 acgatgtcgg agatctgaag cacttcttga aatgctaccc tgtcaagccg ctaccaatgt 3060 cgaaactatg gaaggaccca gtggcagcag ctacagccct tggcttggct gtcacgccat 3120 ttgatccagg aggagatgcc gattcgtgac cgaagatcgt caaggaatga gtagagttgc 3180 acaacaacaa caacaa 3196 // ID MARINER2_GT repbase; DNA; INV; 1287 BP. XX AC X80776; XX DT 10-SEP-2005 (Rel. 10.09, Created) DT 10-SEP-2005 (Rel. 10.09, Last updated, Version 1) XX DE G. tigrina mariner-2 DNA transposon. XX KW Mariner/Tc1; DNA transposon; Transposable Element; MARINER2_GT. XX OS Girardia tigrina OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Girardia. XX RN [1] RP 1-1287 RA Garcia-Fernàndez J., Bayascas-Ramírez J.R., Marfany G., RA Muñoz-Mármol A.M., Casali A., Baguñà J. et al.; RT "High copy number of highly similar mariner-like transposons in RT planarian (Platyhelminthe): evidence for a trans-phyla horizontal RT transfer."; RL Mol Biol Evol 12(3), 421-431 (1995). XX DR EMBL/GenBank/DDBJ; X80776; Positions 1 1287. XX CC 32 bp TIRs. XX FH Key Location/Qualifiers FT CDS 191..1285 FT /product="MARINER2_GT_1p" FT /translation="MEISEIRILMKYEFHRGATTRQAVGNINSVYPTQAVT FT QTTVAHWFKRFRSGDFDLSNQPRGRPEIKVDNDALKADVEADSSQSALELA FT SKFGVAKSTILIHLKQINKVKKLDKWVPHELKDEHKQQRLDACLSLLSRNK FT ADPFLHRIMTCDEKWIMYDNRKRSSQWLDPDEPPKHCPKRKVHQKKLMVTV FT WWSSYGVIHYDFMVPGTSITSDVYCSQLDDMMEKLAIKQPKMFNRLTPILL FT HDNARPHSAKNTVAKLQQLGLETLRHPTYSPDLAPTDCHFFQSLDNFLSGK FT NFTSSGAVKTAFQEFIDSRESVFYTKGLNVLPLKWQQCVDNMGDILIKNKM FT LLFKKNRFTFYFNFDPFHIEQP" XX SQ Sequence 1287 BP; 415 A; 254 C; 251 G; 367 T; 0 other; ttaggttgtt cgatatgaaa cgggtcaaac ttcacacctt acatttcaat cgcgtataac 60 ttaactttga actatcgtat gaaaataaat aaggtatttc ttgacagata tattattcct 120 ctttaaaaat caattattta atttcaaata tattgaatag tttcttgtca acaaacaatc 180 aaacagcgca atggaaatca gtgaaatcag aattttaatg aaatatgagt tccaccgtgg 240 agccacaaca cgtcaggcag ttggaaatat caacagtgtg tatcctactc aagccgttac 300 gcaaacgaca gtagctcact ggttcaagag atttcggtct ggggattttg acctgtccaa 360 tcagccgcgt ggtcgacctg aaattaaagt ggataatgat gccttgaaag ccgatgtgga 420 agctgattca agccaaagtg ctctagaatt agcgtcaaaa ttcggtgttg caaagtcaac 480 aatcttgatt cacttgaagc aaatcaacaa ggtgaaaaag ctggataagt gggtaccgca 540 cgaattgaag gatgagcata aacaacaacg acttgatgcg tgcctttccc ttttgtctcg 600 caacaaagct gatccatttc tccatcgaat catgacttgc gatgagaagt ggataatgta 660 cgacaatcga aaacgttcat cgcagtggtt ggacccagat gaaccaccaa aacactgtcc 720 taaaagaaaa gttcatcaga agaagctgat ggtgactgtt tggtggtcta gttatggtgt 780 catccattat gattttatgg tacccggtac atcaatcact tcggatgtct actgtagcca 840 actagacgat atgatggaaa aacttgcaat taagcagccg aaaatgttca atagattgac 900 tccaatttta ttgcacgata acgctcgccc tcattcggca aaaaataccg tggcaaagct 960 acagcagttg ggtctcgaaa ctcttcgcca cccaacatat tcaccggatc tcgcacccac 1020 ggactgccac tttttccaaa gcttggacaa cttcttgtcg gggaaaaatt tcacttcttc 1080 gggggctgtg aaaacggctt ttcaagaatt cattgactct cgtgaatcag tgttttatac 1140 taaagggtta aatgtattac cattgaaatg gcaacagtgt gttgacaaca tgggggatat 1200 tttgattaaa aataaaatgt tattatttaa aaaaaatcga tttacatttt actttaattt 1260 tgacccgttt catatcgaac aacctaa 1287 // ID hAT-14_HM repbase; DNA; INV; 2832 BP. XX AC . XX DT 07-DEC-2008 (Rel. 13.12, Created) DT 12-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-14_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2832 RA Jurka J.; RT "hAT-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 2003-2003 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(712..1845,1912..2622) FT /product="hAT-14_HM_1p" FT /translation="MEKGTEILNGHFYFKPLQQGVDRTKAICKHCKAEFSY FT HRSTSSLKYHLNAMHTVDANKSSTQTNSGGNLRQTTLDAARGRTTEKQKKD FT KITNAIAKWVATNCRPTSIVEDVGLKNVIRIATNDCTYEPPPRRTIVRKIH FT ELYENERTIKATALQRAQTVALTGDYWTSLGNHNYLGVTVHYIDEQWKLHS FT HALTVMKTQERHFAETCAEHFIHVAQQWDILNKVSTLSTDCARNIVAAARH FT LPFERVPCVAHSLQRSVTVSLHNSAFDNALAKCRKVVGHFKHSPASAAELE FT KKQIELGQKKESLIQDIPTRWNSTLDMIKSVRRNEQPLSDVLTTHNTKVAM FT PTPAEMDKLQRLEKLLEPCRYILHHIFLTDWFDCMVNAKHLSEMGFIFCRY FT VTELLGGEKYVSCSVVLPALCHLSRVMESSDDDPAYVVKFKSMFRIDLETR FT KENANIAYLKIATALDPRFKDLKCIPRVERGEVWVSITNLLKEQRVIADEP FT VEAATSKPSKRKFALLVASSESDSEQEEDSIENGVRRYKAEPTISTEACPL FT QWWSKHAGSHNRLASIAQKYLATPATSVPCERLFSLAGHIVQKKRVSLSSE FT NVNYLVCLSNWLGADE*" XX SQ Sequence 2832 BP; 935 A; 486 C; 539 G; 872 T; 0 other; ttagggctgg gcaagttaac gcgttattat cgcgttaacg cattaattaa ttaacgccga 60 caataatttt atcgcgcgtt aacgtacttt ttattttgaa agtctaaaaa gtgaaaagta 120 aactttttaa tttaaattta atagttcatt aaactaaaaa gtgacagggc tgcacaaaat 180 tcatcaaaac agacctgtca atctttagtt tacagaacta ttaaatttaa attaaaaaat 240 taatctccag attttattat tgtaaaaaaa tatatttttt tgctaagttt ttatgaaatt 300 tttttttcca cctattagtc tttacaatga gttcattatg agtttctgcc gctcgttttt 360 ttttaagatt cacaagctta actttatttt tcttttgtga aactgtctgt caaagttttc 420 gtttctcatt ttatattttt tgcttattat tgcttataat tattattata ttaatttata 480 ttagtattaa tatattaaaa aagttatttt gcattttgta tttttatgtt tatttatagt 540 tttgtttatt tattttgtat tgtttcaaaa ttatgtaata atcgaattat aatttaatta 600 tttataattc gataataatc gaatcataaa agaattgaaa ctttcgtgcg tatttatgtt 660 attgcttatg ctcaaacaac aagcagtaag taaattgaga aatatttaaa aatggaaaaa 720 ggtaccgaaa ttttaaacgg acacttttat tttaaacctc ttcaacaagg agtagataga 780 actaaagcta tttgtaaaca ctgcaaagcg gaattttctt atcatcggag tacttcaagt 840 ttgaagtatc acttaaatgc aatgcacacc gttgatgcta acaaatcatc cacccaaaca 900 aacagtggag gaaaccttcg gcagactacg ttagatgcag cacgagggag aactacagag 960 aaacaaaaga aagacaagat aacaaatgcc attgcaaagt gggtagctac aaattgcagg 1020 ccaactagta ttgtggaaga tgtcggtttg aagaatgtta ttcggatcgc aacgaatgac 1080 tgcacgtatg agcctccgcc aagacgcacc attgtaagaa aaatacatga actgtatgaa 1140 aacgagagga ctataaaggc aacggcatta caacgtgcac aaactgttgc cctcaccgga 1200 gactactgga catcgctggg taaccataat tatctcggag taactgtgca ttacattgat 1260 gaacaatgga aactgcattc acatgctttg acagtaatga agacacaaga aagacatttt 1320 gctgaaacgt gcgcagaaca tttcattcat gttgcacagc agtgggacat tttaaataaa 1380 gtcagcacac ttagcacaga ttgtgctaga aatattgtcg ctgctgcaag acatctgcct 1440 ttcgaacgcg tcccttgtgt cgctcacagt cttcagcgtt ctgtcacagt atctttgcac 1500 aacagcgcgt tcgacaatgc cctggctaaa tgcagaaagg tggtgggcca ttttaaacac 1560 agtccagcaa gtgctgcaga attagaaaaa aaacaaattg agcttggaca aaagaaggag 1620 tcgcttatac aagacattcc aaccagatgg aactcgactc tggacatgat aaaaagtgtt 1680 cgcagaaacg aacagccact gagcgatgtc ctcaccacac acaacacaaa agttgctatg 1740 ccaactccag ctgaaatgga caaattgcag aggttggaaa agctcctaga accctgcagg 1800 tatattttac atcatatatt tcttactgac tggtttgatt gcatgtgaga gaattgatgt 1860 gtctgtattt gtgcgtaagc atgaacatga gaacactttt tgtgtgtata agtgaatgca 1920 aaacatttga gtgaaatggg ttttattttt tgcaggtatg tgactgaact tctgggtgga 1980 gaaaagtacg tttcatgctc tgtggttctg cctgctttat gccatctctc tcgggtgatg 2040 gagagctcag atgatgaccc ggcctatgta gtcaaattca aatctatgtt cagaatagat 2100 ctggaaacac gcaaggaaaa tgccaacatt gcatatctaa agattgcaac cgcattggac 2160 ccgagattca aagacctcaa gtgtataccc agagttgaga gaggtgaggt gtgggtctca 2220 atcactaact tactcaagga acagagggtt attgcagatg aacctgtgga agcagcaaca 2280 tcaaagccat caaagagaaa atttgctcta ttggttgctt catcagagtc tgattctgaa 2340 caagaggaag actcaattga aaacggtgtg cgtcggtaca aagcagagcc caccatcagt 2400 acagaggcct gtccattaca gtggtggtcc aaacatgcag gctcacacaa caggctggcc 2460 tcaattgcac agaagtactt ggcaacacct gcaacatctg ttccttgtga aagattgttt 2520 tctcttgctg gtcatattgt acaaaagaag agagtttctt tatcatctga aaatgtgaat 2580 taccttgttt gtcttagtaa ctggcttggt gcagatgagt aagtaaagtt ggaagaatgg 2640 ataaaattgt ttaaagtttt gacaaaccaa aagttatgtt aatttaacta tgaaaaaata 2700 aatttacgct atatatatac tacttgtggt tttattattg aatttttcgt atttctgcga 2760 ttaatcgcga ttaatcgcaa aaaatcatgc gattaatcgc gattaaaata tttaatcgtt 2820 gcccagcact aa 2832 // ID DNA2-1_CQ repbase; DNA; INV; 1805 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A DNA transposon family from Culex quinquefasciatus - consensus. XX KW DNA transposon; Transposable Element; nonautonomous; DNA2-1_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-1805 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the southern house mosquito."; RL Repbase Reports 11(1), 68-68 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 22 sequences with >92% CC identity. 2-bp TSDs. 11-bp TIR. XX SQ Sequence 1805 BP; 597 A; 327 C; 310 G; 571 T; 0 other; cccaagtaac attttcagtt ttgtacctat cttgaggaga gattgaagtt atcttagcta 60 aaaatgacca taaagccatt ttatctccag gacaacaata aatcagcatt aaacctgagt 120 taagagcaaa gtggactatt ttaggaggtt atcaagacca tttattagga gtcatctaaa 180 actcatcagc ttagacattg atttttaaaa cactctgcag gtgtatttaa ttgctaaaat 240 aaaacttatg ctcaagcttc ttaaatttat aaccgcacag aagaggtctt caagactgaa 300 ataaactttc attaaagatg ttttctctcg ttgaagataa aattgatttg tcgaaggaaa 360 atcagttcta attataatct cgttaaattc aatcattctt tatgaaataa atcacaaatg 420 gccaatgaaa aaagataaaa ttatattgat aactataaaa taatatcaaa ataataaatt 480 actgtgactt tacgcagccg ttctaaccga aaaattcttg ccatcaaaat ttcactggtc 540 agttttctta gatacctcga aaaaatatgc atcgaaggca tgaatcgttc aaaattgcat 600 cgattttttt attttgatgg accgattaat tatttttttt tctgaaaaat caacatggct 660 tccatttttg gccgttgctt tttcgtgtta atatggctga tctcataaaa actattctct 720 cttactcaaa gcagttatta gggccatttt tcttaagaag ctcttgtaca agaaacttta 780 agagctatgt ttcttgcgaa gctcttgttc aagattaagc aagaacatgt agactgcagt 840 gagctttaag gcgattttaa ccaagagcaa aaggcgtagt actgcgaccg tagcgcatgc 900 gaattttagt tagcaggagg tagatccaaa acgtttttta tggttatttt ttcatcgttg 960 gataaaaaac aaaatacatt ttttttcata ttctttattc atttattcga ttgggatcat 1020 cataagtttt cctcgcatga aggaatttac cgtcgacaca acaacactga ttttaataaa 1080 caacactaga ccaacgttaa aacactagca ctccgcgacg aatttatccg cttctcggaa 1140 cgccttatcc agcttgtcct cgaaaggctc tgacggacaa acggatggcc taactgatga 1200 tttgtagctt gtcgattctg cgcgtcggag agagcttcac ggtagctacg gaatcggcgc 1260 ccatgatgat tctcgtgtta ctcgatcagc acgtgccgga aaaaaacatc tactgcccat 1320 gcatgccaat ggaccaaatc ggcatgtact cgatcaccct cggataggtg tactccgtta 1380 catccgcgct ggagccggaa acgggcgaaa agtatcactt gtttaccggg atcacaaatt 1440 ttggtaaaag tttctttgac actaaaccga ccgactgtac aaaaaaatgt gatttgacag 1500 atcaagttca aataatctgg aaatatgctt tcatagaaca gagtagtttt aatttagttt 1560 cgaagcagtt ttgacaatgc ttaaaagtct tataaataac ctctctggga tagctttaag 1620 agaaacaatg aagtgtgata taaaactatg ttccatgctc ttaaagtatt cttaaagata 1680 tttttaaaac tttcgtatca taaacccagc aagttttgat caatgttttc ataaaacaga 1740 aaagcaaatt tcaaaccttt tataaaacct gttaagaagt aaaggcattt tgcttgttac 1800 ttggg 1805 // ID TE-X-2_NVi repbase; DNA; INV; 714 BP. XX AC . XX DT 21-APR-2009 (Rel. 14.04, Created) DT 21-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE nonautonomous transposable element from Nasonia vitripennis - a DE consensus. XX KW Transposable Element; Nonautonomous; AA targeting; TE-X-2_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-714 RA Bao W. and Jurka J.; RT "Transposable elements from Nasonia vitripennis."; RL Repbase Reports 9(4), 803-803 (2009). XX DR [1] (Consensus) XX CC The exaxt ends of the consensus is unclear, because this element CC lacks both obvious TSD and TIRs. However, this element seems CC targeting specifically AA and inserting between A and A. XX SQ Sequence 714 BP; 239 A; 108 C; 136 G; 230 T; 1 other; atagtatatt gtgtaccaag ggcgtaaagt gggctttttt aggccgagtg tggattttgc 60 agtacgagtc aaggcgagtt agcccgagcc gaaggcgagg gcgtttacga gccgcagacg 120 agtactgcaa aatccacgag gccaaaaagc ccacttagcc cttggtgcac accatatttt 180 atgtaacgcg cggacgtatt ttcgcgtttt ttcgtacgtg ttaagaggga ctgatgtgac 240 tatttttgac acggcaaccg tgctcttgtc tgttcaaaca ttatataaaa gtcaattttt 300 kttttttttt gtaaataaat aattgatttt aacagtaccg aatttataat gagaaaattg 360 gaaaaaaatg aaataaagta ttatgtaaag ggttttaatg aaaaaaataa gaaatttcga 420 caatgttaat ttatatttag ttataaatta tatattgttc aagaacataa cagtttgtac 480 attttttatt taataataaa taatgcaaat ttatcaaaat caactttttt tttaaattac 540 aaaaaaaaaa tttagcgggc aataaaggcc attattgccc gctgtttgaa tatgcgggca 600 ataaaggttt ttattgcccg ctaaaataac ttttttgccc gccggtcaaa aaagagtcac 660 tttagtccct attaataggt acgaaaaacg cgaaaatcgt tcgcgtgtta cata 714 // ID BEL-174_AA-LTR repbase; DNA; INV; 668 BP. XX AC AAGE02031694; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-174_AA_; KW BEL-174_AA-I; BEL-174_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-668 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02031694; Positions 2909 3576. XX SQ Sequence 668 BP; 223 A; 98 C; 133 G; 214 T; 0 other; tgttgcaacg ctttagataa aagcccctcg acagaagccc tgctaatgta caatagggat 60 tgcgctaact gacagtatca gtgatagagc ggtagatgag ataaagcgat acgttgacgt 120 ctgtcatatt agatgattag tagctgactc atggtaaacc caaattttct aagtgaaatt 180 gttctgctga taattattat atttcgcgat tattctatat gaaaatacgc tgtttacttc 240 gatagtttga atttgagtga gtagctaaca tgaattcgtt atttgcacaa aacgttatgt 300 atatttccat tggaataggt acgacagaga attcttcatt ccattgatcg ggataataaa 360 tcacgtttaa gaatttgata agtttgtgct cagattcggt aaaagtttag gtaacgtatt 420 gattgaattg tgcattatgc agtgaattga tttatagtta ttaatagata gtttaacgtt 480 ataacagatt aaaaacacta ctcgaggtaa ggtgagaggt aacctatgtt tacaccagag 540 cggaaaccaa gattgtgagt agaattgcgc ttacatatta gttgttaaac ctaatcaaat 600 gttacgaatt ctagctaaaa gcaaaataaa ctccaaatat ttgctaaacg gacttggctg 660 ctcgaaca 668 // ID CR1-9_HM repbase; DNA; INV; 3451 BP. XX AC . XX DT 11-DEC-2008 (Rel. 13.12, Created) DT 17-SEP-2009 (Rel. 14.1, Last updated, Version 2) XX DE CR1-type family: consensus. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; CR1-9_HM. XX NM CR1-9_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3451 RA Jurka J.; RT "CR1 families from Hydra magnipapillata."; RL Repbase Reports 8(12), 1837-1837 (2008). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 407..2968 FT /product="CR1-9_HM_1p" FT /translation="MCPTEFVDYYLNPLLEKISLDKKKFMLMGDFNFDLLK FT YDTSNEVSYFLESMFSNSLFPFIIQPTRITTHSKTIIDNIFLNFYSSEIIS FT GNLTASISDHLVQFMLIPHTKTQTSSEPIVRRCFKNFKEAEFIQEISNVEW FT DDHIKDDSDVNDSIKKLIEIFDHILNKHAPYKKLTKNQVVKRNQPWVTKGI FT LKSITIKNKLHKKYLKSKNCKIKNIYFKKFKHYRNSISNLLKFSKKKYYTN FT FFNHNINSIKNTWKGIKELINVKRNNSNSLTSILINNKCETNPKTMSNHFN FT NFFTTIASKLSYEIPNSKFNYHEYLTNPTSNSFYMQPITKEETLDYINSTI FT IEGRSVGPNSLPTKLFKLVSGTLCKPISIILNNSFRKGTFPDAFKLAQIIP FT IHKKGSTIDCTNYRPISLLSNVSKIFEKVMHNKLYNFLNQNQCLYKHQFGF FT RKKYSTTHALIEITESIRVALDNGNFACGVFIDLQKAFDTVDHEILLSKLN FT YYGVRGISLQWFKSYLSDRQQFVTINGTSSDIKNVSIGIPQGSILGPLLFI FT IYINDLNLSIKHSKTYHFADDTNLQLITNSLKKLNKYINQDMASLVQWLRA FT NKISLNTKKSEIIIFKTQNTNFSKKTKKNIPKYLNFRLSGQKMYLSSNITY FT LGVVLDEKLSFKTHISDLTLKLSRSNGMLAKVRHYVNFETLLSIYHSIFGS FT HLRYACQVWGQSKTVCLSRVVSLQNRAVRIIHFCPRNFPTDILYLTSKILR FT FYDLIQFLNCSFVWDQQHGVLPTIFHNYFNNRTTCGHNLRSVTYNNLTLPL FT KQTSKHGINSITYQCILSWNSLPNTLKSGLILRFKNLFLKSLHHFYLKGYA FT *" XX SQ Sequence 3451 BP; 1216 A; 574 C; 400 G; 1254 T; 7 other; tcaattaact gtgaatatgt agatgttcct atgtacaata aattgtccca aaattcttta 60 arttatttcc atataaacat atcttcatta ccttaccata ttgatgactt taaaatacta 120 ttagcatctt ttgataattt acctgatatc attgggatta ctgaatctaa cttgtacgat 180 aacgatctct ctataactaa cgttgaatta gaaggatatg tatctgttca cacgccaact 240 aaagctaaaa aaggtggtgt tttaatgtat attaartcta acttaaattt taaaattaga 300 tctgaattag tccggcttaa aagtaargaa ctcgaatcta tatttattga agttataaat 360 ttaagaaaaa agtctaatag tgggttgttt atatcgccat ccatcaatgt gtccaactga 420 attcgttgat tactacctaa atccacttct tgagaaaatt agcttagaca aaaaaaagtt 480 tatgctaatg ggagatttta actttgactt gctcaaatac gatacctcta atgaggtatc 540 ttatttttta gaatcaatgt tctcaaattc tctttttccw tttattatac agcctacacg 600 cataaccacw cattctaaga caataattga taatattttt ttaaactttt attcttctga 660 aataatttca ggsaacctta ccgcttcaat atctgatcat cttgttcagt ttatgttaat 720 tccacatacc aaaactcaaa cttcatctga acctattgtt aggcgatgct ttaaaaattt 780 taaagaagct gaatttattc aagaaatttc aaatgttgaa tgggatgacc atataaaaga 840 cgacagtgat gttaacgatt ctattaaaaa attaattgaa atatttgatc atatcttaaa 900 caaacatgct ccttataaaa agttaacaaa aaatcaagta gttaaacgta atcaaccttg 960 ggtaacgaaa ggcattctca agtcaataac aattaaaaat aaacttcata aaaaatattt 1020 aaaatcaaaa aactgtaaaa taaaaaatat atattttaaa aaatttaaac actatcgtaa 1080 tagtataagc aatcttctaa agtttagcaa aaagaaatat tacacaaact tttttaacca 1140 caacatcaat agcataaaaa acacctggaa aggtataaaa gagttgataa acgttaaacg 1200 aaataatagt aattccctaa caagcatact cattaacaac aaatgcgaaa ccaatcctaa 1260 aactatgtca aaccatttta ataacttctt taccacgatt gcttcaaaat tatcctacga 1320 aattccaaat tctaaattca actatcatga gtacctgacc aatcctacta gcaactcttt 1380 ttatatgcaa ccaataacga aagaggaaac tcttgattat ataaattcca ctattataga 1440 agggaggagt gttggaccaa acagtcttcc aacaaaactt tttaaactgg tgtcaggaac 1500 tctatgtaaa ccaattagta tcattttaaa caattcattt cgcaaaggaa catttcccga 1560 tgcctttaag ctagctcaaa ttattccgat tcacaaaaaa ggatctacaa ttgactgcac 1620 taactatcgt ccaatatctc ttctttctaa tgtaagcaaa atatttgaaa aagttatgca 1680 taataaacta tacaactttc taaaccaaaa tcaatgctta tataaacacc agtttggttt 1740 tcgtaaaaag tattcgacaa ctcacgctct tattgaaatt acagaaagta taagagttgc 1800 tcttgataat ggcaactttg cttgtggrgt ttttatcgat ttgcaaaaag cttttgacac 1860 cgtagaccat gaaattctcc tctcaaagct taactattat ggtgtacgtg gcatatcctt 1920 gcagtggttc aaatcatacc tttctgatcg tcaacagttt gtaactataa atggaaccag 1980 ctcagatatt aaaaacgttt caattggtat acctcaggga tccatattag gtcctcttct 2040 ctttattatc tatattaatg atctcaatct ttctatcaaa cattcaaaaa cgtatcattt 2100 tgctgatgat actaatcttc aattaataac aaactctctc aaaaaactta ataaatatat 2160 taatcaggat atggccagtc ttgttcaatg gcttagagca aataaaatct cattgaatac 2220 taaaaaatca gaaataatta tatttaagac tcaaaatact aacttttcta aaaaaactaa 2280 gaaaaatatt ccaaaatact tgaactttcg attaagtggt caaaaaatgt acttgtctag 2340 taacataact tatcttggtg ttgtgttaga tgaaaaactt tcattcaaaa cacacatatc 2400 cgaccttact ctgaagttaa gccgttccaa cggcatgtta gcaaaagttc gacattacgt 2460 taactttgaa acattgctct ctatatatca ctcaattttt ggttcacatt tacgatacgc 2520 atgtcaagtt tggggacaat ctaaaaccgt atgtctctct agagttgttt cacttcaaaa 2580 tagagcggta cggattattc atttttgtcc gagaaatttt cctactgaca tcctatatct 2640 tacatccaaa attcttcgtt tctatgactt aattcagttc ttaaattgtt catttgtctg 2700 ggatcaacaa catggtgtcc ttcctactat atttcacaac tactttaata atagaacaac 2760 atgtgggcat aacctaagat cagtcactta taacaactta actcttccac taaaacagac 2820 aagtaaacat ggaattaatt ccattacata ccagtgtatc ctttcttgga actcccttcc 2880 gaacacccta aaatcaggtc tcatccttag gttcaagaat ctgttcttaa agtcactcca 2940 tcatttctac ctaaaaggtt acgcttaact ttacttttta cttttctttc tttctacctt 3000 ttgacttatt tttttatttt tattatagtt tattatttta ttataatttt atttttaata 3060 ttttatttta aataattatt tttattataa tttacttatt ttattataat tcttgtttaa 3120 gtatatgtgc atatataaat gtacattata ttaattatgt tatatatatt caaaaagtct 3180 gatttttttg gtataattaa ataaattttt atgcaatttt acttttatta gttaatatta 3240 tgttaactta tttaatttaa cttcctttta taacctactt ttttatgtgg ttttacttat 3300 tacatattat tattattatt gttattatta ttttttatct tatttaggtc attgcgcttt 3360 attagtgaat cactatttgc aatgacctgt gtttttgtaa catctgtaac acgaaatata 3420 ttgttattgt tgttgtttgt tgttattgtt g 3451 // ID hATx-1_HM repbase; DNA; INV; 3451 BP. XX AC . XX DT 07-DEC-2008 (Rel. 13.12, Created) DT 07-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hATx-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hATx-1_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3451 RA Jurka J.; RT "A distinct, diverse family of hAT transposons from Hydra RT magnipapillata."; RL Repbase Reports 8(12), 1820-1820 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 796..2820 FT /product="hATx-11_HM_1p" FT /translation="MNENSEKGTSFFLHKKDKDPTSVWSHYLKSADNLAAK FT CKKCGVILKTLGGSTKGLHTHLSTKHNLKVTSADKNLPSSSQQLNDARTDE FT KPPTKRRITSYFASEKKKTLDETLARMTALDGLTFNVFITSKDLRQLLITS FT GFENLPKSANSIKRIVVSYSLNIRQLISKEIYQYKALGTRFSLSFDEWTST FT TNKRYMNINVHIPNKHWNLGLIRVYGSMNAEKCIDVLKTRLKEHNVNIDTD FT IVAIVTDGPNVMVRVGKIIEAEHQLCLVHGIHLAVCDVLYKKSSTPMSDAE FT APLKETQDIVDETDDHEDEENIDEEMGLGFLPPTNDGIANMDLLTNEQNIN FT ELVKKARKVVVLFKRSPTKNDAVLQKYVKEEKGSELSLILDCKTRWNSLLS FT MLERFLALKTCIQKALIDLNHPVSLNESDFVVLSEIIEVLAPVKLAVDALC FT RRDANLCTADAVLIFLFQQIAQKQSPFANKMFQAIKTRMEKRRCEEISGVL FT KYLQNPKYDAVNDETNVCKIFGVPPKTLIRKQIKKLIERLAAFKNDNLEKS FT SIDVDENIMAVVRNEESQVRASLSLREKLQEQIAQSMKAIEPETQTSNLMS FT IIKKEMHLFENGGNRGHNLELVYQYLLSIPPTSIEPERAFSAAGYLGSKIR FT SRLNDQTLNALLLLRAYFQTKKESL*" XX SQ Sequence 3451 BP; 1240 A; 527 C; 595 G; 1089 T; 0 other; gggattgcaa tcccgggatc ccggatcccg agatcccggg atcccggacg tttttgggtc 60 ccgaaaatcc cgggattcta aacacaaaat cccgggattt tcgggattat agcaggattg 120 aactaattaa attattttgt aaaacaaaat gttcaaagtt ggtaactata ctgaattata 180 acgagttata taatgctgct ttgcaggttt atcctgttta tgctttatcc ttacgataaa 240 tggaacaaat ttttgaatag accatggagt tcgggatttt atcattgaaa aaagtttcac 300 aaacacaaat taaggcaggt ttgtttgact attgtataaa aaacaatcgt taagtttgtt 360 ggcaagactt ctcgcgttaa aaagataaat tttaaaattg ttgagcgaat ttgtggttaa 420 gtgattgctt gagtttatta agtcgttttg ttcatttcaa aaaatatttt tagagaatga 480 aaaaaataaa aatgaacaag taataaaaaa aaaattatat ctaaaaaaat ttaatgatta 540 ctacataaca attccctttg ggatatttct agcgatttgt ataattatat ttgtacaatt 600 attagttatt gcaacattat aatgcccttg acaattaatt ccgcttgaag gaattctagt 660 gtgaatacat acaaaatttg actgtgtaat gaatcgtttt gaacgttttg aactgataac 720 ctttttaatg cttttgttgt ttaccgcaat aatatgttgt ttactaatgc accttaaaaa 780 aatatccttc taaacatgaa tgaaaacagt gaaaagggaa catcattttt tctacacaaa 840 aaagataaag acccaacttc agtgtggtca cattatttga agtctgcgga taatttggca 900 gcaaaatgta agaaatgtgg tgtaattttg aaaacattgg gtggttctac aaaaggcctt 960 catacgcact tatcaactaa gcataatttg aaagtaactt cagccgacaa aaatttacct 1020 agttcgagtc aacagttgaa tgatgccaga accgacgaga agcctcctac gaaacgaaga 1080 ataacgagtt actttgcatc cgaaaaaaaa aaaaccttag acgaaacact ggctcgaatg 1140 acagctttag atgggctaac atttaatgta tttataacat caaaagattt aaggcaatta 1200 ctgattacca gtggtttcga aaatttgccc aagtcagcaa actcaattaa aaggatagtg 1260 gtgagttaca gcttaaatat tagacagtta atttccaaag aaatttatca atataaagct 1320 ttgggaacac gattttctct gagctttgat gaatggacct ctacaacaaa taaaagatac 1380 atgaatatta atgtacatat cccaaataag cactggaatt tgggtcttat tcgtgtatat 1440 ggttctatga atgctgagaa atgtattgat gttttaaaaa cacgtctaaa ggagcataat 1500 gttaatattg atacagacat tgttgctatt gttacggatg gtccaaatgt aatggttcgt 1560 gtgggaaaaa ttattgaggc tgaacaccag ttgtgtttag tccatggaat tcaccttgca 1620 gtttgtgatg ttctttataa aaaaagttct acgccaatgt ccgacgccga agcaccattg 1680 aaggaaactc aagatattgt tgatgaaaca gatgatcacg aagatgaaga aaatatcgac 1740 gaggaaatgg gattaggctt tcttccccca acaaatgatg ggatagcaaa tatggattta 1800 ctcaccaatg aacaaaatat caatgaacta gtgaaaaaag cccgtaaggt agttgttctt 1860 tttaaacgat ccccgacaaa aaatgatgca gtcttgcaaa aatatgtcaa agaagagaaa 1920 ggcagcgagc tttcgctcat tttggattgc aaaacaagat ggaactctct tctatcgatg 1980 ctagagcgtt tccttgcttt aaaaacatgt atacagaaag ctttaattga cttaaaccac 2040 ccagtttcct taaacgaaag tgattttgtg gttctttcgg aaataataga agtacttgca 2100 cctgtcaaat tagcagtaga tgcgttatgc agacgagatg ctaatctatg cactgcagat 2160 gcagtactta tatttttatt ccagcagatt gctcaaaaac aatctccctt tgctaataaa 2220 atgttccaag ctataaaaac aagaatggaa aaaagacgat gtgaagaaat ttctggtgtt 2280 ttaaaatacc ttcaaaatcc aaaatatgat gctgtgaatg atgaaacaaa tgtctgcaaa 2340 atttttggag ttcctcctaa aacccttatt aggaagcaga taaaaaaact tattgagaga 2400 ctagctgctt ttaaaaatga taatttagaa aaatcaagta ttgatgttga tgaaaatata 2460 atggctgtcg ttagaaatga agaatcacaa gttcgagcct ccctgtcttt aagagaaaag 2520 ttacaagaac aaattgcaca aagcatgaaa gccattgaac cagaaacaca aacttcaaat 2580 ttaatgtcaa tcataaagaa agaaatgcat ctgtttgaaa atggtggaaa ccgggggcat 2640 aatttggaat tagtatacca atatttgtta tctattccac cgacgagcat cgaacctgag 2700 cgggcatttt ctgcagctgg gtatttggga agtaaaataa ggagtaggct aaatgaccaa 2760 accttaaatg ctctactatt gctccgggca tattttcaga caaaaaaaga aagtttgtaa 2820 ataaaatatg catttaatat gaattgtttt tttgttttac atttattatt tagctaccat 2880 ttatttataa atataaaata aagattttct ttaatttttg tctctcaata ttaacaaaca 2940 acttattatg ttatttttta caaagtagct tgttgctaca gcagctacat cggtacccaa 3000 atttaagaaa atttatgttt acaaatttat tttgaaattt tgattgttta gttactttag 3060 tcaaatacac tttaaatgtt caaatgttat ttacatatcc aattatttga tgaaggttat 3120 tctgtaaagt ctgtaatctt aaaacataat atggaaggta ccgtatatat atttctgcat 3180 tattttaaga tctttatata cacctgctgc ctgttacaca attaaaacaa tttaaggcgc 3240 cgtagaaaat taatgtaagt cattaaaaca taatgaaatt aaataaagaa ttgataacat 3300 caaaaagtat attcatatat gcagtattaa gttctgtttt ttagcaaata actcacatta 3360 tataaaatcc cgggataccg ggatcccggg attttattta cttcaatccc gaaatcccgg 3420 gatcgtaatt catgcccggg attgcaatcc c 3451 // ID Gypsy-47_AA-LTR repbase; DNA; INV; 1157 BP. XX AC AAGE02018863; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-47_AA_; KW Gypsy-47_AA-I; Gypsy-47_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-1157 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02018863; Positions 90518 89362. XX SQ Sequence 1157 BP; 336 A; 271 C; 263 G; 287 T; 0 other; tgtaacctct cagagcttcc aacctaaaat aattaatttg atgatagctt tgcaaaaaaa 60 aatacatata ttcaattgtc agtaaatgta aatgacatgc agagaaatgt ttgagttata 120 taacaagtac caacgacgag tttgaatttt cccatctcat taatttcgcc tgagataccc 180 cacttcgaaa actagatagg tagctcaacc gactcaactg ctcgaacgat aaagcttcaa 240 cctgcaccaa aacccaactg cttgatcacc cgaagcgcgc aaaagccgaa tgctgtcggg 300 gggcacgtga ttatacgagg gggtggagga gccgatcacc cgacgaaacc gaatctgcgc 360 ccgctgagct ataaaagcag ccccatagca cctataagct ctctttgatc ctgtgaaggc 420 gaaagttata gttcggcaaa ttttggagaa aatttagtgc tcaaagttaa ggagtgatta 480 ctcgcacgat tggacagttg aagtgtaagg ctaaacgtta aggagcagtg gctcgcatag 540 ctcaagtgtc cggtaattaa atcgcgatag gtaaaacgac taattttgaa gatagtgttt 600 ccctcgcgtt caccgccttc atagaggccg aaccgaaccg tccgtcacca tacggttcta 660 gtgaatttga ccgttcgtca ccatacggtt tagtgaatct cgtgaagatt tctacccagt 720 ccagtgcggg taccacgtgt cagcccgagc ctcccaaacc cccgtcgatc cggtgttggc 780 gctggagtcc ccatttccgg ctcagtccta ggagaacccc tattccggca agcacgtgct 840 ggaatccctg cagcgacaac tgaacaggcc gttcggtact ccatcgagca tcaaaagcgt 900 gagttgagtg aagcccaaaa tttcagtgag caaatttcaa taaaaagttc atagttaaaa 960 attacattca tttcctttag tcgcgtactt tctttgcatg catcaactta atttgcaggt 1020 tgagaatatt taaatacaaa agggtaagtg ttctttagtg caggttgcta gcggccctga 1080 gatcaaaaaa gaggtgagct ggcctagggc gtacggacgt ccacgccaca gatgcatggg 1140 cgtcgtcggt agttaca 1157 // ID SMAR28 repbase; DNA; INV; 2237 BP. XX AC . XX DT 22-OCT-2007 (Rel. 12.1, Created) DT 22-OCT-2007 (Rel. 12.1, Last updated, Version 1) XX DE Consensus sequence of Mariner-type family of repeats. XX KW Mariner/Tc1; DNA transposon; Transposable Element; SMAR28. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-2237 RA Jurka J.; RT "Mariner-type element from freshwater planarian (Schmidtea RT mediterranea)."; RL Repbase Reports 7(10), 1067-1067 (2007). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 345..2027 FT /product="SMAR27_1p" FT /translation="MDKRKRTMLSINQKLEIINKLASGVPVSRICEQYDIK FT KQTVSDIKKNAERINKFALLCDVGNMNNERKKMRFAKNALLEEALMKWYIQ FT QRSCGIPVRALELKTAVIKLGKYMNINIKASDGWLWRFRKRHGISNKRIYG FT DAMSCAVEEIEPFRQMFKDFMTAKNFSKFQIYNADETGLLWRTLPENTQAY FT NFDKNTPGRKSCKDRISALLCANADGSHRLTPVIVGKSCKPRVLKDVMHSL FT PVVYYNSKKAWFTSEIFKNWFFNHFVPDVIKFQHEKLGILPAEATGILLLD FT NCPAHPSESILRTPDGKFTCMFLPKNTTALIQPMDQGIILATKRIYRKKFL FT DEVMVVCLNQKEEEDTRGQRTVQNLKNYNIKSAIFNFASAWKEVKELTLTN FT GWKNLFNGIDEDIDIVLDGIEVRDFYEVIQNNGETKVAEDDVYEWLEIDED FT DPGYNIMTEGEIADLLINEQNSEKSESTTEENTPRRVKISTIASHLADLIA FT FIDTSPDSDFQQYSMHFRLFRELIIKKQRALLKQTTLDSFLITSSCPKQLD FT SEKTNFQDEKRDNN" XX SQ Sequence 2237 BP; 810 A; 347 C; 400 G; 680 T; 0 other; tacagtagaa cctcgttata acggaccccg ttataacgga cttcggttat aacggacaaa 60 aaaaatcgca gaaatatatt aattaaaatt ttcttatttt gacttaatct cttaacaaat 120 ccgttcaatg cattacaaat gatatttatt tctatcaatt aaacaatatt gattgtattt 180 ttaagtgaat agccttcata taaaatagtg gcctgtttta agaataagaa ataaaaccct 240 atttttttac attagattat tttatacatt gcggtatatt ttttttagtg caattaaaga 300 atcttttatt tctactaata gtttgctgtt ccttaaccgt caaaatggat aaacgaaaaa 360 gaacaatgct cagtatcaac caaaaattag aaataattaa caaattagca tcaggtgttc 420 ctgtttctcg catatgtgaa cagtatgata taaagaagca aactgtatca gatataaaga 480 aaaatgccga gagaataaat aaatttgcac tactctgtga tgtagggaat atgaacaatg 540 aacgtaaaaa aatgagattt gcaaaaaatg cattgctcga agaggcgtta atgaagtggt 600 acattcaaca acgctcctgt ggaattccag taagagctct tgagctaaaa actgcagtaa 660 tcaagttagg aaagtatatg aatataaata ttaaagcaag tgatggatgg ttgtggcgct 720 tccgaaaaag acacggaata tcaaacaaac ggatttatgg cgatgcaatg agttgtgcag 780 tggaagaaat tgaaccattc agacaaatgt ttaaagattt tatgactgct aaaaacttct 840 caaagtttca gatttataat gctgacgaga ctggtctact ctggcgcact ttacctgaga 900 atactcaagc ttataatttc gataaaaata caccaggaag aaaaagttgt aaggatagaa 960 tttcagcgtt gctttgtgca aacgcagatg gcagccatag attaacccct gtgattgttg 1020 gcaaatcttg caaacctcgg gttttaaaag atgttatgca ttctttgccg gtagtatatt 1080 acaattctaa aaaagcatgg tttacttcag aaattttcaa aaattggttt tttaatcatt 1140 ttgttccaga tgttataaaa tttcaacatg aaaagcttgg aattcttcct gcagaagcta 1200 ctggtatttt gttgttagat aattgtcctg cgcatccgtc agaaagtatc ttaagaacac 1260 cagatggcaa atttacttgc atgttcttgc caaaaaatac aacagctctc attcaaccaa 1320 tggatcaagg aattatatta gcaacaaaac gaatctatcg caagaagttt ttagatgaag 1380 tgatggttgt gtgcttgaat caaaaagagg aagaagatac aagaggtcaa cgcacagtac 1440 aaaatttgaa aaattacaat attaaatctg caatttttaa ttttgccagt gcgtggaaag 1500 aagtgaaaga gctgacacta acaaacggtt ggaaaaatct ttttaatgga attgatgaag 1560 atattgatat tgttcttgat ggtattgaag ttcgtgactt ttatgaggtg atccaaaata 1620 acggggaaac taaagtagca gaggatgatg tttacgaatg gttggaaata gacgaagatg 1680 atcctggcta taatattatg actgagggtg aaatagcaga tctgcttata aatgagcaaa 1740 acagcgaaaa aagtgaatca acaactgaag aaaacacacc tcgaagggta aaaatatcaa 1800 caatcgcttc tcaccttgcc gatctgatcg cgtttattga cacttcgccg gacagtgatt 1860 tccaacaata ttctatgcac tttcgccttt ttcgcgagtt aattatcaag aaacaacgag 1920 cgctattaaa acagacaacc ctcgattcat ttttgattac atcttcatgc ccgaaacaat 1980 tagatagtga aaaaacgaac ttccaagatg aaaaacggga taataattaa acaactttaa 2040 gttaattttt ttatcttgta aaaaactttt atatgcgcat tttaaatgca taagtaacaa 2100 ttgtgacatt gtttttaacc cagagtcttt ttaagaagat ttatggagtc gatacgaaaa 2160 tttacttata acggactttc ggttataacg gacaacccct tccccgatat agtccgttat 2220 aacgaggttc tactgta 2237 // ID Copia-10_DPu-LTR repbase; DNA; INV; 260 BP. XX AC scaffold_260; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-10_DPu_; KW Copia-10_DPu-LTR; Copia-10_DPu-I. XX NM Copia-10_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-260 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 684-684 (2010). XX DR Genome; scaffold_260; Positions 73342 73083. XX CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX SQ Sequence 260 BP; 76 A; 55 C; 40 G; 89 T; 0 other; tgttggagat aaaggcaagc tgaacatcct aataatgtca ccaggtgtca ctatacttaa 60 cctgtcctac cgataatgta gtctgctagc cagactatct gttcttacct gtcaagtgta 120 tctacttatc tctgtctcac tctctatact cgtgtcagct agtaaggtgc aataaaagta 180 tggtaactct agtataattg tgtctgtcta ctttacttat tcattactat aaactactag 240 ttaataagtt aacctcaaca 260 // ID BEL-242_AA-I repbase; DNA; INV; 6405 BP. XX AC . XX DT 13-JAN-2011 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-242_AA_; KW BEL-242_AA-LTR; BEL-242_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-6405 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (13-JAN-2011). XX DR [1] (Consensus) XX CC Positions [5466-6023] - Integrase core CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 668..2167 FT /product="BEL-242_AA-I_3p" FT /translation="MQHLSSIEERNEGNVSQEVQNPLDWSLTWDLPETSQP FT LPKGQLAVDLQEIQRRFTDVQLHRKPNHPPGCSFTTIEAMRKNNEQNTEIP FT ATMSAHGRSFTQAALEPTIDQPVQLSGLEASHPRQQMSAASFSVSQSCSNY FT TSTQIRPLQSNVTFSLPNRTSFIQHTHYPVTEPCNPPHSFQNVAAFNPLHK FT STPHVRPATTPSVRFADYPLVSSSNLPSVPQVHTVQSTPLPIASAVAAPSG FT MPSAIPPFDSTLAPASVPQPWISAPTAQQLAARHIVPKELPNFSGDPVEWP FT LFISCFQNTTQLCGFSHGENLMRLQRSLEGNALEAVRSFLLEPSSVPMIIS FT TLQTLYGRPDLVINSLLQKVRITPAPKPEKLESLVSFGLACQNLCGHLRAV FT GQRAHFSNPALLQELVNKLPANIKLDWALFKQKCPVVDLGTFGDYMAQLVV FT AASDVAPFQPSDAAIASSEKRKGKEKLYVNTHSTGNPEDSAGKRYQSGSTG FT TKQF" FT CDS 5181..6302 FT /product="BEL-242_AA-I_4p" FT /translation="MEGRIDGFCEAAYEFKYPAVLPKNHYATTLIVDSYHR FT RFNHCNGETAVNEMRQRFHLSEMRAAFRKVRKLCMWCRVFRATPAVPRMAP FT LPDARVTPRVKPFSFVGIDYFGPLLVVQGRHEVKKWVALFTCLTIRAIHLE FT IASSLSTECCKMAIRRFISRRGAPSEIYSDRGTNFVGVSGELREQLRSINT FT ELATTFTNTVTQWRFNPPAAPHMGGSWERMVRSVKCAFASLSVEREPSEEV FT LVTLLVEAESVVNSRPLTYMPIESAEHEALTPNCFIMLSTSGVNQPPTQLV FT DDKLVLRSSWFLYQRLLDQFWTRWVKEYLPTITKRTKWFVDTKPVSAGDLV FT IIVEDRLETAGFEGSYYVYSRDEMAGAAVQT" FT CDS 4276..5268 FT /product="BEL-242_AA-I_2p" FT /translation="MLFPRISSRKLRNSAATRTCGCQRGGVRGGSLFSDSG FT PDSSSLCLSFLENQGGSTKPLSVPRLELQAALLGARLAKSVTENHTLCIKQ FT RFFWSDSTTVLSWLQSDQRKYRQFVACRVSEILDSTKVDEWQYVPTRMNVA FT DDATKWKEGLQIDTGHRWFSGPRFLYDPPSMWPQQKKLSCPTAEELRPVHV FT HKEVINEALVQFNRFSKWERCLRAVAYVHRFIDQLKRKKHGDVPEVQLLLT FT REELQRAEHTIIRLAQMEAFADEIATMQGNQSCHQINVIGWKKLVGSTSYR FT LFWIIKESFAWRVESTGSVKLHTSSNILPCCRRTTMRLR" XX SQ Sequence 6405 BP; 1677 A; 1620 C; 1626 G; 1480 T; 2 other; aaacttttag atttttatcc caccatcgcc aaattcgacg ccaacaaagt acaatggcgg 60 ccagacacaa ttgtacatgt gatcaaccga tcgtcgatgg tgataagata gaccattgta 120 gcgaatgttt tcgctctatt cattccgcgt gtgctgtgat cggtcgttgt atcggtggtg 180 agcgtgtgtt gtgtgttctt tgccagcgtc gggctccacc accgcctcga tcgcattcga 240 gaagatcgac gagctcttcc gttcgagcca gaattcaact ggatctacaa cgactcgaag 300 aagaaaatcg attgcaggag agagcagcag aggaaaggct ccggcgtgag agggaatatt 360 tagccaaaaa atatgacctt atgcacgccg agttgctcga ggaaggcgaa gcaagcagcc 420 gtcgttcaat gtcaagcatc gaaaaggttc agcagtggct gcaggataag cctaccacga 480 ctaccgggtc cggcatcggg tgtaatgaaa tccccgccgg tgtatcgtca cggacaggaa 540 caatccgcaa gcccagccca tcagtgccaa caaccattgc tgcagcttca acaccgaagt 600 cagggcagcc cgccgagctt acttttgggc tcaatccgag ccaaccaatt tgcccagatg 660 cacgaggatg cagcatttga gttcgattga ggagcgcaac gaaggtaacg tatcacagga 720 agttcaaaat ccgttggatt ggagccttac ttgggatctt cccgagacat cgcaacctct 780 accaaaagga cagcttgcgg tggatctaca ggagattcag cggaggttta ctgatgtaca 840 actacatcga aagccaaatc atcccccggg gtgttcgttt acaacgattg aagcaatgcg 900 aaaaaataac gaacaaaaca ccgagattcc cgcaacaatg tcagcccatg gccgaagctt 960 cacccaggct gcactggagc ccaccattga ccagccggtt cagttgtccg gcctcgaagc 1020 atcacatccc aggcagcaga tgtcagcagc tagtttttcg gtgagtcaat cgtgttcgaa 1080 ttatacttct acacaaataa gaccgttaca aagcaatgtg acattcagtc taccaaatcg 1140 tacgtcattc attcaacaca ctcattaccc tgtgaccgaa ccatgcaatc ccccccattc 1200 ttttcaaaat gtggctgcct tcaatccgct gcataaaagc actcctcatg tccgccccgc 1260 aacaaccccc tccgtgcgtt ttgctgacta tccgcttgtt tcatcatcca atttgccatc 1320 ggtgccccaa gtacacacgg ttcaaagtac tccattgcca atagcgtcag cagttgctgc 1380 acccagtggt atgccatccg cgattccccc ctttgactca acccttgcac cagcatcagt 1440 acctcaaccg tggattagtg ctccgacagc tcaacaactc gccgccaggc acatcgttcc 1500 caaagagctt cccaactttt cgggcgatcc tgtcgagtgg ccgctgttca taagctgctt 1560 ccagaacaca acccaactat gtggattctc acacggcgag aacttgatgc gtctgcaacg 1620 aagtctcgaa ggtaacgccc ttgaagcagt gagaagcttt ctgctggaac catcgtcggt 1680 tccgatgatt atctcgactc tccaaacact ctatggacgt cctgacttgg tcatcaactc 1740 tcttctgcaa aaagtgcgta tcactccggc cccaaaaccg gaaaagctgg aatcattggt 1800 atctttcggt ctcgcatgcc agaatctttg tggacaccta cgggcagtcg gtcagcgggc 1860 ccatttctcc aacccagcgc tgctgcagga gttggttaac aagttgccgg cgaatattaa 1920 gttggattgg gctctcttta agcagaaatg tccagtcgtc gaccttggga ctttcggaga 1980 ctacatggct cagttagttg tcgccgcaag tgacgtggcg cctttccaac catctgatgc 2040 tgctatcgct agttcggaga agaggaaggg aaaagagaag ctgtatgtga atacacattc 2100 caccggtaac ccggaggaca gcgcaggaaa aaggtaccaa tccggcagca ccggaaccaa 2160 acagttctts ggaaaaccgt gcccagtttg tggaaaacta gggcacaaag ttagagaatg 2220 cgacgatttc aagaactgca atttggaaga ccgttggaag cgtgtgcagg aacaccacct 2280 gtgcagacgg tgcttgatag cacatgggaa gtttccttgc aaagcgacgt cgtgtgggtc 2340 ggaaggatgt gaagaacgac accacaagct gttgcatcca ggcaaaccgc aatcggctgc 2400 tccagctagg aatccgccag cgatcgaaac aatcaacgtc atagcacacg caaagtagcc 2460 acgctcttcc gtatcgttcc gtaacgcttt tggcaatcga aaattcgttc atacttttgc 2520 gctttggacg atctaccctt gtcgacgcag gtgttgctcg gagctgggag tgcaggtgaa 2580 gcatatccgc tttgccttca gtggaccggc gaaatcgaaa gggcagagga caactcacag 2640 ctgattcata ttgaaatctc aggacgtggg tcgcagaaac gacatgtgct gaaagcagct 2700 catactgtgg aaaaattgtg ccttcctaaa caaagtctgc cttttgaacg attgtctgaa 2760 gaattcccgt atctccgtgg tttaccggtc gatgggtata gcaacgcagt tccaaccatt 2820 ctgattggtc tagacaatac gcatttgaaa attcccctca aaattcggga aggtagaatt 2880 ggccagccgg ctgcagcgaa aaccagatta ggttggacag tctacggtcc gatatccaat 2940 gcgaattccc cgttggaaca ttgccacttt catctgagcg aagaagctcg gagttctgat 3000 gaagttttgc acgatctggt gaaggatttt ttctcaatag agagtgtcgg cgtggctgtt 3060 gctccattgc tcgaaggctc cgatgaaatg cgttctagga aaattttgga agatactaca 3120 atccgtctgc catccggacg gttccaaaca gggctactgt ggaagcacga tcacataaat 3180 tttccggata gcaggcccat ggcggagaac cggatgaaac ccttggagcg tcgtttatta 3240 cggaaccctg agcttttcga caacttaaaa catcagattg tggaatacgt gaagaaggga 3300 tatgcgcata aaattaccac cgaagaaatg ctcaacacag acccaaggag agtctggtat 3360 ctccctttgg gggtagtcgt gaaccctaag aagcctggga aagttcgcat cgtatgggat 3420 gcagcggcga aagttaaagg acagtcactt aactccgctc tgcttgctgg accagatctt 3480 ctaacgtctc ttccatcggt tctatcccgt tatcgccaac gtcaagttgc aataagtggc 3540 gatatacggg agatgttcca ccagttccaa atcagaccgg aggacagaca ggctcaacga 3600 ttcttgttta gaagcgactt atccaaggcg ccagacacct acgtcatgaa tgttgcacct 3660 tcggcgcgac ctgctcaccc tctgcggcac agttcataaa gaaccgaaat gcgaaggact 3720 tcgagactga ataccctgct gcagctaccg caatcgtcca caaacattat gtggacgatt 3780 atttggatag tctggatacc gtcgaggaag ccgtggatat ggcactccag gtgaaagagg 3840 ttcacgctaa ggctgggttc cacatacgga actggatgtc cagttctaaa gaagttctgt 3900 cacgggtcgg ggactcggca gaagagcaag tgaaaagctt ctcagcacat gccacttcgt 3960 catcagaacg cgttcttgga atgagctgtt tccggaaaca gacgagttcg cattcagcgg 4020 tttgtttcga gatgagttga tgccattact ccaaggagac gtcataccga cgaaaagaca 4080 gcttctgcag gtggtaatga gtatttttga tccgcttggc ctaatatcgc tcgtcgtggt 4140 gcatggaaaa atcctcattc aaaacgtttg gcgagcaaac atcgcgtggg acgaaaaaat 4200 caacagtgaa cagctcagtg aatggcgtcg ttggatcaag ctattgggcc aacttgatgc 4260 cgttcgaata ccacgatgct atttcccaga atatcttcca gaaagcttcg gaactctgca 4320 gctacacgta cttgtggatg ccagcgagga ggcgtacgtg gcggcagcct attttcggat 4380 agtggaccgg actcaagttc gttgtgtctt agtttcctcg aaaaccaagg tggctccact 4440 aaaccattgt cagttccccg tctcgaacta caagctgcat tgcttggagc aagattggcg 4500 aagtcggtca cagaaaacca cacattgtgc atcaagcaac ggtttttctg gagtgattca 4560 accactgtcc tttcctggtt gcaatccgat cagcgaaaat atcggcagtt tgtagcgtgc 4620 cgtgtatcgg aaatcctaga ttccacaaag gttgacgaat ggcaatacgt tcccacacga 4680 atgaatgtgg ccgatgacgc cacaaagtgg aaagaagggc tccaaatcga caccggtcat 4740 cgctggttca gtggaccgag attcttgtat gatcccccaa gtatgtggcc gcaacagaag 4800 aaactttctt gccctactgc ggaggagctt cgacccgtac acgtccataa ggaagtcatc 4860 aacgaggcgt tggtacaatt taaccgcttc tccaaatggg aacgatgtct acgtgctgta 4920 gcttatgtcc atcggttcat cgatcaacta aagcggaaga agcatggaga tgtaccggag 4980 gttcagcttc tgctcacccg tgaggaacta cagcgagcag aacatacgat tatcaggctg 5040 gcacagatgg aagccttcgc ggacgagatc gcaacgatgc aaggaaatca atcctgccac 5100 cagatcaacg tcatcggctg gaaaaaacta gtaggctcta caagctatcg ccttttctgg 5160 ataatcaagg aatcgttcgc atggagggtc gaatcgacgg gttctgtgaa gctgcatacg 5220 agttcaaata tcctgccgtg ctgccgaaga accactatgc gactacgctg atcgtagatt 5280 cgtaccatcg gcggttcaac cactgcaacg gagagacggc cgtgaatgaa atgcggcaaa 5340 gatttcactt gtcggaaatg cgggcagctt tccggaaggt tcgcaagcta tgtatgtggt 5400 gcagggtgtt cagggcaacg ccagcagttc cacgaatggc accactaccg gacgctcggg 5460 tcactccacg cgttaaaccg ttcagcttcg tcggaataga ctacttcggc ccgcttttag 5520 tcgtacaagg ccgtcatgag gtgaagaaat gggtcgcact cttcacatgt ttaaccatta 5580 gagctatcca tctcgagatt gcgtccagcc tgtcaactga atgctgcaag atggctatac 5640 ggcggtttat atcgcggagg ggggcacctt cggagatcta cagtgatcgg gggacgaatt 5700 tcgtcggagt cagcggagaa ctgcgagagc agctgagatc catcaatacg gaactagcta 5760 cgactttcac caacaccgtt acmcaatggc gcttcaatcc accagctgca ccgcatatgg 5820 gggggtcttg ggaacgtatg gttcggtccg ttaagtgtgc atttgcttcg ctatcagttg 5880 agcgcgaacc tagcgaggaa gtattggtca cgttgctcgt agaggcggaa tcggtggtga 5940 attcgaggcc gttgacctac atgcctatcg agtctgcgga acacgaagct ctaactccga 6000 attgcttcat catgctgagt acgagcgggg tgaatcaacc tccgacgcaa ctcgtagacg 6060 ataagttggt cttgcgttcg agttggttct tatatcaacg tcttttggat cagttctgga 6120 cacgctgggt gaaggaatac cttcccacca ttaccaaacg aaccaaatgg ttcgttgaca 6180 caaagccggt ctctgccggc gatctagtga tcatcgttga agatcggctc gaaacggctg 6240 gattcgaggg atcgtactac gtgtattccc gggacgagat ggcaggtgcc gcagtgcaaa 6300 cgtgaagaca tcaaccggag ttctacgtcg cccggtgtca aaattggccg tcctggaagt 6360 ggctgataca gctcgcgacg acacggagca atacgggtcg gggaa 6405 // ID Gypsy-5_TCa-I repbase; DNA; INV; 3961 BP. XX AC ChLG7; XX DT 21-FEB-2011 (Rel. 16.02, Created) DT 21-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the red flour beetle genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-5_TCa_; KW Gypsy-5_TCa-LTR; Gypsy-5_TCa-I. XX OS Tribolium castaneum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Coleoptera; Polyphaga; Cucujiformia; OC Tenebrionidae; Tribolium. XX RN [1] RP 1-3961 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the red flour beetle genome."; RL Direct Submission to RU (21-FEB-2011). XX DR Genome; ChLG7; Positions 16910859 16914819. XX CC Positions [1459-1980] - Reverse transcriptase CC Positions [2995-3531] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 1444..3909 FT /product="Gypsy-5_TCa-I_1p" FT /translation="MMALGICRPSNSPWASPLHMVPKKDSNWRPVGDYRRL FT NAVTKEDRYPIPHLHDFAHLLAGKTIFSTVDLVRAYHQIPVEASSIPKTAI FT TTPFGLFEFTRMQFGLRNAAQSFQRFIHEVLNGLDFCFPYLDDILIASTSE FT REHTDHLRKVFERLLKYGLTINPEKCSFGNSKVNFLGYEVSADGTKPLTDR FT VKVILNYPLPKTSKDARRFLGTINFYRRFVPHAAAHQAPIHNLVKNSKKND FT KTPLSWTPESKAAFELCKHDLAQATLLVHPTTTDTISLTVDASDFAMRAVL FT EQNQGGSWKPLSFFSKKLTPAQQKYSTYDRELLAIYSAVKAFQHFLEGRHF FT VIYTDHKPLVYAFTQKSDKATPRQARHLDYISQFTTVINHISGKSNVVADT FT LSRISEIDTPTPIDYQTLRESQDVDPELQTLLAQPDSTALKLKLFTVPGSE FT KQIYCDDSNRRIRPFIPKTSRYELFKHFHGMAHPGIKATTKLLTDRFVWPG FT INKDVSRWTRACINCQRSKVQRHTQPPITEIGTSDERFAVINIDLIGPLPP FT SNGFTYCLTCIDRYTSWTEAIPLTDITAESVAKALYSGWISRFGTPLKIIT FT DQGRQFESSLFASLSTLMGFRRARCTPYHPATNGKIERWHRTLKTAIKAHA FT SPNWTEHLPTVLLGLRTVIRDDTPVSASEMVYGSTIRLPGEFFQDSTNTID FT PATFVGQLKANIANVRPASAPHHDNRQIFVPQNLDTCTHVFVRRDAVRKPF FT DPPYDGPCKVLNRTEKYFTVDLNNKSTNISLSRLKPAFLLNDNPIRHDHTY FT ANWAITPEKTDLKKTVRFKL" XX SQ Sequence 3961 BP; 1155 A; 1108 C; 757 G; 941 T; 0 other; ttggtgaccc cgcacaacac gcgcatcgaa aacaccacgc aacagtgcac gttacaacat 60 gcccgataac ccaccagatt tcaacgccgc tgtccaggtc gagatcgaga gggttactgt 120 taaaccgcca cccttttgga aaggtgatcc caagctgtgg tttatccaac ttgaagccca 180 attcaacctc ggcaagataa caagcgatac aactaagtat cactacgtgg tcagtgctat 240 cgacacatca gttcttcagc aaattgccgc tttcgtgacc aatcctcccg ctgtcaacca 300 atatgaaggt atcaaacaac ggcttatttc tacattttcg gattctcagg aacgccaatt 360 gcgtaaacta ctgtcggaaa tagacttagg cgacaagaaa ccttcgcaat tgcttaacga 420 aatgaagcgt ctgggtgatt cggctgtatc cgacgaatta ctgaaaactt tgtggttaca 480 aagattgccg acccacgtcc aatccgtgct cgcaaccagc tccgacccga ttcagaacct 540 tgcgcaaatg gcagacaaaa tcgtggagat acaacacatg tcctcagttt gtacagttac 600 gaccccaacc gaagatatgt ctgcagctat cctgcaactc actaaggaag tcgcagcatt 660 aaaggctgcc tacaatactg aaattaagcc gcgacgatca cgatcacgac cacagacgcc 720 agaccgttca agcagggact cttgttggta ccaccgaaca tttggcaata aagcaaagaa 780 gtgtatcact ccctgcaact tcaacaagtc ggaaaactcc acgccgcgcc agtaacggcg 840 gcaagcgacg cggcgtgcgc cagccgcact ataaatgcaa tccatgccag tcgacgcctc 900 accataaccg atccaaccac cggtatttcg tttttgattg actctggttc ggatatttca 960 atcatcccca agaaagcacg acagtcaaca gtaagtgacc tcgtacttta cgctgccaat 1020 ggcaccgcca ttcaaaccta cggttacgag agacttacca ttactttggg attacgacgt 1080 cgattaattt ggaacttcat catcgctgat accaaacgag caataattgg cgccgattta 1140 ttacattatt tcgatctttt aatcgatatt agacgggcca gactaattga tagcacgaca 1200 ggaatctcca gcgcaggcac atgcatttcc gcctcgcaca ctgcagtgtt gtcagtttca 1260 ccgaacgctc ggtatcgcga cctgatcagg gaatttccag ctctgacaac ttttcacaat 1320 gtcgaaccaa tccttccaca tgcgacccag catctaatcg aaaccaccgg acctccagta 1380 gcttcaaaac cacggagact ccctccagac aaactacaaa tagccaaaag agagttcgaa 1440 catatgatgg cgctagggat ttgcagaccc tccaatagtc cctgggcaag tcccctccat 1500 atggtcccta aaaaggacag caactggcgt ccggtaggtg actatagacg actcaacgcg 1560 gtcactaaag aggatcgtta cccaattcct catttacatg acttcgctca cctgcttgct 1620 ggtaagacca ttttttcaac cgttgacctc gttagagctt accaccagat cccggtcgaa 1680 gcgagctcta ttccgaaaac cgccattacc acaccttttg gccttttcga atttacaaga 1740 atgcaatttg gtcttcgtaa cgccgctcaa tcctttcaaa gattcatcca cgaagtcttg 1800 aatggtctcg acttttgctt cccctacctg gatgacattc tcatcgcctc cactagcgaa 1860 agggaacaca ctgaccacct tcgcaaagtc tttgaaagat tattaaagta tggccttact 1920 attaatcccg aaaaatgttc ttttggaaat tccaaagtga atttcctggg gtacgaagtt 1980 tcagcagatg gcaccaagcc cttaaccgac cgagtcaagg tcattttgaa ctacccgcta 2040 ccaaagacat ccaaagacgc tagaaggttc ctaggtacaa ttaactttta ccgacgtttt 2100 gtacctcatg cagccgctca tcaagccccg atacacaatc tagtcaaaaa cagcaaaaag 2160 aacgataaaa caccgctatc ttggaccccc gaatcgaaag cagcattcga gctgtgtaaa 2220 cacgacctcg ctcaagcgac gctcttagta caccctacaa ctactgatac tatttcgctc 2280 acagtcgacg cttctgactt cgcaatgagg gcagtgctcg aacaaaatca aggaggtagt 2340 tggaaaccct taagtttctt ttcgaaaaaa ttgacaccgg cccaacagaa atatagcacg 2400 tatgataggg aactgttggc aatttattcc gcggtcaaag cgttccaaca ctttcttgaa 2460 ggccgtcatt tcgtaattta cacagatcac aaaccccttg tgtatgcttt cacgcaaaaa 2520 tctgacaagg caactccgcg acaagcacga catctcgact acataagcca gtttacaacc 2580 gtcatcaacc acatttctgg caaatccaat gtagttgccg acaccttgtc cagaatcagc 2640 gaaattgaca caccaactcc tatcgattac cagacactac gcgaatcaca agacgtagat 2700 cccgaactgc aaacacttct ggcacaaccc gattctacag cactgaaact gaaattgttc 2760 accgtccctg gttcagagaa acaaatttat tgcgacgatt ctaaccgcag aatccgtccg 2820 ttcatcccca aaacctcacg gtatgaactt ttcaagcact ttcatggtat ggctcacccc 2880 ggtatcaaag ccaccacaaa actcctcact gacagatttg tctggcctgg tattaacaag 2940 gacgtcagcc gctggactag agcttgcata aactgccaga gatccaaagt gcagaggcat 3000 acacaaccac cgatcacaga aattggtact tctgacgaac gattcgcagt tatcaacata 3060 gatttgattg gaccattacc accttcaaac ggctttacct attgcctaac ttgtatcgac 3120 cgctacactt cttggaccga agccattccg ctaaccgata tcacagccga atccgttgct 3180 aaagccctgt attccggctg gatctcccgg tttggtacgc cgctgaaaat tatcacggat 3240 caaggcaggc aattcgaaag ttcgctattt gcttcacttt ccacattaat gggttttcgc 3300 cgcgcgaggt gcacaccgta ccatccagcc accaacggca aaattgaacg ttggcataga 3360 acactaaaga ctgccattaa agcccacgct tcaccaaact ggacggaaca tttacccacc 3420 gttctcctcg ggctgaggac agtcatccgc gatgacaccc cagtctcagc ttcggaaatg 3480 gtttatgggt caaccattcg gctccccggt gaattttttc aagattcaac caacaccatc 3540 gaccctgcaa ctttcgtggg acaactcaaa gcaaatatcg caaatgttcg accagcgtca 3600 gctccacatc atgacaacag gcagatattt gttcctcaaa atcttgacac ctgcactcac 3660 gtttttgtcc gaagagacgc agtcaggaaa ccttttgacc caccttatga tggaccctgt 3720 aaagttctca atcgaaccga aaagtatttt acggttgatc tcaacaacaa gtcaaccaac 3780 atttctttgt ccagactgaa accagctttt ctgctcaacg acaaccccat tcggcacgac 3840 catacttatg ccaattgggc tattacaccc gaaaagactg atttgaagaa gactgtcaga 3900 ttcaaattat aaatagtttt ttctattact catcgtggct ccaccactgg caggggagta 3960 c 3961 // ID Chapaev-1_BM repbase; DNA; INV; 1147 BP. XX AC . XX DT 28-APR-2010 (Rel. 15.07, Created) DT 28-APR-2010 (Rel. 15.07, Last updated, Version 2) XX DE DNA transposon - consensus sequence. XX KW Chapaev; DNA transposon; Transposable Element; Chapaev-1_BM. XX OS Bombyx mori OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Bombycidae; Bombycinae; Bombyx. XX RN [1] RP 1-1147 RA Jurka J.; RT "DNA transposons from Bombyx mori."; RL Repbase Reports 10(7), 934-934 (2010). XX DR [1] (Consensus) XX CC >97% identical to consensus. XX FH Key Location/Qualifiers FT CDS 461..1009 FT /product="Chapaev-1_BM_1p" FT /translation="MMKQFVKALDKNNASFEYLCKKIPRLSDAKIKEGVFD FT GPQIRSLMADEKFDATMNNTESDAWLAFRDVVNNFLGNNKHPDYKNKVANL FT LDKYQKLGCNMSIKLHFLDSHVDFFPDNLGDYSEEQGERFHQDIKTMETRY FT QGRWNVNMMADYCWSLTRDITEDTHKKTTPRRNFVAKRKRCHSK" XX SQ Sequence 1147 BP; 409 A; 173 C; 192 G; 373 T; 0 other; tatcacatca ggttttaaaa aaaattaaga cattggctat tttaaatttt ttgattaaaa 60 taaattaaat tttagatttt ggcgctcttt tctcgaatcg tgcgacctct cgtgaggaaa 120 tatattagtt gcaacaacga cacacccatt cgaaacaggt tacatgcaaa taaacctcta 180 cagttacttt gttcataaac tttttactga aattagtgtg tttatttaaa aaaaaatctt 240 acgttacgta cttatatata tttttttaat ttttttttac ttatatacaa aatttccatg 300 cttcttgtgc gagtgggaca gtagagcacg agacaagcac tacgttcaaa aataatggaa 360 tcctcgtaaa aatctcaaag taggaacaaa aaacgttata cgaaaaaaat taattagtcc 420 tgaaaaaatt ttactacctc cgcttcatat caaactcggt atgatgaaac agttcgttaa 480 agcattggat aaaaacaatg ctagttttga atacctgtgt aaaaaaattc caagattatc 540 tgatgcaaaa attaaagaag gagtgttcga tggaccgcaa atcagatctt tgatggctga 600 tgaaaaattt gatgccacaa tgaataacac tgaatcagat gcttggctgg cgttcagaga 660 tgttgtcaac aattttctag gaaataataa acatcccgac tataaaaata aggttgcaaa 720 cttattggat aagtatcaaa aattggggtg taatatgagc ataaaactcc actttctaga 780 ttcacatgtg gacttttttc ctgataatct tggtgattac agtgaagagc agggagaaag 840 gtttcatcag gacataaaaa ctatggagac aaggtaccaa ggtcgatgga atgtgaatat 900 gatggctgat tactgctggt cactgacacg tgatattact gaagatactc acaaaaaaac 960 tacacccaga cgtaattttg tcgcgaaacg taaaagatgt cactctaaat agtcctgcag 1020 cattgtttag actcaatgtt ttctttgtaa atgttcaaat atatgttttt gtaatatgat 1080 tattgatttt ttcgttaaat tttgaaaaaa atgctttttc cttcttttaa aaaaatctga 1140 tgtgata 1147 // ID R1B_DS repbase; DNA; INV; 1467 BP. XX AC AF015813; XX DT 27-JUL-1999 (Rel. 4.06, Created) DT 27-JUL-1999 (Rel. 4.06, Last updated, Version 1) XX DE Dugesiella sp. retrotransposon R1 reverse transcriptase gene, DE partial cds. XX KW R1B_DS. XX OS Dugesiella sp. OC Eukaryota; Metazoa; Arthropoda; Chelicerata; Arachnida; Araneae; OC Mygalomorphae; Theraphosidae; Dugesiella. XX RN [1] RP 1-1467 RA Burke D.W., Malik S.H. and Eickbush H.T.; RT "R1 and R2 Provide an Estimate of the Age and Stability of RT Retrotransposons."; RL Unpublished. XX RN [2] RP 1-1467 RA Burke D.W. and Eickbush H.T.; RT "R1B_DS."; RL Direct Submission to Genbank (24-JUL-1997)Biology, Univ. of RL Rochester, 135 Superior Road, Rochester, NY 14625, USA. XX DR GenBank; AF015813; Positions 1 1467. XX SQ Sequence 1467 BP; 464 A; 325 C; 376 G; 302 T; 0 other; gcatacgcgg atgacctgct ggttctaatt ggcggtaaca acagagatcg actaaaagaa 60 gacgtaagct ccatcttcac agatatcaac gattgggccc tggctaacaa actaagtatc 120 aacaaagaaa aaacaagggc catttggtta ccggggagag atcaaggact actcctacgc 180 aatgtaaaac tccctgaggt gaaattagaa cagcatgtaa aatatctagg gatcacctac 240 agctctgatg gcacattcaa ggagcaccag cgaagaatag acagcaagat aaggaagcta 300 aacaataaaa taagatcaat caggatcgga aatcgtggat taccaggtga aaagatgaga 360 agaatatata agacggtgat cgagaaaatt atatcatatg gagccaacac ttggacaaaa 420 gcacttggaa gtgtagagaa gaagaaactg aacagcctac aacgaaccat gttgttaagc 480 gtaacagagg cataccgtac aacttcaact gaagccatgc aggtcatcgc gaatataaca 540 cctttagatc tgattcttcg agcagagagt aagagggcgg acattctgga gtgtgggggg 600 agcgccacag tctcgggcag gccgataggt gcttcaggga ttgagatgcg gccgcacaag 660 tttgcgaccc atcccaagaa ctggacgcgt gtcggctggg ccgagtggca cgatcgggat 720 gactatattc tgcagatcta tacggatgga tctaagactt ccgatggtac gggggctgcc 780 ttcgtggtcc tggacagggg caggcaaatt ttctctgccg gattctctct gtccaagcat 840 catacccatt accaggcaga ggcagtggca attttgaagg ccacgatgtg gttcgcagag 900 gaatgccaag ggaataaagt tgctattatt tcagacagcc agtcagcact aaaagcccta 960 tatcgcaccc aggaagtttc tcccaccatt cgagacatta aaagaaccat cacaacaatc 1020 aaaagacaag gaagacaaat cgatctttat tggacgaaag gtcacgcggg tcaagctggg 1080 aatgaaatgg cagacagagc ggcgaaggag gcagttattt caggtcaacg atacgtcatg 1140 ccgcgaccgg tttcctgggt caaggcactg atcaaaaaag aaactctgag agaatgggcc 1200 acaaggtggg ccggctctga caagggacga cacacacaca acatcatctc tgccccgggg 1260 tttaaggagt ggatcttctc caaagaaatc actcaaatct taacgaatca cggtaggaca 1320 ccttcgtaca tgcatcgctt tggtttaaca agttctccac tctgctcctg tggcggggtg 1380 ggggactggg aacattactt ctttttttgc agatcaacag aagcagtagc cggagtgtcc 1440 gcagattaca gaacctacaa caactag 1467 // ID Gypsy-170_AA-I repbase; DNA; INV; 7335 BP. XX AC supercont1.294; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-170_AA_; KW Gypsy-170_AA-LTR; Gypsy-170_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-7335 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.294; Positions 675181 667847. XX CC Positions [5194-5670] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 725..2905 FT /product="Gypsy-170_AA-I_2p" FT /translation="MASAKLTYDQAAELLKGLYYELRIEFLFEDELDFELE FT SRGVILPRGCDVTRKRKMLRDTMKAEKDGTAEKWQQIIKGDTEREVKICMQ FT KFGELQTLVRQNSEEDTKKMCMSRLVHVGVRIMTILTGSRSSPNLFGSLQN FT ALVDLVEFMEESAMINSTIETTEINPPEKSPEAEVENVNNIPLIEHFQTEA FT TEDSRQPMSIFTQEDLEWVHSLQRRIVDLEYELKRQKEVREIATQTVDEED FT PLIQYPNSQFQTMNSWSVSSFPSQNLPPSMTTFQPPRQNSTFTIQNHPNQP FT GLSTAQTQQVLRTEIPAQASMLNTQGSFIPSYPNISNPQPSYLPHNPSANT FT QPVNGNAAVPTFLNRPHQATHHLSNHSLLPTYQPRHTLPVSKWNITKYSGD FT DQGLQLNEFLEHVRALSFAEHVSERELFESAVHLFTGSALKWYMSQRATNR FT LLDWQHLVFELRRTYMHPDLDALIKMKIYQRRQQKQESFHEYYFEIEKLFR FT SMSMQIPDYEKVQILQQNMRIDYKRQMTFLPITDLETLVAAGQKLDALNFS FT AYNKVFGSDKTVHAVEDNFKPKGTNGPNAPQPTRSRQPNPVVPHPFQKGFN FT RNNNSNKQNPQNQLNSTSNRSTANPPSTEAQNAGVSTSTSSSSSSSQTLAG FT PSTYPLRPQLTLEELVNRHTPPPNNTCFNCRRIGHEAPMCRQQRAIFCSNC FT GLHGFPTNYCPYCVKNGRSASENRRSQ" FT CDS 3304..6048 FT /product="Gypsy-170_AA-I_1p" FT /translation="MDFWEKFHIIPTLLQKDIVQKKETFAVLDSKQTKEDA FT ETNLTEEQIHEIEKIKQLFLTAQADNLTLTNRAIHQIVLKEEWKHKPPVRQ FT FPYVMSPKTQSLVAVELKRLLDLGIIERSNSEWSLNCVPVIKPNKVRLCLD FT ARKINERTVRDAYPLPHPGRILGQLPKARFLSTIDLSEAFLQIPLEKESRK FT FTAFSVQGKGLFQYTRMPFGLINSPATLARLMDQVLGHGELEPFVFVYLDD FT IVVVTDSFEHHLELLQEIAQRLKLANLSINLSKSKFGVPEIPFLGYLLSVD FT GLQTNPEKIRPIVEYERPTTVTKLRRFIGMINYYRRFIPDFSGVTAALTDL FT LQTKHKNIHWNEAAERAFCLIKEKLISSPILGSPDFNREFVIQTDASDVAV FT AAVLSQRQEDGEKVISYYSHKLTTPQRNYHAAEKEALAAIMAIEAFRGYVE FT GYHFTLITDSSALTHILSTKWKVGSRCSRWALNLQQFDMTVVHRKGKDNVV FT PDALSRSIATVQENKTLSWYTSLMQKVKDRPDAFVDFRIENGVLKKFVTAR FT EKGHDCRFEWKIIPTPESIDAILHSSHDDCFHLGFEKTLNRIKQRYYWPKM FT SSQIRKYVQQCSTCKEVKGSSIPVVPEMGKQRLASQPWKIISADYVGPLPR FT SRRGNQYLLVIADYFSKWILLQPVKSIQSNSLCTILKEQWFLKNSVPEIII FT TDNASCFVSKEFKHLLDQYKIVHWLNSRYHSQANPVERVNRTINAAIRTYV FT KEDQRLWDTRTTEIEMVLNTSVHSSTGYTPYYITHGHELSEYGPDYQRARH FT QEPLNEAERVERRKTQFSRIYDIVQRNLAKSHDASRQRYNLRHKHFAKAFQ FT PGQLVYRRNMKQSNAGEWYSAKYGSQYLPCTVSRKIGTSSYELVSSDGKNL FT GIWPAAHLKPG" XX SQ Sequence 7335 BP; 2309 A; 1610 C; 1469 G; 1947 T; 0 other; ttttgaaaag tattgtttta tatttgtcca gcattcacgg gaagataaag tcctcttaga 60 gaccatttta aaacgaccct gctagtcgga acaaagaact ctatagttac aattggcgcc 120 caacgaaaaa taaaagtaaa aaaaagtgtt ttgcctcaca aaacactttc ccttggtctc 180 tgggagagat tttattcatt ttcattctaa aattcacgtc tcggaacaga tatccaccat 240 tatcacattt aatacgaaac aaaaacctaa atttcattga tttgaactgt gcggatgatg 300 ataaaacttc acccttatat aatctcaaaa agcatagata tacagattgg gaaatactgt 360 tagcacccag agcagaattt ccattctaaa atagtctcag catagttctg gcgctcaact 420 tcaattgctt caattcgttt caatttttca tttttttttt ttcaaatgta ttattacttt 480 acttggtgtc tctagagtga tatattcata tttttttctt atttttttct ctccgtgcac 540 taaatacctt tacactattt gcaattttga aatttaattt tgaagtcctc caagacgcaa 600 cgtcgttttt tgtacttact tctaagttgg tagaattttc ctgcaaaatt tctttcattt 660 tcgagcatta aaataatcag tgataaatag tttattttta atttacctat aagattaaaa 720 caaaatggcg agtgctaagc ttacttatga tcaggcagca gagttattga aaggattata 780 ttatgagctc cgaattgaat ttttattcga agatgaactc gattttgagc tcgaatcccg 840 tggcgttatt ctacctagag ggtgtgacgt aacacgtaaa cgcaaaatgc tgcgcgatac 900 aatgaaagcg gaaaaggatg gtacggcaga aaaatggcag caaataataa aaggcgatac 960 ggaaagggag gtgaaaattt gcatgcaaaa attcggcgag cttcagactt tagttcggca 1020 gaacagtgaa gaggacacca aaaagatgtg tatgagtcga ttagtacacg ttggagtacg 1080 gataatgacc attctaacag gttcccgtag ttctcctaat ctttttggaa gtttacaaaa 1140 tgcccttgta gatttggtgg agtttatgga agagtctgcg atgattaact ctactatcga 1200 gactacggaa ataaatccac cggagaagtc accggaggcc gaagttgaaa acgttaacaa 1260 tataccgctt atagagcatt ttcaaacaga agcaacggag gatagtcgac aaccgatgtc 1320 aattttcacc caagaggacc ttgaatgggt tcattcatta caaaggcgaa tcgtcgatct 1380 agaatacgaa ctaaagcgtc agaaggaggt gcgtgaaata gccacgcaga ccgttgacga 1440 agaggatcca cttattcaat atccaaattc acaatttcaa acgatgaatt cttggagtgt 1500 gtcgtctttt ccatctcaaa atcttcctcc cagtatgacc accttccagc cccccaggca 1560 aaactcaacc tttacaattc aaaatcaccc aaatcaacct ggtttgagta cagctcaaac 1620 ccagcaagtc cttcgtaccg agattcccgc acaagcttcc atgttaaata ctcaagggtc 1680 ctttattccc tcctatccaa atatctctaa tcctcagcct tcttatcttc ctcacaatcc 1740 atctgcaaac acacaaccag taaatgggaa tgcagcggtt ccaacattct tgaacagacc 1800 acatcaagcc acgcaccatc tttcaaatca ttccttattg ccaacatacc agcctcgcca 1860 cacgttaccg gtttcaaaat ggaacataac caaatacagt ggcgacgatc aaggcttaca 1920 gctgaacgag ttcttggaac acgtccgtgc gctgtcgttc gccgagcacg tatctgaacg 1980 cgaattgttc gaatcagccg ttcacttgtt taccggctca gctctcaaat ggtatatgag 2040 ccaaagagcc acaaatagat tgttggactg gcagcaccta gtatttgagc taagaaggac 2100 atatatgcac ccggatttag acgcacttat caaaatgaaa atctaccagc gtagacaaca 2160 gaaacaggaa agcttccatg aatactattt cgaaattgaa aaactttttc gttcaatgag 2220 catgcagatt ccggactacg agaaagtcca gattttgcaa caaaatatgc gaatagatta 2280 taaacgccaa atgacgtttc ttccaatcac cgatcttgaa actttggtgg cggctggaca 2340 aaaacttgat gcactgaact tttcggccta taataaagtg tttgggtccg acaaaaccgt 2400 gcatgctgta gaagataatt tcaaacctaa gggaacaaac gggccaaacg caccacaacc 2460 cacaaggtcg cgtcaaccga atcctgtggt tccgcaccca tttcagaaag gatttaaccg 2520 aaataacaat tcaaataagc aaaatccaca aaatcaactc aattccactt ccaaccgttc 2580 cactgcaaat cctccgtcca cggaggcgca aaatgctggg gttagcacgt ccacttcttc 2640 ttcttcatca tcctcgcaga ctcttgcagg tccttcaaca taccccctta gaccacaatt 2700 gacattagaa gagttagtga acagacacac tccgcctcct aacaacacat gcttcaactg 2760 ccgaaggatt ggtcatgaag cacctatgtg tcgacaacaa agagcaattt tctgttcaaa 2820 ctgtgggctg catggtttcc ccaccaatta ctgtccgtat tgtgtaaaaa acggcagaag 2880 tgcgagcgaa aatcgccgtt cgcagtaacc tcaggcgtaa atacatcaat ccatcagcca 2940 cagcccccac tatactggga gtacgctgat ggaagtatat accaagaacc cactgaaatt 3000 tgcgaaatta catttcctta tgtcaatgat aaacgtcctt atgttgcgat taacctctac 3060 catcttacac tcaatgcctt gttggacagt gggagcaatt acacgttgat aagtggcaaa 3120 attttcaaca aattcagaaa aatcaaactc actcgcccca cgaaggacat tcaacttcga 3180 tcagcttgtg gaaatcgact gactgtcctt ggtcaagcgc aattcccttt taaattcaga 3240 aatcaagtga aattgatttc gaccatagtg gtcgaggatt tgtgcattga ttgtatatgt 3300 gggatggatt tttgggaaaa atttcatatt atacctactt tgctacaaaa ggatatagtt 3360 caaaaaaagg aaacatttgc cgtgttggat tcgaaacaga ctaaagagga cgctgagacc 3420 aatttgacag aggaacaaat ccacgagata gaaaagatca agcagttgtt cttaactgct 3480 caggcagaca acttaactct cacaaatcgg gcgatccacc aaatagtctt aaaggaagaa 3540 tggaaacaca aaccaccggt tcgccaattt ccttacgtta tgtcccccaa gacacaatca 3600 ctggtagctg tggaactgaa gcgattactg gatttgggaa taatcgaaag gagtaattcc 3660 gagtggtcat tgaattgtgt gccggttatt aaaccgaaca aggtacggct atgcctcgac 3720 gcacgcaaga tcaatgagag aacggtgcgg gatgcgtacc ctttgcccca tccgggtcgt 3780 atcttaggac agctccctaa agccagattc ctctcgacta tcgatctgtc cgaagccttt 3840 ctccaaattc ctttggaaaa agaatccagg aaattcacag cttttagtgt ccaaggaaaa 3900 ggtttatttc aatacactcg aatgccattc ggactgataa atagcccggc aactttagct 3960 aggctaatgg atcaggtcct tggtcatgga gaactagaac ctttcgtatt tgtatacctc 4020 gacgacatcg ttgtggtaac agactcgttc gaacatcacc tagagttgct ccaagaaatt 4080 gcgcagcgat taaagctagc taaccttagc atcaacctta gcaaatccaa atttggagtg 4140 cctgagatcc cgtttttagg atacctcctc agtgttgatg gtctccaaac aaatccggag 4200 aaaattagac ctatcgttga gtatgaacgt cctaccacgg tgaccaaact ccgaagattt 4260 atcggcatga taaattatta tcgaagattc attccggact tcagtggagt caccgcggca 4320 ctaacagacc tacttcaaac caagcataaa aatattcatt ggaatgaagc agcggaaaga 4380 gcattctgtc tgattaaaga aaagctcata agttctccga ttcttggaag cccagacttc 4440 aatcgggaat tcgtaattca aactgacgcc agtgacgtag ccgtggcggc tgttctgtcc 4500 caacgtcagg aagatggaga aaaagtaata tcatactact ctcacaagct taccactcca 4560 cagagaaact accacgcagc cgagaaggaa gctctagccg caattatggc tattgaggca 4620 ttccgtggct acgtggaagg gtaccatttt acattgatta ccgattcatc tgcgcttacc 4680 cacatcttaa gtaccaaatg gaaggtaggg tcacggtgca gtaggtgggc actaaattta 4740 caacaatttg acatgactgt cgtccaccgc aaaggaaaag acaatgttgt accagatgca 4800 ttgtctcgca gcatcgccac agtccaggag aataaaaccc tgtcttggta tacctctctc 4860 atgcagaaag tcaaagatcg gccagacgcc tttgtcgatt ttcgcatcga aaatggggtg 4920 ttgaaaaaat ttgtcacggc tcgtgagaaa gggcacgatt gccgattcga atggaaaatc 4980 atacctactc cagaatccat cgatgccata ctgcattcca gccacgatga ctgcttccac 5040 ctcggttttg agaagaccct caatcgaatc aaacagcgat attattggcc caaaatgtcg 5100 tcacagattc gtaaatacgt tcagcagtgc agcacatgca aagaagtaaa aggctcgtcc 5160 attcccgttg tacctgaaat gggcaaacag agattagcat cacaaccatg gaaaattatc 5220 tctgcagatt acgtaggtcc ccttccacgc agtcgtcgcg gtaatcagta tttgttggtg 5280 atagcggatt acttctctaa atggatacta ttgcaaccgg taaagagtat ccaaagtaac 5340 agcctgtgta cgattctgaa agagcaatgg ttcttgaaaa actcagttcc agagattatc 5400 attactgaca acgcaagttg ttttgtctct aaagaattca agcacctttt agatcaatac 5460 aaaatcgtac actggttgaa ctcgcgatac cattcccaag caaatccggt tgagagagta 5520 aaccgtacca ttaatgcagc tatccggaca tacgtgaaag aagaccaacg cttatgggac 5580 accagaacca ctgaaattga aatggtgttg aatacgagtg ttcactcatc tacaggctac 5640 acaccttact atatcaccca cgggcacgag ctatcggaat atggtcccga ttatcagcga 5700 gctcgacatc aggagcctct gaacgaagcg gaaagggttg aacgacgaaa gacacagttt 5760 tctcgtatct atgacatagt tcagagaaat cttgcaaaat cccacgatgc ttctcgccag 5820 cggtataatc tacggcataa acatttcgca aaggcgttcc aacccgggca gctggtatat 5880 cgtcgtaaca tgaagcaatc caacgcaggc gagtggtata gcgctaaata tggatcgcaa 5940 tatctgccgt gtaccgtaag ccggaaaatt ggtacatcct cgtatgaatt ggtttcatcg 6000 gatggcaaaa accttggaat ttggccagcc gcccacctca agccaggata accaaacttt 6060 ccaatccaaa tctccgccac gtgacggtag gagtcacctt gttcatgcca tcggcccgaa 6120 ttttgtttta ccctctgcaa cgaaattcta atgggatcga gccgaaactg catcgaatcc 6180 gccttcatgg tgctattact cctggaaata cataaaaaaa ccaacaacct aaatcgttag 6240 ctcatactca cccaaaatgc cacgattcac atcctgtcca ctgtctatca ttccgaaaaa 6300 tatcaaatct gcgtagtcat tgtcttcact gtcgtttatt aaccggagaa aaatgacaga 6360 acgaggaact agcacgcagc gaggtatgct catttgggct gcagtcagta cgcagccatg 6420 gttaccaaaa gtgttacacg gtaaatgcat actaggccac tctgcgttat ttcaaagtat 6480 ttcctcgtat agcgaggaag taaataagaa aggcgtatgt tgagtggtaa tgaaaatatg 6540 attatcatga aaacaaattc gatcatttta ggaaataaaa tgtaccgatg atatgttttc 6600 tagtgaatag attaggggcc ccgatcaaca atagttaagt taatgataag ttaggaatgt 6660 ttgtgaaaat atcaatggaa atgagtgaat agtacgactg atgaaatgaa cataaatgga 6720 catgagtgaa tggacacttt gtatgtgtga gcaagcgtgt atggatgttt gattgatcaa 6780 agcatgaatg atacaaatat gaatattttg cccaagaaga tgtcttgtgg tagtaaacag 6840 taaaaatgag gatgaacgaa aattgaatat gatctcctcg gattaagtgc gattttggta 6900 aggtgaattc cgcatgcctg caatatttcc taatctgtaa atctattctt cggttgagtt 6960 tacgactgag agggtgttta ttctgacgca aaagtttgga agtggacaga ctcacttcat 7020 tattgtcgta cgattctgtg taaaagagtt cctctgccaa gtatgaaaaa tcattgcatc 7080 gaatggaacg gataatgatg aagcagcagc cgcacattgc aaacgaatgg acaggctcac 7140 tccatcagtg ccatacaatt catgggaaag agctcctctg cttgcaatga aaaatcattt 7200 catgaattgt gactggtact gatggagcag tagtcgcatt caaaacagtg ttatctggca 7260 aaatttatat ttttttttat attatctggg ataatggtgg atgctttcct tttctctccc 7320 cggcgacggg aagat 7335 // ID BEL-17_CQ-LTR repbase; DNA; INV; 235 BP. XX AC AAWU01036025; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-17_CQ_; KW BEL-17_CQ-I; BEL-17_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-235 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 188-188 (2011). XX DR Genome; AAWU01036025; Positions 17061 17295. XX SQ Sequence 235 BP; 73 A; 45 C; 52 G; 65 T; 0 other; tgtttgtagt aacacgaatt gctgttgaat aacaataatt gtttttgatt aggaatgcaa 60 atcatctcgg cgcagcccta ccacaatgaa tttggtagcg gtgggctgcg tttaggtgtt 120 tccgccgtag gttaggacaa ttacgatagg tatgatcggg gaataagtac aaaaataaaa 180 tcacttggaa ttcgaccgca cacaggacta catctcttca aaatccatcc gaaca 235 // ID CR1_Ele9 repbase; DNA; INV; 4924 BP. XX AC . XX DT 06-OCT-2010 (Rel. 15.1, Created) DT 06-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE A CR1 clade non-LTR retrotransposon family from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1_Ele9. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4924 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4924 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (14-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. This consensus is generated from 30 CC sequences with >99% identity, and ~99% identical to the original CC sequence in [1]. This family is closely related to CR1-1_BM. XX FH Key Location/Qualifiers FT CDS 160..1824 FT /product="CR1_Ele9_1p" FT /translation="MKCGVVNCTQNDNRFLWRCQGNCKRTYHAACIGVQRH FT HEEILRTFMLPLCQDCQDKFIPEFNLNELSTSFKHISESIGHSLTANHKLA FT TNFNSLSAMHDETLAHFEQMLSEIKRSISIVNSSSKTSAAEIKHQLSALLD FT TPPTDPCAAVEKAVKSAATAAVEEIIISTSAAAHSIDQLPQLINDLVDKNA FT ILASDVKSHIEEMAKTFSATHTHTCEPIMANNLANELGDIIKPSPSWRMLG FT NKKVWKHDWTEYDAKQRSRRTQEKKEKKARRRRKQRQEIYARNQNHRNINN FT NNNNNNSNNYNYNYNNNNNGNNKNNNNRSISNINNNSNSRNNSTISNINDN FT NHNIDKSTTNNTEITSTHMNSVRGNSSLPLDKELLAAARVQFSQPPKKDNG FT PKFINFQKGETINPYRPEKPARLENPPLFDPMRPPIVRLTEESAAGDQRFL FT KARLRDPKIMDIVRTYLSFLHDQTPSTCIRGLTKTSASLTLAAEGLPADTN FT LLREIFLDVHEELGISRSDALEDLKSYRAFLSSERIQRLQLTRESANKFYS FT AGLQNFRK" FT CDS 1791..4844 FT /product="CR1_Ele9_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="ILLSRLTKFSKIKTHSLEADITDVSTTSTLHTLPDAH FT SNVSITNNPKISLSSQQNATEILVYCQNFNRMKSSAKMKEIHQKLLSSSFN FT IILGTETSWDESVRSEEIFGNNFNVFRHDRNLQHCNKKSGGGVLIAINADF FT TSEEIETFKHKEFENVWAKALINGETHIFSSVYFPPEHANKHSFELFFKIV FT ETIISNMEPEVKLHIYGDFNQRNADFITDNENESLLLPVVGEDSTLQFLFD FT KTSYFGLNQINHVKNQQNSYLDLLFTNCTEDFCVYTSLSPLWKNEVFHTAI FT EYSIFIHKTSLPSDWEYEEVPEYNKANYXEAKRRLCTINWQHIISIDGNVD FT TEVDKFHSIIQQIINETVPTRRKRRIQNNKLPVWFSPHLKNLKNKKQKAHK FT LYKKENTATNLQNYHDTCNQLNSAINSAHEEYNRKVESEVKSCPKNFFNYV FT NTQLKSSNFPSRMHFDGNVGENSSDICNLFADFFQEVYTTYSEEDRDRNFF FT SFFPEISNSISIEKLSQQEILSALKNLDATKGAGPDGIAPVFFKNLSEELT FT LPLERLFNMSLNNGKFPVAWKSSYLVPVFKSGTKSDIRNYRGIAIISCIPK FT LFESLVNDKIFQQVKNQITCRQHGFFKGRSTSSNLLEFVSFILNAMDNGKH FT VEALYTDFSKAFDRIDIPLLIFKLQKIGMEPGLLSWIQSYLSHRKQIVRFQ FT NKLSAPISVTSGVPQGSHLGPLLFILYVNDISFLLRKIKFLIYADDMKLFM FT EVSNSSEAEEFQNDINLFFTWCQKSLLQLNIKKCNSIAFSRKYVTPPIEVF FT LGNQVVQKCKIVRDLGVILDSKLTFVEHYNSIINKANSMLGFIKRFSNNFQ FT DPYTIKLLYTTYVRPILEYCHLVWNPYHVVHEERIESVQKQFLLYALRKLN FT WTVLPLPSYEARCMLINLETLKQRREFAMILFINNIISNRVDSAALLSQLN FT FYTPSRHLRTRKLFSEKSCRTHYAQNGPINRMMRQYNVHCEFIGITMSKEQ FT LKTQLKNNRSNP" XX SQ Sequence 4924 BP; 1745 A; 1087 C; 807 G; 1284 T; 1 other; tgcattacga acctccgttg tgttcggacg cgctgtctcg gcgctcttgt atatattttt 60 accttttaac ttttttttaa taaacatttt tatttgaccc gcgcgcgcac ctcgcgtaag 120 cttttaattt acattttttt ttatcgacgc gtgtgcaaaa tgaagtgtgg tgttgtgaac 180 tgtacacaaa acgacaatcg ttttctttgg cgctgccaag gcaactgtaa gcggacctat 240 cacgctgcct gcattggcgt acaacgacac cacgaagaaa ttttacgtac cttcatgctt 300 cctttgtgcc aagactgtca ggataaattc attcctgaat ttaatcttaa tgaactgtcc 360 acatctttca aacatatctc ggaaagtatt ggacattcat tgacagcaaa ccacaaatta 420 gcaactaatt ttaatagcct tagtgctatg cacgacgaaa cgctggcaca ttttgaacaa 480 atgctgagcg aaattaaacg gagcatttcc atcgtcaaca gtagcagcaa gacaagtgcc 540 gcagaaataa aacatcaact atcggcactt cttgacacac ctccaacaga cccatgcgct 600 gctgtagaaa aagcggtcaa atcagcagca accgcagcag tagaagaaat aataatatca 660 acatcagctg cagcgcactc cattgaccaa ctgccccaat taattaatga cttggtcgac 720 aaaaatgcca ttctagcttc cgatgtaaaa tcgcacatcg aagaaatggc aaaaacattt 780 tcggcgacac atacacatac atgtgaacca ataatggcaa acaacttagc caatgaattg 840 ggggacatca tcaaaccctc ccccagctgg cgcatgctag gcaataaaaa agtatggaag 900 catgactgga ctgaatatga cgctaaacag cgttcacgtc gtacacaaga aaagaaggaa 960 aagaaggcca gacgaaggag gaaacaacgc caggaaattt acgccaggaa tcaaaaccac 1020 cgaaacatca acaacaacaa taacaacaac aacagcaaca actacaacta caactacaac 1080 aacaacaaca atggcaataa caaaaacaac aacaacagaa gcatcagtaa catcaacaac 1140 aatagcaata gcagaaataa cagtaccatc agcaacatca acgacaacaa ccacaatatc 1200 gacaaaagta ctactaacaa cacagaaatc acttcaacgc acatgaattc tgtgcgtgga 1260 aattcgtcat taccattgga caaagaactt ttggcagcgg cacgtgtaca attttctcag 1320 ccgccaaaga aggacaatgg accgaaattc ataaatttcc aaaaagggga aaccatcaac 1380 ccttatcgcc ctgaaaaacc tgccaggctt gaaaaccctc cactttttga ccccatgcgt 1440 ccacctatag tcagactcac ggaagaatct gcagccgggg accaacgctt cctgaaagca 1500 cgtctacggg acccgaaaat catggacata gtgcggacct atctttcatt cttgcatgac 1560 cagacccctt caacatgtat tcgtggactt accaaaacta gcgcatcttt gacattagca 1620 gcagaaggac taccagcgga cactaatctt ttgcgcgaaa ttttcttgga cgttcacgaa 1680 gaactcggaa tttcgcgaag cgacgcctta gaagacttga agtcgtaccg ggcattctta 1740 tccagcgagc gcatacagcg actgcagctt accagggaga gtgctaataa attttactca 1800 gccggcttac aaaattttcg aaaataaaga cgcattcctt ggaagcagac atcactgacg 1860 tctccacaac atccacccta cacacgctac ctgacgcaca tagtaatgta agtattacaa 1920 ataatccaaa aatttctttg tcttctcaac agaatgcgac tgaaattctg gtatattgcc 1980 aaaatttcaa tcgcatgaaa agctcagcca aaatgaagga aattcatcaa aaacttttat 2040 cttcttcttt caatataatt cttggaaccg aaactagctg ggacgaaagc gtaagaagcg 2100 aagaaatttt tggaaacaat tttaatgtat ttcggcacga ccgaaatttg caacactgta 2160 ataagaagtc cggtggtgga gttctcatag ccattaatgc agactttact tctgaagaaa 2220 ttgaaacttt caaacacaaa gaattcgaaa atgtttgggc taaagctcta attaatggag 2280 aaacgcatat tttttcatcc gtgtattttc cacctgaaca cgctaacaaa cattcttttg 2340 aattattttt caaaattgtc gaaactatta tttcaaatat ggaacctgaa gtaaaactgc 2400 atatttatgg cgacttcaac caacgcaatg ccgacttcat tacagacaat gaaaatgaat 2460 cacttttact cccagtcgta ggcgaagaca gtactttaca atttttattc gacaaaacat 2520 catattttgg cctaaatcaa attaatcacg taaaaaacca gcaaaattct tacttggacc 2580 tcttatttac aaattgcact gaagatttct gtgtgtatac atcattatca cctctatgga 2640 aaaatgaagt atttcacaca gcaattgaat attcaatctt catacataaa acttctctcc 2700 ccagcgactg ggagtacgag gaagtgccgg aatacaataa agcaaattac aamgaagcca 2760 aacgtagact atgtacaatt aattggcaac atatcataag tattgatgga aatgtcgaca 2820 ccgaagtaga caagttccat tcaattattc aacaaataat taatgaaact gtgccaacaa 2880 gaagaaaaag acgaatacaa aataacaaat taccggtgtg gtttagccca catctgaaaa 2940 acctaaagaa caagaaacag aaagcgcata aattatacaa aaaagaaaat actgccacca 3000 acttacaaaa ttatcatgac acatgtaacc aactcaactc agccattaac tccgcacatg 3060 aagaatataa tcgtaaggtc gaatccgaag tcaaatcatg tcctaaaaac ttttttaatt 3120 atgtaaacac acaattaaaa agtagcaatt tcccttcgcg aatgcatttt gacggaaacg 3180 taggtgaaaa ctcttccgat atctgcaacc tctttgcaga tttctttcaa gaagtgtata 3240 ccacctactc tgaagaagac cgcgaccgca atttcttttc ttttttccca gaaatttcta 3300 atagtatatc tattgaaaaa ctatcacaac aagaaatctt atcggcacta aaaaacctgg 3360 acgcaactaa gggagcagga ccggatggca ttgcacccgt gttcttcaaa aatctctcag 3420 aagaacttac ccttccctta gaacgccttt tcaacatgtc tctaaataat ggaaaattcc 3480 cagtagcttg gaaatcctca tatttggtgc ctgttttcaa atcaggtaca aaatctgaca 3540 taaggaacta tcgcggaatt gccattatct catgtattcc caaactcttc gaatctctcg 3600 taaacgacaa aattttccag caagtaaaaa accaaattac ttgtaggcag catggctttt 3660 ttaaaggccg ctcaacatct tccaatctgc tagaatttgt atcttttatt ctcaacgcaa 3720 tggacaatgg caagcacgta gaagccctct acactgactt cagtaaggca tttgaccgaa 3780 tcgacatacc attattaatc ttcaaattgc agaaaatagg aatggagcca ggactcttga 3840 gttggatcca gtcatatcta tcacatagga aacaaattgt gcgctttcaa aataaattat 3900 cagcacccat ttctgtaacc tctggagtac ctcaaggttc tcatttaggc ccacttcttt 3960 ttattttgta tgtaaacgac atttcctttt tacttagaaa aataaaattt cttatatatg 4020 cagacgacat gaagctcttc atggaagtaa gtaattctag cgaagcagaa gaattccaga 4080 atgacattaa cttattcttc acttggtgcc aaaaaagttt acttcagctc aatatcaaaa 4140 aatgtaattc aatagcattt agtcgaaaat atgtaacacc accaattgaa gtatttttag 4200 gaaatcaagt agtgcagaag tgtaaaattg taagggattt gggcgtaatc ctagactcta 4260 aactcacttt tgtagaacat tataattcca taataaacaa agcaaacagc atgcttggct 4320 tcattaagag gttcagcaac aattttcaag acccatatac tattaaacta ttgtatacta 4380 catatgtaag accaattctg gaatactgtc acctagtatg gaacccgtat catgtcgtac 4440 acgaagaacg catagaatca gtacagaagc aattcctttt atacgccctt cggaaattga 4500 attggactgt gctgccactt ccgtcgtatg aagcacgctg catgctcatc aacttggaaa 4560 cacttaagca acgccgggaa ttcgcaatga ttctctttat aaacaacatt atttcgaacc 4620 gtgtagactc cgctgcactt ctttctcaac taaacttcta cacaccctct cggcatttaa 4680 gaacaagaaa attattttca gaaaaatcat gcagaacgca ttatgctcaa aacggaccaa 4740 taaaccgtat gatgcgtcag tataatgtac attgcgaatt cattggcatt actatgtcca 4800 aagagcagct taaaacacaa ctaaagaaca atagaagtaa tccatagcat gtaagaaagt 4860 attgtaatta ttgtatgtag tctacctatg cttgacgaaa taaataaata aataaataaa 4920 taaa 4924 // ID BEL-13_CQ-LTR repbase; DNA; INV; 673 BP. XX AC AAWU01008684; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-13_CQ_; KW BEL-13_CQ-I; BEL-13_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-673 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 180-180 (2011). XX DR GenBank; AAWU01008684; Positions 3009 2337. XX SQ Sequence 673 BP; 220 A; 127 C; 153 G; 173 T; 0 other; tgttccgacg cggcgcaaga aatgtcatca ccgaaaaccc cccggcgtga tcatatttgc 60 aacactgcgg cggccaacag cgctcaggcg gacagggaca gaaccagatc ccaaacagtc 120 ggtgacgtcg ttgctacgaa acgtcaaagc gaatgattgg taaaaagttt gttaaccaac 180 gaaaacgaaa gtggtgcgaa aagttcggaa agtggaattt gtgcgattaa gggagaaaaa 240 cgtctgcatt tagttgattt ttattctttt gtgcaactac actcgtgagt aggtgttgga 300 gagataactt atctaaaact aatcctaact taatctaatc aggaagatcg caggggaact 360 gctctcctca tctgctgctg taaaaagggg ccttaatttg taggttatct gtgcgtgagt 420 agaaagaaac ataaattagt tggaataatt aaattagcta aaaaatactt accttcgata 480 gttgcccgca cgtccggcga taagtgtcat tgggagttga attcggagtt aaactagggt 540 gaccgatgta agtaaaaatc tactgaaata ttgtatttag aaactgacat gaaattatag 600 cttttagcaa tacttacact gcaatcaact gtgtttaaat cgctgaaaag aagcccgaaa 660 actcgtccca aca 673 // ID Mariner-35_SM repbase; DNA; INV; 2233 BP. XX AC . XX DT 14-AUG-2009 (Rel. 14.08, Created) DT 14-AUG-2009 (Rel. 14.08, Last updated, Version 2) XX DE Mariner DNA transposon from Schmidtea mediterranea: consensus. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-35_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-2233 RA Jurka J.; RT "Mariner DNA transposons from Schmidtea mediterranea."; RL Repbase Reports 9(8), 1884-1884 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 303..1745 FT /product="Mariner-35_SM_1p" FT /translation="MDLNKKKRKTYTKRIKREIAKAYKKDKKALCDIEKEF FT NVSQKVIRTAIKSSEELLNLESDDPTLDTFNLCKAAEIKLLNIDQYMLDGI FT RQIRNAGGNVLMGNLISLAKKYKSITPSCRVNITPYFIRCFKKRYNVAFKS FT LCGEAKSAMTDKILDFFDEFEAIKKYYDEKDIYNLDETGLYIKNFGRRSYI FT LRNEDDRKSIKTDKTRITLLLCFNKYGDEITPLIIGKSKNPRAFKHEKPDN FT YNLKYANNTSSWLTKDIFLKYINDLNDEMIIQDRKILILCDNFKGHEVGVF FT SNVKLLFLPPNTTSILQPLDLGIINDIKSKYSEYVNNFVTNMIIDNSADIK FT SCFSSLTLLEVCYWIRESINKVLPNTYLNCWEKFEKMKVKLQNTDLENEEN FT EVLFDTLIAPDCVVKDDPIYDESLFCLDEIICDISEASRIESVNYHLKKLD FT NLLHIEEIDVRKSYIEFKNKYYESRSRRSTVSYIID" XX SQ Sequence 2233 BP; 860 A; 272 C; 347 G; 754 T; 0 other; cagtagtgtg cgatatttag ggcacaaaaa tagactgtaa atgttaaatt tttaaggcat 60 ttatcttgta agaaaagaat ttcactatta aaattttgta tgatgtcaaa aagaataaaa 120 taaatttata tatcgtttct tttttatttt ttaaaagcta tattttcgtt caattcagca 180 atttataata aaattttagg gcatgttaaa aatttaaggc attattttac ttaaaattaa 240 acagagtata tatttaccta taaaaaatca atttttgtta aaaatttaag gcaatctata 300 ttatggactt aaataagaaa aaaagaaaaa cttacaccaa aagaataaaa agagaaattg 360 ccaaagcata taaaaaggat aaaaaggctt tgtgtgatat cgaaaaagaa tttaatgtat 420 cacaaaaggt cattagaaca gcaatcaaat cttcagagga gcttttaaac ctggaaagtg 480 atgatccaac attggatact tttaatttat gtaaagcagc agaaatcaag ctattgaaca 540 ttgatcaata catgcttgat ggcatcaggc agattcgaaa tgcaggtggc aatgttttaa 600 tggggaactt gatatcattg gcaaaaaaat ataaatcaat aacaccatct tgccgtgtga 660 acatcacacc atactttata agatgtttta aaaagcgata caatgtggca tttaaatccc 720 tttgtggaga ggcaaaatct gctatgacgg acaagattct agattttttt gatgagtttg 780 aagctataaa gaaatattat gatgaaaaag atatttacaa tttagatgag actggtctat 840 atatcaagaa ttttggtaga agaagttaca ttctgaggaa tgaagatgac agaaaaagta 900 taaaaaccga taaaacaaga attactcttt tattgtgttt taataaatac ggcgatgaaa 960 ttacgccatt aatcatagga aaaagcaaaa accccagggc ttttaaacac gaaaaaccag 1020 ataattacaa ccttaaatat gctaacaaca catcttcgtg gcttaccaaa gatatttttt 1080 tgaaatatat aaacgattta aatgacgaaa tgatcataca ggatagaaaa atcctaatat 1140 tatgcgataa ttttaaaggt cacgaggttg gtgtattttc gaacgtgaaa ctgttatttt 1200 tgccacctaa tacaacatcg atcttacagc ctctagatct aggtataatt aacgatatca 1260 agtctaagta ttctgaatat gtcaataact ttgttacaaa tatgataatt gataatagtg 1320 cggatataaa atcttgtttt agttccttga cattgctaga agtatgttac tggataagag 1380 agtctatcaa taaggttctt cctaatacgt atttgaattg ttgggaaaag tttgaaaaaa 1440 tgaaagtaaa gcttcagaat acagatttag aaaacgaaga aaatgaggtt ttgtttgata 1500 ctttaattgc tccagactgt gtggtgaaag atgatcctat ttatgatgaa tccttatttt 1560 gtctcgatga aatcatttgc gatatatcgg aagcatctag gattgaatct gtaaattatc 1620 atttaaaaaa actggataat ttgttgcata ttgaagaaat tgatgttaga aagtcgtaca 1680 ttgagtttaa aaataagtac tatgaaagta gatcaagaag atccacggtg tcttacatta 1740 tcgattaaag gccaattgta attttttata gccatatctt tgtaatatgt tacgagaatt 1800 ttatcataaa gaccaaattt attttttcct tataaataat tattttagta taggaattcg 1860 gtttaaagca tcttttaact ggatttaatt atattttttc gatcacaacc gtatattaaa 1920 accgcgcagt gctgtattca attttaaaga aaacaacagg tttttacagc aattatcaag 1980 taaaatatcc atgattcatt tttataatca attaataaat tttttaaata tgtaattaga 2040 atttattctg tgaattgtct tacgcaattc atgctattga aatagacctt ttgttgtaaa 2100 aaaaccatat acataaataa taataatttt aaataacgat tttaagtaat tttaaacgaa 2160 ataatttttt tcgaaattta gggcaaataa gatttttagg gcatgtttgt gccctaaatt 2220 tcacacacta ctg 2233 // ID Gypsy-598_AA-LTR repbase; DNA; INV; 2157 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-598_AA_; KW Ty3_gypsy_Ele194; Gypsy-598_AA-I; Gypsy-598_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-2157 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 2157 BP; 580 A; 484 C; 547 G; 528 T; 18 other; tgtaacgaac tttcagttcg atacattttg tgtaaatatg tagatataaa tttagcttaa 60 gtwaatgaat tawtaaaatg aaattgaatt tatattaaag tttccccgtc acatacaccg 120 cagtttgaat tccctataaa agctctttta gctataaatc caggcgcgct cacacagaaa 180 cgagcaagcg aattctccaa gaataagcgg atgagtacca tagaagattg ggaccatgtg 240 agaagcatag gcacgcgcat atgtccctaa caatgtagcg tcggatggtg gccatcgacc 300 tctgggaaac ccctttcagg atwtattgac cctttggctc gtaattgaga gaatttcggt 360 taggctcggt tggtgacatt cgttgcaacc cgaatgatct ttctctttgc cacaatagac 420 tgttagtcgg ggcaaaggca atttagagac tcgggaggtc aawtccggta gcgaagackt 480 tcagaatwat cgcaccttat ctgagggttt ctggctgtta cwcgttcact tcgwtctgta 540 cagtggaaga ggcaggagta gtccagtgtt agttscgcgt cgtgaggtga ttttgcaggt 600 ggaaaatcaa gtgacaagat atctccttga aattactaat ttaagttcgt ctctctctta 660 gtgaaatagt tgttaggaaa tcgtcacgca aaacggcgat tgcgtgcgta gtaggaagtg 720 tgcgagtgag gataaagaat tcagmaggta aatcacgtgc ttctttggtg tgataatcca 780 agcttttaac gcgtcggaaa ttggcattta tttcaattag atttgtggcg agcgtgccaa 840 gcccgccacc tcacaatagt tcaggaaact attcctgacc ccactctgca gcttcgaaga 900 agcccggtgg tacacgtggc cggaaggggg ccgcgtccgt caatttcgcg ccccggacag 960 ggccaaggca gcagcatatc ggttttcawc aagagtcagt ggagccagca accagcgttt 1020 cccgtttcgg tgacgaccgt aagtacaacg gcagtacccc gttgaagccc acaaatcatc 1080 ggtgaggcgc cgtcggcgca tgaagcagtg aggcgtagaa gtcggaagga gcgcgcatcg 1140 cgcgcgcttt gaaggagccg ggcatcgcag agatgtgccg tgactaagcg ttaacccacc 1200 aaaccacccg ttgaacgagc gatcagtgtc aaccgatgcc gtaaccgtcg acacggatca 1260 agccacccac cgatcagcag cgccccaatc acaccaacaa gaccccagcg tcgtcgtccg 1320 gcgaccaccc actttcgtcc tgctgatatg gtaccgtaag taaatgcatg caaatcagtg 1380 aaagtcccat gcgcatggtg tagagatttt tttgcccgct cccgattgat ggttggagac 1440 gcgtggcggc ggctaagctt ttttgtcgga cactagccgt tcccccaatg tgaatgccca 1500 caccgcacaa aagcacgtga agagatagta gaawkagagg tagawgawga aatggaaatg 1560 aacttgtaaa tagctagagt aaggaaggaa gagagaatgt cggtaaagta ggcctamgaa 1620 ctaaatatag aaatgaactc gaataatgaa gttgcgttac gtttkgaata aatkcattag 1680 agtaccacag acaaaccgac ctgtgttgaa gttattagcg aacgagaaga ggtaatgtag 1740 gggaggtttc atgtcactcg ggttcgttta gtagttcctt gttaatttac tttattttac 1800 tggggaggtt ggctccgaat ccaaccaagg aacactctct agtctttcgg gttcagtttg 1860 aggtttacgt gtacagccag ccagtgatga gaataggtgc gaacgtagtc tggactatcg 1920 gctaggtttt cttttctcgg actaacagtc tagggtaatc taaagaggag tcttcttgtc 1980 gcaacaacgt agccaaccgt gcttcgtgct ttccttccac tattggaccc caaaaccctt 2040 tcccttgaga ggtcttaggc cgggcaaccc tgaaaacagg tggatggtct ccaaccggag 2100 tggcgctcaa gccatccgct aatcttaccg gaaagcacgt tcggacgtcc ggttaca 2157 // ID L1-3_HM repbase; DNA; INV; 5234 BP. XX AC . XX DT 31-DEC-2008 (Rel. 13.12, Created) DT 31-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE a non-LTR retrotransposon from the L1 clad - consensus. XX KW L1; Non-LTR Retrotransposon; Transposable Element; KW reverse transcriptase; endonuclease; L1 clad; L1-3_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-5234 RA Bao W. and Jurka J.; RT "L1-like retrotransposon from Hydra magnipapillata."; RL Repbase Reports 8(12), 2072-2072 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(18..305,301..1002,1019..4588,4543..5043) FT /product="L1-3_HM_1p" FT /translation="MASYAGTVSNKINRMECVLDLNVFMNQSEIFNRKYQE FT NRAHEIYQKVSSIVGNNKISSLTYNAKGNWVIEMKQKEDAIILNDSTFLLS FT GSNQNIMFCSKIRSDIGLLITVKCDPLIDDHEILAEMKPFISELYSITHTT FT YRFDRNLKDGRRLFRIKPSVKLNEIPHTIEIEGIKIALHFAGKSFLCKTCQ FT YSHPPQQRCIAQKPIEKATEIVLEKLNRKNRDESVDSITIQNASESKHKHV FT SKPSDSKIPLLLPIEPVFEIVKSKKHKRDERKQNQTQNEQNELTENETETD FT IEDEEIQSKSDKKINAKNASPTTPQKEGPRRKPRQRAVFYLYDYICLFIYF FT LFHEKCHCILFHLPFYFYTFPYSFLPHPSLSYHSIVKNRIQTKMEAIKIFS FT VNICGLADVKKQSRIIQKFRKMNFDFFLLQETHLNSQAAKKFINLWNNPGY FT YSNGTSHTGGTLILIKNLNYNLKVIEHFEDPNGHYNYIIVENDYHIFLIIN FT VYATSGSSYTDQIKRKKLFKNISKKIKIKNSNNQFTILGGDFNMVLDDKDR FT YPQINRRKCLSTCNLKTLLTTLNIEDTWRILNLNALEYTYKSTNNISFSRL FT DRLYISKHMRQHXFIEHEPMTFSDHQNAICATISINNIDIGKDVWVLNNKN FT LSKYNYTDNVSKIIKIGCQKIYYSENKTSDWDLLKQEIKLFSQKFSXSEAK FT KTRVEEYRIKKQLKNSLKKSHYNPRMKYLSEILKVKLQSFEINKIKGAEIR FT AKVKWRTDGXKSSKFFFQLEKAKTSKQIITLIKDKNGNEKTNTREITKCFE FT QFYKKLYKNTNNDVTLQKFLLNNTDIKKITKQQQRSLDEPIMLYEVENAIK FT TMKNNKSPGNDGLTAEFYKSNFNSLGESLVKILNEIIKKNEMSITQKQAII FT TCLFKKGDKSDISNWRPISLLNTDYKILTKIISNRLTKILPTIINENQTSS FT IPHRTVYQNLAYTRDIIYIANKKNLDASIISIDQIKAFDRVDRNFLYDCLS FT KFGFSVYFINLIKTLYYDISSNLKINGFLSDKINIYRGVRQGCPLSMILYI FT IQAEVFAGYIRKNPDIKGITLDNKETKLQQYADDTQFYLSSDNSIKALGNA FT INIYKLATGTKLNTAKCQGLWLGKNRSIIKENFLNFSWHDTTLRSLGIIFS FT NCSQHTNHQWDEGIAKINKKIENWKRFHLSIKGKAIIINQILLSSLWHLAF FT TLPIPGDQKIKQIEQEIERFLWNNSTIKTPKQISKLPVDKGGLNIIDVKEK FT LQSIQLSWISKXFNENEKGAWKDSAFYIFNNYRDANQGNLIFFTTHSTESL FT KTLPLYYHQTIKKWSRLYRKTNNHTNLTIEEILKQPIFYNPYIRHRNKTLK FT PTQESKNNDVIKLGDIAKVFIPGFLVNQLIGIKQLLLTKIINSLPPQWKQL FT INTQSQMFDHNKSLTYIHLKNKPPLCIIDLNSKTIYTISKNYKTTNIPEKY FT FRWQNHFKSMKNTAAKEWSQLFCSIHKNNTNNKADEIRYKLIHFALPTNLK FT IEDTTYTLCFTYQLKNRGYIKNDLCPQCKKKKEDLPHMMYKCKKVQPLIKY FT TFALIRKIYPNHPPLKNIFKSFLFALSPSVNSFYIGKIILNELLYKLYCNR FT MRSFHXKSMFARVTLLEEFQNKIKKILFQEFEISKLTNKKDFFLLELAQIW FT SNENKINIKMNNNLFLN*" XX SQ Sequence 5234 BP; 2167 A; 963 C; 671 G; 1426 T; 7 other; gtgttaataa gtttgatatg gctagttacg ctggaacggt gtcgaataaa ataaatcgaa 60 tggaatgcgt actcgattta aatgtcttca tgaatcaatc tgaaattttt aatagaaaat 120 atcaagaaaa cagagcgcat gaaatttatc aaaaggtttc atctattgtg ggtaataata 180 aaattagtag cttgacctac aatgcaaaag gaaattgggt aattgagatg aaacaaaaag 240 aagatgcaat tatacttaac gattcaactt ttctattatc tggtagcaac caaaacataa 300 tgttctaaaa tacgaagcga cattggtctc cttataaccg taaaatgtga ccctcttatt 360 gatgaccatg aaattctagc agagatgaaa ccctttatat ctgagttata ttcaataact 420 catacaacat atcgattcga cagaaacctg aaagacggtc gaaggctctt tagaattaag 480 ccaagcgtca aactaaacga aattcctcac acgattgaaa tcgaaggtat aaaaattgct 540 ttacattttg caggaaaatc cttcttatgt aaaacatgcc agtattctca cccacctcaa 600 caacgatgca ttgcccaaaa accaattgaa aaagccactg aaattgtttt agaaaaactg 660 aataggaaaa acagagacga atcagttgac tctataacta tccaaaacgc gtccgaaagt 720 aaacacaaac acgtttcaaa accatccgat agtaaaatac cattgcttct tccaatcgaa 780 ccagtttttg agattgttaa aagcaaaaag cataaaagag atgaacgaaa acaaaatcaa 840 acgcaaaatg aacaaaatga attaaccgaa aacgaaactg aaactgatat agaagatgaa 900 gaaatacaat cgaaaagcga taaaaaaatc aatgcaaaaa atgcttcacc cacgactcct 960 caaaaggaag gcccgagaag aaaaccgagg caaagggcgg tatgaataaa aaatttaatt 1020 ttatctctat gactatatat gtttatttat ttatttttta tttcacgaaa aatgtcattg 1080 tatactattc catctgccat tttattttta cacattccct tactcttttt tgccacatcc 1140 gtcattatcg taccactcaa tcgttaaaaa ccgcattcaa acaaaaatgg aagcaatcaa 1200 aattttttca gtcaatattt gcggtcttgc tgacgtaaaa aaacaaagca gaattattca 1260 aaaatttaga aaaatgaact ttgatttttt tcttttacaa gaaacgcatc ttaactctca 1320 agccgcaaaa aaatttataa atctttggaa taatccaggc tattactcaa acggtacatc 1380 tcacacagga ggaactttaa tattaatcaa aaatctaaat tataatttaa aagtaattga 1440 acatttcgaa gaccctaatg gtcattataa ttatattata gtggaaaacg attaccacat 1500 ttttttaata ataaacgttt acgcaacatc tggttcgagc tacactgatc aaatcaaaag 1560 aaaaaagtta tttaaaaaca tatctaaaaa aataaaaatt aaaaactcaa ataatcagtt 1620 tactatactc gggggtgatt tcaacatggt tttagacgat aaagatcgct acccgcaaat 1680 caacagaaga aaatgcctgt caacatgtaa tttaaaaact ttattaacta cacttaatat 1740 cgaagatacc tggcgcattt taaacctaaa tgccctagaa tacacataca aatcaacaaa 1800 caatatatct ttttctcgcc ttgatagact atatataagc aaacacatgc gccaacataw 1860 attcattgaa catgaaccaa tgactttttc agaccatcaa aatgctatat gcgccaccat 1920 ttcgataaac aacatagaca ttggaaaaga tgtttgggtt ctaaacaata aaaacctaag 1980 caaatataat tacactgaca acgtctcaaa aattattaaa ataggatgcc aaaaaattta 2040 ctattctgaa aacaaaacat ctgactggga tcttctaaaa caagaaataa aacttttttc 2100 acaaaaatty agtwaatctg aagctaaaaa aacacgagtt gaggaataca gaatcaaaaa 2160 acaactgaaa aactcgctta aaaagtccca ctataaccct agaatgaaat acctcagcga 2220 aattctgaag gtaaaacttc aaagctttga aataaataaa ataaaaggcg cagaaattag 2280 agcaaaagta aaatggagaa cggatggtra aaaatccagc aaattctttt ttcagctcga 2340 aaaagcaaaa acatctaaac aaatcatcac tctaataaaa gacaaaaacg gaaatgaaaa 2400 aacaaacact cgagagatta ctaaatgttt tgaacaattt tataaaaaac tgtataaaaa 2460 cactaataat gacgtcacac tacaaaagtt tttactcaat aacacagaca taaaaaaaat 2520 caccaagcaa caacaacgct cattagatga accgattatg ctttatgaag tcgaaaatgc 2580 aataaaaact atgaaaaata ataaatcacc tggaaatgat ggcttaacgg ccgaattcta 2640 caaatctaat tttaacagtt taggtgaaag tttagttaaa attctaaacg aaataataaa 2700 aaaaaatgag atgtctatta cacaaaaaca agccataata acatgccttt ttaaaaaagg 2760 tgataaaagt gacatttcaa actggcgacc aatatctcta ttaaacacgg actacaaaat 2820 tcttaccaaa ataatatcaa accgactcac aaaaattctc ccaaccatca taaacgaaaa 2880 ccaaacatca agtatccccc atagaacagt ttaccaaaac ctggcatata ccagagacat 2940 aatatatatt gcaaataaaa aaaatttaga tgcctctatt atatcaatcg accaaatcaa 3000 agccttcgac cgtgtcgaca gaaatttttt atatgactgt ttatctaaat ttggttttag 3060 tgtttacttt attaatctta ttaaaacttt atactatgac attagctcca accttaaaat 3120 aaacggcttt ttatccgata aaataaatat atacagagga gtacgacaag gatgccccct 3180 ctcaatgata ctgtacataa tccaagctga agtctttgca ggatacatta gaaaaaatcc 3240 cgatataaaa ggaataaccy tagacaacaa agaaactaaa ctacaacaat acgcagatga 3300 cacacaattt tatttatcat ctgacaactc cataaaagcc ctaggcaatg ccataaatat 3360 ttataaatta gccacaggta ctaaattgaa cacagccaaa tgccaagggt tgtggcttgg 3420 aaagaacaga tcgataataa aagaaaactt tttaaacttt tcgtggcacg acaccaccct 3480 tcggagttta ggtattattt tctcaaattg tagtcaacat accaaccatc aatgggacga 3540 gggtatcgct aaaatcaaca aaaaaataga aaactggaaa agatttcatt tatcgataaa 3600 gggaaaagct attattatta accaaatact cttaagcagc ttatggcacc ttgccttcac 3660 gcttcctatt ccaggagacc aaaaaattaa acaaatagaa caagaaattg aaagatttct 3720 ttggaataat agcactataa aaactccaaa acaaatatct aaactccctg ttgataaggg 3780 aggactaaat atcatagacg ttaaagaaaa gcttcaatct atacaactaa gttggatttc 3840 aaaawatttt aatgaaaatg aaaagggagc gtggaaagat tcagcttttt acatattcaa 3900 caattacaga gacgcaaacc aaggaaacct tatctttttc accacacact caacagaatc 3960 cttaaaaact ctccctcttt actatcacca aacaataaaa aaatggagtc gcttatacag 4020 aaaaacaaat aatcacacaa atctcactat cgaagaaata ctaaaacaac ctatatttta 4080 taacccatac atacgccaca gaaataaaac tctcaaacca actcaagaat ccaaaaacaa 4140 cgacgtaata aagctaggcg atattgcaaa agtatttatc cctggatttc tcgtaaatca 4200 gctaatcggt ataaaacaac tcctcctaac aaaaattata aattccttac cgcctcaatg 4260 gaaacagctc ataaatactc aaagtcaaat gttcgatcac aacaaatcac tgacatatat 4320 tcacctaaag aataaacctc ccctttgtat tattgacctc aattcaaaaa ctatatacac 4380 catctcaaag aactacaaaa ccacaaatat acccgaaaaa tattttaggt ggcaaaatca 4440 tttcaaaagc atgaaaaaca ccgcagcaaa agaatggtct caattattct gctcaatcca 4500 caaaaacaac accaacaata aagcagatga aataagatat aaacttatac actttgcttt 4560 acctaccaac ttaaaaatag aggatacata aaaaatgatc tctgccccca atgtaagaag 4620 aaaaaagagg atctacccca catgatgtat aaatgcaaaa aagtacagcc actaataaaa 4680 tacacatttg ctcttatccg aaaaatctac ccaaatcatc caccccttaa aaatatcttt 4740 aaatcttttc tttttgcttt atcaccttct gtaaatagtt tttatattgg taaaataata 4800 ttgaatgagc ttttatataa actatattgt aatagaatga ggtcattcca craaaagtcc 4860 atgtttgcga gagtgactct cttagaagaa ttccaaaaca aaatcaaaaa gattttgttt 4920 caagaattcg aaatttcgaa gctaacaaat aaaaaggatt tttttctatt agaacttgca 4980 caaatctgga gcaacgaaaa taaaattaat ataaaaatga acaacaatct ttttttaaac 5040 tgaaacgccc agattatttt tttcgaattc tttttttact ttttcgtttt tttcttcgtt 5100 tttttttttc tcgtgtagtc tctagagatt atgtcttttt ttgacctagt agtggaacag 5160 tagaccacct tgtaaatata actactgtat tctttattaa aactctgcac ggcaaatgcc 5220 tgaggagaat aaac 5234 // ID DNAX-3_Tad repbase; DNA; INV; 279 BP. XX AC . XX DT 08-OCT-2009 (Rel. 14.1, Created) DT 08-OCT-2009 (Rel. 14.1, Last updated, Version 3) XX DE Putative non-autonomous DNA transposon: consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; DNAX-3_Tad. XX OS Trichoplax adhaerens OC Eukaryota; Metazoa; Placozoa; Trichoplax. XX RN [1] RP 1-279 RA Jurka J.; RT "DNA transposons from Trichoplax adhaerens."; RL Repbase Reports 9(10), 2145-2145 (2009). XX DR [1] (Consensus) XX SQ Sequence 279 BP; 92 A; 53 C; 51 G; 83 T; 0 other; ggatcaacag taatatccga ctcctacagt tttatagctt gaagtcaaga ttgcaacaaa 60 gcagtaggca atgtgccttt gcgatccagc tcttactgat gtctgctcta taactacagt 120 aggtagaaaa gtgaccacta ataagctttg gacaaccaat tttcattgac tttccattat 180 gattgaatgg cgacgaattg atactctcat agtatcaaca gtacgaaaat actataaaag 240 tctatcaaaa ttgtaggagt cggatattac tgttgatcc 279 // ID R4-1_TCa repbase; DNA; INV; 1721 BP. XX AC . XX DT 21-MAR-2009 (Rel. 14.04, Created) DT 05-APR-2009 (Rel. 14.04, Last updated, Version 3) XX DE R4-type retrotransposon: consensus. XX KW R4; Non-LTR Retrotransposon; Transposable Element; Nonautonomous; KW R4-1_TCa. XX OS Tribolium castaneum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Coleoptera; Polyphaga; Cucujiformia; OC Tenebrionidae; Tribolium. XX RN [1] RP 1-1721 RA Jurka J.; RT "Non-LTR retrotransposons from Tribolium castaneum."; RL Repbase Reports 9(4), 740-740 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Tribolium CC castaneum Genome Project. XX FH Key Location/Qualifiers FT CDS 415..1608 FT /product="R4-1_TCa_1p" FT /translation="MKAINTFAVPVLVYSFGIIKWTNTDINNINIKIRTLF FT TRFRKHHPKSAIERFSLSREYGGRGFLNLQNLHTRQISSLKKYFLNRSNKN FT ILFKAICLSDHNFTPLNLSNPNINEIISDENSIIETLKEKPLHGRFFKELD FT NSNYIDKKASHLWLTKTNIFSETEGFIFAIQDQVINTKNYQKFIXKKNNDV FT CRLCGTHSETIQHLISSCQVLVASDYKVRHDNVAKIIHSELAFNSGLIQTK FT SPYYKYEPDSFIENDNFKLYWDRTVLTNLTITANRPDIIFISKLERHAFII FT DIAVPNTHNIQETYNTKISKYMILASEIKNIWNLEKIKIVPIIISAVGVVP FT KNLKQNLELLKLSSNLIVEIQKSVLLKTCNIVRNFQHHYLIAISVMVISCL FT TXV*" XX SQ Sequence 1721 BP; 681 A; 226 C; 229 G; 582 T; 3 other; agctataaat ccactttctg atatttcgaa ttcatctaat tatggtttca gtttaaaaaa 60 taataaaaca gttgttgcca aattgaatca tctaatgtat atggatgata ttaaattata 120 tgctgcaacc gagaaacaat tgaatttatt acttaaacta acagaaaatt tttctttaga 180 tattggtatg tcatttggag tagataaatg caaaattcta gcaattacga aggaaatttc 240 aaaaaattaa attttatctg aggtcaagaa attgaacaat aatggtggta taaatactta 300 ggattcaaac aatcaaatag aatatcacat agtgatacaa aagawaaact agttgatagt 360 tttatcacga gatgtagaca attagttaaa acacatttaa attcaaagaa tcttatgaag 420 gcaataaata catttgctgt acctgtttta gtttattcat ttgggattat taaatggaca 480 aatactgata ttaataatat taatattaaa atcagaacac tatttactcg ttttaggaaa 540 caccatccaa aatcagctat tgaaagattt tctttatcta gagaatatgg aggtcgaggc 600 tttctgaatc tacaaaattt gcatacaagg caaatttctt ctcttaaaaa atacttttta 660 aatcgtagta acaaaaatat tctattcaaa gccatttgct taagtgacca taattttact 720 ccattaaatt taagtaatcc caatataaat gaaataattt ctgatgaaaa ttcaattatt 780 gaaactttaa aagaaaaacc tttacatggt agatttttca aagagttaga taacagtaat 840 tatatagaca aaaaagcttc acacttatgg ttaacaaaaa ctaatatttt ctcagaaact 900 gaaggtttta tttttgcaat tcaagatcaa gtaattaata caaaaaatta ccaaaaattt 960 attawaaaaa aaaataatga tgtttgcaga ttgtgtggaa ctcattctga aacaatacaa 1020 catctaattt cttcttgtca ggttctagtt gcatcagatt ataaagtaag acatgataac 1080 gtagctaaaa taattcattc tgagttagct tttaattcgg gtttaattca aaccaaatct 1140 ccttattata aatatgagcc tgactcattt atagaaaatg acaattttaa actctattgg 1200 gataggacag ttttaactaa tctaactatt actgctaata gaccagatat tatctttatt 1260 tctaaattag aaagacatgc atttatcata gatattgctg ttccaaatac acacaacata 1320 caagaaacgt acaatacaaa aatatctaaa tacatgattt tagctagtga gataaagaat 1380 atttggaatc ttgaaaaaat aaaaattgtg ccaataatta tttctgcagt aggagtggtg 1440 ccaaaaaatt taaagcaaaa tcttgaactt ttaaaattaa gttcaaattt aattgttgaa 1500 attcaaaaat cagttctttt aaaaacgtgt aatattgtta gaaattttca acaccattac 1560 ttgatagcaa tatccgtaat ggtgatttct tgcctgacta rggtttaaac ccccgggtca 1620 gtggagtgta aaagttgcca gataattatg atcaaagtaa ttgtctgtca agtttttctc 1680 atacttttgc cgggtcgaaa cgataataat aataataata a 1721 // ID Gypsy-2_CQ-LTR repbase; DNA; INV; 289 BP. XX AC AAWU01034370; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-2_CQ_; KW Gypsy-2_CQ-I; Gypsy-2_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-289 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 384-384 (2011). XX DR Genome; AAWU01034370; Positions 1475 1187. XX SQ Sequence 289 BP; 58 A; 82 C; 65 G; 84 T; 0 other; tgttgtggac agaaccagtg cgcggagtgc ccccactttc cgcacgtcgc tggctcacaa 60 gtggctgcca gtctgccacc ttgtgacgca agacagcgcg tcaattttta tgctagctgt 120 caagattatt tgttgtgccc tttcactcta tttccggatc ttaacaagaa tacacgcgtt 180 tttacatttt gttataaatt accgtgtttt atttgcgtat cgtgcgctga ttccgtggcc 240 ctgccatagg ttcccacgcg acatcacgtc ccgcgcaccg gacacgaca 289 // ID Gypsy4-LTR_Dya repbase; DNA; INV; 364 BP. XX AC chrU; XX DT 18-MAY-2009 (Rel. 14.05, Created) DT 18-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy4_Dya; KW Gypsy4-I_Dya; Gypsy4-LTR_Dya. XX OS Drosophila yakuba OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; melanogaster subgroup. XX RN [1] RP 1-364 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1049-1049 (2009). XX DR Genome; chrU; Positions 13091921 13091558. XX SQ Sequence 364 BP; 96 A; 92 C; 101 G; 75 T; 0 other; tgaacaagcg tctggccttt ccggctggga cagagcaagg gacaagaggt tgtggctagt 60 gaggcgtatg cctttgagag cccgccaatt gttcacccta gtctgagaac ggtgacgctt 120 ccctaagccc tcgaccttcc ttctcgcgcg tcagcggcaa caccaggcga aggtccgacg 180 ctacgacgag agagcagagg cgtccggaca attaaagtgc attgtaaact gagctcaaat 240 aaagacccga gaacgaaact gcgagtgtta ttactggtaa ccgggtgatc acgttattat 300 ataaatttgg tgggacgaac gcacaaactc tactgagcta gccgcacgta ttccgtagcg 360 agca 364 // ID Gypsy-3_AA-I repbase; DNA; INV; 7315 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-3_AA_; KW Gypsy-3_AA-LTR; Gypsy-3_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-7315 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 975-975 (2011). XX DR [2] (Consensus) XX CC Positions [3578-4081] - Reverse transcriptase CC Positions [5129-5605] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 1467..2729 FT /product="Gypsy-3_AA-I_2p" FT /translation="MAGPGDNWKEEINRYIQQALTEQMGALMDKLGPMLAN FT QRTSTPAAPPSPPLPPPPPLTPPQVTASAQQYHPAPPISSYPVGEPSREQG FT QLGNMQPNYRQRFYPNSIASHVSMAGEIEPTRGSQPSHDSRYRWQVPVSKW FT RINFSGDTRGPTVTQFLNKVEILARNNRISDQELLSQANFFFRENSEAEEW FT YFMFCSKFTTWAGFKHLLRLRFEQPNKDMVIERQILDRRQQANETFNAFLS FT AIEKLAQQLTKPLSEERKLNILTENMRDCYRPFLTIYRIERIDELTTICHS FT LDKSMYRTYSTHTRNRPPQVYNVEDTEQEWEEEVDQEEELNAIGQVIRKNR FT ITEKSKDVPNTLPKTSPNSENNVLCWNCRQYGHFWRNCDKAKKIFCHFCGQ FT TNYIAANCPNNHFFPNSQQGNENSGRL" FT CDS 2774..5980 FT /product="Gypsy-3_AA-I_1p" FT /translation="MAEEHKQNEPPYSKYATTCYINTKPNRCPYIKVEILG FT SPIVALLDSGASVSILSSRDVVERHGFKIKPINLMIKTADETAHNCEGVVQ FT IPYTYNGTTNVVPTLLVPKISKKLILGMDFWEAFKIAPALVEGNAIQPIIG FT LNETTINLMENFYADVEESIVFTVEPQGQPVVREEEVVEDTSLDLPFVEDP FT ENKNVKDITTEHDLSVEDRQELNRIIDLFKKSNGGKLGRTNLIRHKIELVE FT GAQPKRPPQYRCSPHIQKEVDKEVARMLELDVIEESTSDWCNPLLPVKKAS FT GEWRICLDCRRINDVTKNEAYPYPDMLGILGRIERAKFFTVIDLSKAYWQI FT PLEESSRDYTSFRAGKQLFRFKVMPFGLKGAPTTQTKLMNKVLGIDLEPYV FT YVYLDDIIITSNSLKEHFRLLRKVAERLEAANLTISLEKSKFCQKQICYLG FT YTLSDQGLAVDSTKIQPILDYPTPRTPKEVRRFIGMVSFYKQFIDRFSDLT FT TPITDLLKGTKGKISWSKEADSAFLKIKSVLTSPKVLANPDFRLPFIIESD FT ASDVAVGAVLVQIFEGVRRPIAFFSKKLSATQRKYAPTEKECLGVILAIQK FT FRHYVEGSRFTVVTDAQSLIWLRKISAEGGSAKLVRWALKLQQYDFELLYR FT KGSLNITADALSRGVLTVDLVDPEYEELKKKIISNNVKYKDFRVVQDKIYK FT IVKSRFNDPRFEWKYVPTNRERIKLIKEIHDSMHFGRSKTLKKLQERYYWP FT LMENEVRKYCQGCDTCKRTKYPNTNQKPLMGRQKVASMPWQTISVDFVGPF FT PRSKSGNSVLLVVTDLFSKFVIIQPLREAKTKPLVSFLENMVFLLFGVPEI FT LISDNGVQFKSKEFEKFLNNYHVTHWKNANYHPANNPTERVNRVIGAAIRT FT YLKDDHKEWDRDIQKVAMAIRTSVHESTSFTPYFVNYGRNFISSGAEYKRI FT RDTNTDISYEPVEINENLKTIFDTVKINLKKAYERYAKHYNLRSNQTLPSY FT EIGEVVLKKNFFQSSKSKQFSAKLANPFSPAKVVGKIGSSCYDLEDLQGNR FT LGVYHASHLQKK" XX SQ Sequence 7315 BP; 2410 A; 1388 C; 1525 G; 1992 T; 0 other; attacaagtc aatcgtatca gttgggagtg agaggagttt caattcggtt tctttatttc 60 gtcggattag aataggctct ggaaagggaa tccttaatgg taacctgccc accctgtaac 120 ttgtatatat tttctgtaaa ctattctttt ggtttggttt gagtgtgctt aggtttcgat 180 agttgggtta aggcttgaag aaaggactcg ccctgaggat tcctggccgg gtacgggatt 240 ttcctgaaga gattttggac acgcatcttc gcggcggttt cctagccgcc tggcgcttaa 300 gtgaaggttg tttttgccga accttatctt cccttcccag tcccgttaca tttggcgccc 360 aaccaacgga ttggtttaac gaagttctgt aaatagtaag ataatttgag taataataac 420 tagttgaatt aatgattttt agttggattt ggtttagtgc ttaagtgatg aattagtggt 480 gaacacaatt atagataaat aaggtaattt tcggattttg ttttgaattg cttttcttag 540 tgtacataat tcagctttat cctttattta aattatccat ttgatggttt ttcagatatt 600 cctagtctaa aattaatttt tcattttttt tttctatatt ttcctcatta ttcatgtcgt 660 tgtggaactt ttaagaatct gcaaagagag tgaaaaaaaa aagattagca aaatgcagtc 720 gcaagaattc gatacactag attactatat aaatcccaac gacctattaa acgatgaagt 780 tgactatgaa ttgaaaatca gatgcctatc agtagaaggg tcgcaagaaa cgaaaaggca 840 aaggttaaga catgctcttt tggaagaagt tcgaaaacca aaaattttac gtacaaaatt 900 gtcgattata caagagtttg aagcgatcaa ggacaaggtt caagagatca aacatgctat 960 tcaattttat ccggagccaa agtttttttc tcgcctaaga cactgtagat tgagagtgat 1020 gcgagcaaat gcagtaacta gagagcaaaa aaggctgaag gaagattttc tttcagagat 1080 tgaaaatttg cttgaccgtt acggcaaccc aattgataga agaggtagtc aattgccttc 1140 gaatctaaat tgtaatgttt ccgaggtatt gggacagaat gttgagcaaa atgcaggtga 1200 caaatctttg ctaaattttt cccagatacc aggagtaaag cactcagcag aaattcggac 1260 caaaattctg tagccttaga caccaatgaa caaaatgagg tgggttttct agatgaggca 1320 gcaggagggc atgaatccgt aggaagcttg gacgaagaac taaatttgct atttactgaa 1380 ctccgtcaca atacacaaag acaagagaac agacttcaag aaccatatta ccatacagaa 1440 gaaatcagag tttcccaaca aaccccatgg cagggccagg ggacaattgg aaagaagaaa 1500 tcaaccggta tattcaacaa gcattgacag aacagatggg agcattgatg gataaattgg 1560 gtccgatgtt agccaatcaa agaacgtcga ctccagcagc tcctccatct cctccactac 1620 ctccaccacc accattaacc cctccccaag ttaccgcttc agctcaacaa tatcatcctg 1680 cacctccgat atcgtcatac cctgtaggtg aaccatcacg ggaacaagga cagttgggta 1740 atatgcagcc aaattacaga caacgtttct atcctaattc gattgcttca cacgtctcaa 1800 tggcaggaga aattgaacct acacggggta gtcaaccaag ccatgattcg cgctatcgct 1860 ggcaggtacc agtaagtaaa tggcgaataa actttagcgg ggatactaga ggcccaacgg 1920 ttactcagtt cctcaataaa gtagaaatat tggcgaggaa taatagaatt agcgatcagg 1980 aattgttgag ccaagccaac tttttcttta gagaaaactc agaggccgaa gagtggtact 2040 ttatgttctg tagtaaattc accacttggg cgggttttaa gcatcttttg agactacgtt 2100 tcgagcaacc taacaaggat atggttattg aacgccagat tcttgataga agacagcaag 2160 cgaacgaaac tttcaatgca ttcttgtcag caatagagaa attggctcaa cagctgacaa 2220 agccactatc agaggaaaga aaactaaata ttttgaccga aaacatgagg gattgttaca 2280 ggcccttttt gaccatctac cgcattgaac gtattgacga attaactact atctgccata 2340 gcttagataa gtctatgtat aggacctaca gtacccacac tagaaacaga cctccacagg 2400 tgtacaatgt agaagacact gaacaagagt gggaagaaga agtagatcag gaagaagaat 2460 tgaacgccat tggacaggta attcgtaaaa acagaattac tgaaaagtct aaagacgtcc 2520 caaacactct gcccaaaact agcccaaatt ctgagaacaa cgttctatgc tggaactgtc 2580 gtcagtatgg ccacttctgg cgtaactgcg ataaagcaaa gaaaattttt tgtcatttct 2640 gcggacagac taattacatt gcagcaaatt gtccaaacaa tcattttttc ccgaacagcc 2700 aacagggaaa cgaaaattca ggaagattgt agggagcgat ctttctgatt ctctacaaga 2760 ctccccggaa aaaatggcgg aagaacataa gcaaaatgag ccaccctatt ctaaatatgc 2820 aactacgtgt tacataaata caaagcccaa caggtgtcca tacatcaaag ttgaaatcct 2880 tggaagtcca attgttgctc tattagattc cggagctagt gtaagtattc taagctcacg 2940 agacgtggtc gagagacatg gtttcaaaat caagccgatt aatttgatga tcaaaacggc 3000 tgatgaaaca gctcataact gtgaaggagt agttcaaata ccttacacgt acaatggcac 3060 cacgaacgtt gttcccacac tacttgtccc caaaatttca aaaaaactta tcctcgggat 3120 ggatttttgg gaagcgttta aaattgcccc agcattggtg gagggcaatg caatccaacc 3180 aatcattggt ttgaacgaaa ccacaattaa tctgatggag aatttctatg ctgacgtaga 3240 agaatcaatt gtcttcacgg ttgaacccca gggacagcca gtcgtcagag aggaagaagt 3300 ggttgaggat acaagcttag atctcccatt tgtcgaagac cccgaaaaca aaaatgtaaa 3360 agatattaca accgaacatg acttatctgt cgaagaccga caagaattga accggattat 3420 cgatctgttc aaaaagtcta atggaggaaa attgggaaga acaaatttaa ttcgccataa 3480 aattgagcta gtagagggag cacagccaaa aagacctcct caatatcgtt gttcccctca 3540 tatccaaaag gaagttgaca aagaggtagc gagaatgcta gaactcgatg tcattgagga 3600 atcgacctcg gattggtgca atccattatt gcccgtcaaa aaggcctcag gagaatggcg 3660 gatatgtctt gattgccgtc gaatcaatga cgttactaag aacgaagcct acccatatcc 3720 ggacatgcta ggaattttgg gccgtatcga aagagcaaaa ttcttcacgg taatagatct 3780 ttccaaggca tattggcaaa ttccgctgga ggaatccagt cgtgactaca cgtcgtttag 3840 agcaggcaaa caactatttc gatttaaagt aatgccgttc gggctcaagg gagcacccac 3900 gactcaaacg aaacttatga ataaggtttt gggtattgat ttagaaccgt acgtctacgt 3960 ctatttagac gacattatca taacatcaaa cagtctaaaa gaacactttc gcttgttgcg 4020 gaaagttgct gaaaggctag aagccgcaaa tttgacaata agtttggaga aatctaagtt 4080 ttgtcagaaa cagatatgct atttaggcta cacgctttct gaccaagggt tagcggttga 4140 cagtaccaaa atacaaccaa ttttggacta cccaactccg agaacaccga aggaagtacg 4200 ccgattcatc ggcatggtta gtttctacaa gcaattcatt gatcgtttta gtgacttaac 4260 gactccaata acagacttgt taaaaggaac aaagggtaaa attagttgga gcaaagaagc 4320 agattcagcc tttttaaaga ttaagtcagt actcacctcc ccaaaggtgt tagccaaccc 4380 tgatttcaga ctaccgttca ttattgagtc ggacgcctcc gatgttgcgg tgggggcggt 4440 tctagttcaa attttcgaag gggtcagaag gccgatagcg ttcttttcca agaagctttc 4500 cgcaacccaa cgaaaatatg ccccaacgga gaaggaatgc ctgggggtca tattggccat 4560 tcaaaaattc cggcattacg tagagggttc gcgattcact gtagtgaccg atgcgcaaag 4620 cctcatttgg cttcgtaaaa tcagtgcgga aggaggctcg gccaaattag tccgatgggc 4680 cttgaagctc caacagtatg attttgagct tttatatagg aaaggatcgt tgaacattac 4740 tgcggacgcc ctctcgcggg gagtgttgac cgtagaccta gtggatcctg aatatgaaga 4800 gctgaaaaag aaaataattt caaataacgt caaatacaaa gatttcagag tagtccaaga 4860 caaaatctac aagatagtaa agtcaaggtt caatgatcca agatttgagt ggaaatatgt 4920 tcccacaaat cgggagagga taaaattgat taaggaaatt catgattcaa tgcatttcgg 4980 tcgtagcaaa acgttaaaga agttgcagga gcgatactat tggccgttaa tggaaaatga 5040 agtgagaaag tactgtcaag gatgcgatac gtgtaaaagg actaaatatc caaacactaa 5100 tcaaaaacct ttgatgggta gacaaaaagt tgcctcaatg ccttggcaga cgatttcggt 5160 cgactttgta ggtccgtttc cacgatccaa atctgggaac tcagttttgt tggtggtgac 5220 agatttgttt tccaaatttg ttattatcca gcccttgagg gaagccaaaa caaagccgct 5280 agtatcattc ttggaaaaca tggtgtttct cctctttggt gtgccggaaa tactaatttc 5340 ggataacgga gtccagttca aatccaagga attcgagaaa tttttaaaca attaccatgt 5400 cactcactgg aagaacgcca attatcatcc ggcgaataac ccaacagaga gagttaatcg 5460 ggtaattgga gctgcgatta gaacatactt gaaagatgac cataaggaat gggatcggga 5520 cattcagaag gtcgccatgg ctatcaggac atcagtccac gaatcaacgt cattcacacc 5580 ctatttcgtg aactatgggc ggaactttat aagttctggg gcagaatata aaagaattag 5640 ggatacgaac actgatatca gctacgaacc ggtagaaatc aatgaaaacc ttaaaacaat 5700 atttgacaca gtcaaaatca atttgaaaaa ggcttacgag cgttatgcaa agcattataa 5760 tctgcgttct aaccaaaccc ttccttcata tgaaatagga gaagtggtgt taaagaagaa 5820 cttctttcaa tcgagtaaga gcaaacagtt ctcagcaaag ttagcgaatc cattttcacc 5880 agcaaaagtt gtcggtaaaa tagggtcttc gtgttatgat ttagaggatt tgcaaggaaa 5940 tcggttaggc gtttatcatg cctcacattt acaaaagaaa taatcggcca accaattaca 6000 agctatgtct gatgctaaag gcacagttac taagagattt ttccattaaa acacgtactt 6060 acggttcaac cattgcaggg ggactagctt caaaaacaac atgcatacta caagcagtcc 6120 catcctcgcg tttcacggtt atccaattag aattcatagt tccaaaccta aaaaaaaaaa 6180 caactcgtta gtaatgtcgt tggtatacag ctatgtacag gatcagctga agattctatg 6240 tttacatcct agacaatgaa taatgctaaa acaccaagaa cagggaaaac cgttcacaat 6300 attaagctat gtatttgaca acaacgaaac aaaacactgt gagtttacgg caggctcttg 6360 aacgtaaaca ctgtaaatat taattgtaaa tagtccagtt aactagtcta taaagatgaa 6420 ttgaaatcta gttcattttt gcatcaggtg attgatttag taggtacact tgatgtgacg 6480 ctgggcacaa taattggttt agcttcgttt agttcactag cactttttcg acagtaattt 6540 tttttcacac ttacaacatt tgcagtaact ctggtagcag cttgtagttg gttcatccgt 6600 tcctcgcaat acatcctcaa aattcgcctt agtttctggt tattttttaa tttgactccc 6660 actttcttgc aataacttta gttacagagc acagaaccaa aacaaaacaa ccgattattc 6720 agtgacaggt caaaccgcaa cttatcgttt aatgagcatc ggccaaaatt acagtagagg 6780 ctgttgaagt agggatacca tgatagtaaa tacgatgtgg agcggtgatg ttccagtagt 6840 gataatttgg agttgagttt tgttactgat tttatgtcag tagtatttgt tcggataaat 6900 ctgaaacgta cataaagtaa gttggttaga atgatgtgtc gagaaaattc ttgacggctc 6960 cttagggaga tcatggtgcc gccttcgcgc acaggtgttt attctagtga tcctagaatt 7020 ggtctgaata gacgttgata ttagacgtgg taggtctgtt aagttgaatg aaaactataa 7080 tggtatcccg atataagaga attaaattaa ctgtaaataa taaatcaaca aaatatatac 7140 actaggaata tctatatcta tatctatatc tatatatgca ctattttgat cagttgagtc 7200 gaaccaaatc aactaacaac aatagaaaaa taaattgcta ctgttttatt ttttacaaaa 7260 aaaaaataaa ttcagatatc ttcatttatt ttttttaacc gttggtaggg ggtaa 7315 // ID hATm-45_HM repbase; DNA; INV; 3725 BP. XX AC . XX DT 07-DEC-2008 (Rel. 13.12, Created) DT 11-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type family: consensus sequence. XX KW hAT; DNA transposon; Transposable Element; hATm-45_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3725 RA Jurka J.; RT "Families of hATm elements from Hydra magnipapillata."; RL Repbase Reports 8(12), 1939-1939 (2008). XX DR [1] (Consensus) XX CC ~15% divergence from consensus. CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(444..905,1149..2153,2135..3028) FT /product="hATm-45_HM_1p" FT /translation="MNNKCESGQPTRKRIHNIITRHESERYAGQPNELPVS FT DLPTYKQIIQYAYKVEKEMITTEKIDSAAIVLKVSKEILKIWKKVNPNLPL FT MKEKPLRNKLQRTFQKVIAAERSKASKHKNSVCWLENNLLKLFDISACRCE FT LPLLNCDDRFEVIHFTFYKLLNVYFFKFKVPVLEREYLKDQKVKKGPFGTW FT QIGQIDKKETAXIARKRNKEKINIKSDTTLHSDNDCISETYDMEIDNNDDY FT KNQNHEFFAKPLCEKNYTDLTNIAKAAIRFEVSDAATSAIATATLIDYQII FT TQNDRSQIITEKKIFDAKKRVSTMFSEKHCSEVFHLKVIGVDSKKDKNTLA FT HILGYDHNAEPIVNRTVKDEHHLTFTAESGPFSKKYLTHIEIKNGTGITMA FT TETLKILEKYNSVSSLEAIVLDNTSANTGADNGLVVTLEKLLNRRIHLIGC FT VLHQNELPLRHVIAEVDGKSNDPKKYKGPIGQKASADDSCMIYQFMHDLPI FT VDFIPIHSEIDSYVNSDILSDLSTDQRKLYEYCIGISKGFISPKYSMKKPG FT PVYHARWLTLALRIMMVYARVKNPTDELCTITKYIVQVYCPMWFAHKKSGL FT YKDAPRLLHKTIELVKNQADNIQQIVFQNLQGNSYCCLQENFLYSMLQDDE FT KDIRAQAIKQILYIRQSREVNPEVKPKIKAKSIMPINFNADTWSGLVDVSL FT IEVEPPTSILISSFELENALQTSKKPILTDLPNNSQSVERSVKLVSEASKI FT VYGQVKRHNFISTKNQSRAENKCNSSKTDYFIPFS*" XX SQ Sequence 3725 BP; 1403 A; 544 C; 586 G; 1181 T; 11 other; ttaagccact accaagttaa agtcaacact cggctcactt tgggaaatgt tctatttata 60 ttcaataaaa ttattatcaa tttaaaaaat ttcaaaaatt tgtcattttt ttgggaaaaa 120 caaaacaagt gacattttta tggttccatt caattattgc aaaattgrac agaacgacac 180 ttaaaatgtt attttcgcgt cataaattgg tacaaaaaca acaagtttgc catttttaaa 240 tgcgtttcca ttttttttgt tgggaaaaag ttacttttta caatgtttta agatataaaa 300 tattatataa aatctaacat agataaagta tagaaatatt agctatatat ttagcttctg 360 tattttatag tattttcact tgatattatc ttatataaca agtttataaa aaatcttttt 420 cagactgaat ttaaatatta aaaatgaata acaaatgtga atctgggcaa ccaacaagaa 480 aaagaataca taatattata actagacatg aaagtgaaag atatgctgga caaccaaacg 540 aactacctgt tagtgacctc ccaacatata aacagattat tcaatatgcc tataaggtcg 600 aaaaagaaat gattacaact gaaaaaattg actctgcagc catagtttta aaagtaagta 660 aagaaatttt aaaaatttgg aaaaaagtta atccaaatct tccactaatg aaagagaaac 720 ctctaagaaa taagttacaa agaacatttc aaaaagtaat tgcagctgaa cgatctaaag 780 cttcaaaaca taaaaacagt gtttgttggt tggaaaataa ccttttaaag ttatttgata 840 tatcagcatg cagatgtgaa cttcctcttt taaattgtga tgacaggttt gaagttatac 900 atttttaaaa aaccacaata ttaaacaata tgtttgttta aacaaatttg cgactaaact 960 gccatatatt ttactgttga acagtttttt tatttcatat tttattgtac atattactct 1020 ttagaaatgt tcgatgcaag ggttgctcta acaagcacta tgtatgtctc tgtgaagaaa 1080 gtaaacaggt taaattcgaa ttttttttgt attattattt ttttaattta tgcttaataa 1140 tattttaaac tttctacaag ttattaaatg tatatttttt taaatttaag gttccagttc 1200 tagaacgaga gtatttaaag gaccagaaag ttaagaaagg gccctttgga acctggcaga 1260 taggccaaat tgataagaaa gagacagcca ngatcgcaag aaaaagaaat aaggagaaaa 1320 ttaatataaa aagtgacaca acactycatt cagataatga ttgtattagt gaaacttatg 1380 atatggaaat agataacaat gatgattata aaaatcagaa tcatgagttt tttgcaaaac 1440 cactgtgtga aaaaaattac acagatttaa ctaacattgc caaagctgcw attagattcg 1500 aagttagtga tgctgcaaca tctgccatag caactgctac cttgatagac tatcaaatta 1560 taactcagaa tgacagatca caaattatca ctgaaaaaaa gatatttgac gctaaaaagc 1620 gtgtttcaac catgtttagc gaaaaacact gttcagaagt ttttcatttg aaagtcattg 1680 gagtagatag taaaaaagac aaaaatacat tagcacatat attaggatac gatcataatg 1740 cagaacctat tgttaataga acagtaaagg atgaacatca cttaactttt actgcagaaa 1800 gtggaccttt ttctaagaaa tatttaactc atattgaaat aaaaaatgga acaggaatta 1860 caatggcaac cgaaacacta aaaatacttg aaaaatacaa cagcgttagt agtttggaag 1920 ccattgttct tgataatact tctgctaata ctggtgcaga taatggctta gttgtgacat 1980 tagaaaagct tttaaatcga agaatccatt taataggatg tgtacttcac caaaatgagt 2040 taccacttcg ccatgtcata gctgaagtag atggtaaaag taacgatcca aagaaatata 2100 agggccctat tggtcaaaaa gcttctgcag atgattcatg catgatttac caatagttga 2160 ttttattcca attcattcag agattgattc atatgtaaac tctgatattt tatctgatct 2220 tagtacagat caacgtaaac tttatgaata ctgtattgga atatctaaag gttttatatc 2280 tccgaaatat tcaatgaaaa aaccaggacc tgtttaccat gccagatggc taactctagc 2340 tttaagaata atgatggttt atgcgagagt gaagaatcct acagatgagt tgtgcacaat 2400 tacaaagtat atagtccaag tatattgccc catgtggttt gcccataaaa agtccggtct 2460 ttataaagat gctccaagac ttcttcataa aacaatagag cttgttaaaa atcaggcaga 2520 caatatacaa caaattgttt ttcaaaatct tcaaggaaac tcttattgct gtcttcaaga 2580 aaacttttta tattctatgt tgcaagatga tgaaaaagat ataagagctc aagctattaa 2640 gcaaattttg tacataaggc aatcaagaga agtaaatcct gaagttaaac caaaaatcaa 2700 agcaaaatca attatgccta ttaactttaa cgctgataca tggagtggtt tagttgatgt 2760 ttctttaatt gaggttgaac ctccaacttc tatcctaatt tcatcatttg aattagagaa 2820 tgctttgcaa acttctaaaa aacctattct tactgattta cccaacaatt ctcagtcagt 2880 ggaaaggtct gtgaaacttg tatcagaggc atcaaagata gtttatggtc aagtaaagag 2940 acataacttt atttcgacta aaaaccaaag cagagcagaa aataaatgta attcgtctaa 3000 aactgattac tttatacctt tttcttgaaa aatgtttttt ttttttaatt ttaaatttgg 3060 ttcacatacc taagtccgar ggtatacttg acttgactac tgtcaagaat tatttttttt 3120 aatkgcaagt ctctcaaccc ttaacttcga aatcaccctt gtgcagaaaa acatgttgaa 3180 cggggtaatt tcagaagcat caggggctta agccagaagc ataaggcgag cgcttaccac 3240 aagtctatca gaacgtttat tacttgttat taaagtaaat tacgacgcat aaaatattta 3300 agatcagtcg gaaggtcgag aaagttgggg gttgaacgga tagaaacagg ggctaaggca 3360 gccgtaatcg gaatttgagt aggtctttac atagttgggg gggggttgct ttctccgccc 3420 cctgttcyta taccggtgtt taagattatg tggtaaatat tcattctata atattcaaat 3480 attctaatgg aaagaatatt tgtatcgaag tttgtactaa agtatataaa tatcgagaat 3540 gccataaaaa tgaatawttc atkgaaaaca agaaaatact taaaaacttt aaaaccattt 3600 cccaaaatta cccttactca aatcgrtctg aaattttcag gattgtctta aaatsatatg 3660 tacaacaaat cccaaagtga gccaatatga tctgagaaaa aaagtatatt ttggtagtgg 3720 cttaa 3725 // ID Copia-137_AA-I repbase; DNA; INV; 4108 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-137_AA_; KW Copia-137_AA-LTR; Ty1_copia_Ele138; Copia-137_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4108 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [1500-1997] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 72..4085 FT /product="Copia-137_AA-I_1p" FT /translation="MIPRDNSSSTSQSGGMTGGNSRQIAGGDGSTASGSGV FT RSTATVRTFGSAVSLPVMEKLKGRQNYASWAFSMKMALIREGTWRAVKPLE FT NVAVDPDMSDRALAAICLSLEKHNFSLVKNADTAKQAWDKLQQAFQDDGLI FT RRFGLLDKLTTIKLEDCESVEDYVDQLVTTANDLSEIGFEVNDQWLASLLL FT KGLPEYYNPMIMGLQASGIQLTADVVKTKIIQDVKWPLKSSSSEGALYSKQ FT KIKRTKGGAEKKNGTCFTCKKPGHYAANCPQKQSSSFSKPKGKALCAMLAT FT GEGREEDWFFDSGATSHMTRTVEGFIKHTDWVHSIDTASSQSIKSVARGTV FT NLELEEGPIQVKDVWMVPDLTTNLLSISKICAKNMTVLFKFDGCEVRDEVG FT DIVVTGTQENGMYKLNTKKSEKAFLTNNSSLWHRRLGHLNQQYMSKLTTMV FT DGMPRQFGVTADCVACVQGKHTRSSFHCSSSRAQNPLDLVHSDVCGPVEVS FT SIGGSRYFVTFVDDASRKVFVYFIESKSQVKEVYEKFKALVERQTGRKLKI FT LRTDNGTEYINSTMKKSFERDGVIHQTTCPYTPEQNGTAERMNRTLVEKAR FT SMLNDVGLPKKFWAEAVSTAAYLVNRSPTRSLETTPEEAWCQKKPDLKHLR FT IFGSQVMVHCPKQKRQKFDSKSVKGVFVGYAENSKGFRVYDQERDEVIISR FT DVVFINEEQKFQPKDVGEQYQDQPVEFIELLDWSETNAHPENEGLNVQQPA FT RNQEMSSSVEIAEASTALPPQSGRSADEQQGFRRSGRERQPPGKYSDFMCF FT SSITGPDDFTDPTPMASEMADDPSNYTEAIERSDRERWITAMREEIQALTE FT NETWELTDLPNGRKAIKNKWIYKTKRGSDGTVERFKARLVVKGCSQRYGVD FT FDEVYSPVVRYSTIRYLMSLAAQHNLDVEQMDAVTAFLQGKLADEVIFMEQ FT PEGFVQNPKKVCRLKKALYGLKQSSRVWNLQLNEALREFGLERSLMDTCLY FT FKIDGESMTFVTIYVDDFLLFTNDSEMKKKLKTFLHSRFKMKDLGEAKFCI FT GLKITRDRVNGKIWLDQQQYVKDVLERFNMANCSPVSTPVDPGAKLNKSMC FT PSEPKEVQSMQAIPYKEAVGSLMFIAQATRPDISFAVNMVSKFSSNPGRRH FT WEAVKRIMRYLKGTSNNRLEYSVTGNPHFTGYTDADWGGDEDRRSTTGYVF FT SKMGGAISWNCKRQDVVALSTCEAEYIALSRTTQEALWWQQMLQQINGDLQ FT VPILCDNQSAICVARNQGYNPRTKHISIRYHFVREALDRGDITLDYVSTKQ FT QPADGFTKPLTRQNHEAFKKLLGIVG" XX SQ Sequence 4108 BP; 1234 A; 818 C; 1076 G; 980 T; 0 other; agtggttatg ggcccagtac aggctagaat ttgaacaaat tcgacgaccg caattcgtgc 60 tagttcctga gatgattccg agggataatt caagttcaac ttctcaaagc ggaggaatga 120 ctggcggaaa ctcgagacag atagctggag gtgatggttc aactgcaagt ggaagcggtg 180 tgcggtcaac agcaactgtt cggactttcg gaagcgcagt atccctaccg gtaatggaga 240 aattgaaggg aaggcagaat tatgcatcgt gggctttttc aatgaaaatg gcacttatac 300 gcgaaggtac gtggcgtgca gtcaaaccgt tggagaatgt agcagtggat ccggatatga 360 gcgatcgagc acttgcggcc atttgtttga gcttggagaa gcacaacttt agtttggtca 420 agaacgctga taccgctaaa caggcatggg acaaactgca acaggcattc caggacgatg 480 gactaatccg gcggttcggt cttctggaca aattgacgac tatcaagttg gaagattgcg 540 aatctgttga agactacgtg gaccagttgg tgactacagc aaatgatctg agcgagatcg 600 gttttgaagt caatgatcaa tggctggctt cattactgct gaaaggttta cctgaatatt 660 acaatccgat gataatgggt cttcaagctt caggaattca attgactgcg gatgtagtaa 720 agacgaagat tatccaagat gttaaatggc ctttgaagag ttccagcagt gaaggtgcac 780 tctattcaaa gcagaagata aagcgaacga agggtggtgc agaaaagaaa aatggtacct 840 gttttacttg taagaaaccg gggcattacg cagcaaactg tccccagaag caaagctcaa 900 gtttttcaaa accgaaaggt aaggcattgt gtgctatgtt agctacagga gaaggaagag 960 aagaagattg gtttttcgat tccggcgcaa catcacacat gacgcggacg gtagaaggtt 1020 tcatcaagca tactgactgg gtacattcga tcgacactgc cagtagtcaa agcatcaaat 1080 ccgttgcaag aggtacagtg aatctggagt tagaagaagg accaatacag gtgaaggacg 1140 tttggatggt tccagatctg acgacaaacc tgttgtctat tagtaaaata tgtgctaaaa 1200 acatgacagt cctgttcaag tttgacggat gtgaagttcg agacgaagtt ggcgatatcg 1260 ttgttaccgg tactcaagag aatgggatgt acaagctgaa tacgaagaaa tcggaaaaag 1320 cgtttctgac gaacaattca agcttgtggc accgcagact aggacatctg aatcagcaat 1380 acatgagcaa acttacaacg atggtcgacg ggatgccgcg acaatttggc gtcacagcgg 1440 attgtgtggc atgcgttcag ggtaagcaca ctcgaagttc ttttcattgt agttcctcgc 1500 gtgctcaaaa cccgttggat ctggttcact ctgatgtctg cggaccggta gaagtttcgt 1560 cgattggtgg aagccgttac ttcgtgacct ttgtggacga tgccagtcgc aaggttttcg 1620 tgtatttcat cgagtccaag agtcaagtaa aagaagtgta cgagaaattc aaggcactag 1680 ttgaacgaca aacgggacga aaattgaaga tactacggac ggataacggt acagagtata 1740 tcaactccac gatgaagaaa agcttcgaac gcgatggggt gattcaccaa accacgtgtc 1800 cttacacacc tgaacaaaac gggactgcgg agaggatgaa tagaacactt gtagagaagg 1860 cgaggtcaat gttgaacgat gtcggactcc ccaagaaatt ttgggcggag gccgtatcca 1920 cggcagcata tttggtgaac agatctccaa ctcgatcact agaaacaacg ccagaagagg 1980 cgtggtgtca gaagaaaccc gacctgaagc acttacgaat atttggatct caagtaatgg 2040 ttcattgtcc gaagcaaaaa cggcagaagt tcgattcaaa atcggtcaag ggcgtattcg 2100 ttggatacgc tgagaattcc aaaggcttcc gggtatacga tcaagagagg gacgaggtga 2160 taataagtcg agatgtcgtg ttcattaatg aggagcagaa gtttcagccg aaggatgtag 2220 gagaacagta tcaagatcaa ccggtggagt tcatcgagtt gcttgactgg tcagaaacaa 2280 atgctcatcc agaaaatgaa ggcttaaatg ttcaacaacc tgcacgcaat caagaaatgt 2340 cttcttccgt ggaaattgcc gaggccagca ctgcgctccc accgcaatct ggtagatccg 2400 ctgatgagca gcaaggattt aggcgcagcg gtcgggagcg ccagccccca ggcaagtatt 2460 ccgatttcat gtgttttagt tcaatcactg gcccagatga ttttacagac ccaaccccga 2520 tggcttccga gatggcagat gatccaagca actacaccga agcaattgag cgatccgatc 2580 gagaaaggtg gataactgcc atgcgagagg agattcaagc gttgactgaa aacgaaacct 2640 gggagttgac tgatttaccg aatggacgaa aggcaattaa gaacaaatgg atatacaaaa 2700 cgaaacgtgg atcagatgga acagtggagc ggtttaaggc caggctcgtc gtgaaagggt 2760 gttcccagcg ttacggagtc gatttcgatg aggtttactc ccctgtcgta cgttactcga 2820 caattcgtta tctgatgtcc ttggcggctc aacacaacct agacgtagaa caaatggatg 2880 cagtaaccgc ttttctgcaa ggaaaattgg ccgatgaagt cattttcatg gaacagcctg 2940 aaggttttgt tcaaaatcca aagaaggtat gccgtttgaa gaaagcgttg tacggtttga 3000 aacaatcgag ccgtgtgtgg aatcttcaac tgaatgaagc tttacgtgag tttggattgg 3060 aacgttcttt gatggatacc tgcctatatt tcaagattga cggcgaatcg atgacatttg 3120 taacaattta tgtggacgat ttccttttgt tcaccaacga ctcggaaatg aagaagaagc 3180 tgaagacttt tctccacagc cgcttcaaaa tgaaagatct tggagaagca aagttttgca 3240 tagggctgaa aatcacacgt gatagagtga atggcaagat ttggttggac caacagcagt 3300 acgttaagga tgttctcgaa cggttcaaca tggcgaactg tagccccgtt tctactccag 3360 ttgatcccgg cgcgaaactc aacaaatcga tgtgtccgtc cgaaccaaaa gaagtgcaga 3420 gcatgcaggc catcccatac aaagaggcag tcgggtcatt gatgttcatc gctcaagcaa 3480 cacgaccaga tatttctttc gctgtaaaca tggtgagtaa gtttagcagt aaccctggtc 3540 gtaggcattg ggaggcagta aaacgaatca tgcgttacct gaagggcact tcaaataatc 3600 gtttggaata ctcggtgact ggcaatccac attttactgg ctatacggat gcggattggg 3660 gaggagatga agatcgaagg tctaccactg gttatgtttt tagcaagatg ggaggagcga 3720 tctcgtggaa ctgcaaacga caagatgtag tggcattatc aacttgtgaa gccgaatata 3780 ttgctttatc aaggactaca caagaagctc tctggtggca gcagatgttg cagcaaatta 3840 atggagatct acaagtacca attctatgcg acaaccaatc tgccatatgc gttgcaagaa 3900 atcaaggtta taatccccgt acaaaacaca tctcaattcg gtatcatttc gttcgtgaag 3960 ctctggatcg tggcgacatt actttggact atgtttctac aaagcagcag ccagcggacg 4020 gatttaccaa gccactaacc aggcagaatc atgaggcatt caagaaactg ctcggtatcg 4080 taggttaagg aggagtgttg cggtatgg 4108 // ID MuDR5x_AP repbase; DNA; INV; 2680 BP. XX AC Contig271; XX DT 25-JUN-2009 (Rel. 14.07, Created) DT 25-JUN-2009 (Rel. 15.12, Last updated, Version 2) XX DE MuDR-type DNA transposon: consensus. XX KW MuDR; DNA transposon; Transposable Element; MuDR5x_AP. XX NM MuDR5x_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-2680 RA Jurka J.; RT "DNA transposons from pea aphid."; RL Repbase Reports 9(7), 1354-1354 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX FH Key Location/Qualifiers FT CDS join(368..532,818..1579) FT /product="MuDR5x_AP_1p" FT /translation="MINDFIDVTTVSIRQCVSIDTQLNLVPVSLKYNGFRF FT KNASTIYCKSKRQQVIIAISSFLIFTLILGDIIKESVHYSHAPDIAKVKIK FT EIITNIKEEAKTSVLTPRNLLSRAVETLPTSIIGQLPNIEKISKTIRRERI FT KQQKPPANPVNVGDLIIPGEYSVTNKGDMFLFYDNKIQKRILIFSTLENLN FT MLKECSSWFGDGTFRSVPTFFSQLYTIHGTKNKQSFPLVYILMVDRSKDSY FT IEVLKALKSAVSSLTPQRIMDDFEMAFISACKDEFPTTEIKGCFFHFEQCL FT WRKIQSCGLQQMYLWL*" XX SQ Sequence 2680 BP; 1000 A; 382 C; 355 G; 943 T; 0 other; caaaatatct ccgatacaaa atatctctag tcaaatatat ctccgattta aaatatctcc 60 agttcaaaat atcttctatt taaaatatct ccagtacaaa atatctcctc ttaaaaatat 120 ctttaatgga aatatctctc atgaaaatat tttataataa tttttttttt tacaacgaca 180 attattaaaa gtagataaaa aactgatgtc taaagtgatt accgaagtaa gtacattttc 240 tagtaatacc taatgtttaa aaaataatcc gaaaggcaaa aataactcat cacaatatta 300 cactgtcacg ctgagcaatt tgtattatct gatatatttt tcacgattac gagtgcgatt 360 acaaaccatg ataaacgatt ttatcgacgt aacgactgta tcgatacgac agtgtgtcag 420 tatagacaca cagcttaacc tagttccagt ttcactaaaa tacaacggat ttagatttaa 480 aaatgcctct acaatttatt gtaagtcaaa aaggcaacaa gttattattg cataatggat 540 accttcatac gttctataaa gaaggtgtga acaaattcat ttggcgttgt tctgagtaca 600 aaacatataa atgtacttca tgctatacaa ccactcgaga agaaataggt aattattata 660 ttactaatta ttaaggactg gatttaattg tatcatttac caaaaattaa ccatagaatc 720 tacacattac aaaacgttga catttataat aaattttaaa ttttataacg tcttaacata 780 atgtacaact caatataaat taatacaact taactaaatt agctcttttt taatttttac 840 tttaatttta ggggatataa taaaagaatc tgttcattac tcacacgcac cagatattgc 900 taaagtaaaa attaaagaaa ttattacaaa tataaaagaa gaagcaaaaa catctgtatt 960 aactcccaga aatttattaa gtcgtgcagt tgaaactctc ccgacatcaa taattggtca 1020 attaccgaac atagaaaaaa tatctaaaac aattcgacga gaaaggatta aacaacaaaa 1080 acctccagct aatccagtaa atgttggtga cttaataatt cctggtgaat attctgtaac 1140 aaacaaaggt gacatgtttt tattttatga taataaaatt caaaagcgta tattaatatt 1200 ttccacttta gaaaatttaa atatgttaaa agaatgctcg agttggtttg gtgatggcac 1260 atttcgttca gtgccaacat ttttttcaca actttacact atacatggta caaaaaataa 1320 acaaagtttt cctctggtat acatattaat ggtagatcgt tctaaagatt catatattga 1380 agttctgaaa gcgttaaaat ctgcagtttc cagtttaacg ccccaacgaa taatggatga 1440 ttttgaaatg gccttcatat cagcttgtaa agatgaattt cctacaacag aaattaaagg 1500 ttgttttttc catttcgaac aatgtttatg gcggaaaatt caaagttgtg gtttacaaca 1560 aatgtattta tggttatgac gtagaatttt ctatgcaaat acgatcgttg agtgctttgg 1620 cgtttattcc aacaattaat gttattcgaa ctttcgaaga acttatagat tcaacttatt 1680 ttttggaaaa tgaagaacta ttatctccaa taatcaatta ttttgaagat acatggatag 1740 ggcgaccaaa tcgtaacaac cggagacgct cgccccagtt tgatttttga atgtggaact 1800 gttacgattc agtacttaaa catcaaccac ggaccaataa cgctgttgaa ggttggcatc 1860 atgcgtttaa cagtgcactt gcagcaaatc atgttacgat ttggaaattc ataaattttc 1920 ttaaacaaga acagtcatta caagaagtat tttatataat atttatgtat ttaataatat 1980 attattaaat caccttatgt ttaggtcaac atggaacaat ctttgtcggg tgaaccaagt 2040 ccgaaaaaaa gaaaaaagta caatgattat gatgaacgac tgtttaaaat cgtaagtcaa 2100 tacaacgaga ttacaccaat gaattattta aggggaattg cccacaatct taaattttaa 2160 cttatatctt gatcggattt tatatttatt tatttaactt tttaaatttt tattaataca 2220 ttaattacct ggtactcaat ttatttaaat ttttttaatt gtattacatc actcaatcgt 2280 ctacaacctt aaattataat ttgatatata ttttttgtga taattttgtt ttaaaaatac 2340 agtcgattat attacaatga aaaatttttt tgtacttgta ataaataata attacataat 2400 attatgatta caataacata tttttaaata ctaatgtact tactttggta atcattccag 2460 ccatcagttt tttatctact tttaataatt gtcgtgttaa aaaaattaat attaatataa 2520 aatataaata ttaaaatatt ttcatgagag ttatttccat taaagatatt tttaacagga 2580 gatattttgt actggagata ttttaaatag aagatatttt gtatcggaga tattttgtcg 2640 gagatatttt gaccggagat atttttgcgg agatattttg 2680 // ID Gypsy11-NVi_I repbase; DNA; INV; 5478 BP. XX AC AAZX01023279; XX DT 13-NOV-2007 (Rel. 12.11, Created) DT 13-NOV-2007 (Rel. 12.11, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy11-NV; KW Gypsy11-NVi_LTR; internal portion; Gypsy11-NVi_I. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-5478 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from Nasonia parasitic wasp."; RL Repbase Reports 7(11), 1144-1144 (2007). XX DR Genome; AAZX01023279; Positions 25260 19783. XX CC Positions [2876-3331] - Reverse transcriptase CC Positions [4256-4768] - Integrase core CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 865..2196 FT /product="Gypsy11-NV_I_1p" FT /translation="MSASGHSASNSQSPSKTRSGTIFGSNMPLVHSDSEDE FT PLGTGSGKDNPSLGQPISDAQAFLEATLSSLQLELPNLIKGAVAPLQTQID FT QLSDRLRLQTSLASGVENDRASALSTSNQAFSLNPTAPSFSASARVPVSSA FT SLSLDNFKSNPIHTDTQTTMNQPATNAPGSLVPADLIPTHTDASRPHSDAF FT LAQKINPGPFWTHNPEGWFVGLEINFEVLNITDDKLKFATLWRSLSRDVAL FT RIDSTIRNLPLESRYEVVKQALLKEFSDTVEQRLDKLFERSALGSKKPSDL FT LKEMYIHAGPNIDKDGVRSYWRKLLPKDVQFAIASLTDLPDDKLFELADRI FT HHIHNSVPDQRLNAVETDRSISSVDLQDLNKRLASLEKSLRESRNSSRSNN FT NSNRSRSKSRNTKSRKSASGLCFAHSKYPDNPTSCRDWCSKNAEWKAKNQ" FT CDS 2852..5203 FT /product="Gypsy11-NV_I_2p" FT /translation="MAQKKDGSPRPCGDYTGLNAITIPDKYPTSHLYDCNN FT NLHGKKIFSALDLHKAFNQIPIAPEDIEKTAIITPFGLFEFLFMPFGLRNA FT SQTFQSYINRALGDLDFVYIYIDDILIASDSREQHQKHLRTVFERLKKFHL FT RLNVDKSVFMVEELEFLGYLINSQGIRPTRAKIDAVLNFPKPRTIVELRRF FT LRMVNFYHRNLPHAATSQAPLNAFFRDSRKNDKRLIQWTPEADEAFEKVRS FT EFAEAALLVHPRSSPSQQRYSAYDRELTAVYEAIKYFAHIVEGCDFSVLTD FT HKPLIYAFLQKTDKAPPRRVRQLNYISQFTTRIEHVRGIDNTVADSLSRIE FT SVRFPLEFDLEELAAAQEADKQLQEIRESPEHSLSLKRIQFGPNHTTICCD FT LTGETLRPFVPVSMRESVFAFFHKPAHPGPKVTDRLIRQRYVWPAMHRDIA FT NWCKLCLDCQQSKISRHVKLTPSQFVAPDGRFDHVHIDIVGPLPIKDGLQY FT VLTMIDRFSRWIEAVPLPETSAQTVARAFFDTWVARYGAPKVITSDQGSQF FT ESQLFSALLSLIGCERIRTTAYHPAANGMIERWHRCMKAAIMCHNDTDWPR FT TLSTVLLGLRCHLRADTNASPAEFMFGTTLRLPGEFFIPEDQTPDPNFFLE FT EYREYMRQIRPIPVAHNYKKRAFYFKDLYKCTHVFMRNMAKKSLERPYSGP FT FKIVKRVSDRVFDIDVNGTTKSVTVELLKPAYYIPDDLGELVPSDSTNGNQ FT PLSNPSDKPSCNPSNTSRPALKTYARKKVSFAS" XX SQ Sequence 5478 BP; 1349 A; 1457 C; 1196 G; 1476 T; 0 other; gcctgtccag gcgaccgcct ttcggcgcag cagcagtatc atcagcagct ctcaacgcag 60 cagcgtcatc ggcagcgggg cggcccggca acctggacat cgtctccgcg gcagttcttc 120 accacctgga cgtcgtcacc ggagcctcta ggtctgcccc gctctcctca tcgcatcatc 180 gagcgcccca ggtatttcgc gggaatttct cgagtgcgcg tcaagcataa aaaaagcgcg 240 cgtcaattag taaaaagcgt gcggcggttc gtaggaaagc gcaccaacgg aattcgtttc 300 cgagaattcg gcggccatct tgccatgacg caattgtttc gttctctcag cgcgcgcgtt 360 tttctggctc agtagtgcca accttagctt cgcgcatcga tttacaacta gctagcgcgt 420 ttcggtgatt cattgatgcc aaccttgatt cggtataacc agccagcacg ttttcgtggt 480 tcagtgatgc caaccttcgc ttcgcgcagt agttcgcgag caactagtgt tcttatggct 540 cattgatgcc aacttcagct tcgcacgata gttcgcgact agctagcgcg cttattcaaa 600 ttttcttttc ttttctttca tacacgtact ctgcacgagg tatgcgtaat tttttattgt 660 ttgtatacag tgcctgcgtg cacacttgcg ctagtcacac tgtaaacctc gtagcagaaa 720 tctagttagt agcgtttagt tttatgtccc tgcgtatgcg tgcggctcaa gatccgtata 780 caattagtag cgtttatatt gtgaggtgcc tgcgtgcacc caataggcaa ttttgtcaat 840 cttgagcagc ttttcccagc gttcatgagc gccagcgggc actctgcgag caacagccag 900 tctcctagta agacccgatc gggcacaatt ttcggctcca acatgccttt agtacacagt 960 gattctgagg atgaacccct gggcacgggg tctggcaagg acaatccttc cttgggccaa 1020 ccaatctctg atgctcaggc ttttttagaa gctacgctaa gcagtcttca gctcgagctt 1080 cccaatttga ttaagggagc agttgctcca ttgcagactc agatcgacca gctgtcggat 1140 aggcttcgtt tacaaacttc tctcgcttcc ggtgtagaaa atgacagagc ctctgctctc 1200 agcacttcaa accaagcgtt ttctctcaat ccgactgctc cttctttttc ggcatcagct 1260 agagttcctg tttccagtgc atcattgtct ctcgataatt ttaagagtaa tccgattcac 1320 acggatactc aaaccactat gaaccaacct gcaaccaatg ctcctggcag cttggtccct 1380 gcagatttga ttccaacaca cacggacgct tctcgtcctc actcagacgc ctttctggct 1440 cagaaaatca atccaggtcc tttctggact cacaatccag aaggctggtt cgttggtctc 1500 gaaatcaact tcgaggttct caacatcact gatgacaagc ttaaattcgc cactctctgg 1560 cgcagcttga gcagagatgt tgcccttaga atagacagca ccattcgtaa ccttcccctt 1620 gaaagcaggt acgaggtagt caaacaagct ctgctaaagg agttttccga tactgttgaa 1680 cagcgtcttg acaaactttt cgagagaagc gctctaggta gcaagaagcc ctctgatctt 1740 cttaaagaga tgtacattca cgccggtcct aacatcgata aggacggtgt gcgtagctat 1800 tggaggaaac ttctccccaa ggacgtgcag tttgcgattg cgtcgttaac ggatcttcct 1860 gacgacaagt tattcgagct tgcggacagg attcatcaca ttcataactc cgtcccagat 1920 caacgtctca atgcggtaga gacggataga tctatttctt cagtcgacct tcaggatctc 1980 aacaagcgtc tagctagctt agagaaatct ctacgcgaat ctcgcaactc ttctcgctct 2040 aataataact ccaacagatc taggagcaag agccgtaaca ctaagtcccg caagagtgct 2100 tcagggttgt gttttgcgca ctctaaatat ccagacaatc ctacctcctg cagggattgg 2160 tgtagtaaga acgcagaatg gaaagcaaaa aaccagtaag acctgcggac ttggaggccg 2220 tctgtcaagg tcattcctct aagcgtctcc acatccaaga ccttatttca ggtcaattct 2280 ttctaatcga tacaggagca gacatctctc tgcttcctgc tgtaaataat attagttgta 2340 agcccgatag tcttaagtta tctgcggcca atggcaccaa aattaatacg tatggcgaat 2400 cttatcgcac gctagatctc ggtcttcgcc gtccatttgc gtggaatttc tgtatcgccg 2460 aggttccgtc cgctataatt ggtgctgact ttctgtcaca ccatagtctc acagtcgacc 2520 tggctagccg gcgactggtt gacacccaga cgaacatttc ttcggctgcg actctcagag 2580 cagcacccct cgttgctatc tctactatct cgcctaatag cgaacatgct cagctgctag 2640 cggagtttca gagatcactg gtcccgctag agctggtctc agtggcaaag ctgatgtgta 2700 tcatcacatc tacacctcgg gtcctcccgt ctctcaacgc ccgcgtcggc ttcatcccga 2760 caagctcaga gccgctaagg ccgagttccg tgcgtggcaa gaagcaggta tctgcagacc 2820 cggtagcggt ccctgggcca gccctttgca catggcgcaa aaaaaggatg gttcacccag 2880 accctgcggt gattacacgg gtcttaatgc cattactatt ccggacaaat atccgacttc 2940 gcatctttat gattgcaaca acaatcttca cggcaaaaag attttttccg cgctcgatct 3000 tcacaaggca ttcaatcaaa ttccaattgc gcctgaggac attgaaaaga cggccattat 3060 aacgcccttc ggcttatttg aatttttatt catgccgttt ggtcttcgca atgcgagcca 3120 aacgtttcag agctatatca atcgtgcttt gggggattta gacttcgtct acatatatat 3180 tgacgatatc ctcatcgctt ctgactctcg tgagcaacat cagaagcatc tgcgcaccgt 3240 tttcgagcgc ctgaagaaat tccatcttcg gctcaacgtt gacaaaagcg tctttatggt 3300 agaggaactc gagtttcttg gttacctcat caactctcag ggaatcaggc ctactcgagc 3360 caaaatcgat gccgtcttga attttcccaa acctcgcacg attgtagaac ttcgtcgctt 3420 cttaaggatg gtgaattttt atcaccgaaa tcttccgcac gcagccacct ctcaggctcc 3480 tctcaacgct ttctttcgcg actctcgcaa aaatgacaag cgtttgattc aatggacccc 3540 tgaagctgac gaagccttcg agaaagtcag atctgaattt gcagaagctg cactcctggt 3600 gcatcctcgc tcatcccctt cgcaacagcg ctacagcgcg tatgatcgcg agctcaccgc 3660 tgtttacgaa gccatcaaat attttgcaca catcgtggaa ggttgtgact tctctgtgct 3720 tacagatcac aaaccactta tttacgcatt cctgcaaaaa actgacaagg ctccacctcg 3780 tcgtgtgcga caattaaact acatatcgca attcactacg cgtatagaac acgttagagg 3840 tatcgataac accgtagctg attctctttc gagaatcgaa tcagtgcgtt tccctctgga 3900 attcgatctt gaggaacttg cagcggcgca agaggctgat aaacagcttc aggagatacg 3960 cgagtcacct gagcactcac tatccttgaa gcgcattcaa ttcggtccga atcacacaac 4020 gatatgttgt gatttaacag gtgaaactct tcgacctttt gtaccagttt ccatgcgaga 4080 atccgtcttt gcgttctttc acaaaccagc tcatcctggg cccaaggtca ctgatcgtct 4140 gatcaggcag aggtacgtct ggccagcgat gcatcgcgac attgctaact ggtgcaagct 4200 ttgcctcgac tgtcaacaat caaaaatctc acgacatgtc aagctcactc ccagtcaatt 4260 tgtagcacca gatggtcgat ttgaccacgt gcatattgac attgttggtc cgttaccaat 4320 caaggatggc ttgcagtacg ttttgaccat gattgacagg ttttccaggt ggatcgaggc 4380 tgttcctctc cccgaaacct cggctcaaac ggtcgcacga gctttcttcg atacttgggt 4440 tgcgaggtat ggcgcgccca aggtcatcac ttccgatcag ggttcacaat ttgagtctca 4500 gcttttttct gctcttctct cgctcatagg atgcgaacgg attcgcacca ccgcctatca 4560 tcccgccgct aacggcatga tagagcgttg gcaccgctgt atgaaggctg caatcatgtg 4620 ccacaacgac acagattggc ctcgtactct ttccacagtc cttctcggac tacgttgtca 4680 tcttcgggct gataccaacg cttcccctgc ggaattcatg tttggcacta ccctacgcct 4740 tccgggcgaa tttttcatac ccgaagatca gacaccggat cccaatttct tcttagaaga 4800 atatcgggaa tacatgcgtc aaatccgacc tatacctgtc gctcataatt ataagaagcg 4860 cgcgttttat tttaaggacc tttacaaatg tacccatgtg ttcatgcgta acatggctaa 4920 aaaatcatta gaaaggcctt attcaggacc attcaagatt gttaagcgag tttccgatcg 4980 cgtcttcgac atagatgtaa acggtaccac gaaaagcgtt acggttgagt tgttaaaacc 5040 agcttattat attccagatg atctcggcga actagtcccg tcagatagta ctaatggcaa 5100 tcagccacta tctaatccta gtgataagcc tagttgcaat ccgtctaaca ctagtcgtcc 5160 agctttaaaa acttatgcgc gcaaaaaagt atcttttgct tcttagtttg cgtcagttct 5220 ctttagtttt caagaaatac attttgttgt tgttctccgt ttaaggctgc ctcccagcca 5280 ataaaaaaaa aaaaaaaaaa aaaaatgtta aaacttatta tttaaataaa gaatcatttg 5340 cgttaacact cgggggggag tgtgtaggga ttctcccctc ttccgccgag tgggggtaag 5400 ccgccatgtt ggcgaggaga attcgccgcc tgcgcgcggc cagaaagtcg ttggctctcg 5460 agtcgaacgg ataggaag 5478 // ID Gypsy-12_RP-LTR repbase; DNA; INV; 636 BP. XX AC ACPB02012584; XX DT 28-JAN-2011 (Rel. 16.02, Created) DT 28-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Rhodnius prolixus genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-12_RP_; KW Gypsy-12_RP-I; Gypsy-12_RP-LTR. XX OS Rhodnius prolixus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Euhemiptera; Heteroptera; OC Panheteroptera; Cimicomorpha; Reduviidae; Triatominae; Rhodnius. XX RN [1] RP 1-636 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Rhodnius prolixus genome."; RL Direct Submission to RU (28-JAN-2011). XX DR Genome; ACPB02012584; Positions 1616 2251. XX SQ Sequence 636 BP; 181 A; 97 C; 174 G; 184 T; 0 other; tgtatgtaac ggtggtggtg aggagaagaa gcagcagtag agtggggagc gctcgatagt 60 agtggagtgg ggcaagcggc cggttggtct gtgaggtagt atcggccgaa gcgtggcctg 120 gaatgtagct gcattaaaca gacagttaat acgattacag atcagtaaat ataccgctgc 180 ttataccgca tcacctggag caccttgctc ggtccgagct aagagagaga gagaggtgca 240 ggcgaggaga ggagcagagg tgtgaggcac tgcttgattg ggcccagacg gaaggaagcg 300 acagcgagga ttcttgcgtg aggcacttgt gtactttcat gtgttaaccg ctaacctggg 360 gagttatttg aagggacttt aattttgtag acttacgtta aattagaatt aaaatattaa 420 attaagtatt agttcggtag ttattattta gtgttaatag tgtattgggg atcatttggt 480 aattattatt gttctgtacg cattcagctg aatggcaggg gaattttatt agttaaaaag 540 aagcgtgttt attttatatt gtaattataa aaatatatgt atataacctc tgaaccatgt 600 tgtactctaa acacaaccca attccctctc cttaca 636 // ID SINE-5_CQ repbase; DNA; INV; 247 BP. XX AC . XX DT 26-DEC-2010 (Rel. 16.01, Created) DT 26-DEC-2010 (Rel. 16.01, Last updated, Version 1) XX DE Non-LTR retrotransposon from Culex quinquefasciatus - consensus. XX KW SINE; Non-LTR Retrotransposon; Transposable Element; SINE-5_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-247 RA Jurka J.; RT "Non-LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 597-597 (2011). XX DR [2] (Consensus) XX CC >98% identity to consensus. Putative SINE. Present in ~10,000 CC copies in the genome. XX SQ Sequence 247 BP; 52 A; 62 C; 71 G; 62 T; 0 other; gcctcgtggc gcggtggtta gcggcttcgg ctgccgatcc ctaagttgct atggggcgcg 60 ggttcgattc ccgccttatc ctcctggcct tctatcggat ggggaagtaa aacgtcggtc 120 catttgcgta aaagaggttt tgggtgactc accacacata accttcggac gcctagaaat 180 gagcagaaac ttgcaacaga gcccacaaaa gacccggggg tcgttaaagt ggattgcttt 240 gcttttt 247 // ID DNA4-3_AP repbase; DNA; INV; 637 BP. XX AC . XX DT 22-AUG-2009 (Rel. 14.08, Created) DT 22-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA4-3_AP. XX NM DNA4-3_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-637 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(8), 1740-1740 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 4 bp TSD (TATA). A variant of this transposon is inserted in CC DNA8-1_AP. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 637 BP; 234 A; 77 C; 82 G; 241 T; 3 other; ccgggtgatt cttttatcat ngaacactca ttatttcaaa aagtgtaaat ttttttgaaa 60 atattttttt acatagtttc aagtcgctta taaaacaacg tttttcttaa aaaattatat 120 ttttaaatat tttttatcct tataattttt taagtttttt acttttttga atgacaacat 180 agggttttaa ttttatattc caaagcagaa tatttttctt agtattttga tacatgaaaa 240 tcgaatttgg ggcgagtagt ttatgagtta taaatattca aagtttagat gagcggagtg 300 gagtggtacg gggttacccc gcgaaatgtt tgtccacttc tccgctcgtc taaactttga 360 atatttataa ctcataaact actcgcccca aattcgattt tcatgtatca aaatactaag 420 aaaaatattc tgctttggaa tataaaatta aaaccctatg ttgtcattca aaaaagtaaa 480 aaacttaaaa aattataagg ataaaaaata tttaaaaata taatttttta anaaaaacgt 540 tgttttataa gcgacttgaa actatgtaaa aaaatatttt caaaaaaatt tacacttttt 600 gaaataatga gtgttntatg ataaaagaat cacccgg 637 // ID piggyBac-18_SM repbase; DNA; INV; 2465 BP. XX AC . XX DT 12-AUG-2009 (Rel. 14.08, Created) DT 12-AUG-2009 (Rel. 14.08, Last updated, Version 2) XX DE DNA transposon from Schmidtea mediterranea. XX KW piggyBac; DNA transposon; Transposable Element; piggyBac-18_SM. XX OS Schmidtea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae. XX RN [1] RP 1-2465 RA Jurka J.; RT "Families of autonomous piggyBac elements from planaria."; RL Repbase Reports 9(8), 1828-1828 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 445..1324 FT /product="piggyBac-18_SM_1p" FT /translation="MLHMTLMLTFFLLVHYLYHIIAFGLFLLSSRLLLVLY FT SSHVSTSTTSSSQQALKILMNINCDESGEENSDEEMSNESSDDFPLPLPLA FT QRIGNKPCSSTSVTQQKSDMTDRAIGKDGTEWRTFSQLRSGGRKTIQNVRR FT VNGGPTTYAIRRVDDSPYSSFKLFFDEPMLRHIQKCTEAEARRQGENDWFV FT PLHEMETFIGLCYARGVLAKHISIAELWSKEWGAPIFRDSMSRDRFTAIMK FT FLRFDMKSERASRLQEDRFALASSLWFPFIENCKKAFVPNENLTIDEQLYP FT SX" FT CDS 1326..2219 FT /product="piggyBac-18_SM_2p" FT /translation="MSVSIYSVHCFKTRQVWHQILFLVDLQTKFVCNGFPY FT LGKNHSRPANQALPSYVVHTLLQGYERNGHNVTCDNYFTSLNLVEELQCKG FT VSMIGTIRNNRRELPDVESLMKGKALFDSSFFISENKTILTVYKCKKNKVV FT SLLSSTHDTVSIDASSKKKSNIINFYNSTKCGVDCVDQMVRLYSTRCQTRR FT WPVCIFYNVLDICLLNSWIIYKSVNKSRISRRDFQMSLVKHLCSQTSLQLH FT THIIVPNVRKRCHCFVSQCPNLSLMLCMKCNKSVCGNHKNGQKYVYTICAS FT CSSSHT" XX SQ Sequence 2465 BP; 746 A; 462 C; 476 G; 781 T; 0 other; cactagaagc acggatgtgt caaaattgac acatggcaaa cttaatatct aattttcttc 60 ataactataa atcatttcac aatgaatttg tatgacaatt atttatttca tgcgctcatt 120 cagatgaata tgaaatagtt catttattct tattgatata agcaatataa caattctacc 180 tataactacg gatgtgtcag attcgacaca acggttaaat tgaagaaaat tgagcgtctc 240 tactatgtga atgcgtgtgt gtagtattga attagacaca gtacgtatga aataattgtt 300 attggctgct gtgaggagtt agaacgtctc tactatctga atgcgtgtgt gttatattgc 360 attagacgca gtacatatga aataattgtt attggctgct gtgtgaagtt tgccaaagac 420 taatagtgtg acaaggtcat agacatgctt cacatgaccc taatgctcac attctttttg 480 ttagttcact atctttatca cattatcgcc ttcggattgt tcctattatc atcacgatta 540 ctgttagtcc tttattcgtc acatgtctcc acctccacga cgtcaagcag ccagcaggca 600 ctgaaaattc taatgaacat caactgcgat gaatctggtg aagaaaattc cgatgaggaa 660 atgtcgaatg aatcaagcga cgattttcca cttcccttac cacttgctca acgtattgga 720 aacaagccat gcagctctac ttctgtgaca caacaaaagt cagatatgac tgatcgtgca 780 attgggaaag atggcacaga atggcgaaca tttagtcaac tgcgttctgg cggaagaaag 840 acaattcaaa atgtgcgacg tgtcaatggg ggtccaacta cttatgcaat tcgccgagtg 900 gatgattcac catatagttc attcaagttg ttttttgacg agccaatgct tcgccacatt 960 caaaagtgta cagaagcaga agctcgtcgc caaggtgaaa atgactggtt cgttccgttg 1020 catgaaatgg agactttcat aggtttgtgc tatgcaagag gggtgctcgc taagcatata 1080 tcaattgccg aattgtggtc taaagaatgg ggtgcgccca tttttcgtga ttctatgtct 1140 agagataggt ttacagctat catgaaattt ttacgatttg atatgaaatc agagagagct 1200 agccgattac aagaagatcg atttgcctta gcgtcttcat tatggtttcc atttattgaa 1260 aactgcaaga aagcttttgt cccaaacgag aatctcacca ttgatgagca gctttatcca 1320 tctaaatgtc ggtgtccatt tattcagtac attgcttcaa aacccgacaa gtttggcatc 1380 aaattctttt tttagttgat ctccagacga agtttgtctg caacggattt ccgtatttgg 1440 gcaaaaatca tagcagacca gccaaccaag cgttgccttc atacgttgtc cacactctat 1500 tacaaggata tgaacgaaat ggtcacaacg tcacttgtga caattatttt acatcactga 1560 atttggtcga agagttacaa tgtaaaggtg tgagtatgat cggaacgatt cgtaataatc 1620 ggcgtgagtt gcccgacgtg gagtcgctca tgaagggtaa agcgttattt gattccagct 1680 tcttcatttc tgaaaataaa actattctaa cagtctataa atgcaagaaa aataaagtgg 1740 tgtccctcct cagttctacg catgacacag tgtcaataga cgcctcaagc aagaagaaat 1800 ccaacattat taatttctat aattccacaa aatgtggagt ggattgtgtt gaccaaatgg 1860 tgcgcctata ttctacgaga tgtcaaactc gaagatggcc tgtgtgtata ttttataatg 1920 tgctcgatat atgtttgctc aattcttgga tcatttataa atcggtgaat aagagccgaa 1980 tttcaaggcg tgattttcaa atgtcgcttg taaaacatct ttgctcacaa acttcattgc 2040 aactgcacac ccatattatt gttcccaatg tacgaaaacg atgtcattgt ttcgtttcac 2100 aatgtcctaa cttgtctctt atgctatgta tgaagtgtaa taagagtgtt tgtggaaacc 2160 ataagaatgg acaaaagtat gtttacacta tatgtgcttc gtgttcatcg tcccacacgt 2220 aactattact tgttttgtaa tactgggtat tgtctatgcc gtcaatatat gattgttttc 2280 ttttgtaaca gaaggctaca agggatgaaa actctctgtt tttcacttta cataaatttt 2340 aattaacatg acctttgact aaaacacaca ttgtcttaaa cgcagccata tctaataaaa 2400 agagattgtg tcaaaactga cacatccgta gttataggta cattgtaatt tccgtgcttc 2460 tagtg 2465 // ID EnSpm-2N1_HM repbase; DNA; INV; 4695 BP. XX AC . XX DT 14-AUG-2008 (Rel. 13.08, Created) DT 14-AUG-2008 (Rel. 13.08, Last updated, Version 1) XX DE EnSpm-type family - consensus. XX KW EnSpm; DNA transposon; Transposable Element; EnSpm-2N1_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4695 RA Bao W. and Jurka J.; RT "EnSpm-type families from Hydra magnipapillata."; RL Repbase Reports 8(8), 786-786 (2008). XX DR [1] (Consensus) XX CC The consensus is built from several copies which are ~90% CC identical to the consensus. This is a non-autonomous transposon, CC because it only contains partial coding sequences. The TSD is CC 2-bp. CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 1773..2387 FT /product="EnSpm-2N1_HM_1p" FT /note="transposase, fragment." FT /translation="FDERIRILTLQSLGNIRILTLLKKFLPETTKIYVYFR FT IIKMNKQEKNQYLREWRQSNQALNNILVNKEIHIKSNINEVKSCIDFKNIK FT IENYSDFYFVSDHKDTSETESSDLDSEVPMNDDNEDLNDGVLVTEIAKWIL FT KHRITRNASDELLFLILHQYGHHVDLPKCTRTILNTPRNVPSELKCGGEYV FT YLGISNSISRTLAVN" XX SQ Sequence 4695 BP; 1710 A; 629 C; 658 G; 1695 T; 3 other; cccagcaggc acacaacgtt gtgctaacgt tgtatttatg ttaaaattgc gttttacgtt 60 acttaatgtt aattcaacgt taagtcaacg taatcttttt aacgtaatat tcttacgttg 120 agctaacgtt gacacaacgt taaaaaaaca taaaagtatt aacctaacat tctaacgttg 180 agctaacata cggacaacgt tattaacgta gttctgctaa ctataaaaaa atctaccgta 240 attaattatt tttttcttaa agatggcgta tgagaatata gaaaataagt ttagttaaaa 300 gataattagc taagtttgaa ggtttgcatt gtcttttaaa agacattcga ggaattttca 360 aaaaatattt gaaaataaac agcgcacact tatttaaaac tttgaattgc tgataaaaga 420 tttaataaat gtttgattaa aaaggtttat tttagaaaaa taaatatatt attggttagc 480 ctctaaatat taccctgctt attgaataaa tatttaacag taagtattgt atttctgaat 540 aaatatttta tagcagtgtt tacttacagt tcttttcaga ttaaccataa atactaatct 600 caataccatt taaatgtgtt caaattttaa ttttttgaat tttgataata caaatatgat 660 aattaatgct gagataatta cacagaaaaa tattttcata ttattatcag tgatggagtg 720 aaacatttgt attagggtat tcattattat aatttttctt cttttaatct cattttaagc 780 tttaaagggt tgtgcaacta ttttatattg ctttaacaga atcaaaattt attgctaagt 840 ttagattttt taataataag aaggagtaat atgtcgaata ttttacttta gatgcaaggg 900 aattacaaac gaatatcaag tcggttagag cctaatgctc atgcaattaa agctgctatg 960 gcaacccttt gaatctgata gttactcact ctgagcaata taatatttta aatcaaatta 1020 actgyggaaa atgaatttca ataaactggt cactttgctt atatttcgac aaacaacgta 1080 agtatattat taatttgaaa ccggaaggtt aatttttcaa atatcttcam caagcatctg 1140 ctatgcattt tcatttatag aaagcataga gagtctttaa caaaagctca aataaatgat 1200 catattgcaa atattgaaaa ggcctctcca ttggtaaata aatgtcaaac aatatgtata 1260 gagaatgcaa gcaattgtta agcagttata tttattattg tttagtttct ttttgtatat 1320 ttatagtata tactatacct gtaatcttct tgtatattca catttttagt attaatttgt 1380 atatttattt agttatgttt tagttaacat aatgtattta acttaataat ttaatgttaa 1440 actaatgaac tttacaattg acattatttt gcaaactaaa gtttaacgtt aacatttcgt 1500 tgttgaaaaa cctgtaatac ctgcctaaat aaccatagct aataaaggct agcaaatgct 1560 cttactgtct ttccaaaaat tgttttcagc ttattttctg gcacaattat gactattaat 1620 tattactatt taataagact tttaacttta ggattttaat tatgatacaa acattcaact 1680 gtatgatttt aattatgaca attatataca aacatttaac tatatgagat taatgagcta 1740 gcaagtaaaa aacttacttt gatcaaacgt agtttgatga gaggataaga atattaactc 1800 ttcaaagttt aggaaatata agaatattaa ctcttctaaa aaagttttta cctgaaacta 1860 ctaaaattta tgtttatttc agaataataa aaatgaataa acaagagaaa aatcaatact 1920 tgagggaatg gcgccaaagt aatcaagcct taaataatat acttgtaaat aaagaaattc 1980 acatcaaatc caatattaat gaagtcaaga gttgcattga tttcaaaaac attaaaattg 2040 agaactatag tgacttttat tttgtgtctg atcataaaga tacttctgaa actgagagtt 2100 ctgatctaga ctctgaggtt cctatgaatg atgataatga agatcttaat gatggagtac 2160 ttgttacaga gattgccaaa tggattttaa aacacagaat aactagaaat gcaagtgatg 2220 aactgttatt tttaatttta caccagtatg gtcatcatgt tgatttacca aaatgcacaa 2280 ggacaatttt aaatactcct aggaatgttc caagtgagtt aaaatgtgga ggtgaatatg 2340 tttatcttgg tatttcaaac agcatatcaa gaactttagc agtcaactag cattatagat 2400 ttgagtcaaa tacaattttt ytagttgtca atgtagatgg ccttccctta ttcaagtcat 2460 caaacacaca ggcctatatt atttatgttt gacaaaggga caccatttat tgtatcatta 2520 tttgttggtg ataacaagcc aaataatcgt gatgattttt atatgatttt ttgcatgagt 2580 atgagttatt attggcaaat ggttttcagt ataattcata taacttcaat tacatcttga 2640 gaaaaattgt tgcaaatgtg ccaataaaac gtcaggaagt gtaagcgtaa ccattttcaa 2700 aagattgagt cccttgaaga attttatgaa cttgaaacca gattaaaaga caaccgggag 2760 tactgtaaat tggtaattta tatttataat ttctattttt aaagtttgtc tgtttgatag 2820 aatgctattt aatttcatca ttttcttgat gaacctattt aaattgctct cacattttac 2880 cagaatatgg tgtaaaaggt aaattcatac aatacataaa tcgagaatta ttttagtgac 2940 ctgatagcca tttacattct gtagagtttt ttcacagtct tgtatttttt gttaaataat 3000 tttgtcatta tatttggttg agcaactatt ttgttgcttt tatagaacaa ctgtgacaaa 3060 acaaccaaat gaatctaatc ctgtcttatt cgaacatgaa cttttcacta gacttttatt 3120 tatgttttag ctttgtcacc tcaagaaagc aggtgtgtta tggtcaggta tgtatttttg 3180 tttaactctg gataatcaaa ccgtttaaat tgtgagtaaa tttaatagaa aaatctagcc 3240 caaggaaagt aatttagaaa aacaaacaat tttaatatta atatttgggg agaaccttaa 3300 tgaatggaac gctgattgaa tgggacaaaa tcaaatttgc atacaagcat ctaaggttaa 3360 ttggtaaata aactcctgtg gtaagaaatg ttaaacacaa caattcaaca gaccaaatca 3420 ttattcctat catggcctaa acatgatggg ccaaacataa cattaaaact gttgcattaa 3480 tgggaattaa tagcacattt ctcctggcgg aacaaacaat tttcaatgta taaattagac 3540 aactttatca acttttaaaa agtcttttac gaaaactgcc ccattcattt aggttctccc 3600 ctacaggttt cagctttaat tcaatgtcga aactttgaat ataaaacatt tataaaaaag 3660 taaagaacct ttcaaaatta tactttgtga attattactc taatgtgtgt ttacaattaa 3720 ttaaataagt tcgctggatc gattgatgta gtataacaaa gtaagaatta tggagcatca 3780 tgttaagccc aagttgaata tgaaaagcaa aattcgtggt gatataaaga aggtatcatt 3840 cgaagataaa aagttgtgta aagcaataat tttggtacgt taataatagc tcaatattgt 3900 tttatctatc ccttcttcct tacatatgtt ttttattctt agatatatat tagtcttgaa 3960 ccttgaatgt tagcgttgaa aatcaaacaa atagtcaatt ataaacaaaa taaattttta 4020 actgttatcc cgtttacagt atcccatgtt gtattttaga agtcatccag ctaatgcacg 4080 agagttcgac ggaagcaaac aaaagatcaa ataggaagca ttttaaagtt ctctccattg 4140 aacaattatt tatttttgag tttttttgta aatttataat acaaaagaat atataataca 4200 tctttaaaaa aactctttaa aaacatttat tgtattttta gtttgaatat ccccaaaact 4260 atttttctaa ccctttgtaa ggggttaaaa acccttatat tatctttgaa tattgattgt 4320 aattgtgttt aacatttaat gttaatatat acggttatat tcatataata accatgacac 4380 gaagccgcgt ataaagggct tcggctcatg gttatatgaa tataactata gctatattaa 4440 cattaaatat tacacaatta caatcaatat tttacgttaa tataaaacgc caatgttttt 4500 ccaacttgta tttacattag tataacgtta atctaacttt agaattttac gttaatataa 4560 cgttaatatc taacgtaaat ctaacgtaag aattttacgt tacaaaacct ttccattttg 4620 ccgaaccggc aacgttgtac taacataaaa tctaacgtaa ataaaacgtt aatacaacgt 4680 tgtgttcccg ctggg 4695 // ID BEL-126_AA-I repbase; DNA; INV; 5335 BP. XX AC AAGE02017920; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-126_AA_; KW BEL-126_AA-LTR; BEL-126_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5335 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02017920; Positions 7688 13022. XX CC 'CTGAG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 939..5335 FT /product="BEL-126_AA-I_1p" FT /translation="MQTSSNTIRNRWTKSDGFINVHSDGDYNKPEVNEPTV FT HGYSSTESPECFVCRSNCQSVAKCKRFIELTLDAKRDVVRETKLCRKCLRK FT HNGSCRQQKPCGIRGCTFLHHPLLHNEKPHTVNEHVGTSTDGIHSCNIHMT FT QTNEILFRIVPVLIHGPSKVIRTYAFIDDGSELTLMEQSLADELGARGPKS FT SLCLKWTGGTNRVEDESQRVDVQISGLGIESKQHQMINVQTIQELQLRPQT FT MVLSELQERFAYLKGLPVESYTNACPRILIGLDNAFLGHVLKSREGKPHEP FT IATKTQLGWIIYGSCYQRKRSSNYVNVHTIKLCECNSGTDENLHQAMKQYF FT ALDSLGILNPDRIIRSSEEQRAVELLEKHTTPKAGRYESGLLWKYDNVRLP FT NSKDMAYKRWLCLDRRMNRDIQFAEIVQHKMADYVSNGYVRKLTPAELEVS FT QNREWYLPVFPVINPNKPGKIRLVWDAAASVYGTSLNSLLLKGPDLLTSLL FT TVLVQFREFRIGICGDIREMYLQILLKQEVRLRFFWKDNKHDLHPQVFEML FT VLPFGVSCAPSIAQHVKNVNAKRFEHEHPTAVRAIVDQHYVDDMLASVETE FT EEAVSLVNDVKRIHAAAGFEMRNWISNCSTIMEALDEPRTEVKNLNLAEAG FT VTEKVLGLWWNTTTDCFTFKVSSRYDPELISGMRKPTKREVLRTLMMIFDP FT LGLIGHLLMFLKVVLQEIWRTSVSWDDPIEEAQFERWLQWLKVLPEVETVE FT VPRCYRSTVSLESSTEVQMHIFVDASENGFAAVVYLRFQQGDIIETALVGS FT KTRVAPLKFLSIPRSELQACVIGARFANTLQKSLTVKVNRKFFWTDSSDVI FT SWLNCDHRRYSQFVAFRVSEILDTTEVHEWRWIRTQSNVADEGTKWKRIPD FT MKKTGRWFTGPAFLKKPICEWPIEVAAPKATSEELRPHLLVHVVIPDPIID FT PRNFSQWNPLVRQVAYVFRFICNIRTSHPEQRATGLLTKSELARAERFLYR FT FAQCSSYADEIAVLSTKRVTNECTMSIPKSSSIYQLCPFLDEHDVMRMRGR FT TAACPYIDKEAINPIILPRDNQVTKLIVDHYHRMYHHQNHTTVLNELRQRY FT VISRLKATLNKIRRDCQQCKNDRVRPLAPIMSDLPLARLAAFSRPFTHMGV FT DYFGPLLVSTGRRSEKRWVLLATCLTTRAIHLQIVHSMTTNSCIMAIRNVM FT ARRGTPAVIYSDRGTNFQGACSELETAMANLDNDKLAKEFTTSRTSWKFIP FT PASPHMGGAWERLVRTVKQNLDRIKPSESMSYEVLENMLTEVENIINSRPL FT TSIPIDSDLSPVLTPNHFLLGSSNGMRSCVPLDDRPTTLRSSSQYSQVLAN FT RFWKLWLQDYLPSITRRTKWFSESQPIKVNDIVIIVDPKYARNCWPKGRVI FT GVKKSTDGQVRSATVQTSSGIYERPTVKLAVLDVGVETNAHLEDQERIPGG FT A" XX SQ Sequence 5335 BP; 1616 A; 1168 C; 1234 G; 1317 T; 0 other; tcttacaaaa ttcttgtaac ggtcaactaa gcattaaacc acaactaccg tcatgagtac 60 caccagcaaa caatcaggaa agggaacgga aaatgagact actgatgcag ctgtagttac 120 cggaatcaac cgccttacta tcgtggatat agcgaaaagc ggagcgatgg atgcagagaa 180 tcaggtaact ggcgcaaaat gcgcagcagc tgaagtgctt acaaaaacga acctaaattc 240 cgagatcaat gaagctagcg ataacactaa agcggtacta gttacgggaa ccaaaccgaa 300 gcatgtgctt ccaagtgctt tggcaacgac cgatcaagag cagctactgg caggtccagc 360 cagtgcgagg aatatgtctg tcttcttaac cgcagccagt tggcggcacg gcaggtcgtg 420 cctaaagact tgccagagtt taatggaaat tctgaagact ggccgttgtt ttattcgact 480 tataattcgt caacgcatat gtgtggttat tccagtgaag agaacatgct tcgtttgcga 540 aaatgcctga aaggcaaagc attagaagcc gtaagatgtc gacttctcca tccatctaac 600 gtcggcggag taatgtcgac gctgaaaatg ctgtatggta gacccgaagc tattgtgaac 660 gcatcgataa agaaaattag agcattacca tcaccgcaag tggaaaaact ggactccttg 720 atcaatttcg ctttaactat tgagaatctg gtggcaacta tcgaagcatg tggagtccaa 780 gattttgtat ataatgcttc tttaaagttc gaacttatcg atcggctccc cccagcattg 840 aaattagatt gggcgaagca ctccaggaat aatccagccc caaaccttac agatttagtg 900 cctggttata tgctatcgca gaagatgcta gtaccgtaat gcaaacgtcc agtaacacga 960 tcaggaatcg ctggacaaag agtgatggat ttataaacgt tcattccgat ggtgactaca 1020 acaaaccaga agtgaatgaa ccaactgtgc acggatattc ctcgactgaa tcgccggaat 1080 gtttcgtttg tagaagcaac tgccaatcag tagccaaatg caaacggttc atagagctta 1140 cattggatgc taagagggac gtagtaagag aaacaaagct atgccgaaaa tgcttacgca 1200 agcataacgg atcatgtcgt cagcaaaaac catgtggtat taggggctgt acttttctcc 1260 atcatccgtt attacacaac gagaagccgc ataccgttaa cgaacacgtc ggaacatcca 1320 cagatggtat tcatagctgc aacattcaca tgacccaaac caatgagata ctattccgaa 1380 ttgtgccggt tttgatccat ggtccttcga aggtaatacg aacctacgcg ttcattgatg 1440 acggttcaga attgactcta atggaacaaa gtttagcgga tgaattagga gcacggggtc 1500 caaaatcgtc gctatgtctt aaatggacag gaggaaccaa cagggtggaa gacgagtctc 1560 aaagggttga cgtgcaaata tctggactag gaattgagtc caaacaacac caaatgataa 1620 atgtgcaaac tattcaggag ttacaactgc gtccacaaac tatggtactt tctgagctac 1680 aagaaagatt cgcttacctt aaaggattgc ctgtagaatc atacactaat gcctgtcccc 1740 gaatactaat tgggttagac aacgcattcc tcgggcatgt gctgaaaagt cgcgaaggaa 1800 agccacacga gccaatcgcc actaaaacgc aacttggatg gataatctac ggcagttgtt 1860 atcaacggaa acggagttcg aattacgtaa atgttcatac tataaagtta tgtgagtgca 1920 attcaggaac cgatgaaaac ctgcatcaag caatgaaaca atattttgcc ctcgatagtc 1980 taggtatctt aaacccagac aggattattc gatcgtcaga agaacagcgg gccgtagaac 2040 ttttggaaaa gcatacaact ccaaaggcag ggcgatatga atcggggcta ctttggaagt 2100 atgacaatgt acgtctccca aacagtaaag acatggcata caagcgatgg ctatgtctcg 2160 atagacgtat gaatcgggac atacagtttg cggaaatcgt ccagcacaaa atggcggatt 2220 atgtgtctaa cggatacgtg cgaaaattga caccggcgga actcgaagta agtcagaatc 2280 gagagtggta cttacccgtc tttccagtca ttaatccaaa caagccgggc aagataaggc 2340 tggtatggga tgcggcagcc agcgtatatg gaacatctct caattccctt ttattgaagg 2400 gtccagatct tctaacgtcg ctcctgacgg tactcgtcca gttccgtgaa ttccgcattg 2460 ggatatgcgg agacattcga gaaatgtatt tgcagatatt gcttaaacaa gaggtacgat 2520 tacgtttttt ctggaaagac aataagcacg atctacatcc acaagtgttc gaaatgttgg 2580 tcttgccatt tggagtctcc tgtgctccga gtatagcaca acatgtgaag aatgtaaacg 2640 ccaagcgatt cgaacatgaa caccctactg cagttcgggc tatcgttgat caacactacg 2700 ttgatgatat gctcgcaagt gtcgaaacag aggaagaagc cgtgagttta gtaaacgatg 2760 tgaaacgaat tcatgcagca gccggtttcg aaatgcggaa ttggatttcg aactgttcaa 2820 ctattatgga agcgttagac gaaccaagaa cagaagtgaa gaacctcaac ttggctgaag 2880 caggcgtcac cgaaaaagtc cttgggttat ggtggaatac aaccaccgac tgtttcacct 2940 tcaaagtatc ttcgcgatac gatcctgaat tgatatcagg gatgcggaag ccaaccaaac 3000 gagaggtttt gcgaacccta atgatgattt ttgacccttt aggtctcata ggacaccttc 3060 tgatgttcct gaaggtagtt ctacaagaaa tatggagaac atctgtctcc tgggatgatc 3120 ctattgagga agctcaattt gagaggtggc tgcagtggtt gaaggttctg ccagaggtag 3180 aaacagtgga agttccacga tgctatagat caacggtatc gttggaaagt tctaccgagg 3240 tacaaatgca tatattcgtt gatgcgagtg aaaatgggtt tgcagccgtc gtatacctta 3300 ggtttcaaca gggtgatata atcgaaactg cattggttgg atctaaaaca agagttgctc 3360 cactgaaatt cttatcgata ccacgttctg aacttcaagc ttgcgtcatc ggagcccgat 3420 tcgccaatac tttacagaaa tctcttactg taaaggtcaa tagaaaattt ttctggacag 3480 actccagtga tgtcattagt tggctcaatt gtgaccatcg ccgatacagc cagtttgtgg 3540 cgttccgagt cagcgagatt ctcgatacca ccgaagttca cgaatggaga tggatacgga 3600 cgcaatcaaa tgtagcagat gaagggacaa agtggaaacg aattcctgac atgaaaaaaa 3660 cgggcaggtg gttcacaggc cctgcatttt tgaagaaacc aatttgtgaa tggccaatag 3720 aagtcgcagc tcctaaagca accagtgaag aacttcgccc acacttgctc gtccatgtag 3780 taattccgga ccctattatc gacccacgaa acttttctca gtggaaccca ttagtacgcc 3840 aagttgctta cgtgtttcgc tttatttgca acatcagaac ttctcaccct gagcagcgag 3900 ctaccgggtt gctaacgaaa tccgagttag ctcgtgcgga aagatttttg taccgttttg 3960 cacaatgcag ttcatatgcg gatgaaatcg cagtgctttc aaccaagcgc gttacgaatg 4020 aatgtacaat gtcaattccg aaaagcagtt ctatttatca gctttgtccc ttccttgatg 4080 aacatgatgt catgcggatg aggggcagaa cggctgcttg tccgtatatt gataaggaag 4140 ctattaaccc aatcattctc ccaagagata atcaagtcac caaactgata gtggatcatt 4200 atcatcgcat gtatcaccac cagaaccata ccactgttct caacgaattg cggcagcgat 4260 acgtcatatc acgcttgaaa gcaactctga acaagattcg ccgagattgc caacagtgta 4320 aaaatgatcg tgtacgtcca cttgcaccca taatgagtga cttaccttta gcccgccttg 4380 ctgccttttc ccgaccattt acgcatatgg gagtagacta ctttgggcct ttattagtga 4440 gtacagggcg cagatctgaa aaacgctggg ttcttctagc cacctgtctg actacgcgtg 4500 ccattcattt acaaattgta cactccatga ctactaattc atgtataatg gccatcagaa 4560 acgttatggc cagaagaggt acgcctgctg ttatatacag tgacagaggt acaaactttc 4620 aaggagcttg cagcgaactg gaaacggcaa tggcgaactt ggataatgat aaattggcaa 4680 aggagttcac cacatcgcga acatcctgga agtttatccc tcccgcgtca cctcatatgg 4740 gtggagcgtg ggagcgcctg gtgagaacag ttaaacaaaa cctcgacagg attaaaccta 4800 gcgaaagcat gtcttacgaa gtcctcgaga acatgctaac tgaagtagag aatattatca 4860 attctcggcc tttaacttca attcctattg acagtgatct gtccccggtg ttaacaccca 4920 accattttct gttgggatca agcaatggaa tgagatcatg cgtcccatta gacgatcgac 4980 caacaacact gagatctagt tcgcagtact ctcaagtatt ggcgaaccgt ttctggaagc 5040 tttggctaca ggactacctt ccatctatta ctcgccgcac aaaatggttt tccgagtctc 5100 aaccgataaa agtcaacgat attgttataa tagtggatcc aaagtacgcc aggaactgct 5160 ggcctaaggg ccgggtaatt ggtgtcaaga agagtacaga tggacaagtt cggagtgcta 5220 cagtgcagac atccagcggt atatacgaac ggccaacagt taagctcgcc gtgcttgacg 5280 taggcgttga aacgaatgct catctggaag accaggagcg cattccggga ggcgc 5335 // ID Gypsy-175_AA-I repbase; DNA; INV; 5100 BP. XX AC AAGE02026183; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-175_AA_; KW Gypsy-175_AA-LTR; Gypsy-175_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5100 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02026183; Positions 52974 58073. XX CC Positions [2325-2825] - Reverse transcriptase CC Positions [3960-4430] - Integrase core CC 'CCATA' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 825..4952 FT /product="Gypsy-175_AA-I_1p" FT /translation="MTTANLVGVIEPYVVGNSFTEYAERFDEFFEFNQIDD FT NRKRSTFCTLSGPAIYSEIKLLFPGKKVKELTFTEIIQKLKSRYDKVDSDV FT IQCFKFNTRVQRADETTESFILDLKLQASLCDFGTYKDKAIRDRILVGVYD FT KNLQKQLLKEEKLTLADAERIITNYELANTRAAAINTESQQVASVRQRLGR FT RATFDQNRNNNYYNRRDRSRSRSTSFQRRVRFSEDGRRNDQYANYYCNGCG FT MKGHIKRQCRNTGGLNTRVHNASKVQSVVVPSAGVEYSLKQAMNRLKMQMS FT DDSGDDSSGEYMCMNVTSINKVSEPCLVKVTIEEKSVLMEIDCGSAVSVMG FT IRTFIELFNTPLRKSNRKLVVVDGGDLKVFGETKVLVCFQNRTEYVSLVVI FT DNSESGCQHRFTPLLGREWLDVFLPEWRYTFQKPNNVYNITTNDQASCMSD FT IENKFANVFKKDFSSPIVGYEAELVLKDEQPIFKRAYEVPYRLRERVCEYL FT DKLENENVITPIKTSEWASPVIVLIKKNNQIRLVVDCKVSLNKVIIPNTYP FT LPVAQDLFARLSGCKVFCALDLEGAYTQLSLSKKSRKFMVINTIKGLYTYN FT RLPQGASSSAAIFQQIMEQILLGIENVTVYLDDVLIAGKDLEDCKKKLQIV FT LERLSAANVKVNWEKCKFFVSNLPYLGHIVTDKGLLPSPDKLEIIRAAKVP FT SNTTELKAYLGLVNYYGKFVPHLSTKLSELYNLLRKNVNYVWTNACQRAFE FT NSKHELVNADLLEYFDPKKPLVVVSDACNYGLGGVLAHVIDEVEKPICFTS FT FSLNEAQKKYPILHLEALALVCSIKKFHKFVYGQKFVVYTDHKPLVGIFGK FT AGGNSIYVTRLQRYVLELSIYDFEIVYRPSTKMGNADFCSRFPLQQNVPLE FT CDVEYVKSLNLTHELPINFSKIACETANDDFLVKLMSFLQNGWPEKIDKTF FT LNIHTNQHDLEIVEGCVMFQDRVVIPFKLRKQILSILHVNHSGIIKMKQLA FT RRTVYWPGVNTDIENFVKDCDTCTKMSVVPKPSSTTEWIPTTRPFSRLHAD FT FFNFEQKVFLLIVDSHSKWLEIEWMRNGTDASRVIRRFTKMFSCFGLPDVV FT VTDGGPPFNSTLFVSFLEKQGIKVLKSPPYNPSSNGQAERMVRLVKDVFKK FT FLLEPEVRNWNTEDQIDYFLINYRNTCVTEDGSFPSEKIYSYKPKMLIDLI FT NPKKHYSRQLVESPSSYKDHQGTLTTSTQTLVKNDPFDSLMSGDKVWLKNN FT HVNRVEKWIEAKFVRRLSINVFQVAIGNVTSNAHRNQLKIPRIKSPTMNVM FT VPVTSHKRSRVVTVAEEDTPTTSASFQTNEPNSKPIIDPAVILPTLRRSTR FT TKKQKINDDFIYS" XX SQ Sequence 5100 BP; 1742 A; 841 C; 1032 G; 1485 T; 0 other; actgtcgacg aggtgaagga ttttttttac gtcgcgtgtg atataactaa acaaattgaa 60 aagttagtaa aatatttatg tggtgaatcg aacgctttaa aataaaagca taattccact 120 caaatcacca tttatgtggt aaaaaaaaag aaaaaaggct tctgctaaga caagtgaagt 180 gaaaagccat tttgaagagg caattattac aattgtgaca aaaggctatt cccacgatag 240 ataatagaag ttcattgacc tcacaactga gggtgaataa ttgatctaaa ggtggtttaa 300 ttaagcaaga taaataaatt caacaacaaa gtgatagcta tattacttta ttgcacctga 360 ataccaagta tgatatatgc gttacatgtg atcggctgag ctataggcaa gagcgctgat 420 agtaaaaaaa aaataaataa ataatacgag cagttccatt tagtgagctc cagctaagat 480 ttgagtatac ggaggtgcat gatttcccgt attgagtgat aaaaaaatag aaaataaaat 540 ttagtagtta ccgaagttca tatccagaga ttcttcagta cagtgagtgt caacgagcag 600 ccgagagagc gagcgtaaca accgacaaca gagtcttctc ttagagtcat agataaacta 660 caaccacaac agcacataca ctggtgagag agctgatttg tctcagctga acaagtgtga 720 gaggtaaaac aacaaacaat cggtgagtcg tcgtattact accgttcttc tattgatttt 780 acttttattt tgattcctca ttcgtatcca tccgtcttgt agatatgact actgcaaacc 840 tggtcggtgt tattgaacca tatgttgttg gtaattcctt tacagaatat gccgaacgtt 900 ttgatgagtt ttttgaattt aatcaaattg atgataatag aaaaagatct actttttgta 960 cccttagtgg tccggcaatt tactcggaaa tcaaactttt gtttccgggt aaaaaagtaa 1020 aagaattaac ctttactgaa ataattcaga agttgaaatc acgttacgat aaagtagatt 1080 cggatgtgat tcagtgtttt aaattcaaca cgcgggttca aagggcagat gaaaccactg 1140 aaagtttcat tcttgacctt aaactgcaag catctttgtg tgattttggc acttacaaag 1200 ataaagcaat aagagatcga attttagtag gcgtttacga taaaaattta caaaagcaac 1260 ttcttaaaga ggaaaaactt acactagcag atgcagaacg tataattacg aattatgaac 1320 ttgccaatac acgtgcggct gctatcaaca ccgaaagtca gcaagtagct tctgttaggc 1380 agcgacttgg tcgaagggcg acctttgatc aaaatagaaa caataattat tacaaccgaa 1440 gagatcgaag tcgcagcaga agcacgagct ttcaaagaag agtcaggttt agtgaagatg 1500 gaagaaggaa tgatcaatac gccaattatt attgtaatgg ttgcggtatg aaaggtcata 1560 ttaagcgcca atgccgcaac actggaggct taaatacaag agtacacaat gcaagcaagg 1620 ttcaaagcgt tgttgttcct tctgcaggtg ttgagtatag cctgaaacaa gcgatgaatc 1680 gattgaagat gcagatgagt gacgattctg gtgatgattc ctcaggtgaa tatatgtgta 1740 tgaatgtaac gtctattaac aaagtaagtg aaccttgcct ggtaaaagtt actatagagg 1800 aaaaatcagt gcttatggaa attgattgtg ggtctgctgt ctctgtcatg ggaattagaa 1860 ctttcattga attattcaat acacctctcc gcaagagtaa cagaaaactt gtcgttgtag 1920 atggaggaga cttaaaagtt tttggagaaa ctaaagttct tgtgtgtttt cagaacagaa 1980 cagaatacgt atctctggtt gttattgata atagtgaatc tggctgtcaa catcgtttta 2040 ctcccttatt aggtcgagag tggttagatg tttttttgcc agaatggaga tatacgttcc 2100 aaaaaccaaa taatgtatac aatattacaa ccaacgacca agcttcctgt atgagcgata 2160 ttgaaaacaa atttgctaat gtttttaaaa aagatttttc atccccgatt gttggttatg 2220 aggctgagct ggtactaaaa gacgagcaac caatattcaa acgtgcgtat gaagtaccat 2280 atagattacg agaaagagtt tgcgaatatt tagataaatt agaaaatgag aatgtaatca 2340 ctcctatcaa aaccagtgaa tgggcatcac ctgtaatcgt tttgattaaa aaaaataatc 2400 agatccgatt agtggtagac tgtaaagttt cacttaacaa agtgattata ccgaacacat 2460 acccattgcc agttgcacaa gatctttttg cccgcttgtc tggatgtaag gtattttgtg 2520 ccctagacct tgaaggagcc tacacacaat tatcactttc aaaaaaatct agaaaattca 2580 tggttatcaa tacaataaaa gggctttata cgtacaaccg cttgcctcag ggagcttctt 2640 cgagcgctgc aatttttcag caaataatgg agcaaatatt gttggggatt gaaaatgtaa 2700 ccgtttattt agatgatgta ttgatagctg gaaaagatct ggaagactgt aagaaaaagt 2760 tgcaaatagt tttggaaaga ctatcagcag caaatgttaa agtaaattgg gaaaagtgta 2820 agttttttgt ctccaatctc ccttatctgg gtcatatagt tacagacaaa gggttgctac 2880 ctagcccaga taaattagag ataataagag cggctaaagt tccatccaat actacagagc 2940 tgaaagcata tttgggtcta gtaaattact atggaaaatt tgtacctcat ttgtctacaa 3000 agctgagtga gttgtacaat cttttgcgaa aaaatgtaaa ttatgtgtgg actaatgctt 3060 gtcaaagagc ttttgagaat agcaaacatg aactagttaa tgccgatctt ttagaatatt 3120 ttgacccaaa gaaaccatta gtggtggttt ccgacgcttg caattatggt ttgggagggg 3180 ttcttgcaca tgtgatcgat gaagtcgaga aaccaatatg ttttacatcc ttttccttaa 3240 acgaagctca aaagaaatac ccaattttac acttagaggc tttagcactt gtatgtagta 3300 tcaaaaaatt tcataagttt gtctatggcc agaaatttgt agtctatact gaccacaaac 3360 cattggtagg aatatttggc aaagctggag ggaattctat ttatgttact cggctacaga 3420 gatacgtgtt agagctgtcg atatacgatt ttgagatagt atatcgacct tccacaaaaa 3480 tgggaaacgc tgatttttgt tcacgttttc ctttgcagca gaatgttccc ttggagtgcg 3540 acgtagaata cgtcaagagt ctcaacctta cccatgaatt acccataaat ttttcaaaaa 3600 ttgcctgcga aacagcaaat gatgactttt tggtcaaatt aatgtctttc ctgcagaatg 3660 gttggccaga gaagattgat aaaacatttt tgaatattca tacgaaccag cacgatcttg 3720 aaatagtaga aggatgtgta atgtttcagg atcgtgtagt tattccgttt aaattgagaa 3780 aacaaatact tagtatctta catgttaatc attccggtat aattaaaatg aagcaacttg 3840 cgagacgaac agtctactgg cctggagtta atacagatat agagaacttt gtgaaagatt 3900 gtgatacctg tacaaaaatg tctgtagtac caaagcctag ctcaaccaca gagtggatac 3960 caacgacgag accatttagc agattacacg cagatttctt taattttgaa caaaaagtgt 4020 tccttcttat agtggatagc cattcgaaat ggctggaaat tgaatggatg aggaatggaa 4080 cagacgcttc aagagttatt aggagattca caaaaatgtt ttcttgtttt ggactaccgg 4140 acgtggtagt taccgacggc ggtccacctt ttaattccac cttatttgtg tcgttccttg 4200 agaagcaagg tataaaagtg ttaaaaagtc caccttacaa cccatctagc aatggtcagg 4260 cagagcggat ggtgaggctt gttaaggacg ttttcaaaaa atttttgctt gaaccagagg 4320 tgcggaattg gaacacagag gatcagattg actacttttt aatcaactac cgaaacactt 4380 gtgtgacgga agatggatct tttccatccg agaaaatata cagctataaa ccaaagatgt 4440 tgatagatct tatcaatcct aaaaagcatt attcaagaca gcttgtagaa agtccctcta 4500 gttacaaaga ccaccaagga acactaacaa cttcaacgca gacattagtt aagaatgatc 4560 catttgacag tttaatgtct ggagataaag tgtggttgaa gaacaatcat gtgaatcgtg 4620 tagaaaaatg gatagaggca aaattcgtaa gacgactttc catcaatgtt ttccaggtgg 4680 caattggaaa cgtaacttca aacgcgcatc gcaaccaact gaagataccg aggattaaat 4740 cgcccaccat gaacgtaatg gtcccagtta cctcacataa gaggagccgg gtggtaacgg 4800 tggcggagga agatactcca acaacatcag cttcgtttca aactaatgaa ccgaattcga 4860 aaccaataat cgatccagct gtaattctcc ctacgctccg aagatcaacc aggacaaaaa 4920 agcaaaaaat taatgatgat tttatatatt cttgagctcg attaagaaaa tgcatgaatt 4980 atttattgtt ttgtttttag cctatttgaa ttaatttctg ttgaacttaa aatgtaaatt 5040 ataatttaaa gtatgaattt tctgtgtata gcttgtagaa taatcttaag ggaaaagaat 5100 // ID Copia-14_AA-I repbase; DNA; INV; 4169 BP. XX AC supercont1.71; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-14_AA_; KW Copia-14_AA-LTR; Copia-14_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4169 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.71; Positions 1307922 1312090. XX CC 'AACC' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 77..4159 FT /product="Copia-14_AA-I_1p" FT /translation="MEDEKKWIFDGRNYDQWAYRMRVLLEEKGLVKCIEET FT INEEDYAELPEDSMEVKTRKKQQLEEKWKLDRKCKSLIVHRVSDSHLEYVM FT GRKTAKECWDSIQGTFQRRGLANRLYYRRRLGLLKLKSGASMQNHLLEFDK FT LLRGLKMTGAKVEEDDAITQLLLTLPEQYDGLCTALETMSFDKLSMEFVKN FT RLLEEEAKRFVKNDCLDEDGNTESAFAGKRFSFKCYNCGKLGHKRSDCRVK FT MTEDNGRKYSEASGNSRNLTNHNGPRRGKANVHCATDIAFVATCETELAAA FT VTQKTDNSFQWFLDSGASDHMVRDRSYFEELHRLPREISIAVAKSSQSLTA FT KYAGTIRTCTEVEGRIISSIVEEVLYVPGLHLNLFSISRLEDKGMSITFPG FT GKVKIVKDGQTVATGRKCGRLYQLHVLLKKEAEALLTKDDEAGLWHRRYGH FT LGEANLQKLWKNGMVSTKATLLECVKQNPCEVCLSGKQTRQPFGVAEGQRS FT TRVLELVHSDVSGPFTPGTYDSKRFFVTFIDDFSHFTVVYLLKSKDKVLEC FT FEEYNAKVAAYFGVKIAKIRCDNGGEYTSKAFRSLCRREGISLEYTAPYTP FT QQNGVAERMNRTLLDKARSMIFDAGLPKSLWGEAVLTAAYLSNRSPTSALE FT ILKTPYEVWYNKKPDIDNLRVFGAVAHTHVPKQCRGKLDPRSERNYMVGYA FT TNGYRIFDPRKKKIFVSRDVVCDENSRKPPTRSRDFNDELEEEEPQTDNEP FT VIDDQNEAKVQDENQIRQIQNNDENEQNVIEAAGVAGNVEDVVSDPRRKRV FT HAVDESFSDQPVSKRLTNPPVWMNDYHVSYSSVETPMAYALNAEAFVEEIP FT NNIAELKKRDDWDRWKEAIESELQSLKKNGTWTLHPKPKEKNIVDCKWVFR FT IKRDDNGNIERYKARLVAKGYSQKKNYDYTETYAPVARITTFRIMLSIANQ FT FKYTIHQMDVKTAFLNGNLKEVIYMYQPEGFEEGDRNMVCRLQISLYGLKQ FT APRSWNEAFNSFVLSLGFKRSNYDCCLYWKRTGGVVVYLLLYVDNILLMTN FT NLEEIQLIKNQLSERFEMTDMGEVKQFLGIKVERYENQIKISQPKYIDGML FT KRFGMEECRPTSTPMEPKPQFEHEGNIELTKQPYKELIGCLSYLMLSTRPD FT IAAAVNTFSRYQAAPTDKHWSHLKRVLRYLKGTRDYGLVYRRRADSSPLIG FT YADADWGNDGEDRRSISGFTFKIHDATVSWTTRKQNTVALSSTEAEYVSLS FT QAACEAIWLRNVLQEFGVDISEPTRIFEDNQSCIRIAEEPRDHRRMKHVDI FT RFHFIRECIQNKIIRPVYVSTKEQVADIFTKGLPAGPFAHLREKLNLFG" XX SQ Sequence 4169 BP; 1327 A; 776 C; 1085 G; 981 T; 0 other; aaccgctgcg cctaatcaat aggttatagg cccagatacg ttattcgagt ttgaggaaga 60 tttgcgaacc aaggtaatgg aggacgagaa gaagtggatt ttcgacggaa ggaattatga 120 ccagtgggcg taccggatgc gtgtgttgct ggaggagaaa ggccttgtga agtgcatcga 180 ggaaaccatt aacgaggaag attacgctga actccctgag gactctatgg aagtcaagac 240 gaggaagaag caacaactcg aggaaaaatg gaagctcgat cggaagtgca agagcctgat 300 cgtccaccgc gtctccgata gccacttgga atacgttatg ggaaggaaaa ctgcgaaaga 360 gtgctgggat agcatccaag gaactttcca acgtcgaggt cttgccaacc ggctgtacta 420 ccgtcgtaga cttggtttgc tgaagttgaa gagcggtgcg agtatgcaga atcatttgct 480 ggagttcgac aagctactgc gaggactgaa gatgaccggt gccaaggttg aagaagatga 540 tgcgattacc caacttcttc ttaccctgcc agaacaatac gatgggctgt gtacggcgct 600 ggaaacgatg tcattcgata agttgtccat ggagtttgtc aagaaccgtt tgttggaaga 660 agaagctaag cgtttcgtaa agaacgattg tctcgatgaa gacggaaata ccgaatcggc 720 gtttgctgga aaaaggttct cgttcaagtg ctacaactgt ggaaagttag ggcacaagcg 780 ctcggactgt cgagttaaga tgacggaaga caacggtaga aagtattcag aagctagcgg 840 aaactcacga aacttgacga atcacaacgg accacgtaga ggtaaagcga acgtgcactg 900 tgctacggat attgcgtttg tggcgacctg tgaaacggag ctggccgcag cagtcacaca 960 gaagaccgac aattcgttcc agtggtttct ggattcagga gcatcggacc acatggtgag 1020 agatagaagt tatttcgagg agctacatcg tttgccaagg gagatttcga ttgctgttgc 1080 gaagagtagc caatcgctga ccgccaaata tgctggaact atccggacct gtaccgaggt 1140 ggaaggaaga ataatatcat ccattgttga agaagtgctg tatgtccctg ggttacacct 1200 aaatctattc tccatcagtc gtcttgaaga taaaggtatg tcaatcactt ttccgggcgg 1260 gaaagtgaag atagtcaaag atggtcaaac tgttgcaacg ggacggaaat gtggaaggct 1320 ctaccaatta cacgttttat tgaagaagga agctgaggcg ttattgacga aagatgatga 1380 agctggtctg tggcatcgac gatatggtca tcttggagaa gcgaatttgc aaaagctgtg 1440 gaaaaacgga atggtgagta cgaaagctac gttactggaa tgtgtgaagc aaaacccgtg 1500 cgaagtatgt ctgagcggaa aacaaacccg acaaccgttt ggagttgcag aaggacaacg 1560 ttcgactagg gtactggagc tggtacatag tgacgtcagt ggaccgttca cgcctggaac 1620 gtacgatagc aagagattct ttgtaacatt cattgatgac ttcagccact ttactgttgt 1680 gtacctgctg aaatcaaaag ataaagtttt agaatgcttt gaagaatata acgcaaaggt 1740 agcagcttat tttggtgtga agattgcaaa aattcgatgc gacaatggag gagaatatac 1800 tagcaaagcg tttcgatccc tatgccgtag agaaggaata tcgttggaat atacagctcc 1860 gtatacacct caacagaatg gagtggcgga gcgaatgaat aggaccctct tggataaagc 1920 tcgttcaatg atatttgatg ctggattacc gaaaagtttg tggggagaag ctgtcctaac 1980 ggctgcatat ttgtcaaatc gtagtccaac cagtgcgctg gaaattttga agacacccta 2040 cgaagtttgg tacaacaaga aacccgatat cgacaatttg agagtgtttg gtgcagttgc 2100 acacacccat gtgccgaagc aatgcagagg aaagcttgat ccaaggtctg agaggaatta 2160 catggttggc tatgcaacga acggctacag aatctttgat ccccggaaga agaaaatatt 2220 tgtgagcaga gatgttgtgt gtgatgagaa tagcaggaaa cctccgacac gttcaagaga 2280 tttcaatgat gaactggaag aagaagaacc gcaaaccgat aatgaacccg tgatagatga 2340 tcaaaatgaa gctaaagtac aagatgaaaa tcaaatacga caaatacaga ataacgatga 2400 aaatgaacag aatgttatag aagctgcagg tgtcgctgga aatgttgaag atgttgtatc 2460 tgatccgaga cggaagagag tacatgctgt tgatgaatcg ttcagtgatc aaccagtatc 2520 gaagcgattg acgaatcctc ctgtctggat gaatgactac cacgtgtcct atagcagtgt 2580 tgaaactccg atggcctatg cgttgaatgc agaagctttt gttgaagaaa taccgaataa 2640 cattgctgag ctgaaaaaga gagatgattg ggaccgatgg aaggaagcga ttgaaagcga 2700 gttgcagagt ttaaagaaaa acggaacttg gactttgcat ccgaagccaa aagagaaaaa 2760 catcgtggat tgtaaatggg tattccggat caaacgtgat gacaacggca acattgaaag 2820 gtataaagcg cggcttgttg ccaaaggata ttcgcagaag aagaattatg attatacaga 2880 gacttacgca ccagtggcac gaataacaac gttccggatt atgctgagta tagcgaatca 2940 attcaagtac accatacatc aaatggacgt aaaaacggcg tttctgaatg gaaatttgaa 3000 ggaagtaatt tacatgtacc agccagaggg attcgaggaa ggagatcgga acatggtgtg 3060 ccgtttgcaa atatcgcttt atgggcttaa gcaagcgcca cgaagctgga atgaggcgtt 3120 caattcgttt gttttgagct tgggattcaa acgatccaac tacgactgct gtttatattg 3180 gaagcgtact ggaggagtgg tcgtatactt gttgctgtat gtagataata ttttgctgat 3240 gacaaacaac cttgaagaaa ttcaacttat caagaatcaa ctttccgaaa gatttgaaat 3300 gacagatatg ggagaagtta agcagttcct cggaatcaaa gttgaacgat atgagaatca 3360 aatcaagatt agtcagccca aatacattga tggaatgctg aaacgtttcg gaatggaaga 3420 atgtcgacca acatcaaccc caatggaacc aaagccgcag tttgaacatg aaggaaacat 3480 agaactgacc aagcagcctt acaaggagtt gataggatgc ttatcatact tgatgctatc 3540 aaccagacca gacatcgccg ccgcagtcaa cactttcagc cgatatcaag cagcacctac 3600 agataagcac tggagccacc tgaaaagggt tttacgctac ctaaaaggga ccagagatta 3660 tggattagtg tatcgccgca gagctgattc cagtccgctg ataggatacg cagatgcaga 3720 ttggggaaac gatggtgagg acaggcgatc tatttctggc tttacattca aaatccatga 3780 tgcaaccgtt tcgtggacta caagaaaaca gaacaccgta gctctctcat cgacagaagc 3840 agagtacgtt tcattaagtc aagcagcatg tgaagcaatt tggctaagaa atgtgctaca 3900 agaatttggc gttgatattt ccgaaccgac tagaatattt gaggacaatc aatcgtgcat 3960 acgaattgct gaggagcctc gggatcatcg aagaatgaag catgtggata ttcgattcca 4020 ctttattcga gaatgcatcc agaacaagat tattcggcca gtttatgtat ctacaaaaga 4080 acaagttgct gatatattta ccaaaggcct accagcagga ccatttgcac atcttcggga 4140 gaagctgaat ctcttcggtt gagcggggg 4169 // ID RTE-5_BF repbase; DNA; INV; 1551 BP. XX AC . XX DT 29-JUL-2009 (Rel. 14.07, Created) DT 29-JUL-2009 (Rel. 14.07, Last updated, Version -1) XX DE Amphioxus RTE-5_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW RTE; Non-LTR Retrotransposon; Transposable Element; RTE-5_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-1551 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-1551 RA Kapitonov V. and Jurka J.; RT "Young families of RTE non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(7), 1703-1703 (2009). XX DR [2] (Consensus) XX FH Key Location/Qualifiers FT CDS 2..1537 FT /product="RTE-5_BF_2p" FT /translation="NLASEVEEAIRTLKPGKSPGVDNIPGELIKHGGPTMV FT KVLSALCQKIWEKKEWPKEWTQSLIIPLPKKGNLRACQNYRTISLISHLSK FT VMLRVILNRLKPKVEEILTEEQAGFRAKRSTTEQIFNIRLLVEKHIDHQRE FT IRHNFIDFKKAFDRVWHEGLWKVMRDYNIDESLVAIIKSLYDNATSAVLHN FT NQLGEFFRTTVGVRQGCLLSPALFNVFLENIMQEALSGFESSISIGGRQLS FT NLRFADDIDLMAGSEDEHQELSTRVENSSAAYGMEVSTSKSKGMINSTLSD FT ARTSVTMAGEVLEDVDFFKYLGSILNRDGTSTQEIKVRIAQASAALAKLSP FT ILQNKNISLPVKIQLYRTLIVSIFLYGCESWTTTAETERRIEAFEMKCLRK FT ILGISYLQRQTNRSVRHQAEAACGKQESLLSTVKRRKLNWFGHICRHNSLA FT KTILQGTIAGGRKRGRPRKTWLDNIKTWTGLSTEQLIRGAEDREKWARITT FT TGCRGAPTTTEVTGRE" XX SQ Sequence 1551 BP; 489 A; 380 C; 372 G; 310 T; 0 other; caatcttgca agtgaggttg aagaagcaat tcgcaccttg aaaccaggga aatcgccagg 60 ggttgataat atcccgggag aactcataaa acatggcgga ccgaccatgg taaaggtgtt 120 aagcgctctc tgccaaaaga tttgggagaa gaaagaatgg ccaaaggagt ggacccaatc 180 cctgatcatt ccactaccta aaaaaggaaa cctgcgagcc tgtcagaact accgcaccat 240 tagcttaatc agtcatctga gcaaggtcat gctacgtgtc atcctcaata gactcaaacc 300 caaagtagag gagatactca cagaagaaca ggcaggcttt agggccaaaa ggagcacaac 360 tgaacaaatc tttaacatca ggttgctggt agaaaaacac atagaccacc aaagggagat 420 acggcacaat ttcatagact tcaaaaaggc gttcgatcgc gtgtggcacg agggcctctg 480 gaaggtcatg agagactaca acatagatga gtcactagtg gccatcatca agtcacttta 540 tgacaacgca accagtgcgg tcctccacaa caaccaactt ggtgagttct tcagaaccac 600 agttggtgtt cgacaaggat gtctcctctc cccggctctt ttcaatgttt tcctagagaa 660 catcatgcaa gaagccctgt caggttttga gtcttccatc tcaattgggg gaagacaact 720 ttctaacttg cgttttgctg atgacataga cttgatggcc ggatctgagg atgaacatca 780 agagttgtca acccgagtgg aaaacagcag cgctgcatat ggcatggaag ttagcacatc 840 taaaagtaaa ggtatgatta acagcacact atctgatgcc agaacttcag tgaccatggc 900 tggtgaagtc ctcgaagacg tggacttttt caaatacctt ggctcgatcc ttaacaggga 960 tggcacctcc acgcaagaaa tcaaagtaag aattgcacaa gcatcagcag cactcgctaa 1020 actcagcccc atcctccaga acaagaacat cagtctgcca gtcaagatcc aactatacag 1080 aacacttata gtctccatct tcctctatgg ctgcgaaagc tggaccacca cggcagaaac 1140 agagcgccgc atagaagcct tcgagatgaa atgcctgcga aaaatcctcg gcatttccta 1200 cctgcaacgc cagacaaaca ggtctgtacg ccatcaagca gaggcagcat gtggcaagca 1260 agaatcactc ctgtcgacag tcaagcgcag gaagctgaac tggttcggcc acatctgccg 1320 acacaactcc ctcgctaaaa ccatactaca aggcacaata gcaggaggga ggaaacgagg 1380 taggccaagg aagacatggc tggacaatat taagacatgg actggcctaa gcactgaaca 1440 gctcatcaga ggggcagagg acagagagaa atgggcacga atcactacta ctggatgtcg 1500 gggagcccct acgaccactg aggttacggg ccgtgagtga gtgagtgagt g 1551 // ID DNA8-12_AP repbase; DNA; INV; 208 BP. XX AC . XX DT 22-AUG-2009 (Rel. 14.08, Created) DT 22-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-12_AP. XX NM DNA8-12_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-208 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(8), 1754-1754 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 208 BP; 60 A; 54 C; 46 G; 47 T; 1 other; cagtggcgtc atttggggtg gggcaactgg ggcatntgcc ccagtctagt atttacatca 60 accttaagtc ggcacattta ccacacgccc ccccccaccc gttaaccact atggcactat 120 gctacgagaa tctgagtgaa aaaaattttg agtcggaaaa gccgtgaaaa attgccctag 180 tctaatgaaa actgaaatga cgccactg 208 // ID CR1-34_HM repbase; DNA; INV; 4902 BP. XX AC . XX DT 17-DEC-2008 (Rel. 13.12, Created) DT 21-JUL-2009 (Rel. 14.08, Last updated, Version 2) XX DE CR1-type family - consensus. XX KW L2A; Non-LTR Retrotransposon; Transposable Element; CR1-34_HM. XX NM CR1-34_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4902 RA Bao W. and Jurka J.; RT "CR1 families from Hydra magnipapillata."; RL Repbase Reports 8(12), 1862-1862 (2008). XX RN [2] RP 1-4902 RA Kapitonov V.V. and Jurka J.; RT "CR1-34_HM belongs to the L2A clade of non-LTR RT retrotransposons."; RL Direct Submission to Repbase Update (21-JUL-2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, J CC Craig Venter Institute. This family was classified originally as CC a member of the CR1 clade [1]. Later, it was reclassified as a CC member of the L2A clade of non-LTR retrotransposons [2]. XX FH Key Location/Qualifiers FT CDS 19..1086 FT /product="CR1-34_HM_1p" FT /translation="MAAKFTEDLLLQESKTKNNNNANQTETISYLNDFESF FT LLDFAEPQANLRMEHSKIFMNAINNIHKRKKRADTESIYEEVLKHFENRLE FT KLHVSEFLNLLVSKNLLSCSIKNGKESYKVLIVPETKLEPDLCIINHKKVK FT TLESEIKILNYELKNLKSSEISLSILKDENLFLKEELSEXRKIIAVLIDKL FT EVKSTVTDLKSTQNLNFLNDFDFNSPLNQNSINSPKAETINAFRQQECGIK FT IIDSKSKEIKDRLNNDLTDVRRQLNIKYLNFKKYNIPELAVSSNSNNNKTM FT RNSINNFENVTNQLVSVRKNYQSDFLKYKNLSTNGVVSTSSSLSTKAVGNV FT KNTKKSNKKSKQ*" FT CDS join(1197..1619,1612..2211,2147..4576) FT /product="CR1-34_HM_2p" FT /translation="MHKLGKVKVRVFSGATIEDMHFNLVPLLKKEPSKIIL FT HVGTNNCTNDSSESALAKLKDLVSFISKSNPSCSIILSTPIRRFDNAKAQL FT TSLHLAEKLKLTKFNIIDNSNLSNEHLGLRGLHLNTRGSGRLAMNFIAYLK FT SISLFDRHSPIDSYHQSTLPSTIEEISLNSDLDNLRAKYPKNISLAFLNIN FT SIRNKFSDIITYINKTVDILILGETKLDDSFPINQFLYPGYKTPYRLDISQ FT HSGGLLVYINENIISKRLAKKDFPRDIQIITFEIKLRMRKWLIIATYKPPN FT QNCDYFFNHLKGVLDFYLQSYLDFXLIGDFNIQPSEKKNEGIPSTLSMYEX FT LIYNLQKKKMKEFLQLYQCINLIKQPTCFKSVNNPSCIDLILTNRKLSFQH FT SNTFGTGVSDHHHLIYTMLRSTFCKLPPKKLIYRSFKNFSSEIFAKILRFN FT LKSCYDISTFNEIFSNLLNIHAPLKSKILRGNNKPFITKDLRKAIMLRSSL FT YKKFLVDKSDFSALMYKKQRNLVVKLNRKNKKKYFEQINTLSXSNTSLWKL FT TKPFFSDKSTNSNERITLTENGNIISDSSLLTNIFNKYFINVALPSSMADH FT IKWSFSPCTSDPVESAIKKYVDHPSIRNIKNKFKMNAHNFKFHCITPDDIT FT KIIGSMNEKKKTSGEIPTFMLKSYLIYFRDSLTDCINNSILDGTFPSFLKM FT ADVLPCFKKNDPTDKANYRPISILPALSKVFEMIVYEQILLFMEPKFDKLL FT CGFRRGYSTQHTLIFLLNKWHECINNGGIVGTLLMDLSKAYDFIPHDLLIA FT KLEAYGFSKESLKFLISYISGRKQRVRIGPSVSNWVDILSGVPQGSILGPI FT LFNIFINDIFFFIKDTELCNFADDNTLYACDYSLEKVIFRLQKEATNTIAW FT FKFNSMVANPDKFQLLLVGLKNTKNQYLKIGNITVFASDRVKLLGVTIDKN FT LNFGEHIKYLCSKANGKCFALSRIRSFLSRNKATLLFNAFIMSNFSYCPLI FT WMFCSKYYDNLINKIHKKALCIVHKRFDLCLDELLSFDNSPRIHLRNLRFL FT MIEVFKSINCLNPQFMKDLFNTKNLPFKLRCSKKMILPSNGSKYKDQCCTI FT FRAISLWNTLDNKTMSSKSSEVFKKHIKNWYAKNCYCKICRF*" XX SQ Sequence 4902 BP; 1804 A; 743 C; 672 G; 1674 T; 9 other; tgatatcaaa aaaaaaagat ggcagctaaa tttacggagg atctattatt acaagaaagt 60 aaaactaaga acaacaataa tgcaaatcaa acagagacta ttagttacct aaatgatttt 120 gaaagctttt tactagattt cgccgaacca caagcaaact tgagaatgga acactcaaaa 180 atctttatga atgccataaa caacatccat aaaagaaaaa aaagagcaga tacggaaagt 240 atttacgaag aagtattgaa acattttgaa aatcggcttg aaaaattaca tgtaagtgag 300 tttcttaatt tacttgtatc aaaaaacctt ttaagttgct caataaaaaa tggtaaggag 360 tcgtataaag tacttatagt tcccgaaaca aaattagaac ctgatctttg tataattaat 420 cacaaaaaag ttaaaacctt agaatctgaa attaaaattt tgaattacga attaaaaaac 480 ttaaaaagca gtgaaattag tctttcaatt ttaaaagatg aaaatttatt tctaaaagag 540 gagttatcgg agttmaggaa aattattgcc gttttaatcg ataagctaga agtcaaatca 600 accgtgactg atttaaaatc aactcaaaac ttaaactttt taaacgattt tgattttaat 660 agcccattaa accaaaactc aattaattca ccaaaagctg aaaccattaa tgccttccgg 720 caacaagaat gtggtattaa aataattgat agtaaaagta aagagattaa agaccgttta 780 aataatgact taacagatgt tcgccgtcaa ttaaacatta aatatttaaa ttttaaaaaa 840 tataatattc ctgaactagc tgtatcctca aactcaaaca ataataaaac aatgcgtaat 900 tcaataaata attttgaaaa tgtaacaaat caactcgtct cagtcagaaa aaattatcag 960 tcagactttt taaagtataa aaatctttca acaaatggtg ttgtttcgac ttcaagttcc 1020 cttagtacaa aagcggttgg aaatgtaaaa aataccaaaa aatcaaataa aaaaagtaaa 1080 caatgacact aaaataatca atcctcgtga taaccaaaca gatgaatata aatggccagt 1140 aggaacgacg ctaatatgtg gagactctat tttgatgggg attaacgaaa aaaaaaatgc 1200 ataagcttgg aaaagtaaaa gtaagagtat ttagtggtgc tactattgaa gacatgcatt 1260 ttaacctagt tccacttttg aaaaaagaac cttcaaaaat tatcctgcat gttggaacaa 1320 ataattgtac aaatgattcg tctgaatctg ctttggcaaa attaaaagat cttgtttcct 1380 tcatttcaaa atcyaaccct tcttgttcaa taatcttatc tacaccaatt cgtcgtttcg 1440 ataatgctaa agctcaayta acgtcgctac acttagccga aaaattaaaa ctaactaaat 1500 ttaatattat tgataactct aatttgtcta atgaacacct tgggctacgt ggtcttcacc 1560 taaacacacg aggatcaggt agacttgcca tgaactttat tgcatatctg aagtctattt 1620 gatagacatt cccctataga ttcttatcac caatctacct taccaagtac tattgaagaa 1680 attagtctaa attctgattt ggataatcta agggccaaat atcccaaaaa tatcagttta 1740 gcatttttaa atattaattc aattcgtaat aaattttcag atattattac ctatataaat 1800 aaaactgttg atatacttat attaggtgaa accaagttag atgactcctt tccaataaat 1860 caatttttat atccaggtta taaaacgcct tatcggcttg atatctctca acatagtgga 1920 ggtcttcttg tatacataaa tgaaaatatc atttctaaaa gattagctaa aaaggatttc 1980 cccagagaca ttcaaattat cacttttgaa attaaattaa gaatgcggaa atggctcata 2040 atagccactt ataaaccccc aaatcagaat tgtgactatt tcttcaatca ccttaaagga 2100 gtgcttgact tttatttgca atcatatctt gattttrttc taataggaga ytttaatata 2160 caaccttcag aaaaaaaaaa tgaaggaatt ccttcaactt tatcaatgta ttaatttaat 2220 taaacaaccg acttgtttta aatcggtcaa taacccttcc tgtatagacc ttatccttac 2280 taacagaaag ctgagcttcc aacattcaaa tacctttggt acgggtgtga gtgatcatca 2340 ccatttaata tacactatgc taagatccac tttttgtaag ttaccaccta aaaaacttat 2400 atatagatcg ttcaaaaatt tctcaagtga gatttttgca aaaatactcc gtttcaactt 2460 aaaatcttgt tatgatatta gtacttttaa tgagatattc agcaatttgc ttaatattca 2520 tgcccctctt aaatcaaaaa ttctaagagg taacaataaa ccttttataa ctaaagactt 2580 acgaaaagca attatgctcc gctcttcatt atataaaaaa tttttagttg ataaatcaga 2640 tttttcagct ctaatgtaca aaaaacaaag gaatttggtg gttaagctwa accgaaaaaa 2700 taagaagaag tactttgagc aaatcaatac actttcgamt tcaaatacat ctctctggaa 2760 gctaactaag cctttctttt cggacaaatc tacaaattct aacgagcgta tcactytgac 2820 tgaaaatggc aacattatct cagacagttc tttgttgact aatattttta ataaatattt 2880 cataaatgtt gcattgccat caagtatggc cgaccatatt aaatggtctt tttctccttg 2940 cacttcagac ccagttgagt cagctattaa aaagtatgtt gaccatccta gtataagaaa 3000 cattaaaaat aaattcaaaa tgaatgcaca taatttcaag ttccattgta ttacacctga 3060 tgatattaca aaaattatag ggtctatgaa tgaaaaaaaa aaaacaagtg gtgaaatacc 3120 aacttttatg ttaaaatctt acttaatata ctttagggat agtcttacag attgtatcaa 3180 taatagcatc ttagatggaa catttccgtc atttctaaaa atggcggatg ttttaccgtg 3240 ttttaaaaaa aatgacccaa cagataaggc taactatcgc cctataagta ttctacctgc 3300 kctctccaaa gtttttgaaa tgatcgttta tgaacagatt ttgttattta tggagccaaa 3360 atttgacaaa cttttgtgtg gttttcgtag aggttatagt actcagcata cacttatttt 3420 tttacttaat aagtggcatg aatgcattaa caatggaggt attgttggaa cattattaat 3480 ggatctatct aaagcgtatg atttcattcc gcatgaccta cttattgcta aattagaggc 3540 gtatggtttc tctaaagaaa gtcttaaatt tcttatatcg tatattagtg gtcgtaaaca 3600 aagggtaagg ataggacctt ctgtttctaa ttgggttgat attttgtcgg gagtgcctca 3660 aggttccatt cttggcccca tcctctttaa tatctttata aatgatatat tcttctttat 3720 taaagatact gaattgtgca actttgcaga tgataatacg ctgtatgcct gtgattacag 3780 cttagaaaag gttatttttc gattgcaaaa agaagcaact aataccattg cctggtttaa 3840 gtttaactcg atggttgcca accctgataa attccaatta cttttagttg gtttaaaaaa 3900 cacaaaaaat caatacctaa aaataggtaa tataactgtt tttgcctctg atagggtgaa 3960 actacttggt gttacaattg ataagaatct aaattttgga gaacacatta aatacttatg 4020 cagtaaagct aatggtaaat gttttgctct ttcacgcata agaagttttt tatcccgtaa 4080 taaagctaca ttgttgttta acgccttcat aatgtcaaat ttttcttatt gtcccctaat 4140 atggatgttt tgttctaaat actatgacaa ccttataaat aaaattcata aaaaagctct 4200 ttgtattgtc cataaaaggt ttgacttatg tcttgatgaa ttgttaagtt ttgataatag 4260 tccgaggatc cacttaagaa atttgcgctt tcttatgata gaagttttta aatcaattaa 4320 ttgtctaaat ccacaattta tgaaggattt attcaataca aaaaaccttc cttttaaact 4380 tcgttgtagc aaaaaaatga ttctaccttc aaatggctcc aaatataagg atcaatgttg 4440 tactatattt agagccatct cgttatggaa cactctagat aacaagacca tgtcctctaa 4500 aagttctgag gtgttcaaaa aacatattaa aaattggtat gcaaaaaatt gttattgtaa 4560 aatctgtcgt ttttaatttc tatttttatt tatcctgaca ttataataca atatttactc 4620 atttttctgt aatgaaacgg ttgtaatttg aatttgaatt tgaatatttt aatgttattt 4680 gtaatttata ttataatttt gtaaattata ttttgctttt tttttttaaa tactttttta 4740 aaattgattc tatttattta tttatttttt ttttaacgat tttcatgtga gtgtgtattg 4800 tctataattg ttaaatgtgt tgattttaat tagttttatt tttgttaaaa ctattgtaac 4860 catcaacaaa aaataaagtt ttttaaatta aattaaaatt aa 4902 // ID Copia-52_AA-LTR repbase; DNA; INV; 220 BP. XX AC supercont1.135; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-52_AA_; KW Copia-52_AA-I; Copia-52_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-220 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.135; Positions 887173 887392. XX SQ Sequence 220 BP; 54 A; 51 C; 41 G; 74 T; 0 other; tgttggagga gaataaccac cgtgcaaccc ctcgttgcta ccaatgttgc taccagcagc 60 acgatgatga gagagagcaa atacattttt cattcgttcg ttttcctttg tacgagtaac 120 acgttaataa atcattgttc gtagtttaat cgcgtttata accctttttc tccgattatt 180 ccgagttatc ccactgtatt ctgctagtgg acttccgtca 220 // ID Gypsy-97_AA-I repbase; DNA; INV; 4614 BP. XX AC AAGE02017326; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-97_AA_; KW Gypsy-97_AA-LTR; Gypsy-97_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4614 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02017326; Positions 9162 13775. XX CC Positions [1918-2490] - Reverse transcriptase CC Positions [3616-4077] - Integrase core CC 'TGACC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 478..2148 FT /product="Gypsy-97_AA-I_1p" FT /translation="MENWNINPFKFNHLPDTQIRKEWLRWKRNFEVIIAAS FT EEKNATKLKNILLAKSGLELQDLFYSIDGADVEEDAEKEIDPFKTAIGKLD FT EHFTPKQHDSYERSEFCKMVPAITSDGTREPLTKFLMRCNEQAKKCDFGKT FT EAESRELRVIDKIIYHALVELREKLLHEETLTLAQVSRMVNSFESIKQQVQ FT AIAGNGIGEAPVSQPSDQDKVNRIFGSGKPTSGACYRCGQKTHYGNDQQCP FT ARNKQCEKCRKFGHFARVCRSMPKRKYDDHIPYSARKKAKTDNVRAIKCSE FT NENSPSFICNIGDGDEFLWVDVGGVLIKMLIDSGTNQNIIDDCTWRNMLIQ FT GVRCWKPIHVPNVVLRTYGRNATPLDVAHVLETTFTIGNGNNQRQETAIFY FT VIDGGSQPLLGKDTAKRLGVLKIGFQEPNMSVNAIVPAGKRPFPKMRNVQL FT SIAINKNVTPIAQRVRRPPIALLSRIEEKLNQLLMADIIEPVSGAAPWVSP FT LVTIVKDNGDLRLCVDMRRANQAILREHHMMPTFESFLPRLKSAKYFSRLD FT IKDAFHQVRQ" FT CDS 2290..4581 FT /product="Gypsy-97_AA-I_2p" FT /translation="MFGLSCAPEMFQKTMEQILAGCENVIDYIDDIFIFGE FT TEEEHNTALAKVLSVLKENNIMLNQDKCIFKAQQLDFLGHVISANGVRPTE FT QKIQALQKFREPRTSDEVRSFLRLVTYVGKFIPDMATITAPLRELICGANP FT FKWTDEHQRSFEKLKQLISNANTLAFFNNSLRTRVVPDASPVALGAVLLQF FT ADNTDNCPRIISYASKSLTTTEKRYCQTEKEALALVWAVEKFSEYLIGREF FT ELETDHKPLEVIFSPSSTPCARIERWVLRLQSFRFTVKYRKGSGNIADSLS FT RLVCSDNANDFEKDSHFMVLAIQESVAVDICEIEEASRDDPEIAAVKECLR FT GGQWKNPLTKPYEPFQNELGFIGDTMIRGNKLVVPKKLREQMLQLAHKGHP FT GESVMKRILRDRAWWPSMDRETKEHCTSCEGCQLVSFPSRPEPMNRRELPT FT KPWIDVAIDFMGPLPSGEYLLVVIDYFSRYKEVEIMNRITASETVERLGKI FT FTRLGYPRTITLDNAKQFIGKEFENYCKIHGITLNHTAPYWPQENGLVERQ FT NRSLLKRLQISHGLGRDWKRDLKDYLIMYYTTPHSTTGKTPTELCYGRTIR FT SKLPAIQDIESIPPASDFRDQDFLSKQKGKEREDIRRHAKHSDIDEGDTVL FT MKNLMPKNKLSTVFNKTKYTVVDRSGPRATVEDTDTGAMYERNVAHLKKIN FT DDTRVATEQRCDETMVDDRSINEEDNFAGFPIEDEEVPNHGFIGDGERQRR FT GRKVPEKYKDFVM" XX SQ Sequence 4614 BP; 1524 A; 898 C; 1044 G; 1148 T; 0 other; tctggcgacg agcttgtgaa acgtaccctc tgcatttagt aggtaagaaa aaataaataa 60 aaataaaata cttacaacaa acatccacgt gtgtcttttg caaggtgagc gcacgtgggg 120 caggcaaaag ccgttgcaaa acgcaaaaaa aagtgaccat ttccgctgcc gtagagcagg 180 gacaggcaac aaccgctgta ttgagcagaa accgatcatt tccgctgcca tagagcaggg 240 tcaggcaccg agccgcttcg agaaaacaag gcgtgtagcc gctgcgtttg ggcagaaaaa 300 aggcgagaag ccgctaccgg gagtagaaat aggcattagc cgctacaaat ttgcagaaat 360 tataaaaaaa aaaaaaacga accctagggc ttgttgtaca gtcgctatgg taacgaggtc 420 aaattgaaaa tttcaaatat ttttagctaa cgttccagcg cctactgttc gcgtaagatg 480 gaaaattgga acatcaatcc atttaaattc aatcaccttc cagacacgca aattcgaaag 540 gaatggttgc gttggaagag aaatttcgaa gtaatcattg cagcaagtga agagaaaaat 600 gcaaccaaat tgaagaatat actactagcg aagagtgggc tggaactaca ggacttgttt 660 tactcgattg atggcgcgga cgtggaagaa gatgctgaaa aggagattga cccattcaaa 720 acggctattg gtaaactgga tgaacatttt acaccaaaac aacacgattc gtatgaaagg 780 agcgaatttt gtaaaatggt tccagcaata acatctgatg gaacacgtga gcctttgacg 840 aagtttttaa tgaggtgcaa cgaacaagcg aagaaatgtg acttcggtaa aacagaagcg 900 gaaagtcggg agttgcgcgt aatcgataaa atcatctacc acgctctggt tgaattgagg 960 gaaaaactac tgcatgagga aacactcaca ctcgcacaag tatcccgaat ggtgaattct 1020 ttcgaatcaa tcaaacagca agttcaagca attgctggca acggtattgg cgaggcaccg 1080 gtatctcaac catcagatca ggacaaagtt aaccgcatat ttggaagtgg taaaccaaca 1140 agtggtgctt gttaccgatg tggacaaaaa acccattacg gaaacgacca gcaatgccca 1200 gctcggaata agcaatgcga aaaatgtcgt aagtttggtc attttgctcg agtatgccgc 1260 tcgatgccga aacgtaagta tgatgatcat attccatact ccgctcgtaa gaaggcaaaa 1320 accgataatg tgcgagcgat aaagtgttca gaaaacgaaa acagcccaag cttcatttgc 1380 aacattggag acggcgacga atttctatgg gttgacgtcg ggggtgttct gatcaaaatg 1440 ttgattgatt ctggcaccaa tcagaatatt atcgatgatt gtacctggcg aaacatgctt 1500 atccaagggg tcagatgttg gaagccaatc catgtgccga acgtggttct acgaacatac 1560 ggccgaaatg cgacgccttt ggatgtagca catgttcttg agacgacatt tacgattggg 1620 aatggaaata accagcgaca ggaaacagct attttctatg tcatcgatgg aggttcccag 1680 ccattactgg gaaaagatac agcaaagcgt cttggggtat taaaaattgg tttccaggag 1740 ccgaacatgt cagtgaacgc cattgtcccg gctggaaaaa gaccttttcc aaagatgagg 1800 aacgtacagc tgagtattgc gatcaataaa aatgttactc caatagccca acgagtcaga 1860 cgtccgccaa ttgcgcttct cagccggatt gaagagaagc tgaatcaatt actgatggcg 1920 gatatcatcg aacctgtatc gggagcagca ccatgggttt cgccattggt aaccattgtc 1980 aaggacaacg gagatcttcg tctgtgtgtg gatatgcgac gtgcgaatca ggctatctta 2040 agagaacatc atatgatgcc gacatttgaa agcttcctcc cacgtttgaa gtccgccaaa 2100 tacttcagtc gtctggacat caaagatgca tttcatcagg taagacagta gcctgaaaat 2160 ataatttttt attaaattgt ctattttttc taaacatgca ttatcgaaat atagattgaa 2220 ttagatgaat caagtcggta tattacgacc tttatttgcc ataaaggtct gtatagatac 2280 aaaaggctga tgtttggcct ttcttgtgct ccagaaatgt ttcaaaaaac aatggaacaa 2340 atcttagccg gatgtgaaaa cgttattgat tatatcgacg acattttcat cttcggagaa 2400 accgaggaag aacataatac tgctttggct aaagttctct ctgtgctgaa agaaaacaat 2460 atcatgttga atcaggataa atgcatcttc aaggcacaac aacttgattt ccttggacat 2520 gttatctcag ctaatggagt acgaccaact gaacagaaaa ttcaagcgtt acaaaaattc 2580 cgtgaacctc gcacttcgga cgaagtaaga agttttctta gattagtaac ttacgttggg 2640 aaatttattc ctgatatggc aacaattacc gcgccgttac gagaattaat ctgtggtgct 2700 aacccattca aatggacaga tgaacatcaa agaagttttg aaaaattgaa acaattgatt 2760 tcgaatgcaa acactcttgc ctttttcaat aattctctcc gcaccagggt tgtacccgat 2820 gcctccccag tcgcattggg tgcggttctg ctacaatttg cagataatac tgacaattgc 2880 ccccgtataa tcagttatgc cagcaagagt ttaactacta ctgaaaaacg ttattgccaa 2940 actgagaaag aggcactagc attggtatgg gcagtcgaaa aattctctga atatttgatt 3000 ggcagagagt ttgaattgga aactgaccat aaaccacttg aagtgatttt cagcccatca 3060 tcaacaccct gtgccagaat cgaacgctgg gtcttgcgat tacaatcgtt tagatttaca 3120 gtaaaatatc gtaaaggaag cggaaacata gcagattctt tgtccagatt agtttgttcg 3180 gacaacgcaa atgatttcga gaaggatagt catttcatgg ttttggcgat acaagaatct 3240 gtggcagtag acatttgtga aatcgaggaa gcatcgcgtg atgatccaga aattgctgcc 3300 gtgaaagagt gccttcgagg agggcaatgg aaaaatcctc tgacaaaacc gtatgaacct 3360 tttcaaaatg aattaggatt tatcggtgac accatgatcc ggggcaacaa attagtggtt 3420 ccaaagaaac tacgcgaaca aatgctgcaa cttgcacaca aaggtcatcc aggagaatcc 3480 gtgatgaagc gaatactgcg agatagggcg tggtggcctt ctatggatcg agaaacgaag 3540 gagcattgta catcatgcga aggttgtcaa ttggtaagtt ttccaagtcg accagagcca 3600 atgaatagaa gagagctgcc taccaagccg tggattgatg ttgccattga ttttatgggt 3660 cctcttcctt ctggtgagta tctactagta gttatagatt actttagtag atacaaggaa 3720 gttgaaataa tgaatcgcat aactgcctct gaaactgttg agcgacttgg caagattttt 3780 acacgtctcg gttatcctag aaccattaca cttgacaatg ccaagcagtt tattggcaaa 3840 gaattcgaga attattgtaa gatccatgga attactctta accatactgc cccgtactgg 3900 ccgcaggaaa atggtttagt tgaacgacaa aaccggtcgc tattgaaacg attgcaaatc 3960 agtcacggcc ttggcagaga ttggaaaagg gatctgaaag attatctaat aatgtactac 4020 acaacaccac actccacaac ggggaaaaca ccaacagaat tgtgttacgg aagaacaatc 4080 agatccaaat taccagcaat ccaggacata gaatcaattc cacctgcatc cgacttccgt 4140 gaccaagatt ttttgagcaa acagaaagga aaagaacgtg aagatattcg acgtcatgcg 4200 aaacattccg atattgacga aggggatacg gtactgatga agaaccttat gcccaaaaat 4260 aaactttcaa cagttttcaa caaaacaaag tatacagtag tggatcgttc aggtcccaga 4320 gcaaccgttg aagatacaga taccggagcg atgtatgaaa gaaacgtcgc gcatcttaaa 4380 aagatcaatg atgatactag agttgcgaca gaacagcgtt gcgatgaaac tatggtcgat 4440 gatagatcta tcaatgaaga ggacaacttt gctggatttc ctattgagga tgaagaagtg 4500 ccaaaccatg gtttcattgg tgatggcgaa cgtcaacgca gaggacggaa agttcctgag 4560 aaatacaaag attttgtaat gtaaatagtt taataattcg tataaaaagg gaga 4614 // ID I-59_AAe repbase; DNA; INV; 5490 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE An I non-LTR retrotransposon from Aedes aegypti. XX KW I; Non-LTR Retrotransposon; Transposable Element; I-59_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5490 RA Kojima K.K. and Jurka J.; RT "I clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1330-1330 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 5 sequences with >97% CC identity. XX FH Key Location/Qualifiers FT CDS 346..1002 FT /product="I-59_AAe_1p" FT /translation="MEAKNKRIESLEVTLNGNIHPGNNGMIAELLQKVDHL FT TSEMRKKDERIQILEAALQSGSRMEIVRKHGTIEDLVTKVNDLELQLKQKE FT REVYTFQSLFQKRSQSHVQHSSNVAQQSTDNPAAHSQAQNQIDIAPTNSKK FT NKNRKQKVKESQLTPMYESDTSPIPSEPTSVERTPKRDRAVTDSDESSGQP FT KNKVYTPNCDLIPNVNISTSSGDDDMSDS" FT CDS 1071..5189 FT /product="I-59_AAe_2p" FT /note="endonuclease, reverse transcriptase, and FT ribonuclease H." FT /translation="MGPKPIESGQSTKKEETTGVSITSSTTSINDTTEEDD FT VSKIPRPNATFFQAVPTSLLWEGPSTMPSPSEALLGSATSESGKNTIETKT FT SQPRRSARIKARYLNETSKSNADAQLGHANSCSVADYDRVQALRYGPAAVP FT IASSSSASAPPTFSRSFTNPGTSTAVIDTIKQTNFVLQWNINGFHNNISDL FT ECLIHDNLPVVIALQEAHRTTVNRMNRMLRGRYKWIYKSNANVYHSVAIGV FT LESVTFSLLQLDTDLPIVAVKLDYPFPVSIVSMYLPCGKIQDLKNRLLRAL FT EPIQGAKLILGDVNGHHTTWGSARVNARGSILMELAEVMDLISLNDGSITF FT TSGQRESTVDVSFASSSIVNRLLWSVGADPLGSDHHPISISINQRPPETSR FT RPRWIYNQADWASLQAMIDESLGSSEPDNLSTFLSTIHQAAAASIPKTSPK FT PGRKALPWWSPETKKAIKARRKALRLIKRLPDCHPDKDAAMALYRAKRNEC FT RQIIRDAKNQCWESFLDSINSSQSSTELWSKVNSLSGKRKAHGIAIHQDGV FT LTRDPSTIAQALGEYFASLSAFEKYSNTFKRRNDASSLEKMIIPEDKTHQP FT INEPFRLIELELALAKCNGKSAGVDEIGYPILKHLPPVGKAALLRLLNKEW FT TGNTLPKEWNHSLVIPIPKHGSVNKAPTDFRPISLTCCISKVMERMVNRRL FT VHFLESNGHLDHRQHAFRSGHGTGTYFATLGQVLHDAKAQEQHVELATLDL FT AKAYNRAWTPGVINQLSKWGLTGHVLHFLRNFLRNRTFQVCIGNHRSNTFS FT EETGIPQGSVIAVTIFLIAMNGVFSSLPKEIFIFVYADDIMLVVIGSTEKA FT LRRKLQASVNAVAKWGNEVGFELSAEKSAIMHLCTSRHRPIRSLVTANGQP FT IPLKKSVRVLGIHLDRHLTFVEHFNEVKRNCQTRLNLLRTLSRRHNTSNRD FT VRLRVAAAIIDSRLLYGIELSCTTTDAMVTSLAPVYHQSIRIASGLLPSTP FT SDAVCAETGRLPFTYLITETVCKRAISFLEKTTPCTGGVLLLDEANRLLRG FT HVNQVLPPVAEVHWNGARSWDSPRMNIETSIKNAFRAGDNSPALTRSVAEL FT LRTKYPDYTHRYTDGSKAQDGVGIGVTDPEISRFLRLPDPCSVFSAEAAAI FT LIASTMPSPKPVLILTDSASVLAALSSEVTRHPWIQAIQLKAHPGTVFAWI FT PGHCGIRGNEEADRLAAHGRHSTFYTKEIPGQDLKRWITTTIRDSWALKWL FT NTRNLFLRKIKYDVTRWVDPKNHLEQKIMSRLRTGHTKASHNMGGEGPFRR FT ICPACNVLYSVEHFIINCPQHQAARTLHDIPDSIRCALNNDPDNITRLMNY FT LKDIELYHNI" XX SQ Sequence 5490 BP; 1635 A; 1490 C; 1142 G; 1222 T; 1 other; tgctgggggt ttggacatac cagtaagcgt tgccaacaat cacatgcaac ctgtggcact 60 tgctcaaaag accacacgat tgataaggag aatccctgtc aagcggaggt atattgcaag 120 cggtgcgaca gacataacca ctctctggcc agccgaaaat gccctgtcta cggcaatgaa 180 aacgaaatcc aaaagattcg tgttaatcaa ggcatttcgt acccagcagc ccgaaggcta 240 tttgaacaag ccaataacca gcacaccttc tccagcgtca cagtagctag caaggatcag 300 caaatcgccg acctgtcttc taaagttgac aaactccttc aggaaatgga agcgaagaat 360 aagcgaatcg aatctcttga agtaacccta aatggcaaca tccacccggg caacaacggc 420 atgattgcgg agcttctgca aaaagtagat cacctgacca gtgaaatgag aaagaaggat 480 gaaagaatac agatcctgga agcagccctg cagagcggtt cccgcatgga aattgtccgg 540 aaacacggca caattgagga cctagtaact aaggtcaacg accttgaact ccaactgaaa 600 cagaaagaaa gagaggtcta caccttccaa agtctcttcc aaaaacgctc ccaatcccac 660 gtccaacact cctcaaacgt tgctcaacaa tctaccgaca accctgcagc tcattcgcaa 720 gcccaaaacc agatcgatat agcgcctaca aactcgaaaa agaataagaa taggaaacag 780 aaggtgaaag aatcccagtt gactcccatg tatgaaagcg acaccagccc aattccctca 840 gagccaactt cggtagagcg aacccccaaa agagatcgtg cagtcacaga ctctgatgaa 900 tcatccggcc agccaaaaaa taaggtctat actcctaact gtgacctcat tcccaacgtt 960 aacatctcga cttcaagtgg agacgatgac atgtcagatt cctagaaatt catccagtac 1020 tcctcgctaa agcactctac aacagggaaa ttacatgtgt tataccagtc atgggcccta 1080 aacccataga aagtggtcag tccaccaaga aagaggagac tactggagtg tctatcacaa 1140 gttcaaccac ttccatcaac gatacaacag aagaagacga tgtttccaaa atcccacgtc 1200 caaatgccac cttttttcaa gcagttccaa catctttgct gtgggagggc ccatcaacga 1260 tgccttcccc cagtgaggct ctgttgggat ctgcaaccag cgagtcggga aaaaacacca 1320 tagaaaccaa gacttcccaa ccaagacgct ctgcacggat caaagcaaga tatctaaacg 1380 agacttcgaa gagcaatgcg gatgctcagc taggtcatgc aaacagctgc agcgtggcag 1440 attatgatcg cgtccaggct ctccggtacg gccctgctgc ggtacccatt gcatccagtt 1500 cctctgcctc tgcaccacca actttctcta gatcgtttac caaccctggt acgtctacgg 1560 ccgtcatcga tacgataaag caaaccaatt tcgtgctgca gtggaatata aacggatttc 1620 acaacaacat aagtgaccta gaatgcctga tccacgacaa cctgcctgtc gtaatagcct 1680 tacaagaagc tcatcgcacc actgtgaaca gaatgaatcg aatgttgaga ggaaggtaca 1740 aatggatata caaaagcaat gcaaatgtat accattctgt agcgatcggg gtactagaat 1800 cggtcacttt ctccctgtta caactagata ccgaccttcc cattgtcgct gtcaaactcg 1860 attatccatt ccctgtatcg atagtcagta tgtatctgcc atgtggtaaa atacaggacc 1920 tcaaaaaccg actactccgt gcactagaac ccatacaagg tgcaaaattg atccttggcg 1980 acgttaacgg gcaccatacc acatggggaa gtgccagggt taatgctcga ggttccatct 2040 tgatggaact tgcagaggta atggacttga tttcacttaa cgatgggtcc attactttca 2100 ccagcggtca acgtgaatca accgtcgacg tatcctttgc tagctccagc atcgtcaaca 2160 gattgctctg gagtgtcggt gcagaccccc ttggaagcga ccatcacccc atctcgatat 2220 ctatcaacca acgacctcca gaaacatcgc gccgccctcg atggatttac aatcaagcag 2280 actgggcatc cttacaagca atgatcgacg agtcactggg ttcatccgaa cccgacaacc 2340 tttctacctt tctgagtacc atccaccagg cagccgctgc atctataccg aaaaccagtc 2400 ccaaacctgg ccgtaaggct ctcccatggt ggtcaccaga aacgaaaaaa gcaataaaag 2460 ccaggagaaa ggccctacga ttgataaagc gactccctga ttgtcaccca gacaaagatg 2520 cggccatggc cctctacaga gcaaaacgga atgaatgccg acaaatcata cgtgacgcaa 2580 aaaatcaatg ctgggaaagc ttcctggaca gtataaactc cagccaatcc tcgaccgagc 2640 tatggagtaa agtaaattct cttagcggaa aacggaaggc tcatggtatc gccattcacc 2700 aagacggcgt ccttaccagg gatccaagta caattgctca ggccctggga gaatatttcg 2760 cctctctttc ggcatttgag aaatacagca acaccttcaa gcgacgaaac gatgctagca 2820 gcctcgaaaa aatgatcatt ccagaagaca aaactcacca accaatcaac gaacccttcc 2880 gtcttatcga actagaactg gccttagcca agtgtaacgg taaatctgcc ggagtcgatg 2940 agataggtta cccaatattg aagcatctcc ctcccgttgg caaagcagcc ctgttacgac 3000 tgctaaacaa ggaatggact ggaaacacct taccgaaaga gtggaatcat agcctcgtta 3060 ttcctattcc taaacacggt tctgttaaca aagcacctac agattttcgt ccgatttcgc 3120 taacatgctg catcagcaag gtgatggaaa gaatggttaa ccgccgcctg gttcactttc 3180 ttgagtctaa tggtcatctc gaccaccgac agcacgcgtt tcgttccggt cacggaacgg 3240 ggacatactt tgcaactctt ggacaagtac tccacgatgc taaagcacag gagcaacacg 3300 ttgaactggc tacactagac ttggcgaaag cctacaatcg ggcctggaca ccaggcgtca 3360 ttaaccagct ttccaaatgg ggtctgacag gacatgtcct tcacttcctg agaaattttc 3420 tgcgtaacag aactttccaa gtttgcattg ggaatcaccg ctcaaatact ttttcagaag 3480 agaccggcat accgcaaggc tcggttattg ccgtcactat cttcctaatt gccatgaacg 3540 gggtcttcag tagtcttccg aaagagatct ttattttcgt ttatgcggac gacataatgc 3600 ttgtggttat tgggtccacc gaaaaagcgc tccgccggaa actccaggca tcagtgaacg 3660 ccgtcgccaa gtggggcaat gaagtaggat tcgagttatc agctgagaaa agtgccataa 3720 tgcacctctg tacatcccgc catcgtccca ttcgttctct agtcactgcc aacggtcaac 3780 caataccatt aaaaaaatcc gttcgagtac ttggtattca cttggacagg catctgacct 3840 tcgtcgaaca cttcaatgaa gttaaacgga attgtcaaac tcgcctaaac cttcttcgta 3900 cattatccag acgccataac accagcaacc gtgacgtccg ccttcgtgta gcagcagcaa 3960 tcattgatag cagactactc tacggtatag aacttagctg tacaacaacc gacgcaatgg 4020 tcacatcact cgctcctgtt tatcatcaat cgatacgaat cgcttccggt ctactgccat 4080 ccacgccatc agacgctgtc tgtgcagaaa ccggtagatt gccgttcacg tatctcatca 4140 cagagactgt atgcaaaaga gcaatcagtt tcctagagaa gacaacacct tgtaccggcg 4200 gagtccttct ccttgatgag gcgaaccggc tcctgcgtgg acacgtcaac caagtgctcc 4260 cccctgtggc cgaggtccac tggaatggag ccaggagttg ggactcgccc cgtatgaaca 4320 ttgaaacatc gataaagaat gccttccggg cgggagacaa ctcccctgcg ttgactagat 4380 ctgtcgccga gctacttcga acgaaatatc ccgactatac acatcggtat acagatggct 4440 ccaaggccca ggacggagtt ggtatcggcg ttaccgaccc tgaaatcagc cgttttctcc 4500 gccttccaga tccctgctca gtattttctg cagaagcagc agccatcctt atcgcctcta 4560 caatgccgtc accgaagcca gtcctgatac tgacagactc agccagtgtt ctcgcagcgt 4620 tatcatcgga agtgactcgc catccatgga tccaagcaat ccagcttaag gcgcatccgg 4680 gtacagtatt tgcttggata cctggacact gcgggatccg tggcaacgaa gaggctgacc 4740 gtttagcagc tcacggtaga cattcaactt tctacacaaa ggaaattcct ggccaagatt 4800 tgaaacgttg gatcactacg accattaggg actcctgggc gcttaagtgg ctgaacacac 4860 gwaacctctt cttacgaaaa attaaatatg atgttacacg atgggttgac cccaaaaacc 4920 atctcgagca aaaaattatg tccagacttc gcacgggtca cacaaaggct tctcacaata 4980 tgggaggtga gggacctttc cgaagaatct gtcccgcttg caatgtatta tactcggtgg 5040 agcacttcat cattaattgt cctcaacatc aggctgctcg cacgctgcat gacattcctg 5100 acagtatccg ctgtgctctc aacaacgacc ctgacaacat cacacgactt atgaattatt 5160 taaaagacat cgaactatac cacaacatct agccatcctc cccgacagaa taattcaact 5220 tgaccacgct tcgaaacttg acaacgaatt cgtttaatgg aaccacccta gaatacagcc 5280 tggaacagat cacgcacccg cacctggact atccaacgga aatagctttt atattacgaa 5340 tgattgtctt tccttaacca aaaatgttaa cttaacacat tactctgtat aagtattact 5400 gcccaggagg gccaatgtaa cgccctcttt tttcacggag acgaaccagc cactggctga 5460 aagtctcctt aataaagata aaaaaaaaaa 5490 // ID hAT-79_HM repbase; DNA; INV; 3352 BP. XX AC . XX DT 15-SEP-2009 (Rel. 14.09, Created) DT 15-SEP-2009 (Rel. 14.09, Last updated, Version 2) XX DE hAT-type DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-79_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3352 RA Jurka J.; RT "DNA transposons from Hydra magnipapillata."; RL Repbase Reports 9(9), 1921-1921 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 775..2976 FT /product="hAT-79_HM_1p" FT /translation="MAHVIKEIYETSDEVIELNAILNEQISSGSVLLKAKT FT TGVGKSKVWMNFSLLFKLHATKPVKNIAVCNICKKAVHQKLSSTTTLIRHL FT KTHAPTAQDPSQSILPVIKKENSTFKTSVQVHKNHKNAMLNSCLALFSTDL FT RPFVTIYGEGFESVINTAIHLGATYRGHITAKDILPSKHKVAEAIKTSVSK FT SRITISRILKAKLERSGVGITTDIWTDDFHRNSYLSMTLHWIENNQVQEAA FT MAFSVLPPISKTATVVSRYLIDNFKKVDAYTAGGLRRIIFTTDQGSNILSA FT FKDSGFVVAIEDIDIRPEILQSTRISCSSHILNNVLTKLFKPSELKIHCLS FT LHDLLHNCYQVVTFVKSGNVNAKLSRTLKKYVATRWNSHLALFKGVLLMYD FT ELLIHLRGTKKMVAIEAIDKNQLEVLVKFLKVFKAASDDLEGSTYPTIYMP FT ILLNFSMQDYFKSFSSSFAQIPLNITPESDNPESYVIDDDTDLANTLPVEN FT FLRWVNDSDFETQYDSQVFKQDYENQRDIITYLSSFATDILSDKFKINHIH FT CIGLFLIPFCKSFKVFGQNASAKLIETKENLRRLIQYCNFEKTNSSPTNSP FT PQKKQHSSTKPFSNHILPSLSSFIDSTAPVENNEIEQYSRIEVVAPNVQXL FT KNLGNEALKWWGKNKETFPVLFIVSSFVLAIPATSAPSERMNSIAKLITND FT RRANLAPETIEALMLLRMDKELQNSHFNAKDKH*" XX SQ Sequence 3352 BP; 1060 A; 595 C; 576 G; 1120 T; 1 other; tagagatggt cggacatggg ccacattgtg tccgatgtgc gaccgagtca aaaaaatttt 60 acaagatgat tgatcggttt gaattgaaaa attttcaacg gtgtccgtga tcgaaatgtt 120 cggttctgat tgaagaatgt tttgtccgat gtccgccatt gtccgacttt tttatggccg 180 ccatgttcgg cgtccgaccg ggccggcatt tctgggccaa atcgattaaa attgattgac 240 ttttcagaaa gtgttcgact gttcggttgt acgacttgtt cagcattatt atctctcttg 300 aatttattcc attttaatcc aaattattca ccacagcttt aaattatttc aagactttta 360 aaagtttcat tctccttgga ctacaggata ttggtgaatc gaagcattta atttttttta 420 acactgatgc agtggttttc ataatttaaa gtttagtttt ttgaaatcag cgtaaaaaac 480 tgcataagaa aaacttattt tgattatcaa attcgaacat gaccaaaagc aaagaagata 540 tgataaatat cctatttgct ttggttgtga acaaaaaatt aatttttttt aaatttttta 600 attgaatttt tcttttataa catattaaat atatatatac aaaaaattta tacatttatt 660 gaatttatta aaaaactttt gttttgtaaa tatctacttt gttttttaac tttgcatgtt 720 gtcaaattcg atttagttac acattttcga gaacttatta cacaaaaata caaaatggcg 780 catgttataa aagagatcta tgagacttca gatgaagtca ttgaacttaa tgcaatttta 840 aatgaacaaa tttccagtgg ttctgttctt ttgaaagcga aaactactgg tgtaggcaag 900 tcaaaagttt ggatgaattt ttctttgctc ttcaaattac atgccaccaa acctgtcaaa 960 aatattgcag tttgtaacat ttgtaaaaaa gctgttcacc aaaagctaag tagcactact 1020 actttgatca gacatttaaa aacacatgcc ccaacagctc aagatccttc acagtctatt 1080 ttgccagtaa taaaaaagga aaattctacc ttcaaaacat ctgtgcaagt tcataaaaac 1140 cacaagaatg cgatgttgaa tagttgtctt gcgttattta gtactgatct tcgtccattt 1200 gtaacaattt atggggaagg atttgaatct gtcataaaca ctgctattca tcttggtgct 1260 acttatcggg gacatattac tgctaaagac attcttcctt ccaaacacaa agttgctgaa 1320 gccattaaaa ctagtgtttc aaagtctaga atcacaatca gccgaatact caaagcgaaa 1380 ctagaaagaa gtggtgtagg cataactact gacatttgga ctgatgattt tcacagaaac 1440 tcatatctat cgatgacgct gcattggatt gaaaataatc aagtacagga ggcggcaatg 1500 gctttttctg tgttacctcc tatttcaaaa actgccactg tcgtctctcg atatctcatt 1560 gacaatttta aaaaggtaga tgcctatact gctggcggac tgagacgaat aattttcact 1620 actgatcaag gttctaatat tttgagtgct ttcaaagata gtggctttgt agtagcaata 1680 gaagatatag acattcgtcc agaaatcttg caatcgacaa gaatttcatg tagcagccac 1740 atcttgaaca atgttttgac aaaacttttt aaaccatctg aactgaaaat tcactgctta 1800 tctttgcatg atcttcttca caattgttat caagttgtta cattcgtgaa gagtggaaat 1860 gtgaatgcaa aattaagccg gactctaaaa aaatatgtag ctacacgttg gaattcacat 1920 ctggcactct ttaaaggagt ccttctaatg tatgatgaat tgttgattca tttacgagga 1980 actaagaaaa tggtagcaat tgaggcaatc gacaaaaatc agttggaggt tttggttaaa 2040 tttttaaaag ttttcaaagc tgcaagtgat gatttggaag ggtcaaccta tccaacaatt 2100 tatatgccaa ttttgttaaa ttttagtatg caagattact tcaagtcttt ctcctcatct 2160 tttgcacaga ttcctttgaa cattactcct gaatcagaca atccagagtc ttatgtaatt 2220 gatgatgaca ctgacttggc aaacactctt ccagttgaaa acttcttgcg ttgggttaat 2280 gattctgact ttgaaactca atatgattct caggtcttca aacaggacta tgaaaaccaa 2340 agagatatta ttacgtactt gtcaagtttt gcaactgaca ttttgtctga taaattcaaa 2400 ataaatcata ttcattgcat tggtttgttt cttattcctt tttgcaagtc tttcaaagtc 2460 ttcggacaaa atgcttctgc caagcttata gaaacaaagg aaaacctgcg acgtcttata 2520 caatattgta attttgaaaa gacaaactca tcaccgacca attctccacc tcagaaaaag 2580 cagcactcct ctaccaagcc tttttctaac cacattttgc cttcactttc ttcctttatt 2640 gactcaacag cccctgttga gaataatgaa attgaacaat atagtagaat tgaagttgtg 2700 gcaccaaatg ttcaarattt aaaaaatttg gggaatgaag cactaaagtg gtggggaaaa 2760 aataaagaga catttccagt attgttcatt gtatctagct ttgttcttgc cattcctgct 2820 acctcagccc caagtgaacg aatgaattcc attgcaaagc tgattactaa tgacaggaga 2880 gccaatcttg caccagaaac aattgaagcc ttgatgctac ttcgaatgga caaggaactg 2940 caaaattccc atttcaatgc caaggacaaa cactaagaaa acttattgaa aacttactga 3000 actttttttt taaatttact acctgtatta tacaggctgt atatactgca cttaaatctg 3060 tactagaaat tactacgaac ctgtaaatat gaaaaattgt ccgattcttg tgtcattttt 3120 tatttttcat ttttttgcat gttcgacgtc cgaccgcgtt cgagattttt agttcaatga 3180 ctgttcggtg atcgatcggt gttcggttcg acattagttt ttctgtccgg tgtttggggt 3240 gtccgagttg tccgggttgt ttgacaaaat tattgttgct cgtgttcgac cggttcggtg 3300 gttgtacgaa cacgggcgaa cagaaaaacg tcaatcagtc cgaccatctc ta 3352 // ID Gypsy-41_CQ-LTR repbase; DNA; INV; 1224 BP. XX AC AAWU01014757; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-41_CQ_; KW Gypsy-41_CQ-I; Gypsy-41_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-1224 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 462-462 (2011). XX DR GenBank; AAWU01014757; Positions 5397 4174. XX SQ Sequence 1224 BP; 301 A; 325 C; 378 G; 220 T; 0 other; tgtaacctat ggtatttgca tcccaagatt agtgcgatcg tttgttccgc ggcacgttac 60 ggagggagca cttgagcagg cttgcgggca gaacgaccta ctagcgtagc tgtccagccc 120 ctacggacgc taccaaagct gcaccgctag ttgcgctagt caccgccagg agcgcacgag 180 atcatcgcaa acgcccgcct tgggtagcgg cgatgaccag gacgatggga cgatgggacg 240 aaggaggcca gaagggggag gttgaggatc gctgggaaaa ccccgccatt gcaacggtca 300 ctttgcctgg tatcttgaag agaacggtcg atatccgtga aaatcaccgt cacgaaagag 360 tgaagtttag aagtgaagtg gaagatccgg tagacgccat cccggagaaa agccgcccga 420 cgatctagag atccggacgc aagcaggagg agcctgcgtt gatttgtgga catccagaac 480 ccctcggtgt tcgtacgctt cgcgtacagg ccggcacaga cgcatacgtc cgccgtaagc 540 cacccgacac ccgcggtatt ccccgcgcgg tacatcgtac cgccgagcac gatcgctcac 600 gcacgaggat cgcccgcgac accgttccgg tgtgggaacc gccctcgcga cgccgccatt 660 ttggtcttct gaccaaccgt cgcgagataa gacgccagcc tcgccagccg ttataccaac 720 cgacgccgga aaccaggcag accggtagat tcatctaccg aggtcccaga gtccgtcacc 780 atactcggga gcgtcgaggc aggtggcgca gaagagggtc gagtacgaga gcagcagctg 840 accgcccata gtggtcgcac aacgagagag gtagacgtgt cccgccccgg tagaccgagt 900 cccggaggtc ggtggaggcg tgtagccgtt gttcgaggaa cgcgagctga gagtgagcgg 960 aagcagcagt tccggaaagg gcccgaagaa gaaggaagaa ggacgtgtga ggtaggatta 1020 gctagtaagg aagtgggaga actaggaagt gaggtactta cgcagcttga gtctgccaat 1080 aaatcgctag taacctgaat aaagtgggtt tgttttgtct aatagtctaa ttaagtgttt 1140 ccctaccgat tccggattaa catgccgacc ctgaggcaaa gaggggggtc tcattaatgc 1200 tggagtagcc caccattagt taca 1224 // ID Gypsy8-I_Dpse repbase; DNA; INV; 7193 BP. XX AC Unknown_singleton_95; XX DT 13-MAY-2009 (Rel. 14.05, Created) DT 13-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy8_Dpse; KW Gypsy8-LTR_Dpse; Gypsy8-I_Dpse. XX OS Drosophila pseudoobscura OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; obscura group; OC pseudoobscura subgroup. XX RN [1] RP 1-7193 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1068-1068 (2009). XX DR Genome; Unknown_singleton_95; Positions 38071 30879. XX CC Positions [2105-2662] - Reverse transcriptase CC Positions [3704-4204] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 81..1409 FT /product="Gypsy8-I_Dpse_4p" FT /translation="MGKGQWIHRLRKEELVQCGHAFGVRLEGTVDEMRRTF FT KEWMREHEDEAEWADLIEVWECRANRSSPTPRADDKQAAYEHLAPPPGAAA FT AALWRSPVDIQRELIASLSVPIPERSHTGTPSRPRHPGRDRSREAERERVE FT PATTRPAHLDYARVAKQVREWSFRFDGTTKPLEFLEQVEWSAETYGLDPDL FT IPRAMPELLKGRALMWFVANNRQWRTWKEFSSSFQAYFLPRGYFEKLLQEV FT RMRKQKWGEPFKEYMVEMQTLMRPLKCPQEEQTELIRENSMPDLRAYMRPH FT RCKDLDTMMELADEFEALERDRLEFQRENPTVKARAANPFHKPADTTACRR FT CKDGPLEEAAREERGAHGRLPTPTNGNIEDGYVKNPAQACRRCGSADHWSR FT ECNGRPLTYCWRCGKVGISAWQCCRKTGNAPRPTPRRAVPGSQETVPQA" FT CDS 1109..4204 FT /product="Gypsy8-I_Dpse_2p" FT /translation="MQRRPAGGGSTGGKGSAWPLADTDQRQYRGRICQESG FT PGVPKVWERGSLEPGVQRSTAHLLLEMRKSRHLGVAVLQEDGKRPATHAAE FT GRARVARNGPSSLVGRLICEEEQLSAVVEVAGVELQATVDTGATSSFVSRE FT LADRLKGEGREVAAKKRVRLADGHSQEITHQIETKVGFGNKEIPMVLLVLP FT GVIDELVLGWDYLRAVGAVVTCAGHRVVIPAQDRERGGQEERLSVAKAEAG FT ETGPEYANKTDTTLRTAAREVAESTNGNEGAREEPVEDFLARELEAFAKIE FT GVSNVAVHTITMSDPQPIKQRYYPKNPKMQAEINQKVDELVEKGCIEPSKS FT PYSSPIVMVKKKNGKWRLCVDFRQVNARSVKDAYPMPRIDYILDQLREAKF FT ISSLDLKDGYWQIPLAESSRPITAFTVPGKGLYQWKVMPFGLHSASATFQR FT ALDQVIGPEMMPHAFAYQDDIVVIGRTEEEHRRNLKEVFRRLRLANLRLNA FT DKCEFFRKELRYLGHKVTGEGICTDPEKVAAIAELKPPTNVKELRQYLGVA FT SWYRRFVPDFATLVQPLTGLLKKKTEWVWTRERQEAFEEVKRRLVADPVLA FT CPDFSKKFILQTDASDYGLGAILTQETERGERVISYASRTLNGPERNYSAT FT EKECLAIVWAIRRLRPYLEGYRFKVVTDHMALKWLNSIESPSGRVARWALE FT LQQYDFEIAYRKGQLNVVADALSRQPVEERGRRIRNVEGTPPVGEPPCKWL FT EGMAEKIRKEAPKYPDYVEKGGNLYRHIPHRAGSEEVASWKLCVPKYARER FT VLKESHDNPEAGHAGGRRTAARVAARYYWPGMYRDVRAYVRKCELCLRFKP FT SQLQAAGEMLTQVPEEPWATVCADFVGPLPRSKHGNSMLLVLVDRFSKWTE FT LVPLRKATAEALIKACRERIIARFGAPKVFITDNGVQFAGRAFKRFLEQLG FT VRHQFTAPYTPQENPTERTNRTIKTMIAQFTRGRSKVLGRKMAGANAGHEF FT GSVGYNGIFTGVRGARKGTPAAQGSI" FT CDS 5669..7192 FT /product="Gypsy8-I_Dpse_3p" FT /translation="MQRGYICGWKTERRTLEGNYQQGENTARKKKEKKMET FT KRLNHEDGKKKMEDLRAFLEEQQAIVGEGGYRAEASRQRRASVLSRGDEED FT WCDGSQFLQLLASPLVSPAPSGHDSVSPTVSSDEGAGEIEDDVICVDGPRA FT PTAQEKASVRRPRTMRDRGEALKQKFQEATAEEKRRIRRVLEAARTIRRRR FT DERLEARRQQERKATRKEEPGHESDEAPPPPPFRVVRPPMQSPARGERASN FT PEAEWQRRLQQAEEEEDELWQPTPPAEDEEGSERRPPQAKEQQQPHQQYQQ FT QQQPQQQQRQHQHQQPQQQQQPQQQQQQQQQQQQQENGQWQQQPQWRGPEA FT AVIGPYVSQEVRTAVRQGMVWHHQTLHITWASGPAPCETQPETGARVWEES FT PLGSNDRDPRRRNHKSDPAPATAEAATAEAASAETATAEAATAEAATAEAA FT TAEAATAEAATAETATAEAATAEAATAEAATTEVTEHAEVEAWERGPWVWP FT APERSGAVERPT" XX SQ Sequence 7193 BP; 1924 A; 1773 C; 2515 G; 981 T; 0 other; ggttcgttac aactggcgcc caacgtgggg cctacaacga caaataatag gacaaaggga 60 gggagagtat agacgagaga atggggaagg gacaatggat ccatcggttg agaaaggagg 120 agcttgtcca atgcggccac gccttcggcg tgcggttgga gggtaccgtg gacgaaatga 180 gaagaacctt taaggaatgg atgagggaac acgaggatga ggcagagtgg gccgatctga 240 tagaggtgtg ggagtgtcgg gcgaacaggt cctcgccgac accccgggcg gacgataaac 300 aagccgcgta cgagcacctg gcgcccccac ccggcgcagc cgcggctgcg ctgtggaggt 360 cccccgtcga tatacaaaga gagttgatag cgagtctctc agtgcccata ccggaacgct 420 cacacacagg aacgcccagc aggccccgtc acccaggaag ggatcgaagc cgagaagccg 480 agagggagag ggtcgaacca gccaccacac ggccagcaca cctggactac gctagagtgg 540 cgaaacaggt gagagagtgg tcgttccggt tcgacggtac gacgaagcct ctcgagttct 600 tggagcaggt cgagtggtcc gcagagacat acggtctgga tccggatctg atccccaggg 660 cgatgccgga gctgctaaaa gggcgggcgt tgatgtggtt cgtggcgaac aatcggcagt 720 ggagaacgtg gaaagagttc tcttccagct tccaggctta ctttctgcct cggggatact 780 ttgagaagtt gctgcaggaa gtgaggatgc ggaagcagaa gtggggcgaa ccgttcaaag 840 aatacatggt agagatgcag acacttatgc gacccctgaa atgtccccaa gaggagcaaa 900 cggagttgat tcgggagaat agcatgcccg acttgagagc gtatatgaga ccacaccggt 960 gcaaggatct cgacaccatg atggaactgg cagacgagtt cgaggcgctg gaaagggacc 1020 gcctagagtt ccaacgggag aatccaaccg tgaaagcccg ggcagcgaat cccttccaca 1080 agccggccga cacgacggcg tgtcgacgat gcaaagacgg cccgctggag gaggcagcac 1140 gggaggaaag gggagcgcat ggccgcttgc cgacaccgac caacggcaat atcgaggacg 1200 gatatgtcaa gaatccggcc caggcgtgcc gaaggtgtgg gagcgcggat cattggagcc 1260 gggagtgcaa cggtcgaccg ctcacctact gctggagatg cggaaaagta ggcatctcgg 1320 cgtggcagtg ctgcaggaag acgggaaacg ccccgcgacc cacgccgcgg agggccgtgc 1380 cagggtcgca agaaacggtc cctcaagcct agtcggcagg ctgatctgcg aggaggagca 1440 gttatccgca gtagtggagg tcgcgggcgt ggagctgcaa gccacggtgg atacaggagc 1500 caccagcagt ttcgtgagcc gcgagctggc ggaccgactg aagggtgaag gccgggaggt 1560 ggcggcaaag aaaagggtga ggttggcaga tggccatagc caggagatca cacaccagat 1620 cgaaacgaag gtaggctttg ggaacaagga gatcccaatg gtgctactcg tattgccggg 1680 tgtgatcgac gagttagtgt tgggatggga ttacctccgc gcggtcgggg cagtggtgac 1740 ctgtgcagga cacagggtgg tgatccctgc ccaagatcgg gaacgaggag gccaggagga 1800 aagactgtca gtggcgaagg ccgaagctgg agagacagga ccggaatacg ccaacaagac 1860 ggacacgacc ctaagaacgg ccgcccgaga ggtggcggaa agcacaaacg ggaatgaggg 1920 ggcgagggaa gagccagtgg aggacttctt ggcccgagag ttggaggcgt ttgccaagat 1980 cgagggggtg tctaatgtgg cggtgcatac aatcaccatg agtgacccgc aaccgattaa 2040 acaaaggtat tacccaaaga accccaaaat gcaggccgag attaaccaga aagtggacga 2100 actggtagaa aagggatgca tagagccgtc aaagagcccc tacagttcgc ctatagtgat 2160 ggtgaagaag aagaatggca agtggagatt gtgcgtggac tttcgtcaag tgaacgcgag 2220 atcagtaaag gacgcatacc caatgcccag gatcgattac atactcgatc agctgcggga 2280 agcaaagttc ataagtagcc tggatctgaa ggatggatat tggcagatcc ccctggccga 2340 gagcagccgg ccgatcacgg cgttcacggt gcccggcaag gggctatacc aatggaaggt 2400 gatgccgttt gggttacact ccgcgtcggc caccttccaa cgggcattag accaagtgat 2460 agggcccgag atgatgccgc acgcattcgc gtaccaggat gacatagtgg tcatcgggcg 2520 aacagaagaa gaacaccgaa gaaacctgaa agaggtattt cgacggctcc gattggccaa 2580 cctgcggctg aatgcggaca agtgcgaatt cttcaggaaa gagctgcggt atctgggcca 2640 caaggtaacc ggcgaaggga tatgtacgga ccccgagaaa gtcgcagcca tcgccgaact 2700 gaaaccgccg accaatgtca aggagctaag gcagtacttg ggagtggcct cgtggtaccg 2760 gaggttcgtg cccgacttcg ccaccctagt gcagcctctc accgggctcc tcaagaagaa 2820 gacagaatgg gtctggaccc gggaacggca ggaagcgttc gaggaggtga agagaagact 2880 agtggcggac cccgtgctgg cgtgccccga cttctcgaag aaattcatcc tgcagacaga 2940 cgccagtgat tatgggctgg gcgcgattct cactcaggag acggaaaggg gagagcgggt 3000 aatatcgtat gctagcagga cgctaaacgg acccgaaagg aactattcgg cgacggagaa 3060 agaatgtctg gccattgtgt gggccatacg ccggttgagg ccatacttgg aagggtatcg 3120 cttcaaggtg gtcacggacc acatggctct gaaatggtta aatagcatag agagtccgtc 3180 gggaagagtg gcgagatggg cattagagtt gcagcagtac gactttgaaa tcgcatacag 3240 gaaaggacag ctcaacgtgg tagccgacgc actctcccga cagccggtgg aggagcgagg 3300 ccgtcgaata agaaatgtgg agggaacacc gccagtgggc gagccgccct gcaagtggtt 3360 agaaggcatg gcagagaaga tcaggaaaga agccccaaag taccccgact atgtggagaa 3420 gggggggaac ctgtaccggc acattccgca ccgggccggg agcgaggaag tcgcctcgtg 3480 gaagttatgt gtgccgaaat atgcccgaga aagggtttta aaggagagcc acgataaccc 3540 ggaggcaggc cacgccggcg gaaggagaac cgcggcacgg gtggcggcaa ggtattactg 3600 gccggggatg tacagagacg tgagggccta cgtgcggaag tgcgaactgt gtttgcgttt 3660 taagccaagt cagctacaag cggcagggga gatgttgaca caggtaccgg aagagccatg 3720 ggcaacggta tgcgcggact tcgtgggtcc tctgccgaga tcgaagcatg ggaactcgat 3780 gctgctggtg ctagtggatc gattctcaaa atggaccgag ttggtgccgc tgaggaaggc 3840 cacagccgag gcactaataa aggcctgtcg agaacgcata atagcccggt ttggggcacc 3900 caaagtcttc attacagaca atggggtgca gtttgcgggc agggcattta aacgatttct 3960 ggagcagcta ggagtccgcc accagttcac ggcgccatac acgccgcaag aaaacccgac 4020 agagcgaaca aacagaacaa taaagaccat gatcgctcag ttcacccgag ggcgatcaaa 4080 ggtgttggga cgaaaaatgg ccggagctaa tgctggccat gaattcggga gtgtcggata 4140 caacgggata ttcaccggcg ttcgtggtgc aaggaaggga accccggctg cccaaggctc 4200 tatatgatga ggagacggta ggcacggggc agggcgcaga gacgccggaa gaaaatgcga 4260 agaagttaaa agagctgttt cagttggtac gacgaaactt ggaaagggca gcccaggatc 4320 aggccaggca ctacaacctg agaagaagac cgtggagacc caaagtggga gaaaccgtat 4380 gggccaaaca gcatcacctg tccaatgcag cggagggatt cgccgcgaag ctggcaccaa 4440 ggtacgatgg gccgtaccag atagaggact tcatatcccc cgttatctgc agcctaagaa 4500 aggagggcga caggaaaaag agaacggccc acatacgcga cctgaaaccc cagcccgacg 4560 attgagggac gcaggggtga gagggaatgg atggaatggg tcgccctaaa agggtcgcgc 4620 aggaccggga acggtagggt gccacgcagg acacggggga agggtgccgc gcaggacacg 4680 cacgagactc gtcagcacgc agggtgacgg actggcgcac ttagatttaa taaaaaaaaa 4740 agggagagca ggaaagctgt ggttggaacg atctgaaaaa gaaaaggggt gagaattagt 4800 aggagtaaag aaaattaatg cgagaagggg gaaaagctca gcaaacctga acaggaaacg 4860 cgtccccgaa agctgggagg cgaatccggg gccggggagg gtaatctcag gcagcggatg 4920 accaaagaaa gcccaggggc caggctcacg cctcaacgcg gtctcacagg gcaactacaa 4980 gtgtcgggcc ggaacgacgg gtgccgctca tgcgtggtct accgggcgag gagtcggatg 5040 atcctcgggt ccgagccact gtgtgaggga gatgccggtc aaaagagtcc ggccgggggc 5100 gcgtagtagt tgccctggga tgtgagccaa agattctagc ccgggcggag ggtcgaagga 5160 cccgttgttc acgagcggaa tcgcggcaag cgacccgcag tgagggcaga gggcgagctg 5220 gaaaggaaga aagggttaag taaaaggcat ccgggtcggg agaactcacc caggccaagg 5280 agggtccaat cagcacggga ggaagaacag ggcgacggta gcagggaagt agatggtaat 5340 taaaataaaa ggagaatata tatatacaga aaacaaatgt acgtacgcac aaatgggaag 5400 ggggtggtcc atcaaagatg agaaaagaat attatccgtc agccaacaca gaacgcggcg 5460 agaacacaac agtatccaaa aaaaaagaca aggcgaagcc ggataaagag cggagagcgc 5520 gttggtactc acccaaccag gggaaagatg ccgttagaga gcgaggccgg tagagagcca 5580 agccatgagc acatacgtaa caaccgttag aaccaggcaa gagagaccga acccgacgca 5640 ccggcaccgt tagagggaac gactggtaat gcaacgcggg tatatatgcg ggtggaagac 5700 ggaacggcgg acacttgaag gtaattatca acaaggagaa aacacagcaa gaaaaaaaaa 5760 agaaaaaaaa atggagacga aacggctcaa ccacgaggac ggaaagaaga agatggaaga 5820 cctcagggcg ttcttggagg agcagcaggc gatagtgggg gaggggggct accgcgccga 5880 ggcttctcgt cagcgacggg cgtccgtgct ctcgcgtggc gacgaggagg attggtgcga 5940 tggctcccag ttcctccaac tgttggcctc gccattggta tctccggcgc cgtcgggcca 6000 cgactccgtg tcaccgacgg tatcctcaga cgagggggcc ggggagatcg aggacgacgt 6060 gatttgcgtc gacggtccac gcgccccaac cgcccaagag aaggccagcg tgagaaggcc 6120 gcggaccatg cgcgaccggg gcgaggcgtt gaaacagaag ttccaggagg cgacggcgga 6180 ggagaagcgc cgaatccgcc gggtgctgga ggcggcgcga acgatccgac ggcggaggga 6240 cgagcggctg gaggcgcggc ggcaacagga gagaaaggcc accaggaagg aggagcccgg 6300 ccacgagagc gacgaggcgc cgccaccccc gccattccga gtggtgcggc cgccgatgca 6360 gtccccggcc aggggcgagc gggcctcgaa tcccgaggcc gaatggcagc ggcgactgca 6420 acaggccgag gaggaggaag acgaattgtg gcagcctacg ccaccagccg aggacgaaga 6480 gggctccgag aggcggccac cgcaagccaa ggagcagcag cagccgcatc agcagtacca 6540 gcagcagcag cagccgcagc agcagcagcg gcagcaccaa catcagcagc cgcagcagca 6600 gcagcagccg cagcagcagc agcagcagca gcagcagcag cagcagcaag agaacgggca 6660 gtggcagcag cagccacaat ggaggggacc ggaggcggcc gtaatagggc catacgtgtc 6720 ccaggaggtg aggaccgccg tgcgccaggg gatggtatgg caccatcaga ccctgcacat 6780 cacgtgggcc tcggggcccg ccccgtgcga gactcagcca gagaccggcg cccgagtgtg 6840 ggaggagagc cccctcggca gcaacgacag ggacccgcgg aggagaaacc acaaatcgga 6900 tcccgcgccg gcgacggcag aggcggcgac ggcggaggcg gcgtcggcgg agacggcgac 6960 ggcagaggcg gcgacggcag aggcggcgac ggcagaggcg gcgacggcgg aggcggcgac 7020 ggcggaggcg gcgacggcgg agacggcgac ggcagaggcg gcgacggcag aggcggcgac 7080 ggcggaggcg gcgacgacgg aggtgacgga acacgcggag gtcgaggcgt gggaacgcgg 7140 tccgtgggtg tggcccgcgc cggagaggtc gggggcggtc gagcgcccaa cac 7193 // ID Gypsy-106_AA-LTR repbase; DNA; INV; 275 BP. XX AC supercont1.333; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-106_AA_; KW Gypsy-106_AA-I; Gypsy-106_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-275 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.333; Positions 562269 561995. XX SQ Sequence 275 BP; 85 A; 46 C; 63 G; 81 T; 0 other; tgttacggat acgccctttg catttgccgc agatggtaag gacaatagaa acgtcaacgg 60 ataacgggga gaagtgagaa ttacacagaa cagaattcag tcacatgtag aacgtggata 120 ataaaacaca tgaattacac atatatgtaa cgttatcgct atttattcgt aattattcga 180 aagtctcatt acttttatac gcgcggggta gtttcgttcg gtacttagcc ggtgctaaga 240 gtttcgtgag tgtttagtta gcgaatttcg taaca 275 // ID Harbinger-N15_BF repbase; DNA; INV; 446 BP. XX AC . XX DT 04-AUG-2008 (Rel. 13.08, Created) DT 04-AUG-2008 (Rel. 13.08, Last updated, Version 1) XX DE Amphioxus Harbinger-N15_BF non-autonomous DNA transposon - DE consensus. XX KW Harbinger; DNA transposon; Transposable Element; Nonautonomous; KW Harbinger-N15_BF; non-autonomous. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-446 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-446 RA Kapitonov V. and Jurka J.; RT "Harbinger-N15_BF - a family of non-autonomous DNA transposons RT from the amphioxus genome."; RL Repbase Reports 8(8), 807-807 (2008). XX DR [2] (Consensus) XX CC It is a very old family: copies are only 80% identical to the CC consensus. It is characterized by 35-bp TIRs and 3-bp TSDs. XX SQ Sequence 446 BP; 118 A; 99 C; 109 G; 120 T; 0 other; agcgggtatc tcactgggcc gcgcctgttg gcgactgttt gcgaaagaca ttcgctaaca 60 gtcgccaaca cgtcgccaac aggtgccaat tggtcgcctg aaatggcgaa tttttttaaa 120 attaaaaaaa tcgcaattgg caaaatgtcg cgaatgggtc actgcaaact agatgcgatc 180 actgcaaatc tctttttgat gaatcttaat cattgtacat tatattcagt ggcaacactg 240 cgatttccaa taaagttatc ttgtatctaa tttggggccg aaactgagga tgtgacattc 300 cgggtgcggc cattttgttt gtagaggtca cgtattttac ccgtgcgtcg cacgtcagtg 360 tgaccgcaag ggaaaattgg cgaaaatttc gcaaagttgc cgcaataatt tgtcgccgac 420 aggcgcggcc cagtgagata cccgct 446 // ID BEL-232_AA-I repbase; DNA; INV; 5876 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-232_AA_; KW BEL-232_AA-LTR; BEL-232_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-5876 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 919-919 (2011). XX DR [1] (Consensus) XX CC Positions [4637-5209] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS join(806..2203,2207..5602) FT /product="BEL-232_AA-I_1p" FT /translation="MEWMIFETEFKTSTEEFRLTDRDNIRRLNKALQGKAR FT KTVESLLSSPDNVAQIMRMLKSNFGRTEWVVVNRLEALRNLESVKEGNIES FT FRTFYNAVVGTMVAMQNAKADNYLLNPELISTLAEKLPGFSKQMWIRHKAA FT LMKEDTVVDFNTFARWLEDEMDNQLASLNPSFGPRKVNYLAKPKPVVFNVN FT SRDEETPKKCPLCSVGSHFSLDKCEQFRGLSVAQRRTTARSCKVCYICLKS FT DHARRNCKSSKKCSVCGKNHHELVHSDESPRSFPPSMPNVCNVNGKNANTL FT LRVGKVRIKGPNCVEEVFALFDEGSSLSMMDATVAAKIGLQGPTSSVNYRW FT TNGISHKEEDSMMLSFDISGPAVQAKWYRANYIRSVQNLDLPPMNFDIEKV FT RQLYPMLDEDKLAAVQGARPCMLIGSNNAGLIVPLKTIQYSLHGLQLTRCH FT LGWTVHGLIEPSDTASFDNHALCAEDDDVELTDLVKQIYKVEDFGITGQSP FT KMSDEDQRAIDIMNRTITRRGERFEVGQIYKYNNFSFPDSKPQALRRLQII FT EKKMDSDPDFAEKYCSKIQDYINKGYARKLESDELTETPNTWYLPHFSVES FT AGKFRLVMDAKAKSHGFSLNDLLLKGPDLVPALIAILMRARKKKIGFVADI FT KEMFHQILIRRMDQDSQRFFFRGMDRSEPPSVYIMMVMIFGAVSSPSMAQF FT IKNFNAKEMEEEYPGVERAVIDQHYVDDYFDCADTDEEAIELVQRVVETHD FT HGGFQLVKFSSNSEAFLDSLDPALVADRVGSRTRVLGIDWDLQTDELVFSL FT EFPKLIEKFRTGESVPTKRQLLKFMMGIFDPLGFLSPITIHLKIIFQELWR FT LQSGWDDEIPDGLVPRWVEWLNETARLKEVRLPRYYFPEIPSFADAEFHAF FT CDASDKAFACVIYMVHRHKGTSHVALVYAKSRVAPLKSSTVPRLELQGCVL FT ASQMMKTLQDEMKIEVTKLFFWTDSKICLSWLKTSQKLTAYVGSRIMKIKE FT NGHSVDLWHWVPSQLNVADLGTKSSKFSNMQEWMKGPDFLLHVSTRWPIDG FT PLELTPAELVNFHSELTVDATADSLCCTVSEEYPIDLPDIERFSDFNRLIR FT STAYYLRMKKILALPKNEKPKRLFITVHDMTDARNAWYRKVQSEVFGDELR FT SLKQTGYVKNSSRLKTYSPFIHNGIVRMRGRVQDNSKPFDVNNPVILPDNH FT PFSMLLIRMYHIVNAHQGLETIVNNIKERHRILKIRSQVKKYTASCIKCRE FT LRALPTVPQMGNLPPERTTPFAFPFTNTGIDYFGPLVVKVGRRLEKRWVVL FT FTCLTSRAIHLEVVPSLDTNSCIMAIRCFMSIRGLPQCIMTDNGTNFTGAD FT QELKRLVKTLDQPQIEDSMSVRGVQWKFIPPGAPHFGGCWERLVRSVKNAL FT RAMLQERHPTDLVLRTTLCEVMNTVNNRPLTHVSSDPLDAEPLTPNMLLLG FT RNDSIQFDHDFDERDLDCRAAYKNAQIYADRFWRKWLSSYRPELIRSRKWP FT DNRNYHEYHVGDFVMIVDENLHRGCWPKGVVEKIYKGPDGKIRTVSVKTSK FT SSFTRPVSKIVLLQANISFPGSENVANTGSN" XX SQ Sequence 5876 BP; 1695 A; 1252 C; 1429 G; 1497 T; 3 other; ttggtccttc gaataccaaa cgccagttta tatggcacaa cagttgcgcc gctctagaag 60 agaaactaga ggcgtagtac caaagcgact agaaaattac tttgttcaac ttccacatca 120 accaaccatg tttcgtactc ctgtaaatga acagcagcag aacaaatcaa tccagcgaac 180 tgcagamcat gtctcagccc cgatcggaaa tgacgcggag aacgatcctg tttctggttc 240 tcgcatctcg aagggtgctg cccactctga aaggcgttca aagaaaagcc tatcttctcg 300 tatcagcgaa gctgacagcc aactgaaggc tcttttgttg gaagagcaga tcgaggagga 360 tgaactgaag gcgtctatcg aacgagacaa aattctggct gagaaacgat tagctgcgct 420 ccagatagaa caagacctgg cgttgaagac ggaagaaaga aaagcagagt tcctcaagcg 480 aaagcgtgag cgtgaactta agaaggaaga acttaagact ggttcaagtt acggctcgag 540 tgtctgcagc agcagaggta gcagctcgag tgcgtacacc agtgtatctc gacgtgtcac 600 caaatggctc accgagtcaa aaaccgtgga agcaagcaac acgaacaatc tggagtcaaa 660 tcgcataatt cgcccatcgt cccgcgagcc tgcagcgaat gatgtacctg ttcccgaacc 720 tggcgtgaat gcaatcctgt acgaagcatt caaggcactg caaggaagaa acgtcaagga 780 cttgccgtcg ttttctggtg acatcatgga gtggatgatt ttcgaaactg aattcaaaac 840 gtccactgag gaattccgtc taacagatcg ggacaatatc aggcgactga ataaagcgtt 900 acaaggtaag gcacgcaaaa ctgtggaatc gctgttgtct tcccctgaca acgtcgctca 960 gatcatgagg atgctgaaat cgaactttgg acgcactgaa tgggttgtgg tcaatcgatt 1020 ggaagcactg cgaaatttag agagcgtcaa ggaaggtaac atcgaatcat ttaggacttt 1080 ctacaatgct gtagtaggta ccatggtcgc gatgcagaac gccaaggcgg ataattatct 1140 gctcaacccc gaactcattt caacccttgc tgaaaaactg cctggtttca gtaagcaaat 1200 gtggattcgc cataaggcag cattgatgaa ggaagatacc gtggtggatt tcaacacttt 1260 cgctcgttgg ttggaggatg agatggacaa tcagttggcc agtttgaatc catcgttcgg 1320 tcccaggaaa gtgaactacc tagcgaagcc aaaaccggtt gtattcaatg tgaacagccg 1380 ggatgaggaa acaccgaaga agtgtccact gtgttcggta ggttcccact tcagtttgga 1440 caaatgcgag caatttcggg gactttctgt agctcaacgt agaaccaccg cccgatcgtg 1500 caaagtgtgc tacatttgct tgaaatcgga tcacgccaga agaaactgta aatcttcaaa 1560 gaagtgttca gtttgtggta aaaatcatca cgaattggta cacagcgatg agtcacctag 1620 aagcttcccg cctagtatgc cgaatgtgtg caacgtgaat ggtaaaaacg cgaacacatt 1680 acttcgtgtt ggaaaggtga ggatcaaagg tccaaattgc gtcgaagaag tcttcgcgct 1740 atttgatgaa ggttcatctt tgtcaatgat ggacgctacc gttgctgcga agattggact 1800 tcaagggcca acgtcgtctg tgaattatcg ttggacgaac ggtatatcac ataaggaaga 1860 agattcgatg atgctctcat tcgatatctc tggtcctgca gttcaagcaa aatggtacag 1920 agcaaattac attcgttctg tgcaaaatct cgacctacca ccaatgaatt tcgacatcga 1980 gaaggtgagg cagctgtacc caatgcttga tgaagacaag ctagctgctg tgcaaggtgc 2040 tcgtccctgt atgctgatag gctcgaacaa cgcaggtttg attgtaccac tgaaaacaat 2100 tcaatactca ctgcatggtc tacaactaac tcgctgccac ctgggttgga cggttcatgg 2160 cttgatcgaa ccaagcgata ctgcatcatt cgacaatcac gctttwttat gtgctgaaga 2220 tgacgacgtc gagctgacgg atctggtgaa acagatttac aaggtagaag attttggaat 2280 caccggacag tctccaaaaa tgtctgacga ggatcaacgt gccatcgata ttatgaaccg 2340 taccatcaca cgaaggggag agcgcttcga agtaggacaa atctacaagt acaacaactt 2400 ttcgttcccg gacagcaagc ctcaagcgtt aaggcgacta cagattatcg agaagaagat 2460 ggattctgac cccgatttcg ccgagaagta ttgcagcaaa attcaagatt acatcaataa 2520 aggatatgca cggaaactgg agtctgatga actcactgaa acaccgaaca catggtactt 2580 gccccacttc agcgtggaga gcgctggtaa attccgcttg gtcatggatg cgaaggccaa 2640 atcgcatggt ttttctctca atgatttgtt gttgaaaggt cctgacctcg tacctgcctt 2700 gattgcgatc ctgatgcgtg ctaggaagaa gaagattggc tttgtggcgg acatcaaaga 2760 gatgttccac caaatcctca tccgtagaat ggaccaggac tctcaacgat tcttttttcg 2820 tggcatggat cgctctgaac ctcctagtgt gtacatcatg atggtgatga tcttcggtgc 2880 tgtgtcatct ccctctatgg cacaattcat caagaatttc aacgcgaagg aaatggaaga 2940 ggagtatcct ggtgtggaac gcgccgtgat agatcaacat tacgttgatg actattttga 3000 ttgtgcggac accgatgaag aggccatcga actggtacag cgtgtggtcg agacccatga 3060 tcatggcgga tttcaattag tcaagttttc atcgaactct gaagcttttt tggattctct 3120 cgatcctgca ttggttgctg atcgagtcgg aagcagaacc cgtgtgcttg gtatcgactg 3180 ggaccttcag accgacgaac tggtgttttc cttggagttt ccgaaattga ttgaaaagtt 3240 ccgaaccggt gaaagtgtgc ctacaaaacg acaactcttg aagttcatga tgggcatatt 3300 tgatccacta ggtttcctga gtcccatcac catacacctg aagatcattt ttcaagaact 3360 gtggcgattg caatccggct gggatgacga gattccagat ggtttggttc cgagatgggt 3420 agagtggtta aatgaaactg ctaggctgaa agaagttcga ttgccgagat actattttcc 3480 ggagattccg tcgttcgctg atgctgagtt ccatgccttt tgtgacgcga gtgacaaggc 3540 tttcgcttgt gtaatttata tggttcatcg gcataagggt acgtcacatg ttgccttggt 3600 ttatgctaag tcaagggtcg ctccactgaa gtcttcaact gtgccacgtc tggaacttca 3660 aggttgtgtg cttgctagtc agatgatgaa gacgcttcaa gacgagatga aaatagaggt 3720 gacgaaattg ttcttctgga ctgactccaa aatttgcctg agttggttga agacgagtca 3780 gaagcttacg gcgtatgtcg gatctcgcat tatgaagatc aaggagaacg gtcacagcgt 3840 tgatttgtgg cattgggtac catcacagtt gaacgttgcc gatcttggaa caaagtcgag 3900 caaatttagt aatatgcaag agtggatgaa aggtccggat tttctgcttc atgtcagcac 3960 ccgctggccc atcgacggac cgcttgaatt gacgcctgcc gagttggtaa attttcactc 4020 cgaactgact gtcgatgcca ccgctgattc gttgtgctgc actgtatctg aggagtaccc 4080 tattgattta cccgatatcg agcgattctc cgatttcaat cgacttatta ggtcgacagc 4140 gtattacctg aggatgaaaa agatattagc attaccaaaa aatgaaaagc caaaacgttt 4200 attcatcact gtgcacgaca tgactgacgc aagaaatgca tggtatagga aagtacagtc 4260 ggaggtgttt ggagatgaat tgcgtagctt aaagcagacc gggtatgtta agaacagtag 4320 tcgcttgaaa acctattcgc cgtttatcca caacggaata gtccgtatgc gaggacgtgt 4380 gcaagacaac agtaagccgt ttgatgttaa caaccccgtc attctccccg ataatcatcc 4440 attttcgatg ctactgatcc gcatgtacca tatcgttaac gctcaccaag gacttgaaac 4500 gatcgtcaat aacatcaagg agcgtcatcg cattctgaag atccgttcac aggtgaagaa 4560 gtacactgca tcctgcataa agtgtcgaga gcttcgcgct cttccgaccg ttccgcaaat 4620 gggtaatctt cctccggagc gaacaacacc gtttgcattc ccgttcacca ataccggaat 4680 agattatttt ggaccgttgg ttgttaaggt tggacgtcgc ttagaaaaaa gatgggtggt 4740 tctttttacg tgtctgactt ctagagctat acatcttgaa gtagtacctt cgctggacac 4800 aaatagctgc attatggcca tccgttgctt tatgtcaatc cgtgggttac cgcagtgcat 4860 tatgacagac aatgggacaa attttactgg cgcagatcaa gaactaaaaa ggctagtaaa 4920 aacattagat caaccgcaga tcgaggattc gatgagtgtt agaggtgtac agtggaaatt 4980 catcccaccc ggtgctccgc actttggagg gtgttgggag cggttagtgc gctctgtaaa 5040 aaatgctcta cgtgcgatgc ttcaagaaag acacccgaca gatttggtgc tgcgtactac 5100 attatgtgaa gtaatgaata ctgtcaacaa taggccgttg acgcacgttt cgagtgatcc 5160 attagacgct gaaccgttga ctccaaatat gctacttttg gggaggaacg attcaataca 5220 atttgaccat gacttcgacg agagggacct agattgccga gcagcctaca aaaatgctca 5280 aatttacgct gatcgtttct ggcgtaaatg gttatcgtct taccgtccag aactgataag 5340 aagccgcaag tggcctgata acagaaacta ccacgaatat catgttggtg attttgtaat 5400 gatagtagat gagaatttac atcgaggatg ctggccgaaa ggagttgtcg agaagattta 5460 caaaggtcca gatggaaaaa ttagaacggt gtccgtgaag acttcaaaat catcgttcac 5520 aagacctgtt agcaaaattg tattgctaca agcgaacatc tcgtttccgg gctcggagaa 5580 tgttgccaac actggtagca actaattatg ttgctcggat atcgttggcg ccatagcaac 5640 cggataacaa atctaagtaa gctgcggctg aaaaaaaatk agactgaacg aaaaattcaa 5700 gcgatcaaga cgtgcgagaa gaaaaactta tcaaactagt taaaattaga aaacttaatg 5760 ttaaattagt tctaatccta aatgtaaaat aaaattagtt aaaattactt tgaagcattt 5820 cattcgagtt tgtctggtaa aatcctacaa atagtatttg tagcagctgt agcagg 5876 // ID NeSL-1_ACa repbase; DNA; INV; 4287 BP. XX AC . XX DT 09-JUN-2009 (Rel. 14.06, Created) DT 09-JUN-2009 (Rel. 14.06, Last updated, Version 1) XX DE NeSL-1_ACa is a protozoan NeSL non-LTR retrotransposon - a DE complete sequence. XX KW NeSL; Non-LTR Retrotransposon; Transposable Element; U2 snRNA; KW NeSL-1_ACa. XX OS Acanthamoeba castellanii OC Eukaryota; Amoebozoa; Centramoebida; Acanthamoebidae; OC Acanthamoeba. XX RN [1] RP 1-4287 RA Kapitonov V.V. and Jurka J.; RT "Non-LTR retrotransposons in the Acanthamoeba castellanii protist RT genome."; RL Repbase Reports 9(6), 1143-1143 (2009). XX DR [1] (Consensus) XX CC Two copies NeSL-1_ACa are present in the genome; they are CC inserted in the same target site into U2 snRNAs (GTAGTA|TCTGTT, CC where the insertion of the retrotransposon is marked by "|"). XX FH Key Location/Qualifiers FT CDS 108..4163 FT /product="NeSL-1_ACa_1p" FT /note="contains the reverse transcriptase and FT restriction enzyme-like domains." FT /translation="MAAKSVACPHDGCANKYASEASLRRHIKNKHATDEEG FT DETSHSCPHCHRPFSTARGLSVHIGKSHRQAPPEPTRPPPAPAPADPGLDP FT DPGPTVTPPSRDDEDREEPDDDPVEIADLSCPHCAQALPSAHGLANHLRAC FT KDHRVPAPGAPRSGPPSSRYWTAVEHHRYVEAMARFADHPDLLARAAAHIG FT TRTYKQVDSHRTKVIAAEREGRPVRTLDPTMDWRMRPYCASTTARWLAEQG FT RSPVAPRSPCPEPHAPPPAAALLYIPATPPAPTPRAPVAPPKLAPPAESTV FT PATPDGNPEAPAPPFSAPGPPTPKALPPPPPSRRNLRPHLVPKDAWQGVAD FT AVAPAASRLLRTPLAHLSTEQWATFEAALAGLEATLHHAARSAEAVPTRCA FT SRAREDAERQLREARKTREIFGKAAALYAAGKDPTATIERIPPEVRLHLPT FT PGSAEWPARAAAARRVIRRAVARADRLRKRMGILDSDRDLQRLFNANQKKA FT VRQILAPSTKAPRCQLDPAAVEEAYIQTLAKPPPIDPSPPWKNSVQWPRPP FT TAADDGGSPFSVAEVRAQLRRLPNGSAPGIDGIPYEAYKRTKLDATLAHVF FT EVVRLNARLPARWDVARTVLLYKKGDPNDTGNWRPISLQVTIYKIFTAALS FT KRLISWAGKHNTFSASQKGFLPAEGCHEHAFVLRSVLDDARRHKQNVYLAW FT YDLRNAFGSVSHDLIAWCAAMLGLPRYLRDAIGAIYRHSALFVQVGDQETT FT GVIPMRCGVKQGCPLSPLLFNLCVEPALRCLRRTTGYKFYGTSITVEGQAY FT ADDLLTAAPSAYHAARQVATIEEWANWAGVSFVVQALSLDAPAGKCAALAI FT NFEGGLMHSIDPALKVQGAAIPAMSRNNVYRYLGVHVGLTDALGQANELLE FT KASRDARTICASGLEPWQKVVAIKTFILSRLPFFFHNGKIQRGRCQQFDRE FT LRENLRAALRLPVCTTNAFFHSRVASGGLGILPIAEEQQVYLAAHVFKLLT FT SPDLSIRAIARHQLAEVTHARHTTPVQDGEASPFFGWLMRGQEVASTTPSG FT DVSSIWFAAAGAYSRMGWSVRDALHPTLTVGPGVQFEGRFQRANVIPALRA FT SAFSRHAVEWSALRTQGRAAAYQHAVHPATHHWVHNSAGLTTKEYRFAIKC FT RLGLLPTRAAPHHRNGPTACRACSYARETANHVLGHCPATKAEVIARHNRI FT CRALAQAAEASWTSVLEDVPIPGVDSPLRPDIYCSRPGQCAIIEVAVSYED FT AFNASMEGRAKQKTDKYAGLAATVEEQLRLQTRHAAFVVGFSGVVLPASVT FT ATATSLDLPPKTWNVLLKRCVAASIKGSYTAWRRFRRSTP" XX SQ Sequence 4287 BP; 781 A; 1658 C; 1173 G; 675 T; 0 other; cccgtcaagg gtgctccacg agatccctgt cgctagccga ccggttttac caccccaccc 60 cgcccggaca accacggacc ctgctccgca gcaggacccc acgcacgatg gccgctaaat 120 ccgtcgcctg ccctcacgat ggatgcgcca acaagtacgc gtcggaagcc tccctccgaa 180 gacacattaa gaacaaacac gctacagatg aggaaggaga tgagacctca cactcctgtc 240 cccactgcca ccgacctttc tccaccgccc gcgggctcag cgtccacatt ggcaaatcgc 300 accgtcaggc cccccctgag ccgacgcgcc cccccccggc cccggcccct gccgatcccg 360 gcctcgatcc cgaccccggc cccaccgtga cgccccccag ccgtgatgac gaagaccgcg 420 aggaacccga cgacgacccc gtggagatcg cggacctaag ctgccctcac tgcgcccagg 480 ccctcccgtc ggcccacggc ctcgccaacc accttcgcgc ctgcaaggac cacagggtcc 540 ccgcccctgg agcaccccgc tcgggtccgc ccagctccag gtactggact gctgtcgagc 600 accaccgcta tgtggaggcc atggcgcgct tcgcggatca ccccgaccta cttgcgcgcg 660 cggctgccca catcgggacc cgcacgtaca aacaggttga ctcccaccgc accaaggtga 720 tcgcggcgga gcgcgagggc cgccctgtcc gcacgctcga ccccacgatg gactggcgca 780 tgcggcccta ctgcgccagc accacggccc ggtggctggc tgagcagggg cgtagcccag 840 tagcgccccg ctcgccctgc cccgagcccc acgccccgcc gcctgcagcc gcgctgctgt 900 acatcccggc cacgcccccc gcgccaacgc cccgtgcccc agtggcgcct cccaagcttg 960 cgcctcccgc cgagagcacc gtgcccgcca cgcccgatgg gaatccggag gcgccagcac 1020 ccccgtttag cgcccccgga cctcccaccc ccaaggcatt gccgcccccg cccccgtccc 1080 gccgcaacct gcgccctcac ctcgtgccca aggatgcttg gcagggggtc gccgatgccg 1140 tcgcccctgc cgcctcgcgc ctcctgcgca cgccccttgc gcacctctcc accgagcagt 1200 gggccacgtt cgaagccgcc ctcgccggcc tcgaggctac gctccaccat gccgcccgca 1260 gtgcagaggc ggtgcccaca cgctgcgcta gccgagcaag ggaagacgcc gagcgccaac 1320 tccgtgaagc ccgaaagacg cgtgagatct ttggcaaggc cgctgccctc tacgcagccg 1380 gcaaggaccc cactgccacc atcgagcgca tccccccaga agtccgccta cacctgccaa 1440 cccctggctc ggctgaatgg cccgccaggg cggccgccgc ccgcagggtg atccgccgtg 1500 cagtcgcgcg agcggaccgg ttgcgcaagc gcatgggcat cctcgatagc gaccgcgacc 1560 tccaacgcct cttcaacgct aaccagaaga aggcagttcg gcagatcctc gccccgtcca 1620 ccaaggcgcc gcggtgccag ctagacccag ccgccgtcga ggaggcctac atccagaccc 1680 tcgccaagcc gccgccgatc gaccccagcc ccccgtggaa gaactccgtc cagtggcccc 1740 gcccgcccac tgccgccgat gacggaggca gccccttcag cgtcgccgag gtccgggccc 1800 agctccgccg actgcccaac gggtccgccc cagggatcga tggcataccg tacgaggcct 1860 acaagcgtac gaaactggac gccacgctcg cccatgtctt cgaggtcgtg cggctgaatg 1920 cgcgcctgcc agctcgatgg gatgtggcgc gcacggtcct gctctacaag aaaggcgacc 1980 ctaacgacac cggcaactgg cgaccgataa gcctccaggt caccatctat aagatcttca 2040 cggccgccct gtcgaagcgg ctcatctcct gggctggcaa gcacaacact ttctccgcat 2100 cgcagaaggg attcctaccg gccgaaggct gccacgagca cgcgtttgtc ttgcgaagcg 2160 tgcttgacga cgcccgtcgg cacaagcaga acgtgtacct tgcctggtac gatctgcgca 2220 acgccttcgg atcggtgtcg cacgacctca tcgcctggtg cgctgccatg ttgggcctgc 2280 cccgctacct ccgggatgcc atcggcgcaa tctatcggca ctcagcgctc ttcgtccaag 2340 ttggggatca ggagaccacc ggcgtcattc ctatgcgctg cggcgtcaag cagggctgcc 2400 ctctcagccc cctcctcttc aacctgtgcg tcgagccggc ccttcgctgc ctacgccgca 2460 ccaccgggta caagttctac ggcacgtcga tcaccgtcga gggccaggcc tacgccgacg 2520 acctgctcac tgccgcgccc tccgcctacc atgcggcccg gcaggtggcc acgatcgagg 2580 aatgggccaa ctgggcggga gtctccttcg tcgtccaagc cctctccctg gatgcgccgg 2640 ccggcaagtg tgccgccctc gcgatcaact tcgaaggtgg tctaatgcac tctatcgacc 2700 ctgccctcaa ggtccaaggc gcagccatcc cggccatgtc aagaaacaac gtgtaccgct 2760 acctcggagt acatgtcggt ctcacagatg cgctcggcca agcgaacgag ctcctcgaga 2820 aggcctcacg cgatgcacgc acgatctgtg cctctggcct cgaaccctgg cagaaggtgg 2880 tcgcaatcaa gaccttcatc ctctcccggc tccccttctt cttccacaac gggaagatcc 2940 agaggggccg atgccagcaa ttcgaccgcg agcttcgaga aaacctgcgg gccgccctcc 3000 gactccccgt ctgcaccacg aacgccttct tccattcccg cgtggcctca ggcggccttg 3060 gcatcctgcc catcgcggaa gaacaacaag tctacctggc agcccacgtg ttcaagctcc 3120 tgacttcgcc agatctgtcg atccgcgcca tcgcccgaca ccaacttgcc gaggtcaccc 3180 acgcgcgaca caccacgcca gtccaggacg gcgaagcgtc acccttcttc ggatggctca 3240 tgcgggggca ggaggtcgca tcaactaccc cctcgggtga cgtcagttca atctggttcg 3300 cagctgcagg cgcctactcg aggatgggat ggtcagtccg cgatgcactc cacccgacgc 3360 tgacagttgg tccgggcgtc caattcgagg gccgattcca acgtgccaac gtcatcccag 3420 ctctccgggc tagcgccttt tcccgccatg ctgtggaatg gagtgccctc cgcacccagg 3480 gtcgagcagc agcctaccaa catgccgtcc accctgcaac gcaccactgg gtccacaaca 3540 gcgctggcct gacgaccaag gagtaccgat tcgcgatcaa gtgtcgattg ggtctcctgc 3600 cgacgcgagc agctccacac caccgcaatg ggccaacagc gtgcagggcg tgctcctacg 3660 cccgcgagac ggccaaccat gttctcggac actgcccggc gaccaaggcc gaagtcatcg 3720 cgcgccacaa caggatatgc cgagctctgg cccaggcggc tgaagcctca tggacgtctg 3780 tccttgaaga cgtcccgatc ccgggggtgg actcccccct acgacccgac atctactgct 3840 ctcggccggg ccagtgtgcc atcatcgagg tcgcggtctc ctacgaggac gccttcaacg 3900 cttcgatgga gggccgggcg aagcagaaga ccgacaagta cgctggcctg gctgctaccg 3960 tcgaggagca gctgcggctc caaacccggc acgcggcttt cgtggtgggc ttctctggcg 4020 tcgtgctccc agcctcggta accgctacgg ccacctccct tgatctcccc cccaaaactt 4080 ggaatgtgct tcttaaacgt tgtgttgctg cctcaatcaa aggcagttac acagcgtgga 4140 gaagattccg gcgctctact ccataacaac catgtatggt gaaccacacc tctctcgatc 4200 ttgtattctg tgattggaca tcagagttcc tgcgaaggga tacactctgc caatctcgtg 4260 ggttgtaata aatccacacc ttcaaca 4287 // ID Gypsy-18_IS-LTR repbase; DNA; INV; 236 BP. XX AC ABJB010623531; XX DT 15-FEB-2011 (Rel. 16.02, Created) DT 15-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the black-legged tick genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-18_IS_; KW Gypsy-18_IS-I; Gypsy-18_IS-LTR. XX OS Ixodes scapularis OC Eukaryota; Metazoa; Arthropoda; Chelicerata; Arachnida; Acari; OC Parasitiformes; Ixodida; Ixodoidea; Ixodidae; Ixodinae; Ixodes. XX RN [1] RP 1-236 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the black-legged tick genome."; RL Direct Submission to RU (15-FEB-2011). XX DR Genome; ABJB010623531; Positions 237 2. XX SQ Sequence 236 BP; 43 A; 62 C; 70 G; 61 T; 0 other; tgatatcgtg ccaagtgacg tcgtgcggtg acgcgcagga ccagcgacaa gccggcggtg 60 tgatatcgta ccaagtgacg tcgtgcggtg acgtgcagga cgggcaggcg agcgtgcgcc 120 tttccttctc tttccttctc ttaagcgtcg ataagcatcg gcgctgaact atctaagccg 180 cgctatcgct tgaaataaac cttcttgttg ttggagctgc tgtctctgct tggtca 236 // ID Copia6-NVi_LTR repbase; DNA; INV; 226 BP. XX AC AAZX01000798; XX DT 05-NOV-2007 (Rel. 12.11, Created) DT 05-NOV-2007 (Rel. 12.11, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: long terminal DE repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia6-NVi; KW Copia6-NVi_I; Copia6-NVi_LTR. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-226 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from Nasonia parasitic wasp."; RL Repbase Reports 7(11), 1134-1134 (2007). XX DR Genome; AAZX01000798; Positions 15307 15532. XX SQ Sequence 226 BP; 66 A; 53 C; 37 G; 70 T; 0 other; tgttggaaat gagtgttcga attcgctaca taaccgctac taccaccact actacttccg 60 tgtataggag agtacaggga tactgtataa ataggtctct cgaggcgtac gagagtcatt 120 cgccgttcac accccgctta aataaaacgt taagtatatt atttaagcta ttgatatttc 180 attatcccat ctacctttcc acatacactc gatatcattt gctaca 226 // ID Gypsy-166_AA-I repbase; DNA; INV; 4505 BP. XX AC supercont1.387; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-166_AA_; KW Gypsy-166_AA-LTR; Gypsy-166_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4505 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.387; Positions 239216 234712. XX CC Positions [3457-3807] - Integrase core CC 'ACTCC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 994..4482 FT /product="Gypsy-166_AA-I_1p" FT /translation="MSCVHKCKRRNHFERVCKLNKDKRKKSSKVKEIKDEN FT SDPEESTSSSNEEPSEESEDEYEIGKIIDNSAKGGSVLAELELKFSKKWEA FT VSCELDTGANTSLIGYDWLVKLTGKENPQLLPSTYRLQSFGGNPIKVLGEV FT KIPCRRRKRRYRLVLQVVDVSHCPLLSAKASRVLGFIKFCKAVKFGRSEAN FT ESDSVLRVHRMSAEKIIEDHKQIFVGYGKFDGKVSLEVDSSISPSIQSPRR FT VPISMRSKLKTELELLEKEGIIIREVMHTEWVSNLVLVQRGDPESTSIRIC FT LDPIPLNKALKRPNLQFATLDEILPELGRAKVFSTVDVKKGFWHVELDEAS FT SKLTTFWTPFGRYRWTRLPFGISSAPEIFQMKLQEVIQGLEGVECLADDIL FT VFGVGETMEEALINHNKCLTNLLVRLESQNVKLNRTKLKLCQTSVKFFGHV FT LSSEGLQPDKSKVATIQNFPTPLNRKEVHRFIGMVNYLSRFLPNLSCNFTN FT LRKLISESVPWLWTNIEEAEFNRVKAMVADVETLKYYNPRESLTMECDASC FT FGLGVAVYQNEGVIGYASRTLSATERNYAQIEKELLAILFGCIRFDQLVVG FT NPKTIVKTDHKPLINIFQKPLLSAPKRLQHMLLNLQRYNLVIEFVTGKDNV FT VADTLSRAPFDDKAVNDEYSKLQIYKVFGQVEDLKLSKFLNISSSCLNEIV FT KETRSDVTMQNIITLIQRGWPGSSDRVPDGVRIYFGFRDELSFQDGIVFRK FT DRILVPHALRRRLIDKCHVSHNGTEATLKLARANLFWPGMSNQIKDAVQSC FT PTCAKFASSQPNPPMLSHQIPVHPFQFVSMDVFFSEFRGRNQRFLVTVDHY FT SDFFEVDILKDLTPESVIDVCRKNFARHGIPQTMLTDNGTNFINQKMKMFT FT MEWDIEHITSAPHHQQANGKSEAAVKIAKRLLKKAEETGTDFWYALLHWRN FT IPNKIGSSPVARLFSRATRCGIPTSGINLMPKVVERVPAAIEENRRKAKLN FT YDRKARNLPELQTGSPVYVQVDPEVSKQWTPGIVTNRLSERSYTVNVRGGN FT YRRDLVHLKPRKESSEEIVSTPISTVPTPEDENNASCLTTLGSHSQRVPNQ FT QPSLSQAVVPVSSSPIRAPSLCQSPATTEAPITGGRPKRTIRIPEKLKDYV FT VDFD" XX SQ Sequence 4505 BP; 1354 A; 909 C; 1106 G; 1136 T; 0 other; tggtgtcaga agtgaacgaa tggttttaaa gtgtattttc gcgtcatcga aatcgcgatt 60 agtaaatcat ttgtgaaaca cgaaaattga agaaactagt ttacgagtcg cgaagcgtcg 120 gcggccattt tggaattttc tcgcgagtgt cgcgtagtgg catcgaaatt ttcgttgaaa 180 tttactgtga agtatttatc ctcattaaaa tggatgcagc ccagtttaaa gcgctgatgg 240 aacagcaaat gtacatgttt tctaagataa tggaaggagt gcagatgcgg tcgcataatc 300 aagaggcaat agcgccgaag cccagttcgt caaacgtgca agttccgcag ccatcgcccc 360 ttgttttgga aggcgatatg gacgaaaata tggatttctt tgagaagagt tggaaagatt 420 acgctaaggc tattggaatg gatcggtggc cacaggagga aaatggtcag aaagtgagtt 480 ttttgctctc cgtcataggg gagccggcga gaaagaaata cttcaacttt gagttgacag 540 ctgaacaaag tgttgatcca gatactgcat tggctgccat aagggaaaag gtggtagcta 600 agaggaacat tattgtggac cgacttgatt ttttttcgtc gatgcagcag actcgagagt 660 ctatcgatga atttgtttct cgattgaaga ccctagcaaa gatggctaag ctcggagatt 720 tgcaagaaga gttgatcgcc tataaagtgg tcacgtcgaa caagtggcca agtctgcgat 780 ccaaaatgtt accgttactg atataacatt gtcgaaggcg gtggacatgt gcagagcaga 840 ggaaattact gcgaaacgtt tccatgagtt gtcgataccg aatccggaag gggaagtgaa 900 caagatagca aaagctaaat atcgtcaaaa ttacaaatct caagtgcaaa agtgtaagtt 960 ctgtggtgat tatcacgact tttcacgtgg ttcatgtcct gcgttcataa atgtaaaaga 1020 agaaatcatt ttgaaagagt gtgtaaacta aacaaggaca agcgcaagaa gtcaagcaag 1080 gtgaaggaaa ttaaggacga gaacagtgat cctgaggaat ctacatcgtc ttccaatgaa 1140 gaaccttcgg aggaaagcga ggacgaatac gagatcggca aaataatcga taactcggcc 1200 aaaggaggca gcgtattggc tgagttggag cttaagtttt ccaaaaaatg ggaggcagtt 1260 agctgtgagt tggacacagg agctaataca agcctaatcg gttacgattg gttagtcaaa 1320 ctgaccggca aagagaatcc acagttactg ccgtctacat accgtttgca aagctttgga 1380 ggaaacccga tcaaagtgct gggagaagtg aagattccgt gcaggcgtag gaagcggcga 1440 taccgtttgg ttctccaggt tgtggacgtt agtcattgcc cgctcctttc ggcaaaggct 1500 tcacgtgtgt tgggattcat caagttctgc aaggcagtta agtttgggag atctgaagct 1560 aacgaatcgg atagtgtact gcgtgtgcac aggatgtctg ccgaaaaaat catcgaggat 1620 cacaagcaga tttttgtggg ctatggaaaa ttcgatggga aagtttcgtt ggaagtcgat 1680 agttccatat caccatcaat ccaatctcca cgacgagttc cgatatccat gagaagcaag 1740 ctgaaaacag aattggaact gttagagaag gagggtataa ttatccgcga agtgatgcac 1800 actgagtggg tcagcaactt ggtactagta caacgaggtg atccagaatc tactagcatc 1860 agaatctgtc ttgatcccat tccactaaac aaagcgttga agagaccaaa cctacaattt 1920 gccaccctcg acgaaatact accggagttg ggcagagcta aagtgttttc caccgtcgac 1980 gtgaagaaag gattttggca cgtagagctc gatgaagcta gcagcaagct cacgacattt 2040 tggacgccct ttggcagata ccgttggaca aggttaccgt tcggaatatc ttcggctcct 2100 gaaatttttc agatgaagtt gcaagaggtt attcagggtc tcgaaggtgt ggagtgtttg 2160 gccgatgata ttttggtgtt tggagtcggc gaaacaatgg aggaagcact catcaaccac 2220 aacaagtgct tgacaaattt acttgttcgg ctggaatctc aaaacgtaaa actgaacagg 2280 acaaaactca agctttgcca aacctccgtg aaattttttg gacacgtgct gtcaagtgaa 2340 ggtcttcagc cggacaaatc aaaggttgct acgattcaaa actttcccac accactcaac 2400 cgaaaggaag tgcaccgttt cattggaatg gttaactatt tgagccgttt cctcccaaat 2460 ctaagctgca actttacaaa cctgcgaaag ttaatttcgg aatcggttcc atggctatgg 2520 acgaacatcg aggaagcgga gttcaatcga gtgaaagcta tggttgccga tgtggaaact 2580 ctgaaatact acaatcccag ggaatcattg actatggaat gtgatgcaag ttgcttcgga 2640 ttaggagtcg cagtttacca gaatgaagga gtaattggtt atgcatcaag aacgttatcc 2700 gcaacggaac gaaactatgc gcagattgag aaggaattgt tggcgattct gtttggatgt 2760 attcgtttcg atcagctcgt agtgggaaat cctaaaacaa tagtaaagac ggatcacaaa 2820 cctctcatca acatcttcca aaagcctcta ttatctgcgc caaagagact ccagcatatg 2880 ctcctgaatc tgcaacgtta taacctggtg attgaatttg taaccgggaa agacaatgtt 2940 gtagcggata ccctttcacg tgctcctttc gatgacaagg cagttaatga cgaatatagc 3000 aagctgcaga tctacaaagt ttttggacaa gtcgaagatc tgaaactatc caagtttctc 3060 aacatatcga gcagctgtct gaacgagatc gtgaaagaaa cgaggagtga tgtcacaatg 3120 cagaacatta tcactctaat tcaacgaggt tggcctggtt catcagatcg tgttcctgat 3180 ggagtacgga tatacttcgg ttttcgagat gaactttcct ttcaagatgg catcgtgttc 3240 agaaaagatc gcattctggt tcctcatgct ttgcgaaggc ggctgatcga taagtgtcac 3300 gttagtcata acggaaccga agcaacgttg aagttggcca gagcaaacct gttttggcca 3360 ggaatgagta atcaaataaa agatgcggtc caaagttgtc ctacgtgcgc gaaatttgct 3420 tcgtcacagc caaatccacc gatgttgagc caccaaatac cagtacatcc tttccaattt 3480 gtctcgatgg atgttttctt cagtgagttt cgaggtcgaa atcaacggtt cctggttaca 3540 gtcgaccact attcagattt cttcgaggta gatatcttga aagatctaac gccagaatcg 3600 gtgattgatg tatgtagaaa aaacttcgca cgtcatggaa taccccagac catgttaacc 3660 gataatggaa caaatttcat taatcaaaaa atgaaaatgt tcactatgga atgggacatc 3720 gagcacatca cgtcagctcc tcatcatcaa caggccaatg gtaaatctga ggcagctgta 3780 aagatagcaa aacgcttact caaaaaggct gaagaaaccg gaacagattt ctggtacgcg 3840 cttttgcact ggaggaacat tcctaataaa ataggatcta gtcccgtcgc gcggttgttt 3900 tctcgcgcca cccgctgtgg aattcctaca tctggaatta atctgatgcc gaaggtggtg 3960 gaacgtgttc cagcagcaat cgaagagaac aggcgaaagg cgaaactgaa ttacgatcgg 4020 aaggcaagaa acttacccga attgcagacc gggtctccgg tatatgtgca agtggatcct 4080 gaagtctcca agcaatggac tcccggtatc gtcacaaacc gtctaagcga aaggtcctac 4140 acagtgaacg tacgtggtgg taattatcgt cgagatttgg tccatttgaa acctcgcaag 4200 gaaagttcgg aggaaattgt atccactccc atcagtactg tgccgacacc ggaggatgaa 4260 aacaacgctt cttgtttgac aacattgggt tctcattccc agcgagttcc caatcagcag 4320 ccgtcgctct ctcaggcagt ggtaccagtt tcttcctcac cgataagagc gccatcgttg 4380 tgtcagtcac cagcaactac agaagcacca ataactggag gtcgtcccaa gcgcactatt 4440 cgcattccag aaaagttgaa ggattatgtt gtggactttg attaatttca ttgaaacagg 4500 gagga 4505 // ID Copia-24_SI-I repbase; DNA; INV; 4240 BP. XX AC AEAQ01023605; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-24_SI_; KW Copia-24_SI-LTR; Copia-24_SI-I. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-4240 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01023605; Positions 5040 801. XX CC Positions [1645-2181] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 217..4170 FT /product="Copia-24_SI-I_1p" FT /translation="MADSTIDLKNITKFDGTNFQLWKFQMKIVLTASGLMD FT IADGTTPKPEPATDNYAAWNTKNAKAMCILSSAVEYSQLEYLVTCEDAAEM FT WAKLSSIHEQKSATNKLTLMTKFHEYKMASADSVAQHVAKIENMARQLKDV FT GEELSEVMIMAKILGTLPQKFSPLITAWDSVSQPNQTKDNLIERLLIEEKR FT LSNFEEATTALATIKIHKKSGAAHASRSRKETQTTKDSRYEKEEISCRYCR FT KRGHPEKRCYKKRDDIRNKRDSDNSAKEKDDNTANFGAFTISNDDFASRML FT KHDSKEVWLLDSGASKHMCFRREWFSELDNSYRESVSLGDNSTCEVMGRGT FT IHIKMYVNGKWLNGKMDDVLYIPSLRKNLFSTGICTSKGYILKFESNNVKI FT IRDKSVMACGIKQHNNLFRLLIKVDIVNEANAVSSNSLKTWHERLGHINCQ FT SLREMIKKNLINGVSLSNSDKFFCESCQFGKQHRLPFKSEKRFRDRRIGEF FT IHTDVCGPMSEASIGGSRYFLLFKDDKSSFRRIYFLTYKSETFEKFKEFEE FT LIFNKFGTRIKTVRADNGTEFCNNNMRNYFARHGITLQTSAPYTHEQNGRS FT EREIRTITECARTMLQAKNLPTRFWAEAVNTAVYILNRCISSQTREITPFE FT LWHKEKPNLSHIRKFGSDAFAHVTKEHRKKWDPKSKKLILVGYEGESTNCR FT LFDPLTNKIIITRDVIINENSDNDSNIKSDNASIKITSQDENIFEPLQNIQ FT NDSTEPINEIDNSDRIKPVANTYNLRNRQSVKPPERYEASIAILDEPATFI FT EATTGENAIQWKSAIKKELEAHEKNNTWTLVSPPKDYNIIGCKWVFKIKEN FT PSENVIRFKARLCAKGFSQKAGIDFTETFSPVVRYDSIRTLLAVAAHENLE FT IGQFDIKTAFINGILNEEIYMRLPEGIETDDHNVVCKLNKALYGLKQAARC FT WNKQFDEFLQKFNFQQSNADQCVYLGQFENEKVYLALYVDDGLILTSSKDV FT LNKILQTLSSAFETTTGNGKTFIGMEIERDRLNNTIFIHQKHYINRLLARF FT GMFDANPVSIPADHNVNLRSADTEDMTCCNIPYREAVGALLFLASVSRPDI FT SYAVGIVSRYLNNYDATHWNAVKRIFKYLLKTPEFGIVYNNSDNKFAIKLT FT GYCDSDYAADIDTRRSTSGFIFKITNGPVTWSSKRQATVSLSTTEAEYIAA FT SMATKEAIWLRKFLSDIGRPCDNATTLMIDNQSAIKLIKNAEFHSRTKHID FT VRYHFIREKLKNLDINVEYISSQDQLADILTKGLPRDRFKYLRNKIGIIDR FT QDALNS" XX SQ Sequence 4240 BP; 1454 A; 908 C; 847 G; 1031 T; 0 other; acaggttatg ggcccaggcg acgcaatcgg tgaatatcag tatctataaa cgagagagtc 60 agcaaatcta accgattcga gagccgaggc gcagcaacgc catcgtcctt tcgcacgaca 120 ctgcacacgt gatcgaagaa ttttttttta cgctctcaga agccttccag tgatcttttc 180 tataaacgac actttgtgtg aatcctcttt attacaatgg cagacagtac aatcgatctg 240 aaaaacataa ccaaattcga cggtacaaat ttccaattat ggaaattcca aatgaagatc 300 gtcctcacgg caagcggctt gatggacatc gcagatggaa cgaccccaaa acccgagcct 360 gcgacagaca attacgcagc atggaataca aagaatgcca aggccatgtg cattctatca 420 tcggcggtag aatattcgca gctcgaatat ttagtcacgt gcgaggacgc cgccgaaatg 480 tgggctaaac tcagttccat ccacgagcaa aaaagtgcca cgaacaagtt aacgctaatg 540 acaaaattcc acgaatacaa gatggcatcg gccgactctg tcgcacagca cgttgccaaa 600 atagaaaata tggctagaca attaaaggac gtcggtgagg aactatccga agtcatgatt 660 atggcaaaaa ttctcggcac cctcccacaa aaattcagtc cgttgataac cgcctgggat 720 agcgtcagtc agcctaatca aaccaaagac aatctcatag aacgtctcct catagaggag 780 aaacgtttat caaatttcga ggaagcgacg actgccttag ccacgatcaa aattcacaag 840 aaaagcggcg ctgcacacgc cagcagatca aggaaggaaa cgcaaacaac aaaagactcg 900 aggtacgaga aagaagaaat ctcctgccgg tattgtcgca aacgaggtca tccagagaag 960 cgatgctaca agaagcgcga cgacatcaga aacaaaagag atagcgacaa ctcagcgaag 1020 gagaaggatg acaacacggc caatttcggt gcgttcacga tatccaatga cgacttcgcg 1080 tcacgcatgc tcaaacacga ttcaaaggag gtatggttat tagatagcgg tgcttcaaag 1140 cacatgtgtt tccgccgcga atggttcagc gaattagaca atagttaccg cgaatctgta 1200 agtctcggag acaactcgac atgcgaggta atgggacgcg gtacgattca tattaaaatg 1260 tatgtcaacg gcaagtggct taacggcaaa atggatgatg tactatacat accatcatta 1320 cgcaaaaatc ttttttcaac cggtatctgt acttcaaaag gttatatttt aaaatttgaa 1380 tcgaacaatg taaaaattat tcgcgataaa tcagtgatgg cttgcggaat caaacaacat 1440 aataatttat tccgattatt aattaaagtc gacatcgtca atgaagcaaa tgcagtctca 1500 agtaatagtt taaaaacttg gcacgaacga ttaggccaca taaattgtca atcacttcgc 1560 gaaatgatta agaaaaattt aattaacggc gtttcattat ccaatagtga taagttcttc 1620 tgtgaaagct gtcaattcgg taagcaacat cgtttaccat tcaagtccga aaagagattc 1680 cgtgacagga gaataggtga atttatccat acggacgtat gcggtccaat gtccgaagca 1740 tctatcggag gctctagata ttttttactt ttcaaagacg acaagtcaag tttccgtcgt 1800 atttattttc taacgtataa aagcgaaact ttcgaaaaat tcaaagaatt cgaagaatta 1860 atattcaata aattcggaac acgaataaag accgtcagag ccgataatgg taccgaattt 1920 tgcaacaaca atatgcgcaa ttattttgcc cgtcacggaa tcacccttca gacatccgct 1980 ccgtacactc acgagcaaaa cggtcgttca gagagggaaa ttaggacaat tacagaatgt 2040 gcccggacaa tgctacaagc caaaaatttg cccacgcgtt tttgggctga ggcagtcaat 2100 acagccgttt acattttaaa tcgttgcatc tcttcacaaa cgcgcgagat cactcctttc 2160 gagttatggc acaaagaaaa accgaactta tcacacatac gaaaattcgg tagcgacgca 2220 tttgcgcatg taactaagga acatagaaag aaatgggacc ccaagtccaa gaaattaatt 2280 ctagtaggct atgaaggcga atctactaac tgtcggcttt tcgatccatt aacaaataaa 2340 attattataa caagggatgt aataattaac gaaaattcgg ataacgattc aaatataaaa 2400 tctgacaatg catctataaa aattacatct caagatgaaa atatttttga acctcttcaa 2460 aatatacaga atgactcaac agagccaatt aatgaaattg acaactcgga tcgaattaaa 2520 cccgtcgcga acacttataa cttaagaaac cgacagtcag ttaaaccgcc agaacgttat 2580 gaggcgagca ttgcaatact tgacgaacca gcaacgttta tcgaggcaac tacaggggaa 2640 aacgcgattc aatggaaatc agcaattaaa aaggagcttg aagctcacga aaagaataac 2700 acgtggactt tagtttctcc gcctaaagat tataatataa tcggatgcaa atgggtattc 2760 aaaataaaag aaaatccctc agaaaatgtc attcgtttca aggcacgcct atgtgcaaaa 2820 ggcttttcac aaaaagccgg gatagacttt accgaaactt tctcccctgt agttcgctac 2880 gactcaattc gtacgctcct ggcggttgcc gcgcatgaaa atttagaaat cggtcaattc 2940 gacattaaaa cagccttcat taacggaatt ctaaacgagg aaatttatat gcgactaccc 3000 gaaggtatag aaacagacga tcacaatgta gtatgcaagt tgaacaaagc cttatacggt 3060 ctcaaacaag cagcaagatg ctggaacaaa cagttcgacg aatttctaca aaaatttaac 3120 tttcaacaaa gcaacgccga tcagtgtgtg tatcttggtc aattcgagaa cgaaaaggta 3180 taccttgcac tttacgtgga cgatggactc attctaacct catcaaaaga cgtattgaat 3240 aaaattttgc aaaccttaag ttccgctttc gaaacgacca cagggaacgg gaaaacattt 3300 atcggaatgg agatcgagcg cgatcgactg aataacacta tttttatcca tcagaaacat 3360 tatattaatc gtctgttggc gcgttttggc atgttcgacg cgaaccctgt atccattcct 3420 gccgaccata atgtaaatct gcgatccgca gataccgagg atatgacttg ctgtaacata 3480 ccatatcgtg aagctgtcgg agccttactg tttttagcgt ccgtatcgag gcccgatatc 3540 tcttacgcag tcggaatagt cagtagatac ttaaataatt acgacgcaac tcactggaac 3600 gcagtgaagc gtattttcaa atacttactg aaaactccag aattcggcat tgtatacaat 3660 aatagtgata acaaatttgc tataaagtta acaggctatt gcgactccga ttatgccgcg 3720 gatattgaca cacgccgttc gaccagcggg tttattttta aaattacgaa cggtccagtc 3780 acttggagtt ccaagcgtca agcaacagta agtctcagca caaccgaagc cgagtatatc 3840 gccgcaagca tggcgaccaa agaggcgatt tggttaagaa aattcctgtc tgatatcgga 3900 cgtccttgtg acaatgcgac gacactcatg atagacaatc aaagcgcaat aaagcttatc 3960 aaaaacgcag aatttcacag tcgcactaaa cacattgacg tgcgttatca ttttatcaga 4020 gaaaagctca aaaatctcga tatcaatgtg gaatatattt ccagtcaaga ccaattagcg 4080 gatattctca cgaaaggtct accacgagat agattcaaat acttgcgaaa caaaatcggc 4140 attatagacc gacaggacgc tttaaactca taaaccatag ccgatttttt tttgtttgtt 4200 gttcaattat taataccata aaatgtgcaa gcggggggag 4240 // ID P-10_HM repbase; DNA; INV; 2908 BP. XX AC . XX DT 17-MAR-2008 (Rel. 13.03, Created) DT 17-MAR-2008 (Rel. 13.03, Last updated, Version 1) XX DE P-type DNA transposon family: consensus. XX KW P; DNA transposon; Transposable Element; P-10_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2908 RA Jurka J.; RT "P-type DNA transposon families from Hydra magnipapillata."; RL Repbase Reports 8(3), 356-356 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 109..2625 FT /product="P-10_HM_1p" FT /translation="MGKKCCIYGCTTNYLSAKKKSDNLKISVYRFPKNEDE FT KQMWIKVIPNANLKVTNDTVVCELHWPINFEKQQVRGKFRPKFPPSVWPGI FT PSCQIPTQPPPLRSTNSASSCIRNFAQDELSAFLEKDRVTFAEMKEILLCN FT ERDLGVPSVCFMINSIIHLQSQAYSNGVPYFLIKIRDDLHFETYHYGVKIK FT IPTISKNRVSKINAWSILEEVLRYLKSLIPGRKEIVIQEHLASMGTQCVGN FT KIYSKEIIIRSFQYFATSRSLYNQLRTDYQLPSVRTLTRITSKVSKLNEKL FT FFTNIFQSIEEKQKICLLLHDEVYVKKMLLYHGGTLFGKAMDDSTSLAKTV FT LGLMVNCLFGGPSFISKMIPISKLNSQFLFEQISITIQSIKESSGQVKAII FT CDGNRINQAFFKIFEFTVPSKPWLTNDGVFLIFDYVHLLKNIRNNWLTEKT FT GQLIFYDKGVQKVAQWDHLKKLYELESKNLVKLSNLNEVSIFPKPIERQSV FT STCLRIFCDKTYHALINHPGLQHIKERQDTADFINIVVSWWKVLNVKGTGA FT DVRFNDELQAVIRDPNDNRLENILDFGNMALKMAGKQGKRKKQLTCDTAQS FT IYHTCNGVVELCRSLLNSSHQYVILSEFSTDPLEKEFSKLRQGSGGTYFIT FT VQQVIEKLNISRAKLLLSLNNPAVDICLDTTHSCPNCGFLLDESSAEIFDN FT LPNLESFISSDTNMALIYISGYLTRKDNELSENELLEKTTFYHQKFGQYID FT SIDRGGLNIPTDNTCQWVIFSFIIFNTIKDKVCRVSLSNILMMISEFYNFE FT MHRHHGNILSNIFLNNLCKLSTPRSNKEPAQKILKLS" XX SQ Sequence 2908 BP; 1014 A; 425 C; 473 G; 996 T; 0 other; cacggcgatc tatactgcag gccttgccaa cgcgcaaaaa acgtatttca ctgggggcct 60 attttaaact gtacttaact ataatcgcca ttttagtgat attgaaatat gggtaaaaag 120 tgttgtattt atggctgtac cacaaattat ttatcggcta aaaagaaatc tgataattta 180 aagatctcag tatatcgatt tcctaaaaat gaagatgaaa aacaaatgtg gattaaagta 240 attccaaatg caaatctaaa ggtaactaat gacactgtag tttgtgagtt gcattggcca 300 ataaattttg aaaaacaaca agttagagga aaatttcgtc ctaaatttcc accatctgtt 360 tggccaggca taccgtcttg tcaaattccc actcaacctc ctccattacg ttcaacaaat 420 agtgcctctt catgtattcg aaattttgct caagatgaat tatcagcctt tttagaaaaa 480 gatagagtaa cttttgctga aatgaaagaa attcttttat gtaatgaaag agatttgggt 540 gttccatcag tatgttttat gattaatagc attattcatt tgcaatcaca agcttattca 600 aatggggttc catatttttt aatcaagatt agagacgatt tacattttga aacttatcac 660 tatggagtca aaattaaaat tcctacaatt tctaaaaaca gagtttcaaa aattaatgca 720 tggtcaatat tagaagaagt cctacgttat ttaaagagtt tgattcctgg aagaaaagaa 780 attgtaattc aagaacattt agcttctatg ggtacacagt gtgttggaaa taagatttat 840 tctaaagaaa ttattattag atcttttcag tattttgcta catccagaag tttgtataat 900 caattgcgaa cagattacca attgccttct gtaagaactt tgacaagaat aacttcaaaa 960 gtttctaaat taaacgaaaa actttttttc actaatattt ttcaatctat tgaagaaaaa 1020 cagaaaattt gtttgttatt gcatgacgag gtttatgtta aaaaaatgtt gttatatcat 1080 ggtggtacat tgttcggaaa agcaatggat gattcaactt ctcttgcaaa aactgtattg 1140 ggtttaatgg ttaattgttt attcggtgga ccatcgttca tttccaaaat gatacctata 1200 agtaaattaa actctcaatt tcttttcgaa caaattagca ttactatcca atcaataaag 1260 gaatcatcgg gtcaagtcaa agccataata tgtgatggaa atagaattaa tcaggctttt 1320 tttaaaatct tcgaattcac agttccaagt aaaccttggt tgactaatga tggtgttttc 1380 ttaatctttg attatgtaca tttgcttaaa aacattagaa ataattggct tactgaaaaa 1440 actggtcaac ttatatttta tgataaaggg gtacaaaaag tagcacaatg ggatcacctg 1500 aaaaaacttt atgaacttga atctaaaaac ttagttaaat tgtctaattt aaacgaagtt 1560 tcaatatttc cgaagccaat cgagaggcag tcagtgtcta cctgtttaag gatattttgt 1620 gacaaaactt accatgcttt gattaatcat ccaggactgc agcatataaa agaaagacaa 1680 gacacagcag attttattaa cattgtcgta tcatggtgga aagttcttaa tgttaaagga 1740 acaggtgcag atgttaggtt taatgatgaa cttcaagcag ttattagaga tcctaatgat 1800 aatagattag aaaatatttt agatttcgga aatatggctt taaagatggc tggaaagcaa 1860 ggtaaaagga agaaacaact tacttgtgac actgcacaat caatttacca tacgtgcaat 1920 ggtgttgttg aattatgtcg ttcattattg aatagctcac accaatatgt tattcttagt 1980 gagttttcta ctgatccatt ggaaaaagag tttagcaaat tgcgacaagg ttccggaggt 2040 acatatttta taacagtaca gcaggtaatt gaaaaactga atatttcaag agccaagctt 2100 ttactatctc ttaataaccc agctgttgac atatgtcttg atacaaccca ttcttgtcca 2160 aattgtggtt ttttattgga tgagtcctct gctgagatat ttgacaacct gccaaatctt 2220 gaatctttta tttcttctga tacaaatatg gcacttatat acatatcagg atatttaact 2280 cgtaaagata atgaactatc agaaaatgag ctgctagaaa aaacaacttt ctaccaccaa 2340 aagtttggac aatacataga ctcaattgac cgtggtggac ttaatattcc cacagataat 2400 acctgtcaat gggtaatttt ttcttttata atttttaata cgattaaaga taaagtatgc 2460 agagtttcat tgagcaatat tctaatgatg atatcagagt tttacaactt tgagatgcat 2520 cgacatcacg gaaatattct ttcaaacatt tttttaaaca atttatgtaa actttcaact 2580 cctcgttcaa ataaagaacc agctcaaaaa atattaaaat tgtcatgaat tcttaaaata 2640 ttattgtatt cttatcacaa tggttttttg tttttgaatt tgagtttttt tgtttttgaa 2700 tttgaaaaaa tttataagtt aagaaatatt ttgtttgaat tttagtttgt ttgttataaa 2760 atgaaacagt ttgagagtaa aaagttcatt gagttaatat aaaattactg aacacttttt 2820 tgttttgttt ttgccgttaa aataaaatag gcccccagtc aactaagctg tttgcgcgat 2880 ggcaaggcct tcagtataga tcgccgtg 2908 // ID hATm-46_HM repbase; DNA; INV; 3884 BP. XX AC . XX DT 07-DEC-2008 (Rel. 13.12, Created) DT 11-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type family: consensus sequence. XX KW hAT; DNA transposon; Transposable Element; hATm-46_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3884 RA Jurka J.; RT "Families of hATm elements from Hydra magnipapillata."; RL Repbase Reports 8(12), 1940-1940 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(626..1273,2433..3590) FT /product="hATm-46_HM_1p" FT /translation="MGKAKAKKGAVKPINYNPRRPVYLLETPLSSLPSRQL FT PTNGHVIRYFLYVISPRVKRFRPIKTNFACAQKTSSGDMKCKGTDGPCNAV FT TGEYCLLRSIVNIWASCGFQNYVLSEVSILNKFLEIYKQYKAVCNEKTLKM FT KPNASQKQQEKFLIMTDNLFDISSQQFEYTMRQDTSMPRDSATIKEDLDFL FT QDQRQDRKMFISACVDEELEARVINIQSLVRVLGQSVLWLLCRHHIAEIHI FT KWGIKAIFKEQTVGPSKQMYKDLKHAWMNSYYPIVSKAGAEKKFKKFPESE FT LVVGSDLHSLYLKSKKFIAYSLEYNVFPRNDYNHLVKYLAFYMNINSPKLE FT KFSLYQPGANHNARFMCDSIYLGLLDITSPIINFLSDDQKTLVSKANFIVA FT LYFAPSFLKSSKAEHAVVNDLEAYKAAKIIQKDWDYDVGSALLKSLQNHTW FT YLSPKCVTMALADPDLDRSTKMSVIGKLLTFPVPNKKDLSLEAPERVSISE FT STKLEDLITEESWLLFIITDTTSVVREWYQTEGDLEELASYKDFCSFVCGL FT SVTNDCAERNIGLIQEYVDTSHNEDQRQNILLVVREHRKLVSKKMPQSQLM FT LNK*" XX SQ Sequence 3884 BP; 1350 A; 626 C; 665 G; 1239 T; 4 other; ttagggtgac ccaaaaatgc aaaaaaaaaa aaaaaattta gctaaggtgt cctatggatt 60 ttgctacata atatgagttc acaggcaaaa ttttaagtat gctggatgct tgtaacagct 120 acaagtaaaa atccattttt acaaggaatt agcattttta ttttataaac aagtatattt 180 tacatgtttg tacaataaaa taatatcttt tacattgtat aagatatttg atatgattat 240 ttcagacatt ataaataata catgtattat aatgttaaaa atcaatttat ttgacgaatt 300 tcagcagata tcattttcct gttcaagctt acgtaacaga ggaaatagta gattaatgct 360 gatatacaat aaaaaaaact tttgtttatt tttattcata ttattttatt cataatcata 420 ttgacaatgc ccaccaactc ctctgcccct ttatccttac tctaaatcct aggaccccac 480 caccccaccg ttactactgt ctgccattaa gttcttatta tattattact caataygctt 540 aaaagcacag tttattaatt tttatattta actttataaa tgtttaaatc aattctttgt 600 tcaaacattt gaatatttca gaaaaatggg aaaagctaaa gctaaaaaag gagctgtaaa 660 gccaatcaat tacaacccaa gaagacctgt ctatcttctt gaaactccac tgagctctct 720 acctagcagg caattgccta ctaatggtca tgttattaga tattttctgt atgttatttc 780 ccccagggtt aaaagattca gaccaatcaa gactaacttt gcttgtgcac aaaagacatc 840 ttctggagat atgaagtgca aaggaactga tggaccttgc aacgctgtga ctggagaata 900 ctgtttgctc cggagtattg taaatatttg ggcctcttgt ggtttccaga actatgttct 960 ttcagaagta agcatactaa acaagtttct agaaatttac aagcaatata aagctgtttg 1020 taatgaaaaa acacttaaaa tgaaaccaaa tgcatctcaa aagcagcaag aaaagttcct 1080 gataatgaca gataacctgt ttgacatcag ttctcaacag tttgagtaca ccatgcgtca 1140 ggatacttct atgccaagag acagtgctac aataaaagaa gacctagact ttcttcaaga 1200 tcaaagacaa gacagaaaaa tgtttatcag tgcttgcgtt gatgaagagt tagaggccag 1260 ggtaataaac atttaatgaa actaaaactg attaagattt gtcaaatatt taaacatttt 1320 tatatactaa atttcaaaag tttttattta tttagatcga gagaagaatt gaaagataca 1380 ctagacacga agagctacgg aagaaagctg aaacagatcg gttggttcct aataatccag 1440 tgagaacaga caataaagat caggactata taagtgaaga tgaaccacct tttaaaaaat 1500 gtaaaagaaa atcagtatta ggtataatgt tttttagtat cttttaatta tttcctagtc 1560 atttctctct cttccccatc ctttttaata tcatgtcatg caagagataa aaatttttga 1620 tttcttcatt ctaaaaaact tctttgatag gtcctcccca gctgtaccct gatagagaag 1680 attatcaccc tgacttaagc aatgaaccaa ctttggctga taagcctgat agacctaaat 1740 caagacacag atctggtccc tcagccttag cagtaaacaa ggagatcact agggagggtt 1800 tgattgcagc aaccacacca acttgtatga gagctggtat cagtataaga gatcaaactc 1860 tattyatctc tagcattgta aattatcttg aaggagatat cacagagtta aggataagca 1920 gagagtcgat tagaagaggc agatacaaac taatccattc tgaaggtaac aaattatact 1980 gttttaacat tttagctttt agattgtaat aatataaaaa tctgttttag attgctgatt 2040 gataaagtta ttaaatttaa agataaaaat ctgtttaggt tgtaaaatcc gtaatgagta 2100 tatggaaaca atgaagaaca aaccattggt gctacacttt gatggaaaaa ttgtaaagca 2160 catagaagaa gaatcgagga aaagagttct tattgataga ctagcagtat cagttacaag 2220 tcctgagttt tcgtcaaaca atgatttact tcttggagtt gttcctgtgg acaatggaaa 2280 agctaatgat atggctattg tactccagaa tctggtagaa tatttcaaaa tatcagagaa 2340 tatcattgct gtgtgtactg acacaacagt tataaacaca ggaaaaaaat caggagcaat 2400 tgtaacttta gtcagagttt tgggacaatt agcaatcttt agtcagagtt ttgggacaat 2460 cagttctctg gctcttgtgt cggcaccata ttgcagagat ccatataaaa tggggcatca 2520 aagctatttt taaagagcaa actgtgggac catctaaaca gatgtataaa gatttaaagc 2580 acgcttggat gaatagttat taccctattg ttagtaaagc aggagctgag aaaaaattca 2640 agaagtttcc ygaaagtgag ctggttgttg gaagtgatct tcacagttta tatctcaagt 2700 ctaagaaatt cattgcttat tcgcttgagt ataatgtttt tcctaggaat gattataatc 2760 acttggtaaa ataccttgca ttctatatga atatcaattc acccaagctt gaaaagttta 2820 gtttgtacca accaggtgct aaccataatg ccagatttat gtgtgacagc atctacttgg 2880 ggcttctgga tattacgtct cctattatta acttcctgag tgatgatcag aagactttag 2940 tctctaaggc taactttatt gttgccctat actttgctcc ttcatttctc aaatccagca 3000 aagctgaaca tgcagtagta aatgaccttg aggcatacaa agctgctaaa atcatccaaa 3060 aggattggga ttatgatgtt ggtagtgctt tattaaaaag ccttcaaaac catacttggt 3120 atctttcccc aaagtgtgtt actatggcgt tggctgatcc tgatcttgac aggtctacca 3180 aaatgtctgt cataggaaag ctacttacat tccctgtacc taacaagaaa gatttaagct 3240 tagaggctcc agagagggta tccatctcag agtctactaa attagaagac ttaattacag 3300 aggaaagttg gcttctgttt attattaccg acaccactag tgttgtcagg gagtggtatc 3360 agactgaggg agacttggag gaacttgcat cttataaaga cttttgtagc tttgtatgtg 3420 ggttaagtgt cactaatgat tgtgcagaaa gaaacattgg tcttatccaa gaatatgttg 3480 atacatccca caatgaagat caacgccaga atatattgtt ggttgtccga gagcatagaa 3540 agcttgtctc aaaaaaaatg ccacagtctc aactcatgtt aaataaatga tgttagaaaa 3600 taattgtatt gatttttttt taatcagttt attatcttaa ataaagacaa tttcaaaaaa 3660 cagtttcaaa tttaattttc ttcataatya agttaaaaat gtatcatttt caccaaaatt 3720 catgtttgga attaaaactc aatttttact tgtagctgtt cccaacacat ttaaaaggaa 3780 atatttttgg gagggactct ttatatggcg tataatggat aggaaacctt agcatgttca 3840 aaatctaaaa aaaaattttt ttttgcattt ttgggtcacc ctaa 3884 // ID nimbus repbase; DNA; INV; 5845 BP. XX AC EF413180; XX DT 28-SEP-1995 (Rel. 14.06, Created) DT 25-MAR-1997 (Rel. 14.06, Last updated, Version 1) XX DE A transcriptionally active non-LTR retrotransposon - complete DE sequence. XX KW Nimb; Non-LTR Retrotransposon; Transposable Element; BGLINE; KW LINE_BG; nimbus. XX OS Biomphalaria glabrata OC Eukaryota; Metazoa; Mollusca; Gastropoda; Heterobranchia; OC Euthyneura; Panpulmonata; Hygrophila; Planorboidea; Planorbidae; OC Biomphalaria. XX RN [1] RP 2375-4346 RA Knight M.; RT "LINE_BG."; RL Direct Submission to Genbank (20-AUG-1991)M. Knight, Biomedical RL Research Institute, Schistosomiasis Group, 12111 Parklawn Drive, RL Rockville MD 20852, USA. XX RN [2] RP 2375-4346 RA Knight M., Miller A., Raghavan N., Richards C. and Lewis F.; RT "Identification of a repetitive element in the snail Biomphalaria RT glabrata: relationship to the reverse transcriptase-encoding RT sequence in LINE-1 transposons."; RL Gene 118(2), 181-187 (1992). XX RN [3] RP 1-5845 RA Raghavan N., Tettelin H., Miller A., Hostetler J., Tallon L. and. RA and Knight M.; RT "Nimbus (BgI): An active non-LTR retrotransposon of the RT Schistosoma mansoni snail host Biomphalaria glabrata."; RL Int. J. Parasitol 37(12), 1307-1318 (2007). XX RN [4] RP 1-5845 RA Kapitonov V. and Jurka J.; RT "Nimb - a novel clade of non-LTR retrotransposons."; RL Direct Submission to Repbase Update (04-JUN-2009). XX DR GenBank; EF413180; Positions 13 5857. XX CC Nimbus was originally assigned to the I clade [3]. However, it CC was recently reassigned to a novel Nimb clade, which includes CC retrotransposons from insects, molluscs and fish [4]. XX FH Key Location/Qualifiers FT CDS 554..1936 FT /product="nimbus_1p" FT /translation="MDKKTHTKLLAPDLCLGKKRRIDDKKEEVPKSCATWP FT SFLVVRCTEEGKSLTKLSPFIIYKGLKSVVGEPKSVSKLHKTMELLVEVDS FT KTHSDGLLRCRRLCDLQVEVMPHKSLNTSRGVISSRDLLECSEKEIVEGIE FT GVTHARRITRRREGEEIKTATIILTFGTRTPPEYVKAGYLRVPVRPYIPNP FT MRCFKCQGYGHGAAVCKRNTVCARCAGEGHEDKGCTAQFKCPNCQAGHSAY FT SKDCPVWKQEVAVQEYKARNGCTFSQAKSAVLALPKGQFGLTKTYAQAVAK FT KGRSIATQTDTLTPPPPPPPPTQKKTPPKNTPLKKQESPVVMLKNRFSVLA FT DESMETEQHVKTNTPGEPGDIMSDTGSPQRTQPDTPTSTELEVDQLPKGRN FT QSGEDARGERGGNNLRSNQRDLPSPERKTSPKRGLSSSPSKPKNKNTKVTG FT IKGNPSGGLPRPNSSK" FT CDS 1957..5622 FT /product="nimbus_2p" FT /note="contains the APE endonuclease, reverse FT transcriptase and ribonuclease H domains." FT /translation="MDSRIVQWNCRGLKANYEEMQLLMDSETPVAVCLQET FT FLKDCISFRSYRAYTKNVEDAERASGGVCILVKDSIPHERVELQTTLQAVA FT ARITLHKVITCCSLYLPPGAPLNRTDMEDLLKQLPRPYLILGDFNAHNTMW FT GSNNTDTRGRMLEDIFLQHDLCILNDASPTYLHPGTGSFTCIDLTVCVPGL FT LDDFKWSVSNDLRGSDHFPIIITNNLPSLGRPQRWKLKKADWEQFQKRCSE FT DITENILHEQNPADTFASKLLSIARKVVPLTSANPKRPSKPWFDTACKSAI FT GDRKKRLAAFIKNPSQENLKLFRIARAKARQTIRSAKRNSWRSFVGSLDAK FT TSARTVWKAVRRIKGKESNAIGHLKNQGRTVTSPREIADCLASSIAEKSST FT AHYTPEFQKVKTREERHPIDFRSENNEDYNKPFSLEELRESLDKSHDTAPG FT EDEIHYQFLKHLPEPSLAVLLGVYICVWQTGAFPNSWRKATVIPIPKPGRD FT GSDPANYRPIALTSCICKTMERMINSRLVWYLESNKVISNYQCGFRQGRTT FT TDHLVRLEAYIRNALLRREHLVAVFFDIEKAYDTTWKHGILRDLALMGLKG FT HLPRFVEEFLKDRKFQVRVGNSASDTHDQEMGVPQGSILSVTLFNIKINSI FT INALSPGIECSLYVDDFVILTYGKNMNTLERKLQLCLNKIQGWANYNGFKF FT SDSKTVSMHFCNLRGLHPDPELFIHKKKIPVVKTTKFLGLTLDSKFNFLPH FT IKELKKKCQKSLNILRVLSHTDWGADRDTLLLLYRSLIRSKLDYGSIIYGA FT ARKSYLKILEPIQNAALRLCLGAFRTSPIPSLHVEAGELPMDIRMKKLAMQ FT YIVKLKSNPTNPAFDSIFNPTEVELYNRRPNVIQPLGLRMREPIQNLTQPI FT DQISKIETPQNPPWLMNKPKLNLSLLNFKKENTDPSILQVHFRELQESYGD FT CGTIYTDGSKMEGKVACACSFRNKTISRRLPDGCSIFTAELHAILLALMAV FT KASERSKFIICSDSKSALQALGRMKTDIPLVHKSLKLLDLITADRRDVTFI FT WVPSHVGIEGNEAADREAKRALNHAVSGTQIPYSDLRQSIASATYREWQNR FT WEAETHSKLRQIVADVRWRPTSKGLTRRGSTTMSRLRIGHTYITHSFVLKR FT EEPPLCEYCDSRLTVEHILVDCPRYQDVRAKHFRATNLKTLFNNVDPGKVL FT GFILEVGLSTKI" XX SQ Sequence 5845 BP; 1771 A; 1439 C; 1325 G; 1310 T; 0 other; cctcgtcccg agacatgacg ttaaactgcg ctcctctttc cttagcactt ttggagctca 60 aaagtgccct acttcttttc tatcattgtg tctgcctcct cttttcctaa gtctgccttc 120 attgatcatc acactgtctc cttaacttta aattctcact acgtcctaga ccagtccgtt 180 tgcagtggac tgcgacgggt aagaggggat aaagagatcc tggtgtttga gtggtggcta 240 tctgaggata aaccacaaca acacaatact ttggctccaa gttcctctag acctgagacc 300 cagatggccc gctctgggtc aaccggctgg tcatgccgag ctaccagcat aggtcgtcga 360 cctatgctgg agggcggtgg ctggagggtc aatcagggag ctgctgtatc ggacacatgt 420 ccgatgtagc agctgactga gtatagcacc tgtgtagccc tggcgggctc attctggaca 480 tccgaatggc ccgtcccctt gttggactcg atggcgggtg gggagcacag gtgccgaatt 540 aaccaaaata tatatggaca aaaaaacaca cacaaaacta cttgccccag acctatgtct 600 gggcaagaaa cgacgaattg acgataaaaa agaggaggtc ccaaaaagct gtgccacctg 660 gccatcattt ctggtggtta gatgcacaga ggagggaaaa agcctgacca agttgagccc 720 ttttattatc tataagggac tcaagtcagt agtgggagag cccaagagtg tgtctaagct 780 acacaaaaca atggaactcc ttgtagaagt agacagcaag acacactctg atgggctttt 840 aagatgcaga aggctctgtg atctccaagt ggaagtcatg ccacacaaga gtctgaacac 900 cagcagaggt gtaatcagct ctagggacct actggaatgt tccgagaagg aaatagtgga 960 gggcattgaa ggagtcaccc atgcccggcg cattaccagg cgcagggagg gtgaggagat 1020 caaaaccgcc actattatcc tcacattcgg aactaggaca ccgccagagt atgtgaaggc 1080 aggataccta cgagttccag tgaggcccta catacctaac cccatgaggt gcttcaagtg 1140 ccagggttat ggacacggcg cggcagtctg taagaggaac actgtgtgtg ccagatgtgc 1200 tggagagggt catgaggaca agggctgcac agcccagttc aaatgcccaa actgtcaagc 1260 tggccactca gcctactcca aggactgccc tgtgtggaaa caggaggttg ccgtgcagga 1320 gtacaaggca agaaacggat gtacctttag ccaggcgaaa tcggctgtac tggccctacc 1380 caagggccaa ttcggcttga caaagaccta cgcccaggct gttgctaaga aaggaagatc 1440 catagccaca cagacggaca cattgacccc accaccccct cctcctcccc caactcagaa 1500 gaaaacaccg ccaaagaaca cacctctgaa gaaacaagaa agccctgttg tcatgctaaa 1560 gaacaggttc tctgtgctcg ctgatgagag catggagact gagcagcatg tcaaaaccaa 1620 tactccggga gaacccggag acatcatgtc tgacacaggt tctcctcaga gaacccagcc 1680 agacacccca acctctactg agttagaggt tgaccaactc cctaagggga gaaatcaaag 1740 cggagaggat gcccgaggtg agagaggagg aaataacctc cgctctaacc agagggatct 1800 cccctctcca gagagaaaga cttcccccaa aagggggttg tcctcctctc catccaaacc 1860 caagaataaa aataccaagg tcactggaat aaagggaaac ccctccgggg gcctcccaag 1920 gccaaatagc tccaagtagg tcatctagaa taagccatgg attccagaat tgtacagtgg 1980 aattgtagag gcctcaaggc caattacgag gaaatgcagc tactgatgga ctccgagact 2040 cctgtagctg tctgcttaca ggaaacattt ctaaaagatt gcatcagctt ccgaagctac 2100 cgtgcttaca ccaagaatgt tgaggatgca gagagagcat caggtggagt ctgtatcctt 2160 gtgaaggata gcatccccca tgagagggta gaactacaga ccacactaca ggccgtagca 2220 gctaggataa cccttcacaa ggttatcact tgctgcagcc tgtatctacc accaggcgct 2280 ccattaaacc gaacagacat ggaggaccta ttgaaacaac tcccccggcc ttatttaatc 2340 cttggggatt tcaatgccca taacaccatg tggggatcca acaataccga cacaagaggt 2400 cgtatgctgg aggatatctt cctccaacat gatttatgca tacttaatga tgcatcaccg 2460 acctacttgc acccaggaac tggatcattc acatgcattg acctcaccgt atgcgtcccc 2520 ggacttctgg acgattttaa atggtcagtt agcaatgatc tacggggaag tgatcacttc 2580 ccaatcatca ttactaacaa tctcccttca ttgggacgac ctcagcggtg gaaactgaaa 2640 aaggccgatt gggaacaatt ccaaaaaaga tgctctgagg atatcacaga aaatatcctc 2700 catgaacaga acccagctga tacctttgct agcaagctac taagtatagc caggaaagtt 2760 gtacccctaa cctctgcaaa tccaaaacgg cctagcaaac catggtttga tacagcttgc 2820 aaaagtgcta ttggtgatag gaaaaaacga ctggctgcat ttataaagaa cccttcccag 2880 gaaaacctaa agctattcag gatagctaga gcaaaggcta gacaaaccat acggtcagcc 2940 aaaaggaatt cctggagaag ttttgtcggc agtctggatg ccaaaacttc tgctagaacg 3000 gtttggaagg cagtaaggcg aatcaaaggg aaagaatcaa atgcaatagg gcatttgaaa 3060 aaccaaggac gaactgtcac gtcccccaga gaaatagctg actgccttgc atcctcaata 3120 gcagaaaaat catccactgc acactacacg ccagagttcc aaaaagtcaa aaccagggag 3180 gaaagacacc ccattgattt caggtcagag aacaatgaag actacaacaa accgttctcg 3240 cttgaggaac tgagggaatc gctggacaag tcacatgaca cagcgcctgg agaagacgaa 3300 atccattacc agttcctcaa gcaccttccc gaaccctcat tggcagtcct actaggggtc 3360 tatatttgtg tgtggcaaac aggcgctttc ccaaacagct ggaggaaagc cacagttata 3420 ccgataccta aaccgggaag agacggctct gacccagcta actatcgacc aatagcacta 3480 acaagctgca tctgcaaaac catggaaaga atgattaaca gtaggctggt ctggtacctg 3540 gaaagtaata aagtgatctc aaactaccag tgcggattcc ggcaggggcg gacaacaact 3600 gaccacctgg taaggctgga agcttatatt agaaatgcat tactcagaag agaacatcta 3660 gtagctgtat tcttcgatat agaaaaggcc tatgacacaa cctggaaaca tggcattcta 3720 cgtgacctgg cgcttatggg gcttaaggga cacctccccc gttttgtgga ggaattcctg 3780 aaagatcgaa aatttcaggt ccgagtgggc aactccgctt ctgacactca tgaccaggaa 3840 atgggtgtac cccagggcag cattctgtca gtcaccctgt tcaacattaa aataaatagc 3900 atcataaatg cgctgtcccc tggcatagag tgctctttgt atgttgatga ctttgtcatt 3960 ctcacttatg ggaaaaacat gaacacctta gaaaggaaat tacagttatg tttaaacaaa 4020 attcagggtt gggcaaacta taatggtttc aaattctctg actccaaaac agttagtatg 4080 catttttgta atctaagggg gctccaccca gaccctgaac tatttataca caaaaagaag 4140 atccctgtcg tgaaaactac aaaattttta ggcctcacct tagattccaa atttaatttt 4200 ctcccccaca ttaaggaact taagaagaaa tgccaaaagt cattaaacat actaagagta 4260 ctcagccata cggactgggg agctgacaga gataccttgc tgctgctcta tcggagtcta 4320 attcgatcca agctagacta cggatccata atatatggag cagcaaggaa gtcgtaccta 4380 aaaatactgg aaccaataca aaatgctgcc ctgcgtctct gtctcggcgc gtttcgtaca 4440 tcacctatcc caagtctcca tgtggaggct ggagaactcc ccatggatat aagaatgaaa 4500 aagcttgcaa tgcagtatat agtcaagcta aaatccaacc ccacgaaccc tgcttttgac 4560 tccatattta accccacaga ggtagaatta tacaatcgaa ggcctaacgt catacagccg 4620 ttgggccttc gaatgagaga acccatccaa aatttaaccc aacccattga ccaaatctct 4680 aaaatagaaa cccctcagaa tcctccttgg ctaatgaata aacctaaatt aaatttatcc 4740 ctccttaatt tcaaaaaaga aaatacagac ccaagcatac tacaagtcca ctttagggaa 4800 ctgcaggaga gctacggaga ttgtggcacc atctacacag atggatccaa aatggaggga 4860 aaggtcgcgt gtgcctgctc ctttcggaac aaaacaatct cccgtagact ccccgatggc 4920 tgctccatct ttacggccga attgcacgca atattgcttg cacttatggc cgtaaaagca 4980 tcagaaagga gtaaatttat aatctgctcc gactccaaat ctgcattgca agctttgggg 5040 cggatgaaga ctgacatccc attggtacat aagagcctga agctgttgga cctaataaca 5100 gccgaccgta gggatgtcac cttcatctgg gtcccctccc atgttggcat tgagggaaac 5160 gaagccgcag acagagaagc aaagagagcc ctaaatcatg cggtgtcagg aacccaaatt 5220 ccctactcgg acctgagaca aagtattgcc tctgccacct atcgagagtg gcagaaccga 5280 tgggaggctg agactcacag taaactcagg cagattgtgg cggatgtcag gtggcggccc 5340 acatctaagg gtctgacaag gcgtggtagc acaaccatgt ccagacttag gattggccac 5400 acctacatca cgcactcttt tgtactgaag agagaggagc ccccactttg cgagtattgt 5460 gactctcgcc tcaccgtgga acatatcctc gttgattgcc ccagatacca ggatgtcagg 5520 gcgaaacatt ttagagccac taatctaaaa acactattta ataatgtcga ccctgggaag 5580 gtactgggct ttattctgga agtggggcta tctacgaaga tctgatttct gaatttgtga 5640 acatgcacta tttacattag atttttacca aatatttaaa tttttactac ctttactatt 5700 ttaactgtga atagacctta attttaatta tattactgta tagaatctgg cccttgttgt 5760 ttagagagag agtagtcctt aagggactgc aggcacgaca tggcctaaat tgtgccgatg 5820 tgcctcaaat caacaaatca aatca 5845 // ID Crack-1_CP repbase; DNA; INV; 4843 BP. XX AC . XX DT 22-JUL-2009 (Rel. 14.07, Created) DT 22-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE Culex pipiens Crack non-LTR retrotransposon. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; Crack-1_CP. XX OS Culex pipiens OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RP 1-4843 RA Kapitonov V.V. and Jurka J.; RT "A family of Crack retrotransposons from Culex pipiens."; RL Repbase Reports 9(7), 1332-1332 (2009). XX DR [1] (Consensus) XX CC Crack-1_CP is a non-LTR retrotransposon that belongs to the Crack CC clade, a sister clade of L2 and Daphne. Like in other Crack CC retrotransposons, the ORF1 protein in Crack-1_CP contains a CC domain similar to the central domain conserved in vertebrate L1 CC ORF1 protein. In Crack-1_CP-Crack3_CP retrotransposons, the ORF1 CC protein contains also in its N-terminal part the PHD finger CC (C4HC3). The Crack-1_CP consensus sequence was derived from CC multiple alignment of 5 copies of Crack-1_CP that are 99% CC identical to each other. XX FH Key Location/Qualifiers FT CDS 408..1475 FT /product="Crack-1_CP_1p" FT /note="ORF1." FT /translation="MSSDEMDAENEIYCAICETAEPNAAKVLECVNCHACH FT HFKCKKIIGNAIAKWKKKDYFCSVLCQEIHLKATSAANTESLLLAEFQKVV FT SEIKNLKEEQHSTRKYVSKAVGEIEKSQNFLAHKFDEVCDNIKRLENEQHT FT LKGSVGVVEGKYVQLSETVNRLEGEVDRYRRSALSCNAILLGIPEVKDENL FT GEAVTRFAAVLGLNITEDFFSDVKRLRDSKSESRSPPIRIVFASEPNKEQF FT FAKKKEVGQVQVSTLGGCYSGKSGRVMLRDEMTPHGLALLREVKEVQEQLD FT LKYVWPGRNGVILVKKTDKSKVEYIQNRNDLQKLNKLSSKRGLEMSLSGPI FT SSSSVIEQDPKRR" FT CDS 1525..4413 FT /product="Crack-1_CP_2p" FT /note="ORF2." FT /translation="MASCIVKNVSFDCIEDCICNKVNVSVALRVFQWNIRG FT MNRLDKFDNIKEFLHRYKSPVDVLIVGETWVKEGCTELYGIEGYSAVFSSR FT SDSHGGLVVYFRDKLKCFVTKNVNINGCHYICLHLQHAGKPIDLVAVYRPP FT SYPFGATLEIIDKILSELPSEHVILAGDFNVPVNITTSPAVQEYLRLLDSF FT NVQVTNTVATRPSSGNILDHMICTTTFADTLLNETIFTEASDHSILVSSFN FT WHATKKLRVLEKQIVDHQRLNTLFSSALENMPDGMNANEKIAFITDKFNEI FT KKSVTKTVSVEAKIKGHCPWMTLDLWKLLAIKEKLLKKSRTNPLDMQTKSL FT LKHASKKVQSKKEECKRNYYYRQLSSSSTKQCWRCLNEIMGKAAGSVKEMK FT LKIEGTEVTDPVTVSYELNKHFCTVGSNLASTIISDRNVLKFNTLRRIDTT FT MYLHPTTSEEVLLLINKLDCSKSPGPDNITAAALKKHKLAFSCILRDIFNE FT IILTGRYPDQLKSARVVPIFKSGDPMDVNNYRPISVLSVLNKIIEQLIASR FT IKSFLNSLELLYKNQYGFRRGSSTRTATSELLDDIYQDLDGKRFTGSLFLD FT LKKAFDVINHGLLLQKLERYGIRGAALSLLRSYLSNRMQFVNSNGINGDSA FT IIDTGVPQGSVLGPLLFIIFINDLSRVQLFGKVRLFADDTVITYSHKDAVH FT ITQMMQADLQTLNEYFTSNLLFLNLSKTVFMIFHSPQRTVPELPAVAINTT FT EIKKVSSFKYLGLVLDETLNWKPHIDQLKKKIAPACGVLRKISSFVPFRWL FT KSLYASLIHSRLQYLVINWGTANKVNLRELQTLQNRCIRTILRKPALYPRR FT SLYNDVNHSFLTIRALHMFQIYVHMYNIMHNPTTHHNFNPVRVEHARITRQ FT TGDLRTTLPHTEFGKRCLSYSGCYQYNKLPDFIKQSPSQITFKCQLKSHLK FT QKIEQYLQ" XX SQ Sequence 4843 BP; 1412 A; 1137 C; 1101 G; 1193 T; 0 other; aagaagaaga actacgttgt ttggtgctca tatctgcact gctgaaaaac aacaataact 60 gaaggataac acaaccgcac aactcaacaa acctgtgcat cttggtcctg ttgctgacgc 120 ggaactccgg gtggtattat ccagtggaaa tccaggggtg acaggccaga acttgtcggt 180 aaaggtgaag gtgtatttac ggagaacaag gaaaattcca tccatcttgg caatcgcaag 240 tattacccac tgataccgtt tcactgaatt gattgtttac agcggtacaa caccgttgcc 300 tgtcactgtc actgtcatca cagtttacca gtcgagttgt ttcaaccatt tgaacactga 360 gtgaactgat caactgtggt accttttcaa caaaataaac aaacgggatg tcgtcggatg 420 aaatggatgc ggaaaacgaa atttactgcg cgatctgtga gactgctgag ccgaatgcgg 480 caaaggtgct ggagtgcgtg aactgtcacg catgccatca cttcaagtgc aagaagatca 540 tcgggaacgc gatagccaag tggaagaaga aggattactt ctgctcggta ttatgtcagg 600 agatccacct gaaagcgacg agtgcagcga acacggaaag tctgctgttg gctgagttcc 660 agaaggttgt cagcgagatc aagaacctga aggaggagca gcacagcacc aggaaatacg 720 tgtcgaaggc tgtaggggag attgagaaga gtcagaattt cttggctcac aagttcgacg 780 aagtttgtga caacataaag aggttggaga acgagcaaca cacgctgaaa ggatctgttg 840 gcgttgtcga ggggaagtat gtccagctga gcgaaaccgt caaccgtctg gaaggtgagg 900 tggacagata ccgtcgttct gcactgtcgt gcaatgccat tctcctggga attccggagg 960 tgaaagacga gaatcttggg gaagctgtga ctcgttttgc tgctgtactg ggtctcaaca 1020 tcactgagga tttcttctct gatgtgaaga gattgcgtga ctcgaaatcg gagagccgaa 1080 gtccccccat cagaatcgtt tttgcatctg aacccaacaa ggaacagttc tttgctaaga 1140 agaaggaggt tggacaagtg caggtttcta cgctaggagg ctgctacagc ggcaaatctg 1200 gacgggttat gctgcgggat gaaatgaccc ctcacggatt ggctctgctt agggaggtga 1260 aggaagttca ggagcaactg gatctgaagt acgtctggcc ggggcggaat ggagtcatcc 1320 tggtgaagaa aacggacaaa tcgaaggtgg aatacatcca aaaccgtaac gacttgcaga 1380 aactgaacaa actcagctct aaacgtggct tggaaatgtc tctaagcggc ccaatttcat 1440 cgtcttctgt tatcgagcaa gatccaaagc gtcgctaggg aacgggtcgt tatttttact 1500 gttttacttg tttatttata aaaaatggct tcatgtattg taaaaaatgt gtcctttgat 1560 tgtattgaag attgtatttg taataaagta aacgtttcgg tggcactgag ggtttttcaa 1620 tggaatataa ggggtatgaa tcgactggat aaattcgaca acatcaaaga atttcttcat 1680 cggtataaat ctcctgttga cgtgctgatt gttggcgaaa cctgggtgaa agaaggctgc 1740 actgaactgt acggaattga aggctactct gctgtattct catcacgttc ggattcgcat 1800 ggaggtctgg tggtttactt tcgtgacaaa ttgaaatgtt tcgtcaccaa gaacgtgaac 1860 ataaatggct gccactacat ttgtttacac ctgcaacatg ctgggaaacc aatcgatctt 1920 gttgccgtgt atcgtccacc gagctatcct tttggagcga cccttgaaat cattgacaag 1980 attttgagtg aattaccaag tgagcatgtt atccttgctg gggatttcaa cgttcccgtg 2040 aacatcacga cgagtcctgc tgttcaggag tatctacgcc tactggattc gtttaatgtt 2100 caagtaacga atacagtggc gacgcgacca tcaagtggga acatcctgga tcacatgata 2160 tgcacaacta cctttgctga cacactcctt aatgaaacca tcttcaccga agcaagcgac 2220 cattctatcc ttgtttcttc attcaactgg catgcaacca agaagcttcg agtcttggag 2280 aagcagattg tcgaccacca acgtctcaac accttgttct cctccgcgct tgaaaacatg 2340 ccagatggca tgaatgcaaa tgaaaaaatt gcgttcatta ctgacaagtt caatgaaata 2400 aaaaaatcag tgacaaaaac ggtatcagtg gaagccaaga ttaaaggaca ctgcccttgg 2460 atgacgttgg acctatggaa actcttggcg attaaggaaa agctcctgaa gaaaagcaga 2520 accaacccgc tagacatgca gaccaaatcg ttgttgaaac atgcttccaa aaaggtgcaa 2580 tcgaagaaag aggagtgtaa gaggaactac tactatcgcc aactttcgag ttcatctacg 2640 aagcagtgtt ggagatgttt gaatgagatc atggggaaag cagcaggaag cgttaaggaa 2700 atgaagctga aaattgaagg caccgaagtt acagaccctg taacagtgag ctacgagctg 2760 aataaacact tctgtaccgt gggaagcaat ctggcctcaa cgatcatcag tgacagaaac 2820 gtccttaagt tcaacacgtt acgtcggatt gacactacga tgtatctcca cccaacaacg 2880 tctgaagagg tcctcctgct gatcaacaag ctagattgca gcaagtctcc aggaccagac 2940 aacatcaccg cagccgcact gaaaaaacac aagctggcgt tttcgtgcat tcttcgagat 3000 attttcaacg aaataatact gactggacgg tatccggatc aactgaaatc tgcacgtgtc 3060 gttccaatct tcaagtctgg tgatcccatg gacgtgaata actatagacc gatctcggtt 3120 ttgtctgtgc tgaacaaaat catcgaacag ctgattgcgt cgagaatcaa gagtttcttg 3180 aactcgctcg agctgctgta caaaaaccag tatggattcc gacgtgggag cagcacccgc 3240 actgctactt ctgagctact ggatgacatc tatcaggatc tggatggcaa gcgattcact 3300 ggaagtctgt ttctagactt aaagaaggcg ttcgatgtca ttaaccatgg tttgctgtta 3360 caaaaactgg agcgttacgg tattcgcggg gctgctctaa gtcttcttag atcctatctc 3420 tccaaccgga tgcagtttgt gaactcaaac ggaatcaatg gagattcagc aattatcgac 3480 actggagtac ctcaaggaag tgtactggga ccgttactgt tcatcatttt cattaatgat 3540 ctatcccgtg ttcaactgtt tgggaaagta cgcctctttg ctgatgacac agtgatcacc 3600 tacagtcaca aggacgccgt acatatcacc caaatgatgc aagctgatct acaaaccctc 3660 aacgaatact tcacgtccaa cctcttgttc ctgaatctct ccaaaacggt gttcatgatc 3720 ttccattcac cgcaacggac tgttccagag ctgcctgctg ttgccatcaa caccactgag 3780 atcaaaaaag tctcgagctt caagtactta ggactggtac ttgatgagac gcttaattgg 3840 aagcctcaca tcgaccaact gaagaagaag attgcacctg catgtggagt tctacgtaaa 3900 atttcgtcgt ttgtaccttt tcgctggttg aaaagcctct acgcctcgct gattcattct 3960 cgtctgcagt acttggtcat caattggggt actgcgaaca aggttaacct gagggaacta 4020 caaacgctgc aaaatcgatg cattagaacg attctgagga aacctgcttt atatccgaga 4080 cgctcgctgt acaacgatgt gaatcactca tttcttacca taagagctct tcacatgttc 4140 caaatctacg ttcacatgta caacataatg cacaacccta caacgcatca taacttcaac 4200 ccagtccgtg tggagcatgc acggataacg cgacagactg gagacctaag aaccacgttg 4260 ccgcatacag aattcgggaa aaggtgtctt agttattctg gttgctatca gtacaacaaa 4320 ctacctgatt tcatcaaaca atccccctcc caaataacct tcaaatgtca actgaagtca 4380 cacctaaagc aaaaaatcga gcaatatctg caataatatc tgccactaac cacccaaaaa 4440 ctgctttctc tccgccatgc cacaaccgcc aaccgcccgc cgcccaccgc ccgccgcccg 4500 ccgcctaccg ctcgccgcct accgctcgcc acccaccgct caagcccgcc gcccgccgcc 4560 cgccgccctt cgcccacgcc cgccgcccac cgctcaacgc acatcgcccg cccaccgacg 4620 atcatcctta tcgtcactta ttgttcaccg aacaacataa tattagtata taaataatta 4680 tctgtaattg ttgaagatgc acatccttaa cagagatata tatctcactg gattgtgcaa 4740 gccgtccgag cctcacctca aaataaacat agaggtaggt gaatgttccc agctttgctg 4800 tggagtgaga aattcggaat taattaaaaa ataaaaaaaa aaa 4843 // ID DNA8-54_AP repbase; DNA; INV; 588 BP. XX AC . XX DT 25-AUG-2009 (Rel. 14.09, Created) DT 25-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-54_AP. XX NM DNA8-54_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-588 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 1986-1986 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 588 BP; 231 A; 69 C; 71 G; 213 T; 4 other; cagtcttgta aagaatactt ttattttgta ttcggaatac atattcgaat aaatgagaat 60 taaggtattc agaatacgta ttcgaatagt tcttaaaaaa tatattcaga atacatattc 120 gaatacttta aaaagtattc gaatactatc agaatacttt ttttcacaac tagtttttgn 180 cagtttttga atgattggta nttaacattt attaattaat aattaaacat acaacgtaat 240 cataagttat ttatacattt tgaccacata tgatncggcc gatacgggcc atacgggata 300 caacacattg gtgtttataa aaataatcat atcatgggcc aatattatat ttcgtttaaa 360 gaataaaatt aatgttcatg atttataata tgtcttgtca caaatttaaa actatttttc 420 tggatcggaa aaaagtattt ttttaatgaa taaaaaatac aaataaaagt attcagaata 480 cgtattcgaa tacttcttaa aaaaagtatt taaaatagta ttcgaataca aaaaaagtat 540 tcgaatactt ttattcgaat acntttattc gaatacttta caagactg 588 // ID Gypsy5-I_AP repbase; DNA; INV; 4553 BP. XX AC Contig832; XX DT 21-APR-2008 (Rel. 13.04, Created) DT 21-APR-2008 (Rel. 15.12, Last updated, Version 0) XX DE LTR retrotransposon from pea aphid: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy5AP; KW Gypsy5-I_AP; Gypsy5-LTR_AP. XX NM Gypsy5-I_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-4553 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from pea aphid."; RL Repbase Reports 8(4), 445-445 (2008). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC Positions [3458-3931] - Integrase core CC 'GTAG' target site duplication CC LTRs are 97% similar to each other. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX FH Key Location/Qualifiers FT CDS 3281..4438 FT /product="Gypsy5-I_AP_3p" FT /translation="MAHDMAGHRALDQTLKKIQEHYWFSKMRRYVQMHIGG FT CLDCLVNKKPGGKQPGELHPIPPGKRPFAVVHLDHLGPFETSLKGNRFLLV FT LIDNLTKYVLLFPAKSTHTQAAIRSLQKFVDQWGLPDRLITDRGTCFTSKA FT FQDNCQHNGIRHTLNSSRHPQANGQVERTNRTLLPMLAIQAQDQKLWDQKL FT REVERDINNTESKTTRYTPFELIHGYRPQFHSSVLRLLDGVGTHQPLNSAE FT LQVKARDNILKTQVSMKERYDQKMNKSISYEVGEIVAMLRAPNPGQSTKLQ FT TKYRGPLQVIGKLPGNTYRVAEMARDGKHIYATTAHVSQLKAVSIMQEEQF FT EKKGSGSSADEDEENNKSNADLPRVEETVKRQRKPPKYLKDYV" FT CDS join(631..2082,2086..3111) FT /product="Gypsy5-I_AP_1p" FT /translation="MSDAARNWFLSKNFVSWSTFREKFKLVFVRELRMADK FT WDAMRSRVQHKDESLMSYFQDMVQLWALFVYCVLFVYAFNKTVLISKIRSS FT SSKPLSNIFRSGKLCKGVRLQFEELRDYVLQGILSRELAMYAFGRKHDDED FT QLLIDILEWQRQSDRRGDQPNTSVITKVDHSTMWSKKETTSNQSEPSKFTS FT ATRTITKTGSPTSDSRHSTIKCYNCSGFGHIARDCPRLKRPMKCSLCSAEG FT HTRGKCPSVKQEHPAQTCQVVVAGCAGKNPFGKQVKINQVEFNGLIDSGST FT VCLIRSSAAVRTKLPIVSSIKPLYAVGDMKTPRITSLAEILGELVIDGVKT FT DKVQFLVVEDKAIPVEVLIGQSWLNLPQITYHKQDDTFVVEYSDMDVEAIC FT GNINKHEHGVQVCLVQEQSTSKVSLSTDDVIIGSQVNSTQKEELTVLLNKY FT RTVFILNLQELGCTDLVTMEIVEEQNSKPVSMKPYRTSEERRQIAEITDEF FT ERMPFGLRNAPAEFCRLMDQVLGTLKREGVVRCYIDDVIIPATDWRDMCRK FT VERVFQALERAKLTLNPTKCAFGVTELDYLGFNIKEGQLRPGRKIAVIENY FT SRPNNVHELRRFLGLTGYFRRFITHYADISAPLTALTKKDLPWKWTDAQQG FT AFEQLRRILTSAPVLKLYNHESPVTEVHTDASAKGLSGMLMQGDTEKSLHL FT VYAVSKRTTEAESHYHSSRLELYAIVWTLIRLRPYLLGIRFTVVIDCQALV FT YLNTNKSEKPQVARWFDLLQEFEMHIKYRSGEQLAHADALSRLEGTTDSVQ FT AVEEALDNRSLVMTLMTLEERVRS" XX SQ Sequence 4553 BP; 1445 A; 776 C; 1095 G; 1237 T; 0 other; ttttcagaag tgggataata aatccatagt tcgcgagtgt agttgccgtt ttgatttcgt 60 tttaacacgg gaaacggttg gtgctcttta cattcattag tcgctactgg tgttgcgaaa 120 ccacgaggtt tagtggtaat agtgtacgga ggaagtaacc gtggcgcaca agttgacgtt 180 acggtgtgtc aatcgatgac tgcatcgcta gtggagataa gctggttatg tttacatgtg 240 ataagccgtt gttatcacaa agattttctt gtcagtattg agcgaatcgt ttgttgtaac 300 gatgtcgagt gatcgagtga cgttgttgaa attacagcac ctaattataa tgaattgtac 360 caattagtga acgagcaacg agttagactc aaacagctag aagatgaatt agcacgggta 420 aatgtgaaaa atgacgagac atcattgaac actaagacaa atgatgaatc gatagagttt 480 cgtgtcatcc cagacgtcaa taagtcagtt aaatgtttta ctggattgga accttctcac 540 gtagccgaag attggttgga aacgatagag ggtttagctg atttgaatcg gtggccatca 600 aaattttgtc tacattttgc acgttcgaat atgtcggatg cggcacgtaa ttggttttta 660 tcgaaaaatt ttgtgagttg gtccacgttc cgtgagaagt tcaaactagt gttcgtacgt 720 gaattacgca tggccgacaa gtgggacgct atgcgcagtc gtgtacaaca caaggatgaa 780 tcactgatgt catattttca agatatggtg cagttgtggg ctctgttcgt atattgtgtg 840 ttgttcgttt atgcgtttaa taaaactgtt ttaataagta aaattcgttc gtcttcgtct 900 aaacctttga gtaacatttt cagaagtggg aagttgtgca aaggtgtcag attacagttc 960 gaggaacttc gcgattatgt tctacagggc attttatctc gggaactcgc gatgtatgca 1020 tttggaagga agcacgatga cgaagaccaa ttactcatag atatattgga gtggcaacgt 1080 cagtctgatc gtcgtggaga tcagccaaac acttccgtaa ttactaaagt cgatcatagt 1140 acgatgtggt caaaaaaaga aacaacaagt aaccagtcag aaccaagtaa gttcacatcg 1200 gcgacacgaa ccattacgaa aacaggatca ccgacatctg actcacgaca cagtacaatt 1260 aagtgttaca attgttctgg ttttggacac atagcacgcg attgtccgag actaaaacga 1320 ccaatgaaat gttcattgtg tagcgccgaa ggacatactc gaggcaaatg tccttctgtt 1380 aagcaagaac atcctgccca aacgtgccag gttgtggtgg cggggtgtgc agggaagaat 1440 ccgtttggga aacaagtgaa aattaaccaa gttgaattca atggactcat tgattcaggt 1500 agtacagtat gtttaatccg atcgtcagca gctgtacgga ccaaattacc aatagtgtca 1560 tcaatcaaac cattgtatgc tgtgggtgat atgaagacac ctagaatcac ttcattagca 1620 gaaatattgg gagagctggt tatcgatggg gttaagactg acaaggtgca attcttagta 1680 gttgaagaca aagcaattcc agtagaagta ttgattggac agtcttggct caatttgcct 1740 cagattactt accacaaaca agatgacacg tttgtcgttg agtactcaga tatggacgtg 1800 gaagcaatat gtggcaatat taataagcat gaacatggag tccaggtatg cttggtgcaa 1860 gaacagtcga catcaaaagt gtcactgtca acagatgatg tgataatagg cagtcaggtt 1920 aactccacgc aaaaagaaga actaacagtg ttgttgaata agtatcgtac cgtgtttata 1980 ttaaatttac aagaattggg ttgcacggat ttggtgacaa tggagatagt ggaagaacaa 2040 aacagtaagc ctgtatcgat gaagccgtat cgcacatccg aataagagcg aagacagata 2100 gcagaaatta ctgacgagtt cgaacgtatg ccttttggtt tgagaaatgc tccagctgag 2160 ttttgccgat tgatggacca ggtgttggga actctcaaga gggaaggtgt tgttcgttgt 2220 tacatcgacg atgtcattat tccagcaaca gattggagag acatgtgtag gaaagtggaa 2280 agagtatttc aagctcttga acgtgctaag ttaacattga acccaaccaa gtgtgctttt 2340 ggagtcactg aattggatta cttaggtttc aatattaagg aaggccagct gcgtccaggg 2400 cggaaaatag ctgtaattga gaattactca agaccaaata atgtccatga actgagaaga 2460 tttctcggat tgacaggata cttccgtcgt ttcataactc attatgcgga catttctgca 2520 ccactgacgg ctttgacgaa gaaggatcta ccatggaaat ggacggatgc acaacaagga 2580 gcatttgaac aattgaggcg tatactgaca tctgctccag tgttgaaatt atacaatcac 2640 gaatcaccgg tgactgaagt acacacagat gctagcgcaa aaggtctatc gggcatgtta 2700 atgcagggtg acactgagaa gtcactacac ttagtgtatg cggtgagcaa aagaacaact 2760 gaagcggaat cacattatca ttccagtaga cttgaattat acgctattgt atggacgtta 2820 attcgattga ggccatatct gcttggaatc agattcacag tggtgattga ttgtcaggct 2880 ctggtatatt tgaacaccaa caagtcagaa aagccacagg tggctcgttg gttcgatcta 2940 ttacaagaat ttgaaatgca tatcaagtac cggtcaggcg agcaattggc tcacgctgat 3000 gcattgagtc gtttggaagg tactacggac agtgtacaag cagtagaaga ggcattggat 3060 aatcgatcct tggttatgac attaatgact ttggaagaac gtgtacgatc ctgatacatg 3120 tcaattaatc aacatattgg agaagggtga aggtgaccga actccatatg agaggaacgc 3180 tgtaaataaa ttcaagttgt tggatggtgt attgtatcga gaattagagg ggataagtcg 3240 ttttgtggta cctaagcctc tgagaaaggg gaaagtgata atggcccatg atatggcggg 3300 tcatagggca ttggatcaga cactaaagaa aattcaggaa cactattggt tttcaaaaat 3360 gcgtagatat gtacagatgc atatcggtgg atgtttggat tgtctggtca acaagaagcc 3420 tggaggaaaa caaccaggag agctacatcc tattccccct ggaaaaagac catttgcagt 3480 tgttcacttg gatcatttgg gaccttttga aaccagtttg aagggcaata gatttttatt 3540 agttttaata gataacctta ccaaatatgt attgttgttt ccagcaaaat ccactcacac 3600 tcaagcagct attagaagtt tacagaagtt tgttgatcaa tggggattgc cggatcgttt 3660 gataacagat agagggacat gttttacttc caaggcgttc caggataact gtcaacataa 3720 tgggatcagg catactctca attcgtctcg acatccacaa gcaaatggac aagtggagag 3780 gaccaataga accttattgc caatgcttgc aattcaagca caagatcaaa aattatggga 3840 tcagaagttg agagaagttg aacgtgacat aaacaacacg gaaagcaaga ctaccagata 3900 cacaccattt gagttgatcc acggataccg gcctcagttt cattcaagtg tgttgagatt 3960 actagatggt gttggtacac atcagccttt gaattcagca gagttacagg tcaaggcaag 4020 agataacatt ttgaaaactc aggtatcaat gaaagaaaga tatgatcaga aaatgaacaa 4080 atcaataagc tatgaagttg gagaaatagt ggcaatgctg cgtgcaccta acccaggtca 4140 atctaccaag cttcagacca agtatcgggg cccattacaa gttataggga aactacctgg 4200 aaacacatat cgagtcgctg agatggcaag ggatggtaaa cacatttatg ctacaacggc 4260 tcatgtgtct cagttgaagg cagtttcaat tatgcaagag gagcagtttg agaaaaaagg 4320 aagtgggtca tcagcagatg aagatgaaga gaataacaaa tcaaatgctg atctgccaag 4380 ggtggaagaa actgttaaac gtcaaagaaa accaccaaag tatttaaaag attatgttta 4440 gttaagtcta tctaaagaac tgatttgttt aattttgttt aaaatattaa tttagttaag 4500 tttgttttgt ataatgactt atcgaggtcg ataagtaagg gtggatggcc gag 4553 // ID CR1_Ele1 repbase; DNA; INV; 4874 BP. XX AC . XX DT 06-OCT-2010 (Rel. 15.1, Created) DT 06-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE A CR1 clade non-LTR retrotransposon family from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1_Ele1. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4874 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4874 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (06-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. This consensus is generated from 28 CC sequences with >92% identity, and ~99% identical to the original CC sequence in [1]. XX FH Key Location/Qualifiers FT CDS 1..726 FT /product="CR1_Ele1_1p" FT /translation="MRNNQLFWFCPSCATLMKDMRLRNTARAAYEVGQGHA FT LNSHSDIMTNLKTEIMDELKAEIRNNFAKLINSSSCTPKSSKRVGIDPRFT FT RSRRLFSTAANPVSKNQPPLLLGTGSTPSPSIEIGTVPPNQPKFWLYLSRI FT AKDVSADQVCALAKKRLGTDDVQVVRLVAKGRDINTLSFVSFKIGMSLELK FT FKALSTSTWPKGVVYREFTNNNNREENFWRPGHVAASDDPLSLPTEEVVLM FT E" FT CDS 1530..4727 FT /product="CR1_Ele1_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MYDLWSSANGDAQMSEAGPSSMFSVDGALLEFARRSD FT ILDCWNGSRDHFTLPLPDVMDLRRVDEASDSRGPPGAEVMTQPELGNTLRR FT PKTDSKIRRVLTDSQRTSHDVMIYYQNAGGMNCDINSYLLATSDDCFDIIV FT LTETWLDSRTFSSQVFGPSYEVFRCDRSARNSTKSVGGGVLVAVKKKLKSK FT PIDNDSWASIEQVWVKIEFSGRNLFLCGIYIPPDRTRDMASIETHNQSIIH FT ISAAANPEDEIVVVGDFNLPGISWQTCGNGFMHPNLGRSSIHAAASRLLDC FT YSTATLRQINNVLNENNRCLDLCFVSARDVAPLITSAPVALVKNVAHHRPL FT IIALQHNEIRHRESISPVYYDFRNADQQSIAEFLSSVDWLHVLDSDDVDTA FT ALTLSHVIEHAIERHVPKKVHSSSGPPWQTRELRHLKTSKRAALRLYSKHR FT TPSLRDHYARVNNAYKNASRRCYKRYQQQVQLNLKSKPKSFWRYVNKQRKQ FT SGLPSVMKLDEEVASDWYDICTLFAKKFSSVFCEEELSEEHVANVANNVPL FT FGETFNMLTVDDNMIIEAAARLKSSLNPGPDGIPSVLLKAHIVNLLVPIRH FT VFNLSLNSGRFPAAWKTATMFPVHKKGDRKDVNNYRGISALCCISKLFELI FT VMKPMLFHCKHYISEDQHGFVSGRSTTTNLLCLTSHIADSFADRAQTDVIY FT TDLTAAFDKINHAITVAKLERLGVGGLMLRWFSSYLTDRRLTVELAGFSSE FT SFTATSGVPQGSHLGPLIFLLYFNDVNTVIKGPRLSYADDLKLFLRVRTIA FT DAVALQHDLDVFVVWCDMNRLTVNPEKCSIISFSRKLQPILFCYQLKGKVL FT QRVEDVKDLGVILDAQLTFKHHLSYVIEKASRTLGFIFRIAKDFTDPYCLK FT SLYCSLVRATLEYCSAVWHPYYRNGVERIESVQRRFIRYALRRLPWRDPFR FT LPSYQSRCQLIQLESLQTRRDVNRAMVVADLLSGRIDCPALLESINASAHS FT RTLRNNAMLRLPLRRTNYGQNSAMVGLQRVFNRVAEGFDFHLSRQVIRRNY FT VEILSRLSV" XX SQ Sequence 4874 BP; 1214 A; 1218 C; 1093 G; 1349 T; 0 other; atgagaaaca accaattgtt ctggttctgt ccttcatgtg ctacgctcat gaaagacatg 60 cgcctccgta acactgcacg tgccgcttat gaggtgggcc aaggacatgc gctcaactcc 120 catagcgaca tcatgacgaa cctgaaaacc gagattatgg atgaactgaa agccgaaatt 180 cgaaataatt ttgctaaact gattaattca agttcctgta ctccgaagtc ttccaaacgt 240 gttggcatcg acccaaggtt caccaggagc cggaggctgt ttagcacggc agccaatcca 300 gtatcgaaga accaaccgcc tttgctactt ggaactggta gcacaccatc tccgtcaatt 360 gaaatcggta cggtcccacc gaatcaacct aagttctggc tgtatctatc tcggatcgca 420 aaagatgtgt cagctgatca agtttgcgct ttggcaaaaa aacgcctcgg tactgatgac 480 gttcaagtag ttcggctcgt ggccaaggga agggacataa acacattgtc cttcgtttca 540 ttcaaaattg gcatgagttt ggaattaaaa ttcaaggctc tgtccacttc gacgtggccg 600 aaaggtgttg tctacaggga gttcaccaac aacaacaacc gtgaagaaaa tttttggcgc 660 ccaggacatg tggctgcatc cgacgatccg ctgagccttc caacagaaga agtggttcta 720 atggaataag tgatgacccg ggacgcactc ttcccctcag cctaaaggaa gcccctattc 780 ctctcaccgc agtcgagccc ctcccgccag cgaccagcag ccgtcccggt cctgcgtgtg 840 agttggaaga tggggtcttc cgatcatctt tcccaggcaa gtatgcgtac cctttgagaa 900 cttcaccggc tgtaccatct gttgttttca gtgtatcttc accaactcaa acatgttcag 960 cgtttcttaa tcggacaccg ggacgcaaat ttgccgcaag ccttaaggaa gcctctattt 1020 cactcaccgc agtcgagccc ctttcgccat cggccagcag ccgtcccggt cctgcgtgtg 1080 agctggaaga cgaggttctc cgaactccaa catcaggcaa gtatgctctc cacttaagaa 1140 cttcgtacgc tgtatcaccc gatgctttca gtgcaacttc aacgactcaa acgtgttctt 1200 catatctgga ccggataccg ggacgcaaat ttgccgcaag ccttaaggaa gcctctattt 1260 cactcaccgc agtcgagccc ctcccgccat cggccagcag ccgtcccggt cctgcgtgtg 1320 agttggaaga tggggtcttc cgaaatcgta tctcaggcaa gtacgacatc aatacgagca 1380 attcgcttcc tgaagttcct gttgttttca gcgtactcga agaactgtca ggcttagctt 1440 caaactccga ttcaccagga ccgtcggcag tacctgccgg aaacaatcat agaagccgca 1500 ttcccgagta tgctcttatt gattcacgga tgtacgacct ctggagttcc gcaaatggtg 1560 atgctcagat gtctgaagcc ggcccttctt caatgttttc tgtcgatgga gctcttttgg 1620 agtttgctcg tcgctctgac atcttagatt gctggaatgg gtcgcgtgat cacttcacac 1680 tccccctacc ggatgtcatg gatttaagac gtgtagatga ggcttcggat agtcggggcc 1740 cccctggtgc ggaagtcatg acccaaccgg aactagggaa caccctccgg cgcccgaaga 1800 ctgactctaa aattcgtcgt gtactaactg actcccaacg cacttctcac gatgttatga 1860 tctactacca aaacgctggc ggaatgaatt gcgacattaa tagttacctt ttggcgactt 1920 ccgacgattg tttcgacatt atcgttctaa cggaaacatg gctcgattcc cgtacttttt 1980 ccagtcaagt tttcggacct tcatatgagg tgttccgatg tgatcgtagc gctagaaaca 2040 gtaccaaatc tgttggtggt ggcgttcttg tagcggtcaa gaaaaaattg aaatctaagc 2100 ctattgataa cgactcttgg gctagtatcg agcaagtgtg ggtgaagatc gagttctccg 2160 gtcgtaacct atttttgtgt gggatttaca tccccccgga ccgaacgcga gatatggctt 2220 caatcgaaac acataaccag tccataattc atatttctgc cgcagcgaat cccgaagacg 2280 agattgttgt tgttggcgat tttaatcttc ccggaatatc ctggcaaact tgcggcaatg 2340 gtttcatgca cccaaatctt ggccgttctt ctatccatgc cgctgcgtcc cgacttcttg 2400 attgttacag tactgctacg ctgcgacaaa ttaacaatgt actcaacgaa aacaatcgct 2460 gtcttgacct ttgctttgtt agtgcgcggg atgtggcacc gctgataact tcggcccccg 2520 tcgccttggt gaagaatgtt gcccatcatc gtccactgat tatcgctctt caacacaacg 2580 aaattcgcca ccgtgaatcc atctctccag tttactacga ttttcgtaac gctgaccaac 2640 agagtatagc ggagtttctt tcatctgtgg attggcttca cgtactcgac tccgatgatg 2700 ttgacaccgc tgctttgaca ctatctcatg taatcgaaca cgctatcgaa aggcatgtgc 2760 ccaagaaagt tcattcttct tcgggtcctc cgtggcaaac ccgcgagcta cgccatctga 2820 agacttcaaa aagagctgct ctgagactct attccaaaca ccgcacgcct tcacttcgag 2880 atcactacgc aagagtcaat aatgcttaca agaacgctag tcgacgttgt tataaacggt 2940 atcagcagca ggttcagctg aatctaaaat cgaaaccgaa atcattctgg agatatgtaa 3000 acaaacaacg gaagcaatct ggtctcccct ccgttatgaa gctagatgaa gaggtagctt 3060 ccgattggta tgatatctgc accctttttg ctaaaaaatt ctccagcgtt ttctgcgaag 3120 aggagttatc cgaagaacac gttgcgaatg ttgcaaacaa cgtacctttg ttcggtgaaa 3180 cgttcaacat gctcactgtc gatgataata tgatcatcga ggcggctgca cggttaaaat 3240 catcgttgaa ccccggtccc gatggaattc catctgtttt gctgaaggcg cacatcgtca 3300 atttgctggt tcctatacgc catgtattca acctatcgtt aaatagtgga cgatttcctg 3360 cggcctggaa aactgctact atgtttcctg tgcataaaaa aggcgatcgt aaggatgtca 3420 ataattatcg aggaatttca gcactttgtt gtatttctaa gctcttcgag ttgatagtca 3480 tgaaaccaat gctgttccac tgtaaacact acattagtga ggatcaacac ggattcgtat 3540 cggggcgttc cacaacgacc aacctgttgt gccttacttc acacattgcc gacagctttg 3600 ctgatcgagc ccagacggat gtgatctata ccgacttgac cgctgcgttc gacaaaataa 3660 accacgcaat aactgtcgct aagttggaaa ggctcggtgt cgggggccta atgttgcgct 3720 ggttcagctc gtatctgact gatcgtcgct tgactgtgga gttggcagga ttctcttccg 3780 aatctttcac ggcaacttcc ggagtacctc aaggaagcca tttgggaccc ctgatttttt 3840 tgctgtactt caacgatgtc aacacagtaa ttaaaggacc tcgattatct tatgcagatg 3900 atctaaaact gtttctgcgg gtccgtacca tcgccgacgc cgttgctctt caacatgatt 3960 tagatgtttt tgtagtttgg tgcgacatga atcgattgac agttaatcct gagaaatgct 4020 cgattatctc attctcacgc aaactgcagc ccatcctgtt ctgttaccag ctcaagggaa 4080 aggtgcttca acgagtagaa gatgttaaag atcttggcgt tattctggat gctcagttga 4140 cgttcaaaca ccatctgtcc tatgtcatcg agaaagcctc caggacattg ggttttatct 4200 tccgtattgc caaggatttc acggatccat attgtctcaa atctttgtac tgctcattag 4260 ttcgggccac acttgaatat tgctccgcag tttggcatcc gtattaccgt aacggcgtgg 4320 agaggataga gtcagtgcaa cgtcgcttca tccgttatgc tctccgtcga cttccatggc 4380 gagacccgtt ccggttgcct agctaccaaa gtaggtgtca actcattcaa ctggaatcac 4440 tgcagacccg tagagacgtc aatagagcga tggtagtggc tgacttgctt tcaggaagaa 4500 tagactgtcc tgcgctcctg gaatctatca atgcgagtgc tcactccagg acgttgcgta 4560 acaatgctat gttgaggtta ccattgcgac gaactaacta tggccaaaac agtgctatgg 4620 tcggtcttca acgcgtgttc aatagggttg ctgaagggtt tgattttcac ctgtcccgtc 4680 aagtaatacg tcgtaactat gtagaaattt tatcccgact ctctgtgtga atatttgtga 4740 ctttcatttg ttttattttt tgacattttg ttgccttgtt ttgttaagtt tcaacttagt 4800 attttactgt attttttagt ttaaaacatc attggggctt actgtctgtt ggtgtataaa 4860 tataaataaa taaa 4874 // ID hATm-53_HM repbase; DNA; INV; 3587 BP. XX AC . XX DT 07-DEC-2008 (Rel. 13.12, Created) DT 15-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type family: consensus sequence. XX KW hAT; DNA transposon; Transposable Element; hATm-53_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3587 RA Jurka J.; RT "Families of hATm elements from Hydra magnipapillata."; RL Repbase Reports 8(12), 1947-1947 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(1390..1884,1859..2593,2614..3363) FT /product="hATm-53_HM_1p" FT /translation="MYITEELDKVLQNSVKNKNIRLKRMEEARDKSSITIN FT AVKSNELEASFYSSGNDSESELSLDSSDDEKLPNRFSHSTKSKIMISISAE FT DLVDCTAAVSARFRVGIRPQTSLVAAICNKAGVNLEDINLSRSTVHRKRFK FT KIEILGDKLRQEIITTLKGKKLCVAFREKSYVLHFDGKRIKQIEEDLKITV FT SVERIAVSVTSPDMDDTDDILLGVVQTPSSKGNDQAEAILNLLEYYDIVDQ FT IFSVCCDTTASNTGIHSGAVVILSSILNIPLIWFLCRRHMLEIHISHFMEA FT LTGEKTKGPRKGLYVKLQKNWRFFKKEVDKMIDLVRFDWSQLQVESPLYEI FT AKKALEFGKEALASKTFVRNDYRKLCELFVFYLGGEVPGLCFHQPGACHEA FT RYSILKEKKLSIINNIININGDKILLFRFMADALYILTLRITDKITMIMSE FT VEKKMIETAAFFASVWYAPWFLKSYLVASSPSNDLAAFKNAFCIKEKYPNL FT GSALVASMQRHTWYLTEQLVLLSLADDDVEQEVKKEMLDRLVQFDVPDKFK FT IGKPELPIISESTELWELVGPESWLLLKIAEVPDGEVELWRMEKAPKSLDL FT FKKFVKNLTCVNDCAERNIRLIQDFVGGYKSDNMQQKLMLVARDNRKKLKK FT DLSKSQLKNI*" XX SQ Sequence 3587 BP; 1271 A; 518 C; 662 G; 1133 T; 3 other; ttagggtgtc ccaaaaaaaa atttttttgc acgcatgttt ttcccattcc tttatgtcta 60 ctgagttcat atatatgact aaaaagtttt tctgaggtca atttcagcta tatgtaaaat 120 ctcaatttct taatttttga taaaatttga gctttttttt accttttttt agattccaac 180 aattttacat ttttaatgtt ccttacccta gagaatttag aaaaccaatg tttttagact 240 atataagaat aagtctatct ggcctatttg gtacagaagg ttcattggag cctgatttac 300 agagggggtc gataatgatg actagttgar gaacctttgt gttttmtgta tactgttttg 360 ttttaacagg attttgtcta ttcttttgat tttaaccgct tatagtaata attagaagtt 420 aaatatgaac ttgttttttc aaatataatt atgtaatagt tatgaatacc ttcaatatac 480 atatatttac ttgaaaattg tatagttaaa tgtaattttt ttagggggac actagaataa 540 atttttaaaa atttgtttta gcattaacta ataataaaag tattaaataa aatgggtcgt 600 actcaaaaaa acaaaaaaat aaaggctgag aagagaaagt acagcatgtc aacaaatcca 660 agaaggctgc tgtttcttct aaagtacaca atcaatgagc ttccaataaa ccagctggca 720 accatcggaa atattcttag atacattcag tttctaaaat ctcccgggta raacaaataa 780 ttattatttt atatataaat atactataaa acaatgataa aatttaaaaa tgatctattt 840 tatgcaatac atatttaatc taatttatct cccccctaac gtttaaaatg agacacaatg 900 aggcaaaaat tttatgtaag aacaaaaaaa ccataacgat aaatacttgt ctatttttat 960 tttagatcaa taagatttca atctacaaag ttgatagtag cctgccctca aaagcacgag 1020 aacagggatt tgatctgcaa agatagtttc tgcaagaatt ctgaacaaag atgtgttgcc 1080 tccgctgtaa tagacgcttg gaagaaagca gggtttcaag attttatcat cagtggatcc 1140 agtgtcagag ataaaatact gaagctccat agtaagtaca taaatctgca tcatctcaaa 1200 tcattgaatt ggaactctag cgatggcaac ttctctaaaa tacagaatga gtttttagag 1260 gagagtaata cactctttga tatttcattg gcaaattttg aacaaaaagt aaatcaggat 1320 cgttttcgtt cggagaatgc taaaaaggag gatattaggt tttatctaga tcaaaaaaga 1380 gagcgaaaca tgtatattac agaagaattg gataaagtat tacaaaattc agttaaaaat 1440 aaaaatattc ggctcaagag gatggaggaa gcaagagata aaagtagtat cacaattaat 1500 gcagtaaaat ccaacgaact tgaagcttct ttttattcat ctggaaatga ttctgaatcg 1560 gaacttagtt tagattcttc agatgatgaa aaattaccta atagattcag tcattctact 1620 aaatctaaga taatgataag tataagtgct gaagatctgg tagactgtac tgctgcggtt 1680 tctgcccgat tcagagtagg tatacgccca caaacttcat tagtagctgc tatttgcaat 1740 aaagctggtg taaatctaga agatattaat ctatcaagaa gtaccgttca tagaaaaaga 1800 ttcaaaaaaa ttgagatatt aggagataag ctcaggcaag aaattataac tactctgaag 1860 ggaaaaaagc tatgtgttgc attttgatgg aaaaaggata aaacagattg aagaagatct 1920 aaaaataact gtttctgtag aaagaatagc ggttagtgtt acatcccctg atatggatga 1980 cacagatgac atactgctag gtgttgttca gacaccaagt tctaaaggta atgaccaggc 2040 tgaagcaatt ttgaatttgc tagaatacta cgacattgta gatcaaatat tttccgtttg 2100 ctgtgacact actgcatcca atactggaat acactctgga gccgttgtta tccttagttc 2160 cattcttaat attccactta tatggtttct ctgtagaaga catatgctag aaatacacat 2220 ctcacacttt atggaggctt tgactggaga gaagacgaag ggcccaagaa aagggctgta 2280 tgtcaaactc cagaaaaatt ggcgtttttt caaaaaagag gttgacaaga tgatagatct 2340 agttaggttt gattggagtc agcttcaggt tgagtcacct ctttacgaaa ttgctaagaa 2400 agctctggag tttggaaaag aggcccttgc ctcaaaaact tttgtgagga atgattacag 2460 gaaactatgc gagctttttg ttttctattt gggaggagaa gtacctggac tgtgttttca 2520 tcaacctgga gcctgccatg aagctaggta cagtatatta aaagaaaaaa agttaagtat 2580 aataaataac atataaataa aagaatataa taaataaata ttaatggtga caaaatctta 2640 ctctttaggt ttatggctga tgctttatac attcttaccc tgagaataac cgataagatc 2700 acaatgatta tgagcgaagt ggaaaaaaag atgatagaga cagcagcctt tttcgcatct 2760 gtctggtatg caccttggtt tctaaagtcg tacttggttg caagttcacc ctcaaatgat 2820 cttgcagctt ttaagaatgc tttttgtatc aaagaaaaat atcccaatct tggttcagca 2880 ttggttgcta gtatgcaaag acatacctgg tacttgactg agcagttggt gctgctatct 2940 ctggctgacg atgatgtaga gcaggaagtg aaaaaagaga tgttggatag attagttcag 3000 tttgatgttc ccgacaagtt caagattgga aaacctgagc taccaatcat atcagagtcc 3060 actgagcttt gggaactggt aggtccagaa agttggcttc tgttaaaaat tgcagaagtc 3120 cctgatggtg aagtagagct gtggaggatg gaaaaagccc cgaagtcatt agatttattc 3180 aaaaaatttg ttaaaaattt gacttgcgtt aacgattgtg ctgaaagaaa catacgcctc 3240 attcaggact ttgtaggtgg atataaatct gacaatatgc aacaaaaatt aatgctggtg 3300 gccagggaca atagaaagaa gcttaaaaaa gatctttcca agtcccagtt aaaaaatata 3360 taatttatta ttcttgaaat acatattttt tgacacactt tagtattttt attaggaaaa 3420 tgtgtaaaat tttggttttt aacaaaaaat ggaattttac atattgctga aattgtggtc 3480 tgacaatgtt tttttttata tatttgcaat caggggacac tagagaatgg gaaaaaattg 3540 cgtgcacata gaaattaaaa aaaaagtata aatttgggac accctaa 3587 // ID Gypsy-16_AA-LTR repbase; DNA; INV; 136 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-16_AA_; KW Gypsy-16_AA-I; Gypsy-16_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-136 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1002-1002 (2011). XX DR [2] (Consensus) XX SQ Sequence 136 BP; 37 A; 31 C; 30 G; 38 T; 0 other; tgtgggagga tcgggtagcc cctcggcaaa aaacttcacg aagtcagttc taacgatacc 60 gccgtaccgg tgaacacgcg tgtgcgcgac atattatatt ttccttgaat aaagttataa 120 tttcgatatt ctctca 136 // ID Copia-3_DWil-I repbase; DNA; INV; 4052 BP. XX AC scaffold_180708; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-3_DWil_; KW Copia-3_DWil-LTR; Copia-3_DWil-I. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-4052 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_180708; Positions 1913987 1909936. XX CC 'GTCTT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 132..4019 FT /product="Copia-3_DWil-I_1p" FT /translation="MSNLYHIEKLDEKNYDSWNIQMRSVLVHSDLWDVTAG FT NLPTVEGVDLGGLDRKALASITLSVKPSQLAYIKNCLTAFEAWTRLRDVHQ FT PKGPVRKVSLYNKLLSLRMTDGQDMPSYINGFSDVLDQLAGAGVQINEELR FT TIILLSSLPPVYENFVVAIETRDDLPSFEILCIKLKEEAERKRLTVTKENG FT QEAFVAMQNRNRNQNARTQRNRSQITCYKCGRQGHVKAQCGKERVDKKEKE FT YKYTEKMENKQCSLINALEAENMKREKWCLDSGATSHMCCDKSMFSDFSVH FT DEKISLADAGYLRAEGKGKVTIRTGICKLTMNNVLYVPGLAGNFMSVARVI FT EYNSVVHFEKHMAKIIQNGECILKAKKIGNLFVFEAESENLFAAVGEDVSL FT WHKRFGHLNYKSLTQIASKGLVRGLSVTNFAPNTPCKTCMVSKIHVQPFPK FT MTESRSSELLQLVHSDVCGPFGTKSLGGSRYFLTFIDDKSRRIFVYFLKGK FT DEVFGKFLEFKSLVERQTGKKLKCIRSDNGREYVNNAFDDYLKKNGILRQL FT TIAYTPQQNGVAERANRTLVEMSRCLLAQSGLCEALWAEAIFTAVYLRNRS FT PTSALTNQTPMEAWTGKKPCINHLKVFGSVAVALSKGHQESKFRPKGKEYR FT MVGYSREAKGYRLYDGETRKVVERRDVLFDERDPIEQTNCTTVMFDLPGRY FT RQNEIADANPEINESESDASEGQSAESDADEESFHSAPEEKLAQDESEMRV FT GPGRPRVIRTGKPGRPRKQYNILASIVAGEVPTPSTYEDAISGPYASQWQE FT AMDKEFGALVKNQTWELIDLTNKQRTIGCKWVFSLKKNAQGDIERFKARLV FT AKGCSQQFGVNYTDTYAPVCRLESVRFVLAVAAELDLYLHQMDVCTAYLNS FT DLEDTVYMRQPMGYTDKNRPKAALHLKKAIYGLKQSGRAWNSKLDAVLRDL FT GFKPCSSEPCLYQNHGEEDLTLILVYVDDLLIACRSKQKMDAIKAAISNAF FT DCTDKGATELFLGMEIHRDGELGPITLGHSQYINDMLVRYGIENCRPAVTP FT LDAGHQVACDNKQCKRVDIGSYQTQIGELMWLALTTRPDILHSVAKLAQRN FT QDPHSEHEAGVKHILRYLASTMDKKLRYKRTGQAFSGYVDADWGGDKTDRK FT SYTGYVFFLAGGPISWKSEKQRSVALSSTEAEYIALSTACKEAIVLRRLII FT ELGCGNAETPTVVYGDNLSAQQLAKNPVHHSRTKHIDIRYHFVRQTVSEGF FT VELKYVPTDLMIADILTKNLPKKKHNDFVNMLNLS" XX SQ Sequence 4052 BP; 1291 A; 709 C; 1033 G; 1019 T; 0 other; ggttatgggc ccagggcacg gttttgaata gtaaagtaat gaagtgatat atttaacgat 60 ttatatcaga gagcattaaa gctacgaata agttacgaac tttattcggg agaagagact 120 caaaagacaa catgagtaat ctgtaccaca tcgagaagtt ggatgaaaag aattatgatt 180 cttggaatat acaaatgcgc agcgtgttgg tgcattcaga cctgtgggat gtaacggctg 240 gaaatttgcc cacggtggaa ggtgtcgatc tcggaggatt ggatcggaag gccctagcaa 300 gtattacgtt gagcgtaaaa ccatcgcaat tggcatacat aaaaaattgt ttaacggcat 360 ttgaggcgtg gacaagacta cgagatgtcc atcagccaaa gggaccggtg cggaaggtat 420 cgctgtacaa taagttgcta agcttacgca tgacagacgg tcaagatatg cccagttata 480 taaatggttt ctctgatgtg ttagatcagc ttgcaggcgc cggtgtacaa ataaatgagg 540 agttacgaac tatcattttg ttatcgagtt taccaccagt gtatgagaat ttcgttgtcg 600 ccatcgagac gcgcgacgat ttgcccagtt ttgaaattct ctgcataaaa ttaaaggagg 660 aagcagaacg aaaacgtcta actgtaacga aagagaatgg gcaagaggca tttgttgcta 720 tgcagaacag aaacagaaat caaaatgcac gcacacagag aaatcgttcg caaattactt 780 gttataaatg cggcagacaa gggcatgtca aagcacagtg cggcaaagag agagtcgata 840 aaaaagagaa agagtacaaa tacactgaga aaatggagaa caagcaatgt agcttgataa 900 atgctttgga agcagagaat atgaaaagag agaagtggtg cttggattct ggtgcgacga 960 gtcatatgtg ttgcgataag agtatgtttt cagatttctc tgtacatgat gagaagatca 1020 gtctggcaga tgcggggtat ttacgtgcag aaggcaaagg aaaagtcact attcggactg 1080 gaatatgtaa attaacaatg aacaacgtgc tttatgtacc agggctggca ggaaacttta 1140 tgtcagtggc gcgtgttatt gaatataaca gtgttgtaca tttcgaaaaa catatggcca 1200 agatcataca aaatggtgaa tgtattttaa aagcaaaaaa gataggcaac ttgtttgttt 1260 ttgaagctga atcggaaaat ctttttgctg ctgtcggaga agatgtatca ttgtggcaca 1320 aacgctttgg tcatcttaat tataaaagtc tgacgcaaat cgcaagtaag ggcttggtac 1380 gtgggcttag tgtgaccaac ttcgcgccca atactccatg taagacatgc atggtgagta 1440 aaatacatgt tcagccattt cctaagatga cagaaagtcg gtccagtgaa cttttacaat 1500 tggttcatag tgatgtctgc ggaccttttg gaacaaaatc actaggtggt tcacgttatt 1560 tcctaacatt tattgatgac aaatctaggc gaatttttgt gtatttcctc aagggcaaag 1620 atgaggtttt tggcaagttt ctggagttca agagtctagt ggagaggcag acagggaaga 1680 agctaaaatg tatccgcagt gacaatggac gagaatatgt gaacaatgca ttcgatgact 1740 acctgaagaa aaacgggata ttacgccagt tgaccatagc gtacactcca caacagaacg 1800 gcgttgcgga acgagctaac cgcactctgg tggaaatgtc taggtgtttg ttggcacagt 1860 ctggattatg tgaggcattg tgggcagagg ctatatttac agctgtatat ctgaggaacc 1920 gttcgcctac aagcgcactg acaaaccaaa cgccaatgga agcgtggact ggcaagaagc 1980 catgtattaa tcatcttaaa gtttttggtt ctgttgcagt tgcgctatcc aagggtcacc 2040 aggaaagtaa attccgacca aaaggaaagg aatatcgcat ggtgggatat tcaagagaag 2100 caaaaggata ccgattatat gatggagaaa ctcgcaaagt ggttgaacga agagatgttc 2160 tctttgatga gagagatccc atagagcaaa ctaattgtac gacggttatg tttgatttgc 2220 ctggacgcta taggcaaaat gagatagcag atgcaaatcc agaaataaat gaaagcgaga 2280 gcgacgcatc agagggacag tctgctgaat cagatgctga cgaggaatcg tttcatagtg 2340 cacctgaaga aaagcttgca caagatgaga gcgaaatgcg ggtaggacct ggcagaccac 2400 gagtaatcag aacgggaaaa cctggacggc cgagaaaaca atataatata cttgcttcga 2460 ttgtggctgg tgaggtacca actccatcga cctacgagga tgccataagt ggaccatatg 2520 catcgcagtg gcaagaggca atggacaaag agtttggagc tctggtaaaa aatcaaactt 2580 gggagctgat agacctaaca aacaaacaac gaacaatagg ctgcaaatgg gtttttagtc 2640 tgaaaaagaa tgcgcaggga gacattgaac gttttaaggc gcgattggtt gcgaaaggat 2700 gctcgcagca gttcggtgta aactacaccg atacatatgc acctgtatgt cgattagaaa 2760 gcgtaaggtt tgttttggcg gtggcagcag agttggatct ttatttacat caaatggacg 2820 tatgcacagc atacctcaat agcgatctgg aggacacggt gtatatgagg caacccatgg 2880 ggtacacaga caaaaatcgt ccaaaggctg cattacattt gaagaaggca atctacggtt 2940 tgaaacaatc aggaagggca tggaactcta agctggacgc tgtattacga gatttgggtt 3000 ttaaaccatg tagtagtgag ccgtgtctat accaaaacca tggtgaagaa gacttaacat 3060 taattcttgt ttatgttgat gatctcctta tagcttgtcg ttcaaaacag aagatggatg 3120 ccatcaaagc agcgatatca aacgcgttcg attgcacaga caagggtgca actgagctat 3180 tcctgggtat ggagattcat cgagacggtg agcttgggcc tattacttta ggccattccc 3240 aatatatcaa cgatatgttg gttcgatacg gtatcgaaaa ctgcagaccg gcggttactc 3300 ctcttgacgc agggcatcaa gtggcatgcg acaataagca gtgcaagagg gtggacatag 3360 ggtcgtacca gactcaaata ggcgaactga tgtggctggc tctaaccacc agaccggata 3420 tattgcattc agttgcaaag ttggctcagc gaaatcaaga tccgcattca gagcatgaag 3480 caggtgtgaa gcacatcctg cggtatttgg catcaacgat ggataaaaag ctacgctata 3540 aaaggaccgg ccaagctttt tctggatatg tagatgccga ttggggtggt gacaaaactg 3600 accggaagtc ctataccggc tatgttttct ttttggccgg aggcccaata tcttggaaat 3660 ctgaaaaaca gcgcagtgtg gcgctaagta gcactgaagc tgagtatata gctttgtcga 3720 ccgcatgcaa agaagctatc gttttgcgac gcttaatcat tgagctggga tgtggaaatg 3780 ccgagacccc aacagttgta tatggagata acttgagtgc ccaacagttg gcgaagaatc 3840 ccgttcatca ttctaggaca aaacatatag atataagata tcattttgta cgacaaactg 3900 tatctgaagg ttttgttgag ttaaagtatg ttccgactga tttaatgata gcggatattt 3960 taactaagaa tttgccaaag aaaaagcata atgactttgt gaatatgtta aatttgagtt 4020 aactttgtaa acatggttgc attgaggaag gg 4052 // ID Gypsy-33_OD-I repbase; DNA; INV; 5918 BP. XX AC CABV01002370; XX DT 24-FEB-2011 (Rel. 16.02, Created) DT 24-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Oikopleura dioica genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-33_OD_; KW Gypsy-33_OD-LTR; Gypsy-33_OD-I. XX OS Oikopleura dioica OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; OC Oikopleuridae; Oikopleura. XX RN [1] RP 1-5918 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Oikopleura dioica genome."; RL Direct Submission to RU (24-FEB-2011). XX DR Genome; CABV01002370; Positions 10404 4487. XX CC Positions [3833-4300] - Integrase core CC 'GCGTT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 71..1195 FT /product="Gypsy-33_OD-I_2p" FT /translation="MSKSPEELLFSPLLAGESNLIRPNLHAYELKIARETL FT DDRIKQVRNLFVKAVAEHGRQNTNELNEQFERRALGCIRKSCSVAFLDLYG FT RELAQLKTAKSALEYIVEIFGEESEEDQVSKAQNDLEKLTRRTDSNERFSA FT FLKRVQSLAGVVSAKQDARDHMAEMYFKRQIDPSNTAFLRDNCMLKSSASD FT IAKFLDERCRFLNANVSQIGLRKIDDFMDNTSDLLARQLDLIREENRKRQE FT ESDKKSADFAEAIKALTATVAQLKVEKSKVEPEQPKPVNQSTRNFSGFQPH FT FPQFSPQAQQFSPQFQQPQFFGQAPFNQQYRPRRPLFCTNCNQPGHGKSTC FT RLIKCYECQKQGHIARNCPNRQQSAQGQQPPN" FT CDS 2390..5863 FT /product="Gypsy-33_OD-I_3p" FT /translation="MKGFYQVALEPESAARTAFSCEYGQFSFKRMPMGCRN FT SPAVFCKIMDTALKDIDKSEVLSYMDDVVIHSVDEENHLINLKKFFQVCAV FT NNLRINIKKVNFFGRKLEFCGYEISDGYYKPSSTRIESIKSLAIPRNKDEC FT LSLFGALSYHRKFLPNFSAKALPITRTYRGPFLWTQEASRAFENLKREICE FT KVLDLKIPPLDDCLFVLETDASSEGFGGVLYVCYENNQSESHNHSENCLRP FT TAYYSLNFTESQLNYCTVEKELLAAKKCMERWAVFLKFRRFHWLTDNGNIS FT YCKTLKTNNNKIQRWVLELQAFDFRVIQKKSAQMQISDYLSRHNAKPIAEI FT AKLQAFKNEFSEMQILDPVLSQIIKFVKIDRWPNKPSPDQFFYALLRKNLN FT FYKNGELAVFYDEGPRICVPEALIPKIMAEYHNNCHTGIEQTYTKVRSKYI FT WPKMKECISEFVRTCVYCQSNKPNTNPNRPPIKSFPTPSGPYEVFGIDLIG FT PLSPTDSGNHYCLTMVDFFSKFGYAEPLMSKKAPEVLSQFKKILFRNPKFP FT KFVVLDNGLEFRAVADYLSQNNISPHFAPPRHPACNGAVEVFNRTLKSRLR FT ARTNFENWDNFLYEVLHEINSNEHSVTKLPPFTVQTGIKNCHSFFDPNFRF FT YGDKIEINYDEIRGRIDMEKQSRISKFDNVKFKEYQIGELVLIKNFRSGKP FT PFIGPFKITKKSAGGVNYECFELDTGKTFRRHAEAIKLFHQRESEAYSETS FT ESNDYEINEDCLTETFEQESIFPTLPFLAPKLTAKQNSNAAFLENVASGLI FT DNTVRKIVNEFLTKFYDQSKMEENHKFYFGDVLEELKSNVERAEIKNRGED FT FDRQTAPEGIPENFLRIFENSYTENSIPQNSPRNLEQFSEIVTPDHLQNIT FT VSDTESVEDANKLTSYDKLNLVTQSGAANLRQRKLVDYFETTEESSSEAES FT VISVSKIARKREREPSENSVPPVDSKNARLSLENDFSNLTMTLKLENEDKE FT FFNSIKRIEEIDICPDMIVENHCLLKLSTLPKDVLLFLAVKFNLPVQSCNT FT KPVLTLKIKRHLKREFPNWRKSETGEYLFFASFRVKEQISLYSLNVPELKT FT VCAAFKLDKIPGNSKTLLQQFIVEQFEIKYPRHPKIKHELIFLPDAPESP" XX SQ Sequence 5918 BP; 2004 A; 1124 C; 1148 G; 1642 T; 0 other; gtggtgaccg cttacggaca gagaatcgca aaaccaaaaa gaaccacaga agccttcttt 60 caaagttagt atgtcaaaat cgcccgagga gcttttattt agcccccttc tggcgggaga 120 atcgaattta ataaggccca atttacacgc atacgagctc aaaattgctc gggaaactct 180 tgatgacaga attaaacaag ttagaaacct tttcgtaaaa gccgtggcgg agcacggccg 240 gcaaaataca aatgagctta acgagcaatt tgagcgccgg gcactcggct gtattcgtaa 300 atcgtgcagc gtcgcttttt tggatttata tggtcgagaa cttgctcagc ttaaaacggc 360 caagagtgct ctcgaatata tcgtcgaaat ttttggggag gagtctgagg aagatcaagt 420 ttctaaagcc cagaatgatc tcgagaaact tacaaggaga acagattcta acgagcgttt 480 ttcggccttt ttgaagcgag ttcagagctt ggcaggcgtt gtttcggcaa aacaagacgc 540 tagagaccat atggctgaaa tgtattttaa aagacagatt gatccgtcaa atacagcctt 600 tttgcgagac aattgcatgc ttaaatcaag cgctagcgat atcgcaaaat ttcttgatga 660 gcgttgcaga tttttaaatg caaatgtatc gcagatcggc cttagaaaaa tcgacgattt 720 catggacaat acgtccgatc tccttgctcg acaattagat ttaattagag aagaaaatcg 780 caagcgccag gaggagagcg acaaaaaatc tgccgatttt gcggaagcta ttaaagcgtt 840 gacggccaca gtagctcagt tgaaagtcga gaaaagtaaa gtcgagccag aacagccgaa 900 acctgtaaac caaagtactc gtaatttctc gggattccag ccccattttc cgcagttttc 960 tccccaggcc cagcagtttt caccccaatt tcaacagcct caattttttg gacaagctcc 1020 atttaatcag cagtaccgcc ctcgtagacc tcttttctgc acaaattgca atcaacctgg 1080 ccacggtaaa tctacatgtc gccttattaa atgttacgaa tgccaaaagc aaggtcacat 1140 cgctcgtaat tgcccaaaca gacagcaaag cgcccaagga cagcagcccc caaactaatt 1200 actggagcgg cttctggtca aagcaccgct ccagtgatta gccacacaat agttaaatct 1260 attgggcggc agttttttgt aaatatatcg ctttttggcg atcaaattcg ttttttagta 1320 gatactggta gtcagttgaa cttaattcaa gaatgccata ttccaaaagg tacaataatt 1380 catcaaagcg atttgagagt gcagaactac agcggaggag acgtaaagat ttctggatat 1440 attgagagtg aatttaaaat tgataatatt tcttggggta aatcccgttt ttacgttgtg 1500 tcaaataatc tttattcgat ccttggtagc gcgtcattag aggagaacga aattattgta 1560 aacttgcaca gaggaaagct aatacagagc ggaccagtcg aaagatttgc gaaaataaat 1620 aaaattggaa tcaaggaaga agctgacttt gattttcaag ggatttcgac tcaaaactat 1680 acctttcggg gaaaatccga atgcatgatc gatttgacgg tcaataatct taccgacaca 1740 gcgtctctct tttttgaaac ttccgagctc agaaactcaa agctagaatt aattccgtca 1800 ttccaagtag tctcgcctga caatccaatt tttaggcttc ttgttttaaa cccaactgac 1860 cagacaagca aaataactgc taatacagtg tacataaaat tatttaaaat agcagaaatt 1920 gcgaatttaa agacagatga aaaatttgaa aaaataaaaa attccatacc atttggagcc 1980 ataaaaaatt taaaaatcca aaaagagttc gaagaactcg taaaaaaata cagtttcctg 2040 gttttgggag gatggagact tcctgccagc ttgtaatata acaaaatttt cgatagatac 2100 aaattgtaaa tatccaatag ccactgcccc atatagaacg ccttacgccc tgcgggggga 2160 attgaaatca ataataaagg attttcttga caacgatata atagagccat gtacatcttc 2220 ttggaattcg ccgatacttt tggtcaaaaa gcagaatgga aaatttagac ttgttgtcga 2280 ttttcgtaaa ctcaacaatg tgactgaaca agttaactac ccgcttccaa atttagagga 2340 cagtttatcg ttacttgaga agagcagaat attttcagca tgcgatttaa tgaaaggctt 2400 ttatcaagtt gcactagaac cggaatcggc tgcacgtaca gcatttagtt gcgaatatgg 2460 acagttttct tttaaacgaa tgcctatggg ctgtcgaaat tctcctgcgg tattctgcaa 2520 aattatggat acagcattaa aagacattga taaatcagaa gttttgtcct atatggatga 2580 cgtagttatt cactctgttg acgaagagaa tcatttaatt aatttaaaaa agttttttca 2640 ggtttgcgca gtgaataatt taagaattaa tattaaaaag gttaattttt tcggtcgaaa 2700 acttgagttt tgcggttatg aaatttcaga cggatattat aaaccatctt cgacacgtat 2760 agaatcaatc aaaagtctcg ctattccaag aaataaagat gagtgtttgt cattatttgg 2820 ggctctgtca tatcaccgaa agtttttgcc aaatttctca gctaaggctt taccgattac 2880 tagaacttac cgtggtccct ttttatggac acaagaagcg tcaagagctt ttgaaaattt 2940 aaaacgagag atttgcgaaa aagtacttga tttaaaaatt ccaccattag acgattgttt 3000 atttgtactc gagacggatg caagctccga gggtttcgga ggcgttttat atgtttgtta 3060 cgaaaataac cagtcagaat cgcataatca ctccgaaaat tgcttaaggc caacggctta 3120 ttactcactt aattttacgg aatcacaatt aaattattgc accgtagaaa aagagctcct 3180 tgctgctaaa aaatgtatgg agcgctgggc tgtttttctt aaattcagaa ggttccattg 3240 gcttacagat aacggaaata tttcatattg taaaacgcta aaaaccaata ataataaaat 3300 tcaaagatgg gtccttgaac ttcaagcatt tgattttcgg gttattcaga aaaaatcagc 3360 gcaaatgcaa atttcggatt atttatctcg tcataatgct aagccaattg cggaaatcgc 3420 taaattacag gcgttcaaaa acgagttttc tgaaatgcag attcttgatc ccgttctttc 3480 ccaaataata aaatttgtaa aaatcgaccg ttggccaaat aaaccatcgc cggaccagtt 3540 tttctatgcg cttcttcgaa aaaatttgaa tttttacaaa aacggagagc tggcggtatt 3600 ttacgacgag ggaccaagaa tatgcgttcc agaggctttg atcccgaaaa taatggcgga 3660 atatcacaac aattgtcata caggaataga acagacttat acgaaagttc gaagtaaata 3720 catctggcca aaaatgaagg aatgcatttc ggagtttgtt agaacttgcg tgtactgcca 3780 gtcaaacaag ccgaatacaa atccaaatag accccccata aaatcatttc caaccccctc 3840 gggaccatac gaagtctttg gtatcgacct tattggccca ctttctccaa ccgattcagg 3900 aaaccattat tgtttgacaa tggttgattt tttttcaaag tttggatatg cagaaccgct 3960 tatgtcgaaa aaggccccag aagttctcag tcaatttaaa aaaattttgt ttcgtaatcc 4020 taaatttcca aaatttgtag tactagacaa tggcttagaa ttccgagctg tagctgatta 4080 tttatcccag aataatattt caccacattt cgcacccccg agacacccag cctgtaatgg 4140 cgcggttgaa gtttttaaca gaacactgaa aagtagatta cgagcgcgga caaattttga 4200 gaactgggac aattttttat atgaagtcct acacgaaata aattcaaatg agcattccgt 4260 gacgaaatta ccaccattta cagtacagac aggtataaaa aattgtcaca gtttttttga 4320 tcctaatttc cgattttatg gcgataaaat agaaataaat tacgacgaaa tacgtgggcg 4380 catagatatg gaaaaacaat caagaatctc aaaattcgat aacgtgaagt ttaaggaata 4440 ccagataggt gaactcgttc taattaagaa ttttcggtcc gggaagcccc cttttattgg 4500 accattcaaa atcacaaaga aatcagcggg tggagtgaat tacgagtgct ttgagctaga 4560 tacgggcaaa acattcaggc gacatgcgga agccataaaa ctttttcacc agcgggaatc 4620 ggaagcttat tcagaaacaa gcgaaagtaa tgattatgaa atcaacgagg actgcctaac 4680 agaaactttt gaacaagaaa gtattttccc aacgttgcca tttttggcac caaaattgac 4740 ggcaaaacaa aattctaatg cagcttttct cgagaacgtc gcaagcgggt taatagataa 4800 tacagtcagg aaaattgtta atgaattcct tacaaaattt tatgatcaat caaaaatgga 4860 agaaaatcac aaattttatt tcggggatgt cctcgaggag ttaaaaagta atgttgagcg 4920 cgctgagata aaaaatcgtg gagaagattt tgacagacaa actgcgccgg aaggtattcc 4980 tgaaaatttt ttgagaattt ttgagaactc atatactgaa aactcgatac cacaaaattc 5040 accgcgaaat ctcgaacaat tttcagagat cgttacgcct gatcatctac aaaatataac 5100 agtttcggac acagagtccg tcgaagatgc taacaaatta acctcctacg acaagttaaa 5160 ccttgtaact caatcaggcg ctgcgaacct tagacaaaga aagttggtcg attattttga 5220 aacaaccgag gaatcctcta gcgaggcgga atcggtcatt tctgtatcta aaattgctcg 5280 gaagcgggaa cgcgagccaa gcgagaatag tgtaccacca gtagactcta aaaacgcgag 5340 attatcattg gaaaacgatt ttagcaattt gacaatgaca ctgaaattag aaaatgaaga 5400 caaagagttt tttaatagta tcaaaagaat agaagaaatc gatatctgcc cggatatgat 5460 tgttgagaat cattgcttgc ttaaactttc gactctacct aaggatgttt tattgttttt 5520 ggctgtcaaa tttaacttgc ccgttcagag ttgcaacact aaaccagttc taacgctaaa 5580 aataaaaaga catcttaaga gagagttccc aaattggcga aaatcagaaa caggtgaata 5640 tctcttcttt gcatctttta gagtgaaaga gcagatatcc ctgtatagct taaatgtccc 5700 ggagcttaaa actgtttgtg ctgcttttaa acttgacaaa attcctggta actcaaaaac 5760 gcttttgcaa caatttattg tcgaacaatt tgaaattaaa tatccacgtc atccgaaaat 5820 caagcatgag cttatatttc tgccagatgc tccagaatct ccttagaaaa aataaaaaga 5880 tgctctgaca aaaaaagaag ccccatgagt ggccacag 5918 // ID AACOPIA1_LTR repbase; DNA; INV; 270 BP. XX AC AF134899; XX DT 30-MAY-2000 (Rel. 5.04, Created) DT 30-MAY-2000 (Rel. 5.04, Last updated, Version 1) XX DE AACOPIA1_LTR is a long terminal repeat from AACOPIA1, a DE copia-like LTR retrotransposon. XX KW Copia; LTR Retrotransposon; Transposable Element; AACOPIA1; KW AACOPIA1_I; AACOPIA1_LTR; MOSCOPIA; endogenous retrovirus. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-270 RA Tu Z. and Hill J.J.; RT "MosquI, a novel family of mosquito retrotransposons distantly RT related to the Drosophila I factors, may consist of elements of RT more than one origin."; RL Mol. Biol. Evol 16(12), 1675-1686 (1999). XX RN [2] RP 1-270 RA Tu Z.; RT "Mosqcopia, a novel family of LTR retrotransposons in Aedes RT aegypti are associated with genes and other transposable RT elements."; RL Unpublished. XX RN [3] RP 1-270 RA Tu Z. and Hill J.J.; RT "AACOPIA1_LTR."; RL Direct Submission to Genbank (16-MAR-1999)Department of RL Entomology, The University of Arizona, Forbes 410, Tucson, AZ RL 85721, USA. XX DR GenBank; AF134899; Positions 3916 4185. XX SQ Sequence 270 BP; 63 A; 72 C; 49 G; 86 T; 0 other; tgtggagaat gcatcggtgt accccttcac tactgcacga atacccattc ccaccattct 60 ggcaaccgtg cactctttcg tgtacgacag ctaggagaag agaaaaaaat gttttttttc 120 agttcaaatc taacactcgc cgcgtgcata caaactttgt atttctttgc ttaataaaac 180 catcgttaaa gttatttccg cgtgtgttct cgttattccg ctggttctcc cgaatccccc 240 gatccgctgt gttgcctgtt tctgctaaca 270 // ID Harbinger-N4_BF repbase; DNA; INV; 381 BP. XX AC . XX DT 04-AUG-2008 (Rel. 13.08, Created) DT 04-AUG-2008 (Rel. 13.08, Last updated, Version 1) XX DE Amphioxus Harbinger-N4_BF non-autonomous DNA transposon - DE consensus. XX KW Harbinger; DNA transposon; Transposable Element; Nonautonomous; KW Harbinger-N4_BF; non-autonomous. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-381 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-381 RA Kapitonov V. and Jurka J.; RT "Harbinger-N4_BF - a family of non-autonomous DNA transposons RT from the amphioxus genome."; RL Repbase Reports 8(8), 817-817 (2008). XX DR [2] (Consensus) XX CC This is a relatively old family: copies are 90% identical to the CC consensus. It contains 34-bp TIRs and TWA TSDs. XX SQ Sequence 381 BP; 134 A; 51 C; 73 G; 123 T; 0 other; ggccacacca aatttatttg ttgcttctcg gattttttca gaaaaaaatt ggaccgacag 60 gtcgaaaaaa aaaaataaat tttttcaaaa ggtctggggt gaagatatac ggcttaaata 120 acacaaaaaa aaacagcctg aggagtttaa tggcacgttt tatacaatga atcccttcct 180 ttgatttgtt actaatgttc tgaataaaag gcattgtttt tagactgaaa ttatgtgtat 240 gtggctctga atggtaatat ttacaggttt gtgatagcaa aacctgtatt attttttctc 300 gggacttaat tttttttttt gaaaatgaaa aaaaaaatgg acaggcgaat ccgagaagca 360 acaaataaaa ttggtgtggc c 381 // ID Sola1-N1_CQ repbase; DNA; INV; 1003 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A Sola1 DNA transposon family from Culex quinquefasciatus - DE consensus. XX KW Sola; DNA transposon; Transposable Element; nonautonomous; KW Sola1-N1_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-1003 RA Kojima K.K. and Jurka J.; RT "Sola DNA transposons from the southern house mosquito."; RL Repbase Reports 11(1), 625-625 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >79% CC identity. 4-bp TSDs are usually TATA. XX SQ Sequence 1003 BP; 316 A; 163 C; 179 G; 345 T; 0 other; ctgccgctcc aatctagtcc gtcccatatg taaaaagtga gtgctgagat aacgccgtga 60 gaagcccaaa aaatgtttca ccttttttta catttttaat caacaatttt tacatatttt 120 ttcaacaatt atttcaatat cggttcaaaa aataccccta ccccaaattt gactcgaaaa 180 agttaaaatt tataggtaaa tttggtaaat tttcgtgctg ggactgacct gactttattt 240 gtccatcaat aaaggcaagt ctgtcaggtt atcaaggtag gcttgaacat ttacaacaga 300 aatttgcctg tcatttgttg gtttgttttg gtagtttttc aagtattttt tattaaatat 360 ctaaaagttt gtttgtttca ggttaatcta agttttagta aaattcctgg gtacgaaact 420 ttaaccggcg acgagacagt caatttcttt tgctgcagag cgaggagata tttgaatgca 480 cgtctcagca tatggaccct tgaattgacc gcaatctgcg ttggaaggga acctcatcca 540 gaaccaggac catgggtatt cattcaagaa ggaacagaac tgtttttttt aaatattagt 600 tacaatggta tgtagtttct tttttgaccg tggttgtccc ttgcatgagg aaaaaaggca 660 gaatacgtat cgagtataac gcttcaaaat acataaaatt aattaaatat caacaactta 720 agtatttttt taagttcaac aggctcttag agacatagtc tgtgggttat ttaaataatt 780 acacatttat tttatcgatt tgtgagacat ttctaaccaa gaactgactt gcctttattt 840 tgcatatggg acagcacaaa agtcattttt attttggaat ttttagaaca aacatgactt 900 ttttattaaa taggcttgaa gtagatgtta aaacaagact tttgtcaggt ttatctcaaa 960 atcgacaaaa atacatatgg gacggacttg cttcgagcgg cag 1003 // ID Transib1_AA repbase; DNA; INV; 3683 BP. XX AC . XX DT 13-JUN-2005 (Rel. 10.05, Created) DT 11-OCT-2010 (Rel. 15.11, Last updated, Version 2) XX DE Transib1_AA is a DNA transposon, a partial consensus sequence. XX KW Transib; DNA transposon; Transposable Element; KW Interspersed repeat; TRANSIB superfamily; DDE-class; TRANSIB4; KW Transib1_AAp transposase; Transib1_AA. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-3683 RA Kapitonov V.V. and Jurka J.; RT "RAG1 core and V(D)J recombination signal sequences were derived RT from Transib transposons."; RL PLoS Biol 3(6), e181-e181 (2005). XX RN [2] RP 1-3683 RA Kojima K.K. and Jurka J.; RT "Consensus update."; RL Direct Submission to Repbase Update (11-OCT-2010). XX DR [1] (Consensus) XX CC [1] Transib1_AA belongs to the Transib superfamily of DNA CC transposons. CC The consensus sequence is not complete; termini are not known. CC Transib1_AA encodes remnants of the Transib1_AAp transposase. CC The transposase is not perfectly recovered since the genome CC harbors only a small number of Transib1_AA copies, which are CC damaged by mutations. CC Conceptually translated Transib1_AAp transposase: CC DLAALLIREQKLXSLIDFVRQRFPNADTSNKNTRKQLRCFTLHFRNLWRKSSRSKERFFGANTVWLDGTV CC VLETKERKSEKRKLKAFSELCRRSKLRRVQALTENHTTQELVTASALSVARDGSRNTGKRIEHLAQSSKS CC ASEPTGNIPTVEEYSPEQALCFICRNNFSRMQYQDVRMTTLEKGISSYPSCNKVLLAKQDTYPEGITVNA CC SCAKVPLQNLIDHTIQRLIKFLNIDMLLDDVFQVVFKWGCDGSSNHSRYKQVFLNEDSDGELKEYNDSHI CC FAISLVPIQVLRRRPENNDQDVIWRKELPSSVSLCRPVKLLIAADLRRAEVREMDSQIQKLVPTQVNVKG CC VAVDVSAKLILSMVDTKVVNDLAGTSTQQCFICLRSGKRLNDSLGDQIFNAPDLLKCVFSPLHAQIRTME CC LLLNISYRLTLPKPTWRVSKSSSILKDRQVSIRKAFKERLGLRLNEPLPGGGNSNDGNTARRFFREVDTV CC AEITGLDKNLLFRFSIILKVVNSNREIDVEAFEHFLSETYCLYRQLYDWYVLSPTVHKLLLHGSDIVRHA CC ALPLGMLSEEAQETRNKSIRRFRENHARKFNRTVNFEDVFNATYVDFPPGDLAYE. CC [2] Consensus update. ~97% identical to consensus. 5-bp TSDs; CC usually CANTG. TIRs are ~690 bp long. XX FH Key Location/Qualifiers FT CDS join(847..1477,1569..1728,1788..2940) FT /product="Transib1_AA_1p" FT /note="transposase." FT /translation="MEVTRRDLAALLIREQKFQSLIDFVRQRFPKADTSNK FT NTRKQLRCFTLHFRNLWRKSSRSKERFFGANTVWLDGTVVLETKERKSEKR FT KLKAFSELCRRSKLRRVQALTENHTTQELVTASALSVARDGSRNTGKRIEH FT LAQSSKSASEPTGNIPTVEEYSPEQALCFICRNNFSRMQYQDVRMTTLEKG FT ISLYPSYNKVLLAKQDTYPEGITVNASCAKVPLQNLIDHTIQRLIKFLNID FT MLLDDVIQVVFKWGCDGSSNHSRYKQVFLNEDSDGELKEYNDSHIFAISLV FT PIQVLRRRPENNDQDVIWRNELPSSVSLCRPVKLLFRKENTDLTRAEVREM FT DSQIQKLVPTQVNVKGVAVDVSAKLILSMVDTKVVNDLAGTSTQQCFICLR FT SGKRLNDSLGDQIFNAPDLLKCVFSPLHAQIRTMELLLNISYRLTLPKPTW FT RVSKSSSILKDRQVSIRKAFKERLGLRLNEPLPGGGNSNDGNTARRFFREV FT DTVAEITGLDKNLLFRFSIILKVVNSNREIDVEAFEHFLSETYCLYRQLYD FT WYVLSPTVHKLLLHGSDIVRHAALPLGMLSEEAQETRNKSIRRFRENHARK FT FNRTVNFEDVFKRLMLTSDPVISLTNRRNTGRGKDNEFPAEASKLIKNN*" XX SQ Sequence 3683 BP; 1204 A; 655 C; 707 G; 1117 T; 0 other; cacactggtc caggaagcta atttaggcgg acatgaggtt ttcgtcaaga tttattgatt 60 ttagagccat agtttcttct gagaatatgt tcagaatttt cagtactaca aaatgtacgg 120 atcaaaaaaa tttttacggt gatcgcaaga gtgacgtata cgccaacttt tttattttaa 180 cgaatcgtaa gttggtatgt tcagcaaagt tgtagcaaat gatcatagaa acaactttgc 240 cgaacaaact ttttctcggt tttgcttaag agccgagata attaagtatt tttaataaaa 300 aatcgttttt tgctcctaag tgttttattt ttcagtttta ttggcctact atgttctaca 360 aagtttttac actcatcatt tgctacaact ttgctgaaca taccaaagtt gtatttctta 420 tggttttcat gttatttgag tttttcttct taaaattagc tattttgtat gccttgtatc 480 tcaactatga gcacctcaaa aaaatatatt tttccactaa atgattttgc aatctttcaa 540 gtacctgtga gcaaaatttg gagaagatga tatttttcta tctcgtttaa aatcgatttt 600 actaatagac gatatgagat aaatccatat ttattgagta ttcatacatg ttcaacacac 660 aaaagtcatg gctacctaag cacatcagac caaaaggaac acgtgctcta tgtgtcaaca 720 tttcaaaaac gaattatagc aaaagctctt agaatatatt tctctcttcg cattgattct 780 actattacct ggtaaaacgg catatcatag taaccattaa aaagcttgtc ttagtcagtt 840 accagcatgg aagtgacacg tcgggatttg gctgctcttc taatacgaga acaaaaattt 900 caaagtttga ttgattttgt tcgccaacga tttccgaagg ctgacactag taataaaaat 960 actagaaaac agctcagatg ttttacgcta cattttcgta atctgtggag aaaaagtagt 1020 cgttcaaaag aacgtttttt tggtgcaaat acagtttggt tggatggtac agtggtgctg 1080 gaaaccaaag aaagaaaaag tgaaaaaaga aaacttaagg cattttccga actgtgccgg 1140 aggagcaaac tgcggcgtgt acaagcattg acagaaaatc acacgactca agagctcgtt 1200 acggcatctg cgctgagcgt tgcacgcgat ggttctcgta acacaggtaa aagaatagaa 1260 catcttgcac aatcgtcaaa gtctgcatca gagcctaccg gaaatatacc tactgttgag 1320 gaatattcac cggaacaagc cttgtgcttc atatgccgga acaatttttc aagaatgcag 1380 tatcaagatg tccgaatgac tacgttagaa aaaggaattt ctttgtatcc tagttacaac 1440 aaagttctac tggccaaaca ggatacatat ccggaaggta atgtgtcaat ttgaatgatt 1500 gtaattgcca aaggcgtcta aattaagaaa taagttttgt tcattctttc attcttcaac 1560 catactaggt atcactgtga acgcatcttg tgctaaggtt ccattgcaaa atttgattga 1620 tcatacgatt caacgactca tcaagttcct aaacatcgac atgcttttgg atgatgtgat 1680 tcaggtggtt tttaaatggg gttgcgatgg cagtagcaat cattcaaggt attaaggtat 1740 aataaatttg taactcaaca ttttatacgt ttatttgatc acttaaggta caagcaagtg 1800 tttctaaacg aggacagcga tggtgagctg aaggaataca atgattcgca tatatttgcg 1860 atttcattgg ttcctataca agttctacgt cggcggccag aaaacaacga tcaagatgtg 1920 atctggagaa atgagcttcc atcgtcggtg tcactttgcc gacctgttaa gctgctgttc 1980 cgtaaggaaa acactgattt aacaagagct gaagtaaggg agatggacag tcaaattcag 2040 aaacttgtcc caactcaagt aaacgtaaag ggagttgctg tggatgtttc agccaaacta 2100 attttatcaa tggttgacac aaaggttgtc aatgacttag cagggacaag tacacaacaa 2160 tgttttattt gtctaagatc aggaaagcga ttgaatgatt cccttgggga tcaaatcttc 2220 aacgctcctg atttattgaa atgtgtattt tctccccttc atgctcaaat tcgtactatg 2280 gaattgttgc ttaacataag ctaccgattg acgttgccca agcctacgtg gcgtgtcagc 2340 aagagtagta gcattctaaa ggatcgccag gtttccatac gaaaagcttt caaagaacgg 2400 ttgggactga gattgaacga acctcttcca ggtggtggga actcaaacga tggaaacacg 2460 gcacgtcgct ttttccggga agtagataca gtagcggaaa taacgggact tgacaaaaac 2520 ctcctattcc ggttctcaat cattttgaaa gtagtcaact ctaatcggga gattgacgtg 2580 gaggcatttg aacatttcct gtcagaaacg tactgtttgt atcgccagtt gtacgattgg 2640 tacgtactct ctccaacggt gcataaacta cttttgcatg gaagtgatat tgtccggcat 2700 gccgcattac cattaggaat gctgagtgaa gaagcgcagg aaacgcggaa taaatccatc 2760 agaagattcc gggaaaatca tgccagaaaa ttcaaccgaa ccgttaattt tgaagatgta 2820 ttcaaacgac ttatgttgac ttccgacccg gtgatctcgc tcacgaatag gcgaaatact 2880 ggcagaggaa aagacaatga atttcccgca gaagcttcca agttaatcaa aaacaattga 2940 aaaactatgt acttttcctt tttttataaa tacacgtgtt atctctgggc tgatgtgctt 3000 aggtagccat gacttttgtg agttgaacat gtatgaatat tcaataaata tggatttttc 3060 tcatatcgtc tattagtaaa atcgatttta aacgagatag aaaaatatca tcttctccaa 3120 attttgctca caggtacttg aaagactgca aaatcattta gtggaaaaat atattttttt 3180 gaggtgctca tagttgagat acaaggcata caaaatagct aattttaaga agaaaaactc 3240 aaataacatg aaaaccataa gaaatacaac tttggtatgt tcagcaaagt tgtagcaaac 3300 gatgagtgta aaaactttgt agaacatagt aggccaataa aactaaaaaa taaaacactt 3360 aggagcaaaa aacgattttt tattaaaaat acttaattat ctcggctctt aagcaaaacc 3420 gagaaaaagt ttgttcggca aagttgtttc tatgatcatt tgctacaact ttgctgaaca 3480 taccaactta cgattcgtta aaataaaaaa gttggcgtat acgtcactct tgcgatcacc 3540 gtaaaaaatt ttttgatccg tacattttgt agtactgaaa attctgaaca tattctcaga 3600 agaaactatg gctctaaaat caataaatct tgacgaaaac ctcatgtccg cctaaattag 3660 cttcctggac cagtgtgcat gtg 3683 // ID AeHerves2 repbase; DNA; INV; 2855 BP. XX AC . XX DT 27-OCT-2010 (Rel. 15.1, Created) DT 27-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE A hAT DNA transposon family from Aedes aegypti. XX KW hAT; DNA transposon; Transposable Element; AeHerves2. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-2855 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-2855 RA Kojima K.K. and Jurka J.; RT "hAT-type DNA transposons from the yellow fever mosquito."; RL Direct Submission to Repbase Update (11-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. This consensus is generated from 3 CC sequences with >98% identity, and is ~99% identical to the CC original sequence in [1]. 8-bp TSDs. 13-bp TIRs. XX FH Key Location/Qualifiers FT CDS 696..2615 FT /product="AeHerves2_1p" FT /note="transposase." FT /translation="MWLIFTRIMAGRGRVTSGIWYHFSKLENGKGKCRYCH FT VLIAMASGSSANLKRHLKTKHPMVPVEIGDKNDTSNTASETVPSSSQKPIA FT NIIQQTSTMNTEECARPFGDSNATEVKQPFAQKSMTNFVEIIKPLSVQKQR FT SIDVQLLKTICKEYHPLSLVEGTEFKKFVNLLCPSYTLPSRKTLTYSLMPT FT VYEDVLAKVKADLDTTLAVSITCDGWTSSNNEGYYAVTAHFFDKCVKLKSC FT LLECSEFKDRHTGENIATWITRVLKSFGLESKVVAIVTDNAANMKSAASIL FT KMNHLSCFAHTLNLLVQGAIASSIQSIVDKVKSIVQYFKKSSYALEKLHDM FT QRKLDEAALKLKQDVPTRWNSTYDMLDRALKNKNSIISTLALLNNKLALNA FT DDWKVIEASVEVLRIFNEVTVEISSEQNVSISKTLVLTQIMKRRVQKFVAD FT DILPSEVVKLVKNLKDGLIKRFEMRSKNELISQAVILDPRFKKQGFDNEDD FT YKSAYDALLSQIRDLQEVKSIGEECSGEADVSERTSKSSLWEEFDDSVSKI FT QFKQDPDAVCSIELDKYIAEPILPRTNDPLIWWMERKCIYPNLFSIMQKRL FT CIPGTSVPCERVFSKTGQICNEKRSRLKAKNVSKIVFIHQNL" XX SQ Sequence 2855 BP; 918 A; 550 C; 593 G; 794 T; 0 other; tagagatggg caatccgttc gcgaactgtt caaaagaact agttcttcca agagaatgag 60 tgaactatcg ttctttttga aggaacggta gttctccgct ctcccaaaaa caaacgagct 120 gaatggttta aggaaccgtt ctcagataat ttggtgcgta tgaacgggtt ctagaaggat 180 gaatgctagt agaataaact tcgtttcact cacattcgtc aacaagtttt cgaaaccgtt 240 cttcgcttgc tctcacaaag agtacagcga agaacgatac ataaatagat cgaaatagag 300 ctaatagcca tacgatcagc acacacacta caacgtaggt tgtcatgttt gtttggtgat 360 tataatgtca aagcgaaata attttgtttt gaaaacgatg cactactaca aacaactgtt 420 tgcaacctct attctgcctc tcactcgctc tcagagagat ttctgtattg aagagtgcac 480 ctctctgttt acttttctcc catttcgact tttgacatcc ttcgaccgtg attcaataaa 540 gttttctggt acaagttccc attgaaaaac ctgttatttt cataaataat ctgcgtattt 600 cactgatcct gctgaatttc ccaatctttg agagtgggcc ctcgacaggt aggtggaaaa 660 aacctatttt gagcgatgat ttgcttcaga ttaaaatgtg gcttattttc accagaatta 720 tggctggccg tggcagagtt accagtggta tatggtatca tttcagcaag ctcgagaacg 780 gaaaagggaa atgccgctat tgccatgtat taatagctat ggcctcggga tcaagcgcga 840 acctaaaaag acatttgaaa accaagcatc cgatggttcc tgtggagata ggggacaaaa 900 atgacacctc caacactgca tccgaaacgg ttccttcgtc gtcgcaaaag ccaatcgcga 960 atatcattca gcaaacatcc acaatgaaca cggaagagtg tgcaaggcct tttggtgatt 1020 cgaatgccac ggaagtgaag cagccatttg cccagaaatc aatgaccaac ttcgtagaga 1080 tcatcaaacc tctttcggta cagaagcaac gttcgatcga tgtgcagttg cttaagacga 1140 tttgtaagga ataccatccg ttatctctgg ttgaaggtac ggaattcaag aagttcgtga 1200 atttgctttg tccgtcatac actctcccga gccggaaaac attgacgtac agtttaatgc 1260 ctacagtgta cgaagatgtt cttgccaaag ttaaagccga tttggacacc acactagcag 1320 tgagcatcac ttgcgatgga tggaccagta gcaataacga aggatattac gctgtcacag 1380 cacacttttt tgacaaatgt gtgaaattga agtcatgtct tctggaatgc tctgaattta 1440 aagacaggca cacaggtgag aatattgcaa cttggatcac gcgggttttg aagtcgttcg 1500 gtttagaatc aaaagttgta gctatagtca cggataatgc ggcgaacatg aaatccgcag 1560 cttcgatttt gaaaatgaac cacttgtctt gctttgctca cacacttaat ctgctggttc 1620 agggtgcaat tgcgagctct attcaaagca tagtagacaa agtgaaaagt attgtacaat 1680 acttcaaaaa aagttcatat gctctcgaaa agttgcatga tatgcaacgt aagttggatg 1740 aagctgcatt aaaattaaag caagatgttc caacgcgttg gaactccacg tatgatatgc 1800 tagatcgcgc tttaaagaat aaaaattcta taatttcaac tcttgcactt ctaaataata 1860 agttggcttt gaacgctgac gactggaaag ttatcgaggc ttcggttgag gttctgagaa 1920 ttttcaatga ggtcactgtg gaaatatctt ctgaacaaaa tgtttcaatt tcgaaaactc 1980 tagttctaac tcagattatg aagcgacggg tccaaaagtt tgtagcagat gacatccttc 2040 catcggaggt agttaaactt gtaaagaacc ttaaggacgg cttgatcaaa cggtttgaaa 2100 tgagatctaa aaacgagttg atttctcaag cagtgatact tgatcctcgt tttaaaaagc 2160 aaggattcga taatgaagat gattacaaat cagcgtatga tgcactatta agccagattc 2220 gtgatctaca ggaagttaag tcaatcggcg aggaatgttc cggggaagct gatgtttcgg 2280 agcgcacatc gaaatcgtca ctttgggaag agtttgatga ttctgtcagc aagatacaat 2340 tcaaacaaga tcctgatgct gtctgtagta tcgagctgga caaatacatt gcagagccca 2400 tcttacccag aaccaacgat ccgttgattt ggtggatgga gcgtaaatgc atttacccaa 2460 accttttcag cattatgcag aagcggctgt gtattcctgg tacttccgtt ccgtgtgaac 2520 gagttttctc caagacggga caaatttgca acgaaaaaag aagcaggttg aaagcaaaga 2580 acgtttcaaa aattgtgttc atccatcaaa atctttgaac aagatgtatg aaaattataa 2640 caatactggt tgaattctta ataaaaagca aatataaaac aaataaaact attattattt 2700 taaaacgtta tatctgaaac aaaacagttt gaatatacaa aaaaaaaaaa gcaaaaatta 2760 atcgaacgga agaacggttc aaatgaacta gttcttttcg aagaactgct cagctaagaa 2820 cggttcaacg aaaagaactg ttttgcccat ctcta 2855 // ID BEL-25_CQ-LTR repbase; DNA; INV; 237 BP. XX AC AAWU01010219; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-25_CQ_; KW BEL-25_CQ-I; BEL-25_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-237 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 204-204 (2011). XX DR GenBank; AAWU01010219; Positions 725 489. XX SQ Sequence 237 BP; 76 A; 54 C; 48 G; 59 T; 0 other; tgttactgca aaaaccgtag agaaaaaccc aaaaaacaac caacagattt cttccagtgt 60 attttagttt ctggtttgac cgcaccgcct tcagcgctgc tcgacctgat ttgacagatg 120 ccatttccgg gaagagttga gaataaacgt gtaaagagaa cggacgcgtt tttatttctc 180 tccgactgaa aacaaaaaaa atacagtcca tttgcgcgcg agctatccgg agtaaca 237 // ID Sola1-3_AP repbase; DNA; INV; 3762 BP. XX AC ABLF01009116; XX DT 28-JAN-2009 (Rel. 14.02, Created) DT 28-JAN-2009 (Rel. 15.12, Last updated, Version 2) XX DE Sola1 DNA transposons from Acyrthosiphon pisum. XX KW Sola; DNA transposon; Transposable Element; Sola1; Sola1-3_AP. XX NM Sola1-3_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-3762 RA Bao W., Jurka M.G., Kapitonov V.V. and Jurka J.; RT "New superfamilies of eukaryotic DNA transposons and their RT internal divisions."; RL Molecular Biology and Evolution 26(5), 983-993 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX FH Key Location/Qualifiers FT CDS join(769..927,885..1094,1107..3122) FT /product="Sola1-3_AP_1p" FT /translation="MFEIIVWSSERNEITSIKENDNIVKGSSNEGITLKLL FT FKSIYFVKNVIIIQFFINIFCQKCNNYTIFLGVANPFLEPHIEFMQNDNME FT SSNYEQGDVGNTLDSEISTNENYNILSRNSDPGKITANNTILQYCVKIIII FT LIENAECSIHSSESDSDTSSLSLSSISNEDDTDADPNFELPANNSIILYEE FT NIVSNIIEKKIPRKVKNTSEIKKNVRKRKANQENWRKNVVKKLRNSGKSYV FT SSSKSNKFVEAKKVLPPCGEKCKLKCFTKFTEDNRQKIFEDYWKLGDIDKQ FT RYFLASSMTIVKPKYRYVRENSHRRENNSFHFEVNQIRIRVCKNFFKVTLC FT INDRPLRTVIEKQDSFTGKIIANDYRGKHKNHVKVPDTVKEDIRQHIRSIP FT RIESHYCRANTSKDFIEGGKSIADLHRDYKSECEEKNILAANYVMYARIFN FT EEFNISFFTPKKDQCELCISYENSNTVEKVQLQEKYDTHLKEKELSRLEKE FT KDKQANCVTAVYDLQAVLPCPLGNASSFYYVSKLNVFNFTVYNLQTNDVQC FT YVWHEGQAHRGANDIGSCILNYLKVLQQNAIETEASAKLDVIFYSDNCAGQ FT QKNKFILAMYLYAVLNFSHINSITHKYLITGHTQNEGDNAHSVIERNIKKS FT KKSGPIYVPEQYVSFIRSAKKSGNPYKVQEMCYKDFFDLKDLTTKLGINSF FT SNAFKITETKMFKVTKDDPDTIYFKNSYEQTEFNSISIKNKKTRKTSNQLN FT NNLELKRAYDVKPGITEAKKSGLLSLIRKNCVPNYYYDFYNNL*" XX SQ Sequence 3762 BP; 1499 A; 449 C; 543 G; 1271 T; 0 other; cgctcatccg gaagaaaaac gacacaagtc aaaaattgac aattttgggt tttttacaca 60 tagtaataaa acaagatgca cattttattg tgtaaaaacg acacttttgt aatttaaaaa 120 ttgtccacct gagtgcacta gtgtcacttt tcctttgata agaaaatatc atatgtgtaa 180 caagaatagt gtcactttat cagttactcc tagaccgtta tcgggtatgc gtgagttgaa 240 atatataaaa acgtcatatt tcgaattatt tttttttata tagtggtaca gttcaagaaa 300 aacgacacat ttagacttct gccactttac ttgatcaggt aagatgaaaa tcaaatacaa 360 ctaaattaaa acacaaatta tatcaaaatg tgtcattctt cttttaaaaa atattactta 420 tattctgtta caaatcaaaa taagtcacta tagtaattac attttaatta aactttattg 480 aaaatacaat ggttgcagtg tttatttgaa gttgtgacgt tattgatgtt tattttattt 540 aaataattgg caataattat tagtggttta acacacatta ttatttgtat tatttttatt 600 tagctatgtc acagacaagt agatctcggt taattttgaa attgtcaaaa gaaaaaataa 660 ctaaagatgg tcaaatcggt tgcaaagaag gtaatcataa tttataatgt acctatataa 720 atattattat ccattagtat tgatcgtttt aagaaaagta attttgaaat gtttgaaatt 780 atagtttggt catcagaaag aaacgaaata actagcataa aagaaaatga taatattgtt 840 aaaggcagta gtaacgaagg tataacttta aaattattat ttaaatcaat atattttgtc 900 aaaaatgtaa taattataca atttttttag gagttgcaaa tccttttttg gaacctcaca 960 tagaatttat gcaaaatgac aatatggaat ctagtaatta tgaacaagga gatgtaggta 1020 atacgttaga ctctgaaata tctactaatg aaaactacaa tatcctatct agaaattctg 1080 atccaggtaa gatataaaat aactaaactg ctaataatac aatattacaa tattgtgtta 1140 aaataataat aattttaata gaaaatgcag agtgttccat ccattcaagt gaaagtgatt 1200 ctgatacaag ttcattgtca ttgtcaagta tcagcaatga agacgatact gatgctgatc 1260 caaattttga attaccagcc aacaatagta taatattata tgaagagaat attgtaagta 1320 atattattga aaaaaaaatt ccaagaaagg taaaaaatac aagtgaaata aagaaaaatg 1380 tgagaaaaag aaaagctaat caagaaaact ggagaaagaa tgtggttaaa aaattgagga 1440 attctggtaa gagttatgtt tcatcttcaa aaagtaataa atttgttgag gcaaaaaaag 1500 tattaccccc atgtggtgaa aaatgtaaat tgaaatgctt cactaaattt acagaagata 1560 acagacaaaa aatatttgaa gattactgga aacttggaga tatagataaa caacgttatt 1620 ttcttgcatc cagcatgaca attgtaaaac caaaatatag gtatgttaga gaaaatagtc 1680 atcgtcgtga aaacaactct tttcattttg aagtaaacca aattagaatc cgtgtatgta 1740 aaaacttttt taaggtgaca ttatgcataa atgatagacc attacggact gtaattgaga 1800 aacaagattc attcactgga aaaattattg ctaatgatta tagaggcaaa cataaaaatc 1860 atgtaaaggt acctgataca gtaaaagaag acattcgcca gcacatacga tccatcccaa 1920 ggattgaaag tcattactgc cgtgccaaca catcaaagga ttttatcgaa ggagggaaat 1980 ctatagccga tttgcataga gattataaat cagaatgtga agaaaaaaat attttagctg 2040 ccaactatgt tatgtatgcc cgtatattca acgaagagtt taatatatca ttctttactc 2100 caaaaaaaga tcaatgtgag ctttgtattt catatgaaaa ttctaatact gtagaaaaag 2160 tacaacttca agaaaaatat gatactcatt taaaagaaaa ggaattgtct cggttagaaa 2220 aagaaaaaga taaacaagca aattgtgtga cagcagtcta tgatcttcaa gcagttttac 2280 catgcccatt ggggaatgca tcttcgtttt attatgtttc aaaactcaat gtttttaact 2340 ttacggtgta caatttacaa acaaatgacg tccaatgtta tgtatggcat gaaggtcaag 2400 cacatagagg tgctaatgat attgggtctt gtatattaaa ttatcttaag gtattacagc 2460 aaaatgcaat tgaaactgaa gcaagtgcta aattagatgt gatattttat agtgataact 2520 gcgctggaca acagaaaaat aaatttattt tagctatgta cttgtatgca gttctaaatt 2580 tttcacatat aaatagtatc acacacaagt atttaattac tggtcacaca caaaatgagg 2640 gcgacaatgc acactctgtt atagaaagaa atattaaaaa atcaaagaaa tctggtccta 2700 tttatgttcc tgaacaatat gtaagtttta ttagatctgc taaaaaaagt ggcaatccgt 2760 ataaagttca ggaaatgtgc tataaagatt tttttgattt aaaagatctt acaactaaat 2820 tagggatcaa cagtttttca aatgcattca aaataacaga aacaaaaatg tttaaagtga 2880 caaaagatga tccagatact atttatttta aaaactctta tgaacagact gaattcaatt 2940 ctataagtat aaaaaataaa aaaactagga aaaccagtaa ccaattaaat aataatttag 3000 aattaaaaag agcttatgat gtgaaacctg gtataactga agcaaaaaaa agtgggttgt 3060 tatctttaat tagaaaaaat tgtgttccta attattatta tgacttttat aataatttat 3120 aagtctgaat tctacctatt agttattttt atatttaatt acattctata aaaagatttg 3180 ttatgttcat tttttaataa tttaaaacat ttttaatgtc ttagtttttg ataataatat 3240 acttacttaa taagaaaaat attatgttac tattttttta aattcaaaag actgcttatg 3300 tttgagtttt tgttatactt gcctacttaa taagaaaaat attatgtttc tcatttttta 3360 aaaatttaaa aactgtttat gttattttat tgttatactc aattatgaaa gtattatgtt 3420 actaattttt aataatttta agaaaactga taatgtgcta gcttctgtta tgcctagtaa 3480 gaaaaaaatg atgttactca ttgatataat aatgaaataa attgtcaata tcaaaaaagt 3540 aatttgttat ttatttattt ttttaacatg gtaataatta aaattatatt atgtaacttt 3600 aaattattaa gtacaatatg caaaatcaca taaggtctga tttatttgca ttgttaatga 3660 ataaatatca aaatgactta caaacaaaat aaaaaaatgt ttaatccact ctcctgaaaa 3720 atagcacttt ttgatttgtg tcgtttttct tccggatgag cg 3762 // ID ASSAT1 repbase; DNA; INV; 123 BP. XX AC V00084; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 28-SEP-1995 (Rel. 1.08, Last updated, Version 1) XX DE Ascaris suum satellite DNA variant. XX KW SAT; Satellite; Simple Repeat; ASSAT1; Repetitive sequence; KW satellite DNA. XX OS Ascaris lumbricoides OC Eukaryota; Metazoa; Nematoda; Chromadorea; Ascaridida; OC Ascaridoidea; Ascarididae; Ascaris. XX RN [1] RP 1-123 RA Streeck E.R., Moritz B.K. and Beer K.; RT "Chromatin diminution in Ascaris suum: nucleotide sequence of the RT eliminated satellite DNA."; RL [Nucleic Acids Res 10(11), 3495-3502 (1982). XX DR GenBank; V00084; Positions 1 123. XX SQ Sequence 123 BP; 45 A; 24 C; 18 G; 36 T; 0 other; tgatcaatta ttcccttcca gatgataatg aaataattct caaattgcat gaataatgca 60 ttataaatgt cctttatcgt gacacaagaa ccgatgatct ggatccaact cgacggcaaa 120 taa 123 // ID Copia-1_Cfl-I repbase; DNA; INV; 4232 BP. XX AC AEAB01029077; XX DT 26-JAN-2011 (Rel. 16.02, Created) DT 26-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Florida carpenter ant genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-1_Cfl_; KW Copia-1_Cfl-LTR; Copia-1_Cfl-I. XX OS Camponotus floridanus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Formicinae; Camponotus. XX RN [1] RP 1-4232 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Florida carpenter ant genome."; RL Direct Submission to RU (26-JAN-2011). XX DR Genome; AEAB01029077; Positions 12262 8031. XX CC Positions [1589-2122] - Integrase core CC LTRs are 95% similar to each other. XX FH Key Location/Qualifiers FT CDS 82..1605 FT /product="Copia-1_Cfl-I_1p" FT /translation="MAENHGIPVFDGEDYGSWKKRITMYLKMKKCDVVITR FT ERIGTDNTEDWEQNNLKAINYIYSAITNKQLEFVSDKESAYEIIKKFDEMY FT LKESTALQIVLRNKLEKLKLKDYSDSATFFSDFEKSINELKAAGAKVEEKE FT KLNYMLRTLPESYSYVGDLIDTLKTEDQTVEYVKSKIKMLELKNRESESAD FT SKSNAFVTESKQNREWRNPADQGCFNCGKFGHYKKDCTESGKGSWREPHRG FT STRGRGQQRGTWRGGGYNKRGSGSTNGRGGHYNSRGRGHHWRREQNGQEAA FT SGGGVFHTVVEVNSLNREVKENNQIHWLLDSGCTDHIINNEKYFSECTTLN FT EPINVKVGDGRILKATKVGKVNILFPIYNRKRKVTLFNVFYVKEMKQNLIS FT YSQVTERNKIVSTGNKSKIYDKFNRLLAIAWKEGRLYKMTSFPCDESEVNV FT TSSYNKAGNISLKEKWHRILGHVNFNYLTTLCKNQLLDGIPNEIDTEYIKC FT RICRKQDAQFTF" FT CDS 1589..2944 FT /product="Copia-1_Cfl-I_2p" FT /translation="MHNLPFENNRTRAKEILEIVHTDVNGPHKNLGMNGEK FT YFVTFIDDYSKAAKVFVIKSKDEVYKCLLEYINEVENKTGKTIKKLRCDNG FT KEYLNRNIYKLAKEKGIFIDTCPPYVHELNGTAERYNRSIMDIARCLLAEA FT KVNVKFWPEAILAAAYLKNRTLANTIEKKTPYEILFGKKPSAKYLRMYGSR FT VFVRVPEQNRTSKWDRKADLGILLGYNEVGYRVLINNRIVIARHVDIVEND FT IRCIGLDEYENSSDSETGDENLDDCLENFEDCQEINEQELRGKENETESNA FT GQKQKNNQVQEIRKSSRVKKKPVRYDENYVYNNCIYINYCSADSPNNFEEA FT INCDESECWKQAMNKEFNCLSKNKTWKLVDQPKNKKVLDVKWVYTKKSENK FT FKARLVVRGFQQKEVVDDIYSPVAKMQTLKILLSYCCQKGLIIEQMDVETA FT FLNGKVL" XX SQ Sequence 4232 BP; 1714 A; 470 C; 881 G; 1167 T; 0 other; tggtagcaga acgtggttga ataaaaagat cactgataat aagatctcga aattgacaaa 60 aaaaaataaa ataaaataaa aatggcggag aaccatggaa taccagtgtt cgacggggaa 120 gattatggca gttggaagaa gagaatcacg atgtacttga aaatgaaaaa gtgtgacgtg 180 gtaattacaa gagaaaggat aggaacagac aatactgagg actgggaaca aaataatttg 240 aaagccataa attacatcta cagtgcaatt acaaataagc agttggaatt tgtaagtgat 300 aaagaatcag cgtacgagat aataaagaag ttcgacgaga tgtacttgaa agagtcaacg 360 gctttgcaga ttgtgttaag aaataaatta gaaaaattaa agttaaaaga ctacagtgat 420 tctgcaacct tttttagtga ctttgaaaaa tccataaatg aactgaaggc ggcaggtgca 480 aaagttgaag aaaaagaaaa actgaattat atgttaagaa cgcttccgga atcttatagt 540 tacgttggag atttgatcga tacactcaag acagaagatc aaacagtgga atatgtgaaa 600 agcaagataa aaatgctcga actgaaaaat cgagaaagtg aaagtgcaga ttccaaatcg 660 aatgcatttg ttacggaatc aaaacaaaat cgagaatgga gaaatccggc ggatcaagga 720 tgcttcaatt gtggaaaatt tgggcattat aaaaaggact gtacggagag tggcaagggt 780 tcatggcgag agccacatcg tggcagcacg cgcggaagag ggcagcagcg aggcacgtgg 840 cgcggtggcg gttacaataa acgaggcagt ggcagcacga acggccgcgg aggacattac 900 aattcacgtg gtagagggca tcactggcgg cgagaacaga acggacaaga agcagcatct 960 ggaggaggag tattccatac cgtagtcgaa gtaaactcgt taaatagaga ggttaaggaa 1020 aacaatcaaa tacattggct tttagacagc gggtgtactg atcatataat taacaatgaa 1080 aaatatttta gcgaatgtac aacattgaat gaaccaatta atgttaaagt tggggatggt 1140 cgaatattaa aagcaacgaa ggtaggaaaa gtaaatattt tatttccaat ttataataga 1200 aaaagaaagg taacattatt taatgtattc tatgtaaaag agatgaaaca aaatctaatt 1260 agttattctc aggtgacaga aagaaataaa attgtatcta caggaaataa atcaaagatt 1320 tatgataaat ttaatagact tctcgcaata gcatggaagg aaggtaggtt atacaaaatg 1380 acaagttttc cgtgtgatga atcagaggtt aacgtaacga gcagttataa taaagctgga 1440 aacataagct tgaaagaaaa atggcatcgt atactaggcc atgtaaattt taattattta 1500 actactttat gcaaaaatca attgttagat ggtataccta atgagataga cactgaatat 1560 ataaaatgta gaatatgtag aaaacaagat gcacaattta ccttttgaga ataatagaac 1620 tagagctaaa gaaattttag aaattgttca tactgatgta aatgggccac ataaaaatct 1680 aggaatgaat ggtgaaaagt actttgttac gtttattgac gattacagta aagcagctaa 1740 agtttttgtg ataaaatcta aagatgaagt ttataaatgt ctgttagaat acataaatga 1800 ggtagaaaat aaaactggga aaactattaa aaaacttaga tgcgataacg ggaaagaata 1860 cttaaacaga aatatttata agttagccaa agaaaaagga atttttattg atacttgtcc 1920 accgtatgta catgaattaa atggaacagc tgaaagatat aataggtcaa ttatggacat 1980 agctagatgt ttattagcag aagcaaaagt aaatgttaaa ttttggccag aggcaatttt 2040 agcagcagct tatcttaaaa atagaacact agcaaatact atagaaaaaa agacaccgta 2100 tgaaatattg tttggtaaga aaccaagtgc aaaatatctt cgtatgtatg gtagtagggt 2160 ctttgtacgt gtaccagaac aaaatagaac ttccaaatgg gatagaaaag cggacttggg 2220 aattttatta ggctataatg aagttggata tagagtctta attaataata gaattgtgat 2280 tgctagacat gtagatatag tagaaaatga tataagatgt ataggtcttg acgaatatga 2340 aaattcaagt gattcagaaa caggcgatga aaatctagat gattgtttag aaaattttga 2400 agactgtcaa gagataaatg aacaggaatt aagaggaaaa gagaatgaaa ctgagtcaaa 2460 tgcaggacaa aaacaaaaga ataatcaggt tcaagaaata agaaagtcaa gtagagtaaa 2520 gaaaaagcca gttaggtatg atgaaaatta tgtttacaat aactgtattt atattaatta 2580 ttgtagtgca gatagcccga ataattttga ggaggcgata aattgtgatg aatctgaatg 2640 ttggaagcaa gcgatgaaca aagagtttaa ttgtctaagt aaaaataaaa cgtggaaact 2700 agtcgatcag ccaaagaata agaaagtctt agatgtaaag tgggtgtata caaagaaatc 2760 cgaaaataag tttaaagcta gattagtagt aagaggcttt caacaaaagg aagtagtaga 2820 tgatatttat tcaccagtgg ctaaaatgca aacgctaaaa attttattgt catactgttg 2880 ccaaaagggt ttaattatag aacaaatgga tgtagagaca gcatttttaa atggtaaagt 2940 tttatagaga cagcattttt aaatggtaaa gtttggtaaa gttttcagaa atttatgtta 3000 aacagccaag aggttacgaa gacgggtcaa ataaagtatg taagttagaa aaggcgctgt 3060 atggtttgcg agaaagtccg agagcttggt atgaatgtct tgatgaattt ttaaagaaac 3120 taggatttaa gcgtagtaaa tatgactatt gtttatatgt gttggctaat gacgataata 3180 taatatacct aataatattc gttgatgatc tattaatttg ctgtaaaaat aaaaagaaaa 3240 taattgaaat taagaggtcg ttgtcaaata gatttagaat gaaagatatg ggggaagtga 3300 agcactatct aggaattgac attgagtata attacgaaag aaatgttatg gctctaaatc 3360 agaaatctta tattgaatca ttagccacaa aatattatat aaaatcaaca ttttagaatg 3420 ttgattaaat gtttcttgct tactgggaca ccaatggaaa ctaacttaaa attggaacaa 3480 gcgagtgtcg tatctaatga tatacaattt agaaatataa ttggcgcact tttatacatc 3540 agttctggga ctaggcccga tataagtttt agtgtcaact acttgagtag atttcaaaat 3600 tgttacaatg caactcatta taagtatgcg ttaagaatgt taaaatatct ttacttaaca 3660 aaagatttaa agttaattta taaaagaaat gaaaatgtag agattataga ttgttatgta 3720 gacgcagatt gggcaggtga ttgtgaagac agaaagtcga caacaggcta tattgtaagg 3780 ttttttggaa atgtagtata ttggaaatct agaaaacagg gtagtgtaac aaaatcttct 3840 acttctgctg aatatgtagc tttatcagaa gctgtaagcg agatacaatt tgtaaaagat 3900 attttagagg attttgatat ttatattaag aagccgatta aaatatttga agataattca 3960 ggagcattaa gtattgcaaa atatggcaac tttacaaaga attctaaatt catagaagtt 4020 cactatcatt ttgtaaacga aaagtatatg aaaggtatta ttgacattgt aaaagtcgag 4080 actgaatata atactgctga tttacttaca aaaccattaa gcaaagaaaa atttgaaaaa 4140 tttagaaaca tgttaaacat taaggagttt attatataag cattttatca taagtactat 4200 aaaagaaaaa ctaatataaa tgtaaggagg cg 4232 // ID Gypsy-8_RP-LTR repbase; DNA; INV; 715 BP. XX AC ACPB02034718; XX DT 28-JAN-2011 (Rel. 16.02, Created) DT 28-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Rhodnius prolixus genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-8_RP_; KW Gypsy-8_RP-I; Gypsy-8_RP-LTR. XX OS Rhodnius prolixus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Euhemiptera; Heteroptera; OC Panheteroptera; Cimicomorpha; Reduviidae; Triatominae; Rhodnius. XX RN [1] RP 1-715 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Rhodnius prolixus genome."; RL Direct Submission to RU (28-JAN-2011). XX DR Genome; ACPB02034718; Positions 13760 14474. XX SQ Sequence 715 BP; 187 A; 119 C; 180 G; 229 T; 0 other; tgtagtgaaa tacactacag tagcagcgag tggtgggggc ctctgtggag ggggaaggcc 60 ccaacaagca gtggagctag ctagagagta ggagtgagtg agtcagcgct gtgtgagaga 120 caggtgcgag tgtagaccgt ctcgcttagc taaattgatt tctaaagggc cctttactgg 180 actatcgcca tccctattca ggtgaaagga agtaccgcag gtgagctagc tttgtgtggc 240 ggtgttggtg ttcccttccc ctcccctaca cggggcatcg acggcctact tggtggaaga 300 cggggcggct cagctacaac taacaataca agagtgtgac cggctaaaac tcatatttat 360 gttaatagta attagtataa agaagattgt atataattgt attgagtgta agtgtgtgta 420 tatgtaccgt gtgtatgtgg gagtacgcat gtgagtgagt gtgctcgtgc gtatgtgtgt 480 atatagtata cttagcgtag aggaagttgt ttgatgttta ttttaaacag ttacttattt 540 tggtgatatt ttcctttgag aatcataaga caaagatagt tactttctgt tattatttgt 600 attatttcta ttacttgtat tatttctgtt atttgtaacg ataggaaata tatcttttat 660 tttgtgaacc taactctgtg attaaccacc ccctttccca taacctaaca taaca 715 // ID hATm-48_HM repbase; DNA; INV; 3898 BP. XX AC . XX DT 07-DEC-2008 (Rel. 13.12, Created) DT 11-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type family: consensus sequence. XX KW hAT; DNA transposon; Transposable Element; hATm-48_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3898 RA Jurka J.; RT "Families of hATm elements from Hydra magnipapillata."; RL Repbase Reports 8(12), 1942-1942 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 1563..3524 FT /product="hATm-48_HM_1p" FT /translation="MIGCDATHAQREKNKSDRNERLENRVNAHSIAKRRKT FT ALSDQEVEEIIKKSEDESSSSEVEVFTPFQQKRKTRLSNVCIQIDPTTLTK FT SFTDVSDRLGLSVRDQMAVEAKVVKVCGASLHHFPISRSSVHRNRKVEREK FT QGAQIKEKYQEKRPLHPSGHWDSKLVSSLTGKWVINLLLAYCFKCLEMQIL FT KNTKIFKNKHFLYVGKKEEHVAILASGWDGSSNDANEKVLEIPVIINSTGH FT AQKDAFIKRFSSWKLLDNLIALVFDTTSSNTGWKAGCAALIEKEVGHALLW FT TACRHHVYELHIRHVWDAVNGSHNGPIEPILKMLQDIWDSISPEPTDITKF FT EWPEENHPIHRQATLVLNWASKCLEKNIFPRADYRELIELTVHYLGGDLPK FT GKMFNFKKPGAHHRARFMHNELYILKLVMLDQQIASLNPDQRHKLKRMANF FT IALFQAKAFLQSRISVIAPKVDRQYIADMQWYRDTDKIASDAAIQSCNRHL FT WYLTEELVIFAMFDEDISPIERSATALKLXDTQRSEEMKPGKPSFPKVDGD FT NIPSFVSLIGPHSWLLFNLIKMTANQVDWMQLPVEHWTKMKDYNYVKNVVK FT YCQVVNDCAERGVKLIQDFKNKVKDPAQLQLLLQIVEDHRKRASLQGRKEN FT LMNI*" XX SQ Sequence 3898 BP; 1308 A; 633 C; 720 G; 1235 T; 2 other; gggtgtccca ttcccgcaaa attttcgaat taccacacca acttttaaat tgaggtttag 60 actaagaaaa aaataatatt tactaagtat ttttaaaaaa tataaaaatt taagggtccc 120 ctcatttagt aaaagttgag ctaatccttt tttaccgtgt tttgtgagta tttttgttca 180 aaatgttaac aaatatattc cgcaacatct tcaacttttc acacagtata atttactatc 240 ttatttatac tgtataaaaa atttgagtta aaaaaaattc atttaagggt ccccccgaat 300 tgaataatca agtggttttg taatttggca tcgtcagaaa gtagttatta accgtaaagg 360 cttaaatttg gttcattata aaaatgtatt cataaaacat tatatataga aacgcatgtg 420 tcgaagtaaa aaagtttgat tgtcttattt taaccaatta attattctaa aacgatggtt 480 gcgtctaaat tggacgctca gtataaggaa aatttatttt cctgttgttt ctttagcagc 540 gtggtcaata tataattttg taagtttaag ttgtttgttt taagtttctt tgttttagca 600 gtaagtttaa tattaacatt ttaatattaa cattattact attatttagt tttaatatta 660 gtattattaa ataataagat ttagacataa tcaataatat aatttgttta aatagtggtt 720 ttaataaatt agtgttttaa atgttttatt tactttaaga aattagactt agagttttat 780 atagttaaaa cacaagcaat taaggttttt gtaattgtta ggcttaaaaa agttgttttt 840 ttttttactt tcagtaaagt aattcgcgcc agattttgtt aaacggcatc tcaattagta 900 actcatgctt gtgctgagtg gtaaccacaa aaatggatct acgttctcgt aaactaccga 960 aacctagtat aaaagagaaa aatcgttcag tggaaaaagc aagatccacg cgaaatcatc 1020 atcaagatta ctttattgga tttggactag agaaaagttt aagtgtcagt gtaaaacttc 1080 cccttaaaat tgctttgctc agaagatacc tctactttag agaaatcaac aagggaagtg 1140 gtgaaataac agtatcggca aaaaatattt ttaaagatat tttgattgaa ttacaagaaa 1200 tttggaaaag agctcactta ccagttaagc cagataacaa gtgcgtcgag tatttcattg 1260 cagtttttaa tgaatggcaa agtgtgttga agaagggaaa gaagtgcccc gagaagaaac 1320 aaacccagtt tttgcaatcg atgaatcaac tctgtgacat atcagctgtt gacacatatt 1380 acatattacg tgtatctcga cttgcaacct ggaaggagga ctgggagttt ttactgaacc 1440 aacggaagta tccacaggta atttcatgca ttacttcaac catgttgctg tactgtgttt 1500 acttaaggta atatttttca ttttacgtta ttgtattgtt tacctccaat taggttggtt 1560 cgatgattgg ttgcgacgca actcatgcac agcgagaaaa gaacaagtcg gatcgcaatg 1620 aacgccttga aaacagagtc aatgctcaca gcatagctaa aagaagaaaa acagctctaa 1680 gtgatcaaga agtagaagaa atcatcaaga agagtgagga tgagtcaagc agctcagaag 1740 tcgaagtttt cactcccttt cagcaaaaaa gaaaaaccag attgtcaaac gtttgtattc 1800 aaatcgatcc tacaacgtta acaaaatcat tcactgatgt ctccgaccga ttaggcctca 1860 gcgtaagaga tcaaatggct gttgaagcaa aagttgttaa agtttgcggt gcttcgttac 1920 atcattttcc aatctcgcga tcgtcagttc accgtaaccg taaagtagaa cgagaaaaac 1980 aaggtgccca aataaaggag aaatatcagg agaagcgccc tcttcatcca tccggtcact 2040 gggattcaaa acttgtttcg tcacttactg gtaaatgggt gattaattta ttacttgctt 2100 actgttttaa gtgtttggaa atgcaaatat tgaaaaatac taaaatattt aaaaacaaac 2160 attttttgta tgtaggaaag aaagaggagc atgttgctat tctggctagt ggttgggatg 2220 gatcgtcaaa cgatgcaaac gaaaaagttc tagaaatccc tgtaattatc aactctactg 2280 gtcacgccca gaaagatgcc tttattaagc gtttttcttc ctggaaactg ttagataatt 2340 tgattgcctt agtttttgac accacttcta gtaacactgg atggaaagct ggatgtgcag 2400 cactcattga gaaagaagta ggtcatgcgc tgttgtggac agcttgtcgc catcacgttt 2460 acgaactgca tattcgtcac gtctgggatg cagttaatgg aagtcataac ggacctatcg 2520 agccaatttt gaaaatgctt caagacattt gggattctat ttctcccgag cctacagata 2580 taacaaaatt tgagtggccg gaagagaacc atcctattca tcgccargca acactcgttc 2640 tgaactgggc tagtaagtgc cttgaaaaga atatctttcc aagggccgat tatcgagaat 2700 tgattgaact gacagttcat tatcttggag gtgacttgcc aaaaggtaag atgtttaact 2760 tcaaaaaacc gggtgcgcat catcgtgcaa gatttatgca caatgagctg tacatcttga 2820 aacttgtcat gttggatcaa cagattgcat cgctcaaccc ggatcaacgg cataaattga 2880 aaagaatggc taattttatc gctctctttc aagcaaaggc ttttttacaa agtcgcatct 2940 cagttattgc accaaaagtt gatcgtcagt acatcgccga tatgcagtgg taccgtgata 3000 cagacaaaat tgcttctgat gcagccattc agtcttgcaa cagacacctt tggtatttaa 3060 ctgaagagtt agttatcttc gcaatgtttg atgaggacat ttcgcctatt gaaagatctg 3120 ctacggcatt gaagttgtwc gacacccaaa gaagtgaaga aatgaagcct ggaaaaccgt 3180 ctttcccgaa agttgatggc gacaacatcc cttcctttgt ctctttaatt ggtccacatt 3240 cctggctgct atttaactta atcaagatga ctgcaaacca ggtagattgg atgcaattgc 3300 ctgttgagca ctggacgaag atgaaagatt acaactatgt gaaaaatgtt gttaagtact 3360 gtcaagttgt gaatgactgc gccgagagag gtgtaaaact tatacaagac tttaaaaata 3420 aagtcaaaga ccccgcgcag cttcaactac tacttcaaat agtggaggat catcgcaaac 3480 gagcttctct tcaaggacga aaagaaaatt taatgaatat ttgattattt ttttaaacag 3540 tgttaattgt atcaatctta cattaacaga agttaaaacg cttaaaacca ctttactatt 3600 taaatcagag gggggaccct taaattaatt tttttttaac tcaaattttt tataaagtat 3660 aaataagata gtaaattata ctgtgtgaaa agttgaagtt gttacgaaat atatttgtta 3720 acattttgaa caaaaatact aaaaaaaaca cggtaaaaaa gggtttgctc aactttttct 3780 aagtggggga cccttaaatt ttttaaaaat acttggtaaa tattattttt ttcttagtct 3840 aaacctcaat ttaaaagttg gtgtggtaat tcgaaaattt tgcgggaatg ggacaccc 3898 // ID Gypsy-1-LTR_MH repbase; DNA; INV; 226 BP. XX AC ABLG01001295.1; XX DT 28-FEB-2009 (Rel. 14.02, Created) DT 02-MAR-2009 (Rel. 14.02, Last updated, Version 1) XX DE Long terminal repeat of the LTR retrotransposon from Meloidogyne DE hapla. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-1-LTR_MH. XX OS Meloidogyne hapla OC Eukaryota; Metazoa; Nematoda; Chromadorea; Tylenchida; Tylenchina; OC Tylenchoidea; Meloidogynidae; Meloidogyninae; Meloidogyne. XX RN [1] RP 1-226 RA Bao W. and Jurka J.; RT "Gypsy LTR retrotransposons from Meloidogyne hapla."; RL Repbase Reports 9(2), 462-462 (2009). XX DR [1] (Consensus) XX SQ Sequence 226 BP; 61 A; 36 C; 25 G; 104 T; 0 other; tgtgactcta ttgaccctaa tttaaataaa tgacttttta tcccaattta aatttgtttc 60 tccccctata aaataaaatc cttttcccct aaagttgttc attcgttcac tttgctttaa 120 ttttaagtgt gcgtatttta attgtggact ttaatttaaa taaataaatt cttttcttat 180 tattgtgtgt tcgactttaa ttattggagg ttctcgttat ttacca 226 // ID Copia-22_NVi-LTR repbase; DNA; INV; 339 BP. XX AC . XX DT 30-JUN-2009 (Rel. 14.07, Created) DT 30-JUN-2009 (Rel. 14.07, Last updated, Version 1) XX DE LTR retrotransposon from parasitic wasp: long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; KW Copia-22_NVi-LTR. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-339 RA Bao W. and Jurka J.; RT "LTR retrotransposon from Nasonia parasitic wasp."; RL Repbase Reports 9(7), 1513-1513 (2009). XX DR [1] (Consensus) XX SQ Sequence 339 BP; 108 A; 63 C; 86 G; 82 T; 0 other; tgttgagata tgagtaattt atatgctact atccgtacac actatataga gtatgtgagc 60 aggcatgcgt gcgcgactat agcaagcagg atagactagc gcgcggtagc ggtaactata 120 ggtatagagt gagagcgtga gtatagagca gagaagcagc tcgaccgccg agagcgtaag 180 atggctatgt agtgagcccg agcacatgtt gtgcgtatcc gacgatataa gtgtaatctc 240 caacaatgag tattaaataa aggtacaaca ctaacccgaa gggtccactg gaataatggg 300 tgatattatt tattgcacca aacccctcat ttattaaca 339 // ID Gypsy_DG_LTR repbase; DNA; INV; 2446 BP. XX AC . XX DT 10-MAY-2009 (Rel. 14.05, Created) DT 10-MAY-2009 (Rel. 14.05, Last updated, Version 1) XX DE The LTR sequence flanking Gypsy_DG_I in Drosophila grimshawi. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy_DG_LTR. XX OS Drosophila grimshawi OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Hawaiian Drosophila; OC grimshawi group; grimshawi subgroup. XX RN [1] RP 1-2446 RA Styles P.; RT "Gypsy_DG_LTR: The LTR sequence flanking Gypsy_DG_I in Drosophila RT grimshawi."; RL Repbase Reports 9(5), 963-963 (2009). XX DR [1] (Consensus) XX CC Gypsy_DG_LTR is the LTR sequence flanking Gypsy_DG_I in CC Drosophila grimshawi. The LTR is longer than most Gypsy LTRs, CC with a consensus sequence 2446bp in length. There are 40 copies CC of the LTR in the genome, 12 flanking Gypsy_DG_I and 28 solo CC LTRs. XX SQ Sequence 2446 BP; 819 A; 450 C; 423 G; 754 T; 0 other; tgtaatgatt cgcttgtcga gctcataaat ttagctttct cctaaaaagg ggcaataatt 60 tgtgcgtttc acacatagcg ccatctatga gccattgtgg tgagcttcca aaaaaaaaaa 120 aaaaataaaa gaaacaataa gaaaggaaat tccaaaattg ttgaaataaa gggtgaagca 180 aaattctgcc aaaacgtgtc ttcataagtt cttttgaccg cggacacatt cctttcattt 240 cctagccttt gagcgagcaa gttgtggacg ttgcaaacat tggaaaaaaa aaaaaaaaaa 300 tgttaataaa gttaataatt tgcggtacag tgtaatagaa aaattagtgc cgcacaaaac 360 ggtttttata tcagtgccta ttcagaaatt tgtcaagggt aaatgaatta acaaccattc 420 aaaaaatgac accaatttaa ataaatgtgt tgttttagaa tgggcactta atacggtgct 480 taatatggat tggtagttcg aagaagttaa taaatataac gcctttgcca cgcccatgtt 540 cacggtggag atgtctgtgc taagttgatt gcccaaattg ctgtatttaa ggtgtttaag 600 agagtctctc ttaaaactcg cccttggcgc acgccttaac cttcgatcgt acccaccaac 660 tcatgtatgt cgtgctaaga agaatatgta aaagggcatc gagcgcatct cattagcatt 720 cctagaccga acttaaacgg atgaacatcg tcaactgttg gcggaccttc tgctacattc 780 caagcctatc agctttaccc agctacctca tggagtcaac tttaataatt gcaaaagtag 840 caaggagtta gagcaatttc cttccttttt ttttatcttt atatacaaaa gtgaataatg 900 aaaaaaaaat taacctcagt gaaaaattca gtattcatat gtctagcaaa tagaaaaagc 960 tcgttcacgt gtgcactcgg cgaagtaaat gagtaaaagt taaattaaat gtaatttgaa 1020 aagaaatttt tttttgttta attaatcata aaaattatag tgataatgca agtaaaagaa 1080 aaaaaaatgg aaattgtgac gattctgtgt aattaagaaa aaaaatttat attttatata 1140 actaataagt accctctatt tagaattgca tgcggaatga aaaacacaca aaaatttgtc 1200 tcagcgtacg gcttatgctg acagcgagct ttcgcacgtg gcttgtgata atcgctcact 1260 cccatgatca ttatacgctg cgagctttcg cacgaggctt gaaatatttg cactctctct 1320 tcataactaa ttacagcgac acacacacac accaacacca ggaagtactt ccagcgtggg 1380 atgatgtatg taagttactc tctttcttta ttatgagtcg cgcttagata acctttcgca 1440 agctactctt taaacttgag caactgtact tttacaacag cagctgcgct aagcaactat 1500 atatctacct ttaaattatt tttgtttgtt tttcataacg caaggcgacc atgctctgaa 1560 tgcataatat tcatataaac taatatagcc cctaaaccta tatatatata taccccacac 1620 cgcgattaat attggattcg atcgtattac cattatacag aacacaagaa acaaagacaa 1680 gtgttagttt ttttttgatt ttgttcttac ttgcatatcg aatagagtta aaaaaaaaaa 1740 aaattgtttc atgaaatcat agattagaat atgtgcatgc ccgggttgaa ttgatgatgc 1800 atgtatctac tgcaggacca cttgcagaca gtaaccgcgc tgatcaactg gcattttttt 1860 ttttgttaaa taaatttttt gaaaatctct ctgcaaacat gcatataaac tataattatc 1920 acttccttgt atatataact ataatgttct aatggtatta atcaaactta catttctaaa 1980 tataaaatat ctatatcttt ctaagtttac tatttaaatg tattgaattc gttgttaagt 2040 ggataacatg cagttcctga tggaggtagg atagctgccg taggtcatcc caagggatcc 2100 tgatgtatat atcgggccgc aggatcaagg cagggtgggc gagtgcgcca aaaaggtagc 2160 accacaataa aaaaaaaagg atgacctgat tgcttgagtg ttccgatcct cctattactt 2220 ttgacctttt tcttctttct catgtccatg gtctgtcttc gtctgtcgtt ctttatgtaa 2280 aaatatagta cgcgataaat aaataaatat aactaataac taggaagcca ttttaatatg 2340 tcggttgacc aataataacc caccccaatt gccggcgagc atcggcccga gcaagcgcgt 2400 actccgaacg tattaatagc ccatctgctc cacgttgttc attaca 2446 // ID Gypsy-240_AA-LTR repbase; DNA; INV; 112 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-240_AA_; KW Gypsy-240_AA-I; Gypsy-240_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-112 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1082-1082 (2011). XX DR [1] (Consensus) XX SQ Sequence 112 BP; 38 A; 21 C; 20 G; 33 T; 0 other; tgtagtaggc taagaatcat aaattataat catagcaagc tggtgggaaa taaacactct 60 ctttcgatac cgacattcaa gtagtcaaca cgtgttttac tgagttccta ca 112 // ID Gypsy-606_AA-I repbase; DNA; INV; 4033 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-606_AA_; KW Gypsy-606_AA-LTR; Ty3_gypsy_Ele7; Gypsy-606_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4033 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [3089-3601] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 1487..4003 FT /product="Gypsy-606_AA-I_1p" FT /translation="MAARTEFENLMKLGICRPSSSSWASPLHMVKKADGTW FT RPCGDYRALNAQTVPDRYPLPYLQDFTSILHGKSIFSKIDLQKAFHQVPIH FT PEDIPKTAITTPFGLFEFTQMTFGLRNAAQTFQRLIHEVVRELDFVYPYID FT DIFIASSSPEEHRDHLRQLFKRLEEHNLAINVAKCEFGKTELTFVGHSVSP FT DGISPLPDRVEAVRNFRRPTTVKELKSFLAVINFYRRFIPNAVVAQIPLLD FT MTSGNKRNDRSLLEWTDSTVSAFEQCKLQLANAALLAHPARNAELSLWVDA FT SNTAAGAVLHQVINGHVQPLGFFSKRFDKAQLRYSTYDRELTAMYLAVRHF FT KYMLEGRNCHIYTDHKPIVFAFQQNLEKASDRQARQLDYIGQITTDIRHVA FT GKENFTADLLSRILAVHAIPTVDFHALATDQETDDELTTILEGKLKTSLNL FT KKFTMPGTAKQLYCDSSGDRIRPFITKRFRDQFLQATHNLSHPGTRATAKL FT MTQRFVWPSIRKDSIAYAKQCEQCQRSKVTRHTHSPLERYSVPDERFCHIN FT IDIVGPFPPSNGNRYCLTIVDRFSRWPEAIPVPDMTAPTIAQALVTGWISR FT FGVPKRITSDQGRQFESTLFAELLRTFGITHLRTTPYHPQSNGIIERWHRT FT LKAALLCHNADRWTEHLPMILLGLRTIHKEDIRASPAEMVYGTTLRIPCEF FT FGENSRNITTSEFANTLRDAMRKIHPPNTAWHNRGKVFVHPDLRSCKNVFV FT RNDSIRPSLSPPYDGPYPVLCRSDKHFKLNINGRSVNISIDRLKPAYTIEE FT QQQPQPPSNRSTSTTSLDQPTTVTRSGRRVRIPHRFR" XX SQ Sequence 4033 BP; 1142 A; 1028 C; 865 G; 996 T; 2 other; tagttggtga ccccgtgtgc gcgaaatgga agacgataaa acattacctg tgcaacccgg 60 aaccagcggt atgctgcagc ttgaaaccgc caatgtggac acaattaaca tgccacgact 120 aaatccgccg gtgatgtcag cttccaacat cgactcttac tttctgtctt tggagttctg 180 gtttgccgcc tcaggactgg gcaatcaaca cgatactaag aagtacaata tcgtcatggc 240 tcaggtgcca ccggaaaagt tgacggagct acgatccatt atagatgcca caccggcttc 300 tgagaagtat ccgtacgtga agaagaagct tattgagcat tttgcagaca gccagcaaaa 360 acgtctacag cgagtgcttt ccgatatgcc tctcggtgat atgaaaccca gccagttata 420 cagtgaaatg atgcgtgtgg ccggtaactc gttgggtgaa cccgtgcttc tcgacctgtg 480 gacttccaga cttccaccgc atgctcaagc agctgtaatt gcgtctaaag gtgatgcaag 540 cgaaaaaaca aacatcgccg acgcaatcat cgacgcaatg ggattccgta acattcacgc 600 catcgttaca ccttcgtctt cagcgccaat agctggaacg gaaaagccag ccgtgagcag 660 tattgaagat ctgcaacggg aaatcgccca actaactaaa aggttcgagc aagtgtttca 720 atccagaaga agcggaagag atcgatctcg ttctcgtagc agaagcagac cagacctacg 780 gtacggttct gaatcttcgt acgatatgtg ctggtatcat cgtacttttg gagcaaatgc 840 acgccaatgt cgaaaaccgt gctcattcga tcagccatca aaatccaatc aacaatgacg 900 ccaaatggct gattctgcgt tggaaaacgg cgaattatct tcctcagcaa ccatttttcg 960 cctgaaaatt gcagacacga ttacaaacag gagcttcctc atcgacacgg gagctgatgt 1020 gtctgtaata ccaaaagatt ctaatttagt tcgtgtcaaa ccaactacaa taagattgtt 1080 cgccgcaaat ggaaccccaa tcaaagtwta tggagaagca ctactaaaag tcagccttgg 1140 gcttcgtcga gaatttctat ggagcttcct catagctgat gtgacaacag gcataatcgg 1200 cgcagatttc gttagccatt atgatttgtt gatagatctg aagcgacatc gcctcatcga 1260 caatactaca aaactcgaaa caatggcagt gctgacacca acgagacatt tctccatcaa 1320 aacgtttagc aaawcatcac catttgcgga actgctgaac gattttccat ctatcacaca 1380 gctagcaccg cctggcacag taagtaaatc ctctatagtt catcgtatcg aaactactgg 1440 ccagcctact tttgctcgac caagacgact tgctcctgat aaacttatgg ctgcacgtac 1500 tgagttcgaa aacttgatga agttgggaat ctgtcggcca tcgtcaagta gttgggccag 1560 tcctctgcac atggtgaaga aggcggacgg tacttggcgg ccctgtggag attaccgtgc 1620 actcaacgct caaactgttc ctgatagata tcctcttcct tacctgcagg atttcaccag 1680 tattctgcat ggaaaaagta tcttctcaaa aattgatctt caaaaggcat tccatcaggt 1740 gcccatacac cctgaagaca tcccgaagac tgcaatcacc actccgtttg gattattcga 1800 gttcacccag atgacgttcg gcctgcgaaa cgctgctcaa accttccagc gactaatcca 1860 cgaggtggta cgcgaactgg atttcgtgta cccatacatc gatgacatct ttattgcttc 1920 atcgtcgcca gaagagcatc gagatcatct tcgacaattg ttcaagcgac tggaagagca 1980 caacctagcg atcaacgtag caaagtgtga gtttggaaag actgaattaa ctttcgtcgg 2040 tcactcagtt tctcctgacg gaataagccc gctgcccgac cgtgtcgaag cagtacgcaa 2100 ttttcgacgt ccaactacag tgaaggagct caaaagtttt ctcgctgtta tcaatttcta 2160 cagaaggttc atcccgaatg ccgtagtggc acaaatacca ttactcgaca tgacctctgg 2220 aaataaacgc aacgatcgat ctctactaga gtggacggat tcgactgtgt cggcatttga 2280 acaatgcaaa ctacaactag caaatgctgc tctactcgct caccctgcta gaaatgccga 2340 gttgtctttg tgggtggacg catccaatac agctgccgga gctgtactac atcaagtcat 2400 taacggtcat gttcaaccac tcggtttctt ctccaaaagg ttcgacaagg cacaactgag 2460 atacagtacg tatgaccgag agttaacggc aatgtatcta gcagtacggc acttcaaata 2520 tatgttagag ggacgaaatt gtcacatata caccgatcat aagccgatcg tgttcgcttt 2580 ccaacaaaat cttgaaaaag cttcggatcg tcaagcacga cagttggact acatcggtca 2640 aatcactacc gatatccgtc atgtagcagg taaagagaat ttcaccgcag atttgctctc 2700 tcgcattctt gctgtccatg ccatcccgac ggttgacttt catgcactcg ctaccgatca 2760 agagaccgac gacgagttga caacaatact ggaaggtaaa ttgaaaacat cgttgaatct 2820 caaaaaattc actatgccgg ggactgcaaa gcaattgtac tgtgatagtt ctggagatcg 2880 catcagacct tttataacaa aaaggttccg agaccaattt cttcaagcaa cccacaatct 2940 ctcacatcca ggtactcgcg cgacggccaa gctgatgact caaagattcg tttggccaag 3000 tatacggaaa gatagcattg cctacgcaaa gcaatgtgaa caatgtcagc gttccaaagt 3060 gactcgtcac acacattcac cacttgaacg atattcagtg cctgacgaaa ggttttgtca 3120 tatcaatatt gacattgttg gaccctttcc gccgagtaat gggaatcgtt attgcctcac 3180 catcgttgat aggttttccc gttggccaga ggcgatacct gttcctgata tgactgcacc 3240 taccatcgca caagctttag tgactgggtg gatttctcgc tttggcgtac ccaaacgaat 3300 cacatctgat caaggacgtc agttcgaatc gactctcttt gctgaattgc tacgtacgtt 3360 tgggatcact cacctccgta caacaccata tcatccccaa tctaatggta tcatcgagag 3420 atggcaccga acgctcaaag ccgctcttct ctgccacaac gctgatcgct ggaccgagca 3480 cctcccgatg atcttgcttg gtttgagaac catccacaaa gaagatatca gagcttcgcc 3540 ggctgaaatg gtttatggaa cgacgcttcg gattccttgt gagttctttg gtgaaaactc 3600 tcgcaatatc acgacatccg agttcgccaa cactctgcga gatgctatgc gaaagataca 3660 tccacccaat acggcatggc ataatagagg taaagtgttt gttcatcccg acctcaggtc 3720 gtgcaagaat gttttcgtcc gcaacgattc gattagacca tcactatcac caccatatga 3780 tggcccttat ccagtgctct gccgatccga taagcacttc aagctaaaca tcaatggtcg 3840 gtctgtgaat atttcgatag atcgactgaa acctgcctac acgatcgagg agcaacagca 3900 accacaacct ccctctaatc gttcgacatc aacaacttcg ctcgatcaac cgacaacagt 3960 aactcgttct gggagacgag tcagaattcc gcaccgattt cgttgactgt cgtctgttct 4020 agaaggaggg taa 4033 // ID DNA-TA-13_CQ repbase; DNA; INV; 3027 BP. XX AC . XX DT 29-DEC-2010 (Rel. 16.01, Created) DT 29-DEC-2010 (Rel. 16.01, Last updated, Version 1) XX DE A DNA transposon family from Culex quinquefasciatus - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; KW DNA-TA-13_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-3027 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the southern house mosquito."; RL Repbase Reports 11(1), 63-63 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >93% CC identity. TA TSDs. 29-bp TIRs. XX SQ Sequence 3027 BP; 1049 A; 450 C; 461 G; 1067 T; 0 other; gggtatatga ccctattttg gacctagtac ctattttgga cctattttta ttaatcgagg 60 tagttcaggc ttttaaagta cgaattcact gaatccaacc aacagaaagt gtaggcagga 120 tcgttttgta tcttacctct tccaaaaatg ttaacatttt tgattatttc cgctttttca 180 gccacttttt ccaacgccat tcgaacttcc tttcattttc ctagcacatc gattaggtcc 240 aaaaattccg gcctcgttgc ttatcagatt tggctcgcga caccttgatg attttttctg 300 ttttgcactg acaccaagca atttcgtccg gtttccgagc aatgtaactg ttgtttattt 360 aattctttct tgttttccgt cactaaacca ttgaaataac tattctatta tcgaaaaaaa 420 ctcatttcaa aatatttttt caaatcagcc gcgttggatc agtttttgga gctactttgc 480 cgtcggttca aggctatcac caacacatgc ataacaaggt gttgccagtt ttgtttatca 540 atttatttcg atatttcatt gaaattgatt gaaatttttt aatttaatat tttgaattga 600 atttctttac tgtaattatt tgaatgaaat ctaagcttta aaaagcaaat gattaacaga 660 aacaatgaaa atatgttttt taaacgataa aaatgcaaat tgatgctaga gatttagatg 720 attgttagga ccgattgcaa agtatcccaa aatttaaact gatgttgatt tatttttgtc 780 ttaacatcat tatcaccaat ttagccgcat atatttttaa tttttgcttc aaaaaatgca 840 aactggctac cctgttggaa ttgaagggga aacgattgaa aaacctaaag ggtctagttc 900 attaaatcgt cagctgtcac cgtgacagga gctgtcaaca tcgcttacga gtggtttttg 960 aaaaagtcgt atgaattaaa tttaaaaaaa tcttgagtta gttttaaaaa gatcctaagc 1020 gctagttgta aacaaatcaa tttttcgatt agttctcgat caatttacga attataaata 1080 aaaaatgctt tgaattatca tctgcacgtc ttttaaatgc tctttactgg aaaacaaaca 1140 aaatttggtt tggttcagaa aaaaaatcaa atatgtgtcg gagatcgtga tggcttttaa 1200 gacgtattgc aaactaatca gagaaatata catatggaaa atatattttt tgtaagtttt 1260 atctgatttt tgcatattta tctatatata atttttaaaa atacaggtat aacgattttt 1320 ttcacaattg gcagcgttgg tttataataa attacacttt taaattttac cgtgtcaatc 1380 atgttagaaa cacaattaaa ccagctagaa aaataaaaat cttcgaaaaa aaataagcat 1440 aagaggttaa tgcgatgtta acattatgaa ttttatgatt tgagacatat agccggggtg 1500 actttgatag gtttgatatt tttccgcaaa atgaagagta aaaattgaat acgtacggaa 1560 tggtatggaa tcatactgac cgtggtagag aagtgctcaa agtacctcat gaagaacttt 1620 tcatgaaatt ttgaaaagtt taaaaagtta gttaactata actaagaaaa tgttgataaa 1680 tagcattatt ttcatcttct caacgtgtca tgattttttc aatgaacatg attttgaatc 1740 gtaaaacgga atgcattttc ggattcttcg gacaatcttc tactaggaaa aggtaaaata 1800 agtaatataa atattattaa atattatttg tttttgaaac ataatttaaa aaaatctcat 1860 aatttaaagg catttccagt aaaacaaatt tcatataaaa tgtgaaaact tttgattcat 1920 tctttaaatt cagttataca tgtaatataa atccataatt taataaacaa aactagtttc 1980 aacgaatttc aggcaaagtt ctgaattttt aacaatatta cctaaaattt atatgtatta 2040 tgttaaaaag cttataaaca tagttaacta agtattgaca taaatgattt tcttaaaaat 2100 tatatcagca acctaagtga tggtaagttc gaagtaaaaa taatgtttga aaacctcaaa 2160 tctgctttta acaagacaaa ctatgactat caaaggcacc ctggtattca aacaaagatt 2220 tttttcaaat attacattgt ttttaaagtg caacatgtag ttaaactttt tgtcaagcaa 2280 ctgtaaagtt taacatgaca gatgagaaag tttcacacca agttacaagc tataatctcc 2340 ctttaaaatt tcttgattgt ttaataatat ttgaacataa gctctgcgta aaatttagaa 2400 ccttttttca ttgtaacttg ttatgtatct gtgtcattca tttaatttta agataaggtg 2460 tttttattgc gagtaataac gggttgtctt ttatgttcta atgcttgaac aattaggtcg 2520 taagaatatg caaattgtaa ataggacaga gacctgctaa agatgcgtga gtttccattc 2580 aatatgatta aaacacgttt tcaaattcaa atccattgtt tcatcataat tgtttaattt 2640 ctgaaataaa tattttccaa attataatcg tggataggcc ctttggttta tcaaaccgag 2700 tttttgtaag tgtgcgtgtt gccgccgcac gccgcaattc gagttgtcca aatttcaacc 2760 aatcagagtg tgggtccaaa ataggttcag cgatgtctcc aaaatacatt caataagtcc 2820 aaaaagcatc caaaaaatgt tttacttgaa atgcttatta caatggaatg gttaaattat 2880 tttaaatttt caatagtaca ctttgttaag cataaattaa ggcacaaaac aatcctgaaa 2940 cgagccaaat tagttagacc gttcctgaga actatgcgaa atatgttgac gctttgcata 3000 gtaggtccaa aatagggcca tataccc 3027 // ID Gypsy-176_AA-I repbase; DNA; INV; 6005 BP. XX AC supercont1.156; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-176_AA_; KW Gypsy-176_AA-LTR; Gypsy-176_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6005 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.156; Positions 245943 239939. XX CC Positions [4432-4908] - Integrase core CC 'GGAC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 293..2128 FT /product="Gypsy-176_AA-I_2p" FT /translation="MDLEFSHLYKNLLIHHLERSEIEHELTIRAVQFEPTE FT TRAAIQRRLRDRLKEEREKSLADLDFIRCFKSVEAEIKEIDENLSQIRNFL FT ENKTRFEGIRESLKTRLVHYFERTRRVQQIADKEEDLRDLDELICTIRETM FT NTHFPSIGPNEVIQEQVIQQIVQTLSDLNLIPSNTNVGSVNQLREEANANK FT NDNLLDEAIGGSQQREPANPSGRRSLKSLRKTTNVQSGLNSLNRGSKSLLI FT SPVHSGGSSQVRSERLISKFSLEPISSRNKVPQRETSDSVSSVESDSSNNS FT VVPSARSRRHSRYSQKRRPVSEWNLKYDGKDNGQNLMRFIKEVQFYAKSEK FT VSDKELFRSAIYLFKDQAKVWFMSGIENEDFSTWKELLSELKREFLSPDHD FT HVNETRAISRKQGPKERFSDFLAEMQRIFNSLTRPMTEKKKFEIVFRNLRA FT DYKAHAIASNIDNLADLKKFGRQLDATFWFKFNANTQETNGSRNRAQVNEV FT SFGQKPRQNPPAEDSNKKFKSRQFYRSKPERSDEEPTKKKTEETAKPPTSK FT PQTSDKGLQVLVEKYKVPAPGTCFNCRLPGHHAADCEGPKHKFCRKCGFLN FT YDTNNCPFCAKNAQ" FT CDS 2494..5292 FT /product="Gypsy-176_AA-I_1p" FT /translation="MTFNGETKLIETLVVPELKRKLILGTDFWKAFQIVPS FT VSSNTVEELEVQPPTPDLTLEQLSELESVKAQFKVAEEGKLDTTPLICHRI FT ELSEEARKKSAVRINPFPTSPKRQEQINKELDKMLEAGIIERSYSDWALRL FT VPVDKPDDSVRLCLDARLLNERTVRDSYPLPHADRILSRLGPCKYISTIDL FT SKAFLQVPLHPKSKKYTAFSVLGRGLFHFTRMPFGLVNSPATLSRLMDRVL FT GGGELEPKVFIYLDDIIIISDTFEEHMKLLREVARRLGQANLSINIDKSRF FT CVLEVPYLGYILSTQGLRPNPNRVEAIVNFERPNSLKALRRFLGMCNYYRR FT FISGYSDLVRPLTDLLKDKPKSVRWSDPAEKAFIKVKELLISAPILTNPDF FT SKSFCIHCDASDYAIAGVLTQSHDGVDKPISYFSQKLTGAQQRYFATEKEA FT LAVLKSIEKFRCYIEGSKFTVYTDASALTYILRSTWRTSSRLCRWSIELQR FT HDMIIKHRRGVDNVVPDALSRSVEELSASPTSQECYSNMMQKVQTEPERFK FT DFRIDNGVLKKFVATSNDLLDYRFEWKLCLPSDMREKVLVEEHDDALHLGV FT DKTIAKVKRKYYWPQLTKDVRSHIQKCTICKQSKPSNRAQHPEMGRQRITT FT KPFQLIALDFIQSLPRSKTGNAHLLVIMDIFSKFCLLFPLRKISAPQVCQI FT LEQNWFRRYSTPEYLISDNATTFLSKDFKALLDKYGIKHWRNSRHHSQSNP FT VERLNRTINACIRTYVRTDQKMWDSRVSEIEFALNSTPHGSTGFSPYRILF FT GHEIIGKGDEHRMDRDVDEVSDEERVGRKLEIDRLIHNIVYKNLRKNYEKN FT ANIYNLRNKASTQSYAVGQKVLKRNFQQSSAIDFYNAKLGPMYLPCTVMAR FT IGTSSYELANEQGKSLGIFSAADIKPDDS" XX SQ Sequence 6005 BP; 1933 A; 1333 C; 1225 G; 1514 T; 0 other; ttttggcgcc cagcaaattc tacattacag tgtgttacac gtgattcaca acttcaatcc 60 cgtggaggga gagaaaaaaa aatctagttt ttttcaaatt tacatcgttc tccaaacgat 120 agtagttgct ttaggatttt caaattttgt agtgaacaaa tgatagtaca tcaattagtt 180 tttcaaattc aatttgtcgt ttacccacga tagcaaatca actagtttat tcaaattttg 240 tcgttaaaca acgattgtta ttacctagta ttcaaattta aaaactataa tcatggattt 300 agaatttagt cacctgtata aaaacttact tattcatcac ttagagagat cggagattga 360 gcatgaacta actattagag cagtacaatt cgaacctacc gaaacacgtg ccgcgattca 420 gagaagatta cgcgaccggt taaaagaaga aagagagaaa agtctagcag atttagactt 480 tatcagatgt ttcaaatccg tcgaggccga aatcaaagag atagatgaaa atctttctca 540 aattaggaat tttttggaaa acaaaacacg gttcgaagga attcgcgaga gtttaaaaac 600 gcgcttagta cactatttcg agagaacaag aagagtacag caaattgcag ataaagaaga 660 agatcttcga gacctcgatg aactaatttg cactattaga gaaacgatga acacgcactt 720 tccatccata ggtccaaatg aagtaattca agagcaagta attcaacaaa ttgtccagac 780 cctttccgat ttgaacctca ttccgtcgaa caccaatgta ggatccgtaa atcagttacg 840 cgaagaagca aatgctaata aaaatgataa tttgctagat gaagcaatag gaggtagcca 900 gcaaagggaa cctgcaaatc cttccggtcg tcgttcacta aaatctttac ggaaaactac 960 taacgttcag tcgggattga actcgttaaa tcgcgggtcg aaaagtctcc tgatctcccc 1020 ggtacattcc ggtggtagct ctcaggtcag atcagaacgg cttatcagta aattttcctt 1080 agaaccgatt tcgagtagga acaaagttcc acaacgcgaa acctccgatt ctgtttccag 1140 tgtggaatca gatagttcga ataactcggt ggttccctca gcgcgttcgc gtcgtcatag 1200 ccggtattcc caaaaacgca gaccggtatc tgagtggaat ctgaaatacg atggtaaaga 1260 taacggacaa aacctcatgc gcttcattaa agaggtgcaa ttctacgcga agtccgaaaa 1320 agtatccgat aaagaattgt ttcgttccgc gatatatttg tttaaggatc aggctaaagt 1380 ctggttcatg tcgggcattg agaacgagga tttttccacg tggaaagaac tcttatcgga 1440 attgaaacgc gaatttctca gccctgacca cgatcacgta aatgaaacgc gcgcaatatc 1500 acgcaaacaa ggaccaaaag aacgattttc tgatttcctt gcagaaatgc aacgaatctt 1560 caattcccta actagaccca tgacggaaaa gaagaaattt gaaattgtct tcaggaatct 1620 tcgcgccgat tataaggctc acgcgatcgc gtcgaacatt gataacctcg cggacctcaa 1680 gaaattcggt cgtcaattag acgcaacgtt ttggtttaaa tttaacgcga atacgcaaga 1740 aacgaatggg tcgcgaaacc gtgcgcaagt gaacgaggtg agcttcggcc aaaaacccag 1800 gcaaaaccct ccggctgaag attcgaataa aaaattcaaa tctcgtcaat tttaccgatc 1860 aaaacccgaa cgctcagatg aagaacccac taagaagaaa accgaagaaa ccgcgaaacc 1920 gccaacatca aaaccccaaa cgagcgacaa aggtttgcag gtcctggttg aaaagtataa 1980 agtacccgcc ccaggcactt gcttcaattg tagattacct ggtcatcacg cagctgattg 2040 cgaggggcca aaacacaaat tttgtcgaaa gtgtggtttc ttaaactacg acactaataa 2100 ttgtcctttt tgtgcaaaaa acgcacaata gacggccagc aggggtgggc tcgtcaggaa 2160 aggcatagaa atccccttaa cacgaaggac ctcagtaatg ctttacctca ttatgggttc 2220 cacccgtttg agaataatga ttacgaccat gaacctaaag agcttgaaga gttgttcatc 2280 caattacaga atgatgagcg gcccttcgtg caaataaaaa ttttcgaaat accaatcaca 2340 ggcctactcg acagcggtgc acaccgtagt atacttggag ccgaatcgca taaaatcata 2400 gacgccagta agctcaagct tttacccact aaagtcgatc ttgtgacggc tagtggccaa 2460 aaactcagcg tccagggatg tgtaaacctc cccatgacgt tcaatggcga aacgaaactg 2520 attgagactt tggtagtacc ggagttaaaa agaaagctta tcttaggtac tgatttttgg 2580 aaagcctttc aaattgtgcc ttcagtttcc tcaaatacgg tagaagaact tgaagtacaa 2640 ccaccaactc cagacttgac cctagaacaa ctctcagaat tagagtctgt gaaagctcaa 2700 tttaaagtgg ccgaagaagg aaagctagat actactcctc tcatttgcca ccgaatagag 2760 ttatctgaag aggccaggaa gaaatcagcc gttcgtataa atccattccc aacttcacct 2820 aaaagacaag aacagataaa taaggagctc gataagatgc tagaagcagg aataatagaa 2880 cgctcgtata gtgactgggc actacgatta gtgccagtcg acaaaccaga tgattcggtg 2940 cgattatgtc tcgatgcacg gctacttaat gagcggacag tccgtgattc atatccgttg 3000 ccacatgccg accggattct tagccggtta ggtccgtgta aatatatatc caccatcgat 3060 ctttcaaaag cctttctaca ggttccttta cacccaaaat caaagaaata caccgcattt 3120 tcggtactcg gccgtgggct tttccacttc acgaggatgc ctttcggact ggttaacagt 3180 ccggcaaccc tatccagact aatggaccgt gtccttggtg gaggagaact agaaccaaaa 3240 gtttttatct atttagatga tatcatcatc atcagcgaca cattcgaaga acatatgaag 3300 ttactccgag aagtggcaag gcgtctaggg caagctaact tgtccataaa tatcgataaa 3360 tccagattct gcgttttgga agtcccgtac ctcggatata ttctgagtac gcaaggactc 3420 cggccgaacc ctaaccgagt ggaggcgatt gtcaattttg aacgtccaaa ctccctcaag 3480 gccttgcgac gttttttagg aatgtgcaat tactatagac ggtttatatc cggctatagt 3540 gatttagtac gacccttaac tgacctcctg aaagataagc caaaatcggt acgctggagt 3600 gatccagctg aaaaggcgtt catcaaagtg aaggagttac tgatcagtgc cccaatccta 3660 acgaacccag attttagcaa atctttctgc attcattgcg acgcgagcga ctatgcgatc 3720 gccggcgtat taactcaatc acatgatgga gtagataaac ccatttcgta cttttctcaa 3780 aaattaacag gggcacaaca acgctatttc gccaccgaaa aagaggccct tgctgtgcta 3840 aaatccatcg aaaaatttcg gtgttatata gaaggttcta aatttacagt gtatactgac 3900 gcatctgccc tcacctacat tcttcgcagt acttggcgta cttcatctcg actgtgtcga 3960 tggagcatag agctacaaag acacgatatg atcatcaagc accgccgagg cgttgacaac 4020 gtggtgcctg atgcgctatc gaggtctgtt gaagagctgt ctgcttcacc gacaagtcaa 4080 gagtgctata gcaacatgat gcagaaggtt caaactgaac ccgaaagatt taaagacttt 4140 cgtattgata atggagtact taagaagttt gtagccacct caaacgacct cctcgattat 4200 cgcttcgagt ggaagttatg cctcccctcc gatatgcgag aaaaagtttt ggtggaagag 4260 catgacgatg ctctccactt aggagtcgat aaaaccatag ccaaggtcaa aaggaaatat 4320 tattggcctc agttgaccaa agatgtacgc tctcacatcc aaaaatgcac gatttgcaag 4380 cagagcaaac catcaaatcg cgcacaacac cccgaaatgg gtagacaaag aataaccacc 4440 aaaccgtttc aactaattgc cttggatttc atccaatcct tgcctcgaag caagaccggc 4500 aacgcacacc tactagtcat aatggatata ttttcaaaat tctgtcttct atttccgcta 4560 cgcaaaatat cagctccaca agtctgccag atattagaac aaaactggtt cagacgatac 4620 tccactccgg aatacctaat cagtgataac gctactacct tcttgtccaa ggactttaaa 4680 gcacttctcg acaaatatgg tatcaagcac tggagaaatt cgcgacatca tagtcagtcg 4740 aatccagtcg aacgacttaa ccgaacgata aacgcgtgca tccgtacgta cgttcgtaca 4800 gatcagaaaa tgtgggattc gagggtttct gaaatcgaat tcgctcttaa tagtacacca 4860 cacggttcca caggcttcag tccttataga attcttttcg gtcacgaaat tataggaaaa 4920 ggggatgagc atagaatgga cagggatgtg gatgaggtct cggatgagga aagggtaggc 4980 aggaaattag aaatcgatcg tttaatccac aatatcgttt acaaaaatct tcgaaaaaac 5040 tacgaaaaga acgcgaacat ctacaatctc aggaataagg cttcaacgca atcgtacgcg 5100 gttggacaga aggtcctgaa acggaatttc caacaatcgt ccgcgatcga cttctacaac 5160 gcgaaactcg gtccgatgta cctcccatgc acggtaatgg ctcgcatagg aacgtcgtcc 5220 tacgagctgg cgaacgaaca aggcaaatcc ctgggaattt tctcagctgc cgacatcaag 5280 cccgatgatt cctaaaacga aagaaaattt caaaaaaaaa acgttgtccg attcaattta 5340 aacgaatact tatcgtttcg attcccgaaa acagctgcgc agcataaagt tatgttgacc 5400 cgtcgcgacg ttcgcaattc ggtgcacctt gcgaatgtaa acaaacataa ttaggtcaaa 5460 caaccacgaa atttagtcag ctcacgttct tgctcaataa cttacctctc catagtgtat 5520 acagcttcgt caataaactt ggaaggcagt atccaacacc gatcaatgta aataacaccg 5580 gccaaggaaa acggaactgc cgagactagc gcgcgtaggt ccacctccaa gataaacgat 5640 gaaactgcca gcattctttc acgattcaat ttcacctaca gattagattt aatcacttaa 5700 ctattccgaa ttatgtttca ctttaagttt ctgtttactt tatgttgctg atgttgtaaa 5760 tgatgagttt gacagttgta atgtacgtgt gttggtaaat gacctctcac tttcttccat 5820 tcagtggaaa gagtaactca gaggctcagc agtgtagcca gattaatttc ctcacaattc 5880 gattatttat tcacgataaa tatttcgaaa ttgaaaacgg aaaagacgtg taatctctgt 5940 agtgtcccca ataccaactt ctaaaatgaa tttcatccat tttaagaaga gggagtgtgg 6000 ggtaa 6005 // ID Copia-36_CQ-I repbase; DNA; INV; 4173 BP. XX AC AAWU01006152; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-36_CQ_; KW Copia-36_CQ-LTR; Copia-36_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4173 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 375-375 (2011). XX DR GenBank; AAWU01006152; Positions 1086 5258. XX CC Positions [1396-1932] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 100..4170 FT /product="Copia-36_CQ-I_1p" FT /translation="MSQAGSFPALERLAGRENWADWKFAMQTYLEVEEFWT FT AVKPAKKADGTYEAVDAVKDRKARGKIILHLERHNFCHVKDTKTAKEAWEA FT LEAAFEDAGLTRRTGLIRKFGSTHLEDCSSMEEFVNTVMSTAHQLRGIGMV FT IPEELVGGLLLAGLPESYRPIIMALENSGTVISGDSIKTKLLQELPVTSAG FT AAYSSRKQERRPEQKRPPRPTGDSKGPKCRRCQKFGHIAKDCRSKDSNRRD FT GDAWCTVLSTVNEDDHWYFDSGASNHFAKSKETLDNLENYSGTVVAANRGA FT MKVVAKGSMKLWTTVSPDEKPIEVNDVQVIPDLSTNLLSVSQIVKRGNTVT FT FDKGGVKVINAAGAIVATGSMVKDIFRLNQAVPKPQALAVTAAESLELWHK FT RMGHLNFDGVRRLKNGLVSGVHYNESAVEKCKVCAMGKQTRLPFSKAGSRA FT SDLLELVHSDICGPMEVQSLGGRRYYLSFIDDKSRRIAVYFLKQKSADEVY FT DAFEDFRCKAERQTGRKMKILRTDNGKEFTNKKLQVLLSRLGIRHQTTVDY FT TPEQNGLAERGNRTIVERARCMLFDANLPKTFWAEATGTAVYLANRSPTKG FT HDKTPEEVWTGRKQNLAHIRVFGTPVMVHIPKQKRKKWDAKAHECILTGFE FT EDIKAYRLWDPIAKKLIKRRDVTFLREGGVPASTPVSAKQPMVIRLDFEEP FT IPCVQEEDEPPVQLEEAEPVVDDVPQDPEVIAELENDNAEVEVEVEDAEPL FT AGPSCDVTPIALPSRPLHPPGSPALLRSGRERSLPGKFKDFILSSKGLSLP FT SFTANSGYDEIVSDSETSGDENSDDTLVGLAARRQPPKGRDPVTPAEAMKR FT GDAKLWKDAMDDEYRSLMENGTWELVQLPPDRKAIGCKWLFKTKEDEKGNI FT VRHKARLVAQGFTQKFGVDFDEVFAPVARQETFRILLTIASRRKMVAKHVD FT VKTAYLHGKLEETIYMKQPAGYTTGDVNTVCRLKKSLYGLKQSARVWNHKI FT DAVFKQLGFSQAKADPCLYVRKTGRSTAYIIIYVDDMVIAAETEEEFEAIF FT NGLQQHFTVTNLGDLKHFLGMEVERAADGFKLNQQKYIDKMASRFGLEDAK FT KSKIPLDPAYLQQKEENDQLPNNTDYLSLIGGLLYVAVHTRPDIAVSTSIL FT AQKSSRPTQLDWQEAKRVLRYLKGTSDHKLHLGSTGAGLEMYVDADWAGDH FT RDRKSNSGYLVRFGGGLVSWGSRKQSCVALSSTEAELVALTEGCKELIWIQ FT MLLGEFGIKINRAVPVYEDNQSCIKLVDGNKIEKRTKHIETRYFYVRDLKE FT KKMIDLQYCPTEKMLADILTKPLQNLRIKMLREEIGLLPDHVEEE" XX SQ Sequence 4173 BP; 1093 A; 1019 C; 1260 G; 801 T; 0 other; aaaggttatg ggcccaggaa gatttcaaaa ctgaacactt agaagaattt tgagaagcag 60 ttattttttc aagaagttta cgtaaagttc cagagcaaga tgagtcaagc cggaagtttc 120 ccagccttgg aacggcttgc tggtcgcgaa aattgggccg actggaagtt tgcgatgcag 180 acgtaccttg aagtcgaaga attttggacc gctgtgaagc cggcaaagaa ggcggacgga 240 acttacgaag ctgtagatgc cgtcaaggat cgcaaggcaa gaggcaagat catcctgcac 300 ctggaacgtc acaatttttg tcacgtgaag gacacgaaga ccgccaagga agcgtgggaa 360 gcgctcgaag ccgctttcga ggacgcgggg ctgacccgac ggaccggcct gatcaggaag 420 ttcggttcga cgcacttgga ggactgttcg tccatggagg aattcgtgaa cacggtgatg 480 tcaaccgcgc accagctgcg tggtatcgga atggttattc cggaggaatt ggtcggtgga 540 ctgctgttgg cgggactgcc tgaatcgtac cgcccgataa ttatggcttt ggagaattct 600 ggaacggtga tttctggaga ttcgataaaa acgaagttgc tgcaggaact cccggtgaca 660 tcggcgggcg cggcctactc aagcaggaaa caggaacgca gacccgagca gaaacggcca 720 ccacgaccaa ccggcgacag caagggtccg aagtgcaggc gctgccagaa gttcggacac 780 attgcgaagg attgccgctc gaaggactca aatcggaggg acggagacgc ttggtgtacc 840 gtactgtcaa ccgtcaacga ggacgaccac tggtacttcg attcaggggc ttccaaccac 900 tttgcgaagt cgaaggaaac gttggacaac ctggagaact acagtggtac agtcgttgct 960 gcgaacagag gtgccatgaa ggtcgtcgca aagggaagca tgaagctgtg gaccacggtt 1020 tcacccgacg agaagccgat tgaagtcaac gacgtgcagg taattccgga cctatcgacg 1080 aacctactct cggtaagcca aatcgtgaag agagggaaca cggttacgtt tgacaagggt 1140 ggagtgaagg tgatcaacgc agccggggcc atcgttgcca ccggaagtat ggtcaaggac 1200 atcttcaggc tgaaccaggc cgtaccgaag cctcaagcgc tcgcggtgac tgctgcggaa 1260 tcgttggaac tttggcacaa gcggatgggc catctcaact tcgacggcgt gagacgtttg 1320 aagaatggtc tagtctccgg agtccactac aacgagtctg ctgtcgagaa gtgcaaggtt 1380 tgcgctatgg gtaagcaaac acgcctcccg tttagcaagg ccggttcgcg cgcaagcgac 1440 ctgctggagt tggttcactc cgacatctgt ggaccaatgg aggttcaatc cctcgggggt 1500 aggcggtact acctgagctt catcgacgac aagtcgcggc ggatcgctgt gtactttctg 1560 aagcaaaagt cggcggacga agtctacgac gcgttcgagg atttccgctg caaggcggag 1620 aggcagactg gtcggaagat gaagattctt cgcacggaca acggtaagga gttcaccaac 1680 aagaagttgc aggttttgct cagccgcttg ggaatacgcc atcaaacgac ggtggactac 1740 acgccggagc agaacgggct tgctgaacgc gggaaccgga ccattgtcga gcgtgcacgg 1800 tgcatgctgt ttgacgccaa cctaccaaag accttctggg cggaggcaac gggcacagcg 1860 gtgtacctgg cgaatcgatc tccaaccaaa ggccacgaca agacaccgga agaggtgtgg 1920 actggacgta aacaaaatct ggcacacatc cgtgtgttcg gaactccagt catggtgcac 1980 atcccgaaac agaagcggaa gaagtgggac gccaaagccc acgagtgtat tctgacaggt 2040 ttcgaggagg atatcaaggc ctaccgcctg tgggatccga tagcgaagaa gttgatcaag 2100 cgacgagacg taacttttct ccgtgaagga ggagtcccgg caagcactcc tgtctctgca 2160 aagcaaccga tggtgattcg tctggatttt gaagaaccca tcccgtgtgt gcaggaagaa 2220 gacgagcctc cggtgcagct ggaagaagct gaaccagtcg ttgatgatgt cccacaagat 2280 cctgaagtca tagctgaact cgaaaacgac aacgctgaag ttgaagttga agtagaggac 2340 gccgagcccc tggcaggtcc cagctgtgac gtgacaccta ttgcgctccc ctcgcgacca 2400 ttacatccgc ctgggtcacc ggcgttgttg cgcagcggtc gggagcgcag tcttccaggc 2460 aagttcaaag attttattct gtcgagcaaa ggcctgtccc ttccatcatt cacagccaac 2520 tctggatatg acgagatcgt ttcggattcg gaaacaagtg gagacgagaa ctcggacgat 2580 acactagttg gacttgcagc cagacgtcag cccccgaagg gtcgagatcc cgttacaccg 2640 gcagaggcga tgaaacgtgg cgatgcgaag ctgtggaagg acgcgatgga cgatgagtac 2700 cgatcactca tggagaacgg cacgtgggag ctggtccagc tgcctccgga tcgcaaggcg 2760 atcggatgta aatggctgtt caagaccaag gaggacgaga agggcaacat cgtgcggcac 2820 aaggccaggc tcgtggcgca gggcttcacc cagaagtttg gcgtggactt cgacgaagtt 2880 ttcgcacctg tggccagaca agaaacgttc cggatcctgc tgacaatcgc tagccgtcgg 2940 aagatggtgg caaagcacgt ggacgtcaag acagcttacc ttcatggcaa gttggaggag 3000 accatctaca tgaagcagcc tgcaggatac acaaccggag acgtcaacac tgtgtgtcga 3060 ttgaagaaga gtctgtacgg gctcaaacaa tccgctcgag tgtggaacca caagatcgac 3120 gcagttttca agcaactggg cttcagccag gcgaaggctg atccgtgcct ctacgttcgg 3180 aagaccggaa ggtcgacagc gtatatcatc atctacgtgg acgacatggt gatcgctgcc 3240 gagactgagg aggagtttga agctatattc aacggcctgc aacagcactt caccgttacg 3300 aacctcggag atctgaagca tttcctggga atggaagttg agcgagcggc cgatggattc 3360 aagttgaacc agcagaagta catcgacaag atggcaagtc gattcggtct ggaggatgcc 3420 aagaaatcga agattccctt ggatccagcc tacctacagc agaaggagga gaacgatcag 3480 ctacccaaca acaccgatta tttgagcttg atcggaggtc tgctgtacgt agctgttcac 3540 acccgaccag acattgccgt tagcacatcg attcttgctc agaagtcgag tcgtccgacg 3600 cagctggatt ggcaagaagc aaagcgagtg ctacgctacc tgaaggggac gagcgatcac 3660 aaactgcact tgggatctac cggtgcagga ctggagatgt acgtggatgc agactgggca 3720 ggcgaccacc gggaccggaa atctaactct ggatatctgg tgcgttttgg cggcggactc 3780 gtcagctggg gttcccggaa gcagagctgt gtagcgctgt catcaacgga agctgaactg 3840 gtggcgctga ctgaaggatg caaggagctg atctggatac agatgctgct gggcgagttc 3900 gggatcaaga tcaaccgagc tgtgccggtc tacgaggaca atcaaagctg catcaagctc 3960 gtggacggca acaagatcga gaagcggacg aagcacatcg aaacccgata cttctacgtt 4020 cgcgacctga aagagaagaa gatgatcgat ctgcagtact gcccgactga gaagatgttg 4080 gcagatattc ttacgaagcc gttgcagaac ctgcggatca agatgctgcg tgaagaaatc 4140 ggtctactac cggatcacgt cgaggaggag taa 4173 // ID SINEL_SM repbase; DNA; INV; 5191 BP. XX AC . XX DT 30-OCT-2007 (Rel. 12.1, Created) DT 30-OCT-2007 (Rel. 12.1, Last updated, Version 1) XX DE SINE-like putative non-autonomous, non-LTR retrotransposon. XX KW Non-LTR Retrotransposon; Transposable Element; Nonautonomous; KW SINEL_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-5191 RA Jurka J.; RT "SINEL_SM: SINE-like long non-autonomous transposable element RT (consensus)."; RL Repbase Reports 7(10), 1099-1099 (2007). XX DR [1] (Consensus) XX CC Individual copies are ~95% identical to consensus, and no open CC reading frame was identified. The 3'-end of SINEL_SM is similar CC to the 3' end of LIN2_SM. The overall structure of SINEL_SM CC resembles that of SINE elements, except that it is much longer CC than most SINEs. XX SQ Sequence 5191 BP; 1839 A; 476 C; 850 G; 2021 T; 5 other; gtacagattt taaataattt tatgggtaca tatacgcaaa tgatacttat ttagaaattt 60 gtgtaatcat ttagagggaa aatttgatta agaaaattaa atgaaccaaa attttggaca 120 ttaaaactat taagaaatta tatttcagaa ttaatactag tattcaattt gaaaatacta 180 cttctttggt tatttaagag ttagttatca acttttgata ttcgattaaa tattttttat 240 cgttaaattg gcttttatca atatggaaat tgtgtttaat tagactaaaa cttgtaataa 300 ggtattgaat aatagctagt atattaattt atcatatatg tccacttagt ataatataga 360 gcctgttaaa tattttagtt taaatttagg attctcgata tagttttcat gaattttata 420 tctaatcgaa tgtcaattga tattactaaa gaatttattg agaatttgat gtaaactttt 480 aagagaactg tatcaatttt agagagagta ttccaacgta tgaatatttc agagtgttag 540 atataatgga aaggttagat ttatcactat ggattgggat tttgatcaat tctagtatgc 600 tttacaagtt ggaatatcga gtttagatgt agtaagttac taggggaaat attgagtgag 660 aaatttaatc gattaattat gacgtaataa taatttggat atcagtatga gtaatttcat 720 attatacagt gttaaaagaa attaatcaaa ggtaaatatt gagtagaaat aatcatttaa 780 tgttagattt tacatgaaaa ttatcagtaa tatatcaatt ttaagggata aattatggct 840 aaatattgag tatgtgcaga taaatatagg gtttttaatg aaaatgatca gtaatatatc 900 aattttaaag tgagatattt tcgctaaata ttgagtttgt gtcgataaat atagggtttt 960 taatgaaaat gatcagtaat atatcaattt taagggataa attatggcta aatattgagt 1020 atgtgcagat aaatgtagag ttttacatga aatgatcagt aatatatcaa ttttaaggag 1080 gatattatgg ttaaatatta agtatgtaca gataaatgta gagttttacg tgaaatgatc 1140 agtaatttac caaataaagg tgaaaagttt aataattggt tacgattgag gagtcttaat 1200 caaatatttg ttgggatatc agggtattac tattgcgtgt aagagaaatt cattaaagat 1260 tcttactgca ttttaataaa tctagctatt gcaggattta tgtaaacaaa tttcaatgtg 1320 gtaattcatt aaattacagt aactttaggg gttaaatcca atgtaaagta gagaaatcaa 1380 taatttagct ttatatgtgg aattattgtc agtaaattgt agcttattaa tgcaatcttt 1440 actggtaaca tgatagtttt taggtgaatc gaagatgaat tcaaggattg agtagattta 1500 tcctttttgt cgatgatttg tggtatttat tgctactatg atagaaattc attaaaattt 1560 tatcgaagat tttaaggtgg ttggatatcc atttggacat taaagtatat caatttttat 1620 caatatgcta acaaatattg tttttcttat actgttaaat tcattttgta ttgaagaaaa 1680 gtatgtgtag ttcaataaat tgttaaactt ttaataggga taattagtat atttttatag 1740 aggcaaattc ttattgaacg gtaaccataa attacgacat ttgatagtca atttaaccaa 1800 tatttgttgt acatgaaaat tttctagtca atctatttgc ttgaatcgtt ataattacaa 1860 cttttttaga tttggaaaca ttggtgaggt attctttata aggaaacttt agagtaatga 1920 attrtggtaa ttgtagcgtt ttacaatgca tataattaga ttcggataag ttttaaataa 1980 aataagtcgg tgattagcaa caagttcaac attataataa ttttaaatga cttttgttga 2040 tttatttaaa aagattaatg aaattatgtt cgattgaata gtatgatttc ctatcgacta 2100 atcaatgcgg taccaataat atttagaaaa taatttgaag tgataaagta ctgttttaga 2160 acattaaaat ttgaattaat ttgagtcgtt gtcattggta ataaatagaa ttaaccaatc 2220 aaaatcaagt attctataat ttcattttca aattattatt ttcatgaaaa ttgtaattat 2280 tttaattgat aaatttatta gtattttcaa caaccggttt tagttaaata ataagtttat 2340 tgaaaaatag tttgtacgtt ttcaaattat tttcaagaaa actgtaataa ttttaatata 2400 ttgataaatt tattagtatt ttgagtttag cgattaaatt tatcgaaaaa taatgtcgtt 2460 gtcaatagca atagtaacag ccaatcaaaa tcaagtattt ggtttctact ttttcaaaat 2520 taatttaaag aaaagttatt gaaatatgat ttttatgttg tttcaagaat aactttctat 2580 taaaaaataa gttcagtcaa taatatttct aattaacaat tgcatttaat caatcaataa 2640 ttttacgagt ttatttaaat tgacgattaa caaatttata tactttttaa tctaatttca 2700 atagagattt tcaatttaat attcttaaat taaaatattt attcatattt tcaaaccatt 2760 tttaacaggt acttaaaaat tgtcaacgat ttttaaatat atgtatgttt tttaccaact 2820 actctttatt aaagacatgt gtttacttgg tatttagtaa ttattggttt acataaggtt 2880 ataactttta atcacatagg gttaaaagtt atctttcatc ttttagggtt aaaattgtgt 2940 yatctttcat tatttagggt tagaattatt aaatatttca ttactttggg ttaaaattgt 3000 tacctatttc ttcacatagg gttaaaattc ttaacctttt aaataacttt ataacaaatc 3060 atgtaatagt ttatccataa atttatcgaa aagtattata atatacctag tttgaatcat 3120 aaattattga aatatttaat aggtatcacc tgatgaattt ttaatggtta tttaatcaat 3180 ttttggttac ataaagcttt aatgtccgaa attatcggtt atcggttgat tagcatgttg 3240 ttaatgaatt taaatctaat tttacacgtt ttggtatcaa catttaatta gtatttagat 3300 attttcaaag ctcttattaa aaacaaaagt ttatttagat atcaatagta tccacatgac 3360 tgatattgac aatgtaatac gtgtttgtaa taaatttacg tgtgaaaaat tgtctttttt 3420 attgttgtga aagttaaatt attatcaata gcattattat tagcattaat taatccagta 3480 tgcgtgaatt tttaaaaaat attgatataa aactcatgga tcgattattc tttatcaatt 3540 tatatagtga gcccagtgac tttcatttgt aaacccagtg acttattttc ataaaaacac 3600 taaaatagtt gttacaacga atttagttga tggtatcgct aataatttat agagtcgaat 3660 gaacttgtaa attagtatcc gactaacttg gtgaattatt atcaaaatat tgaattttta 3720 ttagaaacag tatatttgct ataaatttat agaatttggt taagtattta catgtaaata 3780 atcctatgat gggttatttt cataaacttt ataaatccaa tgtttaaatc ctatcttgaa 3840 tgagttattt tcaaaattct tacatcagtg gtgtcggtaa taattcgttt gggggttcaa 3900 ctcctatcgt gggcgaatta ttttttattt tcaaaattaa attttttaat agaatataaa 3960 tctgtatatt ttatttgctg taaatttaaa attttataga aattggtaaa gtaaatcttg 4020 tataatttaa ttttttacat ttagttaagt tgaaatttta aacttgttga attattttct 4080 taaaatattg aattttgtaa atccagtgtt caaatttaag ttttgagtga cttattttta 4140 attaaataaa ctatcaatat actataatgg ttgttaaact taaattttca tgaaaattat 4200 taaagtttta tgattatttt gataaaattg ttatgtaata tactactaac aaaatagtca 4260 gtaagctaaa taatttagga ttgaacattt atttttaaga cgagtttgat gttaatttat 4320 tatgtttaat gattattttg ataaattttt tatgtaatat aytactaaca aaatagtcag 4380 taagctatat aattaacaat tgaaaattta tttttaagac gagtttgatg ttaatttatt 4440 atgtttaatg atcatgtaaa tataattttt taatatatca catgcaagta acatgaatca 4500 atgaggtaaa cgaaactctt acgtttatat gaagtaaaca gtataataat aataccaata 4560 gtactagtga gtgattttgc taagtctacg acttagtcgt agacgtagtt gtaggcgtgt 4620 tgggcgctga tgggtgctga tgggtatgga tgggtacgga tgggtacgga tgggtgygga 4680 tgggtgcgga tcggtgcgga tcggtgcgga tcgttggttg gtgtgtgtgt gtgtgaaatg 4740 tttaagaacg aaaggttcta gaaggtaggt gtgtggaagt ttctagaagg taggtgtgtg 4800 gaaggttcta gaaggtaggt gtgtggaagg ttctagaagg taggtgtgtg gaagkttcta 4860 gaaggtaggt gtgtggaagg tgtgttggtc cacaactacg tgacataata acagttaggt 4920 aactaagtgt gtctccgagt agtgaccagt gacataataa gtgtgaccag tgacataata 4980 agtgtgacca gtgacataat aagtgtgtga ctgaccagtg gtcaagtgac gtaatcatga 5040 ggtcactgac caaaggtcaa ttactaggta gcaactagtg gacaaaggtc aatctccagg 5100 tagcaacagc gtgtgcagag gtcaaatgtt acgtaataat tacgtaagcc aaggggggtt 5160 gtatctgacg ggctcacatc ctactactta t 5191 // ID Gypsy4-LTR_Dpse repbase; DNA; INV; 1616 BP. XX AC Unknown_singleton_87; XX DT 13-MAY-2009 (Rel. 14.05, Created) DT 13-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy4_Dpse; KW Gypsy4-I_Dpse; Gypsy4-LTR_Dpse. XX OS Drosophila pseudoobscura OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; obscura group; OC pseudoobscura subgroup. XX RN [1] RP 1-1616 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1048-1048 (2009). XX DR Genome; Unknown_singleton_87; Positions 13699 12084. XX SQ Sequence 1616 BP; 478 A; 331 C; 366 G; 441 T; 0 other; tggcactttg tgccaagcag atttccaagg gagaatagcg gaaaaagata attgggatcg 60 atgaccgagc cggccgcagc cacaaccagc tggccataga aagggaagag aagagactgc 120 gctcccgaaa aaaaacacaa gaaccgatcg aattcgccgc tggcgcagag aagcgcaagg 180 cagagagtcc gccgctccag ctgtgggtcc gactcagtgc atcagcagca tcgtcggcgc 240 cgggaagctg cgaaactaaa agcggaatat tgaacatttt ttttgcccca agtttttttt 300 gcgtaaatat cttgccctcg gccgacctgg cttagtgaaa aaaaaagtag aggaaataaa 360 aacccagatc aaaattatac ccaggagcca ataggcaagg tcttagtgag tacgaataca 420 aataccttgc tcccctttcg gcagtgcagc gccgttaatt cagtcttgct tttcttttcc 480 cttaaaccgt aagggctagt cctatgtgga gcccaaagaa aagaaaggca taagggaata 540 gaaatacaat atccatgctc aaaagcaaag aaaaaaagca gtcccccgaa agcctctgcg 600 cgccttgttc tttctcttct tttctctttt tgttgctgtg ctctcccaat aaaattacat 660 ttacagcaat tatagcaagc ggatcgtctt cgacaaaagc gcttttggag atggatggca 720 cgtaagtgaa agaatattgc tttactctaa ataaattgca tgcatgagcg tgagagaagg 780 agaggtatag gaaagaagga aggggagagg taaaacactt gggcagatcc acatgccgga 840 agagatagca agaacagtaa ccatgggttt tttgttgttg tcacaaatgc tctgttaaat 900 ttgtatgcca gtttagcggc tacctcctat atacatatac tgttttttta ttatatacac 960 atttaattct tttttttcta tttagttata attcttctat tttaattatt attagtatta 1020 ttttaatttc caacctggaa tattctcgaa tcctgttgtt cgctaaaata gttattgagt 1080 caattgggag tttaagtaga gtagggaaat gcccctgtgt caagtaataa ataagtttta 1140 aaatgaaatc gataatgaac taaattattt tagggcgcca tgccgagtgg attatggaaa 1200 tgaagtgagt ctgggggatt tagggatcat cggccactaa tatggctttg cggcccagat 1260 ggtcaaggta gggtgggcaa aacgctgcta cacctacagc tactcagcag cacaatccgt 1320 acttcagcat gttctgtttt ctttgcagag caacctctat gtgtgcatag cttgtcactc 1380 ttggtagttt cttttctgta tgtccagtcc gatcgatcat gtgttaggga ggaaaacagc 1440 acgcgaaatc aatattatta tatgtatgta tccaatgcga gcaggtagca cagtggttaa 1500 tcctcacccc ccccttgccg acgacaaata gaagccggaa ccagcgcgtg ctcctctggt 1560 cgctacaata agatcagctc gttatgatag cgtagcgcta tacgttgtat gctaca 1616 // ID Gypsy-616_AA-LTR repbase; DNA; INV; 259 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-616_AA_; KW Ty3_gypsy_Ele25; Gypsy-616_AA-I; Gypsy-616_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-259 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 259 BP; 74 A; 59 C; 52 G; 73 T; 1 other; tgtgggacta tcttgagcat ccccatatga tckaccagtt ccacacagca cccgaacatc 60 accccgcgag gagaagcata cgaaaatagt gtcagtatct gaaagtgaga gccatcgttc 120 ggtcacgttg ttagtgaata tacaatttga attcgatccc cgtatttgaa agtttttcga 180 tatccgaagt tgtcgtccga aacgcgtatc ccttcttttc gcgctagaaa agtgaagcat 240 ttaattaata tttgctaca 259 // ID Gypsy-259_AA-LTR repbase; DNA; INV; 120 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-259_AA_; KW Gypsy-259_AA-I; Gypsy-259_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-120 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1120-1120 (2011). XX DR [1] (Consensus) XX SQ Sequence 120 BP; 33 A; 25 C; 23 G; 39 T; 0 other; tgagtagtaa agctaaaccg tgtaagcacg cattgaaatt gttaccgcaa taaaatattg 60 ttttcgatag ttttcgcgta cgtttattgt ctccattaat tcggctgcac acgtccgcca 120 // ID Gypsy-124_AA-LTR repbase; DNA; INV; 270 BP. XX AC AAGE02024975; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-124_AA_; KW Gypsy-124_AA-I; Gypsy-124_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-270 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02024975; Positions 11181 10912. XX SQ Sequence 270 BP; 61 A; 62 C; 70 G; 77 T; 0 other; tgtgtggcac accctactac gcaacagcat tgatgataat gatgatgata atgatgacga 60 gtatatcaca tatacacgta gaacgcttgc accgatgtac cgctcatgtt atctgagtgt 120 gagtgtgcgt ttattctaga gcgaggcttc acagcacatt cgggacttgg ttcgggctct 180 ttctgcatcc agcgctcgtc aaggtaagac gtgttttcca gcacagccgg ttgggtgatt 240 ccgcttgaga cgcgcttctg gttttgcaca 270 // ID BEL-9_DPu repbase; DNA; INV; 3169 BP. XX AC scaffold_967; XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE BEL-type retrotransposon from Daphnia (internal portion and 3' DE LTR). XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-9_DPu. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-3169 RA Jurka J.; RT "LTR retrotransposons from Daphnia."; RL Direct Submission to RU (24-MAY-2010). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR Genome; scaffold_967; Positions 9349 6181. XX CC The ORF extends to 3' LTR, which is included. XX FH Key Location/Qualifiers FT CDS 761..3166 FT /product="BEL-9_DPu_1p" FT /translation="MVSGFPRSSLVLHPSAVQAFVVSVNQRFRQRELCVYS FT THMPYLIHPTRSTEAQAGHAQSAQANSAMPNPGFGSSQEEDESRKVSPPAG FT DVELTKKRRTTVRSQITSTIRQIRASIDQCGSRGSIAGLVKHLQSLATTAT FT LLHTDLLIVEDASENERQEEKHLMYVQQIGEVIAEADEHLKSRADEAPSEV FT DGGRAARVTEQEIRAAKQLIEQTRTQAEQARKRAEELQIQPQQAEEAPQDL FT QQGDANPENFSSVSNRGGSFTKLAADWKRKQAQQNTASDDWIDGYANGTLK FT PIYTAGSRSSVKSDLEPFSGRSLDWFTWIDLFRALVHDTEKTPGEKLALLH FT RYVRGDCLDVIHGLGGGEGAYIEALIRLKESCGRRDVMRAAHIQAIEGLEL FT KNDPAIFKRYAEKVRTHLFDLSRIGETSSAGLIEKICLRLQLHDRLAWNDG FT RKGGLETRSLNTFGVWLCERAAAYQNAYSIAAEQTTVAQKPNVRFAARANP FT VSSKQSSTPQTSSKSASRPFFFKCVGDHKLEICGAFKSSSVGERVGFCAKH FT RLCFGCLKPRHAIRFCPQRKPCSQSGCTLFHHALLHDVNRANPDTSITARP FT ANLLSDIGRNRRVAMGMMRLKIQDADWHWITANVFVDEGSDSTLMRQGFAK FT LLKLRGAHHILTVVGAGNVIKHYPSQRISFGIRDSDGSVVAITCSTLPTVA FT SDTPVTDWPVLKKRWNHLADLPVTMTGGKVDILIGTDHSPLVTALESRIGG FT DYEPTAVRNRFGWLIRGVVQDGTTITAVRTNTIIGSTQLAQLTDVMRQFCE FT " XX SQ Sequence 3169 BP; 824 A; 786 C; 749 G; 810 T; 0 other; tatatttgcg ggcagaaatc gtaaaggtaa attgccaaac gtgttccggc gaaattatgt 60 ttgtgtcgat ttatcgcaat ctggcaacac tcttttccat ttctttcccg ctgtcagaaa 120 tacgccctag cattcgtctg cctctcgttt ctatgtcagg ttgtcagagt tttgagttct 180 tctaaatcac cggcatttac gtttctttct gtgatttggt tccgttttct gtctaacaga 240 ggtgagaaat gtctcatata tattttcata tgtcttgtac taattatgta tatctgcagc 300 tcaactgtct tgagatattc tgcattctgt cagtttattc cactgtgtga ttcgttctat 360 ggtctacaaa gcttgcaagc aacttatgtt ttgtccttgc ttcgcaggta tgattagtgc 420 tagtcttatt atatattatg ttacatgttc ttatactatc tttgcctctt tcaggcacca 480 tcgttaactc tcatagtcat actgccattc tggcagacaa atctttactc gcaatacatt 540 gaagagaaaa cccactgccg gcctatgctc taatttagct tctggacatt ttggtccttc 600 gaaccggagc tttagtcagg gaaggtggtg gccagcttat tcttcagcca acatcatcat 660 tgttggcccc tcatcattct gctacacgag tcctgtagcc agccttctac tgccagcatt 720 tacaacgttg gcctgtcata tcactgctgc ccagttcttc atggtcagcg gcttcccacg 780 ttcgtcattg gttcttcatc cttctgctgt ccaggccttc gtggtcagcg tcaaccaacg 840 cttccgtcaa cgtgagctgt gtgtttactc cacccatatg ccttatctga ttcatccaac 900 tagatcaacc gaagcccagg caggtcatgc ccaatcagct caagccaatt cagccatgcc 960 gaatcctggt ttcggatcca gccaagaaga agacgaaagt cgaaaggtga gcccacctgc 1020 tggagatgtc gagctcacga agaagaggag aaccaccgtc cgcagtcaga tcacctcgac 1080 gattcgacag attcgtgcca gtatcgatca gtgtggatcg cgtggaagta tagctggcct 1140 cgtcaaacac ctccagagct tagcaacaac agctacactc ttgcacactg acttgctaat 1200 agtcgaagat gcaagtgaga acgaaaggca agaggagaaa catttgatgt acgttcaaca 1260 gataggggag gtgatcgccg aagctgatga gcacctgaag tcgagagctg atgaagcccc 1320 ctcggaagtg gacggcgggc gcgctgcgag agtaacagaa caggagattc gcgcagcaaa 1380 gcaactgatc gagcagaccc gtacgcaagc agagcaggct cgaaagcgag cggaggaact 1440 tcagattcaa ccacaacaag cagaggaggc tccccaagat ctacagcaag gcgatgccaa 1500 ccccgaaaac ttctcttccg ttagcaatag gggcggatca tttacgaaat tggcagcaga 1560 ttggaaacga aaacaagctc agcagaatac cgcttccgac gattggattg acgggtatgc 1620 caacggaaca ttgaaaccta tctacacagc tggatcccga tcgtcagtga aatcagacct 1680 agagcccttt tccggaagat ccctggactg gttcacgtgg atcgaccttt tccgcgcttt 1740 agttcacgac acagaaaaaa caccagggga aaagttggcg ctgcttcatc gttacgtgag 1800 aggagattgt ctcgacgtga tccatggcct aggaggagga gaaggagctt atattgaggc 1860 tctgatacgc ctcaaggagt catgtggtcg gcgcgatgtg atgcgggccg ctcatataca 1920 agccatagaa ggcctcgaac tcaaaaacga tcccgccatc tttaaaaggt acgctgaaaa 1980 ggtacggact catctgtttg acctctctcg tattggggag acgtcttctg cgggattgat 2040 tgagaagatt tgcttaaggc ttcagctaca cgaccgtcta gcatggaacg atggccgaaa 2100 aggaggcttg gagacaagaa gcttaaatac gttcggagtt tggctttgtg agagagctgc 2160 tgcctatcaa aacgcttaca gtatagcggc tgaacagacg acagtggctc agaaacccaa 2220 cgtacgcttt gctgcccgtg ctaacccagt ctcttccaaa caatcttcca cgccacaaac 2280 ctcatcgaaa tccgcttcac gcccgttttt tttcaagtgt gttggtgatc ataaactcga 2340 gatttgcggc gcctttaaat catcttctgt aggagaacgt gtcggctttt gtgcaaagca 2400 tcgactgtgt tttggttgtc tgaaaccaag acatgcgatc cgtttctgtc ctcaacggaa 2460 gccctgtagt caatccggct gcaccttgtt tcatcacgcg ctactacacg acgtcaatcg 2520 agctaatccg gatacttcca tcactgcgcg tccagctaat ctactcagtg acatcggaag 2580 aaatcgaaga gtggctatgg gaatgatgcg gctcaagata caagatgcag attggcactg 2640 gataacagcg aatgttttcg tggatgaagg aagtgactcc accttgatgc ggcaaggatt 2700 cgccaaactc ttgaagcttc gtggtgctca tcacatcctt accgtcgtcg gagccgggaa 2760 cgtcatcaaa cactacccct cccagcgaat cagtttcggc attagagact cagatggatc 2820 tgtcgtcgcc attacctgct caacgttacc aaccgtggcc agtgatacac ctgtaacaga 2880 ttggccagtt ctaaagaaac gttggaatca tctagctgat ctaccagtga cgatgacagg 2940 tggtaaagtg gacatattaa tcggtactga tcattctcct ctcgtcacag cattggaatc 3000 aagaattggc ggtgattatg aacctacggc agtccgaaac aggttcgggt ggttgatccg 3060 aggagtggtg caggatggaa ccacaatcac agctgtcaga accaacacca taatcggatc 3120 gacccaattg gctcaactta ccgacgtgat gagacaattc tgcgagaca 3169 // ID LOA-7_CQ repbase; DNA; INV; 2742 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A LOA non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW Loa; Non-LTR Retrotransposon; Transposable Element; LOA-7_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-2742 RA Kojima K.K. and Jurka J.; RT "LOA non-LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 154-154 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 4 sequences with >98% CC identity. XX FH Key Location/Qualifiers FT CDS 28..2646 FT /product="LOA-7_CQ_1p" FT /note="reverse transcriptase and ribonuclease H." FT /translation="KGRKTPDAAKLQKVLSKDHSNGLGSLKKNDGSFSKSD FT EETLEIMMETHFPGSILAANEDLGTKSVDYGRDTTMEKSFVPKSIFTRAKV FT AWAVSCFEPFKSPGRDGIFPALIQKGGGKLIECLTNIFKSSLELGHIPKIW FT SQTRVVFIPKAGKRDKTAPKSFRPISLTSTMLKVMEKIIDEYIKSKFLHTK FT PLSRFQFAYQQNKSTVTALNELVSRLGKNMSAKEIALAAFLDIEGAFDNAS FT YKSIKKNMEKRGFEHCITRWVMAMLDSREVFAELGGSSVTVKTTRGCPQGG FT VLSPLLWSLVVDDLLKKLSELGFEVIGFADDVVILVRGKFEEIITERMQYA FT LNCTFEWCKNEGLNINPKKTVVVPFTRKRKILLKTLTLDGSALEFSKNVKY FT LGITLDAKLNWNLHLDQAVQKATTALWVSRKTFGMKWGLKPKMISWIYSAI FT IRPRLTYAALVWWPKTKQKTAQKKLEKLQRLICISITGAMRSTPTNALNAA FT LHLLPLYQYVQMEAGKSALRLRRNENLFKNCKEHLGILKELNIDPHVFMNE FT DWMEYTLNLDITYKVILTDRQVWESGGPSLPPGAIIFYTDGSKMGNKTGAG FT ITGPGLSISAPMGRWTTVFLAEIYAILECASACLKRNYRYSTICIFSDSQA FT ALNALNSSKCQSRLVWECVILLKKLGNKNRVFLYWVPGHCGVEGNEKADLL FT ARKGSDDLFVGPEPFCGVSKCVLKMEITKWENSMIQQNWIASHSAKQSKLF FT ITPSKQKTSRILGLNKRSLNIYIGLITGHCPSRYHLKKLTRSQTDICRLCD FT CEVETSRHLLCECGALIAKRIKFFNKGILTPSEIYSENPTSVVDFILEAMP FT NWGISHPQSVAVTLNGDLSP" XX SQ Sequence 2742 BP; 915 A; 497 C; 607 G; 723 T; 0 other; gaggaaaaac tggaggttct actgtgaaaa ggtagaaaaa ccccagatgc ggcaaaatta 60 cagaaagtgc tttcgaagga tcattcaaat ggtttaggaa gtttaaaaaa gaatgatgga 120 tcgttttcta agagtgatga agaaacactg gaaataatga tggaaaccca ctttccagga 180 tcaatattgg cagcaaacga agatctcgga acgaaaagcg ttgactatgg ccgtgacacg 240 accatggaaa aatcttttgt accgaaaagc atcttcacgc gggctaaagt tgcttgggca 300 gtgagttgtt ttgaaccatt caaatctcca ggcagagatg gaatatttcc cgcccttatt 360 caaaaaggag gtgggaaact gattgaatgc ttaacgaata ttttcaaatc aagtctagaa 420 ttaggacaca ttccaaagat atggagtcaa acacgcgttg tcttcatacc taaagctgga 480 aaacgagaca aaacggctcc taaatcattt aggccaataa gtctcacgtc cacaatgctg 540 aaagtaatgg aaaagataat tgatgaatat attaaatcaa agtttttgca cacgaaacct 600 ctcagtaggt ttcagtttgc ttaccagcag aacaaatcga cagttacagc attaaacgag 660 ctagtttcaa gattgggaaa gaatatgagt gcaaaagaaa ttgccctcgc tgcctttctc 720 gatattgaag gagcttttga taatgcatca tataagtcta taaaaaagaa tatggaaaag 780 cgaggctttg agcattgcat cacacgatgg gtcatggcca tgcttgatag cagagaagtg 840 ttcgctgaat taggaggatc ttctgtgact gtaaagacca caagaggctg tcctcaaggg 900 ggcgtgctgt ctccattgct gtggtcatta gttgtagacg accttctgaa gaaactatct 960 gaattggggt ttgaagtaat tgggtttgct gacgatgtgg taatactagt ccgagggaaa 1020 tttgaagaga taattaccga aagaatgcaa tatgccctca actgtacttt tgagtggtgc 1080 aaaaatgaag ggcttaacat aaaccctaag aagactgtag ttgtcccgtt tactcgtaag 1140 agaaagatcc ttcttaaaac tcttactcta gatgggtccg ctttggaatt ttctaagaat 1200 gtgaagtatc tggggatcac tctagatgca aaattaaact ggaacttaca cttagatcag 1260 gctgttcaaa aagccacaac tgcattatgg gtatctagga agacctttgg aatgaaatgg 1320 ggacttaaac cgaagatgat atcctggatc tattcggcca taataagacc cagattaact 1380 tacgctgcct tggtatggtg gccaaaaaca aaacaaaaaa cagcgcaaaa aaaactagaa 1440 aaattacagc gtttaatctg catatccatc actggagcaa tgcgaagtac accaacgaat 1500 gctctaaatg ctgctcttca tctcctcccg ttgtaccaat acgttcaaat ggaggctgga 1560 aagagtgcgc ttagactaag aagaaatgag aatttattca agaattgtaa ggaacatcta 1620 ggaattttaa aagaacttaa catagatcca catgtgttta tgaacgagga ctggatggaa 1680 tataccttga accttgatat aacatacaaa gtgattttaa ctgatcgcca agtatgggaa 1740 tcaggaggtc caagcctacc tccgggagca atcatattct acacggatgg atcaaaaatg 1800 ggtaataaaa caggagcggg gatcacgggg ccaggactaa gtatttcggc tcctatggga 1860 cgctggacta ctgtattttt ggccgagatc tatgctattt tagaatgtgc atcagcatgt 1920 ctgaagagaa actataggta ttctacaatc tgtattttct ctgatagtca agctgctttg 1980 aatgctttga attcttccaa atgccaatct agattggttt gggagtgtgt aattcttctg 2040 aaaaaactgg gtaataaaaa tcgagtattc ttgtactggg ttccaggcca ctgcggggtt 2100 gaagggaatg aaaaggccga tttgcttgcc aggaaaggct cagatgacct ttttgttggt 2160 ccggagccat tctgtggagt gtcgaaatgc gttctgaaaa tggaaattac aaaatgggaa 2220 aattccatga tacaacagaa ttggattgcc tcgcattctg caaagcaatc caagttgttt 2280 atcaccccta gcaaacagaa aacgtcgcga atacttggtc tgaataaacg gtcactaaac 2340 atatacattg gattaattac aggacattgc ccatcacggt atcacttgaa aaagctaacg 2400 cgaagtcaaa ctgatatctg taggctctgt gactgtgaag tggaaacttc aaggcactta 2460 ctgtgtgagt gtggtgcact gattgcaaaa agaattaaat ttttcaataa agggatttta 2520 actccctcag aaatttattc agaaaatccc acttcggttg ttgacttcat tctcgaagca 2580 atgccaaact ggggtatatc gcatcctcag tcagtggcgg tcaccctaaa tggtgacttg 2640 tcaccctgaa aatatgcgac agtaattagg ggtacactac aatagatcaa ctaaatggtc 2700 gcagtagtct taaatcccac aaaggaaaaa aaaaaaaaaa aa 2742 // ID Chapaev3-1_DW repbase; DNA; INV; 3148 BP. XX AC . XX DT 29-FEB-2008 (Rel. 13.02, Created) DT 29-FEB-2008 (Rel. 13.02, Last updated, Version 1) XX DE Chapaev3-1_DW is an autonomous DNA transposon - imperfect DE consensus. XX KW Chapaev; DNA transposon; Transposable Element; Chapaev3; KW Chapaev3-1_DW. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-3148 RA Kapitonov V.V. and Jurka J.; RT "Chapaev3, a distinctive group of animal Chapaev transposons."; RL Repbase Reports 8(2), 45-45 (2008). XX DR [1] (Consensus) XX CC Chapaev3-1_DW belongs to the Chapaev3 group of the Chapaev CC superfamily of DNA transposons (see comments on Chapaev3-1_PM). CC Chapaev3-1_DW is a relatively young family of fruit fly Chapaev3 CC transposons: genomic copies of Chapae3-1_DW elements are ~95% CC identical to their consensus sequence, which was derived from CC multiple alignment of a few Chapaev3-1_DW elements damaged by CC multiple deletions. The transposase-encoding region is corrupted CC by mutations accumulated in genomic copies. Chapaev3-1_DW CC contains 12-bp TIRs and ~550-bp subterminal inverted repeats. The CC genome contains numerous non-autonomous derivates of CC Chapaev3-1_DW. CC This sequence was derived from sequence data generated by J. CC Craig Venter Institute. XX SQ Sequence 3148 BP; 1082 A; 522 C; 582 G; 961 T; 1 other; cactgtgata gtcgaaaaaa aagacatggc aaaattcgat ggtttcaccc tatttcgggt 60 cctgcgaatc gaactgcatc attagttttt taagttatca cgatggtttc atacaaaatt 120 ttttcgggaa gtttttcatc aaagttgccg ttttttcgaa aagggtgttt ttttcatttg 180 cttctaactt ctaaaatatt aatttttttt acaaactttc ttcagcaacg tttcttagaa 240 ttaaaagagc tacaaatgtt gggaagacat ctagcctcta agtccgtccg ttcccgagct 300 ttgggacaaa atgtaaaaaa aatcgaatct tgacccaatt ttttttatgt tttctttctc 360 gcattttgct aatattgaaa attttccatt cgaatcatga tcataatatg agaatttatt 420 atgctcaatg gcagcagaag agaaaccaga cctatctaat ttacgattta taatataacc 480 atttaatcaa attgaatttt tttcagatat gttaacaaaa ttaaactaac tgtatttgtt 540 tagtttttga aaaatattta tacagtcggt atcataactg catattattt atttgacaac 600 aatattagtc aagggcgtag cccggtttga gcacgggggg tgggggaggg gtgcatttaa 660 atcaatttac ttaaaggtga agttgataaa actttttata ggttttcaaa tcaaaatgta 720 ttaaaaaagt taagttagga aaacaaagtg ttaccccgca cgaattaaaa tactggttac 780 gcccttgata taataaatac ttatacccca tgagctttaa ctactcaagc cttttaccgc 840 tattctactg atatcggtgg gtacttatgc gagcaccgtt aagaaattac ctctactgat 900 catgaaagca tagatttaaa tcataatttt gacttcaacg ttgcgcttag aagtgttctg 960 cattgagctt aaattaaaat gaacaaatgt ccgacaagtt ttgctgtatt ttgctacgta 1020 tgtggaaaat ttacgagccc tcgcagtaga cgcaaaatat ctcctggaac tgcagcaatt 1080 tatgaggagt actttaattt accagtgatt agagacgtaa attgggcacc aagcaatatc 1140 tgcacacaat gcaataacaa tttgcagaat tcgtcaaaaa ggttgcgtgc aaaaatgggt 1200 ttcggcattc caatgatttg gatagatcca caaggtcatc atacgggtaa ctgctatgca 1260 tgccagaata aactcgcaaa actaaagaag agttctatgc tttataaaag tgttcaatct 1320 gcacagttgc cggtacctca ctcggagaat attccaattc ctagacgtca aagcccaaca 1380 gaaagatata ttccgccaac atttttcacc gaacctgaag agctactatc aatgtatgag 1440 ccatcgcaaa tcgataaacc ttgccaacat gtcgagatat cccaagctcg tttaaatacg 1500 atggtaaggc aactgaaatt gtcgcaaagg caagcaatct gtttagcaca tcatttacga 1560 tctgttaaca ttttgtctaa ggatgtgaaa gtgtacggat atcggaagag acaagaagag 1620 cttttggact ttttcgaaat aaacggcaat aatacgtttg catactgcaa aaatatcgtt 1680 gggctaatgg aacatatgaa ttgcgagtac aaaccagagg aatggcgatt gtttattgat 1740 gcatccaaaa atagtttgaa ggcagtactg ctatatgtgg ataacacgaa aaatccagtt 1800 cccattgctt ttagttccaa cacgaaggaa accaacgatt caatgaaact aattttggat 1860 tgtgttcaat ataaaaagca tcaatggaag atatgtgctg acctaaaagt tgttgctcta 1920 ctcactggct tgcaagcggg ttatacaaaa aacatgtgct tcatttatca ttgggatacc 1980 agatatcgcg gtaaccaata tgataagcga gttggaaggc tcggacagag tacaatataa 2040 atgttgcaaa tgtaatccat accccactga taccagctaa taaaatttta ctgccaccac 2100 tgcacataaa tttgggaatt gtgaatcggg atggtaaagc gtttcaaagc ttgacacaaa 2160 tagttccaag attgagcgtg gcaaaaataa aatatgacca aaaatttcat tctaggtgta 2220 ttaaatggtc cagatataag aaagctaata aaaaatagtg aattttgcga gcttctaaga 2280 ccaaaagaaa agatggcgtg gaactgcgtt aaagctgctt gggagtgcgc cgtattgaag 2340 gctggcaagg gaatgtggat caaatgttag agtcctttaa agacatgggt gtaaatatgt 2400 ctttaaagat tcattttctc cactaccaca aggaccattt cgacggcaag ttccgacaga 2460 atctgatgaa cacggagaac gatttcatca ggttgctgca tcactagaac attggtatag 2520 cggaaaaagg cttgactcgt tgctcgcaga tttatgttgg aacttaataa atgaagctta 2580 tggtatataa tgcaaatatt tttgtcaaat aaaaaatatg cagctattat accgactgta 2640 taattattct tcaaaaacta aacaaataca gttagtttaa ttttgttaam atatctagga 2700 aaaattcaat ttaattatat ggttatattt tataaatcgt aaattagata ggtctggtgt 2760 ttcttctgct gtcattgagc ataataaatt ctcatattat gatcatgatt cgaatggaaa 2820 attttcaata ttagcaaaat gtaggaaaac ataaaaaaaa ccggacggat ttagaggcta 2880 gatgtgttcc caacatttgt agctcttatt ctaagaaacg ttgcttaaga aagtttgtaa 2940 aaaaaataat attttagaag ttagaagcaa atgaaaaaaa cacccttttc gaaaaaatgg 3000 caactttgat gaaaaacttc cggaaaaaat tttgtatgaa accatcgtga taacttaaaa 3060 aactaatgat gcagttcgat tcgcaggacc cgaaataggg tgaaaccatc gaattttgcc 3120 atgtcttttt ttttcgacta tcacagtg 3148 // ID Gypsy-33_DPu-LTR repbase; DNA; INV; 472 BP. XX AC . XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from Daphnia: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-33_DP_; KW Gypsy-33_DPu-I; Gypsy-33_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-472 RA Jurka J.; RT "LTR retrotransposons from Daphnia."; RL Direct Submission to RU (16-DEC-2010). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR [1] (Consensus) XX SQ Sequence 472 BP; 130 A; 113 C; 96 G; 132 T; 1 other; tgtcacgtag agatcacgtg acatacgcgt gagacacacc ccacactcat ctctttctta 60 gaaacaagta agggcgaatg tacgagtaca tttgacctta ctttgtctga atgtcgcgtt 120 agcaacatag gacgtttaca ccttctctag cgcttcgggg aaacaagcaa gggcgaatgt 180 actcaaacgt tcgaccttgc tttgtagtaa acggataagt gcatcttctc tttcgtagaa 240 acaagcaagg gcggatgtac tcstacaacc gtccttgctc agtaataagt cataagggtt 300 atcttgatta ttcgtagaag cagcaagggc gaaagcactc gatgattcaa ccttgctctt 360 cacctaccat aaataggcgt gtattgtctc ttgtattctc tctctcttac ttgacgtcct 420 aaagtgtgga cgtcctaata cactcgtgtt acacatcaac cccagtgtaa ca 472 // ID Gypsy-50_CQ-I repbase; DNA; INV; 10153 BP. XX AC AAWU01002291; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-50_CQ_; KW Gypsy-50_CQ-LTR; Gypsy-50_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-10153 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 479-479 (2011). XX DR Genome; AAWU01002291; Positions 30103 40255. XX CC Positions [5127-5636] - Integrase core CC 'CAGG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 1718..4300 FT /product="Gypsy-50_CQ-I_1p" FT /translation="MSIHFTKNNKKLNLLIDTGADISLLKASCLDTGDEIY FT LNEKVLVSGISNGILQTFGTMKSCFSFPNGEFHFKFHVMNNDIGLSKIDGI FT LGRDVMYPLKSVINMNDKTFKLKHNNDEIIFKIQERIEENTENTINENKKI FT IEEDKIDNKEGNEDIKKEENNDEFKDHKKNKNNVLNYYESDDSKLNVCDDT FT RPVSEENEIELNCHKHINMNIANEVVYPEDNFLLDDCSRTESILKNENIHS FT DENYAMRLSILKKELKIDPLLNEKEKISLKKIIVYYNDIFFVEGDKLSCAD FT VVRHSINLSSDKPIYVKQYPLPQIYRQEVNKQVSKMLNDGIIEPSTSAYNS FT PLLVVPKKSENGEKKYRVVVDFRKLNEKTVADTYPIPNISEIIDQLGNSSF FT FSSLDLASGYHQILMDEKDKEKTSFSTNFGKYQYVRMPFGLSNAPSTFMRY FT MNNIFVGLQGHKCFIYLDDIIVHGKTLEEHNKSLEEIFSVLRKHNLKLQPN FT KCHFLKSELNYLGHIISSNGILPNKENIKAIDNFPIPKTVKQLQSFLGMAN FT YYRKFVPEFAKICLPLYKILKKENNFKFDDNCLNAFKILKNKLITPPILQF FT PNYTKIFIITTDASQYALGAVLSQGDDVDLPIAYASTTLNKAEVNYSTIEK FT ELLGIIFGVRQFRTHVLGRKFIIRTDHKPLVYLFNHKNPCSRLIRWKVELM FT EYCYDITYKPGLTNVVADCLSRIPSETLIGNFKEDEFEENKIFKFNLHVHY FT TKVQKINLTNLIISKVGVITRNHCKKDFQTCIEDKSDALFAFTKVDKISES FT CVALTIKQVNKIDENIPILNLITNNLVTITVDKTYQKNPEQLFKKNSKFKR FT NSRKIKQK" FT CDS 4359..5984 FT /product="Gypsy-50_CQ-I_3p" FT /translation="MLKYIFRDTKVKLYLLNNSPEFNKVTLTSPNSIDEVP FT KESKKQETINNDRLNKLTNELTLQNFNELKNNFDINQLNYKVFQSPVTLEK FT VSNKCLKIFFKIVQIMNHSRHYKIICKNNEVEIEVMEGSQLSVEDIFEILL FT FIRNFCSTNYFKNIAIDSFPKCCFEHDSEIIKLLTKYIFNNSNISVILIQN FT TVKTLTNIDEIENILHLFHKSKLGAHTGTKRMLKRIQDLYIWKNMRKDIDT FT YVKDCVECARSKIRRKTKMPMAITTTAHKTFEKICIDIVGPLPVTEKGNKY FT ILTIQDDLSKFLFAFSIKDQEATTIAKVFVEKFICVFGVPKSVLTDNGSNF FT ISNLFKNVCKLLNIKKINISAYHPESNGALERTHRTVKEFLRAEISHKHKD FT WDTLLPFGCFAFNTTKHTSTNFSPYELVFGKKCILPQTLIREPEVYYNYED FT YLLDLKNTLRLTQNEARRQLIENKQSSKEYFDKKVNPINIEIGDKVYIDNI FT AVGVGQKLQPLRNGPYEVIGMNSNETVTLKIGNKEYKTHKNRLFKP" FT CDS 6204..7991 FT /product="Gypsy-50_CQ-I_4p" FT /translation="MWILLLLVGYTSALDLSSTKLDKSGAVLERIPNVAFV FT KQHLKMATYVDLDKLKEEMKFAQNMTVMTKDACDKMLAVNISKNCEYELSS FT LVSTWDSVSHSISIFNNKDKRSAPWLGVVGKIAKTIIGTLDADDGLFYEEQ FT FAKVKSNEDLLLKRINKHTIILDSIFDLLNKTNTPTSDFFKNFNSKLDHFE FT KSFSEFQTNVKNELKRISAELVFRDCYAITESVVLALKSKVDLLVTLLGGD FT KDHLNPFLFSPSILYNKLSEANSFIPKSSKLPLPVNWAHIFEYYRISTGSQ FT HRFGSYLIFTANIPLVHSESFNVFKLTPVPVRIENNKYSIFKFDNDILIAD FT NDMKKFSMMDTEHFEHCQHAENLTLCPHIFIMKSSPSCETEILKFSNNIEK FT ICRRSIIQIDNLLFIKLFKTNSWVVVNPQSEVAHLNCLDSSKPVQLRDSSI FT IEIKANCKLISEHINLYPAQMIESNITIYFNWNLNFSKINFNFSSDFNIDS FT IEFPKINVPIINMKDAKALANLGKDIEDLKKEKIELQNLNKQNTSDFIIQT FT SSILLIFAIFGYIIYDKCIRKKNTININTIEMQKEEIQEKTLVPFRRKK" XX SQ Sequence 10153 BP; 4182 A; 1404 C; 1456 G; 3111 T; 0 other; ctggtgtcaa agtaaaaatt tagggtgtta aagtgttaac cgaaatcaag tgataaatag 60 ttcaaaagga agagagcaag taaagttcta tcaaaagttt acaaaaggaa aaaagttaag 120 tgaacagtga aaaccaacaa aacatcggaa actgcagcaa caacaacaac aacatcaaca 180 ggttagtacc actctcaaat aattgaccaa attatttttt tttttgaatt tatgtgaatt 240 tatataaact tattgaaaaa ttgttttgaa tttaagtttt gttttgaaaa atatgatttt 300 taatgagtga tactacgcca agattactca gatctggtat agtaccaaaa aaactgtcat 360 taccaaaaag tgacgaaata aaattaaaat taaaaagccc tttagaaacc ttatttaaaa 420 ggaaagtaaa gaatcaggga actcaaacag atcctgaact agatctagaa accgatactt 480 caagtgaatc tggatcaagt gaaaaaagtt ttgtaattaa aacaccaaaa atgtcatcaa 540 ttaaactgga aagtgttcca atttttgatg gggtttattc aaaattaaaa acatttgtag 600 aactgattga cttaatttac gaacttgaag atggcaaaga taatgataaa atctttgcca 660 aagccgtttt ttatttacgt ttgtccgagt cagttcggaa caaaataaac gtaaattttg 720 aatcatggaa acagttgaaa gaattgttgg aaaatcaatt caagcatgat ccaacaatat 780 ctttcaaaca ggcaacaaaa tttaatcagt tggcaatcat ggagactgaa acactttcgc 840 aatttgtcga taggttaatc gacaatttag aagcaatcag atcgtcaagt gaaagtgaac 900 agaatttgtg gcggcttgct gaggagcaag ccatagaaaa aataaaagaa tattgtccga 960 ggaatagtcg gataatatta ggaaatccct cttctttaga agaggccaaa tttgttttag 1020 aaaccaaagg aatcggtcaa tctaaagtga caaagaattt caaaacagtt gaagaattag 1080 ctcgtttaga atggcaatca gagccagata gttctcaaaa ttgttggggt gctccaattc 1140 aacaaaattt taaaaataat tacagacctc aaggaaattt tcgatttaat aatcaaaaca 1200 attttcaagg taattccagt tactttccac gatatcaaaa tcaaagatat catgatcaaa 1260 gatatcaaaa tcatagatat caggatccag gatatcagtt tcaaagatat caaaatccta 1320 gatatgaaaa ttcaaacttt agaaataatt tcaatcataa caacaatcac aatcataata 1380 atcataataa tcaaaatttc agaccaccaa atcaaaattc ttacagaaat tacaataacc 1440 aaaaccaaaa cggcaattca tttagacaaa ataatcctaa tggtcgaaat tttcaaaata 1500 atgttatgaa cactcaaagt tttcctgcta taatgcctaa cagtgaaaat cctgagactg 1560 ttcaatttat tgacaaaaag gaatatgaag aattcttgca gttcaaattg caaaaaaact 1620 tgtaaataat agtgatttat tctgcgtcac taaaagagga attgataaaa gccagaaact 1680 caacagaata tattcattta atattgaaag ttctccaatg agcattcatt tcacaaaaaa 1740 taataaaaaa cttaatttat taattgatac aggtgcagat ataagtctgc ttaaagcaag 1800 ttgcttagac acaggtgacg aaatctattt gaatgaaaaa gttttagtat ctggaatttc 1860 taacggaatt ctccaaacat ttgggactat gaaaagttgt ttttctttcc ctaatggaga 1920 atttcatttc aaattccatg taatgaataa tgacatagga ctttctaaaa ttgatggaat 1980 tttaggaagg gatgtcatgt atccattaaa aagtgttata aatatgaatg ataaaacatt 2040 caaattaaaa cataacaatg atgaaatcat ttttaaaatt caagaaagaa tcgaagaaaa 2100 tacagaaaat actatcaatg aaaataaaaa gataattgaa gaagataaaa ttgataacaa 2160 agaaggtaat gaagatatta aaaaagaaga aaacaatgat gaattcaaag atcataaaaa 2220 aaataaaaat aatgttctca attattatga atctgacgat tccaagttaa acgtatgcga 2280 cgataccagg ccagtatccg aagaaaacga aattgaactg aattgtcata agcatattaa 2340 tatgaatata gctaatgaag ttgtctatcc ggaagacaat tttttattag atgattgttc 2400 gagaactgaa agcatattaa aaaatgaaaa tatacactca gatgaaaatt atgcaatgag 2460 actttctatc cttaaaaaag aattaaaaat cgatccacta ttgaatgaaa aagaaaaaat 2520 atcgttaaaa aagattattg tctattataa tgacatattt tttgtcgaag gagataagct 2580 atcatgtgca gatgtagtac gacatagtat aaacttgtcg tcagataaac caatttacgt 2640 aaaacaatat ccacttccac aaatttaccg tcaagaagtg aacaaacaag tttcaaaaat 2700 gctgaacgac ggaataattg aaccaagcac aagtgcttat aatagcccgt tgttggtagt 2760 tccaaagaag tcggaaaatg gagagaaaaa gtatcgcgta gttgtagatt ttcgtaaatt 2820 aaatgaaaaa acagttgcag atacttatcc aattccaaac atttcagaaa ttattgatca 2880 actaggaaat tcttcgtttt tctcatcttt agatttggca tctggttacc accagatact 2940 aatggacgag aaagataaag aaaaaaccag tttttctact aactttggaa aataccaata 3000 tgtacgaatg ccatttggtt tgagtaacgc tccgagtact tttatgcgtt atatgaacaa 3060 tatatttgtg ggtcttcaag gccacaaatg ttttatttac cttgatgata ttattgttca 3120 tggcaaaact ttagaggaac ataataaaag tttagaagaa attttctctg ttttaagaaa 3180 acataattta aaacttcagc caaacaaatg tcatttttta aaatctgaat taaattattt 3240 aggacatatt atttccagta atggtatctt accaaataaa gaaaatataa aagcaattga 3300 taattttccc attccaaaaa ctgtaaaaca attacaatca tttttgggta tggctaatta 3360 ctataggaaa tttgttccag aatttgcaaa aatttgtcta ccgctttata aaatcttaaa 3420 aaaggaaaat aattttaaat ttgacgataa ctgtttaaat gcatttaaaa tactgaaaaa 3480 taaattaatt actccaccaa ttttacaatt tccaaactat actaaaatat tcatcatcac 3540 tactgatgcg tctcagtatg cactgggagc agtgttatca caaggtgatg acgttgatct 3600 gccaattgcg tatgcaagta caaccttaaa taaggccgaa gtaaactatt ctactataga 3660 aaaagaactt ttaggcatta tttttggagt acggcaattt agaacgcacg tattaggaag 3720 aaaattcatt attagaaccg accataaacc attagtctac cttttcaatc ataaaaatcc 3780 ttgttctaga ttgataagat ggaaggtcga actgatggaa tattgctatg acataacata 3840 taaaccaggt ttaacaaacg tggtagctga ttgtctcagc aggattccat cagaaacatt 3900 aattgggaat ttcaaagaag atgaatttga agaaaataaa attttcaagt ttaacctaca 3960 tgttcactac acaaaagtac aaaaaataaa cctcacaaat ttaattatat ctaaagttgg 4020 tgtcataaca agaaatcact gtaaaaaaga ttttcaaaca tgcattgaag ataaatctga 4080 tgccctattt gcttttacca aagtggataa gatttctgaa tcttgtgttg ctttaactat 4140 taaacaagta aacaaaatag atgagaatat tccaatttta aatttgataa caaataactt 4200 ggtaactata accgttgata aaacatacca gaaaaacccg gaacaattat ttaaaaaaaa 4260 ttcaaaattt aaaagaaatt ctagaaaaat taaacaaaaa tgaattatgt tttgagaaga 4320 atgatcaaca tgtattgaaa ttttctttaa ttttgaaaat gcttaaatat atttttcgag 4380 atactaaagt taaactatat ttactaaata attcaccgga attcaataaa gtaactttaa 4440 catcccctaa ttccattgac gaagtaccaa aagaatctaa aaaacaagaa acaataaata 4500 atgacagatt aaataaatta acaaatgaat taactttgca aaattttaat gaattaaaaa 4560 ataattttga tataaatcaa ctaaactata aagtttttca aagcccagta acgctagaaa 4620 aagtttcaaa caaatgttta aaaatatttt ttaagattgt ccaaattatg aatcattcta 4680 gacattacaa aataatttgc aaaaacaatg aagtagaaat tgaagtgatg gaaggatcac 4740 aactttctgt ggaagatatt tttgaaattt tactatttat tcgtaatttt tgttcaacga 4800 actattttaa aaatattgct atagactctt tcccaaaatg ttgtttcgaa cacgattctg 4860 aaataatcaa attattaact aaatacattt ttaataacag caacattagt gttatattaa 4920 tacagaatac agtgaaaacc ttaactaaca tagatgagat agaaaatata ctccatttat 4980 ttcataaatc aaaactagga gctcatactg gtacaaagcg aatgcttaaa cgaattcaag 5040 acttatatat ttggaaaaat atgcgtaaag atattgatac atatgtgaaa gattgtgtag 5100 aatgtgcacg atcaaaaatt agaagaaaaa ctaaaatgcc aatggcgatt acgaccacgg 5160 cacataaaac atttgaaaaa atttgtattg acattgttgg acctttgcca gttacagaaa 5220 aaggcaataa atatatttta acaattcaag acgatcttag taaattttta tttgctttct 5280 ccattaaaga tcaagaagct actacgattg ctaaagtctt tgttgaaaaa tttatttgcg 5340 tgttcggagt acctaaatct gttttaacag ataatggttc gaattttata tcaaatcttt 5400 tcaaaaatgt ttgcaaactt ttaaatatta aaaaaataaa tataagtgct taccatcctg 5460 aatctaatgg ggcacttgaa agaacacatc gcactgttaa agaattttta agagctgaaa 5520 tctcccataa gcataaggat tgggatacac ttttaccgtt tggttgtttc gcttttaata 5580 caactaaaca caccagtact aatttttctc catatgaatt agtatttggg aaaaaatgca 5640 ttttacctca gacattaata agagaaccag aagtgtatta caattatgaa gattatttgc 5700 tagatttaaa aaacacatta cggctaactc aaaatgaagc cagacgccaa ttgatcgaga 5760 ataaacaatc ttcaaaagaa tattttgata agaaagttaa tccaattaat attgaaattg 5820 gggacaaagt gtatattgat aacattgcag tgggtgttgg tcagaagtta caacctttgc 5880 gtaacgggcc atatgaagtt atcggaatga actcaaatga aacggttact ttaaaaattg 5940 gaaataaaga gtacaaaact cacaaaaata gactatttaa accatagata gtggaaggtg 6000 ataaaaaaaa aaatatgtta aaacgctgtg cttacgaata agatgtaatc taaaatgtct 6060 tatccgtatg tattgcgaag tagaagtacc tgttatttat tttttttaaa ttactgatat 6120 tatttttttt ttctttctca tagcagcagt tgtatctttg tttttcttcc cttttgtttt 6180 gtgtcatatt tatttcagaa atcatgtgga ttctacttct cctagttgga tatacgtcag 6240 cattagacct tagttcaaca aaactggaca aatcaggagc agtgttggag agaattccta 6300 atgtcgcttt tgttaaacaa catctaaaaa tggccacgta tgtcgatctg gacaagctga 6360 aagaggaaat gaaatttgca caaaacatga ctgtcatgac gaaggatgct tgtgacaaaa 6420 tgttggctgt taatatatca aaaaactgtg agtatgagtt gtcatctttg gtgagtacat 6480 gggatagtgt atctcattca ataagtattt ttaacaataa agataaacgt tctgctcctt 6540 ggttgggtgt tgtcggtaaa atagcaaaaa ccataatcgg aactcttgat gctgatgatg 6600 gactgtttta cgaggaacag tttgcgaaag taaaaagcaa tgaagatctt ttattaaagc 6660 gtatcaataa acatacaata atcttagatt ccatttttga tttgttaaat aaaactaata 6720 caccaacctc tgattttttc aaaaatttta atagcaagct tgaccatttc gaaaaatcat 6780 tttctgaatt tcaaacaaat gttaaaaatg agttaaagag aattagcgca gaactcgtat 6840 ttagagattg ttacgcaatt acagaatctg ttgttttagc attaaagtca aaagtagatc 6900 ttttagttac acttcttgga ggagataaag atcatttgaa tccattttta ttttcaccga 6960 gtattttata taataaacta agtgaagcaa attcctttat tccaaaaagt tctaaactgc 7020 cattgccagt aaattgggcg cacatttttg aatactacag aatttcgaca ggatctcaac 7080 atagattcgg gtcttattta atttttacgg caaatattcc attggtacat agtgaatctt 7140 tcaacgtttt taaattaact cctgtcccag ttagaattga aaacaacaaa tattcaattt 7200 ttaagtttga caatgatatt ttgatcgcgg acaatgacat gaaaaaattt tcaatgatgg 7260 acacggaaca tttcgaacat tgccagcatg ctgagaattt aactttatgc ccacatatat 7320 ttattatgaa aagtagtccc tcttgtgaaa cggagatctt gaaattttca aataatatcg 7380 aaaaaatttg cagaagatcc attattcaaa ttgacaactt actatttata aaattattca 7440 aaacaaattc ttgggttgta gttaatccac aatcggaagt tgcgcatttg aactgcttag 7500 attccagcaa accagtccaa ttacgggatt catctatcat tgaaataaaa gccaattgta 7560 aattgatttc agagcatatt aatttatacc ctgctcagat gatagaatcc aatataacaa 7620 tttattttaa ttggaatcta aatttttcca aaattaattt caatttttct tctgatttta 7680 acattgattc aattgaattt ccaaaaatta atgtgccaat tattaacatg aaagatgcta 7740 aagcattagc aaatcttggt aaagatatcg aagatctaaa aaaagagaaa atcgaattac 7800 aaaatctgaa taaacaaaat actagcgatt ttattatcca gacttctagc attttgttaa 7860 tatttgccat ctttggctat ataatttatg ataaatgcat aagaaaaaaa aatacaatca 7920 atatcaatac aatagagatg caaaaagaag aaatacaaga gaaaacattg gtaccgttta 7980 gaagaaaaaa atagataatt gtttttgatt attatttctt gcaaaaagaa aagtacataa 8040 atatattagt taaggtaata ttaaaaagtt ttaaattttt tagacagagc tcactttaaa 8100 aaatttgaaa cttagaaata ttaaagtata ttaagtaatg gtatcaaatt gaaaaaaatg 8160 ataaattcac ttaaacaaaa aaaaaaaaaa agatcatttg gataaaaata aatagaatct 8220 gtaaaaaact gagattaaca aagaggacaa ttatttcgac tagatcaata aaaatgtttg 8280 taaaatgtat actcaaaatg attaaaaaaa aatgtattca ataaaactgg taatactatt 8340 ctcaaaacaa aacattgtaa ttaatactta gtgttaaata aacattagta tcaatgagga 8400 aaaagaagga aaagaaaatt ttcaacaaat taagatcaat gttcatcacg tggattacaa 8460 gatatatgat tttaaaatac atttattaca tatttgttac atttattaca gacaataata 8520 aaattatgaa attatatttt taacacaata tttatacatg atagtttata atcaataatg 8580 ataagataaa agcacacaaa aagaaaaaca ttcaataaat ggatttctaa ttatttgagt 8640 gattctatta aatagaaaag agaattcgaa tcaaatgtta catagatata tatatatata 8700 tatatatatt tttaatttta ttcccacaaa ctaaacataa tttcaaaaaa aaagacagga 8760 aaaaacgaat aaaacatgaa tatactttta tagaattata atctaataac ttgatatttt 8820 aaaacactat taaaaacaga ttgtaaaaaa gaagaaattc agtaaataaa ccatttatta 8880 taacagaaaa attgtttaaa aatatattaa aaataaaaca ttttttttgt attatttagc 8940 aagacgattt agtttttata attttatttc aatcaatgta actttcaaaa gtaataataa 9000 attgaagata aatatgatta tcataaaaca aaaatacagg aaattagtta gtaacataca 9060 tttataacgg taaaaaaaaa gagctgtaaa agagtttttt tgtttgttag ggggaaggga 9120 accccttcgt cgtcaaacgc atcaacagtg caacaataac acatccccgc aatcaacaaa 9180 acaataatag gaccaaggac aagaccaaca actgcaacaa aatcgtaata gtggataaaa 9240 tcatcgtcgg acgacttcgg agattcaacg cattttcaac gcctacatac agccatcctg 9300 atcaaccata aacattcaaa agtttcaaaa caaatacaat caaaacattt ttatgaataa 9360 gcacattacc agataaaaaa tacattctat agccaattta cacattacat ctaactcagc 9420 ttacgggaaa tatttatatt aaaagagaaa aaatgtataa tagaaaccac tgacaagtat 9480 aattgtttca taacttgaga atcaaaacaa tgaccatatt atttacacta tctataaatt 9540 ccataaaatg aagaaaaata agaagaagaa gaagaagaag ctgaacaaaa gaacacacac 9600 aagcatatat acatattcac taacacgaac tcaaatacaa tcaaatacaa catttatgat 9660 aatttaagga caatgcagaa taacaattag gttattaacc taattcactt taattccttt 9720 tccttagtat ccatcccgaa taaattgtgt cctactaacc ccgagtcaac cgcgggtaat 9780 cggtttccac cattaacatt aagttttagt ttagtacgta agtagtttta ggaaataaat 9840 tgaagaatgc agaaaaatgc ataagccagt aaagcgcata taagaatacc caaaaccaat 9900 ttaaactcag atgtttctct gaaaaaagtc atgtcaagat attgacccaa gtaaaagaaa 9960 attaaacttg taaatgcgta atattgaagt agaagacaaa tttcaaattt ggcatacaaa 10020 aggaatatga agacgtatta tgagataaag agcaaaacac tgtattaaga aaaatcagca 10080 aagcatagtg caaaaactaa tgtaacattt attaaagagt aaacatgcag ccttctttcg 10140 gagggggtag taa 10153 // ID BEL-647_AA-LTR repbase; DNA; INV; 732 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-647_AA_; KW Pao_Bel_Ele227; BEL-647_AA-I; BEL-647_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-732 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 732 BP; 284 A; 96 C; 175 G; 177 T; 0 other; tgacagcgac aggtgagaaa gaaagagaag gagagatgac agatgtcgta gaactgtcat 60 caggtgtaaa tgtaaataaa aacaaagaag agttagaagt aaggaaagga taaaagtata 120 gttgagcggg aaagtggaaa tagctgtaac atcaagagtt ggtttttgga gaaaagtgta 180 gagaagttct ttttatcgaa taggtcacaa ccgtggagtc gctgaagaca gtaagtttta 240 atttattaga aacagaacga gcaaaacaaa atctgcatat cactttagca tccaccgtac 300 ggatacggac gctagacgat acggacaaat tagacgagta aacgattcaa gggaccacag 360 gacggacagg taaaggacaa ttattactat acagacagac aaactagaat gtttaattta 420 taacagacaa aggacgcaga acgttgaaca caaacgacag ccggttatcg cgaatagtag 480 catcggaaac tgtaagttgt agtgattagg gtggaacaaa aatatgatac ataattataa 540 tttggtagga atttgtgcga aggacattgt tcggaaaagg agaaaggaag cctgattatg 600 aaggcaaact aaaattgtaa gtgggcttat ttatcctaaa tgttatgtcg taaatttgtg 660 cacatactaa aacgaaaatt tattatttga attgcagttt gatctgcctg aaataaaacg 720 ggctgattta ca 732 // ID Gypsy-29_DWil-LTR repbase; DNA; INV; 439 BP. XX AC scaffold_181154; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-29_DWil_; KW Gypsy-29_DWil-I; Gypsy-29_DWil-LTR. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-439 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_181154; Positions 1350222 1349784. XX SQ Sequence 439 BP; 194 A; 77 C; 60 G; 108 T; 0 other; tgaaaccaat caaagtcgca caaacaaata acataattat acaaaaagcc aaagctgaac 60 aaacaaataa acctatgcat gggaatgttg ataacgacga gatcagcata atcatgcaga 120 agtacttgaa ccaaaatcaa atttcaaaag cataagcaac agaagcgtcg ccgtcgcttc 180 cgttgcaaat gctcgatcag caatttatga aacaaacgaa atgggcattt gcatatgcca 240 aagaaaatca tattttttag tcttaagatt caattgtaac cagacagaag tcgaagttcc 300 aaactaataa agaaactcac aatatctaaa gatatccagt gaacaaaaat cgtttaattc 360 tgcaaaattt aaaatcaaaa atttaaataa ataaataaag atatctaaag atatctagtg 420 actaaaattc gttctttca 439 // ID MSAT-2_CQ repbase; DNA; INV; 142 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A satellite repetitive sequence family from Culex DE quinquefasciatus - consensus. XX KW MSAT; Satellite; Simple Repeat; MSAT-2_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-142 RA Kojima K.K. and Jurka J.; RT "Satellite sequences from the southern house mosquito."; RL Repbase Reports 11(1), 614-614 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >93% CC identity. XX SQ Sequence 142 BP; 37 A; 39 C; 32 G; 34 T; 0 other; cgacgtatca aacttcatga aattgaaagt tatgaccatc tgaggtcatg ccgatgtcct 60 ctgacccatc ctggtcactc cggaaccggt taccggtggc cactggggga cactatcgga 120 atatgcaatg aaccctatca tc 142 // ID Chapaev-3_ACa repbase; DNA; INV; 4336 BP. XX AC . XX DT 30-SEP-2007 (Rel. 12.09, Created) DT 30-SEP-2007 (Rel. 12.09, Last updated, Version 1) XX DE Autonomous DNA transposon - a consensus sequence. XX KW Chapaev; DNA transposon; Transposable Element; Chapaev-2_ACa; KW Chapaev-3_ACa. XX OS Aplysia californica OC Eukaryota; Metazoa; Mollusca; Gastropoda; Heterobranchia; OC Euthyneura; Euopisthobranchia; Aplysiomorpha; Aplysioidea; OC Aplysiidae; Aplysia. XX RN [1] RP 1-4336 RA Kapitonov V.V. and Jurka J.; RT "Chapaev - a novel superfamily of DNA transposons."; RL Repbase Reports 7(9), 780-780 (2007). XX DR [1] (Consensus) XX CC Chapaev-3_ACa is a young family of DNA transposons. The genome CC contains several copies of Chapaev-3_ACa that are less then 2% CC divergent from the consensus sequence. Chapaev-3_ACa belongs to CC the Chapaev superfamily. Hallmarks of the Chapaev transposons are CC 4-bp target-site duplications, terminal inverted repeats with the CC conserved '5-CAC and GTG-3' termini, and the Chapaev transposase. CC The Chapaev transposase is characterized by the conserved CC D-x(60-80)-D-x(220-290)-E catalytic triad. Chapaev transposons CC populate genomes of different animals, including sea urchin CC Strongylocentrotus purpuratus, amphioxus Branchiostoma floridae, CC starlet sea anemone Nematostella vectensis, sea hare mollusc CC Aplysia californica, mosquitoes Aedes aegypti and Culex pipiens, CC and nematode Caenorhabditis elegans. The N-terminal portion of CC Chapaev transposase in Chapaev-1_ACa, Chapaev-2_ACa, CC Chapaev-3_ACa, Chapaev-1_BF, Chapaev-2_BF, Chapaev-1_NV, CC Chapaev-2_NV, Chapaev-3_NV, and Chapaev-1_SP is similar to the CC N-terminal portion of RAG1 (100-370 aa in the human RAG1). It CC includes a novel type of zinc finger, called Chapa: CC H-X7-C-R-X-C-G-X35-D-X4-H-X4-C-X2-C-W-Xn-C-X2-C-X8-G. In the CC amphioxus and anemone Chapaevs, the N-terminal portion contains CC also the RING finger motif. Some Chapaev transposases (e.g. CC Chapaev-2_ACa) show low similarity to the RAG1 core. XX FH Key Location/Qualifiers FT CDS 545..3541 FT /product="Chapaev-3_ACap" FT /translation="MEHTPESHVSALNCLCRVCGRLNITGKQRKSFKKPYL FT CSEIASDLFLVFGLSIKDDTSDKHSKFVCLKCYKTIHDAKRRNSAVSLQTL FT RNTFDSSQHIWCAYSENCDAKTCTVCEQKRLLSLGCATVKPRDPQSHSSSH FT EKYTHQASPSTNISETDTTTQSTCTRALSTDSTQQNDPHTSTSQTFHPHPQ FT SVVPVSDQHVGVQHADTMVQDSHIVAHTDTISQSTSVTNSPIPHDSSCEHS FT VSTVRVSPSSSFTPTKGQTFSATPLASSTPAQIHETVRPTQDSTTSPMIKD FT IETISHAMSKTETHPLTKKEEQLTTRLIKRKLYSDPKGKTLKCKTKGQPLV FT FHKIVVPRKPTTSARTPTKKKRAQLVNKIRKGTSGTSGTSGTTADADSLHA FT TELGLVPRNRRQEIWKKAGTKSQITISAQQALIMTEELGLSFRKGRVHSQL FT LKDMGVRTESEASQKNLAQQLVQDFVTVKGKLFLKDNFIETETPFARIRDL FT PQFLSYLLDEYDAKDMLTWRDGIIPEEQIWVKVGGDHGKNSFKATLQVVNA FT HSPNSQHNTVVIAMASMKDTYKNIHRFLEGGLVEDIVSLQSHAWRNKTIKV FT FLNGDYDFLCKMFGLSGPQGTYPCLWCLTPRNRIHAQCEAFQLRSLDSLVA FT DNNAFLTEQGGDKKEASRYHNSLHPPLLPIDLDHVAPPYLHILLGITLKHH FT KLLEEAADKLDLKIVSQSDASATEAGAKVKKYGGQWRKVDELKAHLDILQG FT CIVLCHTDVDKAQYSQELEQTEHTLAQLSYLELTPRSGPIASSLDNILKAH FT RITPQAYHSRSFIGNHCHKYLNEVVYKHLTEEIVTQTRRLTCDPFIVDEAS FT IVQFTFDDLNKTYSDIHRAISHSRPINKDALPELQRLVDTYMATYRRTFPN FT KTIPKQHILEHHCIQFITQHSAGLGLLGEQGTESSHQTIAKLEKRACGIND FT EIEKLKFIMKKHLLQTAPSLRHTPKPKTTRTSPRTTEASTSSVY" XX SQ Sequence 4336 BP; 1396 A; 1021 C; 867 G; 1052 T; 0 other; cacggcgcac aaaatcagcc gaaatcgcgc cttttagttg taccgcggct cctgctgtcc 60 ggcggtgatt tatcacctcg cccggtaccc gttgaccttt tttattgggg tcatttgacc 120 ccgtcgcacc gttataaact ggggtgcgtt cttgataaaa tcgtctcatt tactgtctaa 180 attacagtgg tgccgtaaga cccaccaagt agttttaatt tatagacttt agcaacacca 240 tttctcaacc tttaaaagaa agtcccagtc ttctacgtgt tataccaagt gttctacacg 300 tgtctgtttg ctgcctgtgt aacacgcttt tcttatttca aactgaggat aaataatcat 360 ttattacaac aaaacggtgc attgaatttt gggggggatt cgatttgtga gttcattaat 420 aaatcatgtt ttgagtgtat aaatttcagt actaccgcag ttgtggttgt taagctacat 480 cggattatta tgttgggccc ccgaatttac taacgttatt gccgtataca acgaagacct 540 taaaatggag cacacccctg aaagtcatgt atccgcttta aactgtctgt gtagagtgtg 600 cggaagactg aacattacag gaaagcagag aaaatccttt aaaaagccgt atttgtgttc 660 ggaaatagcc agtgacctct tccttgtgtt tgggttgagt atcaaagatg acactagtga 720 caaacactcc aaatttgtgt gtttgaaatg ctataaaaca atccatgacg ctaaacgtcg 780 aaatagtgca gtgtccctac aaacactaag aaacactttt gacagcagtc aacacatctg 840 gtgtgcttac agtgaaaact gtgatgcaaa aacgtgcaca gtctgtgaac agaaaaggct 900 gctctctctt ggctgtgcaa cagtaaagcc ccgtgacccc cagtctcact ccagcagcca 960 tgaaaaatac acacaccaag ctagtcctag caccaacatt tctgagacag acacaaccac 1020 acagagtaca tgtacccgtg ctctcagcac agactcaaca caacaaaacg acccacacac 1080 aagtacaagt cagactttcc acccccaccc tcagtctgtt gtcccagtca gtgaccaaca 1140 cgtaggtgta caacatgcag acacaatggt tcaggacagt cacatcgtag ctcacaccga 1200 cacaatcagt cagagcacat cggtcacaaa ttctcccata ccgcatgata gcagttgtga 1260 gcacagcgta agcacagtcc gtgtttctcc atcgtcatca ttcacaccca ctaaaggtca 1320 aacattttct gcgactccct tggcttcatc aactccagcc cagatacatg aaacagttag 1380 gcctacccaa gacagcacaa catctcccat gatcaaagac atagaaacca tcagccatgc 1440 aatgagtaag acagagacgc atccactcac taaaaaagaa gaacaattaa caacaagact 1500 tataaagaga aagctctaca gcgatccaaa ggggaagacg ttaaaatgta agacaaaggg 1560 gcagccatta gtttttcaca aaatcgtcgt gccaaggaag ccgacaacca gtgctaggac 1620 tccaacaaag aagaaaagag cacagttggt aaataagatt aggaagggta catcaggcac 1680 atctggtaca tcaggtacaa ctgcagatgc tgactctctg catgcaacag agttaggctt 1740 ggtccctaga aatagaagac aagaaatttg gaaaaaagca gggaccaaaa gccaaattac 1800 aatatcggca cagcaggctt tgattatgac agaggaattg ggattgagtt ttaggaaagg 1860 gagagtgcac agccaactcc taaaagacat gggtgttagg actgaaagtg aagcttctca 1920 gaagaacctt gcccaacagc ttgtccaaga ttttgtaact gtaaagggaa agttgtttct 1980 taaggacaat ttcatagaaa ccgaaactcc cttcgctaga ataagagatc tcccacaatt 2040 tctaagctat ctcctcgatg agtatgatgc aaaagatatg ctcacttggc gcgatggtat 2100 cattccggaa gaacaaatat gggttaaggt tggtggggat cacggtaaaa attcgtttaa 2160 agcaactctc caagtagtaa atgcacattc accaaattct caacacaaca ctgtagtgat 2220 agcaatggca tccatgaaag acacatataa aaacattcac cgatttttgg aagggggtct 2280 agtcgaggac atcgtcagtt tgcagtccca cgcatggagg aataaaacaa tcaaagtctt 2340 cctcaatggt gactatgatt tcctctgtaa aatgtttggg ctttccggcc cacagggtac 2400 atacccatgc ctgtggtgcc taacaccaag gaacagaata catgcacagt gtgaggcgtt 2460 ccagctacgg tcactggatt cactcgtagc tgacaacaat gcatttttga ctgagcaagg 2520 tggggataaa aaggaagcaa gccgctacca caatagtttg cacccacctc ttttgcccat 2580 tgatctagat catgtggccc ctccctattt acacatcttg ttaggcataa ccttgaaaca 2640 ccacaagctg ttagaagaag cagctgacaa acttgattta aagatagtct cccagtcgga 2700 tgcctctgcc actgaagcag gagcaaaagt gaaaaaatac ggtggccagt ggaggaaagt 2760 tgacgagctt aaagctcatt tagacattct tcaaggttgt atcgttcttt gtcacacaga 2820 tgtggacaaa gcacagtact cacaggagtt agagcagaca gaacacacat tagctcaact 2880 ctcatacctt gaacttaccc ctagatctgg tcccattgct tccagcctag acaatattct 2940 caaagcacac agaatcacac cacaagcata tcacagtagg tctttcattg gaaatcattg 3000 ccacaagtac cttaacgaag tagtttacaa acatctcaca gaagaaatag tcactcaaac 3060 aagacgtctc acatgcgatc cattcatcgt tgatgaagca agcatagtgc aattcacatt 3120 tgacgatctg aacaaaacgt acagtgacat tcacagagcc atctcacaca gcagacccat 3180 caacaaagat gcactgccag aactccaaag gctggtagac acttacatgg ccacatacag 3240 gagaaccttc cccaacaaaa caatccccaa acagcacatc ttagaacacc actgcattca 3300 gttcataaca caacattccg cggggctcgg tctgctgggt gagcaaggca cagagagcag 3360 ccaccagact atagcaaaat tggagaaaag agcatgtggc atcaacgacg aaatcgagaa 3420 gctaaaattc attatgaaaa aacacctact ccaaacagcc ccgtctttgc ggcacacacc 3480 caaaccaaag acaaccagga catctccacg aaccactgaa gccagtactt cgtctgttta 3540 ttagaactag tttatatgtt tagctggggt ttaatggttc ttgcaacact ttaaggtcct 3600 acaacttata agtgccgatg atagagttac aagagagtag caaacataga aaagaaggac 3660 gatagtagtt atagtgatag aagaaacaaa ccagaacacc cgccagccac ccactacccg 3720 ggccttcaat agaaaattga attcatatag cagcactgca cataatgcgc atgtgtgtat 3780 ataaaaacat atagcctacg ccatgattcg tatggggtta ttgtcttcca agtgagggaa 3840 atatgtggat acttacgaca atgtgatgca tagatgaacc attttgaggt cacacaaact 3900 gcggtagcag tgaaatttgg tgcgtagata cccttatgtg ttacgtttgt gtcccttaaa 3960 gatgaactaa tcctgacgct cagttttcta attatcgacg ctcgttacgt cataactaga 4020 aacaataacg ggaatcagac aagaccatct tgcactgtct gttacaacta aacagaccat 4080 tactgattct ctttcttttt gaaatatcac gaaaaagact tggaactact aatgcccgca 4140 tgcagtgatg tctttaagtc aatgttattg tgttaattag ggctcaattc agaatgaaaa 4200 atacgatttt ctaaacctcc cgtgcggcca cgttgaccga actgtgccca tgcgatgacc 4260 ccgttcaccg tcacaggtca gcgcaccact ggttacctga gttctcaact aaaatatcct 4320 gtttctgtgc gccgtg 4336 // ID BEL-55_CQ-LTR repbase; DNA; INV; 305 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-55_CQ_; KW BEL-55_CQ-I; BEL-55_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-305 RA Jurka J.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 264-264 (2011). XX DR [2] (Consensus) XX SQ Sequence 305 BP; 93 A; 84 C; 72 G; 56 T; 0 other; tgttcggtac aaccgaaggc tgaaaccccg ccgcgtttcc gacccgaaga cccaccgtgt 60 tacgccaatc tgcttgaccg acgagcgaga aacgtcactc attgcgagag gccaaacaag 120 tccacgtcca accttcgatc gagaaggaca cacagcagca acaaaaacag ttttttagag 180 tagcaggagg aggaagagga aagtgggaaa aataaagtta gaaaattaag atctacggtg 240 ttttttcctc ctgcgtgcct acgctccata cagtccaaca ctcccatgtc ggccaggccc 300 gaaca 305 // ID BEL-618_AA-I repbase; DNA; INV; 5778 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-618_AA_; KW BEL-618_AA-LTR; Pao_Bel_Ele52; BEL-618_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-5778 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [4734-5342] - Integrase core CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS join(471..1958,1962..5705) FT /product="BEL-618_AA-I_1p" FT /translation="MPDEKRLKAKELKRRNVIEALKRLDLFLSSYDPDQHQ FT HELPHRLDRLEKIWSAYEIVQDEYEEMDDSDEFVQSNLAMRGKVEELYFRV FT KAGLVSKVPAPQAPVAAAPPAPAAQSALSNVKLPTISLPEFDGDFNNWLTF FT HDTFVSMIHSSTEISHVQKFHYLRAALKGEAARLIQSITITANNYAVAWDT FT LVKRYSNKAILRKKHIRALLKHPKIPNNNVEALHKIVDEFQRHTKVLEQLG FT EPVDQFSSILIELLEDKLDDASLTAWEESIAADAHPTYANMVDFLQKRARI FT LETISINRPQIPASKPSAYLSAQKKPNQPRLSTNAATEVPSKSFPTCPACE FT KQKHSIFDCSVFNGLDPKGRMKVVTEKKLCSNCFRSDHFARNCRSKYNCKH FT CSRRHHSMIHPGPSEMEKVATEVGCENPLPGPSSVVTAVAAVPTPEAISTV FT KSSNASVLLTTVVLIVVDVYGQEHIARALLDTGSQPNAMSERLCQLLHLPR FT IVNVPIAGVDNTVTNAKHEVRAEIRSRVVNFSESLEFLVLRRVTSDIPSAS FT FSASRWAFPENLPLADPDFNTCRKVDMIIGAAHFYSFLKEGRLRLPEQGPM FT LIETVFGWIVAGKYDNPAEPSKCSAVTCHAATVAAVNEQLERFWRIEELQG FT SNYSVDEQKCEDYYRETVSRDPTGRYVVRMPKHPEHERMLGTSKVSAIKRL FT KWLEHRLAKDEGMRTQYHDFLREYISLGHMTPIQDEEECDLKVYYLPHHPV FT IKESSSTTKVRVVFDGSARTSSGFSLNDSLLVGPVVQDELLSLVLRFRKFP FT IALVADIEKMYRQVSMNPSDRPLQRILWRFDASEPIQAFELATVTYGLAPS FT SFLATRTLLQLVEDEGSSFPKASTAIKKNVYVDDLISGHNSIEEAIELRKE FT LETLLQKGGFRFRKWCSNSLPVLAGLPPELLGTQSSLKFDPEESIKTLGIR FT WEPESDVFRFDVSVVIKEAPPTKRNILSTIAQLYDPLGLIAPVVVQAKMLM FT QNLWLLALDWDDEVSPDLQHKWGQFCEQLPGLSKFRIERFAFASGFHSPEL FT HCFADASEVAYGACIYVRSVATDGHVQVTLLASKSKVAPLKPLSIPRLELC FT AALLASRLYEKIVGSLDMEFSGSFFWSDSTIVLQWMKAPPRTWKTFVANRI FT AEIQATTVGSQWQHVSGKENPADMISRGVVVEELISSTLWKYGPVWLRQDK FT STWPSQTVPDSKFSIEELELKKNVILTTQLIHPDPLFERFSSYQTLLNVVG FT FCLRFLHNCRFKNQRNSASVLSVAELQSSKTALVKLVQAEAFPDEIQRLKK FT GFMVSSKSSLRLLSPFLDAEGVIRVGGRLQLADTPYDVKHQIVLPGFHPFT FT RLLLKHFHRKLIHGGIMMTLSVVRDEFWPLNGRKAVRSAIRSCYECSRANP FT QPIQQPIGQLPIARVTANEAFACTGVDYCGPILLKPVHRKAAARKSYICVF FT VCLSTKAVHLELVGDLGTSAFLMALDRFVWRRNKPQHLYSDNGTNFIGAKN FT ALHKVYQMLQPGSDSDKINKHLAEDRIQWHLIPPRAPNFGGLWEAAVKVAK FT IHLVRQLGSSLLTFEELTTVLVKIEGCMNSRPLQPLSSDPNDLGALTPAHF FT LVRNMIRPLPEADIRTIPLNRLNQYEKLQKYSQNFWYRWRNEYLKELNQQY FT SANQKRYPMNVGDIVILKDESLAPARWPLARIIDTHPGPDGVARVATLRTA FT SGTLKRAVSKICPLECAMEQPAKY" XX SQ Sequence 5778 BP; 1453 A; 1487 C; 1375 G; 1461 T; 2 other; ttttggtgcc gtgaccagga tactggtctt cccgttcccg accgcacgac gtaacctcaa 60 tcgtttcgct gtgctgctat ccatatatat gttggctgtg tggattgctg tttgttccgg 120 ctatttcgat taaatacaag gcctcgttaa tagaggtaca aggtgagtac ctctccaacc 180 catttatcgt cggttttgcg gtacttctgg gtctgtttcg accgttttag tttgtgcgct 240 gattccaaat cccaccaaag gctccggtat tatcaatcat ccggaatttc gtcgaaagat 300 taccgctgtg catcggtgtt ccagctaatt gattttaata caaggcctcc tgagattgga 360 ggtaccaggt gagtaccatt ccaatcgtat tacgtcccgt ttttcggtac ttcctgggtc 420 ttttctgacc attgtagccc gactgcccga ctgcgttgga agttgaagct atgcccgatg 480 agaagcggtt gaaggcgaag gagctgaaga ggagaaacgt catcgaggcg ctcaagcgat 540 tggacctgtt ccttagcagc tacgaccccg atcaacatca gcatgagctg cctcatcgtt 600 tggatcgtct ggagaagatt tggagcgcgt acgaaattgt acaggacgag tacgaagaga 660 tggatgactc cgatgaattc gtccagagta atttagcgat gcgtggaaag gtcgaagagt 720 tgtatttccg tgtgaaggct ggtttggtat cgaaggtacc cgccccgcaa gctccagttg 780 ctgctgcgcc tcctgctcca gctgctcagt cggcgctttc caatgtaaaa ttgccgacga 840 tttcgctgcc ggaatttgac ggagatttca acaattggct gacgtttcac gatacgttcg 900 tatctatgat ccactcatca acggaaattt cccacgttca aaagttccat tacctccggg 960 cggctttgaa aggcgaagct gccaggctga tccagtccat cacgattacc gccaataact 1020 acgccgtcgc ttgggacacg ctggtgaagc gatattcgaa taaggctatt ctccgtaaaa 1080 agcacatacg agccctgttg aagcatccca aaatcccgaa caacaatgtc gaagccctgc 1140 ataaaattgt ggatgaattt caacggcaca cgaaggtcct cgagcaattg ggcgagccag 1200 ttgatcagtt cagctcaatc ctcatcgaat tgctggaaga caagttggac gatgcatcac 1260 tcactgcctg ggaagagtcc atagctgcgg atgctcatcc aacgtacgcc aatatggtcg 1320 atttcttgca gaaacgtgcc cgtatactcg aaaccatttc gattaatcgc ccacaaattc 1380 ccgcttctaa gccgtctgct tatctttccg ctcagaagaa gcccaatcag ccaagattga 1440 gtacgaatgc ggccactgaa gtcccgtcaa aatccttccc aacctgccct gcgtgtgaga 1500 agcagaaaca ttcaatcttc gattgctctg ttttcaacgg tctagaccct aaaggtcgaa 1560 tgaaggtggt aacggagaag aagctttgca gcaactgttt ccggagtgac cactttgcac 1620 gcaactgccg ctcgaaatac aattgcaagc actgttcaag gcgtcatcat tccatgattc 1680 atcctggacc gtctgagatg gaaaaagtag caacggaagt agggtgcgaa aaccctttgc 1740 ctggaccatc cagcgtagtg acagccgttg ccgcagtacc aacgcccgaa gccatttcaa 1800 ctgtcaagtc atccaacgct agcgtgctct tgactacagt ggtgctcata gtcgtcgacg 1860 tctacggcca agagcacata gcccgtgccc tgttagatac aggatcgcaa ccgaatgcaa 1920 tgagcgagcg cttatgccag ttgctccatc ttcctcgama aatcgttaac gtccctattg 1980 caggggttga taatactgtt accaacgcca agcacgaagt tagagcggaa atcaggtccc 2040 gagtggtcaa cttttctgaa tcccttgagt tcttggtgtt gcggagagtt accagtgata 2100 ttccgtccgc atcgttctct gcatcccggt gggcatttcc cgaaaaccta cctctagccg 2160 atcccgactt caacacttgc aggaaggttg atatgatcat cggtgctgcc cacttctatt 2220 cgtttctcaa agagggacga ttacgcctgc ccgaacaagg tccaatgctc atcgagacgg 2280 tctttggttg gattgttgcg ggaaaatacg ataatccagc tgaaccgagc aaatgttccg 2340 cagtaacttg ccatgcggca acggtcgctg ccgtaaacga gcaacttgaa cgtttttggc 2400 gcatagagga gctgcagggc tcgaattatt ctgttgatga gcagaaatgc gaggattatt 2460 atagagaaac cgtttcccgt gacccgaccg gtcgatacgt tgtacgaatg cctaaacatc 2520 ctgagcatga gcgaatgtta ggaacatcga aagtgtctgc aattaaaagg ctgaagtggt 2580 tagagcacag attggcgaag gatgaaggta tgaggactca ataccatgat ttcctgaggg 2640 aatacatctc tctgggtcac atgaccccga ttcaggatga agaagagtgt gacttgaaag 2700 tttattatct cccacaccat cccgtcatca aggagtccag ctccacaaca aaagtgagag 2760 tggtgtttga tgggtctgct agaackagtt ccggtttttc cctcaacgat tccttgctcg 2820 ttggaccagt agtgcaggac gaactactca gtctcgttct ccgattccgg aagttcccga 2880 ttgcgctggt ggcggacatc gagaaaatgt atcgccaagt ctcaatgaac ccatcagatc 2940 gtccactcca gaggatttta tggcgtttcg acgcctcaga acccattcaa gcgtttgagc 3000 tagccactgt cacttacggc ctcgccccat cgtccttcct cgccactcgg acactgctgc 3060 agctcgtcga agatgaaggt agctcattcc ccaaagcaag tacggccata aaaaagaacg 3120 tctacgtgga tgatcttatc tccggccaca acagcatcga agaagccatc gagctccgca 3180 aagagctgga gactttgcta cagaaaggcg gattccgctt ccgtaagtgg tgctccaatt 3240 cgcttccagt tttagctggt ctaccgcccg agctgctcgg gactcagtcg tctctgaaat 3300 ttgatcccga agaaagcatt aagaccctcg ggatacgttg ggaacccgag tcggacgttt 3360 ttcgcttcga cgtttctgtc gtaataaagg aagcaccgcc aaccaaacga aacatcctct 3420 ccacgatcgc tcaactttac gaccctctcg gtctaattgc tcccgtcgtc gttcaggcga 3480 aaatgcttat gcagaacctt tggctcctgg cgctggattg ggacgacgaa gtctcccctg 3540 atcttcagca caaatgggga cagttttgtg aacaactgcc cggcctctcc aaatttcgca 3600 ttgaaagatt tgccttcgct tctggatttc actctccgga gctacactgt tttgcggatg 3660 cgtccgaagt cgcatatggc gcatgtatat atgttcgctc cgtcgccact gatggacatg 3720 tgcaagtgac tcttctggct tcgaaatcaa aagtggctcc cctaaaacca ctcagtatac 3780 cgcgccttga gctgtgcgca gctctcctcg cttctcgtct ttacgaaaaa attgttggct 3840 cgctagacat ggaattcagc ggaagtttct tctggtccga ttcgacaatt gtcctccaat 3900 ggatgaaggc acccccgcga acgtggaaga cttttgtagc aaatcggatt gctgaaattc 3960 aagctacaac cgtgggatct caatggcagc atgtttcagg aaaggaaaat cccgcagata 4020 tgatatctcg tggggtagtt gtcgaagagc tcataagcag taccttgtgg aaatacggcc 4080 ccgtctggct gcgccaagat aaatctacgt ggccctcgca aaccgtccca gacagcaaat 4140 ttagcattga ggaattggag ttaaaaaaga atgtgattct aactactcag cttattcatc 4200 ccgatccgtt gtttgagcga ttttcttcgt atcaaacgtt actgaacgta gttggattct 4260 gtcttcgttt tcttcacaat tgccgtttca aaaaccaacg gaattccgca agcgtgctct 4320 ccgttgccga attgcaaagt tctaagactg ccttggtgaa gctcgtccaa gcagaagcat 4380 tccccgatga aatccagcgt ttgaagaaag gattcatggt atcaagcaaa tcttctcttc 4440 gtttattgag cccgttccta gatgccgagg gagtgatccg tgttggtggc cggttgcagc 4500 tggctgacac tccctacgat gtaaagcacc aaatcgtcct cccaggtttt catccgttca 4560 cccgactgct gctgaaacat tttcatcgca agctcatcca tggtggcata atgatgaccc 4620 tttccgttgt ccgtgatgaa ttttggccat tgaatggtcg caaggctgtt cggagtgcta 4680 ttcggagctg ttacgaatgc agtagagcga atccacagcc tattcagcaa ccgatcggac 4740 agctgcctat tgccagagtt accgcaaacg aagcatttgc ctgcactggg gtcgattact 4800 gcggtcccat tttgctgaaa cccgttcacc gcaaagctgc agctcgaaaa agttacattt 4860 gcgtttttgt atgcctaagc acaaaagccg ttcatctcga acttgtcggc gacttgggta 4920 cgtccgcctt cctgatggct ctcgaccgtt ttgtatggag aaggaacaag ccgcaacatc 4980 tgtactccga caacggtaca aatttcatcg gagcaaagaa tgctctacac aaggtgtacc 5040 aaatgttaca gccaggatcc gacagcgaca aaatcaacaa gcatcttgcc gaagaccgca 5100 tccagtggca tctgatccct ccacgagccc ccaactttgg tggcctatgg gaggccgctg 5160 taaaggtggc aaaaatccac ctggttcgtc aactcggatc atcactgtta acgtttgagg 5220 aattgacgac agtgttggtc aaaattgaag gctgtatgaa ttcccgaccg ttgcaaccgc 5280 tttccagtga tccaaacgat ctgggagctc ttactcctgc acattttctt gtgcggaaca 5340 tgattcgccc actccctgaa gccgacattc gaacgatacc cctcaacaga ctcaaccagt 5400 acgaaaagct ccagaagtac tcacaaaatt tctggtacag gtggaggaac gagtatttga 5460 aggaactgaa ccagcagtat tctgccaatc agaagcgtta ccctatgaat gttggcgaca 5520 tcgtcattct gaaggatgaa tccctcgcac ctgcacgctg gccgctcgcc cgtatcatcg 5580 atacacatcc tggccctgat ggtgtggcgc gcgtagctac actccgtaca gcttccggga 5640 ctctgaagcg agcggtttca aagatctgcc ccttggagtg cgcaatggaa caaccagcta 5700 agtattagta gtaactcaac atgaattcgt gaattacatt aaataatttg aaaatagatt 5760 ttcaaaggtg gccggtaa 5778 // ID SAT-1_NVi repbase; DNA; INV; 166 BP. XX AC . XX DT 17-APR-2009 (Rel. 14.04, Created) DT 17-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE Nasonia vitripennis satellite repeat. XX KW SAT; Satellite; Simple Repeat; SAT-1_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-166 RA Bao W. and Jurka J.; RT "Satellite repeats from Nasonia vitripennis."; RL Repbase Reports 9(4), 799-799 (2009). XX DR [1] (Consensus) XX SQ Sequence 166 BP; 51 A; 34 C; 33 G; 48 T; 0 other; ccgatatgaa ccgatgagcg gaacatgggt atagagggcg ctgaaagctg cccaaatcgt 60 gttttcgcac cttatatctt ctaccacggc cagaataatt gcaatatccc tgtttttctt 120 tcagaattta ttggaggaca catttaagaa taactaaaaa ttagtc 166 // ID hAT-N4_BF repbase; DNA; INV; 3492 BP. XX AC . XX DT 30-SEP-2008 (Rel. 13.09, Created) DT 30-SEP-2008 (Rel. 13.09, Last updated, Version 1) XX DE Amphioxus hAT-N4_BF non-autonomous DNA transposon - consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW hAT-N4_BF; non-autonomous. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-3492 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-3492 RA Kapitonov V. and Jurka J.; RT "hAT transposons in the amphioxus genome."; RL Repbase Reports 8(9), 914-914 (2008). XX DR [2] (Consensus) XX SQ Sequence 3492 BP; 1064 A; 719 C; 665 G; 1044 T; 0 other; caggaggagg tgggggggta tcacgtgaac cctcgtgaac cctccgggga cgggcttatc 60 caatcacgta gctggtactt tgtaaatgct ccgtaaggcg cgttatttgt tactcctcag 120 ttttgtgtat tttctcacaa ttggtcttgt tttgaaagac cagagttcta ccaaactgtt 180 acagagatat gtcttcatcg ttggagagag attcttgtga ccaggtacgt gttaacttgc 240 cgattttctc gtcgaaatgt ttgcgaccct ctgcatgctt atagcaggcc cttggagttt 300 gcccatttta cgtagttttc aggggtctag ttttcaattt tacagcgtgg ttcattgcaa 360 attttttatc tcagcgtaaa gtccgtgtct ttaacttctc aaaactcaag ttgaaccaca 420 ttgaaccacc aatagttaag ccactagaag cggtcaaact cggcgaggca agcggggccc 480 gccgtgaaat ccaagatggc ggccgggtta cgaagcaacg tgttattgtt ttcatgtacg 540 aaagtttcgg ggattccttc cttattgatc gattctatgg taacacaaaa atatcaaaat 600 ccgactggaa ccaaattggg tatcatgggg attttaggct ttatcataat tgtgaataaa 660 accggcagaa aatacaagat tttacatcgc ggcaaaatgg ggaggggggc tcaaaaattg 720 tacctatcgt agtcgccagt ttccttccga gaaaacttgt gtgaaaaatt cagatagcgt 780 tgtaaacaac aaggactatt gcttcttgga atacattggc tctgttccag atggaatgca 840 acattccatt acatgacagt acatgtacta ctactagtaa atgggagtgg acaaccacat 900 gcataaattt tgtgtagtta cattataagt tacatcaatc cacttgcatt aatttgtcaa 960 ttttttgtgt gctttaccta acagaacatg attcttcttc ctgacttagt ttgatgtcaa 1020 caaccgagca gtgcagtaac tgtaggagtc agagagtgga ggtttcaatg cagaccctgg 1080 gattggattc agatactttc atgacactag gaaatgtctg agacaaattc tacaatgctg 1140 ggttataata gatttagctc catttgtact tgtgaatgta agtttcatat gtgtcctagt 1200 gtaaacatga cagttcactg ctgtgtccca gatatctact agtaatgttc ccactcctat 1260 agtttagcat tttaccattt ccagtggtca taagttataa tatgttgact cccagtattt 1320 caaatgcaag ttatacaaat ttatattcta caatcttagg gttaggcaaa gtcattctct 1380 tgttgaaaag aatttctcca actccaagtt cttcttatat tcaagttttt attatcattg 1440 aatatcacac aagacaaaaa gtgttgtagc ttataacata agatatgttg actattagta 1500 tttcaaatac atatttcttg caacttcatg tgtccaacac tggctgtcaa caaatataac 1560 taaacgttaa agatacatga aattttttcc atcatcttct ttcatagatg tgttataatt 1620 ggctctttgc acaaagttaa aaggtaaagc aggattctgc gcagaagcgc aaactacaat 1680 ccactctaca tgtcatcttt agccgatact gcactacaag ctatatactc ccatcacaaa 1740 actgctgtct atttcacaat gtgatcaagg catacaggaa tacataatgg tgaactgttg 1800 taagaaatgg acttttggca aacctgtatc actcgatgaa tgtgaacaat atacaaaata 1860 tacatatgat ctgtatgcaa tctacagcta cagtcttcct gaagctacat tctgtacagc 1920 caaagtccat tagaacctct acacaacaat ccttgaagaa gcatataata ataacctgac 1980 ttgactatag caaagttcat agaacacaca catgatcaca acttacttca gggcaaatca 2040 cacaggtaag tacaatactg gacaattgtt cattcgtgta tttaatatct atgcttggaa 2100 tgagtattag aacctaaggg caacatgtgg acattaaaga ctattataca tattcatttc 2160 caagcaatga caccatctac tcagttctgc atagatacaa gacaacaaca tttaaaatac 2220 aggtaatctt caatgcagag actgagtgct gtgcaacatt acatcaaatg gtacataata 2280 cagtggatta atcctccttc actgtcaatt cagtcaaccc gaaaaatcct aaattgtttc 2340 agcagttttt ggtaacatgc acacaacata cttgtctaca aaaagtacaa tgtaataaag 2400 gatatacatg taccaccaag tatgaagtaa gcacatgctt agataggttt gctcatgccc 2460 agcagggctg gcaagtaaca tacatgtatc tgttttatca gtcatttcag aaagacttat 2520 tcagtgaatg tatgtgatga aataaactct tgatgttcac ctctatgttg gttgcttaaa 2580 aaaataattc tttgacaaat acaatgcatg ccatttaccc ttgtgtagca aacatgagac 2640 ttcaacagca tctttggtga tactgataac tctgtagatg aagcaaggac tggtagaagt 2700 aatgtacatt ctgtggatca taaaacaaaa aaggttttaa aatgggcatt ccatgtgcag 2760 ccggtttcaa ggtaagttta tattatattc aattcaaatg acaacgtgaa ctactcgaac 2820 tatgagtgat atccacaata gtgacacaat cgcacaaaat tacacaactg tagttacgaa 2880 ttcttatcgt aatttcgaac tgtactgtga gcactttcta tgttcgtgtg ccgatacaaa 2940 attgaacaac aacccacaag tctatgaccc ggggatatat atcgcccccc tcccccttag 3000 ctgccgagtg ccgagcgaaa aatttatgca cattcaccga cacatactag ttgcaacgaa 3060 aagaactaca gtatatcgtt acgcccccca cacatgtgca cctaacagta gttgcttgta 3120 catgtagtag gattccccat atcatctgca gtatgtcatg tagcttccga ccactgttgg 3180 tttgggagcg cgcggtggct cttttatttc cgtcggggta ctgacgttaa acgttaccca 3240 ttgcaaatga attacactgt taacaaagtc tacacatacc ttgtgttttg tggggacacg 3300 aagatgaccg gaaaatggct gttggtggcc tgaaacagca ctagaagaac acgatatcct 3360 ccggaggcag ccatttttgg cttgtttacg aagctccggg gacgtccgtc ctctgcagtg 3420 gtcacagttc gaatgaaccc gcttggccta attacccgtt acgctttgcg cggacccctc 3480 ctgccccccc tg 3492 // ID BEL-605_AA-I repbase; DNA; INV; 5967 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-605_AA_; KW BEL-605_AA-LTR; Pao_Bel_Ele189; BEL-605_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-5967 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [5021-5581] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 4949..5926 FT /product="BEL-605_AA-I_1p" FT /translation="MPKMQNPRCSAKSTRNGRSSGRLAAYTRPFSHVGVDY FT FGPYLITVGRRTEKRWGVLLTCLTTRVVHIEVAYSLTTSSCIIALRNFIAR FT RGTPLAFYSDRGTNFVGAQKELQKASELLEQNKIACEFTSPRTSWHFNPPS FT SPHMGGSWERLIQSVKRNLQEIQGTRRLSDEELRNALTEIEFTLNCRPLTH FT VPIDDESEPALTPNHFLLGSSDGSKPLTRNDESAATLRREYEVSQTLADYF FT WRRWLRDYLPDITRRTKWFQKVKPIEEGDIVVIADPDHPRNCWPMGRVIGT FT VNRDGQVRRATVQTSKGVYERPAVKLAVLDLASE" FT CDS join(24..977,981..4151) FT /product="BEL-605_AA-I_2p" FT /translation="MSGRSKSSARTSESIRSKQSTRNKEKPPSGSKNKFTG FT NSCHTCKSFDNSQMVQCDHCDDWHHFACVGVTQEIEKNSWCCPPCEKSKQQ FT SSMTDVESEQKAKTKTTGSTTSKKPAETISETVVKKATGAIKKVPGVSKKE FT SLVKTTKMQTRKAKARLAEQSERSKAASLVSASSIVSSRAQLEAEIKRVEA FT EQTLLEQETKRKRDLFIKKLSLLEELAETMGELEVAEEQGANEKVNSWLGS FT NNLQQEVGNGKHESNDDDSDEETDDSIDETATSDTSEEGDGDDGGSRSFVP FT LRKSTPTRQTLAKLPNADRTCLNRSQTLTQEQIAARQVVSRDLPKFGGNPE FT EWPMFISTFESTTKMCGYRDEENMIRLRNCLRDDALNAVRSFLMHPSTVMK FT AISVLKLRFGQPQIIINTLREKILTMPPVRPEAMDRLIDFALSVQNLCATI FT DACGRKEYKRDVTLLQELVGKLPPSIKLDWARYQRTHSKVNLFTFNNWVYS FT VAEDACLVSGATVYNAKPIENRGKKSNKAYVNSHIEWRSEKQNRQDTKPSN FT IKQSAAAPTEEKVPVNSSKGCPICKGWCKTLVKCERFVGLSYEARWSAVRE FT LKICRRCLRQHGGTCDRKPCGINGCTFKHHPLLHKNMSVATNIPTIHEKEP FT AVINEERSLNAHLSVKGARMFRNLPVTLYGTNGQVDCFAFLDDGSELTLID FT RGVADSLSSSGAPIPLCLKWTGGTHRVEASSRSIDIKVSGSSGKIFHLADV FT RTVEELRLPQQTLDVEQLVKSFPYLKGVPVTSYSMARPRILIGLKHANLSL FT VRRTREGSAGDPIAAKTLLGWTIYGGCAGNLARDAHHTYHICECNLRFDES FT LEKAVKQHFALESLGVSRPENPVLSTAEQRSQQLLQSLTRYVGDRYESGLL FT WRHDNVRLPDSEPMARRRFECLERRMAQDESLARSLRKKIEEYCDRGYIRK FT LTEEELAGQYERVWYLPIFPVKNLNKPNKFRIVWDAAASVHGVSLNSALLT FT GPDELVSLPSVLQTFRQFRVAVVGDIREMYHQVRMRLPDQHCQRFFFRNDK FT SEELSVYVMQVMTFGACCSPATAQYVKNRNAERFQIDYPRATDAIVNKHYV FT DDMLVSEETEEAAIDLAITVRGIHAKAGFEIRNWLSNSSKVAAALNGASTE FT EKDLNLTAELANEKVLGMWWNTTTDCFTFKLSNGRFDQSLIDGSRRPTKRE FT VLRVLMSIYDPLGLISHYMMFLKVTLQDVWRTGIHWDAAIEEKQFKNWKTW FT VKLLPNLEKLSIPRCYRQVTSACETTIVQMHTMVDASINGMAAVVYLRFEE FT KGKVECVLVTGKTRVAPLKYLSIPRLELQAAVLGCRLAESVSNNLSIKISR FT RLFWTDNRDVLC" XX SQ Sequence 5967 BP; 1745 A; 1398 C; 1475 G; 1347 T; 2 other; wctttcaaat tcgtttatac gaaatgtctg gtcgcagcaa atccagtgcc cgaacaagcg 60 aaagcattcg tagcaagcag tcaactcgta acaaagaaaa accaccttca ggtagcaaaa 120 ataagttcac cgggaatagc tgccacactt gcaaatcgtt cgataatagc cagatggttc 180 aatgcgacca ttgtgacgac tggcatcact ttgcgtgtgt tggtgttacc caagaaatcg 240 aaaaaaatag ttggtgctgc ccaccgtgtg aaaaatcaaa acaacaaagc tcgatgactg 300 atgtggaatc agaacaaaaa gcgaaaacca aaacaacagg ttcaacaaca tccaagaagc 360 cggcggaaac aataagcgag acggtggtga aaaaggctac cggagccata aaaaaggtcc 420 ctggtgtgtc caaaaaggaa tcattggtga agacaactaa gatgcagacg agaaaagcaa 480 aagcacgatt agcggagcaa tccgagaggt cgaaggccgc ctcgctagtg tcagcatctt 540 caatagtatc atcaagggcg caactggaag ccgaaattaa aagagtggag gcggagcaga 600 cactccttga gcaagaaact aagcgtaagc gggatctctt cataaaaaag ctcagtctgc 660 tggaagagtt agcggaaact atgggcgagt tggaggtcgc agaggaacaa ggtgcgaacg 720 aaaaggtcaa ttcatggctg ggcagtaata atctgcaaca ggaggtcggc aatggcaaac 780 atgagtccaa cgacgacgat tccgacgagg aaactgacga cagcattgat gaaaccgcca 840 ccagcgacac gtcagaagaa ggcgatggag acgatggtgg aagtaggtcc tttgttcccc 900 tacgaaaatc gacaccaacc agacaaacgt tggctaagct cccaaatgcg gaccgcacat 960 gtctaaaccg ttctcaaaam actctcacgc aagagcaaat agccgctcgt caggtggtct 1020 cgcgcgactt gccgaagttt ggtggcaacc cggaagagtg gccaatgttt atatccacgt 1080 tcgaaagcac cacgaagatg tgtggttatc gcgacgagga aaacatgatt cgcttgcgaa 1140 actgtcttcg agatgatgct ctcaatgccg tcagaagttt tttgatgcac ccttccacag 1200 taatgaaagc catcagtgta ctgaagttgc ggtttggtca accccaaata attataaaca 1260 cgctacggga gaaaatactc actatgccac cggttagacc cgaagccatg gaccggttaa 1320 tcgactttgc cctctccgta cagaacctat gtgcaacgat agacgcgtgt ggacggaagg 1380 aatataagcg ggacgtgacc ttattacaag aactcgtcgg aaagctgccg ccttccatca 1440 aactagactg ggcgagatac cagaggaccc actccaaggt taaccttttt acctttaaca 1500 actgggtgta ctcggtagct gaggatgctt gtttagtatc aggggcaaca gtctacaacg 1560 ctaaacctat cgagaatcgt gggaagaaaa gtaacaaagc ttacgtaaac tcacacatcg 1620 agtggcgctc ggagaagcag aatcgacaag acacgaagcc ttccaacatc aagcaatcag 1680 ccgctgcacc aacggaagag aaagtaccgg tcaactcaag caagggctgt ccaatctgca 1740 aaggatggtg caaaacttta gtcaagtgtg aacgctttgt cggattgtcc tatgaggcgc 1800 gatggtcagc cgttcgagaa ctgaaaatct gccgtagatg tctgcggcaa catggtggaa 1860 cttgtgaccg gaaaccttgt ggtatcaatg gctgtacctt taagcaccac cctctactcc 1920 ataagaatat gtcagttgct acgaatattc ccaccattca cgaaaaggaa ccagcagtca 1980 tcaatgaaga acgcagcttg aacgcccact tatcggtgaa aggagcccga atgttccgaa 2040 atttgcccgt tacattgtac ggtaccaatg gccaagttga ctgtttcgcc tttctcgacg 2100 acggatcgga attaactctg atcgaccgag gagttgcaga tagtttaagc tcgtcgggag 2160 caccgatacc actttgccta aagtggactg gaggaacgca tcgcgttgaa gcatcgtctc 2220 gcagcattga tataaaggtc agcggcagct ctggtaaaat atttcacctg gcagatgttc 2280 gaactgtaga agaactacgc ctcccgcaac agactttgga cgtagagcaa ctggtaaaat 2340 ctttccctta tctcaaaggt gtgcccgtta cttcatattc gatggcgcgc ccacgcattc 2400 ttatcggcct caagcacgct aacttatctc ttgtacgacg aactcgcgaa ggtagtgcag 2460 gcgatccaat agccgcaaaa actttgttag ggtggaccat atacggtggt tgtgcaggta 2520 accttgcgcg agatgcgcac cacacctatc acatatgcga atgtaacctt cgattcgacg 2580 aatctttaga aaaagctgtc aagcaacact ttgccctgga aagtctagga gtttcgcgtc 2640 cagaaaatcc agtgttatct acagcagagc agagatctca acaacttctg cagtcattaa 2700 caagatacgt gggagatcga tacgagtctg gtctactctg gcgacatgat aacgtgcggt 2760 tgccggatag tgaacctatg gctcgacgca gatttgaatg cctggagaga cggatggcac 2820 aagacgaaag cctcgcccgt tccctgagaa aaaaaattga agagtattgt gacagaggct 2880 atattcgcaa gctgacagag gaagagcttg ctggtcaata tgagcgtgta tggtacctcc 2940 caatttttcc ggtcaagaat ctgaataagc cgaacaaatt cagaatcgtg tgggacgctg 3000 ccgcatccgt tcatggagtc tccctcaatt cggccctgct cacaggacct gatgaactgg 3060 tttccttacc atccgtgttg caaacattca ggcaatttag agtggcagta gtcggtgata 3120 tccgggagat gtatcaccag gtgagaatgc gtcttccgga tcaacactgt caacgattct 3180 tcttccgcaa cgataagagc gaggagttaa gcgtttatgt catgcaagtg atgacttttg 3240 gcgcatgctg ttccccggca accgcgcaat acgtcaaaaa tagaaacgcg gagaggttcc 3300 aaatcgacta cccacgagca accgatgcca tcgtaaacaa acactacgtg gatgacatgt 3360 tggtaagtga ggagacggag gaagctgcta ttgatctggc tatcacagtt cgcgggatac 3420 atgccaaggc cggttttgaa atcagaaact ggctgtcgaa ctcgagcaag gtagcagcag 3480 ccttgaatgg agcttcaacg gaagaaaagg acctcaacct aactgctgaa ttggcaaacg 3540 aaaaggtcct ggggatgtgg tggaacacca caacggattg tttcacattc aagctctcca 3600 atggacgttt tgatcaatct ctcatcgacg gaagtcgtag accgaccaaa cgcgaagtac 3660 ttcgcgtttt aatgagtatc tacgaccctt tgggactgat ctctcactat atgatgtttc 3720 tcaaagttac gctacaggac gtttggagga ctggtatcca ttgggacgca gctatcgagg 3780 agaagcaatt caaaaattgg aagacttggg taaagctttt acccaacctg gaaaagctga 3840 gcattcctag atgctatcga caggtaacct cagcttgcga gacaactatc gtgcaaatgc 3900 atactatggt agatgccagt atcaacggaa tggcagccgt agtatacttg cggttcgaag 3960 agaagggaaa agttgagtgc gtgttggtta ctggtaaaac tcgagtggct ccacttaaat 4020 atttatcaat accgaggctt gaacttcaag ctgccgtgct gggctgtaga ttagccgaaa 4080 gcgtctccaa caacctctcc atcaaaatct ctaggcgtct attttggaca gacaatcgcg 4140 atgtactttg ctagttgcga tccgaacata ggcgctacag ccccttcgta gctgcacgcg 4200 taggagagat attagacaca actacagtcg acaattggag atatgttccg tcaggtaaaa 4260 atgctgccga tgatgggacc aaatggagtg gtcaaccaga cttaagttca acaagcagat 4320 gggttaaagg accgaaattt ctatgggaat ccgaggacag ctggcccagg atgcctttca 4380 taaactcaga aacggagact gaactacgat ccagtgttat gattcacctt gaagctccat 4440 tacctgtcat cgtagctgcg tcgttctcta gttggagaag aatgatcctg gtgacttctt 4500 acgtgcaacg gttcattgca aacactcgtc gtcgtcttca caagcagaat atacaagtgg 4560 gtccagtgac catcgaggag atgcacgccg ctgagaggta ccacttccgc actgcccaaa 4620 atgaagcgta ctcgcacgaa atttccatct tgagggcatc gagttcctca aagcagatcc 4680 ccaagtcaag tccactatcg aagcttagtc cattcctaga tatcaacaat gttcttcgaa 4740 tgagggggcg agtaagcgct tgcctctatg tgaacgaaga aacagcgaac cctattattc 4800 tacctgccga tcatgctgtg acgtctctta tgcttaattc gtaccaccag aggtttcatc 4860 accggaactt tgaaacggta gttaacgaag ttaggcagat gttccgaata cctaagctac 4920 gaagacttat atcttcttta cgacgcaaat gcccaaaatg caaaatccga gatgttcggc 4980 caaatccacc agaaatggca gatcttctgg gcgacttgca gcgtatactc ggccattctc 5040 acatgtaggc gtggactatt tcggcccata cctcatcact gtaggacgtc gaaccgaaaa 5100 gcgctggggg gtcctattga cctgcttgac gacgagggtg gtccacatag aggttgctta 5160 ttctttaacg actagctcct gtattatagc gcttcgaaat tttatcgcgc gacgaggaac 5220 gccgcttgcc ttttacagcg acagaggaac gaatttcgtc ggcgctcaaa aagaactaca 5280 gaaggcctcg gaattgttgg agcagaacaa gatagcgtgt gagtttacaa gtccccgaac 5340 ttcatggcac tttaaccctc ccagctctcc acatatgggt ggaagctggg aaagattaat 5400 tcaatctgtc aagcgaaact tgcaggaaat tcaaggcaca cgcagactca gtgacgagga 5460 acttcgtaac gcgttaactg aaattgagtt cactctcaac tgtcgtccat tgacgcatgt 5520 accgatagac gacgagtcag agccggcgtt gaccccgaat cattttcttt tggggtcgtc 5580 ggatggtagt aaaccgctta cccggaacga cgaaagtgca gcgacgttgc gacgtgagta 5640 tgaagtatca caaacattag ctgattactt ttggcgaaga tggctgcgcg attatcttcc 5700 tgacataacg cgacgaacca aatggttcca gaaggtaaag ccaatcgaag aaggggacat 5760 cgtcgttatt gctgaccctg atcatcccag aaactgttgg ccgatgggcc gcgttatcgg 5820 aacagttaat cgagacggac aggtacgacg agcgactgtg caaacatcaa aaggagttta 5880 tgaacgacca gccgtgaaac tagcggttct cgaccttgcc agcgagtgat tgtaacccaa 5940 atctgtgttt cggattaccg gggggac 5967 // ID Mariner-2_DYa repbase; DNA; INV; 1630 BP. XX AC . XX DT 15-MAY-2009 (Rel. 14.06, Created) DT 15-MAY-2009 (Rel. 14.06, Last updated, Version 2) XX DE Mariner-type sequence: consensus. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-2_DYa. XX OS Drosophila yakuba OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; melanogaster subgroup. XX RN [1] RP 1-1630 RA Jurka J.; RT "DNA transposon families from fruit fly."; RL Repbase Reports 9(6), 1153-1153 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 384..1388 FT /product="Mariner-2_DYa_1p" FT /translation="MKKLPINVEDSVINLTENGFSTRIIAGKLNISQSAAI FT RVQKRRNAVPKIQRRSPRRLLTDADARLMMQNIKKDNNCRPKDASAAIGKP FT VSQWTACRALKRIGFVSELKKKKPALSDKNIKARLKFAKAHKNWQVDDWKK FT VIWSDESKFNRYQSDGKQYCWKKPGESLQRRHLQETVKQGGGNVMVWSCFT FT WWSIGPIKKIEGTMKKEDYLDILQTHLCDFVDKCVYNESHITFQQDGDPKH FT TSKIVKEWIGNQDFKLMEWPAQSPDLNPIENLWSIVKRRLGQYESVAANQH FT ELWQRIQQEWKAIPKEIVQNLVESMPNRIKSVIRNKGLWTKY*" XX SQ Sequence 1630 BP; 606 A; 253 C; 318 G; 453 T; 0 other; cagtggcgtg caagtatgag tacatgtttg caacttttga ctgttcgcct aaataaaaaa 60 ggtattagtt aagcaaatga acccaaattt ttactgttga ttctttgatt atttatttac 120 aaattcgtaa aaaaaaaata acaataatgt aattcttttc gttaggaaaa aaataaaatt 180 taaattaact cgcatgtact caattatgct caatttgaat ttttcaacaa aagttggaca 240 gctcattcaa ctgaaagttt aattttagaa tttaccgttt gtttttaaga aagtttaata 300 taaaagtata aatataaaag ctttcaacca aaaaggtacg aatctccacg tgtttttggc 360 tcttaactat taaaacaaat aatatgaaga aattgccaat aaatgttgaa gattcagtga 420 taaaccttac tgaaaatgga ttttcgacgc gtataatagc tgggaagtta aatataagcc 480 aatcagcagc cattcgtgtt caaaaaaggc gaaacgcggt acccaaaatt caaagaagaa 540 gtccacgtcg attgttgacc gatgcagatg cgcggctaat gatgcaaaat attaaaaagg 600 ataacaattg tagacccaag gatgcgtctg cagctatagg caagcctgtc agccagtgga 660 cagcgtgtcg agcacttaag cgtattggat tcgtttccga attaaaaaaa aagaaaccgg 720 ccttgtccga taaaaatata aaagcacggc taaaatttgc gaaggcacat aaaaattggc 780 aagtggacga ctggaaaaaa gttatatggt ctgatgaatc aaaattcaat cgataccagt 840 cggacggaaa acaatattgt tggaaaaagc ctggagagtc gctgcaaaga cggcacctac 900 aggaaacagt aaaacaagga ggaggaaatg tcatggtctg gagctgtttt acctggtgga 960 gcattggacc aataaagaaa atagaaggaa ctatgaaaaa agaggactat ttagatattc 1020 tgcagaccca tctttgcgat tttgttgaca aatgcgttta taatgagagt catatcacat 1080 tccagcaaga tggagatccg aaacacacgt ccaaaatcgt gaaggaatgg atcggaaatc 1140 aagattttaa gttaatggag tggccagctc agagtcctga tctcaatcca atcgaaaatt 1200 tgtggtcgat tgtgaaaaga cgattgggac aatacgaatc ggttgcagca aaccaacacg 1260 agttgtggca acgaatccag caagaatgga aagctattcc gaaggaaatt gttcaaaatc 1320 tggtggaaag tatgccaaat cgaataaaaa gcgtcatcag aaataagggt ctttggacta 1380 agtactaaaa gataagcttt tgttgaaaaa ttcaaattga gcataattga gtacatgcga 1440 gttaatttaa attttatttt tttcctaacg aaaagaatta cattattgtt attttttttt 1500 tacgaatttg taaataaata atcaaagaat caacagtaaa aatttgggtt catttgctta 1560 actaatacct tttttattta ggcgaacagt caaaacgttg caaacatgta ctcatacttg 1620 cacgccactg 1630 // ID Gypsy8-LTR_Dpse repbase; DNA; INV; 1488 BP. XX AC Unknown_singleton_95; XX DT 13-MAY-2009 (Rel. 14.05, Created) DT 13-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy8_Dpse; KW Gypsy8-I_Dpse; Gypsy8-LTR_Dpse. XX OS Drosophila pseudoobscura OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; obscura group; OC pseudoobscura subgroup. XX RN [1] RP 1-1488 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1070-1070 (2009). XX DR Genome; Unknown_singleton_95; Positions 30878 29391. XX SQ Sequence 1488 BP; 441 A; 381 C; 486 G; 180 T; 0 other; tgaagaggca ggcctcctgc ccggagcccc aggcaagacc caagcggccg gccctgcagc 60 ggcagagctc agcaccggca gccacgggcc actggaaggc gatcccgcgg gaggagtggc 120 cggcggcagt gagccgcgca caggcccaga tacgcctcgt gggtagacgt gtgaaaaaat 180 tggtaagcga gcgaggtacg cgttaccgca tcgccatgtc ggccacgcgc acggaggtat 240 tccgagagca aaaataaagg aagacggaga agagaagaaa aagggggggg ggcagcagaa 300 acacaaaaaa aaagaagggc agaacacgcg ggagaatatg aataaagaaa aaacaaagaa 360 aagaagaaaa cggaatatcg actagggggg gaagtatcga gtggatcgca gaagcgcccc 420 tacaccttat cgataggcgg gagtcggcgc cagccagaaa agtgcaaggg ctgacagcgc 480 ccccgagcca aggcgccaga gggcgccaga acgccggcga tcccaggtac aacgaaatgc 540 gcgcgcaact tcgggccgtt tgaatgagga aagcccaccc catcccgaga tcgcggaaaa 600 cggaagtatt ccgctacggg aagaaagagg ggggtgtgag gaaccatggt tcccacaggc 660 cgagtgcgac caaaagcaca gtcggataaa agctgacggc cggccggata aggcgaaggt 720 caggccagat gcccccggtt atccggacgc ggcgaagagg ccaagggcct cgagagggga 780 agtgccctgc tctcggagag agagacacaa gagaggcaaa ggaaacggcg ggatcgggcc 840 agcagtaatt tggcgcatga ccaaataagg gcacgggatc acgccgccgg ccaggccccg 900 ttacgaaggc cgtggaaacc aggagacggc gatggaaaac taccgacgcc gagccggggc 960 ctggcaaggg gctataaaag gaggcggcgg cgtcaaagcg ggtctctttt cccgcggata 1020 agtaagcgcg cgcgcacgtt gagacccggt cgaaacgcaa gagtgatact cggtacgaaa 1080 caaagaagtg tagtgaagtg tagcagcagg cagacaggcc ccgcctgccc ccccagagta 1140 agtctaagtt aagtgaagat ccccgtacgc cggcggaagc taggagggga gtgcgagacg 1200 tggagtgcgg attcgagtgg cggaccaaag caatagcgga ggtcaacttg ccccaatcaa 1260 tagcctcgac gctgccacac ccgacgagag cccccgcgtg gtggagatcg gtatttaaga 1320 gaaactgtaa cacctagccc taagtcctaa tagaattgcg aataaagcga aaatcaaact 1380 aaaaggaaag tcttatcatt tgtcgttggg ggcaacaata aaaatacgtt gctgcgaatc 1440 tgccgaccgc tgagcaagcc cagcaaactc aagcgagccc aaaaagca 1488 // ID L1_Ele13 repbase; DNA; INV; 4522 BP. XX AC . XX DT 27-OCT-2010 (Rel. 15.1, Created) DT 27-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE An L1 clade non-LTR retrotransposon family from Aedes aegypti. XX KW L1; Non-LTR Retrotransposon; Transposable Element; L1_Ele13. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4522 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4522 RA Kojima K.K. and Jurka J.; RT "L1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (08-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. This consensus is generated from 7 CC sequences with >97% identity, and ~99% identical to the original CC sequence in [1]. XX FH Key Location/Qualifiers FT CDS 148..945 FT /product="L1_Ele13_1p" FT /translation="MSRRENTFRIDYSSVPKKPSYEDLHRFIANDLGLKKE FT EVLRIQCSRSAGCAFVKVCSLDVAQKVVSDHDNVHEVIVDDKPYKLRIRLE FT DGAVEVKLFDLSEDVSKQKIEEFLSAYGEVLKASEQIWESKYEFSGVPTGL FT WVVRMMVKVNIPSYVTIDGEVTYVQYFGQHQTCRHCEDFVHNGASCVQNKK FT LLLQKLSADKSNKQSYAKVAKQKPSTGAIVRPRSTNTTPSAVARQWTCPIS FT TAAICSRQDVPGPIYLTSQSSPLSP" FT CDS 1278..4460 FT /product="L1_Ele13_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MDHASYNIATININNISSPTKIDALRTFLRTMELDII FT FLQEVENDQLTLPGYNVICNIDHMRRGTAIALKEYIQFSNVERSLDGRLIA FT LRINNVTLCNVYAHSGTAFRAERERFFNSTISYYLRHNTPNVILAGDFNCV FT LRQCDATGSNDSPALKNTTQQLQLCDAWIKLCPRSSGHTYVTANSSSRLDR FT FYVSRGLQQHLRNSATHVCSFSDHQAVTLRICLPSLGRAPGRGFWTLRPHL FT LTQENIEELRYRWQFWTRERRNYASWMAWWLLCAKRKLKSFFRWKSKIAYD FT EFHCEHQRLYSQLRRAYDDFYGQPALLTTIHLLKAQMLALQRKFSKMFTRI FT NESFIAGEDISSFHLGERTRKRTTITRLQSVDGQILHSSEDIEQHLYQYYS FT SLYTEEAVVRNGLNNFFQSERAIPINDATNEACMSAICTEEIITAIKTSAS FT KKSPGPDGIPKEFYLRTFDVIHREINLILNEALSGQFPAEFVDGVIVLVKK FT KGGDETVRSYRPISLLNFDYKLLSRILKSRLETIMRTHDILSGAQKCANSS FT RNIFQATLSLKDRLAQVIKRKQRAKLISFDLDSAFDRVRRAFLHQTMHSLG FT FNGHLVDLLAQIADRASSRLLVNGHLSPPIDIQRSVRQGCPLSMHLFVIYL FT HPLVKELERICEEDLIVAYADDITIVCSSTQRVEMLRETFTRFELASGAKL FT NWRKTKAIDIGLIERDPLVVPWLQTGNTIKVLGVMFTNSVRLMVKLNWDAA FT INNFMHQIRCHSMRCLNLHQRVIIANIFIMSKIWYLSSILSPYAVHIAKIT FT STLGRFIWSGIPTRIPMTQLSRDRKDGGLKLQLPAMKCKALMVNRHVQDLE FT CLPFYKTFVESSDPTQTIPADLPDLKVLCQQYQQLPIQIVQNPSADQIHRM FT YIDQTDKPKVEQEHPEVDWRRIWSSLHWRSFTAAERSNLFMLVNEKGDYRK FT LLYTMRRVDGENCVYCNEPIETLQHKYSECQRVNAAWEFLQRRITAHLGGW FT RRITFNELIRPSLDGIERNRRIHILKLFIHYISFVNNVNNVVDVNSLDFYL FT NVNC" XX SQ Sequence 4522 BP; 1366 A; 1091 C; 932 G; 1133 T; 0 other; gtcagttatc gctcaacctc tgagccgaac agtcgtttgt tctaacgcta ttgtttcgct 60 acgagagtcg atcgcaagct atcctgattt cgtttcgcta tcgcctggcg agtttcgcct 120 tttttatctt cacgcgcgac aaccgcgatg agtcggcgcg agaacacgtt tcgaattgat 180 tattcgagtg tgcccaagaa accatcttac gaagacctac accgtttcat tgccaatgac 240 ttaggtttga agaaagaaga agttctgcgc atacaatgca gcagaagcgc tggctgtgct 300 tttgtgaaag tatgcagtct tgatgtggct caaaaagtgg tgtctgacca tgacaacgta 360 catgaagtca tagtagacga caaaccgtat aagctacgaa ttcgtctaga agacggagcc 420 gtcgaagtga aacttttcga tctttcagaa gacgtgtcta aacaaaaaat cgaagaattc 480 ctcagtgcat atggagaagt gttaaaagcc tccgaacaga tatgggagag caagtatgag 540 ttcagtggag ttcccaccgg cttatgggtt gtccgaatga tggtcaaagt gaacattcct 600 tcctacgtaa ccatcgatgg tgaagttaca tatgtccaat attttgggca gcatcaaacc 660 tgccgccatt gcgaagattt tgtacacaat ggagcatcct gtgtgcaaaa taagaaactt 720 cttcttcaaa aactgtccgc ggacaagtct aacaagcaat cgtatgcgaa agttgctaaa 780 caaaaaccct caaccggtgc tattgtgcgt ccgagatcta caaatacaac tccatcagct 840 gtggccaggc aatggacatg ccctatatca actgcagcga tttgcagtag acaggatgtc 900 ccgggcccga tctatctgac aagccaatcg agccctctgt caccatagaa gatggtgtct 960 ccatcatgtt ctgtggctgt ggctaacaca tcaagtcaat cgtcctccgc ttcgaattcc 1020 aacaccacga aatcactggc tccaccgact gctgtaaagc aattggttgt ggaactgcct 1080 ccactgcccg tcggcaaatt caaaaaaccg acattgcaac aacagccaat cgacaacgac 1140 gaaaacctaa ccagaagcaa tcgttcagat gaagaagaaa ccgatgattc ctcgagctca 1200 aaaccccgcg gccgtccacc tggtaaaaaa gctcgcacac atgacggaga aggttcgaga 1260 gaaaggaaac accgctaatg gatcacgcca gctacaacat cgccactatt aatataaaca 1320 acatttctag cccaacaaaa atcgatgcac ttcggacttt tcttcgcacc atggaactgg 1380 acataatctt tcttcaagag gttgaaaatg accaactcac tttgccgggc tataatgtca 1440 tctgcaacat cgaccacatg cggagaggca cagcaatagc tcttaaagag tatattcagt 1500 tctctaatgt ggagagaagt ttagatggcc gactgatagc acttcgaata aacaacgtca 1560 ctctgtgcaa cgtctacgca cactccggga cagccttcag agctgaacga gagaggtttt 1620 tcaatagcac gatttcgtat tacttacgtc acaatactcc gaacgttatt ctggccggag 1680 acttcaactg tgtcctccgg caatgcgacg caacaggcag caatgatagt cccgctctca 1740 aaaatacaac tcaacaactg cagctctgtg atgcttggat aaaactatgt ccccgctcta 1800 gtggccatac gtatgtaaca gcaaactctt cctctcggct cgatcgtttc tacgtgagtc 1860 gtggtctcca acagcacctt cggaactcag caactcacgt ttgctctttc tccgatcacc 1920 aagcagtcac tcttcgtatc tgcctcccgt ccctcggcag agctcccgga cgcggtttct 1980 ggaccctccg ccctcatcta cttacacaag aaaacattga agagttgcgc tatcgctggc 2040 aattctggac ccgggaacga cggaattatg catcctggat ggcatggtgg cttctctgtg 2100 ccaaacgaaa actcaagtcc ttcttccgct ggaaatcaaa aatagcgtac gacgaatttc 2160 attgcgaaca tcaaaggctc tacagtcaac tgcgtcgagc atacgatgat ttctatggac 2220 aaccggcact cctcaccacc atacacttgt taaaagcaca aatgcttgcg cttcagcgca 2280 aattctccaa gatgttcaca cgtatcaacg aatcgtttat cgccggggaa gacatctcat 2340 cttttcatct cggagaaaga acacgtaaac gcacgactat tacccggcta caaagcgtgg 2400 acggccaaat actacactca tcagaagata tagaacaaca tctctaccag tactattcat 2460 cgctatatac tgaagaggct gtagttagga acggcctaaa caactttttc caaagtgaga 2520 gggcaatccc aataaacgac gcaacaaatg aagcctgtat gagtgctata tgtacagaag 2580 agatcatcac agcaataaaa acaagtgctt cgaaaaaatc gccgggcccg gacgggatcc 2640 cgaaggaatt ttatctgaga acctttgacg tcatacaccg agagatcaac ttgatactca 2700 acgaagcact gtccggtcaa ttcccagctg aatttgtaga tggggtgatc gtgctggtca 2760 agaagaaagg aggagacgaa acggttcggt cgtatagacc gatctcacta ctgaacttcg 2820 attacaaact tctctctcga atcctaaaat cccgattaga gacaatcatg agaactcacg 2880 atatactctc cggcgcacag aaatgtgcta attcatcacg aaatatcttc caagcaactc 2940 tttcgttaaa agatcggttg gctcaggtga taaaacgtaa acagagagcc aaactaattt 3000 cattcgatct cgattcggcc ttcgatcgag ttcgtcgtgc cttcttacat caaaccatgc 3060 actcacttgg atttaacgga caccttgtgg atctcctcgc tcaaattgct gatcgggcat 3120 catctcgact attggttaac ggccatctat cgccacccat tgatatacaa cgctcagtgc 3180 gacaaggctg tccgctgagc atgcatctat tcgttatata tctacacccg ctcgtgaaag 3240 agctggagag gatatgtgag gaagatctca tcgttgcata tgccgacgat ataaccatcg 3300 tttgctcgtc tacccagaga gtagagatgt tgagagaaac tttcactcgt tttgagctgg 3360 cgtcaggagc aaaactgaac tggcgaaaaa ccaaagcgat cgacattggt ctcatcgaaa 3420 gagatccctt agtcgtacca tggctacaaa cgggcaacac catcaaagtt ctcggcgtga 3480 tgtttaccaa ttctgtacgt ttgatggtaa aactcaactg ggatgcagcg attaataatt 3540 ttatgcatca aataagatgc cactctatgc gatgcttgaa tttacaccag cgggtgatta 3600 ttgcgaacat attcattatg tctaaaatat ggtacctatc ttctattctc tcaccatatg 3660 cagtacatat agcaaaaatt acatctacgc taggtagatt tatttggagt gggattccca 3720 cacgcatccc tatgacacaa ctttcacggg accgtaaaga cgggggactg aagcttcaat 3780 taccggctat gaagtgtaaa gcacttatgg tgaaccgaca cgtacaagat ctggaatgtt 3840 tgccgtttta caaaactttt gttgaatcaa gtgatccaac ccaaactatt ccagctgatc 3900 tgcccgattt gaaagtatta tgtcaacaat atcaacaact tcctatacaa attgttcaaa 3960 atccatcggc cgaccaaatc catcgcatgt acatcgacca gactgacaaa cctaaagttg 4020 aacaggagca tcctgaagtt gactggagaa ggatctggtc aagtctccac tggcgaagct 4080 tcacagcagc agaacgatcc aatctattca tgcttgtcaa cgagaagggc gactatcgaa 4140 agcttcttta caccatgcgt cgtgtggatg gtgaaaattg tgtgtattgc aacgaaccaa 4200 ttgaaacgct gcaacacaaa tacagtgagt gtcaacgagt caacgcagct tgggaattcc 4260 ttcaacgtag aataacagct catctaggcg gatggagacg aatcactttc aatgaattaa 4320 ttcgaccatc attagatggt atagagagaa atcgtagaat acatatccta aaactattca 4380 ttcattacat ctcctttgtt aataacgtta ataatgttgt ggatgtaaac tctttggatt 4440 tctatttgaa tgtgaattgc tgatatgttg taatcaaatt ttaaacaaat cacgtcaata 4500 aatattttac aaaaaaaaaa aa 4522 // ID TTAAC_AP repbase; DNA; INV; 438 BP. XX AC . XX DT 29-AUG-2009 (Rel. 14.09, Created) DT 29-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; TTAAC_AP. XX NM TTAAC_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-438 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2099-2099 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC TSD is TTAA tetranucleotide. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 438 BP; 153 A; 66 C; 63 G; 156 T; 0 other; gaggatgtca gcgcactatt tgttttctct ctctgaccca cgcgcaacat agcaaatttt 60 acgttcacaa taatgattat aatcatatta atttctcaaa ttagagtgaa attacctctt 120 ataaaattta aaggtaagat tattatctag ggcatctcat aggcgtttta ttatatttta 180 attttaaagc gagttatgag tattttaaaa ttgtaaattg tttgtacatc ttaaaatact 240 cataactcgc tttaaaatta aaatataata aaacgcctat gagatgccct agataataat 300 cttaccttta aattttataa gaggtcattt cactctaatt tgagaaatta atatgattat 360 aatcattatt gtgaacgtaa aatttgctat gtcgcgcgtg ggtcagagag agaaaacaaa 420 tactgcgctg acatcctc 438 // ID Cre-1_MB repbase; DNA; INV; 5033 BP. XX AC . XX DT 21-JUL-2009 (Rel. 14.07, Created) DT 21-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE Cre-1_MB non-LTR retrotransposon - consensus. XX KW CRE; Non-LTR Retrotransposon; Transposable Element; Cre-1_MB. XX OS Monosiga brevicollis OC Eukaryota; Choanoflagellida; Codonosigidae; Monosiga. XX RN [1] RP 1-5033 RA Kapitonov V.V. and Jurka J.; RT "A family of choanoflagellate CRE non-LTR retrotransposons."; RL Repbase Reports 9(7), 1330-1330 (2009). XX DR [1] (Consensus) XX CC Cre-1_MB belongs to the CRE clade of non-LTR retrotransposons. CC The genome contains 50-100 copies of Cre-1_MB that are ~97% CC identical to the consensus sequence. XX FH Key Location/Qualifiers FT CDS 123..4505 FT /product="Cre-1_MB_1p" FT /note="contains the RT and REL domains." FT /translation="MATESGGEDSWTQVRGAKRPSAESPPSNTTTSPSQTH FT RSAKHTKHGSARHDRNHVFPDPMTTPLRPHARHSVPTARASSHVPSTSPAA FT GATESSARAVVPAAEPVTRTSNGGGEQHPIIGNTSNASPRTPRTPSSPRSF FT AQVAAAMPAAATATSSAPMTEDLSASVPSEPNGSGEQQPSPESTGQTHHSI FT PNTPSDFLTMSSDESDSPPRSTALRAPTPIAPPAHDGDGDTNGSATPEPLV FT QSPTPAQMVLPYPSGTQQTHSDPSPPSASPPATTILPAAISHPVEHSEHAN FT SAPLGEVSESETHNTAGEHSESEQDVLLSDPAPPIAANVLDAQRKVLLKTS FT GHRQLLACPFGLCKCKGPRLDRKAWVNHVLREHPYDEQATDLVKQVMEAKL FT VAQCNKCHLFFEAAGISQHRSRCGANLKRATEALFHAAGHDLLEIMRGAWP FT QQCVGSRISVCELLKLARHPLMQRSRYPSNATETKLMAATLSQLYWSAVHS FT DYTAEEREMCWALILALPSMLLSAPSTALSTIDLRNMFHDRLRWLVTGQLG FT RVVDAMRKAVARKQSRRGQLNAGAGAHPNDAVDQSLRSLVRDPDLADEAWA FT NHVTNRLNRGQIAKAFDADKARAVIGNSEVQAVRDLLVPPGLTPYIASTPA FT STSTLAPATAVSSPTVSFTKGELPKALAATKGVTDPYGWSGELLASIYRIK FT EHFSQVLGPRQGSTSDPTAPSDGDAPQGPTTATGGPQVALNKIFHHIANNT FT VPESIRHALCSINYTILEKANGKFRPVGTDSIFNKVVNRALLEQQQPHIAH FT LLQASPELAVGVKDGISAAVGMAFGELQACESTPGWTMLSLDFKSAFNYTD FT RARLHEIVADKVPGLLRAFERHYARPTTHCIVDKFFKVIDIDVGQGIVQGN FT ELSPFFFALYSCEVLGLLDATTDYRCKVIKYLDDIVLMGPAEDVAADVEIV FT KARAESAGLHLQPSKSRFYMPRHHSASITAIKSVLPDAVRETANTGMTVLG FT TPIGRREWMKKQLNDKAKHIAGKLNDMLTTGVSLQALLTAMQYVPSLINHL FT YTLPPSLTSGLSELLNRACKDTFVKAFFAKVNLSAPAGAEGHDVTLEQLLE FT ARLFTRANTGGFGLHDLVERGPVAYVCNMAKLATRYPRVYDRLLEDASRAA FT DFEAHVQRAGFQMATVKDAATQRPAEIIALRSKAALDDLMAKCALDLQQAY FT LASREWGVSTVLTMRGRDKLRRLSDTTFAIAVVSMMGFGLHELINVKPTDK FT CPLCSSKTPQPRLTREHLLTCRPIKRHNALRDEMGRLLRYATLSHVWVEKS FT GYNANGQSCRIDLHCRNPFPGGALGPALPDLGIDVTVRTAQPPTTSQACIK FT VGAALRRAEKEKRDYYTGFNHGKTLIVPAAMTTTGGFASSFVDLLGQLARC FT AEARGVYQPGLDEAFVPRWKGRFAALVHQMNADHIQRHFGGVCLRSS" XX SQ Sequence 5033 BP; 935 A; 1620 C; 1340 G; 1138 T; 0 other; catcttggcg tgaaccacgt tgtcagacaa aatctgcaac cccgctcttt gcggcccgcg 60 ttttggcggc gccctcgctc ccaccgtgtc cgctcgcttg ctcgcttgct tgccccgcgg 120 acatggccac tgagtccggc ggcgaggatt cttggaccca ggtccgcggt gctaaacgcc 180 cgagtgccga atcacctcca agcaacacca ccacctcgcc ttcccaaact catcgttctg 240 caaaacacac aaaacatggc agcgctcgcc acgaccgtaa ccatgttttc cctgacccca 300 tgaccacccc gcttcgccct cacgcccgcc actctgtccc taccgcccgt gcctcgtctc 360 atgtgccctc gacgtccccc gctgccggtg cgaccgagtc ttcggcacgt gccgtcgtgc 420 ccgcggccga gccagtgacc aggacgtcaa acggcggcgg ggagcaacat cccatcatcg 480 gaaacacttc caatgcttct ccccgcaccc ctcgcacgcc atccagccct cgctcctttg 540 ctcaagttgc tgcggcaatg cctgccgccg ccactgccac atcttcggcc cctatgaccg 600 aggatttgtc agcatcggtg ccctctgagc caaatggcag cggggagcaa caaccctcgc 660 ccgagtccac agggcagaca catcattcga ttcctaacac accatcggat tttttgacca 720 tgtcttcgga tgaaagcgac tcccctcctc gctccaccgc actccgcgcg cccaccccta 780 tcgcccctcc cgcgcatgat ggtgacggtg acacaaacgg cagtgccacg cctgagccat 840 tggtgcaatc acctacaccc gctcaaatgg tgctgccata tccatcgggt acacaacaaa 900 cccattccga tccctctccg ccctctgctt caccccctgc cactaccatt ttgcccgctg 960 ccatttcaca tcctgtcgaa cacagtgagc atgcaaactc agccccactt ggcgaagtca 1020 gtgagagtga aacacacaat acagcgggcg aacacagtga gagtgagcaa gatgttcttc 1080 tcagcgatcc cgctccgccc atcgctgcca acgtgctgga tgcccagcgc aaggtcctgc 1140 tgaagacatc tggccacagg caactcctcg cctgcccatt tgggctttgc aaatgcaagg 1200 ggccccgcct tgaccgcaaa gcctgggtca atcatgtact acgcgagcac ccctacgacg 1260 agcaagccac tgatctggtc aagcaggtga tggaggccaa attggtcgcc cagtgcaaca 1320 agtgtcacct cttcttcgaa gctgctggta tcagtcaaca ccgctcccga tgtggtgcca 1380 atctgaagcg agcgaccgag gcgttgtttc atgcggctgg acacgacctg cttgagatta 1440 tgcgtggcgc ttggccccaa cagtgtgtag ggtctcgcat cagtgtctgc gagctgctca 1500 agctcgcccg gcatccactg atgcagcgca gccgctaccc atccaacgcc accgagacca 1560 agctgatggc tgccaccctg agccagctgt attggtctgc cgtccactcg gactataccg 1620 ctgaagagcg agagatgtgc tgggctttga ttttggcctt gcctagcatg ttgttgtctg 1680 ctccctcgac cgcactgtct acgattgacc tgcgcaatat gtttcacgat cgtctccgtt 1740 ggcttgtgac ggggcaacta ggtcgggtcg tggacgccat gcgcaaggca gtcgcacgca 1800 agcagagccg tcgaggacag ctgaacgccg gcgcgggcgc ccacccgaac gacgcagtcg 1860 accagagcct caggtcgctc gtccgcgacc cggacctggc ggacgaagcc tgggcaaacc 1920 acgtcacgaa ccgtctgaac cgaggtcaga ttgccaaagc atttgatgcc gacaaggctc 1980 gtgccgtgat tggtaattct gaggttcagg ccgtgcgcga cctcttggta ccgcccgggc 2040 tgaccccgta cattgcttcg acacccgcct ccacgtctac actggcacca gccacggctg 2100 tgagctcccc aaccgtgtcc ttcaccaagg gtgagctccc caaggcgttg gcggccacca 2160 agggtgtcac cgacccctat ggttggtctg gtgagcttct tgcctccatc taccgcatca 2220 aggagcactt cagtcaagtc ttgggcccac gccagggttc taccagcgac ccgactgctc 2280 cttctgatgg agacgcgcct cagggcccca ccaccgccac tggaggtcct caggttgcct 2340 tgaacaagat ctttcaccac attgccaaca acaccgtgcc cgagtcgatt cgacatgccc 2400 tttgctccat caactacact atcctggaga aggccaatgg caagtttcga cccgtgggca 2460 cggattccat cttcaacaag gttgtcaacc gcgctctgct cgagcaacag cagccccata 2520 ttgcccactt gctacaggcc agtccagagc tggccgtcgg agtcaaggac ggcatttcag 2580 cagcggttgg catggccttt ggtgagcttc aagcctgtga gtctaccccg ggctggacca 2640 tgctctccct cgatttcaag agtgccttca actacaccga ccgagcacgg ctgcacgaga 2700 ttgtggccga caaggtccct ggcctcttgc gcgcctttga acgacactat gctcgaccca 2760 ctacccactg cattgtcgac aagttcttca aggtgattga cattgatgtt ggccaaggca 2820 ttgtgcaggg caacgagcta tcgcccttct tctttgccct gtactcctgt gaggtcctgg 2880 gtctcctcga cgccaccact gactaccgct gcaaggtcat caagtacctc gacgacattg 2940 tactgatggg tcccgcggag gacgtggcgg ccgacgtgga gattgtcaag gctcgtgcag 3000 agtctgctgg ccttcatctg cagcccagta agagccgctt ctacatgcct cgccaccatt 3060 cggcttccat cactgctatc aagtctgtat tgccagatgc cgtgcgcgag acggccaaca 3120 cgggcatgac ggtcttggga acgccgattg gccgtcgcga gtggatgaag aagcagctga 3180 acgacaaggc aaagcacatt gctggcaagc tcaatgacat gctgacgacc ggtgtctcgc 3240 ttcaggccct cctcacggcc atgcagtacg tgcctagcct catcaaccac ctctacacgc 3300 tgcccccaag tctcacgtcg ggcttgtccg agctcttgaa ccgtgcttgc aaggacacct 3360 ttgtcaaagc cttttttgcc aaggtaaacc tgtctgcacc ggctggagct gaaggtcatg 3420 acgttacgct ggaacagctc cttgaggctc gcctcttcac acgggccaac accgggggct 3480 ttggcctgca cgacttggtt gagcgcggtc cggtggctta tgtctgcaac atggccaagc 3540 tggccactcg ctaccctcgg gtctacgatc gacttttgga ggatgcatcg agggctgccg 3600 actttgaggc ccacgtgcag cgagctggct tccagatggc cacggtcaag gacgcggcga 3660 cccagcgacc agctgagatc attgccctcc gctccaaggc ggcactggac gacctgatgg 3720 ccaagtgcgc gctggacctg cagcaggcat atctggcctc acgcgagtgg ggcgtcagca 3780 ctgtcttgac catgcggggt cgggacaagt tgcgtcgctt gagcgacacg acctttgcca 3840 ttgcggtcgt gtccatgatg ggttttggcc tccatgaact catcaacgtc aagccgacgg 3900 acaagtgccc gctctgcagc agcaagacac ctcagccgcg actgacccgc gagcacctgc 3960 tgacctgccg tcccatcaag cgtcacaacg cccttcgcga cgagatgggc cgcctgctca 4020 ggtacgccac cctctcccat gtctgggtgg aaaagtctgg ctacaacgcc aacggtcaga 4080 gctgccgcat cgacctgcac tgccgcaacc cctttcccgg cggtgctctg ggcccagctc 4140 tgcccgacct gggcattgac gtgactgtgc gcacagccca acccccgacc acctcgcaag 4200 cctgcatcaa ggtgggcgct gcccttcgcc gagccgaaaa ggagaagcgc gactactaca 4260 ccggtttcaa ccatggaaaa actctgatcg tccctgcggc gatgacgaca accggtgggt 4320 tcgcctcctc ctttgtggat ctgcttggtc agctcgcccg ctgcgccgag gcccgtggtg 4380 tgtaccagcc ggggctggat gaggcctttg ttcctcggtg gaagggtcgc tttgcggcgc 4440 tggtccatca gatgaacgct gaccacatcc agcgccactt tggcggtgtc tgcctgcgct 4500 cgtcgtaggt aggcaccgtc tcgggggtcc ctctgtgggg atccctgtgt gcacctgtcg 4560 ctccctaggt ggttcctcgt tgtgtctttt gatggcttga cttgtatttt tgttttaatt 4620 ttgctttaat ttttgctgta tttgtgtggt atttttgctg aatttttgta aggtcctttg 4680 tatgatgtct ttgtcttctg tggtcggttg ttttcctcaa tccgacgttg tgtctcgttg 4740 gatgtgagcg tgccgtggtg ttctttgtgt ttgtgctgtg atggcttgta gttgtgatgt 4800 gtgactgcct ttttgggtgt cttgtgtttg aaatggccgt atctctggtt atacttggtc 4860 gttttgtacg atttttgttt ctatgtgcgt gattcttcgc gcttgtactt cttggcatga 4920 tagaagccaa tgaatgtgtc ttgttctctt gtgttgtttt gcgtgccgtc gtgattttga 4980 tgtcggggtt gcacagcttt gctttcagct ctgaggttca aacacctaat tta 5033 // ID RTE-11_BF repbase; DNA; INV; 3598 BP. XX AC . XX DT 29-JUL-2009 (Rel. 14.07, Created) DT 29-JUL-2009 (Rel. 14.07, Last updated, Version -1) XX DE Amphioxus RTE-11_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW RTE; Non-LTR Retrotransposon; Transposable Element; RTE-11_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-3598 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-3598 RA Kapitonov V. and Jurka J.; RT "Young families of RTE non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(7), 1709-1709 (2009). XX DR [2] (Consensus) XX FH Key Location/Qualifiers FT CDS 498..3578 FT /product="RTE-11_BF_1p" FT /translation="ELRPQYLFSNRKVMMANQYNTRVEVPNRTLPAELDSQ FT ESQDGLPPNLQGRGRLDYSSSRGNRMVNCRSITTISTLNVRTIREAQRREE FT LSTNLALYDIDALGIQEHRIAHEEEVRYESIHGNMLITTSASRNRSGATVG FT GVGILLSPKAYNSLAKVTPFNERILIANFQGNPATTIIVTYCPTNVVDEET FT TNEHYDSLRRAIQSIPAHNLLMVVGDFNARIGSEEAKYTFHSETNRNGKLL FT LELSKEQGLIISSTRFQKKAGKLWTYMSPGGNRYQLDYILIRKKWQNSMLN FT VEAYNTFASVGSDHRIVSARVRLSLRKKKSLPRRKQYDWGALRRSEVLQQQ FT YAIEVRNRFQPLQEENETATERYGRFIAANEGAAEKTIPIKKREKTAKYSS FT DPRVSQARDEVNRRYEVFVENTQEKNRQELKRARRRLEETYTRVAEEDLLQ FT KIQEIERTHENSKHGQSWKLINEITNREATKKSQLEGNSQEERVQSWYNHF FT KNLLGNPPEITCEEEEIPTILGDLGIKEGPFDIEEYEMARKSLVEGKSSGE FT DGIPPEVLKRCKDLDEIILDFCNKALLEGEKPEQWSTLNIVPIPKAGNLRL FT TENYRGISLSSIVAKTYNRMILNRIRPVLDKHLRRNQNGFRVGRSTVGHIL FT AIRRLIEGIKSYNLTAIITFIDFKKAFDTIHRGKMLKILRAYGIPDQLVEA FT IASMYKDTKAKVISPDGETELFELYAGVLQGDTLAPYLFVIVLDYALRTAV FT EGREEELGLQVERRRSRRIGPKVVTDMDFADDIALLSEAIQQAQELLGCVE FT SSVGKVGLRMNAGKTKYMTFNIPTPYCLKTGDGCTLEEVKDFKYLGSRMES FT SAKDITARKAAAWLACNKLKKIWRSNLSRELKVRLFLSTVESVLLYGCESW FT TLTTKLEKQLDGCYTRLLRTALNVHWSQHITNKDLYAGIPKLSDKIRTRRL FT KFAGHCLRSKEIVADLVLWTPRHGKRRPGRPTATYLDMLRKDTGLGVDNLH FT QAMTDRDIWNAIIARDQKDSS" XX SQ Sequence 3598 BP; 1170 A; 729 C; 933 G; 766 T; 0 other; cagcggggga gtcccaaatg gctagccttt taccattgct gaatagtgta gatagggttt 60 gaagttattt tatgttcctt ttttggtttt tgtttcatag ttaggtaaat tccaccacaa 120 acatctgcta cagtgtgcat accccccccc cacccccatt tgttattatg agttccatga 180 ctttttcttt tgtttgccct ctgttatttc gactaaatac tacaacgtga ccctagacca 240 agtaacatcc agttgaggta gatcactacc ttaggcgtag tagagtgcaa ctctttgata 300 cggcaagagg gcgcatgtca acatggacaa tggcggccag tgatcagaaa cagggagtcc 360 tgggacctgt gccgatctgc atcgcgggtt gaggataagg atgagacggc aaagggcgga 420 ggtaggtgct atgcagctaa cctacaacgg ccactggact atgtgaagga aaacacagaa 480 ggaaaaccat gctgtaggag ctgcgaccac agtatttatt tagcaacaga aaagttatga 540 tggcaaacca atacaacact agggttgaag tccctaatcg aaccctgcct gcggagctag 600 attcccagga atcacaggat ggcttacctc caaatctgca aggaaggggt aggcttgatt 660 atagctccag cagagggaat aggatggtga actgtagatc cataacaaca atatcaacac 720 tgaatgtaag gacaatcagg gaggcacaac gcagggagga actgtcgact aacttggctt 780 tatatgacat tgatgcactg ggaatacagg aacatcgtat agcacatgag gaagaggtcc 840 ggtatgaaag catacatggc aacatgctga taactacatc tgcaagtaga aatcggtctg 900 gtgcaactgt tggaggagtg ggtatactgc taagtcccaa agcgtacaac tcactggcaa 960 aggtgactcc attcaacgag cgcatactga ttgcaaactt ccagggaaat ccagctacaa 1020 caataatagt tacgtattgt cccaccaatg tcgtagatga agaaactaca aacgagcact 1080 atgacagcct taggagagca atacagtcta tcccagcaca caacctgcta atggttgtgg 1140 gagactttaa tgctaggata ggatctgagg aggcaaagta tactttccac agcgaaacga 1200 acagaaatgg taagctacta ctagagttgt caaaagagca aggtcttatc atctccagca 1260 cgcgtttcca gaagaaagct gggaaattgt ggacttacat gagcccaggt ggaaacagat 1320 accagttgga ttacatactg atcagaaaga agtggcaaaa cagcatgcta aacgtggagg 1380 cgtataacac attcgctagt gtgggctcgg accacagaat agtgtcagct cgagtaaggc 1440 tcagtctgag gaagaagaaa agtttaccga ggaggaagca gtatgactgg ggagctctta 1500 ggaggagtga agttctgcag cagcaatatg ccatagaagt acgcaacaga tttcaacccc 1560 ttcaagaaga aaatgaaacc gcaacagagc gatatgggag gtttatagca gcaaatgagg 1620 gagctgcgga gaaaacaatc ccaattaaga aacgggaaaa gacagcaaaa tactccagtg 1680 atccaagagt gtcacaggct agggatgagg tcaatagaag gtatgaagtc tttgtggaga 1740 acacacaaga gaagaacagg caagagctta aaagagccag aaggagactt gaggaaacat 1800 atacaagggt cgcagaggaa gatctcttac aaaagatcca ggagatagaa aggacacacg 1860 aaaacagcaa acatgggcag agctggaagc tgataaatga gataacaaac agggaagcga 1920 caaagaaaag tcagctggaa ggaaactcac aggaagaacg tgtacagagc tggtacaatc 1980 actttaaaaa cctccttggt aatcccccag aaataacatg cgaggaagag gaaattccaa 2040 ctatcttggg agatctcggt ataaaggaag gcccttttga cattgaagaa tatgaaatgg 2100 caaggaaatc actggtggaa gggaaaagta gtggtgagga tggaatacca cctgaggtgt 2160 tgaaaagatg caaggacctc gacgagatta tccttgattt ctgtaacaaa gcactgctgg 2220 aaggggaaaa gccagagcaa tggagcactc taaatattgt acccataccc aaagctggga 2280 acctcagact cacagagaac taccgcggca ttagcttgag ctctattgtt gccaagacat 2340 acaacagaat gatcctcaac agaataagac cagtactgga taaacacctg aggcggaacc 2400 aaaacggatt cagggtgggg aggtctacgg tagggcacat tcttgcaatt cgtaggctta 2460 tcgaaggcat caagtcatat aatttaactg caataattac atttatagac ttcaagaaag 2520 ccttcgatac tattcacaga ggcaagatgc tgaagattct cagggcatat ggtatacctg 2580 atcagctagt ggaagccata gccagcatgt acaaggacac aaaggcaaaa gtcatctccc 2640 ctgatggtga aaccgagctt tttgagttgt atgctggggt gctgcaaggg gatacactcg 2700 ctccatatct gtttgtgatt gtccttgact acgcactaag gacggctgtg gagggtaggg 2760 aggaagaact tggtctgcaa gtagagagga ggagaagcag aaggattggc ccaaaggtag 2820 tcactgacat ggatttcgct gatgacatag cgctgctatc agaggcaatc cagcaagcac 2880 aggaactgct tgggtgtgtg gagagttctg tgggcaaggt tggactccgc atgaatgctg 2940 ggaaaactaa gtacatgact ttcaacatac cgaccccata ctgcctcaaa acaggtgatg 3000 ggtgtaccct agaggaggta aaggatttta aatacctggg gtccagaatg gagagttcag 3060 caaaggacat aacagcaaga aaagcagcag cctggcttgc ctgcaacaag ttaaagaaga 3120 tctggagatc aaacctctca cgtgaactga aagtgaggct cttcctctcg actgtcgaat 3180 cggtactcct gtatgggtgc gagtcgtgga cactcaccac aaagctggaa aagcagttgg 3240 acggatgtta tacgaggttg ctgagaactg ctctcaacgt ccactggtca cagcatatca 3300 ctaacaagga cttgtatgct ggcataccaa agctgtccga caagattcga actaggagat 3360 taaagtttgc gggacactgc cttaggagca aggaaatagt tgctgacctg gtactctgga 3420 cgcccaggca tggtaagagg aggccaggta ggccgacagc aacatactta gacatgctca 3480 ggaaagacac tggccttgga gttgacaacc tacatcaagc tatgactgac agagacatat 3540 ggaatgctat catcgctcga gaccagaagg actcgtctta agcaagtaag taagcaag 3598 // ID Gypsy22-LTR_Dpse repbase; DNA; INV; 355 BP. XX AC Unknown_group_236; XX DT 15-MAY-2009 (Rel. 14.05, Created) DT 15-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy22_Dpse; KW Gypsy22-I_Dpse; Gypsy22-LTR_Dpse. XX OS Drosophila pseudoobscura OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; obscura group; OC pseudoobscura subgroup. XX RN [1] RP 1-355 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1123-1123 (2009). XX DR Genome; Unknown_group_236; Positions 7390 7744. XX SQ Sequence 355 BP; 157 A; 9 C; 105 G; 84 T; 0 other; tgacatttgc aaggagaagg agagtgagac aaaaaacgag tgtgaaataa aaaatgagtg 60 tttaggagaa aatgagcgtt tagaagaaaa tgagtgttta ggagaaaatg agtgtttaga 120 aaaaaatgag tgtttagaag aaaatgagtg tttaggagaa aatgagtgtt tagaaaaaaa 180 tgagtgttta gaagaaaatt agtgtttagg agaaaatgag tgtttagaaa aaaatgagtg 240 gaaaatagag aatgagtgtt taggagaata tgaatgtgaa gtaaaagatg agggtgataa 300 agagaataag catgaagtag aaagtggagg caaggtagaa aatgagtgtt ctgca 355 // ID Mariner-5_HM repbase; DNA; INV; 2314 BP. XX AC . XX DT 22-MAR-2008 (Rel. 13.03, Created) DT 22-MAR-2008 (Rel. 13.03, Last updated, Version 1) XX DE Mariner-type family: consensus sequence. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-5_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2314 RA Jurka J.; RT "Families of Mariner elements from Hydra magnipapillata."; RL Repbase Reports 8(3), 222-222 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 221..2104 FT /product="Mariner-5_HM_1p" FT /translation="MSKRKQTNLFNFGFTKKICHRGELVNVVPKEKEDNLF FT ECIVCKKSLKTSQALSMHEFWCKNKLSHTDIAQTDKDPKSMSLDQIIDGKN FT KYDNIEKTVKSVLEKLVTDVEANELPKIADKRKSYTLKEKVDALNDLNDGM FT LSRDVAKKYNVHKSMITRWKSKEKEIYNKYNEKKKNALFRKCRRGDKHKAL FT FNKLYDKFSEARKKGLKVNFEWLYVRANQIHLQQSPNADRLPKSIVTHFLR FT KNKIRMLCVQRKRQADVKKQTPILMKWHSNLREKLIKSGSGKPTYDKKWGR FT FPPKKRINVDQIPLPFVVESKRTYEIPMSKGKERRDHRVWVAQPGSGLDKR FT QATLQICFSPEGMVKPALIFRGKGKRISQDEVLAYDTDVDVYWQPNAWADT FT AFSVKWVKNTLSNAVANLDEYVLFCDNLTAQVSDEFLNEVRQHKGIVWFGV FT PNGTHFWQPVDAGPGKAFKSHIKAEQNIWLENDENIELWLGNDDRKLSAKD FT RRILITRWVGEAYRKTSKDEKFLRMLYRSFAKTGCLITADGSEDDKINPEG FT MPAYVVPPPVDIDNLVEITETPIPDEVKDNPIDVLPDESDGEDEELDGQVE FT KSDGQEDEESGDEENKDPAEERNICDLFFN" XX SQ Sequence 2314 BP; 832 A; 382 C; 503 G; 597 T; 0 other; cacgattttt tttgtataag caactcgttt ataagcaact gaactcgatg tgaaaaaata 60 ttaagcaact tcagttctga acaaatgtta acataagcaa ctcataaatt gcaatttttt 120 tctctgcaat gtaatgactt cttttagata aacgatttca aaagaaagta atcacagtag 180 tgttgttgtt gtgtatgttg ttgacaacgc tgttaagatc atgagtaaga gaaagcagac 240 aaatttgttc aactttggtt ttaccaaaaa gatttgtcat cgtggtgagt tggtaaatgt 300 agtgccgaag gaaaaagaag ataatttgtt tgaatgtatt gtttgcaaaa aatcactgaa 360 aacatcacaa gccctttcga tgcacgagtt ttggtgtaaa aataagttat cacatacgga 420 tattgctcaa actgacaaag acccgaaatc aatgagcttg gatcaaatca ttgatggaaa 480 gaacaagtac gataatattg aaaaaactgt gaaatcggtt ttagagaagc ttgttactga 540 tgttgaagct aacgaattac caaaaattgc tgacaaacgg aagagctata cgttaaaaga 600 aaaggttgat gcgctaaatg acttaaacga tggtatgtta agtagggatg tagcaaaaaa 660 gtataacgtt cataagagta tgattacgcg atggaaatca aaggaaaaag aaatctacaa 720 caagtacaac gaaaaaaaga aaaacgcgtt attcaggaaa tgccgaagag gcgacaaaca 780 caaagctttg tttaacaaac tttatgacaa attctccgaa gcaagaaaaa aggggctaaa 840 agttaacttt gagtggttgt atgtccgagc gaatcaaatt catctacaac aaagcccgaa 900 tgctgacaga ttgccaaaat caatagtcac tcatttctta aggaaaaata agatccgcat 960 gctgtgtgtt caacgtaagc gccaagctga cgtaaagaag caaaccccga ttctgatgaa 1020 atggcattca aatttgagag aaaagttgat taagtctgga agtggcaaac ccacatacga 1080 taaaaaatgg gggcgtttcc caccgaaaaa gagaatcaat gtagatcaaa ttccactccc 1140 gtttgtagtc gagagcaaac gaacatacga aattccaatg agtaagggaa aggaacgaag 1200 ggaccataga gtatgggtag ctcaacctgg cagtgggctt gataagcgac aagctaccct 1260 tcaaatatgt ttcagtcctg aaggaatggt gaagccggcg ttgattttca gaggtaaggg 1320 gaaaaggatt agtcaagatg aagttttagc ctatgacact gatgtagatg tgtactggca 1380 accaaatgca tgggcagata cagcgttttc cgtgaaatgg gtgaagaaca ctctctccaa 1440 tgctgtcgct aacctggatg agtatgtttt gttctgcgat aacctgacag cccaagtcag 1500 tgatgagttt ttgaatgaag tgcgacaaca taaaggtatt gtctggtttg gtgttcccaa 1560 cgggacacat ttttggcaac ctgttgatgc cggaccagga aaagcattca aatcacatat 1620 taaagcagaa caaaacatct ggctcgaaaa tgatgaaaac atcgaactct ggcttggcaa 1680 cgatgatcgg aaactgtcgg caaaagatcg ccgcattttg ataactcgtt gggtcggtga 1740 agcttaccgt aaaacatcca aagacgagaa atttcttcgt atgctttata ggagctttgc 1800 gaaaacagga tgtcttatca cggctgatgg tagcgaggac gataagataa atcctgaagg 1860 catgcctgct tatgttgttc caccccctgt tgacattgac aacctggtgg aaatcacaga 1920 gactcctatt ccagacgaag taaaggataa ccctattgat gttttgcctg acgaatcaga 1980 cggagaagac gaggaactag atggacaagt agaaaaatca gacggacaag aagacgaaga 2040 aagtggagat gaagaaaata aagatccagc agaagagaga aacatttgtg atcttttttt 2100 taattagaat ttaagtgatt ttagttgtaa agagcactct tctgttaaag acttgatatt 2160 aaaccgtatt ttgtctctta ttcgtaaaaa cactgaaatt taggctaaga aacttaagca 2220 accattaagc aactcgagat atagaaattg gctgaaaaat aagcaactcg agttctgacc 2280 aaaattctaa gttgcttata caaaaaaaat cgtg 2314 // ID hAT-N1_CQ repbase; DNA; INV; 1286 BP. XX AC . XX DT 29-DEC-2010 (Rel. 16.01, Created) DT 29-DEC-2010 (Rel. 16.01, Last updated, Version 1) XX DE Non-autonomous hAT DNA transposon from Culex quinquefasciatus - DE consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW hAT-N1_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-1286 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the southern house mosquito."; RL Repbase Reports 11(1), 98-98 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >92% identity. CC 8-bp TSD. XX SQ Sequence 1286 BP; 447 A; 202 C; 169 G; 467 T; 1 other; caggggtgac caaagtatgg cccgcgggcc aaacgtggcc cgcgaggtga tattttgtgg 60 cccgcggacc cattttgaat gatcatgtaa aatggcccgt tgaccacttg taaagtgatt 120 ttatactttt tcaaaattaa ggtttatttt aaacttttat atgctaatct tttttatttt 180 ttgttaataa aaaaacttat katttatcaa tattttgatc cccactttag gatacaaata 240 atttgttcaa aatattttat aattaaaaca atttcacacg tttcaatgtg agtgaaacta 300 gtaaatatcg tcaaaacttt catcaaacat ttttagaaat attaaaaaac gttcaagttt 360 gacttaggta gaattctgta atagttgcaa aaatataagt tttctttatt aatttgcaag 420 tcataatgac ccttttatga actgaccctc aaactataca ttgcacttcc ttcgggattc 480 gaactcatta cagttagata acgaatctga ttgactacca actgattcag gcagacaaaa 540 tgtggaaatc ggaggatcaa gtttgttgca aatattttac aaagctttcg tcgatcaacc 600 cacccctcca ccattataaa aaattggctc aaaaaccagg agcaaaaata tattttcaaa 660 aaacttaaaa aaatgaaatt taagtgcaat cagctgaaat ttattaaata tgcattcctt 720 tgcatttaga atcagttgag tttattaaaa ataatttgaa ttttagtgaa ttttcgatga 780 aataaataaa tgttttttcg ctaatttttt ttttgtcgaa tctttttttg aaaataatga 840 ttgcagttta actatacgga agcttaaaac attttctaac attttttttc attgaaatgt 900 tgaaattctg gctctcaacg atttttcatt agccacactt aacttataga taaaattacg 960 cctttagatg aattgttcaa catgtaatgt caccattttc gaaatataaa aacaaaactt 1020 tttttttttt gaaaacacaa gtacattcag ctaaaaactt tttttttcaa atacctgaaa 1080 caataaaatc aacaataccg atagaaaacc gttattttgt ggtttctatt attttgaatt 1140 agattttttt taaataacag aaattaaatc ttttaaatta aaataacgta atttaatttt 1200 tttaataacc ttttttgtaa atttggcccg caagctcatt tgagcttcaa atttggcccg 1260 gcctccaaaa actttgagca cccctg 1286 // ID BEL-621_AA-LTR repbase; DNA; INV; 692 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-621_AA_; KW Pao_Bel_Ele9; BEL-621_AA-I; BEL-621_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-692 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 692 BP; 247 A; 114 C; 103 G; 227 T; 1 other; tgttgctacg accgtggcac tagcaacctt gcccgctaca ccttcacaac aaattctctc 60 catcaacgct aggagcgcta ctgaacagac agacagattt aatacagaga aaattcaatc 120 accaatgttt tgatttggaa actattgtgc tcatgtgcta tccaacattc taaaattatt 180 ttatatttct attcaaaagt aaaattctta ttattaaatt gaattgctta aaagctaaca 240 tttgcttcta aaattacaga taagatattt ttccaaaaag cactagaagt gtcttaaatt 300 agttataaat aggcttaacg agttggtagg ttatacattg accaaaatta ttgaattatg 360 taatccttat gtaatgaatc ttctacagga agagctatgc gcctttatca accgaattat 420 tgaacttaaa ttaatcatcc tacaaaacgg taggtaaaaa tttaaaacta aatgtatgct 480 taagatgtat tatatttcta tawtgatcct agcagattgc ttcgataaaa actggattgt 540 tacctaacgt gttttccata atagcgcact attaaatgta agtagctcaa tattgtacaa 600 ggcataaata attatgaaga aatattttca gcttgaagca aattacaata aagtcgctat 660 caggattagt gcgaagtttg tttctggcaa ca 692 // ID Transib1N1_AA repbase; DNA; INV; 3520 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A non-autonomous Transib DNA transposon family from Aedes DE aegypti. XX KW Transib; DNA transposon; Transposable Element; nonautonomous; KW Transib1N1_AA. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-3520 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the yellow fever mosquito."; RL Repbase Reports 11(4), 1312-1312 (2011). XX DR [2] (Consensus) XX CC >98% identical to consensus. 5-bp TSDs; usually CANTG. TIRs are CC 860 bp long. Both termini are >90% identical to those of CC Transib1_AA. The insertion of DNA-TA-8_AAe-like transposon is CC masked by x. XX SQ Sequence 3520 BP; 942 A; 538 C; 530 G; 969 T; 541 other; cacactggtc caggaagcta atttaggcgg acatgaggtt ttcgtcaaga tttattgatt 60 ttagagccat agtttattct gagaatatgt tcagaatttt cagtactaca aaatgtacgg 120 aacaaaaatt tttttacggt gatcgcaaga gtgacgcata cgccaacttt tttattttaa 180 cgaatcgtaa gttggtatgt tcagcaaagt tgtagcaaat gatcatagaa acaactttgc 240 cgaacaaact ttttctcggt tttgcttcag agccgagata attaagtatt tttaataaaa 300 aatcgttttt tgctcttaag tgttttattt tttagtttta ttggcctact atgttctaca 360 aagtttttat acttatcatt tgctacaact ttgctgaaca taccaaagtt gtatttctta 420 tggttttcat gttatttgag tttttcatct taaaattagc tattttgtat gccttgtatc 480 tcaactatga gcacctcaaa aaaatatatc tttccaccaa atgattttgc agtctttcaa 540 gtacctgtga gcaaaatttg gagaagatga tatttttcta tctcgtttaa aatcgatttt 600 actaatagac gatatgagat aaatccatat ttattgaata ttcatacatg ttcaactcac 660 aaaagtcatg gctacctaag catatcaaac caaaggtaac acgtgtattt atataggaat 720 ataaagtaca tcgtttttca gttgttttcg attaacttgg aagcttctgc aagaaattca 780 tcgtcttttc ctctacctgt atttaatcta ttcctaagca agatcaccgg gttggaagtc 840 aacataagcc gtataaaata ggatgtcaat gctagtaatg gaccccttag tagctgaaat 900 tcattctcga cgcctttccg caattatata tcaggcagat atacattttg tagagtctaa 960 taacaagttc aatgtcttta cttcacagta cacccatgca acatacaaaa tcatacgatt 1020 ttcccagaga aatatacatt taaaactgtt gtccgaatgc ctcgaatggg agggtccata 1080 ataggattgt ttacaaattt gacgttgcgt attatgggcc ctccatttga ttcccatgta 1140 atccggctcg ctgccgacga gcctattatg gacccacctg gaaatgcgta ttatggaccc 1200 tcttgtgtgg tttcatttac aaaactgttc aaatatctgg atttatgcgc ctcaaaccaa 1260 atattttgcc tgaaactgct gactaacaat gcaatcgaag ttcccccggt ggattttcat 1320 aggaataagg ttgctttgag tgttaattta tgtttttctg tagggggtcc ataatacgca 1380 aagggtccat aaccggcatc gactccctac atacacaaaa tttacagtcc gattgaattc 1440 tgtggcatga ttttcccaga atcttctgat ggattccacg tttcatgttt ttctttggta 1500 atgcggcatg ccggacaaca ccacttccgt gcaatagttg tagacgtatc agtaagtgta 1560 acagcattct aaaggatcga acctcttcca gctggtagga actcaaatgg tctaaccgga 1620 tgccgccttc taagggaagt agacattxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 1680 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 1740 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 1800 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 1860 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 1920 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 1980 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 2040 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 2100 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 2160 xxxxxxxxxx xxxxxxxxxx xxxxgtgtat ttctttcact actggactgc gaagtgcggc 2220 ctcataagts cgaaagtgtg aataaaagtg agggattgca ttgaatcctg ttggtgtctt 2280 cagagcactt attcttcgat ttacgaagaa taagtccccc gaagacacct acccgatttg 2340 attcaattgc gcacttttgt tagcgtttcc gcacttgtaa taacttcttt tcacggatct 2400 caatcatttt aaaagtggtc aactttaatc ggaagattaa cgaggaggca tttgaacagt 2460 ttttgtcaga aacgtactgt tagtatcgcc agttctacga ttggtacgta ctctctccaa 2520 cggtgcaaaa actactttcg catggagtga tgttgtccgg catgccgccg aagaagcata 2580 aaaaatgcgg aataaatcca tcagaagatt ctgggaaaat catgccacag aattcaatcg 2640 gactgtaaat tttgtatatg tattttatac ggcttatgtt gacttccaac ccggtgatct 2700 tgcttaggaa tagattaaat acwggtagag gaaaagacga tgaatttctt gcagaagctt 2760 ccaagttaat cgaaaacaac tgaaaaacga tgtactttat attcctatat aaatacacgt 2820 gttacttttg gtttgatatg cttaggtagc catgactttt gtgagttgaa catgtatgaa 2880 tattcaataa atatggattt atctcatatc gtctattagt aaaatcgatt ttaaacgaga 2940 tagaaaaata tcatcttctc caaattttgc tcacaggtac ttgaaagact gcaaaatcat 3000 ttggtggaaa gatatatttt tttgaggtgc tcatagttga gatacaaggc atacaaaata 3060 gctaatttta agatgaaaaa ctcaaataac atgaaaacca taagaaatac aactttggta 3120 tgttcagcaa agttgtagca aatgataagt ataaaaactt tgtagaacat agtaggccaa 3180 taaaactaaa aaataaaaca cttaagagca aaaaacgatt ttttattaaa aatacttaat 3240 tatctcggcw ctgaagcaaa accgagaaaa agtttgttcg gcaaagttgt ttctatgatc 3300 atttgctaca actttgctga acataccaac ttacgattcg ttaaaataaa aaagttggcg 3360 tatgcgtcac tcttgcgatc accgtaaaaa aatttttgwt ccgtacattt tgtagtactg 3420 aaaattctga acatattctc agaataaact atggctctaa aatcaataaa tcttgacgaa 3480 aacctcatgt ccgcctaaat tagcttcctg gaccagtgtg 3520 // ID Gypsy-4_PPP-LTR repbase; DNA; INV; 213 BP. XX AC ADBJ01000042; XX DT 13-DEC-2010 (Rel. 15.12, Created) DT 13-DEC-2010 (Rel. 15.12, Last updated, Version -1) XX DE LTR retrotransposon from the Polysphondylium pallidum (slime DE mold) genome: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-4_PPP_; KW Gypsy-4_PPP-I; Gypsy-4_PPP-LTR. XX OS Polysphondylium pallidum OC Eukaryota; Amoebozoa; Mycetozoa; Dictyosteliida; Polysphondylium. XX RN [1] RP 1-213 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Polysphondylium pallidum (slime RT mold) genome."; RL Repbase Reports 10(12), 2165-2165 (2010). XX DR GenBank; ADBJ01000042; Positions 87631 87843. XX SQ Sequence 213 BP; 79 A; 38 C; 22 G; 74 T; 0 other; tgttggaatc gtagttccaa gatatcagtc aaccgactac tgctaaccga tcaatgaaca 60 caatgtattt aagcgcaaac aagaactcaa taaacattat taatttcatt atattttaac 120 atcatcttta aatacataaa ctatttatta ttataacatt attatattat tacacgatgt 180 cgttattatc acggtttaac gacgtctcta aca 213 // ID Mariner-39_HM repbase; DNA; INV; 3232 BP. XX AC . XX DT 10-SEP-2009 (Rel. 14.09, Created) DT 10-SEP-2009 (Rel. 14.09, Last updated, Version 2) XX DE Mariner-type family: consensus sequence. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-39_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3232 RA Jurka J.; RT "Families of Mariner elements from Hydra magnipapillata."; RL Repbase Reports 9(9), 1927-1927 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 1189..2700 FT /product="Mariner-39_HM_1p" FT /translation="MPHNWSANEAAGEDWFSAYLKRHKKLSIRKPEATSQA FT RVSSFNPTNVQKFYNNLQTILNRLKLESGDIWNMDETGITTVQTPDHIVAR FT KGFKQIGRVTSAERGNLVTVAVAVSASGNSIPPFFIFPRVKFKSYFLNGAP FT DGSAGAANPSGWMTEVQFLQFSHHFVKYARSTKERPVLLLLDNHDSHLSVE FT ALDYFKENGVSVCSFPPHCSHKLQPLDRSVFGPFKKYTNTACDAWMTMHPG FT STMSIYSIPGIVGNSFPLACTPNNIKAGFAKTGIYPLNVNIFGEHDFMPAY FT TTDRPPPKQDCAVESTALNISNVTDVDFTISENDMLNAGCSKDIEPRQDIS FT YSSSNDVHSASTESTVVLSPEDLRPFKKAEPRKRVRANKRSRTTSILTDSP FT NMNVLKIEKESAKKKKLAIEERKQAKKLNEDNIKMSKRSNSKGSLHLMEKS FT LNCPNSSVEEECFCLVCVGMFSKSKPEEVWVQCTQCHLWAHESCIIGKSLF FT FICNNCC*" XX SQ Sequence 3232 BP; 1074 A; 543 C; 584 G; 1031 T; 0 other; ggggagaccg gggctagttg gcagcagggg taagttgacg aattgcgttt atctgcaaag 60 tctttcatca aaaagtgcca ataatttcag gaaaccttct tagttgacca tttcatcttt 120 cactgtagta ttgtggggat ttgcgcatgc gcataaaaga tattgtgatt tatgtgtttt 180 ttccacattt ttgtaacttt tacaacttct ttattttttc gatttttaaa agcaatttca 240 tcaagtacac gtaaacttga tttgagttat atcttgcatg gttcagtaaa atctgtccaa 300 ccattatttt ttatttatat acgtttttca gggatcggta ggggagtttg tgaccaaata 360 gctgtcgggg ttagttggca aaattttggc ggggcaagtt ggccgtatat atcaactaaa 420 cccggttgcg attggacata aatttcaact ttttctataa cttttgattt atttttttaa 480 aaatcgatta gatagatcac atatattaca tgcgaaacta taaaagaaaa acaaagaggg 540 ctataactcc tcaaaatgtt ataaaaaatg ctgttgatgc tgttttatta gaaggaaaat 600 ctatacgaaa aacagccaaa gactttaata tccctgaaaa aagtttgtcc cgatactgca 660 aaaagcaaca acgtcatggt cagcaaatat caggttacat aaaatctaga caggtcagta 720 tatgtccgta acagtttttg tgcctttgag gtaagcactg aaccagtctt ctccagctgc 780 ttcgttgcac tccaattatg aggcattttt atgctattcg cttttccata ctgataagct 840 agtttcctca cttccttagg agaaagtcca taataaatat cagaagcttt agtcacatat 900 tgttccaaca gaccttcttg taaatcagta aagacctaaa tgaacaacac acacatatat 960 atatatacac acacacacac acacacacac gcacgcacac acacacgcgt atatatatat 1020 atagtgttat ttgttttaat attgatattg ttcatttagg tctttactga tttacaagaa 1080 ggtctgttgg aacaatatgt gactaaagct tctgatattt attatggact ttctcctaag 1140 gaagtgagga aactagctta tcagtatgga aaagcgaata gcataaaaat gcctcataat 1200 tggagtgcaa atgaagcagc tggagaagac tggttcagtg cttacctcaa aaggcacaaa 1260 aaactgtcaa taagaaaacc tgaggcaaca agtcaagcac gtgtttccag ttttaatcca 1320 acaaatgttc aaaagtttta caacaattta caaactattc ttaatcgttt aaagttggaa 1380 agtggagata tttggaatat ggatgaaact ggtattacca cagttcaaac tccagatcat 1440 attgtagcaa gaaagggttt taaacaaatt gggcgtgtta cttcggctga gagaggcaat 1500 ttagtaactg tggcagttgc tgtttctgcc agtggaaatt caattcctcc atttttcata 1560 tttcctcgtg tgaaatttaa aagctatttt ttaaatggtg ctcctgatgg aagtgcaggc 1620 gctgcaaacc catctggttg gatgactgaa gttcaatttt tgcagttttc ccatcatttt 1680 gtcaaatatg ccagaagtac aaaagaacga cctgttttgt tgctattaga caatcatgac 1740 tctcatcttt cagtagaggc tttagattac tttaaagaaa atggggtatc ggtttgttca 1800 tttcctcctc attgcagcca taaacttcaa cctctagatc gaagtgtatt tggacctttt 1860 aaaaaataca caaacacagc ttgcgatgcg tggatgacaa tgcaccctgg ttccacaatg 1920 tctatttaca gtataccagg tattgtgggt aactcatttc cattagcatg tacccctaat 1980 aatatcaagg ctggttttgc aaaaacagga atttacccac taaatgttaa catttttggt 2040 gaacacgatt ttatgccagc ctataccaca gatagaccac ctccgaaaca agactgtgct 2100 gtggaaagta cagcgctaaa tataagtaat gtaacggatg ttgactttac aatatctgag 2160 aatgacatgt taaatgctgg ttgctcaaaa gatatagaac cacgtcaaga tatttcatat 2220 tcaagcagta atgatgtaca cagtgcatct acagagtcaa ctgtagttct atcgcctgaa 2280 gatttaagac cattcaaaaa agctgaacca agaaaaagag ttagggctaa taaaagaagt 2340 agaaccacat caattttaac agattcacca aatatgaatg ttttaaaaat agagaaagaa 2400 tcagctaaaa agaaaaagtt agcaattgaa gagagaaaac aagcaaaaaa gttaaatgaa 2460 gataatatta agatgagtaa aagatcaaat agtaaaggaa gtttacactt aatggagaaa 2520 agtttgaatt gtccaaatag ctctgttgaa gaggagtgtt tttgtttggt ttgtgttgga 2580 atgttttcta aaagtaaacc agaggaagtt tgggttcaat gcacccaatg tcacctttgg 2640 gctcacgagt catgtataat cggaaaatct ttgtttttca tttgcaataa ttgctgctag 2700 ccgatttgaa actccattag ttaaaaatat tcatgtaata aaacaatata agttttaatt 2760 cttattgtat aaccataaat tgggaacact gttttgtttt gtcactgtat tttttaatta 2820 tatttacttt ctattgctta aaatgttaca atgacttgat ccaaaagttt gatgatattg 2880 ataaaagtat atttcagcta tgattccttt aaaaaaaagg ggcggggggt taaaatatta 2940 ttggaataca cttaaatagg acctttaact ttagctttta ttattgcgag tcaacttacc 3000 ccttgagtat aagcaactta ccccagggtg gggtaagttg gcgcaatttt tttttttttt 3060 tttgagggcc cagaaaggcc ccaaaatatt ttttttgtaa atgaatgcaa cttttataag 3120 ttacccaaat tcatactctt cgaaactaca cttttatatt ttcaaaaaaa tgttccgtta 3180 ggactacaga gctcatagag taaaaaccga gtcaactagc cccggtctcc cc 3232 // ID Copia-2_CQ-I repbase; DNA; INV; 4107 BP. XX AC AAWU01028584; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-2_CQ_; KW Copia-2_CQ-LTR; Copia-2_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4107 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 319-319 (2011). XX DR Genome; AAWU01028584; Positions 6466 10572. XX CC Positions [1512-2048] - Integrase core CC LTRs are 90% similar to each other. XX FH Key Location/Qualifiers FT CDS 66..4097 FT /product="Copia-2_CQ-I_1p" FT /translation="MAESGQEERFFVPLLDGTNFNAWKYRMLVFLEEHELL FT ECIEKEVEEMVELVVKDEDTAQEKAQKQVKIDRRRKMDRKCKSKLVSRIND FT DQLEHIQDEKTPKLIWEKLHRVFLRESVARRMHLQRQLHSMKFDGGSLQSH FT FLRFDRLLRQLRATGAKVDELDAVCSLFLTFGTAFSMVITTLESMKPENLS FT LEYAKCRLLDEEVKQKEIGIESVPAGTSSNRDVAAFSGSAKTKKKLQCWAC FT KQEGHKADACPSKKESDRKKKKKSDNKPKAHLAEESGGVCFVGVGSGDGLR FT RVDWFIDSGCSDHIVNDKSLFESLSPLKCPIEIAIAKDGESIKAEFSGTVK FT LFADVGEKLIDCTVKNVLYVPDLRCNLFSVMRVDDVGMEVSYGNGKVVIQH FT GQRIVASGSRCGKVYRLNLFRATPGNSESLLTCGRIPKSLELWHRRFGHLN FT PKSLVKLIRDDMVVGLNQSVSDKDKDIVVCESCVIGKQTRKPFVTRETRAS FT RVLEVVHSDVCGPIAPVGVEGKKYFVSFTDDWSHFVVLFLMKSKDEVFESF FT QRYEAMVTAKFGRKISRLRCDNGGEYRGKDFEKFCQERGIQIDWTVPYTPE FT QNGVSERLNRSLVEKARAMLDDSGADKRFWGQAVQTAAFLLNRSPTRALKE FT NVTPYELWEGSKPNVAKLRAFGCPVYAHVPKELRKKLDPKAWKGVFVGYHQ FT NGYRVWNPELERIVHVRDVDFAEEKESGSHKSKSLHKISQPIVGDESSDED FT QSNDEQEAASVRSAETDGTSSEQPEDDFDSCEEEPLPQGGGVDQPQRPQRE FT KTAPAWLKDYEVDYAAYALNAMNYVENLPNSLSEARKRSDWKLWEAAINEE FT MESLLKNRTWTLEKLPAKRTPITSKWVFKIKRGTDDKPDRYKARLVARGFS FT QKYGLDYTETYSPVAKLDTLRTVLALANHEKMVIHQMDVRTAFLNGELTEE FT IFMAQPEGFEQGENLVCRLQKSIYGLKQASRAWNDRFHAFVEGRLKFVRSM FT NDQCLYTKQTKAGRLIVLLYVDDVLVIGPSQKAVDAVKSCMSKEFEMTDTG FT EVDCFLGMKIERDVEERKLRISQRQYFQDMLLRFGMQDCKPASTPMECRLK FT LLKGAEKDRTDAPYRELIGCLTYASITTRPDLAAAVNFLSQFQSCPTNTHW FT AHLKRILRYVKGTLDFGLVYRAKEGAPTVEVFTDADWANDPTDRRSISGCV FT YQVLGCTTGWMTRKQNCVSLSSTEAELTALCSAACYEMWLVRLLQDLELKI FT PEPVTFYEDNQSSIRIAEESKDFGRLKHVDVKYHFLRDLVKEDKICLQFLK FT SEDQLADMMTKGLPPGAFRKHRASIGLADGSA" XX SQ Sequence 4107 BP; 1017 A; 966 C; 1311 G; 813 T; 0 other; ctttaggtta tcggcccggt cgtattccga aagtctacgg gaatcgcgag tgagtcgggt 60 gaacgatggc ggaatccgga caagaggagc gtttcttcgt tccactgctc gatgggacga 120 atttcaatgc gtggaaatac cggatgctgg tgttcttgga ggagcacgaa ttactggaat 180 gcattgagaa ggaggtcgaa gaaatggtcg agctggtagt gaaggacgaa gacacggcgc 240 aagagaaagc gcaaaagcag gtcaagattg accggcggcg gaagatggac cgcaagtgca 300 agtcgaagtt ggtctcccgg atcaacgacg accagctgga acacatccag gacgagaaga 360 cgcccaagct gatctgggag aagctgcacc gtgttttctt gcgcgaaagt gttgccaggc 420 ggatgcactt gcaacggcag ttgcactcga tgaagttcga cggcggcagc ctgcagtctc 480 atttcctccg tttcgatcgg cttcttcggc agctccgggc gactggggcg aaggtggacg 540 agctggacgc tgtgtgcagt ttgttcctga cgtttggcac ggcattttcc atggtgatca 600 cgacgctcga atcgatgaag ccggaaaatt tgagcttgga gtatgcgaaa tgccggctgc 660 tcgacgagga ggtgaaacaa aaggagatag gtatcgaatc ggtgcctgct ggaacgtcat 720 cgaatcgtga cgtcgctgct ttttctggat cggcgaaaac aaagaagaag ctgcagtgct 780 gggcgtgcaa gcaggagggg cacaaggccg acgcttgccc gtcgaagaag gagagtgatc 840 gaaagaagaa gaagaaaagt gacaacaagc ccaaggcaca cttggcggaa gaaagtggtg 900 gcgtttgctt tgtcggtgtg ggaagtggag acggtttgcg aagagtggat tggttcatcg 960 actctggttg ttccgatcac atagtgaacg acaaatcgct gttcgaatcg ttgagtcccc 1020 tgaagtgtcc gattgagata gccatcgcga aggacggtga gtcgatcaag gccgagtttt 1080 ctggaacggt caagttgttt gctgatgtcg gtgaaaagtt gatcgattgt accgtgaaga 1140 atgtgctgta tgtcccggac ttgcgttgca atctattttc tgtgatgcgg gttgacgacg 1200 tgggcatgga agtgtcctac ggaaacggga aagtggtgat acagcacggg cagagaatcg 1260 tcgcgagtgg ttcccgctgc ggaaaagtgt accgactgaa cttgttcaga gctacgccgg 1320 ggaacagtga atctctgctg acgtgtggtc gcataccgaa aagtctggaa ctgtggcacc 1380 gtcgatttgg gcatctcaat ccgaagagtt tggtgaagtt gattcgggac gacatggtgg 1440 ttggactgaa ccagagtgtg agcgacaagg acaaggacat tgtcgtgtgc gagtcctgtg 1500 tgatcggaaa gcaaacccgg aagccgttcg taacgcgtga aacgagagcg tcgcgagtgc 1560 tagaagtggt ccactcggac gtatgtggtc cgatcgcacc ggtcggagtt gaaggcaaga 1620 agtacttcgt gagtttcacg gacgactgga gccatttcgt ggtgttgttc ctgatgaaga 1680 gcaaagacga agtgttcgag agcttccaac gttacgaggc gatggtgacg gcgaagtttg 1740 gtcggaagat cagccgcctt cgctgtgaca acggcggcga gtatcgagga aaggacttcg 1800 agaagttctg ccaggaacgt gggatccaga tcgactggac tgtgccgtac acgccggagc 1860 agaacggggt aagtgagcga ttgaatcgca gcctcgtcga aaaggcccgt gcgatgctgg 1920 acgactcagg agctgacaag cgcttctggg gtcaagcggt gcagacggcg gccttcctgt 1980 tgaaccggag tcccacgaga gctctcaagg aaaacgtcac gccctacgag ctgtgggagg 2040 gttcgaagcc gaacgtcgcc aagctgcgcg catttggctg cccggtgtac gctcacgtac 2100 ccaaggaact ccggaagaag ctcgacccga aggcgtggaa gggcgttttc gttggctacc 2160 accagaacgg ctaccgagtg tggaaccccg agctcgagcg gattgtccac gtacgagatg 2220 tcgacttcgc ggaggagaag gaatccggat cgcacaaaag taagtctcta cacaagattt 2280 ctcaaccgat tgttggagac gagtcgtcag acgaggacca gtcaaacgat gagcaagaag 2340 cggccagtgt gcgttcagct gaaacggatg gcacatcgag tgagcaaccg gaagacgact 2400 ttgacagttg cgaggaggaa cccctacctc aaggaggtgg cgtagaccaa ccgcagcgtc 2460 cgcaacgaga gaagaccgcg cctgcctggc tcaaggacta cgaggtcgac tatgcagcgt 2520 acgccttgaa cgcgatgaac tacgtcgaaa atcttccgaa ctcgctctcg gaggccagga 2580 agcgaagcga ctggaagctg tgggaagcgg ctatcaacga ggagatggag tctctgttga 2640 aaaaccgaac ctggacgctg gagaaacttc ctgcgaagcg gacaccaatc acaagtaagt 2700 gggtgttcaa gataaagcgg ggaaccgacg acaaacctga ccggtacaag gcccgcctgg 2760 tcgctagagg gtttagccag aagtacggcc tggactacac tgagacgtac tcgcccgttg 2820 ccaagctgga cacgttgcgc actgtcctag ccctggcaaa ccacgagaag atggtgatcc 2880 accagatgga cgttcgcaca gcctttctca acggcgaact gacggaggag atctttatgg 2940 ctcaacccga ggggttcgag cagggcgaga atctggtgtg tcggcttcag aagtccatct 3000 atgggttgaa gcaagcatca cgcgcgtgga acgaccgttt ccatgcgttc gtcgaaggcc 3060 ggctgaagtt cgtgcgcagc atgaacgacc agtgtctcta caccaagcag acgaaagctg 3120 gaaggttgat cgtacttctg tacgtcgacg atgtgttggt catcgggccg tcgcagaaag 3180 ctgttgacgc ggtgaagtcg tgcatgtcca aggagtttga gatgactgac accggagaag 3240 tggactgttt ccttggaatg aaaatcgagc gcgacgtgga ggaacgaaag ttgcggatca 3300 gccagcggca gtacttccag gacatgctgc tgcgttttgg aatgcaggac tgcaaaccag 3360 cgtcgacgcc gatggagtgt cgtttgaagc tactgaaggg tgccgagaag gaccgcacgg 3420 acgccccgta cagagagctg atagggtgtc taacctacgc ttcaatcacg acgagaccgg 3480 acctggctgc agcagttaac ttcctaagcc agttccagag ctgcccgacc aacacccact 3540 gggcgcacct gaaaaggatc ttgcgctacg tgaagggaac gctggacttc ggtctagtgt 3600 atcgtgcgaa ggaaggagcg ccaactgttg aggttttcac ggacgcggac tgggccaacg 3660 acccaacgga ccgacgttcg atcagcggat gtgtgtacca agttctggga tgcactactg 3720 ggtggatgac ccggaagcaa aactgcgtct cactctcatc caccgaagcg gagctgaccg 3780 cgctttgcag cgctgcgtgc tacgagatgt ggctcgtacg actgctgcag gatttggagc 3840 tcaagattcc tgaaccagtc accttctacg aagacaacca gtcgtcgata cggatcgcag 3900 aagaatcgaa ggattttgga cggctgaagc acgtggacgt gaaataccat ttcctgcggg 3960 acctggtgaa ggaggacaag atttgcctgc agttcctgaa gtcagaagat caacttgcgg 4020 atatgatgac caaggggctg ccacccggag ctttccggaa acatcgtgcc agcattggat 4080 tggcagacgg cagcgcttga gggggag 4107 // ID L2-3_Cis repbase; DNA; INV; 5035 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 01-SEP-2010 (Rel. 15.1, Last updated, Version 2) XX DE CR1 Non-LTR Retrotransposon from Ciona savignyi. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; L2-3_Cis. XX NM L2-3_Cis. XX OS Ciona savignyi OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. XX RN [1] RP 1-5035 RA Smit A.F.; RT "L2-3_Cis - CR1 Non-LTR Retrotransposon from Ciona savignyi."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX CC Ci000651 Ci000556 Ci000421 ORF1 (pos 36-896) 27% id/47% similar CC to those in other L2_Cis elements and more distantly to the ORF1 CC product of mammalian L1 elements. The ORF2 product is 37% CC identical (54% similar) to that of L2-2 and 42% similar to the CC mammalian L2 pol protein. XX FH Key Location/Qualifiers FT CDS 36..893 FT /product="L2-3_Cis_1p" FT /translation="MLNLMNKIDLSKKMNNNTAGNSGKEKTGKRSTSALTS FT SVELVLLEKLNDFQQSSEARFLTVINDLQEIKASQQFLEARYEEIKIEISD FT HNKQLKKLKNENGNLKERIRELENKSLENSNTIDALEQYGRRECLEFHGIS FT SHADESTDDLVVATVRKLGLSINKSEISVSHRLAKPSHDRPRPPIIAKFXS FT RKVRDNIYGNRSKLRQVNENLPTGQARIYINESLTKSNKDRFMKCRTYCKA FT NKIRYIWTRNGTTFIKEDDGSSTICIKSDKDLQHLLKQVPLLKTT" FT CDS 941..4201 FT /product="L2-3_Cis_2p" FT /note="PHD zinc finger, endonuclease and reverse FT transcriptase." FT /translation="MTVKYPCRICNLPCKSNQNSIACDICNYWLHAKCLNI FT NEHDLANFATDQLPYFCPVCMEENVPFQAIAQNTFIETISSHAMHSDYFQC FT IKNDIRNSYVENETIESISSSEYYTVDKMTELNRNNDFCIIHVNVRSLQKN FT YDKLYQCLLEASILPDVIAVTETKIRHSMSNGMLNINIPGYNFEHADTPTF FT AGGVGLYIKQSLSYTMLDTTMCIAPEQCEHLCIKVRNMDKSCIVGVFYRHP FT SAIFSLFQTSLSNLIDFLNPSKCNLYLCGDFNIDLIKCSTNHNINNYYNTL FT CSCACKSLITNPTRVTPHSATLIDHIYTNDYKHEIISGILITDISDHFPTF FT IIIKSKNQHVKPSKQYFRNMSNFNEENFLDELHNKLSREFPQNLLNIDGPS FT VNTIMASLLCELEELINKHAPLRPVSNRILKLKKKPWLTKGLLKSIKIKNH FT MYHKIMRMRKLNLGSEHDFSQYKVYRNKLNHLIASAKLMYYRNRLHLYNKN FT TAKQWETIRELVTFKSKRKCGPSVIIGENNEIISDSFQISNKFNNFFSTIG FT SAMASSIPKSNCPPQQHMKWASSSLFLYPIVPDEIHNLISNLDCKKAVKQS FT DIPTKFIKLSASVLSPILADIFNSCIQTGTFPDVLKIAEVIPIHKTGATDR FT CTNYRPISLLSQFSKLFERLLYNRLYDYLIKFNLLSPHQYGFREQCSTANA FT VCDVYNQLLLNSENKFYSCCVFLDLSKAFDTVDHQILLGKLNYFYGVRGTP FT LLLFKNYLINRYQYTYVNTVKSESSMITCGVPQGSILGPLLFLLYINDLPC FT ASNFKTKLYADDTVLFLKDKSLNELETKTNNELLKVTEWLHYNRLSLNHSK FT TNYMIINPFKYCSTTDFKVVLDGTNLKRVNSVKYLGVYIDDKLNWNEQIKC FT LEKRISKYCGIFYKIRRYLDIRTLRLLYFSFIHSHLQYAIICWGAANKTVL FT KPLNVLHNKIIRIITSSQRSCSVTPLYQKLYLLKLNDIYDLEVGKFMHKLF FT NNTLPQPLQLLFKKINTVHCYDTRQTKTTVYFHPSFTTNFGTKTILHNGIK FT VWKLISESLKHASYSVFCKQLKVQLISNYI" XX SQ Sequence 5035 BP; 1794 A; 803 C; 770 G; 1667 T; 1 other; gattctattt gaataagatt tcagatttca cgctgatgtt aaatttaatg aataagattg 60 atcttagtaa aaaaatgaat aataatactg ctggtaactc gggcaaagag aagactggca 120 aacgatcaac tagtgctttg acaagcagtg tggaactcgt tctgcttgaa aaactgaacg 180 atttccaaca atcaagtgaa gcgagatttc tcactgtaat aaacgatctt caagaaataa 240 aagccagtca acagtttttg gaagccagat atgaagaaat aaaaatcgaa atttctgacc 300 ataataaaca gctaaaaaag ctgaaaaatg aaaatggaaa tttaaaagaa cgaataagag 360 aactggaaaa caagtctctt gaaaattcca atacgattga tgcactggaa caatatggaa 420 gacgtgaatg cttagagttt catgggataa gttcacatgc tgatgagtca actgatgatc 480 tggtagttgc cactgtaagg aaacttggct tgtccatcaa caagagtgaa ataagtgtct 540 ctcatagatt ggctaagcca tcacatgacc gaccacgacc acctatcatt gcaaaatttn 600 taagtcggaa ggtacgagac aacatatatg gtaaccggtc caaactaaga caagtcaatg 660 aaaatttacc caccggccaa gcaagaatat atattaatga aagcctaacc aagagtaaca 720 aagacagatt catgaaatgt agaacatact gtaaagcaaa caaaataaga tacatctgga 780 caagaaatgg aacaaccttc ataaaagaag atgatgggag ctcaacgatt tgtatcaaaa 840 gtgacaaaga cctgcaacat ctattaaaac aagtaccact actaaaaaca acttaaaagt 900 tatataagta ttgttctgct ctgtgtaact tatactttaa atgactgtaa aatatccgtg 960 tagaatatgt aacttacctt gtaaatccaa ccagaatagt attgcatgtg atatttgcaa 1020 ttactggcta catgcaaaat gtttaaatat taatgagcat gatcttgcca attttgccac 1080 tgatcagctt ccatactttt gtccggtttg catggaggaa aatgtacctt ttcaagccat 1140 agctcagaat acatttattg aaactatatc aagtcatgca atgcacagtg actatttcca 1200 gtgcattaaa aatgacatta gaaattctta tgtagaaaat gaaaccattg aatcgataag 1260 tagtagtgaa tattacacag ttgataagat gactgaattg aatcgaaata atgacttttg 1320 tattattcat gtaaatgtaa ggagtttaca aaaaaattat gataaattgt atcaatgttt 1380 gcttgaagca tccattcttc ctgatgtaat tgctgtaact gagactaaaa taagacattc 1440 catgagtaat ggcatgttaa atattaatat acctgggtat aattttgaac atgcggatac 1500 accaaccttc gctggaggtg ttggtttata tattaaacaa agtttgtcat acactatgtt 1560 agatacaact atgtgtattg cccctgaaca atgtgaacac ctgtgtatta aggttagaaa 1620 catggataaa tcttgcatag ttggtgtgtt ctatagacac ccttcagcta tttttagctt 1680 gtttcaaact agtttaagca atttaataga ttttcttaat ccctcaaaat gtaatttata 1740 tttatgtggt gacttcaata tagatctgat taaatgctct actaaccata atatcaacaa 1800 ttactataat acattatgca gttgtgcctg taaatcatta atcactaatc ctacaagagt 1860 aacaccccat agtgctacac ttatagatca catatataca aatgattaca agcatgagat 1920 tattagtgga attttaatta ctgatataag cgaccacttt ccaactttta ttattataaa 1980 atcaaaaaat caacatgtta aacctagtaa acaatatttt aggaacatgt ccaattttaa 2040 tgaagaaaac tttttggatg agttacacaa caagttgtct cgggaattcc cccagaactt 2100 gctaaacatt gatggtccat cagtaaatac aattatggca tctcttctct gtgaactgga 2160 ggagttaatt aacaaacatg cccctttaag accagtatcc aaccgaattc taaagctaaa 2220 aaagaaaccc tggttaacta aggggttact aaaaagcata aaaattaaaa atcatatgta 2280 ccacaaaata atgcgaatga gaaaacttaa cctaggttca gagcatgact tttcccagta 2340 taaagtttat cgaaacaaac taaatcatct tattgcatca gctaaactta tgtattatag 2400 aaatcgtctt catctttaca ataaaaatac tgctaaacaa tgggaaacaa tcagggaatt 2460 agtcaccttt aaaagtaaac gtaaatgtgg cccttctgta ataattggtg aaaataatga 2520 gataataagc gattcatttc aaattagtaa taagtttaat aattttttta gtacaatcgg 2580 ttcagctatg gcctcctcca taccgaaatc aaactgccct ccccaacagc acatgaaatg 2640 ggcaagttct tcattatttt tatatccaat agtacctgat gaaatacata acctgataag 2700 taatctagac tgtaaaaagg cagtaaaaca aagtgatatt cccactaaat tcataaagct 2760 gtctgccagt gtactatccc caatacttgc agatatattt aatagctgca tacagacagg 2820 tacttttcct gatgttctta aaatagcaga agttatacca attcataaaa ctggtgcaac 2880 tgatagatgt acaaattatc gacctatctc gctattatcc caattttcta aactattcga 2940 aaggttgcta tataatcggt tatatgatta tctaattaaa tttaatctac tttccccaca 3000 tcaatacggc tttcgtgagc aatgctctac tgcaaatgct gtctgtgatg tttataatca 3060 acttctgtta aactctgaaa ataaatttta tagctgctgc gtgtttttag atttatcaaa 3120 ggcttttgat acggtggacc accaaatact gttagggaaa ctaaattatt tttatggtgt 3180 aagaggtaca cccctgcttt tatttaaaaa ctatttaata aataggtacc aatataccta 3240 tgtcaacaca gtgaaatctg aatcaagtat gattacatgt ggtgttccac agggatcaat 3300 tttaggtcca ttattattcc ttctctatat taatgacttg ccttgtgctt ccaactttaa 3360 aactaaacta tatgctgatg atactgtgtt atttttaaaa gataaaagtt tgaatgaact 3420 tgaaaccaaa acaaacaacg aactactaaa ggtaactgaa tggttgcatt ataatagact 3480 atccttaaac cattcaaaaa ctaactatat gataataaac ccattcaaat attgttcaac 3540 tactgatttt aaagtggtgt tggacggaac caacttaaag cgtgttaatt cggtaaaata 3600 tttgggagta tatatagatg ataagcttaa ctggaatgaa caaataaaat gtttagaaaa 3660 aagaatatct aaatattgcg gaatttttta caagatcaga cgttatctag acattcgtac 3720 tttgcgcctg ttatatttta gcttcattca ctcacatttg caatatgcta taatatgctg 3780 gggtgcagca aacaaaacag ttttaaaacc acttaatgta ttgcataata agattataag 3840 aataattacg tccagtcaac gtagttgttc ggttactcca ctctatcaga agctttacct 3900 actaaagtta aatgacatct atgacttaga agttggtaaa tttatgcata aattgtttaa 3960 taatacttta cctcaaccct tacaattatt gtttaaaaaa ataaacactg tgcattgtta 4020 tgatactcgt caaactaaaa ctacagtata tttccacccc tcgtttacta ctaactttgg 4080 aaccaaaacg attctgcata atgggattaa ggtgtggaag ctgatcagtg aatcccttaa 4140 acatgcctcc tattccgtgt tttgtaaaca acttaaagtg caattaatta gtaattatat 4200 ttaggtttag tatacggtac atcatacact cagttaatat gtaattattt attcggagta 4260 atattaatat ttgcatttga taaatacatt tactttgtaa ttgtgacctg taattacacc 4320 ccataagttt aagattaaaa tacccttatt tgtatttatt atttacttac ttcactgtgt 4380 ataatatgta acagtttatt aatcatttaa tataaatctc tgaattgtta gcccctactg 4440 ttgcagttat gtcacaagtt atattgtatt gtattgcgtg agagtgtagg ctattatttg 4500 acttccgttc tatataactg gagtattgtt cataacaaag ttaaactatt ttattgctta 4560 agcatctgta tgaaattgac ttccgttcta tataactgta gtactgttta tagcaaagtt 4620 aaactatttt aatggttcta tatgaaagcg tgtaaattat tttatataat atgtaacagc 4680 tttaatcttt taatataaat ccttctgaat tgctaatccc aactgttaca gttatgttac 4740 aagttaaccc ttatttaatc agataattat atatacattt tttttttatt tatttattta 4800 tttttttttt ttcacaggga ggcgccaaac tagataactt ttagtttttt ttggtgcctt 4860 cctgttttgc tgtagtattt tttatttctt tggttaatta ttggaatttt agttaacgcc 4920 atatttgttt tggtatacat atgtatatga catgtatttt taaattgttg gttataaacg 4980 ctgcagcgaa agaaacaaaa ataaaaattg tattgtattg tattgtattg tattg 5035 // ID Copia-17_SI-I repbase; DNA; INV; 4068 BP. XX AC AEAQ01022875; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-17_SI_; KW Copia-17_SI-LTR; Copia-17_SI-I. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-4068 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01022875; Positions 5584 1517. XX CC Positions [1535-1987] - Integrase core CC 'AAAAC' target site duplication CC LTRs are 95% similar to each other. XX FH Key Location/Qualifiers FT CDS 95..1987 FT /product="Copia-17_SI-I_2p" FT /translation="MAQDNVKFNIPLLNGGNYVFWKTKVRAILTRDDLWDV FT VNEPKPAQPDEAWKKRNNKAMATITLSVEDSQLIHFAHLDNAFDTWQAVLR FT KYERSTFCSRLYLRRKLYSIHYRSGSMSNHIDAIMEVVGLLCGSGRPLENE FT EIVAVLLVSLPESYSRLVTALEGRDEADLTVEYVTGKMLDEYQKRMESNES FT SDKNSEVALQSAVANPSNYSRINKGEKDKRVVKRDNKETRTCFFCHKQGHV FT KANCRFFKKSSQQRGSSAESGKETAEARSGIQDNQITFSMKEEARAANIGV FT WYVDSGATSHMTNDRNFFTVLQDTNMKIFLADGSSVDVAGIGEGYLKCRTT FT DKLIQTIELKNVLYIPRLDGGLISVLKIIEHGFRVIFEKNDCTIYQGRQVV FT AKAENSGNMFRLKTIGLEMIARIANVETCIHQWHRRLGHRDLKAIKPLSKE FT GLATGIKITACSNEIICEHCIKGKLAQTKFPKSKRREKEPMKLIHSDLCGP FT MQTATPNGNKYFLTLIDDYSRFTVVCLLKTKDEVSNVIKEYITMMLTRFGR FT KPIALRTDNGREYVTSELTNFLRKEGIQHQLTVAYTPQQNGVAERKNRSLT FT ESAKCMLLDADLNNRFWGEAVLTAAYLQNRIVS" FT CDS 2683..3891 FT /product="Copia-17_SI-I_1p" FT /translation="MMNPTYKAQLVARGFSQIYGKDYDQAFSPVVKHETIR FT ILLLIAAQKKLHVRHLDVKSAYLNGELEEEIFMDQPQGFQKRGQEYKVLKL FT KKSLYGLKQSARVWNKRATEVLAQMGFLPGKADQCLYTRKEKDGTITYILL FT YVDDLLVASTTEKMIEKISRSIQKHFRIKYLGNVNHYLGIRIERKEDGSFL FT LSQKEKVIKLLEEYSLLESRPVATPMETSFLNSGSSARLPNNSKFRQAMGL FT LLYIATVTRPDIALAVRILCRRFENPTVQDWNAVKRIIRYLASTIDKKLEL FT SSTGKMELECFVDADWAGDRVDRKSTTGCVFRLGDGTVAWSSRKQTSVAMS FT STEAEYVAVSHASKELLWLRQLLRDMHILEKDPTIIYENNQGCICLIGSDR FT CGVRTDVNT" XX SQ Sequence 4068 BP; 1447 A; 730 C; 946 G; 945 T; 0 other; ggttatgggc ccagacttag aaggacaaac aaaataaagc aagatataaa gatataacac 60 aagtgaaaag gctgacgtac acataacctt taagatggcg caagacaacg tgaaattcaa 120 tatcccatta ttgaatggag gtaattatgt cttctggaaa accaaggtac gtgcaatatt 180 aacgcgcgat gatttatggg acgttgtaaa tgagcccaag ccggcacagc ctgacgaggc 240 gtggaaaaag cgcaataata aagcaatggc aacgatcacg ttaagtgtag aagacagtca 300 attaatacat ttcgcgcatc ttgacaacgc gtttgataca tggcaggcag tattgagaaa 360 atatgaacgt tcaacatttt gcagtaggtt atatctccgg agaaaattgt atagtatcca 420 ttaccgtagt ggatccatga gcaatcacat cgatgcaatt atggaagtgg ttggactgct 480 atgcggatca ggaaggcctc tcgaaaatga ggaaatagtt gccgttctat tggtaagttt 540 accagaatcg tactcgagac ttgttacagc gctcgaggga agagacgagg cagatttgac 600 agtggaatac gtaaccggaa aaatgttgga cgaatatcaa aaacgcatgg aaagcaatga 660 gtcaagtgac aagaattctg aggtggcact tcagtcagca gttgcaaatc caagtaatta 720 cagccgtatc aataaaggtg agaaagataa acgtgttgta aaaagggata acaaagaaac 780 gcgaacatgt ttcttctgtc ataaacaagg acatgttaag gcaaattgca gattttttaa 840 gaaatcttca cagcagagag gttcaagtgc agagtcggga aaagagaccg cagaagctag 900 gagtggaatc caagataatc aaataacgtt ctccatgaaa gaagaagccc gagcagctaa 960 tattggagta tggtatgttg attcaggggc cacgagccat atgacaaatg atcgaaattt 1020 tttcacagta ctacaagata caaatatgaa gatattcctt gctgacggtt catcagttga 1080 tgtagctgga attggagaag gttatttaaa gtgcagaact acagataaat taatacagac 1140 aatagaactt aaaaatgtgt tatacattcc acgacttgat ggcggattaa tctcggtact 1200 caagataata gaacacggat ttcgtgtaat attcgagaaa aacgactgca ctatttacca 1260 aggacgccaa gtagtggcaa aagcagaaaa cagcggaaat atgttcagat taaagacaat 1320 cggactagaa atgatagcta gaatcgcaaa tgtggaaaca tgcattcatc aatggcatcg 1380 ccgattaggc cacagggatt tgaaagcaat aaaaccctta tcaaaggaag gactagccac 1440 tggaataaaa ataaccgcat gttctaacga gataatttgt gaacactgta tcaaaggaaa 1500 attggctcaa acaaagtttc caaaaagtaa aagaagagag aaagagccta tgaaattgat 1560 ccactctgac ctatgtggcc cgatgcagac tgctacgcca aatggtaata aatatttttt 1620 aactctaatt gatgattatt cccgttttac agtggtatgt ctactgaaaa caaaggatga 1680 ggtctcgaac gtcataaagg agtacatcac tatgatgttg acaagatttg gaaggaagcc 1740 tattgccttg cgaacagata atggcaggga atatgtcaca tctgagctga ccaatttctt 1800 aagaaaagaa ggcattcaac accaacttac tgtagcatac acgccgcaac aaaacggagt 1860 tgctgagcgg aaaaatagat cattgactga atccgcgaag tgtatgttat tggatgcaga 1920 tttaaataac cgcttttggg gagaagcggt attaactgca gcgtacttac aaaatcgtat 1980 agtaagttga agcatcaata aaactcctgt agaaatgttc actggaaaaa ggccggacct 2040 tagacatatt cggatattcg gttcgaaagt ttattccttc gttcctaaac aaaaacgcaa 2100 gaaatgggac gacaaagcca atgaaggtgt attagttggc tatgatggaa acactaaagg 2160 atatagaatt ctaaacccaa ggacaaatcg gatttgaatc agccggtctg taagaataat 2220 agagcatgat accgagaaga caaatgtacg acacaatttg gatgaaaaag gagaaaagca 2280 tgataaaaca gcacgggacc tttggctacg aaatgttacc aacagaagat gaggaaaata 2340 gcggaactga ttcagaagaa gaatctgaga actgcgcatt ggatgaagaa gaatgtgaga 2400 taccagttgc taagcccgaa caaccacaga gacgggaatc acaacgaacg aataaaggag 2460 tacctcccct gagattggca tatagagcac aaacgaattt gatcacagag ccgggctcat 2520 ggactgaaat gatggaatta ccattgcgcg aacgagagcg atggatagca gcagctaaag 2580 aagaaataaa atccttgaac gaacacaaag catgggaact cactgaactt ccaccgggta 2640 agaaaacaat tacatgtaaa tgggtattca aggccaaatc agatgatgaa ccctacttac 2700 aaagcccaat tagtggctag aggtttctct caaatttacg gaaaggatta tgatcaagcg 2760 ttctcgcctg tggtcaagca tgaaactata cgcatcttac ttcttattgc agctcaaaag 2820 aaattacatg tacgtcacct ggatgtgaag agcgcttatc taaacggcga acttgaagag 2880 gagattttta tggatcaacc gcaaggtttc cagaaacgag gacaagaata caaggtttta 2940 aaattaaaga agagtctcta cggcctgaaa caatcggctc gagtttggaa taaacgagca 3000 acggaggtac ttgcccagat gggtttcttg cccggaaaag ctgaccaatg tctatataca 3060 agaaaggaaa aggatggaac cataacttat atactactgt atgtcgatga tcttctagtg 3120 gccagcacaa ctgagaagat gattgaaaag ataagtagaa gtattcaaaa acattttcgt 3180 atcaagtatt tgggaaacgt caatcattat ttagggatac gtatagaacg taaagaagac 3240 ggatcatttt tgctcagcca gaaggagaaa gttatcaaat tgctggaaga gtatagtcta 3300 ctggaatcaa gaccagtagc cacaccaatg gaaacaagtt ttttgaattc aggatcaagt 3360 gcaagactac cgaataattc aaaattcaga caagccatgg gattgttatt gtacatcgcc 3420 acggtaacaa gaccggatat tgccttggca gtaaggattt tatgtcgccg cttcgagaat 3480 ccgacagtac aagactggaa cgcagtcaag agaataatac gttacctagc atctacgata 3540 gacaagaaat tggagttgtc ttctacaggt aaaatggagc tggaatgctt cgtggatgca 3600 gactgggccg gagacagagt ggacaggaaa tccaccactg gatgtgtctt ccgtctggga 3660 gatggaactg tggcatggtc aagccgcaaa cagacttcag tagccatgtc atcgacagaa 3720 gcggaatatg ttgcagtatc acatgcaagc aaggaactac tttggctcag acaactgctt 3780 agggatatgc atatcctaga aaaggaccca actattatct acgagaacaa ccaaggatgc 3840 atatgtttaa tcggatcgga tcggtgcggt gtacgcactg atgtaaacac atagatgtct 3900 gccatcatca catccgggac ttgcgagaag agaaggtgaa ataaagtact gtccaactga 3960 ggctatgttg gctgacgtct tgacaaagcc gctaccaagg gaacgttttc tggaattgac 4020 cagatgcctg ggaatctgct gattcaaact tgccaacgcg agaagggg 4068 // ID Merlin1_CB repbase; DNA; INV; 1915 BP. XX AC . XX DT 16-JUN-2003 (Rel. 8.05, Created) DT 16-JUN-2003 (Rel. 8.05, Last updated, Version 1) XX DE DNA transposon Merlin1_CB - a consensus. XX KW Merlin; DNA transposon; Transposable Element; 8-bp TSD; KW Merlin/IS1016 superfamily; Merlin1_CB. XX OS Caenorhabditis briggsae OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-1915 RA Feschotte C. and Wessler R.S.; RT "Merlin1_CB, a family of Merlin/IS1016-like DNA transposons from RT the nematode C. briggsae."; RL Repbase Reports 3(5), 92-92 (2003). XX DR [1] (Consensus) XX CC The Merlin1_CB family is one of the founding members of a new CC superfamily of eukaryotic DNA transposons related to the IS1016 CC group CC of bacterial insertion sequences. There are at least 6 copies of CC Merlin1_CB with partial or complete coding capacity in the C. CC briggsae CC draft genome sequence (Jim Mullikin's WGS assembly 6/24/02). The CC sequence below is a consensus constructed from these 6 copies. CC Merlin1_CB has been recently active because the 6 copies are CC 96-99% CC similar to the consensus. There are also a few short CC nonautonomous CC elements more distantly related to Merlin1_CB and more ancient. CC The Merlin1_CB consensus sequence is 1915-bp long with 141-bp CC TIRs CC (3 mismatches). It contains one predicted gene (positions CC 340-1402), CC interrupted by one intron (593-639), which can encode a 338-aa CC protein, CC Merlin1_CBp. The C-terminal half of this protein has strong CC similarity CC with the putative transposases of IS1016 from Haemophilus CC influenzae CC (and related IS) and with ORFs from a wide range of eukaryotes, CC including other nematodes, flat worms, mosquito (described CC previously CC in the AF293351 GenBank locus as an IS1016-like insertion), CC ascidians CC (annotated transposase, GenBank AAM76104), zebrafish, CC Xenopus and humans. Many of these ORFs are clearly part of CC transposable CC elements. Multiple alignments reveal that the most conserved CC domain of these proteins contains a putative DDE motif CC (D(60)D(36)E CC in Merlin1_CBp). A HTH DNA-binding domain is predicted in the CC N-terminal region of Merlin1_CBp. Thus, this protein is likely to CC represent the Merlin1_CB transposase. Finally, there are striking CC sequence similarities in the TIRs of Merlin/IS1016 elements; most CC of them (including Merlin1_CB) create an 8-bp TSD upon insertion. CC The putative Merlin1_CBp transposase: CC MLAVISCFCCYSLLLASILIFAMTAVAAILDLKSFNLIKTWPELANKTDDEFDEYLAERGLLWRENPCP CC SCHESRKISKQKNASGAFCKQKFECYKRACRNSRFNCKTGYLKGTFFEGLRGSRKKIFLCSFLYLNGKM CC VMKELAETLETEEKTVIQWCQWFRDIMAESLYQPAIMIGGVGETVQIDETNIVKRKYNVGRIVRNGWLI CC GGIQNNTRAVFIEIVDKRDQATCERIIQQYVAPGTTVITDCWRGYNGLAALGYDHKTVNHSQNFVDPAT CC GLHTQRVESLWSHLKRRIKPKCGLKGDLWDDHWFEALWHFKHHEEAKLYELWAEIARRYPLT. XX SQ Sequence 1915 BP; 577 A; 335 C; 401 G; 601 T; 1 other; ggggctaagg tagttggggg gggccacttt tttttcgttt tttgaggcta atttagtgtc 60 tctggggtag aaaagagcgg cgaagaagcc taagcctatt ttttagacaa attcccatct 120 aaaaacacaa aaaaaaatct tctaaaaaat tttttttacc ggtttttcgt attttttacc 180 gatttttcgc tcatttctcg gaagtttttg ttttctggga tccacgtttg ttcaaaattc 240 ctttttaaag tctgaaaatg gaattttaaa atcattcttc gtcctcgaga actacaaaat 300 ttcttgctca ctctaaattt agattactgt agctcgaaaa tgctagctgt tattagctgt 360 ttttgctgtt attcgctgtt attagctagt attctaatat tcgctatgac tgccgttgcg 420 gctatattag acttaaagtc ttttaaccta atcaaaacct ggccggagct agccaacaaa 480 accgacgacg aattcgatga gtatctcgct gaaagaggct tgctctggag agaaaatcca 540 tgtccgtctt gtcacgagtc gagaaaaatc tctaagcaaa agaatgcctc tggttagtag 600 attaaaagta tttatttcat gcgaattart tgtttatagg agcgttttgc aagcagaaat 660 ttgagtgtta caaacgagca tgtcgaaatt cacgtttcaa ctgcaaaact ggatatttga 720 agggaacttt ctttgaggga ttacgtggaa gccgtaaaaa gatttttctg tgtagcttcc 780 tctatttgaa tggaaagatg gtgatgaaag agttggcaga aactttggaa actgaggaaa 840 agacagtaat tcagtggtgc caatggttca gggatatcat ggcggagagt ttatatcagc 900 ccgcaattat gatcggaggt gttggagaga cggttcagat cgatgaaaca aacattgtca 960 agagaaagta taatgtggga aggatcgttc gtaatggatg gttaataggc ggtatccaga 1020 ataacactcg tgccgttttt atcgaaatcg tcgataaaag agatcaagca acgtgtgaaa 1080 gaatcattca acaatatgtt gctccaggca caacagtcat cactgactgt tggcgaggat 1140 acaacggact cgcggcgcta ggttatgatc ataaaacagt gaaccattct caaaattttg 1200 tggaccctgc cactggtctc catacgcaga gagttgagtc gctttggagc catttgaagc 1260 gtagaatcaa gccaaagtgc ggccttaaag gagatttatg ggatgatcac tggtttgaag 1320 ctttatggca cttcaagcac cacgaagagg cgaagctcta cgagctttgg gctgagatcg 1380 ctagaagata tccactcact taaaattctt cgatggttcc gaaaattgtt gttgttttta 1440 atgtcattaa tcaataaatg ttaagaaaaa gttttttttg aatttggcgc caaaatgcgt 1500 gtgtcccaga ggaagggaaa caaggggttt ttgcctaagc ctattattta aaacaaaaaa 1560 caaaaagttt tgaaaaaaat ttttggccaa aaatttgcaa atttcactga ttttcgccga 1620 tttcaacact tttttggctg aaattttttt caaaactttt tgttttttgt tttaaataat 1680 aggcttaggc aaaaacccct tgtttccctt cctctgggac acccgcattt tggcgccaaa 1740 ttcaaaaaaa aaattttaga agattttttt tttcagattt ttttttgtga ttttagatgg 1800 gaatttgtct aaaaaatagg cttaggcttc ttcgccgctc ttttctaccc cagagacact 1860 aaattagcct caaaaaacga aaaaaaaagt ggcccccccc aactacctta gcgcc 1915 // ID Gypsy-12_DPu-LTR repbase; DNA; INV; 167 BP. XX AC scaffold_221; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-12_DPu_; KW Gypsy-12_DPu-LTR; Gypsy-12_DPu-I. XX NM Gypsy-12_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-167 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 740-740 (2010). XX DR Genome; scaffold_221; Positions 128696 128862. XX CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX SQ Sequence 167 BP; 31 A; 47 C; 43 G; 46 T; 0 other; tgttgtgtac gagtcatgat aggtggcgtg tcggttccta ccctccccgt ctggtgtggc 60 cggcaacggc agtctgctgt acgcttcccc tggatacggt tgttccttac gtgtcagccc 120 ttcaatacaa ttacggtggc ttcataccac cggaacagaa tctaaca 167 // ID Gypsy-186_AA-I repbase; DNA; INV; 4729 BP. XX AC supercont1.123; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-186_AA_; KW Gypsy-186_AA-LTR; Gypsy-186_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4729 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.123; Positions 2369108 2373836. XX CC Positions [3585-4055] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 323..2023 FT /product="Gypsy-186_AA-I_1p" FT /translation="MSYVAPNIDPYRKGQSFASWFKRLGYHFRVNKVADEN FT KKDQMFLLGGEYLFEVAQNLYLSEQLLDAAPLEELVEKLKQKLDKTDSTLI FT QRYKFGSRVQQAGETASDFLFSLKLQAEYCNFKDDKNSRILDRVLIGLSDD FT TLRQKLLTEDGENFNLAQAEKTITTWEMAASNAKALTKDDVSAQIASIGSL FT TGGRGAVLQRIADLAQGRSIAPVKSRLGYRPQYKSPARSVDRSTRPYNRFN FT DRAHRSPSRYRQPRHVRFEDSHRRYEDAPLYPAEMETKQQIVIDQRVCSYC FT GVPGHVRRKCFQLKQYKKDPINNINLGKPGSSTENTLSNMMNQMTWDDSTD FT DSDQGDLECMHISSINHLSTPCLLELTVAGKILQMEVDSGASVSVIGKRLF FT DVRLNVPLKESSKNLIVVNGSSLKVAGEAIVSVEFQDVRKNLKLLVLDCDH FT DFIPLLGRTWLDVFFPNWRNFFDKLLPVNNVTDYKVDDLMNDIKDKFSNVF FT IKDFSSPIREYEAELVLKTDTPIFRKAYDVPYRLRDKVLNYLDKLEKENVI FT TAIQTSEWASPVIVVMKKKW" FT CDS 2205..4643 FT /product="Gypsy-186_AA-I_2p" FT /translation="MVINTIKGLFTYNRLPQGASSSASIFQQVMDQILRGI FT ENVYVYLDDVLIAGKNLEDCRRKLHIVLDRLSEANIKVNWDECKFFVTSLE FT YLGHIISEKGLSPCASKIATIKNAQIPTNVTELKSFLGLINYYNRFIPGLS FT SKLHSLYNLLKNDVKFIWNDECDKAFTNSKNLLLETNFLEFYDSRKPIIVV FT SDASGYGLGGVIAHSIDGVEKPICFTSFSLNSAQKKYPILHLEALALVCTI FT KKFHKYLYGQKFTVFTDHKPLVGIFGKEGKNAIFVTRLQRYILELSIYEFD FT IQYRPSSKMGNADFCSRFPLEQSVPKDLDLDCIKHINFSSDCPIDFTNIAK FT NTKADSFLQRIISFMRHGWPKKIEKNFKDVFANQHDLEIFEGCLLYQDRVI FT VPQNLQIEILKLLHANHVGMVKMKHLARRSVYWYGINADIENYVSECETCA FT SLAVHPKQKIDSKWTPSTRPFGRVPIDFFYFKHHVFLLIVDSFSKWVEVEW FT MKNGTSCPKVIKKLLAYFARFGFPDILVSDNGPPFNSHDFKNFLERQGIRV FT LNSPPYNPASNGQAERLVRTVKDVLKKLLLDPEYMSLELEDLINMFLINFR FT NNSLTMEGFFPSEKVLSYTPKVLIDLINPKRHTNEHGIQANNNNNNNNRNV FT DIDNRPMDDRTKEVYEDPLDSLMAGDVLWYKNHDPHHQAKWLKAHFLKRMC FT KNILQITIGNGITTTAHRSQIKIANDTGRRACRRMVVHVDPIVDYPDPLPP FT RRTSDDEEPFAGFQECELEGASLSTVKPNRKRKIETGLALDGQPRRSKRTR FT RNNFRNDFEYF" XX SQ Sequence 4729 BP; 1478 A; 821 C; 1034 G; 1396 T; 0 other; tattatctgg cgacgagaaa aaacggcagc aatcagtgca gtgcatcacc cggacagtga 60 ttgatttccg gcgttagcag acaagttatc tgggtttttg cttccaccat ccaaacagca 120 gcggcggagt catcttcata ggtgcagctg gggtaaacgc aacggaacgg agtggacttt 180 ccggagattt ggtagagccg tgagtgtttt gcggcagctg aacgacagta aatccgacga 240 gtttattttt tctatttgga attaacggtg taatttgtat ttctgatcat attgtcggca 300 cttgggaggt gtgattgata aaatgtcgta cgtagctccg aatattgacc cctaccgcaa 360 agggcagtca tttgcatcgt ggtttaagcg actcgggtac cattttcgcg tcaataaggt 420 ggcggatgag aacaagaagg atcaaatgtt ccttttgggg ggcgaatacc tgtttgaagt 480 tgcccaaaat ttgtacctca gtgagcaatt acttgacgca gccccacttg aggaattagt 540 tgaaaaactt aaacaaaagc ttgacaagac ggattcaact ttgatccagc gttataaatt 600 tgggtctcgg gtacaacaag caggcgaaac agccagtgat ttcctatttt cgcttaaact 660 gcaagccgaa tattgcaatt tcaaagacga caaaaatagt cgcattcttg atcgtgtcct 720 cattggtctg tctgatgaca ccttgagaca aaaactcctt actgaggacg gggaaaattt 780 taacttagcc caagcagaga aaacaatcac tacctgggaa atggcggctt ctaatgccaa 840 agctttaaca aaggacgatg tttctgcaca aatcgcatcc ataggcagcc tcaccggtgg 900 acggggggcg gttttgcaac gtatagcgga tttggcgcag ggccgaagta tagctcccgt 960 gaaaagccgt ttaggttaca ggccacagta taaatcgccg gcaaggagtg tggaccgcag 1020 cacaaggccg tacaataggt tcaacgatcg cgctcatcgc agtccgtcac ggtacaggca 1080 gcccagacac gtacgcttcg aggattcgca tcgccgatac gaggatgcac cgctataccc 1140 tgcagaaatg gagactaaac agcaaatagt gatcgatcaa agagtatgca gttattgcgg 1200 agtgccggga cacgttagaa ggaagtgttt tcagctaaaa caatacaaga aagaccccat 1260 taataacatc aacctaggaa aacctggttc tagtacggag aatactttgt cgaatatgat 1320 gaaccagatg acgtgggatg actcaaccga tgattcagac caaggtgatt tagaatgtat 1380 gcatatttca tccattaacc atttaagcac tccatgttta ctagaactaa ccgttgcagg 1440 aaaaattttg cagatggaag ttgacagtgg agcttctgtt tctgtaattg ggaaaaggct 1500 atttgatgtt agacttaatg ttcctcttaa agaaagttcg aaaaacttga tagtggtaaa 1560 cggttccagt ttgaaagtag ctggggaggc tattgtttcg gttgaatttc aagatgtaag 1620 gaaaaatttg aaacttttgg tgttagactg tgatcatgac tttattcctc tattaggcag 1680 aacttggttg gatgttttct ttcccaactg gcgcaatttt ttcgacaaat tactacctgt 1740 aaataatgtg actgattata aggttgatga cttgatgaat gatatcaagg ataaattttc 1800 caatgttttc atcaaagact tttcgtctcc tattagagaa tatgaagccg aattggtttt 1860 aaaaacagac acacctattt ttaggaaagc ttatgacgtt ccctacaggc taagggacaa 1920 agttttgaat tatttagata agttggagaa ggagaatgtt ataactgcga tacaaacaag 1980 tgaatgggca tctcctgtta ttgttgtgat gaaaaaaaaa tggtgatatt cggttagtaa 2040 ttgactgtaa agtttcaata aataaactca ttgttcctaa tacatatcct ttacctgtag 2100 ctcaagatct ttttgctgga ttggcagggt gtaaagtttt ctgcgcgctt gatttggagg 2160 gcgcatatac ccaattgtct ttaacaaaac gctctagaaa atttatggta ataaacacta 2220 ttaaaggtct ttttacttac aatagattgc cacagggtgc ttcctctagt gcatccattt 2280 tccagcaggt aatggatcaa attttgcgag gaattgagaa tgtttatgtt tatttggatg 2340 atgttctcat agctggaaaa aatttagagg attgcagacg gaagcttcat attgttttgg 2400 atagattgtc cgaagcaaat attaaagtga attgggatga atgtaaattc tttgtgacta 2460 gtttggaata cctgggacac ataattagcg aaaaaggttt gtcgccatgt gcaagtaaaa 2520 tcgccacaat aaaaaatgca caaataccaa caaacgtaac tgagctcaaa tcatttttgg 2580 gactaatcaa ttactataac aggttcattc cagggctgtc ttccaagtta cacagtttgt 2640 ataatttatt gaagaacgat gtaaagttta tttggaatga tgagtgcgac aaagctttta 2700 caaactcaaa gaatttactt cttgaaacaa attttcttga attttatgat tccagaaaac 2760 caatcatagt agtctctgat gcttccggct atggacttgg tggagttatt gcacattcta 2820 tagacggtgt agagaaaccc atttgtttca cttcattttc actcaattct gctcaaaaga 2880 aatatcctat tctacatttg gaggcacttg cactagtatg cacaataaag aagtttcata 2940 agtatcttta tggacagaag ttcactgttt ttaccgatca taaacctctg gtaggaattt 3000 tcggaaaaga gggtaaaaat gctatttttg tgacaagact acagcgttat attctggaat 3060 tgtcaattta cgagtttgac atccagtaca ggccttcttc aaaaatgggt aatgctgatt 3120 tttgttcgcg ttttcctttg gagcagtcag tacctaaaga tctagattta gattgtatca 3180 aacacattaa ttttagcagt gattgtccaa ttgactttac aaatattgcc aaaaacacga 3240 aagctgattc ctttttacag cgtattataa gttttatgcg ccacggttgg cctaaaaaga 3300 tagaaaaaaa tttcaaggat gtttttgcaa accaacatga cttagaaata tttgaaggat 3360 gtttgttata ccaggataga gtaatcgtgc cgcagaattt gcaaattgag attttaaaac 3420 tgttgcatgc caatcatgtt ggtatggtga agatgaagca tttggcaaga cgttcagtgt 3480 attggtacgg gatcaatgct gatattgaaa attatgtcag tgaatgtgaa acttgtgcta 3540 gtttggcagt gcatccaaag cagaaaattg attcaaaatg gacaccttcc actagaccat 3600 ttggtagggt acccattgac tttttctact tcaagcacca tgtttttcta ttgatcgtag 3660 atagtttttc taaatgggtg gaagttgagt ggatgaaaaa cggtacatct tgtccaaagg 3720 ttattaaaaa gttattagca tattttgcta gatttgggtt tccagatatt ttagtatcag 3780 acaatggtcc tccattcaat tctcatgatt ttaagaactt tcttgaaagg caaggaataa 3840 gagtcctaaa cagtcctccg tacaatcctg ctagtaatgg ccaggctgag agacttgttc 3900 gtacggttaa agatgttttg aaaaagttgc ttcttgaccc cgaatacatg agtttggagt 3960 tggaagacct gataaacatg tttttgataa actttagaaa caatagtctg acaatggagg 4020 gtttcttccc ttccgaaaaa gttttatcgt atactccaaa agtgttaatc gatttgatta 4080 atccaaagcg acatactaac gaacacggta tacaggctaa taataataat aataataata 4140 atcgtaatgt tgatatagat aatagaccaa tggatgatcg taccaaagag gtttacgaag 4200 atcctttgga tagtctgatg gcgggagatg tactgtggta taaaaatcat gatccccacc 4260 accaggctaa atggttaaaa gcgcattttc taaaacgtat gtgtaaaaat attttacaga 4320 tcacgattgg aaacggaata acaacaacag ctcacagaag tcaaatcaaa atcgctaatg 4380 acaccggcag acgggcgtgt cggaggatgg tggtacatgt cgatcctatt gtcgattatc 4440 ccgatccatt accaccaaga agaacgagtg acgatgagga gcctttcgct ggtttccagg 4500 agtgcgaatt ggaaggagcc agcttatcaa cggtcaagcc gaatcgaaag agaaagattg 4560 agacaggttt ggcgctcgat ggccagccac ggcgttcgaa gagaacgcga agaaataatt 4620 tccgaaatga ttttgaatat ttttgattga tacgtttaat taatttgtga gttcataaat 4680 gatttttttt aacttcgaga ttaacgaata tctataaaaa gggaaggac 4729 // ID Nimb-1_CQ repbase; DNA; INV; 5402 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A Nimb non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW Nimb; Non-LTR Retrotransposon; Transposable Element; Nimb-1_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-5402 RA Kojima K.K. and Jurka J.; RT "Nimb non-LTR retrotransposons from the southern house RT mosquito."; RL Repbase Reports 11(1), 581-581 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 4 sequences with >99% CC identity. XX FH Key Location/Qualifiers FT CDS 350..1714 FT /product="Nimb-1_CQ_1p" FT /translation="MASSGGGMYAFPPTLDPSWSLKGSSRYVTVQAPNKGS FT LATVNIFLVTKSIRACATVEDIRKNFKAGTYTVRVPNEAQAKQLKDKLTVL FT GDQITKVEVVDDEFRNTSRAVISCRDLIGVTEEQIVEELSEQGIKAARHLK FT KGVNKVETGSVVITFNKPHPPERVQVACHSVAVRPYYPPPALCYGCFEYGH FT ISKNCRGAKACANCSGPFHGDNIDCPNPAKCKHCGEGHAATNRQCPKRKTE FT EEIVRIKIDRNIPYHEARKLLESRTNAGNYAKVAAAGSMNTAEQVNIEEEK FT NKWLSEIKKKEESLKVLELKYEQKFEEMLKKQSEVDQSYLKLQKCFEKSSM FT QIEYLTKELENKQHYITLLEQKVKTLTSNKMTPNEERRPMPTKRDRSSEAK FT KNDELDSDDSNRSRSPAPKTLAIETTTTSKDDLISLSETIYVEVESSSSME FT FDPNSISPNS" FT CDS 1666..5325 FT /product="Nimb-1_CQ_2p" FT /note="apurinic-like endonuclease, reverse FT transcriptase and ribonuclease H." FT /translation="IFQLYGVRPELHLAKFLIQWNCNGVRSKHAEIQLLIN FT KHKPTVIALQEVKLQHQNYSIKGFSIFSKLRSNHGGGVMLAVRDGVDAEQV FT TLQTTLEAVAVKIDIPVKATVCSIYIPPNEEITRSDIEDLLEQLPGPVILA FT GDLNAKNELWSNDDDARGRTLASTFEETNLVVLNDGADTYQPHQACQSPST FT PDITACSAPLASRLNWKLAEDVHSSDHFPILVGFSGVSVPEKRRQSWIIPS FT ADWNKFELDVYNSINPEQQYTIEELTAVIISAAEASIPKTKEDIHPRSVPW FT WNERVAEAVKQRRRALRQLQSVRKQRRVNETLIQAAHEEFKQARMKARKVI FT EEEKSRSWQEYVSTINEDTPPSEMWRKIRSISGKRSSTRTWSVKVGDRQTS FT DSKEIAEHLAEFFEQSSSDSSYPDDFRTRKTRVERRPFSFPGDRGANYLQT FT FSKDELLTQLNDLSGSSPGPDGVHNTMLQHLPAIGKDKLLDAYNSIWTSNT FT FPATWKETILVPIPKPGKNPNIPGNLRPIHLSSCVLKLFERMINRRLMDTL FT EERNVFGQHQAAFRKGRQTLDTLASIESYGKNAIATKKHAEFLFLDIEKAY FT DRTWRRLILEGLAKARIGGHMASFCSRFLEDRRFRVHFNGEVSSLKVLQNG FT VPQGSVLAVTFFLLAIDSIRQYIDRDVFLELFADDITLGVSDVNVRRARQK FT LQTVVNAIARWSKETGFKVAISKTAAMHVCFKRFHAKKQPTLKLENQEVEF FT VGSHKVLGVWLDYRLSFKRHLAKTKAECRRGLNLMRCLGKTKFGADRVTMI FT HLIRATLLPKLLYGIPITSGAGESEYMKLAPVYHEAIRLATGAYRSSPIDS FT ILADSGLLPFNFLVDEHLILYSTRQLARNRINRNYPVALRARTTATKLNVN FT MSQVIVADAPAPNDWVLKRGTCRADRFGCEPPAVQRAVFRQTVGELYPNHA FT QIFTDGSLNETGGVGCGVYSSSTSRSITLPSQLTIFSAEAYAILEAIELAK FT RNPHQTVIFSDSKSVISAVKGNKTHHPWVVKIRRELARLRDSVELCWVPAH FT CDIDGNEKADELAKTGASSTQNAELEEIPYPDFKLCVKKLLRQRWNQLWYS FT CDTKLRRIKESSTEWLSSRTLTRRDSRAITRLRIGHTHLTHGHLMANCDPE FT PCETCGETTTVQHILVDCRKFEHQREESGIANSLYVALGDNLDEIGKTLEF FT LRISNLYCSI" XX SQ Sequence 5402 BP; 1714 A; 1380 C; 1242 G; 1066 T; 0 other; aaagtgacgt tcggttgaat aaacaagcgt ctgcgttgct ctgccaattt cgagcgattt 60 tgtcagattt aaaagtgatt tagttatcga actaactgca aacttagcta aggacagtac 120 agtggtgttt aaatagtttc gcagtcccag tgggccacca aaaagggccc gggccgaaac 180 catccgtggt gtgctccgca aaggagaata cctaacctag aagagacaac aaaccaaagt 240 gtaaacaagt gaagaagaac tgtaattttt gccaaaggaa gtacacaatc tcagccttct 300 gagtgaattg tccgccaaag aggcgccctc cgagggagag gcggtcaaca tggcttcgag 360 tggaggggga atgtacgcat tcccacctac ccttgaccca agttggagct tgaaaggttc 420 atctcgttac gtgacagtac aagcaccgaa caaaggatct ctggccactg taaacatctt 480 cctggttacc aaatctatcc gggcatgtgc caccgtcgag gacatccgca agaacttcaa 540 agccggaacg tacacagtca gagtaccaaa cgaagcacag gctaaacagc tgaaggacaa 600 gctaactgta cttggcgatc aaataacaaa agtagaagtt gtggacgacg agttccgaaa 660 cacatcaaga gcagtgatct catgccggga tctaattggt gtaacggaag agcagatcgt 720 tgaagaactg tcggaacaag ggatcaaagc tgcacgacat ctgaaaaaag gtgtgaataa 780 agttgaaact ggatcggttg tgataacctt caacaaaccc cacccgccgg aaagagtaca 840 agtagcatgt cattcggtcg cggtccgtcc atactacccc ccaccagcac tgtgctacgg 900 atgctttgaa tatggccata tcagcaaaaa ttgtagaggt gcaaaggcct gtgccaactg 960 ctctgggcct ttccacggcg ataacatcga ctgcccaaac ccagcaaaat gtaaacactg 1020 cggagaaggc cacgctgcaa caaatcgaca gtgcccaaaa aggaaaaccg aggaggaaat 1080 agtgagaatc aaaatcgacc gaaacatccc ttaccacgaa gcgaggaaac tccttgagag 1140 caggacgaac gccggaaact acgccaaagt cgctgcagcc ggctctatga acacggctga 1200 gcaagtgaac attgaggaag aaaaaaacaa gtggttaagt gaaataaaga agaaggaaga 1260 atcgctgaaa gtgttagaac taaaatatga acagaaattc gaagaaatgc taaaaaagca 1320 aagtgaagtg gaccaaagtt acctaaaact acagaagtgc ttcgaaaaaa gttccatgca 1380 aatcgagtac ctgaccaaag agctggagaa caagcagcac tacatcaccc tcctggagca 1440 gaaagtgaaa actctcacca gtaataagat gacacctaac gaagaacgcc gccctatgcc 1500 caccaagcga gatagatctt cggaagccaa gaaaaacgac gaattagact ctgacgacag 1560 taaccgatcc cgatctccag cgccgaaaac gctggccatc gagacaacaa cgaccagcaa 1620 ggacgatctt atttctcttt ccgagaccat ctacgtggag gttgaatctt ccagctctat 1680 ggagttcgac ccgaactcca tctcgccaaa ttcctaatcc agtggaactg caacggagtg 1740 cgatccaaac atgcagaaat tcaacttctt attaacaaac acaaaccaac tgtaatcgca 1800 ctgcaagaag taaaactaca acaccaaaac tacagcataa aaggattcag catcttctct 1860 aagctgcgat caaatcacgg cggaggagtg atgctggcag tacgagatgg agtagacgca 1920 gaacaagtta ccctgcaaac gaccctagaa gcagtagcag taaaaattga catccccgtc 1980 aaagccacag tgtgttcaat ttacattcca ccgaacgagg agatcaccag atccgacatc 2040 gaggaccttc tagagcagct tccaggtcca gttatcttgg caggggattt gaacgcaaaa 2100 aacgaactct ggagcaatga cgacgacgca cgcgggcgca ctttggccag caccttcgaa 2160 gaaacgaacc tggttgtgtt aaatgacgga gctgacactt accaacctca tcaagcctgc 2220 caaagccctt ctacgccaga tatcacagca tgcagtgcac cattagccag ccgactaaac 2280 tggaagctcg ccgaagacgt gcacagcagt gaccattttc caatacttgt cggtttctca 2340 ggagtgtctg tccccgaaaa aagaagacaa agttggataa taccctcggc cgactggaac 2400 aagttcgaac tggacgtata caacagcatc aatccagagc agcagtatac catagaagaa 2460 ctgacagcag taatcatcag cgccgccgag gcctcaatcc caaaaaccaa agaggacatc 2520 cacccccgct ccgtcccgtg gtggaacgaa cgtgtagctg aggccgtcaa gcaaaggaga 2580 cgtgccctac gtcagctcca gagcgtaagg aagcaaaggc gcgtaaatga aactctcatt 2640 caagcagcac atgaagagtt taagcaggcg aggatgaagg caagaaaagt gatcgaagaa 2700 gagaagtctc gttcgtggca agaatatgta tcaacaatca atgaagacac gcccccatcg 2760 gaaatgtggc gaaaaattcg ttcaatatcg ggcaaaagat cgtccactcg cacctggtca 2820 gtcaaagtcg gagaccgaca aacatccgac tcaaaagaaa tcgcagaaca tctagcagag 2880 ttcttcgaac aaagctcgtc cgactcgtcg tatcctgatg acttccgtac ccgaaaaacc 2940 agagtcgaaa ggcggccgtt ctcatttccg ggagatcgag gtgcaaatta cttacaaaca 3000 ttctcaaagg acgagctact aactcagctg aacgacctct ctggatcttc tcctggaccc 3060 gatggagtgc acaacacaat gctacagcat ctgccagcaa tcggaaagga taagcttctc 3120 gacgcttata actcaatctg gacgagtaat actttccctg ccacgtggaa ggaaactata 3180 ctggtgccaa tcccaaagcc cggaaaaaat ccgaacatcc ctggaaatct ccgcccgatt 3240 cacctctcta gctgcgtcct gaaactcttc gagcgaatga taaatcggag gttgatggat 3300 acgctagaag agcgaaacgt cttcggccag caccaggctg cgttccgaaa gggaaggcag 3360 acattggaca cgctggcaag tatagaaagt tacggcaaga acgcgatcgc aacgaaaaaa 3420 cacgcagaat ttctctttct ggacattgaa aaggcgtatg atcgaacatg gcgtcggctc 3480 atcctcgaag gactcgctaa agctcgaatt ggcggacaca tggcaagttt ctgttcgaga 3540 ttcctcgaag acagacgttt ccgtgtgcac ttcaacggtg aagtatcttc attgaaggtg 3600 cttcagaatg gcgtacccca aggttcagta ctagccgtaa catttttcct gcttgccatc 3660 gacagcatta gacagtacat cgacagagac gtcttcctgg aactgttcgc cgacgacatc 3720 acgttgggag tctcggacgt gaacgtacgt cgagctcggc aaaaactaca aactgtagtc 3780 aacgcgatcg cccgttggag taaagaaact ggcttcaagg tagcaatctc caaaacagct 3840 gcgatgcacg tgtgcttcaa acgattccac gcgaagaaac aaccaaccct taagctagag 3900 aaccaagaag ttgagtttgt tggatcccac aaagtacttg gcgtatggct agactaccgt 3960 ttatcgttca aaaggcatct agccaaaacg aaggctgaat gcaggagagg cctaaatcta 4020 atgagatgtc ttggaaaaac aaagttcgga gctgaccgtg tcaccatgat tcacctgatt 4080 cgagccaccc ttctaccaaa acttttgtac ggtattccaa tcaccagcgg cgccggagag 4140 agcgagtaca tgaaactcgc accagtctac cacgaagcca tccgtctcgc aactggagcc 4200 taccgatcta gcccaataga cagcatccta gctgactctg ggctgctacc tttcaacttc 4260 ctggttgatg aacatctgat tctctactca actcgtcaac tcgctaggaa caggattaac 4320 agaaactacc cggtggccct gcgggccagg accaccgcca ccaaactgaa cgtcaacatg 4380 tcacaagtca tcgtcgctga cgccccggct ccaaacgact gggtcttgaa gcgcggtacc 4440 tgcagagctg accgttttgg atgtgaacca cccgcggttc aacgagcagt gtttcgccaa 4500 actgttggag agctgtatcc gaaccacgca caaatcttca ctgacggatc gttgaacgag 4560 actggcgggg taggctgcgg tgtctacagt tcgtcaacat ccagaagtat aacactgcct 4620 tcgcaactga ctatattctc cgctgaagcc tacgcaatcc tggaagccat cgaactcgcc 4680 aaacgcaacc cgcatcaaac cgttatattc agcgacagta agagcgtcat atcagcagtt 4740 aaggggaaca aaacgcatca cccgtgggtt gtcaaaatcc gtcgagaact cgctaggctt 4800 cgtgactctg tcgaactttg ctgggtccca gcacactgtg atattgacgg taatgaaaaa 4860 gcggacgagc tagcaaagac cggtgcctcc tcgacacaaa acgctgaact tgaggaaata 4920 ccatacccgg acttcaagct ttgcgtaaaa aagttgctgc gacaaagatg gaatcaactt 4980 tggtacagct gcgacacaaa acttcgcaga atcaaagaaa gcagtacgga atggctgtcc 5040 tccagaacac taacccgtcg agactccaga gctatcacca gactacgtat tggccatacg 5100 cacctcaccc atggacatct aatggcaaac tgcgatccag aaccctgtga gacttgtgga 5160 gaaacaacta cagttcaaca catcctagta gactgccgaa agttcgaaca ccaaagagaa 5220 gaaagtggca tcgcaaactc actctacgtc gccctaggag acaatctcga cgagatcgga 5280 aaaaccctgg agtttttaag gatatcaaat ttgtactgct caatttaacg caaacctcac 5340 ttcggacccg aatgactctt gagttaagtg gtccttaaac gcaaataaat aaaataaata 5400 aa 5402 // ID Sola1-3_AA repbase; DNA; INV; 3189 BP. XX AC . XX DT 28-JAN-2009 (Rel. 14.02, Created) DT 28-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE Sola1 DNA transposons from Aedes aegypti. XX KW Sola; DNA transposon; Transposable Element; Sola1; Sola1-3_AA. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-3189 RA Bao W., Jurka M.G., Kapitonov V.V. and Jurka J.; RT "New superfamilies of eukaryotic DNA transposons and their RT internal divisions."; RL Molecular Biology and Evolution 26(5), 983-993 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS join(400..1197,1201..1794,2042..2854) FT /product="Sola1-3_AA_1p" FT /translation="MSLEYIKLYGGGSESEGSSDEDFAGFDLSAPASRYEK FT FGESIRNAWMLDLNQLGNRARFGQRDDGNPVGAHPNVKFSMVSSKPPRFSA FT SDVESNGNGRDPAVAQQANNSSPLEDEPTVAVSPDDNNSHSDSIDSEGSYF FT AEEHRGKRKRTKKEQRKAVVAKKRRVCHPPKLELCGCRGRCTDSLSKDDRI FT SIHRAFWKLNFAEQGEFFRERVVRVSVKKRSTERHYLKEPRKLHSYVYSLE FT MADKRVVKVCRKFFLNTLGYGEHCGLLIGSAKKLLHSYFEHSILFIDRNMV FT YRCLRTDNDGISGKPKMRGKYKRSRERKDAVKAHILLYNPTISHYRREHAP FT NRLYLPSDITEKSMYENYLKQRPNLNVSYTFFCRVVKEMNISLVKLGHEEC FT ETCVTAKQHEDQLGHGDENARHGCSVCERHQQHIHRATAAREQYRKDGETI FT PPNEIVLAVDLQKVRVYLRLQVIQLPRLDGFKTIVFSQRLVAFNETFAPIG FT EFAKSNAVIACLWNEAVAGRSAHDILSCFYRVISHLASRRKITFWLDNCAA FT QNKNWGLFLFLILLVNSPAIQVNEIVLKFFESGHTFMAADSFHAAVEKAMR FT QNPTITYPDFVDVVSKAKKRVDVFDMQVSDFFQTPFNVSQYTLNKCTKRPY FT IDKIKKIIVQKGCFELKYSESVELFSKSQMKQINAEGFNLGNSLKMQRNPV FT GIDAVRKDSLVSAILPLVAEEKKAFWENLPIKTE*" XX SQ Sequence 3189 BP; 982 A; 613 C; 680 G; 914 T; 0 other; actgcccata ctcgcaatac agtcccattc gaaaaatcat catgttgaga aaaacgcatc 60 tttacaattt cacaaaaaaa tacgttttgc tgctactggt tataataaat ataggtagca 120 ttgttcagca tattataatt acaatatgaa agagtcgctg gcagttaaga tacatttgaa 180 ttcgttgtat tttcatattt aaattagaat actggtaaat gggactcctt atccgagtat 240 ttctcacgcc aaatgcattt atacccgcat accagtccca ttagttggga ctcgcttact 300 ttgttatcgc gaacattgac acattgtccc aagtttacaa tcgattccta tcgattccaa 360 gacacttttg catttccaac cgttgtttga aatgtaataa tgtccctcga atacattaaa 420 ttgtatggtg gtggcagtga gagtgaaggt agcagtgatg aagattttgc tggattcgat 480 ttgtccgcac cggccagccg atacgaaaag tttggtgaat ccatccgtaa cgcgtggatg 540 ttggatttaa accagttagg aaaccgtgcc aggtttggtc agcgtgatga cggcaatcct 600 gttggcgccc acccaaatgt gaaattttct atggtgagtt caaaacctcc tcgtttctcc 660 gcatcagatg tcgagtccaa tggaaatggg cgcgatcctg cggtagccca acaagcgaat 720 aactcatcgc cattagaaga tgaacctacc gtggctgttt ctccggatga caataacagc 780 catagcgata gtatcgactc agagggaagc tattttgctg aggagcacag aggaaaacgg 840 aaaagaacca aaaaagaaca gcgtaaggca gtcgttgcaa agaaacgccg agtttgccat 900 ccgccgaaac tcgaactatg tggttgcagg ggacgatgta cggacagctt atctaaggac 960 gatcgcatca gtattcatcg agcattctgg aaactgaatt ttgcagaaca gggagagttt 1020 ttccgcgaga gggttgttcg agtttccgtt aaaaagcgaa gtacggaacg ccactatctc 1080 aaggaacccc gaaaactgca ctcatatgtc tattcattag aaatggctga taaacgagtc 1140 gtgaaagtgt gccggaagtt ttttctgaac acacttggtt atggagaaca ttgcgggtaa 1200 ttattaatag gttctgccaa aaagcttttg catagttatt ttgaacattc gattttattt 1260 attgacagaa atatggttta tcggtgtctt cgtactgaca acgatggtat atccggtaaa 1320 ccaaaaatgc gcggcaaata taagcggagc cgtgagcgta aagatgctgt gaaagcgcac 1380 atcttgctct acaacccgac aatatctcac tatcgccgcg agcatgctcc gaacaggctg 1440 tacttgcctt cggacattac agagaaaagc atgtacgaga actacctgaa acagcgaccc 1500 aatttgaacg tcagctatac attcttctgt cgagttgtga aggaaatgaa catttccttg 1560 gtaaaactgg gccatgaaga atgcgaaacg tgcgttactg ctaagcaaca tgaagatcaa 1620 ctgggacatg gcgatgaaaa cgcgcgccac ggatgttctg tttgcgagag acatcaacag 1680 catattcatc gtgcaacagc agcacgtgag caataccgca aggacggcga gaccatacca 1740 ccaaacgaaa tagtgcttgc tgtggatcta caaaaggtaa gggtgtactt acggtaacac 1800 acctaaacat ggcacaatgg ccggcttgtg cctctcttca aaggcactaa tctgctgaaa 1860 tgatgaacgt aaaaagttta tcttcttatt ttaagcttgc acaaagctaa attagaaatt 1920 gcattgaagt ttgatgcttt tttcgcgaga tttgtgcctt tgaagttttg gtacattatt 1980 gaaatttggt tccaagttgt ttcactttaa attaactgta ttaattaagt gaggctatta 2040 attacaggtc atccagctgc ctcgtctaga tggttttaaa acaatcgtgt tctcgcaacg 2100 tcttgtggcg tttaacgaaa catttgcgcc aatcggagaa tttgccaaat ccaatgcggt 2160 gatcgcctgt ctgtggaatg aagcagttgc gggtagatca gcacacgaca ttctcagttg 2220 tttttatcgt gttatttcgc accttgccag ccgacgcaaa ataacattct ggctcgataa 2280 ctgtgccgct cagaacaaaa attggggact atttctcttt ctgattttgt tggtgaattc 2340 gccagcaatt caagtgaatg aaatagttct caaatttttt gagtctgggc atacgtttat 2400 ggctgcagat tctttccacg ctgctgtaga aaaagcaatg cgccaaaatc cgacgattac 2460 ataccctgat ttcgttgatg ttgtatcaaa agcaaaaaag agggtcgatg tatttgatat 2520 gcaagtgtct gatttcttcc aaacgccatt taacgtcagt cagtacacac tcaataaatg 2580 tacaaagcga ccatatattg acaagattaa gaagattatt gtgcaaaaag gatgtttcga 2640 gttaaaatac agcgagtctg ttgagttgtt ttctaaatcc caaatgaaac agatcaacgc 2700 tgaaggattc aatttgggaa attcgttaaa aatgcaaaga aaccccgtgg gtattgacgc 2760 tgttcgcaag gattcacttg tcagtgcgat tctaccattg gttgcagaag agaaaaaagc 2820 attctgggaa aatttgccta tcaagacaga ataaacttct ttgtataaga atatttttaa 2880 atatagatga actgtaccac cttagcttgt ataatttaat attttataaa ataaaaacca 2940 taatgctaga tttcaacaat aattgaagga atatattggt tcattttctt cgttatcaac 3000 taatgggact gttatgcggg tactttcaat atgggacgca catggtttgg tatttttttc 3060 gcattttgcc catacaaaag taaaatttta aaaatgattt cgaaaagtat acttaaaata 3120 aacactattc atgaagttat ctcaaaagtg gtgaaaattc aaatgggact gttatgcgag 3180 tatgggcag 3189 // ID Gypsy-194_AA-LTR repbase; DNA; INV; 190 BP. XX AC supercont1.67; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-194_AA_; KW Gypsy-194_AA-I; Gypsy-194_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-190 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.67; Positions 1678253 1678064. XX SQ Sequence 190 BP; 57 A; 33 C; 36 G; 64 T; 0 other; tgttggggtt ttaccttaaa cgcacaatat tattgacctg tccatacaca ttgattgtca 60 ttgatatgaa gcaagagaaa aataaaggta ttgagttata agcaaacgaa caagcaacac 120 gcgtttaact ttgctcttgt tatccggaat ttctgcggcc attctgtttg tcttgcattt 180 gattatagca 190 // ID Copia13-NVi_LTR repbase; DNA; INV; 289 BP. XX AC AAZX01023252; XX DT 13-NOV-2007 (Rel. 12.11, Created) DT 13-NOV-2007 (Rel. 12.11, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: long terminal DE repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia13-NV; KW Copia13-NVi_I; Copia13-NVi_LTR. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-289 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from Nasonia parasitic wasp."; RL Repbase Reports 7(11), 1160-1160 (2007). XX DR Genome; AAZX01023252; Positions 1157 869. XX SQ Sequence 289 BP; 56 A; 96 C; 60 G; 77 T; 0 other; tgtgtacgat ctcccccgac accgacggca gatccctcgc gtcgcagcga ctctagaagc 60 gctcgccgcc aaaaccacgt gatacggcgc tgagctcggc ggcgctctcc gctcgccgcc 120 tcactctcta cgagatagtc gagctgcacg cacgccttcg gctcttgtcc cgctctgtct 180 aagtttaaac tttaataata aagttacgct acgctctatt acttgatact ctgtctttgc 240 ttctttattt cgtgcaacct tatcgcgtcc ggatccttat ataccgaca 289 // ID Copia-115_AA-LTR repbase; DNA; INV; 186 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-115_AA_; KW Ty1_copia_Ele6; Copia-115_AA-I; Copia-115_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-186 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 186 BP; 58 A; 41 C; 41 G; 45 T; 1 other; tgaagaaata ggaggccgtg agcgatcggc caaaccgtag gtggccatga aatgatgagg 60 catccgatcg tgtcagamac ttcagtcatt ccacaacaag ccttcaagca gacaagacat 120 gtttcttaag caatagttaa ataaagctat tttagtttaa ctcggagctc gcttactatc 180 tgccca 186 // ID I-71_AAe repbase; DNA; INV; 7344 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE An I non-LTR retrotransposon from Aedes aegypti. XX KW I; Non-LTR Retrotransposon; Transposable Element; I-71_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-7344 RA Kojima K.K. and Jurka J.; RT "I clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1342-1342 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 10 sequences with >98% CC identity. XX FH Key Location/Qualifiers FT CDS 467..2188 FT /product="I-71_AAe_1p" FT /translation="MDLASGQNFRPGDPGGPGGGHKLGLNWRQGEYMGPTL FT PAFMDRDGTAGSLQYLRMQTTAGTMPQDPFLLRISVEKHLGARIEGAFKEN FT RGMSYVLKVRSSAQFDKLLRMEKLCDGTPVSISEHPQLNQRQCVVSNQDVA FT GLSDEYLKSQLAAQGVKDLRRIRRRLPDGSFMNTPTIILTISGTVIPEHID FT FGWTRCKTRNYYPSPMLCFRCWTFGHTGKRCTEPHRICGRCCKVHPEHQAP FT VSTQNGNEVSTSFRPEELPSENQDMNRFVCSEPVFCKNCKADDHSVSSRKC FT PIYLKEVEVQKIRVDCNISYPQARREFEARQNASTSAVTFSGVVAGSKDVE FT IGSLKAAIRQMQADAKAKDVRMAEMETALRGPSVGERLDMVREHGTIEQLI FT KQVADLTATVQKLQQTIELKDEIISKLMVERKCTTLNESPPTAFSIADSHG FT EMPVSQESIEVIPATANPKLNAKVQKWIDQTTSGSKKNCVLPKDTGTTVTK FT ADKRHKKPNKNSTEDSMSTDESIASVLSHRTNNTNISIPNKRNHEKTDSSN FT GSDNALNLRRNTKTRKQEEKTKSNRKK" FT CDS 2219..6943 FT /product="I-71_AAe_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="INGSHASGNTDTNTFNPSYYIEPDTIKDRTEQTYEEI FT PIIENSQTVQAPDNGVTDSTEAIPRSNIFTTSGTHKGVKTRSPQSPILRLD FT SQGPVGAEAKAVPEPTDNLWRPVSIGRGTHKGVKTCSPMDSAGVKSTDYIQ FT DTLAFRQRAKFETPNPSKDYEGVKLSTRRYPQRNRRPPDKFTMERPVSKSP FT LNGPFPALGPEASNAELYDILHKVRYGYSADIAPTTSHSSGWTFEASQRQF FT PKNLDRCPVGPRGPIASHETIEDNGKSDCLTDNTAFFRTLGTVQPLADNCH FT KPNHSSHPESQNPNEISQWIPTISPPFCAVGTSLPLAGGTLPPPTGGALSR FT SATPTLSHTGSEIPHNNLNRNQTTLAVQWNMNGYFHNLPDLEMIVDRLRPI FT TLAIQEIHRVTVARMNKTLRQQYKWYVKTGNNIYQSTAIGVLADIPTEELE FT IDTDLPIIGVRIPWPFPVSIISFYLPNGRIPDLENCLREILDQVPSPVILL FT GDANGHHMAWGSRHNDARGAIITGLAGQYDLTILNDGSSTFLRGNTESSID FT ISLVSARLTNRLLWAVEQDLHSSDHYPISLTLEGTSTPQTTRRPKWMFEQA FT DWPAFQSEIDVRLASSPPESLEEFADLLRSTASQTIPRTSPKPGRRALYWW FT NEDTRRAVKMRRKKLRKLRKLQKKLPEEHPDRTKAQEEYSASRNGCRQTIR FT NAKEASWTEFLDGINEEQTASELWRRINCLQGKRRMKGLALKVDGLLTRDP FT ASIADALADQFQKLSSFQRYPESFRNLLQSPETAIRDFPVPPNKGQSFNLE FT FTIDELEFALKKSKGKSAGPDEIGYPMIKYLPPSGKLALLKGINREWLYGT FT LPETWNHSLVIPIPKNSGPPSDAGSYRPIALTSCLSKVMERMVNRRLISFL FT EDKRLLDHRQHAFRSGFGTSTYLAALGQILDDSFQKGEHVEIASLDLAKAY FT NRAWSPAILQKLANWNISGNMLAFIKNFLNGRTFQVLIGNERSKISYEETG FT VPQGSVLAVTLFLVAMSDVFLALPKGVFILVYADDILLLVSGKHPKSTRRK FT LQAAVSAVERWAKKVGFDMAAEKCARIHICNSNHRPPGTVSINKTPIPIKN FT RVKILGVTLDRNLSFQAHFKNVKVSCRNRVSLMRSISTKRTRSNRTTLKNV FT MDAVVCSRLLYGIEVTCRNLDELIKHLAPIYNKCIRHVSGLLPSTPALSAC FT AELGVLPFRHKTILALCNRTIGYLEHTKNQGPVCFLAKQANLALASVADVQ FT LPPVAGLHRRGPRSWTAKAPKTDGTIQTSFRKKATPERVKAHFYQRISEAY FT KNHEIRFTDGSKLGDQVGFGVLSDQIELSYSLPNGSSVFSAEAAAIHQAII FT SPSDKPLLIVTDSASVLAAIKSPTNKHPYIQGIQVALDRTKREVVFMWVPG FT HSGIAGNEKADALAGTGRNRPRLTRKIPGDDIKTWTKNRVWNAWSQEWRQE FT RTLFCRKVKNTINPWKDLPNRREQVVLSRLRTGHSRISHDMSGETADFRRH FT CESCGERCTIEHVISNCPSLEDLRRIHDITNVVRALQNDPSFERQLINFLK FT DARMYDCV" XX SQ Sequence 7344 BP; 2162 A; 1833 C; 1695 G; 1654 T; 0 other; tcagtcgaaa gcataccgcc gtcgcgttcg aacgcttttc tgtatagctc tgaacttatc 60 gaatcgagtt tctttgttgc gctccgaaac aatcaactat cgtctagtgt tctacgctgg 120 ggccgatccg gtgtcggagc attttgcatt gtctcccgga agataacttg ggctgcacac 180 aggggcccat tgttctctgt tggttgaaag ggaatccacg tggtgttgct agtcgtaact 240 ttaaagcggt gttttttttg cctcttttgc ctcccagact aagtgcgact cgcgtgcagg 300 tgttcgataa tccactgagt tggacacagt ggtctccaat ttgctaagac ggactacagt 360 gacagtgtga gggtgttgtt cgagtgcagg cctaagtgag ttgtgtgaga agcaaccaat 420 ttggcgttag gtgggcgtgt aagttggtaa ccgccgctct gccagtatgg acctggcaag 480 cggccaaaat tttcgccctg gggatccagg agggcccgga ggtgggcaca agctcggttt 540 aaactggcgt caaggggagt acatgggacc cacactccca gcgtttatgg accgggatgg 600 tactgcgggt tcgctccagt accttaggat gcagaccacg gctggaacga tgcctcagga 660 tcctttcctg ctgcgcatat ctgttgaaaa acacctcgga gcacggattg agggagcgtt 720 caaggaaaat cggggaatgt cgtatgtgct aaaggtacgt agttcagccc agtttgataa 780 actgctccgt atggaaaaac tgtgtgacgg aaccccggta tccatcagtg agcatcccca 840 attgaaccaa agacagtgtg tcgtctctaa tcaggatgtt gctggtctca gcgacgaata 900 cctgaaaagc caacttgctg cacaaggagt gaaggacctt cgtcgaatcc ggcgccgtct 960 tccggatggt tcgtttatga acacaccgac gatcattcta acaatcagcg gcacggtcat 1020 cccagaacac atcgacttcg gatggacaag gtgcaaaacg agaaactact acccgtcccc 1080 gatgctgtgc tttcgttgct ggacttttgg gcacaccggt aaacgatgca ctgaaccgca 1140 tcgtatctgc ggccgatgct gtaaggttca cccggaacat caagcccctg tttctacgca 1200 aaacggaaat gaagtaagta catcgtttcg tccagaagaa ctccctagcg aaaaccagga 1260 catgaacaga tttgtttgct ccgaaccagt cttctgcaaa aattgcaagg ccgacgacca 1320 ttctgtatct agccggaaat gccctatcta cctaaaagag gtagaagttc agaagataag 1380 ggttgactgt aacatctctt accctcaagc tcgtcgcgaa ttcgaagcac gtcaaaacgc 1440 cagtactagt gccgttacct tctccggagt tgtcgctggt agtaaggacg tggagattgg 1500 aagtctcaaa gcagctatcc gacagatgca ggcagacgca aaagctaaag acgtgaggat 1560 ggcggaaatg gagaccgctc ttcgaggacc aagcgttggc gaacgcctag atatggttcg 1620 tgaacacgga actattgagc agctgatcaa gcaagtagcc gacctaactg ccactgttca 1680 aaaattgcag cagacgatag aacttaaaga cgaaatcatc tctaagttga tggtcgagcg 1740 aaagtgcaca accctcaacg aatcgccacc aacggccttc tcgattgcgg attcccatgg 1800 agaaatgcca gtttcgcagg aatcgattga agttatcccg gcaaccgcaa acccaaaatt 1860 gaatgcgaaa gttcaaaaat ggatcgacca aaccacctca ggcagcaaga agaactgtgt 1920 attgccgaaa gatacaggga ccactgtgac taaggctgac aaacggcata agaaaccaaa 1980 caaaaactcc actgaggaca gtatgtcaac cgacgaaagt attgcctctg tgttatcgca 2040 ccgaacaaac aacaccaaca tctccatacc taataaaagg aatcacgaaa agacggattc 2100 cagtaatggt tccgacaatg ctctcaacct gcggcggaac accaagaccc gtaagcagga 2160 agagaaaaca aagagtaata gaaagaagta aaacccccaa cctgctcagt tccactaaat 2220 taacggctcc cacgcaagtg gtaatactga taccaacact ttcaatcctt catattacat 2280 cgaaccagat acgataaagg atagaactga gcagacgtac gaagaaatac caattattga 2340 aaatagtcaa acagtacagg ctccggataa cggggtcacc gacagtacgg aagctatacc 2400 acgttcgaat atattcacta ccagtgggac gcacaagggc gtcaaaaccc gttccccaca 2460 atccccaatt ttgagattgg acagtcaggg ccccgtcggt gcggaagcca aagccgtacc 2520 ggaaccgacg gacaacctct ggcgtccagt ttcaattgga agagggacgc acaaaggtgt 2580 caaaacctgt tcccctatgg acagcgctgg agtcaagtca acagattata tacaggatac 2640 cctggccttt aggcaaaggg cgaagttcga gacgccaaat ccctcgaagg actacgaggg 2700 agtcaagcta tccacaagaa gatacccaca acgcaaccgt cgacccccag ataaatttac 2760 aatggaacgt cctgtcagca aatctccact caatggacca tttccagcac taggacctga 2820 ggccagcaat gcagaattgt acgacattct gcacaaagtg agatatggat attctgctga 2880 catcgcccct acaacctctc acagctcagg ctggaccttc gaagcttcgc aacgacaatt 2940 ccccaagaac ctggatcgat gccccgttgg acctcgaggc cctatcgcta gccacgagac 3000 gatagaagac aatggtaagt ccgactgctt gactgacaac accgcttttt tccgcaccct 3060 cggcacagtc cagcctttgg cagacaactg tcacaaaccg aaccattcta gtcatcccga 3120 gtcacaaaac cccaatgaaa tatcccaatg gatcccaacc atttctcccc ctttttgcgc 3180 tgtcgggaca tctctgcctt tggcgggtgg aacactacca cctcccactg gtggagctct 3240 atctcgttca gcaaccccca cgctctcaca cactggctct gaaatccctc acaacaatct 3300 gaaccgaaat caaactacac tcgcggttca atggaacatg aacggttatt tccacaatct 3360 gccggacctc gagatgatag tcgatcggct ccgtcctatc actctcgcga tacaggagat 3420 tcaccgtgta accgtggcac gaatgaacaa aacattgagg caacagtaca aatggtatgt 3480 taaaactggc aacaatatat accagtcaac ggcaattgga gtattggcag atattcctac 3540 agaagaactt gaaatcgaca cggatttgcc catcatagga gtccgcattc cgtggccctt 3600 tccggtttca atcatctcct tttacctccc aaacgggcga atcccggatt tggaaaactg 3660 cttgagagaa atactcgatc aagtgccaag tcctgtaatc cttctcggcg acgcaaatgg 3720 acatcacatg gcatggggca gccgccacaa cgatgcccgc ggggctatca ttactggact 3780 agccggccaa tacgatctta cgattttaaa cgatggttct tccacattct tacgtgggaa 3840 caccgaaagt tctattgata tatctctcgt ttcggcccga ttaacaaacc gtcttctttg 3900 ggccgtggag caagatctcc acagcagcga tcattatcca atttctctaa cattggaagg 3960 tacatcaacc ccgcaaacga cacgccgacc caaatggatg ttcgaacaag cggactggcc 4020 ggccttccaa agtgaaatcg atgtcagatt agcctcctct cccccagagt cgcttgagga 4080 attcgctgat ctattacgct ccaccgcatc tcaaacaatt ccgcgaacca gtccaaaacc 4140 cggtcgccgg gcactctatt ggtggaatga ggacacacgg agagctgtca agatgcgccg 4200 gaaaaaatta agaaaactac ggaaactaca gaaaaaactt ccagaggagc atcctgatcg 4260 caccaaggcg caggaggaat atagcgcatc cagaaatggc tgccgtcaaa caatacgaaa 4320 tgcaaaagag gcatcttgga cagaattttt agacgggata aacgaggagc aaacggcatc 4380 cgaactctgg agaagaataa actgtcttca ggggaaaaga cggatgaaag gcctggccct 4440 aaaggtagac ggactgctca cgcgagatcc tgctagcata gcagatgccc tggcggacca 4500 attccaaaaa ctttcatctt tccagcgata ccctgaaagc ttcagaaact tgctgcaatc 4560 cccggaaacc gcaatacgtg attttccagt gcctcctaac aaaggacaaa gtttcaatct 4620 tgagttcaca attgatgaat tggagttcgc tctgaaaaaa tctaaaggca aatcagcggg 4680 gccagacgaa ataggatacc ctatgatcaa gtatctgccc cctagcggta aactggcact 4740 actgaaaggc attaatcgtg aatggcttta cggaactctt ccggagacat ggaaccacag 4800 tctggttatc ccgattccaa agaactcggg tccaccgtcg gatgcaggta gttatcgacc 4860 aatcgcactc accagctgcc tatccaaggt aatggagcgg atggtaaatc ggagattaat 4920 tagcttcctg gaagacaaac gcttattgga ccaccgacaa cacgctttcc gatcgggttt 4980 tgggacaagt acctacctgg ctgcactagg gcaaattttg gatgattcat ttcaaaaggg 5040 cgaacatgtc gaaatcgcct cgttagatct ggcaaaggcc tataacaggg cctggagtcc 5100 agcaattctc caaaagctcg ctaattggaa catctctggc aatatgttgg ccttcattaa 5160 gaattttttg aacgggcgta cgtttcaagt tctgattggg aatgaacggt caaaaatttc 5220 ttatgaagaa acaggggtgc cacagggatc tgtccttgct gttactttat ttttggtggc 5280 aatgagcgat gtattcttgg cactccctaa aggggtcttc attctagtat acgccgatga 5340 tatccttctc ttagtaagcg gtaagcatcc taagagtaca agaaggaagt tacaggcagc 5400 agtctcagct gtggaaagat gggccaaaaa ggtaggattc gatatggcag cggaaaagtg 5460 cgctcgcatt catatttgca actctaacca ccggcccccg ggaacagtct ccataaataa 5520 gaccccgata ccgatcaaaa atcgagtaaa aatccttggt gttaccctcg atcggaatct 5580 ttcctttcag gcccatttca agaacgtaaa ggtgtcctgc agaaaccggg tcagtttaat 5640 gaggagtata tcaacgaagc gcactcgaag taatcgaact acgctgaaga acgtgatgga 5700 cgccgtagta tgtagtcggc tattatacgg cattgaagta acctgcagga atttggatga 5760 gctaatcaag cacctagcgc cgatctacaa taagtgcata cgtcacgtgt caggactgct 5820 tccgtcaact cctgctttgt cagcatgtgc tgaattggga gtccttccct ttcggcataa 5880 aactatcctc gctctctgta accgtacgat agggtatcta gaacacacta aaaatcaagg 5940 accggtttgc ttcctagcaa agcaagcaaa cctggccctt gcatcggtgg ccgacgtgca 6000 gcttccaccg gtggctgggc tccaccgaag gggaccacga agctggacgg ccaaagcccc 6060 taaaactgat ggaaccatac aaaccagttt tcgtaaaaag gccacacctg agagagtgaa 6120 agcacacttc taccagagga tttcggaagc ctataagaac cacgaaatta gattcaccga 6180 cggttcgaag ttaggagatc aagtcggttt tggggtactg agcgaccaaa ttgaactatc 6240 ctacagctta ccaaatggca gctcagtttt ttcagctgaa gcagcggcca ttcatcaggc 6300 aatcatatcc ccaagtgaca agccacttct tattgtaacg gactccgcca gtgtcctggc 6360 tgcaattaaa tcaccaacta ataaacatcc atacatccag ggcatccagg tcgcactgga 6420 tcggaccaag agagaagtag tttttatgtg ggtcccgggc cattccggaa tcgctgggaa 6480 cgaaaaggca gatgcactgg cggggactgg tcgaaataga ccacgtctaa caagaaaaat 6540 tcccggagac gacatcaaga catggactaa aaacagggtc tggaatgctt ggtcccaaga 6600 gtggcgacag gaacgaacgc tattttgtcg taaggtcaaa aatacaatca atccatggaa 6660 agacctacct aacagaagag agcaggttgt tctatcaaga ctgaggactg ggcacagcag 6720 aatatctcat gacatgagtg gagaaacagc cgatttccgt cgacattgcg aatcgtgcgg 6780 agaacgatgt actatcgagc atgtcataag caactgtcca tctctcgagg atttaagaag 6840 gatacacgac atcaccaacg tggtcagagc gcttcaaaat gatccgtcat ttgagagaca 6900 gctgataaat ttcttgaagg atgcaagaat gtacgattgt gtttaagttg gaattccaca 6960 atagcagact agcagttaat tgctcaattc tacggctaag tggtgttcag ccaatacctc 7020 gatcgaacac acgaatcctc tatactaaca actccccttt cctttaaaaa gcaccgtaca 7080 aaacaacaca attattgtaa ctatcacact atcataagac actttgacaa ttggtgtatt 7140 ttgtaaagcc tttagtttca cactagtttt cacaataagt atctagactt aagcgatgct 7200 ctgtaataac aaaaaaaata tctgtggaac tctgtaatct ggtggtaccc ttaaggtagc 7260 tccagaattt gttttttttt ttccatatgg agatgaacca gccacgggct gaaaatctcc 7320 ctaataaagc taataataat aata 7344 // ID Gypsy-4_OD-I repbase; DNA; INV; 7920 BP. XX AC CABV01000256; XX DT 24-FEB-2011 (Rel. 16.02, Created) DT 24-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Oikopleura dioica genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-4_OD_; KW Gypsy-4_OD-LTR; Gypsy-4_OD-I. XX OS Oikopleura dioica OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; OC Oikopleuridae; Oikopleura. XX RN [1] RP 1-7920 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Oikopleura dioica genome."; RL Direct Submission to RU (24-FEB-2011). XX DR Genome; CABV01000256; Positions 10368 2449. XX CC Positions [3769-4245] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 4676..6628 FT /product="Gypsy-4_OD-I_3p" FT /translation="MKIQRIVFLLKLVLTQEKTQVTNTTLEASTATTTLPK FT AGILIEQDSLVYITDGDFNVLFSRTHTSPCSWERIDKMFSNLPQSNCIRNS FT YKHTAQSTKDIKNLYFLRCNELVNQELTAIAELVNNRRKRDTDEEDVVPTD FT EDYDTIQNYDYDNFTSRERRSSTSAVSFFAPLLSSFIPVIPTLFALYSAKR FT DNDIMRKSINDNTLAVSTLARAQMRLAAETQRKFCDSNFLTNNQMRGIIMK FT SEVRTYIDLIETEVLMLRMGRFPPRADIIKNLISICASIESNTVPFCRSVL FT LTSYENIQINFKGVSIFESNLNLVTEITFPLLSPVKVNSKNLIVRNVGGFS FT NETYFKIDLDQTYIQRQDGKLTYNLNMSLCKQRCCPINAISHLPSATCMQN FT ILEGTTAGCLRIEKEAPDCYFERTGSGSYLIAARQGTFFPQAKMSKVSDLV FT NTTLHVNETGRLRCFHEESNEFTTHYLHATTSRHFDARDSVSVDLNLNNTI FT LLDDVHDDLEDKILEIAARDDNMTLGDTRIHWGFIILLIHASLLLVLVIIY FT YMYRAWIKFTCPFVFKRREVNSDHDSLDNHFGSPIINMLTRASESVRASFR FT RKNNRSETHNVYPSIPSIDTLRIDTPGTDHAARQINTTSFRENGCRTVPKN FT RP" FT CDS join(238..924,928..4800) FT /product="Gypsy-4_OD-I_1p" FT /translation="MSITTEGMQLQSSIEAQDLTSWQTYRERLKEITGHSR FT EYYEDEFSQFQIGDRRPGQALAALTLSYKRARLYGNDELTEKDKKQICDQF FT VRGLKNPLKGFVSTQLPNLDFRSLGAYAERVMRGFELNKSTVAALPIDHGI FT VSEKPAVQSTTEKALLNAIEELKKQMPAQRGYKQSRGHSLFDTKKLAGYCY FT RHHVQKNCKNDPCHYSHEKASGDVSKYIDSLIAERKNKQSFGSGNSSPTSS FT PSPTITHLCSCPNAFSAPTFLKYTRVRIGQNTLTCMLDTGCSKTVILKSAL FT PPSCELTPSNTQLLCANEQRLQAETTKHAITIVFNDNLSEKLTPLVTNKLS FT CDIIVGVDCLKSFSYKENARTANVNGFAVDLIQPHDSSRIARACGLDESMN FT HPRLINANSTISVPVRNPFFGSEHQTVKLVPFDVPFPQCKNLEIIPTVHNN FT TKNISVAIKNNSKTDVSLAHGIPVCRIKPLDIEKVNGLTVVSNLDAERERT FT RQFQQRRSQKAEEIGFQPDIQDYGDVTEKQKEQLQALIQKYRLCFSMDETD FT LGCLANYRFTLPQKDEHLDAYEPPRPAPIHLRDQVKNEISRWEELGIIAET FT QSSYNIPLIILRKGDGSIRVSLDARKLNSILVQDRFPLPSMSEAFVQISNR FT LAKSKGNCFVTALDWFRGYWALQVDEKHRHKLAFSFNNKHYEARRMLYGTS FT TGPAAFSRVMSELFAGHPSIYIYLDDCLVIDTSFEEHIKTLTFILEKCLEY FT GILMSAKKIKLCRNSFDFLGHKITSNGIQSTDKHIKSIRDFPAPTDRSSLK FT RFLGLTNFNLKFVRNGSKILSPLYRLTSKSVPFDWNDEHQAAFDEIKLQLT FT SSEGLGHFDKDKKLTLVTDASGHAVGGTLYQTDGDRMITIGYFSRYLTGPD FT SRRSMRQKELLAVAYAIRSFEYYLLGQRFDLVTDHKSLVYLYREKRRQELD FT CKFTNIFHYLLKFDFEILHHPGESAIMASSDCLSRLEKKTAAEIEEEADKE FT EIPETLFSMLHLPQNFKDDTPKNVQELIYTYLETKQKDEQPPLFRFGTKSF FT GAEQLRDLQQECNKTKNLLVKVKLNARKFQRFKLVNEVLYRVTKTTKRLVI FT PEPLDTEFISYAHASMGHPGYHQMAQLFKKVYIHGCDEKLRDFLTKCTTCI FT RVKPRKMNRPGALKHKYYESEPFSKTHIDLVDYNKKDCENKRYFLSFVDSL FT TGYADGCAISAKTDSIIAKHLLQLILRNGATNVIVSDNGGEFSGPAVRNLL FT TRFNLRQCKTSAYYSKSNSVAERTHRELNKHLRLMDSTSSKWSLHIPEALF FT FTNQLPRQSLDGLSAFECIFGRSFLYPFSIDLRPTEKPVPFQKALQSYIDE FT LHPALLAHQVARHKKLLEGETPHKVPVLEKGDRCLVFRPNLKSSKLSLAWD FT GPYRVEAKRCAHSYVLKHTQTGRVFLRHISHIRPLPKKDVFSNQETPEDED FT EDAKIRNEITHLENENPTDSIFAQTGPNPRENPSHEYNLRSLYRNNNSSES FT RNTH" XX SQ Sequence 7920 BP; 2453 A; 1892 C; 1516 G; 2059 T; 0 other; actctacaca gcgaccaaga agatctcatc cagcaactta taagcaaagt cgaaagcctg 60 gaaaaatcgc aaaatctgtc atctcggttc gatcatgata aatcgctgaa ctacaaaacg 120 aaaatcaagt ttgatccgga ggcacctatt gaagacttca ttcagggagt agaagcctat 180 ggacgcgcca acggcgttgt agatgctcaa aaatatattg gcgtcgccca agctgcaatg 240 tcaataacga ctgaaggtat gcaactccaa tcaagcatag aagctcagga tctcacatct 300 tggcagacat atcgcgaaag actcaaagag ataactggac attcgcgtga gtactacgag 360 gatgaatttt cgcaatttca aattggcgac cgtcgacctg gccaagctct ggcagcgctc 420 acactctcgt acaagcgcgc gagactgtat ggcaacgatg agcttacgga aaaggacaaa 480 aaacagattt gcgaccaatt cgtcagaggc ctgaagaatc cactcaaagg cttcgtatct 540 actcaactcc cgaatttgga cttcagatca cttggagcat atgcagagag agtcatgcga 600 ggattcgagc taaacaagtc tactgttgct gcgttaccga tcgatcatgg aattgtctct 660 gaaaagcctg cagtgcaatc gacgacagaa aaggcactcc taaacgcaat tgaggaactc 720 aagaagcaga tgccagcgca acgaggatat aagcaaagtc gagggcatag cctgtttgat 780 acgaagaaac tggccggcta ctgctacaga catcatgttc aaaagaattg caagaatgat 840 ccttgtcact actctcacga aaaggcatct ggtgatgttt caaaatacat tgacagccta 900 attgcggaac gcaaaaataa gcaatgaagt ttcggcagtg gcaactcttc tcctacatca 960 tcaccttctc ctacgattac tcatctctgt tcgtgtccta atgcattctc cgctcctact 1020 ttcctgaaat atacacgtgt acgcattgga caaaacacct taacctgcat gctggatact 1080 ggctgcagca aaactgttat tttaaaaagt gctcttcctc cttcatgtga acttactccg 1140 tcgaatacac agcttctctg cgcaaatgag caacgactac aagcggaaac tactaaacat 1200 gcaatcacga ttgttttcaa tgacaacttg tccgaaaaac tgacgccgct tgtaacaaac 1260 aaactctcat gtgacatcat tgttggcgtc gattgtctca aaagcttcag ctataaagaa 1320 aatgctcgaa cagcaaatgt aaatggcttc gccgtagacc tgattcaacc acatgacagc 1380 agccgaatcg cgagagcctg tggacttgac gaatcgatga atcacccaag actgattaat 1440 gcaaattcca cgatctctgt ccctgttcgt aatcctttct tcggttctga gcatcaaact 1500 gtaaaacttg tgccgttcga tgttccgttt ccacagtgca agaacttgga gatcatccca 1560 acagtacaca acaacacgaa aaatatttcc gtagcaataa agaacaactc aaaaacggat 1620 gtctccctgg cacatggcat tcctgtttgc agaataaagc cgctagatat cgaaaaggta 1680 aatggactga cagtagtgtc aaatttggat gctgaacgtg aacgaactcg acaattccag 1740 caaagaagaa gccaaaaagc ggaagaaatt ggatttcaac cggatattca ggattacggc 1800 gacgtcactg agaaacaaaa agaacaactt caagcgctta ttcaaaagta taggctatgt 1860 ttctctatgg acgaaactga tcttggatgt ctggcaaact atcgatttac tcttcctcag 1920 aaagatgagc atcttgatgc gtacgagcct ccccgacctg ctccgattca tctcagagat 1980 caagttaaaa atgaaatctc tcgctgggaa gaacttggaa tcatcgctga gacccaatcg 2040 tcctataata ttccgcttat aatcttacgc aagggagatg gaagtataag agtgtcactc 2100 gatgctcgaa aactcaactc catcctcgtc caagatcgat ttcctttgcc aagcatgtca 2160 gaggcttttg ttcaaatcag caaccggctc gcaaaatcaa aaggcaactg ttttgttact 2220 gcacttgact ggttcagagg ttactgggca cttcaagttg acgaaaaaca cagacataaa 2280 ctagcgttct ctttcaataa taaacattac gaggctcgaa gaatgctata tggtacgtca 2340 actggccccg ctgctttcag ccgtgtcatg tctgaactat tcgcaggaca tccgtctatt 2400 tatatctatt tggacgactg ccttgtcatc gacacctcat tcgaagaaca catcaaaacg 2460 ctgacgttca ttttggaaaa atgccttgaa tatggaatcc tgatgtcggc aaagaaaatt 2520 aaactctgtc gcaactcgtt tgacttcctc ggacataaaa taacatctaa cgggattcaa 2580 tcaacagata aacatatcaa gagcattcgt gactttccag cgccgactga ccgatcttca 2640 ctaaaacgct ttcttggact aacaaatttc aaccttaaat ttgtacgcaa tggatctaaa 2700 attctatcgc cactctatcg attgacttcc aaatctgtcc cgttcgactg gaatgatgag 2760 catcaagctg cattcgacga gataaaacta cagttaacaa gttcggaagg ccttggacac 2820 tttgacaaag acaaaaagct gactcttgtt acagatgcat ctggacatgc tgttggtgga 2880 actctctatc aaactgatgg cgatcgaatg attactattg gctacttttc gcgctattta 2940 acgggcccag actctagacg cagcatgagg caaaaagaac tgcttgctgt tgcgtatgca 3000 atacgctcgt ttgaatatta cctacttgga caacgcttcg atcttgttac ggatcacaag 3060 agtttagtct atctttatcg agaaaaacgc cggcaagagc tcgactgtaa atttacaaat 3120 attttccatt atcttctgaa gttcgacttc gaaatccttc atcatcctgg tgaatctgcg 3180 atcatggctt ctagcgactg cttgtcaaga cttgaaaaga agactgctgc ggaaattgaa 3240 gaagaagccg acaaggaaga aattccggaa acccttttca gtatgctcca cctgccgcaa 3300 aatttcaaag acgacactcc aaagaacgtt caagaactga tatacacata tttggaaaca 3360 aaacaaaaag atgaacaacc tcctctcttc cggttcggaa caaaatcttt cggagctgaa 3420 caactgcgcg atcttcaaca agaatgcaac aaaacgaaaa atctgctggt taaagtcaaa 3480 ttaaatgcac ggaagttcca acgattcaaa cttgttaatg aagttctgta tcgagtcacg 3540 aaaacgacta aacgactcgt cattccagag cctttggata ccgagtttat aagctacgca 3600 cacgcttcta tgggacatcc tggatatcat caaatggcgc aacttttcaa aaaagtatac 3660 attcatggct gtgacgagaa acttcgtgat tttctaacga aatgcaccac ttgtatacgc 3720 gtcaaaccgc gaaaaatgaa tcgccctggc gcgttaaaac acaaatatta cgaatcagag 3780 ccgttctcga aaactcacat tgatcttgta gactataaca aaaaggactg tgaaaacaag 3840 cgatatttcc tgtccttcgt cgacagtttg actggatacg ccgacggatg cgcgatttca 3900 gcaaagacag actcaataat agctaaacac ctactacaac ttatcttgcg aaatggcgct 3960 acaaacgtta tcgtatccga taatggaggt gaattctccg gtcctgctgt taggaactta 4020 ctgacacggt tcaatcttcg acaatgcaaa acctctgcat actacagtaa gagcaactcc 4080 gtggcggaac gtacacacag agagctcaac aagcatctac gtcttatgga ctcaactagc 4140 tcgaaatggt ccctgcatat tccagaggcg ctctttttca caaatcaact tccacggcaa 4200 agtttggacg gactgtctgc gtttgaatgc atcttcgggc gttctttcct ttacccattc 4260 agtatcgatc ttcgcccgac agaaaagcct gtgccgtttc aaaaagccct gcaatcctat 4320 attgacgagc ttcatcctgc cctactagcg catcaagtcg cgagacacaa aaagctgcta 4380 gaaggcgaga caccgcataa agtcccagta ctcgaaaaag gagaccggtg ccttgttttc 4440 cgaccgaacc tgaaaagttc caaattaagt ctggcttggg acggtccgta ccgcgtcgaa 4500 gctaaacgat gcgctcacag ctatgtgcta aagcacacgc aaacaggccg tgtttttcta 4560 aggcacatct ctcatatcag acctttgcct aaaaaggatg ttttctccaa ccaggaaacg 4620 ccggaagacg aagacgagga cgcaaaaatc agaaacgaaa tcactcactt agaaaatgaa 4680 aatccaacgg atagtatttt tgctcaaact ggtcctaacc caagagaaaa cccaagtcac 4740 gaatacaacc ttagaagcct ctaccgcaac aacaactctt ccgaaagcag gaatactcat 4800 tgagcaggac tcactagttt acattactga cggagacttt aatgttttgt tctcaagaac 4860 acacaccagc ccttgctcat gggaacgtat tgacaagatg tttagtaact taccacaatc 4920 caattgtatt cgaaactcgt acaaacatac cgctcaatca accaaggaca tcaagaatct 4980 atacttcctt cgctgcaacg aacttgtaaa tcaagaactt accgcgatcg ccgagctagt 5040 taacaacagg agaaaaagag acaccgacga agaagacgtc gtgccaaccg atgaagacta 5100 cgatacgatt caaaactacg actacgacaa cttcacatct cgcgaacgcc gatcttccac 5160 ctctgctgtt tcattcttcg ctccattgtt aagctccttt atcccggtaa tacctactct 5220 ttttgcactc tatagtgcta aacgtgataa tgacataatg cgtaaatcta ttaatgataa 5280 tactctggct gtttccacat tggcacgagc ccaaatgcgc ctcgcagcgg aaacgcagcg 5340 aaaattttgc gattcgaact ttttaactaa taaccaaatg cgcgggatta tcatgaaatc 5400 ggaagtacga acttacattg atctaattga aaccgaagtc ctaatgttac gcatgggtcg 5460 ttttcctcca cgggcggata taattaaaaa tcttatttca atttgtgcgt ctatcgagtc 5520 aaatacggtc ccgttttgtc gttctgtgtt actgacgagt tacgaaaaca ttcaaattaa 5580 cttcaaagga gtatcaattt tcgagtcaaa tttaaatttg gtaactgaaa taacatttcc 5640 tttgttgtcg cccgtcaaag tcaactcgaa aaatcttatt gttcgaaatg taggcgggtt 5700 ctcaaacgaa acatatttta aaatcgacct cgaccagact tacatacaac gacaggacgg 5760 caaacttacg tacaatttaa acatgagtct gtgcaagcaa agatgctgcc ctataaatgc 5820 catttcacat ttaccaagcg cgacatgtat gcaaaatata ttagaaggaa ccacagcagg 5880 ttgtcttcga attgaaaaag aggcgcctga ctgctatttc gaaagaactg gttcgggatc 5940 ataccttatc gctgcgcgcc agggtacctt tttcccacag gcaaaaatgt caaaggtgtc 6000 ggacttagta aacacgacct tgcacgttaa tgaaacaggt agactccgtt gtttccatga 6060 ggaatcaaac gaatttacaa ctcattatct gcacgcgact acgtccagac attttgacgc 6120 acgtgactct gtttctgttg atctaaatct aaataacact attttacttg atgatgtgca 6180 cgatgactta gaggacaaaa ttcttgaaat tgccgcccgg gatgacaata tgacccttgg 6240 tgacacgcga attcattggg gatttataat tttgttgatt cacgcatccc tattgcttgt 6300 actcgtgatt atttattata tgtatagagc ctggataaag ttcacatgcc cctttgtttt 6360 caaaaggcga gaagtgaact cagatcatga tagcctagat aatcactttg gtagcccaat 6420 cattaatatg ttgactcgcg cttctgaaag cgtgcgggct tcctttcggc gcaaaaataa 6480 tcggtcggaa acgcataatg tgtatccgtc aatcccttca attgacacgt tgagaataga 6540 cacaccgggc actgatcacg ccgcacgaca aataaacaca acctcttttc gcgaaaatgg 6600 ttgtcgtacc gtacccaaaa atcgaccgta aaaaaaaaca catcgaatca acaaaattat 6660 ttttctgcca aacgacgacc aaaaagatca cgcaagctta tctttgttgg cttatccctt 6720 attaatccat aagcacagcg tgtactagaa tataataaaa aaaaaatacg cggaaaatac 6780 aaacaccttg tgaaatctgc aatttgactc atattttcga aaaacgcact acctcggagc 6840 tgcgattggt cgagtgattt ttgtgtaatt tgaattcgcg caagcaaacc aatttttaca 6900 cagatcacct gccgcgggtg cagctatatc cggtcatgaa caccgtcacc gagttgacgc 6960 ccattacgga cgacaggtca ttactcaagt agatctcctc cttctaggat tagtagctgg 7020 gggcggaccg ttttccccta gggtcctaac tcgtaatgag tcgactacca tatcctccgg 7080 ataatcggaa atgcaaaaaa gagtaagaca aagtgtagtg cgttccccga aaaatgagtc 7140 acaaagcggg ccccacaaag aacttggaaa tttcgcgtat ccccttttta atagtttttc 7200 tacgctgagc ctattggttt attaaaacaa ataagcaatt caaaaattat gctcgtgatc 7260 ggatctactt tattatccct taccccaacc aaaacttacg tttttgagcc ggcgctcacc 7320 ctagcgcgct cttgaaccgc ggtccgcttc gcttcgctcc gctccgatct cttatcgcat 7380 ggcgcgcggc gctccgctgt ctctttgcga tcacctgctg tgatacgcgc gggctcgttg 7440 caccaccagc accaatagat ccttatcagt tggtgtacat cagcctccaa aatggcaaca 7500 taataattct aattctcaat ttcgcattat ttcaattttc gcgttcagca attttacttt 7560 atgctactca aaaatataat ttatcgctta aaatttacga ttttatctca aaactgtatg 7620 ttgccacatg cctgacctct tcccaacgta tattcctctc agcttctgcg tataatctct 7680 tataagcttc tgcgtatatt caattttctt ttaaactctc aagttttcaa aattaaaaat 7740 gtcaaaattt caaaatttga ccgtccgcaa cttttaattt ttcattctga cgggatctcc 7800 gtcaaatttc acccaccgac tgcacaacta ggtcaaccaa atgttgacct tccttcttat 7860 cgcccgctgc ttctcccgca gccggaaccg tcttcaaaag taaaggcggg atatatagag 7920 // ID BEL-81_AA-LTR repbase; DNA; INV; 428 BP. XX AC supercont1.280; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-81_AA_; KW BEL-81_AA-I; BEL-81_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-428 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.280; Positions 1388354 1387927. XX SQ Sequence 428 BP; 158 A; 61 C; 81 G; 128 T; 0 other; tgttgctggc aacgctgtca ctagagagtc ccccggccag cttacctagt gtgcgtataa 60 atggtagcac aggaatgtta aatgaactgt caaacacgaa atgtcataac ttcgtaattt 120 attgagtcat tggatttatg aaatttactt gaatactgtt tattacgtaa gacttacaaa 180 ataatgaatt tatatgctac aagttgaaat tatttaaatt attgcagaag aaaagcagaa 240 atagttattg gtagagatat ttgttggtag atttgaattg tgcgaaaact aaagggagaa 300 taaatgtaag gtaaatgaaa ttccatgtaa acaaaacaat aacctaaaat aaatttacag 360 ctttgagcta tcttgatcgg gaacgcagac gagttgctac taagatctcc gaaataattt 420 caccaaca 428 // ID BEL2-LTR_Dpse repbase; DNA; INV; 207 BP. XX AC Unknown_singleton_95; XX DT 14-MAY-2009 (Rel. 14.05, Created) DT 14-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL2_Dpse; KW BEL2-I_Dpse; BEL2-LTR_Dpse. XX OS Drosophila pseudoobscura OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; obscura group; OC pseudoobscura subgroup. XX RN [1] RP 1-207 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1014-1014 (2009). XX DR Genome; Unknown_singleton_95; Positions 60282 60488. XX SQ Sequence 207 BP; 65 A; 59 C; 44 G; 39 T; 0 other; tgaagcgttc gtattaccac agataatagc tgaccaacca gctcaggcac tcgatccaga 60 gagcttaaac ataccaagca ggatctcact ggcagatcca aacttccata cgccaggcaa 120 aatcgacatg ctaatcggag ccgagcatta ttactcgttg ctgctaccag atcaacagca 180 gctaaggaca ggaggcccgc tgctaca 207 // ID TANDREP_TG repbase; DNA; INV; 529 BP. XX AC AF146527; XX DT 09-JUL-2004 (Rel. 9.06, Created) DT 09-JUL-2004 (Rel. 9.06, Last updated, Version 2) XX DE Toxoplasma gondii repeat region. XX KW TANDREP_TG; tandem repeat. XX OS Toxoplasma gondii OC Eukaryota; Alveolata; Apicomplexa; Coccidia; Eucoccidiorida; OC Eimeriorina; Sarcocystidae; Toxoplasma. XX RN [1] RA Homan L.W., Vercammen M., De Braekeleer J. and Verschueren H.; RT "Identification of a 200- to 300-fold repetitive 529 bp DNA RT fragment in Toxoplasma gondii, and its use for diagnostic and RT quantitative PCR."; RL Int. J. Parasitol 30(1), 69-75 (2000). XX DR Genbank; AF146527; Positions 1 529. XX SQ Sequence 529 BP; 116 A; 88 C; 190 G; 135 T; 0 other; ctgcagggag gaagacgaaa gttgtttttt tatttttttt tctttttgtt tttctgattt 60 ttgttttttt tgactcgggc ccagctgcgt ctgtcgggat gagaccgcgg agccgaagtg 120 cgttttcttt ttttgacttt tttttgtttt ttcacaggca agctcgcctg tgcttggagc 180 cacagaaggg acagaagtcg aaggggacta cagacgcgat gccgctcctc cagccgtctt 240 ggaggagaga tatcaggact gtagatgaag gcgagggtga ggatgagggg gtggcgtggt 300 tgggaagcga cgagagtcgg agagggagaa gatgtttccg gcttggctgc ttttcctgga 360 gggtggaaaa agagacaccg gaatgcgatc cagacgagac gacgctttcc tcgtggtgat 420 ggcggagaga attgaagagt ggagaagagg gcgagggaga cagagtcgga ggcttggacg 480 aagggaggag gaggggtagg agaggaatcc agatgcactg tgtctgcag 529 // ID BEL-593_AA-I repbase; DNA; INV; 6814 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-593_AA_; KW BEL-593_AA-LTR; Pao_Bel_Ele54; BEL-593_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-6814 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [2674-3324] - Reverse transcriptase CC Positions [5785-6363] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 1477..6732 FT /product="BEL-593_AA-I_1p" FT /translation="MLTRTTRSGDSLKALNSKLKGLKASLNNISQFVKNFK FT ETTTASQVNVRLDRLDLLWQQISETVWEIQAHDDYEEQEGFEKEQLWYENC FT FYDAKSFLTEKAQEFRDESASNHSSRSAGGSHSSMERVRLPQIKLQTFDGN FT IDDWLSFRDLYTSLIHQKPDLPEVEKFHYLKGCLAGEAKALVDPLKITGDN FT YKIAWELLLKRFNNSKLLKRKQVQALVKLPTLTRESVTDLHQPVDGFDRAI FT QTLDQVIEPAEYKDLLLVELLSSRLDPVTRRGWEEHTSAKEQDTLKDLQDF FT LQRRIQILESLPAKPASRNEQQPLKRNTSSSKVSHNAVQWNNAKCPSCAEI FT HGLHTCPAFLKMSLTSREAFLRAQSLCRNCLKRGHLARECSSKFSCRNCKA FT RHHTLLCFKGERKDSGNQQSEKENSRKNNDDTASGSKAANASSVETASSNS FT AQRSASQVLLATAVVVVEDNQGSRYSARALLDSGSECNFISEGLSQLMKVS FT SQKVDIAISGIAQTGTRVTRKIQAVVKSRNSIYSQPMDFLVLPKVTACLPT FT STVQVDGWNIPADVELADPEFFRSRRVDLVLGIQAFFSFFPTGRKITLGSG FT LPALTESVFGWIVTGEVAKSGHSAKYTCNLSVSGKLEELMARFWSCEEVGA FT VSKLSPDETRCEEQFERTFQRNSDGRYVVTLPKNNEVLAELGESRDIALRR FT LRSVERRLARDPDLQRQYYDFMAEYLELGHMKKVEVSEEETVKRCYLPHHP FT VIKESSTTTKVRVVFDASCKTSSGISLNDALYAGPVVQQDLRSIVLRSRIR FT QVMVVADVEKMFRQIWTNTSDTPLQCVMWFEPDGSVVTYELLTVTYGTKSA FT PYLATRTLKQLADDERERFPLAVSAVQEDVYMDDVISGADDVDSAVELRRQ FT IDAMTSSGGFKLRKWASNSPTVLEGIPKENLALPDGINWDQDAEVKTLGLT FT WLPNVDCFKFAFSLPQITDDQILTKRQVLCYIAQLFDPLGLLGATITAAKI FT FMQRLWAMKNEANQSLQWDDPLPVTVGEEWRTFHKQIPVLNEIRIQRCVVV FT PEAVSIEFHCFSDASILAYGTTIYVRSERQDGTVSVHLLTSKSKVAPLKVQ FT SLPRLELCGALLAAQLWEKVAESLKAKGSVWFWTDSTCVLQWIRSPPGTWT FT TFVANRVAKIQALTEGSEWNHVPGLSNPADLISRGIAPKDIVENRRWWHGP FT DWLMEAPEEWPKGVGYSPEEDLERRRNMLICASSEEKEFISDYVARFSSLS FT KLIRTTAFALRFLGNLRCSKDQKRTGFLTTEELEEAERVIIRKIQEGSFVE FT ELKNLSSGSVVGRKSPLRWFHPRIDGNGILRVGGRLEHSDETFQVKHPMVL FT PARHPFTELLFRYYHDKHLHAGPQLLLGTVRQRYWPLGGRNIARKVVRQCL FT RCFRIKPSAIQQQMGELPAARVTVSRPFSKTGVDYFGPVYLRAGRGRQPTK FT AYVAIFVCMATKAVHMELVSDLSTERFLQALRRFIARRGRCSDIFSDNGTN FT FVGASNQLRELAKLLKEKEHREKVSKECANEGIYWHFNPPNAPHFGGLWEA FT AVRSAKFHLLRVLGGNPVTHEDFVTLLAQVEACMNSRPLTALSNDPQDLEA FT LTPAHFLTGGSLLAMHDANLGDVQINRLSRWQLVQRQLQDFWKRWRKEYLS FT QLQGRMKNWQPAVKMQIGQLVVVVDANQPATQWKMGRIQELHPGDDGVTRV FT VTVRTSTGVLKRPVARLCLLPLKRSEDTSDRV" XX SQ Sequence 6814 BP; 1736 A; 1617 C; 1809 G; 1652 T; 0 other; taagttggtc cttcgagccg gatcctctac accctgtgga ccacggatat cgaacgattt 60 gaatatcggt ttgcgccatt gcgctaagca acgacctaca caaaggtcat aaggaagcca 120 tcgcaatctg gagacctcgt ctaaaggatt ctcgccattt cgtgattgaa tcaaatgatc 180 ctgtcgccat cgccgaggta gtgaaaggat cgcctttgcc gccaaataca ttcgccatca 240 tcgtctaacg aaatcaccgc cgagtgagaa gttttcgcca tcgcgcatca gcatcgagtt 300 acattaccat cagcagcacc atcgcaacag ctatcaaata acgccttgga taatcggaag 360 gattgtagga caaccagagc gttgttctac taccggtcag gtaggttaga acgagtatat 420 gtccaccgaa gcgtttggtt ttcgccacta acacccaccc tacgatttct gtgctttgct 480 ggtgaaccaa atcgtcgatc gctgacggca tcggagtggt gtatcagctc atggagagct 540 tgatttcggg caccttccct gccggtcgaa gcttatccag cctctgagag gacatcctgt 600 ggtcgtccag cgaccacgaa ggcgatttga gcaactacac gaccagacga agacgacaac 660 tgcaacgttc ggacttgggc atcgctcgac ggtcatcgtg gccaggtaaa tgcaatttta 720 cagtatatgc ccagccgaag ctggactcct tttgattcta cactgaataa attattcgtt 780 ggtgcacaat ttggactgcc tgagctcaaa ctcattcaac aaacgcaagc cctggggttt 840 tggtttcaat ttgctgtctt ttcgatccgc gcgacgttcg gcgacatcta aagttctcca 900 ggtgagacat aagtatatgc cctgccgaag caggcatttt tgcgatacta catttgcaat 960 ttcctccatc gctgcattgg actgatcaat ctactcgttg gctaccgtac cttgatgcga 1020 cgttgagacc tggttgctga aaatattgaa acgttaagtg atttgaactt ggctgttggt 1080 gcagtctgtc gttttacagg taaagcgaac atgtatatgt tcagccgaag ctcaaattta 1140 gattattaca cctggctttc catttcctcg ctattcactg gctccaattc aacttgaggc 1200 tgattgaaca accgattgga accgagtctg ccgattgaga gactgcgcgt atttggattt 1260 atcactttgg cttcggtggt accactgttt cttctcctgg gtgagtttga acagtatatg 1320 ttcagccgga gtagataatt tgcatactac atcgccgatt atcttcaatt ctcggatttc 1380 caacgttcat catcactgtg acgggagtga cgctgctcaa gtgccgactt ttcattggtt 1440 catcggtttt gtgaccagat tccaccatcc gctacgatgt taacgagaac cacaaggtcg 1500 ggcgattccc tgaaggctct gaattctaaa cttaagggtc ttaaggcatc gttaaacaac 1560 attagccagt ttgtaaagaa ttttaaggaa actactacag ctagtcaggt caatgtacgc 1620 ctcgatcgat tagatttgtt gtggcagcag attagcgaaa ctgtgtggga aattcaagcc 1680 cacgacgatt acgaggaaca ggaagggttc gaaaaggagc agctgtggta cgagaattgc 1740 ttctatgacg cgaaatcctt tctgacggag aaggcccagg aattccgaga tgaatctgct 1800 tcaaatcatt cttcgcggag tgctggtggc agccacagct caatggaacg tgtacgtctg 1860 ccacaaatca agctccagac cttcgacggg aacatcgatg attggcttag tttcagggac 1920 ttgtacacat ccctgattca ccaaaagcct gatctaccgg aagtggaaaa gttccactac 1980 cttaagggat gcttagcagg tgaagctaag gcgctagtgg atccgctaaa gatcactggt 2040 gacaactaca aaattgcttg ggaattgctg ctaaagcgat tcaacaacag caagctactg 2100 aagaggaagc aagtgcaagc actagtgaag ctgcccacgc tcaccagaga atcagtcacg 2160 gatctccacc agccggttga tggattcgac cgtgcgattc agaccttgga tcaagtcatc 2220 gaaccagcgg agtacaagga cttactgctg gtagagctgc tgagttctcg tctggatcca 2280 gtcactcgac gtggctggga ggaacatacg tccgctaagg agcaagacac gctgaaagat 2340 ttgcaggatt tccttcaacg gagaattcaa atccttgagt ccctgccagc gaagccagcg 2400 tctaggaatg agcagcagcc actcaagcgg aacacgtctt cttcgaaggt cagtcacaat 2460 gccgtgcagt ggaacaacgc taagtgcccc tcgtgtgcgg agattcatgg actacatacc 2520 tgtccagcct tcttgaagat gtcgctgacg agtagagagg catttttgcg ggcacaatca 2580 ctttgccgca actgcctcaa acgaggacat ctggctcgag aatgttcgtc caagttctcg 2640 tgtcggaatt gtaaggctcg tcatcatacg ctgctgtgct tcaaaggtga acgcaaggac 2700 agtggaaacc agcaatcgga gaaggaaaac tcccggaaga acaacgacga tacggcttct 2760 ggttccaaag cagcgaatgc atcatcggtt gaaacagcat cgtcgaattc ggcacaacgt 2820 tccgcatcgc aagttcttct ggctacagct gttgttgtgg ttgaggacaa ccagggttct 2880 cgctattcag cacgtgcact tctggattcc ggatcggaat gcaacttcat ctctgaaggt 2940 ttgagccagc tgatgaaggt ttctagccag aaggtggaca tcgcaatatc gggtatcgca 3000 cagactggaa caagagtcac gcgaaagata caagctgtcg tgaagtctcg aaattccatc 3060 tatagtcaac cgatggattt tctcgtgttg ccgaaggtca ccgcctgtct gccgacctct 3120 acagtgcagg tcgacgggtg gaacattcct gctgacgtag aattggcaga ccccgaattt 3180 ttccggtcaa gaagggtcga cttggttcta ggtatccaag ccttcttcag cttctttcca 3240 acgggaagga agatcacgct aggaagcggc ctaccagcac taaccgagtc tgttttcggg 3300 tggatagtga caggcgaagt cgctaaatct ggtcatagtg cgaagtacac atgcaacctg 3360 tccgtatcag gcaagctgga agagttaatg gcacggtttt ggtcctgcga ggaagtagga 3420 gccgtcagca agctctctcc ggacgagacg cgttgcgaag agcaattcga gcgcactttc 3480 cagcggaatt cggatggtcg ctatgtcgtt actcttccta agaacaacga agttcttgcg 3540 gagctaggcg aatcacgaga catcgctctg agacgtctgc gatcagtgga acggagactg 3600 gcacgggatc cagatcttca aaggcagtat tacgacttca tggccgaata tctcgagctt 3660 ggtcacatga agaaggtgga agtcagcgaa gaggagacgg tcaaacgatg ttatctaccg 3720 catcatcctg taatcaagga gtcgagcact acgacaaagg tgagggtcgt cttcgatgcc 3780 tcctgcaaga catcttccgg gatttcactc aacgatgctc tttacgctgg acctgtggta 3840 caacaggatc tacggtcgat tgtcctccgc agtcgtatac gtcaggtgat ggtggtagcc 3900 gacgttgaga aaatgttccg gcagatttgg acgaacacaa gcgacacacc ccttcaatgc 3960 gttatgtggt ttgaacccga tggaagcgtc gtgacgtatg agctgctaac ggtgacctat 4020 ggtactaaat cggcaccgta tctagctact cgtacgttga agcagttagc ggacgatgaa 4080 cgagaacgtt ttcccttggc agtatcggca gttcaggaag acgtttacat ggacgacgtt 4140 atttctggag cagatgacgt cgactctgca gtcgaattga ggcgacaaat cgacgcaatg 4200 acgagcagcg gagggttcaa gctacgaaaa tgggcttcca acagtcccac tgtgctggaa 4260 ggtattccca aggagaatct ggcactacca gacggaatca actgggacca agatgcggag 4320 gtgaagacgt tgggactgac ttggcttccc aacgtagact gcttcaagtt cgcgttctcg 4380 ctgccacaaa tcacggatga ccagattctg acgaagcgac aggtgctctg ttacatcgca 4440 caattgttcg atcctctggg actactcggt gcgacgatca cagcagcaaa aatctttatg 4500 cagcgacttt gggcaatgaa gaatgaagct aatcagagcc tgcaatggga tgacccgtta 4560 cctgtaacgg tgggcgagga atggcggacg ttccacaagc aaattccagt cttgaacgag 4620 attagaattc aacggtgtgt ggttgttcct gaagcggtgt ctatagagtt ccattgcttc 4680 tcggacgcat cgattttagc ctacggtaca accatctacg ttcggagtga gcggcaggat 4740 ggtacggtgt cagtgcacct tctgacatct aagtcgaagg tggcacctct caaagtgcaa 4800 tctctaccac ggttggagct atgtggtgca ctcctggcgg cgcagctgtg ggaaaaggta 4860 gcagaatcgc tgaaggcaaa aggcagcgtg tggttctgga cggattcaac gtgcgtcctt 4920 caatggattc ggtctcctcc tggaacgtgg accaccttcg tggcgaacag ggtcgccaaa 4980 attcaagctc taacggaagg aagcgagtgg aatcacgtac ccggactctc caatcccgca 5040 gatctaatct cgcgaggaat cgcaccaaag gatatcgttg aaaatcgcag atggtggcac 5100 ggtccggact ggctgatgga agcgcccgaa gaatggccga aaggggtagg atattcacca 5160 gaagaggacc tggaaagacg acgcaatatg ctgatttgtg cttcgtctga ggaaaaggag 5220 ttcatatcgg actacgtagc aagattttcg tcactttcga agctaattcg tacgactgcc 5280 ttcgcgctac gcttcctggg caatttgcgg tgctccaagg atcaaaaacg gactggattt 5340 ctcaccactg aggaattgga ggaagctgaa cgagtgatta tccgtaaaat ccaagagggg 5400 tcgtttgtgg aagagctcaa gaacttgtcg tcgggctccg tcgtgggtcg aaagtcccca 5460 ctaagatggt tccatcctcg catcgatgga aatggcatcc ttcgagttgg tggccgctta 5520 gagcattcgg atgaaacttt ccaggtcaaa catcccatgg ttttgccagc caggcatcca 5580 ttcacagagc tgttgtttcg ctactaccac gataagcatc tccacgctgg accgcaatta 5640 cttctgggta cggttcggca gcgatattgg ccgcttggag gcagaaatat cgctcgaaag 5700 gtagttcgtc agtgcctacg atgtttccga attaaaccat cggcaattca gcagcaaatg 5760 ggtgagctgc cagcggctag agtgacggtg tctaggccgt tttccaagac tggcgtagat 5820 tatttcggtc cggtgtacct acgagcagga cgcgggcgtc aaccgacgaa ggcctacgtc 5880 gcgatcttcg tttgcatggc aacgaaggca gtgcatatgg agctggtttc ggatttatcg 5940 accgaacggt tcctgcaagc gctgcggagg ttcattgctc gtcgaggaag gtgctccgat 6000 attttctcgg acaacgggac gaattttgta ggcgctagca atcaattgcg ggaactggcg 6060 aaattgctga aagaaaagga acatcgtgag aaagtgtcca aggaatgtgc gaacgaagga 6120 atttattggc attttaaccc gccgaatgcc ccacactttg gtggattgtg ggaagcagct 6180 gtccgttcgg cgaagtttca tcttctgcgg gttcttggtg gtaatccagt tacacacgag 6240 gactttgtga cgttgttggc acaggtggaa gcgtgcatga attctaggcc tctaacagcg 6300 ctttctaatg atcctcaaga tttggaggct ctgacgccgg cacacttcct tacaggagga 6360 tcgcttctcg cgatgcacga tgctaatttg ggagacgtcc agataaatcg actttccagg 6420 tggcagttgg tccagcggca gctgcaagac ttttggaaac gatggcggaa ggagtatttg 6480 tcgcagcttc agggccgaat gaaaaattgg cagccagcag ttaaaatgca gatcggccag 6540 ctcgttgttg ttgtcgatgc aaatcagcca gccacacaat ggaagatggg tcgcattcaa 6600 gagcttcatc ccggcgatga tggagtaact cgagtggtta cagtacgaac atcgactggg 6660 gtactcaaac gtcctgtggc aagattgtgt ttgttaccct tgaaacgttc ggaagacacc 6720 tcagatagag tttgatgcca gtagtccagt gcaccgcgag gattttattt atttactttc 6780 agaagtttcc cggcaacttc agggtgggcg agaa 6814 // ID Gypsy-17_RP-I repbase; DNA; INV; 4917 BP. XX AC ACPB02041249; XX DT 28-JAN-2011 (Rel. 16.02, Created) DT 28-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Rhodnius prolixus genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-17_RP_; KW Gypsy-17_RP-LTR; Gypsy-17_RP-I. XX OS Rhodnius prolixus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Euhemiptera; Heteroptera; OC Panheteroptera; Cimicomorpha; Reduviidae; Triatominae; Rhodnius. XX RN [1] RP 1-4917 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Rhodnius prolixus genome."; RL Direct Submission to RU (28-JAN-2011). XX DR Genome; ACPB02041249; Positions 6058 1142. XX CC Positions [2058-2564] - Reverse transcriptase CC Positions [3782-4093] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 180..3806 FT /product="Gypsy-17_RP-I_1p" FT /translation="MSKLGDYQAIAANISNAALSTIVSFHKFIFSEEGNRN FT KRKRIKEFTGFNFLKDSESYKDKFSWIKENLKISELYKICALLNLECEDDF FT DDMTITLLDCLIDLALLAPKKKVEEDSDSEHEETEAEGKSYNEYVDAEVTT FT TKKEKANKFVMSFRDVQDSIRSFSGENDYSIEAWASDFEELSDIMEWSDIQ FT KLVFAKRSLTGLAKLFVQSEKGLCTFNKLKAALLEEFSHRISSADLHRLLS FT ERRAKKNESLQQYYFTMREMASRGSVEPKALIQYIVDGIPDAVNNKITLYE FT AKSLKELKDKIIIYESIREKLENKQKPSALTFKTEGSKASKFTQNNKCYNC FT NGEGHRAAECPRPRRERGSCYTCGEKGHLQNSCPNRKRSAGTSVAASTSKV FT VGIVAEKPPLPPYYTSLVIDGSRVQSEVEAVIDSGSCISLIRNDMVESDRI FT LVAPSNIKQYEGINKSKLSIIGIFCTKVFLNSCNIFLDINFYVVNNNAIPC FT SCLLGRDFISNKQICITFSDTVSIVASKHNLDIKDESYDSNVQALMLINNN FT VVTSEKEIILNQNLSIENKLSVEKLIKNYLGADNTVNNKSVIHPEVEIELV FT PGHKPFNFSPRRLAYTEKIELQKLIDDLLSRGIIRESRSPYASPIVLVRKK FT TGELRLCVDYRELNKVTIKDRFPLPLIDDQLDRLQGKKYFSKLDLKSAFHQ FT IRVAEKSVKYTSFITPLGQYEYLRMPFGWCNSPAIFARFINQIFKYLILDS FT KVCVFLDDILIATETIDNHLTILKDVLGLLEDNKLELRLDKCVFLMTSINY FT LGYSVDANGISPCSEHIDAISNYPVPRNSKELHSYLGLISYFRRFVPNFSI FT LASPLYDLIKKGIEFKFNAEHIKCFNNLKKLLVTPPVLSIYSPTAETQLHC FT DASSLGFGSVLMQRQGDDGKFHPVAYFSKRTSDAEKRYHSFELECLCLIYS FT IKRFHIYLFGIPFKIFTDCQSLKLTLDKKDICPRIARWALMLQNYNYQIVH FT RSGKGMGHVDALSRCQNVLVLHENTLEQNLAFQQSQDKAIIKIRDYLEHNE FT HNKFELNNGLVYRKKDKKILFFVPESMENSVIKIYHDEMGHFGVDKVIELV FT TRSYWFPHMKDKIKNYVRNCLKCIVYSPNSGKREGELHNIDKGNVPFSHLH FT IDHYGPLQKTSAGFKYIFEVVCGFTNLSTIPLQNYHSKEAIKHLTTTLDI" XX SQ Sequence 4917 BP; 1659 A; 743 C; 1006 G; 1509 T; 0 other; attctcaggt gtggggttca caataaataa actataaata aaagacaaac aatttttggt 60 gatagtgagg tgaggttaag tggttaagtt aaactgttta aaatattttt tttagtaata 120 attagacatt tgttttgtta gaatagttct ttagtttatc tttaaaaaaa aaaaaaaaaa 180 tgagtaaatt aggagattat caggcaatag cggccaacat tagtaatgct gcactttcaa 240 ctattgtaag tttccataaa tttatttttt ctgaagaagg taataggaat aagaggaaaa 300 gaattaaaga atttacaggt tttaatttct taaaagattc tgagagctat aaagataaat 360 ttagttggat taaagaaaat ttgaaaataa gtgaactgta caagatttgt gctttactaa 420 atttggaatg tgaagatgat ttcgatgata tgactattac attattggat tgcttaatcg 480 atttagcact actagcacct aaaaaaaagg tggaagaaga ttccgatagt gagcatgaag 540 aaactgaggc agaaggaaag agttataacg aatatgttga tgcagaggtt acgaccacta 600 aaaaagagaa ggccaataaa tttgtaatgt cttttcgcga tgtgcaagat tccatacgtt 660 catttagcgg cgaaaatgac tattccatcg aagcttgggc ctcggatttc gaagaattat 720 ctgatataat ggaatggagt gacatacaga agctagtttt tgcaaagcgt tctttgacag 780 gtttagcaaa attgttcgta caaagtgaaa agggattgtg tacttttaat aaattaaaag 840 ctgctctgtt ggaggagttt tctcaccgta taagtagtgc agatttgcac cgccttctta 900 gtgagagacg ggctaagaaa aatgaaagtc tgcaacaata ttattttacc atgagagaaa 960 tggcatccag agggtcagtg gaaccgaagg cgttaatcca gtatattgta gacggaatac 1020 cagatgcggt caacaataaa attacgttgt atgaggccaa aagtttaaag gaacttaaag 1080 acaagataat tatttatgaa agcataagag agaaattgga aaataaacag aaaccttccg 1140 ctctgacttt caaaactgag ggtagcaagg cgtctaagtt tacgcagaac aacaagtgct 1200 acaactgcaa tggcgagggt caccgagcag cagaatgtcc tcgtcctagg cgggaaaggg 1260 gctcctgcta cacctgtgga gaaaagggcc acttacaaaa cagttgcccg aacagaaagc 1320 gctcagcagg aacatccgtt gcagcctcaa cttctaaggt ggtgggcatt gtggcagaaa 1380 aacctccact gccgccatat tatacttctt tggtgataga tggaagccga gtacagagtg 1440 aagtagaagc tgttatagat tccggatcat gcatttcatt aattaggaat gacatggtgg 1500 agtctgacag aattctagtc gccccatcca atattaagca gtacgaaggt ataaataagt 1560 ccaaattaag cattataggt atattttgta caaaagtatt tttaaattca tgtaatatat 1620 ttttagatat taatttttat gttgttaata ataatgcaat tccttgtagt tgccttttag 1680 gtagagactt catctcaaat aaacaaatat gcataacgtt ttctgatacc gttagtattg 1740 tcgctagtaa acataattta gatattaaag atgaaagtta tgactcgaac gtgcaagctt 1800 taatgctaat aaataataac gtagtaacga gtgaaaaaga aattattttg aatcagaatc 1860 taagcataga gaataagctt tctgtagaaa aacttattaa aaattattta ggtgccgata 1920 ataccgtgaa taacaagtca gtcattcatc cagaggtaga aatcgaatta gtgcctggtc 1980 acaaaccgtt taatttcagt ccgagaaggt tagcatatac ggagaaaatt gaacttcaaa 2040 aacttattga tgatctgttg tcgaggggaa ttattcgtga gagcagatcc ccttacgctt 2100 cgcctattgt acttgttcgg aaaaaaacgg gggagttaag gttatgcgtt gactacagag 2160 aactaaataa agtcacgatt aaagatagat ttccgctccc attaatcgat gaccaattgg 2220 ataggctcca aggaaagaaa tatttttcta aactcgatct taagagtgcg tttcatcaga 2280 ttagggtagc tgaaaagtcc gttaaataca cttcgtttat caccccgcta ggacagtacg 2340 agtatttgcg aatgccgttt gggtggtgca atagcccagc aattttcgct aggtttatta 2400 atcaaatatt caaatatctc attctcgatt caaaagtgtg cgtttttctc gatgatattt 2460 tgattgcgac ggaaacaatt gataaccatt tgacaattct taaagatgtt ttgggtttac 2520 tggaggataa taagctggaa ttacgtctgg acaagtgtgt gttcctgatg accagtataa 2580 attaccttgg gtattctgta gatgcaaatg ggatctctcc ttgcagtgaa catattgatg 2640 cgatatctaa ctatccagtt ccacggaact caaaggagct acatagttac ttggggttga 2700 ttagttattt tagaaggttc gtacccaatt tttcaatttt agcctctcct ttatatgatc 2760 ttattaagaa agggatcgaa ttcaaattta atgcagaaca tataaaatgt tttaataatc 2820 ttaaaaaact tctagtgaca cctcctgttt tgtctattta ttcacccacc gctgagaccc 2880 agctacattg tgatgccagc tctctaggat ttgggtcggt acttatgcag cgacaaggtg 2940 acgatgggaa atttcatccc gttgcttatt tcagtaagcg tactagtgat gcggaaaaaa 3000 ggtatcatag ctttgagcta gaatgtctgt gtttgatata ttctataaaa cgatttcaca 3060 tttatctttt tggaataccc tttaaaattt ttacagactg tcaaagtttg aaattgacct 3120 tagataaaaa agacatctgt ccaagaatag ctagatgggc actaatgctg caaaactata 3180 attatcaaat agttcatagg tctgggaagg gtatgggtca cgttgatgcc ttaagcagat 3240 gccagaatgt tttagtactg catgagaaca cgttggagca aaatttagct ttccagcagt 3300 ctcaagataa agctattatc aagataagag attatttaga acacaacgaa cataataagt 3360 ttgaattaaa taacggctta gtctatagaa agaaagataa aaagatttta tttttcgtgc 3420 ccgaaagtat ggagaatagt gttattaaga tttatcatga tgaaatggga catttcggtg 3480 ttgacaaagt aattgagtta gtgacaagat cttactggtt cccacacatg aaagataaaa 3540 tcaaaaatta cgttaggaat tgccttaaat gtattgtcta ttcacccaac tctggaaagc 3600 gtgaagggga actgcataat attgataaag ggaatgttcc atttagtcat ctacatatag 3660 atcattacgg gcccttgcaa aaaacatccg cgggatttaa atacatattc gaagtagttt 3720 gcgggtttac gaatttatca actataccct tgcaaaacta ccatagtaag gaagcgatta 3780 aacatttaac gactacttta gacatatagt caaccaatta attatttctg atcgaggaac 3840 tggctttacc tccaatttgt ttgaggaatt cttagagaag agggggattc aacatatcct 3900 aatagctact ggtaccccta gagccaatgg gcaaatagag agatttaatc gcgatattac 3960 gccagttcta gctaaaatta cgcctgagct acctaaatgg gacactgttt tagatcgtgt 4020 agaattcgcg tttaataata ccttctgcag atctattaaa aatactccta gtagattact 4080 gtttggtgtg gatcaacaag gggaaatact agacgcgttg aaattttact tagatgaaag 4140 tgataagaat agaaacttag ttgaaattag ggactccgct gctaaagcaa tcggggatac 4200 gcaagcacat aataaagaac tctttgatgc acggcataaa gcacctagga ggtataaaga 4260 aggtgactat gttatgatag ttaatacgga cgtaacaccg ggcattaaca aaaaattatt 4320 accaaagtat agggggccgt acgaggtcaa gaaagtgtta cccaatgata ggtacgtgat 4380 agcggacgtg gacggttacc aaattactgg cataccgttt gaaggagtct ttgaatcagg 4440 tagaatgcga ccatggcaag aaagttcgaa ctctggcatg gattttgatt ttttaactga 4500 caattagtta atcattttaa taaaatttga cacttatgtg atgattgcca caaattttat 4560 gagttgtgaa ttagcataga cttctgctat gttcaatatt atgtcatttt gtggttattg 4620 ttttaatgta ttatttatgt atcattttta attaatgtta ttatacttta gtttatgagt 4680 actgctcaag aacaagcttg caccgttttt atgattaagt tttaagatta tttgtcaata 4740 ataatttatg attttctttc atttaaggca aaagaaatgt attttccaat tattttccac 4800 tatgttagtt aaaggttgat tgatcgggtc gctcaatcgt caggttggac gagatgtagt 4860 agtaaataat aataatgcca tgtggagcgc tggagtgcgc gctagaaaaa gtcaatt 4917 // ID DNA-9_AAe repbase; DNA; INV; 1276 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A non-autonomous DNA transposon family from Aedes aegypti. XX KW DNA transposon; Transposable Element; nonautonomous; DNA-9_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-1276 RA Jurka J.; RT "DNA transposons from the yellow fever mosquito."; RL Repbase Reports 11(4), 1264-1264 (2011). XX DR [2] (Consensus) XX CC >99% identical to consensus. Present in >8000 copies in the CC genome. CC TA TSD. XX SQ Sequence 1276 BP; 440 A; 191 C; 171 G; 474 T; 0 other; ggactgttca ttttataaag tggacacctt gtttatgcta tatctttttt atttattgat 60 gaaatcgtaa acggttttct gtgtatcgtt caactattat tctacaatgt tatgaaaata 120 cagaaactta caaaatgctt tcggttgaag aactaaatag tttttgcaaa aactcctaag 180 aaaaactgtt cgtcaagttt aagcattatt tttcgcatga aaaaaatcct aaatttaatg 240 aacaaattgt atgttggtat cctttactat tcaacttaag gtagagcttt taaaaacatc 300 aataatattc catcagttcc tgacgctgag tcactttagt gatctattcc ttatggtcaa 360 atttgctgat acaccatctt tttactacca gcaaaaaagt aaagtaataa gcgtgttcaa 420 aactggttta gattactaaa gaaactatat atgtataaaa aaaagcaaaa tcttgagttt 480 taaccgcatt cttgggtacc aaatttactc tttatggtcg ttttattgaa aaatcccata 540 cttttaaaat tatatacgca tgtttatcga tctagagcaa actgtaagcg tttttctcaa 600 aatattttga taggtagtct cagaataaca cattctgaaa ataatacatg caaaataaat 660 ttgatttaat tcaagcagtg ttcccaatct tgatgattgt cattgtgctt tgagtggtct 720 tctgaaaata aattatttgt ttttctattg tgcttaaaat atgacgtggg gactaaatca 780 gcaactctat gaaggcgtaa gttattttta ttataaatac tatattatta cttccgtctg 840 gacgtacttt aaaccttaaa attctactca tatcagttcc atttaaattt aagatatttt 900 tctttaaaaa aaattgataa aaaatgattt tatgatatct gcaaaattgc ttctccaaaa 960 ataacttgat attttttagc aagcttctaa ttgatgatgc tttctaagta caaccatttt 1020 ttgaaaataa aaaatataaa ttgtcgaatt ttcctgccga tttttaataa tccttgattt 1080 tgataacctc caaaaatcga ccgttctaaa cgttgtgcct tattagtgtg aaatcatatt 1140 attttggtaa cttttttatt accacagcat tgtacaatat tagtcgaacg atgtacagaa 1200 aaccgatttc gatttcattg ataaattaaa aagatattgc atgtccaaag tgtccacttt 1260 ataaaatgaa cagtcc 1276 // ID Gypsy-6_CQ-I repbase; DNA; INV; 3908 BP. XX AC AAWU01000668; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-6_CQ_; KW Gypsy-6_CQ-LTR; Gypsy-6_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-3908 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 391-391 (2011). XX DR GenBank; AAWU01000668; Positions 34623 30716. XX CC 'CACT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 447..2132 FT /product="Gypsy-6_CQ-I_1p" FT /translation="MVFDLKEAVICIPKFDGSHDHLHNFIAFIDLFAQGNE FT NNVNVNWEHQLLVVVRMKFHGKALDKADNIIKETWNETKLALKNAFEEKIS FT IEQIIMQVYSLKQLYHESFEDYKERADKIHASIKKLGNNVYANRQLKSNFI FT SGLRDYNVMKLAANIQADDYEDLVAELVKKCRYIEAINTSSSMKQNYTNNN FT YDDAIRYHSDDFNDECTSFNKRHYNYSSNSNSNSRSYYHGRKNNKFQNFEN FT RPHWQDRNQSQNFNDSNFNNQTNGSDWFDNRNTNFSQQQMDNQNNLLNYSD FT NCSVFVTEESELVHSFEDFNQNYESKNVIDISNSNYDEHIELAENSENLSI FT VLTTAAWKGDNLFEENTAVEPTTHSEIEVQVLNVNEELDQLTHINEINDDV FT ATTSDHLKEEYNKSDDKEIIYDLTKIYAETNDLNMIIPQWIFNQKTSEEIQ FT NGVHQEQVVVSEFDPIKDTAYESQFYPKDANEIIESDIAWVETTNPIALEQ FT VRTFNKIKQYHTDKFNSRSISSLSYGDRKENSNSIEEIKPLSLNCQNVLQS FT SQKLEDFKRFRMKGS" XX SQ Sequence 3908 BP; 1463 A; 596 C; 703 G; 1146 T; 0 other; tctggtgaca gcggaaacta accgctagga aaaaggaaaa cgccatttct gactgaccta 60 agtaataatt gttacagcaa tcgattttga ttagaagata caaggaagca aaaatatgtg 120 ggcctactac tcgtgaatcc tagaactacg cataactgag tgttgtttgt gcccgatttt 180 gtgtgttgct gttttgacga attctatcat cattgcccgc cctatctgcg acagccacca 240 cgagaggatg atcatcattc tattaataac atggtaagta ctttttagtt ttatgttctg 300 atttttgtga aaaaagtaat agtgtttgat gatttgatgg agaagtcagt acttcaaagc 360 tgtttcgttg agtaaaagtg tttaagtata atcttcattg ttatgcatca cttataaatt 420 tcaattttaa attttaaaat aacaaaatgg tgtttgatct gaaagaggca gtaatttgca 480 ttccaaaatt tgatggatct catgaccatc tacataattt tattgcattt atagaccttt 540 ttgcacaagg caatgaaaat aatgtcaacg ttaattggga acatcaacta cttgtagtag 600 ttcgcatgaa gtttcatggg aaagcgctag ataaagctga caatataatt aaagaaactt 660 ggaatgaaac caaactagct ttgaaaaatg cctttgaaga aaaaataagt attgagcaga 720 ttatcatgca agtttatagt ttaaagcaat tgtatcatga atcatttgaa gattataaag 780 agagggctga caaaattcat gcttctatta aaaagcttgg taataacgtc tatgcaaata 840 gacaacttaa atcaaatttt atctcaggtt tgagagatta caatgttatg aaacttgcag 900 caaatattca agctgatgat tatgaagacc tagttgcaga attagtcaaa aagtgcagat 960 atattgaagc aattaatacc agttcatcaa tgaaacaaaa ttatacaaat aacaattacg 1020 atgatgcaat tcgttatcac tcagatgatt ttaacgatga atgtacatca tttaacaaac 1080 gccattataa ctatagctct aattcaaact caaacagtag atcatattat catgggagga 1140 aaaacaacaa atttcaaaat tttgaaaatc gcccacactg gcaagacaga aaccagtctc 1200 aaaattttaa tgatagcaat tttaataacc agactaatgg ttcagactgg tttgataaca 1260 gaaatactaa tttcagtcaa caacaaatgg acaatcagaa taatttacta aattattcag 1320 ataattgttc agtttttgta acagaagagt cagaactagt gcatagtttt gaagatttta 1380 atcaaaacta tgagtctaaa aatgtaatag atatttcaaa ttcaaattac gatgagcaca 1440 tcgaacttgc tgaaaatagt gaaaatttga gtattgtgtt aactactgcg gcatggaaag 1500 gtgacaattt atttgaagag aatactgcgg ttgaaccaac tacacacagt gaaatcgaag 1560 ttcaagtttt gaatgttaat gaagaattag atcagttgac ccatataaat gaaataaatg 1620 atgatgttgc aactacgtct gatcatttaa aagaagaata taataaatca gatgacaaag 1680 aaattattta tgatttaaca aagatttatg ctgagactaa cgatttaaac atgattattc 1740 ctcaatggat atttaatcag aaaacgtctg aggaaattca aaatggagtt catcaagaac 1800 aagttgttgt atcggaattt gatcctataa aagataccgc ttatgaaagt caattttatc 1860 caaaagatgc caatgagatc atagaaagtg atattgcatg ggtcgaaaca acgaacccga 1920 ttgccttgga acaagtcaga acgtttaata aaataaaaca atatcataca gataaattca 1980 actctcggtc gatatcatct ttgagttatg gtgatagaaa agagaactca aattcaattg 2040 aagaaattaa acctttaagt ttaaattgtc aaaatgtatt gcaatcctca caaaagcttg 2100 aagatttcaa aagatttcgt atgaaaggaa gctaaatatg ttttgataat ttggactgga 2160 cacattatcg aaaaatgtcg aatgaaaatt aaaatgaaaa ggatatttta ttttaaaatg 2220 aagtgattga ttaaacataa tggtcgctca agcgcagtgc tctttataac ttaattcatg 2280 agtgtggact cttattatca cgttcacaat atggatttta aaaattaatc tgtattgtgg 2340 taactcaccc acataatgaa atatgccttt aaagatttgc ctatcgatgg tttcatcatt 2400 aggtttaatc ctttactaaa aaaaaggtta gcaaaatgct cttcaaagta tatttttact 2460 tgatgacaca cacgctaaca atgtgtacgc tgtcaatgaa aacgattacg atgtaaattc 2520 gtttagtact ttttagaaga gtttgagttt tatcatgtta aaaaaataat aataatataa 2580 aaaaaacatt gcattttata ctaatatgct tgcatgcgag attttcgctt tgcttgagac 2640 ttttcttttc agcacaccta ctaccaactt ttcggggtca tagggggaat gatcggtcgg 2700 atcatttaag taagtatata aattgaatga tggcagttaa tgtctcggac tattttatgc 2760 accaaaattt aaaatcattc aatttaaacc tggttttttt atgcatgaat gatgaatgca 2820 agagaagtct gtgggaagag cgataaataa aatgattgaa aaaataatat aaatatggct 2880 tgaaaataac caatcatttt gtttataaaa tattaaaaaa aaaaaaaaga ttgagattcg 2940 atcaacatta aatgccgcct agcaaataaa atatggggct tagctaattc gtagagataa 3000 tcattatatt taaaaaaaac ttctgcaaat taaaaaaaat gcagcttagc aattcaaaga 3060 ataaaaacaa aaaaaaaaac ttagggttta gaattttgag ccagtcctca gagagcggaa 3120 tccactctgc tagggagcag aggtaacttc aaacttcaac atccgcaatc aacgacgctg 3180 gggtcttcaa gataggacaa cgaagggtcc ttggcgttac tccagatgat gaaaattggt 3240 gatcttcaat gttactcaat gttgtatggt gttacgcttg atgttgacga agctgatggt 3300 ctaaactgtg gcaaggagaa tggcaagatg agagatcgat ggaatctcgt tggaaagaca 3360 agaatatacg gtatcaacat acggcgactg ggagatgatg ctcctagatt gcgccttatt 3420 cattaataac cggagtgttg ttagcggcta agcaatgaac ttagactgca tttattttgg 3480 caatggttgc ctgatccaga tctaatgaac taatggaagt tcaatcatac cgtatcaagc 3540 caacgagata cgaaaagttt aacaagagat ggcctcaaca acagattgaa cacttaacag 3600 cgaaaagtca acggttggaa caacagagtc aatatcgtca acacagccca gacgtacatc 3660 atcgaagcag ccaggacaac cgtgattatc ccagaagaca ccccagagtc atatcgcgag 3720 acagaagaca tccccgaaga caccccagag tcataacgcg agacggaaga catccaagaa 3780 gacatccctg aggtagcaag gaccaacgta aagttaggaa aaattataat attttaccaa 3840 aaacttacaa ctatctactt gaaatcaatc tcttcaggat taatttcaag ctgttaagga 3900 aggggaga 3908 // ID Gypsy3-NVi_I repbase; DNA; INV; 3345 BP. XX AC AAZX01003557; XX DT 05-NOV-2007 (Rel. 12.11, Created) DT 05-NOV-2007 (Rel. 12.11, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy3-NVi; KW Gypsy3-NVi_I; Gypsy3-NVi_LTR; internal portion. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-3345 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from Nasonia parasitic wasp."; RL Repbase Reports 7(11), 1120-1120 (2007). XX DR Genome; AAZX01003557; Positions 9787 13131. XX CC Positions [2336-2842] - Integrase core CC LTRs are 95% similar to each other. XX FH Key Location/Qualifiers FT CDS 1649..3346 FT /product="Gypsy3-NVi_I_1p" FT /translation="MHEIHTTGPPVHSRPRSLYGERLEAAKDYFNKLIQRG FT IIRPGSGQWSSPLHMVPKPKGGFRAAGDYRKLIASTIPDRYGLPRIEDLLQ FT DCRDKGEDNVVVDALSRPCSTISMPSLLDPATIAAAQDDGEELPHLLKTGI FT LDLQQLAVQQHNIYCNVLKNVVRPYLPGKLRRLAFDTVHRPAHPSVRATVR FT MLADKFVWPGMRKDANSWAQSYVSCQRAKVQRHNREAIQSFEVPDNRFEHV FT HMDIIVMPFVGDLRYCLTMIDRFSRWPVVVPIADIRAETIARSFFEHWVAY FT YGTPITITTDQGTQFESALFALAQMIGSRRIHTTPYHPQANRLIERFHRTL FT KAALMCEEHTPWPERLPIVMLGLRSCLKEDLQASPAEMLYGTTLRIPGDFF FT TSDSAPADPGTFAGKLRALFRDIKPVPAAHHGNYKPFKLKRLATCTHVYQR FT VDAVRKPLVPPYVGPFKVIRTTSDKVFIILVNGQEKAVTTDALIPAHQDIS FT DAPSTTPMEPAATSPPIDTPEAKEKSGDSTPSDAQSKEAEEEDFSVLEHPG FT AKKKVSFASQGKLTKGGVP" XX SQ Sequence 3345 BP; 797 A; 1043 C; 898 G; 607 T; 0 other; agaccccgac atgcgacgca tcttacctca cagtatccgc cacgtgatct caagaaaaga 60 ctctgtaaat aattagtgtt ttaataaacg tgaaactcga actataagtg agaaagtgag 120 tagaatttct taccgcgaaa tcccagacct cgctaaattg gtgaccccga cgtgatcgcg 180 gcggatttcg ggcgagtgac agacaaacac gtgcaaaagg tgaaagtgtc aattgtggac 240 ttggacctga aatgttggtg ccacggagac tccaccccga gagcccggta ttacgacccg 300 tgtactacgc tccagagcta aggagtttcg cacgagcctc gacgcagaac aaagcgacga 360 caaaaacgag gtcgacatga cgagtcaggc aggagagagc ggtgcaggtg gcggccttgg 420 caacattccc ggaggcgccg gcgataacag caacgcaaca ggtggtgccg gtaacagcag 480 cggaagcgcg ggaataggcg acgctaacct cgctgcacta cgacatgtgg cgaaacgaac 540 gccgccattt tggaagcatt gtcccgatgc ttggttcttg cagctcgaga gaatttttgc 600 gtgcaatcag attcaatctg atctgcaccg ctacaacacc ctcgtatcca gcctcgacca 660 ggagaccgtc caggaaaaac gtcgtaaaaa cgctgcgcat tctacgcaac agcccgttgg 720 acgagttcgc agaggtggcc gacggcatcg tcgagactgg gcccgccacc ttcgctgtac 780 atccaggcaa caggaaggca ttttcaccag ctcgggctcc agcatttgta cctgaggcgg 840 ccccgccatc ctcgtcagct gactctcggc tcctcgctga attcgccgcc atccgcatct 900 caatggctaa gatagcttct gcaacagcaa aaaccctcaa ggcggtccag aggatgagca 960 gcagcagcgg tcatcaagga ggacagcatc agagtaacca gcatggtgga cgtcagagcc 1020 gctcaaggtc acgggccaag tctccgacag gcaaccccag ccactgctac taccatcagc 1080 gttttggaca tccaaacaac cagggaaact aggccggctg ccaggagctt tggcggtctc 1140 cgacagccca gcttctgctg aaaaccgcct ccacgtcctt gacaggacga ccaacaccag 1200 cttcctagtg gactccggct ctgtgctgtc gcttctgcct cggtcagcgg tgcgagatcg 1260 aaagctcgcc gtccagccac ttcgccttgt ggcggcaaac ggcaccccgg tccacacctt 1320 cggcaggcgc ctggtgacgc tcaacctcgg actccggcga gccatccctt ggcctttcgt 1380 cgtggcggac gttccggttg cgatcctggg ggcggacttc cttcaccact cagggtatct 1440 ggtggacctc cagcatcatc gactcatcga ctcgaccacg acgctttcca ccccggcgga 1500 gatccgggcg acggccatcc acggtgtctc ggtggttgct ggctcaagca ccgccatctc 1560 ccaggacgaa taccagcggc tcctagctga gttcactgac ctggcgcagc cgaacgaggg 1620 gagagccacg ctcaacggcg aggtggtcat gcacgagatt cacaccacgg gcccgccggt 1680 ccactctcga cccaggtcgc tctacggcga gcgtctggag gcagccaagg actacttcaa 1740 caagctgatc caacgaggca tcatccgtcc aggatcaggg cagtggtcca gcccactcca 1800 catggtgcct aaaccgaagg gaggattccg cgctgcgggc gattatcgga agctgatcgc 1860 ttcaacgatc ccagaccgct acggattgcc gaggatagag gacctccttc aggactgtcg 1920 agacaagggt gaggacaacg tggtggttga cgcgctttcg aggccatgct caaccatcag 1980 catgccgtcg ctcctcgacc cagctaccat cgcagcagct caggacgacg gcgaggagct 2040 gccgcatctc ctgaagaccg gcatcctcga tctacagcaa ctcgccgtcc agcagcacaa 2100 catctactgc aacgtcctga aaaacgtagt ccgcccgtac ctgcctggca agctccgtcg 2160 gctagccttc gacaccgttc accgtccagc gcatcccagt gtacgagcca cggtgcgaat 2220 gctggcagac aaattcgttt ggcctggaat gcgcaaggac gccaactcct gggcgcagtc 2280 ctacgtctct tgtcaacgcg ctaaggtaca gcgacacaac agggaagcca tccaatcctt 2340 cgaggtgcca gataataggt ttgaacacgt ccacatggac atcatcgtga tgcctttcgt 2400 cggagacctc cgatactgtc tcacgatgat agacagattc tccaggtggc cggtggtggt 2460 gcctatcgca gatatccgag cagagaccat cgcaaggagc ttcttcgagc actgggtggc 2520 ttactacggc actccaatca ccataaccac cgaccagggc acccagttcg agtcagcgtt 2580 gttcgccttg gcccagatga taggctcgcg caggatacac accacgccgt accatccaca 2640 ggcaaacagg ctcatcgagc ggttccacag gacgctcaag gctgctctga tgtgcgagga 2700 gcacacgcca tggccagaga ggcttccaat tgtaatgctg ggactcagat catgcttaaa 2760 agaagatctc caggcgtcac cagcagagat gctttacggg accactctac gcatccctgg 2820 ggactttttc acctccgaca gtgctccagc agacccgggt acctttgcag ggaagctgag 2880 agcactcttc agggatatca aaccggtacc agcagctcac cacggtaact acaagccgtt 2940 caaactcaag cggctcgcta catgtacaca cgtctaccaa cgtgtcgacg cggtgaggaa 3000 gccactggtc ccaccgtacg tcggcccctt caaggtcatc aggacgacca gcgacaaggt 3060 cttcatcatc ctggtcaacg gtcaggagaa ggccgtcaca actgacgccc ttattccggc 3120 ccaccaggac atctccgatg caccatcgac cactccgatg gaaccagcgg caacctcgcc 3180 gcccatcgac acgccagagg ccaaagagaa gtcaggagac tccactccgt ccgacgccca 3240 gtccaaggaa gcagaagaag aagatttttc tgttttagag caccctggcg ccaagaagaa 3300 ggtgtcgttc gcttcccagg gcaagctcac taagggggga gtacc 3345 // ID Chapaev3-3_HR repbase; DNA; INV; 2226 BP. XX AC . XX DT 29-FEB-2008 (Rel. 13.02, Created) DT 29-FEB-2008 (Rel. 13.02, Last updated, Version 1) XX DE Chapaev3-3_HR is an autonomous DNA transposon - consensus. XX KW Chapaev; DNA transposon; Transposable Element; Chapaev3; KW Chapaev3-3_HR. XX OS Helobdella robusta OC Eukaryota; Metazoa; Annelida; Clitellata; Hirudinida; Hirudinea; OC Rhynchobdellida; Glossiphoniidae; Helobdella. XX RN [1] RP 1-2226 RA Kapitonov V.V. and Jurka J.; RT "Chapaev3, a distinctive group of animal Chapaev transposons."; RL Repbase Reports 8(2), 58-58 (2008). XX DR [1] (Consensus) XX CC Chapaev3-3_HR belongs to the Chapaev3 group of the Chapaev CC superfamily of DNA transposons (see comments on Chapaev3-1_PM). CC Chapaev3-3_HR is a young family of leech Chapaev3 transposons: CC genomic copies of Chapaev3-3_HR elements are ~98.7% identical to CC their consensus sequence, which was derived from multiple CC alignment of four Chapaev3-3_HR elements. Chapaev3-3_HR contains CC 12-bp terminal inverted repeats and encodes a 566-aa transposase. CC This sequence was derived from sequence data generated by DOE CC Joint Genome Institute. XX FH Key Location/Qualifiers FT CDS 214..1911 FT /product="Chapaev3-3_HRp" FT /note="transposase." FT /translation="MATRKCLNSPNLFCYVCGYFTDVDHRKTMTQLLKKAY FT ELYFDSKVDDAEKQWKPNNICSICANTLAGWLRKSPKHKSMPFGVPVIWRE FT PTNHATDCYFCMTVIKGFSFKTRKSISYPDIESVSKPIPHNPVNCPVPISP FT DSYSFHSDDSSESDKQANPESTNDSDDNYIPESDEIHLVNNANLSDLIRDL FT ALTKGQAELLGSRLKQWNLLAPDAKICNFRFRHKELMLYFTSDDSMCYCND FT VKNLMAHLGHEHEVKNWRLFIDASKTSLKFVLLHNGNLLPSIPIAYSTQLK FT ETYGNISNVLIKIKYNDYKWRVCGDLKVIAILMGLQLGYTKFCCFLCEWNS FT RDKVAHYTKKIWPIRDRMEPGHKNVLHEPLVLKKDIILPPLHIKLGLMKNF FT VKALDKTSDAFTYLRSMFPRISDAKIKERIFVGPQIRKVIGDKHFQDMLNG FT KDLEAWVAFKSLVVNFLGNNKSNNYKEIVEQCINAFQKMGCNMSLKIHLLD FT SHIDFFPECLGEVSDEHGERFHQEIAVMESRYQGRWSSAMLADYCWFLQRD FT APEAKHRRKSTCKKFKPLS" XX SQ Sequence 2226 BP; 739 A; 381 C; 383 G; 723 T; 0 other; cactgacata cataattttt ctttcccata aaataattag tgttcttata caagtttttt 60 acgctgattt caaatctgaa aaccgttttt ctctatcaca ttgcgttcca aagaaaacga 120 ctaaacaatt tttttaaaaa aaagctttta cctgtatagt ttatctatgc cctacgttat 180 ttgttttcat aataaaaatt taatttcgtt gccatggcta ctcgaaagtg tttaaattct 240 cctaatttgt tttgttatgt ttgtgggtat tttactgatg tcgaccatcg gaaaacgatg 300 acccaattat taaaaaaagc ttatgaactt tatttcgatt caaaagttga tgatgcggaa 360 aaacagtgga aaccaaacaa catttgctcc atttgtgcta atactttggc aggttggttg 420 cgaaaatcac caaagcacaa gtcaatgccg tttggagttc cagtcatatg gcgcgaaccc 480 acaaatcatg ccacagattg ttacttttgc atgacagtta taaaaggatt ttcatttaaa 540 actcgtaagt ctatttctta tcccgacatt gaatcagtat ccaagccgat tcctcacaac 600 cctgtcaact gtccagttcc aatttctcct gattcgtact cctttcattc agatgattct 660 tcagagagtg acaagcaagc aaatccagaa agtaccaatg attctgatga taactacatt 720 ccagaaagtg atgaaattca tttagttaat aatgcaaatc tttctgacct tatacgtgat 780 cttgctttga cgaaaggaca ggcggagctt cttggttcac gattaaaaca gtggaattta 840 ttagctcctg atgcaaaaat atgcaacttt cgcttcagac acaaggagct tatgctgtac 900 tttacatcag atgacagtat gtgctactgt aatgatgtta aaaatcttat ggcacatctt 960 ggtcatgaac atgaagtgaa gaactggcgg ctctttattg atgcatcaaa aacgagccta 1020 aaatttgtgc tgctgcataa tggaaattta ttgccatcaa ttccaattgc ttactccact 1080 cagttaaaag agacttatgg caatatcagt aatgttttaa taaaaataaa atataatgat 1140 tacaaatgga gagtgtgtgg agacttgaaa gtcattgcaa tcctcatggg tctacaactt 1200 ggttacacca aattctgctg tttcctctgc gaatggaata gtcgggataa agttgcacac 1260 tatacaaaaa agatttggcc tattagggat cgcatggagc ctggtcataa gaatgttttg 1320 catgagccac tggttctcaa aaaagatatc attttaccac cgttacatat caagttaggt 1380 ctgatgaaga attttgtgaa agcgttggat aagacaagtg atgcttttac atatcttcgt 1440 tccatgtttc cacgaatcag tgatgctaaa ataaaagaaa gaatttttgt gggccctcag 1500 attcgaaaag taataggtga taaacatttt caagatatgc taaatggaaa agatctcgag 1560 gcatgggtag cattcaaatc acttgttgtc aatttcttag gaaataataa gtcaaataat 1620 tacaaagaaa tcgttgaaca atgcatcaat gccttccaga agatgggatg taacatgtcc 1680 cttaaaatac atcttcttga ttcgcatatt gactttttcc ctgaatgtct tggtgaagtc 1740 agtgacgaac acggagagcg tttccaccaa gaaatcgctg taatggaatc acgctatcaa 1800 ggacgatgga gtagtgctat gctcgcagat tactgctggt ttctgcaacg tgatgcccca 1860 gaagctaagc accgtcgaaa gtctacatgc aagaaattta agcctctttc ttaaaattaa 1920 atttgaatca actgtcttat tttaatgaac aatcacaaaa tttttaatta ttaactattt 1980 gttttcaata gaatgacatt ttttgattaa taaaaatctt gaaagaaatt actttttata 2040 ttttaatgat tatattgtga gtatatttat taataaagtg atcattactt caactgaagt 2100 tttcttaaaa ttttgatgtg atacaagtaa atggattaca gatttgaaat cagcatgctg 2160 aaatgatcca gaaaaacata cactcactaa agaaagtaaa aaaaaaattt tttttgtatg 2220 tcagtg 2226 // ID CR1-23_HM repbase; DNA; INV; 4752 BP. XX AC . XX DT 15-DEC-2008 (Rel. 13.12, Created) DT 15-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE CR1-type family - consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-23_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4752 RA Bao W. and Jurka J.; RT "CR1 families from Hydra magnipapillata."; RL Repbase Reports 8(12), 1851-1851 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 769..1644 FT /product="CR1-23_HM_1p" FT /translation="MSAIEEIATNAFKAAVDLKDLNATLMKSYQNDIALMI FT KAALLVALAEEKRSSELREKALKDKIDLLVKQINELKEIKPTATPSFASLL FT KDKSDESKQQRQQLSTFIAAQNKNIRSRELNVIVVGLNKSSTVDDKKLVED FT FFVATGLEDATGIARVNRLKSSKNKTKLSNSNIIQVSLSNLSSRAEILKKC FT AHHKQEIYKGVYVREDRTPEQQTEFNTTRTQLKKLNDQLAAANILDNPFRN FT VIHRRTGRICCIDVIESTSNQCYVFSSPARALSDYASRTAASNSEISTAN* FT " FT CDS 1211..4612 FT /product="CR1-23_HM_2p" FT /translation="MLLVSHVLTASNRAKTKQNCQTRTLSRFRYQIFHLER FT KYSKNAHTTNKRYTRVSTYAKTERQSNKPSLTPHAPNSKSLTTNLPLLTFS FT IIRFGMLYIEEQVASVASTSSNQRPISATYFHPQPELCQTTLAAQPLLTQK FT SPLPTRRPRPITKAKIKPMQSDTSTVSPSHLCYWANNPCSLNNEKRQELFA FT RLSALHKLSRPHVIFFAETWFTDLSDVAVPGYQPHRKDRDNRGGGVAIYTR FT DDIIVSEVSSTQLNSTSIEQIWRKIKLGQESFLLGCLYRPHDENDQNLTQC FT ITSITTAQQILPKINCSAMLLYGDFNFSHTSYEPIEVNSGIATVAHVRGER FT QGDIRFQKCLDECHLTQLITFPTYRSSRHVEPTSTLDLIITNEPDRVIFIK FT QGDSLGDTPMGQAHCFIEGQFTVACLNNSPTLPLKPRLIWSRANYAAISAH FT IAAVDWETVFTGLNANEDYQLLVNVYNEATNLHIPSTTTPFTGKQEQWVTP FT ELIEAVKAKRTSWAKYINAGRDTHKFLKISHRTACKKVKKMVKAGRQKYEE FT LLVRDSESNPKRLHAYIRSKQSVSNNITSLETVNSNITTDTGDICASLNNY FT FQSVFVLEPEGSMPGFESRTEAVCVIDPAIFSIDIVQKCLNSLDERKSTGY FT DGLHPRVLSKCSATFAKPLSLIYKCSFATGVVPDLWKKLNVTPIFKKGSRL FT KASNYRPVSLTSIPCKIMESILQKRIMEHCVANGLISPNQHGFVHRKGCVS FT NLLETRDIMTEATHCRHAVDVIYTDFAKAFDKVPHKRLLHKLKAYGIQGAL FT VDWIAAWLNNRCQRVVINSITSEWKAVTSGVPQGSVLGPLLFVIFVNDLPD FT RITHHMKLFADDSKIIGIIKKESDSITLQDDVDRAVEWSHIWLLHFNVEKC FT KVMHVGRGCNKSTYKYSMSNVDGTRRPLEKTAVERDLGVMVSNDLKVRPQI FT ESAASVANRMLGRLKKTFRSRSFYLWRTLYLTYIRPHLEFAVQAWSPHLKS FT DIAILEQVQQRATKTISTIKHLSYEDRLQRLGLTTLEERRKRADLIEQFKI FT TRKLELVSFVVPQKFSKSKDKYILRGHNHKLERQMVKNCEERYNFFSNRIV FT APWNALSQEAVNAQSINAFKNNIRK*" XX SQ Sequence 4752 BP; 1558 A; 1003 C; 824 G; 1365 T; 2 other; tttttktttt ttcgtattct tttattcttt tcgctcttat tctttttttc tgacattatt 60 tctttctcta ttattgcttc atcttattag cttattaatt aatttctact tttaaatcat 120 agtatactat attgcctata ttcgtgtcta aaataacagt ttgtcgtact tcgcgcacct 180 atatttatat atacctatac ttatatttta ttatatatat ttttatttat atattattat 240 tataaaaata gttgtgacta agattttatt tatgctttag ttcgattata tttttacaga 300 gtttacaaat ttttttttct tttttcttgt tttaaagagt attgtaaacc tttcaatttt 360 tgctcgtagt tttaaccgcc aaagtttaag cgtgtaaacg atattttcta atctattgtt 420 ctattcaaga agttttgaga atattttttc aacttttttt tcttaatttt tcaacttttt 480 gtgccttgac aaaatccaaa atttattatc aattaataaa tatatttacg gaaaatttat 540 cactatatat ttactaatat ttgtaatcaa attactaata ttagtaagtt atatacgtat 600 aaatttatta tattaaagta tatattaagg tatatatata caatcatata gaatatcaat 660 tttattgaaa ttttaaattt taaattaaat ttttaaaatt tgtaataaat tgtaacaaat 720 attaaaaatt taaaaacttg caattattaa ttgcattcat tgctaattat gtcagcaatc 780 gaagaaatag ctaccaacgc ttttaaagcg gcagtagatc taaaggattt aaatgctact 840 ctcatgaaaa gttaccagaa cgatattgct cttatgatca aggcagctct acttgttgca 900 ctagcggaag agaaacgatc tagtgaatta agagaaaaag ccctgaaaga caaaattgat 960 ttactggtaa aacaaattaa tgagttgaaa gaaataaaac caacagccac accatcattt 1020 gcatctttgc ttaaggacaa atcggatgag agcaaacagc aacggcagca gttaagcact 1080 tttatcgctg ctcaaaataa aaatattcga agtcgtgagt taaatgtcat tgtagtcggt 1140 cttaataagt ctagtactgt agacgataaa aagcttgttg aggatttttt tgtagcaaca 1200 ggattggagg atgctactgg tatcgcacgt gttaaccgcc tcaaatcgag caaaaacaaa 1260 acaaaattgt caaactcgaa cattatccag gtttcgttat caaatctttc atctcgagcg 1320 gaaatactca aaaaatgcgc acaccacaaa caagagatat acaagggtgt ctacgtacgc 1380 gaagacagaa cgccagagca acaaaccgag tttaacacca cacgcaccca actcaaaaag 1440 cttaacgacc aacttgccgc tgctaacatt ctcgataatc cgtttcggaa tgttatacat 1500 cgaagaacag gtcgcatctg ttgcatcgac gtcatcgaat caacgtccaa tcagtgctac 1560 gtattttcat ccccagccag agctctgtca gactacgcta gccgcacagc cgcttctaac 1620 tcagaaatct ccactgccaa ctagacgtcc tagaccaata actaaagcaa aaataaaacc 1680 gatgcaatcc gacacttcta cagtttcacc tagccatcta tgttactggg caaacaaccc 1740 atgttcactc aacaatgaaa aacgccaaga gctctttgct agactatcag ctctacataa 1800 actcagccgg ccacacgtca ttttttttgc ggaaacttgg ttcacagatt tatcagacgt 1860 tgccgtaccc ggctaccaac cacatcgcaa agacagagac aataggggcg gtggagttgc 1920 aatttacaca agagacgata tcatagtcag cgaagttagt tctactcagc tcaattcaac 1980 atcaattgag caaatctggc gaaaaataaa acttggtcaa gaatcatttc tcctcggctg 2040 cttatatcgt ccacacgacg aaaatgatca aaatcttact cagtgtatca catcgatcac 2100 taccgctcaa caaatactgc ctaaaataaa ctgctctgca atgctactgt atggcgattt 2160 caattttagc catacatctt atgagcctat cgaagttaac agtggcattg caactgttgc 2220 tcacgtgaga ggtgagcgtc aaggcgacat caggttccag aagtgccttg atgaatgtca 2280 cttaactcag cttatcacat tcccaaccta ccgtagtagt cgacacgtcg agcctactag 2340 cacactcgac cttattataa caaatgagcc agaccgtgta atcttcatta aacaaggcga 2400 ttctcttggt gacactccaa tgggccaagc acactgcttc atcgaaggcc aatttactgt 2460 tgcatgcctc aacaattcgc caacwttgcc gctaaaacct cgactcattt ggagtcgagc 2520 aaactacgca gcaatatcag cacatattgc agctgttgac tgggaaacag tattcactgg 2580 actaaatgca aacgaagatt atcaacttct agtaaacgtg tacaatgaag ccactaactt 2640 acacattccg tcaactacca ctccctttac aggaaaacag gagcaatggg taacaccaga 2700 acttattgag gcggtcaaag ctaaacgcac ctcttgggcc aagtacatta atgcaggtcg 2760 agatacgcac aaatttctca aaatatcgca tcgtactgcc tgcaaaaaag ttaaaaaaat 2820 ggtcaaagct ggtaggcaga agtacgaaga gctactcgtc agggattccg aaagcaaccc 2880 taaacgctta catgcatata taagaagcaa gcagagcgtc agtaacaaca ttacatcgct 2940 tgaaacagtc aatagcaaca tcacaaccga taccggtgat atatgcgcat cgttaaataa 3000 ctattttcaa tcagtctttg ttttagagcc cgaaggttca atgccaggct ttgaaagtcg 3060 tacagaagca gtttgtgtca tcgatccagc aattttctct atcgacatag tacaaaaatg 3120 cctaaacagc cttgatgaaa gaaaatcaac cggctacgat ggcttacatc cacgggtcct 3180 cagtaaatgt tcagctacct ttgccaagcc tctttcgctt atctataaat gctcgtttgc 3240 aactggtgta gttcctgact tgtggaaaaa attgaatgta acacctattt tcaagaaggg 3300 gagtaggcta aaagcatcca actaccggcc agtctcgctc acttccatac cttgcaaaat 3360 tatggaaagt attttacaaa aaagaataat ggaacactgc gttgctaatg gtctaatttc 3420 tccaaaccag catggctttg ttcatcgtaa gggatgcgta tcaaaccttt tagaaactcg 3480 ggacatcatg acggaagcaa ctcactgcag acacgcagta gacgtcattt acacagactt 3540 tgcaaaggcg ttcgacaaag taccacataa acgcctcctt cataagctga aagcgtacgg 3600 tattcaaggc gctctcgttg attggattgc agcttggcta aacaatcgat gtcaacgagt 3660 ggttataaac agcattactt ctgaatggaa agcagtcact agtggagtgc ctcaaggctc 3720 ggtcttgggc ccattgctat tcgtaatctt tgtaaacgac ctgccggaca gaattactca 3780 tcatatgaaa ctgtttgccg acgacagcaa aataataggg ataattaaaa aggagtcgga 3840 cagcatcact ctccaagatg acgttgacag agcagtggaa tggtcacaca tttggctctt 3900 gcatttcaac gttgagaaat gcaaagtaat gcatgttggg cgcggatgta acaagtcgac 3960 gtacaaatac tcaatgtcta acgtcgacgg cacccgtcgt cctctcgaaa aaaccgcagt 4020 cgaaagagac cttggtgtga tggtctcgaa tgatcttaaa gtcagaccgc aaatagagtc 4080 agccgcatca gtagcaaata gaatgctagg gcgactcaaa aaaacatttc gcagtcgcag 4140 tttttattta tggcgcactc tgtatctcac ttacattcgc ccccatcttg agtttgctgt 4200 acaagcctgg tcaccgcatt taaaatcaga cattgccatt ctcgagcaag tccaacaacg 4260 agcaacaaaa acaatatcga ctattaaaca cctgtcctat gaagatcgac tccaacggct 4320 cggtttaaca actcttgaag aacgccgcaa aagagcagat cttatcgagc aattcaaaat 4380 cactcgtaaa ctcgagcttg tcagttttgt cgtcccgcaa aaattttcaa aaagtaaaga 4440 taagtatatt ctaagaggcc ataaccataa actagaacgt cagatggtaa aaaactgtga 4500 ggagcgatat aacttttttt caaacagaat tgtcgcccca tggaatgcgc tttcgcaaga 4560 agcagtgaac gctcaatcca tcaacgcttt caaaaacaac ataagaaagt aactttatag 4620 atacaactgc tgctttgtct gtagcgtctg tatctccatg ttcgtcagca tagaggggcc 4680 tggtgtagca cctcttcatc aaattataaa tggattattg tatttgtatt tgatcactga 4740 aataaataaa ta 4752 // ID DNA8-38_AP repbase; DNA; INV; 486 BP. XX AC . XX DT 25-AUG-2009 (Rel. 14.09, Created) DT 25-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-38_AP. XX NM DNA8-38_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-486 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 1968-1968 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 486 BP; 210 A; 42 C; 58 G; 176 T; 0 other; cagtgttcgg attagataag tattttttta tctagataaa gataaagata atgcaactca 60 aatgtatcta gatagagata aaagataact taactcatat ttatctagat aaagataaag 120 ataaatactt ttttatctag ataaaaaaaa gatacaatta ttttttaaat ggttataaat 180 tttggtatat tttagttggg cactaggaac ataataataa taatgatata aataaaaata 240 attacattat ttataaacca taaactttgt ttgagaatct gtataaatat ttaaaaattt 300 agtacaattt cgcgattttt atcaataaaa aatgtatcta gatgaattta tttagattag 360 taaaaaaatt atctaagatg aacgtttaga taccttgtaa ctatttatct agataagata 420 aaagatgaac aaataattat ctagatattc atctagataa atttatctag ataagtccga 480 acactg 486 // ID TC1_HC repbase; DNA; INV; 1588 BP. XX AC . XX DT 09-JUL-2004 (Rel. 9.06, Created) DT 09-JUL-2004 (Rel. 9.06, Last updated, Version 2) XX DE Haemonchus contortus transposable element Hctc1 - a consensus. XX KW LTR Retrotransposon; Transposable Element; Nonautonomous; KW LTR transposon; TC1_HC; non-autonomous transposon; tc1-like; KW transposase. XX OS Haemonchus contortus OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Strongylida; Trichostrongyloidea; Haemonchidae; Haemonchinae; OC Haemonchus. XX RN [1] RA Hoekstra R., Otsen M., Tibben J., Lenstra A.J. and Roos H.M.; RT "Non-autonomous transposable elements in the genome of the RT parasitic nematode Haemonchus contortus."; RL Mol. Biochem. Parasitol 106(1), 163-168 (2000). XX RN [2] RA Gentles A., Kohany O. and Jurka J.; RT "Consensus."; RL Direct Submission to Repbase Update (JUN-2004). XX DR [2] (Consensus) XX CC Homology to tc1a gene, encoding transposase. Possesses 53 bp CC inverted terminal repeats. XX SQ Sequence 1588 BP; 461 A; 373 C; 342 G; 412 T; 0 other; gtaccggtca gtgaagtatt cgttttcaat ttttcaaaca taacttccgc aaatttcaac 60 ctttcggtgg gaaactcaaa cagcaggtaa ctttaacact tagtaacatt atacttgtgc 120 agcagcctag cagcacactt ccaagcaaaa cttgcgctta attttcaaac agccccccta 180 cgcacatttt tgccttcgat aactcgtggg tttttacatt aagaaattcg aggtaattct 240 tgctcctttg tgtcacgtca tgcacctagt acatcaataa gacttttcca atgcactcgg 300 gaatctgtaa gtcacggcaa gatcaatttg acgccgtcat tcggggcttc caggaagcct 360 taacatccag gcgagtttct gaagttcaag gcgtgccagt tatgtgcgta caagcgattc 420 ggaaggaata caaggtcaca aggtcagccg aagctaaaat tcacccagag gcagcgagat 480 tgacgctttg tcttgtcgac aggaaaattg tgcagcgtgc aacaaatggc tcgcgtctca 540 ctgcagccaa gatttttcgc gaaatctcgc cgcccgaaag accgaatcct ttcgtgaaca 600 ctgttctacg tcgtctgaaa gaggtcggtc tgtttggacg tcgtccagca aagaaaccac 660 tgatttctgc caagaatcgc aaagtgcttc tcgattgggc tcatgctcac aaaaactggg 720 tagtccagca gtggcgtaag gccatcaaaa gcgacaaatc caagttttcg ctgtccagca 780 aggacgaaat catgtttgtg tgacgtccaa tcggtaccag atgccatgca agttaccagc 840 tgctaaccgt gaaatatggc gggggttcag tgatggttca cggtagcttc tgtggcaaag 900 gagtgtgccc cctctgaagt atcgagggta agatggattc caaaatgtgc accaatatta 960 cgaaagctgt tatctatccg ttcattcgca gtattggccg cggccactta attttccagc 1020 aagacaatga cccggagcac aaatcgaagc tgctcaccaa atgattccgt tacaacaacg 1080 tccctctgct accgtggccg ttgcaatccc cagatctgag tcctattgaa catctgtagg 1140 attgtcttta gcgtcaagtt aaaaaccttt gagcccaaaa caagcacgaa aagctccttc 1200 aacttaagac catatgggag aacatccctc aggaggagat cgacaaactc atcaagtcca 1260 tgccacgccg atgttaagcc atcattgacg ctaaaggtta tgcgacaaag cattcatgtc 1320 ttacttgtta tatttgctct tatgcaagtt gctgcgctga gaaatggaaa attaaggcag 1380 attgaagatt tcactggaat tatggggctt gaccatttct ttgactaaaa tctttcgaaa 1440 tattccacaa ttaacttaag ctactgaaaa cgcgctctct gttcacattc agtgatagga 1500 actcacagtg gaaaaaccag atgaaaagat gctgttttaa agaagctacg tttgaaaaag 1560 tgaaaacgaa tacttcactg accggtac 1588 // ID L1-4_Cis repbase; DNA; INV; 5907 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 06-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE L1 Non-LTR Retrotransposon from Ciona savignyi. XX KW L1; Non-LTR Retrotransposon; Transposable Element; L1-4_Cis. XX OS Ciona savignyi OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. XX RN [1] RP 1-5907 RA Smit A.F.; RT "L1-4_Cis - L1 Non-LTR Retrotransposon from Ciona savignyi."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX CC 6% div. XX SQ Sequence 5907 BP; 2315 A; 1022 C; 865 G; 1658 T; 47 other; attctatttg aataaggtac gaattttgcn cggcaaaatc taaatcgacc agattcgaat 60 agaaccttgg attatttcat cgttgttaca tcgtttattg tggatttacc ttgtggattt 120 attttcttcg cttcaaacgg gagactcggt gagtttggta tagtactctg tctcactttg 180 cgcacatttt ttcttgcacc gtttttgtcg ttttttcggt aatctttttt gtaattacta 240 gtgttgccat ttttttactg ttgtatttct ttttttcccc ctctggcaac actaataatt 300 acttttgtgt ggcgtttaat tgtttgattg tttttgtgca cttttcccgt taacttagac 360 ggcgaagcgc tagccaaatt ttctgttaaa tagcgccacc ttacgaccac tacatttctc 420 atcacacgac ataataacat aatcataaaa tacacataac acttaaacaa tatgaactca 480 aaagaaatcc cggacattga tgttgagtta cctcgaacaa ttgtgataaa acttaccggt 540 aacgtggagc atatgagtat ctcctcgttt atggagatat tcgctgaaga tgggccgctt 600 gcatatcttg cacctctaat tgacggagtg ataatggaga gcttcaggga atgtgagttc 660 cttgtcactc tcaaggacga tgggcgagat catccgattt cgttggcnca gcaaaccata 720 gattctgtaa acgaaaaaac agatatgttt attgggccga aagggtcggc tatgacattt 780 atggcaatgc tgcctcagcc acccgtggag acggtgacgc tccatcccat tccatgcact 840 atcacggaac gtcacctaaa attactagtc aataaataca agtggggtac atacaaaagt 900 tacaattttg gaaaccacag aaattttccn acaattaaaa atggatggct ntcaattaat 960 ctcatggaca tnaatatcaa taatatcccc aaagtcataa aaataggggg naaatggata 1020 acngttacca aaccgggaga atcccacttg gagctttgcc gctactgcaa agaaaggggc 1080 cacttgcaga gcaaatgccc caaaaaaggt tactgtgctt tctgcaaaac gcatggccat 1140 ataaccagaa gatgcagatc ggcaattcca ccaccatttc ctcagggcan cccatcacca 1200 ccaaatccaa cattgggtga ctggattctt ccgaaaagaa ccttcaaaaa ccctctgcaa 1260 ccaacccttt tccaagacaa caatatcctg ccgacaacct caaaccagtt ttatgtattg 1320 gatacaccga gtcatgagga gaattcagac atgacaatnn tggaaaatat taaccaggaa 1380 gtgttgcccc ccaaaaaagt tacaaaatcc cctcgaccna agtccacccc caggaagcta 1440 aaagcaaata acattggagc accaaaccaa aattctacaa tggatagctt catgaataaa 1500 tcaaacacag atccccgaga aacaagccaa tataaaaccc cggatgaaac naaaacaaaa 1560 caacaancaa caaaaataga tctagacgac aacgttaagt ctgatataga ctcttcaccc 1620 cctgcaacaa cggattcaaa tcaatcgtca aaagatgttt cacctatcga ggtgattaaa 1680 aaagcccaca aaaagctcgt atacggtttg gccaacccca ttttaaaata cacggacaat 1740 aacccaccaa cnattcgacg ttcaatacta gatcgaaaaa aggacttgct tgggacaacc 1800 atgtctccaa ataaaagaag aagagaggaa aacgaaccaa gaacataatt tagtgtttaa 1860 acaccacgaa tcacaaaaca aattgatatc aatttcatan tgaagaatgg ataatacgat 1920 tgaagagagt attgaaatac tttcttatcc attaaacata gctacattga acataaatgg 1980 catttcaaca aagttgaacc atttggaaaa ttacatacaa attaatgaaa ttgatatcat 2040 ttgcatacaa gagacacaca gaattactcc caaaattata cataccttaa ccaataatct 2100 tcattataaa atcttcatca attgtcttgt ttctgatgta ccatcacgta acgcctatgg 2160 aaatataacg ttaattaaat cgcatttatt aaataaattt aagtttgatc ataatattat 2220 nttagataat cttatggata aaattacact naaatgtaat agtataataa taaacttata 2280 taacgtatat ttaaaatgta aaaataaaac ggtacgtaaa gatatgatac taaaattaaa 2340 agataacata gataatncaa taactcataa tattatactg ggngatttta acatggtcat 2400 ggatgaaatt gatgtaaatg gttattttaa ttcgaaaaat catatggata gagcggcgtg 2460 gaagcaatta gaaaaagact ataacataaa tgatgcgttt cgatatttta aaaaacaaca 2520 aataacttat agccgaataa cngaaaaatc tgcgacgcga atcgatagaa tctatactag 2580 taaaagtctn aataataaaa tcattaaatt atagtcataa acctaatcnt ttttcngacc 2640 acaataattg tccagtaata tccttaaaaa tagataatga caaaaaatgg ggtccatcct 2700 tttataaatt aaatacttct atattagaat ataatgacgt nattgatcat ttaatttttt 2760 tctggaaaaa cctaagaagg gaaaaaagaa aattttcaaa cacgttacct tggtgggata 2820 ctgcaaaact agcagttaaa aacgaattga tatatatttc atcggacata aaaaagggcg 2880 aaaaacaaag atatgaaacc gtgaataaaa aactaaaaac tcttcgtaag caaatatcga 2940 ctcctgaaat aaataaagaa atcgctttaa tagaaaataa tatcgaaata tacgaacgga 3000 aattacataa atggtgcgct aattagagcn aaattaccct tacttgaaaa tgaagaaaan 3060 cctacgaaag ccttttttgc gtttgaaaaa gaaaaacaga aaagagacgc aatatttcaa 3120 gtcaaaaccg ccgaaggaaa tttaaccgaa accccgatcc aaacnataaa agaaattcat 3180 aaattttact taaatctatg gggagtagcg ccaaatgtat gtaacgacga tattgataat 3240 tatttaaata cgatnaatcc tataactgac gacggagaat tcgaccaatt aaacaaacct 3300 attaattgtg atgaaataaa aattgccatc gaccaaatga acgaaacgag ttcccccggc 3360 atggacggcc taaacgtagt gatatataaa aaattattta atacaataaa atatgactta 3420 gaagaactat ataataacat ttttctnaaa gggcaactta ctaacagtat gaaaacggca 3480 atngttaaat taatattcaa aaagggcgat aaatcggaaa ttaaaaactg gcgtcctatt 3540 tcacttctna actgcgacta caaaatatta agcaaagtaa tagcnaatag attcaatgta 3600 ataataaata aagtaataag taaaaatcaa aaatgcggcc tcaaaggaag atcnatcaac 3660 gaagcacttt ataatataca agcgagccta caatccgcaa aacaattcaa taataaatta 3720 acgatcctag ctatagactt tgaaaaggcc ttcgatagaa cgaatcccaa atttataatt 3780 aanatctgtc aaaaattaaa cttacctaaa acattaatac aatgggtcaa tattatctat 3840 tctaatatta aaagtaaaat tgaaattaat ggaacgttta ccccggaaat cccaataaaa 3900 cgaggaatca gacaaggntg cccnttaagt atgattttat ttttaatagg aatggaatct 3960 ctnactcgna aaatatataa taacccacgc ataatcggat ataaacttaa tcaaattgag 4020 ctaaaaacng aacaatatgc cgacgacctg acaatattaa ccaatgataa gaattcaata 4080 caaccgatct ttaaagaact cgaattatat ggacgtatat cggaccaaaa aattaatatt 4140 acaaaaaccc aggtaattag taacgacgcg gaggcaatat cgaaaattca acaattatat 4200 ccggatataa atatcgtcga aaatataaaa atcctaggaa taacttttaa tttaanaaat 4260 caatgtaata accttaattg ggagaaatat atgaaacata taattaacac gctaaatata 4320 aacaaacgac gcaaactaac gttacgcggt aaaaaactnc ttattaatac nctaatactc 4380 cctcaaatta attcgattgg cgaaataatt caacttccaa aaatatttct gaataaaata 4440 aattcggctt tatttaactt tctatggttt ccgacaaaat gggaactcat gaaacgagaa 4500 aaattttacg catcaatgaa ccacggaggg ttggcgatcc caaatataaa aataagatta 4560 gatgcaatga aagctactag antatataaa cttaaacaaa tcgaacgggt tgcggaaatt 4620 tggcacgaat gggcgagatt taatcttggc tcaaccatga aaattattaa taaaaaccta 4680 tatagtaaca gtaaaccaaa tgcaacttac cccaacgact ttttccgaga attgcgccac 4740 atatttttca atctaaacaa acaggattac gactggaaaa acggtaaaac caaagacttt 4800 tactttggtt taattcngaa tatcgcgctt aaaacggaaa tagaaattaa tggagagacg 4860 gtcccatggc aaaatatatg gcttaaacaa taaaagtata agtgcgtttt ttaatcatat 4920 agaacgcgac aatagctata aaatagcnca aaatataatt cttcacgggg attggtttac 4980 gtctaaatcc ttttcaaact accataatgg aaaattacta ataagacgct gtaaattctg 5040 taaagggtct gatgataata taaaacacct cctaactaga tgtaatataa taaataaaat 5100 aatagataat atatatactt atataaataa tatatcttat ggaaactata ctaaatcaga 5160 taacttaatt ttatataacg gtaatgttct tctccaaaaa gaagaacttc tacccattaa 5220 atttattaat atattaaaaa gtcatatttt acaagaaaag aaacgattag atatagcaaa 5280 tatatatatc tggaataacg atgaatttca tagaaaaact ctatggatcg tcaaaacgaa 5340 attcaaaata tttttaagaa ggatcatgat aaaaaatgga ttaacttata cccagcaaaa 5400 atttaaatta aaagataatt ataatcccta tatatagctt atatcatacg gtataaataa 5460 tgtacagata tntgcatata tctatatata ggtttgtatt tttttgtgtt tttttttttt 5520 tgcaaagtga gacactgtaa catactaaac tccgctataa ccaaaaaaat aataatatta 5580 atactaacca atttccttta atagcagagt ctctcgtttg aaaattttca ccgcagattg 5640 cttcagcctt tcaccngtca acttgaaatc tcgtcgtctt taggacaacc tccttcaacc 5700 tacacatcat aaatatcgta attagtgaat ttgacgcaac aattaaaatt aaaaagttct 5760 actttgacat cttggtaaat aatgtaaata gtctttaaaa tttccgttct ttattattta 5820 tatattgtga atttaaatat tatgtatttc gtccggattg ccgggaaatg tgtttatatt 5880 gctaataaac tgaaaaagaa aaaaaaa 5907 // ID R1_DWi repbase; DNA; INV; 6120 BP. XX AC . XX DT 08-NOV-2010 (Rel. 15.11, Created) DT 08-NOV-2010 (Rel. 15.11, Last updated, Version 2) XX DE 28S rDNA-specific non-LTR retrotransposon R1 in Drosophila DE willistoni. XX KW R1; Non-LTR Retrotransposon; Transposable Element; R1_DWi. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-6120 RA Stage D.E. and Eickbush T.H.; RT "Origin of nascent lineages and the mechanisms used to prime RT second-strand DNA synthesis in the R1 and R2 retrotransposons of RT Drosophila."; RL Genome Biology 10(5), R49-R49 (2009). XX DR [1] (Consensus) XX CC This family is specifically inserted into 28S ribosomal RNA CC genes. XX FH Key Location/Qualifiers FT CDS 364..1896 FT /product="R1_DWi_1p" FT /translation="HEKIFYEFFRDTNYLCRDVVMDGSMKRSGSTISVMSA FT SSVVESARSATSCVESAAESMNVTGSDTVDESGAEMTVVVGKREKKARKAK FT ARSARSSPVLEQVAPVAKKPHLDSVSMPAPAAPPVKPAKRKAAAAAASPEA FT VESPAAGPADMEIPRSGAWRIVAGGSSAILAEIGASGQAIRNAMIGMEPVA FT ASQLSSAIRRLEELATSLTLRNALLEAAASKPAAQPPTTPVAAHIAAYNAA FT FPAVTARAAAVVPVPGVPVAAPRISKPAETWSAVVTSRDPNVSSKQVAERV FT MKEVAPTLGVRVHEVRELRSGGAVIRTPSVVEIQRVVANKKFAEVGLNVQQ FT KKAARPSLKVMEVDTKLTPEVFMQQLYENNFKEMTVESFKKAVHMVSKPWV FT AQEGKRVNVSIEVDDSIASLLEGRERFYIQWFSYRFRWIVRTHVCHRCAGY FT DHKVAQCGATEATCFRCGQKGHKIWKCENPVDCRNCRFNGRPSGHLMLSLD FT CPMFAAKEARFAANH" FT CDS 1893..5024 FT /product="R1_DWi_2p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="SLTMFSFIQANCGRGRVACIDIGVRMRGFGHSFALLQ FT EPHTDLDGRIIGFPAGMKIFGSRIFEGHRRNSAVIIDDPDAICLPVESLIT FT EAGVCVRVTGKFGSIYLTSLYCHAHAELMGAFQYMDAILLLASSTPVILCL FT DANAVSPMWFCKRYDSYRGQPNYRRGEQLEDWILASRAGVVNRFSEVYTFD FT NRRGQSDIDVTIVSEAASAWAAYDWSVDEWEMSDHNIITVVVTPSPDTVES FT TYAPVPSWKLRNANWRLFDSELVEEIREHLPLEHIRMSPLDSSVSALRLAV FT QSTCDRVLGRRTSRAAGKVKWWSTELSTKRHEVRRLRRRLQDARRRGNESA FT QFLATHLRTAVREFKKLILKTKEDHWRNFVGDHRDDPWGHAYKICRGRKRS FT TELGCLREDGRPVVTWHDCASLLLRNFFPVAAASDDIVIPSGATPVLEAFE FT IGVCVARLKSRRSPGLDGITGGICKAVWRAIPEHITALLSRCVAEGYFPLE FT WKCPRVVALLKGPDKERSDPASYRGICLLPVFGKVLEGIMVNRMKEVLPEG FT NRWQFGFRPGRCVEDAWHHVKASVAASQAVYVLGLFVDFKGAFDNVEWSAV FT MRRLIDAGCQEASLWRSFFSGRKARMTSRFGEVDVEVTRGCPQGSISGPFI FT WNTMMDVLLQRLEPLCVFSAYADDLLVLVEGRSRAELERRGKELMSVVGAW FT GAEVGVSISTSKTAMMLLKGRLAASRSPFVEFAGAGLRYVTQYRYLGITVG FT ERLSFQPHIAGLRGKLAGVSGALSRVLRVDWGLSPRSRRTVYAGLMVPCAL FT FGASVWYSAVKSTMVARRHLCSCQRVMLLGCLPVCRTVSTVALQVLAGAPP FT LDLVARQLAIRYKVKHSHPLEEGDWLFGLDVANLDRKQMKAMLDERMCCEW FT QRRWDDDEHGRPTHEFIPDACFVFSRKDFGFGLHAGFLLTGHGSLNAFLYE FT RSLSRTPTCQCGALCEDWQHVLVACPLYTDLRDLDGLGVRREDTRWDFSRV FT LDTPEQLQMLELFARAAFKRRRRMTQDQPVPGRTGQNW" XX SQ Sequence 6120 BP; 1372 A; 1490 C; 1895 G; 1363 T; 0 other; cagtcttgtt tggactgtcg ttgcgtgcgg ttgtgtcgcg gtcgtttgat ttacgaatct 60 tgaacgcaaa acagattgca taagtgcaaa acagagacac cgattgaaac ggtgaagtgc 120 tcacttgtcg taaacaagtg gattagtgtg ccaagcaacg gtaacttgtt taccagttga 180 gactgtgcgt gtgagtgaca gtgcgagtgt gtgtgtgagc acgtgctcaa gtacagtgga 240 cactcgagaa gagtgctagt acagtgcgcc ctcgctaaaa gggccaagtg cggttttcgg 300 cgtgcgtgat cggactcgtg cgtgagccga ttgttcgtgc gtgagccgat agttcgtgcg 360 tgacacgaaa aaatttttta cgaatttttt cgtgacacga actatttgtg tagagacgtt 420 gtcatggacg gatccatgaa acgcagtgga agcaccatca gcgtgatgag cgcttccagc 480 gttgttgaaa gcgcccgcag cgctacgtcg tgtgttgagt cggccgcgga gtcgatgaat 540 gtcactggta gtgacaccgt tgatgagtcg ggcgccgaga tgacggtcgt cgtcggtaag 600 cgggagaaga aggcgcggaa agcgaaggct cgctcggctc gttcgtcgcc ggtgctggag 660 caagtcgccc ccgtcgcgaa aaagccgcat ctggacagcg tgtccatgcc cgcccccgcc 720 gcgccccccg tgaagccagc caaacgtaag gctgccgccg ctgctgcttc tcctgaagct 780 gtcgaaagtc cagccgctgg accagccgat atggagatac cgcgaagcgg agcgtggcga 840 attgtcgccg gtggctcgtc tgcgattctc gcagagattg gcgcctcagg gcaagccatt 900 agaaatgcca tgattggcat ggagccagta gctgcatcgc agctatcgtc tgctatcaga 960 cgcttggagg agttggcgac atcgctgacc ctcaggaacg ctctgttgga agcggcagcg 1020 agcaaacctg ctgcacagcc gcccaccacc cctgttgctg cgcatattgc tgcgtacaat 1080 gctgccttcc ccgcggtcac cgccagagct gctgccgttg tccccgtccc cggagtgccg 1140 gtcgccgcgc cacggatcag caagccagct gagacgtggt ccgcggtggt cacgagtcgg 1200 gacccgaacg tgtccagcaa gcaagtggcc gagcgggtga tgaaggaggt cgcaccgacg 1260 ctgggagtgc gtgtacatga agtacgcgaa ctcaggagcg gaggcgctgt cattcgaaca 1320 ccgtcggttg tcgaaataca gcgggtggta gcgaataaga agttcgcaga agttggactg 1380 aatgtccaac agaagaaggc ggcaagacct tccctgaagg tgatggaggt tgacaccaag 1440 ctcaccccgg aggtcttcat gcagcagctc tacgaaaaca acttcaaaga gatgacggta 1500 gagtccttta aaaaggcagt gcacatggtg tcaaagccat gggtcgcgca agaaggcaag 1560 agagtgaacg tctccatcga ggtagacgac tcgattgcgt ccttgctgga aggccgtgag 1620 aggttctaca ttcagtggtt cagctacagg ttccgctgga tcgtaaggac ccacgtatgc 1680 cacagatgcg caggctacga ccataaggtg gcccagtgtg gtgcaacaga ggcgacatgc 1740 ttccgatgtg gtcagaaggg ccacaagatc tggaagtgcg aaaatccggt ggactgccgg 1800 aattgtcgct ttaatggaag gccatcgggg catttaatgc tctcgttgga ctgtccaatg 1860 ttcgctgcga aggaggcaag gtttgctgct aatcattaac catgtttagc ttcatccaag 1920 ccaattgtgg tcggggtcgc gttgcgtgca tcgatatcgg tgtgcgtatg cgtggtttcg 1980 gccactcctt tgcgttgctg caggagcccc acacagactt ggatggccga ataattggat 2040 tcccagccgg gatgaaaatt ttcggtagtc gaatcttcga gggacaccga aggaactctg 2100 ccgtcatcat cgacgatcca gatgccatct gcctaccggt cgagtcgcta atcaccgagg 2160 ctggagtatg cgtgagggtc acgggtaaat ttggctctat ttacctgacc tcactttact 2220 gccatgcaca tgctgagttg atgggcgctt tccagtacat ggatgccata ctgctactgg 2280 cgagcagtac tccggtcatc ctctgcctag atgcgaacgc tgtatccccg atgtggttct 2340 gtaaacgcta cgatagctat cgtggccagc cgaactacag acggggtgag cagctagagg 2400 attggattct cgcaagccga gccggagtcg tcaacagatt cagcgaggtg tacacgttcg 2460 ataatcgccg tggacagagc gatattgacg taacaatcgt tagcgaagca gcgtctgcgt 2520 gggccgcgta tgactggagt gtggacgagt gggagatgag cgaccacaac attatcactg 2580 ttgtagtgac gccttcccca gacacagttg agagcacata tgctcctgtg ccgtcctgga 2640 agctcaggaa tgctaactgg cggttgttcg actctgaact ggtagaggaa attcgggagc 2700 accttccgtt ggagcacatc aggatgtcgc cgttggactc ctcggtgtct gcgctacgcc 2760 ttgccgtaca gtctacgtgc gacagggtgc taggtcgcag aacgtcgcga gccgctggaa 2820 aggtgaagtg gtggtccact gagctgagta caaaacgtca tgaagtcagg agacttaggc 2880 ggaggctcca ggatgctagg cgcagaggga acgaatctgc gcagtttctt gcaactcatc 2940 tgcggacggc ggtccgcgag ttcaagaagc tcatcctgaa gacgaaggag gaccattggc 3000 ggaacttcgt tggagaccac agagatgacc catgggggca tgcgtacaag atttgccgag 3060 gacgaaagcg gtccacagag ttgggatgcc ttcgcgagga tggcaggccg gtcgtaacct 3120 ggcacgactg cgcgagtctc ctcctccgta acttcttccc agttgcggca gcgagtgacg 3180 acattgtcat tccaagcgga gctacaccgg ttctcgaagc cttcgagatc ggagtatgcg 3240 tcgcccggtt gaagagcagg cgctctcccg gattggacgg catcacaggt ggtatctgca 3300 aggcagtatg gcgtgctatc cccgagcaca taacagcgtt gttatctcgc tgtgtggcag 3360 agggatattt tccccttgag tggaaatgcc cgagagtagt ggcgctcctc aagggccccg 3420 ataaggaaag gagtgatcca gcttcctatc ggggcatctg cctgctgcca gtgtttggca 3480 aggtgctaga ggggatcatg gtgaatcgga tgaaagaggt gcttccggaa ggaaatagat 3540 ggcaattcgg ctttcgacct ggacgctgtg ttgaggatgc ctggcaccac gtaaaggcca 3600 gcgttgccgc cagccaggcg gtatacgtgt tgggtctttt cgtggatttc aagggtgcct 3660 tcgacaacgt tgagtggagc gctgtgatgc gccgcttgat cgacgcggga tgccaggaag 3720 ccagcttatg gagaagcttc ttcagtggca gaaaagcgag gatgaccagc agattcggag 3780 aggttgacgt ggaggtcact cgaggatgtc cgcagggatc catcagtgga ccatttattt 3840 ggaacactat gatggatgtg ctgctccaac gtttggagcc cctgtgtgta ttcagtgcgt 3900 atgcagacga tctgctggtc ctcgtcgagg gaaggtcgag agccgagctg gaacgtcgtg 3960 gaaaggagct gatgtccgtc gtgggagcct ggggagctga agtcggagtt tcgatttcga 4020 ccagcaagac ggcaatgatg ttacttaaag ggagattggc cgcttctagg tcaccctttg 4080 tagagtttgc aggagcaggc ctacggtatg taactcaata ccgatacctg ggcatcacag 4140 tcggcgagcg gcttagtttt cagccgcaca tcgcaggatt acgcggtaag ttggctggag 4200 tcagtggtgc actcagtcgc gttctgaggg ttgactgggg tctcagtccc cgatccagac 4260 gcactgtgta tgctggactc atggtgcctt gtgcgctatt tggtgcctca gtgtggtatt 4320 ccgcggttaa gagtacgatg gtcgccagga ggcacctctg ctcttgccag agagtcatgc 4380 tcttgggatg cctgccggta tgccgtacag tgtccactgt ggcactgcag gtgcttgctg 4440 gagccccccc cttggacttg gttgccaggc aactggcgat taggtacaag gtgaagcatt 4500 cccacccgtt ggaggaaggc gattggctgt tcggtctgga tgtggcaaat ctggatcgga 4560 agcagatgaa ggcgatgctg gatgaacgta tgtgctgcga gtggcaacgc aggtgggatg 4620 acgatgagca cggtcgaccg actcatgagt ttatcccgga cgcatgcttc gtcttcagcc 4680 ggaaggactt tggttttgga cttcacgctg gattcttgct gactggccat ggatcgctga 4740 atgcattcct gtatgagaga agtctgagta ggacgcccac atgtcaatgt ggtgcgcttt 4800 gcgaagactg gcagcacgtg ctagtggctt gcccactcta cacagatttg cgagacctcg 4860 atggacttgg tgttagaagg gaggataccc gttgggactt ttcccgggtc ctggacacac 4920 cggagcagct ccagatgttg gaattgtttg cgcgagcggc atttaaaagg cggcgacgaa 4980 tgacacagga tcaaccagtg ccgggtcgga cgggccaaaa ctggtagcag tagctcacac 5040 acactgcaaa gccgtaccag agcgctttaa tctggcagat gaacccccat cggggtgact 5100 cggggatgat ccattccttg ttcaggtgaa caaaaatggc tagtagtttc gactgcgaac 5160 agcccgccaa agctgcaaat ctggtagacg aaccccatga ggggggagtt gaggatgatc 5220 cattccctgt tcgtgttgaa caaaaatggc tagtagtctc tgactgcgaa cagcccgcca 5280 aagctgcaat ctggtagacg aaccccatga ggggggagtt gaggatgatc cattccatgt 5340 tcgtgttgaa ctaaaatggc tagtagtttc gactgcgaac agcccgccaa agctgcaaat 5400 ctggtagacg aaccccatga ggggcgagtc ggggatgatc cattccctgt tcgtgctgaa 5460 cagaaatggc tagtagtttc gactgcgaac agcccgccaa agctgcaaat ctggtagaca 5520 aaccccatga ggggcgagtc ggggatgatc cattccatgt tcgtgttgaa ctaaaatggc 5580 tagtagtctc tgactgcgaa cagcccgcca aagctgcaaa tctggtagac gaaccctagt 5640 tggggggagt cgaggacgat ccattccgtg ttgtcgagga cacatatggc tagtagtttc 5700 gactgcgaac agccgaccga agctgtgatt cggtactacg ggatgggcta gtggcccaag 5760 gctagcccaa ttagtggttg gcccccttgt gggagatcgt agtggctgtg gtttgatacc 5820 caaatgcggg gagagtcatt ggactcagcg tggagttgcg ttacacaacc gggtgctgta 5880 cccatagacc agcagaggtt ttagatgggc ctcgctcctt acccaggggg agtgtcatgt 5940 ccgacagcat ggcattcaat aggcactgac gagatgctgc aatagcagta tctccgaagg 6000 gcgttggttg acgcattgtt ttgacccgct gtactatatg tcacaagtgt ggcatattag 6060 agtaccgtgg ttgtaaaccc tattaatggg tacacgtcac gttaaataaa ctcgacttca 6120 // ID CR1-123_AAe repbase; DNA; INV; 4682 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-123_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4682 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1211-1211 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 8 sequences with >98% CC identity. XX FH Key Location/Qualifiers FT CDS 363..1181 FT /product="CR1-123_AAe_1p" FT /translation="MVKHCTKCDXAIAGTDYVACRGYCGGFFHMNACSGVT FT RALLSYFSSHRKNLFWMCDGCANLFENSHFRMISEQXEQHSPLGSLTAAIN FT ELRTEIKQIRSKPIMQFSPASNTRWPAIEQQRAAKRPREMDSPVRAPXCCV FT GSRKAQENDVTISVTPNVVDEKFWLYLSRIRPDVTVESMTALVKDSLQIDV FT DPPVVKLVPKGKELGSLSFVSFKIGLNPSLKEKALDPSTWLEGLXFREFED FT CGVQKFRVPPKTMRPSTPLLVPHSSSPATPLT" FT CDS 1058..4597 FT /product="CR1-123_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="GFGFPRVRGLRSTKISGXAKNHETIDTITSAPLIISS FT NTTYINLSAPGRTTSSVLEAPYPPITVEQFPPALCSRPGPVYGSGDGVFHT FT LFTGKYXIDSYLQLCVVPDASNDYSQLTKPPTKSPSQATPGCTPASLMEAS FT NPFITVEPILPASSSHPGPVYDYGEEVFQTVNAGKYNDVLSNSLPVTSITS FT SELTTSRSTPDYTLQVASNQQRPSRSTNAGLNVQPTHRLLSVYYQNVRGLR FT TKISQLRLLLSSCDYDVLVFTETWLRPDINNAEISTEYEIFRCDRSDFTSS FT YSRGGGVLVAVKSHHKCESISLPNCTHLEQIAVRVALEHRSLYIVAIYLPP FT NSIAELYSAHADAVEHIAKLSSSSDVILSVGDFNLPNLRWQCDEDSNCCIP FT LNVSSESEQLLIETLFSIGLQQINNVVNVNSRLLDLSFTSLPEALDVIVPP FT VPLLPVDNHHAPFILLLDDSGENFSELDEWDEVPTFNFNACDFDVLNAAFA FT DTDWHFLQQCASVNEMLTAFYDKLYDICNVHVPRRRKKLNSMYNKPWWSNE FT LRNSRNILRKARKRFFHTRSESERENLHRVETAYKSLLKSSFENYISVVQS FT KVKQNPTFXWDFVGKQKSTKRVPSNVMLNGCQACSHEEAANLFASFFESVF FT STTTPTLPPNCFHHIPSFDFSFPVIQFSVIDVLNSFNDLDVTKGPGTDAIP FT PLLLKNCSTVLADPVSLIFNRSIHERTFPTIWKRSCIIPIHKSGNHAQVTN FT YRGVSILCCLGKVFEKLVHSALYNVAAPLISDSQHGFMKHRSTSTNLMVYV FT TALSRELEMGNQVDSIYIDFAKAFDTVPHKAIISKMTHIGFPNWITEWLLS FT YLSDRTAFVSVNSAKSRSFSITSGVPQGSVLGPLLFNIFVNDLSLLLKSFS FT LSFADDLKMYRTITSSVDCVTLQEDVNTVLIWCGNNGMSVNSSKCKVISFT FT RRSRSVSHDYFIGSVIIERVDSICDLGVTIDAKLKFNDHVGATVAKSFAAL FT GFIRRHAADFTDVYALKTLYCSLVRSIVEYAVPVWCPYYTTHILAIERVQK FT KFIRFALRTLPWNDPSNLPPYPDRCRLIDLQTLRSRRVEMQCLFIFDVLQG FT NIDSPALLEQVPLNIPPRQLRNYPFLAIPYHRTNYGQNNPFDACLRVFNVK FT SNVFDFNLSKNVFRGRIRVNI" XX SQ Sequence 4682 BP; 1270 A; 1135 C; 947 G; 1320 T; 10 other; catcactggc ctatatgtaa gcttggtgat aaccgtgcgc gaattttcaa gcttttaaac 60 cgtgtaatac aacctataaa ttcattacgt cgttctaatt gttatctgtc tgtgcagttc 120 tactagattt cgtttgtgat tactgaaatc gacgcstcgt agtgtataaa atcacatcaa 180 attttcggac attcagagtg tttttgttgc tgttgttaca atcttcatcc ctctatccac 240 cctctcgccg attttttcat ccgaagcaat atctgacgga atccaacgca aacttccaat 300 cgagcttgcg tgctttagta ctcgaattct catcaacact gcctacggtt aggcgcatca 360 acatggtcaa gcactgcaca aagtgtgatk aagcaattgc cggcaccgat tacgtagctt 420 gccgcggata ctgtggaggt ttcttccata tgaatgcctg ctcaggcgta acacgtgcac 480 tcctatcgta cttttcgtct catcgaaaaa atctgttctg gatgtgcgac ggatgtgcaa 540 atctttttga gaattcgcat ttccgtatga tctccgaaca agmtgagcag cactcacctc 600 tcggctcttt gaccgctgct atcaatgagt tacgaactga gattaagcaa attcgttcca 660 agccaattat gcaattttca ccggcctcaa acacacgttg gcctgcaatc gagcaacaaa 720 gggctgccaa acgacctcgt gaaatggata gtccagtaag agctcctaaw tgctgcgtag 780 ggagtagaaa agcacaggaa aacgacgtca caatttccgt tacccccaac gtggtggatg 840 aaaaattctg gttgtacctc tcaagaatcc gtcctgatgt caccgttgaa tcaatgactg 900 ctttagtcaa agatagcctt caaattgacg tggatccgcc cgtagtgaag cttgtcccta 960 aaggaaagga gttaggctca ctctcattcg tttcctttaa aattggcctc aacccctcgc 1020 tcaaagaaaa agcccttgat ccctccacgt ggcttgaggg tttggwtttc cgcgagttcg 1080 aggactgcgg agtacaaaaa tttcgggtwc cgccaaaaac catgagacca tcgacaccat 1140 tactagtgcc ccactcatca tctccagcaa caccacttac ataaatctst ctgccccggg 1200 acgcactaca tctagcgttt tggaagcccc ttatcctccc atcacagtcg agcaattccc 1260 gccagcgctc tgcagtcgtc ccggtcctgt gtatgggtct ggagacgggg tcttccacac 1320 tctttttact ggcaagtatc mcatcgatag ttatctccaa ctttgtgttg ttcctgatgc 1380 ttcaaacgat tacagccaat tgacaaaacc gccaactaaa tcaccgtcac aagctacacc 1440 gggatgcacg cctgcaagtc ttatggaagc ctccaatcct ttcatcacag tcgagcccat 1500 cctgccagcg tccagcagtc atcccggtcc tgtgtatgat tatggggagg aggtcttcca 1560 aactgtcaac gcaggcaagt acaatgatgt tttgagcaat tctcttccgg taacgtccat 1620 cacttccagt gaactgacaa cttcgcgatc gacacctgat tatactctac aagttgcatc 1680 gaaccaacag cggccttcac gttcaactaa cgccggactg aacgttcaac caacccaccg 1740 tcttctgtcc gtttattacc agaacgttag aggtttgcgt accaaaattt ctcagctacg 1800 cttgctttta tcgagctgtg actacgacgt gctagtcttt acggagacat ggctacgacc 1860 ggatataaac aacgcagaaa tctcgacaga atatgaaata tttcgttgcg accgtagtga 1920 tttcaccagt agctattcaa gagggggtgg ggtattggtg gctgtcaaaa gtcatcataa 1980 atgcgaatcc atttcattgc caaattgtac acatcttgag cagatagcgg ttcgtgttgc 2040 attggaacat cgctcattgt atatcgtggc tatatacctc ccgccaaact cgatcgccga 2100 actctattca gcccacgctg acgcagtaga acacatagcg aaattgtcat ctagttcgga 2160 tgttattctc tcggttggag atttcaatct tcctaatctg cgatggcaat gcgacgaaga 2220 ttccaactgc tgcattccat tgaacgtttc ttctgaatca gagcagttgc ttattgaaac 2280 tttattctca attggtttac aacaaataaa caatgttgtg aacgtgaata gtagattgct 2340 ggatttgtcg ttcaccagcc taccggaagc tctcgatgtc attgtacctc ctgttccact 2400 attacctgtg gacaatcatc atgcaccgtt tattttacta cttgatgata gtggagaaaa 2460 cttttctgaa cttgacgaat gggatgaagt ccccactttc aattttaacg cttgtgactt 2520 cgacgtattg aatgctgcat ttgccgatac cgactggcat tttcttcagc aatgtgcatc 2580 tgtaaatgaa atgttaacag cattctatga caagctwtac gatatatgta acgttcacgt 2640 ccctcggagg aggaaaaagc ttaattctat gtacaacaaa ccatggtgga gcaacgaact 2700 gcggaattca agaaatattc ttagaaaggc acggaaacga ttttttcata cgagatctga 2760 aagcgaacga gaaaatctac atagagttga gacagcgtac aaatcgcttc taaaatcttc 2820 cttcgaaaat tacatttccg tggttcaatc gaaagtgaaa caaaatccaa cgttcwtctg 2880 ggacttcgtc gggaaacaaa aaagcaccaa gcgtgttcct agcaacgtta tgctcaacgg 2940 atgtcaagca tgttcacacg aagaagctgc gaatctgttt gcgtctttct tcgaaagtgt 3000 tttcagtaca accacaccaa ctctccctcc caactgcttt caccacattc catcgtttga 3060 cttctcgttt cctgtaattc agttttctgt gatcgacgtg ctgaattcat tcaacgatct 3120 tgatgtcact aaaggaccgg gaactgatgc catacctcca cttctactga aaaattgttc 3180 taccgtatta gccgaccctg tttcgttgat attcaacaga tccatacatg aaagaacatt 3240 cccaacaata tggaagcggt cctgcattat tccgattcac aaatctggaa atcacgccca 3300 agtcaccaat taccgtggcg tctcgatttt gtgctgccta gggaaagtct ttgaaaaatt 3360 agtgcacagt gctctgtata atgtagcagc tcctctcatc tctgatagtc aacatggctt 3420 catgaaacat agatccactt cgacaaatct aatggtttat gtaactgcgc tttctcgtga 3480 gttagagatg ggaaatcaag ttgactccat atacatagat ttcgccaagg catttgatac 3540 tgtaccgcat aaagccatta tcagcaaaat gacgcacatc ggattcccta actggatcac 3600 ggaatggctg ctgtcgtatt tatctgatcg gacagcattc gtgagtgtta attcagcgaa 3660 atctcgctcg ttcagcatta cgtctggcgt accccaagga agtgtcctgg gtccgctgct 3720 tttcaacatc tttgtcaatg acttatctct actgctcaaa tcatttagtc tttccttcgc 3780 tgatgatctg aagatgtacc gcaccataac ttcatcagtc gattgtgtaa cgctgcaaga 3840 ggacgtgaac actgtgctga tctggtgtgg aaacaacgga atgagtgtca acagtagcaa 3900 gtgcaaggtc atatccttta ctcgccggag tcgctctgtc agtcacgact atttcatcgg 3960 atcggttata atagagcgag tggattcaat ctgcgatctg ggagtaacca tcgatgcgaa 4020 gctcaaattc aacgatcatg tgggagccac agtagcgaaa tccttcgctg ctctaggctt 4080 cattcgccgc cacgctgctg acttcaccga tgtctacgcc ttaaaaacgc tctattgttc 4140 cttggtgcgt agtattgtag agtatgctgt cccagtttgg tgcccctact acaccacaca 4200 tatccttgcc attgaacgtg tgcagaaaaa attcatacgg tttgcgctgc gcactcttcc 4260 gtggaatgat ccgtcaaact taccgcccta ccctgatcga tgtcgattga ttgacctgca 4320 aacattgagg agtagacgcg tagaaatgca gtgtctcttt attttcgacg tcttgcaagg 4380 gaacattgac agccctgcac tattggaaca ggtcccgctg aatattccac cacggcaact 4440 acggaactac ccgttcctgg ccataccgta tcacagaaca aattatggtc agaataatcc 4500 gttcgatgcc tgtcttcgtg tgtttaacgt caaaagcaat gtatttgatt ttaacttgtc 4560 taaaaatgta tttcgtggta gaattagagt taatatttag aatatgcagt acagtaattc 4620 agtctgtgtg acttcatgtc aaagacggtg gaaataaaat aaaataaaat aaaataaaat 4680 aa 4682 // ID hAT-4D_AP repbase; DNA; INV; 3411 BP. XX AC Contig1724; XX DT 31-JUL-2009 (Rel. 14.07, Created) DT 31-JUL-2009 (Rel. 15.12, Last updated, Version 2) XX DE DNA transposon. XX KW hAT; DNA transposon; Transposable Element; hAT-4D_AP. XX NM hAT-4D_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-3411 RA Jurka J.; RT "DNA transposons from pea aphid."; RL Repbase Reports 9(7), 1368-1368 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX FH Key Location/Qualifiers FT CDS 1225..2952 FT /product="hAT-4D_AP_1p" FT /translation="FILGRTVDSEIICSTSSDSDSDTGENISNKSLDIGNT FT SEFLDLHTFQSRGKKNFITPKLAIALDRCKISDRDAVHILTATVEAFGIHV FT KDLILNRTSINRIRQRLRKDRADQLRKEFNTSEVGPVVVHWDGKLLPDLTG FT KELVDRLPVIVSYKNSEKLLSVPNLISGTGQNQAEAVFQALEEWGLIDHVQ FT ALCCDTTASNTGRLKGACIILEQLLERDILYFPCRHHIYEIILRSAFEVNF FT VVTSGPDIQIFKRFQQFWPKVNINNYNTGLEDIIVSEKLNDITNVMLLFYM FT DQLRKPHNRDDYRELLELAVIFLGGTPTRGISFKYPGAMHHARWMSKAIYS FT LKIFIFRKQFKLKKNEEDSIRSICIFLIRLYIKAWFCAPIASLAPFQDLQF FT LKSLVDYENIHKKISQSTLKKLCGHLWYLAPETVALAFFDTNLTIETKIKM FT VDSIKLNNLTSEINKRIIVSPNEVTQIMKKEIYDFIYVESTSFFSRFGIST FT SFLEQHPSMWDENVDFQKGLEIVNTFRVTNDTAERGVKLMEEYNKVLTKNE FT EQKQYVLQVVEDYRRKYPNSLKTTVLNPF" XX SQ Sequence 3411 BP; 1289 A; 478 C; 508 G; 1136 T; 0 other; cagtcttacc ccctaatttg gttccaatat atataaaaac tatgtgtaca aaatttcatg 60 tcgataggtt aagtttaacc cgtgcccctg ggtccttaaa gttcgaaagc actattttat 120 tcaaataacc acgttttttt ttaaatattt tatttatatc tcgtttcaac tcactcatat 180 ataaatatgt gtaggacatt atttatagta aattaaattt tctataattc ttattctata 240 tagttttttg attaacataa tatttttata tataattaac tatgaaacat acaagtttat 300 aaaatgtata aaacgtaatt taataacgtg acaatttaac aagattacaa aatatctaac 360 atcattatat tattacattt gtaactgaac gaatttatta tctgagataa gcaaacagca 420 tttaataatt taataatttg atttgattat cttttaatct tttaatgatt cgaaacagtt 480 ttaaaaatag actactccat aaaactgata gtaaatacta acatatttgt atgtaatctg 540 acggtcgatt agtatactgc ggaccacgat agatagcaaa acaaaatatt acacatattt 600 tttttatcat ggcttcaaca tctacttcaa atattattct tcgtgagcaa gataaaattt 660 ttttaattgg tagtacttgc aaccaaatta ttggtagtaa attaccgtca aaaagacagg 720 cgttacgtgt actttttttt aatatgcgta aagtaaaact caacttacac gaaagtgcga 780 aacttgttat ccaagaaata gttgtatttt ggcaaaaagc tcgcattcca ataagacaag 840 aatacaataa cattaaggag ctggaatcat tatacgaaga atggagaacc ttacaaaaac 900 atgcaacaag aaaaactgaa ataaataaga aaaaacaaga atgttttgtg aatgagttag 960 atgacctttt tgatattgca catatgaatg cattggatat tattaaaata gaagtagatc 1020 gacaattctt actatctcaa agagaaaaag gcagaattgg ttgtatgctt ggtaaagata 1080 aaaatttaca aaaaactgaa gaaagagtaa ttactcggtt agaagcagtt acaaaacgga 1140 aaaagagagc ttatggagaa attgaacaag caagtaagac tgtaaatttc tatcaaacca 1200 ttttaaatta ttctataaat ttaatttatt ttaggacgta ctgtcgacag tgaaataatt 1260 tgttctacaa gttctgatag cgattctgat actggtgaaa atatttcaaa caaatcgtta 1320 gatattggta acacatccga attcctagat cttcatacgt ttcaaagtag aggtaagaaa 1380 aattttataa cgccgaaact tgcaatcgca ttggacaggt gtaaaataag tgatcgcgat 1440 gcagttcata tattgacagc tactgtagaa gcgtttggta tacacgtaaa ggatttaatt 1500 ttaaatcgga catctattaa ccgaatacgc cagcgcttac gtaaggatag agcagatcaa 1560 cttcgaaaag aattcaatac atcagaagta ggtccagttg ttgttcattg ggatggtaag 1620 ctactcccag atttaacagg taaagagctt gttgatcgat taccagttat tgtttcatat 1680 aaaaattccg aaaaactgtt aagtgtacct aatctcatat ctggaactgg tcaaaatcaa 1740 gcggaagcag tgtttcaagc acttgaagaa tggggcctta tcgatcatgt acaagctcta 1800 tgttgcgata caacagcttc gaacacgggc cgtctaaaag gagcatgtat aatcctggaa 1860 caattattag aacgtgatat tttatacttt ccttgcaggc atcacattta tgaaattata 1920 cttagatcgg catttgaagt aaactttgtg gttacgtctg gtccagatat tcaaattttt 1980 aaacgttttc aacaattttg gccaaaagtt aatatcaata attataatac tggattagaa 2040 gatataattg tcagtgaaaa attaaatgat ataactaatg ttatgttatt attttacatg 2100 gaccaattac gcaaacctca caacagagac gactacagag aattgttgga attagctgta 2160 atatttcttg gtggtacccc tactcgtggt atatctttta agtatccagg tgcaatgcac 2220 cacgcccgtt ggatgtctaa ggccatttat tctctaaaaa tatttatatt tcgtaagcaa 2280 tttaaattaa aaaaaaatga agaagattca attcgttcaa tttgtatttt tcttattcga 2340 ttgtatatta aggcatggtt ttgtgcacct attgcctctt tagctccatt tcaagactta 2400 caatttttga aaagtctcgt cgattatgaa aatatacata aaaaaatttc tcaatcaaca 2460 ttaaaaaaac tatgtggaca cttgtggtat ttagcacctg aaacagtagc attggcattt 2520 tttgatacta atttaacaat tgaaacaaaa attaaaatgg tagattctat aaaactaaat 2580 aatttaactt ctgaaattaa taaacgaatc attgtttcac caaatgaagt aactcaaatc 2640 atgaagaaag aaatatatga ttttatttac gttgaatcta catcattttt ttcacgattt 2700 ggaatatcaa catcattctt ggagcaacat ccatcaatgt gggatgaaaa tgtagacttt 2760 caaaaaggtc ttgaaattgt aaatacattt cgtgttacaa atgatacggc agaaagaggg 2820 gtaaagctca tggaggagta taataaggtc ttaacaaaaa atgaagagca aaaacaatac 2880 gtgcttcaag ttgtcgaaga ttatcgccga aaatacccaa acagtttaaa aactacggtt 2940 ttaaacccat tttaattatt tttctttgta tgctataaaa tataaatatt aaacaaatta 3000 attcatccat caaaccaatg aggctctttt ttaataacaa ttaaatgtat attatatata 3060 atacctacgt tttgtaataa tattgtttaa atgtcacatt atttataaac ttttatattt 3120 cataattaat taataattat atataaaaat attatgttaa tcaaaaaact acgtataaca 3180 agaattatag agaatttaat ttactataaa taatgtctta tacatattta tatatgagtg 3240 agttgaaaca agatataaat aaaacattaa aaaaaaatta tatttatttg aataaaatag 3300 tgctttcaaa ctttaaggac ccaggggcac gggttaaact taacctatcg acatgaaatt 3360 ttgtacacat agtttttata tatattggaa ccaaattagg gggtaagact g 3411 // ID Jockey-1_CQ repbase; DNA; INV; 4312 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A Jockey non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW Jockey; Non-LTR Retrotransposon; Transposable Element; KW Jockey-1_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4312 RA Kojima K.K. and Jurka J.; RT "Jockey non-LTR retrotransposons from the southern house RT mosquito."; RL Repbase Reports 11(1), 112-112 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 10 sequences with >98% CC identity. XX FH Key Location/Qualifiers FT CDS 45..1418 FT /product="Jockey-1_CQ_1p" FT /translation="MTRRVPAQNAGNIALPAAQAGKKVGISKRKVEASTDG FT QKPGISKRKVLLEVTSNSKKTKKSDGTVPMDEEEENSSSHEEQLLKNNKFA FT GLPDEDQVAEAKENEVKQRKEKLPPFYVRQSAATIDFRAGLVELIKSGKVQ FT GNIRLCQDGFKVLVQSRQHYQLVKDYLTENEAEYFTHDVVMDKPYKIVVRG FT LYDMPVEELAAELKVLKLDVLAVHKMSRRNKDIKYRDQLYLLHLAKGSTTL FT PELKAIRAVFNIIVSWERYRPVHRDVTQCFNCLGFGHGGKNCHLKRRCAKC FT GTDAHITSQCIQDSLVKCLNCNGEHSSTDRKCPKRAEFVKIRQQASTKNQP FT QRRRTPPALVEQNFPPLQPRRQVPNLAPLPLDPRKRAEMNHPRPGSSQEPR FT PPPPGFSQEPRPTQEPAVEENGNDLYTSTELLNIFKQMSAALRGCKTKTQQ FT IEVLTSFVIQYGS" FT CDS 1411..4062 FT /product="Jockey-1_CQ_2p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="MDRRSLKLLNWNACSIRRKNLELVDFLREKEIDVAAI FT TETHLKPGEKVYLQNYKIATQLDRTTSGGGGVLVAVHRDLKPRRLPHFKLD FT IIEAVGVEIPTSVGPVLFIAAYCPRQVNARDGSAAKLKSDIQKLTRRSAKY FT IIAGDLNAKHEVWGNSRRNRNGVILHNDLQNGYYNVVSPDRPTRVARSGNH FT SIIDFFITNMAENVAHPEVFEELSSDHYPVVVEVGASVTPQRQPTRKDYHN FT VDWQQFQQVVDSNIDYDQHPETSADIDRSLEVIQQAINQAEAANVREVPVN FT FKVTDIDVDTKHLIRLRNVYRRQYQRTGDYDKKISVNNLNKIIQDRLDNIR FT NQEFSKHVSQLGNYSKPFWKLSKVLKNKPKPIPPLIVEDSXLITSEEKANA FT LGLHFVSSHNLGASMTSRKETSVANSISTINDSTFEFPADSHVSGEEVKVA FT VKQMKNMKAPGFDNIFNLVLKKQSDQFFQHLANIFNKCLQLGYFPTNWKLG FT KVIPILKPQKDPTSPTSYRPISLLSSLSKLFEKVIYSRLLDFTNDNNIILN FT EQFGFRKGHNTAHQLTRVTKIIKQNKLESKSTAMALLDVEKAFDNVWHDGL FT IHKLYLYGFPMYLIKIIQHYLSERSFRVFLNGIASGLFNIDAGVPQGSILG FT PLLYNIFTSDLPTLPGNGVLSLFADDTAVIYKGKITRYLVGRLQKGLDVLS FT EYFGDWKIRINAAKTQTIIFPLSKSARFVPKDDVLIKMNDVSIPWSKEVVY FT LGLILDSKLLFRQHVDKILNKCSILIRCLYPLINRKSKLSLKNKLAVYKQI FT IYPVIEYAVPVWECCARTHKLKLQRVQNKVLKMVLNVPGWTRSSEVHELAE FT VKMLDQKIQEKCLKFREKCAISEYPLIQGLV" XX SQ Sequence 4312 BP; 1292 A; 913 C; 984 G; 1121 T; 2 other; cattgttatc cgagcattcg ttggtgaagg ttgtctgaat caacatgact cgtcgcgtcc 60 cggcgcaaaa cgccggcaat attgccctac ctgcggctca agcaggcaaa aaagtgggaa 120 tttccaaaag gaaggttgag gcctcaactg atggccaaaa accggggatt tccaaaagga 180 aggtgctttt ggaagtgaca tcgaacagca aaaagacgaa aaagagcgac ggtactgtac 240 cgatggatga ggaggaggag aactcttcgt cgcacgagga gcagctattg aagaacaaca 300 agttcgctgg actgcctgac gaggaccagg tagcggaagc aaaggagaat gaagtgaaac 360 agcgaaagga gaaactgcct cctttttatg tgcggcaatc agcagcaacg atcgactttc 420 gagcggggct ggttgaactc atcaagtctg ggaaagtcca aggcaacatt cgtctgtgtc 480 aggacggatt taaggtgctg gtgcaatcca ggcagcacta tcaactggtc aaggattact 540 tgaccgaaaa cgaggcggag tactttaccc atgatgtcgt catggataag ccgtacaaaa 600 tcgtcgtcag aggtctgtac gacatgccag tggaggaatt agctgccgag ctaaaagttt 660 tgaaactgga tgtgttggcc gtgcacaaaa tgagccgacg caacaaagac atcaagtacc 720 gtgaccagct ctacctgctg catttggcta agggatcgac gacgcttcct gagctgaagg 780 caatccgagc ggttttcaac attatcgtgt cgtgggagcg ataccgacca gtgcatcgtg 840 acgtcacaca atgcttcaac tgcctgggct tcgggcacgg aggtaagaac tgccacctga 900 agcgtcgttg cgccaaatgt ggtaccgatg cgcacatcac atcccagtgc atccaagatt 960 cgctggtgaa gtgcctcaac tgcaacggtg agcactcgtc aaccgaccga aagtgcccca 1020 agagagctga gttcgtgaaa attcggcagc aagcatcgac gaagaatcag cctcagcgtc 1080 gtagaactcc tccagccctg gtggagcaaa attttccacc tcttcaaccg cgacgccagg 1140 tcccgaactt ggcaccgttg ccgttggatc ccaggaagag agctgagatg aatcatccac 1200 ggccgggttc cagccaggag ccgagaccgc caccaccggg cttcagccag gagccaagac 1260 caacccaaga accagcagtt gaggaaaatg gtaatgatct ttacacctca accgaactcc 1320 tcaatatttt caaacagatg tccgctgcac tgcgtggatg caaaaccaag acccagcaga 1380 ttgaagtgct gacctcgttc gtcatccagt atggatcgta ggtccctcaa gctgctgaat 1440 tggaacgctt gctccattag gaggaagaat ttagagctgg tggattttct tcgcgagaag 1500 gagatcgacg tagctgccat cacggaaact catctgaagc ccggtgaaaa agtttatctg 1560 caaaactaca agatcgcgac gcagctcgat aggaccacct ctggaggagg aggtgtgctt 1620 gtcgctgttc atcgtgatct caagccacgc cggctgccac acttcaagct ggacatcatc 1680 gaggccgttg gggtggaaat tcccacttct gttggcccag tactcttcat tgctgcatac 1740 tgcccacgtc aggtgaatgc cagagatggt tcagcagcaa aactgaagag cgatatccag 1800 aagctgacac ggcggagcgc aaagtacatc atcgctggtg acctcaacgc gaagcatgaa 1860 gtttggggca acagcaggag gaatcggaac ggagtgattc tgcacaacga tctgcaaaac 1920 ggatactaca acgttgtgag tccggatcgt ccaacgaggg tggctcggtc tgggaatcac 1980 tcaatcatcg acttcttcat taccaatatg gctgagaacg tggctcatcc tgaggtgttt 2040 gaagagttga gttctgatca ctatccggtg gttgtggagg ttggagcttc cgttactccg 2100 cagcggcaac caacccggaa agattaccac aacgttgact ggcagcagtt tcagcaagtg 2160 gtmgacagca acatcgacta tgatcaacac ccggaaactt ctgctgatat tgatcgttcg 2220 ctggaggtga tccagcaggc tatcaaccaa gcagaggctg ccaacgttcg ggaggttcct 2280 gttaatttta aggtaactga tattgacgta gatactaaac acttgattag acttaggaat 2340 gtttatagga gacaatatca acggactggg gactatgaca aaaagatttc agttaacaat 2400 ttgaacaaaa tcatacaaga ccgacttgac aatatcagaa atcaagaatt ttcaaaacat 2460 gtaagccagc ttgggaatta ttcaaaacca ttttggaaac tttcaaaagt tcttaaaaac 2520 aaaccaaagc ctattcctcc tcttattgtt gaggattctc stttgattac atccgaggaa 2580 aaggcaaatg cacttgggct tcattttgtt agttcacata atcttggcgc ttccatgacc 2640 agccgtaaag aaacatcagt tgccaatagt atttcaacaa tcaatgactc tacctttgaa 2700 tttcctgcag attcccatgt ttctggtgag gaagtcaaag ttgcagttaa acaaatgaaa 2760 aatatgaaag ctccaggctt tgataacatt tttaatctag tgttgaaaaa acagagtgat 2820 cagttctttc aacatctagc caatattttc aataaatgtt tgcaacttgg ttacttcccc 2880 accaattgga aactgggcaa agtcatacca attttgaagc ctcaaaaaga tccaacatcg 2940 ccaacaagtt atcgtcccat tagtcttttg agtagtctgt ccaaactctt tgagaaggtc 3000 atctattcaa ggcttttgga ttttaccaac gataataata taattttgaa cgagcagttt 3060 ggcttccgaa agggacataa tactgctcat cagcttacga gagtaactaa aatcatcaag 3120 cagaacaagc ttgagtctaa atcaactgct atggctttgt tggatgttga gaaggctttt 3180 gacaatgttt ggcatgatgg tttgatacat aaactgtatt tatacggttt tccaatgtat 3240 cttatcaaaa ttatccagca ctatctttcg gagagatcgt tcagggtttt tctgaatggg 3300 attgcttctg gattattcaa cattgatgct ggggttcccc aaggaagtat tcttggccca 3360 cttctgtaca atatttttac atctgatttg cctactcttc ctggtaatgg tgtgttgtca 3420 ctttttgctg atgacactgc cgttatttat aagggtaaaa taaccagata tttagttggc 3480 cgtcttcaga agggtcttga cgttctttcc gaatactttg gcgactggaa aattcgcata 3540 aatgcagcca aaactcaaac catcattttt ccactttcca aatcggcccg atttgtccca 3600 aaggatgatg ttttgattaa aatgaatgat gtttcaatac cctggtcaaa ggaagttgtc 3660 tatttaggtc tcatacttga ctcgaaacta ttgtttcggc agcatgtaga taaaatattg 3720 aacaagtgca gcattctcat caggtgtttg tatcctttaa ttaacagaaa atcaaagcta 3780 tctttgaaaa ataagctagc agtttacaaa caaataattt accccgttat tgagtatgca 3840 gtacctgttt gggagtgttg cgctagaact cataaattga agctccagcg tgtccaaaac 3900 aaggtactca aaatggtttt gaatgttcct ggctggacaa gatcaagtga ggttcatgaa 3960 ttagcagaag taaaaatgtt ggatcagaag attcaagaaa aatgtttgaa attcagggaa 4020 aaatgtgcta tatctgaata ccccttgatt caaggattgg tttagtttat ggtaagatta 4080 agttagtttt aggttaggtt atgttttctt aatttttttt ttcctcattg tacctagtgt 4140 tacaaaaatg ataaatcata tacttttaaa gttatgaaaa tgaacagtgt ttaaatcacg 4200 aaaagcaaac taaagctgaa gagccacagc tgaccactta ttatgtaaac caaatgtaat 4260 tattataaga aagattcaat aaagtatatt taattcaaaa aaaaaaaaaa aa 4312 // ID Gypsy-139_AA-LTR repbase; DNA; INV; 429 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-139_AA_; KW Gypsy-139_AA-I; Gypsy-139_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-429 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1012-1012 (2011). XX DR [2] (Consensus) XX SQ Sequence 429 BP; 124 A; 107 C; 85 G; 113 T; 0 other; tgtggtgtac ttgtgaaaac atgtgcccat gcgcaatatt taaattcggt gctaacacat 60 agcacctacc aaatagttat gcttacgcgc gcgaaagact gcatcgaacc gtcgccccat 120 tgggggcaca gttcgattgc aatatgttcg tttgctatga gccatccact tgatggactt 180 taattgtcac gcgctgacca aagcacaaat cactccagaa gtcttccgat agaccctatg 240 taattgtaaa atgtagtgcg acacttttga agtgctggac gggtataaaa tacccctgac 300 cagccaaata caggcagttc cattcgatgg tctcagatca tcacttaatt gagcctttta 360 attaagtaaa atatcaccgc catcccggat ccaccactaa gtctactagt aggcaagtag 420 cctacctca 429 // ID Crack-1_CQ repbase; DNA; INV; 4315 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A Crack non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; Crack-1_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4315 RA Kojima K.K. and Jurka J.; RT "Crack non-LTR retrotransposons from the southern house RT mosquito."; RL Repbase Reports 11(1), 32-32 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 8 sequences with >97% CC identity. XX FH Key Location/Qualifiers FT CDS 212..991 FT /product="Crack-1_CQ_1p" FT /translation="MTGSDRRSINSKGGKNGKSANSETANSTVSQQLQEIS FT AKLQKLDDLEKLNKDLYGKLLQLAHRVDVVSAENRLLKREIEDLRQDRLQE FT EVLIKGFKFERPNQHLEVFRGVCERAGFDTPDDVKEVRPLVFKNNKRTDAV FT LVKFWRVEDKGRFIRCIRSLKKPIAPRDIDIRSPNKFILVQDHLTPEKHKI FT HTKAWDLCKLGLQRPWIYKGSTWITHPESNKRFRVADLGDLADIEFVLVTR FT EGTAQGSATVQSKVTVSEQ" FT CDS 1002..3986 FT /product="Crack-1_CQ_2p" FT /note="reverse transcriptase." FT /translation="MEDIETLPDFHSHNFLFDSINNYDRDFSNFHGLLIFQ FT VNARSICKISRFDQLKFVIEQLKRKPDVIVVSETWVSNDLIQLYRIDGYNS FT VFSCRKNGTGGGLVVFVLDNLTFTTNCSLELSSVPKPINFIQISISVSNNR FT SIVVSAYYRPPDDRNLTDFFNHLELILSASNSIHVLVGDINIDSSSSSNKF FT LDYLNILTSYGFLLTNTNITRPSSSSILDHVAFNHTDDFTVTNDTITNPDS FT DHNFILSLFDITPISTSAKISYKHQTNYVRLEQLLEMRFNPVYMGNFDDPN FT EFYNYFISTLTKSIQESTIITELKSKSKHICPWFSDGLKTLLNSSKNLRKK FT RNKLFAAGKSISEIDSKIHDLTNKIRTYSNTLQTQYYTNKFANATNAKDTW FT KNINDILGRKIHSKLPTEMLITDAFGNDEKINNAQLISNEFNNYFSSIGEK FT MANKIPSFPNDNINKFKTLKRPNSTFFLTPISQTEIFDIINSLKNNTACGN FT DHINAFTLKKINHIISPVLAEIFNLCIILGVYPDKLKVAKVTPIHKSGSTS FT SFNNYRPISVLSAINKVFEKLLFNRLTTFLNSNNFFCHQQYGFRERSSTFN FT AVVDLVNKIQHDLDNRDDVLGLFLDLSKAFDTVDRRILFEKLKFAGIHGVA FT LDLFKSYLTNRSQFVCFNGISSLMNLIDVGVPQGSVLGPLFFIIYLNDFSR FT LPLKGDLRLFADDSSLFYNSRLNSVNDLNLKHDLLIVSEYFRINKLTLNIN FT KTNIINIKNSSLSTPNPLIQTKIEFPDIDIVSEFKYLGILLDSRLNWSSHV FT NSLILKLNKITGIVFKIKHKLPRNVLILIYHSLFNSLLSYLTAVWGNTYDY FT LLNKLQIVQNKFLKSIFNLPLRSHTVDLYTKEKNIMPVRGIYIIQVCSFIY FT KSLSNQTHTNTKFELSSHQHGTRNHNLLKRPNVSTLSGERSITFKGANYFN FT FFFNKFGLCSSLSIFKDRLKLYLAQPEIVEKLIKSIDILS" XX SQ Sequence 4315 BP; 1339 A; 821 C; 714 G; 1439 T; 2 other; actggcagca ctgcaacctg aaaggtcggc attatttacg ctccgtaaaa taaattcgga 60 aaaattacta attmacttgt ttttcttgtg ttttgcatat ttccggtcgc tggaggacgt 120 tgcggaaccg ggaggattgg agttctggtc ctgtggtcac ttttcggtga aatcatcttc 180 aaatttcacc gtgtcgcaga ttgaggttag gatgaccgga tccgaccgac gctcgatcaa 240 ttccaaaggt gggaagaacg gaaagtcggc gaactccgag acggcgaatt ccaccgtctc 300 acagcagttg caggagattt ccgcgaagct gcaaaagctg gacgacctgg aaaagctaaa 360 caaggacctc tacgggaagc tgctgcagct tgcgcatcgg gtcgacgtcg tttccgcgga 420 aaaccgactg ctgaaaaggg aaatcgagga cctgcgccaa gatcggttgc aggaggaagt 480 ccttatcaag ggattcaagt ttgaacgtcc gaatcaacac ctggaggtgt tccgtggtgt 540 ctgcgagcgc gctgggttcg acacgccaga cgacgtgaag gaggtgcgtc cacttgtctt 600 caagaacaac aaacgtaccg acgccgtgct ggtgaagttc tggagggtcg aagacaaagg 660 tcggttcatt cggtgcatcc ggtccctcaa gaaaccgatt gctccacggg acatcgacat 720 ccggtcccca aacaagttca tcttggtcca ggatcatctc acgccggaga agcacaagat 780 acacaccaag gcctgggatc tgtgtaagct tggattgcag cgtccgtgga tctacaaggg 840 atctacctgg atcacgcacc cggagtccaa caaacgcttc cgagttgctg accttggtga 900 cttggctgac atcgagttcg tgttggttac gcgagaggga acagcccaag gcagcgcgac 960 agtccagtct aaggtaactg tatcggaaca gtaagttgtg aatggaagac attgagacat 1020 tacctgattt tcattcccat aatttccttt ttgatagcat aaataattac gaccgcgatt 1080 ttagtaactt tcatggattg ttaatctttc aagtgaatgc aaggagcatt tgtaagattt 1140 cgagatttga tcagcttaaa tttgtaatag aacagttaaa gcgtaaacct gacgtgatcg 1200 ttgtcagtga gacgtgggtt tcaaatgatc ttattcagtt ataccgtatt gatggttaca 1260 attcagtttt ttcttgtcgt aaaaatggta ctggtggggg tttggttgtg ttcgtcttgg 1320 ataatctaac ttttacaact aattgtagct tagaactttc ctctgttcct aaaccaataa 1380 actttattca aatttcaatt agtgttagta acaatagatc aatagtagta tcagcatatt 1440 atagaccacc agacgatcgt aatttgacag acttttttaa tcatttagaa ttaattttat 1500 ctgcaagtaa ttccatacat gtgctagtag gagacattaa cattgattcc agttcttctt 1560 caaataagtt ccttgattac cttaacatct tgacgtcata tggatttctc ttaaccaaca 1620 ccaacataac cagaccttcc agcagttcca ttttagacca tgttgctttt aatcataccg 1680 atgattttac tgttacaaac gatactataa ctaatcctga cagtgatcat aattttatct 1740 tatctttatt cgacattact cccatttcaa catcagctaa aatttcatac aaacatcaaa 1800 ccaactatgt ccgtttagaa cagttacttg aaatgagatt taatcctgtt tatatgggaa 1860 attttgatga tcccaatgaa ttttacaact attttatttc tactttaact aaatctattc 1920 aggagagtac cattataact gaattaaaat cgaaatctaa acatatttgt ccatggtttt 1980 cggacggtct caaaacatta ttaaattctt caaaaaattt gcgcaaaaaa agaaataagc 2040 tctttgctgc aggtaaaagt atttccgaaa ttgattcaaa aattcatgat ttaactaaca 2100 aaattcgcac ttactcaaac acattacaga ctcaatatta cactaataaa tttgctaacg 2160 ctactaatgc gaaagatact tggaaaaaca ttaatgatat tcttggacga aaaattcatt 2220 ccaaacttcc aaccgaaatg cttattactg atgcttttgg aaatgatgaa aaaataaata 2280 atgctcaatt aatatctaat gaattcaaca attatttttc ctctattgga gaaaaaatgg 2340 caaacaaaat tccttccttt cctaatgaca atatcaataa attcaaaact ttaaaacgtc 2400 ctaattctac tttctttctt acacccatta gtcaaactga aatctttgac attatcaatt 2460 cacttaaaaa taacactgct tgcggtaatg atcatattaa tgcttttaca ttaaaaaaaa 2520 taaatcatat catttctcct gtccttgctg aaatttttaa tttatgtatt atccttggtg 2580 tttaccctga caaactaaaa gttgcgaaag tgaccccaat tcataaatct ggttccactt 2640 cctcattcaa caactataga cctatttctg tgctctctgc aataaacaaa gttttcgaaa 2700 aattattatt caaccgttta actacctttt taaatagtaa taattttttc tgccatcagc 2760 aatatggatt tagggaaaga tcatcaactt ttaatgctgt tgttgatctt gtgaataaaa 2820 ttcaacatga tcttgataat cgtgacgacg tgttgggact gtttttggac ttgtccaaag 2880 cgttcgacac ggtagataga cgtattttgt ttgaaaaact taaatttgca ggtattcacg 2940 gtgttgcttt agacttgttc aaaagttatc ttacaaatcg ttctcaattt gtttgtttta 3000 atggaataag tagtttaatg aatttaattg atgttggagt ccctcaaggt tctgtacttg 3060 gtcctttgtt ttttattatt tatcttaatg acttttctcg tttacctctt aaaggggatc 3120 ttagattatt tgccgacgat tcttctttat tttataacag tagactaaat tctgtaaatg 3180 atttaaattt gaaacatgac cttttgattg tttctgaata ttttagaata aataaactta 3240 cattgaacat taataaaaca aatatcatca atatcaaaaa ttccagcctt tctacaccta 3300 acccattaat tcaaactaaa attgagtttc cagatattga tattgtatct gaatttaaat 3360 atttgggtat tcttttggac agcaggctta actggtcttc acatgttaac tcgttaattc 3420 taaaacttaa taaaataacc ggtattgttt ttaaaataaa gcataagctt ccaagaaatg 3480 ttcttattct gatctatcat tctttattca actctttact gtcttactta actgccgtct 3540 gggggaatac ttacgattat cttctaaata aacttcaaat agtacaaaac aaatttctta 3600 aatcgatttt caaccttcca cttagaagtc ataccgttga tctatatact aaagagaaaa 3660 acattatgcc tgtcagagga atttacatta ttcaagtttg ttcttttata tataaaagtt 3720 tatcwaacca aactcataca aatacaaaat tcgaactttc ttctcaccaa catggtacaa 3780 gaaatcataa tctgcttaaa cgtccaaatg tttccacttt gtctggtgaa cgtagtataa 3840 ccttcaaagg tgcaaattac ttcaatttct tttttaataa atttggctta tgttcatccc 3900 taagcatttt caaagatcgt ttgaaattgt atctagctca acctgaaata gttgagaaac 3960 ttattaaatc cattgacatt ttaagttaaa caatatttta taactattac ttcatgtttc 4020 ctttattgtt ccgatacagt tgtccattct atattttcat ttgccacatt catctcgtag 4080 gttctttttt tcttttattt tctttccagc ccgtaacagc cagttccgaa ctcctgtaaa 4140 ggtttccacc gatggagttc tgtcttcagc cgcagttcat tatttacttt ataattgtat 4200 ttatttattg tattttttta accgaaaaga agcaggtttt tatgccagca ttctagtggc 4260 ttttcctgtc caaggaaaat tgatcaaata aacaaacaat caatcaatca atcaa 4315 // ID I-9_AAe repbase; DNA; INV; 5782 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A I non-LTR retrotransposon family from Aedes aegypti. XX KW I; Non-LTR Retrotransposon; Transposable Element; I-9_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5782 RA Kojima K.K. and Jurka J.; RT "I non-LTR retrotransposons from the yellow fever mosquito."; RL Repbase Reports 11(4), 1364-1364 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 20 sequences with >96% CC identity. XX FH Key Location/Qualifiers FT CDS 479..1774 FT /product="I-9_AAe_1p" FT /translation="METDGESSNSKEDDHSNLPPSVSKPFRVKIFPSTFLG FT PYPVYFRKKDKPINVLLISAEVYKRFKSVKEIKKISLDKLRVVFGSRGDAN FT TLLESNLFGSSYRVYAPCDSCEISGVIYDESLSCDDIKNFGLGFFKNKTIS FT PVKILDCNRLSKLTFNGNSSSYTHSNCIKIIFAGSVLPDFVSINNVSFSVR FT LYFPKLMHCTHCLLFGHTSSYCSNKPKCSKCGDSHSSSVCDKQSDVCIYCN FT QNHFSIKDCSVYKTNQQKFNQQIKFKNRCSYSEILKASGDFSSSNIYEALS FT DYDDDDLNEDSNQFIYKPPTKRKRANLLSNKHNANSDPQPSTSFDKNFPPL FT NVSSTCHTIPGFQRVENDNSFNNKNRDNVDNSSNNNDNKDILNILEQIINL FT LGFSDFWKNLIKKCLPFLAFIFEKLNSFGPLLSSLFSM" FT CDS 1777..5469 FT /product="I-9_AAe_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="MATFNKLNILQWNCRSIIPKIDRLKALLFNHNIDIFC FT LNETWLVECKHFRIPTFNIIRKDRNVSAGGVLIGVRGNIQFKYLNLPFNFA FT IEYIAISVKHLGKEFSIICFYIPPNSPFSLSEIKTILDNVPSPFYILGDFN FT AHNSAWGSRNSDGRGNLIMDLVDELNLNILNDGSFTRIAVPPAHHTCIDLS FT LCSNSLTFLSSWKIINDPNGSDHLPILIEICNNDNKQTNNESLKPDLCRNV FT NWLKFSDLISFSLTHFDYSLSPLDNYKQFCEIIIKCLIKSQKYKISPISSR FT KKCPSFWWDSECSMALKNKSEAFKKFRRFGSRENYFLYCKAEAKFIRITKF FT KKRNYWRNFIENLDRESSLSSLWSVARNLRNCNSSSPNIQEYSEEWINKFA FT SKICPDFVPCSINFKNCQRYNYFPELSASFSLGELDLALSVTKNTSPGIDN FT IKFIVLQNLPHDGKIHLLELYNSFLFQNILPLEWRSIKVVSIVKSGKDPSL FT ADSRRPISLLSCLRKLMERMLLNRLELWAESNKIFSSSQYGFRKGCSTRDC FT TALLASQINLSFNKKQDMVSTFLDVSGAYDSVLLDLLFEKLKNFNIPNIIA FT NFLYNLFSFKIMHFFHNGSSKLIRYSYFGLPQGSCLSPFLYNLFTSDISSV FT IPNGCYFYQFADDKVISISGNNREIIRHFMQCALNNIEIWANNNGFSFSVS FT KTKFILFSRKRSIVNINLYLNGHEIEQVDDYKYLGIWFDSKLLWKKHIQYI FT QIICAKRINFLRTITGTWWGAHPTDLITLYKTTIRSIIEYGCFTFVNASQS FT HFCKLEKIQFRCLRICLKLMNSTHTQSIEVLAGVIPLKTRLHELNCKFMLN FT CFMKKHPIIDVLKCLNDINPTCKILDSYKYCSSLNIVPTTISTFNYHNFDI FT NVHSFHPMIDLSLYEELKQIPSDEYFRFAPLFFRRKFVGLDSHQLYFSDGS FT LIQNVAGFGVYNYYSAYFFKLQSPCSVFIAELTALYFTCSLIKKCSPNIFI FT ICSDSLSCLKALNSVNFNSKTHHIMLLLKKELCHLNSQGYIIKFIWVPAHS FT NIYGNEQADSLAKLGVRCGSMFNRQISSSEYFADLKQYSLNNWQLSWDFSD FT KGRWCHSILPNVSRFSWFKNFAVGRNFICTLSRLISNHYICNSYLYRININ FT DSNLCDCGVAYEDIDHIVFHCTRFAIPRIAFFRNLKIYHRLLPQSVRDILG FT SKFLPSLKILYRFLNDALYYV" XX SQ Sequence 5782 BP; 1812 A; 800 C; 878 G; 2292 T; 0 other; tcattgctgc agtaggcctt gactaggaaa cacgtatttt ttctgcacct gacatttttc 60 tctttttttc aggaggaaaa aaatctcgga gtacggatat tgaattggtt tcgtgtttca 120 tggtggcgaa tgtttttgga tgcgaggaaa gcgattcaag attcgattac gttcgttgat 180 tggtggggcg ggtttcatgg tggcgaatgt ttttggatgc gaggaaagcg attcaagatt 240 ctgattacgt tcgatgattg gatttgaaga gagcgatcga agttttctga atcaagttca 300 gattgtgggc tttgggaagt tttcaagctt gaggtgattt ctctcaagaa agaattcatc 360 aagttctcta ttgaatttga taagtataat tttatttact ttcttatttt ataagtttta 420 gttgttatta cttctcgatt gttgattatt tatttatatt ccccgttgtt ttcctaatat 480 ggaaactgac ggggaatcgt cgaattctaa agaggatgat cattctaatc ttcctccttc 540 agtttccaaa ccatttagag tcaaaatttt cccatctact tttcttggtc cttatcctgt 600 ttattttaga aaaaaggata agccaattaa tgttctcttg atttctgcag aagtttataa 660 aagatttaaa tccgttaaag aaatcaagaa aatttcgtta gataaattaa gggtagtttt 720 tggttctcgt ggagatgcca atactctcct ggaatccaac ttatttggta gttcataccg 780 tgtatatgcc ccttgtgact catgtgagat tagtggcgtc atttatgacg aatctttgag 840 ttgcgatgac attaaaaatt ttggtttagg ttttttcaaa aataaaacca tttctcccgt 900 caaaatttta gattgcaatc gtttatctaa attaacgttc aatggcaaca gttctagtta 960 tactcattct aattgtataa aaataatatt tgccggatct gttcttccgg attttgtttc 1020 aattaataac gtatcattta gtgtaagact ttattttcct aaacttatgc attgcactca 1080 ttgtctttta tttggacaca cttcttccta ttgttcaaat aaacctaaat gttccaaatg 1140 tggagattct cattcttcat ctgtttgcga taaacaatct gatgtttgta tttattgtaa 1200 tcagaatcat ttttcaatta aagactgttc agtttataaa actaatcaac aaaaattcaa 1260 ccaacaaatt aaatttaaaa atcgatgttc ttattcagag attttgaaag catctggtga 1320 ttttagttct tccaatattt atgaagcatt atctgattat gatgatgatg acttgaatga 1380 ggattctaat caatttatat ataaacctcc tactaaaaga aaaagggcta atttattatc 1440 taacaaacat aatgcaaatt ctgatcctca accatcaaca tcttttgata aaaactttcc 1500 tccacttaat gtttcttcca cttgccatac gattcctggg tttcaaagag tagagaatga 1560 caattccttt aataataaaa atagggataa tgttgataat tcatccaaca ataatgataa 1620 taaagacatt ttgaatattt tggaacaaat aataaattta ttaggattca gtgatttttg 1680 gaaaaatttg attaaaaaat gtttaccttt tttagcgttt atttttgaaa agttaaattc 1740 ttttggaccc ctcctttctt cattattttc aatgtaatgg ctactttcaa taaattaaat 1800 atattacaat ggaattgtcg tagcattata cccaaaattg atagattaaa agctctgtta 1860 tttaatcaca atatagatat attttgttta aatgaaactt ggttagtgga atgtaaacac 1920 tttcgtatcc ctacttttaa tattattaga aaagatagaa atgtttcagc tggaggagtg 1980 ttgattggtg ttcgtggaaa tattcaattt aaatatttga atttaccatt taattttgca 2040 attgaatata ttgcgatatc tgttaaacat ttgggtaagg aattttctat catatgtttt 2100 tatattcctc caaattctcc tttttcatta tcagaaataa agactatatt agacaatgta 2160 ccttctccat tttacatatt aggagatttt aatgcccata attctgcttg gggtagtagg 2220 aatagtgatg gtagaggtaa cttgatcatg gatttggttg atgaattgaa tttgaatatt 2280 ctcaatgatg gatcatttac cagaattgct gttcctcctg ctcatcatac ttgtattgat 2340 ttatctcttt gttctaatag tttaacattc ctttcttctt ggaaaataat taatgatcct 2400 aatggtagtg atcatttacc tattttgatc gaaatttgta ataatgacaa taaacaaaca 2460 aataatgaat ctcttaagcc tgatttatgt agaaatgtta attggttaaa attttcggat 2520 cttatttcat tttcgttaac tcattttgat tattcattat ctccactcga taattataaa 2580 caattttgcg agattataat taaatgttta ataaaatctc aaaaatacaa aatatctcca 2640 atttcttcta gaaaaaaatg tccttcattt tggtgggata gtgaatgttc tatggcttta 2700 aaaaataaat ctgaagcctt taaaaaattt cgtcgttttg gatcaaggga aaattatttt 2760 ttatattgta aagctgaagc taaatttatt cgaattacta aatttaaaaa aaggaattat 2820 tggagaaatt ttatagaaaa tcttgataga gaatcttctc tatcttctct ttggtctgtt 2880 gctagaaatt taagaaattg taattcttct tctccaaata ttcaggaata ttctgaagaa 2940 tggattaata aatttgcctc taaaatttgt cctgattttg ttccatgttc cataaatttt 3000 aaaaattgtc agagatataa ttattttcct gaacttagtg cttcattttc attaggagaa 3060 ttagatttag cattatctgt tacaaaaaac acatcaccgg gaattgataa tattaaattt 3120 attgttttgc aaaatttacc acatgatgga aaaatacatt tgttagagtt gtacaattca 3180 ttcctttttc aaaatattct acctttggaa tggcgttcta ttaaggttgt cagtattgta 3240 aaatcaggca aggatccttc attagctgat agtcgtagac caataagttt attgtcatgt 3300 cttcgtaagc ttatggaaag aatgttactt aatcgtttag aattatgggc tgaaagtaat 3360 aaaatttttt cttcttccca atatggtttc agaaaaggtt gtagcactcg tgattgcact 3420 gctcttctag cttcacaaat taatctttcg tttaataaga agcaggatat ggtttcaact 3480 tttcttgatg tttctggggc ttatgattcc gtgttattag atctactttt tgaaaaatta 3540 aaaaatttca atattccaaa tataattgcc aattttttat ataatttatt ttctttcaaa 3600 attatgcatt tttttcataa tggctcttcc aaattgatac gatatagtta ttttggtctt 3660 cctcaagggt cttgtttgag tccatttcta tataatttat ttaccagtga catatcatct 3720 gttattccaa atggatgtta tttttatcaa tttgccgatg ataaagttat ttctatcagc 3780 ggtaataata gagaaattat tcgtcatttt atgcaatgtg cattaaacaa tattgaaatt 3840 tgggcaaaca ataatggttt ttctttttct gtttctaaaa ctaaatttat tttattttct 3900 cgtaaacgtt ctattgttaa tataaattta tatcttaatg gtcacgaaat tgaacaagtt 3960 gatgattata aatatctagg aatatggttt gattcaaaat tattgtggaa aaaacatatt 4020 caatatatcc agataatttg tgcaaaaaga ataaatttcc ttagaactat aacaggtact 4080 tggtgggggg ctcatcctac tgatttaatt acactttaca aaacgaccat tagatctatt 4140 attgaatatg gatgttttac ttttgtaaat gcaagtcaat cgcatttttg taagctagaa 4200 aagattcaat ttcggtgtct aagaatttgt ttaaaattaa tgaattcaac tcacactcaa 4260 tctattgagg ttttagctgg agtgattcca cttaaaactc gcttacatga attaaattgt 4320 aaattcatgt tgaattgttt tatgaaaaaa cacccaataa ttgatgtttt gaaatgttta 4380 aatgatataa atccaacttg caaaatttta gattcataca aatattgttc ttctttgaat 4440 attgttccta caacaatttc tacattcaat tatcataatt ttgatattaa tgttcactcg 4500 ttccatccaa tgatcgattt atcgttatat gaagaattaa aacaaattcc tagtgatgag 4560 tattttcgct tcgctccttt attttttcgg cgtaaatttg ttggattaga ttcacatcaa 4620 ctttactttt ctgatggatc attaatccaa aatgttgctg gttttggagt ttataattat 4680 tattctgctt attttttcaa attacaatct ccttgctcag ttttcatagc tgaactaact 4740 gccttgtatt ttacttgttc tttgattaaa aaatgttctc caaatatttt tattatatgt 4800 tcagacagtt taagttgttt gaaagcactc aattccgtta atttcaattc taaaactcat 4860 cacattatgt tgttgctgaa aaaagaatta tgtcatttga attctcaggg ttacattatt 4920 aaatttattt gggttcctgc tcattctaat atttatggta atgagcaagc tgattcttta 4980 gcaaaattag gggttcgttg tggcagtatg ttcaacagac aaatttcttc gtctgaatat 5040 tttgcagatt taaaacaata ttctttaaat aattggcagc tttcatggga ctttagtgat 5100 aaaggacgat ggtgtcattc aattttacct aatgttagtc gatttagttg gtttaaaaat 5160 tttgctgtcg gaagaaattt tatttgtaca ttgtcgaggc ttatatccaa tcattatatt 5220 tgtaatagtt atttatatcg tatcaatatc aatgattcta atttgtgtga ttgtggtgtt 5280 gcttatgagg atattgacca tattgttttt cattgtacgc gttttgctat acctagaatt 5340 gctttcttta gaaatttgaa aatatatcat cgtttgctcc ctcaatctgt tcgtgatatt 5400 ctgggaagta aattcttacc ctcgttaaaa atactttata gatttttaaa tgatgctttg 5460 tattacgtat gattctgctt ctttcttttt tttctttttt caggtttgat ggaaacattt 5520 tcaacaaact catttaagga tggcatcaag tagttttcga gtcatggccc ttttttcaag 5580 atttttggct ccgtaatgga taaatgccgc ctgagcctat agatttatat tttattatta 5640 ttaccatttt gtaatgttat tttgaaaaga taaagaggtt ttgtgccttt ttgagaagaa 5700 tttcaaaagg aaatcactca aaggggcttt tccctctttc aaaattttta gttaaaaaat 5760 aaataaataa ataaataaat aa 5782 // ID Copia-4_DPu-LTR repbase; DNA; INV; 600 BP. XX AC scaffold_73; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-4_DPu_; KW Copia-4_DPu-LTR; Copia-4_DPu-I. XX NM Copia-4_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-600 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 672-672 (2010). XX DR Genome; scaffold_73; Positions 225900 226499. XX CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX SQ Sequence 600 BP; 138 A; 150 C; 97 G; 215 T; 0 other; tgtcggaagt ccaaacactt ctacttttga gcgccagatg gcaccacgca gaaatgcgca 60 aatgtgaagt ccccggcctt accatttgtg ctgtaggagg ctcttcccct ttatctctct 120 ctctcccttt tcattgttct cactcacgcg acgaccacgt tgcgagaagg ttctctcagc 180 aatctgtatt ttagttcctt tcatctttag tttttctcat tcagaatcac tggatcaaat 240 ttattctttc ttgtgagaaa tcgctgtgta atactcaaca actggtaatt acttgtttat 300 catttcacgt atatgtacat gtgctacgtc tccatttccc ccagcaatag agtgttatca 360 cttgagctcc atttccccta gcaatagagt gttatcactt gagctccgtt tattcttgtt 420 acacatgtac ccgtcacgtg tttatacaaa taattttcca cgttgatctc gtgattcccc 480 ttatgtcgaa ttatcaccat tgtctaagtt gattcatttt tgtctccaca catgtggaaa 540 gcctgttctc aatacagaag cctctaagtc gattcttttg tttcacgtgt atttccaaca 600 // ID Copia-122_AA-I repbase; DNA; INV; 4095 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-122_AA_; KW Copia-122_AA-LTR; Ty1_copia_Ele101; Copia-122_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4095 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [1472-2011] - Integrase core CC LTRs are 95% similar to each other. XX FH Key Location/Qualifiers FT CDS 47..2356 FT /product="Copia-122_AA-I_2p" FT /translation="MEKEDFVRVPLFDGSNYPAWKYRMQVVLEEHDLIDCI FT EREMAEMDELEVKPEDNDAAQRAKAKEAEKRKKQERRCKSLLISRIHDSQL FT EYVQDHQTPKAIWLALQRVFERKSVASRLHLKRKMLTLRHEGGSLSEHFLI FT FDKIVREYKSTGAQLDDLDVVCHLLLTLGSSFATVVTALETMPEDTLTLEF FT VKCRLLDEEIKQQGGGVKASSEPAAFAGSSGKQRQRQQQQKKKWKCFGCQQ FT EGHKISECPEKNKKKNKDSKKSSAYFGEDGGGVCFLSGQSLPNKQQVAWVV FT DSGSSEHLTNDRALFHQLTPMQDPMTIAVAKEGESIVAKHQGEVRLFSVVH FT GKSIPVCLKNVLYIPDARVNLLSVRKMEMAGLKVTFADGKVVVLRGSEVVA FT VGERRGKLYELNFSKSDCANDSSFYSCGRVPKELETWHRRFGHLSASGLEQ FT LVRNEMVVGLKANVRKGSEEIICESCVAGKQTRRPFITGEGRQSRRVLELI FT HSDVCGPVTPVGVGGEKYFVTFIDDWSRFTVVFLISSKDQVLGAFRNYVAA FT VSAKFGSKISRLRCDNGGEFKNGAFLNFCKQKGIQVEFTVPYTPEQNGVSE FT RKNRTLVEMARAMLEDSEADRRFWGAAIQTAAYLTNRSPSRALDPKVTPFE FT IWEGRKPNVEKFRAFGVDVHVHIPKEHRQKLDAKSWKGVFVGYSPNGYRIW FT DPKRKRIVVARDVIFVEEKVVGAKPPGQRTGGCSDGDVVRVPVGTDTRTQC FT DESDTESDGRHGGAGSDRA" FT CDS 2359..4014 FT /product="Copia-122_AA-I_1p" FT /translation="MAWPEEDSFDSCVDDTIREESEPEPVGHSRPVRNRSA FT PAWHGDYELDMTGFALNAASFVEDLPCSIAEMRKRKDWPKWEVAVREEMDA FT LSRNHTWDLVKLPEGRVPISCKWIFKVKRADDGRGDHYKARLVARGFTQRH FT GFDYSETYSPTAKLDTLRTVLAVANHERMVIHQMDVKSAFLNGSIKEEIFM FT TQPEGFNAGNGLVCRLNRSLYGLKQASRAWNERFHVFAEKLGFRRSSSDKC FT LYIRESGGHKLFLIIYVDDVLLVGHQLKAIQVVKQCLSKEFEMKDIGEVNC FT FLGMKIERKVEQRVLRISQRAFLERLLQRFNMSDCKPVSTPIENRLRLQRG FT EESKRTDKPYRELVGCLVYVTLTTRPDLSAAVNFYSQFQSCPTEEHWRHLK FT RVLRYIRGTLDLGLQFEGDDNAPVIEAYSDADWGNDTTDRRSLTGYVFRVY FT GCTTSWLTRKQSTVSLSSTEAELIALCVAVCHGTWVMRLLEDLGIKPDGPV FT VYHEDNQSAIRVVEEERDSGRLKHIDIKFCFVRDLIQRGLIAMRYVPRRIC FT NRQIS" XX SQ Sequence 4095 BP; 1046 A; 828 C; 1282 G; 938 T; 1 other; gtgctacgcg gaacgtgaaa tccgggtgtg aattgttgtg aaaacgatgg agaaagaaga 60 ttttgtgcgg gtgcctttgt ttgacggcag caactatccg gcatggaagt accggatgca 120 ggtggtgcta gaagagcacg atttaatcga ctgcatcgag cgggaaatgg ccgagatgga 180 cgagctggaa gtcaaaccgg aagacaacga tgctgctcag cgggcgaagg caaaagaggc 240 tgagaagcgg aagaagcagg agaggcgctg caaatctttg ttgatctcca ggattcacga 300 cagccagctg gaatacgtgc aagatcacca gacaccgaaa gcaatttggt tggctctcca 360 acgggtgttc gagcggaaga gtgtggccag ccggctgcat ttaaaacgga agatgcttac 420 gttgcgccac gaaggtggat ccctaagtga gcatttcctg atcttcgata agattgtccg 480 ggaatacaaa agtaccggtg cgcagctgga tgatctcgat gtggtttgcc atctcctctt 540 gacgttgggg tcgtcttttg cgactgtcgt aacagctttg gaaaccatgc cggaggatac 600 cttgacgctg gaatttgtca agtgcagatt gcttgacgag gaaatcaagc agcaaggtgg 660 aggtgtgaag gccagctcgg agccggcagc gtttgctggt tcaagtggaa agcaacggca 720 gcggcagcag cagcagaaga aaaagtggaa gtgttttgga tgccaacaag agggccacaa 780 aataagtgag tgtccggaga agaataagaa gaagaacaaa gattcgaaga aaagcagtgc 840 gtatttcgga gaagatgggg gtggtgtgtg ctttttgagt gggcaatctt tgccaaacaa 900 gcagcaggtg gcatgggtcg ttgactccgg atcgtcggag catttgacaa acgatcgtgc 960 gctgttccac cagcttactc cgatgcagga cccgatgacg attgctgtgg ccaaagaagg 1020 cgaatctatt gttgcgaaac atcaaggtga ggtgagactg ttctccgtcg ttcatggtaa 1080 gtcaatccca gtttgcctta agaatgtact ttacatacct gacgctcgag tgaatctgct 1140 ttcggtgcgg aaaatggaaa tggcagggtt aaaagtgaca tttgcggatg ggaaagtggt 1200 agttttgcgc ggatcagagg tagttgcggt tggtgaacga cggggcaaac tatatgaact 1260 gaacttttcc aaaagtgact gcgcaaatga ttcgtccttc tattcgtgtg gccgggtgcc 1320 aaaagaactg gaaacatggc acaggcgttt tggccatctg agtgcgagtg gattggagca 1380 gcttgtgcga aacgagatgg tggttgggct aaaagcaaat gttcggaagg gtagcgaaga 1440 gattatctgt gagtcttgtg ttgctgggaa gcagacccgg cggccgttca ttactggtga 1500 gggtaggcag tcaaggagag tattggagct tatccactct gatgtctgtg gtcccgttac 1560 tccggtgggc gtcggcggcg agaagtattt cgttactttt atcgacgact ggagtcgatt 1620 cacagttgtg tttttgatta gctcaaagga ccaagttctc ggtgcgttcc gaaactatgt 1680 tgcggcggtg tctgcaaagt ttggcagcaa gatttctcgt ttgcgatgcg acaacggtgg 1740 agaatttaag aacggcgctt tcttgaactt ctgtaagcag aaaggtatac aggtcgagtt 1800 cacggttcca tacacgccgg aacagaacgg ggtcagtgag cgcaaaaacc gaacgctggt 1860 cgagatggcg cgtgccatgt tagaggattc cgaggccgac cgaagatttt ggggtgcagc 1920 aatccagaca gcagcttatt tgaccaaccg tagcccttcg cgggcgctcg atccaaaagt 1980 gacaccgttc gagatttggg aaggacgcaa accgaatgtg gagaagtttc gtgctttcgg 2040 agttgatgtg cacgttcata ttccgaagga acatcgacag aaacttgatg ccaagtcgtg 2100 gaaaggagtg ttcgtgggct attcccccaa cgggtacagg atctgggatc caaagaggaa 2160 gcgcatcgtt gtggctcgag acgtgatttt cgtcgaagag aaggtagtcg gagccaagcc 2220 gcccggtcag cgcactggcg gttgttctga cggtgatgtg gttcgtgtac cagttggtac 2280 ggacacgcgg acacagtgcg atgaaagtga cacggaaagc gacgggcggc atggaggcgc 2340 gggcagtgat cgggcataat ggcgtggcct gaggaagatt cgtttgacag ttgcgttgac 2400 gacacgatcc gtgaagagag tgaaccggag ccggtggggc attcgcgtcc agtgcgaaat 2460 cgttcagcgc cagcgtggca tggtgattac gagttagata tgactggatt cgccctgaat 2520 gcggcgagct tcgtcgagga tttgccgtgt tccatagcgg aaatgcggaa acgcaaagac 2580 tggccgaagt gggaagttgc ggttcgcgaa gaaatggacg ctttgagccg gaatcataca 2640 tgggatttgg tgaagcttcc ggagggacgt gtgcccattt cctgcaagtg gatttttaag 2700 gtgaagcgtg ctgacgacgg tcgtggagat cactacaagg cgaggctggt ggccagaggg 2760 ttcacccagc gacatggatt cgactattcc gaaacgtatt cgccgactgc gaagctggat 2820 acgttgcgca cggtgctggc agttgcgaat cacgagcgaa tggtaatcca tcaaatggat 2880 gttaaaagtg cgttcctgaa cggatcaata aaggaggaaa ttttcatgac ccagccagaa 2940 ggattcaatg cagggaacgg gctcgtgtgt cggctgaaca gatcgttgta cggcctcaag 3000 caggcgtcga gggcgtggaa cgagaggttc catgttttcg ctgagaagtt ggggtttagg 3060 agaagttcga gtgacaagtg cttatacata agagaatccg gtggacataa gctgttcctg 3120 atcatatacg ttgatgacgt gctgttggtt ggacatcagc tgaaggcaat ccaggtagtg 3180 aaacagtgct tatcgaagga atttgaaatg aaggacatcg gcgaagtgaa ctgtttcctg 3240 ggcatgaaga ttgagagaaa ggttgagcaa agagttctgc ggatcagtca acgtgcattc 3300 ctggagagac tactgcagcg gttcaacatg agtgactgca agcccgtatc aacaccaatc 3360 gaaaaccgtt tacgcctgca gcgtggcgaa gagtcaaagc gcacagataa gccataccgt 3420 gagttagtgg gatgtttggt ttatgtgact ttgacaaccc gaccggactt gagtgcggct 3480 gtgaattttt acagccaatt tcagagctgc ccgacagagg agcattggcg gcatctgaaa 3540 cgcgtattac gatacattcg tggtactttg gatttgggac tacaattcga aggcgacgac 3600 aatgctccgg tgattgaagc atactccgat gctgactggg gcaacgatac cacggacagg 3660 cgatcgttga ccggctatgt ttttcgagtc tacggatgta caacaagctg gttaacacgg 3720 aagcaatcga ccgtttcgct ctcatcaacc gaggcagagc taattgcgtt gtgtgtggca 3780 gtgtgtcatg gtacttgggt aatgcgtttg ttagaagatc ttggtatcaa accggatggt 3840 ccagtagtkt accatgagga caatcaatcg gccatacgtg tggtggaaga agaaagagat 3900 agtggccggc tgaagcacat tgacatcaag ttctgcttcg tacgagatct cattcagcgg 3960 gggctgatcg cgatgcgata tgtgccaagg cgaatctgca accggcagat atcatgacta 4020 aggggttacc agcgaagctg ttcctgcaac atcgcacggc tctggggatg cgagtttccg 4080 gaaattgagc ggggg 4095 // ID DNA8-91_AP repbase; DNA; INV; 687 BP. XX AC . XX DT 26-AUG-2009 (Rel. 14.09, Created) DT 26-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-91_AP. XX NM DNA8-91_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-687 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2028-2028 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 687 BP; 260 A; 81 C; 65 G; 274 T; 7 other; cagtcttgtt aagtattcga atacttgtat tcgaatactt tttttgtatt tcaaatactt 60 tttaaatact ttttgaccaa tgtattttaa atacttaaaa aatacttttt atttgtattt 120 xtattctaga atactaaata cttttttxca ttcttatgtt ctactccaaa ttccaatttc 180 ataatttgtc gtcataatac tccattgcct agcagtgctc ctattgaacg ggtatttagt 240 attagtaatg ccatactaac aaaacgtcga ggaaagatgg acgataacac attxgaxaaa 300 acaatttttt txaaatacaa ttcaaatata atgcaattca tatctattaa aatatacaat 360 ctgtctacta ttatattata atatttgtgt gctttattat tataattatt tattatatat 420 attaataggt atataaattg ctgaatttta acattccata attaggtaag taxtaactaa 480 acaaaactaa atatttcaat ataaaattta taaataacta aaaaaaaagt atttgttgaa 540 gtattttgaa tacttttaaa aagtatttgt atttatattt aaatactttt aaaagaaagt 600 attcgaatac gtatttcaaa tacaaxtgac accaagtatt taaatacgta ttctgaatac 660 atttgaaaag tattcttaac aagactg 687 // ID Gypsy19-I_Dpse repbase; DNA; INV; 5115 BP. XX AC Unknown_singleton_87; XX DT 15-MAY-2009 (Rel. 14.05, Created) DT 15-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy19_Dpse; KW Gypsy19-LTR_Dpse; Gypsy19-I_Dpse. XX OS Drosophila pseudoobscura OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; obscura group; OC pseudoobscura subgroup. XX RN [1] RP 1-5115 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1112-1112 (2009). XX DR Genome; Unknown_singleton_87; Positions 7924 2810. XX CC Positions [4126-4611] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 100..5025 FT /product="Gypsy19-I_Dpse_1p" FT /translation="MAEETMANIDKEKIVEWLLEKEIIFPASATLRQLRKL FT AISGGMDEVSESLVNKSDIESEEKVSENEQIDDMKEEELLDAAIRVAEKKK FT KLAALMVMDNHVTDDIRMVKQLIAPFSASENDEASEWVLNFERICGGVNES FT SNFKLRCVRMLMKAGTEADLFMRVDKSKTYDEFKGNFLKTFGRSNSTADVV FT LLLKETFFNPEKNSVMGYILRMEEIAMRADIEEKLTVQFIIDGFRDRSSSI FT ALLYSASTIGKLKELARQYEALRKSTQTNFRRTGMTASGNDRSQARCYNCS FT AHGHYASSCPAPKREKGSCFQCGSMQHQVKDCQQKPMSAQRSVGTASNPPM FT DDREENNIFVPIVNQVGVYFYTDIKRRYQIKFLSGMLDTGSAINLMQRSAI FT PYENFDGVLRPTEYFGVNGSRISIFGKIIVNLVFHNIEKELIFFVVPNNYI FT SYNVLLGRSFLENYGIKLIFENEIEIENRVLVNSDLDENRVLVNNDLDENR FT VLVNSDLDENRVLVNSDLDENRVLVNSDLDENRVLVNSDLDENRILVNSDL FT DENRVLVNSDLDENRVLVNIDLDENRVLVNSDLDENRVLVNSDLDENRVLV FT NSDLDENRVLVNSDLDENRILVNSDLDENRILVNSDLDENRVLVNIESDQI FT ENVVLTNRDSEMMLSDINNNGLLKDENYENKISVQLNCNNIDLLLKDNEQK FT HMASESLERTNLDIPYIYSIEITKEEDSYNIDPNRTDKERSKFMELIKLNY FT SDLTNIPAIQHNYEMNLKLKSEEPVRTVPRRLSYQEKKEVDTQIDELLKKG FT FIRESKSPYSSAIVLVKKKSGEKRMCVDYKQLNKICIRDGYPLPLIDDCLE FT RLEGNTIFTLLDLKHSYHQVKVAESSVQYTAFVIPSGQYEYTRMPFGLMNA FT PAVFMRFINFILQPLIREGNVVVYMDDIAIGSKSLNEHLITVGRVLRILAE FT YRLEIKVSKCQFGYESIEFLGYTLSGKGISPNCEHTATIKHLPLPKDRHEM FT QKVLGLFSYFRRFIPAYSRIAKPLQELTKPGTKFELQDKCIQIFEYLREKL FT ISSPVLALYNPNRETELHCDASSEGFGGILLQKQDDNKFHPTAYFSQRATK FT LEARYESFKLETLAVIYSLRKFRIYLEGIPFRIITDCNSVVLCLGKGRLHS FT NIARWAIELANYNYTLKHRSGKYMSHVDTLSRYPSALNQDFEYEPIVTIHE FT SDNKDRIVSVIECDDINLQLQITQNRDNHIGKLKEQLEHGPVKKFSLSDGI FT VYKIGENDVELFYVPAEMETQIIRRVHENLCHLGTEKCMKEILRHYWFPNM FT RSKIDLFIGNCLNCIMFSVPTHANNRTLHNIPKRPIPFDTLHVDHFGPLPA FT VISKKKHVLVIIDAFTKFVKLFSVNTTSTKEVNTCFSKYFQFYSRPRRIIS FT DRGTCFTSLEFSKFLLENNIEHVKVATASPQANGQVERVNRTMKAMLAKIT FT EPINHEDWSKLLIKAEYAINNSVHSTTKELPSVLLFGVQQRGCNIDALTEY FT LDDKQDRTQSDLESVRKKSLDKIEHCQQKSEEYFERNHKNHIEFLEGDFVV FT MRNIDTTIGTNKKFVPKYRGPYIIKKVLPNDRYVLNDIENCQISQIPYEGI FT IEACRLRKWADWRNQCESDPYLNS" XX SQ Sequence 5115 BP; 1834 A; 798 C; 1072 G; 1411 T; 0 other; ataacttcag aagtgggata gtagcccttt aaagaatctg cctacataag tatttgtcta 60 ctggcatccc cgacaacgag tattttccac attttataaa tggctgagga aaccatggca 120 aacattgaca aagaaaaaat tgtagagtgg ttgctggaga aagagatcat ttttccagcc 180 agcgcaacac taaggcaact tcgcaagctt gccataagtg gtggaatgga tgaagtttct 240 gaaagtctgg tgaataaatc ggatattgaa tcggaggaga aagtgtctga aaatgagcag 300 atcgacgata tgaaagaaga agaattactt gacgccgcaa taagggttgc tgagaaaaag 360 aagaaactgg ctgccttaat ggtaatggac aaccatgtga ccgatgacat ccgaatggtc 420 aagcaactga ttgccccttt ttctgccagt gaaaatgatg aagcctctga atgggtcttg 480 aattttgaac ggatttgcgg aggcgtgaac gaaagcagca attttaagct tcgttgtgtg 540 cgcatgttga tgaaggcggg aacagaggcg gacttgttca tgcgtgtcga caaatcaaag 600 acttacgacg agtttaaagg aaacttccta aaaacttttg gccgcagcaa ttcaactgcc 660 gacgtggtac tcctgttgaa ggaaaccttt ttcaacccgg agaaaaactc cgtaatggga 720 tatattcttc gcatggaaga aattgctatg cgtgccgaca tcgaagagaa attgacagtg 780 cagttcatca ttgatggttt tcgtgatcga tcttccagta tcgccttatt gtattcagcg 840 tcgaccatcg gaaaacttaa ggagctagct cgacaatacg aggctttgag gaagagtacc 900 cagacgaatt ttaggcgcac tgggatgact gcttccggaa atgaccggag ccaggcgcgc 960 tgctataatt gttcagcaca tggccattat gcttcatcgt gtcccgcacc taagcgggag 1020 aagggatcgt gtttccagtg tggatccatg cagcaccagg tgaaggattg ccaacagaag 1080 cctatgtcag cccagcgatc cgtgggtaca gccagcaacc cgccgatgga cgacagagag 1140 gagaacaata tttttgttcc tattgttaat caggtagggg tatattttta cactgatata 1200 aaacgcaggt atcaaattaa atttctttct ggcatgttgg acaccggtag cgctattaat 1260 ttgatgcagc gctcggcgat accttatgaa aattttgatg gtgtactaag accgacagag 1320 tattttggag ttaacggttc gagaataagc atttttggaa aaataattgt caatttagtt 1380 tttcataaca ttgaaaaaga acttatattt ttcgttgtac caaacaacta tatttcgtat 1440 aatgtccttc tggggcgcag tttccttgaa aattatggaa taaaattaat ttttgaaaat 1500 gaaattgaaa ttgaaaatag agtattagtg aacagtgact tagatgaaaa tagagtatta 1560 gtgaacaatg acttagatga aaatagagta ttagtgaaca gtgacttaga tgaaaataga 1620 gtattagtga acagtgactt agatgaaaat agagtattag tgaacagtga cttagatgaa 1680 aatagagtat tagtgaacag tgacttagat gaaaatagaa tattagtgaa cagtgactta 1740 gatgaaaata gagtattagt gaacagtgac ttagatgaaa atagagtatt agtgaacatt 1800 gacttagatg aaaatagagt attagtgaac agtgacttag atgaaaatag agtattagtg 1860 aacagtgact tagatgaaaa tagagtatta gtgaacagtg acttagatga aaatagagta 1920 ttagtgaaca gtgacttaga tgaaaataga atattagtga acagtgactt agatgaaaat 1980 agaatattag tgaacagtga cttagatgaa aatagagtat tagtgaacat tgagagcgac 2040 caaattgaaa atgtagtatt gacaaataga gacagtgaga tgatgttatc agatatcaat 2100 aataatggat tattgaagga tgaaaattat gagaataaga ttagtgtcca attgaattgt 2160 aataacattg accttttatt gaaagataac gagcaaaagc atatggcaag tgaatcatta 2220 gaaagaacca acttagatat accgtatata tatagtattg aaataacaaa agaagaggac 2280 agctataata tcgacccaaa tcgaactgat aaagaacgta gtaaattcat ggaattgata 2340 aaacttaact attcagactt aactaacatt ccagcaattc aacataacta tgaaatgaac 2400 ctcaaactta aatcagaaga acctgtacgc actgttccga gaagactttc atatcaggag 2460 aagaaagaag tagataccca aatagatgaa ctattgaaaa aaggcttcat aagagaaagt 2520 aaatcgcctt acagctctgc aatagtatta gttaagaaaa aatcagggga aaaaagaatg 2580 tgtgtagatt acaaacagct taataagatt tgcattcgtg atggatatcc attgccacta 2640 atagacgact gcttagagag attagaaggg aacacaattt ttacactttt agatttgaaa 2700 cacagttatc accaagtaaa agtagcagaa agttcagtgc agtatacagc atttgtcata 2760 cctagcggac aatatgaata cacaagaatg ccttttgggc ttatgaatgc accagcagta 2820 tttatgagat ttattaattt cattttacaa ccattaattc gcgaaggtaa tgtagtagtt 2880 tatatggacg atatcgctat aggttccaag tcgttgaacg aacatcttat aacagtaggt 2940 agagttttaa gaattctcgc agaatatcga ctagaaatta aggttagtaa atgtcaattt 3000 ggttatgaaa gtattgagtt cttgggatat accctttctg gtaagggaat aagcccaaac 3060 tgtgaacaca cagcaaccat taagcatctg ccattgccaa aagatcgcca tgaaatgcaa 3120 aaggttttgg gattattctc ttattttagg cgtttcattc ctgcttactc aagaatagca 3180 aaaccattac aagaacttac aaaacctgga actaagtttg agcttcaaga caaatgcata 3240 caaatttttg aatatcttcg agaaaaactg atttcgtcac cagtattagc tctgtataat 3300 ccaaatcgag aaacagaatt gcattgtgat gcaagtagtg aaggttttgg tggaattttg 3360 ttgcagaaac aagatgacaa taagtttcac ccaacagcat atttttcaca gagagcgaca 3420 aaactagagg caagatacga aagtttcaag ctggaaacgt tagctgtgat ctattcattg 3480 aggaaatttc gaatttattt agaagggatt ccgtttagaa ttataacaga ttgtaactca 3540 gtagttttat gtttaggaaa gggacgtctt cattcaaata tcgccagatg ggccatagag 3600 ctagcaaatt ataattacac tttaaaacat cgtagtggaa aatatatgtc ccatgtggac 3660 accttaagta gatatccttc agcgcttaat caagattttg aatatgaacc aatagtgaca 3720 attcatgaaa gtgataacaa agaccgaata gtttccgtga ttgagtgtga cgacattaat 3780 cttcagcttc agataacaca aaacagggac aaccatatag gaaaattaaa agaacaattg 3840 gagcatggac ctgttaagaa attttcattg agcgatggaa ttgtttataa aataggtgag 3900 aacgacgttg agctgtttta tgtcccagct gaaatggaaa cgcaaataat cagaagagtt 3960 cacgaaaact tatgtcactt aggaactgag aaatgtatga aagaaatctt aagacattat 4020 tggttcccga atatgagatc taaaatcgat ttgtttatag gaaattgttt aaattgtatc 4080 atgttttcag tccccacaca tgcgaacaat cgaacacttc acaatattcc taagcgacca 4140 atcccattcg acaccttaca cgtagaccat tttggaccac ttcccgctgt tatatcaaag 4200 aagaagcatg tattagttat cattgacgcc tttacgaaat ttgtaaaatt gttttctgta 4260 aatacgacaa gcactaaaga agtcaacaca tgtttttcaa agtattttca attttatagt 4320 cgtccacgta gaattattag cgatagagga acgtgtttta catcattaga atttagtaaa 4380 tttttgttag aaaataatat agagcacgtg aaagtggcaa ctgcttcacc ccaagcgaat 4440 ggtcaggtag aacgggttaa cagaactatg aaagctatgc tcgccaaaat aactgagcca 4500 ataaatcacg aggattggag caaactttta ataaaggctg aatatgcaat taataactca 4560 gtccattcga ccacgaaaga attgccttca gtgttgttat tcggagtcca acagaggggt 4620 tgtaatattg acgcgttaac agaatactta gatgataaac aagaccgaac tcaaagcgat 4680 ttagaatcag tacgtaagaa atctttggat aaaatcgagc attgtcagca gaagagtgaa 4740 gagtattttg aaaggaatca caaaaatcat attgagttct tggaaggcga tttcgttgta 4800 atgagaaaca ttgacaccac aataggtaca aataaaaaat ttgttccgaa gtacaggggt 4860 ccgtacatca tcaagaaggt attgccgaac gacagatatg tgttaaatga cattgaaaac 4920 tgccaaataa gccaaatccc gtatgagggc attatagagg catgccgatt aaggaaatgg 4980 gcagattggc gtaatcagtg tgaaagtgac ccctacttaa atagttaaca aaatatatat 5040 atacacatat aattaaaatt actcttaatt acatatttac gatcgagggc gatctaattg 5100 tcaggatggc cgagc 5115 // ID CR1-50_AAe repbase; DNA; INV; 4481 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-50_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4481 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1137-1137 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 17 sequences with >91% CC identity. XX FH Key Location/Qualifiers FT CDS 852..1406 FT /product="CR1-50_AAe_1p" FT /translation="TSQLLAGVVDKLSKKVDVLASTTVAGGKTKQLLTPSF FT RAWPKPGAKRPRVEQYDSAALTIDRGTRDIDLSDLSIGSIMPTPTPPKFWL FT YLSGFQPLISCDDVQKIVARCLDLSAPCDVIRLVPKGKDVSNMSFVSFKVG FT LDPSARDQALLASTWLDGLVFREFVDQSKNSHRPSIKPVDFNQTPV" FT CDS 1310..4396 FT /product="CR1-50_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MAGWIGVSGVRGPIKKLPSSIDQTCGLQSDSCIMRSI FT SDGGNPRVPITGEYPYSISPPRTLNRFMLQTNTADVXLYYQNVRGVRTKID FT DLLLAAMDCNYDVIMLTETGLNDCINSLQIFGQTFNVFRCDRSCMNSNKTT FT FGGVLIAVNRRLASTQIRIRRGLTLEQVFVEVAVDRNHLMIGVVYIPPDRS FT RDVDLIEEHIASVRELCDHASFDDQILICGDYNQPRLQWLLNADGEVQLAG FT NSSLAPASAALIDGMDFLNMIQANSHRNQHDRTLDLVFRSNDCTPIISEAA FT IALLPVDLHHPPLEMQISIQCERQIDKRLPAADEPALDFSRIDYDALLDYF FT STFDWRCLNACRGVDDMASNFCFVIQQWLKNNVPHRRRPFSPAWSTSLLRK FT LKREKNACQRKYRKEQTAYSKLKFQRASNAYRSLNASLYKSHVLRVQANLR FT LNPKSFWKFVNSKRKDPAIPRNMVYDHVKASSSAECCELFANFFASVFDHT FT ASSTVEAESASSEVPLDFLSLTTFEVTPAMVITAARKLKSSLSPGPDGVPS FT VLLSRCAAVLADPLSKIFTRSLNDGIFPAIWKQSYMFPVFKAGDRENVRNY FT RGITSLSAVSKLFEIIVSEVILSQSKHYVSLDQHGFMPGRSVTTNLLCFTN FT TCITAMEEKAQVDVIYTDLKAAFDRIDHNMLLSKLIRLGATNKLASWLASY FT LIGRQLRVKIDSCVSTPFCNSSGVPQGSNLGPLLFTLFFNDVASTLGVDCK FT IIYADDMKIYLVIRSVEDCHRLQNLIDKFIDWCQLNKLIVSIPKCSVMTFH FT RSKRPLLFDYRMNGSRLRRVDEVIDLGVTLDSKLTFNNHYASIIAKANRQL FT GFISKIAKDFTDPYCLKSLYCALVRPILETASIVWTPNDVSWTLRIERVQR FT RFIRMALRHLPWRDPLNLPAYPARCKLLNMDTLEKRRKIQQAMFVAKLLNN FT EIDCPRLLSKLDIRAAQRPLRQRSLLQGRFHRTTFGYNEPVSSMIRSFTMV FT EDFFEFAETTVCFKNKITRSSIL" XX SQ Sequence 4481 BP; 1202 A; 961 C; 988 G; 1327 T; 3 other; tacgtcacta gctgtggtgc acactgtgtt tttatccgtt gttttttttt acgtttttct 60 gtttattaat tcggttttaa atactcatcc attatcgtac tagtagttta atcgaactgt 120 gatcctacca tacattggac aaatcgaaaa cgatctcgga attgcttaat ttctagagaa 180 actcttgtcg aatttaccgg cttaccagtt gtggttatga cgatggagaa gaaacagtgc 240 aaagaatgcc aactggacgt gaacgatatc gaaccgatac gttgtggctt ttgtgatgcc 300 gttttccaca ttagtcaaca atgctgtgga ttcaattacc gtaccaatcg agacgttttg 360 tcgcaaggga agctgatgtt tatctgcgcc gattgccgat ctgagttgga tggccgaagc 420 gtgaaacgct acctggagga taagatgcaa tcacaaatga tttctcctac tggtgtgact 480 gcgtctgata atttatcatc ccaagtacaa ttgggttgtt tagcgatgaa aaattcacgt 540 tttcaagtag cttgatcgaa agaccacatt tttcttagtg taacacgatt ttattgcgct 600 tcaatatcgt aattttgata tagtaaacaa tgaaaaaaga ttttattccg tcgactcaat 660 gacatttcaa tttacataat gacagctcgt cccgaagcaa aatttgccga ggggtgattc 720 aaacgcgatt tccatactaa gttcaaacgt gttttaaaaa tagttccagg gcacggacaa 780 ttacgaaact ttggattctg gctcagtttt taacgtagaa tcagattatg taaaaaacag 840 ggtaccccta aacaagccaa ttgcttgctg gagtggtgga caaactgagc aagaaagttg 900 atgttctcgc ttcgactaca gttgcwggag gtaaaactaa gcaattgcta acaccttcct 960 tccgtgcgtg gcccaaacca ggagccaaac gtcctcgtgt ggaacaatat gactccgctg 1020 ccttgactat cgatcgtggt acaagagaca tcgacctcag tgatctttcg attggttcga 1080 ttatgcctac tccaacgccg ccaaaattct ggttgtattt atccggattc cagccactca 1140 tatcttgcga tgatgtgcag aaaatagtgg ctcgctgtct ggacctatca gcaccgtgtg 1200 atgtaatccg tttggttccg aaaggcaaag atgtctcgaa catgtcgttc gtttcattta 1260 aagttggtct cgatccgtcg gcgcgcgatc aagcgttatt ggcatcaaca tggctggatg 1320 gattggtgtt tcgggagttc gtggaccaat caaaaaactc ccatcgtcca tcgatcaaac 1380 ctgtggactt caatcagact cctgtataat gcgctctatt tcggacggtg ggaacccwag 1440 ggttcccatc acaggcgagt atccgtattc aatatctccc ccacgtacac tcaaccgttt 1500 tatgttgcag acgaacaccg ccgatgtawc cctttactat cagaatgttc gaggtgttag 1560 aacaaaaatc gacgatttgc ttctggctgc tatggattgc aattacgacg tcattatgct 1620 gacggagact gggttgaacg attgcatcaa ttccctgcaa atatttggac agacgttcaa 1680 cgtttttcga tgtgaccgca gctgcatgaa tagcaataaa acgacctttg gtggtgtgct 1740 gatagcagtt aaccgtcggt tggctagtac acaaattcgc atacgtcgcg gtttaacgct 1800 ggaacaagta tttgttgagg ttgctgttga ccgtaatcat ctcatgattg gagtagttta 1860 catcccacca gaccgtagtc gagacgttga cttgattgaa gaacatatag cttccgttcg 1920 agaactttgt gaccacgcat cctttgatga ccagattttg atatgcggag attataatca 1980 gcctcgtctc cagtggttac tgaacgctga tggtgaggta caacttgctg gaaattcatc 2040 tttagctcct gcgagtgctg ctttgatcga tggtatggac tttttgaata tgatccaagc 2100 aaattcccat cgcaatcaac atgatcgaac actggacctt gtatttcgct cgaacgactg 2160 tactccaatc atcagtgaag ctgcgattgc attgttgcca gtagatttgc atcatcctcc 2220 gctggaaatg cagatttcca ttcaatgtga acggcagatt gataaacggt tgcctgctgc 2280 tgacgaaccg gctcttgatt ttagtaggat tgactacgac gcacttctgg attatttttc 2340 tacttttgat tggagatgcc taaatgcttg ccgaggtgtc gatgatatgg cttcaaactt 2400 ctgtttcgtc attcaacaat ggctaaaaaa taacgttcct caccgaagac gtccgttctc 2460 gcctgcctgg agtacgtcac tgcttcgaaa attgaaacgt gagaagaatg cgtgtcaacg 2520 taaatatcgg aaagagcaaa ctgcgtacag taagttgaag ttccaacgag ccagtaatgc 2580 gtaccgtagt ttgaatgcat cgctgtacaa atcccacgtg ctgagagtgc aagctaatct 2640 tcgtttgaat ccgaaaagtt tttggaaatt cgtgaacagc aaaagaaaag atccggccat 2700 ccctagaaat atggtatatg atcatgtgaa ggcaagctcc tctgctgagt gttgtgagct 2760 atttgcgaac ttttttgctt ctgttttcga tcatactgct agttccacgg tggaagctga 2820 gagtgcttcc tcggaggtcc cgctggattt ccttagtctg acgactttcg aagtaactcc 2880 tgcaatggtt atcactgctg ccagaaaatt gaagagttca ttgtcccccg gacccgatgg 2940 cgttccatca gtattgctca gtcgctgtgc agctgttctg gctgatccgc tttcgaaaat 3000 tttcactcgg tctctcaatg atggtatatt tccggccatt tggaaacaat cgtacatgtt 3060 tccggtcttc aaggctggag atagggaaaa tgttagaaat tacagaggaa ttacaagctt 3120 gtcagcagta tcgaagctgt tcgaaattat cgtcagtgaa gtcattctta gtcagtcgaa 3180 gcactacgtc tcactcgatc agcatggttt tatgcctgga agatctgtca ctacgaatct 3240 gctttgcttt actaatacat gtataactgc tatggaggag aaagcacagg ttgatgtgat 3300 ctacaccgat ttgaaggcgg cctttgaccg aatcgatcat aatatgctat tgtcgaagct 3360 tattcgcctt ggtgcgacta ataaactcgc atcatggctc gcctcctatt tgatcggaag 3420 gcagctacgt gttaaaatcg attcctgcgt ttcgacccca ttctgtaatt cgtcgggtgt 3480 accccaagga agcaatttgg gtcccctatt gtttacactg tttttcaacg acgtcgcttc 3540 gacattaggt gtggactgca aaatcatcta tgctgacgac atgaaaattt atctggtgat 3600 tcgatcagtg gaagactgcc atcgtctgca aaatctgata gacaagttta ttgactggtg 3660 ccaactgaac aagttaattg tcagcattcc aaagtgctct gtaatgacat ttcaccgctc 3720 gaaacgacca cttttgttcg actataggat gaatggatcc agacttcgta gggttgatga 3780 agtgattgat ttaggagtga cgttggactc gaaattgacc ttcaacaatc actatgcgtc 3840 aataatagcc aaggcaaatc gacagctggg cttcatatcg aaaatagcca aagatttcac 3900 agacccgtac tgtttgaaat cactatactg cgcgctggta cggccaattc tcgaaactgc 3960 ttcgattgtt tggacgccaa atgatgtttc ctggacattg aggatcgaac gtgtccaacg 4020 aaggttcatt cggatggccc tgcgtcatct accctggcgt gatccactga atcttcctgc 4080 atatcctgct cggtgcaagc tactcaacat ggacaccctt gagaaaaggc gtaaaattca 4140 acaggcgatg tttgtagcta aacttctgaa caatgaaatt gattgtccac gactgctttc 4200 caagctagat attcgggctg ctcaacgacc tctccggcag cgatcattgc tacaaggcag 4260 atttcaccga actacgtttg gatataatga accagtgtcg tcgatgattc gctcattcac 4320 tatggttgaa gatttctttg agttcgctga aacaactgtg tgttttaaaa acaagattac 4380 tcgctcgtcc attttgtaat ttctttgtta ttagttcatt cattaagacc ttgaagtcag 4440 atggataaat attggtaaat aaataaataa ataaataaat a 4481 // ID hATx-23_SM repbase; DNA; INV; 2702 BP. XX AC . XX DT 12-AUG-2009 (Rel. 14.08, Created) DT 12-AUG-2009 (Rel. 14.08, Last updated, Version 2) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hATx-23_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-2702 RA Jurka J.; RT "haT-type repetitive family from freshwater planarian Schmidtea RT mediterranea."; RL Repbase Reports 9(8), 1858-1858 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 552..2447 FT /product="hATx-23_SM_1p" FT /translation="MSKRKSEAWNYFQKVNANEAKCNHCGTILSCKGSSTT FT GLINHLKLHDLLISRNSKCNDTDSEPSTSKTMKLSNSSIKSFLKSQISLSE FT ILAKCAAKDGFSFNAMIHSDAIKGYVTSRNYSMPKSCTSIQKHVLNFFEEK FT KQETKTELIRLLHSKKKFSIAVDEWTDVLMRRFLNVTLRSSNSKEYVLGLV FT EIKEKCTAEKTKELVENLLKEFGMNLETDIVASTNDGASVMVKYCSIITAE FT KQLCYNHAIHLAVLKILYKKVNLDDVVEDEDKTEYDDEELLYDEDDQDEEN FT QVDEEDDGNLVMEMEQNSECFLLVSDIKSALYAMRKVIKFFKNSSIRTEIL FT LKHVRQKEGKNLRLILDVKTRWNTLASAVNRFLKLIECVNNALEELGLQKF FT DEKYILVLKSISITLEPTMLAVQELSKKDANLITAEGTILYVMKKLKNIDT FT ELSLNLLEILKNEIGKRRNRDMVSLLIFLHQGTYVKPNENEFLNYSSKSSL FT KNYAKQLFLRLFPHNEFENREVAEDLETEIIHLEEDELTEYENLQNCIQSL FT ARPKQVKENEFNIEKEFKNLQGNRGKRSDDMDCLYNALLTVAPTSTSCERV FT FSVSGNKMTKIRSRLGVNTLNALVFLKYYFNQNV" XX SQ Sequence 2702 BP; 1043 A; 325 C; 438 G; 896 T; 0 other; gggtttccat tccgagtact cggttactcg acaaattcgc caatttatcg agttttcgaa 60 tactcggtta atggaatcga gtactcgagt tttaataaaa caattttttt tgtaaaatat 120 aacttaaatt tgatgcatat aaatgagacc atactaaaaa actacgatta aaaagggtat 180 cgaattttca taatatataa acactgattc gccatctgtt gaaagattga tttaacataa 240 ttcataaaac ttgtttagtg tttctataac aaataagatg gccgtaagaa cgtttaaaat 300 attttctggt tttcggtcaa tgttacctgg ccagtcaatg ttaggcaatc tgacggttat 360 tttattctta ttatgtcagt gtcaatgtct taaataagta aacaatattt ttattatatt 420 attataatta aaatcaaaca ataattttat tattattcga taaattttca actttaatag 480 tttttttata aggttaagtt ctaatagttt tatttaatca ttttttaaca tattaattac 540 ccataggcat aatgagtaaa agaaaatcag aagcttggaa ttattttcaa aaagtcaatg 600 caaatgaagc taaatgcaat cactgcggaa caattctaag ctgtaaggga agcagcacca 660 ctggattaat aaaccatttg aaattgcacg atttgcttat ttctagaaat agtaaatgca 720 atgatactga ttctgaaccg tcaacatcta aaacaatgaa gttatctaat tcttcaatta 780 agagcttttt aaaatctcaa ataagtttat cagaaatttt agcgaaatgt gcagcaaagg 840 atggattttc atttaatgct atgattcatt cagatgcaat taaaggctat gttaccagcc 900 gtaattatag catgccgaaa agctgtacta gtattcaaaa acatgtatta aacttttttg 960 aagaaaaaaa acaagagacc aaaactgaat taataagatt attgcattca aaaaagaaat 1020 ttagtatagc agttgacgag tggacagatg ttttaatgcg gagatttttg aatgtgactt 1080 tgcgctcttc aaattcaaag gaatatgtcc ttggactagt tgaaattaaa gaaaaatgta 1140 ctgctgaaaa aaccaaagag ttagtggaga accttttgaa agaatttgga atgaatttag 1200 aaacggatat tgttgcctca acaaatgatg gagcgagtgt aatggtgaag tattgttcaa 1260 ttattacagc agaaaaacaa ttatgctaca accacgcaat tcatctagct gtccttaaaa 1320 ttttatataa aaaagttaat ttagatgatg tagttgaaga tgaagataag actgaatatg 1380 atgatgaaga acttttatat gatgaagatg atcaagatga agaaaatcaa gttgatgaag 1440 aagatgacgg aaatttagta atggaaatgg aacaaaattc ggagtgtttt ttattggttt 1500 ctgatattaa atcagcgtta tatgcaatgc gaaaagtaat taaatttttt aaaaattcat 1560 cgattcgtac tgaaattcta ttaaagcatg tacggcaaaa agaaggcaag aacttaagat 1620 taattttaga tgtaaaaact cgttggaata cattggcatc agctgtaaat agatttttaa 1680 agttaattga atgtgtaaat aatgccttag aagaacttgg tttgcaaaag tttgatgaaa 1740 aatatattct tgttctaaaa agcatatcaa ttacgcttga acctacaatg ttagcagttc 1800 aagaacttag caaaaaagat gctaacttaa taacagcaga agggacaatc ttatatgtta 1860 tgaagaaact taagaatata gatactgaac tatcactaaa tttattagaa atattaaaaa 1920 atgaaattgg gaaacgtaga aatagagata tggtgtccct acttatattt ttacatcagg 1980 gaacttacgt taaaccaaac gagaatgagt ttttaaatta ttcatctaaa tctagcctta 2040 aaaattatgc caagcaacta tttctaagat tgtttcccca caatgaattt gaaaatcgtg 2100 aagttgcaga agatttagaa actgaaataa tacatttaga agaagatgaa ttaacggaat 2160 acgaaaattt acaaaattgt attcaaagtt tggctagacc taagcaagtt aaagaaaatg 2220 aatttaacat tgaaaaagaa tttaaaaatc ttcaaggaaa tagagggaaa agaagtgacg 2280 atatggattg cttatataat gcattgttga ctgtagctcc aacatcaacc tcttgcgaaa 2340 gagtattctc ggtatcagga aacaaaatga caaaaatacg aagtcgacta ggcgtaaata 2400 cgttaaatgc tctcgtattt ttgaaatatt atttcaatca aaatgtatag atatttaaat 2460 aattaatatt ttgactaaaa agtatttttg ttttgactaa aatgtttttc taatgtttaa 2520 tggattaatt taattttata ttatgttata cttgttataa tgaactgttt ttgtttttaa 2580 ataaataagt atttttgata atcttttgtt aagaaattat ttttcctaga aattatttcg 2640 agtactcgag taaccggtta attaaaaaaa ggtactcgac tactcgagta aaatggaaac 2700 cc 2702 // ID Gypsy-35_CQ-LTR repbase; DNA; INV; 489 BP. XX AC AAWU01032713; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-35_CQ_; KW Gypsy-35_CQ-I; Gypsy-35_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-489 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 450-450 (2011). XX DR Genome; AAWU01032713; Positions 19310 18822. XX SQ Sequence 489 BP; 150 A; 119 C; 102 G; 118 T; 0 other; tgcatagcta attagttgca acccatcatt tactgcttca cacttcaatt ttaatttatt 60 gccctgcccg gctgagtgag ccacacacag ccaggcccag gactcaaatt acataacaat 120 cggttcaata tgcgctggca taactgctgt gcttgcgcat gggactgttt cgggttgtac 180 aatttttcgg gttttacggc aaagtggcca tgcgctggat aatgctgatc cagcgcaaag 240 gacacttcgg gccataaaca aagagagcca actccagatg tctccaatga gtaaactcta 300 caaatacgat gtaagaaatg taaccaaact catataaagg gccaggagcc accaaataaa 360 gagattcatt cttagaacac agaccagaac tgtacagttt agggatcgct cccaagccag 420 gcttgaggac accgatctac ttcggatgac agcctatggt ccaaaattct aagacccttg 480 aagtaatca 489 // ID I-11_AAe repbase; DNA; INV; 5409 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A I non-LTR retrotransposon family from Aedes aegypti. XX KW I; Non-LTR Retrotransposon; Transposable Element; I-11_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5409 RA Kojima K.K. and Jurka J.; RT "I non-LTR retrotransposons from the yellow fever mosquito."; RL Repbase Reports 11(4), 1366-1366 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 10 sequences with >97% CC identity. XX FH Key Location/Qualifiers FT CDS 210..1352 FT /product="I-11_AAe_1p" FT /translation="MDGDGEDDGTEKSEPQSRSPRIREYPVGAKGPFIVYF FT RKIAIPLKVLLISSELYSKFTSIKELKKINEFKVRVVLSKREEANEIVKMD FT RFEGVYRVYIPCEDVEVDGVIYDEVLTPEEVMQLGEGRFRNRALPSVPVLD FT CERLTMMNADKSNRVHSNALRITFAGTMIPDFVNVSNVFIPVRVYTPKVML FT CSRCNMFGHTDKYCSNKVKCVKCGQDHESSKCPRNTNTCILCKGSHDSLTT FT CKAYQEVKKQVRNKLVARSKASCAELLKAVNDPAGGARARGGGRPPGHSLR FT QICSPRSKTIVRPVHLRMTFKLKRIAVTDHRESVRLTLWRQIVNPRRLNKR FT KVSLRRQNIFHLCNPQKISQGSRRLTSLRLLSTLFRKF" FT CDS 1356..5195 FT /product="I-11_AAe_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="SHCVVCLASTQHGVSLWNLLPRYYRRSGKLLNLFCPS FT SRHSSNSSAVGMECNNTQRALSFLQWNARSIIRRLPGLNALLAEYEVDVFC FT ICETWLTEANFLNIPGYHILRKDRSEPYGGVLIGVRHGIEFHSLPVTTPCP FT IEVVVCSIKVKDFTCSVLSVYIPPNCSFSAESMRQIINSVPQPCIILGDFN FT AHGVAWGSLRDDGRSKIILDIIDDFRFSILNDGTGTRINLQSNNPSCLDLS FT LCSSSIALNCTWEVVNDPFGSDHLPIVIKYCKSGQVLQDRCSLYDLAKHVD FT WQTFSLKVTELINKTQDNRNLNETYSRFARGITNALRNSQTKPVPTVHTFK FT RSPTIWWDEECTNALQAKSRMFKTFRRTGYREDFIEYKRLEALFVRITKKK FT KRGYWKSFVENLDKDTALATLWSTARKLRNSSYCSSSVDSYSEEWVSRFAQ FT KICPDFVAPENEYISAIDTGTSVDTTAAPFTIEEFEFALGSSNNTCPGLDG FT IKFTVLRKLPLIAKTFLLDLFNKFVAYNVCPDEWREVKVVGILKPSKDPSS FT EDSYRPICLLPCPRKLLEKMILNRLEPLLEKSGVLSKTQYGFRRGRGTRDC FT LALLATNVQIAFNKKEELAAVFLDISGAYDSVLIDKLCSKLHFLGVSSLIS FT NFLLNLLSKKKLSFYLNNVFKTSRDSFYGLPQGSCLSPFLFNVYTNDMDDC FT LADKCFLVQYADDSVIWVAGKDKIVMQTSLQQTIDNLILWGDSHGFSFSND FT KTEIMIFSRKRSPTSIKVFLYDTEVKLVFESKYLGVWFDPKMTWKMHIHYI FT VKKCQKRVNFLRTITGTWWGAHPKLLLTLYKTTILSVLEYGSFVFDQAAKT FT HMLKLCRLQNRCIRISLGLMQSTHVQSLEVLAGVPPLPLRFFELGCRFVIQ FT SFESKNEVERVLDKLFEINPSSKMLASYRYAITMDMDKTPSHKYFDYAIEI FT HTFPLNVDTSLHAMVRDTPDWLMPKVAASALRVKLKNVKNVQVFYSDGSKS FT GIKAGFGIYHNQKGYSFSLNQPCSVFLSEVLAIFHCCQLIQQNAPSRYFIC FT SDSMSALSALGTNRFDSRASLFILKIKEFVYKLSRIGYTIIFLWVPAHSSI FT HGNEEADALAKEGASNGPLYQREIQASEYFAQIKEKIRLDWQKSWSEDQLG FT RWCFSIFPSVSFKPWFDNLPVTRNFIRNMSRIISNHYGCKSHLFRINILQE FT NKCSCEKDYEDIDHIVWSCERHTAMRNTLINSLSRNNLPIGLPIRDLLGSR FT NIPYLKYVNEFLQTIEQKI" XX SQ Sequence 5409 BP; 1564 A; 1090 C; 1128 G; 1625 T; 2 other; attcttcgct catcagttgt tgcaggcaga cgtgtttttt catttctccc gtgttcaata 60 gtgtgaagcc gttctattgt tagcagacca ttgttggagt tgcatcgatt tcaagctaag 120 tattgttcga tccatagttg atttgccgag acaacagaga gcctttttta ttagaactat 180 ttgctaaatt ccgctcctcc atttttgcaa tggacggaga cggggaagat gatggcactg 240 agaaatctga accacaatca aggtcgccta gaatccggga atatccggta ggcgcaaagg 300 gccctttcat agtctatttt cgaaagattg ccattccgct caaagttttg ttaatctcgt 360 ctgaattgta cagtaagttc acctctatca aagaattgaa gaagattaat gagttcaagg 420 ttagagtcgt tctctcaaaa cgagaggagg ccaacgaaat cgttaaaatg gatcgctttg 480 agggtgttta cagggtgtac ataccatgtg aagatgtcga agtcgatggc gtcatttatg 540 acgaagtact cacgcctgag gaagtcatgc agctcggtga aggcaggttc cgtaataggg 600 ccctsccttc cgtgcctgtt ttggactgtg aacggctcac catgatgaac gctgataaat 660 ctaacagagt tcactcgaat gcattgagaa tcacatttgc tggaactatg attccagatt 720 tcgtgaatgt cagcaacgtt ttcattccgg ttcgtgttta cactcccaaa gtcatgctct 780 gcagtcgctg caacatgttt ggtcatacag ataaatattg ctccaacaag gtaaagtgcg 840 ttaaatgtgg tcaagatcac gagtcctcca agtgccccag gaatacaaac acatgtattc 900 tttgcaaagg cagccatgat tctctcacaa cgtgcaaagc ataccaagaa gtgaagaaac 960 aggttaggaa caaactggtt gctcgcagta aggctagttg tgctgaattg ttaaaagcag 1020 taaacgatcc ggccgggggg gcccgggcgc gggggggcgg gcgccccccc ggccattccc 1080 tacgtcaaat atgttcaccc cgctcgaaaa cgattgtgcg tccggttcat ttgagaatga 1140 ctttcaaact caagcgcatt gcagttacag accaccgaga aagcgtaagg ttaactctgt 1200 ggcgacaaat agtaaatcca cgtcgtctga acaagcggaa ggtttcgctt cgtcgtcaaa 1260 acattttcca cctctgcaac ccacaaaaga tatcccaggg ttcaagaaga ttgaccagcc 1320 tgaggttgtt gtcgactctg tttcggaaat tttgaagtca ttgtgtggtc tgtttggcgt 1380 caacccaaca tggtgtaagc ttgtggaatc tgttgccccg gtattatcga agatctggga 1440 agctcttaaa tctcttctgc ccctcctctc gccactcctc gaattcttcc gctgtaggaa 1500 tggagtgtaa taatactcag cgcgcgttat cgtttcttca gtggaatgct agaagtatca 1560 tccgaaggct acctggctta aatgctttgt tagctgagta cgaagttgac gtattttgca 1620 tttgtgagac atggttgact gaggctaatt tcttgaatat tcctggttat catattcttc 1680 gtaaagatcg ttctgagccg tacggtggag ttttaatcgg cgttcgtcac ggtattgagt 1740 ttcattccct ccctgttacc accccttgtc caatcgaagt cgtcgtttgc tcaatcaaag 1800 tgaaagattt tacatgctcc gttctatctg tctatattcc tccgaattgc agtttcagtg 1860 cagaatcaat gagacaaatt attaattctg taccgcaacc atgcataatc ctgggcgatt 1920 ttaacgccca tggagttgcg tggggatcgt taagagatga tggtaggtca aagattattt 1980 tggacataat cgacgatttt cgcttcagca ttttgaatga cggcactggc actcgtatca 2040 atcttcaatc aaataaccca tcttgtttgg atctttcact atgttcatcc tcaatcgctc 2100 taaattgcac gtgggaagtc gtaaatgatc cgtttggtag tgatcatcta cctatcgtaa 2160 tcaaatactg taaatcaggg caggtactac aggatcggtg ctcattgtat gatttagcaa 2220 aacacgttga ttggcaaaca ttctcgctca aagtcactga attgattaat aagacccaag 2280 ataatagaaa tctcaatgaa acttatagta gatttgctcg aggcataact aacgcgctcc 2340 gtaactctca aacgaagcca gttccaactg tacacacatt caaaagatct cctacaatat 2400 ggtgggacga agaatgtact aacgctttgc aggccaaatc tagaatgttt aaaacatttc 2460 gtcgaactgg ttacagagaa gattttatcg agtataaacg gttggaagct ctatttgtaa 2520 gaattacgaa aaagaaaaag cgtggatatt ggaagtcttt tgttgaaaac ttagataaag 2580 atactgcctt agcaacattg tggtcgactg ctcgtaagct tagaaatagt agctactgct 2640 cttcttctgt agacagttac tcagaagaat gggtgtcacg ttttgctcaa aaaatttgcc 2700 ccgatttcgt cgcacctgaa aatgaataca tcagtgcgat tgacactgga acttctgtag 2760 atacgactgc cgcgccgttt acgattgaag aatttgagtt tgcactagga tcatcgaaca 2820 atacttgccc tggattagat ggtataaagt tcaccgtttt gcgaaaactg ccgttgatag 2880 cgaagacatt cttacttgat ttattcaaca aattcgttgc ctataatgtg tgcccagatg 2940 aatggcgtga agtcaaagtt gttggaattc tgaagcctag caaagatcca tcttccgaag 3000 attcttatcg cccgatttgt ttgttgccat gtccaaggaa attactggag aaaatgattt 3060 tgaatcgctt agaacctttg ttagaaaaat caggtgtact ctctaaaact caatatggat 3120 ttcggcgtgg tagaggaacc cgagactgcc ttgctcttct agcaacaaac gttcaaattg 3180 cgttcaataa gaaagaagaa ttagccgcgg tcttcctcga catttctggc gcttatgatt 3240 cggtactgat cgacaagctt tgttccaaat tgcattttct tggggtttct tcgctgattt 3300 caaatttttt gctcaatctt ctctcaaaaa agaaattgtc attttatttg aacaacgtct 3360 tcaaaacatc aagagatagc ttctatggtt tgccgcaggg atcatgcttg agcccgtttc 3420 tgttcaacgt atacacaaat gacatggacg attgtttagc agataaatgc tttctcgtgc 3480 agtacgccga tgatagtgtc atttgggtag ctgggaagga taagatagtc atgcaaacct 3540 cacttcagca aactatagat aatttaattc tctggggtga ttcgcacggt ttctcgtttt 3600 caaacgacaa aacggaaata atgatatttt cacgaaaacg ttcaccaaca agcataaaag 3660 ttttcctgta cgacacagaa gtgaagctag tgtttgaatc taaatatctc ggcgtctggt 3720 ttgatcctaa aatgacctgg aagatgcata tccattacat agtgaaaaaa tgtcaaaaac 3780 gggttaattt tctaagaacc ataacgggaa cttggtgggg tgcccatcct aaacttctgt 3840 tgacacttta taagactaca atattatctg tacttgagta cggtagcttt gttttcgacc 3900 aagctgcaaa gactcacatg ttgaagttgt gtagacttca gaaccgctgt atcagaattt 3960 ctctaggctt gatgcaatca actcatgtcc agtcgttaga agttctagcc ggagtccctc 4020 cgcttccgct tagatttttt gaattaggtt gtcgatttgt tatccaaagt ttcgaaagta 4080 aaaatgaagt tgaacgtgtt cttgacaaat tgtttgaaat aaatccttca agcaaaatgc 4140 tcgcatcata tagatatgct attactatgg acatggacaa aacgccttca cataagtatt 4200 ttgattacgc gatcgaaatt cacacttttc ctctaaacgt tgatacatct ttacatgcaa 4260 tggtacgtga tacacccgat tggttgatgc ctaaagtcgc ggcctctgcg ctacgagtca 4320 agctaaaaaa cgtgaaaaat gttcaagtct tctattcgga tggttctaaa tcaggaatca 4380 aagccggttt tggaatttac cataatcaaa aaggatattc gttcagtttg aaccagccat 4440 gttcggtgtt cttatccgaa gtactagcga tttttcactg ttgtcagctg attcagcaga 4500 atgcgccttc taggtacttc atttgttcag atagtatgag tgcgttaagc gcactcggta 4560 ctaatcgttt tgatagtagg gcgtcccttt ttattttgaa aatcaaagaa tttgtctaca 4620 aactatctcg catagggtac acaatcatat ttttgtgggt tccagcacac agttccatac 4680 atggaaatga agaagctgat gcacttgcaa aagaaggcgc ttcaaatgga ccgttatacc 4740 agagagaaat ccaagcatcg gagtatttcg cacaaattaa agaaaaaata cgtttagact 4800 ggcaaaaatc ttggagtgaa gatcaattag gaagatggtg cttctctatt tttccatcag 4860 tttcgttcaa gccatggttc gacaatcttc ccgtaactag aaattttata cgaaacatgt 4920 ctcgcattat ctccaaccat tacggatgca aatcacatct ttttcgtatt aatattctac 4980 aggaaaacaa atgctcgtgc gaaaaagact acgaagacat agaccacata gtttggtcct 5040 gcgaacgtca tactgctatg agaaatacgc ttatcaacag tctatcgagg aacaatttgc 5100 ctattggatt gccgattcga gacctgctag gatcccggaa cataccgtat ttgaaatatg 5160 tgaacgagtt tcttcaaact attgaacaaa agatttgatt ctcatattga tcaaatgatc 5220 aacgctgtta taatctattc taatattcca ctcgttttct aaacttacta ctttttccat 5280 aatcaataaa tattttatga agcacggctt tgtgatggtg ctaattgtcg ccgaatgagc 5340 cagttagatt aaggtttatg wttgtatatg taatgttttg ttttgtttca ataaaaaaaa 5400 aaaaaaaaa 5409 // ID Gyp1c_Cis_LTR repbase; DNA; INV; 532 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 06-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE Long terminal repeat of Gypsy LTR Retrotransposon from Ciona DE savignyi. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gyp1c_Cis_LTR. XX OS Ciona savignyi OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. XX RN [1] RP 1-532 RA Smit A.F.; RT "Gyp1c_Cis_LTR - Gypsy LTR Retrotransposon from Ciona savignyi."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX CC Ci000068. XX SQ Sequence 532 BP; 164 A; 116 C; 111 G; 141 T; 0 other; tgtatgtagc gacctcatat gtttacataa gatttatgat ggccgcagac gacatctgcg 60 ctgacattgc gtcatcactg aactattatt accgcagctg acgaataacc ttgtctatat 120 aagtttattt cactttggct ccagacgatg gaaggctcct aacggagagc tcctagagag 180 agtcctaacg cattaagtaa tattcttgta tgtgaagaat actgagagtt ttatctgaaa 240 tctatatcgt tgcaacatct aacgatttac tcgtaattgg tactaagtta atcgcctcaa 300 ttgctcaata caattgatgt tataaaacta ctgaggcgtt tatcgacgaa acacagcaga 360 tagtcaagtc aaagagtcgt tacgcagaaa acctcaagta attagcgcac ctccgagagt 420 cgagatagcg cgtagggaat cgcagatcgc atcaggtgtt gtaaccggct tacctggggc 480 ctacaacgaa caatctaaca aaataggcag cgacctgcaa cggctcccta ca 532 // ID EnSpm-12_HM repbase; DNA; INV; 5604 BP. XX AC . XX DT 08-JAN-2009 (Rel. 14.02, Created) DT 08-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE EnSpm-type family from Hydra magnipapillata - consensus. XX KW EnSpm; DNA transposon; Transposable Element; EnSpm-12_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-5604 RA Bao W. and Jurka J.; RT "EnSpm-type families from Hydra magnipapillata."; RL Repbase Reports 9(2), 383-383 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 179..2134 FT /product="EnSpm-12_HM_1p" FT /translation="MNSTERVRKHRQRNAFLQSAIEIITTSSASENDIVSV FT QGSNPHEVSPDDYQHDTSELNETLSDDYYSTNIMEDSDSDSVISVELDEDP FT SDKNVEDDLQAELAVWSSSSMCKRDSLNKLLAILRKNGHPELPKDSRTLLN FT TPRSIASYSKCGGDFFHLGIESGIKRTLDSTVITNENTQNIALIVGIDGIP FT LFKSSSVELWPILCRFNSLKPFIVSVYTGNKKPSFINDFLKDFLVEYQLLN FT TVGFLYENHRYTVSIKAFVCDAPARAFIKCIKSHTGYYSCERCIIKGAWHD FT GRVVFQKNNCSARNDLDFSLLKYQHENDEHQNAKSPLIDYNLPCVSGFVLD FT YMHLVCLGVMKXILNFLTDGPRLCKLSIQQINXVSESMVSLNNEMPSEFAR FT QPRSLEYVKRWKATEFREFLLYSGLVVLRGVVKKEVYTHFLYLSIAIRIML FT SSNAEYRNSLLEYAHHLLVYFVSTTKNIYGNTFNTYNVHNLIHLQDDVAGY FT QLPLDAISAFAFENHLQILKKLVRNATNPVSQIIKRAHELESCGAMYSEKF FT LYTKISIKHKDSWFLLKCGKYANIKEINDNNIYMCETYRKDSMESFFLEPC FT DSKNFNIALLRERTLNLTRTLIERDDILYKVVVMPYGDNGLVFIPLVHNVN FT WK*" XX SQ Sequence 5604 BP; 1866 A; 779 C; 875 G; 2080 T; 4 other; cccagccggc acacgacgca atttcgacgt cgatataaag tcgatataaa gtcgcgacgt 60 cgagcaactt gatttcgacc cggtttagac caaaatcttc gaaattgtcc agaaatcatt 120 gcgccataaa aaagtatata accacgaagt gttaaaagtt tttaagtaag ttaaaaatat 180 gaattccaca gaaagagtta gaaaacacag gcagagaaat gcttttttac aatctgcaat 240 agaaattatt actacatcat cagcttctga aaatgatatt gtttcagttc agggcagtaa 300 tccgcatgaa gtttctcctg atgattatca acatgatacg agtgaattaa atgaaacact 360 aagtgatgat tattattcta ctaacataat ggaagattca gactctgatt ctgttatcag 420 tgttgaacta gacgaagatc caagcgataa aaatgttgaa gacgacctac aggctgaact 480 tgctgtttgg agttcttcaa gtatgtgtaa acgagattcc ctaaacaagc ttcttgcaat 540 tcttcgcaaa aatggacacc cagaacttcc aaaagattcg cgaacattgt taaacacacc 600 aaggtctata gcttcttatt ctaaatgtgg tggagatttt tttcatttag gaattgaatc 660 tggaattaaa aggacacttg atagtacagt tataactaat gaaaatactc aaaatattgc 720 attaattgta ggcatagatg ggattccttt gtttaagtct tcaagcgtag aattatggcc 780 aattctttgt cgttttaaca gtctaaaacc ttttattgtt tcagtataca caggtaacaa 840 gaagcctagt ttcattaatg attttttaaa agattttcta gttgaatatc agttattgaa 900 cactgtagga tttctctatg aaaaccatag gtacacagtc tctattaaag catttgtatg 960 tgacgcacct gctagagcgt tcattaagtg tatcaaaagt catactgggt attattcgtg 1020 tgagcgatgt ataattaaag gtgcatggca tgatggacga gttgtttttc aaaaaaataa 1080 ttgttctgca agaaatgatc ttgattttag tctgttaaaa tatcaacatg aaaatgatga 1140 acatcaaaat gctaaatctc ccctaattga ttataatttg ccttgtgttt ctgggtttgt 1200 cttagactat atgcatttgg tttgtttagg tgttatgaaa craattttaa actttttgac 1260 tgatggtcca agattgtgta aactatcaat tcagcaaatt aatygtgtat cagaaagtat 1320 ggtatcatta aataatgaaa tgcctagtga atttgctcgt caacctcgat ctctagaata 1380 tgtaaagcga tggaaagcaa ctgaatttag ggaattcctt ttatatagtg gtcttgttgt 1440 acttcgaggg gttgttaaaa aagaagtcta cacacatttt ctttatctgt caatagctat 1500 tagaattatg ctgagttcaa atgctgagta tcgaaattct ctgctggaat atgctcatca 1560 tttgcttgta tattttgttt ctaccacaaa aaatatttat ggcaatacat ttaatacata 1620 caatgttcat aatctgattc atctccaaga tgacgtagct ggttatcaac taccacttga 1680 tgcaatttct gcatttgctt ttgagaatca cctccaaatt ttaaagaagc ttgtccgtaa 1740 tgccacaaat cctgtaagtc aaattatcaa gcgagcacac gagttagaaa gttgtggagc 1800 tatgtattct gaaaaatttt tgtacactaa aataagtata aagcataaag atagctggtt 1860 tttactaaag tgtggcaaat atgctaatat aaaagaaatt aatgacaaca acatttatat 1920 gtgtgaaacc tatagaaaag atagcatgga aagtttcttt cttgaacctt gtgattcgaa 1980 gaatttcaat attgcacttt tgcgagagag aacccttaat cttacacgta cattaattga 2040 aagagatgac attttgtata aagttgttgt tatgccatat ggtgataatg gtcttgtatt 2100 tattcccctt gttcacaatg tcaactggaa ataaaataat attcattttt gtttctttaa 2160 taatttttta attaaaattt attgttttgt tgcttttaat tccactaact attaattaat 2220 aacctttttt ttttttttcc agacaattta ttatttttgg ctaaaaataa aattgacttt 2280 aacttttgca acacttgtat tttgaacaat agttttttta tagaaaaata agtttagtat 2340 aaataagatt ttgttaagct attataattt ttgccttagt aattaaaatt tagtatctaa 2400 ttatgtctta tgctcgtggt gtttggattg aagatggtat ggaaacagaa ggtgttgttc 2460 catgtaactg gataattgat aataaactgt attggcccaa tggtatatct gtgataagac 2520 actttcagct cctaagtata ccagatgtct tgaaatggaa acattataaa tatttgtcaa 2580 caaagtgtgt aggaggttgg tagtttgtca tattctagaa aacctcatta ctttaataat 2640 ttaacttatt ctaatttaaa taactaactg gtaatttatc cttagtaaat tctgaattaa 2700 agttttcaga tttttaattt acaattttga tttttttttt ttaataactt tcagatgaac 2760 aaacatgtca agactatgag tttgtaacaa cccaggagga aaaaagtgct tcagaagaat 2820 cgttaataga aagttctaat gaagaagaca cgggtaggtt ataatatgtg aaaacttaac 2880 ttaatgcttt ttaataattt gatgttatta aagtatattt aaaaaaaatg tatatttata 2940 ataatagggg ttagcttaaa ttatttaaat agctgtcaca aatatttatt tttcaactgg 3000 tttggtacat actttattat cttgttttta agttgtttat ccaactatct ttttcagttg 3060 ttgtacccaa atttccaaat gctatgtctc tgcaatcaat gatttcaaca ccacaactat 3120 ctgcatttaa atcatcttgc agcgttcaaa aattgagaac aaaaccatct attcacagtg 3180 gtgacagatt agaatcatca ggtttcaaaa gcacaaggaa gattgcaata ccctgtaagt 3240 aatacaaagt atagacacta ccttagaaaa aaaaggaact tcttttgtca tagatttttc 3300 aatttttgac atttctgatt actcttgtat gacatttgat ttaaaaaatc tactattata 3360 attgtcatca gcatgctatt atatcaatta cttgttttaa taatatattt ttttaatgtt 3420 ttgctttttt tattttgttt ttctcattta ttattgtttt tttcatttct taaatcgtct 3480 gtttatacta atcaataatt aaattctctg gatcttgtta ccactttttt ttattattta 3540 caattattat ttttgtatat ctattttgat ttatttttcc tattcattac ctttagcacg 3600 gaaactaaat caaacaaaac aacatgtggt gaagcgtagg catggttata actatcctct 3660 aacaaatgag agtaagtttt atactgtttt aattgtttag gactagggca aggtatatgt 3720 ggcaaccctt tttttaaaaa tctttcagga atatggatga atataaatta gcttgagttt 3780 atgctttcat tagaacaaca ttatcctaaa tatagctaat tatgttggtt gttatcattg 3840 cattttagac ataatcactt ttttatatta tttatatttt ttttagtatt tcaaaaaact 3900 gttatcaatc tgttaacaga aatttctatc aaacttgaaa acataacggt ttgtggttgc 3960 aacaactctg caacgactat actgttgcca tcaactaatg aaagctttaa attttcccat 4020 attacttcaa tggaagaact ttgtaaacta aatagtgatt tagataatga tgcaaatttt 4080 acatcttttg taagtaatac tgatttttta agtttgaaaa tttaaagttg atttgaaatt 4140 ggtgtacatt tattattaga catttatgtt aaatgtaaaa gtattaggtt acggtataag 4200 ttctatacaa tgcaataagt tacaaaatac aaataataat tatagttcta aaaataattt 4260 tataacatgt ctattgcata gtttatcagc tttaaaatta ggcataacag tcttcttagt 4320 ttgcttccaa gcagttaagt aacaattaac aaataacaac aacagcagaa agtaacaatt 4380 agcagaaaga aaattttttc tttttagaaa acacagttat cacgaatcgg tggaaagtcg 4440 attaaaacaa tcattaaaaa cattttgcaa aggtatgctg agtttttaaa tgttgcaata 4500 ttgttttgga aataatgttt tttgcagcac tttcttattt ttgcagaggg tatccagggg 4560 aatccaccta atgatattgt gattatgata ttttaaattt aatactattt aataaataaa 4620 taatttttaa cttaaaatat ataagtatta tgtatatagc tttcggtata tctgttttat 4680 tgcagttttt tacccaatgc tacacaagcg ttatgtaata tgtatrgatc tggtgaaaag 4740 tttggattct caaaatctaa catttacaga gcaatagttg gtaagctata aatttttctc 4800 ataattattg ttagtataat catttacact attttaagtt ttgttattat ttggacttct 4860 gtatctgatc ggccataatt aggtatcttg tagataattg tgatttaaca aacaaatttg 4920 tggcataatt tatatctgct tatctgagtt ttgttttaat gtattatggt ctgtgtcaat 4980 ataactcttt tattatgtat tatggtgtac accataatac atttgaaaaa tgagctatgt 5040 tgacacacac atgtgtgata cactcgtgtg tgtcaaaata gctttgatga tacatatatt 5100 ttttgtacca aaaatgttta aaattatttc ttagatgttg ttcttttcaa gaacaaagag 5160 gctactacag ctgaaatatg ccaaataatc atgcttcaac taaaacatgc gcctcggcga 5220 tccaatggtg gaggatcgat tattcgcaag tttcctaatg aagaataatc gttgtcagag 5280 atgctttttt atgctgattt ttatgtatag ctttgctgct aaaagtaatt taaaaagaaa 5340 saatgttaac gaatactttt tgttattttg ctttttcctc agattttcct caaaatacgt 5400 gtcaaatcaa tgtttcttgc tacgtcgaaa aaaagtcgaa atgaacgttt ttgtgttacg 5460 tcgaaacgac gtcgttagtt tcgacgtcgg catttgtttc attttcgacc aaatttcgac 5520 gtcgtttcga cgtcacgatg tttaattaaa ataaaacttt cgacgtcgtt tcgacgtcga 5580 aacgacgtcg atgtgccggc tggg 5604 // ID CR1-82_AAe repbase; DNA; INV; 4308 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-82_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4308 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1170-1170 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 11 sequences with >98% CC identity. XX FH Key Location/Qualifiers FT CDS 248..934 FT /product="CR1-82_AAe_1p" FT /translation="MENCCAKCCQAITGIDSXTCRGYCGSVFHMNCSGVSR FT ALMNYFTSXRRNLFWMCDKCANLFENSHLRAITRHADEKTPLTSLTDAINN FT LQTEIKKISIKPASTAPIFNRWPLINEPNRPAKRLREIDPVGRIPLECQSG FT SKQHNLNVVSVPICEKPASKLWLYLSRIRTDVTNEAIAEMVKANLELETDP FT DAVKLVSKGADVSNMSFISFKVPLQNFGSLPRQLRFHHQ" FT CDS 855..4187 FT /product="CR1-82_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="ATCPSSRSRCRSKISEAFHVNSDSITNSXVTCSSDQL FT IVSHRFTVHPCYSTSLHSESGRSVAHTGQTDDSFVTARCCRVHLGRTAISP FT KGVLEPPDQVVHAVSSCQRSRPGPSVVVGDRISRQPLSGKYASGCNVNLPE FT ISSISSVSVDSHAIDSRKIWVYYQNVRGLRSKIDAFFLAARDSNFDIIVLT FT ETGLNDRINSIQLFGTSYNVYRCDRSDLNSVKTCFGGVLIAISKRYTSALA FT EISHGRYLEQICVACTVQGMQLFICAIYIPPDKSRLVDVMETHVASIRELC FT DLQQDDGAVLVCGDYNQPRLKWLRQEEETAVSESQITPSSSALIDGMNFLN FT MHQYNSITNQLGRTLDLVFGSDNLRLTVDESVTPLLRVDNHHPPLVISIPT FT AECTPLRNSTSNHRSHMLNYRRIDFDALQSFLEGFDWDSRLNNLEIDDMTT FT VFCNVITEWLDTNVPSKRKPVSPAWSTPLLRDLRRKKNSCQRKLRCNRNYA FT NKRNFNIASQAYRKLNSTLYRSYVQRIQFSLRSNPREFWKFVNSKRKNPSI FT PSNVYLDHSNANTNASACELFATHFSSVFADCPSTEDEVGASISAVPMNIV FT DLDSFRVTPEMVISATKKLKNSFTPGPDKIPAIVYVRCCDAIAQPLCRIFN FT KSFDQRVFPGIWKQSFMFPVHKKGDRHDVRNYRGITSLSAGSKLFEIIVGN FT FLINQTMNYISTDQHGFMPGRSVTTNLLEFTTICANQMESKAQVDVIYTDL FT KAAFDRIDHEILLRKLLKLGASERLVCWLRSYLVGRTLRVQFGNELSNTFT FT NKSGVPQGSNLGPLLFSLYINDAALLLGPNCKSIFADDAKIYVSVRTIEDC FT HHLQALLDRFVNWCRRNRLTISITKCSAMTFHRIERPIIFDYKIDGILLNR FT VDQVCDLGVQLDTKLTFIPHMTEIIAKANRQLGFISKIAQNFTDPYCLKSL FT YCALVRPILETASVVWTPHQLTWNLRIERVQKRFIRFALRSLPWRDPVNLP FT PYPDRCQLLNLDTLERRRNIAQISTAAKIISNELNTPGLLSRIDFRASRWT FT RNTAVIQHRFHRTTFGYNEPFSAMLRTFSIVEEEYEFGEPAKRFVNRIMRL FT RLV" XX SQ Sequence 4308 BP; 1161 A; 1012 C; 893 G; 1230 T; 12 other; atcgaagttt catagtctgc tgggmtamtt tgtgcttaat cagtgaagtg gcttttggtg 60 aagtggtgac aaagwcttac cttcttcatc ccccgtcatt tttttctgtg tgcagcgcca 120 tcttgcgtca aaaaattkaa actgcagttg cgtgmaatct ccagtccaag ttcctgaacg 180 tcgtcgacat agtgtttcat caccgcgtag tatactgcka ggcgctttcg attcgcawac 240 tasaatcatg gaaaattgtt gcgcaaaatg ctgtcaggca attactggca tcgattccgw 300 tacatgccgg ggatattgcg gcagtgtgtt tcatatgaat tgctctggcg tgtcacgcgc 360 tctgatgaat tatttcacat ccsaccgcag aaacctcttt tggatgtgtg acaaatgtgc 420 caatttgttt gagaattctc atctacgagc cattacacga catgcggacg aaaaaactcc 480 cttgacgtcg cttactgatg cwatcaataa tttgcaaaca gagataaaaa agatctcgat 540 taagcctgcc tcgacagctc cgatattcaa tcgatggccg ttgattaacg aaccgaatcg 600 tcctgctaaa cggcttcgtg agattgatcc tgttggacgc ataccattgg aatgtcaatc 660 tggatcgaag cagcataacc tgaatgttgt ttctgtacct atttgtgaaa aacctgctag 720 taagctttgg ctctatctgt cacgcatcag aacagatgta actaacgagg caattgctga 780 aatggtaaag gctaatctgg agctggaaac cgatcctgat gccgtaaagt tggtttccaa 840 gggtgccgac gtaagcaaca tgtccttcat ctcgttcaag gtgccgctcc aaaatttcgg 900 aagccttcca cgtcaactcc gattccatca ccaatagcmc tgtcacctgc tctagcgacc 960 aactaatcgt tagccaccgc ttcactgtac atccctgtta ttccacaagc cttcattccg 1020 aatcaggacg ttctgtagca catacaggtc aaaccgacga ttctttcgtt accgctagat 1080 gttgccgtgt acacctagga cgcactgcca tcagccctaa gggagtcctc gagccacccg 1140 accaagtcgt gcatgctgtc agttcctgcc aacgtagtcg tcctgggcct tcggtcgtgg 1200 ttggtgacag gatctcccga caacctctgt caggcaagta cgcgtctggt tgtaacgtaa 1260 acctgcctga aatttcatca atttccagcg tttctgtgga ttctcatgca atcgatagta 1320 ggaaaatctg ggtatattac cagaacgttc gtggcttgcg aagcaaaatt gacgccttct 1380 tccttgcggc tcgtgatagc aactttgaca taatagtctt gactgaaaca ggtctgaatg 1440 atcgaatcaa ctctattcag ttattcggta cctcgtacaa tgtctatcgc tgcgatagaa 1500 gtgatctcaa cagtgtcaag acgtgttttg gtggagtttt gatagctatt tcgaagcgtt 1560 atactagtgc actggccgaa atctcccatg gtcgatatct tgagcaaatt tgcgtagcat 1620 gcactgtgca aggaatgcaa ctattcattt gcgccattta catacccccc gataaaagca 1680 gattggttga cgtaatggaa acacacgttg catctattcg ggaactatgc gatttacaac 1740 aagacgatgg tgctgttctt gtgtgcggag actacaacca accccgatta aaatggctac 1800 gacaggaaga ggaaaccgct gtttctgaat cccagattac tccctctagc tccgcactga 1860 ttgatggaat gaatttctta aatatgcatc agtacaactc gattacgaat cagctcggtc 1920 gtactctgga tcttgttttt gggtctgata atcttcgcct cacagttgat gaatcggtca 1980 cacccttact ccgtgttgac aatcaccacc caccgcttgt tatttcaatt cctactgctg 2040 aatgcactcc actacgaaac agtacctcta accaccgttc tcatatgctg aactatcgaa 2100 ggattgattt cgatgcctta caatcatttc tcgaaggatt tgactgggat tctcggctga 2160 ataatttgga aatcgacgat atgacaactg ttttttgcaa cgtaataact gaatggttag 2220 atacgaatgt tccctcaaaa cgcaagcctg tctcacctgc gtggagcact ccgttactgc 2280 gcgatttacg tcgtaaaaag aactcatgtc aacggaaact gcgttgcaat cgcaactatg 2340 ccaacaaacg aaatttcaat atcgctagcc aagcctatcg caagcttaac tcgactcttt 2400 atcgctcata tgtgcaacgc atccagttta gtttacgaag caatccaaga gagttctgga 2460 agttcgttaa ttcgaaaagg aaaaatccta gcataccctc caacgtgtat cttgaccact 2520 ccaacgcaaa tacgaacgcc tctgcttgtg agttgtttgc cacacacttt tcatccgttt 2580 ttgctgattg tccttcaacg gaagatgagg ttggcgcttc aatttctgct gttccaatga 2640 acatcgtcga cctggattca ttcagggtta ctcctgagat ggtaatctct gcaactaaaa 2700 agctgaaaaa ctcgtttact cctggaccgg acaaaattcc tgccattgtc tacgtacgat 2760 gctgcgatgc cattgcccag ccactgtgcc ggattttcaa caagtcattt gatcagcgag 2820 tgttcccagg aatttggaaa caatctttta tgtttccagt acacaagaaa ggtgatcgtc 2880 acgacgtaag gaattatcga ggaattacga gcttgtctgc agggtccaaa ctgttcgaga 2940 ttattgttgg aaatttcctg attaaccaaa cgatgaacta catttcaact gatcagcacg 3000 gtttcatgcc cgggagatcg gtaacgacta atctacttga gttcacaacc atctgtgcga 3060 accaaatgga gagcaaggca caagttgatg ttatctacac agatctgaaa gccgcattcg 3120 atcgaattga ccatgagata cttcttcgca aactcctgaa attaggtgcc tccgaaaggc 3180 tagtttgttg gctgcgctcg tatctggttg gaagaacgct gcgtgtgcag ttcggtaatg 3240 agctttcaaa tacgtttact aacaagtcgg gtgtgccaca gggaagtaat ctaggtccct 3300 tactattctc tctgtatatc aatgatgctg cgttgctcct tggaccaaat tgtaaatcca 3360 tatttgcgga tgatgcaaaa atctacgtct cagttcgaac aattgaggac tgtcatcact 3420 tgcaagccct cttagatcga tttgtaaatt ggtgccgaag aaatcgattg acgataagca 3480 tcacaaaatg ctctgcaatg actttccatc gaattgaacg ccctatcatc tttgactaca 3540 aaatcgatgg tatactactg aacagagtag atcaagtatg cgaccttggt gttcaattgg 3600 acactaagct gaccttcata ccccacatga cggaaatcat tgccaaggca aatcgccagc 3660 ttgggtttat ttctaaaatt gcgcaaaact tcacggatcc ctactgtctt aagtcattgt 3720 attgtgccct tgtccgccca attttagaga ctgcctcggt tgtttggact cctcaccaac 3780 tcacttggaa cttgagaatt gagagggttc aaaaacggtt catacggttt gcgctgagaa 3840 gtttaccatg gcgtgaccct gtaaacctac ctccctatcc agataggtgc caattactca 3900 atctggacac gttggaacgt cggcgtaata ttgcacagat ttccactgca gctaaaatta 3960 tcagtaacga actgaacact ccagggttgc tatcccgaat cgattttcgc gcctccagat 4020 ggactcgcaa cacagctgtg atccaacatc gatttcatcg caccactttc ggttacaacg 4080 agcctttttc tgcaatgtta aggacctttt caattgtgga agaggaatat gagttcggtg 4140 aacctgctaa acgttttgtc aaccgtatta tgcgattacg actagtgtga ttctcacttt 4200 ttgaatgtta atgataatgt attgttagtt atcttaattg ctccttttat tatcatttag 4260 actaataagt ccgatgagtt acaataataa acaaataaac aaataaac 4308 // ID BEL-181_AA-LTR repbase; DNA; INV; 597 BP. XX AC supercont1.4; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-181_AA_; KW BEL-181_AA-I; BEL-181_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-597 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.4; Positions 2630542 2631138. XX SQ Sequence 597 BP; 207 A; 101 C; 101 G; 188 T; 0 other; tgttacggtg gatagcaacg tccctttgga tacccctcgg tctgacacga acgtagtgcg 60 ctagcttttg acatctgtca ttactcacga atctaggtag acgtatcaaa gaatgaaaat 120 agatatcaac acctctatga atcagttgta gctagcgaga agttatttca acgtttaatt 180 tagtttaaaa gcagttcttg gcaaataata ttgaatttcc tttattgtaa gtagtaatat 240 taaaaatatg aatattttga attcaattcc gaatcttatg aagatttttg agcacctata 300 aagacagcat aattctgcaa gttaacgagc ctggttgtaa cctaaaagta tatattgtac 360 acctattaaa tgaccagcaa ttaatagatt gaattttagg gcgaaactca gcagaataac 420 taaaactata ttttttgttg gccacaagtg ggatagcgga tcaaatatgt aagtttactg 480 cttaattatc ctaaaaatgt aaccaagaat taattaaatg aacttagctt ttagcgctta 540 tatcaccaac atcatcggtg ttattgctta aaagaagccg aataccctac gccaaca 597 // ID Sola3-1_CapOwc repbase; DNA; INV; 4331 BP. XX AC . XX DT 17-MAY-2011 (Rel. 16.05, Created) DT 17-MAY-2011 (Rel. 16.05, Last updated, Version 1) XX DE DNA transposon: consensus. XX KW Sola; DNA transposon; Transposable Element; Sola3-1_CapOwc. XX OS Capsaspora owczarzaki OC Eukaryota; Ichthyosporea; Capsaspora. XX RN [1] RP 1-3053 RA Yuan Y.W. and Wessler S.R.; RT "The catalytic domain of all eukaryotic cut-and-paste transposase RT superfamilies."; RL Proc Natl Acad Sci U S A 108(19), 7884-7889 (2011). XX DR [1] (Consensus) XX SQ Sequence 4331 BP; 932 A; 1308 C; 1177 G; 914 T; 0 other; ggggctgttt ttttaaaaag gtagaatggc catgtctttc actcctcttg catcacctgg 60 ctctgtatgt gttcgcccct gccctgcaac cccccgcccg agggagtaga ccatgtttct 120 atgccactcg gtgtatcgga cggcgcgccc ggtcccactg gcagcggcag ggccgcgagc 180 agtgggtgac aatcagcgcg gcggcgtcac agcaggtccc tgacagcggg ctaccgttcc 240 acatcgaaca attcatctga tcatgtatca tagacaatgt ctggcagggg taggaggagc 300 tgggggatgg gggagcgcat gcgtgtcgac ctcctttgct cttcgcttgg tagaccatgc 360 tcgctcactt tcgcttcctc tctcgcgcgg acagttgatt ttacctacct gtgacgatgg 420 cagctcggcg cgttggcggc ggggacgata gcgctgcgga caaggcgtat gtgtcgatcg 480 cacccccagc cccgctcatc aaccctggtc cgcagccagg ctatgtctct ctgcagatca 540 acgtcgagtt tccttcgtcg ttcaagtatt cgccaatggc taacgctctg tccagccaag 600 aacttggcgc acatcgtgtc aagcagcggc gacgtgctga agctcttgag accgggcgca 660 acatcctgtc gactgtcaag aacaccgtcg tcaatcaatg gccgagcgat caaacgcttt 720 ggaacgagat cctttcggag acgaaggtca ccgacgataa tgacatgacg aaacgcgcgg 780 atgcattgct gacggatctt ggactacgat ggcagcgggc aacccgccct gccgaccgtc 840 gacgcatcct ctcaaccatt gcaccattct tccagctcaa gcagcttcaa ccacacttcc 900 ccggcctcga atatcgcgca ctgacggccg ctcgaaagca cggcaaacac aatggtcctg 960 gcgtcccggc ggcgcacgtc gtgattccca ggaacagtcg ggccagcgca catgatgagc 1020 gaatcctcgc gtttgccatg gcgaccggta cagcatcccc gaccgcgtgc actgtcgacg 1080 gtcagccagt catctacctc aacatgactc tcaatcagat ctacaagcgc ttccttgcca 1140 agtaccctgg cgaaggtacc aagccgattg tctcccggag caagttctat gacgtgctgc 1200 agtcatcggg cgtaacgttc ggcgagcgta aagtcaagac cggcctttgc gcgacctgca 1260 accgtggcgt gacgattttt cgcgaaatcc ttccgctcct cattcaccag gcgagctcgg 1320 ctggcgccat ttccttggaa atggacgagc gcctctgtgc ttttgccacg ctcgctgagg 1380 gtcatgcaag gactatagcg agctctgctg ggaagatgga gcatcacccg cacccgctac 1440 attgcgagct cttcgccttc ggcctctgtc acgaggagca caattgtttt gtcgacggtc 1500 acagctcttg ccccgagtgt gccccactgc atgagctgat tctacagctc aatggcgctc 1560 tctcgggccc gaacgctgac ctcctcaaac tggcctgcaa cgacctcgtc cttgtgaatg 1620 cccaccattg gcgcgcggcc gctgccaagg ccgatttcaa cgaggccctc gagaatcttt 1680 cggaaggtcg cctgatgatc gtggtggatt tcaaggtccg tttttttgcg agcgatgttg 1740 accatgatac ctcatttgca cgtgactcac caaccgcgct aaagatagat gaaggtgctc 1800 cagagttctg ctcgagaagt gcaagttgac ttctttggca agcatggctt tccgtatcat 1860 ggtgctgtgc tcctctggat gcatcagggg gtgcggcaca tgcatatgct tgatcactgg 1920 agcgatgtta atcggcagga ttgggtgttt gtcggccatg cgcttgaggc catcgctggc 1980 ttcactcggt caatgattca acacaagagg atgccgccgt aagtgcttat gtgaggcagg 2040 tgatggacga gtgtgtgtgc ggtgatttac ttgtgttttt ttccgagcga gcagaatcac 2100 atcggccgcg ctttacagcg acaacggcgc ttcgtattcc aacagcgctt tgcctgtgtg 2160 gattcagcgc cgcttcaagt tactgtccgg gtttccgatc acagatattc accactttga 2220 gcccggcgag ggtacgttca tgtgcaggtg tttattgact gcttctccgc tcattgattc 2280 cataggcaag tcctactgtg acacgcactt tgcagcaatc acgcacgcgc gcatgcactg 2340 gatccgtaac gggaacgacc tcatgaccgg gctggatgtg gaacaaggtt cgtgcatgac 2400 gcaatctcaa atgtgtgcct gcgtttctga ctctcttacg actcagccgc acagagtgtg 2460 accaatacaa ctgtcatggg catcaacttc tcgccctcag ttctcaagca ctacggcaac 2520 aagtcacccg tgtcgccaat ccccggtctc gatgcggcga tgcacattgt catcaccgac 2580 gctggccttg aagcataccg cattcgcgat taccgcttgc ttgccacatg gaacgcaacc 2640 gatcatcccg acccgtttct tcccacgttc ggcgcaagcg tgagccgggc tggcatcgtc 2700 ctttggagcg ccgattctga aactcacgaa gcgcttcgtg ccgaccgcga cgctgccctg 2760 gtcgaccact gcgactttca ctcgccgccg tcgtcgataa tcagccacga accgcataca 2820 ccaccacaca acagtcagca acagccaacc gcccctggat cgcctacgtt gcttcgggcg 2880 ttcacgcaag ctacaccgct gcatcggaac tgtgtttgtc catcagcatg cggcgcaaag 2940 ttcagctcgc ctgacgactt gagagcgcac acacggatgg cacatcgctc tgcgagcccc 3000 acgaccatcg ccgctgccgc ttttgcagcc gccgccgacg cagccaaacc cccttccgat 3060 gtcgaggacg cgaacgagga cgatgaggat gacgaggacg atgtcaacga ccacgacgaa 3120 gaggatgacg aggacgatgt caacgaccac gacgaagagg atgacgagga cgaggacgag 3180 gaggatgaat atgacgacga cgactccgat gacgatgatc acaaccacga cgaggaggac 3240 agccgtctgc gggcttcaat ctatccccgg gatgtcccgt tgtacgcaca ggccacgctc 3300 ctccagtaca cagctcgcac gggtcggtca ttatccgtgc agcgaagttg gacagcggaa 3360 gaacggacaa gcttggcagc gcgactcgaa gttactccgg gctacgcgct catcacggcc 3420 cctcgccgcc cgcgtctccc aaccgcgatg acgattttct tgcggcagcg attcgacgag 3480 ggtattgatg atcccaaccg ccggcacgac aacagtgtgt acaaagactt tgtccagcgt 3540 tttggagtcg tccagtctgc gttgcactct gtcacccagc agcgcatcaa gtcgctgttt 3600 gtttcctggg cgaatcgacg cgactctcaa gctgtcaaca accgcagtcc agcagaagta 3660 ctctcaccat cagcggcagc accgcgaggg cgcaaacgac ggcgtcagaa cgacgaggat 3720 atcttcttcc gcgctgaccc gcacaacgat ctcgatgagg cagcagacaa cgagcacacg 3780 tctctgccct cgtttgacga cgagggcgac tacgaacccc aagcaaggtt tgtctcagtc 3840 tctagttcgc aacgccgagt ccgaactcgc agcagtcaac aatctccacc cccagacgat 3900 agtgattaat tgtgcgaaga ataaaagagt tacatttggc ggcctcaggc caaggggagg 3960 ttgggaggtt gggtggttgg tgaaaaagtt ggtgtagtag ctgcgggtgg cggaatcgtc 4020 ccctatttcg gaccaattat tgttggacaa ggtgaatctg caacagaatg tagcagcggc 4080 gcagatcagg gggccggaaa atagcaccag cgagtcagtg agcaaaacga ccgcgacaca 4140 acggctcctt cactcgctcc cctctgccgc ccaacgcact tccggtgcgc ccccgaccct 4200 cgatcacact gatggcatgg tgtacacccc acagtcagcg cccccagcga cggaaagcga 4260 tgtaggacat gtgctggata tgaacgacat attttggggg gttcattcta cctttttaaa 4320 aaaacgaccc c 4331 // ID Mf3_MF repbase; DNA; INV; 700 BP. XX AC AF291758; XX DT 09-JUL-2004 (Rel. 9.06, Created) DT 09-JUL-2004 (Rel. 9.06, Last updated, Version 2) XX DE Metopeurum fuscoviride Mf-3 repeat region. XX KW Mf-3; Mf3_MF; tandem repeat. XX OS Metopeurum fuscoviride OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Metopeurum. XX RN [1] RA Massonnet B., Leterme N., Simon C.J. and Weisser W.W.; RT "Characterization of a (TG) repeat 'Mf-3' in the aphid species RT Metopeurum fuscoviride (Hemiptera, Aphididae)."; RL Unpublished. XX RN [2] RA Gentles A., Kohany O. and Jurka J.; RT "Consensus."; RL Direct Submission to Repbase Update (JUN-2004). XX DR Genbank; AF291758; Positions 1 700. XX SQ Sequence 700 BP; 238 A; 116 C; 133 G; 212 T; 1 other; gttataatat tatataatgt gtataagaac ccaaggtgta cggcgcgtaa aaatatatcg 60 aacgtgtgtg tgtgtgtgtg tgtgagagtg aatattaata ataaaaataa aatatatatt 120 taatattctc gcggctttct attcagccgt gtgcgntttc ctattgttct aaaaaaataa 180 taataatata atgtgcggcg cgcgagcagc ctctatgcct acggataacg acccgaaaat 240 aatgtattct aatttcgatg cctcacgcga acggcggcag cttactccca ccgaggtata 300 agacgacgta ctatattata tggtatatag gtttattatc ggcagcagca tatataataa 360 tataatatac gtctaataca acaacaacaa caacaacaac aataatagta acaataataa 420 tgatatatac gcggggcgat tgcgtataat atattataac gaaataatat aataacagcc 480 gccgcgaggt cgggacggtg gcgacaaaaa gtaggttaat aattctctga gtctgtgttg 540 ttcgagacga cgatgattgt cgtaacggga aatccgatca ttttttttag agttacacca 600 aaaaccaaaa tcgattttgt caaaaaccaa tttagcgtaa gaattcccag gtttccataa 660 tttttttttg tttttctcga ctttttgaac actattggga 700 // ID Gypsy-625_AA-LTR repbase; DNA; INV; 294 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-625_AA_; KW Ty3_gypsy_Ele63; Gypsy-625_AA-I; Gypsy-625_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-294 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 294 BP; 68 A; 56 C; 73 G; 97 T; 0 other; tgttgaggac aaccctatcg cggtgtacta tgaagtacta cgcacgccgc tgggctcgca 60 ctcggcagca taactgctaa tgtgtgatgc tggagaagcg catcaatgaa tgattttaat 120 gtggttgtca tcgtattgta tctgtgtact tcagttagtt tgtttacgtc ccgtatatcg 180 ttttgtactt ttagcgtgtc gcaataaatt atgttttatt agttacgtgc gcgtttaatt 240 ccctgatgta gggacgagta tagagaacga ccatcggggt atttccacgg taca 294 // ID Gypsy8-I_Dya repbase; DNA; INV; 10311 BP. XX AC chrU; XX DT 18-MAY-2009 (Rel. 14.05, Created) DT 18-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy8_Dya; KW Gypsy8-LTR_Dya; Gypsy8-I_Dya. XX OS Drosophila yakuba OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; melanogaster subgroup. XX RN [1] RP 1-10311 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1069-1069 (2009). XX DR Genome; chrU; Positions 830636 820326. XX CC Positions [7665-8054] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 3409..4905 FT /product="Gypsy8-I_Dya_1p" FT /translation="MTPGSTLPYITLNLFDPPQAVKFLIDTGSSYSFIDPA FT ISPQESTRTLDREIKITTVMDSYTLTMATTLPMPKELGKQGEINLLLFKFH FT TYFNGLLGIDVLTKLKAKIDIKNLKLETQSASIDIITQINRGTQELHIAPH FT SKSRIKLPVDIDDGDFSCNEVGINKDLTITGGLYTAIDGHSYLEVANNSHD FT EQTLCLDQPIEVAPYDEDEHDQIYSMTLGPDPTEQFKRPIDQIRTEHLNPE FT ELQGLFAVCNQFRELFYDERETLTFSNTVKHSIPTTDDTPVHVKTYRYPYI FT HKEEVRNQIKKMLEQDIIRNSHSPWCAPVWVVPKKTDSDGNKKWRLVIDFR FT KLNEKTIADPYPIPNINEILDSLGRAKYFTTLDLASGFHQIQMNPEDRAKT FT AFSVENGHYEFTRMPFGLRNAPATFQRVMDNVLGTLIGDICLVYLDDIIIF FT SASLQKHLDDINSVFRKLRTANFIIQLTKSNFSCGENRFLGPRSPLKKESN FT PNR" FT CDS 5679..8054 FT /product="Gypsy8-I_Dya_2p" FT /translation="MTLGPDPTEQFKRPIDQIRTEHLNPEERQGLFAVCNQ FT FRELFYDERETLTFSNTVNHSIPTTDDTPVHVKTYRYPYIHKEEVRNQIKK FT MLEQDIIRNSHSPWGAPVWVVPKKTDSDGNKKWRLVIDFRKLNEKTIADRY FT PIPNIKEILDSVGRAKYFTTLDLASGFHQIQMNPEDRAKTVFSVENGHYEF FT TRMPFGLRNAPATFQRVMDNVLGTLIGDICLVYLDDIIIFSASLQKHLDDI FT NSVFRKLRTANLKFQLTKSNFLRKEIDFLGHVVTQEGIKPNPNKIEAIKGF FT PCPKTTRQIKSFLGLIGYYRKFIKDFARITKPLTNQLKGGKSVIINNEFRE FT AFEFCKTLLVNDPILRHPDFTLPFTLTTDASNVALGAVLSQGPKTCDRPIC FT FASRTLNEAETRYSTIEKEMLGIVWAVKYFRPYLYGRKFKLVTDHKPLLGL FT RNFNGQNAKLIRWRDALSEYTYELEHEKGSQNVVADALSRVEPNTHINIAM FT PNEPGDIPVSQSPLNAFNSQVTLTVSPDSACTICVRSKNKIRRQISEPLFT FT LETVTDILKETLEPNKTLAVFATDEIFEIIKNSYSEYFTGTNNYKILRCLT FT FLKEPSSTDELVSLIKTTHVNKNHRGVDEMYAHLKREIYVPHLKSLITQAI FT WDCETCLTLKYDRHPQKPPYILPEIPNEPLDILHTDIYTINKNYNLTIIDK FT FSRFAQAFPLPNRNSISVTKAFKTFFCQFGIPKKVIFDQGAEFAGTVFTDF FT LEQYGVEVHVTSFQQSSSNSTVERLHSTLTEIYRIILAKRK" XX SQ Sequence 10311 BP; 3691 A; 2541 C; 1845 G; 2224 T; 10 other; cgacgtaact ggcgcagccg gaaaaggaac ctaaaggaga tccagaacat ccagaaggag 60 gatatgaaga tcaccgggat attcttaaat attatacttt aatgcaggta agcagccggc 120 caagtgactt caataagtga gtgcaaatgt aaaataaaga aagaaacgaa attaaaaaga 180 atgcagtgca tccgaacaaa ttgtaaattc cgtgcgctta aaagaaaaac aaccgactcc 240 aaacataaat aaataccatt atacacaaga acaaatatgc agaatcagca aattacggtg 300 tgcggaagca aaataaacgg cattaacaat tgtgcattac acacgcaaat ctaaaggcat 360 attaataaat attaaaacga tcatcaatcg ctgatgactg taaggcatcg agctttaaaa 420 aaagtaaaca ctaaatatgc gacgaaacaa atatactatt aaaccctaaa attcacaatt 480 taaatttcgt actacataca tactacgtac tacgtacata tttttagcgc ctaatatgta 540 ccacaaaaaa ataaaataaa taatttttac acaaaaaatt tatagaaaca tatttttagc 600 aacgatcaat cgctgattgc cgctataaaa tgcatcgcgc attaaaatat gtacacgcct 660 aatatgtacc acaaaaaaat aaataaataa tttttacaca aaaaaaattt atagaaacat 720 atttttagca acgatcaatc gctgattgcc gctataaaat gcatcgcgca ttaaaatatg 780 tacacgccta atatgtacca caaaaaataa aacaaataat ttttacacaa acaattttta 840 tggaaacata tttttagcaa caatcaatcg ctgattgccg ctacataatg catcgcgcat 900 taaaatatgt aaacgcctaa tatgtgccgc aataaaaata ataataaata attttgcata 960 ttacacaaaa attttctatg gaaacatatt tttagcaaca atcaatcgct gattgccgct 1020 acataatgca tcgcgcatta aaatatgtaa acgcctaata tgtgccgcaa taaaaaataa 1080 taataaataa ttttgcatat tacacaaatt ttttttatga aaacatattt ttagcaacaa 1140 tcaatcgctg attgccgcta cataatgcat cgagcattaa aatatgtaac gccttatatg 1200 tgccgcaata aaaaaaataa tttcccatat tacacaaaga aaaattatag aaacatacat 1260 aggcggcaat caataataat taatagccgc tcatgcatcg cgcataaata tgtgaatata 1320 gccacactga aaaaaaaggg aaacgaaacc tatgtgcccc aacttaacaa atgctagtgc 1380 tccgccacct aaattaatgc ttgtgtgtat ataagtgcgg ccgatcattt ctttctgtgt 1440 gccggggaag aaaggggaaa ccggcaccga cagcagttcg tagggagccc ccctccaggc 1500 ctcggcaacc gcacttgaca gtcgaccact tgagtttgtt tatgtgggca cacatacttg 1560 catacacgcc gcagcgtccg gccgctacat tgtagacggt gtcgcatgac aacataaact 1620 atactcagta aaatcgacaa aacgaccgcc gcggagtcgg atgcttttgc gcatcggatg 1680 ctaaaataaa aatatacaaa taatcaaatt ttacgactgc agtgcacaat gtatcaagaa 1740 aacacttaga aaaaaaaaaa aagagaaaca ctgcactgaa accacttggc cgggggcttt 1800 gggaatttaa ttctccatta ataaacatac actaacaata tcaaattaca gatcccaacg 1860 tattcgttag gatcgacaac acatcccact atcgataagt acaagcaaaa cagatatacg 1920 tagctaagaa aaatacccaa cataatttaa gaaaagggga agagaaaaga atacgccaca 1980 cccctaaact atatattatc cgtactattt tttaacatat tcatacattc attatataca 2040 cattaaacaa aaagaaaatt tttctatacc ttatatatat atattttttg cgaattcaaa 2100 atttttatta tacccttatt acccaaaaat ttttataaac accttactag gatacaatac 2160 caataccaat accattacat atatacgtta ttctctcaga caatccacac ctataattcc 2220 tatatacatt ttactagaat acaacaccaa taccaatacc cttacatata tacgtcattc 2280 tctcagacaa tccacaccta tacatctcaa actctttagt actctcgcaa atgtccagaa 2340 aaacaatcgt ggataagaca aaatctaggc taggtcttag aagtagcagc agtttaccag 2400 agataaggga ggaagaaatt atagttatgg attccggaaa cgggtcaggg ggatcgagga 2460 cgactagtcc ggcaccccac gtaataaaca tggccaacgc tagcacaata ctaaacacct 2520 caatgaatac atcgattaat atgacactaa atccacaaga catactagcc tttgtaaaac 2580 aactgccgac ctttgagggt ataccgggga cactacaaaa attcatagtg agcgtggagg 2640 aagtgatcat gctcattagg ggcactgatc agactccgta cgggcagcta ctgctacgca 2700 cactatgaaa caaggtgata ggcaaggcgg acgaggccct gaatatgctg gatacaaacg 2760 catgtacgca tgcaagaaat cggagcctat tctaatctcc gaaatacaga atcaaccgga 2820 cggtcttccc ttcgggaaat tgtttcacga aattaacaga ttgagaagcg agctgctgac 2880 actggccacc gattccgaac aagggggctg tgcagcatcg aggtgagctc tttacaacgg 2940 catttgccta aatgccttcc tagtaggcat ccgcgacccg ctcaggacgg ccatccgggc 3000 taggaacccg tcaaccatag aacaagccca cgaatggtgc caggcaaaac aaagcttcat 3060 gtaccaaaga cagaataggc ccagaaataa ttacaataac aatttcgacc acaacagggc 3120 acccccaaac agaccagggg gacgcttcga acgataccgc aactatcccc aaaacaacta 3180 caggacggat aaccggaggg ataacccctt ccggcaaggg gaattccgca acaacgacaa 3240 taggggctgg aataacaatt tcagggcagg accaccggcc ccgagacaag ataacccaga 3300 ccgacgcaat ggaccccctc caagggctcc aacaaatttt aacaatatcg acgacaatgt 3360 aaattttcaa ccaggagcct cgaccgacaa gtcggctacc taagtaacat gacgcctggc 3420 tcgactcttc cctacataac actgaaccta tttgaccccc ctcaagccgt aaaattcctt 3480 atcgacaccg gttcctcata ctccttcatc gacccggcca tcagtccgca agaatcaacc 3540 cgcaccctcg accgagaaat caaaataacc actgtcatgg acagttatac cctgacaatg 3600 gcaacaacct tgcccatgcc aaaggaattg ggaaagcagg gcgaaataaa ccttctactc 3660 ttcaaattcc acacatactt caacgggctc ctgggcatag acgtcctcac aaaactcaag 3720 gcaaagattg atatcaagaa cctaaaactt gagacacaat ccgcctcaat cgatataata 3780 acacaaatta acaggggtac acaggaactc cacatcgcac cccacagcaa gtctcgaatt 3840 aaattaccag tagacataga cgatggggac ttttcctgca atgaagtagg catcaataaa 3900 gacctcacta tcacgggcgg actttacact gctatcgacg gacatagcta cctcgaggta 3960 gcaaacaact ctcatgacga acaaacactt tgcttagacc aaccaataga agttgcaccc 4020 tatgacgaag acgaacacga ccaaatctac agcatgacat tggggccaga ccccaccgaa 4080 caattcaaac ggcctatcga ccaaatccga actgagcacc taaacccaga agagctacaa 4140 ggactgttcg cggtctgcaa ccagttccgg gagctattct acgacgagag agagaccctc 4200 accttttcga atacagtcaa acactctatc ccgacgacag acgatacacc ggtgcacgta 4260 aagacttaca ggtacccgta catccataaa gaggaagtac ggaaccaaat aaaaaagatg 4320 ctggaacaag acataatccg aaatagccac tcaccttggt gcgcacccgt ctgggtagta 4380 cccaagaaaa ccgactcgga cggcaacaag aaatggcgac tggttataga tttcaggaaa 4440 ctgaacgaga aaaccatcgc cgacccatac cctatcccca acataaatga aatcctcgac 4500 agcctaggcc gggccaagta cttcaccacc ctggacctgg caagtgggtt ccatcaaatt 4560 caaatgaacc cggaagatag ggctaagaca gcattttcgg tcgaaaacgg ccattatgaa 4620 tttaccagaa tgcccttcgg gctgagaaac gcccccgcca catttcaacg tgtcatggat 4680 aacgtcttag gcacgcttat aggagatatc tgcctagtct acctagacga tatcataata 4740 ttctccgctt cactccagaa acatttggac gacataaact ccgtcttcag aaaactccgc 4800 acagcaaatt ttattatcca actcaccaag tcgaacttct cttgtggaga gaatagattt 4860 cttgggccac gtagtcccct caagaaggaa tcaaacccga accggtaata taatagnnnn 4920 nnnnnngatt gtaacacgat tcacttaata ataataataa aataacacaa ccctaaattt 4980 gaacaatggc aaaacagtct cttacccatt ccacgaatca tttgcaaagc cattttgcta 5040 agcattttca aatccaaatg gccagccata aatgttcaag gaaaaaacat catttcttca 5100 aaatgaaact agagccacat atgatatata tgggggggaa attatcgggc atttggattc 5160 caatttaagc aattatgttc aatcaaatga gcgcatgcgg aatgcaacgt acaatgcaag 5220 ataaacggag tgtgtcaaaa caatgatgaa tgtgcaataa ggaaacaatt gccaagaagg 5280 cacttcactc cttaaggtaa aatgttaaaa cgtacgaata ccgaggtcac atttgcacgc 5340 catgtccagt gcacagattt atagatttaa aattcaactt cacagaaaat cagaaacatg 5400 ggacaaaaga taacacatta acaagggtac acaggaacat cgcaccccac agcaagtctc 5460 gaattaaatt accagtagac atagacgatg gggacttttc ctgcaatgaa gtaggcatca 5520 ataaagacct cactatcacg ggcggacctt acactgctat cgacggacat agctacctcg 5580 aggtaacaaa caactcccat gacgaacaaa cactttgctt agaccaacca atagaagttg 5640 caccctatga cgaagacgaa cacgaccaaa tctacagcat gacattgggg ccagacccca 5700 ccgaacaatt caaacggcct atcgaccaaa tccgaactga gcacctaaac ccagaagagc 5760 gacaaggact gttcgcggtc tgcaaccagt tccgggagct attctacgac gagagagaga 5820 ccctcacctt ttcgaataca gtcaatcact ctatcccgac gacagacgat acaccggtgc 5880 acgtaaagac ttacaggtac ccgtacatcc ataaagagga agtacggaac caaataaaaa 5940 agatgctgga acaagacata atccgaaata gccactcacc ttggggcgca cccgtctggg 6000 tagtacccaa gaaaaccgac tcggacggca acaagaaatg gcgactggtt atagatttca 6060 ggaaactgaa cgagaaaacc atcgccgacc gataccctat ccccaacata aaagaaatcc 6120 tcgacagcgt aggccgggcc aagtacttca ccaccctgga cctggcaagt gggttccatc 6180 aaattcaaat gaacccggaa gatagggcta agacagtatt ttcggtcgaa aacggccatt 6240 atgaatttac cagaatgccc ttcgggctga gaaacgcccc cgccacattt caacgtgtca 6300 tggataacgt cttaggcacg cttataggag atatctgcct agtctaccta gacgatatca 6360 taatattctc cgcttcactc cagaaacatt tggacgacat aaactccgtc ttcaggaaac 6420 tccgcacagc aaatttgaaa ttccaactca ccaagtcgaa cttcctacgt aaggaaatag 6480 atttcttggg ccacgtagtc actcaagaag gaatcaaacc gaaccctaat aaaatagagg 6540 ccataaaagg atttccatgc cccaagacca ctcgacaaat taagtctttc ctgggactca 6600 taggctatta caggaaattc atcaaagatt tcgcgcgaat taccaaaccg ctcaccaacc 6660 aattaaaagg tggcaagtcg gtaataatca acaacgaatt ccgcgaagcc ttcgaattct 6720 gcaagacact acttgtcaac gacccaattt tgagacaccc agacttcact ttacccttca 6780 ccttaactac agacgcgagt aacgtcgcgt tgggagcggt gctgtcgcag ggaccaaaaa 6840 cctgtgacag accgatatgc ttcgcaagca ggaccctcaa cgaggctgaa actcgctact 6900 cgacgataga gaaggaaatg ttaggaattg tgtgggcagt aaagtatttc agaccatacc 6960 tctacggtcg gaaattcaaa ctagttaccg accacaaacc cctattagga cttaggaatt 7020 tcaatggaca gaacgcaaaa cttattcgct ggcgggatgc cttgagcgaa tacacgtacg 7080 aattggaaca cgaaaagggt tcccaaaacg tcgtcgcaga cgcactgagc agggtggaac 7140 caaatactca cataaacata gctatgccga acgaaccagg agacattccg gtatcccaga 7200 gcccactcaa cgcttttaat tcacaagtca cccttacagt ttcaccagac tcggcatgta 7260 cgatctgtgt ccgatccaaa aataaaatca gaagacagat ttctgaacct ctctttacct 7320 tagagacagt cacagatatc ttgaaagaaa cgttagaacc aaacaagacc ttagccgtgt 7380 tcgctactga cgaaattttc gaaataatta aaaattctta ttccgaatat ttcacaggaa 7440 ctaacaacta taaaatcctc aggtgcttaa cgttccttaa ggaaccttca agcaccgacg 7500 agttagtatc acttattaaa acaactcacg taaacaaaaa ccaccgaggg gtagacgaga 7560 tgtacgccca cctaaaaagg gaaatctacg taccccactt aaaaagcctc atcacacagg 7620 caatttggga ttgcgagacc tgcctcacac tcaaatatga caggcacccg caaaaaccac 7680 cctacatatt gcccgaaatc ccgaacgaac cattagacat cctgcataca gatatctata 7740 caatcaataa aaactacaac ctcacaatca tagacaaatt ctctagattc gcgcaagcat 7800 tccctctacc caatagaaat tccatcagtg taaccaaagc atttaaaaca ttcttctgcc 7860 aattcggtat acccaaaaag gtcatattcg accaaggggc cgaattcgcg ggaaccgttt 7920 tcacggattt cctagaacaa tacggcgtag aggtgcacgt gacatccttt caacagtcaa 7980 gcagtaactc gacagtcgaa aggctccatt caacactgac ggagatatac cgaatcatcc 8040 tcgcaaaaag aaagtagggg aaactggact tggaccatga agacatctta tcagagacag 8100 ttgtcacata caataatgcg atccactcga gcacgaacct aacaccgttc gaactcttca 8160 gcggccgtac ccacgttttc acaaaaacag tcaagttcaa caccggacac gacttcctaa 8220 ataaactcaa tgaatttcaa accaaaatct acccggaagt caaaaaacac ttggaagaca 8280 tcgctttaaa gcgcacgtct aaactcaacg aaacgaggga ggacccccct gccttaccta 8340 ccggaaacac agtcttccgg aaggagaaca ggaggaataa aataacaccc cgtttcacga 8400 gacaaacagt accacgggac agcggacgca cactccacac gacacggaac caacaaatac 8460 acaaacaaaa gattaggaag atttccttac ccaaatgacc accgacatgg gcttaacaca 8520 tagatagact tggattttgg ggctccgtgt tattttctta ccaccaatgg gttttatatt 8580 cttccactaa aatatttttt tggccaactt ctattaccaa ccttttgacc agggatttcg 8640 tcgcgttgat acctttatca gaaccttaca ttaggagtta ctagcctagg cttgttggcg 8700 gcgcatacta caccaccgat gactaactag aaacacttaa ctgacgttca ttacagattg 8760 acgatgacca acgccaagga actgacattc aaaccattga ccaacgacca agcaataata 8820 aaagtccaac taggaaaggc cagaacggta tcagcattca caacaataca gcatacaata 8880 cacctgaacg agtacgagat cgcagccaaa gggttgtaca agctcatagg ggatctcccg 8940 aacaaatcgg aggccactaa ccttaaatcc ctactactcg cgaaacttga ccttatcaac 9000 cataggctag cctctctgct acccaacacc aatagcggta gaattagaag aggcttgatc 9060 aacggtctag ggacaatagt gaagatagta agcggtaaca tggacgccaa cgacgaagac 9120 agaataaaca aacaactagc aaacctaaat aacaatgagg agagaatgaa tagattcaag 9180 aatgaaacaa tatttagatt cagggaaata gcaacacaca ttaataggga acaggagata 9240 atgactgata tcattaactc aaatcaaaat aaaattttta aagagctaaa cacaggacaa 9300 agggacctag agctagaccg tcttatccac aggttaaata tcaacttaga cctcctctac 9360 gctcatttga ctagcgtagc ggaaagcctt ttattctcta aattaaaaat aatccccaaa 9420 tacatcctaa ccaatcccga aataattaaa atcaggaact ctctaacgac acagggaata 9480 caaatcaacg cagaccacga gatctatgaa ttattatcgc ttcgtactag caccagtaaa 9540 aaaaaccgta acattcgaaa tcaagatccc aattttgtct actacaaatt ataaattgta 9600 ccaaattata gcaataccgt tcaacggcac gcagatagta aacctaccca actacatact 9660 aagaaataac gaagaattaa atattgtaca agacccatgt atcaaagtaa actccgaatt 9720 tgtatgcgaa acccccacca tccccggtag ccagtcctgc ctccacaacc tacttaacaa 9780 caaaccagcg acatgcgaca cggtggaaac cgcagggaac cagcaaatct acagcccaat 9840 ggaaggaata ctcatcatca ccaacgcagt aaacctcaac atatcaacaa actgcgcacc 9900 ggaaagaata gtaaacggct cgtttatcgt aagatacgat aactgctcta tcttggcaaa 9960 tggaacgaga tatgccgaca ccatcaccac gatcacggaa aaactacaga ttaacttgat 10020 caaagccgag gaggtaaaaa ccatacccgc aggagaacca ctcagcctgc ggaaactaca 10080 actaaataca ctaaaaaccg aagaagacat cgcagcaatt tacgacaacg caacgaccca 10140 ccttacagtc atctgcatcc tacttgccgg ctccacggca ttggccgccg ccgcctgggc 10200 gctaaggaag aataaaacga cgtacaccgc cacgcatgtc cttcctaccg gaactccggc 10260 cataccttcg ctatggccct cgttccgtct tgagggggga ggagttatcg g 10311 // ID EnSpm-3_HM repbase; DNA; INV; 6655 BP. XX AC . XX DT 11-DEC-2008 (Rel. 13.12, Created) DT 19-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE EnSpm-type family: consensus. XX KW EnSpm; DNA transposon; Transposable Element; EnSpm-3_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-6655 RA Jurka J.; RT "EnSpm-type families from Hydra magnipapillata."; RL Repbase Reports 8(12), 1904-1904 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 253..1530 FT /product="EnSpm-1_HM_1p" FT /translation="MFLCYLCNKGYNSISELRVHLQRHKDNRQLSIPIYCR FT QDKCISSFSVLFNFIRHLNTYHQINTERVSEAAAFIEENPENLFLCDNYAP FT CINXLKDSSPNLDVVSSVNSLDFLQDVKDEGASLVTQLRANSSIPYGIIPT FT IVQSYNNMAGSLSSYFKEEFNKSLLLAGIDCAVIDQISQEMQPKLELCNNP FT LDFLSTRYRIDSYFAGHPLMVQPESICYMPRLESHCGHSKFVYDQFQYVSI FT EKTXFNLLQNEAYVKAFLEKKCRPGVSTDFSDGSRFKEHYLFSDVSKISLM FT IQLFYDGLGVTNPLRSQGSVHNVGVFYYTIKNLPHAFNSCFGNVHLLAIGY FT THDIAVYGHGPILNKFVNEIQELSTIGLQGVFPVLGTRTIYASLCQVTCDN FT LALNGLLGFIESFSGRSFLHNLLCNKRRHSN*" FT CDS 1403..2830 FT /product="EnSpm-1_HM_2p" FT /translation="MQVFVKLLVIIWLSMACLDLLNLFLADRFCTICYATK FT EDIQIKFEDDLFEMRTVHSYKTDLINLVASRTSNKNNSNQLKNHYRGVKTE FT CPLNKIKGFHVTDNWCLDIMHTLLEGVVLVELGCILHGLCVLDKCLTLSDI FT NKAISLLWGKITIEKTHKPAEIFKIQEPGHVLTPSMKAVQWWALLKYLPMA FT VGKIVPLKNKNWKFLLHLSHLVDLIFAPXFTHEMXVYMKHVISDHLSMFSK FT LYCSDEVRLRPKHHFLVHLPNIIIKSGPLVGMSCMRYELKNSFFKRCAHIV FT CNFTNICRTLAHRHQRQALESQLSNKHIRKIITVVKYSIESTYSLPFYNAM FT QNILCFKSCEEIAIAKTLHAGSVEYKQGHFIAINIDMETGLPNFGKIVCFV FT SSANNNQVDVYKGEKWHFVLEIVKTNNFTYHLHSYEVIFLCQPEYQIFKLS FT NFVDYHPLYCHSLLATTEKKNFVRLPYHVF*" XX SQ Sequence 6655 BP; 2256 A; 896 C; 1056 G; 2433 T; 14 other; tacactctcg cggaaagtgt ggacataaaa ttccacacta agtgtagagg ccgtaatttg 60 cacaattttg tggagttatt atttcagaaa aattgatttg agtgaaaatt ctataaaaat 120 taaaagattg cacattagtg tggagtaaaa taatgtgcat gaaacgcgtg atataaatta 180 agccgccatc acgagcattt ataagagtta naaaagttat aattattttt aagaaattag 240 catattttaa raatgttttt atgttatctt tgcaataaag gttataatag tatttctgag 300 ttaagggttc atttgcagcg acataaagat aaccgacaat tatctatacc aatatattgt 360 cgtcaagata agtgcataag ttcgttttca gttttgttta attttattag acatttaaat 420 acttaccacc aaataaacac agagagagtc agtgaagcgg ctgcatttat agaagaaaat 480 ccagaaaatt tgtttctatg tgacaattat gccccttgta ttaaccamtt aaaagattca 540 tctcctaatc tagatgtggt ttcgtctgta aatagtttag attttttgca ggatgttaaa 600 gatgaaggag catcacttgt aacacaactt cgtgccaata gcagtatacc ttatggtatc 660 atacctacaa tagtccagtc atataataat atggctggat cactatcatc ttattttaaa 720 gaagaattta ataaaagttt attattagct ggtattgatt gtgcagttat tgaccaaatt 780 tcacaagaga tgcaacccaa attagaacta tgcaataatc ctcttgattt tttatcaaca 840 cgttatcgca ttgactctta ttttgcaggg catcctctta tggttcaacc tgaaagtatc 900 tgctatatgc cacggttaga gtctcattgt gggcatagta aatttgtata tgatcagttc 960 caatatgttt ctatcgaaaa aactwtattt aatttgttac agaatgaggc ttatgtaaaa 1020 gcgtttctgg agaaaaagtg tagacctggt gtctcgaccg atttttcaga tggaagtcgt 1080 ttcaaagagc attacctctt tagtgatgtc tctaaaattt cccttatgat ccaattgttt 1140 tatgatggtt taggtgtaac aaatccgttg agaagtcaag gctctgttca taacgttgga 1200 gttttttatt acactataaa aaatcttcca cacgcattta actcttgttt tggcaatgtg 1260 cacttgttag ccattggtta cactcatgat attgcagttt atggtcacgg gccaatttta 1320 aacaaatttg ttaatgaaat acaagagtta agcactattg gattacaagg agtatttcca 1380 gttttaggta ctcgtacaat ttatgcaagt ctttgtcaag ttacttgtga taatttggct 1440 ctcaatggct tgcttggatt tattgaatct ttttctggca gatcgttttt gcacaatttg 1500 ctatgcaaca aaagaagaca ttcaaattaa atttgaagat gatctgtttg agatgcgtac 1560 tgtgcatagc tataaaacag atctgattaa tttagtagca agtaggacaa gcaataagaa 1620 caattctaat caattgaaaa atcattacag aggtgtaaag actgaatgcc cactcaataa 1680 aataaaaggt tttcatgtga ctgataattg gtgtttagat attatgcata ctctattgga 1740 aggtgttgtt cttgttgagt tagggtgtat tcttcatggt ttgtgcgtat tagataagtg 1800 tctaactyta tcagatataa ataaagctat ttctttatta tggggaaaaa taactattga 1860 aaaaactcat aaacctgcag aaattttcaa aatacaagaa ccagggcatg tacttacacc 1920 ctcaatgaaa gcagtacaat ggtgggctct gcttaagtat ttaccaatgg ctgtaggaaa 1980 aattgtacct ctaaagaata aaaactggaa atttctgctt cacttgtctc atttagttga 2040 tttgatattt gcaccawgtt ttacacatga aatgrttgtt tatatgaagc atgttattag 2100 tgatcattta tctatgttta gcaagttata ctgcagcgat gaagtaaggt tacgtccaaa 2160 gcatcatttt cttgtgcatt tgcccaatat tataataaag tctggtccac ttgttggcat 2220 gagttgtatg cgatatgaac tgaaaaattc attttttaaa agatgtgcac atattgtatg 2280 caactttacc aatatttgtc gcactctagc acatagacat cagcgacagg ctttggagtc 2340 acaactctca aacaaacata ttcgcaaaat tatcacagtg gtaaaatata gtattgagtc 2400 tacttattcg cttccttttt ataacgctat gcaaaatata ctgtgtttta agtcatgtga 2460 agaaattgct attgcaaaaa cattgcatgc aggaagtgtt gaatacaaac aaggacattt 2520 cattgcaatt aatatagata tggagactgg cttgccaaac tttggaaaaa ttgtatgctt 2580 cgtttcaagc gcaaataaca atcaagttga tgtgtacaaa ggagagaagt ggcattttgt 2640 tttagaaatt gtgaaaacaa acaattttac ttatcacctt cattcttatg aagtaatatt 2700 tctttgtcaa ccagagtatc agattttcaa gctttctaat tttgttgatt atcatccact 2760 ttattgtcat agtttgcttg cgactactga aaagaagaac ttcgttcgat tgccctatca 2820 cgttttttaa ttattaattt cttttgtaaa tagtataaac tttagttgct ttatatagat 2880 caatagcaat gttttttcta attattttat tatttagcat tttttataat ataattttta 2940 aagtttaatc taaaatgttt tctttataga aaatataaaa attttgtaga aaaaacgttt 3000 ttacaaaatg tagagaaaag gtttctacaa agttttacag ttgtttaaaa aaaagtgttt 3060 tttaacaact gtacaactta ttcataacct aggccagaca aagaagttat tatttttttt 3120 gtttaataca ggtcatcagt ttataattgt ttgtaaatgt ccctaatgat tgtatatata 3180 gtgttattgt taaattggaa atatagtaat ttgattgatt tgattatgtt tagttgatat 3240 atttttaaac catgagtcta atcaacacac tcgccactgt attggaatta tctgttctcc 3300 ctgaggatat tcaacatgct ttatttggta agrttttctc tcttttagta tgtattttat 3360 atgtatccca aatcactttc tcttaaaata cacagaccta aaataacgat ttacycaaat 3420 atatgtctga aactttttct ttcacatcaa tacacatata aagaattact tatttattat 3480 attgttttat tttagaatct ggttttacac taaggtcatt tgtattagcc acagaagagg 3540 accttgtcga attaggattt aggatggcag aaagaaggtt actgcaggaa tggattaggg 3600 ctcaaccggt tgttaatgtt ccagytgkta atgaatatgt taccccacag tgtttattga 3660 gtattggtga gtcagttatt gcttcatcca gaataaatag ctgcaccact actagtgtaa 3720 tcagtttcaa ggttagtttt caaacaatta acaaatgatt gctgaaatgg catacatgga 3780 gtaaataaaa gagacataat aaaataaaaa ataatcataa taataaaata atgttgaagt 3840 gtatattttg tatagtatgt aaatgctaaa atgaaataat atgaaattat gttagttgca 3900 tatttccatt aattttatca gaatctgcga gttgataata ttatatcaga gcagaggcag 3960 cagcatggac ctatttgtgg tcaacgattg acacaaaagt taaagtcagg agagcggatt 4020 gaaaaagcag aaatgttatt tctaacaaaa gtttgtgggc gttacctgat gaacaattgt 4080 gctcagtaag tataaataaa ataaaaaatt tgataggatt ttaattagtt aaaataattt 4140 ttatatttct atatttcaga aaagataggc caaatgttac tgagaagcag gacttatcaa 4200 agtcaattat tgactgtttt cctatcttac atgatggaac agcaagtggc tacgtgagta 4260 atatatttaa tcaattaagt tagttaattg attaaatttt aaactctgag tgttattatt 4320 tttttgttat ctttagggat acttctttga tagaaaatca tttagtggaa agttagaata 4380 ctttgttcga agccggcgtt cagattgtaa agatataact acattgtgga aacattcacc 4440 atcaaatagg taaatattag atctaaaatg tccttcattt aataaatgat gataatgatc 4500 atgatgatga taagcgatta ttataattgt tagccctttt tttaaatttc aataaatttt 4560 tttattcctt accttttatc agtagtcttg ctactagcag aataggtctc actactagcg 4620 atgatgttga aagtgctgtt cgattactac gcgaacaaag acctaatgcc caaaacaaag 4680 aagagctaaa taaattaatg gagattacaa gaaatcatcg gcgatcatgg atcaacacat 4740 cgttcccatc aatcacagaa attgtaaacc agtacccgcg gttactagac atgccagaat 4800 cagtatatwt acatttattt atttaaatat ataaaacata tagtttgctc ccaatactat 4860 ttttaatgaa atttgcagtg ctgtaaattt tgtaaaaatt tgaatttttt ttgtgactta 4920 aaggcttctt cagtctaaat ataaatatat atatatatat atattagaaa cacgattgta 4980 cgttgagtgg cagaaataca aacttaaaat tctggagtat gccaaattga agccctcatt 5040 gaatactcta atgttttcct taacgaacga gttagatgaa ggtgaattat gaacttttaa 5100 cagtccaatt aaaaattgta gtttattgtt attaaattat aaatcgacta ttcttagttt 5160 ttactatttt taacactatt taagtaatgt attattaaat aataattaac tttttgtgat 5220 ttttattttg caaaatttat ataactttta atatacatat taagttatat atctgtttaa 5280 tatatctgtt attatatata tattgtttaa gtatacatat taggttatat atctgtttaa 5340 caagttataa atctgtttaa tatcagttca aatttttttc ttttatattg ttagacaaga 5400 aaagtgaact tgcggttgaa tgtcttgttt ggcttctccc taccccacca caaaaatcag 5460 ttaaaagggc tcgtttgcta ccgaatgtgg ttatgcaaaa tcttttcttt gacttcaagg 5520 tattgttgtt ttttttttga aatttttgaa ctaatatttt tgtagaagtt tttatattta 5580 tttttttata tatttttatc attattttat tattgttatt gctactctat ctgatagtaa 5640 aatataatgt aaaaaaatgt ttatacatta tgcaggatga cacagttgac tgtgcactaa 5700 agacactgga ggatcgcggg gttaaacaac cgatgctgtt gcgcatatct gatggatatt 5760 atataaaggt tgacaacaca gttataacac tgccttgttg cagctgtttt atggaggcat 5820 tggagttttt atttgtcttt ttatgttttc gccgtggaat atccgaccga attaagattt 5880 ttttatggmt ttttggagaa agtcatggga cttaaatgta ctgtgaaaag ttcaaccgtt 5940 gctgatttgt ttacaaacat aacttttctt acaaatgtat aaatctcttt aaatcccaga 6000 tgggtttaaa tatctttatt cgtttttttt tttaatcttt aagatgtctt ttatttgtat 6060 attaatatta tttataaagg gattttttag ttagagataa ctattaaaat ttatttccaa 6120 tattttttaa tttactaatt ataattgttt cttttttcat ctgcatctat taattgcttc 6180 agtttacttg acagtaatta gatcttatga taaaatgtta actaataaaa tttacggtca 6240 cttttaaatt tagtaataaa aaaaattaaa ttttattaat tactatttaa actatagtaa 6300 agaaatttta atattcttca aaatatttaa tgaaacttct cgaatccact ggagttcttt 6360 gaatgaaatg aaaattgtta acaataatgc gtaaaaaaaa caataacgca taaaaaaaag 6420 atttttaaag aaatttacag taatcaactg ctgctccttg aacgaaatgc aaattattcg 6480 caataatgcg taaaaaaaaa aatataataa acwtacccac aagcaattta atagctctta 6540 taaaataaaa ttgattccac actattgtgg aaaaaaaaga agtccacatt tgtgtgtagg 6600 agtgaagcct acacactgat gtggaaggaa actccacact ttccgcgaga gtgta 6655 // ID CR1-107_AAe repbase; DNA; INV; 5105 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-107_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5105 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1195-1195 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 15 sequences with >99% CC identity. XX FH Key Location/Qualifiers FT CDS 358..1182 FT /product="CR1-107_AAe_1p" FT /translation="MVKNCGKCAKAINGIEFVVCRGYCGATFHINECSGAT FT RAMLSYFTTHKKNLFWMCDKCADLFENSHFRAISCQADETSPLASLTSAIT FT ELRTEIKQLNSKPHPGSTPASSVQWPIINQRRGSKRPRMAEPVARASETCQ FT VGSKKPQENDPSIPICKKEDIQRFWLYLSKIRPDVTVEAVSAMVKANLNID FT ADPPIVKLVPKGRTIESLSFVSFKIGLDPSLQQIALDPETWPEGLLFREFE FT DYGAPKFRFPLTTTPSLLTPQAASSPATPVMDLS" FT CDS 1080..5015 FT /product="CR1-107_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="RLRSPKISVPADDNAFTVNSSSCFITCHPRNGSELDS FT SSQHSSSSAPGCTPASLMGALNPLVAVEPLLPASSSHPGPAYECGEEVSQL FT APAGKYSHPKNASRPVICTTFDSPTHTASNPSSEWIEFSSNWRNISSSTNE FT LRTSSKGCNVSSSLSKFSSAQLQSPGCAPASLMEAPNPFVAVEPLLPASSS FT HPGPAYECGEGVFQPVLAGKYNHVKNTSPSVIRTTFGSLPHIGYGSNSSKC FT IVTSTIDSIRSLTSSQPSSAQLSSQSTLPSSSPGCMPASLMGDPNSFVAVE FT PLLPASCSHPGPAYEIEEGVSQPVLAGKYSSIENNYLPVISTDSSSQAQRI FT DRHAPSTPPSSDISPRSSIAVYYQNVRGMRTKISRLRLILSSCDYDVLVFT FT ETWLRDDIDSTEISPDYTFFRCDRSRVNSQHARGGGVLIAVKHHLRCEMFP FT LSDCDQLEQVAVRMKLDNRSLYIITIYIPPNSSADRYSAHANAVQGLVNSV FT SADDIVLSVGDFNLPNLRWQMDDELNGYIPSNASSEQEKSLIETLFATGLR FT QVNNFENINNRLLDLVFVSLPELLDLVQPPAPLLPIDNHHAPFILLLDVSV FT SIPATLDDDADRVEYDFNVCDFDQLAIALGNVDWSILEQTGTVDELSAAFY FT NKLLEICDALVPRKRRTINSPSSKPWWTSELRRLRNLLRKARQHFFLSKSE FT GDKIALREAEATYKSVLQTTYDDYISGVQASVKQDPTRFWGFVKKLKSSTR FT IPCNVSYDETLASSNEEAANLFAEFFESVFSKSSPVPRQDCFAHVPSHNIN FT FPIMQFTADEVLNVIYTLDCKKGPGTDGIPPLLLRKCAAELVTPVTLLFNR FT SLSERTFPSVWKTAFIVPIHKSGNINRVRNYRGISILCCLSKIFEKLMHNA FT LYTVAAPIISEYQHGFMRHRSTTTNLMCYVTSLSREIESGNQVDAVYVDFA FT KAFDTVPHILIIDKLKRIGYPDWLTEWIFSYLSARTAQVVVNSSRSRQINV FT TSGVPQGSVLGPLLFNIFVNDLCLCLSSFKLSFADDLKFYRIIRSSLDCFA FT LQEDINTLLIWCSDNGMRVNNDKCKVITFTRSNNPVTHQYHMEFAHLERVA FT SICDLGVTIDAKLQFHEHVGITTAKAFSVLGLIRRHASEFTDIYALKTLYC FT SLVRSILEYAAPVWSPYYVAPILSIERVQKKFVRFVLRSLPWNDPDNLPPY FT PDRCQLISLEPLTVRRVKMQRLFVFDILNGAIDCPALLEQISLQIPSRRLR FT NTPMIAIPYHRTNYGFNNPFDSCLRSFNVVSDQFDFDMCKNVFKTRISRIV FT " XX SQ Sequence 5105 BP; 1322 A; 1347 C; 1049 G; 1387 T; 0 other; tacccttaat tcgcactgat ctgtttacgt ttcataccgg caattacttg cgtttttaat 60 cgggtaaaat cgatcttttt tagtcacttt ttgtgtttct cttgtgatta tcgttttgtg 120 taccgctgtg ttatcgaggt tttcgagtgg tcagcgtgaa tattgcgaaa aatctaccgt 180 atagtgtttc attatttcaa caatagctac aacccgctgt actcttattc cactttcgtg 240 acaaccgtac atttgagcag cgtttccaca catacgcgcc atctacatct ggaaaaagta 300 aaccggctta ttggcgatct aaatcgattg acgatctaaa aacaacgagt tgtcaagatg 360 gttaaaaatt gtggaaaatg cgcgaaggcc ataaatggca ttgaatttgt agtttgccgt 420 ggttattgtg gagctacgtt ccacattaac gaatgctcgg gtgctacaag agcgatgtta 480 tcgtatttca cgacgcacaa gaagaatctt ttctggatgt gcgacaaatg tgccgacctg 540 tttgagaatt cccacttccg tgccatttcg tgccaagccg atgagacttc acctcttgca 600 tctctgacgt ctgcaataac tgagcttcgc acggaaataa aacaactgaa ttcaaagccc 660 caccccggtt cgactccagc ttcaagtgta caatggccta ttattaacca acgaagaggt 720 tcaaagcgcc cacgcatggc cgagcctgtg gcacgtgcat ctgaaacctg tcaagtaggc 780 agtaaaaaac ctcaagaaaa tgacccatcg atccctatct gcaagaaaga agatattcaa 840 cggttttggt tgtacctctc aaagattcgc cctgacgtca ctgtggaagc tgtgagtgcc 900 atggttaaag caaacctcaa tattgacgca gatccaccca tcgtcaagct tgtaccgaag 960 ggaagaacaa tcgaatctct ttccttcgtc tcgtttaaaa ttggtttgga tccttcctta 1020 caacaaattg cgcttgatcc tgagacatgg ccagagggac tactgtttcg agaatttgaa 1080 gactacggag ccccaaaatt tcggttcccg ctgacgacaa cgccttcact gttaactcct 1140 caagctgctt catcacctgc cacccccgta atggatctga gctagattca tcaagtcaac 1200 actcgtcatc ttcagctccg ggatgcacgc ctgcaagcct tatgggagcc ctcaatcccc 1260 tcgtcgcagt cgagccactc ctgccagcgt ccagcagtca tcccggccct gcgtatgagt 1320 gtggagaaga ggtctcccaa ctcgctccag caggcaagta ctcccatcct aaaaacgctt 1380 ctcgtccagt aatttgcacc acttttgaca gccctacgca cactgcctcc aatccgtctt 1440 ctgaatggat cgaattttcc tcgaattggc gaaatatttc ttcatcaacg aacgaactca 1500 gaacttcttc gaaaggttgc aacgtttcat catcattgag caaattttct tcagcacaac 1560 ttcaatcccc gggatgcgcg cctgcaagcc tcatggaagc tcccaatccc ttcgtcgcag 1620 tcgagccact cctgccagcg tccagcagcc atcccggtcc tgcgtacgaa tgtggagagg 1680 gagtcttcca acccgttctc gcaggcaagt acaatcacgt taaaaacact tctccgtcag 1740 taattcgcac cacctttggc agtctacctc acattggtta cggcagtaac tcatcaaagt 1800 gtatcgtaac ctcaacaatc gacagcatca gatctctgac atcaagccaa ccatcatcgg 1860 cccaactttc atctcaatca acgcttccat cgtcgtcccc gggatgcatg cctgcaagcc 1920 ttatgggaga ccccaattcc ttcgtcgcag tcgagccact cctgccagcg tcctgcagtc 1980 atcccggtcc tgcgtacgaa atcgaagagg gggtctccca acccgtcctc gcaggcaagt 2040 attctagcat cgagaacaac tacctcccag taatttctac tgattccagc agccaagctc 2100 aacgcattga tcgtcacgct ccaagcacac caccatcgag cgacatttca ccgagatcca 2160 gcatagcagt atactaccaa aacgtacgag gtatgcgcac caagatttct cgactccgtt 2220 tgatactgtc tagttgtgat tatgacgtgc ttgttttcac tgaaacctgg ctccgtgatg 2280 acatagatag tacagaaatt tctcccgact acacgttctt ccgttgcgac cgcagtagag 2340 taaacagtca gcatgcacgt ggtggtggcg tgctgattgc cgttaagcat catcttcgtt 2400 gtgaaatgtt cccactgtcg gattgcgatc aattagagca ggttgcggtt cggatgaagt 2460 tggacaatcg ctcactctac attattacta tctatattcc tcctaactcg tccgccgacc 2520 gatattctgc ccacgcaaat gcagtacagg gcctcgtaaa cagtgtttca gcagatgata 2580 ttgttctctc cgtgggggac ttcaacctcc caaatttgcg atggcagatg gacgacgagc 2640 tgaacggcta cattccctcg aatgcttcat cggaacaaga gaaatctctc atcgaaaccc 2700 ttttcgctac aggcctgcga caagtgaaca atttcgagaa catcaacaat cgactccttg 2760 atcttgtttt cgtaagtctg ccagagctac tggatctagt tcaacctccc gccccactat 2820 tacccattga caaccaccat gcgccattca ttttacttct cgacgtgagt gttagtattc 2880 cagccacact cgatgacgat gctgacagag tggagtatga cttcaacgtg tgtgatttcg 2940 atcagctggc tatagcttta ggaaatgttg actggagtat tctcgagcaa accggtacgg 3000 tcgacgaatt gtctgcagcg ttttacaaca agcttctcga aatttgtgac gctttggtcc 3060 cgcgcaaacg acgtacgatt aactcacctt ccagcaaacc gtggtggact tccgagctgc 3120 gtcgtctccg taatttactg aggaaagcac gacaacactt cttcctctcg aagtctgaag 3180 gcgacaaaat cgccctacgt gaagccgagg caacatacaa gtctgttttg cagactacct 3240 acgacgacta catatccgga gtgcaggcca gcgtcaagca agatccaacg cgcttctggg 3300 gctttgtaaa gaaactgaag tcatccaccc gcatcccttg caatgtaagc tatgacgaaa 3360 ctcttgccag ttctaacgaa gaagccgcta atctctttgc tgaatttttc gaaagcgtgt 3420 tcagtaaatc gtctcctgta ccccgtcagg actgctttgc gcatgttcct tcgcacaaca 3480 ttaacttccc catcatgcag ttcaccgccg atgaggtgct gaatgtcatt tacactctgg 3540 actgtaagaa aggacccggg acggacggca ttccaccact gctcctgaga aaatgtgctg 3600 cagaacttgt cactcctgtc acgcttttat ttaatcgatc tctgagcgaa agaacatttc 3660 cttccgtgtg gaagacagct ttcatcgttc caatccacaa gtcagggaac atcaaccgag 3720 tcagaaacta ccgtggtatt tcaatactgt gttgcttgag caaaattttc gaaaaactga 3780 tgcacaatgc cctctacacc gtagctgcac cgataatatc cgagtatcaa cacggattca 3840 tgagacatcg ctcaacaaca acaaacctga tgtgctacgt tacctcgctg tcacgggaga 3900 tagagtcggg aaatcaagtt gacgcagtgt atgttgactt cgcaaaagcg ttcgatacgg 3960 ttccgcatat tttaatcatc gataaactca agcgaattgg ctacccggat tggctcacgg 4020 agtggatttt ttcgtaccta tccgctcgta ctgcgcaagt agtggtcaac tcatcaaggt 4080 cccgtcaaat caacgtaacg tctggagtgc cccagggtag cgtattgggt ccactgttat 4140 tcaatatctt tgtgaacgac ttgtgtttgt gcctttcatc tttcaaactc tcgttcgccg 4200 atgatctcaa attctaccgt atcatacgtt catcacttga ctgttttgcg ctgcaagagg 4260 acatcaacac tttgctgatc tggtgcagcg acaacggtat gcgtgtcaat aatgataaat 4320 gcaaagttat aactttcact cgctccaaca accccgttac ccaccagtac cacatggaat 4380 ttgcacatct agaacgtgta gcttcgatct gcgacctggg agttaccatc gacgctaagt 4440 tacaatttca tgaacacgtt ggaatcacga ctgctaaagc tttttcagtc ctgggactaa 4500 tccggcgcca cgcttcagaa ttcactgata tctatgcttt gaaaacactc tactgcagtt 4560 tagtgcggag cattcttgaa tacgcggccc ctgtctggtc accgtattat gttgcaccga 4620 tactctccat tgaacgcgtt caaaagaagt ttgttcgttt cgtacttcgc tcccttcctt 4680 ggaatgatcc ggacaatctt cctccgtatc cggatcgctg ccaactgata agtttagaac 4740 cgctgactgt caggcgtgta aaaatgcaac gtctatttgt atttgacatt cttaacggtg 4800 ctatagactg tcctgcgctt ctcgagcaaa tatctctgca aatcccttct cgacgactgc 4860 gaaacactcc catgattgcg attccttacc atcgcactaa ttatggtttc aataatccgt 4920 tcgactcttg tcttcgttcc ttcaatgttg tttccgatca atttgacttc gatatgtgca 4980 aaaatgtgtt taaaactcga ataagtagaa ttgtttagat gttaagtatg cgtaaattag 5040 tgctaagtgg ttcagtctgt acgattttaa ttcgaagacg gtgattataa ataaataaat 5100 aaata 5105 // ID BEL-109_AA-LTR repbase; DNA; INV; 305 BP. XX AC supercont1.257; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-109_AA_; KW BEL-109_AA-I; BEL-109_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-305 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.257; Positions 450295 450599. XX SQ Sequence 305 BP; 86 A; 79 C; 54 G; 86 T; 0 other; tgttcgagcg agaataggga gccaccctat cagcttcgat ccaaagtgat ctaaacgtag 60 tcgccgatat gaaggtgagc tctgatacca aaaacggcag aatctctctc tctctccctg 120 attctctatt ttatttgaat acatgtccaa ccttataaaa gcctatacaa agaactaaga 180 ataaccagta tcaattgaat tagcaacaag cgcgttttat tctaatcgtt ctcggaaagt 240 ttcccgttcg cctcctgttg agcatccgcc tcgttaaggt cttccttttg gaccccgcac 300 caaca 305 // ID BEL-84_AA-I repbase; DNA; INV; 5725 BP. XX AC supercont1.284; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-84_AA_; KW BEL-84_AA-LTR; BEL-84_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5725 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.284; Positions 100463 106187. XX CC Positions [4688-5269] - Integrase core CC 'TAATA' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 552..1805 FT /product="BEL-84_AA-I_3p" FT /translation="MAEERSRQQLINRRTTLIAALGRAEQFVEKYVAERDR FT GQVKLRLENLDTVWMGLEEVQTQLEDMELTNQGMAQNLTFRSHNETRYFQI FT KAALQSFLPNVPPTTNAIPQSSTSGLSGIKLPTISLPEFDGDYNQWLPFHD FT TFVALIHSNPEVHDIQKFHYLRAALKGEAAQLIESIGISSVNYAIAWQTLV FT SRYANDYLLKKRHLQALLDCPRMKKESATALHSVVDEYERHTKTLRQLGEP FT IDSWSTMLEHLLCLRLDDNTQKAWEDFATTATNPDYSCLIDFLQRRIRVLE FT SMSVNHQAQPKSNQQPTMFRRPPFHKTVSHAVTETTSRKCYSCDQHHPLFQ FT CPRFEKMSIADRLKVVNEHHLCLNCFRQDHLARNCQSKFSCRHCKKTTPFI FT ATPRLPPTKCRSTIHHLAHSSIHT" FT CDS 1630..2676 FT /product="BEL-84_AA-I_1p" FT /translation="MNITSASIAFVKTTSLVTANRNSPVVIVKKRHHSLLH FT PGYHPLNADQPSTTSHTAAYTPKPIVKQDAIINSTKRSPTSMNAVTAQTTH FT SSQTSNANVLLSTVVLMVLDCNGQKHPARALLDNGSQSNIISDRLCQLLRL FT KRRKINIPVFGVGESSSSVNHSVSATIQSRNSDFELGLDFLVMPRITIDLP FT VVSSPAEGWHVPNDLPLADPTFNKTGAIDMLLGAEHFFTYVNPGTRIEEGV FT NPALIQSVFGWIVTGRDQKPLRSPPVACHVSLSDPLHEALERFWRIEEVEN FT QPNYSVEEQQCETHYSSHVSRTPEGRYIVRLPRRENFDHMLGESKSAALRR FT FHLLEK" FT CDS 3026..5620 FT /product="BEL-84_AA-I_2p" FT /translation="MPHCNESSGDFIRTTQLKPMSWQQLTTGWLHPHSWPL FT EGTSNNPKVLDGLDPSQIGTQPTLKLDNNESTKALGVSWEPGTDKLRFDSA FT PELGNGPPTKRSILSAVSKHFDPLGLTAPIVIKAKMLLQELWLQPCGWDEE FT ISDSMRAKWDSYRTELPKIVSYRTDRFAFLPNASVQLHTFADASQQAYGAC FT IYARSTDSHGKTKIQLIAAKSKVAPLKRISLPRLELSAAVIAARLHQKVVI FT ALDMKISASFFWSDSTVTLEWLRSPPYTWNTFVANRVSEIQNTTQGSSWHH FT VAGKHNPADLITRGMKMDDFLNSDLWHCGPLWLKYLETAWPVTMGQLSAPE FT DVLERRKMVAAIQNKPVANPIFNTSSSYNRLIRVTAYCLRFVNACRRKVSM FT SSQPELDSIPKSITVGEMSIARMILIKLAQTDAFSEEIRYLGKGKPVSKHS FT SLRLLSPFVDPDGILRVGGRLRLSEQPYLTKHPILLPSSHPLTRLVAKHYH FT LRLFHGGGRVTLAAIRQEYWPIQGRRVVNTVMRNCYRCARASPVPAKQQTG FT QLPMQRVTPSRPFSITGIDYAGPVYLRAIHKRASPTKAYISVFVCFVTKAV FT HLELVSDLSTPAFITALRRFIARRGCPAHMHSDNGKNFEGARNELRELYQL FT LQEEESSGNVATFCANQGIQWHMTPPKAPHFGGLWEAAVKVAKRHLHRQFG FT DSRLSFEEMSTALAEIESAMNSRPLTPLTEDPNDLAVLTPAHFLIGTTPSA FT IPEDDCSQIPINRLDHYRGLRHRVQQFWHQWQAEYLHELQKESKHIIPNTE FT IQPGRMVVVVDEFLAPVKWPLARIVIPGPDGLIRVVDLRTSRGIIRRPITK FT ICLLPFEDTNDA" XX SQ Sequence 5725 BP; 1536 A; 1643 C; 1196 G; 1350 T; 0 other; acttggtgcc gtgaccagga tcaggtttct tcccagaatt cccgccatca tactacggac 60 gccgacgcta tctaccgcca tcgttccgca ccattgtgga ccccgacggc attcataaga 120 tcgccatctt ccccggatcg tattgtgccc acccgccatc gagaagcttc tccggagttc 180 ccgattgaat tctttaatac aaggcctaat tcaattaggc actcggtacg tataattccg 240 aagtgcgcag aagtttaccg cgggtgcctt gagatttttt cctgtttttc tcctattcgt 300 tccggagttt tcgccgaacc ttctggaacc gttcagctgt cgctgctagc aagtaagcta 360 cgccattgcc atccgtatac cgattttcat ccgaggacct actacgccgg attcgcaagc 420 agattcgaat aaaatacaag gcttaaatct agccttgaga aggttagtac cgttcctaca 480 tccctactcc gcttgtccgg ctctaccggg tctttctctg tatctcctat ccagtgcaac 540 ccgactccgc aatggcggaa gaacgttcga gacaacagct tatcaaccgc cgtaccactc 600 tcatcgctgc gctgggacga gcagaacagt ttgtcgagaa atacgtggcg gaacgggacc 660 gaggccaggt gaaactacgt ctcgagaacc tcgataccgt gtggatggga ctagaagaag 720 ttcaaaccca actcgaggac atggaattaa ccaatcaagg tatggcacaa aacctcactt 780 tccgttccca taacgaaacc cgctatttcc aaatcaaagc tgctctgcaa tcattcctcc 840 ccaatgtacc tcctacaacg aacgcaatcc cccaatcgtc tacttccgga ctgtccggta 900 tcaaattgcc aaccatatcc ctaccggagt ttgacggcga ttataatcag tggctaccat 960 tccatgatac attcgtcgcc ctgattcatt ccaatcccga agttcacgac attcaaaaat 1020 tccactatct ccgcgctgct cttaaaggag aagcggccca actcatagaa tcgatcggta 1080 ttagctccgt caactacgcc attgcatggc aaaccctcgt ctctcgctac gctaacgact 1140 accttcttaa gaaacgccac ttgcaagccc tcctcgattg cccgcgaatg aagaaagagt 1200 ccgccactgc acttcattca gtcgtcgacg aatatgagcg tcacactaaa accctccgtc 1260 aacttggtga accaatcgat tcgtggagca ctatgttgga gcatctccta tgcctccgac 1320 tcgacgacaa cactcaaaag gcttgggaag acttcgctac gacagccaca aaccccgact 1380 acagttgtct cattgatttt cttcagcgac gcatccgcgt acttgaatcc atgtcggtga 1440 atcaccaagc gcagccaaaa tccaatcaac aaccaactat gtttcgccga ccaccgtttc 1500 ataaaaccgt ttcccatgct gtaaccgaaa ccacttctcg aaaatgttac tcttgcgacc 1560 aacaccatcc actcttccaa tgtccacgat tcgagaaaat gtccatcgct gatcgcctca 1620 aagttgtcaa tgaacatcac ctctgcctca attgctttcg tcaagaccac ctcgctcgta 1680 actgccaatc gaaattctcc tgtcgtcatt gtaaaaaaac gacaccattc attgctacac 1740 cccggttacc acccactaaa tgccgatcaa ccatccacca cctcgcacac agcagcatac 1800 acacctaagc ccatcgtcaa gcaagatgct ataataaata gcaccaaacg atcacccaca 1860 tccatgaatg ctgtcactgc ccagacaacc cattcctcac aaacatcaaa cgcgaatgtt 1920 ttgctttcga ccgttgtgtt aatggtgctc gattgcaatg gacaaaaaca ccccgcacga 1980 gcgttattgg ataatggatc gcaatccaat atcatcagcg atcgtctatg ccaacttcta 2040 cgactaaagc ggagaaaaat caacatccct gtgttcggtg tcggtgagtc ctcctcgagc 2100 gtgaatcact ccgtcagtgc cactatccaa tcgagaaatt ctgacttcga actcggactc 2160 gacttcctcg ttatgccccg cattacgatt gacttgcctg ttgtgtcatc tcctgccgaa 2220 ggctggcatg tcccaaatga tctccctttg gctgacccta cctttaacaa gaccggcgct 2280 atcgatatgc tcctcggagc agagcatttt ttcacctatg tcaaccctgg tacccgtatc 2340 gaagaaggtg taaatcccgc cctcattcaa agcgtgtttg gttggattgt caccggtagg 2400 gatcagaaac ccctcagatc gccccccgta gcatgtcacg tctcactttc cgatccactc 2460 catgaagcac tagagcgctt ttggcgcatc gaagaagtcg aaaatcagcc aaattactcc 2520 gtagaagaac aacaatgcga aacccattac tcgtcccatg tttcccgcac tccagaggga 2580 agatatatcg tccgtctgcc tcgccgggag aatttcgatc atatgctcgg cgaatcgaaa 2640 tccgcagctc ttcgacgttt tcatttactc gaaaaatgac tgggcagaga accacatctg 2700 aaggatgaat atcactcttt cttgtcggag tacctgtttc tcggtcacat gcgacttgtc 2760 ccaccaagcg aaatcgaacc acatcaggtg cactatttac cccaccatgc cgtcctaaaa 2820 gaggccagta ctacgacaaa agtccgtgta gtgttcgatg gatcggaaaa aacatccacc 2880 ggatattccc tcaacgatgc cctccaagta ggtccaattg tgcaggacga actattgacc 2940 ctcattatca gattccgaaa atatccgata gcactggtcg cggacatagc aaaaatgtac 3000 cggcaggtgt tactccatcc tgacgatgcc ccattgcaac gaatcctctg gagatttcat 3060 ccgaacgacc caattgaaac ctatgagttg gcaacagtta actacgggtt ggctccatcc 3120 tcattcttgg ccactcgaag gaacatcgaa caatcccaaa gttttagatg gactcgaccc 3180 atcacaaatt ggtacccagc caaccctgaa attggacaac aacgaatcca cgaaggcact 3240 tggagtgagt tgggagccag gaaccgataa actgcgcttt gattccgctc ccgagcttgg 3300 aaacggcccg cccacaaaga ggtcaatttt atcagcagta tcgaagcatt tcgatccgct 3360 gggactcaca gctccgatag tcataaaagc caaaatgcta ctgcaggagc tctggctgca 3420 accctgcgga tgggacgagg agatttctga cagcatgcga gcaaaatggg acagctaccg 3480 caccgaatta ccgaaaatcg tctcctaccg aactgaccgt tttgccttct taccaaatgc 3540 ttccgtccaa ttgcacacat ttgcggatgc atcccagcaa gcctacggcg cttgcatcta 3600 tgcgcgatcc accgactccc atggcaaaac taagattcag ctgattgctg ccaaatctaa 3660 ggtggctccc ctaaaaagaa tttctctccc ccgattggaa ttgtccgctg ccgtcatcgc 3720 tgcacggctg catcaaaaag tggtcatagc actcgatatg aaaatctccg catccttctt 3780 ttggtccgat tcgacagtca ccctggaatg gctacgctca cctccgtaca cctggaacac 3840 ctttgttgcc aacagggtct ccgaaattca gaacaccaca cagggatcta gttggcatca 3900 cgtcgctggc aaacacaatc ctgctgatct tatcaccaga ggaatgaaaa tggacgattt 3960 ccttaacagc gatctgtggc attgtggacc attgtggctg aaatatcttg aaacagcatg 4020 gccagttaca atgggacagt tgagtgcacc tgaagacgtt ctggaacggc gaaagatggt 4080 ggcggcgatc caaaacaaac ccgtagcgaa tccaatcttc aacacgtcat cctcctacaa 4140 tcgcctaatt agagtgacag catactgtct tcgctttgtc aatgcttgcc gacgcaaagt 4200 gtcaatgtcg tctcaacccg aattggattc aataccaaaa tcgatcactg tgggagaaat 4260 gtcgatagcc cgaatgatac taatcaaact agcccaaact gatgcgtttt cagaagagat 4320 aaggtacttg gggaagggga agccggtatc caaacattca agcttgcggc tattgagtcc 4380 gttcgtcgat ccggatggaa tcctcagagt cggaggtaga ttgcgtctat ctgagcagcc 4440 ctatcttacc aaacacccca tcctccttcc tagttcccat ccacttacac ggttagttgc 4500 caaacattac cacctaagat tgttccacgg tggcggccgt gttacactag cagccatccg 4560 ccaagaatat tggcccatac aaggacgacg tgtagttaat accgtcatga gaaactgcta 4620 ccgctgtgct cgtgcatcgc ccgttcccgc caagcagcag actggacagc ttccaatgca 4680 gagagtcacc ccgagtcgtc cgttctccat cactggcata gactacgccg gaccagtcta 4740 cttaagagcg atccacaagc gagcatcacc gacgaaagct tacatcagcg tctttgtctg 4800 cttcgtcacc aaagcagtac accttgagct tgtcagtgat ttatccaccc cagcattcat 4860 caccgcactc cgaagattca tcgctcgtag aggttgtcct gcgcatatgc attccgacaa 4920 cggaaaaaac ttcgagggtg ctcgaaatga actgcgagag ttgtatcaac ttctacagga 4980 ggaagaatcg tccgggaatg tcgccacctt ctgcgcaaac cagggcatcc aatggcacat 5040 gactccaccg aaagccccac acttcggtgg attatgggag gcggcagtga aggttgccaa 5100 aagacacttg caccggcagt ttggcgattc aagactttcc ttcgaagaaa tgtcaacagc 5160 actagcagag atcgagtctg cgatgaattc acgtcctttg acaccgctca ccgaagatcc 5220 aaatgatttg gccgtcctca ccccggcaca ctttctcatt gggacaacgc cgtctgctat 5280 ccccgaagat gactgcagcc aaattcccat aaatcgcttg gatcattacc gaggtcttcg 5340 tcatcgtgtc caacaattct ggcaccaatg gcaagcggaa tatttacacg aactgcaaaa 5400 ggagagcaaa cacatcatcc cgaacacaga gatacaaccc ggaagaatgg tcgtcgtcgt 5460 cgatgagttt ctcgctcctg tgaagtggcc gttagcgaga atcgtcatcc ctggaccaga 5520 tggactcatc cgagttgtgg acctgcgaac cagcagaggg attatccgtc gccctatcac 5580 caagatttgc ttgctgccct ttgaagatac gaatgatgcc taaaattctg atgtcaacaa 5640 aattgtatca ttgaattcct tgaatgaatt ggtttgttat gataggagtc aggtagtgaa 5700 attcaggaat ttcaggtggc ggcga 5725 // ID Kiri-17_AAe repbase; DNA; INV; 4545 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 16-FEB-2011 (Rel. 16.02, Last updated, Version 1) XX DE A Kiri non-LTR retrotransposon family from Aedes aegypti. XX KW Kiri; Non-LTR Retrotransposon; Transposable Element; Kiri-17_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4545 RA Kojima K.K. and Jurka J.; RT "Kiri non-LTR retrotransposons from the yellow fever mosquito."; RL Repbase Reports 11(2), 712-712 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 10 sequences with >98% CC identity. CC This family and related Kiri elements found in two mosquito CC species, Culex quinquefasciatus and Aedes aegypti, constitute a CC distinct lineage belonging to the CR1 group. The phylogenetic CC analysis with PhyML indicates that Kiri is a sister lineage of CC the L2 clade. Although several Kiri elements do not encode two CC proteins, probably due to 5' truncation, Kiri elements generally CC encode two proteins. Their ORF1 proteins commonly contain an CC L1-like RNA-binding domain. XX FH Key Location/Qualifiers FT CDS 330..1115 FT /product="Kiri-17_AAe_1p" FT /translation="MSKHGKRKPNNNNISKRTWENMNKSDSQIDSYDDLVS FT RMQCMFDAANAKIEDKIDTCVEELKSEISDLRDEVQRLKDECGRDIDNLAE FT SVSQLKKDVNFNRGCIERAGRTNELLITGVPYTPDECLSVFVQKLSTVLGY FT QDVPLIQPKRLARSPIAAGSTPPILMQFAFNISRNEFYSRYLATRNLSLLH FT LGFNVNTRIYINENLSEDARRIKGAAIKLKKTGKLHNVFTRDGIVFIKLHE FT GAAALPVQTIDQLNATVNPSS" FT CDS 1551..4397 FT /product="Kiri-17_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MANAMGTNARMNEGIPRAVMNAALSNNNMNICHINIQ FT SLCARQMSKFSEFKDCFENSKIDIICLTESWLSDSVPDDIISIEGYKIVRN FT DRMYSRGGGICVYFKSYLNCQIIARSELLVGVGNVNRTEYMLIEVNLSSKK FT LLVGIFYCPPRIDCSEVIFNLLCDHSLQYDEILLVGDFNTDFLRDNSRSNR FT LRSTFDYFGLSCVNSEPTHFFAGGCSMIDLLLTVNQDFITKFHQIAAPGFS FT RHDMILASLNVSRSLGERHSWFRDYKSINYTALCEAVDCIDWSLLYSITNS FT DLALEFFNSHLRNLFERFVPLRYSQIKNNSQWFNNNIRMAIVERNIAYRVW FT KLSRTVEDHAQFKRLRNRVTSLIKLAKSDYVSQSVGPTDSQKTIWNKLRNL FT KVIADRDDDDCFNSSCDEINEHFCTNFTNDSSTIPTFPSNEEGFTFLLTNE FT LETGIAINSIKSDAIGLDNIPLKFVKLILPFVIQQITYIFNLIIRSGNFPI FT AWKETKIIPIKKKKRNNDLNNLRPISILCGLSKVFEKILKNQIQNFLHNYD FT LLSPFQSGFRAEHSTTSALLKVHDDIHKTVDKKGLAILLLIDFSKAFDRVS FT HCKLLKKLSCEFNFSTQALKLVKSYLCNRKQSVYVNDSRSQPSVILSGVPQ FT GSVLGPLLFSMFINDLPRILKFCKIHMFADDVQLYICSNNLDVHDMANLVN FT SDLSRLLKWSEKNLLLINGSKTKAMLITRTARPALLPSIHLGSELIGFSDR FT LKNLGVVFQNNLEWDDHINAQCGKIYAGLRRLRFTASMLPCNVKIMLFKSL FT LLPHFMYGLELTLNASARCLDRLRIALNCCVRWVFNLNRFSHVRHLQSRLL FT GCSFYQFFKLRSCLMFYKIINHTYPHYLKEKLQPFRSDRNRNYLIPQHNTS FT HYSGTFFVRGVVYWNQLPTTIKNINSFNVFRQNCTEWLNGGNQHM" XX SQ Sequence 4545 BP; 1359 A; 832 C; 860 G; 1494 T; 0 other; gagtttctga agggatgagc gtgagttgtg agtgaacttg tgacgggcag ttctaaacgg 60 aaaattctac atgaaccgtt gatatatctg cattgacttg cctgttgccg tgcgatttaa 120 tcggtgtttg ttcgttagtg actggagtac gattattaca tgatgtaccc gatcttttag 180 tgcaactaaa aagttgatat tgtggtccag cctcacagct cttcatatcc gccatcccca 240 tttcatatct gtcattggtg aaccaaatac actcacctat atccaccaca caatagtcac 300 tcagcaaatt tactgagctc aaccgaagca tgagtaaaca cggcaagcgc aaacctaaca 360 acaacaacat tagcaaacgt acgtgggaga atatgaataa atcggacagc caaattgact 420 cctacgatga tctggtctct cgaatgcagt gtatgtttga cgccgctaat gctaaaattg 480 aggataaaat cgatacctgt gtagaagaat taaaatcgga aatttccgat ctacgcgatg 540 aagtgcaacg cctgaaagat gagtgtggcc gggacatcga caacctagct gagtccgttt 600 cgcaactaaa aaaagatgtt aacttcaaca gaggatgcat tgaacgagct ggaagaacta 660 acgagctctt aataacgggc gttccgtaca cgccggatga atgtctctct gtttttgtgc 720 agaagttatc gaccgtgctt ggatatcagg atgttccatt aattcaaccc aaacggttgg 780 cccgatcacc aatagctgct gggtcgacac ctcctatttt gatgcagttc gccttcaaca 840 tttctcggaa cgagttctat tcacgttatc tggctaccag gaatctgtcg ctcttgcatc 900 tcgggttcaa cgtcaacacc aggatctata ttaacgaaaa tctctcagaa gacgcacgta 960 ggatcaaagg agcagctata aaacttaaaa agactgggaa gttacataac gttttcacca 1020 gagatggaat agttttcatc aagctgcatg agggagctgc cgccttgcct gttcaaacga 1080 tcgatcaact taatgcaacg gtaaaccctt cctcttaaga tcttctcttc catccaacgt 1140 tgatccatgt ttcccatcat catgtatcca tgattccagc cttcctgaaa gtttgacctc 1200 atccaatgaa gcttttcttt tttcccttga ttgcattcct gtgaatccga tccttgcatc 1260 ccatgtatta ttccgtccta aaagtctatt catcatacgg tttcataatc gtttgctgct 1320 gctgctgttg ttgcctgctg gatctgttgc tgctgttatt gttgccttgc tgattgatag 1380 aattgttgaa ttttccaaaa cgaactgtgc tttattgaat ttgaagaatt tgtataaaat 1440 tgtttagtag ttattaatta tgattcattt acatttattg ctgattgctg attcttttct 1500 tgggggacct tcagtatttt tctgtcgtat ttcgttcttt ttacgcgcta atggctaatg 1560 ccatgggaac gaacgctcgt atgaatgaag gtataccgag ggctgtaatg aatgctgcat 1620 tgtctaacaa taatatgaac atatgtcaca taaacataca aagtctatgc gcacgtcaaa 1680 tgagtaaatt tagtgagttc aaagattgtt tcgagaatag taaaattgac ataatttgct 1740 taaccgaatc gtggctgtct gatagtgtac ctgacgatat aatttcgatt gaaggctata 1800 aaattgttcg taatgataga atgtacagca gaggaggagg tatttgtgtg tattttaaaa 1860 gttatttaaa ttgtcaaatt attgctcgct ctgaactttt ggttggtgtt ggtaatgtaa 1920 atcgtacaga gtatatgctc attgaagtga atttgagttc gaaaaaactg ctcgttggta 1980 tattttactg tccacctcgg atagattgtt cagaagtcat tttcaatcta ttatgtgatc 2040 attccttgca gtatgacgaa attttacttg taggagattt caatactgat tttttgagag 2100 acaattcaag atctaatagg ctgcgtagta cttttgacta ttttgggctt tcctgtgtaa 2160 actctgaacc tactcacttt tttgccggtg gatgtagcat gatcgatttg ttgttaacgg 2220 tcaatcaaga cttcataaca aagtttcacc aaattgccgc gcctggtttc tctcgacatg 2280 atatgatctt ggcgtcgtta aatgtctctc gctctttggg agaacgtcat tcatggttca 2340 gagactataa aagtataaat tacactgccc tttgtgaagc agttgactgt attgattggt 2400 ccctgctata ctcaatcact aactcagatt tggctctaga gttctttaat tctcatttgc 2460 gtaatctttt tgaacgcttt gtacctctcc gatatagtca aattaaaaat aattcacaat 2520 ggttcaacaa caacatacgg atggcaatag tggaaagaaa tattgcgtat cgtgtttgga 2580 agttatccag aactgtagaa gaccatgctc aattcaaacg tcttcgtaat cgagtgacta 2640 gtttgattaa attggcaaaa tctgattatg tttcccagtc tgttggtcca acggattctc 2700 agaaaacgat ttggaataaa cttagaaact tgaaagttat agccgaccgt gatgatgacg 2760 attgttttaa tagctcgtgt gatgaaatca atgaacactt ttgtactaat ttcactaatg 2820 attcttctac aataccaacg tttcctagta atgaagaagg tttcacattt ttattgacaa 2880 acgaacttga aacgggaatt gcgattaact ctatcaaatc tgatgctatt ggtttggaca 2940 acataccctt gaaatttgtg aaactcattc tgcctttcgt tattcaacaa ataacctata 3000 ttttcaattt aattatacgt tcaggaaatt ttcctattgc gtggaaggaa accaaaatca 3060 taccaattaa aaagaagaaa cggaacaatg atctaaataa cttgagacca attagtattc 3120 tatgcgggct ttccaaagtt tttgagaaaa ttttaaagaa ccaaatacaa aattttcttc 3180 ataattacga tcttttaagc ccctttcaat ccggttttag agctgaacat agtacgacat 3240 ctgcgttact aaaggttcac gacgacattc ataagactgt ggacaaaaaa ggtttggcga 3300 tattactttt aattgatttt tctaaggctt tcgatcgtgt atctcattgc aaattattaa 3360 agaaattatc gtgtgaattt aacttttcca ctcaggctct gaagttggtc aaatcgtact 3420 tatgtaaccg gaagcaaagt gtttatgtaa atgatagtcg ttcacaacct tcagttattc 3480 tatctggagt tccacaaggc tcagtcttgg gtccgttact atttagtatg tttatcaatg 3540 atttgccaag aattttgaag ttctgcaaga ttcatatgtt tgcagatgat gtgcaactct 3600 atatttgctc caataatctt gatgtacatg atatggctaa tttggtaaat agcgaccttt 3660 ctaggctttt aaagtggtca gagaaaaatt tacttctcat aaatggttct aaaacaaaag 3720 ctatgttgat aaccagaact gcgcgaccag ctcttttacc cagcattcac ttgggttctg 3780 aacttattgg attttcggat cgattgaaaa atttaggagt tgtctttcaa aataaccttg 3840 aatgggatga tcacataaat gcccaatgtg gtaagattta tgctggtttg agaagattaa 3900 ggttcacagc aagtatgcta ccatgtaatg tgaaaatcat gttatttaaa tcgttattac 3960 ttccccattt tatgtatggt ttggagctta ctcttaatgc ttcagctaga tgtttagaca 4020 ggctgcgaat agcattaaat tgttgtgtac gctgggtttt caatctcaat agattttcgc 4080 atgtcagaca ccttcaatcg aggttactcg gatgttcatt ctatcagttt tttaaattac 4140 gttcgtgttt aatgttctac aaaatcatta atcatactta tcctcattat ttaaaagaaa 4200 agttgcagcc tttcagaagc gatagaaaca gaaactattt gattcctcaa cataatacat 4260 cgcattattc gggtaccttc ttcgtacgcg gcgtcgtata ctggaatcaa cttccgacta 4320 cgatcaaaaa tataaattcg ttcaatgtct ttcgtcagaa ctgtactgaa tggttgaatg 4380 gggggaatca gcatatgtaa gttaacaaca gaaacttagt taaattagtt aaaaaagatg 4440 aattgcttac ttccttaagt gcctagcgaa tgtagaaatt taaaagtatt agcttactct 4500 acatgttttt ggaaataaat aaataaataa ataaataaat aaata 4545 // ID SAT-1_CQ repbase; DNA; INV; 985 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE Satellite-type sequence from the southern house mosquito: DE consensus. XX KW SAT; Satellite; Simple Repeat; nonautonomous; SAT-1_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-985 RA Kojima K.K. and Jurka J.; RT "Tandemly repeated DNA from the southern house mosquito."; RL Repbase Reports 11(1), 630-630 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >96% CC identity. Highly repetitive in tandem. XX SQ Sequence 985 BP; 246 A; 190 C; 235 G; 314 T; 0 other; attcgaagca cgatgaaaaa tgctttcgaa tgaagtataa tagttgtggg ttttttagcc 60 ggatgggcga accagcatgt gataatgcat tggtatcgtc aaaaatagca gtttagcgca 120 gatattaacg actaagctta ataattcgtc gagatatgta tctcagtgga ttgggttttg 180 aaagagcttt ccaaaaatgt gcaacatgcc gacatttcgg ttaattttgg cctcgtttta 240 gtcgatttga aggtggctga atgcaaacag ttttcttaaa gtggctgtaa ctttggcaaa 300 ttccaatgga tttttgtccc gtttcagccc gggaaggagg acattccaga ctacctagtt 360 atgcaaaaat gaagcaattt tgaaaaattt tcatcgtttt ttgtttttcg gttttaggtt 420 tgcaaatcgt ttttcttaaa tatcttttga caggaaaggg ctacggggaa gattttcaat 480 agcgacttat cggaagctaa gagggatcga atgcaaccgg aagtgtccaa atccgttcag 540 ccagttccga gaaatcgcag tgacattttt tggccaattt tcaagtgatt ttatatggca 600 ttcctcggag aggaagccaa aactcgggtt cccccaaaat aaaatagtct gccgtaaacc 660 aattgcgacc agttggtcgc gttcttgttg cactcttctc agcaagtcga gtcggcgtag 720 tgagctaggg cgcgccccag agttttattt ttgcctgtgg gttcgaatct cgttctctcc 780 cgctcgctct ctgagctgat ttgttttttt tttctccctg cgcgttgtcg agcaatcttc 840 cggccggaag gtgctcgact tgagttttcg agcaatcttc cggccgctcg acttttgagg 900 ctattttgga cctgatttga ggtcgggtga aatatttgca tgctggacgc gattcacaaa 960 gggaatttta tttcctaaga tgaca 985 // ID Copia-57_AA-LTR repbase; DNA; INV; 111 BP. XX AC supercont1.281; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-57_AA_; KW Copia-57_AA-I; Copia-57_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-111 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.281; Positions 881219 881329. XX SQ Sequence 111 BP; 41 A; 25 C; 15 G; 30 T; 0 other; tgttgagaaa caattaatca tagtatgcat agccctagcc tcgataaatg aatgtaaccc 60 tgacaatgta ataaaacttt cacttagtta tcaaccacca accactgtgc a 111 // ID Gypsy-201_AA-I repbase; DNA; INV; 4577 BP. XX AC supercont1.64; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-201_AA_; KW Gypsy-201_AA-LTR; Gypsy-201_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4577 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.64; Positions 1545624 1541048. XX CC 'GAGGA' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 151..4437 FT /product="Gypsy-201_AA-I_1p" FT /translation="MASMELRLPNPLDCTNLADEWPKWKQTFLIYMIANKR FT DGDSEQSKIATFLWLIGQRGVEIFNTLFPNDGSFDGMFGVGGAQAIGGVGA FT VAGEAGAVADGVGVAAGGGRTLGDVIKAFDDYCLSRRNVAMEAFKFNMITQ FT KEKQSFGNFETELRTQLQYCEFECTNCHTSYADRLLRDRIIVGVQDKKLQL FT KLLDGKDDPLAKVVEVCKVFEAAAANKQLLDRKCLNAEIKSVVEQTQETGG FT CTKEFPEVAVVKRSCYNCGQAFNGQHFRHCPAVNIRCNKCGKIGHFQKFCK FT TGGTDFSKGRSSKPDDKKKSAMKMVKSIDWRDSGKSLTCVEQGDSAELTVE FT PNICHYRGSLNDHVIGRVRWGKTFLIQNYPISFKLDTGSDINCLPLNIIQK FT LKIELSKFKQYRVLDYSSNEIKIHGQVRLKCVDKDGGTNHVISFMVVDNNF FT EPLLGLESCVTLGLLKRTYSVSATSVLPSDANAFLSMFRTLFEGLGKCPGT FT CSIVLKEGSVPSLYYKKRIPLSLQDRLKTQLNEMVAQGIISPVEYPTDWVN FT NIQIVEKPNGSLRICLDPKPLNACIKREHFLIPKIEDIMSTLSNKKVFTVL FT DLRNGFWQMELDSRSSDLTTFMTPFGRYKWNRVPFGINSAPEMFQKKMVQI FT FGDIPGVEVYFDDLTVSGKDYEEHDRTLRLVLERAMKYNIRFNESKIQYRC FT SEIKLMGHIISNGTIRPDDKYIRAITEMQTPTSKLEVMRLLGLFKYLAKFI FT PNLSQRTTLMREVTRKDVVWQWTDAHNAELKGLLQVISTSPVLTIYDPAKP FT VVVQTDSSKDGIGSVLLQDGKPVAYASRCLTKSETKWAQIEKELLAIVFAC FT ERFHYFLYGRDFEVQSDHKPLATLINRDIDSVTVRLQRMFMFLLKYPGLSI FT VYTPGKDMLVADCLSRAPLGDPVEDDLNLSGLIHSVTKRVCMSEENYRICV FT EALRNDERYSRICKYVQEGWPSYHKLDDFSQRFHKVKNELHFEQDLLFKDH FT RLVIPTELQRTMCNWLHAPHMGIEKTLARARVHVYWPGMTQDIIETVKDCP FT TCEVLKRNNQKEPLVQDRKAEYPFQKVSVDIFEYGGNSYIALIDSYSGLLC FT AELLKDKSASQVVKALDKIFCRYGYPTEIRCDNVPFNSEECDKYADECNFR FT FVFSSPRYPQSNGLAEKGVAIAKNLLKRCLEVGELPMFQYRLLEYNNTPIA FT SMKMSPADIFFGRLLKTKLPVDAKLLHAGQINKPLVDDQIDKKRYNQKKYY FT DHHAKPLPVLNEGERVMFKKNGKEWHYGKVVRKVNERSYIVVDNFDNHFRR FT NRRFISKTNNNVVNTSDMLFEESLSKNNHNNSNVKSAITRETLQATDNNEQ FT TLVSAESQEFVTNSSYYDTASEGGNTDEENERTNENAGQDLQQSSETRTRS FT GRIVKPPKMYGEWTC" XX SQ Sequence 4577 BP; 1404 A; 807 C; 1077 G; 1289 T; 0 other; tggtgtcgga agtgggattt tcttcccagt cgtcggaatt cgaaagtatc gtttccggtt 60 aagtttgtcg tcgtggaaga ttgttaatat cgatgttatc gacggaaata taaattggag 120 aaaaaaggga attcaacgga gcaaatcaac atggcgtcga tggagctgcg tttgcctaac 180 ccgttagact gtactaattt ggctgatgag tggccgaaat ggaagcaaac ctttctgatc 240 tacatgatcg ctaacaaaag agacggtgat tcggagcaaa gtaaaatcgc cacattcctg 300 tggttgattg ggcaacgtgg agttgaaata ttcaatacgc tgttcccaaa cgatggaagc 360 tttgatggaa tgtttggcgt aggaggtgca caagctatcg gtggagtcgg tgctgttgcg 420 ggtgaagctg gtgctgttgc ggatggagtt ggtgttgctg ctggtggagg gcgaacattg 480 ggtgatgtca tcaaagcctt tgatgattac tgcttgtcta ggagaaatgt cgcaatggaa 540 gctttcaagt ttaacatgat cacccagaag gagaaacagt cgtttggaaa tttcgaaact 600 gaactacgaa cccagttgca gtattgtgaa ttcgagtgca cgaactgtca tacgtcttac 660 gcggatcggt tgttacgaga ccgaatcata gtgggcgtgc aggataagaa actgcaactt 720 aagttactcg acggaaaaga cgatcctctg gcaaaggttg tcgaagtatg caaggtattc 780 gaagctgctg ctgccaataa acagttgttg gatcgaaagt gtttgaatgc ggagataaag 840 tcagtcgttg agcaaacaca ggaaacagga ggatgtacaa aagaatttcc ggaggtagca 900 gtagtgaaac gtagttgcta taactgcggg caagcattca atggtcaaca ttttcgtcac 960 tgcccagcag tgaacatcag atgcaataag tgcggcaaga ttggacactt ccagaaattc 1020 tgcaaaacag gaggaactga tttttccaaa gggagatcat cgaagccgga tgacaagaag 1080 aagtccgcaa tgaagatggt aaaatcaatt gattggcgtg actcaggtaa atcattaact 1140 tgtgtagaac agggagacag cgctgaactt actgttgaac caaatatttg tcattacaga 1200 ggaagtttga acgatcatgt gattgggcgc gtgaggtggg ggaaaacgtt tctcattcag 1260 aattacccaa taagcttcaa gcttgacact ggatctgaca tcaactgttt gccgttaaac 1320 attattcaaa aattaaaaat tgaattgtcc aaatttaagc agtatcgtgt attggattac 1380 agctcaaacg aaatcaagat tcatggtcag gttcgtctaa aatgcgttga taaggatggt 1440 ggtacaaatc atgtgatctc gttcatggtc gtcgataaca attttgagcc gttgttaggg 1500 ttggagtctt gtgttactct tggcctttta aagcgaactt actcagtgag tgcaacgagt 1560 gttttgccgt ctgatgcgaa tgcatttctt tctatgttta gaactctttt cgaaggtctt 1620 ggcaagtgcc ccggtacatg ttctattgtt ttgaaggaag gttcggtccc gtcattatat 1680 tacaagaaaa ggattcctct gagtttacag gatcggttga aaacacagtt gaatgaaatg 1740 gttgctcaag gaattatttc tcccgttgaa tatcctacag actgggtcaa caatatccag 1800 atcgttgaaa agccaaatgg ctcattgaga atttgtttgg accctaagcc gctaaatgcg 1860 tgcatcaaaa gggaacattt cttgattcca aaaattgagg acattatgag tacgttgtca 1920 aataagaaag tgtttactgt acttgatctg cgaaacggtt tttggcaaat ggagctggac 1980 agtcggagct cagatttaac aacatttatg actcctttcg gtcgatataa atggaacaga 2040 gtaccattcg gcattaatag cgcaccagaa atgttccaga aaaaaatggt gcagatattt 2100 ggagacatac caggggtaga agtttatttt gatgatctta cggtttctgg aaaggattat 2160 gaggagcatg atagaaccct tcgactggtc ttagaaagag caatgaagta taatattcgt 2220 ttcaacgaga gtaaaattca atatcgatgc tcagagatta agttgatggg acacataatt 2280 tccaatggta ctattcgtcc agatgacaaa tacattcgtg ctattaccga aatgcaaact 2340 cctaccagca aactagaagt catgaggttg ttgggactgt tcaagtattt ggcaaagttt 2400 attccaaatc tttcgcaaag aaccactttg atgcgcgaag tcacaagaaa ggatgtggtt 2460 tggcagtgga cggacgccca caatgcggaa ctcaaaggat tgttgcaagt aatttcaacg 2520 tctcctgtac tgacaatcta cgatccggcg aagcctgtag tggttcaaac ggacagctcc 2580 aaagatggga ttggaagcgt cttgttgcaa gatggaaaac cagtcgctta cgcgtctcgt 2640 tgtctaacaa aaagtgaaac caagtgggcg cagattgaaa aagaactgct agcaattgtt 2700 tttgcttgtg aacgatttca ctactttctc tatggacgtg attttgaagt ccaatcggat 2760 cataaaccgc ttgccactct gattaaccgt gacatagaca gtgttacagt tcgtctgcag 2820 cgtatgttca tgtttttgct gaagtatcct ggtttgtcaa ttgtttatac tcctggtaaa 2880 gatatgttgg ttgctgattg cctctctaga gctcctctag gtgatcccgt cgaagatgat 2940 ttgaatttgt ccgggttgat tcattccgtc acaaaacgtg tatgtatgtc agaagagaat 3000 tacaggatat gtgttgaggc acttcggaat gacgagcggt atagtcggat ttgtaaatac 3060 gttcaagaag gttggccgtc atatcataaa cttgatgatt tttctcagcg tttccacaaa 3120 gtaaagaatg aacttcactt tgaacaagac ctacttttta aagatcatcg tctagtgatt 3180 ccgacagaac tccaacgtac aatgtgtaat tggttacatg ctcctcacat gggaatagag 3240 aaaacactcg cacgtgcgcg agttcatgtt tattggcctg gtatgacaca ggatataatt 3300 gaaacggtga aagattgccc aacgtgtgaa gttttgaagc gcaacaatca gaaagagccc 3360 ctagttcagg atagaaaagc tgaatatcct ttccagaagg tgtcggtgga tatttttgaa 3420 tatggtggta atagttatat tgccttgatc gattcttatt ccggtctgct atgcgctgaa 3480 ttgttgaagg ataagtctgc aagccaggtg gttaaggctc ttgacaagat tttctgtcgc 3540 tacggttatc ctactgaaat cagatgtgat aatgtcccgt tcaattctga agagtgcgat 3600 aaatatgcag atgaatgtaa tttccgtttt gtattttcga gtccgaggta cccccaaagt 3660 aatggtttgg ctgaaaaagg tgtggctatt gcgaaaaacc ttttgaaacg ttgtcttgaa 3720 gtaggagagt tgcccatgtt tcagtacaga ctattggagt acaacaatac tcctatcgct 3780 agtatgaaaa tgtctccagc agatattttt tttggtcgtt tgctgaaaac taaactgccc 3840 gtggatgcaa aacttttgca tgctggtcaa atcaataaac ctttggtgga tgatcaaatt 3900 gataaaaagc gttataacca aaagaagtat tatgaccatc atgcaaagcc tttgccagtt 3960 ctgaatgagg gtgaaagagt gatgtttaag aaaaacggaa aagaatggca ctatggcaag 4020 gttgttagaa aagtgaatga gaggtcgtac attgtggtag ataacttcga caatcatttc 4080 cgaagaaatc gtcgttttat atcgaaaact aacaataatg tcgtcaatac aagtgacatg 4140 ttgtttgaag aaagcctttc gaaaaacaat cataacaatt ctaatgttaa atcggctatt 4200 acacgcgaaa ccttgcaagc tacagataat aatgaacaaa cactagtatc agcggaatcg 4260 caagaatttg tcacgaactc atcatattat gatacggctt cagaaggagg taatacagat 4320 gaagaaaatg aacgaactaa tgaaaatgcc ggtcaagact tacaacaatc atcggaaact 4380 cgtacgcgaa gtggtcgtat tgtaaaacct ccaaaaatgt atggtgaatg gacgtgttaa 4440 ctaacttgtt ttttccaaaa aaaccgacta actaaattaa acccaaatcc ttatcccttt 4500 tatctgtttt aacctcgagt cagtcgcgag tgtacgactt cccaaccact aacaataagt 4560 aaaaaaaaaa agaggcg 4577 // ID DNA-2-2_NVi repbase; DNA; INV; 534 BP. XX AC . XX DT 20-APR-2009 (Rel. 14.04, Created) DT 20-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE non-autonomous DNA transposon - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; TSD 2-bp; KW DNA-2-2_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-534 RA Bao W. and Jurka J.; RT "DNA transposons from Nasonia vitripennis."; RL Repbase Reports 9(4), 761-761 (2009). XX DR [1] (Consensus) XX SQ Sequence 534 BP; 168 A; 93 C; 104 G; 169 T; 0 other; ccctattgca aaatctcacg tgagatctta cataagatct cacctgagat ctcacgctga 60 gatctcatgc caaaatctca tctgtaattt tggtccggaa gctgctggat cctcttattt 120 aataaaaaat aaggttttta atattggaag tcgcctataa ggaggtgctc ctactcagaa 180 gagttcaaaa aactaggttt cgaagcattt aagatgtttt ttcgattttt aaaaatttct 240 catctgagat ctcaccttag atgtcacctg agatctcagg tgagatttta gatgagatat 300 ttttaaaaat cgaaaaaatg tctaaaatgc tttgaaacct agttttttga acttatttgg 360 ggggaggctc ctccttatag gcgactacca atatttaaaa aactcatttt tgaataaata 420 agaggatccg gcagcttccg gaccaaaata acagatgaga ttttggcatg agatctcagg 480 tgagatctca ggtgagatct catgtaagat ctcacgtgag attttgcaat aggg 534 // ID LINER1 repbase; DNA; INV; 5163 BP. XX AC . XX DT 06-NOV-2007 (Rel. 12.11, Created) DT 08-NOV-2007 (Rel. 12.11, Last updated, Version 1) XX DE Non-LTR retrotransposon. XX KW I; Non-LTR Retrotransposon; Transposable Element; R1; LINER1. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-5163 RA Jurka J.; RT "Non-LTR retrotransposon from Nasonia parasitic wasp."; RL Repbase Reports 7(11), 1175-1175 (2007). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 169..1593 FT /product="LINER1_1p" FT /translation="MDSELITLLPQAGQATVPSPAVDSDRVKAPVVDAGPM FT EFGGRGGQTHLEAAEVYHLTTAACRDAGDMEMEVDADRDKRAACITQRNAH FT LDRLGSRIDELMDYVRDRHNVHKQIKDLARIIRASYKNLCMAEDNLQQCQR FT PGTQGRETQTTPSLRSRKAGTTKKAPTTPQDRNGGNKEEDNQPKKAAWQEV FT VTRKSKKKEKKASKEDKSVPGVTNQNRTSKDGPKRARPPRPDALIIKAAEG FT KSYADILSKMKAAPSLNTLGNSVNKIRRTVAGDLLIELKRTKEVKTSDFQE FT AVKAVLVEGATIKALQEEETIEVRDLDMLTSKEEVLEALQKEIGEENIIEV FT STIRSLRKTYGDTQIAVIRVPAQIAAKITKLQKIRIGWVNCRIRVANRKNE FT PLRCYKCLGFGHIGRKCTVTEDRSKLCFKCGKDGHKAKECKNQPNCVLCKG FT GTGGKSDHAAGSYACPVYRAAVEATDRQRK" FT CDS 1593..3350 FT /product="LINER1_2p" FT /translation="MKIVQLNINHCATAQDLLSQYVRETGIDIAIVCEQYR FT DLDKPSWDMDSTGKAAIWACGDAAFQEKMTRRNEGFVRAKVAGVHIYSCYA FT SPNTSIEQFRQLLDQIVQDAVGRKPVLIAGDFNAWAVEWGSQRTNERGRAL FT LEAFALLDLVLVNQGCSYTFQRGDAGSIIDLTFVSSCLIGSVRSWMVSDHY FT TNSDHQAIIIEIRISKQRSNGRATTKRVGWKTKDYDKETFLLALEEIQLSG FT SANDKVEQVMGNITRACDATMPRRAPNNRRPPVYWWDKEIDAARKECHRTR FT RREQRARKKYYKTGRGKEIVDALNKEMKDAKRILKKKIRENKRRCLKELQD FT EVELDPWGRPYQVVMKKIKGGYIPPPKSPELLDRIVTTLFPLQLEVTAIPE FT AEANDETIPTITTEELLAACKRIGNNKAPGLDGIPNIALKQAIQARPDVFV FT DLYNSCLEEGTFPKNWKKQRLVLLPKGDKPPGEAASYRPLCMLDTPGKILE FT RIIGVRMDQVIEKPGGLAEYQYGFRKKRSTLDAVSLVVDTARTAIEGKRWK FT GGSKKYCAIITLDVKNAFNSASWGKIHEALRKKEVPAYI" FT CDS 3362..4612 FT /product="LINER1_3p" FT /translation="MSDYLKDRTLLYDTESGTKTYQVTGGVPQGSVLGPLL FT WNIMYDGVLRLELPEGATVVGFADDIAVVVVAKQKEEVTEIANEAVSIIHD FT WLKQTGLELASHKTEAILISSRKKIETITLSVDGHEIDSQPTIKYLGITMD FT ARLTFKQHLERVSNKAAKVGAALSRLMPNVGGPTQGRRLLLASVTTSIMLY FT GAPIWADAMLVKSYARKLSTVYRRSALRVASAYRTVSEDAVCVISSMPPID FT LLATERKNIYEEXRRVDNTQKEVWKSAKEQTLAEWQRRWDSSGKGRWTHRL FT IPRIVGWTNRRHGEVNYYLTQFLSGHGCFRAYLHRFKIEDNPNCPVCLEAN FT EDAEHVFCNCSRYEMEREELERYLQTRVTPESMMTAMLASKDGWCAVNNYV FT RTIIKKVRNDEENRREGQGETAE" XX SQ Sequence 5163 BP; 1690 A; 1100 C; 1398 G; 972 T; 3 other; gcgcggggat gccgataccc cgtatgtggc gcgtcccctg ggtggcggat aggggaatgc 60 tcatcggcgc gtcgaggtct cctcaaccgg ccgaagggta gagcagaaat aaaatatgct 120 ccgcggacca aacaggctag gggaagaaac cccgaagata aatcaaacat ggatagtgag 180 ttaataacat tactacccca agcgggtcaa gccacagtgc ccagtcccgc cgtcgattcc 240 gatcgggtga aagccccggt ggtcgacgct ggccccatgg agtttggggg gaggggsggg 300 caaactcact tagaagccgc agaggtttat cacctgacca ctgctgcatg tagagacgca 360 ggtgatatgg aaatggaggt tgatgcagac agggataaga gagctgcatg cataacgcaa 420 aggaatgcgc acttggatcg ccttggcagc agaatcgatg aactgatgga ctatgtcagg 480 gacaggcaca atgtgcacaa gcaaattaaa gatcttgccc gaatcataag agcaagctat 540 aagaatctct gcatggcaga agataactta cagcaatgcc agcgcccggg aacacagggg 600 agggaaaccc agacaacccc aagtcttaga tctcggaagg cgggtacaac aaagaaagcc 660 ccaactactc cacaggatag aaatggagga aataaagaag aggataatca acctaaaaag 720 gcagcgtggc aggaagtcgt cacaagaaag tcaaagaaaa aggagaaaaa ggcaagcaaa 780 gaggataaaa gtgtccctgg cgttacaaac cagaatagga ccagcaagga tggaccgaag 840 cgggcaagac ctcccagacc ggatgcttta atcatcaagg ctgcagaagg aaagagctat 900 gcagacattt tgtcaaaaat gaaagcggct ccgtcactta acacactggg gaatagcgta 960 aataaaatac gcagaacagt tgcaggagat cttctgatag agcttaaacg cacaaaagaa 1020 gtcaaaacct cggacttcca agaagcagtt aaagcggtac tagtggaagg cgccacaata 1080 aaagcactcc aggaagaaga aaccattgaa gtaagagatc tggacatgct tacctcaaaa 1140 gaggaggtac tagaagcact acagaaggaa attggcgaag agaacatcat agaagtctct 1200 acgataagat ctctgcgaaa gacgtatggt gacacgcaga ttgcagtgat acgggtacca 1260 gctcagatag cggctaagat cactaagcta cagaagatca gaataggatg ggtcaactgt 1320 aggatccggg tggctaaccg caaaaacgag ccgcttaggt gttataaatg tctaggtttt 1380 ggccatatcg gtagaaagtg cacagtaact gaagacagaa gtaagctttg ctttaaatgt 1440 ggaaaagatg ggcacaaagc aaaggagtgt aagaaccaac ccaactgtgt actctgcaaa 1500 ggaggcacag gtgggaagag tgaccacgca gctggtagct atgcatgccc agtttaccga 1560 gctgcagtag aagccactga cagacaaaga aaatgaaaat agtgcagttg aatataaatc 1620 actgcgcgac tgcacaggac ctactgagcc agtatgtccg cgagacagga atcgacatag 1680 ccatcgtctg tgaacaatac agagacctgg acaaaccatc gtgggatatg gacagtactg 1740 gtaaagcggc gatatgggca tgcggagacg ccgctttcca ggaaaaaatg acaagacgta 1800 atgaaggatt cgtacgcgct aaggtcgcag gcgtacacat ttacagctgc tatgcttcac 1860 ccaacacgtc gatagaacaa ttcagacaac tgttggacca aatagtacag gatgctgtag 1920 gaagaaaacc ggtactgata gcaggcgatt tcaacgcctg ggcagtagag tggggcagcc 1980 aaaggacgaa cgaaagagga cgagcgctat tggaagcatt cgccctcctt gacttggtct 2040 tagttaacca gggatgttct tacacatttc aaagaggaga cgcagggtcc atcatagatc 2100 tcacgtttgt cagcagctgc ctgattggct cagttagatc gtggatggtg agcgatcatt 2160 acacaaacag tgaccaccaa gcgataataa tagagataag aatatcaaaa cagagatcca 2220 acgggagggc aacgacaaaa agagtcggtt ggaaaacgaa ggattatgat aaggagacat 2280 tcctgctggc tctagaagaa atacagctgt ctggatctgc gaatgataag gtcgagcagg 2340 taatgggaaa tattacccga gcctgcgacg caacgatgcc aaggagagct cccaacaata 2400 gacgaccgcc agtatactgg tgggataaag aaattgacgc agcccgcaaa gagtgtcatc 2460 gaaccagaag acgggaacag cgggctagga agaaatacta caagactggt cgtggtaaag 2520 aaatagtaga cgccctaaat aaagaaatga aggatgctaa acggatactc aagaaaaaaa 2580 tccgagaaaa taaacgacgg tgccttaaag agttacagga cgaggttgag ctggacccct 2640 gggggcgacc gtaccaagta gtaatgaaga agataaaggg aggttatatc ccccctccta 2700 aaagcccgga attgctggac cgtattgtga ccacactgtt tccccttcaa ctggaagtaa 2760 cagctattcc tgaagcggag gcaaatgacg aaaccattcc aacaatcacc acagaggaat 2820 tactggctgc atgcaaaaga attgggaaca acaaggctcc aggcctggat ggcataccca 2880 atattgcatt gaaacaagct atacaggcac gccccgatgt ctttgtggac ttgtacaatt 2940 catgtctcga agaagggacg tttcctaaga actggaagaa gcaacgcctg gtgctactac 3000 caaaaggaga taagccacct ggagaagctg cctcatacag accactctgc atgctggaca 3060 ccccgggcaa gattttagag cgcataatcg gcgttagaat ggaccaagtc attgaaaaac 3120 caggcggact agcggaatat caatacggct tcaggaaaaa acgctccact cttgacgccg 3180 taagcttagt agttgacacc gctagaactg caatagaagg aaaaagatgg aaaggaggat 3240 ccaagaagta ctgtgctatt attacactgg atgttaaaaa tgcgtttaac tcagccagct 3300 ggggcaagat acacgaggca ctacggaaaa aagaggtacc cgcatatata taagaaggat 3360 tatgtctgac tacctgaaag acagaaccct tttgtatgac acagagagcg gcacgaaaac 3420 ctatcaagta accggagggg tcccacaagg gtcagtgctt ggcccactat tgtggaacat 3480 catgtacgat ggcgtactta ggctagagct acccgaaggt gcaacagttg tagggtttgc 3540 agatgacatt gcagtagtgg tagtagcaaa gcaaaaggaa gaggtgacgg aaatcgctaa 3600 cgaggcagta agtataatcc acgattggct aaagcagact ggacttgagc ttgctagcca 3660 taaaacggag gctattctta tatccagtag aaagaaaata gagacaataa cgctgtcagt 3720 ggacggacac gaaatagact ctcagccgac cattaagtac ctgggaatca ccatggacgc 3780 cagactaacg tttaagcagc atcttgagag ggtgagcaat aaggcagcaa aagtcggtgc 3840 tgcactatcg cgactaatgc caaatgtagg tgggccaacg cagggtcgaa ggttactcct 3900 agccagtgta actacatcaa ttatgttata cggtgcccca atatgggcgg acgctatgtt 3960 ggtgaartcc tatgcacgaa agctgtcaac agtttatagg agaagtgcat tgcgggtcgc 4020 ttctgcctac cgaacagtat cagaagatgc agtgtgcgtt atttcaagta tgccgcctat 4080 tgaccttcta gcaacagaac gaaagaatat atacgaagaa arccggcgtg ttgacaatac 4140 tcaaaaggaa gtttggaaat ccgcaaagga acaaacccta gctgagtggc aaagacgatg 4200 ggactccagt ggaaaggggc gatggacgca ccgcctcata ccacgtattg ttggctggac 4260 caatagacga cacggggaag tgaattacta ccttacgcag tttttgagcg gacatgggtg 4320 ctttcgagcc tacctacacc gcttcaagat agaggataat ccaaattgcc cagtatgttt 4380 ggaagcaaac gaagatgcgg agcatgtctt ctgcaactgt tcgcgatacg aaatggaacg 4440 agaagaactc gagcgttacc ttcagactag ggtgacccct gagtcaatga tgacagctat 4500 gctagcgtca aaagatggct ggtgcgcagt aaacaactac gtaaggacca ttatcaagaa 4560 ggttagaaat gacgaagaga atagaaggga aggacaaggc gaaacagctg agtaaaactg 4620 ttgctagacg aaggccacta ggtaatagat acgaaacgct cctcggactg atgcagagga 4680 acgtaggtca cagagttgag ctctcactcc cctgcccgat gcagaggaac gtaggagcta 4740 gggttgagta cctctaagtg ctcctcagac cgatgctgag gaacgtaggt aactgagttg 4800 agcccagact ccctcacccg atgcagagga acgtaggtgt cggggttgag tctgtccaga 4860 ggatggccgg tcgatgctgc tcgaagtaga cgaccggacg atgcggcagg aagcagacgg 4920 ctggacgatg cagaaggaag aagacggcag gacgatgcag aatgaagaag acgagtttga 4980 tcctgaaccc ctaatcacct tggtggatgg gggatgtatg aaaatgtcaa acttcaaagg 5040 agaagccaaa aaaggctagt taacgggccc cacggaagta ttgttcaacg atagtacctg 5100 tggggatccg gtgtcagagg gccacttgtg cagagcttgc tctgcctacg ccacataaaa 5160 aaa 5163 // ID Gypsy-35_CQ-I repbase; DNA; INV; 6004 BP. XX AC AAWU01032713; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-35_CQ_; KW Gypsy-35_CQ-LTR; Gypsy-35_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-6004 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 449-449 (2011). XX DR Genome; AAWU01032713; Positions 25314 19311. XX CC Positions [4925-5413] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 880..2100 FT /product="Gypsy-35_CQ-I_1p" FT /translation="MTENLLEKTYEGLGVILKKISKNPNRKYRSKTLLTKL FT DNAKSLYRIGLSEIEFLPEQKQAPLTKTLRFLQGEIITLINEKLEKHVEPL FT SFRTLVNVIITLNRSYKKVKMASAAEIIQIASTLVPEYDGQGDKLKNVTSA FT LAALTPLVTAATEPIAVQVVLTKLTKKARSTVGDNPANLAEIVTKLEAKCS FT SSVSSETIIHKLNKTKQNGSLKKFIAEIEELADHLEAALLLENYSVDHAGN FT IAHKTAIKTLREGLKNERTQLILEAGTFTRLSEATAKAMEVDDKSHNLNIF FT FQQKNNRGRKFNRGGQHQQNNFQNGNNFNGNGNGNGNGNGNGNGNGDRGRS FT SRGRGRGQHNQGHYNQNQNQGADRQVFVNRTENGQIPQQNNQVGGQRQQPN FT QPNTTIAQTHNQQ" FT CDS 3233..5758 FT /product="Gypsy-35_CQ-I_2p" FT /translation="MSGFHQIELEEESKKFTAFSSSNGHYEFNRLPFGLNI FT SPNSFQRMMTIALSGLPPECAFLYIDDIIVVGCSINHHISNLENVFSKLRH FT FNLKLNPAKCNFFGTDVTYLGHHVSDKGIQPDKSKYEVIQKYPKPENADDV FT RRFVAFANYYRRFLPDFAKTAHPLNRLLRKNQPFIWSDICENAFPEIKNKL FT LSPEILQFPDFEKQFILTTDASKISCGAILSQNHGDIDLPVCYASRTFTKG FT ESNKFTIEQELVAIHWAIDHFKPYLYGRKFVVRTDHRPLVYLFSMKEPSSK FT LTRMRLDLEEFDFTVHYVPGKSNVGADALSRVVIDIDELKQMYVHLVQTRA FT RAKLHQQNQTKPSSVVEPDQLKIYESCNINDVYNLPKLEIVGIDTNAQTGI FT KISIRDKRYRKELTPALIIPMPLSNTNDNLMEIIQNIDSQANKLGIDKLAL FT ALTAEFSSLCDIQKFIHAYKVLATPKPNPITNQKSKQINLTLIFYKKQRII FT NDPLEALRIIEYYHDSPIGGHVGSYRLWKKLKAYFEFKNMQKLIKTFVEKC FT VQCKMNKHSTKTNEIYTKTSTPHSVFETISIDTIGPMTKTQNGNRYALTMQ FT CDLYKYVIVVPIQDKQAQTLAKAFIENLILVFGCPKNIKSDLGTEYKNEVF FT GKISQFLQLNLKFSTAYHPQTIGNLERNHRCLNEYLRSFTNELHDDWDSWL FT TYYCFCYNTTPNTDMQYTPFEIVFGKQANFPTYTLNNIEPMYNHEEYYTQL FT KFKLQTIAQRVRDIIIKTKLKRIQNQKSSNPINLKVGDKILIKSETNRKLD FT NIYEGPYTIVSLNHPNITIFHTPSNKNKEIHKNRVIKYAI" XX SQ Sequence 6004 BP; 2315 A; 1126 C; 1012 G; 1551 T; 0 other; aggctacatg gcgaccgtga gcagcggctc ggacgcaacg cgccgcgatt tttaattcct 60 ttttttttct cgggactttg taaaaaaaag tgaagtgaac tctaagtgat acacaaaact 120 ttaccagtga aaatggaaag agaaatcacg cgaatcgaac tgcgagattg cttgtgtctc 180 aagctcggaa tgcttgaaca atttccggag ttaaaacacc aaaaagcagt tgaaaaaatt 240 cggaaaatga tagacacgag attctatcta aacaatttgg ctttacaacg agctctagga 300 atacttcgtt ggatcgcgaa ggaaaatcgc aaattgtacc aacaagtaca gtaaatatgg 360 aaaccattaa aataacagta aacagattga acactacatc agtgccaatc atcgcaagaa 420 cgaatttaaa atgtgtcaaa ggccggattc agtgcctcgt aaaatgtgaa cttttaacaa 480 tactgagaaa ctatgaacaa tcggtgttag tggaaaaaac aaaaaaataa taagataagc 540 gccgccgcga atgactaaga gccaaaagaa gatcttttac gaaaaagaac gccaacgatg 600 gaatcggcta agggcagccg accagcagca acaacagcag cagcagcagc agcagcgaaa 660 ccaggtgcag ccgcgccagg aaataaacaa ccgtaaaggg gagcgcgtta ttgtttaacg 720 gatgcatatc cttacgcggt gttttcgtca atcttcaaga aggggagcgc acataagtgg 780 ccgcataccc gaatcatcaa tcttcaagaa ggggagcaca caacagagat tgcatacccg 840 ctataatctt catttaaatg tataagataa gtaatctaga tgacagaaaa tttattagaa 900 aaaacatatg aaggtctggg ggttattcta aaaaaaattt caaaaaatcc gaatagaaaa 960 tataggtcta agaccttatt gacgaaatta gataatgcga aaagcttata tcgcatagga 1020 ctttcggaaa ttgaattttt gccggagcaa aaacaagctc cgttaacaaa aactttacga 1080 tttctccaag gagaaatcat aacattaata aatgaaaagc tcgaaaaaca cgtcgaacca 1140 ctttcattta ggacactagt taatgttata attacattaa acagatcata taaaaaagtt 1200 aaaatggcaa gtgctgccga aatcattcaa atagcttcta ccctcgtccc tgagtatgac 1260 gggcagggag ataagctaaa aaatgtgaca agtgcactag ctgcattgac accactcgtc 1320 acagcagcaa ctgaaccaat cgcagttcag gttgtgctga caaaattgac gaaaaaagcc 1380 agatcaacgg taggtgacaa ccctgctaac ttggctgaaa ttgtgacaaa actggaagct 1440 aaatgctcca gttcggtttc atccgaaaca ataatacaca aattaaataa aaccaaacaa 1500 aatggttctt tgaaaaaatt tatcgcagag atagaagaac tagctgatca tttggaagca 1560 gctcttctac tcgaaaatta ctctgttgac cacgcgggga acatagctca taaaacggct 1620 attaaaactc tgcgtgaagg tctaaaaaat gaacgcactc agttaatttt agaagcgggg 1680 acgttcacga gattatctga agctactgcc aaagcaatgg aagtagacga taaatctcat 1740 aatttaaaca tatttttcca gcagaaaaat aatcgcggaa gaaaatttaa tagaggaggt 1800 caacaccaac agaataattt tcaaaatgga aataatttca atggtaatgg taatggtaat 1860 ggtaatggta atggtaatgg caatggcaat ggtgacagag gacgttcttc tagaggaaga 1920 ggtcgaggtc aacataacca aggccattac aaccaaaatc aaaaccaggg ggctgatcgt 1980 caagttttcg tcaacaggac ggaaaacggt cagatccccc aacagaacaa ccaggttggg 2040 ggacaaagac aacaaccaaa ccaaccaaac acaacaattg cccagactca caaccagcaa 2100 taaagcggtt ttttacagcc agcaataatt taagtaattt tattaatgta aaattagata 2160 tatcagacgc caaatgtaca tttttagttg ataccggcgc tgaaatatcc ttattaaaaa 2220 ttacgaaact ttttggatcc actccaatgg actccgaaaa taaatgcacc atatcaggta 2280 tcaacgatga aggtattcat acgttgggaa gtacttatgg taacataatc ttagaaaaca 2340 taaagatatc tcaagaattc caattaatca atgaggaagt aacaatcccc actgatggaa 2400 ttcttggaag agactttctt acaaagtacc attgtaacat cgattacgat acatggttac 2460 tttccggaac tgtcaattcg gaaaaatttg aattcgaaat aattgacaac ttggaaggaa 2520 atacttatat tcctcctcgt tgtgaagtat ttcgaaaagt aaatttacca gataccaggt 2580 ctgtgtattt aataaaatca catcagactg accgtggagt cttcatcgca aactccattg 2640 tagatagtag ttttccatat gtaaaaattt tgaatactac taataaaaca gtaaagatca 2700 ataaaaactt tactgaaaat atgaccaatg tcgacaattt tgacgcttat aactttggtc 2760 acgaaatcaa aaataaacgt cataaaaaat tactgatgtc gctaaaactg gataaagttc 2820 ccaaaaaagc taaagacaaa ttaatagaat tatgtcgcga atataatgat attttctctt 2880 tagaaaatga tcatctcacc acgaataatt tctataaaca aaaaataaac ttggaagata 2940 gtcgaccagt ttatattaaa aattaccgaa ctccagaagc acacattccg gaaacaaaca 3000 aacaagtgca caaaatgcta aatgacggga ttataagaca ctcaacgtct cattataatt 3060 ccccgataat gctagtgccc aaaaaaccga caatgacgag aaaaaatgga ggttggtagt 3120 ggatttccga caattaaata aaaagattct cggtgacaaa tttcctttac caagaattga 3180 tgaaatttta gatcatctag gtagagcaaa atatttcaca actttggatt taatgtccgg 3240 atttcaccaa attgaacttg aagaggaatc aaaaaaattc acagcctttt ctagttctaa 3300 tggacactac gaattcaata gacttccatt cggattaaac atttcgccca acagttttca 3360 aagaatgatg acgattgctt tgagtggatt gccaccagaa tgtgcattcc tttatattga 3420 cgacattatc gtcgtcggat gctcaatcaa tcatcacatc tcaaatctgg aaaatgtttt 3480 ttccaaacta agacacttca atcttaaact aaatccagca aaatgtaact ttttcggaac 3540 tgacgtgact tatttaggtc atcatgtctc agataaaggc attcaaccgg acaaatcaaa 3600 atatgaagtg atccaaaaat atccaaaacc agaaaatgca gatgatgttc gaagatttgt 3660 agcatttgca aattattatc gacgcttctt gccagatttc gctaaaacag ctcacccatt 3720 aaatcgacta ttacgcaaaa atcaaccatt catctggtct gatatatgcg agaacgcttt 3780 tccagaaatt aagaataaac tgttgtcacc agaaattctt caatttcctg actttgaaaa 3840 gcagttcatt ttaacaacgg atgcttcaaa aatatcatgt ggagcaatat tatcccaaaa 3900 tcatggagat attgacttac cagtatgcta cgcaagcaga acattcacta aaggtgaatc 3960 aaataaattc actatagaac aggaactcgt tgccatacat tgggcaattg accattttaa 4020 gccatattta tatggtagga aattcgtcgt tagaacagat catagaccac tagtctatct 4080 gttctcaatg aaagagccat catcgaaact tactagaatg cgcttagatc tcgaagaatt 4140 cgatttcaca gtacactatg tacctggaaa atcaaacgtc ggagctgacg cactttcaag 4200 agtagttatc gacattgatg agcttaaaca gatgtatgtg catctcgtac aaacacgagc 4260 gcgagcaaaa ctgcatcaac aaaatcaaac aaagccaagt tcagtagtag agcctgatca 4320 actaaaaatc tacgaatctt gcaatataaa tgacgtctat aatctaccaa aacttgaaat 4380 agttggtatt gatacaaatg cgcaaaccgg tataaaaatt tccattagag acaagcgata 4440 cagaaaggaa ttaactccag cgttaattat ccctatgcct ctgtctaata ctaatgataa 4500 tctcatggag attattcaga acattgactc acaggccaat aaactaggaa tagacaagtt 4560 agctctagcg ctaaccgctg aattctcaag tttatgtgat attcaaaagt tcatacacgc 4620 gtacaaagtg ctagccacac ccaaaccaaa tcctattact aaccaaaagt caaaacaaat 4680 aaaccttacg ttaatctttt ataaaaaaca aagaattata aatgatccac ttgaagcact 4740 aaggattata gaatattatc atgactcacc gataggaggt catgttggtt cttatcgcct 4800 ttggaaaaaa ttgaaagcat actttgaatt taaaaacatg caaaaactaa tcaaaacttt 4860 tgtagaaaaa tgtgttcagt gtaaaatgaa caaacattct acaaaaacaa acgaaattta 4920 tacaaaaact tcaactcctc attcagtttt cgaaacaatt tccatagaca caattggacc 4980 tatgacaaaa acgcaaaatg gtaatcgata cgcattaact atgcaatgcg atctatacaa 5040 atacgttatt gttgttccaa tccaagataa acaagctcaa acacttgcaa aagcgtttat 5100 agaaaacctc attttagttt ttggatgtcc caaaaatatt aaatcagatt taggaacaga 5160 atacaaaaat gaagtattcg gtaaaatttc tcagttctta cagttgaatc tgaaattttc 5220 aacagcttac cacccacaaa caattggtaa tttggaacgt aatcatcgtt gcctcaacga 5280 atacctcagg tcattcacaa atgagttgca tgatgattgg gattcatggc ttacttacta 5340 ttgtttttgt tacaatacta caccaaatac tgatatgcag tatacacctt tcgaaatagt 5400 ttttggaaaa caagcaaatt ttcccacata cacactaaac aacatagaac ctatgtacaa 5460 tcacgaggaa tactatacac aattaaaatt caaattacaa acaatagctc aaagagtacg 5520 tgatatcata atcaaaacaa agctgaaaag aatacaaaac caaaaatcat caaacccaat 5580 caatcttaaa gtaggagaca aaattttaat taaatcagaa accaatagaa aattagacaa 5640 catttatgaa ggcccatata ctatagttag tctaaatcat ccaaatataa caatttttca 5700 tacaccctca aataaaaata aagaaatcca taagaatagg gttatcaaat acgcaatcta 5760 aaaataaaaa aaaaaaacct catccgagcc taacttcaac cgaagtaggt gaaagtttcc 5820 cggcaatagg ccatggaagt gaataattcg gaaataacaa aaaacaggaa gattttgttt 5880 atatacacaa gattgtaaaa ggcaactgga gaatggtaaa gaataatttt tacttcataa 5940 aaatcattct tcttttaggg ggaaggtgtg ataggttacc atccctgcac gggtttacca 6000 accc 6004 // ID Gypsy-76_AA-I repbase; DNA; INV; 4868 BP. XX AC supercont1.16; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-76_AA_; KW Gypsy-76_AA-LTR; Gypsy-76_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4868 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.16; Positions 2939232 2944099. XX CC 'CTCTT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 719..2593 FT /product="Gypsy-76_AA-I_2p" FT /translation="MEESRGFPPFRCDRIESAKLAKEWKRWKESLDCYFEA FT YGITDQRVKRAKLLHLGGSALQTVFRNLGDPEHVPLVALDPKYYDMAIDQL FT DEFFEPRHQSTSERRKLRQMKQQANERFADFMIRLKQQASECGFDRYGSEV FT GQILTNIYLTDAVVEGCMSNEVRKMILLKDLPFPEIEMLGVSQEGVELQVE FT EISEGQRPGKMYRVTEQRADRRKITSNAEKLQGRNCYNCGRLGHISISAGC FT PARGKQCHNCKSFGHFEKLCRKPKRQYERTSNAKQIRAIETTHMSDDTPVG FT QDLPGSPEKDKVYYAFYTGNELNVIPCVIGGVKVDMLVDSGADANLITAKA FT WSMMKEKHISIHSSEKGSCRMLRGYGSDKPLTILGSFTSDVATGSKSVRAE FT FLVVEEGQRCLLGDKTAKQLGVLRVGINVGQVANVPKPLGKIKGIKVNIQV FT NPEVTPVFQPMRRIPLPLEEAVGRKIDELLKRDVIEAKSGPTSWVSPLVVV FT GKANGEPRLCLDLRRVNEAVLREHHPMPVVEDYIARLGQGSIWSKLDVKEA FT FLQIELAEESRDLTTFITSRGLFRFKRLPFGLVTAPELFQKAMDEILAGCD FT GAYCFIDDITIEGKNAEEHDNRLEKVE" FT CDS 2890..4815 FT /product="Gypsy-76_AA-I_1p" FT /translation="MNKYIPNLATIDEPLRKLLQKDAKFEWTERQSDAFEA FT IKAALCRVENLGFFKMEDRTAVIADASPVGLGAILAQFDKDGRHRVISFAS FT KSLTETERRYCQTEKEALALVWSVERFQYYLLGKPFDLVTDCKVLELLFSK FT RSKPCARIERWVLRLQMFEYKVVYVSGKDNVADTLSRLAVRDAKPFDITEE FT VIIHEVSTYAAGSVALTWREIQDATNTDSDIQQVKNTLELGHLQDLTVEYR FT VCANELSVYENVLLRGDRIIVPLSLRKRVLATAHEGHPGITMMKNHLRTNV FT WWPKMDAEVERYVKQCRGCTLVAAPEPPEPMQRSQLPSSPWQTIALDFLGP FT LPEGQHILVVIDCYSRFMEVVEMESTTAKDVVRELSIMFSRYGMPSALKAD FT NAPQLSAECAEFREFCVANGIKLLNTIPYWPQSNGEVERQNRSILKRLRIS FT QELGQDWRVELRRYLLTYHSTKHPSTGKSPGELMFGRHIKSKLPTVINFDE FT DACVRERDAISKEKGKVYGDKKRQARGSELLEGDVVLAKRMRKASKVDSDF FT ANEEFIVKEKTGTDTIIQSKVSGKQYRRSSAHLKKIGDRETVDDSVVPGQE FT KIGDENNRDLPIEPSSEENHCLKRASGAKRISNQPVRYQDYVTY" XX SQ Sequence 4868 BP; 1507 A; 883 C; 1260 G; 1218 T; 0 other; gttggcgacg aagatagggg ataaaaagta agtaaaaaaa aaactcacgg gttgtgtgga 60 acgatttaat ttatccaatg aacaattatg tggcaatcag aatacggatt atgattgtaa 120 atcgacaaaa ccatgaattt aggtgtttgg gaagatgcga gaacaaaaga gtgttctccg 180 gagataagct ggatgggaag aaaaacattc agtggagtcg gcagaaatta ttgaatttct 240 ccagttgcaa acgcaagaat cggaaggatt cttcacgaat tgggttgtat cagttggata 300 cttttgtcgc aggaaaaaaa atggtatttt ttcaagatgg cgctgaaaaa aaaagagtaa 360 agtttggtcc gcacgctaac ttgtggaaat agcgccaaac ggagctggtg cagtgtagta 420 gtagtagtag tggaatcaat cgaaaagagg aggtgtgaag gggtgaaaga aaaatgaatg 480 aagtagtatt ggtagaacgg gatgaaattt gctgacgtca caatttacac tgttgctaaa 540 ctttcatgtg caaataaagt tgtttaccac tcatggagga acgtttacga atggttggat 600 ataatgattt cgtggacggt agtgcaatga aattgtttac atggatggtt aatgatattc 660 tgatgagata atggtgttcg gaaattatgt taacaagtgg ttatacgtct atttttagat 720 ggaagaatct cgaggtttcc ctcccttccg atgcgatcgg attgagtcag cgaagttggc 780 caaagaatgg aaacgctgga aggaatctct cgactgctat ttcgaggctt acgggataac 840 cgatcagaga gttaaacgag caaagctgct gcacctgggt ggatccgcac tacaaactgt 900 gttcagaaat cttggagacc ccgaacacgt accgttggtt gcgttggatc caaaatatta 960 tgacatggcc atcgatcaac tagatgagtt ttttgaaccg cgtcatcaaa gcacatctga 1020 acgccgaaag cttcgccaga tgaagcagca agctaacgag aggtttgctg attttatgat 1080 acgcttgaag cagcaagcat cggaatgcgg gtttgacaga tacgggtctg aagtaggcca 1140 gatcttaacc aatatctatc tcacggatgc cgttgtggaa ggatgcatgt cgaacgaagt 1200 gcgcaagatg atattactga aagatcttcc tttcccggaa atagaaatgc ttggggtatc 1260 gcaagaaggt gtggagttac aagttgaaga aatttcggaa ggacaacgac caggaaaaat 1320 gtacagagtg actgaacaaa gagcagaccg tcgcaaaatc accagtaatg ccgaaaaatt 1380 gcaagggaga aactgctaca attgcggacg gctgggccac atttctatat cggcgggttg 1440 tccagctcgc ggaaagcaat gccacaattg caaaagtttc gggcacttcg aaaaactctg 1500 tcggaagccc aagcgtcagt atgagagaac gtcgaatgcg aaacagatcc gtgcaattga 1560 aaccactcac atgtctgatg acaccccggt agggcaagat ttgccgggaa gcccggagaa 1620 ggataaagtg tattatgcgt tttacactgg aaatgagtta aatgtaattc catgcgtaat 1680 tggaggagta aaagtcgaca tgctggtgga ctcaggcgcc gatgccaacc ttatcaccgc 1740 caaagcttgg tctatgatga aagaaaagca catttcaata cattcttccg aaaaaggaag 1800 ctgtagaatg ctgcgtggat atggtagtga caaaccactt accattctgg gctcgtttac 1860 atctgatgtt gctaccggtt caaaatctgt tcgcgcagaa tttctagtgg ttgaagaagg 1920 tcagcgttgt ctattaggag acaaaacggc caagcagttg ggtgtgctgc gagtaggcat 1980 aaacgttggg caggtggcaa acgtgccgaa accactgggc aaaatcaaag gcatcaaagt 2040 taacattcag gtgaacccag aggtaacacc tgttttccaa ccgatgcgca gaattccttt 2100 accgctggaa gaagcggttg gcagaaaaat tgatgaactt ctgaagcgtg acgtaattga 2160 agctaaatcg ggtccaacca gctgggtgtc cccattggtc gttgttggga aagccaatgg 2220 ggagcctcga ttgtgtctgg atttgcgtcg ggtcaacgag gctgtgctta gagagcacca 2280 cccaatgccg gttgtcgagg actacatcgc acgtctggga caaggaagca tttggagcaa 2340 actcgatgtg aaagaagcgt tcctgcaaat agagttggca gaggagtcaa gagatttaac 2400 cactttcatc acgagtcgcg ggttgttccg ttttaagcgc cttccatttg ggcttgttac 2460 agcgcccgaa ttgttccaga aggcgatgga cgagattctg gcgggctgcg acggggcata 2520 ctgtttcatc gacgacataa ccatcgaagg aaagaacgca gaggagcatg ataatcgcct 2580 tgagaaggta gaatgaataa tttgattttc ttcttctgtt ttatcacgtg aatttttaaa 2640 gaatgaataa aaaaacatgg taaagtactt atataatgtt tattttaggt tttgtctcga 2700 ttgaaagagc gatgtgttga gcttaactgg gaaaaatgta aactgagagt tactgagctt 2760 gagattcttg gccataggat tggcacaaat ggaataagcc catctgaatc gaaaattgca 2820 gccatacgca cgttccgcca acctcagaac gaggctgaag tgcgaagctt tttagggttg 2880 gcaaattaca tgaacaaata cattccaaac cttgctacga ttgatgaacc cctaaggaag 2940 ctattacaga aagatgcgaa attcgagtgg acagagcgtc agtctgatgc attcgaagca 3000 attaaggcag cattgtgcag agtcgagaat cttgggtttt tcaaaatgga agatcgcacc 3060 gctgtgatag cggatgcaag tcctgtgggg ctaggagcaa tactagctca gtttgataag 3120 gatggacgac atagagtaat aagtttcgct tctaaatcat tgactgaaac agagcgacgt 3180 tactgtcaaa cagagaaaga agctctagct cttgtatgga gtgtggagcg gttccagtat 3240 taccttttag gaaaaccttt tgacctggta accgactgca aagttctaga gcttctattc 3300 tccaagcgat ctaaaccgtg cgctcgcata gaacgatggg ttcttcggtt gcaaatgttt 3360 gaatacaaag tagtgtatgt atcggggaaa gataacgttg cagatacatt atcgcgactc 3420 gctgtgcgag atgcaaaacc gtttgatatc actgaagagg ttattataca tgaagtatct 3480 acatacgctg ctggaagcgt ggcactcaca tggcgggaaa tacaggatgc tactaatacc 3540 gatagtgaca tacagcaagt gaaaaatacg ctagagttag gtcatcttca ggatttaact 3600 gttgaatatc gtgtttgcgc gaatgagttg agtgtatacg aaaacgtcct cctacgtggt 3660 gatcggatca tcgtaccact gtcgcttcgc aaaagagtat tagccactgc acacgaaggt 3720 catccaggca ttactatgat gaagaatcac cttagaacaa atgtttggtg gccgaaaatg 3780 gacgctgaag tagagagata tgttaaacaa tgtagagggt gtacattggt tgctgcacct 3840 gaaccaccag agccaatgca acgtagtcag cttccatcat ctccatggca aacgatagca 3900 cttgatttcc tgggacctct tcctgagggg cagcatatac tagttgtaat agattgttac 3960 agcaggttta tggaggttgt agagatggaa agtacaactg ccaaagatgt tgtacgggaa 4020 ctctcaatca tgttcagccg atatggaatg cctagcgcac tgaaagctga taatgcacca 4080 cagttgagtg ctgaatgtgc cgagtttcgt gagttttgcg tagctaatgg tataaagctt 4140 ctgaacacaa ttccctattg gccgcaatca aacggagagg ttgagcgcca aaaccgctct 4200 attcttaaac gactgcgaat ttcccaagaa ctgggtcaag attggagggt agaattgagg 4260 cgttatttgc tgacttatca ctcgactaag catccaagca caggtaaatc cccaggtgag 4320 ttgatgttcg gccgtcacat caaaagcaaa ttgccaactg tgatcaattt tgatgaggat 4380 gcctgtgtac gcgagcgtga tgctatttct aaagagaagg gtaaagtgta tggcgataag 4440 aagcgtcagg ctagaggaag tgagttactg gaaggggatg tggtgcttgc aaaacgaatg 4500 cgtaaagcta gcaaggtaga tagtgatttt gctaatgaag aattcatcgt caaagaaaag 4560 actggaactg atacaatcat tcaatcgaaa gtaagcggga aacagtatag gcgtagttca 4620 gctcatttga agaaaattgg agatcgcgaa acagttgacg acagcgtggt tcctgggcag 4680 gagaaaattg gcgatgaaaa taacagagat ttgccaattg aaccatcgtc ggaagaaaac 4740 cattgtttaa aacgagcatc tggcgctaaa aggatttcta atcagccagt gcggtatcaa 4800 gactatgtaa catactgaat ggatttaaga tataaagtga aactgcagtt tgaaataaaa 4860 atggggat 4868 // ID LIN4b_SM repbase; DNA; INV; 5524 BP. XX AC . XX DT 10-AUG-2009 (Rel. 14.08, Created) DT 10-AUG-2009 (Rel. 14.08, Last updated, Version 2) XX DE Non-LTR retrotransposon; consensus. XX KW NeSL; Non-LTR Retrotransposon; Transposable Element; LIN4b_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-5524 RA Jurka J.; RT "Non-LTR retrotransposons from Schmidtea mediterranea."; RL Repbase Reports 9(8), 1906-1906 (2009). XX DR [1] (Consensus) XX CC The 5' and 3' termini are approximate. CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 55..1302 FT /product="LIN4b_SM_1p" FT /translation="MILNLNKKINCKNTELNPSFDIDDIQFTPTPPHIDPG FT FDINDLKFTETPTEDTNPXPIFHQPDLNNSITENDFDTDSLGEIEYPKPST FT SRLRTKSVDLDAHDSTPSDFYEPLPNTLESSQSLFSPPQTKSIVMQYPDMF FT SSKEAPNLNGHFELYFAREPNPDRYDLANPNIAMTSQPPIIPSAVXSPXSS FT DPIKHILLSNEEILTYNARSPFLLRNRDPIFRLGSEGYLKCVVSKCKPGKK FT IGPLMLDKLHSHMHTYHNRTIKPTDTFDCLICQKNKNKLTIYNLKSIEEHF FT TDLHPDKQIPGTREQYNDKRKILTFDLNEEGTLVCAYLDRKTPCSTHLAYD FT TDLLEIKRHITTKHKKKTFDPEKSVRCYCGTTIDIDDIKTHFSIHNSTPTI FT XEIENDPPNHIPVTKSDIGSD*" FT CDS 1330..4620 FT /product="LIN4b_SM_2p" FT /translation="MLTYATLPTTIFVDSFIGKAFVSKNFKTFFSSFPFHK FT YPNWNDLLWPLNLEKNLWVLFYMNKTKRIATFLDPTCENSTEKHHSMTESI FT ANVLTNLQNIIDKIPIKITELSYPQCPLIKDSGFYICHFASCLIKNSNITL FT PNIGEIKEKLNKTIIKIINNKKDVSDITKYKPFINDLITQIRDNCITPDDS FT LRELYRIHEATNPERYKTKHAVPNFLNNKDRNKLVFELRQGFDLRQKVTIG FT RILNPLSSIRTTPTQSDLINNFSKEPKHMTNILHKETPILSPEYRSHTKIE FT DEEMKKAYSQLDTSSSPGSDKITYSDWRKMDPDFEYLTELFNQIIKTGRSP FT TTWKTFRTKLIIKPGKEHSPHEVSSWRPLAILDTAYRFFASIINNRLLSWI FT ATNNLLSRNQKAVGTPDGCAEHNTVISLAKEWAVRNGSDINIVWLDLADAF FT GSIPHNLIWHTLSRLKLNKQTINLIKEMYTDCYSIYECEDKKTKEIKVNNG FT VKQGCPMSMTLFSLSIDFILRNILLDHPLVINNYNISIMAYADDIVLISKN FT RTEMRSMLKDITKYTEMATLKFRPNKCGYFQLKRNHTDPSLKLYDENIPII FT GENNLYKYLGVDFGQKGKHSIDDTLDLALNDTEKLFNSDLHPAQKIQAYKT FT HIHSRLVFIFRNCHVNHMILDSNRNKIVQHREKQLGFDQKIKRKLKDALQD FT IHQNLNNNFLYANIKMGGLGIVPSIDEYLIQSVAHLIKILNSRDTEMKKFI FT IKELIAITQNRFPNQIADIDLSLRWLNAEIKGEKYVPKTXFSKFCNSIRRI FT NEKFNIIIKITTVKERFELEITGKNFKTTIDIDSLKETSNIIHDLTSEWYA FT DQWFIMTCQGHIAKTIGGNRQSAYLIKHSALNDQQFYFLIKARNNMLSLNY FT NTHRVKESANTSCRLCNKEPETQAHIFNHCTQTCNARRNKHNNVMEKASEY FT LISKGFXVDVEKPPPGIETRLRPDLIIKSKRNNLIHILDIKVPYDQLINFD FT NAKEENYKKYKDLALEIGKANNCHTTVTALVVGTLGSWDGENDRTLTKIGL FT NRTEIKKLAKICMTTAVISSYRIYMDHVTNRISNIGTQN*" XX SQ Sequence 5524 BP; 2069 A; 1203 C; 895 G; 1339 T; 18 other; aagttagtaa aacaaataag cgaccagaac ctgataatca tagagaccca acaaatgata 60 cttaacctta ataagaaaat taactgtaag aataccgaac tgaaccctag ctttgatatt 120 gacgacatcc aatttacacc cactcctccc cacattgacc ccggctttga tatcaacgac 180 cttaaattca cagaaacgcc tactgaagac acgaaccccc nacctatatt ccaccaacct 240 gacctaaata actcaatcac agaaaatgac tttgataccg actctttggg ggagatagaa 300 taccctaaac cntctacatc aagactgcgc actaaatcag ttgacctgga tgcccatgac 360 agcacaccct ctgacttcta cgaacctctt ccaaacaccc ttgaatcttc acagtctctc 420 ttctctccac ctcaaaccaa gtcnatagtt atgcaatacc cagacatgtt ttcntcnaaa 480 gaagccccaa accttaacgg acacttcgaa ctntactttg ccagagaacc aaaccctgat 540 agatacgact tagctaaccc taatatagca atgacctcac aaccacctat tataccttct 600 gcagttgnta gcccancatc atctgacccc atcaaacata tccttctatc taatgaagaa 660 atattaacgt ataacgcaag atctcctttc ctccttagaa atagagaccc catattcaga 720 cttggcagtg aaggctacct aaaatgcgtt gtaagcaaat gcaaacccgg caaaaaaata 780 ggaccactga tgctagacaa gctgcatagc cacatgcaca cttatcacaa taggaccata 840 aagcccacag ataccttcga ctgtctgatt tgtcaaaaaa ataaaaacaa gcttaccata 900 tacaacctta aaagtatcga ggaacacttt acagacttac accctgacaa acaaatccct 960 ggcactcgtg agcaatacaa cgacaaacgg aaaatattaa ccttcgacct taacgaagag 1020 ggcacgcttg tctgtgcata cttagataga aagaccccat gctctacaca cttagcttac 1080 gacaccgacc ttctcgagat taaacgacat atcactacca aacataagaa aaaaacattc 1140 gaccccgaaa aaagtgtaag atgctactgt ggcacgacta tagatataga tgatatcaag 1200 acccacttct ccatccataa ctccactcca actatcgnag aaatagaaaa cgaccctcca 1260 aatcatatcc cagttactaa atccgatata ggctcagatt gactcacaca gcaccacatc 1320 tccatcttta tgcttactta tgcgacactg cccaccacca tctttgttga cagctttata 1380 ggtaaggcat ttgtaagcaa aaactttaaa accttcttct cttcctttcc gtttcacaaa 1440 taccctaact ggaatgacct tctatggccc ctgaatctcg agaaaaacct atgggtactc 1500 ttttatatga ataagaccaa acgcatagcc acctttcttg accccacctg cgaaaatagc 1560 actgaaaaac accatagcat gactgagtcc atagccaatg tattgaccaa cttgcaaaac 1620 ataatagata agatcccaat taagataact gaactctcct acccgcaatg cccgctaata 1680 aaagactcng gcttttatat ctgtcacttc gcctcctgcc taattaaaaa ctcaaacatt 1740 acgctaccca atataggaga aattaaagaa aaattaaata aaactataat taaaataatt 1800 aataataaaa aagatgtctc tgacatcact aaatataaac catttattaa tgacttaata 1860 acgcaaataa gagataactg tataacgcca gacgacagcc taagagagct ttataggatc 1920 cacgaagcga ctaatcctga gagatataag accaagcacg cagttccgaa cttcttaaat 1980 aacaaagaca gaaacaagct agtgttcgag ctacggcaag gctttgatct caggcaaaaa 2040 gtcactatag gtaggatact taacccatta agcagtataa gaaccacacc tacccaatct 2100 gacctaatta ataacttttc caaagaaccc aaacacatga ccaacatatt gcacaaagag 2160 acaccaatcc tatccccgga atacagatcg catacgaaga ttgaagacga agaaatgaag 2220 aaggcctaca gtcaactgga caccagctcc tctccaggca gcgacaagat tacctactct 2280 gactggagga aaatggatcc tgatttcgaa tacctgacgg agctattcaa tcagatcatc 2340 aagactggac gaagcccgac gacatggaag accttcagaa caaaactgat cataaaaccg 2400 gggaaggaac attcaccgca tgaagtatcc tcttggagac ctcttgccat cctcgacaca 2460 gcttatagat tttttgcctc tatcataaac aacaggctat tgtcctggat agcgacgaac 2520 aatctattaa gcaggaacca gaaggcagta ggtactccag atgggtgcgc tgaacataac 2580 acagttatct ctctggcaaa ggaatgggca gtcaggaatg ggtcggatat aaacatcgtg 2640 tggctagacc tcgccgatgc ttttggaagc atcccccata atctgatatg gcatactttg 2700 tcgagattga aactgaacaa acagacaata aatcttatta aggaaatgta cacagattgt 2760 tactcaatat atgaatgtga agacaaaaaa actaaggaaa taaaagtaaa caatggtgtt 2820 aagcaggggt gcccaatgtc catgacccta tttagcttat ccattgactt tatacttagg 2880 aatattcttc tagaccaccc acttgtaatt aataactata atataagtat aatggcatat 2940 gccgacgaca tagttttaat ttcaaaaaat agaacagaaa tgagatcaat gctgaaagac 3000 ataaccaaat atacagaaat ggccacccta aaattcagac ccaataagtg cggatatttt 3060 caacttaagc gcaaccatac tgacccctct cttaagttat atgatgaaaa tataccaata 3120 ataggagaaa ataacctcta caaatactta ggggtagact ttggccaaaa aggcaagcat 3180 agcatagatg atactcttga cctagcactt aacgataccg agaaactttt taactctgac 3240 ttacaccctg cacaaaaaat acaggcgtat aagactcata tccactctcg cctagtattc 3300 atctttagga actgtcacgt aaatcacatg atattagata gcaaccgcaa taagatagta 3360 cagcataggg agaaacagct gggttttgat caaaaaataa aacgtaaact gaaagacgcc 3420 ttacaagata tccaccaaaa tcttaataat aacttcctat acgccaacat taaaatggga 3480 ggnttaggaa tagtgcccag cattgacgaa tacctcattc aaagcgtagc ccatctcata 3540 aagattttaa attccagaga tactgaaatg aaaaaattta taattaagga actcatagcn 3600 attacccaaa atagattccc aaaccagata gcagatatag acttatccct aagatggctg 3660 aacgcggaaa ttaaggggga aaaatatgtt cctaagacaa tnttttcaaa attttgtaac 3720 tccatacgta ggataaacga aaaatttaac attattataa aaatcactac agttaaggag 3780 cgttttgagc tagagataac ggggaaaaac tttaaaacca ccattgacat agactccctt 3840 aaagagacat caaatataat tcatgactta actagcgaat ggtatgctga tcaatggttc 3900 ataatgactt gtcaaggaca tatagctaag actatagggg gcaataggca atcagcatac 3960 cttatcaagc atagtgcact aaatgaccaa cagttctact ttttgatcaa agcccgaaat 4020 aatatgctaa gcttaaacta taacacccac agagttaagg aaagcgcgaa cacttcttgt 4080 aggctttgca ataaggaacc tgagacccaa gctcatatct ttaatcactg tactcagaca 4140 tgtaatgcta ggagaaacaa acacaataac gtgatggaga aagctagtga gtatcttatt 4200 tctaaggggt tccntgtaga cgtcgaaaaa ccaccccccg gcatcgaaac gcgactacga 4260 cctgacctca ttataaaatc aaaaagaaac aacctaatcc acattttaga catcaaagtc 4320 ccatatgacc aacttattaa ctttgataac gccaaagagg aaaactataa aaaatataaa 4380 gaccttgccc ttgaaatagg taaagctaat aactgccata caacagtaac agcgttagta 4440 gtaggaacac tagggtcatg ggacggtgag aatgatagaa cactcaccaa aataggactc 4500 aatcgaacag agattaaaaa attggcaaaa atatgtatga ccactgcagt tatatcaagc 4560 tacagaatat acatggacca tgtgaccaat aggataagca atataggtac acaaaattga 4620 tgcctcaagc actaagtgct tagacgcaca aacacctatt tagcgaaata tcatccatgt 4680 tcagtaataa taataataat aataataata ataataataa taataataat aataataata 4740 ataataataa taataataat aataataata ataataataa taataataat aataataata 4800 ataataataa taataataat aataataata ataataaaga aaggatgata cttcgctaat 4860 aggtaaataa aactttntta atgtacctac atgtacctag tggcatatta tgcttactag 4920 tgcatcccta cagatgaaag aagataaaaa atgttacttg gggtatggca tacttccaag 4980 tacattagta aaataataat aacagtaaca caagataagg ttgggttatt aatgacactt 5040 gatgtcatta ataaccttgt gatacgaatg atattattaa aagtgaggnt agaagcattg 5100 gatgctcgct agcctacact ctcctaccca tnaatagcca ataatgggta cgagaccacg 5160 cagttcccaa tagagcactg gcacttagat gccactgcct attattagag gacaatgtgt 5220 tgatatctag ccaatatcaa cgcatcagca ggcataacca atcagaggta tgtctgtgaa 5280 gcatcgcana atgatgctca actaaggctc tgaactggca tgagctggag gaggggattg 5340 gcgggagtgc tgtcaattgg aacggagtgt tgtgatggtg acaactgatg atcctcaagg 5400 ggaacacagt aatatctttg aacagtagaa aataaaaacc agctgaaact ctcacgtttc 5460 tgcttatccg aatggattgt acctactctt cggagaggct ggtaaaacca agcttggctc 5520 acaa 5524 // ID EnSpm-9_HM repbase; DNA; INV; 6579 BP. XX AC . XX DT 08-JAN-2009 (Rel. 14.02, Created) DT 08-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE EnSpm-type family from Hydra magnipapillata - consensus. XX KW EnSpm; DNA transposon; Transposable Element; EnSpm-9_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-6579 RA Bao W. and Jurka J.; RT "EnSpm-type families from Hydra magnipapillata."; RL Repbase Reports 9(2), 380-380 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 216..2204 FT /product="EnSpm-9_HM_1p" FT /translation="MADNKRKHYMKSYMRDYRSFKKLFNEEEEGKEDDKEE FT GKEDDEEEGKEDDEEEEGKEDGEEVGKEDDDDEECKEDDEEETNENDDEEE FT EGKKDDDEDKEDDEVVGYSEKIDNAESEVPQSVKTRNFLHKWAVENNIKSK FT PLNELLTHLNSFMKDIPKSGTSLKNTLRSVDVENVAGGDFVYISLKIALNN FT YVKCFRKPAEFDLIINVDGTQIYNSKKKQMWPILCTINQRGPYAIALWYGT FT GKPTVLGEYLKKFVAELKFYFINGYEGLKVIIKAFVCDAPARAFLKCIKSH FT SGYDSCERCEEHGEYHKGIRLLGTKSLIRDDKKFKEKFYSEHQIGKSPLSK FT LEIPMVTGFVLDYMHLVCLGVTKKLMKNYINNIGSSVNSKISSELLEIRKC FT TPTDFQGWPRSLDEFSDFKATEFXFFLLYAGIVVMKGKISHDEYKLLLALS FT VAMHILLSETMVKDNSMVCYCKNLLEWFTTESIVFYGPLFASYNVHSIIHL FT ADDVINHKVSLNKISAFPFENYLGKIKRMIHSGTYTVAQVVKRLDEHSKVV FT HLLPAKRSVIFNTNFRNRVYYISKIGFVIVEKVVXNKYFCKLFKFSKLTNF FT FESPLPSSVIHIYHLTTLSYCKKIELERNALQHKAIMMPYDKGGYVLIPLL FT HCDLNDNFTNLQF*" XX SQ Sequence 6579 BP; 2432 A; 812 C; 1008 G; 2296 T; 31 other; ccctatgcgc gcatcactag tgatttagta ctgctcaacg gtgattaagt ggtgatcact 60 agtgatcgtc attcaatcac cactctatca ctactaacac ataacttatt tatacgcaaa 120 aatcaatatg attttatgta agcggtgatt caatgcgtat taagtagtga cggtttagta 180 atatatcaaa acacgaaaat agttctcaat aaaaaatggc agataacaag cgtaagcact 240 atatgaaaag ctacatgaga gattatagaa gttttaaaaa gttgttcaat gaagaagagg 300 agggtaaaga agatgacaaa gaagaaggta aagaagatga tgaagaagaa gggaaagaag 360 atgacgaaga agaagaaggg aaagaagacg gagaagaagt aggtaaagaa gacgacgacg 420 acgaagaatg taaagaagac gacgaagaag aaactaatga aaacgacgac gaagaggaag 480 aaggtaaaaa agacgacgac gaagataaag aagatgatga agtagtagga tatagtgaaa 540 aaattgataa tgcagaaagt gaggttcctc aatctgtcaa gactcgaaat tttcttcata 600 aatgggctgt tgaaaataat ataaaatcta aaccgttaaa cgagcttttg actcatttga 660 actctttcat gaaagacatt cctaaaagtg ggacatcttt aaaaaatact ttaagaagtg 720 tggatgttga aaacgttgct ggaggtgatt ttgtttatat ttctttaaaa attgcactca 780 acaattatgt aaaatgcttt agaaagccag cagagtttga tcttattata aatgttgatg 840 gcactcagat atacaatagt aaaaaaaaac aaatgtggcc tatattgtgt actataaacc 900 aacgaggtcc atatgcaatt gctttatggt atggtacagg aaaacctaca gttctcggag 960 aatacctcaa aaaatttgtt gcagagttaa aattctattt tataaatggg tatgaaggcc 1020 taaaagtgat aattaaggca tttgtatgtg acgcaccagc cagagcattt ttaaaatgta 1080 taaaaagtca tagtggatat gacagttgtg aaagatgtga agaacatggt gagtatcata 1140 aaggtataag attgttaggt accaagtctt taatacgaga tgataaaaag tttaaagaaa 1200 aattttattc agagcaccag attggtaaat caccattatc caaacttgaa attccgatgg 1260 tgactggatt tgtacttgac tacatgcatt tagtttgtct aggggttaca aaaaaactta 1320 tgaagaatta tattaacaat ataggttcct ctgtaaacag caaaatttct tcagagttgc 1380 tcgaaataag aaaatgcaca ccaacagact ttcaaggttg gcccagaagt ttggatgaat 1440 tttctgattt taaagcaact gaatttygct tttttttatt atatgcagga atcgtagtta 1500 tgaaaggcaa aatatcgcat gatgaatata aacttttatt ggctttatca gttgctatgc 1560 acattctttt gtctgagaca atggttaaag ataactcaat ggtttgttac tgtaagaatt 1620 tgttagaatg gtttacaaca gaatcaattg ttttttatgg accattgttt gcatcttata 1680 acgtgcacag tattattcat ttagctgatg atgttataaa tcacaaagta tcattaaata 1740 aaatatctgc atttcctttt gaaaattacc ttggaaaaat taaaagaatg attcatagtg 1800 gtacatatac agttgctcaa gttgttaaaa ggcttgatga acatagtaaa gttgtacatc 1860 tattaccagc aaaaagatca gtgatcttta atacaaattt tagaaataga gtttattata 1920 tcagcaaaat aggttttgta attgttgaga aagttgtara aaacaaatat ttttgtaaat 1980 tgtttaaatt ttcgaaacta acaaattttt ttgaaagtcc tttaccaagt tcagtcatcc 2040 atatttacca tttaacaact ttgtcatatt gcaagaaaat tgaattggaa cggaatgcat 2100 tgcaacataa agccattatg atgccttatg ataaaggagg ctatgtactt attcctcttc 2160 ttcattgtga ccttaatgac aattttacaa atctccaatt ttgattatta cttttacaca 2220 tttggtctta attatatttg aagtaaattt tttttttgtt caacatcttt gtttccaaca 2280 aggctgcaag caaccactgt tagagtatta ttagcagtat tataaaacaa ccactattag 2340 cattggaagt tactggatca gagaaaagat gaggtttata gagcaaaata acgacttaca 2400 gacaacttag aaaattgcaa attatataaa tcagtaaagc aagatgaagg taccaaattc 2460 caaagaacwg atgtttgaga aaaaaaacta ggcgaawaag agtttttgga gcacttagga 2520 ayagtcacag agaaaggatg agacctaatt gartgactag taacacgaga atgaatttta 2580 gtagatggca caagagacgc tagctcttta gagcagtgcc gttatagtat ttgtagaaaa 2640 gagaaagaaa agcaacatta cgacaatgcg ataaacaatg tttgcaatgc atttttgcac 2700 cttgtctata aaaaaagggc atcattggaa gatccgccca gatatggcaa cagtattcca 2760 tacaagattg atttgagatt tatagagata aagaatagaa tctggagtaa gaaagtgtcg 2820 agcatgataa agagacgcaa cctgagcaga tgttaatttt gcaatggatt tgatatatgt 2880 ttctaaatta attttataat gattttacac ttaaattttt atacaaagta gcataacaaa 2940 ataatgaatt ttttatttaa tgaaattgtt gttgatttgt atatatatat atatatatat 3000 atatatatat atatatatat atatatatat atatatatat atatatatat atatatatat 3060 atatatatat tataaaaaaa ttgtttattt taaaaaacaa aaataataag aaaaagctcc 3120 cttgctttaa tgagtataaa aatctatttg ttaaattaac cctgattata aataaaatat 3180 atatatatat agttaatctt taattatttc tttttttatt ttcttatgat gaaatgttta 3240 attataatat tcactataaa ttttttatca atacctgttt aatactgttc caactatgta 3300 tatatattaa tagttaataa caataaaaat attggtttat tttaattrtt aattactttt 3360 tattattgtt cgagctgtta tatttattac ttatttttca gttttatttt tatttaaaaa 3420 aattgataat ttgttaaagt aaaacggctt gaacctccat attccattga atatcaatta 3480 cttaaattgg attgaagaaa aaaatgatag cttttgggtg tgctggcctc cagttcctaa 3540 tgtgaagcca ttaattttaa acaggcaact accaaaaaaa gactggttca ggtttatttt 3600 gttgaagaaa tatggaagtg gaagtaagtt taagtcatgt taaactttta tgaaacaaat 3660 ccttgtttta tgtttttctw tyyatrtttt atttttttag tatttgattc atttttcatg 3720 ttttattttt ttctcattta ttttaggcag tctttttgaa ttgcaaacga aagtgagcta 3780 atagtatgtm ccctcagcca ttgatacaaa cgaagataca gatgatgatc taatctttcc 3840 aagccaaaaa tgtcgaacag ytttaccata ttttatgagt gattctgatg atttgggttg 3900 tttaagttac tttactttat tttgtttggt attatactaa aagatttttt tgacgtatca 3960 gcaaaatttg taaaacaggt atattgttta ttatagataa tgatgttgct tccaatcaac 4020 caaaaaggcc taaattggtt tgcagcagtg atgaagagca agagaccgta atcgacaagt 4080 ttccgttact tccaccatca ctacaaaaat gttctttact ttatctttat ttaaaaattt 4140 atacttataa actttttttt tgttgtgtag tcaatactgt tatatacatt tttcttcttt 4200 gagacaaatt ttttagactt tggtatagca aataatactc tttaaaacac tgaacttaaa 4260 tttagtaaga tcaatgagta ttatttgtta tgctaacaaa gatttttgca ggactttaca 4320 atatacatct aaagtatatc agatatgttt taattagttt atttttgcag atattttaaa 4380 gtgtagtttt ctatcattaa atttaagttt atttctttta agtgccttca aatctaaatt 4440 tgatawataa aaagaagcaa agaatygatt trgagattga tccatwattk caaactaaag 4500 taccatcgtc tcaatcttat ctcctgcagc ataacacaag caatagttca gttttaaaaa 4560 tattatttac ttgatagata tctataarta gactttataa actttttgta acttaaaatt 4620 tytttwtttt yagttttcca gaaaaatgtg atgaaagttt tacaaaaata taaaaaaaga 4680 tttatatgat attaagcaga cccaaaaaga tmttatagaa aagttttcag ttctgcagca 4740 tacaaaatag ataatgttaa tacgtttgtc ttaattacta ttgacaatgt tgaggatttg 4800 gggaagttag aaagtaaatt agaagacaaa aatatatatt tgcagctggt atgttcacta 4860 gtatatttta tcaawtttgt tattatttat aacacataat ttataacact gtattagtca 4920 gttaaaaaac tagttcaagc taaagatatt tttttaaatt tcaaatacat ataatgttaa 4980 ttagtaatat catattgtat tgcttttagt ttgttacagg taggtaaatg ttcactgttt 5040 tttttgcagt agtatttaac caaatagtga aaagtaaaca gcratagatg caggaattca 5100 cagtagaggt gtggaatgag tttaaactta ttctaaaawt caaaatcttt aataaaaatt 5160 catgaaaaaa tattgttgta aatgtctttt acatctcaag ayayctktkt tatttccata 5220 gcaatttgat atagtatttt caagattttt gcttgctatt ctgatgcttt aaatattctt 5280 gcagagattt tttacttgtt acttaatata tcatgttatt atatataata aattattatt 5340 acaggtggca gttttagctc gtgcttgtga catccaccct aaaaaaagta tctatgaggc 5400 aatgaawaaa cttttaacta aagatgttgc ttcgaaatat aatataacgg gcactgccca 5460 gaagtgtaat ttttgcacgg aattcaaaaa tatttattca gccattatag gtatatatac 5520 aaaggaacaa gtttacacat attttaataa atgtgactaa gtcatgtttc aaatatacaa 5580 aggtatttca atttcatggt ttaaagttct gtatcttttt tatttagagg cagttagtag 5640 tcagcatgct gacaccaaag atattcaact ccacagtttt attggagagt acttgcgaca 5700 ggctggtgtc ttgaatatcc ggaacgagaa gaaaaatatt gaaaatcaaa ataaaacttt 5760 ataaaaaaaa atttttctta tactgataaa atattttttc aactgctgtt ttgtaactta 5820 aataaacaaa atcttctaac ttttgttgtt tacattattt cagtttgaaa aactgattta 5880 tgtcttttat ttttgttaat ttgtttatga agtatcatcg tactctaata aaatattgtt 5940 aaaatatttc tctaaatttt atcaaagttt ttttttttta gatgcagtca atagcatgaa 6000 aatgtaaaag attatgaaaa ctaaataatc aatatgtttg tattatttta gaaatagaat 6060 atcagaagtc taactgtagt tttattaaaa acgaattaat gattattttt aaatatatta 6120 attatcgcaa tccctgcaaa cactaaaatt tggcgttcaa ataattattt atcaatatta 6180 cttaaatata tcataaatga tatataataa gtactttatc actattaaca ataagtaatc 6240 gcaaataaat aacgagtgca tcggcgttaa atcaacagtg atcatcttta tgtacttact 6300 attaataaac caccactcaa tcacagtaat acaaagttta caaaatcacc attgaataag 6360 tacttaatca ccgcttacga ttagtattga atcactacta aatcaatagt gtatcgctgt 6420 taaatcagcg gtgatagtca ctacttaatc actagtaaca aatcaccact caatcactag 6480 tgataaaaag tttacaaaat catcgtttag taagtactta atcaccactt acgataagta 6540 ttgaatcact actaaatcac tagtgatgcg cgcataggg 6579 // ID T2 repbase; DNA; INV; 335 BP. XX AC X15618; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 28-SEP-1995 (Rel. 1.08, Last updated, Version 1) XX DE S.mansoni retroposon-like repetitive element t-2. XX KW SINE2/tRNA; SINE; Non-LTR Retrotransposon; Transposable Element; KW Interspersed repeat; T2; Repetitive sequence; retroposon. XX OS Schistosoma mansoni OC Eukaryota; Metazoa; Platyhelminthes; Trematoda; Digenea; OC Strigeidida; Schistosomatoidea; Schistosomatidae; Schistosoma. XX RN [1] RP 1-335 RA Spotila D.L., Hirai H., Rekosh M.D. and LoVerde T.P.; RT "A retroposon-like short repetitive DNA element in the genome of RT the human blood fluke, Schistosoma mansoni."; RL Chromosoma 97(6), 421-428 (1989). XX DR GenBank; X15618; Positions 1 335. XX SQ Sequence 335 BP; 108 A; 82 C; 71 G; 74 T; 0 other; aatttcaact tcacaccatt gcacaagcaa gtggctatca aaactcagtg gctgagtgga 60 caacgcgatg gcgtttgaag cgaaagctac tgggttcgag tcccagagtg aacatcaaca 120 ctgagatgca ggtacatcca actgaccagt cggaaattgg acgaaacgcg cgtcctggat 180 tccactgcta gccaccatcc atctttgctt accatgcttg tgaatttagg ctatatcgag 240 gcaatacgca cagtatgcac atatgacaat tacagactga ccggttgcag tcctaaacac 300 atcaatagga agatccaaac aaacaatact aagta 335 // ID PENELOPE_SM repbase; DNA; INV; 2370 BP. XX AC BK000685; XX DT 26-APR-2005 (Rel. 10.04, Created) DT 26-APR-2005 (Rel. 10.04, Last updated, Version 1) XX DE Schistosoma mansoni penelope-like transposable element Cercyon DE reverse transcriptase gene, complete cds. XX KW Penelope; Non-LTR Retrotransposon; Transposable Element; KW reverse transcriptase; Cercyon; PENELOPE_SM. XX OS Schistosoma mansoni OC Eukaryota; Metazoa; Platyhelminthes; Trematoda; Digenea; OC Strigeidida; Schistosomatoidea; Schistosomatidae; Schistosoma. XX RN [1] RP 1-2370 RA Arkhipova I.R., Pyatkov K.I., Meselson M. and Evgen'ev M.B.; RT "Retroelements containing introns in diverse invertebrate taxa."; RL Nat Genet 33(2), 123-124 (2003). XX DR Genbank; BK000685; Positions 1 2370. XX FH Key Location/Qualifiers FT CDS 1..2368 FT /product="RT" FT /translation="MVHLMITDAHYRIRKYQHVIEDQKRKLQQTLTPEHFE FT TLSTAIETLCKRHKEKRRNILKKEIFKISKPTIEENTNNSVVNLSKQPLTN FT TEEQLLRKRLNYNTSDAPKLAFYAALESLLKTSGISEETQQEIRQSIIPLT FT QRMKGNNQLTSQEQTALKKLRTGKDIVIVPADKGRTTVIMDKEEYIKKAEE FT ILEDKSTYKPMDINPVKKLDNRISKTLNKLVKAGEITKQDQWRVKSNGPVL FT PRFYGRPKIHKPNIPLRPIVALPGTPTYNLSREISRILKHFVDSSNHSFKS FT ATQFLGQIRNIQINDDELMISFDVTALFTSIEPNLAKETLSLLLTNDENLA FT KYTKLQSQSLLELMDLCLTTYFQFNNKIYKQTKGTPMGSPISRLFAEAVMQ FT RLEASILLMIKPKNWIRHVDDTFLIVKKEELEHTYKLINNVFNDIKFTMKQ FT ESNDKLPFLDILITRTNTKKPETQVYRKPTHTDQILNYNSNSPRTHKVNCV FT HTLFKRARTHCYTIPAQLYVVLYLKTILQKNGYLINFIKKYQPHPSSEIKS FT TTEINKRITIPYIKGISETTKRLLKAFRINVAHKPTKSLHSILCKPKDEIT FT KEDKPNIIYKINCANCNKHYIGQSGRPLHLRLQEHQLAVKHHDMSSLISMH FT VDNYRHTFDWKNVEILDRGNSKNTREFVEAWHSGQSAINKHIEINPIYQPI FT KKIMQEHRSKNQTNRRYNQNKEINNDRFKVNKNQSTSRCRGQAVNHMWRNS FT NTSLLPEVCADDLVQKDDESSTTKPSSSENKPTSKSSTX" XX SQ Sequence 2370 BP; 988 A; 561 C; 368 G; 453 T; 0 other; atggttcatc taatgataac ggacgcacat tacaggatcc gcaaatacca acatgtcata 60 gaggaccaaa agaggaagct acaacagaca ctcacaccag aacacttcga aaccttgtca 120 acagccatcg aaacattatg caaacgacac aaagaaaaaa gaagaaatat ccttaaaaag 180 gaaatattca aaatctcaaa accaactata gaagaaaaca caaacaacag cgtggtcaac 240 ctatcaaagc aaccactaac aaacacagaa gaacagctac tccgaaaaag attaaattac 300 aatacaagcg atgccccaaa actagcattc tacgcagcac tagagtcatt actcaaaacg 360 tccgggataa gtgaggaaac gcaacaggaa ataagacaat ctataatacc actaactcaa 420 cggatgaagg gcaacaacca actgacgtca caagaacaaa cagctttgaa gaaactcaga 480 acaggaaaag atattgtcat cgttccagca gacaaaggac gcactacggt gattatggac 540 aaggaagaat acatcaagaa agccgaggaa atactcgaag acaaaagtac atacaaaccg 600 atggacataa atccagtcaa aaaactagac aatcgcattt caaaaacttt aaacaaactc 660 gtcaaagcag gagaaataac gaagcaagat caatggagag tgaaatctaa tggaccagta 720 ctccctcggt tctacggcag accaaagata cacaaaccaa acatcccact acgtccaatt 780 gtagcactac caggaacacc aacgtacaat ttatcaagag aaatctcaag aatactgaaa 840 cattttgtag attcgtcgaa ccactccttc aaatcagcca cacaattcct tggtcagatt 900 aggaacattc aaatcaacga cgacgaacta atgatatcct tcgatgtaac agcactattc 960 acttcaattg aaccaaacct agcaaaagaa actctgtcct tgcttctaac caacgacgaa 1020 aacctcgcga aatacacgaa acttcaaagt caaagcctac ttgaactcat ggatctatgt 1080 ctaacaacat atttccaatt caataacaaa atctacaaac aaacaaaagg gacaccaatg 1140 ggatcaccaa tatcacgact tttcgcagaa gcagtgatgc agagactaga agcctcgatc 1200 ttactaatga tcaagccaaa aaattggatt cgccatgtag acgacacatt cctcatagtc 1260 aaaaaggagg agctagaaca cacctacaag ctgatcaaca acgtcttcaa tgacatcaag 1320 ttcacaatga aacaggagtc aaatgacaag ctaccattct tggatatatt aatcaccaga 1380 accaacacaa agaaaccgga gactcaagtg tacaggaagc cgacccacac tgatcaaatc 1440 ctcaactaca atagcaacag cccaagaact cacaaagtca actgtgttca tactttgttc 1500 aagagagcga ggacacactg ctacacaatt ccagcacagt tatatgtagt tttatacttg 1560 aaaaccatcc ttcaaaagaa cggctactta atcaatttca tcaaaaaata ccaaccgcat 1620 ccatcatcag aaataaagtc aactacagaa atcaacaaaa ggatcaccat accatacata 1680 aaaggtatat cggaaacgac aaagagactg ctaaaagcct tcagaataaa tgttgctcac 1740 aaaccaacaa aatccctcca ttcaatctta tgcaaaccaa aagatgaaat aacaaaagaa 1800 gacaaaccaa acatcatcta caaaataaat tgtgccaact gcaacaaaca ctacatcgga 1860 caaagcggac gccctcttca tcttcgcctg caagaacatc aattagcagt caaacaccat 1920 gacatgtctt cacttatatc catgcacgtg gacaactaca gacacacatt cgactggaaa 1980 aatgtggaga ttttggatag aggcaattcc aaaaacacca gagaatttgt agaagcttgg 2040 cactcaggtc aatcagcaat aaacaaacac attgaaatca atccaattta tcaaccaatc 2100 aagaaaatta tgcaggaaca taggagcaaa aatcaaacca ataggagata caaccaaaac 2160 aaagaaataa acaatgacag atttaaggtc aataagaacc aatcgacatc gaggtgcaga 2220 ggacaagctg tgaatcacat gtggagaaat tcaaacactt cacttctacc cgaagtttgt 2280 gctgatgatc tcgttcagaa ggacgatgaa agctccacga ccaaaccatc cagctcagag 2340 aacaaaccaa catcaaaatc atccacctga 2370 // ID LIN2_SM repbase; DNA; INV; 8114 BP. XX AC . XX DT 17-OCT-2007 (Rel. 12.1, Created) DT 16-AUG-2009 (Rel. 14.09, Last updated, Version 3) XX DE Non-LTR retrotransposon (consensus). XX KW NeSL; Non-LTR Retrotransposon; Transposable Element; LIN2_SM. XX NM LIN2_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-8114 RA Jurka J.; RT "Non-LTR retrotransposon from Schmidtea Mediterranea."; RL Repbase Reports 7(10), 1093-1093 (2007). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS join(7..1143,1110..2219,2223..6773) FT /product="LIN2_SM_1p" FT /translation="MKIEIGEFAALFQRFFEQLNAKSKQEEEFRNESVPGS FT RVPSEDMPVFQINERFHALERSNELLNGKMDSIVNTLNYLISGLNGNFDNN FT NKLNCICKTNHVKEPLSIVKTNLVSELPSALLVQNKVPCLKLSEEDLIRKI FT PTKIVPSKKVSKKTIEHTSRVNFSLFVRVLDSNERQQFFKLFPNLXALLKX FT KESEKKSEVLNIELTKISKKDNPLNIAGFKNFFFRNNNICVTIDFYNSSKV FT KSKITKNLKIKTSLINSKIVKNYNTNKFIPASLSIKKFLFVLSDKLAETTN FT ISNVTVREEQINKIITLLKIKNSLPDYLVKSQIKYKDGDFLMLDLLNSCEI FT EGELGQKITELSRSQIKELFNSFPEIQELFFIEKKANSGTFFHRKKSIDEN FT PEKLINLINKFKIEKTFPQQISFVGGKFVKIDKLNDSQFPQVESESDIMFN FT LGLFSLFGLSEYDNILNSLLCQPKSPEKLVDIIFHLNYLNLKGNFDTSTNG FT TYENYVQNYIVNCLLMHSILDLETCAFLNLDNIELWAPKLIELIPSSSHYS FT SSCKEKLLKLKQWKLSGEIDRIHNIICPETIIEINNFLGVIDQSYTTKSDQ FT LKFLTSLGEKINNIEIDNDVDDLVKAKRMLYFIIRDKKRFLNKDNFNYRIP FT FNCRKICTLKMSDFITKLIQSNELFEEREAVVDNNIDVSPNENLNILIVEP FT VNPTPDTVIVGQMDTFPTDIDETEPVTIDKPLKKITSKQAVKPKAQKSKRE FT VPASVSISVPISEETDADSTIINYFFPERDPKIWFTDDDIDYYLESHIGNP FT NFAPVKCFIVEILSSNCKDEIFPMPDNLFKAQIILCPLNINESHWILFVYC FT KLTKISYFVDPILRKRHNLINKSSALKITMALNNLFDLKAAVTFHPHDNLV FT YQDNNYDCGPYICAYAMLISQEWQTFPNNFIETIRKNVHRFQTSKFQNPPK FT VSGGGITTLPIFKRFQNVTRPPKVALPYKASLQNENSCVPSLIKEWFSNKN FT YITVLSPELTLNLIAHNNQFIRENVTFRQFTRVKYVIAVIPNNPHWCLYIF FT SPVFEHSYILNFQGRVLNERLVSIGESMTDYLNKFFISTAKVNFIVNHNQT FT HFSFRSEDPHFFASAFIEFLERFFFGNILSFTKINETLKSFKLQITNEICK FT NLNLINFSINDEIITKYFKNLNLPSNFIFLNSAMCTAIMDDCTRYLNDFLN FT YDNIIKAQVIFAIVAPPKFREILFIIDYNTDEYYLLNPTTLLGNDNYLSTC FT KILRNKINEIRESPNRVLNLGECPHDIRGSGLVSKILFCSYMSNYANDLPL FT VGLNLRQIGNVVSSFLPIVSEIKKVKSLENNIINQNHKESENLKERKNKID FT QLLIKVFKMNVDEIVDVILDQISIKTVPTHKPYLGNKERVESQINKSNLIK FT NFNSDMKTTVNTIINDVDVDVRPKYENIITNFEHKIPLDRTWDKVFKFCNR FT SDKKLKLKYLSNSEVLFELKKADNTSPGVDGIKYRDLVMLDPEGKLLTFLF FT NKIISKKVIPDSWKSFKTLLIPKPDKGDKYDNVSSWRPIALLSVIYKLFAS FT CLANRLTFWINKNNLLHIGQKGGSRHDGCVEHNSILSSALEHSKYSKNSPL FT AIAWLDIKDAFGSVPHAYMWSLLRFVGVDEEFVTILERMYSDTSTFYSCGP FT ILTPNIAIRQGVKQGCPISMILFALAINPVLEAVSRSKCEPFMIGESAVKI FT LAYADDIALVSKSAYDLQRVTNIAVAIASEIGFEYRPEKCGYIELPLIETE FT SQILINDIKIKKLASSEFYQYLGVPVGEETDQSPYDILKKVVSDTRKIADS FT DLLGWQKLKAYKIFLHSRLVFPFRTREIKTSALAKSNANNNLTVNITSQLR FT NCFRKMMSLPNHSEVCYFYNSTENGGANCVDLLDEYHTQTITHFFRLFTSS FT CEFARKVNLDSLNFVAGPRLGIINPTLQERFDWINGKEPKPRHSGKKTRFQ FT RARIAIAFFKKEHDINIMFIVTKGQPSLFITSEKRGTFILTPDLRKTTSKV FT LHMALCDSYLSKWEKSCTSNFIASAIKLSPKINKAIYRGDISEFAWHFIHR FT ARTNTLSINAKPHNKGNKRLCRLCHTEDETMSHIIQSCKIHLTLGSERHND FT CLDLISSHLSKNSNLIVVVDHVCSLVPESKERVDLMITDMVRKKIFLVDMK FT CPCDTINNFALVDLLNLQKYDSLKMQIQDAKPDFKVELDTCIIGSLGSFPP FT KTPELLLKIGVDRTHLKGLLKDCALSNISHSARRWH" XX SQ Sequence 8114 BP; 2746 A; 1294 C; 1428 G; 2639 T; 7 other; aacgttatga aaatcgaaat cggagaattt gctgctcttt ttcagcgttt tttcgaacaa 60 cttaatgcaa aatcgaagca ggaagaggaa tttcggaatg aatcggtgcc tgggtctcgt 120 gtgccttctg aagatatgcc tgttttccag attaatgaaa gatttcatgc attggaaaga 180 agtaatgagc ttctgaacgg taagatggat tctattgtta atactttaaa ttatcttata 240 tctggtttga atggtaattt tgacaataat aataaattaa attgtatttg taaaactaat 300 catgttaagg aacctctttc aattgttaag actaatcttg ttagtgaact tccttcagct 360 cttttagtcc aaaataaggt tccttgtctt aagttatcag aagaagattt gattaggaaa 420 ataccaacta aaattgttcc ttctaagaaa gtttctaaaa agacaataga gcatacttct 480 agagtaaact tttcgctatt tgttagagtt ttagattcca atgagaggca acagtttttt 540 aagctatttc ccaatttatt wgctttattg aaargtaaag aatcagaaaa gaaatctgag 600 gtattaaata tcgagttgac caaaatttct aaaaaagata atcccctaaa tatagctgga 660 tttaaaaatt tcttttttag aaataataat atatgtgtta caattgattt ttataatagt 720 agtaaagtaa aatctaaaat tactaaaaac ttaaaaatca aaacatcttt aataaactca 780 aagattgtta aaaattacaa tacgaataaa tttatccctg cttctttatc gataaagaag 840 tttttatttg ttctttcaga taaactggct gagacaacta atattagtaa tgtcacagtc 900 agagaggaac aaataaataa aataattact ttattaaaaa ttaaaaatag cctacccgac 960 tatttggtaa aatcccaaat taaatataag gatggggatt ttttaatgtt ggatttgtta 1020 aatagttgtg agatcgaggg tgagttgggc caaaaaatta ctgaactgtc ccgttcacaa 1080 attaaagaac tatttaactc attccctgaa attcaggaac tttttttcat cgaaaaaaaa 1140 gcatagatga aaaccctgaa aaattaatta atttaattaa taaatttaaa attgaaaaaa 1200 catttcctca acagatctca tttgtggggg gtaaatttgt taagattgat aaattaaatg 1260 attctcagtt tccacaagtg gaatctgaat ccgatatcat gtttaacctt ggtttgttta 1320 gtctctttgg tttatctgaa tatgataata ttttaaattc ccttttatgt caaccaaagt 1380 ctccagagaa actggttgat ataatttttc atttaaatta cctaaattta aaaggcaatt 1440 ttgatacttc aacaaacgga acttatgaaa attatgttca aaactatatt gttaattgtt 1500 tgttaatgca ctcaattcta gatcttgaga cttgtgcttt cttaaatctt gataatattg 1560 aattgtgggc tccaaaattg attgaattga ttccttcttc ttcccattat agttcaagtt 1620 gtaaagaaaa acttttaaag ctcaaacaat ggaaattgtc tggtgaaatt gacagaatcc 1680 acaatatcat ttgtcctgaa actataattg aaatcaataa ctttcttggg gtgattgacc 1740 aatcttatac taccaaatcc gaccaactga aatttcttac tagtttaggt gaaaaaatta 1800 ataacataga aatcgacaat gatgtagatg atttggttaa agcaaagaga atgctttatt 1860 ttattattag ggacaaaaaa cgatttctaa ataaagataa ttttaactat cgcataccct 1920 tcaattgtag aaaaatttgt acactaaaaa tgagtgattt cataactaag ctcattcagt 1980 caaatgaact ttttgaagag cgagaagcag ttgtagacaa taatatagac gtatctccta 2040 atgaaaactt aaatatctta atagtggaac ctgtcaatcc cactcctgac acagttatag 2100 ttggtcagat ggatactttc ccaacagaca ttgacgaaac tgagccagtt actatagata 2160 aacctctgaa aaaaataaca tcaaaacaag ctgtaaaacc caaggcacaa aaatctaaat 2220 gaagggaagt gcctgcatca gtttcaatat ctgttccaat ctctgaagag actgatgcag 2280 atagcactat aattaattac ttttttcctg aacgcgaccc caaaatctgg tttaccgatg 2340 atgatattga ctattatctt gagagtcaca tcggtaatcc aaactttgca ccggtcaaat 2400 gttttatcgt cgaaatttta agttcgaatt gcaaagacga aatatttcca atgccagata 2460 atctttttaa agcgcaaata atattatgcc ctcttaacat caatgaatct cattggatcc 2520 tttttgttta ctgtaaattg actaaaattt cctattttgt tgacccaatt ctccgtaaac 2580 ggcataacct tattaataag tcaagtgcac ttaaaatcac gatggcttta aataatcttt 2640 ttgatctcaa agctgctgta acctttcacc ctcatgataa ccttgtttac caggataata 2700 attacgattg tggaccttac atctgcgcgt atgctatgct gataagccaa gaatggcaaa 2760 catttcccaa caactttatt gaaacaatac gtaaaaatgt ccatcgattc caaactagta 2820 aatttcagaa tccaccaaaa gtctctgggg gtggtataac gacattgcct atttttaaac 2880 gcttccaaaa tgtaacaaga ccaccaaaag tagcacttcc ttataaagct tctctccaaa 2940 atgagaattc ctgtgtacct tctttaataa aagaatggtt tagtaataaa aattatataa 3000 ccgtattgtc tcctgaattg acattgaatc ttattgcaca taataatcaa tttataagag 3060 aaaacgtcac atttaggcaa tttacgagag taaagtatgt aattgctgtc attccaaaca 3120 acccacactg gtgtctgtat atattttccc ctgtattcga gcacagctac atcttaaact 3180 ttcaaggcag agtcttaaat gagagacttg tcagtattgg tgaaagcatg actgattatc 3240 tgaataaatt cttcatttct actgcgaaag tcaattttat tgttaatcat aaccagactc 3300 acttctcgtt tagatcggaa gatccacatt tttttgcttc tgcgtttatt gaatttctcg 3360 agagattctt ctttggtaat atcctttcat ttactaaaat caatgaaact ttaaaatcat 3420 ttaaactaca aataacaaat gagatctgta aaaatttaaa tttaattaat ttttcaatta 3480 atgatgaaat tatcacaaaa tattttaaaa atttaaattt acccagcaat tttatattcc 3540 taaattcagc aatgtgcact gctattatgg atgattgcac tcgctattta aatgattttt 3600 taaactatga taatattatc aaagcacaag ttatattcgc cattgtagca cctccaaaat 3660 ttcgtgaaat tctttttatc atagactaca acactgatga atattacctt ttaaacccaa 3720 caactctttt aggaaatgat aattatttgt ctacttgtaa gattctaaga aataaaatta 3780 atgaaattag agaatcacct aatcgtgtat taaatcttgg tgaatgccca catgatattc 3840 gtggttcagg tttagtaagc aaaattctgt tttgttctta catgtcaaat tatgccaatg 3900 atttgccttt agttggtttg aatttaagac aaattggtaa tgtggtaagc tccttccttc 3960 caatagttag tgaaataaaa aaagttaaat ccttagaaaa taatataatt aatcaaaatc 4020 ataaagaaag tgaaaattta aaagaacgta aaaataaaat tgatcaactt ttaataaaag 4080 tatttaaaat gaatgttgat gaaatcgtcg atgtaatttt ggatcaaatt tctattaaaa 4140 ctgtgccaac acacaaacct tacttgggca ataaagaaag agtagaatct caaattaata 4200 aaagtaattt aattaaaaat ttcaacagtg acatgaaaac cactgtaaac acgatcatta 4260 atgatgtaga tgtggatgtt agacctaaat atgaaaacat tataacgaat tttgagcaca 4320 aaattccttt agaccgaact tgggataagg tgtttaaatt ttgtaatcgt tctgataaaa 4380 aacttaaact taaatattta tcgaattctg aggtattatt tgagctcaag aaagctgaca 4440 acacaagtcc tggtgttgat ggtataaaat atcgagacct ggtgatgctg gatcccgagg 4500 gtaaactact aactttttta tttaataaaa ttatttctaa aaaggttata ccagatagct 4560 ggaaatcctt taaaacgttg ctaattccaa agcccgataa aggtgataaa tatgataatg 4620 tgtcgtcttg gcgtccaatt gcactacttt ccgttatcta caaactgttt gcctcttgtt 4680 tagctaatcg tttaacattt tggataaata agaacaatct gttgcatatc ggtcagaaag 4740 gtggctctag acacgatgga tgtgtggaac acaattcaat tctttcgtca gctttagagc 4800 actctaaata tagcaagaac tctccacttg ctattgcttg gcttgacatt aaggatgctt 4860 ttgggagtgt tcctcatgcc tacatgtggt ctttacttcg atttgttggg gttgatgagg 4920 aatttgtgac cattcttgag aggatgtatt ccgatacaag tactttctat agttgtggcc 4980 ctatcttgac tccaaatatt gcaatcagac aaggtgttaa gcaaggatgt cctatctcta 5040 tgatattgtt tgctttagca attaatccag ttcttgaagc ggtatcccga tctaaatgtg 5100 aaccatttat gattggtgaa tctgcagtta aaatccttgc gtacgctgat gacattgctc 5160 ttgtatcaaa gtctgcttac gacttacaga gagtaacaaa cattgctgtt gcaattgcat 5220 ctgaaattgg ctttgaatac cgtcctgaaa aatgcggtta tatcgaactt cctcttatcg 5280 aaacagaaag tcaaattctt ataaatgaca ttaaaattaa aaaattagct tcaagtgaat 5340 tctaccaata tcttggagtg cctgtgggtg aagaaaccga tcagagtcca tatgacattc 5400 ttaagaaagt tgtttctgat actaggaaaa ttgctgactc tgacttactt ggatggcaaa 5460 aactgaaagc gtacaaaatc tttttacatt ctcgtctagt atttcctttt agaacacgag 5520 agattaaaac aagtgcacta gcaaaatcca atgccaacaa taatctgact gtaaacatca 5580 ctagccaatt aagaaattgt tttaggaaaa tgatgtcttt gcctaaccac tctgaagtat 5640 gttattttta taattcaact gaaaacggag gggcaaactg tgttgaccta cttgacgaat 5700 accacacaca gacaataaca cactttttcc gactttttac ttctagttgt gaatttgcga 5760 gaaaagttaa ccttgattct ttgaactttg tagctggtcc gcgacttggt attatcaatc 5820 cgactttgca agaaaggttt gattggatca atgggaaaga gccaaaacct cggcattctg 5880 gtaagaaaac tcgttttcag cgagctagaa ttgcgattgc tttttttaag aaagaacatg 5940 atattaatat tatgtttatt gtcaccaaag gtcaaccttc actcttcatt acatcagaaa 6000 aacgtgggac atttattctt actcctgatt tgcgtaaaac tacttcaaaa gtcttgcaca 6060 tggctctctg tgactcgtat ctttcgaaat gggaaaaaag ctgcacttca aatttcattg 6120 cctcagccat taaactttca ccaaaaatta ataaagctat ttatcgaggt gacatttctg 6180 agtttgcttg gcactttatc catcgtgctc gtacaaatac tttgtccatc aatgcaaaac 6240 cacataataa aggtaataaa agactgtgtc gattatgtca cactgaagac gaaacaatgt 6300 cgcatattat tcaatcttgt aaaatccatt tgactcttgg ttctgagagg cacaatgatt 6360 gtcttgatct gatttcgtct cacttgtcga aaaattcaaa tttgattgtt gttgttgacc 6420 atgtatgttc acttgttccg gaatcgaaag agcgagtaga tttaatgatc actgacatgg 6480 tgcggaaaaa aatattttta gttgacatga aatgtccatg tgatacaatt aataattttg 6540 cactagtaga tcttcttaat ttacaaaaat acgattcgtt gaaaatgcaa atacaagatg 6600 ccaaacctga cttcaaagtg gagcttgata cttgtatcat cggctcttta ggttcctttc 6660 ctccaaagac tcctgaatta cttttaaaga ttggagttga tcgaactcac ctgaaaggtt 6720 tattaaaaga ttgtgcgcta agcaatattt cccattctgc tagacggtgg cactaactac 6780 cacaagacgg gtattttggt aaatcttgaa aaatttcgat ttaattaatg tttgagttaa 6840 aattgccatt tgacttgtgt atgtgattta ttgtgttaat taatgaattc caccctcttc 6900 cagggaaagg tttaactttg cataaaatta aactctctgt ctacacttgg tagcaaatta 6960 atctagaatg aagaagcctg atggtatttt tcccaaatct gctgaaaact caattctgaa 7020 ggatggaaat aaaagcagat aaagattcaa tattcgataa agaaatggac ctcaacaatc 7080 aatgtcaatt atcaataatg aatctgggca gatgtcagaa ggcgataatc aaatgctgaa 7140 ttccaatgac cgaatctact ttgatgtctt agatttgtga tcagcttcca ctacaaaggt 7200 gctgcatcga aagccaatgg agattcaatg ccggatgaaa atgccgatga aagagaatca 7260 gagccgagga caatttagag aaaattcaat tgcaactgcg tcaagagttc ctcaacaata 7320 agatgaaata tcgaaactca attggatctt ttgcttttgg agagtctcag aacggcaagc 7380 tccatgtaga gctgtgtccg atatgaaaaa ttcaaaattg ttatattaat aaatttttac 7440 aatagtacta gtgagtgatt ttgttaagtc tacgacttag tcgtagacgt agttgtaggc 7500 gatgtttggc gatgattgga gatgattggc gatgatgggy gctgatgggc gatgatgggc 7560 gctgatgggt gcggatgggt gttgatgggt gcggatcggt gcggatcggt gcggatsggt 7620 gcgcatgggt atggatgkgt gtgratgggt atggataagt ayggatgagt gtggtactag 7680 aatttaggta ggtgtggaag ttagtaggtg tatggaaggt tctagaaggt aggtgtgtgg 7740 aaggttctag aaggcgggtg tgtggaattt ggtaggtgtg ttggtccaca actacgtgac 7800 ataataacag ttaggtaatg tgttggtcca cgtgacataa taaggttagg taataagtgt 7860 gtctccgggt agtgactgac cagtgagtac gtgacataac aatggttagg taataagtgt 7920 gtctccgggt agtgactgac cagtggtcaa gtgacataat catgaggtca ttgaccaaag 7980 gtcaattact aggtagcaac tgagtggaca gaggtcaatc tccaggtagc aacagtgtgg 8040 acagaggtca aatgttatgt aataattacg taagccaagg ggggttgtat ctgacgggct 8100 cacatcctac tact 8114 // ID Gypsy-27_OD-I repbase; DNA; INV; 8452 BP. XX AC CABV01000704; XX DT 24-FEB-2011 (Rel. 16.02, Created) DT 24-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Oikopleura dioica genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-27_OD_; KW Gypsy-27_OD-LTR; Gypsy-27_OD-I. XX OS Oikopleura dioica OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; OC Oikopleuridae; Oikopleura. XX RN [1] RP 1-8452 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Oikopleura dioica genome."; RL Direct Submission to RU (24-FEB-2011). XX DR Genome; CABV01000704; Positions 15001 6550. XX CC Positions [3746-4213] - Integrase core CC 'AATCC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 87..1115 FT /product="Gypsy-27_OD-I_2p" FT /translation="MSKRKLQDIKIQYAYDKEEMKSIFDESVEAHSVTTGS FT EKHQKLEKQFERTERRILMSYLSPDFERRYEDELRNIVGSEDFFKLMGEII FT GEVSQADLVREAQEKLRDISRNSEEEETYQRFTGRITQLAEIASKKVTVLQ FT EFFINEAFDRNLTPEIKRYLLDQGMTGKSTTETAKYLDKMKKHKRKPSVNV FT INTKDLILQEQIDALSAQFASFPEMIRESLTTSISSIVQQQIRSLQPETGD FT INKVSAKHEPHTKRDPQQHDRTVQRRPQHNDFRRDRQYDQGHQENRYPDHF FT QFAPDGKPFRCTKCGILGHESRFCRGTTLTCRYCGEAGHIKFACPKKMAKN FT " FT CDS 992..5371 FT /product="Gypsy-27_OD-I_1p" FT /translation="MYKMRNSRPRIKILPWYNADLSLLWRSWAYQVCVPEE FT DGKKLGTGSFYVSKKSHALIAKFSATIGPKLLINGFIFDKPITFNLDTGAQ FT VSCIPQKFIPVSLLSSMTPAPFELQSYNGTSINVYGSICASFNLGSIKINN FT AVLLVVENNCSPIIGTPEMKENNLVLDIARSTLSQNANSENLLVSRDKSSA FT GIKLISKRSVYTATARIENDICIPPRTSMSVNTFLDTQPTSTVCIFPEACA FT KNKKLQIYDQVVQLQCLDDKAAIQIANLSNEAIRIKKNTAICRVEEVSIQP FT PDNPVKGKLEQILKELKIGEAPTQIVQKLRRMIADFTDVFAIEGEPLGTTD FT AMSYNIDTGSAAPVASQRYRTPYYLRNELKRIIESNVNSGLLKPCSSPWAA FT PVLLVKKSNGKWRLVCDYRKLNSVTVSNQYPLPDIDGLIDEMSNSTVFSTA FT DLFTGFHQIPCDEETQQKVAITTDFGQFTWTAMPMGGKNAPAVFQQMMDKI FT FKDIPSNELAIYLDDLCLHSTTYEQNLEIIQKVLLVLRRNNLKIRAAKTEF FT LKNKIKFCGALLENGFRSPNPDKTRAVAELRHPSNAKEAQSVFGLLNYHRN FT FIPHFASKSAPITLAYRKGFRWTNEAAVALEAIKKEIVEKAEKLRIPTPDS FT GHYAIETDASDTGLGAVLLFKAEKEKQYQPAAYLSHKFDEAQKNYNTYEKE FT LLAGKKAMEKWSHYLLGRKFEWITDNSCVNWAHKIKPRKLKIAKWLAEIAD FT FDYITRLKPSSQMVISDCLSRQFADCNIVSINMMSGKEMADIQQNEQEIRD FT IERFCKIDRWPHDRSRTLQGYYKVREKIRKGPDGEIGVQLETFKPFPPQCL FT TTEILKEYHDKSGHPGISQTIDGISRKYFIPNLREMVTDFIKSCDRCQRVK FT PMTNPLNAPLGHVSPPSKPFERFAIDLIGPLPITNRNFKHICVSTDLFSKK FT TYAQPLRNKNPSEILRVTMAEWLRNPCLPKSVLMDNGKEFTSLKNSCEEKG FT IRVFCSPAYHPQTNGECENRNRTIKSRLRLLSNFTNWDEHLPYVIHQMNSA FT KHSVTKMTPYEVETGFEGENPNDSYRPEQKKMAVDFKTIKNRIEENHVKRR FT SENEEIRTFNVGELVLLKNMDPRSKHEKFVGPFAIRYIRDQGLGFSLENLQ FT TGNMVSRHISHIKPYIQRETESSEPQAPAQKPVIPIQKKRKLRSHTISLVG FT RTTQPLSVVQPNQQRQKSISEETITVDIPLIPEQLNSSDDDQDAAEIDQTI FT FEDALDQTLCLEQEDPTSSLEPSDLDRTLTDDPNLTNNRSTSNELWLSDNS FT STSSQGIIAPVIQRIVDLPNKALDKFMSDMKIDNKLSKWLDMGGSKKQKKM FT DQITTWIEQNHPNWDKDEEGFYLVKQDAVKLESKCYISQLTLSDLKVFATH FT LRLDVDFTGKSKKMLVSEITRKALEKVPAFKCTKSGNLIIDPTLLQ" FT CDS 6555..8426 FT /product="Gypsy-27_OD-I_3p" FT /translation="MKIFLIFTLPFCKPDDEFVFDKSNGIVFRKEEPIYTF FT DSEIPARINIMVPSPRQVFRTSLQADCADTSQGLPPMGRNFFNIDGSIPEL FT EENLQHLSETCSEAFNIFDSMILSLVKNELVFTEDYKPELHEKQKTKPKLM FT RRETSARRRRRREVVIATIATVAAVGMVANLGYTATVDVKSKQRDELLEKK FT IDEDRQKLASLTTVVELLDESIDEVAKMMRKSKKPIITFSGIELSADDKMK FT ERMVDTDPNSLNDFFADYSQRIGRETVRAVMLLTTQRMPLFSKFILSIRAN FT CLAIQEIEDTYLAKEFCLAFALHTTRFDTSLRFAGLGLTKFDNDQELFRIK FT EVILAMELKIPRLRLKADRYSVANLGYYKPDNTRWKLNVPQQLIVMPSKEV FT LSMSPSLCLKFSPSYACDVSSLEPSTCGESLLTSNTTRLCETREVDSQKCG FT YLETHDRGFISMAKKSVVNFFHHHPSKELQQIDTFEKVKYPGAVDCGPTVV FT RVNAGFGEVNFTSRINYISPIKIQVTNIEEHRYQELENRTHLAIENNHNLK FT MSNEKLNIKISAIEGKIADFVHGITGWISGITSILTLLLLVFAIQRFKCCR FT RKPVDRIMLANFQPPTASTSRTDSEL" XX SQ Sequence 8452 BP; 2905 A; 1945 C; 1624 G; 1978 T; 0 other; tggtgactac tttgagacat ccgctagaac tcctccgaaa ttttcaagaa gaatacagag 60 aatctgtccc taaaatcacc ctgagaatgt cgaaaagaaa actacaagat atcaaaatcc 120 agtatgctta cgacaaagaa gaaatgaaaa gtatattcga cgaatcagtc gaagcccaca 180 gtgttacaac tggttcagag aaacatcaga aacttgaaaa acagtttgaa aggacagaaa 240 gaagaatctt gatgtcatat ctgtcgcctg attttgaaag aagatacgaa gatgagctcc 300 gaaacattgt tggatcagaa gacttcttca aacttatggg ggaaatcatt ggagaagtct 360 cacaagccga tctcgttagg gaagcccagg aaaaattgcg ggatatctca agaaattcag 420 aggaagaaga gacatatcaa cgcttcacag gaagaataac gcagcttgct gagatcgcaa 480 gtaaaaaagt tactgtcctt caagagtttt ttataaacga agccttcgac agaaatctca 540 cgccagaaat caagcgatat ctcctggatc aaggtatgac aggaaaatca acaacagaaa 600 cagcgaagta cctcgacaaa atgaagaaac acaaaagaaa accaagcgtt aatgtaatta 660 acacaaaaga tctcatttta caagagcaaa tcgacgcact ttccgcacaa ttcgcctcat 720 tcccggaaat gataagagaa tcactgacta cttcaatttc atcaattgtc caacaacaaa 780 tccgaagtct tcagccagaa actggcgata ttaataaagt ctctgcaaaa catgagcctc 840 atacgaaaag agacccgcag cagcatgaca gaacagtaca acgacgtcct caacataacg 900 attttcgccg agaccgccag tacgatcagg gccatcagga aaatcgttac cctgatcact 960 ttcagttcgc gccagacgga aaacccttta gatgtacaaa atgcggaatt ctcggccacg 1020 aatcaagatt ctgccgtggt acaacgctga cttgtcgcta ctgtggagaa gctgggcata 1080 tcaagtttgc gtgcccgaag aagatggcaa aaaactagga actgggtctt tttatgtgtc 1140 aaaaaagtcc catgctctta ttgccaaatt ttctgctaca atcggaccga aacttttgat 1200 aaacggcttc atatttgaca agccaatcac ctttaacctt gataccggcg cccaagtctc 1260 gtgtattcca caaaaattta tccccgtttc tcttctctca tcaatgactc cagcgccatt 1320 cgagctgcag agctacaacg gcacgtccat caatgtgtat ggaagtattt gtgcctcttt 1380 caaccttgga tcaatcaaaa taaacaatgc cgttcttctt gtcgtcgaga acaattgctc 1440 gccaataatc ggcactcctg aaatgaaaga aaacaatctc gtcttggata tagcacgatc 1500 tacactttct caaaatgcta attctgaaaa cctccttgtc tctcgtgaca aatcatcagc 1560 tggaataaaa ctcataagca agcgatccgt atacacagcc acagcacgaa tcgaaaatga 1620 catctgcatc ccgccaagaa cctctatgtc agtgaacact tttcttgata ctcaaccaac 1680 atcaactgta tgcatctttc cagaagcttg tgcgaaaaat aaaaaacttc aaatctacga 1740 ccaagttgtt caacttcaat gtcttgacga taaagcagca atacaaatag caaacttatc 1800 aaacgaggct ataagaatca aaaagaacac cgcaatttgc agagttgagg aagtcagtat 1860 tcaaccacct gataatccag taaagggcaa gctagaacaa atcttgaagg agttgaaaat 1920 tggcgaagct ccaactcaga ttgttcagaa actgagacgc atgattgcgg attttacaga 1980 tgtcttcgcg atagaaggtg agcctctcgg gacaactgac gccatgagct acaacatcga 2040 cactggatcg gctgcacccg ttgcttctca gagatatcga accccgtatt atctaagaaa 2100 tgagctgaaa cgtataattg aatcgaatgt caactctggt cttctgaaac cctgctcaag 2160 tccatgggcg gcccctgttc tactcgttaa gaaatcaaat ggaaaatggc gacttgtttg 2220 cgactacaga aaactaaata gcgtcaccgt cagtaatcag tacccgctac ccgatattga 2280 cggcctaatc gatgaaatgt caaactcaac ggtattttcc accgcagact tgtttactgg 2340 ctttcaccag atcccgtgtg acgaagaaac tcagcaaaaa gtggcaataa ccactgattt 2400 tggacaattt acctggacag caatgcccat gggcggtaaa aacgcgccag cagtatttca 2460 gcaaatgatg gacaaaatat tcaaagacat ccctagcaac gagctagcta tatatttgga 2520 tgatttatgt cttcactcaa caacatacga gcaaaacctg gaaattatac agaaagtact 2580 tctagttctc agaagaaaca atttgaaaat aagagctgca aaaactgaat ttctaaagaa 2640 caaaataaaa ttctgtggag ctcttcttga aaacggcttc cgaagcccaa atcctgataa 2700 aacaagagct gttgccgaac ttagacaccc aagcaatgct aaagaagctc agagtgtttt 2760 tggacttctt aattaccatc gcaattttat ccctcatttt gcatcaaaat ctgcgcctat 2820 aacactggca tacagaaagg gatttcgatg gacgaatgag gcagctgtcg ctcttgaagc 2880 aattaagaag gaaatagtcg aaaaggcgga aaagttgaga attccgactc cagattcagg 2940 acactacgcc atagaaactg atgcaagcga cactggactt ggtgctgtct tactgtttaa 3000 ggcagagaaa gaaaaacagt accaaccggc tgcgtatcta tcacacaaat tcgatgaagc 3060 tcagaaaaac tacaacacat atgagaaaga acttcttgct ggcaaaaaag ccatggagaa 3120 atggtcacat taccttcttg gaagaaagtt cgagtggata actgacaact cgtgtgtcaa 3180 ctgggctcac aagataaaac caagaaaact taaaatagct aaatggctcg ccgaaatagc 3240 ggactttgac tatataacgc gccttaaacc atcgagccag atggtgattt cagactgtct 3300 atcccgtcaa tttgccgatt gcaatatcgt tagtataaac atgatgtcag gcaaagaaat 3360 ggcagatatt cagcaaaacg agcaagagat acgtgacatt gaacgatttt gcaaaataga 3420 ccgctggccg cacgacagat ccaggactct tcaaggatat tataaagttc gggagaagat 3480 cagaaaaggc cctgatggtg aaataggagt tcagctggag acttttaagc cctttcctcc 3540 acagtgctta actactgaaa tactcaagga gtatcacgac aagtcaggtc atcccggaat 3600 atcacaaaca attgacggaa taagcagaaa gtactttatc ccgaacttgc gagagatggt 3660 cacggatttt atcaaatctt gcgatagatg ccaacgagtt aaaccaatga caaatccact 3720 caatgcccca ctgggacatg tttcaccacc gtcaaaacca ttcgaaagat ttgctatcga 3780 ccttatcggc ccgctaccaa taacaaatcg gaatttcaaa cacatttgcg tcagcactga 3840 tctttttagt aagaaaactt atgcgcaacc actaagaaat aagaaccctt cggagatact 3900 tcgcgtaacc atggcagaat ggctaagaaa tccctgcttg ccgaagtcgg tcctgatgga 3960 taacggtaaa gaattcactt cgctgaaaaa ttcatgtgaa gaaaagggta ttcgcgtatt 4020 ttgctcacct gcgtatcatc ctcaaacgaa cggggagtgc gagaacagaa accgaacgat 4080 taaatcaaga ctaagacttc tttcaaattt tacgaactgg gatgagcatt tgccttacgt 4140 gattcatcaa atgaactctg cgaaacacag cgtgacaaaa atgactccct acgaagtcga 4200 aactggcttt gaaggggaga accccaacga ctcttaccgc cctgaacaga agaaaatggc 4260 cgttgacttt aaaactataa aaaatagaat tgaagaaaat cacgtcaaac ggagatctga 4320 aaatgaagaa atcagaacat tcaacgtcgg agaattggtt cttcttaaaa atatggaccc 4380 aagatcaaag cacgaaaaat ttgttggacc ctttgctata cggtacatcc gtgatcaggg 4440 cctaggattc agtctggaaa acttacaaac aggaaacatg gttagccgtc acatcagtca 4500 tataaaacca tacatccaac gtgaaacaga gtcgtcagaa ccacaggcac cggctcagaa 4560 acccgtaatc ccgattcaga agaaaagaaa acttcgctca cacactattt ccctcgttgg 4620 cagaacaacc cagcctctct cggttgtcca accaaaccaa caaagacaaa aaagcatttc 4680 tgaagaaacg atcacggtcg atatccctct gatacccgaa caactcaata gttctgatga 4740 tgatcaagat gctgccgaaa tcgaccaaac tattttcgaa gatgcactgg atcagactct 4800 gtgcctcgag caagaggatc ctacttcgag cttagagcca tctgatctag atagaaccct 4860 gactgatgat ccaaatctca cgaacaaccg aagcacatca aatgaacttt ggctttctga 4920 taactcaagc acgtctagcc aaggcataat cgcaccagtt atccagagaa tagttgatct 4980 gccgaacaaa gccctggata aatttatgag cgacatgaaa attgataata aattgagcaa 5040 gtggcttgat atgggcggct cgaagaagca gaaaaagatg gatcagataa caacctggat 5100 tgaacaaaat caccctaact gggacaaaga cgaagaaggt ttttaccttg ttaaacaaga 5160 tgctgtcaaa ctggaatcaa aatgctacat ttcgcaacta actttgtctg atcttaaagt 5220 gttcgctaca catttgcgtt tggacgtcga tttcaccgga aaatctaaga aaatgttagt 5280 ctccgaaatc acgagaaaag cgttggagaa agtcccggca ttcaaatgta cgaagtctgg 5340 gaaccttata atcgacccga cacttcttca gtaattttcg aaaaatcaaa atccatttgg 5400 gtacccaatc cgaattttgg acctgataat ttccgaaaat cgccacaagt gatccgagat 5460 tacctcaaat caatctctaa aatctatttt cgaaagccaa ttatctacgc tttcattcga 5520 gcagtccaga accaacatac gacaaaaact tttctcgatg gacaagtctg aagtagggga 5580 taattttctt atcccctcga aaaatcacca ttttctcaaa attgacacaa atcatctccg 5640 aaacgtcaga atcaatctcc aaacgacttt ttagaaagca cgcagcgtca gctctttgtt 5700 ttaaacagct aagcaactga ataacacatc tgaagaaaaa tcgaaaactg ggtaccaaaa 5760 atgttaggaa tcccaacatt ttcccgaaaa tcgccacaag tcatctgaga atacctcaaa 5820 tcaatctctg aaagctgctt atgaaagcca attatctacg ctttcattta agcagtccag 5880 aaccaatatc cgacaaaaaa gtttcttgat ggacaagtct gacctagggg ataattttct 5940 tatcccctcg aaaatcgcca aattggtcgg attcccacca ccatctcaaa aatttacgca 6000 aatcatctcc gaaacgtcag aatcaatctt cgaacgaatt tttagaaagc acgcagcggc 6060 agctttcgat aaaacaggaa ctcaataaca catctgaatg agaaaaatcg aaaaatttgg 6120 tggcaaatct tggaccttag acaagtatcg gttcattatt gcgacagtca agtgaagagt 6180 ttcgttttga agtttgaatc gcttatgaac ctgcacgacc agtcagcaag gatggtgcgc 6240 aacttgtagc tcacttcttt atgtatgtcg agaaatcttg actttacgac gtatgcgtgc 6300 tcgagatatt ttgtgtaatc ataaatcata acaaaaagaa aaagatgaaa ccaacaaacg 6360 tcaaacctcc cgtggaaaac caaatagaaa tttcatgccc tgattacaca ttgcaatcac 6420 ccaggattat ctaggccatg tgattcgagt caataacccc aggtgacgcc acagctgact 6480 gcacggcgga acagcgctga cgctacccat ataaaatcga attcaccaag gagtgctcag 6540 aacaacaaga gaaaatgaaa atatttctta tttttacact ccccttttgc aaaccagatg 6600 acgaattcgt attcgacaag tcgaatggta ttgtattcag aaaagaagaa ccaatctaca 6660 ccttcgacag cgaaatccca gcaagaatta acattatggt gccttctccc agacaagtgt 6720 tccgaacatc gctacaagca gattgtgcgg atacaagcca aggactaccg ccaatgggaa 6780 gaaatttctt caacatagat ggttcaattc cagaattaga agaaaacctt caacatctct 6840 cagaaacatg ctcagaagcc ttcaacatat ttgactcgat gatcttgagc cttgttaaaa 6900 acgagcttgt cttcacggaa gactacaaac ctgaacttca cgaaaaacag aagacaaaac 6960 ccaaactaat gagacgggag acatcggcaa gacgacgcag aagaagagaa gtcgtcatcg 7020 cgacgatcgc aacagttgcc gccgtcggta tggtcgcgaa tcttggatac acagccactg 7080 tggacgttaa gtcaaaacaa agagacgagc tcctcgagaa gaaaatcgat gaagatcgac 7140 aaaagctcgc ctctttgaca acagtcgtcg agctgcttga cgaaagcata gatgaagtag 7200 ccaaaatgat gagaaaatca aaaaaaccga tcatcacctt ctccggaatc gaactctccg 7260 ctgatgacaa aatgaaggaa agaatggtcg acacggaccc aaactcgctg aatgatttct 7320 tcgcagacta ctcccagaga attggccgcg aaactgtcag agccgtcatg ctactaacca 7380 ctcagagaat gccgctgttc tcaaaattca ttttgagcat cagagcaaac tgtcttgcta 7440 tccaagaaat cgaagacacc tatctcgcga aagaattttg tcttgcattc gccctccaca 7500 caaccagatt tgatacgtca ttgagatttg ctggactcgg ccttacgaaa tttgacaacg 7560 accaagaact cttccggata aaggaagtca tcctggcgat ggaactcaaa attcctcgcc 7620 ttcggctgaa agctgaccga tactccgttg caaatcttgg gtattacaag cccgataaca 7680 cccgatggaa acttaacgta ccacagcaat tgatcgttat gccgtcaaaa gaagtcctct 7740 ccatgtcccc gtcactttgc cttaaattct ccccgtcata tgcatgcgac gtatcatcac 7800 tcgagccgtc aacatgtgga gaatcattgc tcacctcaaa tacgacaaga ctttgtgaaa 7860 caagagaagt ggatagccaa aaatgcggct atcttgagac ccacgacaga ggcttcattt 7920 caatggcaaa gaaaagcgtc gtgaatttct tccatcatca cccatcaaaa gaactgcagc 7980 agatagacac atttgaaaaa gtaaaatatc ctggtgctgt cgactgtggt ccgacagtcg 8040 ttcgagttaa tgccggattc ggagaagtta acttcacttc gcgcattaac tatataagtc 8100 cgatcaaaat ccaggtcacg aatatcgaag aacaccgcta tcaagaactc gaaaatagaa 8160 cacaccttgc aatagaaaac aatcacaacc tgaagatgtc aaacgagaaa ctcaatatca 8220 aaatttctgc aatcgaaggc aaaattgccg actttgtcca tggaataacc ggctggattt 8280 ccggaataac cagcattctc acccttctcc tgctcgtttt tgcgattcaa agattcaaat 8340 gctgtcgcag aaaaccagtc gacagaataa tgcttgcaaa ctttcagcca ccgaccgcat 8400 caacgagtag aactgactcc gaactgtaaa aacccgccga ctgacgatgg tc 8452 // ID BEL-4-I_HM repbase; DNA; INV; 5760 BP. XX AC . XX DT 15-JAN-2009 (Rel. 14.02, Created) DT 15-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE LTR retrotransposon from Hydra magnipapillata: internal portion - DE consensus. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-4-I_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-5760 RA Bao W. and Jurka J.; RT "LTR retrotransposons from Hydra magnipapillata."; RL Repbase Reports 9(2), 436-436 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 339..5735 FT /product="BEL-4-I_HM_1p" FT /translation="MAAILGRKIAKRKALRSIFQKNVVEISQALEDPDTSQ FT SRFIALKNSLSDSYTNLNNIDEEIINILDPNEVEEDVLASVEITDPFHNIL FT SKINLRLESLKLGFDNNSEQSSKNKINCRLPKIELPIFKGNFLEWQTFWDQ FT FNVAIHNNDDLNDIDRFNYLKKYLGGQALTTICGLTLSSENYREAVSLLTE FT RYGNPQILISAHMDLLLKLVKVKNSDNVDGLRKLYNDIEASVRNLKSLKVE FT TNTYGSLLIPILKEKIPDDLVLLISRKFGGEVWTIDLLLKYFREELNAIEC FT CSTNSRNVENRKNNSQLFTTAGLYSQNINQNSKNNYKKVFPKSFDSDYQNK FT RNYNYPCIYCSENHNPWECKNVTNFSTRKEILKKQGRCFNCLRTNHILKNC FT QANYICSKCKKKHHISLCEKENTFVGHATHNISQQTILLQTARANISAVSE FT NKSLKARILFDSGSQRSYISENIQKRLNLKTIRKEKIIIKTFGKDNEQTLQ FT LLNVVQFKVKHKYTDHSVYMEALCVPVICSPLQGQNVSLACNQFKQLSKYS FT LADYDDGSLDFHVDILVGIDFYHSFITGRIVRSPIGGLVASKSILGWVLSG FT PISLEKSTKNTTKTVFTSHSMHCEVESEIDELRKDLNKFWSVETIESPESC FT VMYQFERDLTHNGERYITKLPFKTDHDFLPDNYNICKKRLANLKVRLNDKQ FT LVTEYNQIFIDYEKNNIIERVSESDIFKEPGCVHYLPHRPVIRRDKDTTKI FT RAVFDASCSTTGPSLNDCLYSGPNLLSKIFDILLKFRFNAIGIIADIKQAF FT LNIEISPEHRDYLRFLWYERISDKEAKLIVYRFLRVVFGLTSSPFLLNATL FT KHHLNKYIKHDEKFIERLIDDLYVDDITASGCETISEGKMLYKKSKCIFLD FT AGFELRKWITNDHELQNYFNKKETNQSENSKCNLKVDELKYFESEIIKNEN FT NNNFIKILGIEWDILKDEFVFEFKEFVKNARMLKITKRNILKXAASFFDPL FT GFLTPITSRVKTIFQLICRDKSGWDEIVTEAIELALTEFLKDLELLSFVRI FT PRFVFERVNESSKRVQLHGFCDSSKLIYCAVIYLVVETSLGVNRKFLVSKS FT RVSPLKELSIPRLELLGCVLLSRLLEEVLRVLKGRVHFEDVSCWCDSEVAL FT YWIKGKERTWKPWVENRVNAIRKVVDREKWNHISGELNPADFPTRISNFVD FT FGHWLKGPEFLLNMNKEINEFKVDKKFQVDIMSECKRGENVVICNVVLDSS FT PSLSSIIDACRFSSFEKLIISTGYVFRFINNLLKTIKKLPLNKQVTLTTDE FT YKFALNEWIRDEQRXLQNDKSFDKLKNSLKLFDGEDKLIRLRGRFENANLN FT FAEKHPLILRGKESWFTILLIRNCHEQVLHHGIESTLNKVRSQFWIIKGRS FT TIKSIIRQCVTCKKFQGRTLLPPSSPDLPDFRVNNLYSFQATGLDYAGPLF FT VKTVNKDAACKVYVLLLTCASSRAIHLELTPDMKIPAFIRAFKRFAARRGF FT PDLIISDNFKTFISKEVKNFMMKNDVQQKFILPASPWWGGFYERLVRSVKI FT TMKKTIGKSLLSYEELETILCEIESVINSRPLFYMSEDDTEEALTPYHLMY FT GRNIVNKMNGISLLPNDISKRAIFIKTTIFNFWRRFATSYLNELKQHHLYV FT KSKTSYNEQLVLNDVVLIRDDIKIPRSQWRMGKVLELIRGADGKVRGARLQ FT VLSKEGIRTTCFRPIQKLIPFEITNKSNEMNKDKNSRSSENELGVIATDET FT CRSICKRPIRQAASAGENLRRLKELYY*" XX SQ Sequence 5760 BP; 2156 A; 697 C; 1057 G; 1846 T; 4 other; tggcgcccga tgcaaaatac gaaatctgaa cgaagaatat tgccagaaga aaaatattta 60 tccaattgaa ctcaaaaaga aaaagaaaaa gaatcgattt gacttacaaa agttttgtac 120 atttgattcg gtgtaagatt tgaattgtgt aaacaagcgt gttggtaaga aataacatat 180 tgcttttgtt gattgatagt gaatttgttg ttgtttacat atttttattg ctcattgtgt 240 ttattgtgta taagagcgag tgaaaaattg tatagagact gtaaagtagt attagattat 300 aagttataca tttgtatagt agtagtagaa tataagttat ggctgcaata ttaggtagga 360 agatagctaa aaggaaagca ctaagaagta tttttcaaaa aaatgttgta gaaataagtc 420 aggcgttaga agaccctgac acctcacagt ctaggtttat agctttaaaa aatagtttga 480 gtgactctta tacaaattta aataatattg atgaagaaat aattaatata cttgatccaa 540 atgaggttga ggaggatgtt ttagcgagtg ttgaaataac tgatccattt cacaatatac 600 tttctaaaat aaatctaagg ttagaatctt taaaattagg gtttgataat aattctgaac 660 agagttctaa aaataaaatt aattgtagat tgccaaaaat tgaacttcca atttttaagg 720 gtaatttttt agaatggcaa acattctggg atcaatttaa tgttgccata cacaacaatg 780 atgacctaaa tgatattgat cgtttcaatt atttaaaaaa atatttagga ggccaagcat 840 tgactaccat ttgtggttta accttatcgt cagaaaatta tagagaagca gtttctttat 900 tgactgaacg gtatggtaac ccacaaattt taatctctgc acacatggat ttattattaa 960 aacttgtgaa agtaaaaaat agtgataatg ttgatggctt aagaaaactt tataatgata 1020 tagaggcaag tgttcgaaat ttgaaatcat taaaagttga aactaacaca tatggaagtc 1080 ttctgattcc tattttgaaa gaaaaaatac cagatgattt agttctgtta atttcgagga 1140 aatttggtgg ggaggtttgg actatagatt tattattaaa atattttcga gaagaattaa 1200 atgcaataga gtgttgttca actaatagtc gaaatgttga aaacagaaaa aataatagtc 1260 agttattcac aactgcagga ttatattcac agaacattaa tcagaactct aagaataatt 1320 acaaaaaagt ttttccaaaa tcatttgata gtgattacca aaacaaaaga aattataatt 1380 atccttgcat atactgttca gaaaatcaca acccctggga atgtaaaaac gtgacaaatt 1440 tttcaaccag aaaagaaatt ttaaaaaagc aaggtagatg ttttaattgt cttagaacga 1500 atcacatatt aaagaattgt caagcaaatt atatttgtag taaatgcaaa aaaaagcatc 1560 atatttcctt gtgtgaaaaa gaaaacactt ttgttgggca tgcaacacat aacatttccc 1620 aacaaactat cttgttgcaa acagctaggg caaacataag tgcagtttca gaaaataagt 1680 ctttaaaggc aagaatctta tttgatagtg gaagccaacg atcatatata agtgaaaata 1740 ttcaaaaacg attaaattta aaaactatta ggaaagagaa aattataata aaaacatttg 1800 ggaaagataa tgaacaaaca ttgcaattat taaatgttgt tcaatttaag gtcaaacata 1860 aatatactga tcacagtgtt tatatggaag ctttatgtgt accagttatt tgtagccctc 1920 tgcaagggca aaatgtgtca ttagcttgta atcaatttaa acaattatcc aaatattcat 1980 tagctgatta tgatgatggg tctcttgatt ttcatgttga tattttagta ggtatagact 2040 tttatcacag ttttattact ggaagaatcg taagaagtcc aataggtggt ctagtagcta 2100 gtaaatcaat cttagggtgg gtattaagtg gtcctatttc tctggaaaaa agtacaaaaa 2160 ataccacaaa aacagttttt actagtcatt ctatgcactg tgaggttgag agtgaaatag 2220 atgagcttag gaaagatcta aataaatttt ggtctgttga aaccatagaa tctcctgaga 2280 gttgtgtgat gtatcagttt gaaagagatc tcacacataa cggggaacga tatataacaa 2340 agttaccatt caaaactgat catgattttt tacctgataa ctataatatt tgtaaaaaac 2400 gtctggctaa tttaaaggta agattaaatg ataaacaact ggttacagaa tataatcaaa 2460 tttttataga ctatgaaaag aataatataa tagaaagagt tagtgagtct gatatattta 2520 aagaaccagg gtgtgtgcat tatttgcctc ataggcctgt cattcgaaga gataaggata 2580 ctacaaaaat tagagctgtt tttgatgcgt cctgttcgac aacaggaccg tctttaaatg 2640 attgcttgta ttcaggtccg aatttattrt caaaaatatt tgatatttta ctaaaattta 2700 ggtttaatgc tatagggata attgctgata taaaacaagc attcttaaac attgaaatct 2760 ctcctgaaca cagagattac ttaaggttct tgtggtacga gagaatatct gataaagagg 2820 ctaaactaat tgtatataga tttcttagag ttgtttttgg tttaactagc agtccttttc 2880 ttttaaatgc cactttaaaa caccatctta ataaatacat taaacatgat gaaaaattta 2940 ttgaaagatt aatagatgat ctttatgttg acgacattac agcctctggc tgtgaaacaa 3000 tttcagaagg gaaaatgctt tataaaaaat ctaaatgtat atttttagat gcaggattcg 3060 aattaagaaa atggattaca aatgatcacg aattacaaaa ttattttaat aaaaaggaaa 3120 caaatcaaag tgaaaactca aaatgtaatc taaaagttga tgaattaaaa tattttgaaa 3180 gtgaaattat taaaaatgaa aacaataata attttataaa aattttagga attgagtggg 3240 atatattaaa agatgaattt gtttttgagt ttaaagaatt tgttaaaaat gcaagaatgc 3300 taaaaattac taaaagaaat attcttaaga trgcagcatc tttctttgat cctctaggtt 3360 ttttaacacc aataacttca agagtaaaaa caatttttca attaatttgt agagataaaa 3420 gtggatggga tgaaatagta acagaagcaa tagaattagc tttgacagaa tttttaaagg 3480 atcttgaatt attaagtttt gttagaatcc caagatttgt gtttgagaga gtgaatgaga 3540 gtagtaagcg tgttcaattg catggatttt gtgacagttc taaattaatt tattgtgccg 3600 ttatatatct agttgtagaa actagtttag gagtgaatag gaagttttta gtttcaaagt 3660 cacgtgtttc acccttaaaa gaattaagta tacctagatt agagttatta ggttgtgtgt 3720 tactgagccg acttttagag gaagtgttaa gagttttgaa gggaagagtg cactttgagg 3780 atgtaagttg ctggtgtgat tcagaagtgg cattgtattg gattaagggt aaggaaagaa 3840 cgtggaaacc atgggtggag aatagagtga atgccattag aaaagtggtt gatagagaaa 3900 aatggaatca tattagtggg gagttgaacc cagctgattt tcctactcga ataagtaatt 3960 ttgttgattt tggacattgg ttgaaaggac cagaattttt gttaaatatg aataaagaga 4020 taaatgaatt taaagttgat aaaaagtttc aggttgatat tatgtctgag tgtaagagag 4080 gagaaaatgt agttatttgt aatgttgttt tggactcatc accaagtttg agtagtatca 4140 ttgatgcatg tagatttagt tcatttgaaa agttaattat tagtactgga tatgtatttc 4200 gatttatcaa caatttgtta aaaacaatta aaaagctacc attaaacaaa caagttactt 4260 tgacgacaga tgagtacaaa tttgcactca atgaatggat tagagatgag caaagarttt 4320 tacaaaatga caaaagtttt gataaactga aaaactcttt aaagttattt gatggtgaag 4380 acaaactaat acgattacga ggaagatttg aaaatgctaa tttaaacttt gcagaaaaac 4440 atcctttaat attacgtgga aaagaaagtt ggtttacaat attactgatt cgtaactgtc 4500 atgaacaagt tttgcatcat ggaatcgaat cgacgttgaa taaagtacga tctcaattct 4560 ggataataaa aggacgatcg acgataaaaa gtattattag acaatgtgta acatgcaaga 4620 agttccaagg aagaacattg ttaccaccta gcagcccaga tttacctgat tttcgagtta 4680 ataatttata ttcctttcaa gcaactggtt tagattatgc aggaccttta tttgtaaaaa 4740 cagttaataa ggatgctgcc tgtaaggtct atgttttact attgacttgc gcgtctagtc 4800 gagcaatcca tcttgaactg acacctgaca tgaaaattcc ggccttcata agagctttta 4860 agcgttttgc agcacgaaga ggttttcctg atcttattat ctccgataat tttaagacct 4920 ttatatctaa ggaagttaaa aattttatga tgaaaaatga tgtgcaacag aagtttattc 4980 taccagcttc cccctggtgg ggcggtttct acgaacgttt agttagatct gttaaaatta 5040 caatgaagaa aacaattggt aagtctcttt tatcttacga agaactagaa acgatwttat 5100 gtgaaatcga atctgtgata aacagtcgtc cattatttta catgagcgag gatgacactg 5160 aagaagcatt aacaccatac catttaatgt atggacgaaa tattgttaat aaaatgaatg 5220 gaattagtct actacctaat gatatttcga aaagagctat atttataaaa acaacgatat 5280 ttaacttttg gagaaggttt gcaacttctt atcttaatga acttaaacaa caccatttat 5340 acgtaaaaag taaaacttca tacaacgaac agctagtatt aaacgatgta gttcttatac 5400 gagatgatat caagatccct agaagtcaat ggcgtatggg aaaagttttg gaacttatta 5460 gaggagctga tggaaaagta cgaggtgcgc gtttacaagt tttatccaaa gaaggaatta 5520 gaacaacatg tttccgacct attcaaaaat taataccttt tgaaattacc aacaaatcaa 5580 acgaaatgaa taaagataaa aatagtagat catctgaaaa tgaactaggt gtgattgcta 5640 ctgatgaaac gtgcaggtca atttgtaagc ggccgatcag acaggcagcg agcgcaggcg 5700 aaaatttgcg gcgtttaaaa gagttatatt attagttgac caagtcaaca taggggtgta 5760 // ID DNA3-7_AP repbase; DNA; INV; 401 BP. XX AC . XX DT 27-AUG-2009 (Rel. 14.09, Created) DT 27-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA3-7_AP. XX NM DNA3-7_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-401 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 1948-1948 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 3 bp TSD CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 401 BP; 137 A; 75 C; 80 G; 109 T; 0 other; atagcgtttt cgatttgaca aaatagtgtg cactgaaatt ccactggcac agagcaagtg 60 ttgcatgaaa attgcactag tttcttataa atcgaataca tttatgtgta atattatgga 120 aaacaaaata tgttgacatg gaaagaaata caattcttaa tagggccgcc gtattttgtt 180 ccccgttatc gagaaaaacg gttcatattt ataaaatgca ctcattgcac tgacaaaatc 240 ggcggtgcag aaaatgcgca ctgagtgcgc acgagtgttg tcagtgcaat taaatcgaat 300 acaaaacacc gcaccagttc actgaatatt ggtcaaatcg aatacgtttt tgcactcagt 360 gcgcaccgag tgcaaaaagt gcgtcaaatc gaaaacgcta t 401 // ID Gypsy-7_IS-LTR repbase; DNA; INV; 150 BP. XX AC ABJB010066853; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the black-legged tick genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-7_IS_; KW Gypsy-7_IS-I; Gypsy-7_IS-LTR. XX OS Ixodes scapularis OC Eukaryota; Metazoa; Arthropoda; Chelicerata; Arachnida; Acari; OC Parasitiformes; Ixodida; Ixodoidea; Ixodidae; Ixodinae; Ixodes. XX RN [1] RP 1-150 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the black-legged tick genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; ABJB010066853; Positions 24810 24661. XX SQ Sequence 150 BP; 54 A; 31 C; 42 G; 23 T; 0 other; tgtagtgacc gcttacgaag agaaaaaaga ggagcggtca cgggactcga aaaccgagat 60 cgcgaggagc gaggccacat tcagaacgac gcaagtcaag ctgtagatat tgtaaataaa 120 atacgtgaac tagagggacg cacctctaca 150 // ID BEL-69_CQ-LTR repbase; DNA; INV; 445 BP. XX AC . XX DT 07-JAN-2011 (Rel. 16.02, Created) DT 07-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-69_CQ_; KW BEL-69_CQ-I; BEL-69_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-445 RA Jurka J.; RT "LTR retrotransposons from the southern house mosquito."; RL Direct Submission to RU (07-JAN-2011). XX DR [2] (Consensus) XX SQ Sequence 445 BP; 124 A; 86 C; 83 G; 147 T; 5 other; tgttaagagc aatcggtgca gtaggtatcc ctcgtcgaca cgggagccaa ctgttgagta 60 cattakttcg atttttgtcw smtttcacga tagatggtca aggaacgtca agttcaaatt 120 cgctccgtca gccttttctg acatcagtct attagcaatc tcttcaacgg tagacgtgta 180 caattaaggt ggacttatta tttataattt acttgaattt attataatcg attgaaattt 240 cagcacatct attgggttgc ttacaaataa gagttcgttc tgtagtggtt agaattataa 300 acggacacta aatgtaacta ttctctatct ttcttwgccg aggacggcct ctaataaaaa 360 tctatatttt agcttcgagc tgaactgaac tgcaactagc gtgttctgct tccaggatta 420 atccgagttc taatcaccac gtaca 445 // ID L1_Ele9 repbase; DNA; INV; 4621 BP. XX AC . XX DT 15-OCT-2010 (Rel. 15.1, Created) DT 15-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE An L1 clade non-LTR retrotransposon family from Aedes aegypti. XX KW L1; Non-LTR Retrotransposon; Transposable Element; L1_Ele9. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4621 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4621 RA Kojima K.K. and Jurka J.; RT "L1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (15-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. This consensus is generated from 10 CC sequences with >97% identity, and ~99% identical to the original CC sequence in [1]. XX FH Key Location/Qualifiers FT CDS 162..1331 FT /product="L1_Ele9_1p" FT /translation="MNNRKIRENTFRVEFAKLPKKPTSEEVHKMVGLQLGI FT TRSQLVCIQLSHTDECAFVKVTEQAIAQRVVDQYDNKLDYDVKGVKYKLRL FT RMADGCVDVRLHDLSENTTNEQIRRHMEVYGDVLSVQELLWSDKHHFPGLS FT SGIRLVRMVLREPIKSYITIDGESTLVTYPRQRQTCRHCGEYLHTGISCVQ FT NKKLLAQKAGLNDRLKKASYAGAVRGNPAEAASTHQGNDTALISAQATNTS FT TQAPSVQEASGESVGLRTDVEIETDNTNNAAGESADDPSNRSNQAPDIGNI FT AQASTVPDEHGTDTASEYGSVEGSGPSHQSVPAQQPEPEAGPSSPGRVMQA FT LLSITSNENEAQKGKHKQEDGSTSEDSSAFTVIGKQRGRGRSKKLKQ" FT CDS 1362..4547 FT /product="L1_Ele9_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MDSSSYIIGTVNINNITNKTKLEALHHFVRSQDVDIL FT LLQEVENSSISIQGFNIIFNVDHTRRGTAIALKSHIKYTHVERSLDTRLIA FT VRIHDKVTIINIYAPSGTQNRASRESFFNTTLAYYLRRATEQVIIGGDFNA FT IIHPKDATGESNFSLSLKNTVQQAQIHDVWDRLRRRVTQHTYITHNSASRI FT DRIYVSTGLLEHLRNVDTHVCSFTNHHALTVRLCLPNLGRATGRGYWSIRP FT HVLTVENMDEFQLKWDYWTRRRHNYNSWMHWWLTFAKHKIRSFFRWKTKLK FT YDEFHDRYQYLYSQLREAYDRYFGDRSVLPTINLIKGKMLRLQSEFSKMFV FT HANESFVSGEPISAFQVGESRRKRTIINRLDLSDTEYIEDSTEIENHVYEH FT FREMYSQREETLNTEFHMVKCIPDDSRANAKCMDPISTEEIYFAIKGSASK FT KSAGPDGLPKEFYAKAFDIIHVHLNLILNEALRGEFTSDFTNGVIVLIKKK FT GTNNTLKSYRPISLLNVDFKLLSRILKTRVERVMTQHSVISSSQKCSNHKR FT NIYQATLSLKDRIAHLRENKRCGKLVSFDLESAFDRVNRTFLFDTMSSLGF FT NANLIELLSKIGTLSNSRILVNGHLSREFPVQRSVRQGDPIAMHLFAIYLQ FT PLITKLEHCLNDPEDLLVVYADDVTIITTDINKIERARLLFEQFGNSSGAC FT LNISKTISLDIGLISDNNRLIISWLSTTDRVKILGIVFTNSVRLMTNFNWD FT PLISKFAQQLWMHKMRLLSLHQKIVLLNSYVSSKIWYVASIVNINKSHIAK FT LTTLMNTFLWSGCRNRIPINQLALDIEEGGLKLQLPGFKCKALLISRHLKE FT IESMPFYSTYLENVGNPPNVRIIPSNCPCLQSIVQEAAYLPEQLESHPSPK FT KVMYAFLKKVTIPKIMQKHPEVCWKHVWCSIASKTLSSDERSLYYILINQK FT VMHRTLLVSMQLADTATCLHCSSADEDLEHLFSKCQRVRDSWRILTQMIEA FT VVGRAYSFSELLFSNIHTSREKKTKTMKLFINYVKFINNVNNTVDASAIKF FT HLENL" XX SQ Sequence 4621 BP; 1499 A; 979 C; 964 G; 1179 T; 0 other; cagtaaggct tcgactaccg aagcgttaac acgtcgtttt gatagctcgt caatcggtga 60 tattaagtga agtgtcgttg gtgaaccacg tgcaagttgg cccggaatcc ggtgccggaa 120 aatcgttaca cagcgtgagc atctcggacg acggttcgag gatgaacaac cgcaaaatcc 180 gcgaaaacac gttccgcgtg gaattcgcga aactaccaaa aaaaccaacc tcggaagagg 240 tacacaaaat ggtcgggctt cagctcggca tcacacggtc tcagctggtg tgtatacagt 300 tgagtcacac agacgaatgt gcctttgtga aggtcacaga acaagctatc gctcagcggg 360 tagtggacca atacgacaac aagctcgatt acgacgtcaa gggtgtcaaa tataagctac 420 ggctccgtat ggcggacgga tgcgtggatg tgaggctcca cgatctttca gagaacacta 480 caaacgaaca gatccggaga catatggagg tttacggtga tgttctgtcc gtacaggaac 540 tgttatggag tgacaaacac cactttccag gactctcatc aggtatccgc ctagtgagga 600 tggttctcag agagccaata aaatcgtata tcacgataga cggagaatct acgctggtca 660 cttaccccag acaaagacaa acgtgtcgcc attgtggaga atacctccac acaggaatat 720 cctgtgtcca aaataaaaaa cttttggcac agaaggccgg tttgaacgat cggctcaaga 780 aagcttcgta tgctggagct gtgagaggca acccggctga agcagcatca acacatcagg 840 gaaacgatac agcattgatc tcagcacagg ctaccaatac atcgacgcag gctccttctg 900 tacaggaagc ttcaggagag tccgttggat tgagaacaga cgtagaaata gaaacggaca 960 acaccaacaa cgctgcgggc gagagcgcgg atgacccaag caatcgctca aatcaagctc 1020 cagacatcgg caacatcgct caggccagca cggtacctga tgagcatggt actgatacag 1080 cgtccgaata tggatctgta gaaggatcgg gtccctctca tcaatccgtt ccagcacagc 1140 agccagaacc agaggcaggt ccttcgtcac ctggtcgggt tatgcaagcg ctattaagta 1200 tcacatcgaa cgaaaatgag gcccagaaag gtaagcacaa acaggaagac ggtagtacca 1260 gtgaagattc tagcgcattc actgtcattg gtaagcagcg tggtcgaggt cgctcaaaga 1320 aacttaaaca gtagaaaaca tgacactcgc gaatttctaa aatggattcc tcaagctaca 1380 tcataggtac cgttaatatt aacaacatta ctaacaaaac caaattagaa gctcttcatc 1440 attttgtgag atcacaagat gtggacatcc tacttctcca agaggtggaa aacagtagta 1500 tttcaattca aggatttaac attattttta atgtggatca tactcgcaga ggcactgcga 1560 tcgcgctaaa aagtcacata aaatacactc acgtagaacg aagcttagat acaaggctga 1620 tcgccgttag gattcatgat aaagtaacca ttattaacat ttacgctccc tctggcacac 1680 agaacagggc atctcgagaa tcttttttta acacaacgtt agcttattat ctgcgtcgtg 1740 ctacagagca agtgattatt ggcggggatt ttaatgcgat aatacatcca aaagacgcta 1800 ctggcgaaag taacttcagc ctctctctca aaaatacagt acaacaagct cagattcatg 1860 atgtatggga tcgcttgcgc cgaagggtca ctcagcacac atacatcacc cataattcag 1920 cttccagaat cgaccgtata tacgtgtcaa ctggattgtt ggaacatctg agaaacgtag 1980 atacacatgt gtgttcattt acgaaccacc atgcgttgac agttagacta tgcttaccaa 2040 atcttggtag agcgactggc agaggatact ggtcaatcag accacatgtt ttaactgttg 2100 aaaacatgga tgagtttcaa ttaaaatggg actattggac acgtcgaagg cacaattata 2160 attcgtggat gcactggtgg ttaacctttg caaaacataa gataagatct ttttttcgat 2220 ggaaaaccaa acttaagtat gatgaatttc acgatcgata ccagtatttg tattcgcagc 2280 ttcgtgaagc atacgaccgc tattttggcg accgctcggt acttccaaca ataaacctta 2340 tcaaaggtaa gatgctacgt ttgcaaagtg agttcagcaa aatgttcgta catgcaaatg 2400 aaagctttgt atctggtgag ccaatctcag cattccaggt gggagaaagt agacgtaaaa 2460 gaaccattat caatcgactc gatctatccg acactgaata catagaagat tctacagaga 2520 tagaaaatca tgtctacgaa cacttccgag agatgtactc tcaaagagag gaaactttaa 2580 acaccgaatt tcacatggtc aaatgtattc cagacgatag tagggcgaat gcaaagtgca 2640 tggaccctat ttctacggag gaaatttact ttgccataaa aggaagcgct tctaaaaaga 2700 gcgctggtcc agatggactg ccgaaggagt tttacgctaa ggcctttgat attatccacg 2760 ttcatcttaa cctcatacta aacgaagcac taaggggtga attcacctct gatttcacta 2820 acggtgtgat agtccttatt aaaaagaaag gcaccaataa cacactcaaa tcctatcggc 2880 ctatttctct tttaaacgta gacttcaaac tattatctcg aatcctgaaa actcgggtag 2940 aaagagtgat gacgcaacat tcggttatca gtagctctca aaaatgctcc aatcacaaac 3000 gaaacatata ccaagccaca ttgagcttga aagatcgcat agcacacctc cgagagaata 3060 aacgttgcgg taagctagtt tcattcgact tagagagtgc attcgatcgc gtcaaccgaa 3120 cgtttctatt cgataccatg agctctcttg gttttaatgc gaatcttata gaattgcttt 3180 ctaagatagg tactctttca aactcccgta tattagtaaa cggtcatctc tctcgtgagt 3240 tccctgtgca gaggtcagtt cgtcagggcg acccgattgc tatgcattta tttgccattt 3300 acctacaacc actgataaca aagctagaac attgtctcaa cgacccagag gatctgttgg 3360 ttgtatacgc agatgatgta accataataa ccactgatat aaacaaaatc gagagagcca 3420 gactgctgtt cgagcaattt ggtaattcat ctggagcttg ccttaacatc tccaaaacga 3480 tttcgctgga tatcggttta ataagcgata acaatcggtt gataatatcg tggttatcga 3540 caacggacag agtcaagatc cttggtattg tgttcacgaa ttcggtacgt ctgatgacca 3600 atttcaactg ggatccactg atctccaagt tcgctcagca actttggatg cacaaaatgc 3660 gtctactatc gttgcatcag aaaattgtgt tactaaacag ctatgttagt tccaaaattt 3720 ggtatgtcgc atccattgtg aacattaaca aatcgcatat agcaaaactg accacattaa 3780 tgaatacatt cttatggtca ggctgcagaa atcgaattcc catcaatcaa ttagctctgg 3840 atattgagga aggaggtttg aagctacagc ttccaggatt taaatgcaaa gcattgctaa 3900 taagtagaca cttgaaagag atagagtcga tgcctttcta ttctacatac ctagaaaacg 3960 taggtaaccc tcctaacgta agaattattc cttctaactg tccttgtcta cagtcaatag 4020 tgcaggaagc agcttatttg ccagagcaat tagaaagtca tccctcacca aaaaaggtta 4080 tgtacgcatt tttgaaaaaa gtaactattc cgaaaatcat gcagaaacat ccagaagttt 4140 gttggaaaca tgtttggtgt agtatagcat ctaaaacact cagctctgat gagaggagcc 4200 tgtattacat tctaattaac caaaaagtca tgcatcgtac tcttctggta tcaatgcaac 4260 tggcagatac agctacatgc ctacattgta gttctgcaga tgaagactta gaacatcttt 4320 tttccaaatg tcagagagta agagattcgt ggagaatact aacacagatg attgaggccg 4380 tagtaggtag agcttatagt ttctctgaat tgttattctc aaacatacat acttcaagag 4440 aaaaaaagac aaaaacgatg aaattgttca taaactatgt caagtttatt aataatgtta 4500 ataatactgt agatgctagt gcaatcaaat ttcacctaga aaacctgtag attatagttt 4560 ttaatatgaa tgtaattata ccaatgacaa ataaacgatt tttataaaaa aaaaaaaaaa 4620 a 4621 // ID BEL1-I_SM repbase; DNA; INV; 5685 BP. XX AC . XX DT 08-FEB-2008 (Rel. 13.02, Created) DT 26-JAN-2010 (Rel. 15.02, Last updated, Version 3) XX DE A consensus sequence of BEL-type family (internal portion). XX KW BEL; LTR Retrotransposon; Transposable Element; BEL1-I_SM. XX NM BEL1-I_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-5685 RA Jurka J.; RT "BEL-type family of LTR-retrotransposons from Schmidtea RT mediterranea."; RL Repbase Reports 8(2), 39-39 (2008). XX DR [1] (Consensus) XX CC The protein-coding portion includes ubiquitin around positions CC 574-992. CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 73..5523 FT /product="BEL1-I_SM_1p" FT /translation="MMEFTIGSRAWFPSGGHAQIRKTEAQRERGPRISEGQ FT QPSRNQLMNPRAPRQELLLFFIKHIWGSGGFRLPKFLLMTNIYLNLISILK FT FKMGRRPASCYRYYKNNSYPKSRFCRGVPDPKIRIIDLGIKKAGCDELPAC FT VCILSDEIEQLSSEALEAARNDEEEFCDGMSKISSTKNKLTRLRNRVKLSL FT EQSDGSKVYEQYLGEQEEVINRLTQLKLLGNKIGKVAEVKKIEKDINEIME FT RDETLNNQIEVMTNKKLGIESSKTKSEEYTPTLDKLKRLTIPVFNGQIREF FT PAWKAAFEVRVNKALGSKEEKLQHLREYLVGEPLKLIDTLGYSAGAYDAAL FT ERLESNYGGEERLYRNLLQELERFRPIREGQIRDFENLLSLIGTLVVRLED FT MGRDNELGAGFIFEKLLQKLSHSVLDEWLTNYDNSKNGSDKAVKILQGWLG FT SKLQRTKQAMEIKGTMEFRRTNYEERSVYNKEYRGQPVMSHVTVNQNTKNK FT SNKIEKFKYRGSCTKNQGISNRQEELKNQKCPMCEEKHYLNQCLVFRKMNV FT EERTRKVRELKLCFNCLSKGHFVKECLKEKLCRFENCEGKHNYWLHRILEP FT KTSEESVMMGVEPYQLSANKRWISLRTIPIILEKGNRQLVVNALLDDASTA FT SFISEQVATELNLSGEKVRIPIKVVGGRTQEVVAHRTEMTLKSLDGSVEKC FT FKAMALKSVTGEMRVMDWNEAAKSWPYLAELKFPSYLKRKTVDVLIGADYL FT ELQYSIKEIRGGEGEPIARLTPLGWTCIGNTGYGNSFSHQSNFVQTYFVSG FT NSMNFELQKFWELEDLEMPPSSHSMKDREILQHSIEAMQFENDRYEVELPW FT KFEKPHLKSSRRMAEKRLMNLFGRFERDKELEREYTKLIDDHERKGYIEEI FT DEKQMANEGWILPHFPVIKRDRETTKIRIVFDAAAKVEGRCLNDNIETGPK FT LYNDLFDVLIRFRKNKVAIVCDIAEMFLQIGIIKKDRKYLRFLWKKDNQLK FT FYEFNRLVFGLNAAPYLAQLVAQENARKFKEKYPRAVETIAESTYMDDSLD FT SVGTEKEASLLKAQLEEIWNKAGMRTQKWMSNSTKVLENVPIENRAIKLDI FT EEEGSMTMKTLGLRWVANLDVFEFSVKAFESETLTKRLLLSWIARIFDPLG FT FLAAFVVRGKMIMQSIWITGADWDECLPLNIDQEARLWVNEAREIVSMRVP FT RNLLESESDDWTIHVFSDASEKAYGSVAYLRRQSSGEIQCKFIAAKSKVAP FT VKTVNIPRLELLGACLGLKLAKKIVKPIGCNINEVCFWTDSHNVLYWIKQR FT SKTFKQFVANRVGTIQEETTAGQWRYVPTKMNPADMISRGATIKELQDREE FT WLGGPKFLSSSEDEWPKTELKGIKVDNREEKKVVMMTLGPEELGHKHVDCF FT APETYSDWTRLRRDYAWMLRFINNCKMKSKESRNSGDLKSNEIEEAERRII FT KIDQERYLATEVKKLIEGKRVDEKSKLIELNPVLDDDGIVRSDSRIINAEF FT LENKTRYPIILSDESWIARLIIKELHVKNKHYAGVNHTLGELNREYWVVCA FT RKLIKKCMKGCFECRRRKAKGMEQIMAPLPSFRFQEPLQAFARIAVDFAGP FT FETIQGRGKTRQKRYMCLFTCLQTRAVHLEVAYALDTDAFIRAFIRFMGRR FT GKPLMILSDNGTNFVGATNEVQEIKVDSYRFNKFMSGNDIRWIFNPPLAPH FT FGGVFEALIKSAKKALQVIVGSADLSDEELNSVFIKVESLLNARPLTTQCS FT DSKEDLVLTPNHFLLLKQPQNEDEILNPKTGASYRRRWRRNVGRER" XX SQ Sequence 5685 BP; 1956 A; 834 C; 1450 G; 1445 T; 0 other; gtaacatcaa cccatctgaa tcaattcaat agatgtctct gcatcctcgt gggccttgtt 60 gcaaataaaa agatgatgga attcaccatt ggtagcagag cgtggttccc ttcggggggc 120 cacgcacaga ttcggaaaac ggaagcccag cgagaacgcg gaccaaggat atcggaagga 180 cagcagccaa gccgaaatca gctgatgaat cccagagcac ccaggcaaga attattgttg 240 ttttttatta aacatatatg gggatctggg gggttcagac taccaaaatt ccttttgatg 300 acaaatattt acctgaattt aatttcaata ttaaaattta aaatggggcg tagaccagct 360 agctgttata gatattataa aaataattca tatcctaaaa gtcgattttg tcgtggtgtt 420 ccagacccta aaattagaat tatcgactta ggtataaaaa aggctggttg tgatgaactt 480 ccagcatgcg tttgcatatt atcagatgaa attgaacaat tgtcaagtga agctttggaa 540 gctgcacgta atgatgaaga ggaattttgt gacggtatga gtaaaatatc gtcgacaaag 600 aataaattaa cccgattaag gaatcgggtc aagctcagtc tagagcaatc ggacggatct 660 aaagtatacg aacaatacct gggggaacaa gaagaagtga ttaacaggtt gactcaatta 720 aagctactag gaaacaagat cgggaaagta gcagaagtaa aaaagataga aaaggatatc 780 aacgagatca tggaaagaga tgaaactttg aataaccaga tcgaagtgat gactaataaa 840 aagcttggaa tagagtcgag taagacgaaa tcggaggaat atacaccaac attagataag 900 ttaaaaagat taacaatccc agttttcaat ggacaaataa gagaattccc agcttggaag 960 gcagcctttg aagtgcgagt caacaaggca ctcggatcga aagaggagaa gctacaacat 1020 ctcagggaat atctggttgg agaaccccta aagttaattg atactttggg ttattctgct 1080 ggggcgtatg acgcggcttt ggaacgtcta gagtccaatt acggaggaga agaaagatta 1140 tataggaatt tgttacaaga attggagaga tttaggccga taagggaagg tcaaataaga 1200 gattttgaga atctcttatc tctaattggt actctcgtcg tccgtttgga agatatggga 1260 agagataatg agttgggcgc gggatttatt tttgaaaagt tgttgcaaaa gttgtcacac 1320 tctgtgttgg atgagtggtt gacaaattat gacaattcaa aaaatggttc tgacaaagca 1380 gttaaaatct tgcagggctg gttaggaagc aagttacagc ggacgaaaca ggcgatggaa 1440 ataaagggaa cgatggaatt ccgcagaacg aattatgaag agaggtccgt atataacaag 1500 gagtatcgag gccaacctgt aatgtcccac gtcaccgtga accaaaacac aaagaacaaa 1560 tctaataaaa ttgaaaaatt caaatatcga ggaagctgca ccaagaatca aggcataagc 1620 aatagacaag aggaattgaa aaatcagaaa tgccctatgt gtgaggagaa acattatttg 1680 aatcagtgtc ttgtgtttcg aaaaatgaat gttgaggaaa gaacaagaaa agtcagagag 1740 cttaaactct gtttcaactg tctatcgaag ggccatttcg tgaaggaatg tttgaaagaa 1800 aaattgtgca gattcgaaaa ttgtgaagga aaacacaact attggctaca tcgaatttta 1860 gagccaaaaa cctcagagga gagtgttatg atgggggtcg agccatatca gttgtcggct 1920 aataaaaggt ggatcagtct tcgaaccatt ccaataatac tggagaaagg taaccggcaa 1980 ttggttgtta atgctctgct cgatgatgca agtacagcta gttttataag tgagcaagtg 2040 gctactgaat taaaccttag tggagaaaag gtcagaatcc ctataaaagt ggttggaggg 2100 aggactcagg aagttgttgc acatcgaaca gagatgacat tgaagagttt ggatgggtcg 2160 gttgagaaat gttttaaggc aatggcatta aagtctgtga caggtgaaat gagagtcatg 2220 gactggaatg aagctgcaaa atcatggcct tatttagcgg aattgaaatt tccttcatat 2280 ttgaaaagaa agacggttga tgtcttaatt ggagcggatt atttagagct tcagtattcg 2340 attaaagaaa taagaggcgg ggaaggtgag cctattgcaa ggcttacacc gttaggatgg 2400 acctgtatcg gcaatacggg ttatggaaat tcgttttcac accaatcgaa tttcgttcaa 2460 acgtatttcg tgtctggaaa cagcatgaat tttgaattgc agaaattctg ggaattggaa 2520 gatctcgaaa tgccaccatc gtctcattca atgaaagatc gagagattct acaacattca 2580 atagaggcaa tgcaattcga gaatgataga tatgaggtag aattgccatg gaaatttgag 2640 aaacctcatt taaaatcgag taggcgaatg gcagaaaaga ggttaatgaa tttatttggt 2700 agatttgaaa gagataagga attggaaaga gaatatacca aattaattga cgatcatgag 2760 agaaaaggtt atatcgaaga aatcgatgaa aaacagatgg caaatgaagg atggatattg 2820 cctcacttcc ctgttataaa aagggatagg gaaaccacta aaataaggat tgtctttgat 2880 gcggcagcga aggttgaagg gagatgcttg aatgataata tcgagactgg gcccaagttg 2940 tataatgatt tgtttgatgt cctaatccgc ttcagaaaga acaaagtcgc cattgtgtgt 3000 gatattgctg agatgttcct acagatagga ataattaaaa aggacagaaa atatttacgg 3060 tttttatgga agaaggacaa tcagttgaag ttttatgaat tcaatcggtt ggtgtttgga 3120 ctgaacgcgg ctccttatct agctcagctt gtcgcacaag aaaatgcaag aaagttcaaa 3180 gagaaatatc caagagcggt ggagactatc gctgaatcga cttacatgga tgacagtttg 3240 gattcagtcg gaacagagaa ggaagcttcg ctgctgaaag cacaattaga agaaatttgg 3300 aacaaggcgg gaatgaggac gcagaaatgg atgtctaatt caacgaaagt tttagagaat 3360 gttccaatag agaacagagc tattaaatta gatatcgaag aagaaggaag tatgacaatg 3420 aaaactctgg gcctaagatg ggtcgctaat ttagacgtct tcgaatttag cgtaaaggcg 3480 tttgaatccg agacgttgac gaaaaggttg ttattaagct ggatagcaag gatttttgat 3540 ccattaggat tccttgccgc atttgtagta agggggaaaa tgataatgca gtcaatttgg 3600 atcacaggag ctgattggga tgagtgtcta ccgctgaata tcgatcaaga agctagattg 3660 tgggtcaatg aagcaagaga gattgtgtca atgagggttc caaggaattt gcttgagagt 3720 gagtcagatg attggacgat ccatgtgttt tcagatgctt ctgaaaaggc ttatggctcg 3780 gtggcttatt tgagaaggca atccagtgga gaaatccaat gtaaattcat agcggcaaag 3840 tcaaaagtag cccccgtaaa aacggtgaat ataccgagat tggagttgtt gggagcttgt 3900 cttggtttga aattggcgaa gaaaatagta aagcccatcg gatgtaatat caatgaggtg 3960 tgtttttgga ctgatagtca caatgttttg tattggataa agcaacgatc taagacgttc 4020 aagcagtttg ttgcaaacag ggtcggcacc attcaagaag aaaccactgc gggtcaatgg 4080 aggtatgtgc caacaaagat gaatcccgct gatatgatat cgagaggggc gacaataaaa 4140 gagttacaag atagagaaga gtggctggga ggaccaaagt ttttgagttc aagcgaagac 4200 gaatggccaa agactgaact gaagggtatt aaagtagaca atagggaaga gaagaaggtc 4260 gtgatgatga cgttggggcc agaagaatta ggtcataaac atgtagattg ttttgctccc 4320 gaaacttatt ctgattggac tcgattgaga agagactatg catggatgct ccggtttatc 4380 aataattgta aaatgaaatc aaaagaaagt agaaattctg gagacttaaa atcaaatgaa 4440 atcgaagagg ctgaaagacg aatcattaaa attgatcaag agagatattt ggcaacggaa 4500 gtaaaaaagt taatcgaagg aaagagagtc gacgagaaaa gtaagttgat tgaattgaat 4560 ccggttctgg atgatgatgg gatcgttaga agtgacagca ggataataaa tgcggaattt 4620 ttggaaaata aaacaagata ccccattatt ctgagcgatg agagttggat agcgaggctt 4680 ataataaagg aactacacgt aaagaacaag cattatgcag gtgtcaatca cactctggga 4740 gaattgaacc gggaatactg ggtggtttgt gcgagaaagt tgatcaagaa atgcatgaaa 4800 gggtgtttcg aatgcagaag gcgaaaggca aagggaatgg agcagattat ggccccattg 4860 ccatcattca ggtttcagga gcctttacag gcatttgcaa gaatagcggt agattttgcc 4920 ggtccgtttg aaaccattca gggaaggggg aaaacgagac aaaagagata tatgtgttta 4980 tttacgtgtc tccaaactag agcggttcat ttggaagtag cgtatgcatt agacacagat 5040 gcattcattc gtgcgttcat ccgttttatg ggaaggagag ggaagccgtt gatgatcctg 5100 tcggataacg gtacgaattt tgttggcgct acaaacgaag ttcaagaaat taaagtggat 5160 agctacaggt tcaataaatt catgtcggga aatgatatta gatggatttt taatcctccg 5220 cttgcaccac attttggagg agtttttgag gccttgataa agtcagcaaa gaaagcttta 5280 caagttattg tcgggagcgc tgatctctct gatgaggaat tgaatagtgt ttttatcaag 5340 gtagaaagcc tattgaacgc tcggccgtta acaactcaat gctcagatag caaagaagat 5400 ctggtcttga caccaaatca tttcctgttg ctcaagcagc cacagaacga agatgaaata 5460 ttgaacccaa aaactggtgc gtcttataga aggagatgga gaaggaacgt tggaagagag 5520 agatagtccc attatggagc actaggcgta agtggtggaa tgaaaagaat aatatcgcag 5580 ttggagatat agtatgctgt tggaaaagtc cacttatgat ggaaaatggc cacttgcacg 5640 gatatgtgag gttgttcctg gtcaagatgg tagagtcaga gttgt 5685 // ID Gypsy-150_AA-I repbase; DNA; INV; 5473 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-150_AA_; KW Gypsy-150_AA-LTR; Gypsy-150_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5473 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1025-1025 (2011). XX DR [2] (Consensus) XX CC Positions [4044-4550] - Integrase core CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 949..2661 FT /product="Gypsy-150_AA-I_1p" FT /translation="MPLVNHIEPYVPGTIPFSQYLEQMEWVFIDNKMADAN FT EQKASFLAACGKEVYSELKLLFPGKDLKTVSFKEITDSLKKRYDKTESDMI FT QRFKFYQRVQGPTESAEDFVLAVKLQAELCEFGEFKDIAIRDKLVCGIRDP FT DVQQRLFDEEALTLAKAEKIIINRELAGANRKLVSRESGRVSVLNRLGKRQ FT EPTVSRARSRGRSRGPSNRYRSRSRSVSFDRRNRSSSRSRKFRCNFCGRDG FT HTRRFCYDLKSKEKKSVKFVDSPAAKPKMTEYQKSLRRRLNRSVSDDEDME FT CLLISSIGKINEPCFRSVLIEGIRVEMEIDCGAAVSVVSKETYEGEFGDIP FT LQKCTRKLAVINGSRLVVEGQIPVQVELNGRRKSVKLVVLRCGSSFTPLLG FT RDWLDIFFPEWRNAFGSGTNLNQLSLPTSEERVVNDIQSKFANVFDKSLFT FT PIKGFEADLVLKKGAAPVFKRAYDVPLRLRDKVVEHLESLEKDGVITPIDA FT SEWASPVIVVVKKDGGIRMVIDCKVSINKVIIPNTYPLPLAQDMFASLAGA FT KVFCSLDLTGAYTQLLLSKSQEKS" FT CDS 3270..4676 FT /product="Gypsy-150_AA-I_2p" FT /translation="MKSVEKRGPLVLPRFSLTKAQKSYPILHLEALAVVST FT IKKFHKFLFGQKFTVYTDHKPLIGIFGKEGKNSLFVTRLQRYVMELNIYDF FT DIVYRPSSKMGNADFCSRFPLPVEVPVSLQREFIKSLNVYDEFPLDHVSIA FT KESEQDEFIQRVVSYMRHGWPKKLERCLLDVYAQHQDLELVDGCLLYQDRV FT FIPASLQKQLMKLLHRNHSGITKIKQMARRTVYWYGMNSDIESFVKSCSVC FT CQMNAVAKPVSTHTSWIPTTKPFTRIHADFFYFDKKVFLVVVDSLSKWIEV FT EYMKFGTDARKVISKFMSVFARFGLPDVVVTDGGPPFSSKEFIVFFEKHGV FT KVMKSPPYNPSSNGQAERMVRVVKEGLKKFLLDPDMIGLSTEDMVSYFLFG FT YRNTCSEDGHSFPSERLFSFKPKTPLDLIHPKNSFKHHSTKPKEVDKRVVY FT SPSRKKHDWIRQVFSAASWRYCLL" XX SQ Sequence 5473 BP; 1427 A; 963 C; 1456 G; 1625 T; 2 other; cgagatatta cagtggcgac gaggtggaga tttgtttttc gcgtcccggg aagtgcaaat 60 cagtgcgagt tttccggttt cccagtgatc ttttggcgaw tttcaccggg ggaaacagtt 120 tgtttgcgag cgagccattc atcccgttgg gctgtgtttt tgccgacgga gggaaagcaa 180 cagtttgcgc tgccatactg gcagtaccta cactgcagga gttggtttgg tttcactttg 240 cacaggtttg ctgcaaaaac cgtaagtttt tctcgaaaag tggaaataaa cagagaaaaa 300 gtgttttaat ttgcatgccg tttgtgtgtt aaaagtgagt tttatgagcc attttcttct 360 gcctagcaag caaagaattt gtgtttcatt ttgttttccc gtactgtgac ccggctttgt 420 gtgtgcataa taaaaattaa atcaatcaca gccgaagtgt ttgtcacgta cgaggagttt 480 gtttcgccaa cattgtagtg tgttgcgcca aagtttgttt gcaacaaagt ttgtatcgcg 540 agaaattggt ttgtagcgag agattgttgc acttgtttat gctcatgttt gatagcaacc 600 gaagtttatg ctagtgttga gcaacaaaat tgtttggttt gtttgtgtgt ttcacgtttc 660 gttgataaag tttgtcgttg catgttgcat acgatcgttt cgaaacagtt gaaaagtgtt 720 tagtttgaca cagtagcagt ttatgttttg cactgtggga caaattgttt caaaattgga 780 caaaaaaagt tttgcaaaaa agtttttaaa tttttgcgtt tgtttagtca actgtttgtt 840 ttgatcaagt ttattcgatt gttacactga aagaaatttg tagttgaatt tcgtgtaata 900 gtttgtcgaa tcgttttcag tttttcgtcg tttttcatcg ctgtcagtat gccgttggtg 960 aatcatatcg agccgtatgt acccggcacg attccgttta gccagtactt ggagcagatg 1020 gaatgggtgt tcatcgacaa caagatggcg gatgcgaatg aacagaaagc atcgtttctg 1080 gcggcatgcg gaaaggaggt gtacagcgag ttgaagcttc tgtttcctgg aaaggatttg 1140 aagacggtgt cgttcaagga aatcaccgat tcgttgaaga agcgatacga caagactgaa 1200 agcgacatga tccaacgttt caagttttat caacgagttc aaggccctac tgaaagtgcg 1260 gaggatttcg tcctagctgt gaagctgcaa gcggagttgt gtgagtttgg cgagtttaag 1320 gacatcgcaa tccgcgacaa gcttgtgtgt ggcattcgcg atccggatgt gcaacagcgt 1380 ttgtttgatg aggaagcttt gacacttgcg aaagcggaga agatcatcat caacagggag 1440 ctagctggag cgaaccgaaa gctagtttct cgtgaatccg gtcgagtgag tgtgttgaac 1500 aggcttggta agaggcaaga gccaacagtt tcacgcgcga gatcccgtgg taggagtcgt 1560 ggaccaagta atcggtacag gagccgtagc cgatcagttt cgtttgaccg tagaaatcgt 1620 tcttcatcac gatcacggaa gtttcgctgc aatttctgtg gcagagatgg tcacacgcgt 1680 cgtttctgct acgacttgaa gagcaaagag aagaaatccg tgaagtttgt cgattctccg 1740 gcagcaaagc cgaagatgac ggagtatcag aaaagtttgc gtcgtcggtt gaacagatct 1800 gtctcggatg acgaggatat ggagtgtttg ttgatctctt ccatcggaaa aatcaacgaa 1860 ccgtgttttc gatccgtttt gatcgagggt atccgtgtcg aaatggagat agactgtgga 1920 gctgcagttt ctgtcgtcag caaggagacg tacgagggag aattcggcga tataccgttg 1980 cagaagtgta cgaggaagtt agcggtgatc aacggaagtc gtttggtggt tgaaggacag 2040 attccagtgc aggtggagtt gaacggaaga cggaaatccg ttaaactggt tgttttgcgc 2100 tgcggaagtt cgtttacacc gttgcttgga cgtgattggc tggatatctt tttccctgaa 2160 tggaggaatg cgtttggcag tggtacaaac ctgaaccagc tgtcgttgcc gacttcagag 2220 gagcgtgtcg tgaatgatat tcaaagtaag tttgccaatg tttttgataa gtctttgttt 2280 actccgataa aaggttttga ggcagatcta gtgttgaaga aaggtgcagc accggttttt 2340 aagcgtgcgt atgacgttcc attgcgttta agggacaaag ttgttgaaca tttggaaagt 2400 ttggaaaaag atggcgtcat aacaccgata gatgcgagtg agtgggcatc gccggtgatc 2460 gttgtagtga agaaagacgg cggaatcagg atggtgatcg actgcaaggt ttcgatcaac 2520 aaggtgatca tcccgaatac ctatccgttg ccgttagcac aagacatgtt tgcttcacta 2580 gctggtgcta aagtgttttg ttccctagat ctgaccggag cgtatacaca gttgttgttg 2640 tcaaaaagtc aagaaaaatc atgacgatta atacgatcaa aggtttgttt tcgtataatc 2700 gtttgccaca aggtgcttct tcaagtgctg ctatttttca acgtgtaatg gatcaagttt 2760 tgaaaggcat tgatggtgtt tattgctact tagacgatgt tttaatagcg ggtgaggatt 2820 atgaggactg tttgagaaag ctttatttgg ttttggagcg actttccaag gccaatatta 2880 aagttagttt gcaaaaatgc aagtggtttg tgacaagttt gccatttctg ggtcatgtgt 2940 tgacgataaa ggtttgttgc cgtgtccaga gaaggtcgag acgattcgta gagccaaggt 3000 tccgaacaac gtcaccgagc ttaaggcatt tttgggtttg ataaattact acggtaagtt 3060 tatccccaat ttatcctcac gtttaagctg tttgtatcgt ttactcaaaa aagacgttaa 3120 gtttgtctgg accgatgagt gtcgtcgtgt gtttgatcaa agtaagcaat cgttgttgtc 3180 ttcaaaactt ttagtgtttt tcgatccaga aaagcctgtc gttgttgtga cagatgctag 3240 cggttatggg ttaggtggcg tgatwgctca tgaaatcggt ggagaagaga ggcccattag 3300 ttttacctcg cttcagttta acgaaagcac aaaaatccta cccaatttta cacttggaag 3360 cgctggccgt tgtaagcact atcaaaaagt ttcataagtt tttgtttgga cagaagttta 3420 ctgtgtatac cgaccacaag ccgttgatag gcatatttgg caaagaaggc aaaaattcgt 3480 tgtttgtgac gcgtctgcag agatacgtaa tggaactaaa catctacgat ttcgacatcg 3540 tttacagacc gtcatcaaaa atgggcaatg cagatttttg ctcaaggttt cctttaccag 3600 ttgaagtgcc agtttcgttg caacgggagt ttatcaaaag tttgaatgtt tacgatgagt 3660 ttcctttgga ccacgtttca attgcaaaag agtcagaaca agacgagttt atccaaagag 3720 ttgtttcata catgaggcat ggttggccga aaaagttgga acgttgtttg ttggacgttt 3780 atgcacaaca tcaagatctc gaactggtcg atggttgttt attgtaccag gatcgtgtgt 3840 ttattcctgc tagtttgcaa aagcagttga tgaagttgct gcatcggaat cactcaggaa 3900 tcaccaaaat caagcagatg gcaagaagga cagtttactg gtatggcatg aatagtgaca 3960 ttgaaagttt cgtcaaatca tgtagtgtgt gttgccagat gaatgcggtt gcaaagccag 4020 tctcaacaca taccagttgg attcctacaa caaaaccgtt tacccgtatt catgcagact 4080 ttttctactt cgacaagaag gttttcctgg tcgttgtgga cagcctttca aaatggattg 4140 aagttgagta tatgaagttt ggaacagatg ccaggaaagt catatcaaag tttatgagcg 4200 ttttcgctag gtttgggtta cctgacgtag ttgttactga tggcggacca ccgtttagct 4260 cgaaagagtt catcgttttc ttcgagaaac atggagtgaa ggtgatgaag agtccaccgt 4320 acaacccatc gagcaacggt caagcggagc gcatggtcag agtagtgaag gaaggtttga 4380 agaagtttct gttggatccg gatatgattg gtttgagtac tgaagacatg gtttcgtact 4440 tcttgtttgg ttaccggaat acttgttcgg aagacggaca ctcgtttccg tccgagagat 4500 tgtttagttt taagcctaag acgccgttgg atttaatcca tccaaagaat agttttaaac 4560 atcattcgac gaagcccaag gaagtggata agcgtgttgt atatagtcct agtcgtaaga 4620 aacatgactg gatcagacaa gttttctcag ctgcgtcctg gagatactgt ttattataaa 4680 aatttcagac ctacggatat tagacgatgg ttagaagcca aatttctcag acgtatctca 4740 tcaaatgttt ttcaggtttc cgttggaggg agagtctact ccgctcaccg aaaccagctg 4800 aagctaaggt cggcaccgcc caaagcgccg aggtttgttc ttggggagga gcgagaggaa 4860 ccaacgcaga aaacgactag agaggacgat gacgatgacg ataggtttag cgacgattcg 4920 acgaagagcg acttttacgg ttttccttcg gattccttca tctacagtga cgggggcggt 4980 cgtaacgggg tcgcaagtcg tggtcctgag gcggaggttg gtgttccgcg ccgtcgaagt 5040 gcgtcacacg tggaggaaga tctacgaaga tcgtcgagtt tgccggttga ggaggtcgag 5100 ccgcaggctg gtccgtcgca tcgagctgtg tgccagtcgt caccatcgaa gaatcgacag 5160 gagagtgtca ggcgttctaa gcgacctaag cgtctgaaga aggatgtcga ttacgagtac 5220 tattgaactg ttgtgtttgt ttatatgcca tggaggcatt caaaagtgaa ttagtttatc 5280 ggcagtcatg ttttgccgtg ttctgtagtt ttggtagttt ttaccgatca aattgtttag 5340 acggcagtga agtttcgccg tagttttgtt tgaagttttc ggcagtcgag ttttgccgac 5400 agtcagtaga gcggcagtca ggttttgctg cgtattcatt gtaaatatgt ttaaaaacta 5460 aaagggggag aac 5473 // ID DNA3-10_AP repbase; DNA; INV; 341 BP. XX AC . XX DT 25-AUG-2009 (Rel. 14.09, Created) DT 25-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA3-10_AP. XX NM DNA3-10_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-341 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 1951-1951 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 3 bp TSD CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 341 BP; 104 A; 66 C; 71 G; 100 T; 0 other; aaggtgggtt tccactagat acgtgtacgt gtacgtgcat aaaaatactt tactgtgatt 60 ggttggtata cacggtaagc ttttggtata ccagggtgtt atacgttctt accacaggct 120 tgaccatgat accatgaagt tggatccagt tcaacttttc acgtgtacac ggctctatat 180 tgataacaat attgataaca taagaatata cgaggtaact aatatgttac caatccataa 240 ccgatctaac cgatgggaaa cgtttgtgag accgttcata gcgaacgtat ccggtcaata 300 tatgcacgta aacgtacacg tatgtagtgg aaacccacct t 341 // ID hAT-38_HM repbase; DNA; INV; 3215 BP. XX AC . XX DT 07-DEC-2008 (Rel. 13.12, Created) DT 13-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-38_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3215 RA Jurka J.; RT "hAT-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 2026-2026 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 810..2993 FT /product="hAT-38_HM_1p" FT /translation="MSKKTKLNEEILEKTINSADIEPIQSISNKTSASCSS FT ASSTISLESCTVRNSQLPDCWTEKQVSLFTEQYKWLVVKDGKLGCKDCSKI FT KTLGIENKKHIHVSNEWISCSIIPNGTKKDTRQASLRKKIKEHSTSCSHLK FT IQETIIQSNKQVLPNVVKILNSEAINITVKIFNTVYSLVKRNRPLSDIEAA FT LELQEKNGVNIGNCLHSRRTATRIVEHIAMQIKKTVFKNIINQNSKICIII FT DEASTLAKKSTLVIYLQCEVQSAIEPLMIFVDLKELTSTTAENIFMTLIDC FT LTYYGFDKNYLQRNLIGFCSDGASTMLGRKSGVATKLTEMFPNIIVWHCLN FT HRLQLALDDSISDIKQVNHLKIFLDKIYAIYHQSNKNQAELETISSDLNLE FT IIKIGRVFGPHWASCSLRAATAVWRAYPALYKHFIQSKSNSGMAKRLANTH FT FIEDLALMIDILEEFSILSNALQSRKITISKADQLIKRSIRAIELHKNDNF FT GIYEKKVQNLISSNEFKCIPFDYNNKFVSLPREQLLDSVAKNMKARLLSAS FT HINMDEDKNENNDTDNYEKFVQLFEILDPSTWSLDKIMSPWLEGEDMVRKL FT CEILKFKVDVNDFRDFVDANIQLKNVQLPNSILRAKKIASTIAISSSEAER FT GFSLMNLIASKLRTSLTVEHISNLMTINLLGKPIANFDAIPFVTSWMNQNH FT RLATETRVRRGHNNDVSANEQGIWTLH*" XX SQ Sequence 3215 BP; 1234 A; 468 C; 503 G; 1010 T; 0 other; tgggctggag ccggctcggt ccggctcgcg cgagccggtt gttaaaattt ttgaaactcg 60 gcgagccgct tgttaaatta accctaaaac caaaaataag ccattttaat agtatgttta 120 tgcaaaatat gcacatcgcg ctgatgtgcc acacactcgc aaacatatca tgtgcgcaac 180 acaatacaca catcatgtgc gtacgtaaaa tacgaagtgt aataaacact gttccaaatc 240 gtaaaatatt acaacaaatg aaatgaagac atgacactgt taactgcaat ttgggaatgt 300 tttctctgtc tacttgtatt atgcagaata actgtaattt agcatttttt tctttatgtc 360 ttcttataga attagaatgt agaacattta tatgtaaaca aacaaaaaca caatttaaaa 420 gttggataga gcacgaaaaa atcaatttta atttcggaat tcgctggtaa aattttcgga 480 attcgaaact atattatgaa aacgaattcc ggaattctag aatttcgaat ttcgagatgc 540 aatccctaag attatattta ctataaaaac ttactttgtt tactgaattg acaatacaaa 600 atacaattaa ttagtctaat tgctaaggta agtaatataa gtaaaattaa atttttagtt 660 tacataaata aacaattgta aattattatt tgtaaataat ataaattcat tcatttaaaa 720 attgtgtgaa tataatggca gtatttaaat tttagatttt ttttattaaa aaatgaagag 780 agtgcaagga aatttattgt cctttgtaaa tgtcaaaaaa aacaaaactc aatgaagaaa 840 tactagagaa aacaataaat tctgcagata ttgaacctat acaatcaatt tcaaataaaa 900 catcagcttc atgttcatca gcatcatcta caatatcatt ggaatcatgc acagtacgca 960 atagtcaact accagattgt tggacagaaa aacaagtgag tttgttcaca gaacaatata 1020 aatggttagt ggtaaaagat ggcaaattag gttgtaaaga ctgttcaaaa ataaaaacct 1080 taggaataga aaacaaaaaa catattcacg tatcaaatga gtggatttca tgttcaataa 1140 ttcctaatgg aactaaaaaa gacacaaggc aagcttctct acgcaaaaaa attaaagaac 1200 attccacttc atgctcacat ttaaaaattc aagaaacaat tattcaatct aacaaacaag 1260 ttttaccaaa tgtagttaaa atattaaatt ctgaagcaat taacattact gtaaaaattt 1320 tcaacacagt ttatagtctt gttaaacgaa acagaccatt gtctgatata gaagcagctt 1380 tggaattaca agaaaaaaat ggtgttaata taggaaattg tttacattcc agacgcactg 1440 ccactagaat tgtcgaacat attgcaatgc agattaaaaa aactgttttt aaaaacatta 1500 tcaatcaaaa ttcaaaaatt tgtatcataa ttgacgaagc ttctacactt gctaaaaaaa 1560 gtactttggt aatatattta cagtgtgaag tacagtctgc aattgaacct ctcatgattt 1620 tcgtagattt aaaagaattg acgtcaacaa ctgccgaaaa tatttttatg accttaatag 1680 actgtcttac ttactatgga tttgataaaa attatttaca aagaaatctc ataggattct 1740 gttcagatgg tgcatcgaca atgttaggaa gaaaatctgg agtagccaca aaattaactg 1800 aaatgtttcc aaatataatc gtgtggcatt gtttgaatca ccgattgcaa ttggcactag 1860 atgattccat aagcgacatt aaacaagtaa atcatttaaa aatattttta gacaaaatat 1920 atgcaatata tcatcagtca aataaaaacc aagccgaatt ggaaactatt tctagtgatt 1980 tgaacctaga aataataaaa attggtcgcg tttttggacc tcattgggca tcctgtagct 2040 tacgagcagc tactgctgtg tggagagcat acccagcgtt gtataaacat tttatccagt 2100 ccaaatcaaa ttcaggaatg gccaaaagat tagctaatac acattttatt gaagatttag 2160 ctttaatgat agatatctta gaagaatttt caatactatc caatgcgcta caatcaagaa 2220 aaattaccat atctaaagct gatcaattaa ttaagcgttc aataagagct atagaactac 2280 acaagaatga taattttggt atctatgaaa aaaaagttca aaatttaatt tcatcaaatg 2340 agttcaaatg cataccattt gattacaata ataaatttgt gtctcttccc agagagcaat 2400 tattagatag tgtggcaaag aatatgaaag caagactttt atcggctagt catataaata 2460 tggatgagga taagaatgag aacaatgaca ctgataacta tgaaaaattt gtgcagctct 2520 tcgagatttt agatccttct acctggtcat tagataaaat tatgtcacca tggctagaag 2580 gtgaagacat ggttagaaaa ttatgtgaaa ttttaaagtt taaagtagat gttaatgatt 2640 ttcgtgactt tgtagatgca aacatacagc tgaaaaatgt tcagttacct aattcaattt 2700 taagggctaa aaaaatagca agcacaattg ctattagcag ttcagaagct gagagaggtt 2760 ttagccttat gaatttaatt gcatcaaaat taagaaccag ccttactgtg gaacatatat 2820 ccaatctaat gacaataaat ttattgggga aaccaatagc aaatttcgat gccattcctt 2880 ttgttacatc ttggatgaac cagaatcaca ggttagcaac tgaaacaagg gtaagaagag 2940 ggcacaataa tgatgtttct gcaaatgaac aaggcatatg gactctacat taattaatta 3000 attaattagg catatacata gcgtagtatg ttaaatttgg aatttatcag ttttattgtg 3060 caataaatat tttgtttagt tattttgtgc aataaatgtt ttgttttgtt atatatattt 3120 tatactgtac aaaaattaat gaagacaatg tacgcattta ccgttttttt tttttgaaat 3180 aatagcgagc cggttgttca aaatttggca gccca 3215 // ID DIRS-1_DPu repbase; DNA; INV; 5276 BP. XX AC scaffold_58; XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE DIRS retrotransposon from Daphnia. XX KW DIRS; LTR Retrotransposon; Transposable Element; nonautonomous; KW DIRS-1_DPu. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-5276 RA Jurka J.; RT "DIRS retrotransposons from Daphnia."; RL Direct Submission to RU (08-JUN-2010). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR Genome; scaffold_58; Positions 703677 708952. XX FH Key Location/Qualifiers FT CDS 134..3898 FT /product="DIRS-1_DPu_1p" FT /translation="MSRISRNRPPPSEMFECRAFDPRAEGFEPDADRERIR FT ERNFRIESQRQHPSFPERDRSPIRSPRYLPQWPNRDRAYFEHDGFASVFNV FT DIHNNRSYDRFGHFDERDGRPGGRERQCDDFNDRRQPWHEMRDNRNEPREY FT RCGEDGFYLFREDGLCYDAEGFCYSFGEDGFYYDVEEGRRYDSGGFACDEY FT GVRFDFADGPQEDNGGNLDARPLDPVSPRVVRARKTPPRRRVSIVEPDAEP FT EDRRRPWRPQPSTSPTAEDDVVDLGEETTPTPLESLSLPGSSVAQPLSVSK FT VISDEISGWMTSGISTEASKAISKEFPIEFIDADFAIKPPKLDGWIGRRAQ FT SRPDKGLLKTINATEDSLTKAQLTIMDIAPPLIDLYSRLSSLPGSDAAKRS FT VQAALQQWGRAYFHITRKRRSAVIALAEPASEYLLRDPGAFESGKEARSFL FT LTEKFLQAMLTDASQDNTLAQAARVAAAAAAAIAPKAQNPRRYRFPPEQAA FT APFQQTTRYDGVQGSRGGRGGRGRRGSRGRGQRHNAWLQGGRRYVYLNPDP FT LASSQNDPKPPSHSLSNQARTFNPTFHEGGTAPTGVASRLKSFASHWASIT FT NDIWVLKTVSEGLRLEFTAEPVQNFFPPEISMSDEMTAICDQEVDDLLFKQ FT AITEITDGSDGFVCSFFCIKKKQAGKFRPIVNLKPLNHFIKYQHFKMENLE FT SVRFLIREGDWMVKVDLQDAYFTVAILHSHQKYLRFRWKKRIFEFKCMPFG FT LSSAPRAFTKILKVVMAFLRKKGIRIIIYLDDILILNGSREGLLADLEQVM FT ELLQALGFIINQDKSVLSPAQIIEYSGLVVSSIDLSFALPAKKAEAVKRMC FT ESALAEGMVSLRALASIQGNFSWAIPAIPFAQSHYRSLQRFYILNAQRAYS FT DLNTKVRLSPCAQVDLKWWVANIEKCSGKTFFPRDPDMEIFSDASLSGWGA FT VCSGVTTRGPWTLNDKEKHINELELLGALYAIQAFGADSSNIAIRIYLDNS FT TAVSYVNKCGGTRSSVLTATAKTLSAWCEIRNISIEAVYLAGELNVVADRE FT SRAEADASDWQLDPSVFSEMREIWEMDTDLFAAPWNAQLPSFVSWKPQPGA FT TAINAFSISWDRVRGYAFPPFSLIFRCLEKLRREKATIVLICPIWPGQPWF FT PVMLEHACDIPRLLVPAASLVTSAQGVSHPLLQSGALKLAAWRLSGDLTVC FT KGFRSQLSSFSWKGAALTPTRHTNQRGTPGVIGVCCGIKIPFLTM" FT CDS 3631..4698 FT /product="DIRS-1_DPu_2p" FT /note="tyrosine recombinase." FT /translation="MRYPPPTRTSGKPRYFGTRRVSPASPVRSAKAGRLET FT LRRSYSLQGFSEPAIELLLEGCRANTNAAYESAWNTWCDWCLLRDKNPLSN FT NVKFISDYLAHLQTTGKSYSLINIHRSMLSVTLKPADGCPIGQHPLIVKLL FT KGCYNRNPPKPKYNVTWDPSLVLNFMASSGDNSALPIPMLVGKLATLFALA FT TLLRVSELASIPFSSIKFTENSVQFALSKPRKAQRNGPLQSFTLPACPDSD FT ACPVASLRSYVERTGTNRPSKDEGMLFISTIAPFGPVTSNTVGRWIKNFLK FT TAGIDTSIFSAHSTRSAAASLAVARGLSIDAVLQAGHWASQTTFGRFYNRG FT VDATFAASVLNDA" XX SQ Sequence 5276 BP; 1341 A; 1363 C; 1306 G; 1266 T; 0 other; accctaaggg tgtagactca tttattttag cttttcaaag attttgaagg ccaccttgcc 60 ttcttccccc gtacaacgcc ccctttgtct cagactcatt tttgctttcg cgcaagcaac 120 aacgctctag acaatgtcaa ggatcagtag gaaccgacca ccgccgagtg aaatgtttga 180 atgtcgggct ttcgatccac gtgctgaagg cttcgagccg gacgctgatc gagagagaat 240 ccgtgagcga aattttcgca tcgagtcgca gcgtcaacat cctagtttcc ccgagagaga 300 taggtcgcca attcgttccc cgcgttatct gccgcagtgg ccaaacagag atcgggcata 360 ttttgaacac gatggcttcg cttcggtgtt taatgtggac atccacaaca accgctctta 420 cgatcggttc ggccattttg acgaacgtga cggccgcccc ggcggtcggg aaaggcagtg 480 cgacgacttc aacgatcgca ggcagccctg gcacgagatg cgcgacaacc ggaacgaacc 540 acgtgagtac cgctgtggcg aagacggttt ttatctgttc agggaagatg ggctgtgcta 600 tgatgccgaa ggcttttgtt atagcttcgg tgaggacggt ttttattacg atgtcgaaga 660 aggccgtcga tatgattcag gtggtttcgc gtgcgatgaa tacggtgtgc gtttcgactt 720 tgccgatggg ccccaagagg acaatggtgg caatttagac gcccgcccgc tcgatcctgt 780 gtcgcctcgt gtcgtgaggg ccaggaaaac tccacctcgt aggcgcgtga gtatcgtgga 840 accagacgcc gaaccggaag acagacgacg cccatggcgc cctcaaccat caactagccc 900 gacagcagaa gacgacgtag tagatttagg tgaagagacc acgccgacac cgctagaaag 960 cctaagtttg ccaggtagtt cggtggcgca gccattatcc gtctccaaag taatctccga 1020 tgagatttca ggctggatga cgagcgggat ttcgacagag gcctccaagg caatttcgaa 1080 ggaattcccg atcgagttca tagacgcgga ctttgcgatc aagccgccca aactcgacgg 1140 ttggatcggc cggcgcgcgc agtcgagacc tgacaaaggt cttcttaaaa caattaacgc 1200 aacggaagat tcccttacta aagcgcagct gacaataatg gacatcgcac cgccattaat 1260 cgatctctac tcccggttga gttccttgcc gggcagcgac gcagccaaaa gatcggttca 1320 agccgcgctt cagcagtggg gccgtgctta ctttcacatc acgcggaaac gtcgcagtgc 1380 cgtaattgcg ctcgctgaac cggcttcaga atatttattg cgtgaccctg gtgcctttga 1440 gagtggaaag gaggcgcgct ccttcctctt aacagagaaa tttcttcagg caatgctgac 1500 ggacgctagt caggacaaca cgctagctca ggcagctaga gtagcggcag cagccgcagc 1560 cgctatcgca cccaaggccc agaatccaag gcggtaccgt ttccctccgg agcaggcggc 1620 cgctcccttc cagcaaacca cgcgctatga tggcgttcaa ggcagccgag gcggaagagg 1680 tggccgcgga agaagaggca gtagaggcag gggtcaacgc cacaacgcgt ggctacaagg 1740 aggccgaagg tatgtttatt taaatcctga tcctttggcc tctagtcaaa atgaccctaa 1800 acccccctct cattcactca gtaaccaagc gcgcacgttt aatcccactt ttcacgaagg 1860 aggtacggct ccgacgggcg tcgcctcgag attgaaaagt tttgcttccc attgggcgtc 1920 gattacgaac gacatttggg tccttaaaac agttagcgag ggcctgcgct tagaattcac 1980 tgccgagcca gttcaaaatt tcttccctcc cgagatttct atgtcagatg agatgacggc 2040 catctgcgat caagaagtgg acgatcttct tttcaaacaa gcgataacag agatcacgga 2100 cggctccgac ggctttgttt gctcgttttt ttgtattaag aaaaaacagg cggggaaatt 2160 tagaccgata gttaatttaa aacctcttaa ccatttcatt aaatatcagc actttaaaat 2220 ggaaaatctc gaatcggtcc gttttttaat cagggaaggc gactggatgg tcaaagtgga 2280 cctgcaggat gcctacttca ctgtagcgat cctacattcc catcaaaaat atttacgctt 2340 ccgctggaag aagcgcattt ttgaattcaa atgtatgcca tttggtcttt cttcggcgcc 2400 aagggccttt actaaaatcc tgaaagtagt tatggctttt ctccgcaaaa aaggcatccg 2460 gataatcatt tatctggacg atattttgat cttaaacggc tcaagagaag ggctcttagc 2520 ggatcttgag caggtgatgg agctccttca ggcactcgga ttcataatca accaggataa 2580 atccgttctt tcccccgctc aaatcataga atattcggga ctcgtcgtca gctcgataga 2640 tttatcattt gcgctacccg cgaaaaaggc agaagcagtg aaaagaatgt gcgagtcggc 2700 cctagccgag ggaatggtat cactgcgggc tttggcctca atccagggaa atttttcctg 2760 ggctattccg gcgatcccct tcgcgcaatc gcactatcgt agtctgcagc gattttacat 2820 attgaatgcg caacgggcct attcagattt aaacactaag gttcggctat ccccctgcgc 2880 ccaagtcgat cttaagtggt gggtagcaaa catcgagaaa tgtagcggaa aaacgttttt 2940 cccgcgggat ccggacatgg aaattttttc agacgcgtca ttatcaggat ggggggccgt 3000 gtgtagcggt gtcactaccc ggggtccctg gacactaaac gacaaggaaa aacacatcaa 3060 cgagctcgag ttactaggag ctttatacgc aatccaagca ttcggggccg attccagcaa 3120 catagcgatc cggatttacc tagacaattc gaccgcagtt agctacgtaa acaaatgcgg 3180 cggcaccaga tcaagcgttc tcacagccac ggctaaaaca ctatcggcct ggtgtgaaat 3240 ccgcaacatt tcaatcgagg ccgtttatct agcaggcgag ttaaacgtgg tcgccgaccg 3300 ggaatcaagg gctgaagcag acgccagcga ttggcagctg gacccgtccg tcttttccga 3360 aatgcgggaa atttgggaaa tggacacgga tttatttgcc gcaccgtgga acgctcaatt 3420 accaagtttc gtatcgtgga agccacaacc cggggcaacg gccatcaatg cattctcaat 3480 cagctgggat cgagttagag gatacgcttt cccccccttc tcgttaatat ttagatgtct 3540 tgaaaaattg agacgcgaaa aagcgacaat cgttttaatt tgccccattt ggccaggcca 3600 gccgtggttt ccggtgatgc tggaacacgc atgcgatatc ccccgcctac tcgtaccagc 3660 ggcaagcctc gttacttcgg cacaaggcgt gtctcacccg cttctccagt caggagcgct 3720 aaagctggcc gcttggagac tctccggaga tcttacagtc tgcaagggtt ttcggagcca 3780 gctatcgagc ttctcctgga agggtgccgc gctaacacca acgcggcata cgaatcagcg 3840 tggaacacct ggtgtgattg gtgtttgttg cgggataaaa atcccctttc taacaatgta 3900 aaattcattt ccgattatct ggcgcatttg caaacgaccg gcaaatcata tagtttaatt 3960 aatatccacc gttccatgct ctccgtcaca ctaaaacccg ccgacgggtg cccaatcggg 4020 cagcacccac ttatcgtcaa gttattaaaa ggatgctaca accgtaatcc ccccaagccc 4080 aagtataacg taacatggga ccctagtctt gttctcaact tcatggcctc ctcaggcgac 4140 aatagcgcac ttccgatacc catgctagta gggaaattag ctaccttgtt tgcgttggcg 4200 actctgctaa gagtgtcaga actagcctca attccatttt catcaattaa atttacagaa 4260 aattcagttc aattcgccct atcgaaaccg cggaaagcgc agcggaacgg tcccctgcag 4320 tcgtttacac tgcccgcctg cccagattcg gacgcgtgtc cagtagcgtc cttacggtca 4380 tatgttgaac gcacaggcac caacaggcca tcaaaggacg aagggatgct ctttataagc 4440 acgatcgcgc ccttcggccc agtaacgagc aatacagtcg gtcgctggat caaaaatttt 4500 ctaaaaacgg cagggatcga tacttcaata ttcagcgcgc attcgactcg aagtgcggca 4560 gcatccttgg ccgtcgcacg gggactttcc atagacgcgg tattacaggc cggccattgg 4620 gccagccaaa caacgtttgg ccgcttctat aatcgcggag tcgacgcaac atttgcggca 4680 tcggttttga acgatgcata agctttgaaa tcacccttag ggtagagcgg aatgaattcc 4740 gctatacaat tggaaattac cagagtgatc gctcagcgat cacgaagggt aatttgaatt 4800 gtatgaggaa gaatgagaga gaccctaagg gtcttctaac ccatcccaca atcctcccgt 4860 ctcattcaat cctcacatct tttttaatgt tgcttaattt ctcaccacgg ccaggatcca 4920 accggacaga ctgtcaaaga ggggtcgaga ggccggccaa gcagagtgaa gaagaatatc 4980 ttcaacttaa atcaattctc gttttgttgt ttccctattc catgaaggca ccatttgcca 5040 ttctttgctt gtttcacaat gcggcttttt ctggccctgc cggccagttt tttatgccaa 5100 tttgtttttg ttttgtatgc cattccttgc aatgttgtgt tatgttataa ccctgtcatg 5160 cgctttcccc aagaaaaatt agtctgagac aaagggggcg ttgtacgggg gaagaaggca 5220 aggtggcctt caaaatcttt gaaaagctaa aataaatgag tctacaccct tagggt 5276 // ID HAT1_Cis repbase; DNA; INV; 1134 BP. XX AC . XX DT 29-AUG-2008 (Rel. 13.08, Created) DT 29-AUG-2008 (Rel. 13.08, Last updated, Version -1) XX DE hAT DNA transposon from Ciona savignyi. XX KW hAT; DNA transposon; Transposable Element; DNA; HAT1_Cis. XX OS Ciona savignyi OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. XX RN [1] RP 1-1134 RA Smit A.F.; RT "HAT1_Cis - hAT DNA transposon from Ciona savignyi."; RL Direct Submission to Repbase Update (05-AUG-2008). XX DR [1] (Consensus) XX CC Ci000017, 2% div, 60 bp TIRs, 8bp duplications with a bias for CC NYYYRRRN, but not as clear as Charlie family; still bias CC suggests membership in hAT family. XX SQ Sequence 1134 BP; 388 A; 195 C; 198 G; 353 T; 0 other; tagggatgca caatatagaa tttcgaatct ttgagattcg aatctctagg gattcgaatc 60 tttacaagca atttaataag aatagttagg attcgaaatt aaaaacgtaa aatattggct 120 gcgatccaac gcgcaacaac acgcgtgatg ccgattatgc agacgtcagc gtatcgtttc 180 agcaagcgtg cgacagtgga taacaatgat aagaaacttt tggtgaatcg tcattgaagt 240 aaattcgtaa tctcaaaatt gtaaaatggc ttcttcattc gtatgtaata cccgtaatgc 300 gtttctccat accgaaagtg catctatcgt gcgaaagcgt ccgaacattt tctaaagcgg 360 tagctcggaa atacaacttg catcaaagca caacaacatt atcttatcag ttgtcccata 420 cacgtgcatc gtggacagac gcgcttaaaa cgaacgcacg cacaaagaaa gtaggtcacg 480 ttcgccaggc tgtaatcacg tggtttatcg cacctttaaa tatgacgcaa ttattccaaa 540 ggcggaagag ttatttacag actaaacacg ttagtatttg caatatatat ttcacttaaa 600 caactatgtt ttcaataaac acaatgaatg atatcctatg tgttcaattt aatgatatcc 660 tatgtgttat ttagttaggt gattttgcag tgattaattt attatacttt taccgaggta 720 taactgtgtt aaatagtaaa ttaatgttta attgtgcaag ggcattaata gctagtcttg 780 ataccaaact ggtgaaataa aaattgtctg gtgtttggga gcatttttct cagccattca 840 tttcaacaac aacctactag actattatcc tccctcatgg cagatagtgt ttagagatgt 900 tacatatgat taaaaaaata tgtaatggta aaatattgtt ggactacgta aattgctctc 960 atcacagttt aaaagcatat ggctaattca ctataaatga gaaaatatat gcacagcata 1020 gcaaatttaa atacatggta aaacattcta gaataggatt cgaaattcta gattcgaaat 1080 atattattct agaatattct agaataccta actatcctat attgtgcatc ccta 1134 // ID Gypsy7-I_Dya repbase; DNA; INV; 4163 BP. XX AC chrU; XX DT 18-MAY-2009 (Rel. 14.05, Created) DT 18-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy7_Dya; KW Gypsy7-LTR_Dya; Gypsy7-I_Dya. XX OS Drosophila yakuba OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; melanogaster subgroup. XX RN [1] RP 1-4163 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1064-1064 (2009). XX DR Genome; chrU; Positions 1435651 1431489. XX CC Positions [3236-3697] - Integrase core CC 'CAAAC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 2252..4162 FT /product="Gypsy7-I_Dya_1p" FT /translation="MWENFIPDLADNKEPLRLLLRKDSIFQWGNTEEKAFQ FT KLKCHLAHVPNLAYFNPKNRTRLIADASPVALGSVLLQFDGKGDPLIISFA FT SKALSEVEKRYSQTEKESLALVWAVEKFYYYLAGLHFELVTDHKPLEAIFK FT PTSKPPARIERWLLRLQSYRFKVIYKTEKENISDILSRLCQSSSSPTTNRM FT EYSILRVVESAIPKSMTISEIAESSLRDEEVVDAIHSLENNSWEREKSTKL FT HPFRFELSTIGPLLLRGNRIVIPTNLRKRVLELANEGHPGESAMKRRLRSK FT VWWPLIDRDAEEFVKTCRDCLMVSQSIRPAPMNRNPFPTGPWIWVASDLLG FT PLPNNEFILVFIDYFSRYMEFKFLRSISSSSLIGVMKEIFCRLGYPKYLRT FT DNGRQYISDEFSEYCKTCGIEQVRTPPYWPQANGEVENMNKSLVKRLKIAY FT SNSKNYKEEIQKFVVMHNVTPHGTTGIAPNQLMFNRIIRDKLPGIEELNEQ FT FTDSAERDRDIIQKEKGKEAADKKRGAKNIDIKIGDKVLVRNVVFPNKLTP FT TFDPTEFEVLERHKNIVKIAGGGKTLMRNVAHVKKIPGDVYVPPAPESGTL FT QTIPAPFQEALEDQPSEVPAQHSPPKVKPLKLRLINKGG" XX SQ Sequence 4163 BP; 1433 A; 786 C; 906 G; 1038 T; 0 other; ttggcgacga ggtaaattat aactattata aacgtcgtgt gcttaagaaa taacaggcaa 60 ataaatgaaa atagcaaaca gtgtaaacag caagcgcata tgactcctgt aaaaatactc 120 ttgtttgtgg ttaaaatcca actatataca catacatata tggaaaacag aaattatata 180 tatatgtgtt cacaaaagtg tacagtcgga ggaagaatac attgaaatct aaagtccaat 240 ataaaataag tgcatgcaaa agaaaaagga aaagaaacga aggatcaaaa taacggggga 300 atagaagcta attcaagaga gagtgagatc tctcgacaag ctgcggttac gaaaggctga 360 gtcaagagaa agagatctct tgacaagctt taatgacaaa aggctgcgta aagagagttc 420 actttacaag ctgtattgat gaaaaaaaaa gaggccgcgt aagagagaat tctcttttac 480 aagctgttag aaaagaggct gtgcagaaga gtgttattac tgcaagctgg aaggctgcgt 540 aaagttatct ttacaagctg aactaataaa aagccgggca agagagaatt tctccttgca 600 agctgttgaa aataaaaagc tgtgcgcgtg aagagaattc tcttggcaag cggtgatata 660 ttcagaaact ataaatggct gagtaactta tattatatat gtacatgtac atataagtta 720 tacaagccaa agctagaaaa ataatgttgg tcttagaagg gaaaaatgta atatatatat 780 atatatatat atatatatat atatatatct gatgtcgacg caattaggaa gagttctact 840 cgaaagacaa aaccacaagg ggaatgttca agatgcggac gttttggtca cgctagttat 900 gactcatcat gcccagcaag gcaagcaaaa tgctacaaat gttcaaaaat tggtcatttt 960 tcccgaaagt gtagaaccac cctgaagaga cggaattctc agtgtaattt tgaaaatgtt 1020 aagaaacgta agacgaatgt tcgttgtgtg gagggcgaga acacagagct caaattggaa 1080 aaggagaaca caaactgttt caaaatagca agtgatgagg aagaagagtt tatatactac 1140 cgtgtggaag gccagaaaat tcctctagtt attgactcag gatcaaggtt caatctcatc 1200 agcagctcgg attggcaact gctacaaaag aaaggcgcta taatattcaa tgacagatca 1260 cattctgaaa aacagtttcg aggctatgct tcttaacaat tgcttgaggt aattcgtatt 1320 tttgaggcac ccatttcggc tgaaagagat attgaagtga ttgcgacatt ttacgttatc 1380 aaaaatggtc gacaatcatt gttggggcga cccacagcta tacaactcaa tgtccttcgc 1440 ctaggtttga atgtcaaccg cgatgaagaa cctacaccgt tcccaaagtg gaaaggcgtc 1500 aaagtaaaac tatgcataga tcccgagatt aggccggtgc aacagccggt aagaagaatt 1560 ccggtagctt tagaggagaa agttcatgct aagctggagg aagcccttgg acatgacata 1620 atcgaaccag ttttaggtcc aagttcttgg atctctccca tcgtactggc atttaaagag 1680 aatggagaca ttcggttatg cgtggatatg cgattggcaa ataaggcaat tcagcgagag 1740 aactaccctt tacctatttt tgaatcattt atgacaaagt taaaagacgc aaagttcttc 1800 tcccgccttg atcttaagga tgcctatcac caataagaat tggaagagtc tagtagggag 1860 attacaacgt ttataacgtc aaaggagctc ttccgatata agagattgat gtttggggtg 1920 aattcggctc ctgagatttt tcagcgccgg ctggaaacac tgttagctgc attcccgaat 1980 gccatgaatt acatcgatga cgtgatcact tttggtgcca acgaactcga acatgacgaa 2040 acagtaaaag cagtttgcaa agtctttaac gacaataatg ttttattaaa caaacaaaag 2100 tgcatttgga aaacaaacag gctaaaattc ttggagcaca ttttgtccga ctcgggaatc 2160 gaagttgacc ccgagaagat agaggttatt aagtctttta gagtccccaa gaataaggaa 2220 gaaacccgca gttttctcgg cctcattaca tatgtgggaa aattttattc ctgatttggc 2280 cgataacaag gagcccctac gcctattatt acgaaaagac agcatatttc aatggggcaa 2340 caccgaggaa aaggcttttc aaaagttgaa gtgtcattta gcacatgttc ctaatcttgc 2400 ttattttaat ccaaagaacc gcacccgttt aatagcggac gcaagtcctg tagctcttgg 2460 atcggtcctt ctgcagtttg atggtaaagg cgatccgttg attatatcat ttgccagcaa 2520 agccctgtct gaggttgaaa agcgctactc tcaaacggag aaagagagtt tagcacttgt 2580 ctgggcagta gagaaattct actattacct agctggtcta cacttcgagc tggtaacaga 2640 tcataagccc ttagaggcta tcttcaaacc gacatcgaag cccccggctc gtattgagag 2700 atggttacta cgccttcaat cataccgttt taaagtaatc tataaaaccg aaaaagaaaa 2760 tatatctgat atattgtctc gactttgcca atcatcatca agtccaacta ccaacagaat 2820 ggagtacagc atcctgcggg tagtggaaag cgctattccc aagtcaatga ccatctccga 2880 gatcgctgaa tcgtcattgc gagacgaaga agttgtcgat gcaatacaca gtctagagaa 2940 taactcttgg gaacgagaaa aatcaacaaa attacatcca tttcgatttg aactgtcgac 3000 cataggacct ctcctgctaa gaggaaaccg cattgtgatc ccaacaaacc ttcgcaaacg 3060 ggtacttgaa ttggcaaatg aaggacaccc aggagaatca gctatgaaac gccgcctaag 3120 atccaaggta tggtggccgc tgatcgacag agatgcagaa gaatttgtaa aaacatgcag 3180 agactgcctt atggtatccc agtcaataag accagccccg atgaacagaa atccatttcc 3240 aacgggtccg tggatatggg tggcgtcaga tctcttggga cccctgccta acaatgaatt 3300 catcttagtt tttatagact atttttctcg ctatatggag tttaagtttt tgcggtccat 3360 ctcatcgagc agtttgattg gagtcatgaa agaaatattt tgtagattgg gctatccaaa 3420 atacttacgt accgacaatg gacggcaata tattagcgat gaattctcag aatattgtaa 3480 gacatgcggc atagaacaag taagaacccc accatactgg cctcaggcta atggggaggt 3540 agaaaacatg aataaatcct tagtcaaacg actaaaaatt gcctactcaa attcaaaaaa 3600 ctataaggag gagatacaaa agtttgttgt aatgcataat gtgactcccc atggaaccac 3660 agggatagcc ccaaaccaat taatgtttaa taggataata cgcgacaaat taccagggat 3720 agaagaatta aatgaacagt tcacagattc tgcggaaaga gatagagata tcatccaaaa 3780 agaaaaaggg aaggaagcag cagataaaaa gagaggagct aagaatatcg atattaagat 3840 tggcgataaa gtgttggtac gaaacgtagt ttttccaaat aaactaacgc cgacttttga 3900 cccaactgag ttcgaggtat tagagagaca taaaaacatc gtgaagatag caggtggcgg 3960 caaaacatta atgaggaacg ttgcgcacgt aaagaagata ccaggcgacg tatatgtccc 4020 accagcgccc gagtcgggta ctctccaaac tattccagca ccgtttcagg aggccctaga 4080 agaccagccg tcggaagtac ctgctcaaca ctctcctccg aaggtgaaac ccctgaaact 4140 ccgtctcata aataaaggag gga 4163 // ID Copia-34_DPu-LTR repbase; DNA; INV; 290 BP. XX AC ACJG01007327; XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the common water flea: long terminal DE repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-34_DPu_; KW Copia-34_DPu-I; Copia-34_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-290 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the common water flea."; RL Direct Submission to RU (08-FEB-2011). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR Genome; ACJG01007327; Positions 5929 5640. XX SQ Sequence 290 BP; 71 A; 68 C; 45 G; 106 T; 0 other; tgttgtgttt tatcaatcca agcgccagtt gctgaaacta atccactgtt gccaaaaccc 60 actccctctc ccgtcaaaac tcttggtcgt ctgctagact actcttgcct ctttctcttc 120 tcttttggaa tcaacgtgtc aggtactctc tagaaggtta agacaaatta tatgttttct 180 atgttctgtt ctctacttgc cgtgatattg atcctaagtt aatatacgtt ctatgcattg 240 atttgtgttc gtgtcactaa tatggtatat ctgaatcaca aatcacaaca 290 // ID Gypsy-5_IS-I repbase; DNA; INV; 4776 BP. XX AC ABJB010064655; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the black-legged tick genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-5_IS_; KW Gypsy-5_IS-LTR; Gypsy-5_IS-I. XX OS Ixodes scapularis OC Eukaryota; Metazoa; Arthropoda; Chelicerata; Arachnida; Acari; OC Parasitiformes; Ixodida; Ixodoidea; Ixodidae; Ixodinae; Ixodes. XX RN [1] RP 1-4776 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the black-legged tick genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; ABJB010064655; Positions 2324 7099. XX CC Positions [3833-4312] - Integrase core CC 'ATAT' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 1013..2587 FT /product="Gypsy-5_IS-I_2p" FT /translation="MLSCSCYIGKRYSRSLGAEPDITPQRPLISLTDQLTS FT VPRHPVMVENVGVVDALFDTGAAVNAIDMSFLSSDIKVFPDYSVLITAHGE FT SIPTGGKVKLKITVFGGTSVEDFVVIPNCVCPLILGYNWCAKTGLFSWKSS FT TACQETLASDQSACHEYALLRRFWSYIMTSWASYAVLLLLRLFWSNFVLLW FT ASSASYGASVLETLLHKFFQSSSVQIGETWDIHIAATTVLQPETAMWIKAR FT SNPPHLNGGIVCEASRISKPGYEVAVPRTLTTSIGGHFSILMMNLGHNSVT FT LQQGHYVGKSFPLTLTELLTAAIDQPGSSELPSKECAQIPWDLFQFGPTLA FT NHQIESLKELITTYWACVALTPNELGCTHLAEHNINRGDHAPIHQSLRRLA FT PVEHAVIRTQVDDMLNRGIVTPSNSPWSSPVVLVRKKDGTVRFCVDHRKVN FT SITKQDVYPLPRLDDSLDRLQGASFFTTLDLLSGYWQVPLNPDDAEKSASI FT TPGGLYQFTRLPFGLCNAPATFQRLTIEF" FT CDS 2608..3771 FT /product="Gypsy-5_IS-I_1p" FT /translation="MALVYLDDIIVYGTTFEEHNRRLELVLDALQRANLRI FT EPQKCFFGFEEVLYLGHAITKNGIKPDPAKLEAVSKYPRPSRFSEVKSFLG FT FASYFRRFIPQFASRATPLTLLLRKNNHWRWTINEERAFNDIKESLLQAPI FT LAHFNMEFPSFLHTDASGTGLGAVLLQEHPDKSQRVIAYASRRLTDAEQRY FT HSSELECLALIWAIQHFRPYLYGHRFTAITDNSALLWLQSKRDLTPKLARW FT ALLLQEYQLNICHRGGKYHTDADFLSRNPIAPAEELNKEEVQFCTPVFNLA FT ESQTSDPEVQEIMSRLRSPSNTAERRLRRAYRVCDGLLCHRGTQGWLSFVP FT TQLRTAVLHLCHDNLTAGHLGRDKTTKKILERYWWPRCRIHSVVH" FT CDS 3791..4735 FT /product="Gypsy-5_IS-I_3p" FT /translation="MASSLYHCGVQPVPLPKAPFEILGIDHLGPFPPTLSG FT NKYVLVAIDSLSKWVELRAVPSTAVDHIVLFLRQQVILRHGVPRVIVSDRA FT SSFVSRRFAREMETHGIHHSQAAPYHPQTSGLVERANRTISGILASYVNEQ FT HNNWDELVSYAAFAMNTAEQSTTEFTPFQIVYGRTPVLPCESGDNAVWKLR FT TEHSTSSSRRRFQAVRKKARDSMLRAQVSTQERTAPSFPPPPVRCGDWVLV FT RRPLRKKGLAEKLLPRYRGPCEVLLQLGPVTFQVRDCTGSPHHTNATFTAH FT RSHLKHYVPRSSRHTHHAPQLGP" XX SQ Sequence 4776 BP; 1215 A; 1338 C; 1081 G; 1142 T; 0 other; atttggtggc agcggtggga tgccgaccga aacccgacaa ggtcctttgc cggtacaagg 60 gacgccttcc ggtgagccac cctcgcaggc ccagcagcca gaggaaacca cttctcaatc 120 ggtggccgca ccaagcgaca ttccgacgac agcggtgaca ttggagacgc tgcaaaggct 180 gttggaactg cagactcaaa caatcatggc cgctacgcag caatcttcgg accttctaac 240 cgcaagtaat gccgacatct gccgtcgagt agctgctctg gagacgcttc agcaaacagc 300 gcaaccaact gtcttcgcag ttgagcagac tcctttcacg agccgcccga ctacatcttc 360 tggtcaaacc cgtctcaagt taaagccact taatggtgaa acacccttgg atctatacct 420 acagcaggtt gacatcattt ccgatgcaaa ccactggact gaagacgaga aggcacgcgc 480 tgtcatcgcg caactagatg gccctgccct cagcgttctt catgcgctcc aaggaccggc 540 tcacttacgc agcattgatc acgagcttgc gcgatcgttt cggtgatgaa catctgcagc 600 aagcattcta cgcagaatta cgttcgcggc gccaagaagc aagcgagagt tttccagaac 660 tcgctgcaag tatagagcgt ttgacctaca aatccttccc gggcgcaacg tctgatacat 720 tgaatcgtgt cggcagagcc gcgtttgtgg atggcatagg caaccctgag gtccagaagt 780 ttgcaagact agcaagacca gaagcaatac gagccgcact cggacatgcc atggagatga 840 gcgctgctca atgggctacc agccgcatgc aaccagcagt acctcctgga gttacagtgc 900 aaccattgac aattccacag gacacgcgct ctcagtctgc gtctgctcgt tcacggaaca 960 ataccgctgg gtcgtgctat cgatgtggcc gtcgtggtca ttttgcggct gaatgctcag 1020 ctgctcctgc tacatcggga aacgctactc ccggtccttg ggggcggaac cggatataac 1080 tcctcaaagg cccctcatat cacttacaga ccagttaacg tcggttcccc gccatcctgt 1140 aatggttgaa aacgtggggg ttgtggacgc actttttgac actggagcag ctgtgaatgc 1200 tattgatatg tcctttcttt cctcggacat taaagtgttc cctgactact cagttctaat 1260 aactgctcat ggggaaagca taccaaccgg aggaaaagtg aaactaaaaa ttacagtatt 1320 tggtggcaca tcagttgaag actttgttgt aattccaaac tgtgtgtgtc ctttaattct 1380 tgggtacaac tggtgtgcca aaaccggctt attttcttgg aaatcttcta cggcctgtca 1440 agaaactctg gcatcagatc aaagcgcgtg tcatgagtat gcgctacttc gacgcttttg 1500 gtcgtatatt atgacttcct gggcatctta tgcggttctt ttgctgcttc gactcttctg 1560 gagtaacttc gtacttctct gggcatcgtc tgcatcttat ggagcaagtg tattggaaac 1620 tctgctacat aagttttttc aatcatcctc agttcaaata ggagaaacgt gggatattca 1680 tattgcggct acaacagtgc tgcaaccaga gacagccatg tggatcaaag cgcggtcaaa 1740 tccaccccat cttaacggtg gcattgtctg tgaggcttcc cggatttcaa aacctggcta 1800 cgaagtggca gtacctcgaa cgctgactac aagtataggc ggccatttca gcatcctgat 1860 gatgaatctc gggcataact ccgtcactct tcagcaagga cactatgtcg gcaagtcgtt 1920 tcctctcact ctaacagaac ttctaacagc ggctatcgat caaccaggat cgtcagaact 1980 accctcaaaa gaatgcgcac aaataccatg ggatctgttc cagtttggtc caaccctcgc 2040 caatcatcag attgaatccc tcaaagaact aatcacaacc tactgggcat gtgtggcact 2100 aactccgaac gaactcggat gcacacatct agcggaacac aacattaata ggggcgatca 2160 tgctcccatt caccagtcct tacgtcgcct tgctccagta gagcatgccg tcatccgaac 2220 acaggtcgac gacatgctta atagaggaat cgtcacacca tccaacagcc catggtcgtc 2280 accggtcgtc ttggtacgca agaaagatgg gactgtacgt ttttgtgttg atcacaggaa 2340 ggtcaacagc atcacgaaac aggacgtata tcccctacct cgccttgatg attcactaga 2400 cagactacaa ggcgcaagtt tcttcacaac acttgatctc ttgtcaggat attggcaagt 2460 acctctgaac cccgacgatg ccgaaaaatc ggcgtcaata actcctggcg ggctttacca 2520 atttacgcga ttaccattcg gcctttgcaa cgcgccagct accttccaac gccttacgat 2580 agagttctag gccacctcaa gtggactatg gctctggtat atcttgacga cattattgta 2640 tatgggacga cgtttgaaga acacaaccga agactcgagc ttgttcttga cgcgctacaa 2700 cgagcgaact tacgaatcga acctcaaaaa tgcttttttg gttttgaaga agttctctat 2760 ctcggtcatg ctatcaccaa gaatggcatc aaacccgacc ctgcaaagct ggaagcggtg 2820 tcaaagtatc caagaccaag ccgcttttca gaggtcaaga gctttctcgg ctttgcatcc 2880 tattttcgcc ggtttatacc acaattcgcc agtcgtgcaa cacccctgac actcctactt 2940 cgtaaaaaca accattggcg gtggacaata aacgaggaac gtgccttcaa cgacattaag 3000 gaaagccttc ttcaggcacc aatcttagcc cattttaaca tggaattccc atcttttctt 3060 cacaccgacg ccagcggaac tggtcttgga gctgtattac tccaagaaca ccccgacaaa 3120 tctcaacgag taatcgctta cgccagtcgc cgccttaccg atgccgaaca acgctaccat 3180 tcatcggagc tcgaatgcct agctctcatc tgggctattc aacactttcg gccctacttg 3240 tatggtcatc gtttcacagc cattacagac aactctgcgt tactttggct acagtcgaaa 3300 cgagacctta cccccaagct tgctcgctgg gctttactac tccaagagta tcagctgaat 3360 atctgccatc gaggtggaaa atatcacacg gacgccgatt ttctctcacg caatcctata 3420 gccccagctg aagaactgaa caaagaagag gttcagtttt gcacccctgt gtttaacctt 3480 gctgaatctc aaacttccga cccagaagtt caagaaatta tgtcgcgact gagatctcct 3540 tctaatacag cggaacgacg tctacgccgc gcctatcgtg tctgcgatgg gctcctctgc 3600 caccgcggga cgcaaggttg gttgtccttc gtgcccacgc agttacgtac cgccgtgctc 3660 cacctgtgtc acgacaacct aaccgctgga catttaggaa gagacaaaac tacaaagaag 3720 atcttggagc gctactggtg gccgcgttgc cgaatacata gcgtcgtgca ttaaatgtca 3780 gtcacggaaa atggcgtcca gcctgtacca ctgtggcgtc cagcctgtac cactgcctaa 3840 agcacccttt gaaattctcg gcatcgacca tctcggcccg ttccccccta ctctaagtgg 3900 taacaagtat gttcttgtcg ccatcgattc cctaagtaaa tgggtagagt tacgtgcagt 3960 tccaagcacc gcggtcgacc acatcgtgct attcttacga caacaagtta ttcttcggca 4020 tggtgtacct cgcgtcatcg tgagtgatcg cgcctcatct ttcgtttccc gtcggtttgc 4080 gagagaaatg gaaactcatg gtatccatca ctctcaagcg gcgccatacc atccacaaac 4140 aagtggtctt gttgaacggg ccaaccgcac catcagtggc atactcgcga gctacgtgaa 4200 cgaacaacat aacaactggg acgaactggt gtcatacgca gcctttgcaa tgaacactgc 4260 tgaacaatca acaaccgagt ttaccccttt ccaaatagtg tacggacgga caccagttct 4320 gccctgcgaa tccggtgaca atgccgtttg gaagctaaga acagagcact ctacaagtag 4380 ctctcgacgc cgcttccaag ccgtgcgaaa gaaagctcgc gattccatgt tgcgtgctca 4440 ggtaagcact caagaacgca ctgctccatc attcccccca cctcctgtac gttgtgggga 4500 ctgggttttg gtacgccgac cccttcgaaa gaaaggcctg gctgaaaagt tactcccacg 4560 ctacagaggc ccctgtgagg ttcttcttca gctgggtcct gtgacatttc aagttcgtga 4620 ttgtactggc agtccccatc ataccaatgc aacattcacg gcacatcgta gccaccttaa 4680 gcactacgtc ccacgttcgt cgagacacac acaccatgca cctcaactcg gtccatgaac 4740 tcgatcatta tcgattcata ggggggaggg ggcgag 4776 // ID Ginger2-N2_AP repbase; DNA; INV; 1692 BP. XX AC . XX DT 03-FEB-2010 (Rel. 15.01, Created) DT 03-FEB-2010 (Rel. 15.12, Last updated, Version 2) XX DE Nonautonomous Ginger2 DNA transposon from Acyrthosiphon pisum. XX KW Ginger2/TDD; DNA transposon; Transposable Element; Nonautonomous; KW integrase; Ginger; Ginger2; Ginger2-N2_AP. XX NM Ginger2-N2_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-1692 RA Bao W., Kapitonov V.V. and Jurka J.; RT "Ginger DNA transposons in eukaryotes and their evolutionary RT relationships with long terminal repeat retrotransposons."; RL Mobile DNA 1, 3-3 (2010). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC TIR is 88-bp long. XX SQ Sequence 1692 BP; 640 A; 230 C; 291 G; 530 T; 1 other; tgttacggta aatgatgtta tattggaaat taacgttatt cgccggttat ttacgttatt 60 tacctacgaa tgtgggaatt atcgttattt tctagtgaat aacgttaata accgtatgat 120 ttgggaaatg ttgattttat tttaaaataa tgtccaacga gcaatgataa ggtgatcgct 180 aaactaattg tggtactaat ttggtaaacc aataaaactt tatcggccac agataaggtg 240 gacactgggc ctatacaata tataattatt actttagtat gtgtacaact gtcgtcgtat 300 ttaccactta tttaaaatgg ctatcgatat tcactgttcg cacaactttc aacgaaaaaa 360 tcaaaaatat tcttacgagc aaacgagatg ataataactc atttttgtct actaaagact 420 acaatgatgt tattgaacaa gttaaaaaat caaaaatgtg tttaaaaact gtcggtgaag 480 ctaaaacaat gaaggactat cgtgtagtac gtaagtacga tattttaaaa ataaatggaa 540 aagaacgcct tattagacca gttgacgaaa aaaatgttgt attgtattac gttaaaattg 600 gtagaacgcc atatgaagcg atgtttggct gcacggctag aattggttta atgtcgtcta 660 atcttccaaa cgatgaaata aaagaggtca ttacggaaga agatttagaa aaaataacca 720 atgaaccaat caccgaagaa gacgaaatcg gaaatgaaat catagaaatc gcagaaaaaa 780 attcggaact tggtacttaa atatttttta attatttaac aataaatttc cgaattctaa 840 atataacaca aataaaacta aaatataaat atttaaaaat aaattatttt taatattttt 900 tagatgatcg tcaagaaaat atttgtattg cacgtaaaaa ttcaaaaaaa aatttagaaa 960 aacaaggaga gaaaatgatg aaactttcta aagaaaagtt tcctcagctt gaaataggca 1020 caactgtacg tgtaactata ccggacgttg atagagctcg tggttctccg cgaaatatac 1080 tggcagttgt cacccaagtt gaacatgact tatacaaact ctgtaagtta attcttaatg 1140 aaaaaatata ttattaaaat taaaaatgtt ttaattattt tataggtacg gaacatggtt 1200 ttcttaaata taattacaca cgccaagaaa tagctgtttg cgaagaaaat ctattagaca 1260 tggacaatgt gataaaatta gcaggtgata gacagttgac actgagggaa gcagctagta 1320 atagttcggt tgcgggccct caagggtacc agcgttgcca ttgcaagact ggttgtaata 1380 ataaacgttg tgcctgccgt ggggctgaaa aattatgtaa tagtaaatgt catgggagta 1440 caaattgtaa aaacaaataa aaaataataa tgcatttttt ctatttttaa aaatacctag 1500 gtacatcatt tttaaaattg tagtwggtat ttgagattca cgtgcatatc tggtctataa 1560 caacaattac ctgtgtattt ggtcattaac gtgattatcc ggtttataac gataattacc 1620 tgtgtatttg gtaaataacg ttaataaccg gcgaataacg ttaatttcca tttaacataa 1680 tttaccgtaa ca 1692 // ID BEL-47_AA-LTR repbase; DNA; INV; 186 BP. XX AC supercont1.248; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-47_AA_; KW BEL-47_AA-I; BEL-47_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-186 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.248; Positions 1214175 1213990. XX SQ Sequence 186 BP; 63 A; 39 C; 30 G; 54 T; 0 other; tgttgggaac caccctaccc aatctctcct catagttggt gctgacaggc atcgctacca 60 aaattttttg cgtttcaata tttcctctag ttcttaagaa aaaattgcga gaagaagaat 120 tacaattgta atagaaactg gccatatgcc tgcagaatca ttccataact agaataaaaa 180 tgtaca 186 // ID Gypsy-24_DWil-LTR repbase; DNA; INV; 168 BP. XX AC scaffold_181148; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-24_DWil_; KW Gypsy-24_DWil-I; Gypsy-24_DWil-LTR. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-168 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_181148; Positions 4237137 4237304. XX SQ Sequence 168 BP; 49 A; 39 C; 30 G; 50 T; 0 other; tgttgtgtat gcagccctat cgtacgtaag ccatctggca actctgctgg cttgacgcca 60 tgaacagttc agccggtcgt tcgcagcaag cacacgcgtt ataaacttgc gatatatata 120 tatgtaattt caatatcacc taatcaataa acattctaag atatttca 168 // ID AMPLICON_AA repbase; DNA; INV; 2680 BP. XX AC AF065437; XX DT 14-SEP-2004 (Rel. 9.08, Created) DT 01-MAR-2009 (Rel. 14.04, Last updated, Version 2) XX DE Aedes albopictus type I dihydrofolate reductase amplicon repeat DE region. XX KW RTE; Non-LTR Retrotransposon; Transposable Element; KW type I dihydrofolate reductase; amplicon repeat region; KW AMPLICON_AA. XX NM AMPLICON_AA. XX OS Aedes albopictus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Wang H.Z. and Fallon M.A.; RT "Similarities to a LINE element shared by Anopheline and Culicine RT mosquitos map to the distal end of dihydrofolate reductase RT amplicons in Aedes albopictus mosquito cells."; RL Insect Biochem. Mol. Biol 28(8), 613-623 (1998). XX DR Genbank; AF065437; Positions 1 2680. XX SQ Sequence 2680 BP; 791 A; 515 C; 669 G; 704 T; 1 other; gaattcgaat atcggtgtcg gataaggggg agctctaggg gtccatcaga ctaccagagt 60 acagtactgt cggtgaggaa aaaacggtag aaaatggttt tctccgaaga gagttttttt 120 tttaatctat acgagatgct cgaggaggac ctgcaagcac tcggagtttt cgagcgacgc 180 gtgctaagta cggtcttcgg cggtgtgcag gaaaacggtg tgtggcggag aaggatgaac 240 cacgagctca ctgcacttta cggcgaaccc agcatccaga aggtggccaa agccggaagg 300 atacggtggg cagggcatgt cgaaagaatg ccggacaaca accctgcaaa gctggtgttt 360 gcaactgatc cggttggcac aagaaggcgt ggagcgcaga gagcacgatg ggtggaccag 420 gtggagcgtg acttggcgag catcgggcgc gaccgacgat ggagagcggc agtcacgaac 480 cgtgttatta tggagaaata ttgttgattc agttttatct tgaatttgat gtaatactaa 540 ataaatgaat gaatgaatga cgagataccc aaattcggaa gtcaggcact ctatacacat 600 ctgtgagctt ccgcgtgtga aagagtgctt gttcagtgag aatttcaata gtggatttat 660 tgatttcgaa aagaatcatt atttccaaat gactcgtgcg atagcggggg gtttcaaaga 720 ttgtgtatgt ttgaatgccg ctgctaaagt agttcgggtt cggaaacggt gctgaaattc 780 gtttcgagag tgtaggaatg tgctgtctat ttcgcgtttt cacgtagaac ggtcaatggc 840 gctacgcctt ccagtagttt ttcattgtta agttcctatg actgcagttt tgtttcgtgt 900 tttttcacgt atgtggctat taaactacca ccatgccatc agtgattcgc gagggactag 960 agaatcaccc cgtgtggctc aacactcgta attccgtggg agctaaagtg acgagtgttc 1020 aaatgatgac tgactgcaag cgcaacggtg gcgggacgaa aaggaagcct gactgcacaa 1080 tgaagacgtt gccttggcga tggcagcgct accttggatt ggaaggattc cggtgagaat 1140 gttccgatta cttttattac aacaaaatga actaaatacg cgccagtttc aaagattttt 1200 tttgaatcag gaagtgtgtt tgcattacca ttatccaaat gataacggta atgtacacat 1260 gcggggctta ttgtaagtga aagtgacttc tgaatggacg tgtctctcca gccattctga 1320 aatgggtctc tgaatctgca atagagctat caccttctaa ctcccggact aaagcttttt 1380 gcaaaagcat caaaatgata ttgcagaaaa ctttcttcca taataccact gccggacagt 1440 gggggttagt tggatcacac acacacacac aggcactcta tacacatctg tacgcgaatc 1500 gaatgcataa ttccaaaagt taggtatctc gcaaagattt taagaaaaaa aataattttc 1560 ggtggagggc agtcctaaac gacgacgccg agagcactgt caagtatgtg gtacgaagtc 1620 aacagaacta gcgatacgac taggaatgta gagcaatttt tgaagtgaag gactaatgga 1680 atgtgaggaa atagaatgtg ctgtgccgtt ctcgaaataa cacgtaagtt ctaccagaag 1740 ctcaacgcat tccgcaaaag cttggtgcca caagccgaaa tctgccagaa taagaacagg 1800 agcctcttga tgtaaaaaat gaggtgattg atatgtggaa gcagcactcc gacgagcacc 1860 tgaacggcga anagaatgta gaatgtaggt gtcaaaaagc agacaaaaat gctacacaat 1920 gggatgagct gggcctagca gtcggggtat gatcttcacg taatccgacc aatttgtctg 1980 ttttgtggat ggcatggata taattgctag aacatttaga acggtggctg aactgtacac 2040 ccgcttgaaa tgtgaagcaa aagtcgcagt gtggtggcga atgccgtttt caataaacat 2100 ggaagttcta cccggagatc aacgcgcccc gctatggctt catgccgcga gctaagctgt 2160 acagggataa ttgacggatg tacgcgagac gattgaaaga tggaagcagt acttcgacga 2220 gcacctgcca ccgtatatta ttgattgcaa aagaacgtgg gtgattgcgt tacaatggac 2280 agatgtttgt cattcgctag gtgttgcaga aatgccgcat ttataatgtg tccatgcgtc 2340 acctgttcat cgtgcatttg tttttttttt ttgtataagt tttaaaactc tcacccctac 2400 ttgatcacta atttgaatcc taaaaatatg tttctataaa tgtgaagaaa caggttgata 2460 cccgattctt ggtttgcacg ggtcgttttc tttaagctct caactcatat ttaactaatt 2520 gacaattttt gtacaaaaaa tcaagtcaat aggttcaaca ttgactaagt tgtagcagaa 2580 aaatgacacc tgaaaaaatc acgatctcca aataaaaaat cttattagta tccagtctct 2640 tataccaact gactgaaagg acggaacgac actggaatca 2680 // ID Copia-8_DPu-LTR repbase; DNA; INV; 358 BP. XX AC scaffold_728; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-8_DPu_; KW Copia-8_DPu-LTR; Copia-8_DPu-I. XX NM Copia-8_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-358 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 680-680 (2010). XX DR Genome; scaffold_728; Positions 4569 4926. XX CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX SQ Sequence 358 BP; 109 A; 68 C; 70 G; 111 T; 0 other; tgttttgatt tattataaac accaacatga tacgcacgat cgattggttg cggaaaccaa 60 gggggttgca gaaccaactc tgtgtgacgc agcaagacga atgaagaacg tcatcaacgc 120 gagaagttcg tctgtattta accaagcacg acaaaagtct atctgttcct tcttcttgtt 180 cgttgttgct tcgagtaaca atacatgatc tatcggtcca gatcttacaa acctaattgt 240 gtgcgactta tctttattca actttgagtg taagtactaa gacaattatc attgctatta 300 ttactgtgtg tatcaaattg agagattgag taacttactg gaatacgata tgccaaca 358 // ID Crack-12_AAe repbase; DNA; INV; 6168 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A Crack non-LTR retrotransposon family from Aedes aegypti. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; KW Crack-12_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6168 RA Kojima K.K. and Jurka J.; RT "Crack clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1228-1228 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 10 sequences with >97% CC identity. Closely related to Crack elements in Culex pipiens CC (Crack-1_CP to Crack-4_CP). XX FH Key Location/Qualifiers FT CDS 365..1504 FT /product="Crack-12_AAe_1p" FT /translation="MSELENEAVCYVCEKVEPNLARLISCAYCGRFAHFRC FT KKLFGNVITKVKKNPYLCSVECSEMNSHTNQKTVSFDAVVAELHTLNQGMS FT ESKKEASQLRYVVEQSRLQLAALVKTSGKIEESQQFLSNQFDTLQSDFRQF FT KLDMDTIKTENVKIRSEVGEWQRKHHDLSSKVDSLELELDKANRQLLAKNA FT VVLGLPSMECENTRGLIFQVCRAIGFSLEDTAVTAARRLSGNSANKEGAPI FT LVTFREEKLKQDFFDHKRKHGVLEASIVSEAFKGSTNRVSVRDEMTAFGRE FT LLRYTKEVQLSLGFKYVWPGRNGKVLIRRQDGGKVEQIGTKQDLKLLTSTS FT SKRPLDASIGGVHSMDSSPLASPRTSPQIQQLNKRTR" FT CDS 1551..4433 FT /product="Crack-12_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MENLYVDNVEFLNKNYGRMNNYCSLRILQFNIRGMNN FT PNKFDSLKEFIDLYTGPIDVLVIGETWLKADRIQLFRLEGYSATFSCRDDM FT AGGGLAVYVRNGINAEVLTNINECGFHHIHVCLKLRSSAFDVHAVYRPPCF FT QQSLFNEKLEGICSCIKNDSSCVIVGDVNIPVNQTESRVVREYLDLLNCYN FT FAVTNTIPTRPASANILDHVVCSENTMGCVVNETMPWEISDHCAILSTFRL FT DHPIQLIQLQKTIVNHTRLNEAFLMAMQNPVGDSAREKMNYAVESYRTYKQ FT RFSKVISINARVKGQCPWMSFDLWKLMRIKNNVLRRSRRRPDDVRLRELLA FT HISRKVQLEKHRSKSFYYTRLFENSNQKSVWRNLDQVLGRKQGINDCIKLT FT VDGDLTSHGSTVADKFNEFFSAIGPQLASSISSNRDINKFGTLQTVRQSIF FT LRPASIQEVIVKINKLNINKSPGHDEIPASFIKTHHAIFASLVADTFNESI FT SSGIFPENLKIARVVPIHKSGNKSEVNNYRPISILSVLSKLLEQLLADRVS FT SFLDEQKVIYDHQFGFRAGSSTWTAASELVDDIYRAMDNRCIAGVLFLDLK FT KAFDTIDHDVLLRKLEYYGIRGTANALFRSYLTARSQYVSVNGATSSKREI FT TVGVPQGSNLGPLLFLLYINDLAKLPLHGKPRLFADDTSVTYTATDPACLI FT NLMKEDILKLQMFFAENLLSLNLGKTNYMIFHSRRKKIDRHAELVVGSTVV FT EKVEVFKYLGLIFDSTLSWSSHVDKLRSDISSYCGLFWRVSKFLPLKQVIT FT MYQAFVHSKLQYLVSIWAAASKTILKPLQSTQNRCLKIVYGKPRLYSTINL FT YTNAALSISPILALKELQTVVMMKNILYNPRIHHNFTLQRAHHPHGTRNRN FT NLFVTRRNTEAGKKSFTYNGFLRYNALPAALKSEMNLMKFKAAAKRYVKSN FT VNLYLI" XX SQ Sequence 6168 BP; 1761 A; 1383 C; 1306 G; 1717 T; 1 other; ctggmaacac tgtatcagcc aaaactaggt ggttaagctc cgttctaatt atctgtttat 60 tggttgtaaa attacatcta aaaagttaat accagtgata ctttggaggg caataaattt 120 cgctggggtc cgtgacttga tcgcaccgga agcataagta taacattatt tatacgcatt 180 ttatgagttc acaatcgaag aaggctatcg acatctttta acccacatcc aaatcattcg 240 tgtagtggtg ttataagtct ctaaatgact ggtgctaacg gtcgttcgat tggtcctacc 300 tgagctgact tgtggtcttg attgagaata ttacctctac gttgtttata actacagtaa 360 tacgatgtca gagttggaga atgaagctgt ttgttatgtt tgtgagaaag tggagccaaa 420 cctggctcgt ctcataagtt gtgcatactg tggaaggttt gcgcattttc gatgcaaaaa 480 gctatttggg aatgtgatta ccaaagtgaa gaaaaatcct tacttatgct ctgttgaatg 540 ctcggagatg aactcgcata ccaatcagaa aactgtaagt tttgatgctg ttgttgctga 600 gttacatacg ctgaatcaag gaatgagtga gtcgaagaag gaggcatctc agttgcgata 660 cgttgtggag caatctcggt tacagctagc agccttggta aagaccagcg gaaaaattga 720 agaatcccag cagtttttat cgaatcaatt cgacaccctt cagtctgact ttcgtcaatt 780 caagttggat atggacacga ttaagacaga gaacgtcaaa atacgcagcg aagtcggcga 840 atggcaaaga aagcaccatg atctctcttc caaggtagac tctttagagc tggaactaga 900 caaagctaat cgacaactgt tggccaagaa tgctgtcgta ttaggtctcc cttcaatgga 960 atgtgagaat accagaggac ttatcttcca agtatgtcgt gctattggat tctcactcga 1020 ggacacggca gttacagcag ctaggaggct ctcgggaaat tccgcgaaca aagaaggtgc 1080 tcccatactg gtaacatttc gtgaagaaaa gctgaagcag gattttttcg accataaacg 1140 gaaacacgga gtactggaag catcgattgt ttcagaagcg ttcaaagggt cgactaatcg 1200 agtatcggtt cgggatgaaa tgactgcttt cggacgtgag ctactgcgtt acaccaaaga 1260 ggttcaattg tcgctgggct tcaagtacgt ttggcctgga agaaatggca aggtgctcat 1320 tagacgtcag gatgggggca aagtagagca aattggaacc aagcaagatc taaaactgct 1380 tacaagcaca tcttcaaaac gcccgctcga tgcttcaatt ggcggagtgc actcaatgga 1440 ttcttcgcca ttggcttcac ctcggacgtc gccgcagatt cagcaattaa ataaacgcac 1500 tcgttaaaag tgaatttcta ttatttgtat cctgatttat aattaatgaa atggagaatt 1560 tatatgttga taacgttgaa tttttaaata aaaactatgg tcgaatgaat aattattgtt 1620 ccttgcgaat attgcaattt aatataaggg ggatgaacaa tcccaacaag ttcgattctt 1680 tgaaggaatt tattgatttg tacacggggc ctattgatgt gttggtaatt ggagaaactt 1740 ggttgaaggc ggatcggata caactgttta ggctagaggg gtacagcgca acgttttcct 1800 gccgtgatga catggccggc ggtggattgg ccgtatatgt tcggaacggg atcaacgccg 1860 aagtattgac caacatcaat gaatgcggtt tccatcatat tcatgtttgt ctaaagctgc 1920 ggagttctgc gtttgatgtg catgctgtat acagaccacc ctgttttcaa caatctctgt 1980 tcaacgaaaa gctggaagga atttgctcgt gtataaagaa tgatagttct tgcgttattg 2040 taggggacgt aaacattcca gtcaatcaaa ccgaatccag agtcgtacgc gaatacttgg 2100 acttactgaa ttgttataac ttcgcagtaa caaatacaat tccaacgaga cctgctagcg 2160 ctaacatttt ggatcacgta gtttgctcgg aaaatacaat gggctgtgta gtcaacgaaa 2220 caatgccgtg ggaaatcagc gatcattgtg ctattctttc tacattccga ctcgaccatc 2280 caatacaact tattcagctt cagaaaacca ttgtgaatca cacaagactc aatgaggcgt 2340 tcttaatggc catgcaaaat cctgttggtg attcagctcg tgaaaaaatg aattatgctg 2400 tggaatcata ccgtacgtat aagcaacgtt tttctaaggt catatcaatc aatgctcggg 2460 ttaaaggtca atgtccatgg atgagctttg atctctggaa gttgatgcga ataaaaaaca 2520 atgtcctaag gagaagccgg agaagaccag atgatgtacg attaagggaa ctgctcgcac 2580 atatctcacg taaagttcaa ctggagaagc atcgttctaa aagcttctat tacactagat 2640 tattcgagaa ctcgaaccaa aaaagcgtat ggcgaaacct cgatcaagtg ttgggccgta 2700 aacaaggaat taatgactgc ataaaactga ctgttgacgg cgatttgacg agtcacggat 2760 ctacggttgc agacaagttt aacgaattct tcagtgcgat cggaccccag cttgcctcct 2820 ctatttccag taaccgcgat atcaacaagt ttggaacatt acaaacagtc aggcagtcta 2880 tttttcttcg gcccgcctca atacaagaag taatcgtcaa gattaacaaa ttaaatatca 2940 ataaaagtcc cggccatgac gagatcccag cctccttcat aaagacgcac cacgccatat 3000 ttgcttcttt agtagcagac acgttcaacg aatctatttc ctcaggaatt ttccccgaaa 3060 atctgaagat tgcacgtgtt gttccaattc acaaatcagg caataaatcg gaagtaaata 3120 actatcgacc gatatcgatc ctatcggttt taagtaaact tctcgagcaa ctcttagccg 3180 acagagtttc cagttttttg gacgaacaaa aggtaatcta cgatcaccaa ttcggtttta 3240 gagctggttc aagtacctgg acggctgcga gtgagcttgt tgacgatatc tatcgggcaa 3300 tggataatcg ttgtattgcc ggagtcttgt tcctggattt gaaaaaagcc ttcgatacca 3360 ttgaccatga cgtactattg agaaaattag agtactatgg aattcgtggt actgctaacg 3420 cattgttcag gagctatcta accgcaagat cacaatacgt atcagtcaat ggcgcaacta 3480 gttctaagag agaaattaca gttggagttc ctcaggggag taatttagga cccttactgt 3540 ttctgcttta tataaatgat ttggcaaagc taccattgca cgggaaaccg agacttttcg 3600 cggacgatac gtctgttaca tataccgcaa ccgatcctgc ctgcctcata aatctgatga 3660 aggaagatat cctgaagctg caaatgttct ttgcggaaaa tctactgtcg ttaaacctcg 3720 gaaagacgaa ctatatgata ttccactcgc ggcggaaaaa aattgaccgc catgcagaac 3780 tagtagtcgg atctactgtt gttgagaaag tcgaggtgtt taaatacctt gggttgatct 3840 tcgactctac cttgagctgg agctcccacg tcgataaact gcgaagcgat atcagttcgt 3900 attgtggtct cttctggaga gtttccaagt ttcttccatt gaaacaagtc attactatgt 3960 atcaagcgtt tgttcattct aaacttcaat accttgtttc tatatgggct gctgcatcaa 4020 aaactattct gaaaccactg cagagtacac aaaatcgatg cctcaaaatt gtatacggca 4080 aaccccgact atattcaaca ataaatctgt acacaaatgc agctctttca atcagtccta 4140 tattagcatt aaaagagctg caaaccgtgg ttatgatgaa aaacatcctc tacaacccac 4200 ggatccacca caacttcact cttcagcgag ctcatcaccc gcatggaacg aggaatcgaa 4260 ataacttatt cgttactcgt cgtaatacgg aagctgggaa gaaatccttc acgtacaatg 4320 gattcctgcg atataacgct ttacccgcgg cgttgaaatc ggaaatgaat ctaatgaaat 4380 ttaaggctgc tgcaaaacga tacgttaaaa gcaatgtaaa cctgtacttg atttagcttg 4440 actaaattag ctccaacttc atcaatacgt ttccgtatcc acttctaaga catgcacttc 4500 gcccagaagt tactccagtt ttgcagtaat gtctccagcc tttggatagc actcaacagc 4560 aataccattg cgctgttagc tcttagtcta cagcgacata gtttttcgtt gtcctttatc 4620 ctggtgtcct tggtcagcag tagcaccagt ttccctgtca ggtaacatag actggtatat 4680 gtctggccga agccagtgat actacgccca taacggagtt cgttacgctt ccagaaccac 4740 gcctgcaatt cgcccagaag ttactccagt tttgcagtaa tttctccagt aattggtgtc 4800 cttggtcagc agtagcacca gtttccctgc cagcaatgtc tccagcactc agcagcaaca 4860 acatttcgcc gttagctctt ggactacagc gacaccgttt ttcgacgaat tttatcctgg 4920 tgtccttggt cagcagtagc accagtttcc ctgtcagcct acagcaacac cgtttttcgt 4980 cgtagtttat cctggtgtcc ttggtcagca gtagcaccag tttcccttcc aggtaacaga 5040 aactggtata tgtctggccg aagccagtga tactacgccc ataacggagt cctttaacgc 5100 ttccagaacc tcgccacgtc ttcgtacaga acgctcagca gcaacacgtt tcccgttagc 5160 attggctcag cgcagcagta acacagtttc cctgtcagcc tacagcaaca ccgtttttcg 5220 tcgtagttta tccaggtgtc cttggtcagc agtagcacca gtttccctgc caggtagcag 5280 aaccggttat gtctgcgaag ccagtgatac tcgcccgtaa cggagtcctt tcgcttccag 5340 aacctcgcca cgtcttcgtc ggcactcagc agcaacaaca tttcgccgtt agctcttgga 5400 ctacagcgac agcagtagca ccagtttccc tgtcagccta cagcaacacc gtttttcgtc 5460 gtagtttatc ctggtgtcct tggtcagcag tagcaccagt ttcccttcca ggtaacagaa 5520 actggtatat gtctggccga agccagtgat actacgccca taacggagtc ctttaacgct 5580 tccagaacct cgccacgtct tcgtacagaa cactcagcag caacaacatt tcgccgttag 5640 ctcttggact acagcggcgg cccagtttcc cgcgcctaca gcaacaccgt ttttcgtcgt 5700 agtttatcct ggtgtccttg gtcagcagta gcaccagttt cccttccagt aacaccaatt 5760 tccctgtcag caatgtctcc ggcactcagc agcaacaaca tttcgccgtt agctcttgga 5820 ctacaacgac accgtttttc gtcgtttttc atccaggtat ccttggtcag cagtagcacc 5880 agtttccctg tcagcgagac atcatttcaa cggcacatag tcaacagctc atcctaccta 5940 attcaattta tataaagttt ataatctatc taattagtaa gcttgtatcc tgtagttata 6000 gtaggcgctt ccttaaaaga gaacatattc tcactggaag tgcacatgat ttgtatatat 6060 atatatcgaa aagaagagta ggttttatgc cttttggaga agaggataca cttgatcttc 6120 actcccaagg gcttttccct gctccaaata aataaataaa taaataaa 6168 // ID hAT-45_SM repbase; DNA; INV; 2946 BP. XX AC . XX DT 16-AUG-2009 (Rel. 14.08, Created) DT 16-AUG-2009 (Rel. 14.08, Last updated, Version 2) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-45_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-2946 RA Jurka J.; RT "haT-type repetitive family from freshwater planarian Schmidtea RT mediterranea."; RL Repbase Reports 9(8), 1848-1848 (2009). XX DR [1] (Consensus) XX CC The ORF is broken by stop codons. CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX SQ Sequence 2946 BP; 1047 A; 421 C; 462 G; 1016 T; 0 other; cagtgtttct caacagccgg tccgcgaaag ttttagggtt ctcacatgaa attcggaaaa 60 tttcaagtga tgataacgaa actttttaat aattattata cttttgctaa caaaaaattt 120 taactaaata attaaatgac ttttttaatt tcattcaaat taatattatg tgttacttca 180 taacatcagt tcattatcaa ctacatattt taatcttcac agtacatcca gtaactaaaa 240 tttaatttac gtgtgactag aaaacactct gaggccgagc gcaaacgtcg aaatgcaacg 300 agcgatcggt tgagtcaaat gcaaactaca cgagtttgca aagaagggga aggggaaagt 360 ttcggtgcgc agttgttact acttttccgt tatattttat tattgtagtg tgcgatttga 420 cggcgtgtgg atggaagaaa aagtatattt gctgaaatta ttattgttta tatatgatat 480 atgattatta aacaattcat tattgtttat gcatataaaa cgttataact taattctttt 540 aaattaaata ggtattgaga attgaattta aataatttat gatcaaaatt attaattatt 600 aatttatttt tagtattttt aaaaatgagt aataagagaa taccaagaac aattgaaatt 660 tttttcaaaa aacatgctca agtacttcga atacaagtga aattattaat aattaccaaa 720 atgattcacc acaggcaaaa cgaaaatgta tttttgttcg tcaatacaat aatgattatt 780 taaaatttgg ttttattctt tggcaaaaat cggaagaaaa attacctaag ccacaatgtg 840 ttatttgcaa ggacgtttta tcaaatgaat gtatgaaacc agcaaaatta ctacgacact 900 taaaaaccaa acatccaacc ttacaaaata agcccctgga ttattttgaa cgccaaagcc 960 ttgactttaa acatcaaaca tcttctttaa aaaatgcttt ttagtcgaca agtctctagt 1020 aaagtcttca tatttagtgg cattaagaat cgcaaaatgc aagaagccat actctatcgc 1080 cgaaaattta atcaaacctt gtttgataga cgtatgctct gaattactgg gccccacagc 1140 cggagataaa atgaaaagtt tgcctttttc caatgataca attcaacgaa gaatcgttga 1200 attagcaact gatgttgaag atcaactcat tcaaaaagta aacaaatcat tgtttgctat 1260 tcagatggac gaatctactg atatttctaa caaagcaata cttttatgtt acgttcgtta 1320 tattgatcac gaaataaatg agattaaaga agaaatactt tgttgtcttg aacttcctag 1380 tcacacaact agcttagaaa tatttaaagc tctcaacaca tacttcgaaa ataaatcaat 1440 aaattggaaa aattgtattg gtttatgtac tgatggtgcg gcaagcatga ctgggaaaca 1500 ttcgggagtc gcagcaaaaa ttttaaaggt aggtgcagtt ggaatgatat ttactcattg 1560 cttcatacac cgacagcatt tagttggaaa aaaatgtctc cagacctgaa ctgcgtactt 1620 tctcaatctg tacgaatcat aaattatatc aagagtaacg cattaaattc aagattattt 1680 tctattttat gtaatgaaat gggatcgggg catgaaaatt tattacttca tgcagaagta 1740 cgatggctgt cgaggggaag aatattacaa agactctttg aattacgaaa tgaagttgaa 1800 tcctttttaa atacaaaaaa ttcagaatta gcctcttttt tccgtaacga cgaatggatt 1860 gcaaaattgg catatttgtc ggacatattt tcattattca acgaattaaa tttaagtctt 1920 caaggaaata gcattactat ttttaattta tgggataaaa tcgatgcatt caaaaaaaat 1980 agaattatgg cgcaagcgta ttgaagaaaa aaactacaga gtgttccact ctctcgaaga 2040 attattaaat acagtgaatg tgaataccga gtctttatca aatattattt ctgaacatct 2100 agcagcgatt tcttacgcgt tcgaagacta cttttcacag aaagatgatg cgcgaaaggg 2160 aaatatgtgg attataaacc catttgtcga acatggaaat aatgacttga ctgattcgga 2220 agaggaaaaa ttagtagagt taagttgcga ctcttcatta aaattacaat ttaactcgaa 2280 agataaaatt caattttgga ttcaagttaa aaatgagtat gcagatttac atacaaaagc 2340 aatgagagtg ctgctaccat tttcaacaac atatttgtgt gaatctgcat tctccgcaat 2400 gacgcttata aaaaccaagc aaagaaataa tttggaaatt tgcccagctt tacgcctagc 2460 tataacaaat atagctccca gaatcgaaag tttatgcaat gtaaaaaatc agcagccatc 2520 ccattgatga tttttttatt tgtggaattt attcatttaa atgtgatttt tctctatttt 2580 tttattctct ataatttttt taaatgaaaa ttatttatat attgtctatt ttctgtgatt 2640 tattagaaat gcagcagttt tgtataattt taagattact tttgtgttgt acttctttat 2700 tataatatta ttatatgttt tttgcaatat ttattgttat tcttttatgt acattttttt 2760 gtgtaaatat attgtttatt ttttgtaaac tttgtgtgaa tttttacaac tgaaatatat 2820 gtatttaata gtacattaaa tattttgtat tcctgcaaaa tataaccaac aattagtatt 2880 agtggtccgc ggtaaaaatt tagattgttt actggtccgc ggcacgtttt tggttgagaa 2940 acactg 2946 // ID Kolobok-1_NVi repbase; DNA; INV; 3222 BP. XX AC . XX DT 21-APR-2009 (Rel. 14.04, Created) DT 21-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE Kolobok-type family - consensus. XX KW Kolobok; DNA transposon; Transposable Element; Kolobok-1_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-3222 RA Bao W. and Jurka J.; RT "Kolobok-type DNA transposons from Nasonia vitripennis."; RL Repbase Reports 9(4), 768-768 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 709..2463 FT /product="Kolobok-1_NVi_1p" FT /translation="MAKRRDVRGRFASTHPDKKYSRGRNKKRPRGFFGSKN FT NEDDQVEKRPRLSASAKKISTSTVPDAASTKNRLIDVSILYSELQDVLCCG FT VCHGNVHFSESSVQGLAFKIDVVCEQCGQISAINSCKKVGPRSNACEVNRR FT SIIALRALGHGHAGLSTFCGIMDLQKPVAQSAYDTIRNQLSEVCQKVAEDS FT MKAAVQEEMEATGSNQLDVSGDGSWRKKGFSSLQGLASIIGNKTAKVLDVA FT IKNSFCKACDNWSGKEDTIPYQEWYEEHEPNCNANHQGSAGKMEVDGIVQM FT FQRSKEEYGVMYKNYIGDGDSKVYKAVTEAQPYGSRLTVAKKECIGHVQKR FT MGTRLRNLKKSLGRQTLSDGKGIGGRGRLTAKVIDRLSSYYGKAIRYHPDS FT VEDMSNAIWATFYHQLSTDAKPQHHLCPKGASSWCKWQKAKVHKTLKNFKH FT KSSIPSAIMEKIKPIYEDLTEPNLLERCVGGFTQNSNESFNNSVWKLVPKT FT SFSGLSVLNIGVYTAVTCFNDGRTGFLKIMEQFEVMPGKAACEWVESSNDS FT RLFHAERKKLASTKEARVTRRKAREELQDDAYGAGEN*" XX SQ Sequence 3222 BP; 1076 A; 504 C; 697 G; 945 T; 0 other; gggggcacaa cacctttgaa atcatgaaaa aatcgatttt tttttattcc gtattttgat 60 aggaaatttc ccgtagaata tgttcctatt gtcgttattg cgctaaaaaa attttttttc 120 aaagttgtac agcaaacgct aaaagtactt tttatctgtt ttttcataaa tttacgaaat 180 tcagttaatt cattttcaaa taaataacaa ttattatttt aagttccagt gagtaaaaaa 240 ttgaaaaaat tgacaaaatt tatgttatat atatgttaaa aaattaattt ttatgtttaa 300 accgatatat aggcgatatt attcattgat aaaaaagaac gaaaaatcta ctgtttgaat 360 ttttttgaga ctgtcttgtg actgtcatca ttcctttcgt tccgattcca catgtctaca 420 cctacgctgc acgggtatag tgttgtatat actgaacagc tgatacatgt acacaaatac 480 agctattatc agctatcagt atatgtcagt tttttttgcc cgtcgttaat taattaacaa 540 ttgtcagtca gtgtaaatgt ggattatttt ttgcgcgttc gtgagtgaat ctgcgagtgt 600 gataggtgcg tttatatagg ttaagttttt tcaaaaagtg caagtgtcag tgaaagtgtt 660 atgtgataag tttaaattaa ataaattaaa tagtgtaagt gtagcataat ggctaaaaga 720 agagatgtca gaggaagatt tgccagcact catccagaca aaaaatattc tcgaggaagg 780 aataagaaga ggccaagagg attttttggc tctaaaaaca atgaagatga tcaagtggag 840 aagagaccaa gactttcagc atctgcgaag aaaatcagca catcaacagt tccagatgct 900 gcttctacaa aaaataggct tattgatgtc agcattttgt acagtgaact tcaagatgtt 960 ttgtgttgtg gtgtctgtca cggcaatgta cattttagtg aatcaagtgt acaagggctt 1020 gctttcaaaa ttgatgttgt ctgtgaacag tgtggtcaga taagtgcgat aaactcttgc 1080 aagaaagtgg gtccaagaag caatgcatgt gaagtgaaca gaagaagcat tatcgctctg 1140 cgtgcacttg gccatgggca cgctggattg tctacctttt gtggcatcat ggatctgcaa 1200 aagccagtag cacaatctgc ttacgataca attaggaatc aattaagtga agtgtgtcaa 1260 aaagtagctg aggactctat gaaggctgca gttcaagagg agatggaagc tacaggaagc 1320 aatcaattag atgtttctgg cgatggctcc tggaggaaga agggcttttc atctttacaa 1380 ggcctcgcca gcatcattgg aaataaaaca gcaaaggttt tggatgtcgc catcaagaac 1440 agcttctgca aagcatgtga taactggtca ggcaaagaag acacgattcc atatcaagaa 1500 tggtacgaag agcatgaacc aaattgtaat gctaaccatc aaggcagtgc aggaaaaatg 1560 gaagtggatg gaatcgtcca gatgtttcaa cgttcaaaag aagagtacgg tgtaatgtac 1620 aaaaattata ttggtgatgg tgactcgaaa gtgtacaaag cagtgacaga agcgcaaccc 1680 tatggcagca ggttgaccgt tgcaaaaaaa gagtgtatcg gtcatgtgca gaagaggatg 1740 ggtacgcgtc tcaggaattt gaagaaatca ctaggacgac agacactctc ggatggaaaa 1800 ggaattggag gaagaggacg actgactgcc aaagtgatcg acagactgtc ttcatactac 1860 ggcaaagcaa tcaggtatca ccctgattcc gtggaagaca tgtccaatgc aatctgggca 1920 accttctatc accagctgtc aacagacgca aagcctcagc accacttgtg tccgaaagga 1980 gccagcagct ggtgtaaatg gcagaaagcc aaagtgcaca agactttgaa gaattttaag 2040 cacaaaagca gcattccatc tgctattatg gagaagatta agcctattta tgaagattta 2100 acggaaccaa accttctgga aagatgtgtt ggaggtttca cacagaattc taatgaaagt 2160 ttcaacaatt ctgtgtggaa gctggttcca aaaacatcat ttagtggctt gtcagtgctc 2220 aatattggag tgtacacagc tgttacgtgc ttcaatgatg gtcgtacggg ctttctgaag 2280 attatggagc agtttgaagt catgccagga aaagctgctt gtgaatgggt agaaagcagc 2340 aatgacagca gattattcca cgcggagagg aaaaaactgg caagcaccaa ggaggcacgc 2400 gttaccagga gaaaagctcg agaggaacta caggatgatg catatggagc aggtgaaaat 2460 taaattttgt aagtacttac atctaatatt attaatattt tcattttata ctttcaagta 2520 tttcttttta aaagttattc ttatcacttt tcagcaaatc ttggacagta agtgggttac 2580 cacagctgcc tcataaagtt gaggttatga tggacaggag acacaaaatg ccttgtgact 2640 acggtgaaaa aaagataaga gctttgaaga aattttatta ataaaaacat gcttttaaat 2700 gaaatgtact aatttaaact ttttgtgttc tagatacatc tacaccccaa caggagtttt 2760 tatgtgtcca gatgcgacag tctggtacac aatgttatat tttgatggat atcaatgatt 2820 ggagcaataa aaaaatgata gagataagta ttacggtgat tatgtataag tattgtaagt 2880 caaaacttta atgctgattt tctcctaatt gtgtttttct caatttcagg tgctgcaccg 2940 taagattcca aaggaacgga tggaccaaat gagctcaaat ttggtgatat tgtttaaaag 3000 tttgagctca tttgggccat tcgttccttt ggaatcttac ggtcagtgaa ttatggacac 3060 cctactatac gtaggtatta tagaggtata tcgaaggtct aaaacaataa aaataaaggt 3120 aattatatgt aagaatagca aaaaaatttg gtttatttat ataagaaatt ataaactttt 3180 aatggaaaat tttgattgta atttcaaagg tgttgtgccc cc 3222 // ID Gypsy-67_AA-LTR repbase; DNA; INV; 293 BP. XX AC supercont1.238; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-67_AA_; KW Gypsy-67_AA-I; Gypsy-67_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-293 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.238; Positions 142869 143161. XX SQ Sequence 293 BP; 86 A; 62 C; 53 G; 92 T; 0 other; tgttgtgaac cctattagca ttgcagaatc catcaactgt atactgtggt aaattagcaa 60 tagtagtagt gtatagaaat agtcatgacc ttgaacgtag attattctct cacactttcc 120 ccacgctcaa tagattgtaa aacaatgcct tttgttctgc accctataaa agaccatgcg 180 tccccatgat cggtctcttt tgccgatcga ccgtggagta agtatcacga tagccgtaat 240 aataaattag tgattagtga aagtcctgtg tttcatttct gcacgacatt aca 293 // ID Copia-1_DPer-I repbase; DNA; INV; 4739 BP. XX AC super_0; XX DT 04-MAR-2011 (Rel. 16.03, Created) DT 04-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-1_DPer_; KW Copia-1_DPer-LTR; Copia-1_DPer-I. XX OS Drosophila persimilis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; obscura group; OC pseudoobscura subgroup. XX RN [1] RP 1-4739 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (22-MAR-2010). XX DR Genome; super_0; Positions 11771780 11767042. XX CC Positions [1837-2046] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 831..1946 FT /product="Copia-1_DPer-I_2p" FT /translation="MKKDCYQFIRLNQNKENYKHNKQAQLAASGHGFAFMM FT RKMKKSDQINNELGFILDSGATDHLINDDSLYADCIELEQPIKIAVAKEGE FT FLYATKRGIVRLHNGHNITLEDVLYCKEAAGNLMSVKSLQQAGMSIKFDCG FT GVSISKNGVSVVKHTGMLDIPVVKFQAYSVDARNKNNYRLWHERLGHISNA FT KLLEINNNNLFNDSNLLVNLKISDEVCESCLNGKQARLPFKQSKDKSYIKR FT PLFVIHSDVCGPIKPVTLDDKNYFVIFVDQFTHYCVTYLLKYKSDVFNVFK FT DFVAKSEAHFNFKIVNLYIDNGREYLSNEMREFCVNKGISYHLTVPYTSVE FT WCFGANDKNHYRESSCYGKWCKIRQKFLG" FT CDS 1876..4386 FT /product="Copia-1_DPer-I_1p" FT /translation="MIRTITEKARAMVNGAKLDKSFWDEAVLTATYLINRI FT PSKALGDCKMTPYELWHNRKPNLKFLRVFGSTVYVHNKIKKGQFDEKSFKA FT ILIGYEQNGLKLWDVIKGKVIVARDVVVDETNMLYSREVKFETDFPKESKE FT RSQQEFLNESKESDQLNFPNDSTECGILNESKECDQFYLPNDSTECGILNE FT SKECDQFYLPNDSTECGILNESKECDQFNFPNESTECEILNESKECEKQNF FT PNESRKRKTDFLNDSKELYIPNESKDCEKADQPDNCREENKDTNIISRKSE FT RLKSKPKVSYKEHDKSFNKFILNAHIMFDKVPNSFDEINFRNDKSDWEEAI FT NTELNAHEINNTWTIIKKPENKNIVDSRWVFSIKYNELGVPIKYKARLVAR FT GFTQKYQVDYEETFAPVARIASFRFVLSLAVQFNLKVHQMDVKTAFLNGTL FT SEEIYMRPPQGVLCGAGHVCKLNKAIYGLKQAARCWFQVFEQALKGFDFVN FT SPVDRCIYILDRGDIKKNIYVLLYVDDVVIATRVMDRMNNFKNYLSERFRM FT TDLNEIKHFIGIRVEMFEDKIYLSQAEYVKRILNKFNMDSCNSVSTPLLNK FT LNYELLNSGEDCDAPCRNLIGCLMYIMLCTRPDLATAVNILSRYSSKNNAE FT LWQCLKRVIRYLKGTIDMKLIFKKNTSFENTLVGYVDSDWGSNEIDRKSTT FT GYLFKMFDSNLICWNTKRQNSVAASSTEAEYMALFEAVREALWLKSLLNSI FT NLKLDRPIKIYEDNQGCISIANNPSCHKRAKHIDIKYHFSREQIETNEISL FT EYINTDNQLADIFTKPLPAGRFAELRNKLGLLQEE" XX SQ Sequence 4739 BP; 1711 A; 657 C; 952 G; 1419 T; 0 other; ataggttatg ggcccagtcc acgccaattt aaataaatta aattaataac aaatagttgt 60 gaaattagtt ccgcgtgcgt tcttgtgtag ttttttttta tagtgtgtgt gtgtataata 120 tatacataca tagaaaatgg aaagaaataa gcaacatatt aagccttttt gtggcgagaa 180 atattctata tggaaattta gaatcagagc tttgtttgct gaattagagg tgctccaagt 240 agtcgataac ataatgccta acgaagtaga tgataattgg aaaaaggcag agcgatgtgc 300 aaagagcatt ttaattcaat atttaggcga cgcatttcta aacttcgcgg caggcgacat 360 ttcggcacgt gatattctga agaagccttg cgtcgcaatt agctgcacgt aaacgtttaa 420 taacactcaa actctcaagt gaaatgacac tttcgagtca ttttgttact tttgatgaaa 480 cagtgagtga gttgttaggg gcgggcgcaa aaatggatga gatggacaaa gtggcacatt 540 tgctgcagac tctaccttca acctataatg gtgtcatcac ggcaatcgaa actttgtcag 600 atgaaactct atcattagcg tttgtaaaaa ataaattgtt cgactacgag accaaattaa 660 gaaatgaaag tactgataca agtagtaaag tcatgaaggc agttgtatga aataataaca 720 attttaataa tcgtaaaacg tttggatcta gaaacacacg gcaaaagata ttatttaagg 780 gaaaagctag aagtaatatt aaatgccatc actgtggcaa agaagggcac atgaaaaaag 840 actgttacca gtttataagg ttaaatcaaa acaaagaaaa ttataaacat aataaacaag 900 ctcagcttgc agcatcgggc cacgggttcg catttatgat gcggaaaatg aaaaagagtg 960 atcagataaa taatgaactc ggttttatac tcgattctgg tgcaacggat catttgatca 1020 atgatgactc gctgtacgcc gattgcatcg aactggagca gccaattaaa atcgctgtgg 1080 caaaagaagg cgagttttta tatgccacta agcgtggaat tgtgcggctg cacaatggac 1140 acaatatcac actggaggat gtgctctact gcaaggaggc agcaggaaac ctcatgtctg 1200 tcaagagtct gcaacaggca gggatgtcaa ttaagtttga ctgtggtggt gtctccatat 1260 caaaaaatgg agtaagtgtt gttaaacata caggtatgtt ggatatacct gtagtcaaat 1320 tccaagcata cagtgtagat gctaggaaca aaaataatta tcgattatgg catgagaggt 1380 tggggcatat aagcaatgca aaattgcttg aaataaataa taataattta tttaatgaca 1440 gtaatcttct tgttaattta aaaatatcag atgaagtttg tgagtcatgt ttaaatggga 1500 aacaggctag gttgccattt aagcaatcta aagataaaag ttatataaaa cgacccttgt 1560 ttgtaataca ttcagatgta tgtggcccca ttaaaccagt tacactagac gataaaaatt 1620 attttgtgat cttcgtagat caatttacac attattgtgt aacctatttg ttaaaatata 1680 aatcagatgt atttaatgta tttaaagatt ttgtagcaaa gagcgaggct cattttaatt 1740 tcaagattgt caatttgtat attgacaacg gtagagaata cttgtcgaat gaaatgcgtg 1800 agttctgcgt taataaagga ataagttatc atttgacagt gccatacacc tcagttgaat 1860 ggtgtttcgg agcgaatgat aagaaccatt acagagaaag ctcgtgctat ggtaaatggt 1920 gcaaaattag acaaaagttt ttgggatgaa gctgtgctaa ctgcaacata tctaatcaac 1980 agaattccaa gcaaagcact tggtgattgt aaaatgaccc catatgagct gtggcacaat 2040 agaaagccca atttaaaatt tttaagagtg ttcggttcaa ccgtgtatgt gcacaacaaa 2100 attaaaaaag gtcaatttga tgaaaaatca ttcaaagcta tcttgatagg gtatgaacaa 2160 aatggactca aattgtggga tgtcattaaa ggaaaagtta ttgttgcaag agatgttgtc 2220 gtagacgaga ctaatatgct ttattctaga gaagtaaaat ttgaaacaga tttcccgaaa 2280 gagagtaagg aaagaagtca acaagaattc ctgaatgaga gtaaggaatc tgatcagctt 2340 aatttcccga atgatagtac ggaatgtgga atcctgaatg agagtaagga atgtgatcag 2400 ttttatctcc cgaatgatag tacggaatgt ggaatcctga atgagagtaa ggaatgtgat 2460 cagttttatc tcccgaatga tagtacggaa tgtggaatcc tgaatgagag taaggaatgt 2520 gatcagttta atttcccgaa tgaaagtacg gaatgtgaaa tcctgaatga gagtaaggaa 2580 tgcgaaaagc agaattttcc gaatgaaagt aggaaaagaa aaactgattt cctgaatgat 2640 agtaaggaat tatatatccc gaatgagagt aaggattgtg aaaaagctga tcagccagat 2700 aattgtcgtg aagaaaataa ggatactaat ataattagta gaaaaagtga gaggttaaag 2760 tctaagccga aggtttctta taaagaacat gataaaagct ttaacaaatt tatattaaat 2820 gcccatataa tgtttgataa agttccaaat tcctttgatg aaataaattt taggaatgat 2880 aaatctgatt gggaggaggc aataaataca gagttaaatg cccatgaaat taataacacc 2940 tggacaatta taaagaagcc agaaaacaaa aatattgtag acagtagatg ggtattttcc 3000 attaaatata atgaactagg agtcccaata aaatacaaag ctagactggt tgcaagagga 3060 tttactcaaa agtaccaagt agactatgaa gaaacatttg ctcccgttgc tagaattgcc 3120 agttttaggt tcgtactatc attggcagtc caatttaatt tgaaagtcca tcagatggat 3180 gttaaaacag ctttcctcaa tggtacatta agtgaggaga tatatatgcg gcctcctcag 3240 ggcgtattat gcggtgctgg gcatgtatgc aaactgaaca aagccattta tgggctcaaa 3300 caagcagcta gatgctggtt tcaagtattt gagcaagcat tgaaaggatt cgattttgta 3360 aactctcccg tggatcgttg catatatatt cttgatagag gtgacataaa gaaaaacata 3420 tatgtattat tatatgttga tgatgtagta atagcgacaa gggttatgga tagaatgaat 3480 aatttcaaaa attatttaag cgaaagattt agaatgactg acttaaatga gataaaacat 3540 tttatcggta tcagagtaga aatgtttgaa gacaaaatat atttgagtca agctgaatat 3600 gtaaaaagaa ttttaaataa atttaacatg gatagttgca attcagtcag tactccatta 3660 ctaaataaac taaattatga gttacttaac tcaggtgaag actgcgatgc tccatgtcga 3720 aaccttattg gctgtttaat gtacataatg ctgtgtacac gaccagattt agctaccgct 3780 gtaaatatct tgagtagata tagtagcaag aacaatgctg agttatggca atgtttaaaa 3840 agggttatta gatatttaaa gggaactatt gatatgaagc ttatatttaa aaagaataca 3900 tcatttgaaa atacattagt tggctatgta gactccgatt ggggtagtaa tgagattgat 3960 agaaaaagta caacgggtta tttgtttaaa atgttcgatt ccaatttaat ttgttggaat 4020 acaaaaagac agaattcagt ggcagcttca tcaactgaag ctgaatatat ggccctattt 4080 gaagctgtaa gagaagcttt atggctaaag tctcttctaa atagtattaa tcttaaactc 4140 gacagaccca ttaaaattta tgaagacaac caaggctgta ttagcattgc aaataatccc 4200 tcatgtcata aaagagcaaa gcatattgat ataaaatatc atttttcaag agaacagata 4260 gaaactaatg agatttctct tgagtatatt aatacagaca accaacttgc tgacatattc 4320 acgaagccac tacctgctgg aagatttgcg gaactgcgga acaaattagg actactacaa 4380 gaagaatgat tccaatttaa tgaaaacatg aacatgaagc tttattaacg aatttattac 4440 attaattcat atttaatttt tttttttttt tttttttttt atttgaacca aaagaaccaa 4500 tgaattgtta aagtaattta attttatgtt aatttaattt taaagtgatc tgatcgggtt 4560 tttctgggtt ttccccgtat ccttagagca aatgctagat cacataacac ttcccagaat 4620 gcacaccaac cacattagaa taacaattga atgttacctt tttgatttaa tgatgaaaac 4680 tttttttata ttttttattt ttatgatttt gataattaat gttatttttg aggggggcg 4739 // ID Gypsy-26_IS-LTR repbase; DNA; INV; 265 BP. XX AC ABJB011015085; XX DT 15-FEB-2011 (Rel. 16.02, Created) DT 15-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the black-legged tick genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-26_IS_; KW Gypsy-26_IS-I; Gypsy-26_IS-LTR. XX OS Ixodes scapularis OC Eukaryota; Metazoa; Arthropoda; Chelicerata; Arachnida; Acari; OC Parasitiformes; Ixodida; Ixodoidea; Ixodidae; Ixodinae; Ixodes. XX RN [1] RP 1-265 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the black-legged tick genome."; RL Direct Submission to RU (15-FEB-2011). XX DR Genome; ABJB011015085; Positions 3778 4042. XX SQ Sequence 265 BP; 66 A; 66 C; 80 G; 53 T; 0 other; tgccgtgtat tgtgtgcgtg ccgtgtattg tgtgcgtgtt cgccacgcac agcgaacacg 60 acctttggtc caggcaacgg cgcacgggct ggtgcagccg gagacgccgg cgagggagag 120 cgagccccgg tgaagaacgt gacgcctgcg cgaaagaaca gactgggcgc ggtcgcagac 180 tgaagaagac gttgtcaacc gaatgtaatc tattagttaa ataaatatat gtcctgcatt 240 accctaaaat tactggtaca ccaca 265 // ID hATm-55_HM repbase; DNA; INV; 3420 BP. XX AC . XX DT 05-JAN-2009 (Rel. 14.02, Created) DT 05-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE hAT-type family: consensus sequence. XX KW hAT; DNA transposon; Transposable Element; hATm-55_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3420 RA Bao W. and Jurka J.; RT "Families of hATm elements from Hydra magnipapillata."; RL Repbase Reports 9(2), 389-389 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 1053..3104 FT /product="hATm-55_HM_1p" FT /translation="MPMTTLQHAIARLEKIYNEWRNLQKASKRGGATQIAK FT ETAFVASFEDTFDISHVDAERLIQNPEDKLFLKAQLGNGRHSSSMAQIDRK FT LAVKLKRKHERQEAIKARTEREELRHAVSNGEAVVETFESDSEKSCNENVD FT SDHDSAVTPQTKRPRPKNIVSPQVAAALDRTKISDRNAVYALASFSVAVGQ FT NPSELALNRWSIRRNRMKHRDETILKLKESFGENIRDVPLVVHWDGKLLPD FT ISGKHEKIDRLPILVSGQGVERLLNVPKLPNGTGEAMANAVVTAIYDWNLA FT ENIKAMSFDTTASNSGIRSGAAVLIEQKMNKELLHLACRHHIFEVVLGDVF FT NSLAGPSTGPDILMFKRFQTAWPLIVRDRYEDGISDPETALVFSSNREMMK FT EAIDFIQNCLLTENQPRDDYRELLELSLIYLGHIPPRGVHFMAPGAMHRAR FT WMAKAIYSIKIFLFRSQFRMTKREINLSRFLNIFVCFVYLNSWFTASLSIQ FT APLNDLMCLKRLTQFRTINLNVANVAIKALSRHMWYLSEELIAFSFFDDRV FT DAATKYEMVNAMKTKTGNGIPAKRVQLRENNISSSELADFINSNTMRFFVI FT LGISTSWLELDSSLWNEDQVFIAAKHFVAGLSVVNDRAERGVALIQDFNQI FT LTKDEGQKQALLQVVSEHRKQFPDAKKSTVVKQD*" XX SQ Sequence 3420 BP; 1146 A; 599 C; 611 G; 1064 T; 0 other; tagggtggta gtcaaaaatt aaatgttcag attttgtctg ggacaacctc tagtcctgtt 60 cctattgatc tcatgaactc atgtgcaaaa tttcataaaa atattccttc attaaagggt 120 gccacatgcc aactgataaa atcacatgta cgcgatagaa aagatagccg tgtaaattaa 180 tgtgttgttg aatattattg aaatctttaa atacattttt ttatctgtaa atccattttg 240 aacacaacca gaagtctttt caatgaggtg tactctttat gttttttttg agaatctgac 300 cagttgttaa atttatcaac taattaatgt tctgtatatg aaaaatattt ataaatttcc 360 aaatacagat aaaaaaaaac agaaatttta cttcatagca tttttagaaa tttgttacat 420 tcaaatagtt aaatttttat agtgcttaat tattaattac ttagctgcag cccacttact 480 tattttttct atcccagaat tccttgtaat ttgccaccaa accgccattt tgtatgttta 540 tcactatcag actaccaaat taattatttt aactaaaaat ttttattttt aacaattttg 600 cagtggaata tctcaattgt gaaaagtcag tatacattta agttgttatt ttgtaatata 660 aattatgtaa actactttga attttatttt atataaattc atatctctaa actcaaaaaa 720 gatactgaat aatggatatc actgaatata tactaaatga ataaaatttt ataacttttg 780 caattcacat catgcttatt gcataacttt atttgtattt ttctagatta ataaatatca 840 gaaaatgtca caatcaaaac ctgcaactag atctgcaacc gaaatttacc ttctcggttt 900 cacaaatgcc tccattaatg gtactagact accgtcacga aagcaagcat tacagttttt 960 cttccatcag ttgaaaataa acaaaaaaaa ccgtgcgaga aagttcggac attactatta 1020 aaacagttgc tgatttttgg aacaaagctt acatgccaat gacaacattg caacatgcaa 1080 ttgcgaggct agagaaaata tacaacgagt ggagaaatct acaaaaggct tcaaaaagag 1140 gaggagcaac tcagatagcg aaagaaacag cctttgttgc atcatttgaa gatacgttcg 1200 acatttctca tgttgatgcc gagaggctta tacaaaatcc agaagacaaa ctttttctta 1260 aagctcagct tggaaatgga agacactcaa gttcaatggc acaaatagac agaaaattgg 1320 cagtcaagtt gaagcgtaaa catgagagac aagaggctat aaaagccaga acagaacgag 1380 aagaattgag acatgctgtc agcaacggtg aagctgtagt tgaaacattt gaatcagatt 1440 cagaaaaaag ttgcaatgag aacgtagatt ctgaccatga ctcagcagtg acaccgcaga 1500 caaaacgacc tcgtccaaaa aatattgtca gtccacaagt tgcagcagct ttagatcgaa 1560 cgaagatcag cgaccgaaat gcggtttatg ctttggcttc attctcagtt gctgttggtc 1620 aaaacccaag tgaactagct ctgaatcgct ggtcaattcg acgcaataga atgaaacata 1680 gagacgaaac aattttgaaa ttgaaggaaa gttttggtga aaatattagg gatgttccac 1740 tcgtagtgca ttgggacggc aagttactcc ctgatatatc aggaaaacat gaaaaaatag 1800 atcgcttgcc tattcttgtt tcgggtcaag gtgtagaacg tctattgaat gtgcctaagc 1860 tcccaaatgg cactggagag gcaatggcaa atgcagttgt aacagcaatc tacgactgga 1920 atcttgcaga gaatataaaa gccatgtctt ttgacactac agccagtaat tctgggattc 1980 gctctggtgc agcagttctt attgaacaaa agatgaataa agagctgctc catcttgcat 2040 gtcggcacca tatttttgaa gttgttcttg gagatgtgtt taactcatta gctggaccat 2100 ctactggacc agacattttg atgtttaaaa ggtttcaaac agcatggcca ttaatcgttc 2160 gtgacagata tgaagacggt atatcagatc ctgaaacagc tttagttttt agcagcaaca 2220 gagaaatgat gaaagaagct attgatttca ttcagaactg cttgctgaca gagaaccagc 2280 ctcgtgatga ttatagggaa cttttagaac tgtctttgat ctacctgggt catattcctc 2340 ctcgcggggt tcattttatg gctcctgggg ctatgcatcg ggccagatgg atggccaagg 2400 ctatttattc aatcaaaatt tttctcttca gatcccagtt tcgcatgacg aagcgtgaaa 2460 taaatttatc acgtttcttg aatatttttg tctgctttgt ctatctcaac tcatggttta 2520 ctgcttcgct gtccattcaa gctccactga atgatctcat gtgccttaaa cgactcacac 2580 aattcaggac aataaattta aatgttgcga atgtagcgat aaaggcattg tctcgtcata 2640 tgtggtacct cagtgaggaa ctgattgcct ttagtttttt tgatgatcgt gttgatgcag 2700 ctacaaaata cgagatggta aatgcaatga aaacaaaaac tggaaatgga attcctgcaa 2760 aacgtgttca actacgtgaa aacaacattt catcatcaga gcttgctgat ttcattaact 2820 cgaacaccat gaggtttttt gttatactgg gcatctcaac ttcttggctc gaattggatt 2880 catctttgtg gaacgaggac caagttttta ttgcagctaa gcattttgta gccggtttga 2940 gcgttgtgaa tgacagagca gagagaggag ttgctcttat ccaagacttt aaccaaattc 3000 taacaaaaga tgaaggacaa aagcaagcac ttctccaggt tgtgtctgag catcgaaaac 3060 agtttccaga tgccaagaaa tccactgttg ttaaacagga ctaaatagtc atttttccat 3120 tcaaaacaaa atgaaaaact aaaatgatat tgaaattata gttaattttt catggttttc 3180 aatgttcaat actttgttat ttaaaaactt ttgaattaaa atagtttata taaaaaaaat 3240 tttttaaagt gcttacagcc attagcgtca accttagtga aactttcaac cccgtgtggc 3300 accccttaat gatgaaatat ccttctcaaa ttttgcacat aagtttgtta tgctaaaagg 3360 aacaaaaata gaggttgtcc catcaattta ctttgaaaaa gttttttgag caccacccta 3420 // ID L1-1_HM repbase; DNA; INV; 5502 BP. XX AC . XX DT 23-DEC-2008 (Rel. 13.12, Created) DT 23-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE a non-LTR retrotransposon from the L1 clad - consensus. XX KW L1; Non-LTR Retrotransposon; Transposable Element; KW reverse transcriptase; endonuclease; L1 clad; L1-1_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-5502 RA Bao W. and Jurka J.; RT "L1-like retrotransposon from Hydra magnipapillata."; RL Repbase Reports 8(12), 2070-2070 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 103..1128 FT /product="L1-1_HM_1p" FT /translation="MSKSYAGVLASPSQNGNPKPKLGIFNSKPVEENSYVL FT VCRYGDLNISINEFLAALAQKDLLTNVWGVTRHYAHKYFEFAVHNSTLIPL FT FLKSEIIFKNCVLTFQQKFVRKIHVLVQNIPIGILRSGCKDAITKLGNHLA FT LGRGQFDSFYCQSVKVHGEEIFTGNVIIIIKNWTNPYANPLPRFENVDGLT FT LRYKHNGQPESRLKTTEKPTDADTSQGKNSEVVESVKETDEEPSTQSSPKQ FT TEPEELEVLKSKKAVQKTQNEVVAQIVCDTKESTKPKRKSQNDLREGETPS FT KQVTPNQTQVMNDVEMEGTCSSEIAEEQNNDLIFDYEKYDLIKSAGEKS*" FT CDS 1509..5021 FT /product="L1-1_HM_2p" FT /translation="MTVNVLTCNINGLNDQEKRENFFGFLKKSSFDLILLQ FT ETHSGPSTVQQWGDEWTGESIWNSGPSHNSCGVAVLSKSNISLSELKRDTN FT GRILNVIIKFDEHEMQVMNIYAPNVPRERRHFFGDLVDFSQDTGLPVLLAG FT DFNMVESLPLDTEGTNNAKYHTYGVDELGVFQGKYNLVDVFRLKFPNKREF FT TWRNQKAKCRLDRIYCPEKIAKKTTYTKILTNPFSDHEFVTTTVNFSKTRR FT GPGYWKLNCSLLEIEVYRQKIELSIKDWQTKKRSYNSILKWWDDCKLFVRV FT ELQHISIQESAKNKKQIKRIENEIQRERQNANPNLDSIKEKRDALFLLHFP FT GAFIRTKQKLVEEGEKPSKFLYSLEKTQQRNKSITKVRKNDGTLTSEPGEI FT LNSLVSYYKNIYTKTNLCKDSQNYVLSHITKHLSDDENQLLNTRLTGAELK FT EALFKFENGKSPGYDGLPAEFYKTFWHILEKDFEELAYEILFVEKNTSHSM FT KRSIISLIPKQGDLTECKNWRPISLIGADYKIITKALALRLAKVMGKIIEP FT NQTCGIPGRTIFSNLHLVRDLIDYAEIKNLPSFILSIDQEKAFDKVDRAFL FT LQILQKFNLGENFISFINTLYSEISASVLNCGFLSATFPTERGVRQGDPLS FT LLLYVFVAEALALVVRADSRIEGFPLPGTSKPLKLQQYADDSNFFARDVKS FT VRFFFEAVKLFEKASGSVINASKTKGLALGGFDPKMYPELDNIEWTNNTGF FT KILGVTFYTRLSNTTNFNWLVTLNKIEKKLKFLGLRTLSLRGKTMLINTLA FT LSKVWFLANVLPVPEWVIERLHSAIFENLWQKTNFNPVKRETLFLPVKSGG FT LGILSPKEQSLALRLKTLFQLKECGEDNTHDNYYFSKYWLASSLIKFTNQN FT PSWNFLKQNNFPKHWDNKMSQYYKTTIEVLGKNGSLFNIPLKSTKNFYLEV FT AKNKKTVVPAELFWNGSTKTTMPWTQIWKKNFTSHACGPTQNILFRFLHNS FT LPSAALLAKSTRQQIVQNNKCKTCGKIEDNLHIFAYCPPSVEIWEYFKHVY FT NSLTSQTVFSPINSIFSIGTANLSEKNPTSQLLLTLTQTIMSEIWNSRCHH FT MFSNMKTDPMAVIRVIIRKIRIIINLKYNYHVRRNTTDIFCQLFCIKNIIC FT QVENGRLKLNI*" XX SQ Sequence 5502 BP; 1950 A; 1070 C; 941 G; 1536 T; 5 other; aaaaaaaaaa acggaaaaaa acacatactg aaacttttta gtccctgtgt aataaggtag 60 gcatcaatat taagcgatta ttgaagcgaa gaaaagttga atatgtccaa gagttatgct 120 ggtgttcttg cttcgccgag tcagaatggt aatccaaagc cgaaacttgg aatctttaat 180 tctaaaccgg ttgaagagaa yagctatgtt ttagtttgtc gttatggtga tttgaatatt 240 agcataaatg aatttctagc tgccttagct caaaaagatc tattaacaaa tgtctgggga 300 gtcacgcgtc attatgcaca taaatatttc gagtttgctg tccacaacag cacattgatt 360 cctttatttt taaaatctga aattattttt aaaaactgcg ttttaacctt tcaacaaaag 420 tttgtaagaa aaatacatgt tctggtacaa aacatcccga taggaattct tcgaagtgga 480 tgcaaagatg ccattacyaa acttggcaat catctagccc taggacgtgg acaatttgat 540 tccttttact gtcaaagcgt aaaagtccac ggtgaagaaa tttttacagg gaatgtcatc 600 attataatca aaaactggac aaacccatat gccaacccac ttccacgttt tgaaaatgtc 660 gacggtttaa cgttacgata caaacacaac ggacaaccgg aaagtcgact aaaaacaact 720 gaaaaaccta ctgatgctga tacatcccaa ggaaaaaatt cagaggtagt ggaaagtgtg 780 aaggaaaccg acgaagaacc atccactcaa tcttcaccta aacaaactga gcctgaagaa 840 ttagaggttt tgaaatccaa gaaagcagtt caaaaaactc aaaatgaagt agttgctcaa 900 atcgtctgcg atacaaaaga atcgacgaaa cccaaaagaa agagccaaaa cgatttgcga 960 gaaggcgaaa caccatcaaa acaagtcact ccaaatcaaa cacaggtaat gaacgacgtt 1020 gaaatggaag gaacgtgttc ttctgaaatc gctgaagaac aaaacaacga tttaattttc 1080 gactacgaaa aatatgatct cataaaaagt gctggagaaa aatcgtaagg tttcatatta 1140 tttccttttt actttttttt ctcctttcct ttctaaaaac catttttttc ttctaaaact 1200 ttgttttttt tctaaaactt tgattttttg ctttttcgat tctttcacgt tctattctat 1260 taaaatgcat gttctatttt taaaaccgcg cgtaagctgt cgtaatctgt attgttgtag 1320 ttcgcgcttt tgtgtaaaat gttatggtcc agtgcgtcca tctgttcgta atacgtcagt 1380 taaaccgaaa actcgtgctt taagtttagg ctttatatgt gcgtgttttt tatatcaggt 1440 tgtaaaatta tcatatttat gtttaagtaa cacgagtttt caaattagga aaactgattc 1500 tctccatcat gactgttaat gttttaacgt gcaacataaa cggtctaaac gaccaagaaa 1560 aacgcgaaaa tttttttggg tttctaaaaa aatcctcctt cgatctcatc ctcctacaag 1620 aaacacactc tggaccttcc actgttcaac agtggggaga cgaatggact ggcgagtcca 1680 tttggaactc tggtccaagt cataacagtt gcggggtggc ggttctctcg aaatccaaca 1740 tttctttaag cgaactaaaa agggacacta acggaagaat attaaatgtc ataataaaat 1800 ttgacgaaca cgaaatgcag gtgatgaata tttatgctcc taatgtgccg agggaaaggc 1860 gccatttttt tggcgacctt gttgatttct cgcaggacac aggactaccc gtcctgttgg 1920 ccggagactt caacatggtc gaaagcctgc ctcttgacac agaaggcacc aacaacgcta 1980 aatatcacac ctacggcgtg gacgaactcg gggtattcca aggtaagtat aacctggttg 2040 atgtttttcg tttaaaattc ccgaacaaaa gagaattcac atggcgaaat cagaaggcaa 2100 aatgtagact cgaccgaatc tactgccctg aaaaaatcgc aaaaaaaaca acatatacaa 2160 aaattctcac caaccccttc tccgaccacg agttcgtaac aacaaccgtc aattttagta 2220 aaaccaggag aggaccagga tactggaaat taaactgttc tcttttagaa atagaagtgt 2280 ataggcaaaa aattgaactc tcaattaaag attggcaaac taaaaaacgg tcctacaatt 2340 caattttaaa atggtgggac gactgcaaac tttttgttag ggtagaatta caacatatat 2400 ctattcagga aagtgcgaaa aacaaaaaac agatcaaaag gattgaaaac gaaatccarc 2460 gggagcgaca aaacgccaac ccgaatttag atagcatcaa agaaaaacgc gatgctcttt 2520 tcttgctcca tttcccagga gcgtttattc gcacgaagca aaaactagtt gaagaaggag 2580 aaaaaccttc aaaatttttg tacagtttgg aaaaaactca acagaggaac aagtcaatta 2640 ccaaagttcg caagaacgac ggaacattga cgagtgaacc tggtgagatt ttaaactccc 2700 ttgtttctta ttacaaaaac atctacacca aaacgaattt atgtaaagac agccaaaact 2760 atgtcttgtc acacatcact aaacatttga gtgatgatga aaaccaatta ttaaacaccc 2820 gcctaaccgg cgcggagtta aaagaagccc tttttaaatt tgaaaacgga aaatccccgg 2880 gctatgacgg acttccagct gaattttata aaactttttg gcacatttta gaaaaagatt 2940 ttgaagaact tgcttacgaa atacttttcg tagaaaaaaa caccagccac tctatgaaaa 3000 gatcaataat ttcactaatt cccaagcaag gagatctcac cgaatgtaaa aattggaggc 3060 ccatatcctt aatcggcgct gattacaaaa taatcacaaa agcgctggcc ctaagacttg 3120 caaaagtaat ggggaaaatt atagaaccaa atcaaacttg cggaattcca ggtagaacaa 3180 ttttctcaaa tttacaccta gtccgtgatc taatagatta cgccgaaatt aaaaatctac 3240 ctagtttcat tttgtcaata gatcaagaaa aggcgtttga caaagtcgat cgcgcgttcc 3300 tgctgcagat tctccaaaaa ttcaacctcg gagaaaattt tatatctttt attaacaccc 3360 tgtattcaga aatttcggca tctgttctga attgcggatt cttgtcagcc acttttccaa 3420 ccgaaagggg agtcagacaa ggagaccccc tctcactttt gttgtatgtc tttgtcgccg 3480 aagctcttgc tttggtggtc agagcggaca gcagaattga gggattccct ttaccaggaa 3540 catcgaaacc gcttaaacta cagcagtacg cagacgattc gaactttttt gccagggacg 3600 taaaatcagt ccgtttcttt tttgaagcag taaaactttt tgaaaaagca agtggttcag 3660 tgatcaatgc ctcaaaaacc aagggtcttg ctcttggggg ctttgatccc aaaatgtacc 3720 cggaattgga caacatagag tggactaata acaccggttt caaaatttta ggtgttacgt 3780 tttacaccag actgagtaat actaccaayt tcaactggct agtaactcta aacaaaatag 3840 agaaaaaact caagtttctg ggactacgta ctttgtcact taggggaaag actatgttga 3900 taaatacgtt agcattgtca aaagtgtggt ttctggcaaa cgttttgccc gttcccgaat 3960 gggtaataga acggttgcac agtgccattt tcgaaaatct gtggcaaaag accaatttca 4020 acccggtcaa aagagagacg cttttcttac ccgtcaaaag cgggggtctc ggaattttat 4080 caccgaagga gcaaagcctt gcgctccgcc tcaaaacact ctttcaactg aaggaatgcg 4140 gagaggataa cactcacgat aattactact tttcaaagta ctggttagca agttcactta 4200 ttaaattcac aaaccaaaac ccaagctgga actttttaaa acagaacaat tttccaaaac 4260 actgggacaa caaaatgtct cagtactaca aaacaactat tgaagtttta ggcaaaaacg 4320 ggtccctttt taacatcccc ctaaaatcaa caaaaaattt ttacttagaa gtggcaaaaa 4380 acaaaaagac agttgtacca gcagaacttt tctggaacgg ttcaacaaaa acaacaatgc 4440 cgtggaccca aatttggaaa aagaatttta cctcccacgc atgcggaccg actcaaaaca 4500 tcctctttcg atttttgcac aacagcctac cttctgcggc cttgctagcc aaaagcacaa 4560 ggcaacagat tgtccaaaac aacaaatgta aaacttgtgg taaaatcgag gacaaccttc 4620 acatctttgc ctactgtcct ccttcygttg aaatttggga atattttaaa catgtatata 4680 actccttgac gtcccaaacg gttttttcgc cgataaattc gattttctca atcggaaccg 4740 cgaatttatc ggaaaaaaat cctacctcgc aactgctgtt gacactcact caaaccatca 4800 tgagtgagat atggaacagc agatgccacc acatgttcag taacatgaaa actgacccaa 4860 tggcagtgat cagggtaata atcaggaaaa tcaggattat tattaattta aaatataatt 4920 atcatgtccg tagaaacact acggacattt tctgtcaact tttttgtatt aaaaatataa 4980 tatgtcaagt agagaacgga agactgaaac taaacatatg agctaaactg acaccctagc 5040 tcaaataaaa gaactatatg tttagtgagg ttttccattt ctctacgaat tattattttt 5100 acaacaaaat attattattt atgattaaaa aatcactgcc agcctttatt tatgagttat 5160 aatatccctt tttagatgtt ttctctgtag cgtacgtaat gtggtatttg tgtgaaagtc 5220 ttgtactttt ctgctgttat taagttataa tttcctttct ttaaataata acttctcgaa 5280 ttctttttaa aatatcttcc tctttaaatt ttccaaacct tcgatttttt ttccgcacaa 5340 tggctttcta ttataaaaac aaaaaaatac aaaaaaaaaa tataaaatca aaaataaaaa 5400 aaaaaatata aaatacaaaa acacaaaaaa atataaaatt taaaaaatca aaaaataatt 5460 aaaaaattaa aaaaaaaaaa aaaaaaaaat atatatatat at 5502 // ID hAT-72_HM repbase; DNA; INV; 5509 BP. XX AC . XX DT 08-JAN-2009 (Rel. 14.02, Created) DT 08-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE hAT-type DNA transposon family - consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-72_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-5509 RA Bao W. and Jurka J.; RT "hAT-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 9(2), 412-412 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 824..2692 FT /product="hAT-72_HM_1p" FT /translation="MVNDITKWVKECDKCQRMEKIRTVAPELKPIKVNGLW FT DFLGIDLIGPLPITKLGNKYILTITDLWSKYIEAFPIPEKSAFYVSKCLTT FT LFYRFGPPKKILSDQGREFVNSLNEQLFSLFQIKHLITSAYHPQTNGQDER FT TNQTIKKSLSKLSNDTQDNWDELLEAVLFGLRTCVQKSTKFTPFFLMFGRE FT ANLFSTLSLNNSDSGTNDNNIVESNAXDEQIQEKLGSYNKVVTEVNNNICH FT AQIKMKKYYASKQLKGWKSFTFKXGDQVLVRNYRKIGRKGSRMEHDWLGPG FT IISELKQTGATLTINGKVWKKAVSLSNMKPYIADRQVLSAELLLKEHDYCL FT KMTNQSYKKEDLKKRSSKKRHISELKCYDKDKKYKSGKMEITTSSLSDDKS FT ILKHKRLIYPAISLDNNFYSTLSAQSIKQCIDILNNPFGWLDDTLIDIAQS FT FLSHQFPKVGGFQSSCIFNSNNFGGFVSGKFVQIFNVRNSHWVLISNVTSD FT NGSSSVQYYDSLFTGFNKSTVPLLVHRVARSMLINEGIAFINLEVMRCQNQ FT DNGNDCGLHAIANATALCHGIDPSVILWEQNSMRSHFLKCVENRKLEMFPY FT ILLNDTKTCSISFQCDDLCPCALK*" XX SQ Sequence 5509 BP; 1900 A; 726 C; 806 G; 2032 T; 45 other; ccggtgttgc gttaaataag tagcccgtta aataagtaga caaacagttc tgcgcatgcg 60 ttttcacatg cgcagatgtg ctgttaagtg tccgttaggt wagtaacatt tatcgttaaa 120 taagtaacct cgttccgtta ggtacagtta aagttwattt ggatttcatt tttagccata 180 tttaactatt taattcattt aaagtttatt ataaaatgtc agctcacagt tattctgttg 240 aagatattaa gctttatctt tataaaggag aaataaattg ctctaaatca cagcgtaaat 300 catttttaaa atatgcaaaa aagttcaaac tatgcggtat gtaatcgttg tataataaat 360 tgttgaataa gtaataatta tttttaataa acaataatta tttttattga ataatcacta 420 acttattata tacttcagat ggtaagttgt actatgtaac agcgtcaaaa aacctgcaag 480 ttctatttaa tgacaatgaa aaaatgttag cctttagaga tgtccatgat tctaatcatg 540 gggctcatgt tggccttaac aatacaagag caaaattaaa aggttctttc tactggctag 600 gtaatattat tgtcttttta taatagtact tttttaattt ctataaggtt tttaagctct 660 tttaagtgtt ataccgttct tgaacttaat tgctaggcaa attaaattaa aacatgaatt 720 gatatcaatt catgatatca aaagatagag tacagctcta tttttttwaa tttggttatg 780 ttttggttca tgtcaaaytt ttgtttttgt tgaattgtta ggaatggtta atgacattac 840 caaatgggtt aaagaatgtg acaagtgtca aagaatggaa aaaataagaa ctgttgctcc 900 cgaattaaaa cctatcaagg ttaatggact gtgggatttt ttaggcatag acctaatcgg 960 acctcttcca attacaaagc ttggaaacaa atatatttta acaataactg atctttggag 1020 caaatatatt gaagcttttc ctattcctga aaagtcggca ttttatgttt ctaagtgcct 1080 tactactttg ttctataggt tyggtcctcc aaaaaaaatt ctatcagatc aaggtaggga 1140 gtttgtaaat agtttaaatg aacaattatt ctctttattt caaatcaaac acttgataac 1200 ctctgcctac caccctcaaa ccaatggaca agatgaaaga actaatcaga caattaagaa 1260 atctctctct aaattatcta atgataccca ggacaattgg gatgaattgt tagaagctgt 1320 tttatttggt ctgcgcacat gtgtgcaaaa gtcaactaaa tttacaccct tttttttaat 1380 gtttggcaga gaagcaaact tgttctctac tttatcactg aataatagtg attctggtac 1440 taatgacaat aacattgttg aaagtaatgc atyggatgag caaattcaag aaaaattggg 1500 ttcatataat aaagttgtta cagaagtaaa taacaatatt tgtcatgccc aaataaaaat 1560 gaaaaagtat tatgcctcaa aacaacttaa aggatggaaa tcatttacat ttaaarcagg 1620 cgatcaagtt ttagttagaa attacagaaa aataggtcgt aaaggaagtc ggatggaaca 1680 tgattggttg ggacctggaa tyatttcaga gttaaaacaa actggtgcta ctttaaccat 1740 caatggtaaa gtttggaaaa aagctgtttc tttatccaac atgaagcctt atatagctga 1800 taggcaagta ttgtcggctg aattgctttt aaaagaacat gattattgtt tgaaaatgac 1860 aaatcaatca tataaaaaag aagatttaaa aaaaaggagt tctaaaaaga ggcacatatc 1920 agaattgaaa tgttatgaca aagataaaaa atataaatct ggtaaaatgg aaatcactac 1980 atcatctcta tctgacgata aaagcatttt aaaacataaa agattgattt atcctgcaat 2040 ttctctagat aacaattttt attcaacatt atcagcacag tcaatcaaac aatgtataga 2100 tattttaaat aatccmtttg gttggctaga tgatacttta attgatattg cacaaagttt 2160 tctttcgcat cagtttccta aagtaggtgg gtttcaaagt tcttgtattt ttaattcaaa 2220 taattttggt ggttttgttt ctggaaagtt tgttcagata tttaatgtga ggaattcaca 2280 ttgggtatta ataagtaatg taacctctga taatggttca agttctgttc aatattatga 2340 ttcccttttt actggtttta ataaaagtac tgttccatta ctagtacaca gagttgcacg 2400 atcaatgttg ataaatgagg gtattgcatt cataaattta gaagtcatga gatgccagaa 2460 tcaagacaat ggaaatgatt gcgggcttca tgctattgct aatgctacag ctttatgtca 2520 tggcattgat ccaagtgtaa ttctttggga acaaaacagt atgagatctc attttttgaa 2580 gtgtgttgag aatagaaaac tagagatgtt tccttacatt ttattaaatg atacaaaaac 2640 ctgctcaata tcatttcaat gtgatgatct ctgtccttgc gcattaaaat aatgattctg 2700 ttttgtgtct ttacatcaag aaaatattta aattataagc ttgtacaata tatgtgaatg 2760 tttatttaca atatgtaata cgttgacaac atgtgatgaa acatcaagtt ttcagttatt 2820 ttttatgttt aaaaaactta tatcagaatc cctgtaaata taataatccc tataaaatat 2880 aataatccct ataaaaataa tataataaaa ttattggttt taatttttat tgtggatcgc 2940 tattttctga agtttgtgtt ttgtgaggac atttactgta tcaaaatatt gtaaataaat 3000 ctaaaaaatc aattgccttt ttttgaaaaa aaaaaggata gtattttaag tctagtatta 3060 tttgagtcag tactttagct tgttattaat ttagttttag attagacttt acttttgtta 3120 gactagagtt tgctcaattt tatattagag ttatatatag agtttttaaa ctgaatatat 3180 taaggaagct ttttaaaaat ttgtaaattg ttttttcaat tttaagtcac tttaaaattt 3240 attaaagtat cgctatttta aatatttaat caaaaacttt tagtttcatt tagatttgta 3300 atattcattt gtgtaatatt aagacacttg aggttgttga atccatttca ggttacaaga 3360 gtatttttaa ctaacttagt attacctttt tcttattatt tattttgaat wcaaagattt 3420 ttaaatgaaa tttttcaata aaatatttgc atatttaatt tactaattta aaaaaactta 3480 ataaatgcat actacattat tattttctag taaataattg tccttttttt tagaaatgac 3540 aagctaagct ccttggttac aagcggtaaa agataacctt ttaggtaaac tcaatctctg 3600 atgtttgttt gcaaagacaa gtaaattcca atctagcaga ggtaaacgtt aagggacaga 3660 tttaaaaaca aagtaaatgt aaattaattt ttagctaata aatttttata attttattat 3720 gttatcgtaa ttaattgtaa atctttttcc cgcataaatt tgtcttggat gttatcccat 3780 gtgttttttt cattggaatt aagagttatg gaagtttatt aaaggggtta taaaattatt 3840 ggattttgct aatttttgca atttttcatg aacttaccat tccagattaa ataatgcgat 3900 gaaagtaact gtttcattga gcattttcta tcagctgcaa aaatcttatt gattttttta 3960 aaaatattyc tgacgtaggc tgctttattt gtattgctgt tgctacaaaa caactcttta 4020 aaatatttac yttttcaata atttagtatg atgccttgga atcttaytat tgatttttta 4080 acaagaaact tttctaaagt aaaatkacaa aatttagatt aaaaaaactc attttaactt 4140 tcaattaata ttcctttwaa acataataac attacactta tctgaaayat tttcacacct 4200 tgtagtatta ttatgcgata tttgtttaat gtcaaactat cctattaaga tcttttgaaa 4260 tttgaataag katagggraa cccagagcaa gatgactaac gttttgttat tcttaatttt 4320 tgcaaagtat ttttcaaaak aaaataaaaa tgaaaaatyc tcagawcarg tcaggatatt 4380 gttaccatam caaatgaaaa cttagtaact acaaarcaga rttgttagtt cttggccttg 4440 gtcttgtctt aaggcyaaaa awaaagrctt tggtcttggt ttggccttga aacttttgtc 4500 cgtaaaaatt ttggccttga ccttaattgt ttcggcctag gccttcaaaa aaaaaataca 4560 taaaattaaa gaattaaagg ttttcattcc ttctttttaa gcatttttgt ttttagcttt 4620 catatatatc tacaagtatt tttcctattg acttcgtaga attttgcgca taaccttggt 4680 ttatttttat aacatgtggt ttattgtcac agcaattgtt accttcagtt ttgttaataa 4740 ttggccttaa cgtwagscaa aaatttaagt ctttggcctt gaaaatgttt gcgtaggcct 4800 tagccttaaa aatcttattc ttggccttga ccttaaaact cttggcctta tccttgactt 4860 tgtaactttt ggccatggcc atgctacttt tggccttgtt aacaacattg ctaccgagca 4920 ctttttctaa agtattgcaa aataaacacg cattgcaaaa agcatyccaa agaagratac 4980 gtcattagaa taacatctat cagcaattat cctaaccgta acatcatttt ttttttgatg 5040 gaaaaaaaaa acttatgcat ataaaacgac gttaccaatg tttaaacaaa aaaaaaaatg 5100 ttattttaaa tacaarctac aaaatgaaaa atatgcggtt tcrtttcyat ccatttayaa 5160 atgttttttt ttgtgatwga aaaattaatc gtgcctyaat tttttaaatt tatttttayt 5220 aamtaaagtt aatgatatag ataaagcgta actaaagtta atgmtataga taaascgtaa 5280 attggtttta aaataacatt gtaamactta ccttgacaac gtttttgttt caktttgaca 5340 taaagtttgg ctaaacgaaa tatgccttga ttgctcaatt tgtctacttt tttaacgagc 5400 tacttattta acgcatatag gcgttaaata agtatccgta tactaacgca tgcgcaaaya 5460 ttacattttg tctacttttt taacgggcta cttatttaac gcaacaccg 5509 // ID Gypsy-79_CQ-LTR repbase; DNA; INV; 185 BP. XX AC AAWU01022223; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-79_CQ_; KW Gypsy-79_CQ-I; Gypsy-79_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-185 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 538-538 (2011). XX DR GenBank; AAWU01022223; Positions 1588 1772. XX SQ Sequence 185 BP; 50 A; 46 C; 38 G; 51 T; 0 other; tgtaggacga tcgaggcacc cctcggttcc gaacccgagg tctagagaga aaattttgtg 60 ctgctctcgg cagttagcac tcgaccgcgt accgacgtag acgtggaata aaccgcaatt 120 aaatcggaca aataaagcta cgtttttgta ttttccatta attctaaact cttccctttt 180 ctaca 185 // ID Gypsy-101_AA-LTR repbase; DNA; INV; 1754 BP. XX AC supercont1.55; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-101_AA_; KW Gypsy-101_AA-I; Gypsy-101_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-1754 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.55; Positions 880408 878655. XX SQ Sequence 1754 BP; 488 A; 414 C; 351 G; 501 T; 0 other; tgtaaccgtc ggttccaaaa ttttatagtt tgtaggctcc tcctcaaaca tccctgtaca 60 ttcaacccgc ctgccagtgt gttcccaatt cccactccaa atcatagatt taaaaattga 120 aatgacagcc tcactggcgg gaccttttgc tgtcattgca gagctaaagg cgttattata 180 gtaacacacg ttacgttgtc tcgaaacttg cgtcatttat tggcagcggt ttgagttacc 240 taaatgtagt cggcctaaca tgaaccctgc gcggaaatgt tgccaaccta aatctgagtg 300 gcctcacttc gggtgagttg ccaaacttcg gatgttttct gctgattaat gtccaagcaa 360 agaattccag ggaaaattga cttcattctg ttcgtgacta tcacagataa tagtagtgcg 420 cgccgtgtat tcttaaaaac gtaaaaaaaa tgtaatgtgt aaaattgtgg tgtataaaat 480 taatttgtgg tttttctttt agatcttaaa agctagttaa aacttaaaaa ctagttaaaa 540 cttaaagcta atagtacggc aagtgttaag aagtacgata aatagttcaa agtacggtta 600 gttaaaaaaa aaaatgtatt atgtgattgt aattataatg tgtctttgtt acagcttaaa 660 caatagaacc atcaaacaaa aggttcaatt aattcctata gttgaatagg atttcaaaag 720 aaaaggtaat tcttttgtta ctattgtgat taactattag tatggtgaag gaagagagag 780 acagtttcaa ggtgtgctaa tttcccggaa taggtccact tctgggcctt ttctttttga 840 ccttcgggtt ccggaggact tctttttgtg catccacaga atttcccaag gccttttgga 900 ccggtctaaa ccggacatta ttacaatacc ctgaacccaa gccacctgga ggattctgga 960 attgcggacg gaagatttcc tttgaagacc attccccacc atctccattt tgggaatcgg 1020 cagtccttta agccggattg tggattgagt cggccatagc agaggtttcc accgcattgt 1080 ctgcgtctgc gttctccatc aaccagcttc gccgttagta ttgctgcgtc cgtgtaagcc 1140 accaacccca ccattctgcg cagaggttcc caacgcattc tctgcggcag cgttcagcct 1200 tcgtcgaaaa cccaccccag tcaagccatc agttaccgcc gtgaaagcca cccctccgat 1260 gcatgcatgc caccgcctac caccgcccac caccaagcta gccaccgtac agcaccaacc 1320 accctcacca aaatcgcaag tacaccttac caatgatgtc tctctcttta gggcccaaag 1380 aggcaaaata aatgtaggca tgtaagttag aaactttggt ggttattttt ctatttgaaa 1440 accccacaat tggtttatat acaccctttt gatattttgc tcgctttccc gtgcccaaaa 1500 aggggcattc acgcctcgat ctgggtagtc cgctggagaa atacgatttt caccttgaat 1560 tctactgact ttggtccaga aaatagttgg tttccttccc ctcgtacgga actgatagct 1620 ggcggttaga atccagctcc tgctaaaggt gagtgaccgg tttagtcagc gtaacggtgc 1680 tattaagaat aagcgaccct gtgctcagca tttcccaacc cacaaatccc ctactgagca 1740 gcctaggagc taca 1754 // ID Gypsy-23_DYa-LTR repbase; DNA; INV; 263 BP. XX AC chr2L_random; XX DT 06-APR-2011 (Rel. 16.04, Created) DT 06-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-23_DYa_; KW Gypsy-23_DYa-I; Gypsy-23_DYa-LTR. XX OS Drosophila yakuba OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; melanogaster subgroup. XX RN [1] RP 1-263 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; chr2L_random; Positions 3035549 3035811. XX SQ Sequence 263 BP; 85 A; 40 C; 65 G; 73 T; 0 other; tgtcggagaa ggaaatgtta ttttgaaatg agcgccttat agatggttaa cttgcttacg 60 atactaatcg ataggacggt tttggttggt atttggttgt gagatcttga catacaaaaa 120 gatcgagtgt ggccctggtt tagcgtggtg gttcttctct aagaaaaaag ttaagaagaa 180 acacacaagc agcggaagaa caattggaaa agctatcaag ggactcgcta cacaatcata 240 gaacaacttt tagcgtttcg aca 263 // ID BEL-634_AA-LTR repbase; DNA; INV; 517 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-634_AA_; KW Pao_Bel_Ele201; BEL-634_AA-I; BEL-634_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-517 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 517 BP; 184 A; 96 C; 86 G; 151 T; 0 other; tgttggcgaa accacagggc agttctcgca ctcgaatgag tgagtcccga ctacaccctt 60 ttcgcttgaa cgatccgttg acagctacgt caactgtcat cgggagtttg acgttttgat 120 tcaccctttt gctcaatcat tcttgaacaa ctatcacgag tagtaattaa aaccctacaa 180 ctaaacttaa acttgaaata gtacggattt gttaaaatta tcaattggat tattgttata 240 ttgaattcga aaggtaattg ctagtgatac ctgctaaaat atattattct atatgtaaaa 300 ctattatcac agatcgaaat aaaagggaag cacagtcagg acaaaattat aacctaaaac 360 gtaaaaagaa caccaaaatg taagtaatga actaaaacta tgtgaattga acttaatcta 420 acatgcaata aataaaactt ttagctttga agctgtgtca aaggacctgc taccaaggag 480 tgttcaatta tagtccgaac tgcgattttt cccaaca 517 // ID BEL-204_AA-I repbase; DNA; INV; 6873 BP. XX AC AAGE02024599; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-204_AA_; KW BEL-204_AA-LTR; BEL-204_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6873 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02024599; Positions 23041 16169. XX CC Positions [5850-6428] - Integrase core CC 'AACAC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 1524..6782 FT /product="BEL-204_AA-I_1p" FT /translation="MSKPITRNADSLKSLTTRLKGLRLSLSNICQFVRNFK FT DDTSATQINVRLEKLDSLWVQISETIWEIEAHEEFEEQESLQKDQLEMENR FT YYDAKSFLAEKAQVFQSNSIQNQSIRSGDATLHGVMDHVRLPQIKLQCFDG FT NIDDWLSFRDLYTSLIHEKPDLPEVEKFHYLKGCLAGEAKALIDPLKITRD FT NYQVAWKTLLKRYNNNKLLKKKQVQAIIKLPSLSKESITELHKLVDGFDRV FT VQTLDQIIQPAEYKDLLLVELLSSRLDPITRRGWEEHSSTKQQDTFKDLLE FT FLQRRIQILEALPAKVESRVDQVLQPKRKPFSAKVSNNAVQSNNAKCPSCT FT EIHGLHTCPDFLKMSISSRESLLRAQSLCRNCLKRGHLARKCSSKFSCRYC FT KARHHTLLCFKAGETNGSQESSGKANLRDNSQQNDTSANSKAANASAVEST FT SSNAAQCVSSQILLATAVVVIEDDQGSRYPARSLLDSESECNFISENLYQM FT MKVSVQKANISIMGIGQSGTRASRKIKAAVRSRTSSYSRTMEFLVLPKVTA FT CLPTSTVEANGWDIPVDIELADPEFFRSRKVDLVLGIQAFFSFFPSGREIQ FT LGMGLPVLTESVFGWIVTGEVTSASQGTKYTCNMAVFDDLEDLMERFWSCE FT EVGAVSKYSPEEARCEEQYERTVKRDSDGRYTVSLPKNNEVLAGLGESKDI FT ALKRLRAVERRLSRDERLQKQYSDFMTEYLELGHMRMVDESEEKAVKRCFL FT PHHPVIKESSTTTKVRVVFDASCKTSTGVSLNDALYAGPVVQQDLRSIVLR FT SRIRQIMVVADVEKMFRQIWTSLEDTPLQCILWSTPDGKVVPFELLTVTYG FT TKSAPYLATRTLKQLAFDERERFPLAAPTIEEDVYMDDVISGADDVESAIE FT LRRQVDAMMDSGGFKLRKWASNRPEVLEGIPCENLALPDSNGIDWDQNAEV FT KALGLTWLPNADQFRFKFEVPPITENQILTKRKVLCYIARLFDPLGLLGAT FT ITSAKVFMQRLWCLKNEAGCILQWDDPLPEMVGEEWRTFHKQIPMLNEIRV FT PRCVVSPNSVSVEYHCFSDASIVAYGATIYVRSQRDDGSVSVHLLTSKSKV FT APLKVQSLPRLELCGALLVSQLWEKVAESLKAEEKVHFWTDSTCVLHWINS FT PPGTWTTFVANRVAKIQGLTESDQWRHVPGNTNPADLISRGVSPTSIVGND FT MWWHGPTWLLEIPEKWPKPVETHPEEELERRRNVVACTASKESGFISDLVA FT KFSSFIRLIRMTAYWLRFLNNLRCSAIEKQTGYLSTHELQLAEQMIVRKVQ FT AESFVDEIGKLTAGGSVARNSPLRWFNPRIDENGILRVGGRLAHSEESHQT FT KHPIVLPARHALTELVLRHFHQIYLHAGPQLLLGTVRQRYWPLGGRNLAKK FT IIHQCQRCFRAKPSIMQQQMGELPAARVTVSRPFSKSGVDYFGPVYLRAGR FT GRKPTKAYVAIFVCMATKAVHMELVSDLSTERFLQALRRFFSRRGRSTDIY FT SDNGTNFVGARNQIQELFELLKNKTHRETVSRECANEGIYWHFNPPSAPHF FT GGLWEAAVRSAKFHLLRVLGGNSVSHEDLFTLLTQVEGCLNSRPLTSISDD FT PSDLEPLTPAHFLIGGSLLALPDPDLSDVQLNRLSRFKVVQRQLQDFWKRW FT RLEYLSQLQARSKHWQPAVDVQVGRLVVIVDSNQPPTQWKMGRIEELHPGD FT DGITRVVTVRTATGSLKRPVVKLCLLPTLTEED" XX SQ Sequence 6873 BP; 1857 A; 1571 C; 1712 G; 1733 T; 0 other; taagctggtc cttcgaaccg gatttccact accgccagac ctcagatcgg tcgaatctcg 60 acgcttcgac cacttcgcgc caagaacgtc catcaaaagg ccattctttg gacaaaggac 120 ttccgccata ctacagtgcc gtgaatagat ccatcgccat ctcaaggatc ttcaaaggga 180 gtagtcgcca tctcaagccg tcttcaaagg atccttcggc cattcaagat cgtgtaaaag 240 gatccatccg ccatccttca tctagtcaat tggaattgat cgaagggacg gttccgaaac 300 catttgccgt cattaattag aagcgacatc gactgaagcc ttcgagtggg aaattgaact 360 cggcctgcat tcttggagcg tctacaggat tgtagaataa ccagagcgtt gctctactac 420 cggtcaggta tgtaagaaca aatatagtaa atgtcttgcc gaagcatgtt caatattccc 480 cactaacgca aacccttcga tctcttgtgt gactgtgatt catcgagccc caactggacg 540 gcaatctggt gtattgagct gctacgctgc gattattcgg tgtggaaagt gtggaccatc 600 aacggatacg tcggtgaaga attcggttcg cgaagaactg atttgcgact tccatccaac 660 ggcgaaagcg gtgacgatta cgtacggcga ggacgacgat ttgactgtgt tcaacgacaa 720 cgaacgatcg attgcgacca ctagacagct agcacgccgt cgcagtagct tcgaagcctg 780 ggcatcacgg aacggccatc gcggtcaggt aaatacactt taaagtatat gcccggccga 840 agctggatga ttgattctac acacctgaat aaatttaatt cctttcgatg taaattggac 900 cgttgcgagt caactcattt atggacgcaa tccctgaggt ttacgttggt tttcggactt 960 gaactactgc attcagcgcc atctttagtt ctccaggtga gactaagtat atgtcctgcc 1020 gaggcaggcc attttgtgaa actacattgg ttcttctttc tccgctgcac tgatttggat 1080 ctgaattgag acgatttccc atttatcgcc acgaaattga tgaattgtga cgaattgagg 1140 catttttgaa ctcggctacc attgcacgct gttgtttcat aggtgaggcg aacagtatat 1200 gtctagccga agtttcaatt ttgtatacta cattgagttt tccactcctt cgcgcttcac 1260 tggcttgaat cggaatctga ggcgaataat catacattga ggtgaatcaa attggagatt 1320 gctgctttgg cttcggtggt gccactgttt cttctcatag gtgagctgaa cagtatatgt 1380 ctagccgaag caaccaaatt tgaatactac atctccgttt tctccgattt tggattttgc 1440 tacgattcaa cggaactgcg agtggaacat cctgcccaac tgccaacttt agttcgagtt 1500 tccgttacga gaggctttca aacatgtcga aacccatcac gaggaatgct gattccttga 1560 agtcgttgac tacaaggctc aagggactca gattatcttt aagcaatatt tgccagtttg 1620 ttagaaactt taaggatgat acttcagcca ctcaaatcaa tgttcgtctt gagaagttag 1680 attcgctatg ggtacaaatt agtgaaacta tttgggaaat cgaagcccac gaggagttcg 1740 aggagcagga gagtcttcag aaggatcaat tggagatgga aaatcgctac tacgatgcga 1800 aatcgtttct agcggagaag gctcaagtgt tccagagtaa ttctatccag aaccaatcaa 1860 ttcgctccgg agatgccact ctacatggcg ttatggacca cgttcgatta cctcaaatca 1920 aacttcaatg ttttgatggt aatatcgatg actggctgag tttcagggat ctctacactt 1980 ccttgatcca tgagaagcca gatttaccgg aagttgagaa gttccactat ctcaaggggt 2040 gcttagcagg cgaggctaaa gcactaatcg atccgctaaa gatcaccagg gacaactatc 2100 aagttgcttg gaagacgttg ctaaaacgtt acaacaacaa caaactccta aagaagaagc 2160 aagtgcaagc aattatcaaa ctgccttcgc ttagcaagga atcaatcacg gaactccaca 2220 aattggttga tggattcgac cgtgtggttc agacactcga ccagattatc cagccagccg 2280 aatacaagga tttgttgctg gtggaactcc ttagttctcg tctcgatccc attacacgac 2340 gagggtggga agaacattcg tctacgaaac agcaggacac cttcaaggac ttgctggaat 2400 ttctacagcg gagaatccaa attctcgagg ctctaccagc gaaagtagaa tcaagagtcg 2460 atcaagtcct tcaaccgaaa cggaagccat tttcagcgaa agtcagcaat aacgccgtac 2520 agtcgaacaa cgctaagtgt ccctcgtgta cggagattca tggacttcac acgtgtccag 2580 actttctgaa gatgtcgatt tctagtcgag aatccctttt gcgggctcag tcactctgcc 2640 gcaattgtct caagcgtggt catctagctc gaaaatgctc gtcaaagttt tcttgtcggt 2700 attgtaaggc tcgacaccat acgttgctgt gtttcaaggc aggagaaaca aacggaagcc 2760 aagaatcttc agggaaggcg aatctgagag acaattctca gcagaacgac acaagtgcaa 2820 attcgaaggc agcgaatgcc tcagcggttg aatctacttc ttccaacgcg gcacagtgtg 2880 tttcatcgca gattcttctg gctacagcag tcgtcgtgat cgaggacgac cagggctctc 2940 gctatccagc acgttcacta ctggattcag agtcagagtg caacttcatc tccgagaatc 3000 tgtaccagat gatgaaggtt tcagttcaaa aggccaacat ttcgattatg ggaattggac 3060 agtctggcac aagagcatct cgtaaaatca aggcggccgt cagatctcga acttcgtcat 3120 actctcgaac gatggaattt ctggtactac cgaaggttac agcttgcctt ccaacctcta 3180 cagttgaggc aaatggatgg gacataccgg tagacatcga attagcggat ccggagtttt 3240 tccgatcaag gaaggtcgac ttggtgttgg gaatccaagc tttcttcagt ttcttcccgt 3300 ctggaaggga aattcagttg ggcatgggcc tgcctgtact caccgagtcc gtttttgggt 3360 ggatagtgac aggcgaagtc acttctgcaa gtcaaggtac caaatacact tgcaacatgg 3420 cagttttcga tgatctggag gacctaatgg aacggttttg gtcctgcgaa gaagtgggag 3480 ccgtcagcaa atactctcca gaagaagcgc gctgcgagga gcagtacgag cgcacagtca 3540 agcgagactc ggatggacga tacacggttt ctcttcccaa gaacaacgaa gttttggcag 3600 gccttggtga atcaaaggac atcgctctga agcggctgcg agcggtggaa cgaaggttgt 3660 caagagacga acgtcttcaa aaacagtata gcgacttcat gaccgaatat ctggagctcg 3720 gtcacatgcg aatggtggat gaaagcgagg aaaaggcagt taaacggtgt tttctgcccc 3780 accatccagt tatcaaggag tcaagtacca cgacgaaggt gagagtggtc ttcgatgcct 3840 cctgtaaaac ttctacgggt gtgtcgttga atgatgcact gtatgctgga cccgtcgttc 3900 aacaagatct gaggtcgatc gttcttcgta gtaggattcg tcaaataatg gtcgtagctg 3960 atgtcgagaa aatgtttcgg caaatttgga cgagtttgga ggacacgccg ctacaatgca 4020 tcctgtggag tactcctgat gggaaggtgg taccgtttga gctccttact gtgacgtacg 4080 gcaccaaatc agcaccatat ctcgccactc gaaccttgaa acagctggca ttcgatgaac 4140 gagaaaggtt tccactggca gcaccgacaa tcgaagagga cgtgtacatg gacgacgtta 4200 tttctggagc agacgacgtc gaatctgcga ttgaattgag gcgacaagtc gacgcaatga 4260 tggatagcgg tggattcaaa ttgcgaaagt gggcttccaa ccgtcctgaa gtacttgaag 4320 gtattccttg cgaaaatctg gcactaccag attcaaatgg cattgattgg gaccagaatg 4380 cagaggtgaa ggcattgggc cttacgtggc ttcccaacgc agatcaattc cgtttcaaat 4440 tcgaggtccc tcctataacg gaaaatcaga ttctcacgaa gcgaaaggtg ttgtgctaca 4500 tagcaaggct tttcgacccg ttgggtttgc tgggagcgac gatcacatca gcaaaggtct 4560 tcatgcaacg gctctggtgt ctgaagaatg aagcaggttg cattctccaa tgggacgacc 4620 cactacccga aatggtgggt gaggaatggc gaacattcca taagcaaatt ccgatgttga 4680 atgaaattcg ggttcctcga tgcgtagtta gtccaaattc agtatctgta gaatatcact 4740 gcttctcgga tgcctcgata gttgcctacg gtgcgacaat ctacgttcga agtcagaggg 4800 acgacggctc agtttcagta cacctgctca catcgaaatc aaaggtggca ccactcaagg 4860 tgcaatctct accacggttg gaactatgcg gtgcgcttct ggtatcgcaa ctctgggaaa 4920 aggtggcaga atcgctgaaa gcagaagaga aggttcactt ttggacggat tcgacctgtg 4980 tgcttcactg gatcaattca ccaccaggaa catggacaac tttcgtcgca aacagagttg 5040 cgaaaattca agggttgacg gaatctgatc aatggagaca cgttccagga aatactaacc 5100 ctgctgacct gatttcgcgt ggggtttcgc caaccagcat cgtcggaaat gatatgtggt 5160 ggcatggtcc tacctggctg ttggaaatcc ctgaaaaatg gcctaaaccc gtggaaaccc 5220 acccagaaga ggaactggaa aggcgtcgca atgtggtggc gtgcactgcg tcgaaggagt 5280 ctgggttcat ttcggattta gtagcaaaat tttcgtcgtt tattagattg attcggatga 5340 cagcatactg gttgcgcttc ctgaacaact tacgatgttc agcaattgaa aaacagacag 5400 gatatctttc cacgcatgag ctacaattag cggaacaaat gattgtccgc aaggttcagg 5460 cggaatcgtt cgtggacgag atcggtaagc taacggctgg tggctcggtg gcacggaatt 5520 cccctctacg atggttcaat cctcgaatcg acgaaaatgg tatcctacga gtcggtggtc 5580 gattagctca ttccgaggaa tcgcaccaaa ccaaacatcc gatcgtccta ccggcaaggc 5640 acgcactcac cgagttggtt ctccgtcatt ttcatcaaat ctaccttcac gctgggccac 5700 aattgctcct agggacggtt cggcaacgtt actggccact cggtggcagg aatctagcga 5760 agaagataat acatcagtgt cagcgatgct tcagggcaaa accatcaatc atgcagcaac 5820 aaatgggaga attaccagca gctagagtca cagtttcaag accgttttcg aagtccggcg 5880 ttgattactt cggacctgtg tatctgcggg ccggtcgagg acgcaaacca actaaagcgt 5940 acgtcgcaat ctttgtgtgt atggctacga aagccgtcca tatggaacta gtgtcggacc 6000 tttcaacaga gcgtttcctg caggcattgc gaaggttctt ttcacgccgg ggtcgttcca 6060 cagacatcta ctccgataat gggacgaact tcgtgggagc gagaaatcaa atccaggagt 6120 tgttcgagtt gctgaaaaac aaaacacatc gtgagacggt ttcgagagaa tgtgccaacg 6180 aaggcatata ttggcatttc aaccctccga gtgctccaca tttcggaggt ttatgggaag 6240 cagcggttcg gtcagcgaaa tttcatctgt taagagttct tggaggcaat tcagtatcgc 6300 acgaagacct ttttactttg ttgacgcagg tggaaggatg cttgaattcg agaccattaa 6360 catccatatc agacgatcca tcggatttag agccattgac tccagcccat ttcttgattg 6420 gtggatcgct tcttgcctta cccgatccag atcttagcga tgttcaactt aatcgcttat 6480 cacggtttaa agtcgtacag cggcagcttc aggatttctg gaaacgatgg cgtctcgaat 6540 atctctctca acttcaggct cgatcaaaac attggcagcc agcagtagat gtgcaagttg 6600 gaaggctggt tgttattgtc gattctaatc aaccacccac acagtggaag atgggccgaa 6660 tagaggaatt acatccaggc gatgatggaa tcactcgagt agttacagtg cggacagcga 6720 ctggaagttt gaaacgacca gtagtcaagt tgtgtttatt accaacgtta acggaagagg 6780 attagggaag actctgaaga ttccaaccag tgcgtcgaga ggagattttc tttattttca 6840 gaagttttcc tgcaacttca gggtgggtga gga 6873 // ID HERO-1_BF repbase; DNA; INV; 3279 BP. XX AC . XX DT 19-MAY-2009 (Rel. 14.05, Created) DT 19-MAY-2009 (Rel. 14.05, Last updated, Version 1) XX DE Amphioxus HERO-1_BF autonomous non-LTR Retrotransposon - DE consensus. XX KW Hero; Non-LTR Retrotransposon; Transposable Element; HERO-1_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-3279 RA Kapitonov V.V. and Jurka J.; RT "Young families of HERO non-LTR retrotransposons from the RT amphioxus and sea urchin genomes."; RL Repbase Reports 9(5), 1136-1136 (2009). XX DR [1] (Consensus) XX CC The consensus sequence was built from several sequences less than CC 5% divergent from each other. XX FH Key Location/Qualifiers FT CDS 2..3070 FT /product="HERO-1_BFp" FT /note="Contains the reverse transcriptase and FT restriction enzyme-like nuclease domains." FT /translation="PVSQQEVVAGATPSPQEPAPGRKEKIKWPRANNKEEW FT EKLDDDLHQILEQGLKGDVERKLNQFGNLVYSYCSERFGTYKKEGKEAREK FT QTSRREREIAILVRRRRQLRKRWRKAKQEEKEGLEPLWEEIKSKLKSLRRA FT ERLRRKRKRKAKERNQFFKNPYKFARNLLEEKKGGSLQISREDLELHLKQT FT YSDPQRDEELGSPGYVPRPTEPEIPFDVSLPRWSELETAVKKARAGSAPGP FT NGVSYQIYKKCPKVVRLLWKLFRVAWKKQCIPSAWRRAGGVLIPKEKVSTD FT IKQFRNISLLNVEGKLFFSILTKRRITYLLANNYIDTEVQKAGVPGFPGCI FT EHASTIWQQIQMARRQKEDLHVVWLDLTNAYGAVPHSVIHYALNFFWVPET FT IRTMIQNYFQDFRVSVTTPQFTTGWQQLEKGIAMGCTISPLLFVLGFEIFL FT IGARQVAGGIKLPSGQRLPPLRAIMDDVTSILRSAPCTRRVLQRLEELTDW FT ARMRFKPSKSRSLSLRKGKSSNRVFSINNQDIPTIQQEPVKSLGRLYTSDI FT ADTKRGQELVKQAVEGLRAIDKCELPGKLKVWCLQSVLIPRLKWPMKMYDI FT PLSTADQVEGKANSFVRKWLGVPRCLSRTALAGRNKLTLPITSTSITEEYK FT LEKVRTALELKWSQDNAVRAAYRGQKTGRKWNPDGVIDQAVSRLKHRDIVG FT AVQQGRCGLGWGERTLRWDKATQRGKKQLVVDEVKRMMEEERKVKVVGQHQ FT QGAWINWESTVDRKLTWKNMWDKPDHRLSFLIRAAYDILPCPRNLSRWYSK FT EESCELCGANKANLKHILSACTTSLKQGRYTWRHNQALRILAAALEEGRKR FT VSSNRVQNQSFIPFVKEGQKAAPANRSRGEGLMSKGGWEMTVDLDTRLTFP FT RTICETTLRPDIVLWSPTQRTVVIVELTVPWEENVQAAFERKKLKYQDLVQ FT QCVENSWRALLYPVEVGCRGFVGTSITRLCRELSLSHKQLVKALSEEVERC FT SFWLWVKRKDQKWGTKE" XX SQ Sequence 3279 BP; 1042 A; 677 C; 923 G; 637 T; 0 other; gccagtctcg cagcaggaag tagtagcagg ggcaactcct agtccacaag aaccagcccc 60 agggaggaaa gagaagatta agtggccaag ggcgaacaac aaggaggaat gggagaagtt 120 agatgatgat ttacatcaga tcctcgagca ggggctgaaa ggcgacgtag agaggaaact 180 taaccagttc ggcaacttag tgtacagtta ctgcagcgaa aggttcggca cgtacaagaa 240 ggagggaaaa gaggcccgcg aaaaacagac gagtcgcaga gagagagaga ttgcaatatt 300 ggtccgcaga cgtcgccaat tgaggaagag atggcgtaaa gcaaagcaag aagagaagga 360 aggtctagag ccgttatggg aagagattaa atcgaagttg aaaagcctta ggagagccga 420 gagactcaga aggaagcgta aaaggaaagc aaaagagaga aaccaattct tcaaaaaccc 480 atacaagttt gctagaaact tgcttgagga aaagaaggga ggctctcttc agatttctag 540 ggaagatttg gagttgcatc tcaaacaaac gtacagcgat ccccagagag atgaagagtt 600 ggggtcgcca ggttacgtac ccagaccaac tgaacccgaa ataccctttg atgtatccct 660 ccccaggtgg agcgagttgg agacggctgt caagaaagcg agggctggtt cagcaccagg 720 gccaaacggt gtgtcctatc agatatacaa gaagtgtcca aaggttgtga gactgctctg 780 gaagcttttt agagttgcat ggaagaaaca gtgtatccct agtgcttgga gaagagcagg 840 aggagtactg atcccaaagg agaaagtttc tacagacatc aagcagttcc gaaatatatc 900 acttttgaat gtagagggaa agctgttctt ctccatctta acaaagcgga ggataaccta 960 cctgctcgcc aacaactaca ttgacactga agtccagaag gcaggggtgc caggctttcc 1020 tggatgcatc gaacatgcaa gcacgatttg gcaacagatt cagatggcaa gacgccaaaa 1080 ggaggacttg cacgtagtgt ggttagatct caccaatgca tatggggcag taccgcacag 1140 cgtaatccac tatgcactga acttcttctg ggtgccagag accattagaa ccatgattca 1200 gaactacttc caggacttca gagttagtgt aaccacacca cagttcacta caggatggca 1260 acagttggaa aagggaatag caatgggttg caccatctca cctctccttt ttgttctggg 1320 ttttgaaata ttcctcatag gtgcaaggca agtggcgggt ggaataaaac tgccttcagg 1380 acaaagactt ccgccactca gggctattat ggatgatgta accagcattc tcaggagtgc 1440 cccgtgtact agacgggtgc tgcagaggtt agaagagtta acagattggg cgcggatgag 1500 atttaaaccc agcaaatcca gaagtctgtc actaaggaaa gggaaaagca gcaacagagt 1560 tttctcaata aataaccaag acattcccac aatccaacag gaacctgtta aaagcctggg 1620 acggctttac acaagcgaca tagctgacac caagcgagga caagagctgg tgaaacaagc 1680 agtggaaggc ctaagggcga tagacaagtg tgagctaccg ggcaaactga aagtgtggtg 1740 tctacagagt gtcctgatac caagactcaa atggcccatg aagatgtatg atatacccct 1800 ctcaacagcc gaccaagtcg aaggaaaagc aaactcgttc gtaaggaagt ggttgggggt 1860 tccacgatgt ctgtctagga ccgcattagc cgggaggaac aaactgaccc ttccgattac 1920 tagtacctcc attacagagg aatacaagtt ggagaaagtg cggacggctt tagagctgaa 1980 gtggtcgcag gacaacgcag ttagagcagc ctacagaggg cagaaaacag ggagaaagtg 2040 gaatccagac ggagtgatcg accaagctgt cagcaggctc aagcatagag atatagtagg 2100 agctgtacag caggggaggt gtggtcttgg atggggcgag agaaccctga gatgggacaa 2160 ggccacccag cgaggaaaga aacaactggt agtggatgag gtgaagagaa tgatggaaga 2220 agagaggaaa gtaaaggttg ttggtcaaca ccagcaggga gcctggatta actgggagag 2280 cacagtggac agaaagttaa cgtggaaaaa catgtgggac aaaccagacc acagattaag 2340 ttttcttata agggcagcct acgatattct cccatgtcca agaaacctaa gcagatggta 2400 ctcaaaggaa gaatcttgtg aattgtgtgg ggcgaataag gccaacctaa agcatatcct 2460 ttctgcctgc actacctcgt taaaacaggg aagatacacc tggcgtcaca accaagctct 2520 gaggatcctt gctgcagcct tagaggaggg aaggaaaagg gtgagcagca atagagttca 2580 gaaccagtcc ttcatccctt tcgtcaagga agggcagaaa gcggctccag ccaacagaag 2640 tagaggtgaa ggtctgatga gcaaaggagg gtgggagatg acagtcgacc tggacacccg 2700 tttaaccttc ccaagaacca tctgtgaaac gactctgcgc ccagacatag tgctttggtc 2760 accaacacag agaacagtcg tcatagtcga gttaactgtc ccttgggagg aaaatgtcca 2820 ggctgcgttt gagaggaaaa agctcaaata tcaagaccta gtacaacagt gtgtagagaa 2880 cagttggagg gcgttgttat acccagtaga ggtaggatgc aggggattcg taggaacatc 2940 catcacacgc ctctgtagag aactctcact cagtcacaaa cagttagtga aggcgttatc 3000 cgaagaagtg gagagatgca gcttttggct gtgggtcaag aggaaagacc agaagtgggg 3060 aaccaaggag tagtagtcag ttcaagagtt gcagagaagc agagtgaaga cgttcgccct 3120 gctgtccccc cacctagagg ccgtctcaga ttaggggacg aaacgaccga tgagaggtgg 3180 ttctgactga tgatttctgc aactcaaggc gctggaaagt gcgacaagta gcaagcctac 3240 taggcaggtg agtttcagac tcatgctgta tacgtctaa 3279 // ID R1A_DS repbase; DNA; INV; 2232 BP. XX AC AF015489; XX DT 27-JUL-1999 (Rel. 4.06, Created) DT 27-JUL-1999 (Rel. 4.06, Last updated, Version 1) XX DE Dugesiella sp. retrotransposon R1 reverse transcriptase gene, DE partial cds. XX KW Jockey; Non-LTR Retrotransposon; Transposable Element; R1A_DS. XX OS Dugesiella sp. OC Eukaryota; Metazoa; Arthropoda; Chelicerata; Arachnida; Araneae; OC Mygalomorphae; Theraphosidae; Dugesiella. XX RN [1] RP 1-2232 RA Burke D.W., Malik S.H. and Eickbush H.T.; RT "R1 and R2 Provide An Estimate for the Age and Stability of RT Retrotransposons."; RL Unpublished. XX RN [2] RP 1-2232 RA Burke D.W. and Eickbush H.T.; RT "R1A_DS."; RL Direct Submission to Genbank (22-JUL-1997)Biology, Univ. of RL Rochester, Hutchison Hall, Rochester, NY 14627, USA. XX DR GenBank; AF015489; Positions 1 2232. XX SQ Sequence 2232 BP; 538 A; 548 C; 595 G; 551 T; 0 other; tacgcgtttg cgcaggttga gatcgatgct gtctgtaagt ctcttccgcg gaagaaggct 60 cccggagagg accgtttcac ctacgaaata gttcgaggac tgcatgaagc tttgccaagg 120 ctgttgggtg cactgttcaa cagatgtctt tctttgaagg tattcccgaa tgagtggaaa 180 aagggtgaac tagctttgct agcgaagatt gggaaagacg taaaccgtcc aaacgcgtac 240 aggcctattt gccttctccc ggctctggga aagatctatg aaaagttgat cgtcaataga 300 ctctggcatt ttgtgacgat caacaaagct ctatcagata accagttcgg cttcgtccca 360 tatcgctcca ctactatggc tattgataaa cttttgggga cattccgcca ctttgaggat 420 gaacgctgcg tcacgctttt ggtctccctg gatatctcaa acgcttttaa ctcggtctgg 480 tatccgagtg ttttgagata cctacggcaa aacaggttcc ctggtaacct catagacata 540 atcgtagatt acttccggga ccgcacgatc agcgttcgtt ttgggagcac tcgtgttacc 600 cgcaatgttg ttagaggctg cccgcaggga tcggttctgg gacctcttct ctggaacatg 660 gtcatagacc ctgtcctccg agagaggctt cccggaggtg ccatactcca agcctatgca 720 gacgatctgg tccttgcggt tcccggtaaa agcagagtag acctactagg caaagcggtt 780 gactgcctta gtacggtccg tgagtgggca ggacgaagca aactggcgct ggacacggct 840 aaatccaaag ccatggtctt gaacagcaag taccgttcgc ttgatcttag ccaccactta 900 gtagggggca ttaaggtgga aaaagagttg aagtacttgg gtctgatcat tgactcggac 960 ttaaacttcc agtcgcactt taaggcggtc agtgcacgag tcaacagatt cgtgcacaac 1020 ttaagatcta tgaccgttcg agcaaatggc atttgcccag atgccataag aacctattac 1080 catcaggttc tgcgacagtg gatgagctac ggggcagcca tatggttccc cgggcacatt 1140 aaaaagttct atgatcgttt caatgcgatc caacgatcgg ctctgattct tgcgactaga 1200 gcgtaccctc gtgccctact acggcattac aggcgatcag tggcatcttg ccaggatctc 1260 caaatgtgga ggtctgtgta caacacgcaa ctcctgcact tgcagaaggt gactatggtg 1320 gatggaagaa gctaccatcc cacgatgttt cagcagaggg cgaataggtt cagcatccat 1380 ccagggaaat ggatagctat tgaatggtcg aacttcggtt ctccgagcac ggtctcggac 1440 tcagataaac tggctcgcca ccagatctat actgacgggt cgaaaagtga atcagggact 1500 ggtgctgcgt ttgtggtgtt tcggaacggc gagttgtgga tgagcagaag ctacaagatg 1560 actgcgagta acaccagcag tcaggccgag attttggcta tttggaaggc gttgcaatgg 1620 ctgcttgcgg atggtgtcgg cattaagagc tgtgcagtca ttactgacag ccaatcgtcg 1680 ttacaggccc tagccaatcc ttcttgtgat tggctattag taatgcgggc aaaggcagcc 1740 tatcggcagc tgctgcgaaa tggagttgca gtccgtttct tttggaccaa aggacacgcg 1800 acttgcgagg gcaataaaat cgccgacagt gcggcaagag aagcctcggc ttcgggactc 1860 tcgatcgagg tcccccttcc gcactcatac ttcaaatccg tgacagccaa gatggcctac 1920 cgcctttggg agcagcggtg gcgcgcctcc aatggctcca gtgcgacgca tttgttcata 1980 gaggtaccta cctctaagag ttggagtaac tgctcggaaa tgactcaagt tctgactaag 2040 catgcgcagt accccgagta cttgcacaga cgtggagtta ttaattctcc aaggtgtgta 2100 tgcggcgaga tgggctcttt ggaccactac ctcgccgcat gcccggaaac cctttgtttc 2160 cgacagaagc tcacgtcgcg ttgcagtaca tacgcggacc taccgcgatt gtcaggattt 2220 cctctaattt ag 2232 // ID CR1-55_HM repbase; DNA; INV; 3858 BP. XX AC . XX DT 22-DEC-2008 (Rel. 13.12, Created) DT 22-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE CR1-type family - consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-55_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3858 RA Bao W. and Jurka J.; RT "CR1 families from Hydra magnipapillata."; RL Repbase Reports 8(12), 1883-1883 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(5..745,749..3706) FT /product="CR1-55_HM_1p" FT /translation="RWQPKFCIKYYNFNFLPKLKIFIQLTLIKMVRKEDFE FT ALVSRVLKLEAELLTKDKEITQLKILVNDLKNQPSNVLNSSACAWSDVVKG FT KKSNKAAQLDLVNVVVKESQDRDLRQKNLIVFGFNTSKNKNDDVAREEDIL FT GIQNIFNQINAKATIQRVFKLSIKSDKPPPVIVVLTNKEERNAVLKTAKTL FT RNLPETNNIFINADLTVAERLKMKALREERAKLNETNKDLQYYYGIRNERV FT VKLIKRQFISSQQDQLLFSKNLLFKNSNKFIEKNNKRQLINLKNQELFNHP FT KNNYCGKWALTNIRSLGNKIADFQHFVHLHSLDICCVIETWLNKNLPDSLL FT CPRFYTVLRCDRSSKGGGIAIFFRSFIQFTKVDIPPEFHELEIVCADLNLN FT GQSFRVIGYYRSGGFGKVAIDYMQSSVKCFSILCSTKKNVVILGDFNLPDI FT DWHYYHGPDSVIHNSFLHFINRYSLTQHVDQPTRENSILDLILSSSSSFIS FT NLCIEPPIGTSDHNVIIFTPNLSCLYNKTEVPVQQFFCWKNADYQVINKYL FT KSVDWSIIFQTCFNIEACWHTFSTILNNIITQHXPKVRVCEEHKLSHKIHY FT PYYVKRLINNKAMAWKRSKVTGKSNDKAAYKSIAAQCSDSIRKYHAAKELK FT MIRKDNLGSFFNFVNGKLKTCSKLNDIRRSDNTLCHDKEEISCIFNNFFAS FT VFTVDNGLSPKFTDRVNVHSTCLSSINFSPSTVEKALRALKPTTSTGLDGI FT PNVFLKNCANSLSIPLSHLFDTSFKDGAIPLDWKVASVIPIHKKGPTVDPG FT NYRPISLTSTCCRVMERIINNKLINYLLLNNLITKEQHGFIRKRSTCTNIL FT ESLHDWTLNLQRRIPTDIIYFDFKKAFDSVSHPKLLMKLPAYGIVGNLLNW FT LTNFLHMRSQVVNLNNVTSHAISVTSGVPQGSVLGPTLFLLFINDVSHINK FT QLDVKLKLFADDIKLYSVYDVRGLQSDLHTAVNXLYEWSCTWQLQIATEKS FT FVCTIANQSQNIAQRVYTINNHELAHVNCIRDLGVTIDRNLKFNQHINLIV FT HKAMSRAFLILKSFYSRDRLLMVKAYCTYVRPMLEYCSSVWSPYTICLINK FT VEKVQRFFTKRIAGLWSVSYDKRLAILKLNTLEYRRVFKDLVLCYKILNGK FT LDTDLSNILILNSYLNNRGHNFKLYKHHCSLNSTKCFFSNRIVNVWNSLSE FT DVVSASSVLMFKKRLAVLLCSS*" XX SQ Sequence 3858 BP; 1349 A; 650 C; 614 G; 1242 T; 3 other; ttaaagatgg cagcctaagt tttgtataaa atattataac tttaactttt tgcccaaatt 60 gaaaatattt attcaattaa cgttaataaa aatggtaaga aaagaagact ttgaagctct 120 agtaagtaga gtattaaaat tggaagcaga acttttaaca aaggataaag aaattactca 180 actcaaaata ttagtgaatg atctgaagaa ccaaccaagc aacgtactaa attcgagtgc 240 ttgtgcatgg tccgatgtag taaaaggtaa aaagagtaac aaagcagccc aactggattt 300 agtcaatgtw gttgttaaag aatctcaaga tagagaccta aggcaaaaaa acttaatagt 360 ctttggattc aatacctcta aaaacaaaaa tgatgatgta gccagagaag aagatatatt 420 agggatacaa aatatattta atcagattaa tgctaaagca actatccaac gtgttttcaa 480 actaagtata aaatcagaca agccaccacc agtaattgtt gttttaacca acaaagaaga 540 aagaaatgct gtactaaaaa cagccaaaac cctacgaaat ttaccagaaa ctaataacat 600 atttattaat gcagacttga cagtggctga aagacttaaa atgaaagctt taagagaaga 660 acgtgctaaa ttgaatgaaa ctaataaaga tttacaatac tattacggaa ttagaaatga 720 aagagttgta aaactcataa aaagatagca attcatatca agtcaacaag atcagctgct 780 attttcaaaa aatctactct ttaaaaactc aaacaagttt attgaaaaga acaacaagag 840 gcaactaata aatttaaaaa accaagagct atttaaccat ccaaaaaaca attattgtgg 900 gaaatgggca cttactaaca ttcgcagtct aggtaataaa attgctgatt tccaacactt 960 tgtccattta cacagtcttg atatttgttg tgttatcgaa acttggttaa ataaaaatct 1020 acctgacagt ctattatgtc ctaggtttta cactgttctt cgctgtgaca gaagtagtaa 1080 aggtggtgga atagcaatat tttttcgcag ttttatacaa tttacaaaag ttgacatacc 1140 accagagttt catgagttag aaattgtatg tgctgattta aaccttaatg gtcagtcgtt 1200 cagagttatt ggctattatc gcagtggggg tttcggtaaa gtagctattg actatatgca 1260 gagtagtgta aaatgtttct ctattttgtg ttcgaccaaa aaaaatgtgg tcatactagg 1320 agattttaac ttacctgata tagattggca ctactatcat ggtccagata gtgttataca 1380 taactctttt ttacatttta taaacagata tagtctcact cagcatgttg atcaaccaac 1440 tcgagaaaat agcatcttag atcttatttt gagttcttcc tcttcattta tcagtaatct 1500 ctgtattgaa ccacctattg gtactagtga tcataatgta atcatattca ctcctaattt 1560 gagctgctta tataataaaa ctgaagttcc agtgcaacag ttcttctgtt ggaaaaatgc 1620 agattaccaa gttatcaata aatatctgaa atcggtcgat tggagtatta tctttcaaac 1680 atgttttaat attgaagctt gttggcatac tttttctact atactaaata acattattac 1740 acaacatrta cctaaagtgc gagtctgtga agagcataaa ttatcacata aaatacacta 1800 tccttattat gttaaaaggt tgataaacaa caaagcaatg gcatggaaaa gatcaaaagt 1860 aacaggtaaa tctaatgaca aagcagccta taaatcaatt gccgcacaat gttctgattc 1920 cattagaaag tatcatgcag caaaagaact taagatgata cgcaaagaca accttggcag 1980 tttctttaat tttgtcaatg gaaaactcaa aacctgtagt aaactcaatg acattcgtcg 2040 atccgataat accctttgtc atgataaaga agaaatatcc tgtattttta ataacttttt 2100 tgcaagtgtt tttacagtcg ataatggttt atcaccaaag tttacggatc gtgttaacgt 2160 acattcaact tgtctgtcat ctattaactt ctctccttct acagtcgaaa aagcacttag 2220 agcacttaaa cccaccacat ctactggtct agatggcatt cccaatgttt tcctaaagaa 2280 ttgtgccaat agtttatcga ttcccttaag ccatttattt gatacctcat ttaaagatgg 2340 tgctataccg ctagactgga aagtagcaag cgttattcca attcacaaaa aaggaccaac 2400 agttgaccca ggtaattata gacctatctc cttgacatct acttgttgtc gtgtaatgga 2460 aagaatcatt aataacaaac tgattaatta tctcttactc aataacctaa tcacaaaaga 2520 gcagcatggt tttattcgta aacgtagtac ctgtactaat attctagaaa gtcttcatga 2580 ctggacttta aatcttcagc gccgtattcc aacagatatt atttattttg attttaagaa 2640 agctttcgat tcagtttcac atcctaagtt actaatgaaa ttacctgctt acggaatagt 2700 tggcaatctt ctgaattggc taactaattt tctgcacatg agatcacagg tggtaaattt 2760 aaacaatgtt acttcccatg ctatatctgt aacaagcggt gttccacagg ggagtgttct 2820 tggacctact ctattcttat tgtttataaa tgatgtgagc cacattaata aacagcttga 2880 tgtcaaactt aaactttttg ctgatgacat taaattatat tcagtctatg atgtacgtgg 2940 ccttcagtca gaccttcaca cagctgttaa tygtttgtac gaatggagtt gtacttggca 3000 actccaaata gcaactgaaa aatcctttgt ctgtactatt gctaaccaga gtcaaaacat 3060 agcccagcgt gtttacacca taaataacca tgaacttgcc catgttaact gtataagaga 3120 tcttggagtt accattgata gaaatctcaa attcaatcaa catatcaatc ttatcgtaca 3180 taaagctatg tcaagagctt ttcttatttt aaaatctttt tattcacgag atagattgct 3240 gatggttaaa gcctactgta cttatgtacg acctatgctt gaatattgct cttcagtatg 3300 gtcaccttac actatatgcc tgattaataa agttgagaaa gtgcagcgtt tcttcactaa 3360 aagaattgct ggtctgtggt cagtttccta tgataagcgt ctagccatac taaagcttaa 3420 cacgttagaa tatcgacgag tgtttaaaga tttagtttta tgctacaaaa ttcttaacgg 3480 taaactggac actgacctat caaacattct tattctcaat tcttacttga ataatcgcgg 3540 tcataacttt aaactgtaca aacaccattg tagtttaaac agcacaaagt gtttcttttc 3600 taatcgaatt gttaatgtat ggaatagttt atcggaggat gtggtgtctg catcgtctgt 3660 cttgatgttt aagaaacgtc ttgctgtact gctttgttca agttaataca attttgttgt 3720 aacttgtcca gcacttgata caatttaaca tttcagtaaa aaaaaaaaaa tgtttttgtt 3780 aatgtatact cagtgttagt ggccttcggg tccttctgtt gttttcaaat aaataaataa 3840 ataaataaat aaatatat 3858 // ID Gypsy-30_DPu-I repbase; DNA; INV; 4250 BP. XX AC . XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from Daphnia: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-30_DP_; KW Gypsy-30_DPu-LTR; Gypsy-30_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-4250 RA Jurka J.; RT "LTR retrotransposons from Daphnia."; RL Direct Submission to RU (16-DEC-2010). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR [1] (Consensus) XX CC Positions [2680-3174] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS join(1633..2619,2623..3834) FT /product="Gypsy-30_DPu-I_1p" FT /translation="MVTDGIIAPVVEPTSWVSRMMVVGKPDGDVRICLDPS FT ELNKAIQRQHFSVPTVEQLFAKIGKAKYFCSLDAASGFYQIPLTEEASYLC FT TMATPKGRFRYLRLPFGLKSAPEVYLQVMSDLFGDLPGVIIYFDDFLVTGE FT TKAELELNLRQVLLRCREKNLKLQLKKCRFFLQQLPWLGHVIGHGTLRPDP FT EKVDAIVNMPDPTDKQSLQRLLGMVTYLDKFCRDLATLTRPLRDILQKDAA FT WCWDEQQRKAMTTLKTVLSSLPVLRLFDVSKPVLVSVDASPVGLGAVLVQD FT GQPVAFSSTTLTATQRRYCQIEKELLAIQFGLLRFRQVYGQRVIVESTRLP FT LFPVRIPDHPFQLVSADLFEFSQVSYLLLVDSYSKWPCVVAMKSTTSAALI FT GEVTRFFCDFGRPEVLESDNGTQFSSAEFREFCRAMGVQQVSSSPEFAQSN FT GLVERHIQTVKRTLLKMFADGKSLWESLAAIRSTPVSPTLPAPSVLLQGRN FT LRGNLPFLPAELQPKYVSSSVVRRELSTRQSQAAFVQAASPNTRASSLRIG FT QPVRALIGRRWLHGKVHSVCPEPQSYVVRLDDGRYFRRTRWAINVDRGTQS FT QPRSRDSTAMFSYRVPTIVQSAQSATGCETSGPASGVAGSPVIEAPTPVRV FT GVPSSAEPATPSGRQSSNIGRVRTAASHGARQVRLFDAPLSPAVAPPPMSA FT GPSGDSGAAPAREPFGTTRSGVRFGIAMPSTRK" XX SQ Sequence 4250 BP; 887 A; 1062 C; 1123 G; 1177 T; 1 other; tggtgtcaga agtggtgcgc gtacgcctgt agtcactcgt tttcacacgt ggtctgtcgt 60 ttatagccag ttgtatcaga gccatccgtt attggttcgt gtttatttcg tgcctgatca 120 ttctttgatg aattacgatg gcagtctcgt tgaagtttcc agacccattt aacttttcag 180 catctaattt ggctttggag tgggaagaat ggcggactca atttgagtgg ttcatcacag 240 ccactcgcaa agccaagaaa gatgaagacg ttctggttgg tgttttattg tccctattgg 300 ggagagaggg tctaaagatc tatggaacgt ttgtctttgc tacgccgggt gatgacaaaa 360 agatcaaccc tgtacttgac cacttctctg attactttgc gcccctcaaa agtgaagttt 420 ttgaccgttt ccgctttcat aagcgtcacc aaaagcccgg tgaacctttc gatgcgtggt 480 tggtggagtt gcgtagcatg gtgaagtcgt gcaactatgg cactgaagcg gtggtcaact 540 ctatcctacg ggaccagatt gtgttgggag tagcgagcga ccctgttcgg gagaagctgt 600 tatatgaaac taatttgtct ctcgccggtg cctgtkgaat cgttcgtgca tgtgaggcgt 660 cagcaattca gctcaaccaa attttgtccc cacctgaggc tgttcatgct gttaaagaca 720 agcccagtca ggaacgccac caacatcgac agcacaaacc cagtgggcag cagcactctg 780 gttttggcgg atgtcccaat tgtggccgta agcatgcgaa aggtaattgt tcggcgacca 840 acatcacgtg ataccaatgt ggcaagaccg gtcattatgc ccggtggtgc tctcaagctg 900 agaatctgcc ctatcgcccg agttccactg cttctcgtca gatggcgcaa ggtgccaccc 960 aatcggagac ggctgttcgt cagaggccgg ctcaacgagg aacgtatatg cagcagcggt 1020 tgcatgctgt ggaattggaa ggagccttgc cacaggaaga tgtccgttac ctggatgagg 1080 aatatgtcac acatgaactt aaaagttccg aagacggaga agagtggcat gaagtctctc 1140 ggtagatggc agcagcccaa ttcgattcaa gcttgactct ggcgccactt gcaacgtact 1200 tccgctggag gattttcgcc gtgttcaaag agcagtggtg ttgagtgctg gcccccgcgt 1260 gaggaactat ggcgcgaaag gtggttacct caaagtcatg ggcgtgtttg tcggctctgt 1320 gaccagtcgt ggtgctcaac atacggtgaa atttgtggta gtggatgaac ctggccagcc 1380 gccaatattg ggccttccaa catgtatagc aatgaaactt atcaagcgtg tccatgctga 1440 gacagtcgat caaccagtgg agctgccagt cgtcgcgaag gagtttttgg atgtgtttca 1500 aggaatcgga aagcttccgg ttcaatacga catcaagcta gctaccggtg acaattttgt 1560 ggacccggtt gtgtgtgcgg caggtcggtt gctgtttctg cttgaggaca aagtatatgc 1620 caactagcac agatggtgac tgacgggatc atcgctccgg tggtagaacc aacatcgtgg 1680 gtgagtcgaa tgatggtcgt aggtaaacct gatggggatg tgcggatctg cctagatccg 1740 tcggagttaa acaaggcgat ccagcggcag catttttcgg tgcctacggt ggagcagctc 1800 tttgccaaga ttggcaaagc caagtatttt tgcagtttgg atgcagcgtc tgggttctat 1860 cagataccgc tgaccgagga ggcctcctac ctgtgtacca tggctactcc aaaaggacga 1920 ttccgatatc ttcgactgcc gtttggccta aaatccgcac cggaagtata cctgcaggtg 1980 atgtcggatc tcttcggcga tcttcccggt gtgatcatct actttgatga ttttttagtc 2040 acgggtgaaa ctaaagcgga actggaactg aacctgagac aggtattgct gcggtgccgt 2100 gaaaaaaatt taaagcttca actcaagaaa tgtcgtttct tcttgcagca gttgccgtgg 2160 ctgggccacg tcattggtca tgggactctt cgacccgacc ccgaaaaagt ggatgccata 2220 gtcaacatgc ctgacccaac agacaagcag agtctccagc gtcttctggg aatggtgact 2280 tatttagata aattctgtag agatttagca actctaacac gtccccttcg tgacatcctt 2340 cagaaggacg ctgcttggtg ctgggatgag cagcagagga aggccatgac aactctgaaa 2400 accgtcctgt cgtctttacc ggtgctacga ttgtttgacg tttccaagcc agtgttagtg 2460 tcagtggatg cctctccagt aggactcgga gctgtgttgg tccaggatgg gcaaccggtg 2520 gcattctcct ccaccacctt gaccgcaacg caacgccgat actgccaaat cgaaaaggag 2580 ctgctggcaa ttcaatttgg gctactccgg ttccgtcaat aagtttatgg ccagcgtgtc 2640 atagtggagt cgacgcgctt gcccctgttt ccggtccgca taccggatca tccgttccag 2700 cttgtgtctg ctgatttatt tgaattttct caggtgtctt atttgctgct ggtggattcc 2760 tacagcaaat ggccctgtgt agtagcaatg aaatcgacta cctcggcggc gctcatcggt 2820 gaagtgactc gcttcttttg tgattttggc cgaccggaag tgctggaatc ggataatggt 2880 acgcagttca gcagcgccga gttcagagag ttttgccgtg ctatgggtgt gcaacaagtt 2940 tcgtccagcc ctgaatttgc ccaatccaat ggcttagttg aacgccacat tcaaactgtg 3000 aaacgtacgc tgttgaagat gtttgccgat ggaaagtctc tttgggagtc tttggcggcc 3060 attcgctcca ctccggtgtc acccactctc ccagccccgt cagttttgtt gcagggccgt 3120 aatctacgcg ggaacctgcc gttccttccg gccgagttac aacctaagta tgtctcgtcg 3180 tcggtcgtca ggagggagct atcaacgcgc caatctcagg cagcgtttgt tcaggcagca 3240 tcgcctaaca cgcgtgcctc gtcgcttcgt atcggtcagc ccgtccgagc cctcataggc 3300 cgtcgttggc tccatggaaa agttcactct gtctgtcctg agccacaatc ttacgtggtt 3360 cgtttggatg atggccgtta ttttcggcgt acgcggtggg ccatcaacgt cgaccgtggg 3420 acacagtcac agccccgttc ccgggattct actgccatgt tctcgtatcg agtcccaact 3480 attgtccagt cagcccagtc ggccactgga tgtgagacgt ctggtccggc atctggtgtg 3540 gccggtagcc cggtaatcga agctcctact cctgtccggg taggggtgcc gtcaagtgcc 3600 gagccagcca ccccgtccgg ccggcagtcg tcaaacatcg gtcgcgtacg cactgcagct 3660 agtcatggcg cccggcaggt tcgtctattt gatgcgccgt tatcccctgc cgtggctcca 3720 cctcctatgt ccgctgggcc ttccggtgac tccggtgctg cgccggcgag ggagccgttt 3780 ggaacaactc ggtccggtgt ccgcttcggc atagccatgc cctctactag gaagtgagac 3840 tgtcgtcgta atttttcgga tccgggtggc cgtctcctag atgtctgatg tctttgttct 3900 tttcttgtcc gttttgtttc caagtttgtt tccactgata ttgagattgg tcttcattag 3960 gtcgtctggt gcaaacccgt ctgtcacttt tgtttgattt caacctctct catttcttag 4020 tcataacgct catcattcca tcttcgctat cacattcata tcatatttca caatttcata 4080 ttttgcttgt ccttcatctc atcaagtcat atttcatcgt cctgtttact gtggctttca 4140 gttcattttg tttcgctttc ggtctttgta atttcgtttt ccttttctcc tgatccttct 4200 ttcatttgtt tctcgcgttc gctcactctt agtttgtgag agaagggaga 4250 // ID TABOR_DA-I repbase; DNA; INV; 6388 BP. XX AC . XX DT 13-OCT-2005 (Rel. 10.1, Created) DT 05-MAR-2011 (Rel. 10.1, Last updated, Version 2) XX DE TABOR_DA, an endogenous retroviral element from Drosophila DE ananassae. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW endogenous retrovirus; Interspersed repeat; reverse transcriptase; KW gag; integrase; TABOR_DA-I. XX NM TABOR_DA-I. XX OS Drosophila ananassae OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; ananassae subgroup. XX RN [1] RP 1-6388 RA Gentles A., Kapitonov V.V. and Jurka J.; RT "TABOR_DA, an endogenous retrovirus from Drosophila ananassae."; RL Repbase Reports 5(10), 339-339 (2005). XX DR [1] (Consensus) XX CC 451 bp LTRs deposited as TABOR_DA-LTR. This internal sequence has CC two overlapping ORFs. The first is weakly similar to spumavirus CC gag protein. The second putatively encodes reverse transcriptase CC and integrase functions and contains a single domain related to CC retroviral aspartyl protease. 4 bp target site duplications. XX FH Key Location/Qualifiers FT CDS 1242..2576 FT /product="TABOR_DA-I_1p" FT /translation="MEWNKLYTELSNIKTNFDRSYKSLTQNRPIQTNTVRK FT HVEILVECFNEARMLIHKHRERLNQDHWSQVSKLLIRLRSNLIFVKQKYSL FT EISIPTVLNTPIAVDTAPDSESTEANESESEEIPKVNLKIEDEDLNNLTIP FT AVYTDLDNSADSDTSSVIENKIINFVTMAQSNIDFINTASKLIPVFDGKAE FT NLTSFIDALQIIESIKGEHEELAVSIIKTKLKGVARNLIGNETTITEVISK FT LRGNVKGETVEVLSAKLMNLQQRNKTANQYTQEIEQMTKALEGAFITDGLP FT LELARSYSTQHAVKAMTKNCTIDKVKLIMQAGTFSTMNDAVSKFVNSCTEA FT TGQSNTVLYFKNYPQRGSYRGRGNYRGNSRGNYNNYNNNYQNNNNNYNNQN FT NRGGRGQYRGNYRGGGNSYQGPNGNNQSNVRIAQNTDQNTSGNSQSPLNTQ FT " FT CDS 2534..6199 FT /product="TABOR_DA-I_2p" FT /translation="PKHFGKLPKPFKYSIVKEVRVHTINLSQNIFVSFFNV FT ATGKELIFLIDTGADISILKETSDTFQNIQDDHIIDIKGISQEITKSKGLV FT SLEIQTTKYIIPHDFHILNSHFPIPCDGIIGIDFIKKFNCQLDFKPSEDWL FT IIRPNNLKFPINVPITHSSGNNSILLPARSQVIRKIELNLKEDSVFIPNQE FT IQHGIYIANTIATSNNAFIRLLNTTNTNQVVSTDNLTYEPLSNYDVVNTNS FT DNRKKSVLSQLSKNFPPQFKSQLSSLCTQYSDIFGLETESITTNNFYKQKL FT RLKDDEPVYIKNYRSPHSQVNEIQKQVGKLISDGIVEPSVSEYNSPLLLVP FT KKSSPGSNVKKWRLVIDYRQINKKLLSDKFPLPRIDDILDQLGRAKYFSCL FT DLMSGFHQIELEKGSRDITSFSTSNGSYRFTRLPFGLKIAPNSFQRMMTIA FT FSGLEPSQAFLYMDDLIVIGCSEKHMLKNLTDVFEKCRRYNLKLHPEKCSF FT FMHEVTFLGHKCTDKGILPDDKKYDVIMNYPVPHDADSARRFVAFCNYYRR FT FIKNFADYSRHITRLCKKNVPFKWTDECENAFQYLKTKLMEPTLLQYPDFT FT KEFCIITDASKQACGAVLTQNHNGLQLPIAYASRAFTKGESNKCTTEQELA FT AIHWAILHFRPYIYGKHFLIKTDHRPLIYLFSMINPSSKLTRMRLELEEYD FT FTVEYLKGKDNYVADALSRITIKDLQNITRNILKVTTRYQSRQKFRAENQK FT QELPMQSLNKASKPNVYDVINNDEVRKVVTLQIKNSLCLFKQGRKITARYD FT VSDLYTNGVIDLDQFFQRLEMQAGIHKISQLKVAPWENIFEYTSVDNFKLM FT GNKILKTLRVALLNPVTKITNLKEKEAILSTFHDDPIQGGHTGISKTLAKV FT KRHYYWKGMSKDITEYVRKCQKCQKAKITKHIKTPLTITDTPINAFDRVIV FT DTIGPLPKSENGNEYAVTLICDLTKYLVAIPVPNKNANTVAKAIFESFILK FT YGPMKTFITDMGTEYKNSIIDDLCKYLKIENITSTAHHHQTVGTIERSHRT FT FNEYIRSYISVDKTDWDVWLQYFVFCFNTTPSMAHTYCPYELVFGKTSNLP FT KHFNSIDSIEALYNIDDYAKESKYRLEVAYKRARVMLETNKNRNKILYDHK FT VRDIDISVGDQVLLKNETGHKLDPKYTGPYIVTEIGDRDNIIITNNKHRKQ FT TVHKDRLKIFIS" XX SQ Sequence 6388 BP; 2499 A; 1150 C; 1057 G; 1682 T; 0 other; tggcgaccgt gacatgagtc ctggaggact taggaagata ttcggaacca aggctaaaag 60 gaatcaaagg attgaaaatg caaaaggaaa aatacatcat acgagaaaaa cgtctgagtg 120 agtagagtgt gcacaatggt acgtggaggt ctaaagaaaa aaatcaacaa aagaatagcc 180 cgcataaagt ggctactatt tgctcaatca actccgcccc catggacgac ggcgcaaaag 240 ctatacaacg agcttgaatt cttaagagaa accctccaaa acaaagtacc acagagcaca 300 actcgctcgg aaaatcaaat taaagttata aaccaaaaac taatatcgac acaaaatatt 360 tccaagcctc tgccaaaacg accacaggtc tgcccgaccg actgccccgg acctctcaac 420 tctgcagcat gcaattgcac atgcgtgaag ccgaagaata aagcgaagac caacccaaaa 480 caatagcaag cgtgggaggc acgaccgcca cctgcgccct cgagggagcg tcagcgcagc 540 caaaccggcg acgaccagct gacgtggaag gaaggaccac caaacctggg aaaaaatttt 600 tttttttcat gcactgcgct gcatattttt tttgattgca ctgcgatgca tatttttttt 660 atattcataa tataatattt tgtaaaattg agcggaacct tttctgctcg attagacgtg 720 tagcctgtaa agctacgcaa aaaactaggt aggaagctgt taacttaaat gcgcagtaat 780 gcgataagta aatttccaac agaagatctt tataatttta atgaaacaca aataaaagta 840 tatgcaaaac ccaccaaaat gacgttggac ataaaaacag tggattccac tgccaatgtc 900 atcaacaagg tagaatctac caacgttgat ctatattggg tacatgtact tttagtaatc 960 attgtcgtta ttatgtgcgc cgctgttttt tataagattt acaagctgca taataattgt 1020 ctgaaaaaaa ggtatatgag ccggggaaat gacctggata aaatataagc agcaaataaa 1080 taaaaataag gaaaaaatat atatagcaca aacaacagat tacacataaa tgttaacacc 1140 taaaaatcta agaaaatgaa ataaaaaaaa aaataaaata aaataataaa tataaaatat 1200 gtcaatatat tcctctcaca aagtacggcc aaactaaagg catggaatgg aacaaattat 1260 acacagaatt atcaaacatt aaaactaatt ttgacagatc atataagtca ctaacccaga 1320 atagacctat ccaaacgaac actgttagga agcatgttga aatattagta gaatgcttta 1380 atgaagcacg aatgctaata cataaacaca gggaaaggtt aaatcaagat cattggtcgc 1440 aggtatcgaa acttttgatc cgactacgat ctaatttaat atttgttaaa caaaagtaca 1500 gtttagaaat atcgatacca accgttctaa acaccccaat agcagttgac actgcacctg 1560 actcagaatc aacagaagcg aatgaatcag agtcagaaga gattcccaaa gtaaacctca 1620 aaatagagga tgaagatttg aataatctta ctattccagc ggtatacact gatctggaca 1680 attcagcaga ttcagatacc tctagtgtaa tagaaaacaa aataattaac tttgtaacaa 1740 tggcacaatc aaatattgat tttattaata cagcatccaa gcttatacca gtttttgatg 1800 gtaaggctga aaatttaacg agtttcattg atgctttaca aataatagaa tcgataaaag 1860 gcgagcacga agaattagct gtctccataa taaaaacaaa gctaaaaggt gtcgccagaa 1920 atctaatcgg taatgagaca acaattactg aagtgatttc taaactaaga ggcaacgtca 1980 agggcgaaac tgtagaagta ttatcagcga agctgatgaa cctacagcag cgcaataaaa 2040 ctgctaacca atatacccaa gaaattgagc agatgacgaa agcattagaa ggtgcattta 2100 taacggatgg tttaccttta gagctagcca gaagttattc cacgcagcat gcggtcaaag 2160 caatgactaa aaattgcaca atcgataagg tgaaacttat catgcaagct ggcaccttta 2220 gtactatgaa cgatgctgtt tccaaatttg tcaatagttg caccgaagca actggacagt 2280 caaatactgt cctatatttc aagaattatc cacagcgcgg ctcatatcgc ggacgtggaa 2340 attatagagg taattctcgt ggtaattaca ataactacaa caacaattac caaaataaca 2400 acaataacta caacaaccaa aacaacaggg gaggacgagg ccaatataga ggaaactata 2460 gaggcggtgg aaactcgtac caaggcccca atggcaataa tcaaagcaat gtcagaattg 2520 cacaaaatac tgaccaaaac acttcgggaa actcccaaag ccctttaaat actcaatagt 2580 aaaagaagtc agagttcaca ccattaatct cagtcaaaat atattcgttt catttttcaa 2640 tgtagctact ggaaaagagc ttatcttttt aatagataca ggtgcagaca tttccatttt 2700 aaaagaaact tcggatacct tccaaaacat ccaagatgat cacatcatag acataaaagg 2760 cataagtcaa gaaataacca aatccaaagg tttagtttct ttggaaatac agacaactaa 2820 atacataatt ccacacgatt ttcatattct aaattcacat ttcccaattc catgcgatgg 2880 aataataggc atcgacttta taaaaaaatt caactgtcaa ttagacttca agccatctga 2940 agattggctt ataattagac ccaataactt aaaatttcca attaatgtac caataactca 3000 tagttctggc aataattcaa tacttctgcc agcaagatcc caagttattc ggaaaattga 3060 gcttaactta aaagaagaca gtgtatttat tccaaatcag gaaattcagc atggcattta 3120 tattgcaaat accatagcaa catctaataa tgccttcata agactactaa acacaacaaa 3180 taccaaccaa gtagtcagta ctgataacct aacatatgaa ccactttcga actacgatgt 3240 agtcaataca aattcagaca ataggaaaaa atccgtacta tcacaacttt caaaaaactt 3300 ccctccacag tttaagtcac agctttctag cttatgcacc cagtatagcg atatattcgg 3360 attggaaacc gaatcaataa ccaccaataa tttttataaa cagaaattga gattaaaaga 3420 tgatgaaccc gtatatataa aaaactatcg aagcccacat agtcaggtga acgaaattca 3480 aaagcaagta ggtaaattaa tctctgacgg gattgtcgaa ccgtcagtat ctgaatacaa 3540 tagcccattg ttacttgttc ctaagaaatc atcccccggg tcaaatgtta agaaatggcg 3600 attagtaatt gactaccgcc agataaataa aaaattacta tctgataaat tcccattacc 3660 tagaattgat gacattttag atcaactagg tcgagcaaaa tacttttcat gtcttgactt 3720 aatgtcagga ttccatcaaa ttgaactcga aaaaggctca agagatataa cgtccttttc 3780 aacgagcaac ggctcatatc gtttcacgcg attacctttt ggattaaaaa tagcccctaa 3840 ttcgtttcaa agaatgatga ctattgcgtt ctctggtcta gaaccttctc aagcattcct 3900 ttatatggat gacttaatag taattggctg ttccgaaaag catatgctta agaacttaac 3960 agatgttttt gagaaatgta gaagatacaa cctaaagtta catccagaaa aatgttcatt 4020 ttttatgcat gaagtgactt ttctaggtca taaatgcact gataaaggaa tacttccaga 4080 tgacaagaaa tacgatgtca ttatgaacta cccagttccg cacgacgcgg acagtgctag 4140 acgtttcgta gcattttgta actattatag acgttttata aaaaatttcg ccgactattc 4200 acggcacata acaagattat gtaaaaagaa tgttcctttt aaatggacgg atgagtgtga 4260 aaatgcattt caatacttaa aaacaaaact tatggaacct accttgctac aatatccaga 4320 tttcactaaa gaattctgta taataacaga tgcaagtaag caagcatgtg gagcggtttt 4380 aactcaaaac cataatggac tccaacttcc cattgcatac gcatcaagag catttacaaa 4440 aggtgaaagt aacaaatgta ctactgagca ggaattggct gcaattcatt gggcgatatt 4500 acatttcaga ccatatatat atggaaagca tttcttaata aaaacagacc acagaccttt 4560 aatatacctc ttctcaatga tcaatcccag ttctaaactg acacgtatga ggctcgaact 4620 agaagaatac gattttacag tcgaatattt aaagggaaaa gataattatg tagctgatgc 4680 tttatcaagg ataaccatca aagatctaca aaatataact agaaatatat tgaaagtcac 4740 taccagatat caaagtagac aaaaattccg cgcggaaaat caaaaacaag aattgcctat 4800 gcaatcttta aataaagctt ctaagcccaa cgtatacgac gtcataaaca atgatgaagt 4860 acgtaaagta gtgaccttgc agataaaaaa ttcgttatgt ttatttaaac aaggcagaaa 4920 aattactgca agatatgatg ttagcgattt atatactaat ggagttattg acttagatca 4980 atttttccaa aggcttgaaa tgcaagccgg tatacataaa atcagccaac tcaaagtggc 5040 accgtgggaa aatatctttg aatatacttc agtagataat tttaaattaa tgggcaataa 5100 aatcttgaaa acattaagag tagcgctact caacccggtg accaaaataa caaatttaaa 5160 agaaaaagaa gcaattttgt ctacatttca tgatgatcca attcaaggag gtcatactgg 5220 catatcaaaa actttggcca aagtaaaaag acactattac tggaaaggca tgtctaaaga 5280 tataactgag tacgtaagaa aatgtcaaaa atgccagaaa gccaaaataa ctaaacacat 5340 aaagactcct ttaacaatta cagacacacc aataaacgct ttcgacagag tgatagtgga 5400 caccattggt ccattgccaa aatcagaaaa tggcaatgag tatgcagtca ctttaatatg 5460 tgatttaact aagtatttag ttgcaatacc tgttccaaat aaaaatgcta atactgtcgc 5520 taaagcaata tttgaatctt ttattctgaa gtacggtcca atgaagacgt tcattacgga 5580 catgggaaca gaatataaga attcaattat agacgattta tgcaaatatt taaagattga 5640 aaatattaca tccacagcac accaccacca gacagttgga acaatagaaa gaagtcatag 5700 aactttcaat gagtacatac gatcttacat atcggttgat aaaactgatt gggacgtatg 5760 gcttcaatat tttgtctttt gtttcaacac gaccccatct atggcacaca cttattgtcc 5820 atatgagcta gtattcggta aaacaagtaa tttaccgaaa cattttaata gcatagatag 5880 tatagaagca ctatataata tagatgacta tgctaaggag agtaagtata gattagaagt 5940 agcctataaa agagctagag ttatgttaga aacaaataag aatagaaata aaattctata 6000 tgatcataaa gtaagggata tagatatatc agtaggtgac caagttttat taaaaaacga 6060 gacaggtcat aagttagatc ctaaatatac aggtccgtat atagtaacgg aaataggaga 6120 tagagacaac attataataa caaataataa acataggaaa caaacagttc ataaggatag 6180 gttaaaaatc tttatttcat aaattgtact tagtatggcc atacaaagac acacctatat 6240 aaataaataa ataaacatat atttttttta taatttcatt ttcttcaatt ctaataaaat 6300 aaaaaaaaaa aagggaatgt attaattaga aatttttatt ataaaaataa cttcattata 6360 ttacgttatt tttcaaaagg agggagat 6388 // ID DNA8-84_AP repbase; DNA; INV; 563 BP. XX AC . XX DT 26-AUG-2009 (Rel. 14.09, Created) DT 26-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-84_AP. XX NM DNA8-84_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-563 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2020-2020 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 563 BP; 183 A; 58 C; 82 G; 238 T; 2 other; ccagggctcg taagttcatg catttgcata taaatcttgc taagttgtat ttaatgctag 60 atgacataga aatccatttc aaggtaccac ggttaaatat gtgaagtctt attttgcatg 120 tttttgcata tttagaggtt ttttgcctat ttggcatatt tatgcgtatt tctattttat 180 cgggcatatt atncatgatt ttgatattcc tatattgata ttattatatt aatatgcatt 240 tttttatgga aataatctat gaaaaaatca aaatttgacg aaaaaaaatg cgataattta 300 taatctattt tttttaattg tgacataaaa aaataaaata taattataaa aaaatgaatg 360 gataattctt gttatttcaa aaattgttta tgattttttt aagtaagtat tttttaagtg 420 catagtttta gatttttagt gcatataaat gcatattttt ntaacttttt aaggcatatt 480 tcaatgcata ttttgccaat ttttgagtgc ataaatgcat gcttatttag gcatttttta 540 gggcatgaac ttacgagccc tgg 563 // ID Gypsy-88_AA-LTR repbase; DNA; INV; 209 BP. XX AC supercont1.20; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-88_AA_; KW Gypsy-88_AA-I; Gypsy-88_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-209 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.20; Positions 2603872 2604080. XX SQ Sequence 209 BP; 68 A; 35 C; 36 G; 70 T; 0 other; tgtagtatcg gtttacaaat gttttataaa ccttgattat taacaatctt gaaatcagac 60 atacctttat aaatgcaacc ttgttatacc atacctgatg ttgtagagta tcatacctga 120 acggaaataa atgcagttga attctgacta tcgttgagag tacacgagcg tattatttac 180 tgtagatccg agattgacgc gactttaca 209 // ID Proto2-7_CS1 repbase; DNA; INV; 4258 BP. XX AC . XX DT 15-JUL-2009 (Rel. 14.07, Created) DT 15-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE Annelida Proto2-7_CS1 autonomous Non-LTR Retrotransposon - DE consensus. XX KW Proto2; Non-LTR Retrotransposon; Transposable Element; KW Proto2-7_CS1. XX OS Capitella sp. 1 OC Eukaryota; Metazoa; Annelida; Polychaeta; Scolecida; Capitellida; OC Capitellidae; Capitella. XX RN [1] RP 1-4258 RA Kapitonov V. and Jurka J.; RT "Proto2, a novel clade of metazoan non-LTR retrotransposons."; RL Repbase Reports 9(7), 1562-1562 (2009). XX DR [1] (Consensus) XX CC Proto2-7_CS1 is a very young family of non-LTR retrotransposons CC present in the annelid genome. It belongs to a novel clade of CC metazoan non-LTR retrotransposons called Proto2. This clade CC includes families of non-LTR retrotransposons present in the CC hydra (from Proto2-1_HM to Proto2-5_HM), annelid (from CC Proto2-1_CS1 to Proto2-8_CS1), hemichordate (Proto2-1_SK) and CC amphioxus (Proto2-1_BF) genomes. A model Proto2 non-LTR CC retrotransposon is ~4.3 kb long, contains two ORFs; its 3' CC terminus is composed of microsatellites; usually there is no CC target site duplications generated upon retrotransposition of CC Proto2 elements. ORF1 codes for a protein conserved in Proto2 CC elements from all species mentioned above. ORF2 codes for a CC protein composed from the AP endonuclease and reverse CC transcriptase domains. It appears that the Proto2 clade is a CC clade ancestral to the RTE and RTEX clades. XX FH Key Location/Qualifiers FT CDS 279..1322 FT /product="Proto2-7_CS1_1p" FT /note="ORF1." FT /translation="MRPTTTNEPDSIYVSKDDKIVIDGLLCYMVHKLHVLN FT QDALEKVIVSGYSNDAIRASKEKLFKVCPGQRYVGHTGEAAATKHVRDMIK FT LLQERGTAIPTFVSDSGSLSAMPPVTFDSMDVVSLLRRIQSTQEEVSAMKA FT LMASQSETMGCVTKLTQDLSGRVGNMEAIYVRSEDAPMQDNATEVMQSNEE FT LKTAEDATAMKQVALDPLPATYTQVLKRRPRKKKVVPALVDDGQQVAPAQK FT QRQAKKSVVGTAACELRTVKTKLIRVFVTGMVHETKPAEMEKMLKEKMNNN FT TVKCELIKKGERSSSFCVSAECLSADGIYDPTLWPEGSYVRRYYSDRKTSQ FT ANSRS" FT CDS 1382..4141 FT /product="Proto2-7_CS1_2p" FT /note="ORF2, AP endonuclease, RT." FT /translation="MPIEKMLAEECDVLCLQETWLTKQDLGGLSDLHPGYV FT GVGEATTDLNSGLLRGRVGGCVAIMWRSCHGHLISEVRLGVDWAIGIEYRS FT ADHHFYIINIYAPYECRDNEPLYLERMAYLSAFIDNLDSTCIYLIGDFNAN FT VLNANSSFAKHLNDFCSDNGYIFSSKMLLPRDTYTYISEAWHTESWLDHVL FT STADAHNSIENMSIVYTLSSCDHIPLCLTLKVAALPVLTSLPNSLVNSRID FT WTSMSDDAIRQYCRDTGMGLTQVYIPHDALLCNNAMCNEESHQKDLQDLYA FT AIVSCIKSASSSLLTRGTRSCRPGWNMHVNELHGLARDAYVEWKQSGSSRQ FT GHLFELMKATRARFKYALRFIKRNESAMRADSLAMKLTCNNDREFWKEVKM FT MNNSRTPLPNNIDGYTGSTEVCELWRQHYSSLFNCIHDNGVYLSPDVMFDN FT DMIIRPNEVQRAVAKLVDNKSCGLDGITAEHLKYADVSIVPLLAMCFTGFF FT VHGFLPEDMLSVVIVPVIKDKAGKINSRDNYRPIALASIMSKLIERILLER FT AEDTFMTVDNQFGFKPKHGTDMCIYALKEVLHTYNALNSTVFLSFLDASKA FT FDRVSHGKLFRKLEQRGVSSYIIRLLVFWYSQQTMVIRWGDTMSTSFHVSN FT GVRQGGILSPFLFNVYIDGLSINLNECPTGCYAGSLKINHLMYADDLVVLS FT PYSAGLQQLLNICTEYGAEFDIKYNAVKSNVMIVQCKGNTQTVFPDFYLCN FT EIMNVCHDTKYLGHIICDDLSDDADINRQRRKLYVQGNTLARKFFMCTPDV FT KVQLFRTYCSSMYTSYLWCHYKVASIRKLYIAYNDALRILLRVPRYLSATQ FT MFVEVNVPGPLPALRNIMYRFICRLNKSSNSIISALVNTTLSAVSLNSKIW FT RFWNKCLSFSSNMILY" XX SQ Sequence 4258 BP; 1189 A; 852 C; 948 G; 1269 T; 0 other; tgtctccatg gcaacgcaca ggaaggaagc aaattttgaa tcttttctta ctttttatac 60 cttaaagtta ttttacgtca gctacgtcat gtatggatat gaataattgt ggccattgga 120 ctactcatac tacgtagttt cttgtgatat atttttgtga gtattcgagt gaaaatgaag 180 atgtcagagg gagaagaagt ttgtgttacc ggtaacagtt ctttttgaat attcttgctg 240 agtgcatcca caatcgttca attgtacaag agctgatcat gagaccaaca acaactaatg 300 aacctgactc catctacgtc tccaaggatg acaagatcgt tattgatggt ctcttgtgtt 360 acatggtaca taaactacat gtgcttaatc aggacgccct agaaaaagtt attgtgtcag 420 gctactctaa cgatgctatt cgtgcatcga aagagaagct gttcaaggta tgccctggac 480 aacgttatgt tggacacact ggtgaagctg cggcgaccaa gcatgtcaga gacatgatta 540 agcttctgca ggagagaggt acggccatcc ccacctttgt gagtgattct ggttccctct 600 cagctatgcc acctgtcaca ttcgacagca tggatgtagt ctcactcctt cgacgcatcc 660 aaagtacaca agaggaagtt tccgcgatga aggcactgat ggcatcgcaa agtgagacaa 720 tgggctgcgt caccaagctc actcaagacc tctctggacg ggtgggaaat atggaagcga 780 tttatgtccg ctcagaagac gcccctatgc aggataatgc aacagaagta atgcagtcaa 840 atgaagaact gaagactgca gaagatgcaa cagcaatgaa gcaggtggcc cttgatccct 900 tgcccgcaac ctacacccag gtgctgaaac gcaggcctag gaagaagaag gttgttccgg 960 cattagttga tgatggtcag caggttgccc ctgctcaaaa gcaacgacaa gcaaaaaaaa 1020 gtgtcgttgg cactgcagcc tgtgaactca ggactgtgaa gaccaaactc atcagagtct 1080 tcgtcacagg tatggttcac gaaaccaagc ccgcagagat ggagaaaatg ctaaaagaga 1140 aaatgaacaa caacactgtc aaatgcgaac tcattaagaa gggggagcgt tccagcagct 1200 tctgcgtctc tgccgaatgt ctgtccgcag atgggatcta cgatccgaca ctatggcccg 1260 agggcagcta cgtgcggcgt tactactctg accgcaagac atcgcaagca aattcaaggt 1320 cctgagcttc aacacccgcg gcctcaccct cgggaagagt gctgcagcag caagccgccg 1380 catgcctatt gagaaaatgc tagctgaaga gtgtgacgta ctgtgcctcc aagagacttg 1440 gttgaccaag caggatcttg gtggcttgag tgacttacac ccgggatatg taggcgttgg 1500 tgaagcaaca acagacctta acagtggact actgcgcgga cgcgtgggtg gttgtgttgc 1560 cattatgtgg cgatcctgcc acggtcacct tataagtgaa gtgcgtcttg gtgtggactg 1620 ggccattggt attgagtacc gctcagccga tcaccatttt tacatcatta atatatatgc 1680 tccttatgaa tgtcgtgaca atgagccgct ctatctggag agaatggcat atctgagtgc 1740 ctttattgat aacttagatt cgacatgtat atatcttatt ggtgatttta atgctaatgt 1800 cttgaatgct aactcaagct ttgccaaaca tttaaatgat ttttgttccg ataatggtta 1860 tatattcagc agcaaaatgt tgcttcctcg agatacatat acatatatca gcgaagcatg 1920 gcacactgaa tcatggttgg accatgtcct ttcaactgca gatgctcata acagcataga 1980 gaatatgtca attgtatata cactctcttc atgtgaccat ataccgttat gtctcaccct 2040 gaaggttgca gctttacctg tattgacctc attacctaac tctcttgtca actcacgtat 2100 cgattggaca tccatgtctg atgatgcaat taggcagtat tgtagggaca ctggaatggg 2160 actaacccaa gtatatatac ctcatgatgc gctgttatgt aacaatgcga tgtgtaatga 2220 agaatctcac cagaaagacc tacaagattt atatgcagct attgtatcct gtattaagtc 2280 tgctagcagt tccctgctta ctagagggac tcggtcttgt aggccaggtt ggaatatgca 2340 tgttaacgag cttcatggcc tagccagaga tgcatatgtg gagtggaagc aatctgggag 2400 ctccaggcag ggccacctct ttgagctgat gaaggcaact agagcccgct ttaaatatgc 2460 tctgcgtttc atcaagagga atgagtcagc tatgcgcgct gattcgttgg ctatgaaatt 2520 aacgtgtaat aatgacagag aattttggaa agaagttaag atgatgaata acagtagaac 2580 ccctctgccc aataacatag atggatatac aggttctact gaggtgtgtg agttatggag 2640 gcaacattat tcctctctct tcaattgtat ccatgataat ggggtgtatc tctcccctga 2700 tgttatgttt gataatgata tgataatcag gccgaatgag gtacaacgtg cagtcgctaa 2760 acttgtggat aacaagtcgt gtggcttaga tggtattact gcagagcatc tgaagtatgc 2820 tgatgtgtcc attgttccat tactagcaat gtgttttaca ggtttttttg ttcacggttt 2880 tcttccagaa gacatgctat ctgtagttat tgtacccgta atcaaagaca aggcaggtaa 2940 aataaatagt cgagacaatt atcgtcctat tgctttagcc agtattatgt ccaaattaat 3000 tgagcgtatc ttacttgaac gtgcagagga cacattcatg actgttgata atcagtttgg 3060 ttttaaacca aaacatggca ctgatatgtg tatctatgcc ttaaaagaag tactgcacac 3120 atataatgct ttgaacagta ctgtgtttct tagcttccta gatgcatcca aagcatttga 3180 ccgggtcagc catggcaagc tgtttaggaa gttagagcag agaggagtct cctcatatat 3240 cattcgctta cttgtttttt ggtactcaca gcagactatg gttatacgct ggggtgatac 3300 catgtctact tcgtttcatg tcagcaatgg agtaaggcag ggaggtatac tctccccctt 3360 cttatttaac gtttacatag atggtttatc tattaatttg aatgaatgtc caacggggtg 3420 ttatgctggg tctcttaaga tcaaccacct tatgtatgct gatgatctag tcgtcttaag 3480 tccatattct gctggcctcc agcaactttt gaatatttgt actgaatatg gtgctgaatt 3540 tgacattaaa tataatgctg taaaaagcaa tgtcatgata gtgcaatgta agggcaatac 3600 tcaaactgtt ttccctgact tctatttgtg caatgaaata atgaatgttt gtcatgatac 3660 caaatatttg ggtcatatta tttgtgatga tttgtctgat gacgctgata taaaccgtca 3720 acgtcggaag ctatacgttc agggaaatac tttggctaga aagtttttta tgtgtacacc 3780 tgatgtgaaa gtccaactat tcagaacata ttgctcatca atgtatactt cgtatctctg 3840 gtgtcattac aaagtggctt ctatacgcaa attgtatata gcttacaatg atgcattgcg 3900 cattttatta cgtgtgccta gatatcttag tgctacacaa atgtttgttg aagttaatgt 3960 tcctggtcct ttgcctgccc tacgaaatat aatgtaccgc tttatatgtc ggctgaataa 4020 gtcatctaac agcattatat ctgcactagt taatacgaca ttgagtgcgg tatctttaaa 4080 ttccaaaatt tggcgttttt ggaataagtg tctgtcattt agcagcaata tgatccttta 4140 ttgacgtgtt ctttgttttt ctatgtacac ctaatgcttg tttctttcac attttccttt 4200 ttacctgttt gcatggacca ttgagtccga ataaaagttt tgaattgaat tgaattga 4258 // ID R4_HC repbase; DNA; INV; 1449 BP. XX AC U29590; XX DT 22-DEC-1995 (Rel. 1.11, Created) DT 25-MAR-1997 (Rel. 2.02, Last updated, Version 2) XX DE Haemonchus contortus non-LTR retrotransposon specific to the DE large subunit rRNA genes of nematodes. XX KW R4; Non-LTR Retrotransposon; Transposable Element; HCR4; KW Non-LTR retrotransposon R4; R4_HC. XX OS Haemonchus contortus OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Strongylida; Trichostrongyloidea; Haemonchidae; Haemonchinae; OC Haemonchus. XX RN [1] RP 1-1449 RA Burke D.W., Mueller F. and Eickbush H.T.; RT "R4, a non-LTR retrotransposon specific to the large subunit rRNA RT genes of nematodes."; RL Unpublished (1995). XX RN [2] RP 1-1449 RA Burke D.W.; RT "R4_HC."; RL Direct Submission to Genbank (20-JUN-1995)William D. Burke, RL Biology, University of Rochester, Hutchison Hall, Rochester, NY RL 14627, USA. XX DR GenBank; U29590; Positions 1 1449. XX SQ Sequence 1449 BP; 402 A; 313 C; 407 G; 327 T; 0 other; tttacatgga cgacgtaagc tcgggaagta cgctgtaaac agcaatctac ctgaggagac 60 tggaccggat gggttccatc caggcccaaa tctccgagca gccctgcaaa gggaccgtaa 120 gtgccgggta gatgtcgtac ctattgtctt aggagcttgt ggagaggtat cgtcgaacct 180 aaggcaatat atatgttccc ttgagcttcg agacgatgct actgtcctga tagagcgtat 240 cgagcgtagt accatccttg gcacaaacag gttggtgaaa tgtcatcttt ccaacagtat 300 ggagtgaagc gagggccatg gagtcgcagc gaagtatgta aggtgcactg gtgctgatgt 360 gctggtcagt tcgcccggac cagcgcagtg catacactga aaagctgcta tggactcgac 420 acgtttacag tttaacgggg aggtaacagg ctagtctctg cgttaaaaat ccaaaaaaac 480 tgaaaaatac aaaaatccta aaaacaataa aatacataac cctaaaaaca ataaaataca 540 aaaacacaaa aaaagacgcc ttgtgcgtgt gtatatagat gtacagcttg tatttgttgg 600 agcgaaagca cttgtatata tctatataat atttgtattg ctgtacatag ctgtaaatat 660 agtagttcgt agtagtagcc ctagtcgcca tacgggttat gtatacgggt ttttgtatgt 720 gtatttgttt gaaattccgg cgtgttctac gctcgggatg ctgatgatgc atactgtgta 780 tgcgaaaaat atacgcagag tgttacccta accccggccc ggtgtcagac gccagacgcc 840 cggcgctaag tcctagcttg cagtgttcag ataagaacag gaacaactag agtaaggtag 900 aaagagggga ccattcggtc ctggggcggt atgctatagc actgcaaagc taagggatga 960 agtgccacct aggaaggcga acgtggtgag tactaggaca gtggcggagt gcgcagcaac 1020 gaccaatgtt ccatgacccc agatggcgag tctgtagaga gaacaaaggc gaccgggtaa 1080 aagccacgta cggccgaaaa gcggggaaag gcagtgcgag acctcgaagc attcacttcc 1140 gagagcttcg aggaactgac cgagtacctg cttcaggagt tgagatggct ctcctggagt 1200 cggtctcccg gggggagctg accattcgcg tttgcctttg cggctcgcga ccgggggggg 1260 tttatacctt cccaacaggc acccacctga gaggtacccc tatgaatcgt ggaaggttcg 1320 cctgaagcga ggagggggcc gtcgcgtacc ggttctgcga tgactcggcg tccagaacgt 1380 tctctatggt ttgcaacgat tagatataaa caagccatag cagacggtac aagtggagag 1440 ggttagagg 1449 // ID BEL-61_CQ-LTR repbase; DNA; INV; 424 BP. XX AC AAWU01017211; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-61_CQ_; KW BEL-61_CQ-I; BEL-61_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-424 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 276-276 (2011). XX DR GenBank; AAWU01017211; Positions 9239 9662. XX SQ Sequence 424 BP; 103 A; 138 C; 97 G; 86 T; 0 other; tgttccgtac cggaagctgt ttagcgtgta cccgagcaac cctgaagggc gcgccaacca 60 aaacaaactg acaggtggaa aacggcgcgt gcaacgacga tcgctacttt cacccccgcc 120 gccattcttg tgccgacctc cagccgagga ggaagctgcc gacgcgtttc ggacaatttc 180 atcgacgcga acccacgcga atcccagaac cggcttccga ggaagcaccc gcgaccgcac 240 ttacttttgt accgagaaac cgtggagaaa tatatttaag tagtctagta aatccgccca 300 atcccaatcc gatccgaccc tccgaagtgg tcctccgttt gttcgcccat gtgaccgtgc 360 tcgaaagttc caccgccgca aaatacagtc cacatttctt cctctgttcg gctttgaccg 420 aaca 424 // ID Copia-1_DPer-LTR repbase; DNA; INV; 245 BP. XX AC super_0; XX DT 04-MAR-2011 (Rel. 16.03, Created) DT 04-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-1_DPer_; KW Copia-1_DPer-I; Copia-1_DPer-LTR. XX OS Drosophila persimilis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; obscura group; OC pseudoobscura subgroup. XX RN [1] RP 1-245 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (22-MAR-2010). XX DR Genome; super_0; Positions 11767041 11766797. XX SQ Sequence 245 BP; 81 A; 42 C; 38 G; 84 T; 0 other; tgttggaata cgcctattcc cacattcaaa taataacatt aaataatatt atcgatatgt 60 taggtattat tattataagt tagtctggtc acacttgaag tgtcgaccct ctctcttctt 120 ttgtgatctc aagtgtgtgg accggtgttg cagaaaaagt aatcaagaaa tacataaata 180 atctttaaag aaaaacataa cttggtactg tcttccgttt actaattgtc atagctagtc 240 aacca 245 // ID BEL-147_AA-LTR repbase; DNA; INV; 531 BP. XX AC AAGE02019780; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-147_AA_; KW BEL-147_AA-I; BEL-147_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-531 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02019780; Positions 1602 1072. XX SQ Sequence 531 BP; 184 A; 100 C; 97 G; 150 T; 0 other; tgtgccgaca agacccctcg tttcgacaac cttaacaatc acagcgctac gttgatcgat 60 gacagcgtag atgatgagtg atatcgggag agaaatgaca ggcgattagt ggatagaaga 120 agaaaaaacc cttcgtcatc ttgtgcctgt gactaggaga ttatcatcca agtgaattat 180 agtgaactag aatctttatt aatagtaaaa cactatttcg agaaaattag cgtgaattat 240 atctaatgat agactgtaag taacactata tacctttgct gcaaatcgaa aaattatgtt 300 taatgaacga tatgtctaaa tcaggattgt tcggttgata ctgtatctac cacataattc 360 aaacacgacc cattcttgaa attgatagcg gtaaccaaat ttgtaagtaa attcacctta 420 tttagaaaac ctaacatgca gaaacctaat aaaatgaatt cattgtagct tgaagctaaa 480 cccacctaat cctgtgtttg ctgtcgagag ttggtgcacc ctaaccccac a 531 // ID DNA8-2_CQ repbase; DNA; INV; 2184 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A DNA transposon family from Culex quinquefasciatus - consensus. XX KW DNA transposon; Transposable Element; nonautonomous; DNA8-2_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-2184 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the southern house mosquito."; RL Repbase Reports 11(1), 79-79 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >93% CC identity. 8bp TSDs. XX SQ Sequence 2184 BP; 745 A; 324 C; 392 G; 723 T; 0 other; cagggctgtg gagtcggagt cggagtcgga gccggagtcg gtggagtcgg gtctttttgg 60 ggacctggag tcggagtcgg agtcggagtc gtcaaaactc gaacagctgg agtcggagtc 120 ggagccggag tcggctaaat tttaagagct ggagtcggag tcggagtcgg agccgaagat 180 ttctgataac ccggagtcgg agtcggagcc ggagttatct taaattcaac aaaaaatatg 240 ttttttttat tgttcagtat ttcctatata acagcttaat ttacaaaaca tcttatattt 300 ttaattgatt tatcagatgc cattatgttt ttgaagcttt ttattgtgat tgaaaagctc 360 taattgtgtt attacactat aatcactaaa tgataacctc ttacccaagt atgtagtatc 420 ataattcaaa aaataataaa aaaaatattg gttatgtagt cagacatatc taattgattc 480 ttgatgcttc cttttgccta tatttatgtg ccctttcaat tcaaatttga ccaaatttca 540 tccagttatt tctgaaaaaa gtacacacac agtcatcttt taccccgata gcgaatgtct 600 tggaggtttt aagttaaatc attttttagg aaaaaaatac gaggtacata aaaagattat 660 aaatagtaat cactgttttt gcagatcgaa aaagagtcac tgacagttta acttttgtaa 720 caaataatca acgataataa caatctctta cttggaactc aacatgcgca ttttgtttat 780 aactgagtac aagaaaatca aagtgttgca tttaaaataa ttgaaactaa caaaaaatgt 840 caaaagaagt tgcaacaaat ttgaaataat cattgattgt tgatgttgat gttgatttta 900 ggtaaactat tttgatatcg ttgcaaatta tcgttgaaca aatctcaaga attcagtaca 960 aaaatgaact aataaaaaaa tgtatagaac aaaatatagg attttaattt atttttcaaa 1020 tattataata aaaaaaacag aatcaaaata ttatcaactg gtacgaattt tctcatactg 1080 tatgtcccac gccaatatga cggaaggtga taatcattgc aaatcaatgt accttaaaaa 1140 aatgaaatat ttgtgaccta gaaaatattt tacttgtgga tatttgtttt ttattatatt 1200 ttaagttttt tcatatattt tgatggaggc gcaagatcat ccataaagct tcgttttagg 1260 tgaagattca cgaagtatcg tgcgcctcct ttttaaagta tataaaattt taaaattaaa 1320 ataaattaaa aatttccagg gtaaaatatt ttccatgtca cagatatttg aaaaggacat 1380 cttaagcgtc ctttccacga agaaattgtt tagatacatt gatttgcaat aataatctcc 1440 ttcagccatg gcgtgatgtc catgtacgtc ttaaaaaaaa acaaaaaaaa gtttcgttta 1500 aaattgtcat ttaaaaaaca aggtgcctgt cactgtgctt cttacaaatg tcagcctgaa 1560 agtgagcaaa ttgaatttta atgttcaata gatcattaag gccaggtttc ttttagttat 1620 tcattcaaaa ataacaattt aatatttaaa tttattagct ttgaaaaaaa tattttcaaa 1680 tttgaggtta agcaaataaa tccaaattta cttacaaaac tgttcttctc aaaatgcctc 1740 taaaactgaa gatatttttt gatttctgtt tccataacgt ataaaacttg taacctagaa 1800 actttttttc cgtttattta ctttacttta cttttactta acttattttt cttgagaaaa 1860 acatgacgct tagagagtat ttaaataatc ctatttatca atgttttaaa aataattaac 1920 taataatagt taactaactt ttgaaacatt taaaaatatg tggagtcgga gtcggagtcg 1980 gagccaacca tttgttgaaa gctggagtcg gagtcggagt cggagtcggc tagagttggt 2040 aggccggagt cggagtcgga gtcggagcca accatttgtt gaaagccgga gccggagtcg 2100 gagtcggagt cggctaatct gagaaagccg gagtcggagt cggagtcgga gtcgtttgaa 2160 atatgacccg actccgcagc cctg 2184 // ID ERE1_ED repbase; DNA; INV; 2075 BP. XX AC . XX DT 25-JUN-2008 (Rel. 13.1, Created) DT 25-JUN-2008 (Rel. 13.1, Last updated, Version 1) XX DE Repetitive element from Entamoeba dispar - consensus sequence. XX KW Nonautonomous; Entamoeba; Transposable element; Ed_ERE1; ERE1_ED. XX OS Entamoeba dispar OC Eukaryota; Amoebozoa; Archamoebae; Entamoebidae; Entamoeba. XX RN [1] RP 1-2075 RA Lorenzi H.A. and Caler E.; RT "GenBank accession number EU099439."; RL Direct Submission to Genbank (09-AUG-2007). XX RN [2] RP 1-2075 RA Lorenzi H., Thiagarajan M., Haas B., Wortman J., Hall N. RA and Caler E.; RT "Genome wide survey and discovery of repetitive elements in three RT Entamoeba species."; RL Repbase Reports 8(10), 1684-1684 (2008). XX DR [2] (Consensus) XX CC Found flanked by two inverted Ed_SINE1 transposable elements. XX FH Key Location/Qualifiers FT CDS 253..1350 FT /product="ERE1_ED_1p" FT /translation="MEELINKNTQFFKEEIETLPNIQQSIDDTTKHYQYYL FT GSIKNEIFEIQDNETYSEAIKRGEGIFDRIDETERRRKLVIEKSQETINKI FT EIIKNKMNKIIEECNDDEEILSKTRNEIINKIISIEEMIMKDKIFRTPEEI FT DKITKKFNKKKEEWYTYYSDYFERKKIRRKKKQEEERRKKEEEKKKEEEGL FT RIMKGMNTIEEMKQLEEWTKRKVSNILFDSDIDNWNINTSVFNQRIINKEH FT IIIIIEDTNGNKFGGYVNVKIDKVGNWINDPKSFLFSLKSKGRIQGMMKFD FT IKEPQYAFYLYNQSSDCLFGFGCGCDIRVYKENNKTKSYCKQHSYEYKGIS FT NALCGKQYPEYFTPQRIIVIEMK" XX SQ Sequence 2075 BP; 990 A; 141 C; 231 G; 713 T; 0 other; ttctttttat ttgtgttttc atttctattt cattttcatt tctatcatta ttatgtgata 60 ttatcattaa ttatgatgag atgttttgtt atttattatt tttattatat tctattttct 120 tttttttatt attgttattt aaaacaataa aaatgaaatg acgtttatta aaataataaa 180 aaaaatgagt gttataaacg tgtatttaat tatttaatca tctgatttta atctgatatt 240 cattaaatta aaatggaaga gttaataaac aagaacactc aattctttaa agaagaaata 300 gaaacacttc caaatattca acaatcaatt gatgatacta ctaaacatta tcaatattat 360 ttaggaagta ttaaaaatga aatatttgaa atacaagata atgaaacata ttcagaagct 420 attaaaagag gagaaggaat atttgataga attgatgaaa cagaaagaag aagaaaacta 480 gtaatagaaa aaagtcaaga aacaattaat aaaatagaaa taattaagaa taaaatgaac 540 aaaataatag aagaatgtaa tgatgatgaa gaaatactaa gtaaaacaag gaatgaaata 600 ataaataaaa tcatttcaat tgaagaaatg attatgaaag ataaaatatt cagaacacca 660 gaagaaattg acaaaatcac aaaaaaattt aataaaaaga aagaagaatg gtacacatat 720 tatagtgatt attttgagag gaagaagata agaagaaaaa aaaaacaaga agaagaaaga 780 agaaaaaaag aagaagaaaa gaaaaaagaa gaagaaggat taagaataat gaaaggaatg 840 aatacaattg aagaaatgaa acaactagaa gaatggacaa aaagaaaagt atccaatata 900 ttatttgatt cagatattga taattggaat attaatacat cagtatttaa tcaaagaata 960 ataaataaag aacatataat aattattatt gaagatacaa atggaaataa atttggagga 1020 tatgttaatg tgaaaattga taaagttggt aattggataa atgatccaaa atcattttta 1080 ttttcattaa aatcaaaagg aagaatacaa ggaatgatga aatttgatat taaagaacca 1140 caatatgcat tttatttata taatcaatca agtgattgtt tgtttggatt tggatgtgga 1200 tgtgatattc gtgtttataa agaaaataat aaaacaaaat catattgtaa acaacattca 1260 tatgaatata aaggaatatc aaatgcactt tgtggaaaac aatatccaga atatttcaca 1320 ccacaacgaa ttattgtaat tgaaatgaaa tgatttttat tcattattat ttatttgaaa 1380 tataaaaact aaaaaagaat ataaaataaa atcaataaat gaatatgaac atttcatcta 1440 tcaatgaatg aaatggtgaa tataaaaatg ttcatatttc attcttgttt tatttcattt 1500 gaaataatca tattcaatgt gttaatttaa taaataataa aaagaaataa gaataaaaga 1560 taaataaaaa tataataaaa tatttatttt attctttttt atttattttt tattttattt 1620 atattatttt atatttattt attttattct tcttctttta tctttatatc tattatatat 1680 ttcttatttt attatttatg ttatattctt atatcatttt attcttattt tatatttatt 1740 cattattatt catttcatat taacaatgat aaatatcaat acaacatata ataaataaat 1800 gatataagat aaacaacaat aaataataat aataaaatat aaacaaacaa tataaacaaa 1860 tataaaatga taaaaagatc atctaatata aaataataat aacaaagaat aaaaataaga 1920 aatgaatgat gtttagaaaa gaaaaaattg agaaaaagaa gaaatgaatg atgatattca 1980 aagataaaaa tcaatataaa atataaaata aaatataata aataataaga ataagaatga 2040 ttatatctta ttttattatt atattcttct ttatg 2075 // ID DNA8-112_AP repbase; DNA; INV; 1310 BP. XX AC . XX DT 29-AUG-2009 (Rel. 14.09, Created) DT 29-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-112_AP. XX NM DNA8-112_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-1310 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2050-2050 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 1310 BP; 494 A; 160 C; 154 G; 501 T; 1 other; cagtgttgtg ctttagataa atttatctag atatnttatc tagataaaaa ataaaattga 60 catctaattt ttatctagat aaaaattata gccatctaga taaatttatc tagataattc 120 acaaaatcca tagctaataa ttagtaataa ctattttaca atattatcat tcaatataaa 180 atagataact taataagaaa tctcatttgt attacttgca aaaattggtt ctaaaatagt 240 tttacatagc atttattatt aattatgaat aatgcacaga tataaactac tattatagta 300 tattataact gttgaataat gacgatttat tatgtcaata tgcataggag atgcgaatta 360 tttatataat tttataatat ggcccatacg gggaagtctc acgcccgtat acgatttatg 420 attttacgaa cgccgaaaga gcctttacaa aaattagaac tagagatacc cctccataca 480 tagagtatac tctatgccac cataggtcct acgggtaaac agggtttaac atcatgttgg 540 tttacgggct cgttccctat gttcagactt cagatcatag gtaaacgtaa accgaatatt 600 atagaatatt atattattat attatataaa aatatattta aatatatata ttattcatat 660 attctgtgtt aaaactatgg aatatagcat ttgtgtaata taccagtaaa aagggaaatg 720 gttactttgt actcatatag ttatatgata ttcaaaattt ctttaaattt actaaataat 780 ttaataaata tttattaggc ttaaacttta aatgtataac tatttaataa gaattacaat 840 aacaccacag tttctacagt acaaattaag taaatttgat atgtcaatta ttagagcctg 900 ggtcttaagt tttcaccaag tgtacgcaat tttatgtttg atgggagtga aaataaagac 960 ttcattaaat acagatgcgt atcacaaaat aacttttttt ttgcattatt cagaatcttt 1020 ttttatgttt gggataatat gatgttacca acttatcttt agcatgcttg gaatattgtc 1080 taattatgta aatattgtaa taaaaataca aatttatttt aattttcaat tattagtata 1140 aatttaaatt ttgaggtaat accttatata aatatctaga taaaaattat tttatctatc 1200 tatatctaga taaattcaat ttctacagta tctatctata tctagataat tttaaattca 1260 gatatcttat ctatatctag ataatttttt tttatctaag cacaacactg 1310 // ID SR2 repbase; DNA; INV; 3913 BP. XX AC AF025672; XX DT 04-JUN-2009 (Rel. 14.06, Created) DT 04-JUN-2009 (Rel. 14.06, Last updated, Version 1) XX DE Schistosoma mansoni SR2 non-LTR retrotransposon. XX KW RTE; Non-LTR Retrotransposon; Transposable Element; SR2. XX OS Schistosoma mansoni OC Eukaryota; Metazoa; Platyhelminthes; Trematoda; Digenea; OC Strigeidida; Schistosomatoidea; Schistosomatidae; Schistosoma. XX RN [1] RP 1-3913 RA Drew A.C., Minchella D.J., King L.T., Rollinson D. and. RA and Brindley P.J.; RT "SR2 elements, non-long terminal repeat retrotransposons of the RT RTE-1 lineage from the human blood fluke Schistosoma mansoni."; RL Mol. Biol. Evol 16(9), 1256-1269 (1999). XX DR EMBL/GenBank/DDBJ; AF025672; Positions 1 3913. XX FH Key Location/Qualifiers FT CDS 693..3743 FT /product="SR2_1p" FT /translation="MLPPIMNTEAVFLKPRTPFKLASFNVRTLMQIRQQMG FT LAMSLESLNVDVCCLSETRIQDSSEVLQIRSPSVASKSLFHVRLSGDPVAS FT SSGLAGVGVALSARAEAALIDWIPINSRLCAVRLESSIKVRRNRREKRCLF FT VISAYAPTDCSPDAIKDEFYHQLTVLLQKARSTDIVVLAGDLNAQVGRLGT FT EESRLGGRWGLVGRRTDNGDRLLQLCTDHNLFLASTNFRHSHRRCATWRPP FT SASQAWTQIDHIAISYRWRGCVQDCRSFWSTYLESDHALVCANLTLLFSGR FT RIDHHQRIDVSKLAATSVASKYRAELASRLATIPPKSIDEHWLHLHDAMKM FT ASKAACGFAKRPAYKHWVSSGSLQLMEARRSTPGDREFDHKRRLLRNEIGQ FT SLRKDREAWWSERANEMEAAAASGNCRKLFQLIRATGSKKSGVSETIYEDD FT GMPITNISRRLGRWAEFFEGQFNWPAAPATSTSLSCPPWPVTTDPPNEAEV FT RKELQLLKRYKSPGPDDLPPALFKDGGDFLAKELTVLFXKVWEQESVPTSW FT NESIVVPIFKKGSRCSCNNYRGISLLSIASKLLASXILRRLFKTRERLTRE FT EQAGFRSGRGCIDHIFTLRQMLEHRHTYRRPTIVVFLDIRAAFDSLDRTVL FT WDCLLKKGVPEKFINILKALYTNTSGRVRAYNHLSPLFHSSSGVRQGCPIS FT PFLFNFAIDDILETALMDVSNGGVDMLPGERLLDLEYADDIVLLCDNAQGM FT QSALNQLAISVRRYGMCFAPSKCKVLLQDWQDSHPVLTLDGEQIEVVEKFV FT YLGSYISAGGGVSDEIDARIMKARAAYANLGHLWRLRDVSLAVKGRIYNAS FT VRAVLLYACETWPLRVEDVRRLSVFDHRCLRRIADIQWQHHVSNAEVRHRV FT FGRRDDNSIGVTILKHRLRWLGHVLRMSSQRLPRRALFADSGTGWKKRRGG FT QCMTWCRGMKESCKGLASVGPSRLPGWGPRDGATQWLETLSDMAQNRSQWR FT SCCNLLLLST*" XX SQ Sequence 3913 BP; 935 A; 930 C; 968 G; 1057 T; 23 other; cccaaatgcc ctggtacggc cgagagtggg gagagtccgc tctccctctc gaaatgctct 60 cacatggcca cgcgtatata gcctctgcca gggaagtcct actcactgcc ttctcgtggc 120 rktrgtgttg tttacgaaat tgagaggacg aaaagcgaat gtccggcgct ttaaccgggt 180 tggtggacac ggagggtcca cctaggggag ttggaaaacc ctgattccaa accaatggtg 240 cacatgggct ccagtatcct gaaggaacga atggcgtatg aaccagtcat tggtcaccgg 300 ctaccatggg actgcatctc ctcacgatgc tccactgcct tgtggatcag acctttaggt 360 caaaggctcg gggtgtggcc ccctaagaaa accacctgct tcggtttggg cacccgggca 420 gtatcacagc cctcacacaa ataaatgaaa tgatgaattc tattgacgat amtattctcg 480 ctcatactag aagcaagaca atttacacta ttattattat tattaccatt attattatta 540 ttattattat tactattatt tactacgtcg ctttttcact ccctttattc ccaaattgtg 600 tatccttcct ttccaactta tgcagbagct cgcccatggt ggaaaatctg tcctatctaa 660 acgatctcca gtcactgtct ggttgaaagt gaatgcttcc accgattatg aatactgaag 720 cagtctttct gaaaccacgt acgccgttca agctggcttc cttcaacgtt cgcacactaa 780 tgcagatcag acaacagatg gggctggcta tgtctttaga aagtcttaat gttgatgttt 840 gttgtctatc cgagacccgt attcaagact ctagtgaagt actacaaatt cgctctccat 900 ctgtcgcytc gaaaagcttg tttcacgtgc gcttatccgg ggaccctgtg gcatcttcgt 960 ctggtcttgc tggcgttggt gtcgcactaa gcgctagggc tgaggcagca ctaatcgatt 1020 ggatccccat taacagtcgg ttatgtgctg ttagattaga aagttccatc aaagtgagaa 1080 gaaaycggcg tgagaaacgg tgtcttttcg tcatctccgc ctatgccccg acagattgca 1140 gcccggatgc aatcaaggat gagttttacc accagttaac agttcttctc cagaaagcgc 1200 gttcgacaga tattgtagta ttagccggag acttgaatgc tcaggtcggg cgtctaggca 1260 cagaagagag tcgtttaggt ggccgatggg gacttgttgg tcgcaggaca gataacgggg 1320 accgtttgct gcaactgtgc acagaccaca acctgtttct ggcyagcact aacttccggc 1380 acagtcatcg ccggtgtgcc acctggcgtc ctccctctgc atcycaagcc tggactcaga 1440 ttgatcacat cgcgatcagc taccgctggc gtggttgtgt acaagactgc cgctcctttt 1500 ggagtaccta tctggagtct gatcatgccc tggtctgcgc caatctcacc ttacttttca 1560 gtggccgaag gattgaccac caccaacgga tcgatgttag taaactggct gcaacwtctg 1620 ttgcaagtaa gtatcgagcg gagctagcct ctaggctagc taccatccca ccgaaaagta 1680 tagatgagca ttggttgcat cttcacgacg ccatgaaaat ggcgagtaaa gccgcttgcg 1740 gcttcgcgaa acgtcccgct tacaagcact gggtttcttc tggctcctta caactgatgg 1800 aagcccgtcg gtctactccg ggtgaccgtg agtttgacca caaacgaagg ctgttacgta 1860 atgaaatcgg gcaaagcttg cgtaaggacc gagaagcctg gtggtcggag cgtgctaatg 1920 agatggaagc agcagctgca tctggtaact gccggaagct cttccaactc atccgagcca 1980 ctggcagcaa gaagtctggt gtgagtgaaa caatctacga ggatgatggg atgccaatca 2040 ctaacatctc tcgacgtctt ggacgatggg cggaattctt cgaagggcag ttcaactggc 2100 ctgctgctcc ggcaacatca accagyttgt cctgcccccc atggccggtg acgactgatc 2160 caccaaacga ggcggaagtc cgcaaggaac tccaactctt gaarcgytac aaatcaccgg 2220 gcccagatga cttacctccr gctcttttta aagatggtgg tgactttctg gctaaggaac 2280 tgactgtgtt gtttgsaaag gtttgggagc aagaaagtgt tccaacatca tggaatgagt 2340 cratagtcgt ccctatcttt aaaaagggtt cacgttgttc ctgtaacaac tatcggggga 2400 taagtctact ttcgattgcg tccaaactat tggcttctrt cattcttcgt aggttgttta 2460 aaacccgaga acgattgact cgcgaggagc aggcwggttt tcgttctggt cgaggatgca 2520 ttgatcacat cttcaccctc cgccaaatgt tagaacaccg tcatacttat cgcaggccaa 2580 caatcgtagt gtttcttgat atcagggctg ccttcgattc gttggacagg actgttctct 2640 gggattgtct attgaagaag ggtgtgcctg agaagttcat taacatctta aaggccctgt 2700 atacgaacac ctcaggcaga gtgagggcat acaaccacct ttctcccttg ttccattcga 2760 gcagtggggt taggcagggt tgcccgatct caccattcct cttcaacttt gccatcgacg 2820 acatcctgga aacagctctg atggatgtaa gtaatggcgg tgtggatatg ttgcctggag 2880 aacgacttct cgaccttgag tatgcggatg atattgtctt actgtgcgat aatgcccaag 2940 gcatgcaatc cgcacttaat cagttggcaa tcagtgtccg caggtacggc atgtgctttg 3000 caccctccaa gtgcaaagta ctcctacaag actggcagga ttctcatcct gtactcaccc 3060 tggatggtga gcagattgaa gtagttgaga agttcgtgta tctaggtagc tatataagtg 3120 ctggtggtgg cgtgagtgat gagattgatg cacgtataat gaaagccaga gcggcttatg 3180 ccaatctggg ccatctttgg cgccttcgtg atgttagtct ggctgtaaaa ggtcggatct 3240 acaacgcgtc ggtgagagca gttttgctct atgcttgtga aacctggcct ctccgagttg 3300 aggatgttag acgwctctct gtgttcgatc atcgttgtct ccgaaggatt gctgacattc 3360 agtggcaaca ccatgttagt aatgcagagg ttcggcatcg tgtgttcggg cgcagagacg 3420 ataattcaat tggtgtcacc atcttgaaac accgacttcg gtggcttgga catgttttac 3480 gaatgtcgtc ccagagactt ccacgtcgtg cattatttgc cgactctggg actggttgga 3540 aaaagcggag aggaggtcag tgtatgacat ggtgtcgtgg catgaaagaa agctgcaaag 3600 ggctagcttc tgttggtcct tcacgactcc ctggctgggg tccgagagat ggtgcaacac 3660 agtggctaga racgttatca gatatggctc agaatagaag ccagtggcga tcctgctgya 3720 accttctttt actttctaca taaagagtgg ttctaacttt cttaactgaa agagtcttct 3780 ggttgtacat ttyagtccgc cttatctttt catccyttct cttcctactt tcattatttt 3840 gtgtggcgca tatgtatctg gtgccccttt gtaccaatat atatgtgttt aaataaataa 3900 ataaataaat aaa 3913 // ID Crack-13_BF repbase; DNA; INV; 2307 BP. XX AC . XX DT 30-APR-2009 (Rel. 14.04, Created) DT 30-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE Amphioxus Crack-13_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; KW Crack-13_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-2307 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-2307 RA Kapitonov V. and Jurka J.; RT "Young families of Crack non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(4), 818-818 (2009). XX DR [2] (Consensus) XX CC Its 5'-terminal portion is incomplete. XX FH Key Location/Qualifiers FT CDS 2..2143 FT /product="Crack-13_BF_2p" FT /translation="MILNEASLYIANLPWDLVELFEDVDDALDVSMLLFNT FT VADFHAPVRRFTVRANSAPWLDAELREAMAMRDEAKTAADLYGLPSDREIY FT KKLRNYVVKLNRKKKEHYFKSKMSENSNDPKYVWKLTNSLLGRSKCQYPTC FT VEVNGQNIVKPRQIANYFNDYFMTKTETHRKNLESQNAPVTVPIDQQIMSD FT KSCCFQFHPVSDDEILRLLLDIPMSSTCGIDQLENKLVRLSAVYIASPVRH FT ICNQSLMSSCFPSQWKRAKVCPLRKNTRKPLTGNNSRPISLLPTLSKVLEK FT VVHNQIWQYAALNDIVTPFQHAYRPNHSTTTALLRMTDDWLSAMDKGEMVG FT VVYIDFSAAFDLVDHELLLAKLQSYGFSSGPLAWMRSYLHARTQVVYINGH FT FSEPSVLECGVPQGSCLGPLIFTLFTNDLPLSVSNSTTEMYADDTSMYKAA FT KSIAEIAAVLQPDLNKIVEWVKNNKLVMNVLKTKCMLFGSPRKLALLPSTS FT LNLVIGTRSIEQVKETVLLGLHMDQSLTWDLHINHVVAKQSRNIALVRRHA FT WAMSYEVCKVVLCALVLSVAQYCLPVLANMSESNARKLQVVQNRACRLVLK FT CPLETHVKQMHSDLGWPMVRERFAISTLTMFFRILVSKEPHNIYSSIVPVH FT STHTYSTRLSNSFSFKLPKPKTNSKKRTFLYRAISLWNDLPPYIKSLPNIS FT RFKKAMYQVFSSQ*" XX SQ Sequence 2307 BP; 688 A; 483 C; 459 G; 677 T; 0 other; aatgatacta aatgaagcat cactatacat tgcaaacctt ccatgggatc ttgttgaact 60 gtttgaggat gtggacgacg ctttagatgt atccatgctt ctcttcaaca cagtggctga 120 cttccatgct cctgttcgac gcttcacagt ccgcgctaac tccgcacctt ggctggatgc 180 agagttgagg gaggccatgg ccatgcggga tgaagctaaa acagcagctg atctttacgg 240 cctaccttct gacagggaga tctacaagaa actcagaaac tatgtagtta agttaaaccg 300 gaagaaaaag gaacactact tcaaatctaa gatgtctgag aattcaaatg atccaaaata 360 tgtatggaag ttaactaact cattattagg taggtccaag tgtcaatatc caacatgtgt 420 ggaagtgaat ggccagaaca tagttaaacc tcgtcaaatt gccaactact ttaatgatta 480 tttcatgact aaaaccgaga ctcacaggaa aaacctggag tctcaaaacg cacctgtcac 540 tgtccctatt gatcaacaaa ttatgtcgga taagtcatgt tgttttcagt tccacccagt 600 cagtgatgat gagatactta ggttactgtt agacataccg atgtcatcca catgtggtat 660 tgatcaactc gaaaacaaat tagttagatt atcagctgtg tatattgcat cacctgtaag 720 gcatatttgt aatcagtcac ttatgagcag ctgttttcct agccaatgga agagagctaa 780 ggtgtgcccc ttacgtaaga acaccagaaa accactgacc ggtaacaaca gcaggcctat 840 tagcctactg cctactttga gtaaggtatt ggaaaaagtt gtacacaatc aaatctggca 900 gtatgcagca ctgaatgata ttgtaacacc tttccaacat gcctatagac caaatcactc 960 cacgactact gctttattac gaatgacgga tgattggctc tccgctatgg ataaaggcga 1020 aatggtaggg gttgtttata tcgacttcag tgccgcattc gatttagttg accatgagct 1080 cttgttggca aaattacaat cctatggttt cagttcaggc cctttagctt ggatgagaag 1140 ttatcttcat gccagaacgc aggtagtgta tattaacggg cacttttcgg aaccatctgt 1200 acttgaatgt ggcgttcccc aaggaagttg tttaggccct cttattttca cacttttcac 1260 taatgattta cctttgtcag tttcaaatag cacaactgaa atgtatgcag atgatacaag 1320 tatgtacaaa gccgctaagt ccattgctga gattgctgct gtcttgcagc ctgacctcaa 1380 taagattgta gaatgggtaa agaacaacaa acttgtgatg aatgtcctta agactaagtg 1440 catgcttttt ggatctcctc gaaaactggc tcttctacct tcaacctcac tgaatttggt 1500 tattggaact agaagtattg aacaggtgaa agagacggtt ttgttggggc tgcacatgga 1560 tcagtcactt acatgggatt tgcatatcaa tcatgttgtt gccaaacaat caaggaatat 1620 cgcccttgtg agaaggcacg catgggccat gtcttatgaa gtctgcaagg ttgttttatg 1680 cgcacttgta ttatcagtag cacagtactg tctcccagtt ctagcaaaca tgtctgaatc 1740 aaatgctcga aagctccagg ttgtacagaa cagagcctgt agattggtgc ttaaatgtcc 1800 actagaaaca catgttaagc aaatgcattc agacttgggt tggcccatgg tacgagaaag 1860 attcgccatc agcacattga ctatgttttt cagaatactt gtcagtaagg agccccataa 1920 catttacagc tccatcgtcc ctgtccattc tacccacaca tattcaacta gattatccaa 1980 ctccttcagt tttaaactcc ctaagccaaa aacaaactca aaaaagcgaa cttttctcta 2040 ccgtgccatc agtctttgga atgacctacc accatacatc aaatccctgc ccaacatctc 2100 acgctttaaa aaggcaatgt atcaagtctt ctcaagccaa tagttttttt ttaatgattg 2160 tgtatctatt attatctgtt tgtatatata catgtatgtt tgtgtatata tttgcgtttg 2220 tttatgtatg tttcattgtt tcatatgaac cctggtagac tagccctaaa tagggctaaa 2280 gggtatcgga ataaaggaat aaaggaa 2307 // ID Gypsy-94_CQ-I repbase; DNA; INV; 7121 BP. XX AC AAWU01007055; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-94_CQ_; KW Gypsy-94_CQ-LTR; Gypsy-94_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-7121 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 567-567 (2011). XX DR GenBank; AAWU01007055; Positions 63224 70344. XX CC Positions [5124-5600] - Integrase core CC LTRs are 96% similar to each other. XX FH Key Location/Qualifiers FT CDS 431..2440 FT /product="Gypsy-94_CQ-I_2p" FT /translation="MEHYIRQYFSMNLAHLLQDELDNELIVRNIDASSDSR FT TVQERKCRAELKKERDLNLADIEYETGWDDILTELDVCNIKVHEISKILLD FT RKEKSAPDQKYKTRLLHLLFRLLRAKAHATDESVINTIGDIAGHAARLLGN FT YFSLLSPHEEIRKAELEFASNSMREQLENLNKTPQEEDATPETSIRGGNLT FT PEDSNNIKEKATNTTDLPNPQNSQQIAETEKLKNENTDLKNLLNDLFTHLQ FT KIDKKIENLEAEKNNPPVNKNSTQFHNSGDLNLSDSQKPKFSYQGFLDWLV FT KDQNLIEPNLKNSDQQDKPDPPKPTKNNNPEQEKNSFGKRFPIHKWKIRYD FT GSDQGRQLNEFLKEVEFNARSEGFTKQELYNSAYHLFQGKARSWYMEINSN FT DELETWENLVEELKQEFLPPDLDFYYERQAHMRKQRLNERFQDYFLDMTRL FT FRNLTCPIDETKKFNMIFRNLRTEYKSAMLAAKIKTIPAMKSFGKYFDSIN FT WQWYAKTDKDNNRGQRSGERQVNEIQSERRGPMNTNGNNYNNNRQWNKGEN FT VKREFKIPNRPNSNFRNDRNETNTYKPQQQSSEKSTEKKPLQNSSKPDDVK FT PGPSKNLNTLEKILKAYVPIRRGTCFNCHNFGHNFNQCKQPAQVFCEVCGF FT PGFSTPDCPYCPTKNVMEAAQ" FT CDS 2440..3549 FT /product="Gypsy-94_CQ-I_1p" FT /translation="MRQSSLVKLPESPPSNLDELETTHNILQHLGYAHVST FT ESRGAAEVNTIFLHPRGDNRPFIRINLLGVELVALLDSGANRNILGKGSHK FT LVASLNLSCSPSDMTLITAEGHPVEVMGEIEIPVTFNGCTRIVSFVVAPTL FT KRKCYLGMSFWDQFGIFPSLRESFIETIDDSEEPTDDEVLLTPEEQTELDE FT IKKLFLVPRTGDIGCTSLLTHTIEISDEYKDKPPIRQNPYPWSPMVQSKIG FT IALDNMIRDDIVEPSRSEWSQPVVPVSKRDSDAVRLCLDARKLNERTKRDA FT YPLPHQNRILSRLGSCKYLSTIDLSQAFLQIPLSEESRKFTAFSIPGRGLF FT QFKRLPFGLLIRGAPWYKVKLELLWIT" FT CDS 3513..5972 FT /product="Gypsy-94_CQ-I_3p" FT /translation="MVQSKIGIALDNMIRDDIVEPSRSEWSQPVVPVSKRD FT SDAVRLCLDARKLNERTKRDAYPLPHQNRILSRLGSCKYLSTIDLSQAFLQ FT IPLSEESRKFTAFSIPGRGLFQFKRLPFGLANSPASLSKLMDKVLGFGELE FT PNIFVYLDDLVVASKSFEEHKQHLTELARRLGEADLRINIEKSKVCVPELP FT YLGYILSKDGLRPNPDRVHAIVAYEVPKSVRALRRFLGMANYYRRFISHFS FT ELTAPLTDLLKNKPKKVEWSAEANRAFRAIKERLISAPVMANPDFGLPFTV FT QTDASDHAIAGVLTQIQNGEEKVIAYHSEKLKGAELNYHATEKEGLAALRC FT IEKFRCYIEGTEFTLVTDSSALTFIMRSKWKTSSRLSRWSIDLQQYHMLIK FT HRKGKENIVPDALSRSLEVAEIADGSDWYSKLFASVKNDPENYLDFKIEEN FT KLYKLVSSQSDVLDYSFDWKQCVPDSMREEILTKEHDDAFHVGYEKTVEKI FT KKRFYWPRMAADIKRHVLNCDTCKRIKHSTTSVVPEMGNQRVTTRPFQILT FT LDYIQSLPRSTKGNAHLLVLMDVFSKYCLLVPVRKISSGSVCEILEQLWFR FT RLSVPQYLISDNATTFLSKEFQNLLKKYGIQHWASARHRSQANPTERLNRT FT INAMIRSYVRDNQKLWDTRISEVEFVLNNTVHATTKFTPYRVIFGHEIVTK FT GSEHRLQTMGELTDEERIEKMGNISTSIYDKVVHHLRKAHEVTKQRYDLRH FT KRYAPTFDVGQRVYKRTFRQSSAGDHFNAKLGDVYEPCVVLAKKGTCAYEV FT GDLNGKSLGVFSAGDLKA" XX SQ Sequence 7121 BP; 2226 A; 1347 C; 1521 G; 2027 T; 0 other; tattggcgac caataaaaag ttaataacct cgttattatc ctttttggag gtaagcgagg 60 gggggaaatt tgatggtact tattatcgga atcggttctc ggatcggtgt cggtgtttct 120 gagaaatcca atctcttttt gctgctcaat taaaagtgta caactattag accggagtga 180 atttgtctag cggtgctatt atagatcact ttaatggtga aacaattgat gattacacgt 240 ggattgaagc tattgttctg aatactaaac ataaccttta cttagtcgat ttttcgctag 300 atttaactat taaaaaacgt ctgtaaatta ttggccttta aatggattga atttcgtgca 360 aattttcata atactttagt aaaattttct tttcattata tattttatat ttgtttgtgt 420 aaattccaaa atggagcatt atattcgtca atactttagt atgaacttgg cacatttgct 480 acaagatgaa cttgacaatg agttaattgt tcgtaatatc gatgctagtt cggattcgcg 540 aacggtacag gaacgaaaat gcagggcaga gcttaagaag gagagagatt tgaacctcgc 600 agatattgag tacgagacag gttgggatga tattttaaca gaactagatg tttgcaacat 660 caaagttcac gaaatttcaa aaattttgct agacagaaaa gaaaaatctg cacctgatca 720 aaaatataaa actcgattgt tacatctttt gtttcgttta ttacgagcga aagctcatgc 780 gacagatgag tcagtaatta acaccattgg tgacattgct ggtcatgctg caagattgtt 840 gggaaattat ttttcattgt tgtcaccgca cgaggaaata cggaaagctg aacttgaatt 900 cgctagcaat agtatgcggg aacaactaga gaatttaaat aaaactcctc aggaggaaga 960 tgccactcct gaaacaagta ttcgcggagg taacctaact ccagaagata gtaacaacat 1020 aaaggaaaaa gccacaaata cgacagactt accaaaccca cagaactcac aacaaatagc 1080 tgaaactgaa aagttgaaaa atgagaatac tgacttaaag aatttattga atgatttgtt 1140 cacacatctt cagaagatag acaagaaaat tgagaattta gaggcagaga aaaacaaccc 1200 accagtaaac aaaaatagta cccaatttca caattcaggg gacttaaact tgtctgattc 1260 acaaaaaccc aaattctctt accaaggatt tttggattgg ttagtgaaag accaaaattt 1320 gattgagcca aatttaaaga attctgatca acaagataaa ccagacccac ccaaacccac 1380 caaaaacaac aatccagaac aggaaaaaaa ttcgtttgga aaacgtttcc ccattcacaa 1440 atggaagatt cggtatgatg gatctgatca aggtcgtcag ctgaatgaat ttcttaaaga 1500 agtggaattt aatgctagat ctgagggatt caccaaacaa gaactttata attcagccta 1560 tcatttgttc caaggcaaag caagatcatg gtacatggaa ataaactcca acgatgaact 1620 tgaaacgtgg gaaaacttag tcgaggaact gaagcaggaa tttcttcctc cagatcttga 1680 tttttactac gaacgtcaag ctcacatgcg caagcagagg ctaaacgaac gttttcaaga 1740 ttattttttg gacatgaccc gacttttcag aaacctcact tgtcctattg atgagactaa 1800 aaagttcaac atgatttttc gcaatcttcg tacagaatac aaaagtgcaa tgttggctgc 1860 aaaaattaaa actatcccag caatgaaatc gttcggcaaa tattttgatt ccatcaactg 1920 gcagtggtat gcgaagactg ataaggataa caatagaggt caaagatctg gggaaagaca 1980 agtaaatgag atccaaagtg agagaagagg cccgatgaac actaacggca acaattacaa 2040 caacaatcgt cagtggaaca agggagagaa cgttaagaga gaatttaaaa tcccaaaccg 2100 accaaattcg aattttcgaa atgaccgaaa tgaaaccaac acttacaaac ctcaacagca 2160 atcgtctgaa aaatctactg aaaagaaacc attgcaaaat tcttcgaaac ctgatgatgt 2220 gaagccaggt ccaagcaaaa atttgaatac tttagagaag attttgaagg cgtatgtccc 2280 gataaggagg ggaacatgtt ttaattgcca caattttggg cataatttca atcaatgtaa 2340 acagccagcg caagttttct gcgaagtttg tggatttccg ggattttcta cacccgattg 2400 cccatactgt cccacaaaaa acgtgatgga ggctgctcaa tgaggcagag tagtcttgtg 2460 aagttacctg aaagtcctcc gagtaattta gatgaactcg aaaccaccca caacatcctt 2520 cagcatttgg gatacgccca cgtgtcaaca gaatctcgtg gggctgcgga agtcaacact 2580 atttttctgc atccacgcgg tgataaccga ccgttcatac gtatcaatct gctaggagtt 2640 gaattagtcg cacttttaga tagtggcgcg aaccgaaaca ttttgggtaa gggatcacac 2700 aaactagtcg cgagtttaaa cttgagctgt tcaccatcgg acatgacgtt gataactgca 2760 gaaggacatc cagtcgaggt gatgggagag atagaaattc cggttacttt taacggatgc 2820 acgagaatcg tttcttttgt tgtcgcgccg accttgaagc gtaagtgcta tctcggaatg 2880 tctttttggg atcaatttgg aatctttccg tcgctccgag agtcgtttat cgaaaccatc 2940 gacgattcgg aggagccgac agatgatgaa gtactgttga ctcccgaaga gcagaccgag 3000 ttagacgaaa ttaaaaagtt atttttagtg cccagaactg gagacattgg atgtacttct 3060 ctgctgacgc acacaattga aattagtgac gaatacaaag ataaaccacc aatacgacaa 3120 aatccttatc cgtggagccc catggtacaa agtaaaattg gaattgcttt ggataacatg 3180 attagagacg atattgttga gccttctcgt tcagaatggt cgcaacctgt tgtaccagtt 3240 tctaaaagag acagcgatgc tgtacgcctc tgtctcgacg cgagaaagtt gaatgagcgc 3300 actaagcgtg atgcttatcc acttccgcat caaaatcgca ttctgagcag attaggttca 3360 tgcaaatatt tgtccacaat tgatcttagc caagcatttc tacagattcc acttagtgaa 3420 gaatctcgta aattcactgc tttttctatc cctggacgtg gtctgtttca atttaaacga 3480 ctaccgttcg ggctccttat ccgtggagcc ccatggtaca aagtaaaatt ggaattgctt 3540 tggataacat gattagagac gatattgttg agccttctcg ttcagaatgg tcgcaacctg 3600 ttgtaccagt ttctaaaaga gacagcgatg ctgtacgcct ctgtctcgac gcgagaaagt 3660 tgaatgagcg cactaagcgt gatgcttatc cacttccgca tcaaaatcgc attctgagca 3720 gattaggttc atgcaaatat ttgtccacaa ttgatcttag ccaagcattt ctacagattc 3780 cacttagtga agaatctcgt aaattcactg ctttttctat ccctggacgt ggtctgtttc 3840 aatttaaacg actaccgttc gggcttgcaa acagtccagc tagtctcagt aaactcatgg 3900 acaaagtgct aggatttgga gaattagagc ccaatatatt cgtatacctg gacgacctgg 3960 ttgttgcaag caagtctttt gaagaacata aacagcattt aacggaatta gcgcgcagat 4020 tgggcgaggc tgatttgcgt attaacattg aaaaatctaa agtttgtgtc ccggagttgc 4080 cttatttagg ttacattttg tcaaaagacg gtcttaggcc caatcccgat cgggttcatg 4140 cgatcgtggc ttatgaggtg ccaaagtctg ttcgagcgct gcgtagattt ctcggtatgg 4200 ctaactatta tcgcagattt atcagtcatt tcagcgagct aacagcgcca ctcacagatc 4260 tgctcaaaaa caaacctaag aaggttgagt ggtctgctga ggcgaatcgg gcattccggg 4320 caatcaaaga aaggttaata agtgcaccgg tgatggcaaa cccggatttt ggtcttccgt 4380 ttacggtgca aaccgacgca agcgaccatg caatcgcggg ggtgctcacc cagattcaga 4440 acggggaaga aaaagttatt gcgtatcatt cagagaaact caaaggcgcg gaactgaatt 4500 atcacgcaac ggaaaaggag ggtttggcgg ctttgagatg tattgaaaaa tttcgctgtt 4560 atatcgaggg aacagaattc accttagtga ccgactcatc ggcactcacc tttattatga 4620 ggtcaaaatg gaaaacatca tcaagattat cacgctggag cattgattta cagcagtatc 4680 atatgcttat caaacaccgt aagggtaaag aaaatattgt gccagacgct ctctcacgat 4740 cgttggaggt cgcggagatc gcggacggat ctgactggta ctcaaaacta tttgcttcag 4800 tgaaaaatga tcctgaaaac taccttgatt tcaaaattga agaaaataaa ctttataaac 4860 ttgtgtcttc tcagtccgac gtgttggatt actctttcga ctggaagcaa tgcgtcccag 4920 attcaatgcg ggaagaaatt ttgaccaagg aacatgacga tgcgtttcat gtagggtatg 4980 agaaaacggt ggagaaaatt aaaaaacgat tttattggcc ccgaatggca gcagatatca 5040 agcgacacgt gttgaactgt gatacttgta aacgtatcaa gcattcaact acgtccgtgg 5100 tcccggaaat gggtaaccaa cgggtcacga ccaggccatt ccaaatactc actctagact 5160 atatccaatc cctaccgagg agtaccaagg gcaatgctca tcttttggtc ctaatggacg 5220 ttttttcgaa atattgtctt ttggtccctg ttcggaagat cagttcgggt agtgtgtgtg 5280 agattttgga acagctttgg tttaggcgtc tctcagtgcc acaatactta atttctgata 5340 acgctacgac ttttttgtct aaagaatttc aaaatttgtt gaagaaatat ggaatacaac 5400 attgggcaag tgcgagacat cgtagtcaag caaaccctac ggaacgacta aaccgtacga 5460 ttaacgccat gattagaagc tacgtacggg acaaccaaaa attatgggat actagaattt 5520 ccgaggtcga gttcgtattg aataatacag ttcatgcgac gaccaagttc acgccgtata 5580 gggtaatctt cggccacgaa atagttacaa aagggtcaga acacagattg caaacaatgg 5640 gtgaattaac ggatgaggaa agaattgaaa agatgggaaa tatcagtact tcgatctacg 5700 acaaagtagt tcatcacctt cggaaagcgc acgaagtgac gaagcagcgt tacgacttga 5760 gacacaaacg ttacgctccg acttttgatg ttggccagcg agtgtacaag cgaacgtttc 5820 gtcagtcttc cgccggtgat cacttcaacg cgaagctagg tgatgtctac gaaccgtgcg 5880 tagtcctagc gaagaaaggt acctgtgcgt acgaagttgg tgatttgaac gggaaaagtt 5940 tgggagtgtt ctccgctggt gatttaaaag cttgaaccca gcacggatat gctgtgaaat 6000 atgttggttg atgatttcag acgtggagag atgggagcga ccgccgtcaa ttatgtttgg 6060 aggttgtaat cctgatcctg tcagctctgt ccatgccatg ttccacgaag aaagtcccgg 6120 gcaaaccaaa atgggaattt aaaaggtacc ttaaacaaat gaaatttggt tttaaaatgt 6180 tatttattta ttgttttgag attttgagtg aaatttaaat tgtttgtttg tgcattgttc 6240 tgtgctacat tttatcgctt cttatggatc gtaaatcttc tttttgatct tatgagcgag 6300 ttgttttcgg caaaggtcag tgtacccaac aaatgagatg gtgagccgtg agaagtcgga 6360 tgaactttcc tataacacaa tagaatttat ctcacttttg agacaatgca atatgaccga 6420 agtcttgagg acgctaattg cgttggaaat tgcctaggtt taacaaattt agttcttttt 6480 ggcagttatg aacaccacac actaggattt tcaattaatt tgcacaaaat ttgattccct 6540 tttctttaag aaagtttcta cgaagatttg aactgtgtaa atgatcttgg ccaatcattt 6600 agcaccaaaa ttatagtttg tcttcgtgtt tcgcgaaagt aacttttctt aaagtggact 6660 gaatcgaccc acatgaaagc gaacacaact tgtaaacttc acacaaaaac caaaatcgtt 6720 atttggaatt ttcacgattt gctcagttta tcacgagaac agcgcgcgat tttgcaattt 6780 tcccttggtt tttccaagat ggatccaatt tttatcatat gctccggagc gattaaaata 6840 tatttgtagt ttgtattcat ttatttaatt gtagtttatg tcgtatttaa tttgttatgt 6900 ttagttttga acgcattctt tcatgatgtt gttcatggtt gttaactgtt tttgatgtat 6960 ttataatagt aacgctatcg ttcatttttg ttctctggtt gtacacagtt tgatcagtat 7020 tttgttgaat ggtacaggtg cgacattctg ctaatagtca gaaggtttcc ttcaaaaaaa 7080 atattgtgat ccaatatttt tttccccgag cggtgcggta g 7121 // ID hAT-1_BF repbase; DNA; INV; 2363 BP. XX AC . XX DT 30-SEP-2008 (Rel. 13.09, Created) DT 30-SEP-2008 (Rel. 13.09, Last updated, Version 1) XX DE Amphioxus hAT-1_BF autonomous DNA transposon - consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-1_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-2363 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-2363 RA Kapitonov V. and Jurka J.; RT "hAT transposons in the amphioxus genome."; RL Repbase Reports 8(9), 919-919 (2008). XX DR [2] (Consensus) XX CC This transposon is likely incomplete (the C-terminal portion of CC the hAT transposase is deleted). XX SQ Sequence 2363 BP; 694 A; 576 C; 613 G; 480 T; 0 other; cagggcttta gccagcgtcc gtttttccgt caattgacgg aaatttgctg cctgtgacgg 60 aaaaaaaatt aaaccattcc gtcaaccttg acggaaaaaa aatcacggag aaagatatga 120 aatctgctga atttaaataa agtacggcta taccacgccc aaacacccga aaaactgccg 180 cgaaaaccac gccaaaaccc ccgcgggccg atttttccca gcgcacggca agcgctggcg 240 ttccttattt gcattaaaag aatacactgg gcgggaaaat tgacgcgcca tgaccaccgg 300 cgccagaagg tctcagttga agtctgttat atgcgttttt gtttactttt aatgttaaac 360 aacatgtgct aactgtaaca taacaataaa atgaatttac ataaagaacg gggccgactg 420 acccgactcc gacatactgc tacgtgacat aaacaacggc gcgggaacgg gcccaacttg 480 cgctaactgt tacatgacaa tagaataaga taaacatcga gcgtcttacg agcgcaacat 540 atgggctaac tgttacatga caatagaata aaataaacaa cgggcgtctt acggaaacaa 600 catgcgctaa gtgttacatg acaaaagact aaactggtcc gacgggcgct tgcactagct 660 ggcttgcact gtcagtcact cagttgaatt tcaacagccg tgatcgtcac atctttcgtt 720 tcctctcccc gtaccagaag aatgcaaaca tgaagagaac cttgagtgac ttttttaagg 780 caagtccgtc gaaggagaag caggtcagct cggtcaaggt aggcgactct aatgtctctg 840 ctgtcaatgt accggtatcc gtgtcaactc cgccgaagaa aagccaggac aagtcggaaa 900 agagaagcga acctaccgtc caaccatcca cgtcatcgcc atgccccccg aagaagaaac 960 agtgcaatag gaagttcctg cagagctggc taagcattga aaggtttcgg gactggctag 1020 agtacaggga agatgaggac aagatgtact gcaaattgtg catcaagcac cggaagaagt 1080 ctaagttcac aaggggaagt acaaccttcc agaaagatac actggaagat cacgcggatt 1140 ccaatacgca caaggcagca gaacgcgctg aaaaaatgtc cggcagcatt gtcaacgcgg 1200 cgaagaaggt tgtgagcagg gcagagaccg gtgttgtggc cgcaatgaaa aacgtgtact 1260 tcttggcgaa agaagaacta gcgaccagga agcacaagcc gctgctggag ctgcagaaag 1320 acgtggggtg cgaggccgtg tcctacttgc agacgggggg caacgccacg tacgattccg 1380 ctcaggcggc caatgagttt caggacgcca tagcagccgt cctcggagaa aatctggacg 1440 ccgagattgc cgacagtgcg gctgtttcag tcatgatcga cgaaagcaca gacattagcg 1500 tttccaaaaa cttgattgtg tttttgagcc tgtgctgtaa cggagagatc aagacacggt 1560 ttctcaagct tcatgaggta gatggaccag ccaacgctgc aaacatctac caggcgctac 1620 taagaacttt ggaagaacga ggtattccaa tcgccaaagt ttgtggactt ggaactgatg 1680 gggctaacgt tatggtgggg agaagaaacg gcgtcgctac tctccttaaa caggagaacc 1740 cacaggccta caccacccac tgtgcggcac atcggcacag cctcgccgta agccaggccg 1800 cagcgaactt cccgtacctt cgccgggtgc aaaatattgt aggtacgcta catacgtact 1860 ttgcaaggtc cgggaaacga agccaggagc tgaagaacgt gcagcaggct cttgaggagc 1920 ccgttctaaa gaatctgatg ttggagctcc ataaagtgcg gtggttaagc tttggaaact 1980 gcattgagaa cattaaccgc tcactacgct cgctgttgga gctgttcagg gatgaggcag 2040 aagaagaccc acaagcggca ggcctgtaca gcgagttgaa gaactttaag ttcatgtaca 2100 tgttagagga cgtgtttgcc atcttggaca tgatcagcac agtgttccag acgcgagcgc 2160 tgaacttctc ccagatagat cccaccatca ctggcgccat tgaaagcctg gagagcattg 2220 cggccgggca tcatggacca cgcttgacag agtttctgga agcagacttg cccaacctgt 2280 tccccggagt cgaagtcagt aacaacgatg agaggacaag ggaccaggcg caaaacctga 2340 cagtcaactt tgtcgcggcg gtg 2363 // ID hATm-34_HM repbase; DNA; INV; 2525 BP. XX AC . XX DT 07-DEC-2008 (Rel. 13.12, Created) DT 10-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type family: consensus sequence. XX KW hAT; DNA transposon; Transposable Element; hATm-34_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2525 RA Jurka J.; RT "Families of hATm elements from Hydra magnipapillata."; RL Repbase Reports 8(12), 1928-1928 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 421..2232 FT /product="hATm-34_HM_1p" FT /translation="MAYKYCQSSVEKNKETYKKNLVKFQDSHKSRQFPMLE FT TSFQEYSKMPLKSDIVSRFKYLSSTDVKAKKQRIDLISEEVISLWKEQLNF FT PCLTEQSVKRKINDVIEKYESLRKKGNADLLQKEIFDITNQNGIWLSSEDK FT RFYEIQLQSKGEIGYATSKLAGKKTIHPSIVQKLSSQPSASKSFSSFTESS FT KIERDNSSSEFEETDEQFEIDNEITDEEYNNLKKRRKKYSSCKIATKLVVC FT AKVSTKKASKIGKILSSEGIHVPAPSQSGIYKALFNEAKQFKETLIQDLKN FT ENWSLHFDGKHIDQWEYQVVVLKNERSEIKLGALKLIDGKSITIINAIADI FT IDEFKLWNSIKMIVTDTTNVNTGKKNGVVVKLQDIFQKKGLQKPQFVPCQH FT HILDRILRLEMEEKLGENPNSPNINYFFTNELLEHYETLKTNFYNGTEEIK FT DTGGWRDDMKFLFHLTRTYRYYDQNKIFPNVRFQKIPNISNARWSSRAILA FT LLAYILMPKTRNKLESVCTFISYKWADFWFDDQTYKENTYSDLHEVLLNKK FT KPLATFTSHWNPEPSAINIPRSNQCAERAIKVMEELYRLCRNKDNLPLRFI FT LSNKQ*" XX SQ Sequence 2525 BP; 972 A; 348 C; 403 G; 802 T; 0 other; gggatatgac ttttttaact ttttatacaa tttggacaac cccatattat ttttggatga 60 gtttttgcta taaaacaatg aagaaatgtt tattttcctt caagtgtaaa aaaatgtcac 120 gcccctcggc tgaaatgttt tttaatacaa tatagatagg aatataatat actatacata 180 ttacatatgg gccagaaaac ataagttaaa gaagtcataa ccctctatat atattttatt 240 atattatatt attatattat atataataaa atatatatag acggttaata tatattatat 300 atataatttt aaatacatac cagacgtatg cattaccact ttataatttc tagtttttca 360 ctgccgtgcc taattgttta ttaattagtt tcagattttt ttgtgctgct tttaattaaa 420 atggcataca aatactgcca atcaagtgtt gaaaaaaata aagagactta taaaaaaaat 480 cttgtgaaat ttcaagattc gcataaaagt agacagtttc caatgttaga aacatctttt 540 caagaatatt ccaaaatgcc attaaaaagc gatattgtta gcaggtttaa gtatttgtct 600 agtactgatg tgaaagccaa aaagcaaagg atagatctta tttcagaaga agttataagt 660 ttgtggaaag aacaattaaa tttcccatgt ctgacagagc agtcagtgaa gcgaaagatt 720 aatgatgtta tagaaaaata tgaaagtctc agaaaaaaag ggaatgccga ccttctgcaa 780 aaagagattt ttgatataac aaatcaaaat ggcatttggt tatcatctga agataaaaga 840 ttttatgaaa ttcagttaca aagcaaagga gaaattggat atgctacttc aaagctggca 900 ggaaagaaga ctattcatcc atcaattgtg caaaaacttt ccagccaacc ttcagcatct 960 aaaagttttt cttcatttac tgaaagtagt aaaatagaaa gagacaactc tagttccgaa 1020 tttgaagaaa ctgatgaaca atttgagata gataatgaaa ttacagatga agaatataat 1080 aatttgaaga aaagacgtaa aaaatatagt agttgtaaaa tagcaacaaa gttagtggtt 1140 tgtgctaagg tatcaactaa gaaagcttct aaaataggta aaatactgtc ctcagaaggc 1200 atacacgttc cagcgccatc acagtctgga atatacaagg ctctttttaa tgaagcaaaa 1260 caatttaaag aaactttaat tcaagattta aagaatgaga attggagtct acacttcgat 1320 ggaaagcata tagaccaatg ggagtatcag gttgtcgttc ttaaaaatga aagaagcgaa 1380 ataaagttgg gcgcactgaa gttaatagat ggaaaatcta tcactattat taatgccata 1440 gctgatatta tagatgaatt taaactatgg aattcaatta aaatgattgt gacagatact 1500 acaaatgtca ataccggaaa gaaaaatggt gttgttgtaa aacttcagga catttttcaa 1560 aaaaaaggat tacaaaaacc tcaatttgta ccatgtcaac atcatatctt ggatagaata 1620 cttagactag aaatggaaga aaaattggga gaaaatccga actcaccaaa cattaattac 1680 ttttttacta acgagttgtt agaacattat gaaacactga agacaaattt ttataatggt 1740 acagaagaaa taaaagacac aggaggatgg cgagatgaca tgaagtttct gtttcatctg 1800 acgagaacct accgatacta tgaccaaaat aaaatatttc ctaatgtaag atttcaaaaa 1860 atccctaata tcagtaatgc tcggtggagt tcaagagcta ttttagcgtt gttagcctac 1920 attcttatgc caaaaacaag aaacaaactt gaatctgtat gtacctttat ttcttacaag 1980 tgggcagact tttggtttga cgaccaaacc tataaagaaa acacttattc agaccttcat 2040 gaagttttat taaacaaaaa gaaacctctt gctacattta caagtcattg gaatccagaa 2100 ccatcagcaa tcaacattcc cagaagtaat caatgtgctg aaagagcaat taaagtaatg 2160 gaggagcttt atagattgtg cagaaataaa gacaatcttc cacttcgttt cattttgtct 2220 aataaacaat aactgtatgt atttaaattg tataaaatgt tttttcctgc attgaatata 2280 acactatgcc atgtattaat tgttagaata aattatgttt ttttatagaa tcatatgaat 2340 gaacatgttt gcaaatagaa atgattttgg tcattaattt ttttaaaacc tttttaaatc 2400 ttaatgtgct ttggggcgtg acattttttt tgaacatcgt gaaagtatta atattttcgc 2460 attttatgcg taaaaactca tccatttttg ggccagggaa atttttttta aaaaagtcat 2520 atccc 2525 // ID BEL-600_AA-LTR repbase; DNA; INV; 443 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-600_AA_; KW Pao_Bel_Ele215; BEL-600_AA-I; BEL-600_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-443 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 443 BP; 163 A; 65 C; 74 G; 141 T; 0 other; tgttagcacc actgtgcaaa cacatcaaca cccctgtgga aaacattcgc gactagatga 60 catatgtcaa gttaatgtca gaaatagaag atacctgaaa gtaaaaagtc aattgaattg 120 ttaaagttaa atctatatag tttttccaac tgctttttct agttttgtaa aacaattaac 180 tttgaggaga atcggtaaga atgcttatta aaagtcaatg aaatttgatt gtatttgatt 240 atattatgta gcaaaatacc agtggcaaaa cctacgggac gatgataagc acgtttaaga 300 agcgggcaag aaattactaa aattgtaagt ctattatcga aataactatt aaatgatgaa 360 ttcaaatata tgcttttttt tagtttgagc tgtacttaac gagctgctac agaaattacg 420 ttttttcatc aaatctccga aca 443 // ID LTR1_SM repbase; DNA; INV; 947 BP. XX AC . XX DT 16-AUG-2009 (Rel. 14.08, Created) DT 16-AUG-2009 (Rel. 14.08, Last updated, Version 2) XX DE Long terminal repeat: consensus. XX KW Gypsy; LTR Retrotransposon; Transposable Element; LTR1_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-947 RA Jurka J.; RT "Long terminal repeats from Schmidtea mediterranea."; RL Repbase Reports 9(8), 1861-1861 (2009). XX DR [1] (Consensus) XX CC >85% identical to consensus. CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX SQ Sequence 947 BP; 413 A; 123 C; 196 G; 208 T; 7 other; tgttgaggat acctaaataa caaaacaaaa ataattatta aatgcgtaaa atgattgggg 60 gaaatgatga taaatgaaag gagtgttgtt ccgaaactca gtaaaataga agcggaaaag 120 agatatcgaa aaatgagaag ggaacgaana aatgaaaaaa tcgaaaaatg agaaggaaat 180 aaaaaaatca ataaaaacag acaattaaat angngaaggt cattcntaat taacaagtac 240 aaattcaaaa gaagaaaaac aaaaacaaag ttaaaattat cgtgacttat ctcganagcc 300 cgttcgagat aaggaaaaaa tgacactggg aagccgagaa atggcgtcgt cgaaanattg 360 tatatctcga cagccgtcag agattacgga gtgcccaatg gaacangttg aaaataaatc 420 tattatcagt aagcgccatt gggttatgga gtataaaaag gaaacgcaac tggaatcaga 480 gagagaatta gaagtcgaga aaacggagac acgaagcaga aactaaaacg aaacgaaaag 540 aaagtagttt agaagaaaac gaaaataata cagaaaattg tgaagagaag aaagcagaaa 600 ttgtgaggaa agaaagaaag tgaaatcaaa gaaatcaaag ttattgaata aattcgagtc 660 aaagcaaaag caactaaatc aaacttatcg atctcagaaa cccgcaatag tagtcaggtg 720 attgtgaaat ccaattggta ttgaaaacgt gctattcctc ccgaagggtt gtaataagca 780 cccagttgtc attgtaacag ataattcttt gtgtcattgt aaagtggtga taaagtgtgt 840 taatcaataa agtttatatt gtctggagct cgttattgtt gtttaaaagg cacgcttgga 900 gtcagattaa caacagataa tatttaactt caccgacaac cccaaca 947 // ID Ginger1-3_HM repbase; DNA; INV; 2649 BP. XX AC . XX DT 02-FEB-2010 (Rel. 15.01, Created) DT 02-FEB-2010 (Rel. 15.01, Last updated, Version 1) XX DE Ginger1 DNA transposon from Hydra magnipapillata. XX KW Ginger1; DNA transposon; Transposable Element; Ginger; integrase; KW Ginger1-3_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2649 RA Bao W., Kapitonov V.V. and Jurka J.; RT "Ginger DNA transposons in eukaryotes and their evolutionary RT relationships with long terminal repeat retrotransposons."; RL Mobile DNA 1, 3-3 (2010). XX DR [1] (Consensus) XX CC TIR is 142-bp long. One intron is pos 346-547. XX FH Key Location/Qualifiers FT CDS join(181..345,548..2251) FT /product="Ginger1-3_HM_1p" FT /translation="MYKNINFNDVRKYLNDNAFPEDVNERGKKANFKRQCK FT NFVIVDNKLMYCKKGIGEVIVVIDQEEQLNIVKEVHEGLGTSEKAVALASH FT LGINAVRKQISSRFFWHSIVEDITKYVAQCERCQKTSNRNLKVSPALKPVK FT VEQQVMKQVGVDLIKLPESNGFNYVIVLIDYFSKWTEAEPLIDKTAVSVAA FT FLYRVICRHGCFQIQINDQGREFVNSVSIALHDMTGVQQRVTSAYHPQANG FT LVERQNQTIKKAIVKVLNENVKSWASVLDGILFALRVKVHDSMGYSPFYLM FT YNRQPYLPIDVKYTNVLNGDNYNPVYDKEVLQKALQMKRGLEEKAMTNIVC FT AQKKQKIDYDRKHTTPSNFFVGQKVLLRNLKRENRKGGKMTFSWLGPYEIL FT DILSNSTCLLRNIKGNNTIKKKYSLHHVKPFVEEKSNSKLHSLTSECNDQD FT KSQRNSKLMKHPTKEVLQKIAEEQGLKIFIMPDLFDGEENDVPSKIHVCKG FT DGNCFFRAISFLLTGSECQHTRVRNVVVKHMVSKPCSSLLSGYLGDDVENY FT ITKSGMATDSIWATDVEILGTANLIGIDIAVWSFTGQKLAWLKYPASMKLD FT QLTAHSILLENKNNHYNVVLSLKAL" XX SQ Sequence 2649 BP; 966 A; 374 C; 449 G; 860 T; 0 other; tgtagcgttt ttaaattact gcgttaaaat tttactgcgc ggaagttttc aaccgttcaa 60 acttcatttg cggtaaaata attatactga ggcagtaata ttttaacgtt aaaattttac 120 ttcactcgtt ataattttac tgctttttag tttacaattt atatagctta ttttgtcaag 180 atgtacaaaa atattaactt taatgatgtc aggaaatatt taaatgacaa tgcttttcct 240 gaagatgtga atgaaagagg aaaaaaagca aattttaaac ggcagtgtaa aaactttgtt 300 attgttgaca ataagttgat gtattgcaaa aaaggtattg gcgaggtaag gaaaacttgt 360 ttttataaaa gaagttttta tttaatttaa aaggttttac ttaaataaaa taaagtaatt 420 aaacaatatt ctttattgtg cattattttg aattaacaca acatttttca aaaattcgaa 480 aatttatagc tttaaaacaa ctcaaaataa tgtaaaaatt tcatacaaat aatttttttt 540 attgtaggtg attgtggtaa tagatcagga agaacaatta aacattgtta aagaagtcca 600 tgagggtctc ggtacttctg agaaagctgt tgcattagct agccacttag gaattaatgc 660 tgtgagaaag caaatttctt caagattttt ttggcattca attgtggagg atatcaccaa 720 atatgtagct caatgtgaac gatgccaaaa aacctctaac agaaacctaa aagtgtctcc 780 tgctttaaaa cctgttaaag tagaacaaca agttatgaaa caagttggtg tagatttaat 840 taaattacct gaatcaaatg gattcaatta tgtaattgtc cttatagatt atttttctaa 900 gtggactgaa gctgaacctc ttatcgataa aacagcagta tcagtagcag cctttcttta 960 tagagttatt tgcaggcatg gttgttttca aatacaaatt aatgaccaag gtcgtgaatt 1020 tgtaaattca gtatccattg cacttcatga tatgacaggc gttcagcaac gtgtaacctc 1080 tgcttaccat ccacaagcca atggccttgt ggagagacag aaccagacaa ttaagaaagc 1140 aattgttaag gtcttaaatg aaaatgttaa aagttgggcc tcagttttag atggaatcct 1200 ttttgcttta cgtgttaaag tgcacgattc aatggggtat tcaccatttt accttatgta 1260 taaccgacaa ccctatcttc caattgatgt aaagtacaca aatgtactca atggggataa 1320 ctacaatcct gtatatgata aagaagtttt acaaaaagct ttacaaatga aaagaggact 1380 tgaagaaaaa gccatgacca atattgtttg tgcacaaaag aaacaaaaaa ttgactatga 1440 tagaaagcat acaacaccat caaatttttt tgtgggtcaa aaagtactat tgagaaatct 1500 aaaaagagaa aatagaaaag gtggaaaaat gactttctcg tggcttggtc catatgagat 1560 acttgatatt ttgtcaaact caacatgttt attaagaaac attaaaggga acaacacaat 1620 aaagaagaag tactctctac atcatgttaa accatttgtt gaagagaaaa gtaattcaaa 1680 attgcatagt ttaacttctg agtgcaatga tcaggataaa agtcaacgta attccaaatt 1740 aatgaaacat cccaccaaag aagttctcca aaaaatagct gaagaacaag gacttaaaat 1800 atttataatg ccagatcttt ttgatggaga agaaaatgat gtaccttcaa aaatacatgt 1860 ttgcaaaggt gatggcaatt gtttttttcg agcaatttcg tttttgttga caggttctga 1920 gtgtcaacat acacgtgttc gcaatgttgt tgttaaacat atggtatcaa agccttgttc 1980 cagtcttttg tccggatatt taggcgatga tgtggaaaac tacattacca aaagtggtat 2040 ggccacagat agtatatggg ctacagatgt tgagatcctt ggtactgcaa atcttattgg 2100 aattgacatt gctgtatggt ccttcactgg acaaaaacta gcatggctca aatatccagc 2160 atccatgaaa ctcgaccaac taacagctca tagtattctt ttagaaaata aaaacaatca 2220 ttataacgtt gttttgtcac taaaagcttt gtaaatgaaa tgatgtttaa actgaactaa 2280 aatgatgttt gtatatcagt tgttttacaa ggttgtttgt tataaaaaaa taaataattt 2340 gctcaaggtt gcatattctt tacaacttaa aaaaacgtaa ataaacttgt ttgtaataaa 2400 attctacaga agtgatttta tggatcgcta ggctatttaa tttattctaa gattagattt 2460 tgtcgcacaa taaaggaaaa tcgctttact tgatgacagt aaaattataa cgagcgcagt 2520 aaaattataa cgttaaaata ttactgtctc agtaatattt taacgttatt attttaccgc 2580 aaaggaagtt tgaatggttg aagacttccg cgcagtaaaa ttttaacgca gtaatttaaa 2640 aacgctaca 2649 // ID Jockey-12_AAe repbase; DNA; INV; 4387 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A Jockey non-LTR retrotransposon from Aedes aegypti. XX KW Jockey; Non-LTR Retrotransposon; Transposable Element; KW Jockey-12_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4387 RA Kojima K.K. and Jurka J.; RT "Jockey clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1378-1378 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 9 sequences with >97% CC identity. XX FH Key Location/Qualifiers FT CDS 218..1495 FT /product="Jockey-12_AAe_1p" FT /translation="MDDCDHDQANKRRRFSNNNDSLPEYDSISSQQLQPQP FT MVETENRFDLLADMDPDEVRSTNVNINSQVASTINTKKCRLPPISIINIGT FT KQTRELLNLANIPQTEYHMKAVKSGTQLTVSNEDYFNAAIKALSDGNKEFF FT THTPTSKQPVRIILSGLPLYDLDELKEELHLHGVRPLEMKVFFSKVFGSEE FT SVLYLLCFEKGSVKLTELQKITGLFSTTVKWRFFSKRATDAVQCYRCQKFG FT HGMRHCNVSPLCVKCGEKHTTTSCRLPTKAALKDLDTAERQTLRNGIRCAN FT CSGNHTANFRGCPARKNYIRELETKRMRSKKNVTGVHSQSQPENRGSASIH FT ASEPSHNCQPNNVGGLTFSQVLQGRAQRPNTQQHVAEESADLFSVTEFMCL FT AKDLFQRLKGCRSKTEQFLALSELILTYVYNV" FT CDS 1491..4160 FT /product="Jockey-12_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MSNTATLRLVNWNGRSIHSKQLEFFEFAERHNIDVAV FT VTETWLRPSISFLHPNFHCIRLDRPSNDEVDRGGGVLIATRKELQYSQLNV FT STKVIEATGISISTSSGPVHIIAVYFPGSRRCADWSLFRRDIRTLSNRTEP FT FFLVGDLNARHRRWNCSKGNRAGNLLWQEAARLGLYVSYPDSPTFIPFGRG FT KPSTLDLVLSNNQLDMTKPVTQNELSSDHLPVTFEIKPSSEPEQLVHTVRC FT YARANWITFQRIVNSKLNITDPLITDIRDEVGVDAAIDYFKNSVLEAERIA FT VPTIVPRPYEVACIPNETKLLIQLRNRRRRQWMRTRDPLLKEIVSSLNNNI FT KEQCAKARYNKFAETLQTIDRRDHSVWRISKALRNKCKYSPPLRSGNTLFA FT TPTEKAKLLAESFAQSHNNTVDGDSETVETVRRSVDQINQLLPAADYSWLV FT KPKEVSKVIRNLKTKKAPGHDGIRNCLLKHLPRKGSVFLAKIFSACLKLCY FT FPPSWKQAIVVAIPKPNKDSTLPSNYRPISLLPTVSKLFERIILARIEKHL FT ETTSIIPHDQFGFRKGHSTSHQIVRLVNEVRQNFQQGKSSGLVLLDVEKAY FT DSVWHDAILHKMFLGNFPVYLLKVVREFLKNRTFQVMVNGCTSDRMGVPFG FT VPQGSVLSPTLYNIFSADLAKIDGVNYYCFADDTGFLTSDRDAQVIVENLQ FT RAQETIQQYHKKWKVKVNALKSQAIFFTRRRSPRHMPQNQVICGGIVIPWS FT DTVGYLGVTLDKKLNFGTHVTSCLRKCDMLIKTLYPLINRRSRLDPNIKLL FT LYKTVFRPTVTYGFPAWSNCARSHRKKIQVKQNRLLKMMLNLHPSHPTDEV FT HQLAKIELVDDWFLRLMPKFRLSCFASANPLIQELAI" XX SQ Sequence 4387 BP; 1285 A; 1046 C; 908 G; 1147 T; 1 other; tttcagtttc gatctgtcaa ccggagagtg tggacgtaag agctacaatt aatttcggtt 60 caatcggtta tttttgcgtg attaaatctc gaaagttatc aaagtggcta gcggcagtcg 120 tctgcggccc aaagatgttc ccagcagtgg atcaaacata attattcgcc gtaattacgt 180 taaatcatca aacccttcgt cgtttctaac ctcatcgatg gatgactgtg atcatgacca 240 agcaaataaa aggcgacgat tctcaaacaa caacgactct ctgcctgaat atgattcgat 300 ctccagtcag caattgcagc ctcagcctat ggtagaaact gagaacaggt tcgatctcct 360 ggccgacatg gatccggatg aagtacgttc gaccaatgtg aatataaatt cccaagttgc 420 ctcaaccatc aacacgaaaa agtgccgcct gccccctatt tccattatca atattggcac 480 aaagcaaacc cgcgagctgt tgaatttggc caatattccc cagaccgaat atcacatgaa 540 ggctgtgaaa tcggggactc agcttaccgt ctctaacgaa gactatttca acgctgcaat 600 caaagcgctt agtgatggga acaaggaatt tttcacacac accccaacaa gtaaacaacc 660 agtacgtatc attctttctg ggctgccgtt gtacgatctg gatgaattga aggaggaatt 720 gcacttgcat ggtgttcgcc cactagaaat gaaagtgttt ttcagcaaag tgttcggctc 780 tgaggagagt gtgctgtatc ttctgtgctt cgaaaaaggt tcggtaaaac tcactgagct 840 ccagaaaatc actggcttgt ttagcaccac tgtgaaatgg agatttttct caaagcgagc 900 aactgatgcg gtacaatgct atcgctgcca aaaatttggc catggcatgc gtcattgtaa 960 tgtttctcca ctgtgcgtaa aatgcggtga gaaacatacc acaaccagct gtagattacc 1020 gacgaaggct gccctgaaag acctcgatac cgctgaaaga caaactctgc gtaacggaat 1080 tcggtgcgca aactgtagtg gcaatcacac tgctaatttt cgtggctgtc cagctcgcaa 1140 aaactacatt agggaacttg aaacgaaacg aatgcgctcc aagaaaaacg tcactggtgt 1200 tcattcgcaa tcgcaaccag aaaatcgtgg atcggccagt atccacgcca gcgaaccgag 1260 tcacaactgt caaccaaata atgtcggcgg tttgactttt tcccaagttc ttcaaggccg 1320 agctcaacgc ccaaatactc agcaacacgt tgctgaagaa agcgccgatc tgttcagtgt 1380 caccgagttc atgtgtttgg cgaaggattt atttcagcga ctgaagggtt gccgaagcaa 1440 aacggaacaa tttctagctc tttccgagct aattttgaca tacgtttaca atgtctaata 1500 cagccacgtt acgwttggtt aattggaatg ggagatccat tcacagtaag cagttggagt 1560 ttttcgaatt cgccgaacgg cataacatag acgtagcagt cgtcactgag acttggctgc 1620 gtcccagtat atctttcctc catccgaatt tccattgcat ccgtctcgat cgcccttcga 1680 acgatgaggt tgacagaggt ggaggggtcc taatagcaac ccgaaaggaa ttacagtaca 1740 gccaattaaa tgtttctaca aaggttatcg aagccactgg aatctccatc tcaacatcaa 1800 gcggtcctgt acacataatt gctgtttact tccccggcag ccgccggtgt gctgattggt 1860 cactttttcg acgtgatatc cgtactcttt ccaatcgtac cgagcctttc tttctggttg 1920 gagatttgaa cgctagacac cgccgctgga attgttcaaa aggcaatagg gcaggcaact 1980 tgttatggca ggaagcggct cgtctcggtc tctatgtcag ctatccagac tctccgacat 2040 tcattccttt tggacgtggt aagccatcta cactagactt agtactatcc aacaatcagc 2100 tcgacatgac aaagccagta acccagaacg agttgtcgtc cgatcatctt cctgtaacat 2160 tcgaaatcaa gccttctagt gagcccgagc aactcgtgca tactgtacga tgctatgcac 2220 gagccaactg gattacgttc caaagaatag taaactccaa actcaatata accgatccac 2280 tgataacgga cattcgtgac gaagtagggg tcgatgctgc catcgattat ttcaaaaaca 2340 gcgtgttaga agcagaacgt attgcagttc cgacaatcgt cccaaggcct tacgaagtcg 2400 cctgtatccc gaatgaaacc aaactcctaa ttcaactgcg aaacagacgt cgacgccagt 2460 ggatgaggac cagagaccct ttgctgaagg aaatagtttc atctcttaac aacaatatta 2520 aagagcaatg tgcaaaagct aggtacaaca aatttgctga gactttgcag acaatcgacc 2580 gccgagacca ttccgtttgg cgtatttcca aagcgttaag gaacaaatgc aaatacagcc 2640 caccactgcg aagtggaaac acgctttttg ctacaccaac agagaaggct aaattacttg 2700 cagagagctt tgcccaatcg cacaataaca ccgtggatgg cgacagtgaa accgtcgaaa 2760 ccgtcagaag atctgtcgat caaatcaacc aactattgcc agcagcagac tattcttggt 2820 tagtgaaacc taaagaagtc agcaaagtaa tccgtaatct aaaaacaaaa aaagctccag 2880 gccacgatgg gataaggaac tgtcttctga aacaccttcc aaggaaaggt tctgtgtttc 2940 tagcaaaaat tttctcagct tgtctcaaac tatgctattt tccacccagt tggaaacaag 3000 ctatagttgt agccattcct aaaccaaaca aagactccac attgccatca aactacaggc 3060 caattagctt attaccaacg gtcagtaaac tctttgagcg cataatacta gcgcgcattg 3120 aaaaacacct ggaaacaact agtatcattc cacatgatca gtttggtttc cgcaaaggcc 3180 attccacaag tcaccagatt gtccgtttgg taaatgaagt gaggcaaaat ttccagcaag 3240 gaaaatcttc ggggttggtt ctcctcgacg tcgaaaaagc gtacgactcg gtgtggcacg 3300 acgctattct ccataagatg tttctgggta actttcctgt ctatttattg aaagtcgtac 3360 gagaattcct gaaaaatcga acgttccaag tcatggtgaa cggctgtaca tcagatcgta 3420 tgggagtgcc ttttggtgtt ccccaaggct cggtgttgag cccaacgctg tacaacattt 3480 tttctgctga cctcgccaaa atcgatggag ttaattacta ttgcttcgct gatgataccg 3540 gatttttaac ttccgatcga gacgcacaag ttatcgtgga aaatcttcaa cgagctcaag 3600 agactatcca gcaataccac aaaaaatgga aggtgaaggt gaatgcgcta aaatcccaag 3660 caatattctt cactcgccga agaagtccga gacatatgcc ccaaaatcaa gtgatctgcg 3720 ggggcatagt tatcccgtgg tcagataccg tagggtatct gggagtaacc ttagacaaaa 3780 agcttaattt tggaactcac gttacctctt gtcttcgcaa atgtgatatg ctcatcaaaa 3840 cattgtaccc tcttataaac cgacgatctc gcttggaccc caacatcaag ctgctactgt 3900 acaaaaccgt gtttcggccc acggttacgt acggttttcc tgcgtggtcc aactgtgccc 3960 gaagtcaccg gaagaaaatt caggtgaaac aaaatcgcct actgaagatg atgctgaatc 4020 tacacccgtc tcatcccacc gatgaagttc atcaactcgc caaaattgag ctggtagacg 4080 attggttcct aagactgatg ccaaaatttc gccttagttg cttcgcctct gccaatcccc 4140 taatacaaga attggccatc tagaactgtg atactatatt attaagcttt ttctctctct 4200 gtctatttcc ttcatgtaat ttctaaggtt ttttttctgt atatttttcc ttattcgtca 4260 atttttgttg cttggcctaa ttgttaatgt ttaaacatgt tcaatatcac tgctgtaagg 4320 taatccacat ctcagtaaca gcccttacca tcaaattgta taagttaagg ataaatatga 4380 aataaat 4387 // ID Sola2-1_DPu repbase; DNA; INV; 4114 BP. XX AC ACJG01001377; XX DT 17-FEB-2011 (Rel. 16.02, Created) DT 17-FEB-2011 (Rel. 16.02, Last updated, Version 1) XX DE Sola2-type DNA transposon from Daphnia. XX KW Sola; DNA transposon; Transposable Element; Sola2-1_DPu. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Direct Submission to Repbase Update (09-FEB-2011). XX RN [2] RP 1-4114 RA Jurka J.; RT "DNA transposons from Daphnia."; RL Direct Submission to RU (24-MAY-2010). XX DR EMBL/GenBank/DDBJ; ACJG01001377; Positions 18789 22902. XX FH Key Location/Qualifiers FT CDS join(1446..1778,1782..3050) FT /product="Sola2-1_DPu_1p" FT /translation="MDGIKQKCQDLKMKPESSIEIISLLTLAPVSWSRQQK FT STFFGVSDHLVRRSAILKRGKGLLAKPDKKQGRPITEDEKEIVRDFYLCDE FT NSRQLPGMKDYISVRTRKGEKNQVQKRLLLMNINELYVKYKEYCEHTLYLK FT SCGRSTFFMLRPCNVIEVDSSGSHNVCVCEKHQNVHLMVDTICGSNKEKLL FT LMDKVVCDILNQECMLKRCPRCPGTEPLKTFLTIIVANIPVIKFNQWETTD FT RSMLVQKELRNNDFVEELAKKIDTLTSHHIISKEQGKYCKELKARLLENEC FT ILQGDFSQNYSMIAQDSTQSSYFNPASQATIHTYLAILNVAGKISNHSICV FT ILDYMWHNTVAVFSFLKIVINYIKEINPHVNKIIHFTDGAASQYKNYKNFL FT NLAYHKNDFGISAKWHFFATSHCKGPCDGIGGTLKRLARRASLQNRIGGVV FT IQTPESLYDWCSRNVHNIKSFFVSQIDVQRNEDMLDERFKFGKRIPGTQSF FT HSFIPIDGNRVEARFLSISNESKIFTVNSQLAQTANYV" XX SQ Sequence 4114 BP; 1366 A; 695 C; 750 G; 1303 T; 0 other; agggtcaatc cagccgaaat catccaatcg ttggcgcgct catattttat aatttagcga 60 aatttggaca gtaggtttat actgtcgaaa ataacttatg ttccaaattt cgtccatttt 120 ggacgattag tttttaaaat atggcaaaat gaaaattagt ttttttcatt ttgccgtagt 180 ccatttttcc cgtgcgtttt ttgttttttc aatttgtaga cccattgttt gcaaaactat 240 gttttttaaa tacatcgttt taatcagatt ttcacgggct atcttttggt taaattattt 300 ttttttaatt ttagtttaag gttactattt tcggggcaaa gtttttatac cctcccctct 360 tcaaaaatca atatctcccc cggctgcaaa aatattcaca agccgttttc acaatgatat 420 agaggaaata tttctttata ttatatataa aaatgagaga ggtgttatct ctcgttggtt 480 tgatttaatt agtttagttt taaggttttt gacgaaaaat ggcgttattt cagtcccttt 540 cgaccttgag gggcggagct tcatctcttt cctttcgccg ctctcaaccc ctttattccc 600 ccaggcttac gaggggtgct ttcctctccg cgcgactttc aagctctcag tttcatttat 660 agatcctagt gttacgcatt cattaagaaa aatggcagtg aatagttcaa acaagtgctc 720 ggtttgcatt tttattcagg acgagtgctt tctttcaact tttgtgccag ttaattgctc 780 tatatcaaat ctattgatag cttaagcgaa gatttaagaa aaaccatttg ttacagaact 840 aacatgacga caataccctc taaatgtgga catcatttag tggcgttctc atcgcatttt 900 tctaaattca agcaatcaac aaagtgctca aatcctttaa gtattcattg cgtcgacggt 960 gttaaatctc gaaatattcc caaggctggg aaaaaatcaa tttctctgga atcatgtgag 1020 gctataaagt gtcatcgtcc tgatcttcac gtttatcctg gacaaaaact ttgcaccaca 1080 tgtattaaag agttggtgaa attaaaaacc aaggaaatta tgaagaaatc taacacgtca 1140 gagtccgagt tagaggacat ggaagaggca tttgctaacg aatttagtga taacaacgat 1200 gttttttcac cgatgagacc catttctcga tcacttaaaa ttacatctac gctagaagta 1260 acacccgtca acctcgatcc caaaagatcc aaaaaaaggc aatcagatgt gtttattcag 1320 aaggcagaaa agttaaaaat gcgtttctcc aacttcacga acctcttata ggtgaagaac 1380 tgttacttag ttcgaaatcc aacattggta attgcgattt tacatcggct ggttatgaac 1440 aactcatgga tggcataaag caaaaatgtc aagacctgaa aatgaaaccg gagtcttcta 1500 ttgagatcat ttctctttta acccttgcac cagtttcgtg gagccggcaa caaaaatcta 1560 ccttctttgg tgtatcagat catctagtca gaagatcagc aatattgaaa aggggaaaag 1620 gactgctggc gaaaccggat aaaaagcaag gtcgaccaat aacagaagat gaaaaagaaa 1680 ttgtgcgtga tttttacttg tgtgacgaaa attcaagaca actacctgga atgaaagact 1740 acatctctgt tcgtacaaga aaaggtgaaa aaaaccaata agtgcaaaaa cgattacttc 1800 taatgaacat caatgaactt tacgtgaagt ataaagaata ctgtgagcat actttatatt 1860 taaaatcctg tggacgttca acatttttca tgcttagacc ctgtaatgtg attgaggttg 1920 attctagcgg atcgcacaat gtttgtgttt gtgagaaaca tcagaatgtt catttaatgg 1980 tcgacacaat ttgcggcagc aacaaagaaa aactgttatt gatggataaa gttgtttgtg 2040 atattttaaa tcaagaatgt atgttgaagc gttgccctag atgtcccggt acggaaccac 2100 ttaaaacctt tttaacaatt attgttgcaa acattcctgt tataaagttt aatcagtggg 2160 agaccacaga ccgttctatg cttgtacaaa aagagttgcg gaataatgat tttgttgaag 2220 aacttgctaa aaaaattgac acccttacta gccatcatat catttcaaag gagcaaggga 2280 aatactgtaa agagttgaag gctagattgt tggaaaacga gtgtatctta cagggtgatt 2340 tttcacaaaa ttactcgatg atagcgcaag attcaacaca atcttcttac tttaatcctg 2400 cttctcaggc aaccatccac acttatcttg caattttaaa tgttgctggt aaaatcagta 2460 atcatagtat ttgcgtaatt ttggactaca tgtggcataa tactgttgct gtattttcat 2520 ttttaaaaat tgtgattaac tacatcaaag aaattaatcc acatgtcaac aagatcattc 2580 atttcacaga tggtgccgct agccagtata aaaattacaa aaatttcctt aacttggctt 2640 atcataaaaa tgactttggt atttctgcga agtggcattt tttcgccacc agtcattgca 2700 aaggcccgtg tgatggcatt gggggtactc tgaaaagatt ggctagacgt gcaagtttgc 2760 agaataggat aggtggcgta gtaattcaaa cacctgaatc gttgtacgat tggtgttctc 2820 gcaacgtcca taacattaaa tcattttttg tttctcaaat tgatgttcaa agaaatgagg 2880 atatgcttga tgagcgattt aagtttggta aacggattcc aggcactcag tcatttcatt 2940 cattcattcc cattgacgga aatagagtag aagcaaggtt tctatcaatt agtaatgaat 3000 cgaagatatt tactgtaaat tcccaactcg ctcaaaccgc aaattatgtt taatataatg 3060 attgtgttat tggttccaaa attgcatttg tatatgattt tgatggattg tggtatattg 3120 gtgaaattgt tgaaaaaaat gatattgatt gtgatttgaa aactgagttt ctcaaaccag 3180 atggttatga ggctcaaaaa agaggtttta tgtcatctaa caaagatgtt gcttttgtta 3240 atatttctca tgtaatcaaa attgtaaaaa ctttgaaaag taaatcaaaa tttgggagaa 3300 actttcaaat aaatctttct gaaagaaagg aaattgaaaa aatatacagc gcaatgatgt 3360 cataatacac tgtcaaactt caaaattcac acatcgatta atatcgtata cattttacat 3420 tttatgggtt tatttcttta gtttttgaag ctttaggtat ccaaaaactg ttttgaaagt 3480 cgcgcggaga ggaaagctcc cctcgttagc ctggggaaat agaggggttg agagcggcga 3540 aaggaaagag atgaagctcc gcccctcaag gtcgaaaggg actgaaataa cgccattttt 3600 cgtcaaaaac cttaaaacta aactaattaa atcaaaccaa cgagagataa cacctctctc 3660 atttttatat ataatataaa ggaatatttc ctctatatca ttgtgaaaac ggcttgtgaa 3720 tatttttgca gccgggggag atattgattt ttgaagaggg gagggtataa aaactttgcc 3780 cttaaaatag taaccttaaa ctaaaataaa aaaaaataat ttaaccaaaa gatagcccgt 3840 gaaaatctga ttaaaacgat gtatttaaaa aacatagttt tgcaaacaat gggtctacaa 3900 attgaaaaaa caaaaaacgc acgggaaaaa tggactacgg caaaatgaaa aaaaactaat 3960 tttcattttg ccgtatttta aaaactaatc gtccaaaatg gacgaaattt ggaacataag 4020 ttattttcga cagtataaac ctactgttca aatttcgcta aattataaaa tatgagcgcg 4080 ccaacgattg gttgatttcg gctggattga cact 4114 // ID MAG repbase; DNA; INV; 4564 BP. XX AC X17219; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 28-SEP-1995 (Rel. 1.08, Last updated, Version 1) XX DE Bombyx mori retrotransposon MAG DNA. XX KW Gypsy; LTR Retrotransposon; Transposable Element; MAG; KW Retrotransposon MAG. XX OS Bombyx mori OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Bombycidae; Bombycinae; Bombyx. XX RN [1] RP 1-4564 RA Garel A.; RT "Direct submission."; RL Direct Submission to EMBL/GenBank/DDBJ (20-DEC-1989). Garel A., RL Centre de Genetique Moleculaire et Cellulaire UMR106, Universite RL Lyon I, 43 Boulevard du 11 Novembre 1918, 69622 Villeurbanne RL Cedex, France. XX RN [2] RP 1-4564 RA Michaille J.J., Mathavan S., Gaillard J. and Garel A.; RT "The complete sequence of mag, a new retrotransposon in Bombyx RT mori."; RL Nucleic Acids Res 18(3), (1990). XX DR GenBank; X17219; Positions 1 4564. XX SQ Sequence 4564 BP; 1429 A; 867 C; 1067 G; 1201 T; 0 other; tgttatgttt gtaagtaagt attcacttca gtgtgtcaat aaacggtgaa cgcaacgtac 60 gtgtctatta tttactagtg gcgaacgcgg aaaaaaaaaa tcaatttgtg gcttgtttgt 120 ttcttgtttg gactaattaa aattatatga acataatgtc ggtgggacag ttgaaagaat 180 tcgcggtaaa cagtggaaac tggtcctcat atgttgagag attggaaatg tacttttttc 240 ctgaataaag tgactgacga tttaaattac ccacatttaa tttccattat gggtgaggaa 300 agcgtacgat ttattatcga cattagccag cccttcgaag ccgtcacaat taacatatgc 360 taatgctgta gcgatgttgg cagctcacct gcaaccgaag ccctcaatct tggcggaaag 420 atacaagttt cgacaacgtc ggcagctgaa tgagtcaata gctgactacc tcacagaatt 480 aaaaaaatta tcaaaaaatt gtgaattcgg ctcgtcgctg gatgaaaatc tgcgtgatca 540 aatggtttgt ggattaaaga gcgaaatcat acggcaaagg ttatttgccg aggagaaatt 600 agaatataag cgtgcagtta cgttagcttt gtcattagaa gctgcggaga gggatgcgat 660 tgcagtggaa cgtacgccaa ttgaggaggt taacaaaatt aacttcaatg aatgttcaag 720 atgtggagac aggagacacc aagcaaagga ttgcatatac aaagactacg tgtgcagttc 780 atgtcatgaa accggccatc tacgaagaat gtgccccaaa aacgggctta agaaccaggc 840 ggaggcagcc gggggttcag cacgcaccgg gcgccggggg aaccggggtg caggcggcaa 900 caggcgatcg tggcgcgggc agcgagtgcg ggacgaggag gccgagagcg ggctcctttg 960 cacttattat acgatcaaaa caacgatgac acatacaaca acttcaataa tgaacaggac 1020 gggtgtggag aagaaaatga gcctatgtac caaatgacat tgagtaatta taaaccggtg 1080 tgtattaaag ttaaagtcca aaattgcttg ctcaagatgg aggtcgatac tggatcggcg 1140 ttatcttgta ttagcaagaa tgtttatgat aaatattttt ctaaggagaa gttacaggca 1200 tgcttattaa atttaaaatt ttacgatggc tcaatcattc ggcctttagg gtttataaac 1260 actatagtga agtaccaagg tgtttcaaaa atgttagact tgtatgtcat agacaaagga 1320 acgactaatc tattgggccg acaatggtta gctgaattga atattaatat aaaaatatca 1380 aaacctacga gttttaaaat acaacacagt aactttgtaa ccgaacacgc acgagactat 1440 aataaattaa ttaatgaaat tgtttctaga cataaatcct tgttcgacgg cacattgggt 1500 aaatatacgg ggggcacagc agagttaatc gtgcggccgg atgcggtgcc tatctactgc 1560 cgcgcgcgac cggtgccgta tgcgctgcgc gagcgcgtcg atgccgagct cgacgcgatg 1620 ctggccgccg gcgtcatcaa accggtggac cactccgact gggccacgcc acttgtcgta 1680 gtacgcaaag cggacggcgg tctgagaata tgtgcggact ataaggtaac ccttaacaaa 1740 gtattagcaa tcgacagatt tccggttcca aaaatggagg acttattcag taatctcagc 1800 ggaaataaat tttttactaa gcttgattta tctcaagctt ataatcaaat agtcttatca 1860 gaacgttcta gcgagtacac ggttatcaat acacatagag gattatttaa atattcccgc 1920 ctcgtctacg ggttagcttc gagcccaggc atttttcaaa aactaatggt aaatatgttt 1980 aaaaatgtcc caaatgtagt agttttctat gatgacatat tgattagaaa tcaggaccta 2040 gacagtcatt taaagtctat aaaagaagta ttagatatat tagaaaggta tggcttaaaa 2100 attaaaagga gtaagtgcga gttcatggta acagaagtga ggtatttagg gttcataata 2160 gatcaaaatg gggttcgtgt agacccagag aaagtcaaat caatagcaac aatgccacac 2220 ccaaataacg tgacagaatt aaaatctttc atcggtatgg taaacttcta ttcgaagttc 2280 atacaagatt tgagtgcaca tttatcacct ttatatgccc ttttaaaaaa aggtaagcac 2340 tggatgtggg gaaatgaaca aaatgctgct ttcctgaatg ttaaaaagtt tttgtgtagt 2400 acaaaagcac tcgcacattt tgatatgtct ttggagtcgg tgttgactgt ggatgcgagc 2460 gcgcgtggtc tgggtgccgt gttggcgcag cgcgggccag gatgtcagga gcgcgtcgtt 2520 gcatacgctt cacgcgcact caccactcat gaattacact acagtcagat tcataaggaa 2580 gcattggcga ttgttttcgc ggtggaaaaa ttccatcaat acttatatgg gagaaagttc 2640 atactacgga ccgaccacaa acctctggtg agcattttcg ggcctaacat aggtatcccg 2700 agcgcggcag ctagccgctt gcagcgctgg gctattaaac tatcagcata tgactttgaa 2760 atcgagtacg ttaggacaga taaaaatgta gccgatgcac tttcacggtt aatcgagtct 2820 caaaaaaatg acgtagcttc cgaggaaaca gacttacccg agcaaaccta cttacacttc 2880 tcaacagaag ctctgttaat agattataat gttcttaaaa aacaaactag tagtgatcca 2940 atcttaagtc gcgtattaag ttatttaagg gacggatggc ccttagacat agaaattaat 3000 gaattaaaac catattataa taggaaaaac gaactttata ttgaattagg atgtataatg 3060 tggggtcatc gtgtggttat accctcttca tgcagaaaca aaataataac ggagcttcat 3120 gatccgcata tgggcatagt gaaaacaaaa tcgctagcac gtagttacgt ttggtggccg 3180 ggaatcgacg aagcgctcga gacggaatgt cgcgcttgca ccgtgtgtgc tgccgtcgca 3240 gacgcacctt ctacacacgc gccccgctcg tggccctggc cctcacgccc ctggtccagg 3300 ttacacttag acttcttagg ccccataggt ggagtcactt acctagttgt agtggattca 3360 tgctccaaat ggatagaggc aattaaaatg cagaggacaa cggcacaagc ggttatatcg 3420 gtactgcggg acttgtggtc aaaattcggc ctgccaaagc aaacggtcag tgacaatgga 3480 ccgcctttct cgagctctga ttttcagaag tttctgattc ataatggcat aaaacatatt 3540 tattcagctc cctatcatcc cgcctctaat ggggccgctg aaaatgcagt taaaatttgt 3600 aaacgtgcga ttaaaaaggc attaaaacag aatttaaacg tagatacagc tctgtgtaga 3660 tttttattag cctacagaaa tacagaacat gcaactaccg gcgatagtcc agccaatata 3720 ttacagggtc gaagtctccg tatgagatta gataatttaa aaccagaaag gcaatctcga 3780 gttatcgcgc aacaagagcg gagcgagcaa aacgcagggg gtgtgcagcg acaactcgag 3840 ccgggcacta aggtgtggta tcgggattat cgaggtctgg ataaatgggt acctgggacg 3900 atcctcaagc aattaggtag tcgtgattac tgtgtacgat ctagttttgg aaccgagaat 3960 cataggcacg tcgatcagct aaaattaagg gttacaaaaa atatagataa agttgatacc 4020 tttgcgtcga ttcgtaaaaa tataagtccc gaactgatcg cgcaaaattt agacttaaga 4080 acacaatcac gtaagttccg gctatcattt cccatgacaa gtggcgagga accagtggcg 4140 gtggtcaacg attcggggaa gagtagttca ccaccaaaaa atagcacacc ggcacaggat 4200 gctgtatcgc ctattcgtgg cagccctatg agacccgctg tgccgataga caagggagca 4260 tgtaatactg gtggtaagag tatcaatgaa aatgtgacac ccaaacgtga acgtcctgta 4320 agaatacgca agccaccagt tagatatgga ttcgaggaaa tagattaagt atatagttat 4380 aacctctctt tgtttttttt tactttagtt tgttatgtac gcttagttaa gatatatata 4440 tatatatatt aattgtgctt ataatgtgta gtaaataagg gtggaggtgt tatgtttgta 4500 agtaagtatt cacttcagtg tgtcaataaa cggtgaacgc aacgtacgtg tctattattt 4560 acta 4564 // ID Copia-1_DVir-LTR repbase; DNA; INV; 195 BP. XX AC scaffold_5826; XX DT 07-MAR-2011 (Rel. 16.03, Created) DT 07-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-1_DVir_; KW Copia-1_DVir-I; Copia-1_DVir-LTR. XX OS Drosophila virilis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; virilis group. XX RN [1] RP 1-195 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (07-MAR-2011). XX DR Genome; scaffold_5826; Positions 423 229. XX SQ Sequence 195 BP; 57 A; 36 C; 31 G; 71 T; 0 other; tgttaaaata gtgacaataa gctgttaagc ttctttataa tctggcaaca ttgtatcagc 60 atagtactca ttctgtaatt gagtacttgt tcagattttt ggactcatgc ttggaagttt 120 ccattaaaca tttaaccgaa ccactacgtg tgttttcttt atgtgcatct taggcacact 180 agaaatttcc caaca 195 // ID P-13_HM repbase; DNA; INV; 3505 BP. XX AC . XX DT 17-MAR-2008 (Rel. 13.03, Created) DT 17-MAR-2008 (Rel. 13.03, Last updated, Version 1) XX DE P-type DNA transposon family: consensus. XX KW P; DNA transposon; Transposable Element; P-13_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3505 RA Jurka J.; RT "P-type DNA transposon families from Hydra magnipapillata."; RL Repbase Reports 8(3), 359-359 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 100..2574 FT /product="P-13_HM_1p" FT /translation="MVISCSAYGCVNRFNKGVAISFHKFPLNNSELCQKWV FT VATKRDSFVPTKYSYICSVHFHKEDYNYENANKPRLKANVVPSIFNFPAKM FT HSKKPIKRKYVSRNVTETFETINEKKISLSLPCSSKNHILTSVDFTIENPN FT NKVDSPSKVKLKRKIKTLKQKLRRKELKINTMTEIIKKLENDKLISADTAQ FT FLDNSFSGLVCDLFKSELVNKCKDPRGHRYSEEVKKFALTLHFYSPRAYTF FT VSSLFALPCISSLSNWSSSIDCSPGFFKDVFSYLQQKGLEDVTYKDCALIF FT DSMHIKSGLVYNPSNGNYEGFSDYGNNISAFDPNSIATEALVFMLVGLRGH FT WKCPIGYVLCEKISTTNLVCLITNAVNLCVQHGIDVHSVTCDGVFSNFSAM FT KSLGCSFDKAENMKTFFNIESYNKNIYFIPDPCHMLKLARNALCDLGVFID FT SQKRFVKWEHITILQNLQEDIGLKFANKLGNSHINFHRHKMNVKIAAQTLS FT NSVADAIEFLMLSGHPSFHDAQGTLDFIRTINQLFDLLNSRSPFGKGLKKP FT LFLNDKIHWFSVIEKTIKYLASLTDKDGTKILKHRRKTFAIGFIVAATSIQ FT NMSINLLSRSECSFKYVLTYKVSQDHLELLFACIRGKNGFNNNPNIIQLKS FT SFKKILLRNFLIGSKHANCLTFEKNSAGSIFSLKWSKTRSPMIETELKDED FT KNIIANLSNSLEKISTLIYKQAILGYIAGYIVKTLKKKITCCICAEALNQP FT DNLVRDHDYTVFYKSSSLSFINLKNLGGLNIPSVSVIKIIEKCEKIFRVFV FT IGINSTLATYSQKNLKILNGLQN" XX SQ Sequence 3505 BP; 1270 A; 480 C; 498 G; 1257 T; 0 other; gaggcaaatt tgtttattgg aagccatctt gaatactggc aaaacacgca aaaaaaaatt 60 tataatattt tttaaaaatt tacttttatt actaaagaaa tggttatatc gtgctctgct 120 tatggttgtg tgaaccgatt taataaaggt gttgcgatat cgtttcataa attcccttta 180 aataatagcg agctatgcca aaaatgggtt gtggcaacaa aacgtgattc atttgttcct 240 acaaaatata gttatatatg tagtgttcac tttcataaag aagattacaa ttatgaaaat 300 gcaaataaac ctcgtttaaa agcaaatgtt gtaccatcta tctttaattt cccggcgaaa 360 atgcattcta aaaagcccat caaaagaaaa tatgtttcca ggaatgtaac tgagacattc 420 gaaactatta atgaaaaaaa gatatctctt tcgttgccat gttcctctaa gaatcatatt 480 ttaactagtg tagattttac tattgaaaat ccaaataata aagtcgactc tccatctaaa 540 gtcaagctta aaagaaaaat taagactctt aaacaaaagc ttagaaggaa agaattgaaa 600 ataaatacaa tgacagaaat tattaaaaaa cttgagaatg acaagcttat atcagctgat 660 acagctcaat ttcttgataa tagcttttct ggtcttgttt gtgatttatt taaatctgaa 720 ctagtcaata aatgcaaaga tcctagaggt caccgatatt cagaagaagt aaagaaattt 780 gccttgacac ttcattttta ttcacctcgt gcctatactt ttgtcagctc attgtttgct 840 ttgccttgta taagttcttt atcgaattgg tcctcatcta ttgattgttc acctggattt 900 tttaaagatg ttttttctta cttacaacaa aagggattag aagatgttac atataaagat 960 tgtgcattga tctttgattc tatgcatata aaatcaggtc ttgtttataa tccaagtaat 1020 ggtaactatg aaggtttttc agattatgga aacaatatat ctgcttttga tccaaatagt 1080 atagctactg aggctcttgt ctttatgctt gttggacttc gagggcattg gaaatgtcct 1140 atcggttatg ttttatgtga aaagatttca acaacaaact tggtttgtct tataactaac 1200 gcagttaatt tatgtgttca acatggtata gatgttcaca gcgtaacttg tgatggtgtt 1260 ttttctaatt ttagtgccat gaaaagtctt ggctgttcgt ttgataaagc tgaaaatatg 1320 aaaacttttt ttaatattga gtcttataat aaaaatattt attttattcc tgatccttgc 1380 catatgctaa aacttgcacg aaatgcgcta tgtgatcttg gtgtatttat tgatagtcaa 1440 aaaagatttg taaagtggga gcacatcaca attctgcaaa atttgcaaga agacattgga 1500 ctaaagtttg ctaataagtt aggaaatagc catattaatt ttcataggca taaaatgaat 1560 gtaaaaatag ctgcccaaac actaagtaat tctgtagcag atgctattga atttttaatg 1620 ctatctggtc atccctcttt tcatgatgct caaggaacat tagattttat aagaacaatt 1680 aatcaacttt ttgatttact taattcaaga agtccattcg gcaaaggttt gaaaaaaccc 1740 ttatttctta atgacaaaat ccattggttt agtgttattg aaaaaacaat taaatatctg 1800 gccagcctaa ctgataaaga tgggactaaa attcttaaac atagaagaaa aacgtttgct 1860 attggtttca ttgtcgctgc aactagtatc caaaatatgt ctataaattt actatcaaga 1920 tccgaatgtt cattcaaata tgtgttgacc tataaagtgt ctcaagatca cttagaactg 1980 ttgtttgctt gtattagggg aaaaaatgga tttaataata atcctaatat tatacaatta 2040 aagtcaagct tcaaaaaaat tcttttaaga aactttttga ttggttcaaa gcatgcaaat 2100 tgtctcactt ttgaaaaaaa ttcagctggt tcaatatttt cattgaaatg gagtaaaaca 2160 agatctccaa tgatagaaac agaattaaaa gatgaagata aaaatataat tgcaaactta 2220 tcaaatagcc tagaaaagat ttcaacattg atatacaaac aagcaatact tggttatata 2280 gctggttaca ttgttaaaac tcttaaaaag aaaataactt gttgcatatg tgctgaagca 2340 cttaaccaac cagataattt agttcgtgat catgactaca cagtctttta caaatcatca 2400 tctttatctt ttataaattt aaaaaatctt ggtggattaa atataccatc tgttagtgtt 2460 attaaaatta ttgaaaaatg tgagaaaata tttagagttt tcgttatagg cataaattca 2520 acgttagcca cttattcaca aaaaaatctt aaaatcctta atggtttaca aaattaatca 2580 agagcttgca tgtgaaaaga tgtttcctga actaaacgag catgatttag atcatgagat 2640 tttaactgaa gatatgcact cctctcaact attaaaaaag ataattgata aatatctctc 2700 cattcgttta tttcgttatg gaaaacagta tactaacgat attctacata aacataaaat 2760 tggtttacgt cagcagttta ataaaataat tttatttaaa ggcatataaa tatttttttt 2820 agttaaaaag gaactcagat tttttagttt tagagataat gcaaattgtt ttgtttttaa 2880 atacattcta tatttataaa aaaaacactc tattatatat aaatattata tatgtgtata 2940 ttatatatgt atatatttat atatgtgtat gtgtgctttt ataattatgt gtgtatgtgt 3000 gtgtttatgc atacacatgt ataatattta tataaaataa agtttattca tacattatat 3060 atatatatat atatatatat aaatactatt attaacagtt aatatattgt ttactggtaa 3120 tagtgtattt aattaatacc tcctcaatct aatccatgtc cagttttttc aaagtatgtc 3180 agaaatagca gcatcttctt aaggtagtta acttattttt aaaactataa ctttaaagta 3240 acatcaacta ctttaagttt acaattataa aaataacgtc cagataaagt aattaaattc 3300 aatttcagat aaagtaattc aaaacaacat tacttcttct tcaacattaa ctatactatt 3360 aagattattt tgtttcattt ttaataaata aatctaaaca ggtcaggtag tatatagttt 3420 cttgtaaaac caagtcaaag cttcatcgct tatttacgat gttttgccag tattcaatat 3480 ggcttccgcc gaaaaaattt gcctc 3505 // ID BEL-592_AA-I repbase; DNA; INV; 6704 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-592_AA_; KW BEL-592_AA-LTR; Pao_Bel_Ele30; BEL-592_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-6704 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [5574-6155] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 4290..6548 FT /product="BEL-592_AA-I_3p" FT /translation="MQELWLLSCDWDEPVPEPVKSKWENYHQELSTISEHR FT VDRYAFLPDSKIQLHTFADASQNAYGACTYARCEDNQGRIRIQLLASKSRV FT APLKRITIARLELCAAVVAAHLHDRIKKATDIEISASYFWSDSSVTLQWLR FT SPPNTWPTFVANRVSEVQQYTHGCQWKHVSGIENPADLVSRGMSVEEFISS FT KLWHHGPNWLARPAQDWPVSIPPGVPAEELEMKSTVAAVQVSSSVHPIFLR FT WSSYNRLLHVVGYIIRFVSNTRSKTRTNPSSRASPVPECLTVNEIANAKTI FT LTRLAQEDGYSPELKEIRKVKKVSKQSPIRNMSPFIDPEGVLRVGGRLNFA FT LLPYQAKHPALLPTTHPFTRLIVEHFHRKLIHGGGRQLLTAIREEFWPPRG FT RKLVQSVVRNCFRCVRLNPVPAQQQIGQLPAQRVIPSRPFSVTGIDYAGPL FT YLRPIHKRASPAKAYLCLFVCFSTKAVHLELVGDLSTQAFLCSLRRFISRR FT GRPAHIHSDNGKNFEGAKNELAELFARFQDQSQHNEIATFCSDEGITWHLT FT PPKAPHFGGLWEAAVKVAKKHLFRQLGSSRLSFEDMCTVLTQIEAIMNSRP FT LLPMTEDPNDLAALTPAHFLIGSSLTALPDPDIHSIPTSRLQHYQQLQQHV FT QRFWIHWRQEYLQELLKDTRGWKRNEQIVPGRLVILVDEMQAAIRWPLARI FT ESVLPGKDQLARVVLLRTSRGIITRPITKICLLPCSSVFQNAEDVPSPSSN FT SK" FT CDS join(2027..2890,2894..4039) FT /product="BEL-592_AA-I_1p" FT /translation="MKCQRFNNLSPNERQQLVSSKRLCHNCLRGNHFARNC FT PSSFTCRRCNRRHHTLLHPEQPEGPQRSSSQGSASRSAPPATSASSNASSV FT VESSTQSMVAASESMPSVEASVTVQHPRENVFLLTVMVKIVDEYGIEHLAR FT ALLDSASQPNLITDRMAKILRLRRHKVNVTVQGAGKLSNSVRESVFAQVQS FT RRGDFSCGVNFLVMDKLTANLPAQSVSTAGWKIPKDLYLADPSFNESQPID FT MLLGAKHFYTFFPSAARLQLDTNLPLLVDSVFGWIVVGSAGATSPVQSSAV FT CEAVTVSMVSLEDSLERFWKTEELTTTDNYSVEERRCETLYSSTVSRNSEG FT RYVVRYPRKTGFDEIIGESQKNALRRFGFLERRLERHPQLKDEYHQFMREY FT LSLGHMRLVDADDEERSRTYYLPHHAVIKEASTTTKVRVVFDGSAKTSTGF FT SLNEALCVGPVVQDELIDIILRFRTFPVALVGDIAKMYRQVLIHPDDTPLQ FT RILWRFTQLSPVQTYELLTVTYGLAPSSFLATRTLKQLADDEGEPFAKGAP FT ALKKGFYVDDFIGGAQSVKEAILLRDELSELLKKGGFELRKWTSNRLEVLQ FT GLNDEQIGTKSSLEFKPHETIKALGINWEPEADVLRFDSQIRDDNAPPTKR FT SILSDIAKLFDPLGLISPVHT" XX SQ Sequence 6704 BP; 1654 A; 1760 C; 1555 G; 1733 T; 2 other; taaaatttgg tgccgtgacc aggatacgtc ggaccgcgtg gtgaggtgtc tactgggcgc 60 tgccatcttg gactccggct cgttggattg ttctggatct ggctgctgct gcttattgta 120 ttactacaag gctcgaataa tcgagcggaa ggtaaagacc tctccacttc gattttgctg 180 tacccacggg taccctttag atcccttttc tctcttattg catgcgtgaa tctacttggc 240 attcgctgcc gttcgtctgg agtgtgatac tgctatcggc ctctgagcgc cattgtgccg 300 ccgatttcca agaacaattg gactattgta tcgcagtggc gtccccagcc acagttcctc 360 gcctgctgca acgcattcat tgattttggc ggtttttctg ctactgtggc gctcccagcc 420 ccgacacgac acgtggtgtg cttgctaccg cgtgccatcc agttgctgtg cctgttgatt 480 gtccatcgtg tggcttttcc aacaacaagc gtgattgctt tcgacttgcc aatccagcac 540 agggctttcg gttacccagt acggacggta ctacgactag ttcctgttgg cggtattaaa 600 aatacaaggc acagtttgcc ttggaacagg tgagcacctt tccaagtaat ttattgcatt 660 tgcgtgggtc tactcttggt ccattggtcc acaggtatcc ggctttcgga ctgcattgtc 720 atggattctc agcagcgaga ccagctgcta acaagaaggg atactctgct tgctgcgttg 780 ggtcgggcag aggcttttgc tgccaacttc gatgctcaac gagatcaggc gcaacttcca 840 ctacggctgg agtacatcaa cggactgtgg aacagcttgg agacggtcca agggcaactc 900 gaggatggag aaaccactga tcaaggtagg gcagggcatg cgacatttag agctgaactg 960 gagccgcgtc ttttttcaat aaaggctagc ctgctatcaa aattgcctcc acctcccgtt 1020 attagcagtg cagtccctca cccccctcak gtgtcctcca ctctctctgg gatcaaatta 1080 ccaacgattt cgcttcccga gttcaacggc gactatatgc aatggctggc atttcatgat 1140 actttcctgg cgcttattca ctccaatccg gacctgccgg atatccaaaa atttcactat 1200 ttacgtgcag ctgtcaaggg tgaagctgcg cagttaatag agtccattgg aatcagttcc 1260 gctaattacg gcttggcttg gcaaacgttg gagaaccgat attcgaacga ttaccttctc 1320 cggaggcgcc acctgcaggc actcttcgac atcccctgca tgaagaagga gtccgctgcg 1380 tcactacacg ggttggtgga cgagtttgaa cggcatacga aaattctgca tcagctcgga 1440 gaaccgactg actcttggag ctccatcctg gagcatttgc tttgcacgcg tctgcacaac 1500 gatacggtca aggcttggga ggatcacgcg tcgactgtgg cgaatcctac acacttaaat 1560 tatttcaccg acctcagtaa aatttttcgc cgagaaccca acagctgaga gctcggtaaa 1620 ttttaacgcc gaatctcggt acttatttta ccgagatttc ggcaatccga attttttgcc 1680 gagatctcgg cgaacgttac cgagattcgg taaacgttta ccaagatctc ggtaaaagtt 1740 ttaccgagaa tcgtcaaatt attaccaaga tatcagctgt tgggttctcg gcgaaaccgt 1800 ttaagtgtgt aactacaact gtttgattga ttttctgcag cgacgtactc gggtattgga 1860 atcgatttcc gtgaaccacc atgcaccgga tttgccgccc tcatctggtg gtgcagccca 1920 cccgccgaag aagagttacc agcattccca gtttcggctg tcatcatgtg caacaaccac 1980 caatcaaggc gaaaagtgca tcgtttgtgg tcagtctcat tctgtaatga agtgccagag 2040 gttcaataat ctttccccaa atgaacgcca gcagcttgtg agctcaaaac gattgtgtca 2100 caactgcttg agaggaaatc atttcgccag aaattgtccg tctagcttca cttgtcgtag 2160 atgcaatcga cgccaccata ctcttctcca cccagaacag cccgagggtc ctcaaaggtc 2220 tagcagtcaa ggttccgctt ctagatcagc accacctgca acgtctgctt cgtcaaatgc 2280 ttcttctgtc gtggagtcgt ctacgcagtc catggtcgct gcctctgaat ccatgcctag 2340 cgtggaagct agtgtaacgg tgcagcatcc tcgtgaaaat gtctttctcc ttacagttat 2400 ggtaaaaatt gtcgacgagt acggtataga acatctagct cgtgctcttc tcgacagcgc 2460 ttcccaacca aacctcatta cggaccggat ggctaagatt ctacgactac gtcgacacaa 2520 agtgaacgta acggtgcagg gagccggcaa gctttcgaat tcggtgcgcg aatcagtatt 2580 cgctcaggta caatcaagac gaggagactt ttcgtgtggt gtaaattttc tggtgatgga 2640 caaattaact gctaatctcc ctgcgcaaag cgtttccacc gcaggatgga aaataccaaa 2700 agatttgtac ctagcagatc catccttcaa cgaaagtcaa ccaatagaca tgctactagg 2760 ggcaaaacat ttctacacgt tcttccccag cgcggcacgc cttcagctcg acacaaacct 2820 tcccctctta gtcgacagtg tttttggttg gattgtcgtt ggatcggctg gcgcaacatc 2880 tcctgttcaa tmttcgtctg ctgtatgtga agcggtcact gtatcaatgg tttcgctcga 2940 ggacagtttg gaacgatttt ggaaaacaga ggagctgaca acaacggaca attattctgt 3000 cgaagaacgt cgttgtgaaa ccctgtactc ttcaacagtt tcccgcaatt ctgaaggccg 3060 atatgttgtt cgttatcccc gaaagactgg cttcgatgaa ataattggtg agtcccagaa 3120 aaacgctctt cgaagatttg gatttttgga gcgacgccta gaacgacacc cgcaattgaa 3180 agatgaatac caccagttta tgagagagta tctctccctc ggccacatgc ggttggtcga 3240 tgcggacgac gaagaacgct ctcgaacata ctacctgccc caccacgccg taatcaagga 3300 ggcaagtacg acaaccaagg tccgtgttgt atttgatggt tctgcaaaaa cctccaccgg 3360 cttctcccta aatgaagctc tatgcgtggg tcccgtggtg caggacgagc ttattgacat 3420 cattctgcga tttcgtacct ttcccgttgc tctcgtggga gacatagcca agatgtaccg 3480 gcaagtactg attcatcccg acgacactcc tcttcagcgt atcctgtggc gtttcacgca 3540 gctatctccg gtgcagacgt acgaacttct caccgtcaca tatgggctag cgccttcctc 3600 ctttcttgcg actcgtacgc ttaaacaact tgctgatgat gagggtgaac cattcgccaa 3660 aggagctcca gcattgaaga aaggctttta tgtggacgat tttatcggtg gagctcaatc 3720 cgtcaaagaa gcgatcctct tacgtgacga actcagcgaa ctgttgaaga aaggtggatt 3780 tgaactacgc aaatggacct ccaatcggct cgaagtattg caaggcctca acgatgagca 3840 aatcggaacg aaatcctctc tcgagttcaa accccatgaa acgataaagg cgttaggaat 3900 caactgggaa cctgaagctg atgttctacg atttgactcc caaatccggg acgataatgc 3960 accaccaacg aaacgatcga ttctgtccga catagcgaag ctgttcgatc ctcttgggct 4020 catatctccc gtacacactt aaattatttc accgacctca gtaaaacttt tcgccgagaa 4080 cccaacagct gagagttcgg taattttcaa tgccgaatct cggtacttat tttaccgaaa 4140 tctcggcaat ccaatttttt gccgagacgt aactttaccg agatctcggt aaatttttta 4200 ccgagaatcg tcgaattatt accgagatat cagctgttgg gttctcggcg aatccgttta 4260 agtgtgtagt tgttcgagcg aaaatcctca tgcaggagct gtggcttctg tcctgcgatt 4320 gggatgagcc tgtaccggaa cctgtaaaat cgaaatggga aaactatcat caagagcttt 4380 caacgatctc cgagcaccgt gtcgaccgtt acgcctttct acccgattct aaaatccagc 4440 tacacacctt tgctgatgcg tctcagaatg cgtacggtgc gtgtacatat gcccgctgtg 4500 aagataatca aggtaggatt cgaatccagc ttctagcgtc gaaatcccgt gttgctccgt 4560 taaaacggat cactatcgcc agattggaac tctgtgcagc agtggtggct gctcacctac 4620 acgaccgaat aaaaaaggca actgacatcg agatctccgc atcctacttc tggtcggatt 4680 cctcagtcac ccttcaatgg cttcgctcgc ctccaaacac ctggccaact ttcgtagcca 4740 accgcgtttc ggaagtccaa cagtacacgc atggctgcca gtggaagcac gtctccggaa 4800 tcgaaaatcc cgccgacctg gtctcacgcg gcatgtcagt ggaggaattc atctcgagta 4860 aactatggca tcatggacca aactggctag cacgacctgc acaagattgg ccggtttcaa 4920 tccctcctgg agttccagcc gaagagctgg aaatgaaatc taccgttgct gctgtccaag 4980 tatcttcatc agttcatccg atcttccttc gttggtcctc atacaaccgt ttgctccacg 5040 tcgtcgggta catcatacga tttgtcagca acactcgttc gaaaacccgc acaaatccat 5100 catctagagc tagccctgtt ccagagtgtc taacggttaa cgaaattgcc aatgccaaaa 5160 cgattctcac acgcctagcc caggaggatg gatactcacc tgaattaaaa gaaataagaa 5220 aggtgaaaaa ggtatcaaaa caatcgccta ttcgcaacat gagcccattc attgatccgg 5280 agggggtgtt gagagttgga ggtcgcctta actttgccct actgccctac caagcgaaac 5340 accctgccct gttgccaaca acgcatccat ttacgcgtct aattgttgag catttccacc 5400 gtaagttgat ccacggcggc gggcgtcaac tgttgactgc cattcgggaa gaattctggc 5460 ctccacgggg tcgaaaattg gtccaaagtg tcgttaggaa ctgttttcga tgcgttcgtc 5520 tcaatcctgt tcctgcccaa cagcaaatcg gtcagcttcc agcccagcga gtcattccaa 5580 gtcgcccctt cagtgttact ggcattgact atgctggccc gctttatctc cgcccgattc 5640 ataaacgtgc ctcacctgcc aaagcatacc tgtgcctgtt cgtatgcttt tcgaccaaag 5700 ccgtgcatct cgagttagta ggcgatttat caactcaagc gttcctatgc tcgctacgtc 5760 gcttcatttc acggcgaggt cgacccgcgc atatccactc agataacggg aagaactttg 5820 agggagcgaa aaatgagttg gctgagcttt ttgctagatt tcaggatcag tcccaacata 5880 atgaaatcgc caccttttgc tccgatgagg gaatcacctg gcatttgacc ccacccaaag 5940 ctcctcactt tggcggccta tgggaagcgg ccgttaaggt agctaaaaaa catctcttcc 6000 gtcaattggg atcttcacgg ttgtccttcg aggatatgtg cactgttttg acgcaaatcg 6060 aagcaatcat gaacagccga ccgttgcttc ctatgaccga ggacccaaat gacttggcgg 6120 cgctgacccc ggcacacttc ctaatcggat cttctctaac tgccttgccc gaccccgaca 6180 tccacagcat acccaccagc agactccagc attatcagca gttacaacag catgtgcaac 6240 gattttggat acactggcgg caggaatatt tgcaggagct actgaaagac acccgcgggt 6300 ggaaacgcaa tgagcaaatt gttccagggc ggctggtcat ccttgtcgat gaaatgcaag 6360 ctgcgattcg gtggccacta gcccgcatcg aatcagttct accgggaaag gaccaactag 6420 ctcgagttgt tttgcttcgt acaagccggg gcatcatcac gcgtccaatt acaaaaattt 6480 gcctgctgcc gtgctcctcg gtgttccaga atgcagaaga tgttccaagt ccatccagca 6540 acagcaagta gatgtgttta tcgtcgagtt tctgtttaat ttaagctgag aacaacgaaa 6600 gtgttagaat taagatgcaa aatatccatg taatgatagt tgtagttacc atgttgtttg 6660 tttacctttt ctatgttgaa cactgtagtt caaggcggcg ggta 6704 // ID CR1-64_AAe repbase; DNA; INV; 4551 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-64_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4551 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1152-1152 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 23 sequences with >98% CC identity. XX FH Key Location/Qualifiers FT CDS 726..1598 FT /product="CR1-64_AAe_1p" FT /translation="MMEICNSCAVMMNAPEVVCSGFCKATFHYKCAKVSES FT FYREICGNPAVFWFCRGCCDLMKSARFKNAMTSTNAATLELKDAYQKVVED FT LKTEIKESLIAELKQEIQGGFNKLSPAVLSPVPRHFQFNYRATPKRMRDDD FT TSGPTEHPPKIFCGTGQSSSNVLSESTSSPDDKFWLYLTRISPEVTENDVL FT NLAKECLQTSEVVAKSLVPRGKPLSMLSFVSFKVGVNKGHKSKAMDPATWP FT QGIQFREFVDQESNARHFWKPPQRIDPGASSSNQLQSSQQTSTQPCITLS" FT CDS 1478..4492 FT /product="CR1-64_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="PRVERPTFLEAPAAHRPGSIIIESTPIQSANLNAALH FT HSELNRRSPRSTAATRNYLTVYYQNVRGIRTKTQELFLSLSSSDYDIIVLT FT ETWLRPDIANAEFAANYNIFRCDRNSATSNLQRGGGVLIAVKAALNCRSVV FT LENCNNLEQTVVLVKLQQSSIYVCGIYLRPNSLPVLYASHSAAIQQLCERI FT SSSDTIVVVGDYNLPQLVWQMDDDVGSLLPSNASTEQEITLVERMVASGLH FT QINSLLNSNNRLLDLAFVNEVNDVELIEPPSSLLRVDRHHRPFVLRIDVSD FT IRRQLAGNPNQADEFDFRRCNFAELEAAISSIDWNSLFHGMDTDQTVCIFY FT ETLYEILDEHVPRRRRRRHFYKQPWWTSELQHLRNVVRKSRRRYFRSRTVQ FT NRDNLRIIESRYKECQTTSFRNYVTNMETTAKQDPSGFWTFIRNRKRANRF FT PAEMTFNDTVANSPEGVANLFAEFFESVYCTNSPTFSPDSVSNCPAFDLNV FT ARFDVTQQDVSSALRKLDITKGPGTDNLPPLFLKECADSLQIPLTIIFNRS FT LHDSTFPMLWKTASITPIFKAGSTRAVENYRGISILCCLGKVFEELIHSVL FT YTACQPLISEFQHGFVKKRSTTTNLMTFTNFLSTEIENKHQVDAIYFDFSK FT AFDKVPHDLAIAKLRHLGFPNWIALWLQSYLTQRKAFVSINGTRSRVISIT FT SGVPQGSVLGPLIFILFINDLCFRLKSGKLLYADDLKIYRVIMSHLDCCAL FT QNDIDELVNWCQQNGMELNIKKCKSIAFSRRHSRIDFEYMVGSERIECVDS FT IRDLGVTVDSKLRFNEHVSLTTAKAFAALGFVRRTTNYFKDVYALKSLYCS FT LVRSILEYAVCVWSPHHTTQIVRLERVQRSFIRYALRQLPWSDPVNLPEYP FT ARCRLIDIETLASRRNNLQRLFVFDLLKGNLDCSSLLDNVPFYAPNRHLRE FT RDLLLIRRHRTSYGFNNPLSKCLRLFNSVSALFDFNVSKYVFKNRIKDLD" XX SQ Sequence 4551 BP; 1205 A; 1145 C; 966 G; 1235 T; 0 other; tctggcaacc ctgctgcgct acttagctgt gtttttgttg tgtcatccgt ttcatcaata 60 tttcatcgaa aaacgccacc cacaattcat ctacgcaacg tcagatcgct tgaacccggc 120 caatcttttc ccgcacgatt aatctgctac tcccagccag gtaggatcag tgaattctgt 180 gcaattccga gctggctcga gtgtggtttt gagaaaatcg ctctccgcca ttgccccatc 240 aataaaaatc tcccacctcg ccactctcca cctgcgctgc tcttgcaagc cccgtgcatt 300 acaccattcg ccaaaacata aacaatcatc cattgctcct gccagcagcg catcaactgc 360 agttaattat cgctccaaac tcacccacct gtatcagctg agccaataaa cttcagttgc 420 tcctgctgtt cttctcgctc tgcagtccga aattcccacc aaccccgcat agaggaccga 480 acccaaattg catcaatttc cgcatcgctg caacttcatc gtttttgctt tgctgacgat 540 tgccaaggta aacaatacaa acttcaccga cagctttgtg agtaggaaga cagagaaacg 600 acatttcggc atctgtcatc gccagtgttg ccagttccgc cgacttttga tctgcttatt 660 gctgttacgc ttgggtgacg cttctgtgct tctatttcgc cgatttaaat cgacagtttg 720 aagagatgat ggaaatttgc aacagctgcg cagtaatgat gaatgctccg gaagttgtat 780 gtagtggatt ttgcaaagca actttccatt ataaatgcgc taaagtgtcg gagtctttct 840 acagggagat ttgcggtaac ccagctgtgt tttggttctg cagaggttgc tgcgatctga 900 tgaagagcgc ccgtttcaaa aacgctatga cttcgacaaa cgctgccact ttggaactca 960 aggatgctta ccagaaggtg gtcgaagact tgaaaacgga aataaaagaa agtcttatcg 1020 ccgagctgaa gcaggaaatc caaggcggct tcaacaaatt atcccctgca gtgctttctc 1080 cagttcctcg gcatttccaa ttcaactatc gtgcaacacc taagcgaatg cgcgatgacg 1140 atacatccgg gccgacggaa catccaccga agatcttctg tggcactggc caatcatcga 1200 gcaatgtgtt aagcgagtca acctcaagcc cggacgataa attttggttg tatctgacga 1260 gaatttcacc agaagtcact gagaacgacg tgctgaatct tgcgaaagaa tgcctacaaa 1320 ctagcgaagt cgtggccaaa tcactagttc cacgtggaaa gccattatct atgctttcat 1380 ttgtgtcctt caaggttgga gttaataaag gccacaaatc caaagcgatg gatcccgcta 1440 cctggcccca agggatacag ttccgcgaat ttgttgacca agagtcgaac gcccgacatt 1500 tttggaagcc cccgcagcgc atagacccgg gagcatcatc atcgaatcaa ctccaatcca 1560 gtcagcaaac ctcaacgcag ccctgcatca ctctgagtta aaccgccgtt ccccgagatc 1620 tactgccgct actcggaact acctgacagt ttactaccaa aacgtcaggg gcatccgaac 1680 caaaacgcaa gaactatttt tgagtttatc gtcctccgac tatgatatta tagttcttac 1740 agaaacgtgg ctgcgtcctg acattgcaaa tgcggaattt gctgcaaact acaatatatt 1800 caggtgtgat cggaattctg ccactagcaa ccttcagaga ggcggtgggg tcctgattgc 1860 cgtgaaagcc gcattgaact gcagatcagt agtgttagaa aactgcaata acctcgagca 1920 aactgttgta ctggtgaagc ttcagcagtc gtcgatttac gtttgtggga tctatctccg 1980 ccccaactcg ctaccggtgc tctacgcctc gcactctgca gccattcaac aactctgtga 2040 acgcatttcg agctctgaca caatcgttgt tgtaggggac tacaacctcc ctcaattagt 2100 ctggcagatg gatgatgacg tcggcagttt gcttccttct aacgcctcca ccgaacaaga 2160 gatcaccttg gttgaaagaa tggttgcctc cggattacat cagatcaata gccttttgaa 2220 ttcaaataac cgcttgctgg accttgcatt cgtgaatgaa gtaaacgacg ttgagttgat 2280 cgagccaccg tcttctctcc tccgggtgga ccgccaccac aggccgtttg ttcttcgtat 2340 tgacgtgagc gatattcgta ggcagttggc aggtaatcca aatcaagccg atgaatttga 2400 ttttcgacga tgcaactttg ctgagctaga agctgctatc tcttccattg attggaattc 2460 tttgtttcac ggcatggaca ctgatcaaac ggtatgtatt ttctatgaaa ctttgtacga 2520 aattctggac gagcatgttc cacgtagacg acgccgacga catttctaca aacagccttg 2580 gtggacatcc gagttgcaac accttcgcaa cgttgtgcgc aaatctcgta gacgctattt 2640 tcgatcgcgg accgttcaga atcgagacaa tcttcgtatc atcgaatccc gttacaaaga 2700 atgtcaaaca acctcgttcc gaaactacgt taccaacatg gaaacgactg caaagcagga 2760 tccctccggt ttctggacgt ttattcggaa tcgcaagcgt gctaatagat tccctgccga 2820 aatgacattc aatgacaccg tcgccaactc tcccgaaggc gttgctaatt tgtttgcaga 2880 atttttcgaa agcgtgtact gcacaaactc accaacgttc tcgcctgaca gtgttagcaa 2940 ttgtccagca ttcgatttga atgtcgcacg tttcgatgtc acacagcaag atgtttcgtc 3000 cgccttgcga aaactcgaca tcacaaaggg accagggaca gataatcttc ctccgttgtt 3060 tctcaaggag tgcgctgatt ctcttcaaat tccgctgacg ataattttca atcgatcgct 3120 ccatgacagc acttttccga tgctctggaa aacagcctcg atcactccga tcttcaaagc 3180 aggttctact cgtgcagttg aaaactatcg aggaatctcg attttatgct gcctggggaa 3240 agtctttgaa gaattaatcc atagcgtttt gtacactgca tgtcaacctc tgatatcaga 3300 atttcaacac ggattcgtca aaaagcgttc aactacgaca aatctcatga cgttcacaaa 3360 ctttctctcg accgaaatcg aaaataagca ccaggtagac gccatctact tcgacttttc 3420 caaagcattt gataaagtcc cgcatgatct cgctattgca aagcttagac acctaggttt 3480 tccaaattgg atagctctat ggttgcagtc ctatctaacg cagcgtaagg cgttcgtcag 3540 tatcaacggc acgcgttctc gcgttatctc aatcacgtcg ggcgtgccgc agggtagcgt 3600 gttaggaccg ctgatattca ttctcttcat caacgacctc tgcttccgac taaaatcagg 3660 aaaattgttg tacgcggacg atttgaaaat ttacagggta atcatgtctc atttggattg 3720 ctgtgcgcta cagaacgata tagatgagct tgtgaattgg tgccagcaga acggtatgga 3780 gctgaatata aaaaaatgta aatctattgc attctctcgt cgacactcac gaattgattt 3840 tgagtatatg gtcggatcgg aacggatcga atgcgtggat tcaatacgcg accttggagt 3900 tactgtcgac agcaaactac gtttcaatga gcatgtctcc ctcactaccg caaaggcttt 3960 tgccgctcta ggattcgttc gtcgcaccac caattatttt aaggatgtat atgcactcaa 4020 gtcgctctac tgctctttag ttcgtagtat tttagaatac gcagtttgtg tgtggtcacc 4080 gcatcacacg acacaaatcg tccggttgga gagagtccag cgaagtttta ttcgatatgc 4140 tcttcgtcaa ctgccatggt ctgatcccgt aaacctgcca gaatacccag cccgctgtag 4200 actgatagat atcgagacac ttgcttccag acgcaacaat ttacagcgac ttttcgtatt 4260 cgacctttta aaggggaact tagattgttc atcgcttctc gacaatgttc cattctacgc 4320 gcccaatcgc catttgcgag aacgggactt gctactcatc aggcggcata gaacgtctta 4380 tggattcaac aacccgctgt ccaagtgcct tcgtctgttt aatagtgtta gtgctttgtt 4440 cgattttaat gtgtctaagt atgtttttaa gaataggatt aaggatttag attaagaaac 4500 agtctgtggg atttcaataa ttcgagacag tgacaaataa ataaataaat a 4551 // ID IS4EU-1_AA repbase; DNA; INV; 4318 BP. XX AC . XX DT 29-APR-2007 (Rel. 12.04, Created) DT 01-MAY-2007 (Rel. 12.04, Last updated, Version 1) XX DE A family of autonomous IS4EU transposons - a consensus. XX KW ISL2EU; DNA transposon; Transposable Element; IS4EU; KW Interspersed repeat; IS4EU-1_AA. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4318 RA Kapitonov V.V. and Jurka J.; RT "IS4EU, a novel superfamily of eukaryotic DNA transposons."; RL Repbase Reports 7(4), 143-143 (2007). XX DR [1] (Consensus) XX CC DNA transposons from the IS4EU superfamily are characterized by CC the TA target site duplications. These transposons are wide CC spread in metazoans, including fish, frogs, lancelet, sea CC urchins, sea squirts, insects and cnidarians. Autonomous IS4EU CC transposons encode two proteins: the IS4EU-TR transposase, which CC is similar to the IS4-like bacterial transposases, and the CC ISEU-EX DNA exonuclease (lambda-like exonuclease). Based on the CC conservation of both proteins in highly divergent transposons, is CC is clear that they are necessary for transpositions. CC IS4EU-1_AA is a consensus sequence of a young family of CC autonomous IS4EU transposons that were active in the mosquito CC genome in a last few million years. The IS4EU-1_AA transposon is CC characterized by 15-bp imperfect terminal inverted repeats, TA CC target site duplications, and it encodes two proteins: (i) the CC 430-aa transposase, IS4EU-1_AA1p, composed of the THAP CC DNA-binding N-terminal domain and catalytic "DDE" domain, which CC is conserved in all IS4EU transposases, and (ii) the 519-aa CC IS4EU-1_AA2p exonuclease. Questions and comments send to Vladimir CC Kapitonov. XX FH Key Location/Qualifiers FT CDS join(360..705,976..1919) FT /product="IS4EU-1_AA1p" FT /note="IS4EU-TR transposase." FT /translation="MKISCVFRTQDMKMSATKLRDFSTTCQEESQTFVGTT FT EEQKRSEPGAGVSVNRTVCCVPGCNQRYSPGVSFHAFPLKSDEERYRIWTK FT QLRLKYEPTKTARVCSNHFSIRNFIPPFDAGIQVDTTDIDKRQIKRSCAIS FT FGEDEDDMAAWTGLRSKDMLQSIVNAVMILKEFKKRKHSSVTIDEEILIVL FT VKLKTNLSFRCISVLFRIHAQTVSVCFYRTVPLLSAALKPLIYWPTSEQIS FT KNIPHYFRPDFEDVVVVLDCTEMPIMKPKCLHCRINTYSHYKSRETAKYLI FT GVTPGGTISYVSCGYGGKSSDKQIVVEEKVLDLLTPGQAVMTDKGFMIDEE FT CKRRNVKLVRPPFLNGPQKQLSKFEATQNVSIAAARVHVERAIQRVRIFSF FT FNTKIDSRFLSVLDDLLLISCAIVNISSPIIANTRF" FT CDS join(4195..4193,3904..3579,3334..3057,3000..2051) FT /product="IS4EU-1_AA2p" FT /note="IS4EU-EX lambda-like exonuclease." FT /translation="MESNHEIDTCEVPVQLSEVFHFAAGGSRLVQEGERVF FT NAAHLLIVGAEQVHQDGVSIFATCLQSSSPSSEPHVIKIRTKQAFPQWTFT FT CTCKAGKGKCKHQMAVLYHLLRCATVLILTVTDLKQKWGKIAAKTATDLYK FT ATRLMDLCASMKSVENLSDSTTIEHSLNESTIEESNERSKSSKYRQYTATG FT EERSQILHHFVSCFPNSALGYEICGRPSIQAETMNHESEIENCVLQMEIST FT SKLSLSSSNSIISFQDFSPESIVKRYVEKLFEPEFDLKYKYFIGPLEANLL FT TFYEIKICVSAKESTDICMRTISQNSEECKEERSMRITASNAYSLFTYYVT FT EEKKSKDWKRKIVKLMFPENLKTPAIDHGRRCEHKAIESYEIMTGVSVNRC FT GFVIPPHIPWIGCSPDGIIVEERKINKVKFPFAGKTSTISEVLKDLPYLSK FT SNQLRKRHIYYGQVQINMFVLKCEKADFIIYSEFEDRCYVISVDLDEEYVY FT GLLDTLKEVYFNIYLKILYTLKN" XX SQ Sequence 4318 BP; 1324 A; 825 C; 804 G; 1365 T; 0 other; ttcgattcta cgaaggtacg gtgtcgccac ctacgtcagt ttcgtctgtg cgtgaacaaa 60 aggtaaacaa acattgttga tttcaacgca tttttcgttt gcaaagttct acgtaaattt 120 tactgaacct gttggttttt acaattggaa tccattcatc aattagtaaa gtcatcaaac 180 gtgctatata acccaggtta gtttagtgaa tgatgtgtta aaatccatct ctaagtgagc 240 tcctccaaac accgtagaaa agtgtagcgc ataaatttaa cgatttttga atgaaaatca 300 ctgttttgtg ctttaattat ttgcgcactc agtcgcaata accctagaaa tatcaaaaaa 360 tgaaaatttc ttgtgttttt cgtactcagg atatgaaaat gagtgccaca aagttgcgag 420 acttttcaac aacctgccaa gaagaaagtc aaacattcgt tggtacaact gaagagcaga 480 agcgaagcga gcctggtgca ggagtatcag ttaaccgaac agtgtgctgt gttccggggt 540 gcaatcagcg gtacagtcct ggagtttcgt ttcatgcatt tcctctaaag agtgacgaag 600 aacggtaccg gatttggaca aagcaactgc ggttgaaata cgaaccgacg aagactgctc 660 gagtgtgcag caaccacttt agcatccgga attttatacc accatgtaag tgcagaaatg 720 tttagcctta tcaataatat gttcacgaaa attttaattt ttagcaaaaa aaaacgacaa 780 accgagcgtc taatactacg agcatcagca gttcctgatt caaatctacc cggttgtgct 840 gccaaagaag cactgaaaaa aaatcgtgct gaaatcgatc gcttcttaaa gaacgttgta 900 tttcgaatag acccagcagc aaactgggtc aggaagaagt atttcaacct gcagagacat 960 tcccaccctc actagttgat gctggtatac aagtggatac tactgacatt gataaaagac 1020 aaatcaagcg gtcttgtgcg atctcttttg gtgaggatga agatgacatg gcagcgtgga 1080 caggtttgag atcgaaagat atgttgcagt cgatagtaaa tgctgtaatg atcctgaaag 1140 agtttaaaaa gagaaagcat tcttcagtga ctatcgacga agaaatattg attgtattgg 1200 tcaagctaaa aaccaattta tcttttcgat gcatatctgt actattccgg attcatgcac 1260 agaccgtgtc tgtctgtttc tatcgtacag ttccattact gtctgcagca ttaaaacctt 1320 tgatatactg gcctactagt gagcaaatat ctaagaatat tccacactac tttagacctg 1380 attttgaaga tgtcgtcgta gttctggatt gcactgaaat gccgatcatg aaacctaaat 1440 gcttgcattg tcgtataaat acatactccc actataaatc tcgagaaaca gctaaatatt 1500 tgattggtgt gactcccggt ggaactatct cctatgttag ttgtgggtat ggagggaaat 1560 cttcggataa gcagattgtt gtcgaagaaa aagtgctaga ccttttaact ccaggacaag 1620 ccgttatgac tgataaagga tttatgatcg atgaggaatg taagcgacga aacgtgaagc 1680 tagtccggcc accgttcctc aatggacccc agaaacaact gagcaagttt gaagcaactc 1740 aaaatgtgtc tattgctgca gctcgagtac acgttgaaag agccatacaa cgtgttagaa 1800 ttttttcgtt tttcaataca aaaattgatt ctagattttt gtccgttctc gacgacctgt 1860 tactcatctc gtgtgctatt gttaatattt ccagccccat aattgccaat acaaggtttt 1920 gaagaaatag ttaaggctca ttttaatttg actttgtaca taaataaacc aaaatcacag 1980 ttgtatgttt ttattattct taacattaat ccgatgtctt acaacgcatg attctttaaa 2040 aaataattca atttttcaaa gtatacaata tttttaaata aatgttgaaa taaacttcct 2100 tcagtgtgtc caagagtcca tatacgtatt cttcatcgag gtcgacagaa atgacgtaac 2160 atctgtcttc gaactccgag taaattatga aatctgcttt ctcacactta agaacgaaca 2220 tattaatttg aacttgtcca taataaatgt ggcgttttcg cagttgatta gatttactca 2280 aatatggtaa atccttcaga acctccgaaa tcgttgatgt ctttcccgcg aatgggaatt 2340 taactttgtt tattttgcgt tcctctacta ttataccatc tggagagcat ccaatccatg 2400 gaatatgtgg aggaatgaca aaaccacatc gattcacaga tacaccagtc atgatttcgt 2460 agctttcaat agctttatgt tcacagcgac gtccgtgatc aatagctgga gtttttaaat 2520 tttcggggaa cattaatttt acgatttttc gcttccagtc tttcgatttc ttttcctcgg 2580 tgacataata tgtgaaaagc gaatatgcgt tcgacgctgt aattcgcatg ctgcgttcct 2640 ctttacattc ttcagaattc tgggaaatcg ttcgcatgca tatatcagtg ctttcttttg 2700 ctgatacgca gatcttgatt tcataaaagg tcaataggtt tgcctccagt ggaccaatga 2760 aatacttgta cttcaggtca aattctggtt cgaataattt ttccacatat cgcttcacga 2820 tagactcggg agaaaagtct tggaaagaaa ttattgaatt cgacgagctc agcgacaatt 2880 tgctggttga aatttccatt tgaagtacgc agttctcaat ttcactttcg tggttcatcg 2940 tttcagcttg aatagatggt ctgccacaga tttcataacc aagtgcactg ttcggaaagc 3000 ctgaaatttc agccaactat tagattacaa ttcaaattct aaataaagtt acttacatga 3060 tacgaagtga tgcaaaatct gtgatcgctc ctctcctgtt gcagtatatt gtcgatattt 3120 tgaagatttg cttcgctcgt tagattcttc aatggtagat tcgtttaaac tgtgttcgat 3180 tgttgtacta tccgacaggt tctccaccga tttcatgctg gcacatagat ccatcaaccg 3240 cgttgcttta tacaaatctg tagcagtttt agcagcaatt ttgccccact tttgctttag 3300 atcggtcacc gtcaaaatca gaactgtagc acatctgaaa aaagaggaac acagattgtt 3360 ctcactttat aatgtttttt agaaattatg ttcaaataag gaaaacatcc attttggtgg 3420 ggaacttact ttgagtattt cagtgcttta ttttaacttg tttttggaca atataatttg 3480 gtttagaatt aatgttttta tacatgttgg ctatagcact gaaacaccca aacatggtca 3540 ccaattataa tggcttttct ttctaataga ctacttacct tagcaagtgg tacaagacgg 3600 ccatctggtg cttacatttt cctttaccag ctttgcaagt gcatgtgaat gtccactgcg 3660 gaaatgcttg cttagtacga atttttatca catgcggttc ggagcttggg gaagacgact 3720 gtaagcacgt tgcaaatatg ctcaccccat cttgatgcac ttgttcagct cctactatca 3780 acagatgtgc agcattaaac actctttcgc cttcttgtac caaacggctt ccaccggcag 3840 caaaatgaaa aacttccgat agttgaacgg gtacttcaca ggtgtcgatt tcatgatttg 3900 actcctaaaa ccgacggacg aaatcacaag aatttttgat gtttcagttg gtctatccat 3960 gaaaaattta tgtagcatat agataaagag tagcagaaca cttccataaa atcataatat 4020 tactttatga caattttcat tcataattta atgtaagaaa aacgtaaaca tttgggctac 4080 tcgaaaatta tgatttcaaa tcgaacggaa aactgaatgc tacatattcg aaaaagttta 4140 aaacacggtt ttgttgtgta aatacgaatt ctctctttca attgaaactc accattttca 4200 ctaattaaac gcaaatgttt gaaaaagtca ggtttttaaa ttcgtgaaaa aataacttat 4260 ttgtttactc tttttgttca cgcacaagaa ctgtcaaaat caacttcgaa gtcgcgaa 4318 // ID CR1-96_AAe repbase; DNA; INV; 3463 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-96_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-3463 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1184-1184 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 6 sequences with >96% CC identity. XX FH Key Location/Qualifiers FT CDS 3..3407 FT /product="CR1-96_AAe_1p" FT /note="endonuclease and reverse transcriptase." FT /translation="XXNSTTCSSFKPPNPALGRNIVGSLEATEPLTIVVPS FT LPSLISRXGPMCAEGQVVFQPPILGKYAFNSISTAPDPLIGFSAAANIDHS FT STIMPLPGRTDVCSMEAPYPPATVEPFPPSIFSRPGPVCGVGDGVFQATTV FT GKYIQNTNNAPAHSNVVSSALPNVVSITHNRGGNCNVSASSDSRSLCVYYQ FT NVRGLRTKTAELNLRLSSCEYDIVVLTETWLRSDISNSELSSDYTIFRCDR FT NATTSNLSRGGGVLVAVKSNLQCTEVSAPDCTELEQVAVCVKFPNRALTIF FT GIYLRPNSDPRLYSIHSAAVQSVVDSSASNDVVLLFGDYNLPHLQWNFDED FT VNGYLPSNASSEQEINFTESILSCGLVQINSIENRNNRILDLVFTNFSDIS FT ELSQPVLPLLPVDDHHPPLVLQIDVCCSVTCHDEPEPDIFDLDFRRCDLSL FT LNSILSSVDWNHQLLGLTVDESVGLFYDKIYEILRSTIPRHRRVNHGVDRK FT PWWSPXLRNLRNRLRKTRKRYFANKTVENRNALCQVESSYNDLLDTSYRLY FT LENLQLFAKTNPSKFWKHMKAQKMSNGIPNTIIYGDTKAATSVEAANLFAS FT YFQSVYSTTSPQVYPGCFHNVQTYNIHIPPFQFSNREVLNALLNLDATKGA FT GVDGLPPHLLKSCATSLVTPLTILYNRSLSEKTFPELWKTATMIPIHKSGS FT IRLVDNYRGISILCCLGKLLESMVHKVMLSAANSVISEYQHGFLPHRSTTS FT NLLCYTNMLFREVEQRKQIDSIYVDFSKAFDTVPHRYAVEKLRHMGFPDWL FT TDWILSYLTDRKAYVRINTSRSTAFSIPSGVPQGSVLGPLIFVLFINDLCR FT RFSSEKILFADDLKIFRVINSILDCVALQDDIHSLLQWCNENGMTLNINKC FT KVISFCRSLTSIDYAYAISGTTLERVESIRDLGVIIDSKLRFNEHISAITA FT KAFSVLGFIRRNASQFTDVYALKALFCALVRSILEYAVPVWAPYHTSQVIR FT IERVQKSFIRFALRRLPWNDPVNLPDYSSRCELVDLELLSARRLKLQRLLV FT HDILTGKIDCTDLLAEVQLNVPCRRLRYAPLIAIPMHRTNYGYNNPLTSCL FT RNFNSVCEMFDFNLSKNCFKTRIKNIG" XX SQ Sequence 3463 BP; 903 A; 863 C; 706 G; 986 T; 5 other; ccmactkmaa ctcaacaacc tgttcttcct tcaaaccacc gaatccggct ctgggacgca 60 acattgttgg ctctttggaa gccaccgaac ctctcaccat agtcgtgcct tccttgccat 120 cgctcatcag tcgtcmcggc cctatgtgtg ctgaaggcca ggtggtcttc caaccaccta 180 ttctaggcaa gtacgctttc aattcaatca gtacagctcc tgatccactc attggtttca 240 gtgccgctgc aaacatcgac cactcttcga ccattatgcc gttaccggga cgcacagacg 300 tttgctcaat ggaagcccct tatcctcccg ccacagtcga gcccttcccg ccgtcgatct 360 tcagtcgtcc tggtcctgtg tgcggggtcg gagatggggt cttccaagcc actacagtcg 420 ggaagtatat tcaaaatacg aacaatgcac ctgctcattc caacgttgtt tccagtgctt 480 tgcccaacgt agtgtcaatc acgcataacc gaggaggtaa ttgcaacgtc tctgcatcat 540 ctgatagccg atctctctgc gtttactacc aaaacgtaag gggcttacga actaaaacag 600 ccgaactgaa tctgcgatta tcaagctgtg agtacgacat tgttgttctt accgaaacgt 660 ggctccgatc agatataagt aactcggaac tgtcttccga ttacacaata tttcgctgtg 720 atcgcaatgc gactactagt aacctgtcga gaggaggcgg cgtgctcgtt gcagttaaga 780 gtaatttgca gtgtacggaa gtgtctgcac ccgattgtac tgagctcgag caggtggcag 840 tatgtgtaaa gtttcctaac cgtgcgctga ctattttcgg catttacctc cgtccgaatt 900 ccgatccaag gctgtattcc atccactctg cggccgttca gagcgtggtc gactcttctg 960 caagtaacga tgttgtgctg ctttttggtg actacaatct tcctcatctg cagtggaact 1020 ttgatgaaga tgtgaacggt tacctgcctt caaatgcatc cagcgagcaa gagattaact 1080 tcaccgaatc aatcttgtcc tgcgggcttg tacagattaa ttctattgag aaccggaaca 1140 atcgaattct tgatttggta ttcacaaact tctcagacat atctgaactg tcgcagccgg 1200 tcttaccgct cctgccagtt gatgaccatc atccaccgct cgttctacaa atcgatgttt 1260 gctgctctgt cacctgccat gatgaaccag aaccggatat tttcgatctt gacttccgac 1320 gttgcgatct ctccttgctc aattccatac tgtcctcggt tgattggaat caccaactac 1380 tagggctaac cgtcgacgaa agtgtcggct tgttttacga caaaatctac gaaatacttc 1440 gctcaaccat tccacgtcac cggcgcgtta atcatggtgt agatagaaag ccttggtgga 1500 gtccccakct acgaaacttg cgcaacaggc tgagaaagac acgaaagcgg tatttcgcta 1560 ataaaaccgt ggaaaatagg aatgcacttt gtcaagtgga atcctcttat aatgatcttc 1620 ttgacacgag ctatcgatta taccttgaaa atctgcagct attcgccaaa acaaaccctt 1680 cgaagttctg gaagcatatg aaggcccaaa agatgagcaa tggaatcccg aacaccataa 1740 tctacggaga cactaaggcc gccacatccg tagaagcagc gaacctcttt gcgagctact 1800 tccaaagtgt gtacagtaca acatctcctc aagtttaccc tggttgcttc cacaacgttc 1860 aaacgtacaa catacatatc ccgcctttcc agttttccaa tcgagaagtg ttgaatgctt 1920 tactgaattt ggacgccacg aaaggtgctg gagtggatgg gttgcctcct catctgttga 1980 aaagctgtgc tacttcttta gtaacaccgt tgaccatact ttacaatcgg tctcttagcg 2040 agaaaacgtt tccggaatta tggaaaactg ctacaatgat tcccattcac aaatcaggca 2100 gcattcgtct tgtggacaat tatcgaggaa tctcaattct ctgttgcctc ggaaaactct 2160 tggagtccat ggtacacaaa gtaatgctgt ctgctgctaa ctcggtcatt tctgagtacc 2220 aacatggatt tctacctcat cgttcaacaa cttccaacct gctatgctac acaaacatgc 2280 tgtttcgtga ggttgagcag cgaaaacaaa ttgactcaat ctatgttgat ttctcaaaag 2340 cattcgacac tgtaccccat cgttatgccg ttgaaaaatt gcgtcacatg gggtttccag 2400 attggttaac agactggatt ctgtcctacc tcactgacag aaaagcttac gtcaggatca 2460 acacttctcg atccactgcc ttcagcattc cttcaggggt accacaaggc agtgtgctgg 2520 gaccactaat ttttgtgctg ttcatcaatg acttgtgccg acgattttcg tctgaaaaaa 2580 tattgtttgc tgatgatctt aaaatttttc gagtgattaa ctccatactc gactgtgtcg 2640 ctctacaaga cgacattcat tcgcttctac aatggtgtaa tgaaaatggt atgacactca 2700 acatcaacaa gtgtaaagtt atatcgtttt gccgcagcct tacctcaatt gactatgctt 2760 acgctatatc cggaactact ctcgagcgtg ttgaatctat tcgggactta ggagtaatta 2820 ttgactctaa attgcgtttc aatgagcata tttccgctat tacagctaag gcgttttccg 2880 tgcttggatt catccgtaga aatgcatcgc agttcactga cgtatatgca ctgaaagcct 2940 tgttctgcgc cctagtgcgc agtattttag aatacgctgt acctgtgtgg gcaccatatc 3000 acacgtctca agtgattcgg atagaaagag tgcaaaaatc attcatccgc ttcgctcttc 3060 ggcgcctccc ttggaacgac ccggttaact taccggatta ttcttcccgc tgtgagcttg 3120 tggaccttga gctgctctca gctagaaggc taaaacttca aagactcctt gtacacgata 3180 ttctgacggg taaaatcgac tgtactgatc ttcttgcaga agtacagctc aatgtacctt 3240 gtcgtagact gcgttatgca ccactcattg ccatccctat gcacagaacg aactacggat 3300 ataataaccc tctcacctct tgtttaagga attttaactc tgtttgtgaa atgtttgatt 3360 ttaatttgtc gaagaactgt tttaaaacta ggattaagaa tataggttag tcatttagtc 3420 tgtacgatat tattatcgaa gatgattata caataaacaa taa 3463 // ID L1_Ele19 repbase; DNA; INV; 4560 BP. XX AC . XX DT 15-OCT-2010 (Rel. 15.1, Created) DT 15-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE An L1 clade non-LTR retrotransposon family from Aedes aegypti. XX KW L1; Non-LTR Retrotransposon; Transposable Element; L1_Ele19. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4560 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4560 RA Kojima K.K. and Jurka J.; RT "L1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (15-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. This consensus is generated from 7 CC sequences with >96% identity, and ~99% identical to the original CC sequence in [1]. XX FH Key Location/Qualifiers FT CDS 158..1285 FT /product="L1_Ele19_1p" FT /translation="MNSSRRRENTFRIEFSNLPKKPSFENIHRFIAEALGL FT SKTQVVRVQINHYLWCAFVKCVDLATAQSIVQQHDNRHEIDISGKKYKLRI FT LMEDGAVEVRLHDLSEDVSDESIMDYMSQYGQVLAIEELAWSDKYAFDEIP FT SGIRLVKMILKNPIKSYITVEGETTYVTYFGQKHTCRHCHEYIHSGIPCTQ FT NKKLLVQKASVNERLNTNTRANPKTGSYANAVKTNAGPTFVLPSTSTSPPA FT PSPPTPSPPTTSPAPPLSLLQPAQADVADGAPAIDMECSSVQAASGSGPSL FT GSEMTTSPEQSFPSASSEVVIDDGPFKMPHVSNTSTDMNLDDEISDSSLAS FT AGSTSRWTRSKKYKFDNIQPKKASHDRRSTKQP" FT CDS 1298..4501 FT /product="L1_Ele19_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MDFHSYNIGTININNITNENKLNALRHFVRQQDLDII FT FLQEIENDSISIPGYNSVCNVDHTKRGTAILLKDYIKYSHVERSLDSRLIS FT VRINENITLCNIYALSGSQHRASREAFFNNTVAFYLRNQAEYTIMSGDFNA FT VTHPKDSTGVSNYSMALKNTTQQLRLLDIWEILHGNRVEHTYICHNSASRI FT DRIYISSNLRPHVRTTTTHACSFTDHLALTARLCLPTMGREHGRGFWSLRT FT NVLTDETLEEFERKWSYWTRQRRHYSSWMLWWTAFVKPRLQSFFRKQSKDM FT SREFYTQHQHLYTQLREAYNNYHGNPAALVEINKIKSKMLMLQRKNTQMFQ FT KINETCLAGEPLSVFQLGERIRKRTVINEIEVSNGNIIEDPLAIQEHIVNY FT FKDLYSETTRLEARETFTCERQIPANSNKNNTSMDPITTNEIFAALKGSAA FT RKSPGSDGIPREFYLWAFDIIHRELNLLLNEALGGNILPEFVNGIIVLVRK FT KNTGNTIKSFRPISLLNYDYKLLARILKSRLDGIMTDHNIISNVQKCSNGK FT RTIFGATLALKDKIAQLRKHRRSGKLLSFDLDHAFDRVDRRFLFGTMDSLG FT FNAGLIHLLSSIAELSSSRLLINGHLSAPFTIQRSVRQGDPLSMHLFVIYL FT HPLLHRIQEVCNGQFDLVVAYADDVTMISTCVEKIERVKQLFLNFGRCSGS FT LLNLEKTSALDVGYITARNRVDVQWTQTTDSVKILGVIFTNSIRRMIILNW FT NTLIAKMAQQIWLHKMRFLSLHQKVLLMNTFISSKMWYISSILPIYKLHIA FT KITSLISSFLWQGQVVRVSMHQLALPKEKGGLNLHLPAFKCRSLLMNRHLC FT EMECMPFYRTFIESTQNPPNLAALPTDCPCLKQICQEFAYISQDVLAISTS FT STLHRRLLSQIETPKVMRESPNLNWKRIWLNVNSRQLNSLERSCYYLLVNK FT KILHAKLLHRMNLLPSPVCEHCGVADEDLQHKYXTCTRTANAWTIXQQIIN FT VFVPNRALSFEDMMKPTLLCLEQRTKIKIMKMFICYVNYINSEDNLIQLDS FT LKFHLEIICNE" XX SQ Sequence 4560 BP; 1440 A; 1029 C; 896 G; 1192 T; 3 other; agttagcttc tgtcactcaa gttgaacaga cacgtttcga aagaaagccg gctatttaaa 60 atcgttttta tacggattgc tcgaagtgag cttggcggtt aatccgcttc ggtcgtttcg 120 cgttacgttt cgccccgcgg gtgatatttg tgtaaccatg aattcatcac gccgacgcga 180 aaacacgttt agaatagaat tttccaatct accgaagaag ccaagcttcg agaacattca 240 ccgcttcatc gctgaagctc tcggtctatc caaaactcaa gttgtgagag tgcaaataaa 300 ccactatcta tggtgcgcat ttgtaaaatg tgtcgatctt gccacggcgc aatcaattgt 360 ccaacagcac gacaatcgtc atgaaattga catctctggc aaaaagtaca aactcaggat 420 cctaatggag gatggcgccg tagaggttcg gctacatgac ctttcggaag atgtttctga 480 cgaatctatc atggactaca tgtcacaata cggacaagta ttagcaattg aagagcttgc 540 gtggtctgac aagtatgcgt tcgatgaaat tccttcagga attcgattgg tgaaaatgat 600 tttgaagaac ccgatcaaat catatatcac cgtggaaggc gagactacat acgtcacgta 660 ttttggccag aaacatacat gtaggcattg ccacgaatac attcactctg gcattccctg 720 cactcagaat aagaaacttc ttgtgcaaaa agcgagtgtc aacgaacgac tcaatactaa 780 tactcgggct aatccgaaga ccggttccta tgccaatgca gttaaaacga acgcaggacc 840 taccttcgtt ctcccttcta catcaacgtc accaccagca ccatctccgc caacaccgtc 900 accgccaaca acatcacctg cgccgccgct atcgctactt caacctgctc aagcagacgt 960 agcggatgga gctcccgcta ttgatatgga atgcagctcg gtacaggcag cttcgggatc 1020 aggtccgtca ctaggatcag aaatgactac gagtccggaa caaagttttc catcggcgtc 1080 ttctgaagtt gttatcgatg atggcccctt caaaatgccg catgtttcaa acacttcaac 1140 cgatatgaat ttggacgacg aaatcagcga tagctcttta gcctctgcgg gtagcacaag 1200 cagatggacg cgttcgaaaa aatacaagtt tgataatatc caaccgaaaa aagcaagcca 1260 cgaccgacga tctacgaaac aaccgtaaat atcactaatg gatttccata gctataatat 1320 tggaaccatc aatattaaca acataacaaa cgaaaacaaa ctcaacgcac tccgacattt 1380 cgtgagacaa caagatctcg acatcatatt tctgcaagaa attgagaacg attctatatc 1440 catccctgga tataactctg tttgtaatgt cgatcatacc aagagaggga cggctattct 1500 gctgaaagac tatataaaat actcacacgt tgaaagaagt ttagattcta ggctaatttc 1560 tgtacgtatt aacgaaaata tcactctgtg caatatatac gcgttatctg gaagtcaaca 1620 ccgagctagc agagaagcat ttttcaacaa cacagttgca ttttatctac ggaatcaagc 1680 ggaatacact atcatgagcg gagacttcaa tgcagtaact cacccgaaag actccacagg 1740 tgttagtaac tacagtatgg ctctcaaaaa tactacacaa cagctacgtt tactcgatat 1800 ctgggagata ttacatggta accgagtcga acacacctat atttgccaca attcggcatc 1860 ccgtattgat aggatctaca tcagctccaa cttacgtcct catgtgagaa caacaaccac 1920 acatgcgtgt tcgtttacag atcacctggc gctgacggct agactctgtt tacctacaat 1980 gggtagagag cacggaagag gtttttggag cttacgtacc aatgttttga ctgatgaaac 2040 tctggaagaa ttcgagagga aatggagcta ttggacacga cagcgaagac attatagctc 2100 atggatgctc tggtggactg cttttgtaaa gccaagactg caatctttct ttcgtaagca 2160 atctaaggat atgtcaagag aattctacac ccagcatcaa catctgtata ctcagcttcg 2220 tgaagcctac aacaactacc atggcaatcc tgccgcgcta gtggagatta acaagatcaa 2280 atccaaaatg ttaatgttac aacgaaagaa tacgcagatg ttccaaaaaa tcaatgaaac 2340 ttgccttgcc ggtgagcctc tatcagtttt ccagttggga gaaagaattc gaaaacgcac 2400 tgtaattaac gaaattgagg taagcaacgg taacatcatc gaagacccat tagctatcca 2460 agaacacatt gtcaactact tcaaagacct gtactcagag acaaccagac ttgaagcccg 2520 ggagaccttc acttgcgaga gacaaatacc tgcaaacagc aataaaaaca acacttcgat 2580 ggaccctatc actaccaacg aaatattcgc ggcactcaaa ggcagtgcgg ctcgtaaatc 2640 tccaggctca gatggaattc ccagagaatt ctatttatgg gctttcgata taatacacag 2700 agagctcaat ttgctcttaa acgaagcact aggcggtaat atcttgccgg aattcgtgaa 2760 cggtataata gtgttagtac gtaaaaagaa tacgggcaat accattaaat ctttccgtcc 2820 aatttcgtta ctaaattacg actacaaatt gctggcacga atacttaagt ccagacttga 2880 tggtattatg actgaccaca acatcattag caatgtgcag aagtgttcta atgggaaacg 2940 aacgatcttc ggagcgactc ttgctctaaa agataagata gctcaactac gtaaacatcg 3000 gcgttctggg aaactcttgt cgttcgatct ggaccatgca ttcgatcgag tcgatcgtcg 3060 ttttctcttc ggtacaatgg attctcttgg cttcaacgct gggctgatac atctactatc 3120 atctatcgcg gagctttcat catcacgact actgataaac ggtcatctct ctgcaccgtt 3180 cacgatacag cggtctgtac gtcagggaga ccctttgtcg atgcacctgt tcgtcatata 3240 tctccatcca cttctgcatc gaattcaaga agtctgtaat ggtcagttcg atttggttgt 3300 tgcgtatgcc gacgacgtaa cgatgatatc gacgtgtgtg gagaaaatcg agagagtaaa 3360 gcaactattt ctgaacttcg ggagatgctc aggttcccta ttgaacttag agaaaacatc 3420 ggctctagat gtgggttaca taacagctcg caatagagta gatgtacagt ggacgcagac 3480 tactgacagt gtcaaaatcc taggagtcat ctttactaac tcaattcgac ggatgattat 3540 tcttaattgg aacacactga tagcgaagat ggcccagcaa atttggctcc ataaaatgcg 3600 atttctttca ctgcatcaaa aagtattgtt gatgaatacc ttcatctcgt caaagatgtg 3660 gtatatctct tcaatattgc cgatctacaa actccacatc gcaaaaatta cgtcgctcat 3720 ctctagcttt ttgtggcaag gacaagttgt aagagtatct atgcatcagt tagcgcttcc 3780 taaagagaaa gggggcctga accttcacct cccggccttc aaatgcagat cgcttctaat 3840 gaacagacat ctttgtgaaa tggaatgtat gccgttctat agaaccttta tagaatcaac 3900 acaaaacccc ccaaacctgg cagctctccc aacagactgc ccatgcctca agcaaatttg 3960 ccaagaattt gcttacattt ctcaggatgt gctagcgatt tcaacatctt caacattaca 4020 tcgtcgtttg ctgagtcaaa tagaaactcc caaggtcatg cgtgaatctc caaatttgaa 4080 ctggaagcgg atttggctca acgtaaactc gcgtcaactg aattcgctag aaagaagctg 4140 ttattatttg ctggtaaata agaagatctt acacgctaaa ctcttacatc gaatgaactt 4200 gctaccgtca ccagtctgcg aacactgtgg agtagctgac gaagatttgc agcacaaata 4260 cgmaacctgc acgaggacgg caaatgcatg gacgatattk caacaaatta tcaacgtatt 4320 tgttcccaac agagctctta gttttgaaga tatgatgaaa ccaaccctwc tatgtttaga 4380 acaaagaact aaaatcaaaa taatgaaaat gtttatttgc tatgttaatt acataaactc 4440 tgaagataat ttaattcaat tggattcgtt aaaatttcat ttagaaataa tttgcaatga 4500 ataactttgt tactatgtaa agctttaaaa cttgactaat aaaatatata aaaaaaaaaa 4560 // ID Gypsy-7-I_HM repbase; DNA; INV; 4177 BP. XX AC . XX DT 25-DEC-2008 (Rel. 13.12, Created) DT 25-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE An internal portion of the LTR retrotransposon - a consensus DE sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW reverse transcriptase; Gypsy superfamily; Gypsy-7-I_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4177 RA Bao W. and Jurka J.; RT "Gypsy LTR retrotransposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 1980-1980 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(75..2588,2555..3145,3102..4166) FT /product="Gypsy-7-I_HM_1p" FT /translation="MEHFLPKPNEMDFNESNIAVTWKKWKQTMQLYLKAAM FT TNKSEEEKYATFLFVIGEKGREIFNTWTWNKTENDDGEIDDITVKELFERF FT EEYCLPKRNLVLERRKFFLRKQDVNESFDFFVTDLKNLAQTCEFENIQDGL FT ILYKIIDGIKSNKVQNNLLRKGADLTLTKAIDICRAEEVANAEMKQITYEK FT DIDVVKKFIKPRQANASIGQIPKWRKFSNNENESKGKDRVEPNKQNWKQIN FT NGSNKKQQQRQCKKCNRKHEPRNCPAFGQTCRKCNKINHWATCCNSKNIHE FT NCAVEYHVESVSSGEMNGKNEALIIFKINNKLVRGKIDTGAEVDVMPKRVY FT DQLTSRDNLTYTNVKLRGYGGNHIPVLGTSNMMCYYKSRKKLVKFYVVETT FT SKLVLSLQTSQDLKAIKLLDEVKTVDTNEAKSQQKEISTLNKVKRIEGKQG FT KELKEEILKMYPKLFNGLGVIGPEHHIVLKNDSNPVIHPPRKIPVTLREKI FT KTELDEMEKTGVITRVDEPTEWVNSLVVVEKPNGQLRLCLDPRDLNKAIKR FT EHYQLPTFEEISTRLYGANNFTKLDANKGYWQIPLDKDSSMLTTMNTPFGR FT FRFTRLPYGIHSAQEVFHKRISQCFDDLHFVETDIDDILIWGTKDVSHDEN FT LIKCLDRAEKMGMTLNIEKCRFRTQELIYLGHKLSINGVQPDESKIKAINE FT MPKPEDKKGVQRFLGMVNYVGKFLPNLSETTEPFRNLIKKGTHMQWGEAQD FT RSFRKIKKLLCSEQCLAFYDIQKPVTIQVDACKTGVGAVLLQNEKPIAFAS FT RSMTPAQLNYAIIEKELLAVLFGCERFHQYIYMEKKAISPIYIYGKKVTVN FT SDHKPLENIMKKSLANTPPRLQRMLLRLQKYDIDLIYKAGKEMILADTLSR FT AHTNETEEEIPEDEMIAQVHMVYANCSSEENLKEIKTNTLKDITLNVVANY FT ITNGWPQDKRKVEDSVKPYWSFKEELTLIEGIIFKGQRILIPTSMRKATIQ FT KLHLAHMGIEKTKMRARETVFWPGINKTNEQEKLYFGQALTKQIEDLVKSC FT ETCLEYTKKQTKELMISSEIPNHPFQIVGTDLFHWNNQDFIIVVDYYSRYW FT EIERLHNIQSQTVIKKLKMIFSRLGIPQIVRSDNGTQFTSSEFQKFSKEWE FT FKHITSSPQYQQSNGMAERHVQIAKNLLNKAKRSNQDLYIALLEVKNTPVD FT GLASPAQLISGRRLRSTLPTSLSLLQVNPIDVNKFKKNREKVQNLQKKYYD FT KNAKTLENVSDGDCVRIHDGNKWQPGKIVESKQQRSFNVKTLNGNIVRRNR FT RHLLKTKEEVDWDVNLSDNESDMNESKDIFIGKRKLSNNTNIDTNTNQNES FT LDHDIVTRSGRISRKPKHYIDYEMHF*" XX SQ Sequence 4177 BP; 1755 A; 653 C; 757 G; 1012 T; 0 other; tggtaccaga agtgaactga agacatttta agaagagatt ttggaaaagt acttatttta 60 aaaagaaaaa gaaaatggaa cactttttgc caaagccaaa cgaaatggac ttcaatgaat 120 ccaatatagc tgtaacatgg aaaaaatgga aacaaactat gcagttgtat ctaaaagcag 180 caatgacaaa taaatctgag gaagaaaaat acgcgacatt cttatttgtg atcggtgaaa 240 aaggaagaga aatattcaat acgtggactt ggaacaaaac agaaaatgac gacggtgaaa 300 tcgatgacat cacggtaaaa gaactttttg aaagattcga agaatactgt ttaccgaaaa 360 gaaatctggt tttggaacga agaaagtttt tcttacgaaa acaagacgtc aatgaatcat 420 ttgatttctt tgtaactgat ttaaaaaatc tggctcaaac atgcgagttt gaaaacattc 480 aagatggtct gatattatac aaaataattg acggaatcaa atcaaacaaa gtccaaaata 540 atctacttag aaaaggagcg gatttaactt taactaaagc gattgacata tgcagagctg 600 aagaagttgc aaatgcagaa atgaaacaaa ttacatacga aaaagatatt gacgtggtaa 660 agaaattcat aaaaccaaga caagcaaatg cgtcaatcgg acaaatacca aaatggagaa 720 agttctcgaa taacgaaaat gagtcaaaag gcaaagaccg ggttgaacca aacaaacaaa 780 actggaaaca aatcaataat ggctcaaata aaaaacaaca acaaagacag tgcaagaaat 840 gcaacagaaa acatgaacca agaaactgtc ctgcattcgg tcaaacgtgc cgtaaatgta 900 ataaaataaa tcattgggca acttgctgca actcgaaaaa catacacgaa aattgcgctg 960 tagaatacca tgtggaatct gtatcatctg gtgaaatgaa tggtaaaaat gaagctttga 1020 taatctttaa aattaataat aaattggtac gtggaaaaat tgacactgga gctgaagttg 1080 atgttatgcc aaaaagagtc tacgaccaac taacatcacg tgataatctc acttatacaa 1140 atgttaaact acgtggttat ggaggaaatc atattccagt gcttggaaca tcaaatatga 1200 tgtgttacta caaaagtcgt aaaaagttgg ttaaattcta tgtagttgaa acaacaagca 1260 aactcgttct tagcttacaa acttcacaag acttaaaagc tatcaaacta ctagatgaag 1320 taaaaactgt agacacgaac gaagctaagt cgcagcaaaa agaaatatct actctaaaca 1380 aagtaaaacg gattgaagga aaacaaggaa aagaattaaa agaagaaata ctaaaaatgt 1440 atccgaaatt attcaacgga ttaggcgtga ttggacctga acatcacatc gtgttgaaga 1500 atgactcaaa tcctgttatt catccaccaa gaaaaattcc tgttaccttg agagaaaaaa 1560 taaaaacaga attggatgaa atggaaaaaa caggagtaat aacaagagta gatgaaccaa 1620 cagaatgggt aaattcgcta gtggttgtgg aaaaaccaaa tggacaacta agactctgtc 1680 tggatccaag agacttaaat aaagctataa aaagagaaca ttatcaatta cctacttttg 1740 aagaaatatc aactagactt tatggtgcaa ataatttcac aaaactagac gcaaataaag 1800 gttattggca aattccactg gataaagata gctctatgtt aaccacgatg aatacaccat 1860 ttggaagatt tagattcaca cgtttacctt atggtataca ttcggctcag gaagtatttc 1920 ataaaagaat tagtcaatgt tttgatgatc ttcattttgt ggaaactgat attgatgata 1980 ttctaatatg gggcacaaaa gatgtaagtc acgatgaaaa tctaataaaa tgcctagatc 2040 gagctgaaaa aatgggaatg accctaaata ttgaaaaatg ccgatttcga acacaagagt 2100 tgatatactt aggacacaaa ttatcaataa acggagtaca accagatgaa tcaaaaatta 2160 aagccatcaa tgaaatgcca aaacctgaag ataaaaaagg agttcaaaga tttctaggta 2220 tggtaaacta tgttggtaaa ttcctaccta acttatcaga aacaacggaa ccatttcgaa 2280 atctaattaa aaaaggaaca cacatgcaat ggggcgaagc acaagatcga tccttcagaa 2340 aaattaaaaa gctgctatgt tcggaacaat gtcttgcatt ctatgatatt caaaaacctg 2400 taacgataca agtagacgca tgcaaaactg gtgttggagc agtacttctt caaaatgaaa 2460 aaccaattgc atttgcatca aggtcaatga caccagctca attaaattat gcaatcattg 2520 aaaaagaact tctcgcagta ctttttgggt gtgagcgatt tcaccaatat atatatatgg 2580 aaaaaaagta actgttaaca gtgatcataa accgttggaa aatattatga agaagtcgtt 2640 agcaaacact cctccgcgtc ttcagagaat gttattaaga ttgcaaaaat atgatattga 2700 tttaatttat aaagctggta aagaaatgat tctggcagat acgttatcta gagctcatac 2760 caatgaaact gaagaagaaa ttccggaaga tgaaatgatc gctcaagtgc atatggttta 2820 cgcaaactgt tcatcagaag aaaacctcaa agaaataaaa acaaacacgc tcaaagatat 2880 tacgttaaat gtagtagcaa actacataac aaatggatgg ccgcaggata aaagaaaggt 2940 tgaagactca gtaaaaccct attggtcgtt taaagaggaa ctgacactca tagaaggaat 3000 aattttcaaa ggacaaagaa ttttaattcc aacatcaatg cgaaaagcaa caattcaaaa 3060 actacatcta gcacacatgg ggattgaaaa aactaaaatg agagcaagag aaactgtatt 3120 ttggccaggc attaacaaaa caaattgaag atttagtaaa gtcgtgtgaa acatgtctgg 3180 aatacacaaa aaaacaaact aaggagttaa tgatatcaag cgaaatacca aatcacccgt 3240 ttcaaattgt gggaacggac ttatttcact ggaataatca ggattttata atagtggtcg 3300 actattatag tcgttattgg gaaatcgaaa ggttacataa cattcaatct caaacggtga 3360 taaagaaact aaaaatgatc ttctccagac ttggtatacc gcagatagta cggagcgaca 3420 acggcacaca attcacatct tcagaattcc aaaaattcag taaagagtgg gagtttaaac 3480 atataactag tagccctcag tatcaacaat caaatggaat ggctgagaga catgtgcaaa 3540 tagcaaaaaa cttgctaaac aaagcaaaaa ggtcaaatca agatctgtac atagctctac 3600 tagaagtaaa aaacactcct gttgacggtc ttgcatcacc tgctcagctc ataagtggaa 3660 gaagattgag atcaacactt cctacttctc tgtcactact tcaagtaaat ccaatcgatg 3720 taaataaatt taagaagaat cgtgaaaaag ttcaaaactt acagaagaaa tattatgaca 3780 aaaacgcaaa aacattggaa aacgtaagcg atggagactg tgttcgtata catgatggaa 3840 ataaatggca accaggaaaa attgttgaga gcaaacagca acgaagtttc aatgtaaaaa 3900 ccctaaatgg aaatattgtt agaagaaaca gaagacatct actaaaaaca aaagaagaag 3960 tagattggga tgtcaattta tccgacaatg aaagcgatat gaatgaatca aaagacatat 4020 tcattggtaa aagaaaactg tcgaataaca caaacattga tacaaataca aatcaaaatg 4080 aatcattaga ccatgacatt gtcacgcgat caggaagaat ttcgaggaag ccaaaacatt 4140 acattgatta cgaaatgcat ttttaaaaga aggaagg 4177 // ID BEL-639_AA-LTR repbase; DNA; INV; 661 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-639_AA_; KW Pao_Bel_Ele103; BEL-639_AA-I; BEL-639_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-661 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 661 BP; 234 A; 121 C; 119 G; 187 T; 0 other; tgtcatcgaa acccctcggt gtcgatagca cctcatcaac agctaccttg cggaagactg 60 tcgcggtgac agcggttgac agatcgaacc gatacaatga actgtcaggt taatgacaag 120 gcaaaattac aatatagcgt gtgcaccagt aacatataat actccagatt aattgtcttg 180 aaattcttat aaattatatt actatgaatt tatctctcaa gtgaattaag gtagtatact 240 tgtgtgaact agaactatac ttgtgtcgac ttataactaa acatttattt aacttagaac 300 tacataacgt aaaatagcct aatagtacgg agaggacagt cgatcataag tacgtttgat 360 tgatagcagg tgagaaattg atattctatt tgaagttata atttctcata ttcaccaacc 420 gtaattacat aggtctcttg ataaaacctt aatcagcagg ctaggtccga cgaagccgag 480 gaaaaaaaaa gactaaatgc actaaacgta agaagttata caccttaaaa cttaacctaa 540 aacatgccct aaaatctaat ttacatgtaa acaggaattt aaaacgggcc tgaacttcgg 600 acgaataaag gtcgttaagg atttggaacg catacatcgt ctttttcttg ttacagcggc 660 a 661 // ID Gypsy15-SM_I repbase; DNA; INV; 3349 BP. XX AC . XX DT 02-MAR-2008 (Rel. 13.03, Created) DT 02-MAR-2008 (Rel. 13.03, Last updated, Version 1) XX DE LTR retrotransposon from freshwater planarian: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy15-SM; KW Gypsy15-SM_LTR; internal portion; Gypsy15-SM_I. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-3349 RA Jurka J.; RT "LTR retrotransposon from freshwater planarian."; RL Repbase Reports 8(3), 245-245 (2008). XX DR [1] (Consensus) XX CC Positions [2628-3002] - Integrase core CC LTRs are 100% similar to each other. CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 678..3347 FT /product="Gypsy15-SM_I_1p" FT /translation="MLKERLLSINGLKMLCLIDSGCSTTIIRGESNTSPNA FT QVFLADGSILPVSQSKAEIELEGKYFFIDCFISRKMIAGYDVILGMNAIMA FT LGGVTISEDVRFGCELKCLTAANSNEVHSLTIQRKNFEAKFNKEWMVSWNW FT NTEAPVLMSDRYSKLDPDKPKAGQALKVIEQWIKNGWLVEYDENRFGEPVG FT AVTLFPVMQKNKSKIRPVFDFRALNHYLDVYSGEAVVCNETIREWRKKGNK FT IAILDLSDAYMQIRVDESMWKYQSCIINGRRYALTRLGFGINIAPGIMTEI FT VNHVLGLNDRIKQGASAYIDDIFIDEEVVPAEEVKAWLSKFNLMTKEIERP FT SSTKALRILGLSVFIEDNVLKWNRANAIRYEFEENIMNRRELYSLCGQLTG FT ILPVAGWLRPACSYLKRLSEGEWNQPIDAQLLDKVKWLMKEVEKNDPAHGT FT WKAPMENGNILWCDASSIAIGAALEIDGNIMEDACWLRKEKDVRHINLCEL FT EAIVKGINLCIKWNLKFVKIMSDNKSVVSWVHNALTSERKITTKASSEMLV FT KRRLSIIKELITEYEIKAEIEYVPTNKNKSDVLTRIPSNWITKHSCNMALL FT DFGEDPIKNATEIHKLSHFGGDRLKFLIEESGINLSQNELKEIVEKCDKCQ FT SIDPHPVKISKGQLEVKDSWKRLAIDVTKLHQKKFLTIIDCGPSRFAIWRE FT IRNESADEICKQLEQVFYERGPPDELLLDNYLSFRSETFTSLMKKWRIRTE FT FRSANRPAGNGIVERNHRTIKAAVTRSECSVQEAIYWYNNTPVNGSFRPPI FT NGIYSYDVRLRFNETKEMVRQIDNGFQINDKVWVKPAQNQPCTSPWIRGRI FT MKLLSPHKVIVDYGTVAMPRHIADLRHRNEDSLESRG" XX SQ Sequence 3349 BP; 1207 A; 570 C; 727 G; 845 T; 0 other; agctatccgg gcatgataaa agacattcta agcgcaatag aatgttttga tggagaaaat 60 gtaaaaattg atgaatggtt cgataagttc gaactgatat gcaagctcca taaattcaat 120 ggaaaaaagg aaattttgcc ggctgtcatg aaggggaaag cgtttatcac tctgaaggaa 180 gcagtgaaga atgaagatat tagctatgaa gaaattaaag aatcattact acgagctttc 240 aatgaccccc cagaagcatg cttctccaag ttattaaata agagattcca gatcggggat 300 aatatcaata ttcatttatc agaaatgaag aatctggctt ttaactctgg agatctactg 360 gtaaatatta taataaactc cttgccaaca gaaattgcag ctgaagttag acgacaagga 420 tcaacatcaa tcgacgcaat cggtgacatc gtttcacgat tgaatttagt acaaagacat 480 cagtctgttt ttgcagtgaa aaccaagaat gtgcaatgct tcatatgcaa cgaaagccat 540 tttgctcgta actgtatgaa caagaagaaa attcaatgct cccgatgtga aatgcttgga 600 catcaatccc gatattgtaa atcgggaaaa gacaaaggga tcgagactat tccggagtct 660 ctctcgaccc caaaaagatg ctaaaagaaa gactcctttc gataaatgga ttgaaaatgc 720 tatgcctcat agattccggc tgctcgacaa caattatacg cggagaatcc aacacgtcac 780 ccaacgctca agtgttccta gcagatgggt cgatactgcc agtaagtcag agcaaagcag 840 aaatcgaatt agaaggaaag tactttttta ttgactgttt catttctaga aaaatgatag 900 cgggatacga cgtcatatta ggcatgaacg ctattatggc actaggaggt gtgacgatca 960 gtgaagatgt tagattcgga tgcgaattga aatgtttgac ggctgcaaat agtaatgaag 1020 ttcatagttt gacaattcaa aggaaaaact tcgaagcaaa gttcaacaaa gaatggatgg 1080 tatcatggaa ttggaacaca gaagccccag tattaatgtc agatcgatac tcaaagttag 1140 atccagataa acccaaggca ggccaagcac taaaagtcat cgagcagtgg attaaaaatg 1200 gctggttagt tgaatatgac gaaaaccgat tcggtgagcc tgttggagcg gtaactttat 1260 ttcctgtaat gcaaaagaat aagagcaaaa tacgtccagt atttgacttt agagctttaa 1320 atcattatct agacgtttat agcggcgaag cagtagtctg taatgaaacc atacgagagt 1380 ggagaaagaa aggaaacaaa atagcaattt tagatctatc tgatgcgtac atgcaaatca 1440 gggtcgatga gagtatgtgg aagtaccaat catgcattat aaatggaagg cgatacgcat 1500 tgacaagatt gggttttgga atcaatatcg ccccaggtat catgacagag atcgttaatc 1560 acgtgttggg tttaaatgac agaataaaac aaggagccag tgcgtatata gatgatatct 1620 ttatcgacga agaagtggtt ccagctgaag aagtaaaagc ctggttaagt aaatttaact 1680 taatgacgaa agaaattgaa cgtccatcat caacaaaagc tctaagaata cttggtctct 1740 cagtatttat tgaagacaac gtattaaaat ggaaccgagc caatgccatt cgttatgaat 1800 tcgaagagaa tataatgaat agacgagaac tgtattcatt atgcggtcaa ttaaccggaa 1860 ttcttcccgt agcaggctgg ttgcggccag cgtgctcata tctcaaacga ttaagtgaag 1920 gagaatggaa ccaaccaatt gatgcacaat tattagataa agtaaaatgg ttaatgaagg 1980 aagtcgaaaa gaacgatccg gctcatggca catggaaagc accaatggaa aacggaaaca 2040 ttttatggtg tgatgccagc tctattgcca ttggagcagc tttggaaatc gatggaaata 2100 ttatggaaga cgcatgctgg ttgagaaaag aaaaggacgt taggcacatt aatctgtgcg 2160 aacttgaggc gatcgtcaaa ggcattaatt tatgcatcaa atggaattta aagtttgtaa 2220 aaataatgtc ggataacaaa tctgtggtaa gctgggtgca taatgcctta acatccgaga 2280 gaaagattac aacgaaagct tcttcagaaa tgttagtaaa aagaagactc agcatcatca 2340 aagaactaat tactgaatat gaaattaaag ccgaaataga atacgttcca acaaataaaa 2400 acaaatctga tgttttaact cgaattccct ctaattggat tacgaaacac tcatgtaaca 2460 tggcgttact agattttggt gaagatccca tcaagaacgc aactgaaatt cacaaattaa 2520 gtcatttcgg aggagaccgg ttgaagttct taatcgagga gtcaggaatt aatctatctc 2580 aaaatgaatt aaaggaaatt gtcgagaagt gtgacaaatg tcaatcaata gaccctcacc 2640 cggttaagat cagtaaaggg caattagagg taaaagattc ttggaaacga ctagcaattg 2700 atgtgacaaa attacatcaa aaaaagttct taacaatcat agattgcggg ccctctcgat 2760 ttgcaatttg gcgagaaata cgaaacgaaa gcgcagacga aatttgcaaa cagttggaac 2820 aggtttttta tgaaagagga cctcctgatg aacttctgtt ggataattac ttatcgttta 2880 ggtcagagac tttcacttcg ttgatgaaaa agtggagaat tagaactgag ttcagatccg 2940 ctaatagacc agcaggcaac ggcatcgtcg aaaggaatca tcgaacaata aaagcagcag 3000 tcacacgatc ggaatgctca gtgcaagaag caatttactg gtataataat acaccagtga 3060 atggctcttt ccgaccccca ataaatggca tttattccta tgacgtaaga cttcgattta 3120 atgaaacgaa agaaatggtt cgacagatag acaatggctt tcaaattaac gacaaggtgt 3180 gggtaaaacc cgctcaaaat caaccgtgta catctccatg gataagagga agaatcatga 3240 agctgctgtc tccacataaa gtaattgtcg attacggaac tgttgcaatg ccacgtcata 3300 tagccgactt aaggcacagg aatgaagatt cgttagaatc tcggggaga 3349 // ID I_Ele36 repbase; DNA; INV; 6805 BP. XX AC . XX DT 14-OCT-2010 (Rel. 15.1, Created) DT 14-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE An I clade non-LTR retrotransposon family from Aedes aegypti. XX KW I; Non-LTR Retrotransposon; Transposable Element; I_Ele36. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-6805 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6805 RA Kojima K.K. and Jurka J.; RT "I clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (14-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. This consensus is generated from 4 CC sequences with >99% identity, and ~99% identical to the original CC sequence in [1]. XX FH Key Location/Qualifiers FT CDS 544..1794 FT /product="I_Ele36_1p" FT /translation="MCELKTSYRIRLLLAKELKLVIGLKEARAVEAIREGR FT GARCLLRTNSLSTYQKLQKIVRLPDGTDVEVFPHATLNTIQGIAYHPDTVD FT VDEESALDFLSSQGVKHVRRIKKRFGKDFRNTPLTILTFHGTVLPEYVYFG FT LMRVQIRKYYPNPMVCFNCGNFMHTKKNCTQDKICLNCSQAHDMVEGVACP FT NAPFCKNCRGSHSCVSKECPVYAEEEKIIKLRIDQGITFGEARKQIKEQKT FT QTTYASCVQHRMSADELGKDEIIQALRAELQTARSEFNQLKELYKKSIHQL FT QTHQQHCVLLQEVQQKQKQTASELKKQSSSSQPAQITSESATQATHIPRKD FT QPSRSPPKDKRNKNAQSNKLNDQHTNSTWSTRIRSRSNKRSSQASPSEADP FT SLRGAKTFVAQACDNDSGPHMQS" FT CDS 1800..6725 FT /product="I_Ele36_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="MDQDSLTNNTVYNDENSATNMAVNMETQQDPPPNDIF FT DRNSTMNNYYDNYNNVLNEELESSAITDPISVNRSRRPRHTMLTPLRNVSR FT DNRPDYLREVVNVACQNMQASTSSNYQANPCCRLPSTFPDASRSIPEIDLH FT NRSKSQNPTEYFDVRIXNPVLFHCDAPLHDNVDDEEMQHPGISYQMPQVNE FT RSCRPRYTMLTPLRNVSRDNRPDYLREVVNVACLNMQASTSRNYQSNPCCR FT LPSTFPDASSSIPEIDLHNRRINPALSHDDASSHDNVDNEEMQPRSINFNN FT SENFSCIGPFSSDDTTNSDQRCIINNRSYGPRHTMLTPLRNVSRDISPDYL FT REVVNVAGLNMQASTSFNHRSNPCCRVSSNFPDTPSSTDIGLHLQNTVEMD FT FQQAQNNDFVPVLDTVSPSEPPTNNRRDLSSSSHANSSSTNNQTRCPRIAV FT QWNINGLYNNLTDLQILTQCNPPLILALQELHRHTPSALNNLLRGEYNWLS FT KCGSTRYQNVALAIHRSVAYDTINLNTSLTAVAATVYVPFRLTVVSIYLQH FT SGIENLESSLIHLIDQLEGPLLIMGDMNSHHYAWGSKKTDRRGEAILRVAE FT DKNLIVMNDGTPTFIRGDQQSAIDVTMASTSILEKLRWKVHCDSMGSDHFP FT IEIYSSDAQIHTKRRPRWKFDEANWDGFEEAVLSCMEPAKQYSIEDLTDII FT LLAAKSNIPRTGQFPGKRSVHWWNAETKAAVKVRRKALRAFKRLPVDHPNK FT QQAMENLRIQRSNCRRIVKEAKIKSWEEFLNGINSTHTTTEIWRRVNAISG FT KRRTNGLCIRNDGEVTRDPVKVANTLAKYFEELSSRAAYSDSFASKNPATA FT LSNVTIPEDPEESEYNRLFCIEELYYALEASNGKSCGLDEVGYPMIKRLPY FT RTKIALLDAINKVWTTGTIPEPWSRSLVVPIPKNSAASTDAKNYRPISLTS FT CVCKIMERMVNRRLTTYLTENQKLDNRQXAFQRGRGTGTYHAVLGQVLDDA FT LKANLHVDFAALDLSKAYNRVWTPGVLQQLIKWGITGNMAKFIKGFLSNRT FT FQVIVGNDKSEVMKEETGVPQGSVLAVSLFLVAMNSVFDRLPKGIFVFVYA FT DDVLLVAVGAHPRALRRKLLAGVRSVAKWADNVGFRMAPEKCAITHCCNHP FT HHPWRLPLTVGNMDIPYKKVLRVLGVTIDRKLTFSVHFASIKKESENRMRL FT IRAISSRHQTSNRRTILQISNALITSRLLYGIEITCRASEVLINTLAPTYN FT RFVRFASGLLPSSPTLASXNEAGVLPFEFIVAKSVADRAINFIEKTYGGNE FT EVFLLTEAENLLQKYANETLPPIAGLHRVGDRRWDTTAPRIDWEMKQHVRA FT KDPPWKVSSYFNALLQRKYKNHKHIYTDGSRANGEVGIGVYSRSSKVAARL FT PNCCSIFSAEAAAILLALRNVREEPVVIFTDSASVLSALDSGKSSHPWVQE FT IEERTTAATTLCWVPGHCGVHGNEEADHLASTGRTMNQRIMPSPGVDIKRK FT LKEHLILAWTRKWHTERNLFLRKIRDSTEKWNDRLNQKEQRIISRLRVGHT FT RLTHPHYFSNEQQPRCEICSVRLTVEHILAQCPQYQPGRDALELKYSIRDI FT LSNDREEEEKLLLFLKQIGLYDKI" XX SQ Sequence 6805 BP; 2180 A; 1560 C; 1448 G; 1612 T; 5 other; ctcagttttt aatcgggtta ccggttgtta cataaaacgc taatagagct cttatacaat 60 ttcgtttttt gacgccgtta aagaagcttt aaaataccca aaaggacata ggcaattcga 120 agtgtgctaa tattcgacgc gtgagtgaaa tttagcaaaa tattgccgta tattttacaa 180 aatacgtggt aaaatataat caacacctcg tgatagtgtg cctttgttga gaacgaagta 240 aagcaaacat ccataccgtg cattgtgttg ttgcttaccc gatcaaaact tttgttttga 300 cgaccgactg tgtgcttcgg tctaacagtg attaatacca aaaagcggaa tagtgcgttt 360 gaattaagct cggccggatg gccgaaatca cctctccccc atctggggga aatgtggatg 420 acccattggt gcaagctcac ggaagagtgc caaattggtt ccgtagtagt gacgatatgg 480 gccagaattt ggtacttgtg atgcgcgtta agccgaaacc aaacgaccta gcagctacaa 540 gtgatgtgcg aactgaaaac aagctaccga attcgtttat tattggcaaa ggagttgaaa 600 ctggtaatcg gactgaaaga agcgagagct gtggaggcga tacgtgaagg acggggtgca 660 cgttgtctgc tgagaacaaa ctctttatcg acataccaga agctccaaaa aatcgttaga 720 ttacctgatg gaactgatgt cgaggttttt ccacatgcaa cgctaaacac aattcaaggc 780 attgcctatc accctgacac tgtggacgtt gatgaagaat ctgcgctcga ctttctcagc 840 tcccaaggcg tcaaacatgt caggagaatc aagaaacgtt ttggaaaaga ttttcgcaac 900 accccgctca ctattctcac cttccatgga acagttctac cagaatatgt atattttggc 960 ttaatgagag ttcaaattcg gaaatattat ccgaatccaa tggtctgttt taactgtgga 1020 aactttatgc acacgaagaa aaactgtaca caagacaaga tttgcttgaa ctgttcacaa 1080 gctcatgaca tggtggaagg cgttgcatgt cctaatgcgc cgttctgcaa aaactgccgt 1140 ggaagtcact cttgtgtatc gaaagaatgc ccagtgtacg cagaggagga aaaaataatc 1200 aaacttcgaa ttgaccaagg aattacattc ggcgaggctc gcaagcaaat taaggagcaa 1260 aaaacgcaga caacatacgc aagctgtgtg caacaccgaa tgtcggctga tgaattagga 1320 aaggatgaaa taatacaagc gctcagagca gaacttcaaa ccgccagatc tgaattcaat 1380 caattgaagg aattgtacaa aaaatcaatc catcaattac aaacccacca gcaacattgt 1440 gttctactcc aagaagtaca gcagaaacaa aaacaaaccg cgtctgaact caaaaaacag 1500 tcttcatcca gtcaaccagc ccaaatcaca agtgaatcag ctacccaggc aacccacatt 1560 cccaggaaag atcaaccgtc gaggtcacca ccgaaggaca aaaggaacaa aaatgctcaa 1620 agcaataaac taaacgatca acatacgaat tcaacatggt caaccagaat ccggagccgc 1680 agcaacaaac gatcgagtca agcgtcacct tcagaagctg accctagcct cagaggtgcg 1740 aaaacgtttg tcgctcaggc atgtgacaac gacagtggac cccatatgca atcctaatta 1800 tggaccaaga ttcattaacg aacaacacag tctataacga tgaaaactct gctacgaaca 1860 tggccgtgaa catggaaact caacaagatc ccccccctaa cgatattttt gaccgcaact 1920 caacaatgaa taactactat gacaactata acaacgtgct caacgaagaa ttggaaagct 1980 cagcaattac agatccaata tccgtcaaca ggtcgcgtag accaaggcat accatgttga 2040 cccctctgcg gaacgtctca agggacaatc gccctgatta tctccgtgag gtggtcaatg 2100 tggcttgcca aaacatgcag gcaagtacta gctccaatta tcaagcaaac ccatgctgtc 2160 gactcccttc aaccttccca gatgcatcgc gtagtatccc cgagatcgat ttgcacaata 2220 gatcgaaatc acagaatcca accgaatatt ttgacgtccg gattkttaat ccggtcttat 2280 tccactgcga tgctcctttg cacgacaacg tggacgacga agagatgcaa catcctggca 2340 tcagctacca gatgccgcaa gtcaacgaaa ggtcgtgtag accaaggtac accatgttga 2400 cccctctgcg gaacgtctca agggacaatc gccctgatta tctccgtgag gtggtcaatg 2460 tggcttgcct aaacatgcag gcaagtacta gccgtaatta tcaatcgaac ccttgctgtc 2520 ggcttccctc aacgttccca gatgcatcga gtagtatccc cgagattgat ctgcacaata 2580 ggagaattaa ccccgccctt tctcacgacg atgcttcttc acatgacaac gtagacaatg 2640 aagagatgca accacgtagt atcaacttta acaacagcga aaacttctca tgcattggcc 2700 ccttttcatc cgatgacacc accaattcag accaaaggtg cattatcaac aacaggtcgt 2760 atggaccaag gcataccatg ttgacccctc tgcggaacgt ctcaagggac attagccctg 2820 attatctccg cgaggtggtt aatgtggctg gcctaaacat gcaggcaagt actagcttca 2880 atcatcgatc gaatccttgt tgtcgggtat cgtcaaactt cccagataca ccgagttcaa 2940 cagatatcgg acttcacctt cagaacacgg tcgaaatgga ctttcaacag gcacaaaaca 3000 acgactttgt tccagtgctc gacacggtgt cccccagtga accgccaacc aacaaccgta 3060 gagacctttc ttcatcaagc cacgcaaact catcatccac taataatcaa actagatgtc 3120 ctcgaattgc tgtacagtgg aatatcaatg gtctttacaa caatttaaca gacctgcaaa 3180 tcctcactca atgtaatcca ccgctgatac tagcattgca agaacttcac cgccacacwc 3240 cgtccgcatt aaataatctc ttaagagggg aatataactg gttaagtaaa tgcggatcaa 3300 ccagatacca aaatgtcgct ttggccattc atcgttcagt agcctacgat acaatcaatc 3360 tcaatacatc attgacagca gtagcggcca cggtgtacgt cccttttcgc ctcaccgtgg 3420 tgtcaattta ccttcaacat tcaggcatag aaaacctgga aagctcattg atacatctta 3480 tcgatcaatt agaaggaccc ctcctcatta tgggtgacat gaatagtcat cattacgcat 3540 ggggatcgaa aaaaacagat aggcgaggag aagcaatact acgcgtggcc gaagacaaaa 3600 accttatagt aatgaacgac ggaacaccga cattcatcag aggtgatcag caatcagcta 3660 tcgacgtaac aatggcctcc acctctatcc tcgaaaaact aagatggaaa gtacattgtg 3720 attcgatggg gagcgaccac tttcccattg aaatctacag tagtgatgcg cagatacaca 3780 cgaaacgacg gccgagatgg aaattcgatg aagctaactg ggatggcttt gaggaagctg 3840 tgctatcttg catggaacca gctaagcagt acagcataga agacctgacg gacattatac 3900 tactagcagc aaaatcaaat ataccacgca ccgggcaatt tcctgggaag agatcagttc 3960 actggtggaa tgctgaaacc aaggccgcag tgaaagtacg acgaaaggcg ttgagagcgt 4020 ttaaacgcct cccagtcgat cacccgaata agcaacaagc catggagaat ttgaggattc 4080 agcgaagtaa ttgccgaagg atcgtaaaag aggcaaaaat aaaatcatgg gaagagttcc 4140 tcaacggtat taattccacc cacacgacaa ccgaaatctg gcgcagggta aacgccatca 4200 gtgggaaacg gcgaacaaac ggactatgta ttcggaacga tggggaagtt actcgtgatc 4260 cagttaaggt ggcaaatacc ttggctaagt acttcgaaga attatcatcg agagctgcct 4320 acagtgactc ttttgcttca aaaaatccag caactgcact ttcaaatgtg accatcccag 4380 aggatccaga agagtctgaa tataatcgac tattctgtat agaagagctg tactacgcat 4440 tagaagcaag taatggtaaa tcatgtggac tagacgaggt aggctatccc atgattaaaa 4500 gattaccata taggacgaaa attgctctgt tagacgccat caataaggtt tggaccacag 4560 gcacaattcc tgaaccatgg agcagaagct tagtagttcc aattcccaag aacagtgcag 4620 catccacaga tgccaaaaac tacagaccaa tctctctcac aagttgtgtc tgtaaaatta 4680 tggaaagaat ggtgaatcgg cgactcacaa cttacttaac cgaaaatcag aaactcgata 4740 accgtcaawa tgccttccag cgcggacgtg gaaccggtac gtaccatgca gttcttggcc 4800 aagtattaga tgatgcactc aaagcaaatc ttcatgtaga cttcgcagct ttagatttgt 4860 ctaaggctta taatagagtg tggactccag gggtattaca acaactcatt aaatggggta 4920 tcactggcaa catggcaaaa ttcatcaaag gtttcctttc aaacagaact ttccaggtga 4980 ttgttggaaa tgataaatcg gaggtcatga aagaagagac tggtgtacct cagggatcgg 5040 tgttggcagt atcccttttc ttggtagcta tgaatagtgt ttttgatcgg ctaccgaaag 5100 gaatatttgt gtttgtatat gcagacgatg tactcttggt agctgttggc gcacacccaa 5160 gagccttgag aagaaaacta ctggcaggag tacgctctgt tgcaaaatgg gcagataacg 5220 ttggctttcg gatggcaccg gaaaaatgcg ctattaccca ttgttgtaac catccacatc 5280 acccgtggag attgccactg accgttggga acatggatat accgtataaa aaagtcttac 5340 gtgttcttgg tgtaacgata gacaggaaac ttactttctc ggtacatttt gcaagcatca 5400 aaaaggagtc cgaaaaccga atgcgtctaa ttcgagcaat tagcagtcga catcaaactt 5460 caaatagaag aacgatcttg caaatttcca atgccctcat aacaagtcgt ttactctatg 5520 gaattgaaat aacatgccgt gcttccgaag ttctcatcaa tactctagca cccacttaca 5580 accgctttgt caggttcgcc tctggattac tcccaagttc tccaacattg gcttcatkta 5640 acgaagctgg tgtactcccg tttgaattca ttgttgcaaa atcggtggca gatagagcaa 5700 ttaacttcat tgaaaaaact tacggaggca acgaggaggt cttcctcctg acagaggcag 5760 agaaccttct tcagaaatac gccaacgaaa cactcccacc gatagcaggg ctccatcgag 5820 tgggagacag acgttgggat acaactgctc caagaataga ttgggaaatg aaacagcatg 5880 ttagagctaa agacccacct tggaaagtct cttcgtattt caacgcgctg ttacaaagga 5940 aatataagaa ccacaaacat atctatacgg atggttctcg tgctaatggt gaagtaggta 6000 taggagtgta cagtcgttca tcaaaggtag cagccagatt gcctaactgc tgctctatat 6060 tctctgctga agcagctgct atacttttag ckcttagaaa cgtacgagag gaaccggtag 6120 taatctttac tgattccgct agtgttctgt cagccctaga tagtgggaaa tctagtcacc 6180 cctgggtaca agaaatcgaa gagagaacaa ccgcagctac aactctatgt tgggtaccag 6240 ggcactgtgg ggtgcatggt aacgaagaag cagatcatct tgcgtcaaca ggaagaacga 6300 tgaaccaaag aatcatgcct tccccgggag ttgacattaa aagaaaactg aaggagcacc 6360 tgattttggc gtggacaaga aaatggcata cggaacgcaa cctgtttcta cgtaaaataa 6420 gggattcgac cgaaaaatgg aatgacagat tgaatcaaaa ggaacaacga attatttcgc 6480 gtctacgagt tggtcacacc cgtttgaccc atcctcacta cttctccaat gaacagcaac 6540 cacgatgcga aatatgctca gtacgtctta cggtggaaca tatactggct caatgtcctc 6600 aataccaacc aggcagagat gctttagagc tgaagtactc aattcgggat atcttaagca 6660 acgatcgaga agaagaagag aaattgttac tgtttctaaa acaaattgga ttgtatgaca 6720 aaatttgatt ttctcttttt tttttgcaga ggtgaaccga ctttgaagtt gaaaacctct 6780 ttaataaaga taataataat aataa 6805 // ID Ci000025 repbase; DNA; INV; 255 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 06-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE DNA transposon from Ciona savignyi. XX KW DNA transposon; Transposable Element; Ci000025. XX OS Ciona savignyi OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. XX RN [1] RP 1-255 RA Smit A.F.; RT "Ci000025 - DNA transposon from Ciona savignyi."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX CC 9 bp target site duplications like members of the MuDR CC transposon class. XX SQ Sequence 255 BP; 72 A; 68 C; 50 G; 65 T; 0 other; gggtgcctct aaatactaca ggaccgcttg gagcgcttgg atcgcttgga tcacttagat 60 tgctatggaa agtaattcta accctgtccc aaacactaat ctcaacccca gaactaactt 120 taaccccaac cctaacatta atatcatggc ttttgtaatt acgaagtata gtaagcgatc 180 caagcgctcc aagcgattta agcgctccat gcgctccaag cggtccaagc gctcctgtag 240 tatttagagg taccg 255 // ID Gypsy5-LTR_Dya repbase; DNA; INV; 1854 BP. XX AC chrU; XX DT 18-MAY-2009 (Rel. 14.05, Created) DT 18-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy5_Dya; KW Gypsy5-I_Dya; Gypsy5-LTR_Dya. XX OS Drosophila yakuba OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; melanogaster subgroup. XX RN [1] RP 1-1854 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1055-1055 (2009). XX DR Genome; chrU; Positions 1910961 1912814. XX SQ Sequence 1854 BP; 612 A; 344 C; 341 G; 557 T; 0 other; tgtagtgacc aactcaaact ttacactgca acaactcttc aagacaccct cgccctaaat 60 tatagcagct tttgataacc gaaaaagtga gagaattgtc ttagaccaat cagaacagaa 120 tatgatcgta cgattgtaca gctagggaat ggaaagccta tagcagtggc caccgacatg 180 cagcttccca gagcttaatt cattatcaga taaaacctct tggcgaacgc tcataatttt 240 tgtcgctgcc caataaaaca tcatatacac gtggagatat aatttaatta tatatatcca 300 atacaataaa aacaagagtg ggatattcaa ataaatttcc gtgtgtgatt tgtgtgaaag 360 tggaaaaggt aaaaagtgcg atatagtgag ttcccatttc taacccagtt tttataggga 420 ttcgacccca aggcgtaagg aaggccaagg cctatataag ggcataaatt tgctctacag 480 ccgacttcct ccccccgtct cgtcgacccc cgcatcagct cattatttaa acccagttcg 540 cttcaagaag aggccaaagg tcgacataac cttcatccca aaagttacgc aaccggccac 600 ctgccgaatg gtaactggcc aatatattgg aaggctagag ctcatccaga ttattcagct 660 gtgcagaagt ttagacgggc ggatcggatt taattatatt tggacatcat caacgtacgc 720 agttaggatc accatatgac cagcggaggt ctaacttagt ggttccgtct tatcgcagcg 780 cagcgtagct gccagccgtc ttatcgctgc agcgctgcag ttgcatttcg gtgttttgtt 840 ttgattcttg attttctttt gtattgcaat taataatgaa ttaaatactc gcagattgac 900 ggaacttttg gcatttaaat aggaatttaa attagcacct ttaaaatttg acctacagct 960 cgacaatgtc aacgtgaact tgggcccaac tttacgcaca tcatggagtg aatatttaag 1020 gtcgtgagta ctaaaaaaca acaatgataa aaaccaattt tatgaaatac ctatattaaa 1080 attctacaaa atgccactcg aacgtctaat taaaataaaa tctgcatttg gctaaagatt 1140 atataatata tttactaatt ttataaccat ttcttatatc tatttactta tactttactt 1200 tatatactta tttttactca ctaaatataa acgaaaatga agagttagag aatgactagg 1260 ataattatac gtaggttaaa atttggtagt ttgtaaaaga gatttaatgg aacagcttac 1320 tgaaatgaat ttatataaat ggaattatga ttgaaaatat attggaaaag aatgaattgt 1380 gaataaatag atttaatggt ttgtcgagta cttatgttga acactttctt aaatatgtgg 1440 tagagttgat ttgaacttgg ccaagggaca atgtaaatat tttaagtagc attgtcatgg 1500 tagggagggt aaagacttag ggaattatta acatccaaat tcccaatact tatcagaatc 1560 tgtagcttat cttcacacgt gtcggcgtgt atccgtaata tcaatataaa gccatttctt 1620 ttattaatca attgtccagt tttgaatatt gttttgaata agacgcactt attaaatgta 1680 gtagcgtttc gtgctatcct aagctaaaag aaccccgacc tacggcgagc tagacccgag 1740 catagcaaga gtgcgcatcc ctagcgaagg aagtacaagg tcctctttat tagtagttgt 1800 cacccactat taaaagacca gctagttaaa atctgtcgaa ctacgtacac taca 1854 // ID piggyBac-N6_BF repbase; DNA; INV; 1789 BP. XX AC . XX DT 13-APR-2011 (Rel. 16.04, Created) DT 13-APR-2011 (Rel. 16.04, Last updated, Version 1) XX DE piggyBac-N6_NV DNA transposon DNA transposon - consensus. XX KW piggyBac; DNA transposon; Transposable Element; Nonautonomous; KW TTAA TSDs; piggyBac-N6_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-1789 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M. and Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-1789 RA Kapitonov V.V. and Jurka J.; RT "Transposable elements in the amphioxus genome."; RL Deposited to Repbase as the tmplanrep.ref file (02-May-2008). XX RN [3] RP 1-1789 RA Kapitonov V. and Jurka J.; RT "A family of piggyBac-N6_BF DNA transposons from the amphioxus RT genome."; RL Direct Submission to Repbase Update (13-APR-2011). XX DR [3] (Consensus) XX SQ Sequence 1789 BP; 507 A; 333 C; 429 G; 518 T; 2 other; ccctagtcct gcggcgcgct acaattgtgt tgttgcacat gcatgttcta tttgcggcgc 60 aacactattg tagtttgggg ggtatgtcca ggatacgtac gtaacactcg cgcgcggtas 120 tggcgggaac scaactacgt atcgggtgtt atgggtcaag ggtcaggggt caacgaactg 180 ttgcgcaacc ggaagtacgt catcccagaa gcccttgcga ggttaattaa tattcatgag 240 cgtcatgtcg tctgctatcc tggcggaagt gttttgtttt gctgttttta gctgttttcg 300 gccgaaaatg tcggtatcta acgtaaaaat gccgcgattt atagtattgg agattcacac 360 cgaactcttt atggaaggat tctgatccca ttttaccgga ttcgggcaga gattccaagg 420 aaagcagcga aactatgtga cgacgacatg tgaggaagtt aggcgtgcta ttgaggcatg 480 gggcccagaa aagcgaatcg tgggacggat ggaacagact ctgaccgtgg ctggcatggc 540 tgtggtcggg cgaaaggccg tgtacgggac cggggaggcc aggaaccggg tcgggtatgc 600 agtcggggcc gaggggaggg gggcagttgt gcgaggccgg acgctgaccg cgccggggcc 660 cctgaacgag tgtgagtgat gacgtaggcc cctcaggaag tggcaatata ttttcttacc 720 tcgtttgttt cgttgccttg tccttggaac tcctcaggaa ctttgtactt ttgccacaag 780 ttatcttact ggtatggtag agcttagtat gagagattgt ggacaaaatc ttccttttat 840 tgctttttat tacttaatga tcaataaaat aggtggaaac aacttaacat agactctcaa 900 gcaaaatagt gactgtaagc tttaatttga tgtacagcaa gcttatattc cacttgccag 960 acagtgtcag aacaaagttt attcaatact gcaagaatag tgactatact cacttcagca 1020 ccaaggacaa tattcaatca ttaacttacc tgtagcaata cctgttgatt ttctcacctc 1080 ctttgtatcg ttgtcatgtc cccaggactc tttaggaact ttgtactgat gtcaaaagtt 1140 tttttactgg tgttctagag ataattgtga aagattctgg actaatctac attttattgc 1200 tttttatgac tttattcata aatagaattg gtgcacataa cttaacttag actctcatga 1260 aaagtagaga ctgcaagctt taattggtac agcaaactta tattccactt gccagacagt 1320 gtcagagtga agtttattca acactaaagg aatagtgatt agacagaaaa tgcaaatcag 1380 caccaaggac aatatccaat cattaactca cctgtagcaa tacctgcatg attttttttt 1440 ctcacatggg caaggaaatg ttgtcagaag tctttagata gattgtagtc aagaaattca 1500 cgcttaaatt gtaggatatg aaaggagaat acaaaagtta aatgaaaaag tgtgttttat 1560 tgttttattg ccactgtaaa cccttatgac tgataaaatt gtagatatat gcaacatttc 1620 ttgaaagtac acattgttag ctttacaatg atatatacct tcatggggtt attgtaaagc 1680 aaagtaacta aaaataagga aaatcaaact agtacccttt gtttcaaagg ggtctgggaa 1740 ccccagcagt gggggcgagt tttggtatag aaagccagca ggacaaggg 1789 // ID BEL-89_AA-LTR repbase; DNA; INV; 364 BP. XX AC supercont1.336; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-89_AA_; KW BEL-89_AA-I; BEL-89_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-364 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.336; Positions 1049652 1050015. XX SQ Sequence 364 BP; 119 A; 92 C; 84 G; 69 T; 0 other; tgttccaaac gaaaagcgtt cagtgtagca gcgccgactc gcatcgccca atgctgaacg 60 caacgatgtg cgtttggaaa ggaaggagat aaacgcgcga aaacgacgac agaacggaag 120 aaggaaagcg agggaaaagc gaatgcgaac gccattatta gttcacccgt ctaccgaaca 180 ggcaacaccg cgacgcgttc gaacaagttt ttcaataaag ttaagtttaa tgtgattaat 240 tccgactccg atttatttcc gaccgcgatc cgaaaggaaa cccgaaattc cgcccgatta 300 gaaccgtact ctggaagtcc cgaaaataca gtccaccgaa acacgatttt ttcccgaccg 360 aaca 364 // ID Gypsy3_MH-LTR repbase; DNA; INV; 178 BP. XX AC ABLG01001411; XX DT 24-JUN-2009 (Rel. 14.07, Created) DT 24-JUN-2009 (Rel. 14.07, Last updated, Version -1) XX DE LTR retrotransposon from northern root-knot nematode: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy3_MH; KW Gypsy3_MH-I; Gypsy3_MH-LTR. XX OS Meloidogyne hapla OC Eukaryota; Metazoa; Nematoda; Chromadorea; Tylenchida; Tylenchina; OC Tylenchoidea; Meloidogynidae; Meloidogyninae; Meloidogyne. XX RN [1] RP 1-178 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from northern root-knot nematode."; RL Repbase Reports 9(7), 1523-1523 (2009). XX DR Genome; ABLG01001411; Positions 7533 7356. XX SQ Sequence 178 BP; 44 A; 31 C; 41 G; 62 T; 0 other; tgttgggaag ggctttgtat aaaagccctt agtaaagttt agtataaaag cgattagtgt 60 taaccagcca ggagtcagtt agtttagtgc aacctacctt catcatcgcc tttgtatttc 120 ttgtgtggat aaaccttttg ttggagtgtc tcgtctacgt tcaagtgtgg tctcaaca 178 // ID BEL-47_CQ-LTR repbase; DNA; INV; 490 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-47_CQ_; KW BEL-47_CQ-I; BEL-47_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-490 RA Jurka J.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 248-248 (2011). XX DR [2] (Consensus) XX SQ Sequence 490 BP; 129 A; 149 C; 123 G; 89 T; 0 other; tgttccgtcc ggaatgagtt ttgcgtgcac acgagcgttc ctgaagggag cgcaaaccaa 60 aacaaaccac acacgcaaag tggagggaaa acggcgcgtg caacgtcgag gaaaagtacc 120 gctagttttg ctccacacca cgacgtcagc agtcagtagc attcttgctc cacctcccgt 180 ccgagaagga ggcgtacgac acgcgaggac aagtttcgcc caacctcgac gcgaaccccg 240 aaccggcttc cgaggaagta aacccgcgac cgcgcaacct ttgtagatcg acccagtaga 300 gaaatacaga taagttaagt agtgtttaga agagttgccc tgctttcgcc taagtttgat 360 ccgaccgcaa atccgatttc cgcctccgaa aggtggttcc tccgttgtcc accaagggtg 420 accgcacccg agagtccgcc gtcgccaaaa atacagtcca cacgctccgt agttcggtcc 480 cgaccgaaca 490 // ID hAT-67_HM repbase; DNA; INV; 3223 BP. XX AC . XX DT 31-DEC-2008 (Rel. 13.12, Created) DT 31-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type DNA transposon family - consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-67_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3223 RA Bao W. and Jurka J.; RT "hAT-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 2055-2055 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 484..3039 FT /product="hAT-67_HM_1p" FT /translation="MSHKISNSRKKLSGYEYRKRKSVKDQKFEKIKKFMSI FT EKYVAYKKPDEGREDFNPDASTISTTMSNKIGEPSTILKNSCITSLQIEVD FT NEPVILKESNDLSEDFLQETPNLLDCGNWPIVRSSNFVDHLIKLGPFQITK FT EKYPEDRNGRHFSSIYYNRKLANGETFCRRWLVYSDSKNSVFCFCCLLFDN FT NSKSNLVSDGFKKWRHLTETLGIHENSVSHMKCYQQWVETEIRLKTGKTID FT NEEIKIIEKDSLRWQNVLLRLMNITLYLAENNMAFRGSSDKLFTPQNGKFL FT GLIQMLAKFDPVMQKHLALAIKGDASDHYCGKNIQNELIDLMSQKVNDEII FT NRVLKAVYYSIIADCTPDISRKEQLSLTIRIVDLSLDIRVEIKEYFLGFFS FT VSDSTGLGLTEVLIELLTKHGLEISNCRGQGYDNGSNMKGKINGVQKRILN FT LNPLALYVPCGNHSLNLVISDSARSSVKSIAFFGILQRLFTLFSASVSRWK FT ILIDHVKILHLKKLCDTRWEAKISSVKAVRYQVGDVHDALIALSEIEGCNP FT ETAHEAITLGEQLKDFSFLVSLIVWYDVLFQVNIVSKTLQEKDMDITQCAK FT LLKSCCSFLENYRKCGFKDAIIKAKDLAIELQVEPVFKPLKRIIRVKRHFD FT EPAQDGIYLFSPEKKLEIDFFNPLLDTSLISMRERFIQLENYSEVWGFLYN FT VDSLQKREEILKSCLALQSKLTVNLKSDINGILLCDELMSIKPFLSEMVKD FT KITPIIVLNFIKQHKMQDLYPNTWIAMRILLTIPVTVASGERSFSKMKLIK FT TYLRSTMSQDRLSSLGTLSIEKNIAENLDFSTLIKDFGDKKARKINLKN*" XX SQ Sequence 3223 BP; 1125 A; 451 C; 569 G; 1078 T; 0 other; cagcgccggc ttttaggagg ggggcgaggt cgtgcgttgc acgagggcgg caagtaaata 60 ggggcggaga attttattag tcacaagttt taattttcca gtagtatatt aaaaatttaa 120 agtaaattaa aagatttaac gacctttgca aaaaaaaaac acgattttaa atttgaagac 180 aaaaaattac tacatttact taattaacgc acgattttct attgtaattt gtagtatcat 240 ttgtgcgtta tttcccctag gagtcatttc gaaaagttgc aaaaaatttt ttataaagaa 300 gttcaaaagt ttttattttt tgttattaaa tgttaagttg actcgattta aacaattttc 360 agtaagttta gttacttttt ttttatttct ttaataattt tatgaatgtg ttttgagaac 420 tttgttaaat tttttttttt ttaaattgta gaaataattt tatcttaaaa aatattctgt 480 ctcatgtcac ataaaatttc aaattcgaga aaaaagttgt ctggttatga gtacagaaaa 540 agaaagtccg ttaaagatca aaaatttgaa aaaattaaaa aattcatgag cattgaaaaa 600 tatgttgctt ataaaaaacc tgacgaaggt cgtgaagatt ttaaccctga tgcttcaact 660 atctctacga ctatgtccaa taaaattgga gaaccttcaa caatactcaa aaatagttgt 720 ataacttcat tgcaaattga agtagataat gaaccagtta ttttaaaaga atcgaatgat 780 ttatcggaag attttttaca agaaacacct aatttattgg attgtggaaa ttggccaatt 840 gttcgatcta gcaactttgt ggaccattta atcaaacttg gtccttttca aattacaaag 900 gaaaaatacc ctgaagatag aaatggtcga catttttcta gcatttatta caatagaaag 960 cttgcgaatg gagaaacctt ttgccgtaga tggttggttt attcagattc taaaaactcc 1020 gtattttgtt tttgctgtct actttttgat aataattcaa agtcaaactt agtctctgat 1080 ggtttcaaga aatggagaca tttaactgaa acattgggaa ttcatgaaaa cagcgtctct 1140 catatgaaat gttatcaaca gtgggtagaa actgaaataa gattaaaaac tggaaaaact 1200 atagataatg aagaaattaa aattatagaa aaagacagtt tgcgatggca aaatgttctt 1260 cttcggctga tgaatatcac tctttaccta gctgaaaata atatggcttt tcgagggagt 1320 tcggataaac tgtttacgcc tcaaaacggc aaattcttag gactaataca gatgcttgca 1380 aaatttgatc ctgtgatgca aaagcattta gcacttgcaa taaaaggtga cgcttcagac 1440 cattattgcg ggaaaaatat tcaaaatgag ttaatagatt taatgtctca aaaggttaat 1500 gatgaaataa taaaccgagt cttgaaagcc gtttactatt caatcattgc tgattgtaca 1560 cctgatattt ctagaaaaga acaactttct cttactattc gaattgttga cttgtcatta 1620 gacattagag tggaaattaa agagtacttt ttaggattct tttctgtttc tgattctaca 1680 ggcttaggac ttactgaggt tttaatcgag ttgctaacta agcatggact tgaaatctct 1740 aactgcagag gtcaagggta tgataatgga tcaaatatga aaggaaagat taatggtgta 1800 caaaaaagga ttttaaatct taatccttta gcattatatg ttccttgcgg caaccatagc 1860 ttaaatttgg taataagtga ctctgcacgg tcttcagtaa aatctatagc ttttttcggt 1920 attcttcaga gattgtttac tctattctca gcttcagtga gccgatggaa aattttaatc 1980 gatcatgtta aaattttgca tttaaagaaa ctctgtgaca cgcggtggga agcaaaaata 2040 agtagtgtta aagccgttcg ttatcaagtt ggtgacgtac atgatgcttt gatagcattg 2100 tcagaaattg aaggatgtaa tcctgaaact gcacatgaag cgataactct aggagagcaa 2160 ttaaaagatt ttagctttct tgtctcttta attgtctggt atgacgttct ttttcaagtc 2220 aatattgtta gtaaaactct tcaagaaaaa gacatggaca tcactcaatg tgcaaagtta 2280 ttgaaaagct gttgttcatt tttagaaaac tatagaaaat gtggttttaa agatgcaatt 2340 ataaaagcaa aggatttggc tatagaatta caagtggagc ctgtttttaa accacttaaa 2400 agaataatac gagtgaaacg ccattttgac gaaccagccc aagatgggat ttatttattt 2460 tctcctgaaa aaaaattaga aatcgatttc ttcaatcctc ttttagacac atctttaatt 2520 tcaatgagag aaagatttat acagttggag aattattccg aagtttgggg ttttttgtat 2580 aatgtcgata gtttgcaaaa aagagaagaa attctaaaat catgtttagc tcttcaaagt 2640 aaacttaccg taaatctaaa atcggacatt aatggtattc tcctttgcga tgaactgatg 2700 agcattaaac cttttctttc tgaaatggtc aaggataaaa ttactcctat tattgtttta 2760 aactttataa aacagcacaa aatgcaagat ttgtatccaa acacatggat tgcaatgcga 2820 attttgctaa caatacctgt cactgtagcg agtggagaaa gaagcttttc caaaatgaaa 2880 ctaataaaaa cctatctgcg gtcaacaatg tcgcaggata gactttcaag tttaggaacc 2940 ctatcgattg aaaagaatat tgctgaaaat cttgattttt caaccctaat caaagatttt 3000 ggtgataaga aggctcgtaa gattaattta aaaaattaaa tgtattttgt gtaaaacata 3060 aacaaaaaat gtttctttgt atgtagcgtt ttattgcaaa gtattatttt tgaattcgat 3120 ttgttattat tttccatgta atttaccagc acgtttataa aattttcagg ggcggcagga 3180 caaatttcgc acgagggcgg caaaaaggct aaggccggca ctg 3223 // ID DNA8-83_AP repbase; DNA; INV; 883 BP. XX AC . XX DT 26-AUG-2009 (Rel. 14.09, Created) DT 26-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-83_AP. XX NM DNA8-83_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-883 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2019-2019 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 883 BP; 280 A; 118 C; 157 G; 328 T; 0 other; cagtggcgta gccagggggg gggttgggtg tttaaccccc cccattgaca ttttccctcc 60 taataaatat attgtttata gggttcggga ctgcttgcaa gttataaatt tgtcttatga 120 aatggcgaaa ttaaatacaa agtttgaatt tgcatatttg tgtatttttt aacttaaatg 180 tacatacaat tttcgggaaa aattatattt ggataacatt gaaaaatatc ttaaaaaata 240 ttatgtttgg ttaggtacaa taggcgataa ctttgagatt tggggaggcc gataataaaa 300 ttgaataagc ttaaaaaaat taagtaaatg tttgggaggg gcggagggct gttaaagttg 360 ttggtggagc ttggctacag ttggccctat gttcgcatat gtaagatttc ataagtctct 420 ttgtgacaaa actgtgttta aactataatt gacatctaga ttgtagattg tacgaagaaa 480 tgaacatcag tagatatagg tacgtcgtgt tcgtttatca cggtgttcgc atatcacgtt 540 attcgttatt acttatttat taacctagta ttgtatgcca gtatttttaa aaattaagtt 600 gttatttatt tatttatcag gtttatatac ctacttattt tttagttttt tattaccaaa 660 attgcaatat gttttatgat ttttcaaaaa taatataaaa atgttgctac atagtacata 720 tatttaagca tattttgaga atttcgactg catatttcga taatttttag tgcaacaagt 780 caagtcctga accctaaatt tatttatata cgaaaactgt aaaaacttgg ttttaatcta 840 tacttaaaac accccccatt tcataatcct ggctacgcca ctg 883 // ID Gypsy-23-LTR_NVi repbase; DNA; INV; 1595 BP. XX AC . XX DT 21-APR-2009 (Rel. 14.04, Created) DT 21-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: long terminal DE repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy-23-LTR_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-1595 RA Bao W. and Jurka J.; RT "LTR retrotransposon from Nasonia parasitic wasp."; RL Repbase Reports 9(4), 784-784 (2009). XX DR [1] (Consensus) XX SQ Sequence 1595 BP; 375 A; 339 C; 607 G; 264 T; 10 other; tgcgacgagg cgactcgtcg tgcgccgtgt ccttcggcgg gctctgtgtc gtgtgcgcga 60 tttgagctaa cgtgtgtgtt ttcttgtttt tctcgtgcga gtgttggtgc gaggaatcta 120 ggaagccggt ggacaaggag cgacgacaag gcttcaggga agcagcggat tttgagaaga 180 agaagagtgt gagagagttc gcggaacact tgtaactccg gctgcgtccg gagaattcgc 240 ggaaatctgt agcgagagtg tgcgggtacg tgtgccgagc aacgtagggg agagcgtaag 300 cggggaatcg agccgtttga ggaaggggcc gtatcgacgg ccactagccc gattccaggc 360 tagtcggggc gcgagttcag cgcggtcgtg gggactggcc gagtcgcgcg cgctgattgg 420 ccgggccgga gcggagagag cgcgcatcgc cgaagcggtc ggcgcggcgc ggcacttctt 480 gtttgactgt tgcccaagac gcacgtattg agcgacagtc gcacgagctt ttcgttggta 540 ttgcgaattc gagggtgtcg agatcgtgag tgtcgggata gcgcgttcgt gtgtgcaagg 600 agtcaaaaga gagccagtgt gagagagttg ggggagagtc aagcgaggcg aggcgagaag 660 aagacgttgc ttccaatcca gaccagccaa ggtgacgagg atcgtgagga gacgaccagc 720 gacgcagggc cagtccagga cagggaggtg cggaggaaga agacgaggcg aggaagaaga 780 cgagggcgag gaagaagacg ggagatcagc attcagacgg agccggaaga cagccggagg 840 acagcgtctt gtgagcgacc tgtaagtcrc agcttactca ggctgaggcc tgacctttgt 900 actgcgtgcc gcgacggtga ttcaagcgtg agtgagagag agagagagag agagagagaa 960 agagatctgc gttgagagag agagaamagc attgtcgaga aggcctaaat cgtgagcgtc 1020 acgacagcgc ggccggattg ccgtccagac aaagagccgc gtgggtgcgg ttcgaggaat 1080 cgcgcgcgag cattgggtcg agagaaggga gagtcgcagc gcggtcggag gggcgccgcg 1140 gaagcagcgg gggcctgcct tatcggcggg cagacgaaca ggccgtgagg gcgaagtgtg 1200 cgagggagag aaagagagag agagagagag agagtgtgtg agtgagcgct agarcgtgcg 1260 cgacgcgagc gcaagcgagc ggcgacgacg gtgaggtgta cggaggcaac aagcctcgtg 1320 agagtgcgag agcgcgcatg agtgtgtgtg tgmgagcgtg caagacgagt cggccgcgaa 1380 gccgacggcc gtagagtgtc agccgaatcc gagttaataa agaggtcgta aagagaattt 1440 agagagttgt gatttctskm cccttacaca ccgctgtaac ccgaacagcg gaatyctcgc 1500 ggaactycyt ccgagatcta cagaagaaat aatagtgtcg aggcttcggg acggggttga 1560 ggcgcgagtt ctcaccccag attggccacg tcaca 1595 // ID Kiri-2_AAe repbase; DNA; INV; 4389 BP. XX AC . XX DT 27-OCT-2010 (Rel. 15.1, Created) DT 06-JAN-2011 (Rel. 16.02, Last updated, Version 3) XX DE A Kiri non-LTR retrotransposon family from Aedes aegypti. XX KW Kiri; Non-LTR Retrotransposon; Transposable Element; L2_Ele2; KW Kiri-2_AAe. XX NM Kiritsubo-2_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4389 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4389 RA Kojima K.K. and Jurka J.; RT "A distinct group of non-LTR retrotransposons from the yellow RT fever mosquito."; RL Direct Submission to Repbase Update (19-OCT-2010). XX DR [2] (Consensus) XX CC [1] Originally named as L2_Ele2. CC [2] Consensus update and re-classification. This consensus is CC generated from 10 sequences with >96% identity, and ~99% CC identical to the original sequence in [1]. This family does not CC belong to the L2 clade and is renamed as Kiri. It could CC constitute a new clade with other Kiri elements. XX FH Key Location/Qualifiers FT CDS 267..1061 FT /product="Kiri-2_AAe_1p" FT /translation="MPDNDHTSRLRSNSLVKSSAKSTDADMTVAELANLMQ FT SQLATYQRTVKEDFKKLGDSLSSISTEMTKLREDIASDIEKLREENNKTFN FT SLSTTIDEAKRDTSLALDRSARMNDLLVSGVPFVVGEDLSNYFRTWCNSFG FT YAERDHPLVDIRRLSKGTPTAGTVYMILIQFAITVQRNEFYSRYLNSRSLS FT LSGIGFSVDKRIYINENLGPIVRNLRSKALQLKKAGKLRAVFTRSGVLFVR FT RIGEDKEIAVSTENDLCLLAKTLS" FT CDS 1311..4238 FT /product="Kiri-2_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MICMILFVSKNVIFNEVLFYHSPLLVKSFFYLMANNP FT NFYTSTNWCIPGTVIKSALIRNKLSVCCVNSQSICARKLCKIEELRAIVSV FT SKVDIVCVSETWLNENVNSSILSIDGYNIVRHDRVGRLGGGVLIYIKCDIP FT FNLVAKSDNQSGVAITEFLSIEVILQNEKLLLTAFYNPPEIDCTEPLEHLL FT ALYGNTYSYVYLLGDFNTNLSVSSPKSNRMRSILSTYSIQCLGSEPTFFHK FT NGSSQLDLILTNDVSKVLRFSQIEAPVLSNHDLLFASLDFEREVHPIKVEY FT RDYNAIRIDSLVNAFYSADWRSFYRSNDSNQQLEFFNALLTDLHNLYVPIR FT TTNSNKKFSPWFNNEIGKAIVDRDLLYKKWKNTKNANDLQAFKRARNSVNS FT VIRLAKQNYFSSRLDPNLPAKELWKNVKMIGINPPSMQTTNSFTANEINHM FT FCQNFTDNFSLDQSNVTNYVDREPFYFDRIDETDVVYAMNSIKSNALGLDM FT LPMKFLKAICPILIKPITHLFNSIIITSAFPDAWKKSKIIPIKKKSNLNSM FT DNLRPISILCALSKVFEKIIKSNISCYISRFNLLSQFQSGYRSKHSTKTAM FT LKIIDDIGVVVDKGRPVVLILLDFSKAFDTVSHAILCRKLKTKFSFSDHAA FT NLIESYLCSRSQAVFNNGVLSSFLPVTSGVPQGSILGPILFSLFINDLPCV FT VKNCQIHLFADDAQLYYKCFENTSTKISQDINDDLQRIDDWSRENKLTLNT FT RKTKALFISNCAFLNSLKPFLKLNNEVIEFVEHATSLGIRIESNLEWDSYI FT LSQCGKIYACLRTLSKCASFLPSRTKLKLFKSLIFPHFLACDFLLDSVSLL FT TFNKLKVALNTCIRYIFSLNRYSSVSHLHHLLIGCRLENFSKLRCCLFLHK FT LCSTKQPDYLYSKLQLARSTRYKKYILPRYHTTKYGNSFFVRGVVHWNSLP FT INLSNDKSYLAFKRGCEEHFK" XX SQ Sequence 4389 BP; 1367 A; 781 C; 787 G; 1454 T; 0 other; gtagaatgaa gtgcacgtga aactagccat cctggtgcaa acaacaaaac aagttgtatt 60 tccattaata aattcaccaa ttgctgatta aacttaccgt attagtgcat cgagtccact 120 gtgaatgcag ttttgaatcc gtaacttact aaacgctgat aaaaactcgt ttatttgcac 180 tctgtgggtt ttcgccattc gacattagag gcaaacaaag gatcactctt cgaacagtta 240 tcttgtgtga cgctgttata tccaccatgc cagacaacga tcatacatca cggctgcgga 300 gcaactcact cgtgaagtct agcgcgaaga gtactgatgc tgacatgact gttgcagaac 360 tcgccaacct gatgcagtct caactggcta cctaccaacg aacagtcaag gaggacttca 420 aaaagctcgg tgactctctt tcgagtattt ctaccgaaat gaccaagttg cgagaggata 480 tagcatcgga tatcgaaaag ctacgtgagg agaataataa aacgttcaat agtctatcta 540 cgacgatcga cgaagcgaaa agagatacat ctctggcgtt ggacagatct gctcgaatga 600 atgatctttt ggtgagcgga gtgccgtttg tagttggtga ggacctatcg aactactttc 660 gtacatggtg taactcattt ggttatgcag agagagacca tccacttgtc gacatccgac 720 gactctccaa aggaactcca acagctggaa ctgtttacat gatcctaata cagtttgcca 780 tcactgtgca acgtaacgaa ttctattctc gatatctcaa ctctcgttcg ttgtcgctct 840 ctgggatagg attctctgtt gacaagcgaa tttacatcaa tgaaaatttg gggcccatcg 900 tcagaaattt gcgatcgaag gcgctacaat tgaagaaggc tggaaaactg cgagcggttt 960 ttacacggag tggcgtgcta tttgtacgga gaatcggaga ggataaggag atcgccgtgt 1020 ctacagagaa tgatctatgt ttattagcca aaaccctttc ctaataagct tttctttatc 1080 cttccgctcc ttcccataca tcccgtgatt ccttatccta aaagttttgt tcgagatgat 1140 gctgctgctg ttgctgtgat gttccctgct cttgatgatg cgattgctgc tgttgttagt 1200 ctacatttgg tggacactga ataaatgctc tctaatgttt gccgttctgg ttatgtagtt 1260 ttaactatta gttacttgtt tttttttgtt tgcgccatgt atttgtttga atgatttgta 1320 tgatattatt tgtttcgaaa aatgtaattt ttaatgaagt gcttttttat cattctccct 1380 tattagttaa gtcctttttt tacctaatgg ctaataatcc aaatttctac acctcgacaa 1440 actggtgcat tccaggtact gtgataaaat ctgctttaat tagaaataag ctttctgtgt 1500 gctgtgtaaa tagtcaaagt atttgtgctc gtaagctttg taagattgag gaattgcgtg 1560 ccatagttag tgtcagtaaa gttgatattg tatgtgtaag tgaaacatgg ttgaatgaaa 1620 acgttaatag ttctatttta tcaattgacg gatataacat agttagacat gatagagttg 1680 gtcgtctagg aggtggagtg ttaatttata ttaagtgtga tatacctttt aaccttgtag 1740 ccaaatccga taatcaaagt ggtgttgcta ttacagaatt tttatccatt gaagtaattc 1800 tacaaaatga aaaactatta cttactgctt tttacaaccc acctgaaatt gattgcactg 1860 aaccacttga acacttgcta gcattatatg gtaatacata tagttacgtt tacctcctag 1920 gtgatttcaa cactaatctt tctgtaagtt cacctaaatc taataggatg cgaagtattt 1980 tatctaccta ttcaatacaa tgcctaggaa gtgagcccac gttctttcat aaaaatggat 2040 catctcaatt ggatttaata cttactaatg atgtgtctaa agttttaaga tttagtcaaa 2100 tagaagcacc tgtgctatca aaccatgatc ttctttttgc ttctcttgat tttgaacgag 2160 aagtgcatcc aattaaagtt gagtatcgcg attacaatgc tattcgaatt gattcactag 2220 tgaatgcatt ttattcagct gattggagaa gtttctatcg ctcaaatgat tccaatcaac 2280 aattagaatt ttttaatgct ctattaacgg atctacataa cctgtatgtc cctatacgaa 2340 ctactaactc caacaaaaaa ttcagtccgt ggttcaataa tgaaattggc aaagctattg 2400 tagatcgaga tttgttgtac aaaaaatgga agaatactaa gaatgccaat gatttacaag 2460 cgttcaaaag agctcggaac agtgtcaaca gtgttatacg tttagctaaa caaaattatt 2520 ttagttccag gcttgatcca aatcttccag caaaagagct atggaagaat gtaaaaatga 2580 tagggataaa tccgcctagt atgcaaacaa caaattcttt tactgctaat gaaatcaatc 2640 atatgttttg ccaaaatttc acagacaatt tctcattgga tcaatcaaat gtaacaaatt 2700 atgttgatcg tgaaccattt tacttcgata gaatagatga aactgatgtc gtgtatgcta 2760 tgaactcaat aaaatctaat gctcttggct tagatatgtt accaatgaag tttttgaagg 2820 ctatttgtcc gatattaatt aaaccgatca ctcacttgtt caattctatc attatcacga 2880 gtgcttttcc tgatgcatgg aaaaagtcaa aaattatacc aatcaagaaa aaatctaatt 2940 tgaattcaat ggataattta agacctataa gtatcctatg tgcactatca aaagtgtttg 3000 aaaaaataat caaatcaaat atttcttgtt atataagtag atttaatttg ttgagtcaat 3060 ttcaatcggg atatcgttcg aaacatagca caaaaactgc aatgctcaaa ataattgatg 3120 acattggtgt tgttgtggat aagggacgac ctgtcgtact tattcttcta gatttttcaa 3180 aggcattcga caccgtatca cacgccatat tgtgtcgtaa acttaaaaca aaattctctt 3240 tttcagatca tgcagcaaat ctaattgaat cttatctatg ttcgaggtcg caggctgtct 3300 ttaataatgg ggttctatca agctttctac ccgttacctc tggagtgccg cagggttcaa 3360 ttttaggtcc aattttattt tctttattta taaatgacct accttgtgtt gtgaaaaatt 3420 gccagattca tctttttgcg gatgatgctc aattatacta taaatgtttt gaaaatacat 3480 cgacaaaaat ttcgcaagat atcaatgatg atttgcaacg gattgatgat tggtctcgcg 3540 aaaacaagct aacattgaac actcggaaaa ctaaggctct cttcatttca aattgtgcct 3600 tcctgaactc gttgaagcca tttttgaaac tcaataatga agttatcgag ttcgttgaac 3660 atgctactag tttaggcatc cgaattgaat caaatttaga atgggatagt tatatcttgt 3720 cacaatgtgg taaaatttat gcctgtttaa gaacgttatc gaaatgtgca tcattcttgc 3780 caagcaggac aaaattaaaa ttgttcaaaa gcctaatttt tccccatttt ttagcttgtg 3840 attttctgct agattctgtc tcattgctta ccttcaataa attaaaagta gcattgaata 3900 cgtgtattcg ctatattttc agtttaaata gatattccag cgtttctcat ttacaccact 3960 tactcatagg atgtcgctta gaaaatttca gtaaactccg ttgttgtcta tttttacaca 4020 agctgtgtag cacaaaacaa ccagattatt tatacagcaa gctacaactt gcgagaagca 4080 ctcgttataa aaaatatatt ttaccacgat atcatacaac taagtacggg aattcatttt 4140 ttgtgcgagg tgtagtacat tggaattcat tgcctataaa cctttcgaat gataaatcat 4200 acttagcttt taagagagga tgtgaagagc acttcaaata attttaattg taaatgtatt 4260 agtagaaatt atttagttat tctcaataac tcttgcaaac ggcgctttct tctacagtaa 4320 actactttgt gtagcataaa aaagacatga gtcttacgtt acatatatta gagcgataaa 4380 taaataaat 4389 // ID Gypsy-111_AA-LTR repbase; DNA; INV; 209 BP. XX AC supercont1.213; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-111_AA_; KW Gypsy-111_AA-I; Gypsy-111_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-209 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.213; Positions 720625 720417. XX SQ Sequence 209 BP; 56 A; 51 C; 43 G; 59 T; 0 other; tgtagcagac gtaatttttg ctcgggctac caacagagct ccgacgtttt tgtagattaa 60 ctacggccga ctctgctagt tgcacaatca tttgcaccgc accatcattg taattcgtag 120 tgtcgaagag taaagacaat aaagcaatcg cagtaatttt gactaccgtt ctggtgtttt 180 ctggccattc ccgcaagcac gcctaaaca 209 // ID Copia7-NVi_I repbase; DNA; INV; 4228 BP. XX AC AAZX01002465; XX DT 05-NOV-2007 (Rel. 12.11, Created) DT 05-NOV-2007 (Rel. 12.11, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: internal DE portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia7-NVi; KW Copia7-NVi_I; Copia7-NVi_LTR; internal portion. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-4228 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from Nasonia parasitic wasp."; RL Repbase Reports 7(11), 1112-1112 (2007). XX DR Genome; AAZX01002465; Positions 22806 18579. XX CC Positions [1630-2160] - Integrase core CC 'GTTAC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 1108..3870 FT /product="Copia7-NVi_I_1p" FT /translation="MCSSRLNMTNIKHSRIKSVTAANQTKVSVQGEGNVAL FT QYHGPTGERRVLLENVLLVPDLTVNLLSVSIITRRGGKVIFAGPTCQIYNS FT KGIPILQGQRTTNNVYKITFQPRSAQTDTLRSDFALSVSIPTDIQLWHRRM FT GHLNEAYLKQLREAAIGVNFKNDQLYRCQTCIAGKLIQKPFKINNKRASAI FT LELVHSDVCQMEELSIGRAKYFITFTDDYSRKTFVYFLKQKDEVPDITMKF FT IKYVEKQTDRKLKRFRTDNGGEYVNAKLKTALAELGVKHETSIAYQPQQNG FT RAERVNRVLLEKARCMLSEANLPYKFWAEAISTACYLSNRSPKRCLGGVIP FT EELWTGSRANLSHFRVFGCEARAYISSHQRKKLDPTSRPAIFIGYSEDQKG FT YRLWSAEDKKVFVTSNVEFLEKELESSIKPAYFPVEFSDTMESNVDKVLAQ FT EPINKENFRRNEESVTNENLRRNVEPVNKESFRSNEEPVNKGNFRRNEEPV FT NKENFRRNEKPVNKENFRTTEEPANQEDKIRNKTPKKREIVKEAPANDDPM FT ITRKRKIQSNQDEVIKKSAKLTDRSTDLQPPLSNQPRPQTRSVTKLDQPRD FT LEDSEDELITPSCKSQRRKRKPKHLDDYVVYSIVSEFGEPRTVEEALSGPE FT SHHWRQAMDEEYKSIIKNNTWVLCNLPHGESVIGSKWIFKKKPAADGTTKC FT KARLVAQGFSQLKGIHYDETYAPVVRYTSLRLLFVYAVKRGLNVYHLDVET FT AFLHGDMEETVYLEQPQGYVQKNQKHKVCKLNKSIYGLKQGSRNWNRKLDS FT TLKELKLIQSYQVTCIYSYFNKKKCIIVALFVDDLMVFTDSIDFLDILKTG FT LQKICTLKDLDPMKKCLGIRVNVDNSKGVIELDQTEYIESLLAQYGMSDCR FT STGTPLDLKGDYSSSQS" XX SQ Sequence 4228 BP; 1439 A; 845 C; 952 G; 992 T; 0 other; ggttatgggc ccaggcgcaa cgatgaagat ctgaaaagct ctagtcgcaa tataagtgcg 60 cattggcagg acagtgatac gcaaattttg gagcaccgac atcttgagag agagccgtaa 120 catggcgagc cgtacgagca gcggtgtaat tttatctatc gataaattgg tcggcaggac 180 aaactaccga tcctgggctg tagctatgag agcatattta gaagtagaag atttgtggaa 240 cacgattgag gctccacaag gtggtcaact atcaacagat gcgaagaaac tgcagaaggc 300 gcgaggcaga atcatactga cagttgaacc ggatatctac gcataccttg aagacgccaa 360 atcaccgaag gatgcttggg aggcactcgc caaaaccttc gatgatccgg gagcgaacag 420 caaggtcaat ctatttatgc agctttcaac gaccaggtat gaaaactgta agtctatgga 480 ggactatgta gcgcgcataa tagggacttc gcaaaagctc aatgctatcg gttcgaaaat 540 accgacagat ttgataggag ctttgatgtt ggccggatta ccggagtcat acaaaccgtt 600 aaggatggct cttactaatt cgggtcaaaa tttaaccgca gattttgtga aaacaaaact 660 tttagaagaa gctcaatgct ccaataacgg ccacgaaatt caaaacgaag cattccatgc 720 gcaagtgcac tcgacgcact tcgcgccgca gagtagtgcc aacagagggc gcaacgggta 780 tcataaacgc ttaggacgtc ggcgtaattc gaacgtgaga tgctataact gtaataagtt 840 cggtcatata gcaagtaagt gtactgctcc gataaaaaaa aacaaaatgc atgcacagcc 900 gcagcagacg aggagagcga ggacgaggac gtcgtgtgtg cgctcttagc ctcgatacga 960 tcaggcgata ctcacgcggc gaacgtcggc accgctcttg tcgactcgaa gaatacaatc 1020 tcggctggca ccgcactcct tgcagtgaat cagtcaaaac acacacaggc agacgagtgg 1080 attttggact ccggagcttc gaagcatatg tgctccagcc gactcaatat gacaaacata 1140 aagcattcca gaataaagag tgtaacagct gctaatcaaa ctaaagtgtc tgttcaagga 1200 gaaggaaatg tcgctttgca gtaccacggt ccaactggag aacgtcgggt gttgttggag 1260 aatgtgctgc tggtacccga cctcacagtt aatttgctgt cagtaagtat catcactaga 1320 agaggtggaa aggtgatttt tgctggcccc acatgccaga tatacaacag caaaggtatc 1380 ccaatactgc agggacagcg aaccacgaat aacgtctaca agataacctt tcagccgaga 1440 tcagctcaga ctgatacact caggtccgac tttgcactca gcgtttccat tcctactgat 1500 attcagctat ggcataggag aatgggtcac ctaaacgagg cctatttgaa gcagctgcga 1560 gaggctgcta ttggtgtcaa tttcaagaat gatcaactat acagatgtca aacctgcata 1620 gctgggaagc tgatacagaa gccattcaag atcaacaaca agcgtgcctc agccattctt 1680 gaactggttc atagtgatgt atgtcagatg gaagaattat caattggaag agctaagtat 1740 tttataacct tcacagatga ttacagccga aaaaccttcg tctatttcct gaaacagaag 1800 gatgaagtac cggacatcac tatgaaattc atcaagtatg ttgaaaaaca aactgaccgc 1860 aagttgaaaa gattcaggac agataatgga ggtgagtacg tcaatgccaa gctgaagact 1920 gcattagctg aacttggtgt gaagcatgaa acctctattg cctaccaacc acagcagaac 1980 ggtagagcag aaagggtaaa ccgtgttctt ctcgagaagg caagatgtat gctgagtgag 2040 gccaatctac cttacaagtt ttgggctgaa gctatttcaa ctgcctgtta tctatccaac 2100 agaagtccaa aaaggtgttt gggtggagtt attccagaag agctctggac cggatccaga 2160 gccaacttgt cacacttcag agtttttggt tgtgaagcac gagcatacat ttctagtcac 2220 cagagaaaga aacttgatcc cacttcaaga ccagcaatct tcataggtta cagtgaagat 2280 cagaaaggct atcggctatg gagtgcagag gacaaaaaag tgtttgtgac tagcaatgtg 2340 gaattcctcg agaaggagtt ggaatcgtca atcaaaccag catattttcc tgtagaattc 2400 tcagatacta tggagtcaaa cgtagataaa gtacttgctc aagaacctat aaataaagaa 2460 aattttagaa gaaacgaaga atctgtaact aatgaaaatc taagaagaaa cgtagaacct 2520 gtaaataaag aaagttttag aagtaatgaa gaacctgtaa ataaaggaaa ttttagaaga 2580 aatgaagaac ctgtaaataa agaaaatttt agaagaaacg aaaaacctgt aaataaagaa 2640 aattttagaa ctactgaaga acctgcaaat caagaagata agattagaaa taagacacct 2700 aagaaaaggg aaattgtaaa agaagcacct gcaaacgatg atccaatgat aacaagaaaa 2760 agaaaaatcc aatctaacca agatgaagtg ataaagaaat cagccaaact gacagataga 2820 tcaaccgacc tgcaaccacc attgagtaac caacctcgtc cacaaaccag atcagtcact 2880 aaactcgacc aaccaaggga tctggaagat tcagaagatg aactgatcac acccagttgc 2940 aaaagtcaac gtagaaaacg taaacctaaa cacctagacg actacgtggt ttattctata 3000 gtcagtgagt ttggtgaacc gagaacagtc gaggaggctt tatctggacc agagtctcat 3060 cactggagac aagccatgga cgaggagtac aaatctatca tcaaaaacaa cacatgggta 3120 ctgtgcaacc ttccacatgg tgagtcagtg attgggagta aatggatttt caagaagaag 3180 ccagctgcag atggaacaac taaatgtaaa gcaaggttag ttgctcaagg tttttcgcaa 3240 ttaaagggta ttcattacga tgagacatat gccccggtag tcaggtacac atcactcagg 3300 cttttatttg tttacgctgt gaagagagga ctcaacgtat accaccttga tgttgagaca 3360 gcatttctcc atggtgacat ggaagagact gtgtacttag aacaacctca aggttacgtt 3420 cagaaaaacc agaaacataa agtctgcaag ctgaacaaat ccatttatgg actaaagcaa 3480 ggcagtcgca attggaatcg caagctggac tccacgttga aggaattaaa actgatccaa 3540 tcataccaag tcacttgtat atatagttat tttaataaga aaaaatgtat tattgtagct 3600 ctctttgtgg atgacttgat ggtcttcacg gatagcatcg actttctgga catcctaaaa 3660 actggcctac agaagatatg cacactgaaa gaccttgacc ccatgaaaaa atgcttgggt 3720 attcgagtca acgtggacaa cagcaagggc gttattgaac ttgaccagac cgaatacatc 3780 gagtcattac ttgcccagta tggaatgtca gactgcagat ctactggtac cccgctggac 3840 cttaaaggtg actacagcag ctctcaatca tgaggctcag cattcaatcc tcatcaggaa 3900 ttcgaagttc agtgtcagta ctaagcacat caataggcga ctccactttg ttagcgaaag 3960 cattgatagt ggtgagatca gtgtaagctt cattccatct acgcacatga ttattgacgc 4020 cctaaccaag gctatcagtc aaaggaagtt gagggacttc gtcgagaaca tcggtctcaa 4080 gaatgaagag agaagacagt attaataaat aaaaaataat atattttgat actttgtatt 4140 tagagatttt aattgtcgac gaattttgta atgtaaattt tatctaagaa aatatcaatg 4200 ttctgttaaa taattaattg agggggca 4228 // ID LYDIA_LTR repbase; DNA; INV; 300 BP. XX AC AF177773; XX DT 30-MAY-2000 (Rel. 5.04, Created) DT 29-MAR-2007 (Rel. 12.04, Last updated, Version 2) XX DE LYDIA_LTR is a long terminal repeat from LYDIA, a gypsy-like DE endogenous retrovirus. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW endogenous retrovirus; LYDIA; LYDIA_I; LYDIA_LTR. XX NM LYDIA_LTR. XX OS Lymantria dispar OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Noctuoidea; Lymantriidae; Lymantria. XX RN [1] RP 1-300 RA Pfeifer A.T., Ring M. and Grigliatti A.T.; RT "Identification and analysis of Lydia, a LTR retrotransposon from RT Lymantria dispar."; RL Unpublished. XX RN [2] RP 1-300 RA Pfeifer A.T., Ring M. and Grigliatti A.T.; RT "LYDIA_LTR."; RL Direct Submission to Genbank (16-AUG-1999)Zoology, University of RL British Columbia,. XX DR GenBank; AF177773; Positions 1 300. XX SQ Sequence 300 BP; 83 A; 57 C; 52 G; 108 T; 0 other; tgttatatat taatagtcca ggcagaccgc ggattcagca gcagttaata tgcattcatt 60 attgtactac atcagcaatt acgcaagcaa tgcttcgccg ccgccctgtc gccggaatca 120 gagctgacgc tgcagttgtc accgattgca tttacgtagt ttcataataa aatcgttcat 180 tccatattca tacttcgaat aggttagatt aattagatct atgtaatttt tatatttaat 240 taaataagtt tatagtaatt ttggtttttt ttcgatcctt tggctgccgc taacgtaatt 300 // ID Copia15-NVi_I repbase; DNA; INV; 4184 BP. XX AC AAZX01023481; XX DT 13-NOV-2007 (Rel. 12.11, Created) DT 13-NOV-2007 (Rel. 12.11, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: internal DE portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia15-NV; KW Copia15-NVi_LTR; internal portion; Copia15-NVi_I. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-4184 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from Nasonia parasitic wasp."; RL Repbase Reports 7(11), 1157-1157 (2007). XX DR Genome; AAZX01023481; Positions 4855 672. XX CC Positions [1569-2078] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 93..4145 FT /product="Copia15-NV_I_1p" FT /translation="MSQEHSNYHFTKFDGTNYQVWKFQMRAVLTANSVYDI FT VLGTKKRVRSGETGYDQEKEEKWLKDNAKAMCILSSAMEINQLESCITCET FT ANEMWMKMTLIHEHKSATNRSTLLQKFHTCKMETSEPAIAFVTKIQNMART FT LEDLGEKISEAGVIAKILGSLPSKYSSLITAWDSVDSDKQNIDNLQERLIK FT EERRLSETEENETGAFTVSNVRHVNKASRETGNNQNTRREGYRNLSQVICH FT NCKKPGHIMRFCRRRRNTKGKQADTPKDKSDHVSAFVTTIDDNFSREKAAT FT HQEIERLMKSDSSEIWIGDSGASRHMTYRRDWFSSFTSEHGATIVLGNNKV FT CDVRGSGTIIIEKFLNGKWHRGRIENVLYIPELKKNLFSIGVCTALGYEVS FT FYDNVVKICLDGSVVAQGEKQNNEIYRMFFRSVAQCEAHLSASSLQQWHER FT AGHVNRRTLQEMIKNGVIEGVKISDVEDFVCEACQMGKAHILPFKKEVEHR FT QWKMGECIYSDVCGPMLTNSIKGARYFVLFKDDASSYRLVYFIKHKSDVFD FT KFKEVERLIYNKFNCRIKVLHTDNGREYCNGQMKKFLAQKGIRLETTAPYT FT PQQNGKVERENRTVVESARTMMLHRQVPKYLWAEAVNTAVYVLNMTATSKR FT PETTPYECWIGRKPDYSHLKVFGVPAYAHVPKMLRDKLDAKAKKSIFVGYQ FT DESRNYRLYDYKTRKVFVSRNVTFVENENCCANEYQQVQATLPLPSCDGEE FT MVENPVDNAYQRISSDSTEGADESTADVTQEGPTTMKKTVNKQTTRKKNDA FT VDPSIFSSCRQLRVRENIKKPERYEGNVTEYEPATYQEAIESPDATKWTQA FT IQEELLAHERNGTWTIVPRPQGRKVIDSRWIFKVQDPKAGKENRYKARLCA FT RGFRQEYGIDFNETFSPVIRYDALRVMLAIATHRDLEIVQFDVTTAFLHGK FT LEEEVYMKIPEGLQINSGKEEYVCRLNKSLYGLKQASRCWNETFKNFLFSL FT DFKCSDIDSSIFIAVKRNVKVFLLLFVDDGLVLTEDKDVLSEILDSLRNKF FT QITVSDVDNFVGLKIVRDRVNKTMFLHQAEYASKVLEKFNMTDANSVSTPM FT EKNAISSEVQQHSQQIEKLPYRELIGSLMFLTVVSRPDIAYAVNVLSRFLN FT NFNRTHWELAKRVLKYLKGSLHMGILYTSNESEPLLVGYCDSDFAGDQLTR FT KSTSGFVFKMCGGPMTWNVGRQKDVALSTTEAEYVAASLASREAIWLRRLL FT SGVGCPCASATVLNVDNQSAIKLVKNPVMHKRTKHIDIRCHFIREKYERGE FT IDVKYVSTECQLADVFTKALTRNCHERMCKGIGLVHLSA" XX SQ Sequence 4184 BP; 1412 A; 743 C; 986 G; 1043 T; 0 other; ttaaaaggtt atgggcccag gctcgcgacg agaatagacg gagaaatacg tggcaagtaa 60 agttttgtga agtaatttat caagaaatca agatgtctca agaacactcc aattatcatt 120 ttaccaagtt cgatggtaca aactatcagg tttggaaatt ccaaatgcgc gcagtgctca 180 cagcaaatag cgtatacgac attgtgctcg ggacgaagaa aagagtgcgc agcggcgaaa 240 ctggctacga tcaagaaaag gaagagaaat ggttaaaaga caacgccaaa gccatgtgca 300 tactctcgtc tgcgatggaa ataaatcagt tagagtcctg tattacatgc gaaactgcca 360 acgagatgtg gatgaaaatg actctcatac acgagcacaa gtcggcaaca aatagatcga 420 cgctattgca aaaatttcat acctgcaaaa tggaaactag tgaaccggct atagcattcg 480 ttacaaaaat tcaaaatatg gcaaggacgc tagaagactt gggtgagaaa atttcagaag 540 ccggcgtaat cgcaaaaatt ctcggaagct tgccgtcgaa atacagcagt cttattacag 600 cttgggacag tgttgatagc gacaagcaaa atattgataa ccttcaagag aggctaataa 660 aggaagagag acggctaagc gagacggaag agaacgaaac aggcgcattt acggtttcga 720 atgtgcgaca tgtaaacaag gcaagtagag agactggcaa taatcaaaac acaagacgcg 780 agggctatcg aaatttatcg caggtaatat gccacaattg taagaagcca ggccacatca 840 tgagattttg tcgtcgtaga agaaacacga aaggtaaaca agccgatact ccaaaagaca 900 aatccgatca tgtaagtgct ttcgttacca ctatcgatga taatttctcg cgtgaaaaag 960 ccgctacaca ccaagaaatt gaacggctta tgaaatcgga ctcgtcggag atctggattg 1020 gcgatagtgg cgcatcgcga catatgactt acaggcgtga ttggttttct tctttcacct 1080 cagaacatgg tgccacaatt gtacttggaa ataacaaggt gtgcgacgta agaggctccg 1140 gtacaatcat tattgagaaa tttttaaacg gaaaatggca tcgcggtcga attgaaaatg 1200 tattatacat tccggaatta aagaaaaatt tattttccat tggtgtatgt acggcccttg 1260 gttacgaggt atccttttat gacaacgttg tgaaaatatg tttggacggc tccgtggtcg 1320 cgcaaggaga aaaacagaat aatgaaattt atcgaatgtt ttttagatct gttgctcaat 1380 gtgaagctca tctctcggcc agcagccttc aacaatggca tgaacgagca ggccatgtga 1440 atagacgaac tcttcaagag atgataaaaa atggcgtcat tgaaggagtc aagataagcg 1500 acgtagaaga ttttgtatgt gaagcctgcc agatgggtaa agcccatatt ctacccttca 1560 agaaggaagt tgagcatcga cagtggaaaa tgggagagtg catctacagc gatgtatgcg 1620 ggccgatgtt gacaaactcc ataaagggtg ctaggtattt tgtgctattt aaagatgatg 1680 cgtccagtta caggcttgtg tattttatta agcataagtc agacgttttt gataagttta 1740 aagaagtaga acgtcttatt tataataaat tcaactgtcg tatcaaggtg cttcatacag 1800 acaatggacg agaatattgt aacggacaaa tgaagaaatt tctagcgcaa aaaggcatca 1860 ggctggaaac gactgcgcca tacacacccc agcaaaatgg aaaagttgag agagaaaata 1920 gaaccgtggt cgagagtgct cgaaccatga tgctgcatcg ccaggtaccg aagtatttat 1980 gggctgaggc tgtaaataca gcagtatacg ttctcaacat gacagccacg agtaaaaggc 2040 cagagacaac gccctatgag tgctggatag gcaggaagcc tgattattcg catctcaaag 2100 tttttggtgt gccggcttat gcacatgtgc caaagatgct tcgagataag ctggatgcga 2160 aggcaaagaa aagcattttc gttggttatc aggacgaatc acggaattat agattgtacg 2220 attataaaac aagaaaggta tttgtttccc ggaatgttac atttgttgaa aatgaaaatt 2280 gttgtgccaa tgaatatcaa caggttcaag ccactttgcc attgccttca tgcgacgggg 2340 aagaaatggt tgagaatcct gttgacaatg catatcaaag gatttccagc gattcaacag 2400 aaggtgctga tgaatctact gcggacgtaa cccaagaagg accaacaact atgaagaaaa 2460 ctgtcaacaa acagactacg agaaagaaaa acgacgcagt ggatccttca attttctcct 2520 catgtcgcca actgagagta agagaaaata tcaagaaacc cgaacggtat gaaggaaatg 2580 tcacagagta tgaaccagca acgtaccagg aagcgataga atctccagat gctacaaaat 2640 ggactcaggc tattcaagaa gaattgctcg cccacgaaag aaatggtacg tggactattg 2700 tacctcgacc gcaaggacga aaagtcatag attcgaggtg gatatttaaa gtacaagatc 2760 cgaaagctgg aaaagaaaat cggtacaagg cacgtctatg tgcgagaggt ttcagacaag 2820 agtatggcat cgattttaac gaaaccttct ctccagtaat ccgttacgat gcactccgag 2880 ttatgcttgc gatagccact catcgggatc ttgaaattgt acagtttgat gtgacaacgg 2940 catttctgca tggcaagtta gaagaagagg tgtacatgaa gattcctgag ggactacaaa 3000 tcaacagcgg taaagaggaa tatgtgtgcc ggctcaataa gtccctatac ggcctgaaac 3060 aagcgtccag atgctggaac gagactttta agaatttttt gtttagttta gattttaaat 3120 gtagtgatat agatagctcg atctttatag cggtaaagag aaatgtaaaa gtgtttttgt 3180 tgttatttgt agacgacggt ttagttttaa cagaggacaa agatgtatta agtgaaattt 3240 tagactctct gcgaaataaa ttccagatta ctgtaagtga tgtcgataac tttgtcggtc 3300 ttaaaattgt gcgtgatagg gtaaataaga caatgtttct gcaccaagcc gaatatgcaa 3360 gtaaagtctt ggaaaagttc aacatgacag atgcaaattc tgtaagcaca cctatggaga 3420 aaaatgctat ttcgtcagag gtgcagcaac attctcagca aatcgaaaaa ttgccgtaca 3480 gagagctaat aggctctcta atgtttttga cagttgtatc gcgtccagac atagcctatg 3540 ctgttaatgt attgagccgt ttcttgaata attttaatag aacccattgg gaattagcaa 3600 aaagagtttt aaaatatttg aaaggctctt tacatatggg aatcctatat acatctaacg 3660 aaagcgaacc actacttgtt ggctattgtg attccgattt tgcgggcgat cagctgacgc 3720 gaaaatcgac gtctggtttt gtttttaaaa tgtgtggggg tccaatgacc tggaatgtag 3780 gtcgtcaaaa agacgtagca cttagcacaa ctgaggctga atatgtagcc gcgagtttag 3840 catcgagaga ggcaatttgg ttgcgaagat tattgagtgg cgtcggatgt ccttgtgcaa 3900 gtgcgacagt tttgaatgta gacaaccaaa gtgcaatcaa gttagtcaaa aatcccgtaa 3960 tgcacaaacg cacaaagcac attgacatcc gatgtcactt tataagagaa aaatatgaga 4020 gaggagaaat tgatgtaaaa tatgtatcaa cagagtgtca actcgccgat gtattcacaa 4080 aagcactaac gagaaattgt catgagagaa tgtgcaaggg gatagggcta gtacacctta 4140 gcgcataaaa tcattttggt aaaatacgta aaaaggggaa gtat 4184 // ID EnSpm-6_HM repbase; DNA; INV; 6797 BP. XX AC . XX DT 31-DEC-2008 (Rel. 13.12, Created) DT 31-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE EnSpm-type family from Hydra magnipapillata - consensus. XX KW EnSpm; DNA transposon; Transposable Element; EnSpm-6_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-6797 RA Bao W. and Jurka J.; RT "EnSpm-type families from Hydra magnipapillata."; RL Repbase Reports 8(12), 1907-1907 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 1264..3363 FT /product="EnSpm-6_HM_1p" FT /translation="FSISHKPRLFSNFLCNLIMLILNFIKRDRHFRWRRHR FT STVHALSELVSLSQHKVKNDREIPSFSSNGHNSSNSQCFSLNCXFSSSGNN FT DSSGKLYSISSSXSINNSVLASSDENAISDIRSCESITESISENSKEENDF FT FPDSQFDSQFCKSLKLWSLKHGCTRQCXNDLLALXSQKGHSLPKDSRTXLG FT TKNTHDIVVVDDEAYVYFGIENNIKRILSLHEYQSQDICLNINIDGLPIWN FT SSSYQLWPILLQFGXFKPFMVGLFGGKCKPKMSKFLSEFVQELKELLSRSV FT LNIKGKQYILSVFCFVCDAPARALLKGIVNHTGFYSCERCTKRGIQKHKRT FT VFDDLEKNVTLRNNDMFYNHSYSFPDNSGKRHQHSDSCLLPLPINFIQDFV FT LDYMHLVCLGVVRRILYFFKGTYSGIFDGRLSSAQQNVISEQLLKFNGRLP FT SDFVRQPRSLSELNWWKATELRSFLLYTGLVALKSVLSKQSYKHFLSLSVA FT IRMLCEKNTIKRNSNIESARQLLNYFVNNSKKHYGSSFCVYNVHGLLHIAD FT DVEYFQCSLDEISAFQFENYLYEIKRLIRGKHNPVMQIVLRLSELEDNPTL FT KEESIFKIQVGKKDSCYLVRNGIVNVTAICQDGGFQCEFYPKSALEYFFTH FT FVDSRDLDIYYIKENACSIKCFIQRTDIIRKCVCLPYRNSYVIIPLMHDV* FT " XX SQ Sequence 6797 BP; 2254 A; 956 C; 1105 G; 2441 T; 41 other; cccagtgggc atacgtcgtc cacacgacgt ctagaggacg ttacaatccc gtccgtccgt 60 cccctnttca agttgtacgt cctacggacg tcttacggag gtctattatt atacggtttt 120 aantgcatga aaacagacgt ccattagacg tccgtaggac gtagagggaa tgaaattttg 180 ttatagaagt ttgaaatata ccgtccgtta caaaaatcaa gaaccaaaag atcttttgat 240 tttccatcaa ccaatgggat tttaattnaa cgaggttaaa gttactattc aacgaagttt 300 aaactattat tcaacgagat tacaactgtt caacgaggtt aatgtagttt tttaaactca 360 ttcaataatt ttttataaag tttaatgtta ttcttcaaca acattatntt ataatcattn 420 aawwykttat tattttttat aacatgaatc gacgttcaga aaaagtgaaa aagataaggt 480 aataatttaa tttcatggta tcactaaatt agttttataa tataaaactt atttagtaaa 540 ttatataaat gatttttttt ttaaaaaatt aaaaaaggaa caaggaacaa gttttttcta 600 atcattaggg aaatgctaat tatgtctaat gtcacattaa tgggcaccaa cagatttttt 660 tcttctgcag aactcagcaa gtttaagctt ttaagcttaa tttgagaagc ttctcttagg 720 gcatcaacag tctgttgatg ccctgaaact ttcaagcatt ttttaaactt cacacaattt 780 aattccacat actgaaggtt ttattaggtt taatttaatt ttaagcttat tggaaatttt 840 tttaaaaggg attcaaaatt aaaaaactaa aaatacaaaa acttactata ggagtcccta 900 tagaaagttt ttatatttta agcttcagta aatttaaatt aactttaagc ttattttacc 960 raaagaattt atttataaat taaattgtgg aattaaattg tgtgaagttt aaaaaatgtt 1020 tgaaagtttc agggcatcga cagtttgttr atgtcctaag aaaagcttct tactaattaa 1080 tattaaaaaa tattaaatat ccatattttg taagaaataa tttaatataa taaataatta 1140 aattaatggc tgaatttggt cytccttctt tttattttat tttttaatgg acacangatt 1200 ttgcccttta accacayagt ttattttyaa tttaaatttt ttacctattt aatwataatt 1260 taatttagta tctcccacaa gccaagactt ttttcaaatt ttttatgtaa tttaattatg 1320 cttattttaa attttatcaa aagggaccga cattttcgat ggcgtcgtca tcgtagtact 1380 gtgcatgctt tatctgaatt ggtttctctc agccaacaca aagttaaaaa tgatagagaa 1440 attccaagtt ttagctctaa tggacataat agttcaaata gtcaatgttt ttcattaaac 1500 tgcarctttt caagttcagg caacaatgat agtagcggca agctttacag catttcttcc 1560 agtgaktcaa ttaataattc tgttttggca agcagtgacg agaatgctat ttctgatata 1620 agatcttgtg aatcaattac agaaagtatt tctgaaaact cgaaagarga gaatgatttt 1680 tttccmgatt cccaatttga ttctcarttt tgcaaatcat taaaactttg gtcattaaaa 1740 catgggtgya cccggcaatg trtgaatgat ttgttagcgc ttcwtagtca aaaaggtcat 1800 tctttaccca aggattctcg tacaktatta ggtacaaaga atacacatga tatagtggtt 1860 gttgatgacg aggcatatgt atattttggt atagaaaata acattaaaag aatattgagt 1920 ttacatgaat accaatcgca ggatatatgt ttaaatataa atatagatgg tctacctata 1980 tggaattcta gctcktatca actatggcca atcttacttc agtttggaaa ktttaaacca 2040 tttatggtag gcttatttgg tggcaaatgt aaaccaaaaa tgtcaaaatt tctttcagaa 2100 tttgttcaag agttgaagga acttttgtct agatcagtat taaacatcaa gggtaaacaa 2160 tatattttat cagtattttg ctttgtatgt gatgccccag caagagcact attaaaaggc 2220 attgttaacc atactgggtt ttactcttgt gaaagatgca ctaaaagagg aatccaaaaa 2280 cacaaaagga ctgtgtttga tgatttagaa aaaaatgtta ctttacgcaa taatgatatg 2340 ttttataacc attcttactc ctttccagat aacagtggga agcgacatca acactcagat 2400 tcatgtctgc ttccattgcc aattaatttt atacaagatt ttgtattaga ctatatgcat 2460 ttagtttgtc ttggagtggt acggcgtata ttatattttt ttaaaggtac atatagtggt 2520 atatttgatg gtcggttgtc ttctgcacag caaaatgtaa tttctgagca gttattaaaa 2580 ttcaatggaa gattgccaag tgactttgtc cgccaaccaa ggtcattatc agaattgaat 2640 tggtggaaag ctaccgagct acgtagtttt cttctttaca ctggtttggt tgcactraaa 2700 agtgtgttaa gtaaacaaag ttacaagcat tttttgagtt tgtcagttgc tattcgaatg 2760 ctatgcgaga agaacactat taaacgaaat agtaacattg agtcagctag acagttactt 2820 aattactttg twaacaattc caaaaagcat tatggctctt ctttttgtgt atataatgta 2880 catggactgc tacacattgc tgatgatgtg gaatattttc agtgtagttt agatgaaata 2940 tctgcttttc agtttgaaaa ctacttgtat gagataaaac gattaattcg gggaaaacat 3000 aatccagtta tgcaaattgt tttgagacta agtgagcttg aagataatcc aactttaaag 3060 gaagaaagta ttttcaaaat acaagttggt aaaaaagatt catgttattt ggttcgaaat 3120 ggtattgtta atgtcactgc tatatgtcaa gatggaggct ttcaatgtga attttatcca 3180 aaatctgcac ttgaatattt ttttacccat tttgttgact caagagacct tgatatatat 3240 tatattaaag aaaatgcctg ctccataaaa tgttttattc aaagaactga tataattcga 3300 aaatgtgttt gcttaccata tcgaaatagt tatgttataa tacctttaat gcatgatgta 3360 taaacaagtt tgagtagttt taataaataa atgtcctttt agtctttttt tttattcaat 3420 gttattaggg gtgggtgagt caatgttaat attaattgag gcggagaaaa aaggtatrgt 3480 ttggctataa gtaagtaaag gtaagaaact tycaagtttt ttactcttgg gctccttatt 3540 tttttttatt tttttctttt aattttatat attacaaaaa aaacattttg gggatagaat 3600 gtaagtacca ttaccacttt tattaattta tttataaatt atcacataga catggtttgg 3660 atgaaagacc agaaatctac aacgcaaaca cactgcttaa tgaagcagag tagctggtca 3720 ttttttatgg tctctgactc gttttggctc ccatggctct ggctttttcc ataaacctct 3780 ggctccttaa gttcaggctc cccgccctta gttgttatga atatttgttt gctttcaata 3840 tattcataca aaatgggagt acatttaaag agtataactt tttaaaaggg atatattcca 3900 gaattttgga taatcgcaat aagtaatgag tctktgtggt ctaaattgat tgactttgca 3960 atgaataata ttgcttatct tttctttaga tattgggctg catgtgttgt ccaagaaaat 4020 ggttcagagt ggaaaaccat tgtgccttgc ccatgggttg atactttaaa tcatcaaatc 4080 ttctggccac taaacgaatc taagtcttta gatttctatt ttaaacaaca tgaatcacct 4140 gagcttactt gggtgacata cccaatagtg gagtttttgc ttcaccaagg cacccgcgag 4200 gaagcacagg agttggttga ttttgcaact acagctgagg atgtttcatc ttctgatgat 4260 ggcaacggta cttgatgttt tcttttttca gttcatttca ataattcttt aggttaattg 4320 caactcacat attttttacg actgatataa aatattacgc agctttttat tttcttactt 4380 aaggttcaaa gaatatgaga aaaaatcaat agttattcat taatttattt tgtataatta 4440 gatttagagt tgtatatata tttagcattg cccaagaaac accctcggac tttaagtcca 4500 gttttagaag ttaagaaatc accatcaccw tacaataaga aagaatcttc tactaatgta 4560 aaattagata gaatacactt cagagaagag aatcttcctg tctttgtatc atcttctttt 4620 ttaaattcat ctatgcagaa aagtaagttt actaatttaa agcaagacaa agataaaaaa 4680 gcagcaaacc cagatactta tcttcacatt ttgagcaaaa gcagaaccac gtcacctgct 4740 ttaattgtat caccatcttc ctctagtgga actagaagag aagaactttt agaaacctct 4800 cctctttcat gctctgctag caaaagtcga tcatctcgta aaacatcctt aaatactttt 4860 gctgatggtg gagctggatt caaggagatg acaaataaac gtaagttctt cattcaacac 4920 tttgattttt tgtccatatt tttattttaa tttaatttct tttttagtta taaattcatt 4980 ataaacatga gcataattgt ttttttagag tttcagtatg ttatgttaat gaaaatggag 5040 caattagagg ataaccaaat ccagattaca aattcattag catctctaat tgatggtgtt 5100 aatggtgatg caagtgttgg agttgaagac atgcaaccta tattatcaat tgaagagttt 5160 gatatagagg aagttaaact tcaaaataaa acatatcgag ataaaaaggt atttatttat 5220 ataagcatta ttatcataag caacaaatac attttttaaa ttaacttctt aattagttta 5280 tgcctttata attttctatt ctacgagttc tattagtttg ctgtttagtg gttaggttga 5340 ggatttggat gcatagataa aatttttgtt ggttttttta aacattctat aatggttttt 5400 aagaaggcca tatattacta tatctaccat ctatatatca ttagttatcg aacggtttaa 5460 aatttatgag ctggaattag gatgaaaatt cgtttaaaac agattgaaaa gtctaaaagg 5520 agtaaacttt gtaaattaaa ttagaaatta aaaaaataat aaaaacgttt tgatggaggc 5580 gataatacct ttcatagttg taagtagtta ttagattgta accttgcgtt gtatataaga 5640 tataattgcc ccatcaaaac tttttttgta ctgttttttt aaacattttt ttgcatacat 5700 ctaaatttct tgactttttt ttgtgagtta attagaatat ttygaaaaga atgttttccc 5760 taatcaaaat aaaatttata acttttctta tatagaaaat agctataaag gcacttggag 5820 gtaaggattt gcgagcaact gtaaggcttg cattagatgc tttgatggtt cgtgatgtcc 5880 aaaacctatt tagcaaggat ggaatatccg gaaaaaaggg catttgcaaa gacaatgcat 5940 tatcgttgct tgctcggtat gtattttcaa taattgttat tgctcgaatt ataaaccggt 6000 aatttaaata tttaataata actaatcatt taatgaataa atgactgaaa aaaacatctt 6060 cggtttcctc atccattgct gctgatcgcc attaatacta attggtatta ttggcctcca 6120 taaaattagc tgaaataact cataattgtt tgtcaattat cattagttat tgatatgttt 6180 tcacgcaaca ttttttcatt tttaggagcg tttctggatc ctacaaaaaa tttgacgaag 6240 gatgtaattg attttaatgt tggagacttg ctcaaaagag cgaagattcg aaataaagct 6300 gacagtttta acaaagagaa taaagagcct gataattaat gttttatgtt gttttgtttt 6360 aagataactt atttttgcga ataaatgtta ttataatatt atgttgttgt tgtttttttt 6420 gcactgagag aattttgatt agttattagc tagyattatt gatggaaatt cggtcataaa 6480 cagcagattg caaacaactt aatttgaaaa ataaaatcta aaataaaaaa aaatctaaaa 6540 taaaagaaac tttcatarga cyattaaata ctgtagcggg tattggtttt targacgtcc 6600 gtaagacgta ggttatacgt ccayacgaca aattttatga cgtcctatgg acgtctttat 6660 ttcgtccctt tgggacggac gaaaagcgtt taaaatagta cgtcctttgg acgtcgccgr 6720 ttcgtccagt taggacgtct cytggtccaa aacggacgtc tactggacgt ctcatggacg 6780 tcccaatgcc cactggg 6797 // ID Gypsy-230_AA-LTR repbase; DNA; INV; 263 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-230_AA_; KW Gypsy-230_AA-I; Gypsy-230_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-263 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1064-1064 (2011). XX DR [2] (Consensus) XX SQ Sequence 263 BP; 69 A; 54 C; 61 G; 79 T; 0 other; tgggaataac atcataagca tccctacgtg ttcgccctgt tacgcattgg cttccgatga 60 tcaacggatc aagcagggga acgcttacga agtatggagt attcataagt gagagctatc 120 ggtcggtcac gttttctgtt gaatataagt ttcaaacatt atcgccgtgc ttagaaggtt 180 tcgatatccg aagttgtcct ccaaaacgcg tatctttcca gcttatgtgg gaaaagtgtg 240 gctgttaact agtatttcct aca 263 // ID Gypsy-144_AA-I repbase; DNA; INV; 6528 BP. XX AC AAGE02024568; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-144_AA_; KW Gypsy-144_AA-LTR; Gypsy-144_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6528 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02024568; Positions 35202 28675. XX CC 'AAAG' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 901..2556 FT /product="Gypsy-144_AA-I_1p" FT /translation="MLNETACGNDLGVTEGEASGGVMPTLEVKRKIMEKSV FT QRSKEILDRLSDYEKGKEENLGDLVKHFKDFVIATTKQEEERRKREIEVEE FT KRMKQAAEGIERKKRLEKLLIDLNEKIRIEPTVVPKQESNPNPLRKEREIS FT DKSKVDFKTKQGLLKSESDDQFETPSGSSDERPQLKSKFKKSNRDGRLSPN FT RMLRNRPKKLSISSEFSDESYESSWSWYSSPDSSADSGKSRRIQERDRREI FT RQRNKREHQRPLKRIPMSEWRIKYDGKDGGRKLSEFLKEVKMRRRAESISD FT RELFRGAIHLFSGRAKDWFIEGMENRDFRNWSELKSDLKREFLPPDLDFQL FT EIQATNRRQARGEKFVDYLHDMLKIFQSMTRPLHEKRKFEIIWRNMRFDYK FT NAMTGAGIRSISKLKRYGRIVDENNWSIFQKSLDNVSKPCSSQLNEISTES FT KSKASKPKDKSKQTQQGANREHQVEEKAQPKTKELPPPMEGTSQGTLQELA FT SHYKRPPIGTCYNCRQRGHHYPECEEPKRKFCRMCGFSGVLTKTCPACPKN FT SESSA" FT CDS 2817..5735 FT /product="Gypsy-144_AA-I_2p" FT /translation="MFQSSTLVKTASGTPIQIKGYVNLPITFNGETHIVAA FT LIAPEVRHRLILGYEDFWRVFNLEPSVQNQRVSYEANSHDSDIEVDEMDNN FT LLTQDQFERLEEIKGMFKVAVEGERLDITHVISHKIELKEEYENSPPVRIN FT PYPTSPELQAKINRELDKMLAQQVIERSKSDWSLSTVPVLKPTGEVRLCLD FT ARRLNDRTKRDAYPLPHQDRILSRLGPCRYLTTIDLSKAFLQIPLDPHSRK FT YTAFSVLGRGLFQFTRLPFGLVNSPATLARLMDEVLGFGELEPNVFVYLDD FT IVVVSNTFEAHIESLREVARRLRNANLSINIDKSKFCVEKLPYLGYIISRE FT GLSPNPERIEAIINYERPKSIRALRRFLGMANYYRRFISNFSELTAPLTNL FT LRNKPKSIRWSDVAEHAFNGIKERLISAPVLSNPNFNMPFVIQTDASDSAI FT AAILTQEHEDGERVVAYFSQKLSPAQQNYAASEKEGLAVLSAIDKFRPYIE FT GTHFVVVTDASALTHIMNGKWRSSSRLCRWSIDLQGFDMEIRHRRGRDNVI FT PDALSRAVELVEVDLDAQTDTWYFPTYNSVRDSPDDHLDYRIENDKLYKYV FT PNKTDTLDYKYEWKQCIPDSSREKLTREEHERNLHIGYEKLLERMRQKFYW FT PRMATTVRRIVERCLVCRECKPSTTSQHPEMGKQRLTSKPFQIMAVDFIQS FT LPRSRAGNCHLFVILDIYSKWTMLVPVRKISTELVIKTLEELWFRRYSVPE FT ILISDNASTFLSKKFKEFLDRYRVVHWTNARHHSQANPVERLNRSIETCIR FT TYVRSDQRAWDTKISDVEYTINNTIHSSTGFTPYRILFGHEIVASGDDHRL FT DVVTSEPSESERIGRKVLVDRTIQELVVKNLKKAHEKSSKTYNLRFRKPAP FT CYSVGQKVYRRNFALSSAGDAFNAKLGSAYIPCTIVAKRGTNSYELAGEAG FT KHLGVFSASDLKPGIPEN" XX SQ Sequence 6528 BP; 2104 A; 1236 C; 1507 G; 1681 T; 0 other; attggcgccc aacttgaaaa gacaacgctg atcagatcgc aaacaactaa taaaaacgga 60 gtcagttagg gatcaaggtc agggttcact caaattgctt tgtatggtgc tatagacggt 120 gaattattaa cgtgattgat acgataccta ctctctcgcc acaatttgat gatacttgta 180 ctgcttacta gtgcgcgtac cctctcaaaa gcgaagattt attgttgatg gtcaaccaaa 240 tatcgggaat attgctttac atcatattgc gattcaattg gtttaatggg tcttgattct 300 tccatgtcat agaaacacgt gaataagtag taacagattc aatatccaat tagattctgg 360 tggaactacg ttagccttag agtatggatt tacaaagctt gtataggaat ttggaagtat 420 ctcacctctc agttgacgag gttgagcatg agttgttgat acgaaacatt ctgtttgact 480 tggatgagca tgagagtaag aagagaagaa aactaaaaga tcgaatgcgc gaagaacgtg 540 agttacaatc cgttgtgctc tcagtgactt ggagagacac ggttgaggag atgacaatca 600 tcaattcgaa actcctgata atcgagggat tgctagccag ccctaagtgt gatatacggc 660 atagagaaaa attgagaact cgccatatcc actacagagt aagaattttt ttcacttttc 720 cgtgctcctc aggctcgtaa gcatcaggcc gaggtaacac aactaggtag aaaagtttca 780 gaaatagttg ataaatactt tcccgagtgt gcctctgttg ctggtgatgg ccacaataca 840 ccagagaagc tggaacaaga tttgagtctt gcaatcgaag atgtgcgtag cgagttagaa 900 atgttgaacg aaaccgcttg tggaaatgat cttggagtga cagaaggaga ggcaagcggt 960 ggagtgatgc ccactttaga ggttaaacga aaaattatgg aaaaatcagt gcagaggtca 1020 aaagaaattt tggataggct gtctgattat gagaagggaa aagaagaaaa tttgggagat 1080 ttggttaaac attttaagga ttttgtgatt gctacaacaa agcaagagga ggagaggagg 1140 aaaagggaga tagaggtaga ggaaaagcgg atgaaacagg ctgctgaagg catagagagg 1200 aagaagagat tggagaaact tctgattgat ttaaacgaga aaattagaat tgagcctaca 1260 gtagttccca aacaggaaag caatccaaat ccattgcgaa aggaaagaga aatatcggac 1320 aagagtaagg ttgatttcaa aacaaaacag ggattgctga aatccgaatc ggatgatcag 1380 tttgaaactc cgtctggaag ttccgacgaa cggccccagc ttaaatccaa atttaaaaag 1440 agcaacagag atggtcgttt atcaccaaat cgcatgctca gaaaccgacc aaagaaactg 1500 agtatttcct cagagttctc agacgagtcc tacgagagct cttggtcttg gtactcaagc 1560 cctgacagct cagcggattc aggcaaatct cggagaattc aggaacgaga tagaagagaa 1620 attaggcaga gaaataagag ggagcatcaa agaccgttga aacgaatccc gatgtccgaa 1680 tggagaataa aatatgatgg gaaggacggt ggtagaaagt tgtcagaatt tttaaaagaa 1740 gtaaagatga gacgtagggc tgagagcatt tcggatcgcg aactttttcg aggtgcgatt 1800 catctttttt ctggaagagc caaggattgg tttattgagg gcatggaaaa ccgagatttc 1860 agaaactgga gtgaactgaa gtccgattta aaaagagaat tccttccgcc agatctcgac 1920 tttcagctcg agatacaggc aacaaatcgt cgacaagctc gtggagagaa atttgttgac 1980 tatcttcacg acatgttgaa aatttttcag tcaatgactc gaccattaca cgaaaaacga 2040 aaatttgaaa tcatttggag gaatatgcga tttgattaca agaacgccat gactggtgct 2100 ggaatcagat caatttcaaa attgaaaaga tatgggagga ttgtcgacga aaacaactgg 2160 agcatttttc aaaaatcgct cgataatgta agcaagcctt gttctagcca gttgaatgaa 2220 atttcgactg aaagtaaatc aaaagcttct aaaccgaagg ataaatcaaa acagacacaa 2280 caaggagcta atcgcgagca tcaggttgaa gaaaaagctc agccaaaaac aaaagaattg 2340 cctccgccga tggagggtac ttcacaaggg acattgcaag aactggctag ccactataaa 2400 cgtccaccaa ttggtacgtg ttacaattgc cgacaacggg ggcatcacta ccccgagtgt 2460 gaagaaccca agagaaaatt ttgtcgcatg tgtggcttct cgggcgttct cacaaaaact 2520 tgtccggctt gtccaaaaaa ctcagagagc tcagcttgag ggaggaagct gaagtcaact 2580 ctccacaaac tcccactgtc aaaaatttaa aagagtatgt gttcgagttc ctggatgatg 2640 atgattacga atctgattcc attgacgaac tattaatcca cctagaagga gatgcccgcc 2700 cttttgtgga agttgaaatt ctcaacaaaa aattggttgg attgttggat agtggtgctc 2760 aaaggacggt tctgggcaag ggatccgaag atcttattaa aaatttgaaa ctgaaaatgt 2820 ttcaatcatc gaccttagtt aaaacagcct ctggaactcc aattcaaatc aaaggttacg 2880 tgaacttacc aattacgttt aatggagaga ctcatattgt cgctgctttg attgcaccag 2940 aagtacgcca tagactgata ctgggatatg aggacttttg gcgcgtgttc aatctcgagc 3000 catcagtaca aaaccagaga gtgagttacg aagcgaactc tcatgatagc gatattgagg 3060 tggacgaaat ggacaacaat ttgttgaccc aggatcagtt tgaaaggttg gaggaaatca 3120 agggaatgtt caaggttgct gtggaaggag aaaggctcga tatcactcat gtaatatctc 3180 acaaaataga gttgaaagag gaatacgaaa attctcctcc tgtaagaatc aacccttacc 3240 caacgtcgcc agaacttcag gctaaaatta atagagagtt agacaaaatg ttggcacaac 3300 aggtaatcga acgtagtaag agtgactggt cgctaagtac agtgccggtt ttgaagccaa 3360 cgggtgaagt gcgcctgtgc ttagatgcac gacgattaaa tgatcggact aagcgtgatg 3420 cttaccctct accccaccag gaccgtatac ttagcagatt gggaccatgt cggtatttga 3480 caacgatcga cttatctaag gctttcctgc aaataccact ggatccccat tcacgcaagt 3540 atacggcctt ctcggtattg ggaagaggac tttttcaatt taccagactt ccctttggac 3600 ttgtcaacag ccccgcaaca ctcgctcgac tgatggatga agtgttgggt ttcggcgaat 3660 tagagcccaa tgtattcgtc tatctcgacg acattgtcgt cgtcagcaat acatttgagg 3720 cccacattga gagtcttcgg gaggttgcga ggcggcttcg taatgcgaac ctgtctatta 3780 acatagacaa atcgaagttc tgtgtagaaa aactccctta tctgggttat attatctcac 3840 gagagggact gagtcccaat cccgagagaa tagaggccat tataaattac gagcggccta 3900 aatcgattcg ggccttgcgg cgtttcttag gcatggccaa ttattataga agatttattt 3960 caaattttag tgagttgacg gcacctctca ccaatcttct acggaataaa cccaaatcga 4020 taaggtggtc agacgtagcc gagcatgcat ttaatggcat caaggaacgt ttaatttcag 4080 cccctgtttt atcgaatccc aatttcaata tgccatttgt cattcaaaca gatgcctcag 4140 atagtgcaat tgcagctatc ctcactcaag aacatgagga tggggaacgg gtggtagcat 4200 acttctctca gaagttgtca cccgctcagc aaaattatgc agcgtctgag aaggaaggtc 4260 tggctgttct gtcagcaata gacaaattta gaccatacat tgagggaact cattttgtcg 4320 tcgtcacaga tgcgtcagca ttaactcaca taatgaatgg caaatggcga tcgtcgtcga 4380 gattatgtcg ttggagtatt gatctgcaag gattcgacat ggagattcgt cataggcgtg 4440 ggagagacaa tgtgatccca gatgcactct ctcgggcggt cgagctggtc gaagtggatc 4500 tggatgctca aacggacaca tggtactttc caacttacaa ctcggtacga gattctccag 4560 atgatcatct agattatcgc atagagaacg ataagttata taagtacgtt cccaacaaaa 4620 ctgatactct ggactacaaa tacgagtgga aacagtgcat ccccgattct agtcgagaga 4680 aattaacgcg cgaagagcat gagcgtaatc tgcacatcgg ctatgaaaaa ctgttagaaa 4740 ggatgaggca aaaattttat tggccaagaa tggctaccac tgtacgtagg atcgtggagc 4800 gttgtttggt ttgcagagag tgcaaacctt cgactacttc ccaacatcca gaaatgggta 4860 aacaaaggct tacctccaaa ccattccaaa ttatggctgt ggacttcata cagtcgttgc 4920 cacgaagtcg agcgggaaat tgtcaccttt tcgttatact cgacatttac tctaaatgga 4980 caatgctagt gcccgtgaga aaaatttcca ctgaacttgt catcaaaaca ttggaagagc 5040 tatggtttcg ccgctattca gtaccagaaa tactaattag cgataatgcg agcacctttt 5100 tgagcaaaaa attcaaggaa tttctcgatc gctaccgagt tgtccactgg acaaatgcca 5160 ggcatcacag ccaggccaac ccggtagaaa gattgaatcg aagcatcgaa acatgcattc 5220 gtacgtatgt gcgctcagat cagagagcat gggatacgaa gatctccgac gttgaatata 5280 caattaataa caccatccat tcatctacgg ggttcactcc gtacagaatc ctcttcggtc 5340 atgagatagt agctagcggc gacgatcatc gtttggatgt agtaactagt gaaccctcag 5400 aatcagaacg aataggtcga aaagtcctgg tagacagaac aatccaagaa ttggtggtga 5460 aaaatttaaa aaaagcccac gagaaaagtt caaaaaccta taatttaagg tttcggaagc 5520 ccgcaccctg ctacagtgta gggcagaaag tttatagaag aaattttgca ctgtcttccg 5580 cgggggatgc gttcaatgca aaattgggat cggcatatat tccgtgtact attgtggcta 5640 aaagaggcac aaattcttat gagcttgcgg gcgaggcggg aaaacatctg ggtgttttct 5700 cagcttcgga tttgaaaccg ggaattccgg aaaactaaaa gaactttgca aatataggta 5760 tgcaggaagc ttaattaagt actcagatta atttgtacag ttgagagttt atttgtctaa 5820 gtcaaagtca gagtcaaggt taaagtccaa gtctgtgtga atccaatgaa gtcagatgtg 5880 tctgtgtttg tactggtcca tagttactca tcgtcataca caagagtgtc aaatttcaaa 5940 ttgtaatagc acattcgaac aaaattggta caagatttca caactgttgc atgaaatgtt 6000 cctaatgatg cgtagcaggt ctttttttga aacgtttctc ctgcgttaca attagaaaaa 6060 tgtcctcaag tacacaaaaa aaaaaaaaaa tcagtcaatc ggttttgctc ggcgtttgtc 6120 gttacgagta cagaatttca cttttactgt atccatcacc atattttctt cacaccagtt 6180 acctctcacg gcgaaacaca tttgaagaat aacttcatcg gatagaattc acacgaaaaa 6240 tgtcgaagag aaagatcaat tctcacactc aacaccgaga gtgttaagat gtctgggaga 6300 gatgagatct gttaaaatca atattggttt agcgtcgata aataatcttt catggatcta 6360 atggaagaaa ttatttggtt gatcaacgca taaatgttcc ctattgcgaa agatttcgta 6420 agcatgtaaa gttgcgtctc aaacctgggt ggtactagat gtatcaaaat actggtttaa 6480 taaattaata acataattat taatttattt agagggagta tggggtag 6528 // ID PPSAT1 repbase; DNA; INV; 156 BP. XX AC K02940; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 28-SEP-1995 (Rel. 1.08, Last updated, Version 1) XX DE P.pollicaris satellite, consensus sequence. XX KW SAT; Satellite; Simple Repeat; PPSAT1; Repetitive sequence. XX OS Pagurus pollicaris OC Eukaryota; Metazoa; Arthropoda; Crustacea; Malacostraca; OC Eumalacostraca; Eucarida; Decapoda; Pleocyemata; Anomura; OC Paguroidea; Paguridae; Pagurus. XX RN [1] RP 1-156 RA Fowler F.R. and Skinner M.D.; RT "Cryptic satellites rich in inverted repeats comprise 30222034f RT the genome of a hermit crab."; RL J. Biol. Chem 260, 1296-1303 (1985). XX DR GenBank; K02940; Positions 1 156. XX SQ Sequence 156 BP; 43 A; 31 C; 35 G; 47 T; 0 other; cggaaaacag cctaaatacg tagttttcgg gtatttcagc cacttttggt cagaaagtga 60 taaaacgccc ccaaatcatg gtttttgtca tttggagtgt gctaacacca aaaaagtcga 120 tttatgacgg gtttcggggc gattttgcac tttcac 156 // ID Penelope-4_HM repbase; DNA; INV; 2461 BP. XX AC . XX DT 13-DEC-2008 (Rel. 13.12, Created) DT 14-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE Penelope-like element. XX KW Penelope; Non-LTR Retrotransposon; Transposable Element; KW Penelope-4_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2461 RA Jurka J.; RT "Penelope-like elements from Hydra magnipapillata."; RL Repbase Reports 8(12), 2094-2094 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 90..2138 FT /product="Penelope-4_HM_1p" FT /translation="MAKTETLIKRMRWKALFFINKQTDEKNSEDYVNYGIN FT SSKCPPQIKEMCSFENNLINLVKTIKFRKINNVFQKKLANDIKSIRTSKKT FT LTLADKTSNMYKLSKDQYNHLLNNAITSTYKKADPKLKDQINIQGKQILIN FT HDVYNKIGINGTENCFITLKDHKENFLDKPTVRLINPAKNEIGRISKSILS FT NINSELRKKQFLNQWQNTQNVIHWFQTIKEKQFCKFLVFDINDFYPSINEN FT LLKNALSFAEQHLIINSEQKSFIYHARKSLLFSNGTAWIKKKGGLFDVTMG FT AFDGAEICELVGIFILHQISQFYDKNNFGIYRDDGLAIFRNKSGQEMEKIK FT KHFVQIFKNNNLLISIKSNLKIVNYLDLTLNLKDGSFKPYRKADNILNYIH FT TSSNHPPSIIKSVPKTIELRLSAASSSESIFKESIPPYQEALKKSGHTCEL FT AYHPSIKSSNNNHKRKIIWYNPPFSTNVVTKIGKCFLNLIDHHFPVNHRLR FT KIFNRNTIKISYSCMPNIKSIINSHNKNTLHEEAKVNEKTCNCIDKSKCPM FT NNHCLSNNIVYQATVCSNNPEYKEKVYFGLSETSFKVRYANHLKSFNSIRY FT KNDTELSKEIWKLKEKNFTPFVRWKIVKHCKPYNPTSRVCNLCLNEKFQIL FT MYKGKNLLNTRSEIISKCRHKNKFLISSYDTAD*" XX SQ Sequence 2461 BP; 981 A; 411 C; 329 G; 740 T; 0 other; tcgaattcgt ccaagatgga gaaattaaac tttcaatact caattaaaaa tatctcaata 60 ccatcgagaa agaatttatt ttacaattaa tggcaaagac cgaaacgctt ataaaaagaa 120 tgaggtggaa agcgcttttt ttcattaaca agcagacgga tgaaaagaat tctgaagatt 180 acgtaaacta cggtattaac tcttctaaat gtcctccgca aattaaagaa atgtgttctt 240 tcgaaaataa tttaataaat cttgttaaaa ccatcaaatt tcgaaaaatc aacaacgttt 300 ttcaaaaaaa attagcaaac gatataaaat ctatacgaac ctcaaaaaaa actttaactc 360 tagccgacaa aacatcaaac atgtacaaat tatctaaaga tcagtacaac catctcctta 420 ataatgcaat tacttctacg tataaaaagg ctgaccctaa actcaaagat caaataaaca 480 tacaaggtaa acaaatttta attaaccatg atgtatacaa taaaattgga attaacggta 540 ctgaaaattg tttcataact ctaaaagacc acaaagaaaa cttcctagat aaaccaactg 600 ttagacttat taatcctgca aaaaatgaga ttggaagaat ttcaaaatct attttatcca 660 acattaattc cgagctgaga aagaaacaat ttttgaacca atggcaaaac actcaaaatg 720 ttatacactg gtttcaaacc ataaaagaaa aacaattctg taagttttta gtctttgata 780 ttaatgattt ttatccatcg attaatgaaa acctgctgaa aaatgcgtta tcctttgcag 840 agcaacatct aataataaat agtgagcaaa agagttttat ataccatgca agaaaatcgt 900 tattatttag caatggaact gcttggatta agaaaaaggg aggattattt gatgtcacta 960 tgggggcctt tgatggcgcg gagatttgcg aattagttgg tatttttatt ttacatcaaa 1020 tctcgcagtt ttatgataaa aataatttcg gtatatatcg agatgacgga ttagctatat 1080 ttagaaacaa aagtggtcaa gaaatggaaa agataaaaaa acattttgta caaatcttta 1140 aaaataacaa tcttttaata tccattaaaa gtaatttgaa aatagttaat tatttggatt 1200 taacgctaaa ccttaaagat ggttcattta aaccctaccg caaagcagat aatatactta 1260 attacataca tacaagttct aaccacccac cgagtataat aaaaagtgtt cctaaaacca 1320 ttgagttaag attatcagca gcatcgtcaa gtgaatcgat ttttaaagaa agtattcctc 1380 cttatcaaga agccctgaaa aaatctggac acacttgcga acttgcatac catccaagta 1440 taaaatcttc aaacaacaac cataaacgta aaataatttg gtataatcct cctttcagta 1500 caaatgttgt aacaaaaata ggtaaatgtt ttttaaacct catcgaccac cactttccag 1560 ttaaccacag actccgtaaa atatttaata gaaacacaat caaaataagt tattcctgca 1620 tgccaaacat aaaatccatt atcaactctc acaataaaaa cacgttacat gaggaagcaa 1680 aagttaatga aaagacatgc aactgcattg ataaatccaa atgtccgatg aacaaccatt 1740 gtctttctaa caacattgtt tatcaagcca ccgtttgttc aaacaatccg gagtacaaag 1800 aaaaagtgta ttttggactc agcgagacat cttttaaagt tcgatacgca aaccacctca 1860 aatcttttaa ttcaattaga tataaaaatg atacagagtt atctaaagaa atatggaaat 1920 taaaagaaaa aaactttaca ccttttgtca gatggaaaat tgttaaacat tgcaaaccat 1980 acaacccaac atcaagagtg tgcaacctct gtttgaacga gaagtttcaa attctaatgt 2040 ataaaggtaa aaacctactt aacacaagga gtgaaataat ctcgaagtgt aggcacaaaa 2100 ataaatttct tatttcctcc tatgataccg cagattgatc ctgattggtt tacgtcactt 2160 ttacgccacg tgatactctt aaaacttaaa gttttgtttc catggagtca ttagaactca 2220 cgccaccgta attttaattg taattttaac gtctatttta aattttgtaa tatatggctg 2280 atgattgccg ttaggcatga aacttaaagt tccataataa aagttttttc ttttttcgtt 2340 tttaattata aaccgctctg ttatgaaata tatatataat caaaataaat ttcttaaata 2400 ttgagcactt ttcaacgaac aagagttgtg tattatttta atacaacact acataaaacg 2460 a 2461 // ID L1-5_HM repbase; DNA; INV; 4701 BP. XX AC . XX DT 15-JAN-2009 (Rel. 14.02, Created) DT 15-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE a non-LTR retrotransposon from the L1 clad - consensus. XX KW L1; Non-LTR Retrotransposon; Transposable Element; KW reverse transcriptase; endonuclease; L1 clad; L1-5_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4701 RA Bao W. and Jurka J.; RT "L1-like retrotransposon from Hydra magnipapillata."; RL Repbase Reports 9(2), 429-429 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(3..653,622..1137) FT /product="L1-5_HM_1p" FT /translation="LNIDGNHQRSYAQVTKPTISKQHLKQLNKTLLSEIQH FT QVNFEDVIIALEESGLDKEIEGVQLIRNDSIIEITLKNEKTKNKLLEMGLD FT IKGFHHRFNNSTPTRHASQRITVSVLGLPLEYKGFIVGALEKMGYGKHIYS FT RSVMKETPKNKTLYYSGILTAVMDKIKTPIPKTIEIEGHKVRIIYTGQENS FT LAMKNKTENNTCNNNIGEPNRSRENNTESLTGAEKIIPKISLNKTIESIKT FT MEKEKEININITEAKENIXPKNKNLNYKTEKPNNHKKFAQNEDAFEEENVN FT LKSNEYEDMDEVEASFPETXQNQDMEEGESCVDTQEKGLKRKTTPQDIKIQ FT TYKRSNKDFGIIYDGKCPETNQNNENNLDTQYQVNNNNILNHIEI*" FT CDS join(1315..2331,2292..3908,3817..4701) FT /product="L1-5_HM_2p" FT /translation="MAIKMLTLNVNGLNDDIKRHNIMMYLKNTCCDIACIQ FT ETHCSNEKEAQWKEEWSKKTQGFSIWNNGNNKERGVAILTKDRIQIDQITF FT NKSGRLLAANITIENCTLRIINVYGPNNPMEKEYFFEKLEKHIRPCDDLII FT LGDFNMVEDPILDREGGDQLNKNNMLGKKQLKRLCDTLDITDIYRKLNPQK FT KSFTYEKPDGSYRSRIDRIYAPTSQANFSTCKIIDHKLSDHKALYCKYKTK FT NLHSNKRGPGYWHLNVSLLSDETFEKNLIDRFEYSKHTKAQFENPNQWWDL FT TKLEIKNCAIERSTDIKHQRILYIKELEKKISIEKKIFFIIRKQSKTKKNI FT LYNKETIKNLKKELKLQKNDFGIFIRTKQTLVETGENPSKFLFNLEKSRSK FT KKTIETIRDGNITHHETDNIMRTIRKYYQKLYTKTDLNIVKQQDLLSNIKK FT KLNNEENQNLNKKLSSEELYEAAMSLQKGKTPGIDGIPVEFYQKYWHIYGN FT ELTNLANNNLFNENGQLSWSQRTALISLIPKDGDKTLLQNWRPISLLCSDL FT KIISKAMALRLAKVLDKILTPTQTCSVPGRNIFSNIFLARDLIHYTNSKNL FT DAILISIDQEKAFDKIDRDYLLKVLQKYNLGKAFIQAIKTTMENTQSIILN FT NGFMSAPFKVSRGVRQGDPFSLMLYCIAVETLALEINKNTGIDGVPIPGSM FT KPLKVMQYADDTTIFVRNPSSIDVLFEILTNFEKATGSNINKEKTKALALG FT GFNYLAYEKYVQQGYYQNHKISWTNSTGLKILGVTFFTDLGYTASFNWQIA FT LCKLENKLKTLKYRNISLKCKATLINTLALAKVWYVANVFPINKIIEKKNQ FT KKNIGLPLAKRLCRTCQTGNFKSPDLKKKIKKRISAYLWQSDYAEPVKREI FT LNLPTDKGGLGILEPTNQCISLRTKHLFNLKEVEKPHEATFLLQYLIAIHL FT TKIATEKYAPWKFLTKNRFPKTINNLPFYLNDIINVLKKNENLLLLKHKTT FT KNIYLYLKNNNNNTWINRLQINWETKLQRTLPWNDIWKRSFNSYAQGPSQN FT TIWRLLQDSLPTTEKVKKWKKNRGRGQISCKSCNKTEDTLHPFIHCRKARS FT VWLAYKKLYERLIPDQPFNIVHTVFSLNIENLNKNDETAKIITTITHIITS FT ELWRARNKYVKENK" XX SQ Sequence 4701 BP; 2083 A; 833 C; 680 G; 1101 T; 4 other; aattaaatat agatgggaac caccaaaggt cttatgctca ggttacaaaa ccaacaatat 60 caaagcaaca ccttaaacaa ttgaataaaa ccttacttag cgaaatacaa catcaagtaa 120 actttgaaga tgtaattatt gctttagaag aaagtggttt agacaaagag atagaaggag 180 ttcaattaat tagaaatgac tcaattattg aaataacttt aaaaaatgaa aaaacaaaaa 240 acaaattatt agaaatgggt ctggatatta aaggctttca ccatcgcttt aataactcaa 300 ccccaactag acacgcgagc cagcggatca ctgtatctgt gctgggtttg ccgctcgaat 360 ataaaggatt cattgttgga gccctggaaa agatgggtta tgggaaacac atctactcca 420 gatccgtaat gaaggaaacc ccaaaaaaca aaactcttta ttactcgggt atcctaactg 480 cggttatgga taaaatcaaa acacccatac ccaaaactat tgagattgaa ggacacaagg 540 tgagaatcat atatacagga caagaaaata gcctwgccat gaagaacaaa accgaaaaca 600 acacatgtaa caataatata ggagagccta acaggagcag agaaaataat acctaaaata 660 tccctaaaca aaaccatcga gagtattaaa acaatggaaa aagaaaagga aataaatatt 720 aacataacgg aggctaaaga aaatattyat cctaaaaata aaaatttaaa ttacaaaact 780 gaaaaaccta acaaccataa gaaatttgct cagaacgagg atgcctttga agaagaaaat 840 gttaatctta aaagtaatga atatgaagat atggatgaag tagaagcatc ttttccagaa 900 acaayccaaa atcaagatat ggaggaagga gaatcttgcg tagacaccca agaaaaggga 960 cttaaaagaa aaacaacacc gcaagacata aaaatccaaa cctataaaag atccaataaa 1020 gacttcggga ttatatatga tgggaaatgt cccgaaacga accaaaataa tgaaaataat 1080 ctcgataccc agtatcaagt aaataacaat aatatactta accatatcga aatataaaac 1140 taaaactaaa caaaaataaa aaatcataaa actaaataaa aataaaaaat cacaaaacta 1200 aataaaaata aaaaatcata aaactaaata aaaataaaaa ttccaatcaa aaccaaacaa 1260 aaataaaaat cacaataaaa caaatactaa aagaaaattt ttgtttttgt aaaaatggca 1320 attaaaatgt taaccctaaa cgtcaacggc ctaaatgatg acattaaaag gcacaacata 1380 atgatgtacc taaaaaatac ctgttgcgac attgcttgca tacaggaaac acactgcagt 1440 aatgaaaaag aagcacaatg gaaagaagaa tggagtaaaa aaacacaagg cttctccatt 1500 tggaacaatg gaaataacaa agagagagga gttgcgatcc ttaccaaaga cagaattcaa 1560 atagatcaaa taacttttaa taaaagtgga agattactag cagctaatat tactatagaa 1620 aactgtaccc taagaataat taatgtctac ggccctaaca atccaatgga aaaagaatac 1680 ttctttgaaa aacttgagaa acatattagg ccatgcgacg acctaataat actaggggat 1740 tttaatatgg tggaagaccc aatacttgat agagaaggag gcgaccaact aaataaaaat 1800 aacatgctag gaaaaaaaca actaaaaaga ttatgcgaca ctctagatat aacagacata 1860 taccgaaaat taaatccaca aaaaaaaagc ttcacctatg aaaaacctga tggatcgtac 1920 cgtagtagaa tagaccgaat atatgcacca accagtcaag caaactttag tacatgcaaa 1980 attatagatc ataaactaag tgatcataaa gccctttatt gcaaatataa gaccaaaaac 2040 cttcattcaa acaaaagagg gccaggctac tggcatctaa atgtatcctt attaagtgat 2100 gaaacttttg aaaaaaactt aattgacaga tttgaatata gcaaacacac aaaagctcaa 2160 tttgaaaacc ccaaccaatg gtgggaccta actaaactcg aaattaaaaa ctgtgccatt 2220 gaacgatcaa cagatatcaa acaccaaaga atattatata taaaagagct ggaaaagaaa 2280 atatcaattg aaaaaaaaat attctttata ataaggaaac aatcaaaaac ttaaaaaagg 2340 aactaaaatt acaaaaaaat gactttggaa ttttcatcag gacaaaacaa actttagtag 2400 aaacaggaga aaatccatca aaattcctgt ttaatctaga aaaatcaaga agcaagaaaa 2460 aaactataga aactataaga gatggtaaca taactcacca tgaaacagac aacattatga 2520 gaacaatccg aaaatactat cagaagctct acaccaaaac cgatctaaat atagtaaaac 2580 aacaagatct actaagcaac ataaaaaaaa aattaaataa tgaagaaaat caaaacttaa 2640 ataaaaaact aagtagtgaa gaactatacg aagcagctat gtccctacaa aaaggaaaaa 2700 ctccaggcat agacggcata cctgttgaat tttaccaaaa atactggcat atatatggca 2760 atgagctaac caacctcgca aataataatc tctttaatga gaacggtcaa ctatcatgga 2820 gtcaaagaac cgcactaatc agcttaatac ctaaagatgg agataagacc cttttacaaa 2880 actggagacc aatctccctt ctgtgttccg acctcaaaat aatatctaaa gccatggcat 2940 tgagacttgc caaagtccta gataaaatct tgacaccaac ccaaacatgc tccgtacctg 3000 gcagaaatat attctccaat atttttcttg ccagagacct aatacactat accaactcta 3060 aaaatctaga tgcgattcta atatcaatcg accaagaaaa agcctttgat aagatagata 3120 gagactactt acttaaagtg ctacaaaagt acaatttagg aaaagctttt atccaagcaa 3180 taaaaacaac tatggaaaac acacaatcca tcatcctcaa taatggtttc atgagtgccc 3240 cttttaaagt cagcagagga gtaagacaag gtgacccttt ttcacttatg ctctattgca 3300 tcgctgtcga aacgctggcc ttagaaataa acaaaaatac aggtattgat ggcgtaccaa 3360 tacctggctc tatgaagcct cttaaggtta tgcagtacgc agatgataca acgatctttg 3420 taagaaaccc ttcctcaatt gacgtacttt ttgaaatact cactaacttt gaaaaagcta 3480 caggatcaaa tataaataaa gaaaaaacta aagctttagc tttaggtggc tttaactatt 3540 tagcatatga aaaatacgtc caacaaggat actaccaaaa ccacaaaata tcgtggacaa 3600 atagcacagg tctaaaaatc ttaggtgtaa ccttttttac agatttagga tacacagcca 3660 gctttaactg gcaaattgcc ctgtgtaaat tagaaaataa attaaaaaca ctaaaatata 3720 gaaatatctc acttaaatgt aaagcaacct taataaacac tctagctttg gcaaaagtgt 3780 ggtacgtagc aaacgttttt ccaataaata agataattga aaaaaaaaat caaaaaaaga 3840 atatcggcct acctttggca aagcgattat gccgaacctg tcaaacggga aattttaaat 3900 ctcccgactg acaaaggtgg gcttggtatt ctggaaccaa caaatcaatg tatttcttta 3960 agaacaaaac acctctttaa tcttaaagaa gtagaaaaac ctcatgaagc cacttttctg 4020 ctacaatatt taatagcaat ccacttaaca aaaattgcca ccgagaaata tgctccctgg 4080 aaatttctga cgaaaaatag atttccaaaa actattaata atctgccctt ctatcttaat 4140 gacattatta acgttttaaa aaaaaatgaa aatctactac tactaaaaca taaaacaact 4200 aaaaacatat acctctatct gaaaaacaat aataataata catggattaa tagattacaa 4260 attaattggg aaacaaaact acaaagaact ttaccttgga atgatatatg gaaaagaagt 4320 tttaattctt atgctcaagg ccctagccaa aacacaatct ggcgactgct gcaagacagc 4380 cttcctacaa cagaaaaagt taaaaaatgg aaaaagaatc gtggacgtgg tcaaatatcc 4440 tgcaaatcat gcaacaaaac agaagacact ttacacccat ttatacattg cagaaaagca 4500 agaagtgtct ggcttgctta taagaaatta tatgaaagac tcataccaga tcagcctttt 4560 aatattgtcc acacagtatt ctcgctaaat attgaaaatc ttaacaaaaa cgatgaaaca 4620 gcaaaaataa taaccacgat aacmcacata ataacctcgg agctatggag agctagaaat 4680 aaatacgtta aagaaaacaa a 4701 // ID Cre-1_HM repbase; DNA; INV; 3945 BP. XX AC . XX DT 14-OCT-2009 (Rel. 14.1, Created) DT 14-OCT-2009 (Rel. 14.1, Last updated, Version 1) XX DE Cre-1_HM non-LTR retrotransposon - consensus. XX KW CRE; Non-LTR Retrotransposon; Transposable Element; Cre-1_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3945 RA Kapitonov V.V. and Jurka J.; RT "First examples of CRE non-LTR retrotransposons in animals."; RL Repbase Reports 9(10), 2158-2158 (2009). XX DR [1] (Consensus) XX CC Cre-1_HM is a family of non-LTR retrotransposons that belong to CC the CRE clade. The hydra genome contains >10 copies of Cre-1_HM CC that are ~97% identical to the consensus sequence. The hydra CC genome contains several families of Cre non-LTR retrotransposons. XX FH Key Location/Qualifiers FT CDS 833..3913 FT /product="Cre-1_HM_1p" FT /note="RT and RLE domains." FT /translation="MNMVSICKRCDRSFTTLKGLNIHKGQCKIFVSNTNKQ FT INNVVNNELTTPNKNKVEINTILNCDEISVEHYSTNTPYLPKINICESIID FT PNDYLWGHMPFSFLLNHVNTIYDEIVFYHKNLFKVPSGKGGKMFIEELTFW FT LKQFNNRTKLNGIAMKCFMIVPSLMLQKPSIRSKAKEHAECLVRRITLWRN FT GNFSELMREIRYIQSKINTSKKKRTFEDISRIFAKLMMEGKVAAALKVLDR FT ESSGILQCSESVLKELKSKHPDETPVQDNCLLYGPLQNTPECLFDSIDEIS FT IFNSALQTKGSAGPSGMDADLYRRVLCSKCFGPSCKTLREEIATFTKNIAT FT KSYQPDIVQPYIACRLIPLDKNPGIRPIGIGEVLRRIVGKTISHHCQKEIK FT EAAGPLQTCAGHGAGAEAAIHAMQKIFHQEDTDGVLLIDARNAFNCLNRSV FT ALHNIQITCPILAMYLVNTYRKPAKLFIYGGETIFSKEGTTQGDPLAMPWY FT SLSTVTIINTLKLVIPDVKQVWLADDATAAGKLQSLKKWYKCLEDVGGLYG FT YYVNQSKCWLIVKSDNQAEEAKLIFGNSINITTQGKRHLGAALGSEAYKKV FT YCEDLVSKWSKELNNLCEIATTQPQAAYSAFIKGYRSKFTYFLRTIEAFEN FT FVTPVEKILSEKLLPVLFGTDCSIIKENRDLLALNPSEGGLGICNLITEAK FT EQHTASKKITNLHIKSILDQSDVMKEKDDFGKTFSEIKTKTNMDKSKKKKE FT EVKKIHAGLPENLKLLVEQACDKGASSWLNTLPIKEQHLDLNKEEFKDALR FT LRYNVPLANLPSYCACGEKFDELHAMSCKKGGFVCNRHDNIRDLLTVCLNK FT VCTDVQAEPHLIPLTNEKFNFKTANTNDEARLDIKAKGFWRKGETAFFDVR FT VTHVNSKSSKKQPTKHIFRRHEDAKKREYLERVLEVEHGTFTPLIFGTNGG FT FGDECKRFTALLAQKLSLKMGERYGAVINWLRTRLSMEITRASLLCLRGSR FT TPFRHYNTDDVGLENVQCGLI" XX SQ Sequence 3945 BP; 1420 A; 668 C; 697 G; 1160 T; 0 other; tttctaatgt tacgtgatat gatatggtta gttcatggtt agtttatgtt tatgcttagt 60 ttatggaaaa tcgtttattt atggcacaat attgtttgct gtttttaaat ttatgtaacg 120 tgtgcatttg atgtatattc ttgaactttt taatctgaat ttttacttgg tttaatacgt 180 ttattatatt cttcgattga gcaatttatc ctatcaaagc aatttatcct tcgattcgag 240 caatttatcc ttcgattcga gcaatttatc cttcgattga gcaatttatc ctatcaaaat 300 tagcatatat actgcaattt tcaaataatc tacgaaataa gttcacttac tgaaaatcat 360 taagtaaaag aagaaaggaa gaaaaaataa aaataaaaag tagtaaatcc tttcataaca 420 ataatcattc tattattaaa tttaaaggaa tattttggtt ttgtactaaa tcatgcgttc 480 atatttcacc gaagaagggg gctgctatat ttttgtttga agttgtttat cttaaaactt 540 taaacttgtg ttcaaccaac cgtaaacatt agttcgctgt tcgctcaaat tatctacaat 600 ataaaattta tcaatctttt ttcgttacgg taaacaataa acaataaaat aactatagtt 660 attttattgt ttaccgcata ttgtttaact atagttaaac aaagtatttg tttatggaac 720 attaccagta tctcttgtta aggtaaacaa caaaacatag acggcatctc tttttaaggt 780 aattaagtat acggctaata ataaaaatat acagctaata ataaaatctt caatgaacat 840 ggtttctata tgcaaaagat gtgatcgtag ctttactacc cttaagggac taaatattca 900 taaaggtcaa tgtaagatct ttgtttccaa tacaaataaa caaataaaca atgtagttaa 960 caatgaatta acaacaccga ataaaaacaa ggtggaaatt aatacgatat taaactgcga 1020 tgagatatct gtagaacact attcaaccaa cacaccttac ttacccaaaa taaatatttg 1080 tgaatctatt atagatccca acgactatct atggggtcat atgccgttta gcttccttct 1140 caaccatgtc aacacaatat acgatgaaat agtattttac cataaaaacc tttttaaagt 1200 gccatcagga aaaggtggta aaatgtttat agaagaactg accttttggc taaaacagtt 1260 taataatcga accaaattga atggaatagc catgaaatgt ttcatgatag tcccttccct 1320 aatgttacag aagccctcaa tacggtccaa agccaaagaa catgcagaat gtttagtaag 1380 acgaattaca ttatggagaa acgggaactt tagtgaattg atgcgggaaa ttagatatat 1440 tcagagcaaa attaacacct caaaaaagaa aaggacattt gaggatatct caaggatatt 1500 cgcaaaacta atgatggaag gtaaagttgc tgccgcactg aaggttttag atagagagtc 1560 atctggcatc ttgcaatgct cggaaagtgt attgaaagaa ttgaaaagta aacacccaga 1620 cgaaactcct gtacaagata attgtttact atacggcccg ttacaaaaca ctccagaatg 1680 tttattcgat tcaattgatg agataagtat atttaactca gctttacaga ctaaaggatc 1740 tgcaggtcct tctggaatgg atgcagatct ttaccgtcga gtcctatgct caaaatgttt 1800 tggaccctct tgtaagactc tacgagaaga aatagcaaca tttacaaaaa atattgcaac 1860 aaaatcctac caaccggata tagttcaacc ctacattgca tgtcgactaa ttcccttaga 1920 caaaaatccc gggattcgcc ccataggaat tggggaagtg ttacgtagga ttgtaggtaa 1980 aaccattagc caccattgtc aaaaagaaat caaagaggca gctggaccac tacaaacttg 2040 cgcaggacac ggtgcaggag cagaagctgc aatacatgct atgcaaaaga tatttcatca 2100 ggaagataca gatggtgttt tgttaatcga tgctaggaac gcgtttaact gcctaaaccg 2160 ttctgttgca ctacataata tacagataac ttgcccaatc ttagctatgt atttagtcaa 2220 cacttaccgt aaaccggcaa aattattcat ctacggtgga gaaactattt tttcgaaaga 2280 aggcacaacg cagggcgatc ccctcgccat gccatggtac tcacttagca ctgtgacaat 2340 cataaataca ttgaaactag taattcctga tgtaaaacaa gtatggttag ccgatgatgc 2400 taccgctgca ggaaaattac agtctttaaa aaagtggtat aaatgcctag aggatgtcgg 2460 tggtttgtat ggttattatg taaatcagtc aaaatgctgg ctaatagtaa aatctgataa 2520 ccaagctgaa gaagctaaac ttatatttgg caactccata aatataacta ctcagggaaa 2580 aaggcactta ggagctgcac ttggttcgga agcatacaaa aaagtgtatt gcgaggattt 2640 agtaagtaaa tggtctaaag aacttaacaa tctctgcgaa atcgccacca cgcaaccaca 2700 agctgcttat tcagctttta ttaaagggta cagatctaaa ttcacttact tcttacgcac 2760 aattgaagct tttgaaaatt tcgtaacacc agtggaaaaa attttatcag aaaaattatt 2820 acctgtattg tttggaactg attgttctat aatcaaagaa aatagggatt tattggcgct 2880 aaatccatcg gaaggaggac ttggaatttg taacttaata actgaggcca aggaacagca 2940 tactgcctct aagaaaataa ctaacttgca cataaaatca atactcgatc agtcagatgt 3000 tatgaaagaa aaagatgatt tcgggaaaac attttcagaa ataaaaacaa aaacaaatat 3060 ggataaatct aaaaaaaaaa aagaagaggt taaaaaaata catgcaggac ttccagaaaa 3120 ccttaaactt ctggttgaac aggcctgtga caaaggtgcc agcagctggt taaacacctt 3180 accaattaaa gaacaacatc tagatctgaa taaggaagag tttaaggacg cacttagatt 3240 gagatataat gtgccacttg ccaatttacc atcctactgt gcttgtggag aaaaatttga 3300 cgagctacac gcaatgtcat gcaaaaaagg tggctttgtt tgtaacagac atgataacat 3360 cagagattta ttaactgttt gcctaaataa agtttgtact gatgttcaag cggagccgca 3420 tttaattcca ttgacaaatg aaaaatttaa tttcaaaact gccaatacca acgacgaagc 3480 tagattggat ataaaagcaa aagggttttg gagaaaagga gaaactgcat tttttgatgt 3540 tagagtaacg cacgtaaact ccaaatcctc caaaaaacaa ccaacaaaac acatattccg 3600 taggcatgaa gatgcaaaaa aacgtgagta tttagaacga gttctagagg ttgaacacgg 3660 gacatttacc ccattaattt ttggtacgaa tggtgggttt ggagacgaat gcaaacgctt 3720 cacggcacta ctcgcacaaa aactgtcctt aaaaatgggt gagcggtacg gagctgttat 3780 aaattggcta aggacacgtc tttccatgga gattactaga gcctccctac tctgcttaag 3840 agggtcacga accccattta ggcattataa cactgacgat gttggcctgg aaaatgtgca 3900 atgtggactt atttaacttg tatttttaaa ttgttttatt agttt 3945 // ID hAT-N18_AP repbase; DNA; INV; 815 BP. XX AC . XX DT 29-AUG-2009 (Rel. 14.09, Created) DT 29-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; hAT-N18_AP. XX NM hAT-N18_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-815 RA Jurka J.; RT "hAT-type DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2118-2118 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 815 BP; 305 A; 120 C; 113 G; 277 T; 0 other; actagggccc ggatttatat gcaataaaaa tctataaaat atgcggattc atatcataaa 60 tatgcaaaaa atatgcaaaa tgattttttt tacaaattaa caatatatta tgtataaaaa 120 ttgtttatta ataaaaaaat gcaaaataac atttttaatg acattgtgta agtataaaag 180 tgttataaat taattaaagt attactattt atattaaatt attattattg gacgagtacc 240 agcatgaagt aataaatatg gatttcaagt tttcgaattg aaaactccta cgattatctc 300 gaagcatact tttgtacttg gaaaaacttc tttccacgtc acaagataca attggtgcat 360 atttaaaact cgataaatct tcaactgaca gttcctcgtc ttcatcacat tcagttacct 420 cgccgttgat tataccatcg atttttttga ttgttcgacg tgtttgggta ataataaatt 480 taccactaga aaatccaata tttgtcgtca ttcgtcggcc gtcgataacg atgatgtcga 540 ttgacgatca cttatcgatg tataatgtgt tatttacaat cgacagtagg tatctttaca 600 gacgtaaatt ctatgaattg tagaatatgc aacatttatg caactatcaa taaatatgca 660 ataaatatgc attttcattc aaatatgcaa aaatatgcaa aaaaaaaaat tgaaccatag 720 aatctacgca ttacgaaacg tcgacattta taatcaaaat tattctaggt gaattgtaga 780 agaacatgca tttgcatata aatccgggcc ctagt 815 // ID Proto1-1_NG repbase; DNA; INV; 5413 BP. XX AC . XX DT 21-MAY-2009 (Rel. 14.06, Created) DT 21-MAY-2009 (Rel. 14.06, Last updated, Version 1) XX DE Proto1-1_NG is a non-LTR retrotranspsoson from the Naegleria DE gruberi amoeboflagellate genome - a consensus sequence. XX KW Proto1; Non-LTR Retrotransposon; Transposable Element; Naegleria1; KW Proto1-1_NG. XX OS Naegleria gruberi OC Eukaryota; Heterolobosea; Schizopyrenida; Vahlkampfiidae; OC Naegleria. XX RN [1] RP 1-5413 RA Kapitonov V.V. and Jurka J.; RT "Proto1 non-LTR retrotransposons from the Naegleria gruberi RT amoeboflagellate genome."; RL Repbase Reports 9(6), 1144-1144 (2009). XX DR [1] (Consensus) XX CC Proto1-1_NG is a very young familiy of non-LTR retrotransposons CC that belongs to the Proto1 clade of non-LTR retrotransposons. CC This clade includes also the Proto1-2_NG, Proto1-3_NG, CC Proto1-4_NG and Proto1-5_NG families from the the Naegleria CC gruberi amoeboflagellate genome. The Proto1 elements code for two CC ORFs. The ORF2-encoded proteins are composed of the apurinic CC endonuclease, reverse transcriptase and ribonuclease H domains. CC It is likely that the Proto1 clade is a sister clade of the L1 CC clade. Proto1 retrotransposons are characterized by 15-18 bp long CC target site duplications and by a weak target site preference: CC 5'-CATTTTTTTNNNNNNNN-retrotransposon-ATTTTTTTNNNNNNNN-3'. XX FH Key Location/Qualifiers FT CDS 15..1652 FT /product="Proto1-1_NG_1p" FT /note="Proto1-specific protein of unknown FT function." FT /translation="MPLHDRPPDPINCLQLTNRILVLEQAVDSYRIENIRL FT QKYMETMQKIYDKIMFKSEFVQDKDVKVKVDNINQEDTSTTNISTDSFKKP FT TSEIQNIKSTPQNRNQQIHSITSNIKPKYECSKNPIEQNTKLQLTPYLDMV FT KYHPIKAQPSIKQTIAKQIKMKRSISFYGKKPVKQNKKFDLTHSLNIICYE FT RINNKFMEQIMDIFGEFATIKIRPQTSNVINLHCNSNDNYILCKDMLDSVH FT NDLILKFFPSPAVFEHSLYALTHSTHIYSQHIDMIKSLIKHSVNNDNTILI FT SQMYTNQSHGKYIIRVSYENEDVCEVIRKKLNLTNYKSVEQLRNETPTNVS FT ECLFPPTFTNKEIIDGVTRVYERIKKDFNTNQCMITESKNKSNTTTFKLCK FT VVHSNIDEMNLFIDKKIVLNTKQNITDGTSYCFVPSIKQLDEIIERKQKQY FT PFLTNSKLPEVTNPMINPIQSNDNTSSTTNSVVTPTINTTTNLFETRLSNI FT ETELRDFKKALADTMELIDKISHYINNSTNNSIMQDDEESKSSSSCTLSK" FT CDS 1652..5377 FT /product="Proto1-1_NG_2p" FT /note="contains the APE endonuclease, reverse FT transcriptase and ribonuclease H domains." FT /translation="MSNYSNLRLACYNVNGLYKHASKKSVTPEVKSMCLHH FT KIHMISLTETHLNTPDLEYKVKNSGTGYILFNSLMNTTKGKKGTAVMHFLN FT NPHKRNITNINLIPGVLQWTRFESKGIPINIFTPYLSGRVDDKENDVEAMN FT SITNAVLTCRDEHVIINGDLNINTINPSNGRDKEWCWLFNLLQLTEYKTNN FT KTYIRRQQNRITTSRLDHIFVSKDIKVIKEELLPPLTSNDHIPFYIDIEFK FT SHLNWRSYIGKKKREEMIKEINTSNANTFQELNAEIQNNIAKYGTKVDRLG FT GSIISSIFKGKIRKLENKIMGKLEDPNVGLDEISELQTILKETHIQSCKEI FT KEQLTDKYNNSHASHMYQFDRKLHSKEIIWKELPFGEQEILDYFSKKFTTS FT NQTPTFMPSPSTITSGPRDIGRISYRDVKETIKRMKSNTPGQDEISLDIIK FT GLKLEKLALIVEEFNKCLTQGDIPQEWKHGWVKLIPKREVKTLGDIRPITI FT LPIFYRILFNLIAHKLCEWASSQINMRQQAFINNRNTMNHGIVLSSLALKT FT RRRTFILVNLDIEGAYDAVEMPVIRMALNHCKFPTPLTDFIINAYSNHSLQ FT LEIDKHLSNNFTKTRGIPQGCPLAPLIYDCITQLIIDKAIDQWQLPIEPSE FT LNANEIGLCCFADDMNIVCDRFTNYNGRLDNIDDWLSQLLFKLNAKKSIAT FT LLPTKSKIIPKIKQVPIPIQKNLRVLGHYPWEDKLVKEDIESKIAKFNESL FT KYFPILKLQPHNLRQVLHAKVISLFTHLSKLNVIPPSLVSRIDICIRNVLR FT TKLKLDRTTKTAFFHLPLEEGGLGVPSIEEFTDRLNLRTLSLIANSNNKLV FT QKAFKFGISHTNDNSNNIFIQWMSILQKYNLDFKKNHTKFRLKKVNETNRC FT FIIHTDGSKLAGKTGMGINIYESGGRLSPCKSLSLRINNEYSNNVAEIGAI FT ITAMKLLPNGTTARIHTDSKVAIDVLSRKYKGNFLPFKTEFFNIQKEKNIS FT VYIIKVKGHEDKENIIVDRLAKDGTERDKIFDIKSLLTNQYLLMKNDIVIF FT DCKRMTVNRQLKEMHEAIDSNPNFIPNQSWSSINVAYLKSRLSPKTKYQIW FT RNASNSHIKYTQKKDELNPTCSDCNCPSNLEHYIHECPRLDNQRRYCQNRL FT TEILDRPECSLSPIQEIPFQYSYVLYFHHSGITFFENEYIDKIPNHNVLKK FT WPEIQAAISKFIGRAFLIYQIDSKPIYKRSEK" XX SQ Sequence 5413 BP; 2153 A; 989 C; 749 G; 1522 T; 0 other; cgacttttaa aaaaatgcct ttacatgacc gccctcctga tccaataaat tgtctgcagc 60 tgactaacag gattcttgtg ttagaacaag cagtcgattc atatcggatt gaaaatattc 120 gtcttcaaaa atacatggaa actatgcaaa agatttacga caaaataatg tttaaatccg 180 agtttgtaca agacaaagat gtaaaggtta aagttgataa tattaatcaa gaagatactt 240 ctaccactaa catttctact gattcattta aaaaacctac ttcagaaatc caaaatatta 300 agtcaactcc ccaaaataga aatcaacaaa ttcattccat tactagtaat attaaaccaa 360 aatatgaatg tagtaagaac cctattgagc aaaataccaa actacaactt actccatatt 420 tggatatggt gaagtatcac cctataaaag ctcaaccaag tatcaaacaa actatagcaa 480 aacaaatcaa aatgaaacgc tcaatttcat tctatggtaa aaaaccagtc aagcaaaata 540 agaaattcga tttgacacat tcacttaata ttatctgcta tgaaagaatt aataacaaat 600 tcatggaaca aattatggac atctttgggg aatttgccac cattaaaata agaccccaaa 660 ctagtaatgt cattaatctt cattgcaata gtaatgacaa ttatattctt tgcaaagata 720 tgcttgatag tgtacacaat gatttaatat tgaaattctt cccatcacct gctgtattcg 780 agcatagctt gtacgctcta actcactcta cacacatata ttcacaacac attgatatga 840 ttaaatcact aatcaaacac tcagttaata acgataatac tatacttatc tcccaaatgt 900 atacaaacca atcacatggt aaatatataa ttagagtttc atacgaaaac gaagatgtgt 960 gcgaagttat tagaaagaag cttaatttaa ctaattacaa atctgttgaa caattgcgta 1020 atgaaactcc tactaatgtt tcagaatgtt tatttccacc tacctttaca aacaaggaaa 1080 tcatcgatgg tgttactcgt gtatacgaac gcattaaaaa agatttcaat actaaccaat 1140 gcatgataac tgaatccaaa aacaaatcca ataccactac ctttaaatta tgtaaagtcg 1200 tccattcaaa tattgacgaa atgaatctat ttattgacaa gaaaattgtt ctcaatacca 1260 aacaaaacat tactgatgga accagttact gtttcgtacc atctattaaa caattagatg 1320 aaataattga aagaaaacaa aaacaatatc catttctaac taactctaaa ttaccggaag 1380 taactaatcc tatgatcaat cctatacaat ctaacgataa taccagttca accactaatt 1440 ctgtagttac tcctactatt aatacaacaa ccaatctatt tgaaactaga ctatctaata 1500 ttgaaactga actgcgtgat tttaagaaag cattagctga tacaatggaa cttattgaca 1560 aaatctccca ttatataaat aatagtacta ataattctat tatgcaagat gatgaagaat 1620 caaaaagctc atcatcttgt actctttcta aatgagtaat tatagtaatt tacgactagc 1680 gtgttacaat gtaaacggcc tgtataaaca tgctagtaag aaatcggtta caccagaagt 1740 gaaatcaatg tgtcttcatc ataaaataca catgatttcg ttaaccgaaa ctcatttaaa 1800 tactcctgat ctagaatata aagttaaaaa ttcaggaaca ggctatatat tatttaactc 1860 cttaatgaat acaacgaaag gaaaaaaagg gacggctgtt atgcacttcc ttaacaaccc 1920 tcacaagaga aatattacta atatcaatct aattccaggt gttcttcaat ggacacgatt 1980 cgaaagcaaa ggtataccaa ttaatatttt cactccatat cttagtggaa gagttgatga 2040 taaagaaaat gatgttgaag ccatgaacag tataacaaat gcagtactga catgtagaga 2100 tgaacatgtc ataattaatg gagatcttaa tatcaatacc attaatcctt caaacggtag 2160 agataaagaa tggtgctggc tatttaatct actccaacta acagaatata agactaataa 2220 taagacttac attagaagac aacagaatag aatcaccact tcaagattag accacatttt 2280 cgtgtctaaa gatattaagg ttatcaaaga agaactcctt ccaccgctca caagtaatga 2340 tcacatccct ttttacattg atattgaatt caaatcacat ttaaattgga gatcatatat 2400 tggtaagaag aaaagagaag aaatgatcaa agagataaat acgtcaaatg ccaatacatt 2460 tcaagaatta aatgcagaaa tacaaaacaa cattgcaaaa tatggaacca aagttgacag 2520 acttggtggt agcattattt cctccatatt taaaggtaaa attcgtaaat tagaaaataa 2580 gattatgggt aaattagaag acccaaatgt tggacttgat gaaatatcag aactacaaac 2640 aattcttaaa gagactcaca tccaatcatg taaagaaatc aaagaacaat taactgacaa 2700 atataacaat agtcacgcct cccacatgta tcaatttgat agaaaactgc attctaaaga 2760 aattatctgg aaagaattgc catttggaga acaagaaatt ttagattatt tttccaaaaa 2820 gtttacaacc tccaatcaaa caccaacctt tatgccaagt ccatccacta tcacttcagg 2880 tcctagagat ataggtcgga tttcatatag agatgtaaaa gaaaccatta aaagaatgaa 2940 atcgaataca ccaggacaag acgaaatttc attagatatc atcaagggct taaaattaga 3000 gaaattagct ttaatagttg aagagtttaa caaatgttta acacaaggag atatccccca 3060 agaatggaaa cacggttggg ttaaactgat tccaaagcga gaagttaaaa cattaggaga 3120 tattagaccc atcacaatac taccaatatt ttatagaatc ctttttaatt taatcgccca 3180 caaattatgt gaatgggcta gctctcagat taatatgaga caacaggcct tcataaataa 3240 tagaaataca atgaaccatg gaatagtact ttcctccctg gccttgaaaa ctagacgaag 3300 aacatttata ttagtcaacc ttgacattga aggtgcctac gacgcagtgg agatgccggt 3360 tattaggatg gcccttaatc attgtaaatt cccaacacca ctaactgatt ttataattaa 3420 tgcatacagc aatcacagtt tacaattaga aattgacaaa catctatcaa ataacttcac 3480 aaaaacacga ggaattcctc aaggttgccc attagcacct cttatttacg attgtattac 3540 acaactcata atagacaaag caatagacca atggcaacta ccaatagaac catctgaatt 3600 gaatgcaaac gaaattggtc tttgttgttt cgctgacgat atgaatatag tatgcgacag 3660 attcacaaat tataatggaa gactagataa catcgacgac tggttatctc aattgctttt 3720 caaattaaat gcaaaaaagt caattgccac tctccttccc acaaaaagca aaatcatacc 3780 aaaaatcaaa caagtgccaa ttcccattca gaaaaatctt agagttctcg gacactaccc 3840 ttgggaagat aaattggtca aagaagatat tgaaagcaag atagctaaat tcaatgaatc 3900 actgaaatat ttcccaatac tcaaacttca accccataat ttaagacaag tccttcatgc 3960 taaagtgatt agtttattta cacatctatc caaacttaat gtcattcctc catcactggt 4020 atcaagaata gatatttgca ttagaaatgt actcagaaca aagttaaaac tagacagaac 4080 caccaaaaca gcttttttcc acctaccatt agaagaaggt ggactcggtg ttccctcaat 4140 tgaagaattt acagacagac ttaatttaag aactctctcc ttaatagcaa attccaataa 4200 caagttagtt cagaaagctt ttaaatttgg aattagtcat accaatgata atagtaataa 4260 cattttcatt caatggatgt caatacttca gaaatataat ttagatttca aaaagaatca 4320 taccaaattc agattaaaga aagtgaatga aaccaataga tgctttatta tccatacaga 4380 cggatccaag cttgcaggta aaacaggtat gggtattaat atatatgaat ccggtggaag 4440 gttatcacct tgtaaaagcc tttcactccg tatcaataat gaatactcta acaatgttgc 4500 agaaatagga gcaattatca ctgcaatgaa actacttcca aatggaacaa ccgctagaat 4560 acacacagat agcaaagttg caatagatgt cctatccaga aaatacaaag gaaactttct 4620 ccctttcaaa actgaatttt tcaatatcca aaaagaaaag aatatttcag tttatataat 4680 caaagttaaa ggacatgaag ataaggaaaa tataatagta gatcgccttg cgaaagatgg 4740 aactgaaaga gataagatct ttgatatcaa atccttacta accaaccaat atttacttat 4800 gaaaaacgat attgtgatct ttgattgcaa aagaatgact gtaaatcgcc aacttaagga 4860 gatgcacgaa gcaattgata gtaatcccaa ctttattcca aaccaaagct ggtcatcaat 4920 taatgtagcc tacctcaagt cacgactgtc acctaaaacc aaatatcaaa tttggcgaaa 4980 cgcttcaaat tctcatatca aatatacaca gaagaaagat gaattgaatc caacctgctc 5040 agattgtaat tgtccctcaa atttagaaca ttacatacat gaatgcccca gactagacaa 5100 tcaaagaaga tattgtcaaa atagattaac agaaatactt gatagacctg agtgttcact 5160 ctcccctatc caagaaattc ctttccaata ctcatatgtg ctatatttcc atcactcagg 5220 cataaccttt tttgagaatg aatatataga taagatcccc aaccataacg tcttaaagaa 5280 atggccagag atccaagctg ctatttccaa atttattgga agagccttcc ttatttatca 5340 aattgattca aaacctattt acaaacgaag tgagaaatag aatctgaata gattcgccaa 5400 aatgcattaa aag 5413 // ID hATN-1_SM repbase; DNA; INV; 194 BP. XX AC . XX DT 08-FEB-2008 (Rel. 13.02, Created) DT 03-MAR-2008 (Rel. 13.02, Last updated, Version 1) XX DE hAT transposon: consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW hATN-1_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-194 RA Jurka J., Bao W. and Tempel S.; RT "Non-autonomous hAT-type DNA transposon from Schmidtea RT mediterranea."; RL Repbase Reports 8(2), 158-158 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX SQ Sequence 194 BP; 48 A; 43 C; 44 G; 59 T; 0 other; caggggtggc caaactgcgg ctcgcgagct gcatgtggtc caccttgacc atttatgttt 60 caatttttaa ctcctcaaaa atgtctatag tgtatctaaa tctaatatga ggatatataa 120 ttttatatgc ggcccgcgga gttatgatgc atttccaaag tggcccgcaa gatcatttgg 180 gttggccacc cctg 194 // ID Tx1-16_BF repbase; DNA; INV; 5243 BP. XX AC . XX DT 28-APR-2009 (Rel. 14.04, Created) DT 28-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE Amphioxus Tx1-16_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW Tx1; Non-LTR Retrotransposon; Transposable Element; Tx1-16_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-5243 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-5243 RA Kapitonov V. and Jurka J.; RT "Young families of Tx1 non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(4), 853-853 (2009). XX DR [2] (Consensus) XX FH Key Location/Qualifiers FT CDS 165..941 FT /product="Tx1-16_BF_1p" FT /note="ORF1p is incomplete." FT /translation="MSTRATRGSTRKRDEPAGHMEGKHGDDLDDVTGASEF FT EAFVREQFKKLLEGQKELQEEIRALEDNVEKNVQLLEARMEQMEKKMEASE FT TTQRESAKQIKSITQRLKATEQAVEEFQTKCNKLERFSRRNNVRIIGRKVQ FT KGENCVSTVEKILEDKFGLDHIKIERAHPDGPRNSDSDGVPQHILFKLNSY FT ADKVSIMKAARRKLQDEPYYFTDDLTVTDLQEKRKWMEKVRHAYKEGKKYR FT FYNGYWRDGQGKVVSFD*" FT CDS 2137..5241 FT /product="Tx1-16_BF_2p" FT /note="endonuclease and RT." FT /translation="MFLKETRKFIEEFFLFNKGSAEPRVVWDTFKCAFRGH FT CIKYTSFKKKQIFEKENRLLDEIRRLQNDIDSLGSPPLCLLEELEQKERVL FT ETLYEEKRQKIMLHSRATWMKLGEKPTKYFFSLLNRNKTKKNVTKIVLQDD FT KFVTNPSDILCEEVKFYSSLYSFEDVKKPLKEEKEYNMFFVEGNIPTLTKQ FT QSEICEGRIKVKELWDSISTFKKGKSPRVDGITVEVYKEFFYLLKQPMLDC FT FNKVYDLGQLSETQKVGIISLLLKQDATGTDKNPVYLKNWRPLTLLPCDTR FT ILSKCIALRIKKVISSIIHMDQTGFLSGRSISDNIRKVLEIIEFCEKDNIE FT GLIFITDFEKAFDKIRIDFVLEALKKFGFGKSLLQWVTIMYTMITSRISNQ FT GYLSEPFRLLRGLRQGCPLSPYLFVLSVELLSIRVRANENIEGLVCRKEPC FT KLSQFADDTNFFLKPVLTTLETLCLELKNFSNISGLVPNFDKCIILRMGPL FT KDTGFQLPCTFPVKWSDGPADLLGIHIPQDMNTLISDNFSRKLQKVDRILQ FT PWKGKTLTLYGKVTLINTLVVSQFTYLFMSLPSPDAEFFQTYEKKVFKFLW FT SGGPERVKRNIVYNMYEYGGLKLINLEAFNAALKAAWVPRMYLNKSWFCRK FT MLNHCIHFNYDIFPFLQLTEAHFHNVVSKHSYLSTFFEEVMLAWLKAQKKP FT LDNMSEYLRQFLWFNSNILIDRLPFFWEAFSQRGILFVNDLLNTDGYFMTY FT RSFIGKYGRICDELKFIQLLSSIPFKWKQSIQGIPLCETAIMPVTKYSKWI FT KGLKINKELYRFYLVERKCVDFHCKAQEYWTELFNDQIQWDVVYSLTYKTT FT IDSYLRSFQFKLLHNFLALKNKLYKWKLSENTLCSYCKTEEETPVHIFCQC FT SFLVKFWGDVENWIRKHMNMVFRFTSRIILFGSLGYPIILNTLLLIAKVYI FT FKCRHLKIPTIDGFLAYVHYYYKVEYIIAQNLQKIEKHLLKWDILVMKMQI FT VLPGLGCFLYIHFVLLCLFSKMNKDYVKK" XX SQ Sequence 5243 BP; 1706 A; 838 C; 1082 G; 1617 T; 0 other; taacttgaga gatgacgtca tggcagcacg ggagtagtcg gtcactgagg tagctcccca 60 aagttgataa tttttgacat atttagagcc tgttcggctt gaattttgag aagcctcttg 120 tgtgtcaacc cttcctgggt gcagacgcgt accaaactgt ccaaatgagt actcgtgcaa 180 cgcgggggtc gacaagaaag agggacgaac ccgccggcca catggagggg aaacatggcg 240 acgacctgga cgatgttacc ggcgcgtcgg agttcgaggc ctttgtgagg gaacagttta 300 agaaactcct agaaggccag aaggagcttc aggaggagat aagggctcta gaagacaacg 360 tggagaaaaa tgttcaactc ctggaagccc gtatggagca aatggaaaag aaaatggagg 420 cctctgagac aactcagcgc gagtctgcca aacagattaa atcaatcaca cagaggctga 480 aagctacaga gcaagcagtg gaggaattcc aaacaaaatg taacaagttg gagcggttct 540 cacgtcgaaa taacgttcgc attatcggtc gcaaggtaca gaaaggagag aactgcgtct 600 ctacagtaga gaagatttta gaggacaagt tcggacttga tcatatcaag attgaacgtg 660 cacacccaga tggtccgaga aacagcgaca gtgacggagt tccccaacat attctcttca 720 agctgaacag ttacgcggat aaggtgtcca taatgaaagc tgcacgcaga aagctacagg 780 acgagcctta ctacttcact gacgatctca cagtcacaga tttgcaagag aagcgcaagt 840 ggatggagaa agtgagacat gcctataaag aagggaaaaa gtaccgcttc tataatggat 900 actggcgtga cggacagggt aaagtcgtaa gttttgacta gagtttgaat tcattgaaac 960 aaggtgctcc ttgtttaaaa tcatactttg tttttcacaa tgtcctgtca gcaactgtaa 1020 cttattgctc taaaattgat gattcactta gggggctagt tcatagtatc tgcacagtga 1080 ccaatgaact accgtgaagc ctcgtcaatt ttttcgttac ttgtatgtgt atattttgtt 1140 tactcatttc atgtttttca gtgtactacc agtttatttg caacttccca cggaaaatat 1200 gtagggtatt ctgtgttacg taaaatgtgc agctgtgaat gtggcacaga cagatataat 1260 tgtgcagctg tggggtattt agtgtaccct gttttgatga tggtatgcga aatttgcatt 1320 tatataggta gacgtaattt atgcaaatga aatgatggtg cccggacagg aattttccct 1380 cgctagtttt aactgcagag ggttaggaaa caatgcgaag cgaatggaaa catttacgtt 1440 tttgaaagac aagccccact ctgtcatttg cttacaggaa acccactcaa cttttgagaa 1500 agagaatcag tggacgacag attggggcag tcagattttg tttaaccatg gaacaaacaa 1560 tagctgtggg gtacttatta tgtttcagaa aaatttctca taccagattc atgaaattag 1620 aaaagactat ggcagatata ttttgattga tttagagtgt ggtaattttc gatgttgttt 1680 atgtaacata tatgctcaaa ataaggatga tgaaggtttt tttgaggaaa tacaaaagaa 1740 tatctcggag atggaggctt gtaatgaaaa cattatacta ctggtgactt taatacagta 1800 cttgatacta aaaaagatag agcaggacaa cattttgtaa actatcatcc caaggctact 1860 caggcaattc gtgaattgtc tgccactttt gatcttgttg atgtatggcg tttaaataat 1920 ccagaaaccc gtagatacac atggaggaga caaaaacagg ctagtagatt ggattacttt 1980 ttagtgtcct tttccctctt gccaaaggtc actaaagtag ctatagctga atccttcaaa 2040 tctgatcata ggttaatttt tcttaatgtt gtacctggag atctcaaaca cggacctgga 2100 tattggaaat ttgactccac tcttttgaat gataacatgt ttcttaaaga aacaagaaag 2160 tttattgaag aattttttct ctttaataaa ggttcagcag aacccagagt tgtatgggat 2220 acatttaagt gtgcctttcg aggacactgc ataaaatata catccttcaa gaagaaacaa 2280 atatttgaga aagaaaaccg attactagat gaaataagaa gattacaaaa tgatattgat 2340 agtttgggat ctcccccact ttgtttgctg gaggaattag aacaaaaaga gagggtactg 2400 gagaccctct acgaagaaaa aagacaaaaa atcatgttac attctcgcgc aacatggatg 2460 aagctaggtg aaaaacccac taaatatttc tttagcctgc taaaccgaaa caaaactaaa 2520 aagaatgtca ctaagattgt gttgcaggat gacaaatttg taactaatcc ttctgatata 2580 ttgtgcgagg aagtaaaatt ctactcttcg ctttattcct ttgaagatgt taagaaaccc 2640 ttgaaggaag agaaagagta taacatgttt tttgtagagg gtaacatacc tactttaact 2700 aaacaacagt ctgaaatatg tgaaggtagg ataaaagtaa aagaactctg ggattcaatc 2760 tcaactttta agaaagggaa atcacccagg gttgatggga ttactgtgga agtttataaa 2820 gaattttttt atctattaaa gcagcctatg ttggattgct ttaataaggt gtatgattta 2880 ggacaactgt cagaaacgca aaaagttggc attatttcgc tactcttgaa acaagatgca 2940 acaggaactg acaaaaaccc agtatattta aaaaattggc gccctcttac tttgttgcct 3000 tgcgacacaa gaatcttatc aaaatgtatc gcgcttagaa ttaagaaagt aatatctagt 3060 attatacata tggatcagac tgggtttttg tctggtagat ctataagtga caacattcga 3120 aaagtattgg aaattattga attttgtgaa aaggataata ttgaaggact tatctttata 3180 actgattttg aaaaggcgtt tgacaaaatt agaatagatt ttgtgttaga agctctgaaa 3240 aagtttgggt ttggtaaatc tttattacag tgggtaacca ttatgtatac aatgattact 3300 agtaggataa gtaaccaagg ttatttatca gagcccttta ggcttctccg ggggctaaga 3360 caaggttgtc cattatcgcc ctatttgttt gttttatcag tagaattgtt gtccattaga 3420 gtgcgagcaa atgaaaatat tgaaggattg gtgtgccgaa aagagccatg caaattgtct 3480 caatttgcgg atgatacaaa ttttttcctc aaacctgttc ttacaacatt ggaaacatta 3540 tgtttagagt taaaaaactt ctcaaatatt tctgggcttg tgccaaattt tgacaaatgt 3600 ataatcttaa gaatgggtcc actgaaggac actggttttc agttaccatg tacttttcct 3660 gttaagtggt cagatggtcc agctgacctt ttgggaattc acattcccca agatatgaac 3720 actttgattt ctgataattt cagtagaaaa ctacaaaaag tagataggat actccagcct 3780 tggaaaggaa agactcttac cctgtatggc aaagtgacac tgataaatac tttggttgta 3840 tctcagttta catacttatt catgtcactg ccctctccag atgctgagtt ttttcagact 3900 tatgagaaaa aagtatttaa gtttctttgg tcaggagggc cagaacgtgt caaaaggaat 3960 atagtttata atatgtatga atatggtgga ttaaaactga tcaatttaga agcctttaat 4020 gctgcactta aggcagcatg ggtacctaga atgtatttga acaaaagctg gttttgtagg 4080 aagatgttaa atcattgtat acactttaac tatgatattt ttccattctt acaacttacc 4140 gaagcacatt ttcacaatgt tgtttctaaa cattcatatt taagtacttt ctttgaagag 4200 gtgatgttag cctggctgaa ggcccagaaa aaacctcttg ataatatgtc tgaatacttg 4260 agacaatttc tctggttcaa ttccaatata cttattgata gactaccttt tttctgggaa 4320 gcctttagcc aaagaggaat tttgtttgtt aatgatttat tgaacactga tggatacttt 4380 atgacataca gatcatttat tggtaaatat ggaaggattt gtgatgagct taaattcata 4440 caattgttat ccagtatccc ctttaaatgg aaacaaagta ttcaaggtat tcctttatgt 4500 gaaacagcta ttatgcctgt tacaaaatat tccaagtgga taaaaggact taagataaac 4560 aaggaattgt accgttttta cttagttgaa aggaaatgtg ttgacttcca ttgtaaagcc 4620 caggaatatt ggacagaatt gtttaatgac cagatacaat gggatgtagt gtatagtttg 4680 acctataaaa caactattga ctcctattta agatcctttc aatttaagtt gttgcataat 4740 tttcttgctc ttaagaataa gctttataaa tggaaattat cagaaaatac actttgttct 4800 tattgtaaga ctgaggagga aactcccgtc cacatctttt gtcaatgttc ttttcttgta 4860 aaattttggg gcgatgtaga aaactggatc agaaagcata tgaacatggt atttagattt 4920 acaagtagga ttattttatt tggttcttta ggctatccta tcattctgaa cacattgtta 4980 ttaattgcta aggtgtatat ctttaaatgt agacatttaa aaattcctac aatagatggc 5040 tttttagcct atgtacatta ttattacaaa gttgagtaca tcattgctca aaacttacag 5100 aagattgaga aacacttgtt aaagtgggac atattggtta tgaaaatgca aattgtcctt 5160 cctggcttag gttgcttttt gtatatccat tttgtattgt tgtgtctttt ttcaaaaatg 5220 aataaagatt atgttaaaaa aaa 5243 // ID BEL-599_AA-I repbase; DNA; INV; 5438 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-599_AA_; KW BEL-599_AA-LTR; Pao_Bel_Ele12; BEL-599_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-5438 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [4492-5067] - Integrase core CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 39..3584 FT /product="BEL-599_AA-I_2p" FT /translation="MEAIRKRRNVLFERMKWELSNASAVEKRKPPVGEVQE FT RMHKLSELCEQFDKVQSEIEDQTEAIGEISSIFNHRLEFEELYYQVKGRYM FT KLLEEHNPHGSSEGTIVEPADDLREAIRLLLDSQRILMTSQAAASNNVSQL FT AGQIGSVSLSHPVADAQASGGQVPLEVRLPTIALPVFKGDRKHWTSFKDLF FT ESCIHSKNLKNSVKLQYLLSHLDGEAKKLVSSFTITDANYVEVWERLNEFY FT DKKKYTVAALVKEFVDQPPVNAPTLIGLRKLVSTSDEVVRQLKALGEQYES FT RDPWLIHLLLEKLDRETRSLWAQKLVDEENPSFQDFITFLERRCDALETCA FT SFSKKGADTATKKEQDRRVDQSTKPGKQIQSFLASTQQPCPQCSEAHTIYQ FT CASFKRMAANDRRDFVQRAKLCFNCLKVTHSAKNCTSAVVCKQCKKKHHSL FT LCTSGDSSASDSGQPREGPAAANQQHSGEAEHADSVASYVADLKTSCPANF FT LSLLPTAVVKVLGKNNVFHEVRAMVDSACMNSLISKPAFDRLGLERRNANI FT LVSGITDGKPSKTTGAVTLQISSRFDDRIVIVVEALILNHLVPDQPSQHFD FT IDTGALDEASLADPTYNKCGKIDLLLGIEAFFSILEPGKLFDGRGIPIAQN FT SVFGYLVGGHFNTSHVTEGRAMLGLTATMNLDRTLRKFWEVEEVPKSKQLT FT PDELRAVEHFHSTLSRNETGRYTVRLPFDSSKPELGESATAAIRRFKAMER FT KFSIDPELQQHYQKFMTEYEALGHMEKIPPCEVAVAPEQSFYLPHHGVWKV FT DSETTKLRVVFDASSKSASGVSLNDRLLVGPNVNESLFNVFTRWRTYKIAF FT CADIEKMYRQVLVAKEDADFQRIVWRDHPEQPLEHYRLLTVTYGTASAAYQ FT AVATLQQVAADNKETHPMAAERIPKNFYVDDLLSGADSIEEAKFLRNDIVQ FT VLNDGGFVLRKWSSNDPQILEVDAVQDDPIHLKLPQEDDSVKALGIRWNPQ FT EDSFSFKLTFDIDSTNTKRQLLSDSSKFFDPFGWIAPIVIRMKILFQHLWL FT YDLSWDDPLPAFILEEWIALKETLHHIERIRIRRWIPHSGGKLQLHGFADA FT SEAAYAAVVYVRTVDQEGRIETNLVVAKTRVAPITPNIAATSGADGGRATR FT RVDGQGTGVVFTPRSKNICVERF" FT CDS 3496..5409 FT /product="BEL-599_AA-I_1p" FT /translation="MAAELLVELMVKVLESFSHLEVKIFAWSDSEIVLHWL FT SSVPRKWKTFVANRTSKILQHLPRNHWRHVSSRDNPADCASRGITPLELLM FT HPLWWRGPGWLSETEDKWPAQPGRLVEEDELLEQRTSVTCLFANATPPSRG FT GHETLTFLLNRFSDLTRIRRVLCWINRLCHNGLARQRGAARLEGPLTPKEI FT NDACLQLARAAQHDCFKKEIDCLVKNDPLPGNSKLKSLFPFIDADGTLRVG FT GRLHNSSQPYDVRHPVIIPKEHQYTKLLLTETHLRNLHAGPTLMIATLNQR FT YWIVGCQTVVRSSVASCTRCCRLKGKTATQLMGSLPSVRTTPARPFVHCGV FT DYAGPILLRSSNLRTAKTIKGYVAVFVCLATKAVHLEAVSDLSTNAFLAAL FT KRFCGRRGLCSEIWSDHGTNFVGADRAIREHLQSPEFNQAVSRYLSDLKIK FT WKFITPSAPHMGGIWEAAVKSFKKHLRAVLGNTTLTYEELSTVLTQIEACL FT NSRPLCQLSTSVDSYEALTPGHFIINQPLNLLPEPNINHIQEGRLNRWQRV FT QRHVDDIWARWRNEYVATLQPRNKWQSVQQNLSSGQLVLIKNENTSPAAWE FT LARIVATHPDQYGVVRTVTVRRGQNEYQRAVHKLVPLPMD" XX SQ Sequence 5438 BP; 1376 A; 1427 C; 1417 G; 1218 T; 0 other; ttttggacct tcgtcgccgg atatttcgcg aaaccgctat ggaagcgatt cggaaacgcc 60 gcaatgtgct tttcgagcgc atgaagtggg agctgagtaa cgcgagtgcg gtggaaaaac 120 ggaagcctcc tgtcggcgaa gtgcaggagc gcatgcacaa actttccgag ctttgtgaac 180 agtttgataa ggtgcaaagt gaaatcgaag atcaaactga agcgattggt gaaatcagct 240 cgattttcaa ccaccggttg gaattcgagg agttatacta ccaagtgaaa gggcgctaca 300 tgaagctgct cgaggaacat aacccccacg gaagcagtga aggaacaatc gtcgaaccag 360 cggatgatct acgagaggca atcaggttgc ttctggattc gcagcgaatc ctaatgacgt 420 cccaggctgc agcttcgaac aacgtgtcgc agttggctgg ccaaatcggt tccgtgagtc 480 tttcccatcc ggttgctgac gctcaagcgt ccggaggcca agttccgctc gaagtacgct 540 tgccaacgat tgccctcccg gtcttcaaag gcgaccggaa acactggacc tccttcaagg 600 atcttttcga aagctgcatt cattcgaaaa acctcaaaaa ctccgtgaag ctgcagtacc 660 tgctttccca cctggatgga gaagcgaaaa agttggttag ttcctttact atcaccgacg 720 ccaactatgt ggaagtatgg gagaggctga acgagtttta cgacaagaag aagtacaccg 780 tcgcggcact tgtcaaggaa tttgtggacc agccgccggt taatgcgcca acgctcattg 840 gacttcgaaa gctcgtttca acgtcggatg aagtagtgcg gcagctcaag gctctgggag 900 agcaatacga atcccgtgac ccatggctga ttcatttgct gttggagaaa ctcgaccgag 960 aaacccgttc gttgtgggca caaaagctag tcgacgagga gaatcctagc ttccaagatt 1020 tcatcacgtt ccttgagcga cgatgcgatg ccttggaaac ttgcgcgtcc ttttccaaga 1080 agggggccga caccgcaacg aagaaggagc aggatcgaag agtggaccaa agcacgaaac 1140 ccggtaagca gattcaaagc ttccttgcaa gcactcagca gccttgcccg cagtgctctg 1200 aggctcacac catataccag tgtgccagct tcaaaaggat ggcagcgaat gaccgtcgag 1260 attttgtgca acgagccaag ctctgcttca attgccttaa agtgacccat agtgcgaaga 1320 actgcacgtc tgcagtggtg tgcaaacaat gcaagaaaaa acatcactca ctgctttgca 1380 caagtggtga cagctctgcc tccgattccg gccagccccg cgaaggacca gcagcagcaa 1440 atcagcaaca ttcaggtgaa gcggaacatg cggattctgt ggcatcatat gtcgcggatc 1500 tcaaaaccag ctgtccagcg aacttcctca gtttgctgcc tacggctgtc gttaaagtgt 1560 tggggaagaa caacgtcttc cacgaagtac gggcaatggt ggactctgcc tgtatgaatt 1620 cgctcatcag caaaccagcc ttcgatcgtc ttggcctgga aaggcgcaac gccaacattc 1680 tcgtgagtgg catcaccgac gggaagccca gcaagacaac aggagcggtc acactacaga 1740 tctcatcgcg attcgacgat cgtatcgtca tcgtggtgga agccttgatt ctgaatcatc 1800 tggttccgga tcaaccgagt caacattttg acatcgacac tggtgccttg gacgaggcat 1860 cactcgcaga cccaacctac aacaagtgtg gtaagatcga ccttttattg ggaatagaag 1920 ccttcttttc cattctagaa ccgggcaagc tgttcgacgg tcgaggcata cccattgcgc 1980 aaaattccgt cttcggctac ctggttggtg gtcatttcaa cacgtcgcat gtcaccgaag 2040 gtcgtgcgat gctcggacta acagcaacca tgaatctgga tcgaacgctc cggaaattct 2100 gggaggtgga agaagtgcct aaatccaagc agcttacccc ggacgagctg cgcgctgttg 2160 agcactttca ctccaccctc tctcggaatg aaactggacg atatacagtt cgcctaccct 2220 tcgatagttc caagccggaa ctgggcgaat cagctaccgc tgccatccgg cggttcaagg 2280 ccatggagag gaaattcagc atcgatccag aattacaaca gcactaccag aagttcatga 2340 ctgagtacga ggctctcggt cacatggaga agataccgcc atgcgaggtg gcggtcgcac 2400 cggaacaatc cttctacctt ccgcatcatg gtgtgtggaa agtggacagc gaaacgacga 2460 aacttcgcgt cgttttcgac gcctcgagca agagtgcttc cggggtgtca ctgaatgatc 2520 gcctgctagt gggaccgaac gtgaacgaat cgctcttcaa cgttttcacc cgctggcgca 2580 cctacaagat tgccttctgt gccgacatcg aaaagatgta tcggcaagtt cttgtcgcca 2640 aggaggatgc cgatttccag cgaatagtgt ggcgcgacca tcctgagcaa cctctggaac 2700 attatcgttt gctgaccgtc acctacggta cggcgagcgc ggcgtaccaa gctgtggcga 2760 ccctacaaca ggtagcagct gataacaagg aaactcatcc gatggcggca gagcggattc 2820 cgaaaaactt ctacgtggac gatctgctct ctggggcgga ttccatcgag gaagccaagt 2880 ttctccgcaa cgacatcgtt caggtactca acgatggggg cttcgttctg cgcaagtgga 2940 gctccaacga tccgcagata ttggaagttg atgcggtaca ggatgatccc atccacttga 3000 agttacccca agaagatgat tccgtcaaag ctttggggat cagatggaac ccgcaagaag 3060 actcgttctc cttcaaactc acgttcgaca tcgatagcac aaataccaaa cgtcagctgc 3120 tatccgattc gtcgaagttc ttcgatccat tcgggtggat cgcaccgatt gtcatccgaa 3180 tgaaaatact gttccaacat ctatggctct acgacttgag ttgggacgat ccacttcctg 3240 ctttcatcct ggaagagtgg atagctttga aggagacact gcaccatatc gagagaatcc 3300 gaatcagaag atggattccg cactctggag gtaagctaca actccacgga ttcgccgacg 3360 cttctgaagc tgcatacgct gctgtcgtct acgtgcgtac cgtggaccaa gaaggaagaa 3420 tcgaaacgaa ccttgtggtg gccaagacta gggtggctcc cattactcca aatatcgctg 3480 ccacgtctgg agctgatggc ggccgagcta ctcgtcgagt tgatggtcaa ggtactggag 3540 tcgttttcac acctagaagt aaaaatattt gcgtggagcg attctgaaat cgttctgcac 3600 tggctttctt cggtgccaag gaagtggaag actttcgtgg ccaatcgaac atcgaagatt 3660 ttgcaacatc tcccgaggaa ccactggcga cacgtgtcat caagagataa tccagctgac 3720 tgcgcatcac gtggcatcac acctttggaa ctcttgatgc atcccctgtg gtggaggggc 3780 cccggctggc tctcagagac ggaagacaaa tggcctgccc aaccaggtcg tttagttgag 3840 gaggacgaat tgcttgaaca gcgaacatcg gtgacctgct tgtttgctaa tgccacgccg 3900 ccttccaggg gtggtcacga aacactgacg tttctgctca atcgtttctc cgatttgact 3960 cgcattcggc gcgtcttgtg ctggatcaat cgcctatgtc acaatggttt ggctcgtcaa 4020 cgtggtgcgg cccgactgga aggaccgcta acaccgaagg aaatcaacga cgcctgtttg 4080 cagttggcaa gagctgctca acacgactgc ttcaagaagg aaatcgattg tctggtcaag 4140 aacgatccgc ttccaggcaa tagcaagctg aaaagtctgt ttcctttcat cgatgctgat 4200 ggcactctgc gggttggtgg tcgccttcac aattccagcc aaccgtacga cgttcgccat 4260 ccggttatta tcccgaagga gcaccagtac accaagttgc ttctgacgga aactcatctt 4320 cgaaatttgc acgcggggcc aactttgatg attgcaacgc tcaaccagag atactggatc 4380 gttggttgcc aaacggtggt gcgtagctct gtggcgagct gcacgcgctg ctgtcggttg 4440 aaagggaaaa ccgccaccca actgatggga agtctacctt ctgtccgcac aactccagcc 4500 cgtcctttcg tgcactgtgg tgtcgactat gcaggcccaa tcctgctgcg ctcatctaac 4560 cttcgaacgg caaagactat taagggttac gtcgctgtct tcgtttgcct tgcaacaaag 4620 gcagtacatt tggaagcggt ctccgatctt tcgacaaatg catttttggc cgccttgaaa 4680 cgcttctgtg gtcgtcgtgg tctttgtagc gaaatctggt ctgaccacgg aacaaatttc 4740 gtgggtgctg atcgtgccat ccgtgaacat ctgcaatcac cggaattcaa ccaagcagtg 4800 agtcgatacc tctcggacct gaaaataaaa tggaagttca tcactccatc cgctccccac 4860 atggggggaa tctgggaggc cgctgtgaaa agttttaaga agcatctccg agccgtgctt 4920 ggaaacacaa cacttacata cgaggagctt tcaaccgtct tgacgcaaat tgaggcgtgt 4980 ttgaattcgc gtccgttatg ccagctatca acatcggtcg actcatacga ggccctaacg 5040 ccagggcact tcatcatcaa ccaaccgcta aatttgcttc ccgagccgaa catcaaccac 5100 atccaggagg gtcgtctgaa cagatggcaa cgtgtacaac gacacgtgga cgatatttgg 5160 gcacgatgga ggaacgaata tgttgccaca cttcaacctc gtaacaagtg gcaatcagtg 5220 cagcagaacc tgagttcggg gcagttggtc ctgatcaaaa atgagaacac atcgccagcg 5280 gcatgggaac tcgccagaat tgtcgctacc catccggatc agtacggtgt tgtacgcaca 5340 gttactgtac gacgtggaca gaacgagtat cagcgagcgg tgcacaagct ggttccgctt 5400 cctatggatt gaggcatccg cctcaagggc cgggtgga 5438 // ID Kiri-33_AAe repbase; DNA; INV; 4488 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 16-FEB-2011 (Rel. 16.02, Last updated, Version 1) XX DE A Kiri non-LTR retrotransposon family from Aedes aegypti. XX KW Kiri; Non-LTR Retrotransposon; Transposable Element; Kiri-33_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4488 RA Kojima K.K. and Jurka J.; RT "Kiri non-LTR retrotransposons from the yellow fever mosquito."; RL Repbase Reports 11(2), 728-728 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 6 sequences with >97% CC identity. CC This family and related Kiri elements found in two mosquito CC species, Culex quinquefasciatus and Aedes aegypti, constitute a CC distinct lineage belonging to the CR1 group. The phylogenetic CC analysis with PhyML indicates that Kiri is a sister lineage of CC the L2 clade. Although several Kiri elements do not encode two CC proteins, probably due to 5' truncation, Kiri elements generally CC encode two proteins. Their ORF1 proteins commonly contain an CC L1-like RNA-binding domain. XX FH Key Location/Qualifiers FT CDS 276..1025 FT /product="Kiri-33_AAe_1p" FT /translation="MSVNKHQQNNDSDSKTMNADVLWKGIHRMISSATAKL FT EAMIESYVSNLEACIVCVERRIESIKDEYNNNLDNLTKEINSIRTDHQLKL FT QHLSRMERALDLVFTGVPYHPDEDLNQIYRRIACVIGAPDEHSIVHLKRLY FT KHPVQSGSSPPILCRFAFRGIRDVFFNKYLHGRSLALHHIGFEGKGRIYVN FT ENLTPLSRRILNAAIRHRNEGRLHKVSTKNGLVAVITHKECDVIVVDTLDR FT WLLLESTLS" FT CDS 1505..4333 FT /product="Kiri-33_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MLNNVNSDDASTSNLIPRAVLRSALDSDLFNICHINV FT QSLTARRFSKFNELKMNLFDCNLDTICMTETWLDDSIENRMIAVDGYKIYR FT NDRTRHGGGICVYVRKNLVCRVLEASSIVGNDARITEFMCFEVMSGKDRIM FT LAVYYNPPDVDCSNTLMQHFDQFTVKYKSTFFIGDFNTDLLKNNSRQRRFR FT DAISGMSYVCVNNEPTYFHNTGCSMLDLFLTDSLDKVCKHDQISLPGISHH FT DMIFVSLKIEAPNVNVNASYRDYAHFDANALQNAFNAINWNEYFSQDDPDL FT LLQFLNEKLLMMHDSYIPLRMIRPKKNQWFTSDIERAIISRNIAYKNWLRN FT KTSVNSLQYKQIRNRVTNMIRNAKSNYEKRVLNINMPSKQLWSNIKRLGVS FT KDNSSGIDCNASPDDINRYFSSNFSSDSYPNHSFPANSNGFVFREVNDFEI FT VNAIFDIKSNAIGLDNIPIQFIKIVLPLAIIHFKHLFNRIILTGKYPTVWK FT QVKVIPIKKKSKSTDITNLRPISLLCSLSKVFEKILKIQISEYIDHMNFLD FT PHQSGFRKKHSTTTALLKVHDDIAQAIDKKGIAILLLIDFAKAFDRVSHHK FT LLNKLSSKFLFSNTAVSLIKSYLCDRNQAVFHNMIYSSFVDIKSGVPQGSI FT LGPLLFSLFINDLPSVLEYCAVHLFADDVQIYLCCHKNMSLDEVARRINSD FT LQKLFEWSNRNLLPINSTKTKALLINKSRDALRTPDLYLGGENIEFVDQAS FT NLGMIFTSNLCWDAQINQQCRKIYYVLKQLNLVTIHLDAQIKTKLFKALLL FT PHFIYCDFIYSNASMAAMNKMRLALNACVRYVHCLPRYSRVSHLHESLLGC FT SFRRFYEYRLCLNFHKIIKSRTPNYLFSKITRMRQPRTMNFSIPQHYSVYY FT GQSFFVRSIVHWNALPISIKLCSSVIGFRRDLIRFSNMN" XX SQ Sequence 4488 BP; 1395 A; 853 C; 785 G; 1454 T; 1 other; cagttcctga agggatgttg gcaatcgagt gatagattga gttggtgcag ttttcaagtg 60 gttattcccc attgccgtgt gccagataat tgtatcaaat tagctattta aatcgctaat 120 agcagtttac gttgtggtat agtgaaaaat aactgttttg aaaccaaatg cttcggttat 180 tcgtcaatat ttaattctat cggattgctc gccaaaaaca catcccttca gaagtttcat 240 tatacccaac cttgctataa tacatcatta ccacaatgag cgtaaataaa catcaacaaa 300 acaacgacag tgatagcaaa acaatgaatg cagatgtttt gtggaaaggc atccatcgga 360 tgatttcttc tgccactgcg aaactggaag caatgatcga gtcatatgtg tccaatctcg 420 aagcttgtat tgtttgtgtc gaacgtcgca tcgagtccat caaggatgaa tacaacaata 480 atttggacaa ccttacgaaa gaaatcaatt ccattcgtac tgatcatcag ctcaaacttc 540 aacaccttag tcgaatggag agagcgttgg atttagtttt cacgggtgtc ccatatcatc 600 ctgatgaaga tctaaatcaa atatatcgtc gtattgcctg cgttattggt gcccccgacg 660 agcattcgat agtacatcta aaacgactat acaagcaccc tgtacaatct ggatcatcac 720 caccaattct ctgcagattt gccttccgag gaataagaga cgtcttcttc aacaagtatc 780 ttcatggcag atcgctggcg ctacatcaca taggcttcga aggaaaaggt cgcatctacg 840 tgaatgagaa cttgacacca ctctccaggc gaatattgaa tgctgcaatc cgacaccgta 900 atgaagggcg tttacataaa gtgtctacga agaatgggct ggtcgctgtt atcacccaca 960 aagagtgtga cgtcattgta gtggatactc tcgatcgatg gctgcttcta gaatcaacct 1020 tatcctaacc gatttctctt tcctgcctat tttattccat gtttcacatc cattattttt 1080 tccattgctt ccatccgctc ctaaaagtca aaacattatc acctttccca gcttatctca 1140 cttttgtcca tgtatccaat ccttcaawat ccttgttgtt tccttcctga aagtattgag 1200 ggttggggat gtggatatgt ttccactcgt tgttgctgtt gctgacgctg ttggacgctg 1260 ttgctgttgc tgctataaat ttgctgttga ttaccacatt aatatgctcg atgctgctaa 1320 tggattattg agtgccacgt tttgtttttc ttcagttgct tagagccact tgactgcacc 1380 acagagtctg tgaaaattat gttagttaga aattttagtt tatggttcct ggtaattagt 1440 atctgttcat gttgtgtggg tctcgataat gatcttagct taatccaact tcttgttcct 1500 ataaatgctg aataacgtta acagtgatga tgcttccaca agtaatttga tacccagggc 1560 ggttttgagg tcggctttag attcagattt gttcaacatt tgtcatatca atgttcagag 1620 tttaacagcc cgtagatttt caaagttcaa tgaactaaaa atgaatttgt ttgattgtaa 1680 tttggatacg atctgtatga cagaaacatg gttggatgat tccattgaaa atagaatgat 1740 agctgtagat ggttacaaaa tttataggaa cgaccgtacc cgacacggtg ggggaatctg 1800 tgtgtatgtt cgtaaaaact tagtatgccg tgtattagaa gcatcttcca tagtaggtaa 1860 tgatgcacga attacggagt ttatgtgttt tgaagtcatg agtggcaaag atcggattat 1920 gttagcagta tattataatc ccccagatgt agattgctca aatactttaa tgcaacactt 1980 tgatcagttc actgtgaaat ataaatcaac ctttttcata ggtgatttta atacagatct 2040 tctcaaaaat aatagcagac aaagacgatt cagagatgct atttcaggca tgtcttatgt 2100 atgcgtcaat aatgaaccaa catactttca taacacagga tgctcgatgc tagacctctt 2160 tttaactgat tcactggata aagtatgtaa gcatgaccag atttcgttgc ctggaatttc 2220 ccatcatgac atgatatttg tatcgttaaa aatagaagca cctaatgtaa atgtcaatgc 2280 atcctaccga gattacgctc atttcgatgc taatgcttta caaaatgcat tcaatgctat 2340 aaactggaac gaatactttt cacaagacga tcccgatctt cttctgcaat ttctgaatga 2400 aaaattactc atgatgcatg attcttacat acctctcaga atgatcagac ccaaaaaaaa 2460 tcagtggttc acatctgaca tagagcgagc gattatttca agaaatattg cttataaaaa 2520 ttggcttagg aataaaacaa gtgtcaatag tttacaatat aagcagattc gtaatagagt 2580 taccaacatg attagaaatg ccaagtcaaa ctatgagaaa cgtgtattga atattaatat 2640 gccaagcaaa caattgtgga gtaacattaa aaggcttggt gtttccaaag ataatagttc 2700 cggtattgac tgtaatgctt ctccagacga cattaaccgt tacttctctt caaatttctc 2760 ttctgatagt tatcctaatc attcatttcc agctaattca aatggtttcg ttttccgtga 2820 agttaacgat tttgaaatcg tgaatgctat attcgatatc aagtctaatg ctataggact 2880 agataacatc cccattcagt ttattaaaat agttttacca ttagcaatta ttcattttaa 2940 acaccttttc aacaggatta tattgaccgg caaatatccc actgtatgga agcaagtcaa 3000 ggttattccg attaagaaga aatccaaaag tactgatatt acgaacctta ggccaattag 3060 tttactgtgt tcactttcga aagtcttcga aaaaatcttg aagattcaga tcagtgaata 3120 catagatcat atgaacttcc tcgatccaca tcagtctggt tttcgtaaaa aacacagtac 3180 cactactgca cttttgaaag tacacgatga catcgcccaa gctattgaca aaaaaggaat 3240 cgctatactt ctactaatag attttgctaa ggcctttgac cgtgtatccc atcataaact 3300 gctcaacaaa ttatcgtcaa agtttctttt ctcgaacact gctgtctctc tcattaaatc 3360 atatctttgt gatcgtaatc aagctgtctt tcacaacatg atatattcat ctttcgtaga 3420 tattaaatct ggcgtaccac aagggtcaat tcttggacca ttattatttt cattatttat 3480 taatgattta ccgtctgttt tagaatattg cgctgtgcac ctgttcgcag atgatgtgca 3540 aatttatctc tgctgtcata aaaatatgag tctagacgag gtagcaagaa gaataaattc 3600 tgatcttcaa aaactttttg aatggtcaaa cagaaatctc ttaccaatta attcaaccaa 3660 aaccaaagct cttttaatca ataaatctag agatgcactg agaacaccag atttatattt 3720 gggtggcgaa aatatagagt ttgttgatca agcttctaat cttggtatga ttttcacatc 3780 taatctgtgt tgggatgctc agataaacca gcaatgtaga aaaatatatt atgttttgaa 3840 gcaacttaac ttagttacga ttcatttgga cgctcaaata aaaaccaaac ttttcaaagc 3900 actcttacta ccgcatttta tctactgtga ttttatatac agtaatgcgt ccatggctgc 3960 catgaacaaa atgcgtcttg cccttaatgc ttgtgtgcga tatgtccatt gccttcccag 4020 atattccaga gtttcacatt tgcacgaatc attacttgga tgctctttcc gacgtttcta 4080 tgaatacaga ttatgtttaa attttcataa aataattaaa tctagaaccc cgaattatct 4140 attttcaaaa ataactcgta tgcgacagcc tcgtacaatg aattttagca tccctcaaca 4200 ttattctgtt tattacggcc aatctttctt tgtgcgaagc atcgtacact ggaatgcgtt 4260 acccatcagt attaaattat gttcttctgt tattggattt aggcgggatc ttatcagatt 4320 cagcaacatg aattaggtag tacggaacct tcaggaacta ggaaaataga attgtataag 4380 ataagtgaat taaatcatga actccaattt gaagtcagtt tgtggcaatt taaaagacaa 4440 tgtcttacgt cgcaataatt attacaaata aataaataaa taaataaa 4488 // ID GilD repbase; DNA; INV; 2980 BP. XX AC . XX DT 09-JUL-2004 (Rel. 9.06, Created) DT 09-JUL-2004 (Rel. 9.06, Last updated, Version 2) XX DE Giardia intestinalis non-LTR retrotransposon, consensus sequence. XX KW Non-LTR Retrotransposon; Transposable Element; GilD; endonuclease; KW reverse transcriptase. XX OS Giardia intestinalis OC Eukaryota; Diplomonadida; Hexamitidae; Giardiinae; Giardia. XX RN [1] RA Burke D.W., Malik S.H., Rich M.S. and Eickbush H.T.; RT "Ancient Lineages of Non-LTR Retrotransposons in the Primitive RT Eukaryote, Giardia lamblia."; RL Mol. Biol. Evol 19(5), 619-630 (2002). XX RN [2] RA Arkhipova R.I. and Morrison G.H.; RT "Three retrotransposon families in the genome of Giardia lamblia: RT two telomeric, one dead."; RL Proc. Natl. Acad. Sci. U.S.A 98(25), 14497-14502 (2001). XX RN [3] RA Gentles A., Kohany O. and Jurka J.; RT "Consensus."; RL Direct Submission to Repbase Update (JUN-2004). XX DR [3] (Consensus) XX SQ Sequence 2980 BP; 717 A; 960 C; 876 G; 427 T; 0 other; cacagctgga cctgaggcga aggagagccc gcgaggaccc gatccacatt atgaacgcaa 60 gatcgtacct ccgggacccc agagagcaag ccgtgagtct ctactcgggg gaccactccg 120 aagacgtttg ccctctggca ggacacacac gctggctctg tggcctaccg tgcgagagcg 180 ccaagcaact aaccgagcac ctgagagaag caaacgatga cagacacaga gtgctgggcg 240 acgacaagct gatcattccg gcccacatac cgaaccagac agagcctgcc atgctcatcc 300 accagggtgc gcgcatgatc gcgagcatcg agcaggccgg gactggggtc gcgctccccg 360 ctatcagggg cctgaacgcc ctggtgggaa gccgccagag gactcctctg ccaagcaaca 420 ccgcggagca gtcccagagt aaggcagacc cagaccccga gaaggcgaga cgccggatag 480 caaacagggt gaacaacgcg ctgagcttcg gagcagttgc taaggctctc cgcgcccttg 540 atgacacacc catggtagac aagccagaga aggccaaggc cgccctcagc gcgctacacc 600 cgtgcgtcct ccccgaaggc tacgttggcc ttgactaccc tccatcgatg agctcgacgg 660 ggctggagcc ggcaaccgag gaggaaatga ggaaggccct gtttgggact ctcaacaaga 720 aggcctcagg cgtgtctggc ctcgggccgg tccagctcaa ggcgatgaag cagagcgaca 780 gcttcgtgaa atacctcacg caggcgtaca acgagctgac cacccacccc gaaggagtac 840 ccgatgtgac ggccatgttc gagttccggg cgatcctcat cccgaaagag tccgacggat 900 acccccgatc gcgataggcg gatggtcacg aacatcttcc accgcgtcct gctgaagcgg 960 ctgatcaggc atgccaacta cctctcatgc gaacagatcg ccttcaagca gaacgcatac 1020 gcggtaggcg tgaggagggc ccacgagctc atcagcaggc cggggatgca cgccgtctcc 1080 ctcgacataa agaacgccct caactcgctc ccgagggcag aggtcgtcag agccctgaac 1140 gaggcgggcg tcccgaaggt cctggtcgac tacatcgggg acttcctcga cctgaggcac 1200 agccgggacg tgaagtgcga ggtctgcggc gtcccccagg gcgacccgct cagcacgttc 1260 ctcttctgca tggccatcga gcgcctcctg agacgcctca aggagagggg catcaccttc 1320 cttgcctatg cggacgacat cgtggcgttc cacgacgaag ggtacccagc aggtgtgata 1380 acacagctgg ccacagcaga ggctgccgcc atgggcctga cgatcaggag cgacaaatgc 1440 aagtccacca tggcgggcga ggaggtgacc ttccccaacc acccggtgtc ccccacccct 1500 gccagtctcg cgccgaaggc catcgcaggt gccgagactg ccttgaagaa gatcgagaat 1560 gcccccataa ccacccacca gaagctcatc ctgctgtcac tctgcgtggt cccgatggtg 1620 agctacgccc ccctggtcga gataacatcg gacaaggcgg actacgagga gctcgacaga 1680 cgtcgcacaa agcttcagca agctaaccag cagaagctgc gatcgcctgg tcgacttcct 1740 ggcgtaccaa aggagaaggg cggcctgggc ctgctcatgc cggggctcta ccacgacgag 1800 ctgcggaggg tgtgggcctc cctggacgtc aacaagggga ccagagaccc ggtggagctg 1860 ccggtggaag gccaggcact gcaagggaag gggcctactc caaccgcctc ctcagccaca 1920 caccagcttc cagacagagc ggcgacgtac tgtctgcaga tgtgcggcgt gctgaccgag 1980 gaggaaaggg ccctgaagga gaagggcggc tcgatcatgg cgcagaaaga ggcacctgcc 2040 cgggctgcca cacggcgagc cctcgaccgc gacatgaact gcccgaaagc ggcgtgaccc 2100 tcgaccacga ggcacgacat gatcgtcgcg tgcctgtgcc tccgcgtctc ggggaagctc 2160 aaccggagct ggaaatcaag acccacagcc acctcagcga ccagaaccag aagcccgaca 2220 tctggctcgt cgacaagtgc aaagcgatag atgtgggtgt catccagctc cgccagatgg 2280 atgcctacta caacgagaag gtcaagaagt acagggacgg gacgctcccg atcatctacg 2340 ggacagacgg atccctccac cccaagtcca aggggcacct cgaggagctc cacgtggacg 2400 tcgagaagct ctctgtccgg gccgccttcg ccatcgccca catggctgca gaggagaacg 2460 agcatccgac gaagctgatc agacggaccg ggccgaaagc ctggtcaccc agagcgaccg 2520 tctcctgtca tcctacttct gacagctgaa tgaatccacc cctgggacct gtcgacaacg 2580 accgagaagg ggcacaagac tggacacaca accgagttga atcccaacca gaagagaaag 2640 ccccgctagc tgctgctgat atacatggac agcattgaga atgacaacaa agaccagagc 2700 ttcactcacc gtgaagggag aacaaggaga ggcgccctag tagaccagcg agcctgcagg 2760 acggcctgat ctccaaaagg acctatgtgc acacctagac cgagcacacg cccgtagaac 2820 agctgaggaa cgcagatagc tcctccctac taccactgtg gtctttatat tgcatacacg 2880 accaaccctc gtctccggac atcgccaaaa attgagcggg aatttactcg cgagatttct 2940 ggacagcgcc cgcaggcgag ggccccagca gcacaccact 2980 // ID Mariner-10_HM repbase; DNA; INV; 3631 BP. XX AC . XX DT 22-MAR-2008 (Rel. 13.03, Created) DT 22-MAR-2008 (Rel. 13.03, Last updated, Version 1) XX DE Mariner-type family: consensus sequence. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-10_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3631 RA Jurka J.; RT "Families of Mariner elements from Hydra magnipapillata."; RL Repbase Reports 8(3), 227-227 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(721..981,1002..1256,1434..2954) FT /product="Mariner-10_HM_1p" FT /translation="MKRKFQLRQEKLQSIRGKSKPEASIGRKFTDAKNLQL FT EKAVQFCIDHSVRGYKALKTGLFPLVKDRETINRRLDRIIKNGKYKFFTEI FT SKELWFFLGEERRYCSIFTLEEEEAVVKYVKNRNRSLQGINKTELTKLLLD FT ILRIRDYMNKRCKGGRKLIKLSPNAKSALVNGKTIFYCFRLSRSFWTRFHA FT KHSSLTIKRQGQVSINRALNCTREMACNHLNDLAIELQAAGIMKNAEKVDE FT GVWKGDIDTTRIFNDDETPQFVNYGVDGTSTGLVYAGKGEQCLKMHKENRE FT CVSIHPFVSFSGDVCICQVIFSSVGITSQMTPQVAVEKISHLLVTTSENGI FT SDNDTFKAACQEFDDYLTEKAIERPVVLLSDGHSSRFSYDGLIYLLSKQIW FT LFISPPDTTGVTQLLDQVNKNLHHEYRIAKNYLFNPMQSINREGFMTILGN FT IWEKWTSKSVLVSSAKRVGITPFELNVNFMQQDKFERAAACIHKEEPASTS FT FKKPESTISSPVGIRKGSASYWKDKFDQSQILIREMAEQSLQLENIPDLLT FT IKKIKPNLEKISTRVTQVYGSMRAKDVAAKVKDIKEEKRQKITAKEEAKKK FT KEEAKEKFIKCKVECFCKQRKCQAFGLKQCPNCLDVLRSICSKVSCRVDGV FT KPVMLLPANQHFSTSKRLIYDHYNECNDNQ" XX SQ Sequence 3631 BP; 1269 A; 538 C; 626 G; 1198 T; 0 other; caggtatacg taccgcaaaa acaccaaaaa aagcacatct caataacttt tttattattt 60 gcttttgaga aaaactgttt tgaatgtatt tagagttgct taaaatcagt tcttcccagc 120 tagtttgatt aatttttaag aaatagtttt gttattaatg ttaactaaac ttttgttgtt 180 ttttgtatgt gtaccccaaa aaccgcattc aatttatcga ttgtatctct gcagctagtg 240 aacctatgaa gctgaaattt ggcgtgtcct aatcttattt gatctgcttt aaataaaacc 300 agttactgct aacttaatct tggagaaaat acccgatgca ccacaaaaac cacaagatag 360 gtaccccaaa aacaacattg atgttttaaa aataacattt tataggctat acatgtgttc 420 tcgcttttta ctatattgat tctatgtaac ctcttgattc taaagatgtc attgcagtgt 480 tttaaacaaa gtttgcatcg tgttttttac gtattaaaaa tcaatatcat ttttagaaca 540 taatagttgc aagaatttat taaaacttac tttgtgtttg aacctagagc aaaatctaaa 600 tttttacttg aaagttgttt tttagtattt agatatgttc tgtaagcaac taatgtactc 660 taaaatgttt aacaataatt tttttaatca aatagagtta gaaaaaaatt aacatagaaa 720 atgaagcgaa aatttcagct cagacaggaa aaattacaat ctattcgtgg aaaaagtaag 780 cctgaggcat caattgggag gaagttcaca gatgcaaaga atttacaatt agaaaaggca 840 gtacaatttt gtatcgacca cagcgtgcga ggctataagg ccttgaagac tggtttgttt 900 ccactagtaa aagatagaga aacgattaat agaagattag atcgaataat aaaaaatggt 960 aagtataaat tttttacaga ataatgattt aatttagtta gatatctaaa gagctttggt 1020 ttttcctagg cgaggaacgt cgctattgta gtatttttac acttgaagag gaggaggctg 1080 tagtgaagta tgtcaagaac aggaatagaa gtctgcaagg tatcaacaag acagagttga 1140 ccaaactcct tcttgatatt ttaaggataa gggactacat gaataaaaga tgcaaaggag 1200 gtcggaagtt gattaagttg tctcctaatg caaaatcagc tctggttaat gggaagtaag 1260 tctaatgatt ttttgttcat gtttgatcta aagaaaaaat gaaatttaca taatcaaaat 1320 tactcaatag taatatttgt agccttgctt tttttttata tatatagcag tgaatgttga 1380 tcatttgtaa gtgcaagtca aatgtttagg gttatttcta tagaatattt tagaccattt 1440 tttattgttt cagacttagt cgttcatttt ggacacgatt ccatgcaaag cattcaagcc 1500 taacaattaa acgccagggt caggtatcaa taaacagagc acttaactgt acccgagaaa 1560 tggcttgtaa tcatttgaat gatcttgcaa ttgaacttca ggcagccggt attatgaaaa 1620 atgctgaaaa agtagacgag ggtgtatgga agggtgacat tgacacaaca agaatattta 1680 atgatgatga gacaccgcaa tttgtgaact atggtgttga tggtacttca acaggcttgg 1740 tatatgcagg gaaaggtgaa caatgcttaa aaatgcataa agaaaacagg gaatgtgttt 1800 caattcaccc ttttgtgtct ttttcaggtg atgtttgtat atgtcaagtc attttttcct 1860 cagttggaat aacaagccag atgacaccac aagtagctgt agaaaaaatc agtcacctcc 1920 ttgttacaac atctgaaaat gggatttcag ataatgacac ttttaaggcg gcttgccaag 1980 agtttgatga ttaccttact gagaaggcaa tagaacgacc tgttgtttta ttatctgatg 2040 gacatagttc acgattcagt tatgacggtt taatatacct cctttcaaaa caaatttggt 2100 tatttatttc tcctcccgac actactggtg taactcagtt gcttgaccag gttaataaaa 2160 atcttcatca tgagtacaga attgccaaga attatctttt caacccaatg caatcaatta 2220 atcgagaggg atttatgacc atcttaggca atatatggga gaaatggaca tcgaaatcgg 2280 ttcttgtttc tagtgcaaaa agagttggaa taacaccttt tgaattaaat gttaacttta 2340 tgcaacaaga taaatttgag cgagctgctg catgcattca taaagaagaa ccagcgtcaa 2400 cttcttttaa aaaacctgaa tcaaccattt cgtcaccagt tggtataagg aaaggttcgg 2460 caagttactg gaaagataag tttgaccagt ctcaaatctt gattagagaa atggctgagc 2520 aaagcctcca attagaaaat attcctgatc tgctaactat taagaaaatt aaaccaaacc 2580 ttgaaaaaat atctactcgt gttactcagg tctatggttc gatgcgtgca aaagatgttg 2640 cagcaaaagt gaaagatata aaagaagaga aacgacaaaa gataacagca aaggaggaag 2700 ccaaaaaaaa gaaagaagaa gctaaagaaa agtttattaa atgtaaagtt gaatgttttt 2760 gcaaacaaag gaaatgtcaa gcttttggcc taaagcaatg tcctaattgt cttgatgttc 2820 tccgttcaat ttgcagcaaa gtgagttgca gagttgatgg tgttaaaccg gttatgcttt 2880 taccagcaaa ccagcacttc tccacatcta aaagactgat atacgaccat tataacgaat 2940 gcaacgacaa tcaataatct ttttttgaat taaagtaatc agttaaagta ttttatttat 3000 ctatgttatc ttaacatttt atttatctat gttatcttaa cattgttgca gtaacctatt 3060 gttataataa tgttataaag tttgcagcca ttttagatgt gttattactg ctgcgttcaa 3120 atattgtttt taagataaat gtatattatt ttctttaaaa aaaatctaat acaatggtaa 3180 tttattgttg ttgtttttta tttcaaaaca tggtctgtaa agtgaattct ataaaatgta 3240 ccccaaaaac aacaataata cttcctttct ttatgagggt gttataaaga tttataacta 3300 cttgagaaaa aaaaactttt tcagtatcta gttataatcc ataggtaata gaaaaactat 3360 attttgctaa cttattctct gatacggctt ctaggctcca caaaacaaaa aaaactacac 3420 gtttttataa ttgaaatttt tactctagtg cagataccaa catatttaac aaataaaaaa 3480 accactaaaa ctgctttaat ttatatgctg acatattagc tatcataata tatcaagttt 3540 tttgaatcct gaattttttt ttacatgcta ggtcacttaa gccaacattg tggtttttgc 3600 ggtacggttg tttttgcggt acgtatacct g 3631 // ID Gypsy10-I_Dpse repbase; DNA; INV; 5950 BP. XX AC Unknown_singleton_21; XX DT 13-MAY-2009 (Rel. 14.05, Created) DT 13-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy10_Dpse; KW Gypsy10-LTR_Dpse; Gypsy10-I_Dpse. XX OS Drosophila pseudoobscura OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; obscura group; OC pseudoobscura subgroup. XX RN [1] RP 1-5950 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1076-1076 (2009). XX DR Genome; Unknown_singleton_21; Positions 10430 4481. XX CC Positions [2430-2900] - Integrase core CC LTRs are 95% similar to each other. XX FH Key Location/Qualifiers FT CDS 54..3137 FT /product="Gypsy10-I_Dpse_1p" FT /translation="MKSDSVKVEGSLYSLGSKNPEHADEELSVNSLGFKYP FT EPAVVELSDIQSFNSLGLKYPEHAVVELSDIKSSNSLGLKNPEHAVVELSD FT NEQFKSLGCKNPERAVVEISDIQNYANSELKKMKLYNTITYKEDNLENLRE FT QMVSIIEEDHANINLQLPFRTDIKGEINTEHDRPVYGKQYPYAYSVNDFVN FT NEISKMLDEGIIRPSKSPYNSPVIVVPKKGVNEDGTPKHRLVIDFKKLNEN FT TIPDRYPMQDPSVILANLGKAKYFSAIDLESGFYQILMRESDIEKTSFSIN FT NGKYEFLRMPMGLTNAPRIFQRAMDDILREQVGKTCHVYMDDIIIFSKTIE FT QHYKDLIHIIQILQNANMKISLEKSKFFQLETKFLGYIVSQNIIKTDPDKI FT STIQNYPIPKSIRELRSFLGLTGYYRKFVRNYATIAKPLTKYLSGVNGKVS FT RRASKKTLIQLDEPAIGAFNNLKDSLIAHIELVQPDYNKKFTLTTDASDIA FT SGAVLSQDGNPITFISKTLSKTEQLYATNKKELLTIVWALKNLRNYLYGVS FT GIEIHTDHKPLSFAMSQKNPNVEMKRWFYFIESFTPKIIYKPGSTNVVADA FT LSRIKINHMTDSDVDQTESESDINTQHSAESSFENVIQETSKPLNQFKQQL FT LLAQGRFTVHELVRVYENTRHIIEFDTVDNLLIILKEFIQPHLTTGIHCTL FT EDLYLIQNKLKENFINKFLFTRKFLQDVVNSEDKAIIIEETHCRAHRGLDE FT NYKQITKLYYWPNLHKKLKEYIKNCEICNKNKYQRHPVKIPIGEAPIPAKE FT GDQLHLDIYYAQNLTFITCIDSYSKYLVVKEIQSKLNIENKVMEILQHFPL FT ATSLMTDNEPSFTSAQFRSFIQRSNLTIYYADPRHSTSNGQVERVHSTLTE FT IARCIKDELNLTDYSEIIIRAALKYNLTIHSTTNQMPYDILYNKIEHKHNP FT ILLKEAQTKMLKLHNKDRREKDYKIGEVVYEKKFGERNKLQSRYKKQVVKE FT NRPNKIIINNRNRIIHKDNIKY" XX SQ Sequence 5950 BP; 2358 A; 1030 C; 950 G; 1610 T; 2 other; tccttccgtt gaaggaatat ataataaaaa aatattttga agaagaaaaa gaaatgaaaa 60 gtgattcagt aaaggtagaa ggaagtttat atagcttggg atcaaaaaat cctgagcacg 120 ccgacgaaga attatccgtc aatagtttgg gattcaaata tcctgaacct gccgtcgttg 180 aattatccga cattcaaagt tttaatagtt tgggattaaa atatcctgag cacgccgtcg 240 ttgaattatc cgacattaaa agttcgaata gtttgggatt aaagaatcct gaacacgccg 300 tcgttgaatt atccgacaat gaacaattta aaagtttggg atgcaaaaat cctgaacgcg 360 ccgtcgtaga aatatccgac attcaaaatt atgcaaattc tgaattgaaa aaaatgaaat 420 tgtataacac aataacctac aaagaggaca atttagaaaa tctcagagag caaatggtat 480 ctatcatcga agaagaccat gccaatataa atttacagtt accgttcaga acggatataa 540 aaggagagat taacactgaa cacgatagac cggtatatgg caagcagtat ccatacgctt 600 attcggttaa cgatttcgtt aataacgaaa taagtaaaat gttagatgaa ggtataatca 660 gacctagtaa aagcccatat aattcaccgg taatagtagt accaaagaag ggagtaaatg 720 aggacggaac tcctaagcat agattagtaa tagactttaa gaaacttaat gaaaacacca 780 taccggatag ataccctatg caagatcctt cagtaatcct tgccaattta ggtaaagcga 840 aatatttctc ggccattgat ttagagtcag gtttctatca gattttaatg agggaatcag 900 atattgaaaa aacatctttt tccataaata acggcaaata tgaatttcta agaatgccta 960 tgggattgac gaacgctccc agaattttcc aaagggcaat ggacgatata cttagagaac 1020 aagttgggaa aacttgtcac gtttatatgg acgatataat aatattttca aaaactatag 1080 aacaacatta taaagatcta attcatataa ttcagatatt gcaaaacgct aacatgaaaa 1140 tttcactaga aaagtctaaa ttctttcaat tggaaaccaa atttctggga tacattgttt 1200 cccaaaatat cattaaaaca gatccagaca aaatctccac aatacaaaac tatccaattc 1260 ctaaaagtat cagagagcta cgtagcttct taggactaac agggtattac agaaaatttg 1320 tccgaaatta tgccacaatt gccaaacctt taacaaaata tctgagcgga gtaaatggga 1380 aagtttcacg aagggcatcc aagaaaacat taatacagct ggatgaacca gcaattggag 1440 catttaataa tttaaaggac agtttaatag cacatattga attagtacaa ccagattaca 1500 ataaaaaatt cacattaacc acagatgcat cggacatagc atctggtgca gttttgtcgc 1560 aagatgggaa tcccataaca tttatttcta aaactttaag taaaaccgag caattatatg 1620 ctactaataa aaaagaatta ttaactattg tatgggcatt aaaaaatttg cgaaattatt 1680 tatacggagt gagtggaatt gaaatacata cagatcacaa acctttgtcc tttgcaatgt 1740 cgcaaaaaaa tccaaatgtt gaaatgaaac gttggtttta cttcatagaa agtttcacac 1800 caaaaatcat ttataagcca ggctcaacta acgtagtagc ggacgcgtta tctcggatta 1860 aaataaatca tatgacagat agtgatgtcg accagacaga atcggaatca gacataaata 1920 ctcagcattc agcagagagt agttttgaaa acgtcattca agagactagc aagcctctaa 1980 accaatttaa acagcagcta ctattagcgc aagggagatt cacagttcat gagttagtaa 2040 gagtttatga aaatacccga catattattg aatttgatac ggttgacaat ctgctaatca 2100 ttttaaaaga atttattcag cctcacttaa ccacgggaat acactgcact ttagaagatt 2160 tataccttat ccaaaataag ctaaaggaaa acttcataaa taaatttctc ttcacgagaa 2220 agttcctcca agatgtagta aattcagaag ataaagctat aattatagaa gaaacacatt 2280 gcagagccca tagaggacta gacgaaaatt acaaacaaat aactaagctg tattattggc 2340 caaacctgca caaaaaatta aaagaatata taaaaaattg tgagatctgt aacaaaaaca 2400 aatatcagag acaccctgta aaaattccaa ttggggaagc tcccattcca gcaaaagaag 2460 gagatcaact acacttagac atatactatg cccaaaacct aacattcata acttgtatag 2520 attcttattc caaatacttg gtggtcaagg aaattcaaag taaattaaac attgaaaata 2580 aggtaatgga aattttacaa cactttccgt tagcaacatc attaatgact gacaacgaac 2640 caagctttac ttcagcacaa tttagatcat tcatacaaag aagtaactta actatttatt 2700 atgctgatcc cagacacagc acctcaaatg gtcaggtaga aagggtacat tccacactaa 2760 cagaaatagc gcggtgcatc aaggatgaac taaatctaac agattattca gaaataataa 2820 taagggcggc actgaaatat aacctgacga tacattcaac gaccaatcaa atgccatatg 2880 acatattgta taacaaaata gaacacaaac ataatccaat attacttaag gaagctcaga 2940 ccaaaatgct taagctacat aacaaagata gaagagaaaa agattataaa ataggagaag 3000 tagtttatga aaagaaattc ggggaaagaa ataaacttca atcccgatac aaaaagcagg 3060 tagttaagga aaatcgaccg aataaaatta taatcaacaa tagaaacaga attatacata 3120 aagataacat caaatattaa atattttttc cagaaaatgt atgcgaatat actaatactg 3180 acttttgtta tattgacgac ttcggaagta attgactata cgcatagtga ctatctattg 3240 ctcaaagacg acagagacgt ttacacttat gaaacatacg ctgaactttt ccatattact 3300 aatttgagtt tttatagaga gataattgat aaagaatcca aatatgtcaa caaatcaatt 3360 aattcagacg atgagtggga aatatcaatg gacttaagtc tattaaaatt aatgatttca 3420 gaattagttc caaaacggga ataaatgaat taggtaccat ttggaaatgg atagcaggca 3480 gccctgatca tgacgacttt ttgaaaatac aaaataaagt aaatgaatta actgaaaata 3540 ataataaaca atttgtaata aattccaaat ttttcaaaga aattgaattg ctatcaaatt 3600 cattaaaaaa tgtcatcttc aatgaagaaa cgatattgag aaagcatcgt ttaaaattaa 3660 taacttttga cctactaaat ttagttgata ccataaccct tttaaaagta aacattttca 3720 atacaaaaat tttaaataaa aacgaaatag aggacattta taagcatgaa aaacataatg 3780 ttgaactgtc tgatctgttg gacattgcaa cgattaaaat aattcgaaat gcagatttaa 3840 taatcattta cataaaatat cctcaaatta aagatatttg caagttttac catgcaagag 3900 ccatctcaca aaatgatgga atgttagtaa taagtgaaaa agtttgtgaa tgtaaggaga 3960 aatactattt aatgaataat tttaaaagtg aaacttttaa caattacgca caaataaatt 4020 taaagagtac ctgtttcacc aatttactta atggctttaa tgccaactgc actaaaaaaa 4080 gggaaaaaaa caaagaaata gacatcattc aggatggtgc catattagtt tcaggcaaaa 4140 atatcgtaga caatactact ttgttaggat cgtatctcct tatattcaac aacacaattg 4200 taataaataa tataacccac ataaatgata agggtaaaat tctaagatat atatcgcatc 4260 atagatatag cgattttgaa ttagttgatt atatacaatc gaatgataag caattttctt 4320 ttgataatgt taatatttta aatcccctta taacattagg taatacggcc ttaccattaa 4380 aaacattatt tttaatcatt ttgatattgt taatgttatt ttacttcggc tataaattaa 4440 ccaaatttgt acttattaaa tcaataagaa taccgtcaga agaaatagtg gaaagcaaca 4500 ttcaaatact gtcagaagaa attagagctc tatcagaact tcaaaatgta tcgggacgaa 4560 acatttaaga atggggggag ttaacgtacc caattccatt gagctcttat acgagttgta 4620 aacaatcttt ggctttccaa aacgaaaaca atttcaatct tgggcctccg cctaattgat 4680 ttctaagatg tacaatctca agcgcccccg ccgcaatccc aatctttgac actcgcctaa 4740 atggaaaatc aatgtttgcc cacacccggc ttctcataaa caatctcact cccggctttc 4800 tgcagatcct gcactttagg ccttttacag ttctctcttt ggatgacgaa atccaatact 4860 cgggatggcg aaatcaattg aaatgtttat agaaaaacga ttcccgaaag ggatcaacga 4920 tgcaaagatc agaaagttca aatcagtctt gatcgagcag cgacatcgca aagtctagct 4980 aaatattaat atcgcgcaat cccttgcccg gaactccttt gtgaccctct acttatatct 5040 atatcagtga agttaagttc gaagtanaat aaaaaccttt ttaaaaacac aatccgctgc 5100 cgcagttgtt taatttatat ttgacacaac gaanacgaga aaggggaaag gggcgacagc 5160 gattgcagtt aaaatgagtt atgtatatgt gtagatgtgt gtgatgtaca atctcaagcg 5220 cccccgccgc aatcccaatc tttgacactc gcctaaatgg aaaaccaatg tttgcccaca 5280 cccggcttct cataaacaat ctcactcccg gctttctgca gatcctgcac tttaggcctt 5340 ttacagttct ctctttggat gacgaaatct aatactcggg atggcgaaat caattgaaat 5400 gtttatagaa aaacgattcc cgaaagggat caacgatgca aagatcagaa agttcaaatc 5460 agtcttgatc cagcagcgac tccgcaaagt ctagctaaat attaatatcg cgcaatccct 5520 tgcccggaac tcctttgtga ccctctactt aaatctttat cagtgaagtt aagttcgaag 5580 taaaataaaa actttttaaa aacacaatcc gctaccgcag ttgtttaatt ggcgcccaac 5640 gtggggcgac gttgtataaa aaacaccgaa tgataaaagt gttgcgtcgt gccgaaaccc 5700 gaaggacaat taattaatgg aataaatcct tcaaaggaaa caaaccggag cttagggtgt 5760 gcaaaaataa atacaatata caatacaaat acaaatacaa atacaaatac aacgcaaaat 5820 aatacaaata aatacaaaag ctaatcaagg caaatcgtaa atacaaaaaa aaccaaagaa 5880 acccaaagtt tttaaaaaac aaaaccaaca aaaccgacaa ataccgacaa aaaccaaaat 5940 acaaagaaag 5950 // ID Gypsy11-I_Dya repbase; DNA; INV; 6126 BP. XX AC chr3R; XX DT 18-MAY-2009 (Rel. 14.05, Created) DT 18-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy11_Dya; KW Gypsy11-LTR_Dya; Gypsy11-I_Dya. XX OS Drosophila yakuba OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; melanogaster subgroup. XX RN [1] RP 1-6126 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1081-1081 (2009). XX DR Genome; chr3R; Positions 740758 734633. XX CC Positions [4043-4318] - Integrase core CC LTRs are 91% similar to each other. XX FH Key Location/Qualifiers FT CDS 2195..3787 FT /product="Gypsy11-I_Dya_1p" FT /translation="MILKRKQVFSHSNEALPFNTSIIATIRTETNEPVYSK FT LYPYLMGAADFVNTEIKQLLKDGIIRPSRSPFNSPTWVVDKKGTDADGNKK FT KRLVIDFRKLNERTIADRYPMPSIPMILANLGKAKFFTTLDLKSGYHQIYL FT AEKDREKTAFSINGGKYEFCRLPFGLKNAGSIFQRAIDDVLREKIGKICYV FT YVDDVIIFSEIEADHVIHIDTVLKCLCQANMVAIEFDDTQRRAFERLRDIL FT SSEDVILTYPDFKIPFDLTTDASANGIGAVLSQKGRPITMISRTLKDCEVN FT YATNERELLAIVWAIGKLQNYLYGTREIRIFTAHQPLTFAVSERNTNAKIK FT RWKAFIEEHIAKVFYKPGKESFVADALSRQKVNALESEPHSDAATVHSELS FT LTYTIETTDKPLNCYRNQIILEEAAQPSRRHFIVFGNKNRHIINFDKKDSL FT LLSIKEVVNVSVVNAIYCNLPTLACIQHALVTEFPATKFWHCKNFVNDITD FT ANEQKEIIKCEHNRAHRAAQENVKQVLQDYCFPNM" FT CDS 5024..6127 FT /product="Gypsy11-I_Dya_2p" FT /translation="MLKMSESRLIESNNRQVVINTQTQEKINEQTEAVNKI FT ILAKKNNMVDTPHLYEVLLARNRILITEIQNVMLTVTLAKANIINPAILNH FT NDLKSVLIDHPTEIPVISLIEASNIKVLQSDSVIHVIITYPIIKATCKKVT FT IFPVSHQHTILQLTDNIVAECNNDVLAVTECVPTTYASYCKLATHDTCARG FT LHAGSTALCKTQPSNLSLITLVDDGVIITNECIAKVSTDDGPEIDTNGSHL FT IMFERVAFINGTKYANRREAIRKVPGMAASPLLNIIGHDPRLSIPLLQRIN FT NENLEVIQGLKEEVESAGTTDIWFAVGVGISVFICCSILLTLVLRRRRDTI FT EIQQVIAKFQMTEDGHIPEGGVVNT" XX SQ Sequence 6126 BP; 2011 A; 1381 C; 1263 G; 1461 T; 10 other; taattggcac ccaactgaat aagaattcac gacccccaaa aatttaagtg ggctgatcat 60 atttaataca gcttcaagta cacagggaaa aaataaacaa aaatccctac aaattattta 120 ccaaaaattt taatattaaa aatttttttg ataaagtgtg actgtaaatg tgactcacaa 180 ataaaggcaa atttatatca gccgtctcta aatgcacatt ttcagacacc aagcacccgg 240 ttattaatat aaatatacca tatccaaatt aatttctgtc gaattagaat attcggcctc 300 tatacaactt gtgtttagtg ttaaccagtt aagggcttga tgagttttcc aggcccatat 360 agggaaaata aacgcgaatt agaaagtgat agtgaagaag agcttgtccg agagacccgc 420 agattgcaac tagcacaggc agagaaaaac acatacacca tgaacgcgtc ccaagttgaa 480 gcgttagttc aggccgcgtt gtataatcaa gaacagaggc ttaaagccga atttacagag 540 caattaaatg ccgttaatca acagctaaaa aattagcgtg tagaagcgcc ggaggtaaaa 600 acctaccaga gggtagctgt aaaaccggga gcaacaccgt gtgatacacc tttacacatt 660 gtgaagtcaa ttccagattt atgggtatac aggatgaata cctggatgga ggctatcggc 720 ccccgatgct tttnnnnnnn nnncgggagc acaccgtgtg atacaccttt acacattgtg 780 aagtcaattc cagattttac tggtatacag gatgaatacg tggcatggag gcaatcggcc 840 accgatgctt ttgaattgtt caaagattat ccagacagta gtggacactt ccaagctgtt 900 actataatta gaaacaaaat taaagggcca gcgcgcgcgc tgctagtttc atataacaca 960 gtgcttaatt ttaacgccat tttggctaga ctggactgct cctatgcaga caagacatcc 1020 ttgcgacttt tacgacaggg tcttgagtcg gtccgtcagg gtgaccaaac cctaatgcag 1080 tactacgatg aggtcgaagg aaaattgacc ctcgtcacta acaaaattgt catgacatat 1140 gacgatgaaa gagccacctt gcttaataca gaggtaaggg ctgatgccct actcgtattt 1200 atatcgggtc taaaaaaatc ccttagggct gttgtatttc cagcacagcc caaagacctg 1260 ccagcagcac tggcattagc aagagaagca gaggccagta tcgaacgaag tatgttcgcg 1320 gccacatacg ccaaggcgtt ggagcagaga actcaaaata ctgacttcca caagagccaa 1380 caccgcgcgc ctgaaaaata aggcaaaaat tttggtgccg accgcggccc agaaaaaaac 1440 cctcattttt ttaaaaagcc aagcaagaat caaaataccg attcatggcg caagcaagag 1500 aactgggtat ccgatctacc caaacagatt caatccccag agccaatgga catagatcta 1560 tcgtcttcaa agattaggca ttccactaac tttagaaagg gaagcgataa acatgcccct 1620 aaaggacaaa actcttctga gagaatgtct ggtcaagggc gccagagggt tttcaagatt 1680 gaacaagacg ctaacgcggg ggaagatgag gattatgaaa acgcggctgc agccgctttt 1740 aatgcaagag cgaaagtgga ctcattggag tccaatgacc tcattaattt tttagaggaa 1800 ggtcccgctt gcctttcatt gaaagaaggc tgtcggggag agttttaaga atactcatag 1860 atattggagc ggcaaaaagt tatgtaaggc ccctaaagga gctgaaaaac acaacacttg 1920 tcgactcccc attcactgag agttccattc atggctccag taaagtgaaa aaaaactgcc 1980 taatgcacat atttgcccta acggccccat tctttttggt tgaataggtt tcgacttgct 2040 aacgcaggcc ggagcggcat tagacctacg caacggcgca ataaggtacg gaaacgtatc 2100 tgaaaagcta cagttccaca actgtgaaac tgtgaacttc acaaacatcg atgacattga 2160 agtcccaaat tcagtgaaaa ctgaatttaa aaaaatgatc cttaagagga aacaagtgtt 2220 ttcgcactct aacgaggcgc ttccatttaa cacctcgatc attgctacaa ttcgtacgga 2280 gactaacgaa cccgtatact ccaagctata tccttatctc atgggagcag cggattttgt 2340 taacaccgaa atcaaacagc tgttaaagga tggcattatt cggccttcta ggtcgccatt 2400 caacagccca acatgggttg ttgacaaaaa agggaccgat gccgacggca acaaaaaaaa 2460 acggctggtg atcgatttca ggaaactgaa tgaacgaacc attgccgacc ggtaccctat 2520 gccaagtatt cctatgattc tagcgaactt aggaaaggcc aaatttttta ctaccctaga 2580 cttaaaatcg gggtaccacc agatttatct ggctgaaaag gaccgcgaaa aaactgcctt 2640 ctccataaat ggaggaaagt atgaattttg ccgtctaccc tttggactaa agaatgcagg 2700 aagcatcttc caaagggcga ttgacgacgt gctccgcgaa aaaatcggga aaatctgtta 2760 cgtctatgta gacgatgtga taatcttctc cgaaattgag gcagaccacg tcatacacat 2820 tgacacagtt cttaaatgcc tgtgtcaagc caacatggtc gctattgaat ttgatgatac 2880 ccaacgcaga gcgtttgagc gccttcgaga cattctgtca tctgaagatg ttatacttac 2940 atacccagac tttaaaatcc ctttcgattt gactaccgat gcctcagcaa acggtatcgg 3000 tgcggtacta tcccaaaagg gcagaccaat aacaatgata tctcgcacat taaaagactg 3060 cgaggttaat tacgcaacaa acgaaaggga gctactagca attgtatggg ctattgggaa 3120 attgcaaaat tacctatatg gtactagaga gatccgaatt tttacggccc atcagcctct 3180 cacattcgcc gtgtccgagc ggaatacaaa cgctaaaata aaacgatgga aagcattcat 3240 cgaagaacac atcgccaaag tgttctataa acctggaaag gaaagttttg tggccgacgc 3300 tttgtcccgt cagaaagtta atgctttgga aagtgaaccc cactctgacg cagccacagt 3360 acacagtgaa ctgtcactga cctacacgat cgaaactacc gataaaccct tgaactgcta 3420 caggaatcaa ataatcctgg aagaagcagc ccagccctca cggcgacatt ttattgtttt 3480 cggtaacaaa aaccgacaca ttatcaactt tgataaaaaa gactcactgc ttctatcgat 3540 taaagaggtt gtaaacgtaa gtgttgttaa cgctatttac tgcaacctcc ccacactagc 3600 atgcatccag catgccttgg ttacggaatt cccagcaacc aagttttggc attgcaaaaa 3660 cttcgtaaat gatattactg atgctaacga gcagaaggaa attattaaat gcgagcacaa 3720 ccgcgcccac agggctgccc aggaaaatgt taagcaagtc cttcaggact actgttttcc 3780 aaatatgtaa aaacttgcca atgaggtggt tataaattgt agaacatgca ccaaggctaa 3840 gtacaacagg caccccaagg agcaagaatt gggtgtcacc ccaattccct catatgcggg 3900 tgaaagctac atatggacat ctattccacg gttaaaaaaa tttttttaac ctgtgtggat 3960 aagttctcaa agttcgccat agtacagcca ttagcctcta gggccattat cgacgtacgc 4020 agccccatac tacaattggt aaattttttc ccaaatacca aaacaatata ctgcgataac 4080 gaggcttctg aaaaaccaat atgacattga catcgttaat gcaccccctt tacatagcag 4140 ctccaacggg caagtggagc gattccacag caccctaacg gaaatagcta ggtgcattaa 4200 aattgacaaa aaagttgacg agacggtcga gctcatcatg agagccacgg tagagtataa 4260 caaaacgtta cactcagtca caaagttgca gcctaaagac gccttgcatt ctgccaacga 4320 cgctcagaga ctggcaataa aaaccagtat cataagagcg caacaggcta acctagatag 4380 gtataaccct actaggcaaa accgaatttt tgaggtaggt gaaaaggttt acctcaaaaa 4440 caacaaaagg ctagggaata agctaaccgc actctacaca gaagagcgtg tagaagcaga 4500 catgggaacg tctgtcctca ttagagggag ggtggtccat aaggacaacc ttagataaaa 4560 atccctctcg aatttacatc ttccacaatt tttatacctt tattcatgca ccttgtattt 4620 aagcgtactt atccttttat tagatgcaca ttcgattcaa tcttccgctt tcttacagga 4680 tcgccggaat catattaatt acgttaacta tagttaacgc taggattacc aattactctc 4740 atgctaatta catccccata acagatggtg ttgcactggt gtgggagaag agaaactggc 4800 ttaggcattc tactaacctg tccgaattcg aaatgatgat agaagagact agtaggttgt 4860 ccgagctatt accagagtcg catatgcaaa attactagat accgaccatc tcagaagttt 4920 actggccgta ttgaaagtac atcataggat tgccagaagc ctaaactttc taggcatagc 4980 actgaaagta atagcgggca ccccagatgc cgccgatttt gaaatgttaa aaatgtccga 5040 atctcgatta atagaatcaa acaatagaca ggtggtcatt aatacacaga cccaggaaaa 5100 aattaacgaa caaacagaag cagttaataa aatcattcta gcgaaaaaaa ataatatggt 5160 tgacactcct cacttgtacg aggtactctt ggcaagaaac agaatattaa ttaccgaaat 5220 ccaaaatgta atgcttaccg tcacactggc taaagccaat atcataaacc cggccattct 5280 taaccataac gatttgaagt ctgtattaat tgatcaccct acagaaatcc ccgtaataag 5340 cctaatagag gcatctaaca ttaaagtcct tcagtccgac agtgttatcc atgtgataat 5400 tacatacccc atcattaaag ccacttgtaa aaaggtcacc attttcccgg tatcccacca 5460 acacaccatt ttgcaactaa cggacaacat agtagcggaa tgcaacaacg acgtcctggc 5520 agtcaccgaa tgcgtaccaa caacttacgc atcgtactgc aagctggcta cccatgatac 5580 ctgcgctcga ggtctgcatg cggggagcac tgcattatgc aaaacgcagc ccagcaatct 5640 tagtctaatc acattggtag atgacggcgt aataataacc aacgagtgca tagctaaagt 5700 cagcaccgac gatggccccg aaatagatac caacggctca catctaatta tgttcgagcg 5760 agtggctttc atcaacggca ccaaatatgc caaccgacgt gaagccataa ggaaagtacc 5820 gggcatggct gcatccccat tgctaaacat catcggccac gatcccaggc taagcatacc 5880 gcttctccaa aggatcaaca acgaaaactt ggaggttatc caaggcctta aggaggaggt 5940 ggaatcggct ggaactacag acatttggtt tgcggtcggc gtgggaatca gcgtcttcat 6000 ttgctgctct atcctgctca ctctggtgct gaggaggaga cgagatacga ttgaaataca 6060 acaagtcatt gccaagtttc agatgaccga ggacgggcat attcctgagg ggggagtagt 6120 taacac 6126 // ID Gypsy-37_DWil-LTR repbase; DNA; INV; 2321 BP. XX AC scaffold_181148; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-37_DWil_; KW Gypsy-37_DWil-I; Gypsy-37_DWil-LTR. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-2321 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_181148; Positions 175163 172843. XX SQ Sequence 2321 BP; 786 A; 404 C; 424 G; 707 T; 0 other; tgtaaaggcg cttgttgcta gagattggca acatttaaac aaaatgcaat gttgcaatgt 60 tagcggggat gattgtatgc catgatcgta tgttgccaag cacatatctt cgggtcagcg 120 catgccgcct ctcttgagca gcctctgtgg gtgcagagag acggccaaag agaagaaaac 180 tttatctatg caaatgaagc aataaaaatt ctcctgttgc taagggaatg ctttctaatg 240 ggagcaaaag agaaaggcag tggggtctat aggaccgttt tatagaatcc aggatggcgg 300 cgaggtggat acaccaagcg tctgttttaa taggtttcta agttcttcgg gcaatctgta 360 ggagtcattg caaattggta attcggcaaa gtcagacggt taatggtcaa gtgatactta 420 aagcaattaa ctaagtcgct gaaaaaaaaa aatttttaag ttaaaatccc gcataattaa 480 tccaaaagaa ttttgtacat atatttatgg ttggtgaagt tcatttcctt tcaaagtgct 540 taaaagtgtt aaatatatac atttgggcaa gccctcaaga gggcggaacc acatttggtt 600 tgccaaggcg atacccgtca cataaggtga gaaagtagaa aaatataact aaaaaaaaat 660 aggtgtctgt ccttctgttt gtgttttgcg catagaattt tagcaaacgt ttaatttttt 720 tttatcagca taaccaaaaa gaaaaaaaaa aaaagataaa cacttgcact ggcggcgtac 780 catttttcgc atacccgaaa gcgctaatcg gataatcacc ccaagagacg cataccaact 840 tacatcaata tacaagaaca taactatcag ctgatcgagc gaatgaccta agtatccata 900 tctacctata tttcatctac agaagaaaaa caaagaacaa agcccaaata caaaaagata 960 ccaaaaagaa aagaactttg aagggcaact aagcgctcgc ccgaatgcat acataggatg 1020 tcgattggtt agcagtttta gttaagctag gtttaggtat agaaaaacgt aaaaattaaa 1080 tttttttttg ttgttagttt ttttttttgc aatctcaagt caagtacttt aatgattaga 1140 tgtaagaaaa aataattaaa ttttgaacta tgtaaagata aaggcaagca gcaaagtcag 1200 cagaaaagac atacaaagaa aaaaaaagag aaaaaaataa aacaactaaa atcttttata 1260 tacatatgta catggatagg taatagaatt acagggtatg catattcggt gctagaatat 1320 ttttttatat ttttctttcc ttaaaacagc agtacggaga aagcgctttc gacgttggat 1380 aaagaagctg taagtaatgt acacccaata tttatttatg taaaaacaaa aaaaaaatgt 1440 tcttcacccc ttgcacctgc ccctttttcc cttaaaataa ctaaagtaat attccccaca 1500 taaaataaaa gaaagaagag aaaaaaaaaa aacaaaaaaa aagaaatatt cccgttatca 1560 tgcgtgtgca aatcgtgccc ttgtatgttc ttttggctta gctacttctg ctaaattgtc 1620 tgtgcgccta atctacaact gttgctagcg ctaggtcttg cttattattt gatgagatac 1680 ttttgctgaa atagcggcac ttcttgccga cgttttgttt tgattataaa ttttgatgat 1740 ttttttttgt tctgtctact tgtaagagcg atcatttttt tgtaacttta agtatgttta 1800 atactctaaa tttttttttc ctcttagata agcaataaaa atttataatg tctgattaaa 1860 agttatcttg aattcatcca tctacacctg tctccccagt aaagaacgaa gggatcatct 1920 gaagttaaag cttcaagcgg ccgatgatca taggaagggt gggcaattgt caatcgcaaa 1980 accaccagtg attgcaagta tcgaaatgcc ccttgcatta tcagcgtatg aaattctatc 2040 tttcgtttct atctttcttt tcaatctttc ttttgtatct tccttttcta tctttgtttt 2100 ttttggtatt ctcttcctct tgtcgcatgc acgctcactt ttaataatat aataatgaga 2160 aaggcaggca ggtcgatgaa aatataaata aataattgtg gtaactggat ttcggcagaa 2220 ctacacccct tttgtcgacg agctagtgcc ggaaatttac agcgtgccag ccgatagcca 2280 gctactgcca gacccgccca aactatacgt tacccgttac a 2321 // ID hAT-17_HM repbase; DNA; INV; 3550 BP. XX AC . XX DT 07-DEC-2008 (Rel. 13.12, Created) DT 12-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-17_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3550 RA Jurka J.; RT "hAT-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 2006-2006 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(899..1456,1474..3258) FT /product="hAT-17_HM_1p" FT /translation="MSKVERDAVVLQGPPKNPLSFPRDSNKIKVPESVFHE FT ITRNGEKINRDWLVWSATSKSFLCFPCSLFGSKQACGAGNNSHLLRWNGGI FT NGNWRKLAEKVKGHQNNPHHRDNYIKWKTALESLGNQCGIDSSLEKLIRNE FT AARWREILKCILDVILFLASRNLSFRGIFVNINYYIYVYVIIYFYYNFICV FT IKIDILGSSKMIGDDDNGNFLATLELLAKHNKTLQLHLEEVSRCQQEGKQI FT IAHYLGWSSQNEFIKECGRIVYGAIIKEAHMAIYYSILVDGTPDVSHTEQI FT AFVLRFVYYGIDKKWVVKERFLGVESLDKKKGVDIAKLIIDVLNQNDIDLK FT NCRGQGYDNGANMSGVYKGVQAIILQRNPQAFYMPCSTHNLNLAGVHSLES FT SVEMKNYFGRIQLLYNLFSGSPIRWKILTETTGLSLHQTSQTRWSARIEAV FT KPLVKRPREILLSLQKLRDLDLTADLLIDVKSLEKWIQSFEFIIMTTFWFK FT ALQAINYVSVSFQSENITLDDEMKLMKILIEDLTRLRSSWPELINEAHLVA FT SGLASYGFQSKLVQKRTRKRKTFYEEARNEVHFLENDEKQFEVNVFNTALD FT TLIQQIKDRFQAAEKTTNSFSFLWLSESNSRSSEEKEIEQPSLEEKCKTLA FT QMYVNDVDEDKLILEVRHLDTLKRANLFGPKEYLTSMKLLNGIYQKGLESI FT FESTCILLRIFNTIPVSIAEGERSFSKLGLVKTTLRSTMSQDRLTNLLVIS FT IEHDLAKSLCYDEVIENFALNKARRVKFL*" XX SQ Sequence 3550 BP; 1191 A; 592 C; 664 G; 1103 T; 0 other; cagggccgcc gcgtgggggg gggtcactgg gtacttttta cctgggcgca ggatcataag 60 gggcgctcaa atgataattt tttttttata aaatagatac acagggcttc aatgaaaaac 120 tcttcaaact gctaatttta aaatataagt ttcaaatgta ttgttaactt aacaatacag 180 tattagtcga actcatatgc gggagtctga gtctagtcaa aaataaggtt aaagtacatt 240 gtataattga gattcttaca tttaatatag ttttactaag tatcataaaa atagttttca 300 atcaaattca aaactttctt aaaaacaaaa catgttcagt ttcaagaaaa aagattctgg 360 ctgtcaaaac agaaaaaatg cggcaaatag gacagcaaat gagcaaaaaa ataaacgtac 420 gctcgaagat tgtggtatcc aagttaaaaa aaatgacggg gataggccta cagtttccag 480 attgatatct ataaattcct ctcaatcacg tcaagctttt ggaatagtaa gttcatttat 540 ctaaacataa ccttcttttt tatatatctt ttctattata ctattattat ataaatcaag 600 atatttttga atattttact ttattttcta taggatagct ttgaaaacga ttcttgtaaa 660 attttgttga actcagaaga accacttcct ccttcgagtg gcacgttggc aatatcggca 720 acaagagttg aaaatgaaaa caatattttg gtaaaatttg cttcgcaaag attctttaat 780 tatttttgtg aatttgtgaa aattatatac ttattaaaat ctttatttaa attattagca 840 taaagataag ttggatgagg caattactac aaacgatcct gctttgtggg ctgaaaatat 900 gtccaaagta gaacgcgatg cagttgttct tcaggggcct cctaaaaatc ctttatcttt 960 tccgagagac tcaaacaaga ttaaggttcc tgaatcagtt ttccacgaaa taactcgtaa 1020 tggggagaaa ataaacagag attggcttgt ttggagtgca acttcaaaat catttctttg 1080 ctttccgtgc tcacttttcg gaagcaaaca agcctgcggg gctggtaata attcacatct 1140 tctacgttgg aatggtggca tcaatggaaa ctggagaaag ctagctgaaa aagtaaaagg 1200 ccaccaaaac aatcctcacc atcgagacaa ttacattaag tggaaaacag cgctggaaag 1260 cttgggaaat caatgtggga ttgattcaag tttagaaaaa ttgataagaa atgaggcagc 1320 caggtggcgt gaaattttaa agtgcatttt agatgtaatt ctcttcttag catcacgcaa 1380 ccttagcttc agaggtatat ttgtgaacat taattattat atttatgttt acgttataat 1440 ttatttttac tataattaaa cagaaagtca tgatttatat gtgtaattaa aatcgatatt 1500 ttaggctcat caaaaatgat cggtgatgac gataacggca actttcttgc aactctagag 1560 ctacttgcaa agcacaacaa gactcttcag ttacatctag aagaggtttc tcgctgccaa 1620 caagaaggta aacaaattat tgcgcattat ttgggttgga gttctcaaaa tgaatttata 1680 aaagagtgcg gaagaatcgt ctacggtgcc atcattaaag aggcgcatat ggcaatttac 1740 tattctattc ttgtcgacgg aacgcctgac gtctctcata cagagcaaat tgcatttgtt 1800 ctccgctttg tctactatgg tattgataaa aaatgggtag ttaaggagcg ctttcttggg 1860 gtcgaaagtc tcgataaaaa gaaaggtgtg gatattgcta agctaattat agatgtctta 1920 aatcaaaatg acatcgatct gaagaactgt cgaggtcagg gatatgacaa cggtgctaat 1980 atgtctggtg tgtacaaagg agtacaagcg atcattcttc agagaaatcc tcaagctttc 2040 tacatgccat gcagcactca taaccttaat ttagctggtg ttcattcact tgaatcttct 2100 gtagaaatga agaactattt cggtcgtatt cagttactct acaatctttt tagtggaagc 2160 cctatacgat ggaaaatctt gaccgagacg actggtttgt ctctacatca aacttcacaa 2220 acccgatgga gtgcacgcat tgaggctgtg aagccgttgg ttaagcgacc cagagaaatt 2280 cttttatctt tgcagaaact tcgtgatctt gatttaactg ctgatctatt gattgacgta 2340 aaatctttag agaaatggat tcagtcattt gagtttatta tcatgacaac cttctggttc 2400 aaagcgctgc aagcaataaa ctatgtcagc gtgtcatttc aatcagaaaa cattacttta 2460 gatgatgaga tgaaacttat gaagattctt atcgaggatc ttactagact tagatcttct 2520 tggcccgaat taattaatga agcacattta gttgcctccg gtctagctag ttatgggttt 2580 cagtcaaaac tagttcaaaa aaggacaaga aagaggaaaa ctttttacga agaagcaagg 2640 aacgaagttc atttcttaga aaacgatgaa aagcagtttg aggtgaatgt attcaacacc 2700 gctctcgata ctttaattca acaaataaaa gacagatttc aagctgcaga aaagacgact 2760 aacagttttt cttttctatg gctctcagaa tctaatagta ggtcgagtga ggaaaaagaa 2820 attgaacaac ctagcctgga ggaaaaatgt aaaactttgg cgcaaatgta tgtgaacgat 2880 gtagatgaag acaaacttat tttagaggtc cgtcatctgg atactctaaa gcgtgccaat 2940 cttttcggtc caaaagaata tctcacttca atgaaactat taaatggaat ttatcaaaaa 3000 ggtctagagt ctatcttcga atcaacatgc attttattac gtatttttaa cacaattccc 3060 gtttccatcg ccgaaggaga gcggtctttc agcaagcttg gtctagtcaa gacaacacta 3120 agatctacaa tgagtcaaga ccggcttacc aatctcttag ttatatctat tgagcacgat 3180 ctagcaaaaa gtctgtgcta cgacgaggtc attgaaaatt ttgctttaaa taaagcacgg 3240 agagtcaaat tcttataaag catttaaaat atctattaca tttaaaattt acagttgctc 3300 gcaatataat aaagttaatt tgtcaacttg tgcttctttc tttacctttt ttcaaactta 3360 tttaaaaaat atatagaagc cttctcgact gtaattacct tctcggcctc ttaaagtgag 3420 ttttaaaaag taaaacttaa aaaaaaactt agcatttgga cagtatctaa attttttttg 3480 attaggtcaa tatgacaggg cgctagataa attttgaccc caggcgtcat aaagggacgc 3540 ggcgggcctg 3550 // ID Gypsy-250_AA-I repbase; DNA; INV; 4523 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-250_AA_; KW Gypsy-250_AA-LTR; Gypsy-250_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4523 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1101-1101 (2011). XX DR [1] (Consensus) XX CC Positions [3348-3815] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 126..4157 FT /product="Gypsy-250_AA-I_1p" FT /translation="MANEAGESSGAVKFEAGGVVPVVRAGMFGQIEQYVVG FT ENFEEYINRLEMFFLVNDTQDNRKVPVLVTVAGPSLYTIATRLCAPEDPRT FT KSYADLVKLLKEHLGPTTNVVAERYKFRKCEQLSSQSITDFIISLKATAQT FT CEFAAFLSDALRDQFVAGIRDQSLRKKLLTEPGLTFDKACSLARSWEAALS FT QNKEMSTQSPQKIGTFRKEVHQKLKVKRSNPGKEESKTSRPCQQPSKPCFR FT CNRLHDPNTCPARSWTCYSCGKEGHVSTCCRSKNKQTKKPGGSNRVAEMSE FT AVEVLRLNMLGGPVEAAASTLSSSFEAKSAEEKCQSSVSLAILEQASSGPT FT PGSLNKLDSPEFVSLECDGRRIEFEADCGACRSVISADTYRKFFPNRKIKG FT TSLEFVSVSGQRIKPQGTLEVSVTAPSGIHGRLELVVIRTDREVNPLLGRD FT GLDLVFPDWRRIFSGKSVGKFEHSFESELKSRFPRIVSESANNVITGFTAE FT IVLKPDTSPVFHKAYSVPFLLREKVEQGLDKLVEDGILVPVRSSLWANPIV FT VVPKKDGSVRICLDGKAALNRYTTVEHYPLPRVDDILAKLANWKLFCKIDL FT SGAYLQVQLSESSQVLCTINTHRGLFRYTRMPFGITSAPAIFQSIMDQVLD FT GSPGIAYLDDIIVGGATIDECRRNLFMVLQKLNDHNVRINVLKSSFFESEI FT DYVGYAITSDGIRPSEAKVKAILDAPPPQNVNQLQAYLGLLNYYHRFLPNL FT SIHLRPLYDLLQKNRRFLWSSECQVAFEKTKRSLVENDLLEPYDPLKPITL FT AVDASPYGVGAVLSHVVGNVEKPISFASSTLTPAQTNYAQVHKEALAVIFG FT VTKFHKYIYGAKFKLVTDNTGIKEIFNPSKGISAIAAARLQRWALILASYS FT YAIEHRPGKFMHHADALSRLPLPSLAEVDHTDLGLNSVSSSGQEVVNIDVV FT RTHQKSDPILTKIFAFVSHGWPSNVDPSYKPYFLIRNHLGIDREVLYFDDR FT VVVPDSLKLAVVEQLHSNHDGVVRMKMLGRMYVWWKNFDKDINDYVQKCLV FT CQKRQSVPKEIVESKWPKCERPFQRLHMDLFYFEGHTLLIIVDSYSKYIDV FT RLLKGSNSLQLIEQVESFFASFGIAEEIVTDNGPPFNSELFVRFLEVNSVT FT VSKSPPYHPQSNGLAERGVRTVKDVLKKYLLDDKRQSLSIGRKINRFLINY FT RNTPSTVTNRTPSSMVFAYTPRTPTNMVNPRKVELESVPSKPVVRNNDDRI FT IQSPPEIRNTYRAGDKVLYRNHFKDLVRWIPAIVLKKLSPLTYLISVEGNV FT RMVHANQIRVSDLSDVFHPTVPIAQPVQQRIV" XX SQ Sequence 4523 BP; 1265 A; 1014 C; 1082 G; 1159 T; 3 other; agttggcgac gaggataaaa agtgattagt tgcgcgttgt gtgtaggtag tgaaaaaaaa 60 gtcgaagtga agaaccggtc ggtaacacgt gttcgtgaac ataacctcac attcgattag 120 tcgtcatggc gaatgaggca ggtgaatcaa gtggtgcagt gaaattcgaa gctggcggcg 180 tagttcctgt cgttcgtgct ggcatgtttg gccaaatcga gcaatatgtc gtgggtgaaa 240 atttcgagga gtacatcaac cggctggaga tgttcttctt ggtgaacgac acgcaggaca 300 atcgaaaagt ccccgtgtta gtcacagtcg ctggtcccag tctgtacacg atcgcaaccc 360 gattgtgtgc tccggaagat cctcgtacaa agtcgtacgc cgatctagtc aagctcctaa 420 aggaacacct tggccccacc actaacgtcg tcgcagagcg gtacaagttt aggaagtgtg 480 agcagctttc gtcccaaagc atcaccgatt tcatcataag cttgaaagcg accgcgcaaa 540 catgtgagtt tgctgcattc ctgtcggatg cactgcgaga ccagtttgtc gctggtattc 600 gtgaccagag tctccggaaa aagttactta ctgaaccggg cctaaccttc gacaaagcat 660 gttcgctagc tcgaagttgg gaagcggctc tgagtcaaaa caaggagatg tcaacccaat 720 cgccgcagaa aatcggaacg tttcggaaag aagtccacca gaagctaaaa gtcaagcgaa 780 gcaacccagg aaaggaggag agcaaaacaa gtcgtccgtg tcagcaacca tcgaaaccat 840 gctttcggtg caatcgtctc catgatccaa acacgtgccc agcgcggagc tggacgtgct 900 attcgtgtgg aaaggaaggt cacgtgtcaa catgctgtcg atcgaagaat aagcaaacca 960 agaagcccgg aggaagtaat cgagtcgctg agatgtccga ggccgtcgag gttctacgcc 1020 tgaacatgct cggtggtcca gtggaagcag cagcgtccac gttgtcgtcg tcgttcgaag 1080 ccaaatcagc agaggaaaaa tgtcagtcta gcgtcagtct ggcaatcctg gagcaggcaa 1140 gcagtggtcc aactcccgga tcgctgaaca agttggattc gccggaattc gtgtcgctgg 1200 aatgcgatgg tcgtcgaatt gaattcgaag cggactgtgg cgcgtgccgc agcgtaattt 1260 ccgcagatac gtatcgcaaa ttcttcccaa accgtaagat taaaggtacc tcattggaat 1320 ttgtatccgt ttcaggtcag cgaatcaaac cgcaaggtac gctagaggtc agcgttaccg 1380 ctccgtccgg aatccatggt cgtctggagc tggtagtaat ccggaccgac agagaagtca 1440 acccgttgct tggtcgtgac ggtctcgatc tggtattccc ggactggaga agaatatttt 1500 ctggcaagtc tgtgggtaag ttcgaacatt ccttcgaatc agagctcaag tcgcgttttc 1560 cgagaatagt tagcgagtcc gcgaataacg tcattaccgg tttcaccgcg gagatagttt 1620 tgaagccaga tactagtccc gttttccata aagcgtattc agtcccattc ctacttcgtg 1680 aaaaagttga gcagggctta gacaagttag tcgaagatgg tattttagtc cccgtccgat 1740 cgtcgttgtg ggcgaatccg atcgtagtag tccccaagaa agatggttcc gttaggattt 1800 gtctcgatgg aaaggccgcg ttgaatcgtt acaccaccgt cgaacactat cctttgccac 1860 gtgtcgacga tattttagca aaacttgcca attggaagct attttgcaaa attgacttgt 1920 cgggagcgta tctccaagtg caattgtctg aatcgtctca agtgttatgt acgattaaca 1980 cccaccgtgg tctttttcgt tacacgcgta tgccgtttgg tattacatct gcccccgcga 2040 tttttcaatc cattatggat caagtcctgg acggttcccc tggaattgcg tacctggatg 2100 acatcattgt tggtggcgca acaattgatg agtgtcgtcg taacttgttt atggtgcttc 2160 aaaagttgaa cgaccataat gtgcgaatca acgttcttaa atcgagcttt ttcgaaagtg 2220 agattgatta cgttggatat gctatcacgt ctgatggtat tcgtcctagt gaggcgaagg 2280 taaaagctat tttggacgct cctcccccac aaaatgtgaa tcagctccaa gcataccttg 2340 gactgctaaa ctattatcac cgttttcttc cgaatctttc gattcatctt cgtccgttgt 2400 acgatttgct acagaaaaat cgaagatttc tgtggtcatc cgaatgccaa gttgcattcg 2460 agaaaacgaa acggtcgctg gtagaaaatg atctacttga accctacgat cccctaaaac 2520 cgataacgtt agcagtcgat gctagcccat atggcgtggg tgcagttttg tcccacgttg 2580 tcggcaatgt cgaaaaaccc atttcgttcg cttcatctac actgacccct gcacagacca 2640 actatgctca agtgcacaaa gaagcactcg ctgtgatctt tggagtcacc aaatttcata 2700 aatacattta tggtgccaaa ttcaaattgg taactgataa caccggcata aaagagattt 2760 ttaacccgtc gaaaggcata tctgctattg ccgctgctag attgcaacgg tgggctctaa 2820 ttctcgccag ttacagttac gcaattgaac atcgtcctgg caaatttatg caccatgcag 2880 atgcactgtc gcgtctacca ttaccgagtc tagcagaagt agatcataca gatctagggc 2940 ttaatagtgt gtcttcgagt ggtcaagaag tagttaatat tgacgtcgtt cgtactcatc 3000 agaaatctga tccaattttg accaaaattt tcgcattcgt gtcacatggt tggcctagca 3060 atgtcgatcc ttcgtacaaa ccgtattttc taataagaaa tcacctaggc atcgataggg 3120 aagtgttata cttcgatgat agggttgttg tcccggatag tctgaagctg gctgtagtag 3180 aacagttaca ttcaaatcac gacggagttg tgagaatgaa aatgttaggt cgaatgtatg 3240 tctggtggaa aaacttcgat aaggacatta atgattatgt tcagaagtgt ctcgtgtgtc 3300 aaaaacgaca atctgtccct aaggaaatcg ttgagtcgaa atggccaaaa tgtgaacgcc 3360 cattccaaag gcttcacatg gatctatttt actttgaggg tcacacgcta ctgataatag 3420 tcgatagcta ctcaaagtac atagatgttc gcctactaaa aggttcaaac agtctgcagt 3480 tgatagaaca ggtcgaatcg ttttttgcta gtttcggaat agccgaagaa attgtcaccg 3540 ataatggtcc cccctttaac tccgaactgt tcgtgagatt cttggaagtg aatagtgtca 3600 cagtctccaa atctccacct taccaccccc aatccaacgg gttggccgag agaggtgtaa 3660 ggacagtcaa agatgtgctt aagaagtact tacttgacga taagcgacag tcgctttcga 3720 ttggtaggaa gattaatcgt ttcctaatca actaccgtaa caccccttct acggttacaa 3780 atcgtacgcc gtcgtccatg gtgtttgcgt atacgccgcg tacgccgacg aacatggtta 3840 accctcgtaa agtcgagtta gagtccgtgc catcaaaacc tgtagtacga aataatgatg 3900 atcgaatcat tcaatcccca ccagaaataa gaaatactta ccgtgctggt gataaggtat 3960 tgtaccgcaa tcacttcaaa gatttagtcc gatggatacc cgcgatagtt ttgaaaaaac 4020 ttagtccact cacgtactta ataagcgtag aaggaaacgt tcgaatggta cacgcaaatc 4080 aaatccgtgt atcagacttg tcagatgtat tccatcctac cgtgccgatc gctcagccag 4140 tgcaacaacg aatcgtggma aacgagcaaa gcaacaacaa agtcagtgcg ccatctgtgk 4200 gttcgcgcga gaacctatcg caaccggcaa gcacaagctc atcgcagtcg acgaagcaag 4260 caggcaagaa gaaaaagcac aacaaacgtc gacgaagcga gtcaaaatct cccaaggtgc 4320 ggcgctcaga tcgcttgaaa ggacaaccga ggttgaagta tccgaaatga gtagaagata 4380 aagtagagta agcttttcat atgtgatatg tttgatcgag tgattaattc acctagamgt 4440 ttgtgtagct tttcttaagt gataagtatt ataacgtttt taaaaaatga tgtaatgtaa 4500 atgtgattta aagcccggag agc 4523 // ID Gypsy-2-LTR_HM repbase; DNA; INV; 156 BP. XX AC . XX DT 19-DEC-2008 (Rel. 13.12, Created) DT 19-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE Long terminal repeat of the LTR retrotransposon - a consensus DE sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-2-LTR_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-156 RA Bao W. and Jurka J.; RT "Gypsy LTR retrotransposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 1971-1971 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX SQ Sequence 156 BP; 61 A; 18 C; 27 G; 50 T; 0 other; tgtgaagccg tgaaccttag tcatatgcgg acaaatagga acaatgtttt gatatatata 60 aacatacgtg gctataaaaa atgtatatta aagatggctt acttataaga atatatatat 120 ttaagtgtag tacagtgttt tattaacgac acaaca 156 // ID BEL-18_DPu-LTR repbase; DNA; INV; 601 BP. XX AC ACJG01001526; XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the common water flea: long terminal DE repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-18_DPu_; KW BEL-18_DPu-I; BEL-18_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-601 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the common water flea."; RL Direct Submission to RU (09-FEB-2011). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR Genome; ACJG01001526; Positions 13629 13029. XX SQ Sequence 601 BP; 146 A; 114 C; 121 G; 220 T; 0 other; tgttccagag aaaaagaagg cttaatttaa atgtatattt tagttcatgt tcccatttca 60 gtttaaatct caagtatcga ttaatttaag taagccgcca ggtgggccac ttgtcgaggc 120 agaagagaac agctgatgtc tcaattaagt ctcttggccg aaacagaaag tcgccattgt 180 tcattttagg gcgtgtatag ccattgataa ctcgtttctt actgtgcttt attgatggga 240 attctcatct tgtgactccc tatttcttat ctgattctct gaggtaaatt tgtgtatata 300 tttttatgtg ttccattagc taacattatt ctcgtgtaca tagattaggc cggattaggc 360 cctagatttg atttcgtgtc gtttgtgtct acgattgcct tattgaccgg ctcctaattg 420 tggatgctag gtgagagcta ggttttattt caatcgaagt ttgtgtatgc taattctgtc 480 ttaatttagg caatttgtgc tctgtccatt ttctatattg acgaataaac ccgtgcatcg 540 attagaagca attgataggc ctacgcctcc ctcttcgacc gtctcattct ctcgctgaac 600 a 601 // ID MuDr-1_HM repbase; DNA; INV; 6181 BP. XX AC . XX DT 24-DEC-2008 (Rel. 13.12, Created) DT 30-MAR-2009 (Rel. 14.04, Last updated, Version 2) XX DE MuDr-type DNA transposons from Hydra magnipapillata. XX KW MuDR; DNA transposon; Transposable Element; Mutor; MuDr-1_HM. XX NM MuDr-1_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-6181 RA Bao W. and Jurka J.; RT "MuDr-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 2075-2075 (2008). XX DR [1] (Consensus) XX CC The C-terminal of this transposase contain Ulp1 domain CC (pfam02902) and PHD-finger domain (pfam00628). CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 1666..4119 FT /product="MuDr-1_HM_1p" FT /translation="MQQVFDMFPEVLMIDCTYCTNKLRMPLFTLLVEDGNG FT TGQAVGYAFIAQETENTLQDVFSEIELILNLNQVKVVILDKDLKEIGAVSK FT VMPTAEIQLCKFHVLQAFYRFLNKQGLASIEKDNLKKIFKSLVYAKSKKLF FT DINLGHLATVAPEVVMAYYNKNWGLPHQVKYWAYYETIKYVNLGNTTNNRS FT ESHNQKIKNILSRKMSLAEAVKGILLLHTCKFEQMAHIEFNQSFKEPYRLG FT TDEDVVTDDIVKVLTPYAAGIVIAELKKSRTDLLVLNKVCSCPLQVTMKLP FT CRHTMAVKIVNGDAIFKAEDAGERWHLSYQKKSWLSRKTRSLHNQANNVFH FT FSTNKTEKKALKVPFSKDQKFKTVMNLCKPISDYLSTLGMNEFEEKIDSLN FT LIYKLWLAGKKVIISDMEAQALPEISNLPKVSALPEISDLPEMSDFLETSE FT LPEMSDLPEMSSSPNFSESLFSALPSSLNLNEQLQNQGSVLSPNQSSLNDS FT VNFKDLPTTKSECCTGQAINIFNNLANVCLPKVPPVVGNVKIKKARLKRKR FT DLYNDPYHQSCQRILGKTAHKEQLLKKQLSVIDLQLFSIKDSFEMLSDLHI FT NKAQKLLSMQFPDVKGLQDPILGSKLHFRVQSGKFVQILHNGAMHWLTISN FT FLSTHLNSVDVFDSLYYDLSVSGKMQVGSIMMVKEPFLKLQFKDFQKQSGG FT LDCGLFAIAAATDLCYNYDPSIKCYKQELMRDHLLKCFSDNYLTPFPIETN FT ERKKKKQPKPINLPLYCICRLPDNKEEKMVKCDNCKEWFHVSCLSIPKNIF FT DNSDNSLHWHCNSCS*" XX SQ Sequence 6181 BP; 2151 A; 923 C; 981 G; 2120 T; 6 other; cgaagtgtct attcgcacga ccgtcgtgca aataacaaaa ataaacttat tcgcacgacc 60 gtcgtgcaaa taacgaaaat aagcttattc gcacgaccgt cgtgcaaata acgaaataaa 120 gcttattcgc acgaccacgg ttatatataa actataaata waatacgcat gtgctattca 180 aaaaggtaga attcaatcag gaagcttgtt tggaataaac tgatttattt aatttatttt 240 tgctataaat atgattgcat aataaaccta aattatattt ttagtttcag cttatcaaat 300 tagtcatgaa tattgaaatt aaacaaggag atgagtttga taactttgct gaatttgaaa 360 ataaattggc ttcttggtcg gcatttacaa acattttgtg gactaaatgt aacagcaaaa 420 ctgttacatt ggctaataag ggtttgagtc agaattcaaa acattttgat gagaagttta 480 aatttagaaa tgtaacatat ttatgcaagc atggaggagt taagagaaca aaagggaaag 540 gaattcgccc aaatcaaagg ttttaaatgt tttgtttatc taatttcttt ttatctaaaa 600 gtttttctaa aagaaaagaa atctaatatt ttatttgtgt atgctaaatt tcattttaat 660 tattttttgc ttgtttttgg ctggcttgtg gaagctggca ttcgagcttg aataagatct 720 catatcaact acaacatgca cttctctgtg ggtagtttac attataatgt tcactaccca 780 ctgatagaac tcaataattt tcagaaagtg aagattttca ctactttttt ttttagctct 840 ttcaaaattg gatgtagtgc caaactttat attagtatta ataaagataa aacaaaacta 900 gtagtacagc aacttcaggc taaccataac catgaagtca gcaggttagt acaactatat 960 tttattatta gtttattttg taaaaaaatt gtactttttt ttactttttt gagttaattt 1020 gtattttttt attgtaattt ttatttttaa atttttattt tatttgtatt tatatttatg 1080 ccttcataga actgctttta tgcattaccc tgaacaacgc cggttaagtg aaatagttaa 1140 agatgatatt ggcacaatgt ttaaacttgg tgttaaagtc tgttttttaa aagaatattt 1200 aaaaaataaa gaaagaaaaa tagtaacatc gaaagattta ttcaacataa ataaaaaaat 1260 tgaactagaa cgcaattgtg aaaaatcaaa tgaggcttta ttgatcgatg aactmatgtc 1320 cttctgtaag ccttctgctc taacttttct ttttcaattt ttgtatttta caggatttat 1380 gatttagagt atatatatag aggcagcaaa cgcctgatag ttttaaccta aatgtattca 1440 gagatttatt cagaggttta ttgtactttg agttaaatgg gttataattc aactatagtt 1500 taaataatac actttattaa tcaatttatt ttatttatat attatctaaa agttttagtt 1560 aaagtttgat ttttaatttc aggtgaaaaa gatcctgatg cagttatatc tgttcaagtt 1620 gattcagacg gtgttctgca gtttctttta ctaataagtg gagatatgca gcaggtgttt 1680 gatatgtttc ctgaagtgct aatgattgac tgcacatatt gcacaaacaa attaagaatg 1740 cctttattca ctctgttagt tgaggatgga aatggaacag gtcaggctgt tggttatgct 1800 ttcattgcac aagaaactga aaacactctg caagatgtgt tttcagaaat tgaacttata 1860 ctgaatctta accaagtaaa ggttgtcatt cttgacaagg acctgaaaga aataggtgca 1920 gtttccaagg ttatgcctac agcagaaatc caactgtgta agtttcatgt tttgcaagca 1980 ttttatagat ttttaaataa acaaggttta gctagtatag aaaaagataa tctaaaaaaa 2040 atatttaaat cgcttgtgta tgcaaaatca aagaaattgt ttgatataaa tttgggtcat 2100 cttgctacag ttgcaccaga agttgtaatg gcatactaca acaaaaattg gggcttgcca 2160 caccaagtta agtattgggc atactatgaa actatcaagt atgttaatct tggtaataca 2220 acaaataaca ggtccgagtc ccataaccaa aaaattaaaa acatattgtc taggaaaatg 2280 tccttagcag aggcagtaaa aggtatcctt ttgttacata cttgtaaatt tgaacaaatg 2340 gctcatattg agtttaatca atcttttaaa gagccatata gattaggtac tgatgaagat 2400 gtagttacag atgatattgt aaaagttcta actccatatg ctgctggaat tgtaattgca 2460 gaattaaaaa agtctcgcac agatttgtta gtgttaaata aagtatgttc gtgtcctttg 2520 caggttacta tgaaacttcc ttgtcggcat acaatggcag tgaaaatagt aaatggtgat 2580 gctattttta aagcagaaga tgcaggagag cgatggcatt taagctatca aaaaaaaagt 2640 tggctttctc gtaagaccag aagtttgcat aatcaggcta acaatgtttt tcatttttca 2700 actaataaaa ccgaaaaaaa ggcactaaaa gtgccatttt ctaaagatca gaaatttaaa 2760 actgtcatga atctatgtaa accaatttcg gattatttat ccactctcgg aatgaacgag 2820 tttgaagaaa aaattgattc cttaaattta atttataaat tgtggttagc tgggaaaaaa 2880 gttataattt ctgatatgga agcgcaagca ttgcctgaaa tctctaattt gcccaaagtg 2940 tctgctttgc ctgaaatctc agatttgcct gaaatgtcag attttctaga aacgtcagag 3000 ttgcccgaaa tgtctgattt gccagagatg tccagttctc caaacttttc tgaatctttg 3060 ttctctgctt taccatctag tctaaattta aatgaacaac ttcaaaatca aggaagcgtt 3120 ttatctccaa atcaatcttc tctaaatgac agcgtaaact tcaaagatct tcctacaact 3180 aaatctgaat gctgcaccgg ccaagcaata aatattttta ataatttagc aaatgtatgc 3240 ttacctaaag tccctccagt tgtaggaaat gtcaagatta aaaaagcaag gctcaaaagg 3300 aagcgggatt tgtataatga tccttaccat caatcgtgtc agcgaatttt aggcaaaact 3360 gctcataagg aacagttatt aaaaaagcaa ctctctgtca tagatcttca actattctcc 3420 attaaggact cattcgagat gttatctgat ttgcacataa ataaagcaca aaagctttta 3480 tccatgcagt ttccagatgt aaaaggatta caagacccta ttcttggatc aaaacttcat 3540 tttagagtgc aatctggaaa gtttgttcaa attttacata atggtgcaat gcattggctt 3600 accatttcaa attttttaag tacccatctt aactcagttg atgtttttga ttccttatat 3660 tatgacttaa gtgttagtgg aaagatgcaa gtaggatcaa tcatgatggt taaagaacct 3720 tttttgaaat tacagttcaa agatttccaa aaacaatctg gtgggcttga ttgtggtctt 3780 tttgcaattg cagcggcaac agatttatgt tacaattatg atccaagcat aaaatgctat 3840 aagcaagaac ttatgcgaga tcatttgtta aaatgcttca gtgacaatta cttaacgcca 3900 ttccctatag agactaatga acgaaaaaag aaaaaacaac ctaaaccaat taatttacca 3960 ctatattgca tatgtcgtct tcctgataat aaagaagaaa aaatggttaa atgcgacaat 4020 tgtaaggagt ggtttcacgt ttcatgcctt tccattccaa agaatatatt tgataattct 4080 gacaactctt tacattggca ttgtaattca tgcagctaac taaggtattt tatttaagga 4140 gcgggtattt attaattttt gaaaattttg ccacctatcc taagcgtatt aagaccccct 4200 acccccgttt attgatatac ttgttaacaa taaaaacgtt acatctcaaa actttaaaaa 4260 aatttaaaaa aatatatatt aataaccgat aaaaaataaa aacttttccc atcaaccctt 4320 ttattacgcc tgtccacacc ccgttaatta gatctgggat aaaattccca cttcctttta 4380 tttccctcca aaactccctg tatgaaataa gaataaatta ttttttttta ctttacctat 4440 cgatacctta tatgttttac aataccaatc gattattgtt cctttacgtt tgtttaccct 4500 aaggtgctta gggtttaagc atcttatggt aaataaacgt aaaggaacaa taatcgatag 4560 atattgtaaa atatataagg catcaatagg taaagtaaat aaaaaattaa ttttttattt 4620 ccttttctta catccatttt aaaagaattt gaaatgggtg tgcaaactaa atttttatgt 4680 tttagtatgt cgagatatca aagctaatca gtataaacgg tattatgtaa gtttaaccaa 4740 gtttctcttc tgtttcaagt tggtgaagat agtcatatcc gaggcgatga tgaaaatgat 4800 atttaaaaaa agttttgcta accagaatat aattattaca catttttctt gtgttttctc 4860 tgcagctgcg cgtattatac atttcttgaa atctgaatga gtttataaag agatatttta 4920 ttttagaagt ttaaaaacta ctctagccaa aagttgattt gtattaggaa tgccaaaatt 4980 ggatgtctga gtacctttca accattacta ctttatttga ataactcatt ctctaaatgt 5040 tataggttta taaaatatag aaccagcaag cggtacctat ygccttgacc catgtaaaaa 5100 aaaaagtatt acaaacttag agttaaaaaa caattagttg cacaatatac acagtaatta 5160 gttgcacagt atacacagta attgcgatta tataaaaata gatcaaatat aagtgtatag 5220 aagttaaaat ttaatattaa ctaagtcaaa atgtacaaca gaaaatataa tctattttga 5280 actttctatt tggagtttac ggcgtaatat atactatata tatcacgcct taacatatta 5340 gtgactatac ttgtctagac ttatcattgc agagaatgta taaatacaat taaaagaatt 5400 cagtatcgaa ccgcaaaatt gggttaagac actaatcacc taatatacga caaacgttta 5460 acaagtctta gttcaaacta cccttcactt tcacccttaa cacggttacc cttcaaaytc 5520 gccgtgtgcg tagtgacctt aaacagcgta gtgaacttat acaattagta caaaataatt 5580 aatagaattg attgaatcaa ttttgtaacg tcataaatcg gaccagttta ttgaggatct 5640 aagtaaagtc ttagaacaag ccgaataatt tgaataagaa gccgaaattt ttttgaaaca 5700 tgataaaaga cataactttt ttacaaatgc tgtagtaaaa aaactggaat tctctgttaa 5760 ctgccacatt agactaccac catcgctaaa aaattttaat ggtggactta gatattggct 5820 ggcgtagcta taattgcaaa ttttgtaaga aaatagggat ccttgcctgc ttttttccgt 5880 cagatgtgca ttttaaataa wtaggatgtg cattttaaac aaattttata ccgtgaattg 5940 ttgttgttgt tcttttaatt tcaattaatt tgtaaatgta tacatgtaat atacatatac 6000 atgcatataa tagtcgtgct aaaagccgtt tttattgttt taaatggttg tgtgaataag 6060 cttattttcg ttatttgcac gacggtcgtg craataagct tattttcgtt atttgcacga 6120 cggtcgtgcg aataagctta ttttcgttat ttgcacgacg gtcgtgcgaa tagactcggc 6180 c 6181 // ID Vingi-1N1_BM repbase; DNA; INV; 221 BP. XX AC . XX DT 17-AUG-2010 (Rel. 15.08, Created) DT 17-AUG-2010 (Rel. 15.08, Last updated, Version 3) XX DE A family of non-autonomous Vingi non-LTR retrotransposons - DE consensus sequence. XX KW Vingi; Non-LTR Retrotransposon; Transposable Element; KW Nonautonomous; Vingi-1_BM; Vingi-1N1_BM. XX OS Bombyx mori OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Bombycidae; Bombycinae; Bombyx. XX RN [1] RP 1-221 RA Kojima K.K., Kapitonov V.V. and Jurka J.; RT "Recent Expansion of a New Ingi-Related Clade of Vingi non-LTR RT Retrotransposons in Hedgehogs."; RL Molecular Biology and Evolution 28(1), 17-20 (2011). XX DR [1] (Consensus) XX SQ Sequence 221 BP; 71 A; 59 C; 42 G; 49 T; 0 other; gggggtcgaa gttgcataac gtccgtagga gattacccac cgctacccta cacgtgaccc 60 ctgggtggtg ctaatagtgc aacaacctgg caaccgaact gtcctcctga ggatctcccg 120 gttccaacta cactgcgtca attataaaac cctaatcaca aactgtaaat ttcaattata 180 accacgcagt gtaaacgaat gccatacgat aaataaataa a 221 // ID Gypsy-256_AA-LTR repbase; DNA; INV; 130 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-256_AA_; KW Gypsy-256_AA-I; Gypsy-256_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-130 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1114-1114 (2011). XX DR [1] (Consensus) XX SQ Sequence 130 BP; 45 A; 14 C; 32 G; 39 T; 0 other; tgtggtgatg tgtgaatcta aatatcgaga tagtactgta tctgggatgc aagaaagtga 60 gaggataata aaaacagtaa tggcgagcaa cacgtttgtg tttatatttt actggactgt 120 aaatatcaca 130 // ID Gypsy-42_AA-LTR repbase; DNA; INV; 223 BP. XX AC supercont1.331; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-42_AA_; KW Gypsy-42_AA-I; Gypsy-42_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-223 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.331; Positions 569839 569617. XX SQ Sequence 223 BP; 75 A; 39 C; 46 G; 63 T; 0 other; tgtaatatcg ctttcgttta gtcgagactt tcgaccacag agtggaatcc ccaaaacaat 60 gtattatgga actgagtgga atgagagtag cttaccagag atgggtgagt tctagtttgt 120 gtataaaatg agcatgagaa atacatgagc ttcagaagtt ttgtaactat caactcgaca 180 agtgttcacc tacataattc tcccgaaaat accagataat aca 223 // ID BEL-7_DWil-LTR repbase; DNA; INV; 128 BP. XX AC scaffold_180739; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-7_DWil_; KW BEL-7_DWil-I; BEL-7_DWil-LTR. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-128 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_180739; Positions 12743 12870. XX SQ Sequence 128 BP; 44 A; 29 C; 24 G; 31 T; 0 other; tgaaattact tgcagaatta aaaagcttcc cgcttaccgc ttaccgctta cctaaccaga 60 ggaattaacg gactaaattt atttcaaagg cactggcgga gctagattaa ctagtcgaac 120 gagcaaca 128 // ID hAT-21_SM repbase; DNA; INV; 3524 BP. XX AC . XX DT 13-FEB-2008 (Rel. 13.02, Created) DT 03-MAR-2008 (Rel. 13.02, Last updated, Version 1) XX DE hAT DNA transposon from Schmidtea mediterranea: consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-21_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-3524 RA Jurka J., Bao W. and Tempel S.; RT "hAT DNA transposon from Schmidtea mediterranea."; RL Repbase Reports 8(2), 70-70 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 762..3287 FT /product="hAT-21_SM_1p" FT /translation="MSQRYKSGCEKRKERQKKLEENARGRRTLENLGFITL FT SKSQQDEKQNPSESINAKESDGIPTLSLSTYVADPSSSRTEETTFPLSEES FT ANCEMEEASTSESMDVNLLNIDIGLLKLNLMRKDVEEAIQRGPESHPATFP FT VDKQGNQFPVNLLKLRLKNNEIVARDWLAWSKSKNALFCFPCRLFYNFRED FT QRSVLATELGWQPDRGYKKLYDRIPEHEKSTHHKSCYIQWRTVESNISKNR FT TIDCLLVQQIRNEIQKWTDLLKRFLDVTLFLAERGLAFRGSSHLIGDANNG FT NFLGILELVSRYDPLLEAHLKMVKQSQIEKQRLQVHYLSADIQNEFISCCA FT DYLRTCILRERETVKYYSVIVDATPDSAHIEQTTFILRYVSVNNHSDEYEI FT KERFLAFVNCNKKTGEDIANLILETLEKYNIPISECRAQGYDNGSNMSGSY FT KGAEARILQINPLAIFSPCACHSLNLCGVHAAESCPEVVTFFGTVQKLYNI FT FSSSPQRWEILKNNTGRSLHSMSNTRWSARIECIKPIAENIPAIKSAIDSV FT LELNLTWEIRSDLNGIKKYMESFEFLVMAKIWYKVLQAINYRNIVLQARDA FT TIDVEVANLSSLLDELQTIRDSWETLFNECKLFGSSLEINTDFETKRSKRK FT KWGNELEMTPSDKFRTNVFNVFLDCIIGNITRRFNAAKEIDSLFNVLWLFH FT NLNDEEIHTKSEKLQKVYENDIDDNLGEEILHLKSIYDANIQNKSLSPIEL FT LNIIKKMKLDVLFPNIIIAIRLFCTIPVTVAEAERSFSVLKRIKDVLRSTM FT SQHRLNDLGMLSIESEMAKKIDFQDVINLFARRKVRKATF" XX SQ Sequence 3524 BP; 1213 A; 561 C; 672 G; 1078 T; 0 other; caggcccgac aagagggggg ggttgacggg gaaaataccc ggggcccctg ggttctaggg 60 gccccggaga aaaaatgtag cccaacgcta aggtaagtga atttaaatat ttttaatatt 120 gtgggtttgt tgtgcgcaat taaccaatcc tgtgatgttt taaacagtaa ttggtttatt 180 cttaattcaa tccgtttcaa tctttacatc gacaatctta aactcttaat tcaatctgtt 240 tcaatcttta cataaatctt aaaatcttaa atcgtgtgga tcgcgtgact taaaaagcga 300 ctgtcgcacg tcgcagttgc caagggcatc gtcgcacttt gggagggata cgtcccgcta 360 cctctttcct gcatgccggt gcatgcggac gcgaacttcg gtcagcttac gccgcacaac 420 tggccacgca gtggtgtcca taacgtcgat cgcgatcgac cggtcgctcg cgaagccttt 480 aaactttaaa cctttaaatg aaataaatat gtgggaagaa gaataagatt tttttttgtc 540 ttcacattta gtcgatcttg gcaagacatt tttataatcg atcttcgaaa agtcgatcgg 600 tcaaaaatta gtggacacca ctgatttata tgattgaaac gtgaaacgtg tcgaccttat 660 tagcaaagcg ataatttgct taataacatg cagcatcatt aaaaaaaatt aattaaattt 720 taaaaaaatt aataaatgat attaatttat ggacagggat aatgagtcag cgttataaat 780 ccggatgcga aaagcgtaaa gaacgacaaa agaagcttga agaaaatgct agaggaagaa 840 gaacattgga aaatttgggt ttcataactt tatcgaaatc ccaacaagat gaaaaacaaa 900 acccttccga atcaataaat gcaaaggaat ctgatggaat tcctacactt tccttgtcta 960 cgtatgttgc ggatccttca agttcaagaa ccgaagaaac aacatttccc ttgtctgaag 1020 aatcggccaa ctgtgaaatg gaagaagcat ccaccagtga atctatggat gtgaatttac 1080 taaacatcga cattggattg ttgaaattaa atttaatgag aaaggatgta gaagaggcta 1140 ttcaacgagg tccagaatca catccagcaa catttcctgt tgataaacag ggcaatcaat 1200 ttccagtaaa tttgttaaaa ttgcgtttaa aaaataatga aattgttgct cgagattggc 1260 ttgcttggag caaatcgaaa aatgctctgt tttgttttcc ttgtcgttta ttttataatt 1320 ttcgagaaga tcaaagatca gttttggcta cggaattggg ttggcagcct gatagaggat 1380 ataaaaaact atatgataga atacctgagc atgaaaaatc aacccaccat aaatcttgtt 1440 atatccagtg gcgaactgta gaaagtaata ttagtaaaaa tagaacaatt gattgccttt 1500 tagtacaaca aatacgaaat gagatacaaa aatggacaga tttattaaaa cgattcttag 1560 atgtaacttt atttttggct gaacgaggtt tggctttcag gggcagttca catttgattg 1620 gagatgcaaa caatggaaat tttctcggaa tattagaact tgtcagtcgt tatgatccac 1680 ttttggaagc acatttaaaa atggttaaac aatcacaaat cgagaaacag agattgcaag 1740 tacattattt atcggcagac attcaaaatg aattcatatc atgctgtgcc gattatttga 1800 gaacttgcat tttgagggaa agagaaactg taaaatatta ttctgtaatt gtagatgcaa 1860 ctcctgattc cgcccatata gaacaaacta catttatttt acgatacgtc tctgtaaata 1920 atcattcaga tgaatacgaa attaaggaaa gatttcttgc atttgtaaac tgcaataaga 1980 agactggaga agatattgca aatttaattc tcgaaacgct tgaaaaatat aatattccta 2040 taagcgaatg tcgtgcacaa gggtacgata atggatccaa tatgagtggc tcatataaag 2100 gtgctgaagc tcgtattttg caaatcaatc cactggcaat attttcccct tgtgcatgtc 2160 atagtttaaa tttgtgtggt gttcatgcag cagaatcttg tcctgaggtt gtaacgtttt 2220 ttggtacggt acaaaaatta tacaatattt ttagcagtag ccctcaaaga tgggaaatat 2280 taaaaaataa tactggacgt tcattgcact ctatgtcaaa tacacgttgg tctgctagaa 2340 ttgaatgtat taaaccaatt gctgaaaata ttcctgcaat taaaagtgcc atagattctg 2400 ttctagaatt aaacttaaca tgggaaattc gttcagatct gaacggaata aaaaaatata 2460 tggaatcgtt tgaattctta gtaatggcta aaatttggta caaagtcctt caagctataa 2520 attatagaaa tattgtttta caagctcgtg atgcgacaat tgatgtagaa gtagcgaatt 2580 tgtcaagtct tcttgatgaa ttacaaacaa taagagatag ttgggaaact ctcttcaatg 2640 aatgtaaatt gtttggttca tctttagaga taaatacaga ttttgaaaca aaacgtagta 2700 aaaggaaaaa atgggggaac gaattagaaa tgacaccaag tgacaaattc agaacaaatg 2760 ttttcaatgt tttcttagat tgtattattg gaaatattac aagaagattt aatgcggcta 2820 aagaaattga ttcgttattc aatgtattat ggttatttca taaccttaat gatgaagaaa 2880 tacacacgaa gtccgaaaaa ttacaaaaag tatatgaaaa tgatattgac gataatcttg 2940 gtgaagaaat actccatttg aaatcaattt atgatgccaa tatacaaaac aaatctctct 3000 cgcccattga gttactaaac atcattaaaa aaatgaaact ggatgtattg tttcccaaca 3060 taataatagc aataagatta ttttgtacga tccctgtaac cgttgctgaa gcagaacgat 3120 cctttagtgt tttaaaaagg ataaaagatg ttttgagatc cacaatgtca caacaccgat 3180 tgaatgattt aggaatgttg tcgattgaat ctgaaatggc taagaaaatt gattttcaag 3240 acgtaattaa tttgtttgct aggcgcaagg ttcggaaagc aacattttaa aaaataattt 3300 gttttttgta cttttttttg taattttagt tgtaaacgtt ttcatttttg taagagaatt 3360 attgatgaat acattgttcc aataacaatt aggatatttt gcggtttccc gcttcattta 3420 taaatggccc aaagtccgag gggaaatttt tttttcacgc tcttggcagg ggcccccaca 3480 agaagtatat ccccggggcc ccagaagtcc tctccgcggc cctg 3524 // ID Gypsy-19_SI-LTR repbase; DNA; INV; 225 BP. XX AC AEAQ01023421; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: long terminal DE repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-19_SI_; KW Gypsy-19_SI-I; Gypsy-19_SI-LTR. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-225 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01023421; Positions 408 184. XX SQ Sequence 225 BP; 76 A; 47 C; 61 G; 41 T; 0 other; tgtggagtta agaagtaccg cgtcgcagga cgaaagggga aaagagggcg caggcgagcg 60 aaggcgaagc gagcgcctgg caacaaggag agagacagag agagagagat cgaacacgaa 120 cacgaggaac gggactctaa ctaaatcagt ctaattcttt ccatgtacgc tatattgaaa 180 tatatcttta tccactattc aactcagtgt gacttcaatc ccaca 225 // ID hAT-71_HM repbase; DNA; INV; 3268 BP. XX AC . XX DT 06-JAN-2009 (Rel. 14.02, Created) DT 06-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE hAT-type DNA transposon family - consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-71_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3268 RA Bao W. and Jurka J.; RT "hAT-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 9(2), 411-411 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 671..2815 FT /product="hAT-71_HM_1p" FT /translation="MEKRKLSGWEFKQKRLRKEEQLKSNQTSLEKFLLKTH FT HDKMAKESPLIHNIADSGNESKSTESTNSVETIQTIERLNETAATSEYFDD FT ISDWQENLNIDFIQDAVIKGPNSFKEGSNNYPRNVDNLSFSNHYFYRVMDN FT GEKNHRTWLVYSQKANKVFCFCCKLFCLNPIMLAQNGGNDWRHISRILKRH FT EKSLDHFNCYSKWVDLKNRLQKGTLIDKILDKNLIKEVEHWDNVMRRLISL FT TIMMAKQNMPLRGSCDKVYHDNNGNFLKILEFLAEFDPVMEKHLFRAKKKQ FT PNSVHYLAKDIQNEIIDLIGKSIKFKICSNVQKSVYFSIIVDCTPDISHQE FT QMSMLVRYVDVEDKNEIHINESFIGFHLIEGSTGLDLSNEIIDSLVKNNLS FT VENIRGQSYDNGANMKGVRNGVQAFILKKNNRAVYIPCCSHTLNLAVVDSV FT RCSPKIAMFFNIIDQVYNFFSASTKRWLILQSKCPSLTVKPLSDTRWESHI FT NAVKTLRFEMPKIYEALQDAAKTANETIAKITAESIALKITSFDFLCSIVI FT WHKVLFEINLTSKQLQAASTNISEALKAMERTINFFKENYKQEKFDSVVCI FT AQELASFLKVEESFAXESASRVRKRKRFFDEPEQNFEKQLCPKEKYKQDVF FT NKIFECATIAIEERFNDFNAGIAPFSFLFDIKKLSTVKEDDLKSHCIALXK FT KYLKMEKKIFLARN*" XX SQ Sequence 3268 BP; 1233 A; 427 C; 526 G; 1080 T; 2 other; caggcccggc ggaaggcggg tacaacgggt gattttcacc caggcgacta aggcgccaaa 60 tttatgttaa atattttttt ttttttttga gagtatgaaa atatataaat ttataggaat 120 aattatataa aaatttgtat acttatttat ataaaaatta gtctataata ttttatataa 180 aaaccataat tactcaaaaa aaaaattatt atttttcttt aatttattta ttgtacaatt 240 ttaaaaatgt ttcagccaat taaaaaaaaa tcatgtttga atttttccgc taaccaatca 300 ttgacattgt ctagtaaagt tgtttttact tgcggcgcaa aaacgaacca attcaaaaaa 360 agaaaagaat cattggatcg aaattaattt ctaaagaaaa gcaaacattt ctttaaaaaa 420 tcgtttgtgt tttaaacaag aaattttata aaagcgtttt aaattaaaaa agcgtttctt 480 tataaattaa tttagaaatt aaagctttaa aagtagcttt tttttttttt gcgttaaata 540 ttttttcgtt ctaaagtgtt ttagcgctct tatttgagag tttcaataat ttttttatta 600 tgatataaag ttatctaact attgctataa aattatatat aatatttatt ggtttctttc 660 aaagagaata atggagaaaa gaaaactttc aggttgggaa tttaaacaaa aacgcctacg 720 aaaagaggaa caattgaaat caaatcaaac atctttagaa aaatttttat taaaaactca 780 tcatgataaa atggccaaag aaagtccatt aattcataat atagctgatt ctggaaatga 840 aagcaaatca actgaatcta ctaattcagt tgaaactatt caaacaattg aaagattgaa 900 tgaaactgcg gctacttcag aatactttga cgatattagc gattggcagg aaaatttaaa 960 cattgatttc atacaagatg ctgttataaa aggtcctaac agtttcaaag aaggttctaa 1020 taactatcca agaaatgttg ataatttgag tttttcaaat cattattttt accgcgtcat 1080 ggataatggt gaaaaaaatc atagaacgtg gttagtttat tcacaaaaag caaacaaagt 1140 tttttgtttt tgttgtaaac ttttttgttt aaatcctatt atgttggctc aaaatggtgg 1200 taatgattgg cgccatattt cacgaatctt aaaaagacac gaaaaaagtc ttgatcattt 1260 taactgctac agcaagtggg tggatctgaa aaacagatta caaaaaggca ctttaattga 1320 taaaattttg gataaaaact taattaaaga agtagaacat tgggacaacg taatgagaag 1380 attaatttct ctaactatta tgatggcaaa gcaaaatatg cctctcagag gctcttgtga 1440 caaagtctat catgataata atggaaactt tttaaaaatt ttagaatttt tggctgaatt 1500 tgatccagtt atggaaaaac atttgtttag agcaaaaaaa aaacaaccaa attctgtgca 1560 ctatttggca aaagatattc agaatgaaat cattgacttg attggcaaat ccattaaatt 1620 taaaatttgc tcaaatgtcc aaaagtccgt gtatttttca attattgttg attgtacacc 1680 agatatcagt caccaggagc aaatgtcaat gctggtacga tatgttgatg tagaggataa 1740 gaacgaaatt cacattaatg aaagttttat tggatttcat ttaattgaag gttcgacagg 1800 actagattta tcaaatgaga ttattgactc tctggtaaaa aataacttgt cggtggaaaa 1860 cataagaggt cagagctatg acaatggtgc aaatatgaaa ggcgtaagaa atggtgtaca 1920 ggcttttatt ctaaaaaaaa acaaccgagc ggtttatatt ccatgctgca gtcacacatt 1980 aaatctggca gttgttgatt ctgtgagatg ttcacctaaa attgctatgt ttttcaatat 2040 aattgatcaa gtttacaatt ttttttctgc ttcaacaaaa agatggttaa ttttgcaatc 2100 aaagtgtcca tctttaacag ttaaaccgtt aagtgataca agatgggaaa gccacatcaa 2160 tgctgtaaaa acacttcgat ttgaaatgcc aaaaatttat gaagcactgc aagatgcggc 2220 taagaccgct aatgaaacaa ttgcaaaaat cacagcagag tctattgctt taaaaataac 2280 aagcttcgat tttttgtgtt cgatcgtaat ttggcataaa gttttattcg aaattaattt 2340 gactagcaaa caacttcaag cagcttcaac gaatatttca gaggctttaa aagcaatgga 2400 aagaacgatt aattttttta aggaaaatta caaacaggaa aaatttgact ctgttgtgtg 2460 tattgcacaa gaacttgctt cttttttaaa ggtagaagaa agttttgcak cagaatctgc 2520 ttcaagagta agaaaaagaa agagattctt tgatgaacct gaacaaaact ttgaaaaaca 2580 gttatgtcca aaagaaaagt ataaacaaga tgtttttaat aaaatatttg aatgtgctac 2640 tattgcaata gaagagagat ttaatgattt taatgctggc attgcacctt ttagcttttt 2700 atttgatatt aaaaagctct caacagtcaa ggaagatgat ttaaaatctc attgcatcgc 2760 gttgraaaaa aaatatctga aaatggaaaa aaagatattt ctggcgagga actaagtttt 2820 gaattaatgt ctctttcttc tcattttact gaaaatttgg atcctgaaaa tgctataaaa 2880 tatttgtacg caaatggttt gaatgaaatc tatgcaaatg tatctattgc attgagaatt 2940 cttcttacat taccaataac agtagctagc gccgagagat ctttttcgaa attaaaaatt 3000 atcaaaaatt atttaagatc gacaatgggc caggacaggt tgaacaacct tgcaatgatt 3060 tccattgaaa ataaaattgc cgctgattta aatattgaag atatccggtt gaaattttcg 3120 gaaaagaaag ctcgaaaagt aatattttcg aaataaatta ttttatcaaa agagttgtat 3180 taaattaagt cgaaaatata aactttgatt tataaaggcg ccaaaatatt tttttcaccc 3240 accttcggaa tgtctagcgc cgggcctg 3268 // ID mTA_Ele42 repbase; DNA; INV; 2251 BP. XX AC . XX DT 11-OCT-2010 (Rel. 15.1, Created) DT 11-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE A non-autonomous DNA transposon family from Aedes aegypti. XX KW DNA transposon; Transposable Element; Nonautonomous; mTA_Ele42. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-2251 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-2251 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the yellow fever mosquito."; RL Direct Submission to Repbase Update (11-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. ~97% identical to consensus. This consensus CC is ~95% identical to the original sequence in [1]. TIRs are ~530 CC bp long. XX SQ Sequence 2251 BP; 776 A; 413 C; 361 G; 700 T; 1 other; cacggtgtgc cagaaaccgt tattaacaga aaaaaatgtc catgaattcg gcacctacca 60 ttcgaaagat atagtattca agcatcttct ccgtgaattt gaaaatattc taacgaggga 120 atcgaaagtt atagtgattt gaaggttttg agctattttg catgggatat tcacatgctt 180 ataaatattc ctcaacggtg catcattatt aacttgtaaa caagccaagt aaccatcagc 240 tttgcattaa gcattcaaaa catgtgatat ccaagcatct tccctgtaaa tttgaaatca 300 ttacgtcgag aaaatcagaa gttacagtga tttgtatatt taccattatt tctatgaagt 360 ctaaattaga gcttttctat ggctttgtta tattagtgca cagaatttac aaataaaaat 420 gtccatacga ctgacactcg gcattcaaaa gatgtattat tcaagcatct tcgcagctag 480 tttgagaata ctttatctag agacaacaca actaaagtac tttgaagttt tggacctata 540 atagagtaaa ttttgagtat taaattacca cgttcaatgt ttttaccact actaattgca 600 acaaattctt atcaatatgt tgatcaatgt gatttaatag cgtgattgac cgcctactgt 660 tgcttctccg ttattacaag agcaactata catgcacaag gaaccaacgg atgctactta 720 ggattagtcg cagcactctc agtgtgtaag taatggagtt ctcattcctt gtactaagca 780 gtaccgcgtc ctgctgcgtc cgaatacatg tcaattgggg ttatggaata aatatttacg 840 tgatactatc tttgaggaag ccgttggaat cctctgcact tccacgakaa gtcaatggga 900 gtttggaaaa ttggaagtgg ttatattaca gattcgtctt ggtaaacaaa acgcttaatt 960 acaatagtcc tagggatcat cacatcattt tcactggtta taccctgtta tacctccctc 1020 aaaacataat gaagtcatca acataatgaa tataataaaa taaaaagtaa cccttttata 1080 tgcacaaaac atgactgaag aatcaatatg acgatacctt tgatgacgaa ccattcaaag 1140 atttcgcatg atgcaaatca tgcagcatca atccaaaaag taatctatca gtatctaaag 1200 gaaaagtcga cgcttacagt atccaataat tctaggaacc aaacaaaata taaggcttgt 1260 tgcatttaca tattttgcaa aacatgaaac aaggttaatt tctacaataa actcctactg 1320 actattaatt gagcaaacac atatataaac gtatcccctt cgctgtcatg aataaaatgt 1380 gaaccatcat atttagctta tttaaaacac tagcataaaa cgatctcact agttgataat 1440 ccaacctcga ctgagcaaca tggtgccact ttcaaattgt tcaggttcac caatatgctt 1500 atttgggcca taacgaaaca ctgccccgca aaacgcgcgg aaatgaaaca aaattcacaa 1560 aaacgaaccc atttgcttgt gatttaatga aacactgcat ttcttcaatg taagatacaa 1620 gttgaatgca aatctgttaa gttaatttgt gtcgaaatta ttaaatgcgt gttgcgaaca 1680 ctatttcaga cgattgaaaa aaaaatctta atgactccaa aacttcaaag tactctagtt 1740 atgttgtctt ttgataaaat aatctcaaac taactgcgaa gataattgac tactatatct 1800 tttgaatgcc gagggtcagt cctatagaca tttttatttg taaattacgt acactttcat 1860 aacaaaggca tataaatcag ccctaattta aacttcatag aaataatcat aaaaatacaa 1920 atcactgtaa cttcagattt tctcgacgta atgatttcaa atttgcagag aaggtgcttg 1980 gatatcacat gttttgaatg cttaatgcaa agctggtgat aacttggctg gtttacaagt 2040 taataatgat gcaccgttgg gaaatattca taagcatgtg aatttccctt ccaaaatagc 2100 tcaaaacttt caaatcacga taactttcga ttccctcgtt agaatatttt caaattcacg 2160 gagaagatgc ttgaatgcta tatctttcga atggtgggtg ccaaattcat ggacattttt 2220 ttctgttaat aacggtttct ggcacaccgt g 2251 // ID Kiri-15_AAe repbase; DNA; INV; 4539 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 16-FEB-2011 (Rel. 16.02, Last updated, Version 1) XX DE A Kiri non-LTR retrotransposon family from Aedes aegypti. XX KW Kiri; Non-LTR Retrotransposon; Transposable Element; Kiri-15_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4539 RA Kojima K.K. and Jurka J.; RT "Kiri non-LTR retrotransposons from the yellow fever mosquito."; RL Repbase Reports 11(2), 710-710 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 10 sequences with >95% CC identity. CC This family and related Kiri elements found in two mosquito CC species, Culex quinquefasciatus and Aedes aegypti, constitute a CC distinct lineage belonging to the CR1 group. The phylogenetic CC analysis with PhyML indicates that Kiri is a sister lineage of CC the L2 clade. Although several Kiri elements do not encode two CC proteins, probably due to 5' truncation, Kiri elements generally CC encode two proteins. Their ORF1 proteins commonly contain an CC L1-like RNA-binding domain. XX FH Key Location/Qualifiers FT CDS 380..1039 FT /product="Kiri-15_AAe_1p" FT /translation="MQFMFQDMVNKTIGIIESCNRSLQDEITALRRNIQQL FT QEECRAEIVTLSNDMKKIENEMCSIAERTAAMERNNDLLLFGIPFHPSECL FT XGFLQSVCSXLGYGSEDTPLVFTKRLARAPVKTGTTPPILLQFAFKHDRCD FT FFRRYLSSRRLSLHHLGFSSQSRIYLSENLTRQXRVIKGAALKLKRDGKLH FT SVFTKGGIVFTKITADAAAVPILSLEQLPS" FT CDS 1542..4379 FT /product="Kiri-15_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MPDSQTNALSNSCIPRAVMNSVLSSNFLNICHINMQS FT MCARQMSKFNEFTQYFVHSKIDIICVTESWLTSDINDDLIAVEGYNLLRLD FT RGYCRGGGICIYFKNDLRFKVLCRSELLPGVQYSNLTEYVFIDTQCEEGNF FT LLGVFYNPPNSDCSDLLFDKLSDLSVRYSNTVIIGDFNTDLNKNSRKTFKF FT KNTVESLGFTCANTFPTHFHPGGCSLIDLILSSNEDFVQTVQQVSAPAFSN FT HDIIFSSLNITRDATVSKFMYRDYKNVDMAGLTRAIDSIDWSMLESITDSD FT IAVDLFNRILCNLQDTFIPLRCSKPKRNPWFNSDIQKAMIERDVAYRLWVS FT DRSSHNHSQYKRLRNKVTQLIACAKSNLVSRRIDSSNSSKDLWKKLKQIGA FT TCTSSSKSNFENTGTEINDYFASNFTVDSSSISIPPQNIQGFQFTQIRDHD FT VILAIKSIKSNAIGVDNIPIQFIRLILPYVVRQISYIFNLIISTSKYPRAW FT KTSKVLPIRKKPSSDSLENLRPISILSSLSKAFEKILKLQIQEHLIYFDLL FT SEFQSGFRAGHSTTTALLKLHDDILSTVDKKGIAFLLMIDFSKAFDRVSHS FT KLLAKLSSKFMFSRAAVRLLQSYLSNRSQFVCVDGCYSDTVNILSGVPQGS FT ILGPLLFTMFINDLPSVLKYCHIHLFADDVQIYITSTALSSDELAAMLNED FT LSRVLEWSTSNLLPVNPAKTKAMLMSRNREVNQRPIILFGNSIIEYVDNVT FT TLGFVLQSNLEWEGHINAQCSKIYRGLRVLRITSSTLPISVKLKLFKSLLL FT PHFVYGDVLLLNASAIAMDRLRKALNSCVRYVFNLSRFSRVSHLQSHLLGC FT RFQDFHKIRSCNMLFKLLTKRSPKYLYTKLHSFQNSRNRNFLLPRFCTSHY FT RNTLFVQGIVFWNHLPSVLKNMTSVSGFRQECAALFNSQYQQ" XX SQ Sequence 4539 BP; 1277 A; 904 C; 827 G; 1517 T; 14 other; agtccggctc tggcaatgat cggatgtagt gagtgatcgc attktcgcgc agttttgtga 60 aatttactac gttttttgct gctgaaacaa ttggaatggc tggccaacct wactttcgat 120 gctgagtttt gctgccccga tggagttktg tcccgcwkat gctgattatc cgtatattgg 180 ctgtattttg ttgatgggca tctgaagcac cagctgcgat cttagagctt caaccgttag 240 cccakkttct catctgtcaa mctcaggctc tcaccgttag cccatccgtc tacctgcact 300 cactgtttac cagccatggg tggtagtggt actttggtat tagaacgaca tcaggatttc 360 gtcacaacat tcgactagaa tgcagttcat gttccaggat atggtcaata aaactattgg 420 cataattgag tcctgcaacc gatcgttgca agatgaaata accgcacttc gccgmaatat 480 tcaacaactt caggaggaat gcagagctga gattgttacg ctgtcgaatg acatgaagaa 540 aattgaaaac gagatgtgct ctatcgccga acgcactgct gccatggaac ggaacaatga 600 tttgttgctg tttggaattc cgttccatcc aagtgaatgt cttwctggat ttctacaaag 660 cgtttgttca gwgctgggtt acggcagtga agatactcca ctggtattca ccaagcgcct 720 cgctcgagct ccggttaaaa ccggtacmac tccccccatc ctgctccagt ttgctttcaa 780 acatgatcgc tgtgatttct ttcgacgata cctttcatct cgacgtcttt cactccatca 840 tctcggattt tcctcacaat ccaggatata tctgagtgaa aacctcacta ggcaagsccg 900 agttatcaag ggagctgccc tgaagctgaa gcgtgatggc aaactacact ccgtattcac 960 caaaggtgga atcgtcttca caaaaattac tgctgatgct gccgccgttc caatcttatc 1020 tttggaacag ttaccatcgt gaaccttgtc mtattaaaat aatctttcct tccgatcaaa 1080 cattccatgg ctccagtcct tatcattcct ttgaatattc cgctcctgaa agtaagcatt 1140 aattcaacct ttccaacaag cttctgcata ttcctcccta tgatttcctt gactccgttc 1200 cttcagtctt cctgtgtttt catccttcct aaaagttttt cacctttgga aatgctgcag 1260 aacccaccgt tactgttacc tgctgttgct gctgacgctg atgctgttgt tgatgctgtt 1320 attaatgctg ttgttgatgc tgatgttggc cttcctttca attgtttgtc attgataatt 1380 tttgatgatt tcacgaactg tttaagattg tttaaagatg aaattgttct caatgaattc 1440 atttgattta gattgtttgt tattcacttc tattgctgat tcgttgctgt tgtatggagt 1500 ggtatgttct tgttggggtt cgcttacatt gttcagtcta gatgccggat agtcaaacaa 1560 atgctctttc aaatagttgt attcctcgag ccgtcatgaa cagtgtgctc agttcaaatt 1620 ttttgaatat ttgtcacatt aatatgcaaa gtatgtgcgc tcgacaaatg tcgaaattca 1680 atgaattcac gcagtatttc gtccatagta aaattgatat tatctgcgtt acggaaagtt 1740 ggctcacaag tgatataaac gatgacctaa ttgctgtgga ggggtataat ctcttgagac 1800 tcgacagagg ctactgtcgt ggtggtggaa tatgtattta ttttaagaat gacttgaggt 1860 tcaaagttct ttgccgttct gaattattac ctggtgttca atattctaat ctgacagaat 1920 atgtattcat tgatactcaa tgtgaggaag gaaacttttt attgggtgtc ttctataatc 1980 cccccaattc agattgttct gaccttctgt tcgataagtt atctgacctt tcagtccgat 2040 attccaacac ggtaataata ggtgatttca acacggactt gaataagaat agtagaaaaa 2100 cgtttaaatt taagaacaca gttgaatcac tcggattcac ttgtgctaat acatttccaa 2160 ctcattttca tcctggaggt tgttccctaa ttgatttgat attatcatct aatgaggatt 2220 tcgttcaaac agtgcaacaa gtgtcagctc ctgcattttc aaaccatgat atcatttttt 2280 catcgttaaa tataacacga gatgcgaccg taagcaaatt tatgtatagg gattataaaa 2340 acgtcgatat ggcaggtttg actcgagcaa ttgattcaat tgattggtct atgttggaaa 2400 gtattactga ttcggatata gcagttgacc tttttaacag aatattgtgt aatttacaag 2460 ataccttcat tcctcttcgt tgctccaaac caaaacgtaa tccatggttt aacagtgata 2520 tccagaaggc gatgatcgaa cgtgacgtag cgtatcgttt gtgggtttcc gaccgatcat 2580 cgcataacca cagtcaatac aaacgccttc gtaacaaggt tacacagctt attgcttgcg 2640 ctaaatcaaa tctggtgtcc cgaaggatag attcctctaa ctcttcaaaa gatctatgga 2700 aaaagctaaa acaaattggc gcaacatgta catcttcaag taaaagtaat ttcgaaaaca 2760 caggtacaga aatcaacgat tattttgctt ctaactttac tgttgatagc agtagcatct 2820 ctattcctcc tcaaaatatt caaggatttc agtttacaca gatcagagat catgacgtca 2880 ttttggccat taaatctata aaatctaatg ctattggagt ggataacatt ccaattcagt 2940 tcattcgatt gattcttcct tatgtagttc ggcaaataag ttacatcttc aatctgatca 3000 tttccacatc caaatatcct cgtgcttgga agacttccaa agtcttacca atacgaaaaa 3060 aaccatcaag cgattctctc gaaaacttaa ggcccataag tattcttagt agtttatcca 3120 aagctttcga aaaaattcta aaattgcaaa tccaagaaca tttgatctat ttcgatctat 3180 tgtcagaatt tcaatctggg tttagagcag ggcacagtac aacgacagca ttgcttaaac 3240 ttcatgacga tattctttct acagtagata agaaaggcat tgcctttctt cttatgatag 3300 atttttcaaa ggcgtttgac agagtatctc attcgaaatt attagctaaa ttatcatcaa 3360 aattcatgtt ttctagagca gctgttcgtt tgttgcaatc atacctatcg aataggtcac 3420 aatttgtttg cgtagatgga tgttactcag atactgttaa tattttatcc ggagtaccac 3480 agggttcgat tttaggccct cttcttttca ctatgtttat taatgattta ccgagtgtac 3540 taaaatactg tcatattcat ttgttcgcgg atgacgtcca aatttacatt acatctactg 3600 ctctatcgtc ggatgaattg gcggctatgt taaatgagga tctctcacga gttttggaat 3660 ggtctacgag taatctctta cctgttaatc ctgctaaaac caaagccatg ttaatgtcaa 3720 gaaacagaga agtcaaccaa cgtcctatta tattatttgg aaattccata attgaatatg 3780 tcgacaatgt taccactctg ggatttgttc ttcaaagcaa tcttgaatgg gaagggcaca 3840 taaatgctca gtgttcaaaa atatatagag ggcttcgtgt acttagaatc acttctagta 3900 ccttaccaat ctcggtgaaa ttaaaactct ttaaatcatt gctgttgccg cacttcgttt 3960 atggagatgt tcttttattg aacgcctctg ccatagctat ggatcgttta cgtaaagctt 4020 tgaactcatg cgtacgatat gtttttaatc tttccagatt ctctagagtt tcccatttac 4080 aatcacatct cttgggttgt cgttttcaag actttcataa aattcgtagt tgtaatatgt 4140 tatttaaact tctaactaaa aggtctccta aatatttgta cactaaactt cactcgtttc 4200 aaaacagtcg taatcgcaat tttcttcttc cacggttctg cacgtctcat tataggaata 4260 cactatttgt ccagggtata gtattttgga accatcttcc ttccgttttg aaaaacatga 4320 ctagtgtaag tggatttcga caggagtgtg ccgcattgtt caacagccaa tatcagcaat 4380 aggtttagat tattttttta ttgtatatca ttagttcatt gtcagtttcg gatttgttat 4440 attagattcc atgaattcaa cgtgtgtagt atcaaaaaag gaagttcctt acactacatt 4500 attatgagaa ataaataaaa ataaaataaa taaaataaa 4539 // ID Kolobok-1_CB repbase; DNA; INV; 3212 BP. XX AC . XX DT 27-FEB-2007 (Rel. 12.02, Created) DT 27-FEB-2007 (Rel. 12.02, Last updated, Version 1) XX DE A family of autonomous Kolobok transposons - a consensus. XX KW Kolobok; DNA transposon; Transposable Element; KW Interspersed repeat; Kolobok-1_CB. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-3212 RA Kapitonov V.V. and Jurka J.; RT "Kolobok, a novel superfamily of eukaryotic DNA transposons."; RL Repbase Reports 7(2), 112-112 (2007). XX DR [1] (Consensus) XX CC Kolobok-1_CB is a consensus sequence of a family of autonomous CC Kolobok transposons that were active in the nematode genome in a CC last few million years. The Kolobok-1_CB transposon is CC characterized by 340-bp terminal inverted repeats, TTAA target CC site duplications, and it encodes the 418-aa Kolobok-1_CB1p CC transposase. See also comments in Kolobok-1_XT. XX FH Key Location/Qualifiers FT CDS join(529..1134,1353..1533,1697..1917,2191..2436) FT /product="Kolobok-1_CB1p" FT /translation="MSEGKRRRCQFCWAMVPWKESTVITSSKTGREKWIEA FT LGEGFRNQLEKVGTRRAYICRSHFPSGEVHGRRSVYSLPRLAADGDVADTS FT NLDFSEGDNGFEIIEDSRVADNEESFVLLNTDGEEDGEDSEESYGEQETDS FT SFVPESEESWSDDSEVEEDETSGNLEYAFVDLNCLIDSLKYCRQCHSEAVE FT AAVLKPSGYAVSKIASMLHTVRAPYLRKYCYYDIVQHHVRPVVQNVHDGTQ FT KKVMDFVIKNCEDNQKGLDLAESSKSLEPKSLDLALSNLMSSLTAPDGSEL FT VHIDSITTDRDPAVGKLLNTKYPGIRQFYDGWHFVRNVKKIIWKQINGFVF FT NKFRKCEHDPMNHTRNDYIDVNNPKHEKALRLLWNMIVSDKRLKDLEKVSP FT HFNTSEVESFNSLSTIYHPKSYYL" XX SQ Sequence 3212 BP; 958 A; 598 C; 688 G; 968 T; 0 other; aggaggattc cgggcaaaat gactttgtgc ttaaatcgat gcaaaatatg ggcttatttc 60 cgaaaatcac aataaaaacg ctcaaaaaaa tttcctgtaa ccacaaaacg ccgccaaaca 120 ggccgtcaga ataaaagtta tcccttaggg actagttttt ttgaaaactc ggccaccggc 180 gccgcctcgt ccacgccctc cgcggccatc gattttattg acatttttca atgaattttg 240 gtcgtttttc atgttttttt cgaatattga tgatatttat tgagaaaaaa tcataaaaaa 300 tgaaccggaa aagataatta taccagattt cagcaataat ggctgatgtg tggtttgttc 360 ttgttatttt ttttctcata ttcgttgtct gcttcaaaat aaatcatttt tcaaagctgt 420 attcatttga agtgtacaaa tataataaaa ttatgatagc tggtgagaac cgaagagtct 480 gaatatgaga attagagaga attcatctga aaaattgtag taattaaaat gtcggaagga 540 aaaagacgtc gctgtcagtt ctgttgggct atggttccgt ggaaggagtc gactgttatc 600 acttcgagca aaacaggaag agaaaagtgg attgaggcgt tgggagaagg attcaggaat 660 caattggaga aagtcggaac tcgacgagcc tacatttgtc gttctcactt cccatccgga 720 gaagttcatg gaagaagatc tgtatattcg cttcctagac tcgctgctga tggtgatgtt 780 gctgatacat ctaatttaga tttctccgaa ggtgataacg gtttcgagat aattgaagat 840 tcaagagttg ctgataacga ggaatcgttt gtacttttga acactgatgg ggaggaagat 900 ggagaagatt cggaggagag ttatggtgaa caggaaactg acagtagttt tgtgccggaa 960 agtgaggaat cttggtctga tgacagtgag gtggaagagg acgagacaag tggcaattta 1020 gagtatgcat ttgttgattt gaattgtttg attgacagtc tcaaatattg tcgtcaatgt 1080 cactctgaag ctgtagaagc ggctgttttg aagccatccg gatatgcagt ctctgtgagt 1140 cgtgagcttt ctattagaaa acaatttttg ctagatctct ttcaactgtc tatcttgcaa 1200 tcatgattgg aaatggtctt catcaagaat tatcgaagga tcaaaggaat atgttgtaaa 1260 tcgagatctt gttagcgcag cggtttccac cggatcctcg tattcagtct gttatctaaa 1320 gctgagaaag gcttagaatt agtagatttc agaaaatagc ttcaatgctc cacactgtcc 1380 gtgcacctta tttacgtaaa tattgttact atgacattgt gcagcatcac gtccgacctg 1440 tagtccaaaa cgttcatgat ggtactcaga agaaagtgat ggactttgtg atcaaaaact 1500 gtgaggataa tcagaaggga ctggacttgg ctggtgacgc acaatttgat tcgcccggcc 1560 atatggcaga acattctcga tatgcgttgt tggacgtgtc tacgaatttt gtactggaga 1620 caaaactgtt gaagaagtca tcgagaacag gcaatttagt gtttcgggtt tctcttgttt 1680 atgtttgttt tatcagagtc gtcgaagagt ctggagccca aatctttgga tttggcactc 1740 tcgaatctaa tgagctctct cacagcccca gatggttctg aactggttca tattgattct 1800 atcaccaccg accgtgatcc ggcagtcgga aagttgctca acacaaagta ccccggaatc 1860 cggcagtttt acgatggttg gcactttgtg agaaatgtca agaaaatcat ctggaaagtg 1920 aggttttcgg gaattcctag gagaactaat ttcgcagtag tgcgaaaact ttgtacgttt 1980 ttagaagaaa aatcaagttc aaatgcagcc agtcaaaaat tggattcgcc cacccacaaa 2040 tcatcttcac cattcttttg ccacctcggg gggagatggt aaattgactg tagagaaatt 2100 catttcgttt ttctgtcatt gtcaaaacat gcacgaaaat tttacagtaa gaataagtag 2160 ggaaacacgc tttgagaact acaattacag caaattaacg gatttgtgtt taacaagttc 2220 cggaaatgtg agcacgaccc aatgaaccac accagaaacg actacattga cgtgaacaat 2280 ccaaagcatg agaaggcgtt gcgactgttg tggaatatga ttgttagtga caaacgactg 2340 aaagatctag aaaaagtatc acctcacttc aatacctctg aagtagaaag tttcaattcg 2400 ttgtccacta tttatcaccc aaaaagctat tacttgtgag tcctacagag ctagaaaatt 2460 tgaataataa tattctagcc acaaaaactt ccctctccgc gttcagttga ccgtacttca 2520 ctggaactct ctcaaaatag aatggtataa caatgagaga cactatactg ggctaaaagc 2580 attctacaac aaatcaaaca aggaagcagt tttcaaaaaa aggaagtcgt cgggatctca 2640 ttcttggcgt cgagttgttc tagaaggggt tagatcccac caaacccagg cctctactca 2700 taacgttccc ctatcggttt ctcccgaccc aaatcacatg actgatgatt attcttctcc 2760 taataactat tcttcgtctg acgatgacat gatctgaagc tcccagttct ctctttcttt 2820 ctctcttctc tttttcccag tgttttgtta ttttccgttt gtttctaacg ctattattgc 2880 tgaaatctgg tataattatc ttttccggtt cattttttat gattttttct caataaatat 2940 catcaatatt ggaaaaaaac atgaaaaacg accaaaattc attgaaaaat gtcaataaaa 3000 tcgatggccg cggagggcgt ggacgaggcg gcgccggtgg ccgagttttc aaaaaaacta 3060 gtccctaagg gataactttt attctggcgg cctgtttgac ggcgttttgt ggttacagga 3120 aatttttttg agcgttttta ttgtgatttt cggaaataag cccatatttt gcatcgattt 3180 aagcacaaag tcattttgcc cggaatcctc ct 3212 // ID Gypsy-46_CQ-LTR repbase; DNA; INV; 556 BP. XX AC AAWU01015436; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-46_CQ_; KW Gypsy-46_CQ-I; Gypsy-46_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-556 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 472-472 (2011). XX DR GenBank; AAWU01015436; Positions 1768 1213. XX SQ Sequence 556 BP; 184 A; 96 C; 127 G; 149 T; 0 other; tgttaccgtt acgacatcag caatccattt tgctaaccat cgcttatggt tgcaaatagg 60 agtagggttt attgctataa attaacgatt taacttgtta gtagggcttt aagaattagg 120 ggtgtttgtg ttgttttgtt taaaatgatt tgtcattcgt aaaaattaaa aggttaattc 180 acaaacatat tacaacagag cacagtctag ggaaaaagat caaataaagc acacgttact 240 cgcgaaccga agggaaccat atttgtaaaa acactataac agcacacacg ttgggggggt 300 atacatcgtg acccaaaaag ggagattgcc agttggatta agggcatgca tgggagaagg 360 aaaaagggaa taaaagaggt tttgtggcga gagcgagcca acaattgatc ttgggttcac 420 tgattaggcc ttttccacac caactactga tcgaaccaca tcgtcatccc gtcatcaagt 480 gatcaagtgc aaatctatac ctactgcgtt gtgagaagtg gaaaagttgc ttccacggga 540 gaaagcaatt gtatca 556 // ID Gypsy-78_CQ-I repbase; DNA; INV; 4629 BP. XX AC AAWU01003709; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-78_CQ_; KW Gypsy-78_CQ-LTR; Gypsy-78_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4629 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 535-535 (2011). XX DR Genome; AAWU01003709; Positions 41695 46323. XX CC Positions [3454-3954] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 84..3683 FT /product="Gypsy-78_CQ-I_2p" FT /translation="MDGEDHLINPGDGSGNGTNGQQAGPGLRPFLPPNPRQ FT PLIRPPQFQPQNHQQQAQVTTDVLMLQVLQQIAQQLQQQQAMYQQQNQTFL FT QQQGQMFRNGINVQVPPNPEQLLDSLASNIKEFRFEPDQNVTFAGWFARYK FT DLFEQDAVRLDDPAKTRLLLRKVGPTEYERYCNYILPQSPNLIDFDTTVAK FT LKSLFGTTESVISRRYRCLQVTKQATEDYKSYACRVNKLCVDFELGRLSEH FT QFKCLVFVCGLKAEKDAEIRTRLLSRIEENAEVTLEQISDECQRILNIKHD FT TAMIESASSSSSAAVQAVKRNQQYGKRSFRSNNEAAPRSTKPNGKTPGSPC FT WNCGSMHFSWDCPFAKHRCRDCGQVGHKDGYCSSARKSSKPGQRSSKPVET FT KSVVVRNVKKRRKFVQAQVNGRRVALQLDTASDISVISEQTWREIGKPAPK FT PATVQAATASGKPLKLEFQCATEININGVVREGRFYVVKQQLNLLGLDLID FT EFDFWSVPINQFCNQVSSFPSTLGAVKAAFPAVFNDAPGLCTKAKVKFSLK FT DGQTPVFRPKRPVAYAMCKTVSDELERLEHAGIISPVEYSEWAAPIVVVRK FT ASGAVRICGDYSTGLNDALHPHQYPLPIPQDIFAGLANCKVFSQIDLSDAY FT LQVEVDEDSRELLTINTHRGLYRYNRLSPGVKPAPGAFQQIVDTMMAGTNG FT AAAYLDDILVGGVDEADHWRNLQAVLKRIQDYGFTIKAEKCSFAQRQIKYL FT GHLIDEHGLRPDPAKLDVIRNLPAPKDVSGVRSFLGAINYYGKFVPSMRTL FT RYPLDELLKTTNKFVWTAKCQEAFERFKAILSSDLLLTHYDPAQEIIVSAD FT ASSIGIGATISHRFPDGSIKVVQHASRALTSTEQAYSQPDREGLAIIYAVT FT KFHKMLFGRRFLLQTDHAPLLRIFGSKKGIPVYTANRLQRYALQLLLYDFK FT IAHVSTEKFGDADVLSRLIDQHAKPDEDFVVASITLEEDVRSIVKESINAL FT PLSFRVVQQHTKSDPVLRKVLQYIQQGWPKSKTAIADRELKVFFDRRDSLC FT TVQGCVTFAERLAIPSAFRARCLQHLHRGHPGIERMKALARSYVYWPSIDA FT DIAGHVGTCRHCAAVAKSPPKAPPQPWPKSTFPWQRVHVDYAGPIEATTSC FT SVSTRIQMGRNHQNQIYHSFCDYRHCQKSVRSPWYARNSGQ" FT CDS 3664..4617 FT /product="Gypsy-78_CQ-I_1p" FT /translation="MPETLVSDNGTQFTSAEFGQFCLENGINHMTTAPFYP FT QSNGQAERFVDTFKRAVKKIREGRGTINEALDTFLLTYRSTPNPSAPDGKS FT PSEAMFGRKIRTSLDLLRPPAVPVVKANEEQNAKRSIQKGDCVYVKIYSAN FT AWRWAPGVVLERLGRVMFNIWAENKRMIRSHLNQLRFRTTDGQRGLGVQGA FT KPSTKLPLDILLKECILQRPSTASKNTASPASSPQSAPNTSSLGVPSEQST FT PIQSPQATAEATNWESPRSPSFRSATASPAAVPSTSKPVSSPEFLSASENE FT PAVPFKPRRSSRHRRVPIRFDPYQLF" XX SQ Sequence 4629 BP; 1167 A; 1323 C; 1222 G; 917 T; 0 other; cctgtcgcgc tcgtaaaatt ggcgacgagt aaggtcgtgg aaatcaccgt gagaaattgc 60 ctcccccgag gaagcggagg aggatggacg gcgaggacca cctgatcaac ccgggcgacg 120 gaagcggtaa cgggacaaac ggccagcaag ccggtccagg actacgcccc ttcctgccac 180 cgaacccgag acaacccttg atccgtcctc cgcaattcca acctcaaaat catcagcagc 240 aagctcaagt taccacggac gtgctgatgc tacaagtcct ccagcaaatc gctcagcagc 300 ttcaacaaca gcaagcgatg taccagcagc agaaccagac attcttgcaa cagcaaggcc 360 aaatgttccg gaacggcatc aacgttcaag ttccacccaa tccggagcag ctccttgact 420 cactagcatc aaacatcaag gagttccgct tcgaaccaga tcagaacgtc acgttcgctg 480 gttggttcgc tcggtacaag gacttatttg agcaggacgc cgtgcgcctg gacgacccgg 540 cgaagacgcg tctgctgcta cggaaggttg gtccaactga gtacgagagg tactgcaact 600 acattcttcc gcaaagcccc aacctgatcg acttcgacac cactgtggcc aagctcaaaa 660 gccttttcgg caccacagag tcggtgatca gccggaggta tcgatgtcta caagtcacca 720 aacaagcaac cgaggactac aagagctacg catgcagagt caataaactg tgtgtggact 780 tcgagcttgg acggctctcg gaacaccaat tcaagtgtct cgtgtttgtt tgtgggctga 840 aggcggaaaa ggatgcggag atcaggacaa ggttgctcag caggatcgag gagaacgcgg 900 aagtcaccct ggagcagatt tccgacgagt gtcagcggat cctgaacatc aaacacgaca 960 cggcgatgat agaatcagcg tcgtcatcgt cttcagcagc ggttcaagcg gtgaagcgga 1020 atcaacagta cgggaagcga tcgttccggt cgaacaacga agctgcgccg cgctcaacca 1080 agccgaacgg gaaaacccct gggtcgccat gctggaactg cggttctatg cacttctcct 1140 gggactgccc attcgcaaag catcgctgca gagactgtgg tcaagttggt cacaaggacg 1200 ggtactgttc aagtgcgagg aagtcatcga aacccggcca acgctcgtca aaacccgtag 1260 agacgaaaag tgtcgtggtg cggaacgtca agaagagaag aaagttcgtg caggcgcagg 1320 tgaacggccg acgcgttgct ctccagttgg acacggcgtc cgacatcagt gtcatctcag 1380 agcagacgtg gagagaaatc ggcaaacctg caccgaaacc tgcgaccgtt caagctgcaa 1440 cagcatccgg caagccgctc aaattggagt tccagtgcgc caccgaaatc aacatcaacg 1500 gtgtcgtgcg tgaaggacgg ttctatgtcg tgaagcagca actcaattta ctggggctcg 1560 atctcatcga cgagttcgac ttctggtcag tgccaatcaa ccagttctgc aatcaagtca 1620 gcagtttccc atcaacactt ggcgctgtca aagcagcttt tccagctgtt ttcaacgatg 1680 ccccaggact gtgcacgaag gcgaaggtca aattcagtct caaggacggc caaacacctg 1740 tgtttcggcc gaaacgacct gtcgcctacg ccatgtgcaa gacagtaagc gatgaactgg 1800 aacgcctcga acacgccgga atcatctcac cggtggaata ctcggagtgg gctgctccga 1860 tcgtggttgt ccggaaagca agcggagcgg ttcgaatctg cggcgactac tcgacgggcc 1920 tcaacgacgc cctgcatccg catcagtacc cgctcccgat accgcaggac atcttcgccg 1980 gactggcgaa ctgtaaggtg ttcagccaga tcgatttatc ggatgcgtac ctacaggttg 2040 aagttgacga agacagccgg gaactgctca cgatcaacac gcaccgaggt ctctatcgct 2100 ataatcggct ctcgccaggt gtcaaacccg cacccggagc ttttcaacag atagtagaca 2160 ccatgatggc aggtaccaac ggagctgcgg cctacttgga tgacatcctg gtgggtggcg 2220 tcgacgaagc cgaccactgg cgcaacctgc aagcagttct caagcgcatc caagattacg 2280 ggttcaccat caaggcggaa aaatgttcat ttgcccagcg ccagatcaag tacctgggtc 2340 atctcatcga cgaacacggt ctgcgccctg atccagccaa gctcgatgtg atccggaacc 2400 ttccagctcc aaaagacgtc tccggcgtga gatcgtttct gggggctatc aactactacg 2460 ggaagtttgt tccgagcatg cgaacgcttc gttacccgtt ggacgaacta ctcaagacaa 2520 ccaacaagtt cgtctggact gccaagtgcc aggaagcgtt cgagaggttc aaggcgattc 2580 tatcgtccga cctgttactg acgcactacg atccagctca agaaatcatt gtgtcggcgg 2640 atgcgtcgtc gatcggcatc ggggccacga tcagccacag gttcccggac ggatcgatca 2700 aggttgtgca gcatgcttca agagcgttga cctcgaccga gcaagcctac agccaaccag 2760 atcgcgaagg actagccatc atctacgctg tgaccaagtt ccacaaaatg ctttttggtc 2820 gcagattcct tctccaaacc gatcacgctc ctcttctccg aatttttggt tcaaagaagg 2880 gaataccagt gtacacggcg aaccgtttgc agcgttatgc gttacaactc cttctgtacg 2940 acttcaagat tgcccacgtc agcacggaga agtttggaga tgcggatgtc ctttcgcgcc 3000 tgatcgacca gcacgcaaaa cctgacgagg atttcgttgt agccagcatc accttggagg 3060 aagacgtcag gtcgattgtc aaggaatcaa taaacgcact tcctctcagt ttcagagtcg 3120 ttcaacaaca caccaagtct gacccggtac ttcggaaagt cctccagtac atccagcaag 3180 gttggcccaa gtcaaagacg gccattgctg accgggagct taaggtgttc ttcgaccgaa 3240 gggactcgct ctgcacagtc caagggtgcg tcactttcgc ggaacgactc gcgattccat 3300 cggcgttccg ggctcgctgc ctccagcatc tgcaccgcgg ccatcccggg attgagcgca 3360 tgaaggcgtt ggctcgtagt tacgtgtact ggccatcaat cgacgccgat atcgccggac 3420 acgttggaac gtgtcgtcac tgcgctgcag ttgccaaaag tccgcccaaa gcaccaccgc 3480 aaccatggcc aaagtcaacc ttcccgtggc agcgggtgca cgtggactac gcaggtccga 3540 tcgaagcgac tacttcctgc tctgtgtcga ctcgtattca aatgggtcga aatcaccaaa 3600 accaaatcta tcacagcttc tgcgactatc gccattgtca gaagtctgtt cgctcgcctt 3660 ggtatgccag aaactctggt cagtgataac ggcacccagt tcactagtgc cgagtttgga 3720 cagttctgct tggagaacgg aatcaaccac atgacgactg ccccgttcta cccgcagtcg 3780 aatggccagg cggagcggtt tgtcgacacc tttaaaagag cagtgaagaa aatcagagag 3840 gggagaggaa cgatcaacga agcactggac accttcttgt tgacgtacag atcaacgcca 3900 aacccgtcag cacctgacgg taaatcgcct tctgaagcca tgttcggtcg caagatccgg 3960 acgagcctgg atttgcttcg accgcctgct gtccctgtgg taaaggccaa cgaggagcag 4020 aacgccaaac gatcgatcca gaaaggtgat tgtgtgtacg tgaaaatcta ctcggcgaac 4080 gcctggaggt gggcgcctgg agtagtcctc gagcgcctcg gacgggtgat gttcaacatc 4140 tgggctgaga ataagcgaat gatccggtca catctcaatc agctgcgatt ccggacaaca 4200 gatggtcaac gtgggttggg agttcagggt gccaagccaa gtacaaaact gcctctggat 4260 attctgctga aggagtgcat cttgcagcgg ccaagtacag catcaaagaa caccgcctcg 4320 ccagcttctt ccccgcagtc ggcacccaac acatcatcgc tcggagttcc ctctgagcaa 4380 tcaacaccga tccagtcacc gcaagccacg gctgaagcga cgaactggga gtcaccacgg 4440 tcaccatcct ttcgttcagc gactgcatcc ccggcagcgg tgccatctac ctccaaacca 4500 gtctcttcgc cagagttcct gtcagcaagc gagaacgaac cagctgttcc gttcaaaccg 4560 cggcgctctt caaggcatcg gagagttccg ataaggtttg acccatacca gctgttttaa 4620 ggagggaga 4629 // ID DNA2-2_AP repbase; DNA; INV; 699 BP. XX AC . XX DT 21-AUG-2009 (Rel. 14.08, Created) DT 21-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA2-2_AP. XX NM DNA2-2_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-699 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(8), 1735-1735 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 2 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 699 BP; 275 A; 99 C; 70 G; 253 T; 2 other; tattcaaatt ctgattttta gaatttttaa gtacacctaa agaccatatn ttcaaattcc 60 tgatattttt tgtactacta ttaaagagtg tcctgtggct atacaaactt ctatttttca 120 aatgagaatt tcctttttta ctgcaaatta tttagcagat gatgttcttg taaatgtaga 180 tgtatctaaa tcaaaattct aataagtagt tgctgaatta ttaaaatgtt tatactaagg 240 ataataatcc ttaaaatatg gttttacaat aaaaaactta aaatacctaa tattagatgt 300 agtacctact tattcactat tatcagtata cctacttaat aatttttaca aattatttat 360 cttgatagat caatggtaat tagtactaac aactaactac taacattttc aaatcatcta 420 agcccccaaa tctacttaac taatntccat atacgtatta tccttagtac acaaagttat 480 actataactc aaaaactact agttcgaatt ttgatttaga tgcatcaaag ttctcaaaaa 540 aaaaatatct gctgaataat taaaagtaaa aaaagaggat tcctatttga aaaacagaag 600 tttgtattgt aacaacatca tccttaaatg gtataaaaaa ttcaagtatt cgaaaatctt 660 caagtgtatt taaaaattcc aaaaatcaga atttgaata 699 // ID Gypsy-15_IS-LTR repbase; DNA; INV; 132 BP. XX AC ABJB010373042; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the black-legged tick genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-15_IS_; KW Gypsy-15_IS-I; Gypsy-15_IS-LTR. XX OS Ixodes scapularis OC Eukaryota; Metazoa; Arthropoda; Chelicerata; Arachnida; Acari; OC Parasitiformes; Ixodida; Ixodoidea; Ixodidae; Ixodinae; Ixodes. XX RN [1] RP 1-132 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the black-legged tick genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; ABJB010373042; Positions 9292 9161. XX SQ Sequence 132 BP; 30 A; 38 C; 30 G; 34 T; 0 other; tgtggtatga aatgacctgc cacctactga gcagcacgcc tacctgctgg tggtagacga 60 cgacattaaa gattcttata cgtggcactc tggcgtgcag tagcctctcg ttctttctcc 120 gccaacctaa ca 132 // ID MOGWAI2_EI repbase; DNA; INV; 3023 BP. XX AC MOGWAI2_EI; XX DT 16-MAY-2005 (Rel. 10.05, Created) DT 03-JUN-2005 (Rel. 10.05, Last updated, Version 1) XX DE Mogwai-Ei2, a new member of the Tc1/mariner DNA transposon DE superfamily from the single-celled eukaryotic reptilian parasite DE Entamoeba invadens. XX KW Mariner/Tc1; DNA transposon; Transposable Element; mariner; KW Interspersed repeat; Tc1/mariner superfamily; MOGWAI2_EI; KW Mogwai-Ei2. XX OS Entamoeba invadens OC Eukaryota; Amoebozoa; Archamoebae; Entamoebidae; Entamoeba. XX RN [1] RP 1-3023 RA Pritham E.J., Feschotte C. and Wessler S.R.; RG Department of Biology, University of Texas at Arlington, RG Arlington, TX 76019, USA; RT "Unexpected diversity and differential success of DNA transposons RT in four species of Entamoeba protozoans. Mol. Biol. Evol. (2005) RT In press."; RL Direct Submission to Genbank (16-MAY-2005). XX DR Repbase; MOGWAI2_EI; Positions 1 3023. XX CC Mogwai-Ei2 (MOGWAI2_EI) is a member of a new clade of Tc1/mariner CC elements found in E. invadens. The TIRs of Mogwai-Ei2 are 34-bp CC long CC and are flanked by TA TSD. The element contains several CC degenerated CC ORFs encoding protein fragments related to those encoded by other CC Mogwai elements found in E. invadens. Phylogenetic analyses of CC the central domain of the putative transposase of Mogwai elements CC suggest CC that these elements belong to a clade of Tc1/mariner elements CC distinct from previously established eukaryotic clades of the CC superfamily (e.g. mariner, Tc1, pogo?). See Mogwai-Ei1 CC (MOGWAI1_EI) for further description of Mogwai elements. XX SQ Sequence 3023 BP; 1164 A; 394 C; 490 G; 975 T; 0 other; tattcctttt ccagttaaag aattaacgat acccccatat attaatataa ataaagcatt 60 taaaaaaatt aacaattttg ggcaaataaa attataaaaa aaaataattt gaagaactct 120 taaaactcat tattataatc acaatgaaag aatttggtta tttttttcat ttgaagaaag 180 aagggcctat cgaagaaata acagaaaaaa ggtatttgag atgtatggta tgactgcatc 240 gccattcaat atagggcacg ttaaaaaaga agaaattatt gaagaattaa aagaacaaga 300 atttgaatat ttcccaatat tcaaaatacc atcagaagac atagagtttg aatattgtgg 360 aaagtattta ataagtaaat tacttgatga tcatttcaac acctacaatg atacaatcaa 420 aaaaatgtcc aaaacaaata ctttaacaat agaaccttta gaaaacttgt ttccaacaca 480 gttagtgtct tctacttcaa tcaaattaaa caaaaatgaa aatgaaaaaa ggaaattatt 540 atatgttaaa atactcattc attatttggg gtgtcgaaat gctcataata cggccaatgt 600 ttttaatgtt tgtccccaaa cagtgattaa tatttatagg agagccatga ataatgaaga 660 tttattgggg caacagagag gtccaaaggt tggacaaaac aacaaagttg attatcatac 720 attgactctt ataatcgacg aattgagaaa gaacaaccag ataacattga aaaatttgac 780 aaaatatgtg aatgaacaca gactggagtg gatgaaagaa gttgcgaaaa ttgaaatgga 840 acttggtgca gatgaaaatg ttattaacaa aatgtttaag caatcaaaat tatcaaaatc 900 tactatatct atacttgtgc aaaaacttgg ttggtcatta aaaaaacgtc aagagaaaca 960 tattcgataa aaattcagaa atccgtattc aacaaagaat tgtgtatgct gaacacatag 1020 actttatcga agtgtttaaa tatcgtgttg catatattga cgaagtgggc ttcaatttga 1080 atcaaagaag gagtaatggt tattcgctta ttggaaagtg ttgtagtgta actttaaacc 1140 acatcagaaa acccaacata acagtgtttt gtatgatggt tcctgaaaag aaacttgttt 1200 ttcgaattac aaaaaaaagt tcaaacaaaa attttattga tacagttgaa cacgtcttta 1260 ttcctcaatt tcaaaagtgg tgtggagact gttttttaca tttgatttta gataacgcat 1320 caatccataa aaaacaatga ctgaacttag tcgaaaatat ggtatttatg ttacatttct 1380 tgtaccttac agtccccaat taaatgccat agaaaaatgt ttttctatag caaaatctta 1440 tttgaatcaa gtattgatgg atgattctgt ttttaacgaa gcgataaaaa catcaagaaa 1500 ttatcaacat tttttaatta attgccatat accaagatca attaataatg attttttttt 1560 cgtgtttaat cacagcgtct cttcttaaga taactgaaga acacacttct aaacttcatc 1620 taaaaagagc agagtggatt tcaagggcaa gagaaggagt ttattttgac aatgatgaca 1680 catgcaataa aaaagtgatt gaagtcgaag ttccggtgtt tgaagatcaa tccgcaatgc 1740 atattgaaag tggaacctat tatttggcaa ctgttacata tttgaaccgg gttaatgcag 1800 tccacgtact ttccgattat attatacaag ttgacgacac gaattgtaca ccatctgcac 1860 aaatttctat ttgataaagc gtattttgtg aaaatacgct ttatcaaata gaaatttctc 1920 cggtgttgga agataccaac aacatacatg aattgatgac cgaaaataac gaatgttcaa 1980 aaaacttggt gttgacttgt agtgacaaac aagattttca ctttcatttc gaaattgacg 2040 aaaatacata cagtcaaaat aaaataatag attgtgatta tgaagcaaac aaaactgtcg 2100 atacattaga attatctgac atacatatta ttaatgaaca atcaacaaca tatataagtc 2160 aaatggaaat tgatgatccc ggcaacaaaa tcaactggaa aaatgttcga caacaacact 2220 attctttaaa ttctatgttt gaatatttag tgttttttaa ctcaagaagt gacatcaaag 2280 aattaatcga aataattaaa acgcaacgat tgaatacatt aatacttaat gcattttgtt 2340 tttggacaaa attgagaaac ctttatgagt ggggtaatcc tgtttttttg tttgattggg 2400 ataacttctg ggatggagtg ataaacgaag aaagaataag aggacagatg gatgaaatta 2460 tcgacgaaga tgttattctt attcctgtgt attatgcaga acattttatg ttaattggtg 2520 tatttcttaa acattaacct ttttggccac tttcaatagc tatgaagggt gtggaatcac 2580 gtttttaaac caagttgtat tgttcattaa aaatgaattt attaaacgag aagtatataa 2640 atgggttgat gaaattgaga gagtgatagt accagaacaa caagatggaa tttcatgtgg 2700 ttcatttgtt tcttatttca tggaaaccat ttgttctcgg tatttgaaaa ccgtcgaaga 2760 attaaataac acttttacat ttgaaaaggc agtgaatttt agaaatgaaa tacttttgat 2820 aatgagggaa gactttgcct acaaggtaat gtgtttgctt aattgatatt ttcttttttg 2880 ttttatgatt aagtattgtt attgtttaat attatagtat ttaaattgtt tttttaaatt 2940 cccaaaacta cttcaatcaa caaaaaaagc taaatatata aataaaaggg ggtatcgttg 3000 attctttaac ttgaaaagga ata 3023 // ID Gypsy-129_AA-I repbase; DNA; INV; 4443 BP. XX AC AAGE02025245; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-129_AA_; KW Gypsy-129_AA-LTR; Gypsy-129_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4443 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02025245; Positions 70617 66175. XX CC Positions [3364-3825] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 94..4392 FT /product="Gypsy-129_AA-I_1p" FT /translation="MENPEFMQALSTMIAQALRASIGNAIEQVNVDRPVPA FT GDAAAPSPKVPPFMMPEYRSTEGTSVEDYFKRFKWALQLSLIPEAQYANYA FT RVHMGADLNNALKFLVAPQDPADIAFEDMLTTLVNHFDCKKNKFVESVKFR FT QIIQQKGESVAQFALRLKQGAAYCDYGDFLDRMLIEQLLHGLEARDICDEI FT IAKNPDTFDAAYKVAHTMEATRNTAREINTGSPSTHPEATHKLGYEKPKTK FT KLQQPQHRSPPVHRKQQENRPSNKQQAKAHPNSAGGSSFCNGCGEQHLRSN FT CRFRNAKCNICDKKGHIAKVCRSGKSQHSSADQVQSQETPATEVDLVQSLS FT QVHTVPCIEKKMIDVEIDGRRLQMELDTGAPCGIIGESKLRRIKPKFSLLP FT TDRQFSSYSGHRIHCLGRLPVHVSLGSTTRRLNLYVVSGESDSLFGRDWIA FT HFADQINLNQIFSQGTLINSITNTELTQDQEAHLSKLLDNFEGTFSDVPGK FT LKGPPAKVHLKPGASPVFAKARDVPFALRDRYAAEIDKKLASGFYEKVEFS FT EWASPTHIVIKNNGGIRITGNYKPTVNPRMVIDEHPIPKIETLFNKMKGAT FT LFCHLDVTDAYTHLPIDDEFRHVLTLNTTTHGLIRPTRAVYGAANIPAIWQ FT RRMESVLQGLDQVVSFYDDVIVFAKDFDELLLALTSTMDRMKENGLRLNRS FT KCIFATPSLECLGHRIDRHGLHKSDKHVEAIRDAPRPTTPEELQLFLGKAT FT YYNAFIPNLSSRARSLRDMLLADPFEWTPEADEAYRDLKQALVSPQVLIQY FT DPTLPLVLATDASKTGLGAVLSHRLSNKTERPIAYASCTMSTTEQRYPQID FT KEALAIVWAVKKFFHYLYARKFTLVTDHKPLTQILHPEKSLPTLCISRMAN FT YADYLAHFNFDVVYKPTSQNTNADYCSRIPSRSTQSDVNSLFLHEGRNTED FT EFELFTLHQIQQLPVRAEQIARETRKDTHLGKIVQDLELGRNLPQSGYKAP FT EAKYTLAASCLLFEHRVVIPPTLRQAILNDLHVGHIGVVKMKGLARSFVYW FT PGIDADIEAAVRSCTECARHATAPPKFSSHHWEYPSSPWERIHVDYAGPVA FT GAMLLIIVDAYSKWVDLKITHSTTAEATCQLLDGLFAAYGAPVTLVSDNGP FT QFAAGEFKSFLQKSGVKFHKLSAPYHPATNGQAERYVQTVKKALKAMGTTA FT SNLQANINEFLQQYRKAPHTETGESPAKLFLGRNIRTRLDLVRPQDARTKI FT TEKQRATFDSSFRSFLPGQHVYCLSGNPRMPKWIPGVVAARFGDLHYDIVY FT EGKHLKRHVDQIRRFHNNDNNNNECNEKCIQPSSEAKSNETSGMAPHRMHF FT YGDIVTSQETQSDQQSTSSSESSSEGFETPDGRSTPSSSPDRSSEGSPPAP FT PPGLRRSVRPHRPPLRYSP" XX SQ Sequence 4443 BP; 1235 A; 1253 C; 1053 G; 902 T; 0 other; attttggtgt cagaagtggg attggtcgaa aacttctcgc gatacgaaag tgaggttaga 60 gtcgccccga aaacactctt gccatcgaga acaatggaga atcccgaatt tatgcaagcg 120 ctgtccacga tgattgcgca agccctcaga gcatcgatcg gtaacgccat cgaacaggtg 180 aacgttgatc gaccagttcc ggccggagac gccgctgctc catcgccgaa ggtaccaccg 240 ttcatgatgc ccgagtaccg ctccacagag ggaacgtccg tcgaggatta tttcaagcga 300 ttcaaatggg ccctgcagct gagtctcatc ccggaagctc agtacgctaa ctatgctcgc 360 gtccacatgg gagcagacct gaataacgcc ctgaagttcc tcgttgcccc ccaggatcct 420 gccgacattg ctttcgagga tatgctaacc accctggtga accacttcga ttgcaagaag 480 aataagttcg tcgagagcgt caagttcagg caaatcatcc agcaaaaggg cgaatccgtc 540 gctcagtttg ctcttcgatt gaagcaaggc gctgcctact gtgattacgg agactttctg 600 gaccggatgc tcatcgaaca gcttctacac ggtctggagg cacgagatat ctgcgacgaa 660 atcatcgcga agaatccgga caccttcgac gctgcctaca aggtcgcgca caccatggaa 720 gcaacccgga acacggcgag agaaatcaac acgggatctc cgtcaacgca tccggaagca 780 acccacaaac tcggctacga gaagccgaaa acaaagaagc tacagcaacc tcaacatcga 840 tcgcctcccg tccaccggaa gcagcaagaa aatcgaccga gtaacaaaca gcaagctaaa 900 gctcacccga acagtgctgg tggatcgagc ttctgcaacg gctgtggcga acagcaccta 960 cgaagtaatt gccggttccg taacgccaag tgcaacatct gcgacaagaa gggccacata 1020 gccaaagttt gccgatcggg gaagtcgcaa cactcttccg ctgatcaggt gcaatcgcaa 1080 gaaacgcccg caaccgaggt cgatctggta cagtccctaa gccaagttca cacagttccg 1140 tgcattgaga agaagatgat tgacgtggaa atcgatggtc gtcgcctgca gatggagttg 1200 gacaccggag caccgtgtgg aatcattgga gagtcgaagc tacgccggat caagccgaaa 1260 ttttcactgc tgccaacgga cagacaattc tcgagttact ctggtcatcg catccactgc 1320 ctaggtcgtc tccctgtgca tgtatctctc ggatctacaa cgcgcagact gaacctatac 1380 gtcgtgtctg gagaatccga ttctctcttc gggcgcgatt ggattgctca ctttgcagac 1440 cagatcaacc taaaccaaat attttcgcaa ggcacgctca tcaactcgat aacgaacacc 1500 gagctgactc aggatcaaga agcacacctc tcaaaacttc tggacaactt cgaaggaacc 1560 ttcagcgacg ttccaggaaa actcaaagga ccaccagcga aagttcatct caagcccggc 1620 gcatcaccgg tatttgcgaa ggcgcgcgac gtaccttttg ctctgcgtga ccggtatgca 1680 gccgaaatcg ataagaagct ggcctcagga ttctacgaga aagtggaatt ctccgaatgg 1740 gcatcgccaa cccacatcgt cataaaaaac aacggaggta ttcggatcac cggtaactat 1800 aaacccaccg tcaacccaag aatggtcatt gacgagcatc cgataccgaa gatcgaaacg 1860 ctctttaaca aaatgaaagg agcaactttg ttttgtcacc tagacgtcac cgatgcctat 1920 acgcacctcc caatcgacga tgaattccgc catgtgctca cgctgaacac cacgacacac 1980 ggactcattc gtcctacgag agccgtatat ggagctgcta acatcccggc tatttggcag 2040 cgacgtatgg aatccgttct tcaagggttg gatcaagttg tcagcttcta cgacgacgtc 2100 atcgttttcg caaaggattt cgacgagctt ctgcttgcac tcacatcaac gatggacagg 2160 atgaaggaaa acggtctacg gctcaacaga tcgaaatgca tcttcgctac accatcacta 2220 gaatgcctgg gtcatcggat cgatcgtcac ggtctacaca agtcggacaa acacgtcgaa 2280 gcaatccgtg acgcaccacg tccgactacc ccggaagaat tgcagctgtt cctaggtaag 2340 gcaacgtact ataatgcgtt cattcccaat ctctcttcaa gggcaagaag cttgcgagac 2400 atgctgcttg ccgacccgtt tgaatggaca cccgaagccg acgaagccta ccgagacttg 2460 aaacaggctc tggtctctcc gcaagtacta atccagtacg acccgacact accactggtg 2520 ctggcaacgg acgcaagcaa aactggtctc ggcgctgttt tatctcaccg tctgagcaac 2580 aaaacggagc gaccgatagc ctacgctagc tgcaccatgt ccaccaccga gcaacggtac 2640 ccgcaaatcg acaaggaggc tctcgccatt gtgtgggcag tgaagaagtt cttccactac 2700 ctgtacgcac gcaagttcac gctggttacg gaccacaagc cgctcacgca aatcctgcat 2760 cccgagaagt ccctccctac gctctgcatt agtcgaatgg caaactacgc tgattacctg 2820 gcccacttca acttcgacgt ggtctacaag ccaaccagtc agaacacgaa tgccgactac 2880 tgttcgagaa ttccaagccg atcgacacaa tctgatgtca acagcctgtt tcttcacgag 2940 ggaagaaata ccgaagacga atttgaacta ttcactctgc accaaatcca acaattgccg 3000 gtgcgagctg aacaaattgc tcgtgagaca cggaaagaca ctcacctcgg gaaaatcgtt 3060 caagatctgg agctaggacg aaaccttccg cagtcgggct acaaggcacc ggaagcaaaa 3120 tacactctgg cagccagttg cctactcttc gagcaccgtg tcgtcatccc accaactctt 3180 cggcaagcga tcctgaacga ccttcatgta ggacacatcg gagtcgtcaa gatgaaaggg 3240 ctggcacgat cttttgtgta ctggcctgga atagatgcag acatcgaagc cgcagtgaga 3300 tcctgtacgg aatgcgcacg acatgcaact gcaccaccga agttcagcag ccatcactgg 3360 gagtacccga gcagtccttg ggagcgtata cacgtcgatt acgctggccc tgtggcaggt 3420 gcgatgctgc taataattgt cgatgcttac agcaagtggg tcgacttaaa aatcacccac 3480 tcaaccacgg cggaggcaac ttgccagctg cttgatggat tattcgctgc ctatggagca 3540 ccagtaacac ttgtgtcgga caacggtcca cagtttgctg ctggtgaatt caagtcgttt 3600 ctccagaaaa gcggggtaaa attccacaaa ctttccgccc cctaccatcc ggctacgaac 3660 ggtcaggccg aacgctatgt ccagaccgtg aagaaggcgc taaaggcgat gggaacgacg 3720 gccagcaatc tgcaagcaaa cataaatgaa ttccttcaac agtaccgcaa agctccccat 3780 accgaaaccg gagaatcacc agcaaagttg ttcctgggtc gtaacatccg aactcgactc 3840 gatctggtca gaccgcaaga tgcccgcacg aagataacgg agaaacaacg ggcaaccttc 3900 gattcgtcct tccgctcctt cttaccggga caacacgtct actgcctgtc aggaaatccg 3960 cgaatgccca aatggattcc aggagttgtt gctgctcgtt tcggagattt gcactacgac 4020 atcgtttacg aaggaaaaca tttgaaacgt catgttgacc aaattcgacg ttttcacaac 4080 aacgacaaca acaacaacga gtgcaacgaa aagtgcatcc agccgtcaag cgaagcaaag 4140 tcgaatgaaa catcaggtat ggcacctcat cgtatgcact tttatggaga catcgtgacg 4200 tcacaagaaa cgcaatccga ccaacaatca acttcgtcat cggaatcgtc ttctgaaggg 4260 ttcgagacac cagatggacg atcaacgccg agtagcagcc cggaccggtc aagtgaaggt 4320 tcaccaccgg ctcccccacc agggttgcgt cgttctgtga gaccacatcg cccccctctt 4380 cgatactcac cataattata gtagttataa gagtttcttt cttaaaggag ggaggaaata 4440 tta 4443 // ID hATm-2_SM repbase; DNA; INV; 2919 BP. XX AC . XX DT 14-AUG-2009 (Rel. 14.08, Created) DT 14-AUG-2009 (Rel. 14.08, Last updated, Version 2) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hATm-2_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-2919 RA Jurka J.; RT "haT-type repetitive family from freshwater planarian Schmidtea RT mediterranea."; RL Repbase Reports 9(8), 1853-1853 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 508..1557 FT /product="hATm-2_SM_1p" FT /translation="MATCSTRSASSIWLVGSTNEHFQTSKLPSRGDVLKVL FT FHYHSVKNMNLKDSIGKSASLLLPIWEMARIPTKAPNHVIEHIRKLHSDWQ FT GLKKNISRNSATNLSNQAKFQESLDDLFDIAHQDAMSTIRIEEDRLFLLAQ FT REKGRRGKMGGIDRALALKEERAMKRKIAAEKYALKINSTTAATEMLPPET FT LPYDDECSSSQSSDAEPEAGPSTPKAPKPHTARGTVDVVTPQVSAALDRTN FT TSDRKAAYIFAAMASTGQLKQDAEELIISSSAIRRARMKHRKLFSSEVKTT FT FDPAVPLIVHWDGKIMDDLTGPERGKVDRLPILVSGQDVVKLLSVPKLHDG FT TIGHSKY" FT CDS 1608..2510 FT /product="hATm-2_SM_2p" FT /translation="LLQTLEQRVAFVSGWKQRWIGSYSIWLVVTIRFRYPG FT AIHRARWMARAIYSIKMWLFRKQYEPLQPGTGCRKSRGPSYSQKIWHHLQE FT VSLFVTRVYVKYWFESPVAHCAPRHDLALLCALSEYPNTEIAKAATTAFQR FT HLWYLSETLIAFAFFDDAVTTEEKRLMVVALQEVEGSDEPLKRIQPFQHPT FT TKKLHNFVTKSTTNFFTILGISQNFLQVDPSDWECQTEYQQSQQLVQSVRV FT VNDLAERGVALIQEFNSSLTRDEEQKQYLLQVVEDHRNKFSAPTKSSAIDA FT KLQTMHRSD" XX SQ Sequence 2919 BP; 875 A; 621 C; 608 G; 814 T; 1 other; tagggtgttc ccatccacaa tggaaaaatt ttaaattaag tatttagctt ccaccaaccc 60 agcatttgat gatggctcat atagagacac tgtagccaaa atattataaa gatataataa 120 tatttaggac ttgctcaatg catttgaatg ttcaaaaatg tgaattttgc tgtttttatc 180 aatcatcggc caaaatattt atttttgcta cttacttata acaaagattc atgtctgagt 240 tctttggtca tgtgacctgc acttgcagac attgttgatt gctgacattt ttgcagtttt 300 ttaattttaa tcaagttttt atttgcggaa caccgaatac tacagtggag tgctgaaacg 360 aggcaagttt ttttctattt aacactgcta ttcattttac tagtttctac agttgcaaat 420 tatttgagta aaattgtgca taatatatag tattatattg tacaatgtac aatacaaatt 480 attcacatta ttttcagatc cagtataatg gccacatgca gtacgcgttc agcaagttcc 540 atatggttag ttggatcgac gaatgaacat ttccaaactt ctaaacttcc atcacgtgga 600 gatgtcttga aggttctgtt tcactatcac agtgtcaaga acatgaatct gaaggatagt 660 attggcaagt ctgcatcatt actactgccg atatgggaga tggcccgaat cccgactaag 720 gccccaaatc acgtgatcga gcacattcgc aaactgcact ctgactggca gggtctaaag 780 aagaatatca gtcgcaattc agcaacgaat ttgtccaatc aagcgaaatt ccaggaaagc 840 ctggacgatc tctttgacat cgcccaccag gatgccatgt ccaccataag gattgaagag 900 gacagattgt ttctgcttgc tcagcgcgag aaaggtcgca gaggaaaaat gggcgggata 960 gacagagcac ttgcattgaa agaagaacga gcaatgaaac gaaaaatagc agctgaaaag 1020 tatgctttga agataaattc gacaacagca gcaacagaaa tgctgccacc agaaacttta 1080 ccttatgacg atgaatgtag ttcgtcacag tcatcagatg ccgaacctga agctggtcca 1140 tccactccta aagcacctaa gccacataca gcgcgaggca ctgtggatgt cgtcactcca 1200 caagtctctg ctgctcttga cagaacgaac accagtgata gaaaggctgc ctatattttt 1260 gctgcgatgg catctacagg acagctgaag caagacgcgg aagaactcat tattagttcc 1320 agtgctatca gacgagctcg catgaaacat cgcaagcttt tcagctcgga agtcaagaca 1380 acctttgatc ctgcagtgcc cctaattgta cattgggatg gcaagattat ggatgacctt 1440 actggtccag aacgaggtaa agttgatcga cttcccatat tagtttccgg acaggatgtt 1500 gtaaaactgt tgtctgtccc gaaactccac gatggcacaa tcggtcactc aaagtattga 1560 cgactggggc ttgcgagaca gaatcaaggg tatgtgcttt gatataactg cttcaaacac 1620 tggaacaaag ggtggcgttt gtatccggct ggaagcagag atggataggc agctactcaa 1680 tttggcttgt cgtcaccata cggtttcggt accccggtgc cattcacagg gcacgttgga 1740 tggcaagggc catttattct ataaaaatgt ggctctttcg caaacagtat gagccactac 1800 agccaggtac aggctgtcgg aaatctcgtg gtccctcata tagtcagaaa atctggcatc 1860 acctgcagga ggtcagtttg tttgtcacga gagtgtatgt gaaatactgg tttgaaagtc 1920 cagttgcgca ctgtgcacca cgacatgatc ttgcactgct ttgcgcatta tcagagtacc 1980 cgaacactga aatcgccaag gctgccacta cagcatttca gcgtcacctt tggtacctct 2040 ctgaaaccct tattgcattt gccttcttcg atgatgctgt cactacagag gagaagcgct 2100 taatggtagt cgcacttcaa gaagtcgaag gctcggatga gcctctgaaa cgaattcagc 2160 cattccagca cccaaccaca aagaaacttc acaattttgt gacgaaaagc acaaccaatt 2220 tctttacgat tcttggcata tcccagaact tcctgcaggt tgatcccagt gactgggaat 2280 gccaaacaga gtatcaacaa agccagcagc ttgtccaatc tgtcagagtt gtcaacgacc 2340 tggccgaacg tggggttgcg ctgatccaag agttcaactc aagtttgaca cgtgatgaag 2400 aacagaagca atatctgctt caagtcgtcg aggaccacag aaataaattt tcagcaccaa 2460 ctaagtcatc agcaattgat gccaagctgc agacaatgca tcgaagtgat taacgtgaac 2520 actaactgag taactggatt gtgaattgac aatgtactgc cataaaaact gattttctac 2580 aggcttaggc ttggctattg gttactgcct aggactatgt tctatgatga acaattgatt 2640 gaaccttaca gtacactaat attgttttag ctgattgctg ataataaaca tactgaacct 2700 tgcattttac acagaccttg gtgtattctg ccgtttttat aattaactct cgatcgtata 2760 gtcgcggcac aaattaaaca taactttgat tgagttgagc aagtcctaaa tattattatt 2820 ttcagctaaa anttttacta cagcttcttt ataactagca taatgctata gaggttggag 2880 aacattgttc tcaaactttc attttttagg aacacccta 2919 // ID Gypsy-593_AA-I repbase; DNA; INV; 6069 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-593_AA_; KW Gypsy-593_AA-LTR; Ty3_gypsy_Ele155; Gypsy-593_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-6069 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [4890-5396] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 950..2236 FT /product="Gypsy-593_AA-I_1p" FT /translation="MDKFFSVEDAIFKLRQSLKKYNTFKVETRDKKVHTLI FT ALKEDLKAIVKSSSIDYVTRQHLFARYKRLRAVIYDCLNLLKSPLEESILG FT NLFTLELEQESLDNNNCKDSTEGSETEFEEEFENMATKLDLSLGLKVIQPF FT DGTVAKLTSYIEGVELFQDYSEGVPEDKIIKFLKTTLVGSAHGAIDGSVTV FT ADALQALKNKFAIRVTPRAVQNEMSALKQSQKSISDFGSEMEKLSAKLAAA FT HVSMGTFAHEAAAVNIVEPIAVQSFIDGLKDPSTKFFLKARNPTTLNKAIS FT DALECQATPSTDNMNENMMALRCTSNQRPYYYARGRRGYGGNRGYQNTRGR FT SYGRGRGNYSNYNNPRNDNMNSQPQSHNYGRGHRGNRPPRANHNGNDHRAH FT TANIAEQAPEQPRQRQPVNQEQNQAEEANLIDFFR" FT CDS 2565..5741 FT /product="Gypsy-593_AA-I_2p" FT /translation="MKMEFTIPSGEIHFTIPPRYEYITYIETEFAETCVVL FT NQKVQPNVFVANSIANPVNGKIPVRLVNISNKPVVIKDLKPRIALAENYNV FT IELKKDDQKYDKIRASKLLEELKLNHLSGSDEKAIKQICLKYSDIFCLKGD FT KLGTTNVYCPSLTVKPNIQPAFSKPYRLPFSQKEEVCKQVENMLKDGIIEE FT TKSEWNSPLLLVPKKSDNDSKKWRLVIDYRKVNTNLQDDKFPLPNIEEVID FT SLAGSQYFTHLDLSQGYYQCELKPEDRPITAFATPSGQYQMTRLPMGLKTS FT PSSFSRLMTVAMSGLNLTKCLIYLDDIIVFGKTFDEHNKNLISVFQRLREV FT NLKLNPAKCNFLKQELIYLGHFISKEGVLPDPQKIETIKNWKSPSSSDEVK FT RFVAFANYYRKHIRNFASLCLPLNYLTRKGIEFNWSEECETAFQQLKERFI FT KPPVLDYPDFSEKNTFKLHTDASGYGIGAVLSNKNDKPIAYASKGLNSAEK FT NYSTIEKELLAIVWAIQHFRVYLYGRKFELYTDHRPLVYLFTLTDPSSRLT FT KFRLALEEFNFDVFYKKGCENAVADALSRISTTELRDMQEKLSNEAFVTTR FT MQSKKENTNSTKEVINGHDASSGKIVQLKVKTNNFSDNVTWIPEREEIIIK FT PTDTLVSLRRTMEELGSICKKYGVGELYIQIDDDCAHKFYENIMQNDLAKR FT VPSILKISNKVKIINDDTTKKLILNDYHILPTAGHAGIKRTINTIRQKYYW FT KGINQDVAKFIKSCEKCQKYKSINVAKPPMIVTTTAESAFQKIYLDLVGPL FT LPSDGYEYILTTQCELTKFITATPIKNKTTEAVAEAFMKSIILKYGVPDRI FT ASDRGAEFMSELFTKVAQLLNIEKLNSTAYYHQSIGALENTHKSLGNFLRV FT QCDNKLFSWANWVPYYEFAYNNTVHSATKYTPFFLVFGRPSKMPSNLVDSR FT PEPLYNFEDYCKRIKATLQISHSEVRDRLIQEKTKRIVGINTKSKETTYRK FT GDSVLIKNETGKKLDPRYDGPYHVVEDMGTNVKISNNNKEDIVHKSRIKLF FT IE" XX SQ Sequence 6069 BP; 2211 A; 1038 C; 1235 G; 1585 T; 0 other; tggcgatcct ttgccagtga ctacaccaaa cgaatcgatc agtgcaagtt aattatgtga 60 ttaaccccgc aactacacca acgaatcaat aagtgcaagt taattaattg atatagtgat 120 cggaaaagtg cagcaaaatg ggttggtttt cggcggacga aatcgtcgcc ccagccgcag 180 tgactacaac agaaagtcat cacacagcac agaccgtagc gctatgtctt atggctggag 240 tagccgtggg atatctactc gtaaaaagtc tgatgaagtg tcaccggcag caaacggagc 300 gtgttgcgga gcgcaccgca cggttggcaa atttgcctgc ctaatgattc acattcacat 360 tcggcgcgaa gaaaattagt gccgaaaagt gaacgaaaag actatgaact aagtcgttaa 420 attggaaaag tgcacagaga agcaaatgct agtgaccacg gtatacaaat taacaagtga 480 tcaacgaatt ttaagaacaa tcatacatga tgggagatcg aagaacacta aatgaagaaa 540 agctgagaca gtgctaccaa tttttcgagc gagaatcgat gcattgggaa cgcctcgcta 600 ggaaaagatt gttaatcacc atagaggaat ttgaatccaa gaaggccatc gttgaagcag 660 ttcatcgcgt ggcagcagta gcagccatcg aatgtggacc agtggacgta caaaaccttc 720 agtggaacta cggtgaatta tccatggcga tggaggcata tcgacaaatg gatttataag 780 ggaagtagct aatgaatgaa ccacgtatag gataagatac tgatagaaaa ttacaatttt 840 aggtaaaaag tacgaatggg tacatgtatt catggacgaa catggacgaa caaatggcac 900 agtggtgagt gaaaagtaga tgtaaagcaa aaaaaaaaaa aaaaatggaa tggataaatt 960 tttcagcgtc gaagatgcta ttttcaaact gcgtcaaagc ttgaaaaaat ataatacatt 1020 taaagtagag acacgagaca aaaaggtgca tacattgata gctcttaaag aagatttaaa 1080 agcaatagta aagagcagct caattgacta tgtaactagg caacatttgt ttgccaggta 1140 taaaaggcta agagcagtta tttacgattg tctaaatctt ttaaagtcac ctttggaaga 1200 atcgatactg ggaaatttat tcactctaga gttagaacag gaatcattag ataacaataa 1260 ttgtaaagat tctacagaag gctcagaaac agagtttgaa gaagagtttg aaaacatggc 1320 cacgaaactt gatctcagtt taggcctgaa ggtaattcag ccgttcgatg gaacagtagc 1380 taagctaact agttacatcg aaggagttga gctcttccag gactactctg aaggggtgcc 1440 agaagacaaa ataataaaat ttctaaagac tacgctagta ggttctgcgc acggtgcgat 1500 cgatggatct gtaaccgttg cggatgccct acaggctttg aaaaacaaat tcgcgatcag 1560 agtaacacca agagcggttc agaacgaaat gagcgctcta aaacaaagcc aaaagtcaat 1620 atctgacttt ggttctgaaa tggagaaatt gtcggccaaa ctggcagcag ctcatgtttc 1680 tatgggcaca tttgcgcatg aagccgccgc tgtaaatatt gtagaaccta ttgcggttca 1740 atcatttatt gatggattga aagatccgtc aactaaattt tttctaaagg cccgaaaccc 1800 aacaactttg aacaaggcaa tctctgatgc cttagagtgc caggcaacgc ccagcacaga 1860 caatatgaat gaaaacatga tggcattacg gtgtacatca aaccaacgcc cgtactacta 1920 tgcgagaggt cgcagaggtt acggtggaaa ccgtgggtat caaaatactc gtggtagaag 1980 ctatggccga ggtagaggca attactcaaa ttacaacaac cctaggaatg acaatatgaa 2040 ttcacaaccc cagtcacaca attatggcag aggccataga ggaaatagac cacctcgagc 2100 taaccacaac gggaacgatc acagggcgca tacagcaaat atagctgagc aagcacccga 2160 acaaccacgt caacgtcaac ctgtgaacca ggagcaaaac caagcagagg aagcgaactt 2220 aatagatttt tttcgctaat ttcagtgtta aaaaagcact aagagccaaa tttgaattat 2280 tcggaaggag tatagaattg atagtagaca gcggagcatc ctgttgtcta ctggacaaag 2340 agtatattcc gaaaaattgt aaaataaata agtcatcaac cattgaagtc agaggagtaa 2400 atggaataac acatacattg ggatatgtgg atacatcttt aggccatgag ttggcagagt 2460 atcctgttag atttcatata ctggagcaat taccagcaaa cgttattggt ttgattggta 2520 caaacttttt gaaaaaattt ggagcaaaaa ttgactttgc aaaaatgaag atggaattca 2580 caattccgag tggtgaaata catttcacaa taccaccaag atatgaatat attacataca 2640 tagaaacgga attcgccgaa acatgtgtgg tactgaacca gaaagttcag ccaaatgtgt 2700 ttgtggccaa ttctatagca aatcccgtga atggtaaaat accggtgcga ttggtgaata 2760 tttcaaataa accagttgtt attaaagatt tgaaaccaag gatagcatta gctgaaaact 2820 acaatgttat agaattgaaa aaggatgatc aaaaatacga caaaattcga gcgagcaaat 2880 tattggaaga attaaaatta aatcacttat ctggtagtga tgaaaaagct attaaacaaa 2940 tttgcttgaa atattccgat atattttgtt taaaaggcga taagcttggt accaccaatg 3000 tttattgccc gtcattaacg gtcaagccaa acatccaacc agcatttagt aaaccttaca 3060 ggcttccttt ctctcagaaa gaagaagtct gcaaacaggt ggaaaatatg cttaaagatg 3120 gtatcattga agaaaccaaa tctgagtgga acagcccctt gttgctagtt cccaaaaaat 3180 cagacaatga ttcaaagaaa tggagattag ttattgacta taggaaagtc aatacaaatc 3240 ttcaagacga taaatttcca ttgcccaata tagaggaagt aatcgattcc ctcgcaggtt 3300 ctcaatattt tactcatttg gatttgtcac aaggttatta ccagtgtgaa ttaaaaccgg 3360 aggacagacc cattacagcg tttgcgacgc catccggtca gtatcaaatg accaggttac 3420 cgatgggatt gaaaacaagc ccatcatctt tttctcgatt aatgacagta gcaatgtcag 3480 gattaaattt gactaaatgt ttgatttacc tcgatgatat tattgtgttt gggaaaacat 3540 ttgacgaaca caataagaat ttgatttctg tttttcagcg actacgagaa gtcaatttga 3600 aactcaaccc cgctaaatgt aatttcttga agcaggagtt gatatattta ggacatttta 3660 tttcaaagga aggtgtacta cctgatccac agaaaattga aactattaaa aattggaaga 3720 gtccttcatc ctctgatgaa gttaagagat ttgtagcttt tgcgaattac tacaggaagc 3780 atatacgaaa ttttgcgagt ttgtgtttac cattaaatta tttaactcga aaaggaatag 3840 aattcaattg gagtgaagaa tgtgaaactg catttcaaca attgaaagag cggttcatca 3900 aaccccccgt gttagattat ccagatttta gcgagaaaaa cacatttaaa ctacacactg 3960 acgcatctgg ttacggaatc ggtgcagtac taagcaacaa aaatgataaa ccaatagctt 4020 atgccagcaa aggtttaaat agcgctgaga aaaattattc aacaatcgaa aaagaacttt 4080 tggccatagt ctgggcaatt cagcattttc gtgtatattt atatggacga aaatttgagt 4140 tgtataccga ccatagacct ttagtttacc ttttcacatt aacggatcct tcgagtaggc 4200 tgacaaaatt ccgtcttgcc ctcgaggaat tcaactttga cgtcttctac aaaaaaggtt 4260 gtgaaaacgc tgtggcggac gcgctatcac gtatttcaac aaccgaattg agagacatgc 4320 aagaaaaact cagcaatgag gcattcgtaa cgacccgaat gcagtcgaag aaggaaaata 4380 ctaattctac taaagaggtg atcaatggcc atgacgcttc aagtggaaaa atagttcaac 4440 tgaaagtaaa aacaaataat ttttctgata acgttacatg gattcctgaa agagaagaaa 4500 ttataattaa accaacagat accttggtct cgttacgacg aacaatggag gaacttggtt 4560 caatatgtaa aaaatatggt gtcggagaac tatacattca aatagacgat gattgtgcgc 4620 acaaatttta tgaaaatatt atgcaaaatg atttagcaaa aagggtaccc tcaattttaa 4680 aaatatcaaa taaagtaaaa atcatcaatg acgacactac gaaaaagtta atcctgaacg 4740 attatcacat attgcctact gcaggtcatg caggtattaa acgaacaatt aacaccatta 4800 ggcaaaaata ttattggaaa ggcattaacc aagatgtagc aaaatttata aaatcttgtg 4860 agaaatgcca aaaatataaa tcaattaatg ttgcgaaacc accgatgata gtaaccacta 4920 cagctgaaag tgcatttcaa aaaatttatt tagatttagt gggtccactt ttaccctctg 4980 atggatatga atatatattg acaacgcagt gcgaactgac aaagttcatt actgcaacac 5040 caattaagaa caagacaacg gaagctgtag ctgaagcttt tatgaaaagt atcatactga 5100 agtatggtgt gccagataga attgcttcag acagaggagc agaattcatg tctgaattgt 5160 ttactaaagt cgcacaactt ttgaacattg aaaagctgaa cagtacagca tattatcacc 5220 aatcaattgg agcattagaa aatactcata aaagccttgg gaattttcta agggtccaat 5280 gtgacaacaa gctcttttca tgggcaaact gggtgccgta ttatgaattt gcatataata 5340 atacggtaca ttctgcaacc aaatacactc cgttcttcct agtttttggg agaccttcaa 5400 aaatgccttc taacttggtg gacagccgcc ctgagcctct gtataatttt gaagattact 5460 gtaaaagaat aaaagcaaca ttacagatca gtcatagcga agttagagac agacttattc 5520 aggaaaagac aaaaaggata gtaggcataa atacaaaatc aaaggaaaca acttaccgga 5580 aaggagattc tgttctcatt aaaaatgaga cagggaaaaa actagacccg agatatgatg 5640 gtccgtacca cgtagtggag gacatgggta caaatgtaaa aatcagtaat aacaataaag 5700 aagacatagt acataagagt aggattaagc tttttattga ataggaatac gcatttttaa 5760 acattttatt tgataggatt aaacatttta aacaaatttt tggataggat tgaacattta 5820 ttcattttat ataaaaatat taattatttt aaacattttt ttttatggaa gaggattgac 5880 aattttatag gacgtgtggt ggatgcgtaa ccttgagaag gtcaagtcca agatacgagg 5940 tatcgaggat tagtgtgtat catcacgtta ctgtaggatt gtacctttag agtgaaggaa 6000 aacatattgt atcagataga tttaaatttt taactgtaga taaaaattta aataaaatat 6060 gatagggcg 6069 // ID Hoana7 repbase; DNA; INV; 2519 BP. XX AC . XX DT 21-SEP-2009 (Rel. 14.09, Created) DT 21-SEP-2009 (Rel. 14.09, Last updated, Version 1) XX DE Hoana7 is a putative autonomous DNA transposon. XX KW hAT; DNA transposon; Transposable Element; HOBO; transposase; KW Hoana7. XX OS Drosophila ananassae OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; ananassae subgroup. XX RN [1] RP 1-2519 RA Ortiz M.F. and Loreto E.L.S.; RT "Characterization of new hAT transposable elements in 12 RT Drosophila genomes."; RL Genetica 135(1), 67-75 (2009)DOI 10.1007/s10709-008-9259-5. XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 642..2174 FT /product="Hoana7_1p" FT /translation="MQCGKVLKTSGNTSNLMQHLRTAHPIREQAENIAVSP FT CMNKFLESSKQYSNASARKIAIDKAVMRMIALDMQPFSIVTDRGFVNLMKC FT VDPLYKLPSKTHLRNVILNNEYEASKEKLKKSLEQVDHVGITTDCWTSKAN FT EGYLTVTVHFVLPSFQLHSAVLSTEKLVTPESHSALTIADSLNLILTEWNV FT LKKVTSIVTDNDASMKKACDLLQIKNFPCFAHSINLLIQDTLKLPMLTPVL FT QKCKRIVNYFKSSTIAYEKFKKAQGETAYSLIQEVPTRWNSALYMMQRILT FT TNTHISAVLLGIPKAPQPLGADEVLMLEDFIKILEPFEHATKSTSHEKEVT FT ISIVIPLICELNKKMADMEKIIITTEGKATFQYLKKRMSERLSHYETRTVA FT RLATILDPRFKKFGFLSQTNADEAAKALEIELYNKMSKFNHDAPTPPTPQP FT ENKQFTFLSQKVGEKIKSSRADSIIALRQYLEVSNSPEDCNPLTFYKVFLI FT EIPKFYIKFNDLFYR" XX SQ Sequence 2519 BP; 886 A; 461 C; 453 G; 719 T; 0 other; tagagctgga aaaaacctcg atgtaacatc gatatatcga tgttgccgaa aatttagcaa 60 catcgatatt ttagttttca acatcgatta tcgtcacttg cagccgaata aaagaggaag 120 catcaacgcg tggtaaggta agtgaaaatt aaaatgaata ttattaacaa aaactaatcc 180 attgttattt ttcattttaa tagagtaatt ggtggtgttg gcagtcatcg tgggagaatt 240 atggacaaat ttttaaaaag taagtttgat gaaactgttg taaaatgaat atttgtatgt 300 atgcatgtat gcatgtatgg gtagaaatgt gtttaattct actctaattc tacaatgcat 360 tcatctatac atctttattt gcgttagacc catatatatg tatatattta tacatatatg 420 tttaatgctt tacacatatt tatacataca tacaactatt agaacatttg ttataagttc 480 ataaattaat attattttgc attacaggaa cttcgaatgt gttgggcgac gtggaaaacg 540 aagaaaacaa agaaatctca atttcgaata acaagcgcaa ggaaaaaatt tcaatagttt 600 ggaattattt taagaaggga acaaatcaaa ccgcgtcttg tatgcaatgt ggaaaggtcc 660 tcaaaacatc tggaaatacc agcaatttga tgcagcattt gagaactgca catcctataa 720 gagagcaagc agaaaacatt gcagtttcac catgcatgaa caaatttctt gaaagttcta 780 agcaatactc aaatgcctct gctcgaaaaa tcgctataga caaagctgta atgaggatga 840 tagcattgga tatgcaacct tttagcattg ttaccgatcg cggatttgtt aatttaatga 900 agtgcgtgga tcctttgtac aagttgccaa gtaagacaca tttgcgtaac gttatcttaa 960 acaatgagta tgaggcctca aaggaaaaac tcaaaaaatc attagagcag gtagatcacg 1020 ttgggattac aacggactgc tggacatcta aagccaatga gggttattta acggttacag 1080 tacactttgt cttaccatct tttcaacttc actcagcagt gttgtcaaca gagaaacttg 1140 taactcctga aagccactct gctcttacta ttgcggattc attaaacctt atattgacag 1200 agtggaatgt cttaaaaaaa gttacatcaa tagtaacaga caatgacgct agcatgaaaa 1260 aggcatgtga cttgttgcaa ataaaaaact ttccttgttt tgcacacagc ataaatcttt 1320 tgatccaaga tacgcttaaa cttccaatgc tgactccagt gttacaaaaa tgcaagagga 1380 ttgtgaacta ttttaaaagc agcacaatcg cctatgaaaa attcaagaag gcacaaggtg 1440 aaaccgctta tagtctaatt caggaggtgc ccacgcgttg gaactctgct ttgtacatga 1500 tgcagcgcat attgactacg aacacacata tatcagcagt gctattaggc attccaaagg 1560 ccccacaacc attgggagca gatgaagttc taatgctgga agacttcatt aagatattgg 1620 aaccctttga gcatgcaacg aagtcgactt cgcatgagaa ggaggtgacc atttcaattg 1680 taattccttt gatttgtgag cttaataaaa aaatggcgga catggagaaa ataataataa 1740 caactgaagg aaaagcaacg tttcagtacc tcaaaaaacg tatgtcagaa cgcctgagcc 1800 actacgaaac acggacagta gctagacttg cgacaatttt ggatccacgc tttaaaaagt 1860 tcggctttct ttcgcaaaca aatgcggacg aagcagctaa agccctcgaa attgaactat 1920 acaataaaat gtccaaattc aaccacgatg cgccaacacc accaactcca cagccggaga 1980 acaaacaatt taccttcttg agccaaaaag ttggagaaaa aatcaagtct tccagagcag 2040 actctataat agctctgcgg caatacctcg aggtttccaa ctctccagag gattgcaacc 2100 cattaacatt ttataaggta tttttaattg aaatcccgaa attttatata aaattcaatg 2160 atttatttta tagatgagcc caaatgaaat ggaacctttc aaagctactg caaagcagta 2220 cctttgcgtg ccagcaacat ctactgcatc cgaaaggacc ttcagtaaag ccggccaact 2280 gataagtgac cgcagaagct gtctaaagcc aaaaattgtg gataagcttt tgtttttgaa 2340 caaaaatgct gatttgatct cctatatgaa ttctgaatga ttgatctcct gtaaataaat 2400 ttagaaatta accaaaaaaa aattgttttt aaacttcgat aatcgattat aatcgatttt 2460 ttccccttaa aacatcgaat acatcgatgt caccgaacat cgatgttttt ccagctcta 2519 // ID P126 repbase; DNA; INV; 1840 BP. XX AC M96642; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 28-SEP-1995 (Rel. 1.08, Last updated, Version 1) XX DE P.tetraurelia P126 repetitive element. XX KW Satellite; Simple Repeat; P126; P126 repeat element; KW Repetitive sequence. XX OS Paramecium tetraurelia OC Eukaryota; Alveolata; Ciliophora; Intramacronucleata; OC Oligohymenophorea; Peniculida; Parameciidae; Paramecium. XX RN [1] RP 1-1840 RA Forney D.J. and Rodkey K.; RT "A repetitive DNA sequence in Paramecium macronuclei is related RT to the beta subunit of G proteins."; RL Nucleic Acids Res 20(20), 5397-5402 (1992). XX DR GenBank; M96642; Positions 781 2620. XX SQ Sequence 1840 BP; 568 A; 267 C; 349 G; 656 T; 0 other; tttgtttctc tcctgatggt actttattag cttctggtag ttgtgataat tctatccgtt 60 tatgggatgt ttagacagga aaataaaaag tgaaaataga tggtcaccgc gattatgtaa 120 attcagtatg tttctctcct aatggtacta cattagcatc cggtagtgat gattaaacta 180 ttcgtttatg ggatgttaag acaggaaaat aaaaagccat ttttattggt cattcagatt 240 ttgtgtattc agtcaatttc tctcctgaca gtactatatt agcatctggt agtgtagata 300 aatctatccg tttatgggat gttaagacag gataataaaa agccaaatta gatggccatt 360 tagattatgt taattcagtc aatttctctt gtgatggtac tacattagca tctggtagtt 420 gggataattc tatcaggtta tgggatgtta agacaggaaa ataaaaagcc atttttattg 480 gtcattcagg ttgtgtgtat tcagtcaatt tctctcctga aagtactata ttagcatctg 540 gtagtgaaga taaatctatc cgtttatggg atgttaagac aggataataa aaagccaaat 600 taattggtca ctcaggttat gttaattcag tcaatttctc tcctgatggt tctacattag 660 catctagtag ttcagataac tctatccgtt tatgggatgt taagtcagga taataaaaag 720 ccaaattcga tggtcattta tcaagtgttt tgtcagtcaa tttctctcct gatcatacta 780 cattagcatc tggtagtgta gataaatgta tccgtttatg ggatgttaag acaggatatt 840 aaaaagccaa agtagatggt catttatcaa ctgttgtgtc agtcaatttc tctcctgatg 900 gtactacatt agcatctggt agttcagata attctatccg tttatgggat accaagacgg 960 gataataaaa agtcaaattg gatggtcatt caggttacgt taattcagtc aatttctcac 1020 ttgatggtac tatattagca tctggtagtt ttgataattc tattcgttta tgggatgtta 1080 agacaggata ataaaaagcc aaattagatg gtcattcaga aactgttact tcagtcaatt 1140 tctctcctga cagtactata ttagcatctg gtagtcatga taactctatc tgtatatggg 1200 atgttaagac aggataataa aaagccaaat tggatggtca ttcttaaaca gtttattcag 1260 tcaatttctc tcctgatggt actttattag cttctggtag ttgggataag ttaatccttt 1320 tatgggatgt taagacagga taataaaaag tcaaattaga tggtcattct taaacagttt 1380 attcagtcaa tttctctcct aatggtactt tattagcttc tggtagtggg gataatttaa 1440 ccattttatg ggatgttaag actggataat aaaaagctaa attagatggt cattcatcaa 1500 cagtttattc agtcaatttc tcacctgatg gtacgacatt agcatctggt agtgaagatt 1560 agtctatctg tttatgggat gttaagacag gataataaaa agccaaatta attggtcatt 1620 caaatggaat tctttctgtc aatttctctc gagacaatac tacattagca tctggtagtt 1680 tcgataattc tattcgttta tgggatgtta agacaagata gttaaaaacc aaattagatg 1740 atcatcctgc cacagttaat tcagtcaatt tctctcctga tggtactaaa ttagtatctg 1800 gtagtaatga taactctgtc catttatggg atgttaaaac 1840 // ID Jockey-N6_CQ repbase; DNA; INV; 1458 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A non-autonomous Jockey-like non-LTR retrotransposon family from DE Culex quinquefasciatus - consensus. XX KW Jockey; Non-LTR Retrotransposon; Transposable Element; KW nonautonomous; Jockey-N6_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-1458 RA Kojima K.K. and Jurka J.; RT "Non-autonomous Jockey non-LTR retrotransposons from the southern RT house mosquito."; RL Repbase Reports 11(1), 591-591 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 10 sequences with >96% CC identity. This family encodes a protein similar to Jockey ORF1p CC but does not encode ORF2p. Thus it is a non-autonomous non-LTR CC retrotransposon derived from Jockey, like HeT-A. XX FH Key Location/Qualifiers FT CDS 80..1351 FT /product="Jockey-N6_CQ_1p" FT /translation="MEVEESSEEEDFXPVRGNRKRKKTSEVTGXVIEXEKV FT EKTLLNSNKFSALAGDKNNNNASAAGGGDAPAVPEPRKSAGKQKQPPLVVT FT NLGFNRFLDLMALCKVKPEYKLTRSGIKVTCFSVEEFNTVRELLLEYKVQF FT YSHDRRSERSHRVVLRGLPEVDVEIIKDRLKEDHNLDVLDVKAIKRKEKST FT AGETPYIVFFPKGHTNLKKLSAIKKMGPVIVRWEAYRNEQPNVTQCRNCLR FT LGHGARNCYLKSRCNNCGRFHKTEHCKVEVTQPKKCANCSGSHEGLDRSCP FT KRAQFIQSRQQASKPKPPAWXKDKQSPVVPSFSAADFPPLPVPVPIPDKLK FT KTEPQSAADAGSSRGNTGPGAGGQPKAIGEEVLYSTEELWKMFXGELWKMF FT YEFTARLRSCKTRSDQERVVGTMSRKFGAD" XX SQ Sequence 1458 BP; 405 A; 354 C; 412 G; 278 T; 9 other; cactcagtcg ccagccagcg acaagttaag acgtgttttg ctcgtgttgt catctcgcgt 60 gcttggaagt ttttgacgga tggaggtgga ggaatcctcc gaggaggaag attttmgkcc 120 cgtccgcggg aatmggaagc ggaagaagac cagcgaggtc actggamtcg tcatcgaags 180 agaaaaagtt gagaagaccc tgctcaacag caacaaattc agcgcgctag cgggcgacaa 240 aaacaacaac aatgccagtg cagcaggagg aggagacgcc ccggcggttc cagaacccag 300 aaaatctgcc ggtaaacaaa agcaaccacc tctggtggta acgaacttgg gctttaacag 360 atttctggat ttaatggccc tgtgcaaagt gaaaccggag tacaagctga cccggagcgg 420 aataaaagtg acgtgttttt ccgtggagga atttaacacc gttcgagagc tgctgcttga 480 gtacaaagtg caattctact cgcacgatcg acgaagcgaa cgatcccacc gggtmgttct 540 gcgaggactc ccagaagtgg atgtggagat catcaaggat cgactcaagg aggaccataa 600 ccttgatgtt ttggatgtga aggccatcaa gcggaaggaa aagtctaccg cgggtgaaac 660 cccttatatt gtcttctttc ccaagggaca cacgaacctc aagaagctga gtgcaattaa 720 gaaaatggga ccagtcatcg tccgatggga ggcctaccgg aacgagcaac ccaacgtgac 780 ccagtgcagg aattgcctcc gcctgggaca cggagccagg aactgttatc tcaaaagcag 840 atgcaacaac tgtgggcgat tccacaaaac ggaacactgt aaagtcgaag taactcaacc 900 caaaaagtgt gctaactgct ctggatcgca cgagggtctg gaccgtagtt gccccaaacg 960 tgcgcagttc atccagtcga ggcagcaggc gtccaagccg aagccgccag cwtggaakaa 1020 ggacaaacag agtccggtgg taccgtcgtt ctccgcggcg gatttccctc cgctgccggt 1080 tccggtcccg atcccggata agctgaagaa gacagaacca caatccgcag ctgatgcagg 1140 aagcagtcga ggaaacactg gccctggcgc cggtggccaa cccaaggcga taggagagga 1200 ggtgctttac agcacggagg agctgtggaa gatgttcmtc ggggagctgt ggaagatgtt 1260 ctacgagttc accgcgcggc tgaggagttg caagacccgc tcggaccagg aaagggtggt 1320 cggcaccatg tcccgcaagt tcggtgcgga ttaactcctg tgtgtattta tttattttta 1380 tgcatttgat ccccggtcct aaccaggtct cggtaccgaa gaggacctaa taaaaataaa 1440 ttagcgaata aaaaaaaa 1458 // ID LOA-4_CQ repbase; DNA; INV; 3908 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A LOA non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW Loa; Non-LTR Retrotransposon; Transposable Element; LOA-4_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-3908 RA Kojima K.K. and Jurka J.; RT "LOA non-LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 151-151 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 9 sequences with >88% CC identity. XX FH Key Location/Qualifiers FT CDS 149..3772 FT /product="LOA-4_CQ_1p" FT /note="apurinic-like endonuclease, reverse FT transcriptase and ribonuclease H." FT /translation="MKRFIQVNLHHAKGASAVLCRRFMKNNIDVGLIQEPW FT INNGKIKGISAXMGKLLYDDSSPNPRAAMLVKNNTKFIPITEFISRDIVAI FT QMEVPTVRGSTWVCIASAYFPGDTDEAPPADVTMFISYCKKMNKQFIVGCD FT ANAHHTVWGSTDINSRGENLLNXITTNNIDICNNGDCPTFVTATRQEVLDL FT TFCSXVISERVINWXVSDEISLSDHKQIEFEYNAGEMIVGKYRNPKKTNWD FT AYKIXLNDYDFNSEDEIRTVXQLEKASSELIIAITDSYXASCPVQQRSSSR FT EVSWWNEKLAVLRKGSRRLFNRAKKTGDWXSYKXALTKYNKEIKRSKRSDW FT RRSCEKIEETPIVARLQKALSKDHSNGLGNLKKADGSFTKNASETLDLMMK FT SHFPGSTEVTSGIDQAEGERLAKDWSVLAHNQACEIFTRSKVDWAVSSFKP FT FKSAGTDGIFPALLQQGKDIINAKLTDIFRASIALGYIPKAWREVRVIFIP FT KAGKRDKTSPKSFRPISLSSILLKTMEKILDDFVKRTFLTKFPLSNGQFAY FT QSGKSTITALNCLVSKIEKSLKAKEVALAAFLDIEGAFDNASHSSMRSAME FT ARGFGSCIIKWIEAMLIEREISADLGGXRLVIRPRRGCPQGGVLSPLLWSL FT VVDDLLKKLASLGFELIGYADDICIIVRGKIGSEICNRMQSALNFISKWCK FT NEGLGINPSKTVIIPFTRRRSLVLPDLAIRGITIEFSNEVKYLGITLDRKL FT NWNAHLNNVLNKGINALWVCKKTFGKKWGLRPKMIYWIYTCIVRPRITYAS FT LVWWPKVKEKTAQNKLAKLQRLATISITGAMHSTSSAALNSLLNLLPLHKF FT VELEAFKSALLMNRSTRIQDKDLTGHLEVLKNFTLNPALNSIVDWMPTKAN FT YDIPFTVDAKSRYEWDNGGPNLRPGSIVFYTDGSRMNNKTGAGVFGPGIKT FT SVPMGQYPTVFQAEVYAIIECLRICLKRNYRFANICIFSDSLAALNALKGY FT TCQSRLVWEGINLLKQLSQNNAVNLYWVPGHCGILGNEIADSLARKGSETD FT FIGPEPFFGVPDCSLKMELKRWELSTINSIWNSTDTSRQAKRFIFPGGKLS FT RELLKLDKKNLRVITGLLTGHCPCKYHLFNMRKIPNAICRFCNIENETSEH FT LLCNCGALIQSRISFFGKGVLQPSNINRFRPSRVIDFIKQVDPNWDNTQ" XX SQ Sequence 3908 BP; 1261 A; 747 C; 833 G; 1038 T; 29 other; cgatttatct ggattaagga ctgctcgacg gaccccaggg cgttgawcgc acagggttct 60 gagggcggtt cttmcagact tgaaataaca ataaatsaaa taataagggg agaccacctt 120 cccgaaataa aggtgggaaa actcagcaat gaaamgattt attcaagtta acctccatca 180 tgcgaagggw gcgtcagccg ttctttgcag aaggttcatg aaaaataaca tcgacgttgg 240 tctcattcaa gagccctgga taaataatgg taaaattaaa ggaatttctg ctcwaatggg 300 kaaattgtta tatgatgaca gttcacccaa tccaagagca gctatgttgg tmaaaaataa 360 tacaaagttc atwccaatca ccgaatttat ttccagagac attgttgcga ttcagatgga 420 ggtccctacg gtcagaggwa gtacctgggt gtgtatagcw tctgcgtact tccccggtga 480 taccgatgaa gctcctccag ctgacgtaac aatgtttatt tcttactgta aaaaaatgaa 540 taaacaattt attgttgggt gcgacgccaa tgctcatcac acagtatggg gtagcactga 600 cattaacagt agaggcgaaa acttgcttaa ctwcataaca actaataata ttgatatttg 660 taacaacggc gattgtccca cgtttgttac agcgacaaga caagaggtcc tkgacctcac 720 gttttgcagt gawgttattt ctgaaagggt tataaattgg maggtwtccg atgaaatatc 780 tctctcagat cacaaacaaa ttgagttcga atacaacgcw ggagaaatga tagttggaaa 840 atataggaat ccaaagaaaa ccaattggga cgcmtataaa attcwwctaa atgactacga 900 ctttaattcc gaggatgaaa tccgtactgt tgmacagttg gaaaaagcct cgtctgaact 960 aattattgct ataaccgatt cttacasggc tagttgccca gtccaacaaa ggtcctctag 1020 tcgtgaggta tcgtggtgga atgagaaact agctgtgcta mggaaaggwt ctagaagatt 1080 gttcaacaga gcaaaaaaaa ctggagactg ggwctcatat aaaagwgccc tgactaaata 1140 caataaagaa ataaaacgat caaaacgcag cgactggaga cgttcttgtg aaaaaatcga 1200 ggaaactcca atagtcgcga ggttacaaaa agctctttcc aaagatcatt caaatggtct 1260 aggaaatctt aaaaaggcgg acggcagttt tacgaaaaat gcttctgaaa cactggactt 1320 aatgatgaaa tctcactttc ctggctcgac cgaggtaaca agtggtatag atcaggccga 1380 gggcgaaaga ctggcaaaag attggtctgt actcgcccat aatcaagcct gtgaaatatt 1440 tactcgttcc aaggttgact gggcggtgag ctctttcaaa ccatttaagt ccgcggggac 1500 ggatggaatt tttcctgctc ttctccagca gggtaaagat attatcaatg ctaaattaac 1560 wgacatattt cgagccagta tcgctcttgg ctatatccca aaagcttgga gagaggtacg 1620 ggtcatattt attccaaagg cggggaaaag ggacaaaaca agtcccaaat ccttccgtcc 1680 gattagtctc tcttcaattt tattgaaaac tatggaaaaa atacttgatg attttgtaaa 1740 aagaacgttc ttgacaaaat ttcctctaag taatggccaa tttgcatacc aaagtggtaa 1800 atcaaccata actgctttaa actgcttagt ttctaaaatt gagaagtccc tcaaagccaa 1860 ggaagtagct ctggctgctt tcctagacat tgaaggagct tttgataatg catctcacag 1920 ctctatgcgc tctgccatgg aagcgagggg ctttggttca tgyattatca aatggattga 1980 agcaatgctt attgaaaggg agatttctgc cgaccttgga ggagawcgac tggtaatcag 2040 accacggagg ggatgtccac aaggtggagt tctatcacct ctgctctggt cattagtagt 2100 cgacgatctc ctgaaaaagt tggcgagtct cggctttgaa ctgatcggct atgcggatga 2160 catctgcata atagttcggg gaaaaatagg aagcgaaatt tgcaaccgaa tgcaatcggc 2220 attgaatttt atctcaaaat ggtgcaaaaa cgagggtctt ggtattaatc cctctaaaac 2280 agttattata ccgttcacaa gacgaaggtc actagtacta cctgatcttg ctattcgtgg 2340 gataaccatt gagttttcta acgaagtgaa atatcttggt attactttgg ataggaaact 2400 taactggaat gctcacttga ataatgtatt aaacaaagga attaatgctc tatgggtgtg 2460 taaaaaaacc tttggaaaaa agtggggcct tcgccctaaa atgatttatt ggatatatac 2520 atgcatagtt agaccaagaa ttacgtatgc ctcacttgtg tggtggccga aagttaagga 2580 aaaaactgca caaaataaat tagctaaact gcaaagatta gccaccattt caataacggg 2640 agctatgcac agcacttcat ctgctgcact aaactccttg ttaaatttac ttcctcttca 2700 taaattcgta gaattagaag catttaaaag tgctttatta atgaatcgat ccacaagaat 2760 tcaggataaa gatttgactg ggcacctaga ggtccttaaa aactttactc tgaatccggc 2820 gttaaactca atagtagact ggatgccgac caaggctaac tatgacattc cgttcacagt 2880 ggacgctaaa agtcgctacg agtgggataa tggtgggcct aatcttcgtc caggctctat 2940 tgtgttctac acggatggtt ccagaatgaa taacaaaacg ggagctggag tttttggccc 3000 aggaattaag acatctgttc caatgggaca atatccaaca gttttccaag ccgaagttta 3060 tgcaataatt gaatgtttgc gcatttgcct caaaagaaat tatagattcg ctaatatttg 3120 tatcttttcc gacagcttgg cagcactgaa tgctctgaaa gggtatacct gccagtcaag 3180 actagtatgg gaaggaatca acttactaaa gcaattgtcc caaaataacg cggtcaatct 3240 ttactgggtt cctgggcatt gtggaattct aggaaacgaa atagcggaca gcctggcaag 3300 gaaaggatct gagactgact tcattggccc agaacctttc tttggtgttc cagactgttc 3360 tctgaaaatg gagcttaaac gttgggaact atcaacaatc aattcaatct ggaattctac 3420 agatacttcc agacaggcaa aaagatttat ctttccaggt gggaaactat ctcgagaact 3480 actaaaacta gacaagaaaa atttaagggt gataactggt ttactaactg gtcactgccc 3540 atgcaaatac cacttattta atatgaggaa aattccaaac gcaatttgta gattttgtaa 3600 tattgagaat gaaacatcag agcacctgct gtgcaattgc ggtgccctga ttcaaagtag 3660 aatttcattt tttgggaaag gggtcttaca gccctccaat atcaacaggt tccgtcctag 3720 tagggtgata gactttataa aacaagtcga ccctaactgg gacaacacgc agtaaacagg 3780 agctgcctct catcgttcca tggtgttggg ttaacctgat ttgctacaaa aaaaaagagt 3840 cacaccacaa tattcctcta attggtcgca gtggtcataa ggctcaacaa aaaaaaaaaa 3900 aaaaaaaa 3908 // ID BEL-13_AA-LTR repbase; DNA; INV; 543 BP. XX AC AAGE02023429; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-13_AA_; KW BEL-13_AA-I; BEL-13_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-543 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02023429; Positions 1443 901. XX SQ Sequence 543 BP; 133 A; 131 C; 106 G; 173 T; 0 other; tgattaggtt tacattactt ttctcaattt agcatttagc cgtcccgcta caaatgagac 60 gcactgttac ccactcaatc gaattatcca tctcttttct gttatttgcc cagtttgagt 120 gcgatacatg catacgtata gataacgacc gccctgttca tgttcttttc aatttgctaa 180 atacagtatc agaaaagtga aagtgcgtac aagtcgttct tactaacagt cccgaaaaag 240 ttgacttttg cactgattta ttgttcggtt gctcggttta aatctgcccg ctggttgata 300 gaagtgtggt caattagaat ctgtgccggt tagagaggat ttcgacctta aatcaatagt 360 gctacactga gttacgtgtt cggaagcctc ccacgttctt tcaccagttg ccgattccac 420 gttgtgctgc tattttcccg ctgaagttgt tgcccgtcca taccgccgtc gattcatctg 480 taactaggaa tttacagaga acccgtcccc aaagcttgcc ggtcagtagc ccatcaacca 540 tca 543 // ID Sola2-N4_AAe repbase; DNA; INV; 3338 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A non-autonomous Sola2 DNA transposon family from Aedes aegypti. XX KW Sola; DNA transposon; Transposable Element; nonautonomous; KW Sola2-N4_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-3338 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the yellow fever mosquito."; RL Repbase Reports 11(4), 1306-1306 (2011). XX DR [2] (Consensus) XX CC ~97% identical to consensus. TIRs are ~1010 bp long. XX SQ Sequence 3338 BP; 1166 A; 568 C; 544 G; 1058 T; 2 other; gagcggttcc acgccaaatc ggtaaacgat gaaactcgac taccttggat tcaaatggaa 60 ttttgcacat agtttcgata taatagaata agttttttcc actggtggag agaccattta 120 aaatcaagag taacttctga taagggcgta agtattttta gcatcaccat ttaaaaaaaa 180 tgtacctcgg aaaccgtttg ttatagagaa aaagtgtctg aggagggatg gtagacaatc 240 aaataggctt tgtaaaaaaa atatacactg aaaaaaaaat acttttattt tcatgaaaaa 300 atcgaaatta aaccttaaaa cacaaattgc aaaaaatagg tatcttcgat ttttttcaan 360 ttttttctgt aaaatttcaa tgtaaaacca gatgttttaa gcacagtttc aacatggtgc 420 aatctaaaat aaaaaagttt ttctaagagc aacttttggt gcatttctga aaattcaatt 480 tctggccgta gaaaagacgt tttgatgctg taatatgatt cttcacccaa aaacaattca 540 aaatgctaat aacaatgatc ttcctaaaac tagccgtttt caagatattt aggattcaat 600 tatattaata tcataaaact taagaaaata cgcccgtttc ataagttact ccagatgctc 660 caccaacaat gttacgttta tctctttcca cattaatatc ccatgtttta agggcagttt 720 caacatggtg caatttaaaa taaaaaagtt tttctaaggg caacttttgg tgtatttctg 780 aaaattcaat ttctggttat agaaaaggcg ttttgatgct gcaatatgat tctttatcca 840 aaaacaattc aaaattctaa tatcaatgat cttcctataa gtaaccgctt caagatattt 900 gggattgaat tatattaata tcataaaact taagaaaata cgcccgtttc ttgcattatt 960 ccagatgctc catcaacaat gttacgttaa tctcttccca cattattggg tatttcgttc 1020 accactggac tgcgcagtca gttttcataa gtgcgagaat gtaaataaaa gtgcgatcct 1080 gcatcaaatc ttgttggtgt cttcagcgca cttattctac gatttataat gaatgagtgc 1140 gctgaagaca tcaacaagat tggatgcaga atcgcacttt tattttcctt ccctcactta 1200 taaaaactaa ctgcggagtc cagtggtgaa tgaaatgcgc aataatatcc catcaacttt 1260 atctttgaca cttacacaaa taaacgaatt gttgagaagt aacaataaca atatgtaaca 1320 atattattag ccgccaccaa tccctccaaa atgatccgtc aaacaaacca agacaaatac 1380 ggaggcaatg aaaacaatag ttgatcaagc gctcggctcg ctcggggcac atggtaaagg 1440 gccatgtgat gccattggtg gcacattgaa aaggatggct actagagcta gtctggcaag 1500 acagcatgag catccaatca ctaatgcaaa agctttgtat gagtgggcta gtaagcggag 1560 agaagaatac ctgacaaaat tgtcattttg ttttgtgtca caggaagatt acgaacaagg 1620 atcagaagaa ctaagtaaca tacatcaaca agcaaagcta attccaggaa cacagaaatt 1680 ccattgcttt gtacccattt ccgaaaacaa aatcgcagca aaactgtatt caagtgatga 1740 ggcagtaagc tattataatg tttatagaaa agtaaaaaca taatcaaata agttgacatt 1800 taanatctgt tctataccaa tatatattaa aacaaataaa aatattaaaa ttatgtacaa 1860 taatcgtgag aataaataaa taaggttaag aagtttaaag gcgtttttcc ttcttcccct 1920 ccttaacatc attcatatta tcatcttcca tcgtaatacc acagacaaac agaagtaaca 1980 cctagaacat ttttcacaaa attctctcga ccacaaatac taccaccatc tagtgagctt 2040 attgaacgaa ataacttttt gtgccacatg tccaacaggt ggcggtagtg tgaaatgtca 2100 aacacgaagg aaagcgacgc gcgcgcctct ggttgtgaac ttgacaactg ttgtatttta 2160 aactgatcgt taaattcgtg ggcgatggaa atttatccag tgttacgtct gtttgtctgt 2220 ggtaatactt atattaattc tcaacaatat tccaacaatc atcctgtatc tttcgaacag 2280 ttcatccaac cgtctaaatg tcaaagaaaa agtagatggg atattaatgt gggaagagat 2340 taacgtaaca ttgttgatgg agcatctgga ataatgtaag aaacgggcgt attttcttaa 2400 gttttatgat attaatataa ttcaatccca aatatcttga agcggtaact tataggaaga 2460 tcattgatat tagaattttg aattgttttt ggataaaaaa tcatattgca gcatcaaaac 2520 gccttttcta taaccagaaa ttgaattttc agaaatacac caaaagttgc ccttagaaaa 2580 acttttttat tttaaattgc accatgttga aactgccctt aaaacatggg atattaatgt 2640 ggaaagagat aaacgtaaca ttgttggtgg agcatctgga gtaacttatg aaacgggcgt 2700 attttcttaa gttttatgat attaatataa ttgaatccta aatatctaga aaacggctag 2760 ttttaggaag atcattgtta ttagcatttt gaattgtttt tggatgaaga atcatattac 2820 agcatcaaaa cgtcttttct acggccagaa attgaatttt cagaaatgca ccaaaagttg 2880 ctcttagaaa aactttttta ttttagattg caccatgttg aaactgtgct taaaacatct 2940 ggttttacat tgaaatttta cagaaaaaat ttgaaaaaaa tcgaagatac ctattttttg 3000 caatttgtgt tttaaggttt aatttcgatt ttttcatgaa aataaaagta ttttttttca 3060 gtgtatattt tttttacaaa gcctatttga ttgtctacca tccctcctca gacactattt 3120 ctctataaca aacggtttcc gaggtacaat tttttttaaa tggtggtgcc aaaaatacat 3180 acgcccttat cagaagttac tctcgattct aaacgatccc caattatatg agataatttt 3240 cgtctcatat acttaataca tctgcgcagt ttcatccaaa tcggaatggt acatgtcaga 3300 tttcagcatt ttatggacga tttcgcgtgg aaccgctc 3338 // ID Gypsy-7_DVir-LTR repbase; DNA; INV; 390 BP. XX AC scaffold_10188; XX DT 10-MAR-2011 (Rel. 16.03, Created) DT 10-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-7_DVir_; KW Gypsy-7_DVir-I; Gypsy-7_DVir-LTR. XX OS Drosophila virilis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; virilis group. XX RN [1] RP 1-390 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (09-MAR-2011). XX DR Genome; scaffold_10188; Positions 1884 1495. XX SQ Sequence 390 BP; 128 A; 90 C; 76 G; 96 T; 0 other; tgtagcatgc gtacatatgt atatacatac atatgaataa ggctgcaaca acttttggtt 60 atgccaacag aatgcaaaca taagcagcga ctttgtcgca gcgcgcctaa cgataagtgc 120 cctctgcgca tattcaagtg cagacgaaat atgatcagca cttgataaca acgaccgccc 180 aaatagctaa gtcagcgttc tcacgcttga ccgtcgttgt gtatacgcat gactctctct 240 gactctcttg ctgtatataa taagccaaaa aaccatgtat gtaaaacggc tgctcactgc 300 ttgacggtgg acagctcgaa ctcgaaaaga aaacatataa taaagaacca tcaaggaaga 360 cggacgtctt ttaatcttgg aaccattaca 390 // ID BEL-2_DPu-I repbase; DNA; INV; 5494 BP. XX AC scaffold_119; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-2_DPu_; KW BEL-2_DPu-LTR; BEL-2_DPu-I. XX NM BEL-2_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-5494 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 651-651 (2010). XX DR Genome; scaffold_119; Positions 165482 159989. XX CC Positions [4350-4910] - Integrase core CC 'CATTT' target site duplication CC LTRs are 99% similar to each other. CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX FH Key Location/Qualifiers FT CDS 35..976 FT /product="BEL-2_DPu-I_2p" FT /translation="MALQTLTARRGGNRGAVTKLLTKLQGIVDDTTLDRDL FT KIYELDKKLQDLHSKIKLIQTLDDEIQNVTDAADVAAEIGSADVFNSNAFD FT GRDRAEFELLKLRQQVIDANAAVAAAAAATVAAAAAAANLNRPPTVPTLPV FT QTSSSNLPKFDLPEFDGSVLHWRAFWDVFEFEVHNKATYTGATKLNFLNSR FT LNGSAKAALSGLTPNNANYPKAIDILKDRFGQDKKIVAAHMRALYNMAKPG FT TDRVSLRIFSDQLESHIRGLDALGKKADSFGDLLVCILLDKFSPDLRRNLT FT RQHGTTEWTLEELQASNESSKF" FT CDS 1324..3837 FT /product="BEL-2_DPu-I_1p" FT /translation="MSVSISPSSTLAAAIHHSESSGKFTQPYVFLKTAIVE FT TRFKGFKERANILIDEGSQRTFITSRLAKLLHLKPMWRESLLLSGFSMNPT FT GIEYYDVTQFFIVDLSGSLIKIKAIIIDHLVSPLDDPHRNLAASLPHLNGL FT QLAHPPSEATSFDVDILIGANFYWSLVRDKVIRGAGPTATESKIGYLLSGP FT LSTSSVENESSDSVLTLHVSTMENFDLSKFWTVEGLGIQPELESDTTTNIY FT QSQCIEFRDNQYAAKLPWKSEHPELAPNFQVCHRRTIGTIKRVSKSPSQLA FT IYQKIISDQLDRGFIEKVPPGEIYKRDCHYIPHFAIEKESATTPLRIVYDC FT SCKTSAGVSLNDCLETGPPLQNDMLEILLRFRVHRVGISADIEKAFHKVIL FT HESDRDFFRFLWTSNISNPDADLEVYRFKVIPFGASCSPFILLSVIKHHLH FT SLPSLTAVDMDLNIYVDNLISGCDSTKEAVDYYAVSNSILNKAGLNLQSWS FT SNDQAINVRAVEDGVADPSTSSKILGLNWDRSTDSLSLPAFRLSTFSHATT FT TKRDILRGISTVFDPMGFFAPLTIAAKILNQELWKEKFDWDDPLPQKFRER FT FSKIAADIDNYHPVVPRCYLECHNDLQNLHLFVDASPHAFGAVAYFCNLDR FT VAFVLSKSRVAPLSIGKKPPTLPQLELMAALIGSTLANTIVKALKPLGISL FT TVTMWSDSQIVLYWLAQTARNKCQYVANRVDTIQKLSKDLHASWHYCPTKS FT NPADLLTRGISLRQFQQSSSLWIYGPTWLSSPEDWPSWETNKLNAVVLHLA FT ESQLAPLISRPTIPPIQLIYSKSWMFHATIGFLYFV" FT CDS 3687..5471 FT /product="BEL-2_DPu-I_3p" FT /translation="MGDQQAKRSRTSFGGVSTGSTHQSSNYPPDSIDLFEV FT MDVSRYNWIPLLRVTSRILRYKSYFLPNRHPAQAFEFLTTAELRNAELIWI FT RAFQQRFLQDEYRYLRSRSGARPALVSQHDLFINDDDIICCKGRLQHARIK FT SSAKHPFLLPKSCDLTTLIILYHHKRMFHCGVDATCSSLRQRYWIPSIKQR FT VRSILRSCVTCRRVSGKPYRAPDHAPLPAFRVSDAPPFTVVGVDYTGAYTI FT RGVSPKNDQKAYICLFTCASTRAIHLELVEDLSAQSFLLAFRRFVSHHSLP FT SMVISDNATNFECGADFITKIFRSPEVANYLTSQQVDWQFIPKRAPWYGGF FT WERLIGVTKIALSKMLGRTKPTFDELRTLVAEAEFVLNDRPIEKMSADSQT FT EEALTPSHLMYGRRFTSLPFDHTAAEEILNDPTFGEKPSILTKSAARLEKS FT LCAFRKHFTTNYLTALREYHKATRGHHEEIVKIGDVVLVHDESPRHHWKLA FT VIESLIKSNDGHVRAAEIRTASGKTNRPISKLYPLEVSESTETIDIQQSID FT SSPEQIQQPIDSSPEQIQPIITSRPVRRAAAKANQLIKQMTMDEPEED" XX SQ Sequence 5494 BP; 1457 A; 1448 C; 1135 G; 1454 T; 0 other; tggtgccgta accaggatct ggttttagtt cttaatggct ctccaaactc tcactgcaag 60 gcgaggtggt aatcgtggtg ccgtcacgaa acttcttaca aagcttcaag gcatagtgga 120 cgatactact cttgatcgag accttaagat ctacgaactg gacaaaaagc tacaagatct 180 acattccaag atcaaactca tccaaacatt ggacgatgag attcaaaacg tgaccgatgc 240 tgctgatgta gccgcagaaa tcgggagcgc tgacgtcttc aacagtaacg cgttcgacgg 300 tcgtgatcga gctgagtttg agcttctcaa attacgtcag caggtaatag acgcaaatgc 360 ggctgttgct gctgccgctg cggccaccgt tgctgctgca gctgctgctg cgaatctaaa 420 tcgtcctcca accgttccaa ctcttccggt ccaaacgagc tcttcaaatc tccctaagtt 480 cgatttgcca gagtttgacg gcagcgttct tcattggcgt gctttctggg acgtgtttga 540 attcgaggtt cataacaaag ccacctacac tggagccacg aaattaaact ttttaaactc 600 gcgtctaaat ggctcagcca aagccgctct gtccggcctt actccaaaca acgctaacta 660 tcccaaagct atcgacatct taaaagatcg cttcggtcaa gacaagaaaa ttgtagctgc 720 ccacatgcgc gcgctgtaca atatggcgaa acctggaacc gatcgtgtaa gtctacgcat 780 cttttcggac cagctagagt cgcacatcag aggtttggac gcccttggaa agaaggctga 840 ctcgtttggt gatcttcttg tgtgtatcct cttggataaa ttctcaccgg acctacgtcg 900 aaatttaact cgacagcatg gaacgacaga gtggactcta gaagagttgc aggcttcaaa 960 cgagagctcg aaattttaga cgacagccca gctgaacagc attcaagcaa gattccgcca 1020 gcaggtgtaa agaagaccaa cgttttcttt gccggagcgt ctcaacctat caagaaaaag 1080 caacgctgcg cctattgcac aggtgatcac tacgcatctc aatgtaccag tgtcgggaaa 1140 gcagcagagc gacttaaggt tgcaaaagtg aaaaagctgt gtttaaactg tttggacgcg 1200 tcccacacca gtttgaaaga ttgtccgtca aagtatcgtt gcaaatactg ctccaaagct 1260 catcattcca gtttacatcc agaaaactcc gaggaagtgt ccgagactcc atccgtcgct 1320 tccatgtcag tgtccatctc tccatcttca actttggcag cagccattca tcattcggag 1380 tcgtcaggaa agtttacaca gccgtacgtt tttctcaaaa cggcgattgt tgaaacacgc 1440 ttcaaaggtt ttaaggaacg cgccaacatt ctcatcgatg agggatcgca acgcaccttc 1500 attacttccc gtttggctaa attattgcac ttgaaaccaa tgtggcgtga gagtttactc 1560 ttatccggtt tctccatgaa tccaacaggc atcgaatact atgacgtaac tcagtttttt 1620 attgttgacc tcagcggttc cttaatcaaa atcaaagcca tcatcattga tcatcttgtg 1680 agcccactgg atgatccgca tcgtaatttg gcagcttctc ttcctcatct caacggactt 1740 caattggccc atccaccttc tgaagccaca tcattcgacg ttgacatcct gattggtgcc 1800 aatttctact ggtcacttgt tcgcgacaaa gttattcgag gcgcaggtcc aacggccacc 1860 gagtccaaaa tcggttatct tctctccggt ccattatcta cgtcatccgt agaaaacgaa 1920 tcatccgatt ctgtattaac gcttcacgtt tcgaccatgg aaaatttcga cctgtctaaa 1980 ttctggacgg ttgaagggct tgggattcaa cccgagttgg aaagcgacac aacgactaac 2040 atctaccaga gtcaatgtat cgaatttcgt gacaatcagt atgcggcaaa acttccttgg 2100 aaatcagaac atcccgaact cgctccaaac tttcaagttt gccatcgtcg aaccatcgga 2160 acaatcaagc gggtatctaa gagcccaagt caactggcca tctaccaaaa gatcatcagt 2220 gaccagttgg atcgaggttt tatcgaaaaa gttccgccgg gagaaattta caaacgtgat 2280 tgtcactata tccctcactt cgccattgaa aaagagtcag cgaccacacc gcttcggatc 2340 gtgtacgact gttcttgtaa gacatcagca ggcgttagtc taaatgattg cctcgaaaca 2400 ggaccacctc ttcagaacga tatgttggag attctactac gctttcgtgt acatcgagtc 2460 ggcatttccg cggatatcga aaaagctttt cataaagtca ttctgcacga gtcggaccgt 2520 gatttcttcc gcttcctatg gaccagtaat atcagcaatc ccgatgctga tctagaggtc 2580 taccgcttca aagttattcc atttggagcc agctgttctc catttattct cctatcagtc 2640 atcaaacatc acttacattc tttgccgtcc ctcacagcag ttgacatgga tctcaacatc 2700 tatgtcgaca atcttatttc aggatgtgac tcaacgaagg aagcagtgga ctactatgca 2760 gtgtcaaact ctattctcaa caaggccgga ctgaatcttc aatcttggag ttcaaacgat 2820 caagcaatca atgtccgtgc ggttgaagat ggcgtagccg atccttccac cagctccaag 2880 atactgggcc taaactggga tcgttctacc gactcacttt cgcttcccgc gttccgtctt 2940 tccactttca gccacgctac aacaacaaaa cgcgatattc ttcgtggcat ttctacagtt 3000 tttgatccaa tgggattctt cgctccgctc acaatcgcag caaaaattct aaaccaagag 3060 ctctggaaag aaaagtttga ctgggacgat cctctccctc aaaaattccg tgagcgtttt 3120 agtaaaatcg cagctgacat cgacaattat catccagtcg tcccccgctg ttatcttgaa 3180 tgtcacaacg accttcagaa tttgcatctc ttcgtggacg ccagtccaca cgcgtttgga 3240 gccgttgctt acttttgcaa cttggatcgt gtcgccttcg ttctctccaa atcacgagtc 3300 gctccattgt caatcggtaa gaagcctcct actttacctc aactggaact tatggcagct 3360 ctgatcgggt caacactcgc aaacacgatc gtgaaagctt taaaaccact tggaatttct 3420 ctcaccgtca cgatgtggtc agatagccaa attgttcttt attggctggc gcagacagct 3480 cgtaacaagt gccagtacgt cgcaaaccgt gttgacacta ttcaaaagtt atcaaaggat 3540 cttcatgctt cttggcatta ttgccccaca aaatccaatc cggctgattt actcacaaga 3600 ggtatttcac ttcgtcagtt ccagcaatca tccagtttgt ggatttatgg tccaacttgg 3660 ctctcttcac ccgaagactg gccgtcatgg gagaccaaca agctaaacgc agtcgtactt 3720 catttggcgg agtctcaact ggctccactc atcagtcgtc caactatccc cccgattcaa 3780 ttgatttatt cgaagtcatg gatgtttcac gctacaattg gattcctcta cttcgtgtga 3840 cttctcggat tcttcgctac aagagttact ttttacccaa tcgtcaccct gctcaagctt 3900 tcgaatttct cacaactgcc gaacttcgaa acgcagagct aatttggatc agagcctttc 3960 agcaacgctt cctacaagat gagtacaggt atctccgttc ccgttcaggt gctcgaccag 4020 cgcttgtatc tcaacatgat ctttttatca acgacgacga catcatttgc tgtaaaggcc 4080 gtcttcagca cgctagaata aaatctagtg caaagcatcc gtttcttctt ccaaaatcct 4140 gcgacttaac tactttgata attctttatc atcacaaacg aatgttccac tgtggcgtcg 4200 acgccacctg ttccagtctc cgtcaacggt attggattcc atctatcaag caacgtgtac 4260 gatccatttt acgcagctgc gtaacttgtc gtcgagtgag tggaaaaccc tacagagccc 4320 cagatcacgc tccgctacct gcattccgag tcagcgatgc cccccctttc accgtcgttg 4380 gtgtcgatta caccggtgct tacacaattc ggggtgtttc tccaaaaaat gatcaaaagg 4440 cttatatttg cttattcaca tgcgccagca cccgtgcaat tcatcttgaa ttggttgagg 4500 atctttcagc tcaaagtttc ctactcgcct ttcgtcgatt cgtgtcccat cattctttgc 4560 cgtcaatggt catctctgac aacgcgacaa attttgaatg tggcgctgat ttcatcacga 4620 agatttttag gagtccggaa gtagccaatt acctcaccag tcaacaagtg gattggcaat 4680 tcatcccgaa acgcgcacct tggtacggag gtttttggga gcgcttgatt ggagtcacaa 4740 agatcgcttt gtcaaaaatg ctcggccgca ctaaaccaac ctttgacgag ctgcggactt 4800 tagttgcaga ggctgagttc gttctgaacg atcgtccaat cgagaagatg tccgctgatt 4860 cccagactga agaagcgctt acaccatcac atttgatgta tggaagacgt ttcactagcc 4920 taccattcga ccatactgca gccgaagaaa ttctcaacga cccaacgttt ggtgagaagc 4980 cgtcgatcct gactaaatct gcagctcgat tggaaaagag cctgtgtgcc tttcgaaaac 5040 atttcacaac aaactactta actgccctac gcgaatatca caaagctacg cgcggtcatc 5100 acgaagaaat cgtcaagatt ggcgatgtcg ttctcgtaca cgatgagtct cctcgtcacc 5160 attggaagtt ggcggtaatt gaaagcctca tcaaaagcaa tgatggccac gtcagagcag 5220 ctgaaatccg gacagcgtcc ggaaaaacca accgaccaat ctcgaagctg tatcctttgg 5280 aggtttcgga gtcaaccgaa actattgata tccagcaatc tatcgattcc tcgccggaac 5340 agatccagca acctatcgat tcttcgccgg aacagatcca gcccatcatc acgtctcgtc 5400 ccgttcgtcg agctgcagca aaggcgaatc aactcatcaa gcagatgacc atggacgaac 5460 cagaagaaga ttgatttccg ccggccaggg agaa 5494 // ID Gypsy-20_SI-LTR repbase; DNA; INV; 610 BP. XX AC AEAQ01023379; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: long terminal DE repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-20_SI_; KW Gypsy-20_SI-I; Gypsy-20_SI-LTR. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-610 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01023379; Positions 629 20. XX SQ Sequence 610 BP; 197 A; 126 C; 148 G; 139 T; 0 other; tgctagaaca agaatatgcg caattattta atcaagagcg tgattattta agaaatatag 60 ctaagcaaaa tattttaaaa attcagaaag aaaatcagcg ttcgtataat aaacatcgca 120 agaaagcatc tatctataaa gaaggtgact tagttgccat ccgcagaact cagtttgcgc 180 caggtctcaa aattaggaca ccgtacttag gtccatataa gatttctcaa gtcaagggaa 240 atgatagata cgaagttgta aaaatcggta ccggagaagg gcccatgatt actacttccg 300 cagccgatta catgaaaatt tataagtttc cttcaggggc cgaaggaaaa gcagggatgg 360 ccgaatgtgg gaatggcgat cctaacgtac gcgcgcctgg ggattaccca gcggcgacac 420 tagcagaaga aagagcgagc gccgagcggt ggcgtgcgag tcgcgtgcgt agaagtagtg 480 gagatataac cgccgtagcg ggcagtgagt gagcacgtgt gtaaaacagc gagcgaacca 540 ctaatgtcat ttcctgttaa tatattgtac taactcgcac gcactgtgtt ttccctcacc 600 cacccctaca 610 // ID Gypsy-18_OD-I repbase; DNA; INV; 8432 BP. XX AC CABV01002725; XX DT 24-FEB-2011 (Rel. 16.02, Created) DT 24-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Oikopleura dioica genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-18_OD_; KW Gypsy-18_OD-LTR; Gypsy-18_OD-I. XX OS Oikopleura dioica OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; OC Oikopleuridae; Oikopleura. XX RN [1] RP 1-8432 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Oikopleura dioica genome."; RL Direct Submission to RU (24-FEB-2011). XX DR Genome; CABV01002725; Positions 16832 8401. XX CC Positions [5263-5739] - Integrase core CC 'ATCA' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS join(842..2449,2453..4456) FT /product="Gypsy-18_OD-I_3p" FT /translation="MQLTPISELTVVQLKSICKRLKLSKAGNKADLQATLR FT AHARQMNYRRTNDLKEYVVNTDNLGDEDEEVEPVVSTAAPAALPASLTLPS FT VTSPAPIPTSSFAFPAVVTTASSSEKPAAQGPAMSLLQDIQRERHSRAESI FT ASLCGGTSVTASTTRQSGLYSTAFPQRSWIPDSTRSENPWSLGTPNRTVPT FT SAPFNSPKTSPPVTPPTSYGFNSAGVNDARSRDSLRKESADIRMVSSRSPE FT EIESLLDSLENLASIYDWSPAQMMTVLRTKLSPSIQRECCITPSMTFETAS FT TQLRKRLGYSHQKLTTKIEQLQRLRTESASQAFSRLINLLSASGSVYPSLP FT EEARQQLLRTSLRKVLRPREFEMFFISWISNSQTTDLAAIAEIISLLEGNW FT GNEVGVVGEAGADEYDLNVARSTNCHLCGEAGHYMRQCDLLNGIRQMGTLA FT SRMDQMEAMALRQDECLNRLLTSIESIAKTSQAACNATVQPQHQVPVVQQE FT GHRRGNRGPKKPFCRTCPLTSDGHPPRHYWQECPTSQSSHLGRQPDPKSTA FT EKSPLFYHNINSPSLLPMITLDILNIPVTCLVDSGANCCIVDSNIAANFPK FT ARREAHSAAITVANASKMVVKEKIWLPLCFNNVVKYIDFLIVDGLNHQAVL FT GMSSWSSLTIDRREESITIDGACFDMVPSGRSLVLNSMVAATIAPQSLTPL FT RVKVPAGEDEANDILLVEPSKKGFCAENGVDVMQAAYGVKPDQVILVYNTS FT QVALTIPCGAEVATAKLMEKRDDGRYKFNELVSIDNEEFRNEWLDYRRARE FT IRLKGEDGEKYRNVTIGNELDDKQKTKVQKLLKQFRSVFSTGPTDLGMVKE FT CAFRIRLTDETQRVYEKPIPVRPGLRDKAEKVIEDWCKAGVIEREASDFNL FT PLFFVKKGKDGVRPVLDARRLNMLTIPTRWPIPSLKELMSKLSRNISKAES FT VFVTSVDVASAFNQVRTAPEDSHKCAFSFLGRQYVSRRMCFGLANSPSVFA FT HVIDRLCGHIDGLAVLMDDIVLVHGSFEEHLRAVSEFLAILEEHGILLKIA FT KSHFFVKNFDYLGFKVTEEGIEPLADKLDPVRSFPRPRSRKELRRFLGLTN FT FYQHYVKDGQKTLSPLHKLAAPSKSPYQWKSQHEEAFSRYKEALLQVVSFK FT HRDESMPLVISTDACSTGIGACLYQRDTTGTSSSAWILLQSSFRG" FT CDS join(4843..6096,6100..7830) FT /product="Gypsy-18_OD-I_1p" FT /translation="MQTRRAKRLAESAPDQEEPIMVSGDVPYRPKDIIQLQ FT EIDPALRKMDKAGKLVKGANGLWVLRSNSDIIALPEQLHIEVASFVHAKMA FT HPGALALEKILKRKFFIARIALICREVCSACEICWKIKPRAQLKGLSPPKP FT EMTVRPWQKLYIDLSDFGSLDLSGNRYFLGVMDDFSRFLVGRCLPDKSAET FT VAYALADILLSNGVQGAMIRADNGLEFRNTTFDSVMEKFNIRVAHISPYHA FT QSNRIERKFRSLGLKARLANINRDDWSRQIPFVVFMVNNAPTTALDGLSPS FT EVLTGRPMELPLFHLRPDPELNEDPFLWSGYLSGWLRNVGQFLGQKYRERS FT AHQMEPVRGITLEVGQKVCYWAPMRLKQCKKLHVAYKNKAEVVKILGQDVY FT EIRDMVTGNLLRRNVRFLRPLPSSLSFSPFCTYSCDLPCSKLNLYAFRLHQ FT NPEMFVVNLTFIGIALSQSLVLDGGILLERSGSLFLSRQKSHHNVIITRPV FT PRNPFLELDTSNCEVEACQEFFKSQRWNKKECSDDLGDTRLESLLSAIDDR FT FARELLDLKEEAAYARAHESRLNKRFAGAPALVLAAANLGLKIKNTADIIK FT LKSVSLAQGRNIDKLTKNFATLESNLAVSLNSTYERVSKIEHDLCRLNNQM FT VENRIQDWSTATVSQYINEVDYEVAALQQGQIPSRVEWNRLFSAACWGSCV FT ALEPAECEAYCEGLLRELPLELKPVLLGLNITDSGLDVFIRLAFPVIESSP FT TKLFQANPFGIIVKTDDSNQLVAPDIAPYATELAGEYYEVDRFSCLSSRKN FT MICRHSAIMSASCLREVKNCNVKKTTTEATCSFAFDNNGVVVYAAEEAIFR FT NRTVLADIGSAHERFNGFRYFPESATDSEVFCSGNLLIKIPKTPLVSSSNT FT KMDLRAAANTNFTFAVFNVNPDSLADDVEERLDELEDIVEDNFTQTTQFSI FT LTSVALLLVLFTTLGLLTWQFCLRVRGLARTVNSALKPLI" XX SQ Sequence 8432 BP; 2225 A; 2303 C; 1914 G; 1990 T; 0 other; ctggtgactg attcgataaa gaatacgcca aagaaataaa agtttttttt aaacaaaaaa 60 aaaaacgaaa gaaagagatt tcagaagatt tcggaagaaa agcaaaataa agagaatctc 120 ctgctgaaaa ttctggaaga atctccaaaa cgaagcagct caaatcttct gtaagttatt 180 ttgcaaagag gggggaagaa aaagcaagga aatcgacctt ggatctcaga agtattttct 240 ggactcgaaa gtgctgaggg accaggccgc tgaacctgtc agcacagtct ttcttgaagc 300 cgccgaggtc gcccgcttag ccgctcgctt cctgctctcc cgcatcgccg cctcataaag 360 tctcccaccg agactcatta caccttatct tcattaatcg gcccccacag cgatattcga 420 cccagaggat tacccgcaac gattgtataa caattattaa tcgcctgccg agtacctaca 480 tcattagtct cctgctttga ctaattacag caaatcctca ttacttgtca cttgccgcga 540 tctttggaaa tccgaatcac ccgcaacgct tgtataacga taattaatca ccagccgagt 600 atctacatcg ttagtctcct gcttggatta attacacaaa tccttattac tcgtcacttg 660 ccgcgatctt tgacagtccg aatcacccgc atattgatta acatcaataa atcgccagcc 720 gagaacgact taagggaccc ctccaggaat tgcaaagttg gttcagctca gaacaatcac 780 acgctttctg cgatcccgac aacagcgtca gtctctacga acccttcatc gagcaaggac 840 aatgcagctt actccaatct ccgaattaac ggtcgtccag ctcaagtcga tctgcaagcg 900 gttaaagctg tcgaaagccg gaaataaagc tgacctgcaa gcgactcttc gtgctcacgc 960 ccgacagatg aactacaggc ggacaaacga cctcaaagag tacgtcgtga acacggacaa 1020 cctcggcgac gaggacgagg aagtggaacc agtagtatcg accgccgcgc cagccgcgct 1080 acctgcgtct ttgacactcc cgtctgtcac ttcgccagcg ccgatcccga catcatcgtt 1140 cgccttccct gcagtggtaa caacggcgtc gtcatccgag aaaccggctg cgcaagggcc 1200 cgccatgtca ctcttacaag atatccagcg tgagcggcat tcgcgagctg agagcattgc 1260 ttcgctttgc ggcggaacaa gtgtcactgc ttcaacgacc aggcaaagtg gactctactc 1320 gactgctttt cctcaacgct catggattcc cgactctact cgatctgaaa atccttggtc 1380 tcttggaacc ccgaatcgaa cagtgccgac ctcagcgcct ttcaactctc caaagacgtc 1440 gccaccggta acaccgccaa cgagctacgg gtttaactca gctggagtca atgacgcacg 1500 aagccgggat agcttgagaa aggagtcggc ggatatcaga atggtaagct cgcgttcacc 1560 ggaggaaatc gagtccttgt tggactctct ggaaaatctc gcctcaatct acgactggtc 1620 gccagctcag atgatgacgg tgctgcgaac aaaactgagt ccatcaatcc agcgtgaatg 1680 ttgcattact ccaagtatga ccttcgaaac tgcaagcaca cagctccgaa agcgccttgg 1740 ctacagccat cagaagctca caacgaagat tgaacagctt caacgattac ggaccgaatc 1800 ggcaagccaa gcattctcaa ggcttattaa cctcctcagt gcatcaggat ctgtctaccc 1860 gtctcttccg gaagaagcaa ggcagcagct tttaagaact tctctccgaa aagtactccg 1920 tcctcgtgaa tttgaaatgt tcttcataag ctggatatcc aactcgcaga caactgacct 1980 agctgcaatt gcagaaatca taagcctctt agagggcaat tggggcaacg aagttggagt 2040 cgtcggcgaa gctggtgcag atgagtacga tctcaacgtt gcgcgcagta ccaactgtca 2100 cctctgtggc gaagccggtc actacatgag acagtgcgat ctccttaatg gaattcgaca 2160 gatgggaact ttggcaagca gaatggacca gatggaagcg atggctctcc gacaggatga 2220 atgcctgaac cgactgctaa cctcaatcga gtcaatcgcg aaaaccagcc aggcagcttg 2280 taatgcgaca gtgcagcccc aacaccaggt tccggttgta cagcaagaag gccatagacg 2340 tggcaatcga ggcccaaaga aaccattttg ccgaacttgc cccttaacct ccgacggtca 2400 ccctccgcga cactactggc aggagtgccc gacatcgcaa tcttcgcact gactaggccg 2460 ccaacccgat ccaaaatcaa ccgctgaaaa atctcctctg ttctaccaca acatcaactc 2520 tccttcactg ctcccgatga taactttgga catccttaac ataccagtta cttgcttagt 2580 tgactctggt gccaattgtt gcatcgtgga cagcaatatt gcagcaaatt tcccaaaggc 2640 gcgtcgtgaa gcacactcgg cagctataac cgtggcaaat gcatcaaaaa tggttgtaaa 2700 agagaaaatt tggcttccac tgtgcttcaa caacgtcgtc aagtacatcg acttcttaat 2760 cgtggacggt ctcaatcatc aggcagtgct tggaatgtcg tcatggtcgt ccttgacgat 2820 cgatcgcagg gaggaatcca tcacaatcga cggcgcttgc ttcgacatgg ttccgtctgg 2880 acgttcattg gtcctgaaca gcatggtcgc tgccacgatc gcgcctcagt cccttacgcc 2940 gttacgagtc aaagtacccg ctggcgaaga cgaagccaac gacattctcc ttgtcgagcc 3000 ctcgaagaag ggattctgtg ctgaaaacgg agtggacgtc atgcaagctg cgtacggtgt 3060 gaagccggac caagtcattc tggtctacaa tacaagccaa gtcgcactga cgatcccttg 3120 tggcgccgaa gtggccacag caaaactcat ggaaaaacga gacgacggga gatacaagtt 3180 caacgagctc gtcagcatag acaacgagga gttccgcaat gaatggctcg actacagaag 3240 ggctcgtgaa atccgtctta agggtgaaga cggcgagaag tatcgaaacg tcacaattgg 3300 taatgagctt gacgacaaac aaaagacgaa ggtccagaag ctcctcaagc agtttcgcag 3360 cgtcttttca accggtccaa cagacttggg aatggtcaag gagtgcgcct tccgcatccg 3420 gcttacagat gaaacacaac gagtctacga gaagccgatt cctgtacgtc ctgggctgcg 3480 agacaaagcc gagaaagtga ttgaagattg gtgcaaggcg ggagtaatcg aaagagaagc 3540 aagtgacttc aatctccccc ttttttttgt caagaagggc aaggacggag tgcggccagt 3600 tttggacgca agaagactca atatgcttac gatcccgacg cgctggccga tcccgtcgct 3660 gaaggagctc atgagcaagc tgtctcgcaa catttcgaaa gcagaatccg tcttcgtcac 3720 ctcagtcgat gtcgcatctg cattcaacca ggtccgcact gctccagaag actcgcacaa 3780 gtgcgcattc tcatttctag gacgacagta tgtcagccgt agaatgtgtt ttggtcttgc 3840 gaattcacca tccgttttcg cgcacgttat tgatcgactc tgcggccaca tcgacgggct 3900 tgccgttctt atggatgaca tcgtacttgt ccatggcagc tttgaagaac acttgcgggc 3960 agtcagcgag ttcttggcga tattggagga gcatggtatt ttgttgaaaa ttgcaaaatc 4020 tcatttcttc gtcaaaaact tcgattacct cggcttcaaa gtaaccgaag agggaatcga 4080 gcctttggct gacaagcttg atcccgtccg aagttttccg cgccctcgaa gtcggaagga 4140 gctccgaaga tttcttggtc taacgaactt ttaccagcat tatgttaaag acggtcaaaa 4200 aactctgtcg cccctgcata agctcgcagc tccatcaaaa agtccctacc aatggaaatc 4260 ccagcacgag gaagccttct caaggtacaa agaggctttg ctgcaggtgg tctccttcaa 4320 gcatcgcgat gaaagcatgc cgctggtgat ctcaaccgac gcatgttcga ctggaatcgg 4380 cgcgtgtctc tatcaacgtg acacaacagg gacgtcttca tccgcttgga ttcttctcca 4440 gagctctttc agagggtgag cagaagctgc cttctcgata tttggagctt ctcggactag 4500 tttgcggttt agagcacttc gaatttgaaa tctatatgca gccagtgtac gccctgactg 4560 accacaagag ccttcaactg gtcctgcacg aaaagaaaat gcgacagaac gttcctgttt 4620 cgagtgacaa acctgttctc aagactctcc cgattcaacg tcagtggaat cgaatacgtt 4680 tcctgcgaca aaggcgtgat cgtctcctcc gacgccctca gccgaacgtt tgagatttct 4740 aacgacgacg ataaggacga ggacgaagac atcgtcgaca gaggctttct caactcaatc 4800 gagcttcgcc cacgcgctcc tgagcttcta caagtactcc agatgcaaac aagaagagcc 4860 aaacgacttg ctgagtctgc acctgaccaa gaagagccga ttatggtttc tggcgatgtt 4920 ccctaccgcc caaaagacat aatccagctc caagaaattg atcctgcact tcgaaaaatg 4980 gacaaagctg gaaaactggt caaaggtgca aatggtttgt gggtcctaag aagcaattca 5040 gacataatcg ctctccctga acagcttcat atcgaggtag cctccttcgt acacgcaaaa 5100 atggctcatc ctggcgcact ggccctcgag aaaattctca aaagaaagtt cttcatcgcc 5160 aggattgcac tgatttgcag agaagtttgc tcggcttgcg agatctgctg gaaaatcaag 5220 ccaagagcgc aacttaaggg actctcccca ccaaagcccg aaatgactgt cagaccttgg 5280 cagaagctct acatcgattt atcagatttt ggctcactgg atttgtctgg taacagatac 5340 ttccttggcg ttatggacga cttttcacga ttccttgttg gtcgttgtct ccccgacaaa 5400 tccgctgaaa ccgtcgctta cgcactcgcc gacatcctgc tcagcaatgg cgtccagggt 5460 gcgatgatac gagccgacaa cggcctggaa ttccgtaaca ccacttttga ctcggtaatg 5520 gagaagttca acattcgcgt ggcacacatc tctccctacc acgcgcagtc aaaccggatt 5580 gaaagaaagt tcagaagtct tggtctgaaa gcccgactcg cgaatatcaa cagagacgac 5640 tggagccgac aaattccctt cgttgtcttc atggtgaaca atgcgccgac aacagctcta 5700 gatggactct cgccttcaga agtactaact ggtcgtccga tggagcttcc gctgtttcac 5760 ctgcgacccg atccggaact gaacgaagat ccctttctct ggtctggata cctcagcggc 5820 tggctacgca acgttggaca atttcttggt cagaaatacc gcgagcgaag cgcgcaccag 5880 atggaacccg tccgtggaat caccttggag gttggacaga aagtctgcta ctgggcccca 5940 atgcgtctga agcagtgcaa gaagcttcac gtcgcctaca agaataaggc tgaagtcgtc 6000 aaaatccttg gtcaagatgt ttacgaaatc cgcgacatgg ttaccggaaa tctacttcga 6060 cgaaatgtcc gctttttgag acctttgccc agttcctaat tatccttctc tccattttgt 6120 acatacagtt gcgatttgcc gtgctcgaaa cttaatctct acgcattcag acttcatcaa 6180 aatcctgaaa tgttcgtcgt caaccttact ttcattggca tcgccctcag ccagagcctt 6240 gtgctcgacg gcggcatcct cctcgagaga tccggctcgc tgttcctgtc tcgccagaag 6300 agtcaccaca atgtgatcat cactcgaccc gtccctcgca atcctttcct ggagcttgac 6360 acaagcaact gcgaagtcga agcctgtcaa gaattcttca agagtcagcg ctggaacaaa 6420 aaggaatgca gcgacgatct tggcgatacc cgcctggaat ctctcctctc cgccattgac 6480 gaccgattcg cccgggaact acttgacttg aaagaggaag ccgcttacgc ccgcgcacat 6540 gaatcacgct tgaacaaaag attcgctgga gctcctgcac tcgtactcgc tgccgccaac 6600 cttggtctca agatcaagaa cacggctgac atcatcaaac tcaaatctgt cagccttgcc 6660 caaggccgca atatcgacaa gctcaccaaa aacttcgcaa ccctcgaatc caacctggcc 6720 gtcagcctca actccacgta cgagcgtgtc tcgaagattg aacatgacct ttgccgcctc 6780 aataaccaga tggtcgagaa cagaatccaa gactggtcca ccgctactgt cagccagtac 6840 atcaatgagg tcgactacga ggttgccgct cttcagcagg gccagattcc gagtcgtgtc 6900 gaatggaatc ggcttttctc cgctgcttgt tggggatcct gtgttgcact tgaaccagcc 6960 gaatgcgaag cttactgcga gggactgctt cgagaacttc cattggaact caaaccagtg 7020 ctgctcggct tgaacatcac ggactccgga ctcgacgtct tcatcaggct cgcatttcca 7080 gtcatcgagt catccccaac gaagctgttc caggccaacc cctttggcat catcgtcaag 7140 actgacgatt ccaaccagct tgttgccccc gacatcgctc cttacgcaac agaacttgct 7200 ggagaatact acgaagttga tcgtttttca tgcttgagct cccggaaaaa catgatttgc 7260 agacactcag caattatgag cgcctcctgt ctccgcgaag tcaaaaactg caatgtcaag 7320 aagacaacga ctgaagcaac ttgctcattt gccttcgaca acaacggtgt ggtcgtctac 7380 gctgccgaag aagccatctt ccgcaaccgc acagtgctcg ccgacattgg ttccgcacac 7440 gagcgcttca atggatttcg atatttccca gaatcagcta cggatagcga agttttctgc 7500 tctggaaatc tcctcatcaa aatcccgaag acgccactcg tcagctccag taacacaaag 7560 atggatctgc gtgctgccgc caacaccaac ttcaccttcg ccgtcttcaa cgtcaacccc 7620 gacagcctcg ccgacgacgt cgaggaacga ttagacgaac tcgaggacat cgttgaagac 7680 aacttcactc agacaaccca gttttcaatt ttaacttctg tagccttact tctagtactt 7740 ttcacaaccc tcggactcct cacatggcag ttctgcttga gagttcgtgg tctggcgcga 7800 acggttaact cggctctaaa gccactcatt taagctagca ccgttaagtt cagcgccacg 7860 tccactcaag actttccaac accactctat tttttttcta acctacctct gggggtcccg 7920 gttaactagt gttagacttt tttttttggt tcgcaccttt tcaagaggag ggaaggaaga 7980 cttgggagac ggaaagtctg gcttcgtata cgtaacggtt gcggaacaga ctttcctcga 8040 caccacttaa catcctttat ttttgattgt gatgccagtt ggacggcgat ccgacccata 8100 ggggcagctc ttggcgaaga gtggggagaa attgccctgt gcgaacatta ggattagaaa 8160 ttagccgggc ctaacgtttt tgtagttcaa cttttatgtt ggatttaaag tcgattatta 8220 agtagcaaaa gcccgatctc tcattctcct gtcatgtgag ccatttctat agtcactttt 8280 ataatagcac ttctatttcg ttaagaaccc tttattactc gtgtgtactt tttcctattt 8340 cttcttttct gataaactcc ttacaatagc cttagctcgt gccatcagtc atgtgtttta 8400 catattatct actccagaaa tctaggggtg gg 8432 // ID Copia-51_AA-LTR repbase; DNA; INV; 201 BP. XX AC supercont1.1; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-51_AA_; KW Copia-51_AA-I; Copia-51_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-201 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.1; Positions 2819032 2819232. XX SQ Sequence 201 BP; 59 A; 41 C; 31 G; 70 T; 0 other; tgttgagggt aataattaac acacattcac ctactctttg ttatacagtg taagctatac 60 ctgaaatgat aaatactttg taccttgaac tgcgtcatga gaaatgtcat taaacatatt 120 cattctaagt ttgtattcct atccacaaca cgcgttttta ttctctctcc gaaattgctc 180 tgctaagagg ttatgggccc a 201 // ID Mariner-36_HM repbase; DNA; INV; 3454 BP. XX AC . XX DT 14-JAN-2009 (Rel. 14.02, Created) DT 14-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE Mariner-type family: consensus sequence. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-36_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3454 RA Bao W. and Jurka J.; RT "Families of Mariner elements from Hydra magnipapillata."; RL Repbase Reports 9(2), 394-394 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 1004..2788 FT /product="Mariner-36_HM_1p" FT /translation="MRNYKRKTSNGAFTSQQLSDAASSVLEGKKSVSAAAK FT EFDIKRMTLARYILKLNSGGYPSMGYSKPRLIFSQEQESALKNYLLQMASI FT FYGYTPKDVRSLAYECALQYKIKIPDSWTVNKMAGKEWMTSFLKRNPQLSI FT RKPEATSLGRATSFNAANVKVFFDKLGEVMDIYKFSASQIWNVDETSVSTV FT VKPSKIVAAKGKRNVGAMTSGERGTNVTMVTAVSASGNTVPPMFVFPRKNY FT KDYFVNNGPPDCIGVGNGSGWVTDIEFKNFMQHLIRHVKPSNEYKILLILD FT NHSSHLHFETLNLAKENGIVMLSFPPHCSHKLQPLDVSVFGPFKKYLSVAQ FT DAWLRNNPGKAITIYDIPKIVSDSLPLAITCTNITKGFQKTGVYPYNANIF FT ADDDFLPSFVTDRIEPANLVSELSHVAHSEVCLTSSSGQSVKSSINKETML FT LIPETPKAFSPETVRPYPKAAPRKIHLTRRRKRKAAILTDTPERIILKQQQ FT TKKPKKVEKQKKLSTCQFLSNINKECGNKETQLHSILKKVKKQTNKKTEKK FT EILKKPKKSSASQSFADHGNKETLLLSILKKVKKQSNKKTKMTKKLN*" XX SQ Sequence 3454 BP; 1280 A; 489 C; 522 G; 1162 T; 1 other; gggtagagcg kggctagttg ttaagcatag gggcaagttg ggcaactaaa atatctcaaa 60 aactactcat cccatggcaa aacatctaat ggataaaaga taggtattta gaatggtaac 120 aaaatgcgca tcaaatagtt gttggaccgc ggttcagtaa catgcacaga cactttttcg 180 attttggacc aaaaaagtaa aaaaaaaata tttttgctat ttttctctta gatgcttaaa 240 aaaacattta tctggaatca cttaagtgtt tatattttac ccatttattt ggctattgat 300 tatacttatc ctttttttcc ccaagcctta tagttctttt tataacctaa ctgatgtgtc 360 gccttgtcta gtaggggtaa gttgagcaga tttgacaacg attaataaaa cggctttatc 420 agacgcgaag ataaaatttc gtttggtgaa cattttaagg tccattcaca cttttaaatt 480 aatcattata aaatactcat aacaattata aggtaatcat cttaaacctt ttttttttaa 540 atatgtattt ataatttaaa attatttgat tttttaatat ttttaaaata tttttacttc 600 aatattacgt ggtcaataaa atctttaagt tgaaaagttc ttgaataatt tttttaaatt 660 aatcttgttt actatgttta ttatgataat attatgatag tgtgcatatt taaaaaacat 720 ataacttata atttatatta tataaacata taatttaaag tatatataat caaaacatat 780 ataaaatata atatttcatt tatattttga ttatttatac tatatataaa taatttatat 840 atatatatat ttaatattac aatatttaaa tagaaaagtt taaatattgg tatatacata 900 tatatatata tatatatata tatatatata tatatatata tatatatata tatatatata 960 tatatatata tatatatata tatatatata tatatattat aggatgcgta attataaaag 1020 aaaaacttca aatggtgcct tcacaagcca acagcttagt gatgcagcaa gttctgtttt 1080 ggagggaaaa aaaagtgtta gtgcagctgc taaagaattc gacatcaaga gaatgacatt 1140 agcaagatat attttaaaac taaactctgg tggttaccca tccatgggtt attcaaaacc 1200 tagattaatt ttttcccaag agcaagaaag cgccttaaaa aattatttat tgcaaatggc 1260 atctatattt tacggctata ctcctaaaga tgttcgatct cttgcttatg agtgtgctct 1320 tcaatacaaa atcaaaatac cagattcttg gacagtaaat aagatggctg gtaaagagtg 1380 gatgactagt tttttaaaaa gaaatcctca attatctata aggaaacctg aagcaactag 1440 ccttggtagg gcaacatcat ttaacgctgc aaatgtaaaa gtattttttg ataagctagg 1500 tgaagttatg gatatataca aattcagtgc ttcacaaata tggaacgttg acgaaactag 1560 tgtatctaca gttgtaaaac ctagtaaaat tgtggctgca aagggcaaga gaaatgttgg 1620 tgctatgaca tctggggaaa gaggcactaa tgtcacaatg gtaaccgctg tatctgcctc 1680 aggaaataca gtacctccaa tgtttgtgtt ccctcgcaaa aattacaaag attattttgt 1740 taataatggt ccaccggact gtattggggt aggtaatgga agtggctggg ttacagatat 1800 tgaattcaaa aattttatgc agcatttgat taggcatgtg aagccttcaa atgaatataa 1860 aattctgtta attttggaca atcactcttc tcacttgcat tttgaaacac taaacttagc 1920 aaaggagaat ggaatagtca tgttgtcttt cccaccacac tgcagccata agctgcagcc 1980 actagatgta tcagtgtttg gaccttttaa aaagtaccta tcagttgcac aagatgcatg 2040 gctgagaaac aatcctggaa aagccataac aatttatgat ataccaaaaa ttgtatctga 2100 ttcattgcca ttagctataa cgtgtactaa cattacaaag ggttttcaaa aaactggtgt 2160 atacccgtac aatgcaaaca tttttgctga tgatgatttt ttaccatcat ttgtcacaga 2220 tcgtatagaa ccagcaaatc ttgtttctga gctatcacac gtagctcatt ctgaagtatg 2280 tttaacatca tcttcaggtc aatctgtcaa aagtagtatt aataaagaga caatgttact 2340 tattcctgaa actcctaaag cattttctcc agaaactgtg cgaccttacc caaaagcagc 2400 accaagaaag atacatttga caagacgaag aaaaaggaaa gctgctattc taacagatac 2460 tccagaaaga attatattaa aacagcagca aactaagaaa cctaaaaaag ttgagaaaca 2520 aaaaaaatta tctacctgtc aattcctttc aaatattaat aaagagtgtg gtaataaaga 2580 gactcagtta catagtatat taaaaaaagt taagaaacaa acaaacaaaa aaactgagaa 2640 gaaagaaatt ttaaagaaac caaaaaaaag ttctgccagt caatcctttg cagatcatgg 2700 taataaggag actctgctgc ttagtatatt aaaaaaagtt aaaaaacaat caaacaaaaa 2760 aactaagatg acaaaaaaat taaattaaga caagtgcaga acacgattat gcttgtcttg 2820 tatgttggga aacttattct gaaagtctac cagaagaaaa ttggatccag tgtcaaggtt 2880 gcaatcaatg gtctcattca aaatgtgatt cacattctgg tcaaaattat atttgtatta 2940 actgcaaaat ggaaattgaa gactaacata aagtgtttag ttattatttt atcataatat 3000 taacaagttt tgttttaaaa tgttttaaaa aagttgtgtt tagttttaag attaaaaagt 3060 tttgtttttg tttttatttt agtatcaaaa ttattagttt tgatattcat atcttaaata 3120 atttgttaaa attgaaagta tctatttaat ttataaggaa gttttgtttt ggttttataa 3180 cagtgttaat agttgtcact gaaatgcatt tattttttta ttgctcaact taccccacgc 3240 aattgctcaa cttaccccgc tggggtaggt tgagcagcct aataaacttt agacaatatt 3300 ttctaaaaca aatcgaaaaa agtttttttg tctcaaccat tttttagcag atcaaattat 3360 ctttctaata gtgaaaaaat tatattaatc ggatgtaccg ttcaagagat ctgttagatt 3420 tagtaaaaat tgcccaacta gccccgccct accc 3454 // ID RTE_Ele5 repbase; DNA; INV; 3277 BP. XX AC . XX DT 06-OCT-2010 (Rel. 15.1, Created) DT 06-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE An RTE clade non-LTR retrotransposon family from Aedes aegypti. XX KW RTE; Non-LTR Retrotransposon; Transposable Element; RTE-4_AAe; KW RTE-4B_AAe; RTE_Ele5. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-3277 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-3277 RA Kojima K.K. and Jurka J.; RT "RTE clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (06-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. This consensus is generated from 30 CC sequences with >97% identity, and ~99% identical to the original CC sequence in [1]. The consensus is ~77% identical to RTE-4_AAe. XX FH Key Location/Qualifiers FT CDS 188..3238 FT /product="RTE_Ele5_1p" FT /note="endonuclease and reverse transcriptase." FT /translation="HDVGSATLVRWPIEDSSPTKKGKSRANGLILRQQTRQ FT RIKDNDWKVGSWNVRTLNEPARVGLLARELRNVGVNVAAIQEIRWPRTGER FT EFRAVDPIANTSFKYHIYYSGGDRAERGVGFIVIGKQMKRIIRWKPVSDRI FT CVLRMRGKFFNYSLISIYAPTNDKPDDMKDEFYESLDKAYGECPKQDVKIV FT IGDANAQIGKEDFFRPVIGTESLHSVTNDNGLRLVTFAAARGMAISSTYFA FT RKNIRKHTWRHPSGDACSQIDHVLVDGRHFSDVIDVRTFRGPNIDSDHYLV FT VAKIRARLSSVTSSATHRTLRFNIQRLSTDGVAAQYRQQLDEKLGRVNVAG FT DVDSLWDPIHEAVTTTAREVIGTGQRRRRNDWFDEECQRVTDMKNVARSRM FT LVAGTRQNRERYRAARAEEKRIHRRKKRQHEESVVADAQESMNRNDMRRFY FT ATVNGARHNTVPVPAMCNDREGNLLTDKTAVAARWKEHFQQLLNGENGNVA FT RNRMNIDDDDQAVDPPTIGEVKKAISELKNCKAAGKDEIPVELLKHGSEQL FT YQSIHRVLLKVWEDEELPTGWLDGLIRPIYKKGHRLECANYRGITLLNSAY FT KILSRILFNRLRPLEESFVGEYQAGFREGRSTTDQMFSLRMILDKFREYNL FT QTHHLFIDFKAAYDSVKRNELWQIMSENGFPAKLIRLIRATLDGSKSSVRI FT ADEVSTSFVTLDGLKQGDALSNLLFNIALEGAIRRSGVQRNGTIITRSHML FT LGFADDIDLIGIDRRSVEEAFVPLKRETARIGLTINSTKTKYMVAGRDRVR FT HGGVSAEVVFDGDVFEVVEEFVYLGTLVTCDNDVSREVKRRVAAANRAFYG FT LRNQLRSRSLQTETKFALYKTLILPVALYGHEAWTLKESDRKALGVFERKV FT LRTILGGKLENGVWRRRMNHELYQVYKDANIINRVKYGRLQWAGHLVRMSE FT ERIAKIIFSREPGRGRRLRGRPRIRWLYAVEEDLATLNVRGNWRSFAQDRR FT RWSSTIRPAMA" XX SQ Sequence 3277 BP; 927 A; 722 C; 945 G; 683 T; 0 other; accgctagcg catgacggct caccatagta gcatgtccga aagaacttga aactattgag 60 aaccagagct acaggtccaa agccctgata aaggtggatg gttgtgatgg ctatgaccgt 120 ggtcatcatg taaagaacaa gcggacgatg ctgtatcgtg tcagcatcga ggaggcagcc 180 cctctagcac gatgtaggta gcgcaaccct ggttaggtgg cctatcgaag actcttcacc 240 aaccaagaaa ggcaaaagta gagcaaacgg attgatttta cggcaacaga cccggcaacg 300 aataaaggac aacgattgga aagttggatc ttggaacgtg agaactttga atgaacccgc 360 gcgtgttggg ctcctggctc gtgaactgcg gaatgtcggc gtgaacgtgg cagctatcca 420 ggaaatacgc tggccgagaa ccggagaacg cgaattccga gcggtggacc ccatcgccaa 480 cacttcattc aagtaccaca tctactacag cggtggcgac agagcagaac gtggagttgg 540 cttcatagtg attgggaagc agatgaagcg aattattcgg tggaaaccgg taagcgaccg 600 aatctgtgtg ttgagaatga ggggcaagtt cttcaactac agcctgatca gcatatatgc 660 gccaacgaac gataagcccg atgacatgaa ggatgagttc tatgagagcc tggataaggc 720 ctacggagag tgcccaaaac aagacgtcaa gatagtcatt ggcgacgcaa atgcgcagat 780 cgggaaagaa gatttcttcc gacccgttat tggaacggaa agccttcatt ccgttaccaa 840 tgataatggc ctgcggctag taaccttcgc tgctgctaga gggatggcaa tcagcagtac 900 ctacttcgca cgaaagaata tccgcaaaca cacctggcga cacccgagtg gcgatgcctg 960 ctcccaaata gaccacgtgc tggttgatgg gcgacatttc tcagatgtca ttgatgtcag 1020 gaccttccga ggccctaata tcgactcgga tcattatctc gttgtagcta aaattcgggc 1080 gcggttatcc agcgtcacga gttccgcaac acacagaacg ctacgcttca atatacaacg 1140 cttgtcgact gatggggtag ctgcgcagta ccgtcagcaa ctagacgaga agttgggaag 1200 agtcaacgtt gctggagacg tcgacagcct gtgggaccct atccacgaag ctgtgacaac 1260 aacggcgcgg gaagtgatcg gcactggtca acgacgaaga cggaacgact ggttcgatga 1320 agagtgccag agggtgacag acatgaagaa tgttgccaga agccgtatgc ttgtggccgg 1380 tacccgacag aacagagagc ggtacagggc agcgagagcc gaagaaaagc gaatccaccg 1440 cagaaagaaa aggcagcacg aagaaagtgt ggtagctgac gcacaagaaa gcatgaaccg 1500 aaatgatatg cggagatttt atgcaacggt caatggtgcg cggcacaata ccgtaccagt 1560 gcccgccatg tgcaatgacc gagaagggaa tttgctgacc gataaaacgg cggtggctgc 1620 caggtggaag gagcacttcc agcaattgtt gaacggtgag aatggaaatg tagcgaggaa 1680 caggatgaac atagatgatg acgatcaagc tgtggaccca ccgaccatag gagaggttaa 1740 aaaggctatc agcgagctga agaactgtaa ggctgctggg aaggacgaga tcccggtcga 1800 gcttctcaag cacggaagcg agcagcttta ccagtcgatc caccgagtac ttctgaaggt 1860 atgggaggac gaagaattgc ccaccggctg gttggatggc ctcatccgcc ctatctacaa 1920 gaaagggcac agactggagt gtgccaatta cagaggaatt accctgctga attcggcgta 1980 caaaattctg tcgcgcatcc tgtttaacag actgagaccg ctcgaggagt ccttcgtcgg 2040 cgaataccaa gctggttttc gtgagggccg ttcgacaacg gaccagatgt ttagcttgcg 2100 aatgatccta gacaaattcc gggagtataa cttgcagact caccatctgt ttattgattt 2160 caaggcggcg tacgactcag tgaaaagaaa tgagctttgg cagataatgt ctgaaaatgg 2220 ttttccggcg aaactaatta ggctgatacg tgctacgctg gatggttcga aatcaagtgt 2280 tcggatcgca gacgaggtgt caacctcgtt cgtgacgtta gacggattga agcagggaga 2340 cgcactttcg aatttactgt tcaacattgc acttgagggt gctattagga gatctggcgt 2400 gcaaagaaat ggcactatta tcacaaggtc gcacatgctc cttggctttg cggacgatat 2460 agacctgatt ggaatcgatc gcaggtcagt ggaagaggcc ttcgtgcctc tgaagaggga 2520 gacagcgagg ataggcctga ccatcaattc taccaagacg aagtacatgg ttgcaggtag 2580 agatagagtc aggcatggtg gtgtaagtgc tgaggtagtg tttgatgggg atgtgtttga 2640 agttgttgaa gaatttgttt accttggaac acttgtgaca tgtgacaacg acgtttcccg 2700 cgaagtgaaa agacgtgttg cggctgcgaa tagggccttt tacggactac gtaaccagct 2760 taggtcccgc agcttgcaaa ccgaaactaa attcgccctg tataaaacat tgattcttcc 2820 ggtggctctc tacgggcacg aagcgtggac gttgaaagag tcagaccgga aagctctcgg 2880 tgttttcgag cgtaaagtgc tgcggacaat actcggtggg aaactcgaaa atggtgtgtg 2940 gcgcagacgc atgaatcacg agttgtatca agtgtacaaa gatgcgaata ttatcaatcg 3000 tgtaaaatac ggcagacttc agtgggctgg tcacttagtg cgaatgtcgg aagaaagaat 3060 tgcgaaaata atattcagca gggaaccagg tagaggccgg cggcttcggg gaagaccacg 3120 aatacgctgg ctgtacgcag tggaagagga cctggcgacc ctaaacgttc ggggcaactg 3180 gagaagtttc gcccaagacc gacgaagatg gagctctaca atacgcccgg caatggcgtg 3240 atgctacgct gtagccatca aggtatcaag gtaggta 3277 // ID Helitron1_Dmoj repbase; DNA; INV; 2125 BP. XX AC . XX DT 28-AUG-2008 (Rel. 13.09, Created) DT 28-AUG-2008 (Rel. 13.09, Last updated, Version 1) XX DE A new family of helitrons in Drosophila mojavensis. XX KW Helitron; DNA transposon; Transposable Element; Nonautonomous; KW non-autonomous; Drosophila mojavensis; Helitron1_Dmoj. XX OS Drosophila mojavensis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; repleta group; OC mulleri subgroup. XX RN [1] RP 1-2125 RA Styles P.; RT "Helitron1_Dmoj: a new family of helitrons in Drosophila RT mojavensis."; RL Repbase Reports 8(9), 907-907 (2008). XX DR [1] (Consensus) XX CC Helitron1_Dmoj inserts preferentially into 5'-TT-3' target sites CC in T-rich regions and does not produce TSDs. The consensus was CC obtained by alignment of 67 elements. Elements are on average CC 97.5% identical to the consensus. Helitron1_Dmoj possesses a CC series of 8bp direct repeats at the 5' end, and does not encode CC any proteins. A Helitron1_Dmoj insertion is found within a CC recently-transposed Transib1_Dmoj element, therefore CC Helitron1_Dmoj is currently active in Drosophila mojavensis. XX SQ Sequence 2125 BP; 700 A; 392 C; 425 G; 608 T; 0 other; ttagactcat tcacgaaagt gaaatgagta tattggtttt gtgcaatgtc cgtcttgctg 60 tccgtccgtc tgtccgtctg tccgtctgtc cgtatgaacg caatctattt cggcgctatt 120 gaagctagag acttgaaatt tggaacacat actcgtgtat atctccgatt tataaagtgt 180 taaaattttc ccgatccgcc cactaggggc gtggcaaatc ggaaaatgca ttttccgata 240 taccagaaat catgtttgtt gaagagctag aaagctgaaa tttggtatgt aggtccgcgc 300 cgaccaccca agatcacaag ctatggcgga tatatctccg cccattaggg gcgtggcaaa 360 tcggacaaaa ctgtatctcc taaagttata aacttagaat gctgaaattt tgcacgtaag 420 ctaatgtttt aattctatga agatgggtga aaaaattata tatctacgcc cactaggggc 480 gtggcaaatc ggacctgaag atatagtgat ttttcgaatt ctgctatatt ttaggcacta 540 ttgatgctag agatctgaaa tttggtatat atacgcgggt tgatcaccca agtccaacga 600 gctatggtag atatatgtcc gcctataggg ggcgtggcaa accgcaaaag tttatatctc 660 ctacagttta agagctagaa gcttcaaatt tggatagtta agttactggt ggcctcctct 720 atcgatgccc caaaaataat ttttaagcct ctttaggggc gtggcatatc gaaaaatacc 780 ctgtaatttt cacatttttt gagcgtaaga attcaaattt tgcatttagg taaccggttg 840 ctcgttctaa gaactggcta aaattgagga gtcgacggcc actaggggcg tggcaatcgg 900 catttacagt gcccgaattg cgacatttat atgcgtcagc catctattta tgtatatgta 960 cacataagta ggcttaatag gcattgatag cacggaagga aacattttta taaaattaat 1020 aaaatataaa aaaagctaaa ataacaaaaa tttgcaattt acaaaaaagt gttttcagcg 1080 ggatttgaac tcgcacccaa cccaaacagc atgtttgtca aaatagctaa aataaagatc 1140 gtcggcgctt cagcctacag caccaccgcg tcgtttcgat caatagtagt taaaacgaaa 1200 tcacattatt gttatgacgc cacatagcaa agaaaagcca gtgtagcggc tggaaattca 1260 aaaacaatgc aaagacaaat caatcgcaac gggattctta gaatatgcta gagcacgcat 1320 ttatgtgtac atacatacac acatacaaat gagcgcgtgt ataaggctgg aatgcttgat 1380 gctatgcgtg caggcttaca tatataatta tatatacata tgtatgtatt tgcgtatatg 1440 tatttgaggg ctaatttaag aagaaacaaa tttaaactta taaaattatt cgactaagaa 1500 aatgaaaagt attaagtgcc attcagtcaa gctcgcacct tcccaataaa atatctattt 1560 tataaaaaac gaaatatgta tgaaaaaaaa aaaaaaaata ataataacat agtagaggtt 1620 cgaacctccc ctactctaac attacttgcg tcgcccttag cacccgcgcc actgctttgc 1680 tcgagcaatc agttgctaaa atagaatagc ttaataagag agcgtcatca gacaaaacta 1740 cagtaacggc tagaaagtcc agaaatcaat cgaaattcta agaatgtgag aaaaggcgcg 1800 cttctgtctg tacgcataga atagacgatg agtgtgtgca caagcatagg aaaatcgcgc 1860 tgatctgctg ttagcttgaa gtgaagccgg tatccaatgt atggatataa atacatatat 1920 aataaaaatc aaaattgtat aaattatata tgtatgtaat ttgaatgcaa tagctatcga 1980 tgattttcca attttaatgg caaaatgccg tgttttttgc tgcgcactag acttgaaaat 2040 tgcactatat actctggttg acagcgtttg ggctgtgaat gagtcttctg caatcggtat 2100 cgcccgactg caacgttctt acttg 2125 // ID BEL-108_AA-I repbase; DNA; INV; 5898 BP. XX AC AAGE02018277; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-108_AA_; KW BEL-108_AA-LTR; BEL-108_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5898 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02018277; Positions 71062 76959. XX CC Positions [5054-5515] - Integrase core CC 'CCACC' target site duplication CC LTRs are 96% similar to each other. XX FH Key Location/Qualifiers FT CDS join(377..2383,2387..5050) FT /product="BEL-108_AA-I_1p" FT /translation="MLPWLRSETKTPTTVAHQPTNIESENTPRTPKTVVNK FT FTNILPEPSDIVPSPTPTPIAEQKEAKKKKQEMKRMEEQLKGLVNQRVAVH FT GKLARVRAALWDSDDQPNPNTKNIHFLQLHQKTVESCYHECNAIQNQIYVL FT PLSEDRLAEHTESYVDFEALYNNVSIQLGMLMQAASIAEMSHAHAQGPPNS FT AMPPVLPAQPYLPPLQVPLPTFDGSAEKWYSFKAMFTTIMDRYQQEEPALK FT LFHLQKSLVGKAAGVIDQDLVNNNDYDAVWRVLAQRFEDRRVIVDKHVENL FT FNLPKVGQENAENLRRMIDTCTKNVEALKHLNLPLNGLGEQMVINLIAGKM FT DKKLRVAWEGRQKKNVLSTYDAMLEFLQEQCRIYEKLDTTMKPLTESAKYK FT TVAKGHTLVSSETKNEKIEPKCPMCKANHELWKCDAFKNKSVGEKYELLKK FT YGSCFNCLGRGHRTAACSSKQSCRECGKRHHTTLHVEESSRKASSSGITTK FT SSEPSSTLVATPDVNQTTNTTAGSSTTLCASAENSEKQTLLSTAVVLVDGL FT SSAAYPCRVLLDSASQMNFITERFANLLSMRTVPADVTVSGLNGNKTRISR FT SLRTTIRSRHTDFATELDLLVTPRITGDLPVKSFDVSEWPIPIDKLLADPT FT FNQRGRVDMLIGGILLESSSWPTPAWAKPSGSKEYEAGVDRWRSNRLRCGS FT SRAHFCQTAEDEPLIELLKSFYKVEACDGIRLSHTADDEMCLDHFRRTHQR FT TEEGRYVVQHPFNERKQELGDSRQMALKRFLNLERKLDNQPELKEQYSQFI FT RDYEQLGHMRVVQEQPNEEPGSVYYLPHNCVLRPSSTTTKLRVVFDGSARS FT STGVSINDVLMTGPTIQNDLIAILLRFRGFKYVFTLDIPKMFRQVGVHQQD FT TKYQRIFWRYDKNDPLTVRELQTVTYGLGPSPFQATMALKQAAADHKEEFP FT EAAKVIEKGTYMDDILTGADTLGGACQLQREVTGLLAKGCFGAHKWCANHA FT DIVQAVPEELRGNSFMITDDNSNAVVKTLGVTWNPVEDWFSVSVPDYVDSE FT GITRRKLLSQLAKIFDPLGFFGPVITTAKLILREVAELEIDWDDPVPSDVG FT SKWRSFRNEMIVLKEVRMPRWISWNGLLKLELHGFADASDLAYGACLYARA FT VFPDGSFQMRLISSKSHILPKKKGRKKLITTPRAELLAALLLAKLVTKLLD FT ATELKFESTVVWSDSKIVLGWIRKPPHLLHTFVSNRVSEIQTLTPNCSWKY FT IPTDENPADLISRGEQPRKLAESKMWWTGPPSFNCAVVEVVDEPMIPDEEL FT PEMRSGAGVVLAITASIGRMPIFDRVSNFMKIVRYSAYFVRFAKYIMSRKR FT VVKKGPLSADEIREATLLVVRLIQRETFQQEISALMDGEDVKHRLNGLRAF FT LDPKDNILRVGGRIKRAFIPYNSRHQMLLPAKHPTTEALVRYLHLENLHIG FT QKGLLAVVRQRYWPLNVKSTIRKVIRDCIPCFKANPLKTTQLMGDLSSYRI FT QPAPTFSNTGVDYAGPFLIKSSTTARKPQLTKAYVSLFVCL" XX SQ Sequence 5898 BP; 1623 A; 1404 C; 1542 G; 1329 T; 0 other; aagtggtcct tctagaaccg gatacgggaa aatcagtaac gatcgacgtg gttctcgtga 60 aagttccgcg ataccgaaag aaaattcgag catcggttcg tgctgctgcc ggacgggtgc 120 agctttgctc gtggcggtcg ctacaagtag aagaaaagtg aaggaatact cttcggagga 180 ttcgtaatct ggtccgttcc aagacggata cgacgtcacg gcgttccgaa gcggaaaaac 240 gaaaaaataa gtgctaaaag tgaaagttcc atcggctagg ctcaagcggt gtgctttttt 300 tgcaaatggc ggatggcatc gaacaattat cggtgcgtga gtgcgattga aaaattgtgc 360 gcgatcgaga cggaagatgt taccgtggct tagatcggaa acgaaaacgc caacgacggt 420 cgcgcatcag ccaacgaaca ttgaatcgga aaatacaccg agaactccga agaccgtagt 480 gaataaattc acgaacattt tgcctgagcc atcggacatt gtgccgtcgc ctacgcctac 540 gccgattgct gaacaaaaag aagcaaagaa gaagaaacaa gaaatgaaaa ggatggagga 600 gcagctcaag ggattggtca atcagagagt ggcggttcac gggaagctgg cccgtgttag 660 agcagctttg tgggacagtg acgatcagcc gaatccgaac actaaaaaca ttcatttcct 720 ccaactccat cagaaaacgg tggagagctg ttatcacgag tgcaatgcca tccagaacca 780 gatctacgtg ctgccgctct ctgaagaccg tctagctgag cacaccgaga gctacgttga 840 tttcgaggcg ttgtacaaca acgtttccat ccaattgggc atgctgatgc aggcggcgtc 900 gatagccgag atgtcacatg ctcatgctca aggaccaccc aattctgcca tgccacctgt 960 tttgccagca cagccctacc ttccaccgct ccaagtgcct ctaccgacct tcgatgggtc 1020 agcagagaag tggtattcat tcaaggcgat gtttaccacg ataatggatc gataccagca 1080 ggaggaaccg gcgctaaaac tgtttcacct gcagaaaagc ctcgtaggga aagcagctgg 1140 cgtaatcgac caagatttgg tcaataataa tgactacgac gccgtatgga gggtgctcgc 1200 ccagcggttt gaggacagac gagtcatagt cgacaaacac gtcgaaaatc tgttcaactt 1260 accaaaggtt ggccaggaga acgcagagaa tttgcggaga atgattgaca cctgtaccaa 1320 gaatgtggag gccttaaagc acctgaatct gccattgaat ggactcggag agcagatggt 1380 aatcaatctc attgccggaa aaatggataa aaagcttcga gtggcttggg aaggtcggca 1440 aaagaagaat gtgctttcaa catatgatgc gatgctggag ttcctgcaag agcagtgtcg 1500 tatctacgag aagctcgaca caaccatgaa gccgttgacg gagagtgcta aatacaaaac 1560 ggtggcaaaa ggccatacgt tagtgtcaag tgaaactaaa aatgagaaaa tcgaacctaa 1620 gtgtccaatg tgcaaggcaa accatgaact ctggaagtgt gacgccttta aaaacaaaag 1680 tgtcggtgag aagtacgaat tgctaaaaaa gtatggatcc tgtttcaact gcttgggaag 1740 aggccaccga accgctgcgt gttcttcgaa gcagtcctgt cgcgagtgtg gcaagcgaca 1800 ccacactacg ctccacgtcg aggagtcatc cagaaaggcc tctagcagtg gcatcactac 1860 gaagtcatca gaaccttcat cgactctagt tgcgacacca gatgtgaacc agacgaccaa 1920 tactacagca gggtcatcga caaccttgtg tgcaagtgct gagaattccg agaagcaaac 1980 tttgctttcg actgccgttg tcttggtcga cggcttgagt agcgctgcat atccatgccg 2040 agtgcttcta gactccgctt cccagatgaa tttcatcacg gagcgattcg cgaaccttct 2100 ctcaatgaga acggtacctg ctgacgtcac agttagcggt ctgaacggca ataaaactcg 2160 gattagccgt tcgttacgta cgacaatcag atctcgtcat acagatttcg caaccgagct 2220 agatctgctg gtgacgcctc gaataactgg tgatctaccg gttaaatcat ttgacgtttc 2280 ggagtggccc atcccaatcg acaagttgtt ggccgacccg acgttcaacc aacgaggtcg 2340 tgtcgatatg cttatcggcg gaattctttt ggaatcttct agttgatggc caacaccagc 2400 ttgggccaaa ccttccggct ctaaagaata cgaagctggg gtggatcgct ggaggagtaa 2460 tcgcttgcga tgcggcagta gtcgcgcgca cttctgccag accgctgagg acgaaccact 2520 catcgaactg cttaagagct tctacaaggt ggaagcatgc gacgggattc gcctttccca 2580 tacggcggat gacgagatgt gcctggatca tttccggcga acacaccaac gaacagaaga 2640 aggaagatac gttgtgcaac accccttcaa cgaacgcaag caagaactgg gagactcacg 2700 tcagatggcg ttgaagcgct ttctgaatct ggaaaggaag ctggacaatc aaccggaatt 2760 aaaggagcaa tattcgcagt tcatccgaga ctacgaacag ctcggacaca tgcgagtagt 2820 acaggaacag ccaaacgagg aacctggatc ggtctactat ttacctcaca actgcgttct 2880 gcgccccagc agtacgacga ccaagctgag agtcgtcttt gatggatccg ccagaagttc 2940 cacaggcgta tcgattaacg atgtccttat gactggacct actatccaga acgatctaat 3000 tgcgatcctg ctccgtttcc gagggtttaa gtacgtcttt acgctggata ttccgaagat 3060 gtttcgccag gttggggttc atcaacagga caccaagtac caacgcatct tctggaggta 3120 cgataaaaac gatccgctga cagtccgaga gttgcagacg gttacatacg gcctgggacc 3180 ttcgccgttc caagcaacca tggcactcaa gcaagctgct gctgatcaca aggaagagtt 3240 cccggaagcc gcaaaagtca tcgaaaaggg aacatatatg gacgacatcc tgacgggagc 3300 tgatactctt ggtggagcat gccagctcca acgagaggta acaggtttgc tagcaaaggg 3360 ctgctttggc gctcataaat ggtgtgccaa ccatgctgac atcgtgcaag ctgttcccga 3420 agaacttcga ggaaactctt tcatgataac cgatgacaat tcgaatgcag tcgtgaagac 3480 tttgggcgtc acttggaacc cggttgagga ttggttttct gtgtcggttc cggactatgt 3540 cgactccgag ggaataactc gcagaaagct tctgagtcaa ctagccaaaa tatttgaccc 3600 tctgggattc tttggaccag tgattacaac cgcgaagctg attttgcgtg aagttgctga 3660 gttggagatt gattgggacg acccagttcc atccgatgtt ggctctaagt ggcggagttt 3720 ccgaaacgaa atgatagtcc taaaggaagt gcggatgcca cgatggattt catggaacgg 3780 tctgctcaag cttgaactgc atggatttgc tgacgcctcc gacttggcgt acggggcatg 3840 tctttatgct cgggccgtct ttcctgatgg ttcattccag atgcgactga tttccagcaa 3900 gagtcacatt ctaccgaaga agaaagggag aaagaaactc atcaccactc ctcgcgctga 3960 actgctagcg gctttattgc tcgccaagct ggtaacgaag ctgctggatg ctacagaact 4020 aaaatttgaa tcgacagttg tttggagtga ctcgaaaatc gtactggggt ggattcgaaa 4080 acctcctcac ctgctacata cttttgtttc gaatcgggta agtgaaatcc aaacactaac 4140 tccgaattgc agctggaaat atattcctac ggatgagaac ccggcggatc taatctcccg 4200 gggtgaacag ccgaggaaac ttgccgaatc caagatgtgg tggactggtc caccttcatt 4260 caattgcgct gtcgtcgagg tagtggatga acctatgatt ccggatgaag aactgccgga 4320 aatgcgatct ggtgctggcg tagtactagc tataactgct tcaattggac gaatgccgat 4380 ttttgacagg gtgagcaatt ttatgaagat tgtgcgctac tcggcctact tcgttcgctt 4440 tgccaaatac atcatgtcta gaaaaagggt ggtgaagaaa ggtccgctat ctgccgacga 4500 aattcgtgaa gcgacgctcc tggtagtacg gctgattcaa cgcgaaacgt tccaacagga 4560 aattagcgct ctgatggacg gcgaggacgt taagcatcgg ctcaacggat tacgagcgtt 4620 cttggatccg aaagataata ttctaagagt tggagggcgc atcaagcgag ccttcatacc 4680 gtataatagt cgtcatcaga tgctgctgcc agcgaagcac ccaactactg aggcactcgt 4740 ccggtatttg catttggaaa acctgcacat cgggcaaaaa ggattactcg cagtcgtacg 4800 ccaacgatac tggccattga atgtgaaaag tacaattcgt aaggtaattc gagactgcat 4860 accgtgcttc aaagccaacc ccttaaagac gacacagttg atgggcgatc tgtcgtcgta 4920 tcgcattcag ccagctccta ccttctcgaa caccggtgtg gactatgctg gtcctttttt 4980 gattaaatct tcgactacag ctcgcaaacc ccaacttacg aaggcctacg ttagcttgtt 5040 cgtgtgtctg tagacccgtg ccatacatct ggaactagtt tccgatctga cgacggacgc 5100 gtttctcgcg gctttgaggc gtttcgtcag caggcgtggg tgtccgaaat cgatgcactc 5160 tgacaatgca acaaactttg ttggggctaa atcagagttg catgaactat ggctgatgtt 5220 ccagaaccaa cgtgctaccg caagaatcac aacctactgc accaacaagg gaatcgactg 5280 gaccttcata ccgccgcgaa gtccacactt tggcggtata tgggaggccg gtgtgaaaca 5340 ggtaaagcat cacctcaagc ggattgtcgg cgaccggaag ctatcctacg aagaactgta 5400 taccacgttg acgcaaattg aagctgtgct aaactctagg cccttggcgc caagctcgga 5460 tgagcctagt gactacacag ctattacgcc tgctcacttc ctgatcgctc gagagatgca 5520 agccgtaccc gaaccgagct acttgcatat gaaagagaac agactatcgc gatggcagtt 5580 ggtgcagacc atgctgcagc gcttctggaa aaggtggaca gcggaatatc tgccagaact 5640 tcagaataga tccaagtggt tgaaaacgaa ggaaatcaaa gagggttcac tggtgttgct 5700 tgtggaccag aacatgccgc cgctacaatg gccactagga agaatcctga aaacgcaccc 5760 tggaagtgac ggcgttgtgc gcgtcgtaac ggtgagaaca gcaaatggag cagagttcaa 5820 gcgtgctgtg acggaggtct gtttgctgcc gtttgatcag gaagagtgtt gaaatccaga 5880 tttcaacgcg ggggatga 5898 // ID Transib3_DP repbase; DNA; INV; 2783 BP. XX AC . XX DT 21-MAR-2005 (Rel. 10.03, Created) DT 01-JUL-2005 (Rel. 10.08, Last updated, Version 2) XX DE Transib3_DP is a family of autonomous DNA transposons - a DE consensus sequence. XX KW Transib; DNA transposon; Transposable Element; transposase; KW Transib3_DP. XX NM Transib3_DP. XX OS Drosophila pseudoobscura OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; obscura group; OC pseudoobscura subgroup. XX RN [1] RP 1-2783 RA Kapitonov V.V. and Jurka J.; RT "RAG1 core and V(D)J recombination signal sequences were derived RT from Transib transposons."; RL PLoS Biol 3(6), (2005). XX DR [1] (Consensus) XX CC Transib3_DP is a family of autonomous DNA transposons that CC belongs to the Transib superfamily. CC Transib3_DP elements are characterized by the CGNCG target site CC duplications and 15-bp terminal inverted repeats (1 mismatch). CC The consensus sequence encodes the 680-aa Transib3_DPp CC transposase (pos. 432-2474). XX FH Key Location/Qualifiers FT CDS 432..2471 FT /product="Transib3_DPp" FT /translation="MAMKNTKLFEIFRNSSTSNEFVETVLNELNLVSDSPE FT QHTSIRKKIMCIHFSIKNKWLTSHRKVESFNSKYASWLIGNSLLYDGPSTS FT TGRAIGRPTKTLDKCSDYTRKRKLVDATCHIDTDHLSEALSIKYTNNKENK FT KAHITKAVSKASPVRLDRIKKSILKPPKLQPKSYTDEEALSLFSDLGMSKE FT KYSILRYSLQQHQVYVLPEYKHITDAKLMARPEENLVTVESAVVKLQDLVQ FT HTTKRLIKSLPHHQIESLPQNLTLISKWGCDGSSGQSEYKQAISTDDKTIT FT DSNMLMASLVPLWLKSDTSEYWRNPRPSSAHYCRPILFKYAKESSELIKST FT VSDIKNQISNIRPSVHKVNEEKSISVNFEFFLTTDGKVLNELTNTKSTLKC FT AFCKKKTQKEFKNLDDSIIEEENLEYDISPLHARIRCMEFILKLAYTIPRE FT GESFDNSTPANEIQKTRKKMIQEKILDKLSLKVDCPKQGFGNTNDGNSSLI FT EKNEITAEITGVDIDIIKRLGVILNILNCKEKINTRKFGEYASKTAELLLE FT KYPEKKLTPTLHKILAHGQILIEHQSLPIGEMSEEAQETRNKDYKKYRNMN FT TCKTSRIKQNQDLFNMLAASSDPYISSIRYVRSQIKSASSYSSEMISLLDI FT PPTKQEVEDYFTETKKFNQLSKYTKNCFFLF" XX SQ Sequence 2783 BP; 1033 A; 481 C; 494 G; 773 T; 2 other; cacagtggcg atttcgacga aaagaggtgg caaaagtcgg agattttttt tagcgctttg 60 gatacaatga atacacgcat ctaatagaga attttccaga gctttttggn atctataccg 120 attaaactgt cacaacctcg cacgaaaaac ggtcaaaaac ggctatcaca gctcttaaat 180 cccatacaaa atcatatttt gaccgttttt cgtgcgaggt tgtggcaggt aaatcggtat 240 agaagccaaa cggctttgta acctgctttt tattatgtca aagggtggta cctacatatg 300 tgtatatatt gaattcattt ttagcgcagc tctatagtct tattttgact tttaaaggat 360 tcagttgcgt ataagcaaac atcagtgaat ataattttta tatataattt aagaattgaa 420 ttttcgttaa aatggctatg aaaaatacca agttgttcga gatatttaga aattcaagca 480 cctcaaacga atttgtggaa actgttctta atgaattaaa tttagtatcg gatagcccag 540 agcagcatac aagtataaga aagaaaatta tgtgtataca ttttagtatt aaaaataagt 600 ggttgacctc acaccgtaag gtagaatcgt ttaatagcaa atacgcaagc tggctaattg 660 gaaacagctt gctatatgat ggaccctcca cgtccacggg acgagcaata ggaagaccaa 720 caaaaaccct ggataagtgt tcagattata ctagaaagcg aaaacttgta gatgcaacat 780 gtcatataga tacggatcat ttatcggagg cactgtcaat taaatatacg aacaacaagg 840 aaaacaaaaa ggcgcatata acaaaagctg tatctaaggc aagcccggtg agacttgaca 900 ggataaagaa aagcatcctc aagccaccaa aattacaacc gaagtcgtat accgatgaag 960 aggccttgtc gctgttttca gacttaggaa tgtctaaaga gaagtattca atacttcggt 1020 attccttaca gcagcatcaa gtctatgttc ttcctgaata caaacatatt acagatgcca 1080 aattgatggc aaggccggaa gaaaacttgg tgacggttga atcagcagta gtaaaacttc 1140 aagacctggt gcaacatact acgaaacggc tcattaaaag cttacctcac catcaaattg 1200 agagcctgcc acaaaatctc acactgatca gcaaatgggg ctgtgatgga tcttctgggc 1260 aaagtgaata taagcaagca atatctacag acgataaaac tattacagat tctaacatgc 1320 ttatggcttc tcttgtacca ctgtggctta agtcggatac atcggaatat tggagaaatc 1380 ctcgaccttc gtctgctcat tattgccgtc ctatattatt caaatacgcc aaggaatctt 1440 cggaactaat aaaatccact gtaagtgata ttaagaatca gatttcaaat ataagaccgt 1500 ctgtacataa agtcaacgaa gaaaaatcca ttagtgtcaa ttttgaattt ttccttacta 1560 ctgatggaaa agttcttaac gaacttacta atacaaaatc gaccttgaaa tgcgcatttt 1620 gcaaaaaaaa aacacagaag gaatttaaaa atttggatga ttccataatt gaagaggaaa 1680 atttagaata tgacatttcc ccattacatg caagaattag atgtatggag tttattttaa 1740 agcttgcata taccattccg cgagaggggg aatcttttga caacagcacg ccagccaacg 1800 aaatacaaaa gactaggaaa aaaatgattc aagaaaaaat cctagacaaa ctatcactga 1860 aggtggactg ccccaaacaa ggatttggaa acacaaatga tggaaactct tcactcattg 1920 agaaaaacga aattactgca gaaataactg gggtggatat tgacattatc aaaagattgg 1980 gtgtcatttt aaatatttta aattgcaaag aaaaaataaa tacccgcaag tttggtgaat 2040 atgcctcgaa gactgcagag ctgttactgg aaaaatatcc tgaaaagaag ctaactccca 2100 cgcttcataa aattctggca catggtcaaa ttttaattga acatcagagc ctaccaattg 2160 gagagatgtc agaggaagcc caggaaacga gaaacaaaga ttacaaaaaa tacaggaata 2220 tgaatacgtg caagacatca cgaattaagc aaaaccaaga ccttttcaac atgttggcag 2280 cctcttcaga tccatatatt tcatccatta gatatgtgag atcacaaatt aaatcggcaa 2340 gcagttattc ttcagaaatg ataagtttat tagatattcc tcccactaaa caggaagtgg 2400 aggattattt tacagaaaca aaaaaattca accaactttc naaatataca aaaaattgtt 2460 ttttcttatt ttaaattttt ttaaattagg ttaagttgaa aattaaatca aaaataatat 2520 tttatagtaa aaaacagcat taaaagtact ttatcgataa aaaaaaaaaa aatagagcat 2580 tttacatagg aaaatatgca ccggcaacaa agttcaactt taacccatta ttgtgaagct 2640 tttagacact gtagcacaat ttacttaagc agattaaatt ttgaaagctt tttctttcaa 2700 atgcagtttg tttcatgaag atatctttat ttgttcagaa gttattaatt tagcaagctg 2760 aaacggcgga attccccact gtg 2783 // ID Gypsy20-I_Dya repbase; DNA; INV; 6215 BP. XX AC . XX DT 26-NOV-2010 (Rel. 15.11, Created) DT 26-NOV-2010 (Rel. 15.11, Last updated, Version 2) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy20_Dya; KW Gypsy20-LTR_Dya; Gypsy20-I_Dya. XX NM L1A_Mim; LTR6_MD; LTR86_MD. XX OS Drosophila yakuba OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; melanogaster subgroup. XX RN [1] RP 1-6215 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1117-1117 (2009). XX RN [2] RP 1-6215 RA Jurka J.; RT "Consensus."; RL Direct Submission to Repbase Update (26-NOV-2010). XX DR [2] (Consensus) XX FH Key Location/Qualifiers FT CDS 488..1588 FT /product="Gypsy20-I_Dya_1p" FT /translation="MELPAENMTQLLRQIRQVPKFSGDPSNLSSFIRRIEY FT LLNLYPSNDARQKAVIFGAIELQVVGDAEKVTQFGMHNEWTTLRDALITEF FT KTQTPLEDLLGRLYRTPFKGNLRLFCEQLEDKSSVIINKLALENDRNNMVV FT YTQAMATTIKNTIQRSLPDRLFMTLARYDISTVQKLRQTAQQEGLYEDPIS FT KTFDSQPKLNSNSPNTYKHTNINQIKNTHPNQRFIRPFIPLNPTQTQNTNV FT NNQTPFRQIPPTRQQQLNQYRNLYNGFRTDLLQDRQNDTRNYYQPPQQPRV FT PNKRFRESSEQSRMQTNENFHQHELDYDNETEHYINNDEQFHEDQLQNTQT FT ELFTDQQYQQAENFPIPAWDPTNT" FT CDS 4899..6182 FT /product="Gypsy20-I_Dya_3p" FT /translation="MHHCLTINITEVEETFNSIIKQCQEFKNVTQIKYLTE FT KMEREINGIRISKRNKRGLANFVGSTLRYLFGTLDEDDRQHIEQQISTLSQ FT DTVQVSTLNHVIDSLNNGIEIINNQSNSLKKEQKLNLLIFNIEHFTEYIED FT IEMGSQLTRLGIFNPKLLKHENIGNINYKNLINIKTSAWYNVPTNEIFLMS FT HIPMNSIERPTFIIAPYPDNNGQIIKENIEGKFYSHNNYVLNTLTKNIVKD FT HCIANIIKHETPTCTFQKYRKQSYIQYIKPNILITWNMTREKLYHNCNGQE FT IYIENNKIIKISNCTAELKQITISNSIQQYTNVIFTEHNVTKIEPMSHIEI FT KEMILLNNKSNTIYRNILIIFISVLVLIYIFYFYMKCKTAPHKIIISYLKP FT SKTTNKNIEKETETKTEIAEIANPVPHLYPEIIA" FT CDS 1597..4773 FT /product="Gypsy20-I_Dya_2p" FT /translation="MFNNHRYRCIVDTGSSINLMSSNFFNLQVKDKQITVR FT SMTGYSKLNKYVDLPANGLFQQTQKFYIHSFSPYYDILLGRKILMENNGLI FT DYSRKETIINERTFIHKELDPCDENSSENFNALLQENNFTSEINYAMDNNI FT DSEHTYRLEHLNSEEKYRLTNLLKEFNDIQYSKNENLTFTSIIRHDIKTSH FT ENPVYKRPYSYPFSYEAEVNKQIEEMLKQGIIRESDSPYCSPLWIVPKKMD FT ASGKPKFRLVVDYRNLNEITINDKFPIPNMDEILGKLGKCQYFTTIDLAKG FT FHQIEMDPKSIPKTAFSTKHGHYEYTRMPFGLKNAPATFQRCMNYVLKDLI FT NKHCLVYLDDVIIYSTSLNEHLDSLRKVFEKLKEANLKLQLDKCEFLKKQT FT NFLGHIISPEGIRPNKEKVKAIEKFPLPKTPKEIKSFLGLCGYYRKFIPDF FT AKIAKPLTIFLKKGAKIDTRNEEYNKSFEKLKTLIMNDPILICPDFTKTFQ FT VTTDASNFAIGAVLSQDNKPVSFASRTLNDHEQNYSTIEKELLAIVWATKY FT FRPYLYGVPFEILSDHKPLVWLNNIKEPNIKLQRWRIRLSEYNFKIKYLEG FT KLNHVADALSRVKIEENMVGEEENISTAATIHSSSESNETYIEITEKPINY FT FARQIEFIKDSVDQVETTKYFSKIKLTIKYTDMTLTKAKDILIKYFLNNSS FT VIYLENDKDFLVFQKAYRETINPHSNVKILKSIIMLKDIKSYPEFKEFIIT FT QHSKLLHPGIEKMIRLFKETHYFPDYNKLIQNIINDCEVCNLAKTEHRPTK FT LVFEITPETNNPREIYVIDFYAIDNEQYLSCIDVYSKFASLIKTNSRDWLE FT AKRALTRIFNDMGKPQKIKADKDSAFISTSLKTWLNNEDIQIDITTSKTGI FT ADIERLHKTINEKIRIINTENNRENKETRMETILYIYNHKTKHNTTGQIPA FT NILLYADTPTYDTQMIKEMKIKNLNKKRQDFEIDTKFRQAPLTRAKSKNPF FT RKTGRIEQLDEKHYNENNRGQNVIHYKSKFKKKKKVNDSKYLQQTDTSQSS FT DS" XX SQ Sequence 6215 BP; 2546 A; 1228 C; 890 G; 1551 T; 0 other; aaaaagttta ttggcgcagt cggtaggata ctgaaaaaag tcctagtaaa ctcgttatcc 60 tttgacaatc tagtgaaaag aggaaacttg tacgaccaag taaccttttc tgattgtacc 120 cgcgaattgg ccgcaacaaa gtgatttcgg ttcgtttaaa gtgactaaaa tcggtatccg 180 cacaagaacc tgtgcgatcg gagttaattt aaagacctaa ttttgtggac cgcttcgtga 240 agtgagacct ccactaaggt gtaatacaat taaaaggcgc acagtgcagt tcactaagac 300 ataagtctcg acatactgta gcagtcagtg aacaaaaaca gtagcccact aaaatcaaaa 360 gtgcgaccgc tactattgta taaaacaata aaaagttaaa acagtgcagt tcacttaagg 420 cataagtctc gacatactgc agcagtctgt gaacaagaac agaaactaac ttataaataa 480 aactaaaatg gaattaccag cggaaaatat gactcaatta cttcggcaaa tacgccaagt 540 accgaaattc tctggagacc catcgaatct cagctcgttc atcagacgta tcgaatacct 600 gttgaactta tacccttcaa acgacgctcg acagaaggcg gttatatttg gagccatcga 660 acttcaagtt gtcggagatg cggaaaaggt tacccaattt ggaatgcaca acgaatggac 720 aactttaaga gacgccctaa tcacggaatt caagactcag acgcccctgg aggatctact 780 aggaagactg tatcggacac cttttaaggg taacttacgt ctattctgtg aacaattaga 840 agataagtcc agtgtaatta taaataagtt agcattagag aatgaccgaa ataatatggt 900 ggtttataca caagcaatgg caaccacaat aaaaaataca attcaacgct cactccccga 960 taggttgttc atgaccttag cgaggtatga catttcaacc gttcaaaaac taaggcaaac 1020 cgcccaacaa gaaggtctat acgaagaccc aatttcaaaa acattcgata gtcagccaaa 1080 acttaactca aattcaccta acacctataa acatacaaac attaaccaaa tcaaaaacac 1140 tcaccccaac caacgattca ttcgcccctt tattccatta aatcccacac agacccaaaa 1200 cactaacgta aacaatcaaa ccccttttag acaaattcct ccgaccagac aacagcaact 1260 taatcagtac agaaatttat acaacggatt cagaacagac ttacttcagg accgtcaaaa 1320 cgacacccga aactactatc agccccccca acaaccaaga gtgccaaata agcgttttag 1380 agaaagcagc gaacaatcgc gtatgcagac aaatgaaaat tttcaccagc acgaattaga 1440 ttacgataac gaaactgaac actacattaa taacgacgaa cagttccacg aagatcaatt 1500 acagaatact cagacagaat tgtttacaga tcagcaatat caacaagctg aaaattttcc 1560 aataccagcc tgggatccca ccaatacata gagatcatgt ttaataacca cagatataga 1620 tgcatagttg atactggctc ttcgataaac ttgatgagct caaacttttt caatcttcaa 1680 gttaaagata agcaaataac tgtccgaagt atgacaggct attctaaatt aaataaatat 1740 gtagaccttc cagccaacgg tctttttcaa caaacgcaaa aattctatat tcattcattt 1800 tcaccttatt atgatattct gttaggaaga aaaatactaa tggagaataa cggtttaata 1860 gactactctc ggaaagaaac aataattaac gaaagaacat tcattcacaa agaattagat 1920 ccatgtgatg agaactcttc ggaaaacttt aatgcactct tacaagaaaa caactttaca 1980 tcagaaataa actatgccat ggacaacaac atagactcag aacacacata cagactagaa 2040 catttaaata gtgaggaaaa atacagactt acaaatttat taaaagagtt caatgacatt 2100 caatattcca aaaatgaaaa ccttacattc acaagtataa tacgtcacga tataaaaacc 2160 tctcatgaaa acccagttta caaaagacca tactcgtacc cgttttccta cgaagcagaa 2220 gtaaataagc aaattgagga aatgttaaaa caaggcatta tacgcgaaag cgactcccct 2280 tattgcagtc cattatggat tgtccccaaa aagatggatg cctctggcaa accgaaattc 2340 aggttggtcg tggactatcg aaatctaaat gaaataacaa taaatgataa atttccgatt 2400 cctaacatgg acgaaatact aggcaaatta ggaaaatgcc agtattttac tacaattgac 2460 ttagcaaaag gctttcatca aatcgaaatg gaccctaagt ctataccaaa aacggcgttt 2520 tcgaccaaac acggtcacta tgagtacact cgcatgccat ttggcttaaa aaatgctcct 2580 gcaaccttcc aacggtgcat gaattacgtc ctaaaagacc tcataaataa gcattgcctt 2640 gtttatttag atgatgtgat catctattca acatctctga acgaacacct agactcactg 2700 aggaaggttt ttgagaaact taaagaagcc aatcttaaac tccaattgga caaatgcgaa 2760 tttttaaaaa agcaaacaaa tttcttaggt catattataa gtccagaagg catacgacct 2820 aataaagaga aagttaaggc aatcgaaaaa tttcctctcc caaagacacc aaaagaaatc 2880 aaatctttct taggcttatg cggatactac cgtaagttca ttccagactt tgcaaagatt 2940 gcaaaacctc tgacaatatt tctaaaaaaa ggagctaaaa tagacacaag aaacgaagaa 3000 tacaataaat catttgaaaa attgaaaaca ctcataatga acgacccaat tttgatttgc 3060 cctgatttca caaaaacatt tcaagtcact actgacgcta gcaatttcgc cataggagcc 3120 gtgttatcac aagacaataa gccagtaagc tttgcaagta gaacattaaa tgatcatgaa 3180 caaaattata gcactattga aaaagaacta ctagcaatcg tttgggcaac caaatatttc 3240 cgcccatatt tatatggagt cccctttgag attttgagcg atcacaaacc ccttgtttgg 3300 ctaaataata ttaaagagcc taacataaaa ctacaacgtt ggcgaattag acttagcgaa 3360 tataatttca aaatcaaata tttagaaggt aaacttaatc atgtagcaga cgctttatct 3420 cgcgtcaaaa ttgaagaaaa catggtaggc gaagaagaaa atatttcaac agcagcaaca 3480 atacacagtt catcagaaag caacgaaacc tacattgaaa ttacagaaaa acccataaac 3540 tattttgcaa gacaaattga atttataaaa gatagcgtag accaagtaga gacaacgaaa 3600 tacttttcca aaataaaact gaccatcaaa tacactgaca tgaccctgac aaaagcaaaa 3660 gatatcctta ttaaatactt cttaaataat agcagtgtta tttacctgga aaacgataaa 3720 gactttttag tttttcaaaa agcatatcga gaaaccatta acccacatag taacgtgaaa 3780 attctgaaaa gtatcataat gttaaaagac ataaaatctt atcctgaatt taaagaattc 3840 attattactc aacattctaa attgttacac cctggtattg aaaaaatgat acgattattt 3900 aaagaaactc actattttcc cgattataat aaactaattc aaaacattat caatgattgt 3960 gaagtctgta accttgcaaa gacagaacat agaccaacaa aactagtttt cgaaataact 4020 ccagagacaa ataacccacg cgaaatttat gttattgact tttatgctat tgacaacgaa 4080 cagtatttat cttgtataga cgtttattca aaatttgctt cacttattaa aacaaacagt 4140 agagattggt tagaggcaaa acgagccctt actagaattt ttaacgacat gggaaaacca 4200 cagaaaatta aagcagacaa ggactccgcg tttataagca cttctttaaa aacttggcta 4260 aacaacgaag acatacaaat agatattacc acaagtaaaa caggaatagc agacatagaa 4320 cgccttcata aaaccataaa tgaaaaaatc agaattatca atactgaaaa caatagagaa 4380 aataaagaaa cacgaatgga aacaatttta tacatataca accacaagac taaacataac 4440 acaaccggac agataccggc taacatcctt ctttacgcag acacacctac ttatgatacc 4500 cagatgatta aagaaatgaa aatcaaaaat cttaacaaaa aacgacagga tttcgaaatc 4560 gacacaaaat ttagacaagc acccttaaca cgcgcaaaat caaagaatcc ctttagaaaa 4620 acaggcagaa tagaacaatt agacgaaaaa cattataacg aaaataatag aggtcaaaac 4680 gttattcatt ataaaagcaa atttaaaaag aaaaagaaag ttaatgacag caaatacttg 4740 cagcaaaccg acacctccca atcatcggac tcataatcac cataataact tgtattgcga 4800 cacaaactac agcaggaaca atcgaaatca acccaataga aaacaatcaa ggatttatct 4860 tgtttgaatc tggaacaata caaatcccca tcacctttat gcatcactgt ctcaccataa 4920 atattacaga agttgaagaa acatttaata gtataattaa acaatgtcaa gaatttaaaa 4980 atgtcacaca aattaaatat ttaaccgaga aaatggaaag agaaataaat ggcatacgca 5040 tctcaaaacg aaacaaacga ggactggcta actttgtagg ttccacatta agatatcttt 5100 ttggcacact agacgaagac gacagacaac atattgaaca acaaatttcg actctatcac 5160 aagacacagt acaggtcagc actctaaacc acgttattga cagtctcaat aatggcatag 5220 aaataattaa taaccagtcc aattctttga aaaaggaaca aaaattgaat ctattaatat 5280 ttaatattga acattttacc gaatatatcg aagacattga aatgggttca caactaacac 5340 gactaggaat atttaatcct aaactactaa aacatgaaaa tattggcaac attaattaca 5400 aaaatctgat aaacataaaa acttccgctt ggtataacgt acctactaat gaaatattcc 5460 taatgtccca cattcctatg aattcaattg aacgaccaac tttcatcatt gcaccttacc 5520 ccgacaataa tggccaaatt attaaggaaa atattgaagg caaattctac tcacataaca 5580 actacgttct aaatacacta acaaagaata ttgtcaaaga ccattgcata gctaacataa 5640 taaaacacga aacaccaact tgtacttttc aaaaatatcg taaacaatcc tatattcaat 5700 acattaaacc aaatatactt ataacctgga atatgaccag agaaaaactt tatcacaatt 5760 gtaacggtca agagatttat attgaaaata ataaaattat aaaaatatcc aactgcacag 5820 cagaattaaa acaaattaca atatctaata gtattcaaca atacacaaat gtaatattta 5880 ctgaacacaa tgtaacaaaa atcgaaccaa tgtcacacat agaaattaag gaaatgattt 5940 tgttaaacaa taagagtaac acaatttaca gaaacatact aataatcttc atatctgttt 6000 tagttttaat ttacatattt tacttttata tgaaatgtaa aacagcgcca cacaaaatta 6060 ttatctctta cctaaaacca agtaaaacca ctaacaaaaa tatcgaaaaa gaaacagaga 6120 caaaaacaga aatagctgaa attgcaaatc cagtaccaca cttatatcca gaaataatcg 6180 cttgaggaca agctaatatc taaaaggtgg gggag 6215 // ID Gypsy-117_AA-LTR repbase; DNA; INV; 205 BP. XX AC supercont1.60; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-117_AA_; KW Gypsy-117_AA-I; Gypsy-117_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-205 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.60; Positions 2742616 2742820. XX SQ Sequence 205 BP; 64 A; 39 C; 37 G; 65 T; 0 other; tgtttctctc actaaacaca gatgtcacca tagagaataa gctattgtaa ggaatatcta 60 ttgtctactg tgtagacttc ctgtacatat tgcctataaa tgatgtatga accctagaat 120 agctctctca gtcctagacg atccaccgga ctgtaaatat agattagaag tggaagttct 180 ggtttttcta tagtaagtcc gaaca 205 // ID DNA8-53_AP repbase; DNA; INV; 391 BP. XX AC . XX DT 25-AUG-2009 (Rel. 14.09, Created) DT 25-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-53_AP. XX NM DNA8-53_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-391 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 1985-1985 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 391 BP; 145 A; 62 C; 46 G; 138 T; 0 other; catagataat ataagaatgg ataggacata agaacttatc acatgaaaat ttagtgatac 60 tattttaagg ttatatgggg cgaatattga tgtgataatt gataaataat aatgaatatt 120 aaataacata attttatttt gttaaaatat tattttccaa tttccatttt ccaggtacag 180 gtggcaaaac aatcaaaata caaactgaca aatatctaaa tataactgac tatcatattt 240 attcattatt atttaataaa acagttttgc acataacctc aaaaagcctc atatcgtctt 300 tctctcttta ctttctctct tccccaaaaa agcacacggc ctcatatgga actggtatag 360 gcatgtccta tccattctta tattatctat g 391 // ID BEL-1N_AA-LTR repbase; DNA; INV; 670 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; nonautonomous; KW BEL-1N_AA-I; BEL-1N_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-670 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 850-850 (2011). XX DR [1] (Consensus) XX CC 96% identical to consensus. XX SQ Sequence 670 BP; 170 A; 217 C; 133 G; 150 T; 0 other; tgttcgcgcc aacgcgaagt ttctgatgag tttgtctgtg agtccttccc acccctgtcc 60 catgtaacct cgagccgccg cgagtatttc ggcctctcat cccgacaact aacaaccacc 120 ccaaataacc cacacacaca caccacgtcg ttagttttgc ttgcttccct ttcccatcct 180 tccgagcgac atgatcgagc atgtacgcaa tccagctatg tacagcgtga agcagcaccc 240 aaagagcttc cactaccacc accacaacga caccagcgag cagaccgatt gcagcatcca 300 gtagtaatac atagcaggca gtggaagttt ccaccgccgc ccccacaccg ctgcgcatgc 360 aacgtttttg gctacctgcc acctatataa ggagcttgca tgcgcaagga gtttattcag 420 ttttctttca accaccattc tgtaaacatc aagaggaata atatagaagt ttaagttaaa 480 gttaccaaac cgcctttccc tccaccagtg gaagccaatc agcctttttg aaggctagag 540 ttgcaagagc cccctccagt cgagctatcg gtcctgtgcc ctgttgctcc cggcaacagg 600 tggacggaag atcaacagat ctttccgttc ccgctcaccc gagccaaagt cccagttgca 660 cgccgcaaca 670 // ID TTAA18C_AP repbase; DNA; INV; 451 BP. XX AC . XX DT 26-AUG-2009 (Rel. 14.09, Created) DT 26-AUG-2009 (Rel. 15.12, Last updated, Version 2) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; nonautonomous; TTAA18C_AP. XX NM TTAA18C_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-451 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2084-2084 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC TSD is TTAA tetranucleotide. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea CC Aphid Genome Project. XX SQ Sequence 451 BP; 145 A; 78 C; 93 G; 133 T; 2 other; cccgtcgttg gtgacgctaa tgttttcaca tcattagtga cgtgttagcg tttcgctcac 60 atttaattat ttttggaagg gtgaaattgg gaaaagtgga ctgactaatc tataagtaga 120 cattacaacg aatccaaatc agttttaaaa actattatcg cattactagg tcgatcagta 180 tgatggaaga aatattggta tgttttggct aaaataactg gccgccgtcg gaccgccgat 240 ccccggcgag caaatactgg taaccgtcga catttgatgg taataagacg gaaagtttat 300 atatagatta gxtattttcc acgtatttat ttccatxagt aacttgtatg agcatactaa 360 aaatatcgat ataagcaaaa taatttgaag ccaatggttc ccaaaataac agttatgtga 420 gcgaaatgct catagcgtca ccaacgacgg g 451 // ID Shinagawa-9_AAe repbase; DNA; INV; 1949 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.03, Created) DT 16-FEB-2011 (Rel. 16.03, Last updated, Version 1) XX DE A non-autonomous DNA transposon family from Aedes aegypti. XX KW DNA transposon; Transposable Element; nonautonomous; KW Shinagawa-9_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-1949 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the yellow fever mosquito."; RL Repbase Reports 11(3), 846-846 (2011). XX DR [2] (Consensus) XX CC ~91% identical to consensus. 8 bp TSDs. TIRs are ~100 bp long CC and composed by degenerate repeats. Related non-autonomous CC elements, named Shinagawa, are found in Aedes aegypti and CC Culex quinquefasciatus. XX SQ Sequence 1949 BP; 552 A; 385 C; 344 G; 668 T; 0 other; ggttctatac catttcccgg aaagacgttt cccggaaatc atttcccgga taccccattt 60 cccggaaaga catttcccgg aatgacccat ttcccggaat aacccatttc ccggaaagct 120 actcaatggt aaataaaata agtaatttcc ggttctatac cattttttaa agcaaagtaa 180 tatcagaacc atgaaaagaa tgtggcaatc tcaagaatac tttgtaaaga gccactagat 240 tgtcatagat tagtttttcg acgttcatac cagccacgaa atgcatcgaa aagaatgatc 300 aaagtcaccc cgcacggcgg taatatttca ttgatatttc aatgcttaat tgcatcctcg 360 tttttcatgt tgaataaaac tatttaaaaa tttaaaaatt tagcaacagg caacgtgatt 420 ttagctttat tattgaatgt tttgaaaaaa tatatattaa tgattttatc caacatattc 480 ttattttttt tttaatatat cgaggtttta taatggatcg aaatttttaa cagctaggat 540 ttttttttca aacaatctga ttccaagctt agttaagtct atttatatat catgtagacc 600 aattttctag atttcatttc ccttcgaaag gtttccagct gtagacttac ttcaagacta 660 aataatctca agacatttta agcctctata ccatttttgt gcagcctgtg agaaagtttt 720 tgcccggtcc attttcgtgt tattctggta taattattat tgttttgccc cactgtctac 780 attttcattg gataattgcc cttcttccat acataggctg ttctttcgag ttatgctgtt 840 tgaacaatta ataatgttca aactgatgaa taaacccatc aacatacatt ccagttatag 900 ctactgccat tttcatagag aatcaccctt cttttggaag tttgccgttc ttccgaatca 960 taccgacaga aacatttttg gcgagaaggc tgctaatatt gaacataaga aattttccct 1020 tctttcaaac taaggttatt ctattgagta attggattat atcagggctg ggaatggtaa 1080 tagtagtaag cttgaatgtc gcgaaactca cgtagctctt cttactacag tagtttgaaa 1140 acgtgaatct catcaagtag cctattttgg gtgcatgaat tttctaaatc attagtgtca 1200 tcgaaggctg taaatgcggg aaatatgctc cggaaaatat aacattttcc tacagaagtt 1260 ttgggttgcc tttagcaacg ggtgttttac tacattctag gcattttgcc tacagcaccg 1320 ataggctttg aatttcactg gcttcgctga ctgcgatttg ctctgcttgg ctactgtagc 1380 gtgaattcca tgactttttg caatcctgga ttacacacaa attatgtcac gcacaatttt 1440 gacatattgg accaactgtc ttcttgtcaa ttttataagg cttgtgacgc ttccttcaac 1500 tccttccgca cctttggaat gagacactag cttgaacttt atgatgtgat agtgtttcaa 1560 ttggtatcat cttcttctat tttttctttc tggcatgacg tttcaactgc ttctcggctc 1620 tatgttcata ttagcactcc cacagttatt aactgagaat attcttcgcc aattgacaaa 1680 gaaactctat gccctcaaaa gtagaagaat tttccaatcc taaaaaccct cgaccggcgg 1740 gattggaacc tgctatactt agctacggtc ttgctgaaca gctgctcgtt aatcgctacg 1800 aattttcggg ttccgcgaaa gtttattttt agtgcattcc gggaagtggc attccgggaa 1860 atgggtttcc gggaaatggg tcattccggg aaatgatttc cgggaaacgg cattccggga 1920 aatggttttc cgggaaatgt catagaatc 1949 // ID BEL-207_AA-LTR repbase; DNA; INV; 558 BP. XX AC AAGE02025367; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-207_AA_; KW BEL-207_AA-I; BEL-207_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-558 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02025367; Positions 100280 100837. XX SQ Sequence 558 BP; 161 A; 138 C; 120 G; 139 T; 0 other; tgtagaagat ccatatgaaa tacttaatcc cgatcctatc aaccaaactc aatttgtgtc 60 accctaatgc cttatcaatg tatttcttat atactcacta cccagttagt tcgagcgagc 120 gagatgtaga gatacgagag aaagtagggt cactcacaat gaccactaga aaggtacatg 180 taattagccg agcgatcagg gtcatcgatc gctctgatca atatttagtt cagttagttc 240 aatccaaccg ctcaacccga taggttgata atatacggcc gtgcgcgtaa aaccctaaac 300 acgaatcgtt ctaacgagtc cgaaacgtta agtgcttgtc cgcgactttg attacaaggt 360 gggaaagcag atcccggttc ggtgtccgta ttgcgccaag tgatagtgct aaacttgtgc 420 tctggtgaaa gagcaaagct agcgaagaag ttcccccctc ccccaatcgt agccacaccg 480 ctggccttcc taattgccgt tgcgataacg gcgaagaaaa cccttcgatt cagcgtatac 540 agccgattgg ctcgtaca 558 // ID SINE1a2_Cis repbase; DNA; INV; 303 BP. XX AC . XX DT 14-NOV-2005 (Rel. 10.11, Created) DT 09-DEC-2005 (Rel. 10.11, Last updated, Version 1) XX DE SINE Non-LTR Retrotransposon from Ciona savignyi. XX KW SINE; Non-LTR Retrotransposon; Transposable Element; L1-99ext_Cis; KW SINE1a2_Cis. XX OS Ciona savignyi OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. XX RN [1] RP 1-303 RA Smit A.F.; RT "SINE1a2_Cis - SINE Non-LTR Retrotransposon from Ciona RT savignyi."; RL Direct Submission to Repbase Update (11-NOV-2005). XX DR [1] (Consensus) XX CC Ci000003. XX SQ Sequence 303 BP; 67 A; 76 C; 79 G; 81 T; 0 other; ttgtccggcc ggttagtctg gcggttaagt gcgtcgcctt tcaatcgaaa ggttgcaggt 60 gcgaaactgg tcgctagcta gtcggttgtg tccttgggca aggcacttta cggacattgc 120 ctgaacccag cggattaatg ggttctacca aattgaagga acgtgtctat catatacaac 180 acactgcaat agctccggta acccgacggt gggcgcgagg tgatctgacg attgcccgtg 240 tgttaacccc cttggttttc ccattcacgg ggataaacat gaatatccta tcctatccta 300 tcc 303 // ID hAT-22_HM repbase; DNA; INV; 3412 BP. XX AC . XX DT 07-DEC-2008 (Rel. 13.12, Created) DT 12-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-22_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3412 RA Jurka J.; RT "hAT-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 2011-2011 (2008). XX DR [1] (Consensus) XX CC Average identity to consensus >97%. CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(607..2163,2167..3189) FT /product="hAT-22_HM_1p" FT /translation="MDHYQHKSGAQKRKEKIAREKSAKKGQSTIESFGFIN FT CNRIDGVSEDQMCKMKSQLSNVEVEPVPIEEDELLNRKVMSADVKENVIDE FT NENILPKTSTARLEELQDKADDVQLVPSNNYDIGLIKELFCTAEQIEKFVH FT HGPIAHPTKFPKDALPKSFPRTLFHIAMPNGEKIPRDWLTWSETKNALYCF FT PCRLFSKMPETHRSALSLPTGYSKPKDNKWKKLYEKIPHHQSSSGHKMCYV FT EWRRLEENLRDNTTINFLLNDNIKTEVERWRQILRRVLDVILFLGERGLAF FT RGDSHLIGDPKNGNFLGIMELIGHYDSILHDHLEKVKASQQSHKRMQAHYL FT SNDIQNEFIQCCADKVTDVILDEREKCKYFSMIVDATPDTAHIEQNVFILR FT YVLQDKLTGKYEVKERFLEFVDCNKKTGEDIANIITSTLQKHKIPLMCCRG FT QGYDNGSNMKGSVKGAQARILQHYPLATYSPCACHSLNLCGAQAAECCPQV FT ITFFGIVQKLYNIFSSSPQRWEIKKNIGSSLHSMSQTRWSARVDCIKPFAT FT HLPGIIKSVADIRNLNLSIENRADLNGIISYMESFECILMSAIWFKVLTAI FT NYTNLVLQARNATLDVEVTNIKRLIDELKTLRDKWDSIMVESKLVANTLNI FT SQELPEPRRKIRKRFADETPDEISLSNSENNFKQNVFYILLDCIIGNISRR FT YEAAYEIDETFNFLWKYNHYEEEEIRRRSKDFVNKYKDDVSMELIDELLHL FT KFIHDVNIKNDLDPFLLLNKINDLKLICLFPNICIGLRIFCTLPVTVAQAE FT RSFSRLALIKNDLRSTMTQGRVSGLGILAMECDLAKKINFDSVIADFASRK FT ARKVLI*" XX SQ Sequence 3412 BP; 1163 A; 562 C; 637 G; 1050 T; 0 other; caggcccgac gagagggggg gccagagggg gtattttccc cgggcccgta ggggcccggg 60 cccatcacca tcggtctctt agtctatcag gtcccagtct ccaggtaatc agccaatccc 120 catcaagcaa cattgtgtct gcatcacttc atcagttaga taaataaata aatttactat 180 gtaatattga taggataaat ctattcttta ttaataattt aatttattta atgaaaatac 240 gcttttgaga gtaaaaacgg agactaatag cctctaaaac ttaaaaaata aatatagggc 300 accagcccgc aggtttttcg taagttcctt gctcatgcca tgctttataa tatatataat 360 aggcctatat ataataggcc atatatataa tttaatatat aaagttatat atatatatta 420 tatacatata tctattttat ttatacagta tatactgtac tgtacattaa actgaacttc 480 tattttaact tttaacaatt attaattgtc atttatttta atttgtcaat ttacttatta 540 acttaactaa taaatttttg ttctattgtt acaggacctc gcataggcag cctactacat 600 gcagctatgg atcattacca gcataaatct ggtgctcaaa aaagaaaaga aaagattgct 660 cgagaaaaat cagccaagaa gggtcaaagt acgattgagt cttttggatt tataaattgt 720 aatcgtattg atggtgtttc tgaggatcag atgtgcaaga tgaagtcaca actctcaaat 780 gttgaagtag agccagtgcc aattgaggaa gatgaattgt taaataggaa agttatgagt 840 gctgatgtga aagagaatgt aatagacgaa aatgaaaata ttcttccaaa gacatcaact 900 gcaaggctag aagaattaca agacaaggca gatgatgtcc aactagttcc atcaaataac 960 tatgatattg gtttaattaa ggaattattt tgcactgctg aacaaataga aaaatttgtt 1020 catcatggac ccattgctca cccaacaaaa tttccaaaag atgcattgcc taaatcattt 1080 ccgcgcacgt tatttcacat tgccatgcca aatggcgaaa aaattccacg agattggcta 1140 acgtggagtg agacgaagaa tgcactttac tgttttccct gccgactttt ctctaaaatg 1200 ccagaaaccc atcgatcggc attatctctg ccaacaggat attccaagcc caaggacaat 1260 aaatggaaga aactttatga aaaaattcct catcatcaat ccagcagtgg gcataagatg 1320 tgttatgtag aatggagacg attagaagaa aatctcagag ataatacaac tatcaatttt 1380 ctactaaatg ataatataaa gacagaagtg gaaagatgga ggcaaatact tcgtagagta 1440 ttagatgtta tcttgttttt gggtgaaaga ggtcttgcat ttagaggcga tagtcatcta 1500 attggtgacc ccaaaaatgg gaattttttg ggaattatgg aactcattgg tcattacgat 1560 tccattctac atgatcatct cgaaaaagta aaagcttctc aacagagcca taagcgaatg 1620 caagcacatt atttgtcaaa tgatatccag aatgagttta ttcagtgctg tgcagataaa 1680 gtcacagatg tgatattaga tgagagggag aaatgcaaat atttttcaat gattgtggat 1740 gcaacgcctg atacagctca tattgagcaa aatgttttta ttttaagata cgtcttacag 1800 gacaaattga caggaaaata tgaggtaaaa gagagatttc tggaatttgt agattgcaat 1860 aaaaaaactg gtgaagatat tgcaaatata attacttcta ctcttcagaa acataaaata 1920 ccactgatgt gctgtcgtgg acaaggatac gacaatggga gtaatatgaa aggttcagtc 1980 aagggagcac aagctcgaat tcttcaacat tatccacttg ctacttactc tccctgtgct 2040 tgtcatagtc tgaatttatg cggtgcacaa gctgctgaat gttgtcctca agtgataact 2100 ttctttggga tagtgcaaaa actgtataac atatttagta gcagtcctca aagatgggaa 2160 atttaaaaaa aaaatattgg ttcgtctctt catagcatgt cccaaacgcg ctggtcagct 2220 agagtggact gcattaagcc attcgcaacc caccttcctg gtataatcaa atcagtagct 2280 gatataagga acctaaattt gtctatagaa aatcgtgctg atctaaatgg gattatttca 2340 tatatggaat cgtttgaatg cattctgatg tccgctatat ggttcaaagt acttactgct 2400 atcaattaca ccaatcttgt tttgcaagca cggaatgcta cattggatgt agaagtgacc 2460 aacatcaaac gactaattga tgaattaaaa actcttcgag ataaatggga ttctattatg 2520 gtggagagta aattagttgc aaacacctta aacatttctc aagaacttcc agaacctaga 2580 cgaaaaatta ggaagcgatt tgctgatgag acacctgatg aaatttctct atcaaattct 2640 gaaaataatt ttaaacaaaa cgttttttat atattattgg actgcattat agggaatatc 2700 agtagacgat atgaagctgc atatgaaatt gacgaaactt ttaactttct ctggaagtat 2760 aatcattacg aagaggaaga aattcgccga aggagtaagg attttgtgaa taaatacaag 2820 gatgatgttt ccatggagct gattgatgag cttttgcatt tgaaatttat ccatgatgtt 2880 aacatcaaaa atgatttgga cccctttctt cttctcaata aaatcaacga cctgaaactt 2940 atatgtcttt tccccaacat ttgtattggc ctgagaattt tttgtacatt accagttacg 3000 gtagcacaag ctgaacggtc ttttagcaga ctggcgctta taaaaaatga tctaagatca 3060 acaatgacgc aaggtagagt ttctggcttg ggaattttag cgatggagtg tgatttagca 3120 aaaaagatca attttgacag tgtaattgct gattttgcca gtaggaaagc gaggaaagtt 3180 ttaatctaac ttacaggttt tatatatttt aattttgttg tagttaaatt atatataatt 3240 tttcactgta ttgtatttaa aatatacttg taaataaaat actgtatata gtcaaattaa 3300 aaaattattt gtatcaaaag tagtattatt catatgtctg aagccctaag ccctgagggg 3360 gcccgtcagt ttgatattac cccaggcccg gtaccagctc tccacggccc tg 3412 // ID BEL-175_AA-LTR repbase; DNA; INV; 501 BP. XX AC supercont1.27; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-175_AA_; KW BEL-175_AA-I; BEL-175_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-501 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.27; Positions 1378046 1378546. XX SQ Sequence 501 BP; 186 A; 82 C; 95 G; 138 T; 0 other; tgttgcggtg taatgtacca aacccattcg cgcgaggtag aatgaaaatc tggcagcaca 60 gttgctgctt gactgccgat agggcacatg ggtatagctg ccgatagggc aatgaaagga 120 aaagagcaaa acaaaacaga attgtgtagt catcgtaatt gagtggattt ttatttcggt 180 tgaatttact tgttagcaaa ttgaatttta ttaataagcc tatttaaata ctggtaaatt 240 gtaagtacac aaatgaatta cactaaaatt aaacctaaac attgaaactc aacaattaca 300 cagctttgga acacacgatt gaagttgtta cgacaagacg gatcagttac agcaaaggac 360 tagtaaatgt aggtaaaatc atttgtctaa attaacctaa aataacaaat taaaacatta 420 aattacagct tatagctaac tcccatccaa ataacgagtt tgcttattag aggtccgaag 480 atatccgtac gtgtgacaac a 501 // ID Crack-1_BF repbase; DNA; INV; 4857 BP. XX AC . XX DT 30-APR-2009 (Rel. 14.04, Created) DT 30-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE Amphioxus Crack-1_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; Crack-1_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-4857 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-4857 RA Kapitonov V. and Jurka J.; RT "Young families of Crack non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(4), 806-806 (2009). XX DR [2] (Consensus) XX CC ORF1-encoded protein is similar to the L1/Tx1 ORF1 protein. XX FH Key Location/Qualifiers FT CDS 465..1238 FT /product="Crack-1_BF_1p" FT /translation="MPPRKQNLSEDDMSTIKDLLEQQQAKYEDLLQRQQSI FT FQSFIESVMTSTNNRFDNLVREVQELKSSLQFSQHEVDTLKTKLNEANSEI FT TNLKATIDDQAKNPKSLQMPDMEKKMDYLENQSRRNNVVIDGIEDDVKETW FT ADTEVKVRNILTKKLQLDSKTIEIERAHRNGPFRNDSPRPRQVVVKLLRFK FT DKQMILNRARSNLKNTNIYINEDFSDQVRKRRAELVPKMREARQRGEFAII FT SYDRLVIRKKTNQEMS*" FT CDS 1408..4524 FT /product="Crack-1_BF_2p" FT /translation="MSNTSTQTDDFNSTPYNFEDGNLIFNPFMLDSDKESN FT FTDSWDPDVNFYGNFLDTNTNTPCVNYSENQFNSMCLKSGHNTNAFSALHL FT NIRSLPRNFDNFTHYLSTLNHDFSIIGLSETWLTQSTLQLYDFPNYTSVHR FT CRSNRVGGGVSLFIRENLEFVQRDDLTIHSHEGNVESIFIELPVSSSPTRK FT NMILGCVYRPPDTDIKDFTSHLATTLDTISKEGKTCFLLGDFNIDLLKNER FT HTLTTDFLNMLFSCCMYPTITKPTRITDKSATLIDNILTNSLSESYTSGTL FT VTDISDHLPIFYIMKCKSDVSKTNSTKRRLRKFTSSNITKFKNQLADVTWD FT KLYNYSNTNDAYSHFIQVITSLFEECFPIVTHSERNVEKNKPWFTPALHKS FT SIKKNKLYKKYVRNPNPTSFQTYKTYRNKFNTLIRLSKRNHYHKKFKETTD FT NIRTTWKLINELLNKKKKSNVLPSKLCDDTYEISNPQDIVDAFNNFFVNIG FT PSLAKKIDKTDCSPLSFINNVYPSYSFFETPTASEVQNIICNLKNSAPGND FT ELSTSLIKQVSSSILEPLTYILKLSLETGMVPNDLKIAKVIPLFKSGNTCL FT LSNYRPISVLPVFSKVLEKLVYKRILQHLEDNNILHEHQYGFRKHHSTYMT FT LLKLVDKITSAIEKNEFTIGVFLDLSKAFDTVNHKILLEKLNCYGFQGYVF FT TWLKDYLSNRKQYVSNNTYSSDTRSISCGVPQGSILGPLLFLIYINDLAYV FT SDELYALLFADDTNLFSSHSDLDILVSKMNRELDKVNKWFQINKLSLNVKK FT TNFIIFAGKKKYNKENVKIAINNTPINQVTHTCFLGVTIDDKLTWKKHIDL FT ICKKIYKNIGIVRKIATLLSSKTYMILYYSLLYPYLAYCNIVWASTYSTSL FT KPLLLLQKRFVRIVSNVSYTFPSASLFRNLKVLTIYDINKLQSCLFVHKTI FT NSSCIPKQFQNLFHHNSDHHHYNTRQSNHLKQIQVKTQQRKFSMLFKCPQI FT WNSLNETLHLCHSPPIFKNLLKKSLLDNQTFN*" XX SQ Sequence 4857 BP; 1639 A; 1012 C; 749 G; 1457 T; 0 other; accagagttg tggtatcgcc caatggaaga ggtggctaaa tttcagtgct cttcaattac 60 ctacgatctt taaccagaaa ctgtacgcta aagttatttt tcgagttcat ctgtgagtgt 120 acgtgtgctg gagcctgtgg actgtctgct gagctgatat gaggtggttc cgacgtctga 180 tctgctgaag acgagagccg ttcctgtttg actagccctg gccagccgcc atattacctg 240 aagcgtcaca accgcagcgg gctcctttga ggagcccccg ccggtaagtt ttttttacaa 300 cagcagattc cactacacca tcacttctat actatatcat tatcttcctg cgatctttct 360 gccgtttgct gctgtatttt attacaccct tgcagagttg cagagggctg gagttgagca 420 tcgccatatc tctcaggtgt ggtcatttgc atatcaatct cacgatgccg ccgcggaaac 480 aaaacttgtc agaggatgac atgtccacaa taaaggattt gttggagcaa cagcaggcaa 540 agtatgaaga cttactacaa agacaacaat caatctttca atcattcata gaatctgtca 600 tgacctctac taataaccga tttgacaacc tagttagaga agtacaggag ctgaagtctt 660 ctttgcagtt ctcgcagcac gaagttgaca ctctgaaaac caaactgaac gaagctaatt 720 cagagataac caacctgaaa gccactattg atgaccaagc aaagaatccc aagtctcttc 780 aaatgccaga catggagaaa aagatggact atctggagaa ccagagcaga agaaacaacg 840 tagtcatcga tggcatagag gacgacgtca aggaaacttg ggccgacaca gaagtcaaag 900 tacgcaacat cctgacaaaa aagctacaac tagactcaaa gacaatcgaa atagaaagag 960 ctcatcggaa tggtcccttc agaaatgact ccccccgccc aaggcaagtt gtggtcaagc 1020 tactgcgttt caaggacaaa caaatgatcc ttaacagagc ccgatccaac ctcaagaata 1080 ccaacattta catcaatgaa gacttttcag atcaagttcg aaaacgacgc gcagagctgg 1140 tgccaaagat gagggaagcc agacaacgtg gagagttcgc tatcatcagt tatgaccgac 1200 tcgtcatcag gaagaaaact aatcaggaaa tgtcgtaaag cttatttcca tctgtattac 1260 atttattgtg tgtagtagac ctataatatt tcatgtatga tacttctaga tgcgaccatt 1320 caaatatagt tttccaccca tgtatatttt actatgtaat gatgtataac tatgtaatgt 1380 tgtatattaa ctatgcttat atagaccatg tccaacacat ccactcaaac agatgatttc 1440 aatagtacac catataattt tgaagatggt aacctgatct ttaacccctt catgctcgac 1500 tcagataagg aatcaaactt cactgacagc tgggatccag atgtaaactt ctacggaaac 1560 tttttagata caaatacgaa taccccatgc gttaactact ctgaaaatca gtttaactcc 1620 atgtgcctta agtcaggcca taacactaac gctttctcag cgttgcacct aaacattaga 1680 agtctcccgc gaaatttcga caattttacc cactatctat ctactctaaa tcatgacttc 1740 tctattatag gtttatctga aacatggcta actcaaagta ctcttcaact gtatgatttc 1800 ccaaactata cctcagtaca tcgttgtaga tcgaacagag ttggtggtgg agtatctctc 1860 tttatacgcg aaaatcttga gtttgtacaa agggacgatt taactataca ttcgcatgag 1920 ggaaatgtgg agtccatttt tattgaactt ccagtgtctt cctctcctac tcgtaagaat 1980 atgatattag gatgtgtata ccgaccgcct gacacggata ttaaggactt tactagtcat 2040 cttgcgacaa cactggatac aataagtaag gaaggtaaaa catgttttct tcttggcgat 2100 ttcaatatag acttactaaa aaatgaaaga catactctaa cgacagattt tttaaatatg 2160 ttattctcgt gttgtatgta tcccacaatc accaaaccaa caagaataac tgacaaatct 2220 gcgaccctaa tcgataatat attaacaaac tccttatctg aatcctatac ttctggcact 2280 cttgtaacag atatttcgga ccatttacct atattttaca taatgaaatg taaatccgat 2340 gtgagtaaaa ctaacagtac aaaacgacgt ttgcgcaaat ttactagttc caatatcact 2400 aaattcaaaa accaattagc ggatgtaaca tgggacaaat tatacaatta ctctaacaca 2460 aatgatgcct acagtcattt tattcaagtc attacttctc tgtttgaaga atgtttcccg 2520 attgttactc actccgaacg taatgtagaa aagaacaaac cctggttcac tcctgcgctt 2580 cataaatcgt ccatcaagaa aaacaagtta tacaaaaaat atgtaaggaa tccaaatcca 2640 acttctttcc agacatataa aacttacaga aataaattca acacattaat tcgcttatct 2700 aaaaggaatc actatcacaa aaaattcaag gagacaacag ataacataag aaccacatgg 2760 aaattgataa acgagcttct taataaaaaa aagaaatcga atgtccttcc tagcaaactc 2820 tgtgacgaca catatgaaat atcaaaccca caagacattg tagatgcctt caataacttt 2880 tttgttaata ttggaccctc attagctaaa aagatcgaca agactgactg ttctcctcta 2940 tctttcatta acaatgtcta cccatcttat agcttctttg aaacacccac tgcatccgaa 3000 gtccagaata ttatttgtaa tcttaagaac tctgctcctg gtaatgatga attaagcacc 3060 tccttaataa aacaagtcag cagctcaata ctagagcccc tcacgtatat attaaaattg 3120 tccttggaaa ctggtatggt cccaaatgat ttaaagatag ccaaagtcat accgcttttc 3180 aaatcgggaa acacctgttt actgtcaaat taccgcccca tatcagtttt gccggtattt 3240 tcaaaagtcc tagagaaatt agtatacaag agaattctac agcacttaga agataacaac 3300 attctccatg aacaccaata tggcttccgt aaacaccatt caacatatat gacacttttg 3360 aaattagtag ataaaattac ctcggccata gaaaagaatg aattcaccat tggtgtgttt 3420 ctggacttgt cgaaagcatt cgacaccgtg aaccataaaa ttctattaga aaaacttaac 3480 tgctatggct ttcaaggata tgttttcaca tggctaaaag actatttatc taacagaaaa 3540 cagtatgttt ccaataatac ctactcctca gatacaagat caatttcctg tggcgtccct 3600 caaggttcta ttcttggtcc tctattattt ctcatatata tcaacgattt ggcttatgta 3660 tctgatgaac tatatgcact actatttgca gatgatacta acttgttttc atcacattct 3720 gaccttgata tattagtcag taaaatgaat agagaattag acaaagtaaa caaatggttc 3780 caaattaaca aattatcctt aaatgtcaag aaaacaaatt tcattatttt tgcagggaag 3840 aaaaaataca ataaggaaaa tgttaaaata gctattaata acacaccaat taatcaagtt 3900 acgcacacgt gtttcttagg tgtcaccatt gatgacaaac ttacctggaa aaaacatatt 3960 gaccttattt gcaaaaaaat ctataaaaac ataggtattg tcagaaaaat cgcaacgcta 4020 ctatcttcta aaacatatat gatattatac tatagtttac tatatcccta tcttgcatat 4080 tgtaacatcg tatgggcgag tacgtactcc acatcactta aacctctact tcttctgcag 4140 aaacgcttcg tccgaatagt ctcaaatgtg tcgtatacat tcccttctgc atctttgttt 4200 cgtaatttaa aagtattaac tatatatgat ataaacaaac tgcaaagttg cctttttgtt 4260 cacaaaacaa ttaatagttc ttgtatacca aagcaattcc agaacctttt ccatcacaat 4320 tcagaccacc atcattacaa tactagacag tctaaccatt taaaacagat acaagtaaaa 4380 acacaacaaa gaaagttttc catgttgttc aagtgtccgc aaatctggaa tagtttaaat 4440 gaaaccttac atctttgtca ttcaccaccc atcttcaaaa acctgttaaa aaagtcatta 4500 ttagataatc aaacatttaa ttaatctccc attcattgtc agttgttaca tttctttcaa 4560 tgttatattc ttattacatc atttcattca tttcgccgtt atgatattgc attagttatc 4620 aattattgaa ctattatgat ttatttccat tttgttaatt gatataattc ttttggttac 4680 attattgtct aatgtataat ttttacacgt ctattcaatc ttgttacttt ggaggaagac 4740 tttgtacaag ccacgaggct tttttcttcc cccctgcatg tactttcatc ctttaatttg 4800 ttttatgtta acttttttaa tgattgtgca aaataaataa aatcaaatca aatcaaa 4857 // ID Gypsy-16_DWil-I repbase; DNA; INV; 4867 BP. XX AC scaffold_180708; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-16_DWil_; KW Gypsy-16_DWil-LTR; Gypsy-16_DWil-I. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-4867 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_180708; Positions 8553748 8558614. XX CC Positions [2252-2680] - Reverse transcriptase CC Positions [3821-4297] - Integrase core CC 'AATA' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 29..4678 FT /product="Gypsy-16_DWil-I_1p" FT /translation="MSNAVQVSDLTLDQLKQWLATLNMPTNGSRNELVLRL FT YSVPVETRGHAPPGWNEHDNSEHNLDDIQLMRPNPDQLNEASIGDAIPRSV FT QQEAVNSNFNNNNDVALVQQLLQQLQLLIPRIEHGHDLGRSFQNLENNGNA FT TGNVIDNNVANENIDDSSIPIHGNATVEATNQYSGNSNGESMGIAFSLAKE FT VLMDFNGESCSKKWVAQLKNIGEVYNIGNTHLRMLFICKMKGKAHSWLHSE FT VTRIREPVAVLCEQLMAAFGEKMSKSQMRRHFEQRNWKFGEKFAVYLDDKL FT MLANNINIDEEELLDKVIEGIPDKGLRTQARIQCFANPKHMLAAFAEIHLE FT DIRRATKDENETSRANKLQEMRCRRCNIRGHLVKDCNRPDRVPGSCFICGS FT MEHWAAKCPDRKGSRETNGLNRTNQSSNNFVREFIIHFVDSPSNSFIAACL FT IDSGSPVSLIKKKHLPNHSYLYSVEQSYQGLNGSPLITLGKLLCYVVKNSI FT KIYFYLLVIPDESMSRDVIFGRDFLTSCNLKLDLDSFEKSALVTSQSHDNV FT RKVNFKEIGMLTSEHMENEIVNISVTGNTENKNINANDKINTENVNADKEI FT IEYNVVNKRTCIDKEMTLEFEKNIIKGTNEEKYRIEEIKLKEPRSESETNM FT EDNFERRMLEIFYIDNGNPEPEYVIGENIDYKTAKSFDKLVRDRYVNAQRP FT DKPNIRCEMKLNLENCKPFSYAPRRLSYSEKEKLQKLLDEYIREEIIRPSD FT SEYASPVVLVKKKTGDIRLCIDFRKLNKMIIKDNYPLPLIDDLLDRLVNKR FT VFSKLDLKNGYFHVFVDKESVKYTSFITPLGQYEFLRMPMGIKNASHVFQR FT FINRIFEDMVRKGQVLVYMDDIMIASANTTEHLQTLEEVFDRLVRNKLRLR FT MDKCEFFMSSIKYLGYKITENKICADDKGLDAVKNFPVPGNVHAVKSFLGL FT CSYFRRFIKDFSIIAKPMYDLMKKDKEFQFGKEELECFLNLKQILLEAPVL FT AIYDPRDDTELHCDASALGFGAVLLQKKADKKLHPIFYFSKRTSEVESKYH FT SFELETLAIVYSLRRFRIYVQGKKFKILTDCNSLTLTLNKIELNPRIARWA FT LELQEYDYELAHRSGKQMQHVDALSRCTNIMVIEPNSFEHNLVICQAKDLE FT LKKIKEMLEKSEHKLFEMRNGVLYRKSNDKKILFYVPKEMENHVLYKYHDE FT LGHVGRDKMLDAIGKSYWFPSLKDKATRHIENCLRCIAFSPKCGKGDGFLN FT SIPKGDKPFELIHIDHYGPIEAGRSKKHIFAVVDGFTKFVRLYTTKTTKTK FT EAIDALKAYFRSYSRPKCIISDRGSCFTSSEFESFLVEQKVQHVKIATGSP FT QANGQVERINRTLGPMLAKLTDPDKGNYWDTVVENVEYALNNTIHRSIKQY FT PSIMLFGIEQRGVVTDELKEEIKELRNTENQINLEMIRKNAALNQQKAQAY FT NERYYNSKRTGQKEYKIGDYVMVKNFDSTTGISRKLIPKNKGPYVVDKVLK FT NNRFLLKDVEGFQLSRNPYQGVWCGQNIKPWLRT" XX SQ Sequence 4867 BP; 1810 A; 824 C; 1006 G; 1227 T; 0 other; atatcagaag tggtcctaac ccaaaactat gagtaacgca gttcaagtat ccgatttaac 60 cctcgaccaa ctaaaacaat ggctagcaac tttaaacatg ccaacaaacg gctcaagaaa 120 tgaattagtg cttcgtctgt atagtgtgcc agtcgaaaca cgtggtcatg ccccacctgg 180 ttggaacgaa catgataatt cagagcataa tttagacgac atccaattga tgcggcctaa 240 cccagaccaa ttaaatgaag cgtcaattgg cgacgcaatt ccgagaagcg tacaacaaga 300 agcggtgaac tccaatttca acaacaacaa cgatgtcgcg ttggttcaac aattgctaca 360 acaacttcaa ctgctgattc ctagaatcga gcatggtcac gatttgggca gatctttcca 420 gaatttggaa aacaatggca acgccactgg caatgttatc gataacaatg tagctaatga 480 gaatatcgat gacagcagca ttcccatcca tggcaacgcc acagtcgaag ccaccaacca 540 atactctggc aactcaaacg gcgaatcgat gggtatagct ttttcactgg cgaaggaagt 600 attgatggac ttcaacggcg agtcgtgctc caaaaagtgg gtggctcagc taaaaaatat 660 cggtgaagtg tataacatag gcaacactca cttacgaatg ctattcattt gcaagatgaa 720 aggcaaggcg cattcatggc ttcattcgga ggttacacgt attcgcgagc cagtagcagt 780 actatgcgaa cagttaatgg cagcttttgg agagaagatg tccaagtcac aaatgcgtcg 840 tcatttcgaa cagcgtaatt ggaagtttgg cgaaaagttt gcagtttatc tggacgacaa 900 actaatgtta gccaacaata ttaacatcga tgaagaggag ctccttgata aagttatcga 960 gggtatacct gacaagggat tacgcactca agcacggatc caatgtttcg ccaaccctaa 1020 gcatatgtta gctgctttcg ctgagataca tctagaggac attcgacgcg caactaaaga 1080 tgaaaacgaa actagcagag caaacaagct ccaagagatg cgctgtcgta gatgtaacat 1140 tagaggacac ctggtgaagg actgcaacag gccggaccgt gtaccaggct cttgtttcat 1200 ctgtggatcc atggagcatt gggcggccaa atgtccagat aggaagggca gccgtgaaac 1260 taatggactc aaccgtacga accagagcag caacaatttc gtaagagaat tcattataca 1320 ttttgtagat tcgccaagta attcctttat tgcagcgtgc ctcatagatt cggggagtcc 1380 agtttcctta atcaagaaga aacatttacc taatcattca tatctttatt ctgtcgagca 1440 atcttatcaa gggctaaatg gaagtcctct cattacattg ggaaaattac tatgttacgt 1500 tgtgaaaaat tcaataaaga tttactttta tttactcgtt atcccagacg aatctatgag 1560 tcgcgatgta atttttggaa gggatttttt gacatcatgt aatcttaaat tagatttaga 1620 ttcttttgaa aaaagtgcgc tagtaacttc tcaaagccac gataacgtaa gaaaagttaa 1680 ttttaaagaa attggaatgt tgacgtcaga acatatggaa aatgaaattg ttaatataag 1740 tgtaacgggt aatacagaaa acaaaaatat taatgcaaat gataaaatta atacagaaaa 1800 tgttaatgca gataaggaaa ttattgaata caatgttgtt aacaaaagga catgcataga 1860 taaagaaatg actttagagt ttgaaaagaa tataattaaa ggtacaaatg aggaaaagta 1920 taggattgaa gaaattaaat taaaggaacc ccgtagtgag tctgaaacaa atatggaaga 1980 taactttgag aggagaatgt tagaaatttt ttacatcgac aatggaaacc cagaaccaga 2040 atacgtgatc ggtgaaaaca tagattataa aacggctaag agttttgata agctagtcag 2100 ggataggtat gtgaatgcgc aaagaccgga taagccgaat atcagatgtg aaatgaaatt 2160 aaacctagaa aattgcaaac catttagtta cgcaccaaga cggttatcat attctgaaaa 2220 ggaaaaacta cagaaattgt tagatgaata tataagagaa gaaattatca gaccaagcga 2280 ttcggagtat gcgtctccag tagtgttagt gaaaaagaaa acaggggaca tcagattatg 2340 catagatttc cgaaaactga acaagatgat tatcaaggac aactatccgt taccattgat 2400 cgacgatcta ttagacagac tagtcaacaa gcgtgttttt tccaaattag atcttaaaaa 2460 tgggtatttt catgtattcg tcgataaaga gtcagtaaaa tacacgtctt tcatcacgcc 2520 gcttggtcaa tacgagttcc taagaatgcc aatggggatc aagaatgcat cacatgtttt 2580 ccaaaggttt ataaaccgaa tttttgaaga catggtcaga aaagggcagg tcctagtata 2640 catggatgat ataatgatag ccagtgcaaa tacaaccgag cacctacaga ctctagaaga 2700 agtttttgat aggttagtaa gaaacaaatt aagactacga atggacaaat gcgagttttt 2760 catgtcaagt ataaaatacc tgggatataa aatcacagaa aacaaaatat gtgctgacga 2820 taaaggtttg gatgcagtaa aaaactttcc agtgccaggc aatgtacacg cagtgaaaag 2880 tttcttaggt ttgtgctcct actttagaag atttataaaa gacttttcca tcatcgctaa 2940 acctatgtac gacttgatga aaaaagacaa agaatttcaa ttcggaaaag aggaattaga 3000 atgctttctc aacttgaagc aaatactatt ggaagcacca gtattggcga tatatgaccc 3060 cagagacgat acagagcttc attgtgatgc aagcgcatta ggttttgggg cggtcttatt 3120 acaaaagaaa gcggataaaa aactacaccc aatattttat ttttcgaaga gaacaagcga 3180 ggttgagtcg aagtatcaca gttttgaact cgaaacgtta gcgattgtgt actccctgcg 3240 cagatttagg atttatgtcc aaggaaagaa atttaaaatt ttaactgact gcaattctct 3300 gactcttaca ctaaataaga tagaattaaa ccctaggatc gcgagatggg cccttgaact 3360 ccaagagtac gactacgagt tagcacatag atcagggaag caaatgcagc acgtagacgc 3420 gttaagcaga tgcaccaata tcatggtaat agaaccgaat agcttcgagc acaacctagt 3480 tatttgccaa gcaaaagatt tagaactgaa aaaaataaag gaaatgctag aaaaatcaga 3540 gcacaaattg tttgaaatga gaaacggagt attataccga aagtccaatg acaagaaaat 3600 attgttctac gttccaaaag aaatggaaaa tcacgtttta tacaaatatc acgatgaatt 3660 aggccatgta ggtagagaca aaatgttgga cgcaataggc aaaagttatt ggttcccaag 3720 tcttaaggat aaagcaacga gacatataga aaattgtctc agatgcatag cattctcacc 3780 taaatgcggc aaaggcgatg gatttctgaa cagtataccg aaaggcgata aacctttcga 3840 gttaatccat atcgatcact atggaccaat agaagctgga aggtctaaaa aacacatatt 3900 tgcagtagta gacggtttta cgaaattcgt taggctatat acaacaaaaa ccacaaaaac 3960 aaaagaagca atagatgctt taaaagcgta ttttagaagt tatagcaggc cgaaatgcat 4020 aatatctgac agagggagtt gttttacttc gtctgaattc gaatcctttt tggttgaaca 4080 aaaggtgcag cacgttaaaa tagcaacggg ttcccctcag gccaatgggc aagtagaaag 4140 aataaacagg actctagggc ctatgctggc aaagttgaca gatccagata aggggaatta 4200 ctgggatacg gttgtcgaaa acgtagagta tgctttaaat aatacaattc acagaagtat 4260 aaaacagtac cccagtataa tgttatttgg catagaacaa agaggagtag ttacagacga 4320 actgaaagaa gagattaaag agctaagaaa tacagaaaat caaataaacc ttgaaatgat 4380 aagaaagaat gcagcgttaa accagcaaaa agcacaagca tataatgaaa ggtattacaa 4440 tagtaaaaga acagggcaaa aagaatataa aataggggac tatgtcatgg ttaaaaattt 4500 cgacagtaca acgggaatat ctcgaaaatt aataccaaaa aacaaaggac catatgtagt 4560 agataaagta ctgaaaaaca acagattttt actgaaagat gtagagggat ttcaactttc 4620 ccgtaacccg tatcagggag tatggtgtgg ccaaaatatt aagccctggt taagaacttg 4680 aaatagttga ctttaatgta atcataaact tttgttttaa taattgatat aagatgtata 4740 taaaagaaaa atgtatgtaa tagtttgtaa aatataaaca aaaaaaaaat gtgtatagct 4800 cgtaagatat gaaaaaaatg tataccaccc aagatcagga gatcttattt tgtcaggacg 4860 gccgaat 4867 // ID R2Amel repbase; DNA; INV; 5002 BP. XX AC . XX DT 19-FEB-2010 (Rel. 15.02, Created) DT 19-FEB-2010 (Rel. 15.02, Last updated, Version 2) XX DE R2Amel - R2 non-LTR retrotransposon from the honeybee Apis DE mellifera. XX KW R2; Non-LTR Retrotransposon; Transposable Element; R2Amel. XX OS Apis mellifera OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; Apoidea; OC Apidae; Apis. XX RN [1] RP 1-5002 RA Kojima K.K. and Fujiwara H.; RT "Long-Term Inheritance of 28S rDNA-Specific Retrotransposon R2."; RL Molecular Biology and Evolution 22(11), 2157-2165 (2005). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 1096..4494 FT /product="R2Amel_1p" FT /translation="MSSNEEGASDTGAPGPGVPVADVSAADGRATYDDHGM FT STDYEKQTIELPLNGQIQCLWCHIEGRNQRFLQESQYLKHKDTQHPKGEII FT WRCAACQKEFEKLHGCRCHLPKCKGRKEAKGVAKFKCDSCEESFLTQRGLS FT MHELHRHPAIRNLKRTQGTSRGNTRPINRASVWSKEETDLLIKLNERYKHL FT KQPNVALKEYFPDKTLKQISDKRRLLPVQEPEDVATTDETGPPPSDSSEES FT IYESATEDEGGGDMQQTAPNDSWKEPFIQSIRTNHLEEEDSLRKVEEAIER FT MAMNEGVTEQEVGTLLEQFVDSLTQSPTTERKGSRRKSQKTTKRKTTHNNR FT KKFLYAKHQELYKKSPRRLLELALSGESSSGREVVNLPEADSVGPLYKSLW FT GQIGPEKTHRNQPMCNNIDMSEIWTPIALESLVEKFKKIKSDTAAGADQIK FT KFHLRKKGALHVFAKLCNLLMLHRIYPAQWKTNRTTLIPKPGKSAEEVENW FT RPITIGSLLGRIYSAMIDRKLRSKIKQHIRQKGFTQEDGCKNNIAILSSAL FT TKMKEDSGGIITIIDISKAFDTVPHGEISQSLMNKGVPSPICEYIQKMYIG FT CKTIIYCRDKKTLPVDILRGVKQGDPLSPLLFNLIIDPIIGTLDETTEGIK FT LENENISVLAFADDLVLLAKDKETADKQNRLINEYLDDLKMKVSAEKCTTF FT EIKRQNKTWFLGDPQLTLGQQRIPYADPEAAIKYLGTNFNPWRGLCKTSIK FT EIIDAARTVKQLKLKPHQKINLIRTYLLPRYIHKLVANPPPLGTLDLIDKE FT LKTIIKEILHLHPSTTDGLIYTDKSHGGLGIQRVANIVKLAKLKHSILMTR FT SEDNAVKIALNGQEGMVKRYATSIGLQWPCGIEEIEETRKKLKRADTNKWK FT TLISQGQGIKEFFGDKTGNAWLYNPEMLRPSRYLDALKLRTNTYGTKAALH FT RAKRDIDINCRRCGVQVETLGHILGLCTHTKNKRIKRHDEICDLIAKNVSK FT EYVIFREPEVEVNGDRRKPDMVIKDHDKVYVVDVTVRYENNDSLNKAYKEK FT ENKYKETAEIMRRDLKAKESRVLPVVIGSRGAVPRATIENLKVLGLQTKHA FT LTASLIALRSSIEMANEFLDYDHTT" XX SQ Sequence 5002 BP; 1677 A; 1166 C; 1178 G; 981 T; 0 other; tggtaatcaa atgcctcgct attttagtag cggtagcgct ccgcccgcgc aggaaccatt 60 gacgccgccg tagtgtgggt gattttatat ccaaccaatc acgtcaacta cgatcatttg 120 taatcaccga cggtacttgg taggggtacc acatgggcat tcttgctcat tccacaacgc 180 cgcctccatc atggcaacaa tttaaaatat atataaattc ttaaggtttg accgtattca 240 tatatatata tatatattta atattacaac cataaatctt atatcgagcc ttctatttgg 300 tctcaaaagc aatacgttgt cagatcttgt agaacatcag gagtgagcgg tgcgctgtgg 360 tatccgtgct ttgtgccgcg gcgacaaacc aatacgctgc tgcctgtcgc aaagcaatac 420 gctgctgatt ctcggatgcg ggtgtcgacg gtcacgcaaa gcgatacgct ggtggggttt 480 caaaacaata cggcgctggt gctaaaaagc attatgccgc taacggctgg attgtcgatc 540 gccgctgcgg gggctagtgg cgcacccaga gaggtgcgac gcgcaagcat tggttctgtg 600 cgaagcggag ttcttgagag taatggttgc tgggggcaca aagcgcaaca tatagcctct 660 tatgcctcaa gtcgtagttc gtacctccac gtggtcccgc tggaatgcct atcgactcct 720 ccccggagga tcatagagtt cgaaaccggc tacggcgagg caagggcggt gaggtgcaca 780 ccgatgggga gcagcgaccc cacctaccct tagctaagag agcaggcgat ccgccaactg 840 tcagcacgaa ataaactaat catatgtata cgagggagaa tttacaacgg gtaccttgtg 900 cccgaaccgc ctgtaggtat cacctacagg tgttaaaatg aatctgatag ctggcggatc 960 gtcgaccctc tttgatggct ctgcgccaac gactggaaag aataggaacg gaagtctaat 1020 ggaaggaaag tgtcgggagc actataaatt cccaaagaag aaaagaaaag aaaaaaaata 1080 aaaaacccaa attaaatgtc gagtaacgaa gaaggagcct cggacacggg ggcccccggc 1140 cctggggtcc ccgtggcgga cgtgtccgct gcggatggaa gggccactta tgacgaccat 1200 ggcatgtcca cggactacga gaaacaaacc atcgaactgc ccctgaacgg gcagatacaa 1260 tgcttatggt gccacattga aggaaggaac caaagattcc tccaggaaag ccaatactta 1320 aaacacaaag acacgcaaca ccctaaaggc gaaataatat ggaggtgcgc tgcatgccag 1380 aaagaattcg agaaactcca tggctgcagg tgtcacttac ccaagtgtaa gggacgcaaa 1440 gaagccaagg gtgttgccaa atttaaatgc gactcctgcg aggaatcgtt cctaacgcaa 1500 agaggattgt ctatgcacga gctacataga catccagcga ttaggaactt gaaaagaaca 1560 caaggcacca gtcggggaaa taccagacca atcaacagag cctcggtctg gtccaaagag 1620 gaaacggacc tcttgataaa acttaacgag cgctacaaac acttaaaaca gccgaacgta 1680 gcgctcaaag aatatttccc cgacaagaca ctaaaacaaa tcagcgacaa aagaaggctc 1740 ttgcccgttc aggaacccga agacgtggcc acaactgatg aaacgggacc tcctccttcc 1800 gactcatcgg aagagagcat atacgaatcg gccacggagg acgaaggagg aggagatatg 1860 caacaaacgg ctccaaacga tagctggaag gagccgttta tacaaagtat aagaacaaac 1920 cacctcgaag aggaagactc ccttcgaaag gtggaagaag ccatcgaaag aatggctatg 1980 aatgaagggg taactgaaca agaggtgggc acccttcttg aacaatttgt cgactcccta 2040 actcaatccc caacaacgga aagaaagggg agccgacgta agagtcaaaa gactacaaaa 2100 agaaagacca cccataacaa tagaaaaaag ttcttatatg ccaaacacca ggagctctat 2160 aaaaagagcc cacgaaggct tctggagttg gcgttatcgg gtgagtctag cagtggcaga 2220 gaagtggtta atctccctga ggccgactca gtgggtccac tatataaaag tctatggggc 2280 caaataggcc cggaaaaaac tcacagaaac caacctatgt gcaacaatat cgatatgagc 2340 gaaatttgga ctccaatcgc cctggagagc cttgtcgaaa aattcaaaaa gataaagtcc 2400 gacaccgcag ccggcgcgga ccagataaag aaattccacc tgagaaagaa aggggcacta 2460 cacgtattcg ccaaactgtg taacctcctc atgctgcacc gaatataccc agcacagtgg 2520 aaaaccaacc gaaccacgct tattcccaaa ccggggaaga gcgcggaaga ggttgagaac 2580 tggagaccaa tcaccatcgg gtctctgctg ggaagaatat attcggctat gatcgaccgt 2640 aaattacggt cgaaaataaa gcagcacata agacagaagg ggtttacaca ggaggatggc 2700 tgtaaaaata atatagccat tctcagtagt gccttaacca aaatgaaaga ggactcaggt 2760 ggaatcataa ccataataga catttccaaa gccttcgaca cggttcccca cggcgaaata 2820 agccaaagtc tgatgaacaa aggagtccca tcgcccatat gcgaatacat tcaaaaaatg 2880 tacataggtt gtaaaactat tatatattgc agagacaaga aaacactgcc agtggacata 2940 ctgagaggag tcaaacaggg agacccgcta tcgccactgc ttttcaactt gataatagat 3000 cccataatag ggacactgga cgagacgacg gagggcatta aattagaaaa cgagaacatt 3060 tcagttctcg ccttcgccga cgaccttgtc cttttggcga aagacaaaga aacagccgat 3120 aagcaaaatc ggctcatcaa tgaatatctg gacgacctga aaatgaaagt atccgccgaa 3180 aaatgcacaa ccttcgaaat caaacggcag aacaaaacgt ggttcctagg agacccacag 3240 ttgacgttgg gtcagcaacg tatcccgtat gccgacccag aagcagcaat caaataccta 3300 ggaaccaact tcaatccatg gagagggttg tgcaaaacct cgataaaaga aatcatcgat 3360 gcggctagaa ctgtcaaaca gctgaaactt aagccgcatc aaaaaatcaa ccttataaga 3420 acctacctct tgccaagata catacataaa ttggtggcaa atcctccccc tctggggact 3480 ctagacctaa tcgataaaga gctcaaaact ataataaagg aaatattgca cctccatccg 3540 tccaccacgg acggactaat atacactgat aagagccatg gcggtctagg gatccagcgg 3600 gtggcgaaca tagtcaagct ggccaaacta aaacatagta tactaatgac aaggtcagag 3660 gataatgccg tcaagatagc acttaacggg caagagggaa tggtgaaaag atacgccacg 3720 tccataggcc tacaatggcc atgtggaata gaagaaatcg aggaaacgcg taaaaaactc 3780 aagagggcgg atacaaacaa atggaaaact ttaatttcgc aaggacaagg cataaaagaa 3840 tttttcgggg ataaaaccgg gaatgcctgg ttatacaacc ccgaaatgct gagaccgtct 3900 cgatacctgg acgcactaaa actgagaaca aatacatatg gcacaaaagc agcactccac 3960 agagcgaaaa gagacataga cataaactgt cggagatgcg gcgtccaggt ggaaacccta 4020 ggacatatat tgggactatg cacccataca aagaacaaaa gaataaaaag acacgacgaa 4080 atctgcgatc ttatcgcaaa gaatgtctct aaagaatacg tgatatttag ggaaccagaa 4140 gtagaggtaa acggtgacag acgtaaacca gatatggtca taaaagacca tgacaaggta 4200 tacgtcgtgg acgtcaccgt aagatacgaa aacaatgatt ccctaaacaa ggcctacaaa 4260 gaaaaagaaa acaaatacaa agagacagcg gaaattatga gaagagactt gaaagcaaaa 4320 gagagcagag ttctgccagt ggttatcggg agcagagggg cagtgccccg agccactata 4380 gaaaatctaa aagtcctagg gcttcaaaca aaacatgccc tgacggcttc gctcatagcc 4440 ctccgatcgt cgatcgaaat ggcaaatgaa ttcctggact acgatcacac tacgtgatcg 4500 ttaaaagtaa aaatctattt atttattttt attcctatat tataacacat tatttattta 4560 tttacttatt gttttaaaga tgacgaagcc gcaaggccaa tccaaattta acaaaagaac 4620 gagactactg gtcgacatta aaaagacgaa gcagctgcca gctgataaac aacagagccc 4680 gtctcggcct ttacaccgag cggtgcaagt cctgacgtac tattgtacgt ctagggcgcg 4740 gggcagattc taccgtgtag aatctggggc gacgcctccg cgaggcactc cctggacaac 4800 gtacgctaaa gcgtacggct aagtgcgcct cccgaaaggg tccccgttcc taatttttcc 4860 gagcccgcgg gcagatctcg tggcagtgac gctagaaagt taagtccgcg gacatataaa 4920 attacagcct taaataatga accccacgaa ggaggtatcc tcgaaattcc gccacgatcc 4980 ttctgatcgt aggcgcaaaa ca 5002 // ID BEL-3_CQ-I repbase; DNA; INV; 6224 BP. XX AC AAWU01000122; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-3_CQ_; KW BEL-3_CQ-LTR; BEL-3_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-6224 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 159-159 (2011). XX DR GenBank; AAWU01000122; Positions 91472 85249. XX CC Positions [5024-5563] - Integrase core CC 'ATACC' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 693..4331 FT /product="BEL-3_CQ-I_2p" FT /translation="MASKLFKFSGNTNGHCQLCNDADEKDKMIGCDDCQHW FT FHKSCANVPKDPPKKQEWFCPRCVTIAEKLRGSVNPIIEMMEAQKEAFRYM FT SNKSTNNQVATLKEMMDAFKASLDVKNEQLENQSAMVVDNPSGEAAILQLN FT KIVTRQALMDLPDFDGSYKVWPRFQSIFETTTKEGGFSDVENLNRLEKHLK FT GQALFHVESLLTDHTNVPLIMKSLKEQFGNVRFLYEGYLKDALSLKNPSVE FT NPQSMIDFVTSIENLVTNIKLLKQPDYLKDQRLIQDLVAKLPFNLRQQWLK FT SLLEKEKAVNLLEANKLPDLESFSIWLKDHKSLATMMMSYKTGEYSQPGNR FT ERFNRREKLNAHLDIKSDKRYKCFSCNETTHRNAECPVFAKMDVAKRIESV FT KQNRLCFACCNFSNHNSRNCQNPRKCNIAGCKINHHPMLHETKRTPTVEKK FT SENTANQVNHHDRNDSSVYYQIMPVVLSNGDKRVRVLALLDLGSSTSLLLD FT DVRKELAITGTKIPLALSWTDGNVQRDENSQEITVDIEGSHGNTYELRKVR FT TIGSMSLPKQSLNKDEMIKRYPYLKDINLKGYGKESPKLLLGLPHSFVIQG FT LESRAGNFNQPIANRTRLGWVLHGKVSGKFPNKNFTVAFHNEIEEDEVSIN FT KLMNDFFSLEGFGVKQSIKPLVSKQIERAEKIMKETLQYENGQYEIGLLWK FT HDDVQFPDSYSAALTRLKGQELKMRRDPTLRTWYNNKIKEYEEKGYIRKLS FT QIETITKSSRTYYIPHFSVINPNKQPPKPRLVFDAAAKTRGESLNSNLLSG FT PDTNANLLGVLMRFREGRYAISGDIQEMFHQVKIRKEDQDAQRFLFRESDS FT SEIDTYVMQVMTFGATCSPACAQFVKNQNALKLHENYPREVDAILNNHYVD FT DYLDSFDLLEEAKEVVNNVIMIHGKAHYNIRNFVSNSNSLLNSIEESKRME FT INAIKLGDDEQLYEKVLGIFWNTETDTINFKVKELQEMENPTKRNILSTIM FT RVYDPLGLISNITIQGKIFMQELWKLGLEWDESIPEDLNESWNHWLTRLLA FT AKNIKFPRCYSTMTNIKRRELHVFADASDKAYAAVVYLRTVSQNSDDPKSE FT QIEINLVAAKARVAPNKMLSIPRLELLAAVLGTRIAENIMNELRLKIDRTI FT YWSDSKTVLAWIQSDARSYNVFVGLRISEILDSTTVSQWRWVNKTIQQMKQ FT RRL" FT CDS 4283..5983 FT /product="BEL-3_CQ-I_1p" FT /translation="MEMGKQNNPADEATKVVTKESIWMHGPNFLKYQEENW FT PVKGKRFQTNEELKPVHIIRECKIPNYSYIQIEWCSDWSRLKRAIGGIMKM FT IVSKAKNMAYENTLLKEDFCEAEIVLIKKAQWDKFPSEMKSLTFELPIEKN FT SSLRGLTPFLDEKGVMRSRGRLENADVLNYAAKFPIILPKDHHVSKLILKY FT YHEKFYHLKLDASIASVRQKYWIIDIRATMKKVKNACQLCKNRSAQPQPPI FT MAALPSYRTIPFVKPFTYAGVDYFGPVNVSIGRRVEKRWGVVFTCLITRAV FT YLDLSRNLSTDAFFICLKNVQSRRGRIMQLYSDNGTNFIGANNEIKRIRQR FT LASEGIEWFFNPPSAPHFGGVWERMVKEVKSILASVQETMPEDVLRGLLTE FT IEYIINSRPLTHIPLESEDDEVLTPYHFLINCSGDMEPMLNDVSKGEALRE FT QWKRSSQLANNYWNRWIKEYLPSLTKRPKWNQEMKPIEVGDIAITPDEENK FT GKWIKCLVTDTRPGKDGIVRSVRIKTASGKYLTRPVVKLAVLDVKRKQRNA FT SDDDGNKKIEMKDDKQNENL" XX SQ Sequence 6224 BP; 2275 A; 1066 C; 1291 G; 1592 T; 0 other; ttttatggtg gctccagaga ggaaaagtac aaaactggtg atggttcaaa agtaaggacc 60 aagtgtttgt ggaatttgcc gagttttcac aagatagtga ctcatgtgta aaaagtgaga 120 agtagttcac tttccttcgg ccgcgaccaa cgtctaggtt gactcgtgtt catcaaaatc 180 cgatcaagtg ccacgtgccg atcggtcgcc atcaacatac ggttggaaaa gtgcgttcgt 240 caaaatccgg acaaatctgt tcggtcacac ccgaattgcg gataagtgtg cctggtccgg 300 acatcgaata agtccggtta gatatcaata atcagcaact gtcgtgttca ctggaagagc 360 acgttacaac gaagaaatcc ggtcaattag agattgcccg gtcgcattcg acatcaggtc 420 gagcctacat gcgccaagga gtttcatcag tgaatcatca atcgaagcag atcaatctga 480 aggaaaatct atcgtcgtgt ggacaacgag acatcgaagc atcacatcga gcatagcaga 540 accagcaacg caacgagcat cgcaacgagc agcgcagcat cacgatcaca acgagctaca 600 gcagaaaagt agtaagtcta caacattagc ccgtaagaag tgatctgtta ggaataattc 660 gcaaactaca aaaaagagtg taaagaagca atatggcttc aaaattgttc aagtttagtg 720 gcaatacgaa tggtcactgt cagctttgta atgatgctga tgaaaaggat aagatgatag 780 gttgtgacga ttgccaacat tggtttcaca aaagttgtgc aaatgttcca aaagatccac 840 ccaagaaaca agaatggttt tgtccaagat gtgtgaccat cgcggaaaaa ctcagaggaa 900 gtgtaaatcc aattatcgaa atgatggaag cacaaaagga agcattccgt tacatgtcca 960 ataagagtac taacaatcag gtagcaacct taaaagaaat gatggatgca tttaaagcat 1020 cattggatgt taaaaatgag caactcgaaa atcaatctgc aatggtagtt gataatccaa 1080 gtggggaggc tgcaatcctc caacttaaca aaattgtgac cagacaagcc ctcatggact 1140 tgcctgattt cgacgggtct tataaggtat ggcccaggtt tcagagtatc tttgaaacaa 1200 ctacaaaaga aggaggcttc agtgatgtgg aaaatctgaa ccgcctagaa aaacatttaa 1260 agggacaagc cctatttcac gtagagtctc tgttgacaga tcatacaaat gttccactaa 1320 ttatgaaatc cttaaaggaa cagttcggta acgttcgatt cctttatgaa ggatatttaa 1380 aagatgcact cagtcttaaa aacccaagtg tggagaaccc acaatcaatg attgatttcg 1440 taacttcgat cgagaactta gttacaaata tcaaactact gaaacaacct gattacctga 1500 aggatcaaag gttgatacaa gatctggtag caaaactacc attcaatttg cgtcagcaat 1560 ggctgaaatc gttgcttgaa aaagagaaag cagtgaattt gcttgaagca aataaattgc 1620 ctgatttaga atcgttttct atatggctaa aggaccataa gtcattggca accatgatga 1680 tgtcgtataa gacgggagaa tactctcaac cagggaaccg cgaaaggttc aatagaagag 1740 aaaaattgaa tgctcatttg gatataaaat cagacaaaag atacaagtgt ttctcatgca 1800 atgaaacaac tcatagaaat gctgaatgtc cagttttcgc aaaaatggat gtcgctaaaa 1860 gaatagaaag tgtgaagcaa aatagacttt gtttcgcttg ttgtaacttc tcgaaccata 1920 attcgagaaa ctgccagaac cctaggaaat gcaatattgc tgggtgtaag ataaaccacc 1980 acccaatgct tcacgaaacc aaaagaactc ctaccgtgga gaaaaagtct gaaaatacag 2040 caaatcaagt taaccaccat gataggaatg atagtagtgt gtattatcaa atcatgccgg 2100 tggtattatc aaatggagat aaaagagtta gggttttagc tcttttagat ctcggttcat 2160 ctactagttt gttgctagat gatgtaagaa aagaattggc cataacgggc actaagatac 2220 cattagctct atcgtggact gatggaaatg ttcaaagaga tgaaaacagt caagaaataa 2280 cagtggacat tgaaggaagt catggaaata cttatgaatt aagaaaagtg cgtactatcg 2340 gaagtatgtc acttccaaaa cagtccttaa ataaggatga aatgatcaaa cgttatccgt 2400 atttaaaaga cattaacttg aaaggttatg gaaaagagtc tccaaaactt ttattaggat 2460 tgccacactc ctttgttatt caaggattag aatcaagagc gggcaacttc aatcaaccca 2520 tcgcaaatag aaccagatta ggctgggtct tgcatggaaa agtatcgggt aaatttccca 2580 ataagaattt taccgtcgca tttcataacg aaattgaaga agatgaagtg tcaatcaaca 2640 aactaatgaa cgacttcttc tcattggaag gattcggagt aaaacaatcc attaaaccgc 2700 tggtttcgaa acaaattgaa agagcagaaa aaattatgaa ggaaacattg caatatgaaa 2760 atggccaata tgaaattggt ctactatgga agcatgacga cgttcaattt ccagatagtt 2820 attcagcagc attgacaaga ttgaaaggtc aagagctgaa gatgagaaga gatccgacct 2880 taagaacgtg gtataacaat aaaatcaaag aatacgaaga aaaaggatat atcagaaagt 2940 tatcgcaaat cgaaacaata actaaatcat caagaactta ttatatacca cacttttccg 3000 ttattaatcc aaacaaacaa ccgcctaaac ctaggctggt atttgatgca gcagcaaaga 3060 caagaggaga atcattaaat tcaaatctat tatccggtcc tgataccaac gcaaacttac 3120 ttggagtttt gatgagattt agagaaggcc gttatgccat ctctggagac attcaggaaa 3180 tgtttcacca agtaaagatt agaaaagaag atcaagatgc ccagagattt ttattcaggg 3240 agtcggatag ttcagaaata gatacttatg ttatgcaagt aatgacattt ggtgcaactt 3300 gctctcctgc ttgcgctcaa tttgtaaaga atcaaaatgc cttaaaatta catgagaatt 3360 atccgagaga ggttgatgca attttgaaca accattatgt tgatgactac ttagacagct 3420 ttgacttact ggaagaagca aaagaagtag tcaataatgt gataatgatc cacggcaagg 3480 ctcattataa tatcagaaac ttcgtatcga attccaattc attgttaaac agtattgaag 3540 aatcgaaaag aatggaaatt aatgctataa agcttggaga tgatgagcaa ctatatgaga 3600 aggttttagg cattttttgg aatacggaaa ccgatacgat caattttaaa gtaaaagaac 3660 tgcaagaaat ggaaaaccca acgaagagga atattttatc cacgattatg agagtatatg 3720 acccattggg attaatttca aacattacaa ttcaaggtaa aatatttatg caagagttgt 3780 ggaagttggg gttagaatgg gatgaatcaa ttccagaaga tttgaatgag tcctggaacc 3840 attggttaac tagactttta gctgctaaaa acattaaatt cccaagatgt tattcaacca 3900 tgactaacat taaacgacga gagctccatg tatttgcgga tgcctcagat aaagcatacg 3960 ctgcagttgt atatttacgg actgtcagcc aaaattcgga tgatcctaaa tcggaacaaa 4020 ttgaaataaa tctagtagca gcaaaagcac gtgttgcccc caacaagatg ctgtccattc 4080 cacgtcttga attattagca gcagtattag gaactcgaat cgcggaaaat ataatgaatg 4140 aactaagatt gaaaatcgac agaaccatat attggtctga ttcgaagaca gtgctagctt 4200 ggatacagtc tgatgctaga tcatataatg tattcgtagg attacgaatt agtgaaatac 4260 tcgattctac aaccgtatct caatggagat gggtaaacaa aacaatccag cagatgaagc 4320 aacgaaggtt gtaacaaaag aatccatttg gatgcatggt ccaaatttcc taaagtatca 4380 agaagaaaac tggcctgtaa aaggaaaacg ttttcaaaca aatgaagaat tgaagccagt 4440 tcatatcata agggaatgca aaattccaaa ctatagttac attcaaatag aatggtgttc 4500 cgattggagt cgtttgaaac gagccatcgg tggtataatg aaaatgatcg tttcgaaagc 4560 taagaatatg gcctacgaaa atacattgct taaagaagat ttctgcgaag ctgaaatcgt 4620 tctgattaaa aaggctcagt gggataagtt cccgagcgaa atgaagtcat taacatttga 4680 attgccaatc gagaaaaata gttctcttag aggtctcaca ccgttccttg atgaaaaggg 4740 agtgatgagg tcgagaggaa gattggaaaa tgctgacgtg ttaaactacg cagctaagtt 4800 tccaatcata ttaccgaaag atcaccatgt tagtaaactc atactgaaat attaccacga 4860 gaaattctac catttgaaac tggatgcgtc aatagcatca gttagacaaa agtactggat 4920 aattgacatt agagcaacaa tgaagaaagt aaagaacgca tgtcaattat gcaaaaatag 4980 atcggcgcaa ccccagccac ccataatggc tgcgcttccg tcatacagaa caattccatt 5040 tgtcaaaccg tttacttatg ctggagtaga ttactttgga ccagtaaatg tctctatagg 5100 gagacgtgtt gaaaaacgtt ggggagtagt atttacttgc ctaataacga gagcggttta 5160 tttggatctg tctcgtaatc tgagtacaga tgcctttttt atctgtttga aaaatgttca 5220 atcaagacga ggaagaatta tgcaattgta cagtgacaat ggaacaaatt tcattggagc 5280 aaataatgaa atcaaaagaa tccgtcagag attagcaagc gaaggaattg aatggttctt 5340 caatccacct tcagcgcctc attttggagg tgtttgggaa agaatggtaa aagaagtcaa 5400 aagtatttta gcatcggtgc aagaaactat gccagaagat gttctacgag gcttactcac 5460 tgaaattgag tatatcataa atagcagacc gttgactcat attccactgg agtcagaaga 5520 cgacgaagtt ttaacaccat atcactttct gataaattgt tccggtgata tggaaccaat 5580 gttaaatgat gtttcaaaag gtgaagctct acgagagcaa tggaaaaggt ccagccagtt 5640 agcgaataac tactggaacc gatggataaa agagtattta cccagtctta caaagagacc 5700 aaaatggaat caagaaatga aaccaattga agttggagat attgctataa ctcctgatga 5760 agagaacaaa ggaaagtgga tcaaatgttt ggtaaccgac acaagacccg gaaaagacgg 5820 gatagtaaga tcggttagaa tcaaaacagc atcaggaaag tatttgacaa gaccagtcgt 5880 gaaattagcc gttctcgacg tcaagcgtaa acaaaggaat gcatcagatg atgatggcaa 5940 taagaaaatc gaaatgaaag atgataagca aaacgaaaat ttataaatga tccccaagca 6000 atacggggta aggaataata agaaggtatt cagcatagat atagtaaaag caaataaatc 6060 atattttttt tttctatatt aatttcatct ttcaagtaat acataaatgt aaggaaaaca 6120 tattcaacaa tacccataag gataaggaac aatttactca tgtaacaact ccaactaaat 6180 tcatatttgt ggtaaaccta cggatgtaga tttacggggt ccgg 6224 // ID BEL-626_AA-LTR repbase; DNA; INV; 381 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-626_AA_; KW Pao_Bel_Ele210; BEL-626_AA-I; BEL-626_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-381 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 381 BP; 142 A; 59 C; 68 G; 112 T; 0 other; tgttgcgcgc aagcccctcg tacgagaatg aacattgtgg agtttgacag aaaattgtga 60 gtggcagacg tttgacgttc tgatcattaa cagggaaata taaaaacagt tatccagatc 120 gcttaaaatt atcggaaggt tgaaaaggat ttgtaagata aaattattta ctatacctta 180 aatattctaa aacaattgaa cttacaggct aatctgaact taaacctaaa ttggacttaa 240 aaactataat ttgttcacga ttgaattaaa taggagacta aattgtaagt aacaatatat 300 ccatgcaaac cggtgaaata atggattcaa ataaaatttc agcttgagct ttaccctact 360 gcaaaagcga gtttcgcttc a 381 // ID Baggins-3_NVi repbase; DNA; INV; 4961 BP. XX AC . XX DT 30-JUN-2009 (Rel. 14.07, Created) DT 23-JUL-2010 (Rel. 15.08, Last updated, Version 2) XX DE Non-ltr retrotransposon: consensus. XX KW Loa; Non-LTR Retrotransposon; Transposable Element; KW Baggins-3_NVi. XX NM Baggins-3_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-4961 RA Bao W. and Jurka J.; RT "LINE retrotransposons from the parasitic wasp Nasonia RT vitripennis."; RL Repbase Reports 9(7), 1414-1414 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS join(623..985,910..1230,1220..1540,1555..2406, FT 2256..3458,3340..4863) FT /product="Baggins-3_NVi_1p" FT /translation="MNSESITNRKEEKMAPQEXGDPAKGMLPQRPLIPPSQ FT TQGGVVDAPSDIAAEGVVTPPALEPAKKQRKRDCGALRKRKAREHARAAAS FT STEPTTSDTLVTGQKRRQGTNKPRPLFPVTTRKHSGYGPEEATRDEQTPPS FT VPRNNKKVKGPSEGANPNVGGSKQMRAQAGGGVLTDAQRANPLTIVLVLED FT YPDSPITPELFQEFRGAVRKELDLQPQNSTPPSVRRGRGGGVNPRSQQSRG FT EGVDGQQACHSGLTLGRQAQNWGGGAELLQQRIKVVARFPGPAERPATILE FT KLEGYNPSLKTKAWRVVESKVGEEREAGAGDRQLLVLTIPRGXRAPSSLPI FT SQINLHHCKDASDILLRNMTVAHTCIALIQEPWVVKGAIRGLGNLVCFRNP FT KEQRPRACIVAKGINALPLLNLTTGDLMAVQTELEDVERIVIASAYLPFDN FT TDNFPKAVQMPGVSNCSTTVGDTPTFRNAIREEVIDLTLSSTKIAHLISEW FT RVSDEPSLSDHSLITYLVGNPSEEQPKWVRNPRKTNWEGYKKDVEEAMREI FT PLDIQDSTSRRDIYRKSAKEANRRTAGETSARKSKREKRQPGCTRSWRGIP FT RLSWARLSSRMNITPLQRRRRSXNSWRNFCEKIQKGKETARMYKILARNPE FT AVLGTLKLPNEHYATTKEETLNLLVQTHFPSFVGAGTGDLSGGGSGQGRRP FT TYAELSENRLLAGRIVTPERVMWAIRSFEPFKSPGPDSIFPALLQKAEDVI FT IGPLTRMARTSITLGLVPTGWQGTKVVFIPKPGRNGYTSPKDFRPISLTSF FT VLKTVERMVDRYIREKVLGDKGLHKEQHAYRAGRSTETALFRAVSTINKQL FT EAKGYAIGALLDIEGAFNHTSREVIKRAMNRLVIPSTIAGWIDHMLGNRNL FT EASKGSTTLKGTVGSGCPQGGVLSPLIWCLVVDELLTELNNTGCTAIGYAD FT DILIIARGPFLDLLMGVMQGTLTLVNKWCNKAGLSVNXXXDAGLHQKIQME FT KGMQLDPWGAETASTSGATRPGYLSIXXKMLVCTRRYKWKRECNLTLGGQK FT LPLKSQVKYLGVILDSKLSWKKHLEYVIGKVTANLWQIRRVIGRNWGLSPE FT ALRWMYEAVLKPRLTYGSAVWWTRTALSTSNILLEQLRSQIVRSISGAIRS FT TPTAALGMLLDVEPLRVTLAAAAAQALHRITNSLGEGNMQELPQLPKGLDE FT KGTLRMNPDAMNKAFLFDKKYRVSIPSRGEWDDERINIPNDGDLWYTDGSK FT RKDAAGAGVYQRQSGKGTILPLGRHATVLQAELTAIMWAAQAALAESTRPR FT IYICSDSKSALQALNNFTTKSRLVWDCFQTLNRLALKARVTLLWVPGHYGV FT KGNMIADDLAKLGAKSSLIGPEPAVGLCQSLIKGEIRQWAENETNQIWSQI FT ATCGKTKAFVTTGNAKNWSRVILGLKRAKTKLLVEILTGHGSLNAHRHAMG FT LSRDLFCRFCCTEEDTPEHLLCNCPAIAEKRWRHLGAPLLETEEVQAKHAP FT GIFSFWETIGA*" XX SQ Sequence 4961 BP; 1336 A; 1268 C; 1440 G; 911 T; 6 other; ggtattgttc actattatct acggtcggct ccctccaccg gggtcgccac cttgtcgagt 60 gcccgctttt tagggcctgg tgtgggggct cgtccggcga gtcgtcgcgg ttgtctcgac 120 tccggtgtcg ggagcgcgac gccccgaact cgggtcggtg gggtgacggg ggaccgagac 180 caatcggcca accttctccc cggcgcgtaa ctgaggtgcg tcctggcgtg gcggcgaaat 240 tgggctgatg cgggattccg cctctcgggc tcaagagtag cctccccggc aggatccccg 300 taaaaagggc actgctgtcg ggcctctccc ctgaccgggg tcggacctcg gggttccccg 360 agggcgacct gtggactggc cggaggctaa gggcagcccc ccattgcgtt cgggaggagg 420 cgcaatgggt attttatcca acccctaatc ccaaggtgtg agccatcgtg ctgaggggac 480 tagttgaccg ctggggtact agtcggtcac aaacgatgcc tcggagaaac cgggcgaaat 540 ctccgagtat ctagccttac cgtcgcatgc ggggctctgt gacggcggac cgtttatttc 600 cctagctact cgtgggatca aaatgaattc tgaatcaatt acaaatagaa aggaggagaa 660 gatggctcct caggaggycg gcgaccctgc aaagggaatg ctgccccaga ggccgttgat 720 accaccttcg cagacccagg gaggggtggt ggacgcacct tcggacattg cggccgaagg 780 ggttgtgacc ccacctgccc tggaacctgc taaaaagcag agaaagagag actgcggcgc 840 tttaaggaag aggaaggccc gcgagcatgc tcgcgcggct gcttcatcca ctgagcccac 900 gacctctgac actctggtta cgggccagaa gaggcgacaa gggacgaaca aaccccgccc 960 tctgttcccc gtaacaacaa gaaagtaaaa ggaccgtccg agggagcaaa ccccaacgtc 1020 ggtgggtcta agcaaatgag ggcgcaagct ggagggggcg ttctcacgga cgcccaaaga 1080 gcgaacccgc tcactatcgt gttggtgctg gaggactacc ccgatagtcc gatcaccccg 1140 gagttgttcc aagagttccg gggtgcggtc aggaaggagt tggacctcca gccacaaaat 1200 tcgacgcctc cttctgtaag gagggggcgt taatcctcgt agccagcaat ccagaggcga 1260 aggagtggat ggccaacaag cttgccactc tggacttacc ctggggcgcc aggctcaaaa 1320 ttgggggggg ggggcagagc tactccaaca aaggatcaag gtcgtcgcca ggtttcctgg 1380 tccggcagag aggccggcga cgattctgga gaagttggag ggctataacc cctccctaaa 1440 gacgaaagcg tggagggtcg tcgaatccaa ggtcggggag gagagggagg ccggcgcggg 1500 agacagacag ctcctggtgc tgaccatccc ccgagggaak tgatggaaac ataaagggct 1560 ccatcatccc tccctatctc tcagattaac ttgcatcact gtaaagacgc ctcagatatc 1620 ctacttagga atatgacagt ggcgcacaca tgcatagcac tcatccagga accttgggta 1680 gtgaagggtg ccataagggg gctaggtaat ttagtctgtt tcaggaaccc gaaggagcaa 1740 agaccaaggg cctgcatagt agccaagggc atcaacgcgc tgccactgct gaaccttacc 1800 acaggggacc tcatggcagt tcaaacagag ttggaggacg tagaacggat cgtcatagct 1860 tccgcctacc taccgttcga taatacagac aactttccaa aggcggttca aatgccaggg 1920 gtgtcaaact gctcgactac cgttggtgac acgcctacgt tcaggaacgc aatcagggag 1980 gaggtgatag acctaactct gtcgtccacc aagatagcgc atctgatcag cgaatggcga 2040 gtgagtgacg aaccatcgtt atcggaccac tctctgatca catacttggt cggtaaccct 2100 agcgaggaac aacccaaatg ggtaaggaac cccaggaaaa cgaactggga aggatataag 2160 aaggacgtgg aagaagctat gcgcgaaata ccactcgaca tccaggatag cacatcacgg 2220 agagatatat accggaaaag cgccaaagaa gctaatcgtm gaacagctgg agaaacttct 2280 gcgagaaaat ccaaaaggga aaagagacag ccaggatgta caagatcctg gcgaggaatc 2340 ccgaggctgt cctgggcacg cttaagctcc cgaatgaaca ttacgccact acaaaggagg 2400 agacgctaaa tctcctcgta caaacgcatt ttcccagctt cgtgggggcg gggacgggcg 2460 acctgtcggg cgggggatct ggtcagggcc gcagaccgac gtacgccgag ttgagcgaaa 2520 atcgactgct ggccggacgg atcgtcacgc ctgagagggt gatgtgggct ataagatctt 2580 tcgagccatt taagtcaccg ggacctgata gtatttttcc ggccctgctt caaaaagcgg 2640 aggacgttat cattggccca ctgacacgaa tggcgagaac aagcatcacg ctcggcttgg 2700 tccccacggg gtggcaggga accaaggtag tattcatccc gaaaccaggg agaaatggat 2760 acacatcccc gaaagacttt cggccaatca gcctgacctc ctttgtactt aagaccgttg 2820 aacgaatggt cgatagatac attcgagaga aggttttggg cgacaagggg ctacacaaag 2880 agcaacacgc ctacagagca gggagatcca cggagacagc actctttagg gcagtatcca 2940 cgatcaataa acagctggag gccaaaggat atgccatagg cgcactattg gatatcgaag 3000 gggcattcaa ccacacatcg cgagaggtga tcaaaagggc catgaacagg ctcgtgatcc 3060 cctctacaat agcgggatgg attgaccaca tgttgggaaa tcgcaattta gaagcgagca 3120 agggcagcac gacacttaaa ggtacggtgg ggtctggatg ccctcaaggc ggcgtccttt 3180 caccactaat atggtgcctg gtggtggatg agctgctgac cgaattgaac aacacaggct 3240 gcacggcaat aggatacgca gacgacatcc taatcattgc acgcggccct ttccttgacc 3300 tcctgatggg agtcatgcaa ggtaccctaa cgttagtaaa caagtggtgc aacaaggccg 3360 ggttatctgt caatcyagsc waagatgctg gtctgcacca gaagatacaa atggaaaagg 3420 gaatgcaact tgacccttgg ggggcagaaa ctgcctctta agagtcaagt caaataccta 3480 ggagtcatcc tagacagtaa gctgtcctgg aagaaacacc tggaatatgt gataggcaag 3540 gtcacggcca acctgtggca aattagaagg gtcataggga gaaactgggg cctgtcacca 3600 gaagccttga gatggatgta tgaggctgta ctcaaaccca gactgacgta cggttcggca 3660 gtgtggtgga cgagaacggc cctctccacc tcaaacatcc tcctcgaaca actaaggagc 3720 caaattgtga ggagcatctc aggtgctatc agatcaacac caacagcggc actgggaatg 3780 ctgcttgatg tagaaccact aagggtgaca ctggctgcag cggctgcgca agccctgcac 3840 aggatcacca actcgcttgg ggaaggcaat atgcaggagc taccacagct cccaaaagga 3900 ttggacgaaa agggaactct gcggatgaac ccggacgcca tgaacaaagc cttcctgttt 3960 gacaagaaat atagagtaag catcccatcc agaggagaat gggatgacga aagaataaac 4020 atcccgaatg atggagacct gtggtataca gatggatcca aaaggaagga tgctgcagga 4080 gcgggggtgt accaacgaca gagcggcaaa ggaacaatct tgccgctagg tcgacacgcc 4140 accgtactac aagcggagct gacagcgatc atgtgggcgg cgcaagcggc actagctgag 4200 agcacgcgcc cgcggatcta tatctgctcc gacagcaaaa gtgctctcca agcactaaac 4260 aactttacca ccaaatctag gctggtatgg gactgttttc agaccctaaa cagactagca 4320 ctcaaggcca gggtcacact actatgggtc ccggggcact acggtgtcaa gggaaacatg 4380 atagcagatg acctggcaaa gttaggcgca aagagcagcc taatagggcc agagccggct 4440 gtcggactct gccaaagtct gatcaagggg gagatcaggc aatgggcgga gaatgagaca 4500 aaccaaatct ggagtcaaat cgccacgtgt ggcaaaacca aagcctttgt gacgactggc 4560 aatgcgaaaa actggtccag agtaatcctt ggacttaaaa gagcaaaaac caagctgctt 4620 gtggagatcc ttacaggaca cggcagcctg aatgctcata gacatgcaat gggactaagt 4680 agggacctat tctgtagatt ctgctgcacg gaggaggaca cccccgagca ccttctgtgt 4740 aactgccccg ctatagcgga aaagaggtgg cgccacctag gtgcccctct cttagaaacg 4800 gaagaggttc aggccaagca cgcccccgga attttttcat tctgggagac gattggcgct 4860 tgaaatagcg actgaactaa tcttgaggga cgccaaggga ttacccatgc gtccagtggt 4920 ctgaataagc ccgactcgta tacaataata ataataataa t 4961 // ID Kiri-8_AAe repbase; DNA; INV; 4584 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 16-FEB-2011 (Rel. 16.02, Last updated, Version 1) XX DE A Kiri non-LTR retrotransposon family from Aedes aegypti. XX KW Kiri; Non-LTR Retrotransposon; Transposable Element; Kiri-8_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4584 RA Kojima K.K. and Jurka J.; RT "Kiri non-LTR retrotransposons from the yellow fever mosquito."; RL Repbase Reports 11(2), 703-703 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 16 sequences with >96% CC identity. CC This family and related Kiri elements found in two mosquito CC species, Culex quinquefasciatus and Aedes aegypti, constitute a CC distinct lineage belonging to the CR1 group. The phylogenetic CC analysis with PhyML indicates that Kiri is a sister lineage of CC the L2 clade. Although several Kiri elements do not encode two CC proteins, probably due to 5' truncation, Kiri elements generally CC encode two proteins. Their ORF1 proteins commonly contain an CC L1-like RNA-binding domain. XX FH Key Location/Qualifiers FT CDS 311..1108 FT /product="Kiri-8_AAe_1p" FT /translation="MVKGKTNLSVPLGRNNDGNNVKRSRDDLDKSEGDEIE FT NLNDLFVRMQNMFQTTNAKIDVCKTDLQTEISTLREDVHQFKEECTSNINK FT LSDSLTEMRTSVHQNKERISAVEKSNDLILSGVPYVANEDVGQIVQRIAIA FT LGFSEQNTPLMFSKRLAKIPIANGATPPIVIQFAFKLARDDFYRRYFTVRN FT LSLIHIGFNVDKRIFLNENLTDQTRRIKGKAINLRRTGKLYAVFTKDGCVF FT IKPTPEDAAILIRSLDELDAYNPSA" FT CDS 1594..4437 FT /product="Kiri-8_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MPGENFSNASTNVCIPRVVMNCALDSDYLNICHANIQ FT SICARELSKFEEFKLCFHNSKVDVICLSETWLTDNIPDSLIEIEGYNIVRN FT DRIHSRGGGVCIYYKSFLKCKLLSVSEAMVGNGHMSRTEFMFLEIMHNYDK FT FLLCVCYNPPGTDCSELLFDKLGDLSLQYRNVLLLGDFNTNWNKTDSKTER FT FKSCLDNYGLKCLNSMNTHFHHNGSSLLDLVLTNNPEFVLNLNQVSAPAFS FT KHDVLFCSINISKSNEETTRTYRDYSSINVPHLINAFNGINWNLLYSMSDS FT DLALNFFNLQLLRLFNEFVPVRTTKPKKNPWFNQEIANAIVERNIAYRLWI FT ANRSTFNHNQFKRLRNRTTTLINNAKANYVSDNLSASSTSKDLWKRIKNLN FT ISKTSERNDKFDNSIDEINDYFSSNFTFDEDLPSVPPENVYGFRFSEISEN FT DIIISINSIKSNAVGLDEIPLRFIKLLFPSICPIIHYIFNLIVSSSKFPQA FT WKHSKIIPIKKKAKTLNLENLRPISILCGLSKVFERILKYQIQEHIENFDL FT MHTFQSGFRRGHSTTSAFLKVHNDILSVIDRKGVAFLFLLDFSKAFDRVSH FT TKLLKKLSYDFLFSRPAVNLMQSYLSNRSQSVFAGGAFSKTCSILSGVPQG FT SILGPLLFSCFINDLPNVLKHCKIHMFADDVQLYLALEEVDLSLMYQLIND FT DLSRIIRWADENLLQINASKTKVMFISRSTHNYSLPSFRLGNDLLEYVNKA FT SNLGFIIQNNLEWDSHVNNMCSKIYSGLRLLRISSNMLPTSVKLQLFKSLL FT LPHFMYGDVLLINASARAIDRLRIALNCCVRYVYNLSRFSHVSRFHPKLIG FT CPFYDFFKMRSCLTLFKIIRFATPSYLYENLNPFQSYRTRNFIIPRYNTAH FT FANTLFVRGIVFWNQLPNETKNLNTFKSFQQECVKWFNRGNQQN" XX SQ Sequence 4584 BP; 1414 A; 853 C; 806 G; 1511 T; 0 other; gtttctgaag ggatgagcga gagtagtgtg tagtgaacct gtggctgtga ttctacggaa 60 gatttctgct gtttttccaa ctctataaaa gctttctgtg taactggcct catcttctag 120 tattgcggat gcctggtgta taaaatatta gtcgccaaac tactagttct cccttccaag 180 acattttttg tgaccgcatc tccgtcactt tgatacggtt gtttgtgaag ttgatacaac 240 gcctcaaaca ttcaagaact cactacgcta ttgtttcaca tctgttgctg tttgagtggt 300 acccaacaca atggtaaaag gcaaaaccaa cttatctgtg ccgctgggaa gaaataatga 360 tggtaacaat gtcaagcgct ccagggacga tctcgataaa tcagaaggag atgaaatcga 420 gaatttgaat gatctcttcg ttcggatgca aaatatgttc cagaccacca acgccaaaat 480 agatgtgtgt aagactgatc tgcaaacgga gatatcgacg ctgagagaag atgttcacca 540 gttcaaagag gaatgtacgt ctaatatcaa caaactatcc gattcgttga ccgaaatgcg 600 cacctcagtt catcaaaaca aggaacgaat cagtgctgtc gagaaatcga acgatttgat 660 attatctggt gttccatacg ttgctaacga agatgtgggc caaatcgttc agaggatcgc 720 gattgctcta ggtttctcgg aacagaacac accactgatg ttctcaaagc gtttagctaa 780 aatccctatt gcaaacggtg ccactccacc aatagtaata cagtttgcgt tcaaactagc 840 tagggacgac ttttaccgac gatacttcac tgtacgtaat ctgtcgttga ttcacatcgg 900 gttcaacgta gataaacgca tcttcctgaa tgaaaatttg acggaccaaa cacgtcgtat 960 caaaggaaaa gcaatcaact tgagacgcac aggcaaattg tacgctgttt tcaccaagga 1020 cggttgtgta ttcatcaagc ctactccgga ggatgccgct attctgatcc gttcgttgga 1080 tgagttagat gcttataacc cttccgctta gtttttctta tcctccccat tcatccatgt 1140 attctatcct ttattccatg actccatcca tgcctgaaag tcaaatttga aacaaccttt 1200 ccttttaagc ttttgtataa tcctgcctag ctagtccgtg gctcccatcc ttgagttttc 1260 cttccaatct tctttcctaa aagtcttgac ctgctgttgg aaatggggga cctgctcggg 1320 atgaatgctg ctgctgttgt tgaagatact gttgctgctg ttgctgttag atatcgttat 1380 tgattgctgc ttggatgctt ttctatcgcc attaagaaat taatttgatg ctgatcttcg 1440 gtagattcac tatgaatgtg ttgaatatga tgattgaact attgaattat tgtaaaatga 1500 atgattactc attgctcttt ttgattgttg attcctcagg gttgggttac acgacgtgtt 1560 tatgctggtt tgatcgattg ttactttact acgatgccgg gtgaaaattt cagtaatgcc 1620 tctacaaatg tgtgtattcc acgagttgtg atgaattgcg cgctggattc agactacctc 1680 aatatttgtc atgcaaacat tcaaagtatt tgtgcacgtg aattaagtaa gtttgaagaa 1740 ttcaaattgt gttttcataa tagtaaagtt gatgtaatct gcttaagtga aacctggctc 1800 acggataaca ttcctgattc actcattgaa atagaagggt ataacattgt tagaaatgat 1860 cgaattcaca gtcgtggtgg gggtgtttgt atatattata aaagtttttt aaaatgtaaa 1920 cttctttctg tttcagaagc tatggttgga aatggtcaca tgagccgcac tgaatttatg 1980 tttttagaga ttatgcataa ttatgataaa tttttgctat gcgtatgcta caatcctcca 2040 ggaacagatt gttcggaact cctattcgat aaattgggtg atctttctct gcagtaccgc 2100 aatgttttat tattaggtga ttttaatact aactggaaca aaacggatag taaaacggaa 2160 cgatttaaaa gttgcttaga caattacggc ttaaaatgct taaattcaat gaacacccat 2220 ttccatcata atggtagttc tttattagat ctcgttttga ccaataatcc agagtttgta 2280 ttgaacttaa atcaagtttc ggcacctgct ttttcgaaac acgacgtact tttttgctca 2340 atcaatataa gtaaatctaa tgaagaaact actcgtacat atcgtgatta tagtagcatt 2400 aacgtgcctc acctaataaa tgcgtttaat ggcattaatt ggaaccttct ttatagtatg 2460 agtgatagtg atttagcctt aaacttcttc aacctacaac tactaagatt gtttaatgaa 2520 tttgtacctg ttcgtacaac aaagcctaag aaaaatccgt ggttcaatca agaaattgca 2580 aatgcaatag tcgaacgcaa tatcgcttat cgtctttgga ttgctaatag atctactttc 2640 aatcataacc aatttaaacg tcttcgtaac agaactacta ctttaattaa taatgctaag 2700 gcaaattatg tgtcagataa cctttcagct tctagtacaa gcaaagattt gtggaagaga 2760 attaaaaatt taaatatttc caagacttct gaacgtaatg ataaattcga taacagcatt 2820 gacgaaatca acgattattt cagctcaaac ttcaccttcg atgaggattt accatctgtg 2880 ccacctgaaa atgtatatgg tttcagattt tccgaaattt ctgagaatga tattataatt 2940 tctatcaatt ctataaaatc taacgcagta ggcttggacg aaattcccct acgttttatt 3000 aaacttttgt ttccttccat ttgtccaatc atacattata ttttcaactt aattgtatct 3060 tcgtcaaaat ttcctcaagc atggaaacat tcaaaaataa ttcctataaa aaagaaggca 3120 aaaacgttaa acttagaaaa tctccgtcca attagcattt tatgtggttt atcaaaagtc 3180 tttgagcgga ttttgaaata tcaaattcaa gagcatatcg aaaactttga cctcatgcat 3240 actttccagt cagggtttcg acgtggtcac agtacaacat ccgcttttct aaaagtacac 3300 aatgacattc tttcagttat cgatagaaaa ggtgtagcat ttcttttcct tttagacttt 3360 tctaaagcat tcgatagggt ttcacatacc aagcttttaa aaaaactatc ctatgatttt 3420 ttattctcta gaccagcagt aaatcttatg caatcatatt tatcaaatcg aagtcaatca 3480 gtttttgctg gtggagcctt ttccaagacc tgttctattt tgtctggagt accgcaaggc 3540 tcaatccttg ggcctctttt attttcttgc ttcatcaatg atttacctaa cgttctaaaa 3600 cattgtaaga tccacatgtt cgccgatgat gtgcaactct atctggctct agaagaagtt 3660 gatttgtcac ttatgtatca gctaattaat gatgacttgt ctcgcatcat acgttgggcg 3720 gatgaaaatc ttttgcaaat caacgcttct aaaacaaaag tgatgtttat ttcaagaagc 3780 actcataact actccttgcc atcgtttcgt ctaggcaatg atcttttgga gtacgttaat 3840 aaggcatcaa atctaggttt cataattcaa aataatcttg aatgggacag tcacgtcaat 3900 aatatgtgca gcaaaattta tagcggattg agattgttga gaatttcatc aaatatgtta 3960 ccaacttcag taaaattaca acttttcaaa tcattgctgc ttccgcattt catgtacggt 4020 gatgtattac ttataaatgc ttctgccaga gctatagaca gactcagaat agcattaaat 4080 tgttgtgtac gttatgttta taatttgtca agattttctc atgtgtcacg ttttcatccc 4140 aagttgattg gatgtccatt ttatgatttt ttcaaaatgc gatcatgtct aactttgttc 4200 aagattattc gatttgcaac accttcatac ttgtacgaaa acttaaaccc tttccaaagc 4260 tatcgcacta gaaactttat aataccgcga tataatacag cacactttgc gaatacctta 4320 tttgtaagag gcatagtatt ctggaatcaa ttaccaaatg aaaccaaaaa cttaaatact 4380 ttcaaaagct tccagcaaga atgcgtcaaa tggttcaaca gggggaatca acaaaattaa 4440 taactacgga gtatcagaaa cataatatta agttaaatgt attccttcaa ataatagcta 4500 attgtagcgc tttaaaagga attttcctta tgctacattg atatgaacaa gtaaataaat 4560 aaataaataa ataaataaat aaat 4584 // ID P-6_HM repbase; DNA; INV; 2999 BP. XX AC . XX DT 17-MAR-2008 (Rel. 13.03, Created) DT 17-MAR-2008 (Rel. 13.03, Last updated, Version 1) XX DE P-type DNA transposon family: consensus. XX KW P; DNA transposon; Transposable Element; P-6_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2999 RA Jurka J.; RT "P-type DNA transposon families from Hydra magnipapillata."; RL Repbase Reports 8(3), 352-352 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(176..769,733..1086,1031..2644) FT /product="P-6_HM_1p" FT /translation="MPLKCCVPNCSSNYKSSACNVSVYKFPREVCEKTKWF FT MSIPRLNWTVTKYTVVCKLHWPDDAEFKLCRGKLRPIHPPSVFPNISKSCL FT SSPPPKTRKTLSSSSKRGIRKDELEEYKIIDNLNFEEIEKHLSTYNDLISY FT KQNNSIFIQSKIFNCGVPLFLLIINQSLLFEAFHFGSKCTIKSLANNRITS FT LCTIQSHLQNHFPLYNTKSSLNEALRYLKVEDKDHKTQVMFEHLNVMGDHT FT VGCAVYSPDIITRAFEYFATSRSTYHRLCYDYQLPSIRTLTRITSKINSQD FT DLTFLSSVLSNLDEKKKKLCFIDRVLCYRTLMKKKRNCVLLIDEVYVKAAL FT LYQSGSIFGKAVNNPEKLATTVLSFMVKSLYGGPEFLARVLPVSNLTSDFQ FT QEQCKLIVDTIESSMSNGKVIAIITDGNKVNQSFFSKFKTVDGKPWLCQNG FT SYILYDYVHLLKCIRNNWLTEKLGQLQFTQNEITYIANWRDLVNLYTLEKN FT QLIKLSRLTETSINPKPIERQKVSTCLQVFSEETVAALKTHPKIDQESVKG FT TIIFLELIIEFWKIVNVKCPGANARFRDESRNVIRSSDDINLKKLLAIATM FT AEYMKPTSGKRIRQLTRDTSNALSHTCRGLIDLSCHLLSSGNEYVILGWFS FT SDPIEKCFGKLRQGSGGTYFITAKSVIEKVRIQHAKLVLQLNIEVEGNDGH FT DCLLCLKDLDDMEIDIIYNLPDLEDSINIEIFYAIVYIAGYVQKCSTEEIG FT EDTTTYYEKYGTYLNTLNRGGLTIPSDTLSQWSLFCYIFFSLLDKKVCRTF FT LIRQFAFIAELYSFSIAKKQCQTLANIFLKNFALLHTPKSNKEAKLKEIKL FT M" XX SQ Sequence 2999 BP; 1063 A; 428 C; 480 G; 1028 T; 0 other; catggcctac tttatagaag gcctcgacac cgtgcgggaa tttttttgac tgaaggccga 60 tttaaaactg ttacaaacgc attttttggt tggaaaagga gtaaaatatg tagttacgtt 120 ttttcagaaa aatttacagc tattcttcaa gtaaatttaa tttaaattat ttagaatgcc 180 tttaaaatgt tgtgttccaa actgttcctc aaattataag tcttcagctt gtaatgtatc 240 tgtgtataaa ttccctcgtg aagtatgtga aaaaactaaa tggtttatga gcatccctcg 300 tttaaattgg accgtaacaa aatacaccgt tgtttgtaaa ttgcattggc cagatgatgc 360 agagtttaaa ttatgtcgtg gtaaattgag accaattcat cctccatcag tgtttccaaa 420 tatatctaaa agttgtctat cctctccacc acctaaaacg agaaagactt tatcttcttc 480 ttcaaaaaga ggtatcagaa aagatgaatt agaagaatat aaaataattg ataatttaaa 540 ttttgaagaa atcgaaaaac acttatcaac ttataatgat ttgatttcat acaagcaaaa 600 taattcaatt ttcatccaat ctaaaatatt taattgtggt gttccactat ttttgctaat 660 aattaatcaa agtttgttgt ttgaagcatt tcattttggt tcaaaatgta ctattaaatc 720 tctggcaaat aacagaatca cttccctttg tacaatacaa agtcatcttt aaatgaagcc 780 cttagatatc tgaaagttga agataaggat cataaaactc aagtaatgtt tgagcattta 840 aatgtaatgg gagatcatac tgttggctgt gctgtgtatt ctccagatat aataaccaga 900 gcttttgaat attttgcaac aagtcgttcc acttaccatc gattatgtta tgattatcag 960 ttgccaagta ttcgaacttt gacacgcata acttcaaaaa taaactctca agatgattta 1020 acttttttaa gttctgtgtt atcgaacctt gatgaaaaaa aaaagaaatt gtgttttatt 1080 gatagatgaa gtttatgtaa aagcagcatt attatatcaa agtggaagta tttttggaaa 1140 agctgttaat aatccagaga aattagctac tactgtactt tcattcatgg taaaaagcct 1200 ttatggtgga cctgaatttc tagcaagagt tttaccagtt agtaatctga catcagattt 1260 ccaacaagaa cagtgcaaat taattgtaga caccatagaa agttctatga gtaatggtaa 1320 agttattgca attattacag atggaaataa agttaatcaa agcttttttt ctaaattcaa 1380 aaccgtagac ggaaaacctt ggttatgtca aaatggctct tatattttat atgactatgt 1440 ccatctttta aagtgcataa gaaacaattg gctgacagaa aaactaggtc agttacagtt 1500 tactcaaaat gagataactt acattgcaaa ttggagagat ttagttaact tatatacact 1560 tgaaaaaaac caattaataa agttatccag attgactgaa acatcaataa atcctaaacc 1620 aatagaacgg caaaaagtaa gcacttgttt acaagtattt agtgaagaga cagttgcagc 1680 tctaaaaact catcctaaaa tagaccaaga aagtgtcaag ggtactatta tttttttgga 1740 gttgattata gaattttgga aaatagtgaa tgttaaatgc ccaggtgcca atgctagatt 1800 tagggatgag tcacgtaatg taatcagatc ttcagatgac attaatttaa aaaagttact 1860 tgctattgca acaatggcag aatatatgaa accaacatct ggtaaacgaa ttcgccagct 1920 tacgagagat acaagcaatg cactttcaca cacttgtcgt ggactaattg atttgtcttg 1980 tcatctttta tcaagtggaa atgagtatgt tattttaggt tggttttcta gtgatccaat 2040 tgagaaatgt ttcggaaaac tacggcaagg ttctggtggt acttacttta taactgctaa 2100 atcagttatc gaaaaagtac gtattcaaca tgctaagctg gtgctacagc tgaatatcga 2160 agtagaagga aatgatggcc atgactgtct tttatgtctt aaagatttag atgatatgga 2220 aattgatata atttacaatt taccagatct agaagatagt ataaatatag aaatttttta 2280 tgctattgtc tatattgctg gttatgttca aaagtgttct actgaagaaa ttggtgaaga 2340 tacaactaca tattatgaaa aatatggaac ttatttgaat actttaaata ggggaggatt 2400 aactattcca tctgatactc tatctcagtg gtctttattt tgttacatat ttttttcttt 2460 attagataaa aaagtttgca gaacatttct gatacgacag tttgctttta ttgcagagtt 2520 gtatagtttt tctattgcaa agaaacaatg ccaaacacta gcaaacattt ttttaaagaa 2580 ctttgcatta cttcacactc ccaaaagtaa taaagaagca aaactaaaag aaatcaagct 2640 tatgtaaacg tttttttttg ggaagattta gtacaaaaaa aattagattt gttattggtg 2700 tatttttctg attatttatt atttcttgta tattcaaaaa tttgatttaa aatgtttcta 2760 ttcaacttta caaactattt aagcttataa aatagttgtc aaattaacgt caagaaactc 2820 attttataaa aatgcatcca aaattttttc aaatagaata attagttatt aaataattag 2880 tgttagcaaa gctgtacagc atttttctct tttaataaac cgccatttta tttttaaatc 2940 ggccttcagt caaaaaaatt cccgcacggt gtcgaggcct tctaataaag taggccatg 2999 // ID ITmD37E_Ele5 repbase; DNA; INV; 1296 BP. XX AC . XX DT 13-OCT-2010 (Rel. 15.1, Created) DT 13-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE An ITmD37E DNA transposon family from Aedes aegypti. XX KW Mariner/Tc1; DNA transposon; Transposable Element; ITmD37E_Ele5. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-1296 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-1296 RA Kojima K.K. and Jurka J.; RT "ITmD37E-type DNA transposons from the yellow fever mosquito."; RL Direct Submission to Repbase Update (13-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. ~93% identical to consensus. TIRs are 22 bp CC long. TA TSDs. This consensus is ~92% identical to the original CC sequence in [1]. This family encodes a DD37E-type transposase and CC is similar to Tx_mos from Toxorhynchites amboinensis. XX FH Key Location/Qualifiers FT CDS 152..1168 FT /product="ITmD37E_Ele5_1p" FT /note="transposase." FT /translation="MHSKQEELRQQILGAVIDNPGQSHRWIGKRLGIHHST FT VSRVVKMFQETKTIERRAGSGRKPKTEYPEKARKIRQYFKNNPNLSTRDVA FT KKVNSSKWFVQQTMKRAGLRVFKVRKAPNRTDKQNTVAKLRARKLYREWLV FT KPGCLVMDDETYIKADTKQIPGLEFFVGTSRMEVPEKFRKRKMDKFAKKYL FT LWQAICECGRYSKPFVTTGTINGQIYREECLQKRLLPFLRSHRGPTLFWPD FT LASCHYAKPVMDWYEAKGVNVVPKEANPPNTPELRPIEKFWAIMKKKLKKS FT GKTFKTDEALKKNWEKLCKKDGQSLAQDLMSSVKRNVRAFSLGEEIQ" XX SQ Sequence 1296 BP; 380 A; 272 C; 345 G; 299 T; 0 other; aagattggac tcggaaaaaa tgcgacactt tgttttgggt gtaacttttt atcgcgtggg 60 taaaaattaa tgaaattttg ggtaatacta gtttagagtg ttatgtttat gtgtgcaaaa 120 ttttatcgcg atctgtcaag tagttttcaa gatgcattcg aaacaagaag agctacgcca 180 gcaaatcttg ggcgcggtga tcgataatcc tggtcagtcg cacagatgga tcggcaaaag 240 gcttggaatc caccattcca ccgtgtcccg cgttgtgaag atgttccagg agacgaagac 300 catcgagcgg cgcgctggaa gcggccgaaa accgaaaacg gaatacccag agaaggcgcg 360 gaagataagg cagtacttta agaataaccc gaacctttcc acccgagacg tggccaaaaa 420 ggtcaattcg tcgaagtggt tcgtccagca gaccatgaag cgggctgggc tacgtgtttt 480 taaggtcagg aaggcgccaa accggactga caagcaaaac acggtggcca aattgcgtgc 540 gcggaagctg taccgtgagt ggctggtcaa gccaggctgt ctcgtcatgg acgacgagac 600 atacatcaag gcggacacca aacaaatccc cggcctcgag ttttttgtcg gtacgtcccg 660 catggaggtt ccggaaaagt tccggaagag aaagatggac aaattcgcga agaagtacct 720 gttgtggcag gcgatctgcg agtgcggacg gtacagcaag ccgttcgtta caaccggcac 780 cataaacggg cagatttatc gagaggaatg cttgcagaag cgcttgctgc ccttcctccg 840 ctctcaccgt ggtcccacgc tgttctggcc ggatctcgct tcgtgccatt acgccaagcc 900 tgtaatggat tggtacgaag cgaagggcgt taatgtggtc ccgaaggagg cgaacccgcc 960 aaatacacca gaacttcgtc cgatcgaaaa gttttgggcg ataatgaaga agaagttgaa 1020 gaaaagcgga aaaacattca aaaccgatga agctttgaag aaaaactggg agaaattgtg 1080 taagaaagat gggcaaagcc tcgcccaaga tctcatgagc agtgttaaga gaaatgtacg 1140 agcattcagc ttgggggagg aaattcaata aataaattat gccaaaacat agctaatata 1200 tagttattta atccctgaaa atttgagaag tatccgattt aaaatgaatt tttggtgatc 1260 atttttgtgt gtctcatttt ttccgagtcc agtctt 1296 // ID BEL-57_CQ-I repbase; DNA; INV; 3037 BP. XX AC AAWU01004587; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-57_CQ_; KW BEL-57_CQ-LTR; BEL-57_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-3037 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 267-267 (2011). XX DR GenBank; AAWU01004587; Positions 81883 84919. XX CC 'TTCTC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 541..2553 FT /product="BEL-57_CQ-I_1p" FT /translation="MAPGLRALFHQERFYLNTLTNAKQFVEAYKEAQHSGQ FT LAGWKQRIDGLDDKFHANRLAIELSLDDDTDKATKKTDDAAAQATEEADKD FT ASEESNRLIRKKFELDYVLVYSFLVNEIQKQTIVPQANVDPVSPGRSRVKL FT PDVTLPMFDGSILEWITFRDSFKSLIHSNVELSPFDKFMYLIGSMTPKARE FT RIDNIDVSAANYPIAWQALEELFENKKLIFKAYMDALFAIEPMRKECYECL FT AQLVDGFERNLNMIKKLDISTDGWGAVLSHMVCCRLDHSTLKQWESHYRST FT EVPEYNDLMAFLRGQLSVLQSLPSSKSESHKHEHRSLIPKTTYAVTSPATS FT LCPFCSQPSHSPFKCDYFQGLSVSQRLDLVRTRNLCLNCLSPEHMVRKCPS FT GTCRVEGCHRMHHTMLHLSSNERSSTSTSPANSRSNSPHQSYPQPYSSSPP FT PPESQHPLATSLVSTNVVGARKIPATVLLQTAIVKVLNSRSQHQSARALLD FT PASQLNLISEDLVQKLKLRRYPCHQPIGGVGNSTVISSHAVHIALGSYTNT FT NFRAVQTFHILEKITQDLPSRIIDTSSWKLPTNVNLADPRLAEPRSVDLLI FT GMELYFDLLLDGFIKLGSEKPILQNTVFGWVASGKIRSGRPERAPKLAHVS FT IISKIHNSNHQGGSLPLTCDTT" XX SQ Sequence 3037 BP; 753 A; 792 C; 749 G; 743 T; 0 other; tttggtcctt cgattgccgg atgtgcgcga tttatcacgg acaatcgcgg acatttctcg 60 ctgaacttgt cgctggcgtc aaaagtggac ttttcggatt gtgtggctct cgagaaccac 120 gccgcgtgtg tagtgggtga aaagtgaaaa aataatattg cgctgactgt actgtcgcgg 180 caaaccatgc gctgaagaaa cttgtgccct atgctcgatg aacagtgata agtgctacaa 240 aaattgctgc tgattgcctg ccgaatttct tcgatctacc gcatcaagca ccgcgaggtc 300 accgcgtggc acgacgtggc ttgggctgta ttgtgcgagg aggataaagg gcctggacta 360 gatcaaaggt catcgttgca cacgtgagta cgacatgtgt gttgtggtgg tttaactgga 420 ttggtctttg cgtgcgatcg tcaatactgc aaatctcggg ctggttggta gtgtgtgcag 480 cgtggttgga atttccgcat ttttttgggc ttgattgagc actgctagtc gctttgggac 540 atggctcctg gattacgtgc gttgttccac caagagcggt tttatctgaa cacacttacc 600 aacgccaagc agtttgtgga ggcctacaaa gaggctcagc atagtggaca actagcggga 660 tggaagcagc gaatcgacgg tcttgatgac aaattccatg ctaaccgtct tgccatcgaa 720 ctatcgctgg acgatgatac ggacaaggcg acgaagaaga ccgatgatgc tgctgcgcaa 780 gctacagagg aagccgacaa ggatgcttct gaagaaagca atcgacttat acgcaagaaa 840 ttcgaactgg actatgttct tgtgtatagc tttctggtca acgagatcca gaaacaaacc 900 atagtccccc aagcgaacgt cgaccctgtc tcacctgggc gctcgcgagt caagcttcct 960 gacgtcacac ttcccatgtt tgatggttcg atcttggaat ggatcacatt tcgtgattca 1020 ttcaaatcac tgattcactc taacgtcgag ctgtcgcctt tcgacaaatt catgtatttg 1080 ataggatcga tgacgccgaa ggcgagggag aggattgaca acatcgatgt tagtgctgct 1140 aactatccta tcgcctggca ggcactggaa gagttatttg aaaacaagaa gcttattttt 1200 aaggcgtaca tggatgctct ctttgcgatc gagccgatgc gcaaggaatg ttacgagtgc 1260 ctggcgcagc tagtcgacgg ctttgagcgt aacctgaaca tgataaagaa gttggacatc 1320 tcaacggacg gttggggtgc ggttctgagt cacatggtat gctgtcgact tgatcattct 1380 acattgaagc agtgggagtc acactatcgt tccactgaag ttcctgaata caatgacctg 1440 atggcgttcc tacgtggtca actttccgtc ctgcagtcat tgccatcgtc gaaatcggag 1500 tctcacaaac acgaacaccg atcgctgatc ccaaagacca cctacgctgt cactagtcca 1560 gcgaccagct tgtgtccgtt ttgttcgcag ccatcccact cgcctttcaa atgtgactac 1620 tttcaaggct tgtcagtttc gcaacgcctt gacttggtga ggacgaggaa tctctgccta 1680 aattgcttat ctccagaaca catggtgcgt aaatgtccct ccggtacatg cagggtcgaa 1740 ggatgtcacc gcatgcatca cactatgctc cacctatctt caaacgaacg atcgtccact 1800 tcaaccagcc cagcaaattc tcgctcgaat tcgccccacc agtcctaccc acaaccttac 1860 tcgtcatcac cgccaccgcc tgagtcccag catccactag ccactagcct cgtgtccacc 1920 aacgttgttg gagctcggaa gattcctgcc actgtgctgc tgcagacggc tatcgtcaag 1980 gtgctaaatt ctcgatccca acatcagtcg gctcgagccc tcctagatcc agcatcgcaa 2040 ttaaacctaa tttctgaaga tctagtccag aaattaaagc tgcgccgcta cccgtgccac 2100 cagccgattg gtggagtcgg taactcaacc gtcatttcgt ctcacgctgt tcacattgca 2160 ctgggatctt acacgaacac gaacttcagg gctgtacaaa ccttccacat tctggagaag 2220 attacccaag atcttccatc tagaattatc gacacctctt cgtggaaact accaactaac 2280 gtcaacttag cagacccaag gcttgctgaa cccagatcag tagatctact tatcggcatg 2340 gaactctact ttgacctact gcttgatgga ttcatcaaac tgggatctga gaagccgatc 2400 cttcagaaca ccgtttttgg ttgggttgca tcagggaaaa tccgttctgg ccgaccggaa 2460 agggctccga agcttgccca tgtgtctatc atcagcaaaa tacacaactc gaatcatcaa 2520 ggtggatcgc tgccgctgac atgcgacacc acgtagctat gcgtcgtggc gagtccagca 2580 gttacttcga cccaatcgga cgaggaggcg ctgcccgata atcgtcgtca ttcgtaatta 2640 agctcaaagg actgagcctt ccatcgcaac gctacggcat tgcgcaccgg gaggcctacg 2700 ggcctccgac ccctgtaagt tgttttaata ttgaatttgc aaaaagctca tgaatttttt 2760 gtcccccggc atgtttttgg ctcatcgatg gcgactccaa cccacaacca tgagcgcgcc 2820 caacgcggac gatcaccgag catcaaccga gaacgcacag aggatcgccc gtgacgtcac 2880 caggagcagc aatttgtaag gagaggagag cggtcgatag tcgaagcaga gctgctgtgg 2940 acaacatcaa caaagttgac cttggccttc tgcgacgagg cagaagggtg cgatcgcgtg 3000 tattggttga aagctgaccc tttcaacggg gccggca 3037 // ID Gypsy-73_CQ-I repbase; DNA; INV; 6996 BP. XX AC AAWU01041308; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-73_CQ_; KW Gypsy-73_CQ-LTR; Gypsy-73_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-6996 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 525-525 (2011). XX DR Genome; AAWU01041308; Positions 26141 19146. XX CC Positions [5877-6350] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 386..1738 FT /product="Gypsy-73_CQ-I_1p" FT /translation="MTDRILSLRPRKTLVRTVENPIKRSNSLSTKTNISKQ FT LNTSYSGLEKLNLEECQTSSTPIRLSKYCAKSVKKSFEKFENFLKEVNFDK FT SLFDQFLYPELEDTPEAILNKKRETVIQEFLSASAYFDSALKMAQFDFEKA FT LQCIPEFTGNFDDVENFINIADIFHARIGARNADQLNQFLAVIRLKLKEDA FT KERAIDIMADTWQEVQNNLNTLYQEKNTWPEILNSCNKLVQTRQESLEEYR FT RRADKLNRQIKQKGGNDAAEEILKENFIGGLQSTTLMQFAATIEECSYEDL FT ANTILERGKTLERIYEAHKEKMKNSSSNATEGGDGGGNSLRNKNWSNKKNN FT TWRNPVWGHPSNLTQPPPLGPIPQPYTMMPNQMIGWQPNQTQNFGNPNRTL FT VPYGHVQNHENMTMDRWGRGPPQSNQMSQMNNTMNFNPDFNSTFNSNQNST FT QHVPKN" FT CDS 3600..6701 FT /product="Gypsy-73_CQ-I_2p" FT /translation="MEKIDLGHLNSSHRQSITDIISEFKNTFFIEGDRLTR FT TDAAVHDIETETCIPINKRQYRFPEATKKHIKEQVEEMLEQGIIKPSTSPW FT NAPVLCIPKKPDANGNQKYRMVVDYRALNTITRSFVYPIPLINEILDNIGE FT NSLFTCLDLKSGFYQVSINPNDTEKTAFSTPQGHFEFVGMAMGLKNAPSTF FT QKLMHTVVYELGAVKAFVYLDDIIVFGNTVEDHNDTLKKVLDALSRHNLKI FT EPTKCQMLKTQIEYLGHIIDKNGIRPTDANIKAITNLKRPTNIRQVRSFLG FT TINFYGKFISNIAARRKPLNDLLKKGVKFSWTPQCERAFEDLKQCLISEPL FT LIRPNYKDTFVITTDASDFAIGAVLSNAQTMDQPIAFASRALKGAELRYHA FT IEKELLAIVWATQYFRHYIFNQKFIVYTDHRPLVAIDSLKETSTTLSKLRL FT KLLGLDREIRYKQGRENAVADFLSRIDHDKTEQINPETVGATTRAQLRNQN FT RNNNTSANKGYDSSDDVMFRELSNIDISDGNDDLPEEEYFSKLYEEFRLYA FT EENKSYATEIKNCKTLDNESNCSHTFVIANKKSGFSEISQVYNLPHGARNI FT IQEGNFAFPNDNLYGVMLDGKHNSIIDSKKFFQTFALFYIHTFSECANKRI FT HLISFRKIRQPEILQIIEFAAWKTKTKICFYNNRSEIVNAKADQIQTILQE FT FHDAPLGGHIGARRMIKRISQAYFWKNMNRDILNYVRQCDSCQRHKIHKSN FT KIPMKITTTASEPFEKLYMDIVMFPESHWGNNCALTVQDDLTRFLTIIPIH FT NQEASTVARALVEGVICKFGTPCEIVTDQGTNFMSKMLKEVCKLLHIKKIN FT TSAYHPQANLVERSNRELKQYMRQFVGKDTGSWDQKLPYFLFEYNTAPNES FT TGFSPYELLYGRVARMPASIYKSTTDANYETYLDEIKHLFSSIHKMAKENL FT AVSKVRNKIRYDVNVEDWQPELGDPVMVHDNPMGMGRKLQQLWRGPYLVIR FT LDSEQTSTILNGNKEEKVHNNRLKKYND" XX SQ Sequence 6996 BP; 2463 A; 1334 C; 1347 G; 1852 T; 0 other; tgtggtgaca gtcggtaact taaattaatt tagtaattta atttagagta caatgtgcaa 60 cgtgtcgttt ggagcagtgt tattgtgctg gtaagtacct ttttttttta ctattccccg 120 gtttgtgttg ggaactactt aattaagtgt gtcgttattt tgttatttcg ttcgttcgtt 180 attatcatca ccggtacagc cagtagtgag ataatcaaca caatcctgca cacacagaca 240 tttcctcaga acttgactct agattggatt ttattgaaaa aaattaaaaa cacaaaattt 300 tagcagattt taaatgtttg caaaataaac ttaaacaaat taactttgat cagtttgtgt 360 catagtgcgc ttctccaggg tccgaatgac ggatagaata ttaagtcttc gtccaagaaa 420 aacattagtc cgaaccgtag aaaatcctat aaagaggtcg aactctctct cgacgaagac 480 aaacattagt aaacaactaa atacgtccta ctcagggcta gaaaaattaa atttggaaga 540 gtgtcagact tcgtcaacac ccatcaggct cagcaagtac tgcgcaaaaa gtgtcaaaaa 600 aagttttgaa aagttcgaaa attttcttaa agaagtcaat ttcgataaaa gcctttttga 660 ccaattcctg tatccagaat tggaagatac acccgaggca attttgaata aaaaacgcga 720 aacggtaatt caagaatttt tatcggccag tgcttatttt gacagcgcac ttaagatggc 780 tcaatttgat tttgaaaaag ccctccaatg tataccggaa ttcaccggta atttcgatga 840 cgttgaaaat ttcatcaaca tcgctgatat tttccatgct agaattgggg cgagaaacgc 900 cgaccagcta aaccagtttt tagctgtgat tcggttaaag ttaaaagaag acgccaagga 960 gagggcgata gatattatgg ctgatacttg gcaagaggtt caaaataact tgaacactct 1020 gtaccaagaa aaaaatacat ggccagagat tcttaactct tgcaataaat tagtacagac 1080 tagacaagaa tccctggaag aataccgccg cagagctgat aaactcaata ggcaaattaa 1140 acaaaaaggg ggaaacgatg cagccgaaga aattttaaaa gaaaatttca tcggtgggct 1200 gcaaagtacc accctcatgc agttcgctgc aactatagaa gaatgttcat atgaagacct 1260 ggcgaacaca attctagaaa gaggcaaaac gctagaacgt atctacgaag cacacaaaga 1320 gaagatgaaa aattcttctt caaatgcgac tgaaggcggt gatggtggcg gaaacagcct 1380 cagaaataaa aattggagca ataaaaaaaa taacacgtgg agaaacccag tatggggtca 1440 cccaagtaac cttacacaac ctccaccgtt gggtccaatt ccccaaccat ataccatgat 1500 gccgaatcaa atgattggat ggcaacccaa tcaaactcag aactttggta atccaaacag 1560 aacactagtt ccgtacggtc atgtacaaaa tcatgaaaac atgacaatgg acaggtgggg 1620 tagaggtcca ccacaatcaa atcaaatgag tcaaatgaat aatacaatga acttcaaccc 1680 agattttaat agtacattca atagtaatca aaacagtacg caacacgtac caaaaaacta 1740 atacaagggg acgtacaagt cggatacggc tcagtcccct ttaaaataga tcaaagcgct 1800 ggtaattata ttggcttcaa agctaatcaa gaaattatag ttccagtaga caaaaccgtt 1860 ggaaaacata tggccatcca actagctact ataaccaacc cacaaacgcc ttcaagttac 1920 atgttagata cggggtcacc gaataactac attcataaag atcaccttga agttataggt 1980 ttaagtatgc agaatataaa tcattatgac aggcacaccg ttcaagggat cggcaaacag 2040 ttaacacaga cttgtggttc gatatgggta aaattgataa ttggagaaag tacattccct 2100 atcaaattcc acgttctgga taatatgata gttccagcaa tcattgggct ggaattttta 2160 aaaaaccaca cctcaacaat tggaagttgc tttgattata ttacatttca acgctttcca 2220 gtaaagaaat caccgaataa cacgaaacaa aatacgaaaa acgactacca taaggaaaca 2280 ataaacgaac aaaaatccca acgtattgga aaaaccgtaa taatacaagc agaagcagca 2340 attgcaacat gtggtgcaat tgacaccaca aaatccatgg aacaggctat actggaaagc 2400 gctatcgatc ttgatgctca ccaagattgc gatttcctca cgcagaaaaa tgttttgtag 2460 aatcagcctg tacgaggttt gattcaacaa aatattttgt tgaatataat caacaaattt 2520 tttgcattaa atcaactctc atgttttgtt gtttcaaatc gggttttttt gttgatcaaa 2580 atttgacaga agctcgttca aatcagattt ttgttgaagc aacgtcaatt ttttgttgaa 2640 acaaccctga ttattttttc aaaataaggt tagtctttca attgaattaa caaatgttga 2700 ataggaataa aaatagtcat tatttattga tatttgcatt ttttttatcg gatacaaaac 2760 aattcgatgt ggtcttccaa gtgttgcact ttgttcggtc catgtgccgt ttcgctgctt 2820 ttgaaccgca cctggtggca ccgccgcctc ctgatgaacc gacgagtttg taaagttgcc 2880 tcccgcgttg aatgagccat gttgtcattg gcgttccggt aaaccgtggt tacgttgttt 2940 tccgcagaag tctgcaagga tggtagagga tcaaaaaggg tcatttgaag taaaatgcta 3000 gagcggctta ccaagcatgt tccgtcgagt ggttggggag tttcgtgaat ctgcaaaaca 3060 tctttaactt ctagagatgg caacttgcgc atatcaattg ttgctgatga ggtggcctag 3120 ctggtggagg tgaattctgc ccgaatagcg ctctgacttc tcgcattgta taaaacttcg 3180 ctcgccgtaa gcgacgggat cgggatcaaa cacacgaatg aaattctcgc acggaacaaa 3240 acttgtccac tcaaatcaaa ttcaagaaaa acttaacatt ttgacatata gcgggcacgt 3300 gtaaacagca tttaatttaa ttaaaatcaa aaaacatagt taagttgatt caacgccgat 3360 tattgtagaa tcaacgcaat atttttgttg atatttccaa aaagatttgg caaaacacac 3420 agagtagatt caactcaaaa ttttgagttg tttcagttgg ttttgagatc tgtttcaaca 3480 aaaaatcgtc aagtgaaaac aacaaaatat tttgtcgatt caaataggcc tgatattttt 3540 gcgtgctaga atacaagaac cataaagaga taaaaggtcc tcaacgagta gctactctga 3600 tggaaaagat agatctaggc catttgaatt catctcatag acaatctatt actgacataa 3660 tctctgaatt taaaaatacc ttttttatag aaggggatcg ccttacgcgt actgacgcag 3720 cggtacatga tatcgaaaca gaaacctgca ttccgattaa caagcgccaa taccgatttc 3780 cggaagcaac caagaagcat ataaaagagc aggtcgaaga aatgctagag caaggaatta 3840 taaaaccaag cacgagcccc tggaacgcac cagtcctttg catcccaaaa aaaccagatg 3900 caaatggcaa ccagaagtac cggatggtcg tagactaccg agcacttaac acaattacta 3960 gaagtttcgt gtatccaatt ccattaataa acgaaattct ggataacatc ggtgaaaata 4020 gtctcttcac atgcctagat ttaaaatcgg gtttttacca agtttctata aaccccaatg 4080 ataccgaaaa gacagcgttt tcaacgcctc aagggcactt cgagtttgtt ggtatggcaa 4140 tggggctcaa aaatgcccca agtacctttc aaaaactgat gcacaccgta gtttacgaat 4200 taggggctgt taaagcgttc gtatatctag acgacataat tgtttttggg aatacagtgg 4260 aagaccacaa tgatactctc aaaaaggttc tagacgctct aagcaggcat aatcttaaaa 4320 tcgaaccgac aaaatgccag atgttgaaaa cacagattga atatttaggg cacattattg 4380 acaaaaatgg tatccgccct actgatgcaa atattaaagc cataactaat cttaagcgac 4440 caactaatat tcgacaggtt agatcatttt taggcactat aaatttttac ggtaagttta 4500 tttctaacat cgccgctaga aggaaaccat taaatgacct tcttaagaaa ggcgttaaat 4560 tttcatggac tccccaatgt gaaagagcat tcgaggattt aaaacagtgc ctgatatcgg 4620 agcctctact gatcaggccg aattataagg acacttttgt aatcactaca gatgcaagtg 4680 attttgccat tggggcagta ctatccaatg cacaaaccat ggatcaaccg atcgcttttg 4740 caagtagagc attgaaaggt gccgagcttc gataccatgc tatagagaaa gaattactag 4800 ctattgtatg ggcaacccag tatttccgac actatatatt taaccaaaaa tttatagtgt 4860 atacagacca caggccactg gtcgctatcg atagcctgaa agaaacgtct accacactct 4920 caaaacttcg ccttaaattg ttagggctag accgtgaaat tcgctataaa cagggaagag 4980 aaaatgctgt tgcagatttt ctatctcgaa ttgatcatga taaaacagaa caaatcaatc 5040 cagaaactgt tggcgcaaca actagagcgc aactcagaaa ccaaaatagg aataataata 5100 cttctgcaaa taaaggctac gattcatccg atgatgtgat gtttagagag ctgtcaaaca 5160 ttgatattag tgacggaaat gacgatttac ccgaagagga atacttttcg aaattatacg 5220 aagaatttag attatatgcc gaagaaaaca agtcatacgc tacggaaatt aagaattgca 5280 aaaccttaga caatgaatca aattgcagtc atactttcgt gatagcaaac aaaaagtcag 5340 ggttttccga gatttcccag gtttacaatc ttccacatgg cgcaagaaac ataattcaag 5400 agggaaattt tgcatttcct aatgataatc tttatggggt gatgctagat ggtaaacaca 5460 attctataat cgacagcaaa aagtttttcc aaacatttgc tttgttctac attcacacct 5520 tcagcgagtg tgcaaataaa cggattcatc taatatcgtt tagaaaaatt cggcagccag 5580 aaatcctgca aattatagag ttcgccgctt ggaagacaaa aacaaagata tgtttctaca 5640 ataaccgctc agagatcgtc aacgcaaagg cggatcaaat tcaaactatc ctccaagaat 5700 ttcacgatgc acctttgggc ggccacattg gggcccgcag aatgataaaa aggataagtc 5760 aagcatattt ctggaagaat atgaataggg atattttgaa ctatgtgcgg cagtgtgact 5820 cctgccaaag acacaaaatt cacaaatcaa acaagatacc gatgaaaata acaacaacgg 5880 cttcagagcc ttttgaaaaa ctatacatgg atatagtaat gttcccagaa tcacactggg 5940 gcaataactg cgctttgact gtgcaagatg acttgacacg attcttaaca atcattccaa 6000 ttcataacca agaagcctct acagttgcta gagcgttagt agaaggtgtg atttgtaaat 6060 ttggtacgcc atgtgagata gttacagatc aaggtacaaa ttttatgagt aaaatgctga 6120 aagaagtctg taagttactt cacataaaga aaatcaacac aagtgcctat cacccacaag 6180 caaacttggt cgaaagatcg aatcgagaat tgaagcaata tatgcgtcaa ttcgtcggaa 6240 aagacaccgg gtcatgggat cagaaattgc catattttct ttttgagtac aacacggcac 6300 cgaacgaatc tacgggattt tcaccgtatg agttactata cggaagagtg gcaagaatgc 6360 cagcttcaat ctacaaaagc acaacggatg caaattacga aacgtatctg gacgaaatta 6420 agcacctttt cagcagtata cataaaatgg ctaaggaaaa ccttgctgtt agtaaagtaa 6480 ggaataagat aagatacgat gtaaatgttg aagattggca accagagctg ggagatccgg 6540 ttatggttca cgataacccc atgggtatgg ggagaaaact ccaacagcta tggagggggc 6600 cctatttggt aattagatta gatagcgagc aaacttcgac cattcttaat ggaaataaag 6660 aagaaaaggt tcataacaac cgacttaaaa aatataatga ttaatttccg aagatggcct 6720 ttaaaacatt caaacccttg ggtagaagta agccgaaaaa accgctgagg tacaaaaaaa 6780 agggcttttt taatttttat ggaacttaga aaataacaaa catcattaat aatagagcta 6840 atgcctcgtg gcaagttttt ttttctcttt ctttccatac aaatttcagg catcccttgg 6900 attagcaacc agaggaaaaa cgcacaccat tcaaggaggg agagggcgac ttaacgcatt 6960 gatgacacta aaacgctgga attctatctt attaag 6996 // ID hAT-2_BF repbase; DNA; INV; 3869 BP. XX AC . XX DT 30-SEP-2008 (Rel. 13.09, Created) DT 30-SEP-2008 (Rel. 13.09, Last updated, Version 1) XX DE Amphioxus hAT-2_BF autonomous DNA transposon - consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-2_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-3869 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-3869 RA Kapitonov V. and Jurka J.; RT "hAT transposons in the amphioxus genome."; RL Repbase Reports 8(9), 921-921 (2008). XX DR [2] (Consensus) XX CC The N-terminal portion of the transposase (pos. 11-52) is the BED CC zinc finger motif. XX FH Key Location/Qualifiers FT CDS join(416..644,1296..3139) FT /product="hAT-2_BFp" FT /note="transposase." FT /translation="MPRGFISSKPSKVWDYMTKLPQDTARCNLCKKVAACS FT GGTSNMARHLRRNHKIETTPHAASATLAAMFGEPNNNDRGSAQPLPPCATQ FT SPPAASHTATRPTTASSSAGTAEKPALPSLDSLLEDSSAPSTSSSATVASG FT PSSNVTPFSLAKLNKLPAEKQHKITGRLCHYIVKSLRPFSVVDDPYFRAVV FT NELNKNYQMPNRGEVSERFIPKMYESALLNLKSELAHVDFAALTGDGWTSR FT VADHYMTLTIHYLHNWELKAKVLQTLKTEVSQTGENIAVEIKSCLEDFGLM FT GKIEVMTTDNAPAMINATSAAGIALSMGCFAHTLNLSTQKAMDVPTVVTML FT SVIRPIVTYFRNSSIGKIVLKDKQALLAKPGHNLLQDCKTRWNSSYLMVER FT FVEQYPALVAATLDDRLKKKDTFRKLQRCTEKDLDRMEQFLEVLKLPYKVT FT VAMSSEKRATSGQVLPMIQKLQTNLAEKKEDEKFTKDLKAAIRNDLAKRYQ FT HEGRREFLQEATALDPRFKDASVVTEDVWERITNKIEVIQAQVQVKKEAED FT EGDMPQPQQIKQKSRDGLSDTDTDTPLAKKAKLTAMEEIFKDEDEVVVTHV FT EPPLPVRLRIQQEILKYKVMPKLKSTDDAVAFWRGKTDELPLLSTTARKYL FT VVPGTSVPSERIFSTAGDIVSAERATLDPENVNMLLFLNKNT" XX SQ Sequence 3869 BP; 1167 A; 814 C; 898 G; 990 T; 0 other; tagtggtgcg taccggtccg ccgaaccgga cttggacttg ctataacttt cggttcaagt 60 ccggttaaaa ccggaacgta aaaaaccggt tcttggactt gccaaaaact agcatttatg 120 tatgttctat tttttgttgt aaatcatcta aaaccgtcca tagatcgaga aatttgcgcc 180 gaaaaagggc gaaaaacatc gcgtatatcg acggccatta gtgactcgcg aaatctcgcc 240 aatcatcgca agttctcgca actgctcgcg agatctgcga gacgtaacat gacgtaaggg 300 gaaactgatg atgtgattgg cttagacaca gcggcgtggt ttgtttcgcc gccataatgg 360 aaatttccga gcagagcttt gtttgatttt ctgcgtcacg ggtcaggtca acacaatgcc 420 gcggggcttt atatccagta agcctagtaa ggtgtgggat tatatgacga aactgccgca 480 agatactgct agatgcaacc tgtgcaagaa agtcgccgcg tgttctggcg gtacatcgaa 540 catggccagg cacttgagga gaaaccacaa gatagaaaca acgccgcatg cggcatcggc 600 cacattggcg gcgatgtttg gtgagccaaa caacaacgat cgaggtaagc atttttgggg 660 catattttca gttcagggaa tactagatag tagtagatac accggtaaaa attggcattt 720 tcagcataag ttgtagcata ttgcgattgt gttgcatagg gtagatatta gctgtacgag 780 ccatttttac aggtttgcct tttagtctgc atattaatta gcaaagcctg ttcaattttt 840 aatattttta ctttttatag aacacagtaa tatatatata tattgtgtag tgtcacagtg 900 tgtaccaatg gtaaagtata attattctaa cgacctggtc ccaaaaccaa gcagatgtgc 960 tgcatcatgt gcataagaac actgcatttt ttattcataa atatatgtga taatttttgt 1020 aatcactatt tttattgctg gcttggcaaa aggttatggg gaaaacattc tgcatcattt 1080 gcaaaatgtg cctttttagt ccagtgttat gtggagagca aagcaatgta cctagtttct 1140 aggactagta aaagttaaag ccaaagatta attatttctt gtattagtta cagagtaagt 1200 ctactctatc acagagtaac tactattgct accattcatg aatatgcttc ggttcagtta 1260 cttcaatggg ttgatcaaat actttttact tttaggctca gcccagcccc tgcctccgtg 1320 tgccacacag tccccacctg ctgcatcgca cactgccact aggcccacca ctgctagtag 1380 tagtgctggc actgctgaga aacctgcgtt accgtcgctc gactcactcc tggaagattc 1440 atcagcacct tcaacatctt ctagtgccac tgttgccagc ggacccagca gcaatgtaac 1500 accttttagc ctagccaaac tcaacaagtt gcctgcagaa aaacagcaca agattacggg 1560 caggctttgc cactacattg ttaagtccct tcggccattt tcagttgtag acgacccata 1620 ctttcgggcg gttgtaaatg agctgaacaa gaactaccag atgccaaatc gtggagaagt 1680 ctcggaacga ttcatcccca aaatgtatga gtctgcactg ctgaacttga agtctgagtt 1740 ggcacacgtt gattttgcag ctttaactgg agatggctgg acgtcccgag tggcggacca 1800 ctacatgact ttgaccatcc actatcttca caactgggag ctgaaagcta aggtgctaca 1860 gacactaaag acggaggtct cccaaaccgg agagaacata gcagtggaga taaaatcttg 1920 ccttgaagac ttcggtctga tgggcaagat cgaagtcatg accacggaca atgccccagc 1980 tatgattaat gccacaagcg ctgcgggcat tgcactgtct atgggatgtt ttgcgcacac 2040 actaaacctg tccacacaaa aggcaatgga cgtgccaact gttgtcacca tgctgtcagt 2100 gataaggcca attgtgacat atttccgtaa ctccagtatc ggtaagatcg tccttaagga 2160 caagcaggca ttgcttgcca agccagggca caacctcttg caagattgca agactagatg 2220 gaatagctcc tacctgatgg tcgaacgatt cgtagagcag tatccagcgc tagttgcagc 2280 aacccttgat gatcgactca agaagaagga tacctttagg aagctgcaac gttgcacaga 2340 gaaggactta gacagaatgg aacaattcct ggaggtgtta aagcttccgt acaaggtcac 2400 tgtggcgatg tcatctgaga aacgtgcaac atcgggtcaa gtactcccga tgattcaaaa 2460 gctacaaaca aacctggcag aaaagaagga agatgagaag ttcactaagg acttaaaagc 2520 cgccatcaga aatgatttgg ccaagagata ccagcacgaa ggaagaagag aattcctgca 2580 ggaagccaca gccctcgatc cgaggttcaa ggatgcctca gtggtcactg aagatgtgtg 2640 ggaacggata acaaacaaaa tcgaagttat ccaggcccaa gtacaggtga agaaagaagc 2700 tgaggatgaa ggagacatgc cacagccaca acagatcaaa cagaaatcac gggatggtct 2760 aagtgacact gacactgaca caccacttgc taagaaggca aagctgacgg ccatggaaga 2820 aatattcaaa gatgaagatg aagttgtggt gacccacgtt gagccacccc tgccagtccg 2880 actgcgtatt caacaggaga tcctgaagta taaagtcatg ccaaagctga aatcaacaga 2940 tgatgcagtt gctttctgga gggggaaaac agatgagctg ccattgctca gtacaaccgc 3000 aaggaagtat ctggttgttc ccggtactag tgtgccaagc gagcgcatat tcagtacagc 3060 cggagatatc gtgtctgctg aacgagcaac actggaccca gaaaatgtta acatgttact 3120 gttcttaaac aagaacacat agagactgat agagtcatag actgtttaat gtttattgtg 3180 ttcaagaatg tatggaaagt tgggttgtag aacttgtagt attgtagaac tagaaggaca 3240 ttactagtac aaaattgtat gcacttatag attagaaacg tataaagtca tgccaaagct 3300 gaaatcaaca gatgatgcag ttgctttctg gagggggaaa gcagatgagc tgccattgct 3360 cagtacaatc gcaaggaagt atctggttgt tcctggtact agtgtgccaa gcgagcgcat 3420 tttcagtaca gccggagtct ggagatatcg tgtctgccga acgagcaaca ctggacccag 3480 aaaatgttaa catgttactg ttcttaaaca agaacacata gagtcatagt catagactgt 3540 ttaatgttta ttgtgttcaa gaatgtatgg aaagttgggt tgtagaactt gtagtattgt 3600 agaaggacat tactagtaca aaattgtatg cactattgca gcactaatag attagaaaca 3660 gaacagtcaa taaaccaaac ctatttgatt tgattttgag gcttctttca cattgtttat 3720 tgataattag tattacaata agtcgttcgc ttcagatact catacaagcc gtggattcaa 3780 gtccggactt ataagtccgg acctgaacct gagtctatgg actcgagtcc gaacctgaac 3840 ctgaaagtga gctcaggtac gcaccacta 3869 // ID BEL-12_CQ-I repbase; DNA; INV; 8206 BP. XX AC AAWU01032924; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-12_CQ_; KW BEL-12_CQ-LTR; BEL-12_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-8206 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 177-177 (2011). XX DR Genome; AAWU01032924; Positions 44308 36103. XX CC Positions [7080-7661] - Integrase core CC 'CTGCC' target site duplication CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 798..1901 FT /product="BEL-12_CQ-I_1p" FT /translation="MDATLKATLQARRVSLLAALGRAESFQAEYDHERDQM FT EVQLRLAHVEEASAKLEDIQQQLEDDETDEAGKAVHEGIRANFETRLFKIK FT GFLLSKQRLIPQPRNPSPPQASSTLSGIKLPTITLPEFDGDYQKWLAFHDL FT FVALIHNNPDLPDVQKFHYLRGVVVGKAAGYIDTYDINAANYQIAWDTLVE FT RYNNEYLLKKRHLQALFELPPMEHETASSLHSLVDEFQRHIKILDQLKEPT FT ASWSTVLEHLLCTRLHVDTINLWEDHASTLSNPDFQSLIEFLQRRMRVLES FT ISVNNNHSSSNAATSSHQSPRTKKRSSQRLSSYASTASSGSTADCAACGLA FT HPLYKCHKFGQLSVSERQRIVSSKQ" FT CDS 2010..3098 FT /product="BEL-12_CQ-I_2p" FT /translation="MLITRQEPTAVQRHPHKRAVIWKCPLRSSQPSNCYNC FT LGAHQVRNCSSKRSCRHCQQRHHSLLHAGHADNSPRTNSGAAAPTQASGHL FT EVSASVVSAKQEVVTPTEVVPQVEVSVPLQQPKENVFLLTVLVKIVDSLGM FT EHVARALLDSASQPNLITDRMAKILRLKRAPASVVIQGAGQMSTPVRHSVS FT TEVRSRKNDFSCGINFLVMDKLTAELPSQNISTSGWNIPVELFLADPTFNE FT RQPIDMVLGAKHFYSFFPGAARIHLDGSLPILVDSVFGWIVTGSASQAPST FT PTETAHSYSVLAMMSLEESMERFWKTEELTINDKLSVEERDCESLFQSTTT FT RTPEGRYVVLLIIGVHRVAK" FT CDS 5529..8087 FT /product="BEL-12_CQ-I_3p" FT /translation="MRKWTSNRLEVLRGLTQDQIGTQSSLQFGPNEMVKAL FT GIGWEPEADVLRFDSQINQSKGPPTKQFILSTIARLYDPLGHIAPVVVTAK FT IIMQECWQLKCDWNDPVPDNIRIKWEKFFKELPEIRSYRIDRYALQPDSTY FT QLHTFCDASKDAYGACCYIRCEDRHGNVRVTFLASKSRVAPLRGLNVPRLE FT ANAAVLGVHLHFRVKQALKIKISDSYFWTDSAVVLQWIQANPNTWKTYVAN FT RVAEIQFYTRGCHWNHVPTKDNPADLVSRGMTVPNFLNCETWKNATSWICS FT NPRDWPRTNPPSVPADIMEARVVAAVTEKPPQVHPWFLRWHSYRRLVHSIG FT FVLRFIDNTRQKARTTRTSDTPVPRALTVEQFAKAKTVLTRLAQQDSFSAE FT LKELENGKPVSKQSNIYKMSPFIDAERVLRVGGRLKLSQLPFQAKHPALLP FT GSHPFARILAEFYHQKLFHGGGRLLLTAMREEFWPIGGRILARSVVRNCFR FT CVRLNPELVQQQIGQLPAQRIIPSRPFSVVGVDYAGPLYLKPIHKRAAPAK FT AYICLFVCFATKAVHIELVGDLSTPAFLAALRRFIGRRGVPSHIHSDNGKN FT FEGAKNELARLYTMLTHDAEQEKIHEFCTAEGITWHLTPPKAPHFGGLWES FT AVKVAKKHLFRQLGPSRLSFEDMCTVLTQIESHMNSRPLLPLSEDPNDLAA FT LTPAHFLIGTSMHALPDPDLRHIPVNRLDHYQHLQLHAQQFWAHWRREYLQ FT ELLRDTKGYQRNDEIQPGRLVIVVDELQGPLRWPLARIEAVHPGPDGIVRV FT VSLRTASGGIITRPAVKICLLPVPQSPEAPHTTPQPGQLPTDAVFPEEEKH FT " XX SQ Sequence 8206 BP; 1785 A; 2506 C; 2068 G; 1847 T; 0 other; tatttggtgc cgtgaccagg atttctggcc aacctttccg tggagtgcac cgccattttg 60 gacgccgagg atctgacgcg ggccgacgcc attgctgttg cggcttgctg cttgattata 120 caaggcctaa tttaattagg cagcaggtaa ttagctgtcc taattccctg cgtctccccc 180 gcgggtgccc tttagatctt cttccttctt ttcacgctgt tctgcaactt ccggactgac 240 ggaatcggcg actgctgcct ctaccgttgg agcacatcgc acaggccttg ctagtgcctc 300 tcaaaccttc gacgttcgct gccgaccggc tcgtggtgtt ccccaggcca tcctgcgcca 360 ttttggaagc cacgcggtcg cgcccgcact cctggctcgc tgatcgacga ggatcatcga 420 ccatcgtcat cggcggccat cttgtaacgc tgtatacgac cacgccacac ctgctggctc 480 ttcgaaaccg gttgtacgga cgattccctg cctccccggg atcctcaacc ggctgcaacc 540 gcgtcacgcc cgatctgcga ccgcttcact gctgctgaac ggtgccacga gccgggtttt 600 ggtgatcgcg cacagcacgg acaagctgct tgttcccatg gctgttttct tgaactttca 660 aggcattttc ttgccttgga taggtgagta cctttccaag tcgctactcg tctgcgtccg 720 ggtctaccgt ggtctttttt cgcagatctt gctcgccact tttgcgtcag ttgttgctgt 780 cgggcggcat ttccaccatg gacgcgacgc tgaaggcgac gctgcaggct cgcagagtgt 840 cgttgctggc ggcgctgggt cgggcagaga gctttcaagc ggagtacgac cacgaacggg 900 accagatgga ggtgcagctg aggctggccc acgtcgagga agcgagcgcc aagctcgagg 960 acattcagca gcagttggag gacgacgaaa ccgatgaagc gggtaaggct gtacacgagg 1020 gaatacgggc taattttgaa acgcgcctgt tcaaaataaa aggcttcctt ctgtccaaac 1080 aacgtcttat tcctcaacca cgcaatccct cccccccgca agcatcctcc actctctctg 1140 gcattaagct tccgacgatt acgcttcctg aatttgacgg ggattatcag aaatggttgg 1200 cttttcacga tctgtttgtg gcgctcatcc acaacaatcc cgaccttccc gacgtccaga 1260 agtttcatta tttgagggga gttgtcgttg ggaaggcagc cggctacatc gacacgtacg 1320 acatcaatgc ggccaactac caaatcgctt gggacactct tgtggagcgc tacaacaacg 1380 agtacctgct gaaaaaacgt catctacaag ccctcttcga actgccaccg atggaacacg 1440 agaccgcatc cagcctccac tcgctggtcg acgagttcca gaggcacatc aagattctgg 1500 accagctcaa agaaccaacc gcttcttgga gcaccgttct cgagcacttg ctgtgcacac 1560 gattgcacgt cgacaccatc aatctctggg aggatcacgc ttcgacgctc tccaatccgg 1620 acttccagag cctgatcgag ttcttgcagc gccgaatgag ggttctggaa tctatttcgg 1680 taaacaacaa tcattcttcg tccaatgcgg ctactagctc gcatcagtcg ccccgcacga 1740 agaaacggag ctcgcagcgc ctttcgtcgt acgcctccac agctagttca ggatctaccg 1800 cggactgtgc agcgtgtgga ctcgcacatc ccctctacaa gtgccacaag ttcggtcagc 1860 tttctgtctc ggagcgtcag cgaattgtga gcagcaaaca gtgaccgctt gctacaactg 1920 cttgggggca caccaagtga ggaattgttc gtccaagcgc tcctgcaggc actgtcaaca 1980 gcggcatcat tcgctccttc acgcgggaca tgctgataac tcgccaagaa ccaacagcgg 2040 tgcagcggca cccacacaag cgagcggtca tttggaagtg tccgcttcgg tcgtctcagc 2100 caagcaactg ctacaactgc ttgggggcac accaagtgag gaattgttcg tccaagcgct 2160 cctgcaggca ctgtcaacag cggcatcatt cgctccttca cgcgggacat gctgataact 2220 cgccaagaac caacagcggt gcagcggcac ccacacaagc gagcggtcat ttggaagtgt 2280 ccgcttcggt cgtctcagcc aagcaagagg tcgttacgcc caccgaggtc gttccccagg 2340 tcgaagtcag cgttccactt cagcagccca aggagaacgt gttcctgctt actgtattgg 2400 tcaagatcgt cgattcgctg ggaatggaac acgtggcccg cgcactactg gacagcgcct 2460 cccagcccaa tctcatcacc gatcgcatgg cgaaaattct gcgcctcaag cgagcccctg 2520 cgagcgtggt gatccaaggc gcaggccaga tgtcgacacc tgttcggcat tccgtctcca 2580 ccgaagtgcg gtcaagaaag aacgatttct cctgcggtat caactttctg gtcatggaca 2640 aactgacagc ggaactacca agccagaaca tctcgacatc gggttggaac attccagtgg 2700 aactcttcct cgctgatccc actttcaacg aacggcaacc gatcgacatg gttctcggag 2760 caaagcattt ttattctttc ttccctggcg cagctcgtat ccacctcgat ggttcacttc 2820 cgattcttgt agacagcgtc ttcggctgga ttgtcaccgg ttccgccagc caagccccgt 2880 ccacaccaac cgaaaccgca cactcgtact cagtactcgc gatgatgtca ctagaggaaa 2940 gcatggagcg attttggaaa acggaggaac tcacaatcaa cgacaagctt tccgtcgagg 3000 agcgagactg cgagtccttg ttccagtcga cgacaacgcg tacgccggag ggtcgctacg 3060 tggtactgtt gataattggg gtgcaccggg tagcgaaata gcccagtgtc ccgactagcc 3120 aaccacaaat aacaaataca gcgaattctg aatgtagtgc caaatttatt cgcgcgatta 3180 catgcttcga ttcatcgttc tagatctgcc tcttggtctc taaagcctaa aacctacgcc 3240 aacctctttc tattcatctt cttattacta gtgcactact cacacacgcg atgggcccat 3300 cacttcttct ttgcacagtc ctcgctcagc tcgacgcgct gcctgctcat cgtgcgcgcg 3360 tgacgtaaca cgccttacga caaccctggc gggtgggctg ttgaccgacg agccgttgcg 3420 atgcgcgtgc ctcctctcca atgcgccggc gtctgatctc cgacgctttc ccgccgcgtc 3480 cccacgcgtg cctcgcgaac cgaaagctcc tctccggact gccggccttc cgcacgtcca 3540 accccgagcg ccgaaactcc aatgactcaa gtgtcccaaa aacgccctgc agaaagacac 3600 acatcagcac gctacctctt gttgtggttt gtttttgctc caacttaccc cgatctgatt 3660 tgggggcgtt gttttgtttt ctttcgcgct cggcctgctc ggccgcctcg aagtcgctcg 3720 tctgggaaat ggacatgtgt gagattcgtc accatgcgat cttctttggt ttcttaccgg 3780 cagaatcagg acttgtttgc gtcctcggtt tgtgctcgtc ttgccggtgc tttcttgaga 3840 ctggctcgct gttgaagaat gtggcgttag agcagtatta cggcgtgctg tgagttcact 3900 tactcgacga caggctgatc aggctgaccg ccctccgtgc ctgattgatc gtacgtcccg 3960 ttctgtgcgt cctcaagctg tcgggcccac ttgtcgtcct cccgttcgtg ccttggggcg 4020 ccgagccgtt cctaaatgcg aaatctcgtt agtacggtgt tcgagttcaa cttatggtct 4080 cttactcgat tggcgatcag ttgcggtgga tgcgttcgct cgtggttcgt cctactttgc 4140 cgacttcgtc gctagttggt gttttcctcg aactcgtgct ggatgatcgg cttgataccc 4200 gcctttggtc tcttcctgtc gtgtccgatc aagccgctct agggaaagac gaacgtgtta 4260 ttactgctcg cttgtcggta agcccggatt gtttaccgct gacaacaccc tacacctccg 4320 gtcctagtgc cagcagtcgg ctccggtgct tgcggatttg ttctgcggcg gcgtcgactt 4380 gatttcgggc caagctgttt cgcaaccgtt ccgggatgtg tgcgcggtcg gtggttctcg 4440 atctcagtgg acgccgccct catgcacacg cgtcgctcgc cgaactttgt tgtgctcaca 4500 gcacggacgt gcacctttgc ttgggactcg caggcaacta actctcgctg tcgatcgccc 4560 tgcgatcgag ccgcgtcttc tctccccctc tttcttctta tttctttccg gaaatagcat 4620 cgaagtaggc ggtgatgatc gagcgcaaga tattttgctg ctggcatcga gctttcaagc 4680 gggactgtgg atctcctttt tcgctagcgc tgtgcttctt ttgcacgctt attcttattc 4740 tacccatttg gaacacaaaa tttcaaaata aagctactta tatcagaatt cgcaacgctc 4800 atgtaacagt acgttacccc cgcaaaccag gtttcgaaaa cttgctgggt gactccaaga 4860 ccgcagcacg acgaccaagg ttccgactgc ttgagcgacg gctcgagcga gacgcacagc 4920 tcaaggagga ctaccacaag ttcatgcggg agtatgtcga gctgggccat atgcagctgg 4980 ctgaacctga tgagaaggac aagacgccat cttgctacct accccaccac cccgtcttta 5040 aggactccag cacgacgacc aaggttcggg ttgtcttcga cggttcagca gcgacttcta 5100 cgggattgtc gctcaaccaa gcactctgcg tcggacctgt agttcaagag gaccttctct 5160 cgctcctgct ccgtttccgg aactaccctg tagccctcgt agcggacgca gccaaaatgt 5220 accggcaggt tctcgtccac ccagaggaca gaaggctgca gcgcatcttc tgacggttct 5280 cgccagactc acccatccaa acgtacgaat tacagaccgt cacgtacggc ctagcacctt 5340 ccgcgtacct agccacccgt tcccttcagc aactcgctca ggacgagggt cacgagtacc 5400 cgttgggcgg cccagcgctc gaaagcaact tctacgtaga tgacttcatt ggtggcgctg 5460 acacggtcga ggacgccata cgcttgcgga aggagcttac ggaactgctg gccaagggag 5520 gcttcgagat gcggaagtgg acatcaaacc gtctcgaggt actgcgagga ctgacccaag 5580 accaaatcgg cacgcagtcg agtctgcagt ttggtccgaa cgagatggta aaggcactgg 5640 ggattggatg ggagcccgag gccgacgttc tgcgattcga ctcgcaaatc aaccagagca 5700 agggacctcc cacgaagcag ttcatcctct caacaatcgc tcgactctac gatcctctag 5760 gtcacatcgc acccgtagtt gtcaccgcca agatcatcat gcaggaatgc tggcagctga 5820 agtgtgactg gaacgacccg gtgccggaca acattcggat caagtgggag aaattcttca 5880 aggaacttcc agaaatccgg tcctaccgga tcgaccgcta cgcactccag cctgactcca 5940 cttaccaact gcacacattc tgcgacgcgt ctaaggacgc ctacggagct tgttgctaca 6000 tccgctgtga agaccgacac ggcaatgtac gagtcacgtt cttagcatca aaatctcggg 6060 tcgccccgct cagaggactg aacgtccctc gcttggaagc aaacgctgct gtgctcggag 6120 tccaccttca cttccgggtc aagcaggcac tcaagatcaa gatctcggac tcgtacttct 6180 ggacggactc agctgtcgta ctgcagtgga tccaagcaaa cccgaacacc tggaagacgt 6240 acgtagcaaa ccgtgtcgct gagatccagt tctacacgcg cggctgccat tggaatcacg 6300 tgccgaccaa ggacaatccg gcggacctgg tgtcccgcgg gatgacggtg ccgaacttct 6360 tgaactgcga gacctggaag aatgcaacaa gttggatatg ctcaaaccca cgagactggc 6420 cgaggaccaa cccaccaagc gttccggcgg acatcatgga agctcgcgta gtcgccgccg 6480 tgacagagaa gccgccgcaa gttcacccct ggttcctgcg ctggcactcg tacagacggc 6540 tggtccactc gatcggattc gtcctccgat tcatcgacaa caccaggcag aaagcgagaa 6600 cgaccagaac gagcgacaca cctgtgccgc gtgccctcac tgtggagcag tttgccaaag 6660 ccaaaacggt gctaactcga ctagcacagc aggactcctt cagcgccgaa ctgaaggaac 6720 tcgaaaacgg gaagccagtg tcaaaacagt ccaacatcta caaaatgagc ccattcatcg 6780 atgcagagag agtgttgaga gtggggggtc gactaaagct ttcccaactg cccttccaag 6840 ccaaacaccc agccttactc cctggttccc atcctttcgc tcgcatcctc gccgaattct 6900 accaccaaaa acttttccac ggcggcgggc gcctactact gaccgccatg cgcgaggaat 6960 tctggcccat cggtggtcgc attcttgcca gaagcgtggt gcgcaattgc ttccgttgcg 7020 ttagactgaa tcccgaactg gttcagcagc agatcggcca acttccggct caacggatca 7080 tccctagtcg cccatttagc gtcgtcggcg tggactacgc aggtccactg tacctgaagc 7140 cgatacacaa acgtgctgca ccggccaagg cctacatttg cctctttgtg tgcttcgcga 7200 cgaaggccgt tcacattgaa cttgttggag atctctccac ccccgcattc cttgccgcgc 7260 tgcgcaggtt catcgggcga cgcggagtgc catctcacat tcactccgac aatggcaaga 7320 actttgaagg cgccaagaac gaacttgccc ggctgtacac aatgctcacc cacgacgccg 7380 aacaggagaa aatccacgaa ttctgcacag ccgaagggat cacctggcat ctcactcccc 7440 caaaagcgcc ccacttcgga ggattatggg aatctgccgt aaaggtcgcc aagaagcacc 7500 tgttccggca gcttggtccg tcaagattgt cgttcgagga catgtgcacc gtgctcacac 7560 aaatcgaatc ccacatgaac tcccgaccac ttcttccact gagtgaagac ccgaacgacc 7620 tcgccgctct aacgccggcc catttcctaa tcggaacatc gatgcacgcc ctgcccgacc 7680 cagatctgcg ccacattcca gtcaaccgcc tagaccacta tcagcatctc cagctccacg 7740 cccaacaatt ttgggcccac tggcgacgcg aatacctcca ggagctgctg agggacacca 7800 agggttacca gcggaacgac gaaatccaac ccggtagact cgtgatcgtc gtcgacgaac 7860 tgcaaggacc actccgctgg ccacttgccc ggatcgaagc cgttcatcct ggaccagacg 7920 gcatcgtacg agttgtgtcg ctgcgaacgg ccagcggtgg aatcatcacc agaccagctg 7980 ttaaaatctg cttgcttccc gtgccgcagt ctcctgaggc tccacacacc acaccgcaac 8040 cgggtcagct gcctacagat gcagtgtttc ctgaagaaga aaaacattaa acttagttaa 8100 tttatttagc acacgaagtg catcggctca acttccccta tagttagaat cttattttgc 8160 tttatttaca tttttttttg aattagtcat tcaaggcggc gggcga 8206 // ID BEL-57_AA-LTR repbase; DNA; INV; 337 BP. XX AC supercont1.17; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-57_AA_; KW BEL-57_AA-I; BEL-57_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-337 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.17; Positions 3348124 3348460. XX SQ Sequence 337 BP; 85 A; 74 C; 73 G; 105 T; 0 other; tgttcgcgct gcatactgac agctcgaaat ttttcagtgt acccgattgc tgcgatcgat 60 ttggcaacgc ttcaagtcga gctttaggag agaggagtta gcgcgcccaa tgcgcgcggc 120 agagcttttt tcataagttt atcggcgtcc gtcgcgagcg gttacaattt tcgcttttcc 180 tttttctata atattaattc attaaaaacc atgtgaataa gtgcaatata tcaataccaa 240 atttaaggaa gtgatttttc aatgtgtgac tgaattcctc cgaaattccc gtgaagtccc 300 cgtcgtggca attgcgttac agtccgctaa tcgtaca 337 // ID R1-5_BM repbase; DNA; INV; 5102 BP. XX AC . XX DT 30-APR-2010 (Rel. 15.07, Created) DT 30-APR-2010 (Rel. 15.07, Last updated, Version 2) XX DE Non-LTR retrotransposon - a consensus. XX KW R1; Non-LTR Retrotransposon; Transposable Element; R1-5_BM. XX OS Bombyx mori OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Bombycidae; Bombycinae; Bombyx. XX RN [1] RP 1-5102 RA Jurka J.; RT "Non-LTR retrotransposons from Bombyx mori."; RL Repbase Reports 10(7), 1051-1051 (2010). XX DR [1] (Consensus) XX CC ~90% identical to consensus. XX FH Key Location/Qualifiers FT CDS 666..1730 FT /product="R1-5_BM_1p" FT /translation="MDEAWTTVVRRGRRKAQAAPDPRPVPAAAAPQAMGRA FT AAGGKRKGRKARKPRAPRSAAVVLELLPAAKEKGLTYGEVMARARGSVDVD FT AIGVEGGXRVRHTANGARLLECPGADXGAAADRLAARLREILPDPEVVRID FT RPVKMAEVKVTGLDECATKEEVAAAIASQGNCALAQVKVGELRSSYSGAFT FT AWARCPVQAATLLATPPQGRPADSPGRLRVGWVIAHVQLQEARPWRCLRCF FT GTGHGLARCPSAVDRSGLCFRCGQPGHKAASCTAAAPHCVLCDAAKRRADH FT RAGGPACRSAPSSTKRRRGGKKKKKKAEEEPAAREAAGRSVPAAAVEPQGG FT TEDEGAMDVTPQ" FT CDS 1733..5101 FT /product="R1-5_BM_2p" FT /translation="MDHVHRFLQANVNHSARAQDLLVHTMAEWFIDVAIVA FT EPYFVPPDREDSWAGDVDGSVAIVMRQSAALPPLGMVARGSGYVVVRVDET FT VVIGVYFSPNRSLAEFERFLGGLEALVHRFESRPVILAGDLNAKCTAWGSP FT RTDSRGEFLSEWAFATGLCLLNRGSVATCVRWNGESHVDVSFASPSAARRV FT RGWRVLEGAETLSDHRFVRFELSVSTSLNAPAEDARGEEELPRSAPRSFPR FT WALKRLNKVLAVEAATVAAWAPMPARLVDVESEADWFRGTMRRVCDAAMPR FT VGGRAPRGGAYWWTPEIALLREECVRARRRSARHRRRRLRDADFAEVAARL FT HADCRQKQEALRRAIGEAKSQSMKTLLETLDQDPWGRPYQTVRKKLRPWAL FT PVTERLQPQQLREIVSGLFPRMERDFEPPSMGAPPRGDVSGDAPAEVVPPS FT ISEEEIRAAVSRMRRKDAAPGPDGVHGRVWDLAFGALGDRLVRLFEACLES FT GRFPKQWKTGRLVLLRKEGRPADSPAGYRPIVLLDEAGKLLERVVAARIVQ FT HLTGVGPDLSAEQFGFREGRSTIDAVMRVRALSDEAVGRGGVALAVSLDIA FT DAFNTLSLAEFERFLGGLEALVHRFESRPVILAGDLNAKCTAWGSPRTDSR FT GEFLSEWAFATGLCLLNRGSVATCVRWNGESHVDVSFASPSAARRVRGWRV FT LEGAETLSDHRFVRFELSVSTSLNAPAEDARGEEELPRSAPRSFPRWALKR FT LNKVLAVEAATVAAWAPMPARLVDVESEAEWFRGTMRRVCDAAMPRVGGRA FT PRGGAYWWTPEIALLREECVRARRRSARHRRRRLRDADFAEVAARLHADCR FT QKQEALRRAIGEAKSQSMETLPGLSVVCYADDTLVVARGRDLRESARLSCA FT GVAFVVGRIRRLGLEVALDKSQALLFHGARRAPPQGAHLVIGGVRVEIEAT FT GLRYLGLVLDGRWSFRAHFERLGPRLMAAAGSLSRLLPNVGGPDAVVRRLY FT TGVVRSMALYGAPVWCHALTRDNVAALRRPQRAIAVRAVRGYRTVSFEAAC FT VLAGTPPWDLEAEALAADYAWRCDLRSRGEPRPGAAEVRARKLQSRRAVLE FT AWSRRLADPAYGRRT" XX SQ Sequence 5102 BP; 727 A; 1654 C; 1915 G; 789 T; 17 other; cgtacgtgcg ggccctgaag tgcgtggccg ccgccgtgga ganacatctg gaggctttcc 60 tncagcgcac aagcacggag gaagtagccg gcctgcggtc gcaactggag cggctgcagg 120 ctcctagaaa acgcggagct ncgggcggag agcgtgcggg cgcgggagga gntggccgat 180 atgcgtgcgg ccctcgacgg ngtcgtggga cgccagggca gctcggcnnc tccgccaccc 240 ccgcccccac gcaggagaca gcggaaaagg acaaggagat cgaggagctg aagaggcgac 300 tcgcgatcct ggaggcccgc gcctccaagg ttgagcgcgc gaggccaccg ctggcgcacg 360 agcgccccgc gcctgcgcca tcgtcgcgcg gaaccgcggc tgcaccggcg cccgcgaagg 420 ccgcagcgac caaagtgggt gcggcacccg cgacgcgcgc gtcggccaac ttggccgcac 480 cggtcaagtc cgctggcngg cgagcgcncg ctcagcccgc gaggccgccg gccgccgctc 540 ctgccccgcc ccagcctgcg aaggcggggc ctggcaggag ccgcgcnaag cagaagaggg 600 gcgtgaacgc cgccgagccg gcgcagaccc cccagccccg tccgctacct cccgccccgt 660 ccaacatgga cgaggcctgg acgacggtgg tgaggcgggg tcgaaggaag gcgcaagccg 720 cgcctgaccc ccggcccgtc cctgcggccg ccgctcccca ggcnatgggt cgagcggcgg 780 ctgggggaaa gaggaaaggg cggaaggcaa ggaaaccgcg cgccccgcgg tcggcggcag 840 tcgtattgga gctgctgccg gctgccaagg aaaagggcct cacctacggg gaggtgatgg 900 cccgggcgcg cggtagcgtc gacgtggacg ccataggcgt ggagggtggc ntccgagtcc 960 ggcacacggc caacggggct cggctgctgg agtgtcccgg tgccgacncc ggcgcggccg 1020 cggacagact ggcggcccgg ctccgcgaaa tcctgccgga cccggaagtg gtgcgtatcg 1080 acaggcccgt caagatggcg gaagtcaaag tgacgggcct ggatgaatgc gccactaagg 1140 aggaagtggc cgccgccatt gcgtcgcagg gcaactgcgc cctcgcgcag gtgaaggtgg 1200 gagagctgcg gagctcttac tctggagcct tcaccgcgtg ggcgcgttgc cccgtgcagg 1260 cggccaccct cctggccacg cctccacaag gtcggcccgc cgactcgccg gggaggctgc 1320 gcgtgggctg ggtgatagcc cacgtgcagc tgcaggaggc acgcccctgg cgatgcctcc 1380 ggtgcttcgg caccgggcac ggcctcgcca ggtgcccgtc ggctgtggac cgcagcggac 1440 tttgcttccg ctgcggccag cccggacaca aggcggcctc ctgcaccgcc gccgcgccgc 1500 actgcgtgtt gtgcgacgcg gcgaagcggc gggccgatca tcgggccggg ggcccggcct 1560 gtcgttccgc cccctcctcc acaaagagga ggcgtggcgg gaaaaaaaag aagaaaaaag 1620 cagaggaaga gccggctgcg cgtgaagcag ccggccggtc agttcccgcc gcggccgtcg 1680 agccgcaagg cgggacggaa gacgagggag cgatggacgt gacacctcaa taatggatca 1740 cgtccatcgc ttcctccaag cgaacgtgaa ccactccgcc agggcgcagg atctcctcgt 1800 ccacaccatg gcggagtggt tcatcgacgt cgcgatcgtc gcggagccat acttcgtacc 1860 tcccgaccgg gaggattcct gggccggaga tgtcgatggc tccgtggcga tcgtgatgcg 1920 acagtcggcg gcgttacccc cccttggaat ggtggccagg ggatccgggt acgtcgtagt 1980 tagggtcgac gaaaccgtcg tgatcggggt gtacttctct ccgaacagga gtctcgccga 2040 gttcgagcgg ttcctgggcg ggctcgaggc gttggttcat cgcttcgagt ctcgcccggt 2100 gatactggcg ggggacctca acgctaaatg tacggcgtgg ggntcccccc gcacggactc 2160 gcgcggcgag ttcctgtcgg agtgggcttt cgcgaccggc ctctgtcttc tgaacagagg 2220 ttcggtcgcg acctgcgtgc ggtggaacgg ggaatcacac gtggacgtgt cgttcgcgtc 2280 tccgtccgcc gcgcgccgcg tccgtggctg gcgtgttctc gagggggcgg agacgctctc 2340 ggaccatcgg ttcgtccgat tcgagctctc cgtttccacc tcgttgaacg cgccggccga 2400 agacgcccgc ggcgaggagg agctcccgcg tagtgctccc cgatcattcc cgcgatgggc 2460 actgaagcgc ctgaataagg tgctcgcggt ggaggcggcc acagtggccg cgtgggcgcc 2520 gatgcccgcg cgtctagtgg acgtggagtc ggaggccgac tggttccggg gtacgatgcg 2580 tcgcgtctgc gatgctgcga tgccccgggt cggcggtcgc gctccgcgtg gcggtgcgta 2640 ttggtggacg cccgagatcg cgctcctccg agaggagtgc gtgcgggcgc gccgccgaag 2700 cgcccgccac cgccgtcgcc gccttcgcga tgcggacttc gcggaggtgg cggcccgcct 2760 gcacgccgac tgccgtcaga agcaggaggc gctgcggcgg gccatcggcg aggccaagtc 2820 tcagagcatg aagacgctcc tggagacgct cgatcaggat ccctgggggc gcccgtacca 2880 gacggtccgc aagaaattgc ggccgtgggc gctcccggtg acggagcgtc tccagcctca 2940 gcagctgcgg gagattgtct ccgggctgtt cccgcggatg gagagggact tcgagccccc 3000 ctccatgggc gcgccgccac gcggtgacgt gagcggtgat gcccctgctg aggtggtccc 3060 tccgagcatc tcggaggagg agattcgtgc ggccgtgtcg aggatgcgga ggaaggacgc 3120 ggcccccggc cctgacggtg ttcacggccg ggtctgggat ttggccttcg gtgcccttgg 3180 ggaccggctc gtgcggctct ttgaagcctg cctcgagtcg ggacggtttc cgaagcagtg 3240 gaagacgggc agacttgttc tgctgaggaa ggagggacgc cccgcggact cacctgcggg 3300 gtatcgtccc atcgtgttgc tggacgaggc gggaaaactg ctcgagcggg tggtggccgc 3360 ccgcatcgtc cagcacctga cgggggtggg tcccgatctg tcggcggagc agttcggatt 3420 cagggagggc cgctcgacca tcgacgcggt gatgcgcgtg cgcgccctct ctgatgaggc 3480 tgtcggccgg ggcggggttg cactggcggt gtccctggac atcgccgacg cgttcaacac 3540 cctgagtctc gccgagttcg agcggttcct gggcgggctc gaggcgttgg tccatcgctt 3600 cgagtctcgc ccggtgatac tggcggggga cctcaacgct aaatgtacgg cgtggggttc 3660 cccccgcacg gactcgcgcg gcgagttcct gtcggagtgg gctttcgcga ccggcctctg 3720 tcttctgaac agaggttcgg tcgcgacctg cgtgcggtgg aacggggaat cacacgtgga 3780 cgtgtcgttc gcgtctccgt ccgccgcgcg ccgcgtccgt ggctggcgtg tcctcgaggg 3840 ggcggagacg ctctcggacc atcggttcgt ccgattcgag ctctccgttt ccacctcgtt 3900 gaacgcgccg gccgaagacg cccgcggcga ggaggagctc ccgcgtagcg ctccccgatc 3960 attcccgcga tgggcactga agcgcctgaa caaggtgctc gcggtggagg cggccacagt 4020 ggccgcgtgg gcgccgatgc ccgcgcgtct agtggacgtg gagtcggagg ccgaatggtt 4080 ccggggcacg atgcgtcgcg tctgcgatgc tgcgatgccc cgggtcggcg gtcgcgctcc 4140 gcgtggcggt gcgtattggt ggacgcccga gatcgcgctc ctccgagagg agtgcgtgcg 4200 ggcgcgccgc cgaagcgccc gccaccgccg tcgccgcctt cgcgatgcgg acttcgcgga 4260 ggtggcggcc cgcctgcatg ccgactgccg tcagaagcag gaggcgctgc ggcgggccat 4320 cggcgaggcc aagtctcaga gcatggagac tctcccgggt ctgagcgtag tttgctacgc 4380 ggacgacact ttggtcgtgg cccgggggag ggacttgcgc gagtctgccc gtctttcctg 4440 cgcgggggtg gccttcgtcg tcggcaggat ccgaaggctg ggtctggagg tggcgctcga 4500 taaatcccag gccctgttgt ttcacggggc ccggagagcg ccgcctcagg gggcccacct 4560 cgtgatcgga ggcgttcgcg tcgagatcga ggcgaccggg ttgcggtacc tcggtctcgt 4620 actggacggt cgttggagct tccgcgctca cttcgaaaga ttaggtcccc gactgatggc 4680 ggccgccggc tcgctgagtc ggctgttgcc gaacgtcggg gggcccgacg cggtggtgcg 4740 ccgcctctac acgggggtgg tgcggtcgat ggcactgtac ggggcncccg tgtggtgcca 4800 cgccctgacc cgcgacaacg tcgcggcgtt gcgacgnccg cagcgngcga ttgcggtcag 4860 ggcggttcgc gggtaccgca ccgtctcgtt cgaggcggcg tgcgtgctcg ccgggacgcc 4920 cccctgggac ctggaggcgg aggcgctcgc tgcggattac gcgtggcgat gcgatctccg 4980 ctccaggggg gagccgcgtc ccggcgcggc ggaagttcga gcgcggaagc ttcaatctcg 5040 gcgtgccgtg ctcgaggcgt ggtctcgccg cctggcggac cccgcctacg ggcgacggac 5100 cg 5102 // ID Ginger1-1_AP repbase; DNA; INV; 5197 BP. XX AC . XX DT 03-FEB-2010 (Rel. 15.01, Created) DT 03-FEB-2010 (Rel. 15.12, Last updated, Version 2) XX DE Ginger1 DNA transposon from Acyrthosiphon pisum. XX KW Ginger1; DNA transposon; Transposable Element; integrase; Ginger; KW Ginger1-1_AP. XX NM Ginger1-1_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-5197 RA Bao W., Kapitonov V.V. and Jurka J.; RT "Ginger DNA transposons in eukaryotes and their evolutionary RT relationships with long terminal repeat retrotransposons."; RL Mobile DNA 1, 3-3 (2010). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC The 5'-end and 3'-end are not identified. Contains 4 introns. XX FH Key Location/Qualifiers FT CDS join(37..528,768..1106,1273..1507,2523..2738, FT 2820..2980,2984..3133,4552..4833) FT /product="Ginger1-1_AP_1p" FT /translation="FKLTIPSLFNQFLFSLVPEIINKTKYEHVNLTAVLNY FT IQGKGYPSYIITKNEKANIRRQCKFFTIQCDSLVHKKKLAKVIFDAEERKR FT IIEMVHDGSDQSLQSSALSSHRGRDATQRVLQKRFYWPNMTVDVKNYVREC FT VICQKVNPSSLKFVPELKSVHVPKKVFKQIGVDLITLPEVNNLRYVVVVVC FT YFSKWCEAKALMDKTAESVAKFLYDDICRHGCPEIQISDQGREFVNKLSDE FT LFRLTGTQQRVTSPYHPQANGLVERLNRTLKTSLLKVFKWPDILQGILFAY FT RTTVHCSTKYSPFFMLYQREPIIPIDVEVTNNTQNELFEDRELSYDFDETT FT FSKTVQTMLDLRRVVESKAIQNINEAQLRQRESYAKRHLKADTKFKPGDKV FT LLKNLRRNDRKGGWSLMPWIGPFTIESISDKNTYISQDPTDDEVIVSDENV FT ITVKKSAFRPVSKLWMANKCLELNVPITQKHQCVAKSYKRLSKPKQTIDVI FT GDGNCWFRCISVWLTNSEDYHELIRSCVYKVFMFIFFTEDSRVNSFVGDIE FT KYLEINPVNISGTWVTEAEIFANSLLLKTSIYIYSDYHCLWQLFGKDGSYK FT DGTNPNENCLYISHVNRNHYQVIIEIE" XX SQ Sequence 5197 BP; 1903 A; 655 C; 732 G; 1857 T; 50 other; aattacacaa tattaatata tttttattaa ataacattca aattaacaat tccatcatta 60 tttaatcaat ttttgttttc attagttcct gaaataatta ataagacaaa atatgaacac 120 gtcaatttga ctgccgtatt aaactatatc caaggaaaag gatatccttc gtatattata 180 acaaagaatg aaaaagcaaa tatcagacgc caatgcaaat tctttacgat tcaatgtgat 240 tctcttgtac ataaaaaaaa gttggcaaaa gttatttttg atgcagagga gagaaaaaga 300 ataatagaaa tggtgcacga cggatctgac caatcattgc agtcatcggc attatccagc 360 catcgtggga gggatgccac acaacgggta ttgcaaaaaa gattttactg gccaaatatg 420 acggttgatg taaagaatta tgtcagagaa tgtgttatat gccaaaaagt caacccatcg 480 tcattaaaat ttgtgcctga acttaaatca gtccatgtac ccaaaaaggt agatatcttt 540 ttttcagtaa tttatataca atttattata atatgtacaa tgtactatgt aggtatatga 600 tattataata gttttaggct tagtatcgat tttggaaata taatttttgt tatcttcacg 660 atattgtcaa ttatatttaa aaaaaatcta taatgttgca taaattgtta ttttcattta 720 aaatatcgtt aaaaataaat aagtactcac atagctgttt gtaataggtc tttaagcaaa 780 ttggtgttga tttgattacc ttaccagaag taaacaattt gcgctatgta gtagttgtag 840 tttgttactt cagcaaatgg tgtgaagcaa aggcgttgat ggacaagacc gctgaatctg 900 tagctaaatt cttgtatgat gacatttgta gacacggatg tccagaaatt caaatcagtg 960 atcaaggccg tgaatttgta aataaacttt ctgatgagct attcagatta actggtacac 1020 agcaacgtgt gacatcacca tatcacccgc aagctaatgg tcttgtggaa aggctcaata 1080 gaactttaaa aactagtttg ttaaaggtaa tatttgactt tacacctacc tatattactt 1140 tttttttagg tttttatgac tgtaaattgt tttattatat tgtaagtttg tttttattat 1200 atacctacgc atcattttat tattcagtta caaatgccta ataatatttt taggtacaac 1260 agaatgatgt aggtatttaa atggcctgat atactacaag gcatcctgtt tgcatacaga 1320 acaacagtac actgttcaac caaatactct ccgtttttca tgttatatca acgagagccg 1380 attattccaa ttgatgttga agtcaccaat aacactcaaa acgaattatt cgaagatcgt 1440 gaattaagtt atgattttga tgaaactacg ttttcaaaaa ctgttcaaac gatgttggac 1500 ttaagaagta aataaaatag tcctatatca tattatatga gtataatcta taaaagtatt 1560 tagtaaatta taacactcat cattgttgta actatttaat ttattattta gtgcttattc 1620 atttttacat agtttaaagt taaactgttc aatatttttg caaaacattt taatttaatt 1680 atttgaaatt ttaccttttc aatatttgta acataatatg ttattaattt cataataatt 1740 cattcctgaa tttaatagtc catatgagtt ttctaacgag atggtatgtt gtctttttct 1800 ttctgtcaaa aggtctgaaa ttaaaaataa aaatagatca accattcaca ttggtagtgg 1860 tttctgatta caaattgtac ctaggtagtt taaacatgtt gaaatttcca agaattttca 1920 taatccaaaa aattatgttt tttttttatt tatgtcaaaa atatttctgg acctgataaa 1980 aaaatttaca aaatttaatt caaaaatctt ccccttaaat tatttttgct gcaatattaa 2040 acataaaaaa taaattcaca attttttatt aacatctgta gtcatgtatt attgaacaaa 2100 ttagataaac ttacaaaaaa atatatcccc tttcaacatt tcgaaaaatt attaggacat 2160 ttttatgtgc tttttatgaa ctttaaaatc aattggtttt tgcttaatta actaatagtt 2220 gaactttgct attagagtga ccaattgctt atataggatt tatttgaata aatatatatc 2280 aaagtagcta acgttattaa taaaacacga tttttttcat cattttatac taattttatt 2340 taattttatt taaaatattt tttacttgtt taatattatg tatatacatt gcactatgtt 2400 acaatcaatt gcaaaaatgt tatgtattat aataaagtaa cgtttactta ttgctactta 2460 aagcaggtta atatgataat aacagtagat agataaaaaa aaattatttc ttgttttttc 2520 aggagttgta gaaagtaagg caatacaaaa cataaatgag gctcaactta ggcaacgaga 2580 gtcatatgca aaacggcatt tgaaagcaga tacaaaattc aagccgggtg ataaggttct 2640 attgaaaaat ttaagaagaa atgataggaa aggtggatgg tctttaatgc catggatagg 2700 accttttaca attgagtcaa tctccgacaa aaacacatgt gtgttaagaa gaggagataa 2760 actaattaaa actcatcagc atttaaaaaa cataaaaaag ttttttgaac ggacagaaga 2820 catttctcag gatccaactg acgacgaagt aatcgtatct gatgaaaatg tgataactgt 2880 aaaaaaatca gctttcagac cggttagcaa attatggatg gctaataaat gcttagaatt 2940 gaatgtgcca ataacacaaa aacatcaatg tgtagcaaag tagagttaca aacgtctgtc 3000 caaaccaaaa caaacaatcg acgtcatcgg agacggaaat tgttggtttc ggtgcatctc 3060 tgtttggctg acaaattccg aagactatca cgaattaatt agatcctgtg tatataaggt 3120 atttatgttt attgttatct attaaattta aattagtaat tatatatttg atacttaata 3180 aaaaaaaagc tggtaagtgg atgtcgctct actgtaccta ggtacctaca ttagggggtg 3240 ggatggacct cagtcataga gtacctaggt cactataatg tgtttgttaa atttgaatgc 3300 aacgataggt atcatctatg gtatcattgt ataccaaaaa cgattctgaa cggagatgat 3360 ttgtcagtct aggatatatt atatttaaat attgttatta ttttatttta acgagtgaat 3420 tttgatttcg tgaaaattta aacacttgta aaaatttttt aattgaataa tcatcaattt 3480 tttttatcat tattctaagc tcaacttatg gaaaaaaaat taaaataatc gaatatttca 3540 aatgcctata gatagaaata tcttgaaaat aaaattatta atgaaaatgc caatctaaaa 3600 aattggtaga agtttcaaat tcctacgact tatgcttttt gaataataac aaatatcgaa 3660 aatcattttt cgatacatcg ttattttacg caattttgta aaaatttaaa tttcagacgt 3720 tcataaattt tttttctaaa atcacgatcg agtttttttt aaagttcctt nnnnnnnnnn 3780 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn taccgatttt tgattttttt 3840 ttaactacac taagaacaac tcataaggaa ccttgtatta aattttcaaa attttttgga 3900 gagccaaatt ttttttattc atatttcaaa aaaaaactta aaaaattcga aaaattaaat 3960 tgtctataaa tagctcaaaa aaagtcaaaa tctttgaacc ctgtatagat aacgctaata 4020 taaacatttg gtgaaaattt cgagtgttta cagtagttag ttagtgagtt acagtaaaat 4080 aaaattttta tttttgttga aaattggttt tccgtaaaaa ttcccatttt tttgttactt 4140 ttttagggtt tttcccgacg cttttaaaag gtattaggaa tttcaaaaaa tgacctcaat 4200 gcaccaacta gattcacttt cctttcagaa acagtgctat ttttgaaaat tgaagcatta 4260 ttactattcc aaatcgtgat gacagacaca aaaaaaaaaa aaacacacat cattgtaaaa 4320 tcaatacatt cctcactccg ttcagaatct aaaacaagga tctttattaa tattattttg 4380 ttttaaatag aattatgatt tatcaacatt ttcatgaaat ataatgatat taaatgtctc 4440 acagtaaaat attttttatt tgaaaattaa tgaattaaac taatctataa tatactaaaa 4500 aaataaatat tattattatt ttaatattaa ttattttaca ttttttaaca gttttttaca 4560 gaagacagtc gagttaacag ttttgttggt gatatcgaaa aatatttaga aataaaccct 4620 gtaaatatta gtggcacatg ggtcacagaa gcagagatat ttgctaattc attgttgttg 4680 aaaacatcga tttatatata ttccgactac cattgtttat ggcaactttt tgggaaagat 4740 ggttcgtata aagatgggac aaatccaaat gaaaattgcc tgtatatatc acacgtgaac 4800 agaaatcact accaagtcat tattgaaata gagtgaagaa aacatatatt atatttaaat 4860 tacgcgacga tatattttta attataatta agtacaatct tattgtttat aaattaatat 4920 tatttaagtt caaagttatt ttatttatgt atttgttaag aattcaagag atatatgtaa 4980 cataatattc gtgacgattt taaaatgtga taataagaat aaatatacaa taaaatttaa 5040 aaaagcggat aagtggatgt cgctctgctg tacagttgtt tacaagtggg ttactgtaat 5100 ggattaaatt tgaatttaat gatataatat tattgtatat ttgtatacga aaaacgattc 5160 tgatcggaga cgaattttca gtctgggata ttttata 5197 // ID Gypsy-98_CQ-I repbase; DNA; INV; 9027 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-98_CQ_; KW Gypsy-98_CQ-LTR; Gypsy-98_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-9027 RA Jurka J.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 575-575 (2011). XX DR [2] (Consensus) XX CC Positions [5336-5815] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS join(3023..4009,4013..5911) FT /product="Gypsy-98_CQ-I_1p" FT /translation="MFKDPFLSFSRIKITESCPHITVQIFDRNYEALLDSG FT ASVSVTNVANIAEKHGLTLKESPVRIVTADKTSHDCLGYVNLPISFRGVTK FT VVPTVVVPQVAKELILGFNFWEQFGIQPMVQGENGFEQIEIKQSEQRIESI FT ELSVLPIESFPVITTNEPDESLDIPALELPEPSRITPEAVETEHELSPDER FT EDLVEAIRVFPCTTEGNLGRTTLIEHEIVLKEGAIPKRQPLYKCSPAVQAE FT MEAEIARYKMMDAIEECTSEWASGLVPVRKANGKLRVCLDSRRVNTWTKKD FT SYPMRNMVEIFNRLGKARYYSVVDLKDAYFQIPLKENRDYTAFRTSQGLYR FT FKVLPFGLTNAPFTMSRLMDRVIGFDLEPNVFCYLDDIVVASETFEDHTRL FT LRTVGERLRKAGLTISLDKSRFCRKRVTYLGYLLTDEGVSIDNSRISPILD FT YARPRNVKDIRRLLGLAGFYQRFIREYSRIVAPISDLLKKSKKKFEWTESA FT EAAFNELKAALVSAPILGNPDFSRTFTIESDASDNAVGAALVQEFEDGPRV FT IGYFSKKLSSTQRKYASVEKECLGVLLAINHFRHFIEGSRFKVVTDARSLL FT WLFTIGVESGNAKLLRWALKIQSHDIELEYRKGKNNVLADCLSRSVETLLA FT MSTDQDYQEMIAKIQQDSENHSEYRVVDGQIYKFVKKCGKVEDPRFQWKRY FT PPQAERDDIIRGIHEKAHLGPDKTLEKCRERFFWPGMSSEVKRYCQSCVRC FT QTSKANNQNTTPPMKEQKKVAEYPWQFLTMDFVGPLPASGRNRNTCLLVVT FT DLFSKFVLVQPFRQATAESLVQFVENAIFLLFGVPEVILSDNGTQFVSSLF FT QNLLKKYEVTHWRTPNYHPQVNDSERVNRVITTAIRASIRKDHREWANNVQ FT QIADAIRNSVHEATRYTPYFILFGRNKVSEGFEYRHVRDTSAEGEMNIQNE FT ERERLYK" XX SQ Sequence 9027 BP; 2602 A; 1829 C; 2341 G; 2242 T; 13 other; aattggcgcc caactacagg ttcagcggga aattgctgtt ttggttttgt ttgcaatttc 60 agatattagt tgagatcttg ttggaagaat attcattgtt ttgttgaaaa tatcaattaa 120 gttggggtta aaattcacgt ttttgtacat tatcgagggt attagtggaa gacttactac 180 gttcgctgga gaccggagct gcaaagcgtt gatggccagt agcaattcct ttctgtaaag 240 attggtcttg gaaacagaca cgtaggttct gccgatgacc gtttatgacc ccgaactgtg 300 ggtaagtcaa tccttaatac agttcctcgg tgtaccgtcg atttgattaa ctagtagaat 360 tctcgaactc gctggatact aagccagagg ccgtagatac tgatagtacg tgttggaagc 420 ggtggagaca acgatagttg ttgatgatac tacccgagtt tttgtattct gcctattgtt 480 caaatcgatt gatccaaatt tgttgattta attggtagtt attagtttgt tggcaaatat 540 aatttgtaga tttaattggt agttcttagt ttgttggcaa atataatttg tagatttaat 600 tggtagttct tagtttgttg gaaagtatat taatttggtc aaatacattt gttaaaatga 660 cggggaaatt tcccaaccct gatcacctca cggaggagga aattgattac gaattgagac 720 tccgaggaag actcgaggac cttgatttgg ttagagaggc gaaatgtcga gtacttagaa 780 atttgttcta caatgatgca agaaatgatt atgagtatga tccaccttat acaatggatc 840 aagattttga tcgcatcacg gttaaggtta gggacttgag ggagaaattg gatgagggtt 900 ttgatgagaa atgtttgtct cgcctaaggg cgtattttca tcgcgtaatg cgactcgtta 960 caaccaacga tgacgaaaaa gttagaaaac gkgaactcgc agaagaaatg ttgcaaattt 1020 ttgtcacttt caacgtaaac ccagataaac aacctgatga gatttcagac ggaactccag 1080 agaaagagga aacggagaag gaaaacggta acgaaaagct ggagaaggga cagaagacgg 1140 ttcccaagaa ggacgtaccg aaaacaccgg ggaggacaga caactcagca gcgttacata 1200 agaggatcga ggatttggaa caacaactat tgagggcatg tttggcatgg tcwggggtgg 1260 atgctagtca gcctgggcct ttccaaatca atcagagacc cagacatcgt gctgatgagg 1320 agagccaacc tgaatcgata ccgtgggagg aggaggagga attacgcaga aaaagaaggc 1380 agaatcgtta ttcactgtgg gatagccgga gacagagagt tgacggagat cgaggccata 1440 ggcaaggcag aggccaaggg ttccaagaag agagagatcg acaatcaccg ttsgacaata 1500 accgaccggg ggaattagat cacaggtcta gcaattttga taataatcgt tacagagaag 1560 ggacccgcag gtcaaaggag atcccggatc agcagcactc aagtcaccga tcgcgagttg 1620 atgaccgcaa cctgagggag gatttgagac aggggtccca ggagtctagg cgtcctccag 1680 aacgaagaga tgacctggga cgaaacaatc gctcttctcg ggaagacaat cgtagktcaa 1740 gggagaaccc ggaacaacgc cacggaagtc gccagtcgtt tctagatgat cgctatcgta 1800 gtgatgatcc agggcaaggt ccccaggaag ataggtatga gtcaaacggt tacgagtcat 1860 gggacgaatc gacgcgaaga ggcagtcaac tgttcggaaa tcgtggagac cacaatgagg 1920 atagcaggtc taggtttagc caccatagtc agttggttga cagagaccac ggacgacatc 1980 cacgtggggg tgcaccagag agacgaagac atgtcgttcg gaggagaagg catgccggga 2040 tmggggacgc agagatccaa aacgccgata gacgcatgga gaaatggcac ctcacgttta 2100 atggggacac cagtcagcgg tctttggaag attttcttca taaagttcat cgattggccg 2160 aaatggacag aatcgcagaa gacgtgctac tgcaacggat tcatacgata ctgagaggtg 2220 aggcttacga ttggtatctt tgctatgcgg aggatttcca gcactgggac gacttcgaaa 2280 tgagtttccg ttacatgtat ggtaatccac atagagatca gagcaaccgt aaaaagattt 2340 acgggatgaa acagagatct gatgagcctt tcctaaccta caaaatggag attgagaggt 2400 taaacaaact cttaagggtg ccattggacc aggagagact tttcgaagtg atgtgggaca 2460 actgtaagga tgcatacaaa gacaggttgg tgtgcagaga gatagagaac ttggttcagt 2520 tggaatacta cgcggtccgg attgatgcga acgacccaga actggaagcg caaagagagg 2580 gaacgtccag accaaaatac ggatctaact tcacttcgaa acaccagcga agagggcagt 2640 ttaaaaacac ccacgagaat gaacgtaact atagggaagg acacgccagg gtagttcast 2700 acgtcgaggt ggagggtgaa actaaggtat tagactcgtc cgaatcggaa ccggaagaag 2760 tgaacatgtt ggagcggaaa atggatcagc agccacaagc agaaaagtcg cagacaagtc 2820 catcctgctg gaattgccat aaaaagggtc atctttggag gcgttgccag gagcggaaaa 2880 ccatgttttg ttacgtgtgc ggtacaccgg agaaaacggc gattacttgc ccaaaccata 2940 agaatggttc gggaaactag agcaggagtg tgaggatggg aacatctcca ttccagaggc 3000 tcaaggagtt cccaattcaa ccatgtttaa agatccgttc ctaagcttca gtcgcatcaa 3060 gatcaccgaa tcgtgtccgc acattacagt acagatattc gatagaaact acgaagcgct 3120 tctggactca ggcgccagcg ttagtgtaac gaacgtcgca aacattgctg aaaagcacgg 3180 tttgackcta aaggaaagcc ctgtgcgaat tgtgacggcg gataagactt cgcatgactg 3240 cttgggctac gtgaatctac cgatttcttt tcggggcgtt acgaaagtgg ttccaactgt 3300 ggtagttcct caagttgcca aggaattaat tctaggtttc aatttttggg aacagttcgg 3360 catccagcca atggtacaag gtgaaaacgg gttcgaacag attgaaataa aacaatcaga 3420 acaaagaatt gaaagtattg aactttcagt tctaccaatc gaatcattcc cagtcattac 3480 aacaaatgaa cccgacgaaa gtttggacat tccggctcta gaactacccg aaccttcgcg 3540 catcacaccg gaagcggtgg agacggaaca cgagctttct cccgacgagc gcgaggatct 3600 ggttgaggca atccgagttt tcccatgtac taccgaagga aacttaggac ggactacgct 3660 gatcgagcac gaaatcgtgc taaaagaggg agcaatacca aaacgccaac cgttgtacaa 3720 atgttctccg gcagtccaag cagagatgga ggcggagata gcacgataca aaatgatgga 3780 cgccatcgaa gagtgtacga gcgaatgggc gagtgggttg gttccggtca gaaaggcgaa 3840 cgggaaatta agagtttgct tagactcacg tagggttaac acgtggacta aaaaggattc 3900 ttatccgatg cggaacatgg tggagatttt taacaggttg ggaaaggcca ggtactactc 3960 ggtggttgac cttaaagacg cctatttcca aattccgtta aaagagaact scagagacta 4020 cacggcgttc agaacctctc aaggtctgta caggtttaaa gttctaccgt ttggkctaac 4080 caacgccccc ttcacgatga gtcggttgat ggaccgtgtt attgggttcg atctggaacc 4140 aaacgtgttc tgctacttgg acgacatagt cgtagcttca gaaacgttcg aggaccacac 4200 cagattgcta cgcacggtag gagaacgcct aaggaaggcc ggactgacca tttcgctgga 4260 caaaagccgg ttctgtagaa aacgggttac ctatttagga tacttactga ccgacgaggg 4320 agtttccatc gacaattcga ggatttcgcc gattttagac tacgcgcggc cgcgaaacgt 4380 aaaagacatt cgccgattgc tgggactcgc gggtttttat caaagattca tccgcgagta 4440 cagtaggata gtagcaccta tctcggatct gctgaagaaa tcgaagaaga agtttgaatg 4500 gaccgaatca gcagaagcag cgttcaacga gcttaaggca gctttagttt ccgcgcccat 4560 cctaggaaat cccgatttct cacgcacctt cacgatcgag tcggacgcgt ccgacaacgc 4620 agtgggagcc gcgcttgtcc aagagtttga ggacggtccg agggtgatag ggtattttag 4680 taagaaactg agcagtaccc aacgaaagta tgctagcgtc gaaaaggagt gccttggagt 4740 gctactggca atcaatcatt tccggcattt cattgagggt tccaggttca aggtcgtaac 4800 agatgcccgt agcctgttat ggttgtttac gataggtgtg gagtccggaa atgcgaaatt 4860 gcttcggtgg gcactcaaga ttcagtccca cgacatagaa ttggaataca gaaagggtaa 4920 aaacaacgtt ttggcggact gtttgtcgag gtcagttgaa actttgctgg cgatgtctac 4980 agatcaagac tatcaagaaa tgattgcaaa aattcagcaa gactcggaga accactcgga 5040 gtatcgcgtc gtggatggac aaatttataa atttgtaaaa aaatgcggga aggtggaaga 5100 ccctcgtttc caatggaaac gttacccacc gcaagcagaa agagacgaca taattcgagg 5160 aattcacgag aaagctcatc taggtccgga taagacgctg gagaagtgca gagaacggtt 5220 cttctggcca gggatgagta gcgaggttaa aaggtattgc cagagctgtg tcagatgcca 5280 gacgagcaag gctaataacc agaataccac tccgcctatg aaagagcaga agaaggtcgc 5340 cgaatacccg tggcagtttc taaccatgga cttcgtgggt cccttgccgg cgtcgggacg 5400 aaaccgaaat acctgtctat tagtggtcac ggatcttttt agcaaattcg ttctggtgca 5460 gcctttccgc caagccacgg cggagtcact cgtacaattc gtggaaaatg ctatcttcct 5520 actgtttggc gtgccggaag tgatattgag cgataacgga actcaattcg tctcatctct 5580 gttccagaac cttcttaaaa aatacgaggt cactcattgg cgaacaccga actaccaccc 5640 acaagtgaat gattcggagc gagtgaatcg cgtgatcacg acggcgattc gggcgtcgat 5700 tcggaaagac caccgtgaat gggcgaataa cgttcaacag atcgcggatg caatacggaa 5760 ctcggttcat gaagcgaccc gttacactcc ttacttcatt ctgtttggaa ggaacaaggt 5820 ctcagagggg tttgaatatc gtcacgtaag ggatacgtca gcggaagggg agatgaatat 5880 ccagaacgag gaaagggaga gactctacaa ggawgttcgg gaaaacttga aactggccta 5940 cgaaaagcac gccaactact acaacttacg atcgaatgca aactgtgcca cctataaggt 6000 tggagaaaaa gtcttgaaga agaacacaga gcagtctgat aaaggggcgg gattttgtgc 6060 taaactggcc ccaaaatatg tgccagcagt ggttaagaga ttagtgggta cccattgtta 6120 tgacttggag gacttgaatg gcaaacgctt aggcattttc aattgcaaat tcctcaaaaa 6180 atttagtcaa tagcgtgact caaacaaact tttggctgtt ctttgaatgg tgaatttata 6240 aagctatgta tcatgaactt tttcttgaaa ctaagactat gaagttgtac tctggtcaat 6300 gaaatacttt ccagcctcgc gtacagtgcc gcccccacaa taacaaatca tttctactgg 6360 gtggatgaat gggccttacg tttgagtgtt gctgaagtca gtctagttaa ctcttcaatg 6420 aaagagagga aactctgaat gaagttgatg tgggagatca gggactgcgt tttgtaaata 6480 aatctactaa agttcacaaa ggtagtcctc ttaatgtata tacctatcac tagcgcgttt 6540 aatttcatct agctagttag tagtttaaaa tttatgttga aatattagca gttcgtttgt 6600 ttatacaatt ttcgtgcaaa atttgttatt tatttttttg tagttagagt cgtggttgtt 6660 gtttatagtt tgggatagat tttaggaaat gttacaaagt gggttgatcg ttgagtacat 6720 ttgccaattg aagtcatcct tccttcaggg ggttgagcag ctcccagtcc ggtcgcctaa 6780 aatagaataa attaaaacct tttcataaac aaaccatcaa ttggaatcca acttcccttc 6840 aagttcacca ccatccatcc agttagtaaa ttcaattaca attwtcggcc attttctctc 6900 caacaatagt acattaacac cacatttagt ctcagttaac ttcatttttg cgaaatacac 6960 caacttttga cgaactaagt aatttttccc acgtttagtt gttttgccgt aaacatgacg 7020 gtaggtttga cgtttgctgt tgcttgatgc tcgagagagt gaatgaattc tgagagagtt 7080 catggggtta aggggcgagt tcattccacc ttatttcatt tagccatttc aaatttggcg 7140 ttagaaattt ggtggacatg gcatggggat atttgatgtt attctcgagt agacagactt 7200 aatctggcgt tggagaatac atcagggttt taagggtttt gcgtcttgtt cattcagaat 7260 tagcatacca aggtatacca tggtggattc tgattgaaac agattggggt gaattatgtg 7320 ggtggacgat tttccaactt ctgtttggag tagtccgaaa cataaggaac ttaaggtttg 7380 ttgttgtagt gattgagtaa tcgatcggaa attttgggct tgtgttgaaa tgtaaatatt 7440 tcagtcacaa tgccagaggg aattttcttg gcgatattat tgaggtttca gaggagcaag 7500 taaacagcaa gcgatcaaaa tttgagagga aagtatgagc ctgtagaaaa gtgggcagct 7560 gtaaataatg ttgttgaatt aaattgtttg atgttaacat aattttgttt tggcatgttt 7620 ttgggccgtt gtaaatagtt ggttaaaatt tttgtatata cattgcatta tgttgtccaa 7680 gttcgtttag tgtataaaag ttttttgttg ggtatacccc tacgaaaatt tggttgtttt 7740 gcaaccaaat tttcgtaaaa ttagtccaga gttatgtagc gttctacaat tattatttat 7800 aaatacacta gtacaaatga aacgaacatg gctgattttg aattcgagcg aaatgaatcc 7860 aaacttgcgt gcgagcttgc atgcaaagca gcaagctcaa gatggattcc gtcggattct 7920 ctctgctcgt gaccgtctgg agagaaatcg sggacttcgt gacctagcct ctcccatgga 7980 aactgcaact catgcagctt cccttgttcg atgcaaggtc aagatcaggg gtagggcctt 8040 aagtggttat gccatgagct gagcgagaag gcagatataa ttgggcggta gagcaagtag 8100 tcacatcact agccggtccc ggggtcaagg gtaggaagac acgagcgaca gacttgcact 8160 cggcttgccg attacaagtc cccgtaaccg aacgtacaaa cgctcacctt aggtctgagg 8220 gcacacggaa ggactcggtc atttcaaccg tgacagtgaa agtgaccgga taggagttct 8280 tggtttttgc ttcgcgtctt tcagtgttca agtacggtgt gggtttgctt ttgcgactgg 8340 gtgattaaat tctttttccc gagtaggtcc aagataggaa caagtagtgc cagtgtgcga 8400 acgtgtgccg tgtgcgttaa gaaggagaac cccaggtaaa ttggaaccct ttgaatcccc 8460 tcaatccccg gatttccaac tcacggccat ggaacagtcg ttcggcggcc ctcttactcc 8520 ccgatagaac tggtgcgtcg gcacccgcac tgctggtgtt ccagcatccc cgtccgtgcc 8580 cgcaggatcc actttccatc gkcccggttg aaaaaccccc ccggtaggac gccgtccgga 8640 agagggctgc gcacgttacc atcgcgccct gaacacgccc atccaccacg cgcgtgacca 8700 acgtacaccc atttgcgcgt cggccagcta accccgccga gagcgccgtg agcaggacgc 8760 ccacctggcg tcgaccttca tcggagagcg cgtgaactaa gttcatcgcg ccacgcggat 8820 agatatcgag ttgccgccgg aagccggcag gcacgacgct gaggcgtctc gagagaagca 8880 gcacaccgag tcgagagaag agcagcattg agcgtcagga gcatcgccga gcgtcgcaga 8940 gcatcccggg agttcaccga tcgtcgcacc acgtgcggtc cccgttcctg tgagagaggt 9000 taggttgcaa accgcctaac caagcag 9027 // ID Gypsy-241_AA-LTR repbase; DNA; INV; 262 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-241_AA_; KW Gypsy-241_AA-I; Gypsy-241_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-262 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1084-1084 (2011). XX DR [1] (Consensus) XX SQ Sequence 262 BP; 72 A; 44 C; 65 G; 81 T; 0 other; tgtaggaact tgattatata atcataagaa taggtagatg ttttgttcca gtgtaattac 60 gctcccgacc gttgctaggg gcaacgtgaa gaaaatagca tagtaggctg ttaaatgtgt 120 aagagggatt cgagaaaaaa ccgttgtgta ataaagacgt gtttgagtgc agtgaaggtg 180 gtgattcaat ttatttgacc aattcgtcgg ccgattctgt ttctccgtat ttctgctgtt 240 ccgctccgga tacgacccta ca 262 // ID FEILAI_B repbase; DNA; INV; 284 BP. XX AC . XX DT 27-OCT-2010 (Rel. 15.1, Created) DT 27-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE A tRNA-derived SINE family from Aedes aegypti. XX KW SINE2/tRNA; SINE; Non-LTR Retrotransposon; Transposable Element; KW FEILAI_B. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-284 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 284 BP; 80 A; 63 C; 75 G; 66 T; 0 other; ggggccttcc ttagccgagt ggttagagtc cgcggctaca aagcaaagcc atgctgaagg 60 tgtctgggtt cgattcccgg tcggtccagg atcttttcgt aatggaaatt tccttgactt 120 ccctgggcat agagtatcat cgtacctgcc acacgatata cgaatgcgaa aatggcaact 180 ttggcaaaga aagctctcag ttaataactg tggaagtgct cataagaaca ctaagctgag 240 aagcaggctc tgtcccagtg aggacgtaat gccaagaaga agaa 284 // ID BEL-162_AA-I repbase; DNA; INV; 6076 BP. XX AC AAGE02022918; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-162_AA_; KW BEL-162_AA-LTR; BEL-162_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6076 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02022918; Positions 89423 83348. XX CC Positions [5102-5695] - Integrase core CC 'CAAGC' target site duplication CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 1890..4415 FT /product="BEL-162_AA-I_2p" FT /translation="MDQIVSAASDVIFVNYADGPGGKQEKPKAKEKTYINT FT HAVEEKREENVNSGDRKDKSKPCAACRKDGHKIRDCYAFRKMTLGDRWKLV FT REQYLCRRCLGSHGKFPCKASNCGENGCEERHHKLLHPGNPHTATDPPKTT FT STVTVHRQFNQSVLFRILPVSLHGNGKTIETYAFLDDGSSRTLVETKVAEE FT LGVVGDVHALCLQWTSGIERVEDESQLIKLQISGVPNGTRHDLKEVHTVKS FT LNLPVQSLDFGELCKQFPHLQGLPVQSYTNAVPTILIGLDNSFVMATRKSR FT EGPKGSPIAAKTIIGWTVYGSSAAETGQPCSHQLFHISVRSPDQELHDLVK FT VFFALESVGSDEDRRGKQILETTTVRTVFGRFQTGLLWKYDNVKFPDSRPM FT AEKRFRCLERRLASNPELYDNVKNQINQYQVKGYSHKASPDELASFNPKRT FT WYLPLNVVTNPRKPNKVRLVWDAAAKVQGQSFNSALLAGPDLLAPLPAVLS FT SFRQFEVAISADICEMFHQIKIRPEDRSAQLFLWRDDPSKPFDVFVMDVAT FT FGSTCSPSAARYVKNRNAEEFAEQFPRAAEAITKHHYVDDFLCSVHTEDEA FT VELAEQVRLVHSKAGFTIRNWLSNSSNVLIRVGEQQTESPKKLLIDRSAAY FT ERVLGMIWKPEHDVFVFQGVFREELQVLIQDDSIPTKREVLRVVMSVFDPI FT GLVAVFVVHGKVLVQHIWRLGLGWDERIPATLLEQWKRWIKLLQQLDHVEI FT PRCYFPGYSPEDYRAVELHIFVDASEEAFAAVAYFRIVEGPRVRCSLVSAK FT TKVAPLKLLSIPRLSKTVAENHTIPISRKVFWSDSCTFLS" FT CDS 4853..6076 FT /product="BEL-162_AA-I_1p" FT /translation="MDGRIASVQQVSFDFKFPVILPKGHEGTKLLVDCYHR FT LYTHCNAETAVNEMCQRFHVSEMRIALKQAGKLCQWCKVYKAVPAIPRMAP FT LQEARTTSHVRLFTYVGLDYFGPMLVKQGRNEVKRWVALFTCLTIRAVHLE FT VVYNLTTESCKMAVRRFIAQRGAPKEIFSDRGTNFVGASRDLKEEIQRITT FT NLANTFTNTDTQRHLNPPYTPHMGGSWERMVRSVKSALASLSTGGKPDDET FT LRTLLIEAESIVNSSPLTYLPIDSEEQEALTPNCFLMLSTSGVNKPAREPI FT EERAIARCNWDLGKVLLDKFWVRWIKEYLPTITKRTKRFKDRKPIEVGDLV FT VVVEDRIRNGWLRGRVLRVFPGRDGRVRNVEVLTASAGVLVQSVAKLAVLE FT VGGTAKIDFERVGG" XX SQ Sequence 6076 BP; 1555 A; 1579 C; 1596 G; 1346 T; 0 other; aaactgttta gatttctatg gtgaaatcgc ccggagatgt cgagctctac aaacctcccc 60 gacgctgggg caccgtcaac gacgcagcag catccagatg gtgtaaacct acagcaggaa 120 ggcgttctta acctcaactc cggctctcat ccggaaggtt ccgacaaggg tgttaggatg 180 gacacaccca ttgccccgtc tgtcggtcga tcaaaacgaa gccgttcatc gaagggcttt 240 acgcgcttcg ctagaactcc agcagctcca ggagcaaaag cgggtggaaa tggaattcat 300 cgccaaaaag tttgctattc tgcaggccca actggaggat gatgatggaa gcagtcgcag 360 cagcaagact cgtcgatctt cccatggcag ttcgcagaga actcacgact gggttcaaag 420 ccaggtaatc ccgactacgt gtagtaccac cgaatcccaa acgattcccg tgatcactca 480 atcccggact ggcaccattc ccaaggagat tccagcagtg agcaatgtca tggatgaatc 540 gattgcgctt gccgtgcagt cgttgtccat caatccggtg gaccgagtgg aatcgaacgc 600 aaaccaacgg acagcacctc tttcgctcca acacgtagtc ggctagccca tctgcaggcc 660 cgaaacaagg agaaaactgc tgtagtgaag agcgcgactg atggcttagg ggctgcaggt 720 gaattcacaa cgttttccac ggatcaccag gcgcttccaa ctgcacccaa cttttccatc 780 tcaccggaaa actcgccgtc ggcactgata gagcagctga gtgacctaca aaagcgtgcg 840 gacagtctcc aacagcagtt gatggtcggg acaatccgtc agagccatga tttcgatcga 900 gttccagaac aaactcggca gttccagccg actaagtcat caacagtggt tcgcatcgcc 960 gccaagtcat cagatgagta acgaccagtg cgtaacgccg cctacgattt cccagtcagc 1020 gatggtaata ccaaggttac cagtgcggta agtcaaccgt gctctagttc taccgttaca 1080 tttgccccac aattaaatat tgtgagctcg gtccattcgt cacgtggcga cccgagctga 1140 tgaaataaat aaaactcccc ctccgaatag cggaccatgc ttgcttcccc ctattctgca 1200 attcgttccg gttcatatga gtacaccaca agcccattcc actcccccct ccgtgcacac 1260 agtttcctct cgattgtctc caatcatgtc gtcgatgatt cctacatctg cacccgtcgt 1320 cgcagctgtg tccgctgcta atgatgttac tcggcggcag cttttcggtg gacccagccc 1380 acagcaactc gccaccagac aagtcgtgcc cagggagctt cctactttct ccggtgatcc 1440 tgtcacgtgg ccgttgttcc tgagctgctt ccaaaatacc accgagatgt gcggttattc 1500 cgatggggaa aatctaatgc ggctgcagcg gtgcttgaag gggaatgcgt tggaggcagt 1560 tcagagctat ctgatgcaac catcgtccgt accattgatc atcgacacga tgataacgtt 1620 gtatggtcgg ccagagcaaa tcatcaactc gcttctaggt acggtcaacg caaactccca 1680 agccggaaca tctggaaagc ctagttggag ttggtttggc gtgcaaaaat ctttgcagcc 1740 atctccaagc tgcgggtcta cacgatcatc tttcgaatcc gttgcttctt caggagctag 1800 taagcaaact tccagctact ctcaagctca actggtctct tttcaagcgt cagcatatca 1860 acgtcgacca taccacgttt ggctgctaca tggaccaaat cgtttcagca gctagtgacg 1920 tgatctttgt caactatgcg gacggacctg gtggcaagca ggagaaacca aaggcgaagg 1980 agaaaactta cataaacacg cacgctgtag aggagaaaag ggaggagaac gttaacagcg 2040 gcgatcgtaa ggacaagtct aagccctgtg ccgcttgccg gaaggatggt cacaaaataa 2100 gagattgcta cgccttccgc aagatgacac taggagatcg ctggaaactc gttcgagagc 2160 agtacctgtg tcgtcgctgc ctgggttccc acggaaagtt cccttgtaaa gcaagcaact 2220 gtggagagaa cggttgtgag gagcgccatc acaagcttct tcatcctgga aatccacaca 2280 cagcgaccga tcctccgaag actacaagca cggttacagt tcatcgccaa ttcaaccaat 2340 ctgttctatt taggatactt ccagtgtcgt tacatgggaa cggcaaaacc atcgagacgt 2400 acgcgtttct ggacgacggc tcctctcgga ctctggtgga gacgaaggtg gcagaagaac 2460 ttggagtggt tggtgacgtc catgcactct gccttcagtg gacgagtggt attgaacggg 2520 tggaagacga atcgcagctg ataaagttgc aaatttcagg tgtcccaaac ggcacacgac 2580 atgacttgaa ggaagttcac accgtgaaga gcctgaacct tccagttcaa tcgttggatt 2640 tcggcgagct ctgcaaacag tttccacatc ttcaaggcct tccggtgcaa agctacacca 2700 atgctgtccc cacgatccta atcgggttgg acaactcgtt cgtgatggca acccggaaaa 2760 gcagagaagg acccaagggt agccctatcg ctgccaaaac tataattgga tggactgtct 2820 atggaagttc cgcagcggaa actgggcagc catgcagtca tcagctattc cacatctcgg 2880 ttcgttctcc tgatcaagag ctgcacgacc tcgtgaaggt atttttcgcg ctagagagtg 2940 tgggaagtga tgaagaccgc cgtgggaagc agattctcga aactaccact gtgcgtacag 3000 tgtttggcag gtttcagaca ggcttgctgt ggaaatacga caacgtgaaa ttcccggaca 3060 gcagacccat ggcggaaaag cgattccgct gcctagaacg tcgtctagct tcgaatcccg 3120 aactatacga caacgtaaag aaccagatca atcagtacca ggtcaaagga tactcgcata 3180 aagcgagtcc tgatgagcta gccagtttca acccaaagcg gacttggtat ctcccgttga 3240 acgtcgtcac gaaccccaga aagccgaaca aagtacgact ggtgtgggat gcagctgcca 3300 aggttcaagg ccagtcgttc aattccgccc ttcttgctgg gccagatctt ctcgctccgc 3360 tgccagcagt attaagttcg tttaggcagt tcgaggtggc catcagtgca gatatatgcg 3420 agatgtttca ccagattaag attagaccag aagaccggtc ggcacagttg ttcctttgga 3480 gggatgatcc gtccaaacct ttcgacgtct tcgtgatgga cgtagccacg ttcggctcca 3540 cctgttcgcc atctgcggcc cgatatgtga aaaatcggaa tgccgaggag tttgccgaac 3600 agttcccacg agcagcagaa gcgatcacaa agcatcatta tgtggacgat tttctatgca 3660 gtgtccacac ggaggatgaa gcagtggaac tggccgagca agtgaggctc gtgcactcga 3720 aggctggatt caccattcgc aattggcttt cgaattcatc caatgttctc atacgggtcg 3780 gggaacagca aacagaatcg cccaagaaac ttctgatcga tcggtcggcc gcatacgaac 3840 gtgtcctggg catgatttgg aagccagaac atgatgtgtt tgtcttccaa ggtgtattcc 3900 gggaagaact tcaagtgctt attcaggatg actcgattcc gacgaagagg gaagttttgc 3960 gtgtcgtaat gagcgtcttc gaccccatcg gactagtagc tgttttcgtc gtccacggaa 4020 aggtgctcgt tcagcacatc tggcgcttgg ggcttggatg ggacgagcgt ataccagcca 4080 cactgcttga gcaatggaag cgatggatca agttgttgca gcagctggat catgtcgaaa 4140 tcccgcgatg ctactttcct ggctactctc ctgaagacta ccgtgccgta gagctgcaca 4200 ttttcgtgga cgcatccgaa gaagcatttg ctgcagtggc gtacttccgc atcgtcgaag 4260 gaccccgagt tcgatgttcc ctggtgtcag caaagacaaa ggtggcgcca ttgaagctgc 4320 tgtccatacc aaggctatcg aaaaccgttg ctgagaatca cacgattcca attagccgca 4380 aagtcttctg gagtgattcg tgcaccttcc tctcttgatt gcggtcggat cccagaaaat 4440 atcgacagtt cgtcgcattc cggctaacag aaatccacga tctaacacag gtcaacgagt 4500 ggcagtgggt accatcccgc tcgaacgtgg cggatgaagc cacaaagtgg ggcaaaggtc 4560 ccaacatcgc ctccgacagc cgctggttcc aagctccgac gttcctgtac caacatccga 4620 acttttggcc tcagcaagca acaaaaggag aagagaatca gaaccgccat ttttcattgc 4680 gacgtagtcg aaagcaacca tcgtcaagct gatccagcaa gatgctttcg gcgatgaact 4740 cgttatcctc aagattaacc aagaccttcc agttgtggaa caaaagagga tcgagaagac 4800 cagcaagata atcaagcttt cttccttcct ggacgagcaa ggtgttatcc ggatggatgg 4860 aaggattgca agcgtgcagc aggtctcatt cgacttcaag tttcccgtca ttcttcccaa 4920 aggacacgaa gggacgaaac ttctcgtaga ttgttaccat cggttgtaca cgcattgcaa 4980 tgctgaaacg gcagtcaacg agatgtgtca aaggttccac gtttcggaga tgaggattgc 5040 cttgaagcaa gccggaaagc tttgtcagtg gtgcaaggtc tataaggcag ttccggcgat 5100 cccgagaatg gctccactgc aagaagcaag gactacatca cacgttcggc tgttcaccta 5160 cgttggcttg gactacttcg ggccgatgct ggttaagcag gggcgcaacg aagttaagcg 5220 ttgggtcgcg ctgttcacct gcttgaccat acgcgcagtg catctcgagg tggtgtacaa 5280 cctgaccacg gagtcctgca agatggcggt taggcggttc attgcacaga ggggggcgcc 5340 gaaagaaata ttcagcgatc ggggcaccaa ctttgtgggg gctagccggg acttgaaaga 5400 ggagatacag cggatcacta ccaatctcgc aaatacgttc accaacaccg atacccagag 5460 gcatttgaat ccaccctaca cgccgcatat gggagggtct tgggaacgca tggtccggtc 5520 agtgaaaagc gcgctggcct cgttgtctac tggcgggaag ccggacgatg aaactcttcg 5580 cacactcctg atcgaagcag aatccatcgt gaattcgagc ccacttacgt acctgccgat 5640 agattccgag gagcaggaag cactcacccc gaactgtttc ttgatgctga gtaccagcgg 5700 agtcaacaag ccggctagag agccgatcga agagcgagca atagctcgct gtaactggga 5760 tttgggcaaa gttctactgg acaagttctg ggtgcgctgg atcaaggagt atctcccaac 5820 tatcaccaag cgcacgaaga ggttcaagga tcgcaagccc atagaagttg gagacttagt 5880 ggttgtagtg gaggaccgga tccggaatgg atggctaaga ggacgagtgc ttcgtgtgtt 5940 tccgggacgt gatggtcgcg tacggaatgt ggaagtcctg acagccagtg caggagtgct 6000 agttcaatca gttgcgaagc tcgccgttct agaagttggt ggtacagcta aaatagactt 6060 cgagcgggtc ggggga 6076 // ID Gypsy-24_OD-I repbase; DNA; INV; 5202 BP. XX AC CABV01002164; XX DT 24-FEB-2011 (Rel. 16.02, Created) DT 24-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Oikopleura dioica genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-24_OD_; KW Gypsy-24_OD-LTR; Gypsy-24_OD-I. XX OS Oikopleura dioica OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; OC Oikopleuridae; Oikopleura. XX RN [1] RP 1-5202 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Oikopleura dioica genome."; RL Direct Submission to RU (24-FEB-2011). XX DR Genome; CABV01002164; Positions 9302 14503. XX CC Positions [3939-4412] - Integrase core CC 'GTAT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 76..1053 FT /product="Gypsy-24_OD-I_1p" FT /translation="MEQFIADFELLQKKVEETENADMQKILEKQLKAMKYE FT DYIGAKSGGGIGSAQSRKSCEASLDKRMLQAKDYDEVSLFCAHVDKVHASF FT PEIAETDFLRMVVIKFGQDAYARYKNSGKTFKDWTELRDWILLEFNSGLTT FT LQLIGRALDTQFEKNRGWKCFSITIENRLLSAEAAIVKSARDKKRGEKEMK FT EDEIEKYQPTTREIIAFFGAAIVSDRIRTEDNDLFNRMASQWKDCTSPNEV FT GNAAQYYYAQNVAAENFHAKSKKNDSPSKNEDRENGNNYSAEHSNGRGRGN FT GRGRGGTRGRGREYGRHEENGNYQPTALGTERME" FT CDS 2616..4904 FT /product="Gypsy-24_OD-I_3p" FT /translation="MGRLLTRDGVKLDPRNYETIQNMGPPTTRKSLLSAIG FT NFTWINQWLSANYGEKVAENCCSQLLKELHNCTRGDKRKFKMTPEALEAFE FT KAKTRISSDKVFGYADFNLPFILICDASTVAMGALLVQIQDGRQKIIAAAS FT KSFSATEQKWAANEREAYAIVFMCERFDYYLRGPRAFTVLTDHRPLTALDV FT KSFGSPKLARWQLRLQRYRMVIQYIKGTENVWADLFSRPYDVSPRKLVKDT FT TVMGDYYTVADDNSMEIYVPSWTQKSKLPKELVLHRAHVTSCYSIGYCLTT FT GFDSKMPISEFRTIEEYQSEDPSLSVIRDNLRQGVPREKWTQPDGSYDWSK FT LIRFADSMYIDKTTNLLLIALGGNQPKVVLPNALKNRYISDAHQKGHFGID FT RTSEMLHWAWWTHKTDDIRDYVSTCEYCAKRKGSYIQPATPQLKHVLRGQR FT PFDLIYCDYVHMPQGKTGKRYILTIQCGFSRFLYAVPQARNRSIDAARGLY FT NFMLQYGFPSTISSDKGRHFIGEVLRDFTKLLNIRQNLHCAFRPQSSGTLE FT RIHRVMKNSLWGVVNDQGCDWEEALPSVISWINRAHNKSIKSSPWKVIFGH FT DYNDSGLALPTSTSAQTATQYAREIRSILSSAHKMVQIAQQEADSVLDRKP FT PYFYPLAISPGDEVYVKRDFSAEAKRQKQPYIGPYTVLKCNDCILIIDMNG FT RNETVHRGHVVKKIERSPHQDDDILDLLSDSPPVIQPNAAPLRRSTRISKP FT PVRFQAGICSPN" XX SQ Sequence 5202 BP; 1634 A; 1299 C; 1130 G; 1139 T; 0 other; gtggtaccag aaaataaacg aactcatact ttttggaaga gaagcgaaga gcaccaaaaa 60 gtacgataat aaataatgga acaattcatc gcagacttcg aacttttaca aaagaaagtc 120 gaagaaacgg aaaatgcgga tatgcagaag attctggaga aacagctaaa agcgatgaaa 180 tacgaggatt atatcggagc gaaatcaggt ggcggaatcg gcagcgccca atcacggaaa 240 agctgcgaag ccagcttaga taaacgcatg ctacaagcta aggactacga cgaggttagt 300 ttattctgcg cacacgttga caaggtccac gcttcatttc ccgaaatcgc cgagacggat 360 tttttgagaa tggttgtaat caaattcggt caggacgctt acgcgagata taaaaactcg 420 ggaaagactt tcaaagactg gacagagctg cgcgactgga ttcttttgga atttaattcc 480 ggtctcacga cgctccagct gattggccgc gcgctcgaca ctcaattcga gaaaaatcgc 540 ggatggaaat gtttttcaat cacaatcgaa aacagactgc tctcagcaga ggccgcgata 600 gtgaaaagtg cacgagacaa aaagcgcggc gagaaagaaa tgaaagaaga cgaaatagag 660 aaataccagc cgacaacgcg cgaaataata gcgtttttcg gcgccgcgat tgtgtccgac 720 agaattcgca cagaagacaa tgatcttttc aatcgaatgg ccagccaatg gaaagactgt 780 acaagcccga atgaagttgg gaatgctgct caatattatt atgcgcagaa cgtcgctgca 840 gagaattttc acgcgaaatc aaagaaaaat gattcgccct cgaaaaacga agaccgagaa 900 aatggcaaca attacagcgc ggagcacagc aacggacgcg gaagaggcaa cggacgcggc 960 agaggtggca cgcgcggcag agggcgcgaa tacggccgcc atgaagaaaa tggtaattac 1020 caaccaacag cgctcgggac agaacggatg gaataatcag caacgcggct accaaccaaa 1080 tcaccaaggc tggaaccagc aaggccagag tcagaataat aacggctgga atcaacaagg 1140 ccaaggtcag aattcgcaat atcgccgcaa ttaccacgca gaaaacactc aaggtcagtt 1200 tgatcagagt tcgcaatacc ggcaaaatta ccacgcagaa aacgcacaag ggcagtttga 1260 tcagacgatg cccaaccaag cgccagaaca aaccactacg tcgctttttc atcagtgatc 1320 gagaaccggc acgcactttc gatcacttat aataagcgcg caactacagc tcactgcatc 1380 aaccttattc tttctctccg cgatcaacaa atctccaaaa cggcgctcgt ggacactgga 1440 tctacagtaa atttattacc agttcaaata attcctgatc atcttcgcca tttaattacg 1500 ccgaagaaat caacaataaa tggagtcggc gagctacaaa caaaaggtga aatttttatc 1560 gacatattta cgcgcacgaa ttatggactc gcgcggcacg tcaaatttgc agtggtcgaa 1620 tctccttttc cgatcattct tggaacgcca tttttcaatc atcaggattt cgtcgaaaaa 1680 agttacacga taacacacga tttcctacaa attactacaa aacgagacaa tattcgccat 1740 acgattccgc acacgctcga caatgttcaa gcgtacaccg cgcgctcaca acatgatttt 1800 agctcgaatg attcaaaagc tgactggcta ctaaaacaga aggagataat tatcccttcg 1860 gcccgtgaag atggaaatgc ccaccacggg ttagccgagc tcgtatacgc aaatagagac 1920 gtcttctgct cctctaaaga ggatattgga acatatcctg aagaagtctc gattaacaca 1980 gaacctggac gatcgaagta tgtccgccag cacccaattc ccgcgcagta cgaagaagga 2040 gtcgctttag aaattacgcg gatgatcgac aaaggtataa tcgaagagtg ccgtgacaac 2100 aaaggttggc attcaccgat ccacgtggtg cctaagaaaa acggctcact aaggattgta 2160 tgcaatttca aacctactgt taataagttt acaacagtag aaaacgacgc gtgggaaatt 2220 ccgaaaatcg acttgataat ggcaaatatc ggaagaaaga caaaatactt ttccagctta 2280 gacgtcgctt caggatattg gcatcttcca attcgcgctg aggaccgaca caaaaccgca 2340 tttagctggc gcgatcgtca attcatgttc cgcagacttc cttttggact cgatttcgcc 2400 ggatttagtt tttgcaaggc tatttgcaaa aggccctcga tacaattcaa aatcgccgca 2460 caaaatcacg aattatatcg acgatcttct gattcacagc gaaacgctag aagatcacct 2520 tgaaacacta aaccaacttt tttgcgcact aagaaaattc ggtataaaaa ttgggctccg 2580 caaaatgtaa gttcgcacaa aaggaagttc aatttatggg aagacttcta acccgcgacg 2640 gagtcaagct cgacccgcgt aactacgaaa caatacaaaa tatggggccc ccaactacgc 2700 gaaaatcact tctatctgcg atcggtaatt ttacctggat aaatcagtgg ctaagcgcaa 2760 attatggcga aaaagtcgcg gaaaattgtt gctcacagct gcttaaggag cttcacaact 2820 gtacccgggg cgataaacgt aagttcaaaa tgacgccaga ggccctggaa gcctttgaaa 2880 aggcgaaaac gcgtatatct tccgataaag ttttcgggta cgcagatttt aaccttccct 2940 tcatattaat ctgcgatgca tcgaccgtcg cgatgggcgc ccttttagtc caaatccaag 3000 acgggcgaca aaagataatc gctgcagcgt caaaatcatt ctccgcgaca gaacaaaaat 3060 gggcggccaa tgaaagggaa gcgtacgcaa ttgtttttat gtgtgaacga ttcgattatt 3120 atcttcgtgg tccacgcgca tttacagtac tcaccgatca ccgcccgctc accgcgctcg 3180 atgtcaaatc ttttggatca ccaaaactcg cgcgctggca attacgacta caaaggtaca 3240 gaatggttat acagtacata aaagggacgg aaaatgtctg ggcggacctt ttcagccgac 3300 cttatgatgt ttcgccgcga aaactcgtca aagacacaac agtcatgggc gactactata 3360 cagtcgccga cgacaactcg atggaaatat atgttccgtc atggacccaa aaatcgaagc 3420 tgccgaaaga actcgtcctt caccgcgcac atgtgacatc atgctacagt atcggatact 3480 gtctcaccac tggattcgac tcgaagatgc caatcagcga attccgaacg atcgaagaat 3540 accaatccga ggacccctct ctctcggtaa ttcgcgataa cttacgccaa ggagtaccgc 3600 gagaaaaatg gactcagccg gacggctcat acgactggtc aaaactcatc aggtttgccg 3660 acagtatgta tatagataaa acgacgaact tgctactcat cgcgctgggg ggcaatcagc 3720 ctaaagttgt acttccgaac gcgctcaaaa ataggtacat tagcgacgca catcaaaaag 3780 ggcacttcgg aatcgacaga acttccgaaa tgctccactg ggcttggtgg actcacaaaa 3840 ctgacgatat tcgcgactac gtttccacat gtgaatactg cgcgaaacgt aaaggttcgt 3900 acattcagcc agcaacacct cagctcaaac acgtgctccg tggtcagcgc ccctttgacc 3960 tcatatactg cgactacgta cacatgcctc agggaaagac gggtaaacgg tatattctca 4020 caatacaatg tggattctcg cgatttctat acgcagtgcc gcaagcgcga aatagaagta 4080 ttgacgccgc acgcggactc tacaacttta tgctgcaata cggatttcca agcacgatat 4140 cgtcggacaa gggacgacat tttatcggcg aagtattacg cgactttaca aagctactca 4200 atatccgcca aaatctccac tgcgcgttca ggccgcaatc tagcggtaca ttggagcgga 4260 ttcatcgcgt catgaaaaac agcctctggg gagtcgtcaa cgatcagggt tgcgactggg 4320 aggaagcgct accttcagtc atctcctgga taaatcgcgc gcacaacaag tccatcaaaa 4380 gttcgccttg gaaggtaata tttggacacg attataatga cagtggactt gcgcttccaa 4440 caagtacaag cgctcaaact gctacccaat atgcccgcga aatacgaagt atcttatctt 4500 ccgcgcacaa aatggtccaa atcgcgcaac aggaagctga ctctgtccta gatcgtaaac 4560 cgccttattt ctacccatta gcaatttcgc ctggcgacga agtctacgtc aaacgtgact 4620 tcagcgccga agccaaacga cagaaacagc cttatatcgg accatacaca gtactcaaat 4680 gcaacgactg tatcctgatc attgacatga acggccgtaa cgaaactgta caccgcggtc 4740 acgtggtgaa gaaaatcgag cgctcgccgc atcaagacga cgatattctc gacctacttt 4800 cagactcacc gcctgttatt caaccaaacg cggctccact tcggcgaagc acaaggatct 4860 ccaaaccgcc cgtcagattc caagcaggaa tttgctcacc gaattaaaag tcgaggtagt 4920 ctatttcctt acattattct catcgcgtta tttatgcaat cacatctctt ctacgtcatc 4980 aatttcatgc cttcactaat ttatcgcaac tgggatgctc aactctactc acgcttctga 5040 aatatgctct tcaaattaat cgccgaggta gctaatgaca tttcatattt tcaatatcta 5100 ttgtgctcac ctcgtcaatc gccaattacg acgtagacga cttcaacgac tttgcaaccc 5160 aaccacgcga caccgaccgc ctcaagtcaa cggccggggg gc 5202 // ID BEL-15_CQ-I repbase; DNA; INV; 6497 BP. XX AC AAWU01030696; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-15_CQ_; KW BEL-15_CQ-LTR; BEL-15_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-6497 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 183-183 (2011). XX DR Genome; AAWU01030696; Positions 23506 30002. XX CC Positions [5544-6104] - Integrase core CC 'CTTTG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 31..3072 FT /product="BEL-15_CQ-I_1p" FT /translation="MSKPPITRQTRSQTRAAQELARHNLQTEEVVDEGADL FT QFEASVVDLQGGEEVDCGSCIRPNNAETFMVQCSRCFLYYHFSCAGVTVAT FT VNLQPFICKTCIPLGPPEKIPSARSGTTSSLRSSQLSLELRRVEEEIQLEE FT EISQQLERKKKLIAKRYDLMSQRDDVKSRSSGRSNRTTPNRVLNWIETQGK FT AGKSDPADSVNLLSGNPVGRLNAGLTSTPVNKSKSVDDLAQLAKLPAGGIS FT GPDPNKPPGQLDLDSIPEVPNKPQNNPFSKGAVGKLFKPTTLYENWLDQIN FT RKPPANKPDDVGRQLADKALAEKLARLEDQRKQDQNLQRERERKLKHQLQR FT AEQERLELIREKDRVLAENQNHQKRFDEFQEDLINQFNEMLNQQQNNINSR FT PESSDNWYDRELQQLQRSRELNFKRNEPTPTHSVHSSLPSISEGIGSLPSP FT YGGRTESSHPTIPPQEDPEVSSCSTIPPRPVPSGSHTRPSKSCLDPPPLTS FT HQIAARQVVSKELPDFFGDPAEWPLFISRYNHSTKACGYTDSENLLRLDRA FT IKGYAREMVSSLLLDPSTVPELLSSLKLLYGRPEQIVHNLIAKVRATPAPK FT ADKLESLVTFGLVVRNLCGHLRAIGMEKHLSNPTLLSELVEKLPANVKFNW FT ALYQHQLKEVDLNSFGSYMSQIATATSGVTVLSTAPRAGRDERQKSKEKGF FT VNAHAEQEETSDGEGRDQPKERLNVVNKIETHKACPACGVEGHSAKDCIQF FT KDLSLDDRWKVVKAKKLCRRCLTPHSRWPCNAEACGVNGCQKRHHRLLHYD FT HPPVEPQRTEPTNATVTIHRQPNPTTLFRILPVTLYGARGQVDTFAFLDDG FT SSVTLVERSIANKLGIAGEDSSLCIHWTGGIKKNISNTQLVDLEISGVNNT FT NRLKAKAVYTVEGLGLPDQSMNFEAMAAKYDYLRSLPVQNLMSAVPGVLIG FT LNNLHLMAPLKLREGREGEPIATKTRLGWAVYGAIPGREAPFLHRQMHTRI FT VMNYITTL" FT CDS join(3459..5159,5163..6497) FT /product="BEL-15_CQ-I_2p" FT /translation="MFTTIICDVIYVAAELLPPYPIDLYLLVVPDLLSPLL FT SVLFKFRERQVAIGGDIEAMFHQVRIREADRSAQLFYWRDSPEKPLETMVT FT DVAIFGASCSPAHSQYVKNLNATQHEIDYPRAANAIRTKHYVDDYLDSVDT FT SEEAAEIALEVAEVHSRAGFHIRNWVSNNAAVLQRIGAANPTTVKRFVMDK FT DCGFERLLGMVWLPEEDMFTFRLSFREDHDLLLSGEEIPTKQQLLSVVMSI FT YDPTGFLAAFIIHGKIIIQDVWRSGVDWKNKIPDELVQRWKQWIALLPKIE FT DLKIPRCYFPGYHPDSLRSLELHVFVDASELAYSAVAHFRLVDRGSVRCAF FT VASKTKVAPLDPLSIPRLEANAGVLGVRLRKSIVTGHSLPITRTRFWTDSK FT TVLQWIRSKDLRRYRPYVAFRMNEILSMSAVAEWGYCPSRLNVADLATKWG FT KQGPPLDINSPFYQSQEFVYDNPSEWPEPCEDMVELAPEELRPSFVFAHFV FT LKPTVKWERFSKWERLLRTMAYVQRFIARKLNREKKPWQVDLTREELQKAE FT QSLWRLAQSEEYPDEVATLNNQRVPADQRESLERSSNIVKLCPMLDDAGVL FT RVSGRIQAAECVGYETKHPVILPRDHPVTTLLLEYYHRRFQHSNNETVVNE FT IRQNFSIPKLRVLVRLVSNNCKWCEVYKSTPVVPQMAPLPRARLSPFLRPF FT TFTGVDYFGPYLIKTGRSITKRWVALFTCLTIRAVHLEIVANLSTESFKKA FT VRRFIARRGAPQEIYSDRGTNFQGASSELTKEISLSINKELASTFTDAHTQ FT WRFNPPAAPHMGGSWERMVRSVKAALGAIPLERKLDEESMVTMLTEAEHMV FT NSRPLTFIPLEGADQESLTPNHFLLMSSSGVQQPVKDPMCEGAALRNSWNQ FT IQHTLDEFWRRWVREYLPTLSRRSKWFEETRPLREGDLVVVVDEGTRNRWQ FT RGRVLRTYPGKDGRVRRVDVQTAKGILPSRAVARLALLDVGNVGDAEGTLP FT ATRGGE" XX SQ Sequence 6497 BP; 1750 A; 1590 C; 1685 G; 1472 T; 0 other; gaaaactcaa agatttacct gttcaacaat atgagcaagc caccgattac tcggcagact 60 cgttcccaga cgagagctgc ccaagagctt gcgagacata acctacaaac cgaggaagta 120 gttgatgagg gtgcagatct acaatttgaa gcctctgtgg tagaccttca aggaggagaa 180 gaagttgact gcggtagttg cattcgaccc aataatgccg agacgtttat ggtccaatgt 240 tcgcgatgct tcctctacta ccacttctcg tgtgcgggag tgaccgtcgc caccgttaat 300 ctccagccgt tcatttgtaa gacctgtatt cctctcgggc cgccagagaa aatcccttca 360 gctcgctctg gaacaacttc gagcttacgt agctcacagc tttcgttgga actacgaaga 420 gtggaggagg agatacaact ggaagaggag atttcccagc agcttgaacg caagaagaag 480 ttgattgcta aaagatacga tcttatgagc caaagagatg atgtcaagtc gcgcagcagc 540 ggtcgtagta accgaacgac gccaaataga gttctcaact ggatcgagac acagggaaaa 600 gccggtaaat ccgaccctgc cgattccgtg aacttactga gcggtaatcc ggtcggccgg 660 ctcaacgcgg gactcacatc tacaccggtg aataagagca aatctgtaga tgacctggca 720 caacttgcaa agcttccagc tggaggaatc agcggtcctg atccaaacaa acctccgggc 780 caactggacc tcgattcgat tcccgaagtt ccgaacaaac ctcagaacaa tcctttctcg 840 aaaggagccg ttgggaaatt gttcaaaccg actacgctgt acgaaaactg gctggatcaa 900 ataaatcgga agccaccagc taacaagccg gatgatgtcg gccgtcaact ggctgataaa 960 gcattagctg agaagcttgc acgattggaa gatcaacgta aacaggacca gaacctacaa 1020 cgcgaacgag aacgaaagct gaaacatcaa ctgcaacgtg cagagcagga aagactggag 1080 ctgattagag agaaggatcg tgtcctggcc gaaaaccaaa accaccagaa gcgatttgac 1140 gaatttcaag aggacttgat aaaccagttc aacgagatgc taaatcagca gcagaacaac 1200 atcaatagcc gacctgaatc ctcggacaac tggtatgacc gggagctgca gcagttgcag 1260 cgatcacgag agttgaactt caaacgaaac gaacccacac caacacacag cgtacattct 1320 agtttgcctt ccatttcgga gggaattgga agcctgccga gcccgtacgg aggacgcacg 1380 gaatcatctc atcctacgat accgccgcag gaagatccag aagtttcaag ctgctccact 1440 attccaccac ggcccgttcc gtccggttct catacccgac catcgaaatc gtgtctggat 1500 ccacccccgc tcacatccca tcaaatagct gcgcggcaag tggtgtccaa ggaactgcca 1560 gacttctttg gtgatcctgc tgaatggcca ctgttcatca gtagatacaa ccactcgaca 1620 aaagcgtgtg gctatactga ttcggaaaat ctgctacgtc tagatcgagc tatcaaaggg 1680 tacgcgaggg agatggttag cagcctactg ctcgacccat ccacggtgcc ggagttactg 1740 tcctcgctga aacttctcta cgggagaccc gagcagatcg tgcacaacct gatcgccaaa 1800 gtgcgagcca caccagcacc aaaggctgac aaattggagt cgttggtgac ctttggattg 1860 gtagtgcgta acctgtgcgg acacctaagg gcaatcggaa tggagaagca cctttcgaat 1920 ccgacattgt tatccgagct ggtggagaag cttcccgcca acgtgaagtt caactgggct 1980 ttgtatcaac accagctgaa agaggtagac ctgaattcct ttgggtctta catgtctcag 2040 attgcgacgg cgacgagtgg agttaccgta ctgtctacgg cccctagagc tggtcgggac 2100 gagcgtcaaa aatctaagga aaaaggattt gtcaacgcgc acgctgaaca agaagagaca 2160 agcgacggtg aaggtagaga ccagccaaag gagcggttga acgttgtcaa caagatcgaa 2220 acccataagg cgtgtccggc ttgcggcgtc gaaggtcact ctgcgaagga ctgtatacaa 2280 ttcaaggatc tgagtttgga tgatcgctgg aaggtggtta aagcgaaaaa gctttgtcga 2340 cgttgtttga ctccgcactc ccgttggccg tgtaacgctg aagcctgcgg ggtgaacggt 2400 tgtcaaaaga gacaccatcg acttctgcat tacgatcacc caccggtaga acctcaacgt 2460 accgagccga cgaacgccac ggtcacgatt caccgccagc caaatccgac gaccctgttc 2520 cgaatcttgc cagtcacgct gtacggagcg cgggggcagg ttgacacgtt tgccttcttg 2580 gatgacggat cttcggtgac ccttgtggag agatcgattg cgaacaagct gggaattgca 2640 ggagaagatt cctctctctg tatccactgg accggtggca tcaagaaaaa catctcgaac 2700 acacaactgg tagatctgga gatttccggg gtcaacaaca caaatcgact taaagcgaaa 2760 gcagtttaca ccgtcgaagg attgggattg ccagaccagt cgatgaattt cgaggcgatg 2820 gcggccaagt acgattactt gcgaagtctt cctgtacaaa acctgatgtc cgcggttcct 2880 ggtgtactaa tcgggttgaa caacttacac ctgatggccc ccctgaagct acgtgaaggc 2940 agagaaggcg aaccaatcgc cacgaaaacc cgtcttggct gggcagttta cggagctatc 3000 ccggggcgtg aggcgccctt cctacaccgg cagatgcaca cccgcatagt catgaactat 3060 ataactactt tatgacaaat agttactcaa ctatttatca aatggtctct actattcatt 3120 aaattgtaac ccaactatat atgtacgata tatcaaacca tttattccat tcccaaaata 3180 gttttggtcg attttactat atgagaaatt agttgttagg cagaaaacta tatgaggttt 3240 tcatacatat gtaaatattt ttataaactt ggattttacg tcaaatagtc attgcccaaa 3300 aaattgacta ttccctatat agtttctgaa aaactacttt accgacacta ctttgcggac 3360 actttaaaaa agaccgcaaa aatgaaccct acaattgttg tgtaaactac ttcggatata 3420 atcaaatttg atttttaagt tctatcgata gttggtgaat gtttaccacc attatatgtg 3480 atgttattta cgttgctgcc gaactgctgc cgccttatcc gattgatcta tatctgttgg 3540 tggttcccga cttgctttca cctctgctgt ccgtgctctt caagttccgg gaaaggcaag 3600 tagcgattgg cggtgacatt gaggccatgt tccaccaggt tcggatacgc gaagcagatc 3660 gaagtgccca gctgttctat tggcgtgatt cccctgaaaa gccgttggaa acaatggtaa 3720 ccgacgttgc cattttcggt gccagctgtt cacccgcgca ttcccagtac gttaaaaatc 3780 taaatgccac tcagcacgaa attgattacc ccagggcagc aaatgcaatc cggacaaagc 3840 attatgtcga cgactacctg gatagcgtcg acacatcaga ggaggcggca gagattgcgc 3900 tggaggttgc cgaagttcac tctagagccg ggttccacat ccgaaactgg gtctccaaca 3960 acgctgcggt actgcagaga atcggagcgg ccaatccaac aacggtgaag cgtttcgtga 4020 tggacaagga ttgtggtttt gagcgcctgc tgggtatggt gtggttgccg gaggaggaca 4080 tgtttacgtt tcgcctgagt ttccgagaag accacgattt gttgctgtcc ggcgaagaaa 4140 tccccaccaa gcagcagcta ctgagcgttg ttatgagcat ttacgatcca accgggttcc 4200 tagcagcgtt cataattcac ggaaagatta taatccagga cgtttggagg tcaggcgtgg 4260 attggaagaa caaaattcca gatgagctag tccaacgttg gaaacaatgg attgcgcttc 4320 ttcccaagat cgaggacttg aaaatccctc gttgctactt ccctggttac cacccggaca 4380 gcctgaggtc actagaactc catgttttcg tcgacgcgag tgagttggcc tattctgctg 4440 tggcacactt ccgacttgtc gacagagggt ccgtgcggtg tgctttcgtg gcaagcaaaa 4500 ccaaggtggc gccattggac ccgctatcca taccgcgtct cgaagcaaac gcgggtgtgc 4560 tgggtgttcg tttgcgtaaa tccatagtga ccggtcattc tctacctatc acacgaactc 4620 gtttctggac tgactcaaag accgtccttc aatggattag gtccaaggat cttcggcgct 4680 accgtccgta cgtggccttc cggatgaacg aaatactgtc catgtcagcg gtcgcggagt 4740 gggggtattg tccgtcccgt ctgaacgttg ctgatctcgc taccaagtgg ggaaaacaag 4800 gtccacctct ggatatcaac agtccttttt accagagcca ggaatttgtg tacgataatc 4860 catcggaatg gccggaaccg tgcgaagata tggtggagct ggcaccagaa gaattgagac 4920 cgtcattcgt gtttgctcat ttcgttctga agccaacggt gaaatgggag aggttttcaa 4980 agtgggagcg actattgcgt accatggcgt acgtccagcg atttatcgct cgcaagctga 5040 accgcgagaa gaagccgtgg caagtcgacc tgacccgaga agagctacag aaagctgagc 5100 agagtttgtg gcgtcttgca cagtctgagg agtatcctga cgaggtggcc actttgaatt 5160 agaatcaacg agtcccagct gatcagcgag aatccctgga aaggtcaagc aacattgtta 5220 aactctgtcc catgctggac gatgctggag tacttcgcgt aagtgggcgc atacaagctg 5280 cagaatgtgt cggctacgaa accaaacacc cagtaattct tccacgagat cacccggtga 5340 cgactttact gctggaatac taccatcggc ggttccagca ctcaaataat gaaactgtgg 5400 tgaatgaaat tcggcagaat ttcagcattc ctaagctgcg ggtgctcgtg aggctcgtgt 5460 ccaataactg caagtggtgc gaagtataca agtctactcc agttgtacca cagatggcac 5520 cactcccgcg agctaggttg tccccattcc tgcgtccgtt tacattcacc ggagtggact 5580 actttgggcc ctacctgatc aagacggggc gcagtattac aaaaagatgg gttgcgcttt 5640 tcacctgcct gacgatcaga gctgtccatc ttgagatcgt tgcaaacctt tcgacggagt 5700 cgttcaaaaa ggcagttcgg aggttcatcg cgcgtagagg ggcacctcaa gaaatctatt 5760 cggaccgagg aaccaacttc caaggagcta gcagtgagtt aacgaaggaa ataagtttga 5820 gcatcaacaa ggagctggcc agtacgttca ccgatgctca tactcagtgg cgtttcaatc 5880 cccccgcggc accgcacatg ggaggctcgt gggagaggat ggtgcgttct gtaaaagctg 5940 cgctaggagc gataccgttg gagcgcaaat tggacgagga atcgatggtg accatgttaa 6000 cagaggcgga gcacatggtg aattcccgcc cacttacgtt tatccctcta gaaggcgccg 6060 atcaggaatc gctcacgccg aatcattttt tactcatgag ttcaagcggg gttcaacagc 6120 cggtgaagga tccgatgtgc gaaggcgcgg ctttgaggaa cagctggaac cagatccagc 6180 acaccttaga tgagttttgg cgacggtggg taagggaata tttgccaact ttatcaagac 6240 gatcgaaatg gttcgaggaa acgcggccgc tcagagaagg agatcttgtg gtcgtcgtag 6300 atgagggaac gcgaaatcgc tggcaaagag gacgagtttt gcggacgtat ccgggcaagg 6360 acgggcgtgt acgacgggta gatgtgcaaa cagctaaagg gattctacca agtcgcgcag 6420 tcgccagatt ggcattgctg gatgtaggaa atgttggtga cgccgaagga acacttccgg 6480 cgacacgagg gggagaa 6497 // ID BEL-644_AA-LTR repbase; DNA; INV; 443 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-644_AA_; KW ao_Bel_Ele11; BEL-644_AA-I; BEL-644_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-443 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 443 BP; 148 A; 92 C; 75 G; 128 T; 0 other; tgtgctggct ggtagcactg cgtgagcagg tccaccacag cgccagaagc ttatgcgttg 60 acaaccggcg agtacggaaa ctatccacgt caagataacg tttcctgcca tcacatatca 120 ctttgtttca ttgagctgcg agtgaagacg cgcctttggt attataattc ttattcctaa 180 atttattccc aattaaaatg aatttgttgc tacttataga ctagaattgc tacttaaact 240 aaatttgtta ctacttcatc taaatttgtt tctacttaaa actattggaa gaccgaattt 300 gtaagtatta aatgagaaaa cctattgagc aaatctaaca aataaaacat atttcagctt 360 gaagctagac acaccataac cctagttgtg cttgctaaaa agaatcggtt agaacccaaa 420 aagaaccaac cccagtagaa aca 443 // ID Copia-20_DPu-LTR repbase; DNA; INV; 307 BP. XX AC scaffold_175; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-20_DPu_; KW Copia-20_DPu-LTR; Copia-20_DPu-I. XX NM Copia-20_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-307 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 704-704 (2010). XX DR Genome; scaffold_175; Positions 143409 143715. XX CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX SQ Sequence 307 BP; 93 A; 72 C; 45 G; 97 T; 0 other; tgttgaagtt agcgaacaac ccaagaaatc caaaaggagt tgctaaaacc catactctgg 60 aagttggcct ttagcaacac ccttctccca ctccctcctg ttaactcccc caagttaatg 120 acgctagaga tttgacaaga gcccctcagc cccagttcct ctttatctgt tctcctgaag 180 actgtggtgc tttactaagt ataatctata catatcatta tattctgtgt catatccatt 240 gtgctttatg ttaaaataaa agtcagttag caaagtatat ttatcttaat tgattataat 300 cacaaca 307 // ID Nematis_Cr repbase; DNA; INV; 2446 BP. XX AC . XX DT 20-DEC-2006 (Rel. 11.12, Created) DT 20-DEC-2006 (Rel. 11.12, Last updated, Version 1) XX DE Nematis_Cr is a Penelope-like retrotransposon. XX KW Penelope; Non-LTR Retrotransposon; Transposable Element; KW Penelope-like element; reverse transcriptase; KW GIY-YIG endonuclease; Nematis_Cr. XX OS Caenorhabditis remanei OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-2446 RA Arkhipova I.R.; RT "Distribution and Phylogeny of Penelope-like elements in RT Eukaryotes."; RL Syst. Biol 55(6), 875-885 (2006). XX DR [1] (Consensus) XX CC Nematis_Cr is a Penelope-like element (PLE) from the sequenced CC genome of the nematode Caenorhabditis remanei. It belongs to the CC Nematis group of PLEs. Its ORF contains regions homologous to CC reverse transcriptases and to GIY-YIG endonucleases. The element CC is very low-copy and probably inactive. Consensus sequence was CC assembled from trace archives. XX FH Key Location/Qualifiers FT CDS 210..2309 FT /product="Nematis_Cr_1p" FT /translation="MLLSKGPKFAPSGKFNQKVLSRISMGFVSLAATLRTR FT AASRSQNGITWDTLPPIPFPPAHFFLHPKSDKTDQQVAAAFNIFMKKINEQ FT KCLHIADNMSKKMWSALKELGQNKDINITVSDKRGEFVITTNAFYRESTIL FT HLTDTSVYTKITKTEYNEEVKKFYGGIESVLKSWNKKTADRLTDCHPSKNT FT LYILYKTHKFEERGEKATPSNTKVRPIISGVGGPTDRPSWVVCTIISQFLQ FT FVGCHLQNTNEILKSLNDIRGKKIKTEIFYESFDVESLYTNIDNEAAYEVV FT ITKLKQHYAQIKWYGVSFRDIKSLLKTCLNFNAFVFNEQHYVQKRGLAMGS FT RLAPVLAILYMDKLESPSKSLPTLVFKRYIDDFIVVAESKEVLDSVFKLLN FT AQTPNIRLTRETPSNGWLPFLNMEITVEKGIFATRWYRKKANKNLLLPIDS FT HHPTKQKRNIYDVTKSTAMSMSSITHRDYSMELAESLLRKNGYNSRPKCPR FT FTFSEKKQVSNSNGKLFTDLPILPIPFVSDWTTNTVRNTLAQVGIKAMIIE FT LKSPNLRDRLMKSRRFDNKCQRRQCRVCPFIGNGGCGKKGVIYRIDCDCGD FT FYIGETGRPLAERFNEHSRAAEKPGTPSYKTTIWSKHSFEKHQGSPLSLKL FT SILETERNTTRRRILEGIYIKTVNPTLNTKEELSDMVADLGFGITLTRKL* FT " XX SQ Sequence 2446 BP; 839 A; 512 C; 511 G; 584 T; 0 other; gagagagaga gagagagaga gagagagaga gagagagaga gagagagaga gagagagaga 60 gagagagaga gagagagaga gagagagaga gagagagaga gagagagaga gttcaaacaa 120 agaagagtca cagaacaaga gagcagaagg ttgcactatt ggagaatatt acaatactag 180 gagacgtcaa tatctcggaa gctgcaaaaa tgctactaag caaaggccca aaattcgctc 240 cctctggtaa gttcaaccaa aaagttctca gcagaatcag catgggattt gtctcattgg 300 ccgccactct tcgaaccaga gcagcatctc gctctcagaa tggcatcacc tgggatactc 360 tacctcctat tccgttccct cctgcccact tctttcttca tcctaaatca gataaaactg 420 atcagcaagt agcagcagcg ttcaatatct tcatgaaaaa gataaatgaa caaaagtgcc 480 tacacattgc tgataacatg tcaaagaaga tgtggagcgc acttaaagaa ttgggacaga 540 ataaggatat aaatataact gtgtcggaca aaagaggtga gttcgtaatt acgactaatg 600 ctttttatag agaatcgaca attttgcact taacggacac ctcagtgtat acgaaaataa 660 cgaaaactga gtataacgag gaagttaaaa aattctatgg cgggatcgaa tcagtactga 720 aatcatggaa caaaaaaaca gctgatcgac taacggactg ccacccttcc aaaaacacgc 780 tctacattct gtacaaaacc cataagttcg aagaaagagg agaaaaggct acaccctcaa 840 acacgaaggt aagacctata atttcaggag taggaggccc cacagaccgt ccttcgtggg 900 ttgtgtgtac gatcatctcc caatttctac aatttgttgg atgccacctt cagaacacca 960 atgaaatcct gaaatcactg aatgacatcc gaggaaaaaa gatcaaaaca gaaattttct 1020 acgaatcgtt cgatgtagaa tcgctctaca ctaacataga caacgaagca gcgtacgagg 1080 tggtcattac caagttaaaa caacactatg ctcagataaa atggtacgga gtatcgttca 1140 gagacatcaa aagcttactt aaaacgtgcc tcaatttcaa tgcgtttgtg tttaacgaac 1200 agcactacgt tcaaaaaaga ggtttggcca tggggagccg tctggccccg gtcctagcga 1260 ttctatacat ggataagttg gagtcaccaa gtaaaagcct gcctactcta gtattcaaac 1320 gatacattga cgattttatc gttgtcgccg aatcaaaaga agtgttagat tcagtgttta 1380 aactgctaaa tgcccaaaca cccaacataa ggcttacaag agaaactccg agtaatggat 1440 ggcttccatt cctgaatatg gaaattactg tggagaaggg aatttttgct acaaggtggt 1500 ataggaaaaa agccaacaaa aacttgctac tcccgataga ctcacatcat ccaacaaagc 1560 agaaaagaaa catctatgac gtcacaaaat ctaccgccat gagcatgtca tcgattactc 1620 atcgtgatta ttccatggaa ttggcggaga gtttactgcg gaaaaatggt tataacagcc 1680 gtcctaaatg ccctcgcttt acattctcgg aaaaaaagca ggtaagcaat tccaacggta 1740 aactttttac agatcttcct atccttccta ttccttttgt atcggattgg acaacaaaca 1800 ctgtaaggaa cactttggct caagttggta tcaaagctat gattatcgaa ctcaaatccc 1860 ctaaccttag agacagactt atgaagagtc gtagattcga caataaatgc caaaggagac 1920 aatgtcgtgt ctgccctttc atcggaaacg gaggatgtgg aaaaaagggc gttatttaca 1980 gaattgattg tgactgtgga gatttctata ttggtgaaac tggccgccca cttgcagaaa 2040 gatttaacga acattcgaga gcagcggaaa aaccaggaac gccatcgtac aaaacaacca 2100 tatggtcaaa gcattctttc gaaaaacacc aaggatcgcc tctctcgctt aaactatcaa 2160 ttctggagac cgaacgaaat acgacacgca gaagaatcct ggagggtatt tacataaaaa 2220 ctgttaatcc cactctgaac acaaaggaag aattatcaga tatggttgct gatctgggtt 2280 ttggaatcac actaacgcgg aagttgtaaa cacattaaca cattcattat cttctcattc 2340 ccatcgctac ttttcttaac gctaacggtc tttatcctct cactcccacc tgtgtttgtt 2400 tgtcattccc acttgtgttc tcttggctca ccagtggtgg tttacc 2446 // ID SAT-2_AAe repbase; DNA; INV; 468 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE Satellite-type sequence: consensus. XX KW SAT; Satellite; Simple Repeat; SAT-2_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-468 RA Jurka J.; RT "Tandemly repeated DNA from Aedes aegypti."; RL Repbase Reports 11(4), 1452-1452 (2011). XX DR [1] (Consensus) XX CC 117-bp unit. XX SQ Sequence 468 BP; 168 A; 120 C; 84 G; 96 T; 0 other; caaaatttga atttagcgca taacagtaat tccagattca tcccggtgcc ccaggagaat 60 cgaaagtgac cacctaagcc ccgaggtgta caaaatgcat tcctgcacaa aaaactacaa 120 aatttgaatt tagcgcataa cagtaattcc agattcatcc cggtgcccca ggagaatcga 180 aagtgaccac ctaagccccg aggtgtacaa aatgcattcc tgcacaaaaa actacaaaat 240 ttgaatttag cgcataacag taattccaga ttcatcccgg tgccccagga gaatcgaaag 300 tgaccaccta agccccgagg tgtacaaaat gcattcctgc acaaaaaact acaaaatttg 360 aatttagcgc ataacagtaa ttccagattc atcccggtgc cccaggagaa tcgaaagtga 420 ccacctaagc cccgaggtgt acaaaatgca ttcctgcaca aaaaacta 468 // ID CR1-62_AAe repbase; DNA; INV; 5435 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-62_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5435 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1149-1149 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 23 sequences with >94% CC identity. XX FH Key Location/Qualifiers FT CDS 394..1461 FT /product="CR1-62_AAe_1p" FT /translation="MESCQICSTSLDSRRAINCSGSCGRIFHFACVGMSKS FT HYSTWCAKIGMFWFCNSCRLNFEPAVYDREKTIMKALRELLIRTDSMDTRL FT GNYVENLRKINNNLSGSHLSSKAGNTSTTSIPTKSFAQRIDELTLDDTYDD FT PIHRSRSGDDTSFFEVLDEVNGSLALLPDKFVVGSNKRVHIVAKPSSSSNR FT NVHRSNVSSPAASVNHRYVSTKQTSATDKNTFSPEFAGVKEFRNPSGTRVE FT SRPTSIPLKVANSVQTPGDVESFYVTPFAPDQLADEVKQYVIEISNTDPSL FT VNVTKLVPRGKNVEELSFVSFKVTISKSASSVVGDQWYWPDGITVRAFEPN FT PKNGAVTRLPTIS" FT CDS 1416..5036 FT /product="CR1-62_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="TESKKWRCYQASDHFVTQRGFADFLNPSSVDLGRMTF FT PVSSVHEGRVTFPVLSGNHLNSICPAEVLIDTAPTLKGSTHFQFHDSSTAT FT HPPTIENESLSNFGSHHSGTVPVLSGKIDQSICLAEAGKDTAPSLKSVSRL FT QSSNTANNRPAAIGPPLASLLVDPLAGDVHRVNESTVTTVAGCNTYHLASF FT PTVSGKSESGICHAEAGNDTAPNVSESSPLQHAPSWRVQPTAVNALQCSSS FT NNLLIYYQNVRGLRTKIDDFYLATAESKFDVIVLTETWLDERIYSAQLFGS FT QYTVFRNDRNQENSTKSRGGGVLIAINRRLCCSLDSSPISSSLEQIWVKIK FT GQYRSLSLGVLYLPPDRKSDLECIHNHVNSIGNVLGQLALKDLALIFGDYN FT QSNLVWIKQENKPPTIDILRSSISASCSALLDGFSLHGLVQINTVLNRNSR FT LLDLLWVNDIALSDCSVHESIDPLIDLDSDHPALETCVNMSSPITFESTDD FT VEGLDFRNANITALKQILAQTDWHGLDTACNVDEAVDHFTQIVNQAIIGNV FT PLRRPHPKPIWGNARLRYLKRSRSAALRRYCQYRNPSTKRIFIEASIDYRI FT YNRFLYRRYTRQTQENLRLNPSQFWSFVKSKKNEEGLPVEIFLGDRQARTS FT ADKSNLLADHFKQVFSSSSASSVLVHKAIEHTPSDILSLDVIAITPELIGT FT AISKLKSSYAAGPDGIPSCLLKKCSTELIEPLAKIFKLSFEQGVFPKRWKA FT SYMFPVHKKGDKRNIENYRGITSLCACSKIFELIVNDALFSACKSYISTDQ FT HGFFPKRSICTNLVPFTSMCLRTMETGAQVDVVYTDLKAAFDRVDHGILLA FT KLNKLGVSGAMIRWFNSYLTDRLLCVKIGTAESYYFTNPSGVPRGSNLGPL FT LFTIFINDVGLILPPECRSFYADDVKLYIIVRCFRDCLQLQSLIHSFETWC FT SDNCLTLSVHKCNVITFHRSKSPILHDYKMNGQSLQRVNNIRDLGISLDAR FT LTFNQHYSDMIAKANRQLGFIFKIADEFHDPLCLKALYCALVRSILEFGSV FT IWCPYHATWIARIEAIQRRFVRYALRYLPWNDPSNLPPYEERCQLLGIETL FT EHRRTTAQAVFVAKLFTGEIDSPEIIGQIGIYVPERNLRSRNFLHLGSRSS FT NYGMHDPIRFMSTRFNEFYSIFDFHDTSTTFCRRIQRELIDRQRNIPI" XX SQ Sequence 5435 BP; 1494 A; 1275 C; 1082 G; 1582 T; 2 other; cctggcaaca ctgctcgttt gtttatgtgt tccgtattgg caccagaagt atttcgtttt 60 agtttgtgac aatctacggt tttattgcca agttccgttt catctaattg cttttgtgtt 120 ttacttcctg acgtcgtcga agttttatcg gaaaacggac tcgataacat gtttttgtga 180 agtgaatatt tctgctttgg acgattagca tcacaatata tcccttccca cgcaawtttt 240 ttccatccgg agcgacatct gttgggcact agcgatatcg taatcttgca tccaacctaa 300 cattcaactc tgcccgatca cacaatcgca tcgtcgcata caggcataca atattcgtca 360 tcaagaaact caatcatctt cgatttctgt gagatggaat cttgccaaat ttgttcaacg 420 tccttggact ccagaagggc tattaactgc agtggctctt gtggcaggat atttcacttc 480 gcctgcgtag ggatgtctaa atcgcattac tctacttggt gtgcgaaaat tggaatgttc 540 tggttttgta attcatgccg tttgaacttt gagccagccg tatatgatcg tgagaaaacg 600 atcatgaaag cgttacgtga attgctcata cgcactgact ccatggatac gcgcctcgga 660 aattatgtgg aaaatcttcg aaaaatcaac aacaatctct ctggatcaca tctgtcgtca 720 aaagcgggca acacctcgac cacatcgatt ccgactaaat ctttcgccca acgcatcgat 780 gagttgactt tggacgatac ttatgatgat ccaatacatc gttcgaggtc aggcgacgat 840 acttcatttt ttgaagtgct tgatgaagta aatggttctc ttgcgttgct ccccgataag 900 tttgtcgttg gatcaaacaa gagagtccac atagtggcca agccatctag ctcaagtaat 960 agaaatgtgc accgttcaaa tgtgtcatca ccggctgctt ccgtaaacca tcgctatgtt 1020 tccaccaaac aaacatctgc taccgataag aacacgttct ctcccgagtt tgcaggagta 1080 aaggagttca gaaatccttc tggcactcgc gttgagtctc gtcctacctc cattccgctg 1140 aaagttgcaa atagcgtgca aactcctggt gacgttgagt ccttttatgt gactcctttt 1200 gcgcctgatc aacttgccga tgaagttaaa cagtatgtta tcgagatctc aaataccgac 1260 ccgtccctgg tgaacgtaac caaattagtt ccccgtggga aaaatgtcga ggagctctcg 1320 tttgtatctt ttaaggttac gattagtaaa tctgcttcga gcgtagtcgg cgatcagtgg 1380 tattggccag atggtatcac agtgcgagct tttgaaccga atccaaaaaa tggcgctgtt 1440 accaggcttc cgaccatttc gtaactcaac gwggattcgc cgattttcta aatccttctt 1500 cggttgattt gggacgcatg acgttccctg tttcatcagt ccatgaggga cgtgtgacgt 1560 tccctgtttt atcaggtaat catttgaaca gtatatgtcc cgccgaagtg ttgattgata 1620 ctgcacccac attgaaagga agcacgcatt ttcagttcca cgatagttct acagccactc 1680 atccgcccac tattgagaat gaatcgctga gcaatttcgg ctcacaccac tcgggtacag 1740 tccctgtttt atcaggtaaa atagatcaga gtatatgtct tgccgaagcc ggaaaggata 1800 ctgcaccctc tctgaaaagc gtttcccgat tacagtcatc caatactgcc aacaacagac 1860 cagcagctat cggccctccg ttagcatcat tactcgtcga tcccctcgcc ggagatgttc 1920 accgcgtcaa tgaatcgacc gtaactacgg ttgcgggttg caacacatac cacttggcat 1980 cattccctac agtgtcgggt aagtctgaga gtggtatatg tcacgccgaa gcaggaaatg 2040 atactgcacc aaatgtgagt gaatcttctc cgttacagca tgcaccttca tggcgagtac 2100 agcctacagc cgtcaatgca ctccaatgtt cttcgagcaa caacttgctc atatattatc 2160 aaaatgttcg cggcttacgg actaagatcg atgattttta tttggctacc gccgagtcta 2220 aatttgatgt catagttctc actgaaacgt ggttggacga gcgaatatat tcggcacagt 2280 tatttggcag tcaatacacc gtattccgca acgatcgcaa tcaagaaaac agtaccaaat 2340 cacgtggcgg cggtgtactg atagcaatca ataggcgctt gtgttgcagc cttgattctt 2400 ccccaatcag ttcttccctc gagcagattt gggtcaagat caaagggcaa tatagatcat 2460 tgagccttgg tgttctatat ttgcctccgg atcggaaatc tgatttggaa tgtattcaca 2520 atcacgtaaa ctcgattggt aatgtactcg gtcaactagc cttaaaagac ctggcgctga 2580 tttttggcga ttataatcaa tcaaacctgg tgtggatcaa gcaagaaaat aaacccccaa 2640 caatcgacat tcttcgatct agcatatctg catcatgctc agctcttctg gacgggttta 2700 gcctacacgg cctagttcaa ataaatactg tattgaacag aaactctcgt ttacttgatc 2760 tcctttgggt taacgacatc gcactctctg attgctccgt gcatgaatct attgaccctt 2820 tgatagatct tgattctgat catcctgctc tggaaacatg cgttaacatg tcgtcaccta 2880 ttactttcga aagcaccgat gacgtagaag gacttgattt tcgcaatgcg aatataactg 2940 cgcttaaaca aatacttgct caaactgatt ggcacggtct tgataccgca tgtaatgtag 3000 acgaagccgt tgaccatttt actcaaatag ttaatcaagc gattatcgga aacgtgccac 3060 ttcgcaggcc gcaccccaaa cccatttggg gaaatgctcg acttcgttat ttgaaacggt 3120 caagatctgc tgctctaagg aggtactgcc aatatcgtaa cccatcaacc aaacgcattt 3180 tcatagaagc cagtattgac taccgtatat acaaccggtt tttgtatcgg cgatatacga 3240 gacaaactca ggaaaatctc cgcctaaatc catcacaatt ctggtccttt gtaaaatcca 3300 aaaagaatga agaaggctta cctgttgaaa tattccttgg agaccgccaa gctcgtacat 3360 cggcagacaa aagcaacctt ctggcagacc acttcaaaca agtctttagt agctcctcag 3420 cttcttcagt tctggtacat aaggcaatag aacatactcc aagcgatatt ctcagcctcg 3480 atgtaattgc gattactccg gaattaattg gaacggccat atcgaagtta aaatcgtctt 3540 atgcagctgg cccagatgga attccctcct gcctacttaa aaaatgttct actgagctca 3600 tcgaaccact agcaaaaatt tttaagcttt cttttgaaca aggtgtgttt cctaagcgat 3660 ggaaagcatc ctacatgttt ccagttcaca aaaaaggaga caaacgaaac atcgaaaact 3720 atcgagggat aacgtctctg tgtgcctgct cgaaaatatt cgaacttatt gtcaacgatg 3780 ccttattctc tgcttgcaag agttatatat ctactgatca acacgggttc tttccaaaga 3840 gatcaatatg taccaacctt gtaccattta cttcgatgtg cttgaggact atggaaactg 3900 gtgcgcaagt cgacgtcgtt tacacggatc tcaaagctgc gttcgaccgt gtagaccacg 3960 gtattctact ggcaaagctc aataagcttg gtgtgtctgg agcaatgata cgctggttca 4020 attcttatct tactgaccga cttctttgtg taaaaattgg tacagctgaa tcatattact 4080 tcaccaaccc ttcaggcgtt ccacggggca gcaacttagg accgttgctg tttacgattt 4140 tcatcaatga tgttggactg atcctcccac cagaatgccg ttcattttac gcagatgatg 4200 ttaaactgta tatcattgtt cgttgcttca gagactgctt acaactacag agcctaattc 4260 acagcttcga aacgtggtgt tcagacaatt gtctaactct gagtgttcac aaatgcaatg 4320 taatcacatt tcatcgcagt aagagcccta ttttgcacga ttataagatg aacggccaga 4380 gtctccaacg cgtcaacaat attcgtgatc ttggaatctc actggatgcc cgcttaacgt 4440 ttaaccaaca ctactcggat atgattgcta aagcaaacag acaacttggt tttatattca 4500 agattgccga cgaattccac gatcctctgt gcctcaaagc gttatactgt gcgcttgtac 4560 gttcgatttt agaattcgga tcagtcattt ggtgtccata tcacgcaacc tggatagcga 4620 ggattgaggc catccaaaga agatttgttc gctatgccct tcggtatcta ccatggaacg 4680 atccatcgaa ccttccgccc tacgaagaac gttgccaatt attgggtata gaaaccctcg 4740 aacataggcg tacgaccgca caggcggttt ttgtagcaaa actgtttacc ggagaaattg 4800 acagccccga aataataggc caaattggca tttatgttcc cgaacggaat cttcgctcaa 4860 gaaacttttt gcacctcggt agccgatcat caaactatgg catgcatgac ccgatacgtt 4920 tcatgtccac gcggttcaac gaattttact cgatcttcga ctttcacgac acctcaacaa 4980 cattttgtcg gcgaatacaa cgtgaattga tcgatcgaca gcgtaatatt ccgatataat 5040 catatgaaca tctactaatt tgaacaggtg agtctctgtc caagtcctgt actcagtagt 5100 tttgtgctat acccttgttt ttatttcatg gattccaacc gacccgctac aaacatcgag 5160 tatcaactgg atcaatgaac cccgacttcc cgaaaatgtt tttaagttca tctatttaag 5220 agataagttt taaattgttt tcgtttgttt tctgctgttt actatttata agcttaagtt 5280 ctctgttatt tttcgaaaag atatggggtt tttacgcttt cttgtttccg gcataatgtg 5340 gctgactcaa ggaagctttt ccccatccaa cctttttcat taagactaaa aaggtcagat 5400 gaaataaata aataaataca aatacaaata caaat 5435 // ID Polinton-7_NVi repbase; DNA; INV; 8494 BP. XX AC . XX DT 20-APR-2009 (Rel. 14.04, Created) DT 20-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE autonomous Polinton DNA transposon - consensus. XX KW Polinton; DNA transposon; Transposable Element; Polinton-7_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-8494 RA Bao W. and Jurka J.; RT "Polinton DNA transposons from Nasonia vitripennis."; RL Repbase Reports 9(4), 797-797 (2009). XX DR [1] (Consensus) XX CC The consensus is incomplete at both ends. XX FH Key Location/Qualifiers FT CDS 1135..2814 FT /product="Polinton-7_NVi_1p" FT /translation="MASIDFFERLVQXLRDTLQNVDQMSNNDIHYWLDMCN FT STIKELNEKIXKSNLTRGEKQKCQTCLTHLSCMQVEFEQHLHRGGSLQAEE FT TEARVEWRDVQSAFQNRIRTGVVVNIKHVDILSFLNDAQKLFKVKTKXCLX FT EYNAVKINSDFIAEFSMQKNGEETTDIKYFTSESATVYLSTDLEEWFKDNI FT KEPILSQLEEFQEKDSGWTLKSIISLCINMKKYSPIKGSSYIPLPKFIEKK FT KACVNVXNNDNQCFKYAILSALYPSKSNANRVNQYRLHENELNFGDIEFPV FT KLKNIPKFEKLNNISVNVYMLRKYGPKFEVSPCHITSEKKEKHVNLLIIQD FT FYIDEHEENNRADDGILPKYHYVWIKSMSRLLSSQLSKTHIKSYHCERCLQ FT IFYSEERLEIHENDCKNLNNCRINLPDSKNNILKFEDYSKSEKVPFVIYAD FT FECLLKPTENENAFQLHEAYSIGYYIKCSFDDSLSGYRSYRRKNEEEETAA FT AWFVQELKSIGEKIDVLYKNPKPMRLTDLEEISFRRSTTCHICRKPIXGEE FT LAVRDHCHLTGR*" FT CDS 3014..5347 FT /product="Polinton-7_NVi_2p" FT /translation="MKGRVTLIPHNKEKYISFTKYIDDCDISFRFIDSWRF FT LPSSLEKLASYLENVPIAVKEFRSDGFTDEKIDLLRRKGVFPYDFVNGLDK FT LTTTKLPEKNDFYNKLTDSHIAEEDYKHAVQVWDKFNISTLGEYSDLYLKT FT DVLLLADVFESFRETSLKAYDLCPAHFYTTPGLTFSAALKMTKVELELLTD FT IDMLMFIESGIRGGISQCCNRYAKANNPYMGSSYDKDQKTKTLLYFDINNL FT YGWAMVQPLPVGKFKWIEYETNPSFFNTPPDSDIGYFAEVDLEYPEEIHDD FT HRDLPFCAEHLAPPGSKQKKLLTTLNDKKRYVIHYRALKQVLDNGLRLKKV FT HRILSFEQRAWLKPYVEFNTEKRKQAKNEFEKLFYKLLINAVYGKCIERER FT KRVDVRLVNKFTGRYGAEARIALPNFHSCAIFDENLVAIQLKRTSITIKKP FT IYVGLSILDLSKTLVYDFHYSYMKKRVGEKCKLLYTDTDSLIYEVEDVDMY FT QVMKEDIHKFDTSDYDENNQFGIPRVNKKVPGLMKDECCGKIMTEFIGLRS FT KMYSILIDGAETVKKAKGIKSSVVKKSITFEDYRKCLQDLVIIKREQCNIR FT SKLHIVHTEKQEKIALSPHDDKRFLLPHSTDTLPWGHYRIMEEQMAQAAMQ FT VEGDAEEGIVVEERGNNDEGRKREGESERGHEATNENASRVGMTLYGDEGE FT QIRIEYDVLEEFMHEWDPSLDRIGSEQEERRDILLEAMEGAEIVVGNGLYE FT RTQHDEEHIREIEANNQPCAKRPRLT*" FT CDS join(8035..7235,7320..6799,6827..5901) FT /product="Polinton-7_NVi_3p" FT /translation="MKSESPLQQPSNHILDKNSSKKMKKRIRIDRSSLLND FT QNNNKTSLLDYTVYDYDDDDDDDAFDTITNSTFRSASGKNESAEDFLSLLD FT KSRRTIDNIYGVRKVDGVYMIGDSEIEFDDKYVKVRNESYPKTNGLMELLF FT KKYPDDLLLSSADRENYRKILEATNAHRKKFSKDESIRMSRSNKYKNILAP FT MFRSTPRKKNNSSGGGLIPKYKIAKKNSSIDLVYWDDPNELIERLRLLIAE FT RSAGNNNHTNEIQSIIEELREAGHIYLPNDLLEITITPMKSSQLLKNYVKL FT DIYIDTHNFLICISSTMSLDVFGRKLEGSQQVSRGPPGIGFNFTSDGDFDL FT EXKRLCNLGEPRXPNDAITLHSLKVILQTEXNYIASKLXGIGEVIEEYKQQ FT VEKHQLEVNAKLKYLYXTTQRNYSGVDYIIEELNNFIIKEDGTSKSSSKKM FT ERQRVAEELHKPARRRYKRRKYDIRGIDETWQADLVEMIPYSKENKGFHYL FT LTVIDIFSKYAWAVPVKSKSGNDVTAAMKSILKEGRVPKNLQTDQGKEFYN FT SIFQKLMKKHNIHLYSSHSNLHASICERFNRTLKNAMWLEFSKQGSYKWLD FT ILPNLLKAYNSRKHRTTGIEPENVTQANEREVKRRFPNEKSPVKKPKFKVG FT DKVRISRIKNVFEKGYTPNWSTEIFTITRVVKTDPVTYHLKDYYDKPVSGG FT FYETEIHKTMYPEIYLVEKVLKKSGKRVYVKWLGFSDEHNSWINKNEIV*" XX SQ Sequence 8494 BP; 2760 A; 1382 C; 1478 G; 2801 T; 73 other; atttccarta taarcatgta catttattat tttaaatatc aatttcatct agaccaaatg 60 takcwtctaa ataatttagc agccaagctc gtattgaaat aacattgtga acagcacaca 120 taggttttct tcttatgcag agatcatgat traagcagaa gtactggttw tcacttatta 180 tttcttcaag aggtggacat ccaaagtctt caacatttga aattgatctg aaaaaaataa 240 gaaaanaatt aaararaagt ttcaaatagt ttyaaaataa ttaacttaca trtttggcaa 300 wawatttttt atccattggg ctttctgctc tccttttayr aatatttttt ctactccttg 360 cgtrcagatt tgtaaaattt katgaagacg gtgatatgga atatctccag arttccaygc 420 tattccatga tgactctttt cwakccattg agtcatacat ttttcttcyt cagtyagttc 480 ttcccatgcc attggaggtt tgaaragata wgcrgtcggt aaagcagttt tcctcaaatt 540 gaggaaactr atttctttga aaataaaagt atcttctgcg attctttgaa atccttgaat 600 gtcgcaaata taaacatttt cttcttgaca ttgcaatata gtatttttaa acaacaaatc 660 gtagtcacgt gttctatcca tcttgcarct ttgtagacag ttcagtaaga agtgatctcg 720 cnagattcta rcatcctggt tttataatam gaaactccgc ctytcrctct tcyaggattg 780 gatgaaaatg caagccctca tttgttgggt gaaaagacga tttttggttg gttsaaaagg 840 aagaaaattc atcccttctc tagaaaattt tcctcctctt agtctcataa tcttctagga 900 ttggatgatg gaaaaaatgt gtagccctta ttggaagaaa attcatctct tcccgagaaa 960 atctttctcc tctctcctca gcagcgagat cttccacctc tccttatgat gattttagct 1020 ttaaaagaga gaaatttctc ttrgacgagc acagttcatc tgtcgcagtt ascaaaaaaa 1080 atttagtctt atttttgtgc aagattacta gtgataytct agtgaaaagt aaaaatggct 1140 tcgattgatt tttttgaacg tcttgttcaa ktacttcgtg acactttaca aaatgtagat 1200 caaatgagca ataatgatat tcattattgg ctagacatgt gtaattctac tataaaggaa 1260 ttaaacgaaa aaattgmtaa atccaatctt acgagaggag aaaaacagaa atgtcaaact 1320 tgtttgacac atctttcatg tatgcaagtt gaatttgaac aacatcttca tcgaggaggt 1380 agtctacaag ctgaagaaac wgaagctaga gttgagtggc gwgatgtgca rtcagcttty 1440 caaaatcgaa ttagractgg tgtggttgtk aatataaaac atgtggacat tttaagtttt 1500 ttraatgatg cacaaaagct cttcaaagtt aagacaaaar catgyttara agaatataat 1560 gctgtgaaaa taaattctga ctttattgca gaattttcta tgcagaaaaa tggagaagaa 1620 acgacagaya ttaaatattt tacaagtgaa agtgctacag tatacttgtc aacagattta 1680 gargaatggt ttaaagataa tatcaaagag cctattcttt cacagttaga agaatttcag 1740 gaaaaggatt ctggctggac tctgaagagt attataagtt tgtgtataaa tatgaaaaaa 1800 tattctccaa tcaagggaag ttcttacatt cctcttccaa agtttataga aaagaaaaaa 1860 gcatgtgtta aygttcrtaa taatgacaat cagtgtttta agtatgctat wctttcagca 1920 ttatatccta gtaaaagtaa tgccaataga gttaatcaat atcgattgca tgagaacgaa 1980 ttaaactttg gtgatataga atttccagta aaacttaaaa atattccgaa atttgaaaaa 2040 ctaaataata tttcagtaaa tgtatacatg cttagaaagt atggaccaaa gttcgaagta 2100 tcaccttgtc atattacgtc agaaaagaag gaaaaacatg taaatctact catcattcaa 2160 gatttttaca tagatgaaca tgaagaaaat aatcgtgctg atgatggaat cttgcctaaa 2220 tatcattacg tttggatcaa aagtatgtct cgacttttga gttctcaact gtctaaaact 2280 catataaaat cttaccattg tgaaaggtgt ttacaaatat tttatagtga agaaagattg 2340 gaaattcatg aaaatgattg taaaaattta aataattgtc gaattaattt gcctgactct 2400 aagaataata ttctcaaatt tgaagattat agtaagagtg aaaaagttcc ttttgttatt 2460 tatgcagatt ttgaatgttt attgaagcct actgagaatg agaatgcttt ccaactgcat 2520 gaagcataca gcataggata ctacataaaa tgcagctttg atgatagctt atctggctat 2580 agatcataca gaagaaaaaa tgaagaagaa gagacagctg cagcatggtt tgttcaagaa 2640 ttgaaatcaa ttggtgaaaa aattgatgta ctctacaaga atccaaaacc aatgagactc 2700 acagatttag aggaaatatc ttttcggaga tcaacaactt gtcatatatg tcgaaaacca 2760 atcmatggtg aagaacttgc tgtacgagac cattgtcatc tcacagggcg gtaagttgtc 2820 ttctcaatta tatttgaaat aaagtaaacg aaattttgta attctaagat agattttctt 2880 cttcagtttc agaggtgcag ctcayaattc atgtaacttg aattataagg attcaagatt 2940 tattcctgtg gtttttcata acctcaatta cgacactcac tttattttaa aagagattgc 3000 tacttctacg ttgatgaaag gacgagtcac tctaatacct cataataagg aaaaatatat 3060 ttcatttaca aartatatag atgactgyga catcagtttt agattcatag attcttggcg 3120 ttttttacct tcatcattgg aaaagcttgc atcatatctg gaaaatgtgc cgatagcagt 3180 aaaggaattt cgatcagatg gatttacaga tgaaaagata gatcttcttc gacgaaaagg 3240 tgtatttcca tatgattttg taaatggatt agataagttg acgactacaa aattgccaga 3300 aaagaatgat ttttataata agctaacaga ttctcatatt gctgaagaag actataagca 3360 tgctgtccaa gtttgggata agttcaatat tagtacgttg ggagaatatt cagatcttta 3420 tctgaaaacc gatgttttac tattggccga cgttttcgag agttttcgag aaacatctct 3480 taaagcctat gatttatgtc ctgctcactt ttatacgact cctggactaa cwttttcagc 3540 agctctaaaa atgacaaagg tagaattgga gcttcttaca gatattgaca tgctaatgtt 3600 tattgaatcg ggcattcgag gtggaataag tcagtgttgt aatcggtatg ccaaggcaaa 3660 caatccatat atgggttctt catacgataa ggatcagaaa acaaagactc tcttatattt 3720 tgatattaac aatctttatg gatgggcaat ggtacagcct ttaccagttg gaaaatttaa 3780 atggattgaa tatgagacaa atccaagttt tttcaataca ccaccagatt ctgatatagg 3840 ctattttgct gaagttgact tggagtatcc tgaagaaata catgatgatc atagggattt 3900 acctttttgc gcagaacatc ttgcaccacc tggctcaaag cagaagaaac ttctcacgac 3960 tttgaatgac aagaaaaggt atgtcattca ttatcgagca cttaagcagg tgttagataa 4020 tggacttcga ttaaaaaagg tgcataggat tttgagtttt gagcagaggg cgtggctcaa 4080 accatatgtt gaatttaaca cagaaaagag aaagcaggca aagaatgagt ttgaaaaact 4140 cttctacaaa ttrttaatta atgcagtcta tggaaagtgt attgaacgag agcggaaacg 4200 tgtagatgtg cgattagtaa ataaatttac aggcagatac ggagcagaag cacgcatagc 4260 tctaccaaat tttcacagtt gtgcaatctt tgatgaaaat ttggttgcca ttcagctgaa 4320 gaggacaagt attacaatca aaaaaccaat atacgttggt ctctctattc tagatttgtc 4380 aaaaacatta gtatatgatt ttcattattc atatatgaaa aagcgagttg gagaaaaatg 4440 caaactcctt tatacggaca cagatagttt aatatatgaa gttgaggatg ttgatatgta 4500 tcaagtaatg aaagaagata tacataaatt tgatacttct gattatgacg aaaataatca 4560 gtttggaatt ccacgtgtta ataaaaaagt tcctggattg atgaaagacg aatgctgtgg 4620 caagattatg acagaattca ttggtttaag aagcaaaatg tayagtattt tgattgatgg 4680 agctgagaca gtgaaaaaag ccaaaggtat aaaatccagt gttgtaaaga aatcaatcac 4740 gttcgaagat tatcgaaaat gtcttcagga tcttgttatt ataaaacgtg agcagtgcaa 4800 tattcgttca aaattacaca ttgtgcatac tgaaaaacag gaaaaaattg ctctaagtcc 4860 acacgatgat aaaagattct tattgcctca tagtacagac actttgcctt ggggacatta 4920 cagaattatg gaagaacaga tggcgcaagc tgcaatgcaa gtagaggggg atgcggagga 4980 ggggattgta gtggaggaga ggggcaacaa tgacgaagga cgaaaaagag agggagaaag 5040 tgagaggggg catgaagcaa ccaatgaaaa tgcttcacgg gtggggatga cactttatgg 5100 tgatgaaggt gaacaaatca ggattgaata tgatgtttta gaggagttta tgcacgagtg 5160 ggatccatca ttggatagga taggatctga acaagaagag agaagggata tattactaga 5220 ggctatggaa ggggcagaaa ttgtggtggg aaatgggcta tatgaacgga cgcagcacga 5280 cgaagagcac attcgagaaa tcgaagcgaa caaccaacct tgtgcaaaga gaccaagact 5340 cacctaactc taaaataagt aagttctatt tcttagttct ccctcactac ttgtctttct 5400 cctcaattct cacctatgag acatttttca ctaacataaa ttcttatttt tacttttcag 5460 aatggatcta tctgcactca acaagattgc tgagcgcgag ttcttgccta agaagaaagt 5520 cacggacttg gaaaaagatc atgagtacat ggtgactgca ctcaaggagg tsaagacgag 5580 attcggcacg aagatcgtgg cygaaatcga tgacagcttt caaatatttc tgccagagaa 5640 gatctcatca gcaatcttga aagatcaaga acttttcaat aatctctcta acacagcaaa 5700 taaacttagt ttatttataa catatcaggg tggaacttct tttaagttca gtacttgcta 5760 aaaataaata aaataaaata caaagattga ttatttttaa aaactattta tttgttcgaa 5820 acttgtacat aatatacatg aataaataca ttattaaaca taatttacaa ttgttttctt 5880 atttatatac cttagataat tcatacaatc tcatttttat ttatccaact attgtgctcg 5940 tcactaaatc ctaaccattt aacatatacg cgtttaccac ttttctttaa aactttttct 6000 actagataaa tttcaggata catagttttg tgaatttccg tttcataaaa gccgccagag 6060 acgggtttat cataataatc tttgaggtgg taagttactg gatctgtttt tacaactcgc 6120 gttattgtga aaatttccgt agaccagttt ggygtgtaac ctttttcaaa aacatttttt 6180 atcctactta ttcgtacttt atctcctact ttgaatttcg gttttttcac tggcgacttt 6240 tcatttggaa aacgtctctt gacttctcgc tcatttgcct gtgtaacatt ttcaggctcg 6300 attcctgttg tccgatgttt tctcgaattg tatgctttga gcaagttagg tagaatatcc 6360 aaccatttat agcttccttg tttgctgaat tctaaccaca ttgcattttt taacgttcta 6420 ttgaatcgtt cgcatatgct tgcatgaaga ttactatgtg aagagtataa gtgtatattg 6480 tgcttcttca taagtttttg aaaaattgaa ttgtaaaatt cttttccttg gtcagtttgt 6540 agattcttcg gtactcgtcc ttctttcagt attgacttca tagctgcagt aacatcatta 6600 ccactctttg acttgactgg cacggcccat gcatactttg aaaatatatc tatgacagtg 6660 agtaagtaat gaaaaccttt gttctcttta gaataaggta tcatttctac aagatcagct 6720 tgccaagtct catctattcc acgtatgtca tattttcttc tcttgtatcg ccgacgtgct 6780 ggcttgtgca gctcttcagc tactctttga cgttccatct tctttgatta taaaattatt 6840 caattcttca ataatataat caacaccaga gtaattacgt tgagttgtat yataaagata 6900 tttcaatttt gcattcactt ccaattgatg tttttcaact tgctgcttat attcttctat 6960 aacttcacca attccagyaa gtttagaagc tatataattt ayttcagttt gtaggataac 7020 tttcagtgaa tgcaaagtga ttgcatcgtt tggatkacgt ggctctccta gattgcaaag 7080 tcgcttmtct tctaaatcaa aatcaccgtc tgatgtaaaa ttaaaaccaa tgcctggagg 7140 accgcgactc acttgctgcg agccctcaag ttttcttccr aacacatcga gactcatggt 7200 actactgatg caaataagaa agttgtgcgt gtcaatatat atgtccagct tcacgtaatt 7260 cttcaataat tgactggatt tcattggtgt gattgttatt tccagcagat cgttcggcaa 7320 tcaataatcg aagtcgttcg ataagctcgt ttggatcatc ccagtagacg agatcaatgg 7380 aagaattttt cttagctatt ttatatttag gtataagtcc tcctccgctg ctgttgttct 7440 ttttacgtgg agtgctgcgg aacattggtg ctaagatatt tttatattta ttacttctcg 7500 acattcgtat agattcatct ttgctgaatt ttttccgatg agcatttgtt gcttccagaa 7560 tcttgcgata attttcacga tctgctgaac tcagaagaag atcatccgga tattttttaa 7620 atagcaattc catcagtcca ttcgttttag gataactttc atttctgact ttgacatatt 7680 tatcatcaaa ctcaatttct gaatcaccaa tcatatacac tccatcaacc tttcgcactc 7740 cgtatatatt atcaattgtt ctacggcttt tatctagcaa tgataaaaag tcttcagctg 7800 attcattttt tccactcgca gatctaaatg tagaattggt tattgtgtca aaagcatcat 7860 catcatcatc atcatcataa tcataaacag tataatccaa taaacttgtt ttattattat 7920 tttgatcgtt gagcaaactt gatcgatcga ttcttattcg ttttttcatt ttcttactgc 7980 tgtttttatc caaaatgtga ttggatggtt gctgcagtgg gctttcactt ttcatctcaa 8040 caagtttttc aagtggtgtg atgactggtt ttagagtatc accgatgact ttttcaaaat 8100 catctttctc ctgttttaat aaagtgtatt ttttttaata gcctcacgag ctcgcaagag 8160 ctcactaaga acattcttct ctttcttgaa gtctgacatg ctgacgtggc gatgtctaca 8220 actgaactgt aagtttgaga ctcataattc cttttttatt gcaaaacagt caaatccttt 8280 tctgtatcta ccctcattca tcgrtctatc ttttcaataa cratgaatcc atatctatca 8340 ttcttccaac attcagaaca aagattccga aattcattga aagttaaatc ggtattcaca 8400 tgatcttcat agatatgttt aaatttacat catcttgtcg ataaattact aacaaattga 8460 cattatccct aatgagatgt tttcctatat gagc 8494 // ID hAT-2-h_SM repbase; DNA; INV; 2141 BP. XX AC . XX DT 21-APR-2009 (Rel. 15.05, Created) DT 21-APR-2009 (Rel. 15.05, Last updated, Version 1) XX DE Horizontally transmitted DNA transposon: consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-2_SM*; hAT-2-h_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-2141 RA Novick P.A., Smith J., Ray D. and Boissinot S.; RT "Independent and parallel lateral transfer of DNA transposons in RT tetrapod genomes."; RL Gene 449(1-2), 85-94 (2010). XX DR [1] (Consensus) XX CC hAT-2_SM is a family of autonomous DNA elements in the genome of CC Schmidtea mediterranea found also in Tarsius syrichta, Microcebus CC murinus, Myotis lucifugus, Monodelphis domestica, Otolemur CC garnetii, Anolis carolinensis, Xenopus tropicalis and Echinops CC telfairi. The consensus sequences is based upon fragments CC detected on NCBI. XX SQ Sequence 2141 BP; 692 A; 331 C; 337 G; 609 T; 172 other; aatwttcgca attgctttat gttatgratc accttgtatt ttagtaatad caatttggaa 60 tycctaggaa acaatgcaat caagaaagar vaaaattgac tcgbartbwa ssatmttyta 120 agaacaatgg acttatkatt actttttcat gcadtacaag gaaagaghtd tgtgtttgat 180 atdccwgaat atagtgtctg tgttcaaaka atahaatctg cgtygtcact atcaawgtca 240 acataawgat aaatacgatt gtttggtcgg agaagdgara aaaaatmaaa tattaaagct 300 aaaaaataat ttgamaactc arcaaaatac ttttgtgaac yagaatcagt taaatatttc 360 atcactgcga gcawgttttt aagttgctaa gctaatakbg tgyacctaca aaccattcgt 420 ggagrgtraa tttgttaaag aatggccttc bttccgttgc caaagagatg ttwccagaga 480 acsycrattt atttagtaaa gtgagtcttt caggacctac agttacaaga aggattgaag 540 aaatgggara caatttgcat cagcatttgc aaaactytgg agaaaaattt tcctrbtttt 600 ccttggcact cgacgaaagt aatgatgttc gtgattctgc ayaattycta atttttattc 660 dtgggacgaa tgactatttc gaaatcacag aagagcttgc tgcactgcaa agcatcaaar 720 gaacaactac argagakgat atctatgaaa tggtttgcva aactgtgart rvtttggary 780 tgaactrrac taaactagcc agtrtgacaa ctgatggttc tcctagcatg gtagggtcta 840 ataaaggagt aattgckcac attaaccaag agatggghaa ahataaycat actcatcyaa 900 tadccayaca ctgcctcaty cacyavcaag cgctgtgtak taaatcactg aagtggaact 960 ctgttatgaa aactgtggta tcttgtgtta atttyatcag aactaatgca ttaaaccaca 1020 gacagtttca ggaatttttg tctgagctaa atgttgccaa taaagatgtt ctgtacchca 1080 cagaagtcvg ttggytragt caagggagag ttttbaaacg yttctatgag ttacttccac 1140 agattaaaga ttttctgcyt tctaaaaaca aagaadtacc agagctcaat gatgcagaat 1200 ggaaatggca ccttgccttt ctgacrgatg taacagagct actcaacagt ttcaatgtgc 1260 aacttcaars aaagggmaag ctcatctgtg acatdcaatc atatdtgaaa gcatttgcag 1320 ttaaattang catcctcatc aaabacgtga aggaggaara tttcwgccat aactcaaatc 1380 ttcccataac tcaaaacctg ttagcggaaa aaccattrgt ttcatttcca aacaaaahat 1440 gtgtggattc actkgaaaag ttgcaaaaag agwtycaact tagatttaaa gagcttmgtc 1500 tccatgaaca kgacatacag cttttccgta acccattttc tattgacatt raaaatgtak 1560 ayacaattta ccaaatggaa ctggctgaac tgcdgaattk tgactctctg waagacgcgt 1620 ycaktttaag cagccttcct aatttctatg catctctccc tccgagahat atcccaatct 1680 caggaaccat gsactcaaaa tggtwacaat ctttagcagc actwawwwst gcgaacagac 1740 ttttthcagg atgaaacatt tgaaatctct aacbagatht atayttatkg mtwsryahtt 1800 gcattacttt dtahgactag cagtgacaaa tatgdaaccg gatagtaacc aathttatta 1860 gccaaaaata ggggttcaat aaaacagaat tccyattgaa attgwttkkw awgttkatar 1920 awkyratttt agccatytyg ttatttyywg yrwwytcaaa tttcattatt gtatcaaatt 1980 gtttcaaatt gttttcwttc gtatgtatta aagtaaatct aaaaawcagt agattttttc 2040 aaaayaaywt ttttttaaht wgctwtwatg tagcttgtta rywwataatg ctsttcaatg 2100 gtttgaawah ahgttdgacv ahtbggaaat aaaattcagt t 2141 // ID hAT-N13_AP repbase; DNA; INV; 626 BP. XX AC . XX DT 26-AUG-2009 (Rel. 14.09, Created) DT 26-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW hAT-N13_AP. XX NM hAT-N13_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-626 RA Jurka J.; RT "hAT-type DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2113-2113 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 626 BP; 177 A; 70 C; 96 G; 283 T; 0 other; cagggttggg attttatagc atttgcatat ttcttatgag atgcttaaaa aatagcagag 60 taagctaaaa atgtgtttcg tgatacagag aactcaaatt aaaaatttta ttttgctttt 120 ttttgcatat tttgcgattt tttagttttt agtgcttttt ttggactttt tgagattttt 180 ttgcattttt acctgttttt gaatcaaaat cggtcatatt ttatgaaatt atattttagt 240 aaacgtgttt ttagattttt gtcaatcgaa ttattacgat aatttcgtat taggtatttg 300 gttctaactt ctaatttttg ttatctgctg attttgttgg gttttttttg tcgatttcta 360 ctgcgattat taatcgccga ttatcaacga acgagcgttg aatgcgttgt atagtcaatt 420 catttggatt ctgatcagag tgactaaatt atttcaaatc ttaagtaact acctataata 480 atttgtaagt gattaaaaaa ttattaatta ataaatttcc tttttttttg catattttgt 540 tgtttttatc gcatatttag cgatatttaa ggtaatatta tagcgcatat ttatgcaatt 600 ttaagtgcta taaaatccca accctg 626 // ID CR1-45_BF repbase; DNA; INV; 3363 BP. XX AC . XX DT 30-JUL-2009 (Rel. 14.07, Created) DT 30-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE Amphioxus CR1-45_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-45_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-3363 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-3363 RA Kapitonov V. and Jurka J.; RT "Young families of CR1 non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(7), 1616-1616 (2009). XX DR [2] (Consensus) XX SQ Sequence 3363 BP; 953 A; 697 C; 577 G; 1136 T; 0 other; cagaatcctg gcttggagat tccattgagt cagatgatat ctcgttagat ggcttccaac 60 ctcccatccg tcgtgacaga aatcgtcacg gagggggagt tctcatttat atttcaaata 120 ctattccttg taaaagacga gatgatcttg aagttatctc gtccgaatgt gtttggttgg 180 aactccagat tggcatcttt aaaattttgt ttgctactta ctatcgtcca cctggtcaga 240 atatttccat cactaatgac ttcatggatc aattcattga cgctgtcagt acggcgcatt 300 ctgagcgccc agatgccatt gttgtgactg gtgattttaa ttccaaacac aatgagtggt 360 ggcacctgga cccagttaat tatgccggtg ctaaattacg ccaagctact cagattgtaa 420 atcttcaaca gatcatcacc gaaccgacct gtgacctctc acagtctccc tcgttgattg 480 acttaatttt taccgattca cctgagttct ttacaaaaac gtctgtttta tcccccctgt 540 ccaattgtca ccactctcct attattggtt acatgagttt tcttgtctct gttccaaaac 600 cgtattctcg aactttttgg gactttagta atatcgataa tgattctctt actttcttcc 660 tgtcagatcc caagtggttt gatatatacg gatgcggaac agttgatgaa gcaactagta 720 agcttgtaca acttattcta gaggcaaaat atagctgcat tcctcataag actataacta 780 tcaaacctaa tgacaaaccc tggatgtcat ccaacatcag gaaattaatg cacttgcgag 840 ataagtttca taaaaaagct aaaaaatcta atttttctgc tgattggatt gcatatcgaa 900 acattcgtaa caaattgacc aacaaaatct ccaaggccaa gtttgactat gaatctcgtt 960 tggctcaatc tctttcaaac ccatcaacag cccccaagaa atggtggcat attgttaaaa 1020 agttttataa acgtaattct ttcagcacca tacctccgct attgaatggc ttagtatcca 1080 ttgttgactc gtatgaaaaa gcttgcattt tcaatgactt ctttgcttct cagtcaacaa 1140 tagacacatc caatgccgcc cttcctgatc tgacctttct caccagtgag aaatcaacat 1200 acatttcaac tacgtctgat gaggttgaaa tgcttatcgg ttgcttagag tcctcaaagg 1260 ctcatggcta tgatgacatc gataatcgtt ttcttaagtt gatagcaccc tttatttctg 1320 ataaattagc gtatgttttt aatttgtctt tgtcccatgg tatcttccct aatgtatgga 1380 aaaaagcaaa tgttattcca atctttaaaa aaggtgactc tcgtgacaaa acaaattacc 1440 gtcctgtttc gctgctgcca tctttgtcaa aaatcctgga gaaaattgta tataaacatg 1500 tttataatca cctaatttcc aatgacttat tgtacacttt ccaatctgga tttatgaaaa 1560 atgactctac aacttgtcag ctcgtttaca tatgtaactt aattctagaa gcacttgaca 1620 aaggaaaaga agtacgtgca gtttttttag atttctctag agctttcgac aaggtatggc 1680 acaccggtct tatttttaaa ctccaacaat acggtattga aggtccttta ttgaactggt 1740 tttctagtta tttatcggaa agactgcaaa gagtagttct tgatggtcaa gcttctcctt 1800 ggagagagat cggtgccggt gtcccacaag gttccgtgtt aggcccactt cttttcatca 1860 tatatattaa tgacattgta aacgaattag attcccttcc ttttttattc gcagatgaca 1920 gttctttgtt ggagattgtt gaaaatccat acacgtctgc acacaggctt aattccgatt 1980 tatctaaaat tttggcatgg tcaaacacat ggttgatgga tttaaatcct tccaagacag 2040 aagaaatgtg ttttaccact aaaaagagtc ctcaatacca ccctccactt tttcttggaa 2100 ctaccgaaat tgttagtgtc tcatctcata aacacattgg tactatctta acttctgata 2160 tgtcttggaa caaccacttg aacaatatat gtagtaatgt ttcgaagagg gttaatgttt 2220 ttaaatgtct taagttcaaa cttccccgcc atgttttaca aactatatac aagtctttta 2280 tccgtcctag tctggaatat gcagacgtag tttggcacgg ctgttcaggg gaaaaatcct 2340 cccttttaga gagaattcaa taccaatgtt ccatagtagt gtcaggagcc attaaagggt 2400 cttcttattc caatgtttgt gaggaattgg ggtgggagtc gctctccagc agacgacata 2460 accaccgttt attgttgttc tataagattg tcaacggcct aacacggaac tatctccttg 2520 agttgcttcc ccgcgaaata actcaaacaa cttcttataa cttaagaaac aaatttaatt 2580 ttagactttc aaggtattca actaatagac ttattaaatc ctttgtcccg tattgcgtaa 2640 gtcactggaa cgaactagac tccagcgttc gctgtctgag atatccaatg tttcgtagtt 2700 accttaccaa actagttcgg cccaatagtc ctttacactt ctcgtccggg ccccgataca 2760 cctgcgccct tctcgctcga tttcgcattg gtacctacgg acttaatcat tctcttttca 2820 aacgaggttt ggttccaagc ccatcatgtg tctgtgggtg ccaatccgaa actatttatc 2880 actatttatt tgattgccct ttgtttgata aacatagaac ggttctttta aacaatatat 2940 ctaatttaac tgaccacttg tttgattttg acaccatttc cgacaatctg aaactttcta 3000 tagttcttag aggttcgcct acattcaatg ctgaaatcaa ttccaagctt atgttattca 3060 ctcaaaagtt cattgatcta actcagcgtt tttagtccta ttgtcatatt tcatgctatg 3120 tagaatttac tacatttata catcattgaa ttgtgtactt tgtacatggg gtgggtaggg 3180 gtgttttgtg cacacacgat gttagtcgat tagtttgttc aattattgtt tctaatgttg 3240 ttatatctgt attcatgtat tcaaatgtat gttaagtggc actgttaaaa taagttttta 3300 acttgagtgc agtgtcactt tgtcctgtct gcttttatgt tgaataaaaa aaaaaaaaaa 3360 aaa 3363 // ID Gypsy-5_RP-LTR repbase; DNA; INV; 481 BP. XX AC ACPB02004727; XX DT 28-JAN-2011 (Rel. 16.02, Created) DT 28-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Rhodnius prolixus genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-5_RP_; KW Gypsy-5_RP-I; Gypsy-5_RP-LTR. XX OS Rhodnius prolixus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Euhemiptera; Heteroptera; OC Panheteroptera; Cimicomorpha; Reduviidae; Triatominae; Rhodnius. XX RN [1] RP 1-481 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Rhodnius prolixus genome."; RL Direct Submission to RU (28-JAN-2011). XX DR Genome; ACPB02004727; Positions 1371 891. XX SQ Sequence 481 BP; 131 A; 87 C; 80 G; 183 T; 0 other; tgtaaccgcg taggttggca gatctgtcag ggacagctag cgccagtagc ggaacccaac 60 gaaataataa taaccttctc tccagtctct agagcatgtg cacaaaaagc ttattttctt 120 agtatttctg tttttttctt gttaatcagc tttagttttg ttattttatt attagtttgc 180 cttagtttat cttttgttat gcaatagtgc atggtttctc ttttagtttt tcaagcatgt 240 ctatctagca ttactcttct gcttgtatat atcaattgct ttactattat ctatgcgttt 300 gtaagtattt ctatgtatat atatattaat gtaaatatcc ctgtacacta attgtaaata 360 gactctttat ttaacgtgag gtaggtaact gcttcgacct cccctactca cctgagctcc 420 ccaaggagaa gtattacaga tcaaaacata atttcgaagc ggaaaaaagt aagtttggtc 480 a 481 // ID Copia-2_ACA-LTR repbase; DNA; INV; 158 BP. XX AC AEYA01000638; XX DT 23-MAR-2011 (Rel. 16.03, Created) DT 23-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the Acanthamoeba castellanii genome: DE long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-2_ACA_; KW Copia-2_ACA-I; Copia-2_ACA-LTR. XX OS Acanthamoeba castellanii OC Eukaryota; Amoebozoa; Centramoebida; Acanthamoebidae; OC Acanthamoeba. XX RN [1] RP 1-158 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Acanthamoeba castellanii genome."; RL Direct Submission to RU (23-MAR-2011). XX DR Genome; AEYA01000638; Positions 123284 123441. XX SQ Sequence 158 BP; 45 A; 33 C; 24 G; 56 T; 0 other; tgttgtgtag tgtcaactac acttgcacac aactctggta ttccgttata tctttattag 60 gtagttacca tatatgtgac atgtacacat tagataataa atcaaagagg ctcttctgag 120 ctctcttatc tttccatcaa ctgtgtgata cctcaaca 158 // ID hAT-N7_AP repbase; DNA; INV; 647 BP. XX AC . XX DT 26-AUG-2009 (Rel. 14.09, Created) DT 26-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW hAT-N7_AP. XX NM hAT-N7_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-647 RA Jurka J.; RT "hAT-type DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2107-2107 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX SQ Sequence 647 BP; 219 A; 88 C; 97 G; 243 T; 0 other; gggctcggaa gttgtaggaa ttgcatattt ttctgtatga tcacaattta tagcagacct 60 agctagaaat ttgaagtata caatcgcgag ttgaaaaatt gaaaattgaa aatgcatatt 120 ttgcatattt taccaatttt agctaaaaaa gcgcatattt cattgatttt atacgcatat 180 tatgcatatt ttatattttt ctaaaaaaat tgattttttt ctttcttaac ctgtatattt 240 caactaatat tgaatataaa actcaaaatt atcaacgaag gtatttattg tcatcgccat 300 agtcaattac ggagttaata acagtgatat tgtgacacgc atttagtcta tcatgctagt 360 atgccgaaat acttatttgt agggctcgga agttgtagga attgcatatt gttctgtatg 420 atctcaattt atagcagacc tagctaatta gatagaaata gaaatttgaa gtaggtatac 480 aattgcgagt taaaaaaatt gaaaattgaa aatgcatatt tcgccaattt tagctaaaaa 540 agtgcatatt tcattgattt tatacgcata tttaacattt ttaatgcata ttttgcaatt 600 ttttcgtgca tattttacat tttttagctc ctacaacttc cgagccc 647 // ID Gypsy-1_BT-I repbase; DNA; INV; 4520 BP. XX AC AELG01002157; XX DT 15-JAN-2011 (Rel. 16.02, Created) DT 15-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the buff-tailed bumblebee: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-1_BT_; KW Gypsy-1_BT-LTR; Gypsy-1_BT-I. XX OS Bombus terrestris OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; Apoidea; OC Apidae; Bombus; Bombus. XX RN [1] RP 1-4520 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the buff-tailed bumblebee."; RL Direct Submission to RU (15-JAN-2011). XX DR Genome; AELG01002157; Positions 1050 5569. XX CC Positions [3141-3416] - Integrase core CC 'TATATA' target site duplication CC LTRs are 93% similar to each other. XX FH Key Location/Qualifiers FT CDS 34..2754 FT /product="Gypsy-1_BT-I_1p" FT /translation="MQQHMTITHDVVQTLVRKPSVESENLSLPLFNPEIAG FT ADPFAWCTTASVLMDNEPQSRELYFAISRALKGTAAQWLTRVPVRGLTWEK FT FTEDFLSHYGGKETATSALMRMINEPPQKDEPTATFGNRLRSFLSARWEGL FT SIAEAINAAVLLRQTSYDQRVERIALAHDIRARDQFHQEMRALPHARKRSV FT SLSNDPSAKPEAELNKTSNSRTRCYHCGAVGHKATECRKRMRIEKQKKTQR FT PEESRPATSSKVFCFKCRAEGHIVPDCPLRQERKGGHIKERRVDCRVVEAP FT TGKLSHQGESFPFCFDSGTECSLIKESAASKFSGKRTTDIVVMRGIGNTCV FT KSTSQILSTVCINGFTLEIAFHVLPDSYLKYDIMIGREILSQVFDVHITRI FT GLDICKRKIVNACNKAAENEIDLNEVDTEVLGDDKGRLISILEKFKNSFIT FT GFPRTRVNTGQLEIRLIDPNVTVQRSPYRLSEEERRTVRDRISELIRAKII FT RPSSSPFASPMLLVKKKDGSDRLCVDFRALNKNTVADRYPLPLITDEIARL FT QKARYFISLDMASRFHQIPIHPNSTEYTAFVTPDGQYEYVTMPFGLKNALS FT VFQRAIFNALSDLEYSYVVVYLDEVLIIADSIDQALERLNTVLDALVKAGF FT SFNFAKCSFLKTSVLYLGYVIHNGEVRPNPGKIHALSSLPAPTTVTQLRQF FT IGLASYFRKFVPKYSQMMKPLYALTSGNRNMTWTDRHEKIRQQVVSILTDA FT PVLMIFDPNYPIELHTDASSEGYGAILMHKVEDKNRVIEYYSKRTTPAESR FT YHSYELETLAVVNAVKHFRHYLHGREFLVVTDCNSLKALSNKLHLNDRAQR FT WWVYLQTFTFDIMYREGKRMAHVDFFSRNFVDLDHRKIDKIAEKEINLAEI FT SED" XX SQ Sequence 4520 BP; 1368 A; 1002 C; 1061 G; 1089 T; 0 other; tcagaagtgg gattcgagtc tttgttaaca acgatgcaac agcatatgac catcacgcat 60 gacgtagttc agacgctagt gcgaaaaccg agtgtggaat ccgaaaattt gtcgctaccg 120 ctttttaacc ccgaaatagc gggcgccgac ccatttgcgt ggtgcacgac agctagtgtg 180 cttatggaca acgagcccca aagccgcgag ttatatttcg ccataagtcg tgctctgaag 240 ggtaccgcag cacagtggct tacacgagtc ccggttcgtg gccttacttg ggaaaaattt 300 acggaagatt ttctctcgca ttatggtggc aaggagacgg caacctcagc gctgatgagg 360 atgatcaatg aaccacccca gaaggacgaa cccacggcaa ctttcggcaa tcgtcttcgc 420 tccttcctgt cggcgagatg ggaaggtcta tccatcgccg aagcaattaa cgctgccgtc 480 cttcttcgac agacttcata tgaccagcgt gttgaacgga tagcactcgc gcatgacatc 540 cgggcacggg atcaatttca tcaggaaatg agagctcttc ctcacgcgag gaagaggtcg 600 gtatccttgt caaatgaccc atcggcgaaa cctgaagctg aactgaacaa gacatcaaac 660 tcccggacaa ggtgttacca ctgtggagcc gtcgggcata aggcgacgga gtgccgcaag 720 aggatgcgga tcgaaaagca gaagaagaca caacgcccgg aagagagccg accagccaca 780 tcgtccaagg tgttttgctt caagtgccgt gcggagggcc atatcgtacc tgattgtccg 840 ttacggcagg agagaaaagg cggtcacatc aaggaacgtc gagtcgactg ccgcgtggtg 900 gaggccccaa ccggtaaatt aagtcaccag ggtgagtcgt tcccattttg cttcgattct 960 ggaaccgaat gctcattaat taaggaatcc gcagcctcga aattttccgg caaaagaacg 1020 accgatatag tagtaatgcg agggatagga aatacatgtg ttaagagtac gtctcagatt 1080 ttgtccactg tatgtatcaa cggttttacg ttggagatag cttttcacgt cctccctgat 1140 agttatttaa aatacgatat catgattggt cgcgaaattc taagccaagt cttcgacgta 1200 catatcacgc gcattggcct cgatatttgc aaacggaaga tagtcaatgc ctgtaataag 1260 gccgccgaaa atgagatcga tcttaatgaa gtagacaccg aggtacttgg cgacgataaa 1320 ggtcgattga tttctattct cgaaaagttc aaaaattcat ttattacggg cttcccacgt 1380 acccgcgtaa acacaggcca gttagaaata cggctaatcg accctaacgt caccgttcaa 1440 agaagccctt atagacttag cgaggaagag cggagaaccg tgcgagatag aatcagcgaa 1500 ctaattagag caaaaatcat aaggcctagt agttcaccat tcgcgagccc tatgttgctc 1560 gtgaagaaaa aggacggctc cgatagactg tgcgtagatt ttcgagcgct aaataaaaat 1620 acggtcgcgg atcggtatcc tctacccctt atcacggatg aaatcgcgag attgcagaag 1680 gcgagatact ttattagcct ggatatggcc agcaggttcc atcaaattcc catacatccc 1740 aattcgacag agtatacagc gttcgttaca cccgacggac aatatgagta tgtaacgatg 1800 ccgttcggat tgaaaaacgc actgtccgtt ttccagaggg ccatttttaa cgccttaagc 1860 gacctcgagt attcatacgt cgttgtttat ttagatgaag tcctaattat tgccgattcg 1920 atagatcaag ctttagaaag attgaacacc gtgttagatg cccttgtgaa agccggattt 1980 tccttcaatt tcgcgaaatg ttcttttctg aagacatcgg tactctattt ggggtatgta 2040 atccataacg gagaagttcg tcccaacccg ggtaaaatac acgccttaag ttccttacct 2100 gcgccaacaa ccgtcacaca gctcaggcag ttcatcgggt tagcttcgta cttccgtaag 2160 ttcgtcccta aatactcaca gatgatgaaa cccctgtatg cgctcacctc aggtaacaga 2220 aatatgacgt ggacagacag gcacgagaaa ataagacaac aggtagtttc tatcctgacc 2280 gacgcgccgg tgctaatgat attcgatccc aattacccga tagaactaca cactgacgct 2340 agctcggagg gttacggggc gattttgatg cataaagtcg aagataaaaa tagagtaata 2400 gagtactaca gtaaaagaac tacccccgcg gaatctagat atcactccta cgaactagag 2460 acgttggcag tcgtaaacgc cgtcaagcat ttccgtcatt acttacatgg acgggaattt 2520 cttgtcgtta cggattgcaa ctcgttgaaa gcgttgagta ataaattaca tttaaatgac 2580 agggcccaaa ggtggtgggt ttacttacag actttcacct ttgacattat gtatcgagaa 2640 ggtaagagga tggcccacgt agatttcttc tcgcgaaact tcgtagactt agatcaccgt 2700 aaaattgata aaatcgcgga gaaagaaatc aatctggccg aaatatcgga agactaacta 2760 ctagccgaac aacgtcgcga cccacaaatt atagaaacca ccaaacgaca acgaaccgca 2820 aactacgaaa cgacgagttc gcggaagata tcgcgaatac ctatgagttg cgttcaggta 2880 ccctttaccg caaagtgcag cggaggggca gaaccctctg cttgcccgtt gtcccgagag 2940 gctttagatg gtccgttatt aaccatgtgc atcagtcaat tatgcacttg ggttgggata 3000 agacgctcgg gaaactgtac gagtattact ggttcgaaga aatggcgaag tacgttcgca 3060 aattcgtaga gaactgtcat gcttgtcgag tttcgaaggc aagttcggat aagatacaag 3120 ccgaactaca tcctatatcc aagaccagca taccgtggca tacagtgcac gtggacataa 3180 cgggcaagct aagcgacaaa aatgactcca aagaatacgt cattgttttg gtagatgcct 3240 ttacgaaatt cgtatacctg cgtcataccc gtaagataga ttcccttaac accatcgaag 3300 cacttaagcc cgctatattg ttaatcggca gtccctgccg gataatagcg gatcaaggaa 3360 gatgttttgc aggtaaagaa ttccaagagt tctgtgaaag caaacagatt gttcactgaa 3420 agcaaaaaaa agttcacttg atagcgactc ggtgctagta gagccaatgg acaggtagag 3480 cgtgttatgg acacattaaa aaatatgttc acaatagtag agacgaccgg gcggccatgg 3540 caagacgcga tcggggaaat acagttggct ttgaattgca ctaccaaccg cgtgacaaat 3600 tcgagctcat tagaactact aataggtaga acagcaagac cttatgacct gttgctaccc 3660 agtaacatcg aagaaaaaga aatcgatatc tccaatgtaa gacgacaggc gataaaagaa 3720 atagaaacga atgcggtaca cgataaaaat agattcgaca aagctaaagc taaagtgatt 3780 aggtttaatc tcggcgattt tgtattacgc aaaaatgagg aaagaaacca aaccaaatta 3840 gatccaaaat ttagaggtcc atttgtaata gcagaagttt tggaaggaga caaggtatat 3900 cttaaagaca ttagacggta agcgatcgta caagtacagt caggacagat cgaggaaaat 3960 gccagatggt cgcattcctg ctgagttgga tgtctgtagt gatgacaata atagtgaccg 4020 cgacgatatg agtactccga tctcggaaga ttagcagcac tatgccaata acacccggca 4080 gagtgagttc gcttgtgtct cggaacggta atgtgatata gctcagtgag actctgtgag 4140 gcatttcact tgcgtctcag agcggtaatg tgatttagct cagtgagacg gtggaacgaa 4200 gctctatgtg ctcatacgat ctagaaaacc gagattcatt tgggtacgaa atcgagaatg 4260 gtccatcatg ccgttacgat ccgactgata acccacagga tgcaattatc accttgaata 4320 ctaatcataa cgctaccaat tttcattccc ttttatttct ttgctgtcaa acctccgaac 4380 agaattttgt tgttacactg gacccttagc ttattcgaac atttagtcaa attgtaaata 4440 atgttaatcg tatcgagcat attgttagaa aactcaatat tttagtcaca cccgaggacg 4500 tgtgatagtc agaatggccg 4520 // ID BEL-91_AA-LTR repbase; DNA; INV; 425 BP. XX AC supercont1.289; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-91_AA_; KW BEL-91_AA-I; BEL-91_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-425 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.289; Positions 1201941 1201517. XX SQ Sequence 425 BP; 131 A; 104 C; 62 G; 128 T; 0 other; tgttggatat gatttgaacc ttaacattta atactacaat ttaagaacac aatttataaa 60 ccactacaca tccttaccta gtctatagta ccctcattac tattgtttaa tacatacaca 120 atacacgaag aacaaccatt taaattgcac acacatgcat cgacactaag gagacaatta 180 acgatgagca ataaaatcag tttgaattcg actattgcac agtacacacg ctgtattaca 240 tctctcgcgt cggtgaaaga agattcccca cgattcgctt ctcgtcagtg tttttttctc 300 tccagttcag tgcgctccag tcatctatcg tcccttccac tcccattcga acattttaac 360 gctgcctaat ttttgaagat cgtcaaacgt gacaatttag tgcgcggtga atagtcgcgc 420 catca 425 // ID Copia-17_DPu-LTR repbase; DNA; INV; 249 BP. XX AC scaffold_44; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-17_DPu_; KW Copia-17_DPu-LTR; Copia-17_DPu-I. XX NM Copia-17_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-249 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 698-698 (2010). XX DR Genome; scaffold_44; Positions 991857 991609. XX CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX SQ Sequence 249 BP; 66 A; 58 C; 44 G; 81 T; 0 other; tgttgagcgg caaaacccac acccgtcagt tgagatgctg tttagcatcc accccaatcc 60 cctcctcgtt tctgtcaagc tcgccactag tttcttctct ctctgtttct cctgtcacta 120 aggtactgtg tataatcgtg tgtaataatg gtaaattgta ttgtcttacg tgtgagtata 180 agacagaacc tggtgagtgg aataaagttc aatcaacaat atgtaaactc tttttattca 240 aactcaaca 249 // ID L1_Ele3C_AAe repbase; DNA; INV; 4450 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE An L1 non-LTR retrotransposon from Aedes aegypti. XX KW L1; Non-LTR Retrotransposon; Transposable Element; L1_Ele3C_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4450 RA Kojima K.K. and Jurka J.; RT "L1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1401-1401 (2011). XX DR [2] (Consensus) XX CC >94% identical to consensus. The consensus is ~78% identical to CC L1_Ele3 and ~84% identical to L1_Ele3B_AAe. XX FH Key Location/Qualifiers FT CDS 119..1198 FT /product="L1_Ele3C_AAe_1p" FT /translation="MVAMRRENTFRIDYSCFPVKPSFEKVHGFCRSVLGLK FT KEDVERLQCHKGEQCAFVKVSDLTLAQKIVDEHDARHEVELNGKKHKLRIT FT MEDGSVEVKIHDLPENVPEEKIVEFLCAFGEVIXIRELTWGEGYEFAGIPL FT GIWSARMLVQKNIDSWVTIDGQQAXIVYKGQLQSCKYCKEQAHTGISCVQN FT KKLLVQKSYAXVTKQTGSTRPPPKKTAGTKPXSAKXIGPKPPVLPTATSDA FT FPELPKSSSQPEQPASTSKSDARFDLTTSPRPQTQRXHGSSSSLRAPHVEE FT SPKANIVLVDCFKKPTNAMRSQSKSGNGNETDDSSTSTNSRRSARGRPPGK FT KPRREDGDDEQDEDYHP" FT CDS 1202..4378 FT /product="L1_Ele3C_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MALSSYNISSINIGTITNPTKLNALRNFISSQSLDIV FT CLQEVENDQLSLPGFVVYTNVDHARRGTAVAVKEHIQVSHVEKSLDSRLLA FT LRVQDTTICNIYAPSGSAQRAAREEFFNGTLAYYLRHRTAHVILAGDFNCV FT LRACDSTSPNASPALKTAVQQLQLHDVWEKLRPRDXGFTYVCRNAQSRLDR FT VYVSNGLRENLRTAHTHVCSFTDHKALTVRXCLPQLGHEPGRGFWSLRPHL FT LTDENVAEFQYKWQYWTRQRRNYASWIEWWXTYAKPKIKSFFRWKSREFFD FT EFHNQHQRLYAELQLAYDGYYQHPEMLPTINRLKGEMLALQRNFSHMFMRI FT NETYVAGEPLSIFQLGERRRKKTIITELXRGENXIINEPQAIXANLXDFFS FT QLYSEEAPNSNXNGFAXERAIPPNDPVNEACMEEITTAEILSAIKTSAPRR FT SPGXDGIPREFYLRTFDVIHRELNLLLNEALTGNIPPAFVEGIIVLVRKKG FT GDNTARSYRPISLLNSDYKLLSRILKNRLEGVMRAHGVLSDGQKCSNSERN FT IFQATLALKDRIASLRHHRRAGKLISFDLENAFDRVRHSFLFETMRSLGFN FT QELIALLSRIASRSASRLLVNGHLSRPLEIQRSVRQGDPLSMHLFVLYLHP FT LVRKLEEACGNDLLVAYADDISVIVTSXAQIEALNEIFTRFGIAAGAKLNL FT RKTVAINVGVCEGNEINTHWLQTVNTVKILGVVFANSIRLMTTLNWNTLVG FT KFSQQMWLQSLRTLTXHQKVIMLNTFGTSKLWYLSSVLPPLGVHTAKITAT FT MGKFLWRGIVARVPMMQLARDKEHGGLNLHLPALKCKSLAINRHMQEIDSL FT PYYRSLLHHXNPRPAIPIDHPDIKLILSXLSQIPHQIQQHPSADQIHKFFV FT QQTEVPKVERNSPATDWPRAWRNINXKQLSSXERTKLYLLVNEKLEHRKLL FT FAMQRVADENCEHCGNHIETLRHKLCDCTRVGPAWTVLQRRLAGXMNGWRR FT LTFDELARPTLIGINKAKRVXILKLLVKYICHVNESNDRIDVDALNFLLDL FT NX" XX SQ Sequence 4450 BP; 1181 A; 1185 C; 1082 G; 949 T; 53 other; ttcagttatc gctcagcttc tgagacgcac aggcgtgttt acgattgcgc tcctcattag 60 tgaagcactt ttttcagatc gttcaatcga tatttctgtg ttgtgccgcg tcgcgggtat 120 ggtcgcgatg aggcgggaaa acacttttcg aatagactat tcgtgcttcc cagtaaagcc 180 gtctttcgaa aaagtgcacg gmttctgccg ttcagtgctt gggctgaaga aagaagacgt 240 cgaaagacta cagtgtcaca agggcgaaca atgtgcattc gtcaaggtca gcgacctgac 300 gcttgcacaa aaaatwgtgg acgaacatga tgcccgtcat gaagtggagt taaacgggaa 360 gaagcacaag cttcgtatca ccatggaaga tggtagtgtg gaagtgaaga ttcacgatct 420 gcccgaaaat gtkcccgaag agaagatcgt cgagttcttg tgcgcgttcg gcgaagtaat 480 ctscatccga gagctaacgt ggggagaggg ctacgagttt gctggtatac cactcggcat 540 atggtcggcc cgtatgttgg tacaaaaaaa tattgactcg tgggtcacca tcgacgggca 600 gcaggcatwt attgtctaca aagggcagct gcagtcgtgc aaatactgca aagagcaggc 660 acacactggc atttcttgtg tccaaaacaa gaaactgctg gtgcaaaaga gctatgccam 720 cgtgacgaaa caaackggat caacacgacc accgccgaaa aaaacagctg gcacgaaacc 780 ascaagcgca aaacmgatcg ggccgaaacc tcccgttctc ccaacagcaa cgtcggatgc 840 ctttcccgag cttccgaagt cttcgagcca gcccgaacag cctgcctcga cttctaaatc 900 cgatgctagg ttcgatttga caacgtcccc ccgcccgcaa acccaacgcg kgcatggatc 960 gtcgtcgtcg ctccgagcac ctcatgtcga agagtcacca aaagccaaca ttgttttggt 1020 tgactgcttc aaaaagccga cgaatgcgat gcgatcgcag agcaagagcg gcaatggcaa 1080 cgaaaccgac gattcttcca cttccacgaa tagcagacga agcgcgcgag gccgaccacc 1140 cggcaaaaag ccccgtcggg aagatggtga cgacgagcag gatgaagact atcacccata 1200 aatggctctc tcgtcctata acatctcgtc catcaacatc ggcacgatca ccaaccccac 1260 gaaactmaac gcgctacgta acttcatcag cagccagagc ctcgatattg tgtgtctgca 1320 agaggtggaa aacgaccagc tctccttgcc tggcttcgtc gtatacacca atgtagacca 1380 cgcgaggaga ggtacggccg ttgcggtgaa ggaacacatc caagtctctc acgtcgagaa 1440 gagtttggat agccgactac ttgcgctgcg agtgcaagac acgaccatct gtaacatcta 1500 cgctccctcc ggttctgcgc aacgtgctgc tcgggaggag ttctttaatg gaactctcgc 1560 ctattacctc cgtcatcgta ccgcacacgt cattctsgct ggcgatttca actgcgtatt 1620 gcgcgcatgc gactcaacca gccctaacgc aagccctgct ctcaagacag ccgtgcaaca 1680 gctgcagctg catgatgtgt gggaaaaact kcgcccacgt gacmctggtt tcacctacgt 1740 ctgccgaaac gcgcaatcgc gkctcgaccg cgtktacgtc agcaatggwc tacgagaaaa 1800 tttgcgaact gcgcacactc acgtgtgttc gtttacggac cacaaagcgc taaccgtcmg 1860 amtatgcctc ccccagctcg gscatgagcc tgggcgcggt ttctggtctt tgcggcctca 1920 tcttctgact gacgaaaacg ttgcggagtt ccagtataag tggcaatatt ggacccggca 1980 gcgacgaaat tatgcgtcat ggattgagtg gtggmtcacg tacgcwaagc cgaaaatwaa 2040 aagttttttc aggtggaagt ctcgagaatt tttcgatgaa ttccacaatc agcaccaacg 2100 gctttatgct gaactwcagc tagcgtacga tgggtactac cagcaccctg aaatgctgcc 2160 cacgataaac cggctaaaag gggaaatgtt ggcgctgcag aggaattttt cccacatgtt 2220 catgcgcatt aatgaaacgt acgtggcggg wgaaccwctg tcgatcttcc agttggggga 2280 aaggcgaagg aaaaagacca tcatcaccga gctccamcgg ggtgaaaatg amatcatcaa 2340 cgagccacaa gcgatcsaag cgaatttgtw tgacttcttc tcccagctct actcggagga 2400 agcaccaaac agcaacgaka acggttttgc cwgcgagcgc gcaatcccac cgaacgaccc 2460 ggtgaacgaa gcatgtatgg aggagataac aacagcagaa attttgtctg ctatcaaaac 2520 aagcgccccg aggagatccc cgggcwgcga tggcatccca cgagagtttt atctccgaac 2580 gttcgacgtc atccatcgag aattgaacct cttgctgaac gaagcgctca ccggcaatat 2640 tccccccgca ttcgtggagg gaatcatcgt gttggtgagg aagaaaggag gcgacaacac 2700 agctcggtcc taccgaccca tatcgctgct caacagtgac tataagttgc tatcccgcat 2760 actcaaaaac aggctagagg gtgtgatgag agcgcacggc gttttgagcg acggacagaa 2820 atgttcgaac tcagagcgca acatcttcca agcaacactc gccctcaaag atcgtatcgc 2880 cagtctccgt catcaccgac gcgccggtaa gctcatwagc tttgatttgg agaatgcgtt 2940 cgatcgggtc cggcactcct ttctctttga gaccatgcga tctctcgggt tcaaccaaga 3000 actcatcgct cttctttctc gcatcgccag ccgstctgct tctcggctac tcgtcaatgg 3060 gcatctctcc cgcccgctcg aaatccagcg ctcggtccga cagggggacc ctctctccat 3120 gcacctcttc gtgctgtacc ttcacccact ggtgcgcaaa ctggaggaag cgtgtggcaa 3180 cgatcttctc gtwgcgtatg cggacgacat cagcgtcatc gtgacgtcag mggcgcaaat 3240 cgaggcgctc aatgaaattt tcacccgctt cgggattgcc gccggtgcga aattaaattt 3300 gcgsaagacg gttgcgatca acgttggggt ttgcgaaggc aacgaaatca atacccattg 3360 gctgcaaact gtcaacacag tcaaaatttt gggtgttgtc ttcgcaaatt ccatacggct 3420 aatgacgacc ctcaactgga acacgctggt gggaaagttc tcgcagcaaa tgtggttgca 3480 gtcwctkcgc accttaacak tgcaccagaa ggtaatcatg ctcaacacst ttggcacgtc 3540 gaagttatgg tacctttcgt cagtgctgcc ccctttgggt gtacacacgg cgaaaatcac 3600 cgccacaatg ggcaaattcc tatggagagg aatcgtcgct cgcgtgccaa tgatgcagtt 3660 ggcccgcgac aaagagcacg gaggcttgaa tcttcatttg ccagcattga agtgcaagtc 3720 tttggcaatc aacaggcaca tgcaagagat cgattccctt ccctactatm gatcccttct 3780 tcaccacgwg aatccccgcc cagcaattcc catagatcat cctgatatca aattaatcct 3840 gtcamattta tcccaaattc cccaccaaat ccaacaacac ccctccgccg atcaaatcca 3900 caagttcttc gtgcaacaaa cagaggtgcc caaggtggag cgtaacagtc cwgcgaccga 3960 ctggccacgt gcgtggcgaa acatmaacat saagcagctc tcgtcgswgg agcgaacgaa 4020 actgtacctg ctcgtgaacg agaaacttga gcaccgaaag ctactgtttg cgatgcagcg 4080 agtggcggac gaaaactgcg aacactgtgg gaaccacata gaaacgctcc ggcacaaact 4140 gtgtgactgc actcgcgtcg gcccggcctg gacggtcctt cagcgaagat tagcagggmt 4200 catgaatggt tggagacgac tcacattcga cgaacttgcg aggcctacac tgataggaat 4260 aaacaaagcg aaacgtgtcg akatcttgaa gcttttagtg aaatacatct gccatgttaa 4320 tgagagtaac gataggattg atgttgatgc tttgaacttc ttgttagatt tgaacwatta 4380 aattgtatat agctgtaaac gaacttgaca aataaaacct aacttttaaa aaccaaaaaa 4440 aaaaaaaaaa 4450 // ID BEL-86_AA-LTR repbase; DNA; INV; 190 BP. XX AC supercont1.279; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-86_AA_; KW BEL-86_AA-I; BEL-86_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-190 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.279; Positions 644503 644314. XX SQ Sequence 190 BP; 55 A; 47 C; 47 G; 41 T; 0 other; tgccgacaac acccctcgtt gcagcgacgc cgctggtggg gttacaaaca cttcacgaca 60 ggccatgcac gtaaccgtca agtatgatag tggaatggaa gacatgtcga agcaatagga 120 gatttgtata catgcaagtc atcatggtac tactttcctc cgtatgagac gtggctacca 180 aatcgaagca 190 // ID P-22_HM repbase; DNA; INV; 3165 BP. XX AC . XX DT 21-MAR-2008 (Rel. 13.03, Created) DT 01-APR-2008 (Rel. 13.03, Last updated, Version 1) XX DE P-type DNA transposon family: consensus. XX KW P; DNA transposon; Transposable Element; P-22_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3165 RA Jurka J.; RT "P-type DNA transposon families from Hydra magnipapillata."; RL Repbase Reports 8(3), 368-368 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 255..2711 FT /product="P-22_HM_1p" FT /translation="MPLKCCIPLCKSNYHSTESKISIYKFPKNPEERKKWR FT DVIPRANFSISDYTAVCRLHWPDDAPFETKYGKQRPLNPPSVFKNIPPSCL FT PTPSNKVRKTVTSRGFRNVFPDEFNVFLKDDELSFAEIQSIFINSNDVIVF FT QFNKNSVHIQSKMYKLGVPLFLLEIFDNYSFTAYHNGTTCNISSLTANNLK FT LIKTRSALFEAVRFLKHKESSHQSNILFEHLNVMRKVNVGEVLYTPDIICR FT AYEYFAMRRSLYDRLCVDYKLPSIRTLTRLTSKISSMEDLNFINSIFMNLD FT PLKRTSIILIDEVYIKASLLYQRGALFGHSVNYPERLATTILSFMIKCLFG FT GPEFICRAQPVSNLSAEFQFAQCQQILDAINIIENSKTLAIITDGNRVNQR FT FFRTFKTFESKPWLTTSGTYLLYDYVHLLKCIRNNWLTEKTGQLCFLHNGE FT LTLAKWSDLETLYKTESISLLKLSKLTAKSVYPKPIERQSVQFCLSVFCEE FT TVAAFRTHPDIENKEFEGTAVFIEKIISFWSVVNVKAPGAGIRFRNELRGE FT IHSIDDQQLQLLRDTADLAKFMKPTGKRFKQLTHDTSTAIEHTCYGFIDLV FT ETLLSTGSKYVLLGWFSTDPLEKAFSKLRQGSGGTYFINAKSVLEKIHIQH FT TKLILQLDIPVNGVDGHICDICCRDVSTDEKELLDNLHDLESLVNKSTLLA FT IVYIAGYVQKNEINIYDDTTSYYNEYGDYLNSLNRGGLEIPTDTLVQWSIF FT CFLFFQGATGPLCRTFCVTQFQFIAAKYRFKITKRQCRIYSNILLKNYALI FT STPNSSRESKAKILKLS" XX SQ Sequence 3165 BP; 1080 A; 483 C; 517 G; 1085 T; 0 other; catggccttc tataataaca ggccttgaca tcgcgctata aagttgccag actagaggcc 60 gatttataat actgtgttta catttcctct tttaacttta caaaaaagat ttagtttgcg 120 cttttagagc agatttttca aattcgattg gttaggtttg gttagagcgc aactttaaga 180 cgtaaaagtt aacttcataa cactggcttg agtattttta aactcatata aagttatttg 240 agtagtagta gaagatgcct ttaaaatgct gcatacctct ctgcaagtca aactatcact 300 cgactgaatc aaaaatatcg atatataaat tcccaaaaaa tccagaggaa agaaaaaaat 360 ggagagatgt catcccaaga gcaaatttct ctatttctga ctatacagca gtatgtcgac 420 tgcattggcc tgatgatgca ccttttgaaa ccaagtatgg aaaacaacga ccattaaacc 480 caccatctgt ttttaaaaat attccgccta gttgtttacc aacaccatcg aataaagtca 540 gaaaaactgt aacttcaaga ggatttcgaa atgtctttcc agacgagttt aatgtttttc 600 taaaagacga tgaactttca ttcgccgaaa ttcaatctat tttcattaat agcaatgatg 660 ttattgtttt tcaatttaac aaaaatagtg tgcatataca gtcaaaaatg tacaaacttg 720 gagttccatt gtttcttttg gaaattttcg ataattattc attcacagct taccataatg 780 gcacaacttg caatatctca tctttaactg caaacaattt aaaattaata aaaacaagat 840 ctgctttatt tgaagcagtt agatttttaa aacataaaga gagttctcat caatctaata 900 ttctatttga gcaccttaat gtgatgagaa aagttaatgt tggtgaagta ttatatactc 960 cagatattat atgtagagcg tatgagtatt ttgctatgcg acgtagttta tatgatcgtt 1020 tgtgtgttga ctacaagttg cccagtatac gcactttaac acgactgacc tcaaaaatta 1080 gttctatgga ggacctaaac ttcataaata gtatctttat gaatttagat ccattaaaaa 1140 gaacttccat tattttaatt gatgaagttt atataaaagc ttcattactt taccaaagag 1200 gcgcgttgtt tggtcattca gtaaactacc cagagaggct agcaacaaca attttatcat 1260 ttatgattaa atgtcttttt ggaggaccag aatttatatg tagagctcaa ccagtttcaa 1320 atctttccgc tgaatttcaa ttcgcacaat gtcaacaaat tcttgatgca ataaatatta 1380 ttgaaaacag taagacactt gcaatcatca ctgatggtaa ccgggtaaat caacgatttt 1440 tcagaacatt taaaacattt gaaagtaaac catggttaac aacatcaggt acatatttat 1500 tgtatgacta tgtccatctc ctaaaatgta tacgaaataa ttggttgaca gagaagactg 1560 gtcaactttg ttttttgcac aatggggaac tgaccttggc caagtggagt gatttggaaa 1620 ccttgtataa aactgaatct attagccttt taaaactttc taaactaaca gctaaatcag 1680 tttatccaaa accaatagag agacagtctg tacaattttg tttgtctgtg ttttgtgaag 1740 aaacagtagc tgcttttaga acacatccag acattgaaaa taaagaattt gaaggtactg 1800 ctgtatttat tgaaaaaatt atttcttttt ggagtgttgt taatgtcaaa gcacctggtg 1860 ctggaattcg atttagaaat gaattgcgcg gagaaataca ctcaattgat gatcaacagt 1920 tacaactgct gcgagatact gctgatttgg caaaatttat gaaacctaca ggtaagcgat 1980 ttaagcagct tacacatgat accagtactg caatagaaca tacatgctat ggtttcattg 2040 atcttgtgga aacgttattg agtactggat ccaaatatgt tttattaggt tggttttcaa 2100 cagatcctct tgaaaaagct ttttctaagc ttcgtcaggg gtctgggggt acatacttca 2160 taaatgctaa atctgtactt gaaaaaatac atattcaaca tactaaactg atattacaac 2220 tagacattcc tgttaatggt gttgatggtc atatttgtga catatgttgc agagatgttt 2280 ctactgatga aaaagaattg ctggataatt tgcatgatct tgaaagcttg gttaataaat 2340 cgacattatt agctatcgtt tacatagctg gctatgtgca aaaaaatgaa ataaatatct 2400 atgatgatac aaccagttat tataacgaat atggagatta tctgaatagc ttaaacagag 2460 gtggacttga aattcctact gatactctcg tccaatggtc aatattttgt tttctatttt 2520 ttcaaggtgc cactggacct ttatgcagaa cattttgtgt aactcagttt caattcatag 2580 cagccaagta tagatttaaa ataacaaaga gacaatgcag aatatattct aatattttgt 2640 tgaagaatta tgcactaatt tctactccta atagtagcag agaatcgaaa gcaaaaatac 2700 taaagctgtc atagtttttt tgaatttcag gttttaattc tccttttgtt ttatgtctct 2760 ggatgtattt aaatttaaaa atagttattt gatacataat aatacctatt tgatacataa 2820 taagagatac ataaaactaa aaagtttttt aatgaattga ctttgttttt gtttttattt 2880 gacatttttt gatgttactt tatttcaagt tatgttctgt aagcaaaaaa aaaaaaattc 2940 ttttaacatt tttaaatatg ttaaagtatg aacatttttc aaacgatttt agttttctta 3000 ttataatttt gtctttatta aagattcgaa taagaaagta ttgacaaagt attaaaagcg 3060 attagtaaga ctttatataa atatacaagc aatcaaagtt taatcggcct tcagtttttg 3120 cggcgtgtcg cgcgatgtca aggcctgtta ttatagaagg ccatg 3165 // ID Gypsy-183_AA-I repbase; DNA; INV; 7013 BP. XX AC supercont1.145; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-183_AA_; KW Gypsy-183_AA-LTR; Gypsy-183_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-7013 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.145; Positions 1292314 1299326. XX CC 'ACAC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 551..2518 FT /product="Gypsy-183_AA-I_2p" FT /translation="MMSVEHLMIYQSMNVMHLLDDEVEHELVIRREKFSSG FT DSRDVKRRKLRRIMKLQRDKNEFTFIPLKEEQLEVEFHEIDEKLAKIREAL FT ENKLVKKVDMPPLKTRLVHLLFRLNRLKDGMNFDTQGLEKVAVQLLNNYFS FT FLSTDPDAVLETQERFQEDLSRLSVQNEKEGDEGEDEKTESSSTDDRVVTR FT NKNRRNRSTSTPRRTRRTKMKSQDAIMKTVMSRIDRYLEQKLSSLNLSLNT FT ELSGNQERRALDRVEFTESTSEGVQPQASTEKSGKRDVGKNGKNGQARKTP FT ADSLAEKRREEKSKAKFSAECSEDDSESSAGSSDEEDSNSSDAARRSRRKG FT ASRRPRPVADWKLKYDGKDDGKLLNKFIAEVEFMAEAENISKKALFNEAIH FT LFAGEVRTWYMEGKKNKDFRNWKELVTELKLEYQSPDLDFHYEQQATQRRQ FT RRSEKFTEYYNAMKEIFGYMSVPPAEERKFDIVFRNLSSDYKNALVVKNIR FT SLKALKVWGRKLDSANWYLYRSRENESANKSAQVHEVSRKPFQKPMTFSGK FT DWKPKSFVNNTNERKSQAPQYPKKLIDNQKPTNPRREPSPQQSSSSGTLER FT QAAMYRIPDSDVCFNCKEKNHHYKSCLRKREKFCTKCGMHDVTAENCLFCQ FT KNGRKSA" FT CDS 2641..5658 FT /product="Gypsy-183_AA-I_1p" FT /translation="MGMPMVGLLDSGAQVSILGIGGEKLLQKLKMKKSQTR FT TKLFAAGGEVLEVRGAVHLPITFNGRTRLVTTLIAPSLKRRFILGMNFWRL FT FGIEPTIRDSPEVLVEEVEVEHEEEEIPLTVEQREQLEAIKKEFKAFEEGQ FT TLGTTPLITHKIEFDEEFRTATPVRLNPYPWSPEVQKQVNLELDEWIEAGV FT VERSTSDWALLVVPVLKKGETSEGETSLKVRMCLDARKLNERTRRDAYPLP FT HQDRILGRLGSAKFLSTIDLSKAFWQIPLHPESRKYTAFRVFGRGLFQFTR FT LPFGLVNSPATLSRLMEQVLGYGELEPKVFVYLDDIVIANDTFEDHINCLK FT EVARRLKEANLSINLEKSKFCVPELPYLGFILSRNGVRPNPDKVEAIVNFE FT RPTSIRSLRRFRGMINYYRRFIEGYSEITAPLTDLLKGKPKVVKWNEQAEA FT AFISLKERLISAPILTNPNFTCPFTVQTDASDSAIAGVLTQEIEGEEHVIS FT YFSRKLTTTQRAWKAAEKEGLAAMEAIEKFRPYIEGTRFTLITDSSALSFI FT MNTKWKSSSKLSRWSMQLQQYDMVVRHRKGKDNVVPDALSRSMEALEIVEH FT DDWYANLLRKIAECPEDYADFKVEDSKLYKFVSSPTDALDYRFEWKICVPE FT NLRKSILDEEHDAAMHPGYEKNIQRLKCKYYWPGMAVDCKKHIRSCITCKE FT CKPSTVATAPAMGKQKLTNKPFQILALDFIQSLPRSKQGKCHLLVMMDLFS FT KWILLAPLRKIEAREVCRIVEDSWMRRFGTPEVIITDNATTFVGKEFQALL FT NRRGVQHWPNARHHSQANPVERVNRTINACLRTYMREDQRNWESKIFEVEE FT MINTTVHASTGFSPYRILYGHEKVEKGEHHRLEREDKEVAVEERDQFQAQI FT NEKVFKIVEDNLKKSHDKNLKNYNLRHKKFAPTYHIGQRVFKRNFKLSSAV FT DKFNAKYGPVYTPCVVVARRGTSAYEVADEEGKNLGVFSAADLRPDDSVAH FT " XX SQ Sequence 7013 BP; 2220 A; 1245 C; 1741 G; 1807 T; 0 other; attggctccc agccaaaatc gcgaaaggca aacggaggat cgagtagaat aaaaaattgt 60 ttgaattttt taacacgagc gggaggagag tacacattca ctttgtcact gattgtgtag 120 gatccgtgta ttatcctttt gccaacacag taccgtagac tgattttgag aagttgaaag 180 gtagtggctt gttgaatcgg tttcggaagt gctaaaaaga ttgagaattg caagcacatt 240 aacagttgtt aaaattgaat tagatcagag actgattatc aattgagtcg atagacggct 300 aaaagtaaaa aaaaggtcct gaatgagcac taacacttat acactaatac ataaatacac 360 caaaatcaac accatttatt gaattcgtat ttccttattt tgattttttt tttagtttta 420 ttgttctcta gcttaagttg attgaatttt tatttgtata tattgtgcta cttttattga 480 attcctttga actttgctga actgaattat ctactattaa attttccctt actgaattcc 540 taattttggc atgatgagtg tcgaacacct gatgatctat caatcgatga acgttatgca 600 cttactagac gatgaagtcg aacatgaact tgtcattcgg cgagagaagt tttctagtgg 660 tgattcgagg gacgtcaaaa ggcgtaaatt gcgaagaata atgaagctgc aaagagataa 720 aaacgaattt actttcattc ctttgaaaga agagcaacta gaggtagaat ttcacgaaat 780 agatgagaag ttagcaaaga ttcgcgaggc gttagaaaac aaattagtca agaaagtcga 840 catgccacca ttgaaaacgc ggttagttca tttactgttt cggttgaatc ggttgaaaga 900 cggaatgaat tttgatacgc aagggctcga aaaagtggct gttcaattgc tgaacaatta 960 cttctcgttt ctttcgactg acccagatgc agtgctagaa acgcaggaga gatttcagga 1020 agatttgagt aggcttagtg ttcaaaatga gaaagagggt gacgagggtg aagatgaaaa 1080 aaccgagtcg tcttcgacag atgacagggt cgtaacgcga aataagaata ggagaaatag 1140 atcaacgagc acaccgagga gaactaggag gacaaaaatg aaaagccaag atgcgatcat 1200 gaaaacggtt atgagtcgga ttgatagata tctcgagcag aagcttagta gtttgaacct 1260 gagtttgaat acggaattga gcgggaatca ggagagaaga gcattagata gggtcgagtt 1320 caccgaatcg acaagtgagg gggttcaacc gcaggctagt accgaaaagt cggggaaaag 1380 agatgtcgga aagaatggaa agaatggaca ggctcgtaaa acgccagctg atagtcttgc 1440 agaaaagagg agagaggaga agagcaaagc gaaatttagc gcggaatgca gtgaagatga 1500 ttcggagagt tcggcaggtt cgagtgatga ggaggattcc aacagttcgg atgcagcacg 1560 tcgtagtcgc agaaaaggag cgtcacgcag accacgaccg gtagcggatt ggaagctgaa 1620 gtatgatggg aaggacgatg gaaaactgct gaataagttc attgccgagg ttgagtttat 1680 ggccgaagct gagaacatta gcaagaaggc gttgttcaac gaagcaatcc acttgttcgc 1740 aggagaagta cggacgtggt acatggaggg gaagaaaaat aaggacttca gaaactggaa 1800 ggaattggtc accgagctga agctggaata ccaatcgcca gacctggatt tccactatga 1860 acagcaggct actcaaagac gacagaggag atcagaaaag ttcacggagt actacaatgc 1920 gatgaaggag atttttggat acatgtcagt accacccgca gaggaaagaa agtttgatat 1980 cgtattcaga aatttaagtt ccgattataa aaatgccttg gtcgtgaaaa acattcgaag 2040 tttgaaagcg ttaaaggtat ggggaaggaa gttggactcc gcaaattggt atctgtaccg 2100 aagtagagaa aacgagtctg ctaacaaatc cgcgcaggta catgaagttt ctagaaaacc 2160 gtttcagaaa cctatgacct tcagtgggaa agactggaaa cctaaatcct tcgtaaacaa 2220 taccaatgaa cgtaagagtc aggcaccgca atatccaaag aaactaattg ataatcaaaa 2280 acccacgaat cccagaagag aaccttcacc gcaacagagt agttctagcg gcacgttgga 2340 aagacaagct gcaatgtatc gaatcccaga tagcgatgtg tgtttcaact gtaaggagaa 2400 aaatcaccac tacaaatctt gtctgcgaaa gcgggaaaag ttttgtacga aatgtggcat 2460 gcatgatgtt accgctgaaa attgtctatt ttgtcaaaaa aacggacgca aatcggcgta 2520 ggaggtcgtc gagtgcgtaa ccaaaagcct ctaactgtct tcgtaccgtc caacccagaa 2580 gtggaggagt taactgttcg tgtggaagga gacaaccgtc cgttcgtcaa atttgaggtg 2640 atggggatgc ctatggtagg ccttctagac agtggcgccc aggtttcaat cctaggtatt 2700 ggaggggaaa aattgcttca gaagctaaag atgaagaaat cccagactag gacgaaatta 2760 tttgctgctg gaggcgaagt tttagaagtt cgaggagcag tgcatctccc aatcacattt 2820 aatggcagga cgaggctagt aaccacacta attgccccct cgctgaaaag aagatttatt 2880 ttggggatga atttttggcg cctttttggt atagaaccca ccattagaga ttcgccggaa 2940 gtgctagtag aagaagttga ggttgagcac gaggaggagg agataccgct aaccgtggaa 3000 caacgagagc agttagaagc cattaaaaag gagttcaaag cgtttgagga gggtcagaca 3060 cttggcacca ctccgcttat tactcacaaa atcgagtttg acgaagaatt tcgaacagcc 3120 acaccagtgc gtttgaaccc atatccgtgg tccccagagg tccagaagca ggttaacctg 3180 gaactggatg aatggatcga agcgggagtc gtcgaacgat caacaagcga ttgggcttta 3240 ttggttgtgc ctgtgttgaa gaaaggggag acgtctgaag gggaaaccag tctgaaagtt 3300 cgcatgtgtt tagacgccag aaagctcaac gagaggacgc gtagagatgc ctacccgtta 3360 cctcatcagg atcggatttt ggggcgattg gggtctgcaa agtttttatc tacgatagat 3420 ctgtccaaag cgttttggca gattccttta caccctgagt cccgtaaata tacagcgttt 3480 cgggtatttg gaagaggtct gttccaattc accaggcttc catttggctt ggttaatagt 3540 ccagccacat tatcaaggct gatggaacaa gtgcttggct acggtgaatt ggaaccaaaa 3600 gtttttgtgt atttggatga tattgtcatc gcaaacgaca catttgagga ccacatcaac 3660 tgtttgaagg aagttgctcg aaggttaaag gaagcaaacc tgtccataaa tttagaaaaa 3720 tcgaaattct gtgttcctga attgccgtac ttagggttta ttttgtcccg taatggcgta 3780 agaccaaatc ctgacaaagt tgaggcgatt gtgaactttg aacgaccaac ctcaattcgg 3840 tctttacgac ggtttcgggg catgataaac tactataggc ggtttataga aggatacagc 3900 gaaataacgg ctccattaac tgaccttctg aaaggaaaac cgaaggtggt taaatggaat 3960 gagcaagcgg aagcggcctt tatcagtttg aaagagcgat taatctccgc gcccattctg 4020 acaaacccaa acttcacgtg tccgtttacg gtgcagactg atgctagcga tagcgccatc 4080 gcaggagtcc tgacccaaga aattgaagga gaagaacatg tgatctccta tttctcgagg 4140 aagctgacca ccacacagag ggcatggaaa gccgctgaaa aagagggttt ggctgccatg 4200 gaggcgatag agaaatttag accgtatatc gaggggacta gatttacgtt aatcacggat 4260 tcctcagctt tgtcgttcat aatgaacaca aaatggaagt cttcttctaa gcttagccgg 4320 tggagcatgc aattacaaca atatgacatg gttgtgcgcc accggaaagg gaaggataac 4380 gtggttcctg atgcgctttc caggtcgatg gaggcgttgg agattgttga gcatgacgac 4440 tggtatgcaa acctacttcg taaaattgcc gaatgtccag aggattatgc agattttaag 4500 gtagaggact ccaaactgta caaatttgtg tcctctccaa ccgatgcact cgattacaga 4560 ttcgagtgga aaatttgtgt ccctgagaat ctgagaaaat cgattctgga tgaagaacac 4620 gacgctgcta tgcatcccgg atatgaaaag aacattcaaa gattgaaatg taaatactac 4680 tggcctggaa tggcagtaga ttgcaagaag catattagga gctgtattac ctgtaaggaa 4740 tgcaagcctt caacagttgc aactgcacca gcaatgggta agcaaaagct gaccaataag 4800 cctttccaaa tcttagcact agactttatt caaagtttgc cgagaagcaa gcagggtaaa 4860 tgccatttat tggtcatgat ggacctgttt tcaaaatgga tcttgcttgc tccgcttagg 4920 aaaattgaag cgagagaagt gtgtaggatt gtggaagata gttggatgag gaggtttgga 4980 acaccggaag taattatcac ggataatgcg acgacttttg tagggaaaga gttccaagca 5040 ttgcttaacc gtaggggagt ccagcactgg ccgaacgcaa ggcatcacag ccaggctaat 5100 ccggtcgaaa gggtgaacag gacgataaat gcgtgcttga ggacatacat gcgtgaagat 5160 cagcgtaact gggagtcgaa gatctttgaa gtggaggaga tgataaatac aactgttcat 5220 gcatctacag gattttcgcc ttaccgaatt ctgtatggcc atgaaaaagt tgagaaaggg 5280 gaacaccatc gactcgaaag ggaagataag gaagttgcag tagaagaaag ggatcaattt 5340 caagcacaaa tcaatgaaaa agtgttcaaa atagtcgaag ataatctaaa gaaaagtcat 5400 gataaaaact tgaaaaacta taatctacgt cataagaagt ttgcaccgac ttaccacata 5460 ggacagcgag tgtttaaacg aaatttcaag ctatcgtcgg cggtggataa gtttaacgcc 5520 aaatacgggc cagtttatac accgtgtgtt gttgtcgcca ggcgagggac cagtgcctac 5580 gaggttgccg acgaggaagg caaaaaccta ggggttttct cagccgctga tttacgtcca 5640 gatgactccg tggctcattg attgggttga acgttgccta aaagagacac ggagtgttaa 5700 aatctgtatg ataaaagtag atcatcaata taacttacat tgaagcaggt gatacaatat 5760 tgcattcgct gaacaccatt ggacgtcgta aagcaggtca tcatactccc tcagtcgaat 5820 gagccatcta atatggtagt tatatcgtgg gaactcagga ctctttatag cttagtttgt 5880 ccattggtga gtagttatcg tccgtctaat agagcagaga gagagtgttc tcttgctcag 5940 ggttaagagg atgagtgcaa tgagtgtaca acctgcgaat taaagattaa tgacgtcatc 6000 atttaggaag ctgcgcagcg agatcaggta gaaaagtgaa gaagattaaa attaactctt 6060 ttgttggtga agccatgtac aaaaggggtg aaggaacttg ggaattttta gatagtagag 6120 tagataatag ataataatag ctttaagata gagatagatt ttacgtagaa tttgaagaat 6180 aattgtgagg aagaatgtat agaaagttat cgttgaactg tccaaatact tacccgtgat 6240 atttttgaaa ttttaatttg ttaattccat agatgagatg ctcgttttca ggttcgttcc 6300 gcaggaaaaa ccctagatag gatgctttca acgcgcgccg aaagcttgac caacagaaca 6360 gggtctacta aagtagatct agcctgttgt gaccaggaag gccacctggt cgaatcaacc 6420 tgaaaataaa cagtttcgtc cagcttaacg catagtagat acatatgaaa tacttatcta 6480 atgcaatccg tagaatcacc attcgatacc atcttgtccg tttgttctgt agcacatttt 6540 aatacgtact gtccattgtt catttttttt taatacacag tttgtaattg tccggtcaac 6600 tacgtgtata atcgcgaacg cgtgtgagag gagactgatt ggtcatctcg atccctactg 6660 tgtgtaggat gattcgttga tagacactga gagaaggaag tagtagaacg aattcacttt 6720 aggtgttcat tgacaagccc actcacaaca aaattatctc acgtgctcaa ttatagcaag 6780 tttcttagtg aaattcctgc ttttagtttg ccgaggattt taatgtgttt atgatgtcta 6840 atgatggtaa ttgaatttta actgcttcgc gtagcgatta gtatatgttt gtggaatttt 6900 tagcaactcc cttttaagct ttgaataaga tgttttcacc tctgtgtggg attaaataat 6960 taccctccct gatctttagt tagggtaatt atttaacctc ggtgggggga tag 7013 // ID hATx-2_SM repbase; DNA; INV; 2681 BP. XX AC . XX DT 21-OCT-2007 (Rel. 12.1, Created) DT 21-OCT-2007 (Rel. 12.1, Last updated, Version 1) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hATx-2_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-2681 RA Jurka J.; RT "haT-type repetitive family from freshwater planarian Schmidtea RT mediterranea."; RL Repbase Reports 7(10), 1040-1040 (2007). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 451..2412 FT /product="hATx-2_SM_1p" FT /translation="MEQTHYENCWQHFKNKPGDKDIAICNLCKKCISCKAS FT QTTGLHRHLTGHHSLKRKAIATATTGSSKSHSSSKSSTLDKFVTVTTNANN FT DIETLEEIVAKLAAKDGFSILAITKSEFIRKSIKKQGFSLPQQPSDVMNLV FT YKYYSFVKDKLVFKIKSLKEQNYKFSLSIDEWTSFKNRRYLNVHLYSGKLD FT SVNLGLIPITGSCPAEKIIEMVNSKLESFGISFSKDIVATTSDGASVMVKY FT GKLSPAHLQMCYNHAIHLSIISVIYASKTENNETSDNESSNCDPNEDDLFN FT SNFQEQHSISPSDLIINSDPDININNTVQNIRKVVKLFKKSPVRNNVLQNY FT VKAAEQKELSLLLDCRTRWNTLESMVERFLRIVDPIKHALEDLNMDHLLII FT DDITNARNILDVLKPARIAIEALSRNDTTLLTAEGILKFLFQALEKNHSVL FT ALKLLEELKIRIGKRRDKSLVSLLKYLQDPSSLENVEDEFFSMTSKNEITK FT SAKDLMVRLFPVSPDKSPDNRNLIVEIPTENSEIMEIEDCVLESNAMNQQD FT MNNANEVEDILKKQLENFIRESNKSSQISDGSFKTLSREITIFSINGQLTP FT NLALLQAALKTIKATSTQNERNFSTAGNFVSKKRTSLSDKSIDSLCFLKYY FT FMNNQ" XX SQ Sequence 2681 BP; 995 A; 376 C; 429 G; 881 T; 0 other; tagaggttgt gatcgagaat tctcaacaac ttacaatttc tcgattctcg agattttttt 60 tctatttctc gataaaaaaa agattattta aattaaatta atatcctttt ttgtctaagc 120 ttgacatgca aaagaataac aaaaagcaaa aaaaaaatct tcataattat ttattcctaa 180 attttgtttt atgtcgatgt aggcttgttg tgttgtaaaa aaatttaaaa tttatggata 240 ggtataggtt gtcattcatt ttttattatt cattcctagt attgtagatt taaaaggtga 300 agtgcaaata ttatatattt agattatttt tttataagta agattaaagc tgtcaatcga 360 taatatcgta ctaaaatttt tcttctttaa acgtaagtat aattacattc agaaaaaatt 420 taattttaat ataaattttt aagccattaa atggagcaga cacattacga aaattgttgg 480 caacatttta aaaataaacc tggggataag gatatagcta tttgcaatct atgtaaaaaa 540 tgtatatcat gcaaagcatc gcaaacaact ggacttcatc ggcatttaac tggtcatcat 600 agcttgaaga ggaaagcaat agcaacagca actacaggca gttcaaaaag ccatagttct 660 tcaaaatcat caacgcttga caaatttgtg actgtcacca cgaatgcaaa taatgatatt 720 gaaacattgg aggaaattgt ggctaaatta gcagccaaag atggtttcag tattttggct 780 ataactaaaa gcgagttcat tcgtaaaagt ataaaaaagc agggattttc tttgccacag 840 cagccatcag atgtcatgaa tttggtgtat aagtattatt cctttgtgaa agataaatta 900 gtttttaaaa ttaaatcgct aaaagaacag aattacaaat ttagtttgtc tattgatgaa 960 tggaccagtt tcaaaaacag gcgatacctt aatgtccatt tatactccgg aaaattagat 1020 tcagttaatt taggattaat cccaataaca ggaagctgtc ctgctgagaa aattatagaa 1080 atggtaaatt caaaattaga atcgtttggt ataagttttt ccaaagatat tgtggcaact 1140 acatcagacg gtgcgagcgt aatggtaaaa tatggtaaat tatcacctgc ccatttacaa 1200 atgtgctata atcacgctat ccatttgtcg attatatcgg ttatatacgc atctaaaaca 1260 gaaaataatg aaaccagcga taacgaaagt tctaattgtg atccaaacga agatgattta 1320 tttaatagta atttccagga acagcatagc atttctccaa gcgatttgat aattaattca 1380 gatcccgata taaatattaa taatactgtg cagaacattc gaaaagttgt taagcttttc 1440 aagaaatcac cggtaagaaa taacgtcttg caaaattacg ttaaagccgc agagcaaaaa 1500 gagctatcgc tattattgga ttgtaggacc aggtggaaca cactagagtc aatggtagaa 1560 cgatttcttc gtatagttga tcctataaaa catgctttag aagatcttaa tatggatcat 1620 ctattgatca ttgatgatat tactaacgcc agaaatatat tagatgtttt gaaacctgcc 1680 cgaatagcaa tagaagcgct aagtcgtaat gatactactt tattgacagc ggaaggaatc 1740 ctaaagtttc tttttcaagc actagaaaaa aatcattctg ttttagcatt aaagttgctt 1800 gaagaactaa aaattcgtat aggaaagaga agggataaat cattagtgtc tcttcttaaa 1860 tatttgcagg acccatcctc tttggaaaat gtggaagatg aatttttcag catgaccagt 1920 aaaaatgaaa tcacaaaatc ggctaaagac ttgatggtaa ggctgtttcc tgtatcacct 1980 gataaatcac ctgataatag aaatcttatt gttgagatac ctacagaaaa ttcagaaata 2040 atggagattg aagattgcgt attagaatcg aatgctatga accagcagga tatgaacaat 2100 gctaatgaag ttgaagatat tcttaaaaag cagctggaaa attttattcg cgaaagcaac 2160 aaatcttctc agatatctga tggaagcttc aaaacattat cgagagaaat tactattttt 2220 tcaatcaatg gccaacttac gccaaactta gcgttgttgc aagcagcatt gaaaactata 2280 aaagcaactt ctacgcaaaa cgaaaggaat ttttcaactg cgggaaattt cgtttcgaag 2340 aaaagaacaa gcctatcaga taaatcaatt gatagcctct gttttttgaa atattatttt 2400 atgaataatc aataaattat gttaatttta aaaatttaat ttataatata tacatatatt 2460 agaattttga ataaaattta atgaatttct ttttgtaaaa tttatacttt tttattagta 2520 aataatatag tgaaaatact tgcattaaat attacattta atgcaagtat tttttatcta 2580 tttttgttaa ttatttgcat ttctcgagaa atctcgagaa atgcaaatta attttctcag 2640 tttctcgaga agtttatttt gtcgagaaat cacaacctct a 2681 // ID ISL2EU-6_HM repbase; DNA; INV; 2976 BP. XX AC . XX DT 16-SEP-2009 (Rel. 14.09, Created) DT 16-SEP-2009 (Rel. 14.09, Last updated, Version 2) XX DE A family of autonomous ISL2EU transposons - a consensus. XX KW ISL2EU; DNA transposon; Transposable Element; ISL2EU-6_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2976 RA Jurka J.; RT "DNA transposons from Hydra magnipapillata."; RL Repbase Reports 9(9), 1917-1917 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(86..1360,1364..2029) FT /product="ISL2EU-6_HM_1p" FT /translation="MSYNKYEDFLNLSSKKLINYLSIRGLKITGKKVELVA FT RAFAAMELNLDIICSAEDQYKRLQIDYEHLLQKHGIPDPNGVELEKRFDNL FT SKWPVVTMGNIFAFILKKKEFNTDYIGKYKDQKAFSYFDSGFVGPIFIYEH FT YLKPKEKVIFLYCTVTASQALRESKTLWIVIKQGDEKGSDILTGWCSCMAG FT SYETCNHVIACLYKIEYANTKGWCNPACTETACQWNLGTRKDVEPKRISDL FT IISKKLGTKSSLSNDCEENRMKNLNNFDPRIISHRTISSNHISNFIRDMQL FT VNEEAVLFKSIESLSTTNEEHYNYAFVKTIALNVLKAYSQFSETELIQIFI FT EKLNMSLKTIENVEKATRGQSEVLLWFEIRSGRLTASKHHEVFTKVNTILK FT AQGSIKPKTTPLVSKIIYYDDVSSASMKWGVHEDIAFKSFYAQEISKHQDF FT RAEKCGIFLSKFKCYLAASPDGIVFCKCHGKSILEIKCPYSLRDKKISESV FT NDCEFLTVNNGKITLMENHKYYTQVNSQMAITEIKECFFVVWTTQETFVQK FT VKFNQDHWNKVSTNLDIFYKTFVCPALLNFKPLIFCCKCDNVLLEKCEIDT FT NKESYLKSIQCNLCEGWLHFKCEGLTDTENENNTEWICSYCLVSLPGLE" XX SQ Sequence 2976 BP; 1100 A; 411 C; 441 G; 1024 T; 0 other; gacgatttcg gagttagcgt agccatcttg gaaaaatatt cttgggggtg caagttgaac 60 tttgaacctt ccgtttttat gcaaaatgtc atataacaag tatgaagatt ttctcaattt 120 aagttcaaaa aaactcataa attatttgtc tattcgtggt ttgaaaataa ctggaaaaaa 180 agttgaactt gttgctagag cttttgctgc tatggagctt aatcttgata ttatatgttc 240 tgctgaagat caatataaaa ggcttcaaat tgattacgaa catttattac aaaagcatgg 300 aattccagat cccaatggag ttgaactaga aaagcgattc gataatcttt ccaaatggcc 360 ggttgtgaca atgggaaata tttttgcttt tattttaaag aaaaaagagt ttaatacaga 420 ttatattgga aagtataaag atcagaaagc cttttcgtac tttgatagtg gttttgttgg 480 acccattttt atttatgaac attatttaaa accaaaggaa aaagttatat ttttatactg 540 tacagtaaca gcctcacaag ctctccgtga atctaaaact ttatggatag ttataaaaca 600 aggtgatgaa aaaggaagtg atattttaac tggatggtgt tcatgtatgg caggatcata 660 cgaaacttgt aatcacgtaa ttgcatgttt atataaaatt gaatatgcaa atacaaaagg 720 ttggtgcaat cctgcatgta ctgaaacagc gtgtcaatgg aaccttggaa cccgcaaaga 780 tgtggagcct aaaagaattt cagatttgat tatttctaaa aaacttggaa ctaaatcatc 840 tttaagtaat gactgcgaag aaaatagaat gaaaaattta aataactttg atcctcgaat 900 tatatcgcac aggacaatat cttctaacca tatatcaaat ttcataaggg atatgcaatt 960 agtaaacgaa gaagccgtct tatttaaatc catagaatca cttagtacaa ctaatgaaga 1020 acattataat tatgcctttg tgaaaaccat tgcacttaat gttttaaaag catattctca 1080 gttttctgaa acagaattaa ttcagatatt tattgaaaaa ctaaatatgt ctttaaagac 1140 tattgaaaat gttgaaaaag ctacaagagg acagtcggaa gttttattgt ggtttgaaat 1200 aaggagtgga aggttaacag cttctaaaca tcatgaagtt tttactaaag taaatacaat 1260 tttaaaagca caggggtcaa taaaacctaa aactacacct cttgtttcta aaataattta 1320 ctatgatgat gtatctagtg cttctatgaa atggggagtc taacatgaag atattgcatt 1380 taaaagtttt tatgctcaag aaatatcaaa acatcaggat tttagagcag aaaaatgtgg 1440 aatttttctt tcaaaattta aatgttattt agctgcatca cctgatggca ttgtattttg 1500 taaatgccat ggtaaaagta ttttagaaat caagtgtccg tatagtcttc gtgataaaaa 1560 aatatcagaa tcagtaaatg attgtgaatt cttaacagtt aacaatggca aaattacttt 1620 aatggaaaat cacaaatact acacacaagt taattcacaa atggcaataa cagaaataaa 1680 agaatgtttt tttgttgtct ggactaccca ggaaactttt gttcaaaaag ttaagttcaa 1740 ccaagatcat tggaataaag tttccacaaa tcttgatata ttttacaaaa cttttgtttg 1800 tccagctctt ttaaacttta aacctttaat attttgttgt aagtgtgata atgttttgtt 1860 agaaaagtgt gaaattgaca caaacaaaga atcgtattta aaaagtatcc aatgtaatct 1920 ttgtgaaggt tggctccatt ttaaatgtga aggtttaacg gatactgaaa atgagaacaa 1980 tacagaatgg atatgctcat actgccttgt ttccttacca ggcttggaat aacttactgt 2040 gattatcgga tacaatttca tcaaaaattt atgaaactac tttggtttta aacctgaaat 2100 aaattatttt ttgctggatc atatcttacc tttataatat tatttatgta ataacttgca 2160 tagtttgagc aaatgcttaa aaatttgtat taaattttgt ttatatcaca aataaaatag 2220 ttaaattttt tttgtgacat tttttataaa attggtaccc actccctatt ttaaaaataa 2280 aaaaatctta caaggatgtt tgaaactgtt gccatacttt tgaaaaataa aatatattct 2340 ctaacaataa tttctcaaaa acacatgttt acatagtttt ccatgttatt aaaattgaaa 2400 taaaaaaaac cttgtccctt tttatccttt ttataatgac cccctacccc actatttgca 2460 tgacgtgctt tatggatggc ccctgattaa tataataatt atgatgttaa attataaatt 2520 tatggtaaaa attaatattg ataagtataa acatcatgtt tatacttaac aagtataaac 2580 atatttatac ttatcaatag taattttgca aaaaatataa tataaaaatt tttaaaaaca 2640 ttttatatta aactcttttt tctttaaaat aaaaacaaaa atatttccca ttgttacaac 2700 tggctgattt gaagcctttt atattgatct tcagcagaac atataatatc aagattaagc 2760 tccttagcag caaaagctct tgcaacaagt tcaaattttt ttccagttat tttcaaacca 2820 cgaatagaca aataatctat gagttttttt gaacttaaat tgagaaaatc ttcatacttg 2880 ttatatgaca ttttgcataa aaacggaagg ttcaaagttc aacttgcacc cccaagaata 2940 tttttccaag atggctacgc taactccgaa atcgtc 2976 // ID Kiri-8_CQ repbase; DNA; INV; 4061 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 16-FEB-2011 (Rel. 16.01, Last updated, Version 2) XX DE A Kiritsubo non-LTR retrotransposon family from Culex DE quinquefasciatus - consensus. XX KW Kiri; Non-LTR Retrotransposon; Transposable Element; Kiri-8_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4061 RA Kojima K.K. and Jurka J.; RT "Kiri non-LTR retrotransposons from the southern house RT mosquito."; RL Repbase Reports 11(1), 127-127 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 3 sequences with >97% CC identity. Both termini are truncated. CC This family and related Kiri elements found in two mosquito CC species, Culex quinquefasciatus and Aedes aegypti, constitute a CC distinct lineage belonging to the CR1 group. The phylogenetic CC analysis with PhyML indicates that Kiri is a sister lineage of CC the L2 clade. Although several Kiri elements do not encode two CC proteins, probably due to 5' truncation, Kiri elements generally CC encode two proteins. Their ORF1 proteins commonly contain an CC L1-like RNA-binding domain. XX FH Key Location/Qualifiers FT CDS 3..926 FT /product="Kiri-8_CQ_1p" FT /translation="FLLPEAKYGDEVLQLDTXXSSNLRHIAGLLSFPQVTH FT WNHSXFDSDHEWNEENRRQKKSSVSSKDGTNFKRSRDDLDETDDRIQSFDE FT LLTRMKQMFDDTNTRIDTCKSDLQAEFTTLREDVQSFKDRCSVEINNLSDA FT LALTQNNVNLNHERHLKNXKSNDLLLSGVPYHQPEDLLGFVKSASAALGYG FT ERELPLIYTKRLXRPPIAPGSTPPILLQFAFRAARDDFYFRYLSSHNLSLT FT HLGFNVNKRVYLNENLTDQARSIKGAALKLKKSGKLHSVYTKDGFIFVKPA FT EGGEAKLVNSLEQLAE" FT CDS 1552..4059 FT /product="Kiri-8_CQ_2p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="MLDNSSATPSQLNCFIPRVVMNCALDSSKLNICHLNA FT QSICARQLSKFNEFKICFSNSKVSIICVTESWLTDNIPDELISLEGYNLIR FT NDRKCARGGGILVYYKSNLTCKILTXSKNDTLNTTEXLLIEIHCNREKFLL FT GVFYNPPRVNCSEIVSSLFSEFSTSYENVVIVGDFNKSGTLSDNFLNAIDH FT LGICCVNSFPTHFHSGGCSLLDLFXTNNTNFVHNFNQVSAGGFSKHDMIFA FT SLNLSSFEDEIYYFRDYNNVNLPALIEAVNSINWSQLFSMXDPDIALDXFN FT QHIRNLFNRFIHLRKYVPRKNSWFNNNIMKAMIERDLAYNIWKTDKNFTNY FT SQFKRLRNKVTHLINEAKSNYVTSRVDSSVSSKDLWNKLKQLNISNKSKPD FT VKFPNSNAEINNYFTSNFTQDLSHPPILPRNNEGFRFSEVNVNDIINAINS FT VKSNAVGMDDIPIRFIKLILPEIIHKITFIFNMIILSSKFPRAWKMTKVIP FT IRKKSRSLDLNNLRPISILSALSKVFEKVLKIQIQTYLDNFNILNSNQSGF FT RKNHNTTSATVKVLDDILSVIDKKGKAFSVFIDFAKAFDRVSHVKLIQKLS FT LQFGFSSGATELLQNYLCNRKQVVFANGLFSDSASIISGVPQGSILGPLLF FT SLFINDLCSVLNHCRIHMFADDVQIYIASNRLSIPQMAQLLNEDLANIYDW FT SCSNLLPINSSKTKVMLFSRNSSEHQQPLIMLGQNVLEYVLKYPSLGFILK FT NDLEWDGHVNAQCRKIYIGLRTLKLSASMLPVNVKLKLFKSLLLPHFMYGD FT VLLLNASSASLNRLRVALNACVRFVYNLNRYSRV" XX SQ Sequence 4061 BP; 1185 A; 829 C; 717 G; 1306 T; 24 other; agttcctgct gccagaagcc aaatacggcg acgaagtttt gcagctggac acgagwsttt 60 cctcaaacct tcgccatatt gccggattac tctccttccc ccaagtcact cactggaacc 120 actcakcatt tgacagcgat catgagtgga acgaagaaaa cagaagacaa aaaaagtcat 180 ctgtttccag caaggatggt accaacttta aacgctcgag agacgatttg gacgagaccg 240 atgaccgaat ccagtccttc gacgaacttt tgaccaggat gaaacagatg tttgatgaca 300 ccaacacsag aatcgacacc tgcaaaagtg acctgcaggc cgagttcact acacttcgcg 360 aggatgtgca atcattcaag gaccggtgct ctgtggagat caacaatttg tctgatgccc 420 tcgcacttac tcaaaacaac gttaatctca atcatgaacg ccacctcaaa aatkccaaat 480 cgaatgatct gctgctctct ggcgttccgt atcatcaacc cgaagacctw cttggttttg 540 tgaagagtgc gtcggcggca cttggctatg gcgaacgtga acttccactt atctacacga 600 agcgattggs tcggcccccg attgcacctg gctcaactcc accgattctg ctgcaattcg 660 ccttccgtgc tgctagagat gacttctact tccgctattt atcgtcgcac aatctctcct 720 tgacgcacct tggcttcaat gtgaacaaga gggtttatct gaacgagaac ctgacggatc 780 aagctcggtc gatcaagggt gcggcactca aactcaagaa gtcaggaaag cttcacagcg 840 tctacaccaa ggatggattc atcttcgtga aaccggcgga ggggggcgaa gcgaagcttg 900 tgaattcgtt ggagcagttg gcggagtaac ctttcctagt aaacacccgt tcctaccacg 960 acaacttcct tgactccttt ccttcttccc cgtgttatct tcccctcctg aaagttcggc 1020 tatgaatata gcascaacgt tmaacctatc cagatttgct tttctttttt ccttccaagc 1080 tagtccttgt ttccgatccc tcgttatcct tgactccatc cttcctgaaa gtcttgttcc 1140 ggctcgtaaa tcaacttcca gctcctcgat ttgatcgagt tgagctcgtt gctgktgatg 1200 attcgttgct gctgctgccc gttagaatcg ttgggatcac tcctcggctg gaccgagtgg 1260 agtccgttgc tgtcgctggg tcgttggatc tgctggatgg accgaatckg ttggatcatg 1320 ctgctgcgcc gtgtcgttgg tgattaaaca cttatatcac actttgttga aacctttttg 1380 awtatttcca ttcgaatcac aaagagcatg tgtagggatc taaggttcac tttatttgga 1440 ttktttttat cwtgaactca aacttgtttt tcttagtctt agttattttt aktttgtcct 1500 cattaaccat ggtgatgcat ttttcaattc cgacttttta tcttttaact aatgctggat 1560 aatagtagtg ccaccccttc acaattaaac tgttttattc cacgtgtcgt gatgaattgt 1620 gctctcgact ccagtaaatt gaatatttgt catctcaatg ctcaaagtat ttgtgctcga 1680 cagttgagca agttcaatga atttaaaatt tgctttagca acagtaaagt aagcataata 1740 tgcgtaactg agtcgtggct aaccgacaac atacccgatg aactaatttc tcttgaaggt 1800 tataatctaa taagaaatga tcgaaaatgc gctcgtggtg gaggaattct tgtgtactat 1860 aaatccaacc taacctgtaa aatwttgact ttwtctaaaa atgatacatt gaacacaact 1920 gaatwtttac tgattgaaat tcattgcaat agagaaaagt ttttgctggg tgttttttac 1980 aatccacctc gtgttaactg ttctgaaatt gtgtcctctt tattttcaga attttcwact 2040 agttatgaaa atgtcgttat tgtaggcgat tttaacaaat caggaacact ttcmgataat 2100 ttcttgaatg ccatagatca tcttggaatt tgttgcgtka attctttccc gacacacttc 2160 cacagtggag gttgttctct tttagatctt ttttwaacaa ataatacaaa ttttgttcac 2220 aattttaatc aagtctcagc tggtggattt tcaaaacatg atatgatttt tgcatcgtta 2280 aatctaagtt cttttgaaga tgaaatatat tacttcaggg attacaataa tgtcaattta 2340 cctgccttaa ttgaagcagt taattccatt aactggtcac aattattttc tatgmgtgat 2400 ccggacattg cacttgactw tttcaatcaa catattcgta acttatttaa ccgttttatt 2460 caccttcgca aatatgtacc tagaaagaat tcctggttta acaacaatat tatgaaagca 2520 atgatcgaaa gagatctcgc atataatata tggaaaactg ataagaattt tactaattac 2580 agtcagttta agagattgag aaataaagtc acacatctca tcaatgaggc taaatctaat 2640 tatgttacat ctcgagttga ttcttcagtg tctagtaaag acctttggaa caagttaaaa 2700 caactaaata tttctaataa atcaaaacct gatgtgaaat tccccaactc taacgcagaa 2760 atcaacaact attttacctc aaattttact caagatttgt cacaccctcc aatattacca 2820 aggaacaatg aaggctttcg tttttcagag gtaaatgtta atgatataat taatgcaatt 2880 aattcagtga agtcaaatgc agtaggtatg gatgatattc ctatccgttt tattaaactt 2940 attttgcctg aaatcattca taaaataact ttcattttca atatgatcat attatcatca 3000 aaatttcccc gagcttggaa aatgactaaa gttataccaa tcagaaaaaa gtctagaagt 3060 cttgatttga acaatttacg cccgataagt attctttctg cactttcaaa agtctttgag 3120 aaagtattga aaatccaaat ccaaacctat ttggacaatt ttaatattct taacagtaat 3180 caatcaggat ttcgaaaaaa tcataatacg acttcggcta cagttaaggt actagacgac 3240 attcttagtg taattgacaa aaagggaaaa gctttttcgg tctttatcga ttttgcaaaa 3300 gcatttgatc gcgtgtcaca tgttaaatta attcaaaagt tatcactgca attcggattt 3360 tcatctggtg ccacagaatt gttacaaaat tacttatgta acagaaaaca agttgttttc 3420 gcaaatggcc ttttttctga ttcagccagt attatttcag gcgtacctca aggttctatt 3480 ttagggccgc ttttattctc cttattcatt aatgacctgt gctccgttct aaatcattgt 3540 agaattcaca tgtttgctga tgacgtccag atttacatag cttctaatag gctgtctatt 3600 ccacaaatgg cgcaattatt aaatgaagat ttagctaaca tttatgactg gtcatgttct 3660 aatcttctac caattaactc atccaaaact aaagttatgc tcttttccag aaacagctca 3720 gaacatcagc agcctttaat aatgctcggt caaaacgttc ttgagtatgt gcttaaatat 3780 ccaagtcttg gcttcattct aaaaaatgat ttagaatggg atggtcatgt gaatgcgcaa 3840 tgtagaaaaa tctatatcgg acttagaact ttaaagctct cagcaagtat gttaccggtt 3900 aatgtaaaac tcaaactctt taaatcccta ttgttgccac atttcatgta tggtgatgtt 3960 ttgctattaa atgcttcatc tgcttctttg aacaggttaa gggtagcttt aaacgcgtgt 4020 gtacgatttg tatacaattt aaatagatat tcgagagtga c 4061 // ID piggyBac-1_BF repbase; DNA; INV; 3758 BP. XX AC . XX DT 13-APR-2011 (Rel. 16.04, Created) DT 13-APR-2011 (Rel. 16.04, Last updated, Version 1) XX DE piggyBac-1_BF DNA transposon DNA transposon - consensus. XX KW piggyBac; DNA transposon; Transposable Element; piggyBac-1_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-3758 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M. and Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-3758 RA Kapitonov V.V. and Jurka J.; RT "Transposable elements in the amphioxus genome."; RL Deposited to Repbase as the tmplanrep.ref file (02-May-2008). XX RN [3] RP 1-3758 RA Kapitonov V. and Jurka J.; RT "A family of piggyBac-1_BF DNA transposons from the amphioxus RT genome."; RL Direct Submission to Repbase Update (13-APR-2011). XX DR [3] (Consensus) XX FH Key Location/Qualifiers FT CDS 747..2669 FT /product="piggyBac-1_BF_1p" FT /note="TPase and the C5HC2 zinc-finger domains." FT /translation="MATRERLARYTAAEALRQILNAQSDDENSSVDEDESD FT LDVEDHLSDASDHSDTESVQEEDHPREEIRQEADRGLGRARGRGRGRARGL FT GARGRGRARGARGRGRGRGRAPINNPAVNQDAGNENPVVDRPYQGKNGTVW FT DQNPPPLGRRRNQDIIRNPPGITNAARCVDIQSAFSLFVTPAMIDLIVQQT FT NREARRKHREWNDNQENERQEEWCRVDGTEIRAVIGLCIIAGLYQSNHEPQ FT ASLWSTADGKPVFPATMTRERFRNILKYMRFDDRETRAERRATDKLAAFRD FT IWEMFIAQLPKYYIPGTDLCVDEQLVAFRGRCAFRQYIPSKPAKYGLKIWW FT NCDATTCYPLKGEVYLGRQPGEQREIGQGARVVKELTYLWRRTGRNVTADN FT FFTSIPLAEDLLEDGLTYVGTIRSNKPHIPEVMKAANHKEVNSSMFGFQDQ FT LTLTSYVPSKGKAVLVLSSMHHDANIIGEQQKPEIIMHYNATKSGVDNLDH FT LATMHTCRRKINRWPMVLFFNLLDVAGIASLIVFLGNFPDWNLSACSRRRR FT LFLRELGYNLVMPHVQRRAQSPYLMPATRESMARIGVRSVRRAEEPAAAAA FT QEGRKRCELCPRSSDRKVRRACAGCGRTVCAEHSYKQYVCGDC" XX SQ Sequence 3758 BP; 1152 A; 781 C; 825 G; 1000 T; 0 other; ccctcaaaca cccgagtggg gtccgtttgg accccaggcc tatattttga tgtgccattt 60 gaaaattttg catcggaaca aaaatccctt ccgtgacttt gttcataaac ctcttctcca 120 acttctccta atttttttcg gagatcgaaa tgattttgta aaagttataa agggaagtct 180 gaggtatgtc cgactggggt ccaaacggac cccagaaata tgtggatttt tcaacataca 240 gaagtgtttg gctgtctttg atatgtcccc cagcaagcat aattgcttca atatcatgca 300 gtttgccaag aaaactcatc cctaggaatc aggttatgag gtcatccgag gtcatgacct 360 ttttatgacc tctgagggca aaggtcaacc acatgtccac ctctgcaggc cttgcctctt 420 aggtaagtta tctctttcaa tccatgataa ttctctgcct ttgtaaacca tcctttgaat 480 ataacattag tacaatgaag ctagatatgt gctgataaca cgtacttgaa aatttcatgt 540 tttattcaag tttggagtat atttttttca tcctatatat gcagtgtcga agcatactgt 600 ggggaggggg gcactgcctt agctccggta tatttgtgct gggctttaca aatgtttaca 660 ataaccgaac cgagacaagt ctattttaca catccccaca tattgtcact tctaagaatg 720 gcatcttaac atttgacagg tacgccatgg ccaccagaga gagactagcc agatatacgg 780 ctgcagaggc cttgcgtcaa atactcaacg cccaaagtga cgatgagaac agttcagttg 840 atgaggatga gtccgattta gacgtcgaag accacctgtc agatgcgtca gaccattctg 900 acacagaatc tgttcaggaa gaagaccacc ccagggaaga gatccgtcaa gaggcggata 960 gaggtctggg aagggctcgc ggaagaggtc gtggacgagc cagaggtctt ggagctagag 1020 gacgtggacg agctcgtgga gccagaggac gtggaagagg gagaggccga gcaccaataa 1080 acaacccggc agtaaatcag gatgcaggga atgagaatcc agtagttgac aggccctacc 1140 aaggaaaaaa cggaacagta tgggaccaaa acccccctcc tctcggcagg cgccgaaatc 1200 aggacataat aagaaatccg ccaggtataa ctaatgctgc taggtgcgtg gacatccagt 1260 cagcattttc cctctttgtg acacctgcta tgattgatct tattgtacag caaaccaaca 1320 gagaggcgag acgaaaacac agagagtgga atgacaatca ggagaatgaa agacaagaag 1380 agtggtgccg agttgatgga acagagataa gggcagtgat tggattatgc atcatagcag 1440 gtttatacca gagtaaccat gagcctcagg cttctctctg gtctacagct gatggcaaac 1500 cagtcttccc tgcaactatg accagggaaa gattcagaaa cattttgaag tacatgcgct 1560 ttgatgacag ggagacccgt gctgaacggc gagctactga caaacttgcg gcgtttaggg 1620 acatttggga gatgttcata gcacagttac caaagtacta catccctggc acagatctct 1680 gtgttgatga gcaacttgta gctttcaggg gaagatgcgc attcaggcag tacataccat 1740 ccaaaccagc taaatatgga ctgaaaatat ggtggaactg cgatgcgaca acatgttacc 1800 ccttgaaagg agaagtgtac ctagggagac aaccagggga acagcgagag ataggccaag 1860 gggcaagagt ggtaaaagag ctgacatatc tttggcgcag aacagggaga aacgtcacag 1920 cagacaactt cttcacttct atcccactag ctgaagatct cttggaggat ggacttacat 1980 atgttgggac aataaggtct aacaaaccac atattccaga agtgatgaag gctgcaaacc 2040 acaaagaagt gaatagctcc atgtttggct tccaggacca actgactctc acatcatatg 2100 taccttccaa aggcaaagct gtccttgttc tgtcaagcat gcaccatgac gcaaacatca 2160 taggcgagca acaaaagcca gagattatca tgcactacaa tgccacaaag agtggtgtag 2220 acaatctgga tcatctagcg acaatgcaca catgcaggcg caagataaac agatggccaa 2280 tggtgctgtt tttcaatctc ttggatgtgg caggaattgc ttctctgatt gtcttccttg 2340 gaaacttccc agactggaac ttgtctgcat gcagccggag gcgccgcctc ttcttgaggg 2400 aacttgggta caacttagta atgcctcatg tccaaagaag ggcacagagc ccatatctca 2460 tgcctgcaac tagagaatca atggctcgca ttggtgtaag atctgtccga cgagcagagg 2520 agccagcagc agctgctgca caagaagggc gcaagagatg cgagttatgc ccacgcagct 2580 ctgatcggaa agtcagacgt gcttgtgcag gatgcggaag aacagtttgc gcagaacaca 2640 gctacaagca atacgtatgt ggtgattgct tgtagctttg ttctacaaac atcgtcttct 2700 tgcactttgc acttagcaca ggtagaagca attgtagtgt tgcctgatag agaagaagca 2760 gatattcacg tggtcaggtt acccatcgca atgtccaatg aattctgtac aatgacattc 2820 ttatatttat atgtcttatt ttatctgcat acttgctttt tcatgctaac ctccattttt 2880 atcagtgtaa atacagtaat gccacttttg tttctaaaga agtagctaca gtcactttgt 2940 atattattag gattacgatc atcggtacca gtctgtatgt gctcagctaa ttataactgt 3000 gatagcatag cataatcagc tcttatatac tttacaatca acttatattc taagattcct 3060 tattcaataa ctagtttttg atacttgttt cataataggt aacatataag tatgtttaga 3120 catgtcaggg acgaaatttg tgggaaaaaa cgcaaaattg ttagtaaaaa aatgtggtcc 3180 tcctccacat atttgcataa attatgataa ttagctgaaa ttattcacac ctaaacaagc 3240 caaaattttt catacacaac aaggacacta ttgtttttga agaagtagac acaaatcagt 3300 ttgtatacta ggatttagat catgggtacc agtctgtatg tgctcagcca ttcctaaatg 3360 tgttagcata atcacctcat atatctatac ttatattcct tacttttaac ttatattcaa 3420 acattccttg ttcaataact aacatttgat acctatttca taatagcaac atatgaatat 3480 ttttgtagac atgtaaagga cgaaatatat gaaataaagg taaaatttgt tgttaaaaat 3540 ggggtcctcc ttcacttatt tgcataaatt atgataatga gccaaatata ttcatgtttc 3600 aacatgccaa aatatttctc tcactatgac gaagcaatgt atatgcaaag attacccaaa 3660 caatgcaaat tagatgcatt tattaacgaa tcaaaatggg gtccatttgg accccactcg 3720 gtcattttag tcgcaaaaaa aggtcgggtg cttgaggg 3758 // ID CR1-20_HM repbase; DNA; INV; 4249 BP. XX AC . XX DT 15-DEC-2008 (Rel. 13.12, Created) DT 15-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE CR1-type family - consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-20_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4249 RA Bao W. and Jurka J.; RT "CR1 families from Hydra magnipapillata."; RL Repbase Reports 8(12), 1848-1848 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(1082..1696,1686..2177,2128..3168,3141..3830) FT /product="CR1-20_HM_1p" FT /translation="MSENSLYNLVSYKMISQPRVNSKGGGVAVYILDKFEF FT KKNNTFSKTNKHIETLCIEVLNTSGKSFLISSLYRPPCGKQVNFLNILKXF FT MSQMSKQNKHVFFAGDTNMDALSYEKFANIKNFFNLLFEFNIIPTIFKPTR FT ITNQSSSVIDNILTNCSLNTKFESGLFIADFSDHFPVFHIAREVLSNIDNK FT KCFVTYRNLSIKKLKKNLNNFKNKLQVEQWDKVYSSTDPNIAFDHFSSTFQ FT NIFDNVCPKITTEIKNKEYKNPWMTITLIKASKRKQKLYVKFLKSKNKEDE FT KIYKNYKKFFQHSMKEEKIRYFSNQLDKHKFDIKRTWAVINELTGREKKKH FT TCTCLPQKLISDKKNYNKYNKNMSRIFPIKKIITSTTKICQEFNKYFVNVG FT PSLASNIVQPNKTFDNFLGEKPTTSINNEKITKLEFNKAIIELKRNKSCGY FT DGISSNVAIYIMDSISKPLFYLISSSFENSIFPNKLKLAKINPILKKGDCN FT NVSNYRPISLLPVLSKVFERIIYNRVNNYFTLNNLLYKNQFGFQSKCSTEH FT AIIKLADKIFKSFDSDELVLGVFVDLSKAFDTVDHSILLSKLKYYGIIGST FT FKWIQSYLSNRKQFVLLEKSGALDITCGVPQGSILGPLLFLIYINDMHKAS FT QKISTIMYADDTNLFFNHPXAKTLFETMNIELEKFNLWFIANKLSLNCQKT FT SFTLFHKKKNNQLISLKKKQSINLPLKLPKLLINKNEIKREKCTKFLGVLF FT DENLSWRNHIGLLETKMSSVIGVLYKSRPFLDSKSRNMLYFSLVHSQLLYA FT NIVWASTHKTKLLKLQSLQNHACKVINYLNRLVNPAPVMKSMMVLDIEKLN FT TLQILIFMYKYKNNLLPSVFSDIFLISFSEKYNLRSNTRYNYHLGKIKSHQ FT SNFLITNRGPSIWNHFQNKNIKDLKSISSFKRETKMHLLNS*" XX SQ Sequence 4249 BP; 1647 A; 594 C; 591 G; 1395 T; 22 other; aacatacaaa atggataacg ataaagtaat tacaatggga atgcttcgag aactcatgtc 60 aatacaagga gatacaattt tcaaatgctt taaggaaagt tcggtcagtg tcctggaaaa 120 agttgagaac ttagccaaag rcattgaaga aataaaacgt gggcttactt ttgttggtga 180 tcaaattgag acaaagtaag caaggttgat agtaaaatgg cagaaattaa agaaactatt 240 gtgtattaaa tcattccacg gaaaattaag cattgaaaat aaaakaaaaa ctgtcgatct 300 tgaaaccgac gtaacaattt aagaattgtt ggaatcaaag aagaawataa cgaatctttt 360 aatgatgttg taraaaaagt aaacctacta attaaagaag atctggaaat tgcgggtcya 420 gttattatag aaagagcrca tcgtgtcggt aaatttagtg gaaagcctag aactattgtc 480 tgcaaaatat taaactggca ggacaaggaa aacatcttat ccaacgctcg taaactcaag 540 gggaaaaata tatatattaa cgaagactat agtgatgaaa caatgcgaat tcgtaaagaa 600 ctttttgtac aagcyaaaat acatagtggt aacggaaaat acgcgaaagt catttacagt 660 cgattagtgg taagagaaat gataaacgag aacgaaggtg ttgaaagyaa gtaatttatt 720 tatttacgtt ttaatagttt tattagatat taaattttaw ggtttttatt taaggataat 780 attaacaatg aatatrtnta aaacaattca taayaatgaa gcaaatttaa caaaattaat 840 atctccaatt tttgaaattt tgaaatataa caaaaatgat tayaattttt tgactgaaat 900 agatattaaw tgcttaaata atgaatacta taaaccggaa gaatttaaga gaaaaaaagc 960 ttaacaaaga tttaaacatt ttaagtataa atatacggtc aatgaacatt tttattattc 1020 tttaaattat ctttttgacg taatatgtat atcagaaact tgggaagatg caaacttgcc 1080 catgtcagaa aattccctat acaatttagt ttcatataaa atgataagtc aaccacgtgt 1140 aaacagcaag ggaggtggcg ttgcagtgta tattcttgat aaatttgaat ttaaaaaaaa 1200 caacactttt tcaaaaacaa ataaacatat tgaaactttg tgcattgaag tgttaaatac 1260 ttctggtaaa tcgtttttaa tctcctcatt gtatcgaccg ccatgtggta aacaagttaa 1320 cttccttaat atcttaaaaa mttttatgtc tcaaatgagt aaacaaaaca agcacgtttt 1380 ttttgccgga gatacaaata tggaygcatt aagttatgaa aagtttgcaa acataaaaaa 1440 ctttttcaat ttactatttg aatttaatat aatacccacc atatttaaac ccacgcgcat 1500 tactaaccaa tcatcttctg tgattgacaa tattttaaca aactgttccc taaatactaa 1560 atttgaatca ggactcttta ttgcagactt ttctgatcat tttcccgttt tccatattgc 1620 acgcgaagta ctatctaata tcgacaataa aaaatgcttt gtgacttaca gaaatctttc 1680 tataaaaaaa cttaaataat tttaaaaaca aactgcaagt agaacaatgg gataaagttt 1740 actcatcaac agatcctaac attgcgtttg accatttctc aagcacattt caaaatatct 1800 ttgataatgt atgcccaaaa ataactacag aaataaaaaa caaagaatat aagaatccat 1860 ggatgacaat aactttaatt aaagcatcaa aaagaaaaca aaaactttac gtaaaatttt 1920 tgaaatcaaa aaataaagaa gatgaaaaaa tatacaaaaa ttataaaaag ttttttcaac 1980 atagtatgaa agaagaaaaa ataaggtatt tttctaatca gcttgacaag cataaattcg 2040 acataaarag gacttgggct gttataaacg aactaacagg aagagagaaa aagaagcaca 2100 cttgcacttg cttaccacaa aaactaattt ccgataaaaa aaattataac aagtacaaca 2160 aaaatatgtc aagaatttaa caaatatttt gttaatgtag gcccctctct tgcttctaac 2220 attgtgcaac caaataaaac atttgataat tttttgggag aaaaacccac cactagtata 2280 aataatgaaa agataactaa acttgaattt aataaagcaa ttattgaact taaacgcaac 2340 aagtcatgtg gatatgatgg tatttccagt aatgtagcta tttatatwat ggacagcatt 2400 agtaaaccat tattttattt aatatcatct tcatttgaaa atagtatatt ccctaacaaa 2460 cttaaattag ccaaaattaa tccaattctc aaaaaaggtg attgcaataa tgtttccaac 2520 taccgaccta tttctcttct tccagtactt tcaaaagtat ttgagagaat aatttataac 2580 agagttaaca attactttac attaaacaat ttattataca aaaatcagtt tggatttcaa 2640 agtaagtgtt ctactgaaca tgctattatt aaacttgctg acaaaatatt taagtcgttt 2700 gatagtgatg agctggtatt aggagttttt gttgatcttt ccaaggcctt tgatacggtt 2760 gatcacagta ttttactttc caagctaaaa tattacggta taataggctc aacatttaag 2820 tggattcaaa gctatttgtc taacagaaag caatttgtat tactagaaaa gtcaggagcc 2880 ctcgatatta cgtgtggagt tcctcagggt tcaatcctag gtccattatt gttyttaatt 2940 tatattaacg atatgcataa agcatcccaa aaaatatcta caattatgta tgctgacgat 3000 acaaatttat tttttaatca tcctratgca aagactttgt ttgaaactat gaatattgaa 3060 cttgaaaaat tcaacctctg gttcatagca aataaattat ccctaaattg tcaaaaaact 3120 agttttaccc tttttcataa aaaaaaaaac aatcaattaa tctcccttta aagcttccta 3180 aactgcttat taataaaaat gaaatcaaac gtgaaaaatg tacaaaattt ttgggagtgc 3240 tttttgatga aaatttgtca tggagaaacc atattggtct ccttgaaaca aaaatgtcct 3300 ctgtaatagg ygttttgtat aagtcgagac cttttctcga tagtaaatcg cgtaatatgc 3360 tttactttag tcttgtacat agtcaacttt tgtacgcaaa tattgtatgg gccagcaccc 3420 ataaaacaaa actgctaaaa ttgcaaagcc tgcaaaacca tgcatgcaaa gtaataaact 3480 atttaaatag gttagtaaac ccggccccag tgatgaaaag catgatggtt ctagatattg 3540 agaagcttaa cacattacag atattaattt ttatgtataa gtataagaac aacctcctac 3600 caagcgtttt ttcagatatt ttcttgatat cgttttctga aaaatataat ttgcgttcaa 3660 ataccaggta caactatcac ttagggaaaa ttaaatcaca tcagtcaaat tttttaatta 3720 caaaccgtgg cccatcaata tggaatcatt tccaaaataa aaatattaaa gacctaaaaa 3780 gcatttcatc gtttaaacgg gaaactaaaa tgcatcttct taattcctga catatatttt 3840 atagtttaca gtatttgttt tcagtatttt tttatttttt tatttcttct tcttcttatt 3900 tttcttgttt ttaatatttt ttgtttttga atttgttaca ctaaaaaaat atgagctttg 3960 tttttattga atttttattt tatttttaaa aattaaattg tttttataat gagtgactat 4020 aggggctcta tgaaaaggtt gtgatgttaa acacatcatc cttaccttct ttgagtccct 4080 gactgattag aacagaatat acataatata aaataaaaaa ttatgaaaaa acgtttttta 4140 tcatgtttat ctttacgacg tgtattttat gtatttttta catgattata tgaaaattgt 4200 tattgtaact gtttaatcag caaaataaaa gtwtatataa taataataa 4249 // ID BEL-51_CQ-I repbase; DNA; INV; 3166 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-51_CQ_; KW BEL-51_CQ-LTR; BEL-51_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-3166 RA Jurka J.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 255-255 (2011). XX DR [2] (Consensus) XX CC 'TAACC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 622..3165 FT /product="BEL-51_CQ-I_1p" FT /translation="MDSTTKKKLTDAFRQRYRTERKVIFVQELLLGLYEPT FT LAQLTALYEFLVKSYREQSRHHLQVIGLIPDECLAAQEAEFEKYDALFYEV FT ATALAELQIEAEQTVPSATTPIAANRQRLQAEAWLKQMLKAWESNQVDIRD FT SGIETQAPESQEVVLRDCALSKPKPLQEQAEVNRKHFVEPPKQDPVGHRVS FT AGILEDAVAITTKHDEIVNAEPLLPATELIRDQHNEASHSCVPWDSSPEVT FT SRLVKVPSVVKHEALVLRSVHLDGKSSAVFDTSSKPAKPTNPQGHHRANQD FT GGFIPGKDPLVPSEVKAPDKLEQIHTTPNSQPQTFDAHPSGRQTVRIISRT FT MPHSCVPISRHDPFTPTLTGADQSKPEANRCSRIVHSVTAFSTTTVDFYTT FT LKHWNPAKALSQRHACLAMFVTSAPNKRFVVNAPDRTEDLAHKRSAGRQTL FT RHENVLQEPRPIFIQQPLKDRSKSDFNRSQALYEFCKPRKRLLRILDLKVR FT NVRKRRCNSVGPPPSQLCSSPDPEDKPPDDLTTRGAPVEDQTIVCILTHPW FT DTGTKSPFIVTAESDQVQPLQQEVVQYSDDEANQRRVHSDSGPQTKVRIRK FT HTQNQNHRVQPRQMIVPNSVSPGNSAFHVNEQRRPEDLGVTGELQVKCSRS FT SHSWSYAERGSSNVFVVHQSRKPSHKPTSNLSNLTPYKFSNVDTSRRVPNI FT RITNRLSTTWLESSASPPETRPLLIALEPSQPVEMKLVIAILRIQGSCHEP FT PSQDELFQVRIQPRRFQRTVDRWRRLWHPTVPPDELSRDPKSAQPSKRKYN FT FRGIHAILPAIEDVGERREPADQSAAIHRKRCVGLTKELRDLNGGS" XX SQ Sequence 3166 BP; 855 A; 984 C; 773 G; 554 T; 0 other; tggtctcatc gaaccggatt tgaaccccgc ccaagcctcc aaaagtgaaa tcactggatt 60 caaccggtac tagtccagcc gttccggaag cgaccggaat acgtcaatca agcactcctg 120 tacaagttcc ccaccccgct gccccatcgc agctcgacgt ttccgagcga ctgtccagcc 180 gtggaagtca tcggaagtcg gattcccgga agtgaactcg cgcggatgca caaagccggc 240 gagttgttcc tgcccaagtt cccctgctgc cccatcgcag cttgaccggt tccgagcaac 300 tgtcccaccg cggaagttgt ctgaattcgg attcccggaa gtgtccaccc taggaacttc 360 ctccaagcag ttcacccgac gaactgctct cgcgcggatg tacaaagccg ttgcgtcatc 420 ccagtccaag cattcccgga acaacccatt ccgagccgaa accgcgtaca aactagcttg 480 tgcgtgtgtg tatgtgtgtc tccgccaaaa agtgtcatcg tccagccgag gaggcgaact 540 ttccccgacc aaacagcagc cactgtccga ccgataacag gagtctcgac gttgacttcc 600 ttagaccaaa aggaaaaagc catggactcc acaacaaaga agaagctgac ggatgcgttc 660 cggcagcgct accgaacgga acggaaagta attttcgttc aagaactgct gctgggtctg 720 tacgaaccga cgctggcaca gttgacggcg ctatacgagt ttttggtgaa atcgtaccga 780 gagcagagcc gacatcacct gcaagttatc ggacttattc cggacgaatg tctcgccgcg 840 caagaggccg aattcgagaa gtacgacgcg ctgttctacg aagttgctac agcgctagca 900 gagttgcaga ttgaagccga acaaaccgta ccaagtgcaa ccacgccgat cgccgccaac 960 agacagcgcc tgcaagcaga agcatggctg aagcagatgc ttaaagcctg ggagtccaac 1020 caagtggata tccgtgattc aggcatcgaa acccaagcac cggagagtca ggaggtggtc 1080 ttgcgagact gcgctctttc aaagcccaaa cccctccagg aacaagcaga agtgaaccgg 1140 aagcatttcg tggaaccacc caaacaagat ccagtaggcc accgtgtgtc tgctggaatc 1200 ctcgaagacg ccgtcgccat caccacgaag catgacgaaa tcgtaaatgc cgaaccactg 1260 ttgccagcaa cggaacttat ccgagaccaa cacaacgaag caagtcacag ctgtgtcccg 1320 tgggacagca gccctgaagt gacgagcaga ctggtgaaag tgccatctgt cgtcaaacac 1380 gaggcacttg ttctgcgtag tgttcacctg gacggaaaat cctcggcggt gttcgacaca 1440 agttcgaaac ctgcgaagcc aaccaatcca caaggacatc accgcgccaa ccaggatggc 1500 ggtttcattc cgggaaaaga cccgctagtg ccgtcagagg tgaaggcccc ggacaaactc 1560 gagcagatcc ataccacgcc aaacagtcaa cctcaaacct ttgatgccca tccaagcggc 1620 cgtcaaactg ttcgtatcat cagcagaacg atgccccatt cctgcgtgcc aatatcacgc 1680 cacgacccat tcacacctac gctgaccggg gccgaccaga gcaagcctga agccaacaga 1740 tgttctagga ttgtccattc cgtaacagcg ttctccacaa ccacggtgga tttctacacc 1800 acactcaagc attggaaccc ggccaaagcg ttgtcccaaa gacatgcttg cctggccatg 1860 ttcgtgacca gtgcccccaa taagcggttt gtcgttaacg caccagatcg aaccgaggac 1920 ttggcacaca agcgaagtgc tggaagacag acactacgac acgaaaatgt gctccaagaa 1980 ccacgcccca tcttcattca gcagccgttg aaagaccgca gcaaatcgga cttcaaccga 2040 agtcaagctc tttacgagtt ctgcaagcca aggaagagac tactgcggat tctggacctc 2100 aaggtgcgaa acgttcgtaa acgacgatgc aactccgtcg gacctccacc atctcaactt 2160 tgcagttcgc ctgacccaga agacaagcct cccgacgacc taacaacaag aggcgcgcct 2220 gtagaagacc aaacgatcgt atgtatcctg actcatcctt gggatacggg aacaaaatcg 2280 cccttcattg tcactgcaga atcagatcag gtgcagccac tgcaacagga agtggtgcag 2340 tactccgacg acgaggccaa ccaacgccgt gtccacagcg acagcggccc ccagactaaa 2400 gtccgtatcc gcaagcacac tcaaaaccag aatcaccgcg tccaaccgcg gcagatgata 2460 gtcccgaaca gcgtatctcc gggcaacagc gcattccacg tcaacgagca gcggcgtccc 2520 gaggacttgg gtgtcacagg agaacttcag gtgaagtgca gccgatcatc ccactcctgg 2580 tcctacgccg agcgtggatc gtccaacgtc ttcgtagtcc accagtcgag aaagccaagc 2640 cacaagccaa caagtaatct gtccaaccta accccgtaca agttctccaa cgtagacaca 2700 agccgtcgtg tgcccaacat ccggatcacg aaccgcctga gcaccacttg gttggagtcg 2760 tcagcatcac cacctgaaac cagaccgctc ctgatcgcac tagagccaag ccaaccagtc 2820 gagatgaagc tcgtgatcgc aattctccgc atccaaggta gctgccacga accgcccagc 2880 caagatgaac tgttccaagt gcgtattcaa cccaggcgtt tccagcgaac cgtcgatcgt 2940 tggcgcaggc tctggcatcc gacagtccca ccggacgaac tgtcacgtga tcccaagtca 3000 gctcaaccca gcaagcgcaa atacaacttc cgagggattc acgccatcct accggccatc 3060 gaagacgtcg gcgagcgacg agaaccggcg gaccaatccg cagccatcca ccggaagcgt 3120 tgtgtgggcc taaccaaaga gttgagagac ctcaacgggg ggagta 3166 // ID SMARN2 repbase; DNA; INV; 1122 BP. XX AC . XX DT 30-SEP-2007 (Rel. 12.09, Created) DT 01-OCT-2007 (Rel. 12.09, Last updated, Version 1) XX DE Consensus sequence of putative non-autonomous Mariner-type family DE of repeats. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Nonautonomous; KW SMARN2. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-1122 RA Jurka J.; RT "SMARN2: Non-autonomous Mariner-type element from freshwater RT planarian (Schmidtea mediterranea)."; RL Repbase Reports 7(9), 999-999 (2007). XX DR [1] (Consensus) XX SQ Sequence 1122 BP; 482 A; 136 C; 126 G; 378 T; 0 other; tacacctgat tcagttcatt aaaaactcct ataataaaat gaaaaaatca ataaagttat 60 aatataggaa catacatact tacccgtgtc tgcatgtaat ttgttcaact gaataaaaac 120 ttatattata ggaaggaagg aattagaaga aaaatttttt ttaacgtaaa aacataatac 180 atagtaaata atgcatgcaa actttcaaaa tgtagaaatg ttcagttcaa taaaaactct 240 atataaaact atttttagta actttttaga caagtttttt tttaaattaa agccgaaaat 300 ttttgaaaaa aataccaata actctacttt gagttgtaga gaaaattaag aaataatttc 360 gtaagatttt tcatgaaatt caatcaattt aatattttta caacttttca gatatttcaa 420 aaatatccat cgaaactcta ggtttttgag tttttacctc atttgcaatt taccaggagc 480 agtggaacca gaaccttcaa atgaggagtt taaaaaatac ggaaaacata agttttccga 540 taacttaagt taaaaaaaat ttttttataa agtataaaaa atccaaatca ttaataccat 600 tgaaaatgag gaagattttt ataaatttgg gaaattaaca gaaatttcca aagcctcata 660 aaatcaataa ttatcaaaaa tttggatttc tgtcaaaagg taaaaaagat ggcagccaaa 720 acaaatttcc aaggaacaga ggaatagttt attcgaatat agaaactttt taataataaa 780 ttttatttta ataaatttta gttttcaatt aatcggactt acatttattc gaaaataatt 840 ccattaattt acataaattg gcataaattc ctaaaaaaaa cctaaaaaaa ttaattaaaa 900 ctaaatttat aataaatgat gatgaaaagt attatttaag agcttgaata atgatctcat 960 aaatatacga gatcacaatc caagccctaa aatatacttt attaaaaaat ttacctgtca 1020 gccattttaa aaatggctaa ttcataaact taaccataat tgtaattggt tgttcattaa 1080 aacttctatt atagaagttt ttaatgaact gaatcaggtg ta 1122 // ID Gypsy-267_AA-I repbase; DNA; INV; 5496 BP. XX AC . XX DT 13-JAN-2011 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-267_AA_; KW Gypsy-267_AA-LTR; Gypsy-267_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-5496 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (13-JAN-2011). XX DR [1] (Consensus) XX CC Positions [2616-3116] - Reverse transcriptase CC Positions [4250-4720] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 1030..2403 FT /product="Gypsy-267_AA-I_1p" FT /translation="MALNGLTTMIEPYRKGSSFCDWVERLAYCFQANAVSD FT EMKKAHLITLGGTVLYSELKLLYPNGALDTAVYADIVTKLKARLDKIEPDV FT IQRAQFNSRIRQAGETLEDFVLALKLQAEFCGFGDFKNMAIRDRIVAGVND FT KALQQRLLNEEKLTLETAEKIITTWQIAETNAKALGSRENSWDQIASLKMA FT GANAVGTSMRKLARTYELARQGENSRGSVKSRLGYRQWPVRQGVYQRQPTD FT WGKSRSLNTGEWRNPEHRRPSRFANFTCDYCGIKGHIKKRCFKLKNWKRDA FT VNMVEDQQAPAVDEDLHDMINRMTAKDSDTEDDDMDVGESENEEFECMNIT FT SINKINNPCLLELTIEGKLLSMEIDCGSAVTVINKRQYFSLFDKPLQKCNK FT NLAVVNGSKLDVLGEAKVFVSFQGKEAILKLLVLDCCNNFIPLLGRPWLDV FT LFPNWRIFFLIKL" FT CDS 2415..3389 FT /product="Gypsy-267_AA-I_3p" FT /translation="MVIDNNEALISDLKQQFSKVFVKDFSTPIVGFQADLV FT LKTEQPIFKKAYDVPYRLKDKVDDYLDKLEREKVITPIQTSQWASPVIVVI FT KKNNQIRLVIDCKVSINKMIIPNTYPLPTAQDLFAKLAGCKLFCALDLEGA FT YTQLELTDRSKKFMVINTTRGLYTYNRLPQGASSSAAIFQQIMDQILHGID FT GVYVYLDDVLIAGKNFEDCLSKLRIVLAKLSKANIKVNLDKCRFFVKELDY FT LGHVISEEGLKPCPDKISTIKNANVPKNVSELKSFLGLINYYQKFIPHLST FT KLFHLYNLLKKDVKFVWDSKCSKAFEDSKNALL" FT CDS 3536..5269 FT /product="Gypsy-267_AA-I_2p" FT /translation="MSKIYPILHLEALALVCTIKKFHKYLYGKKFLVFTDH FT KPLVGIFGKEGRNSIFVTRIQRYVLELSIYDFEIRYRPSSKMGNADFCSRF FT PLPQSIPNELRTDFVRSINFSKNVPLDHEKVANATKNDEFMCKVISFMKHG FT WPDRLNKHFANVFANQHDLEIVDECLLYQDRVVIPAVLQPHVLNLLHGNHA FT GIVKMKRLARQSVYWFGINSQIEEFVKKCDVCNSMATVSKQKIDSEWIPTT FT RPFSRVHVDFFHFSSHTFLLIVDSYSKWVEVEWMKTGTECGKVLKKLVAFF FT ARFGLPDVLVSDGGPPFNSFAFALFLENQGIKVLKSPPYHPASNGQAERFV FT RTVKDVLKKFLLEPEHCELELEDQINLFLINYRNNSLTADGHFPSERVFAY FT PPKTLIDLLNPKKHYRKFLLKPAKTSNEDNLLEVVSRTKRKNPLDDLMEGD FT VIWYQNHKPDHPAKWLKATYLKQISQNLFQVLIGNVVATVHREQIRTSGGS FT DREMRPNVVLTKQRAIDVTDGPGNRVSERLSDDVNKNRSDKYYQEEKFSGK FT RKKPADWVDEPPRRSKRNRKEKRDDAFVYIK" XX SQ Sequence 5496 BP; 1716 A; 948 C; 1216 G; 1615 T; 1 other; agttgacgac gaggcaaccg gacatcaata caccgtagcc gtaagtaaat gctgtcagtt 60 gtgaggcagc tcagttgaat tgtagacggc ttggtgaacg gtgcgaaacg cgtgcaatcg 120 gcaaaacctc cttttaaccg gcgattggtg agcgtccgga cggaaagttc acggaagtta 180 tttttttgtt ctcaaaagaa aattcgatcg cggttctttt cgggtattac aaagtaagta 240 atttacaaat ctaccttcag agaagaaata agcagtaatc gtcgagtttt tgacctaatt 300 ttctgggttg gtgttggcgg tgaaatgctg tttaagatca tattgtaatc gaccattttg 360 ttttgagcgt cagttgattg ttctatagat aataaatgat cttattgttg gaagatcttt 420 tgtagtggtg aattctatgc tacgacgtgc agttgaagat atttttgatg agttttatta 480 agagtgcctc cttttacata ttcatctagt tgagctggtt tggaaataaa agcatcagtg 540 agcatccatc aagctacgga gggcacacag ccccacgcaa attcagtagt gaaaatacga 600 catcaggatc ggtctttgtg attgtagcgc caaggagggt tcaacgtagt tgagtttacc 660 agctttggtt gacccagcat cttttcgaac gagataacca gaaggatcgt tcaaccagta 720 tcaacatcaa ggccattgac cttagttggt tcatcgaccg gttggaacgc aagtgagtga 780 gcagtaaatt ggtgagtcaa tttgaaatct ttttattacg agatcaacgg attcttactg 840 ggattttgtt ttcatgatct tcattattct tggccaccga aactattatt gaaaaacgac 900 aaacgcctat aatttggcat attgttaccg tcttttacca ttgtgtaatt tactccattt 960 tggggtcctt gttggtgcct tttgttagta ttcttttatt tgaattttag attgttattc 1020 tttctaacaa tggcgttaaa tggattgact actatgattg agccatacag aaagggctct 1080 tccttttgtg attgggtaga gcggcttgcc tattgctttc aagccaacgc agtttccgac 1140 gaaatgaaaa aagctcattt gataacatta gggggaacag ttttgtattc cgaacttaaa 1200 ttattatatc ctaatggggc tttagatacc gctgtgtatg cggacattgt aaccaaattg 1260 aaagccaggc ttgacaaaat tgagcccgat gtgatacagc gtgcccaatt taattcccgg 1320 atcaggcaag ctggtgaaac tttggaagat tttgttttag ccctgaagct tcaggccgag 1380 ttttgcggct tcggcgattt caaaaatatg gcaataagag atcgcatcgt agctggcgtt 1440 aatgacaaag ccctccaaca aaggcttcta aacgaagaaa agctgacgtt ggagacggct 1500 gagaaaatta taacgacgtg gcaaattgcc gaaactaacg ctaaagcttt gggaagtcgg 1560 gaaaacagct gggatcaaat agcctctttg aaaatggcag gtgccaacgc agttggaact 1620 tctatgagga aacttgctag aacttatgag ctggcaaggc aaggtgaaaa ttccagaggt 1680 tctgtaaaga gtcgtttagg atacagacaa tggccagtgc gtcaaggggt ttaccagaga 1740 cagccaactg attggggtaa atcgaggtca ttgaacacag gcgaatggcg gaatccggaa 1800 cacagaaggc catcgcgctt tgccaacttc acatgtgact actgtggaat taagggtcat 1860 attaaaaaac gatgcttcaa attgaagaac tggaagagag atgctgtgaa catggtggag 1920 gatcagcaag ctccggcggt ggatgaggat ctgcatgaca tgatcaacag gatgaccgcc 1980 aaggactcgg ataccgaaga cgacgacatg gacgtaggtg agagcgagaa tgaagaattt 2040 gaatgcatga atattacctc tattaacaaa atcaataacc cttgtttgtt ggagctgact 2100 attgaaggca aattacttag tatggagatc gattgtgggt cggcggtgac tgtgataaat 2160 aagcgacaat acttttcatt gtttgacaaa cctttgcaga agtgcaataa aaatttggct 2220 gttgttaatg ggtccaaact agacgtgctt ggtgaagcga aagtttttgt cagttttcaa 2280 ggaaaggaag caattttgaa acttttagtg ctagattgtt gcaataattt tattccatta 2340 ttgggtcgac cttggctgga cgttttgttc cctaactggc ggattttttt tctaatcaag 2400 ttgtgatcaa caacatggtt attgacaata atgaggcgtt gattagtgac ttgaaacaac 2460 agttttctaa agtttttgtt aaggattttt cgacacctat tgtaggcttt caagctgatc 2520 tagtgttgaa aacggaacaa ccaattttta aaaaagccta cgatgtacca tacagattaa 2580 aagacaaggt tgacgattac ttggataaac tagaaaggga gaaggttata acaccaatac 2640 aaactagtca atgggcttct ccggttatcg tggtaattaa aaagaacaac caaataagac 2700 ttgtaataga ttgcaaagtt tcaataaata agatgataat accgaatact taccctttac 2760 caactgcaca agatttgttt gcaaaattag caggatgtaa attgttttgc gctcttgacc 2820 ttgaaggtgc atacactcaa ttggaattaa cagatcgttc caagaaattt atggtcatta 2880 atacaacaag aggtttatac acttacaacc gactacctca aggagcttca tcaagcgctg 2940 ccatatttca gcaaataatg gatcaaatat tacatggtat tgacggggtt tatgtttatc 3000 tggatgacgt gctgattgcg ggaaaaaact ttgaagactg tctttctaaa ttgcggatag 3060 ttttagcgaa gttgtcaaag gcaaatatta aagtaaattt agacaaatgc agattttttg 3120 ttaaagagtt ggactatctt ggtcatgtta ttagcgaaga aggtttaaaa ccatgtcccg 3180 acaagatttc tacaataaaa aacgcaaacg tgccaaaaaa tgtaagtgaa ttgaaatctt 3240 ttttgggttt gattaattat taccaaaagt tcattcctca cctctctaca aagctgttcc 3300 atttgtacaa tctattgaag aaggatgtaa aatttgtttg ggacagcaaa tgcagtaaag 3360 cgtttgagga ttctaagaat gcgcttttgg maactgatct acttgaattt tatgacccaa 3420 acaaaccaat tgttatagtc tctgacgcct cagggtatgg actggggggt gtcattgccc 3480 acgtaataga cggagcagaa aagccaatta gttttacctc gttttctctt gataaatgtc 3540 aaaaatctac cccattttgc atctggaagc tcttgcattg gtctgtacca tcaaaaaatt 3600 tcacaaatac ctctatggga aaaagttttt agtttttaca gaccataaac cactggtagg 3660 aatatttggc aaagagggac gaaattcaat atttgttaca agaatacaaa gatatgtttt 3720 ggagctgtcc atatacgact ttgagatccg ctataggcct tcgtccaaaa tgggaaacgc 3780 agatttttgc tcccgttttc ctttgcctca gtcaatacca aatgaattac gtacggattt 3840 cgtaaggagc ataaacttca gtaaaaacgt accgctggac catgaaaaag ttgcaaatgc 3900 aaccaaaaat gatgaattta tgtgtaaagt tatttctttc atgaaacatg gttggcctga 3960 tagattaaac aaacatttcg ctaacgtttt tgcgaatcaa catgatttgg agatagtaga 4020 tgaatgttta ttgtatcagg accgcgtggt catcccagcg gtattacaac cgcatgttct 4080 gaatctctta catggtaacc acgcaggaat tgtaaaaatg aaaaggttag caaggcagtc 4140 cgtttattgg tttggcataa actcgcagat agaggaattt gtgaagaaat gtgatgtgtg 4200 taacagtatg gcaaccgttt cgaaacagaa aattgattcg gaatggattc ctacaacaag 4260 gcctttcagc agagttcatg tagacttttt ccatttttca agccacacgt ttctattgat 4320 tgttgacagt tactcaaagt gggtggaagt agaatggatg aaaactggca ctgagtgtgg 4380 aaaagtcctc aaaaaattag ttgcattttt tgccaggttt ggtcttccgg atgtgttagt 4440 ttcagacgga ggtcctccat tcaattcttt tgctttcgca cttttcttgg aaaaccaagg 4500 gattaaggtg ctaaaaagcc ctccatacca tccagcaagc aacggtcagg cggagcggtt 4560 tgtgagaacg gtgaaggacg ttttgaaaaa gttcttatta gaacctgagc attgcgaact 4620 tgaattggag gaccagatta acttattttt aatcaattat aggaataatt cattgacggc 4680 tgatgggcat ttcccatccg aaagagtgtt tgcttatcca cctaaaacat tgatcgactt 4740 gctgaatcca aaaaaacatt ataggaagtt tttactaaaa ccagctaaaa cctctaatga 4800 ggataatcta ctggaggtcg tttccaggac gaaacgaaaa aatcctctag atgatctgat 4860 ggagggtgac gtgatctggt atcagaatca taaacccgat catccagcaa aatggttgaa 4920 agcaacttac ctaaagcaaa tctctcaaaa cttattccag gttctaattg gaaacgttgt 4980 ggccacggta cacagagaac agattaggac gtcaggaggt tcagacagag agatgcgtcc 5040 aaacgtggtg ttaacaaagc agagggcgat cgatgtaacc gatgggccag gcaatcgcgt 5100 ttccgagaga ttaagtgatg atgtgaataa aaaccgtagt gataaatatt accaagaaga 5160 aaagttttct ggtaaaagga aaaaacctgc cgactgggtg gatgaaccac ctcggcgttc 5220 taaaagaaac cggaaagaaa agcgagatga tgcatttgta tatattaagt aaatgtagct 5280 ttcattaagt tctcaaggaa tatattccta tcatgtctaa aatctttctg aaattaagtt 5340 ccaatcgata gttagatttt gtcagatcaa ttcgaattag tgtatcaaac atccaaaagg 5400 tggaggagtt gttgtgttct gattttcaac aactagtttt gatcagtgta gctttgccat 5460 aaattacgta ttaccactac aatccgaatt tattaa 5496 // ID Troyka-2-LTR_BF repbase; DNA; INV; 160 BP. XX AC . XX DT 29-APR-2008 (Rel. 13.04, Created) DT 29-APR-2008 (Rel. 13.04, Last updated, Version 1) XX DE LTR of the amphioxus Troyka-2_BF autonomous LTR retrotransposon - DE consensus. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Troyka group; KW Troyka-2-I_BF; Troyka-2-LTR_BF; Troyka-2_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-160 RA Putnam N.H., Srivastava M., Hellsten U., Dirks B., Chapman J., RA Salamov A., Terry A., Shapiro H., Lindquist E. et al.; RT "Sea anemone genome reveals ancestral eumetazoan gene repertoire RT and genomic organization."; RL Science 317(5834), 86-94 (2007). XX RN [2] RP 1-160 RA Kapitonov V.V. and Jurka J.; RT "Troyka - a distinctive group of gypsy-like LTR retrotransposons RT inducing 3-bp target-site duplications."; RL Repbase Reports 8(4), 515-515 (2008). XX DR [2] (Consensus) XX SQ Sequence 160 BP; 31 A; 38 C; 37 G; 54 T; 0 other; cgccatagtt atgttatgtt aatgttatgt tatgacgtca gcggtgtccc ggatgttctc 60 cgccatattg ttgaagagta tccagaagtt attaaaccgt gctgcgttca tcccttgcac 120 tcgcgtcgtc ctttgcgatc ctggtagttc actactgacg 160 // ID KABUKI_BM repbase; DNA; INV; 1165 BP. XX AC . XX DT 17-NOV-2004 (Rel. 9.1, Created) DT 17-NOV-2004 (Rel. 9.1, Last updated, Version 1) XX DE Bombyx mori gypsy-Ty3-like retrotransposon Kabuki gene for DE protease-like protein, partial cds. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KABUKI_BM; KW protease-like protein; putative ty3-like retrotransposon. XX OS Bombyx mori OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Bombycidae; Bombycinae; Bombyx. XX RN [1] RA Abe H., Ohbayashi F., Shimada T., Sugasaki T., Kawai S., Mita K. RA and Oshiki T.; RT "Molecular structure of a novel gypsy-Ty3-like retrotransposon RT (Kabuki) and nested retrotransposable elements on the W RT chromosome of the silkworm Bombyx mori."; RL Mol. Gen. Genet 263(6), 916-924 (2000). XX RN [2] RA Gentles A. and Jurka J.; RT "Kabuki retrotransposon, partial sequence."; RL Direct Submission to Repbase Update (OCT-2004). XX DR [2] (Consensus) XX SQ Sequence 1165 BP; 374 A; 243 C; 255 G; 293 T; 0 other; caactcaact agcattgtcg cagacttcag cggacattat ggcgatctcg ctatcgacac 60 gtataccaga attttggaca gagcaaccaa gagcatggta cattaggatc gagtctatac 120 tcgccccaca gaaacttggg gatgaagtta aatttgacat tgtcgtatcg aaattaccaa 180 aagaagtcat tattcaacta acggattttc tagctaagcc accggacaca ggaaaatttc 240 tggcgttaaa aacgaaatta ttatcaacat ttgaggagtc caaaagtcga caaattgaaa 300 gactcatagg agagatggaa ctaggagatc aaaaaccttc tcagctattg cgccgcatga 360 gagagttagc cagagataaa gtaccggacg acaccctgcg agtgttatgg caaggtcact 420 tacctacaac attgagggca atccttacag tcactgaaac aaaagattta gacaacctcg 480 ccatgattgc ggataatgtt gcggaggcca cgcgagttaa ccacatctct gaggtcgtta 540 atcagcaaac ctcgaatatg gaacgaccat caacaagcga cacgtcttta attttagcgg 600 aaatagcaaa attaagcgtc agaatgacta acatggagca gtcacgatcc agacaaaact 660 actttgatcg taaccgtgat agaagtggct cacgatcttc atcaagacgc cgtaccccac 720 agaaccccaa ctggttatgt tactaccatt acaagttccg tcagaatgct cagaaatgcg 780 tccaaccatg tgcatggaaa ccaagttcgg aaaactaggc gtggtacggt cgcgggcgga 840 agctagtacc attttaggat gccgccgttt gtgtgtgatg gactacaata gtggactaaa 900 ttttttggtg gacacaggtg ctaatatttc agttttaccg gtgcctaaaa taagtaaatt 960 tatcaaaaat cctaactata agttatatgc agccaatgga actgaaattg caacatttgg 1020 tgaaaaaacg ttagtgctag atttgaagct acgtcgcgct ttccgatgga acttcgtgct 1080 tgcagatgtt aaacaggcta tattaggggc ggacttttta acttctaaca atcttttggt 1140 tgatatgggt gaacgtaagc ttatc 1165 // ID Gypsy-86_CQ-LTR repbase; DNA; INV; 1625 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-86_CQ_; KW Gypsy-86_CQ-I; Gypsy-86_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-1625 RA Jurka J.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 552-552 (2011). XX DR [2] (Consensus) XX SQ Sequence 1625 BP; 417 A; 356 C; 395 G; 451 T; 6 other; tgacacagtc cggatcgttt aaaaagttta attacgaccg tgatatttcg cgtcctctga 60 tcgtcgtcaa ttgcgatcag gtgaaatcgt ggcagtgatt ttttttaaaa agttaaccgc 120 attgagtgcg tcttgaaaag gtagttttgg tgatattttt ttttttgtaa tttgtacttg 180 atttggtgaa attaaatttt gcagtcgagg tgtgaagaat tttggaggca gatctgtgat 240 cgagcctagt tagcaggagt aattttgcag gtgtgaggcc gccgcaatca gaacgagcga 300 gaaaggtaag aattagtttg gacgctttaa atttggagtt cgctmatgtt tgtgcacgtt 360 gatttaggaa cccatcaacg tgggtggctt atctggagct ggaaagctca aacgtggtcg 420 tctgggagcg caatcctgga tcggaagcmg ttttttggaa ccggaagtga agtgtttgct 480 aatcttgaac tttgagaatc ttttggtgtg tcggaatttg gggacagccc gttgcttttc 540 gctgttggca cagctctatt gccaccccca gcggcaaccg gawaaacgtc acaccggtaa 600 cgccttagtc ggtctaggac gccatctcgt agcactttag cgagagcacg ttgggatcga 660 acgctagagg tcagtgtggg atcggataac gctacacgtc cgtaacccct ctgccacgtg 720 tttgctttcg gggtgggtta gcttcggctc aatactccca cgacttccgg aaacgcacgc 780 ctgagttggt agcagtctgt gatcaatgac gcccctcagc atacactgac ccaccaacac 840 atgtgcattt gaccgattta agccccgaca actagcacca tctatcggca ggacgccaaa 900 gccgcaacaa caacacgcgc cgtacgcctg agccggtaac gcctcgcctc gcctcacacg 960 cttcaagaat cggcccgcct tcgccgatcg gtccagccga catcggaccg gaattgtcga 1020 cacatcgtgg gagcaccgca caaatcgcaa gtaacgaacc ccgcagggac gtttaggaac 1080 cggataggga aatagggacg tttagcggtc agaaggwgta gggaaaaagg gttaggatca 1140 cacctggact gagttgaact gtccacgcaa aagatggagg ttttttagct tkgcttttcc 1200 tcccaataaa aaagtataat caaaactaca aaagtttctg gcctcagttt ttttctttta 1260 atctgagaga taaaatataa tctttattct tttcttttaa tacaaataat acaaatttag 1320 tcggtcctta gtgaacttgg gtcgaagtca gttsgtaaac gaagtcacat atcgcaatag 1380 tttcagctaa atcaataaat acacacaaat atcggctcaa atcttggttt tctttcacaa 1440 ttttcaatag tacaaaatct tagttttcgg tctttagttt tgggttagtt taatgaacat 1500 tttggttggt gccactccgt tttagcctgc ccgaagagag agtttctagc cggacctcca 1560 actgcgacaa cagccttccg tttggaagtg gcgcttaagc tgttgaacgt ttcgtgtgca 1620 acgca 1625 // ID Gypsy-38_OD-I repbase; DNA; INV; 8989 BP. XX AC CABV01001283; XX DT 24-FEB-2011 (Rel. 16.02, Created) DT 24-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Oikopleura dioica genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-38_OD_; KW Gypsy-38_OD-LTR; Gypsy-38_OD-I. XX OS Oikopleura dioica OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; OC Oikopleuridae; Oikopleura. XX RN [1] RP 1-8989 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Oikopleura dioica genome."; RL Direct Submission to RU (24-FEB-2011). XX DR Genome; CABV01001283; Positions 10342 1354. XX CC Positions [2692-3243] - Reverse transcriptase CC Positions [4464-4937] - Integrase core CC 'ATAT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 262..1656 FT /product="Gypsy-38_OD-I_1p" FT /translation="MTTYPALYKKTVEHLDSPELFLIYKSFKGFFPKGCTK FT QVLLLLIEDSSPGFLADKKKFYFFKQKDSKSVHFLKSKIDDPAEEHTLREL FT SPQHRNEVDQLLKDSKNDKEEEKDGDDEDSNTILISDATPSEDHKPDHDHK FT SEDMINFLDKHNISIPSTIRESPIDRLEHIVNKMEVPRRSSDFKFTQKLKF FT DPDLGVESFLIAVENYSAANNIQDQQKWIALAKAGLMNSEQGLSAQSTLEP FT EDEYSWDRFKKRIINTIGYDPSYYRSIFRNYRRKSSERVGLSFANLVQSYR FT RAFNKNGPLSQDDRTHILHQFICGQEGQLKTSLELEEGRLTYSNIIERASQ FT IERALGYPNANSSYEINAITSTPKQVSRNQPSEIECLVTLMKEMMASQASQ FT MAQHTAILSKMAEPRRSGGGNKSNSSYREDVKIAAGYCIQKLKNGSCSKGE FT CRYKHDEAPESIRKHFSTSA" FT CDS 2380..3987 FT /product="Gypsy-38_OD-I_2p" FT /translation="MTDLAAEKEEIDYFQLQREKRAEKIGFVPPIKSFGNL FT DEHEQNEVSDLIQSYRLAFTMADDDLGRLGYYRFTIPMIDESDTSYIPPRP FT VPIGLRSKVESEIDTWKQLGIIEPTQSGFNTPLIILKKGDGSIRISLDARN FT LNGKIRQDRFPLPSMTEIFSKLGERLSGSKSCYISQFDLSRGYWQVQVSEN FT DCHKLSFSHNRRHYQANRLLYGISTGPAAFSRIMNEIFGDNESFLLYLDDL FT LIIDNDYDRHMENLEFLFSSCIKHGLLLSAKKSSLMDSKIEFLGHQIDHEG FT IRPLPKHLVTINQFPVPTDRTSLKRFLGTVNYNLKYVDGLAVKLAPLHKLL FT TKEYEKFTWPAEAAEAFETVKREISNASKLFHHDLNLPLYLANDASGYGIG FT SCLYQLNKDKMEPLGYFSRALKGPDLRRPTRQRELIALAEGIRHFVYHLLN FT RRFTVLSDHKSLTYLYREHLHKRLDLRLANIMFFLADFDFEIVHKPGNDPV FT MFTPHYLSRLQHLLKSSKMRSNRKKSQTEFSHFITSRNQN" FT CDS 5353..7101 FT /product="Gypsy-38_OD-I_3p" FT /translation="MRTITFLAFLAKVLAQTYFENAGLAINELDPIELFNP FT EREVHFIRSFSTKPTREILDRIERCIAPDEISRMEKDNRIAESVNKMAFEI FT ADGLESHWQSAMTSLQMIAGRNERSAWDSAKSAAKYLYNSPFLRSVIQTAM FT PSLGNYLSAMTDKDHHEKIELLEHKLINNTHLIHQSNNILLSRLVELETKH FT CELHNRIFLENEMTRMIATVESEILSVSLGQIPHQANFVKGVLESCSKIGN FT TPRFCHRVLDANLISLTFNGLTFEHENSTLISYATLKIPVEKTSMSKVGRF FT EVVNLGAWLADQFFKLDIEGTFIRGRDNEIFELDREKCNKNFCPFEAIRVQ FT TPKCLSSLIANSTEGCEIIFQSPPAYCEIRNFGPNFLIAIPEGDIIYSNPI FT TTEAIHNQTILIKDKVTLSCHGPSGTKTVALHDEIFVRHHSELNLITLELS FT ETPRKFSRNLKVLSDISDLKDSEDSILIGKSDMKISLVILLGAAISVIVSV FT FLPHVAGKCRSLIVALISRFHNKKGKLSKKGPSETVEKQAKLDRRRSVHFE FT TRPQTRPQSFISRPQSMASRPESFVSINLDGFAEIN" XX SQ Sequence 8989 BP; 2783 A; 2198 C; 1738 G; 2270 T; 0 other; taatttggtg actgaaaata tttacgaact ccacaactct acaaacgatt cttccccata 60 cccgtacccc gagaaattgc gaaaacatat cagtctaact gatataagta gtaaaagtat 120 ctttgtaaag aaaggctcac tagctcagaa tttacccgtt gggttctaag attctgaagg 180 cttcggcttc gagctacttt ccaagtacgc taactagtga atagtattta cattccccca 240 aaactccaaa cacactccaa gatgacgaca tacccagctc tctacaagaa aacagtcgag 300 catctggact ctcccgaact ttttctgatc tacaaaagct tcaagggttt cttcccaaaa 360 ggatgcacca agcaagtgct cctgctccta attgaagatt ccagccctgg atttctggcc 420 gataaaaaga aattctactt tttcaaacag aaagactcga aaagcgtgca tttcctaaaa 480 agcaaaatcg acgacccagc cgaagaacac acactgagag aacttagccc tcagcaccgc 540 aacgaagttg atcagctgtt aaaagattcg aaaaacgata aagaagaaga aaaagatgga 600 gatgatgaag actccaatac gatcttaatc agcgatgcca caccatctga ggatcataaa 660 ccagatcatg accataaatc tgaggatatg ataaatttcc tcgataaaca caatataagc 720 attccatcta cgattagaga gtcgccaatc gaccgcctgg aacacattgt aaacaagatg 780 gaagtcccgc ggagatcaag cgacttcaag ttcacgcaga agctcaaatt tgaccccgat 840 cttggagttg agagctttct tatcgccgta gaaaattact ctgccgcaaa taacatccag 900 gaccagcaaa aatggattgc gctggcgaaa gcgggcctga tgaactccga acaaggcctc 960 agcgcacaaa gcacgctaga acctgaggat gagtattcct gggaccgatt taaaaagcgc 1020 attattaata ctattggtta tgatcccagc tattatcgct caatcttccg caattataga 1080 cgaaagtcaa gcgagcgcgt tggtctgagc ttcgcaaatt tagttcaatc atatcgacga 1140 gccttcaaca aaaatggccc tctcagtcag gatgaccgaa cacacatcct gcaccaattc 1200 atttgtggtc aagaaggaca gttaaaaaca agcctggagc tcgaggaagg tcgcctgacc 1260 tactctaaca taattgagcg agccagccag attgaacgag cactcggata ccctaatgcc 1320 aactcatcct acgaaatcaa cgcgatcaca tctactccaa aacaagtgag ccgcaaccaa 1380 cccagtgaga tagaatgcct cgttaccctg atgaaagaaa tgatggcatc acaagcatct 1440 caaatggccc aacacaccgc gatcttatca aagatggcag agccaagacg cagcggcggt 1500 ggcaacaaat caaactcgtc ttaccgtgaa gacgtaaaaa ttgcagctgg atactgtatc 1560 caaaagctga aaaatggatc ctgctcaaag ggtgagtgcc gatacaagca tgacgaagcc 1620 ccagaatcta tccgaaaaca cttttcgaca tccgcatgaa gttttgcatc ggctaatgta 1680 gtcgagccaa cttacgaatg tcattttgca aaatccgccg ccccttccca tcttaaatac 1740 attgccgcaa aaatctgcgg catcacaatt cctgctcttg tcgactccgg atgtacaaaa 1800 tcatgcatcc gttcttctct cccactactc gacaaatgta caaaaactct ttcacaatgt 1860 gcccttctca ccgccgatgt caaggatgat caaagatgtc tgcatcacag atggcatcac 1920 taaagatcca attttcccgt gactattcag ccgaaatcga ctgcgttctg gtcgacaatc 1980 tacattcctg catgatcttg ggtttggatt ttctcaaaag catcacaata accgaaaact 2040 ccgcccacat cattataaat ggacacaccc ttcccaccat ttctaagctc gattattctt 2100 ttgcagcata tcctgtcgaa aatacaacca tccctcctta ctcgtacaag cgtgttgaac 2160 ttcgcaatac ccggcaattt caagcaaaaa caatagcagt tcaaaatctg aagcattcaa 2220 aacgcaaaat caacgtcgaa acgtctatcc agcgcacttc agatcaaccc ttcgttattg 2280 taagaaataa ttcgcccgaa tagccttcaa gtctcaagaa acgcgcctat ctgtacgatc 2340 caagaaattg aggttgaggt tgtaaatgga gtaacaagaa tgactgatct cgccgctgag 2400 aaagaagaaa ttgactactt ccaattacaa cgtgaaaaac gagccgagaa aattgggttt 2460 gtgcctccaa tcaagtcatt cggtaatctt gatgaacatg agcagaacga ggtcagtgac 2520 ttgattcaaa gttaccgcct tgcctttacg atggccgacg acgatctagg acgcctgggt 2580 tattaccgat ttacgatccc gatgattgat gaatctgaca cgtcctatat tccaccgaga 2640 ccagtaccga ttggattgcg atcaaaagtt gaaagcgaaa tcgacacatg gaaacaactg 2700 ggaataattg agcctacaca aagtggtttt aataccccct tgattatctt aaaaaagggg 2760 gatggctcta ttcgtatcag cttggacgct cgaaatctta atggcaaaat tcgacaagac 2820 cgatttcctc ttccttcaat gacagaaata ttttcaaaac tgggtgagag actgtctgga 2880 tcaaaatctt gctacatctc ccagttcgat ctgtcaagag gatattggca agtccaagta 2940 agcgaaaatg actgccataa actctcattc tcccacaacc gacggcacta tcaagccaac 3000 cggctgcttt atgggatttc aactggacct gctgcgttca gccgtatcat gaatgagatt 3060 tttggcgata acgagtcatt ccttctttac ttagatgatc tgttgattat tgacaatgat 3120 tacgatcgcc acatggaaaa cttggaattc ctgttttcaa gctgcattaa gcatggactt 3180 ttgctaagcg cgaaaaagtc gtctttgatg gattcaaaaa tcgagttcct tggacatcag 3240 atcgaccatg aaggaatccg gcccctcccc aaacatctag tgacgatcaa ccaatttcct 3300 gttccaactg acagaacttc actgaagcgt ttcctaggaa ctgtcaacta taatctcaaa 3360 tacgttgatg gactagccgt gaaattggcg ccgcttcaca agctactgac caaagaatac 3420 gaaaaattca cctggccagc tgaagctgct gaagcattcg aaactgtaaa acgcgaaatc 3480 tcaaacgcgt cgaaactttt ccatcacgac ctcaatttgc cactgtatct tgcgaatgac 3540 gcaagcggct atgggattgg gagttgcctc taccaattaa acaaagacaa aatggaacct 3600 cttggatatt tcagcagagc ccttaaagga cctgatctcc gacgcccgac tcgtcaaaga 3660 gagttaatcg ccttggcaga aggaatcaga cactttgtat atcatctgct caatcgaaga 3720 ttcacagtcc tgtcagacca taaatcgctg acctatctat accgcgagca tctccacaag 3780 cggctagatc ttaggcttgc gaacattatg tttttcctgg ccgactttga ctttgaaatt 3840 gtccataagc ccggcaacga ccccgtgatg tttacgcctc attatctgag ccgccttcaa 3900 catcttttaa agtcatcgaa aatgaggtca aatcggaaga aatcccagac cgagttttcg 3960 cactttatca cttcccgcaa tcaaaactga atgataagga cgtccatcga gatctttatt 4020 tgagaacact ggtggactcg aaaaaccagc ctcccgttac gaaagatgtt atcattcatt 4080 tcggtgaagt ggaactgtca tcatcaagcc tgaaagccgc ccaagaaaaa gacgattggt 4140 gccgcaatat ccttttcatg ctgaaaaaca acagcaaaaa caaaacgact tcaaagtttg 4200 gcctgattga cgacgtcctt tactcaaccg aaaaggaagt aaaccgtcct gtgatcgttg 4260 aaccacatgc atctgagttt attacgtact gccactctag ctatggtcac ccaggaacct 4320 acgcgaccat gaaattagtg tcaaagtacg tctacataaa caaactcaaa gaagcagccg 4380 cccatatctg ctccaactgc attgattgtt tacgatctaa gccacgggcc gccttgcgtc 4440 ccacaatgat tccagttcga cattatcctg acgccccctg gacttattcc gccgtggacc 4500 tttatgatct tggcgtggca gataacaatc gcaaaagata tctgctcacg atgattgatc 4560 acctgaccgg atttctggac ggtataccga tagcaagcaa gtctgacagg cttgtcgcga 4620 atgcgatgaa cgaacttctc ctccgacatg gccttactgg acgtgtatta cttgataatg 4680 gtagagaatt tggacctcta ttttctggcc ttctcagacg gttcaatatc caaatgatcc 4740 acacatcaag ttacaactca agaagtaacg gaaagctcga aagatgtcat agatcaataa 4800 cggaaaaatt gaagctgctc aacgcgaaac ggtcagaatg gtccagccac tggccttatg 4860 tacaatttct catcaataat ctacctaaat caaacttgga tggtcttagc gcatgtgaag 4920 ccctgtacgg aagaagcatg ttcgcgccct ttgcagaaat tcgccatgcc caaccagaac 4980 cctgctccga aggctttgta aaagccctca accgctatat ttcagatctt catccgtccc 5040 ttatggcgca ccattacgcg aaatatgaaa aagatctcaa gaaagataac gggaaagtaa 5100 tacagttgaa aaagggaaca aaagtattac tctacaaacc cgacatcaaa gaaggaaaat 5160 tgtcccgcgt ctggagcggc ccctacgtgg tcgagcgtaa ctatagtcag aactcgtacg 5220 tcataaaaga cccagaacga ggacaaactt tcacgcgcca cttacgcttc atgcgcgttc 5280 tccatgacca gccacccacg acgccagagg atataatccc tgacgacgaa gaagaaccag 5340 tcattccaga aaatgagaac aatcacattt ttagcatttt tggcgaaagt tttggcccag 5400 acttactttg aaaatgcagg cctagccatc aacgaactgg accctataga gctatttaat 5460 cccgaaagag aagttcactt catcagatcc ttctccacaa aacccacacg agaaatactg 5520 gaccgaatcg aacgatgcat cgccccagat gaaatctccc gcatggaaaa agataataga 5580 atcgccgaat cagtaaataa aatggcattt gaaatcgccg atggacttga atcacactgg 5640 cagtccgcca tgaccagcct acaaatgatc gctggccgaa acgaacgatc tgcctgggac 5700 agtgcaaaaa gcgctgcaaa atatctgtac aattctccat ttctccgctc agtgattcaa 5760 accgccatgc cttccctcgg taactattta tccgccatga ctgataaaga tcatcacgag 5820 aaaatagaac tcttagaaca taaactaatc aataacactc atctaattca ccaatctaac 5880 aatatccttc tttcccgttt agtagaatta gaaactaaac attgcgagct ccacaacagg 5940 atttttctcg aaaacgagat gactcgaatg atagccacag tcgagtccga aattttgtcc 6000 gtctcactag gacagattcc ccatcaggca aattttgtga aaggagtcct tgagagctgt 6060 tctaagatag gaaatacccc tagattctgt caccgcgtat tagatgcaaa tcttattagt 6120 ttaaccttta acggacttac tttcgaacat gaaaattcta ctcttatcag ttacgccaca 6180 ctcaaaattc ctgtagagaa aacatcaatg tcgaaagtag gccgatttga ggttgttaat 6240 ctaggcgcat ggctggcgga ccagttcttc aaactagaca ttgaaggtac ttttattagg 6300 ggcagagata atgagatctt tgagcttgat cgggagaaat gtaataaaaa tttctgcccc 6360 ttcgaggcaa ttcgagttca gaccccaaaa tgcctcagct cccttatcgc aaattccacg 6420 gagggatgcg aaataatttt ccagagccct ccagcctact gtgaaatcag gaattttggc 6480 cccaattttc tgatcgctat ccccgaagga gacataatct atagcaatcc tataacgaca 6540 gaggcaattc ataaccaaac cattttgata aaggacaagg tgaccctatc ttgccatggc 6600 ccttcaggca caaaaactgt cgctcttcat gatgaaattt tcgtccgcca tcattcggaa 6660 ttaaatctca ttacactcga gctttcagaa actcccagaa aattttcgag aaatttaaaa 6720 gtcttgagcg acatttcgga tctaaaagac tcagaagact ctattctcat cggaaaatcg 6780 gacatgaaaa tttcattagt aatcctatta ggagctgcta tctccgttat agtcagtgtc 6840 ttcttgccac atgtggctgg aaaatgcaga tcacttatcg tagccttgat atcgcgtttc 6900 cataataaga aaggaaaact cagtaaaaag ggaccaagtg agactgtcga gaagcaagca 6960 aaacttgata ggaggcgatc agttcatttc gaaactaggc cacaaacaag accacaatct 7020 ttcatttcta ggccacaatc tatggcatca aggccggaat cttttgtctc tatcaatcta 7080 gatggttttg ccgaaatcaa ttaaagccta tcaccattct tccatcctgt aaatttctag 7140 ctccccctct actaacccaa gcctaatcca cgtttaagat acctaaggtg cgcaattttt 7200 gtggcgcgat cttatcagtt gataactcaa cagagaaatc cagagcccgg aggcggaaca 7260 cctttatatc cctaagcggg acaggtgaac cgagataagc atcgcataaa gccaatcgcg 7320 acaatcataa ccagcgaagc gacctgtaaa caggccctcc agcccgaaga tatcggaaca 7380 ccctaaagcc ccgagcgggg caggtgaacc ctgattatga tcgcgataga caattcgcac 7440 attaggtccg gatttccaac ccctaatcta aagaatgatt aatttttcag aatataattt 7500 aaaaccctgg cattatctac aggacgggga taatttgtcg gtcgggttcc cattggccga 7560 gattctccgg cgtcgatgga gagaaaaaaa tcagatgagc aagcgttcgg ctgccgtgtc 7620 gctggctatg ctactagaca gcgtgtaggc ggcggtgtgg tgcgcgatcc cgaccgacaa 7680 attatcccct ttctgtagct ttattccagt caaaaggaag attattcaga tcaaaaaatg 7740 tcagctcaaa aatgacccac cgggcggcgc tctttataaa aaggcggcaa tggctcttag 7800 cagcccgatg aagcccccat gtaagacggt aacgcgatac aacaaccctg aagcgggttc 7860 tgcgttagat cattatgact cataatttat gagctttagt gacattgtaa tttgattttt 7920 atttcttgcc aatttgaaag aaagaaaact atcaaaattt aaggaactaa tttattctgg 7980 aaaattaagc acaaaaaaaa aatatacttg cacaaactta cttctgcttt ttgacggccc 8040 gattaaaccg ttgtcgctgt cgttttcgga aaagcctaac cgactctgcc ctggcatgag 8100 aaactgcttt ctcagagact cagtttgact tcggagatca gcagctgctt tatcatccgc 8160 gttcttgtag aaggaagcct ctgcacaggt gtgcgaaata ggcgttgcca gtcgcagctg 8220 cgtgtaattc ttgagccacc tgctctggtc acggccatga gtgaccagct gtcggaggaa 8280 ctggaaagcc ggttgccgaa attcatgctg ctcgaagtag gcttgacggc gtttcactaa 8340 aaaaaaacga tttaaaaata tgctccataa aatttcataa atttcaaaaa aaaaattttt 8400 taaaaaaatt ttttttttct caaaatttgc ccaaaatttt ttttttatta cctggattcg 8460 acttgagcca tcgttggaga agtctcgcca gcgtttcttt ttcctggagg aggtcttcga 8520 cgtttttaca catcttgatc ttgctcattt ttatatctct caaaaacata gcaatatcca 8580 tagtgacctg caagtcagat ggtggacact tcagttcttg agtctgaaaa tttcgaattt 8640 ttatgatttt caaacttcgc gcccacttat atatgctgtg tactgacgag caccctctat 8700 tgacaatttt atgatcacct gttacatgcg cccactgtta attactgcca catgtacatt 8760 ccagcactat tcccaatcaa catcacctgc gcacagctgc cccgcgcccg cccaggtgcg 8820 ttatccgcgc tcccatcgac cgcgcacacc aattgccgac gccatctgcg accttgattg 8880 ctaatcagca aaaactttat tacttacctc tgcaactgat tgcttttatc tgcctgaaat 8940 tttttttaaa atccctatgt aaacttattt tcaaagaagg gagggatta 8989 // ID Crack-7_HM repbase; DNA; INV; 3388 BP. XX AC . XX DT 17-SEP-2009 (Rel. 14.1, Created) DT 17-SEP-2009 (Rel. 14.1, Last updated, Version 2) XX DE Crack-type non-LTR retrotransposon: consensus. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; Crack-7_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3388 RA Jurka J.; RT "Crack-type non-LTR retrotransposons from Hydra magnipapillata."; RL Repbase Reports 9(10), 2141-2141 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 108..1484 FT /product="Crack-7_HM_1p" FT /translation="MAIKFPCGICHKSVKKNHRALQCDYCDTWIHIKCNRI FT DNKTYNSLKTESGSWYCLICLENTLPFSSLIDSEFQLTINDINGLSYNSDD FT EIQTNLKNPIKNFKNFNNNKIKCKYYDTXEFNRVVNKDCNVFFHVNISSLA FT YYIDDLKCLLSLINHEPSIIAITESRLIHGQLPKTNINLNGYSVEHCDSSS FT NRGGTLMYIKMDLKYKIRADLQIQKTKELESTFIEILRPGKKNIVIGCIYR FT HPCMCIAEFNNFYLQTLIEKVSIENKDIIILGDFNINLLNYDSSSDVSNFL FT DSMCSYSLFPFITQPTRITPQSKSLIDNIFLNFYKPNLVSGNLTYSLADHL FT IQFIAIPSKKFEKLHPKSYRRCFKNFDNKLFLDQLKVTDWDPVIKGANENI FT SNSTTKFLEIINKLLDYHAPFKLSTIKIQKTLSKPWITSGIIKSIKIKNQT FT XKVYRILSAFQIL*" FT CDS 2080..3363 FT /product="Crack-7_HM_2p" FT /translation="MYIRLSGFLHQSNILYRLQYGFRKNHSTNHALIDITE FT TIREALDQKKFAGGVFVDLQKAFDTVDHEILLSKLSHYGVRGTALQWFKSY FT LSNRSQFVEIANTSSSTKSTSTGVPQGSILGPLLFLVYINDLNTCIRYSRT FT YHFADDTNLLIVEKSLKKLNKYLNYDLSCLVQWLRANKISLNTKKTEIVLF FT KSRGKKIEKKLNFRISGQKLYIVDKIKYLGIILDENLXFNHQLKSISIKLS FT RANGMLAKVRHYVDYETLINIWYAIFGSHLNYGCQIWGQNYSVNKNNLINL FT QNKSIRLIHFQQNSFDLDILYSISKITKLDQMIYLLNSAFAWDVIHKNIPM FT GFNNYFKFTGNIHNHNLKSVSCNNLSIPSKQTYKYGIKSIKNHSIHSWNSL FT PSEIKKSTNQVNRNRFIKQAKEFIMKMQDDTTK*" XX SQ Sequence 3388 BP; 1297 A; 617 C; 437 G; 1033 T; 4 other; ataaaaatat acactttaaa taactccaat ggtctaccct cttcttcaat ctctaaaata 60 actacctcat aaatgcctgt tatcaagttc tgatttcatg ttcctatatg gctataaaat 120 ttccttgtgg tatttgtcac aaatctgtta aaaagaacca tagggctcta cagtgtgact 180 attgtgacac atggatacat attaagtgta acagaattga taacaaaacc tacaactcct 240 taaagacaga aagtgggtct tggtattgct taatctgttt ggaaaacact ctncctttct 300 catctctaat agatagcgaa tttcagctaa caataaatga tataaatgga ttgtcataca 360 attcagatga tgaaatacaa acaaatctga aaaatcccat caagaatttt aaaaatttta 420 ataacaataa aataaaatgc aaatactatg atacttncga gtttaaccgt gttgtaaata 480 aagattgtaa tgtcttcttt cacgttaata tatcttcatt agcatactac attgacgatc 540 taaaatgtct gctctccctt ataaatcatg aacccagcat aatagcaata actgaatctc 600 gcctaatcca tggtcaattg cctaaaacta atatcaatct aaatggctat tctgttgagc 660 actgtgattc tagctcaaat agagggggaa cacttatgta cataaaaatg gatctgaaat 720 ataaaataag agctgacctt caaatccaaa agacaaaaga gctagagtca acttttattg 780 aaatattacg tccaggaaaa aagaacatag ttataggatg catctatcgc cacccatgta 840 tgtgcattgc agagtttaat aatttctatc tgcaaacact cattgaaaaa gtatccattg 900 aaaataaaga cataattatt cttggggact ttaacattaa tcttctaaac tatgattctt 960 ctagtgatgt ttctaatttt cttgattcca tgtgttctta ctctctcttt ccttttatta 1020 cccaacctac tagaataaca ccacaatcaa aatcacttat agacaacata tttttaaact 1080 tttacaaacc taatcttgta tctggcaatc taacatattc tctagctgac caccttatac 1140 agtttattgc gataccttcc aaaaaatttg aaaaactaca tccaaaatca tatcgtcgat 1200 gttttaagaa ttttgacaac aaattattcc ttgatcaact taaagttaca gattgggatc 1260 ctgttattaa aggagcaaat gaaaatatca gtaattcaac aacaaaattt ttggaaatta 1320 tcaacaagct tcttgattat catgcaccat ttaaactgtc tacaataaaa atacaaaaaa 1380 ctctatctaa accatggatc acctctggga tcattaagtc tattaaaatc aaaaatcaga 1440 cacntaaagt ttatagaata ctttcagcgt ttcaaatctt atagaaatca aattaacaat 1500 ttaattcgat actctaaaaa attgtactac tcaaaatatt ttaatgaaaa tgtaagcaac 1560 ctgaaaaaca cctggaaagg aattaagagc ataattaact taaaatcaaa gaataaccat 1620 gttatagata gtttaacaat aaataaccaa aacatcacag acaagaaaat aattgcaaat 1680 actctaaatg agtacttttc tacaatagca gaaaaccttg catcaaaaat aatacctccg 1740 aaaaatgact ttagctatta tcttaaaaac tgcaatccta cctcatttta tataacccca 1800 gttactcctc aagaaattaa agactatata tctgccacga attcaaaaaa aggtatcgga 1860 ccaaacagca ttccctcaaa aatacttata cttgcatctc aggaactcag ttaccctcta 1920 agcgtcataa taaatgaatc attcaaatct ggaatctacc ctgacccatt taaagttgct 1980 aaggtaatac caatctttaa aaatggctcc atacaggact gctgtaatta tagacctata 2040 tcactactat caaacatcag caaactactt gaaaagttca tgtatattag attatctgga 2100 tttctacatc aatcaaatat cttatatagg ctgcaatatg ggtttcgcaa aaatcatagt 2160 acaaaccatg ccctaattga tattactgaa acaataagag aggctcttga tcagaaaaag 2220 tttgccggag gagtgtttgt ggatctgcaa aaggcattcg acacagttga ccacgaaatc 2280 ctgctgtcaa agttatctca ttatggtgtc cggggaacag ctcttcaatg gtttaaatca 2340 tatttatcta atcgctctca atttgttgaa attgctaata catcctcctc tactaaatca 2400 acatcaacag gtgttcctca gggctcgatt cttggtcctc ttctcttcct tgtatatata 2460 aatgacctca acacatgtat cagatattcc agaacatatc attttgctga tgataccaac 2520 ctattaatag ttgaaaaatc attgaaaaaa ttaaataaat atctaaacta tgacctatca 2580 tgtttagttc agtggctacg tgcaaacaaa atctccctaa atacaaaaaa gaccgaaatt 2640 gttctcttta aatctagagg taaaaaaatt gaaaaaaagt tgaatttcag aataagtgga 2700 caaaaactat atattgttga taaaatcaag taccttggta taatactaga tgaaaaccta 2760 ntgtttaatc atcaattaaa gtcgatctcc atcaaactga gtcgtgcaaa tgggatgcta 2820 gcaaaggtga gacactatgt agactatgaa accttaataa atatctggta tgccatcttt 2880 gggtcacacc ttaactatgg atgccagata tggggacaga attattctgt aaacaaaaat 2940 aaccttataa atctacaaaa taagtctata agattaatcc actttcaaca aaatagcttt 3000 gacttagata tactatacag tatatcaaaa ataacgaaac tagatcaaat gatctacttg 3060 ttgaacagtg catttgcatg ggatgttata cacaaaaata taccaatggg cttcaacaat 3120 tatttcaagt ttactggtaa tattcacaac cacaatctta aatcagtatc ttgcaacaac 3180 ttatcaatcc catcaaagca gacatataag tatggtatca aatcaataaa aaatcattca 3240 attcattcct ggaactccct acctagtgaa attaaaaaat ctaccaatca agtaaataga 3300 aataggttta ttaaacaagc aaaagagttt atcatgaaaa tgcaagatga cactacaaaa 3360 taagtatatg agtatgtgtg cgtgtata 3388 // ID BEL-66_CQ-LTR repbase; DNA; INV; 632 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-66_CQ_; KW BEL-66_CQ-I; BEL-66_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-632 RA Jurka J.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 286-286 (2011). XX DR [2] (Consensus) XX SQ Sequence 632 BP; 211 A; 123 C; 158 G; 140 T; 0 other; tgtttaaagt gtgacgacaa ttggccaatc caccctacgc gcaagcagca gcaggtgaag 60 cacgccgcgg gtgaaaacaa aacgcagtct cgagcgaact gtcagccgtc gtcgtcgtcg 120 ttgtgtcgaa ggaagtgtcg agagaaagag agggagaata aaaatcgatg tattagtgtt 180 agccaagtaa acaagtacat gtctttgtat gtattccccc ctgtaaatag tccctactgc 240 agtagtctta atatagttcg tttctgactc tcgagtgaag aacacgtttt ttccactttc 300 atccgaaacg tctcgaaaaa gttcccggcc gcgaacaaat cgtatcggtt gtatcgagtt 360 gatttagttt cgaagaattt ggtgctaatt agcaaagtgt cgaagtgaaa cgtatcgaaa 420 gcgaatcgag agactgtcga aactctggcg agcactgaag agtgtcgtgg aggtttcgcg 480 aaagttagtg tcgaaacaaa aagaactgaa agtgtcgaaa caaaaagaac taaaagtgtc 540 gaaacgaaag tgacgaaaaa tcgaaagaaa aagtgcaaaa cgagaagtgc gaacgagatc 600 cagcttcgag caaaagtgaa ccacatcgac ca 632 // ID L1-10_CQ repbase; DNA; INV; 4330 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE An L1 non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW L1; Non-LTR Retrotransposon; Transposable Element; L1-10_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4330 RA Kojima K.K. and Jurka J.; RT "L1 non-LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 140-140 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 3 sequences with >96% CC identity. XX FH Key Location/Qualifiers FT CDS 121..1092 FT /product="L1-10_CQ_1p" FT /translation="MARRINTFRIDYSMIPKKPSFEEVHEFIATVIGLKRE FT EVERIQCSRTHGCAFVKTTDLAIAQRVVEENDNKHEIEVEDKIYKLRIVME FT DGAVEVKLHDLPEDIQEQAIVDFLSAYGEVYGLRELMWGEKFAYDGFPSGI FT WVAKMKVKQNIPSYVVIDGELTALSYYGQKQTCKHCQDYAHNGISCVENKK FT LLVQKTYADAAKQVPSAKTSSTRESKQKKTKKPMQKSVFTKLTDSKKTPPG FT SSALALTPAVKPTAQLATTSKTDLTNMLPPKTTLVLAKKTVSEGYDTDTST FT TSTNSKRSLRSGSGAKKVRPNDEDNDMGDDAL" FT CDS 1082..3811 FT /product="L1-10_CQ_2p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="MMRCKMAQHSYNIATININTITNTTKINALRSFARTM FT ELDIIFLQEVENEQLVLPGYNLVCNVDHARRGTAIALKDHIVFSHIEKSLD FT GRLITVRVHNTTLCCIYAPSGTALRAERERFFNNTLAYYLRHQTEHVIVSG FT DFNCVVRQCDATGNNYSPSLHTTIQQLHLQDVWVKLHPTSPGPTYITHNSQ FT SRLDRIYVSPGLNEGLRTIAAHACSFTNHKAVTARLCLPILGRAPGRGFWS FT LRPHLLSREHLEELQLSWQFWTRQRRNFPSWMMWWLSFAKPKLKKFFCWKS FT KIIYDDYQRKHQRLYAELRAAYDGYYQNPRMLSRINRIKAQMLTMQREFTH FT MFVHINATHVAGEPLSTFQLGERRRKRTTITRVLNDRGEHIDGSREIEEHM FT VNHFRELYAVAEEAVDAEDRFECNRVIPAGDEASEACMSEITTAEIYSAIK FT ASASKKSPGPDGLPKEFYLRAFDIIHRELNLVLNEALSSNFPPEFVDGIIV FT LVKKKNTDESARSYRPISLLNFDYKTLSRILKGRLENVMRAHNILSDGQKC FT SNSNRNIFQATLSLKDRIVQLKQHKKRAKLISFDLASAFDLVSRSFLFTTM FT RSLGFNTNLVDLLKRIGDLSSSRLLVNGNLSAAFPIQRSVRQGDPLSMHLF FT VLYLHPLLSRLQQVAGDDLVVAYADDVSVIATSTHKIEMMRETFDRFRTVS FT GAILNTRKTISIDVGFINGNPLHVPWLQTVDTVKILGVIFANCVRQMVQIN FT WDTLVTKFGQQVWMHSMRSLTLHQKVTLLNTFITSKIWYLSSTIPPNAAHT FT AKITSTMGSFXWRRLPARIPIQQLARHIXKXGXKLQLPAMKCKALLINRHL FT NEIDCMPFYKSIVFPENPTQPRPAVPDLXRCSRTSHGYLLNFKIVPLLTSS FT TNIIWS" XX SQ Sequence 4330 BP; 1258 A; 1145 C; 958 G; 961 T; 8 other; gacagttacc gctcatctgt tgagctgagc agacgtgctt gttgaaaagg gctgatcaag 60 tatttttcct cagtttgtcc ttcaataagc aaccgctaat cgcggacacg ccggtccggg 120 atggcccgtc ggattaatac ttttcggatc gactattcga tgataccgaa aaaacctagt 180 ttcgaagaag tgcacgaatt catcgcgacg gtcatcggtc tgaaacgcga ggaggtagaa 240 cgcattcagt gcagccgtac acatggttgt gcttttgtga aaactaccga tcttgcgatc 300 gcacaacgag tagtggaaga aaatgacaac aaacacgaaa tcgaagtgga agacaaaatc 360 tacaagctac gtatcgtgat ggaagacgga gcagtggaag taaaactcca cgaccttcct 420 gaagacatcc aggagcaagc gatcgtggat ttcctctcgg catacggtga agtttacgga 480 cttcgtgagc tgatgtgggg tgaaaaattc gcgtacgatg gttttccatc cggaatctgg 540 gtggccaaga tgaaggtcaa acagaacatt ccatcgtatg tggtaatcga cggagaactt 600 actgctttgt catactatgg ccagaagcag acatgtaaac attgccagga ttatgctcac 660 aatggcataa gctgcgtcga aaacaagaag cttcttgttc aaaaaaccta cgctgacgcg 720 gcgaaacaag taccttcagc caagacttcc tcaacgcgcg agtcaaagca gaagaaaacg 780 aagaaaccaa tgcagaagtc tgttttcacc aagctcaccg acagtaagaa aacccctcca 840 gggtcatctg ccttggcctt gacccccgct gtcaagccaa ccgcgcagct ggccaccacc 900 agcaagacag atctcaccaa tatgttgcca ccaaagacca cactagtact cgcgaagaag 960 accgtatctg agggatacga cacggacaca tcgacgacct ccaccaacag caaacgctcg 1020 ctacgttccg gctccggggc gaagaaagtg cgaccaaacg acgaggacaa cgatatggga 1080 gatgatgcgt tgtaaaatgg ctcaacacag ctataacatc gctactatca atataaacac 1140 catcactaac actaccaaaa tcaacgcact gcgatctttt gcaagaacga tggaactcga 1200 catcatcttc ttgcaagaag tagaaaacga gcaacttgtg ctgcctggct acaaccttgt 1260 ctgcaatgtg gatcacgcga gaagaggaac agcgatcgca ctcaaggacc acatcgtttt 1320 ctcacacatc gaaaaaagcc ttgacggacg gctaataacc gttcgtgtcc acaataccac 1380 attgtgctgc atctacgctc cgtccggaac cgctctgcga gccgaacggg agcgtttttt 1440 caacaacact cttgcgtact atctgcgaca tcaaaccgag cacgtgatag tgtcaggaga 1500 ttttaactgt gtggtaagac aatgtgatgc aactggtaac aactacagtc cctctctcca 1560 caccaccata caacaactac acctgcaaga tgtttgggtc aagcttcacc cgacctcccc 1620 tggccctacc tatattaccc acaactctca gtctagactt gaccgtatct acgtcagccc 1680 tggactgaat gaaggcctaa ggactatcgc cgctcacgca tgctccttta cgaaccacaa 1740 ggcggtgacc gccagactct gccttcccat tctcggtaga gcacctggcc gtgggttttg 1800 gtctcttcgt ccccatcttc tctctagaga acaccttgaa gagctgcagc tcagctggca 1860 gttttggacc cgtcaacggc gaaatttccc gagctggatg atgtggtggc tctcgtttgc 1920 taagccgaaa cttaaaaagt ttttctgctg gaagtcaaaa atcatttatg atgactatca 1980 gcgtaaacac caacgactct atgctgagct ccgggcagca tacgatggct attaccaaaa 2040 cccccgtatg ctgtccagga tcaatcgcat caaagcacaa atgctgacta tgcaacgaga 2100 gttcacgcac atgtttgtac acatcaacgc aacacacgtc gccggtgagc cactgtcaac 2160 cttccagctg ggagaaagaa gaagaaaaag aacaacaata acccgtgtac taaacgaccg 2220 tggagaacac atcgacggta gcagagaaat cgaagagcac atggtcaacc acttcaggga 2280 gctttacgcg gtagcagaag aggcggtgga tgcagaggat cgcttcgaat gtaaccgtgt 2340 gattcccgct ggagacgaag cgagcgaggc ctgtatgagc gagataacaa cagccgaaat 2400 atattcggcg atcaaagcaa gtgcgtcaaa aaaaagtcct ggtcccgatg gtctgccgaa 2460 ggaattctat cttcgggcct tcgatatcat ccacagagaa ctcaacctgg tcctcaacga 2520 agctctatct tcgaactttc ccccagagtt cgtcgatggt ataattgtgt tggtgaagaa 2580 gaaaaacacc gacgagtccg ctcgctcata ccggccaatc tcgctgctaa atttcgacta 2640 taaaactctc tcccgcatcc tcaaggggag gctggagaac gtgatgaggg cccacaacat 2700 tctcagcgac ggccaaaaat gctccaactc caaccgcaac atttttcagg caactctttc 2760 tctgaaagac cggatcgtac agctgaaaca acacaaaaag agagctaaac ttatcagctt 2820 tgatcttgcc tcagcatttg atctcgtatc gaggagcttt ctgttcacca cgatgcggtc 2880 tctcgggttc aacacgaact tggtggacct gctgaagcga atcggcgatc tatcgtcatc 2940 tcgcttgctc gtcaacggaa atctctcggc agccttcccc atccagcgct ccgtccgaca 3000 aggggacccc ttgtccatgc acctgtttgt actgtatctt cacccactgc tgagccgact 3060 gcaacaggtc gctggggacg atctcgtagt cgcgtacgcc gatgatgtca gtgttatcgc 3120 cacctccaca cacaaaatag agatgatgag agaaactttc gatcgctttc gcactgtgtc 3180 cggtgcgata ctcaacacca gaaaaaccat ctctatagac gttggcttta tcaatgggaa 3240 ccccctacac gttccatggt tgcaaactgt agacacggtt aaaattctgg gagtgatttt 3300 tgctaattgc gtccgacaaa tggtccaaat caactgggac actcttgtga ccaaattcgg 3360 acagcaggtg tggatgcatt ctatgcgcag tttgacctta caccaaaagg taacactact 3420 aaacaccttc atcaccagta aaatctggta tctctcctca accataccac cgaatgctgc 3480 acacacagca aaaatcacat ctacaatggg tagctttckc tggagacggt taccagcccg 3540 catcccaatt caacagctcg cacgacacat csaaaaagsg ggattmaaac tacaactgcc 3600 ggcgatgaaa tgtaaggcgc tgctcatcaa tcgtcatctc aacgagattg actgcatgcc 3660 gttctacaaa tcgatcgtgt tcccagaaaa cccaacgcaa ccacgtcccg ctgtgcctga 3720 cttaaawcgc tgctccagaa cctcccacgg ttacctgctc aacttcaaga tcgtccctct 3780 tctgacctca tccaccaaca ttatctggag ttgaccgatc gaccaagggt tgagcggaac 3840 accccgacag ctgactggca gagaatctga gaaacatcgc ctcgaaacgg ctctcttcct 3900 ctcagcgaag cgagctttac cgtttcgtaa atgagaagac cgagcaccga cgtctactac 3960 atgtgatgca gcgagtagac ggaggatact gcacacattg caattcgcca gtggagactc 4020 ttcaacacaa gtttagmgaa tgtaatcgtg ttcgtgcagc gtggcagctt ctacagcaga 4080 aagttgccgc ggttctcaat ggatggcgac gattaacctt cgaagaactt ttgcgaccag 4140 cgctgagaaa catcggacca gtgattagaa acaagattct taaattattt gtgacstaca 4200 ttagtttcat caataatgca gaaggtagaa tcgatattag tggacttkaa tttgccatga 4260 tcactgaaac gtgaatcaat aatgtaaata tttcataaat tcaacaaaat aaaaccaaga 4320 ctaaaaaaaa 4330 // ID Crypton-2_TCa repbase; DNA; INV; 1648 BP. XX AC . XX DT 17-FEB-2009 (Rel. 14.03, Created) DT 21-MAR-2009 (Rel. 14.03, Last updated, Version 1) XX DE Putative Crypton-type transposon. XX KW Crypton; DNA transposon; Transposable Element; Nonautonomous; KW Crypton-2_TCa. XX OS Tribolium castaneum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Coleoptera; Polyphaga; Cucujiformia; OC Tenebrionidae; Tribolium. XX RN [1] RP 1-1648 RA Jurka J.; RT "First Cryptons from insects."; RL Repbase Reports 9(3), 673-673 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Tribolium CC castaneum Genome Project. XX FH Key Location/Qualifiers FT CDS 594..1457 FT /product="Crypton-2_TCa_1p" FT /translation="DIGPKSLKHSSEKTFTNFXCKLRTIIILCKRLLLILF FT LKKINRLLIQVALIFGIAGALRREELYKMKCEDVNDTGNVLIITVPDTKTH FT IERRFTVIGETPKHNLNLIDIYKKYKNQRPTNVKTDHFFLQYRQGKCKTQV FT VGINTFSKIPSQIATYLNLPDPSSYTGHAFRRSSASLLVDSGGDLMQLKKH FT GGWKSSTVAEGYVDESMQTKMDSATKILAGTSSTPTASDLSTSVYNPTVNS FT TTQKQVGNKTTNVDEVNSLSKSIVCSSLQINNAQSCTFHISINKDI*" XX SQ Sequence 1648 BP; 564 A; 288 C; 294 G; 499 T; 3 other; ttttttctac gtccgttata aaacttagaa tttttttatt catatctgtt cataaagtat 60 atcattcttg aatgaatcaa aagcgtgaaa aaaagtgccg taacgtgtca ttttttaacg 120 cgtaaatatt aatatagtag ggtgctttgg acgcggcgct gtcagtagac gctgtcttag 180 agctgtcaca tagaaaacat tttcaaagtt atttgcttgg tttttgtaag aattgtgttg 240 ttgaacaaaa aaaaaggtgt tgctagttgt gatggagtgt gaaagcgaag aaagcttcag 300 ttgcaccccg aggacgtcat tgattcggca aatgccgcta caatgaatat tcttccggaa 360 aaataaaaaa atcaatattt gaaagaatat aawgatttta tggagtggcg caggaatcca 420 aagtttcaca gagagagttt tgctcgctta ttttgattca aaatcaaaaa cgtggaragc 480 ttcaactttg tggtctgcct attccaaatt gaaggccact ttgattataa atcatgacgt 540 tgacatcagc aaatacccaa aacttattgg gtatttaaaa aataagtcaa taggatatcg 600 gcccaaaaag tctaaaacat tcctccgaga agacatttac aaatttttka tgcaagctcc 660 ggaccataat tatcttatgc aaaaggttat tattaatatt atttttaaag aaaattaatc 720 gcttactcat tcaggttgct ttaatatttg gaattgccgg cgctctgcgg agggaagaat 780 tatataaaat gaagtgtgaa gatgtgaacg acactggtaa cgttctaatt atcacggtac 840 ccgacactaa aacacatatt gaacgaagat tcacagttat tggagaaaca cccaaacata 900 atttaaactt gatagacatt tataaaaaat acaaaaatca acgaccgacg aatgtcaaaa 960 ccgatcattt ctttttgcaa tatagacagg gaaagtgcaa aacacaagtc gtaggaataa 1020 acaccttttc aaaaattcct tcacaaatcg caacttattt aaatttaccg gatccatcaa 1080 gctacactgg acatgcattt cgtcgctctt cggcgtctct tttggtagat tctggtggcg 1140 atttaatgca gttgaagaaa cacggtggat ggaaatccag cacagttgcc gaaggttacg 1200 tggacgaatc aatgcaaact aaaatggatt cggcaacaaa aattttggca ggaactagtt 1260 ccacaccgac tgcaagcgac ttgtccacaa gtgtctacaa tcctacagtt aattctacta 1320 cgcaaaaaca agtaggaaat aagactacaa atgtagacga agtaaattct ttgagcaaat 1380 caatcgtttg ctcttctcta caaatcaata acgctcaaag ttgcactttc catatttcaa 1440 ttaataaaga catttgaatt aatactatta cttttttata ttttcatatt atgacggacg 1500 tagaaaaaat gttgtatgca attcgtgagt aaagtatctt tttttaactc gtaggaattg 1560 ccgcactcgc ctacggctcg tgcggtcaac tgttcctact cgttaaaaaa agtgttactt 1620 tactcacttg ttgcataaat aactatta 1648 // ID DIRS-1_NGr repbase; DNA; INV; 4237 BP. XX AC ACER01000053; XX DT 07-MAY-2011 (Rel. 16.05, Created) DT 07-MAY-2011 (Rel. 16.05, Last updated, Version 1) XX DE DIRS-type DNA transposon. XX KW DIRS; LTR Retrotransposon; Transposable Element; DIRS-1_NGr. XX OS Naegleria gruberi OC Eukaryota; Heterolobosea; Schizopyrenida; Vahlkampfiidae; OC Naegleria. XX RN [1] RA Fritz-Laylin L.K., Prochnik S.E., Ginger M.L., Dacks J.B., RA Carpenter M.L., Field M.C., Kuo A., Paredez A. et al.; RT "The genome of Naegleria gruberi illuminates early eukaryotic RT versatility."; RL Cell 140(5), 631-642 (2010). XX RN [2] RP 1-4237 RA Jurka J.; RT "DIRS retrotransposon from Naegleria gruberi."; RL Repbase Reports 11(5), 1465-1465 (2011). XX DR EMBL/GenBank/DDBJ; ACER01000053; Positions 119372 115136. XX CC Only a single copy was found. Flanked by perfect LTRs. XX FH Key Location/Qualifiers FT CDS 157..3972 FT /product="DIRS-1_NGr_2p (Cre-like recombinase)" FT /translation="MSQSTIQETITDILKKHDKSAVGKLTKALIQKIIGTL FT EIEVSGQKKVLAKAIVDFIGADKFTVCTSGKSKDVEKLKDEIKHAFKEARD FT DDDDDDDEEDDGESVVEVEDGSNKRKREEVSEEDVKRFKAYEQIIDAKIAQ FT MKLEKAETERVLQQLAAESEVRKYTDTIYTPLPTTNSSLLTEYENLRKDSA FT FTMKIANLFPEIDDTKKMVANLETRAFQLQVQAVNPELASKLAFIESNPLW FT RENMDKLKMAEKLLKFDHKEKGKTSFVKSYANAKIYDKKDKAEEKVNMFGS FT ANGMKKVKKCFICNDPNHLIKDCPKRNKREGSEYVDDLFFPRGRRKKRGYS FT FHFEQMGYLEEERSFNGNSGYVERGCISSNKRRICSILWKGKFRRMVFGRE FT RMVKKRSEYFERDWSFGTLSQSKSDSQVSAGPKERIIPTDYRFKTAQSVLF FT YPQIQVQHVEEVFRTRTNEQDRSNLGFYVRYEIRISPDEGKKEGQTILCHS FT SRGRGFAIHSPSIWMEWISILFYTNDECIYQFFEIRFFNTIQLLRRCWFIT FT FWFLFICIKEQYIRYNGMCRIRCSYRKEEESVNSIKSLSTTWDQDRSGQQE FT INYDRSYIRENSNSYFHSPCQVSPRSESNIQRTVSDDWYYNCLCRLFPTCK FT GDVESDLFFSGGLGEYWRLGFRISHQRPRCNFQSPMDFIQLGPLEWKGVPI FT SKLSGILLDRCIEIGMGSFPQRKKFETTRFLEGRTMPRTYQCIRSESDLSC FT PKRMVTSFERKEFNCLHGFKGGSILYNEGKSKVSHYESIDSTNMAHYFRTR FT HPFAFVRMVTFGGESSSGQSEQKYSRERPNRLGDYQRSFSISREAAGCINR FT MGQIRLSLQLQDKTIYSEKEDQQKRESRFILSPLDRASGKLVLRTFQSNRE FT SITTLSRVQSSYGIGSSSMEKEVVVEDNRKLSNSFYESRWKSFQSLPRREP FT RTMQRILDHKSNISRLQSHDLSKIADTLMSCSIEISTTDNYESYWNRFINF FT LIKQKEVPPATSEMVVRFLTFMFIKTPSGNNCRSAMYAIKHFCKRIGWEDI FT TVNIRVQMVIKGMCNLATVATKRERDPWRIEFICKWVIVGRQFVKEKSYVL FT YTCLMIFGIRSMFRGSELGGLLIEDLKIVKFPVVGFSITVRKVKNDKKGRK FT SCIEATGSELCPMKWLQKLLDSRDKGKFLFTGGFSLNTVEISYILTQVATW FT IEADPGKYSSHSLRIGGATEAAIMGIPDATIKAMGGWNSDAIDRYFRANFR FT GDRNGSHLMGF" FT CDS join(1139..2140,2144..3082) FT /product="DIRS-1_NGr_1p (RT+RNAse H)" FT /translation="MICFFREVEERREAIHSISNKWDTWRKKGASMEIVDM FT LKEGVSVQTNEEFVPYYGKVNLEEWSLEEREWLKREVSILKEIGALEPCHN FT PKVIAKYRLVPKKESYRLIIDLRPLNRFCSIPKFKYNTLKKFLEHVPMNKT FT DQIWGSMFDMKSGYHQMKVKRKDRPYFAIQVEEEVLQFTVLPFGWNGSPYF FT FTQMMNVFINFLRSDSLTPSNYLDDVGLLHFGSYLSALRNNTFVTMVCAEL FT GVVIEKKKSQLTPLKVFQLLGIKIDLVSRKLIMTEATLEKIRTRISIALAK FT YHQDQKVTSRELFRMTGIIIAYVDCFQPARVMLNPIYSLVVDVNIGGWDSE FT FRIRDRDVIFNLQWILYNLDHWNGKVFQYPSLVEYFWTDASRLGWGVFHKE FT RNLKLQGFWKGEQCQEHINVLEVRAIYLALKEWSHLLKGKNLIVYTDSKVA FT LYCIMKGRAKSPIMNQLIQLIWRIIFEHDIRLLSLEWLPSEENQVADNLSR FT NIQEKDPTDWEITKEAFQLVEKQLDVSIGWDRFASHYNFKTKPYTVRKKTS FT RREKADSFSHLWTEPVVNWCCVPFNLIGRVLQHCLESKAVMVLVHPVWKKR FT WWWKIIESYQIRFMNLDGNHFRASQGGNLEPCKGFWTIRATLVDCRVTI" XX SQ Sequence 4237 BP; 1504 A; 617 C; 939 G; 1177 T; 0 other; tggtagtaag gtccacacct tacgtaagtt tatattgtaa aaacaataga caaccaggta 60 acgtagtcaa cctaaatttt acaatgaaaa tttaggttaa atagagagtt ttactgaagg 120 aaagtaaaac tcaagttgaa aaaattgcaa cacattatga gtcaatcaac aattcaagaa 180 accatcactg acatacttaa gaagcatgat aaaagtgctg ttggaaagtt aacaaaggcc 240 ttgatacaaa agattatagg taccttggaa attgaagttt ctggccaaaa gaaggtattg 300 gcaaaggcaa ttgttgactt tattggtgca gacaagttta cggtttgcac aagtggcaag 360 agtaaagatg tagagaagtt aaaggatgag attaaacatg cattcaagga agcaagagat 420 gatgatgacg atgatgatga cgaagaggac gatggtgagt cagttgtaga agttgaagat 480 ggctctaaca agagaaagag agaagaggta tcagaagaag atgtgaagag atttaaggct 540 tatgaacaaa taatcgatgc taagattgca cagatgaagc tggaaaaagc tgaaacagaa 600 cgtgttttac aacaattggc agctgaatct gaagttagaa aatacacaga tacaatctac 660 acaccattgc caacaactaa ttctagtctt cttacagaat atgagaattt gagaaaggat 720 tctgctttta caatgaagat agctaatctt ttcccagaaa tagatgatac gaagaaaatg 780 gttgcgaact tagagacaag agctttccaa ttacaagttc aagccgtcaa tccagaatta 840 gcatctaaat tggcgtttat agaaagcaat ccattatggc gtgagaacat ggataagctt 900 aagatggcag aaaaactact caagttcgac cataaggaga agggtaagac aagttttgtt 960 aaatcctatg caaatgccaa gatttatgat aagaaagata aggcagagga aaaggtgaat 1020 atgtttggta gtgctaatgg tatgaagaaa gtgaagaaat gctttatttg taatgaccca 1080 aaccatctga tcaaagattg ccctaagaga aataaaagag agggttcaga gtatgtagat 1140 gatttgtttt ttccgagagg tagaagaaag aagagaggct attcattcca tttcgaacaa 1200 atgggatacc tggaggaaga aaggagcttc aatggaaata gtggatatgt tgaaagaggg 1260 tgtatcagtt caaacaaacg aagaatttgt tccatactat ggaaaggtaa atttagaaga 1320 atggtctttg gaagagagag aatggttaaa aagagaagtg agtattttga aagagattgg 1380 agctttggaa ccttgtcaca atccaaaagt gatagccaag tatcggctgg tcccaaagaa 1440 agaatcatac cgactgatta tcgatttaag accgctcaat cggttttgtt ctatccccaa 1500 attcaagtac aacacgttga agaagttttt agaacacgta ccaatgaaca agacagatca 1560 aatttggggt tctatgttcg atatgaaatc cggatatcac cagatgaagg taaaaaggaa 1620 ggacagacca tactttgcca ttcaagtaga ggaagaggtt ttgcaattca cagtccttcc 1680 atttggatgg aatggatctc catacttttt tacacaaatg atgaatgtat ttatcaattt 1740 tttgagatca gattctttaa caccatccaa ttacttagac gatgttggtt tattacattt 1800 tggttcctat ttatctgcat taaggaacaa tacattcgtt acaatggtat gtgcagaatt 1860 aggtgtagtt atcgaaaaga agaagagtca gttaactcca ttaaaagtct ttcaactact 1920 tgggatcaag atagatctgg tcagcaggaa attaattatg acagaagcta cattagagaa 1980 aattcgaact cgtatttcca tagcccttgc caagtatcac caagatcaga aagtaacatc 2040 cagagaactg tttcggatga ctggtattat aattgcttat gtagactgtt tccaacctgc 2100 aagggtgatg ttgaatccga tttattcttt agtggtggac taggtgaata ttggaggctg 2160 ggattccgaa tttcgcatca gagaccgaga tgtaattttc aatctccaat ggattttata 2220 caacttggac cattggaatg gaaaggtgtt ccaatatcca agcttagtgg aatacttttg 2280 gacagatgca tcgagattgg gatggggagt tttccacaaa gaaagaaatt tgaaactaca 2340 aggtttttgg aagggagaac aatgccaaga acatatcaat gtattagaag tgagagcgat 2400 ctatcttgcc ctaaaagaat ggtcacatct tttgaaagga aagaatttaa ttgtttacac 2460 ggattcaaag gtggctctat attgtataat gaagggaaga gcaaagtctc ccattatgaa 2520 tcaattgatt caactaatat ggcgcattat tttcgaacac gacatccgtt tgctttcgtt 2580 agaatggtta ccttcggagg agaatcaagt agcggacaat ctgagcagaa atattcaaga 2640 gaaagaccca acagattggg agattaccaa agaagctttt caattagtag agaagcagct 2700 ggatgtatca ataggatggg acagattcgc ctctcattac aacttcaaga caaaaccata 2760 tacagtgaga aagaagacca gcagaagaga gaaagcagat tcattctctc acctttggac 2820 agagccagtg gtaaattggt gctgcgtacc tttcaatcta atagggagag tattacaaca 2880 ttgtctagag tccaaagcag ttatggtatt ggttcatcca gtatggaaaa agaggtggtg 2940 gtggaagata atagaaagtt atcaaattcg ttttatgaat ctagatggaa atcatttcag 3000 agcctcccaa ggagggaacc tagaaccatg caaaggattt tggaccataa gagcaacatt 3060 agtagactgc agagtcacga tttgagtaag attgcagata cattaatgtc atgttcaatt 3120 gaaatatcca caacagacaa ctatgaaagt tattggaata gatttatcaa ctttcttatt 3180 aagcaaaaag aggttccacc agcaactagt gaaatggtag tcagatttct cacctttatg 3240 ttcatcaaaa caccatctgg taataactgt cgaagtgcaa tgtatgctat taagcatttt 3300 tgtaaaagga ttggttggga ggatattaca gtaaacatta gagtacagat ggtgataaag 3360 ggtatgtgca atctggctac agtggcgaca aagagagaaa gagatccatg gagaatagaa 3420 tttatttgta aatgggtaat tgttggtaga cagtttgtta aagaaaaatc atatgttcta 3480 tacacgtgcc taatgatttt tggtataagg tcaatgttta ggggttcaga attgggtggt 3540 ttattaatag aggatctaaa gattgtgaag ttcccggtgg tggggtttag tataacagtg 3600 agaaaagtga agaatgataa gaaaggaaga aaatcatgta tcgaagcgac aggttctgaa 3660 ttgtgtccga tgaaatggct tcaaaaatta cttgattcaa gagacaaagg aaagtttttg 3720 tttaccggtg gtttttcttt aaatacagtt gagatttctt acatcttaac tcaggttgca 3780 acatggatcg aagctgatcc aggaaagtac tcttctcatt cgttgagaat tggtggagcc 3840 acggaggctg ccattatggg tatccctgat gctaccatca aggccatggg tggttggaat 3900 tcagatgcta tagacagata ttttcgagct aactttagag gtgatagaaa cggttcacac 3960 ttgatgggct tctaaactta tgctagtaag gtctcaatgt tttcgagctt gttgtctcat 4020 tgtttttact ataaatggta gtaaggtcca caccttacgt aagtttatat tgtaaaaaca 4080 atagacaacc aggtaacgta gtcaacctaa attttacaat gaaaatttag gttaaataga 4140 gagttttact gaaggaaagt aaaactcaag ttgaaaaaat tgcaacacat tatgagtcaa 4200 tcaacaattc aagaaaccat cactgacata cttaaga 4237 // ID L2B-3_CQ repbase; DNA; INV; 4148 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE An L2B non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW L2B; Non-LTR Retrotransposon; Transposable Element; L2B-3_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4148 RA Kojima K.K. and Jurka J.; RT "L2B non-LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 144-144 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 6 sequences with >97% CC identity. XX FH Key Location/Qualifiers FT CDS 2..895 FT /product="L2B-3_CQ_1p" FT /translation="AAGKRKAEDRAEDVEETGIGFVTPKTSFAEALRKRVK FT VTEEEVKKKQARKPDPVVVIKPKEGVQVEDARAEVQKKISAKNLNVQRVTS FT SKNGSVIVELKDEASVGVLKASVDAQLGGRFEARLRESMKPSIKIIGMSDE FT FSEEELRASLVEQNDVFANLKHFKLRKTFQIEKWRFNNHAAFVELDAETFF FT KVLEQGKVNCGWNRCRVFDGLQVTRCYKCNGYGHKGADCKAETLVCPICSE FT DHEWKDCNAETEKCANCEKLRVQRKLNIDVNHSAWSSECPVFIKEQEKRNK FT MVDFTI" FT CDS 899..3718 FT /product="L2B-3_CQ_2p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="QPKQCGEILYLNIAGIKSHLDELKLMMQNTKPKIVFL FT TETHLTEKHDLDEFEIQNYRCSCCLSRSAHTGGVMVYVDDRLEYEIVSNTV FT AGDNWFLAIDVKSTTLNGIYGGIYHSPSSSDAGFIDSFESWLRTVFQDCRT FT NLLVGDFNIRWNEPGYARELKNVTEALGMKQIVSEPTRVGPSSSTIIDLVF FT TNLEGVEARVTEELKITDHETICIHVGETTINLPEQKKSYVSWKRYTKERL FT QAALRAKLRRADAGVTVGEAAQTFGSTLATAVAELTDLCSNDRKVSNAWYG FT EQLRNLKTERDDAYRTFKHTKRLDDWNLYKVLRNRYVRGLRSSKNLAVEEE FT IRSCHGNPKKLWRSLKSLIRPGGEPPMEIVFEGARSERETAERLNQFFVKS FT VETIHRNIPPAQVEPAVVPAEVRVNEDGSEDLLTELQPITLTKLKETVYSL FT KDCAGVDSVTKRVMTDSFDVVGEQLLAIVNRSLESGEFPQPWKRTLVIPIP FT KVPKSTRPEDHRPINILPLYEKVLETIVKEQLLAFVDRTKVLLEEQSGFRK FT HHSCESALNLLLLKWKQAIEEGKIVLSVFVDLKRAFETIDRTKLKAVLRRY FT GVRGSALKWFSSYLDSRTQVTRYNGSTSAATAVDLGVPQGSVLGPLLFILY FT INDLKQALRRVQVNLFADDTVLFLTGTSFEECFNVMNTELASFTEWLRWKK FT LQLNVSKTKCMIVTTRPSDGVSRAVRIDGEEIERVETIKYLGVMLDEKLNF FT NDHIDYTIRKAARKFGVLCRINRYLTTEAKIDVYKSLIAPHFDYCASILFL FT ATRQQLKRMQVLQSKVMRLILXCDRLTPRQNMLDCLQWMSVRQRIEYNTLI FT FVFRVTKGMAPKYLTGTVVYGRDIHQHDTRGADNLRLQMCRKACTQNSLFY FT KGYSMYNQLPEAAKNTRNINEFKNLCKNFVRQRPME" XX SQ Sequence 4148 BP; 1301 A; 855 C; 1107 G; 882 T; 3 other; agcagcaggc aagcggaaag cggaagacag ggcagaggat gttgaggaaa ccggaatagg 60 atttgtgacg ccgaaaacaa gctttgctga agcgcttcga aagcgggtga aagtcacgga 120 ggaggaggtc aaaaagaaac aggcaagaaa accagatcca gtcgttgtca tcaagccaaa 180 agaaggggta caggtggaag acgcgcgagc agaggtgcag aagaaaatca gcgccaagaa 240 cttgaatgtt caacgcgtca ccagcagcaa aaatggttca gttatcgtcg agctgaagga 300 tgaagcgtca gtcggagtgc taaaagcaag cgtggatgct cagctaggag gtcgatttga 360 ggctcgctta cgggaaagca tgaagccgtc gatcaaaatc atcggaatga gtgacgagtt 420 tagtgaagag gagctgagag cgtcgttggt ggagcagaat gacgtctttg cgaacctgaa 480 acacttcaag ttgcgcaaaa ctttccagat cgagaagtgg cgctttaaca accatgctgc 540 attcgttgag cttgacgcgg agacattctt caaagttttg gagcagggaa aagttaactg 600 tggttggaat cggtgtcgtg ttttcgacgg gcttcaagtg acaaggtgct acaaatgcaa 660 cggatacggc cataagggag cggactgcaa ggccgagact ttagtgtgcc caatctgcag 720 tgaagaccac gaatggaagg actgtaacgc tgaaacagag aagtgtgcca actgcgagaa 780 actacgcgtg caacgcaaac tcaacattga tgtcaaccac tcggcttgga gcagcgaatg 840 tccagtattc atcaaagagc aggaaaaacg gaacaaaatg gtggatttca caatctagca 900 accaaaacaa tgcggagaaa tactgtacct gaacattgct ggaataaaat cacatcttga 960 cgagctgaaa ctgatgatgc agaacacaaa acctaaaata gtatttttga cggaaacaca 1020 cttgacagaa aagcacgacc tagacgaatt tgagatccag aattaccgwt gcagttgctg 1080 tctgtccaga tcagcacaca cgggtggggt gatggtgtat gtagacgacc gcttggagta 1140 tgagattgtc tcgaatacgg tggcaggaga taactggttc ttggcgatcg acgtgaaaag 1200 tacaacgctc aatggaatct atggaggaat ctaccattct cccagcagca gtgacgctgg 1260 tttcatagac agctttgaga gttggctgag gacggtcttc caggactgca ggacaaatct 1320 actcgttggc gacttcaaca tccgatggaa cgaaccggga tacgcacgtg agctgaagaa 1380 cgttaccgaa gccttaggga tgaagcagat agtatcagaa ccgacgcgtg tgggtcccag 1440 tagcagtaca attattgacc ttgtctttac aaacctggag ggggtggagg cgcgcgtcac 1500 tgaagagctt aagataactg atcacgagac catctgtatt cacgtcggag aaaccacgat 1560 taacctaccg gagcagaaaa agagttacgt tagctggaaa cggtatacaa aggaacgact 1620 gcaagctgcg ctacgagcaa aactgaggag agctgacgct ggagtgaccg ttggagaagc 1680 tgcacaaact tttggctcaa cactggcgac ggcggttgcg gagctgaccg atctatgcag 1740 caatgaccgg aaagtctcaa acgcgtggta tggagagcag ctgagaaacc tgaaaaccga 1800 gcgggatgat gcctacagga ccttcaaaca tacaaaaagg ctggatgatt ggaaccttta 1860 caaagtgctt cgaaaccgct acgtccgagg attgagatca tcaaagaacc tggcagtaga 1920 agaggaaatt cgaagttgtc acggaaatcc caagaagctt tggagaagcc tgaaatcgct 1980 gattcggcct ggtggagaac ctccaatgga aattgttttc gagggagcgc gaagcgaaag 2040 ggaaacggcg gaacgcttga atcagttctt cgtgaaaagt gtagagacaa tccatagaaa 2100 catcccacca gcccaggtgg aaccagcagt agtaccagca gaagtaaggg taaacgaaga 2160 tggcagcgag gacttattaa cggagcttca accaatcacc ttgacgaaac tgaaagaaac 2220 agtctactcg ttaaaagact gcgctggtgt tgatagcgtg acwaaacggg tcatgacgga 2280 ttcatttgac gttgttggag agcagctgtt ggcgattgtc aacagatccc tggaaagcgg 2340 agaattccca caaccttgga agcggacttt ggtgataccg atacctaagg taccgaaatc 2400 gacgcgcccg gaagatcaca gaccaatcaa catcttgcca ctctatgaaa aggttttgga 2460 aactatagta aaggagcagc tcctggcgtt tgtggatcgg acgaaagttt tactcgaaga 2520 gcagtctggt ttccgaaagc atcactcctg tgagtcagcg ctcaacctgt tactgctgaa 2580 atggaaacaa gcaattgaag aagggaaaat tgtcctatcg gtgttcgtag acctgaagcg 2640 agctttcgag accattgacc gaaccaaact gaaagcagta ctgcgtcgct acggtgttcg 2700 cggttcagcg ctcaaatggt ttagcagcta cttggacagc agaacgcagg tgactcgtta 2760 caacggctcc acgtcagcag ccacggcagt agatcttgga gttcctcaag gcagtgtgct 2820 tggaccgctt ctctttattt tatatatcaa cgacctgaag caagcactga gacgagttca 2880 agtaaacctt ttcgctgatg atactgtact gtttctgacc ggaactagct ttgaagagtg 2940 ctttaacgtg atgaatactg agcttgccag ttttaccgaa tggctgagat ggaagaaact 3000 acagctgaat gtcagcaaaa caaaatgcat gatagtgacc acacgaccaa gcgacggcgt 3060 aagcagagcg gttcggatag atggcgagga aatcgagagg gtcgagacga tcaaatacct 3120 cggagtgatg ctggacgaaa aattgaactt taatgaccac attgactaca caatacggaa 3180 agcagcccga aaatttggag tactctgcag aattaaccgt tacctgacga ctgaagccaa 3240 aatagacgtg tacaaatccc tgattgctcc tcacttcgac tactgtgcct ctatcctgtt 3300 tttagcaact cgtcaacaac tgaaaagaat gcaggtcctg cagagcaaag tgatgcgact 3360 aattttgmaa tgtgatcgtt taacaccaag gcaaaacatg ctggattgtt tacaatggat 3420 gtctgttcgt caaaggatcg agtacaatac gttaattttt gtgtttcgtg taactaaagg 3480 gatggcgccc aaatacttga caggtacggt ggtatacggg agggatatcc atcagcacga 3540 caccagagga gctgacaatc tcagattgca gatgtgcagg aaggcgtgta cgcagaattc 3600 actgttctac aaaggataca gcatgtacaa ccagctaccg gaagcagcaa agaacacaag 3660 gaacatcaac gagttcaaaa acttgtgtaa gaatttcgtg cgtcaacgac cgatggagta 3720 aaagtgaacc acgactgtac tgtgaggaag agcatgttat gacggccggc catcttcatt 3780 atcggtacat gtcgcatggg atcactttgg gcctcatatg atgtaagctt gatcaaaagt 3840 aacgcgaatc tgggcgcggt tttaacccta tgtgctcata tgtgtgtggg atagcaaatg 3900 atctcaatac cctaaatgga agctgcaatc tgagtgaaag acaaaaggac tctacacagg 3960 attttgtaag agcgccttga gatgaagtaa gagaggaacg gatgggcata cacggaagta 4020 gagttagatc aaaggacact ctcaataata gacgatagaa ttatcgaaag atatctgctc 4080 gtaatccttc catgctacaa aaactgtgta tgggtaagag gtgggccatc caaggaaaaa 4140 aaaaaaaa 4148 // ID L1-N6_CQ repbase; DNA; INV; 1083 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A HAL1-like non-LTR retrotransposon family from Culex DE quinquefasciatus - consensus. XX KW L1; Non-LTR Retrotransposon; Transposable Element; nonautonomous; KW L1-N6_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-1083 RA Kojima K.K. and Jurka J.; RT "HAL1-like non-autonomous non-LTR retrotransposons from the RT southern house mosquito."; RL Repbase Reports 11(1), 105-105 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 6 sequences with >96% CC identity. CC This family encodes a protein similar to ORF1ps of L1 in CC mosquitoes. Thus it is likely a HAL1-type element. XX FH Key Location/Qualifiers FT CDS 98..1006 FT /product="L1-N6_CQ_1p" FT /translation="MSKYRRNAVVIDYNVLPVRPDIESVSKFISNDLQLDL FT KRVKNLQLNNSRNHVIIELGSPEEASDLAERHNLKHKVGCRGKLFRIPIFL FT EDNSIQVRVHDLPPHMPNEIIATHLEAYGEVNSIRREVWREYFPGIPNGVR FT VVRMVLEKPIPSFIKVEEEVGYILYNQQVKTCRHCAKPLHHGKKCDENLAL FT TPTTTTTTTTPAAQLLAPTLPALSPPVSTPSATAIEKEIAPEKTLPEIAPP FT TTTTFLPSTSVKNHQTTPSDQTMDISPEDDDDNETAKTVKPKRRSRRKKPW FT DPHWNHLYDDW" XX SQ Sequence 1083 BP; 361 A; 287 C; 215 G; 220 T; 0 other; agacgacagc ttgctcttgc agcggtcgga cgtaaacact agcctgtgct gatctccttt 60 gttctctttc tcccctgccg gggccgtgcc gatcgaaatg tcgaaatatc gccggaacgc 120 ggtcgtaatc gactataacg tcttgccggt aagaccggac atcgaatccg tttcgaagtt 180 catctccaac gaccttcaac tggatttgaa acgagttaaa aacctacaac tcaacaattc 240 ccgaaaccac gtcatcatcg agctgggatc gccggaggaa gcttcggacc tagccgaacg 300 ccataacctc aaacacaagg taggatgcag aggcaaattg ttccggatac ctattttcct 360 ggaggacaat tccatccaag tccgcgtcca cgatttgccg ccgcacatgc cgaacgaaat 420 aatcgccacc catctcgagg cctacgggga agtcaactcc attcggagag aagtctggcg 480 agaatatttt ccaggcattc cgaacggagt tcgagttgtc cggatggttc tggagaagcc 540 aattccttcc tttataaagg tagaggaaga agttggctac attttgtaca accagcaagt 600 taaaacttgc cgccattgtg cgaaaccact tcaccacggg aagaagtgtg acgaaaattt 660 agcgttaacc ccaacaacaa caacaacaac aacaacacca gcagcacaac tactagcacc 720 aacactacca gcattatcac caccagtatc aacaccatca gcaacagcga tagaaaaaga 780 aatagcacca gaaaaaacac tgcctgaaat agcaccacca acaacaacaa catttttgcc 840 atcgacctcg gtcaaaaatc accaaacaac accatcagat cagacaatgg atatttctcc 900 agaggacgac gacgacaacg aaacagcaaa aacggtgaaa cccaagagaa gatcgcgaag 960 aaaaaagccc tgggatcctc actggaatca tctatatgac gactggtaga agatcgcgcg 1020 attaattcat ctatttgatt ttttcacgac taataaaact atattttacg aaaaaaaaaa 1080 aaa 1083 // ID TCRP4 repbase; DNA; INV; 62 BP. XX AC M63895; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 28-SEP-1995 (Rel. 1.08, Last updated, Version 1) XX DE T.cruzi ribosomal intergenic spacer repeat. XX KW Ribosomal intergenic spacer; TCRP4; spacer repetitive sequence. XX OS Trypanosoma cruzi OC Eukaryota; Euglenozoa; Kinetoplastida; Trypanosomatidae; OC Trypanosoma; Schizotrypanum. XX RN [1] RP 1-62 RA Novak M.E., Mello P.M., Gomes B.H., Galindo I., Guevara P., RA Ramirez L.J. and Franco da Silveira J.; RT "Repetitive sequences in the ribosomal intergenic spacer of RT Trypanosoma cruzi."; RL Unpublished (1991). XX DR GenBank; M63895; Positions 261 322. XX SQ Sequence 62 BP; 25 A; 7 C; 16 G; 14 T; 0 other; gacaagatga aagtatgagg gaaagatggc aacgcataac atcaataata tgcttgtgtt 60 ga 62 // ID Crack-23_BF repbase; DNA; INV; 2568 BP. XX AC . XX DT 30-APR-2009 (Rel. 14.04, Created) DT 30-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE Amphioxus Crack-23_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; KW Crack-23_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-2568 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-2568 RA Kapitonov V. and Jurka J.; RT "Young families of Crack non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(4), 828-828 (2009). XX DR [2] (Consensus) XX CC Its 5'-terminal portion is incomplete. XX FH Key Location/Qualifiers FT CDS 2..2371 FT /product="Crack-23_BF_2p" FT /translation="VLLGDLNCDMFVSSSSKKVDNLCKTFQAEQLVHEPTR FT VTATSSTCIDVIISTCPEKISECGVRATGLSDHCFTYAVRKAKQPKGTPRM FT ANVRSYRNFNELSFQEELSSASWAEVKNCSDVNDALSCFLSILSAICDKHA FT PWVTIRVRGHAPPWVTQEYLAMTRDRDFYFHRAKKTKLPFIWDTAKKLRNK FT CNNMAQYLKKQYYSNEIKAKSNDSKGLWSTLKTLPGKTRHPVNIPNKTEKD FT TANDFNMYFTSIGSKLAAAFTSVYNVTVGPPKSVFQFNEISTDFTHRQLLN FT IPLGKSTGLDGVSSRLIRHAAPVIAGPLTHIYNLSLSTGVVPSDWKMAKVT FT PLHKDGDKNNCNNYRPISVIPSFMKIFEKAIHAQVYSYLKEHNLLNECQSG FT FRPKHSTSTTLLHVTDNILNNMDKGLVTGAVFLDLRKAFDTVCHEILLQKL FT RFSGIQGMALTWFHSYLSNRTQTTVINGVNSNLLELSVGVPQGSVLGPLLF FT ILYINDLPEQISHGHIVLYADDTALFFAAKSVSDVNRALNADLQNISEWLE FT TNRLTLNISKCKAMLFGSSKRLHLGNEQLTTYLSGTHIEVVPCFKYLGVWF FT DSCLTWQFHIEKLTNAVSARIGVLRRLVPVLPQDTLAMLFNCLILPKVDYC FT DVVWGNCGKGLSDKLQKLQNRAVRIILGLSYRSHVNNEHLSALGWKDLASR FT RKLHLLQTVYKSIYNLLPVYLNIFSRTSESHSYSTRHSLNQSLRIPKVKLE FT SGKRTFAYRGAESWNALPLTVKTAPSTQTFKTLCKNVV*" XX SQ Sequence 2568 BP; 794 A; 552 C; 530 G; 692 T; 0 other; tgtgttactt ggtgatctta attgtgacat gtttgtaagt agttccagta agaaagtaga 60 caacctttgc aaaaccttcc aagccgaaca gctggtacat gaaccaaccc gtgtgacagc 120 gacgtcctct acatgtattg atgtgatcat atcaacctgt cctgagaaga tttctgaatg 180 cggtgtgcgc gctaccggac tcagtgacca ttgttttact tatgcagtta gaaaggcaaa 240 gcagccgaaa ggaacaccga ggatggcaaa tgtaagatcg tacaggaact tcaacgaact 300 gtcttttcag gaggagttgt ccagtgcttc ttgggcagag gtgaagaatt gctctgatgt 360 aaatgacgcc ctaagttgtt ttctgtcaat attaagtgca atctgcgata aacatgctcc 420 gtgggtcaca atacgtgtca gaggccacgc accaccctgg gttacccagg agtacctggc 480 aatgacacga gacagagact tttatttcca tcgagcaaag aaaacaaaac tacctttcat 540 atgggataca gcaaaaaagc tacgtaacaa atgtaacaac atggctcaat acctgaaaaa 600 acaatactat agcaatgaaa taaaggcaaa aagcaatgat agtaaaggtc tctggtccac 660 gctgaaaact ctgccaggga aaaccagaca cccagttaac attcctaaca aaaccgaaaa 720 ggacactgca aatgatttta atatgtactt tacatctatt ggttcaaaac tagctgcagc 780 ctttaccagt gtatacaatg tgactgtagg ccctccaaag tctgtgtttc agttcaatga 840 gatatccact gactttaccc accgtcaatt gctgaatata ccacttggta aaagtacagg 900 gttagatgga gtgagtagtc gtcttatacg ccatgctgca cctgttattg cgggtccact 960 tacccacatc tataaccttt cattatccac aggagtagtt ccatcagact ggaaaatggc 1020 taaagtgaca cctttacaca aagatgggga taaaaacaac tgcaacaatt acagacctat 1080 ttctgtgata ccatctttca tgaagatttt tgaaaaagcc attcatgccc aagtttacag 1140 ttaccttaag gaacacaacc ttctcaatga gtgtcagtcg ggattcagac cgaaacactc 1200 tacctcaaca accctgttac atgttaccga caacatattg aataacatgg ataagggact 1260 agtgactggt gctgtcttcc tcgatctgag gaaggcattt gacaccgttt gtcatgagat 1320 cctgctgcag aagttgaggt ttagtggtat acagggcatg gccttgacat ggttccactc 1380 ttacctgtca aaccgcacac agacgacagt tataaatggg gtgaatagca atcttctgga 1440 gttatctgtg ggagttccgc aagggtcggt cctaggaccg ctactcttca tactatacat 1500 caatgacttg ccagaacaga tcagccatgg acacatcgtg ctctatgccg atgataccgc 1560 actctttttt gctgctaagt cagtgtcaga cgtaaacagg gcactgaatg cagatctgca 1620 gaatattagc gaatggctcg aaacaaacag actcacactt aacatcagca agtgcaaagc 1680 aatgctcttt ggttcttcta aaaggctgca cctagggaat gaacaactaa ctacgtatct 1740 atctggaact cacattgaag tagtaccttg tttcaagtac ctgggggtct ggtttgactc 1800 atgtctgaca tggcaatttc acattgagaa actgaccaac gctgtttctg caagaattgg 1860 tgttcttcga cgacttgtcc ctgtgttacc tcaagataca ctggctatgt tgtttaactg 1920 tctaattcta ccaaaagttg actattgtga tgtggtgtgg gggaactgtg ggaaaggtct 1980 gtctgacaag ttacaaaagc tacaaaaccg tgcagtgaga atcatacttg gtttgtcgta 2040 taggtcacat gtcaacaacg aacacctgtc tgctcttggt tggaaagacc ttgcctcacg 2100 tcgtaaattg caccttctac aaactgtcta taagtcaata tataacctgt tacctgtgta 2160 tcttaacata tttagcagaa catcagaatc ccatagctat tccaccagac atagtttaaa 2220 ccagtcgttg agaattccga aggttaagct agaatctggg aaacgaacat ttgcatatag 2280 aggtgcagaa agctggaatg cactcccact aactgtgaaa actgctccat ctacacaaac 2340 attcaaaaca ctgtgcaaga acgttgtgtg acctctgacc tccaccccct taatgacttg 2400 gaacggaaaa ggatgtggtt ttgtttttgt aattatgtaa atattgtatt gtatgtattg 2460 ttatttgttt atgtatccag ggcctccatg aaaaacagtg ccagccactg atcggagcat 2520 accctggtta aagaaaacag aaataaataa ataaataaat aaataaat 2568 // ID BXSAT1 repbase; DNA; INV; 160 BP. XX AC L09652; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 01-JUL-2005 (Rel. 10.08, Last updated, Version 2) XX DE Bursaphelenchus xylophilus satellite repeat. XX KW SAT; Satellite; Simple Repeat; BXSAT1; KW Satellite repetitive element. XX NM BXSAT1. XX OS Bursaphelenchus xylophilus OC Eukaryota; Metazoa; Nematoda; Chromadorea; Tylenchida; OC Aphelenchoidoidea; Aphelenchoididae; Bursaphelenchus. XX RN [1] RP 1-160 RA Tarès S., Lemontey J.M., De Guiran G. and Abad P.; RT "Cloning and characterization of a highly conserved satellite DNA RT sequence specific for the phytoparasitic nematode Bursaphelenchus RT xylophilus."; RL Gene 129(2), 269-273 (1993). XX DR GenBank; L09652; Positions 1 160. XX SQ Sequence 160 BP; 43 A; 33 C; 28 G; 56 T; 0 other; ccggtgtcta gtataatatc agagtttttc gcccagtgtc ggcgtaattt gattccaaaa 60 aagctgaaac ttgccatgct aaaatctcag gcgattagct ttcgaatggt gtatgtcttg 120 tcaattcact ccgtcatcac taattcactt ttttttaaaa 160 // ID I-2_CQ repbase; DNA; INV; 4177 BP. XX AC . XX DT 26-DEC-2010 (Rel. 16.01, Created) DT 26-DEC-2010 (Rel. 16.01, Last updated, Version 1) XX DE I-type non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW I; Non-LTR Retrotransposon; Transposable Element; I-2_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4177 RA Jurka J.; RT "I non-LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 107-107 (2011). XX DR [2] (Consensus) XX CC >98% identity to consensus (only 4 copies found). XX FH Key Location/Qualifiers FT CDS 201..3956 FT /product="I-2_CQ_1p" FT /translation="MAIEPSTESSQDALXPNDDNQNPNLTNQQAVSSTSRP FT ALTDGAADDTXTTTPTTTAIAVQWNLRGMAARTNELQQLLEQHRPVVVALQ FT EIKTKKKKDKNKLDRRRYDWXFCFXPSDGYSNGVALGIXKDVPHRYIDVQG FT PLQVVAARVEWPVLATFVSIYICREDGKTEIEDKLAQLVAQLPAPVVLLGD FT FNAHSPLWGGNHIDQRGKAIECLLGKHNFIILNDGSHTRVDPSNGRSSAID FT LSIVSDTIAPELVWSVDVDTRESDHYPIFIHSVDHTRAKRSRRPRWKYDSA FT DWTKFANEVQKPNPESAQEIEEAIIAAAEVSIPRTSTRVGRKAVHWWTNEV FT EEAVKDRRKKLRRLRKLGDDHPNKIQALNEFKLARNTARQIVEKAKADSWA FT AFVSGIAPSSGTTEIWRRVNVFRNGPKVNIHQLVINGKLSDEPGEVAETLA FT DYFAEVSAARQSTHRRAVPDQPTPTFEGGETSKYNETFSMEELDWAIRKSK FT GLSAGVDNIGYPMIRNLPHLTKTRLLELYNRIWEDGNIPDRWKEGIVVPIP FT KAGKDRKTVGNQRPITLVSCLGKIMERMVNRRLITTLEELGVFGDNQHGFR FT RGKGVDTYLAELESEIQNWIDKRQHGELALLDLAKAYDTAERAPILQNLHR FT WGIRGRMGCFVANFLKNRTFRVAIGSFLSTLRTMESGVPQGTILAVTLFLV FT RMTEVRRYIPKGVSLKLYADDILLVVHGRSAVYVRKKIQEAISSVETWTQL FT VGFELASTKSNVIHICRRNRHEERNPVTTDAGPIELVKYARLLGTGFDGRF FT NFKQHISDTKKMLESRNRVLRVIGGHRISGARKTLLDVHRAIVQSRLFYAW FT GIVSSATPAAIRPLGPAHVAGIRNASGAFRTSPVKSIYAEAGQLPFDYAAT FT EATVTKAARLEAGGTISPQHPLGVRARREFQRITQEEMPEIARKLRIGDRP FT WHAEKPKVDWEMTAYVRAGETSQKVAAAFGIVREKYGEHRQIFTDGSKDED FT FTGCGFVDGTEQTAIRLPAQCSIFSAESFAILKALERIPEGDGETTVIYTD FT SASVAEAVENGVSHHPWVQAIEREMRRTGAILCWVPGHCGIIGNERADETA FT RSAKRLDRINLPVPAQDFLSWAKNLIRLAWEREWHGERDLFLRRVKPNTLP FT GSDRDNQEEQRALTRLRIGHTRLTHEGLFRGERVTCDTCGVPMTVEHILCT FT CRKFDGLRDDGAQSVYGALCNDPEAEKTLLRYLKNTKLLTEI" XX SQ Sequence 4177 BP; 1150 A; 1152 C; 1154 G; 712 T; 9 other; cgttctgaat atagcgacgg aagcgagtcg acactggtcg aaagcgagga aggcctcgaa 60 gcttccgaaa ctaacccctc canacctccc aaccaaatta ccaccccaaa aacaaaccct 120 caaaaccctc ccaaccaaaa atcctcggga acggggaagt ccaagaattc gaagaacaaa 180 accaaaaaga gtaaacagtg atggccatcg agccatccac tgagagctct caagacgctc 240 taanccccaa cgacgacaat caaaacccga acctnaccaa tcaacaagcg gtgtcgtcca 300 cctctcgccc tgccctcacg gatggggcag cggatgacac cncaaccaca actccaacaa 360 cgactgcgat cgcagtccaa tggaacttgc gcggcatggc cgccaggacc aacgagctac 420 aacaactctt ggagcaacac cgcccggtgg tggtggcact ccaagaaatc aaaacgaaaa 480 agaagaagga taagaacaag ctggaccggc gacgatacga ctggnaattc tgtttcaanc 540 ccagcgatgg gtattcgaac ggagtcgcnc tcggtatcga naaggacgta ccgcatcgct 600 acatcgacgt tcaaggaccg cttcaagtcg tggctgctcg cgtcgagtgg ccagtattag 660 ccacctttgt ctccatctac atctgtaggg aggatggcaa gaccgagatt gaagacaagc 720 tagcacagtt agtcgcacaa ctgccggcac cggtagtact gttgggagac ttcaacgccc 780 acagtccact ctggggtggg aatcacatcg atcagcgtgg aaaggctatc gaatgtttac 840 taggtaaaca caacttcatt attctcaacg acgggagtca cacgagggta gacccgagta 900 acggaagatc atcagcgatc gacctgagca tcgtgtccga cacgatcgcg cccgagctgg 960 tttggtctgt ggacgttgac accagggaga gcgaccacta cccgatcttc atccacagtg 1020 tcgaccacac gcgggccaaa agatcgagga gaccaaggtg gaaatatgac agtgctgact 1080 ggaccaaatt cgcgaatgag gttcagaaac caaacccgga gagcgcacaa gagattgaag 1140 aagcgatcat agctgccgcc gaggttagta tcccacgaac gtctaccagg gtgggtcgca 1200 aagcggttca ctggtggacg aatgaagtcg aagaggcggt caaggatcgc cgcaagaaac 1260 ttcgtcgctt gcggaagctg ggcgacgacc acccaaacaa aatccaggcg ctgaacgagt 1320 tcaaactagc acgcaacaca gcgagacaga tcgtggagaa agctaaggct gacagctggg 1380 ctgccttcgt ttcgggcatc gctcctagta gcggaacaac cgagatctgg cggcgggtta 1440 acgtgttcag aaacgggccg aaagtcaaca ttcaccagct ggtgataaat ggaaaattgt 1500 cagacgagcc cggggaagtc gctgagacgc tagctgacta cttcgctgag gtgtcggcgg 1560 caagacaatc gacccaccgt cgggcggtac cggaccaacc aacgccgacg ttcgaagggg 1620 gagaaacctc caaatacaac gaaacattca gcatggaaga actggattgg gctatccgga 1680 agtcgaaagg gctatcagcc ggagtagaca acatcggata tccaatgatc cggaaccttc 1740 cgcacctaac caagacccga cttctcgagc tctacaaccg gatatgggag gacgggaaca 1800 tacccgaccg ttggaaagaa ggaatagtcg tcccgatccc gaaagctgga aaagacagga 1860 agacagtcgg aaaccagaga ccgataacac tcgtcagctg cctcgggaaa atcatggaaa 1920 gaatggtcaa ccgacgtctt atcaccacgc tcgaagagct gggagtcttt ggggacaacc 1980 aacacggctt ccggagaggc aaaggtgttg acacctatct cgccgaactg gaatctgaaa 2040 tccaaaactg gatcgacaaa cgccaacatg gggaactcgc tctactggac ctcgcaaaag 2100 catacgacac cgcggaacgg gcaccgattc tccagaacct tcatcggtgg ggaatccgtg 2160 ggcggatggg ctgctttgtt gcgaactttt tgaaaaaccg gaccttccga gtagctatcg 2220 ggagcttcct gtcgacactt cggaccatgg agagcggggt gccgcaggga acgattctgg 2280 cggttacact tttcctggtc cggatgacag aggttagaag gtacatcccc aaaggagtgt 2340 ccctcaaact ctacgccgat gatatcctcc ttgtcgtcca tggtagaagc gcagtctatg 2400 ttaggaaaaa gatacaggag gcgatctcca gtgtcgaaac ctggacccag ttggttgggt 2460 tcgaactggc ctcgacnaaa tccaacgtga tccacatttg ccgacggaat cgacacgagg 2520 aacgaaaccc tgtgaccacc gacgctggtc cgattgagct ggtcaagtac gcgcgactgc 2580 ttggaacggg ctttgacggg aggttcaact ttaaacagca catctcggat accaaaaaga 2640 tgttagagag ccggaaccgc gtactccgag taattggagg ccaccggatc agcggtgcga 2700 gaaaaactct actcgacgta caccgtgcaa ttgtgcagtc gcggctattc tacgcttggg 2760 gaatagtcag ttcggcgaca ccagcggcca ttcgaccact tggacccgca cacgtcgctg 2820 gaatccgaaa cgcctccgga gcgttccgaa caagtccggt caaatcaatt tacgccgaag 2880 ccggacaact tcctttcgat tacgccgcaa ccgaggcgac agtaaccaaa gccgcaaggc 2940 tggaagcagg agggaccatc agcccacagc acccactggg ggtgagagct agacgggaat 3000 tccagaggat cacccaagaa gaaatgcccg aaatcgcgcg aaaacttcgg atcggagacc 3060 gcccctggca tgcggaaaag ccaaaggttg actgggagat gacggcctac gtccgagccg 3120 gagagacctc ccagaaggtc gcggcggcgt tcggcattgt cagagagaaa tacggggaac 3180 atcgtcaaat attcacggac ggatccaaag acgaggactt caccgggtgc ggattcgttg 3240 atggcacgga gcaaacagcc atccgactgc ccgcacagtg cagcattttc tccgcagaat 3300 cgttcgcaat cctgaaagct ctagagcgaa ttcctgaagg agatggggaa actacggtga 3360 tttacactga ctctgctagc gtggccgagg cggtggagaa cggcgtatcc caccacccct 3420 gggtacaggc cattgaaagg gaaatgagac gcacaggagc catattgtgc tgggttcccg 3480 gccactgtgg cattatcgga aacgaacgcg ctgacgaaac ggcgagatcg gccaaacggc 3540 ttgaccgcat taatctacca gtcccggccc aggactttct gtcatgggcc aaaaacctca 3600 tccgcctggc atgggagagg gaatggcatg gcgaacggga tctcttcctt cgccgggtca 3660 aaccaaacac cctgccagga tccgaccgag acaaccaaga agaacaacgc gcgctaaccc 3720 gcctccgtat cggacacacc cgccttaccc atgagggtct cttcaggggc gaacgggtca 3780 cgtgcgatac atgtggagtt cctatgaccg ttgagcacat cttgtgcacc tgcaggaagt 3840 tcgacgggct tcgtgatgat ggagcccaaa gcgtttatgg agcactgtgt aatgacccgg 3900 aggcggagaa gacgctcctc cgatatctta agaacaccaa gctgctaacg gaaatctagt 3960 accactgtcg cgcccggaag tcccggggcc cccctgacca caagtctttg gaggcgtgga 4020 acagggacaa cccctcggct cttccggggc gcctgggaaa gatcaagaag aacccacggg 4080 aagtccggga gactcggggc ccccctagcc acaagtcttt ggaggagtgg aatagggaac 4140 acccccgggc tcccggttta ccggtggaag aggacgc 4177 // ID Gypsy-224_AA-LTR repbase; DNA; INV; 145 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-224_AA_; KW Gypsy-224_AA-I; Gypsy-224_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-145 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1052-1052 (2011). XX DR [2] (Consensus) XX SQ Sequence 145 BP; 49 A; 30 C; 25 G; 41 T; 0 other; tgagagtgag tgcctgcaat caactctcat tacactgttg gacctaataa atattcagta 60 tgaatttgac tagtacaaag taaagaagtg ttttcccaaa gcttaatccg aaaggttccc 120 aaattaccaa tcctcggtcg taaca 145 // ID Gypsy-599_AA-LTR repbase; DNA; INV; 226 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-599_AA_; KW Ty3_gypsy_Ele67; Gypsy-599_AA-I; Gypsy-599_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-226 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 226 BP; 55 A; 54 C; 41 G; 75 T; 1 other; tgttgggcac ccctatcgcc gactctgmca ttcttctaca ctgtcgccta tcggcagtgc 60 caactgctac gtcaccgaag aagtctgtca tcagttcgta catatacgag tagcagtaca 120 cctgtgtgca gagcatgtac attttaattc gtttaattgt taattattga ataaagttat 180 tgtttgttac gagtcataat tctccgcctt atttcccatc gtaaca 226 // ID BEL-1_DWil-I repbase; DNA; INV; 8506 BP. XX AC scaffold_180576; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-1_DWil_; KW BEL-1_DWil-LTR; BEL-1_DWil-I. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-8506 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (04-MAR-2011). XX DR Genome; scaffold_180576; Positions 25854 34359. XX CC Positions [6450-7022] - Integrase core CC 'AAGAG' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 2390..4333 FT /product="BEL-1_DWil-I_1p" FT /translation="MSTEPSKTAPKSNGLSSPSSTPEKVRPPITPATVKRT FT DISRKFSSVRSRSESPISRPSSSSLPQGRRFSVKDSSEHKGRSKLSTQTQI FT PHVVVTHCSDAPVTRSVSQQRREHSRMSLSDFILACDALTLFEARESEALT FT SEVPLFSIKMHLEQLKSLWSTVEMEHSRCHKALTVQTDEESLANISRIKAK FT HEYAYAAYVRCGTLFGERIEQLSAQMTSLMRQPTAPSRPAAWENLTARYEN FT SRLLVMTQARQLLQIPAVAQESGKSLRELQTAFQGGLTALERSGVSTENWD FT VLLIAICTSKLPKATILLWEQSVPNKTVVSNWSDLNQFLTDRYRVLDATEE FT HKSGPSVQTIPFGPRKSSATMKVQAFPAPSAAQVPPSSATEGPSTSNQLTN FT AQSYFASTPGRGLLGTAIVTLHHRGVDHHIRALIDSGADSTFLSERVFSMV FT QPPFYPVDAAVTGMGQSDGGCADKVCLLMLSPPCDRLNKIEVSALVVSKLS FT GAIPTAPFLNTVPTQCLDRPLADPYYNKPAPIDMIIGVDLFPHMGFCVELS FT CLTLRCLDQFSRHRRELGQKQNLMYSSPSFGKWKTFPQNRKRRMTYSASGT FT SRRPPKEERTVDMLCPYPSKVRIALTWATQDRSRWLNSCGMKHAFSRIPS" FT CDS join(4620..7391,7395..8426) FT /product="BEL-1_DWil-I_2p" FT /translation="MYRQIRVDPRHTPFQRILYRNRCGNIQDYELQTVTFG FT VNCAPFLAIRVLLQLAQDVQAMYPQASEILRNYMYVDDVLAGAHTEADARL FT AIRELQATLQSAGFPLRKWTSNKKEVLQAIPREHLLREDFLELEDASTAKT FT LGIRWQAAEDVFFFTVADPPLKESITKRQVLSHIAKLFDPAGWLAPFVIRA FT KIFMQQIWLTELGWDDVLPPLLLQQWHEFLQDYPGLSRLRIPRWLNTSDKL FT SWELHGFCDASQKAYGAALYVRVRCGDQVDVHLLTAKTRVAPVKTVSLPRL FT ELCGAVLLADLWASIVSELPIPPATSQFWTDSTIVLAWLNKPPCSWSTFVA FT NRVSSISKSTSGQSWSHVRSEDNPADLASRGVSAAELSASRLWWHGPDWLQ FT RAPEYWPTPHNELPDTQLEQRVQCHTTATTLLEDVSERFSDYGRALRVVAY FT ILRFATKRISTPSTVHLTNDELLSAERALIRTSQRREYLAEIRALGEGRPL FT PSSSTLLNLNPFLDQHGLLRSCGRLRAAESLRYDERHPILLPYSTHFTRLL FT VKFAHRITLHGGNQLMVRYLRTKFWIPRIKNMVKSHLKGCKVCIVHRRRLQ FT TQMMGDLPRERITYSRPFTHTGIDFAGPFEIKNYTGRACLITKGYVCVFVR FT FSTKAIHLEATFDLTAERFLAAFSRFVARRQCPQRIYSDNGKTFVGASKAL FT EKDFLNATKLDIMKAFPQHTLSWQFIPPSAPHMGGLWEAGVKSFKTLFYKS FT SSTVKYTFEELSTLLCRIGACLNSRPISPMSEDPEDLLALSPGHFLIGGPL FT LSISEPEILVEPMSLINRWQRLKAIQQAFCRRWKSEYLKELHKRNKWKSPT FT RSIQTGDLVIVSEGHLPSNEWRLGRIERTLPGADGLVRVADVTTARGSIRR FT PVAKLILLQDSRSPEPKTLSLPIPLCSLRMSSSPCMLFVPAHRLIHSPLRN FT FFRYSSFSSPLHSDMAPTRLPHHRSKRAAPQGTNVYRCRVCRGVHALRQCQ FT RFLNLRGEKRLRAVLINKYCPNCLAHEHSSDACRTRHGCRRCGQRHHTLLH FT MADQPPRQRASARRRSQSPPSGRAAPPLQPCSRSSSDRSASPEGAPEISLS FT SLLHARTTTVLPTAVLRLGNGSKSFETRVLLDACAAESRINRSFARSLGLA FT VTKVGVDQACTAVLESRSDETFRRNVIFRVEEDLLIRTPIREVAAPVREKF FT ATLILADPYFYRPASVSVVLGADLYPEVIPPGCVPGHSGTPAAQSTVFGWV FT VSGSCGI" XX SQ Sequence 8506 BP; 2017 A; 2443 C; 1854 G; 2192 T; 0 other; ttggtccttc gagccggatt tccttccgct cctccagctt gcatcggcgg aattcctgat 60 aagtacaaaa tttcattatt cgtttcattt gcccctacat actaataaaa gtgtgaggtt 120 ttcccgcttc tccatagcga cctacacact tttacatatt gtatgtacat acctgacctg 180 ttccaagggt ccatggtagt tcctaatcct tagcgtgttt gtccaacgtt ctcgttttcc 240 attacatttc catttcttta tttatatttt tccccgcact tgtccaaatc cacctctttt 300 acatatgcat tcacatgtac atgtaatttc cacagctgtt tgccaagtcc gcgtaaaggg 360 ttaaaaaaca aaaaaaatat taaagtgtta taaaaaatag aaaaaaaata ataatccaag 420 tgttcaccgc gttttcttat gtgttgaaat aattattttt aatgccaaac taattgccgc 480 acgcacgtat ctacgtacat acacacatcg tcatctgttg atgtacgcgc atatgtacat 540 atccatgtac atatgtataa tcccttcaca catttaccac aacacaaaag gccaaaaagt 600 attgtttttt tttttaatag cacttaacgt atgcaccgcg atttcttgcg cgttcgaatt 660 aggtgcgaaa ttaattcccc acgcgcgtct ttggaaaccg gtgcacttgc ttacatatgt 720 acatacatac ttacatagtc atacattgca catatgttat gtatgtacat atccacaaca 780 cataatccat ttcggtgtcg cgcattgatt taacaccacc ccacctccaa ttaaatgaca 840 attacttacg ttatttttgt tgtgaacaac gcgtgtgcac acatatctat atactcacct 900 gtgtacttat gcatgtacgt acatacatat gtagatcaag catttgtcca catgcattga 960 tcgtttgttt tgtttgcctt cgcgcacaac actcgcggtg tgtaaaagcg ttcttttgtt 1020 attggttttc tcttttttgc acaaccaaaa aaaaaaaatt attaacataa ttaaacaaca 1080 acatcaaaat ataaacaacg gccgacagct gttttcggag tagtggtgac acgtgagtcg 1140 agtcgtgata tcgataggcg gcttaagcgc gccaagttca aatcacatta tttccgattc 1200 ccttttttcg cctaattata tatccgccca tattcataac taaatacata tatatttttc 1260 cagcactatt gtgcccacat ttatatatcc gtccatattc ataaattata tatatataat 1320 tttctccggc cagtattttg cgcacatata tatatccgcc catatttata aataaataca 1380 tatatatttt ttccagcact attgggccca tatttatata tccattcata ttcaaaaatt 1440 ataaatatat tttttccggc cagtattttg cgcacatata tatatccgcc catatttata 1500 aataaataca tatatatatt ttttccagca ctatcgggcc catatttata tatccattca 1560 cattcaaaaa ttataaatat attttttccg gccagccttt tgcgctcata tatatatccg 1620 cccatattta taaataaata catatatatt ttttccagca ctatcgggcc catatttata 1680 tatccattca tactcaaaaa ttataaatat attttttccg gccagtattt tgcgctcata 1740 tatatatccg cccatattta taaataaata catatatata ttttttccag cactatcggg 1800 cccatattta tatatccatt catattcaaa aattataaat atatttcttc cggccagcct 1860 tttgcgctca tatatatatc cgcccatatt tataaataaa tacatatata tattttttcc 1920 agcactatcg ggcccatatt tatacatccg tccatattta taaattatat atataatatt 1980 tttttccggc cagtattgtg cgcacatata tatatccaga tccagattta tacaaaaacg 2040 tattttcttt tatcctatac atctttattt atatatttcc tccgtgtttt gttttatacg 2100 tactcccagc gttacttacg aatccataaa aacttgtttt gtaatcattt agctttgtct 2160 acgtgcaagt tttgatccgc actcgtgcgt tcgtatattg ttccgcatct atccccatta 2220 aaagggctcc gtttccgagc cgcacgcgag ataattctcc cgcaatcctg cggtgcacag 2280 tgagaaagct cttgcgtcct catacagtcc acttctcatt tcctcaaatt ttcctcattt 2340 tctcaaaacc cataaactcg gtgttcgacg gtaaggtcgt cccctgagta tgtccacgga 2400 gccgtccaag actgcgccga agtctaacgg tctttcgtct ccctcttcaa ctccggagaa 2460 ggttcggcct ccgataactc cggcgaccgt gaagcgtacg gatatttcgc gcaagttctc 2520 gtctgttcgc agccgcagcg aatccccaat atcccgtccg tcttcgtcca gcctccctca 2580 gggtcgcaga ttctcggtta aagactcgag tgaacacaaa ggtcgttcca agctttcaac 2640 ccagacccag atcccgcacg ttgttgtgac gcactgttct gacgctcccg tcacccgatc 2700 cgtatcacaa cagcgcagag aacactccag aatgtccctc tccgatttca ttctggcgtg 2760 cgacgcattg actctgttcg aagctcggga gagtgaagct ctaacttctg aagtcccctt 2820 gttttccatt aaaatgcact tagagcagct caaatcgctg tggtcaactg tcgagatgga 2880 acactcccgt tgtcacaagg ccttgacagt ccaaaccgac gaggaaagct tggcaaacat 2940 aagccgcatc aaggcgaagc acgaatacgc ttatgcggcg tatgtgcgat gcgggacgtt 3000 gtttggagaa cgcatcgaac agctctcggc acagatgacg agtcttatgc gtcagcccac 3060 cgctccgtcg aggccggccg cgtgggagaa cctcaccgca cggtacgaaa atagccgtct 3120 gcttgtcatg actcaagcca gacaactcct tcagattccc gcggtggcac aagagtcggg 3180 caaatccttg cgagagctgc agactgcgtt tcaagggggt cttaccgcgc ttgagcgttc 3240 gggtgttagt acggagaatt gggatgtctt acttatcgcc atctgcacta gcaaattgcc 3300 gaaggccaca atcctcctgt gggagcaatc agtgccaaat aagactgtcg tctcaaattg 3360 gtccgatttg aaccaatttc ttaccgatcg gtatcgtgtc ctggatgcca ccgaagaaca 3420 taagtcgggc ccaagtgtgc aaaccatccc ctttggcccc cgcaaatcct ctgcaacgat 3480 gaaggtccaa gcgttccccg cgccgtccgc tgcgcaagtc ccaccatcca gcgccacaga 3540 gggtccgtcc acgagcaacc agctcacaaa tgctcaaagt tattttgcat ctaccccagg 3600 cagaggcctt ttaggcactg cgattgtcac tctgcaccat cgtggtgtgg atcaccatat 3660 tcgggcgttg atcgactctg gggccgattc aacttttctc tccgagcgtg tcttcagcat 3720 ggtccagccc cccttctacc cagtagacgc tgcggttact ggcatgggcc agagtgatgg 3780 aggctgcgcc gacaaggtgt gtctgctaat gctgtctccc ccgtgtgaca gactgaacaa 3840 gatcgaggtc tccgccctgg tcgtatccaa attgtcaggg gccattccaa cagctccatt 3900 ccttaatacg gttccgacgc agtgcctcga tcgccccctg gctgacccgt actacaacaa 3960 gcccgcgcct attgacatga ttatcggtgt tgacctgttt ccccacatgg ggttctgtgt 4020 ggagctatca tgcctgaccc tccggtgtct cgatcagttt tcgcggcaca gacgcgagtt 4080 aggccagaag caaaacttga tgtactcctc accaagtttt gggaagtgga agaccttccc 4140 gcaaaaccgg aaaaggagga tgacgtattc tgcgagcgga acttccagga gaccacccaa 4200 agaggaaagg acggtcgata tgttgtgtcc ctacccttca aaggtccgaa tagcgttgac 4260 ttgggccact caagaccgat cgcgctggct caattcctgc ggaatgaagc acgccttctc 4320 aaggatcccg tcctaaaaac gcagtacgat gcgattattc aagagtacga ggaattgggt 4380 cacatgattc gggtgacgcc gcctcaagtg gaaaactcct atctccccca tcacgcggtg 4440 tttaagcccg atagcaccac cactaaggta cgcgtggtat ttaacgcgtc gagccctcca 4500 cacaaggcgt cagcctcaac gatgtcttgc acaccggccc cactcttcag gccgacctca 4560 cgcttcaggt actgaaatgg cggttctttc aattcgtttt caacgccgac ataacgcaga 4620 tgtatcgaca aatccgagtc gaccccaggc ataccccctt tcagcgaatc ctctaccgaa 4680 accgctgcgg caatattcag gattatgagt tacaaactgt cacgtttggg gttaactgcg 4740 cgccattctt ggcgatccga gtccttttac aactggcaca agacgtgcaa gcgatgtatc 4800 ctcaggcgag cgaaatcctt cggaactaca tgtatgttga tgatgtcctt gccggcgctc 4860 acactgaagc cgacgcccgt ctcgccatac gagagcttca agcgaccctc caatctgcgg 4920 gcttcccgct tcggaagtgg acttccaata agaaggaagt acttcaagcc attccgagag 4980 agcacctgct gcgagaggac tttctcgagc tggaagacgc gagcactgca aaaaccttgg 5040 gcatccgctg gcaagctgcc gaagacgtct tcttcttcac cgtagcggac ccccccttga 5100 aggagtcgat caccaagagg caggttctgt cccatatcgc taaactcttt gaccccgctg 5160 gttggctagc cccatttgtc atccgagcga agatattcat gcaacaaatc tggcttaccg 5220 aactgggctg ggatgatgtt ctgccgcctc tcctgctgca gcaatggcac gaattcctgc 5280 aggattaccc aggcctcagt cgactccgca ttccccgttg gctaaacaca agcgataagc 5340 tctcgtggga gcttcatggc ttctgtgacg cgtcacagaa agcgtatgga gcagccctat 5400 atgtgcgggt tcggtgcggc gatcaagtag acgtccattt attaacggct aagacccgtg 5460 tagctccagt caaaacagtc tcgctgcctc ggctggagtt gtgcggagcg gttttgctag 5520 ccgatctctg ggcgtcgatc gtttctgaac tcccaatccc acctgcaacc tcccagtttt 5580 ggaccgactc taccatcgtg ttggcttggc tcaataaacc cccgtgcagt tggtccacct 5640 tcgtggccaa tcgggtatcc agcatctcca aaagcaccag tggccaaagc tggtcacatg 5700 tgcgctccga ggacaacccc gctgatctgg cgagccgtgg ggtctccgcg gccgagttat 5760 cggccagcag actctggtgg catgggccgg actggctcca gcgagccccc gagtattggc 5820 caactcccca taacgaactc ccagacaccc agctggagca acgagtccag tgccacacca 5880 ccgcgaccac cctactcgag gacgtcagcg aacgtttctc ggattatggt agggcattac 5940 gagtcgtcgc gtacatccta cgcttcgcca ccaaaaggat ttcgacgcct tccacggttc 6000 acctcaccaa tgacgaactt ctctccgccg agcgcgcctt gattcggacc tctcagcgtc 6060 gggaatacct cgcggagata cgggccctgg gggaagggcg acccctgcca tcatccagca 6120 cgctactcaa tctgaatcca tttctcgatc agcatgggtt actccgctcc tgcggacgac 6180 tccgggccgc cgagtccctc cgatatgatg aacgtcaccc tattctcctc ccctatagta 6240 ctcatttcac tcgcctgctt gtcaaattcg cccaccgcat cacgttgcat ggcggcaatc 6300 aactgatggt gcggtatctg cgcaccaaat tctggatccc acggatcaag aacatggtga 6360 agtcccacct caaggggtgc aaagtctgca tcgttcatcg tcgacgacta cagacccaga 6420 tgatgggtga tctccctcga gaacggatca cgtactccag gcccttcacc cacacgggca 6480 ttgatttcgc agggccgttc gagatcaaaa attacaccgg gcgtgcgtgt ctcataacta 6540 agggttatgt ctgcgtcttt gtgcgtttca gcacgaaggc aatccatctc gaggctacct 6600 tcgacctcac cgctgaacga tttctcgcag ccttctctcg gtttgtcgca cggagacaat 6660 gcccacagcg catttactcc gacaatggca aaaccttcgt aggggcctca aaggcgctcg 6720 aaaaagattt cctgaacgcc accaagcttg atataatgaa ggcatttccc cagcatactt 6780 tatcctggca gttcattccg ccgagcgccc cccatatggg aggactatgg gaagcagggg 6840 tcaaaagttt taagaccctg ttttacaagt cctcgtccac tgtcaaatac accttcgagg 6900 aactttccac tctcctgtgt aggatcgggg cgtgcctaaa ctccaggcca atctctccca 6960 tgagcgagga tccagaggac ctgctggccc taagcccggg acatttcctg attggaggac 7020 cgctcttatc catttcggaa ccggaaattc tggtcgagcc tatgtcactg atcaaccgat 7080 ggcagcgact gaaggcaatc cagcaggctt tctgccgccg ctggaagagt gagtacctca 7140 aggagcttca caaacggaac aagtggaagt caccgactcg cagcatccag accggcgacc 7200 tcgtgatcgt gtccgaaggt catctcccct ccaacgagtg gcgcttaggg cgaattgagc 7260 gcactctgcc cggggctgac ggtctcgtcc gagtggctga tgtcaccacc gcgcgtgggt 7320 ctatccgaag acccgtagcg aagcttattc tgttgcaaga cagcaggagt cctgagccta 7380 aaaccttatc gtgactccca atccctctat gttccctccg aatgtcctcg tctccgtgca 7440 tgttgttcgt tcccgcgcat agactaatcc actcccccct acgtaatttc tttcgttaca 7500 gttcgttcag ctcccccctt catagcgaca tggcccccac acgtctgccg caccaccgca 7560 gcaaacgagc tgctccgcag ggcacgaacg tgtaccgatg ccgggtctgt cggggggttc 7620 atgcactccg gcagtgtcag cgcttcctga acctccgagg ggaaaagcgg ctccgggcgg 7680 tgctgatcaa caagtattgt cccaattgct tggcacacga gcattcatcc gacgcgtgcc 7740 ggactcgcca tggttgccga cgatgcgggc aaaggcacca taccctgctc cacatggccg 7800 accaaccgcc gagacaacgt gcctccgccc ggcgtcgcag ccaatcaccc ccctccgggc 7860 gtgctgcacc cccgctacaa ccctgttccc gatcatcgtc cgaccgctcc gcttccccag 7920 aaggcgcacc cgaaatctct ctatcctccc tgctccacgc caggacgacg actgtcctgc 7980 caaccgcagt tctccggttg ggcaatggct cgaaatcctt cgagacccgc gtccttctcg 8040 atgcgtgtgc ggctgagagc cgcatcaatc gctcctttgc ccggtccctg gggttagctg 8100 tgaccaaggt cggggtcgac caagcctgta cggccgtcct cgagtcgagg tccgacgaga 8160 cgttccgacg gaacgttata ttccgggtcg aagaggactt gctcatccgc actcccatcc 8220 gggaagtggc ggcgccggtt cgggagaaat tcgccaccct gatcctggcc gacccctatt 8280 tctatcggcc ggcgtcggtg tccgtggtac tcggggcaga cctctatcct gaggtcatcc 8340 caccaggctg tgtgcccggt catagcggta ctcccgcagc ccagagcacc gtgttcggat 8400 gggtcgtctc cggctcctgt gggatctagg cgctccagaa cgttgccgtc ccccgtcctt 8460 tatatgtcca gggtggcaat tgcaacatgt tgcaaggggg gcggta 8506 // ID Copia-2_SI-LTR repbase; DNA; INV; 289 BP. XX AC AEAQ01001041; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: long terminal DE repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-2_SI_; KW Copia-2_SI-I; Copia-2_SI-LTR. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-289 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01001041; Positions 852 1140. XX SQ Sequence 289 BP; 63 A; 69 C; 54 G; 103 T; 0 other; tgttagattt aatatattat gtatttctca ttttcttatt gtattcttaa tccgcgtctt 60 gtattgtgtt atgcgacgta tcctaatgcg tttgtttttt ttttgtaacc tctgcgaccg 120 cgaaccagcg cgaggcgctg tcgtcgagcg acgttttcgc cggttgagaa accatccagt 180 tcagtccgcc tgattccggc actgagtaaa ccacgttttt aataaagcta tacatatcac 240 gaatatctcc tgtttcaact cgactggctt cccgttccta accgtaaca 289 // ID Gypsy-245_AA-LTR repbase; DNA; INV; 1173 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-245_AA_; KW Gypsy-245_AA-I; Gypsy-245_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-1173 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1092-1092 (2011). XX DR [1] (Consensus) XX SQ Sequence 1173 BP; 340 A; 239 C; 357 G; 233 T; 4 other; tgtaaccagt gtcagtgtga tgtctagtgt tgatgagatt tgtcgcgttc agtttcgtaa 60 atgtattgta agaaaaaaag aaaacgtttg tagataggtt tgagtaagat gtcccgaaag 120 atcccacgaa cgcagccaac acagctccac cctatagaag gacgagagta gtgccgctag 180 tcaccaccag ggctgcgcta aaccacacga atgggcggcc atgggggtgg cgaaatgggt 240 ggccatgatg aggcggatca tcgatgacgt agatggacat gggtcaacgg ccaaaggttc 300 ggggtcaacg ttcaccagca agacacttsa taagcctggt gatcgttgaa gacagcagaa 360 gtgctttcaa aaagttataa agctataaga gaaagtgttg tgagaagttt aagtgagtag 420 ataatttgaa gaggagaaag gaagtgaagt gaaactggac atccaggtaa attaataacc 480 ccgaacctag ccgaccgtac cgcgtctgtt tccaccattt gaaatccgtc tgccacagga 540 cctcgcggtt tttattcgaa atcgccattg ccgattgagc cgttcckacc tgcgttgaac 600 ctactcgcta atccagtagt gtatccctga agacctgtac cccagaacga caaccgagtg 660 cgcttggaaa cccaggcttt ccagtgaatc cacggatcac tgaagcctcg agtccgtcac 720 cacactcgag gggcgcacgg gacaggcagg aggtccacga gaggctgcag ccgaagtagg 780 agagaggtga ggaascattc gtccgcccct ggccaggaka gaaggagaag tctaagcggt 840 cggtgaagga gcaacgggag gagagcggag aaggtgctgc agtgcgcggg gccccgaaga 900 agtgaaggaa ttgtaaggta ggagtagata agaaagtggg aggcgtcgtc cggtgttgag 960 gacggacgca agtaaagaga aggaaagatt aggaggagga acaggaagta ggacgtcaag 1020 gaagtggaaa tacaagtgta cacgagacca cgaaatagag ggagttttac tggagtctat 1080 ctataagtgt tttctttcgc ggagtgtcac gagcgtagcg ccgaccctga gttagcttgg 1140 agacgggggt ctcatctagg ctgtggagta aca 1173 // ID Gypsy-213_AA-LTR repbase; DNA; INV; 607 BP. XX AC AAGE02029464; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-213_AA_; KW Gypsy-213_AA-I; Gypsy-213_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-607 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02029464; Positions 132208 131602. XX SQ Sequence 607 BP; 160 A; 137 C; 151 G; 159 T; 0 other; tgtaggatat tatactgatt gaaaataata tacatctaaa tttacaaata tgagctgaaa 60 aatgcgtatc gaaaatattc atatcacaaa cttcccatta gcaaagctta atgttgtgaa 120 tgacggcttt tccttatatt tcaatacccc actctcacat tcagcaacag aatcgttccg 180 tatccatgtg gggcgtgcct aacgggtcgc attattcatc ggttcattgc atcatttagg 240 tgtataaaag cagcacgcta cggagggagc gctcattcta ttggcgaaat tcggcgagtg 300 cagatcggtc ggaaaggaac cgcagcagca aacgaaaggt cttgcagtgg aagagcgtgt 360 ggtctcgtaa ccatcgtaac agcggcatcg aaacgagcgt gtggtctcgt aaccatcact 420 gtgcgtcagt cttcgtgccg agtgggtggt catcgatcca gcgaagcgag tcgtgcgtta 480 ggctcgtgat cttgccgacg ggtttttcgg agcggaacat cgatccgagc ggtaggcctc 540 gttactagct cactcatcgg attcctcgtt tgaaccgttc gacgtcgagt cacgatgaat 600 tacgaca 607 // ID R2-2_DWi repbase; DNA; INV; 3554 BP. XX AC . XX DT 08-NOV-2010 (Rel. 15.11, Created) DT 08-NOV-2010 (Rel. 15.11, Last updated, Version 2) XX DE 28S rDNA-specific non-LTR retrotransposon R2 in Drosophila DE willistoni. XX KW R2; Non-LTR Retrotransposon; Transposable Element; R2-2_DWi. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-3554 RA Stage D.E. and Eickbush T.H.; RT "Origin of nascent lineages and the mechanisms used to prime RT second-strand DNA synthesis in the R1 and R2 retrotransposons of RT Drosophila."; RL Genome Biology 10(5), R49-R49 (2009). XX DR [1] (Consensus) XX CC This family is specifically inserted into 28S ribosomal RNA CC genes. D. willistoni contains two subfamilies of R2. There may be CC only two copies. XX FH Key Location/Qualifiers FT CDS 158..3319 FT /product="R2-2_DWi_1p" FT /note="reverse transcriptase and restriction-like FT endonuclease." FT /translation="FERRSNSWGYQNLEPSNVGQDMNTVPRINNTTTTPAT FT SRPGDQPREAIAVVNLAGEIPCAVCGRLFNTRRGFGVHMSHQHKDELDTQR FT QREDVKLRWSEEEAWMMARKEVELEASGNLRFPNKKLAEVFTHRSSEAIKC FT FRKRGEYKAKLEQIRGQSTPTPEALDSITSQPRPSLLERNHQVSSSEAQPI FT NPSEEQSNWEIMRILQGYRPVECSPRWRAQVLQTIVDRAQAVGKETTLQCL FT SNYLLEVFPLPNEPHTIGRSNLRRPRTRRQLRQQEYAQVQRRWDKNTGRCI FT KSLLDGTDESVMPNQEIMEPYWKQVMTNPSTCSCENTRFRMEHSLETVWSA FT ITPRDLRENKLKLSSAPGPDGITPRTARSVPLGIMLRIMNLILWCGKIPFS FT TRLARTIFIPKTVTANRPQDFRPITVPSVLVRQLNAVLASRLASKVNWDPR FT QRGFLPTDGCADNATLVDLILREHHKRWKSCYLATVDVSKAFDLVSHQAII FT KTLQAYGAPTNFVSFIEEQYKGGGTSLNGAGWSSEVFIPARGVKQGDPLSP FT LLFNLIIDRLLRSYPREIGAKVGNTMTSAAAFADDLVLFAETPMGQTLLDT FT TLGFLASVGLSLNADKCFTVSIKGQAKQKCTVVERRSFCVGERECPSLKRT FT EEWKYLGIRFTADGRAQYSPADDLGPKLLRLTRAPLKPQQKLFAHRTVLIP FT QLYHQLTLGSVMIGVLGKCDRLVRQFVRRWLDLPLDVPVAYFHAPHTCGGL FT GIPSIRWIAPMLRLKRLSNIKWPHLEQSEVASSFIDDELQRARDRLKAENV FT QRCSRPEIDSYFANRLYMSVDGCGLREAGHYGPQHGWVSQPTRLLTGKEYL FT HGVKLRINALPSKSRTTRGRHELERRCRAGCDAPETTNHILQKCYRTHGRR FT VARHNSVVNAVKRGLERKGCVAHVEPSLQCDSGLNKPDLVGIRQNHIYVID FT VQVVTDGHSLDQAHQRKVERYDRADIRSQMRRFFGVTGEIEFHSVTLNWRG FT IWSGQSVKRLIAKDLLIAEDTKLISVRAVNGGVTSFKYFMYCAGYTRS" XX SQ Sequence 3554 BP; 995 A; 795 C; 922 G; 841 T; 1 other; gaagctggga agctgggtcg gatgagcgca gaaggggtgt tcttcggagc actgtaattc 60 ataagtcgta agtctgatca agtcgactcg gaacctcttc gtggtgtttc ctgggtgctg 120 ttgagttcct agtctctagg ttctctccag tagctaattc gagcggcgaa gcaactcttg 180 gggttaccag aaccttgagc caagtaacgt tggtcaagat atgaatactg tgcctcggat 240 aaacaacaca acaacaactc cagcgacttc ccgtcctgga gaccaaccga gagaggctat 300 agcagtggta aatctcgcgg gagagattcc ctgtgcagta tgtgggcgcc tcttcaatac 360 tagaaggggg tttggggttc atatgtcaca tcaacacaaa gacgaactag atacgcaacg 420 tcagcgtgaa gatgtaaaac tccgatggag tgaggaagaa gcgtggatga tggcgagaaa 480 ggaggtggag ctcgaagcaa gtggtaattt gagatttcct aataagaagc tagcggaagt 540 atttactcac cgtagctccg aagcaattaa atgttttcgg aagaggggtg aatataaggc 600 aaaactggag cagatcagag ggcaatctac tcccacccca gaagcgttgg actcaattac 660 ctcacagcct cgccctagtt tactcgagcg aaaccaccaa gtatcatcgt cggaagcgca 720 accaatcaat ccatcagaag aacagtcgaa ctgggaaatc atgcggatac tacagggcta 780 tcgccccgta gaatgtagtc cccggtggag agcccaggtc ttgcaaacta tcgtagatag 840 ggcgcaggcc gtagggaagg aaaccactct ccaatgctta tccaactatc tcctggaagt 900 atttccatta ccaaacgaac cacacaccat cggtcggagc aatttgcgaa gacctcgaac 960 taggagacag ttaagacaac aagagtacgc acaggttcag cgtcgttggg ataagaatac 1020 tgggagatgc attaaatcct tgcttgatgg aacagatgag tcggttatgc caaaccaaga 1080 gataatggaa ccctattgga aacaagtaat gacgaatccc agcacatgct cttgcgaaaa 1140 cacaagattc cgtatggaac attcgcttga gacggtttgg tcagcgataa cgccacgcga 1200 cctgagggaa aataagttaa agttgtcaag tgctccgggt cctgacggta tcactccaag 1260 aacagccagg agtgtaccct taggcattat gctacgcata atgaacctga ttctctggtg 1320 cggcaaaata ccattctcta cccgactggc cagaactatc ttcattccga agactgtgac 1380 ggcaaatcga ccgcaagact ttcgtccaat aacagtcccc tcggttttgg tcaggcaatt 1440 aaacgctgtt ctggcttctc gattggcttc taaagtcaac tgggatccaa ggcagcgcgg 1500 tttcctacct accgatgggt gtgctgataa tgcgacgttg gttgatctca ttttgcggga 1560 gcaccataaa cggtggaagt catgttacct tgcgacggtg gatgtcagca aggcttttga 1620 cttagtatca caccaggcca ttatcaagac tttacaggcc tatggtgctc caacaaactt 1680 tgttagcttt atagaagaac agtataaggg cggcggaacc tccctcaatg gggcaggatg 1740 gagttcagag gtgtttatac ccgcgcgggg cgttaagcaa ggtgaccctc tgtctccact 1800 attatttaat cttatcattg atagattact taggtcctac cccagagaga ttggtgccaa 1860 agtcggaaat accatgacaa gcgcggcagc gttcgcggat gatctggtgc tatttgcgga 1920 aactccgatg gggcaaacat tgttggatac cacgctaggc ttcctagcct ccgtgggact 1980 ctcccttaat gctgataagt gtttcactgt cagtataaag gggcaagcca agcagaagtg 2040 tactgtcgta gaacgacgga gcttttgtgt aggggagcgc gagtgtcctt cattgaagcg 2100 tactgaagag tggaagtatt taggtatccg gttcactgcg gatgggcggg ctcaatatag 2160 tccagcagac gacctcggtc cgaagctgtt aagattaaca agagcccctc tgaaaccaca 2220 acagaagtta tttgcacata ggactgtcct tatcccacaa ctctatcacc aactaacact 2280 tgggagtgtg atgataggcg tcctaggaaa atgtgacaga ttggtacggc aattcgtaag 2340 gagatggtta gatctcccac tggatgtacc agttgcgtac tttcacgccc cccacacttg 2400 tgggggtctc gggattccgt caattagatg gatagcaccg atgcttcgtc tgaagcgatt 2460 gagcaatatt aaatggcccc acctcgaaca atccgaggta gctagctctt tcattgacga 2520 cgaattgcaa agggctcgag atagattaaa ggcggaaaat gtgcagcggt gttcgcgtcc 2580 agagattgac tcgtatttcg caaataggtt gtacatgtct gttgatggtt gcggtctccg 2640 tgaagcaggt cattatggcc cgcaacatgg atgggtgagt cagcccacgc gcttgctaac 2700 aggaaaggaa tatttgcacg gtgtcaaatt gcggataaat gccctaccct cgaagtctcg 2760 tacgacgagg ggaaggcacg aattggagag acggtgtcgt gcaggatgtg atgctcccga 2820 gacaacaaac cacatcttgc aaaaatgcta tcgtacgcat gggaggcggg tagctagaca 2880 caacagcgta gtaaatgccg tcaagcgggg acttgaacgg aaaggctgcg ttgcccatgt 2940 cgaaccaagt ctgcaatgcg actcgggctt aaataaaccg gacctggtgg gaatccgaca 3000 gaatcacatt tatgtgatag acgttcaggt tgtgacagac ggacattcct tagaccaagc 3060 gcaccagcgc aaggtcgaaa ggtacgacag agctgacata agatcacaaa tgcggcgatt 3120 tttcggagtg acaggtgaaa tcgagtttca ttccgttaca ctcaactgga gaggaatctg 3180 gagtggtcag tcggtaaaac gattgattgc aaaagatctc ctcatcgctg aagataccaa 3240 actcatcagc gtcagagcag taaacggcgg agtgacgtct tttaaatatt tcatgtattg 3300 tgctgggtat actcgaagct agatgtacta acctctagtt tctctatact tttgcctgct 3360 accttggcat tacatctaaa aaggtacaaa catcgcattg gcaaaaagag gtggttttag 3420 tacataggcg ctgtgggact tcattgtccc gatgatgcag cgaatcgtgc atacgagatt 3480 gtccagtagt tggttgctcg tatctttaga agatttcctt cctcggcgat caaaanaaaa 3540 aaaaaaaaaa aaaa 3554 // ID SCAR_MI repbase; DNA; INV; 533 BP. XX AC AF387097; XX DT 09-JUL-2004 (Rel. 9.06, Created) DT 09-JUL-2004 (Rel. 9.06, Last updated, Version 2) XX DE Meloidogyne incognita sequence characterized amplified repeat DE reverse sequence. XX KW SCAR_MI; Dispersed repeat; KW sequence characterized amplified repeat. XX OS Meloidogyne incognita OC Eukaryota; Metazoa; Nematoda; Chromadorea; Tylenchida; Tylenchina; OC Tylenchoidea; Meloidogynidae; Meloidogyninae; Meloidogyne; OC Meloidogyne incognita group. XX RN [1] RA Sui D.D., Lewis A.S., Fortnum A.B., Dong K. and Kluepfel A.D.; RT "Simplex PCR Identification and Multiplex PCR Diagnosis of RT Root-knot Nematode Species without Restriction Enzyme RT Digestion."; RL Unpublished. XX DR Genbank; AF387097; Positions 1 533. XX SQ Sequence 533 BP; 169 A; 98 C; 76 G; 176 T; 14 other; tnnattgntg acacgccagc tatttaggtg aattcactag tgattggggg ttaggaggat 60 tgagacagat atctctgcat tggtgcaatg accgattcca cttacacatt gtccagatta 120 ggatagaacc tcatttaatc ggttatcgag atatcctttt cggtggtcta tcaagtccag 180 tctctaataa tatatacaaa aacaaaataa aaaaatgttt taataaaacc cagaacgaat 240 taaaggtatt tccccagtcc gtgtataata cctcaattca tcatctgaat tgacaacaat 300 taaccttctc ctaactaaag aatctccacc aagtaaaatt agttgttgtt gaagagcttt 360 ttaaattttt ataattaaaa aaaaattttt ttttctcctt accactagac attttaaatg 420 ggtnggctct ttgctttgac ttccaaccaa cccatggact gtntntcgac nggaaanatt 480 tttccctttn ttnccttttt cntaaaacnt ttaaaaantt tacggcntca aaa 533 // ID Gypsy-6_BM-LTR repbase; DNA; INV; 226 BP. XX AC nscaf3093; XX DT 19-MAR-2010 (Rel. 15.07, Created) DT 19-MAR-2010 (Rel. 15.07, Last updated, Version -1) XX DE LTR retrotransposon from silkworm: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-6_BM_; KW Gypsy-6_BM-I; Gypsy-6_BM-LTR. XX OS Bombyx mori OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Bombycidae; Bombycinae; Bombyx. XX RN [1] RP 1-226 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from silkworm."; RL Repbase Reports 10(7), 988-988 (2010). XX DR Genome; nscaf3093; Positions 987535 987760. XX SQ Sequence 226 BP; 70 A; 43 C; 35 G; 78 T; 0 other; tgttatatcc ctacatagca tacaataatg cttaccaaac atcaacatca gcatttattc 60 ccgatgatga tttctagcta gctttttaaa gcagttcggg tcagactttc atcggagtgg 120 gacgtatacc gagttttcct cactttccct atgaaaacta taaatttagt ctagggagta 180 ttaaaagtgt ttttttaaaa agaagttcct agtttcaatt ctaaca 226 // ID BEL-611_AA-LTR repbase; DNA; INV; 483 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-611_AA_; KW Pao_Bel_Ele16; BEL-611_AA-I; BEL-611_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-483 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 483 BP; 164 A; 92 C; 100 G; 127 T; 0 other; tgaaccctat cacgaaattt ggaacgagga gtcatctggt tgtgaagtgc atcaacaata 60 ggctgtcaaa tcgtcggtca tcgatgaaaa aacgcaaatc aaggttttag tatggcggcg 120 taccttcttc cgctgtacct tgtgtataaa agggaaacaa tgacaagggc aacttctctt 180 actgtggtgc gacagccaac tctacacgcc gacgtcataa gcaaatcaaa acaaacgaaa 240 aagtgaaatt tattgaaatt atagaaaata gtgaacttac attttgaata aagctatagt 300 aatttaagtg gaaaattaag aacaataaag tggagttaaa ttataataaa cttaccgttg 360 agttgtgtcc aagtgctacc tgaaaggaaa aggaagtgag taatacttac cgctgctgtc 420 ctggaaatcc atccgcttca ttgtactggc caatcatccc gctgctgttc gctgtgctaa 480 cca 483 // ID EVSAT1 repbase; DNA; INV; 106 BP. XX AC M31306; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 28-SEP-1995 (Rel. 1.08, Last updated, Version 1) XX DE E.vuilleti satellite DNA. XX KW SAT; Satellite; Simple Repeat; EVSAT1; KW Satellite repetitive element. XX OS Eupelmus vuilleti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Eupelmidae; Eupelminae; Eupelmus. XX RN [1] RP 1-106 RA Bigot B.Y., Hamelin H.M. and Periquet G.; RT "Heterochromatin condensation and evolution of unique RT satellite-DNA families in two parasitic wasp species: Diadromus RT pulchellus and Eupelmus vuilleti (Hymenoptera)."; RL Mol. Biol. Evol 7(4), 351-364 (1990). XX DR GenBank; M31306; Positions 1 106. XX SQ Sequence 106 BP; 33 A; 28 C; 24 G; 21 T; 0 other; accgattgaa atttcataaa atgcccgaaa atccctgtag acgatgtcca ggtgtcagga 60 tgacccccag gtcgacgtaa caccatctac gaggaatgca ccaggt 106 // ID BEL-74_CQ-I repbase; DNA; INV; 6169 BP. XX AC AAWU01004636; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-74_CQ_; KW BEL-74_CQ-LTR; BEL-74_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-6169 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 293-293 (2011). XX DR GenBank; AAWU01004636; Positions 11079 4911. XX CC Positions [5216-5800] - Integrase core CC 'CTATT' target site duplication CC LTRs are 96% similar to each other. XX FH Key Location/Qualifiers FT CDS 1373..6169 FT /product="BEL-74_CQ-I_1p" FT /translation="MFQTVMVRYPNESPAIKLYHLKNSLVGRAAGKIDQDV FT INNNDYESAWQLLEDTYEDERLIIDTHIDALLDLPKLTRENGDEMRKLVET FT CTKHVDALKNRELPVEGLSEMILVNLISKRLDRESRKLWETSLVRGDLPSF FT DDLIDFLKDRSRVLQRLPSNASTSQSQAAPRAPGQQQVVAKPRQTKLFVQT FT NKESCPCCSSSHPVFKCTEFRKLPVGERFEKVKKAGMCYNCLKPQHRADAC FT ASTQHCKTCGKPHHSLLHSERPENPKKHTEPDQPKNPVDAEKQEETPAAPE FT QRTVSCCTQTQAAQKQIFLSTAKIVVFGSGNATTTCRALLDCCSESNIITE FT RLARKLNMQMLEMKPPITICGLNGMKTTANKIVQTKVSSRVGDFTAVLDFI FT VTPSITELPTGKVETHTWPLPAGIELADPTFNVPDDVDVIIGAELFYDVLK FT KGRVKIGGDFPTLAETAFGWVVSGPARTKLPAGQRRVCQLATTFEDVNRTL FT SKFWELESGYFSSKMTVTERTVEKHFEETHSRNAEGRYVVRLPFNELKSQL FT GDSYENAKRRFGRLMVGLTKNPSKREAYTQFMKEYVALGHMKEVNHSPEEG FT YFLPHHAVYKEASSTTKVRVVFDASAATTTGVSLNDTQLVGPTVQSDLVTI FT MLRFCTHQIVLTADVPKMYRQVQVHPDDRKYQRILWLNDANEMATFELATV FT TYGCSSAPYLATKALTQLASDEADKFPKAVRVVKEDSYIDDFLTGGKTAAD FT VVEIYEQLKDMLSRGGFGVHKFCSNSAEVLKHIPAELQEKLVDFETSEIND FT TIKTLGLIWNPNYDYFTFNVPHLVLKGSRPTKKIVLSEIGKLFDPLGFLGP FT VVTKAKLMMQKLWELKLGWDTELPDEQMQRWLVFREQLVLVRNIKKRRCVI FT PADAKKLELLGYCDASKAAYGAVLYVKSELKHGTINIQPVCSKSRVAPLKP FT TTIPRLEACGMVVLAELTHKTLGALNVKFDRVRLFSDSMICLNWLKKSPAQ FT LNEFVCNRVATIVELTQNYEWGFVRSENNPADSLSRGLDPEELIDDELWWD FT CDPNQRQPSDTSEAEVPELADEDLPELRSAKVVLAVTVEKPCTNFNRLSNI FT NRLYRAWAYVWRFIEITRSKKRNPPSNPLTASELSRATTTIVKIIQQEAFT FT EEIKQLRTGQAKRNKLSSLAPFLDGDGLIRVGGRLKYSNIPYAGKHQILLP FT EKHPFVTLLVRHLHELNLHVGQNALVSMIRQQYWPIKVKNTVKRVTFDCVI FT CFKHRPPQVHQPMGELPSYRVTPAPVFASTGVDFAGPFTLRESGRKPKFYK FT AYVAVFVCMAVKAVHLELCTDLRTETFLAALQRFTSRRGLPSNLHSDNATT FT FVGAKNELAELRKLFENQVHKDKLADFCSSKGITWHFIPPRSPHFGGIWEA FT GVKAMKYLLKRVVGETRLTYEEMTTFLAQAEATMNSRPMCPLSDDPNDLQA FT LTPSHFLIGRPAHALPEPSYEQEKIGRLSRWQHVQYMREHFWKRWSADYLH FT TLQQRQKWKDGVLDIKVGALVLLREENIPPQQWKKGRITATHPGEDGVVRV FT VTVKTASSEYKRAVTRICLFPDVDSNDPTGGV" XX SQ Sequence 6169 BP; 1614 A; 1635 C; 1773 G; 1147 T; 0 other; ttttggtcct atcggaaccg gataaggaca tttcgcgatt cggaagtgac tcggttacgc 60 ggatcggtcg cgagtgcaag aaaagttctc ggatttgcat cacgtgcaaa agcgtgccgc 120 cattttgtac cgtcgcgtcg gtggtgtgca caaaagagac gtgtggaaga agtgcgaggc 180 ttctagaagc acgaaaagta aatctgcacc gtgtgcggaa attgtcccgg atcggcgcta 240 cgccaccgga cgcataaaaa gtgaagtgtt taagacgcac caagtgcgga agatcgccac 300 gcggtcatct accggattgc cgctacgcca ccggtctgga agaaaatagt gaaaagtgac 360 gcaccaagtg cgtaggtcgc caagcggtcg ttttctccca gattggcgcc gcgcctttgg 420 taggaaaaag tgaagaattg aagtgtcgca ccaagtgcgg tttgatcgcc acgcggtcga 480 gtgtcccgga ctggcgcttc gccaccggaa gtgaaaagag aaaaagctcc ctgcttgcga 540 gttcgtcgag cgaaagaaga gaggaaaagc tccgtgcttg tgagttcgtc gagcgaaaga 600 agagaggaaa agctccgtgc ttgcgagttc gtcgagcgaa agaaaaaaaa agtgtgaggt 660 tatacgtaac ctaaagtgag agtgcaaatg aagaaaaagc taaatgagtg aacaacaaaa 720 gaaagtaaaa gaagaaatca gtgctactcc aagagtaacg agaagtgtaa caaggaaaaa 780 tttgctagcg gcgttaacgg gatcgggcgc cgatcgtgag gacgctgtca cgacagttga 840 aaaagctgta cagtttccag agccgttgct acctgccgga agcgccgtgc tgagtattgc 900 cgaacaaaag gagttggatc ggcggaaaaa ggaacgggaa aaaatggagc gcaacttgga 960 gaacctgagc gatcgtgtgg acacgattaa gcagaagttg ttccgcatga aggaggcgct 1020 cgggacgatg gacaatgtgc actacctcaa cctccagttg caaacactgc agcgctgcaa 1080 cgaggaattc gacacgctgc acagcgaaat cgtcgctctc gtcaagaagg aagacaaagt 1140 taagtggaac caggagtacc tgaacttcga agccctacac ggtgagttgt acgtgaacgt 1200 gcagacaaag atcgccaatc tccagaaagc tgattcgaac cgggctttga cattgaacgc 1260 cagcgcgtca gagttcctcc cgagaacgca agtagtacaa aacgtccccc acctgaacgt 1320 gccgttgcct tccttcgacg gagccccaga gaattgatac gcattcaaat gcatgttcca 1380 aacagtgatg gttcggtacc ccaacgaatc gccagcgatt aaactgtacc acctgaaaaa 1440 ctctctggta gggcgcgctg cgggcaagat tgatcaagac gtgatcaata ataacgacta 1500 cgagtcggca tggcaactgt tggaagacac atacgaagac gagcgcctaa tcatcgatac 1560 ccacatcgat gcgttgctgg atcttccaaa gctgacacga gagaacggtg acgagatgcg 1620 aaagctggtg gagacttgca cgaagcatgt ggacgccctc aagaatcgtg aactgccggt 1680 ggagggcctg tcggagatga ttctcgtcaa cctgatcagc aagcgattgg atcgagagag 1740 ccgaaagctg tgggagacgt cgctagttcg tggggatctt ccgtctttcg atgacctaat 1800 cgacttcctc aaggatcgga gccgagttct ccaaaggttg ccgagcaacg ccagcacgag 1860 tcagtcacaa gcggcaccga gagcccctgg tcagcagcag gtagtggcga agcccaggca 1920 gacgaagctg ttcgtgcaaa ccaacaagga gtcttgtccg tgctgttcga gctcgcaccc 1980 cgtcttcaag tgtaccgagt tcaggaagct gcctgttggt gagcggtttg agaaggtgaa 2040 gaaggcaggt atgtgctaca actgtcttaa acctcaacat cgtgccgacg cctgcgcatc 2100 aactcagcac tgtaagacct gtggcaagcc gcaccacagc ctcctccaca gcgagcgccc 2160 cgaaaatccc aagaagcata ccgaaccaga ccagccgaag aacccagtag acgcagagaa 2220 gcaagaagaa acaccagctg ctcccgagca gcgcacagtg agttgctgca cgcaaacgca 2280 agcggcccag aagcagatct tcctgtctac tgcgaaaatc gtggtgtttg gttccggcaa 2340 tgcgactacg acctgtagag cgttgcttga ctgctgttcc gagtccaaca tcatcacgga 2400 gaggttggcc agaaagctga acatgcagat gctggagatg aaacccccga tcacgatctg 2460 cggactgaac gggatgaaga cgacggcgaa caagatcgtc cagacgaagg tgtcgtcgcg 2520 agtcggcgat ttcactgccg tcctcgactt catcgtaacg ccctcgatca ccgaattacc 2580 aacaggtaag gtagaaaccc atacttggcc cctccctgct ggcattgagc tagctgaccc 2640 cacattcaat gtccctgacg acgtcgacgt gataatcggc gctgagctgt tctacgacgt 2700 cctgaagaag gggcgtgtga agatcggcgg tgacttcccg acactagccg aaacagcatt 2760 cgggtgggtc gtcagcggtc ctgcgcggac caagctacca gctgggcaga gaagagtctg 2820 ccaactggcc accacgttcg aggacgtcaa ccgtactctc tccaagtttt gggagttgga 2880 gtcagggtac ttctccagca agatgaccgt gacggaacgt acggtggaga agcatttcga 2940 ggagacgcac agccgcaacg ccgaaggacg gtacgttgtc cgactaccgt tcaacgaact 3000 caagagtcaa ctgggcgact cgtacgagaa cgccaagcgt cgatttggga ggctgatggt 3060 tggccttacc aagaacccat ccaagcgaga agcctacacg cagttcatga aggagtacgt 3120 ggcgttgggc cacatgaagg aagtgaacca cagcccggag gagggctact tcctaccgca 3180 ccacgcggtc tacaaggagg ccagctcaac gacaaaggtt cgagtggtct tcgacgcctc 3240 ggcggcgacg acaacgggcg tgtccctcaa cgacacccag ctggtgggcc cgacggtgca 3300 gagcgacctg gtaacgatca tgctgcgctt ctgtacgcat cagatagtgc tcacagcgga 3360 cgtacctaag atgtaccgac aggtacaagt gcaccctgac gatcgaaagt accagcgaat 3420 tctgtggctc aacgacgcaa acgagatggc tacttttgaa ctagcaacag tcacctacgg 3480 ctgttcgagc gcgccgtacc tcgccacgaa ggcgttgacg cagctggcct cggacgaggc 3540 ggacaagttc ccgaaagcgg ttcgagtagt caaggaggat agctacatcg acgactttct 3600 caccggcgga aaaacggcgg ctgacgtcgt cgagatctac gagcagctga aggacatgct 3660 gagtcgaggt ggattcggtg tgcacaaatt ctgctccaac agcgccgaag tcctgaagca 3720 catcccggcc gagctccagg agaagctggt agacttcgaa acttccgaga tcaacgacac 3780 gatcaagacg cttgggctga tctggaatcc gaactacgac tacttcacct tcaacgttcc 3840 acacctggtg ctcaagggaa gtcggcccac aaagaagatc gtgctctcgg aaattggaaa 3900 gctctttgac ccacttggtt ttctgggacc ggtggtaacg aaggcgaaac tgatgatgca 3960 aaagctttgg gagctaaaac ttggttggga caccgagttg ccggacgaac agatgcagcg 4020 gtggttggtg tttcgcgaac aactcgtgct tgtgcggaat atcaagaaga ggagatgcgt 4080 catcccggct gacgccaaga agttggagct gcttggatac tgcgacgcct cgaaggcggc 4140 gtacggagcg gtcttgtacg tcaagagcga gctcaaacac ggcacgatca acattcaacc 4200 ggtgtgcagc aagtcacgcg tcgccccact gaaacccacg acgatccctc gactcgaagc 4260 atgcggaatg gtggtgctgg cagagctgac acacaagacg ctgggagccc tgaacgtgaa 4320 gttcgacaga gtgcgccttt tctccgactc catgatctgt ctgaactggc tgaaaaagtc 4380 cccggcgcaa ctgaacgagt tcgtgtgcaa ccgagtggcc acaatcgtcg aactgaccca 4440 gaactacgag tggggattcg tccgatcgga gaacaacccc gctgattcgt tgtcacgagg 4500 cctggacccc gaggaactga tcgacgacga gctttggtgg gactgcgacc caaaccagcg 4560 acagcccagt gacaccagcg aagccgaggt tcctgaactg gctgatgaag atctcccgga 4620 gctgcgcagt gcgaaggtcg tccttgctgt aaccgtggag aagccgtgca ccaactttaa 4680 tcgcctcagc aacatcaacc gactgtaccg cgcgtgggct tacgtgtggc ggtttattga 4740 gatcacccga tcgaagaaaa ggaatcctcc cagcaatccg ttgacggcga gtgagttgtc 4800 cagggcaacc acaaccatcg tcaagatcat ccagcaggag gcattcacgg aggagatcaa 4860 gcaactgcgt acgggacaag cgaagcggaa caagctgtcg agtcttgcgc ccttcctaga 4920 tggcgatggc ttgattcgag ttggcgggcg attgaagtac tcgaacattc cctacgccgg 4980 caagcaccaa atcctgttac ccgaaaagca cccgttcgtc actctactgg ttcgccacct 5040 gcacgaactg aacctgcacg taggacaaaa cgcactagta tcgatgatcc gccagcagta 5100 ctggccgatc aaagtgaaga acacggtcaa gcgagtcacc ttcgactgtg tcatctgctt 5160 caagcacaga cctccacaag tacaccaacc gatgggtgag ctgccaagct accgtgtgac 5220 acccgcgcca gtgtttgctt cgaccggtgt ggattttgct ggacccttca cactgcgaga 5280 gagtggcaga aagccgaagt tctacaaagc ctacgtagca gtgtttgtgt gtatggcggt 5340 caaggcggtg catttggagc tgtgcacgga tctgcgtacg gagacgttcc tggcagcact 5400 gcagaggttc accagtcgac gaggcctccc atcgaacctg cactccgaca atgccacgac 5460 attcgtcggt gccaaaaacg agttggccga actgcggaaa ctgttcgaga accaggtgca 5520 caaggacaag ctggccgact tctgcagttc gaaaggcatc acctggcact tcattccacc 5580 ccgtagccct cactttggcg ggatatggga ggcaggagtc aaagcgatga agtacctgct 5640 caagcgcgtc gtgggtgaaa cacgactcac ctacgaagag atgacgacat tcctggccca 5700 ggctgaagca acaatgaact ctcgtccgat gtgccccctt tcggacgacc caaacgattt 5760 gcaggctttg acgccctccc acttcctgat cgggcgaccg gctcacgccc taccagagcc 5820 ctcgtacgag caggagaaga tcggcaggtt gtccaggtgg cagcacgtgc agtacatgag 5880 agagcacttt tggaagaggt ggtctgcgga ctatctccac acgttgcagc agcggcagaa 5940 atggaaggac ggcgtgctgg acatcaaggt tggagccttg gtgttactgc gagaagagaa 6000 cattccaccc cagcagtgga agaagggccg catcacagcc acacatcccg gcgaggacgg 6060 tgtggtgcgt gtggtcaccg tgaagacggc gagcagcgag tacaagcggg cagtgaccag 6120 gatctgcctg tttccggatg ttgactctaa cgacccaacg gggggagta 6169 // ID Gypsy-14_AA-I repbase; DNA; INV; 5132 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-14_AA_; KW Gypsy-14_AA-LTR; Gypsy-14_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5132 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 997-997 (2011). XX DR [2] (Consensus) XX CC Positions [4094-4555] - Integrase core CC 'AATAT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 926..2611 FT /product="Gypsy-14_AA-I_1p" FT /translation="MLGYFTPYVAIFNFKFVFSLEGLTNQQVVYLFVLDSS FT AMPPFKIGVDPRNDWMKWRKAFDRFLRANKIVDDEEKFDLLLVIGGIELQE FT YYDKITKFEVQYFLPSGESVTLQYDSAVTSLEKYFAPQLNKRFERHLFRAM FT KQDDNEPFNEFVFRLQDQAKRSDFIDVDDMVIDQVVEGCKSTELRKKLLTD FT DLTLNDVMTLGKTIEEVQKQSKQYERPNLSTYEKPLVQRIVEKRPDKYPNV FT ESGRRCYNCNRPGHLAKETERCAARNATCHSCGSKGHFKVCCRKRKHGDSQ FT QRSVPAPKRSMVQAIHDEKTPTKGVFTVNDGIHLNEVLKLEVGGVIMNMLV FT DSGSPANIMSSKSYAWLKQQGANMLNERRPRNNETNLTPFASDQKIFFSDA FT FETEIKIPGDESGVWTHVLVAPDGQTNILSKSTAFALGVLRIGYKINQISN FT STELAEEFPKVPNVLLKIQVDEKVSPVVQVARRLPMSMEADVEEAIRELLE FT KKIIERAEGSLTWVSPLVPVRKADGKIRLCVDMRAANQAVKRENYPMPNID FT AAMMSITKVGTDFCE" FT CDS 2774..5038 FT /product="Gypsy-14_AA-I_2p" FT /translation="MFGIKSAPELFQREMENLFRGMKGVIVYMDDVLVHGA FT TAEDHDATLARVLQILRDRNMRINEQKSVFSVTEVEFLGYCVADKGIRPTD FT DRVKAILGLQAPTSVSELRSLLGLINFVGRFVPNLSALTFHMRQLLIKDSP FT FEWNKVHEEELGKVKSILGNVQSLGYFDPADDTQLVTDASPYGLGAILIQI FT KNGIPRTISCISRSLAVHEKKYCQTEKECLAIIWAMEKLFVYLYGIKFTLI FT TDCKPLEYLFNRVQSKPSARIERWILRLQSYDFVVRYEPGKNNIADPLSRM FT SQGLEESIGDIDTVAWLAEEIRPSALSIEEIETETKRDDKLQAVKDAIFSN FT NWDAAPVEFRTTTIKDDLTTYGELILRGDRIVIPKQLQDKVVQLAHSGHQG FT CTAMKAQLRAKVWFPSMDKAVESFVRGCKPCKMTGLPDSPNPILRRVPSEP FT WQDVAIDFKEGLSGGLSLLVVVCYTTRFIQVEPMKPATSQRVIAALLRMFS FT VLGIPRSITADNGPQFRAIEFSRFCVCYGIHLNLSTPYWPEQNGAVERQMR FT NIGKRLKISEIQGTDWQTDLYEYITMYHATPQETTGVSPGKMMLGREIRTR FT IPSIRTPYSLQWEEARDRDMGKKEYHKQRADAQRHAKEHTLVRGDTVLMRN FT LDPGALDSTFRGEEFDVINVNNNAVQVRSTETDRVYLRNSSHLKKLEKDRE FT MKCDQSQAKDTEDESCTSVETAETTSNHAEEENIANERFSQRTRKRPSKFR FT DYVV" XX SQ Sequence 5132 BP; 1605 A; 920 C; 1261 G; 1346 T; 0 other; atggctacga gaaaaatcgg taagtcgaaa actcggtttt ggaaaaacgg ggaatttctg 60 cgaaatattg gaggaatatt tttatttgta aaaggaggat gatattttca acaatgggag 120 ggaggaacag tgaaaacgtt tatgggaaaa caaatggagt gatgaaatgg agaataaaca 180 taagaatcac atgtcgccat tttgaagggc agagggctag atccggatgt agtgcaagca 240 ttgaaaaagt taaaacgaaa gattgcggtt atgttcagag aaatggtgtg taatggatta 300 aagagaagtg acttcatgag cacatttcgg aactaaatcg aggcaaatac cgaaggctga 360 atacaaacga cgggaaacac agtgtgtgta atagtttgta gcttagtggt ggagctgttt 420 catgcaagaa gtgctacgtg caaagttatg cttcgtgcaa aataatgcta catgaaatcg 480 ttagagtcac ttcgtgctgc gtgcaagatt atgcttcgtg caagattatg tttcgtgtaa 540 ttttcatgac aaaaggtgct acgtgcaaag gaatgcttcg tgcaaagtta tgcttcgtgc 600 aaagtaatgg tacatgcaat tgccagaatg atttcgtgct gcgtgcaaga ttatgcttcg 660 tgcaagatta tgcttcatgt aattttcatg gcaagaagtg ctacgtgcaa agttacccta 720 gatgcaaagg tatgcttcgt gcaaagttat gcttcgtgca aagtaatgtt acgttaaatt 780 gtccgatttg ttcgtgctgc gtgcaagatt atgctttgta catgagatgc tttgtgcaag 840 tttatgcttc gagtaattgt ggaacactac atactgtacc caaaattatg aatgctacgt 900 gcaaagttac gtttagtgcc gttgaatgct gggctacttt actccttacg ttgctatatt 960 taattttaaa ttcgttttta gtttggaagg cttaactaac caacaagttg tttatctatt 1020 tgttttagat tcatctgcga tgcctccgtt caaaatcgga gtagatccaa ggaacgattg 1080 gatgaaatgg cgaaaagcct ttgatcgttt tttacgagct aataaaattg ttgatgatga 1140 agagaagttt gatttactac tggttatcgg tggtatagaa ctgcaggaat attacgacaa 1200 gattaccaaa tttgaagtgc aatatttcct tccatccggt gaatcagtta ctcttcagta 1260 tgactcggca gtgacttcac tggagaagta ttttgctccg caactgaata aaagatttga 1320 acgacatttg tttcgggcaa tgaagcagga tgacaatgag ccattcaatg aatttgtgtt 1380 tagattgcaa gatcaggcaa aacgtagcga ttttatcgac gtggatgata tggttattga 1440 tcaagttgtc gaaggttgca agtcaaccga attacgcaag aaactcttga ccgatgattt 1500 gacactgaat gatgttatga cactcgggaa aacaattgag gaagttcaga agcaatcaaa 1560 acaatacgaa aggcctaatt tgtcaacata tgagaaacca ttggtgcaac gtatagtgga 1620 aaagcgcccg gataagtatc cgaatgtaga atctggaagg cgatgctaca attgcaatcg 1680 tcctggtcac ttggccaagg aaaccgagag atgcgcggca aggaatgcaa cttgccatag 1740 ttgtgggagt aaagggcact tcaaagtgtg ttgccggaaa aggaaacatg gagactcaca 1800 acaacgatcg gtaccggcac cgaaacgttc gatggtacaa gctattcacg acgagaaaac 1860 acccacaaaa ggcgtattca cagtgaatga tggaatccac ttgaacgagg tgttgaagtt 1920 ggaagtaggt ggagttatta tgaacatgtt ggtagactcg ggttcgcctg ccaatatcat 1980 gagcagcaag agttacgctt ggttgaaaca gcagggagcg aacatgctca acgaacgtcg 2040 tcctaggaat aatgagacaa acttgacacc tttcgcttcg gaccaaaaga tattcttcag 2100 tgatgcgttc gaaacagaaa tcaaaattcc tggtgatgaa tcgggagttt ggacgcacgt 2160 attggtggct cccgatgggc agacgaacat attaagcaaa agcacggcgt ttgcattagg 2220 cgttttgagg attggttata aaattaacca aatatccaat tccaccgagt tagcagagga 2280 gtttcctaaa gttccaaatg ttctgctgaa aattcaagtc gatgaaaaag tatcaccagt 2340 tgttcaagtg gccaggcgac tacctatgtc tatggaagcg gacgtcgaag aagcaattcg 2400 tgaattactg gagaagaaga taatagaacg tgctgaagga tcgttgacct gggtctcacc 2460 tttggtacct gttcgaaagg cggatggaaa aatacggcta tgtgtcgaca tgcgagctgc 2520 aaaccaggcg gttaaaaggg aaaattaccc aatgcccaac atcgatgcag cgatgatgtc 2580 aatcacaaag gtaggtacag atttttgtga gtaaacaata aagttttgat tggtttgttc 2640 ataaataaaa cagattgcaa gactgtcaaa gattgactta gaggcagcat actaccactt 2700 cgagttggat gaaagcagcc gtgtaataac tacgtttgta gctcgtagtg gggtttatcg 2760 attccgcaga ttaatgttcg gcatcaaatc cgcgccggaa ttgttccaaa gagagatgga 2820 gaacttattc cgaggtatga aaggggtaat tgtatacatg gacgatgtgt tagtccatgg 2880 agctacagca gaggatcacg atgctacgct agcacgtgta ctgcaaattc tgagggatag 2940 aaatatgcga atcaacgagc aaaagtcggt gttctcggta acagaggttg aatttctggg 3000 atattgcgtg gcagataagg gaattcgccc caccgacgat agagtcaaag cgattcttgg 3060 cttacaagct ccaacatcag ttagtgagct aagatcatta ctgggattga taaatttcgt 3120 tggacgattt gttccgaatc tctccgcatt gacgtttcac atgagacagc tccttattaa 3180 agatagtcct tttgaatgga acaaagttca tgaagaggaa ctcgggaagg tcaaatccat 3240 cttgggaaat gttcagtcac ttggttattt tgatccggca gatgataccc aacttgtaac 3300 agacgctagt ccgtacggac tgggtgccat tttgattcaa attaagaatg gaatacccag 3360 aacaatatct tgcatctcca ggagtttggc agtgcacgag aaaaagtatt gccaaaccga 3420 gaaagagtgt ttggcaataa tttgggcaat ggaaaaactc tttgtatatc tgtatggtat 3480 caaatttacg ttgatcaccg attgcaaacc tctcgagtat ctgtttaaca gagtacaatc 3540 aaagccatcc gctcgcatag aacgatggat tcttagacta caaagctatg actttgtagt 3600 ccgatatgaa ccgggtaaga acaacatcgc ggatcccttg tcgagaatgt cgcaaggtct 3660 cgaggaatca atcggggata ttgatactgt cgcgtggctt gcagaggaaa taaggccctc 3720 ggcactttca attgaggaga tcgaaacaga aaccaaacgt gatgataaac tccaagccgt 3780 taaggatgct attttctcaa ataactggga tgcggctcca gtggagttta ggactacaac 3840 gataaaagac gatcttacaa catatggtga gcttattcta cgtggagacc gtatcgttat 3900 tccgaaacaa ttacaggata aagtcgttca acttgctcat tccggccacc aaggttgtac 3960 cgctatgaaa gcacaattgc gagcaaaagt gtggtttccg tcaatggata aagcagtgga 4020 gagttttgtg cgtggttgca aaccttgtaa aatgacaggg cttccggaca gtccaaatcc 4080 catattgcga cgtgttccat cggaaccatg gcaagatgtt gcgattgatt tcaaggaagg 4140 tttgtcaggc ggcctctcac tactagttgt agtatgctat acgactaggt tcattcaggt 4200 ggagcccatg aaaccagcca cgtcccaacg ggtaatagca gcattactgc gaatgttcag 4260 tgttttgggt attccacgtt caataacagc agacaacggc ccacaattta gggcaatcga 4320 attttctcgt ttttgtgttt gttatggcat acacctaaac ttatcaacgc catattggcc 4380 cgaacagaat ggtgcagttg agcgtcagat gcgtaacatc gggaaaagat tgaaaattag 4440 tgaaatccag ggcacagact ggcaaactga tctgtatgaa tatattacga tgtatcatgc 4500 aacgccccaa gagactactg gcgtatcacc aggaaaaatg atgctgggtc gcgaaattag 4560 aacacgcatt ccatcaattc gtacgccata cagtttgcaa tgggaggagg ctcgggacag 4620 agacatgggg aagaaggagt atcacaaaca aagagctgat gcacaacgtc atgcaaaaga 4680 acatacgttg gtgagaggag atactgttct aatgcgtaac ttggatccgg gagcactgga 4740 ttccacgttc agaggagaag aattcgatgt tataaacgtc aataacaacg cagttcaagt 4800 gcgatcgact gagacggata gagtctactt gcggaacagc tctcatttga agaaactgga 4860 gaaagatcga gaaatgaaat gtgatcaatc gcaagcgaag gatactgagg atgagagttg 4920 tacatcagtt gagactgcgg aaacgacatc aaaccatgct gaggaggaaa acatcgccaa 4980 tgagcgattc tctcaacgta cgcgtaaacg accaagtaag tttcgtgact atgttgttta 5040 agatggagac gtattcatat gattactgat tgttggcttg aaataaataa aaatgtacca 5100 aacagcagtg ttgtacacat tttctggaga ga 5132 // ID hATm-28_HM repbase; DNA; INV; 4254 BP. XX AC . XX DT 07-DEC-2008 (Rel. 13.12, Created) DT 10-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type family: consensus sequence. XX KW hAT; DNA transposon; Transposable Element; hATm-28_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4254 RA Jurka J.; RT "Families of hATm elements from Hydra magnipapillata."; RL Repbase Reports 8(12), 1922-1922 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 1694..2998 FT /product="hATm-28_HM_1p" FT /translation="MATLSDVSPNVLQKVTSAILTEAGVXLTDIHCSKSTA FT FRKMKLANENISTLAKEDVKNAIEASPYPCVIHFDGKTLYEINAGKKFKID FT RLAVLVNIEGETHLLGVPPLTSSSGEDQYTSIMNLLKEYKLESKVGGLCFD FT TTASNTGIKKGSVIKISNNLNKYLLKLACRHHITELRMVHFWKHTTNELXT FT GPENLLFQKFKSLFEHPNFSYAPSDLCKFNWNEVNGSISEKAAEDSMNFCK FT AYIERKGIAGEDKREDRRELAELVVTYLSPSVYKIRKTGAIHHARFLAKAI FT YYLKMQLLYSQLDFVRENNDLKMEIKLIAEFVACFYAKWYLESSNVSKAPN FT LDMKAIHQMHQFRNICLNPKAVDAVLDSLYKHTWYLDSTMIPLALLDEDLP FT LNEKQKIAAAILSFKMPKAEHFKNENKEKKRHQKRDRKKC*" XX SQ Sequence 4254 BP; 1615 A; 587 C; 696 G; 1350 T; 6 other; ggggtgttca gaaaaaaaaa ttttgaattt gaaaaacaaa accgtctaga tttgaagcaa 60 atatgtttaa acttactcac aaaaaatttc agcaccattg aaccactttt tccttgccct 120 caaagtacat gtttggaaaa aaaggactca aaaatgaccg aaaactgctt actttaatgt 180 gtcttgaggg caaggctagc atgaatattt ttaatatatt tttattttag ggttaagaaa 240 aatatctgta tttttgatta aaaaagtatg gtttttttct ttcttcacta aatacagtat 300 aatttgcatt ttaaatcggc taaaaaacgt ttatttcgyg gtgcggcgtt gttttaatat 360 ccttttcaat aattaaattc aaacattgtt attcaggttt tagtatagtt tgcaaatgta 420 tatattcaaa acgagaatag aattactcat tttatgtatg tttaatacct ttaccaaaca 480 aaagaataag ttctttactt ataactacaa atttaataat tttataagta gcgatatgga 540 tgcacaaatt aagatataat gtgttgtaaa tatcatgtaa aattaaacta tataaaataa 600 aatatagcat ttataatatt tataaatata aaattcttgg ttatttaggt taataatgat 660 gataaataca aaacgagcga taaatatgta taaaagaaac gaagagtaca agtttgaaaa 720 gaaatttaaa ttaagatctt ataaaaaaaa gaaattgttt attattaaag cagctaaacc 780 agaaatttca ggtaatttct agtttgtaag aagcttatgt atattatata tttaagagaa 840 agagagagag agagagtagt ctaaattatt aaaaaaaaaa taataaacaa aaaaaaaaca 900 tgcatttaga aattcaactt ccaacaggca tggatattct acaacatcta gcatacctgc 960 aatctaaact gcctaaaacc accaaaaaaa atartttaat tttttgctcg cttcgtaaaa 1020 attttgaaag atgtgaagag ttagattgta actgtgtttt agctaaggtt aagaaaccct 1080 gggtaaaagc agggtttgaa gttgttagtg acactcaatt attcaaaaat cttattaaac 1140 ttgacaagca gtattctaat ttgaaaaaaa aatgaaaaaa gaggctctcc taaagattta 1200 acaaatcaga aaaactttga gaattactta aaaaaagttt tttggatagg caaaccagat 1260 ttgaagaaca ttataagaag cgacaaaaaa agatctgaca aggataagct cgaagacttg 1320 atgtttttgg aagatcagga aggagaaaga aagtttattc ttgggacaga ggataagaaa 1380 tttagtagaa aggtaaaact gtaaaatacg attaaaaata tttagtgata tttttcttat 1440 aaaacatttt gttataatta atattatttt atgcttgtag actgcagaga gtaacagaaa 1500 gaaaaataga acaaaggatg tgcgatcgac gaaatctaat tatgaaattg aactcaattc 1560 tgatttgagt tctgattcat ccgatgattc tgattttgag cctagtaact attatcaaaa 1620 agaaccatct caatctaaca gtacagtaac tgcagaaatt ccaatagatg tatatgccgg 1680 gaacgttgcc cttatggcta cgttaagtga tgtttcgcca aatgttttgc aaaaagtgac 1740 aagtgccatt ttaactgaag ctggtgttga wcttacagat atacattgca gtaaatcaac 1800 agcctttaga aagatgaaat tagcaaatga aaatatatca actttagcaa aagaggatgt 1860 aaaaaatgca atagaagctt ctccttatcc atgtgttata cattttgatg gaaaaactct 1920 ttatgaaatt aatgcaggaa aaaagttcaa gattgacagg ctggcagttc ttgtgaacat 1980 tgagggtgag acgcacctac taggtgttcc tcctttaact tcctcttctg gtgaggatca 2040 atatacaagt attatgaacc tactaaagga atataaactt gaatcaaaag taggaggtct 2100 ttgttttgat acaactgcaa gcaataccgg gatcaaaaaa ggttcagtca taaagatatc 2160 caacaattta aataaatatc ttttaaaatt agcatgtagg catcacataa cagaattaag 2220 aatggtgcat ttttggaaac atacaacaaa cgagttgast acaggaccag aaaacttatt 2280 gtttcaaaaa tttaagagcc tatttgaaca tcctaatttt agttatgctc caagtgatct 2340 gtgcaagttt aactggaatg aggtkaatgg tagtataagt gaaaaggctg ctgaagattc 2400 tatgaacttt tgcaaagcat atatagaaag aaaggggata gcgggagaag ataaacgtga 2460 agacagaaga gagctggctg aattagttgt gacttatcta tcaccttctg tttataaaat 2520 caggaaaact ggagctatcc atcatgcccg ttttcttgct aaagcaattt actatctaaa 2580 aatgcagctt ctgtattctc aacttgactt tgttcgagag aacaacgatc tcaaaatgga 2640 aatcaagctt atcgcagagt ttgtagcctg tttttatgcc aagtggtatt tagaatcgag 2700 caatgtgtca aaggccccca acttagatat gaaagctatc catcagatgc atcaattcag 2760 aaatatttgt ttgaatccaa aagcagttga tgctgtgctt gattcgcttt acaagcacac 2820 ttggtacttg gattcaacta tgattccact agctctattg gatgaggatt tgcctttgaa 2880 tgaaaaacaa aaaattgcag ctgcaattct ttcatttaaa atgcctaagg ctgaacactt 2940 taaaaatgaa aacaaagaaa aaaaaagaca tcagaaaaga gatagaaaaa aatgctagta 3000 ctgagaaaga tccaccaagc ttagcccctt tggtagatga gttttcttac cttatgttta 3060 gtttttgcgg tttaacagaa gaaagaataa aagactggct ttcacttcca ccgcaatact 3120 ggcatacaca gtcttcgttt aaaatttttg aaaattatgc caatagcctt attgttatca 3180 atgaccattc ggaaagagcc gtcggcatga tgcagcaata tgttcatcgt tacaacaacg 3240 aagaggaaaa acagaacaga ttgatatcag ttgacagagt tcgctctgct ttcaggattt 3300 ctgggaaaag ctccaccagt cttaccaaaa aaaggttatc tgatagtctt tcctccttag 3360 cgaagaaaca aaacacaaac aaacaaaaga aggattaatt agataaataa aagaacttct 3420 acaaaatctt tttaaatatt taaaaataat ttttttttga tatgataagt aatattgaat 3480 cgtgatatgc aaaaaccaca tatgttacaa aatcaacatt tttggtaaat ttgaaaaaac 3540 ctttaattgt taatgatttg tttttgttgg ccacttgaaa accgcgtatt tactaataat 3600 gtaaggtact ttgacacatg tttgattttt tcagaatatc gttgatactg cacataggaa 3660 catttttcgc aataaaatct caaaattttg caaaaaagta aaaaaatgcg aagcgtggtt 3720 tttgcagatm acgattaaat tgttgtaata acttaattta ttttattctg attaaaatat 3780 tataaataat aaacatacat aaaatgagta gttctattct tgttttgaat ataaacattt 3840 gcaagctata ctataaacct gaataacaat gtttgaattt aattattgaa aaggttatta 3900 aaacaacgcc gcgccgcgaa ataagcgttt tttagctgat ttaaaatgca aattatactg 3960 tattttgtga agaaagaaaa aaaccatact tttttaatca aaaatactga tatttttctt 4020 aaccctaaaa taaaaattta ttaaaaatat tcatgctagc cttgccctca agacacttta 4080 aagtaagcag ttttcggtca tttttgagtc ctttttttcc aaacatgtgc tttgagggca 4140 aggaaaaagt ggttcaatgg tgctgaaatt ttttgtaagt aagtttaaac atatttgctt 4200 caaatctaga cggttttgtt tttcaaattc taaaattttt tttttgaaca cccc 4254 // ID BEL-206_AA-LTR repbase; DNA; INV; 437 BP. XX AC AAGE02024196; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-206_AA_; KW BEL-206_AA-I; BEL-206_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-437 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02024196; Positions 128382 127946. XX SQ Sequence 437 BP; 124 A; 83 C; 103 G; 127 T; 0 other; tggtcgatgc acgaagcgat gcctgctcca tctaaacgtg gttttcgtcg ttcatcgata 60 agggtatgcg cattagacga gatcgacata gtgtttaacg gtgcggtcaa ggataagcca 120 tattgagcaa catggaggcg atgtaatttc cgtctactgt gactataaaa ggcctcacag 180 acgtcacgtt gttctctttt tttgacatcg aacgccagca gaagaaacgt caatttataa 240 tgttattgag tgaaatgaaa ttgaactgag taaatataag ttaagaagtt gaattaagaa 300 agaagtgtag ttacttacca atcgtagtgg ttttaattag tgctgtcgcc gccggtgtta 360 ctctgtgatc cgcaacctgc aaagaaagga gtacttacct gaagaatttc tccgctgccc 420 tcgctgtgtt attgcca 437 // ID BEL-1_RP-I repbase; DNA; INV; 6975 BP. XX AC ACPB02040023; XX DT 28-JAN-2011 (Rel. 16.02, Created) DT 28-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Rhodnius prolixus genome: internal DE portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-1_RP_; KW BEL-1_RP-LTR; BEL-1_RP-I. XX OS Rhodnius prolixus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Euhemiptera; Heteroptera; OC Panheteroptera; Cimicomorpha; Reduviidae; Triatominae; Rhodnius. XX RN [1] RP 1-6975 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Rhodnius prolixus genome."; RL Direct Submission to RU (28-JAN-2011). XX DR Genome; ACPB02040023; Positions 7048 14022. XX CC Positions [6010-6597] - Integrase core CC 'ACGAC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 633..2198 FT /product="BEL-1_RP-I_2p" FT /translation="MEESLEKVTPPVSPVPSVSPSLEETPRASTQPEASES FT LNAIRCRHLVARLDQWARSVQLLDLDSPDRVARCQQFSALFQSLSQTVDRE FT AWEWLRLLAPEDSGPLALALSASHAQAELAHLSIVLPALATPLMMENVPTS FT SPPSPVPDDSVSELARRQLQELVIPRIPVFGGEVLSYREFKRTFQSEVHKL FT ALSPAKRLQALRSCLRGPPLSIIQNATLTEAGYLYAWETLDGRYEGRLRIA FT SAMLEAFLQFPAAGSSMEDIIAWLSGRQILLDHCDIPEPLDFLLCHVSLSV FT LSPGLQEEFRKKCLGSFDELPRSGQVLDFLKQQEDRFSPKLVPLQKMSHSS FT AQSGPSQASAPFTPSSGPAPPVDEPQVVEVPSPNQFRPYRPKSVSFVPRTP FT RRPSPTPSPAPVRRSTPSTSCAPASSVSSIRCYFCGGPHRLSRCDRYLQLS FT GKDRWSFVAEERLCVNCLSKYHQVSRCPCQVGCGKCHGEHHTTLHPPPQSS FT SVCQEARSASHLPSLMSLRTCPASRI" FT CDS 2542..6939 FT /product="BEL-1_RP-I_1p" FT /translation="MSVSPTAMVYLLGGDSQWYRARALLDSGSEINLMSAT FT FAKECKLSYSPMSGAIRGIGGNIATQVVGFVSTNLKQCPPFSQLKVPFRVV FT TKTVPDAPSSYFPVGDWVHLANKYLADPTFNEPSPVDVVFSASVFDALDLS FT QMIRSQNGSPNARLTTLGWVMSGPLKRLPVKRVRFHPSVQNHMSNSKLYRT FT YYKILFKHKYGHLIPSKVPNNESNPEDKTPTSTPPGEYESEPVAHRNNPEL FT IMEPEQSVTPSRSSSSAQILSTLEEEVTPQPCARPTLDIPLPPPKPQRLNR FT TRRPLQILTSPDSESFPPTTISAEASSSENSTLPVKSSLCNPVSVVSIQPT FT VLRFSSVKSSQPSQHPPDQVLSRLMEKMFALEEVSEVSPLTPEELKAEECF FT SKSHSRLPDGRFSVALPLKTSPCCLGESRKVALKRLLQTESRHAKSPHLRP FT PYVEFMRDYLESKHMELAPPLKSGQVTYYIPHHAVHRPDDPPEKIRVVFNA FT SQKSSSGVSLNELLLPGPRLQLDIWKVVTRFRVDRWVFTADIRQMFRQILH FT PPEDRDLLRILWRFDTSQPVQEYRLCTVTYGTTSAPYLACRVLQELALTSQ FT IPFPKQQRSLRDRTSVDDVSGGGPTFEAALESRDQLIRLLSQAQIELRKWA FT ANHPRLLEKIPLDHLLPAMSSVCLENDEESQLKILGLCWNPSQDYFHFLIR FT SVPNIQTKRQLASQIAKVFDPLGWLIPITVFARAIFRVVCQASFSWDDPLP FT SSIIQEWSQFAVTLPELSNIRIPRFLSLSTPKFYLTGFADASERAYAAVVY FT LVSLSEETVISLIMSKARMAPMKPVTLPRLELCAAHLLALTLHKVSLLFPQ FT IPSPNIFAFSDSTIALAWITATPPPHWKVFVGNRVAAILEKVPASQWFHVP FT SSQNPADCASRGLKPQELVNHPLWWEGPPWLKLNPESWPLRQVNADLSDPM FT VGAEIRSQALNVSIAVSSPEDLESKFSNLHTLINVVAWCLRFAHNARNSAH FT HISGPLKVKERGEALRALIVRAQRHSFLEEIEDAQSSRPKLRLVRILGLFI FT HTDGVLRVGGRLRQANIPESNKHPALLPQQHPLTALVIQDAHVSNLHVGPT FT ATHAFLRNRYWITNGKNVVRRQLSSCNKCFSIKPSPFNPIMGQLPEGRASA FT SPPFTTTGIDYAGPFNVKIASLRSAKVLQCYLAIFICFSTKAIHLELVTDL FT TTQAFLAALNRFIARRGMPSHIWSDNGTNFVGATRRLKEVSALLNDSQCQE FT SILRESGRRGVEWHFIPPRAPHFGGLWEAAVKSTKRLLKVTLGLQEPTMEC FT LLTIICQVEAILNSRPLTTLSSSPDDLQVLTPGHFLIMRPMTALPTSDSLN FT QRTALKSKWAMAQFTVSQFWKRWHREYLLDQQAMIKWHSPSFPAAEGQIVT FT IMDDNLPPLQWSLGRITKLHYGRDDVARVAEVKTASGVISRPVRKLCPLPN FT Q" XX SQ Sequence 6975 BP; 1647 A; 1930 C; 1439 G; 1959 T; 0 other; tggtgccgaa acccgggaac actcaagata cctattaaat aaataagtaa ataaataaat 60 aattacataa ttttttgtgt tctgtttaga cttatagtaa ttttcatttg tttatatata 120 tatatgaata tacttgtagc ttagaaatac acagtcatat tttaatttcg ctttaaaatt 180 attttttgga acatcgttca tattttatat tgttcatatt ttttcattgt gatctagcct 240 gaagtcaagt tattcttgga atttacaaat gtattgtttt agtcactcat aggttacgtc 300 ttgcctttgc cttgtggctc cgtatttttt gtgttaatta taatgtcaaa ttaaatattt 360 cgtttgcact ataaaccttg tttattctcg tattctgtat ttactattca acaattattt 420 atatatataa tttttttttg tccgtgttgt tctgtcaagt tcttgttcaa cttgttttat 480 aattatacta atttgttttt tgttaaattt atactatttt aaatttatac tgttttacaa 540 atttatacta tcttttgtat tgtgtattta tattttgtat tttgtatttt ttttgtcgac 600 aagagttatt agttgtgtcc aattttagga tcatggagga gtccctggag aaagtcaccc 660 cccctgtttc gccagtcccg tcggtttccc cctcgttgga ggaaactccc cgagcttcca 720 cccaacctga ggccagcgag agtctcaatg ctatcagatg tcggcacctc gttgccaggt 780 tggaccagtg ggctcggagt gttcagttgt tggacctgga ttccccagac agagttgcaa 840 ggtgccaaca gttctccgct ctctttcaat ccttgtccca gactgtggac agggaagcct 900 gggagtggct tcgactgttg gcgccagagg atagtggtcc cttagccctg gcactgtctg 960 ccagtcatgc tcaggcagag ctcgcccatt tgtctatcgt cctccctgcc ttggctacac 1020 ccctcatgat ggaaaatgtg ccgacttcct caccgcccag tcccgttccc gatgattctg 1080 tgtcagagtt ggcgcggaga cagcttcagg aactggtcat ccctagaatt cccgtgtttg 1140 gaggggaagt attatcatat agagagttca aacggacctt ccagtcggag gtccataaat 1200 tggctctctc acctgccaag cgacttcagg cgttgcggtc ttgcctgagg ggtccaccct 1260 tgtcgatcat tcagaatgcc accctgacgg aggccgggta cctctacgct tgggagacat 1320 tagacggccg gtatgagggg aggctaagaa ttgctagtgc catgctggaa gccttccttc 1380 aattcccggc cgctggctcc agcatggaag atatcatcgc ttggctgtcg gggcgacaga 1440 ttctcctcga ccactgcgac atccctgaac ccctggactt cttgttgtgt catgtctccc 1500 tgtctgtcct gtcccctggc cttcaggagg agttccggaa gaaatgcctt ggttcctttg 1560 acgaattgcc ccgcagtggt caagtattgg actttctaaa gcagcaggaa gacaggtttt 1620 cccctaaact ggtcccctta cagaaaatgt cgcattcatc tgctcagtct ggtcctagcc 1680 aggcttccgc cccctttact ccttcatccg gtcctgcccc tccggtcgac gaaccacagg 1740 tggtggaggt cccttcgcca aaccagttta gaccgtatcg ccccaagtca gtcagttttg 1800 tcccccgcac accgaggagg ccttcaccta ccccttcccc ggctccagtt cggcgctcca 1860 cgccttctac gtcctgcgcc cccgcctcct cagtctcttc aatccgatgc tacttttgtg 1920 ggggaccaca tcggttgtcc cgctgcgaca gatatttgca attgtctggc aaggataggt 1980 ggagttttgt cgcggaggaa cgcctgtgtg ttaattgcct gtctaaatat catcaagtat 2040 cccgctgtcc ctgccaggtg gggtgtggca aatgtcatgg cgagcaccac accaccttgc 2100 acccccctcc tcagtcttcc agcgtatgcc aagaagcacg aagtgcctcc cacctgccgt 2160 ccttaatgtc cttgaggaca tgtcccgcct ccagaatcta agtcctcttc caaacaagcc 2220 tcttcaccct ctcaatccag atcttcacct ccttttccca aacgcccact ccttcccccc 2280 agttcctttg agcctctact gtcagtcagg gactctgatt ccagacctac gtcatcatcc 2340 tcgggcagca tatcccctgt ccttcccata acctattttg gccgacatcc agcaccaacc 2400 ccccagtcag atttataggt tcttggcagg cacgataccc ctcccttatc cagtgtcgtc 2460 ttctgtcgtc gtagacattg tcgtttatgt catgctacga aagcctcccg aaaatcaata 2520 ttcaagtctt tcccatccga catgtcagtt tcccccactg ccatggtgta tctccttgga 2580 ggagatagcc agtggtatcg tgctcgtgcc ctgctggatt caggcagtga gattaatcta 2640 atgtcagcaa ccttcgccaa agaatgtaaa ctgagttatt ctcccatgtc aggtgctatc 2700 cgaggaatag gaggaaatat tgccacccag gtcgtgggtt ttgtgtcaac taatttaaaa 2760 caatgcccgc cattttccca acttaaagtt ccctttcgcg tagtcactaa gacagttccg 2820 gatgctccat cctcgtattt tcctgtcggt gattgggtac acctagcaaa taaatattta 2880 gccgacccga ctttcaatga accaagccca gtagacgtgg tattttcagc ttcggtcttt 2940 gatgccctcg atctcagtca gatgattcgc tcacaaaatg gctcccctaa cgcacgattg 3000 acaacccttg gctgggtcat gtctggtccc ctgaaaaggc tacctgttaa aagggttcgt 3060 tttcaccctt ctgtccaaaa ccacatgtct aactccaaac tttatcggac ctattataaa 3120 atattattta aacataagta tggtcattta atcccctcta aggtacctaa taatgagtct 3180 aatcctgaag acaagactcc aaccagcacc cctcctggtg agtatgagtc tgagccagtt 3240 gctcatcgta ataaccctga acttatcatg gaaccagaac agtcagttac tcctagccgg 3300 tcttcgtcct ctgctcaaat tttgtcaacc ctcgaagagg aagtcactcc tcagccttgt 3360 gctcgcccaa ccttagacat tccgctccct cctccaaaac cacaacgttt aaatagaacg 3420 cggcgtccgc tacaaattct taccagcccg gattctgagt cttttcctcc taccacaatt 3480 tcagcggaag cttcttcttc tgaaaattct accttgcctg ttaagtcttc cttatgcaat 3540 cctgtgtcgg ttgtctctat ccaacctact gtactccgct tcagctcagt aaaatcctcc 3600 caaccaagtc agcatccccc cgatcaggtc ctgtcccgcc tcatggaaaa aatgtttgcc 3660 ctggaggaag tatctgaagt ctctcccctt acgccagagg agttgaaggc cgaagagtgc 3720 ttctctaagt ctcactctcg actccctgat ggtcgctttt cagtagcact gccgctcaaa 3780 accagtcctt gttgtctcgg cgagtctaga aaggttgctc taaagcgcct tcttcagact 3840 gagtctcgcc atgcgaagtc tccacatctt cgccctccct atgtcgaatt catgcgagac 3900 tacctagagt cgaaacacat ggagctagct cctcccctaa agtcgggtca agttacctat 3960 tacatacccc atcacgcggt acatcgccca gatgaccccc cggagaagat cagagttgtc 4020 ttcaatgcct ctcagaagtc ctcatcaggc gtttccctca atgagctctt actccctggc 4080 cctagattac agcttgacat ctggaaggtt gttaccagat tcagagttga caggtgggta 4140 tttacagccg atattcgtca aatgtttcgt caaatattgc acccccctga agacagagac 4200 ctattacgta tcctttggcg ctttgatacc agtcaaccgg ttcaggagta cagactgtgt 4260 actgtcacgt atggaaccac gtcagcgccc tacttagcat gtcgcgtgtt acaagagtta 4320 gccctcactt cgcagattcc tttccccaag cagcagagat ccttaaggga tcgtacttct 4380 gtcgatgacg tctccggagg cggccctact tttgaagcgg ccctggaatc gagagaccaa 4440 ctcataaggt tactaagtca agctcaaatc gagctcagaa aatgggcagc caatcaccct 4500 cgactgttag aaaaaattcc tctcgaccac cttctcccag caatgtcttc cgtatgtcta 4560 gagaatgacg aggagtcaca attaaaaatc cttggcctct gttggaaccc ctcacaagac 4620 tatttccatt ttttgattcg ttcagtcccc aatattcaaa ctaaaagaca attagccagt 4680 cagatcgcaa aagtttttga tccattggga tggttaatcc caataacagt ttttgctcgt 4740 gcaatatttc gagtcgtttg tcaagcctcc tttagctggg atgatccttt accttcatcc 4800 attatccaag aatggtctca gtttgctgtc acccttccgg aactctcaaa tattcgtatc 4860 cctagattct tgtccttgtc tactcctaaa ttttatttaa ccggttttgc tgatgcatcg 4920 gaacgggcat atgcagccgt agtgtacctt gtgtcacttt ctgaggaaac agtcatcagc 4980 ctgatcatgt ctaaagccag aatggctcca atgaaacctg ttacgctccc acgcctcgag 5040 ctttgtgccg ctcatctcct ggctctcaca ctccataaag tcagtcttct cttccctcag 5100 ataccctctc caaacatttt cgccttttcc gattctacca tagctcttgc ctggataacc 5160 gctaccccac ctccacattg gaaggtgttt gtaggcaata gagtggctgc cattttagaa 5220 aaggtacccg cttcccaatg gtttcatgtc ccttcatcgc agaaccctgc ggactgtgcc 5280 tcacgtgggc taaagccaca ggagttggta aaccacccct tgtggtggga aggccctcct 5340 tggctaaaat tgaaccctga gtcatggccc ctaagacaag taaatgcaga tttgtcggac 5400 ccaatggtgg gggcagaaat tcgctcccag gctcttaatg tatcgatcgc agtctcctcc 5460 cctgaggatt tggaatccaa gttctccaat ttgcatacgc ttatcaacgt ggttgcctgg 5520 tgtctaagat tcgcccacaa tgccagaaat tcagcccacc acatctcagg cccattaaag 5580 gtcaaagaac gcggtgaagc cctgcgtgcc ctgattgttc gagctcagcg acattcattt 5640 ttagaagaaa ttgaagatgc ccagtcctcc agacctaaat tacgcctggt cagaatttta 5700 ggcttattca tacataccga cggagtattg cgagttggag gaagattaag acaagccaat 5760 atccctgagt ccaataaaca cccggccctg ttgcctcagc aacatcccct tacggccctg 5820 gtaattcaag acgctcacgt atccaatctc catgtaggcc ctaccgccac tcatgccttt 5880 ctccgaaaca gatattggat cactaatgga aaaaatgtag ttcgacgtca attgtcatca 5940 tgtaataaat gtttcagcat taaaccatcc cctttcaacc ccattatggg tcaactgcca 6000 gagggaagag ccagcgcctc accacccttc acgactactg gtattgacta cgcaggcccc 6060 ttcaacgtaa aaattgcgtc acttcggtcc gccaaggtcc tccaatgcta cctcgctatc 6120 ttcatctgct tttcgacaaa ggccattcac ttagaattgg tcactgacct cacgactcag 6180 gcctttctag ccgccttaaa ccgattcata gccaggcgag gcatgcctag tcacatatgg 6240 tctgacaatg gtactaattt tgtcggcgct accaggagac ttaaggaagt atctgccttg 6300 ttgaatgatt cccaatgtca agagtcgatc ctacgagagt ccggaagaag aggagtcgag 6360 tggcacttca tcccaccaag agcaccccac ttcggtggtc tatgggaggc agcggtcaag 6420 tccacgaaac gtttgctgaa agttaccttg ggcctgcaag agcccacaat ggagtgtctt 6480 ctcaccatta tctgccaggt cgaggctatt ttaaattcac gacctctgac aaccctttcg 6540 tcgtcgccag acgatctaca agtcttaacg ccaggacatt tcctaattat gcgtccaatg 6600 actgctcttc cgactagtga ctccctgaac caacgcaccg ccctcaagtc taagtgggca 6660 atggcccaat tcactgttag ccagttttgg aaacgatggc atcgggagta cctccttgat 6720 cagcaggcaa tgatcaaatg gcattcgccc tctttcccag cggcagaagg acaaatagtc 6780 acgattatgg acgacaacct ccctccactg caatggtcct taggccggat caccaaattg 6840 cattatggcc gagatgacgt ggccagggtg gccgaggtga agacggcgtc cggagttatc 6900 agccgccctg tcaggaagct gtgccctctc ccaaatcagt aagtggaact ctggcgagtc 6960 cactgcgggg gagga 6975 // ID Gypsy-266_AA-I repbase; DNA; INV; 5071 BP. XX AC . XX DT 13-JAN-2011 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-266_AA_; KW Gypsy-266_AA-LTR; Gypsy-266_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-5071 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (13-JAN-2011). XX DR [1] (Consensus) XX CC Positions [2106-2483] - Reverse transcriptase CC Positions [3741-4211] - Integrase core CC 'CAAAC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 996..4889 FT /product="Gypsy-266_AA-I_1p" FT /translation="MSTTSSASFIKPFIPGSVSFHKYLEDLNREFMHRGVY FT RKEKQKASFLASCAAQVFSELKQIFKERSLKDVSFGEITDALRKRFDGTEA FT EAKESLIHGVSTNRSLPSSFKQSFSHNDKNSLPSHSRKLFCSFCHREGHIR FT RFCFDRKNDEVKFVDSSAVQSQSLDVSNSLSCESEFSCEETDCLTFSVNRI FT NEPCYRTVYVDGIRLKMEVDCGAAVSVIDSSMYEEEFDHIPIEPCDKKLAV FT INGSRLKVYGKIQVLVELDGRSEVVNLIVLRSENRFTPLLGRDWLDLFIPS FT WRSVFATRSINQLAVSHDQILVDVKRKFANVFDKSLSTPIKGFEADLVLKD FT HTPMFRRAYDVPLRLKDKVVEHLESLEKDGVITPVDASEWASPVIVVVKKS FT GDIRMVIDCKVSINKVIIPNTYPLPLAQDMFASLAGAKVFCSLDLAGAYTQ FT LKLSKKSRKIMVINTIKGLFTYNRLPQGASSSAAIFQRVMDQVLKGIDGVY FT CYLDDVLIAGRDEKDCLRKLYLVLERLSNANIKVNLQKCKWFVTSLPFLGH FT VLTDKGLLPCPEKVETIRRASVPNNVSELKAFLGLINYYGKFIPNLSSRLS FT CLYRLLRKDVKFVWTAECTSVFEDCKLSLLKSNLLEFFDPCKPVVVVTDAS FT SYGLGGVIAHDIDGEERPISFTSFSLTNAQKSYPILHLEALAVVSTIKKFH FT KFLFGKKFTVYTDHKPLIGIFGKEGKNSLFVTRLQRYVLELNIYDFDIVYR FT PSSKMGNADFCSRFPLPDEVPVALQREFIKSLNVSSELPLDYVLVAEETKQ FT DEFIQQIVYYLKHGWPKKLERSLLDVYAHHQDLELVDGCLLYQDRVFIPVS FT LQKQILKLLHKNHSGITKIKQLARRTVYWYGMNSDIESFVKSCSVCCQMNA FT VAKQVPHSSWIPTNKPFSRIHADFFYFDEKVFLVIVDSFTKWLEVEYMKYG FT TDARKVNSTFMSVFARFGLPDVIVTDGGPPFNSKEFTDFMERHGVKVMKSP FT PYNPSSNGQAERMVRVVKESLKKFLLDPELRSTTTEDLVSYFLFGYRNSCL FT EEDNKFPSERLFSFKPKTLLDLIHPRNSFKNHMTKPTSVENPKQDTKDYRK FT PDWTDKLCPGDTVYYKNFRTTDIRRWLEAKFQKRVSSHTFQVSVGGRVYLA FT HRNQLKQAGNDKSRRYVTFSRGKEYGRSRKRTREDDEESDSDDAEDFYGFP FT ADSFVFRDNCENPLNDSRDGSGEVVAGSMSPEELDCGVPDRGVSQADEVDP FT LEGPSSRCSSSSDRQKASLRRSRRIKRFRRDNEYVYY" XX SQ Sequence 5071 BP; 1408 A; 802 C; 1223 G; 1638 T; 0 other; gtggcgaacg aggtggtgta gttttttttc cgtaattgga aaataattgc gtttcgcgcg 60 tagttctagt ggtcgaaaaa caaccgggag gaaattaccg tcggtattcc ggagtgattt 120 ttgtgcggtg gtttgtaccg gcagtttgct gtttttggct caagtttgcc atacagtgcg 180 ccgtacagtg tttggttttg tttcccacgg attgtagcgc ttgtttgctt gccgtcggag 240 gaacagctga ctgtgtgttt tgttttgtat cgctgtttaa cgctcaatcg caggtttggt 300 tgttgcgttc acgtaagtac tgcatgtaaa gttgaaatac aaaagagaaa attctcaaag 360 tttaaaactg catgcttact gtggaagttt gcgatccatt ttctttcgcc tagcaagcga 420 gggaaaatag tttcattttg tttgtaatgc tttattctgt ttggcattca aagaaagttt 480 ttgtcatcag actagtgtgt gtttctgttg atcaatagtt tattgctaca acgttgtttg 540 cgacattaag tttaacgata catgatcgtt gtcgcgcggt gaagtgtttc atcgcatatg 600 tttgttgtat gcaaagttgt ttgtgtgtca caacacagtg gtcttaccgc attagttttt 660 gcgtggaata aagtttttca acaaaataaa caggatcagt tttatgctca aagcgtacaa 720 aaagtttgtt tcgatctgtt tgttgaagtt tgttttcagt ttatgattta atctgtaata 780 aaaaaaatca atcgttttca gtttatgatt tacactgtaa aaaaatcaat cgtttgtttt 840 cggtttatga tttaatctgt aataaaaaaa atcaatcgtt ttcagtttat gatttacact 900 gtattaaaaa aaaatcaatc gtttgttttc agtttatgat ttaagctgta ataaaaaaaa 960 tcaatcgttt ggagtttgtt ttagttgaac aaatcatgtc aactacgtcg tctgcaagtt 1020 ttatcaagcc ttttattcct ggatcagttt catttcacaa gtatttggag gatttgaatc 1080 gtgagtttat gcatcgagga gtttaccgta aggaaaagca aaaagcttcg tttcttgctt 1140 catgcgcagc tcaagttttt agtgagttga agcagatttt caaagaacgg agtttgaaag 1200 acgtttcgtt tggagaaatt acggatgcat tacggaagcg gtttgatgga acagaagcag 1260 aggcaaagga aagtttaatc catggagttt caacaaatcg cagtttacca agcagtttca 1320 agcaatcatt ttcgcataat gacaagaata gtttaccttc acattcacgc aagttgtttt 1380 gtagtttttg tcacagagaa ggacatattc gacgattttg tttcgatcga aagaatgatg 1440 aggttaagtt tgtcgattct tcagcagttc aatcacaaag tttagatgtt tccaacagtt 1500 tatcgtgcga gtcagagttt tcgtgcgagg aaacagattg tttaacattt tcagtcaaca 1560 gaattaatga gccgtgttat cggacagttt acgttgacgg tattcgtttg aaaatggaag 1620 ttgattgtgg tgctgccgtt tccgtcattg attcatcaat gtacgaggaa gaattcgatc 1680 atataccgat agaaccatgt gacaagaagc ttgccgtgat aaatggaagt cgtttgaagg 1740 tttatggtaa aattcaagta ttggttgagt tggacggacg atcagaagtc gttaatctga 1800 tagttttgcg aagtgagaac aggtttacac ctttgcttgg acgcgattgg ttggatttgt 1860 ttattccaag ttggcgtagt gtgtttgcaa ctcgtagtat caatcaactg gcagtttctc 1920 atgaccaaat tctagtggat gttaagcgca agtttgccaa tgttttcgat aaatcattat 1980 caaccccaat caaaggtttt gaagcggatc tagttttgaa agatcataca ccgatgttta 2040 gacgtgccta tgacgttccg ttgaggttga aggataaggt cgtggaacac cttgagagtt 2100 tggaaaagga tggcgtcata accccggtcg atgcgagtga gtgggcatct ccggtgatcg 2160 tcgtcgtaaa gaagagtggt gatattagga tggttatcga ctgtaaagtc tcgattaaca 2220 aggtcataat ccctaatacg tatcctctac ctttggcaca agacatgttt gcatctttgg 2280 ctggcgcgaa ggtgttttgt tcactagatt tggcaggagc gtatacacag cttaagttgt 2340 ctaaaaagtc gagaaaaata atggtgatta atacgattaa aggtttgttt acgtataatc 2400 gtttaccaca aggtgcttct tcaagtgcag cgatttttca acgtgtcatg gatcaggttt 2460 tgaaaggtat tgacggagtt tactgttacc ttgacgatgt tttgatagca gggagagacg 2520 agaaagactg tttacgaaag ctttatttgg ttttagagcg actttctaat gccaatatta 2580 aagtcaactt gcaaaaatgc aaatggtttg tgactagttt gccatttctg ggtcatgtgt 2640 tgacggataa aggtttgttg ccgtgcccag agaaggtgga aacaattcgt agagccagtg 2700 ttccgaataa tgtttccgag ttgaaggcat tcttgggttt aatcaattac tacggtaaat 2760 ttattcccaa tctgtcgtct cgccttagtt gtttgtatcg tttgctcagg aaagacgtta 2820 agtttgtctg gactgctgag tgcactagtg tgtttgaaga ctgtaagctc tcgttgttga 2880 agtcaaacct gttggagttt tttgatccat gcaagcctgt cgttgttgtg acagatgcta 2940 gtagttacgg gttaggtgga gtcattgcac atgacataga tggagaagag aggcccatta 3000 gttttacctc cttcagttta acaaatgcac agaagtccta cccgatactg cacttggaag 3060 cgttggccgt tgtgagtacc attaaaaagt ttcataagtt tctgtttggt aaaaagttta 3120 ctgtgtatac tgaccataag ccgttgattg gaatctttgg taaggaagga aagaatagtt 3180 tgtttgtgac gcgtctgcag agatatgtat tggaactgaa catctacgat tttgacattg 3240 tttacagacc gtcatcaaaa atgggaaatg cagatttctg ttcgaggttt cctttaccag 3300 acgaagtacc agttgcacta cagagagagt ttatcaaaag tttgaacgtt tcgagtgagt 3360 taccattgga ttacgtttta gttgcagaag aaaccaaaca agacgagttt atacagcaga 3420 ttgtttacta cctgaaacat ggttggccta aaaagttgga gcgaagtttg ttggatgttt 3480 atgctcatca tcaggatttg gagttggtcg atggatgttt attgtaccaa gatcgtgtgt 3540 ttatacctgt tagtttacaa aaacagatcc tgaagctgtt gcataagaac cattcaggca 3600 ttaccaaaat caaacagtta gccagaagaa cagtttactg gtatggcatg aacagcgaca 3660 ttgaaagttt tgtcaagtcg tgtagtgtat gttgccagat gaatgcagta gcaaagcaag 3720 ttcctcattc ctcttggata ccaacgaata aaccgttttc gcgtattcac gctgatttct 3780 tctacttcga cgagaaggtt ttcctggtca ttgttgacag ttttaccaaa tggttggaag 3840 ttgaatacat gaagtatgga acagatgcca ggaaagtcaa ctcgacgttc atgagtgttt 3900 tcgctaggtt tgggttacct gatgtaatcg ttacagatgg cggaccaccg ttcaattcga 3960 aggagtttac ggattttatg gaaagacatg gtgtgaaggt gatgaagagt ccaccgtata 4020 atccgtcaag caacggtcaa gcggagcgca tggtgcgagt tgtcaaggag agtttgaaga 4080 agtttttatt ggatccagag ttgagaagta caacaaccga ggacttagtt tcgtattttc 4140 tgtttggtta tcgtaattcg tgtttggaag aagataacaa gtttccttcg gagaggttgt 4200 ttagttttaa acctaagacg ctgttggact tgattcatcc tagaaatagt tttaagaatc 4260 atatgacgaa gcctacaagt gtggaaaatc ctaagcaaga taccaaagac taccgtaagc 4320 ctgactggac agataagttg tgcccaggag acactgttta ctataaaaat tttagaacaa 4380 ctgacatcag acgatggctt gaagccaaat tccagaagcg tgtttcctca catacatttc 4440 aagtttccgt aggaggtcgt gtgtatttgg cacatcgtaa tcagttgaag caggctggaa 4500 acgacaagag tcggcggtat gtaacttttt cgaggggtaa ggagtatgga cgcagcagaa 4560 aacgtactag agaggacgac gaagagagcg attctgacga tgctgaagat ttttacggtt 4620 tcccagcaga ttcttttgtg tttagggata attgtgaaaa tccgttgaat gacagtcgag 4680 atggttctgg agaagttgtt gcaggttcga tgtcaccaga agagttggat tgtggagttc 4740 ctgacagagg agtttcgcaa gctgacgagg ttgatccatt ggaaggaccg tcatcgagat 4800 gttcgtcgag ctcagatcgt caaaaagcaa gtttacgccg ttcgaggcga atcaagcgtt 4860 tcagaagaga caacgaatat gtttactact gatttcgaat cagagtttgt ttgtttgatt 4920 gtttgtttgt tttcggtagt atgttttacc gactgtttac agcagtactg ttttgctgtg 4980 ttcgagaagt atccacgttg ttcccgaaag tattgaaagt gtttagtgtg aatttcagtg 5040 ttgtgtatag ttcatctaaa agggggagaa c 5071 // ID Proto2-3_CS1 repbase; DNA; INV; 4591 BP. XX AC . XX DT 15-JUL-2009 (Rel. 14.07, Created) DT 15-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE Annelida Proto2-3_CS1 autonomous Non-LTR Retrotransposon - DE consensus. XX KW Proto2; Non-LTR Retrotransposon; Transposable Element; KW Proto2-3_CS1. XX OS Capitella sp. 1 OC Eukaryota; Metazoa; Annelida; Polychaeta; Scolecida; Capitellida; OC Capitellidae; Capitella. XX RN [1] RP 1-4591 RA Kapitonov V. and Jurka J.; RT "Proto2, a novel clade of metazoan non-LTR retrotransposons."; RL Repbase Reports 9(7), 1558-1558 (2009). XX DR [1] (Consensus) XX CC Proto2-3_CS1 is a very young family of non-LTR retrotransposons CC present in the annelid genome. It belongs to a novel clade of CC metazoan non-LTR retrotransposons called Proto2. This clade CC includes families of non-LTR retrotransposons present in the CC hydra (from Proto2-1_HM to Proto2-5_HM), annelid (from CC Proto2-1_CS1 to Proto2-8_CS1), hemichordate (Proto2-1_SK) and CC amphioxus (Proto2-1_BF) genomes. A model Proto2 non-LTR CC retrotransposon is ~4.3 kb long, contains two ORFs; its 3' CC terminus is composed of microsatellites; usually there is no CC target site duplications generated upon retrotransposition of CC Proto2 elements. ORF1 codes for a protein conserved in Proto2 CC elements from all species mentioned above. ORF2 codes for a CC protein composed from the AP endonuclease and reverse CC transcriptase domains. It appears that the Proto2 clade is a CC clade ancestral to the RTE and RTEX clades. XX FH Key Location/Qualifiers FT CDS 212..1324 FT /product="Proto2-3_CS1_1p" FT /note="ORF1." FT /translation="MNDLKRSSVSSATNDSPMPPIVNNVLGFAMASVQMHA FT PNEVVKRILCQFSGETITDAKDILWEHAITHNYECIIGKKTNRRDSNLRSE FT MEAHVDDIVQALLRISKRDVKPVIAVDACEIPSLPSLTGEDGDMRMRVSYL FT EDTCRELGRTVQQLSDVIQHRLHRPEPTYCSQPSSVEPTYSQSLTLPKTIS FT TDSTHDPLPDGESYTRNMSARHNEHAPSTCVPQQKSVIVPSAAPIDCDGFT FT VPSHVLNKRRKQARRKRKIITGSNTGDQTLKGATQDANRDLFVYHVDKSAT FT TEDLEHLMTSQKCEVRKVSVTSNDEARFKSFKLTIPASCLKRVFSEEFPWP FT LNVRVRRFISRRPQDHVRRSQQQDQQHI" FT CDS 1423..4416 FT /product="Proto2-3_CS1_2p" FT /note="ORF2, AP endonuclease, RT." FT /translation="MNLKFCTYNSHGHGAGRFSYMKTLLSKYEFLLIQEHW FT LSTKCLHLFELEIDKCCSHSISGMPDESIMIGRPYGGCSIVWREGLQCKIT FT PVTSNCRRLCAVNVQSNDYQFILICVYMPIDGASIDDYNDVLNEIGNLMSS FT VDHDNVIIGGDFNTDLVRYNSLNTISLVDFCSSQYLVNCLNIDGVAIDHTF FT KSKVTGSTSCIDHFLLTENLLSSFVSAGVSHDIDNFSDHDPLYLCTSLSAN FT VTPNEQVNNVHTRIAWHSASSIQIARYKTTLNEMLADVNLPSCTADCIDVF FT CTAHQQSISKFHDDIIECCTSSALSSIGRPTAVQGSGARPGWSRYVKPVRV FT EALKWHRAWKERGRPDHGPLYDMRKMTRARYHNAVRFIKKQEQILRDTSLA FT NSMSDLDYNQFWREVKRIKDSRKPVASTVDDFNTRAEICKVFTDKYSALYN FT SVTFDSSDMIRYLDNINRDLLHDSTASYLYISTADVKKAVSGLKTGKSDGQ FT NQQSDHLINGTPLLFRKLAQLFTAMINHGFAPDSFLLGTLSPIVKNGRKSI FT NSSSNYRGITLSSILGKILDRIIMNRFSGSLYSNDFQFGFKQSHSTVQCTL FT VIDEVSKYYTSNGGEIYIMLLDASQAFDRVNYVKLFRALSAKGMCPLICRF FT LAYSYTQQKVRVKWSDHTSEPFEVTNGVKQGGVLSPTLFNVYVDFVFKKLK FT EKRLGCHLNGHYSGAISYADDLTLLAPTKHALQVMLQEVNTVARDLDIKFN FT GAKSQLVAFNKQGHIPITISCMGAIVHSVDNAIHLGNSVGINSDCIRVKKC FT IREMVMHCNSLCTLFPCVDIEVKYRLFKTYCMPLYGCQLWHLNSSNIEKIH FT LCWRKCLRRLFNLPRRTHCDLIHIICQDVPFINQILNRFAKFLQGIVKSVN FT IFTRSCLSSVLSGSLSVFSFNVSCLCSKLGIPKFRIPEYNFTARTRQLFFV FT DDAMQSLGGVIRDLIRLRDCNGLINNGWTTCDFTMFLEELCCN" XX SQ Sequence 4591 BP; 1265 A; 920 C; 914 G; 1492 T; 0 other; caacgtttat ggtgaagcaa caactccacc ataatacgtt ctttcaagtg gatatacttc 60 gcattgcgcg tggattttta tattggcgga tgcgtaaaac tgattctcat ttgtgtgata 120 tcccctccgc ccactgtggg tgtgggccag agggagtgct ttttcccttc atcgttcata 180 acagtgatct gctcaactca gaggcgctga catgaatgac ctaaaaagat caagcgtttc 240 aagcgcaaca aacgactccc ccatgcctcc tattgtaaat aatgtgcttg ggttcgctat 300 ggcctcggtg caaatgcatg caccgaatga agttgttaaa aggatcctct gtcagttttc 360 tggcgaaact atcacggatg ccaaggatat tctgtgggaa cacgctatta ctcataacta 420 tgaatgtatt attggtaaga agaccaatcg ccgcgacagc aatctccgca gtgaaatgga 480 agcacacgtt gatgatattg tgcaggctct tctacgcatt agtaagagag atgttaaacc 540 agttatagca gtcgatgctt gtgagattcc atctctgccc tcccttacgg gagaagatgg 600 ggacatgagg atgagagtgt cctatttgga ggacacttgt agagaacttg ggcgaacggt 660 ccagcagcta tccgatgtga ttcaacatcg tttacatcgc cctgaaccta cgtattgttc 720 acaaccatct tctgttgaac cgacgtattc acagtcgttg acgctgccga aaacgatttc 780 tactgattct actcatgatc cactgcccga tggcgaaagc tacacacgaa atatgtctgc 840 acgccacaac gagcatgctc cctctacttg tgtaccacag cagaagtcag tcatcgtacc 900 cagtgcagca cctattgatt gtgatggctt cacggtacca agtcatgtct taaataagcg 960 ccgaaagcaa gcaaggcgta aaagaaagat catcaccgga tcgaatacag gtgatcaaac 1020 attaaaaggt gcaactcaag atgctaaccg cgatctcttc gtctaccatg ttgacaagag 1080 cgctactaca gaagacctgg aacatctaat gacgtcacag aaatgtgaag tgcgcaaagt 1140 cagtgttacc tcgaatgatg aagcacgatt taaatctttc aaactgacta tccccgcttc 1200 ctgcttaaaa cgagtattta gtgaggaatt tccatggccg ctgaacgtca gagttcgtcg 1260 ctttatctcc cgtcgtccac aggatcacgt ccgtcgatca cagcagcaag atcaacaaca 1320 tatataacaa gtattgtatt gtactttata gttctatgcg cactctagtg actacctata 1380 aaaccattac tacctttttt ctttctctct ctctctttct ctatgaattt aaaattttgt 1440 acgtacaact cgcatggcca cggagctggt cgcttttctt acatgaaaac attgttgtca 1500 aaatatgagt ttctgttaat tcaagagcat tggcttagca cgaaatgtct gcacctcttc 1560 gaacttgaaa ttgacaagtg ttgctctcat tcgatttctg gaatgcctga cgaaagtatt 1620 atgatcggtc gaccgtacgg tggctgttca attgtctgga gagagggtct ccaatgcaaa 1680 ataacacctg tcacctctaa ttgtaggcga ttatgtgcag ttaatgttca gagtaatgat 1740 tatcagttca ttcttatttg tgtttacatg ccaattgatg gggcttccat tgacgattat 1800 aatgatgttc taaatgaaat cggcaatttg atgtcttctg tggatcatga taatgtaatt 1860 attggaggtg atttcaatac ggatcttgtt cgatataact ctttgaatac aatttctctt 1920 gtcgatttct gttcctcgca atatctcgtg aactgtttga atattgacgg tgtagcaatt 1980 gatcacacat ttaagagtaa ggtaactggg tcaacttcat gtattgatca tttcctgttg 2040 acagaaaact tattatcatc atttgtctct gccggtgtgt cacatgacat tgataatttc 2100 tcagatcacg atcctttata tttgtgtact tctctcagtg ctaatgttac tcctaatgaa 2160 caagtcaata atgtacacac tcgaattgcg tggcattcag ccagtagtat tcagattgca 2220 aggtataaaa ccacgctgaa tgaaatgttg gctgacgtta acttacccag ttgtaccgct 2280 gattgtattg atgtattttg tactgctcac caacaaagta tttctaagtt tcatgatgat 2340 attattgaat gttgtactag ttctgccttg tcatcaatcg gtaggcccac tgctgttcag 2400 ggatctgggg caaggcctgg ttggtcacgc tatgtaaaac ccgttcgagt tgaagcgctt 2460 aaatggcatc gcgcatggaa agagagaggc agacctgacc atggtccatt gtatgatatg 2520 cggaaaatga ctagagcacg ctatcacaat gctgtacgtt ttataaagaa gcaagagcaa 2580 atactgcgcg atacttcact tgctaattct atgagcgatc ttgactacaa tcaattctgg 2640 cgtgaggtga aacggatcaa agactctcga aaaccagtgg cttcaacagt cgatgacttt 2700 aatacacgtg ctgagatatg caaagtcttt acggacaaat atagtgcgct ttataatagc 2760 gtcacttttg attcgtcaga catgatcaga taccttgata atataaatcg tgatttactt 2820 catgattcaa cggcctcgta tttgtatatt tccaccgctg atgttaaaaa ggctgtgagt 2880 ggtttaaaaa caggcaaatc tgatggtcaa aaccagcaat ctgaccacct gattaatggc 2940 acacccctct tattcagaaa attagcacaa ctctttactg cgatgatcaa ccatggcttt 3000 gctcctgatt ctttcttgtt gggaacttta tcacctattg tgaagaacgg tagaaagtct 3060 attaatagca gttctaatta ccgtggtatt acccttagta gtattttagg taaaattctc 3120 gatagaatta tcatgaacag attcagtggt tcgctgtaca gtaatgactt tcaatttggt 3180 tttaagcaat ctcattcgac agtgcaatgt actctcgtta ttgatgaagt gtcaaaatat 3240 tatacttcta atggaggtga aatatatatt atgttattgg acgcttccca agctttcgat 3300 agggttaact acgtcaagct attccgtgcc ctgagtgcca agggtatgtg cccgttgatc 3360 tgtagatttc ttgcttattc ttatactcag cagaaagtac gcgttaaatg gtccgaccat 3420 acatcagaac catttgaagt cactaacgga gtcaagcagg ggggagtgct ttcaccgacc 3480 ctttttaatg tttatgtcga tttcgtcttt aaaaaattga aagaaaaacg actcggttgt 3540 catttaaacg gtcattacag cggtgctata agttatgctg atgacctcac tttgcttgct 3600 cccacaaaac atgctcttca agttatgttg caagaggtca acaccgtggc aagggattta 3660 gatattaagt ttaatggtgc gaaaagtcaa cttgtcgctt ttaataaaca agggcacata 3720 cctattacca tttcatgcat gggggcgatt gtccactcgg tagataatgc catccacctt 3780 ggtaattcag ttggaatcaa ttctgattgt attagagtta aaaaatgtat tcgtgaaatg 3840 gtcatgcatt gtaattcttt atgtacatta tttccttgtg ttgatattga ggtcaaatat 3900 cgtcttttta aaacatattg tatgccttta tacggttgcc agttgtggca tttaaatagc 3960 tctaatatcg agaagataca tttatgttgg cgaaagtgtt tacgccgcct ttttaatctt 4020 ccaagacgga ctcattgtga tctcattcat attatctgtc aagatgttcc tttcattaat 4080 caaattttga accgatttgc taagttttta cagggcattg ttaaaagcgt taatatattt 4140 actcgttcct gtctttcctc tgttttatca ggaagcctat ctgtttttag tttcaatgtt 4200 tcttgtttat gttcaaagct cgggatacca aagtttcgga tccccgagta taattttacc 4260 gcaagaacgc ggcaactatt ttttgtagat gacgccatgc aaagtcttgg tggcgtcatc 4320 agagatttaa tccgtttgcg agattgcaac ggattaatta ataacggttg gacaacctgt 4380 gattttacta tgtttctcga agaactttgt tgtaattaac atattcttct ttttatttaa 4440 ttctacttgt tgaatttatt attcttttgt tctttctttc tgttgttttt ttctttttct 4500 ttttttttct tttttttctt ttcttttccc ctaaatgaaa tgcttggatc acgcccgtga 4560 atgtatttcg attaataaac attatatata t 4591 // ID SKIPPER_LTR repbase; DNA; INV; 390 BP. XX AC AF049230; XX DT 27-JUL-1999 (Rel. 4.06, Created) DT 27-JUL-1999 (Rel. 12.07, Last updated, Version 1) XX DE Dictyostelium discoideum LTR-retrotransposon Skipper, Gag (gag) DE gene, complete cds; and Pro (pro) and Pol (pol) genes, partial DE cds. XX KW LTR Retrotransposon; Transposable Element; SKIPPER; SKIPPER_I; KW SKIPPER_LTR. XX OS Dictyostelium discoideum OC Eukaryota; Amoebozoa; Mycetozoa; Dictyosteliida; Dictyostelium. XX RN [1] RP 1-390 RA Leng P., Klatte H.D., Schumann G., Boeke D.J. and Steck L.T.; RT "Skipper, an LTR retrotransposon of Dictyostelium."; RL Nucleic Acids Res 26(8), 2008-2015 (1998). XX RN [2] RP 1-390 RA Schumann G.; RT "SKIPPER."; RL Direct Submission to Genbank (17-FEB-1998)Molecular Biology and RL Genetics, Johns Hopkins University, 725 N. Wolfe Street, RL Baltimore, MD 21205, USA. XX DR GenBank; AF049230; Positions 1 390. XX SQ Sequence 390 BP; 146 A; 43 C; 39 G; 162 T; 0 other; tgttagagac tcaaaactaa attaatttaa agttagtatg agctcaacaa cgtaacaaat 60 tcgcgcatct ccatgtaaaa atatttcatt taatttttat ttttattttt taatttttta 120 ttttttcagt caaaaaataa aaaattaaat taaaaaaaat aaaattaaat taaaatttaa 180 aaaaataata aaaattctga tttttcggtt tagttttcat aacgccctta cttttaatta 240 tctcttttat gtatgtttac ttttatgata ttagtctttc actctttcta tccttcatat 300 taaaaaacat tactttaaag attggactgt ttgatttaag attgaacaaa tgtgacaatg 360 gattgaagat tagtgtgagt gacttttaca 390 // ID CR1_Ele15 repbase; DNA; INV; 5735 BP. XX AC . XX DT 22-OCT-2010 (Rel. 15.1, Created) DT 22-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE A CR1 clade non-LTR retrotransposon family from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1_Ele15. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-5735 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5735 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (22-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. This consensus is generated from 20 CC sequences with >98% identity, and ~99% identical to the original CC sequence in [1]. XX FH Key Location/Qualifiers FT CDS 383..1297 FT /product="CR1_Ele15_1p" FT /translation="MEACHACSTLMESGRVINCNGTCGRLYHFTCVGMSKS FT QFTAWTAXIGLYXFCESCRLNFEPAVYDRDKTIMKALRELLIRTNSMDTRL FT ANYGENLRKINSTIFGPKQQSKSSTDSLHQSNFLQQIDEMTLDDTTDDPIN FT RSRSCENTSFFEVLDEVNGSIALLPDKFVVGANKRVQIIANPSSGSGSSEN FT SPRITVSTPAATNKQTFAQNDRSSRRHTDRITNPSVLPGDRMSVTNSDRSQ FT NGVSRTRPEAPSLKVANITSDSHELESFYVTPFAPDQNEEEVKRYVREISN FT VHTSLCERGQTGT" FT CDS 1582..5520 FT /product="CR1_Ele15_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MSPEARDDATHILNQVKKFQSNENSTNITNSTDTQRV FT QPFADFASADLSRAYHSAKLPVSPGELYSGICLAEAGKNAAPNLVRVSNLQ FT PLHVSNYRRIDDQPFIEFSSADNSSACHLAALPVSPGKFDSGICPAEARND FT AAPNXSREINLQLIGNLATSINGTEDLYFQPCLQVSEEFTTRFIRSGDHSD FT QHFGEFSSSDDPGAYHFAALPVSPGKFFSGICPAEAGEDTTQNLNRVLNLQ FT STEHPIAYINPYGEHHIHPFAKSADVSTAYHFAALPVSSGELYSGICPAEA FT GKDAAQNMMRYPNFQQHHETATCIDQTDRAVCESSSNDHNIILPTINARNG FT FNDKKCSLLVYYQNVGGINTRVCDYRLACSDSSYDAYAFTETWLNSDTISS FT QIFSDTYSVFRLDRNPQNSLKNSGGGVLLAVKASFKPRQLSVPNSEAIEQL FT WVAISFQTNTLFICVLYIPPDRSHDMNVIDQHVSAFNWITDKMKIKDSLLI FT LGDFNIPGIRWKFSTSNYLYPDVTSSSIRSSDATLIDSLNMARVSQLNYVV FT NHNNRILDLCFGSIDGDVRFSLIEAPSCLVKPTTHHPSLLIEVIGATPCVF FT VEPVESLYNDFKHGDYSGMNTFFSNINWLDVINQDLELSVATFSNIVLYAI FT DQFVPKRIHRPPTNPPWSNYRLKQLKRLKRSTLRKYSKFKTLSCKLEYHAA FT NVRYKRLNEKLYLSHQRKIQQKLRHNPKAFWNHVNEQRKESGLPTAMFKDS FT VEESSTEGICNLFLSQFSSVFTSEYLDEQQVFLAANNVPVYPPVGDHPFVN FT SEFVCKNCSALKASTSCGPDGIPAIVLKNCSSILSVPLSQLFNLSLQAGSF FT PECWKKSFVFPVFKKGNKRDVKNYRGISALCAVSKLFEKVVHNFLLHNCLH FT FISEHQHGFMPKRSTNTNLVAYTSFIAKELQKGHQVDSIYTDFSAAFDKIN FT HQIIVAKFSRLGFSGSFIDWLKSYLTGRKMAIKIGDCTSPYFLVTSGVPQG FT SHLGPLIFLVYLNDVHSLLKSFKLSFADDFKLYCIVDDVNNACFLQSQLDA FT FTVWCKTNRMELNADKCSVISFSRKRSTFNFDYKIGSTVLRRESVVKDLGV FT LLDSKLSFKEHIAFVASKASKTLGFIFRVAKQFKNIQCLKSLFCSLVRSIL FT EYSSVVWAPYYQNSISRIEAIQRRFVRFALRHLPWNNPTNLPIYEDRCKLI FT GLELLSVRREISKSLFVSDLLQLKIDCPHLLSLLNFNVPHRPLRSSTFIFL FT HGARTNYGYNEPFSAMCRSFNRCSHEFDFHIPRTTLRTKFSQIISESHQNS FT R" XX SQ Sequence 5735 BP; 1577 A; 1291 C; 1146 G; 1718 T; 3 other; tctggcaaca ctgttacctt tccgctcttg ctatttttcg gtgctcgatt ttgtatgttt 60 ttgtcgtgca atttatgtcg aaaattatca gtgactggtg attccggcgt acagtagtag 120 tctttttaag tgtatagtga ttgttctccc attacaacgc cgtgcaagtg tttatttcgt 180 gatctgagtt gttattgttt actacaacac actcccactc gctcgatttt ttttttgaaa 240 gcgacatctg gcggcaatat ttggaagcta ctcggctgta ttggatacaa gtggtagctt 300 agtttcatac aagttgcaaa tccctgactt ggaatagtct accaaaaggc gtttgcggag 360 tcgtatcgaa acgtttgcga cgatggaagc ctgtcatgct tgttcaaccc ttatggaatc 420 gggtagagta ataaattgca atggaacatg tggccggctt taccatttca cttgtgtagg 480 gatgagtaaa tcccaattta ccgcgtggac ggcaaamatt ggactatatt kgttctgcga 540 atcctgccgt ctaaatttcg agccggctgt gtacgatcgg gataaaacta ttatgaaggc 600 tctacgcgag ttgcttattc gcacaaattc catggacacg cgcctagcca actacggaga 660 aaatctgcgt aaaattaata gtacgatatt tggaccaaag caacaatcta aatcgtcaac 720 ggactccctt catcagtcaa actttttgca acaaattgat gagatgacat tagacgatac 780 gactgatgac ccgataaatc gttcgaggtc ttgcgaaaat acttcatttt ttgaggtgct 840 ggatgaagtg aatggctcta ttgcactcct accagataag tttgtcgtcg gtgcgaataa 900 acgagtgcaa attatcgcaa acccgtccag tggctcagga agtagtgaaa attcacctcg 960 tatcactgtt tccactcctg ccgctaccaa caagcaaacg tttgctcaaa atgaccgatc 1020 ttctcgtcga catacggata gaatcaccaa tccttccgta cttcccggtg ataggatgtc 1080 ggtgacgaat tctgatagat cgcagaatgg agtgtctagg actcgacctg aagctccatc 1140 gttgaaagtc gccaacatca catccgattc ccacgaactg gagtccttct acgtcactcc 1200 tttcgcacct gatcaaaacg aggaagaagt aaaacgttat gtcagagaaa tttccaatgt 1260 ccacacatct ttatgtgaac gtggtcaaac tggtacctag gggtaggaac gccgaagatc 1320 tttctttcgt gtcgttcaaa gtgactgtca gtaagactgt ctcgaatgtg gtcggtgatc 1380 cctggtattg gccggaagga attactgtac gtacttttga gcccaatcca aaaaacggtt 1440 ctgcaacacg tcttccagta attcaataaa tagagaaact acgaatcttc cacaactcac 1500 gttatccagg aattcctccg accatcgatc ataccacgca gacaagctcc ctgttttatc 1560 gagtaagttt tacagtggta tatgtccccc gaagcaaggg acgatgctac acacatcctg 1620 aatcaagtta aaaaatttca gtccaatgaa aattccacaa acatcacaaa ctccactgac 1680 acccaacgcg tccaaccatt cgcggatttt gcctcggctg atctttcaag agcgtaccac 1740 tctgccaaac ttcctgtttc gccaggtgag ttatacagtg gtatatgtct cgccgaagcg 1800 ggcaagaatg ctgcaccaaa tttagttcgc gtttcaaatt tgcagccact tcatgtctcc 1860 aactaccgcc gtattgatga tcaaccgttt atcgaattct cgtccgctga caactcaagt 1920 gcgtgccact tagcggcact tcctgtttca ccaggtaagt ttgacagtgg tatatgtccc 1980 gccgaagcga ggaatgatgc tgcaccaaat ktaagtcgtg aaattaattt gcagctcatt 2040 gggaatcttg ccacttccat caatggaacc gaggatctct actttcaacc atgtctccaa 2100 gtttctgaag agtttaccac ccgtttcatc cgatctgggg atcattctga tcaacatttt 2160 ggcgagtttt cgtcatctga tgaccctgga gcataccact ttgccgcact tcctgtgtca 2220 ccaggtaagt ttttcagtgg tatatgtccc gccgaagcag gggaagatac tactcaaaac 2280 ctgaatcgag tattaaactt gcagtctacc gaacatccta tcgcgtatat caacccatat 2340 ggtgagcacc atatccatcc gtttgccaaa tcggctgatg tctcaacagc gtaccacttt 2400 gccgcacttc ctgtgtcatc aggtgagtta tacagtggta tatgtcccgc cgaagcaggg 2460 aaggatgctg cacaaaatat gatgagatat ccaaatttcc agcaacacca cgaaactgca 2520 acctgcattg atcaaacgga cagggcggtc tgtgaatcat catccaacga tcacaacatc 2580 atattaccaa ccatcaatgc tcggaacggt tttaacgata agaagtgtag tctgcttgtc 2640 tattaccaaa acgtcggtgg aatcaacact cgcgtttgtg attatcgcct cgcgtgttcg 2700 gactctagct acgacgcgta cgctttcact gaaacatggt tgaatagtga tacgatatcc 2760 agtcagattt tcagcgatac ttacagcgtt tttcgtttgg atcgcaatcc tcaaaatagt 2820 ttgaaaaact caggtggagg agtattatta gcagtaaaag ccagctttaa acctaggcag 2880 ttatcagttc cgaattcaga agctatcgaa caattatggg ttgctatcag ctttcaaact 2940 aacacgttgt tcatttgtgt cctttacatt cctccagacc gctctcacga tatgaatgta 3000 attgatcaac atgttagcgc gtttaattgg atcactgata agatgaaaat taaggacagt 3060 ttactgatac ttggggactt caacataccc ggcattcgtt ggaaattcag cacatcgaat 3120 tatctctatc cggatgttac tagctcttca attagatcat cagatgctac gctcattgat 3180 agtttaaaca tggcacgggt ttctcagttg aattacgttg taaaccacaa caaccgaata 3240 ctggatttat gctttgggag cattgatgga gacgtgcgtt tctcattgat cgaggcccct 3300 tcttgtcttg ttaaaccaac aacgcatcac ccgtccctgc tgatcgaagt aataggtgca 3360 acgccctgtg ttttcgtaga accagtcgaa tcgctttaca acgattttaa acatggtgat 3420 tattctggta tgaacacctt tttctccaat attaactggc tggatgttat taatcaggat 3480 ctcgaattat cagtcgcaac attctccaac atagttcttt atgctataga ccaatttgtt 3540 cctaagcgta tccatagacc tcctacaaat ccaccttgga gtaattaccg tttgaagcag 3600 ttgaaaaggt tgaaacggtc taccttacgc aagtattcga agtttaaaac cctgagctgt 3660 aagctagagt atcatgccgc caatgttcgc tacaaacgac taaatgagaa actgtatctt 3720 tcgcatcaac ggaaaattca acagaaactg cgtcataatc ccaaggcatt ttggaaccac 3780 gtcaacgaac agcggaagga atctggtctc ccaacggcca tgttcaaaga cagcgttgag 3840 gaatcgtcta cagaaggcat ctgcaatctc tttcttagtc agttttcgag cgtctttaca 3900 tcggagtatc tggacgaaca acaggttttc ttagcagcga acaatgttcc cgtttatcct 3960 ccagttggtg atcacccatt cgttaattca gaatttgtct gcaagaattg ctctgctttg 4020 aaggcgtcaa ctagctgtgg tccggatggt attcctgcaa tagttttgaa gaattgctca 4080 agcattctat ccgtgcccct atcacaactg ttcaacctat ctctacaagc aggatcattc 4140 ccggaatgtt ggaaaaaatc gttcgtcttt ccggtgttca aaaaaggcaa caaacgtgat 4200 gtcaaaaatt atcgcggtat atctgctctt tgcgctgtgt ctaagttgtt cgaaaaagtg 4260 gttcacaact ttttgctgca caattgcctt catttcatat cggaacacca acatggtttc 4320 atgccgaagc gttcaacgaa taccaactta gtagcgtata catcattcat tgcaaaagag 4380 cttcaaaaag gtcatcaagt agattccatc tatacggatt tctcagcagc ttttgataaa 4440 atcaatcatc agataattgt tgccaagttt agtcgtctcg gtttctccgg atcttttatt 4500 gattggctga aatcctactt gactggtagg aaaatggcca tcaaaattgg agattgcact 4560 tcgccatact ttcttgttac ctctggagta ccacaaggca gtcatttagg gccattgatt 4620 tttttagtgt acctcaatga tgtgcattcg ctgcttaagt cgttcaagct ttcatttgcc 4680 gatgatttta agctgtattg tatcgtcgac gatgtcaaca acgcttgttt ccttcaatcc 4740 cagcttgacg cgtttacagt ttggtgtaaa acgaaccgca tggagctaaa tgcagacaaa 4800 tgctcagtaa tatcattttc ccgcaaacgc tctacgttta attttgacta caagattgga 4860 agtactgtcc ttaggagaga atcggtggtt aaagatctgg gtgtgttatt ggactctaag 4920 ttgtccttca aagaacacat tgctttcgta gcctccaaag catctaaaac actcggtttc 4980 atatttcgtg ttgcaaagca gtttaaaaac attcaatgtt taaagtcttt gttttgttcg 5040 ctagtgaggt caatattaga atactcgtca gtagtctggg cgccctatta tcaaaacagt 5100 atttctcgca tagaggcaat tcagcgcaga tttgtacgct ttgccttgcg ccatctgcca 5160 tggaacaacc caactaactt acccatttac gaagaccgtt gtaaactgat tggtttggag 5220 ttgttaagcg tccgccgtga aatttcaaaa tctttgtttg tatcagatct tctgcaattg 5280 aaaattgatt gtccacattt gctgtcgctt ctcaatttca acgtacctca tcgtcctctt 5340 cgttctagta ctttcatatt tctccatggt gctcgtacta attacggata taacgaaccg 5400 ttttccgcta tgtgtcgttc cttcaatcgt tgctcacatg aattcgattt tcatatcccc 5460 cgtactacgc tccgaacgaa attctctcag attatttcag agtctcatca aaattctcgt 5520 tgaactcttt ttaaggctag caaattaaat cgtttttatg ttactattaa gaaaataagt 5580 gtatagtatt aagttaaatt gtatcatttg gattgtatgt ttttctgttg gtactaaaag 5640 atgaggaggt tttgcgccca tttgagatag agctgaggtg ctcaactcaa acgggctttt 5700 ccctgctcca aataaagaat aaagaataaa gaata 5735 // ID P-37_HMa repbase; DNA; INV; 4526 BP. XX AC . XX DT 21-JUN-2010 (Rel. 15.06, Created) DT 21-JUN-2010 (Rel. 15.06, Last updated, Version 3) XX DE P-type DNA transposon - a consensus. XX KW P; DNA transposon; Transposable Element; P-37_HMa. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4526 RA Jurka J.; RT "DNA transposons from Hydra magnipapillata."; RL Repbase Reports 10(6), 794-794 (2010). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(750..1037,1084..1287,1350..2921,3308..3949) FT /product="P-37_HMa_1p" FT /translation="MPRKCSVGNCRGNYDSANEHVKVYKFPSDKELCERWL FT AALPNKIQSPTDNMGVCEKHWPLGCSMHYPKRSKYQVSRIHIYKQKFQINV FT VVILVSFIIPSDPPSLFPGCFPSMLRQTGKTFRQTNFEKISIESRNSFCDE FT LDSFNKIDKIKNFEVKQFLTDVKQRIILLNISRFPDYFVVTNALDLTVFNM FT TFSRPPETNISVTINTHTQQLHAYIKHTQIHCNDILGFSHKLTRWSQLEAI FT ISRMKSSKPELSEELKHLLQEIECTYGEKSDEINFLIEQLQLQMMPVRGKR FT YSTLVVRRALELYLSSRCCYRILCDTLSLPHPRTLKLKLGRIGEVGTQKEC FT EECIKTVFETLTISEKRCCLLFDEMYVKPSIRFRGCHLIGYSEDNPAEIAK FT TILCFMIKPMFGKPAFVCRMAPVYKLSGIFVQNLIVEITQTVAKLGGEILC FT LISDNLPTNRSIYQFFALSLNEPWLGNICNQSIIMLHDPVHLFKSIRNNWL FT TEKTGTIHLTLNSQLFKGYWKDLVSLYEKEKNNVIKLTNLSYKAIHPGVID FT RQNVTHMCAVVNEKTVAALKICDFKETSEFLERILLIWNYLNIKQEGMDIR FT LNDPNRASYKNVNDKRLQEILTFAEAVAKMPGGKGWKRNKSFTSETKISLS FT NMLYGIVDLIRKLLNEDHQYVLPGIFQTDRLEGEFGIYRFRFFYARQLCGG FT SYHVSVEEVKNSLRLQRLKLFSKLDAMGVPISQCSIIQSECCQLDLNEDDI FT SYLDTSVSSIDAISEEENAALYYISGYVQHKFLPSISVSGTFVTNASEFTE FT LVSRGRLCHPTEDLFQYVRYAYSLFILLPNAEMRFQCVRYISKLFIYFYTA FT LPFDLQALPINTVVSTILNCFFKGLTKLNESSDKVSLNIEDRKVKKFCK" XX SQ Sequence 4526 BP; 1575 A; 633 C; 699 G; 1619 T; 0 other; cagggactac ttaaagtccg ggacgcacgg cattgaatcg atgtgaaatc gacgagaggg 60 ctgactttag cgccttcaat ataccaacat gtaatgttaa tagcttaaat aaaagagtgg 120 tatttaaaga ttattgataa gctgcaacaa atgttgattc atatagatta tataatgtat 180 taactttaat taatttattt ttttaagcct ttttatagta cttttaaagt agtaaaagta 240 tttttattaa gatgtttata tgatgattgt attgcaataa taaatactac ttaaatattt 300 tatgactact taaatattta tgactattac ttacttatta tattatgaat attacttact 360 taattactta attattatat ttatgagtat tacttgctta attatttatg actacttaaa 420 tattttatga taaaaaatga aaaacagtaa catactatta cagtacgaag attttatttg 480 attttgagta taattacgtt ttgatgcaaa tcaagtcttc aaactgttac taacagtttt 540 aggggtactt gagttacttt acactaaagt aacttgagtt actttagact aaagtaactt 600 ttttagtact aaataaagtt gagtacaaaa aaacataaga atgttaagtt attattgtac 660 taaactaaac ctatttttaa ggttataata tttaaaaaat attaatttta ttctctattt 720 aaaaataaat ttataaaacc attacagaca tgcctagaaa atgctcagta ggtaattgtc 780 gtggaaatta tgacagtgca aatgagcatg ttaaagttta caaatttcca agtgacaaag 840 aattatgtga aagatggtta gcagctttac caaataaaat tcaatcacca acagacaata 900 tgggtgtttg tgaaaagcac tggcctttgg ggtgtagtat gcactatcca aaaagatcta 960 aatatcaggt gagtcgaatt catatatata aacaaaaatt tcaaatcaat gttgtagtta 1020 tacttgtaag ctttatttag ttcaaattaa atagataatc taaaatcaat ggaaactttt 1080 tagattccat cagatcctcc ttcactattt cctgggtgtt ttccttcaat gttacgacaa 1140 acaggcaaaa cattccgtca aacaaatttt gaaaaaattt caattgaatc aagaaactca 1200 ttctgtgatg aactcgattc tttcaacaag attgataaaa taaaaaattt tgaagtcaaa 1260 cagtttctta cagatgtaaa gcagcggtaa gaaaagttct aattaaatat aatttttgta 1320 ggattttttt gtgttagtat tactaataaa taattctttt aaatatttca aggtttcctg 1380 attattttgt tgttacaaat gcattggatc tgactgtttt taatatgaca ttttcgcgtc 1440 caccagaaac taatatttct gttacaatca atactcacac tcaacagtta catgcttata 1500 tcaaacacac acaaattcat tgcaatgata ttttggggtt ttctcacaaa ctcactcggt 1560 ggtcgcagtt agaagcaatt atttcaagaa tgaagtcatc taaacctgaa ttaagcgagg 1620 aactaaaaca tttattgcaa gagattgaat gtacatatgg agaaaaatct gatgaaataa 1680 attttttgat tgaacaacta caactgcaga tgatgcctgt gagaggcaag cgttactcta 1740 cgctagttgt acgacgagct ttagaactat acctttcttc aagatgctgt tatcgaattc 1800 tttgtgatac actttcactc ccacatccaa gaactcttaa actaaaatta ggcagaatcg 1860 gtgaagtagg aacacaaaag gaatgtgaag aatgtattaa aaccgtgttt gaaacattaa 1920 ctatttcgga aaaacgttgc tgcctgctat ttgatgaaat gtatgtgaaa ccaagtatac 1980 gttttcgtgg atgtcattta attggttata gtgaagacaa tcctgctgaa attgccaaga 2040 caattctttg ttttatgatt aagccaatgt ttggaaaacc agcttttgtg tgtcgtatgg 2100 ccccagttta taagctatct ggcatttttg tacaaaattt aattgttgaa atcacacaga 2160 cagttgctaa acttggtgga gaaattttat gccttatttc agacaatctt cccacaaatc 2220 gtagcatata tcaatttttt gctttaagtt taaatgagcc atggcttggt aatatttgca 2280 accaaagtat aataatgctt catgatccag tgcacttgtt taaatcaatt cgaaataact 2340 ggcttacaga aaaaactgga acaattcatc tgacactaaa tagtcaatta tttaaagggt 2400 attggaaaga tttggtaagt ttatatgaaa aagaaaaaaa taatgttatc aagttgacaa 2460 atctttcata caaagctatt catccaggtg taattgatcg acaaaatgtc acccatatgt 2520 gtgctgttgt taatgagaag actgtagcag cccttaaaat ttgtgatttt aaagagacaa 2580 gcgagttcct tgaaagaata ttattgatat ggaattacct taatattaaa caagaaggta 2640 tggacatacg tttaaatgat ccaaatagag cttcatacaa aaatgtaaat gataaacgac 2700 ttcaagagat tttgacattt gctgaagctg ttgcaaaaat gccaggagga aagggctgga 2760 agcggaacaa atcattcact agtgaaacga aaatttcttt gtcgaatatg ctttacggaa 2820 ttgtagattt aatccgaaaa ttgctgaatg aagatcatca atatgttctt cccggaatat 2880 ttcagacaga tcggttagaa ggagaatttg gaatatacag gtagttaaat aattccttac 2940 ctatttatga tcagattgat attgttgtta ttacatttaa gcagggtcag actggcactg 3000 aagggcccgg gggcaaaaat aaattaaatc cttaatcaca tattttttgg ttataagtag 3060 aaataacttt tatttagaaa taactggtat tttgtaacat caaaaatgat atttcataga 3120 aataactatt tttatgtatt ttttatgtat tttatgtatt ttttatgtat tttattaaat 3180 ttttatgtat tttttgtggc aggggctctg gggtaaatcc ctcccaaccc ccctcaccac 3240 tctgacccta catttaagaa cactaatttt cgtatgtatc tttaaataaa attcaaatta 3300 tatatagttt agattttttt atgccaggca actttgtgga ggcagttatc atgtttcggt 3360 tgaagaagtt aaaaatagtc ttcgtttaca acgcctgaag ttgtttagta aattggatgc 3420 tatgggtgta cctatttctc aatgctcaat tattcaatca gagtgttgtc aattagatct 3480 gaatgaggat gacattagtt atcttgatac atctgtctct tcaattgatg caatttccga 3540 agaagaaaat gctgccctat attacatttc gggttatgtc cagcataaat ttttaccaag 3600 tatatccgta tctggtacct ttgttaccaa tgcatctgag tttactgagt tagtatctcg 3660 tggtcgacta tgtcacccta ctgaggatct atttcaatat gtcagatatg cgtattctct 3720 ttttatttta ttacctaatg cagagatgcg gttccagtgc gttagataca tttctaaact 3780 ttttatatat ttctatacag ctttaccttt cgatttacaa gcattaccaa tcaacacagt 3840 tgtttccacc attttgaatt gttttttcaa gggtctaacg aaattgaatg aatcgtcaga 3900 caaagtttct ttaaatattg aagaccgcaa agttaaaaag ttttgcaaat aaatcaaaca 3960 acacgcaata tgtggttaaa atggttattt tttaaaattg ttaacattgt tttaacacag 4020 aagttagatc attttagata atctactttt tttgttgttg aattttgaaa attcttatat 4080 ttataattat atttattatt taaaattctt atatttatta atgtttgaag attaataaat 4140 tatgaattaa ttttaatttt taatgaaatg atttaaatgg tttcaagtaa ttaaatttta 4200 taagatccct ttatttactg cataagatcc tttgattaac tacataagat caatttattt 4260 actaattaaa gattaaagtt actaatccag gccttataaa aataagattc cttttttact 4320 ataaattgtc ataaatttaa tgagttatga gattaagatg ttatttaatg agaaacaaaa 4380 aataactctg aataactgtt aatattagta tcaaaatcat ttaagtgtta aacttcaaaa 4440 atttttcgaa catattcaac tttcagccct ctcgtcaaaa tcgcatgaaa tcgaagttgc 4500 gcgtcccggt attttagtag tccctg 4526 // ID BEL-214_AA-LTR repbase; DNA; INV; 547 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-214_AA_; KW BEL-214_AA-I; BEL-214_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-547 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 886-886 (2011). XX DR [2] (Consensus) XX SQ Sequence 547 BP; 175 A; 111 C; 90 G; 171 T; 0 other; tgtgccgaca acacccctcg gtgcagcgac gccgttacca atgacgatcg ctttccaaca 60 ggtcgttctt gaacgatctg tcagcgatag gggagaccgt agaaacgtca aggatagaga 120 aacacgaaag tcatctatct aaccatattt ccattgtgta ttactcagag caaaactgtt 180 attgaactta cgagaaatct ttacgctgca gtctattaaa tttaacagaa acgtaattgt 240 atttcaaggc aatctttatt ttttccaatt gaatttatta ttgtaagtaa tctatgaact 300 tcttcaaatt aattagatat gttgcttaac gaatgctctc taaattccca gcccttataa 360 cattatagta gtgcttacga tacgattcat tcatactcca tagtttgtac actaaaattg 420 tgagtacttt agtagtaagc attgagaaac cattcaaata aattaattaa ttttctagct 480 tgaagctaac aacgacaaag agtgcgtttg ctgtccagat ttggtgatac ttacccccac 540 cccaaca 547 // ID BEL-95_AA-I repbase; DNA; INV; 6961 BP. XX AC supercont1.291; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-95_AA_; KW BEL-95_AA-LTR; BEL-95_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6961 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.291; Positions 27500 20540. XX CC 'CAGAT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 1549..4401 FT /product="BEL-95_AA-I_1p" FT /translation="MAPKKSPLKILTVKLQEIQIELDDIWRFAQNYPGDVT FT ATDVYMRLSRVDELWTKYSETLIELKCHEDYDSDDETFSSQRVVYSDRYYR FT SKSSLTDKAKALQLTPDLDRSTNRANESVLNGVLEHVRLPQINLQKFDGNI FT DEWISFRDLFTSLIHWRADLPEVEKLHYLKGCLQGEPKALIDPLKITQANY FT QIAWDTLTKRYNNSKQLKKRQVQSLFSLPSLSKESIVDLHLLVDGFERIVQ FT TLDQIVQPGDYKDLLLVNILTNRLDPVTRRGWEEFSSTKEQDTLKDLTEFL FT QRRIRILESLPPKVAESKGAQQSQPHFQKQKQSSVKVSYNTTQSSSTKRCV FT ACKEDHFLYQCATFQRMSIADRDAVLKTHALCRNCFKTGHISRECQSKFNC FT RNCKGRHHTLVCFKSEKDSSARVAAAAGNNHPSHQRKSSEIGETSGTSSSQ FT VVNMVATDISVSSAAQQYSSQVLLATAVVVVEDDEGIRFPARALLDSGSES FT NFITEWLSQRLKTQKQKVDIAVRGIGQAGTKVRHKIEAVVRSRVSPFSQSM FT SFLVLPKVTVNLPTATINTDGWSIPPGIKLADPAFFESRRVDMVLGIESFF FT DFFETGKRMTLGNQLPTLNESVFGWVVCGGLSNTSQDLRITCNVSATEKLE FT SLVTRFWSSEEVGSLRVLSPEEKRCEDLFQNTVQRNPDGRYTVTLPKEESA FT VTRLGESKEIATRRLQGSERRLARNANLHKSYHDFMKEYESMGHMERIDVT FT DVDAKRRCFLPHHPVFKEDSTTTKVRVVFDASCATSSGVSLNDMLLTGPVI FT QEDLRSIILRSRTKQIMLVSDVEKMFRQIWVHPDDRHLQCILWRSNPQEDV FT SVYELKTVTYGTKPAPYLATRTLKQLAIDEATTFPLGAQAASEDTYMDDVI FT TGANSVDEACELRQQLSEMTNRGGFWLRKWASNCSKVLEGLTEDTGN" FT CDS 5696..6871 FT /product="BEL-95_AA-I_2p" FT /translation="MVLPARHELTKMILRYYHEKLLHAGPQLLLANVRLHF FT WPLGGRSVARNIVHRCMKCFRSKPSSIEQFMGELPAPRVTLSRPISQTGVD FT YCGPFYVRPAPRRPVVKMYVALFICLCTKAVHLEIVSDLTTDRFIQALRRF FT TARRGRPKDMYSDNGTNFVGARNVLRELYNLVDSKNHQESVTNCCLEERIQ FT WHFNPPSAPHFGGLWEAAVRSAKSHLLKAIGETPLSPEDLNTLLVQVEACL FT NSRPLTPMSDDPNDLEPLTPAHFLINSSLQSLPDTELEDVPMNRLDRFQLT FT QKIAQDFWKRWRREYLCQLQGRSKRWKPSVAIEIGKLVIIRDENQPPMRWK FT LGRISEVHPGTDGIVRVVTLKTASGNFKRPVEKLCLLPFPESTSTHQYQ" XX SQ Sequence 6961 BP; 1837 A; 1667 C; 1751 G; 1706 T; 0 other; tttatggtcc ttcgagccgg atgtggtcag gaccctggaa ggacaggaac tagcttctgg 60 attgctagta gccgaaaccc gcgtgacatc gcgccatcat tgctggacgg gctcaaggag 120 acaatcgcca ttccccaggc gaacaattgt caacccgcca tcacactcgc ggacaatcag 180 attggacctt ttagaagcta cacaattgga ttgcgcgctt catcccactg gcttggcttc 240 ggttttggac atccgtcgat tgaactgttt ctgctggtga gtggaaccca tttatatgtc 300 ctgccgaagc ttggacgaaa tattcgttac taacaagtgg tctttcaaat tcctgcatgt 360 gaaactgcct gtgagtgatg gtgatatatt gtcactgaac ctgaggagtg cgttttttgg 420 atctaccaag catccacaac ggtggttgat gctggatgcg aaatattccg acctggaatt 480 cttcgtgaac cctatgtaca tgccttaaac tagaaatcat cccggaattg cctgcatcga 540 tacttcatcg ccctttggat gtgtctgctt cccgaacagc tgaattagtg acgtggtttt 600 tctctcagga gcgacaccta ttaccggcaa ggctacctgg tgattgccac ggatcggatc 660 ggtaattggt actggtcctg cccacctgtt ctaccgttta atggatttcg aggatcctgc 720 ttgcaaaccc cgacgtaacg attgaatgcc cgttattacg atcgatttgg actaccacct 780 gtacgaatca tcgctggcta cgtgtgcata cgtgtgagga cgaatgatag aagaggtcgc 840 gccatcagtt acctgtctag gtgagtcaca aagaacgtat atgtcaccga agcgggacga 900 ctgaaaatac acgtttttta attaatttga aatcctcttc gacaattcat tgcgacgatt 960 ccctgagtgc acattttgtg ccgttttaca attttggaac ccgatctggg gggaactgcg 1020 atagacttgg acatttgcgc tgctgtttgg tttcccaggt aagccattac agtatatgtc 1080 cagccgaagc aaacattttg gatattacac cctcgtttct cctcttcctg ctaccacgca 1140 ctatcattga ttgacgttga gccttgctgc taattacctg caagctggaa ctttggatta 1200 cgattttaca atttcgaatt gggtattgct gatagtttgt gaaggtaaac actatagtat 1260 atggcctgcc gtggctaggt ctgttggata ctacataagt gccctctccc cttcttccat 1320 atacactggt ggatcgtgag ccttgctgta ttacgattct cgtgacgtca tcgatcattc 1380 gttagagact acgtgtttct gtttaacttt tcgttcgatt tggtgctgca tttcactgtg 1440 ccaggtaaat atacacagta tatgccagcc gaagcgagga atcacgtgga tcactacacc 1500 ttctacccca ccttactttc gtttcctgcc tgagtgacgt ctaccaacat ggcaccgaag 1560 aagtcaccgc tgaagatcct caccgtcaag ctacaggaga tccaaatcga gttggacgat 1620 atttggagat tcgcccaaaa ttaccccgga gatgtcacag ctaccgacgt ctacatgcgc 1680 ttgtctaggg tggacgagct ctggacgaag tatagcgaga cgttaatcga acttaaatgc 1740 cacgaagact acgattctga cgacgaaacc ttcagtagcc aacgtgtggt ctacagtgac 1800 cgctactacc gatcgaagtc atccctcacg gacaaggcga aggcattgca gcttacacca 1860 gacctggatc gttcaacaaa ccgtgctaat gagtctgtgc tgaatggtgt cctcgaacat 1920 gtgcggctac cacagatcaa tttgcagaaa tttgacggca acatcgacga gtggattagt 1980 tttcgggatt tgttcacctc tctgatccac tggagagccg atctaccaga ggtggagaag 2040 ttacattatt taaaagggtg tcttcagggg gaaccaaagg ctctaatcga tccgttgaag 2100 attacgcagg ctaattatca aatcgcgtgg gatactctta ctaagcgtta taataacagc 2160 aagcagctta aaaagaggca ggttcagtcg ttgtttagtt taccttcgct ttccaaggag 2220 tcgatcgtcg atttgcacct tctagtcgac gggttcgaga gaattgtaca aacactcgat 2280 caaatagtcc aaccagggga ctacaaggac ctgctactag tgaatattct taccaatcgt 2340 ttggacccgg tcacccgcag aggatgggaa gagttttcgt ccacaaagga acaggatacc 2400 ttgaaggatc taacggaatt cctgcaacga aggattcgta ttctcgagtc tcttccaccg 2460 aaagttgctg agtcaaaggg agctcaacag tcgcagccac acttccagaa gcagaagcaa 2520 tcatcggtga aggtcagcta caacactacg caatcatcat caacgaagcg atgcgtagct 2580 tgtaaggaag atcactttct ctaccaatgt gcaacattcc aacggatgtc tatagctgac 2640 agggatgcag tgctgaaaac ccatgcgcta tgcagaaact gcttcaaaac gggtcacatc 2700 tccagggaat gccagtccaa gttcaactgc cggaactgca agggtcgtca ccacactctg 2760 gtgtgtttca agtcggagaa ggatagttcg gcaagggtcg cagcggctgc tgggaacaac 2820 catccttcac atcaaaggaa gtcctccgaa atcggcgaaa cttcgggaac tagttcatct 2880 caagtggtca acatggtggc cacggacatc tcggtttcca gtgcggctca acagtattcg 2940 tcgcaagtct tattagcaac tgcagtcgtc gtcgtggaag acgatgaagg aatacgattc 3000 cctgcacggg ctctcttaga ctcgggttca gagagcaact tcatcacgga gtggctaagt 3060 caacgattga agactcaaaa acagaaggtg gacatagcgg tgcgtggaat cggtcaagca 3120 gggaccaagg tcaggcacaa aatcgaagcg gtggtgcgct cgcgagtttc tccattctcg 3180 caatcgatga gtttcctcgt tctccccaag gtaaccgtga atcttccaac tgctacaatc 3240 aacacggatg gttggtcgat acctccaggg atcaaactgg ctgatcccgc tttcttcgag 3300 tccagacgag tagacatggt gcttggaata gagtcgttct tcgacttttt cgaaactggg 3360 aaaaggatga cgttaggcaa ccaactgcca acgttgaacg aatcggtttt tggatgggtg 3420 gtgtgtggag gcttatctaa cacaagtcag gatttgcgca tcacttgcaa cgtttcagca 3480 acggagaaac tggagtctct ggttacacgg ttctggtcca gtgaagaggt tggatcgctt 3540 cgagtactct caccggagga aaaacggtgt gaggaccttt tccaaaatac agtccaaagg 3600 aatccggatg gtagatatac tgttacactc cctaaagaag aaagcgctgt cacacggttg 3660 ggcgagtcca aggaaatcgc aacccggcgt cttcaaggaa gcgaacgaag attggctcga 3720 aacgcaaacc tccacaagag ttaccacgat ttcatgaagg aatacgagtc aatgggacat 3780 atggaaagga tcgatgtgac ggatgtagat gcgaaacggc ggtgttttct acctcatcat 3840 cctgttttca aagaggacag cactaccacg aaggtcaggg tggtgtttga cgcttcctgc 3900 gctacgtcat ccggcgtgtc acttaacgat atgttactga ctgggccagt aatacaggag 3960 gatttaaggt cgatcatttt gcggagtcgt accaagcaga tcatgctcgt ctcagacgtc 4020 gaaaagatgt tccggcaaat atgggtgcat ccggacgaca gacacctaca gtgcatcttg 4080 tggcgctcca atccccagga agatgtcagc gtatatgagc tcaagaccgt tacgtacgga 4140 accaagccgg caccatatct agccacaaga accctcaaac agttggcgat agacgaagca 4200 acaacttttc ccctaggagc acaggcggca agtgaagata cctacatgga tgatgtcatc 4260 acgggtgcga attctgtgga cgaagcttgc gaactacggc agcaactaag cgaaatgaca 4320 aacagaggtg gattctggct cagaaagtgg gcctcaaact gctccaaggt tttggaaggg 4380 cttactgagg acactggcaa ttagagcggc agacggaatc aatttggatc ctgacccgtc 4440 tgtgaagacc ctcggactaa catggatgcc gaagagtgat cagttgaagt tcaagttcga 4500 cgttccagcg gtgctaccaa accagcaatt caccaaaagg gaaatgctat ccatcatagc 4560 cactctcttc gatcctttag ggctgctcgg agcagcagta gtcactggga aaatcgtgat 4620 gcaacttcta tggaaatatc gcgatatcga cgatcgtccg ctgggttggg acgacccgat 4680 accttcgacg gtgggtgagg attggagaaa gtaccttcaa caacttcctc tgctaaacga 4740 gatcaggatt gatcgctgcg tcatcatacc caacgcagtt tccatggaaa tccactgttt 4800 ttcggatgcc tcggagaagg cgtatggctg ttgtatttac gtgaagagtg tagatgcagc 4860 aaggggaatc aaggttcggc ttctttcgtc gtaatcgcga gtggcaccct tgaattgcca 4920 aacaattcct cgattggagc tatgcggagc tgttctatcg gcggaactct tcgaaaaggt 4980 tcaagaatcc ctcaaaattc ccttgcagag gtacttctgg acagactcta cttgcgttct 5040 tcgttggatc caagcttcac caacgaactg gaccacttac gtggccaatc gagtctccaa 5100 gatccacaca ctaaccaatc cggaaaggtg gagacatgtt cctggcgtgg agaatccggc 5160 cgatcttatt tcaaggggcg tcttgccgga ggacatcatc cacaatgatt tctggtggaa 5220 tggaccacga tggctcgcgg aaaatccgga agggtggtca aaactaccgg aatcatggaa 5280 tgaagaggac gttgagaagg tcgataaaga aaggcgacgt acagcggtcg cttgcaccag 5340 ttctccagag gcggagttca acagctggta tctaggacga acttcatcgt ataccaaact 5400 cgtacgaatc acggcgtact gcttacggta cctgaaacaa ctacgtactc ctcgagatca 5460 gcgaaatcca tcgaagttcc tgacgtctgt ggagttaaga gaagcggaga tcaccctggt 5520 tcggaatgtt caacaggagt tcttcgttga ggagtggaag gtgctagctg aagggaaacc 5580 tattccaagg aagtcgccat tgcggtggta taatccattt atatccaagg acctagttat 5640 tcgagtagga ggaagactgc gacattcgca agaagctgag gacaccaaac acccgatggt 5700 tttgccagct cgtcatgagc ttaccaagat gatccttcga tattaccacg aaaaactgct 5760 gcacgctggg ccgcagttac tgctagctaa cgttcgctta catttctggc cgttgggggg 5820 aagaagtgta gcgcggaata tagtacaccg gtgcatgaaa tgctttcggt ctaaaccatc 5880 gtccattgag cagtttatgg gagagctgcc agcgccacga gtcaccttgt cgcgacctat 5940 ttcgcaaacc ggagtggatt attgtggacc gttttatgtt cgaccagcac cccggcgacc 6000 agtcgtcaag atgtacgttg cgctattcat ctgcctgtgt acaaaggcgg tacacctcga 6060 gatcgtgtcc gatctgacta ccgatcgatt catccaagcc ctacgtagat tcactgctag 6120 gagaggaaga ccgaaggaca tgtactcgga caatggtacc aacttcgttg gagcgcgaaa 6180 cgtgttacgg gaactctaca atttggtgga ttcaaagaat caccaagaaa gcgttacaaa 6240 ttgctgccta gaagagagga ttcaatggca cttcaatccg cctagcgccc cgcactttgg 6300 aggcctatgg gaggccgcgg ttcggtctgc gaaatcccac cttctgaagg ctataggaga 6360 aacaccactc tcgccggagg atttgaacac gctgctcgta caggtggagg cctgtttaaa 6420 ttcccgaccg ttgacgccca tgtcggacga tccaaatgat ttggaaccgt taacgccggc 6480 gcatttcttg attaactcaa gccttcagtc cctaccggat acagaattgg aagacgtgcc 6540 aatgaatcgc ctcgaccgtt ttcaactgac ccagaagatt gcccaagatt tttggaagag 6600 atggcggcgt gaatacttgt gccaacttca agggagaagc aaacgctgga agccatcggt 6660 cgcaatcgag attggcaagc tggtcatcat tagggacgag aatcaacccc ccatgcggtg 6720 gaaactaggt cgtataagcg aggtgcatcc aggcaccgat ggtatcgtga gggtagttac 6780 gttaaaaaca gcttcaggca acttcaaacg tcctgtggag aaactttgtc ttttgccatt 6840 tccggagtca acgtctactc atcaatatca atgagcgaaa ccctaccctt tccttccctg 6900 tcgaagagga gatttcattt tttcttttca gaaattgaag caattcctgg gtgggtgagg 6960 a 6961 // ID BEL-1_CQ-I repbase; DNA; INV; 6864 BP. XX AC AAWU01033868; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-1_CQ_; KW BEL-1_CQ-LTR; BEL-1_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-6864 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 155-155 (2011). XX DR Genome; AAWU01033868; Positions 30630 37493. XX CC Positions [5812-6390] - Integrase core CC 'AAACT' target site duplication CC LTRs are 94% similar to each other. XX FH Key Location/Qualifiers FT CDS 1403..4741 FT /product="BEL-1_CQ-I_2p" FT /translation="MAPRKSLLGPTLKALTVKYQELQAALNDIFIFADKFK FT AETTASQVSVRLESLNKLWDVLCTTLVEIKSHKDYSPEDDPTYKKDRETLT FT EGYYRVKSLLLDKLKERQEVPCEPTHNDTIVGGTDHARLPQIKLPMFKGNI FT DEWLSFRDLFLSLIHWKSDLPDVEKFHYLKNCLDDKPKRIISGLSTTSVNY FT KLAWDRLQRYYNNHKVLKQRLVQSLFTLPTLAEESATDLHSLVDEFERVLN FT TLDQVVSPVDYKDLLLVNLLVTRLDPVTRRAWEEVSTAKDLVRDSLGDFAD FT DSREESANDTTKDTLKDLCDFLNRRIRVLESLPPKTVNSRAVQHQQQQYVP FT KSKAKKVSYSTVQHAGYSCVACKDNHLLYQCSAFQRLAVSERDALLKTHSL FT CRNCFRTGHQAKNCTSKYSCRNCKGRHHTLVCFQQQPERDNAVTALAVAGS FT NNPSNSKETQGPTTSHVANMAATNPSVSGAATQAASQVLLATAVVIVEDDD FT GNRYPARALLDSGSESNFISERLAQLMKVTREKVDISVLGIGQSGTRVKHK FT IHATVRSRLSNFERDMNFLVLSKVTVNLPTTTIATNGWSIPDGIELADPAF FT FQSKKVDIVLGIECFFEFFETGRRIPLGDNLPALNESVFGWVVSGGLSVPC FT NSTQVKCNVSTSEKMETLMARFWSAEDVGLDDAFSPAERRCEENFQHTVQR FT LSDGRYSVSMPKVEGGISRLGESKEIALRRLHATERRLARDANLRKQYTDF FT MDEYLELGHMSKVEATTPTQVNRCFLPHHPVVKEASITTKCRVVFDASCKT FT SSGVALNDVLLAGPVIQEDLRSIILRSRIKPVMLVSDVEKMFRQIMVHPEE FT RPLQSILWRFSPDEEVGIYELKTVTYGTKPAPFLATRTLKQLATDDGPRFP FT LAARAVNQDTYMDDVITGAEDVQSAIELRKQLDEMMLRGGFKLRKWASNRQ FT EVLAGIAEENLAISMEEINLDTDPGVKTLGLTWMPKTDTLKFQFNIPDLGK FT VPELTKRFILSTIAILFDPLGLLGPVIVTAKIFMQSLWTLLNPNGERLDWD FT EAVPEMVGEVWRRFHSQLALLNELRITRASSALKCSRWNCTSFLTHLRRPL FT EAVRI" FT CDS 4684..6783 FT /product="BEL-1_CQ-I_1p" FT /translation="MELHFFSDASEKAFGSGAYLKSEDPKGRIFVNLLTSK FT TKVAPLKSQAMPRLELRGGVLSAELYVQILQSLKIKIPTFFWVDAMCVLYW FT LQSPPSTWVTFVANRVAKIQALTEGCVWGHVPGVENPADFLSRGVMPADLI FT DLVPFWKPGWVKKTNEQRQPQPKIVTTEEAEEERRNTAASVAAATTAEFNV FT WFISHFSSYPDLVRRTAYWLRLKDLLRMPKEKRFKSDLLTVAELKEAEFAL FT IRLVQKQAFADEWNKLTKGKPVANSSPLRWFNPYISDEQLIRVGGRLRHSL FT ESEDTKHPIVLPARHQLARLLFRHFHEKLLHAGPQLVLGVVRLRFWPLGGR FT SVIREVIHHCKECFRSKPVAVQQFMGELPSARVTTSRPFARTGVDYFGPLY FT VRPAPRRPAVKVYVALFVCLCTKAVHLELVTDLSTERFIQALRRFVARRGI FT CDDIYSDNGTNFVGARNKMAELLLLLKDRKHRDIVSKDCSNQGIKWHFNPP FT SAPHFGGLWEAAVRSTKKLLLKTIGETPVTPEDLNTLLVQVEGCLNSRPLT FT PMSDDPNDLEPLTPAHFLVQSSLQTVPDADLSAIPMNRLDKFQTVQRMLQD FT FWTRWRREYLCQLQGRVKRWKPPIQIEAGKLVVIKDENSPPMRWKMGRIFE FT VHPGEDGVVRVVTLKTADGYLKRPVEKLCILPLPEYSDAMESTDADLQTQS FT " XX SQ Sequence 6864 BP; 1745 A; 1781 C; 1749 G; 1589 T; 0 other; ttctggtcct tcgaaccgga tcggacggat cctggttcgc gggaggaagg acggttggac 60 gctacattgc cgtttgtgga cgaggttcgt cgccattgga ctgcggcgtg gcacggttcg 120 ccatcgctgc tgggaacaat caaagcgcgc caacggtttg gccattcatt cctgggacaa 180 taggacattg taccctcaac ccggacaacg aacttcgcgc ttggacatct acgcgccaac 240 gtgttgcgtt tggtaagtgg aaccactacg tatatgtccg ccgaagcggt cagacattcc 300 cggtactgac acacctccca tccacatcta cgctgtacat tccgctggag cgaaccgctg 360 cgtgcgtccc attggatcga gcgagccatc tgctacgcgc tgcgcggcga gagtgcgttt 420 ccggtcacgg tcagccattt tctggaacat ccggtcagcc atttcctgga ccatccgatt 480 gctgcgttgc tgccggagcc atccgactgc tgcgttgaca cactgttcca gaaaggacac 540 cttttgccca ggtaaaatat tctaagcaca gtgtatgtca gccgaagcga gtcgctacag 600 tgaaactaac accatttttc gaccacacaa ctctttacta cgcccgaacc ggctggacac 660 cggtgattgt acgtgctgtg accacgtgga ggtcgctgac tatctgcacg actccaagcc 720 gacacgcggc gactccgagg tcgctggaga tctacacgac gatctgcacg actccaagag 780 gacatctaac ggtggccaca accgtttgct accgttcaga ccacccgaga aggacgacat 840 caacgtgtct gacacgtgcc aagacggatc gtcacgtgga ctgaagaggc tgcgcaactg 900 ttcgtttcta ccggtgagtg ctaaagaata gtatattgcc gaagctaaga cgacagaaat 960 acacaatttt taaatcgagt ttgtatcctc ttcgcttaca ccacgcatcc ctgaggtcat 1020 cccatccgct gattgattta cgatctgatc tggttcacgt tgttttttgg tcactgctac 1080 tgtttctcca ggtgagatac actagcaaag tatatggctt gccgaagcca aggcaccgtt 1140 gatactacat gtcgttttcc acctcttctt cgcacaccca cgcattcaat ttgagctgcc 1200 cacttgaccg ctaaggacat tttggtttga cgtctcgagt tacctggtgc ttccggttcc 1260 gattgaactc ccaactgctt tgggtacgat cagttactgc taaaagtaag taatttgcta 1320 cagcagtata tgttcttgcc gaagccagga cgaaacggaa tactacagca ccacactttt 1380 tcgctttaca ttgtctgacg tcatggcacc ccggaagtcc ttgttaggac cgacgctgaa 1440 ggcgttgacc gttaaatacc aagagttaca agcagcgctc aatgacatct tcatatttgc 1500 tgataaattc aaggcggaaa caaccgcttc ccaggtgagt gtgcgcttgg aaagtctaaa 1560 caagctgtgg gatgttttgt gcactacctt ggtggaaatc aaatctcaca aagactacag 1620 ccctgaggat gaccctacct acaaaaagga ccgcgaaacg ttgactgaag gctattaccg 1680 agtaaaatcc cttctactgg acaagctgaa ggaacgacag gaagttccat gtgaaccgac 1740 ccacaacgac acgattgttg gggggactga tcacgcccgt ttgcctcaga tcaagcttcc 1800 tatgttcaag gggaacatcg acgagtggct gagttttcgt gacctgttcc tctcactcat 1860 ccactggaag tctgacctac cggatgtgga aaagttccac tacctaaaaa attgtcttga 1920 cgacaaaccc aagaggatca ttagtggact ttcaaccacg tcggtcaact acaagctggc 1980 atgggatcgt ttgcagaggt actacaacaa ccacaaggta cttaagcagc gactggtgca 2040 atcgttgttt acccttccaa ccctcgccga ggaatctgcc accgatctgc attccctcgt 2100 tgacgagttc gagcgtgttc tgaacaccct cgatcaggtg gtctcacccg ttgactacaa 2160 ggatcttctg ctggttaatt tactcgttac gcgtctggat cctgttacgc gtcgcgcatg 2220 ggaggaagta tctacggcga aagatttggt tagagattcg ctaggtgatt ttgcggatga 2280 ttcaagggag gaatctgcta acgacacgac gaaagacacg ttgaaggact tgtgcgattt 2340 cctaaaccga cgcattcgag ttctggagag tttgccgccg aaaactgtga acagtcgggc 2400 tgttcaacac caacaacaac agtacgttcc caaatccaag gcaaagaagg ttagctacag 2460 tacggtgcaa catgctggct actcgtgcgt ggcttgtaag gacaaccact tactctacca 2520 gtgttctgca ttccaacggt tggcggtgtc ggagagggac gcattactca agacgcattc 2580 cctctgtcgc aactgcttca gaactggtca ccaagccaaa aactgcacgt ccaagtacag 2640 ttgcaggaac tgtaagggtc ggcaccatac tttggtatgc tttcaacaac aaccggaaag 2700 ggacaacgcg gtgacggcac tcgcggttgc agggagcaac aacccttcca actccaagga 2760 aacccaaggt ccgacaacat cccacgtggc aaacatggca gccacgaacc catccgtatc 2820 aggcgcagca acccaagctg catcccaagt gctgttggct acagcagtgg tcattgtcga 2880 ggacgacgat ggcaatcggt accctgctcg cgcactactc gactcggggt cggagagtaa 2940 cttcatatca gagcgactgg cgcagctgat gaaggtcacg cgagaaaagg tggacatttc 3000 cgtgctagga atcggtcaat ctggcacaag ggttaagcac aagattcacg caacggtgcg 3060 gtctcggtta tccaacttcg agcgagacat gaacttcttg gtgctctcca aggttacggt 3120 caatctgcca acaacaacaa tcgcaacaaa tggatggtcg attccggacg gaattgaatt 3180 ggcagatcca gcattcttcc agtccaagaa ggtggacatc gtccttggca ttgaatgctt 3240 tttcgagttc tttgaaaccg gtcgaagaat tcccctggga gacaatctac cagcgctgaa 3300 cgagtcggtg ttcggctggg tggtatccgg tggtctctcg gttccatgca actccacgca 3360 ggtgaagtgc aacgtgtcaa cctcagagaa gatggaaact ctgatggcac ggttctggtc 3420 ggctgaagac gtcggtttgg acgacgcatt ctctcccgct gaaagacgct gcgaagagaa 3480 cttccaacac acggttcaac ggttatccga cggtcggtat tccgtctcca tgcccaaggt 3540 cgaaggtgga atctccaggt tgggcgaatc caaggaaatt gcgttgcgac ggttgcacgc 3600 aaccgagcgc agactggcaa gggacgccaa tttgcggaaa cagtatacag atttcatgga 3660 cgaatatctg gaactgggtc acatgtccaa ggtcgaagca acaactccaa cccaggtcaa 3720 ccggtgtttt ctaccgcacc atccggtggt caaggaggcc agtatcacca ccaaatgcag 3780 agttgttttc gacgcttctt gcaaaacatc ttctggagtc gcgctgaacg atgtgttgct 3840 ggcagggccg gtaatccaag aagatttgcg ctctatcatt ctgcggagtc ggatcaagcc 3900 ggttatgctt gtgtctgacg tagagaagat gttccgtcag atcatggtcc acccagagga 3960 acggccattg caatccattc tgtggcgttt ttctccggac gaggaagttg gcatctacga 4020 gctgaaaacc gtcacttacg gcacaaagcc tgctccattc ctcgcaacca gaacgctaaa 4080 acaactagcg acggatgatg gacctcgatt tccgcttgct gcacgggcag tcaaccaaga 4140 cacctacatg gatgacgtca tcaccggagc tgaggacgtc cagtcagcaa tcgaacttcg 4200 gaagcagcta gacgagatga tgctacgagg cggtttcaag cttaggaaat gggcatcaaa 4260 tcgacaggaa gtgctagctg ggattgctga ggaaaaccta gcgattagca tggaggaaat 4320 caaccttgac acggaccctg gagtgaaaac tcttggtctc acatggatgc cgaaaacgga 4380 cacgctaaag ttccagttca acattcccga tttgggcaag gttcctgagc tcaccaaacg 4440 cttcattttg tcaacaattg ccattctttt tgaccctctt ggactgctag gaccggtcat 4500 cgtaacggcc aagattttca tgcaatctct gtggacgttg ctgaacccga acggagaacg 4560 tttggattgg gatgaggcgg tacctgaaat ggtgggtgag gtctggagga gatttcattc 4620 acagctcgct ttgctcaatg aacttcgcat cactcgtgcg tcatctgccc tgaagtgttc 4680 aagatggaat tgcacttctt ttctgacgca tctgagaagg cctttggaag cggtgcgtat 4740 ttgaaaagcg aagatccaaa gggcaggata ttcgtgaact tactcacatc caaaaccaag 4800 gtggcaccgt tgaagagcca agcgatgcca agattggaac ttcgcggtgg cgttttgtct 4860 gctgaactct acgtgcaaat actgcaatcg ctgaagatca aaattccaac ctttttctgg 4920 gtcgacgcca tgtgcgtgtt gtactggcta caatcaccac catcaacatg ggtcacattc 4980 gtcgccaaca gggtggccaa gattcaagct ctcacggaag gctgtgtctg gggacacgtg 5040 cctggagttg aaaaccccgc agatttcctg tcacgtggag tcatgccagc agatctcatc 5100 gacttggtac cattctggaa gccaggctgg gtgaaaaaga cgaatgagca acgccaacca 5160 caaccgaaga tcgtgacgac ggaagaagca gaagaagagc gaagaaacac tgcggcatcg 5220 gtagctgcag caactacggc tgagttcaac gtttggttca tctctcactt ctcttcgtat 5280 cccgatctgg tcaggcgtac ggcgtattgg ctacgcttga aggatctact tcgcatgcca 5340 aaggaaaagc ggttcaaatc ggacttgttg acggttgcgg agctgaaaga agcagagttc 5400 gcattaataa ggctcgttca gaagcaagct ttcgctgatg aatggaacaa actaaccaag 5460 gggaaaccag ttgcgaatag ttcaccactg cggtggttca acccgtacat ttccgatgaa 5520 cagctcattc gagtgggagg acgcttacgt cattcattgg aatccgagga cacgaaacat 5580 cccatcgtac ttccggctag gcatcaactc gcacgacttc ttttccgaca tttccacgag 5640 aaattgcttc acgctggtcc acagctggtg ttgggagtcg ttcgattacg cttttggcca 5700 cttgggggca gaagtgtgat cagagaggtg atccatcact gcaaggagtg tttccggtcg 5760 aaacccgtgg ctgttcagca gttcatggga gaattgcctt ctgctcgggt taccacctcg 5820 cgaccgttcg cgcgtacagg tgtggactac tttggaccac tttatgtgcg cccagcccca 5880 agacgacctg cagttaaggt ttacgtcgca ctttttgtat gcctctgcac caaggctgtc 5940 catttagagt tggtaacaga cttatccaca gagcgtttca tccaggcact tcgacgattc 6000 gtagccaggc ggggaatctg tgacgacatc tattccgata acggaacgaa cttcgttggt 6060 gctcgtaata aaatggctga actcctgttg ctgctcaaag accgtaaaca tcgcgacatc 6120 gtctccaagg actgttcaaa ccaaggaatc aaatggcact ttaatccacc cagcgcacca 6180 cacttcggag gcctctggga ggccgcggtt cgatctacaa agaagctatt attaaaaaca 6240 atcggcgaaa cacctgttac cccagaggat ctcaacacgt tgctggtcca ggtggaaggc 6300 tgcctcaact ctaggccgct aactccgatg tccgatgacc ctaatgacct ggaaccatta 6360 acaccggcac attttctcgt gcagtcatcg ctgcaaacag ttcccgatgc tgatttatca 6420 gcaatcccga tgaacaggct cgacaaattc cagacggtgc agcgcatgct gcaagatttc 6480 tggactcgat ggcggaggga gtatttgtgc cagctacagg gacgggtcaa gcgctggaag 6540 ccaccaatac agatcgaggc cggtaagctt gtggtcatca aagacgagaa ctcaccacca 6600 atgcgatgga agatgggaag aattttcgag gtacacccag gcgaagacgg agtggttagg 6660 gtagtaacgc tgaaaaccgc cgacggctat ctcaaacgac cggtggagaa actgtgcata 6720 ctgccacttc cggagtacag cgacgccatg gaatcaacgg atgcagatct tcaaacccaa 6780 tcctaacttc ctatccctat tcccacatcc ccgaagaggt tttttgtatc ttttcagaaa 6840 ctggtcattt ctgggtgggt gaga 6864 // ID Copia-18_AA-LTR repbase; DNA; INV; 148 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-18_AA_; KW Copia-18_AA-I; Copia-18_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-148 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 946-946 (2011). XX DR [2] (Consensus) XX SQ Sequence 148 BP; 42 A; 37 C; 19 G; 50 T; 0 other; tgacatcgaa gcaacctgct agtttggacc aatcgtttat ctgaacacag taaccttggc 60 ttctttgaca tcttttactt tttatcattc tcaaaccaac ctctgaataa accacacgtt 120 ttaattaaac gagctctgcg ttcattca 148 // ID Gypsy-22_SI-LTR repbase; DNA; INV; 133 BP. XX AC AEAQ01030800; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: long terminal DE repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-22_SI_; KW Gypsy-22_SI-I; Gypsy-22_SI-LTR. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-133 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01030800; Positions 664 532. XX SQ Sequence 133 BP; 40 A; 20 C; 45 G; 28 T; 0 other; tgtaagaaat ggagatgtaa taagatgtca tgcgtggcca gatggcacta gccggcggac 60 gcgctatagg aatttgccta ttttgtacgt taggaaggag aaggagaagg ggaatcgcga 120 ggattccaga gca 133 // ID DNA-8-1_HM repbase; DNA; INV; 2817 BP. XX AC . XX DT 22-DEC-2008 (Rel. 13.12, Created) DT 22-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE non-autonomous DNA transposon from Hydra magnipapillata- DE consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA-8-1_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2817 RA Bao W. and Jurka J.; RT "nonautonomous DNA transposon from Hydra magnipapillata."; RL Repbase Reports 8(12), 2076-2076 (2008). XX DR [1] (Consensus) XX CC TSD is 8-bp long. CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX SQ Sequence 2817 BP; 1123 A; 316 C; 353 G; 1024 T; 1 other; cactgatctc aaaatataac tggatagcgg attctattgt tttttccgaa aaaagttgaa 60 gccaaaattg gcctcccaag atggcggcga tttaagttcc tagcggtgaa accaatatag 120 attttatatg gctttatcaa actacgatag aaaataatta aataaggcta ataaatttgt 180 caaaaataaa agaaaacaaa tgaattttat aaatattaga gaaaaaaaaa agaaattatt 240 ttttgaataa ttcaatttat tattaaatgg catttacaat tttagttaaa ttttgtgtat 300 aatttatgta ccttttagtt cttagttacc caatgctgta tgctagaagg ttaataacat 360 aacagcatgg taagcagcaa tcatctctac ctacaatttt atattctcta ttttttgtag 420 caagatacca ttttgtacaa atgctgcatg aactattaac tgagtgctgt cagtatacag 480 gggtatacaa actcaggggt tccttctaga attatattat aattctatta ttgcataaat 540 tatttgaata aaacattcta aagtgtcact tatgctggta aaaaaaaaac aggtctgaat 600 tatagttatt tatatcaact tttaaaaaaa ataaaaagta aataatcatt tcttataatt 660 taaaggataa tcactcaata gaattgagga tgtatgtaac aaattttaca tacatcctca 720 aacaaacatt tgtttgaaaa aaaaacaaat ttattaaatt tttaatatag aaatattcaa 780 aacacctctt tgaaatatct cattaactaa agttatatat actgtttacc tgctgttaag 840 tattatggtt atatgttata gtttatataa aggctgactt ttttatatta ttaaactgta 900 tgttgcagtt aaataattat ataaatatgt tgttgagttt atttgcataa aagattgatt 960 acatcacata ttaaattcac aaaacctatt tgccaaagga tacaagtcag cattgtgtgc 1020 atgcagcaat gctctttgtt ttccatttct tgacatggta tataattata tcattagtct 1080 taaagagatt aactgttacg caaatgaatt tgtctagtct aaaaactggt ttcattggat 1140 atttcgtagc aattaaaata tatatctcga ctgagtagaa aaagaaggtt cgccataaaa 1200 ttatttaaca tataagctca gtcaaaacca tctaacaagt tctttattgt gctgtgagat 1260 cttatttagc tgtgagatct gcaagaatgt ttaacaacaa tatcaattgc caacagttta 1320 cagccacata taaacaactc ttgatgtgaa ttagggttca attttcaaat ggaaactgca 1380 gcatcagaga taaaactcat atttcaaaaa ttttagacaa ttcttttgct cactccagta 1440 ctaaaatatc tgactaaaat tgaattaaca aaaaaataat gatttgttaa agagaaaacc 1500 aactaatagt gaccacgaat atgcagatat tccaaattgc agcattctat ctaaatacaa 1560 aaaagctttg atctcatata ttgccagata tgtagctaaa atggcatcaa aaaaaaaatc 1620 tgcattatat atatatatat atatatatat atatatatat atatatatat atatatatat 1680 atatatatat atatatatat atatatatat atatatatat attttagtaa agtacaacca 1740 aataaaaaat aaagtacata taattatata taaaaaaaat tctagaagga acccctgagt 1800 tgtgtgtact cacagagctc agctaatagt tcatgcagca tttgtaaatt tgtgatatct 1860 atttgaaaaa ctaatttata atagtttatt ttaaaaagat attcagctgc ctaccatgca 1920 gttgtgttat taaccttcta gcacacagca ttgggtagtt aagaactaaa aggtacataa 1980 atttgtacaa aaaacaataa caaacaatta atttattgta aaactaaaat taaaagaaag 2040 taaataaatg aaatataaat aaaaaatgat taaaattatt tcagtttttc acatttaaaa 2100 taattattag aaagttgttt ggaaattgtt aaattggata actactaggt tcatatttgt 2160 tttttatttc tactgatagg ttataactag ttatgaaata ttagtggaac ctagtagtta 2220 actactaggt tataggtata attctttact aatgttaaat aaactaattt ttaattaatt 2280 taactaaaaa attttatctt atttaattaa attaaaaaaa aaaaaaaaaa gaaagatatt 2340 acacaaactt attgtttgag taaattatta agagtataat atttattgtt tgagtataat 2400 atttattagt aatatatatt gtgatatata tattagtaat atatattgta atattatawa 2460 agtataatat ttattagtaa tatatattgt gatatatgta ttagtaatat atattgtgat 2520 attataagag tataatattt attagtaata ttgtttgagt agattattaa gagtattgtt 2580 aaacaaaata ttatatatat tttagaaatt tacgtaaata tgtattattg tgtttctttc 2640 taaaacaatt gctatgtaaa accgaaaatt gcatttttgt ttgaggcttg tacatttatt 2700 tttatatata taaaaaaaaa aattgtaatc aaccaaatac gccgccatct tgggaggcca 2760 aaactggctt caaaaaattt gccgtatacg ctatccagtt atattttgag atcagtg 2817 // ID BEL-93_CQ-LTR repbase; DNA; INV; 806 BP. XX AC AAWU01007335; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-93_CQ_; KW BEL-93_CQ-I; BEL-93_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-806 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 314-314 (2011). XX DR GenBank; AAWU01007335; Positions 19344 20149. XX SQ Sequence 806 BP; 287 A; 157 C; 203 G; 159 T; 0 other; tgttccgaca acactgccac caggtcgaca taagtcacgg cggcacagac acggtcgcag 60 caacgccgca aatcgtgtca acgcaacgcg ttgacaactg tcacgagaga gagataggga 120 gaaagagtga gagagagcga ggagaaagag cgcaaaaagc gatgtaaaca aacaagcaca 180 agtagtagga aaaaagtgaa aaacgtttac gagcagctag acgcgcgtga ataggggtta 240 aaaggtgcgc ttttgtgcag tttcgcgagt ttggtgcatt ttgtgcgtgc aacgagaagg 300 ttgacgatca cgtacaggta gaagctacga tcacgaacag gtaggcgagt aaaactgcat 360 gcagacgaaa accagagact aaaatatgtc ctaaccctaa tagtacggtc gcaagcggtt 420 gattagacgt tccgacggga cgtagaaggc cagttgacct cgaacagcac aagaacgaca 480 cagtacagga cagaattgga ttccggcgga gtcagcaggg aatttcaatt gaacttgtaa 540 gttacactga acaaaaagtg cacgatagat ttaaaacatt aaaatgatga atttcgatag 600 gattgtggat aggacggtta caaggagcag gaaaggattt ctggaaagga tcaaactaaa 660 aaatgtaagg aaacacacac acacacacac acatgtacat ttacaaacat actaaaaact 720 gttcttaaat acattcaaca gtttgatctg ctgcgaagcg gctgattcac aaaatttcgt 780 tggattattt ctacaccgtc cgaaca 806 // ID BEL-26_CQ-I repbase; DNA; INV; 4188 BP. XX AC AAWU01010404; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-26_CQ_; KW BEL-26_CQ-LTR; BEL-26_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4188 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 205-205 (2011). XX DR GenBank; AAWU01010404; Positions 27568 23381. XX CC 'CTCTG' target site duplication CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 809..3880 FT /product="BEL-26_CQ-I_1p" FT /translation="MDHKFIVNPNGRCRSCTNLDDQHMLMCSHCDRFFHLQ FT CVGLGRMPNKNENWHCIKCDAVAKTLQSFDQKVKQLEKSIEDVSTKPSTEN FT KDILTCQFLQGIVNKLEGLSVQPRTADSFLFRQTLIDLPKFNGSFKDWPKF FT KQTFFETTKLGNFSDIENLNRLEKCLKGEALLRVNSLLSDSSNVSKIMNVL FT EERFGSIEQIYNGLMNDIIKVRNPSYDNPKSMVDFISAIGDLVINMESLHH FT EEYLNDPRLVRDLMKKLPSGLQNQWLDNINEEKTLSSPTNPYKAPTIKDFY FT TFLKPKEKVAIAKLVDSDSSNSRTKSERILYAQDTCKTSCIKCNLNSHKLI FT QCNEFKNLTPEEKQTFVMEERLCFSCLGANHLAKQCRFAKDCNIEGCKSKH FT NRLLHVKRTENTDSEIEDSGNEHSVNCHVKKYNNVFYQIVPVTLINGSNKK FT ETFAFFDSGSSVSLINAKTAVELQAKGAKRSMTLAWTNGETQEDLNSMSVQ FT INIQALNGRKFNLKDLRTVNDLKLPTQSLDTNRLKKQFPHLQGSKLASYNN FT AMPTLLLGLPHAYLFKSIKELSGKFNEPIGRMTKLGWVLFGRSNFGKIKKN FT HLFTIQEVKKNLVSKEDAHENYSQRAKEQFGQISVREFYQDKKDEESEALQ FT NQAIIVGAISPINCVQKLTNGENIKKVHILNDQHKENLDDLKTETKCVLYT FT CDIPIKLNFKMTCLKFDIKLIKQYLPIVMNFPIKEINIYGQKHDDNNRRCT FT MWNENDSIFSINKETKLQPKVNSDRCELKNVIFPANDTDINFNEKSRFKNI FT DIVIPENNVLVQNSLNINSEIIGTNFFKKKEFNKASNTTTQTTMINSASSS FT VLNSTKYTKRNYLAYVGSLYDPHGLILHIKKHANIVLKDIWYSSAHWDDCV FT PAEIMYKWQVWIAQLSSFKNLSNLEPNENNIIKRQINVYFGNNHKAFIRVQ FT SFCDKNIIKTTTRLVKSNIETEVNISTPRSEIHDYMFKTKLSNTILKSHTF FT GTNQVTFRSDSKRITLEK" XX SQ Sequence 4188 BP; 1663 A; 670 C; 759 G; 1096 T; 0 other; tggtggctct gcagagagga ttaggagaag tgcttgtcgg aatcaagcac tgaggactcc 60 gagcttccgg tcagtcgaaa tctgatcgtg atcgtaagta aaagcgtgtc cgacgctata 120 gagcggacaa gtgatcaagt gtgagcagtg tcaggctcag ttagtgcaga cgaattatcg 180 tcaagagtgt aaaagcgtct ccgacgctct agagcggtca agtgaccaag tgtgagcagt 240 gtcaggctca gatagtgtag acgagttatc gtcaatagtg aataatcctg gacaatagtc 300 cagaagaaca ccgtgagcga tcctgggcaa gaacccagaa gaagaagaaa acaagaagaa 360 ccttttaata ggtgattcaa cattaaacgt cgtcgattaa actctacttc aagtaaaggg 420 aagactggat gaaaatccag agatccgctt ggttgcagaa aatcaccaaa aaatagtaaa 480 aaaaaaaaaa aaaacccttt taatagatag gtgatttcaa gaagtgaagt gagtgattaa 540 aggaagacta gaaacacata tatataaaaa atatatatat ataataaagg aataaaaact 600 gtaagattgc aaagcagatt tagtgcaaag tgagtgaaaa aaatatataa aaaaaaaaaa 660 tataagaagt aacacagtga cgtcgaacat cgaagctaca tcaacgaagc ctaaggaggc 720 tcgagaagta ccacaagcaa cgtcgaacca gggaacatcg agcatcaagg atcgacgaag 780 catcatcgaa cataaggtat ttagaaatat ggatcacaaa ttcatagtga atccaaatgg 840 taggtgcagg agttgcacca atcttgatga tcagcacatg ctaatgtgca gtcattgtga 900 tcggttcttc cacctgcaat gtgtaggact tggccgtatg ccaaataaga atgaaaactg 960 gcactgcatc aagtgtgatg cagttgccaa gactttacaa agttttgacc aaaaggtcaa 1020 acaattagaa aaaagtattg aggatgtttc aactaaaccc tcaacagaaa acaaagatat 1080 attgacttgc cagtttttac aaggcatagt caataaatta gaaggactgt cagttcaacc 1140 aagaactgca gattcatttt tatttaggca aactttaata gacttgccaa aattcaacgg 1200 ctcttttaaa gattggccaa aatttaaaca aactttcttt gaaaccacca agcttggtaa 1260 tttctctgac atagagaact taaaccggct tgaaaaatgt ttaaaaggag aagctctact 1320 tagagtcaac tcccttttat cggattctag taacgtcagc aaaattatga acgttctaga 1380 agagcgattt gggagtattg aacaaattta caacggatta atgaacgata ttataaaagt 1440 tcgtaatccg agttatgata acccaaaatc catggttgat tttatatcag ccatcggaga 1500 tttggttatt aatatggaga gtcttcatca tgaagaatat ttaaacgatc ctcgactagt 1560 tcgggatcta atgaaaaaac ttccatctgg tcttcagaac caatggttag ataatataaa 1620 cgaagaaaaa acgctatcat cacctaccaa tccttataaa gctccaacga taaaagattt 1680 ttacactttt ctaaaaccaa aagaaaaggt agccatagca aaattagtcg actctgatag 1740 ttcaaattca agaactaagt cagagcgaat tctctatgca caggatacat gcaaaactag 1800 ctgtatcaag tgcaacttaa atagtcataa attgattcaa tgtaatgagt tcaaaaattt 1860 gactccagaa gaaaagcaaa catttgtaat ggaagaaagg ttatgcttct cttgtttagg 1920 tgcaaatcat ttggcaaaac aatgtcgatt tgcaaaagat tgtaacattg aaggttgcaa 1980 aagcaaacac aatcgtctac ttcatgtgaa gcggacggaa aatactgatt cagaaatcga 2040 agactcggga aatgagcatt cagtgaattg tcacgtcaag aaatataaca atgtattcta 2100 tcaaatcgtt cctgtgacgt taattaatgg aagcaataaa aaagaaacat ttgcattctt 2160 tgattctggc tcatccgtca gtcttataaa cgcaaaaaca gcggttgagc tccaagccaa 2220 aggggctaag agatctatga cgttagcatg gacaaatggt gaaactcaag aagatttaaa 2280 tagtatgtct gttcagataa atattcaagc attaaacgga agaaagttta atttgaaaga 2340 tttacgtaca gttaatgatc tgaaattgcc aacgcaatct ctggacacaa atcgccttaa 2400 aaaacaattt ccccatctac agggtagtaa gctggcatcg tataataacg ccatgcctac 2460 attactgcta ggtttgcctc atgcatacct atttaaaagc attaaggaat tatcaggaaa 2520 attcaatgaa ccaattggca gaatgaccaa attagggtgg gtcctgtttg gaagaagtaa 2580 ttttggaaaa atcaaaaaga atcatttgtt tacgattcaa gaagtaaaaa agaatttggt 2640 atctaaagaa gatgcgcatg aaaattattc tcaacgtgca aaagaacaat ttggtcaaat 2700 ctctgtaaga gaattttatc aagataaaaa ggatgaagaa tcagaagctt tgcaaaatca 2760 agcaataatt gtaggagcaa tcagcccaat taattgcgta caaaaactga caaacggaga 2820 aaatataaaa aaggttcata tcttgaatga tcagcataag gaaaatttag atgatttgaa 2880 aacagaaacg aaatgtgtct tatatacatg tgatatacct attaaattaa actttaaaat 2940 gacatgtttg aaattcgaca taaaattgat taaacaatat ttgcccattg taatgaattt 3000 tccaataaag gaaataaata tttatggtca aaaacatgac gataacaaca gacgttgtac 3060 gatgtggaac gaaaatgatt ccatattttc aattaataaa gaaacaaaat tacaaccgaa 3120 agtaaattcg gatcggtgtg aattgaaaaa tgtaattttt cctgcaaatg atacagatat 3180 aaatttcaat gaaaaatctc ggtttaaaaa tattgacata gtaattccag aaaataatgt 3240 attagttcag aatagcttga atataaattc tgaaattata ggtacaaact tttttaagaa 3300 aaaagaattt aataaagcat cgaacacaac cacacaaaca acaatgataa actcagcgag 3360 ttcaagtgtt ttaaattcaa cgaaatacac taaaagaaat tatttggcct atgtgggtag 3420 tttgtatgat ccacacgggc ttatcttaca cattaaaaaa catgcaaata ttgttttaaa 3480 agacatttgg tacagcagtg ctcattggga cgactgtgtg cctgcagaaa ttatgtacaa 3540 atggcaagta tggatagccc aactatcatc attcaagaat ttaagcaatc ttgaacctaa 3600 tgaaaataat ataattaaaa gacaaattaa tgtgtatttt ggaaataacc acaaagcttt 3660 cataagagtt cagtcctttt gtgataaaaa cattatcaag accacaacca gattggtaaa 3720 atcaaatata gaaacagaag taaacatatc aacaccgcga tcagagatac acgattatat 3780 gttcaaaaca aaactttcaa atactatttt aaaatcacat acattcggaa ctaatcaagt 3840 cacgtttcgg tccgattcaa aacgaattac tctggaaaaa taaacaacaa tgagatattt 3900 aaaactaaaa ctaaaaagca gatagattta aatcattttc aaacagcaaa taatcaagaa 3960 aaaatcattc aaagaacgca atggcgttat gtgaacaaca acagagaaga acgtaacaac 4020 acgaactggt ttaaaggtcc aggttttcct atatagaatt ttataaaaac ccgataatac 4080 tcagagttat atttttataa aagtgcaata aacaaaacaa aaagaaaatc agataaagtt 4140 cgcggcttcg aaaatttgta aaccaaggat gctttacggg ccctggaa 4188 // ID hAT-1_TCa repbase; DNA; INV; 2996 BP. XX AC . XX DT 21-MAR-2009 (Rel. 14.03, Created) DT 21-MAR-2009 (Rel. 14.03, Last updated, Version 1) XX DE DNA transposon: consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-1_TCa. XX OS Tribolium castaneum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Coleoptera; Polyphaga; Cucujiformia; OC Tenebrionidae; Tribolium. XX RN [1] RP 1-2996 RA Jurka J.; RT "Mariner/Tc elements from insects."; RL Repbase Reports 9(3), 678-678 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Tribolium CC castaneum Genome Project. XX FH Key Location/Qualifiers FT CDS join(582..1367,1270..1734,1685..2728) FT /product="hAT-1_TCa_1p" FT /translation="MNPQYDSVVLLLNIPFSRRKEIEKRQILELGRPTPRL FT CMVKLAGSHSRSFQNSWYASHXWLCGSIFKESLFCWPCLLIGNVKNVWSSG FT NGFKDLKNFSRGVKLHEASKEHIKNCIGLKRIEKNTLSVAEALQEHSNFNK FT ILFNENVRKNRLLIAYLIDVTLLLGKQELAFRGHSETESSVNQGNFREIFQ FT VLIKRNTELTEHVSKFSNIFXGQSKTIQNELISCISDHLREHIFNEITSTN FT FFCNCGRYNRCYRKIAVLSYNGNIYLMKLQAQIFFVIADDTTDVTEKSQCS FT LTIRFVNNATIQERFFGFFDVSADRSAESLYLVLMNALNRFDIKNKLVAQS FT YDGAAVMAGELNGLQAKVKSVGPQALFTHCCAHRLNLVLQQGTXCISKCRI FT FFATLVGIPSFFLSIAEKNICFKVFRXFFYQSPKRTFVLNNIIGKSIPRAG FT ETRWSTRSKIISLVSAEWNNFICVFEEIMNNSTSGAESIRMAKGFLKNFSD FT FEFAFLTFVFNDIFXITDVLFQILQKKVLNISYCQDRIKDTENRINTLRTE FT VFKKIFNIAKNFTGLPSNKRNSSSNEIELFNEYKVLFYEILDYIXMQLNTR FT FQDFEKLSFFSLVDSSQFSEFDKKFPSSFLTKLNSIYPNIFDLSRLENELA FT VIYRDSQFSEAEPIKILELLDSDYKDVFSEVRKLLMIILTIPATSVSAERS FT FSCLKRIKTYLRNSIGQDRLNDLAFXSIEKELLNDMSKNPNFYEEIIDKFA FT DLKEXRIDLIYKK*" XX SQ Sequence 2996 BP; 951 A; 474 C; 541 G; 1011 T; 19 other; cagtggcggc tcgtcagatg aggcaactga ggccgggcct cagttaattg aaacgatttt 60 tttacataat tttgtttttt ataattatta tcaatcaatc attatttaaa tccaaattgc 120 ttcctagtgg agagaatttt cgttaaaaaa tgttccagtt ccgaacatgc ttccacttta 180 acgtttcaca cccagcattc ttgtccaacg ccagcgctaa ggtatcatcc tgtttcatct 240 aaagagatgt gaggcagaat ttttcgagcc tcagtttaag agttttgcgc atgcgcagta 300 aggtctgtta aagcgaaaga gaagaaacat gcatcaatgt ttcttctttt atttgaggct 360 gagatttcgg agcctctcgg cactctcggc gtctcggcga acatcgtcgt cgggaatcag 420 acttaaggaa tgcttaagga taaaatttcc tatttgctta tctctgtctg gatgttaaat 480 cggcaccaga tcacgtccac tgcagcagtc agtcttgtat tgtcagatag cattggtgta 540 gatgagtagg tgtttaggaa tcaggaaaag gtaagatcaa gatgaatcct caatatgatt 600 ctgtagtact gttgctaaat ataccatttt ctcgacgtaa agagattgaa aaacgtcaaa 660 ttttggagtt aggaagacca actccacggt tgtgtatggt caaattagct ggttcacatt 720 cgcgatcttt tcaaaattcg tggtatgcta gtcatanttg gttatgcggt agcattttta 780 aagaaagttt attttgttgg ccgtgcctgt tgattggcaa tgtaaaaaat gtgtggagtt 840 cgggcaatgg tttcaaggac ttgaaaaatt tttcacgcgg ggtaaagtta cacgaagcat 900 ccaaggaaca tataaaaaac tgcattggtt taaaaagaat tgaaaaaaac acacttagcg 960 tcgcagaagc attacaagaa cactctaatt tcaacaaaat tctattcaat gaaaatgtta 1020 gaaaaaacag actcttaata gcatatttaa tagatgtaac attgcttctt gggaagcarg 1080 aactggcttt tcgtgggcac agtgaaaccg agtcgtcagt taatcaaggt aattttcggg 1140 aaattttcca agtcttgatt aaacgtaata cggagctaac tgaacatgtt tctaaatttt 1200 ccaatatttt taycggtcag tcgaaaacaa ttcaaaatga attaatttct tgtataagtg 1260 atcatttaag ggaacatata tttaatgaaa ttacaagcac aaattttttt tgtaattgcg 1320 gacgatacaa ccgatgttac agaaaaatcg cagtgctctc ttacaattag atttgttaat 1380 aatgccacaa ttcaagaacg tttttttgga ttttttgatg tgagcgctga tcgatctgcg 1440 gaatctcttt atcttgtttt gatgaatgct ttaaacagat tcgatattaa aaataaactt 1500 gtggctcaaa gttatgacgg tgctgccgtt atggctggag aacttaatgg tctgcaagcc 1560 aaagttaaat cggttggtcc acaagctctg ttcactcatt gttgtgccca tcgattaaat 1620 ttagttttgc aacaaggtac aaawtgtatt tctaaatgta gaattttttt tgctaccctt 1680 gtaggtattc cgtcwttttt tctatcaatc gccgaaaaga acatttgttt taaataacat 1740 aattggtaaa agtataccga gggctggcga aactcgctgg tctacacgtt caaaratcat 1800 ttccttagta tccgcagaat ggaacaattt catttgcgtt tttgaagaaa ttatgaacaa 1860 ttcaacatca ggagcggaat ccatacgaat ggcaaaagga tttttaaaaa atttttctga 1920 ttttgaattt gcatttttaa cattcgtttt taatgatatt tttgmwatca ctgatgtgtt 1980 atttcaaatt ttacaaaaaa aagttttaaa catttcttac tgtcaagatc gcattaaaga 2040 tacggaaaac cgcataaata cactaagaac agaagttttt aaaaaaatat ttaatatagc 2100 aaaaaayttt actggtttac cttctaataa acgtaacagt tcytccaacg aaattgaatt 2160 attcaatgaa tataaagttt tattttatga aattttagat tatattawaa tgcaattgaa 2220 tacgagattt caagaytttg aaaaactttc wtttttttct ttagtagatt catcacaatt 2280 ttcagaattt gacaaaaaat ttccttcttc atttctaaca aarttaaatt ctatttaycc 2340 aaatattttc gatctaagtc gtttagaaaa tgagttagcg gtcatttacc gtgattctca 2400 attttctgaa gcagagccca taaaaatttt ggaactcctt gactctgatt ataaagacgt 2460 cttttccgaa gtgcgaaaac ttttaatgat aattttgaca attcccgcaa ctagcgtttc 2520 agcggaaaga agtttytcct gtytaaaacg tattaaaaca tacctgcgta actccatagg 2580 acaagatcga cttaacgatt tggcattttt ktcaatagaa aaagaattac ttaatgatat 2640 gtcaaaaaat cccaactttt atgaggaaat catcgataaa tttgccgatc taaaagagmg 2700 tagaattgat ttaatttata agaaataaat ctttaattat atagttagga ttttgttttg 2760 ttttttgttt gtttttgttt ttgattttat aagaattttt attttatttt aatagcaaag 2820 aaaaaagtgc tttgagcggt ttacgggtaa agtagggctg tcgtggcgat gactccctta 2880 aaaagtcaac gtgttgggca gattacctca tgggtgacat ggaaatatct gcaattaatt 2940 tgttgcttct ggtttctctt ggcctcacct gactcgagag tcacgagccg ccactg 2996 // ID Gypsy-36_AA-I repbase; DNA; INV; 4203 BP. XX AC AAGE02023632; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-36_AA_; KW Gypsy-36_AA-LTR; Gypsy-36_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4203 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02023632; Positions 26765 22563. XX CC Positions [3181-3639] - Integrase core CC 'GTCTG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 16..4161 FT /product="Gypsy-36_AA-I_1p" FT /translation="MAQNANPGVPVNNAELPANVIPAAGARALPPNFHIDP FT YDRRKIRWCRWVERLETAFAIYDVNDAVLRRNLLLHLMGPETYEIACDKVA FT PQNVREMDYQQIVNTLEAHFNPQPLEISENFRFKCRRQGDKNAASPDETVD FT EYLVALRRIAVTCNFGNYLETALRNQLVFGLKRNDIRGRLLERRQLTLQDA FT IDVAVSMELSLKGGAEIEGAMAKQEVNALQKPQGKAKNKNVGKSLQSSSRN FT ADDLHCFRCGDKSHLAKKCKHQNTICSYCKLKGHLERVCMKKAAGKRSEFG FT PSGLKSAPARAHHVEEQCEEESIDNIAVDEICMVASRPGDAKMWATVIVNG FT VPVRLELDTGCPVSIVNAECYGKHFKNIPLRNCSLKLVSYCNANIDVLGYF FT DAEVDYQGTKKVLPLYVVRSTKHPLLGREWLKALAVDWNKVLQSSCVNSIA FT SADRSAVLQSLFAKYRNVFGDSIGRIASVQANLRLKPNAKPVFLKARKVPF FT NMKKVVEDELDKLVAQGVLSKVDQSERATPIVPVKKSENRVRICGDYKQTV FT NPLLMVDRHPLPTVDELFSSLYGGDKFSKIDLVQAYLQLEVAPEDREILTL FT STHRGLYRPNRLMYGIASAPAIWQRQIEIILQDIPGVSVFLDDIKITGPND FT EIHLRRLDEVLRRLSYYGIRVNQGKCVFFADKIEYCGYEIDREGIHKMKKK FT VEAIQEMRRPKTKDEVRSFVGMINYYGRFFENLSTLLYPLNNLLKNNVVFK FT WTSDCEQSFQDVKKRMQSENCLVHYSPELPLLLATDASPYGVGAVLSHLYP FT DGSERPIQFASQTLNSIQQRYMQVDKEAYAIVFGVKKFFQYLYGRRFVLLT FT DNQAITKIFGEHRGLPVMSALRMQHYATFLQSFDYEIRFRRSADHANADVM FT SRIPLKQTVSGNVIEESDIVEMNQIETLPLTADELSQATADDKSVNSLIQG FT LKHGQMVDARDRFGIDQSEFSLQRGCLLRGIRVYVPPMLRRRVLEELHTTH FT FGITRTKSLARGYCWWVGMDRQIEEMIANCTDCQSVRPEPSKATPHCWEPA FT TMPFQRVHADFAGPFMDTYFFILVDAYTKWTEIRICKAITAEATEAMCREI FT FSTFGIPSVFVSDHGTQFTADSFQRFLKRNGVVHKMGAPYHPATNGQAERY FT VQTFKQKLKALKCTKSKLQTEIANILMVYRKTIHPSTGQSPSMLMFGRQIR FT SRLDLLLPKNYPKESPNHVVRTFNDGDRVRVRDFLTPNKWQFGRIVFKAGK FT LRYQVRLDDGRLWERHVDHIVGVGECLMPRVASGAIPEPSILLPTVPVFTR FT TTSKPLLIATDTASAAVEDTSITNRPNITAETSSTSITEQPGRLDVARKSE FT GQPLRRSTRIIRAPQKLNL" XX SQ Sequence 4203 BP; 1185 A; 925 C; 1075 G; 1018 T; 0 other; gaattggcga cgaggatggc gcagaatgcg aatccgggtg ttccggtaaa caacgctgag 60 ctgccagcaa atgtgatccc tgcggctggg gctcgagctt tgccacccaa cttccacatt 120 gacccctacg acagacggaa aattcgatgg tgtaggtggg tcgaaaggtt ggaaactgcg 180 tttgcgattt atgacgtcaa cgatgccgtg ttgcgtcgta atcttctact acaccttatg 240 ggaccggaga cctacgagat tgcgtgtgac aaggtagctc cgcaaaacgt acgtgagatg 300 gactatcaac aaatagtgaa tacattggaa gctcacttta acccgcagcc cttagagata 360 agtgagaact tccgcttcaa atgtcgacgc caaggggata aaaacgccgc ttctccagac 420 gagaccgtgg atgagtatct agttgctctc aggcgaatcg cagtaacatg caacttcggc 480 aactatttgg aaacagcctt gcgtaaccaa ttagtattcg gccttaagag gaacgacata 540 cgcggccgct tgttggaacg aagacagctg acactacagg atgctattga cgtcgccgtc 600 agcatggagc tctcgttgaa gggcggtgcc gagatagaag gtgcgatggc caaacaggag 660 gtaaacgctt tgcagaagcc acaaggtaag gctaagaata agaacgtggg aaaatccttg 720 cagtcatcca gcaggaatgc agatgatttg cactgcttcc gttgcgggga taaatcacat 780 ctagcgaaaa agtgtaagca tcaaaacact atttgctcgt actgcaagtt aaaaggccat 840 ttggagagag tatgcatgaa gaaggctgca gggaagagat cggagtttgg accatccggt 900 ctgaaatcag cgccagcgag ggcacaccac gtcgaggagc aatgcgaaga agaaagtatt 960 gacaacattg ctgtggacga aatttgcatg gttgccagtc gacctggtga tgcgaaaatg 1020 tgggcgacag ttattgtgaa cggtgttcca gtgcgtctgg aattagacac gggttgtccc 1080 gtaagcattg tgaatgcaga atgttatggg aagcacttca agaatattcc gctgcgaaat 1140 tgttctttga agttggtcag ttactgtaac gccaacattg atgtactggg atacttcgat 1200 gcagaggtgg attatcaggg taccaagaag gttttgcctt tgtacgtcgt gcgttccact 1260 aaacatccgt tgctgggtcg agagtggctg aaagcattgg ccgttgattg gaacaaagtt 1320 ctgcaatcaa gttgtgtcaa ttcaattgca agtgctgatc gttccgcggt attgcaaagt 1380 ttatttgcta agtaccgcaa tgtgtttggc gattcaattg gaaggattgc ctcggtacaa 1440 gctaatttgc ggctgaagcc caatgcaaaa cctgtcttct tgaaggcccg gaaggtaccg 1500 tttaatatga agaaggttgt ggaggacgaa ttagataagt tggtagctca aggggtgctt 1560 tcgaaagtcg atcaaagtga aagggcaacc ccaatcgtcc cagtaaagaa atcggagaat 1620 cgtgtgcgta tctgcggcga ctacaaacaa accgttaatc cgctgctgat ggttgatcgg 1680 catccgttac caaccgtgga tgaactgttt tcgtcactct acgggggtga caaattctcc 1740 aaaatcgact tagtacaggc gtacttacag ctagaggttg cacctgaaga ccgtgaaata 1800 ctaactctca gtacccatcg agggctatac cgtccaaatc gtctgatgta cggcattgca 1860 tctgcccccg ctatatggca gcgtcagata gaaatcattc tacaagacat tcctggagtt 1920 tcggtatttc tggacgacat caagattact ggtccaaacg atgagatcca cctgcgtcga 1980 ttggatgagg ttctacgtcg attgagctac tacggaattc gtgtgaatca aggaaagtgc 2040 gtcttctttg cggataagat cgaatattgc gggtacgaga tcgatcgaga aggcatacat 2100 aaaatgaaga aaaaggtgga agcaattcag gagatgcgta gacctaaaac gaaggacgag 2160 gtccgatcgt ttgtcggcat gatcaattac tacggaaggt tctttgagaa cctaagcacc 2220 ctgctctatc ccttaaataa cttgttaaag aacaacgtag tgtttaagtg gacaagtgat 2280 tgcgagcaat cttttcaaga tgtgaagaag aggatgcagt ctgagaactg cttggtgcat 2340 tactcacctg aattaccttt gctactcgcc acggatgcgt ctccttacgg tgtcggcgca 2400 gtcttaagcc acttgtaccc agatgggtca gagagaccaa tccaatttgc ttctcagact 2460 cttaacagta tccagcaaag atacatgcaa gttgacaagg aagcttatgc aatagtgttc 2520 ggcgtcaaaa agttttttca atacctgtac ggccggagat tcgtgttact cacggataat 2580 caagccataa cgaagatctt cggtgagcac cgaggccttc ccgtaatgtc tgctctacga 2640 atgcagcatt acgctacgtt tctgcaatcg ttcgactatg agatacgttt tcggagatct 2700 gcagatcatg cgaacgcgga cgtcatgtcg aggattccac tgaagcaaac agtttcggga 2760 aacgtcatcg aggagtccga tatcgtcgag atgaaccaaa tagagacact acctctgaca 2820 gcggatgaac tatcgcaagc aactgcagat gacaagtctg tgaacagttt gatccaggga 2880 ctgaagcatg gacaaatggt tgatgccaga gaccgttttg gaatagacca gagtgaattc 2940 tctcttcaac gtggatgttt gctgcgaggt attcgggtct atgttccacc tatgttacgg 3000 cgaagggttc ttgaggaact ccataccact cacttcggca ttactaggac aaaatctcta 3060 gcaagaggtt actgctggtg ggttggtatg gacaggcaaa ttgaagaaat gattgcaaat 3120 tgtaccgact gccagtctgt tcgaccagaa cctagtaaag caacaccgca ctgttgggaa 3180 cctgctacga tgccgttcca gagagtgcat gctgatttcg ctggaccgtt tatggacaca 3240 tatttcttca tcctggtgga cgcctacacc aaatggacag agatcaggat ttgcaaggct 3300 ataacggccg aagctacaga ggcgatgtgc agggaaattt tcagtacgtt tggaattcct 3360 tctgtgttcg taagcgatca tggcactcag tttacagctg actcctttca acggttcctg 3420 aagcgaaatg gagtagtcca taagatgggg gccccgtacc accccgctac aaatggacaa 3480 gcggagcggt acgtacagac tttcaagcaa aagctaaaag ctttgaagtg tacgaaatcg 3540 aaactgcaaa ccgagatagc gaacatctta atggtatacc gcaagactat tcatccctca 3600 actggacaat cgccttcgat gctaatgttt ggaagacaaa tacgttcccg tctggaccta 3660 ctgcttccca agaactaccc gaaggaatca ccaaatcatg ttgtacgaac gttcaacgat 3720 ggagaccgag tgcgcgtacg agatttctta acacctaaca aatggcagtt tgggagaatc 3780 gtgttcaagg caggcaagct acgatatcaa gtacgactag acgacggaag gctatgggag 3840 cgacatgtag accacatcgt tggagttgga gaatgtttga tgcctcgtgt agcttcggga 3900 gctatccctg aaccgagtat tcttctacca acagttcctg tcttcactcg cacaacttca 3960 aaaccgttgc tgattgccac cgatactgct tcagcggcgg tggaagatac tagcatcacc 4020 aatcgaccaa acataacagc ggagacatct tcaacgagca tcactgaaca gccaggtcgg 4080 cttgatgttg ccaggaaatc agaagggcag cccttaaggc gttcaacccg aattatacga 4140 gccccccaaa aacttaactt gtaattttga attgtattac atttgcttta cgagggggag 4200 agc 4203 // ID DNA-4_CQ repbase; DNA; INV; 3352 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A DNA transposon family from Culex quinquefasciatus - consensus. XX KW DNA transposon; Transposable Element; nonautonomous; DNA-4_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-3352 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the southern house mosquito."; RL Repbase Reports 11(1), 45-45 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >92% CC identity. ~1500 bp TIRs. XX SQ Sequence 3352 BP; 1186 A; 463 C; 470 G; 1233 T; 0 other; cagagacgca tcgagctcta caggctcaat aaaaactcct tacctgctct gtatggtaga 60 acagagctaa gctctgtatg cagcgaacag agccagctct gtatggggaa aaataatatt 120 gatttgcata tatctcaaaa acgataagtt ttagaaagtt gacatgttct agaaagttgc 180 tggtaccaaa aggttctata ttattgtctc agacgtcatt acaatttggt tgattttagt 240 tgtgcaaaac aagaaaaaac tatttatttt ctaagataca aggtatagaa tatttttgtt 300 ttcgacaaag ttgctcaact accaaaaatg aacaactctc tcgaagacac caaagctcta 360 tcttcaatag ataccgaaat aaaaaataaa aactattttt ataataaata ctgcttgcga 420 caaacacaaa gcgaccgcgt cgtaggcaca ggtaatatga ggcatgtttg gtagagatcg 480 tctgaagtga aaactagtat agcagagtgt tcatagagag tttttatgtt aaatgctctc 540 gtctggatgg ttgaaagaaa tttcagataa aattatacaa atttataaaa ttatattctt 600 tttattgtac tggtaagtaa ttaagattag aaacaacaaa attattgcac agctattttt 660 atgtttgtct tgacaaaaca aatgtttaac aaaattacaa gttttgtaca acaaattaag 720 ataaaaaaca atttcaaata cgcaagttaa aaaaataaaa actaaatctt ctaacaatta 780 ttttaatttt ataaaaagaa aatcagttaa cataagttta accttataaa gatttcagga 840 aacttttcta ttttatttaa ttcaacagaa aaaaaactat atttttcaag caagataacc 900 ataagaaagt tttaagctct aaataatctc tatgatattt ttttacattt tgtcaccttt 960 tttaaataaa ataagaaaat gaaattaaat agcaaaaacc attttttctt tatttctcaa 1020 gccaggaaat cagtgaagag tttcaagctc taaatgctct ctatggaaat ttgccatttt 1080 attacctgta ttctaaatat ttttttattt ttcaaacaaa ggaaagcagt gaagagtttt 1140 tagctctaaa tgctctctat ggaaatttgc cattttatta cctgtattct aaatattttt 1200 ttatttttca aacaaaggaa agcagtgaag agtttttagc tctaaatgct ctctatggaa 1260 atttgccatt ttattacctg tattctaaat atttttttat ttttcaaaca aaggaaagca 1320 gtgaagagtt tttagctcta aatgctctct atggaaattt gccattttat tacctgtatt 1380 ctaaatattt ttttattttt caaacaaagg aaagcagtga agagttttta gctctaaatg 1440 ctctctatgg aaatttgcca ttttattacc tgtattctaa atattttttt tatttttcaa 1500 acaaaggaaa gcagtgaaga gtttttagct ctaaatgctc tctatggaaa tttgccattt 1560 tattacctgt attctaaata tttttttatt tttcaaacaa aggaaagcag tgaagagttt 1620 ttagctctaa atgctctcta tggaaatttg ccattttatt acctgtattc taaatatttt 1680 ttttttttca aacaaaggaa agcagtgaag agtttttagc tctaaatgct ctctatggaa 1740 attttgctat tttttctcac tacaagattt taacatcaat tgagttgaat aatgtaatta 1800 aatttaagtt ggacaaaatc gataatatat tttattgtcg attatgtttt tgctaaattt 1860 ccatagagag catttagagc taaaaactct tcactgcttt ccttttttga aaaataaaaa 1920 atatttagaa tacaggtaat aaaatggcaa atttccatag agagcattta gagctaaaaa 1980 ctcttcactg ctttccttta tttgaaaaat aaaattttca ggaatggaat catttaggaa 2040 tgtttacttt atgaaaaata agaaatattt agaatacagg taataaaatg gcaaatttcc 2100 atagagagca tttagagcta aaactcttca ctgctttcct ttgtttgaaa aataaaaaaa 2160 tatttagaat acaggtaata aaatggcaaa tttccataga gagcatttag agctaaaaac 2220 tcttcactgc tttcctttgt ttgaaaaata acaaatattt agaatacagg taataaaatg 2280 gcaaatttcc atagagagca tttagagctt gaaactcttc actgatttcc tggcttgaga 2340 aataaagaaa aaatggtttt tgctatttaa tttcattttc ttattttatt taaaaaaggt 2400 gacaaaatgt aaaaaaatat catagagatt atttagagct taaaactttc ttatggttat 2460 cttgcttgaa aaatatagtt ttttttctgt tgaattaaat aaaatagaaa agtttcctga 2520 aatctttata aggttaaact tatgttaact gattttcttt ttataaaatt aaaataattg 2580 ttagaagatt tagtttttat ttttttaact tgcgtatttg aaattgtttt ttatcttaat 2640 ttgttgtaca aaacttgtaa ttttgttaaa catttgtttt gtcaagacaa acataaaaat 2700 agctgtgcaa taattttgtt gtttctaatc ttaattactt accagtacaa taaaaagaat 2760 ataattttat aaatttgtat aattttatct gaaatttctt tcaaccatcc agacgagagc 2820 atttaacata aaaactctct atgaacactc tgctatacta gttttcactt cagacgatct 2880 ctaccaaaca tgcctcatat tacctgtgcc tacgacgcgg tcgctttgtg tttgtcgcaa 2940 gcagtattta ttataaaaat agtttttatt ttttatttcg gtatctattg aagatagagc 3000 tttggtgtct tcgagagagt tgttcatttt tggtagttga gcaactttgt cgaaaacaaa 3060 aatattctat accttgtatc ttagaaaata aatagttttt cttgttttgc acaactaaaa 3120 tcaatcaaat tgtaatgacg tctgagacaa taatatagaa cctttggtac cagcaacttt 3180 ctagaacatg tcaactttct aaaacttatc gtttttgaga tatatgcaaa tcaatattat 3240 ttttccccat acagagctgg ctctgttcgc tgcatacaga gcttagctct gttctaccat 3300 acagagcagg taaggagttt ttattgagcc tgtagagctc gatgcgtctc tg 3352 // ID CR1-71_AAe repbase; DNA; INV; 3987 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-71_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-3987 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1159-1159 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 20 sequences with >97% CC identity. XX FH Key Location/Qualifiers FT CDS 163..918 FT /product="CR1-71_AAe_1p" FT /translation="MHVPSVSISSTVSAPPRPQRYIPFSVSTSTSTISSLP FT AIRDHTQVVHNVPNHQPNTQVSQMNIATQNSVGNVNYSRNTPIQPTTVPSL FT SVAPASTPFKWYYLTRFQPYETQQNIVSFIVSKTNCDPSLISCHKLVRSNR FT DENIPLTFVSFKISVPAEIEAFITAPHFWPAGVSIKPFAVRNESSSQHFLS FT TRQSRIALNMPSPRQRATLQTNYRQIISPLASPLPLSHRVVHQPFLYXQQA FT GRSTPQLTTMV" FT CDS 1014..3800 FT /product="CR1-71_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MYAFTETWLNCDTISSQIFGHSYNVFRLDRNSRNSIK FT KSGGGVLLAVRSNLKSRQLSISNSDAVEQVWVAVSFQTHTLYICVLYIPPD FT RTSDSNMIDQHISSLTSIVNQMEIRDSLMILGDFNLPGIEWKQSSSSYLYP FT DVTHSTISSLVSKLIDSYSLARVSQLNSVRNNNGRMLDLCYGSIEGDVCFS FT LMEAPSPLINQSIHHPPLLIELRGASPCVFMEPVETLYYDFKRGDYCSMNA FT FLSNIEWQDFINQDLEFSVGTFSNIVLYAIDQFIPKHSRNPQPNPPWSNNR FT LKRLKSLKRSALRKYSKHKTPANKRVYNSANSRYKRLNQKLFSAHQRKVQQ FT NLRHNPKHFWNYVNEQRKESGLPNVMSKGNVECSSLNGICDLFLSQFSNVF FT TQETLNDEQIRNAANNVPVHPPVGNHPIVDPEEIGKICFSMKSSTSSGPDG FT IPAYILKKCSISLSTPLSRLFNLSLQVGSFPVQWKRSYVYPVFKKGNKREV FT CNYRGIAALCAVSKLFEKVVYNFLFHHCHHYISEYQHGFMPKRSTNTNLVV FT YTTFIAKALQKGKQVDSIYTDFSAAFDKINHQIAIAKFERLGFSGYFLSWL FT HSYLNGREMAIKLGDAVSSYFHASSGVPQGSHLGPLIFLVYLNDVNLLLKS FT FKLSFADDFKLYWIVNNVDDALFLQSQLGIFTDWCETNRMHLNASKCSVMS FT FSRKRILIDYDYKIRSTSLKRESVIKDLGVLLDPKLTFKEHIDFIASKASK FT TLGFIFRVAKNFRNIQCLKSLYCSLVRSILEYSSVVWAPYYQIDIFRIEAI FT QRKFLRFALRHLPWNDPINLPSYEDRCKLIGLELLSVRREVSKSLFVSDLL FT QSRIDCAQLLSLLNINVPYRQLRSNSFLFLHGARTNYGHHEPFKSMCRSFN FT RCFNEFDFHLSRPTLRKKFHLVLAS" XX SQ Sequence 3987 BP; 1158 A; 953 C; 682 G; 1189 T; 5 other; gcctccaatc aagcctccaa tcaagcctct aaacaagcca tacctcacgc accaaaccaa 60 gccacagctc aagccaccta ttctatccaa gaatgcccaa acgccacaac gctcgctacg 120 tatcactccc aacttccatt gcatctaaat agctcatcat taatgcacgt tccaagtgtc 180 agcatatctt caaccgtttc agctccacca cgacctcaac gatatattcc attttcggta 240 agcacctcta ccagtactat ttcatcgctt cccgctattc gggatcatac gcaggtagtt 300 cataatgtac caaatcatca accaaacaca caagtgagtc aaatgaatat agccacacaa 360 aactccgtag gcaatgtaaa ctacagtaga aacacaccga ttcagccaac caccgttcct 420 agcctttctg tcgctccagc atccacacca ttcaaatggt actacctaac cagattccag 480 ccctacgaaa cacaacaaaa tattgtttca tttatcgtga gcaaaaccaa ctgcgatccc 540 tcactgataa gctgccataa actcgtacgt agtaatcggg atgaaaacat acctctaaca 600 tttgtctctt ttaaaataag tgttcctgct gaaatcgagg cttttattac cgctccacat 660 ttttggccag ccggtgtttc gatcaagcct tttgccgttc gaaatgagtc aagtagtcag 720 cattttttat ctacgcgcca aagtcgaata gccctgaata tgccatcacc mcgccaacgt 780 gctacccttc aaaccaacta tcgacaaata atttctccac ttgcatcccc cctcccgctg 840 tcccaccgag ttgtgcacca accattcctg tatcamcaac aagctggcag atcaactcct 900 caactcacga cgatggttta aatcataaat tattggttta ctaccaaaat ttaggaggaa 960 tcaacaccmg catctctgag taccgtcttg cctgctcaga ctccagttat gatatgtatg 1020 ctttcaccga aacatggcta aattgcgata caatttccag tcaaattttt ggtcattcat 1080 ataatgtgtt ccgcctcgat cgtaactcta gaaacagcat taagaagtcc ggaggaggag 1140 ttttgttagc ggtacgttcc aacctaaaat cacgacaatt atccatttca aactctgatg 1200 ccgtcgaaca agtttgggta gcagttagtt ttcagacaca tacgttgtac atttgtgttc 1260 tttacatacc tcctgatcga actagtgact cgaatatgat tgatcagcac attagctccc 1320 tcacttcaat agtaaatcaa atggaaataa gggatagttt gatgatcctc ggcgatttta 1380 acctacctgg catcgaatgg aaacaaagct catccagtta tctttaccct gatgtaactc 1440 attccacaat aagctctttg gtgtcgaagc ttatcgacag ttatagctta gcccgtgttt 1500 ctcagctcaa ctccgtgcgc aataataatg gtcgaatgct tgacctttgt tatggtagta 1560 tcgaagggga tgtgtgcttt tcattaatgg aagctccttc tcctctaata aaccaatcaa 1620 ttcatcatcc tcctctattg attgaattgc gtggtgcttc accctgtgtg tttatggaac 1680 ctgtggaaac gttgtattac gacttcaaac gtggtgatta ttgcagtatg aatgcttttc 1740 tgtccaatat cgaatggcag gatttcatca accaggactt ggaattttca gttggtactt 1800 tctccaacat agtactgtac gcaattgatc agtttattcc taagcactcg agaaatcctc 1860 agcccaatcc gccatggagc aacaatcgtc tgaaacgctt gaaaagttta aaacgatctg 1920 ctttgcgtaa gtactcaaag cacaaaacgc ccgcgaataa acgtgtttat aactctgcca 1980 attctcgata caaacgcctg aatcaaaagc tgttttccgc gcatcaacgg aaggttcaac 2040 aaaaccttcg tcacaatcca aaacactttt ggaattacgt taacgaacaa cgtaaggaat 2100 ctggtctacc caacgtaatg tccaagggaa atgttgaatg ttcgtccttg aatggtattt 2160 gcgatctttt cctgagtcag ttttcaaacg ttttcactca agaaactctg aatgatgaac 2220 aaatccgcaa tgctgccaac aatgtaccag ttcatccacc agtcggcaac catccaatag 2280 ttgatcctga agaaattggc aagatttgct tctccatgaa atcatcgaca agctccggac 2340 ctgatggcat tcctgcatat attttgaaga aatgctccat aagtttatct acgcctcttt 2400 ctcgtctctt caatctatct ctgcaagttg gatcatttcc agtccaatgg aaacgaagct 2460 atgtataccc agtattcaaa aaagggaaca aacgtgaagt ctgcaattat cgtggaatag 2520 ctgctctttg tgccgtttcg aagttgttcg aaaaagttgt ttacaacttt ttgttccatc 2580 actgtcatca ttacatttcg gagtaccaac atggtttcat gccgaaacgc tcaactaata 2640 caaatttggt ggtatataca actttcatcg ctaaagctct acagaaaggg aagcaagtag 2700 attctatcta caccgatttc tcggccgcgt ttgataaaat caatcatcag atagctattg 2760 caaaattcga acgtctcggt ttctccggat attttcttag ttggcttcac tcctacttga 2820 acggccgaga aatggccatt aaattaggtg atgctgtttc atcctacttc cacgcttcat 2880 ccggggtacc acaaggaagt catctaggac cgttgatatt tctcgtgtat ctcaatgacg 2940 ttaatcttct acttaaatct ttcaaacttt cttttgccga cgacttcaaa ctgtattgga 3000 ttgtaaataa tgtagatgat gcattatttt tgcaatctca gctcgggata tttaccgatt 3060 ggtgtgaaac taatcgcatg cacctaaacg cttcgaaatg ctctgtaatg tcgttctccc 3120 gaaaacgcat tttgatagat tacgactata aaattcgatc cacatctctc aaaagagagt 3180 ctgtcattaa ggaccttggt gtgttactag acccaaaact gacgtttaag gagcatatcg 3240 acttcattgc ctctaaagcc tctaaaactc ttggttttat tttccgcgtc gctaaaaact 3300 tcagaaatat tcaatgctta aaatcattgt actgttcttt ggtaaggtct attcttgaat 3360 attcgtcagt tgtttgggca ccgtactatc aaattgacat atttcgtatt gaagcaattc 3420 aacggaaatt tctgcgcttt gctctgcgac atctcccatg gaacgatcca attaacttgc 3480 ccagttatga agatcgttgc aagttaatcg gcttagagtt gttgagtgtt cgccgtgaag 3540 tttcaaaaag tctctttgtt tccgaccttc tccaatcgag aatagattgc gctcaactat 3600 tatctctcct caatataaat gttccatacc gtcaacttcg atcgaactcc ttcctatttt 3660 tgcacggtgc tcgcacaaac tacgggcatc acgaaccatt taaaagtatg tgccgatcgt 3720 tcaaccgttg tttcaatgaa tttgattttc atctttcccg tccaactctt cgtaaaaaat 3780 tccatctggt tctggcaagc taggtttata ttagtcatag ttattaagta gatttaagta 3840 atctttgtat cagttggaaa mawatgtaat agtctgttga tacgaaaaga agaggaggtt 3900 ttgcgcccat ttgagaaaga gctcaatatt gctcagctca aatgggcttt tccctgctcc 3960 aaataaacaa ataaacaaat aaacaaa 3987 // ID Mariner-9_BM repbase; DNA; INV; 1645 BP. XX AC . XX DT 28-APR-2010 (Rel. 15.07, Created) DT 28-APR-2010 (Rel. 15.07, Last updated, Version 2) XX DE Mariner-like DNA transposon - consensus sequence. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-9_BM. XX OS Bombyx mori OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Bombycidae; Bombycinae; Bombyx. XX RN [1] RP 1-1645 RA Jurka J.; RT "DNA transposons from Bombyx mori."; RL Repbase Reports 10(7), 944-944 (2010). XX DR [1] (Consensus) XX CC ~97% identical to consensus. XX FH Key Location/Qualifiers FT CDS 296..1387 FT /product="Mariner-9_BM_1p" FT /translation="MPGSRRHMDVETATRAVFMLQEGESQRSVARRLGVSR FT RAIRNAWERFLEXGSVTRRSGSGRARATTVQEDRYIRLTARRERTIFARVL FT QNRLRQSTGTRIGDQTIRNQLHEDGQRSRSHVIRLKLTRAHRAARLRFARE FT HAEWTTDNWRNVLFTDESKVKFFSDDRRVRVWRREGERFSEPCIRERDRFG FT GPNVMVWAGISLLGKTELVILEEGPITASRYVDRIIRPHVIPFSQRVGENF FT VFMQDNARPHTARTTQRAYFVFMQDNARPHTARTTQRAFSEANITVLPWPA FT MSPDLNPIEHLWDQLKRSLKSNFSNVTNRQELINALKLCWGRIPEENITHL FT IESVPDRVRECIRNRGGPTRY" XX SQ Sequence 1645 BP; 511 A; 341 C; 357 G; 435 T; 1 other; ctcaaaacaa aatagggccc actctaagtt aatctcgaat ttcgtctaaa aaaaattgtt 60 tggtagtaaa tagtgaagtt aaacatttcg aagtattgtt gacatgttta tctaacgtta 120 tcaatgtttc aagtcaggaa ttcaatggtg tgtagacaaa aaaagtgaag tactccaaaa 180 aacgtttatt actcaaattc tcagtcattt gtaccctcag tcacgttctg aatgttacag 240 tcgcacgtct actttttttc attttgactt gatttcggtg ttactcgcga tcattatgcc 300 tggctctcgg cgtcatatgg atgtggaaac agcaacaaga gcagtgttta tgcttcagga 360 gggagaatct cagagatcag tggcaagacg tcttggtgtg tcgcgacgag ctattcgaaa 420 cgcgtgggaa cgctttctag agncaggtag tgtcacgagg agatctggat caggaagagc 480 gcgagctaca accgtccagg aagaccggta tatcagattg actgcacgtc gagagcgtac 540 gattttcgcc cgcgtcttac aaaatcgact gcggcaatcg actggcacac gaattggtga 600 tcaaaccatt cgaaatcagc tccatgaaga cggacaacgt tccagatctc atgttattcg 660 tctaaaattg acaagagcgc atcgtgcagc gcgtttgagg tttgctcgcg agcacgcgga 720 gtggacaacg gacaactgga gaaatgtact ctttacggac gaaagtaaag tgaagttttt 780 cagcgatgat cgtcgagtgc gcgtgtggag aagagaagga gaacgttttt cagaaccttg 840 catacgtgaa agggatcgtt ttggaggtcc aaatgttatg gtgtgggctg gaataagctt 900 gctcggtaaa actgagctgg taattctgga agaaggcccc ataactgcct cccggtatgt 960 cgatcgaata attcggcccc atgtcatccc attttcacaa agagtgggtg aaaacttcgt 1020 ctttatgcaa gacaatgctc gcccacatac agcgcgaaca acccagagag cttacttcgt 1080 ctttatgcaa gacaatgctc gcccacatac agcgcgaaca acccagagag ctttctctga 1140 ggccaatatt acggttttac cgtggccggc catgagtccc gatcttaacc ccattgaaca 1200 tttatgggac caactgaaaa gaagcctcaa atcgaacttc agtaatgtga ctaatcgaca 1260 agagctcata aacgccctaa aattgtgttg ggggcgaata cccgaggaaa acatcaccca 1320 cttgattgaa agtgtcccag atagggtcag agagtgcata cggaacagag gaggacctac 1380 tcgctattaa gcaccatcaa actcattgat ttagtcaata ataaaaatca aaatcaaact 1440 ttacttaaat caataatttc tgtcaacttt cctaaaaact gtaaattttg actaaattcc 1500 ataaaattgg caaaacttac tcttgttaat gtttacacac tttcaaaaac cttaatttat 1560 tcattcctaa tcataaaaac tgaaaaaaga aaaagaatct gtaaaattca tttaaaaaac 1620 atggtgggcc ctattttgtt ttgag 1645 // ID Copia-15_SI-LTR repbase; DNA; INV; 286 BP. XX AC AEAQ01019900; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: long terminal DE repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-15_SI_; KW Copia-15_SI-I; Copia-15_SI-LTR. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-286 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01019900; Positions 430 145. XX SQ Sequence 286 BP; 76 A; 52 C; 40 G; 118 T; 0 other; tgttgaatta attcatttac attcaaatta ttattattca cttaattagt ttgttagact 60 tattatattg gtaacacaca tattggtgta agcgtgttgt cagcactgaa atgccgtagt 120 tctttttacg ccataaagtt tacagacctc tctctctctt gtgtgttgtt actctcttgt 180 gttacgcttg ttatcgctta caagagtctt ttcagtaata tacttgcatt ataataaagt 240 tatactcatc aagtctgtct tcagcatatt ttctaatccc agcaca 286 // ID L2-1d_3UTR_Cis repbase; DNA; INV; 1303 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 06-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE CR1 Non-LTR Retrotransposon from Ciona savignyi. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; KW L2-1d_3UTR_Cis. XX OS Ciona savignyi OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. XX RN [1] RP 1-1303 RA Smit A.F.; RT "L2-1d_3UTR_Cis - CR1 Non-LTR Retrotransposon from Ciona RT savignyi."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX CC Ci000002 3' UTR of L2; 85-90% similar to that of L2-1c. Copies CC <2% diverged. XX SQ Sequence 1303 BP; 339 A; 244 C; 162 G; 557 T; 1 other; ttcttaggtt aatatacata atcctatcct tatctaaaac ctattcttat aatacccaaa 60 cccatcctaa atttcccttt cactatcaca tcaccagtat catttatctg tttctctacc 120 actcctttct aactccttta tgttttgatt tgtcttattt ttgtggtcct tgttttatca 180 cctttatcat ttaatataac acctacaata tgttattttt tttttttttt tttttaatac 240 accactttat ctttgaccac gcgattactc accaatctgt ttcaccattt tatatgacac 300 agcaggttgt aactgttaca tggtgagccg atggcaatta ttttaaagtg gatttttgtt 360 gatgtttaaa tttagccggc cctaacagtt accttgtatt tagaatgctc cacatccaac 420 caccactaan gtagcgcact agctcacctg cacccgcatg tatcacttac aagtaaaact 480 ctcttttttt tctctctttc tatatttttg ttttatttaa gaaccctact gtcccgaaat 540 cctctttttt cagagcacca ggagatcaaa tgaatgcagg aggttgattt ttctgttggc 600 gggcgtcact gaaaatcatc ctctaaaaaa ttattctaat ccggaacttg ctgcatagtt 660 tattcatttt tcatttttat ttattgttag ttttgttagt taggcagttg ttgcctactt 720 attgttagtt ctatgcatta ctgttatgca tatctaatgt aaagtgcaat attgttttgc 780 acaaaaaaat gttttttttt ttaattaatt ttaattattt ataaagtgca atattgtttt 840 gcaaaaaaaa aaaaaaaagt tttttctaat taattgtaat taattattta tttatatatt 900 tttattttta cttaatatac ttttaaatgc cttggtaagt gttgcacttg aaacttttta 960 gtggaatttc atttttgggg tttttctttt tccactgtgg cccctacttt ccgggccacc 1020 ccttctccta aagatgactc ccccattgca cgggtctctt taccgcacac ttttcttcga 1080 ggcttagtgt tatcttattt attcatttta tttatttttt tttatttggg tttagcggcc 1140 acgtctaatt ttaccggatt ttaataattt attgcttgtt ttttagacgt gttctaacga 1200 ctgtcgtttt gccgtttttt cccgttttcc aatttttttc aaaaatcctg atttatgttt 1260 tcatcgtgga aacaaaaaat aaactaaact aaactaaact aaa 1303 // ID CRE2 repbase; DNA; INV; 9595 BP. XX AC U19151; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 28-SEP-1995 (Rel. 1.08, Last updated, Version 1) XX DE C.fasciculata retrotransposable element (CRE2). XX KW CRE; Non-LTR Retrotransposon; Transposable Element; CRE2; KW integrase; retrotransposable element; reverse transcriptase. XX OS Crithidia fasciculata OC Eukaryota; Euglenozoa; Kinetoplastida; Trypanosomatidae; OC Crithidia. XX RN [1] RP 1-9595 RA Teng S., Wang S. and Gabriel A.; RT "CRE2, a new site-specific non-LTR retrotransposon that RT interrupts the mini-exon genes of Crithidia fasciculata."; RL Unpublished. XX RN [2] RP 1-9595 RA Gabriel A.; RT "CRE2."; RL Direct Submission to Genbank (27-DEC-1994)Abram Gabriel, RL Molecular Biology and Biochemistry, Rutgers University, CABM 306, RL 679 Hoes Lane, Piscataway, NJ 08855, USA. XX DR GenBank; U19151; Positions 417 10011. XX SQ Sequence 9595 BP; 2578 A; 2732 C; 2699 G; 1586 T; 0 other; attctctttc tgctgccttt cttttccgcc aactgtccga cctataagca tccggtaaat 60 aacacgggtc taacgtcgca cagtgcacac cgaacgtggt tctccgcacg cctctgcaca 120 ccgagccctc cgacacaaca ggagagccgg ttgcacgagt gaagagcgcc gccgcgaggt 180 gtagtaaaag aaataaataa ataaataaat aagcacacgc agtggccaca gtgaaccggc 240 cactcgctaa ttattttcac aacacacacg cagcagctgc aaaggcgtgt ccttttcccc 300 cccttttccc ttttcccttt tttttttttt tttttttttt tttttttttt tttttttttt 360 taagacgaat ttttccgatt tttggcttta caaaattttt caacttgttg ttttgagcga 420 tttgcacgaa aaagcgccat ataactaaaa tttttatttt tggaaaaaat cgaaattttg 480 ttattattta acgcggcttg tgcagatttt aatcttggct ctcgggactt ttttgcggag 540 ccgtcggcac tcgagaaatt tggttctgaa atttttagtt ttgttttttt gaccgctgaa 600 tacgtccttt tgaggcagtt ctgggcctct ctaagttttg gttccccttt tattttagcc 660 aaggaccgtt ttttggggtc cgttaaaggg ggagttggga ggtttgaggt cgatctgagt 720 cacaaaccga aaagatacag ccaagtgtga gtagtaagtt tttgaaccgc tgggttggtt 780 tttgaagttt ttggtagccg gagggcccgg gccttagggc aaaaaaaaaa aaaaaaaaaa 840 aaaatggcta aaatgatgaa atacctaaag gaaaaaatgt gtctctccgg ggcggggagc 900 aacagtcagg gtaaagaaaa aacgtccctg agggtgaggt gcgggcttcc cccggtgtac 960 ccgcgagtcg ccgacgggag aacggaggaa ggcgagtttg tttttggtgg gaagaaacaa 1020 ccggcagcaa ccacaccacc actgccgcca ctaacaacac agccgcaagg ttttcaggtg 1080 acggcagaac tcgtagagca ggtgttgcgc atcctgcaga ccggtggcat ccccgtcccg 1140 actccaccgc acacgggcac caccacacca gcgacgcatc agagcgaggc cccgccacaa 1200 accggactag ccacgacccg cgacacccgc gccactgatc ggaagcaaac cccgcagcgt 1260 ggcccacaag agccgaagcg agaggttccg acaccgcggc tagtagctcc acaaccgtct 1320 gtgccaccag gcttccgtct ggtaccagca agggcctcga cgccaccacg ctggcaacgg 1380 gaggcgactg cacaggccgc aaccacaaca ggcgcggcaa accgggtttc aacaccgcag 1440 cagcgcacac ccaccacaag caggagtgca acaccacagc ccaacatcgc atcgccgagt 1500 ggcgcaccac cgcgctacaa cgcaacaaga ctctcgccgg tgacaccgct cctggtgcca 1560 gtgccgccgc gcgacggcag gacgtacgca caagccacgc gtagcaccac gccaatactg 1620 tacaacgcac aacacgggac gacgacgtcg gccacccggc aggcgactcc gtcactatgg 1680 gcggcacgcg cctccactgc aacgccgcac cgggcaccat caccaacagt acggctccca 1740 acgcggacgg taccgccagc aattcttacg gaggagcaac ggaaggcaga cccagagttc 1800 catcaagtga ggacgcagaa ccagcgtcat cgtagaatgc tggaagcgct tgtggcaagc 1860 atgaaggatt ccgaggccac agcggcacac tacgagacgt cgtacatgta cgcaaaggtg 1920 ctggcggaac acgcggagat gggacgcgaa gggctacgac gccatttcag ccttctcgcc 1980 gacaactacc caaaagcaca cccagttgac agaccatact ggtattgtca gggtcaaacg 2040 ccggaaggac gaccgtgcac gtacgcgcac gaacacgagg cagtgatgcg cagccacatc 2100 gcgaaacgtc acgagggaca gagggtggaa atgcgtcagc acgtcctctc aagctacagc 2160 ggtggagcgc acaacggcag atccacatgt gcggccacca cggcggcggc aatgtggagc 2220 acccgaccgg acgcacggcg tgacaccgcg acgtgggccg ccttcaagaa atccacgccc 2280 gcggcgacaa aggccatgct gcaggacttg cggctagctg tgcccaccac cgcgcgacga 2340 gcgctagagg cagtcgttgc aagagaggcc gacaccacac aagacgacca ggaaacacat 2400 caaaagctca ttctggtatc cgggagggtg aacagagcgg tcggagcagc agacgttaca 2460 acgcagctag caaaaatggt aaccgaaaag cacgacgaga gcgcatacga ggatgtcacg 2520 gaggtggcct tcaatatcct cgcgccggcg ggggctacgg gaatcgaggg gatgccgcac 2580 gtcccggaga agatcgagct tccgtccgcg agcacaaaac aaacaatcga gcttaacctt 2640 cacgcggtgg caagcgcaac ggagccaacc cacctaaccg ccgcggtgcg agaggtgcaa 2700 aaaagacgcg gcatagagat aacaacatgg caccgctacg atggtgcggt gcacaccaca 2760 tcgcccaccg caccgccatc aaactcggtt gtgatgcttc tttatacacg ggcacaaacc 2820 acacaggatg agcaaaagca gctaacgcta aaaggagcag acggcgacaa ggacgacgac 2880 acgtacagct tcctcagtga cgacagcgac accgaggaca gggttccaga agatatcgac 2940 gatcgcacga gcacaagctt cacacaagag ttgtcgccac aaaccacggc ggaggatgga 3000 gagcacgtct ttggcggaca cgtggaccca gccgtgctcg gagactggag cgaagaagat 3060 gacagcctcg agcaggagac tgcggatcgc gcctgtcgcg gaagcgccgc tgcgaaaact 3120 gagcgacacc gaagcatgac aaagttcctc gagagactcg cgcagcacga aagtatgacg 3180 cactcaccga catcacccgg agacgtgata agagccgtca ttgaagaaga tgaggcatcg 3240 agtgcactat tcgcatgcgg aattgagacc gcacgcacat gcaggagctg cgggtaccaa 3300 gtcgatgcgg aggtgctgga tgacatagct gtggtgacgg tgacgaacac aggcctgcga 3360 caagacaact tatccacaca gcagctggac gacgccatca accagcacac catttacacc 3420 ccagaagagt gtgcgcgatg cggccgggaa accgcttctg caacacatcg cctcgtccaa 3480 tgggcggtgg cggtggtcct tccagaaaaa gaagacacaa cgcagaccac ggaggagcga 3540 catcttccgc gaacggtaat ggcaaccaat ggcaacggat cgcagtgcac attgcacctc 3600 gttgcgatgg ccaatattca tgcccaccgt acagctgagg tgtgggagat ggtggcccga 3660 cactccgacg gaaacgagca atggagacga cgcgggcaca cggacgaaga agaccagcaa 3720 acggagacgg agagacgggg caacgtactt ttatacactc gagatgctcc caccgtcttg 3780 gccgtgttca ccacagcggc gcgtagcaca agcgctgtgg aagaatcgac ggcactcacc 3840 aatgtgacaa caccgcaacc agcagcaacg ccggagcacc gacaggaaac acgtgcaacg 3900 caagacgcgc aacagacaca aagagagaca ccgacaccgc aacgctcact agaaggaatc 3960 ctggaggagg tgaccggggt gcgaccacga tctgcggtgt ccaacatcga ggcggttcgg 4020 cgaatgctgt gcagcattgc gagcggcaag atgagagcac gaggcgggat ctcaatcaac 4080 caagccttcg ccaccctggc caacgtgaca caagagtgga gaacggcctt tgccgataac 4140 gggtgggaaa ggcgcggctg tgcagcatgc ggggcccgcg agggccaaaa actattcgac 4200 tcccagctgt cgatcgtgga tgtggcgatt cgaatgaaag aagagatatc gacaaccgac 4260 ctgacagcag cgctcggcat acgcaaaaca caaacgagtg atgcgtgccc tgtgtgcgat 4320 gcgcacatgc aatccaccat tcaacacgtt ccaggaaaag cagtcatgtt ttcaattgaa 4380 gaggcgacgg cgacaggaaa ccaaatcacg acaagagtag cgccaccaac ttttacgcac 4440 aaagcgccga acggagcaga agtcaccaga tccctactgg cagttattga cacaaccacg 4500 gggccggaag cccaagtgca ccgcgttaca gcaaactggg ggacagcggc gcaacggtgg 4560 tgcaaggtcc tgccggaagg aggggaacaa gagaccaagc ccaccgtccc accaaaaggg 4620 aacttacttt gctacctttc tcacaagcca cgggtgacac aaacagcaca acgcaccgaa 4680 gacgacgatc accgcgacac aaacaatgag gagatttcct cttcatcaca tcaagctccg 4740 aaaaaacggg agcgaccggc gagcagcacg accacgccac cgagcatgaa accatcgaat 4800 tgcagcgagg tgaaaattga caccacaccg ggaaggcagc ggcaaccaca caacagggag 4860 gaggagcaga cacgcgaaac ggcacgtgcg ccaacttatc ccacggataa cgccgccaac 4920 gagaacgcga gcgcgcggaa acgcgaggaa cagcggggcc cactgggagg cgccacttcg 4980 acgagccctg ttgtaatcaa cagcagcgcc gacttttcga tggtggaaga caccgccgca 5040 acaccaatcc agcggacgat gtcgttcacc ctcgaccccc tatcgccgga ggacgaagac 5100 gcagtagcac cggacccctt tctggcatca attgtccagg tagtcggcga ggaagatgag 5160 ggcacagcgg aagcaccatc gatgaacaac ggagaaaaca gtgagccggc tgaaacccca 5220 acacgaagcc tttttgaagc agaagacgta ccgctgccgg cgtcaatcat tgatgtcgac 5280 gacggtgccg aagaggacgc accacacgcc cagaacaacg ggcaacaagt ccgaaacgca 5340 gccacgtctc cacaacggga ccgcgtaacg ccaccaccgg acgcgcggca ctgcccgcta 5400 tgcccgcaca cccctaaaaa tcaaagcacg ccaaagccgc agcgcttcca acagctttgt 5460 agccacatga acattaaaca tcacgtggcg gaagtggcag aggcggtgct tgaggcggcc 5520 ggtttggaac gatgcagccg atgccggcag tttcttcgcc gcacccagag agcccgcgag 5580 tcgcacaggt gtgagccaca agccggcgag ccaacccggg atgaagaggg cggggagacg 5640 caacatgggg ctccccgggc gacggccacc gtgcgggatt tccgacatga cagccccacc 5700 gcggatgaca cggagtggct cgaaaaaacg tcgagcacgg tacgccacct ccacaaaaaa 5760 gagtggggac actggctggt gaccgtaggt acggttctca ccggatacac cgcatcggtg 5820 gaggaagagc gatggaagcg acagctggcc atgacgacgg tcgtgcgcga caacctgggc 5880 cgccaacgtc gcggcgacgg ggaaacgtcg caagcccaac cgcaacgctc ggaggcaaac 5940 acggcgccgc agggaagggg tgaagaaaca acggccacgc aaccccaccg cgccattacc 6000 gaggaaaact ggggccagga ccgggcggca gaggaggagg gacagctccc acaacagcac 6060 aaggggatga ccgccgaagc ggagcgcgaa gagcagaacc ggaaacggat tctaaaccag 6120 ctatcgctgg gcgctctggg aaaggcggcg cgatccctct tcaacaccca gcttccgcgc 6180 gccaccatcg aagaggcgcg accgcaactg gaagcgctgc acccccaagc ctcggtaagc 6240 gatctccccc tacctagtga gacgccgttc ctccacgctg tcaaaccaga gcgagtccga 6300 gcggtcatca caaagcagct gggacgcgcg gcggcacccg gcttggacgg gtggacccga 6360 gagctcctcg ttccaatcac cgaagacaag ggcctcctga ccgagctgac ggcactagta 6420 caggacatgc tggtgggaaa cgtgcacccg tccttcgcga cacggatccg cgcctgcatt 6480 ctacaccctt ttcggaagga agccggctcg gcgaaggtcc gtcccatcac gccagagtcc 6540 gcgcttatga agctggctgc gcacatcgcg cttgacagtg tggagaagtc gttccgcagc 6600 accttcaaag ggtggcagta cggcgtgtgg ggcgactcca ccgaggcggt gaagcgcatt 6660 cgcgaagcat acgcggaggc ctcttcggac acgctggtgg cactggacgc gacaaacgcg 6720 tacaaccgca tgtctagacg ccacatcctg gaggctgcgt acgcacaacc ggagctccgc 6780 tttgccttcg gcgtggtgaa cctctccctc ggtgccgctg gtgagctcgc gctgtatgaa 6840 aacggtgcga agatccacgc cctcaaatcg acagaggggg tgcggcaggg aatggtgctc 6900 agcccgctgc tgtttgccaa cgccatgagt ggcatcatcc gaccactgat ggagatgcat 6960 ccgagggtga aagtggtcgc ctacctggac gacgtgacgc tgatcggtcc gcacgctgcc 7020 gtacaggact tcctcgcaga agcggggcca caactctcac gcgtcggatt cgacatcaac 7080 ccggcgaagt cccaccacct ggcaaagctg gaggttccag aggcgttaag tgtaagcgga 7140 agaacgattc cgattgcaca gggcgtggtg cgcatcctcg gtgcgggttt ccgaggggac 7200 acggcctcag tagaggagtg ggtgtgggag aagacgaaga cgcatgacca ctacttcgag 7260 aagctgcaat cggaatggct gccgcgattg gcgcgactcc agctgctgcg tggctctacc 7320 gtgccgcgcc tcaaccacct tttgcgcacc cacaagccgg aggaactcaa tcgaagtacg 7380 acttggtttg acgagcgcgt gatggaaacg gcgctgaaca ttgccaacat cggtgaccga 7440 agaaacgcgg acgacggaga cccagggcac acgccgcccg accccctacg agagacaagg 7500 ctcctggttc gacttccgat ggcgatgggg ggtctggggc tgcggtcgca aaagccaatt 7560 gcggggtttg ccgcggagtg tgtggggacg aagaacgcgc agcgcgagcg cacggcggtg 7620 gtcgacgtcg agattgagaa gagccttcat tctctccttt cggagcctgc tcttcttctc 7680 ctggacgcga acacggcgac gggtgcgacg cgtgcgttgg tggacccggc ggtgaaaatg 7740 tcggaccgtg cagcggaggt gatgctgaag cagcgccttt ttcaacgcgt gctcccggtg 7800 gggtggcagt gcgtgtgtgg agaagacgca acgaacgagc acatcaacag atgcaggcag 7860 gcagaaggcg gtgcaaggat tgcgcgacac aacaaggtgc gtgacacagt catcggctgg 7920 gcaaccgaaa tcggttatgt ggcgagagca gagccgccga cacagaacca gctggcaaca 7980 acgattgaaa acagccatcg acgacggctg gacgtggaga tcacaacgcc gaacggaacc 8040 ttcactaccg acgtcacggt gacgtacccc ggtcgaacgg gtctagggac acacgccgta 8100 caggaggcgc acaccaacaa gttggcaaag cacggcgacg acgcgcgtgc acaggacaag 8160 atatttgccc cactagcgct ggagtccacc ggtggaattc acaaggacgg cttccagtgg 8220 atgcggcggc tggccggcgt aaaaccgcac ccgtacacac ccaatactgc gctcagcatg 8280 ttgctgacaa aggtggctca agcactgcac gagggtaatg cctacatgtt tcatgtggca 8340 gatcaagcgc acaaacaccg ccagctcgga cgttcggtcg gtgaggtgca agagtagagt 8400 acattgaagg gcggtgaact gtcagagcta ttttttcctt tacatttttc tttgtttcac 8460 aacaaggctt tacaataggc gacgcggaag atgaaggaca atcaccgaca tcttttatgc 8520 atcatacgcg acggcaccgc agtaacaaaa tggaaatttc aaatggaatt ttaaatgaca 8580 taacaccaag agaatgcgcc ttgtctgcaa aaccaccggt ggggcgtcat gatgacatcc 8640 agatgcggat gtgcggaacc aatacaccag gaccgaccac acaatcggtc aacacacaga 8700 aaaaaaaaaa aaaaaatcca gtatgctaaa gagagtggat cacagcgcct acgtataaga 8760 aaatgataaa attaatgaaa ataatagatc acctttaccc catcaaggat atcaaagcgt 8820 gcagccctct gttaaagatg tgaaaagcac aaagaaaaaa gaaaagaaat gaaacgagac 8880 tgaacatact gatatattac actaatctgg cgcagaaaaa aaaaaaaacc agcccgggcc 8940 tttacacatt ccaccttccg ttgacgacgg ctcttaacac gatgcaaaat tcgacatctt 9000 gagcctcgaa tgccaaccga caacatgaga gcaccacctt ttaatcaaaa gaactcggaa 9060 aaagaacaga aaagcaaata agatgacgaa agagaatcca gcacgccttt ctattacgga 9120 aagcgttcaa tattgtaaac ccagcgagaa tcaacccgag taggtcaacc tcaaagaggc 9180 agatccaggg tatcaacttt taatttctta cccaattgca gccgcttttt tccttgcttc 9240 aactttttct tgcttttagt agcgccagaa ataatcatca aggtcagata gcagtcctct 9300 cttttttaac tttagttttc caacgactgc atttttcatt tttctttttc caactctatc 9360 ctttccgagt aaaacgtcga aaactacagt ggacgcctag tgcttcacgt catgtgaaga 9420 ataaaaaata tgctaaggtc acgggcaaag gacatcacgt atccacactt tgtcttcgga 9480 caaggcgatg gacgggaact ggcgttttgg aggtggttca cgcaactggt tcgtgccact 9540 tatttcctgc gttctgacgg tgatgacaga ccggtgcgat ggtcggggca cccgc 9595 // ID Crack-5_BF repbase; DNA; INV; 2129 BP. XX AC . XX DT 30-APR-2009 (Rel. 14.04, Created) DT 30-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE Amphioxus Crack-5_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; Crack-5_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-2129 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-2129 RA Kapitonov V. and Jurka J.; RT "Young families of Crack non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(4), 810-810 (2009). XX DR [2] (Consensus) XX CC Its 5'-terminal portion is incomplete. XX FH Key Location/Qualifiers FT CDS 3..1649 FT /product="Crack-5_BF_2p" FT /translation="IITENVEICNECNKFFVDIGTYLANNITSTKQDPISY FT IKAPTNCSLSLFEPTTNKEVEDIVLNLKDSAAGYDEIKSKALKSALPYILT FT PLTHILNMSLQSGDVPNGTKIAKVIPIFKGGDSTSFSNYRPISVLPSISKL FT FERLVYNRLLKYLNTNNILYKHQYGFRKGISTPLALIQLIDKISTAIDKNE FT YTIGIFLDLSKAFDTVNHTILLRKLAAYGVTGTALNWFQNYLNHRKQYVTL FT NGIKSTYKEIQCGVPQGSILGPLLFLIYVNDLAEASSILYFILFADDTNIF FT LSHKNFDSLIQLANQELRKVATWFEANKLSINTKKTNFLIFCSKNKSYDIN FT KAKIFLQNVPIKQEKQTKFLGVIIDEKLNWKAHVSAVSKTVAKTIGVIGKI FT KHNIPLKTLSTIYNSLIYPYLTYCNIVWGCACHTTLHPLTTLQKKFTRMAT FT ASDFLSHTAPLFLKLKILNVFDINKLQISIFTFNCLSNNLPDMFKTFLNPN FT SHVHNYHTRQRNLLHIPLFRTSLAQSTVKYRCATIWNSLPTNIKKHFFPPY FT I*" XX SQ Sequence 2129 BP; 718 A; 434 C; 277 G; 699 T; 1 other; aaatcatcac tgaaaatgtt gaaatatgta atgagtgtaa taagttcttt gttgatattg 60 gtacttatct agcaaataac atcacaagta caaaacaaga tcctataagc tacatcaagg 120 cccctacaaa ctgttcttta tcactttttg agccaaccac gaacaaagaa gtagaggaca 180 ttgttttaaa cttaaaagac tctgccgctg gttatgacga aattaaatct aaagccctaa 240 aatcggccct tccttacatc cttactcctc taactcatat tcttaacatg tcgttacaaa 300 gtggagatgt tccaaacgga acaaagattg caaaagttat acctatattc aaggggggag 360 actcaacttc atttagcaat tatcgaccga tctcagtctt gcccagcatc tcaaaacttt 420 tcgaaaggct agtttacaat cgattactaa aatatctwaa cactaataat attctatata 480 aacatcaata cggcttccgt aaaggtatat ccactccact tgctcttatt caactaatag 540 acaaaatatc cacagccatt gacaaaaacg aatatacaat aggaatattt ttagatttat 600 ccaaagcctt cgataccgta aatcatacca tactcctccg aaaattagct gcatacggtg 660 ttactggcac tgcactgaac tggttccaaa attatctaaa tcataggaaa caatacgtaa 720 ctttaaatgg aattaaatct acatacaagg aaattcaatg tggagtaccg caagggtcga 780 ttttaggtcc tctattattc ttgatttacg tcaatgatct ggcggaagcc tcctcaatac 840 tttattttat actgtttgcc gatgacacca acatattcct ttcccacaag aattttgact 900 ctctgataca actggctaac caagaattgc gcaaagtagc tacatggttt gaagccaata 960 aattatcaat aaatactaaa aaaacaaatt ttctcatttt ctgtagcaag aataaaagct 1020 atgatattaa taaggccaaa attttcctac agaatgttcc tataaagcag gagaaacaaa 1080 ctaaattttt gggagtaatt atagatgaaa aattaaattg gaaagcccat gtctccgccg 1140 taagtaaaac agttgctaaa acaattggtg ttattgggaa aatcaaacac aatattcctc 1200 taaagaccct ttcaactatt tacaacagcc ttatataccc gtacttgact tattgcaata 1260 tagtttgggg atgtgcgtgc catacaacac ttcacccact tacaactcta caaaaaaaat 1320 tcacacgaat ggccaccgcc tctgattttc tttcacatac agccccttta tttctaaaat 1380 tgaaaatatt gaatgtgttt gatataaata agctgcaaat tagtatattc acgtttaact 1440 gtctttctaa caatctcccc gatatgttta aaactttcct aaaccctaat tcacatgtac 1500 acaattatca taccaggcaa agaaatctac tccatatccc tctctttcga acatcattag 1560 cgcagtccac cgtaaaatat agatgtgcca caatctggaa cagtcttcct acaaatatca 1620 aaaaacactt cttccctcct tacatttaaa agtgccctga aaacatatct cataaaccac 1680 cctagcccct tatagaaata tttatcctat cttaattcat attgctatta tcatttgatt 1740 tcattgatca cacattttct ttttttgtat tactataaca tatgactgtg ttagttactt 1800 ctccttgtta tttattgatt cactgatcat acatcttttg ttgtattatt ataacatatg 1860 attgtgttag ttacttatgt ttgttattta ttgatttcac tgatcgtaca tcttttgcat 1920 tattataaca tgtgtgctag ttctttcttc ttctttgtta attatgattt cattgatcat 1980 acattttgta taagcataga ggaggagggc acagcataaa ccgttaaggt tttttccctc 2040 ctcctgcacc tgttttatat tttattctct tctgtatcta ttatcatttt gtatgtgcaa 2100 actgaataaa caaacaaaca aacaaacaa 2129 // ID Gypsy-1-I_LG repbase; DNA; INV; 7868 BP. XX AC . XX DT 18-MAR-2009 (Rel. 14.03, Created) DT 18-MAR-2009 (Rel. 14.03, Last updated, Version 1) XX DE An internal portion of the LTR retrotransposon - a consensus DE sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW reverse transcriptase; Gypsy superfamily; 4-bp TSD; Gypsy-1-I_LG. XX OS Lottia gigantea OC Eukaryota; Metazoa; Mollusca; Gastropoda; Patellogastropoda; OC Lottioidea; Lottiidae; Lottia. XX RN [1] RP 1-7868 RA Bao W. and Jurka J.; RT "Gypsy LTR retrotransposons from Lottia gigantea."; RL Repbase Reports 9(3), 727-727 (2009). XX DR [1] (Consensus) XX CC Terminal nucleotides of the LTR are 5'-TG, TA-3'; the TSD is 4-bp CC long. XX FH Key Location/Qualifiers FT CDS 142..1002 FT /product="Gypsy-1-I_LG_1p" FT /translation="MSTEEETTFFHTLSDQLNRLTNSISTNGVVTIIDSFE FT GDPKAFREWTRSIEKYAVLASIPNDRMKLIAYQTSKGVVSNFIRRYLDTHP FT AADWPTLKDELSARFAEVTDAQHAFALLRKVKQNHNENVQFFGERLLVLAS FT EAYSGTSSLEIERQLVGFFVDGLKENYLKLKILRENPKTLQLAVTLAMGEQ FT NLRTRFALRTGSNFPRNEDSSSRDEPMEVDHFRPRKCRICYRHGHEAKDCR FT RKPNNRQVNSADTFRASRPPIKCWRCGKLGHKQVDCKVRLNSGNH*" FT CDS 960..4922 FT /product="Gypsy-1-I_LG_2p" FT /translation="TSRLQSSTKFGKPLASPRKPANEERKKVTSTNKPIFN FT KNCKSKRNSKNKKILEVNLAGNPSTCLVKFEKQYYRALIDSGAEVSLMHRR FT VYQNLKYKPKLKKENVILRSVSGQSLTIDGSADFEFKIGNKEMKHKLFIVS FT SMNRNLILGRDFLAEKNVRIYFDLKCLRIYGTYVPLQHDLHISSLVRLTEN FT VVLHPQSSCMSVARAKCLNPNSPMLQIQGTDSGFMSKQPGIMVANALVATV FT NVERNNGRMPITIVNNTNKTMTLKRGCLIGQIDPVNHNDVSELVPPLDVNQ FT VSNEQTNDCFKDIKMPNNHRSRLMNIFTKYRDIFASSDTELGQTDTVKMKI FT DVGDHPPIKLRPYRTPLNKTKIVDKAIDDMLGAGIIAHSSSPYAFPIVVVD FT KKDGGRRFCVDYRALNKITKINSYPLPLIDDLLAQLNASSVYSLIDLRAGY FT WQVQMHADDKEKTAFTCHRGLFHFNVMPFGLVNAPGVFMQLVGRVLEGLSD FT FAVAYLDDILIFSKTPEDHFKHIKLVFDRLRQHNLKLKASKCSFMQEETKY FT LGFTITKSGISADPDKVRALKAMTPPTTVTETRSFIGGMSYYRRLIPNFSK FT IAEPLIHLTKKFARFNWTTECQLAFDFLKNSLTTIPFLGYPNSNLPYVLYT FT DASDTCIGACLAQVIDGVETPIYFLSHKLSSTQVKYSTVLKECYAIHYSLQ FT KLDYYLHNAEFIIRTDHQPLKYLLESPFQNKQIQLWALGIQGYNCKIEYIP FT GKSNSIADLLSRVPVATFDNLTDDDEEIEPRLNDNAFEINVINSNKVDTGN FT FADCDMNDNESLSDPEDDPLKLDLVSEQIKDDELQNIKTELEKGNLKPAVN FT NKYIVVDDILYYISDPDGDLNLRLYVPKHLTSKVINQYHDENGHSGVDRCY FT ALIRTKYYWKGLYKQLYKYVENCVTCQARSTTKNKPDLVETGIPPYPMAKI FT SLDLSGPYPTSLSGNKYIASFVCMYSGWIEAFPIPDKSAQNVAHLLIDEMF FT PRFGCPLAIVTDNGTENVNKVMKETMQTLNIKHICTSYYRPQSNSVVERSH FT RTLHDILSKRLKSDISTWDIHLNQSLAALRFNISASTKQSPFTLLYCRDPV FT LPLDNILKPRRKFLGEGDFHQIALENQHRSFLQVYKNVKKAKKRQAKYANK FT NTKNIEFKVGDPVYYKNFLRKNKLEDRWTPYYRIVEKTSSVSYRIKSQLTG FT KTTKAHAEQLRLAKIDEWDIPTDEQRLRQAKYVEPLSDSSSEGSASEYSEN FT DELPLAQLIKKYRRVRENSSSESDIPLMELAQRIRAQDNDVIDDSNVESKS FT SSGDVSDE*" FT CDS 5204..7342 FT /product="Gypsy-1-I_LG_3p" FT /note="putative env gene." FT /translation="MVHYFSERMRCIWLLLLPVYGITAATKVSQNVLFRPV FT HQVSTSRSKWRLTLVHDITPFMKFTSDLATDIGRTLLTVDEIMTNYSAGNN FT SLEYENTFSKLKIEVITLRDEIEDLLDGVQDYRALSMKHARPRRALIPFVG FT KALSWLFGTVSTDDLNVIRSNIDKLANNQQAIQHVVAESLSILNVSRLHIS FT ENRQQVNLLSDSLRNLDIKIKNVVRQFKKKLVSLEGFLHLYLRMKLVLDDL FT KNMYLRAEMYASEIKLQLNMLSLGHLSPSVLTPGSLRRLLIEIQSHLPPSV FT RLPANPKKNLWHYYRILNCVTIIENNKIIVVLNIPLLDSDRVFEIYKIHNL FT PLPMTNKLILSKSKATFKMVASYRLEAEVIAVNTDRTKYAILDSDEFTKCS FT DPLAGFCQITSPIYPINLSQLCIVALFTENREKIRQLCKVFIKPNCILPVA FT TYLANGMWMVVTHRPIRFSIVCHQTSFVHQSVVIRPPIQSITLPQSCKAQN FT DFLELPAHFSNESSYMDSTSFPNVTAIFEKTIQDLWQPYITKFENYSLTKL FT PPKLEHITDIPMKELTYEIENLQKLEHEATEKWPLWKIAAISIAVITGILI FT LFVFIKYCLVGRMGHRFLSVLRTRTAHGRGNKRSSDVELRTIPSAPNDEPT FT SRPDPTVPLLPNNMYPPIPRPRDDTPPPSDESSLPVPKGRMTLDYLLKLAQ FT ARNAAKAQESTM*" XX SQ Sequence 7868 BP; 2715 A; 1426 C; 1473 G; 2254 T; 0 other; tacctatctt ttctggtgcg agtgaccagg attctgaacg ggcaagaaga tcaaggaact 60 aggaaaaaga acagaagagg aaacgcactc ctatcggtaa gttaattttg tttcactgtc 120 cattgagcat aatccgagaa catgtcaaca gaagaagaaa ccacattctt ccacacttta 180 tctgaccaat taaatagact tacaaattcg atatcgacta atggggtggt aacaataata 240 gactcatttg agggggatcc caaagccttt agggaatgga cgagatccat tgaaaagtat 300 gcggtgttag catccattcc aaacgatagg atgaaattga tcgcttatca aacgagtaaa 360 ggggtggtgt ctaattttat taggcgctat ctcgataccc atcccgcagc agattggcct 420 acgctaaagg atgaactctc ggcacgcttt gcagaagtaa ctgatgcaca acatgctttc 480 gcgttactcc gaaaagttaa acaaaatcat aatgaaaacg tacaattttt tggtgagcga 540 cttttggtat tagcatcgga agcatattca ggaacatcgt cgttagaaat agaaaggcaa 600 ctagtagggt tctttgttga cggattgaaa gaaaattatt taaaattgaa aattctacgg 660 gaaaatccca aaactttgca attagcggtg accttagcta tgggtgagca aaatttgaga 720 acacggttcg cattaagaac gggaagcaat ttcccgcgca acgaagatag ttctagccga 780 gacgaaccca tggaggttga ccatttcaga cccaggaaat gcagaatttg ctatagacat 840 ggacatgagg caaaggattg ccgacgaaaa cctaacaata gacaggttaa cagtgcggac 900 acatttaggg catcccgtcc tccaattaaa tgttggaggt gcggtaaatt aggccataaa 960 caagtcgatt gcaaagttcg actaaattcg ggaaaccatt agcctctcct cgtaaaccgg 1020 ctaacgagga gagaaagaaa gtaacttcaa caaataaacc tattttcaac aaaaattgta 1080 agtcaaaacg aaattctaaa aataagaaaa ttttagaagt aaacttggca ggtaatccga 1140 gtacatgttt agtaaaattt gaaaaacaat attatagggc attaattgat tccggtgcag 1200 aagtgtcttt aatgcaccgg cgcgtttacc aaaatttaaa atataaaccc aaactcaaaa 1260 aagaaaatgt aattctacgt tccgtaagcg gacaatcttt aactattgat ggaagtgccg 1320 atttcgaatt taaaatagga aacaaagaaa tgaagcataa actctttatt gtatcttcta 1380 tgaatagaaa tttgatactc ggaagagact ttctagccga gaaaaatgtg aggatatatt 1440 tcgacctcaa atgcctaagg atctatggaa cgtatgttcc cctacaacat gaccttcata 1500 tttcatccct tgtacggttg actgaaaatg ttgttttaca tcctcaaagt agttgtatga 1560 gtgtagcaag ggcaaaatgt ttaaacccga attcgcccat gctacaaata caaggaaccg 1620 attctggttt catgagcaaa caaccaggaa taatggttgc aaatgcacta gttgcaacag 1680 taaatgttga acgaaacaat ggacgtatgc ccattacgat agtaaataat actaacaaaa 1740 ctatgacctt aaagagaggt tgcttaatag gtcaaataga ccctgtcaat cataacgacg 1800 taagtgaatt agtcccacct ttagatgtaa accaggttag taatgaacaa actaatgact 1860 gttttaaaga tataaaaatg ccaaacaacc atcggtcacg cctaatgaat atatttacta 1920 aatatagaga tatattcgcc tcgagcgata ccgaacttgg acagacagat actgtcaaaa 1980 tgaagattga tgttggtgac catccaccca taaaattaag gccatatagg acaccgttaa 2040 ataaaacaaa aatcgttgac aaagcgattg atgatatgct aggtgcaggg ataatcgcac 2100 attcgtcatc cccgtatgca tttcctattg tagtcgtaga caagaaagat ggagggaggc 2160 gcttttgtgt tgattacaga gcattgaata aaatcactaa aatcaactcc taccctctcc 2220 cattaatcga tgatctatta gcacaattaa atgcatcttc agtgtacagc ctcattgatt 2280 tacgagcagg ttactggcaa gtgcagatgc atgcagatga caaagaaaaa actgcattta 2340 catgtcatag aggtttattt cattttaatg tcatgccttt tggtttagtt aatgcacccg 2400 gagtattcat gcaattagtt ggacgagtat tagaaggcct cagtgatttt gcggtcgctt 2460 atcttgacga tattttaata ttttcaaaaa caccagaaga tcactttaaa catataaagt 2520 tagtttttga tagattaaga caacataacc taaagcttaa ggcaagtaaa tgttcattca 2580 tgcaagaaga gacgaaatat ttaggtttta ctattactaa gagtggtatt tcggctgacc 2640 cagataaagt gagagcgtta aaagctatga ctcctcccac cacagttacc gagactcgtt 2700 cttttattgg agggatgagt tattatagaa gacttatacc aaatttcagt aagattgcag 2760 aaccattaat acatttaact aagaaatttg ctagatttaa ttggacaacg gaatgtcaac 2820 ttgcgtttga cttcttaaaa aatagtctta ctactatacc gtttttaggt tatccgaact 2880 ctaatcttcc gtacgtttta tacactgatg caagtgacac ttgcataggt gcatgtttag 2940 cacaagttat agatggagtc gaaactccaa tttacttttt gagtcataaa ttgagttcaa 3000 cacaggtaaa atatagtacc gtgttaaaag aatgttacgc tatacattat agcttgcaaa 3060 aacttgacta ttatttgcat aatgcggaat ttattattag aaccgatcat caaccattaa 3120 agtacttgtt agaatcgccg tttcaaaata aacaaattca gctttgggcc cttggtatac 3180 aaggttataa ttgtaaaatt gaatacattc ctggcaaaag caactcgata gcagatctac 3240 tatcacgagt gccagtagca acttttgaca atttaacaga tgacgacgag gagatagaac 3300 ctcgattaaa cgacaatgca ttcgaaataa atgtgataaa ttctaataag gttgacacag 3360 gtaatttcgc agactgtgat atgaatgata atgaaagttt aagtgacccc gaagatgacc 3420 cactaaaatt agacttagtc agtgaacaaa taaaagatga tgaattacaa aatattaaga 3480 ctgaattaga aaaaggaaac cttaaacctg cggtcaataa taaatatata gttgttgatg 3540 acatattata ctatatatct gacccagatg gtgatcttaa tttaagactc tatgtcccca 3600 agcatttgac gagtaaagtc attaatcagt atcatgatga gaatggacat agtggtgttg 3660 acagatgtta tgctctaatt cgtactaaat actattggaa gggtttatat aaacaactat 3720 ataagtatgt agaaaactgt gtcacttgtc aagcaaggtc aacaactaaa aataaaccag 3780 acttagttga aacaggtata ccaccatatc caatggctaa gatctcattg gatctatcgg 3840 gaccttaccc aacaagttta tcaggaaata agtacatcgc cagttttgta tgtatgtact 3900 ctggttggat tgaagcattt ccaattcctg ataaatcggc acaaaacgtc gcacatttac 3960 tgattgacga aatgtttcca aggttcggtt gtccattagc tatagtgaca gacaatggaa 4020 cagaaaatgt aaataaagtc atgaaagaaa cgatgcaaac tttaaatata aaacatattt 4080 gtacttctta ttataggcct caaagtaata gtgtagttga acgaagtcat cgaacgttac 4140 acgatatctt gtctaaacgg ttgaaatctg acatttccac ttgggacatc caccttaacc 4200 agagtcttgc agctttgaga ttcaatataa gcgctagcac aaaacaatct ccatttacat 4260 tactttattg tagagatcca gtgttaccgt tagacaatat tctcaaacct agacgtaagt 4320 ttttaggtga aggtgatttc catcagatcg cactggaaaa tcaacatcgt tcattcctac 4380 aagtttataa aaatgtcaaa aaggctaaga aaagacaggc taaatatgca aataaaaata 4440 ctaagaacat cgaatttaaa gttggtgacc cggtctatta taagaatttc ttaaggaaga 4500 ataaacttga agaccgatgg acaccttact atagaattgt cgaaaaaaca tcttcagtat 4560 cgtatagaat taagagtcaa ctcacaggca aaacgaccaa agctcatgca gagcaattac 4620 ggttggctaa aatagacgaa tgggacattc caactgatga acaaagactt agacaggcta 4680 agtatgtaga acccctctct gactcgagtt ctgagggctc tgcttcggag tattccgaaa 4740 acgatgagtt acccttagct cagttaatca aaaagtacag acgtgtacgt gaaaactcct 4800 ccagtgagag cgatatacct ctcatggaat tagctcaaag gataagagca caagataacg 4860 atgttatcga cgacagtaat gtagaaagta aatcgtcgtc aggtgatgta tcggatgagt 4920 aatttatggt ccaatcaggt gtgtgtgttg tgagtgatga gaaaggcgcc ttctctgatc 4980 ttatatcgat atactgcctt gggccataca aattactatt gcggtcaaat aacctaatta 5040 tcattatgga tacaataacg agtatatcgt gtatcgaaaa tgatgggaac aaaacaagat 5100 ttacctaatt aaggcgtaga tatctaacgc ttatcatgat ttgtgtccac atgtaactaa 5160 aattaattat aattttgttg tagttgttgt tgttttaaag taaatggtcc attatttttc 5220 agaaagaatg cgttgtattt ggttattatt attacctgtt tatggtatca ctgcggcaac 5280 caaggtttct cagaatgtac tctttcgacc agtacatcag gtatccacaa gtcggtccaa 5340 atggcgcttg accctggtcc atgatataac tccgtttatg aagttcacca gtgaccttgc 5400 taccgacata ggacgcactc tactcactgt agatgaaatt atgacaaatt attctgcagg 5460 gaacaatagc ttagaatacg aaaatacgtt tagcaaacta aaaatagagg taataactct 5520 tcgagatgaa atagaagatc tactggatgg tgtgcaagac tatagagcac tgtccatgaa 5580 acatgctagg ccaagaagag cacttattcc atttgtagga aaagcattaa gttggttgtt 5640 cggtactgtc agtaccgatg atttaaatgt cattagaagt aatatcgata agctagctaa 5700 taaccaacaa gctattcaac atgtagttgc tgaaagtttg tctatattaa atgtatcacg 5760 cttacatatt tcagaaaata ggcaacaagt taatttatta tcagatagtt tacgaaactt 5820 ggatatcaag ataaaaaacg tagttcggca atttaaaaag aaactagtaa gtttagaggg 5880 atttctccat ctttatttaa gaatgaaatt agttctcgat gacctcaaaa acatgtatct 5940 tagagcagag atgtacgcat cggaaatcaa acttcaactt aacatgcttt ccttaggaca 6000 tttgtcccca agtgtattga ctcccggaag tctccgcaga ctattaatcg aaattcaatc 6060 ccacctacct ccgagtgtac gacttccggc aaatcctaag aaaaaccttt ggcattacta 6120 ccgaatttta aactgtgtaa cgataatcga aaataataag atcattgttg tgttgaatat 6180 acccttatta gattccgatc gcgtatttga aatttataaa attcataatt taccattacc 6240 tatgacgaac aaattaattt tatcgaagtc taaagcgacc tttaaaatgg tagcctcata 6300 tcggctcgag gctgaggtca tcgcagttaa tactgataga acaaaatatg caattctaga 6360 ttctgacgaa ttcactaaat gttcagaccc attggctgga ttttgccaaa ttacaagtcc 6420 tatctaccct attaatttgt cacaactctg tatagttgcc ttgtttacgg aaaatcgaga 6480 gaagatacgc caattatgta aagtctttat taagccgaac tgtattttac cggtggcaac 6540 atatttagca aacggtatgt ggatggttgt cacgcaccga ccaatccgat tttcaattgt 6600 atgtcatcag actagtttcg tacaccagtc tgtcgtgata cggccaccga tacaatcgat 6660 caccttacca caatcatgta aggctcaaaa tgattttcta gaactacccg cacacttttc 6720 gaacgaaagt tcatacatgg attccacgtc cttcccaaac gtgactgcta tttttgagaa 6780 aacaattcaa gacctttggc aaccttatat caccaaattc gagaattact cacttacaaa 6840 attaccaccc aaattagaac atattactga tatcccaatg aaggaactaa cgtacgaaat 6900 tgagaattta cagaaattag agcatgaagc tacagaaaaa tggccacttt ggaaaatcgc 6960 tgctattagc atagcagtaa taacgggtat tttgatactc tttgtcttta taaaatattg 7020 tttagtagga agaatggggc ataggtttct gtcggttctt cgaactcgga ctgctcatgg 7080 gcgggggaac aaaaggtctt ccgatgtgga gttgcgtact ataccttccg ctccaaacga 7140 tgaaccaaca tctcgacctg atcctacagt tcctctactc cccaacaaca tgtacccgcc 7200 gattcctaga ccgagggatg atactccacc cccatctgat gaatccagtc tccctgtacc 7260 taaaggaaga atgaccctgg actacttact aaagttggca caagcccgaa acgctgccaa 7320 agcccaggaa tctacaatgt aattgacata attttaaaaa caaatgtgta gatctcttta 7380 tagttgaaga ctacaagttt agctaaatgg tgaaacgtat cctgcttctt caaaaacgaa 7440 acaacataac ctatggcgaa ataaacgatg taacctgaga aacctactaa ccaaatggta 7500 gaatcgacaa ccgcaaaccc aagcacctta agaaacttta atggtggata taacttttat 7560 ggtggatata tgatgtaact tttaatggtg gatataactt taatggtgga tatataatat 7620 aactttttat ggtggatata actttttatg gtggatgaca tatagcttct agtagtgcat 7680 ataaacaata tagacttaca atcatgcctt gtgttatttc tgattatatt gaagaattgt 7740 tttttgtaat gtgtattttt caacttgtat tattaataat tatgatgtta agaatcgaca 7800 ggtgtgagga tagatttaac cgttttcttt tggtacccca atttcatgac cctctgcaaa 7860 gggaggaa 7868 // ID hAT-44_SM repbase; DNA; INV; 1880 BP. XX AC . XX DT 16-AUG-2009 (Rel. 14.08, Created) DT 16-AUG-2009 (Rel. 14.08, Last updated, Version 2) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-44_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-1880 RA Jurka J.; RT "haT-type repetitive family from freshwater planarian Schmidtea RT mediterranea."; RL Repbase Reports 9(8), 1847-1847 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 895..1551 FT /product="hAT-44_SM_1p" FT /translation="VYLILNTTQQKMVKIRIYLXAWISAPQASDAPYSDLM FT LLKSLLQYSSIHSAISKSTSRKFSNHLWYLSHEIVSLALFDRRVGSVNKRL FT MVSAMLNEENEDQVQSKRISVALDSFSDKXXEDFVTAKXMTLIRMMDLPDG FT FLMVDPDLWEDRDDYKKAAETVESLKVVNDHAERGVALIQEYSGQITRDEA FT QLQFLLQVVEDHRRMYPDSRKQALSGQL*" XX SQ Sequence 1880 BP; 589 A; 341 C; 381 G; 562 T; 7 other; gggtggagct tatttttgaa cttttgaaat ataatcttct taccctctca ttatgttcct 60 atatatgaaa taaaaattca ggcaaaatat gagcacaatt gaacaatatt taggggtagc 120 tcaatgatgg tgaagttttt ctgaatctct aaagctgcgg ctgcaatgaa gcattgttct 180 tagttttatt acataataca actgtttgca ttcattgctt gatttacgaa ttcaaagttc 240 atttctgtac ttttgctgat tgacaagtga cgctaacata gtgagcctta ttgattctta 300 tctattggtg agttacatat taaatgtttt aagtgtaaaa ttataagtgt tacaaaataa 360 tcatttgttt atcaatttat ctatctatct gtttctgtat atgtatttcc tatttattta 420 cactattgat ttacatttgc ctgccacatc atctcgaact gaaggaccat aagacagtcg 480 tcggctgcaa ccatcaaaga aatttccaag ttttggaaga aagctagaat cccgaagaga 540 gatcatcaaa actgccaaac caagcttgag caggtgtttg aagaatggcg tcttctgaag 600 aagaataaag cacgaacgtc atcaactcaa ctcgcgaaag agacggcttt gtttccagac 660 tcgaagatct ttttgatgtc gctcacgctg atgctcgacc aagtcgtccg tattacaaga 720 caaggatttt ttgttatccc atcgagaaaa aggacgaagg gggtcgatgg ctggagtaga 780 tgaaaagctt gctgctaaag aaaaaaggag catctgaaag agaggagcaa atggttgcca 840 gacaacagcg aatggacgag ataaaacagt cagcagtttc tacagcagaa ctgagtttat 900 ctgattctga atacaactca gcagaagatg gtgaaaatac gtatctacct gaangcttgg 960 atctccgcac ctcaagcatc agatgctcct tacagcgacc tgatgctgct gaagtcactg 1020 cttcaatact catcaatcca ttcagcaatc tcaaagtcga catcacgcaa gttttccaat 1080 cacttgtggt acttgtcaca tgaaatagtt agcttagctc tgttcgatcg tcgagtcggt 1140 tcggtaaaca agagattgat ggtgagtgcc atgctgaatg aagaaaatga agatcaggtt 1200 caatccaaac gaataagtgt tgctcttgat tcattcagcg acaagaantn ggaagatttt 1260 gttacagcaa agncaatgac actgatccga atgatggacc tgccagatgg atttctcatg 1320 gttgatccgg acttatggga ggacagagat gactacaaga aggctgcaga gacagttgag 1380 tctctaaagg ttgtcaatga ccacgctgaa cgtggagtgg cactcattca ggagtacagt 1440 ggacaaataa cccgtgatga agcgcagctt cagttcctgt tgcaagttgt cgaggaccat 1500 cgcagaatgt accctgatag caggaaacag gctctgtccg ggcagctatg aaatcaataa 1560 atttagaatt tgctggaaca gctggatatt gtcaaactga tactgtgaac tctgtcagaa 1620 actgaacaaa caattcacta atactgtgtt tttgtgttga gaaataaact ctgtgttgac 1680 tgttcgttaa aataaatatt cataattttg ctgaaatttc aaaacttttg taatctttga 1740 ggcccttgag ctacccctaa acanaagaat ttattgctca aactttttat acantgttta 1800 tatattataa ggaacatatt angggggtaa gactttcaaa aatgacattt ttattttttt 1860 taatttttta agctccaccc 1880 // ID R1_DEr repbase; DNA; INV; 5381 BP. XX AC . XX DT 08-NOV-2010 (Rel. 15.11, Created) DT 08-NOV-2010 (Rel. 15.11, Last updated, Version 2) XX DE 28S rDNA-specific non-LTR retrotransposon R1 in Drosophila DE erecta. XX KW R1; Non-LTR Retrotransposon; Transposable Element; R1_DEr. XX OS Drosophila erecta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; melanogaster subgroup. XX RN [1] RP 1-5381 RA Stage D.E. and Eickbush T.H.; RT "Origin of nascent lineages and the mechanisms used to prime RT second-strand DNA synthesis in the R1 and R2 retrotransposons of RT Drosophila."; RL Genome Biology 10(5), R49-R49 (2009). XX DR [1] (Consensus) XX CC This family is specifically inserted into 28S ribosomal RNA CC genes. XX FH Key Location/Qualifiers FT CDS 402..1754 FT /product="R1_DEr_1p" FT /translation="LGKRRSHFHWVSPMESDSTASAMSASSASKKSTRRRR FT KSVLASKSSVPTQAKLVAVAANGVPEPVGVLDEEFSSMDAPVAAVAPAAAP FT AVAPAVAPTVAPAVARAVATATAATAAARAGQAAMMAELSATQRMVRSSFR FT SLGGVNTDELTLAISRYDELVMALMLRCGELETRLAMPPPPTPSMTTAANP FT PQMLQAAPIDAPRTAKVRETWSAVVKCDDPALSGKAIAEKVRNMVAPSLGV FT RVHEVRELRRGDGAIIRTPSVGELQKVVSSKRFAEVGLNVTRNAAEKPKVV FT VYDVDTAISPEEFMTELRENNFDDTMTASQFKKSVHLLTKAWSVTDGATVN FT VTLEVDDGAMAKLDVGRVYIKWFSFRCRSQVRTYACHRCVGFDHKISECRQ FT KDNVCRQCGQQGHNAARCQNPVDCRNCRHRGRPSGHHMLSSACPIYGAVLA FT RVQARH" FT CDS 1751..4894 FT /product="R1_DEr_2p" FT /note="reverse transcriptase." FT /translation="TLMFSFIQANCGRGRAATIELGVRLRSSSSMFALVQE FT PYLSSDGMDVLPQGMRIYTDRRGKAAILVDQQDAICMPVETLTTEYGVCLV FT VKGSFGSIFLCSAYCQFDAPLEPYLRYMDAILLQASRTPAILGLDANAVSP FT MWFAKLSRHAEGQANYSRGELLSEWMLEARAAALNQPTEVYTFDNYRARSD FT IEVTIVNEAASMWATYEWRVDEWELSDQNIITVVAETTSTSAVESIAPVPS FT WNLSNARWRLFEEEMVSRTAELPGSFSESPLDQQVSTLRSIVHDVCDIALG FT RKSPRSPSRRARWWTADLSAARREVRRLRRRLQDARHRDDDVAAEPLVVEL FT RRTSANYKRLIGRAKMDDWKRFVGDHADDPWGRVYKFCRGRRKCTEIGCLR FT VNGELITDWGDCARVLLRNFFPVAESEAPTAIVEEDPPALETLEIDVCVGR FT LKSRRSPGLDGINGTICKAVWRAVPQHLAALYSRCIRSGYFPSEWKCPRVV FT ALLKGPEKDKCEPSSYRGICLLPVFGKVLEAIMVNRVREVLPEGCRWQFGF FT RQGRCVEDAWRHVKSSVDASPAQYVLGTFVDFKGAFDNVEWSAVLRRLADL FT GCREISLWQSFFAGRRAVIRSSSGAVNVPVTRGCPQGSISGPFIWDMLMDV FT LLQRLEPYCLLSAYADDLLLLIEGNSRAVLEEKGAQLMSIVETWGAEIGVD FT VSTSKTVIMQLKGALRRAPTVRFAGANLPYVRSYRYLGITVSEGMKFLTHI FT ASLRQRMTGVVGALARVLRVDWGFSPRARRTIYAGLMAPCVLFGASVWYDT FT AAQVAARRRLSSCHRLILLGCLSVCRTVSTAALQVLAGAPPLDLAAKKLAV FT KYKLKRGYPLEENDWLYGEDIARLSREQKEIRLEECLLLSWQSRWDDDSEP FT GWVTHRFIPDVTLVYRDPNFGFSMMASFLLTGHGTFNAFLHGRALSDTTAC FT ACGDPYEDWVHVLCACPLYADLGGETSMDLECSALARTGHLKESWRIGREP FT NGWLCSLSWCSAEEGAHSLTHRRVVSGREYCHSLPQLVVGGD" XX SQ Sequence 5381 BP; 1191 A; 1337 C; 1673 G; 1180 T; 0 other; cggacgtgtt ttcctcgagc tcccattgct acggtcattg ttgagtaaat ccgcggattt 60 atacgttgaa aaacggagta aagtgtgtgt gaaggtgtcg cgaacaaaag ttgaaaattt 120 gtgagcgaaa gaacatcgcg tgttgtggaa ataaacgatt ttcaattgga aattttccac 180 gagggccacg tgttgagtga gagcgctctc tgcgagagag gcgttagccc ttttcgtgtg 240 ggattctctc agcgccagct gatcggttgt gaactttgct aaggtgtgtg cattggataa 300 ttttagtgtt gccataccga gcaattggtg gcgccactgt gaataggtgg agccagtgtg 360 ccatagcaac cagcctctgt cagcaagcta aagttgcttg acttggtaaa aggaggagcc 420 acttccattg ggtaagtccg atggaaagcg atagcactgc gagcgccatg agcgccagca 480 gtgcgtcgaa gaagtctacg cgtcgcaggc gtaaaagcgt actggcttca aagagctcgg 540 tgccaacgca ggcgaagctg gttgccgtgg cggcgaatgg agttcctgaa cccgtaggcg 600 tgttagacga ggagttttcg tcgatggatg cccccgtcgc tgctgttgcc cccgctgccg 660 cccccgccgt tgcccccgct gtcgccccca ctgtcgcccc cgctgtcgcc cgcgctgttg 720 ccaccgccac agctgccacc gctgccgctc gtgctgggca agcagccatg atggcagagc 780 tgtcggccac tcagcgtatg gtgcgcagca gttttcgcag cctaggaggg gtgaacacgg 840 acgagctcac cttggctatc agccgttacg acgagctggt tatggcgttg atgctcaggt 900 gtggagagct ggagacgcgg ctggctatgc caccgccacc gaccccgtcg atgactacgg 960 cggccaaccc tcctcagatg cttcaggcag cacccattga tgccccgcgg actgcaaagg 1020 ttcgcgagac atggtcagcg gtcgtgaagt gcgacgaccc agcgctatcg gggaaggcga 1080 tagctgagaa ggtgcgaaat atggttgcac cctccctcgg agtcagggtt cacgaggtgc 1140 gtgaactccg gagaggcgat ggagcaatta tccgcacacc ctcagtcgga gagctgcaga 1200 aggtggtgtc atcgaagaga ttcgccgaag tgggactaaa tgttacgagg aatgctgctg 1260 agaagccgaa ggtagtcgtt tacgacgtgg acacggccat cagcccggag gaattcatga 1320 cggaactgcg ggaaaacaac ttcgatgaca cgatgacggc atcgcaattt aagaagtcgg 1380 tgcacctgtt gactaaggcg tggtcggtga ctgacggcgc cactgtcaat gtgacgctgg 1440 aagtcgatga tggagcgatg gcgaagcttg atgtgggtcg ggtgtacatc aagtggttct 1500 cgttccgatg ccgatcgcag gtccgcacct atgcctgtca cagatgcgtg ggcttcgacc 1560 acaagatcag cgagtgccgt cagaaggaca acgtctgccg ccagtgcggg cagcagggcc 1620 acaacgcggc aaggtgccag aacccggtgg actgccgaaa ttgccggcat agagggcgac 1680 cctcgggaca tcatatgctc tcgagcgctt gcccaatata tggcgcggtg ctagcgaggg 1740 tgcaagctag acattaatgt ttagcttcat ccaagcgaac tgtggccgag ggcgagctgc 1800 gaccatagag cttggtgtcc gactcaggag ttcaagctca atgttcgcac tggtgcagga 1860 gccgtacctc agcagcgacg ggatggatgt gctgcctcaa ggaatgagga tttacactga 1920 ccggcgaggg aaggcagcca tcctagtaga tcagcaggat gccatctgta tgccagtgga 1980 gaccctcacc acagagtatg gcgtatgtct ggtagtaaaa gggagttttg gctcaatctt 2040 cctttgttca gcatactgcc aattcgatgc acctttggaa ccgtaccttc ggtacatgga 2100 tgcgatcctg ctacaggcta gcagaacccc cgcaatcctg ggcctcgacg cgaatgcagt 2160 gtcccccatg tggtttgcca aactctctcg gcatgccgag gggcaagcta actacagtcg 2220 gggtgagctg ctgtctgagt ggatgctgga ggcaagagcc gccgccctaa accagccaac 2280 agaggtgtac acgttcgata actacagagc tcgtagtgat atcgaagtga caatcgtcaa 2340 cgaggcagca tctatgtggg ccacatatga gtggagagtg gacgagtggg agttgagtga 2400 ccagaacatc attactgttg tggccgaaac aacctccacg agcgcagttg agagcatagc 2460 tcctgtgccg tcctggaacc tctccaatgc acgttggcga ttgttcgagg aggagatggt 2520 gagtagaaca gccgaacttc cgggaagctt ctctgagtcg ccgttggacc agcaggtttc 2580 caccctgcgc agtatagtgc atgatgtgtg cgatattgcg ttgggtagaa aatcgcctag 2640 atcgcccagc aggagagcac gttggtggac tgccgacctg agcgctgcga ggcgcgaagt 2700 tcggagactt cgtcgccgac tccaggatgc acggcatcgt gatgacgatg ttgctgcaga 2760 gcctttagta gtcgagctga gacggacctc agccaactac aaaaggctca ttgggagggc 2820 gaaaatggac gattggaaac gcttcgtggg agatcatgcc gatgatccat gggggcgcgt 2880 ctacaaattt tgccgaggtc gaagaaagtg cacggagatt gggtgcctcc gcgtgaatgg 2940 cgagctgatc actgattggg gtgattgtgc acgagtgctc cttcgcaact ttttcccagt 3000 tgcggagtcc gaagcaccga ctgccatcgt ggaggaagac ccaccggccc ttgaaacact 3060 cgagattgat gtctgtgttg gccgactgaa gagcaggcgt tcgcctggct tggacggcat 3120 caatggcact atttgtaagg cagtctggcg cgccgtacct cagcacctgg cagcgttgta 3180 ctcccgttgc atccgatcgg gatactttcc cagcgagtgg aagtgcccac gagtagtggc 3240 gctactcaag ggacccgaga aggacaagtg tgagccctcc tcatataggg gaatttgctt 3300 actgccagtt tttggcaagg tgcttgaggc aatcatggtg aatcgtgtaa gagaagttct 3360 tccggaaggc tgcagatggc agttcggttt tcgccaagga cgttgtgtgg aggatgcttg 3420 gagacacgtg aagagcagtg ttgacgccag tccagcgcaa tacgtgctcg gcacattcgt 3480 ggacttcaaa ggagcatttg acaacgtcga atggagtgct gtactgcgcc gactagccga 3540 cttgggatgc cgggaaataa gcttgtggca gagctttttt gccggccgaa gagcagtaat 3600 ccgaagcagt tccggtgcag tgaatgtgcc ggtaactaga ggttgcccgc aggggtcaat 3660 cagcggccca tttatttggg acatgctgat ggatgtactg cttcagcgcc tcgagccgta 3720 ttgcctgcta agtgcatacg cggatgacct gcttcttctc atcgagggaa attcccgagc 3780 cgtgctagag gaaaaaggag cgcaactgat gtccatcgtt gaaacgtggg gagcggaaat 3840 aggcgttgac gtttcgacca gcaagacggt aataatgcag ctcaaaggtg ccttgagacg 3900 tgcgcccacg gtaagattcg ctggagcgaa cctgccgtat gtgcgtagct atcggtacct 3960 tggcatcacg gtcagtgaag gaatgaaatt cctcacgcat atagcttcgc ttcgccagcg 4020 gatgaccgga gtcgttggag cattggcgcg tgtgctacga gtcgactggg gcttcagtcc 4080 tcgagccagg cggaccatat atgcgggact catggcacct tgtgtgctgt ttggtgcctc 4140 ggtatggtat gacaccgccg cacaggtagc tgccaggagg cgactgtcct cctgccatag 4200 actgatcctg cttggatgtc tatcggtatg ccgaacagtg tccaccgcgg cactgcaggt 4260 tcttgctgga gcccccccgc ttgacttggc cgctaagaaa ttagcggtca aatacaagct 4320 gaagcgtgga tacccgctgg aggagaacga ctggctctat ggcgaggaca ttgcgcgtct 4380 aagccgtgag caaaaagaga ttcgcctaga ggaatgcttg ctattaagtt ggcaaagcag 4440 atgggatgac gacagcgaac caggatgggt gacgcacagg ttcatcccgg acgtcaccct 4500 tgtctatcgg gatccaaatt ttggtttctc gatgatggcg tctttcttgc tcactgggca 4560 cgggacgttc aacgcatttc tgcatgggag agccctcagc gatactactg cttgcgcatg 4620 tggtgatcca tacgaggatt gggtgcacgt gttgtgcgcc tgccccctat atgcagattt 4680 agggggggag acctcgatgg acttggagtg cagcgcgttg gcgaggactg gacatttgaa 4740 agaatcctgg aggatcggga gagaacccaa cggctggctg tgttcgctga gttggtgttc 4800 cgcagaagaa ggggcacata gcctgacgca tcgccgtgtg gttagcgggc gagaatactg 4860 ccacagcttg ccgcagcttg tcgtaggagg cgactaatgc ggcaataggt cactccgccc 4920 gtgcttgtcg gagccaaaga gtgaggccga ccgagcctct aattttcggt accacgggtt 4980 gagcagttcc aagactgctc attgaggttg gcccccttgt gggagtatcg tggtggctgt 5040 ggttggtacc catatcgcgg gtagagcctt catgctcgac gtttgagtaa cggcgcgggt 5100 tgcgcaacac tcgggtgctg tgacccatag aacagtagag attttaggta gatctcgctc 5160 ctcagcaagg gggagtgctt gcccggcaag caagcactcg aattgctacc ggggtggtcg 5220 ctatgtacat agctatagct tccagaccgg gacgttttgt tcagcgtatt ggacacatgc 5280 atcatatgct cacttgtggg tgtatagggt gccgtggttg taatcccttc aatgtggaac 5340 acgccacgta aaacaagttc ggagggatcc gattcataca c 5381 // ID Dneoca1cons repbase; DNA; INV; 492 BP. XX AC . XX DT 19-APR-2011 (Rel. 16.04, Created) DT 19-APR-2011 (Rel. 16.04, Last updated, Version 1) XX DE Consensus of mariner-like element internal region of mellifera DE subfamily. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Dneoca1cons. XX OS Drosophila neocardini OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; cardini group; OC cardini subgroup. XX RN [1] RP 1-492 RA Wallau G.L., Hua-Van A., Capy P. and Loreto E.L.; RT "The evolutionary history of mariner-like elements in Neotropical RT drosophilids."; RL Genetica 139(3), 327-338 (2011). XX DR [1] (Consensus) XX CC Consensus of clones with show less than eight percent divergence. CC D neoca1cons. XX SQ Sequence 492 BP; 137 A; 113 C; 120 G; 122 T; 0 other; tttgggtgcc tcatgagctg actcaaaaaa atttcttgga ctgaaataac gcctgcgatg 60 cactgctcaa acgtaataaa gtcgacccat ttttgaagtg gattgtgact ggagatgaaa 120 agtggatcat gtgcgacaac tccaagcgaa aacgatcgtg gtcgaagtgc ggcgagctag 180 cccaagccat ccctaagtcc ggattgacgg ctagaaaaat attgctgtgt ttggtgggac 240 tggaagtgaa ttattcacta tgaactgctt aactattgcc agaccctaaa tttagccatc 300 tattgcgagt agatcgaccg tttgaagcag gcgatcgacc agaagcggcc agaaatggtt 360 aatgggaata gtgttttgtt ccaccaggac aatgctcgac cacacacatc cttgatttct 420 cgccagaagc tacgtgagct cggatgtgga tatcctatcg catccaccgt actgcccaga 480 ccttgctcca ag 492 // ID Copia-15_CQ-LTR repbase; DNA; INV; 130 BP. XX AC AAWU01015815; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-15_CQ_; KW Copia-15_CQ-I; Copia-15_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-130 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 346-346 (2011). XX DR GenBank; AAWU01015815; Positions 53625 53754. XX SQ Sequence 130 BP; 33 A; 27 C; 27 G; 43 T; 0 other; tgttgagttg aagactgcgc gcaatcaagt aggcactgtg tgccggcagc ttggttacaa 60 ttattagttg aataaactta ctaccgattg gacgtatttc taatccactg cgtttctcat 120 tactcttaca 130 // ID Gypsy-191_AA-I repbase; DNA; INV; 3908 BP. XX AC supercont1.102; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-191_AA_; KW Gypsy-191_AA-LTR; Gypsy-191_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-3908 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.102; Positions 1414743 1418650. XX CC 'CTGAC' target site duplication CC LTRs are 96% similar to each other. XX FH Key Location/Qualifiers FT CDS 201..1940 FT /product="Gypsy-191_AA-I_2p" FT /translation="MTPEDINAYMQSQKAMFDVMLRQMQQMNAAPAAAARA FT PAFVPLPPPLSVEGDMEQNFDFFIANWEHYGKNVEIDKWPAEDNERKVSLL FT MSVIGQAALKKYSNFELSAEEKADLTLAVAAIKAKVVIERNVNIDRLEFFE FT AQQHSAESIEDFVLRLKHLARLAKLCNLSAELVTFKIITANRWPHLRQKML FT TKTNITEAVTFCRNQEIGEARAEQSKGEVHKLKSSSSKLPRYKFCSDRHQF FT TKGTCPAWGKRCKKCGRKNHFERFCKEENEKKKKKKSVKVVKNCESELSSC FT SSAEESECEDNSSEEEVEIRKIYDHSGSGGRVLAELDLKLNGKWKSVQCEV FT DTGADTNIIGLKFLAEMAEGNQHEITPSKYRLQSFGGNPIQVLGETRISCK FT RNGKRYKLVFQVVKVDHIPLLSANASKTLGFVKFCNAVTFSPAEPSWSDLL FT TVYRVSALEIINKHKSVFEGYGRFPGVVSLEVNDSVPPRIQPPRRVPVALR FT AALKEELENLERDGIITKEPRHTEWVSNIVLFKRGHDKSAPLRICLDPIQL FT NQALKRPNLQFVTVDEILPALGKAKAQTDTVCG" XX SQ Sequence 3908 BP; 1161 A; 869 C; 970 G; 908 T; 0 other; tggtgtcaga agtgtcgtag ctattttccg gcgtcattga taaatcgcgg aaaagtttgt 60 cgaggaaacg tcggtttcga agcggtatgg catcatattc gcggaattag tgtccgtttc 120 cgagcggttt ttttatatcg cgaaatcaga aagtgctcgt tttttccggc ttcgatccca 180 agattttccc gttaccaaaa atgactccag aggacattaa cgcctacatg caaagccaga 240 aggcaatgtt cgatgtgatg ctgcgacaaa tgcagcaaat gaacgctgcc ccggcggccg 300 cagccagagc gccagcgttt gttccccttc cgccaccgtt atcagtggaa ggtgacatgg 360 aacaaaattt cgattttttt atcgccaact gggagcatta cggcaaaaat gttgaaatag 420 ataagtggcc ggccgaggat aacgagagaa aagtaagttt actcatgtcg gtaattggcc 480 aagccgcttt gaagaaatac agcaatttcg agctgtccgc tgaggaaaag gcagatttaa 540 cgttggcagt cgcagcaatc aaagcaaaag tcgtcatcga aagaaacgta aatatcgacc 600 ggcttgagtt tttcgaggca cagcagcact cggcggagtc catcgaggat ttcgtgctcc 660 gactaaagca cctggcgcga ctagcaaagt tgtgtaatct ttcagcagaa ttggtaacgt 720 tcaaaattat taccgcaaat cgttggccgc atcttcgtca gaaaatgttg accaagacta 780 atatcacgga agcggttaca ttctgtcgga atcaagagat cggagaagcc cgagctgaac 840 aatcgaaagg agaagtgcac aaattgaagt cgtcgtcgtc aaaacttccc cgctataaat 900 tctgtagtga ccgtcatcag ttcacgaaag gcacatgtcc agcgtggggc aagcggtgca 960 agaaatgcgg aagaaagaat cactttgagc gtttctgcaa ggaagagaat gagaagaaga 1020 aaaagaagaa atcagtgaaa gtggtgaaga actgtgaatc agagctgtcc agctgttcgt 1080 cagcagaaga aagtgaatgt gaggacaaca gtagtgagga agaagtagaa atccggaaga 1140 tctatgacca ctccggatca ggaggaagag tgttagcaga attggatctg aaactcaacg 1200 ggaaatggaa gtctgtgcag tgtgaagtag acacaggagc ggataccaac atcatcggtc 1260 tcaagtttct agcagaaatg gccgaaggaa accaacacga gatcactcct tctaaatatc 1320 gtttgcagag cttcgggggg aaccctatcc aagttctggg cgagacgcgg atcagctgta 1380 aacgcaacgg aaaacgatac aagttggtgt tccaggtagt caaagtggac catatccctc 1440 tgctgtcggc aaacgcaagc aagacacttg gatttgtaaa gttctgtaat gcagtcacat 1500 tttcaccggc tgaaccatct tggtccgatt tgctcaccgt gtatcgggta agtgctttgg 1560 aaataatcaa caaacataaa tccgtctttg aaggctatgg tcggtttccg ggcgtagtgt 1620 ctctagaggt taacgattcc gtacccccac gcatacaacc accgcggcgg gtccccgttg 1680 cattacgtgc agcactgaag gaggagttag aaaacctaga gcgagatgga ataataacaa 1740 aagaacccag gcacaccgag tgggtgagta acatagtcct attcaagcga ggacatgaca 1800 aatctgctcc attgagaatc tgcttggatc caatccaact aaatcaagca ctgaagcgac 1860 ccaacttaca gtttgtgact gtcgacgaaa tcttaccggc gctcggtaaa gctaaagcac 1920 agacagacac agtctgtggc taaagtgttt acaacagttg atgccagaaa aggattctgg 1980 catgtggttc ttagtgagga aagcagtcgt ctcactacat tctggacacc gtacggaaga 2040 ttccgttgga ccagacttcc attcgggatt agtcccgcac cggagatctt ccaaatgaag 2100 ctgcaggagg tattacaggg attgaatgga gtggagtgta ttgcggacga tatactagtg 2160 ttcggctgcg gagacacctt gcaagaagct ctcgtcgatc acaaccagtg tttggataac 2220 cttcttgttc ggctggagga aaatgaagtg aagttgaatc ggagcaaact gaagttgtgc 2280 caaagtgcgg tgaaattcta tggacatgtg atgaccgacc agggactaaa acctgacaag 2340 tcaaaggtgg aaacattcaa aaactatcca gtgccaaaaa accgcaagga acttcatagg 2400 tttatcggta tggtgaatta tttgtgccga ttcctgccga atttgagtga aaatttcagt 2460 gtgctccgaa agttgatttc cgagaaagaa ccgtggattt ggtcatcgga ccaacaagaa 2520 gaattggacc gagtgaaaag tctagtgtca gatgcgaaaa cattgcggta ttacgatccc 2580 aacgaaccgt tggtagtgga gtgtgacgca agttgtttcg gattgggcgt cgcggtgttt 2640 caacgtaacg gtgtgattgg ttacgcgtcg cgaacactaa ccgctactga aagaaactac 2700 gcgcaaattg aaaaagaact aatcgcgatt ctctttgcgt gtgtgaagtt tgatcagatc 2760 attgtcggta atccgatagt caccgtaaaa acggatcaaa agccgctagt aaatgttttc 2820 cgaaagcccc tgttgtcagc tcctcgtaaa ttgcaacaca ggcaaggaca atgtggtcgc 2880 ggacgcaatt tcacgtgccc ggctggtgga atgtccagaa gaagattgtt tccgtaaagc 2940 gactatatgc gaagtgttcc ggaaggtcga agtggttcag ctttcgtcgt ttctgagtat 3000 ctcggaagac atcatccagc agattgtcca ggaaacgacc aaggattcaa cgctccaaac 3060 tatcgtgtcc tacgttcgtt ccggctggcc tcgatcagta tcgaaagtac aggaaggtgt 3120 aaaagtcttt ttaagtaccg gaatgaactg tcagttcagg atggaatgtt gttccgtcac 3180 gaccagattg ttattccgta ttctctccgc aagaagataa cggacaaggc acacgtaggt 3240 cacaacggag tggagtcaac gctaaaacta gctcgagcaa acgtattttg gccgggcatg 3300 agcgcgcaaa ttcgagaaac cgttaaggaa tgctctgttt gcgccaaata tgcttcatca 3360 caatcaccag caccgatgca gacacacccg atccctgttc atccctttcc gggctatcag 3420 ctcaacccgg atacctgtaa ggaatggaca ccagcagtag ttatccagcg ccaaagcgac 3480 agatcctgcg tcattgacgt agagggcgct aactatcgac gtgaccgtag tcacctgaag 3540 ctacgtaatg taccacaaac gccattgaaa tctccctcaa ttactgaacc acccgagcca 3600 ccaacatcaa aaaagctacc tgttgcattt caactaccat tgcaacgcca gcgatggcca 3660 acaccacaca gcatcaacgg acaccgatgg tcaacactcc tgtcagcgag gtcccgaata 3720 tgttagagca aacaacgaga aacactaccc cgattcctcc agaatcggat tcttcagctg 3780 tcgccagtgg ttctcaactg atcacacctc ggcgtccaag aagagatatc aaagtgacag 3840 ctaaatttaa agattatctt ctaaattttg attaaactgt cttatcttta attttgcgaa 3900 agagaagg 3908 // ID Gypsy-28-I_NVi repbase; DNA; INV; 4636 BP. XX AC . XX DT 11-MAY-2009 (Rel. 14.05, Created) DT 11-MAY-2009 (Rel. 14.05, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW internal portion; Gypsy-28-I_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-4636 RA Bao W. and Jurka J.; RT "LTR retrotransposon from Nasonia parasitic wasp."; RL Repbase Reports 9(5), 992-992 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 127..4602 FT /product="Gypsy-28-I_NVi_1p" FT /translation="MSDVEEPSTALIAAVTKALADALRIAIPQVTTPVTNQ FT PESTSQVQPKAPPFKYQEYRSSDGTTVNDYFKRFEWALQLSKISEEEYGNF FT ARVYMGAELNNALKILVAPDTPESRTYQQIRTVLIAHFDVKKNKYAESVKF FT RQIVQLQNESIANFSLRLKQGATFCEYDTFLDRMLIEQLLHGLTDRDMRDE FT IIAKKPTTFKDAYEIAHTLESTRQTADEVTSTSNKIHALSYSTTHFKHKRN FT ASHTSRSASRKQDHGEQQGQYACYGCGERHKRSECPFRTSECHKCKKRGHI FT AKVCRASSSLSTSQIQSSEEPAQQIDTLKCFNAVEEIHVIDKSCGKKILQV FT LIEGHKLNMELDSGAPYGFIGSDTLRNLKPNFQLQPTSKKFMSYSQHRLNC FT IGTCSVNVSFGSTSRQLPVYVIQGSYDSLFGREWIAQFSHEIDWTELFSPI FT KVNALSTLPPSLTRDQQAQLDQLLTRYAEVFSETAGKLTGPPVKVHFKPGT FT SPVFARAQDIAYALRDKYSKAVDAKLASGFYKKVDFSEWASPTHVVIKKNG FT DIRITGNYKPTLNPRIIIDECPIPKPSDIFNKVKGAKIYAHLDITDAYTHL FT PADDEYSHALTLNTPTHGLVRPTRAVYGAANVPAVWQRRLKEILQGLKNVE FT NFFDDIIVWAESFEELLIILETCLIRLLENGVRLNRRKCVFATNSVEFLGH FT KLDTQGIHKSDSHIKAIRDAPKPSTPQELELFIGKATYYNSFIPDLATKAR FT PLRDMLLTSSFQWSPTADKAYEELKNILISPQVLMPYDPSLPLILATDASK FT VGLGAVLSHKLSNGIERPIAYASRTLTATEQRYPQIDKEALAIVWACQKFF FT NYLYARHFTLFTDHKPLTQIFHPEKSLPILCISRMANYADYLAHFNYDIKF FT KPTKANANADYCSRAPLPSTVDAIEEITELDSFDTFIINQINQFPVRAEQI FT AKETRKDSNLGKIIQLLEAGQDLSRHGYKAPESSYGLSSNCLIFEHRVVIP FT SSLRQAILNDIHSAHLGIVKMKGLARSFVYWPGIDADIELIAKSCAECAKH FT AHAPPKFNTHHWEYPKGPWERIHIDYAGPVAGKMLLVITDAYSKWLEVKVT FT SSSTSTATIDILDQLFATYGVPSIVVSDNGRQFVSDEFETFLKISGVKYHK FT LTAPYHPSTNGQAEKCVGTTKRALLKMDTTQGSLQRNLNEFLRQYRKAPHS FT TTGQPPALLFLKRNIRTRLDLVRPEPINEKISEKHQADFINTYREFKPLEH FT VYFLSGNPKLDKWIPGQINSRLGDLHYEISYMDKKYKRHVDQIRAFTKKSE FT EGEETRKKPERVTFFEDKIGANLTTLSTTSTRRRTRFYKNNVTPRSMSAQQ FT EVNTEVITPSYKAPSTSQTVSFKESPTIQATSTQESLISKSLPTSSVEMST FT TTTEAQSSTSHASLPPTVNDSFQNLSTSATLQEEFPFQTQDQRTERLPQEQ FT SFFIPNTPPIRRSSRIRKSRTIYSPP*" XX SQ Sequence 4636 BP; 1467 A; 997 C; 815 G; 1357 T; 0 other; ttttggtgtc agaattggtt tcgaaaatac gccaattact tgattttcta tattctacta 60 caataacacg cagagagaaa tagtgaaaag tgcatatttt cattacaaat caacgtcaac 120 ataaccatgt cggacgtaga ggaaccttct actgcattga ttgctgctgt gacgaaggcg 180 ttggcggatg ctctgaggat tgctattcca caagttacca ctcctgttac aaaccagcct 240 gaatctactt ctcaagttca acccaaagct cctccgttca agtatcaaga atatcgttcg 300 tccgacggaa cgacagtcaa cgattatttc aagagattcg agtgggcatt acaactgagt 360 aaaatttcgg aagaagaata cggtaatttc gctcgagtat atatgggcgc agaacttaat 420 aatgctctga aaattcttgt tgctcctgat acaccagaaa gtcgtacgta tcaacaaatt 480 cgcactgttt taattgcaca tttcgacgtc aagaaaaata aatatgcaga aagtgttaag 540 ttccgacaaa ttgtacagct acagaatgaa tccattgcta atttctcact acgattaaag 600 caaggagcta cgttctgcga atacgacaca tttctcgaca ggatgctgat tgagcagctc 660 ctacatggtc tcaccgatcg tgatatgcgc gacgaaataa tcgccaagaa acctactaca 720 ttcaaagacg cttacgaaat agcacataca ttggagtcaa ctcgtcaaac agcggatgag 780 gttacctcta cttctaataa gatacatgca ctatcttatt caacgacgca tttcaagcat 840 aaacggaatg cttcacatac atcgcgcagt gcttctcgga agcaagatca tggtgaacaa 900 caaggtcagt acgcttgtta tggatgcgga gaacgtcaca aacgcagtga gtgtccattt 960 cgtacttctg agtgtcataa gtgcaaaaaa cgtgggcaca tagcgaaagt ttgcagagct 1020 tcatcttcgc tttctacttc tcaaatacaa agttctgaag agcctgcaca acaaatagat 1080 acattgaaat gtttcaacgc agtcgaagaa atccacgtta ttgataagtc ttgtggtaag 1140 aaaatacttc aagtgctcat tgaaggccac aaattaaata tggaattaga ttctggtgca 1200 ccatatggtt tcattggtag cgatacactt cgtaacctca aacctaattt tcaattgcag 1260 ccaacttcta agaaattcat gagttactcg caacatcgcc tcaactgcat cggtacttgc 1320 tctgttaatg tctcatttgg ttctacatct cgtcaacttc cggtctacgt tattcaaggc 1380 tcttacgatt ccttatttgg tcgtgaatgg atcgcacaat tcagtcacga aattgattgg 1440 accgaactct tttctccaat caaggttaat gccctttcta ctcttccacc ttcattaact 1500 cgtgatcaac aagcacagct ggatcaactt ttgacaaggt acgctgaagt tttcagtgaa 1560 acagctggaa aacttactgg acctccagtt aaggtacatt ttaaacctgg aacttctcct 1620 gtttttgctc gtgctcaaga tatagcatat gcgttacgcg acaaatattc taaagcagtt 1680 gacgctaaac ttgcgtctgg attttacaaa aaagttgatt tttccgagtg ggcttcgcct 1740 acacacgtcg ttattaaaaa aaatggtgat attcgtataa ctggcaatta taaacctacg 1800 ttgaatccac gtataataat agatgaatgt cctattccaa aacctagtga cattttcaac 1860 aaagtaaaag gagccaaaat ttacgcacat cttgatatta cagatgctta tactcatctt 1920 cctgctgacg acgagtacag ccatgcttta accctcaata ctcctacaca tgggttggtt 1980 cgtcctacac gagctgttta cggcgcagct aacgttcctg cagtttggca acgtcgatta 2040 aaagaaatct tacaaggtct aaaaaacgtt gaaaattttt tcgacgacat cattgtatgg 2100 gctgaaagtt tcgaagaatt gttgattatt ctggaaacgt gtttgatccg tcttcttgaa 2160 aatggtgttc gtctcaatcg acgtaaatgt gttttcgcta caaattctgt agaatttttg 2220 ggtcataaac tagatacaca aggcattcat aaatctgatt ctcacatcaa ggcaattcgt 2280 gatgcaccta aaccttctac acctcaagaa ttagaattgt ttattggtaa agctacgtac 2340 tacaattcat tcattcctga cttagctaca aaagctcgac cacttcggga catgcttctt 2400 acatcatctt ttcaatggtc gccaactgct gataaagcat acgaagagct taaaaatatt 2460 ctcatttctc ctcaagtttt gatgccatac gacccatcct tacctctcat actcgccact 2520 gatgcaagca aagtaggact cggagctgtt ttatcacata aacttagcaa cggaatcgaa 2580 aggccaattg cttatgcgag ccgtacatta actgctactg aacaacgcta tcctcaaatc 2640 gacaaggaag ctttagctat tgtatgggct tgtcaaaaat ttttcaatta tttatacgct 2700 cgtcatttta cgctgtttac agatcacaag ccattgacac agatttttca tcctgaaaaa 2760 tcacttccta ttctttgtat cagtcgaatg gctaattatg ctgattactt agctcatttc 2820 aactacgata ttaagttcaa gccaactaaa gctaatgcaa atgcagatta ttgttcacgt 2880 gctccacttc cttcaacggt tgacgctatt gaagaaatta cagaacttga ttcttttgac 2940 acatttatta tcaaccagat caatcagttt cctgtacgag cagagcagat tgcgaaggaa 3000 actcgtaaag attctaacct aggtaaaatt attcaactac tcgaagctgg tcaagatcta 3060 tctcgtcatg gatacaaagc gcctgagtct tcttatggtc tatcttcaaa ttgcttgatt 3120 tttgaacatc gagttgtgat tccatcgtct cttcgtcaag caattttaaa cgacattcat 3180 tcggcgcatt tgggcattgt caaaatgaaa ggtttggcac gatcatttgt ttactggcca 3240 ggaattgatg cagatattga acttattgct aaatcttgtg ctgagtgtgc caagcatgct 3300 catgctcctc ctaaattcaa cacacatcat tgggaatatc ctaaaggccc atgggaacgt 3360 atccatatag actacgctgg accagtagct ggtaagatgc ttctagttat cactgatgct 3420 tatagtaaat ggcttgaagt caaggtcaca agttccagta ctagcactgc tacaatcgat 3480 attctcgatc agttattcgc tacttatggt gtacctagca ttgttgtttc agataatggc 3540 cgtcaatttg tttcagatga gtttgaaaca tttctcaaga taagtggagt gaaatatcac 3600 aaacttactg ctccttatca cccttcgaca aatggtcaag cagaaaaatg cgtgggtaca 3660 acaaaaagag ccttacttaa aatggatacg acacaaggtt ctttgcaacg gaacttgaat 3720 gagttcttga gacaatatag aaaagctcct cattcaacta ctggacaacc acctgcttta 3780 ttatttttaa aacgaaacat tcgcacacga ttagatcttg ttcgacccga acctatcaac 3840 gagaagattt ctgagaaaca tcaagctgat tttatcaata cgtaccgaga atttaaacca 3900 cttgaacatg tttattttct ttccggtaat ccaaaattag ataaatggat tccaggacaa 3960 attaattctc gtctaggaga tctgcattat gaaatcagtt acatggataa aaagtataaa 4020 cgacacgtcg atcaaattcg tgcttttaca aagaaatcag aagaagggga agaaacaaga 4080 aagaaaccag aaagagtaac attttttgaa gataaaattg gtgctaattt gacaacgctt 4140 tccaccacaa gcacacgacg tcgtactcga ttttacaaaa ataacgtaac tccaagatct 4200 atgagtgcac aacaagaagt aaatactgaa gtaattacac catcatataa ggctccttcg 4260 acttctcaga ctgtgtcgtt taaagaatca ccgactatac aagctacgtc aactcaagaa 4320 tctttaattt caaaatcttt accaacttca agtgttgaga tgtctacgac tacaacggaa 4380 gctcaatcct caacatctca cgcatctctt ccacctactg taaatgattc attccaaaat 4440 ctatctacct ctgcaactct tcaagaagaa tttccgtttc aaactcaaga tcaacgtact 4500 gaacgtttac ctcaagaaca gtcatttttc attccaaaca caccgccgat tcgacgatca 4560 tctcggatac gcaaatcacg aactatttac tctccaccat gataattcaa gaaatcttta 4620 taaaggaggg aggaaa 4636 // ID Copia-3_AA-I repbase; DNA; INV; 4069 BP. XX AC supercont1.3; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-3_AA_; KW Copia-3_AA-LTR; Copia-3_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4069 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.3; Positions 2026476 2030544. XX CC 'GTATT' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 721..4059 FT /product="Copia-3_AA-I_1p" FT /translation="MKKKNPIKCHYCGNPGHKQKECRKFARDSEKQPKQQE FT KGTSKKEKVSFALAAGGTTDTEAIKLWVVDSGATWHTVADRKFFRELRESE FT VKFVTLADGKQAEVHGEGCCVVECVDEHGNRREMDISFALYAPTLSMNLLS FT VPALVQKQATVVFDSAGCRILRGEATVGVARLKQGIYQIKQPENVVMVAGS FT HHNKDCQHVWHRHRDPKAIDHLLKAQLVDGITLRRCSIDEKCECCLQGKTP FT RTPFPSESQSKSNAPMDLVHTDVCGPVEVPSVSGYRYFMTVIDDHSRFCVL FT YLLKEKSEVATKIEEYVAYVKTLFGRKPKAIRSDQGGEYSGKRLRAFYKRE FT GISAQYTAGYRPQQNGIAERKNRSLVEMVRCMLFDAHLEQRYWAEALTTAV FT HLQNVLPKKPLDVTPYELWTGVKPKVDHFKIFGCSAWIHIPKVKRKKLDAT FT ARKLTFVGYSSEHKAYRFLDKATRKVYVSRDAKCIEEDVKIDPTDAATEGK FT NVDEVFFDTGRLQNHPVVPVAQPEVEPEANTGAQHQIEIEARPEADEAEGY FT DTAIEDTTQTDTDFEGFSDTESPHFGEPRRSARPTQRFRDVTGVTKHVSDE FT PQSYSDAVEGPEASVWQAAMQEEIEAFQETGTWELVPLRPGRKPVGCKWTY FT KKKEDESGQVVWCKARLVAQGFSQKFGQDYDEVFAPVVRQTTLRTILVLAS FT QEHKLVKHLDIKSAYLHADLEEEVYMKQPPGFDQKDGMVCRLKRCIYGLKQ FT SARAWNRKIDGVLKAMGFKPADSDMCLYIRNRNGKLSYILMYVDDLLVVCN FT SEAEFDEIRRLLQDKFKLSCLGELKQFLGIRVEKAGDHYTLDQASYIDKLI FT RRFEHDEAKPSKVPIDPAYLKQKEETELPTNASYRSLVGSLLYVAVNTRPD FT ICVAVSLLGRKVTNPTNRDWTEAKRVLRYHKATKNHRLHLGSGSGETECFV FT DADWAGDEGDRKSHSGFLLKFGGGLVDWGTRKQSCVALSSTEAEFVALADG FT CQQLMWYRKLLNDLKQSSPKPIIIWEDNQSCIKIVESERLERRSKHIDTKY FT AYTKDLQQQRVIELGYMPTERMEADLMTKPLERIKLERHRNAIGVKEPSEM FT SICQR" XX SQ Sequence 4069 BP; 1099 A; 1044 C; 1152 G; 774 T; 0 other; ggttatgggc cgctcggcga aaaagtggtg acgccatttg taaaaataaa gatttttcga 60 tagttcgcgg taaaaccgga agaaaatcga agtagtcgcg agtttttcgg tcggttttcg 120 cgagtgtcat ggaaggaaat cgattttcct tgcaaaagct gggtaacagc aattacccga 180 cgtggcgctt caaggtcgag ctactgcttg ttcgagagga gctctgacgc tacgtggatc 240 cgggagtgaa gcccgatgag gaagccgatg cgacgtggaa tgcgggcgac gccaaggcga 300 gggccaccat cggcctgtta attgaagaca atcagcacgg actgattcgc gcggcacgga 360 cagccaaaga cacgtggacg gcgctgcaaa atcaccatca gaagacgagt ttgacgtcga 420 aagtgtcgct actcaaacaa atttgcgaca agcgctttgc cgacggtgat gacatggcgg 480 agcatttatt cgagatggag gaattgtttt ctcgtttggc aaacgccgga caggaactcg 540 gggagaacct cactgtcgcg atgattttga gaagcctcgt tcgacaccct cacaaccgcc 600 ctcaaaagcc ggtcagatga tgacctaaca ctggagctgg tcaagaggaa gctcctcgac 660 gaggcagcaa agcgacaggg tgctggatca agctcggcgc tgcgggtcgg ccatggcagc 720 atgaagaaga agaatccgat caagtgccac tattgcggaa atccgggaca caaacaaaag 780 gaatgccgta agtttgcccg tgattcagaa aagcaaccga aacagcagga aaaaggcacc 840 agcaagaagg aaaaagtttc gttcgcgctg gctgcaggag gaacaacgga caccgaagcc 900 atcaagctgt gggtcgtgga ttcaggtgcc acttggcaca ccgtggcgga tcgaaagttc 960 ttccgggaat tgcgtgaaag tgaggttaag ttcgtcacac tggcggacgg aaaacaagcg 1020 gaggtgcacg gcgaaggttg ctgcgtcgtc gaatgtgtcg acgagcatgg gaatcgtcgt 1080 gaaatggaca tcagctttgc gctctatgct cccaccctct cgatgaacct gctttccgtt 1140 ccagctctcg tccaaaagca agctacagtt gttttcgact cggcaggatg ccgaattctc 1200 cgtggagaag ccaccgtggg cgtggcacgt ttgaagcaag gtatctatca aatcaagcaa 1260 ccggaaaacg tggtgatggt ggcaggcagc caccacaaca aggattgcca gcacgtttgg 1320 catcgccacc gggacccgaa agcgatcgac catttgctga aagcccaact ggtggatgga 1380 attacactcc gccggtgcag catcgacgag aaatgtgaat gctgtttgca aggtaagaca 1440 ccacgaactc ctttccctag tgagtctcaa tccaagtcaa atgcaccaat ggacctggtg 1500 catacggacg tttgcggtcc ggtcgaggtg ccgtcggtta gcggttatcg gtactttatg 1560 accgttatcg atgaccacag ccgattttgc gtcctctacc tactaaaaga gaaatccgag 1620 gtcgcaacga aaatcgagga gtacgtcgcc tacgtcaaaa cacttttcgg tcggaaaccg 1680 aaagcaatcc gctctgatca gggcggcgag tacagcggaa agcggctgcg agcgttctac 1740 aagcgggaag gaatatcggc gcaatacacg gctggttata gaccccagca aaacggcatc 1800 gcagagcgga aaaaccgttc gctcgtggaa atggtgcggt gcatgctctt cgatgcacac 1860 ctggagcaac gctattgggc agaagcactg accacggcag ttcacctcca gaacgttctg 1920 ccaaagaagc ctctcgacgt cactccatac gagctttgga ccggcgtgaa accgaaggtc 1980 gaccacttca agattttcgg ctgcagtgct tggatccata tcccgaaagt aaaaaggaag 2040 aagctggacg ccaccgcacg aaagttgacg ttcgtgggtt actcctcgga gcataaagct 2100 taccgcttct tggacaaagc cacacgaaaa gtctacgtga gccgggacgc caaatgcatc 2160 gaggaagacg tgaagataga cccgacggat gcggccactg aaggtaagaa tgtcgacgaa 2220 gttttctttg atacaggtcg tctacaaaac catcccgtcg taccggttgc tcaaccggaa 2280 gtcgaaccgg aagccaacac cggtgctcaa caccagatcg agatcgaagc acggcccgaa 2340 gcagacgaag ctgaaggcta cgatacagcg attgaagata cgactcaaac agacaccgat 2400 tttgagggat tcagcgacac ggagtcgcca cacttcggcg aaccgagaag gtctgcgcgt 2460 ccaactcaac ggttccgtga cgttaccggc gttacgaaac acgtcagcga tgagccgcaa 2520 agttactccg acgcggtcga aggtcctgaa gcgtcggttt ggcaggcagc gatgcaggag 2580 gagatagaag ccttccagga aaccggtact tgggagctcg tccctctacg ccccggacgc 2640 aaaccggtag gatgcaaatg gacctacaag aagaaggaag acgagtccgg gcaggtagta 2700 tggtgcaagg cgagactagt cgcccagggt ttttctcaaa aattcggcca agactacgac 2760 gaggttttcg cccccgtcgt gcgtcaaacg accctgcgga ccatcctcgt actcgccagc 2820 caggaacaca aattagtcaa gcacttggac ataaaaagcg cgtacctaca cgccgatttg 2880 gaggaggagg tgtacatgaa gcaacctccc ggcttcgatc aaaaggatgg aatggtatgc 2940 cgcctcaaac gatgtattta tgggctgaaa cagtccgctc gagcttggaa caggaaaatc 3000 gacggagtat tgaaggccat gggattcaag cctgcggaca gtgacatgtg cttgtacatc 3060 cgaaatcgca acgggaagct gagctacatc ctgatgtacg tcgacgactt gctcgtcgtt 3120 tgcaactccg aggccgagtt cgacgagatt cgtcggctgc tgcaggacaa attcaagctt 3180 tcctgcctcg gtgagctcaa gcagttcttg ggaatacgcg tcgagaaggc tggtgatcac 3240 tacacacttg accaggccag ctacatcgac aagttgatac ggcgattcga acacgacgaa 3300 gccaagccat ccaaggtacc aatcgacccc gcgtacctca agcaaaagga ggagacggaa 3360 ctacctacta acgcctcgta ccgtagtttg gtagggagtt tgctctacgt cgcggtgaac 3420 acacggccgg acatctgcgt ggccgtctca ctgcttggaa gaaaagtcac caacccgact 3480 aaccgagatt ggacggaagc taagagggtc ctccgatacc acaaggccac gaagaaccat 3540 cgtttgcacc tgggctccgg atccggcgaa acggaatgct tcgtggacgc cgactgggcc 3600 ggagatgaag gcgaccggaa gtcacattcc gggttccttc tgaagttcgg tggcggtctc 3660 gtcgactggg gaacacgaaa gcagtcgtgt gtcgccctgt cctcaacaga ggccgaattt 3720 gtcgcgttgg cggatggatg ccagcagctc atgtggtacc ggaagcttct caacgacctc 3780 aagcaatcat cgcccaagcc gatcatcatc tgggaagaca atcaatcgtg cataaagatc 3840 gtcgagtcag agcgtcttga acggcgttct aagcacattg acacgaagta cgcctacacc 3900 aaggatctcc agcagcagag agtcattgaa cttggctaca tgccaacaga gcgcatggag 3960 gcggacctca tgacgaagcc gttggaacga atcaagctag aaagacacag aaatgccatc 4020 ggcgtcaaag agccatccga gatgtcgatt tgccaacgtt gaggagaag 4069 // ID Tx_mos repbase; DNA; INV; 1303 BP. XX AC . XX DT 14-SEP-2004 (Rel. 9.08, Created) DT 14-SEP-2004 (Rel. 9.08, Last updated, Version 1) XX DE Mosquito transposon DD37E - a consensus. XX KW DNA transposon; Transposable Element; Tx_mos; KW putative transposase. XX OS Toxorhynchites amboinensis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Toxorhynchitinae; Toxorhynchites. XX RN [1] RA Shao H. and Tu Z.; RT "Evolutionary divergence of unique DD37E transposable elements RT and evidence for horizontal transfer between distinct mosquito RT species."; RL Unpublished. XX RN [2] RA Gentles A. and Jurka J.; RT "Mosquito transposon DD37E - a consensus."; RL Direct Submission to Repbase Update (JUN-2004). XX DR [2] (Consensus) XX SQ Sequence 1303 BP; 412 A; 258 C; 302 G; 330 T; 1 other; tgcttcattt agaattggaa caaattcttt tgctcatatt gtggcgctcc tggtggacgg 60 atttggaagc tcttggcgcc cacgtgtcgg gaattttgtc agcttcacgt atgtattttt 120 gacattccgc aaatcgactg tactttgtaa acaatcaaca tggaagccga aagaagggaa 180 aaaattgtgc acagttattt ggaaaatcca ttgtggtctg catctaggct agctaaacag 240 ctgaaattgc ccagaaatac cgtatggcgc gttatcaaac ggtataagga aacattgacg 300 acgattcgga agcctcaagc caatcgtcgg agtggaactg tcgaccggaa actgcgtggt 360 aagattttga agacgattaa gaggaatccc aatctgtcgg accgtgattt ggccagaaaa 420 ttcggtgctg cycatagtac cgtgaggaga actcgactcc gggaaggaat caagtcgtat 480 cgagctagca aacagccaaa tcggaccata aaacagaata gtgtggccaa aatccgtgct 540 cggaagctat acgaccaggt gctgaccaag ttcgacgggt gtcttctgat ggacgatgaa 600 acctatgtca aggctgactt cgggcaaatc ccaggtcaaa aattttactt ggcaacggct 660 cggggggatg ttccagccaa atttaaattt gtttttgccg ataaatttgc acgaaaattt 720 atgatttggc agggcatttg cagctgtggc aaaaaaacga aagttttcgt tacaaacaag 780 acaatgacgt cggaactata ccaaaaagag tgtctccaaa aacgaatttt gccgttcatt 840 cgatcccacg accatcccgt aatgttttgg ccagatttgg caagctgtca ttacagcaaa 900 gtcgttcaag aatggtatgc agagaaaggg gtccagtttg ttccgaaaaa ccttaaccca 960 cccaactgcc cccagttccg ccctattgag aaatactggg caatcatgaa gaggagactc 1020 aaggcaaagg gaaaggttgt caaagacatc aatcagatga agacctggtg gaataagata 1080 gctaaaacga tggacgaaga aggtgtgcgc cgcctaatga gccgtgttac aggaaaaatt 1140 cgagaatttc ttcgaaaccg tgacgaataa ttttatccgt attttttctt aaaagtatga 1200 agaaaacgct acatttgtat aaaaaacaga tcttgaattc aataataaat aactgaaata 1260 caggcaattg tttttgttcc aattctatat gaagcaagct tta 1303 // ID Chapaev3-4_HM repbase; DNA; INV; 2333 BP. XX AC . XX DT 26-DEC-2008 (Rel. 13.12, Created) DT 26-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE an autonomous Chapaev DNA transposon - consensus. XX KW Chapaev; DNA transposon; Transposable Element; Chapaev3; KW Chapaev3-4_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2333 RA Bao W. and Jurka J.; RT "Chapaev3 DNA transposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 1829-1829 (2008). XX DR [1] (Consensus) XX CC TSD is 3-bp long. CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 454..2106 FT /product="Chapaev3-4_HM_1p" FT /translation="MPRTCKNQADNFCYICGEMTLKRCRRRLTPHVKKLYE FT LYFGCKVGDQDKNWAPHACCVRCASSLSSWANRTGGGPTFGVPMVWREPQN FT HCSDCYFCSVNISGRNSKGKKAIVYPNLPSAIRPVPHSAELTVPEPPSEIP FT NTESSDSDEDTDDDDSYELPEDTERKPHFITGSDLKDLVRDLYLTKQQSEL FT LASRLQEWHLLAGDARVSSFRKRSRELQQYFSMDKELCFCNDISGLFDDLG FT IQYDSSMWNLFIDASRYSIKAVLLHTGNVLPSIPIAHSVTLKESYVNISLI FT LDRIKYRDHRWSICADLKVVAILNGLQAGFTKYMCFLCKWDSRMRSEHYMR FT KCWPQRETFEVGSHNVIHEPLVDKEKIVLPNLHIKLGIFKQFVKALDHDKP FT AFQYLQTKFPKISDAKIKEGILVGPQIRELMLDETFSVTMDQYELAAWDAF FT KRVCQGFLGKHIAPNYADLVDRLIASYQQLGCNMSLKLHFLHSHLSFFSVN FT GDVYDENGERFHQSIATMESRYKGKWSPAMLADYCWNLHRDEPDAAFKRQA FT KYN*" XX SQ Sequence 2333 BP; 733 A; 416 C; 469 G; 715 T; 0 other; cactgtacaa caacgaaaaa aaaacggaaa tgccattgac actaccttat gtattttgat 60 ttggagatta cgaaaataat gtccgttttg ttgtgtgacc caccaatttt gacggaaatt 120 cacattttca aattttcgat tttcactatg aaagttgtca atgtgatttt tattatattt 180 aatagcagtt gtttatgatt tatttattag cattttttta tcatatacct atttgcattt 240 gtttaacctt gtattagact aaatgtaagt ctgttttatt ttagataaac tacagttagc 300 caagttatat atagatataa tacagtgtat gatataatac agttgcattt gtttaacctt 360 gtattagact aaatgtaagt ctgttttatt ttagataaac tacagttagc caagtaatat 420 atagatataa tacagtgtat attgaaacaa gaaatgccca gaacttgtaa aaatcaagcc 480 gacaactttt gctacatttg tggcgaaatg actttaaaga gatgcagacg aagattgaca 540 ccacatgtga aaaaactgta tgaactatac tttggttgca aagtcggcga ccaggataag 600 aactgggcgc ctcatgcatg ctgtgtgagg tgcgctagtt cactatcgtc gtgggcgaac 660 aggactggtg gaggaccaac atttggggtg ccaatggtgt ggcgtgaacc ccaaaatcat 720 tgttcagatt gctatttctg ttctgttaac atatcaggtc gcaattctaa gggtaagaaa 780 gctatagtgt acccaaatct accttcagct attcgtcctg ttcctcattc ggctgaactc 840 actgttcctg aacctccatc ggaaatccca aacacggaaa gttctgattc agacgaggat 900 actgatgacg atgatagtta cgagttgcca gaagatacag agcgtaaacc ccacttcata 960 acaggatctg acctgaaaga tctcgttcga gatttgtatt taacaaagca acagagtgaa 1020 ttgctggctt cacgtctcca ggagtggcat ctgttggctg gtgatgcaag agtgtcgtca 1080 tttagaaagc gatcacggga actgcagcaa tacttctcta tggataaaga actttgcttc 1140 tgtaatgaca tcagtggatt gtttgatgat ctaggaatac aatacgattc atctatgtgg 1200 aatttgttta ttgacgcatc acgttatagt attaaggcag tattactgca tactggaaat 1260 gtgctgcctt ccattcccat agcacattcg gtaaccctga aggaaagcta tgtgaatata 1320 tcacttattc tggacagaat taaatacaga gatcacagat ggtccatatg tgcagatttg 1380 aaagttgttg ctatactaaa tggattgcag gctggtttta ctaagtacat gtgtttcctc 1440 tgcaaatggg atagcaggat gagatcagaa cattatatga ggaaatgttg gccacaacga 1500 gaaacatttg aggttggttc ccataacgtc attcatgaac ctcttgtgga taaagagaag 1560 atcgtcttgc ccaatcttca catcaagctt ggaatcttca aacagtttgt aaaggcgtta 1620 gatcacgaca aaccagcatt tcagtatttg cagaccaaat ttcctaaaat cagtgacgcc 1680 aaaattaaag agggcatttt ggtggggccc caaatacgag aactgatgct ggatgaaacg 1740 ttttcggtaa cgatggatca atatgaactt gcagcatggg atgcattcaa acgggtgtgc 1800 cagggatttc ttggaaaaca tattgcacca aattacgctg atttggtaga cagattaatt 1860 gcatcctatc aacaactggg ttgtaacatg tcgctgaagc tccactttct tcattcccat 1920 ctttccttct tctctgtcaa tggagatgta tatgatgaaa atggcgagcg ctttcatcag 1980 agtattgcca caatggaaag cagatacaag ggaaagtgga gccccgccat gcttgcagat 2040 tactgctgga atttgcaccg cgatgaaccc gatgcagcat tcaaacgcca ggctaaatac 2100 aattaaatac aagtaaacaa tttgtgttac aagtgataat aaacatgttc atacacttat 2160 tcatatgtat agaaaaatgc atcttacgtt acatctttaa atcgttattt caaaaaattg 2220 tgacgtgcta cagcataact gattacatat ttggaatcag cacgtctgaa ttaataaaga 2280 taacctaatt tggttcttgt ttttttcaaa aagttaaaaa ttgttgtaca gtg 2333 // ID Sola3-4_HM repbase; DNA; INV; 5270 BP. XX AC . XX DT 28-JAN-2009 (Rel. 14.02, Created) DT 28-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE Sola3 DNA transposons from Hydra magnipapillata. XX KW Sola; DNA transposon; Transposable Element; Sola3; TTAA TSD; KW Sola3-4_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-5270 RA Bao W., Jurka M.G., Kapitonov V.V. and Jurka J.; RT "New superfamilies of eukaryotic DNA transposons and their RT internal divisions."; RL Molecular Biology and Evolution 26(5), 983-993 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 1263..4205 FT /product="Sola3-4_HM_1p" FT /translation="MCISSLHIRNRMETKKKICMHPNHKVQTGKKALLGRP FT AQISMTLQISSHYNSYIPIGSIICNRHRKEYYSETRNKETKIITEHESTST FT ATDIDYIPDEFLQNQEKLNTSNEIREKITACLGTSPIKFQFKRKLELINES FT TKQQMRNKYKRMEETFRKKFAESIAPGQSEEFLKVLEGDENAETEIPDEIE FT KLVEMYKNSDATSQLVILAFIDHNKYTKQFLCNTFKCTKYKIELARKWKAS FT QKGFVLPQKNLFTRSRLSIPKCEHFLDFIFLSGLLQDVSYGISKIKYDSGD FT VETVASAILTAKNSHTIALYRESCKEIAYEPLSNSSLMRILNGIKPSHRKC FT LAGLNDTAAAGMNAFATLEQFANIFNKKSTVEKLERSKRYLKTKYKLHCSY FT LENEIATHSTRFALSDPENIKMQTTSILNDSVCYECNELFEVLDEVNQIGN FT ENDVGDDVLYDIKVSIGSITEYMKHLLRDAQQKKAKNDAYKQLDDKTAFWL FT KDYSQKILPVKFREGQKDYFGKKGISMHIDVFFTKKNDILKKHVYLTILYH FT CDQGVSSVISIAESVLNQFRLDEPEIVNIFAKSDNANCYHANYSVESMYIL FT CKKKNFKLLRYDFNEPCCGKDQCDRESASAKTILRSYVDAGNDIVTAEDVA FT KAVKYGHGVKNGMVSVAKIENVTLTGPKIPNISNYHSYEFQENYMTLWRYY FT EIGDGVKVLYNNLSVEPSIELLVPFTSTEGSFQRKTQSGKKRKKNCGLNTL FT HFCQETGCIESFETVIELEEHMLTEQHKXSEQSSSLDKVKKVYTEKLKATL FT QVHSNSMTASVKITPIPINNIYSQKFHQLGWALLTRKTFRFSIKLKKLLYE FT YFINGENTGNKMSPEEVNTVIRKNLTADEFVSSQQIRSLFSRWSKLLRIGK FT LTPLTNEEDNNNELENEEAKKYQEEIHKLATELSVIWLKDDWVAVVYKNQW FT YPGVVEEIKILYILFNQKLIXYKTK*" XX SQ Sequence 5270 BP; 2026 A; 763 C; 809 G; 1668 T; 4 other; gagcgacttt ctgtaagaat caggcagagc tcaatagtga ccctcttcaa aattgctaat 60 tttttaatat gttgttgcta ttaatattac gaagctcaat tccaaaattt tttcaaaaaa 120 tattaattgg ttctctagat attagatgtc aaagtttgac taaatttttc gaaatattga 180 ccaaatggtg ttgaaggcat gtttaaacta aaaaaaaaca acaaagtaaa tactttgcaa 240 tgagtatttc actttattat tgtttattgg ctagttcaga ccaaaaataa ataaaattgg 300 taaaggaaat atttttcttt gtctaaatat catgcttcaa aatacgcaaa aaatgatttt 360 tttgtcattt ttgatctttg ttttcattca aagccgcgat taacccacag cttttttttg 420 ttattataaa tctttatcta gttaagtacc taaaagtata aaaacatcta tggggaagca 480 aaggcttagc cgcaaaattg aactcaaaaa cgatattcta atctttttac gcattgattg 540 cgacatgata atggctaaag cccggggttc actgtaaaat tcatttaaaa aaaacttgta 600 ttgaccttta atctacctta tttataaaaa atacttatgg ggtattgcaa tagttcagaa 660 aaaataacat ataaatagta tagatatttt cttggtgctt ttatatttat ttataaacgc 720 ttgaattcaa tatgtttctt tgttactcta tcatattttt tgagtataat ttttactccg 780 aatatgtttc ccgagtatga ttttgagtat gactatttta tattataata aatctcattt 840 gcgtatttat agttagttwa gattttttac attttgatta aatgtcttca aaaaatattt 900 cttacaacac gtgttgttgc tttgaacgga ataattgtgg accttacatg cgacctactt 960 caaatgtaga tgaaatgttt gaaacaagcc aactcactaa aaatatacaa ccacatctta 1020 ccaaccttaa ggtaaaataa tattatacta tttttgtaaa aactcgtgtt taaatagatt 1080 tgtaacttgt aaatcctaat aattagtttt tattaatttg tctttaaatc aaaataaaga 1140 aggctcgtaa aataataaaa tttttttaag gtcttaaaga gctatgagct agcaggtgat 1200 tggaatgaaa aatctctaat tgaaaatcgt ttaaacagag attttaacaa tacagagaaa 1260 atatgtgcat atcatcgcta cacattagga ataggatgga aacaaaaaaa aaaatttgta 1320 tgcaccctaa tcacaaagta caaacaggga agaaagcatt attagggcga cccgcacaaa 1380 tatctatgac acttcaaata tcatcccatt ataattcata cattccaatt ggttcaatta 1440 tatgcaatcg tcatcgaaaa gaatactatt ccgagacaag gaataaagaa acaaaaataa 1500 taactgaaca cgaatctaca tcaacagcta ctgatattga ttatatacca gatgagtttt 1560 tacaaaatca agaaaaactt aatacttcaa atgaaatcag agaaaaaatc acagcntgtc 1620 ttgggaccag tccaattaaa tttcagttta aaagaaaact tgaacttatt aatgaatcca 1680 caaaacaaca aatgcgaaat aaatataaac gtatggaaga aaccttcaga aaaaaatttg 1740 cagaatctat tgcgcctggt caatcggaag agtttttaaa agtgcttgaa ggtgatgaaa 1800 atgccgaaac agaaattcca gatgaaattg aaaaacttgt agaaatgtat aaaaacagtg 1860 atgcaacaag tcaacttgta attttggcat ttattgatca taataaatac acaaagcaat 1920 ttttatgcaa cacatttaag tgcacaaaat ataaaataga acttgccaga aaatggaaag 1980 cctcacaaaa aggttttgta ctaccccaaa agaatttatt tacaagaagt agactttcaa 2040 ttccaaaatg tgaacatttt ttagatttca tctttttaag tggactgtta caagacgttt 2100 cttatggtat ttccaaaata aagtatgaca gtggtgatgt tgaaactgta gctagtgcca 2160 ttttaacagc aaaaaatagt cacaccattg ctttatatag ggaaagttgc aaagaaattg 2220 cgtacgagcc tctgtccaac tcaagtttaa tgcggatact aaatggaatt aaaccatctc 2280 atagaaaatg cctggctggt ttaaatgaca cggctgctgc tgggatgaat gcatttgcaa 2340 cacttgaaca attcgcaaat atatttaata aaaaatcaac tgttgagaaa cttgaaagat 2400 caaaaaggta tttaaaaact aagtataaac ttcattgcag ctatttggaa aatgaaattg 2460 caacacattc aactcgattt gccctttctg atcctgaaaa cattaaaatg cagactacaa 2520 gcattttaaa tgattctgta tgctatgaat gcaatgagct gtttgaagtt ttagatgaag 2580 tcaaccaaat aggaaatgaa aatgatgttg gtgatgatgt attatatgac attaaagtat 2640 caattggcag catcacagag tatatgaaac acctcttaag agatgcacaa cagaaaaagg 2700 caaagaatga tgcatataaa caactagatg acaaaacagc attttggttg aaagactatt 2760 cacaaaagat attacccgtg aagtttagag agggtcagaa ggactacttt gggaagaaag 2820 gcataagtat gcatatagat gtttttttta ccaaaaaaaa tgacattctc aaaaagcatg 2880 tttacctaac aattttatat cactgcgatc aaggtgtttc ctctgtaatc tctattgctg 2940 aatcagttct aaatcagttt cgcctagatg aaccagaaat agtaaatatt tttgcaaaat 3000 cggataatgc taactgttat catgctaact attcagttga atctatgtac atactctgca 3060 aaaaaaagaa ctttaaactt ctacggtacg actttaatga gccttgctgt ggaaaagacc 3120 aatgcgatag ggaaagtgca tccgccaaaa caatactaag aagttacgta gatgctggca 3180 atgacatagt aacagcagaa gatgttgcaa aagcagtgaa atatggtcat ggtgtcaaaa 3240 atggtatggt tagtgtagca aaaatcgaga atgttacttt aactggtcct aaaattccaa 3300 atataagcaa ctaccactcc tatgagttcc aagaaaacta catgactttg tggcgatact 3360 atgaaattgg agatggagtt aaagtgcttt acaacaacct ttcagttgaa ccatcaattg 3420 aattgcttgt tccattcaca agtactgaag gaagctttca gcgaaaaacc cagagtggaa 3480 aaaaaagaaa aaaaaattgt ggtttaaata cactacattt ttgccaggag actggttgta 3540 ttgaatcttt tgaaactgtt attgagttag aagagcatat gctgacagaa caacacaaaa 3600 ratctgaaca atcttcttct ttagataaag taaaaaaagt ttatactgaa aagttgaaag 3660 caactttgca agttcattct aattctatga ccgcatcggt taagatcacg cctatcccaa 3720 taaacaatat atattctcaa aagttccatc aattaggttg ggcattgcta acaagaaaga 3780 cttttcgttt ttccattaag ctgaaaaaac ttctttatga atacttcata aatggagaaa 3840 atacaggaaa caagatgtca ccagaagaag taaacacagt gataagaaaa aatcttacag 3900 ctgatgagtt tgtttcaagc caacaaataa gatctttgtt ctctagatgg agtaagctat 3960 taagaattgg aaagttgact cctctgacta atgaagaaga caataacaat gaacttgaaa 4020 atgaagaagc caagaaatat caagaagaaa tacataaact agcaaccgaa ctttctgtta 4080 tatggttgaa agatgattgg gtggcagttg tttacaaaaa ccaatggtat ccaggagttg 4140 ttgaagagat aaaaattttg tatatattat ttaaccaaaa acttatacya tataaaacta 4200 agtaaaaagc taacattaca tcaacaacag tattaaaact tgttgcaata ttagttaaac 4260 taaataatta tttattttag aaatgataag ttcatttatc cataaattta aaatattttt 4320 aaatattttc agataactga tgttggtact ttaattgact gcatgaaaag tgtttctttt 4380 ggaaaaaaat tgttatcaat ggcctaatca gaaagatcgc atggtatatt cagatgaaga 4440 cattttatgt aaaataagtc ctccaactac aattaatcac cacagtgatt acatcttagc 4500 aacaaatgat tttgaagaag ctaataaact aatgaacaaa taaactaaaa aaaaaagttg 4560 agttgtataa aaagaaattt attgtaaaac aacgtgtttt ttttaaatct cttaaatata 4620 taccccacaa ctattcttct tctatctgat aaataattga aatatgttta cgttacatct 4680 tatacaattt ttgtaaaccc agggctttaa ccattatcat gtcacaatca gtgcgtaaaa 4740 agattaaaat atcgtttttg agttcaattt tgcggctaag cctttgcttt cccataaatg 4800 tttttatact tttaggtact taactagata gagatttata aaaacaaaaa aaaattatgg 4860 ggtaatcgcg gctttgaaag aaaacaaaga tcaaaaatga caaaaaaata tttttttgcg 4920 tattttgaag catgatattt agacaaagaa aaatatttcc tttactaatt ttattcattt 4980 ttggtctaaa ctagccaata aacaataatc aagtggaata ctcgttgtaa agtatttact 5040 ttgctttttt tttgagttca aaaatgcctt cgacaccatt tggtcaatat ttcgaaaaat 5100 ttagtcaaac ttggacatct aatatctaga gaacaaatta atattttttg aaaatttttt 5160 ggaattgagc tctagtatga atagcaataa catattaaaa aattagcgat tttgaaagag 5220 gtcactctaa aaatttagtg tttttgactg attcttacag aaagtcgctc 5270 // ID Gypsy-1-LTR_DP repbase; DNA; INV; 385 BP. XX AC . XX DT 17-MAR-2009 (Rel. 14.03, Created) DT 17-MAR-2009 (Rel. 14.03, Last updated, Version 1) XX DE Long terminal repeat of the LTR retrotransposon - a consensus DE sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-1-LTR_DP. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-385 RA Bao W. and Jurka J.; RT "Gypsy LTR retrotransposons from Daphnia pulex."; RL Repbase Reports 9(3), 656-656 (2009). XX DR [1] (Consensus) XX SQ Sequence 385 BP; 60 A; 86 C; 97 G; 138 T; 4 other; tgtcacgtcc cgtcctgttc gccttacttt ttgkgcgcta gatggcctta tgtcgcttgg 60 cgtcctagtt aaaccagatt cgtctctcta ttctcttcag agttcgcctg ctgttgttcc 120 ggtccggtcg caagctagtt ygctctattc aattgtgtgt gttattcttt tctytgtgtg 180 tgtgtttcgc cttgctgtag tgtgtgtgta ttcatgttag tttagaccca ctaggggcac 240 catcggagta gttaagtmgt ttagtgtagt taaccgatgg tgtgattcac tgtcccgtac 300 tgtgccagtg tcgacggcca gcgtctcagt gatttgcatt gtagtgtcgc attcaacgag 360 tgtactgaac tgattgcccg tgaca 385 // ID BEL-212_AA-I repbase; DNA; INV; 6370 BP. XX AC AAGE02025927; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-212_AA_; KW BEL-212_AA-LTR; BEL-212_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6370 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02025927; Positions 28137 34506. XX CC Positions [5422-5982] - Integrase core CC 'ACGAT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 194..3685 FT /product="BEL-212_AA-I_2p" FT /translation="MVQCDVCSKWHHFQCVGVTQQIEHFPWSCTKCESAKG FT IQEYSSSAALPQGNGGPSRQSSKSHLNRLSVPVQTQHPHQLQQQSQPTPPQ FT QQSDITASQLSLSLAANGSNQQPKGANQGPFVTSLQWVVSPSRDGSKAASV FT SSSHSSQAIARLKLQMLEEMRDVERREADRQCAIAAEEAKREKAFLEQKYQ FT LLEQAMSENGSSKNGSMSRTQNWITATNAHHDGLIASQAEEAIDPNMHVGK FT MQLWNSTFARIGSIRSSLPEPSPPEPFPAPESMLSCPINPPTLTAGMQQLA FT LGSTHQTPFSEQPRTFPTYELPRRVSISQLPHSGGQGVTAPTAFKNIEMDP FT SRRVSQNVSVPDYRRFVPPSNRPAVHHQSNPYSSTMRDYRDYNQEEETDAY FT PISRKQLAARQAISKDLPTFSGNPEEWPLFLSTFNSTTALCGFTNEENIVR FT LQRSLKGRAYEAVKSRLLHPSNVSGVISTLKMLFGQPEVIVHSMMSKINSL FT PTLREDKLELIVDFAVSVQNFCATVDACGLEEYFYNMTFLHQLVNKLPPSI FT KLNWAQYRISLPTVNLQSFSNWLYQLAEAASAVTIPNTTFESKPVRSDSRM FT GKKSNSFVNSHSEEAPSTAIDEANSCLVCKGSCRSIPKCERFLEFSRDSRW FT ATVRDLGLCRRCLHQHSGGCQTKPCGRSGCELRHHELLHNEQKRAPLSSNT FT SASDASKRPPHQSLESTPPATNSNEHGCHTHQVTSSHVLFRYLPVVLHGKN FT RSIGTFAFLDDGSELTLLDQDLADELQLKRETTPLCLLWTGGAKRREDDSK FT SVQLEIAAKHNGSKKYAMHGVRTVAELLLPPQTLNFEELFEQYPHLRGLPI FT TSYQDIRPKILIGMKDQYLSLVQKSREGAIHEPIAVKTRLGWTICGGGNLN FT NAANQVHSVFHICACDSSSVENLHRMMKEYFAVDSLEVVQPKKHLSAEEER FT AQTLLETRTALKGNRYETGLLWRYDNVRLPDSFPMALRRLQCLRKRMDKDP FT LLADQLNMKISDFVEKGYARKLTKAELSQRERTWYLPIFPVTNVNKPGKIR FT MVWDAAAEVHGMSLNTSLLKGPDQLCELFTILVQFREGRVALTGDVREMFL FT QVLMHPDDQQCQRFLWYDEDGTLSVYVLQVMTFGACCSPSSAQYVKNLNAE FT RFKGE" FT CDS 4801..6336 FT /product="BEL-212_AA-I_1p" FT /translation="MSHFIATDPIIRVNDFSSWGRLVRVVGFVHRFSNNWK FT LKHLKKAISTGPLTTQEISSAEVHLLRQTQLEGYPDEVALLQRSQHEPKLT FT NTPLPKTSPLYQKSPWLDRRGVLRMRGRIAHCDYASEDAKHPVILPRDHPT FT TKLIIAHFHQKFHHQNHETVINEIRQKYSIPKLRSAFANVRNNCQRCKNHR FT VVPKVPIMADLPTARLEAFASPFSHVGVDFFGPYEVVIGRRVEKRWGMLVT FT CLTTRAIHIEVAHSLSTDSCVMALRNLISRRGKPRAIYSDRGTNFIGANRE FT LNEAKFAIDQIEIMKEFVDADTTWQFLPPSSPHMGGSWERLIGSVKKNLMA FT ILPCRKLSDEVLRNLLTEVENTVNSRPLTHVPVDDDSAPALTPNHFLLGSS FT NGVKPLCTIDDRGETLRQCWRLSQVQANRFWKRWITDYLPEITRRTKWFAD FT TKPIEINDVVVILDPKLPRNCWPKGRVICIRPGRDSKLRSGTVRTATGIYE FT RPVAKLAVLDVRRVEE" XX SQ Sequence 6370 BP; 1795 A; 1633 C; 1508 G; 1434 T; 0 other; aattattatt ttcgtttata tacggaaacg cggattacat tcagtcatgt cgagtcgaac 60 cctacggtct ggcaaagtgg ttggtggcac taatagtgca aatggtggtg tgcaagtggt 120 ggtcacggca gtaccggagg agtcacatcc cggttgaact tgccaagtat gccgtggact 180 ggattcagtc gagatggtgc aatgtgacgt gtgttccaag tggcatcact tccaatgcgt 240 aggagtcacg cagcaaatag aacattttcc atggagctgt accaaatgcg aatctgcgaa 300 aggtatccag gaatacagtt cttctgcagc tcttcctcaa ggaaacggtg gtccttcaag 360 acagtcatct aaatcacact tgaatcggct ctcagtacca gtacagactc agcatccaca 420 ccaactccag cagcagtcac agccaacgcc tccgcagcag cagtctgata taacagctag 480 tcaactatct ctttcgctgg cagctaatgg gtcgaaccaa caacccaaag gagcaaatca 540 aggaccgttc gtcacctcgc ttcaatgggt cgtgtcgcct tctcgagacg gatctaaagc 600 tgcatcagtg tcatcgagtc actcttcgca agctatcgct aggctcaaac ttcaaatgct 660 agaagagatg agggacgtcg agcggcgtga agcagatcgg cagtgtgcaa tcgcagcgga 720 ggaagcaaaa agagagaagg cattcttgga gcagaagtat cagctgttgg agcaagccat 780 gagcgaaaat ggctcgtcaa agaacggttc catgtccagg acgcaaaact ggatcacagc 840 aacaaatgcg caccacgatg ggctgatagc atcccaagcc gaggaagcaa tcgatccaaa 900 tatgcacgtt ggcaagatgc agctttggaa ttcaacattt gcgcggattg gtagcattcg 960 gtcgtcactc ccagaaccaa gtcctcctga accattcccc gcccctgagt ccatgctgag 1020 ctgccctata aatccaccga cactgactgc tggaatgcag cagttagctc tcggctcaac 1080 acatcaaact ccgttttcgg aacagcctcg gacgttccca acatacgagc ttccacgacg 1140 agtttcgata tcgcagttac cgcattctgg tggccaagga gtaactgcac cgactgcttt 1200 caaaaacatc gagatggacc catcccgtcg ggtctcacaa aacgtgtcgg taccagacta 1260 tcgacggttc gtaccacctt cgaatcgacc agcggttcat caccaaagca atccctattc 1320 atcaaccatg cgagactatc gagactacaa ccaagaggaa gaaaccgatg catacccgat 1380 ttctcggaaa cagttagctg ctcgacaagc catttcgaag gaccttccaa cgttttctgg 1440 aaacccagaa gagtggcctc tgttcttgtc aaccttcaac agtaccaccg ctctatgtgg 1500 attcaccaac gaagaaaaca tcgtccgttt gcagcgaagc ttgaaaggtc gagcgtatga 1560 agcagtaaaa agccgactgc tacacccttc gaatgtcagc ggggtcatat cgacgctcaa 1620 gatgcttttt gggcaaccgg aagttatcgt ccactcgatg atgtcgaaga tcaactcact 1680 cccaacattg agagaggaca aattagaatt gatcgtggat tttgctgtaa gcgttcaaaa 1740 cttctgcgcg acagtggacg cgtgtggatt agaagaatac ttctataaca tgaccttcct 1800 tcaccagctc gtcaacaagc ttccaccatc gatcaagttg aattgggccc agtatcggat 1860 atcactaccg acggtcaatt tacagtcttt cagcaactgg ctctaccaac tagcagaagc 1920 tgcaagtgcc gtaactattc caaacaccac tttcgagtcc aaacccgtac gaagcgactc 1980 acgtatgggc aagaagagca actcgttcgt taactctcac tccgaagaag ccccttccac 2040 cgctatagat gaagctaata gctgtcttgt atgcaaaggt agctgtaggt ctattcccaa 2100 atgcgaacgt ttcctggagt tttctcgtga ttcccgctgg gcaactgttc gagatctagg 2160 cctctgccgc cggtgtctac atcaacacag tggcggatgt cagacgaaac cttgtggaag 2220 gagtggatgc gagctaaggc atcacgaatt gttgcacaac gagcagaaga gagcaccact 2280 gtcgtcgaat acttccgcaa gcgacgcaag caaacgacca ccacatcaaa gtctagaatc 2340 aacaccacca gcgacaaatt caaacgagca cggatgtcat acccaccagg ttacctctag 2400 tcatgtgctg tttcgctacc ttcccgtagt acttcatgga aagaatcgct cgattggaac 2460 atttgcattt ctggacgacg ggtcggaact taccctgctt gaccaagatc ttgcagatga 2520 gctgcaattg aagagagaga caacgcctct atgcctgctc tggactggag gagcgaagcg 2580 tcgagaggat gactcaaaga gtgttcagct tgagatcgcc gcgaaacaca acggatccaa 2640 gaagtacgcc atgcatggag tacgtacagt ggcagaactt cttctaccac cgcaaacgct 2700 caacttcgaa gagctgtttg aacagtatcc acacctcaga ggcctgccaa tcacttccta 2760 tcaagatatt cgaccgaaaa tcctcatcgg aatgaaggac cagtacctca gccttgttca 2820 aaaaagccgc gagggagcta ttcacgagcc gatcgccgta aaaactcggc taggatggac 2880 aatttgcggg ggaggcaatc ttaacaatgc tgccaaccag gttcattcag ttttccacat 2940 atgtgcctgc gactcgtctt cggtcgagaa cttacataga atgatgaagg agtatttcgc 3000 agtggatagc ctagaagttg ttcaaccgaa aaaacatctc tcggcagaag aagagcgagc 3060 gcagactcta ctcgagactc gcactgcgtt aaaaggaaac cgatatgaaa caggcttatt 3120 gtggcgatac gacaatgttc gactgccgga cagttttccg atggccctcc gacgactgca 3180 atgcctcaga aagcggatgg ataaagatcc tctgttagcc gatcaactca acatgaaaat 3240 cagcgatttc gtcgaaaagg gctatgccag aaagctcaca aaagcagagc tttcccaaag 3300 ggaacggacg tggtaccttc ctatatttcc cgtaacgaac gtcaataaac ctggaaagat 3360 tcgaatggtg tgggacgcgg cagcagaagt ccacggtatg tcactgaata catcgctgct 3420 caagggaccc gatcaactct gcgaactgtt cacgatactc gtccaattcc gggaaggacg 3480 tgttgctcta actggagacg tgcgtgaaat gttcctgcaa gtgcttatgc atccagatga 3540 ccagcagtgc cagcggtttc tctggtatga cgaagatgga actctctccg tctacgttct 3600 ccaagtgatg acgttcggag cgtgctgttc ccctagtagt gctcaatatg tgaagaactt 3660 gaacgccgaa cgtttcaagg gtgagtaacc agcggccgtt gaggtcatcc agaagcgtca 3720 ctacgtcgat gatatgctgg tgagtgtgac gacagaagag gaagcgatta aactcgcaca 3780 gcaagtaaag aagatacatg cagaaggagg ctttgaaatc cgtaattggc tcagcaactc 3840 caagcgtgtc gtcgatgccc tagaggagaa ccacacagaa gaaagaagcc tagatttgtc 3900 atcggagcta gccacggaga aggtactcgg tatgtggtgg tgtaccgact cggacatctt 3960 tacctacaga gtcgggtggg atcgttacgg tcgagcgtta ttagaaggta aacaccgtcc 4020 aacaaaacga cagatgctgc gggtactcat gtcagtattc gatccgcttg ggctaatagc 4080 ccaggttctt atgtacctga aaattttgct acaagatttc tggcgctctg gcatcgggtg 4140 ggatgatcaa gttgatgata ccatcttcga acattggcaa acatggttgc aagtgctccc 4200 acaaatcgag cagatcagca ttccacggtg ctacttcagt actcgaagct tcgataccga 4260 aagggtgcaa ctccatacct ttgtcgatgc cagtgaaaac ggttttgcag caacttgcta 4320 cctccgattc gaccatgacg atgacgtaga gtgcaatata gttgctgcca aaacccgagt 4380 tgcgccgttg aagttccttt ctatacctcg actggagctc caagctgcct tgatcggtgc 4440 ccgattggct cgaacactaa ccgaagcact cacgattcag atctcgcgtc aagttttctg 4500 gtccgactca caagatgtcc tgtgctggat gtgcaagatg gcgtaccgat aaattcggat 4560 tttgctgccg ttcaaacaaa atacgtttgc gcacttccac ttgattggac gccatttcag 4620 tgaatgttac gataatgtaa acatgttgta ccacgaaata aagtggagag ttcagctgtc 4680 atataagtga aacgtttcag caatccgtga aataattcaa aacctacact gaaaaaaaaa 4740 agtgtccgaa atttatcgtg gacaggcttt accaccgacc tcgaactccg tcccagtgca 4800 atgtcgcatt tcatcgcaac tgatcccata atacgggtca acgatttctc cagttggggt 4860 cgattagtta gagtcgttgg atttgtacac cgtttctcta acaactggaa actaaagcac 4920 ctgaagaaag cgatctcgac cggtcctcta actactcaag aaataagttc cgctgaagta 4980 catctacttc gtcagacgca actggagggt tatcccgatg aagttgctct tcttcaacgg 5040 tcacaacacg agccaaaatt aaccaacaca cctttgccga aaacaagccc gctgtaccaa 5100 aaatcacctt ggctagatcg gagaggagtt ttacggatga gaggccgtat cgcccactgc 5160 gattatgcaa gcgaagatgc caaacaccca gttatccttc cccgtgacca tcccaccact 5220 aagctgatca ttgcacattt ccaccaaaaa ttccaccacc agaaccatga gaccgtaatc 5280 aacgaaatcc gacaaaaata cagcatcccg aagcttcgtt cagcctttgc caatgtccga 5340 aacaactgcc aacgctgcaa aaaccatcgg gttgtcccaa aagtaccgat catggcagat 5400 ttaccgacag cccggctcga agccttcgca agtccatttt cccacgtcgg cgtcgatttc 5460 tttggaccat acgaagtcgt catcggccgc cgagtagaaa agcgttgggg tatgctagtc 5520 acatgtctca cgactcgggc aattcacata gaagtcgctc attctctcag cacggattcg 5580 tgcgtcatgg cgttgcggaa ccttatctca cggagaggaa aaccgagagc aatctacagt 5640 gaccgaggga cgaattttat tggagcgaat cgagagttga atgaagcgaa gttcgctatc 5700 gatcaaatag agatcatgaa ggagttcgtt gacgcagata ccacatggca gtttcttccc 5760 ccctcttcac cacacatggg tggcagttgg gaacggctaa tcggcagtgt aaaaaagaat 5820 ttaatggcaa tcctaccatg cagaaaactg tcagacgaag ttctgcgcaa tctcttaacc 5880 gaagtagaga atactgtcaa ctcgcgacca ttgacccacg ttcctgtgga cgacgactca 5940 gctccagctc taacacccaa ccatttcttg cttgggtcgt ctaatggagt taaaccactc 6000 tgcaccatag atgaccgtgg ggagacgctg cgccaatgct ggcgcttatc ccaagtacag 6060 gctaaccgat tctggaagcg gtggatcacc gattatctac cggaaataac ccgccgcacc 6120 aaatggttcg ccgataccaa accaatagag atcaacgacg tggtggtgat tttagaccca 6180 aaactacctc ggaactgttg gccaaagggc agagtcattt gtatccgccc cggacgagat 6240 agtaaactac ggtctggtac agtaagaacg gcaaccggaa tctacgaacg accggtagcc 6300 aaactggctg tactagatgt acggcgcgtt gaagagtaag ctggccaacc ggtcagcata 6360 cctgggggac 6370 // ID EnSpm-1_BF repbase; DNA; INV; 7504 BP. XX AC . XX DT 04-AUG-2008 (Rel. 13.08, Created) DT 04-AUG-2008 (Rel. 13.08, Last updated, Version 1) XX DE Amphioxus EnSpm-1_BF autonomous DNA transposon - consensus. XX KW EnSpm; DNA transposon; Transposable Element; EnSpm-1_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-7504 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-7504 RA Kapitonov V. and Jurka J.; RT "EnSpm-1_BF - a family of autonomous DNA transposons from the RT amphioxus genome."; RL Repbase Reports 8(8), 788-788 (2008). XX DR [2] (Consensus) XX CC This autonomous transposon shares common termini with the CC non-autonomous EnSpm-N1_BF. It contains 28-bp TIRs (6 CC mismatches), is characterized by 3-bp TSDs and codes for the CC 737-aa En/Spm transposase. XX FH Key Location/Qualifiers FT CDS 3167..5377 FT /product="EnSpm-1_BF_1p" FT /note="transposase." FT /translation="MEEAFYTKANGRKVSRKCEFVPFYSHPHRRSELRRKC FT GTVLMRKVKSVKGDEYLYPKSTYCYKSVKESLQDLVSRPGFNDKCEHWRDR FT NVPDTEMMDVYDGQVWREFQYVNESPFLAEPNNLALMLNCDWYQPYKHSEY FT SVGVLYLVVLNLPREERFKEENMIVVGLIPGPKEPKIDINSFLEPLVEELQ FT DLWKGVLLSDNSILEAQLYRAALICLSSDIPAARKCGGFVGHGALRGCSKC FT LRTFTRSRFRTKADYSGFDRTLWPPRDSSEHVRFAKMARRAKTKADRKRVE FT GQYGARWSELFRLSYWNAVKYVVVDPMHNLLLGTSKHVFKVWMELGVLNKK FT AFDVIQSRIDSMKVPHDVGRIPRKIASGFSGFTADQWKNWTCIYSLFCLKG FT LIDSRQYDMWYEFVQACRLLCSRAISYDNLRKADHHLQRFLAMFEEFCGPQ FT YCTPNMHLHLHLKECIEDYGPVYAFWCFSFERYYGMLGKYQHNNRNIEVQI FT MRKFQQSKQLSMMSWPGGYGDAFKSVLTDGKLSSTSVGHALPLMADVVVLD FT NSSVVEPLSPFRKVVISEAVRSMLLTMYRAMYPGVAISRLDLFAMSCTRVR FT CNGVTLSVDSCRSERSACVHARWCCNNPGSDQPTIDSHAEERPAVLREVIL FT VHILLGNGRRCSHAVARVTWYEHHPERWFYGRGSPVQVWGLDFEPESPASF FT IPVERIAGRCVATTAAVQFDRMKERVSVIVPLCAVAD" XX SQ Sequence 7504 BP; 2055 A; 1525 C; 1691 G; 2233 T; 0 other; cactacatga aaagccttgt ttccaggtcg gaaattgccc tacacatccc cggaaatatt 60 gcatttcctt taaaaaacag gcgtttcagt cctgcccgcg ttctgtgtcc agtcctttac 120 atgccctaga aacacgcgga aacgaaccaa tattgtgtac aggacgtgta aagggcatga 180 aataccctta cagatgccct tgaaatgaat gtactcggca ggaaacacac cgtgtgtcgc 240 tccgtgtagc gctgaaaata tgcataattt agcgtaatta tcacctaatt atccaattat 300 ccccacgggt cctgccgtac gtagtgcgcg tgcgcggtac agcgctgact aaccgccccc 360 cacccctcgc ccccctcccc ccactgcatt ccatatccac ccccatacgc tcttccctgc 420 ctctggaatc ccagcttcta ctttataccc ttctgtatct tactaatgag gcatatatat 480 atatatatat atatatatat atatatatat atatatatat atatagaata agcagcacat 540 ttcattcatg aagagacgtg gtttacttct tacatttaac attttattcg taccgtatag 600 aatcgccatt tacaaatcca cgaaaattac agtaccactg aaacggaaaa tcaatactgg 660 ctgtaactat gtgctaaaga aaaaaacaga ttaccttaac ataacatatt tgcccttact 720 taacccaagg atgttgtaac gttcttgctt aaactcaata aagaggatag ctctaaagtt 780 aacatagaaa tcgtcaattg aatgtcgtag cgtacactcg ggaatggaat taccctggaa 840 taaactcaaa gtgtaaatag tcttccgtac aagcaaaacg taaatggcgc ctacatgttt 900 aactatcgtc ctaacagtaa cacatagcaa gaaaggacac tagaatacga tggatataca 960 aggcaaacga ttaacataca gactaacttt gcactgtaca agtaatgttt tctttactac 1020 tgtacaaatg ttgaaagata aatttcgaaa taagtagaag ttgttaagta attaaaagga 1080 tacaaagggt aagctaaaaa aacaagtcat gctggccacg ttaaactaca atggggtcaa 1140 cgaattgtca tccatgtccg aaagaaaatc ttcatcaaca aagctagcta cattgtatcc 1200 ttcaagagcc cgtgaaggag agtcttcgct ctggaaggct aagcttcttc ctgtcggtcg 1260 gcggggcggc ggggtgcggt tggaggttga ggaccggttg ttgttgttgg cggagatggt 1320 tgtcagacag ggccggactg ctgggcgggg agcgcttccg ggggtgaagt gcacagcggc 1380 ccggtcgggg tcgtagactc gagggaggtc cctgtcactc tgtgtgcctt ccaccctcag 1440 tctcggtttg cggttggcgg ccaggtactt ggggtcagcc tccagcctgg tctgcagtgc 1500 gctgcaaagg tcggaaagct gttggcttct gaagacaggg ggcttcacca gccacacggc 1560 catcccgttc gaactgctat cctcctcgtc cgacatgagt tcctgggtta ctcctgccca 1620 gatcccctcc tcttcctcag tctggacgac ttcgcttctc acgtccagca actagtgtaa 1680 aacatagaaa agtggaaata cgtcagcgca tccaaggaat ttagaaagca tatgtgacgt 1740 aaaaagaaat cattattagt ttaaactatc attacagcta aacgactaaa tcggactata 1800 caatattata ttttaataga ttcaatatgt tttgctgaaa gataaaaatt gacagggacg 1860 atttcgtcta gatcatgaag taaaggcggg tttatgaggt caacttgttc tcgtttgtgc 1920 atgatttggc attatttcgc atgcaagcac aaatttgaaa aactaatcaa acaaaacatg 1980 ttcctctctt ctaaaaacaa tgtcaaggtg tcaatggatt ggtcctttgt aggcagacaa 2040 acacgtttgt gtggtagact caccctcttt ctccgggacc gaaggcgctt ctctttcttt 2100 tctttttcct tcttttcttt gttttctggc aggctgagtg tgtagttccg acgaatggtc 2160 tcgtagtaac caacacacgc ggctggaaaa aagggtataa aatgtttagc ttgtataaac 2220 ccaactgtat taaatgtaat tctaagttac cataattgtc gattttcaaa catcacatat 2280 accaatagtt tgggaatcat cgtggaaaaa acataaaaat gccacacaga agttacatag 2340 agagtataac ctatgtgtag aacatttaaa tttgcaaaag ggacagcttg aacttgaagc 2400 acagttactc acctcggatc atgtctgacg gatgctcgcc gatggcagca atttctttca 2460 caagaaaggt cgtcactgct aaattgtggg gtgagttgag tctgaaaaaa gcgattaaaa 2520 ctgttacatt ttctgaagtt ttttttgtat cgatgatttt ttctgcgtta cataaacaga 2580 tacacaatca tcaattatta ttttcctttg tccaaagttg tctgtcgtac gtgcatgttg 2640 accatctagt ttggacaaga catatactag tatgcatata cactgtatga tataattgtt 2700 ttaacttacc cttccttccc ctcatacttc ttgtcatctt catttgagtt gtgcaaacgt 2760 ctcaccatct cctgaaaaga aaaataattg caattgtgtt agcatatggt caatggtatg 2820 tgtgtattgt gaacatcatt tgcattgaaa gtagaccttg ttacatgcgt ctgttttgtt 2880 tgccgggttg ggcaaatgtg tgaactatcc ctactaaaca aacaaagaga aaacagacgc 2940 ttaaatcttc gcaaaacaat acagtttgcg gtgtttcagc aagccttttt aatgtgaacg 3000 aaaaaaaatt ctgtaaagag gggttataaa cacacacttc acacacactc aagacattgt 3060 agatacggct aaggatagga ctgtacgtga gacgtaaacc atacgacttg accgaaatga 3120 tttcgttaag tttgtggtat gccccaagtg taagactagg tatactatgg aagaggcatt 3180 ctacaccaag gcgaatggtc gcaaggtgtc taggaagtgc gagtttgtcc cattctatag 3240 ccatcctcat aggcggtctg agttgcgtcg taaatgtggt actgttttga tgagaaaagt 3300 aaagtctgta aagggtgacg aatatctgta tccaaagagc acgtattgtt ataagagtgt 3360 aaaagaatcg cttcaagacc ttgtaagtag gcctggtttt aacgataagt gcgaacattg 3420 gagagataga aatgtccccg acactgaaat gatggatgtg tatgatgggc aggtgtggag 3480 agagtttcag tatgtaaatg aatcaccctt cctggcagaa cccaataact tagccctgat 3540 gcttaactgt gattggtacc agccctacaa gcatagtgag tattcggtag gggtcttgta 3600 tcttgtggtg ttgaacctgc cccgtgagga gagattcaaa gaggagaaca tgattgttgt 3660 tggcttgatc cctggtccaa aggagcctaa gatagacata aattccttct tggagcctct 3720 agtggaagaa ttacaggatc tgtggaaagg tgttctcttg tcagataact ccattctcga 3780 ggcccaactg tatcgagctg cccttatctg tctgtcctct gacatcccgg ctgcccgtaa 3840 gtgtggaggt ttcgttggtc atggggcttt gagagggtgt agtaaatgtc tgaggacctt 3900 tacgaggtca cggtttcgca cgaaggctga ttattcagga tttgacagga cattatggcc 3960 ccctcgcgat tcttctgaac acgtccggtt tgctaagatg gccagaagag ctaagacaaa 4020 ggcagatagg aaaagggttg agggtcagta tggagcaagg tggtcggaac ttttccgtct 4080 ttcttattgg aacgccgtaa aatacgtggt cgtggacccg atgcacaatc ttctgcttgg 4140 tacttccaaa catgtcttca aggtctggat ggaattgggt gtgctgaaca agaaagcttt 4200 cgatgtcatt caaagtagga tcgacagtat gaaagttcct catgacgtag gcagaattcc 4260 taggaaaata gcgtcgggat ttagcggctt cactgctgat cagtggaaga actggacttg 4320 tatctattct ttgttctgtt tgaaaggact gatagatagc cgtcagtatg atatgtggta 4380 tgagtttgta caggcttgtc gtttgttatg ttctcgggca atctcttacg ataacttaag 4440 gaaggcggac catcatctgc aaaggttttt agctatgttt gaggagttct gtgggccaca 4500 atactgcaca ccaaacatgc atctgcactt gcatttgaag gagtgtattg aagactacgg 4560 gcctgtgtac gctttttggt gtttttcatt cgaaaggtat tatggaatgt tgggcaagta 4620 tcagcataac aacaggaata tcgaggttca aatcatgagg aagtttcagc agtccaagca 4680 gcttagtatg atgtcctggc ccggcgggta tggagacgcc tttaagagcg ttcttacgga 4740 cgggaagcta tctagtacgt cagttggaca tgcccttccc ttgatggcag atgttgtggt 4800 gctagataac agttctgttg tggagccatt gtctcctttt cgtaaggtag ttatcagtga 4860 ggctgtccgc tctatgttgt taactatgta ccgagcgatg tacccagggg tagctatttc 4920 acggctagat ctgtttgcca tgtcctgtac cagagttaga tgtaatggcg tgaccttgtc 4980 tgtagactcg tgtaggtctg aaaggtctgc atgtgtacat gcaagatggt gttgcaataa 5040 ccctggatcg gaccaaccga ctattgactc acatgcagaa gaacgccctg ctgtcttgag 5100 ggaagttatc ctggtccaca tcctgttagg gaacgggcga aggtgttcgc acgcagtggc 5160 cagggtaact tggtatgaac accaccccga aaggtggttc tatggtcgcg gaagtccggt 5220 ccaggtgtgg ggtttagatt ttgagcctga gtcccccgct tccttcatac cagtggagcg 5280 tattgcgggt agatgtgttg ctaccacagc ggccgtacag tttgatcgca tgaaagagag 5340 agtttctgta atagtgccgc tgtgtgctgt agcggactga tccatcaacc ttttcgtgac 5400 atggcttcat tacgtcttta acttgcgttc atgatgaatg gaaagttatt ttgtttaaac 5460 tttcagtctt atggtcttga ttttcccatt ctaaaagctt accgactgaa tatctgtatt 5520 ctaaatgatt gtataggata gctatgtggt catagcatta attgtcttgt cggctattta 5580 gatgcgttgt ttacacttcc ccctttatat gcagattaga tagcgtgtgt ctgttaagtg 5640 agttagaaaa aaattctctt ggtttgaaat ttcaatgatg tctaagttct ttggtttctt 5700 atatctatgt atgtatagat atattagctt ccttcttatg tccttgtgtt caccaataat 5760 tccaggctcc cattatttca tgtccctatt tatcctttca cgatagacgc attgacgttt 5820 cattttgggt ttaacagaat ggatgtattg aatcatgctt gacctgtttt tgtagtatgt 5880 atgtatagat atattagctt ccctcttatg tccttgtgtt caccaataat tccaggctcc 5940 cattatttca tgtccctatt tatcctttta cgatagacgc attgacgttt cattttgggt 6000 ttaaccaagg acttggttta acagaacgga tgtattgaat catgcttgac ctgtttttgt 6060 agttaaactc acattccgta tatagtttca tcgtgtgttt gtgatcatgc acttattcca 6120 tgttcatctt actgctgaca attttccttt tatccgaggt gcggtcatct gcacaccggt 6180 gattagaagc ccgtcagctc tactatggac acaacgacat ggtttctgcg atgtcgacaa 6240 cattcaattg caagagaatt gatacctaat ggcggtcaac tccgacacct cacatccatc 6300 tacagaaaat cgatcagtta tacgaacatt tcttgacgtt attcattata cagcaaacta 6360 gtacgtgtaa tcttagagag gaatcacgac ttctacgcat gtaaaccgta tctgcagtta 6420 taacacatgg tagtactgta taagttgtac tttttaatta cttgttttgt aatcaatttc 6480 attaaagtct gtagtatgtc catttgtatg atttgtcttg ttttgttgag ccataaatgt 6540 gtatttgtaa ttgtaccata aaaaaagact cataagacac cgggtggaaa atttcgcact 6600 gttagtatta caacttcggt tatttgtcac aattctggta gtaatttagt gcagaagggc 6660 catatcttgt cgtagactcg ttctaccggt caaatagtca tcctatcctt agtatttgca 6720 cggccccgtg gaggtccaga ccggtagaga atagcacggg tctgatgggg gggggggggg 6780 ggtgattcat gatttgccgg gaaagggagt ttcggctgag gaagggtctt tccaaacttc 6840 cttttccgtc ataagccctc ccccataaga taaaaagcga accctgtaag taagatagaa 6900 gccaaccccg cttgtcacaa gtctgcgacg gaggccagta tttcccggct tgggctcgcg 6960 acgacagagg tctattgttc gttcaaacaa agggtattta gtagtatgtg tcaggtatgc 7020 aaatgtccca gcactcgaca ttcaagtcca tacaatgatg ttatgcttca ttctttctca 7080 caccaaggag tttttttcgt gataaaacgt cattgctcac acgaaggagg ctgaaatcga 7140 tttttgtaca aaaaagtttt tatgcaaaat gtaaaccgta ttgcttattt atttcccaga 7200 catacgttgg gttgcggggg tagtagtgac cggaaaacag taaaaaaaac ggtaactacg 7260 ttccccgcga gccaagttct gcttggagat taggcgcttt caggcttttg tctgcctatt 7320 tcccccacac gtctggtgta agggtcctag aaattccggt acaagcgcga gtccaggtat 7380 gtaagaaaat gtcaaggaca ttacacgcca aaccaggaaa gtcatgtaca cagccaggac 7440 acagaaaatt tcacaggtgt gtaagttcct tgaaacaccc ggaaataaga gctttcgtgt 7500 agtg 7504 // ID CR1-80_AAe repbase; DNA; INV; 4750 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-80_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4750 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1168-1168 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 17 sequences with >91% CC identity. XX FH Key Location/Qualifiers FT CDS 288..1085 FT /product="CR1-80_AAe_1p" FT /translation="MAFCERMIHLRCSATKLNKPFVKIVHDSPNLFWMCDE FT CAKLMKIARFKSVVSSFGDAFKAIADRQEMVHAEIRKELAIQGQQIAQLSK FT RMAPTSPFLRESASSSRQPPSKRRRDDELVFNPAVTKPLLGGTKEMSNAGI FT VTVPEPVQLFWVYLSRIHPSVKPEAIEKLVKDCLQSEGTIKAVPLVKRGID FT STRLSFISYKIGIDPKLRDAALSPDTWPKGLLFREFEDNSSKNLWLPSLNT FT PTIMVSPEVGASQFSTPTTGMDLSS" FT CDS 989..4552 FT /product="CR1-80_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MVTILEYADHHGISRSRSLAVLHSHHRNGSFKLTSTR FT LQSSNVDQAAIVSSIPERTACRLMGALDPHDTVEQAVSCHRSRSGPVVVLG FT DRVSQPPLSGKYASIYSNSSPDQFPCSSTLSVHAETVDAENSIWKSSSECL FT PMLALDSTPPGHPERTLESPMEALNPSDAVLPSAGSVHQSRSGPVVRCGER FT VFQASNHGEYFPSVRPQSSPDIYMDSRNISDPVPKRFEDIVVYYQNVGGIN FT SCVEDYRLAVSGSSFDIIVLIETWLDSRTLSSQIFGPDYEVFRCDRSPDNS FT KKSTGGGVLVAVRRELKARAVENEAWKCLEQVWLCVDLRDRKLFLCALYVA FT PDLTRDNAVFETHSQSVFAVIDTASPVDEVLVLGDFNLPTVSWKLSHNGFF FT YPDPERSVIHGCVAGLLDSYSAAMISQINHVVNENGRSLDLCFVSTQDEAP FT FISVAPCPLVKDVPHHPPLIISVERTLLVNCVDRPATVSYDFQNADHHSIA FT EVLSSIDWDNVLDRNEVDTAALTFSHVLAYVIDRHVPKRVHQVSRLPWQTA FT ELRKMKSIKRAALRKYTKFRTPSLRCHYVRLNHEYQRLSRQCFLRYQREIE FT RKFKLKPKSFWKYVNEQRKTYGLPSSMELNGRTGSSTQEICQLFAEKFSMV FT FSSETLSSHEVSLAARNVPLNDQALGAIEIDDGMISEAAKQLKASNNPGPD FT GVPSVLLKKHIAGLLSPLLFLFRSSLSSGIFPSNWKIAHMFPVHKKGSKRN FT VDNYRGITSLCAVSKLFELVIMKPLLSHCKQHLCPDQHGFTAGRSTATNLL FT CLTSFITDSMVERSQTDVIYTDLSAAFDKLNHTIAIAKLERLGVCGSLSSW FT FKTYLTGRLLKVAIGDCRSGSFYATSGIPQGSHLGPLIFLIYFNDVHSAIR FT SPRLSYADDLKMFLKIHSIADCGFLQQQLDHFADWCFLNRMVVNPKKCSII FT TFTRKRQPIIFDYCLLGTTIDRVNQVKDLGVILDSKLTFKPHVSYIVDKAS FT RTLGFIFRIAKNFTDVYCLKSLYCSLVRSTLEYCSAVWSPAYDNGSERIES FT VQHRFLRFALRKLPWRNPFQLPSYESRCQLIDLELLRTRRDVCRALTVADT FT LQGRIDCAFILEKNDLHAQSRSLRNNVMLRLPLRRTNYGMHGAIIGIQRVF FT NRIALLFDFHLTRESLRRRFSEFFAGRSG" XX SQ Sequence 4750 BP; 1237 A; 1161 C; 1039 G; 1311 T; 2 other; gttgtttctc gtgtctcccg ttgaataaac cgatcacgct ccgstgtata gtggacatta 60 acgaatagca agcacaatat tgtgaaaaca tctagaaaat ccgatcatac tgagcgcggt 120 gaactattgt ttttgtttgt ttttcttcac tgttccctct gtgcactttc cgggctaccg 180 tacaaaaatt aagggcatcc ccatcgcggt tagagcaatc cccatcgtcc gacgcttgtg 240 atcactgcgc gaagccggta aaaagtgatg acgaacacat cacttgcatg gctttttgcg 300 aacggatgat tcaccttaga tgctcggcta caaagctaaa taaaccattt gtgaaaatcg 360 ttcacgacag cccaaatttg ttttggatgt gtgacgagtg tgcgaagctt atgaaaatcg 420 cacgattcaa atcggtcgtt tcatcgttcg gcgatgcctt taaggctatt gccgatagac 480 aggagatggt acatgccgag attaggaagg aactggcgat acaaggacaa caaatcgctc 540 aattgtctaa acgaatggca ccaacttccc cttttctgcg agaatctgct tcatcttctc 600 gacaaccacc gtcgaaaagg cgccgtgacg atgagttagt tttcaatccg gctgtcacca 660 aaccacttct tgggggcact aaggaaatgt caaacgccgg catagttacc gtccctgaac 720 ctgttcagct gttttgggtg tatctttccc gtattcatcc gagtgtcaaa cccgaagcca 780 tcgaaaaatt agtgaaagac tgtttgcaga gcgagggcac cattaaggct gttccacttg 840 taaaacgcgg gattgattca actcgtttga gcttcatatc ctacaagatt ggaattgatc 900 caaagctccg cgatgccgct ttgagtccag atacatggcc aaaaggatta ttgtttcgcg 960 agttcgaaga taatagctca aaaaacttat ggttaccatc cttgaatacg ccgaccatca 1020 tggtatctcc cgaagtcgga gcctcgcagt tctccactcc caccaccgga atggatcttt 1080 caagttaact tctaccagat tacaaagcag caacgtagat caagcagcca tcgtttccag 1140 tataccagaa cgcaccgcct gtcgccttat gggagccctc gatcctcacg acacagtcga 1200 gcaagctgtt tcctgccatc ggagtcgttc tggtcctgtt gtcgtgcttg gtgacagggt 1260 ctcccaacct cctttatcag gcaagtacgc ttctatatat agcaattctt cgcctgatca 1320 atttccgtgt tccagtactc tatcagttca cgccgagacc gtcgacgccg agaattcgat 1380 ttggaaatct tcatccgaat gtctaccaat gctggccctg gactcaaccc ctcctggaca 1440 cccggaacgc actctagaaa gcccaatgga agcccttaat ccctctgacg cagtcctgcc 1500 ttctgctggc tccgttcatc aaagtcgttc cggtcctgta gtcagatgtg gagagagggt 1560 cttccaagca tccaatcacg gcgagtattt tccttctgta cgccctcagt cgtcgcctga 1620 tatttacatg gattccagga acatcagcga tcccgtacca aaacgattcg aggacatcgt 1680 agtttactac caaaatgttg gtggaattaa ctcatgtgtg gaggactacc gcttggctgt 1740 ttcgggttcc agttttgata tcatcgttct aatcgagacg tggcttgatt ctcgaacttt 1800 gtctagccaa atttttggac cagattacga agttttccgt tgtgatcgga gtccagataa 1860 cagcaagaaa tctactggtg gtggcgtact cgttgccgtt cgacgagaac tgaaagcgag 1920 ggccgtcgaa aacgaagcct ggaaatgcct agaacaagtt tggttatgtg ttgatctccg 1980 cgatcgaaaa ctgtttctat gcgcgcttta tgttgcaccc gatctgaccc gagacaatgc 2040 tgtcttcgaa actcactctc aatcagtttt tgctgttatc gatactgctt cacctgtcga 2100 tgaggtcctc gttttaggtg acttcaacct accaaccgtc tcgtggaaac tatcacacaa 2160 tggtttcttc tatccggacc ccgaacgctc cgtaatccat ggttgtgtag ccggtctctt 2220 ggacagctat agtgccgcta tgatctccca gataaatcac gtagtcaatg agaacggacg 2280 cagcttggat ctttgcttcg tcagtaccca ggatgaagca ccctttattt cggtggcccc 2340 atgcccacta gttaaggatg tccctcatca ccctccattg attatttccg tcgaaagaac 2400 cctcctcgtc aattgcgtcg atcgtccagc aactgtatct tatgatttcc agaacgccga 2460 ccaccatagt attgcggaag tgttgagtag cattgactgg gacaatgttt tggatcgtaa 2520 cgaagttgat actgctgctt taaccttttc acatgtcttg gcatacgtca ttgacaggca 2580 cgtcccaaaa agagtgcatc aagtttcgcg acttccttgg cagactgcag aactgcgaaa 2640 aatgaagtca attaaaagag cagctctccg caagtatacc aaatttcgaa caccctctct 2700 aagatgccat tacgtgaggc tgaaccacga atatcaacgc ttaagtcgtc aatgtttctt 2760 gcgataccaa cgtgaaatcg aacgtaaatt caaattgaag ccaaaatctt tctggaaata 2820 cgttaacgag cagcggaaga cgtacgggct gccatcttca atggaattga atgggagaac 2880 gggttcatct actcaggaaa tttgccagct attcgcagag aagttctcta tggtttttag 2940 cagcgagact ttaagcagcc atgaagtttc acttgcagcc agaaatgttc ctttgaacga 3000 ccaagcatta ggtgcaatcg aaatagacga tggaatgatc tcagaagcag caaagcaact 3060 gaaagcgtcc aacaatcctg gacctgatgg agtaccgtcc gtactcctaa agaaacacat 3120 cgccggcttg ctgagccccc ttcttttttt atttcgttcg tcactttcga gcggcatatt 3180 tccttctaac tggaagattg cacacatgtt tccggtgcac aaaaaaggaa gcaagcgcaa 3240 cgttgacaat tacagaggga ttacgtcgct atgtgccgtg tcgaagcttt tcgaactcgt 3300 tatcatgaaa ccacttctgt cgcactgcaa gcaacatctt tgtcctgatc aacacgggtt 3360 tactgccggt cgttccaccg ccaccaattt gctctgtttg acatcgttta taaccgacag 3420 catggtagaa cgatcacaaa cggatgttat ttacaccgat ctgtccgccg catttgacaa 3480 gctgaaccat actattgcga tcgctaagct tgaaaggctc ggcgtctgtg gcagtctctc 3540 tagttggttc aaaacttatt taaccggtcg cctgttgaag gtggcgatag gagactgcag 3600 atctggcagc ttttatgcta cgtctggcat accccaaggt agtcacctgg ggccattgat 3660 cttcttaatc tattttaatg acgttcattc ggccatcaga agtcctcggc tatcatatgc 3720 tgatgatctc aaaatgtttc ttaagatcca ctccattgcc gactgtgggt ttttgcaaca 3780 acagcttgat cactttgccg attggtgttt cctgaaccgt atggtagtta acccaaagaa 3840 gtgttccatt ataacgttta caaggaaaag acagccgata atatttgact actgcctgct 3900 gggaactaca atcgatcgtg tgaaccaagt aaaggatctc ggggtgattc tggattcaaa 3960 gctgacattc aagccgcatg tttcgtacat tgtcgacaag gcatccagaa cattgggatt 4020 catcttcagg attgcaaaga acttcacaga cgtctactgc ttaaagtccc tatattgctc 4080 gctcgtgcgc tccactttgg aatactgttc cgcggtctgg agccccgctt acgacaacgg 4140 ctccgaaagg attgagtccg tacagcaccg ctttcttcgg tttgcacttc ggaaactgcc 4200 atggagaaac ccgttccaac tgccaagcta cgagagtcga tgccagctga ttgatctaga 4260 gctacttcgc accagaaggg acgtctgcag agctcttacg gttgcagaca cgctacaagg 4320 acggatagat tgtgccttca tcctggaaaa aaatgattta cacgcccagt cacgatcact 4380 ccggaacaat gttatgctac gcttgccact acgaagaact aactacggaa tgcatggggc 4440 gatcatcggt atccagcgcg tcttcaacag gattgcccta ctgtttgact tccacctcac 4500 ccgtgaatcg cttcgtcgaa gattttcgga gttttttgct ggtagaagcg gataacgaca 4560 acgattatat gttaaatgtt tttttttaat tgcctgacaa mgtttatgat tatgtttgag 4620 actgtgacca agattatgtt cgtgatttag tttttttttt aatttgtatt gctgtcctta 4680 gtttaaacaa tatcattggg gctacgaagc ctgttgataa atttaataaa taaataaata 4740 aataaataaa 4750 // ID BEL-41_CQ-I repbase; DNA; INV; 1907 BP. XX AC AAWU01003534; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-41_CQ_; KW BEL-41_CQ-LTR; BEL-41_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-1907 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 235-235 (2011). XX DR Genome; AAWU01003534; Positions 70049 71955. XX CC 'GGCTT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 26..1906 FT /product="BEL-41_CQ-I_1p" FT /translation="MPIANSRELPCRRCGTPDVYTMWSYCRCRSGLPDANA FT TRNASLPSGSTKISHQYRQECREVYAARLQQLRQDQLAAQEALATKQREWL FT EIQARIKRDYQIAKAELQQKVEECENSFRLREEQIMSSGRFEVRGTITQQS FT AENKKEDEQHRVLFCESSNSTVEATKTTVQSGQDNPIGSTSGVGAGVDSNK FT PSAGKSTSVRTFAGVSRNDADWMSSLGETTCRGWEAAKTAVSSLPFTLTPT FT VQTCVNDNNTHKSISSSCIFVNEPSLSRNKRSASKTNVVESCLQDVAAQLE FT VEFGGLVSSKKLIQDQLMVNRESGQQKLSYQQEQRSIPERAFQDRDTQRLP FT ERRNQQVPPLQDTGSPLVFFSMTQRGRLNPKGKSHDEPMFICTSITIGRWN FT EYSGAENFEKERKLVGQPGQNERQMSSSMQQKGMCGVSATWHEQLDKLFVL FT KIVLPNSETWDKSMFIHGTKWFSGVTAGRMMFPLQLQEGDRNNPGYTKAIN FT QQYSRKSQSVVEAVVVPEVQHEIVRLWFNKNVDRNQVGVPMTAVSHSTAKI FT GDGELGRSGSYRSRTLLGQFPRQKKIQWFQIDPGGKSSSDAGSSSTRAMQL FT VGQLSVTSAASTRGGNSGPERWTTLTVGE" XX SQ Sequence 1907 BP; 552 A; 422 C; 543 G; 390 T; 0 other; ctaaaagaaa atagttccga tcaagatgcc gatcgcgaac agccgggaac ttccctgccg 60 gagatgcggt acaccggatg tgtacacgat gtggagctac tgtcgatgcc ggtcggggtt 120 gcccgacgca aacgctacca ggaacgccag tttgccaagc ggttctacca aaattagcca 180 ccagtatcgc caagaatgca gggaagttta cgcggcacga cttcaacaat tacgccagga 240 tcaacttgct gcccaggagg cgttggccac gaagcagcga gaatggcttg agatacaggc 300 gcggattaaa cgggactacc aaatcgccaa agcggagctg cagcagaagg tagaggagtg 360 tgaaaactcc tttcgattac gggaagagca gattatgagt tcgggacgtt ttgaggttag 420 aggcacgatt acacagcaga gtgcggagaa taagaaggag gatgagcagc atcgggtact 480 attttgtgag tcttccaact cgacggtgga ggcaacgaag acaacagtac aatcgggtca 540 ggacaaccca atcggttcga catctggagt aggtgccggg gtggatagca acaagccatc 600 agccggcaaa tccacttcag taaggacgtt tgctggtgtg tcccgaaatg acgcggattg 660 gatgtcatcg ttgggagaaa ctacatgtcg aggctgggaa gcagcaaaaa ctgccgtcag 720 ctcactaccg ttcacactca cacctactgt acaaacgtgt gtaaacgaca acaacactca 780 caaatctatc agtagttctt gtatttttgt aaacgagccc agtttgtcga gaaacaaaag 840 atctgcgagc aaaacaaacg tcgttgagag ttgtttacaa gatgtggcag cacagctgga 900 ggtggaattc ggcgggttgg tgtcgtcgaa gaaactcatt caggatcagc tgatggtgaa 960 tcgggaatct ggacagcaga aactcagtta tcaacaggag cagcgatcga ttccagaacg 1020 agcgtttcag gaccgggaca cgcagcgttt gccggaacga cgaaaccaac aagtaccacc 1080 tctgcaagat actggttcgc cgcttgtctt tttttcgatg acgcagcgag ggcgactaaa 1140 cccgaagggc aagtctcacg atgagccaat gttcatctgc acgtctatca ccatcggtcg 1200 atggaatgaa tattccggtg cagagaattt tgagaaagag agaaagctgg ttggacaacc 1260 agggcagaat gaacgccaaa tgagttcatc aatgcagcag aaggggatgt gcggagtctc 1320 ggcgacatgg catgaacaac ttgacaaact gttcgtactg aaaatcgtgt tgccgaattc 1380 cgaaacatgg gataaaagta tgtttatcca tggtacaaag tggttcagcg gtgtgacagc 1440 tggcagaatg atgttcccac tgcagctgca agaaggtgat cgcaacaacc caggatacac 1500 gaaagccatc aatcaacagt actcacggaa gtcgcagtcg gtcgtggaag cggttgtggt 1560 gccagaggta cagcatgaga tagtgcgttt gtggttcaac aagaatgttg acaggaatca 1620 agtgggagta ccgatgactg ctgtttcaca ctcgactgca aagatcggag atggtgagct 1680 gggacggtct ggatcgtacc gtagcagaac tttgctgggt cagtttccta gacagaaaaa 1740 gatccagtgg ttccaaatcg atcccggtgg gaagtcatcg agtgatgcag gttcttcgag 1800 taccagagca atgcaactcg taggacaact atcagtaaca tcggctgcta gcacgagagg 1860 aggtaacagt ggaccggaac gctggaccac tcttacggtc ggggaaa 1907 // ID Zator-2_HM repbase; DNA; INV; 3481 BP. XX AC . XX DT 29-JAN-2009 (Rel. 14.02, Created) DT 25-JUL-2009 (Rel. 14.08, Last updated, Version 2) XX DE Zator DNA transposons from Hydra magnipapillata. XX KW Zator; DNA transposon; Transposable Element; Zator-2_HM. XX NM Zator-2_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3481 RA Bao W., Jurka M.G., Kapitonov V.V. and Jurka J.; RT "New superfamilies of eukaryotic DNA transposons and their RT internal divisions."; RL Mol Biol Evol 26(5), 983-993 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 656..3154 FT /product="Zator-2_HM_1p" FT /translation="MSANVFQVIHNIKMDKGQLYASLYDAYVKAYSQKRKQ FT DCQKEVNDIWSKVKSEDNLSAKVDSLLREYDALFKKKKGTLMSFWANQSLR FT HDDAKTIPRKSSAPDVLAKLPDTVVGGNKCKASSCECSMITSSTSGSENVG FT KYKTKVQDELKVKIDVIKSDLVGLHKRKDSGIITNDQEAELIEKRRKVDML FT EKSLLKKKKEQARQKKTREKKRETLAKICAENPGVCDALKAKDKTGRPSVE FT INQPLFLKAIVDIALHGSAAHERRQSDMYRSIKTLDELTKQLNKDGYQVSR FT STVYTRLMPRRSSTLEGKRHISTVPVKLIRSQNDSHSKHIDGKFCTATLRH FT LEELSSILGPKEVCFISQDDKARVPIGLTAANKQAPLLMHVEYRINLPDHD FT WVVAARHKLIPSVYAGICIKKDGLGKPDAVTYSGPTYIAIRSGKHSSSSAL FT AHGLDFERLLQLDEFDAITKDGRDKIIKPIVVFTVDGGPDENPRYQKVIDV FT AIHHFVKNNLDALFIATNAPGRSAFNRVERRMAPLSRELSGVILPHERYGS FT HLDSQGRTIDDYLERRNFKYAGQTLAEIWSQVIIDQFPTVAEFIDPETSEL FT DAEELIIKDIKWFSEHVRTSQYFTQIVKCQDVKCCFKIRSSYFNVVPSRFL FT CPPIPVIQTSNGLQADKRGDSESHKFPSLFVSHSIKIDDIMARSAGSFQVH FT PYDLFCPSVQTCLLNRICKVCHLYFASQAILKKHVELHKNCDKGNTVLEKT FT QTKRIRPIRVAARRQRELMVVIAREKNEDIEWLDEDEVDVSGIVIPDEDLE FT KTSEMPVLTIDEHFKSPWMDNNKWVDKVKV*" XX SQ Sequence 3481 BP; 1241 A; 506 C; 683 G; 1051 T; 0 other; ggggtcatcc ataaagtacg tcacgctact aggggggggg gggaggcata agtttgtgac 60 aatttgtgac aaggggggag ggtttgacga atgtgacatc acaaaaatta tttatcatga 120 ttattttcta cttatttact cattcaatta aaggagtatt tccgattgcc taaaatttta 180 cttacacttc ttatgttttg cgtatcgccg tattttaaat agttttgaat agtttaaaaa 240 gttcgcaaaa actgccttca cgtttctgaa gacattaaag cgcagcaatg caaaatcatt 300 aaattaagaa cgtataattt aagtttggaa ttttaaatag agagtttaaa taagtaaata 360 aaacattttt aaaatattat cttttatatt atttattttt ttgaatctta tgacattata 420 taaacattag agttttagtt acatttacat ttaagcaaga atatatgact acttaaatat 480 ataatatata ataactaagt taaatagact tgctcatgct ttaaatagcc aataggaaac 540 aaagattatt atcaaataat agtgtaaata ggatagtagc tagttagcgc tcagtataac 600 taagttagta gtaattgata ttatggcata aatacaatat ttttaatttc tctaaatgag 660 tgcaaatgtg ttccaggtaa ttcataatat aaagatggac aaaggacagt tatacgcgtc 720 actttatgat gcctatgtta aggcatattc acagaagagg aaacaagatt gtcaaaagga 780 agttaatgat atttggagta aggtgaaatc tgaagataat ctttctgcaa aagtagactc 840 cctgttaaga gaatatgatg ctctgttcaa aaagaaaaaa ggtacactga tgagtttctg 900 ggcaaatcaa tcattacgac atgatgatgc gaaaactata cctcgtaaat catcggctcc 960 ggatgttcta gcaaaattac cagatactgt tgtagggggt aataaatgta aagcttcttc 1020 ttgtgaatgt tctatgataa cttcatcaac gtctggttct gaaaatgtag gaaaatataa 1080 aacaaaagtt caagatgaac taaaagttaa aattgatgtt atcaagagtg atttagttgg 1140 actacataaa agaaaagaca gtggtattat tacgaatgat caagaagcag agcttattga 1200 gaaaagaaga aaagtagata tgttggagaa aagtttgtta aaaaaaaaga aagaacaagc 1260 aagacagaaa aaaactagag aaaaaaaaag agaaactcta gctaaaattt gtgcagagaa 1320 tcctggagtt tgtgatgctt tgaaagcaaa agataaaaca ggtcgacctt ctgttgaaat 1380 taatcaacca cttttcctga aagccattgt tgatattgct ttacatggat cagctgcaca 1440 tgaaagacga caaagtgata tgtatcgcag tattaaaaca ctagatgaat tgactaagca 1500 actaaataaa gatggttatc aagtgagtag aagcaccgtt tacactcgac taatgccacg 1560 acgaagttca actttggaag gaaaaagaca catatcaaca gtaccagtca aattaattcg 1620 atctcagaat gattcacact ccaagcacat cgatggaaaa ttctgtacag caaccttgag 1680 acatctggaa gagttgtcgt ctatactggg accaaaagaa gtttgtttta ttagccaaga 1740 tgacaaagcg cgagtcccca ttgggttaac agctgcaaac aaacaagctc cattgcttat 1800 gcatgttgaa tatcgcatca atttgccaga ccacgattgg gttgttgcag caagacataa 1860 gcttattcca tcagtatatg ctggaatttg catcaagaaa gatggacttg gtaaaccaga 1920 cgctgttact tattcaggtc caacttatat tgcaatccga tcaggcaaac attcgtcttc 1980 ttctgctctt gctcatggtt tagactttga gagattgtta caattagacg aatttgatgc 2040 aataacaaaa gacggcagag ataagataat taagcctatt gttgtcttca cggtggatgg 2100 aggaccagat gaaaatccac gttatcaaaa agtcatagat gttgctatcc accattttgt 2160 taaaaataat ctagatgcct tatttattgc aaccaatgcg ccaggtagaa gtgctttcaa 2220 tagagtggaa cgtcgaatgg ctccattgag cagagaactt tctggtgtta tattgcctca 2280 tgaacgctat ggtagccatc tcgattcaca gggcagaaca atagatgatt atttggagag 2340 aagaaatttc aagtatgctg gacaaacatt ggctgaaata tggtcacaag ttattattga 2400 ccagtttcca acagttgccg agtttattga tccagagaca tctgagctag atgcagaaga 2460 attgataatt aaagatataa aatggttttc tgaacacgtc cgcacaagtc aatactttac 2520 tcaaattgtt aaatgtcaag atgtaaaatg ttgctttaaa ataagaagct catatttcaa 2580 tgttgttcct tcgcgatttt tatgtcctcc aattccagtt attcaaacaa gtaatggtct 2640 tcaagctgac aaaagaggag acagtgagtc tcataagttt ccatcacttt ttgtttcaca 2700 tagtattaaa atagatgata taatggccag atcggcagga agtttccaag ttcatcctta 2760 tgatctcttt tgtccatcag tacagacttg tctattgaat cgaatatgca aagtttgcca 2820 tctctatttt gcttctcaag ctatcttgaa aaaacatgtt gaattgcaca aaaattgcga 2880 taaaggaaat actgttcttg agaaaactca aacaaagcgt atcagaccaa taagagtagc 2940 cgctagaaga caaagagagc tgatggtagt tattgctcga gaaaaaaatg aagatataga 3000 gtggctggat gaagatgaag ttgatgtttc gggaattgtc attccagatg aagatttaga 3060 aaaaacatca gaaatgcctg tcctaactat agacgaacac ttcaaatctc cgtggatgga 3120 taataataaa tgggtagata aagtaaaagt ttaatttgta atttgaaaaa cataaggttc 3180 aaataaaaaa actattttta ttttatttta tgatgacaat ctttcccaat aaattaagaa 3240 ggtatttcaa attagttaaa ttgagtcttt tttattctgt taaatagcta ttactatgtt 3300 tttagaatgt tgaaaattcc gtaatggttt gttttttttt ttagagggag ggggggggga 3360 gaaacatgtg acatactttt ttaggggggg gggtgcagaa atatgtgaca gtttgtgaca 3420 ggaggggggg agggggttga aaattgaaaa aaatagcgtg acatacttta tggacgaccc 3480 c 3481 // ID DNAX-7_AP repbase; DNA; INV; 311 BP. XX AC . XX DT 25-AUG-2009 (Rel. 14.09, Created) DT 25-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNAX-7_AP. XX NM DNAX-7_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-311 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2061-2061 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC TSD TATA or TA. Note imperfect TSDs. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 311 BP; 51 A; 110 C; 102 G; 48 T; 0 other; ctcgggccga cgagagaacg ctattgggcg gcgaccaaaa ttcgtccgat ctcgcgcatt 60 cccaccacta aaccttcccg aactgctggc cgccgtcaac tctggccgcc gccgccgtcg 120 ccgccaccgt cgcggccgcc gccaccgtcg ccgccaccgt cgcggccgcc gccgccgccg 180 ccggtcgcgc gtagcgagtg ggggaattta cggcggcgac tagatggtag agtgggagaa 240 atttgggtac gaaaccgcgg agctcggtca tttggatgcg cgaacgagta gcgttctctc 300 gccggcgcga g 311 // ID Copia12-NVi_I repbase; DNA; INV; 8827 BP. XX AC AAZX01023343; XX DT 13-NOV-2007 (Rel. 12.11, Created) DT 13-NOV-2007 (Rel. 12.11, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: internal DE portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia12-NV; KW Copia12-NVi_LTR; internal portion; Copia12-NVi_I. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-8827 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from Nasonia parasitic wasp."; RL Repbase Reports 7(11), 1162-1162 (2007). XX DR Genome; AAZX01023343; Positions 23037 14211. XX CC Positions [4884-5417] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 260..3250 FT /product="Copia12-NV_I_1p" FT /translation="MLKATKIGTVKTYFKNYYYQKYVDIKNVYYVKGIKQN FT LMSLAKITENYTVLATKDIAKIFNKSRELIAVANKKENLFYMKSFVSKRES FT NEMYVNSLKLTDKEKWHRALGHVNFQYLNKLVTNKLVEGLPEKIENVEMKC FT ANCIESKMANVPFENSRSKTTEILELIHTDLNGPHNTTGYGGEKYFLTFVD FT DYSKCTRIFCIKSKAETASCLKEFVNLVENKFNKKVKKLRCDNGKEYLNKE FT IYDFVKSKGIELLPCPPYVHELNRVAERYNRSAMDMGRCLMREAKIHRRYW FT PEIIKTVAYLKNRTIANTAENKTPFEIFFGKKPNVDNLKIYGSRVFVRVPE FT TLRKSKWDDKAQLGVLVGYVENGYRVLVNGRIIHARHVQVVEEKTQLICLE FT KLDDEKDRDLENSESENKGNEILENESKIDKYDVNSNRKNPLISTQTFGES FT RDNELNSDIAENENSNLKVQRKSNRKKSPVNRYGNPVTHFIYVNYIDANIP FT NTFEEALNSNEYKQWKIAMDSEINSLNKNNTWQIVERPKDKRVIDVKWVYK FT RKNNNVYKARLVVRGFQQREHIENVYSPVGKKQTLKILLSYSCKNNLFIEQ FT MDVETAFLNGNVTTEVYINEPKGYETGDNKVCKLQKALYGLRESPRAWYNC FT FNEYIEKLNFMRSNYDYCLYVNNTSKDPIYILVFVDDLLICCKDKDKINKV FT KASLMQRFAMKDMGKISQYIGIDIDYSNDRSKMTLSQTKYIESLAIKYNLE FT NAKLYNTPMESNLKLEQASEIDENIKYRNLIGELLYISTGTRPDISYSVNY FT LSRYQNCYNQTHYKYAMRILKYLYKTKDLKLTYCDNVNTEILDCMVDSDCA FT GDNVDRKSTTGFVIRLFGNLIFWKTHKQSTVTKCSTFAEYTAMSEAVTEVL FT FVRNLLCESFDVNFENPINIYEDNSGAIAIPKYGNFTKNSKHIEVQYHYIN FT ENYENGIIDIVKIDSNSNLADMLTKSLDKTKFIKNRQALNLA" FT CDS 4161..7085 FT /product="Copia12-NV_I_2p" FT /translation="MFSWLKHRVKQENELDATDIELTKKIRNKIRKIFLKE FT IKKIVVLKNQVYENIRVKEEELKMTDSGCTDHIITNDKFFEKCVDLKNPVD FT VKLPDGKMLKATKIGTVKTYFKNYYNQKYVDIKNVYYVKGIKQNLMSLAKI FT TENYTVLATKDTAKLFNKSRELIAVANKKENLFYMKSFVSKRESNEMCVNS FT VKLTDKEKWHRALGHVNFQYLNKLVTNKLVEGLPEKIENVEMKWANCIESK FT MANVPFENSRSKTTEILELIHTDLNGPHNTTGYGGEKYFLTFVDDYSKCTR FT IFCIKSKAETASCLKEFVNLVENKFNKKVKKLLCDNGKEYLNKEIYDFVKS FT KGIELLPCPPYVHELNGVAERYNRSAMDMGRCLMREAKIHRRYWPEIIKTV FT AYLKNRTIANTAENKTPFEIFFGKKPNVDNLKIYGSRVFVRVPETLRKSKW FT DDKAQLGVLVGYVENGYRVLVNGRIIHARHVQVVEEKTQLICLEKLDDEKD FT RDLENSESENKGNEILENESKIDKYDVNSNRKNPLISTQTFGESRDNELNS FT DIAENENSNLKVQRKSNRKKSPVNRYGNPVTHFIYVNYIDANIPNTFEEAL FT NSNEYKQWKIAMDSEINSLNKNNTWQIVERPKDKRVIDVKWVYKKKNNNVY FT KARLVVRGFQQREYIENVYSPVGKMQTLKILLSYSCKNNLFIEQMDVETAF FT LNGNVTTEVYINEPKGYETGDNKVCKLQKVLYGLRESPRAWYNCFNEYIEK FT LNFMRSNYDYCLYVNNTSKDPIYILVFVDDLLICCKDKDKINKVKASLMQR FT FAMKDMGKISQYIGIDIDYSNNRSKMTLSQTKYIESLAIKYNLENAKLYNT FT PMESNLKLEQASEIDENIKYRNLIGELLYISTGTRPDISYSVNYLSRYQNC FT YNQTHYKYAMRILKYLYKTKDLKLTYCDNVNTEILDCMVDSDYAGDNVDRK FT STTGFVIRLFGNLIFWKTHK" XX SQ Sequence 8827 BP; 3613 A; 1112 C; 1587 G; 2515 T; 0 other; acagggcaac tattcatctg aatcatggat aacgcaggta tgccacgact cgaacaattc 60 ttatacctac cctcgggaaa agtataagaa ttataaaaca agtgaagtaa atcatgtaac 120 aggttatagc gaaaattgta gcgaaatcga ttggttatta gatagtggtt gcacagatca 180 tattataaca aatgataaat tctttgaaaa atgtgtagat ttgaagaatc ctgttgatgt 240 aaaattacca gatggtaaaa tgttaaaagc aaccaaaata ggtactgtta agacttattt 300 taaaaattat tattatcaaa aatatgttga tataaaaaat gtctattacg ttaaaggtat 360 taaacaaaat ttgatgagtt tagcaaaaat aactgaaaat tatacagtat tagcaactaa 420 agatattgca aaaatattta ataagtctag agagttgata gctgtagcaa acaaaaagga 480 aaatttgttt tatatgaaaa gttttgtatc aaaaagagaa agtaatgaaa tgtatgtaaa 540 ttcactaaaa ctaactgaca aagaaaaatg gcatagggca ttaggccatg taaactttca 600 atatttaaat aaactagtaa ctaataaatt agttgaagga ttgccagaaa aaattgaaaa 660 tgtcgaaatg aaatgcgcaa attgtataga aagcaaaatg gccaatgtac cttttgaaaa 720 tagcagatca aaaacaactg aaattctaga attgattcat actgatttga atggtccaca 780 taacacaact ggttatggtg gagaaaaata tttcctaact tttgttgatg actatagtaa 840 atgcacaaga attttttgta ttaaaagtaa agctgaaaca gctagttgtt tgaaagaatt 900 tgttaattta gtagaaaata aattcaataa aaaggttaaa aaattacgat gtgataatgg 960 caaagaatat ttaaataagg aaatttatga ctttgtaaaa tcaaaaggaa tagagttatt 1020 gccatgccca ccctacgtcc atgaattaaa tcgagtagcg gagagatata atagatctgc 1080 aatggatatg ggtagatgtt taatgcgaga agctaaaatt cataggcgtt actggcccga 1140 aatcataaaa acggtcgcgt atttaaaaaa ccgtacaatt gcaaatacag cggaaaataa 1200 aactcctttc gagatattct ttggtaaaaa acctaatgta gataacttaa aaatttatgg 1260 tagtcgtgtc tttgtaagag taccagaaac tttacgaaaa agcaaatggg acgataaagc 1320 acaattaggc gtattagtag gttatgtaga aaatggttat agagtcttag taaatggtag 1380 aataattcat gctagacatg ttcaagtagt tgaagaaaaa actcaattaa tttgtttaga 1440 aaaacttgat gatgaaaaag atagagattt ggaaaatagc gaatctgaaa ataaaggaaa 1500 tgaaatacta gaaaatgaat ccaaaattga taaatatgat gtaaattcaa atagaaaaaa 1560 tcctcttatc tctactcaga ccttcgggga aagtagagat aacgaattaa actctgatat 1620 tgctgaaaat gaaaattcaa acttaaaagt gcaaagaaaa tcaaatagaa agaaaagtcc 1680 tgtaaataga tatggaaatc ctgtaactca ttttatctat gtaaattaca ttgatgcaaa 1740 tataccgaat acatttgagg aggcgttaaa ctcaaatgaa tataaacaat ggaaaattgc 1800 aatggactca gagataaata gtctaaacaa aaataatact tggcaaattg ttgaaaggcc 1860 aaaagacaaa agggtaattg atgtaaaatg ggtctataaa aggaaaaata acaatgttta 1920 taaagcgaga ttagttgtaa gaggatttca acaaagagaa cacattgaaa atgtttattc 1980 accagtaggc aaaaagcaaa cattgaaaat tttactgtct tacagctgta aaaataattt 2040 attcatagaa caaatggatg tagagacagc tttcttaaat ggtaatgtaa caaccgaagt 2100 atatataaat gaaccgaaag gttatgagac tggtgacaat aaagtttgca agttgcaaaa 2160 ggcattatac ggattgcgag aaagtcctag agcctggtat aattgtttca acgaatatat 2220 tgaaaaattg aatttcatga gaagtaatta cgattactgt ctatacgtaa ataatacaag 2280 taaagatcca atatatatac tagtctttgt agatgatctc ttaatttgtt gcaaagacaa 2340 agataaaata aacaaagtta aagctagtct aatgcaaagg tttgcaatga aagatatggg 2400 caaaattagc caatacattg gtatagatat agattacagt aatgatagaa gtaaaatgac 2460 attaagtcag actaagtata ttgaatcttt agctataaaa tataatttag aaaatgccaa 2520 actgtataat actccaatgg agtccaacct aaaactagaa caagctagtg aaattgatga 2580 aaatataaaa tacagaaatt taattggcga actattatat ataagtacag gcacaaggcc 2640 agatatatct tatagtgtaa attacctaag tcgttaccaa aactgttata accaaacaca 2700 ttataaatat gcgatgcgaa ttcttaagta tttgtacaaa acaaaagatc tcaaactaac 2760 ttactgcgac aatgttaata ccgaaatatt agactgtatg gtagattctg attgtgcagg 2820 tgataatgta gatagaaaat ctacaacagg ctttgtaatt agattgtttg gaaatttaat 2880 tttttggaaa acgcataaac aaagtactgt aacaaaatgt tctacttttg cagaatacac 2940 tgctatgtca gaagcagtga ctgaagtttt gtttgttagg aatctattat gtgaatcttt 3000 tgatgtaaat ttcgaaaacc cgattaatat atacgaggat aactcaggtg caattgcgat 3060 tccaaaatat ggtaacttca caaaaaattc aaagcacatc gaagtgcagt atcattatat 3120 aaatgaaaat tatgaaaatg gaataattga cattgtaaaa atagattcaa attcgaactt 3180 agctgatatg ttaacaaaaa gtctcgataa gacaaaattt ataaaaaata gacaagcatt 3240 gaacttggct tgaacttggc ttgaacttgg tcaaatacat tgaacttggc ctgaacttgg 3300 caacatacca tggaaatgta ttatatatct catcaagatt ataaacttaa aaaggcgtgt 3360 tgtaatatag taagtattat aatcttgatt ataagtacca aaacggtaca cgtcgcgcca 3420 tctattggtc gacgacgttt tcgtgcagtc agacgagctc acgagagagt aagtgtaccg 3480 cgcatgcctc gcgactatac tctaagcgcc tattacgtac tatactatac tccgtgtgta 3540 gagccagact atgctgttct gtactctaca ctctcttgcc tgtaagactc cgatgtgatc 3600 ggacgtgccg actactgggc tctcaagcct aactactttt tcaatatata ttattattat 3660 taacctcact gtttattatt tattccacca ctccaacatg gtagcagagc cctggaattt 3720 tcttttcaat tttgcattaa ccttcggaga aatgcaaaat tgaggggggt aaaaagtgag 3780 gtaagaagaa agccacgtgg aaaagaagtt aacggttatt gttaacaaag tgaatcgcac 3840 agtgaaaaat tgagcagtga aattaaagga gaaagtttct cagaaagatt atatgcttgt 3900 ggagcattga aatttaaatt ttgtgtcgaa acggccgtcg aaaaaaagaa aaatttcaag 3960 tgaagattaa ccagtcaaga aaaggttcaa gtcaaatata tatgctcagt gacaaagatt 4020 aagttgtcag gtattaaaca ataagaagaa ttttgaaaat taataaagat tcaagtgcga 4080 tgtcaataga ttaaagtcgg caagtgaaaa acgtggtaac gtgctagtac agagcacgtg 4140 gcaattttga ggttaagcac atgttctcgt ggcttaaaca tagagtaaaa caagaaaatg 4200 aattagatgc tacagatatt gaattaacaa agaaaataag aaacaagatt agaaaaatat 4260 ttttaaaaga aatcaagaaa attgttgtat tgaaaaatca agtgtacgag aatatacgtg 4320 tcaaagaaga agaattaaaa atgacagata gtggttgcac agatcatatt ataacaaatg 4380 ataaattttt tgaaaaatgt gtagatttga agaatcctgt tgatgtaaaa ttaccagatg 4440 gtaaaatgtt aaaagcaacc aaaataggta ctgttaagac ttattttaaa aattattata 4500 atcaaaaata tgttgatata aaaaatgtct attacgttaa aggtattaaa caaaatttga 4560 tgagtttagc aaaaataact gaaaattata cagtattagc gactaaagat actgcaaaac 4620 tatttaataa atctagagag ttgatagctg tagcaaacaa aaaggaaaat ttgttttata 4680 tgaaaagttt tgtatcaaaa agagaaagta atgaaatgtg tgtaaattca gtaaaattaa 4740 ctgataaaga aaaatggcat agggcacttg gccatgtaaa ctttcaatat ttaaataaac 4800 tagtaactaa taaattagtt gaaggattgc cagaaaaaat tgaaaatgtc gaaatgaaat 4860 gggcaaattg tatagaaagc aaaatggcga atgtaccttt tgaaaatagc agatcaaaaa 4920 caactgaaat tctagaattg attcatactg atttgaatgg tccacataac acaactggtt 4980 atggtggaga aaaatatttc ctaacttttg ttgatgacta tagtaaatgc acaagaattt 5040 tttgtattaa aagtaaagct gaaacagcta gttgtttgaa agaatttgtt aatttagtag 5100 aaaataaatt caataaaaag gttaaaaaat tactatgtga taatggcaaa gaatatttaa 5160 ataaggaaat ttatgacttt gtaaaatcaa aaggaataga gttattgcca tgcccaccct 5220 acgtccatga attaaatgga gtagcggaga gatataatag atctgcaatg gatatgggta 5280 gatgtttaat gcgagaagct aaaattcata ggcgttactg gcccgaaatc ataaaaacgg 5340 tcgcgtattt aaaaaaccgt acaattgcaa atacagcgga aaataaaact cctttcgaga 5400 tattctttgg taaaaaacct aatgtagata acttaaaaat ttatggtagt cgtgtctttg 5460 taagagtacc agaaacttta cgaaaaagca aatgggacga taaagcacaa ttaggcgtat 5520 tagtaggtta tgtagaaaat ggttatagag tcttagtaaa tggtagaata attcatgcta 5580 gacatgttca agtagttgaa gaaaaaactc aattaatttg tttagaaaaa cttgatgatg 5640 aaaaagatag agatttggaa aatagcgaat ctgaaaataa aggaaatgaa atactagaaa 5700 atgaatccaa aattgataaa tatgatgtaa attcaaatag aaaaaatcct cttatctcta 5760 ctcagacctt cggggaaagt agagataacg aattaaactc tgatattgct gaaaatgaaa 5820 attcaaactt aaaagtgcaa agaaaatcaa atagaaagaa aagtcctgta aatagatatg 5880 gaaatcctgt aactcatttt atctatgtaa attacattga tgcaaatata ccgaatacat 5940 ttgaggaggc gttaaactca aatgaatata aacaatggaa aattgcaatg gactcagaga 6000 taaatagtct aaacaaaaat aatacttggc aaattgttga aaggccaaaa gacaaaaggg 6060 taattgatgt aaaatgggtc tataaaaaga aaaataacaa tgtttataaa gcgagattag 6120 ttgtaagagg atttcaacaa agagaataca ttgaaaatgt ttattcacca gtaggcaaaa 6180 tgcaaacatt gaaaatttta ttgtcttaca gctgtaaaaa taatttattt atagaacaaa 6240 tggatgtaga gacagctttc ttaaatggta atgtaacaac cgaagtatat ataaatgaac 6300 cgaaaggata tgagactggt gacaataaag tttgcaagtt gcaaaaggta ttatacggat 6360 tgcgagaaag tcctagagcc tggtataatt gtttcaacga atatattgaa aaattgaatt 6420 tcatgagaag taattacgat tactgtctat acgtaaataa tacaagtaaa gatccaatat 6480 atatactagt ctttgtagat gatctcttaa tttgttgcaa agacaaagat aaaataaaca 6540 aagttaaagc tagtctaatg caaaggtttg caatgaaaga tatgggcaaa attagccaat 6600 acattggtat agatatagat tacagtaata atagaagtaa aatgacatta agtcagacta 6660 agtatattga atctttagct ataaaatata atttagaaaa tgccaaactg tataatactc 6720 caatggagtc caacctaaaa ctagaacaag ctagtgaaat tgatgaaaat ataaaataca 6780 gaaatttaat tggcgaacta ttatatataa gtacaggcac aaggccagat atatcttata 6840 gtgtaaatta cctaagtcgt taccaaaact gttataacca aacacattat aaatatgcga 6900 tgcgaattct taagtatttg tacaaaacaa aagatctcaa actaacttac tgcgacaatg 6960 ttaataccga aatattagac tgtatggtag attctgatta tgcaggtgat aatgtagata 7020 gaaaatctac aacaggcttt gtaattagat tgtttggaaa tttaattttt tggaaaacgc 7080 ataaataaag tactgtaaca aaatgttcta cttttgcaga atacactgct atgtcagaag 7140 cagtgactga agttttgttt gttaggaatc tattatgtga atcttttgat gtaaatttcg 7200 aaaacccgat taatatatac gaggataact caggtgcaat tgcgattgca aaatatggta 7260 acttcacaaa aaattcaaag cacatcgaag tgcagtatca ttatataaat gaaaattatg 7320 aaaatggaat aattgacatt gtaaaaatag attcaaattc gaacttagct gatatgttaa 7380 caaaaagtct cgataagaca aaatttataa aaaatagaca agcattgaac ttggcttgaa 7440 cttggcttga acttggtcaa atacattgaa cttggcaaca taccatggaa atgtattata 7500 tatctcatca agattataaa cttaaagagg cgtgttgtaa tatagtaagt attataatct 7560 tgattataag taccaaaacg gtacacgtcg cgccatctat tggtcgacga cgttttcgtg 7620 cagtcagacg agctcacgag agagtaagtg taccgcgcat gcctcgcgac tatactctaa 7680 gcgcctatta cgtactatac tatactccgt gtgtagagcc agactatgct gttctgtact 7740 ctacactccc ttgcctgtaa gactccgatg tgatcggacg tgccgactac tgggctctca 7800 agcctaacta cttttcaata tatattatta ttattaacct cacgtgttta ttatttatgc 7860 acactccaca tgtagcagag cgtgaaactt acttttcaat ttgcattaac cttcggggaa 7920 atgcaaaaat gaggggggta aaagtgaggt aagaagaaag ccacgtgaaa aagaagttaa 7980 cggttattgt taacaaaagt gaatcgcaca gtgaaaaatt gagcagtgaa attaaaggag 8040 aaagtttctc agaaagatta tatgcttgtg gagcattgaa atttaaattt tgtgtcgaaa 8100 cggccgtcga aaaaaaaaag aaaaatttca agtgaagatt aaccagtcaa gaaaaggttc 8160 aagtcaaata tatatgctca gtgacaaaga ttaagttgtc aggtattaaa caataagaag 8220 aattttgaaa attaatatag attcaagtgc gatgtcaata gattaaagtc ggcaagtgaa 8280 aaacgtggta acgtgctagt acagagcacg tggcaatttt gaggttaagc acatgttctc 8340 gtggcttaaa catagagtaa aacaagaaaa tgaattagat gctacagata ttgaattaac 8400 aaagaaaata agaaataaga ttagaaaaat atttttaaaa gaaatcaaga aaattgttgt 8460 attgaaaaat caagtgtacg agaatatacg tgtcaaagaa gaagaattaa aaatgacaag 8520 acaagctaaa gttgaagata ttatgatacc cgtgtttgat ggtgcaaatt actccagttg 8580 gaaaataaga ttgatgattt tattagaata taaagagtgt aacgaacctg caactagtaa 8640 aataactgct gcatacaaag ataaagaaga tgaatggcag aaaaaagatt taaaagcaag 8700 aacgataata ataagcactg tatccgataa acaattagaa tacattggtg aatgtaaatc 8760 agcatttgac atggtggaaa aattcgataa aatgtactcg acgcaatcca cgtcactaca 8820 aataatt 8827 // ID Sat7_Cis repbase; DNA; INV; 218 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 06-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE Satellite from Ciona savignyi. XX KW Satellite; Simple Repeat; Sat7_Cis. XX OS Ciona savignyi OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. XX RN [1] RP 1-218 RA Smit A.F.; RT "Sat7_Cis - Satellite from Ciona savignyi."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX CC Ci000007; originated from Cis_SINE1. XX SQ Sequence 218 BP; 44 A; 46 C; 64 G; 64 T; 0 other; ctcgtgggtg ggaagccagg ctttatccac gggtatcgta agacacctca tgcccacagt 60 cgagttgctg gagctattac agtgtgtgga tggttggttc atttatatcc tcgtgggtgg 120 gaagccaggc tttatccacg ggtatcgtaa gacacctcat gcccacagtc gagttgctgg 180 agctattaca gtgtgtggat ggttggttca tttatatc 218 // ID BEL-15_DPu-LTR repbase; DNA; INV; 262 BP. XX AC scaffold_147; XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from Daphnia: long terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-15_DP_; KW BEL-15_DPu-I; BEL-15_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-262 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Direct Submission to RU (16-DEC-2010). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR Genome; scaffold_147; Positions 95130 95391. XX SQ Sequence 262 BP; 56 A; 79 C; 32 G; 95 T; 0 other; tgtcggaaat attatttccc tcccaaattt atattgcaac gttgtaaccc cccctcattt 60 tttgtcaccc tctctcaccc cctaattgct agaacccccc ttttttgttc tagaagccag 120 ttcgcgtcct gctctcccat cgactggtcg ttagctccgt attttctctc tcgtctaaac 180 tctcagtttc tttcccgttc aattctccat tgtatttgtc tgtttaatac aaagacaaaa 240 caaagtcgtt aatttcccga ca 262 // ID Kiri-36_AAe repbase; DNA; INV; 4656 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 16-FEB-2011 (Rel. 16.02, Last updated, Version 1) XX DE A Kiri non-LTR retrotransposon family from Aedes aegypti. XX KW Kiri; Non-LTR Retrotransposon; Transposable Element; Kiri-36_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4656 RA Kojima K.K. and Jurka J.; RT "Kiri non-LTR retrotransposons from the yellow fever mosquito."; RL Repbase Reports 11(2), 731-731 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 10 sequences with >97% CC identity. CC This family and related Kiri elements found in two mosquito CC species, Culex quinquefasciatus and Aedes aegypti, constitute a CC distinct lineage belonging to the CR1 group. The phylogenetic CC analysis with PhyML indicates that Kiri is a sister lineage of CC the L2 clade. Although several Kiri elements do not encode two CC proteins, probably due to 5' truncation, Kiri elements generally CC encode two proteins. Their ORF1 proteins commonly contain an CC L1-like RNA-binding domain. XX FH Key Location/Qualifiers FT CDS 288..1106 FT /product="Kiri-36_AAe_1p" FT /translation="MNQNKSAKTNPPITRSVSTSSVCSEGNNNKRPRDDDD FT SNCDMLDKIQQMFSASNAKIEAKIEASNSKLESRISAVENLLFSLKAECTT FT SINNLSMAVTEVRAEIENTAQRLDRFEKASDLIISGIPYAVNENLVESFYK FT LAAILGYNEVDRPIVDLRRLMRSPIPVGATPPTVCQFALKNARDAFYGRYL FT RMRNLSLRHLGFDSDQRVFLNENLTKHARDIRTAAIKLKKQGVLKKVFTRD FT GIVFIQYANGSNAEPVVDCKQLSIPQYSTLSK" FT CDS 1678..4497 FT /product="Kiri-36_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MDNVPDENASLNTLIPRIVMNAALKNDHLNVAHINVQ FT SLIARNFTKFHELRMSFSDSKVDVICFTETWLNSTISDAMVHIEGYKLIRN FT DRNRHGGGIRVYLRKSFGHRVIEKSQFLDNNHGEIEYLVVEIKNRTEKVLL FT GVVYNPPNNDCTDTISNLLGNLTVRYESTFFIGDFNIDLLNPNRSRKFKEM FT LDSLSFVCINAEPTFFHRTGCSLLDLILTDSPEIVSKQDQLSMPGVSNHDM FT IFCSIRLTQKNAEPVVTYRDYVNFDSTVLQAAFYNIDWDTFLSMTDPNDLL FT DFMNYHLLLLHDNYIPLRCKKNSSTPWFSNEISSAIVNRDLAYRIWKRSKT FT DNNRNNYKRLRNRVNTMISQAKINSDRLKFNANLPSKQLWNSIKKLGVTKD FT SAFSNDDFSANDINDYFTSNFSNDASFESITTNSQGFSLLQFSEHDIVNSI FT FSIKSNAIGLDNIPIQFLKLLLPLALPIYKHLFDSIIITSIFPRAWKSAKI FT IPIKKKANCSTIGNLRPISILSAFSKVFEKLVKSQISKHVVDNNLLHPFQS FT GFRSNFSTNTALIKVHDDIAHAVDKRGIAILLLIDFAKAFDRVSHSKLVRK FT LTNLFGFSQSTANLIKSYLTDRYQAVLYNNELSSFSPIRSGVPQGSVLGPL FT LFSLFINDLPNTLRFCSIHLFADDVQIYFSMTGNFSMAEIARKINHDLNCI FT LQWSINNLLAVNPSKTKGMIISKLRNPPVQPELYFDGQRIQFYDKIDNLGV FT IFTSNLSWDAFINSQCGKIYGSLKRLNTVSRHFEISTKIKLFKSLILPHFM FT YGEFVYSNASAASLNKLRVALNSCIRYVYNLSRYSSVSQFQSSLIGCKFAN FT FHAYRSCIQMYRLIKSKEPPYLFDKLQFLRNTRTNSLIIPQHSTSYYGNSM FT LLRGISLWNRLSPNLKLSNSLNSFKEGLLEELERMQ" XX SQ Sequence 4656 BP; 1464 A; 866 C; 837 G; 1487 T; 2 other; agtttctgaa gggatgaacg cttcagcgtg actgtgaaag tggtccagtt tttagttgat 60 tatatcctgc tcccgttccg ccttaaacca cccagtgcct atgatacggt gaacgaaaca 120 gtgatttgaa gttttatcat tgagttacct gaatatcgtt cgcgatcgtc tgctaaaacc 180 tatattttcg tcgcgatcaa cgatagcgtt ccaagctagm ccccttcaga atattatctt 240 tttgctttcg ttgttcaacg tcaaaattca acaaaagcta ttgcgttatg aaccaaaaca 300 aatcagcaaa aacgaatcca ccgatcaccc gatcggtttc tacatcgtct gtgtgcagcg 360 aagggaacaa taacaaacga ccaagggatg acgacgattc taactgcgat atgctcgaca 420 aaatccagca aatgttctct gcttctaatg ctaaaataga agctaaaatt gaggccagca 480 attccaaact cgagagtagg atttccgccg tagagaacct gctgttttct ttgaaagctg 540 aatgtacgac gagcatcaac aatctttcga tggctgttac ggaggttcgt gcggagattg 600 aaaatactgc ccaaaggttg gaccgtttcg agaaggccag tgatctcatt atctctggta 660 ttccctatgc tgtcaatgaa aatctggtgg agtcgttcta caagcttgcc gcgatcctgg 720 gttacaacga agtggatcga cccatagtgg acctcagacg gttgatgcga tctccgattc 780 cagtcggggc cactccacca actgtttgcc agtttgcatt gaaaaatgca agggatgcgt 840 tctacggacg ttatcttaga atgagaaatt tgtcgctacg tcatctgggc ttcgatagtg 900 atcagagggt gttcttgaat gaaaatctca ctaaacatgc cagggatatc agaactgctg 960 caattaagct gaagaaacaa ggagttctca aaaaagtttt cacacgtgat gggattgtat 1020 tcattcaata tgctaacgga tccaatgcag aaccagtcgt cgattgcaaa caactgtcaa 1080 ttccgcagta ctcaaccctt tccaagtagt tattcgcttc ctcccctcag ttatccgtgc 1140 ctcctttcat gaatagtcca tgattccaac cactcctgaa agttagttct ctttgaaaac 1200 aaccttatcc aaatagtttt tctatcctta atcttcttcc atgactcttt cctttggtta 1260 ttcctttgac tccttccact cctaaaagtc aatgctgtca tcagtaatct gttgctgttg 1320 ctgtggggaa cgtggctgct gttgtcgaga ggacgttgct gttgttgttg aaaaagcgct 1380 gttactgtcg ttgagaggat gttgctgctg ctgctgttag gaaggcgtaa aagaatgttg 1440 ttgtttctgt ttctctactt gcacactgtt ggtggtatta aaagtaatag tttgaattga 1500 aagggaaaat aataagcaat gaattattac atatagctaa acactgaatc gcaaacctca 1560 atgtaatgtt aatgttagta aatagtaaat attctttttt gttgttcaat attgcatgtg 1620 ggatgggttc ttttagcgga ttttcttcaa agcggagttg caccactact tttgataatg 1680 gataatgttc cagacgaaaa tgcttctctg aacacattga taccaaggat tgttatgaat 1740 gctgcattga aaaatgatca cctaaatgtc gctcatatca atgtacaaag cctaatagcc 1800 aggaatttta cgaaatttca tgaacttcga atgagttttt ccgacagtaa ggttgacgta 1860 atatgtttca cagaaacatg gcttaatagc actatcagtg atgccatggt tcatatcgaa 1920 gggtataaat taattcgtaa tgatcggaat agacacggtg gaggtattcg cgtgtatttg 1980 aggaaaagtt tcggtcatcg tgtaatagaa aaatcgcagt ttttggataa caatcacgga 2040 gaaattgaat atttagttgt ggaaatcaaa aatagaacgg aaaaagtcct acttggagtg 2100 gtttacaacc ctcccaataa tgattgtact gacactataa gtaatttact aggaaattta 2160 actgtgcgat atgaaagtac tttttttatt ggagatttca atatagattt actaaatccc 2220 aaccgttcac gaaagttcaa agaaatgcta gacagcttgt catttgtttg tattaatgct 2280 gaacctactt ttttccatag aactggttgt tctttacttg atctcattct aaccgactct 2340 ccagaaattg tctcaaaaca agatcagctc tcaatgccag gggtttccaa ccacgatatg 2400 atattttgtt cgatacgatt aacccaaaaa aatgctgaac ctgttgtcac ttatcgtgac 2460 tatgttaatt tcgattcaac tgtattgcaa gccgcttttt acaacattga ttgggatact 2520 ttcttaagta tgactgatcc taatgatttg cttgatttca tgaattatca tttacttttg 2580 ctccatgata attatatacc tttaaggtgc aagaaaaaca gttcaacccc atggtttagt 2640 aatgaaattt cttctgcaat tgttaaccgt gacttagcgt acaggatctg gaaacggtcc 2700 aaaacagata acaacagaaa taattacaaa cgactaagaa atagagtaaa tacaatgata 2760 tctcaagcta aaatcaatag tgacagactc aaattcaatg ccaatttgcc cagtaaacag 2820 ctttggaaca gtattaagaa attaggagtt actaaagatt ctgctttttc aaatgatgat 2880 ttttcagcta atgatattaa cgactatttc acctcaaact tttcgaatga tgcttcattt 2940 gaatcaataa caaccaattc tcaagggttt agcttacttc aattttcgga acacgacatt 3000 gttaacagta ttttttcaat caaatcaaat gcaataggac ttgataatat accaattcaa 3060 ttcttaaaac ttctattacc cttagctttg ccaatctaca agcatctttt cgactcaata 3120 attataactt caattttccc aagagcttgg aaaagtgcta agattattcc tattaaaaag 3180 aaagcaaatt gttctacaat tggtaacctt cgtccgatta gtattctaag tgcattttct 3240 aaggttttcg aaaagttagt aaaatctcaa atttctaaac atgttgttga taataatttg 3300 ctgcacccat ttcagtcagg atttcgaagc aattttagta caaataccgc attgatcaag 3360 gtccacgatg atatagccca tgcagtggat aaaagaggca tagctatact tttgctgatt 3420 gatttcgcca aagcgttcga ccgagtgtcg catagtaaac tagttagaaa attaacaaat 3480 ctgttcggat tttctcagtc aactgctaat cttattaaat cttatttaac ggatagatat 3540 caggctgtat tatataacaa tgagttgtca tcatttagtc caattagatc tggagtccct 3600 caagggtccg ttctaggacc tttattattt tccttgttta tcaacgatct tccaaatacc 3660 ttgcgttttt gttcgattca cttattcgcc gatgatgtcc aaatctattt tagtatgact 3720 gggaatttca gtatggctga gatagccagg aagattaacc atgacttgaa ttgcatcctc 3780 caatggtcga taaataattt gttggctgta aatccatcaa aaacaaaagg tatgattata 3840 tcaaaactca gaaaccctcc cgtacaacca gagctttatt ttgatggtca aagaatacag 3900 ttttacgaca agatagataa ccttggggtt atctttacat ccaacttgtc atgggatgct 3960 ttcatcaatt cccaatgtgg aaaaatatac ggttcactga agaggttgaa tactgtgagc 4020 aggcattttg aaatttcaac aaaaatcaaa cttttcaagt cgttgatact tccccatttt 4080 atgtacggcg aattcgttta tagcaatgca tctgctgctt cactcaataa attaagagtt 4140 gctttaaatt cgtgtattag gtatgtgtat aatttatcta gatattccag tgtttcacaa 4200 tttcaaagtt ctttaatcgg atgcaaattt gccaactttc acgcatatag atcttgtata 4260 caaatgtata gactaatcaa gtctaaggag ccgccatatt tatttgataa gttgcaattc 4320 cttaggaata ccagaacaaa tagcttaatt attcctcagc attcaacatc atactatggt 4380 aattcaatgt tgttgagagg catttcactt tggaatagat tgtcaccaaa tcttaaatta 4440 tccaatagtt taaatagctt taaagagggt ttgcttgaag aacttgaacg catgcaatag 4500 atgaattaat catatgataa acaaattcag aaactttgac ttagagtaaa tgaattgttt 4560 gtaaagctca catccaccac agtgtaacat taaaaaagga cttatcctta cgttacataa 4620 atcsaaataa ataaataaat aaataaataa ataaat 4656 // ID L2B-2_AAe repbase; DNA; INV; 4631 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE An L2B non-LTR retrotransposon from Aedes aegypti. XX KW L2B; Non-LTR Retrotransposon; Transposable Element; L2B-2_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4631 RA Kojima K.K. and Jurka J.; RT "L2B clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1406-1406 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 7 sequences with >96% CC identity. CC Closely related to CR1-1_AG and L2B-1_CP. XX FH Key Location/Qualifiers FT CDS 296..1705 FT /product="L2B-2_AAe_1p" FT /translation="MSVQCHVCAKGINTANDRVFCFGGCGQVLHTKCADLT FT NAEATALRGNVSIKYMCHDCRKKQVCLNTMMSKWEEIVTAINEIKIRLQKI FT ESNCERMNVAEMISKSEQNIKTIIEQSISAQMSRCRSLGDQCLVVEKDLQN FT IDVFVVGEGSAVSDENSSSSYAAVAKSNKXTSVSDSVLRSGRIRNKSSVTT FT PTTFGNGRKLNNRXIDANEVIMVNDNSGKSSKKTLECTVRIKPTVQQSNQQ FT TKKEVREKINPSQMGIKSVRNGVNGAIVVECGTKNEAEGLVNKLRDELGEN FT YATKIEQPKRPRIKILGVEDEYESDDLKDILNSQNDICNIEYLRILKTIKH FT RRGVYTEFTLICETDPGTFEQIMRKGKLYIDLSSCRVTESIDILRCYKCCG FT YAHKSQECKNQLHCARCAEGHDIKECSSEQLKCINCVVSNKERKTKLDVNH FT SARSFNCPIYLNKIKFSRRFIDYEK" FT CDS 1709..4510 FT /product="L2B-2_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="QSKSRADIIFLNISGITAHFDELKLLVDNKKPKLVML FT TETHLTVDINITEYSIRNYKICCCFSCSRHTGGVMMYIHNSIKYQEISNVM FT VGQNWLVAINVAKGLRTGVFGLLYHSPNGSDKEFLEYLEHDWFENVLNDSQ FT MNLIAGDFNINWRNPNDSGSLRNVTQCFNLSQKVHEVTRRTRLSETIIDLV FT FCNEETIRVSVEDESKISDHETLRIDLFDNSEPVDDFLTIKCWKKYTKDAL FT LTILREKYQNESPRLLDEKADILGTILKESVNKLVVIKTIKCRSRKKWYSL FT ELKNLQVLRDEAYKKASTSWDEADWQHYKVLRNEYSHSIRNAKAEYTQRKI FT TQSKHNSKQLWKTLKALWKNRENPASCVSFNGNSSDNDQEISEKFNSYFID FT SVQVINNSIEDVPDICSDRNPQLLERWSVFRRISMETLKNAIYRIGSSSGM FT DNVNLQVLKDSFEVTGEYLLDIINESLETGHCPNVWKQSMVVPIPKVSGTT FT KAEEFRPIDMLPIYEKVLEIIVKDQLLEYLSDQNILIDEQSGFRQKHSCET FT ALNLLLYKWKRMIEEKKTIVVLFLDLKRAFETISRPILLKTLRDYGIAGKV FT LDWFESYLTNRTQRCQYNGTTSTPKAVPLGVPQGSVLGPLLFIMYINDMKK FT GIKYCDINLFADDTVLFIGEKDPSTAIWKIREDIETLAKWLKFKKLKLNVQ FT KTKFMIITNRKQINIEEFKLEIDGAEIERVEVFKYLGIYIDQKLTFKVHIE FT SLVKKVARKYGMLVRLKSQLTFWSKIFLYKTLVAPHIDYCSSVLYLASDIH FT LKRLQKLQNKFMRFILCCNKYTPVRTMLETLQWQSIKQRIIFNVLVLIYKL FT TNNLLPDYLCNIVVRGRNLHQYRTRHIDDLRVVPFTMTTNQKSIYYSGIRT FT FNQLPLDVRNVRTVNEFKRKCSVWIKISID" XX SQ Sequence 4631 BP; 1749 A; 640 C; 942 G; 1298 T; 2 other; agtgagtgag tgctcgttca agcagactac tttattgtga ctgaagttaa tattaaaaaa 60 aaacccgtgt aaaattagct aatgagtaat taatcgtaca tcattagacg cgttaatgaa 120 ttctctgttg aatggtgctt agtgcgcagc gttaaaatcg agataatcga agaaacaccg 180 ggaattatat tttattactg cggtacaatt tcgattcgct ttgtattgaa agtgaaaaaa 240 gaacaaagcg gccgtaggcc gtatacagta caagaaaggt agcaacgcga acagaatgtc 300 ggtgcagtgc catgtttgtg ctaagggaat aaacacagcg aacgacagag tattttgctt 360 tggcggttgc ggccaagtgc tgcatacgaa atgtgcggat ttaacaaacg ctgaagctac 420 agctctacgt ggaaatgtgt ccataaaata tatgtgtcac gattgccgaa agaaacaagt 480 ttgtttaaat acaatgatga gcaaatggga agagattgtg acggcaatca atgaaattaa 540 aattcgtttg caaaaaattg aatctaattg tgaaagaatg aatgtagcag agatgatatc 600 gaaaagtgaa caaaacataa aaacgatcat tgagcagtct attagtgcac aaatgagtcg 660 atgcagaagt ctaggagatc aatgtctcgt ggtcgaaaaa gatttacaaa atatagatgt 720 ttttgttgtg ggcgaaggct cagcagttag tgatgaaaat tcatcttcca gctatgctgc 780 agttgcgaaa agcaataaaa awacgagtgt gagtgattct gtgttgcgtt cgggtcgcat 840 tagaaataaa agttctgtga caacccctac aactttcggc aatggtcgaa agttgaacaa 900 ccgtsggatc gatgccaatg aagtgattat ggtaaacgat aatagtggga aaagttccaa 960 aaaaacatta gaatgcacag tgcggattaa gcctaccgtg caacagagca accaacagac 1020 aaagaaagag gtaagggaaa aaatcaatcc atcccaaatg ggtatcaaga gtgtgcgtaa 1080 tggggtgaat ggtgcaattg tggttgaatg tggaacaaag aatgaagccg aagggcttgt 1140 caataaatta agagatgaat taggcgaaaa ttatgccact aagattgagc aaccaaaaag 1200 acctagaatc aaaattctgg gtgtcgaaga tgagtatgag tccgatgatt tgaaagacat 1260 tttgaatagt cagaatgata tttgcaatat tgaatacttg cgtattttga aaacaatcaa 1320 gcatcgtaga ggggtttata ctgaatttac ccttatttgt gaaactgacc caggtacatt 1380 tgagcaaatt atgcgtaaag gaaaacttta tattgatcta agcagctgta gagttacaga 1440 aagcattgat atactacgat gctacaaatg ttgtggttat gctcataaat cacaagaatg 1500 caaaaatcag cttcattgtg caagatgtgc agagggccac gatattaaag agtgttcatc 1560 ggaacaacta aaatgcataa attgcgtagt ttcaaataaa gaacggaaaa caaaactaga 1620 cgtaaatcat agcgcaagaa gcttcaactg tccaatttat ttgaataaaa ttaagttttc 1680 aagacgattc attgattatg agaaatagca atcaaaaagt agggcggaca ttatattttt 1740 gaatatttct ggaattacag cccattttga tgaacttaaa ctacttgtgg ataataaaaa 1800 accgaaactg gtgatgctaa ctgaaacaca tttgacagtt gacatcaata ttactgaata 1860 tagcataaga aattacaaga tttgttgttg cttttcatgt tcaagacata ctggaggagt 1920 aatgatgtac attcacaatt ctataaagta tcaagagata agtaatgtaa tggtaggaca 1980 aaattggttg gtagcaataa atgtagccaa agggctaaga actggagtgt ttggattgct 2040 ttatcattca ccaaatggta gtgataaaga atttctagaa tatttggagc atgattggtt 2100 cgaaaatgta ctgaatgaca gtcaaatgaa tttgattgct ggagacttca atatcaattg 2160 gcggaatcca aatgacagtg ggagtttacg taatgttaca caatgtttca atttaagtca 2220 aaaagttcat gaagtaacaa gacgtaccag attatcggaa acaataatag atttggtatt 2280 ctgtaatgag gaaactattc gagtgtcagt tgaagatgaa agtaaaatat ctgatcatga 2340 aacgttgaga attgatttat ttgataactc cgagcctgtt gatgatttcc tgactattaa 2400 atgctggaaa aaatatacga aggatgcact actgacaatt ttgagagaaa aatatcaaaa 2460 tgagagccct agattgttag acgaaaaagc agatatttta ggaactattt tgaaagaaag 2520 cgtaaataaa cttgttgtga taaaaaccat aaaatgtcga tctcgtaaaa agtggtatag 2580 tttagaatta aaaaatctgc aagttttgag agatgaggca tataaaaagg cgagcacaag 2640 ttgggacgaa gcagactggc aacattacaa agtattacga aatgagtact cacactctat 2700 tagaaatgca aaggcagaat acacgcagag aaaaataacc caaagcaaac ataatagtaa 2760 acaactatgg aaaacattaa aagcattgtg gaaaaatcgt gaaaaccctg ctagttgtgt 2820 cagtttcaac ggaaacagca gtgataatga tcaagaaatt agcgagaaat ttaactcata 2880 tttcatagat agtgttcaag tgataaataa cagtatagaa gacgttcctg atatatgtag 2940 tgatagaaat ccccaattgt tagagaggtg gagcgtattt cgccggattt caatggaaac 3000 tttaaaaaat gcgatatata gaataggtag ttcatcagga atggataacg taaatcttca 3060 ggtgctaaaa gactcattcg aagtaacagg tgaatattta ctagatataa tcaatgaatc 3120 tctcgaaacc ggacattgtc cgaatgtatg gaaacagtca atggttgtac cgattccaaa 3180 agtatctggg acaacgaaag ctgaagagtt tagaccaatc gatatgctgc caatttatga 3240 aaaagtacta gaaatcattg tgaaagatca attactggaa tatttaagtg atcaaaatat 3300 attaatcgat gaacaatctg gatttagaca gaagcattct tgcgaaacag cacttaactt 3360 acttttatat aaatggaaac gaatgatcga agagaaaaag acaattgttg ttctgttttt 3420 ggatttaaaa agagcatttg aaacaatatc acgccctatc ttgttgaaaa cattgcgtga 3480 ttatggaatt gcgggaaaag tcctggattg gtttgagtca tatttgacta atagaacaca 3540 aaggtgtcaa tacaatggta caacttctac acccaaagct gtaccgttag gagtacctca 3600 aggaagtgtt ttagggccat tattatttat aatgtacata aatgacatga aaaaaggaat 3660 caaatattgt gatatcaatc tgtttgcaga cgacactgtt ttattcattg gagaaaaaga 3720 tccttcgaca gcaatatgga aaataagaga ggatattgaa acgttggcaa aatggctaaa 3780 attcaaaaag ttaaaattaa acgtacaaaa gaccaaattt atgattataa cgaatagaaa 3840 acaaattaat atcgaagaat ttaaactcga aattgacggc gctgaaatcg aaagagtaga 3900 agtatttaag tatttaggga tttatattga ccagaaattg acattcaaag ttcatataga 3960 aagtttagtt aaaaaagtag ctagaaaata tggtatgttg gtgcgtctga aaagtcagtt 4020 aacattctgg agtaaaatat ttctatataa aacattagtt gcgccacaca tagactattg 4080 ctcatcggtt ttatacctcg ctagtgacat acacctgaaa agactacaaa aattgcaaaa 4140 caaatttatg agatttattt tatgttgtaa taagtatacc cctgttcgaa ctatgttaga 4200 gacactgcaa tggcagtcta ttaagcaacg tattattttc aatgtgcttg tactgatata 4260 taaactgacg aacaatcttt tgccagatta cttgtgtaat attgttgtga gagggagaaa 4320 tttacaccag tacaggacaa gacatattga cgatttgcgt gttgtgccgt tcacaatgac 4380 cacaaatcaa aaatcaatat attacagtgg gataagaaca ttcaatcagt tgccattgga 4440 tgtaagaaat gtaagaactg ttaatgaatt taaacgaaaa tgttctgtat ggattaaaat 4500 aagcatagat taagatgata acttatgaat ttttgtatat atatatattt tttttttgta 4560 acaatttgtg aattcttgta aaccaaacta tctgaagata aataaatatg aactactact 4620 actaaaaaaa a 4631 // ID Homo3 repbase; DNA; INV; 2359 BP. XX AC . XX DT 09-OCT-2009 (Rel. 14.1, Created) DT 09-OCT-2009 (Rel. 14.1, Last updated, Version 1) XX DE Homo3 is a putative autonomous DNA transposon. XX KW hAT; DNA transposon; Transposable Element; HOBO; transposase; KW Homo3. XX OS Drosophila mojavensis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; repleta group; OC mulleri subgroup. XX RN [1] RP 1-2359 RA Ortiz M.F. and Loreto E.L.S.; RT "Characterization of new hAT transposable elements in 12 RT Drosophila genomes."; RL Genetica 135(1), 67-75 (2009)DOI 10.1007/s10709-008-9259-5. XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 392..1540 FT /product="Homo3_1p" FT /translation="MKDHLRSIHKIDQNVEVPAVPNTSGVAQISIAESFQN FT MVEYSTDGNKTKRINDAIIYMISKDVQPFSIVENKGFIHLMNTVAPRYKIP FT SRYLITKWVDEKFLEMKEMWKRLLNGKTLTLTMDVWSDQMSMRSYLGITAH FT FQLELEMTSLTIGVLELSERHTSVYLTEMLESCCKDWHIEKELVTAVVTDN FT GANVVKAVDLGFGRKKHIPCFAHTLNLIARSAMQRDSISACIEKVKGIVTF FT FKQSCVASDSLRKVTDKVLIQDVATRWNSTYYMIERYLELKNFVNDIVISV FT RNDVEILNGNELQLLNNIMPMLRPLEEATKIIGSDTYCTASMVIPMVNILK FT TKLLNVDDITPEANDIKKFLLFEIDKRMGSIEKVCLKCHH" XX SQ Sequence 2359 BP; 824 A; 381 C; 457 G; 697 T; 0 other; tagtgatgtg aaaaaacatc gatgcatcga caatcgatgt tttttggcga tgttttccga 60 tgtttttatt gataccgatg ttaagtgatg catcgatgtt tgacagaaca tcgggcgttg 120 aatttcacaa cattgcagaa catatgggta tactgccact tggaaaaatt cgcgaatttg 180 ttttattttt ttaccccgtg tttgcaatgc caaagtaagt aaattgttga aaatatatat 240 aaatatagtt atttgcaatt acaaatgtat ttattctaga ataacaacaa agacaagctt 300 ttgctggagg cactttacta aaattgatgt aaatagcgca aaatgcaaca tttgcaacaa 360 agttttgaag actgcgggta acacgaccaa catgaaggac cacctgcggt ccattcataa 420 aatagatcaa aatgttgaag tacctgcagt acctaatact agcggtgtgg cacaaatttc 480 tattgctgaa agctttcaaa atatggtgga gtatagcacc gacggaaata aaacaaaaag 540 aatcaacgat gcgattattt atatgatatc aaaagatgtg caaccatttt cgatagtcga 600 aaacaaaggc ttcatacatt taatgaatac ggttgcaccg cgttacaaga taccgtcgcg 660 atatttaatt acaaaatggg tggatgagaa gtttttggaa atgaaggaaa tgtggaagag 720 attgctgaat gggaaaacgc taacattaac aatggatgta tggagcgacc agatgtcaat 780 gcggagttac ttgggcatta ctgctcattt tcagcttgag cttgaaatga cttccttgac 840 tatcggagtt ttagagctaa gcgagcgtca cacatcagta tacctcactg aaatgctgga 900 gagttgctgc aaagattggc acattgaaaa ggaattagtg acagccgtcg tcaccgacaa 960 tggggcaaat gtcgtcaaag ccgttgacct aggttttggg cgaaaaaagc acataccatg 1020 ttttgcacat accctaaatc taattgcaag atctgctatg caaagagatt caataagtgc 1080 atgcattgaa aaggtaaagg gaattgtaac tttttttaag cagagttgcg ttgccagcga 1140 tagcttaaga aaagtaacgg acaaggtatt gatccaagat gtggctacta gatggaatag 1200 cacatactac atgattgagc gttacttgga gcttaaaaat ttcgtaaacg atattgtcat 1260 aagtgttcgc aacgatgttg agatactaaa tggaaatgaa ttacagttgc ttaataatat 1320 catgccaatg ctgcgccccc ttgaggaagc cacaaaaata ataggctcag atacatattg 1380 cacagccagt atggttatcc caatggtgaa tatcttaaaa acaaaattgc ttaacgttga 1440 tgacatcacg cccgaggcaa atgacattaa gaaatttctg cttttcgaaa tcgataagcg 1500 tatgggttca atagagaagg tatgcttgaa atgtcatcat taatttattc taactctatt 1560 actaaaatta agtctatgat aaaaccgatg gtcaaagact acaatagtaa tgatatcgtg 1620 acaagcacaa ctaaatccga ttctttttgg gaccaccatc accagctggc acattctcat 1680 gagcctgatg gggatattga tgtggagatg atagcgtacc tgcgaatgcc attagcgtca 1740 tttgaaagca accctctaca agtatgggaa ggtatgaaaa acacatatcc caatttacat 1800 aaaatggcgc tgcaattttt aacggtagtt tgatcgtcgg ttccttctga gcgggttttt 1860 ttcagcggcg tcatatcttt taagccaaag gagaaatagg ttagatccaa accgactaag 1920 ccgcatttta tttttgcaaa gtatcgacaa aaaatacttt tttgaaaaaa agtaattaca 1980 ataattaaaa tttcgttgaa agcataaaaa aggactgcta aggaaacatt tattaatatt 2040 atttattgtt atattattat tttattcatt ttgattctca atgtaattaa aaagactgct 2100 aaggaagcat ttgttatatt attttttaat ctgttaatta tattgttaag taaaccaatc 2160 agtagagaaa taaaaaaaaa actaataaag catttattta aaaggctttt aagaattgaa 2220 ttaacgaaaa ataacggaat tggtagaaaa catcgataca tcgatgttta taaccgatgt 2280 tttacacaac atcggtttat cattttgcaa acatcgatga acatcgacat cgaattttcg 2340 acccaaattt acatcaata 2359 // ID BEL-136_AA-LTR repbase; DNA; INV; 385 BP. XX AC supercont1.101; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-136_AA_; KW BEL-136_AA-I; BEL-136_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-385 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.101; Positions 1669143 1668759. XX SQ Sequence 385 BP; 109 A; 79 C; 77 G; 120 T; 0 other; tgttagatat aagtttgatt ataattttgt acaccactgt aataacttag attggttagt 60 tgagtgacca tacataggtt tcacacaacc cgtatcgcta ctcgagagag gtagatacga 120 agattcgctc tccccgcttc gatagattag aagcaagtta gttataaatc gaccgtcgag 180 ataatcggtt acgatctgtg gatcgtgcgt cccttttgat acgtagaaat acagtccata 240 caatcaactc tagtttctaa ttaattaccg ttgtaattaa taaattctac atagtgaccg 300 tatcaccagt attcgtcgcg tgtgcaatcc gaaactctat ctttcgcggg tgttttcgct 360 gaagtgcgaa ccgggagttc ccaca 385 // ID Gypsy-6_DPu-LTR repbase; DNA; INV; 328 BP. XX AC scaffold_54; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-6_DPu_; KW Gypsy-6_DPu-LTR; Gypsy-6_DPu-I. XX NM Gypsy-6_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-328 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 728-728 (2010). XX DR Genome; scaffold_54; Positions 500956 500629. XX CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX SQ Sequence 328 BP; 63 A; 92 C; 73 G; 100 T; 0 other; tgtagcgggg atatcacaga taccccgctg gcattaagac aagccccttt tcccgctgat 60 agatgtcacg actgtgaccc tcccccaccc tcttcttatt gtgtcacccc cctctcaagc 120 actctctctg gttcgagctg tcaagctgta cacgtctccc cccgtctgct aagattgtgt 180 catttgggtg ccgtgttagg ctccgtcgtg ttgaaaaggt atgaagtaca tttgattccc 240 ccgtgattat tttgcattcg tgtcttattc atgtgagaga cgatcgcgtc ttattagagc 300 agacttttgt gggcgttatc ccgcaaca 328 // ID Gypsy-44_CQ-I repbase; DNA; INV; 5730 BP. XX AC AAWU01034529; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-44_CQ_; KW Gypsy-44_CQ-LTR; Gypsy-44_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-5730 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 467-467 (2011). XX DR Genome; AAWU01034529; Positions 10222 4493. XX CC Positions [4756-5244] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 755..1972 FT /product="Gypsy-44_CQ-I_2p" FT /translation="MEKITISLGKLKIIEENFKKAPNRSYTRQFLENKLKE FT VKILRNEIIDQLVLLEETLDSNTHARIYSEYTNLVGKLLNFIENKIPLARN FT CSQLTLKNLAKAAIICKRLSMADPFDIKTATALVQPYDGCFDKLDSFIDTA FT NLLKELTKEAHMAMAIKFLKTRLSGKARQCLPEKIATIDELLQHVKEQCAE FT FVTPHSILAKLKNTRQNAGVEKFCTDVENLTSQLRNVYISQKIPGDVAKSM FT ASKAGVDTLISGIKNPETKLILKAATFNDIKEAVQKINENSQTNSETNEIQ FT VLHYRRTQSNQHGNYRGRSNFQRGNFNRRNFHDNQNRYDNNRNHSYDNRNI FT YNNNRNRFNDNRNQNNNFGRTRDTQSRNVNRNVFISENVNHPSTAPQVDVN FT QGHGEQITHPRT" FT CDS 2029..5589 FT /product="Gypsy-44_CQ-I_1p" FT /translation="MCEKPCTLIVDTGADVSLIKENILKPITNIYTAQKCI FT LNGITEGKSESIGLAHTNVEINGVEIPIDFQVVESNFPFRTNGILGRDFLT FT KYGCNICLKSWLLTFSVNNEIFEIPIEDKFKDELVIPPRSQVVKKMTIPGI FT KKDSVILADQIKPGLFIGNSIVNSKEQYINILNVKPTAETIPQNYKPKIIP FT LEKFIFKNTQETEIKNLIRNEKLLKELNVSNIADEEIKEKLRQLCSNYNDI FT FAMSDEPLSVNNFYKQTIRLDSESPVYTKNYRIPETHKIEVNKQVQKMLDD FT NIIRPSISPFNSPILLVPKKSSNDEKKWRLVVDFRQLNKKIIGDKFPLPRI FT DEMLDSLGRAKYFSTLDLTSGFHQIELDENSKQFTAFSNDFGHFEFNRLPF FT GLNVSANSFQRMMMIALSGLPPECAFLYIDDILVIGCSVKHHLSNLEIVFK FT KLRHYNLKLNPMKCNFFKHDVTYLGHHISENGIQPDPSKFDVIKNYPEPKD FT ADEIRRFVAFCNYYRRFIPKFAEITHPLNKQLRKNSVFDWNNECKTAFETL FT KKKLMSEPILKFPNFKKEFVLITDASKVACGAILAQCYDNVDLPIAFASKA FT FTKGESNKSTIEQELTAIHWAVTYFRPYLLGRKFIIKTDHRPLVYLFSMKN FT PSSKLTRMRLDLEEFEFTVIYVQGRLNVGADALSRIQIDSETLKQMTILRV FT TTRAMVSKNEPNTRADVDDSVNDELDHLKVIESLNNTEVYDMPKLCIRPKN FT NSNAFHYKIASKNYKKDLTLESSIQINNYTDLKNMFKNLEKSAKHLNIGKI FT ALALPSTIFEIISVHNFKIHGNQVLDKLKIILYQQQQIIENKELVNQIIAD FT NHDSAFGGHVGINRLYRKLKGLYKWANMKNTIKNYIKNCITCKQNKHTTKT FT NENFTLTPTPLKAFDSIAMDTIGPFPKSNSDNRYALTIQCDLSKYIIVKPI FT KDKQASTIAKAFIESCILVYGTPSIIRTDQGTEYKNEIFNKISEMLQYTHT FT FSTPYHPQTIGNLERNHRCLNEYVRQFINESHTDWDDWLPYYSFCYNTTPH FT SDLPYSPFELVFGKIASIPTNIIKTKQVEPIYNFDNYYYELKHKLQYTAMK FT TRELVEKIKNNRKNKQQQVANPIQIKINDLVLIENQNRSKLDKVYKGPFKV FT VAIEHPNITILNENNELYTLHKNRIIKF" XX SQ Sequence 5730 BP; 2283 A; 1011 C; 933 G; 1503 T; 0 other; cgatatggcg accgtgaaca gttgacgtga aactactaaa agtacaaagt gataagtgtg 60 actgcaaatc tttaaatcct aagatccaga aagatgggtt ggtttgttgg agataccatc 120 gtcacgaatg aagtcgcgaa agacgaaaca agaattattg ccatcatttt agtgattcta 180 gtgattgtac taatcataac cggacttttt aaagcatacc acgctttcat taagagccat 240 ataacaaagt cagcagcaag agaagttgct ctgaacaaca taacgtgcaa atgaactaaa 300 gagtttaaaa atacaatatt gactcagagc cacagtggaa aagggaagaa aaagtgattg 360 aaatcagcta ataacatgac ccagcaatct tcaatgagag cattaagttg agtgaaacct 420 aacaatgcag cagaatacta aatcagaagt ccaacacccc aatctgggtg tacagcgaag 480 tacagcagcc caaaccgagt ggagcacagc gaagcaattt tgatgtcatc caaatccagc 540 acagaagatt ttcaacaaca agccgagtgg agtacagcga aggtcagcat cccaattcaa 600 gtgaagcaca aagaactacg ctaactcaaa gaaatccagc aactcacctg acaagtccag 660 cagcaatgtc atccaaatcc agagcagatg attttcaaca agtaagcaaa ctactcaatt 720 attaattatc cagaggcaaa ttaagtagag tcacatggaa aaaattacaa ttagcttagg 780 aaaattaaaa attatagaag aaaatttcaa aaaagcacca aacaggagct atactagaca 840 gttcctagag aataaactta aagaggtgaa gattcttagg aacgagataa ttgatcagct 900 agttttactt gaagaaacct tagattcgaa tacacatgca aggatatatt cagaatatac 960 caaccttgtt ggaaaattac taaatttcat tgagaataaa atccccttag caagaaattg 1020 ctcacaattg actttaaaaa atctagccaa ggcagccatt atttgcaaaa gattaagtat 1080 ggctgaccct ttcgacatta aaactgcgac ggcactcgta caaccgtacg atggctgttt 1140 cgataaactt gattcattta ttgatacggc taatttatta aaagaactca ctaaggaagc 1200 tcatatggct atggccataa aatttttaaa aacaagatta tcgggtaagg caagacaatg 1260 cctaccagaa aaaattgcaa caattgatga actactacag catgtaaaag agcagtgcgc 1320 ggagtttgtt actccgcata gcattctggc taaacttaaa aacaccagac aaaatgctgg 1380 tgtcgaaaaa ttctgtaccg atgttgagaa tttaacctct caacttcgaa acgtatatat 1440 ctcacaaaaa ataccggggg atgtagcaaa atctatggcc tcaaaagcag gcgtagatac 1500 actgatttct ggaataaaaa atccagaaac aaaattaatt ctgaaagcag ctacatttaa 1560 cgacataaaa gaagctgtcc agaagataaa tgagaattcg caaacgaatt cggaaacgaa 1620 tgaaatacaa gtattgcatt atagacgcac acaaagtaat caacatggaa attaccgagg 1680 tagaagtaat ttccaacgtg gaaatttcaa tcgtagaaat ttccacgata atcaaaaccg 1740 ttacgacaat aatcgtaacc attcctacga taaccgaaac atttataata acaaccggaa 1800 ccgttttaac gacaaccgga accaaaataa taatttcggg agaacgcgag atacgcaatc 1860 ccgaaacgta aaccgaaacg tttttatttc ggaaaacgta aaccatccgt cgactgcccc 1920 ccaagtcgac gtgaaccagg gccacgggga gcagattaca cacccaagaa cgtaaatatt 1980 ttcaatttta atgtacacta ttcaattttt gttaccatca agttggatat gtgtgaaaaa 2040 ccctgcacac tgatagtaga tacaggtgca gatgtatcat tgataaaaga aaatatttta 2100 aaacctataa ccaacatata cactgcacaa aagtgtattc taaatggaat aactgagggt 2160 aaatctgaaa gtattggcct cgcgcatacg aacgtagaaa ttaatggagt tgaaattcca 2220 attgatttcc aagtcgtaga aagtaatttt ccttttcgca caaatggaat tttgggtaga 2280 gattttttaa cgaaatatgg atgtaacatt tgtctaaaat cttggctttt gacattctcg 2340 gtaaataatg aaatttttga aattcctatt gaagataagt ttaaagatga attagtcata 2400 ccacctagaa gtcaggtagt aaagaaaatg actatacctg gtatcaaaaa ggacagcgtt 2460 attttagctg atcaaataaa accaggatta ttcatcggaa attcaattgt aaatagtaag 2520 gaacaatata taaatatttt aaatgtaaaa ccaactgcgg aaacaattcc acagaactat 2580 aaacctaaga ttattccact agaaaaattt atttttaaaa atacacaaga gactgaaata 2640 aaaaatttaa taagaaatga aaaattatta aaagaactga atgtgtctaa tatagcagat 2700 gaagagataa aagaaaaact tagacagctt tgctcaaatt ataatgatat tttcgctatg 2760 agtgacgaac cgctttcagt taacaatttt tataaacaaa cgataaggct agattcagaa 2820 agccctgtat atacaaaaaa ttataggata ccagaaactc ataaaatcga agtaaacaaa 2880 caagtacaga agatgcttga tgataacatt ataagaccat cgatttctcc tttcaactcg 2940 ccaatactac ttgtaccaaa aaaatcaagt aatgacgaaa agaaatggag gcttgtagta 3000 gattttcgtc aattaaacaa aaaaataatc ggagataaat ttccacttcc tcgcattgac 3060 gaaatgctag acagcctagg tagagcgaaa tatttttcaa ctctagatct tacttccggt 3120 tttcatcaaa ttgaactaga tgaaaactcg aagcaattta cagcattctc gaatgatttt 3180 ggtcatttcg aattcaaccg attacctttc gggttgaatg tatcggcgaa cagtttccaa 3240 agaatgatga tgatagctct cagtggttta ccacctgagt gtgcttttct ttacatcgat 3300 gatatccttg ttattggatg ttcagtcaaa catcacctca gcaatctgga aatagtattc 3360 aagaaactaa gacactacaa tttaaaactc aaccctatga agtgcaattt tttcaaacac 3420 gatgtaactt atttaggaca tcacatttct gaaaacggta ttcaaccgga tccgtcaaaa 3480 ttcgacgtca taaaaaatta ccctgaacct aaagatgcag acgaaatcag aagatttgtc 3540 gcattttgta attattacag gcggtttata ccaaaatttg ccgaaataac tcaccccttg 3600 aataagcaat tacggaaaaa ctcagtgttt gattggaata atgaatgtaa aacagcattt 3660 gaaacactaa agaaaaaact tatgtcggaa ccaattttaa aatttccaaa ttttaaaaag 3720 gaatttgttt taattacaga tgcctctaaa gtggcatgtg gagcaattct tgctcaatgt 3780 tacgacaatg ttgatttgcc aatcgcgttt gctagcaaag cgttcacgaa aggcgagtct 3840 aacaagtcaa caattgagca agagctaaca gctattcatt gggccgttac atattttcga 3900 ccttacctac tgggaagaaa attcataata aaaacagatc acagaccact agtctacctg 3960 ttttctatga aaaacccaag ttctaaacta actcgaatga ggttagattt agaggagttc 4020 gaatttacag taatatacgt acaaggcaga ttgaatgtag gagcagatgc cttgtcgaga 4080 attcaaatcg actccgagac gctcaaacaa atgactattc tcagggttac aactagagca 4140 atggtcagta aaaatgagcc aaacacaagg gctgacgtag atgattcagt caacgatgag 4200 cttgatcacc tcaaagtaat tgaatcactg aacaataccg aagtttacga tatgccaaaa 4260 ctatgcatta ggcctaaaaa taattcaaac gcgtttcatt acaaaattgc tagcaaaaat 4320 tacaaaaagg acttgacact agagtcaagc atccaaataa acaattatac agatttaaag 4380 aacatgttca aaaatctaga aaaatctgca aaacatttga atattggaaa aatagctcta 4440 gcgctaccta gtacaatatt tgaaataatt agtgtgcaca attttaaaat acatggaaat 4500 caagtacttg ataaactgaa gataatatta taccaacaac aacaaattat agagaataaa 4560 gaactggtaa atcaaattat tgctgataat catgattctg catttggtgg acatgtaggc 4620 attaatagat tatatcgcaa acttaaggga ttatacaaat gggcaaatat gaaaaatact 4680 ataaaaaact atattaaaaa ttgtattaca tgtaagcaaa ataaacacac tactaaaacg 4740 aacgagaact ttacattaac accaactccg ttgaaagcat ttgattctat cgctatggac 4800 acaataggtc cattcccgaa atcaaactca gataaccgtt atgctttgac aattcaatgc 4860 gatttgtcca agtatataat agttaaacca ataaaggata agcaagcatc aacaatagcc 4920 aaagcattta ttgaaagttg catattagtt tatggtacac cttcgattat ccgtaccgat 4980 caaggaaccg aatataaaaa tgaaatattc aacaagatat cagaaatgct tcaatataca 5040 catacatttt ctacacctta ccaccctcaa actattggca atttagaaag gaaccatcgt 5100 tgcttgaacg aatatgttag gcaattcatt aacgaatctc acactgactg ggacgattgg 5160 ttaccttact attcattttg ttacaatacc acaccacact ctgacttacc atatagccct 5220 tttgaattag tttttggtaa aatagcatca atacctacaa atattataaa aacaaaacaa 5280 gttgaaccaa tatacaattt tgataattac tattacgagc taaaacacaa attacagtat 5340 acagcaatga aaacaagaga attagtagaa aaaatcaaaa ataatagaaa aaataaacaa 5400 cagcaagtag caaatccaat tcaaatcaaa attaacgatt tagtattaat tgaaaaccaa 5460 aacagaagca aattagataa ggtttacaaa ggtcctttta aagttgtagc aatagaacac 5520 ccaaatatta caatacttaa tgaaaataat gagctataca ctttacacaa aaacagaatt 5580 ataaagtttt agcatcacta gcattcacca ataaaaataa aacaaacata aataaaaatt 5640 gtaaaacact actgtaaaaa caaaaaagag aaatctaacg aaagcattta acattgtgaa 5700 tagctacgct attctcttaa aaggggaagg 5730 // ID BEL-5-LTR_HM repbase; DNA; INV; 893 BP. XX AC . XX DT 28-JAN-2009 (Rel. 14.02, Created) DT 28-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE LTR retrotransposon from Hydra magnipapillata: long terminal DE repeat - consensus. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-5-LTR_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-893 RA Bao W. and Jurka J.; RT "LTR retrotransposons from Hydra magnipapillata."; RL Repbase Reports 9(2), 439-439 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX SQ Sequence 893 BP; 340 A; 137 C; 143 G; 270 T; 3 other; tgtaaaatct racacggrtg agttttcgcg cttaagcgct cctccgtgcc gcatggcgta 60 atatttgtaa ttataatttt ttgtatagtt gtaaaagaga gttcgtgctg gatagacgcg 120 ttttatagtt gttttattga gttgttattt cgtcttatcc ctttcttaaa aacaaatata 180 gaagagtttt taaattggcg cccgatgtaa aaaagagtat tttaacgaaa tacaaagatt 240 ttgcaaaaag aagcaaagaa aattgtaaag atatacattt gattgacatt acataaacat 300 atggcagcct taaataggaa attatcgcga aggacaggtt tacaaagcga cataaaagaa 360 ataaatgaag aagttgaaaa taacttgtta aataatagtg acgataattt taatgcactt 420 agaggtttaa gaaattcatt gagcaacacg ttactagagc taagagaagt agaagaagaa 480 ataattaatt taaaggaacc ggatgacatt gctgaattcg tcattcagtc taagagattc 540 atacgtcatt caaatcaact tcttgcacaa atcgattaca aattaaactc tcaaaaactt 600 tcacctaatt ctcaatctgg ttattcacaa agcactacat ccaattgtaa attacctaaa 660 ctagaaatac aatcttttga cggaagccct cttaagtggt actcattttg ggaccaattc 720 aatgcaacaa ttaatcaaaa cgattccctg agtgaaatag ataagttttc atatttaaaa 780 aaatatttac atcatggtcc actagcagcm atttcaggat taactctatc aaatgaaaat 840 tataaagaag caattgatat tttaaaacaa agatatgcag acccccaagt aca 893 // ID Gypsy-27_AA-I repbase; DNA; INV; 4013 BP. XX AC supercont1.320; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-27_AA_; KW Gypsy-27_AA-LTR; Gypsy-27_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4013 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.320; Positions 430595 426583. XX CC Positions [2973-3437] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 300..4001 FT /product="Gypsy-27_AA-I_1p" FT /translation="MFDVCLWVEIRTRLGGTNIEDNNCANLDALIEECQRL FT INLKNDTAMIENPVAPAVQAIKGNVQHRKPSKKPFSEQRTHYHQQQQQSRK FT SGPASPCWNCGAMHYARDCSFVNHKCSSCQQLGHKEGYCDSAKRSAKTRKS FT HNKQFSTKTVVSVNNVQGKRCFVWVKLNDIPVRLQLDTASDITIVSKQTWE FT RIGKPAATPTVQSAKTASGEDLRLDFEFMCIVSLNDSVSTGHIYVSSNSLN FT LLGIDFIDKLSLWSLPMDKFCDLIGGDGPIDEGSLQKAYPKLFSSTLGLCT FT KTKVSLALKEACKPVFRPKRPVSYAMLPIVDKELDRLESLNIIAPVDFSEW FT AAPIVVVRKSNGSIRICGDYSTGLNDRLLPHQYPLPLPQDIFSKLTGFTIF FT SQIDLSDAFLQMEVDENSRKLLTVNTHRGLYQYNRLPPGVKAAPGAFQQLM FT DVMLTGLPFTSGYLDDVIIGGRTEAELQRNLHAVLQRIQEFGFTIRPEKCS FT FGQERLGYLGLLLDKDGLRPDPNEIKAIQDMPPPQDLTGVRSFLGAVNYYG FT KFVPNMRALRYPLDELLKVSSSSFKWTKECQKAFEDFKRMLSSDLLLAHYD FT PHQEIVVSADASSIGIGATISHKYPDGKMKVIQHASRALSSAEQKYSQPDR FT EGLAIVFAVTKFHKYIFGRHFRLQTDHAPLLRIFGSKKGIPVYTANRLQRW FT ALTLLSCDFSMEYVATDKFGNADVLSRLIDQHIKPDEDFVVACTSLEEDMQ FT SVANSAISQLPLSFRMVERATNSDPILRKVYRFVKHGWPKKRSDVTDQELL FT RYYDRQEALSIVEGSIMFSDRLVIPTVYRKRCLNQLHKGHPGAQRMKAVAR FT SFVYWPCLDEEIINYVRDCRSCASAARSLPKATPEPWPKPTVPWQRVHIDY FT AGPLKGEFYLVVVDAYSKWPEIFPTKYITAKATINLLRSLFANKGMPEVLV FT SDNGTQFTSAEFKEFCIENGVEHLTTAPFHPQSNGQAERFVDTFKRAVRKI FT QEGKYSIKTALDIFLLTYRTTPNPNTPDGRSPAESMYNRRLRTSLELLRAP FT RSQAPSEQLDTNDRHRSFIPNEAIYAKVYANNKWTWAPGTVVERVGRVMYN FT VWVNERKLIRSHINQLRSRGTTTHRNSKEHILPLDILLSECNLVKPATFSD FT PVQTTPSPESRLDLSTRVSSSESSSSSSEASSSSSATTSSAGFQSAVEASP FT AIKVPRRSSRNRRPPQRFNTYQRF" XX SQ Sequence 4013 BP; 1118 A; 993 C; 952 G; 950 T; 0 other; ttattccaac gaatcgatga cagtgcgaag gtacggctgt tactccgtcg gttaggaact 60 caggagcatt accgttatgt gagttacatt ctcccaaatg caccgaaaga tttttcattt 120 tcggagacgg tggagaagct gcatggcctt ttcggcacca aagagtcagc agtgaagaaa 180 cggtattcca cactcactat cgcaaaatca cccaccgaag attacgtatg ctatgcctgt 240 cgggtgaaca aaatgtgtgt ggagtttgag ttatcaaaaa tgtctgaggc ccagttcaaa 300 tgtttgatgt ttgtttgtgg gttgaaatca gaacgcgact cggaggtacg aacatcgagg 360 ataacaactg tgccaatctg gatgcgctca tcgaagaatg ccaacggttg atcaacctaa 420 agaacgatac agctatgatc gaaaatccgg tagcaccagc agttcaagcc atcaagggaa 480 acgtgcaaca ccgcaagcct tccaagaaac cattcagcga acaacgtacc cactatcacc 540 aacagcagca gcaatcacgg aagagtgggc cggcttcgcc ttgctggaat tgtggtgcga 600 tgcactacgc aagagactgc agcttcgtca accacaagtg ttcgtcctgt caacaactgg 660 gacataaaga aggatattgc gacagtgcta agagatcagc gaagactagg aaatcccaca 720 ataagcagtt ctccaccaaa accgtcgttt ccgtcaacaa cgttcaaggt aaacgttgtt 780 tcgtctgggt gaagttgaac gacattccgg tgcgactcca gttggacaca gcatcagaca 840 tcaccatagt gtcgaagcaa acatgggagc gaatcggtaa gccagcagct actccgactg 900 tgcagagcgc taagactgca tcaggggaag acctccgcct tgactttgag ttcatgtgca 960 tagtttcgct caacgattcg gtgagtaccg gtcacatcta cgtttcgagc aactcgttga 1020 atcttttagg gatcgatttc atcgacaaac tcagcctttg gtcattaccg atggataagt 1080 tttgtgatct catcggcggt gacggaccga tcgatgaagg ttcattgcag aaggcctatc 1140 ccaaactctt cagctcgact ctgggacttt gcacgaaaac gaaagttagt ttggctttga 1200 aggaggcgtg caaaccagtt tttcgaccaa agcggccggt gtcctatgcg atgctaccga 1260 ttgtagacaa ggagcttgat cggttggaaa gcctaaacat tatcgccccc gtggactttt 1320 cggagtgggc ggcgccaatt gttgtcgtca gaaagtcgaa tggaagcatc cgaatatgtg 1380 gcgattattc aacggggtta aatgatcgtc tattaccaca ccagtacccc ttgcccttgc 1440 ctcaggacat attttccaag ttgactggct tcacaatatt tagccaaatc gacttatcgg 1500 acgcctttct gcaaatggaa gtcgacgaaa acagccggaa attgctcacc gtcaataccc 1560 accggggctt gtaccaatac aacaggctcc ctcccggagt gaaagcagcc ccgggagctt 1620 ttcagcaatt gatggacgtt atgctgactg gactaccgtt tacttcgggc tatctggacg 1680 acgtcattat tggtggcagg actgaagcgg aacttcagcg aaatctgcac gcagttctcc 1740 agcgcatcca agagtttgga tttacaattc gtccagaaaa gtgttccttt ggacaagagc 1800 gactcggata tttgggcctt cttctcgata aagacggcct tcgccccgat ccgaacgaaa 1860 tcaaggcaat ccaagatatg cctccaccac aggacctcac tggtgtccga tcattcttag 1920 gcgccgttaa ttattacggg aaatttgtcc cgaatatgcg cgccctgaga tatccgctgg 1980 atgaacttct caaggtttct tcttcttctt tcaagtggac aaaagaatgt cagaaggctt 2040 ttgaagactt caaaaggatg ttatcgtctg accttctgtt ggcacactac gatccgcatc 2100 aagaaattgt ggtgtcagct gatgcgtcgt ccattggcat tggagctaca ataagccaca 2160 agtaccccga tggcaaaatg aaggttatcc aacatgcttc acgagcatta tcatccgccg 2220 agcagaagta ctctcaacca gatcgtgagg gattagcgat cgtgtttgca gtgaccaaat 2280 ttcacaagta catcttcgga agacattttc ggcttcaaac ggatcatgct ccactcttac 2340 ggatcttcgg atctaaaaag ggtataccag tgtatacagc aaaccgcctt caacgctggg 2400 cccttacttt gctttcctgc gacttttcta tggaatatgt cgcaacagat aagttcggaa 2460 atgcagatgt gctgtcacgt ctaattgacc agcatatcaa acccgatgag gactttgtgg 2520 tagcctgtac gagccttgaa gaagacatgc aatccgtagc aaacagcgca atcagtcagc 2580 taccccttag ctttcgaatg gtggagcgag ccacaaattc tgacccaatt ctcaggaagg 2640 tgtaccgatt cgtcaaacac ggatggccca agaaacggtc agacgtcaca gatcaggaat 2700 tattgcggta ctacgatcgt caagaagcac tatccatcgt cgaaggtagt attatgttca 2760 gcgatagatt agtcattccg actgtctacc gaaagcgatg cttgaaccag ctgcacaaag 2820 ggcacccagg agcccaacga atgaaggctg tcgcacgcag ttttgtttac tggccttgtc 2880 tcgacgaaga aatcatcaat tacgttcgtg actgtcggag ctgcgcctct gcagctagat 2940 ctctgccaaa agcaacacca gagccgtggc ccaaaccgac ggttccttgg cagagagtgc 3000 acatcgatta cgctggccca ctgaagggcg agttttactt ggtcgtcgtc gatgcataca 3060 gtaagtggcc ggagatcttt ccgaccaagt acatcactgc gaaagctact atcaacctgc 3120 tgcgaagttt atttgccaac aaggggatgc cagaagtcct tgtgtcagat aatggcacac 3180 agtttacgag cgccgagttc aaggagttct gcatcgagaa cggagtggaa cacttaacta 3240 cagcaccttt ccacccacaa tccaatggac aggccgaaag gtttgtagac acttttaagc 3300 gggcagtcag aaagatccag gaagggaaat attccattaa aacagcgttg gatatattcc 3360 tactcaccta cagaaccact cctaacccaa atacacctga tggacgatca cctgcagagt 3420 cgatgtacaa tcgtcggctt agaacatcgc tggaacttct acgcgcgcca cgcagtcagg 3480 caccgtcaga acagctggac acaaatgacc gacacaggtc gtttattccg aatgaagcca 3540 tttatgcaaa agtctacgcc aataacaagt ggacttgggc tccaggaacg gttgtggaaa 3600 gggtcggccg agtaatgtat aacgtttggg tcaatgagag aaagttgatc cgttcgcata 3660 taaaccagct acgaagccga ggtacaacta cccatcgaaa tagcaaggag catattttgc 3720 cgctggatat tttactgagt gaatgcaact tagttaaacc tgcaacgttt tccgatccag 3780 ttcagacaac accctcgccg gagtcccggc tggacctatc aaccagggtg tcgtcctcag 3840 aatcatcatc gtcatcgagt gaagcaagtt cgtcgtcatc agcaacgaca tcatcagctg 3900 gtttccaatc agcagtcgaa gcttcgccag cgattaaagt ccctcgtcga tcttctcgta 3960 acagaagacc gccacaaaga tttaacacct accagcgatt ttaaagaggg aga 4013 // ID BEL-233_AA-I repbase; DNA; INV; 6233 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-233_AA_; KW BEL-233_AA-LTR; BEL-233_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-6233 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 921-921 (2011). XX DR [1] (Consensus) XX CC Positions [5288-5836] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 540..1904 FT /product="BEL-233_AA-I_3p" FT /translation="MKTMRLSYKKCMDGLDQEMQNMRRSYEEAQGKKPERP FT HPSGLTNEKQRDLPKSSVVNEAMFRRNNDPGSVNHLKALNDECSSDEEDDD FT DEEDYYEDEGNTECITGGSNTAPYGLGQQCSGPTKSQLAARSGVSKKLPVF FT TGKPEDWPLFYGSYVASNQACGFSDVENLVRLQESLKGPALESVRGQLILP FT KCVPKIIEKLRQLYGRPELILQSHLERIRKLDPPKPEKLASFVPFGNAIEQ FT LCEHLEAADLQQHMVNPILIQDLVDKLPANDKRGWVRYKKSKKQVTLRTFT FT DFVGAGAAPRAPPRGGPPGPAPGGRRAAPARPGPGRAGPPGPGGGGARPGR FT GGPAPRRGAGRAGAGAGAGPPRPPARAPRRGPPPRGPRGRRPRRGAPRPRR FT GAAAAGPGGRRRRAPPARRPAQELSRKHVRQTSAWNKIRDLGEVIAILTVV FT AEAEPWKMVY" FT CDS 3857..6232 FT /product="BEL-233_AA-I_2p" FT /translation="MGFFDPLGLLTPFTVHGKTLVQDLWRTGCTWDDEVND FT DCWMKWKRWIGLLPEVEAIRIPRCYFGDAPWSSVQSIELHIFTDASEYAYG FT CVGYLRTVIEGDVRCCLVMSRSKVAPLKRQSIPRLELMAAVLGARMLHTIL FT NTHSIKFQRHILWTDSQTVRSWILSDQSKFKQFVAFRIGEILELTRSTDWR FT WVPSKLNAADVLTKWGRGPPLQSDGSWFAGPTFLLDPEENWPTQALSIEET FT NEEVRAVVLHHEAIMVEKVINSERFSRWTRLLRSTATVLRFIDNCKKKKDG FT LPLFTSKATLNQMPMIKGERLTICRPLQQEELQRAENILWKLVQNQSFPDE FT VAILLQNRGINDGQRPMSIEKSSTLYKLTPFLDENSVIRMGGRMEASESLP FT FDMKYPIILSRFHDVTRKVILHFHERYGHANQETVVNELRQKFRIPNLRTT FT VGQIVRSCCKCKVMHSRPRVPMMGPLPVQRISPQLRPFNAVGVDYLGPVEV FT TVGRRIEKRWIALFTCLSVRAVHLEVAHSLSSQSCLMAIRRFICRRGCPEN FT FFSDNGTNFKGASKELLKRVEVDSAEAVSSSTTKWNFNPPGAPHMGGVWER FT MVRSVKEAMLALNDGRKLTDEILMTTLGEAEDMINTRPLTYKSLEPSEVEA FT ITPNHFLRGTVRDLDLNRNGPTELSEALRNMYKRSQYLADKMWERWYKEYL FT PTINQRTKWFDDPKALQIGDLVFVMDGKSRKNWTRGIIEEVFAGQDGRIRQ FT ANVRTAGGVFRRAAANLAVLEIQDGKSGFSGELVPELRAGE" XX SQ Sequence 6233 BP; 1750 A; 1428 C; 1703 G; 1336 T; 16 other; aactcaaaaa agtatgagtc agtcggatct aacctcaaat ctctccgata cggaagagct 60 ggatttcacg attacgcctt gttcggcctg taacaagaca tcgacaacga atgaaccaat 120 ggtcggttgt gacgcgtgta accgatggtt ccactatcga tgcgtaggag tgaccgaagc 180 cgttaagaag gagaaacgat ggttttgtcc tgagatcgta tgccaagaaa cggcgaagaa 240 agctaaaaaa actagcagca ggaaaaactc caatcgttcc aaaccagatc aagtgcaggc 300 ggccgtgcaa ggaactgccc aggaacaaaa attgaaggca atggaagagg aatacgcgag 360 acagatggaa gatctagaaa acgagaaaat cctccgggaa aagcaaatgg aactccaaca 420 agtaagccct gaaggagaaa aggctgcaga ttgaaaggga gttacgagaa aaggaattgg 480 cccaggaaaa gcttctactc gaccgagccc ttgaggaaaa aacggagcat ttcgagaaga 540 tgaagacgat gcgtctgtcg tataagaagt gtatggacgg tttggatcaa gaaatgcaga 600 acatgcgacg aagttacgag gaagctcaag ggaagaagcc cgaaaggccc cacccatccg 660 gccttaccaa cgaaaagcaa cgcgatttac ccaaatcaag cgtagtgaat gaagcgatgt 720 tcagaaggaa taacgatccg ggaagcgtaa atcacttgaa agcgctgaat gacgagtgta 780 gcagcgatga ggaagatgac gatgacgaag aagattacta tgaagacgaa ggtaatacag 840 aatgcatcac aggcggtagc aatacggctc cctacgggct ggggcagcag tgtagcggcc 900 ctactaagag tcaactagcc gcgcgaagtg gtgtttccaa gaagctgccg gtattcacag 960 ggaaaccgga agactggcca ctcttctatg gctcgtacgt agcgtcgaat caggcttgtg 1020 gtttcagtga cgtggagaac ctcgtccgac tccaagagag cttgaagggg cctgctctcg 1080 agagcgtaag agggcagttg attcttccta aatgcgtgcc gaaaatcata gagaaactgc 1140 gtcagctgta tggccgtccc gaattgattc ttcaaagtca tctggagcgt atacgaaaac 1200 tagatcctcc aaagccagaa aaattagcgt cgttcgttcc ctttggaaat gctattgagc 1260 agctctgtga acacctggag gctgcagatt tgcagcagca tatggtaaac ccaattctta 1320 ttcaagacct ggtggataag ctaccagcca acgataaacg tggttgggtt cggtataaga 1380 aatcgaaaaa gcaggtgacg ttgcggacat tcaccgattt cgtcggggcc ggggcggccc 1440 cccgggcgcc gccgcggggg ggcccgcccg ggccggcgcc ggggggccgg cgcgccgcgc 1500 cggcgcgccc ggggcccggc cgcgcggggc ccccggggcc gggggggggc ggggcgcgcc 1560 ccgggcgggg ggggccggcc ccccgccgcg gcgcgggccg cgcgggcgcg ggcgccgggg 1620 ccggcccccc gcgcccgccg gcgcgcgcgc cccggcgggg gccgcccccg cgcgggcccc 1680 gggggcgccg cccccggcgg ggggcccccc gcccgcggcg gggcgccgcc gccgcggggc 1740 cgggggggcg gcgccgccgg gcgccgccgg cgcggcgccc cgcccaagaa ttgtcgagga 1800 agcatgtgag gcaaacgtct gcttggaaca agatacgaga tttgggagaa gtaatcgcaa 1860 tattaacagt ggtggcagaa gcagagccat ggaaaatggt gtattgatga accatgaagc 1920 tacgaatagc actgcaggga ttccgtccga gagatctaga ctgaaggcat gtaaggcatg 1980 tcagcgcacc gaccaccgtt tgcgattttg cgaggatttt cgaaaaatga actacgctga 2040 tcgaatgcac atcgtgacaa gatggaaatt gtgcaacatt tgtcttaacg accacggaaa 2100 tgcgacatgt aaatttaaga tccattgtga cgtcggtggc tgccaacaac gacacaatcc 2160 gcttcttcac ccagcgggag gcgctttcgg tacgagtgtg catattcgaa ccaccgtatc 2220 agtgctcttc aggattatcc cwgttcggtt gtattgcggc aatagaacgg tcaccacatt 2280 agcctttctc gatgagggcg cttcggttac gatgatcgaa aacgagctgg tggacagttt 2340 aggcttggta ggagttccgg agaggctaac gattacctgg actgcggata tttcacgagt 2400 tgagaaaagt gcgaagcgag tcagcgtgtg gacatcggcc atcgacgacc acgagcgact 2460 tttattggat tatgtgtaca ccgtggagag tctacgcctt ccaatgcagt cgctggacgc 2520 taggacacta tcagagcagt ataagcactt acggaatttg cctataacct cataccgaga 2580 tgctcgacct ggtatgctma tcggactcaa caacctccac acgttcgcgc ccattgaagc 2640 aaaatgcggc gcaccgggtg aaccaattgc ggtacggtgc aagctakggt ggaccgtata 2700 tggaccaagg aaagagaccg catcgaattc gaacgcagtt gtaggatttc acgatggtgt 2760 cagtaatgaa gatctgcatg acctaataaa aacgcactac gcattagaag agtcagtggt 2820 tactgtaaaa cgggagtcgc gtgaagatga aagggctcta gatattctgc gacgaacaac 2880 taaacggatt ggaaatcgat wcgaaactgg cctgctgtgg aaagatgats aagtgacctt 2940 tcccaacagc tatccaatgg cagttagaag gctgatgcat ctggaacgaa agctggcaaa 3000 ggacccggaa ctatacgaaa atgttcgcag gcagattgcg gaataccaac aaaaaggcta 3060 tgcccactta gcatcggagg aggaaatatc cggaacagat ccgcgtaaag tttggtattt 3120 accactcaac gttgttttaa atcctcgcaa accgggaaaa atkcgcttag tttgggatgc 3180 tgccgcaagc gtagatggcg tatctttaaa ctcgcaattg atamcaggtc cwgacatgtt 3240 gactccgttg atmtctgttg tcagtagatt ccgtgaacat agaatcgcgt tcggagcwga 3300 cattcgcgaa atgtatcacc asttgcgaat taccgaagcc gacaaacaag cacagcgttt 3360 cttgtttcgg atgaataagg aggacccgat taacgtttac gtcatggacg tcgctamatt 3420 tgggtcgaca tgctccccat gctcggccca atacgtcaaa aaccamaacg caatggagta 3480 cgccactaca tatccagatg ccgcggcagc mattatcgac ggacattmtk tagacgacta 3540 tttcgacagt gtggaaacta tcgaggaggc aattcaacga gcaaaagaag ttagctttat 3600 ccacgctcaa ggaggatttg aacttcgaaa ttggatttca aacgcgccag aagttcttca 3660 tggtttggga gagataaagg ctgctcagcc agttcacttc ggtcgagcca aggaaagcga 3720 cagtgaaaga gttcttggaa tcatatggag tccagaccga gatagctttt cgttttcggc 3780 ggagcaccgg gaacatcttc aagtatattt aagccttcag aaaaaaccta caaaaaggat 3840 aattttgagc tgcgtcatgg gattttttga tccgcttgga ctccttacgc cgtttacagt 3900 ccacgggaaa acgcttgttc aagatctttg gcgaactggc tgcacctggg acgatgaagt 3960 gaacgacgac tgctggatga aatggaaacg ctggatcgga ctacttcccg aagtagaggc 4020 gataagaatt ccacgctgtt atttcggaga cgcgccatgg tcatccgttc aatcgattga 4080 actgcatatt tttaccgacg ccagtgaata cgcctatggc tgcgtcggct atctgcgaac 4140 ggtcatcgaa ggagatgttc gatgttgcct tgtaatgtcc cgctcaaaag tggcgccact 4200 caaacggcaa tcgattccgc gtttggaatt aatggcggcc gttttgggtg cacgaatgct 4260 gcatactatc ctgaacactc actcaatcaa atttcaacgc cacattctct ggacagattc 4320 ccaaactgtg cgcagttgga tcctctccga tcaaagcaaa tttaagcaat ttgtggcttt 4380 ccgcataggc gaaattttag aattgactag atctacagac tggcgttggg ttccgtccaa 4440 attgaatgca gcggatgtct tgacaaagtg gggaagaggc ccaccattgc aaagtgatgg 4500 ttcctggttt gccggtccta cttttttgct cgatcctgaa gaaaattggc ctacacaagc 4560 actatcaatt gaggaaacaa atgaggaagt acgtgccgtt gtactacatc atgaagcgat 4620 tatggtagag aaagtaatca acagtgaaag gttttcgagg tggacgaggc ttctaagaag 4680 tacggcaaca gtacttcggt ttatcgataa ctgtaaaaag aagaaggatg gtctgccatt 4740 attcacatcg aaagctaccc tgaatcaaat gcccatgatt aaaggagaga gactgacgat 4800 ttgtcgacct cttcaacagg aagagctaca acgagctgaa aacattctat ggaagttggt 4860 gcagaaccaa agttttccag atgaagtggc gatacttttg caaaaccgag gaataaacga 4920 cggccaacga ccgatgtcca tagaaaaatc cagtactctg tacaaattga cgccgtttct 4980 ggacgaaaat agcgttatcc gaatgggcgg tcggatggaa gcttcagaat ctcttccatt 5040 tgatatgaag tatccgatca tcctatctcg tttccatgac gtcacaagaa aggtaatact 5100 acatttccat gagagatatg gacacgcaaa tcaagaaact gtcgtaaacg aacttcggca 5160 gaaattccgt attccaaatt tacgtactac agttggccaa atcgtacgaa gttgctgcaa 5220 gtgcaaggtt atgcattcac gtcctcgcgt tccaatgatg ggaccattac cagttcaacg 5280 aatttcaccg caattacgtc ccttcaatgc cgtaggagtg gattatctag gtcctgtcga 5340 ggtaaccgtt ggccgtagaa tcgaaaaaag gtggatagct cttttcactt gtttgagcgt 5400 cagagctgtt cacctagaag tagcccattc actatcatcg caatcatgtc taatggcaat 5460 ccgaagattt atatgtaggc gtggatgtcc tgaaaatttc ttttcggata atggaaccaa 5520 tttcaaaggg gccagtaagg agttgctgaa acgtgtcgaa gttgattccg ctgaggcagt 5580 aagtagttcc acgacgaagt ggaactttaa tcctccagga gctccgcata tgggcggggt 5640 ctgggaaaga atggtgaggt cagtcaagga agcaatgttg gcactgaacg acggaagaaa 5700 actcaccgat gaaattctga tgacgacatt aggtgaagcc gaggatatga tcaatacgcg 5760 gcccttaact tacaaatcac ttgaaccttc tgaagttgaa gccattactc ccaatcactt 5820 tctgcggggg acagtacgcg acctagacct taaccggaat ggacccacgg agttgtctga 5880 agcattgaga aacatgtaca aacggtcgca atacttagca gataagatgt gggagcgatg 5940 gtacaaggag tatctgccca ccattaacca acgcaccaaa tggtttgacg atccgaaggc 6000 tcttcaaatc ggggaccttg tattcgtgat ggacggaaag agtaggaaga attggacaag 6060 aggcattatc gaagaagtgt ttgcaggaca ggatggacga ataaggcaag cgaacgttcg 6120 aactgctgga ggcgttttcc gacgagcagc agcaaactta gcggtgctgg aaatacagga 6180 tggtaaatcc gggttctcag gcgagctggt accggagtta cgggctgggg aat 6233 // ID Sola2-5_AAe repbase; DNA; INV; 4409 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A Sola2 DNA transposon family from Aedes aegypti. XX KW Sola; DNA transposon; Transposable Element; Sola2-5_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4409 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the yellow fever mosquito."; RL Repbase Reports 11(4), 1303-1303 (2011). XX DR [2] (Consensus) XX CC >97% identical to consensus. TIRs are ~480 bp long. XX FH Key Location/Qualifiers FT CDS join(1318..1758,1662..2801,2734..3525) FT /product="Sola2-5_AAe_1p" FT /note="transposase." FT /translation="MDQQDLSRIKISRCCAGINNPKCRRSQLRNLSPTMIE FT HIRAMGYSHQLDETMHICESCRLVIRLHRSPPKKKRRSGGGEVNQNPQPPT FT SGQASNEMLGKINLLCSTLNDNDCLHRTHINILIRHYCIFQKQSRPVHLCH FT PSHQRMNYNPYQYINSTLLYISEAVPSCSSLPSVPSADELLGPYNVEKFNS FT IIETLNLSPISSRNMRSLDYRIKKSVEITKAVRHDILGMDAMDVDKVEAFE FT EIINQLKEKFNSPGIDRNDQLKILSVLPKSWSINKIVEEFEAPIYMVRQVK FT QLVNEQGILCTTRARSGHGISETDKQVVIDFFDSDDISRPMPGANDFVSEY FT RNGSKEQVQKRLLMMSLKEAYMIFEERCQNVEIGFTLFTMLRPKHCRLLDS FT TGIHNVCVCTYHENVKLMLNSLGIVSQAEICEKLLCDVLVRTTDCFFRECE FT KCNSKENITGELISLLEESDKEDVVFQQWLTTDRCNLETIIKPVEEFVTYF FT VEKVDKLITHDFICKEQSSFLRNKKKLLKTRLMILFAKNSLLFYETKKNSL FT KQGEILVISDFSENYSFIIQNAAQGYHWNNSQATIHPFEIYHKKDNKLENV FT SFIIISEVLTHDTVAVQLFISKLINFVKQKINFSKVIFMSDGAAAHYKNKK FT KIASLCNFKKIYGLEAEWHFFATSHGKGPCDAIGGTLKRMAKRASLAKDYG FT NTIATPRELFDWAVKQTDTCITKLNFCYISNEQYVKMSEELMELFDKVKTV FT PGTQKYHCFMPISDTQIAAKRYTNSEDEPKIFNLFSKAQK" XX SQ Sequence 4409 BP; 1574 A; 710 C; 756 G; 1368 T; 1 other; gacggatttc acgccaactc gacaaaggga ttggcagcac cccttcagaa ttcaatgaaa 60 ctttctgggt gtgaagacta tgtgaaacta agatacttta catactttgt ttttttcaaa 120 atcgatctag actaacattt ggaaagggtc aaagtttttt ttttactttt tttataaacc 180 cgtataactc gaaaacggta agacctacaa aaaagtgttg tatgggggac tgtcgtgaaa 240 tttcctgacg ttttagagaa aaatattgaa aaaataaaaa cacattttct acactgaaaa 300 aaaaatattc aaaactttaa agtcgattta aaaaaaacgg ccatttcaga ttttgatcat 360 ccttaagcaa aaagtttcgt aattaaattt actaaaagtc gtccatacat tgcaaattgg 420 gcatatttta gggaaaaaag tttttctaac atcgaatttt ttaagtccca aagttttttt 480 tttacttttt ttataaaccc gtataactcg aaaacggtaa gacctacaaa aaagtgttgt 540 atggggggct gtcgtgaaat ttcctgacgt tttagaaaaa aatgttgaaa aaataaaaac 600 acattttcta cactgaaaaa aaatatgatt caaaacttaa gagtcgattt aaaaaaaatg 660 gccatttcag attttgatca tccttaagca aaaagtttcg taattaaatt tactaaaagt 720 cgtccataca ttgcaaattg ggcatatcta agggaaaaaa gtttttctaa caacgaattt 780 ttcaagtcca aaactgattt tttttttcaa agattttgtt ctaaaccgtc aaatatgatc 840 gtgttttcaa gtgataaatg agtttgaatc agaaataaat tgtatttttt attgagaata 900 gttaaaaaac ggaattatgc agtgattatg ttttttaaat gaaggcaatc tatggatttc 960 gcctctgcct atgagaaaat caactccgtc taacaatccg acggattttt atggttcgaa 1020 agattgcaaa cattgaaaag gaacaacctg atccataccc cattaaattt tcttctagaa 1080 aatactacta aacgtacctg cgttacaagc cagatgaata agaaataatg tttgtattgt 1140 tatctaccta aaatgtccgt taacgacctg agttactcaa caatcaccgc gtgagtgtat 1200 agaaagatag gtatttaact tataattgtc gttctaccta tgcatataac tatttaagct 1260 tgatgtcaat aaaactcttt tcattttatc taagtcattt gtgaaaagaa gttaaagatg 1320 gatcaacaag atttatcgcg cattaaaatt tcgagatgtt gcgccggtat aaacaatcct 1380 aagtgcaggc gtagccagtt gcgaaacttg tcaccaacaa tgatcgaaca catacgtgct 1440 atgggatata gtcatcagct tgatgaaact atgcatattt gtgagtcatg tcgcttagtg 1500 atcaggctcc acagaagtcc gcctaagaaa aaacgacgtt cgggtggagg cgaagtgaat 1560 caaaatccac aaccaccaac aagcggtcag gcctccaatg aaatgctagg taagataaat 1620 ttactttgca gcacattaaa cgacaatgat tgtttacata gaacccatat caatatatta 1680 attcgacatt attgtatatt tcagaagcag tcccgtcctg ttcatctttg ccatccgtcc 1740 catcagcgga tgaattacta ggcccctaca acgtggaaaa attcaacagt attattgaaa 1800 ctttgaattt atccccaatc tcgtcgagaa acatgaggag cttggactat cgaattaaaa 1860 agtcagtgga gatcaccaaa gctgtacgtc atgacatact aggcatggat gcaatggacg 1920 tcgacaaggt ggaggcattc gaggaaatca tcaatcagtt gaaagaaaaa ttcaactcac 1980 ctggtattga cagaaatgac cagttgaaaa tattatcggt tcttcctaag tcgtggtcga 2040 tcaacaaaat tgtcgaggag ttcgaggcac caatttacat ggtaagacaa gtgaaacaac 2100 ttgtgaatga acagggcatt ctatgtacca caagggctcg aagtgggcat ggaattagtg 2160 aaacagataa acaggttgtt attgatttct ttgatagtga tgatattagt aggcctatgc 2220 caggagccaa tgacttcgtt tctgagtacc gaaatggtag taaagaacaa gttcaaaagc 2280 gtctattgat gatgtctctt aaagaggcgt acatgatttt tgaagaaaga tgccaaaatg 2340 tagaaattgg ttttactttg tttactatgt tgaggcctaa acattgtagg cttctcgata 2400 gcacaggaat tcacaatgta tgtgtgtgca cataccacga aaatgttaaa ttaatgctta 2460 acagtttagg aattgtctct caggcagaaa tttgtgaaaa actgttatgc gatgttttag 2520 taagaacgac tgattgtttt tttcgagaat gtgaaaagtg taattccaaa gaaaatatta 2580 caggagaact aatctcgctt ttagaggaaa gcgacaaaga agatgtagtt tttcagcaat 2640 ggctcaccac cgatcgctgt aatttagaaa ctataataaa gcctgttgaa gaatttgtta 2700 cgtatttcgt ggagaaagtt gataagctaa taactcatga ttttatttgc aaagaacagt 2760 cttcttttct acgaaacaaa aaaaaactcc ttaaaacaag gtgaaatact ggtaattagc 2820 gacttttctg aaaattatag ttttatcata caaaacgctg ctcaaggata tcactggaat 2880 aattcccaag caacgataca cccattcgag atttaccata aaaaagataa taaattggaa 2940 aacgttagtt ttattataat atcagaagtg ttgacccatg atacagtagc agttcagcta 3000 tttatttcaa aattgataaa ttttgttaaa caaaaaataa atttctcaaa agttattttt 3060 atgtcagatg gagctgcagc tcattataaa aataaaaaaa aaattgccag cttatgtaat 3120 ttcaagaaaa tatatggatt ggaagcagaa tggcactttt ttgccacatc ccacggcaaa 3180 ggcccctgtg acgctattgg tggcactctc aagcgaatgg caaaaagagc aagtcttgca 3240 aaagactatg gaaacacaat cgcaactccg cgagaactat tcgactgggc agtgaaacaa 3300 actgatacat gtatcaccaa attaaatttc tgttatatat ctaatgaaca gtatgttaaa 3360 atgtcagagg aattgatgga attgtttgat aaggttaaaa ctgtccctgg tacccaaaaa 3420 tatcattgtt ttatgcctat tagtgataca caaattgcag ccaaacggta taccaattca 3480 gaggatgaac caaaaatatt caatttattc agtaaagccc aaaaataatg taaatattca 3540 tgtgcagata ctacgtttta ggatatatag caataataaa aataatgatg catgataaaa 3600 aaaaaccacg acttttttta taagcataat attgaccctt tccagacacc tgttaagatt 3660 ggctttgacc aaataagaaa taaaatatta ttgtgatatc tttccaacca ttaaaaaccc 3720 atcggattgt tagacggagt tcattttctc ataagcggtt tggtgcgaga tcaattgtat 3780 ttaattaaaa aaaaactctg tataattccg ttttatttct tttaactact cccaataaaa 3840 caaatataat ctattttgtg attaaaactc atttatcact tcaaaatatg agcatactag 3900 actatttaaa gcgaaataaa aaaaaatcaa aaaaaaaaan tttggactta aaaaattcgt 3960 tgttagaaaa acttttttcc ctaaaatatg cccaatttgc aatgtatgga cgacttttag 4020 taaatttaat tacgaaactt tttgcttaag gatgatcaaa atctgaaatg gccgtttttt 4080 ttaaatcgac tttaaagttt tgaatcattg ttttttcggt gtagaaaatg tttttttatt 4140 ttttcaatat ttttttctaa aacttcagga aatttcacga cagtccccca tacaacactt 4200 ttttgtaggt cttaccgttt tcgagttata cgggtttata aaaaaagtaa aaaaaaactt 4260 tgaccctttt caaatgttag tctagatcga ttttgaaaat acaaaatatg caaagtatct 4320 tagtttcaca tacttttgac atccagaaag tttcattgaa ttctgaaggg gtgctgccaa 4380 tcgcgtgtcg agttggcgtg aaatccgtc 4409 // ID Gypsy-9_AC-I repbase; DNA; INV; 4716 BP. XX AC AASC02062576; XX DT 18-JAN-2011 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from Aplysia californica (California sea DE hare): internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-9_AC_; KW Gypsy-9_AC-LTR; Gypsy-9_AC-I. XX OS Aplysia californica OC Eukaryota; Metazoa; Mollusca; Gastropoda; Heterobranchia; OC Euthyneura; Euopisthobranchia; Aplysiomorpha; Aplysioidea; OC Aplysiidae; Aplysia. XX RN [1] RP 1-4716 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Aplysia californica (California sea RT hare)."; RL Direct Submission to Repbase Update (02-FEB-2011). XX DR Genome; AASC02062576; Positions 2022 6737. XX CC Positions [3162-3650] - Integrase core CC 'AAACG' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 328..2988 FT /product="Gypsy-9_AC-I_1p" FT /translation="MTCIGDEALDALDGMRLSDEQREDLDQIIKSFDEYCI FT GEVNETFESYKFHKRNQKQHETIDSYVTELRQLAKRCNFVEEDRMIRDRLV FT IGIQNDNIREKLLEQKKLSLGEALDICRAQEKAKTQRQAMSISTESQVNKL FT GKPTKYRSSQNHPKSKPHIPAHKCGRCGCQHERGRCLALEAICHSCQKKGH FT FSSVCRTPKNTSKKAYRNRVNILEDEEEGYLGAVFGPGQQNPWQANVTVNG FT RSIDFKLDTGADETVIPKSCVPANTKICRSQKRLYGPSGEQIRIVGEFRAE FT LKLDAGNRAKQKIYVLENLPQPLLGKPAIEELDLLKRINVLKHESDFKKEY FT PELFTGLGKLKKEYKIEMRENIVPFSVATPRRLALPLHKKVEQELKRLEED FT EIIKKLDSPTEWCAPIVVVPKKEGKVRICVDLTKLNEGVKREYYPLPTTEQ FT LLAQVSGATVFTKLDCNSGFHQVPLSEESQELTTFITPFGRFAYRRMPFGI FT SSGPEIFHREMSHLLSGIPGVICNIDDILISGTSKQEHDARVREVLRKLED FT AGVTLNEKCQFSADKMKFLGHIISQNGIEIDPDKIEAITNLPQPENVSDIR FT RLLGMVNHVGKFIDNLSSMTAPLRELLKKENCPWIWNEQHQLAFQKIKRAL FT TETPVLAHYDPNFETKVSADASKSGIGGVLFQKCGEHWKPVLYASRSMSDA FT EQRYAQIEKEALGITWACERFSDFLIGLKTFQIETDHKPLITLLQRKGLDD FT LSPRIQRFRMRLMRFSYTVFHTAGKYLHAADMLSRAPLRETSEKDSLREDA FT VNSFVRAVIESFPASKTKLEEIKMEQDNDEVCQKVNSYCKSGCWPDYCKKD FT LRLKPFWTVRHELTVAHGLLLYNARMVIPTSMQK" FT CDS 3201..4190 FT /product="Gypsy-9_AC-I_2p" FT /translation="MDLFEWKGHDYLLIVDYFSRWIEIVLLRKTTSSTVVE FT HSKSIFAKYGIPEFVISDNGPQFASCEFANFAETYGFEHKTSSPKHPQGNG FT EAERAVQTVKNLLRKADDPYIAILNYRATPLKQGQSPAELLQGRQLRTKLP FT SLPSQLKPQGRKLKQFRSADKHIMKNQQKNYFDRRHGGKRLQNFERNETVW FT ITEPKLEKASILKPYQSGGGDVHERSYLVDTPRGMKRRNRFHLRRRSLGPC FT DEGKSTPVATSLISTPRDRLSQTDERHPPDNAVIGTKSGGNGNNKSSSLPV FT TAPRCASPPMQHTHSEQTQTHYTTRSGREIKPPQKLTL" XX SQ Sequence 4716 BP; 1591 A; 955 C; 1045 G; 1125 T; 0 other; tggtgtcaga agtgtgttat taccgttgat attgatcttt atagagtcta gatctagatt 60 tagagccata gtgtcagaga accatagatc tatgtctaga tcttgataga accttctaga 120 ctagagtcta gattttagat ctagattcga gtatatactg acaatgacag aacaaaaacg 180 attcacggca aatttgccac ccccaacaaa attggacttc tccaatacaa cagaaattca 240 ccaaaactgg ttgaaattca agaggcagtg gaataattat caaattgctt caagactaag 300 agaagagcca atggagttgc agtcttcatg acctgcatag gagacgaggc gctcgacgca 360 ctcgatggaa tgaggctctc agatgagcag agagaagatc tagatcagat aataaaatct 420 ttcgatgagt attgcattgg cgaagtcaat gagacatttg aatcgtacaa attccacaag 480 aggaaccaga aacagcacga aactattgat tcttatgtca cagagttgcg ccaactggca 540 aaacgctgca atttcgttga agaagatcga atgattagag accggcttgt cattggaata 600 caaaatgaca acatcagaga aaagcttcta gaacagaaga agctctcact gggtgaagct 660 ttggacattt gcagagcaca ggaaaaggct aaaactcagc gtcaggcaat gtctatatcc 720 actgaatcac aagtaaataa acttggaaaa cctaccaagt acagaagtag ccagaatcac 780 ccaaaatcga aaccccacat accggcacat aaatgtggac gatgtggatg tcagcacgag 840 aggggaagat gtctggcact agaggcaatt tgccattcgt gccagaagaa agggcacttt 900 tcaagcgtat gtaggactcc caaaaatact tccaagaaag catacaggaa cagagtgaac 960 attttggaag atgaagaaga ggggtacttg ggagcagtgt ttggaccagg acagcaaaac 1020 ccatggcaag caaatgtcac agtaaatgga aggagtatag acttcaagtt agatacgggt 1080 gctgacgaga cggttatacc taaaagttgt gtcccggcaa acaccaagat ctgcaggtcc 1140 caaaagagac tgtatggacc atcaggtgaa caaatcagga ttgttggaga attccgagca 1200 gaattgaaat tggatgcagg aaatcgtgcc aaacagaaga tctatgtgtt ggaaaatctt 1260 ccacaaccgc tacttgggaa gcctgctata gaagaactgg acctgttgaa aagaatcaat 1320 gtcttgaaac atgaatctga tttcaagaaa gaatatccag aattatttac aggccttgga 1380 aagctaaaga aagagtacaa aatcgaaatg agagaaaata tagttccatt ttcagtggca 1440 accccaagga gattggcctt acctcttcac aagaaagtgg aacaggagtt aaagagactc 1500 gaagaagatg aaatcatcaa gaaacttgat tcaccaaccg aatggtgtgc tcccatcgtg 1560 gtggtcccga agaaagaggg caaggttagg atctgtgtgg accttactaa attgaacgag 1620 ggagtgaaga gagaatacta ccctttgcct acaacagaac agcttttggc tcaagtgtca 1680 ggcgctacag ttttcacaaa gcttgactgc aacagtggat ttcatcaggt ccccctttct 1740 gaagagtctc aagaactcac gacattcatc accccctttg gacggttcgc atatagaaga 1800 atgcctttcg ggatctcatc cggtccagaa atttttcaca gagagatgtc acaccttctt 1860 tctggaatac caggagtaat atgcaatatc gatgacattc tcattagtgg aaccagcaaa 1920 caagagcatg atgccagagt tagagaggtg ttgcgaaagc tggaagatgc cggtgtcact 1980 ctcaacgaga aatgtcagtt ctcagcagat aagatgaaat ttttgggaca tatcatatca 2040 cagaatggaa ttgaaattga tccagataag atcgaagcca ttacaaatct gcctcaacca 2100 gagaacgtct cagacattag gcgtctatta ggcatggtaa atcatgttgg caaatttatt 2160 gacaacttgt caagcatgac agcgccactg agggagctcc taaagaaaga gaactgtccc 2220 tggatttgga atgaacagca tcaactggca tttcagaaaa tcaaaagagc gctgactgag 2280 actccagttc ttgcccacta cgacccaaac tttgagacaa aagtttctgc tgatgcaagc 2340 aagtctggca taggaggggt actgttccag aaatgtggcg agcattggaa accagtgtta 2400 tatgcgtcaa ggtctatgag cgatgcagaa caaaggtatg cccaaataga aaaagaagca 2460 ctagggatca catgggcttg tgagagattc tcagatttcc tgattggatt gaagacattt 2520 caaatagaaa cagatcacaa accactgata acactccttc agaggaaagg actggatgac 2580 ctctctccta gaatccagag gttccgaatg agactaatgc gattctccta cactgttttt 2640 cacacagcag gaaagtattt gcatgcagct gatatgttgt caagagctcc tctgagagaa 2700 acaagtgaaa aagatagctt acgggaagat gcagtgaact catttgtcag agcagtcata 2760 gaaagcttcc cagcttcgaa aacgaaactt gaagagatca aaatggaaca ggacaatgat 2820 gaggtttgcc agaaagttaa cagctactgc aaatctggat gctggcctga ttactgcaag 2880 aaagacctaa ggctgaagcc attctggact gtgagacatg aacttaccgt ggcccacgga 2940 ctactccttt acaatgcaag gatggtgatt ccgacatcga tgcaaaagta aatcctacga 3000 agaatccatg agggtcacca aggagttgtc aagcctcgtg catttgccaa gtcttcagtg 3060 tggtggccag gaatttcaca ccagatcgaa gaggaagtac ggaaatgctc tatctgcgaa 3120 aaacatagac ggctatcacc tgagcccctg aaaccaactg tcacgccaga ttacctttgg 3180 caaagggttg gtatggactt atggaccttt ttgagtggaa gggccatgat tatttgttga 3240 tcgtcgatta tttctcccga tggatagaga tagtcctact cagaaagaca acgtcgtcaa 3300 ctgtggttga acactcaaag tctatatttg ccaagtatgg aataccagag tttgtcataa 3360 gtgacaacgg acctcagttt gcttcatgtg agtttgcaaa ctttgcagaa acatacgggt 3420 ttgaacataa aacgagcagt ccaaaacatc cacaagggaa cggcgaggct gaaagagctg 3480 tacagacagt gaaaaatctt cttcgaaaag cagatgaccc ctacattgcc attctcaact 3540 atagagcaac gccactcaaa caaggacaaa gtccagcaga acttttgcaa ggaagacagt 3600 tgagaaccaa attacctagt ctgccatctc aactgaaacc tcagggccgg aagctgaagc 3660 agtttcgttc tgctgataaa catattatga agaaccagca gaagaactac ttcgatagaa 3720 ggcatggcgg taaaagactt cagaatttcg aacgaaacga aactgtctgg attacagaac 3780 ctaagcttga aaaagcatca atcctcaaac cctatcaatc aggaggagga gatgtacatg 3840 agaggtctta tctcgttgac actccaagag ggatgaaaag acgaaatcga ttccatctac 3900 gcagacgaag ccttggtccg tgtgatgaag ggaagtcaac acctgtggct acatcgctga 3960 tctcaactcc aagagacagg ctaagtcaaa cagatgaaag acaccctcca gacaatgctg 4020 ttatcgggac caagtcaggt ggtaatggca acaacaaatc atcgtcgttg cctgtgactg 4080 cacccaggtg tgcatcacca ccaatgcaac acacccatag tgagcaaacg caaacacatt 4140 acacaacacg aagtggccga gagattaaac ctccacaaaa attaacacta tagggtcgat 4200 gtcctacaga catacatgct tttgcgtgta acagaaattg gccttttgtt aagaagtgta 4260 gctatacagc actctggtca caacaattgc aacatacatt aaagattata aattttgcat 4320 acacataaaa tgcatcagaa tgcattcaca ggtgcattag acatgcatcc tatttcaaag 4380 aatgtttgag ttttgaaaca tacacattta ggtggaattt catactcgtc tactgaatga 4440 tactttaatt acaaaagccc tacacaagga catttgatca gtggacattg aatttgtgag 4500 gcactgataa actgtatcac aagcagacca agaagtctca agttcaagtt cttgctttta 4560 tataactaca gaaatacgat gtctgttttt ctgtttgatc taattctttg ttttattcct 4620 ttaatttctt aaagtataaa gactctgcgg agtctatcct ctctaaagaa tttaattgta 4680 atgagtgtct tctgttccaa actcaagaaa gagaga 4716 // ID hATm-36_HM repbase; DNA; INV; 2537 BP. XX AC . XX DT 07-DEC-2008 (Rel. 13.12, Created) DT 10-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type family: consensus sequence. XX KW hAT; DNA transposon; Transposable Element; hATm-36_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2537 RA Jurka J.; RT "Families of hATm elements from Hydra magnipapillata."; RL Repbase Reports 8(12), 1930-1930 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 491..2284 FT /product="hATm-36_HM_1p" FT /translation="MSFKSKKSKSAVEAAGEIWTQNLKSSTKRRPNIPQVK FT QDMSDKLCKMPSNRDIVGRFLGLFNELKGQHDRLTKISEELMNLWQKFNFP FT VLSKQQVSANVYKLITSFEKHRKRPNAVFEENLAHLFDITKPDGNWLCSED FT KQLYKVQIESDGRVGYTTMKVAPMSTIHPSKRAKSTSEPSTNQAVISVPLS FT CDSEDSEEGSSSGDSEAVPGESRRKRATRQSTKSAAKLVSSHSLSTRKTSR FT VLQSLAEDGVCVPTPSQSGVWRRTIRDAEQVKNRLMELISEEKFSLHFDGK FT KIDKTEYQVVCLKNDKRTLQLGILVCDSGSSASIYKPLQDLLDDYNAWNSI FT QMIVSDTTAVNTGRKNGVVAKLQKIFRDKGFDEPQYIGCQHHILDLVLRHV FT LDFFFPIQSQSPNINYKFIDEIMEHYDDLKNAYKGTAVVAEYENLGWRDDF FT KFLFQLCEAYKFYKNEEKWPRIKWHKLPSLHSARWNSRGIFALIAFFILPN FT RREQLRITCNFVATTWSRAWFSNQHYAEAAYEDLLSAISQLKCSKAVKCFS FT THWKKEPSVLDVPRTNIVAERAVKLMGDIRSSCRTNKYLNCKFINTNTQI* FT " XX SQ Sequence 2537 BP; 815 A; 464 C; 501 G; 757 T; 0 other; ggtatatgac aaaaaaactt ttttcaaaat tttaattttc ctgatgttgc atttgatgaa 60 cttttaccaa tagatcactg ttttgaggtt tatttcattg gatcacaaaa aaacctcccg 120 cacctagcac ctcaccactt ttaacttttt cgatttttca cttgattttg ttcccagttc 180 ttacttcctt atgcttgact atgcttgaat ataactacaa gttggagagc gtcaaacagt 240 ttaaaaagtt aattaacata gaatgaaaaa gttcgatagt tttaaattgt ttttatgtta 300 tgtaactcaa ttttgaataa caaattttaa cgactgtttt tcaaattaat gcgcgtttgt 360 tgttgtttgt tgttgagtat tttttaatgt tcgcgcgatt aaaatattga gtagaagtat 420 ttaaaaagtt ttttcattat ttgcagtatt aatttgcaag atctgctgtt tagtgcttct 480 ttttgaaaat atgtccttta aaagtaaaaa gtcaaaatca gcagtggaag cagctggtga 540 aatatggaca caaaatttga aatcaagcac gaagcgccgt cctaatatac cacaggtaaa 600 acaggatatg agcgacaaat tgtgtaaaat gccttccaac agagatatcg ttggaagatt 660 cttgggactc tttaatgaat tgaaaggaca acatgatcga ctaacaaaaa tatccgagga 720 attgatgaat ctttggcaga agtttaattt tcctgttctt tccaaacaac aagtgtccgc 780 taatgtttac aaacttatca catcatttga gaaacacaga aagcgaccaa atgcagtatt 840 tgaagaaaat cttgctcatc tgttcgatat aacaaaacca gatggaaatt ggctttgtag 900 tgaggacaaa caactgtaca aagtacaaat tgaatcagat ggcagggtgg gatatacaac 960 tatgaaagtg gctcctatgt caacaatcca tccatcaaaa agggccaaaa gcacatcaga 1020 accctcgaca aaccaagcgg ttatctccgt accactttcc tgcgatagtg aggattctga 1080 ggaaggaagc agcagtggtg actcggaggc agttccagga gaatctcgca gaaagcgtgc 1140 aacacgccag tcaactaagt ctgctgccaa actagtgtct agtcactctt tgtcaaccag 1200 aaaaacgtca cgggtgcttc aaagtctggc tgaggatgga gtttgtgtcc caacaccatc 1260 acagtctgga gtctggcgtc gaacgatcag agatgcagaa caagttaaga accgcctcat 1320 ggaactgatc tctgaagaga aattttcttt gcattttgat ggtaagaaaa ttgacaagac 1380 tgagtaccaa gttgtgtgct tgaaaaatga caaaagaaca ttacaacttg gcattttggt 1440 ttgtgacagt ggatcatctg caagcatata taaaccacta caggacctac ttgatgatta 1500 caatgcatgg aacagtattc agatgatcgt ctctgacaca acagctgtaa atactggtcg 1560 caagaatgga gttgtagcca agcttcagaa aatatttcga gacaagggat tcgatgaacc 1620 tcaatacatc ggctgccaac accacattct tgatcttgtt ctacgacatg tgttggattt 1680 cttttttcct atacagtcac agtcgcctaa cattaactac aaatttattg atgagattat 1740 ggagcattac gatgatttaa aaaatgccta caagggcaca gcagtggtag cagagtatga 1800 gaaccttggc tggagagatg actttaaatt tctttttcag ctttgtgaag catacaagtt 1860 ttacaaaaat gaggaaaaat ggcctcgtat caaatggcac aagctgccat cactgcacag 1920 tgccaggtgg aactcaagag gcatttttgc gcttattgca tttttcattc ttccaaaccg 1980 gcgagaacag ctgaggataa cttgcaattt cgttgcaact acatggtcgc gggcctggtt 2040 ttcaaaccag cactacgcag aggcggctta tgaagacctg ctgtcagcaa tatcccaact 2100 taagtgttcc aaagctgtga aatgcttctc tacccactgg aagaaggaac catcagtgct 2160 tgacgtgcct cgaaccaaca tagtagccga gagagctgta aagttgatgg gggacattcg 2220 ttcttcttgc agaacaaaca aatatcttaa ctgtaaattt attaatacta atacacaaat 2280 ctgaaagatt attaacatca ctacaaatgc tgtaggacag tgtattcatg tatttctcat 2340 gtgtattgta tataaagttg acgtattata attaatagcc tttgggctgc tatgcgttgt 2400 atggttaaat atgaaactgt cttaggtgcg ggaagttttt tcaaggtatg gcgaaatcaa 2460 tgtcatttct gtgatttata ggcaaaagtt catcatttag aaccccaggc taattgattt 2520 aaaatttatc atatacc 2537 // ID CR1-46_HM repbase; DNA; INV; 4556 BP. XX AC . XX DT 18-DEC-2008 (Rel. 13.12, Created) DT 18-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE CR1-type family - consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-46_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4556 RA Bao W. and Jurka J.; RT "CR1 families from Hydra magnipapillata."; RL Repbase Reports 8(12), 1874-1874 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 45..764 FT /product="CR1-46_HM_1p" FT /translation="MAPREISELFLFLNSNKNLVENYVSNVEEICSHPKDV FT IINEFSKLQEEDIFSLRSFLFTEILESFELEQFSSLNISLDADDPLKKNLR FT KRYKVSTCLDDIYLLSISLAEKQLHKEFTKLINNSKIVSEKITTNVTDINI FT LSNLKELNDRLKTLQSDNVKIREDLRILQGKYDELQESNSEIIKNQQQLSL FT NYXTNSSFXQKPIENSIVTNSINFRSLTQNHTQRKTNPSFTVEDKLKKN*" FT CDS join(823..1047,926..1435,1348..4464) FT /product="CR1-46_HM_2p" FT /translation="YHNINTHISTTVSKKIEHKSIDRDCLEERKNYYLKPG FT VSTSKKEPIFGNKIAKNRNSIAGSRIVREFIYICWRSLNRAYLHQRKNQSL FT GIKLLKTETLLQEAVLFENSYIFVGGVCNSISEALLFNYIKKEIGVEPIAV FT TLNRENKYNRSFKVTIKSHEKHIVLKPEVWDHNIIVKQFRKSRENLGTEMM FT LQNNNNIPSISQSVTKSQSEDEHQSDVVNQSDVEHQLYAARAMSLNQKSQW FT MNAQRRKSIGCGTSTIRSSCHVSKSEITMDERAINSNTCSFSLCAYNCQSL FT KSNSCYISTLINSFDIIFLSEHWLSNVERFILYDLSAPTHQLFFHSAEKKE FT YGRPFGGNAFLVRKNLLLSLHVIHEDEHILAIKGSYGNSVFIFIGVYLTSC FT RNTSESLEQYLSQLSVIASIINTYEDEGECIIAGDFQSFPENIYDTEVRNN FT QTRNNFSXHLSNFLETNELSFLDIISGTGPTYTYKHKTLNNSSYIDHIAVL FT KESSICFTNCKIIPPSHLNMSDHLPIATHINLKNNSLTPVINNILKKYNTI FT PKYAWKDDNFIEIYNRNLSQFFKDYTFIGEAIEKQTQTTCEKINNAATLAM FT EECFQKKEFCIYSKSWWTPEVKLCKDILSFHFKEWKKCNYNRSQESISFNK FT YKLARKNFRKTVKAAHNLKINKITKNIEYLRKTNPQKFWNNIRKIKSSPNT FT RIFTINKKQDRKEIVEEFRQNFNTILNTPTITNNNSEHRNIPQLIVEPNKI FT LLSTSDIESCIRKLKSNRSRDSCGITAEHLKYSSNNNLVTWLSKFYNXIFN FT NGAVPEYLSTSIIIPIVKSYKKSLNNSDNYRGISITPILTKLFEYIIMQVC FT PEISDSHSHQFGFKQNSSTLHAEFLLSETLKHYNNNNSPVYICSLDANKAF FT DSCNWDLLFEKLYYNKNLSLSVVRALQSLYRSGTANISYLGHKSNNFRLSQ FT GVRQGSILSPHLYNIYTESILSQIVSDCKVGTTVMGLYTGIIAYADDIILQ FT SPTLSGLQELINKVQTQCQNLYIKLNTEKTEFIISGKSLVTKNIIQINGFF FT LKLNNKLKHLGFLWENQGHGQNIATIQSANVNERIIQFQSISRALIKSGIR FT FCQPVTIVHLFKSLAIPNLSYGLEICDYNNTFLNKLDAVGRSALKFLFSIS FT KYSKNYLHSYFKIDDVSTILVQNKLNLFIRLLSNPTCFSLILTQISNVNKE FT NCFINEIMKICNRLELSFMQCLIKGEKIIIKRRNDDVEDNIAEILNQSFEF FT WNIREQRMIFKNLMEESIPRRI*" XX SQ Sequence 4556 BP; 1726 A; 792 C; 640 G; 1381 T; 17 other; gcactttaaa aaaaaaaaaa aagttttgaa aatgttacgc taaaatggcg ccgagagaaa 60 ttagtgagtt atttttgttt ctgaatagta ataagaatct tgtagaaaac tatgttagta 120 atgtggaaga aatttgctcc catcctaaag atgtaatcat taatgaattt tcaaaactcc 180 aagaggaaga tatatttagt cttagaagtt ttctgtttac ggagatccta gaaagctttg 240 aactagaaca attttcttcc ttaaacatta gtcttgatgc tgatgaccct ttgaagaaaa 300 atcttagaaa aagatacaag gtctctacat gtttggatga catttattta ttatctatct 360 ctcttgcgga aaaacagctt cacaaagagt ttactaaact aatcaataat tcaaaaattg 420 ttagtgaaaa aataactaca aatgtaacag acataaatat cttgtcgaat ttaaaagaac 480 ttaatgaccg cttaaagaca ctgcaatcag ataatgtaaa aattcgggaa gatcttcgca 540 tactgcaagg gaaatatgat gaactccaag aatcaaattc tgaaattata aaaaatcagc 600 aacagctgtc tctaaactac kccacgaact cgtcgtttmt ycaaaaacct atagaaaatt 660 ctattgttac aaattcaata aactttcgtt ccttaactca aaatcatacg caacgtaaaa 720 caaatccatc cttcaccgta gaggataaat taaaaaaaaa ttaaaaacag gtgaaactta 780 tgcacaaaaa gttaaaatac cacccaaaat ggcaaaccat aataccataa tatcaacacc 840 cacattagta caacagtctc caaaaaaatt gaacataaaa gtattgatcg agattgtttg 900 gaagaaagaa aaaattatta tttgaaaccg ggcgtatcta catcaaagaa agaaccaatc 960 tttgggaata aaattgctaa aaacagaaac tctattgcag gaagccgtat tgttcgagaa 1020 ttcatatata tttgttggag gagtttgtaa ttccatatca gaagctcttt tgtttaacta 1080 cataaagaaa gaaataggcg tcgagccgat tgctgtcaca ctcaaccggg aaaataaata 1140 taaccgctct ttcaaagtaa caataaaaag tcatgaaaaa cacatagtcy taaaaccgga 1200 agtttgggac cacaatatta ttgttaaaca gtttcgaaaa tctcgtgaaa acctaggaac 1260 tgaaatgatg cttcaaaata acaacaatat accgtcaata tcgcagtcag taactaaaag 1320 tcaatctgaa gatgaacatc aatctgacgt cgtaaatcaa tcggatgtgg aacatcaact 1380 atacgcagct cgtgccatgt ctctaaatca gaaatcacaa tggatgaacg cgcaataaat 1440 tcaaatactt gtagttttag tctctgtgcr tataactgtc agagtcttaa atcaaactca 1500 tgytatattt ctaccttaat taactctttt gatataattt ttctatccga gcactggctt 1560 tcaaatgtcg agcgatttat tttatatgat ctatcagctc cgacacatca attatttttt 1620 cactctgctg aaaagaaaga atacggaaga ccttttgggg gaaatgcttt tttagttcgt 1680 aaaaacttac tgctttctct tcacgtaata cacgaagatg aacacattct tgcaataaaa 1740 ggttcatatg gcaacagtgt ttttattttt attggagttt atcttacttc atgcagaaat 1800 acttctgaat cattagaaca atatctgtct caacttagtg taattgcctc catcattaac 1860 acatacgagg atgaaggaga gtgtataatt gccggcgatt ttcagtcatt tcctgaaaat 1920 atatatgaca cggaagttcg aaataaccaa acacgtaaca acttttctar gcatttatct 1980 aattttcttg aaacaaayga gctttctttt ttggatataa ttagtggtac tggtcctacc 2040 tatacctata agcataaaac gctaaataat tcatcttaca tcgatcacat agcagtctta 2100 aaagaatcat caatatgctt tacaaactgt aaaattattc ctccttcaca tttaaacatg 2160 agtgaccatt tacccatcgc tacacatatt aatcttaaaa ataatagttt aactccagtt 2220 ataaataata tacttaaaaa atacaataca atcccaaagt atgcatggaa ggatgacaac 2280 tttattgaaa tttataatcg gaacctctct cagttcttca aagattacac ttttattgga 2340 gaagcaatag aaaaacaaac tcaaacaaca tgcgaaaaaa taaataatgc tgccacttta 2400 gcaatggaag aatgttttca aaaaaaagaa ttttgcattt actctaaatc atggtggaca 2460 ccagaagtta aattatgcaa agacattctc tctttccact ttaaagaatg gaaaaaatgt 2520 aactataaca gatcgcagga atccatttca tttaataaat acaagctagc tcgtaaaaat 2580 tttcgtaaaa ctgtaaaagc agctcataac ctaaaaatca ataaaattac caaaaayatt 2640 gaatatcttc gaaaaactaa tcctcaaaag ttttggaaca atatccgaaa aattaaatct 2700 agtcctaaca ctcgtatttt tacgataaat aagaaacaag atagaaagga aattgtagaa 2760 gaatttagac aaaactttaa taccattctc aacactccca caattactaa caacaactcc 2820 gaacaccgya atattcccca acttatagtt gaacctaata aaatactctt atcaacttcc 2880 gacattgaat cttgtattcg taagctwaag tctaatagat cacgtgactc ttgtggaatt 2940 actgcagaac atcttaaata ctcttctaac aacaatctcg ttacatggct ttcaaagttt 3000 tataacarta tctttaataa tggagctgtg ccggagtact tatcaacttc cataattatt 3060 ccaatagtaa aatcctataa aaaatctctt aataattcag ataactatcg cggtattagt 3120 attacaccta tcttgacaaa actctttgaa tatattatta tgcaagtatg tccggaaata 3180 tcagacagcc actctcatca atttggtttc aagcaaaata gctccaccct gcatgcagag 3240 ttcctattga gtgagacatt aaaacattat aataataata actcgccagt atatatctgt 3300 agtttagatg caaataaggc tttcgatagc tgtaactggg atttgytgtt tgagaaactt 3360 tattacaaca aaaacttatc gctatcagta gttcgcgcac tacaatcatt ataccgctcg 3420 ggtacagcta atatatcata ccttggacat aaatcaaata atttccgctt gtctcaaggt 3480 gtgcgacaag gatccatttt atcgccacat ttatacaaca tttacactga aagtatatta 3540 agtcaaattg tttccgattg taaagttgga acaacagtaa tgggtttata cactggtatt 3600 attgcttatg cagatgatat aatcctgcaa agtcctactc tttctgggct acaagagttg 3660 attaataaag ttcaaacgca atgtcaaaac ctctatatca aattaaacac agaaaaaacc 3720 gagttcatta tctccggtaa aagtttagtt acaaaaaaca taatacaaat taacggtttt 3780 ttcctgaaac tcaataataa actgaaacac ctcggattct tatgggaaaa tcaaggccat 3840 ggacaaaata tagctacaat ccaatcagca aatgtaaacg aaagaattat tcaatttcaa 3900 tcaatctcca gagctcttat aaaatctgga attcgttttt gtcaaccagt aactatagtg 3960 catttgttca aatctctggc tataccaaat ctatcatacg gattagaaat ttgtgattat 4020 aacaatactt ttcttaataa attagacgct gttggaagat cagcattaaa atttctcttt 4080 agtatttcaa aatacagcaa aaactatctg cattcctact ttaaaattga tgacgtatct 4140 accatcctcg ttcaaaacaa attgaatctg tttatcagat tactgagcaa tccracttgt 4200 ttttccttga ttttaactca aatatctaat gttaataagg aaaactgctt tatyaacgaa 4260 ataatgaaaa tttgcaaccg cctggarcta agcttcatgc agtgtttgat taagggggaa 4320 aaaatcatta tcaaacgtcg taacgatgac gtagaagata atatagcaga aattttaaac 4380 cagtcctttg aattctggaa catcagagag cagagaatga tattcaaaaa tttaatggaa 4440 gaaagcattc caagaagaat ytaaattaga attttaactt agagtataaa ctattagttt 4500 tatgacctct gtatttttta cctgtaattt tgggtgatga aaagaataaa taaata 4556 // ID TTAA2B_AP repbase; DNA; INV; 426 BP. XX AC . XX DT 23-AUG-2009 (Rel. 14.08, Created) DT 23-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; TTAA2B_AP. XX NM TTAA2B_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-426 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(8), 1779-1779 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC TSD is TTAA tetranucleotide. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 426 BP; 133 A; 79 C; 83 G; 125 T; 6 other; gaggacgcta cacccgcgtg tgttgtctcc gtcttacaca cgcacgacat agcaaattgt 60 cgttcactag tctcaatagt gngctgttag ttttgatact agagtgaatc gacctattat 120 caaactttta ggtaagaaca ttatctgggt cttagcgtng gcgatttttc gatattttna 180 ttttcgagcg agatangggt atgtgaaata tcaaaaatta aaaatgctca tatctcgctt 240 gaaaattaaa atatcgaaaa aatcgccaac gctaaaacac agataatgtt cttacctaaa 300 agtttgataa taggtcgntt cactctagta tcaaaactaa cagcacacta ttgagactag 360 tgaacganaa tttgctatgt cgtgcttgtg taagacggag acaacacata cgggtgtagc 420 gtcctc 426 // ID Gypsy-8_CQ-LTR repbase; DNA; INV; 117 BP. XX AC AAWU01000706; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-8_CQ_; KW Gypsy-8_CQ-I; Gypsy-8_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-117 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 396-396 (2011). XX DR GenBank; AAWU01000706; Positions 23953 23837. XX SQ Sequence 117 BP; 37 A; 23 C; 23 G; 34 T; 0 other; tgctgagtat gccatgaatt gctaaaattt gggtataaaa ggcggctgtc ggcgattaat 60 aaatcattct agtttggacc accaacctga acagttgtac ttattccaaa cctaaca 117 // ID Gypsy10-I_AP repbase; DNA; INV; 4772 BP. XX AC Contig4460; XX DT 21-APR-2008 (Rel. 13.04, Created) DT 21-APR-2008 (Rel. 15.12, Last updated, Version 0) XX DE LTR retrotransposon from pea aphid: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy10AP; KW Gypsy10-I_AP; Gypsy10-LTR_AP. XX NM Gypsy10-I_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-4772 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from pea aphid."; RL Repbase Reports 8(4), 455-455 (2008). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC Positions [2095-2523] - Reverse transcriptase CC Positions [3702-4181] - Integrase core CC 'AGAT' target site duplication CC LTRs are 98% similar to each other. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX FH Key Location/Qualifiers FT CDS 316..3468 FT /product="Gypsy10-I_AP_1p" FT /translation="MTEGNKYSDSDTTTQHRRYLADIEMVVPEFDPVHGTI FT TIEKWIDKVEEYAILYEWDDVAIQHFALTKLTGVARLWRDSLPREDRTWLQ FT WVPLLKANFPSTNEDVLRIKLDAQNFTRKTGQNMIEYFYQKLSKCNRAEMS FT DNEIIQWIVRGIGNDRYRDYLGPLSNYKKPADLLPHLMSANDYIERDKEKN FT TQKLNENSAVKQTRTTGTSDTKKTSLICFRCRVTGHTSKECTKKPTVTCFR FT CSKPGHKSYECRSKPNVNDKTSSSVPQVPSSTNESGKQRSVFQLSNGTTSQ FT NKYFKTAIINGERVRAYIDMGAACVAMRKSEADRLKINYDDTTREEFVGYG FT FGRVSSLGRFQTTLIIDKVSAYVTINVVPDDVQEISLLVGHPFTEQPHVMI FT VSAPNELRVEEVASVEAADQVDKTSVWAKETLVIPKNHVGHITVVTALPDQ FT DLCVEGGMRATRQMVPRCLISTDGEGHAVVPMINLSNGDVTVKKGDTVTRG FT VLFNEVVSKSEKQNREVNQEPVSTEEVVSDLAADQVDEVLTLLNEYKDLVA FT RNLRQVGCTHLTEMKLVLKDNRPVVYRPYRISYKEREQVRGIIDELKEADI FT VEDSESPYASPILLVKKKTGDVRMCVDYRELNKKTVPDKYPLPRIDDQLDR FT LHGNTYFTSLDLFSGYYQVPINDVNTRAQTSFVTPDGHFQFKRMPFGPTNC FT PAIFSRMIAVALGKLLYTVALAYLDDIIIPSKSVEDGLEKLRLVLQSLRDA FT GLTIKLEKCRFFMQKLEYLGFEISKNGIEPGRRKIMAVENFPVPNNVRAVR FT GFIGLASYFRRFVKGFAVIARPLTDLLCKNTRFEWTDERDKAFKMLKKALT FT SRPILSMYEPNAYTEVHTDASQHGVAAVLLQRQAEDGKMHPISYFSRKTTK FT DEAKYHSYELEALAIVSALERFRVYLIGICFVIKTDCNSLKLLADKRDLNP FT RIGRWFMKLSEYQYKIEHLKGDRNLIADALSRSPVEPEQSVEVASLNVFGI FT RITTDWVAALQKDDEEVAGMIAKVEANDPTTKDKYVIENTIDYTE" FT CDS 3606..4550 FT /product="Gypsy10-I_AP_2p" FT /translation="MREFVTKYINRCVNCMFYKIPKKGELYWHPLDKGSEP FT FQTIHLDHVGPFVMTERDNKYILTIVDGFSKYVVLRAVKDVTAIETVYYMR FT EFICTYGRPERIITDRGTAFTAAIFEKFCHELNINHVKISSKSPRSNGQAE FT IINGIAVKCLAMITENPDNTDWDLKSMEVQWGINNSKHRVTGYTPSDVIYR FT FKIASRIENPLVSEVSKINKDKCITEKEIDPTEVLRKNREKELRKITQRKT FT RPEKFHKGDLVLIKWEAPASGQSRKLEPKYKGPYQIARELRYDRYVVTDVE FT GEQLGQRPFSGVIGFDRLKRVSK" XX SQ Sequence 4772 BP; 1621 A; 791 C; 1137 G; 1223 T; 0 other; gttcaggtgt ggggttaata cggaaattgt gcggtaaatc ggtcgtgtgt gtgcgctaga 60 ctcaattttt tttttgcata catatgcata aaaagcgtta aggttacgcg aacgtgaata 120 ttattcgagt aatatttatt ttgtaacagt cgtaatagtt tgtactgcgt gtacgaaggc 180 caggctaaat taggctgggg ctgtccgacg acgtgttgga tatctgttcg ttatcggtga 240 gtggctcagc ggagcgagtt gttcgggcta tatatacttg attacctgtg gagttgaact 300 tatcgcgtgc gcgtgatgac tgagggtaat aaatatagtg atagcgacac gacaacacaa 360 caccggcgat atttggcaga tatagaaatg gtagtgccgg aatttgatcc tgtacacggc 420 acaattacaa tcgaaaaatg gatcgataaa gtggaagaat atgctatctt atacgagtgg 480 gacgatgttg cgatacaaca tttcgcatta acaaaactca cgggagtagc ccgactgtgg 540 agagacagtt taccgagaga agatcgaacg tggttacaat gggtaccact gttaaaagca 600 aattttccgt cgacaaatga agacgtctta agaataaaat tagatgctca aaattttact 660 agaaaaacag gtcaaaatat gatagaatat ttttaccaaa aattgtcaaa gtgtaatcga 720 gcggaaatga gtgataatga aataatacaa tggatagttc gcggaatcgg aaacgatcga 780 taccgagatt atctaggtcc actatctaat tacaagaaac cggcagattt attgccacat 840 ttaatgtcag caaatgatta cattgaacgc gataaagaaa aaaatacgca aaaattgaac 900 gaaaattcgg ctgtaaaaca aacgcgtacg acaggtacga gtgacacaaa aaaaacgtcg 960 ttaatttgtt ttcggtgtcg tgttaccggt cacacgtcga aggagtgtac aaaaaagccg 1020 acggtgactt gttttcgatg ttccaaacca gggcataaat cgtatgaatg tcgcagtaaa 1080 ccgaacgtaa atgacaagac gtcaagttct gtaccacaag ttccttcgtc gactaatgaa 1140 agcgggaaac agcggtcagt gttccaattg agtaacggaa caacgagtca aaataaatat 1200 tttaaaactg ccataataaa tggcgaacga gtacgagcat atattgacat gggcgcggca 1260 tgtgtcgcga tgcggaaatc ggaagctgat agattaaaaa ttaattacga cgacactacg 1320 cgagaagaat ttgtcggtta tggttttggc cgggtaagct cgttaggccg ttttcagact 1380 acattaataa tcgacaaagt gagtgcatat gtcacaataa atgtagtgcc ggatgatgtg 1440 caggaaatct cactcttagt tggtcatcct tttacggagc aaccgcacgt catgatagta 1500 agtgcgccga acgaactacg cgtggaagaa gtcgcatccg tagaggcggc ggaccaagtt 1560 gataagacgt cagtatgggc taaagagaca ctagtgattc ccaaaaatca tgtcggacat 1620 atcactgttg tcacggcgct tccagatcaa gatttatgtg tcgaaggagg gatgcgcgct 1680 accagacaga tggtgccgcg ttgtttgatt tcaactgacg gagaggggca cgcagtggtg 1740 ccaatgatca acttatctaa cggagatgtt acggttaaaa aaggcgatac agtaacgaga 1800 ggggtgctat tcaatgaagt ggtttcgaaa agtgaaaaac aaaatcgtga agtaaaccaa 1860 gagcctgtat cgaccgaaga agtcgtaagt gatctcgcgg ccgatcaagt cgacgaagta 1920 ctgacattat taaatgagta taaagacttg gtagcccgaa atctgcgaca agtagggtgt 1980 actcatctaa ccgaaatgaa gttagtactt aaagacaacc gaccagtagt gtataggccg 2040 tatcggattt cctataaaga acgagaacaa gtgcgcggta tcatcgacga actgaaagaa 2100 gcagacatag ttgaggacag tgaatcgccg tacgccagtc ccatattact ggtgaagaag 2160 aagactggtg acgtccgcat gtgcgtggat tatcgtgaat tgaataagaa gacggtgccg 2220 gacaaatatc cgttaccaag gattgacgac caactggata ggctgcacgg taacacatat 2280 tttactagtt tagatctttt tagtggttat tatcaagtac caattaatga tgtaaatacg 2340 cgtgcacaga cttcttttgt aacgcccgac ggtcatttcc agtttaaacg tatgcctttt 2400 ggtccaacaa attgtcctgc aattttttcg cgaatgatag ctgtcgcctt aggcaaatta 2460 ttgtatacgg tcgccctagc ttatttagac gatattatca ttccgagtaa aagtgtggag 2520 gacggtcttg aaaaattgag gttggtatta cagtctttga gagatgctgg tttaacgata 2580 aaattagaaa aatgtagatt ttttatgcaa aaactcgaat atttgggttt cgaaataagt 2640 aaaaatggta tagaaccggg gcgacgaaaa ataatggcag tagaaaattt tcccgttcct 2700 aataacgtac gagcggtacg cgggtttata gggctagcga gttattttcg tagatttgtg 2760 aaaggatttg ccgtgattgc tcgaccgctg acagacctgt tgtgtaaaaa tacgcgtttc 2820 gagtggacgg acgagcgaga taaagcgttt aaaatgttaa agaaagcact tacgagtcga 2880 cctattctat cgatgtatga accgaatgca tatacggaag tgcacaccga cgcgagtcaa 2940 catggcgttg cagccgtatt attgcaacga caagcggaag acggtaaaat gcatcctata 3000 agttatttta gtcgaaaaac aacgaaggac gaagcgaagt accattcgta cgaattagaa 3060 gcgctggcga tagtgagtgc gttagaacga tttcgcgtgt acttaatagg tatctgtttc 3120 gtaattaaga ctgattgtaa cagtttaaaa cttttagcag ataaacgaga tctaaatcct 3180 aggataggta ggtggtttat gaagttatcc gagtatcagt ataagataga acatttgaaa 3240 ggtgaccgaa atttaattgc tgacgctcta agtaggagcc ctgtagagcc agaacaatcg 3300 gtggaggtgg cgagtttaaa cgttttcgga ataagaataa caaccgactg ggtggccgcg 3360 ttgcagaaag atgacgaaga ggtggcagga atgattgcga aggtggaagc gaatgaccct 3420 accacgaagg ataaatacgt gatagaaaat actatagatt atacagaata acagaaggga 3480 ggtggaggtt atatttaccg gtcgacctca ggtatgatgt tgttagtact actcatcggg 3540 aattagtaca cctcggcata gacaagacac tatctaagtt aaaagaacat ttttattttc 3600 ccaaaatgcg agaattcgta acgaagtaca taaatagatg cgtgaactgt atgttttaca 3660 aaataccaaa aaaaggagaa ttgtactggc atccattaga taaaggcagt gaaccgtttc 3720 aaacaataca tctcgaccat gtaggacctt tcgttatgac tgagagagat aataaatata 3780 tccttacgat agtagacggg tttagtaagt atgtcgtact acgcgccgta aaagacgtaa 3840 cggctatcga aactgtatat tatatgcgcg agtttatatg cacatatggc aggcccgaac 3900 gaataattac agataggggt accgcgttta cggcggcgat attcgaaaag ttctgtcatg 3960 agttaaatat aaatcacgtt aagatatcgt ctaagagtcc gcgtagtaat ggccaagccg 4020 aaataattaa cgggatagct gtaaagtgtt tagcaatgat aaccgaaaat cccgataaca 4080 ccgattggga cctaaagtcg atggaagtgc agtggggaat taacaatagt aaacacagag 4140 tgacaggtta tacgccgtcc gacgtgatat atcggtttaa aattgccagt cgtatcgaaa 4200 acccgttagt gagtgaagtg tccaaaataa ataaagataa atgtataaca gaaaaagaaa 4260 ttgacccgac cgaggttcta cggaaaaatc gagaaaagga attacgtaaa ataacacaac 4320 gaaagacaag acccgaaaag tttcataagg gagatttggt attaattaag tgggaggcac 4380 cggcctcagg acagagtaga aagttagagc ctaaatataa aggaccgtat caaatagctc 4440 gggaactgag atacgaccga tacgtcgtaa ctgatgtaga aggggaacag ttaggacaaa 4500 gaccatttag tggggtaata ggatttgacc gcctaaagcg agtaagtaaa taaaaaaaaa 4560 aaatgtgtaa tttaatgttt agtcgtattt ttttaaaaca aaaatgtttt tatgaaatat 4620 taattaaaaa aaaaaaaaaa aatgtaattt aatgtttagt cgtattttca aaacaaaaat 4680 gttattatta attattattt aaaaaaaaaa aaaaaactgt attattattc atatatttga 4740 cgggacgcca aaaatttgtc aggaaggccg ac 4772 // ID hATm-24_HM repbase; DNA; INV; 3377 BP. XX AC . XX DT 07-DEC-2008 (Rel. 13.12, Created) DT 09-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type family: consensus sequence. XX KW hAT; DNA transposon; Transposable Element; hATm-24_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3377 RA Jurka J.; RT "Families of hATm elements from Hydra magnipapillata."; RL Repbase Reports 8(12), 1918-1918 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 524..2776 FT /product="hATm-24_HM_1p" FT /translation="MAGVSGCKKHKTRLRTNCPIFGNPSYLPETVLPTNAD FT VMKYYLFLKDEMTYRSAEIPSVAEVSSCVSSKVEALWHKASIPIITHKRIV FT AKIKELHDKHRNMLKPLKRRQNDPNYQAKMQKFCTEANSQLFDIAVCKCLD FT INHCNCEKSFKVPVAERPFLEDQRSTRNMMIGSIDKVVTNRLKERDNRKRK FT LSNIYKTPTTATTSNSLLSSNIVEFSSSSDTETDNDTDVHFPSIHKVPTTG FT TAEKKQMRANIPNISKECDRHSISDRAAAAIASAVLQDVGIVNDCDSTMVI FT DRSKIRRERQKVRNELQSHLASNLTGLYFDGRKDKTKMQLKKGSKYYYKIV FT TEEHVTLTQEPHSIYVGHISPENGTSVKIAYGIVDFCDSNGISLDNLVAVG FT CDGTNVNTGALGGVIRLLEMKLDRPLHWFICMLHANELPFRHLLCKLDGTT FT SGPKAFSGPIGKLLPICETFPVKHFSTIEMNSFPDVNFKDLSTDQMYLYKM FT CHAVNNGECPINLANKTPGPVVHSRWLTTASRVLRLYVSMEDPTENLVTLA FT TFVVKVYAPTWFYIKTKSSCTQGALHLWRMINFSRYLKNDLRCIVDEVIQR FT NAYFSHYENILLAMLTDEREHIRKLAFRRILGAREKNSQSLRRFRVPKLNF FT EADDYTNLIDWSVVQKSEPPATKNISTHEIKSFVNGVSLTEPVLNIPEFPC FT HTQSTERHVKLVTEASATVCGFDRRDGMIRATLESRSKTKTFNTKSQFQL* FT " XX SQ Sequence 3377 BP; 1141 A; 603 C; 627 G; 1006 T; 0 other; gggtggagtg aaaaaacctt tttttggaaa ttcatttgag ctgggtatgg aaaagttgcc 60 tttgggtaca aaatgaactg tggtaaaagc attttgctct agattaatat ttaggtctgc 120 cccaaagcac ttgaaaattt gaatatttcg ttaatttcgg catttttggc ttaaaaaata 180 ataatttcct atcattgcaa attcaagttg gctgggtatg caaaagttgc catttagtac 240 aaggtaaact gtggtaaaag cattttgccc taggttaatt cttaggtctg ccccaaagca 300 ctcgaaaatt ggcgtttttc gtgcattttg tggttttcgg agaaaaagtt aataatttcc 360 tataattaca aatttcagta tttacattat tttgtttgaa ttattatgtt agaaattaat 420 tattttattt ttatattatt attttagatt agaaaattat tttagattag aaaattcaaa 480 caacctcgca tctgtgagtg acgtccgaac aaaaataaca agaatggctg gtgttagtgg 540 gtgtaagaag cacaagacac ggctgcgtac caattgccca atttttggaa atccttcata 600 tttaccagag accgttcttc ccacaaatgc tgatgtgatg aaatattact tgttccttaa 660 agacgaaatg acatatagat ctgcagagat cccttcagtt gctgaagttt caagttgtgt 720 ttcatcgaaa gtcgaagcat tgtggcacaa agcatcgatt ccaattataa cacacaagcg 780 gattgtagcg aagataaaag aacttcatga caaacatcga aatatgctca agcccttaaa 840 gaggcgacag aatgatccta actatcaagc aaagatgcaa aagttttgta cagaagctaa 900 ttcccagttg tttgacatcg cagtctgcaa atgcttggac ataaatcact gcaattgtga 960 gaaaagtttt aaggtaccag ttgcagaaag accatttttg gaagatcaaa gaagtacaag 1020 aaatatgatg attggaagta tagataaagt tgttacaaat cgattgaaag aacgcgataa 1080 tcgcaaaaga aaactttcca atatttacaa aactccaaca acagccacta catctaactc 1140 cctattgtct tcaaatattg ttgaattcag ttcgtcttct gatactgaaa ctgacaacga 1200 tactgatgtc cattttccct caatacataa ggttccaact actggaacag cagagaaaaa 1260 acagatgaga gcaaacattc caaatatctc aaaggaatgt gatcgccata gcatttcaga 1320 tcgtgctgct gcagccattg ctagtgccgt gttgcaagac gttggtatcg tgaatgactg 1380 tgacagtaca atggttattg atagaagtaa aattcgaaga gaaagacaga aggttagaaa 1440 tgaacttcaa tcccaccttg cctctaatct aacaggtttg tacttcgatg gtaggaagga 1500 caaaacaaaa atgcaactca agaaaggttc caagtattat tacaaaatag ttacagaaga 1560 acatgttaca ctaactcaag agccccattc aatatatgtc ggtcacataa gccctgaaaa 1620 tggaacatca gttaagatag catatggcat agtagatttc tgtgattcga atggaatttc 1680 actagacaat ttagtagctg tcggttgtga tggcacgaat gtcaacactg gtgcgttggg 1740 aggtgtcatc cgtttacttg agatgaaact cgataggcct ttgcattggt tcatttgcat 1800 gcttcatgca aatgagctcc cttttagaca tcttttgtgc aagttagatg gcaccacaag 1860 tggaccaaaa gctttctcag gtccaattgg taaattactt ccaatctgtg aaacttttcc 1920 tgtaaagcat ttttcaacta tagagatgaa tagttttcca gatgtaaatt ttaaagattt 1980 aagcacagat caaatgtatt tatataaaat gtgccacgca gtgaataatg gtgaatgccc 2040 aataaatctg gcaaataaga ctccaggtcc tgtagttcac tcgcgctggc tgacaacagc 2100 tagcagagtt ctgcggttat atgtttcgat ggaagatcca actgaaaatc tagtgactct 2160 tgcaacattt gttgttaaag tttacgctcc tacttggttt tatatcaaga cgaaatcatc 2220 ttgtactcaa ggggcactcc atctatggag aatgataaac ttttcaaggt accttaaaaa 2280 tgacttaaga tgtattgtgg atgaggtaat tcagagaaat gcttatttca gtcattacga 2340 aaacatccta cttgcaatgt tgactgatga gcgcgaacac ataaggaaac tggcattcag 2400 aagaatcctt ggtgcaaggg aaaaaaacag tcagtcacta agacgctttc gtgtgccgaa 2460 acttaatttc gaagcagatg actacactaa tctaatcgac tggagtgttg tccagaaaag 2520 cgagccccct gctacaaaga acatctcaac tcatgaaata aagtcatttg taaatggtgt 2580 ttcattgact gaaccagttt taaacattcc agagtttcca tgtcacactc agtcaactga 2640 aagacacgtt aagcttgtta ctgaagcatc agctactgtt tgcggttttg atcgcagaga 2700 tggaatgatt cgtgccacac tggaatcgag atcaaagaca aaaactttca acactaagtc 2760 tcagtttcaa ttataaagat ttaaacaact gtgaagttct tggatatatt atacatttgt 2820 aatgttattt aatatgccta taaacaatct gaaatatgta cgaaattggt tttgtatgga 2880 tttaaattaa agtactcaat tttcaagtgg cttggggcag acctaattat taacctaggg 2940 ccaaatgctt taaccacagt ttaccttgta ctaaaaggct acttttgcat acccagccaa 3000 catgaatttg caattatagg aaattattaa attttcgcca aaaaccgcaa aatggacgaa 3060 aaacccatat tttcgagagc tttggggcag acctaaaaat taacctaggg caaaatgctt 3120 ttaccacagt ttaccttgaa ctaaaaggca acttttgcat acccagccaa catgaatttg 3180 caatgatagg aaattattat tttttaaacc aaaaattgcg aaattaacaa aaaactcaaa 3240 ttttcaagtg ccttggggca gacctaaata ttaatctagg gcaaaatgct tttaccacag 3300 ttcattttgt acccaaaggc aacttttcca tacccagctc aaatgaattt ctaaaaaaag 3360 gttttttcac tccaccc 3377 // ID CR1-31_HM repbase; DNA; INV; 4126 BP. XX AC . XX DT 16-DEC-2008 (Rel. 13.12, Created) DT 16-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE CR1-type family - consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-31_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4126 RA Bao W. and Jurka J.; RT "CR1 families from Hydra magnipapillata."; RL Repbase Reports 8(12), 1859-1859 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 906..4037 FT /product="CR1-31_HM_1p" FT /translation="MQNYWQKIYLTNPFDLSSAAKKSNALTHPKQLTSMEE FT RGTHTSNGNQQLRPFNRSQHLTSSTFLNKASSSIKLSNLLCFYTNATSLNP FT IKLIELSLLAENNSFDIIFITETWFNDQSTPSLPNYNLYRSDRLNQIGGGV FT AIYTNNKIISNQLQESLLHKNFICYYSEQLWVELKLLNETLLLGCIYRPPS FT SGIXAFSEISSAIFHAETSVLTKKYNGMIIAGDFNFPEITWTSNSVFSSSK FT IGLSSSFVSLLQDCHLHQIVTVPTFKPANRPMTNTLDLIICDSPYRANEIN FT IGPPLGLSDQYHCSITWQYKLASTGKLSHFNSSKFIYNLGNYKKINKEFNK FT VDWSSIFKELNVEQCYQIWLNKYNECCLQFIPRMPNKPSPNKQPWMTKELL FT SLIRLKKSLWARNVSSKWKIASLVKEYRETRKFVKKLSCSSLKSFEFALAN FT DKRNPKRLFSYINSKRSVINQISSMRITSSSEIISSDKIKIANSLNNQFQA FT VFSKEPSNGNLPHFDRRTNTILNNITFDTQATLMELSSLDISKSLGVDNVS FT PXVLRXCSTSMSTPLTLIFQKSFDNGEIPXSWSKANVTPLFKKGSRIDPSN FT YRPISLTSIPCKIMEKLVKKAIMXHLKXNTLLSNSQHGFXEKKACITNLLE FT TMDFLTGNMARKLPVDVIFLDFAKAFDKVPHQRXLYKLKMYGIXGNLLNWI FT KAFLXNRSQRVILGDTQSNWLPVTSGVPQGSVLGPILFVIFINDLPDVISK FT ENICKIYADDTKILSIVNSLAAQLQLQSDIDQIVHWTKSWLMELNIKKCKV FT MHFGQKNXIPFEYSMEETGSSSIXTRTKLETTTSERDLGIQITNDLKVETQ FT SIIASCKANQMLGILKHTFQXRDASLWKQLYSTYVRPLLEFAIPAWNPHLV FT KDEQIIELIQHRATKIPXKLKXLDYKSRCLKLGLTSLIDRRKQGDLIQKYK FT IDKNIDIINWHFPXITIPPRANHRERXHREKYSNXLARFXFFNNRIVNAWN FT ILPDKVINSPTVNSFKASLDRLQCYYSSQFSLMXXAI*" XX SQ Sequence 4126 BP; 1431 A; 896 C; 581 G; 1161 T; 57 other; atatatatat actacagaaa aaaacayata ctgaaacttt ttagttcctg tgtaataagt 60 atttgtgtta ataaggcaaa aaawaagtta aaagttctcg accaagaggt aatccactgt 120 tattcagagt ggtagccctt ttatcaaata taaaatatca gattgagcaa tttgtaaatc 180 attggtgata atgagcacac cttatacaac cgaccaactt gcagctctag cagccaactg 240 gccaggctta ggtcaaggta atcattcagc tgcttctcaa aagcttgatg ctacgatatc 300 aaaactaata gccacagtca gcatacttct cacccgcgta tctgttcttg aaaacgaaaa 360 cattactcta aaagcatatc actctcaaca accaaagtta gcacctacga atttgaccac 420 aactctcaac tgggcaagcg cagtcaagca cggttcagaa gccaataaag caattatcac 480 agcagtcaac gaaaatgcca aaattgtaga aacgaaatct aaaagagcta tagttgccgg 540 tcttccaaaa tcctctgaca ataatcagag gctaatttca gttcaaagta tgctccaaaa 600 aaaaaaagct taactataat gttgctttta aagttgttgg gcatttctca gcaaaaactt 660 ctttaacaaa aaagtttaat actacaacca ataccaacca gtttaactac aatagcagct 720 gaaccaaatg aattgattat cattgagttt gactccaatg agcaccgcaa tgatctctta 780 tctaatgcaa aacgacttgc ctcaatagat gaattaaaat ctgtctatat tcgtccagac 840 cgtacacctc tcgaacaaga aacattcaac aacttaaaca aaaagcgtgc taccgaatac 900 caaaaatgca gaactattgg caaaagattt acttgacaaa ccctttcgat ttgtcatccg 960 cagcgaaaaa atcaaatgca ttgacacatc caaaacagtt aacatcaatg gaagagagag 1020 gaacccatac ctcaaatgga aatcagcagc tgaggccatt caatcgctct cagcatctta 1080 ctagttcaac atttttaaac aaggccagct cttccattaa attatctaat cttttgtgct 1140 tctacactaa tgcaacctcc ctcaatccaa taaaactcat cgaactctcc ttacttgcag 1200 aaaacaactc ctttgacatt atttttataa ctgagacatg gttcaacgac caatcaactc 1260 cctctcttcc aaactacaac ctctatcgtt cagaccgtct gaaccaaatt ggaggaggtg 1320 tagctatata cacaaacaat aaaatcatct caaaccaact acaggaatcc ctactacaca 1380 aaaattttat ttgttactat tcagaacaat tatgggtcga actaaagctt ctgaatgaaa 1440 ctctgctttt aggctgcatt tatagacctc catcttctgg tatcraagcc ttctcagaga 1500 tttcaagtgc tattttccat gccgaaacat cagtattgac caaaaaatac aatggtatga 1560 tcattgcygg tgatttcaat ttcccagaga ttacttggac ttccaattca gtcttctcct 1620 ctagtaaaat yggtctcagt tcttcatttg tctctcttct tcaagattgt catcttcacc 1680 aaattgtcac tgtccctact ttcaaacctg ctaatcgccc tatgacaaac actcttgatc 1740 tcataatatg tgactctcca tatcgtgcta acgaaatcaa cattggtcct ccactcggtt 1800 tgtctgatca atatcattgc tctatcacct ggcaatacaa actggcttcc accggaaaat 1860 tatcgcattt caacagctct aaatttattt acaacctygg taactacaaa aaaatcaaca 1920 aagaattcaa taaggttgac tggtcatcta tttttaaaga attaaacgta gaacaatgtt 1980 accaaatttg gctaaacaaa tacaatgaat gttgtcttca attcatccct cgcatgccaa 2040 acaaaccatc cccaaataaa caaccatgga tgacaaaaga acttttatcc ctgattcgtc 2100 ttaaaaaatc gctgtgggct cgtaacgtca gttcaaaatg gaaaatcgca tccctggtta 2160 aggaatatag agaaactaga aagtttgtta aaaaattaag ttgttcttct ctgaaatctt 2220 tcgaatttgc tcttgcaaat gataagcgca accctaaaag actattttca tacataaata 2280 gtaaacgctc tgtaataaat cagatctcat ctatgcgaat caccagttcc agtgaaatca 2340 ttagctctga caaaattaaa atagcyaact cacttaacaa tcaatttcaa gcagttttta 2400 gcaaagagcc atccaatggc aatcttcccc attttgatcg tagaactaac acaatcctca 2460 acaacataac ttttgacact caggccactc taatggaact ctcatcacta gatatatcca 2520 aatcacttgg tgtagacaat gtcagtccty acgttttacg ctyttgttct acytcaatgt 2580 ctactccact tactctyatc tttcaaaagt catttgacaa tggtgaaatt ccaarctcat 2640 ggtctaaagc aaatgtaacr cctttattta agaagggctc cagaatcgay ccatcaaact 2700 acagaccaat atcyctyacc tcaattccat gtaaaataat ggagaaacta gttaaaaaag 2760 caatcatgaw gcatctaaaa tyraacactc ttctatcaaa tagtcaacat ggctttktag 2820 agaaaaaagc atgcatyaca aatctccttg aaacaatgga cttcttgacy ggaaatatgg 2880 ccagaaaact accagttgat gtcatattcc tggattttgc caaagcattt gacaaagtgc 2940 cacatcaacg awtgctttat aaactaaaaa tgtayggaat craaggtaat cttctcaatt 3000 ggataaaagc ttttctamtc aacaggtctc aacgagttat ccttggtgac actcagtcga 3060 attggttacc tgtgacaagt ggwgttccgc aaggatcagt cttgggtcca attttatttg 3120 tcattttcat caatgacctg ccagatgtaa taagcaaaga aaacatttgt aagatctayg 3180 ctgatgacac taaaatyctg agcattgtca actcacttgc tgctcaactt caactccaat 3240 cagatattga ccaaattgtt cattggacca agtcttggct catggagtta aacataaaaa 3300 aatgcaargt catgcacttt ggtcaaaaaa attyaattcc yttcgagtac tcaatggagg 3360 aaactggctc tagttcaata rcaactcgaa ctaaattgga aacaaccacc tctgaacgtg 3420 acttgggaat tcaaattacy aatgatctta aagtagaaac ccaatccatc attgcmtcat 3480 gtaaagcaaa tcaaatgcta ggcatcytaa aacacacttt ccaatstcgt gatgcttctt 3540 tgtggaaaca actatactcc acctatgtcc gtcctttact tgaatttgca attccagctt 3600 ggaaccctca tttggtgaaa gatgaacaaa ttattgagct aatacagcac agagccacaa 3660 agataccgsa caaactgaaa aakttagact ayaagtccag atgtttaaaa cttggcctca 3720 catctctaat tgaccgccgt aaacaaggtg acctcattca aaaatataaa attgataaaa 3780 acattgacat aatcaactgg cattttccyc ytatcacaat tccaccaaga gcaaatcaca 3840 gagagagayt tcaycgtgaa aartayagca acmatttagc tcgyttymat ttttttaaca 3900 atcgcattgt caatgcatgg aacatcttac cagataaagt aatcaattct ccaactgtca 3960 acagctttaa agcgagcctc gataggctyc aatgttacta cagctcgcaa ttttcyttga 4020 tgraarttgc aatatgagct tcacaaaaac tawctrtaat ctgacaatct ayatttgtat 4080 tttgtcaatt atttttactt ttgttgtaac aaatatatac tactac 4126 // ID BEL2_Cis_LTR repbase; DNA; INV; 476 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 06-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE Long terminal repeat of BEL LTR Retrotransposon from Ciona DE savignyi. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL2_Cis_LTR. XX OS Ciona savignyi OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. XX RN [1] RP 1-476 RA Smit A.F.; RT "BEL2_Cis_LTR - BEL LTR Retrotransposon from Ciona savignyi."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX CC Ci000071; < 2% div. XX SQ Sequence 476 BP; 135 A; 84 C; 86 G; 170 T; 1 other; tgtaagattt atcgtcgcaa ttattgttta tattatatcg atgtcactag ggaataagat 60 cggtattgta ttcccttttt tcccgcaatt tcaccgaact ataaaacgct tgttttttac 120 cgcatgagca ttatcgtgag gtaatcggtc ttttggtagt atgttttgga attaacggtt 180 anaaagaaga tgtgtatgac gtgctttggc tcctgttcag ccacaatttg gtaaatattg 240 gttttatttc tcaaactagt aatttttgtg ataattaaat tgtacatata tttttcctta 300 gaaaaccgtc atttcaagac ctttcgtttt cgccatcatt caaattcgaa taaattcaca 360 agttttggac aatgacgcgt tatttcaaac tctgaataaa ggacgccgtc tataagcata 420 ctgcaacctc tcactaagac aggtgttagg agccagcacc ttgtcgggaa gttaca 476 // ID R4-N1_ED repbase; DNA; INV; 579 BP. XX AC . XX DT 30-JUL-2008 (Rel. 13.1, Created) DT 30-JUL-2008 (Rel. 13.1, Last updated, Version 1) XX DE Nonautonomous R4 non-LTR retrotransposon from Entamoeba dispar - DE a consensus sequence. XX KW SINE; Non-LTR Retrotransposon; Transposable Element; EdSINE1; KW Ed_SINE3; R4-N1_ED. XX OS Entamoeba dispar OC Eukaryota; Amoebozoa; Archamoebae; Entamoebidae; Entamoeba. XX RN [1] RP 1-579 RA Lorenzi H. and Caler E.; RT "Genome wide survey and discovery of repetitive elements in three RT Entamoeba species."; RL Repbase Reports 8(10), 1686-1686 (2008). XX DR [1] (Consensus) XX SQ Sequence 579 BP; 246 A; 68 C; 88 G; 177 T; 0 other; tggcacatca gagacaccac atggaaccct aaaacattca catgcttcga atctcccagt 60 tattatctgg ttatgatatt gcctttggat aataaatgta ttagtcatta gtgcaaaaag 120 tgcagtaatt tattgttgtg taatattaca tagaatagta taagatttga tatttgttat 180 gttgaaatta taaatacata aggaggtatt aaccaaagaa aagtatctac taaggtgtgg 240 ttaagactaa aagaaaatta atcataataa tttagtggta atgaattgtt ttattctctc 300 tttcataaaa taaagaaaaa aggaaaataa aattgtttaa aattaaggca gaaaataaac 360 aaaggcttaa aaagaagaat tgagtagaag aagtttgaaa gaccttaaca ggaagaaatg 420 aaacaaagaa atgccttcct catattccaa gaaaaaccta aaatatatgt taaacaaaaa 480 gattactctt ttttaaacag ctcagtgatg gaattagtct cccctaaact aggaagaata 540 gatgaaaaat ctattaacac ttaattaaat aatttttat 579 // ID Gypsy-80_CQ-LTR repbase; DNA; INV; 157 BP. XX AC AAWU01003625; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-80_CQ_; KW Gypsy-80_CQ-I; Gypsy-80_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-157 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 540-540 (2011). XX DR Genome; AAWU01003625; Positions 42429 42273. XX SQ Sequence 157 BP; 38 A; 43 C; 33 G; 43 T; 0 other; tgttccatcc ccaccatcga gcgcccatcg cagtaggctg cgcctgtggg tggtggagaa 60 aattaagttc ggtttcaaca accacgcaaa ccagtacgaa atatattcgc tcttacgttt 120 tgtactacac gtgtttcatt cgtttccgga cacatca 157 // ID CR1-4_HM repbase; DNA; INV; 4352 BP. XX AC . XX DT 11-DEC-2008 (Rel. 13.12, Created) DT 11-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE CR1-type family: consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-4_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4352 RA Jurka J.; RT "CR1 families from Hydra magnipapillata."; RL Repbase Reports 8(12), 1832-1832 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 620..3694 FT /product="CR1-4_HM_1p" FT /translation="MEDIKFHDLSFKAFQTNKNILLENNSDPDVNFFNTDE FT IIKNICPSYYNSDISNVTANLDINAFSILHLNIRSFRKNFDKFKDFLFGLE FT INFKIICLTETWCRDKQIEKDSNYQLKNYTVIHQLRNSETIGGGVCIFIHN FT SLNFKPLTEFSVISNDCESLTIEILNKVTKNVIIHVVYRPPSGNNKFFEKH FT FKKLIKSKVLCDKRVFFVGDFNYNVLDYDTNKNVKNFMNIIFQNGFIPTIN FT KPTRITRNSATSIDQIIINEYTKNKIKTGIIISDISDHFPIFIIAHKSVEH FT TPHKKILITNRVYNDISMELFNNSLFSENWDDILQIQTAETAYNSFLRIFK FT DHYDKAFPVVTKIIKTKTFINPWITPGIIKSSKKKQRLYEVFLKKRTYVNE FT LRYKKYKSHFESIIKLSKKLYYSNKIIKFKGDTQKTWQVIKEVIGKKTIML FT NGLPKKLNYNHQDITKESLIAKTFNDVFVNAGLNLASKLKHSDFNKEKYLT FT PSKSIMENAELTEKELLDAVLSLKLNKSQGVDNISSNVIIKCIFYIKKILL FT HIFNLSFKQGIFPKKLKIARVIPVFKSGDNTNISNYRPISILPCFSKILER FT IMYNKLYSYLEKNNILYNKQFGFKTGHSTDHAILHLVQIIFQAFDKNKFTL FT GVFIDLSKAFDTVDHSILIKKLENYGIKNINIAWFKSYLSNRKQYISFGNK FT KTNNMTITCGVPQGSILGPLLFLIYINDLSKASNILDSILFADDTNLFYSH FT SDITALFKTVNIELLKLTEWLNMNKLSINLTKTKYTIFHRLHKKEKIPLEL FT PKLIIGDYIVKRECSMKFLGVILDENINWKDHINMIENKISKNIGLLYKAK FT PFLDQSCLKYIYYSFIHSYLNYANIAWCSTNKTKIKKLFSKQKHAIRLLSN FT AHRLSHSRPLFNNLKILNVYQINIYHLLIFMYKINNNSIPNIFKPLFNKIQ FT HKYVTKYSNNNFVQPKTIYDSTKFSITVRGPNAWNKIISNDIKTLKTLDQF FT KQKLKLLLLNNEQTDNYF*" XX SQ Sequence 4352 BP; 1784 A; 583 C; 524 G; 1461 T; 0 other; ctacgttaaa agattaaaaa aaatgaaaat gtgcaaaaag atatagaaga tagtctaaac 60 tttacacaag acatacaaga aaaaaaaatc tcgaattaga aaaaaaaatt atatcaaaat 120 cagcaagcag agatacggaa gataataaac taaggcagct tgaagataga caaagaagga 180 ataatcttag aatagaggga ttatcggaaa atgaaaaaga aaattgggat gaaaccgaac 240 taaaggtact aaatattttt gaagaaaaac ttaatttaaa aggaattgtt attgaacggg 300 cgcacaggtc aggaaaaatt gaagcaggta gaccaagatc aatagtaatg aaactgttaa 360 attacaaaga caaagtacaa gttttaaaaa gtgcgagtaa actaaaagga tcgggaatat 420 atatcaacga ggattattct ttggaaactg catctatcag gaaaaaacta ttcgagcaaa 480 tgaagataca taggaaaaat ggtatgtttt cggttgtaat ttacgataaa ttaatagtaa 540 aagaattcag gaaattacct tcaaaccttt aattatttta tactctatat attttgttag 600 tataaccttt cgtgttgtta tggaagatat aaaatttcat gatttaagct ttaaggcttt 660 ccaaacaaat aaaaatatac ttttagagaa taattcagat ccagatgtta atttttttaa 720 cactgatgaa atcattaaaa atatatgtcc ttcttattat aattctgata tttctaatgt 780 tactgcaaat ttggatatta atgcattctc tattttacac ctaaatatta ggagctttcg 840 taaaaatttt gataaattta aagatttttt atttggtctt gaaattaatt ttaaaattat 900 ttgtctcaca gagacctggt gtcgagataa acaaattgaa aaagattcaa attatcaact 960 aaaaaactat acagttattc atcaactcag aaactctgag acaataggtg gaggagtatg 1020 tatttttatc cataattctt taaattttaa accgctaacc gagtttagtg taattagtaa 1080 tgattgtgaa tctttaacga tcgaaatttt aaataaagtc acaaaaaacg tcattataca 1140 tgtcgtgtac agaccgccat ctggcaataa taaatttttt gaaaaacact ttaaaaaatt 1200 aattaaaagc aaagttttat gtgacaaacg tgtcttcttt gtgggagatt ttaattataa 1260 tgtgcttgat tacgacacaa ataaaaatgt taaaaatttt atgaacatca tatttcaaaa 1320 tggatttatt ccaactataa ataaaccaac acgaattact agaaatagcg caacttcaat 1380 cgatcaaata ataattaatg aatatacaaa aaacaaaata aaaactggaa taataatatc 1440 cgatatatcg gatcatttcc caatatttat aattgcacat aaaagtgtcg aacacacccc 1500 tcacaaaaaa atattaatta caaatcgagt atataatgac atttctatgg aactttttaa 1560 taattcttta ttttctgaaa actgggacga tattttacaa attcaaactg cagagactgc 1620 ttataacagt tttcttcgga tattcaaaga tcattatgat aaagctttcc cggtggtaac 1680 aaaaattatt aaaactaaaa catttataaa cccctggatt acaccaggaa taataaaatc 1740 gtctaaaaaa aaacaacgtt tgtatgaagt atttctgaaa aaaagaacct atgtaaatga 1800 actaaggtat aaaaaatata aaagtcactt tgaatcaatt ataaaacttt caaaaaagtt 1860 atactactca aataaaataa taaaattcaa aggtgatact caaaaaacgt ggcaagtcat 1920 taaggaagta attggaaaaa aaacaattat gttaaatggt cttccaaaaa agctcaatta 1980 caatcatcaa gatattacaa aagaatctct aattgctaaa acgtttaatg acgtatttgt 2040 caacgcaggt ctaaatttag catccaaact aaaacacagt gattttaaca aggagaaata 2100 cttgactcct agcaaatcaa taatggaaaa tgcggagtta accgaaaaag agttattaga 2160 tgctgttctt tcattaaaat taaacaaatc acaaggagtt gataacatta gcagtaatgt 2220 cataattaaa tgtatcttct atattaaaaa aatccttttg catatattta atctttcgtt 2280 taaacaagga atctttccaa aaaaacttaa aattgcaagg gtaataccag tttttaaatc 2340 tggagataat actaatattt ctaattatag acctatttcc attcttcctt gcttctcaaa 2400 aatactagaa cgcattatgt ataacaaatt atattcatat ttagaaaaaa ataatattct 2460 atataacaaa caatttggct tcaaaacagg acactctact gatcatgcaa ttcttcatct 2520 tgttcagatt atttttcaag cttttgataa aaacaagttc actttaggtg ttttcataga 2580 tctaagtaag gcgttcgaca cagttgatca tagtatttta attaaaaaat tagagaacta 2640 cggtatcaaa aatattaaca tagcttggtt taaaagctac ttatctaata gaaaacagta 2700 tatttcattc ggcaataaaa aaactaacaa tatgacaatt acttgcggcg ttcctcaagg 2760 atctatatta ggaccactct tatttttaat ttatattaat gatttaagca aagcctctaa 2820 tattctagat tctatcttat ttgctgatga cacaaattta ttttattctc atagtgatat 2880 aacagcatta tttaaaacgg tcaacattga actcctaaaa ctaaccgaat ggttaaatat 2940 gaacaaactt tcaattaatt taactaaaac taaatatact atttttcatc gcctccataa 3000 aaaagaaaaa attcctcttg aacttccaaa acttattatt ggagattaca tcgtaaaaag 3060 agaatgttcg atgaaatttt taggtgtaat tctagatgaa aacataaatt ggaaagacca 3120 cataaacatg attgaaaata aaatttctaa aaatattgga ttactctata aagcaaaacc 3180 attcttagat caatcctgct tgaaatacat atattactca tttatacatt cttatcttaa 3240 ttatgcaaac attgcttggt gtagcaccaa taaaacaaag ataaaaaagc tatttagcaa 3300 acaaaaacat gcaattagat tattatcaaa tgcacaccgc ctctcacatt caagaccatt 3360 gtttaacaat ctaaaaatac taaatgtgta ccaaataaat atatatcatc ttctgatttt 3420 tatgtataaa attaataaca attcaatacc aaatatcttc aaaccattat ttaataaaat 3480 tcaacacaaa tacgtaacta aatattctaa caacaacttt gttcaaccca aaactatata 3540 cgactccacc aagttttcaa taactgttag aggacctaat gcatggaata agataatcag 3600 taatgatatt aaaacactta aaactcttga tcagtttaaa caaaaactaa aactattact 3660 tttaaataat gaacaaacag ataattattt ttaaatctat ctataaaaat attatatttt 3720 aaaactatct actaatgcat actttatttg tctggaaggt ctaaattctt taattttata 3780 taggtattac agtctgttat attataaaaa aagtacttat aaagattttt agcattttta 3840 gatttcatta gtttaaataa atttattagt tttcttaaat aagagagaga gaggaaaaat 3900 aaaaaaagta atcttttttg agttttaatt ttttatatta atattttgta atttgttttt 3960 gaggaattaa tatattacac tttataagaa cagattgtaa attaaatttt acgtttactt 4020 ttttcatgtt ttttagtttt ttgatatttt tgtttttctg aatgaacttt taaatttttc 4080 tttcaaaaaa taaaatcgaa aaaatatctt tcaaaacttt gaactattaa actatttggc 4140 attttatata aacttctttg taatattttg cttttttctt aattttttaa ttttcatttt 4200 agttaccatg tatttatata aatgtctaga caaacaaggg acttggtgat aagacaaccc 4260 cgtcttcttc tcgtccttgc catttttaaa tattttaaat tacggaactt gtaaatattt 4320 gtacggctaa attataaaaa aaaaaaaaaa aa 4352 // ID Gypsy-22_DPu-I repbase; DNA; INV; 18820 BP. XX AC scaffold_601; XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from Daphnia: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-22_DP_; KW Gypsy-22_DPu-LTR; Gypsy-22_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-18820 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Direct Submission to RU (16-DEC-2010). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR Genome; scaffold_601; Positions 4723 23542. XX CC Positions [5801-6304] - Integrase core CC 'AATCTT' target site duplication CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 1062..3359 FT /product="Gypsy-22_DPu-I_5p" FT /translation="MIQRDHFWPRSTPYIICICNNVSLSLYSRPIDQTCNT FT SALHLHACLHLFSVTANRALQLATLPFSAPGISTRYDRMNPNFDHSKYFVS FT TLDVNGERPVISHNFKYLNEAELDYCKKKIAVKVVQLNFARDKIITSIQDA FT RATTGHARRCLQTKGDKLTREIRTLEIKFDELAIPNFPICRLSRKARLGRV FT TLPAGVTVYLTKFRKSAPRTPVEPDLLAEATGGISPTPRVTEEIKWRPTVE FT TEADTSIFDNVDEFIRLPVDDIIKRLQEERITGSPDLRNKAKFLNRLWRAD FT LSPEFDWNTYSGELNEATGFRGWESDDDTLSTFDLDLIYPDEDMALTQAAI FT DQLTAAAQDNTEALVFGMRNLAAQRKLENIPIFSGESNCPLVIEEWFKIAE FT RTARLAGWNEQQKILFFQEKLTKSAANFNDSLTPAQRVAYDAWKALMIDGL FT HDNTTKAIKKGELKDLKQEPGERVRDFQTRIDDMYRIAYGAGPASSNDANV FT VLVRDDMKKEILLSGLRREIATLVWNRVEADATYDNTVQTAIECEKVTEVK FT KIAQSKDITSAVSVISQENEKNAAKIKELEGLINKLSTKPVSPPTASTPVD FT LGDPAVIAAFNTYNNAQSNFPKTVRFPDNQRSRSVSPFTYNRSNRPEEGAR FT AQTPTRINNNQDTRITCYNCGKKGHISRECWSGRGQQQQNRQRDGQHFSQD FT GRQNSERSGRNWRPNGDQPNHYRGQSSERYDRKNNSYGRQFNQNREQNGSR FT YSQDYRNAPPNGRRQ" FT CDS 6851..8941 FT /product="Gypsy-22_DPu-I_4p" FT /translation="MDCKIALGLLSSVLFVTSVSSINITICDCNKPKTIGL FT LDAELPSYCQESIAETPVIKKYTFFIKEEPHAHWDGFVCRTWIKTKKIEGF FT FFGGYDTTFTTSSRPMAEKECWEMVQYHRCFENGMEGAQDSFAFTASPVGE FT GAWMQTKEYGIINCLLQRITLRKDCLNCPISSPYGILTNKSDVSFVRHHNS FT IIVWDASKANISDQCRLKELKNGTGLVTKVDASSFKLVDNTAQLEFFYHEN FT LENICKRPLRKLKNLDAAYLHIIAQNWSHVYNVETKICLDINMQNAPCDGQ FT KPHEFILIGNEIAINNKELGLVHANCIAPAALYALSTILATVLPKRKESIP FT TDKPKPTYFHVKSSQCRQKFNFRWNSETLEITGMVDGREGCLTAKVGLQPS FT LTDCNQEKSQKWIFGHPNIPIKRKMNDGQPLLLQHHQYMEHQAIVSENVLE FT KEIKKIYCGNLQVRRYTLMLLAEQNGLLAARANNLPMCQRLKMYGDYFLVQ FT QCKLDNISIGMEKTKCGPEPKLKNYTIGKDGFSLHPFEECFWANSFINLNG FT KLYMWKDSEWSAIQPTAHLSTLKLTSKFTELEDNEAQYLLNSHDILERPEY FT EQINAVNEIVNKIHESNTRSLSAILVNEKEESRFWSFSSWAAKIKASFITV FT VSVVILAICVIFLLWKYKSRIGGLQDILTHFAVGVQERRRRQNNVLTE" FT CDS join(3095..4966,4970..6829) FT /product="Gypsy-22_DPu-I_2p" FT /translation="MLVRSWSTTAKSTTRWTAFLARRKAKFRKVWPKLEAE FT WRPTKSLSGPEQRTIRSQKQFVRQTIQSEQGTERQSLQPRLSKCPTERQET FT VKATNNSNNVNTKYQLFHVPIKIDNTTTYALIDTGASVSAMSEYFFNRLSN FT EAKLSKISNNDNLRSICGESMDISGVYDLNISLDNNAECIRQKFFIVPKLT FT ETCILGIDFVTENAIALNGETRRVTYKINGQSFSFIADTSTLGNEYSNLIQ FT KLNASVATSAKIKENIPDIVEISQSKIVIEDINCEMYRNKIDQLLELNKDV FT IADKLCELGQAKGIKHKIDTTGKIIYMRPRRQARAHLSLIEKEVDEMLQYN FT IIRHSSSPYSSPIHLTDKKDGTKRFCIDFRNLNTETTKDKYPIPIVDETKD FT YLLGARYFSTLDLISGYWQIEIEEADKHKTAFTTSQGHYEFNRMPFGLTNA FT PATFQRLMNNLLQPVIHKCALVYLDDVIIYSRTIEDHMRHIQCVLNLLREG FT GLKIKLSKCLFLQKAVKYLGHIISEDGLRPDPKLTEAIRNYPTPQNKNHVK FT SFLGLSGYYRKFIWDYANKARALTILTRQKEEFRWGPEEENTFQFLKNCLL FT ENPILRFPDFDLEFIVQTDASGFAVGVLAQRKIVNGEKVEYAVAYASQQLD FT ETQEKWHTTDKEAYAIYHALKTFYHYLYGTKFTVETDHEALKGFPKITENM FT SRKVIRWALFANEFDFQTVFRLGVTNQNADGLSRITEQKEVVETSGQTHSI FT KALTTETFALEQGKDKFCLEMKRRYLRDQQRQKELMEEYLVENEPRQNARK FT RRNSEDDDSDSDVDEDEQIKELNNGLLGTADGRILVPQVLREKILLRFHDS FT PFAGHLGVKKTTARIQRRFKWPRMVKEIREYIKNCELCVKRKGGGDNKSPL FT NPIPPPDHVWQCMAMDIMGPLTPSGINNHKYILVMGEYLTRYVTVASMPDE FT TAESVAKAFIVNIITRHGVPETVLTDQGQNFMSKLMECLYKQCGIKAIRTS FT AYRPQCDGMVERVNRTLADTISCYVKDEPSRWTEFLDVAAFAYNTAVHLST FT GYSPFYLMYGREAREPSDLMPPARNRNLTDINMLFSQQWYDALRIAKERLI FT EAKEKQKFYYDRNTKRIEFKGGEKVLLKQLAITPGKFNNRWEGPYTVKEKK FT GNVSYRIISDDGKKLMVAHADRMKKFQGRTNPEATATATNEMRKEETEEKQ FT DTERARFVNQKNVVVNEPRYNLRTKINLPKRLQY" XX SQ Sequence 18820 BP; 5984 A; 3901 C; 3953 G; 4982 T; 0 other; aatttcccga tagggattct atcgccaaag acctgtttcc tgacccaacc catcccaacc 60 cacaccaaca gatggcgttg ttcctctcag tgagatagga aagttgttaa tttcattaat 120 gtcttatacg ttttttttct tagagttaaa aaattagaaa attatgatca tcttgtgcaa 180 gacaaggttt tacatggagc cgtcgcattt agatttttat attgcgacga aatcataatt 240 ataaactcga atttgtgttt tcgtgtttta ttaaactgcc gttagatggt tctgagttct 300 gacgcattgt gctataaggc cgtgctatat aaacacagcc tattatctag tcaactcggt 360 ctactgtttt ccatttgcac ttttgcagtc ttcatttgaa agtaatggca ttctcttcca 420 tcaaccagcg gcaatcaatg gtagacctca ggtacaggta attaattaat taactaattt 480 caataccaca ttagttaaag aaaaggggaa ctcttcttga agactgccta agtattctct 540 ttgcttgcat ttgttctgga ttcctggacc caatagccaa attgactatt gagtccggcc 600 gctaggggga cattgcactc ctctgactcc tccaagaaat ctgcacgaaa taccattcct 660 cgtaatttgc cactgacgca tagtacattt acaagaaatt ttgaaaattt tctatataag 720 tatgatgtat atatacatat tagaatgaac atttgtgact tgtactgtac aatgctacaa 780 tcaaaactag atttaaagct gtaatgcccc gcggagccga cttgccccgc ggtgcctact 840 agccccgccc cctccagcga tggccaccac ctccgcctac gtcacagccg tgacgtcaac 900 accggcaacc catgtcacct cccgtgactc acactggaat gccatccacc ggaaacgccg 960 acgctgcacg ctgactcggc ggtcagctca gaataccggc ccagctcaac cggcccgctg 1020 gtccagctca accaaggcac tggtccagct caaccacgaa gatgatccag cgggatcact 1080 tttggcctcg atcaactccc tatataatct gtatttgtaa caatgtctct ctctctctgt 1140 actcccgccc gatcgaccag acctgtaata catcggctct tcatcttcac gcttgtctac 1200 atttattctc tgtaactgcc aaccgggcgt tacaattggc gaccttgcct ttttcggcac 1260 ctggaatctc aacaaggtac gatcggatga atccgaattt tgatcactcc aagtattttg 1320 ttagtacgct agacgtaaac ggtgaacgac cagttattag ccataatttt aaatacttga 1380 acgaggctga attagattat tgtaaaaaga aaatagcagt aaaagtagtg cagttaaatt 1440 ttgcaagaga caaaataatt acatccatac aagacgctag ggcaacaaca ggtcacgcaa 1500 gacggtgctt gcaaacgaaa ggagataaat taactaggga aataagaaca ttggaaatta 1560 aattcgacga acttgcaatc ccgaatttcc ctatttgtag attatctcgt aaggctcgat 1620 taggccgagt taccttacca gcaggggtta cagtctattt aacgaaattt cgaaagtcag 1680 cacctcgcac accggtagag cccgatttac ttgcagaggc cacaggtggg atatcaccga 1740 ccccgagagt tactgaggaa attaagtgga gaccaacggt agagacagaa gcagacacta 1800 gcatttttga taacgtcgat gaatttattc gactaccggt agatgatata attaaacgtt 1860 tacaagaaga acggataacc ggctcaccag acttgagaaa taaagctaaa ttcctaaacc 1920 gattgtggag agcagattta tcaccagagt ttgattggaa cacttattcc ggggaattaa 1980 acgaagcgac aggctttaga gggtgggaaa gtgacgacga tacccttagt acttttgatc 2040 tcgatctaat atacccagac gaagatatgg ctctaactca ggcagcaatt gatcaactga 2100 cagcagcggc tcaagataat acagaggccc tagtattcgg catgcgtaac ttggcagctc 2160 aaaggaagtt agaaaacatc ccaatttttt caggcgaaag taattgccct ttagttatag 2220 aagaatggtt taaaattgcc gaacggactg ctagattggc gggatggaat gaacaacaaa 2280 agatactatt ttttcaggag aaattaacta aatcagctgc taattttaat gattcattaa 2340 caccagcaca gcgagtagct tatgatgcat ggaaagcatt aatgatagac ggattgcacg 2400 ataatacaac aaaagcgata aaaaagggag aattaaaaga tttaaaacag gaaccgggag 2460 aacgagtaag agattttcaa actagaatag atgatatgta taggatagct tatggagcgg 2520 gcccagcatc tagcaatgac gctaatgtgg ttttagtaag agacgatatg aagaaagaaa 2580 tattattgtc aggccttcgc agagaaatag ctacattagt gtggaacaga gtggaggccg 2640 atgcgacata cgataatact gtgcagacag cgatcgaatg cgaaaaagta acggaagtta 2700 aaaagatagc acagagtaaa gacataactt ctgcagtttc agtaatatcg caagaaaacg 2760 agaaaaatgc ggcgaaaata aaagagctgg aagggttgat aaacaaactg tcgacgaaac 2820 cggtctcacc accgacggct tcaacaccag tcgatttggg agacccagca gttattgcag 2880 ccttcaacac atacaacaac gcgcaatcaa atttcccgaa aaccgttcgt tttccggata 2940 atcaacgaag tcgtagcgta agtccattta cttataacag atccaatagg ccagaggaag 3000 gagcaagggc tcagacgcca acacgaatta acaataatca ggatactaga ataacttgtt 3060 ataattgcgg gaagaaaggg catattagta gagaatgttg gtccggtcgt ggtcaacaac 3120 agcaaaatcg acaacgcgat ggacagcatt tctcgcaaga cggaaggcaa aattccgaaa 3180 ggtctggccg aaattggagg ccgaatggcg accaaccaaa tcattatcgg ggccagagca 3240 gcgaacgata cgatcgcaaa aacaattcgt acggcagaca attcaatcag aacagggaac 3300 agaacggcag tcgttacagc caagactatc gaaatgcccc accgaacggc aggagacagt 3360 aaaagcgacg aataattcga ataatgtaaa taccaagtac caactttttc atgtaccaat 3420 taaaatagac aatactacta cgtatgcttt aatagatacg ggggcttccg ttagtgcaat 3480 gtcagaatat ttctttaaca ggttgagtaa tgaagcaaaa ttgagtaaaa taagtaataa 3540 tgataacctt cgtagtattt gtggcgaaag tatggatatc tcgggagtat acgacttaaa 3600 tattagctta gataataatg cggaatgtat cagacagaaa ttttttatag tacccaagct 3660 aactgaaaca tgtatattag gaatagattt tgtaactgaa aatgcgattg ctttaaatgg 3720 agaaacccgt cgagtgactt acaaaattaa tggccaaagt ttttcgttta tcgcagacac 3780 aagtacctta gggaatgaat attcaaattt aattcagaaa ctgaatgcct cagtggcaac 3840 aagtgcgaaa atcaaggaga atattccaga tatcgttgaa atatcgcaat ctaaaatagt 3900 aattgaagat ataaactgtg aaatgtatag aaataagata gaccagttgc tagaattgaa 3960 taaagatgta atcgccgata aattgtgtga attaggccaa gccaaaggca ttaaacataa 4020 aatagacacg acaggaaaaa ttatttacat gcgacctaga cggcaagcca gggctcattt 4080 atcactaata gaaaaagaag tggacgaaat gttacagtat aacataatta ggcatagctc 4140 aagtccgtat agctcaccca tccatctcac cgacaagaag gatggcacta aacgcttttg 4200 tattgacttt cgcaatttga atacggaaac gacaaaagac aaatacccta taccaatcgt 4260 cgacgaaaca aaagattatt tgttaggcgc tcgttatttc tcaacactcg acctgatcag 4320 cgggtattgg cagatagaaa ttgaggaagc cgacaagcat aaaacagcat ttacaacaag 4380 tcagggtcat tatgaattta atcgtatgcc attcggattg acgaatgcac cagcaacgtt 4440 tcagcgttta atgaataatt tattacaacc agttattcat aaatgtgcgc tagtttactt 4500 ggacgacgta attatttatt ctagaacaat tgaagatcac atgagacata ttcaatgcgt 4560 actaaacctg ttaagggagg gtggattgaa aataaagtta tcgaaatgtt tatttttgca 4620 aaaggcagtg aagtatttgg gacatattat atcagaagac ggcttgaggc cggatccgaa 4680 gttaacggaa gcgattagaa attatccgac gccgcagaat aaaaatcacg tgaaaagttt 4740 tttgggatta tcaggatatt acagaaagtt tatatgggat tacgcgaata aagcgagagc 4800 acttacaata ttgacgagac agaaagaaga atttagatgg ggccccgagg aagaaaatac 4860 gtttcaattt ttgaagaatt gtttgttaga aaacccgatt ttacgttttc cagatttcga 4920 cctcgagttc attgtccaaa cggacgcatc gggatttgca gtaggataag ttctagcaca 4980 acgaaaaata gtgaacggag aaaaagtgga atatgccgta gcatacgcgt cgcagcaatt 5040 agacgaaacg caggaaaagt ggcacacgac agacaaggaa gcatatgcga tatatcatgc 5100 gttaaaaaca ttttatcatt atttgtatgg tacgaaattt acggtagaaa cagatcacga 5160 agctctgaaa ggtttcccaa agataacaga aaatatgtcg agaaaagtaa tcagatgggc 5220 gttattcgcc aatgaatttg actttcaaac ggtttttagg ttaggtgtga ctaaccaaaa 5280 tgccgacggg ttaagcagaa taacagagca gaaagaagta gtcgaaacgt ctggccagac 5340 acacagcata aaagcattga ctacggaaac atttgcgcta gagcagggaa aggataaatt 5400 ttgtttagaa atgaaaagaa gatatttacg cgatcaacag agacaaaaag agctcatgga 5460 agaatattta gttgaaaacg aaccgagaca aaacgcgaga aagcgtcgaa atagcgaaga 5520 tgacgatagc gatagcgacg tcgatgaaga cgaacaaatt aaagaattga ataatggttt 5580 actcggcaca gccgacggaa gaatattggt cccacaagtg ttaagagaaa agatattatt 5640 acgatttcat gatagtccat tcgcgggaca tctgggagta aagaaaacga cagcaagaat 5700 acaaagaaga tttaaatggc cacgtatggt gaaagaaatt agagaatata taaaaaattg 5760 cgaattatgc gttaaacgta aaggaggtgg agataataag tctcctttaa acccaattcc 5820 cccaccagat catgtatggc aatgtatggc tatggatatt atgggcccac tgacaccgtc 5880 aggaataaat aatcataaat atattttagt aatgggcgaa tatttaacta gatatgtgac 5940 cgttgcatca atgccagacg aaacagcaga atctgtagca aaagctttta ttgtaaatat 6000 cataacgcgt catggcgttc cagaaacagt actgaccgat caaggacaaa attttatgtc 6060 taaattaatg gaatgcttgt ataaacaatg tggaattaaa gctattcgaa cgtcggccta 6120 ccgtccacaa tgcgacggga tggtcgagcg cgtcaatagg actcttgccg atacaatttc 6180 ttgttatgtt aaggacgagc ctagtcgatg gacagaattt ttagatgtag ccgcttttgc 6240 ttataatact gcagtccatt tgagcacagg ttatagccca ttttacttaa tgtatggaag 6300 agaggcgcga gagccaagtg atttaatgcc gcccgcacgg aaccggaatt taacagatat 6360 taatatgtta ttctctcagc aatggtatga tgccttaaga atcgcgaaag aaagattgat 6420 agaagcaaag gaaaagcaga aattttacta tgataggaat acaaaaagaa tagagtttaa 6480 ggggggagaa aaagtattat taaaacagct agccatcacc ccaggcaaat tcaataatcg 6540 ttgggaaggg ccgtatacag taaaggaaaa gaaaggaaac gtcagctaca gaataatatc 6600 ggacgatggg aaaaagctga tggtagcaca tgctgacaga atgaagaagt ttcaaggccg 6660 tacaaacccg gaggcgacag caacagctac gaacgaaatg agaaaggaag aaacggagga 6720 aaagcaagac acagaacgag cgcgatttgt aaatcagaag aatgtagtag taaatgaacc 6780 tagatataat ttaagaacaa aaattaacct cccaaaaaga ttgcaatatt aacaaacgta 6840 aatattatag atggattgta aaatcgcgct cggactactg tcatctgtct tattcgtcac 6900 ctcagtttcg agcataaata tcacaatttg tgattgcaac aagcctaaaa cgattggact 6960 attggacgcg gagcttccat catattgcca ggaatcgata gctgaaacac ccgttataaa 7020 gaagtataca tttttcatta aagaagagcc acatgctcat tgggatggtt ttgtttgtcg 7080 aacatggata aaaacgaaga aaattgaagg tttctttttc gggggatatg ataccacatt 7140 tactacttct tccaggccta tggctgaaaa agaatgttgg gaaatggtac agtaccatag 7200 atgttttgaa aacgggatgg aaggcgcgca agattctttt gcatttacag cttcaccagt 7260 aggcgaggga gcttggatgc agacaaaaga atacggtata attaactgcc tgctgcagag 7320 aataacgctg agaaaagatt gcttaaattg tcctattagt tccccatatg gcattcttac 7380 aaataaatca gatgtatctt ttgtgcggca tcacaattca attattgtgt gggacgcatc 7440 aaaagcaaac atatctgatc aatgccggtt gaaagaactt aaaaatggga cgggtttagt 7500 cacaaaagtc gacgcaagtt catttaaatt agtagataat acagcacagc tagaattctt 7560 ttatcacgaa aatttggaaa atatttgtaa acggccttta cgtaaactaa aaaatcttga 7620 cgccgcatac ctccatataa tcgcacaaaa ttggtctcat gtttacaatg tggaaaccaa 7680 aatttgtctc gatattaata tgcaaaatgc gccgtgtgat ggtcaaaaac cgcatgaatt 7740 cattttaata ggtaacgaaa ttgcgataaa taataaagaa cttggtttag tgcatgcaaa 7800 ttgtatagcg ccagcggcgt tatatgcgct atcgactatt ctcgctaccg ttctccccaa 7860 aagaaaagaa agcatcccga cggacaagcc aaagcccact tattttcatg taaaatcatc 7920 gcaatgtcga cagaaattta attttcgctg gaatagcgag acgttagaaa ttactggtat 7980 ggtcgacggc cgcgaaggtt gtcttacagc aaaagtaggc ctacagcctt cattaaccga 8040 ttgcaatcag gagaagagtc agaaatggat atttggtcac cccaatattc ctataaaacg 8100 gaagatgaac gacggtcaac cactcctact gcagcatcat caatacatgg aacatcaagc 8160 catagtcagt gaaaacgtat tagaaaaaga aataaaaaag atatattgcg gtaatctgca 8220 agtgcgcaga tatactttaa tgttgctcgc ggagcaaaac ggactattag cagcgcgagc 8280 aaataatctc cctatgtgtc aacgacttaa aatgtacggg gattactttt tagtgcaaca 8340 gtgcaaatta gacaatatct cgatcggaat ggaaaaaacg aagtgcggac cggagcctaa 8400 attaaaaaac tatactatag gtaaagatgg tttttcttta catccatttg aggaatgttt 8460 ttgggcgaat agtttcatca atttaaatgg aaagctgtac atgtggaaag acagtgaatg 8520 gagtgcgata cagccaacag cccatctctc gacgctcaaa cttacatcaa agtttacgga 8580 gctagaagat aatgaggctc aatatttgtt aaattcccac gatatacttg agaggccgga 8640 atacgaacaa attaatgctg tcaacgaaat cgtaaataaa atccacgaat ccaatacaag 8700 aagcttgtca gcgattttag ttaacgaaaa agaggaatca cgtttctgga gcttcagcag 8760 ttgggcagcg aaaattaaag catcatttat tacagtagtt tcagttgtca ttctcgccat 8820 ctgtgtgatt tttcttctat ggaagtataa atcaagaatt ggaggactgc aagacatctt 8880 gacacatttc gctgtgggag tgcaagaacg cagacggagg caaaataatg tgctgacgga 8940 gtagtcacct aagacaaata aaaatcggaa acacagcaat attcttcccc gcttcaacaa 9000 aatacatttc gaaatacata tcacatttct gcccattcag caggctatta aacaattaac 9060 gaaaagctgc tcgatagttc tcgacagact ttgtcagtgt gcttcaggac gtcatcgcgt 9120 gtagtgtaca accaaaagtt tgtgcaatga gtgactcaag aaaaatcaac gacgttcgtg 9180 ttaactgtga accgttgaat ggcgatagtg tgctccatcg tatcgccaaa aacaattgct 9240 ataaggaatt ggcaaacgtt tcgccgggac cgggagtaaa ttgccggaac aatcgaatgg 9300 agacgccgct gattgtcgcg gcgcagaaca gcgcaatccg cgccatcacg tctctaactg 9360 ctttgggcgc cgatgtcgac gctcaggatg caaacggtaa tacagcaatg cattacgcgg 9420 ttcttaactc gtcagatagg gcaattgact cgctcgcaca cgcgtacacg aacttcaata 9480 tcccgaatgc cgtgggaaga accgccatgc acctcctttc aattagcaat agcccaaatt 9540 tagcgagaca attatgtcgt aatattctaa gaatcgatat gactattttg gatcaagatg 9600 ggaacacccc atttatggct gccgtcagat ataacagcag tcgtatggtg cattatttcc 9660 tgcaactcgg gtatagacct gagggaaaag acaaagacgg aaatacgcct ttacatatcg 9720 cggtgagatc caaaaatctc gcaacgataa aattattggc ccgaaaaccg catataaatt 9780 ttgtgaatga cgatggccgc agcccactgg ctgccgcgat cgccattggt aatgaaatgg 9840 ctatggaaat attgtggcgc cataacgcag acccacacgt cgtagacaag gctggcaaca 9900 ccttgttgct tcttgcatgt gccgggcctt caatagtaat acttaaacat attttggagc 9960 attgctcgga agtgacaatg tgcaatgtag ccggcgagac cgcattgcat ttggcatctg 10020 cagctaaaac ttctagcgtt atttttgaaa tttgtaaaag gggagcgact gtcacagcac 10080 aagattgtaa agggaggacg cctttaatga tagcaatcat gtcggggaat cgacaggccg 10140 caacggtcca attagagtgg caatttgccg cagggacaga ttttctctcg agagtagatt 10200 tggcaggatt aaacgcttcg caatattgcg atttatttgc tgatctggaa actaaagagt 10260 gtatccggaa tttacaaata gcatttgtta gagcccagca ggttcccgtc caaactagaa 10320 caatttattt taacataccg cccgaaacgg ttgcattgga ttacgacaat aacgatcccc 10380 tggaatttaa ttccgatgaa tcaaccgatt cagactcaaa catgccagtc ctcaccctgt 10440 aatggaagat ttattttttt aagaaaaatg tggagaagag aacaaaaagt tgaaaaaaga 10500 gagagtgaaa agaggacaag actctctctt ctttgtaact tagttttttt ttattttacc 10560 gctgttctta ccttcttctg cctagtgtcg tttttctcgc tgccctcttt cctactccaa 10620 cgtttttttt ttgtttcccc tctctttcgt cacccatttt catatcttat tttctttttt 10680 ccataggcct acgcctaaat atgccaattt ttttttttat taacttccct ctctctctga 10740 tctttgatgc ccagaatgtt ttttttgggg ggggggaata agttaatgga aatgtagaaa 10800 atacgctcgt gaaatcgccg cctcgaacag tggtaaaatc gatccagtgc acactgggcg 10860 acggataatg taaaagaccg ttgaatgaca gagtgcaaca cggaaaagac aaagataagg 10920 gaatgctact gacacgaaac aaagaggatg tgatttgaca cagcattaaa aatccattta 10980 atagaataca cgcctgcaca cttattaagc gacgaatcga tgttcgtcat aaacgaacta 11040 gaaaggtata caaacggaaa aaaaaataaa aaaaaaatgc ttacagtaga caagctcttc 11100 tctctacgga caaaattttg ttgaaggcga attgatgttt gtcgaaacgg gtatattcca 11160 tgaaaaatag caaagtaaac acggcatgac aaggtttagg cctgcatttt acagtctttg 11220 cgcgtgattc ccatttacgt tgcaattgct tgattgcttt tcttatattt ttgcaagtaa 11280 ccggaagcaa gatagaaaca cgccccaccg accttggtat ggacgttcag acagacggat 11340 taaataacga ctaatttgtg tcgttataga tttttttttt tcttccatta cgaaaaagtc 11400 cgttcgtcac catttgttac caaatcaagt accccgaaca attttgtttt tgtgaatttt 11460 attttttttt cttctcttcc tcttttcttc ccattttttt ttataataag gagacgtaac 11520 gaaaaacaaa atgcaacttt tatagtgtat gaattaaaag gaaaaaccta tcactaacaa 11580 tccaagggag agagggtaat tacggatttc agtccatgag ttgacatgaa gtacgaaaaa 11640 gcaatggggt aagggacgct tcggggggga aagtagccca cagtaacaaa tgaaacgtcg 11700 ccattccacc ccaaacattt tattttcctt gggggatgta ttaagtatac cctgagaata 11760 cgtctagtac atgttatcct gttgtcaata gaccccctgc caacagcaaa ttattaagac 11820 ggattggaaa aggatattcg tccagtgaaa agtaaaacaa taagttaacg gtaaagtaag 11880 catgccggga gttaatcaac gcgttgccag aaaggaaagg aaaaactatg tgccttgtca 11940 cactatttgg aaagaagggg ggataaaata agatgtgaat agtaaaagga aaaggggatg 12000 agacaacctt cccctaccac tatttttttt tattacggga ttaatgtcag tattccgtgt 12060 tagatccagt tggtcatgga attgatgaaa tgaaggcgtg aggaaagaaa agaaaaaaaa 12120 aagaaaattc tcgatccaat ggttggctca cgcctctgtt atttttttgg gggggtatta 12180 tgggattggg agacaattca agatgaatag gagaaaaatg aaaatgacga aaaacgaaag 12240 tttttctttt tcttcgcgta ctgtttctga ccactatgat gctaggttgg cactgtaaat 12300 tacagcggac agaatagtat caaaagccgg ggacagaacg aatgaagtaa aaaaaaaaat 12360 ttggggggat tttcgcaaag gaaaagggag gataattgcc cccctaggcg ttcgcctcgt 12420 ttggctgtca gcgaacgtca gtgggataga tagactgtgt gtatcggtca aacggcccca 12480 ccaacaatga aaggcgctgg cagtttaggg acccttggaa agtgtgacca gcgggtgctc 12540 gcaccttgaa ttccttgggt acagggtcac tgcagcgact cattagagga aagaaaaata 12600 tcaaaaacag aagacaagcc gatatacaga gattcatggc catcgcgata acaaataaga 12660 aaaactcaat ctgggggtta cgactccttt tcaaaaaata cgaaacggga attcagcaac 12720 caccaattag cggaaataaa agcagtggtt gaacacacga cagatgcaag taaatgacaa 12780 tcgttgagtc cccaccacgg caatagcaaa aggaaacgac tcatttttcc tgaaacgttt 12840 ttttttcttc tcatctggtc ccagatgagg tgggtttaaa attaattttt tttttcgggc 12900 ccgagttgtt ggccattaca gaacgtttgt caagtcatcg acgacaggag aaaataaagt 12960 ggggaattac ggtacgaatt ctatcaacct ttattcgacg cgcaaataga tatttccgcg 13020 aaaagattgc aattaagcca aacttcgtcc tcttgactgg aggtttcact acaaaccaaa 13080 caaattggaa ttccagttaa ggctgcgatg aagcgtagca acgaaacaaa aaaaaaataa 13140 caaacaaact caagtagcaa tgagcttgat gacgggaaaa agggaaagga cagtgctcac 13200 acgcacgggg catttgtttt taataattaa aagggggggg ggggatctca gtctctaaag 13260 actagtacgt tactattttt tgattttaaa tgtttttttt ttttccaaca cctggcacaa 13320 ggctatatcc ccagcttacc ccaaagtatt ttagaatcca aaattcgcgg aagaaattcc 13380 tgaacttgct ttgtgcaaat cgtgtttgcc tcccaaaatg aaaaaaacaa ttttttttta 13440 caactgcaaa ccatgttttc tttgtctctc tcctcctttt catttttttt tactcgcctt 13500 tctatctctt tgccctccaa atagtttaag tttgtctatc tacatgtttt tttttctctc 13560 tctctctcat ctgtgctctt cttcacacct tcctccaccc ctcagtaagc taaggatttt 13620 tgacacctcc agtgcgaaaa ttaagatttg taagtgacaa acgttcccgt cggcgacttg 13680 agtgagaagt tcagtgccac agtgccccag cgctcacgct ttcccccgac ggcatcgttt 13740 ttttttcttg tgtgtgtgtt gatttatcaa ttctccaaca tcaactttgc aaacaaacac 13800 tcagaaatgg actcaaaagc ggccaaaacg ttactggcaa acggtaagca catccgtgct 13860 tactttatca ccaatttggc cgtcactttg gtccgtcaaa ccatcgccac ggccctggga 13920 tgcgtcctcc gcgcagtgtc cggtccggac acgtcgtttg tagacccggt gacgtcacgt 13980 gatttcccaa tcctatggga aacgcggatc aacattagct acagcaaccg atcggagacg 14040 gtcgtggcat atgtagttaa ggacgaggag ataggcttcc ccctccttgc aggaatggat 14100 acaattctgg cgttacaggg agccctgcta gttaaaggta acgagaaaat taatccactg 14160 ttcgggacca cagagcctag cgacctagat ccgagaccgg tattggcaac cggcctttgc 14220 tcttcattcg tatcccgcga atgggcagcc gaacaggaag cagttgcgac ccccgtcgac 14280 caggaaaatt tttacgaccc cttaacaaat actacgctgc aactatatgg gatggtagac 14340 gtacaggcca atgacgaccc tacgttcccg gcgttcgtcg ccgatatcgg gacgggcctg 14400 gtgctcagaa taaattattt cgaacgaaca gacgtccaat tgaaattccc agaaacaacc 14460 atatcctggc cacaaacgga aatagtccgc ccgtaccgcc ctcccgctcg ttcaatagcc 14520 ggcccgacgt gcccaaagtg cggcctaaac ggccacgtac aatccaaatg cacgggggac 14580 ccgtgaattt taaaaaggaa ggggaggaaa aacatcgtcg ccaccaggca ccaacccccc 14640 cttacagtgc catccctaga tttcctactt tcttttagtt tcttaattcc ccaatgtttg 14700 cttatttttt atgcttcccc ttattcaggt tgttcccatt tttccccgaa aatgtataat 14760 gtgccacgtt tggcaattcc ctttctattt tccaaacagc tctagccaat atttcccaca 14820 tatccctttt catcttgttt cctttcactc gttccctatt ttttttttct ctcttctttt 14880 ctctctcttc cctcctcctc gtctcatttc tcttcaaatg ttatgttttc tttaagcctt 14940 gtattgcttc caaattaagc cgactccggg agtcgccttc aaagaaggcg gctatgtaat 15000 gccccgcgga gccgacttgc cccgcggtgc ctactaggct agtaggctag ccccgccccc 15060 tccagcgatg gccaccacct ccgcctacgt cacagccgtg acgtcaacac cggcaaccca 15120 tgtcacctcc cgtgactcac actggaatgc catccaccgg aaacgccgac gctgcacgct 15180 gactcggcgg tcagctcaga ataccggccc agctcaaccg gcccgctggt ccagctcaac 15240 caaggcactg gtccagctca accacgaaga tgatccagcg ggatcacttt tggcctcgat 15300 caactcccta tataatctgt atttgtaaca atgtctctct ctctctgtac tcccgcccga 15360 tcgaccagac ctgtaataca tcggctcttc atcttcacgc ttgtctatat ttattctctg 15420 taactgccaa ccgggcgtta caaagcaaca gatttgccaa acgttgccaa atcaagtaaa 15480 tgaggaaaaa acggcatttg aaatttaaag aataacaatt taaacttatt aaaactggca 15540 agtgaataga aataatgaat aaatctttaa taatccaaag atgttaaccg ctaggtacac 15600 aaatttttca ataccatttc aaaccatttt tggcagctag aaaagtatct acacaagtgc 15660 acagtgcaca catcttttga agtatgaacc tataccacac aatcaccccc cttttaataa 15720 aagtttcaat tttttacatc cgtacgtttg aaataaatag gaaataagcg gaaataagtc 15780 aaatatgctg gaaaatggta ctttgcgatg tacccctagt ggctggactc aatagtcaat 15840 ttggcacaga ttgagggttt gagtttgagg gtgaaagtta tggctgtctg tgctttaaat 15900 ttttagagtt tttggcttgc ttgacagcct tgactacact tgttgcaatt ttttgtaaat 15960 aaacaataag gccaactcgg tagtgtattt ttccgtatgg tatgtgtccc cctagcggcc 16020 ggactcaata gtcaatttgg ctattgggtc caggaatccg ggacccaata ggtgcgacct 16080 ttgtttttaa ccactacact gttgttgaat actgtttgat tatcgatagt gctttgtgtt 16140 gcagaataaa attacagaaa tgaccaacaa cagtgaagta cgtacaacag acctcatctt 16200 ccaccgagga gaccatcata gaggtggtag ttcagccacc aactacagag gccaaggttg 16260 aatctgatgc tccccccgct gtgatcacca ctcagccagc acagccaatg cctgaacaca 16320 atgacagcat tgctgtggtc atgtgggatg ccaccgtttc aataaccaca ttggttaaag 16380 aaaaggggaa ctgttcttga agactgccta agtattctct ttgctcgcat tttgttttta 16440 accactacac tgttgttgaa tgctgtttga ttattgatag tgttttttgt tgcagaataa 16500 aattacagaa atgaccaaca acagtgaaga acaacagacc tcatcttcca ccaaggagaa 16560 aatcatagag gtggtaattc agccaccaac tacagaggcc aaggttgaat ctgatgctcc 16620 ccccactgtg atcaccactc agccagcaca gccagtgcct gaacacgatg acagcattgc 16680 tgtggtcatg tgggaagcct ccgttactgg tcgacgcctt atcgacgaga tggtcaacct 16740 gggatcattc tacgtgaaac aggaagggaa agcagaaggg cacgatttca atgagtaggt 16800 catggatcat tgtacgtggc acaggaagag aaagcagaag ggcacgattt caacgagtac 16860 gtcatgccgt attgcaagct accgctgaat gtaattcgca actacaatct tcgcatcgtt 16920 gacacgggtc gctaccgcat gctcaaggag atgcgcaaca acagaatgct caagtacaag 16980 agcgaggtgg tcgcctgcaa cgatttcgcc tcgaggctca tcaagctcca aaaagataat 17040 ttcaccgaca agaaaattgt cttagtcttc tttgaaccac gcaactccaa gatacttcag 17100 ctcttccagg tatgtcgcta tttagtttct atttttagac tcgtttctaa ttttcagttg 17160 gttttttggc tttggaacgt ttccgtcttc ttgatgatgt caacaagatt ttgcttggat 17220 gtgtcaatgg ctgggatctg ctccgtcaac aggtagcaac tgaaatagta actcgtgaat 17280 cggttagatt ttctaacgtt tcaatcgaaa acaggtgcat gatttgaagg atgaacatcg 17340 catggtcctt aaagctttgg ctgatcacca ccttggagtc gacacacctt tggagagcgc 17400 tgttcaacat gctcaatcaa gcacttcaca agatccttcg caaggtgcat ggcggccgca 17460 tcaacactgg ggtacttaag gagaacctca tcagttgcga tcatttgctc gaactagtca 17520 agaagatcaa agaagaattg attaaggagg ctcaatggag acctgtattc gctgacattt 17580 ttcgtcaagg tatcaagcct cgccggcgag ctggattcct ccgccatatg ctcgtcgagt 17640 cgggcatcac ctatgcaggc cttacggaaa cattcaaggt gagtttgttt tcttattttc 17700 aattcttaac aggataattg aaatttcttt tgcaggacaa gaaagatgac ggtgtcgtcg 17760 ggtcatcaaa gagaacatca aaggcaagaa tgaagccgac gtggaggaag tcatcaagct 17820 gctgaatcaa caccttgcgc agccagacaa gcccgtccga cgcggtagca gccgtcgagc 17880 accccgcact cgttcccttt gaaccgccac ggacttggag aagcagcagc ctgcgtccaa 17940 gatcgagccc gaggaaaaca acaacgaccc tgtcaaagtg attatggatg ttattcccag 18000 cactcctgcg gtcctgtccg aagtttaaaa agtccctctc tcctcgactc gcctgaaatt 18060 acttactgcc tccccacttc ccttttgttt ttcacttaac ttgtcttgtt tttgaaaaat 18120 tttgatcttt ttttaagttt acaagtcttt cacgaaacaa acttgcttgt cgcttttggt 18180 tatgcttttc ttcttaccag agataacacc tgcaaaattt ataaatatac cactgaggaa 18240 atttattgtc ttttattttt ttcttctcaa aaaaattctt attcttcatc tcttatgcaa 18300 ctatcgaaca tttctttaag ttgggattct ctaaggcggc ggcgactctt tccttatcgt 18360 tgatttgttt attgactgaa tgactggatg ggaataaagt aaaagaatat tttattaaag 18420 cttttttgaa catttaaata tatggtatta ggacttactc tcttcttctg tgctaccctc 18480 tgcaataact atgtccaatt tgatcctttc caccacattt ttctgaatct tgcctctcag 18540 gcagaggcca ataaggctgg caagagaaca atgaggaact gtaggattga aatctattct 18600 aaccaagtaa ttcctcaaat ggttgtactt gaacagaatt catctgtaat aaccaacagc 18660 ctcttttgac actagagtgc tccggagtgc aataaatttt accataaagg aaaaagggaa 18720 aatcggaaaa tcaaaatata gacttgagga aaactatttt ctaattgttg gaccactcct 18780 atcgggaaac tcgcgcgcgc agcaagggat cggcaatctt 18820 // ID TTAA4C_AP repbase; DNA; INV; 420 BP. XX AC . XX DT 26-AUG-2009 (Rel. 14.09, Created) DT 26-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; TTAA4C_AP. XX NM TTAA4C_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-420 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2073-2073 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC TSD is TTAA tetranucleotide. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 420 BP; 126 A; 77 C; 83 G; 126 T; 8 other; gaggacgtca cgcccgcatg cgttgtctcc gtcttacaaa tgtacaacat agcaaaaact 60 gttttgcgcg ggacaacttt nctccctccg tatttatagt agaattacca aaattccaca 120 acgcataggg aagaacntta tctgtgccgc agcgtttcta ttttttggat aacataaata 180 caattaaagt tattagtttg agaacatttn gtttttttca tttanttata aaaaatgatt 240 ggnaagtgct atcgaaaaaa taaaaacgct gcggcacaga taaagttctt ctctacgcgt 300 tgtggaantt tggtaattct actataaata cggagngagn aaagttgtcc cgcgcaaaac 360 agtttttgct atgttgtaca tttgtaagac ggagacaacg catgcgggtg tgacgtcctc 420 // ID MarinerN-2_AP repbase; DNA; INV; 452 BP. XX AC . XX DT 25-AUG-2009 (Rel. 14.09, Created) DT 25-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Nonautonomous; KW MarinerN-2_AP. XX NM MarinerN-2_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-452 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2068-2068 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 452 BP; 125 A; 100 C; 116 G; 111 T; 0 other; tactctattc agaataagaa ttcccactcg cttgacgaaa actcgtcttg ttcgcgcatg 60 tctgttgcca ccagtcggcg ctgatgtttg ctgtgccgcc gccgccgacg tgccgcaggt 120 gtatacgtag tgaacaataa tatcgcaccg ttttcacgaa gaattgtata cgacgtaggc 180 attgagtttg gtgcagaaca ataaggtaaa agtacaagaa acatggtaaa gaaaaaaaat 240 atacattata acataatagg taataattat tacgatcttg ttattatgat catcatgtat 300 gcacggcggc ggctgtcgat agcgacgggt gacggcagct ttcgacaact ggcgaccaat 360 cgcagcgcag ccgacactcg ccagggggtc tgacggctgc cggcggctca gttgcgtgcg 420 cgaaagtggg aattcttatt ccgaatagag ta 452 // ID Gypsy4-SM_I repbase; DNA; INV; 11254 BP. XX AC Contig1646; XX DT 15-AUG-2007 (Rel. 12.08, Created) DT 15-AUG-2007 (Rel. 12.08, Last updated, Version 1) XX DE LTR retrotransposon from Schmidtea mediterranea: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy4-SM_I; KW Interspersed repeat; LG_I; internal portion. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-11254 RA Xu Z. and Wang H.; RT "LTR_FINDER: an efficient tool for the prediction of full-length RT LTR retrotransposons."; RL Nucleic Acids Res 35(Web Server issue), W265-W268 (2007). XX RN [2] RP 1-11254 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from Schmidtea mediterranea."; RL Repbase Reports 7(8), 751-751 (2007). XX DR Genome; Contig1646; Positions 71035 59782. XX CC Positions [6140-6664] - Integrase core CC 'CGTC' target site duplication CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 643..1635 FT /product="Gypsy4-SM_I_1p" FT /translation="MMNKDNLQSEKAIMIMSRINKLIELRAAETSKIITSL FT LELSQVSLSVQRLKSVIELCTDYYEVNSKDRDEITNMMKVFGKAFRENKWL FT DFIFIDVVDESRNSVICIRCEIGDYIKDLKDRLKENNGRQIIKVKFKNRYL FT EDDEILIDCGFIPGNKNKLQVACMIREEVAEKSDSQEITRDLRDTLSDWYG FT VKITEKLKLIDKIELEENAKLEIVNIIRPYLYDKSRYGEKLKCLMRRFGRE FT LRETRMELKPFIIAIGYEDWGKTEYIEEHWCAKVQKLRKRAVNRIGNDGII FT ILMFKGEELDDDEYLYECGFKPQELDKNGYIDSNERQFI" FT CDS 3728..5542 FT /product="Gypsy4-SM_I_3p" FT /translation="MEERSNMKPTSKVMSMSHNVLNIIGKLKAELIISNQE FT LVHYVSVMRNSPVGCLIGIDVMEQLQGISLDLKTGRLMKGQTNEKEHKMRE FT VFLNQTLEFPARSEVIYYADVLDVSDEECIFEPNGMLIEKYEIPMCDEVVK FT VKDQTVPVRIVNQNTKPIQMKPKYKLGKLTKFEIEKTPKLSRNIKIYKEPQ FT VISNDLTKLQKWEFTELLKKYKMVFGKHEYDLGLTQVIEHEIPLLDETPIK FT LKPYRIPYIHQEESDRQIKAMKEFKIILDSSSPFSSSIVVVKKKDGDLRFC FT VDYRKLYQVTRKNKYPIPVPQEMFDQMGELQFFTLIDLNKGYNQFKVKEED FT RRKMAFSDGKELYEFNVLPFGLTGAPATFQRDMNFVLRDITHALIYIDDII FT IFSNTFKEHLKDGETVFRRLLKANLKIKPSKSEVVYVGHIISLNGLQPNPE FT NVRKIKEYPIPENVDQVRQFLGLASYYGKFIKNFSHVAQPLNVLLKKQTQF FT KLGKTQQEAFELLKESLIKKPILRCSNFRIPFILMTDASNFALGAVLGQQV FT EGEKDHVIAYASRTLKPHKKNYSTIEKETLALVYGIATFRQYLVGRKFVVF FT TDHNPLQ" XX SQ Sequence 11254 BP; 4568 A; 1419 C; 2030 G; 3237 T; 0 other; ttttggtgag cccgacgtga tctaaagtca gctgagaatt gtgtggaaaa taattaaatt 60 gtttaaaaag aaaaaatctg aaaaaaaaag tttcaaagta aaaataaact gataagagaa 120 taaattattg aattaaaaga aaactttcaa attattgaaa aaccttcgtg attcattacg 180 acagaattta caaagaaaat tattatagaa tacattccaa aattaaaaca aaaggttaca 240 ttaataattg taacatttta aaatacagac gagaaaaagt tgtacaaaaa agattttaca 300 atttgagaaa ccagaagcaa tcagaaataa tggcgagctt tttccaataa taatgatcaa 360 atcctggtgt ctactaggcc gaattcaatg tcattcatgt agaggttatg gacatatgtc 420 gaggtcatgt cccaggaata atgagagatt ccaaaagcca atcttaaaga aaattagatt 480 ccaaagtgac aaattacaag cagactacgt tgcttttgtg cagccagata attctaataa 540 tagaattgaa gacgaagtga aacaaatgag tcgacgtttg gatcgtttaa tatgcaagtt 600 tgaatctgat gaagagacgg ataaatacat gagttgaaaa gaatgatgaa taaagataat 660 ctgcaaagcg aaaaggcaat aatgattatg tcgaggatta ataaattaat agaattgaga 720 gcagcggaaa cgtctaaaat aataacatca ttactggaat tgagtcaggt gtcactgagt 780 gtacaaagat tgaaaagtgt aattgaactg tgtactgatt attatgaggt gaattcaaaa 840 gatagagatg aaataacaaa tatgatgaaa gtatttggaa aagcgttcag agaaaataaa 900 tggttagatt ttatttttat tgatgttgta gatgaaagta gaaatagtgt tatttgtatt 960 cgttgtgaaa taggagatta tattaaagat ttaaaagata gattaaaaga gaataatgga 1020 agacaaataa taaaggtgaa atttaagaat agatatttag aagacgatga aattttaata 1080 gattgtggtt ttatacctgg aaataaaaat aaattgcagg ttgcgtgtat gattagggaa 1140 gaagtagcgg aaaagagtga ttcacaagaa ataacaagag acttaagaga tactttaagt 1200 gattggtatg gagttaaaat tacggagaaa ttaaagttaa ttgataaaat agaactagag 1260 gaaaatgcta aactagaaat agtaaacata atacgaccat atttatatga taaatcgagg 1320 tatggagaaa aattgaaatg tttaatgcga agatttggta gagaattaag agaaactaga 1380 atggaattaa aaccatttat tatagcaata ggttatgaag attggggaaa aacagagtat 1440 atagaggaac attggtgtgc aaaggttcag aagttacgaa aaagagctgt aaaccgaata 1500 ggaaacgatg gaattattat attgatgttc aaaggtgaag agttagatga tgatgaatat 1560 ttatatgagt gtggatttaa gccacaggaa ttagacaaaa acggatacat agattccaat 1620 gaaaggcagt ttatttagtg aattgtgtag tatcaagaaa ggaacagaaa gtagactcaa 1680 tgaataattc ggagaattca aatacagata tagaggacga taacgagaaa aagcagatga 1740 aatttccagt gataagggag aaaatgaatc gaataataaa tttcaggtag aacattcgga 1800 aaaacaagaa atttttgaga cactaagagg tgaaataaat agtgaaataa ttgttaggaa 1860 acctagaact ttatacgaga ttaagcagag tttagtgaga attgaacagg aagttgatga 1920 aacaaaaata atattaatgg gcttgattaa agcagtatac ccgatgtttg aaataaaaat 1980 taatgtagag tatgagttag gaaacatggt aactgaaatg aaaaacagac ttccaaatgt 2040 cgtgaaaaga tttggagaaa acgataagaa attaattgaa agtttgacta aaaatggaat 2100 tgacaaagaa caagtttaac ttatatggtt ttagaaattg gagctcatgc gatagcagaa 2160 acggataatt tggaaaattt gatagatgta gcggcacgaa tttcaaatga atattttata 2220 gagtgggtta aaaagattgg aggttggaag aagttgaata aatatcttac agtaaatgaa 2280 agtgatgaaa aaaagttacc atgggcaata gttatggtct ttatttagtg tacaataact 2340 acacaagcta caggtatgac agcttacgat tgttcccaaa cagaaatggg gaaaatttat 2400 tcgattgtgg acatggcaga atatctggat gcatacccaa agaaattgaa agtagaaaat 2460 gatgtaaatt attatgtgta tcaggaatct gatatacggc gagccaaagt tagagaatat 2520 attgtgaagc gaactacttt tgtatttttc tgtggtaaat tgagccacac atcattgata 2580 aaaatgcacg ttattccacg actagtagag gtaatgccag aaaactgctt agctgcgttt 2640 caaacagagg agttaaaaat taatccggaa attaaattga aagcaaaagt aaatcacaga 2700 ataagagaga cagttgttgt aaagggagta atacaacaag atggaacgtg tgaatgagaa 2760 ccacataaag tgaatgggaa aattattaaa caatcagtgg tgattgaaga atatttgaca 2820 aaattgaggg aatttgaagg agtttttgat gatgaaacag gaaaacttca aatgcatccg 2880 ttttgtaata tgagacaaag taattgccaa accgtagaat ccattctagt atttgatgtt 2940 aaaactaata tatgtaattt agcgcatttg aaaattatga ctttcaaaga gttaagtgga 3000 gaacagttta gaaaaagccc cgataatttc aatataaatt ataaatcgaa tggcacaatc 3060 aagtcgactc cagtatggac gaggaaaacc acaccagtat taatgtcgac attaccaaca 3120 gatagtatgc attttataaa aaaggaggct accaataaat gtgggcaaat ggtttatgca 3180 acaaactaca gaagtgttta cgaatcaaaa atgatggtgc atgaggcaaa aactagaata 3240 cacataaatg atattaaact caatcactat tttaacaata aaattgatta tttgtaccat 3300 tttagtcaaa acattatgga gaacatatat cggaaaatga ttctaaatga ttgtaaattt 3360 aataaagaaa tacttagaaa caggatggca atggcaatta caaatctaag cttagtagca 3420 cctatgctaa tatcagaaaa gtggtattca ccaacgtttt tgccaatggc accaacatat 3480 ccaactgaaa aattgaaaga aatgaaatcg gtagaacata ttcaatttga attagataac 3540 attgaaccta ctagtgctga aattgaacgg aaaaacatgc tcggattacc atgcagttat 3600 gcaatgatga agttaggaaa taaaagcaaa acggcctcga ataaacattc tactcaatgg 3660 aagaaaagtg gaagctttat taaattcagg ttccagtaat actatagaat cagatgattt 3720 gttgacaatg gaagagagat caaatatgaa accaacaagt aaagtaatgt caatgtcgca 3780 caatgtttta aatataattg ggaaattaaa agctgaatta atcattagta atcaagaatt 3840 agtacattat gtgtcagtaa tgagaaatag tcctgtcgga tgtttaattg gtattgatgt 3900 gatggaacaa ctacaaggaa tttctcttga tctaaaaact gggagattaa tgaaaggaca 3960 aacaaacgaa aaagaacata agatgagaga ggtattttta aatcagacgc tagaatttcc 4020 agcaagaagt gaggtaatat attatgcgga tgtgttagat gtttcagacg aggagtgcat 4080 atttgaacca aatgggatgc taatagaaaa atatgaaata ccaatgtgtg atgaagtggt 4140 aaaagtcaaa gaccaaacag tgccagtgag aatagttaat caaaatacaa aacccatcca 4200 aatgaaacca aaatacaaac tgggaaaact aacgaagttt gaaatagaga aaacaccaaa 4260 attatcgaga aatattaaaa tctacaaaga gccacaagtc ataagtaatg atttaacaaa 4320 attgcaaaaa tgggaattca cggagttatt aaagaaatat aaaatggtgt ttggaaaaca 4380 cgagtacgat ttgggactca cccaagtaat tgaacatgaa ataccattac ttgatgaaac 4440 tccgattaaa ttgaaaccgt atagaattcc atatatacat caagaggaaa gtgatagaca 4500 gataaaagca atgaaagagt tcaaaataat actagatagt tcttcaccat tctcatcgtc 4560 aatagtagtt gttaaaaaga aagacggcga tttgcgattt tgtgtcgatt atcggaaatt 4620 atatcaagtt acgcgaaaga ataaatatcc tattccagtt ccacaagaaa tgtttgatca 4680 gatgggagaa ctgcaatttt tcacattaat cgatcttaat aaaggttaca atcaatttaa 4740 agtgaaagaa gaagatcgaa gaaaaatggc tttttcggat ggtaaggaat tgtatgaatt 4800 taatgtgtta ccgtttggat taactggagc acctgctacg ttccagcgag atatgaattt 4860 tgttttacgt gatatcacac atgcattaat ctatattgat gacataataa tattttcaaa 4920 tacttttaaa gaacatctaa aagatgggga aactgtattt agaagattat taaaggctaa 4980 tctaaaaata aaaccgtcta aatcagaagt ggtttatgta ggtcatatta tttctctaaa 5040 cggattgcaa ccaaacccag aaaatgtgag gaaaattaaa gaatacccaa tccctgaaaa 5100 tgtagatcaa gttcgacaat ttctaggttt agctagttat tatggaaaat ttattaaaaa 5160 cttttctcat gtagcacaac ccttaaatgt attattaaaa aagcagactc aatttaaatt 5220 gggaaaaact caacaagaag catttgaatt gcttaaagaa agtttgatta aaaaacctat 5280 attgagatgt tcaaatttta gaataccatt catcttaatg acagatgcaa gtaattttgc 5340 gttgggtgca gtattaggac aacaagttga aggtgagaaa gatcacgtaa ttgcatatgc 5400 cagcagaaca ttaaagccac ataaaaagaa ttattctaca attgagaaag aaacattagc 5460 attggtttac ggtatagcaa cgtttagaca gtatttagtt ggtagaaaat ttgttgtatt 5520 tactgatcat aatccattac aataattgat aaaacatcgg gattcggcca gcaaactgat 5580 ccgatggtca atagccctac aagaatatga ttttgaaatt aaatatcgaa caggcaaatc 5640 aaatggaaat gtggatacgc tatcgagaat ccctgtaaat aatgagagta agaataatga 5700 aacaaattat aataatacaa tatttatggc aatcaaaacc gctaatagtt tgcagtaact 5760 gcaaaaagat caggaaaacg atgatgaatt aaatcaattg cagaaacatg cagtcacgat 5820 caaaaaggag agtgactgga aacatgaagt tgacgaaaaa cataaatata tcattcatga 5880 tggtttgtta aaatgtgttg aaaatgaaaa cttattaatt gtggttccaa caaaacatcg 5940 aacaacatta atgttgatat attatgatgg aacactaggt ggacatttat ccgctcggaa 6000 aacgttatca agattgaaac aaaaatatta ttgggacact attgaatcag atgtcagaca 6060 atggtgtcat acttgtgaag tgtgcttaac caaacgaaat aagggaaaga aaataaaagt 6120 atctttgaaa cctatgccag tgcccgaagt gccgatggaa attactgcaa tggacattgt 6180 tggtctatta cctgaaacaa taaatggtaa caagtatatc attgtgttct gtgattatct 6240 gactaaatgg cctgaagcat atgcgatgca aattcgaaaa gcggaaacaa ttgcaaagat 6300 atttgtagaa aacattattt ttcgatatgg agcaccgaag aaattactta ctgatcaagg 6360 aacgaatttc caaagcaaac tgtttatttc aatcacggaa ttattcggga tactgaaact 6420 tagtacatcg ccttatcatc cacaaacaga tggtctagtt gaaagattta atggaacatt 6480 aatcaatatg atatcatcct atgtaaatag aaagcaaaaa gattgggatt tgtttatcaa 6540 gccatgctta tttgcgtatc gaaatgcagt gaatgagaga accagtgaaa catctttcta 6600 tctcatgttt ttacggcgga acaacatgcc ggttgattta aaatttcaag cacctatctc 6660 acagtaatta tgcaagaaaa acttcaatca gtatggactc aagctgggct taaaataaga 6720 catcgtcaag aagcgtataa agagtgctat gataaacgag cacatgttca caatattgaa 6780 gtaggagatt tagtgtatgt acataaacca gaaccagcaa agggtcgatc accaaaatta 6840 cagagaccat ttaaaggccc atatatatat atatatggat accactgaaa ctaatctaaa 6900 attacgtcca caaaataata aaaaggcagg aacaattatc gttcatgcta attgatgtaa 6960 ataggtgcca gctacagagg atactaagta tcaattgaga tctaaagaca caccgcgctc 7020 aaacaaaaaa tagcgaaaaa tcccgcacca ttgggatata tcaacaaaat tttgttcgtt 7080 acaacactta tgataatcaa tgtgttaggg tcgaatgcga ctccattgtt tccaaagtac 7140 gccattaaaa tgatcccatt aatggaaaac gttgatattt atgccacatg tacccagatg 7200 gtatggatct atcgttgtgc acctaccgaa caaatagaag taagtaattt actgaatatt 7260 ttgaaccccc agtcaagttg aattttaata aaaatactta tgtaaatcgt caactaacaa 7320 taaattatga atacccatgg ttaataacac aagtgcaacc ccaagcgaat agtactaaaa 7380 taataaatct gattcatatt gaaatatgga caaaaaatct aaataaccat gtttccatag 7440 aaccattgct gaatggaatc aatatatatt gggaaccccg aaataatttt tcagaattat 7500 acaatcgata cgttgaatgt tagaaagttt tttgtgcatt atgcaaatct aaaactgcat 7560 catacctatg atatcgcaat tcactactac gaaaaagtat tgcaagataa tggtacagag 7620 agtaatgacc ggtgggtcaa gatctatact agtcgaaaat caatattgtt gcaaaacgaa 7680 acaaatcgag ttcatttaca gctagttaac caaaaactaa acatttcacg gaaagccggt 7740 attaatgaca aagttgtgat ttattattgt gggaaaatcc atgccacgtt aacaagtaac 7800 tttaccttta aaatgtcaaa ttttcagcag atacaatcta aatcaatttc aaaaattgtg 7860 atgataaaaa tcaagtatta caaacgaata tcaaatttca atccatagga aatcattttt 7920 gtcaaaacaa gattcattcg agttgtatat aattaaacga aactcataaa gaacacgaat 7980 tattctcata tctctaatgt aaatattcga aaacaaaaag caatattatc ttcgttcaat 8040 gaccaaatgg ctaatatgca aattcgtgag aacaatacaa catcatcttt atctgataac 8100 aatgaattca tatgtgtact tttatcgatc aatacaatca gtgtgatatc atgttctatc 8160 atttttcttc tcttattgct agcatatctt cgaaaactaa agcaacgact tgaaagtcaa 8220 ccatggacca aaaccttgga ttgtctatga ggaaacaatc gtgtattgag atacatttat 8280 tttgcatgca attatgataa tgtagtagta cagtataagt attaactttt tatttgggat 8340 aatgtaaaga tttgcacctt tcgatcatag atgagtttga ttggctgttg agaacaaaga 8400 aggggaagca gaatttgcgc aggtaaaaag attttgggga aaaggaaatt tatcaagggg 8460 tatataaggg ataaaaaagt aaataaaatt attataaaat tcaaattaaa ataggtaatt 8520 aatttttatt aatattgtta ttgataccgc taccagttaa ggcagtatcc cacattgatt 8580 ctagcgtaaa cttttaaatt caaaaataat taactaactc ttagatccca aattatgtga 8640 gagtggatga caggaaaatg aagatacaaa tattcagatt gaatagaaaa tcaggaatta 8700 aattagctga ggaaacaaat gcaacgaaaa aggaagagtt catgcttcac attatcaagg 8760 gaaatgtaga aatggtcgac gacaataaga agatatggtg ggaaatacat caaggtttgt 8820 catgaaaaat gcagaaagaa gcgatgacca aaaggttgag agcaacaaat gaaaatgcac 8880 tgattgtgtt agtaaccaga attaaaaacg aagctcaaga agtacagcag atgataatac 8940 ctgtgaatga aaaattgtta gcaatcactg aacagatgct aaagaaaaga atgccatttt 9000 taagaaaaga aatggatcaa gagaagataa gaagcaaggt cttgttcatc gatgtggagt 9060 atgggacatc aaaaagtaga cagatggttc cgctgtctat tgcgataatg aattattatg 9120 gcaagacaat aaaggatgtg tatctaactc caagaggaaa attgtggcat tttaattcac 9180 aaattcatgg aataacggaa gtcgcaagta ggaatcaata tgaccagtat ggaattatga 9240 aggaagttca aaaattggtc gttggaacaa tattaattta ccaggagata caatacttaa 9300 aactttgaat agaccagttg catggcattc gagatttggt tacagcgcag gcattggaaa 9360 aggttggaat gtctaggaga ggagaatttt tcaaactgaa aactgttgtc caggagatgt 9420 ttggagatca aatgcaagtg ggcatacact aagccctcga agatgtaaag aacataagaa 9480 gagtatattt gaaaattgag aaactgtgga tcgatgatat cgaaatacca caaaagcaac 9540 cgtcaaggaa atattcgttg gaagaatatc ggaagcttta caacaccgcc tcaacatcaa 9600 ttacccaaat tacggaagaa attgacctca cagcagttga ggatatggaa atcgatcttg 9660 aaattcaaac tgttcaattg aaataggcaa cagaagagag cattgaaata attgaggaga 9720 gtctgggtct aataatcaat gaaccgattc atcgaaccta attaatattc gagcatgggt 9780 caattatcta aaaaacataa aaaaggaaac cttcattgga ttctttggag gttgacagtg 9840 gagaaactat cacaataaat cagcttacgt atcgaaagaa gaaaactgtt attcatcata 9900 aatgtggatc taaaacttga aatggaataa aaattaatgt tatttatatt atttatttat 9960 aaaatattaa tttaataaat ctcgttctct tgatttacgt cgtccaataa aaaaataaat 10020 aaataaaaaa aagcatgaac aatgacacaa aagtatgtaa gacaaagatt ctaaattcca 10080 gggaaaattg tatacttatt atcattataa gtatattggt gatattggca ttattattat 10140 tgaggctatt atttttaata ttatcgttgc tattatcagt tcttgagagt ctcaaatctc 10200 gaacttcgtg atttaagacg cacaagaaga accattcccc atgcctgcag cctggcgtaa 10260 caagcaggag gagtggaact ttttaatatt taacttatat aaatcatcca taaatagtat 10320 ataattttaa aagaagaaca tttgcatttt tcgaaaaatg taatattgta ataagtaaac 10380 gaatgcttta aacatttaaa gtttaaaaaa aatatatcac atccctcata agacacacat 10440 ctatcagtgc tgatgtcata gaacctcctg gagcatcaaa aaaagttatt ataataaatt 10500 ttaaattagt tttaaataat taaattggtt ttaaataaga taaaataaat taaataaaaa 10560 aaatatatat ttcaattttc aatttatttc aatatgtatt tcagctccaa aaataatatt 10620 gagatcattt atggagaatt tgtagattca ctcacaaata taataattta ttgcaatttg 10680 aactattaaa ttatacattg aaatattttt aaaatatttc tatctaaatt ttgcatttat 10740 attaaatcaa gtttaaaatt tcgtaactat tttttttaat gtagataaag taaactaata 10800 aatcttgtgt ttattggata tattttatat ctatatgtac gaaagacatt tttaacaaat 10860 tatttctttt acctaataaa tataaactgg ataatataaa tttgaaaata taaataaaaa 10920 aatataataa aataccaaaa tctttttaat catattatac gcgaaaagaa atgaaaatat 10980 taatataaaa aaacattaat ataatatgat aaggaagtgt caaagttttt aaattttgat 11040 ttttttgaag atattattca ctttattatt attattatta tgatgatgat gatgattact 11100 attattatta ttattgttat cgttattgtt aatattgttg ttgttagtgt tattattagg 11160 taatatggta ttgagattac tctaccagga caagtataaa aaatataatc aaatgaatag 11220 attagaaaag gaatctttat taaaggagaa gata 11254 // ID I-3_CQ repbase; DNA; INV; 6457 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version 1) XX DE An I non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW I; Non-LTR Retrotransposon; Transposable Element; I-3_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-6457 RA Kojima K.K. and Jurka J.; RT "I non-LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 109-109 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 10 sequences with >98% CC identity. It is phylogenetically close to Loner elements in CC Anopheles gambiae. XX FH Key Location/Qualifiers FT CDS 641..2359 FT /product="I-3_CQ_1p" FT /translation="MDPDEDQVEKMSESEDWDTDNEGEVDPLPGVPSNQTA FT LELYGKDTKVVVKGTGGRKAKKRKSRLSPKPYPKPDPTSPPTQKPDPALPL FT NPSPLVPTNPTPIPTNPTPPIPPNTNPIPPAPTPPIPPNPNPIPPAPTPPI FT PPNPNPPGDKERPVQARARQYVGGSQTEWVVFFRPKQKPLNFVRITKDLHN FT HYPGLVQCTKLNKSKLSVIVNSAEEANQIVKDIRFCVEYRVWIPAHKVEID FT GVVTDDSLSLVDLSRAVGRFKNPKLPAVEVLECRQLGNVTTEGGQKKFIPS FT ASFRVTFAGSALPDYIELYKLRLPVRLYVPRVMSCENCQQLGHTKTYCSNK FT SKCSKCAGPHKDVECQKQAEKCLLCGGEPHKTRQCPKYKEREDKMKRSLKE FT RSKKSFAEILSQANQSNRFAPLADDSGDEENEEEVLFRRDEESDSSGPNPK FT RNKVSKSTKAAGGKGKESDSLNFEEDFPFGPSGKPAPKPTPKPAPIPLKPL FT KPVPKLPKIPLKPVPQPNPITDAFKGVLPFSTIVEWLCSFVSEPTRLIIKR FT FEPLARHIGKQLAXTMPLLSFISFDG" FT CDS 2355..5924 FT /product="I-3_CQ_2p" FT /note="apurinic-like endonuclease, reverse FT transcriptase and ribonuclease H." FT /translation="MGNTPKTKNGISFLQWNCRSLTPKLNDFQQFAFERDC FT DVFALSETFLIGDETPAFRGYNIITESRAGPNRGGGVLIGVRKCHTFRKVT FT LPKLGGIEAVAVQARIRGLDICIVSVYVPPQVSLTRQKLWGMLELLQAPRL FT VLGDFNLHSTEWGSHKTDPQAYLIHELCDDFQMTLLNRDKQVTRIATPPTP FT TTSGTKASAIDLSLCSNSLAVLCDWTVLQDPRGSDHLPIAILVSHGPQQST FT TVEMPYDLTRNIDWKQYAEAVSIGVELRAGLAPLEEYAFLANTIYDSALQA FT QTKPVPGPYIRTRPPLPWWDKECTDLSAARSVAYVDFCKCGSPANFQRYRA FT LDRRYSNTLKRKKRMYWREFVEGLPADTSMSTLWRVARAMRRGSVPNESVE FT KSDKWILDFAKKVCPDSVPTQPSFDVVDGSDSNPPFSMVELSIALLTGNNT FT APGPDKIKFSLLKNLPDVAKQRLLXLFNTXVEQNVIPHEWRQVRVVAIQKP FT GKPASDHNSYRPISLLSCLRKLLEKMILDRLEPWMEREHLLSDTQYGFRRG FT KGTSDCLALLATGIDIARGKKQQMASVFLDIKGAFDSVSIDVLGVKLRRSG FT LNPTMCNFLINLLSEKHMHFVQGDLAITRISYMGLAQGSRLSPSMYNFYVS FT DIDDCLTGDCTLIQYADDAVVSIAAKKEVDLLEPLQDTLNNLAHWAVGKGI FT EFSPEKTELVVFTGKHEPPQLNLSLSGKTIEQSDHFMYMGTIFDQKGTWGK FT HINYLKQKCLQRTNFLRSVSGNRWGAHPSDLLRLYKTTILSVLEYGSFCFQ FT SAAKSRLLVLQRIQYRSLRIVLGCMHSTHNMTLEVLAGVLPLRTRYYELSY FT RFLIRCDYRNKLVIDNFETLLNLHVESRRMLLYYDFMSSWEWSPSTTVQRT FT PPLTSSTLIGFDTSMKADTRGIPNHLLRGVVPSIFASKYNHVPPNNRFFTD FT GSKLDGCTGFGVYHESYQLFFKLKDPSSVYVAELAAVYCALRIIETMPPDH FT YFIFTDSFSSVEAIRSLKPTRESAFFLTEIRKTLNNLAAQSFSITVVWVRA FT HCSIPGNEKADLLAKKGAESGDIFERPISLQECYGLPRQRALLDWQKQWDD FT DTKGRWMYSIRPKVQRKAWFKGMDLTRGFIRTMSRLMSNHYSSKAHLYRIN FT MSDTNLMRLWAGLSGHRPSRMGVSRSPGSQN" XX SQ Sequence 6457 BP; 1602 A; 1717 C; 1645 G; 1489 T; 4 other; cattctcctg tcaaactcgg tacgagtcgg tcgtgttgac gagttgtctc ttacgcgagt 60 tttttagcgc gatatttcgg cgttttcgac ggcgatttcg ccgttccggc ggcgcccttg 120 caggtggtgc gaggcaccac gttttggacg cagcggacac ggcgacggcg cagaagcggg 180 cggcggttcc ccaaggcagc ttcccgagtt cccgctaccg tgaggcggca gcaggaggag 240 ggcgcgcggc ggcagacgcg cgggcggaga cgagcggtgg tcgcaggagg cggccccgga 300 gagtcgactg cggtcggcgg accagttttc ggccaggcca ggtcggagac aaccccggtg 360 tttgtgtgtg gctggcagca ggcagcacaa caacaacaac aagcgcgcga aaacgagctt 420 cccgcgcatc ccgctgcggg cggcagccag tgcggccagt tcggcggtga gtgtgaaagg 480 cgagtttttt gtttgctttt agtttttttt ttttcttttc tcatagtttt tttttgttag 540 tcatattttt tttctcttct tccttgggtt ttttttttct ttttcgcgta aatcttgatt 600 agcttttgta ttagtttttc tttcgttttt cattccaacc atggatccgg atgaggacca 660 ggtggaaaaa atgtccgagt ccgaagactg ggacacggac aacgagggtg aagttgatcc 720 tctcccagga gtgccttcta accaaaccgc gttggaactc tacggaaagg acaccaaggt 780 cgttgtgaag ggcactggcg ggcgcaaagc gaagaaacgc aagtctcgac tgtctccgaa 840 accataccca aaacctgatc ccacttcccc tccaacacaa aaaccggatc cagctcttcc 900 actgaatcca agtcctctgg tcccaacaaa tccaactcct attccaacga acccaacccc 960 tccgatcccg ccaaatacga atccaattcc accggcacca acccctccga ttccgccaaa 1020 tccgaatcct attccaccgg caccaacccc tccgatccct ccaaatccaa atccaccagg 1080 tgacaaggaa agaccagttc aagctcgcgc tcgccagtac gtggggggct cgcagacgga 1140 gtgggtggtc ttcttccggc ccaaacaaaa accccttaac ttcgttcgga tcacaaaaga 1200 tctgcataac cattatcctg ggctggtaca atgtactaag ctgaacaaga gcaagctcag 1260 cgtgatcgtg aactccgcgg aggaagctaa ccagatcgtg aaagacattc ggttctgtgt 1320 cgagtaccgt gtttggattc cggcccacaa agtcgaaatc gacggcgtgg tgaccgatga 1380 cagtctgtcg cttgtcgacc tctcaagggc ggtgggacgc ttcaagaacc ctaaacttcc 1440 agctgtggag gtactcgagt gccggcaact gggcaacgtc accaccgagg gtggccagaa 1500 gaagttcatt ccttctgcct cgtttcgggt gacttttgca gggtcagctc tgccggacta 1560 cattgagctg tacaagcttc ggcttccagt tcgattgtac gttccgcgtg tgatgagctg 1620 cgaaaactgc cagcagttgg gacacaccaa gacctactgc agcaacaaga gcaagtgctc 1680 aaagtgcgca ggcccccaca aggacgtgga atgccaaaag caggctgaaa aatgcctgct 1740 ttgtggcggg gaaccgcaca aaactcggca gtgcccaaag tacaaggagc gcgaggacaa 1800 gatgaagcga tccttgaagg aacgctccaa gaagtccttc gcggaaatcc ttagtcaggc 1860 gaaccaatcc aatcgtttcg cccccttggc agatgattcg ggggacgagg aaaacgagga 1920 ggaagtcctc ttccggagag acgaggaaag tgactcttcc ggaccaaatc cgaagcgcaa 1980 caaggtttcc aagtccacaa aagccgctgg cggcaagggt aaggaaagtg actcgttgaa 2040 cttcgaggag gattttcctt tcggtccgtc gggtaagccc gctccgaagc cgacaccgaa 2100 gccggcaccg attcctctca agccgctgaa gcccgttccc aagctgccaa agatccccct 2160 aaaaccggtt cctcaaccga atccgatcac cgatgccttc aaaggagtcc ttcctttttc 2220 cacaatcgtg gaatggctgt gctctttcgt gagtgaaccg acgcgtttaa tcataaagcg 2280 gtttgagcct ctcgctagac acatcgggaa acagcttgct ascacgatgc ccctcttgtc 2340 gttcatttcc tttgatgggt aatacaccaa aaacgaaaaa tggcatctcc ttcctacaat 2400 ggaattgtag aagtttaacc cccaagttaa acgattttca acaattcgca ttcgaacggg 2460 actgtgatgt gtttgcgctg agcgaaacgt ttcttatcgg agatgagacg ccagctttcc 2520 gggggtacaa catcatcaca gagagcaggg caggaccaaa tagaggtgga ggcgtactga 2580 ttggggtcag gaaatgccac acttttagaa aggtcacgct cccgaagcta ggaggaatcg 2640 aagcagtcgc agtacaggcc aggatcagag gactggacat ttgcattgtg tccgtctatg 2700 ttcccccaca ggtgtcgtta actcgacaaa agttgtgggg tatgcttgag ctgttgcagg 2760 caccgcgact ggttctgggt gactttaatc ttcacagcac cgagtgggga agtcacaaaa 2820 cagaccctca agcatatctg attcatgaac tgtgcgacga cttccagatg accctcctaa 2880 acagggacaa gcaagttacg cggattgcaa caccaccaac gccaactacg tccggtacga 2940 aggcgagtgc catcgacttg tcgctctgtt ccaacagttt ggcagttctc tgtgactgga 3000 ccgtgcttca agatcctcgt ggcagtgatc atttgccgat cgctattctg gttagccatg 3060 gcccacagca atcgacaacg gtggagatgc cttacgatct gacgcgaaat atcgactgga 3120 aacagtacgc ggaggcagtc tccattggag ttgagttgcg agctgggttg gcaccgcttg 3180 aagaatatgc gtttctggct aatacgatct atgacagtgc gttacaagct cagacgaaac 3240 ccgtcccggg accatacatc cgaacgcgcc ctccacttcc ctggtgggac aaggagtgta 3300 cggatctctc ggcggccagg tctgtagctt atgtggactt ctgcaagtgt ggatccccag 3360 cgaacttcca aaggtataga gctcttgatc gcaggtacag taatactctg aagcggaaga 3420 agcgaatgta ctggcgggag ttcgttgaag gactaccagc agatacgtcc atgagcactc 3480 tgtggcgcgt tgcgagggcc atgcgaagag gttccgttcc gaacgagagt gtggaaaaat 3540 cggacaagtg gatattggat ttcgccaaga aggtgtgccc ggactcggtg ccgacacaac 3600 catcattcga cgttgttgat gggagcgatt ctaatcctcc cttctcgatg gtagagctct 3660 ccatagcact attgacgggc aacaacactg ctcctggtcc ggacaagatc aagttcagct 3720 tgctgaagaa tcttccggac gtcgccaagc agcgtctgtt gaawctgttc aacacgttsg 3780 tggagcagaa cgtaattcca cacgaatggc gacaggtccg ggtggttgcc attcaaaagc 3840 ccggtaagcc ggcgtccgac cacaactcgt atcgtccaat cagcttgctg tcgtgtctac 3900 gcaagttgct ggagaagatg atcctcgacc gcctcgaacc atggatggaa cgagagcact 3960 tgctgtcaga cacgcagtat ggcttccgga gaggcaaggg aacaagcgac tgtctagcct 4020 tgctggccac agggatcgac atagcccgtg gtaagaaaca gcaaatggct tctgtctttc 4080 tagacatcaa gggtgcattc gattcagtct ccatcgatgt gttgggagta aagctgcggc 4140 ggagcggtct caatccgacg atgtgcaact tcctcatcaa cctgttgtca gaaaaacaca 4200 tgcactttgt gcaaggtgat ctggcaatca cccggataag ttacatgggt ttggcacaag 4260 gctcacgcct tagcccatcc atgtataact tctacgtcag cgacatcgat gattgcttga 4320 ccggagactg cacccttata cagtacgcag atgacgcagt ggtttcaatc gctgccaaga 4380 aggaagtcga tctgctagaa cccttgcaag ataccctgaa caacttggcc cattgggcag 4440 ttggaaaggg tattgaattc tctccggaga agacggaact ggtcgttttt acggggaagc 4500 atgaaccgcc gcagctcaat ctttctctct cgggaaagac tatcgagcag tcggaccact 4560 tcatgtacat gggtaccatc ttcgatcaaa agggaacatg ggggaagcac attaactacc 4620 tgaagcaaaa gtgcctgcaa agaactaatt ttctgcgcag tgtctctggc aaccggtggg 4680 gtgctcatcc ttctgacctg cttcgactgt acaagacaac gatactctcg gtgctggaat 4740 atggcagttt ctgcttccag tcagctgcga aatcacgctt gttggttctc cagcggatac 4800 agtaccgaag tcttcgcatt gtcttgggat gcatgcactc aactcacaac atgaccctcg 4860 aggtcttggc gggagtgttg cctctgcgaa ctcgttacta cgaactgtct taccggttcc 4920 tgatccggtg tgattacagg aataaactgg taattgacaa ctttgaaacg ttgctgaacc 4980 ttcacgttga gtctcggcgc atgcttctat attatgactt tatgtcatcg tgggagtgga 5040 gcccgagcac aacggtacag cgcacgcctc cgttaaccag cagcaccctg attggcttcg 5100 acacgtccat gaaagccgat actcgcggta tcccaaacca tcttctgcgg ggagttgtac 5160 cgtcgatctt cgcatcaaag tacaaccatg ttcctccaaa caacagattc ttcaccgatg 5220 gctccaagct cgacggctgt acgggcttcg gtgtttatca tgaatcttat cagctgttct 5280 ttaagctgaa ggacccgagt tcggtttacg tcgcagagtt agccgcggtt tactgcgcac 5340 tgcgaatcat cgagaccatg ccacctgacc actacttcat cttcaccgat agctttagct 5400 ctgttgaggc tatccggtct ctgaagccga ccagggagtc tgcgtttttc ctcacggaaa 5460 tacgcaagac tttaaacaac ctggcggctc agtccttcag catcacggtg gtgtgggtcc 5520 gcgctcattg ctcgattccg ggtaatgaga aagcggactt gctcgccaag aagggtgctg 5580 agagtggaga catttttgaa aggccaatta gcctacaaga atgctacggt ttgccgaggc 5640 agcgtgcgct tctggattgg caaaaacaat gggacgacga caccaaggga cgttggatgt 5700 attccatacg acctaaggtg caaagaaaag cctggttcaa gggaatggac ttgacgcgtg 5760 gattcatcag aacgatgtcc cgccttatgt cgaaccatta ctcgtccaag gctcatctgt 5820 accgtatcaa catgagtgat acgaacctta tgcgactgtg ggcagggtta tcaggacatc 5880 gaccatctcg tatgggcgtg tccagatcac caggctcaca gaattaagtt gaaagatacc 5940 ctcagggccc gaggaagacc accagaaatc ccgatgcgag acgcgctatc tcaactagac 6000 cttgatgttc tctatcctat ctatcagttc ctctcagatt ccaaaatatc tatttagttt 6060 tcttccagtt agttttcctt cagttagttt tcctccagtt agtataactc gccttcacac 6120 aaagccgtcg tttctggatt gcaccggaag caaagcaaga wcgccccagc agctggacac 6180 cgagttgcgg ctgaggacaa aacgcaaagg ccagcagagc caaccaagac ccctacacaa 6240 tcccccttcc catcccacgc cttgtcttta acatcaatcc cttccctact aaccccgagt 6300 aggccgcggg taatcggctc ccctcccact aacatttaca cacaagatcc tctgtaatta 6360 ttaagttttt tgtaattaca aaagccgact cggtcctaac caggtcccag taccgaaaag 6420 gacctaataa aaataatttt atgaaaaaaa aaaaaaa 6457 // ID hATx-22_SM repbase; DNA; INV; 2960 BP. XX AC . XX DT 12-AUG-2009 (Rel. 14.08, Created) DT 12-AUG-2009 (Rel. 14.08, Last updated, Version 2) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hATx-22_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-2960 RA Jurka J.; RT "haT-type repetitive family from freshwater planarian Schmidtea RT mediterranea."; RL Repbase Reports 9(8), 1857-1857 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 616..2595 FT /product="hATx-22_SM_1p" FT /translation="MNTFTEHRKGQDEASVWNYFLVEKKGQVAKCKKCGKE FT IKCTGGSTSGLHTHLKTIHGLNLKKKCEFTNDVESCVSKQPRLLITNYFKN FT NKNEQLSEVLARMTARDGLSFNIFIKSHDIREGLKARGFKNIPKSAVSIRN FT MVLEYSKIIKQSTVQEISKLKSEGKRFSITFDEWTSIRNRRYININVHSVG FT GTMWNIGLVRAYGSVPAEKCLSLVEKKLRDFNLDISKDIVGVTTDGASIMV FT KLGKTIEAEHQKCMAHCINLAIADVLYKLPTNLSEDVDFVEQINGDDDIIS FT VDDNDNDIDEENDNGIIIEEYASITKNQLNIRHVDISSLVSKIRNVVKTFK FT YSPSKNDKHLQKYVKEQLGKESALKLDCKTRWNSMLSMLQKFEDIKFCIQK FT ALIDISSDIHFSDKEFNLLSSIIAALTPVKLTVDALCRKDANLLTADASLT FT FMLDTLSQQNSAIALELLEALKSRILLRRTELSQILQYLQKGVQDVGEVFK FT RVSKFTIIKKIVSLTKRLTHENPEDTEDMEVVQETSLPISCKVSSETALDY FT NMVSIKTLKKQLQTAIEETLNTSPALACKKVQSLEQKIEDEITFFEKTGTK FT GIYLQNVYDFLMTIVPTSVESERAFSAAGYICTKIRSSLDDESIDCLCFLR FT AYFNKKNASYQ" XX SQ Sequence 2960 BP; 1095 A; 413 C; 495 G; 957 T; 0 other; ggcttccaat accgggatcc cgggatctcg aataccggaa tcccggacaa ttttctgacc 60 gggattgaaa accggtatca gcgtagaaat accggtattt cggtattata acagttgata 120 aaatgggtat tcatgcgctt tattttatat tggaaaataa tcacaaaaat ttacaataaa 180 attttcttgg aagaaaacaa aaaattttgt cggattaaaa tctacatcgg gtacatttat 240 tacacatact gagccgatta tgtcaacatc gtgtaatagc gctgctgggc atattaagct 300 gatttttatg gtttatcttt atgtccgggg aagagtcctg tttagacaaa taaatgatat 360 gttaaaataa gaaaatataa aatagtcgga ttaagatcct ataaatttac aaattagctt 420 taatatttgc aataaaggtt gtattttata ttttctcctt tttttctaat aattaaaaat 480 agcataacgt atttaaatac tgattaatct atatttatta ttcgataagt tactatatat 540 ttacattatg tatgttggat tgtaatattt taaatattaa tattgataaa aataaattgt 600 tatcacccct aaaaaatgaa tacatttact gagcatcgaa agggacagga cgaagcttcg 660 gtttggaatt attttttagt tgaaaaaaaa ggacaggtcg ctaaatgtaa aaagtgcgga 720 aaagaaataa agtgcacagg aggatccaca agtggccttc atactcactt aaaaacaata 780 catggactaa atcttaaaaa gaaatgtgaa tttactaatg atgtagaatc atgtgtttct 840 aaacagccca gattgttgat aacaaactat tttaagaata ataaaaacga acaactttct 900 gaagtgctgg ccagaatgac ggctcgagat ggtttgtctt ttaatatttt tataaaatcg 960 catgacataa gagaaggttt aaaggcaaga ggatttaaaa atataccgaa atcagcagtc 1020 agtattagaa atatggtact tgaatatagt aaaattatta aacagtctac agtgcaagaa 1080 atttctaaat tgaaatccga ggggaaacgt ttcagtatca catttgacga atggacgtct 1140 attcgtaatc ggcgctatat aaatatcaat gtacattctg ttggaggaac gatgtggaat 1200 atcggattag tgcgcgctta tggatccgtt ccagccgaaa aatgtttatc cctcgtagaa 1260 aaaaaattaa gggattttaa tttagatata tctaaagata tagttggagt aactactgat 1320 ggtgcatcaa ttatggtaaa actaggaaaa actattgagg ctgagcatca aaaatgtatg 1380 gcacattgca taaatttagc gattgccgat gttttgtata agttacccac gaacttgagt 1440 gaggatgttg attttgtgga gcaaattaat ggtgatgatg atattattag cgttgatgac 1500 aatgacaacg atatcgacga agaaaatgat aatggaatca ttatagagga atatgctagt 1560 attactaaaa atcaacttaa tattagacat gttgacatat catctctagt ctcaaaaatt 1620 cggaatgtcg taaaaacatt caaatattca ccttctaaaa acgataaaca tttacaaaag 1680 tatgtaaaag aacaactagg caaagaatct gctctaaaat tagattgtaa aacaagatgg 1740 aacagcatgt tatctatgtt acaaaaattt gaagatatta aattttgtat tcaaaaagct 1800 cttattgaca ttagttctga tattcatttt tctgacaaag aatttaattt attgtcttca 1860 attattgcgg ctcttactcc cgtaaaatta acagtagacg cactatgccg gaaagatgca 1920 aatcttttga cggctgacgc ttctttaaca tttatgctag acactctttc gcagcaaaat 1980 tccgcaattg cattagaatt attagaagca ttgaaatctc gaattttact acgacgaacg 2040 gagttatcac aaatacttca gtatctgcaa aagggagtac aagacgttgg agaagtattt 2100 aaacgcgtat ctaaatttac aatcataaaa aaaattgtga gcttaacaaa acgactcaca 2160 catgaaaatc ctgaagatac agaagatatg gaagttgtgc aagaaacatc ccttccaatt 2220 tcttgtaaag taagtagtga aacggcgtta gattataata tggtaagcat aaaaactttg 2280 aaaaaacaac tacagacagc aatagaagaa acattaaata cttcgcctgc gcttgcttgt 2340 aaaaaagttc agagtttaga acaaaaaatt gaagatgaaa ttacattttt tgaaaagacg 2400 ggaaccaaag gaatatattt acaaaatgtg tatgattttt taatgacaat agtgccgact 2460 agtgtagaat cagaaagagc attttcagcg gctggttata tttgcacgaa aattcgatcc 2520 tcgcttgatg atgaatctat tgattgttta tgttttttga gagcatattt taataaaaaa 2580 aatgcaagtt atcaataaaa tatttcataa ctgctttcaa ttaatgtttt ttttgcttct 2640 ttaattaaat aatgcctcgc gggcctatta catgcatatt taacataaat ttaatatata 2700 attaataaac tttcttcgtt ttttgtaccc gtttattcct accatctatg taaaaaaata 2760 aagaatatat actcatttta gaataatatt tattagattt catgaatatt tgttaataag 2820 tcgttataat gtctgattta aacagtaata tgacttcaaa tttttcattt ataataccgg 2880 ttttaatacc ggtatcccga tattacaaaa tttctaatat cgaataccgg tattcagaat 2940 ttactccggt attgtaagcc 2960 // ID Gypsy-5_PPc-LTR repbase; DNA; INV; 334 BP. XX AC chrUn; XX DT 06-JUL-2010 (Rel. 15.07, Created) DT 06-JUL-2010 (Rel. 15.07, Last updated, Version -1) XX DE LTR retrotransposon from the Pristionchus pacificus genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-5_PPc_; KW Gypsy-5_PPc-I; Gypsy-5_PPc-LTR. XX OS Pristionchus pacificus OC Eukaryota; Metazoa; Nematoda; Chromadorea; Diplogasterida; OC Neodiplogasteridae; Pristionchus. XX RN [1] RP 1-334 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Pristionchus pacificus genome."; RL Repbase Reports 10(7), 1003-1003 (2010). XX DR Genome; chrUn; Positions 154086622 154086289. XX SQ Sequence 334 BP; 97 A; 78 C; 48 G; 111 T; 0 other; tgttgtatag tcacgttatc ctatacacaa taagtaaata attaataagc acactgatta 60 ttaatgtgct cttccgtact accgtatctt acacttcatt caagtgcagt ttggctccca 120 ccacgattac cctgaattcc atatttcaag tactgtaatt tcatcatcat gtctgtcgta 180 gctaaacatt cattgtttct agttgagttc cctcttcccg gcacgttgtt aatcaacgtt 240 cgtctattct gtattggatt cccctgccat caagataaag cgtatcaaga gtattaatca 300 agactgttat tcacacaagg acaacgacac aaca 334 // ID L2-6_NVi repbase; DNA; INV; 5247 BP. XX AC . XX DT 17-APR-2009 (Rel. 14.04, Created) DT 17-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE CR1-like retrotransposon: consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; L2-6_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-5247 RA Bao W. and Jurka J.; RT "CR1-like elements from the parasitic wasp Nasonia vitripennis."; RL Repbase Reports 9(4), 756-756 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS join(884..1963,1967..4906) FT /product="L2-6_NVi_1p" FT /translation="MPVCRKCGRNIVTRAVSCYQCGLDFHAGCLARFVKSK FT SCSGCCLKTHKALSTPPDITSLPSRCERDSIFDAGRDLGSTSLLDISDLSI FT ASLSSEFTRSADPASLTENRLDDSSSHSLPCSSTMSTPSNWNAMSVDDKLG FT ELFNVITSGNTALSSKIDALAAKVDNQESRVTLLENENAKLTKEVASLKER FT FDKSCKYAEIKVSGIPPTCQASLHDITRAIFTKLNVANYITDVHSVRAIVS FT RPQSNAAEASQLSSQRAPRPLVFCIRFKSFCVRDDVLRAKRLFGPIKFSDI FT YPGGSDTSIDMHDMLSPFLHNLRLAVKTRANALGYKYVWVRDDKILVRKAD FT GSEIIPIITANDLERMVQSEASIPKTVSAPARKSSKIRIAHLNVSSVKAHF FT NEIEILLSDNSIDVLIITETWLTPSMQSSLFSIPGYNFIRNDRGLVSSRIL FT NPATEDDVTYVLGGGIGFYIREDLRFTLLEASYISNINETEYMLISVDCGS FT KILLGGVYRRPKGHTLSKFFNSLQNHSHRFGNVVVAGDLNSDLLSDNYYGN FT HLRNLINENNFHFVPFGATHHTDSSDTVRIDVMLSDCGDKIMHYEKSAAPY FT INGHDYLLIDFTFDLPEKQELLIISRDFRRFSSAAFLADLLPDLSLPAFSD FT PLASVNDKLVGFQQVACRALDIHAPFTARKPNRNATPWFTPDLKRRCKARD FT RLYSAAKLSRKRAAIIRYKTARIEVKRAIRSAREEYLLHGCSVRTKCWGFL FT RRNGLVSAKSKSPLHFFDKDTLAKYYSSVTCAHLVCSSDSLRAILDGQYNS FT LGSAFEFRFRNLDCVEVAQFMRACLSKAKGRSCDELSLSYFEQVIPQIAPF FT FTEVFNLSISSGVYPSIWKKSVIVPLSKCSTPSSPGDTRPVANIPHFAKVF FT DKIITDQVIAYLEDNNLISPYQSGFRSNYSTQTALLNVTEDIRRGADMGLL FT TLLLLFDFRRAFDSIDHATLLAKLKSIGFSNDAILWFHSYLTGRSQAVVGL FT DGTLSDFLPNTSGVPQGSSPGPVIFLIYINEIVSVLRHCSKSCMLFADDLQ FT IYIQCAPGDIGRAVASLNEDASCILKWSQDNGLLLNVAKTKAIIVGSTQHH FT MRLDLQRIEPIVVDGVPIPFDDSVKNLGVILSKTLSWNAHVSRTYSNAHFA FT LYRLRFKGYCLPTSLKTQLVNALVLPYIDYACLVYMDLPDYLAIKLQRLCN FT SAVRFIFFLRKDAELGPYYAKLGWLSLNHRRNYYLGITLYKIFVHKRPGYL FT ADLFALPNEDLRRSVRNNPSAFVIPTARTEIFRNSFTVKGMHFWDSIPTTI FT RLSKSLGAFKQRLYVYLLSLQQCYVHY*" XX SQ Sequence 5247 BP; 1193 A; 1151 C; 1102 G; 1800 T; 1 other; cggtgcgagt ttctgtgtat tacaggcggt gtgggaccgt tatggctcct tttggaaata 60 aaattcaagc actgtcattt tgctgttgag ccatggttct gcactttgcc tgacgaaaag 120 ttttgaggtt ataccacttc gccattacta ctcagcccgc ttgctcgctg ccgaccttcg 180 ctcgttcatg actgtcatcg ttaccgtccg ttacaccagc aaggaccgac cgaagcctac 240 tgttctgcac cagcccggtg aattatgacc gtctcctgac tcttctttgg ctgtgtgcga 300 gtgaggttcg tacgtgcgtt caggcgtgat ttgtgcggtg agagagagct ctagtgcagg 360 catcctagcc actgaatctc ctttggcccc tatgcgtttg cttttgtcct agcgaattgg 420 cctttcttgt tggtcacttg ctcttttttt tctttttctt ttctttcttt ctcgttcgca 480 tgtggcgcaa gttcaatgcc cgattacgta acttgtttac ttcaccgagg cctgtgttcg 540 tgaaagcatc atcttttgtt tggcgcgaat ttgaaatgcg tttgtaaacc tactcaattt 600 cgtcgaactt ggcgagtgtt cttaaagttg atcatttttr tttggcgcga aatttcactg 660 cgttcttaat tgattgattt ctattaataa gttaaattga tcttttttat ttgcatttgt 720 tgcgtttttc actgcgtttt tcgcttttct tttttttttg tttatatttt tgcgctttgt 780 tatagttttt attttaatag ttgtaattat tgcgtttatt tataagtttg cgttttattt 840 ttaagtttcc gttttattta tattaatatt catatgcgta ttaatgccag tgtgtaggaa 900 gtgcggtcgt aacatagtta ctcgagctgt ctcgtgctat caatgcgggc ttgattttca 960 tgctggttgt ctagcacgtt ttgttaagag taagtcctgt agtggatgct gccttaaaac 1020 tcataaggca ctttccacgc cacccgacat cacttcgctt cctagtcgct gtgagcgcga 1080 ttctattttt gacgccgggc gcgaccttgg ctcgacatct cttctcgata ttagcgatct 1140 tagtattgct agcctcagtt ccgagtttac tcgtagtgct gaccctgcat cgttgactga 1200 aaatcggttg gacgactcta gttcgcattc tctcccttgt tcttccacaa tgtcgacgcc 1260 ttctaattgg aatgcgatgt ctgttgacga taagcttggc gaattattta acgtcatcac 1320 ttccggtaat accgctctta gtagcaaaat tgatgcgctt gctgctaagg ttgacaatca 1380 ggaaagcaga gttaccttgc tagagaacga gaacgcgaaa ctaacgaaag aggttgcgag 1440 cctaaaagag cgttttgata agagctgtaa atatgcggag attaaggttt cgggtatccc 1500 gcccacttgt caagcttctc ttcatgacat cactagggct atttttacta agctaaacgt 1560 cgctaattat attactgacg tgcatagtgt tcgtgcaatt gttagtcgcc cgcagtctaa 1620 tgctgctgag gcttctcagc tttcgtctca gcgcgcgcct cgcccacttg tgttctgtat 1680 tcgttttaaa agtttttgcg tgcgcgatga tgtcttgcga gcgaagcgat tatttggccc 1740 aattaagttt agtgatatct atccgggcgg tagcgataca tctatcgata tgcacgacat 1800 gttatccccg tttctccata atttacgctt agctgtgaaa acacgtgcaa atgccttagg 1860 ctataaatat gtttgggtac gtgacgataa gattttagtt agaaaagcgg atggttcaga 1920 aatcattccg atcatcactg ctaacgacct ggagcgcatg gtttgacaat cagaggcgag 1980 tatccctaag acggtctcag cacccgctcg taaatccagc aaaattcgca ttgctcacct 2040 caacgttagt tctgttaagg cccacttcaa tgaaatagaa attcttttat ctgacaatag 2100 tattgacgta ttgattatta ctgaaacctg gcttactcca tccatgcagt cctcactttt 2160 ttctattcct ggctacaatt ttatccggaa tgatcgaggt cttgtctctt cgagaattct 2220 taaccctgcc actgaggacg atgttacgta tgtccttggt ggtggtatcg gattttacat 2280 acgtgaggat ctgcgtttca ctttattgga agcatcttat attagcaata ttaatgagac 2340 cgaatatatg cttatcagtg ttgattgtgg ttcaaaaatt ttgctcggcg gtgtttaccg 2400 tagaccaaaa gggcatactc tttctaaatt cttcaactcg ttgcaaaatc attctcatcg 2460 gttcggcaat gttgttgttg cgggagatct taactccgat ttgctctctg ataattatta 2520 tggcaatcat cttcgtaatc tgattaacga aaataatttt cactttgtcc cgtttggtgc 2580 tactcaccat accgatagtt ccgacactgt ccggattgat gtcatgttat ctgattgtgg 2640 cgacaagatt atgcactacg aaaaatccgc tgctccatac atcaatggtc atgattacct 2700 cttgattgat ttcacgtttg atctcccaga aaagcaagag ttgctgataa tatcgcgtga 2760 ttttcgaagg ttctcgtcag ctgcttttct ggctgacctt ctgcccgatc tctccttgcc 2820 ggctttctct gatccgttgg cttctgttaa cgataagctg gttggttttc agcaagttgc 2880 gtgcagggct ctagacatcc atgcaccgtt taccgctaga aagcctaatc gtaacgctac 2940 cccgtggttt acaccggacc ttaaacgccg atgtaaagcc cgcgatcggc tgtatagtgc 3000 agcaaagctt tctagaaagc gggctgctat tattcgctac aaaacagctc gcattgaggt 3060 aaaacgtgcc attcgaagtg cacgcgagga gtatttacta catggctgct ccgtgagaac 3120 gaagtgctgg ggctttctac gtcgtaatgg cttagtttca gccaaaagta aatcacctct 3180 tcattttttc gataaagata cccttgcgaa gtattattct tctgtcactt gtgcccatct 3240 agtatgctcg tccgattctt tacgtgctat ccttgatggg cagtataata gtttaggttc 3300 cgcgtttgaa tttaggttta gaaatttgga ttgcgttgag gttgcgcagt ttatgcgagc 3360 ttgcctttcc aaggccaaag gtcggagttg cgatgagtta tccctatcat attttgagca 3420 agttatacct cagattgccc cgtttttcac ggaagttttt aacctgtcga tttcgagcgg 3480 tgtctatcca tctatctgga aaaagtcggt cattgtgccg cttagtaaat gctcgacacc 3540 cagctccccg ggcgatactc gtccagttgc gaacattccg cactttgcca aggttttcga 3600 taaaataatt actgaccaag ttatcgcata ccttgaagat aataatctta tttcgccgta 3660 ccaatctggg tttcgtagca actacagcac gcaaacagcg cttctgaatg tcacggaaga 3720 tatcaggcga ggtgcggata tgggtctgtt gaccctgctt cttttgtttg atttcaggcg 3780 tgctttcgac tcgattgacc atgctacttt gcttgctaag ttgaaatcga tcggtttttc 3840 taatgacgca atactatggt tccactcgta tctgactggg cgttcacagg cagttgttgg 3900 cctcgatggc accctttctg attttcttcc taatacgtca ggtgtacctc aaggatcctc 3960 tccaggacct gtgatctttc tcatctatat taatgagatt gtctcagtac ttcgacactg 4020 tagtaagtcg tgtatgctgt ttgccgatga tctccaaata tatatacagt gcgcgcctgg 4080 cgatattggc cgagctgttg catcccttaa tgaggatgcc agctgtatac ttaaatggtc 4140 tcaggataat ggtctcctct tgaatgttgc caaaacgaag gctataattg ttggatcaac 4200 ccagcaccac atgcgtcttg atctgcagcg aattgaaccc attgtcgtgg acggagtacc 4260 gattcctttt gacgattccg tgaaaaacct gggcgtaatc ctttctaaaa ctctctcttg 4320 gaatgctcat gtatctcgca cctactctaa tgcgcatttt gctctttatc gccttaggtt 4380 taagggttat tgtcttccta ctagcttaaa aacccaacta gtcaacgctc tagttctacc 4440 gtatattgac tacgcttgtc ttgtttatat ggaccttccg gattaccttg cgatcaaatt 4500 acaaaggctg tgtaattctg cggtacgctt tatctttttc cttaggaaag acgctgagct 4560 tggcccctac tatgccaaat tgggctggct atcacttaac catcggcgca attattattt 4620 aggtattacg ttgtataaga ttttcgttca taagcggcct ggttatcttg ccgatttatt 4680 tgcgctacct aatgaggact tacgacgctc ggtcagaaat aatccgtctg cgtttgtgat 4740 accgactgct aggaccgaga tttttcgcaa ttcatttacc gtgaaaggta tgcatttctg 4800 ggattctatc cctacaacta ttaggctaag caaatccttg ggcgcattta aacagcgcct 4860 atatgtctat ttattaagtc tacagcaatg ttatgtgcat tattaatatt tttagattag 4920 gttataagat gttactgcga gcttactact cagttttctt tgctgccttt aattggtgag 4980 ccgtttacta ctggttaaga gagatattat tattttgttt ttccattatt ttgtttacga 5040 tttgcaactc tttattttgt aatacattag cttaagtcta ttattgtatg ccaaatattg 5100 gccactgctg tgccatcgaa aaatttgtta taagttgcca tctttatttg tatttctatt 5160 cctgctatgc tgtattattt gctcctcgcc ctacggcctt ttggtcacgg cgttaataaa 5220 tctgtttatc aatcaatctc tctctct 5247 // ID BEL-209_AA-LTR repbase; DNA; INV; 398 BP. XX AC AAGE02017349; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-209_AA_; KW BEL-209_AA-I; BEL-209_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-398 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02017349; Positions 34743 34346. XX SQ Sequence 398 BP; 137 A; 70 C; 78 G; 113 T; 0 other; tgttctccgc acagcccact gtcggtaact catgcgccac aatacgtaca acgggcgatc 60 cgggaggcat catcagtcaa cagcgatgac gagcaaaatt gaatgtgggg caattgtttt 120 gataggcaaa catatcgggt gaaatattat aagttttgga ttggagaagt tttgaattga 180 attatataaa ggctataata aaaactttat acgcaacgta aaatttgaga tagattagac 240 agaaatagta cggacactaa attgtaagta caaacgaacg ttatgactag acttatgcta 300 aacttaaata aacatttttt ttttagcttt gagcttatta ccacccaact ctgggtcatt 360 gatttgctga gaagaactcc gtaaaatcta tcccaaca 398 // ID APAIA_ME repbase; DNA; INV; 179 BP. XX AC X61120; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 25-MAR-1997 (Rel. 2.02, Last updated, Version 2) XX DE M.edulis ApaI repetitive DNA sequence. XX KW Satellite; Simple Repeat; APAIA; APAIA_ME; KW ApaI repetitive DNA family; highly-repetitive sequence. XX OS Mytilus edulis OC Eukaryota; Metazoa; Mollusca; Bivalvia; Pteriomorphia; Mytiloida; OC Mytiloidea; Mytilidae; Mytilinae; Mytilus. XX RN [1] RP 1-179 RA Cornudella L.; RT "APAIA_ME."; RL Direct Submission to Genbank (30-JUL-1991)L. Cornudella, Centro RL de investigacion Y, Desarrollo del Csic, Jordi Girona 18-26, RL 08034 Barcelona, SPAIN. XX RN [2] RP 1-179 RA Ruiz-Lara S., Prats E., Sainz J. and Cornudella L.; RT "Cloning and characterization of a highly conserved satellite DNA RT from the mollusc Mytilus edulis."; RL Gene 117(2), 237-242 (1992). XX DR GenBank; X61120; Positions 1 179. XX SQ Sequence 179 BP; 43 A; 54 C; 33 G; 49 T; 0 other; gggcccgtct ggccaccaca gaggttcaaa gttgcccatt tacgtatttt ccatatcaac 60 cacacattcg gtactcttcg gcatatccct acggcaaagt tccgaaccgg acctagctca 120 ttttaaaggt attgacccaa gctattcccc acttaaatat ttttctggga tccgggccc 179 // ID R9Av repbase; DNA; INV; 4181 BP. XX AC . XX DT 21-MAY-2010 (Rel. 15.05, Created) DT 21-MAY-2010 (Rel. 15.05, Last updated, Version 2) XX DE R9Av, an rDNA-specific non-LTR retrotransposon family from DE rotifer. XX KW R2; Non-LTR Retrotransposon; Transposable Element; R9Av. XX OS Adineta vaga OC Eukaryota; Metazoa; Rotifera; Bdelloidea; Adinetida; Adinetidae; OC Adineta. XX RN [1] RP 1-4181 RA Gladyshev E.A. and Arkhipova I.R.; RT "Rotifer rDNA-specific R9 retrotransposable elements generate an RT exceptionally long target site duplication upon insertion."; RL Gene 448(2), 145-150 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 570..3875 FT /product="R9Av_1p" FT /note="includes reverse transcriptase and FT restriction-like endonuclease domains." FT /translation="MNLPIREHAVSVHNINKFNYLCQLCSKSYDTINSVKA FT HYVACRRQKNASSTTAVPTNVINNNQLAINTNQVISRNPLQCVECLMKQVD FT FYAKDTKALVTHMRTKHAAAYEESKKVATRRVAWSPDEDQILAELEVKLKK FT IQKGQLLSRLVVEYNKCADKSKAPSRSKDAIRTRRQQHDYKLLLRSLQSQQ FT PPVGSEDSDSDISSSNNNPLTTTHNVTPTPDSSNVVLLIQKIRESVDSIVK FT ITNLKLNTNMLNAASAFINQNNNMDPLELSMRGIEEDVKAIRDKELQKPTR FT NVPSSTTSRKPTRNAKRLEKSKKYGYYQHLYYNNKKKLVAEILDGETSGAK FT PPPMNLVEDYYRNIWSRSTIDDSPVNNIKTVNSDSIFAPISRDEIKLALSN FT TKKDSAAGPDAVTIKEAKAIIDNLYVAYNIWLGVQGIPEQLKLNKTILIPK FT GNSDLSLLKNWRPITISSIILRVYNRLLAYRMNKIFKTNDKQVGFKPVNGC FT GINISWLHSLLKHARLNKNSIYACLVDVSKAFDSVSHQSIVRALTMNGAPS FT LLVKLIMDQYTNVNTVITCSGSISNKINISSGVKQGDPLSSLLFNLVIDEL FT FDVIKDQYGYTIDNIGTTNARCFADDLTLISSSRMGMNKLLELTTKFFKER FT GLNVNPSKCMSIGMSKGYKGKKSKIESEPLFSITDAQIPMLGYIDKTTRYL FT GVNFTSIGAIDAKRIKKDLQDTLDKLEHLKLKAQCKMDLLRTYMIPRFMFQ FT LIHTELYPKLLIKMDILIRKLAKRILHLPISTSSEFFYLPFKEGGLQLTSL FT KEAVGLAKIKLHKKIMSSNDPMLCYLIESQRSRIVEHFMKDLKLGDSLTLN FT EMNNIKECFMKEKRISFAQKIHGVGFEVFSSSPLTNQWINGEIKTMTTKTY FT INSIKLRTNTLETRVTTSRGLNIIKTCRRCHVADESLMHVLQCCSSTKGLR FT YSRHHKICAKVANKLVMNGYGVFREKSYPDPNNSGSYLRPDIIAVKNGHVI FT VLDVTVVYEVTGATFINAYQTKINKYNAIMVQIEQMFNCVNGELHGLVIGS FT RGSIHHSQLHIWHQMGFSSIELKYVAIGCMEDSLRIMSTFSKAIT" XX SQ Sequence 4181 BP; 1405 A; 807 C; 749 G; 1220 T; 0 other; gaaatagttt gcaatggtag gtgtatggcg cctctgtgtc tctctttcgc tggatatagt 60 ttgacgattt tgtaccaggt atctgtttct tgtgagttca gcaccagttt gaacaggctt 120 agcgatagac cttcgaactt gaaacactgt tgtgaagctg gctgggcccc tgcagatttt 180 ctcgattaga acgtgagtgt tacgtccaga atgacccacc agtggttagt tctacgttgc 240 cctggaaagg agaaaagttg agctaaaatc gcacggccta gttgtttatc aaataggcac 300 ggtgaggaac tcttctatgt accctgacta aagtactcac ttgtgcgctg ggtttgctcc 360 ccctcgcatt gacttatctg atcgcactac ccaccaaacg aaacataaac ttagctcgtg 420 gtatcagtcc acagcgtgtg cagtcggatt caggggagcg tgttagtgac aagcaggata 480 atattaacat agttaatgtt aaggcgttca acattcctta tccaattgga agagttgact 540 gtgaagtttg tcatgaagac attggacaaa tgaatttgcc gattcgagag catgccgtat 600 ctgtacacaa tataaacaaa tttaattatt tatgccagct atgttctaag tcttatgata 660 ctattaatag tgttaaagct cactatgttg catgcagaag acagaagaat gcctcatcca 720 caacagctgt tccaaccaat gtcatcaaca acaaccaact tgctataaat actaatcaag 780 taatatcaag aaatccactt cagtgcgttg agtgtctaat gaaacaagtt gatttctatg 840 ctaaagatac aaaggcacta gtcacgcaca tgcgtactaa acatgctgct gcctacgagg 900 aatcaaagaa agtcgcaaca agaagagttg cctggagccc tgatgaggat caaattcttg 960 ctgaactaga agtcaaattg aaaaagatac aaaaaggtca attacttagt cgtcttgtcg 1020 ttgaatataa taaatgtgct gataaatcga aagctccttc caggtccaag gatgctattc 1080 gtacaagacg ccaacaacat gattacaaac tattgcttcg ctcactccaa tctcaacaac 1140 cgccagttgg tagcgaagac agtgacagtg acatatcttc tagtaataac aatcctttaa 1200 caacaacaca taatgtcact ccaacgccag attcatccaa cgttgtacta ctaatacaaa 1260 agatccgtga atctgtagat tccattgtaa aaataacgaa cctcaaattg aacacgaata 1320 tgctgaacgc agcaagtgcg ttcattaatc aaaataacaa catggatcca cttgaactat 1380 ctatgcgtgg tatcgaagag gatgtgaagg caattcgaga caaagaactt cagaaaccaa 1440 ccaggaacgt tccttcttca acaacttcga gaaagccaac tcgaaatgcc aaaaggcttg 1500 agaaatcaaa aaaatatggc tattatcaac atctgtacta taataacaag aaaaaattag 1560 tagcggaaat cctcgatggc gaaacaagtg gtgctaagcc acctccaatg aacctggttg 1620 aagattatta tagaaatatt tggtcacgtt ctactattga tgattcgcct gttaacaata 1680 ttaaaaccgt taatagtgac tctatatttg ctccaatttc gcgtgatgaa atcaaattag 1740 cattatcaaa tacgaaaaag gattcagcag ctggacctga cgctgtaaca ataaaagaag 1800 caaaagctat tattgacaat ctttatgttg catataatat atggctaggt gttcaaggaa 1860 ttcctgaaca actgaaattg aataaaacta tcttaattcc aaaaggaaat tccgatctta 1920 gtctactgaa aaactggcga cctattacaa tctcgtctat tatcctaaga gtatacaaca 1980 gattattagc atacagaatg aacaagatct ttaaaactaa tgataaacaa gttggattca 2040 aacctgttaa tggttgtggt attaatatat cttggcttca ctctctcttg aagcatgcac 2100 gcttaaacaa aaattcaata tatgcttgtc ttgtcgatgt gtctaaagcc tttgattctg 2160 tgtcacatca atcaatagta agagctctca caatgaatgg tgcaccatcc ttgctagtga 2220 aattaataat ggatcaatat acgaatgtaa atactgtcat cacatgttct ggttctatat 2280 caaacaagat aaatatctcc agtggtgtca agcaaggtga cccactatct agcttgttgt 2340 tcaatctggt tatagatgaa ctgttcgatg taataaagga ccaatatggt tatacaattg 2400 ataacattgg caccaccaat gcacgatgct tcgccgatga tttaacacta atatcatcat 2460 ctagaatggg tatgaataaa ttgcttgagc tcaccacgaa attcttcaaa gaacgtggac 2520 taaatgtaaa cccatcaaag tgcatgtcta ttggcatgtc caaaggttat aaaggaaaga 2580 agagtaaaat cgaatctgaa ccactcttct ctatcaccga tgctcagata ccgatgttgg 2640 gctatattga taagacaact cgatatctcg gtgtaaattt cacatctatt ggtgccattg 2700 atgcaaaaag aatcaaaaaa gaccttcagg acacactcga taagcttgaa catcttaaac 2760 tcaaagctca gtgcaaaatg gatctcttac gaacttatat gataccaaga ttcatgtttc 2820 aattaattca tactgagtta tatccgaaat tgcttattaa aatggacatc ttaattagga 2880 aattagctaa acgaatccta catctgccca tatcaacgag tagtgaattc ttttacttac 2940 ccttcaaaga aggaggtctt caactaacct cacttaaaga agcagttggt ttagccaaaa 3000 taaaattaca caagaagata atgtccagta atgatccaat gttatgctac ttgattgaga 3060 gccagaggag ccgtattgtc gaacatttta tgaaagacct taaacttgga gattctttaa 3120 cattaaacga aatgaataac atcaaagagt gcttcatgaa agaaaaaaga atctcatttg 3180 ctcaaaaaat tcacggtgtc ggcttcgaag tattctcatc aagtcctttg acgaaccaat 3240 ggattaatgg cgaaattaag acaatgacaa ctaaaacata cattaactca attaaactta 3300 gaacaaatac tctagaaact cgggtaacaa catctcgggg actgaacatc ataaaaacat 3360 gtagaagatg ccacgtagct gacgaaagtc tcatgcatgt gctccaatgt tgctcttcta 3420 ccaaaggttt acgatactct cgtcatcaca aaatatgtgc caaagtagca aataaattgg 3480 taatgaatgg ttatggtgta tttcgtgaga agagttatcc agatccaaac aactcaggtt 3540 cataccttcg accggatata attgcagtaa aaaatggtca tgttattgtt cttgatgtaa 3600 cggttgtgta cgaagtaact ggtgctacgt ttattaatgc ctaccaaaca aaaataaata 3660 aatataatgc gattatggta caaatcgagc aaatgttcaa ttgtgttaat ggtgaattgc 3720 atggtctagt aattggatca cgtggttcaa ttcatcacag tcaactccac atctggcatc 3780 aaatgggatt ctcttccata gaacttaaat atgtggctat aggatgcatg gaggattcgc 3840 tcagaatcat gtccacattc tcaaaagcta tcacatgaac tagtctcctt cttctattag 3900 tcagtctaat taatttttct tacattctac atctagttcc attattaaat tggtatgatc 3960 agtgctatct ctgctacact caatgcttaa tcgtatgtta ttgacagtct gacacttgat 4020 tactcttacg acatatgcac tgtttgcttc agagaaacca ctgttcatat agtgaagttc 4080 ctcagttttc tgttgatata ttcttctttc attctcgctt ctccttttct actgtgttct 4140 ttttatcagt tttttgtgga aaaattgaga ataaataaag t 4181 // ID EnSpm-N5_BF repbase; DNA; INV; 5096 BP. XX AC . XX DT 04-AUG-2008 (Rel. 13.08, Created) DT 04-AUG-2008 (Rel. 13.08, Last updated, Version 1) XX DE Amphioxus EnSpm-N5_BF non-autonomous DNA transposon - consensus. XX KW EnSpm; DNA transposon; Transposable Element; Nonautonomous; KW EnSpm-N5_BF; non-autonomous. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-5096 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-5096 RA Kapitonov V. and Jurka J.; RT "EnSpm-N5_BF - a family of non-autonomous DNA transposons from RT the amphioxus genome."; RL Repbase Reports 8(8), 793-793 (2008). XX DR [2] (Consensus) XX SQ Sequence 5096 BP; 1327 A; 973 C; 1008 G; 1709 T; 79 other; cacagtgctc gagcgggtgt aaatgcgtta ggtttggagc gtaattgagt cagacttgag 60 tcgtatatct ctgttgaccg tttcacgact atcgtgccca ccaaaacacg actgtgaggc 120 cgaatccctt gagccccatc gataaaatac gactgtgtgt tgactaatat acgacgaggt 180 cggtaatcag acgtatacag tcgtattttg gtcgacgtcc agaggaaagg tcaatagagg 240 tcatcagatg cgttacagcc tggttatcca aataaacaaa ctgtaagtta gtcggatgag 300 ctaggcgttt actacgtttg tctgccttgt tcagttgttg atttttaacc cccttaggta 360 aaatgaggtc gcatgatttt tttcccaaac tattttcatt tcttttgcag gatattactt 420 gtaaaattat gctataggct aacgttcgtt ttctttcgtt ccgttttgta acagttgtta 480 cagttcgtta tgtttagctc cgtttcgtta cattttgttt tctttcgttc ttgtttatag 540 ttcgttatat ttcattccgt tgcgtaacgt tacagttcct ttgatccaac ccataacgtt 600 aagttttatc aaaatgaaag ctgctaaatg gtagctggtc ttctgacgtc ctgggcgtct 660 acgttttata attaggccgt tccatttcgt ttccatcaaa ttatcgttcc gcttcgttcc 720 aattatttca ttttagttcc gtttggttcc gtttcgctct attccattct gttcaagttt 780 tattaatttt gcgttctgtt tcgttcattt tcgtaccgtt ttgtttcatt tcattcatgt 840 cgttgaattt ggttaccttt cgttatattt cgttacgctt cgttacagtt cgttccgttt 900 tgttatagtt cgttgtattt cgttccgttt cgtttcgtta cggttcgttc cgttttgtaa 960 cagtttgttg cgtttcgtta catttcgtta cagttagttt tctttcgttt tgttttgtca 1020 tggttcgtta tatttcrttc cgtttcgtta cagttcgttt tctttcattc cgttttgtaa 1080 cagtttgtta cagttcgtta tgtttagctc cgtttcgtta cattttgttt tctttcgttc 1140 cgttttgcta tagttcgtta tatttcattc cgttgcgtta catttcgttc cgttttgtaa 1200 tagtttgttc cgtttcgtta catttcgttc cgtttcgtta tagttcgtta tatttcgttc 1260 cgtttcgttc cctttcgtta tagttcattt tctttcgttc cgttttgtta ttgttcgtta 1320 tatttcgttc cgtttcgtta tagttcattt tctttcgttc cgtcttgtta tagttcgtta 1380 tagttcgtta tatttcgttg cgtttcgtga catttcgttc cgttttgtaa cagtttgttc 1440 catttcgtta tatctcgttc tgttttgtta tagttcgtta tatttcgttc catctcgttt 1500 ccgtttcaaa cttttcattg ttcttgtcaa aattgtttta ctatatgttg tcgtgatgtc 1560 tgttgaggct agtcgaaata tttctggagt ccctgtgcaa atacgaacaa catgagttat 1620 gtgaaatgtc aaggttgtta gcaaatatac ctgtatattg catagtctat ctgtagttat 1680 actttatatt gtaattgtcg aacttgtgaa ttgaatcaat tgacgaactc gagaactggt 1740 aattatagta aggtagtaat gttttactag tggtccgcac catgatcgcc gtctatgttt 1800 agtaattcca atctttgtta aagtgttaaa atgctactta cctcgttcag atgaactgac 1860 agatgaagtg tgttgtgtta atcgacggtt tctctcatcg gcgacggtag gaatcgacgt 1920 gaactttgat acgacttaac taagtaagac gcaaacggga tccggtgttg aaaatcgttt 1980 tattctcgcc gatgtgaggg cgccgttatc ccttgtatgt gctattgtgc ttgtgtatat 2040 ttctcgtagg acgtataaca ttgtgcgtat gtataataaa gttgttttgt tgttgttaca 2100 acatttagat acacaacaag atgacggcat accatactca agcttgcagc ttattttggt 2160 gttgccatca acagactggt aactaggttg cgagagtaag tttgaactta aagtaccgag 2220 acgaatacga gcgagaagac gcacatacag attttttact gttaaaatgt ggaagactta 2280 ccaggaatga cccatctgcc aagttgtgat acatcctaac aactgaaggt tatgtttatt 2340 acaaagatca aaggaagtgg attctgatac ttggtattaa atctcttata ttttgtccag 2400 tttgaccatg aatgatgtta tagaaaattg aataataata tggcggcggt cagatagttt 2460 ttcccatcca ggtacggcaa ggagagaaga gtacgaaggc gccctgatac atgatgccat 2520 cagctgtagt taaagagcat tcatattgta cacgttcgat gagtggagag tctaaagaag 2580 tgcgaccgtg ccaaactacg ctgccatatt cgagtagggg acggatacaa gatttgtaca 2640 agtatatgat ttttacaagg agcttagatg taagtttact gaagattgaa acagttttat 2700 atgcattggc tagcattnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 2760 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnttacc tagtcagagg gttggatgtg 2820 gaggaggact ggagagtttt gttttgttac gtactttttc tctcctcctt gattttatac 2880 tgtacgatta taatgaattc ggaaaagatc gatgagatta caaatagcta tatctttcat 2940 agaataaaca tgtgatatac tctagccttc aatgttctgt attcatgaca tcgtcgccat 3000 ggagccactg cacaccgcct ccaacacagt tgtccgcttt gctaacgtgt ttgcaaagtt 3060 cagaccttga agtccggatt tacgacgtgt tgtaagtctc ctacaaaatc catttgcgat 3120 ttctaacttg gcaagtaata ccgtaggtag catgtacata ctccgtgaca taattatgtt 3180 tttcatgctg cgttgcgttt tctttcttat tgagtgtgac atagtgttga ttacggtttc 3240 accagtattt gttaccgaac ctggcgtgtt tgtgtgtatt gtgtttagta ctagcttaat 3300 gtttgcttga ctgtgtgctg tcaatataac ggactataac aaaacgaaat ataacgaaac 3360 ggaataaact gttacaaaac gaaacggaaa gttaacgaaa cggaacgaaa tataacgaac 3420 tataacaaaa cgaaacgaac tgtaacgaaa tgcaacgaaa cgtaacagaa tgtaacgacg 3480 tgaataaaat ggaacaaaac ggtacgaaaa taaacgaaat agaacgaaaa attaatgaac 3540 agctggcaac acatcgtgca caaagcaacc agagaaaatc acacagcatt tgcttggcag 3600 tatccaatta ccgcaagccc taagcaaaca gaggtaacgc ttacctggtc cacaataccg 3660 aaaacacgtg aatgcagcaa aagggaacat cacccagcta caatatacac cgagggactt 3720 cgaatttgac tgtataatcc ccttgtcagc atgagtaaaa atgttattga tgatatagaa 3780 atcaataaag tcatttgaag ataagattcg gggagaaaaa aaataccgac gtccagctgg 3840 actcgatccg acttcatcac aattctaata gaaaccgccc aacctactct cttatgccac 3900 tgagccacgg aagactttct tttttatcgt ctttgactag agcctttata catgacggtg 3960 cgtgattgcg caacgaagac aaaattaatg atcgtgtaaa aacttgtcat tgttccatct 4020 ttatagtaaa cggagtattt acacatggtg tttaaaattg aacttagaaa agattatgca 4080 acgttataaa tatatgcttc actgttgccc tgcgttcttg aatagtacgt aaacgactgt 4140 tctgtcggcg tgcgggtgtt aggtcaatca gtagcgcgtc tgcattgcga tacatggctg 4200 ccagaaatgg cgccggttcg aatcctacct tgtcttacat gtatgttatg tgtgaatctg 4260 caataataat cttacacatt tactacgcac tttcagcatt cctcagagaa gctcatgtgt 4320 tatctgtaca aaattaacaa gtaacgttaa tccatgttgg aaattctaat atttcatttt 4380 gccgactaaa acggaccaaa aatagaactg acctaatttt acagcagctg tcgtaacagc 4440 ttgtcctaaa atattcaaca tttaaataca tatagtacgt taaggacgtt atagatattt 4500 ttttattcaa tgatgacaaa cacaaccttg tctacatcct tgaagtttct ctcggtgctt 4560 gattgcttat tttactttta ccctaaggtg tccatgcccc gacgcacggt gtttataacc 4620 agaatccatg aaacgcggag ctacatatac cagagcacgt ggaacctcta gatgccaatc 4680 attccggtag ccacggaaac agtcaccacg gtgacgtgcg tcgccatgcc gtcactcacc 4740 ggagttgcca tggcgccgga ctgtctccga gtcagccatt tttgtttcgg tagtcttggc 4800 aacgttactt gtcgtcaagg caactcaggt aggtttccag tcgtgattag aacacaaccc 4860 gattccaact ttctagtcgt tattcatgcg gttttgagtc gtaaggtgga acaggtgaga 4920 aaatggtcgt aattcagtag ggaatgtccc acctgacgac tagcttacgc cccgttaacg 4980 acccgcagga aaattggaca gacattcacg acccaaacac tgctgtgggc caaacacgac 5040 cgacgaccca ttcacgccta atgtttaact gatcaacgac ctgctcgagc atagtg 5096 // ID Kolobok-2_Aplcal repbase; DNA; INV; 3779 BP. XX AC . XX DT 16-MAY-2011 (Rel. 16.05, Created) DT 16-MAY-2011 (Rel. 16.05, Last updated, Version 1) XX DE DNA transposon: consensus. XX KW Kolobok; DNA transposon; Transposable Element; Kolobok-2_Aplcal. XX OS Aplysia californica OC Eukaryota; Metazoa; Mollusca; Gastropoda; Heterobranchia; OC Euthyneura; Euopisthobranchia; Aplysiomorpha; Aplysioidea; OC Aplysiidae; Aplysia. XX RN [1] RP 1-3053 RA Yuan Y.W. and Wessler S.R.; RT "The catalytic domain of all eukaryotic cut-and-paste transposase RT superfamilies."; RL Proc Natl Acad Sci U S A 108(19), 7884-7889 (2011). XX DR [1] (Consensus) XX SQ Sequence 3779 BP; 1142 A; 798 C; 804 G; 1035 T; 0 other; gggggctcgc gggtggattt cagcctgaca gatatcattt ttttggtaaa atatcgaaat 60 cgaggtttta ccattttttg aaagtcgaat cattcttctt cacaatggta tcggtcataa 120 tgttgaaaaa cggtcacaaa agtgttttca acaggcattt tagagacacc ccctctgcca 180 aaatgtcagc cgggaagcga aatgttgaag caaattcaca aaagagcaat gtttggacca 240 tagcaacggc cttacaattg tatacaaaca tagccgtgtt atataccgtt tgaaagagca 300 ggatttcaac cgacgaatag taatcgagtc aaccacatga atcgattaca gaatgaacca 360 attgcattca aatgagacat gcgctccaaa aaatgcttca ttctggttgg tcttcagctt 420 cagatgtctg agttgccacc gcgtagcgct tttcggaaaa cccgatccgc atttcgaaac 480 ctctaatcac catgctttgg gattttttta catgatattt tattcaaata acaaattcat 540 ttataaatat tgataaataa aacattttga taaaattaca ttcgttttac ttgagactgt 600 taaatgttcc gatgttgaca gtactacgat acacctcgcg cgtactcata cctggcatat 660 tatggtcaac gccataagcc attgagaaag acagtcgaag tgaaacaaag cgagttactg 720 cattgtttac gatctgttca cattctacaa gatgccgaga gtcgctaaaa ctttcgcttt 780 tggtcgaaag aagagacgtg gtcctaatgc taagccaaga gggaggcatg ccaccacgca 840 ggtgtgtgaa aatggagtgg aaacaaccga agatgtcgac atggatgaac ctacttcgcc 900 agttcagcct cttgctatga atgaattcgg tccatcatct tcagcccctc aaattgacag 960 gtaagaaaat atgatcgtta tttagatcta gagtctagat ctggatttat tcattctaga 1020 tctacaaact ggtttcttat atcactcaag ctccccaatc taatatatct agattagatc 1080 tagatctaca gtcttgatct aaattatact tctctgtaaa attgcactgt gtgaggtata 1140 gtatagattt atgggtgaag gatcacacag aggaagaaaa caattgtgtg ctcagctata 1200 tgcagatctg agttatgcaa atatttgtgg agagggtggt gtgagcacaa gttattattt 1260 tggtgattgt tctcaaagac agaaagtaaa aaaaaagtag agtgtagatc tggtactttt 1320 caagtagcct aatgacctaa tgccatgctt ttctacaatt ttgcatgtat cacacaatgc 1380 aaaattatat tgctttgact ttctaaggac tgtagttcta acgcaatgtc taattttatt 1440 ttctgcaaca gtgagccagc tcccgttgaa gacgacgagg aaacagaaac ccaaacaaat 1500 tcatcgcggt gtgagcagaa gatgtcagcc ttcgtagatc attcttcgtc ccctgaccct 1560 gaagcagagc gatactacct ggtacattcg tcgttttttc tcgctatgct agatattatt 1620 gtatgcccct tctgctttgg ctccgatcta tcccttgatg cgtcaagcga catcggtatg 1680 gcttgtcgtt ttactgtgga atgcaataca tgccagaaac cagtctggtc aaagtgcact 1740 tcgccgaaat caacacgtcg acacgacatc aacgtccgtg cggtcgtagc ctctaaggag 1800 tgcggtgtgc cgctgagcac catgaagacg atgttcagca tcatgaacat gcccaacgtc 1860 atgcaccaca agacgttcca tgaaatcggc ctagaagtga gagaggctgc catcggtgct 1920 gcagaggaag ttatgtcggc ttctgcccga gtcattaaag aacgtcatca atccaacgtg 1980 tacgcaactg aacgatcaag tccgtctggc gttcaagttg tcaacgtttc ctacgacggc 2040 acgtggcaca agcgaggtca ctcgtcacac tatggagtag gtgtggcgat tgatgtggac 2100 agtggttatg tactggacac atatacagtg tctaatttct gcacagggtg tactcgtgcg 2160 ccagatcccg actccccaca gtacttgtcc tggttcgaac ggcacaaacc tttgtgtaac 2220 aaaaattttg atggctcctc taacgcaatg gagactagag cagctgaggt gatctttagt 2280 cggtcaattg agtgccgtgg gctgatatat ggcactatgt tgtgtgatgg tgacagcaag 2340 gcactgacac gtgtcaacga ccttggtgtc tacgacattc cggtgcagaa ggaagactgc 2400 gtaaatcata tagccaaacg catctacaat cagttagaac aggttaaaaa agacaacaag 2460 cagaagctca acaggaaact gaccgctgcc aaaataaaaa aaatcacaaa cacttacgcc 2520 acgaatctga gacagaatgc cccagatatt cagaagatga agttaggtgt tatgggtggc 2580 ctgttccaca tgatgtccac ggacagcgcc ccgaaccatc gtttgtgccc agagggtgaa 2640 acatcatggt gcaagcacaa ccgtgccctc gccaagggtg aacagccccc tccccacaac 2700 ccaaccttat cacctgatat agggacttac gtattcccca ttttcaagag gctgactgac 2760 ccagaactcc tctctaggtg tgtcagaatg tctacgcaga acccaaatga atgttttaat 2820 tcgacaattt ggagacgttg ccccaaggtg cttttcgccc acctcccaac cattgaaaca 2880 gctgtggcgt taagtgtaat ttcgtttaac atgggaccta caggactata caaggtcttc 2940 gaaagactcg gcctgacctg gggagctgtg acaaattcgc atgccattgg agtagcagag 3000 acaagaattc gtcactccaa gaggaaagga aagaacattt cgaagtggaa acgaaagcac 3060 cttcaacaga accacctcaa cgagaaggac aagagagagg aaagggaggg tgtcacctac 3120 tcttcagggg ctttcaactg ttaaggtagg cccagaacac tagttagagg caaatattgg 3180 actactaacc acctaacaca gcatgtggca tgtttactaa caaccgtaaa ctttgacgcg 3240 atttttctca aaactacctt tttcagatcc ttttcaatag ttggaacact accattttat 3300 tatttgtaaa gatatgggca tcaaatttgg catgttaata gcttatccat ttccacagat 3360 agtgacatgg tcagaattgt gatatgtctc tttatttttt ttttggaaat ttttttcaac 3420 agtcactggg ggtattttct taccaaaatt gtaaggtgaa tttggatggc ttaaaaaaca 3480 actaaaaggg atatataagc cattctaagc ttgttactat ttagagctat gtctcctaaa 3540 cacattacaa gtgaaataac aggaaaaaag taaaaactat attgatggta gtgtaccaaa 3600 attggtatat attttttgtt tacaatccaa gatggccgcc gtttccctgg ttacgaatgg 3660 acctgataac actatgtaca gttggatcca aatattttca cataggttaa tagttgatat 3720 taattgcttt ttagaataac ggtggataaa aaaacttgtc aaataaatgt cgagccccc 3779 // ID Gypsy-88_CQ-LTR repbase; DNA; INV; 500 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-88_CQ_; KW Gypsy-88_CQ-I; Gypsy-88_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-500 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 556-556 (2011). XX DR [2] (Consensus) XX SQ Sequence 500 BP; 142 A; 139 C; 92 G; 125 T; 2 other; tgtaatagtc tgaacaccac tgcattccca acgtacccat tcgcacttaa tttccattac 60 catgcccgga cctcgcacaa caagaccagg ttttgtcttt caaaacacaa accctgcctg 120 gctgtagagc ccacacacag ttagaaccac ttggcgcaag cataattgct gatgcttgcg 180 tatagccttt ttcgaaccat attttggttc atcgacaatt ggccgggacg ctgactcctg 240 ctgagccagc gcaagggttc aacccagtac aagccaacwc cagaagccta tactgtagca 300 cgtaagtgtc accactcaaa ttacatgtga tgcaacaaat ttgactagca taaatacccg 360 agttcaaata aacaaaaats agtcttttcg gctggaacat caaggcgact tgtctacttt 420 atttcgagct ccgagctggt tactgctcct cgcagtctac cacaccttgg atagctctta 480 atgcaaagaa agtccctgca 500 // ID Mariner-1N_HM repbase; DNA; INV; 893 BP. XX AC . XX DT 07-DEC-2008 (Rel. 13.12, Created) DT 08-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE Mariner-type family: consensus sequence. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Nonautonomous; KW Mariner-1N_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-893 RA Jurka J.; RT "Putative nonautonomous Mariner elements from Hydra RT magnipapillata."; RL Repbase Reports 8(12), 2102-2102 (2008). XX DR [1] (Consensus) XX CC The youngest elements are ~99% identical to consensus. Present in CC >2000 copies. CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX SQ Sequence 893 BP; 273 A; 185 C; 162 G; 273 T; 0 other; cccagtgggc acccgacgtc aatccgacgt cggatttacg tacgattttc atcagcgatg 60 tcaaaagacc tattccgacg ttgttccgat gtcagtaaaa tgacgtcggg cgatgtcttt 120 atctcaacat cccctgacgt cgtttcgatg tattttgacg ttgttccgac atcttataaa 180 agacgtccaa aaaacgtcgc agtactgaca tcgatccaac gtcgataaaa agacgtcggg 240 caacgtcata atattaacgt actccgatgt cttttgacgt cttccgacgt caatccaacg 300 tccttaaaac gacgtcggaa caacgtcgta agacatcgct gatgaaaatt taatccaaac 360 ccaacaatat gtgtgcaatt gcgctcttca ataaatatgt gaactaagcg tgttttttta 420 ttaatggaaa gaagtttgaa taatggttga taaaacatca tgaggttttg tatgatgcac 480 tagcatagtt ttttaactat taacattttc gagtatacat tttttataat atataactat 540 tcgttttaat atttatataa tatttaaaca aaattatttc ttatcaaagt aataattccc 600 aaatagctac aagtataaat tactaaataa acacattaat taatatgtcc ctgtggtgta 660 gtggtaagcg cgtaggattt agtatctaat agaccgaggt tcgattcctg cctaaggcaa 720 cgttttttag ttttcaaaaa acacgttgga tttacatcgg aatgccgacg tcgcgcgatg 780 tcagtttatc cgacgtcaaa taacatttcc atttccaacg taaacccaac gtcgatccga 840 cgtcgagatc cgacgtcgct cgacgttcgt tcgacgtcga aatgcccact ggg 893 // ID Copia-2_AA-LTR repbase; DNA; INV; 290 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-2_AA_; KW Copia-2_AA-I; Copia-2_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-290 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 940-940 (2011). XX DR [2] (Consensus) XX SQ Sequence 290 BP; 75 A; 78 C; 48 G; 89 T; 0 other; tgaagggtac aatgctcacc tcttagtaag tatgtacacc ttgttccttt tgacaaccct 60 atcacacata caaccgtacg cgcacacacc tcgagccttg tgtttgcgcg ccaacaggga 120 aatcattccg cattcgattg ttaccaagtg tagtactgta ctttttcact ctaatttttc 180 cgatcattaa aaccacgttc tttagttagt aacttcgcgt gtgatttctc caccaaatat 240 ccgaaactcc tttagggtat tccactgccg caagttggaa tctgccaaca 290 // ID Gypsy-81_AA-I repbase; DNA; INV; 4510 BP. XX AC supercont1.19; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-81_AA_; KW Gypsy-81_AA-LTR; Gypsy-81_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4510 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.19; Positions 3747596 3752105. XX CC Positions [3454-3993] - Integrase core CC 'ATTTC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 226..4482 FT /product="Gypsy-81_AA-I_1p" FT /translation="MDADQFSRFMDMQRQAMQALIGSMRDVQLKQPAPPVM FT GGSAASSVPLPPPLELEGDMEQNFNFFEENWKTYASAVGMDAWPVEQNKQK FT TSILLSVVGKDALKKYFNFELNNAQQNDPDLALAAIKLKVVRDRNKFVDWF FT DFFSLEQSPAESIDNYLCRVKSLAKLCKFGDLEEDMIKYKLATSIKWMKLR FT SKLITTQNLTEAHAVDLCRAEEITERHPVTVGHASAEVNMVKKSKMRCKFC FT GSKHDFSKGACPALGKKCNRCGGKNHFEKVCKAERKKKLKKKLRVKKVRED FT ASSDSESSESSDSENSESSASESLSIGKIVDKSGGGGHVTADLELKLAGQW FT QSVKCELDTGANTSLVGRNWLETMIGRDKFELKPSAYRLQGFGGSNIPVIG FT QVKLPCRRKGRKYNLVLQVVDVKHGPLLSANVCRILGFVKFCNSVSFIAPK FT TEQELLNVYRVKAQEIIKQHEDVFHGIGKFAGAVTLEIRPDVPPSIQPPRR FT VPIALREKLKVELRNLERDGLIVKENQHTDWVSNIVLVKRKEQKSESIRIC FT LDPIPLNVALKRPHMQFTTIDEILPELGKAKVFSTADVRKGFWHVLLDEKS FT SRLTTFWTPFGRYRWTRLPFGIAPAPDIFQMKLQEVIQGLQGVECVADDLL FT IYGSGDTLEEALENHTFCLEKLLVRLNECNVKLNISKLKLCETSVKFYGHM FT LTDKGIQPDEGKIATIKNFPRPTDRKQLQRFVGMVNYLSRFIPNLSADFTI FT LRRLISEKEDWVWSEKEDEEFDRVKRLVADTKTLQYYNMNEPIVIECDASS FT YGLGAAIFQSHGVIGYASRTLTATEKNYAQIEKELLAILFACVRFDQLIVG FT NPKTTVKTDHKPLVSVFRKPLLSAPRRLQHMLLNLQRYRLTIEFVTGKENV FT VADAISRAPFDENLADDKFNKRDIFRVFREIEEVNISSFLKVKDAQLNEII FT VETESDPSLQQIMQFIQNGWPSSVNQVPDCAKMFFKYRHELSTQEGIIYRY FT DRIVVPYSLRRKLTEKVHVSHNGIEATLKLARANLFWPGMSAQIKEVVTKC FT SICAKFQPSQAPAPMQSHPIPVYPFQLVSMDVFFADYQGKSCKFLVTVDHY FT SDFFEVDLLKDLTPKSTIAACKSNFSRYGKPQQVISDNGTNFINREWRQFA FT RDWDFCHSTSAPHHQQANGKAEAAVKIAKRLLKKAEEAESDFWYALLHWRN FT VPNKVGSSPAVRLFSRNTRCSIPMSTDGLKRKLVTGVPEAIEERRRKAKFQ FT YDRKTRNLPQLETGSAVYVQLNPEVTKKWTPARVSNKLNERSYVVDVHGAQ FT YRRDLVHLKPRNEPETPEVSTNPAIASNVEATVPERSSSDLQLPIPNDEVD FT TSVPTTITPELISTPKPSKTTRISRSSCDPIIEEGKNVRSKRNVKLPVRFK FT DYCLE" XX SQ Sequence 4510 BP; 1348 A; 920 C; 1136 G; 1106 T; 0 other; tggtgtcaga agttgtggac taaaagtgta ttccggcgtc atttgtaaat cgcggagggt 60 gttagtggaa aagttcagtt cgagcggtgg aaaaatcatt tgcagcaagg attgttagaa 120 gttgtgcggt gaaaagcaaa cttgttgggt cttgtcgtgc gccatatttt ttctgtcgtg 180 tcggatttga gtgatttccc ggaagtgtta acaaactcaa agaaaatgga tgcagatcag 240 ttttctcgtt ttatggacat gcaacggcag gcgatgcagg ccctgattgg ttcaatgcgg 300 gacgtgcagt taaagcagcc agcgccacct gtgatgggag gttcagccgc atcatccgtt 360 cctctcccgc ccccgttgga gttagagggg gatatggaac aaaactttaa ttttttcgaa 420 gagaattgga aaacctatgc tagtgcggtt ggcatggatg cgtggccggt agagcaaaac 480 aagcagaaaa cgagtatttt gctatcggtt gttggaaaag acgcgttgaa gaaatatttc 540 aacttcgagt tgaataatgc tcagcagaac gacccggacc tcgctctggc agcgattaag 600 ctcaaagtag tgcgggatcg gaacaagttc gtcgattggt ttgatttttt ttcgctggag 660 caaagccccg cggagagtat tgacaattat ttgtgccgtg tgaaatcgct tgctaaactg 720 tgtaaattcg gtgatttgga agaggatatg atcaaataca agcttgctac ctcgatcaag 780 tggatgaagc ttcgatctaa gttaatcaca acccaaaact tgacggaagc acacgccgtg 840 gatttgtgtc gtgccgaaga aatcaccgag aggcatccgg taacagtggg ccatgcaagc 900 gcagaagtga acatggtgaa gaaaagcaaa atgagatgta aattctgtgg cagcaagcat 960 gacttttcca aaggtgcttg tccggcactt ggaaaaaaat gcaatcgttg tggtgggaag 1020 aatcactttg aaaaagtgtg caaggcagag cgaaagaaga aattgaagaa gaagctccga 1080 gtgaagaaag tgcgtgagga tgccagctct gatagtgaat cttcggagag cagcgacagt 1140 gaaaattccg agtcgtccgc tagtgagagt ttgtccatcg gaaaaatcgt tgataagtct 1200 ggtgggggtg gccatgtaac cgccgatttg gagctgaagc tggctggtca gtggcagtca 1260 gtaaaatgcg aattggatac aggtgcaaac acaagcctcg ttggacgaaa ctggctggag 1320 accatgattg gacgtgacaa gtttgaactc aaaccatcag cataccgtct tcaaggtttc 1380 ggcggcagca acattccagt aattgggcaa gtaaaacttc catgtcggag aaaaggccgt 1440 aagtacaacc ttgttttaca agtagtggat gtcaaacatg gaccacttct ctcagcaaac 1500 gtttgccgaa ttcttggttt cgtaaaattc tgtaactcgg tcagtttcat tgcaccgaaa 1560 acggaacaag aactgctgaa cgtgtaccgt gtgaaagcgc aggagatcat taagcagcac 1620 gaggacgttt tccacggtat cggtaaattt gctggtgctg taactttgga aatccgacca 1680 gatgttccac cttcaattca gccacctcgg agagtgccga ttgcgctgag ggagaaactt 1740 aaagtagagt tgcgcaatct ggagcgtgat gggctgattg tgaaggagaa tcagcatacc 1800 gactgggtca gtaatatcgt gttggtgaaa cgtaaggaac aaaagtcgga atcgatccgt 1860 atctgtctcg atccaatccc gttgaatgta gcgctcaaac gtccgcatat gcagttcacc 1920 accatcgatg aaatcttgcc ggaattggga aaggcgaagg tgttttcaac agcggatgtt 1980 cgtaaaggtt tctggcatgt gctgttggac gagaaaagta gccgattgac aacgttttgg 2040 actccgtttg gccggtatcg gtggactcgt ctaccatttg ggatcgctcc agctccggac 2100 attttccaaa tgaaactgca agaagttatt caaggactgc agggtgtaga atgcgttgca 2160 gatgatttgc tgatttacgg ctcaggcgac acgttagagg aagctctgga gaaccacact 2220 ttctgcttgg agaagctgtt ggttcgcctg aacgaatgca acgtaaagtt gaatatcagc 2280 aagctcaagt tgtgtgaaac gtcggtaaag ttctacggac acatgctgac agacaaagga 2340 attcaaccag acgagggtaa gattgcaacc atcaagaatt ttccgcggcc gacagatcga 2400 aagcaacttc agaggtttgt cggcatggta aattacctca gtcgtttcat acccaatctc 2460 agcgctgact tcaccattct ccggcggcta atttcggaaa aagaagattg ggtatggtcg 2520 gaaaaggaag atgaagaatt tgatcgtgta aaaaggctag tagcagacac taagacgttg 2580 caatactaca atatgaacga accgatagtc atcgagtgtg acgctagttc ttatggtttg 2640 ggtgcagcta tcttccagag tcatggagta attggatatg cttcacgtac actgacagct 2700 accgagaaga actacgcaca aatcgaaaag gagttgctcg ctatcctttt cgcctgtgta 2760 cgttttgacc agttaattgt tggaaaccct aaaactactg ttaaaacgga tcacaaaccg 2820 ttggtcagcg tatttcgaaa gccattacta tcagcaccac ggcgtctgca gcatatgttg 2880 ctgaatctac agcgttaccg attgacgata gagttcgtga ccggaaaaga gaacgtagtg 2940 gctgacgcaa tatccagagc tccgttcgat gaaaatcttg ccgacgacaa gttcaacaaa 3000 cgggatattt tccgagtctt tcgggaaatc gaagaagtta acatctccag tttcctgaaa 3060 gtgaaggacg cgcaactaaa cgagatcatc gtagagacag agtcggaccc ttcgttgcag 3120 cagatcatgc aatttattca aaatggctgg ccgtcatcgg taaaccaagt cccagactgc 3180 gccaaaatgt tttttaagta cagacacgaa ctaagcacgc aagaaggaat catctatcgt 3240 tacgacagaa ttgtggtacc ttattcgcta cgtcggaaat taacagaaaa agttcatgtt 3300 agccacaatg gaatagaagc gaccttaaag ttggcacgcg ccaatctttt ctggcctggc 3360 atgagtgccc agattaagga agtcgtcaca aaatgcagca tctgtgctaa atttcagcca 3420 tctcaagctc cagccccgat gcaaagccac cctattccgg tgtatccatt tcagttggtg 3480 tcaatggacg tgttcttcgc agactaccaa ggaaagagct gcaaatttct tgttaccgtt 3540 gaccactatt cggatttttt cgaagttgac ctcctgaaag atttaacgcc gaagtcgacg 3600 attgctgcat gcaaatccaa cttttcgcga tacggaaaac ctcaacaagt tatttcggac 3660 aacggaacaa attttatcaa cagagaatgg agacaatttg caagagactg ggatttctgc 3720 cattcaacat cggctcccca tcatcagcaa gctaacggaa aagctgaagc agccgtgaaa 3780 atagcgaaaa gattattgaa gaaggcagag gaagctgaaa gcgatttttg gtacgcatta 3840 ttgcattgga gaaacgtgcc gaataaagtt ggttccagcc cggcagtacg gctgttttca 3900 agaaacactc gctgtagcat accgatgtct acagatggtc taaagcgaaa acttgtaact 3960 ggagtaccag aagcaattga agaaagaaga agaaaagcaa agttccagta cgatcgaaaa 4020 accagaaatc ttccgcagtt ggaaacagga tccgcagtct atgtacaact aaacccagaa 4080 gtcaccaaga agtggacacc tgcgagagtc agtaacaagt tgaatgagcg atcgtatgtt 4140 gtagacgtac atggtgcaca atatcgacgt gatctagtgc acttgaaacc acgtaatgaa 4200 cctgaaacgc ctgaagtgag taccaatcca gctattgctt caaacgtcga agcaactgtt 4260 ccagaaaggt cgagtagtga tttgcagttg ccgataccta atgacgaggt ggatacgagc 4320 gtacctacta ccattactcc ggaactgatc tcaacaccga agccgtcaaa gactacgaga 4380 atttccagat catcttgtga tccaataatc gaggagggta aaaatgttag atctaagaga 4440 aatgtcaaat taccagttcg tttcaaagat tactgtcttg aataatttct tttttataaa 4500 acagggagga 4510 // ID Afun1 repbase; DNA; INV; 1086 BP. XX AC AJ006557; XX DT 09-JUL-2004 (Rel. 9.06, Created) DT 09-JUL-2004 (Rel. 9.06, Last updated, Version 2) XX DE Anopheles funestus Afun1 gypsy-like retrotransposon encoding DE reverse transcriptase, partial. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Afun1; KW gypsy-like retrotransposon; reverse transcriptase. XX OS Anopheles funestus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Anophelinae; Anopheles; funestus group; OC funestus subgroup. XX RN [1] RA Cook M.J., Martin J., Lewin A., Sinden E.R. and Tristem M.; RT "Systematic screening of Anopheles mosquito genomes yields RT evidence for a major clade of Pao-like retrotransposons."; RL Insect Mol. Biol 9(1), 109-117 (2000). XX DR Genbank; AJ006557; Positions 1 1086. XX SQ Sequence 1086 BP; 386 A; 178 C; 268 G; 253 T; 1 other; gcgttgttga ggccaaaggg catcttaaga aggattgtcg tagtgcaagc aaagcactca 60 cacttaaaaa cacaccaatg aatgtagtgg acaataaaag tgtgtttgta aaaacggtca 120 gagtggaagg tgaaaccatg atggcgttag tagattcagg agcacgtgtt tcaacaattc 180 agcaaaaatg ggcagataaa ataggaaatt tgaagcctac tcataaaatt ttaagagggt 240 ttggtagaaa agaaatacag gtgaattcaa aagttgtcac ggaattacta ttagacaacg 300 ttaaactaga aatcgagctt caagtagtgc caaattgggt acaggatacc gcgataataa 360 tcggtgaaga agtgttggga aaagaaggat tagtantaac agtgcgtgcg ggatcagtaa 420 ccttccagat ggaaccaaac tgtcgctcag aggaaaaaga aagagaagat gagaatgata 480 gtgcgaaatc agtaatgcac atgttcacca ttaaagaagg gtcaagacag gtcgtggatc 540 gcaaaatggt gaattcagat ggaagttacg aacacacggt cagattgata aaggttgtaa 600 atgaataccg tgatgtgttt gcgttgaata ttaaagagtt agggtgtgca aaatcgacgg 660 caatttcact aaaattgacc caaatcgaac cagtgtatgt gaaaccacgg aagttggagt 720 atgcgcgaga aatcatgtta aatgaaatgg tcgaagaact gttagaggcg gaggtaataa 780 aggaaacaga ttcgccttat aatagtccga ttgtgttagt gccgaagaaa aatggacgat 840 ttaggatggc gatcgactat cggatgctta acgcaaagac agtaaaggat aaattcccta 900 tgcccgacct cgaacagtgc ttacagagat taagtggtgc gaaaatgtat ataacgctag 960 atctattcag tgggtattac caaataccag tacatcagga tagccaagat tacactgcgt 1020 tttcgacgcc cagcggacat tttaaattca cgcgtatgcc cttcgcctca ccaacgcaac 1080 cgaatc 1086 // ID Gypsy-39_AA-LTR repbase; DNA; INV; 269 BP. XX AC AAGE02027584; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-39_AA_; KW Gypsy-39_AA-I; Gypsy-39_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-269 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02027584; Positions 58573 58841. XX SQ Sequence 269 BP; 75 A; 62 C; 59 G; 73 T; 0 other; tgtggtgtat tgaaagcaat gcttttacgc gctgcagcaa caagcaaggc ggcacaagcg 60 tggcgtcgaa aagcgtagcg cgtggcgttg ttaactattg cggcgatgca gcactgctac 120 cttggagatc tgttgccgcc tagcaaccag ccggttgttg attttaactg ctatataaac 180 cttccaattt ctgtactcta gtcagtttaa gattcagctt caataaagaa acaactaatc 240 ttcaaacaag tcttcatcaa gatcctaca 269 // ID Gypsy3-I_Dpse repbase; DNA; INV; 6233 BP. XX AC Unknown_group_550; XX DT 13-MAY-2009 (Rel. 14.05, Created) DT 13-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy3_Dpse; KW Gypsy3-LTR_Dpse; Gypsy3-I_Dpse. XX OS Drosophila pseudoobscura OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; obscura group; OC pseudoobscura subgroup. XX RN [1] RP 1-6233 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1039-1039 (2009). XX DR Genome; Unknown_group_550; Positions 30939 37171. XX CC Positions [830-1252] - Reverse transcriptase CC Positions [2336-2812] - Integrase core CC LTRs are 93% similar to each other. XX FH Key Location/Qualifiers FT CDS 26..3184 FT /product="Gypsy3-I_Dpse_1p" FT /translation="MGTEVAESIRTRAIRAQRARDRFQRRKRRRLQVCAAV FT HSAIRQDDRPFARVKMGPKTVMGLLDTGASVSLLGKGGLELIGELELKLQD FT IKKFSVQTAGGTPHKILGHVSTVMTYGGQKQTMELFLCPSLKQPLYLGVDF FT WRKFGLAPHIVGLERHKELAEIDAEVLTRSRPTEPHELTHTQQTRLDQVKS FT NFRAFDTHGLGRTTLERHEIQLIEGAVPVKERFYPVSPAVQELLFAEVDEM FT LRLGVIETSESPWSNRVTLVRKPGKNRLCLDARKLNKLTVKDAYPLQSIES FT ILSRVEDTVYISSIDLKHAFWQIELEPESRTYTAFTVPGRPLYQFKSMPFG FT LCNAAQRLCRLMDRVIPQRLRSHVFVYLDDLLVISRTFEEHIHLLEEVAKC FT LKEANLTIGRKKSKFGYKYLKYLGYIVGDGALRTDPEKVQAITQIPMPRNV FT RQIRRFLGTAGWYRRFIRNFSALSAPLTDCLKGKGQFKLTEEAGKSFEELK FT KALTSAPVLHHPDFKKHFFIQCDASHVGIGAVLFQRDDEGAEHPIAFFSAK FT LRGAQLNYSVPEKECLAAVRAIEKFRAYVELMPFTVITDHASLQWLMRFKA FT LDGRLARWSLALQAYDFDIEHRKGKENIVADMLSRPFDVDAIDFLEFDTTA FT FEDEEYRRKIESVQSDEDEFPDVKVEEGLLFKRTQFGRPELEEFSWKLWIP FT EALTSTLIQQAHDSDAAMHGGIARTLGRLRQFYYWPRMTVQVKAYVADCDT FT CKETKHSTQVHRPLMGAETRTERPIQKLYLDFLGPYPRTRSGNVVILIVLD FT HMSRYVWLKALPKATSSAVIRFLHAEVFAQFGVPEIVHTDNGKQFVSAEFS FT QFLDRRSITHLKSGNYAPQANAAERVNQTILAAIRTYASEDHTRWDEKLTE FT IQCVARSAIHTGIGTSPYFVLFGQHMFTSGRDYKLARRLGALNDAQMSPLA FT RSDKMEIVRDEVRGNSHEAYNRAKNRYNRKAREIRYSPGQEVYKRNFVLSK FT FKDNINAKLSPKYVKCRVIRSMGNSLYELETLQGSRIGVFHAKDLKQ" XX SQ Sequence 6233 BP; 1801 A; 1335 C; 1763 G; 1334 T; 0 other; gtacgagcag gctaggaatc ggataatggg gaccgaggtc gcggagagta tccgaacgag 60 agcaatcaga gcacagcggg ctcgggacag gtttcagagg cgcaagagga ggcggctcca 120 ggtgtgtgca gcagtccact cggcaattcg tcaggatgac aggccattcg cacgagtaaa 180 gatgggcccg aagacggtga tgggactact ggatacaggg gcatcggtga gcctcctggg 240 taaaggtggc ctggaactca taggagaatt ggagctgaaa cttcaagaca taaagaaatt 300 ctcagtacaa acagcaggcg gcactcctca taaaatactg ggacatgtgt cgacggtaat 360 gacgtatgga ggccagaagc agaccatgga gttgttcctg tgtccctcgc tgaagcaacc 420 gctgtatcta ggggtcgatt tctggcggaa gtttggtctg gctccgcata tagtaggtct 480 agaacgacac aaggaattag ccgaaattga tgctgaggtg ctgactagga gtagaccaac 540 ggaacctcac gagttgacac acacgcagca gacgcgtttg gaccaggtga agagcaactt 600 cagggcattc gacacgcatg gtttgggaag aacgacgctc gaaagacacg aaattcaatt 660 aatcgaggga gccgtacccg taaaggagcg attctacccg gtatctcccg cggtgcaaga 720 gctgttgttt gctgaggtgg acgaaatgtt gcgtctcgga gtcatcgaaa ccagcgagag 780 cccgtggagc aaccgcgtga cgctggtacg gaagccgggg aagaaccgat tgtgtctgga 840 tgcgcggaaa ctaaacaaat taaccgtgaa ggatgcttac ccgctgcaga gcatcgagtc 900 aattctgagc cgcgtagagg atacggtgta catcagcagc atcgatctaa agcacgcatt 960 ttggcagata gagctggagc ctgaaagtcg gacgtacaca gcattcacgg tgccaggcag 1020 accactctat caatttaaat cgatgccctt cgggttgtgc aatgcagccc aacggctgtg 1080 tcggttgatg gatcgagtaa ttccacaacg actccgaagc catgtctttg tatacttaga 1140 cgatctacta gttatttccc ggacgtttga ggagcatata cacctgttag aggaagtggc 1200 caagtgtctc aaggaggcaa acctgacaat tggaagaaaa aaatcgaaat tcggctataa 1260 atatttaaag tatttagggt acatcgtagg cgacggcgca ctgaggacag atccggagaa 1320 agtacaggcg atcacacaaa tacccatgcc aaggaacgtg agacaaatcc ggagattttt 1380 ggggacagcg ggctggtatc ggcggtttat tcgaaatttc tcagcactct cggctcctct 1440 gacggactgt ctgaaaggaa aagggcaatt taagctgacg gaggaggcag gaaaatcgtt 1500 cgaggagctc aaaaaagcac ttaccagcgc ccccgtgcta catcacccag actttaagaa 1560 gcacttcttt atccagtgtg atgcgagtca cgtcgggatt ggagcagttc tcttccagag 1620 ggatgatgaa ggggcagagc acccgatcgc attcttttcg gcgaagctaa gaggggcaca 1680 gctcaattat agcgtaccgg agaaggagtg tctggcggcg gtaagagcca ttgagaaatt 1740 ccgtgcatac gtggaactga tgccattcac ggtgataaca gatcacgcga gtttgcagtg 1800 gctcatgcgc ttcaaagcac tggatgggag actagccagg tggtcgttgg ctctccaggc 1860 ttacgacttt gacattgagc accgtaaagg aaaggaaaac atcgtcgccg atatgctgtc 1920 acgtcccttt gacgttgacg ctatcgactt cctggagttc gacaccaccg ccttcgaaga 1980 tgaagaatac cgaaggaaaa tcgagtcggt ccagagcgat gaggatgagt tcccagacgt 2040 gaaggtggag gaaggactgc tatttaagcg cacacaattt ggccggccgg aattagagga 2100 gttttcgtgg aagctatgga tcccggaagc cctgacgagc acgctgatcc agcaggctca 2160 cgacagcgat gcggcaatgc atggaggaat cgcgaggaca ctgggacggc tgcgacagtt 2220 ctattattgg ccccggatga ccgtgcaggt gaaggcatat gtggctgatt gtgacacctg 2280 caaagaaaca aaacattcaa cacaggttca ccgtccactc atgggggccg aaaccagaac 2340 cgagaggccg attcaaaaac tgtacctgga tttcctgggt ccatatccga ggactcgaag 2400 tggcaatgta gtcatactaa tagtcctcga tcacatgtcc agatatgtct ggttgaaggc 2460 tttgcctaag gcgacgtcgt cggccgtgat ccgattcttg cacgcagagg tcttcgcgca 2520 gtttggcgtg cctgaaatag tacacaccga taatggaaag cagtttgtgt cagctgaatt 2580 ctcgcaattc ctggacagga gaagcataac gcacctgaag tcggggaact atgcaccaca 2640 ggcgaacgcg gccgaacgag tgaatcagac aatactggcg gcgataagga catacgctag 2700 tgaggaccac acgcgctggg acgagaagct gacggagatc cagtgcgtgg ctaggagcgc 2760 catacacacg ggcatcggga cgtcgccata ctttgtgttg tttgggcagc acatgtttac 2820 cagcggccga gattataaat tggcacgcag actgggcgcc ctaaatgatg cacaaatgtc 2880 acccttggcg aggagtgaca agatggaaat agtccgagat gaggtccgag gaaactccca 2940 tgaagcctac aacagagcga aaaacaggta caatcgaaaa gctcgagaaa ttaggtactc 3000 accggggcag gaggtatata agcggaattt cgtgttgagc aaatttaaag acaacatcaa 3060 cgcaaagctc agcccaaaat atgtcaagtg ccgagtgatc cgaagtatgg gcaacagtct 3120 gtatgagctg gagacgttgc aaggctcacg tattggggtg tttcacgcaa aagatctgaa 3180 gcagtagccc gggcatctca gtcgatggag ccagagactc gcaatcccag ggtgtgggcg 3240 atggcctacg tttggggccc cttaattgaa aaggcccgcc tgtttggcac tcccgtttcg 3300 gcggaggaat cctggaaaca gtagtgtggt gttatgggcg gtcttggaga gagccgataa 3360 gtatcagggc tggagaagaa agagacagcg gcatctatag tggatcctgg gaagcagtgg 3420 cagagttgcg tgggagaaaa ggaaacggtg cgaggaggag gaaaaagaaa aagaagagtt 3480 tccggtggag aatagtggag gccgcacgtg ccctgaaaga aaaaaaagga aacaaaattg 3540 gtgatcgtgg agacgacgat cccaccaaga catcagcggc aggcatcaac atcagcagaa 3600 gcggcggacc gcggcagcag cagcccggta ctggccaggc cggaaaactg gtgcgagaac 3660 cggcagcgga gcaccgacac caaggaaccg tcagcggagc accggcacca aggaaccgtc 3720 agcggagcac cggcaccaag gaaccgtcag cggagcaccg actggagaca cgtttctgtc 3780 cccacacacc aacccccccg gggaaagaat ccagacttcg aagcagaaat aaacccacac 3840 gaatcaatgt cacacacctt catccacaca catcacaccc aacccacgca tgtataccct 3900 cattcataac acatcacata acatcaccac atttcacaca tctacacccg tccgacccac 3960 aattaaacaa tatatcacga tcatataaaa agaacaccac gattaaacgt aaacatcagc 4020 aagcatcagg aagccacaac ttctagcaaa gggggctgcc aattcggccg aaatagggtt 4080 tctttttttt ttgttagtat tttttgtgtc agtatttgtt tccgttatat aaacctagat 4140 tatgaattaa tgttcaagga gataaataag aaaatataat ggatataaaa atgaaaattt 4200 gaacagagtc atgagtgaat agaaaacggt agttgggtgg gacgagagcg agaagaagag 4260 tgggaccctg aatatgagaa gggagcgctc atggaaggtg gcgctcatgg aagggtggga 4320 ccctgctagg gataacgaaa ccaactatcc ccgctgacga aagtgccaac gaagacaagt 4380 ctgtcctata aattagattt gttttttttg tatgtttttt gtttgttttt gttttgttca 4440 atccgtcccc tgacttatcg ggtcggaaac acccctgact tccccgtgag ctagctggga 4500 acgataggcc agggtggcaa agaggcggat cataactaat ggcgcccaat ggaaatagaa 4560 gggatcgatt tgaaacacga gtcttaaaga cccggtgtaa aatggaaccc tggtagaacg 4620 aaactcagcc ggtccgcgag ccgggaataa atcctagccg tcgtagccgc tgagaggagt 4680 agaagtgagg aaaatggaca tggtagttat gtatcggagg tagaagggaa ggaagtgtgg 4740 gtcccgagtt gtggtccagg tattaccagg tggtagtagc ctcgttgtcc gactcgtgga 4800 gcccgctgtc cggatctgaa aagaggaaga ggtgccgggg gatctgtcga gtcgtggtcc 4860 gggttatgtc cagaggcatt aacgtcgttg tccgattctg ggggatccac ggtgatctga 4920 gggaggaaac agcaaaggga gttggcactc aaaaaaagta agcgaaggta tatagcaagg 4980 ggaaaagaag ggggtccgct cacctgcaaa acaataaatg cagttgtctc tcgtcccacc 5040 aaacaaatta aaggtgaatt tttgtctcaa gcagtttggg aattatatgt ttttttttgt 5100 tagtgattta gcagactatt ggaaagccga agcaagaaaa cccaaaggga aatcatcttt 5160 ttttttgagg tctaacaatc ctcgctagca ttgtttggaa gcaattttag aagtaggcat 5220 taggcaagtc atcggaaaaa gagtttttgt ttgggagagg aagtatttag ttatttagtg 5280 ttgtatttaa ttaggcgatt aatggtagat gtacataggt gtttatttat ttgattgaat 5340 caatttagat atttagattt atcaatttat tgatttaaat taaccgattt acgtatttga 5400 atttatttat ttggttatct aaacattgaa ttaattattc gcttatttat ttattgcact 5460 atttatttgt tcaacatttg ttcaatttat ttgttttctc gttatgattg aaagttttaa 5520 agattattta gccaggaact aggtgtttag aggagtaggg tagtcggata ataaacaatc 5580 caatttattt atcgccaaga tagttaagta aatagagaat agacaaagga aaggagtacg 5640 ttacgaaaat tggttgatga gtgacccgga gatttttaaa gggtcgggaa agacaaggcg 5700 tagccctgag gcacagcgca ggtaccagcc gtacaacatg ccgacgactc ggtcccagga 5760 cggtaaagcc ggcagtaggc cggaagggcc gtcgcaagtg gcggaaaaca tacatgcggg 5820 gttgtaccag gaggacccag aggccggagc ggtaggaggt caacagcgaa cccgtaagga 5880 cctggtgtta ccgtccgagt taagctcagc cgtgaacgca gcgttagccc aagcacagga 5940 gacctaccga gcttcgctcg ggcagcaaat ggaagcgctg agatcgtcaa tgcaagcgga 6000 catgctggag ttcatgcgag agatcaacgc tgtgatgggt tcgttaaaag cggagcgaac 6060 ggcgagtgat gcgaatggaa ggaatcgggg atcgaatgca accagtctgg gaggaggtca 6120 accggaagca cccgcgaacc agcaagcaca tccaacgcac ttgtttgtgg acaaccgaac 6180 attggaggcc gatcagagga cccaagccga tcagagaccc gtggaggagt ggc 6233 // ID SAT-2_NVi repbase; DNA; INV; 151 BP. XX AC . XX DT 20-APR-2009 (Rel. 14.04, Created) DT 20-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE Nasonia vitripennis satellite repeat. XX KW SAT; Satellite; Simple Repeat; Nonautonomous; SAT-2_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-151 RA Bao W. and Jurka J.; RT "Satellite repeats from Nasonia vitripennis."; RL Repbase Reports 9(4), 800-800 (2009). XX DR [1] (Consensus) XX SQ Sequence 151 BP; 31 A; 15 C; 51 G; 54 T; 0 other; ttcccagggt taggctagtt agcctgcttg tggtttggta ggggtggttt ctaggggtta 60 agcgggttag gggtaggtat ttaggtgatt agttggctta tggtgtgtgg cttaattatt 120 ttgagaaaaa ataaagcgat ttaactcgct g 151 // ID Pifo_LTR repbase; DNA; INV; 444 BP. XX AC . XX DT 05-JAN-2009 (Rel. 14.03, Created) DT 05-JAN-2009 (Rel. 14.03, Last updated, Version 1) XX DE Pifo_LTR is a long terminal repeat from Pifo. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Pifo_LTR. XX OS Drosophila yakuba OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; melanogaster subgroup. XX RN [1] RP 1-444 RA Bartolome C., Bello X. and Maside X.; RT "Widespread evidence for horizontal transfer of transposable RT elements across Drosophila genomes."; RL Genome Biol 10(2), R22-R22 (2009). XX RN [2] RP 1-444 RA Bartolome C., Bello X. and Maside X.; RT "Pifo_LTR is a long terminal repeat from Pifo."; RL Direct Submission to Repbase Update (05-JAN-2009). XX DR [2] (Consensus) XX CC TSD is 4-bp long. XX SQ Sequence 444 BP; 153 A; 122 C; 77 G; 92 T; 0 other; agttaccaca gtcaccacac cccctaaacc cccacgccta caccactgaa cacatcgacg 60 cccatgggca ctccggtaca acgaaccggg aaacgaataa acaaatgggt caacaatgta 120 tctacaatgt atcgacatcc ggccaaaatg ctgacactaa catcagcaga ccagacggca 180 agaatgccga tgcagcataa gaacctagca taacccttat atataagaac tatgtactta 240 gatttgtagg cttcttgaga agaacgaaag aaatatagaa tcactcttaa gtttgaaccc 300 aaagcgtgaa gttgtgttac tgaatcccat acaagtgcct gaacaaaatc tcatgcaaag 360 tgccgacagc cgactataag cgaaccaccc aacatcttta ctgctgttat attcacctcc 420 catcattcgg cccgaatggt aact 444 // ID Kolobok-2_HM repbase; DNA; INV; 2618 BP. XX AC . XX DT 14-DEC-2008 (Rel. 13.12, Created) DT 14-DEC-2008 (Rel. 13.12, Last updated, Version 2) XX DE Kolobok-type family. XX KW Kolobok; DNA transposon; Transposable Element; Kolobok-2_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2618 RA Jurka J.; RT "A distinct subgroup of Kolobok-type DNA transposons."; RL Repbase Reports 8(12), 1818-1818 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 253..2046 FT /product="Kolobok-2_HM_1p" FT /translation="MGKSCRVKQSTQSKKSRRFHGNRYTKNLTNNSTLSVA FT PVNNNVTCSENDLIFNNSSLDLSISSKKVEKIETPHQSNNKTVGYRIIDIE FT ILSGVISQLCCKNCLQSGLFLEECFPKKKGFSSMLNICCPNCNKKIEFYTS FT KTCSGKKNFDVNNRIIYSMRACGQGYSGLEKFTSLMNIPTPMTKNNYNKII FT KHVSDCAKVVAKETMSDAVKDICQSSSNIVDTAVSVDGSWQRRGFSSLNGV FT VTAISMDTGKILDCEPMSRSCKACSLKLKLKENNPNAFEAWKSSHVCKLNY FT RGSAPGMEVTGAQRIFSRSISQNKLRYVKYYGDGDCKSYSYVKDTYPGITV FT CKLECVGHVQKRVGSRLRILKKNVKGLGGKGKLTNAIIDRLQNYYGIAVRQ FT NANDLEGMKKAILATLFHVASSAENNWHAHCPDGINSWCRYKQDKANLTST FT YKPGAGLPLSVVSHLKPIYIDLSQEELLKKCLHGKTQNQNESFNGMIWQRI FT PKTKYVSLTQLELGVYDAVSNFNIGRKASILLFEKLNMIPGIYTLQGCSTM FT NKKRLIFAEYQNRITSKKRRKVIRGCKKKKDDAENDQEGCLYDPGSCSYK* FT " XX SQ Sequence 2618 BP; 930 A; 366 C; 433 G; 888 T; 1 other; attaaggtat aagacccctc aaaattccta aaaaattcaa aaaaaataat ttgtgtattt 60 cttacacgaa atttatcgct gaatccgaat ttgaaaacyt tttttaaaaa taaggcttag 120 ttcttgagat ataaaaggtt tccttcaact gggtaaattt gatctgctta gcaacacgca 180 tagcaacggg ttttttataa gaattgccag ctatttagtt atttttgtgt tatttaataa 240 actttttcaa taatgggtaa atcatgcaga gttaaacaaa gtactcaaag taaaaaatcc 300 agaagatttc atggaaatag gtataccaaa aatttgacaa acaattcaac tttatcagta 360 gctcctgtta ataacaatgt tacctgttct gaaaatgacc ttattttcaa taatagttca 420 ttagatttat ctatatcctc gaaaaaagtt gaaaaaattg aaacaccaca tcaaagtaat 480 aataaaactg ttggttatcg tattattgac attgaaattt tatctggtgt aataagtcaa 540 ttatgttgca aaaattgttt acaatctgga ttatttcttg aagaatgttt tcctaaaaaa 600 aaaggtttta gctcgatgtt gaatatatgc tgccctaatt gcaacaaaaa aattgaattt 660 tatacttcaa agacatgctc tgggaagaaa aattttgatg taaataatag aataatttat 720 tctatgcgtg catgtggtca aggttattca ggattagaaa aatttacttc tttaatgaac 780 attcctacac caatgactaa gaacaattac aacaaaataa ttaaacatgt ctctgattgt 840 gctaaagttg ttgcaaaaga aacaatgtct gacgctgtta aagatatttg tcaatcttct 900 tcaaatattg ttgatactgc agtatcagtt gatggtagct ggcaacgtcg tggattttcc 960 tcattaaatg gagtggttac tgctatatca atggacacag gtaaaattct tgattgtgaa 1020 ccaatgagtc gatcatgtaa agcatgtagt ttaaaactaa aacttaaaga aaataaccca 1080 aatgcctttg aagcttggaa atcatcacat gtatgtaaac taaattatcg tggctctgct 1140 cctggaatgg aagtaactgg cgctcaaaga atatttagcc gttctatttc acaaaataaa 1200 ctgaggtatg taaagtatta tggtgacggt gattgtaaaa gctattcgta tgttaaagat 1260 acttatcctg gtataacagt gtgtaagttg gagtgtgttg gacatgtaca gaaacgtgtt 1320 ggtagcagat tgcgaatttt gaaaaaaaat gtaaaaggac ttggaggcaa aggcaaatta 1380 acaaatgcta taattgatcg cctacaaaac tactatggaa ttgctgttag gcaaaatgcc 1440 aatgatttag aaggtatgaa aaaggccata cttgcaacat tgtttcatgt tgcatcatca 1500 gctgaaaata attggcatgc tcattgccca gatggcatca atagttggtg tcgttataaa 1560 caggataaag ctaatttaac gtctacctac aagccaggtg ctggtcttcc actctcagtt 1620 gtcagtcatc taaagcctat atatatagat ttatctcaag aagaactctt aaaaaaatgc 1680 cttcatggta aaacccaaaa ccagaatgaa agttttaatg gaatgatatg gcagcgcata 1740 cctaaaacaa agtatgtttc attaacgcag ttggagttag gagtttatga tgctgtttca 1800 aattttaata ttggaagaaa agcaagcata cttttgtttg aaaagttgaa tatgattcct 1860 ggtatttata ctctccaagg ctgttcaacc atgaacaaaa agcgcttaat ttttgcagaa 1920 tatcaaaatc ggattacatc taaaaagcga cgaaaagtta ttcgtggatg caaaaaaaaa 1980 aaggatgatg cagagaatga tcaagaaggt tgtctttatg accctggttc atgttcatat 2040 aaataaaagt ttttcaattg agtatacgag agaaaatatt atgaaaattt atgaatataa 2100 tattcaaacg agtttttctc aaaatgtaaa tttttgacct cccggccatc ctatctcatt 2160 aaccgtaagt taaatttgta tgaaattttc accaaatgtt tattgcatct ggttgtacaa 2220 ttggaactaa aagtttttta atgcaatcaa gagtttttga tttatagaag tttcaagtta 2280 caaaaaaatc atatttttag tacgccattt tgaaaatttc attataggtc aagtactaag 2340 tttttttaaa taattttagt tcagtttgta gatcaatatg ttattaacca tatgccaaaa 2400 attcataacc tacatgcttt cagttactga gatagcttgg ccggggtaac ttaaatattt 2460 ttaaatgtcg gccattttac ttggtagcca tgacaattaa taaatttatt ttgtattttt 2520 ttaaatcctt ttatctagtt gtttctcaag ataaaattgt ttttttgtta aaaaactttt 2580 tttaataatt ttttttttga ggggtcttat accttaat 2618 // ID BEL-114_AA-LTR repbase; DNA; INV; 646 BP. XX AC AAGE02020048; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-114_AA_; KW BEL-114_AA-I; BEL-114_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-646 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02020048; Positions 114632 115277. XX SQ Sequence 646 BP; 185 A; 167 C; 125 G; 169 T; 0 other; tgttggcgcc aacgccaaat tagcgtctca ttccgttaac gaaaatgcca cattacaaat 60 tttgttattt ccattcatcc cgaccgctcg cgcaaggtac agttgtgtaa ttttcattca 120 aacctactga accacaccac cgctttgcaa agaaaatcaa accaccacgg cgcagcttga 180 aaaatatcaa taataagctc cgcctttcag ctggaagaag gtaatcgtat gttgccatct 240 ccactcttct tattatataa gacgcatagc agtaagagaa atcatcagtt ttcgtttgta 300 tctcaaagag tgaacaccaa taaataagtt aaggaaaagt tcctgttctt ttttgcatcc 360 cggccaaagc agtcctcctg agcaaaaatc caatacggcg atccatcctg agacgaagca 420 tacattccac ttgagaagac ttccgcctgc gaagctgtgt agcattcacc gatttggctt 480 gagtggttcc cccccgctgt cctttttagg acaattgagt tgctaagaac cacaagatag 540 ccaccatccg ctcttgcgag agcactttga gagaaattta gagctgcaca gtttcgaggt 600 cggacagttc cgattttccc actggagccg ggctagcgct ccaaca 646 // ID hATm-1_SM repbase; DNA; INV; 3325 BP. XX AC . XX DT 30-OCT-2007 (Rel. 12.1, Created) DT 06-JAN-2010 (Rel. 15.02, Last updated, Version 2) XX DE hATm-1_SM, a family of autonomous hATm DNA transposons - a DE consensus sequence. XX KW hAT; DNA transposon; Transposable Element; hAT superfamily; KW Autonomous DNA transposon; hATm group; hATm-2_SM; hATm-1_SM. XX NM hATm-1_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-3325 RA Kapitonov V.V. and Jurka J.; RT "hATm, a distinct group of hAT DNA transposons in animals."; RL Repbase Reports 7(10), 1045-1045 (2007). XX DR [1] (Consensus) XX CC hATm is a separate group of hAT DNA transposons. Significant CC identity between hATm and hAT transposases appears after PSI CC BLAST iterations. hATm transposons exist in genomes of different CC animals, including segmented worms (Helobdella robusta, leech), CC flatworms (Schmidtea mediterranea, freshwater planarian), insects CC (Aedes aegyptus, mosquito), and tunicate (Ciona savignyi sea CC squirt). All identified hATm transposons are characterized by CC 8-bp target site duplications and conserved termini CC (5'-TAGGgtGgyccnnA). The planarian hAT-7_SM, hAT-8_SM and CC hAT-10_SM transposons also belong to the hATm group. Their CC putative classification as hAT transposons was not supported by CC significant similarity (BLAST and PSI-BLAST) to known hAT CC transposase proteins. CC hATm-1_SM is a young family of hATm autonomous DNA transposons CC identified in the planarian genome. The consensus sequence was CC built based on multiple alignment of 10 copies that are only 1% CC divergent from the consensus. TIRs are 14-bp long. XX FH Key Location/Qualifiers FT CDS 682..2946 FT /product="hATm-1_SMp" FT /translation="METRGKSNVYLLGSAESDILGAKLPSHRQTFGYFMHL FT HKVENLTVRDSSRQTIEKVFGFWHKAGIPVRAAKHCIDKLEAFFAEWKGLQ FT KHKNRITDGHKMKEDEFTSRLDNLFDIAHSDALTLITIPEDRAFLLSQRQE FT GRPGSIGAVDKVEELKTRRRQDRKEAELKRKKRSQEEIEAYSSQAALKTAL FT SSDDDAVDEYDDDEYFDKSKTCSRSDEVCVGPKRAKINVLSPGLTSALDRT FT KISSRNATFVLSEVASSLGHDVNTLNINRSSIHRARASHRAARSSCLRSEF FT STTVPLTVHWDGKLMEDLSTHEHVDRLPVLISGVGVEQLLGVPKLSAGTGE FT AQASAVVQCLEEWGVAEQVAAVCFDTTASNTGHRSGACSLIEHKLNKDLLY FT LACRHHIMELIVGAAFETTSVGTSSGPEIQIFKRFRDYWQFIDRDGFQVAS FT SDPSVELLVAPYRSDILSFARTYLQAEHPRDDYREFLELAVIFLGDVPDRG FT IRFQVPGAMHRARWMAKVIYSIKMFLFRSQFKMTASEERGICDVATFSVVI FT HLKAWMTAPLAVEAPLNDFTLMGQLRSYPHAAISAATSKKLGLHMWYLSEE FT LIGLALFDSRLSHDSKRLMIAAMDEEAPDHPSKRPSIKSDAFLGKRGLEQF FT CTVNSKKLFQLLDLPVTLVNKDPSDWEQDESYTRALGIVKSLAVVNDRAER FT GVALIQDFNKKLTKNEDQLQFLLQIVNEHRRQFPDCTKRNLATCASGQQSD FT TVKL" XX SQ Sequence 3325 BP; 961 A; 654 C; 722 G; 988 T; 0 other; tagggtggcc cttaaaacaa aaagtttgaa atcatcttct ggcaccccac taatatgttc 60 cacttacaga aataaaatta tgtgcaaaaa atcatgcaga ttgggcaata tttagaggtg 120 gccctcaagc tttgaagttt gactaaatcc gtgaaattta aataaatcgt taagttttaa 180 acctgtttcg tgttttgtat aaatatgatg tattgtaact gtacaatata ttaacttaca 240 tactacatca tcattgcatc atcccaatga caaaataaat attctttggt acagtagaca 300 ctgtgtgcaa atttgcaaaa aagcttacaa cccttaaata ataaataatt tgactgctat 360 cattgaattg cttagtcaat acttatctgt gtctgccaag tctttaattg cagcactagt 420 gcgctagatt ggtctcactg gattgtagtg gattcaagtg gattgcttgc attcaaggta 480 aatgtaattt tttctattaa tccaatttgt ctattatttg aatgggttta attacttctg 540 ttaattcatt attctatcat tcgttactca ttactcaata ctcaatagtc attatttgaa 600 gtgattagta aacgcctagt ccataattat gtgttgtcac ttgtcagttt ttgatataat 660 gtttattttt atagattaaa aatggagacc agaggaaaat caaatgtcta cttgctcggt 720 tcagcggagt ctgacattct gggagcaaag ttgccatcgc atcgccaaac ttttggttat 780 ttcatgcatc ttcacaaagt tgagaatctg accgttcgtg actctagtcg acaaaccatc 840 gaaaaggtgt ttggattttg gcataaggct ggtattccag ttcgagcagc caaacattgc 900 atcgacaagt tggaagcatt ttttgctgaa tggaaaggtc tccaaaagca caagaatcgg 960 attaccgatg gacacaagat gaaggaggat gaattcactt cacgcctaga caatctcttt 1020 gacatcgcac attcggatgc actgactttg attactattc ctgaagatag agcgttcctc 1080 ctctctcaaa ggcaggaagg ccggccaggt tcaattggag ctgttgacaa ggtggaagaa 1140 ctgaagacac gccgtcgtca ggatcggaag gaggcagaac tgaagcgcaa gaagcgatca 1200 caagaggaaa ttgaagcata ttcgagccag gctgctctta aaactgcttt atcttcagat 1260 gatgatgcag ttgatgagta tgatgatgat gagtacttcg ataaaagtaa aacttgcagt 1320 cgatcggatg aagtctgtgt tggacccaaa agagcaaaaa tcaatgtgtt gtctcctggt 1380 ttaacttcag ctttggacag aactaagatt tcatcacgga atgctacctt cgttctcagt 1440 gaagtagctt ccagtcttgg ccatgatgtg aacactttaa acatcaatcg aagctcaata 1500 catcgcgcta gagcaagtca tcgggctgcc agatcgagct gtctgcgatc agagttttca 1560 acaactgtac ctttaacggt tcactgggac ggcaagctga tggaagactt atcgacacat 1620 gaacatgtag accgtcttcc agtccttatc tcaggtgttg gagtcgaaca gttacttggc 1680 gtcccaaagc tgtcagctgg gacaggtgaa gcacaagcat ctgcagtcgt tcaatgtttg 1740 gaagagtggg gtgttgcaga gcaggtcgct gcagtttgtt ttgacacgac agcatctaat 1800 actggtcatc gttcaggagc atgttctctg atcgagcaca aactaaacaa agatcttcta 1860 tacctggcct gtcgtcacca cattatggag ttaatagttg gggctgcatt tgagacaaca 1920 tcagttggca cttcatctgg gccggaaatt cagatattca agaggttcag agattactgg 1980 caattcatag atcgtgacgg ttttcaggtt gcttcttcag acccgtcggt tgagttgtta 2040 gttgcaccat atcgttctga catcctgagt tttgccagaa catatttgca agctgaacat 2100 ccaagagatg actacagaga atttctggag cttgcagtca tatttcttgg agatgttcct 2160 gatcgcggaa ttcgatttca ggtgcctggt gcaatgcatc gggcacgatg gatggcgaaa 2220 gttatctact caatcaaaat gtttctcttt cgcagtcagt ttaagatgac tgcttcagaa 2280 gagcgtggta tttgtgatgt cgccactttt tccgttgtca tccacctcaa agcctggatg 2340 actgcaccac tggctgttga agcacctctg aatgacttca cgcttatggg acaacttagg 2400 agctatccgc atgcggctat ttctgctgct accagcaaga agctcgggct acatatgtgg 2460 tacttgtccg aagagcttat tggtctggct ctgtttgact ccagactatc acatgactcc 2520 aagaggctaa tgattgctgc aatggacgaa gaagctccag atcatccatc gaagcgaccc 2580 agcataaagt ccgatgcatt cctgggaaaa cgaggtttgg aacagttctg cacagtgaac 2640 tccaagaaac tgttccagtt gcttgatcta cctgtaacat tggtcaacaa ggatccatct 2700 gattgggaac aagatgagtc atacacgcga gcgctgggca ttgtcaagag cttggcagtc 2760 gtcaatgaca gggctgagcg tggagttgct ctcattcagg attttaataa gaaattgacc 2820 aagaatgaag atcaactcca gttccttctc caaattgtga atgaacatag gcgccagttc 2880 cctgactgta ccaaacgaaa cctggcgact tgtgctagtg gtcaacaatc tgacactgtg 2940 aaattgtgaa tgttgtgaaa taataataca gcgaagaaca gttggcagag agacataatt 3000 cttaacattt tcagttggag ctatttctga atcagtgaca ttattgttta tttgttgtac 3060 agtaaactaa cctgttcagg ctgttgtgtg tagaataaat gtgatttaaa aatgtaagaa 3120 acctttaaca atttgcaata tcgactatcg agcttgaaat tgtggtacat acaaaacttc 3180 aatgcctgag ggccacctct aaatattctt caattgtaat gatgttttac acaattcatt 3240 atttcaatat ttggcacaga ttggatgggt gccagaattc aaattatgaa tttcattttt 3300 ttgccctatg ctaagggcca cccta 3325 // ID piggyBac-12_SM repbase; DNA; INV; 2445 BP. XX AC . XX DT 30-MAY-2008 (Rel. 13.05, Created) DT 30-MAY-2008 (Rel. 13.05, Last updated, Version 1) XX DE DNA transposon from Schmidtea mediterranea. XX KW piggyBac; DNA transposon; Transposable Element; piggyBac-12_SM. XX OS Schmidtea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae. XX RN [1] RP 1-2445 RA Kapitonov V.V. and Jurka J.; RT "Families of autonomous piggyBac element from the freshwater RT planarian genome and two clear examples of horizontal transfer of RT piggyBac transposons between a flatworm and insect."; RL Repbase Reports 8(5), 531-531 (2008)Repbase Reports. XX DR [1] (Consensus) XX CC piggyBac-12_SM is a young family of piggyBac transposons, CC characterized by 18-bp TIRs (3 mismatches) and TTAA target-site CC duplications. The consensus sequence was reconstructed based on CC multiple alignment of 8 copies, which are ~98% identical to the CC consensus sequence. XX FH Key Location/Qualifiers FT CDS 347..2134 FT /product="piggyBac-12_SMp" FT /note="piggyBac transposase." FT /translation="MLLLIFIIDLFINFREISLKMSRFLNDEEIRLLAEIQ FT NSDEEYKYSSDESEDTDHVSVNSYHSDAESEEELPIRQELPAAPGNDNTHP FT PAKDHSVWYCFPQTTGRTPAHNIIRGSTNKVILPPGTTIEDPVDAFHLIFP FT DNLLSVITEMTNKEAIRVHAKNQYKKQWKPVDNIEMGAFIGVLLTAGHLKS FT SDVSIDRLWSKKYGPPIFRAIMSKNRYHEICRFIRFDDKDSRHERRANDKL FT APIRDLWEQININLGKFYMPSCNLTVDEQLFGFDGKCPFKQYMPSKPEKYG FT IKTFWINDSQTGFPLGGIPYLGKEGSQRAINLGEKIVEKLCIPYFGSNRNI FT TIDNFFTSKQLALNMKSCGLTITGTLRKNKSFIPKNFHSNRSREEFSTIFG FT FTKSLTLASYVPRKGRAVILLSTMHHSDTIVQENKRKPEIIIDYNKTKGGT FT DLMDKMLKTYTCKRKTNRWPVAFFTNILDISALAAYLIYTNIYKNWNLKKN FT DRRNSFLEELSESLILPNLQRRQHQNLRNKDLKESISFINCSIGKKRENST FT APSTSSVKKSRKQCHLCETERRIFQCCIRCAQNTCNNHAKMICDKCLNQ" XX SQ Sequence 2445 BP; 894 A; 387 C; 416 G; 748 T; 0 other; cccttaacct cactctaccg gccgcagcgg ccgtgtgcac ttttacctag ctatatcttt 60 tcttagaatt aacaaccatt aaaatgggct taggtaatct tctagttcca tgaaaggatg 120 tgaaattttt cgaaaaaaaa tattaaaaaa atttttttta caaattgcac gtgtttataa 180 aaaaactccg ccatcccctc tctaacggcc gcagcggccg acgagaatta gttgtcgttt 240 gtccttcgct gagagtgtta ttgaaagttt ttaaaaaatt taactgtcgt tattttgtat 300 acaggtatgt attataataa ttaaatcaac aacttctatt atactgatgt tattattaat 360 tttcattatt gatttattca ttaattttag agaaatatcc ctaaaaatgt ctcgcttctt 420 gaatgatgaa gaaattcgtc tgctagcaga aatccaaaat tcggacgaag aatataaata 480 cagtagtgac gaatcagaag atactgatca tgtgagtgtc aattcttacc acagcgatgc 540 tgagagcgaa gaagagttgc caatacgtca ggaattacca gctgcacctg gtaatgacaa 600 cactcatcca ccagcaaaag atcatagtgt gtggtattgt tttcctcaaa ctactggaag 660 gactcctgca cataatatta ttagaggttc aaccaacaaa gtcattttac caccaggcac 720 tacaattgag gatccagtgg acgcctttca tttaattttc ccagataatt tattgtcagt 780 aataacagaa atgactaata aggaggctat cagagtgcac gcaaaaaatc aatataaaaa 840 acagtggaaa ccagttgata atatagagat gggtgcattt attggggtgt tgttgacagc 900 tggtcatcta aaatcgtcag atgtaagtat tgaccgatta tggagtaaaa aatacggacc 960 tccaattttt cgagcgatca tgtccaaaaa tcgatatcac gaaatttgtc ggtttattag 1020 atttgatgat aaagactcga gacatgaaag gcgtgccaac gataaacttg cacctatcag 1080 agacttgtgg gaacaaataa atattaattt gggaaaattt tacatgccat catgtaatct 1140 aactgtggat gaacaattgt ttggttttga tggtaagtgt ccattcaaac aatatatgcc 1200 atcaaaacca gaaaaatatg gaatcaaaac attttggatc aatgattcac aaacaggatt 1260 tccattagga gggattccat atttaggaaa agaaggaagc caacgtgcga tcaatctcgg 1320 cgaaaaaatt gtcgaaaaat tatgtatacc atatttcgga tcaaatagaa atataacaat 1380 cgacaatttt tttacttcaa agcaattggc tctaaatatg aaaagttgtg gcctaaccat 1440 aactggaacg ttgcgaaaaa ataaatcatt tattcccaaa aattttcatt caaatcgttc 1500 tcgggaagaa ttttcgacaa ttttcggttt tacaaagagc ctcactttgg cgtcatatgt 1560 accgcgaaaa ggaagagcag taattttact gtccacaatg caccatagtg atacgattgt 1620 acaagaaaat aagcgcaaac cagaaattat aattgactac aataaaacga aaggaggcac 1680 tgatcttatg gataaaatgc tgaaaacata cacatgcaaa aggaagacaa acagatggcc 1740 tgtagcattt tttacaaata ttttagatat cagtgcattg gcggcgtatc ttatatacac 1800 caatatatat aaaaattgga atttaaaaaa aaacgatagg agaaattcat ttttagaaga 1860 acttagtgag tcactaattt taccaaattt gcagcgaaga cagcatcaaa acttacgcaa 1920 caaagacttg aaagaatcga ttagttttat aaattgttca attggaaaga aacgcgaaaa 1980 ttctacggca ccttcaactt catcagtaaa gaaaagtcgt aaacaatgcc atttatgtga 2040 aacggaaaga agaatattcc agtgttgtat acgttgcgct caaaatacat gtaataatca 2100 tgctaaaatg atttgtgata aatgtttaaa tcaataaaaa aggattgaaa ataatataca 2160 tagttttatt ttgaattttt aatctcttaa tttttttatt tgaatggtaa aaagttttat 2220 tttgtaatca aaatgtatca tgtttatttt gttaatttat taattagccc acttttattt 2280 aataaaaata aataaaatat atgtaagaaa tctaaatttt aatttaaaat atcataattg 2340 atagttttac gcatttaact attctaaatt tcctataaat tgatgccggc ctcttaggcc 2400 gctaggcgta ggatgactga caaaacagta gtatgaggtt aacgg 2445 // ID I_Ele34 repbase; DNA; INV; 6727 BP. XX AC . XX DT 08-OCT-2010 (Rel. 15.1, Created) DT 08-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE An I clade non-LTR retrotransposon family from Aedes aegypti. XX KW I; Non-LTR Retrotransposon; Transposable Element; I_Ele34. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-6727 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6727 RA Kojima K.K. and Jurka J.; RT "I clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (26-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. This consensus is generated from 8 CC sequences with >96% identity, and ~98% identical to the original CC sequence in [1]. XX FH Key Location/Qualifiers FT CDS 384..1745 FT /product="I_Ele34_1p" FT /translation="MSGASPGPPGGTERTICRSNVPEWMLGPDDLGQTMVL FT VLRRKLKENDXXDQTQXKATLPDSIIVGASIELKIGVKEARSTNATREGRG FT TRYILRTNSRSILEKLTKITHLTDGTEIEIAPHPTLNTVQGVVYDLDTVNK FT DENLILDYLKPQGVQSVRRIKKRVNGTLKNTPLLVLSFNGTVVPDYVYFGL FT LRIXVRPYYPSPMICFNCGSYGHSRKFCQQPSICLRCSVQHDTPEGEQCTN FT PPSCLHCKAEHQATSRDCPKYQQEEKIVRLKVDRNVSITEARRIFAEENKQ FT ETFXGVVQEQIQQQLAAKDVLIVTLQKQVAALTKEVAVLRNVLKSSTQSQS FT TSQRDQQIATTRQRSSSKNPSNTTTPQNQPAQRDRLSRKDHCFITPQTDQL FT ENLRNNREQYNVQTRSRSSKRHMEISPTETVNNRGKRISAQSNTSSASTNA FT ETNNGPGSS" FT CDS 1729..5898 FT /product="I_Ele34_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="MGQDHPKAIHQTTNKNDPATKHDKTLNKRLPPPTFGN FT LDNCKNSDNDSDQQKRTSAXSTTTXIGKVTTTTXVPKHSALTAAEYDRIHH FT LXSIPSTSKTMSVLQHDSPRARPPTSTLTRDVPDYSDEPLAAVGVGSPLXR FT ASIFSSXSSQEQTTASLFPSNSGRSCIKSSVEVFQCSDSPSARPPTSTLTR FT NVPDYSDELLAAVDVGRPYSRASILTPLFYREQSNPAFFHTNAEKHTTNIN FT FXQNTPLSSSAPTDRPSSITSPLRVTATPPHPDSPSARLPTSTLTRDVPEL FT SDEPLAAVGVGWPYTRASITTLFQYKNPSTSAFFLANAGSSTEVIPGADAI FT NQPASTHHQRNDKAVTSTSHSTTVWKDTGEVISESNTPDRRSVQPPTTNTN FT HNTTSRPISPSPSQASTDSEPFPSGGNAAPSFAVQWNISGLRSHLAELQLL FT IGKHQPIIIGLQETNADSNRLDQNCLGKNYKLFLSHCSPHGRQGAGMAIKI FT GTPFQLINLKTKIQAVAVKLYTPIPITVASIYLPPKDKNAVEALDELLKEL FT PKPTLILGDVNAHHTAWGSRTSNSASECKKRGETILEMIVQHDMVILNNGS FT HTRIDPSTGASQALDVSFCSATHAAKFKWNTLLEYSGSDHLPILIDTHSXR FT XETQHRPKWIFGKANWELYEQIITNSICSGAILTVDEFTNKIISAAESSIP FT KTTGNIGRKYVPWWNSEVEATIKARRKCLRALRRLEDNDPRKIAALKQFQE FT ARSICRKTINDAKLKSWESFVESINPDTPTSQVWNNINRLQGKRKNSTIAL FT NLPTGHTNDGEKVANALADEYQKKSSDEHYSEQFRRKHKADNCETFKNQRS FT NLHKXYNTDFTIXELIWALNRRGGSSTGADNVSYTFLQRLPFSAKVALLNL FT YNRIWDSGIFPSQWKVGTVIPIPKHDADRSKPEGXRPITLLSCLGKLFERM FT VNRRLITELETSGKLDXRQHAFRSGKGVDSHLATLESHIDLKPDEHVEIVS FT LDVSKAYDTTYRPAILRTLSRWRISGRMMNIVSSFLCDRYFRVAANGSLSS FT LRKAHNGVPQGSILSVTLFLVAMQPIFDVIPSGTEILLYADDVLLIVKGKN FT HGDIRRIMKKSVKAAVGWATDVGFAIAPTKSKLMHVCSSRHRKCGRAIKID FT NIPIPRVRSLKVLGVVIDSKLNFIKHFSSIKQSCQRRTQILRILGYRLKRS FT SRATLLKIGSALVHSKLYFGIGLTSGNIADMERILGPTYNDVIRQATGVFA FT TSPIISIMAEAGCLPFHFAIIQRLTQLAVRLLEKTEDVAGLPLIQRVRTLL FT MDLTGWCLPDICKTLRTADRAWHSPSPTIDLVLNVISRPVKTRSLHCLSSM FT ILSRLATVDMXNCIRMAPKITDISAPV" XX SQ Sequence 6727 BP; 2090 A; 1754 C; 1389 G; 1462 T; 32 other; cattcgtgct cgggatacag caacccatca tcacgcgcgg atagctcata ttaaattagc 60 tcttttctgc ataattacca cctttaaata tccgccgctt gtggccaawc gatagacgtt 120 acgtgaatct acgccggtat agtgaagttt ttgtgaattc taatgcaacc gaagtgattt 180 ttacaataaa ctgttggtgt acatatcacg cggttgcgtc gcagtgtatt gatttatcca 240 cacatacgcc gtcgctgtat agtgaaaaca aagcacaata taccgagtga taggtgataa 300 cgaagaagga tcaaatccag tttcgttgaa aaagttttgc cgataataaa agtggaatag 360 atagtactca acaatccgga ctaatgtccg gcgcatctcc tggcccccct gggggcacag 420 agagaacgat atgccgcagt aatgtgccag aatggatgct tggkccagat gacttgggac 480 aaacgatggt gcttgttttg cgtcgcaagc taaaggaaaa tgackctgkt gatcaaacgc 540 aagawaaagc aactttgcca gactcaatma ttgtgggagc ttccatagag ctgaaaatag 600 gagttaaaga agccagatca acgaatgcaa ctcgtgaagg tcgcggaaca cgctacatcc 660 ttcgaactaa ctctcgaagc atactcgaaa agctcacmaa aatcacccat ctaactgacg 720 gcactgagat cgaaatcgct ccccatccta cactcaacac tgttcaaggt gtagtctatg 780 acttggacac cgtcaacaaa gatgaaaatc tgatwctgga ctaccttaag ccacagggcg 840 twcaatcagt aagacgcatc aaaaagcgag tgaatggcac cctcaaaaat acaccgctgt 900 tggtcctttc gttcaacggc acagtagtac ccgattacgt ctactttggc ttacttcgga 960 tcscagtccg tccctactac ccctccccta tgatttgctt taactgcgga tcstatggac 1020 attccaggaa attctgccag cagcccagca tctgccttcg atgttcagta caacacgaca 1080 cgcctgaagg ggaacaatgc accaatcctc caagctgtct tcattgcaaa gctgaacacc 1140 aagcaacttc gcgcgattgt ccaaaatacc aacaggagga aaaaattgtc cgtcttaaag 1200 tggacagaaa cgtttctatc actgaagcaa gacgcatctt cgcagaagaa aataaacagg 1260 aaacattckc aggggttgtc caggaacaaa tccaacaaca actggcggcc aaagacgtcc 1320 tgatagtcac actgcagaaa caggtggcag cactgacgaa agaagtcgcc gtattgagga 1380 acgttttaaa atccagcact cagagtcaat caacatcgca acgtgatcaa caaatagcaa 1440 ccacccgtca aagatcgtcg tctaaaaacc catcgaacac tacgacgcca caaaaccaac 1500 cagcacagcg cgaccgttta tctagaaaag atcattgttt catcaccccg caaactgacc 1560 aactagaaaa cctcagaaac aacagggagc aatacaatgt acaaacgcgc agcagaagta 1620 gcaagcggca tatggaaata tcgcctaccg aaaccgtaaa caaccgggga aagcgaatat 1680 cagcacagtc gaatacaagc agcgcatcca ccaacgccga gacaaataat gggccaggat 1740 catcctaagg caattcacca gaccaccaac aagaacgacc ccgccacwaa acatgacaag 1800 actctaaata aacgacttcc acccccgacc ttcggcaatc tcgacaattg caaaaactcg 1860 gacaacgact cagatcaaca aaaaaggacc tccgcagwct caacaaccac aaakattgga 1920 aaagttacca ccacaactgm tgtaccgaaa cattctgccc taacagcggc cgaatacgat 1980 agaattcatc acctcawtag catcccatca accagcaaga caatgagtgt cttacaacat 2040 gattcgccac gcgcgaggcc ccccacgtcg accctgacca gagatgttcc ggattattcc 2100 gacgagcctc tggcggctgt tggcgtgggc agcccactcm accgggcaag tattttctca 2160 agttwttcat ctcaagaaca gacaactgcc agccttttcc cctctaattc aggacgttcc 2220 tgcataaaat catcggttga agtcttccaa tgctctgatt cgccaagcgc gaggcccccc 2280 acgtcaaccc tgaccaggaa tgttccggat tattccgacg agctcctggc ggcagttgac 2340 gtgggccgcc cgtattcccg ggcaagtatc cttacaccac tattctatcg agaacagtca 2400 aatcccgcct ttttccatac caacgcagaa aagcacacga ccaacatcaa ctttmctcag 2460 aacacaccct tgtcgtcttc agcaccaacc gacagaccta gcagcatcac atctccacta 2520 cgagtgacag caaccccacc acaccctgat tcgccaagcg cgaggcttcc cacgtcgacc 2580 ctgaccaggg atgttccgga gctctccgat gagcccctgg cggcagttgg cgtgggctgg 2640 ccctataccc gggcaagtat cacaacacta tttcaataca aaaatccgtc cacttccgca 2700 tttttcctcg ctaatgcagg atccagtaca gaagtcatcc ccggtgccga tgcaatcaac 2760 caacctgcta gcacacatca tcaaaggaac gacaaagcag taacttcaac atcgcactca 2820 acaaccgtct ggaaagatac tggcgaggtc atcagcgaaa gcaatacacc cgaccgacgc 2880 tccgtgcaac ccccaacgac gaataccaat cacaacacta ctagcagacc catctcaccc 2940 tcaccatctc aagcatcgac tgactccgaa ccgtttccct ctggtgggaa cgctgctcca 3000 tcctttgccg ttcagtggaa cattagtggc cttcgctcgc acttagccga actccagctc 3060 ttgattggca agcatcagcc tatcattatt ggcctgcaag aaacaaacgc tgatagcaat 3120 cgattagacc aaaattgtct tgggaagaat tacaaactct tcttaagcca ctgctccccg 3180 cacggtaggc aaggtgctgg aatggccatc aagattggaa cccccttcca gttaatcaat 3240 ctgaaaacca aaattcaagc ggttgctgtg aaactttaca caccgatacc aattacggtt 3300 gcctcaatat acctcccacc gaaggataaa aatgccgtcg aagcgctgga cgagcttttg 3360 aaagagctgc ccaaaccaac tctcatactt ggtgacgtaa acgctcacca tacggcatgg 3420 ggaagtagga cgtccaattc agcatctgaa tgcaagaaaa ggggtgaaac catactggag 3480 atgatagtgc agcacgatat ggtcattcta aacaacggtt ctcacacacg tattgaccca 3540 tccactggag catcgcaagc cctcgatgta tcmttttgct ctgcaacaca tgcagctaaa 3600 ttcaaatgga acactcttct agagtactct ggtagcgacc acctcccaat tctaattgat 3660 acccatagca awagagktga aacacaacac agacccaaat ggatatttgg taaagccaac 3720 tgggagctct atgagcaaat catcacaaat agtatttgtt cgggggctat tctgacggtg 3780 gacgaattca ccaataaaat tatatctgcc gcggaatcca gcatcccaaa aactacaggg 3840 aacatcggac ggaaatatgt accatggtgg aacagtgaag tggaggcaac aatcaaagct 3900 agacgaaaat gcctacgagc cctccgtcgt ctcgaagaca atgacccacg gaaaatcgcc 3960 gctttgaagc aatttcaaga ggcccgttcc atttgtcgca aaactatcaa cgatgcgaaa 4020 ctcaaaagct gggagagctt tgtagaaagc attaaccctg acacaccaac ttcccaggtg 4080 tggaataaca tcaacagact tcaaggcaaa agaaaaaaca gtactatagc gttgaacttg 4140 ccaactgggc acaccaacga cggwgagaaa gtagcaaacg cwctggctga cgaatatcag 4200 aaaaaatctt cggatgaaca ctactctgaa cagtttagaa gaaaacacaa agcagataac 4260 tgtgaaacct tcaaaaatca gcggtccaac ctccacaagc wctacaacac agacttcaca 4320 atagawgaac tgatttgggc acttaaccga cgaggtggca gctcgaccgg cgcagacaac 4380 gtgagctaca ccttcctgca acgccttccc ttttctgcaa aagttgccct tctcaacttg 4440 tacaacmgaa tatgggacag cggaatcttc ccatcgcagt ggaaagttgg tacagtaata 4500 ccaatcccaa aacatgatgc agatcgtagt aaacctgaag ggtwtcggcc tataaccctt 4560 ttaagttgcc ttggtaagct gttcgagaga atggtgaacc gtaggcttat caccgagctg 4620 gaaaccagtg gcaaactcga tgsacgccaa catgccttca gatctgggaa aggtgtcgac 4680 tctcatctgg ccacactaga atcacacatt gacctcaaac cggatgagca cgttgaaatc 4740 gtctctcttg atgtctcaaa agcctatgac accacatata gaccagcgat tctacgtaca 4800 ctatccagat ggcgtatttc aggaagaatg atgaacattg tttccagttt tctctgcgac 4860 aggtactttc gagttgctgc aaatggctcg ttgtcttcac tsaggaaagc acacaacggt 4920 gtgccccaag gctcgattct ttccgtgaca ctttttcttg tggctatgca gcctattttt 4980 gacgttatcc catccggtac agaaatcctt ctctacgcag acgatgtgct ccttattgtg 5040 aaggggaaaa atcacggaga tattcgtagg attatgaaaa aatctgtcaa agcggctgtc 5100 ggttgggcca ctgatgtggg atttgctata gctcctacaa agtccaagtt aatgcatgtc 5160 tgctcctcac gccaccgcaa atgtggtcga gcaatcaaaa tagacaatat tcctatccca 5220 agagtacgaa gcttgaaagt tctgggcgta gtaatcgact ccaaattaaa ttttataaaa 5280 cacttttcgt ctataaagca aagttgccaa agacgaactc aaatccttcg aatacttgga 5340 tatcgcctca aaagaagcag cagagcaaca ctcttgaaaa taggaagcgc actcgtccac 5400 tcaaagctgt acttcggcat agggctaaca agtggcaaca ttgcagacat ggaaagaatt 5460 ttaggtccaa catacaacga tgttattcgg caagcaacag gggtattcgc aacaagcccg 5520 atcatctcaa ttatggccga agctggctgt ctccccttcc actttgcgat aatccaacgc 5580 ttaactcaac tagcagtccg actacttgaa aaaactgaag atgtagccgg gcttccctta 5640 attcaaaggg tcagaacact gctgatggac ctaactggat ggtgccttcc cgatatctgt 5700 aaaacgttga gaaccgcaga cagagcatgg cactcgccat ccccaactat tgatctagtc 5760 ttaaacgtaa tatcaaggcc ggtgaaaaca agatcactgc actgcctaag ttcaatgatt 5820 ttatcacgac tcgctacagt agacatgama aattgtatac ggatggctcc aaagataacg 5880 gacatatcgg cgccggtgta gtcacgggta gtagcaatct gagcttccgc ctcccagatt 5940 catgcggaat attttctgcc gaagccttcg ctttaaaaat agctgtctct aatgtacaaa 6000 gggcaaaaac gattattttg actgattcgg cgagctgcat tgacgcgcta ggtggaggtc 6060 gatccaaaca cccatggatc caatccatcg aggagatggt tgtgggaaaa gacatcactt 6120 ttagctggat tcctggacac gccggcattg ctggcaatga gaaagcggac gagctagcga 6180 agcaaggaag gactctaacg cctgtggaca ttcccatacc agcccaagac gcgattcgta 6240 ctatgaaaag taaaatatgg gaagtttggg agttggaatg gcatcgaagt caagtgcatc 6300 tacgtgaaat caaaccaaca cccactaggt atccagaccg gaagtctcca tgcgaacaac 6360 gagttctgtc acgaatgcgg atagggcaca cacgcatcac ccacgctcat ctgatgagtt 6420 ccaacccccc accaacatgc aatttctgcg gagtgcatat aacagtacgt catcttctgg 6480 ttgactgcaa aggattggag ccaagtagaa aacgctgtgg tattaccgga tcgctcaatg 6540 aaatattagc ctacaacaaa gacagagagg aagcagtaat acgattcttg atagactgta 6600 atttgtttaa agaaatctaa taactcatct atagtagcaa tgtaactaac tactacaaat 6660 aattaatctg acacgaatgc cacgctggtg gtaaagtgtc acaaataaac taataataat 6720 aataata 6727 // ID Gypsy-16_CQ-LTR repbase; DNA; INV; 190 BP. XX AC AAWU01025728; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-16_CQ_; KW Gypsy-16_CQ-I; Gypsy-16_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-190 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 412-412 (2011). XX DR Genome; AAWU01025728; Positions 29978 30167. XX SQ Sequence 190 BP; 62 A; 45 C; 35 G; 48 T; 0 other; tgttgtatta tgcagatccc tgaatctggc aacacggtta ggaaacacac caacacactc 60 acacactgac acactcgtag agcacagacg ccgttcgggc agttgataaa agtacattgt 120 cgagttcaga ccgctgttaa taaacgctaa cttaatcgta ctttttaagt gttttaataa 180 caccagaaca 190 // ID Gypsy-260_AA-I repbase; DNA; INV; 8397 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-260_AA_; KW Gypsy-260_AA-LTR; Gypsy-260_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-8397 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1121-1121 (2011). XX DR [1] (Consensus) XX CC Positions [5263-5742] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 1001..2884 FT /product="Gypsy-260_AA-I_2p" FT /translation="MEKLPYAEHLNQEEVDYELLIRGELGEENSNLDLASK FT QRHLRALFKSDVKNMKNYPSPYNITEEYDYVDGRVSDLMNALKRGVEQRYI FT SRLLHYYYRAKRCLSEQPEENKLKRNLIRRIEGCMKEFKIEPPVSPVDQQM FT NSRSNEADKVDLAQALPIAMGKTSFEQMVVNQDQPIGTPQDTSKYTGTTPK FT AQGFTPGDAHLNKFANIEKVGEDQNAMQQKIDHLERMLQSVTKLLLERESQ FT DVQQERGQRVRPSIGNSRSSGLFTRPHSDSSGDEDLKNVHKSRRHYAVARD FT QQRVFVNRQRFQDPYDSEEERSTGSSVGGRRSTGWRSYRHERRRDDGIRDH FT INRVEKWKLRFSGDSRSVTVENFLYKLKKIAEREGVSEHQLLRDIHLLLEG FT PASDWFFTFVDELEDWQTFERLIKYRFGNPNQDQGIRQKIHDRKQLRGESF FT IAFVTEIEKLNRMLSKPLSNTRKFEVIWDNMRQHYRSKISIVDVQDLQQLV FT KLNHRIDAADPTLHQLGEPRRTINAVQAEESDYNSDESAMVNSMKPRQIKQ FT NRPPPLQNQFQQTRDSNQPGADNTEPTTQRSCWNCQQQGHNWRECRRPKVI FT FCYGCGNLGRTVRCCERCSAMNRPSSSQGN" FT CDS 2953..6138 FT /product="Gypsy-260_AA-I_1p" FT /translation="MVADFDPLRRIHHIRVSVTKCPHIRVKVFETEIDALL FT DSGAGISVINSQNLAEQYGLKILPAAVRVSTADGTEYRCLGYLNIPFTYKG FT ITKVVPTIIVPQISRSLILGADFWESFGIQPMIDVGNGPEKIETLQASDGS FT HIVFNIEPTAHLPKVEEKEPDDTLDIPAFDVPPEPVPEDIETEHELSENDR FT RKLIETIKLFDFTAPGKLGRTDLIQHEIILKEDAKPRNQPVYKCSPFIQKE FT IDAEIERFKELDAIEECYSEWTNPLVPVRKSNGKIRVCLDSRRINAMTVKD FT AYPMQNMQDIFHRLGHAKFYSIIDLKDAYFQIPLKKECRNYTAFRTSKGLF FT RFKVCPFGLTNAPFTMCRLMNKVIGFDLQPFVFVYLDDIVIATETLEEHLR FT LLAIVAERLRRAKLTISLEKSRFCRKKVMYLGYLLTERGVSIDQSRIQPIL FT DYARPKSVKDIRRLMGLAGFYQRFIKDYSRITAAITDLLKKEKKKFTWTEA FT AEASFNELKSVLTSAPILANPDFTQPFVIESDASENAVGAALVQNQGGETR FT VVGYFSKKLSSTQRKYSAVEKECLGVLLAIDNFRHYVEGTRFKVITDARSL FT LWLFKIGAESGNSKLLRWALRIQSYDIELEYRKGKNNITADCLSRSIDTVM FT VVQPDPDYEELAANILADPIKHGEYRVIDGQIWKYVKGSSRQKDPRFLWKR FT FPIETERASIVQQEHEKAHFGFEKTLAAVKERFFWPRMAQQIRKVCRECLT FT CQTSKSGNRNVTPPMGSQKPVEYPWQFVTLDFIGPLPASGKKQSTCLLVAT FT DVFSKFVLVQPFREAKAEPLVDFVENMIFRLFGVPEIILTDNGTQFVSKAF FT KTLLQEYNVNHWLTPAYHPQVNNTERVNRVITTAIRATLKKEHKHWADDIQ FT AIANAIRTAVHGSTKYSPYFVVFGRNQVSDGREYSQMRDVTQASTMDETEI FT SASRKKLFEEVKANLSAAYRRHEKSYNLRSNAGCPRYVAGERVLKKTFDLS FT DKGKGFCKKLAPKFEPCVIRKVLGSHTYELEDNAGKRIGVFYADQLKKLNA FT SSHPN" XX SQ Sequence 8397 BP; 2627 A; 1668 C; 1840 G; 2254 T; 8 other; aatagccgta ccgtttggcc gcgcgtgagc gccggtagca ggctttaagt tttgaatttg 60 agtcgctcat cgaaatgaac accgcaccgc acaagaatac cgcacgatat tagatttata 120 gttagctagc gaaagaggga atgaactaga atgaatagta gatagatagg cctataagca 180 aatgaactcg atgtcgataa agaagaagtg taaatacaaa tgttgaaatg gaaagttgaa 240 ttagttgaat aaatgggagt ttgggttaca cgatagatca acagtttttg aacagcattt 300 cgtatgagag aaatctgaag cggtaaatgt agaagtattt tcacacttcg gtcggtagtc 360 tcttttgtag tattttctgg gaaggttggt cctgaatcca accaaggatt tttctctcat 420 cgaacgtggc ttgttcgaag tgtttactgg tgaagtctgc ccgtgatgat acccctcgcg 480 acaattgtct agttaccggc agagttcgct ttttcacaca caaattttct gattcgcacg 540 acctcacagt cgtgaggcat cttgtttccc taggtcagcc aactcgtttc ccgcccaata 600 actttgggaa aatctttaac cctcccctga agggtctcaa aaggtcgggc aacctccaaa 660 aacggttagt ggtctccaat cggagtggcg cataagccac tttaacttac atccgggaaa 720 ctcgtaaggc tgccgggtta catttggcgc ccaacagaat tggcaaaaca cattttcatt 780 tttttgattg tatataaata atttctggtg gaagtttacc tttaagttta gcattattaa 840 gcgttttact acatatttca ctactccaag taattttgtt tagtttgttc aaatattgaa 900 ctgattgaat ttgtattgaa catttggtac atcgtttctt accataaaag tattagaaga 960 aactcatata caaagcataa tagccagtta atccttaaaa atggaaaagc taccttacgc 1020 tgaacatttg aaccaagaag aggttgacta tgaacttctt attcgtggtg aactaggtga 1080 ggaaaatagt aatcttgatc tggcaagcaa gcaaaggcat ctaagggctc tatttaagtc 1140 cgatgtcaag aatatgaaaa attatccgtc gccatacaat attacggagg aatatgatta 1200 tgttgacggt cgtgtatcag atttgatgaa tgcactgaaa cgaggtgtag aacagagata 1260 catttctcgc ttacttcatt attactatcg tgcaaaacgt tgcctttcag aacaaccaga 1320 ggaaaacaaa ttaaaaagaa acttgattcg tcgaattgaa ggttgtatga aggagtttaa 1380 aatagagcca ccggtttcac cggttgacca gcaaatgaac agcaggtcga acgaggctga 1440 caaggtcgat ttagctcaag ctcttccaat cgccatgggt aaaacttcct ttgaacagat 1500 ggtagtaaat caggaccagc ctattggtac tccgcaggat acatccaaat atacgggaac 1560 cacgccaaaa gcccaagggt ttacgcctgg agacgcacat ttaaataaat ttgcaaatat 1620 tgaaaaggtt ggagaagatc agaacgcaat gcaacaaaag atagatcatc tagaacgaat 1680 gcttcagagc gtaacgaagc tcttgttgga gcgagagtcg caggacgtgc aacaggagag 1740 aggacagaga gttcgaccct cgataggtaa ttcacgcagt tctggtttgt tcacacggcc 1800 tcattccgat tcgagcggag acgaggatct gaaaaatgta cacaagagta gaagacatta 1860 tgctgtagct cgcgaccagc aacgagtatt tgtaaataga caaagatttc aggatcctta 1920 cgattctgag gaagaacgta gcactggatc gagtgtagga gggagaagaa gtactgggtg 1980 gagaagttat cgtcatgaac gtaggcgaga cgacggaatt agagatcata taaaccgagt 2040 agagaaatgg aagttgaggt tttcggggga ctcaaggtca gtcaccgttg agaattttct 2100 atacaaactg aagaaaatcg cagaacggga aggcgtttcc gagcaccaac tcttgaggga 2160 catacatctt cttctagaag gtccagcgtc cgattggttt tttaccttcg ttgacgagct 2220 ggaggactgg cagacgtttg agagactaat taagtatcgc tttggaaacc caaaccaaga 2280 tcagggaatt agacagaaaa tccatgatag gaaacagttg cgtggcgaat ccttcattgc 2340 attcgtaact gagatagaga agttgaaccg tatgctttcg aaacccttat cgaatacaag 2400 gaagtttgaa gtcatatggg ataatatgcg tcaacattat cggtcaaaga tatcgatagt 2460 cgacgttcag gaccttcaac aactggtaaa actaaaccat cgaattgacg cagcagatcc 2520 gactttacat cagttgggtg aaccaaggcg gacaataaat gcagttcaag cggaggagtc 2580 ggattacaac agtgacgaat ccgctatggt caatagtatg aagccacgtc agatcaagca 2640 gaaccgacct ccaccattac aaaatcaatt tcaacaaacc cgggattcaa atcaaccagg 2700 ggcagataac acggaaccaa caacacaacg atcttgttgg aattgccaac aacagggcca 2760 caactggaga gaatgtagaa gaccaaaagt gatcttttgc tacggctgtg ggaatttggg 2820 acgaacggtt cgttgctgtg agcgatgctc ggcgatgaat cgtccttctt ctagtcaggg 2880 aaactaaaca agggaggtgg tagcggggat cgaaccaccc ctcattataa gcagattccc 2940 aaaccgacga ccatggtagc agattttgac cctctaagga gaatacacca tataagggtc 3000 agtgtaacaa aatgtccaca cattcgagtg aaagttttcg aaactgagat agacgcgcta 3060 ctggactcag gggccggaat tagcgtaatt aattcacaaa atttggcaga acaatatggg 3120 ttgaaaattc ttccagctgc agttcgagtc agcacagctg atggaacaga atataggtgc 3180 ttgggctatc tcaatattcc ttttacatat aaggggataa ctaaagtagt accaactatt 3240 atagttcccc aaatatcaag atcgctcatt ctaggagctg acttctggga aagctttggc 3300 attcaaccca tgatcgatgt cgggaatggt ccagagaaga ttgaaacatt acaggcatct 3360 gacggatccc acattgtgtt caatatcgaa cctacagcac acctaccgaa ggttgaagaa 3420 aaggaaccag acgacactct agatattcca gctttcgacg tacctccgga accagttccg 3480 gaagatatag aaacagagca tgaattgtca gaaaacgaca gacgaaagct aattgaaacg 3540 attaaattgt ttgattttac cgcacccgga aagctagggc ggactgatct cattcaacat 3600 gaaatcattc ttaaagaaga cgcaaagcca aggaaccaac cggtttacaa atgttcccct 3660 ttcattcaaa aggaaatcga cgcggaaatt gaacgcttta aagaactgga tgcaattgag 3720 gaatgttaca gcgagtggac aaatcctctt gtaccagtgc ggaagtccaa tggaaagatt 3780 agagtttgtt tagattcgag gcgaataaat gcgatgacgg taaaagatgc ataccccatg 3840 caaaatatgc aagatatttt tcacaggcta ggtcatgcaa aattttattc aatcattgac 3900 ctaaaagatg cgtattttca aataccattg aaaaaagagt gtagaaacta cacagcgttc 3960 cgtacatcaa aaggtctttt tcgttttaaa gtgtgtccat ttggactcac aaacgcgcca 4020 ttcacgatgt gccgtttaat gaacaaggtt attgggtttg atttacaacc ttttgttttc 4080 gtgtatctcg atgatatcgt gatcgctaca gaaactctcg aagaacacct ccgtttactt 4140 gccatagttg ctgaacgact tagaagagcc aaactaacca tctctctaga gaaatcacga 4200 ttctgtagaa agaaggttat gtatctaggt tacctcttga cggagagagg ggtttccatc 4260 gatcaatcga gaattcagcc aattttagat tatgcccgac caaagtctgt taaggacatt 4320 agaagactaa tgggtcttgc tggattctac cagagattca ttaaagatta cagcagaatc 4380 acagcagcaa taactgatct gctcaaaaaa gagaagaaaa agtttacgtg gacggaagcg 4440 gcagaggcat cttttaatga attaaaatct gtactgactt cagcacccat cttggcgaat 4500 ccagatttca cacaaccatt cgtaattgaa tcggacgcct cggaaaacgc tgtgggtgcc 4560 gccttagttc agaatcaggg tggggagact agggtagtag ggtacttcag caaaaagctt 4620 agttccactc aaaggaaata ctcggccgtg gagaaggaat gcttaggcgt tctgttagcc 4680 atcgataact tccggcatta tgtggaaggg acaagattta aagtcataac agatgcaaga 4740 agcttgttgt ggttatttaa aataggagcg gaatcgggaa attcgaagct gctacgctgg 4800 gcgctccgaa ttcaatctta tgatattgaa ctcgaatatc gtaaaggtaa aaataatata 4860 acggcggact gtctttcaag gtccattgac acagtaatgg tagttcaacc cgacccagat 4920 tacgaagaac ttgcagcaaa tattttggca gaccccatca agcacggaga atatcgcgtg 4980 atagatggac agatttggaa atacgtcaag ggttcaagcc gccagaaaga cccaagattt 5040 ctatggaaac gttttcccat agaaacggaa agagcatcta tcgttcaaca agaacatgag 5100 aaggcccact tcgggttcga gaaaactctt gctgcggtta aagagcgatt cttttggcca 5160 agaatggccc aacaaatacg aaaggtttgc agggaatgcc taacatgtca gacgagcaaa 5220 tcagggaata gaaatgtaac tcctcctatg ggttcgcaga aaccagtgga atatccctgg 5280 cagtttgtca cactggactt catcggacca ctaccagcgt caggaaagaa gcaaagcaca 5340 tgcttattgg ttgcgacaga cgttttcagt aagtttgtac tggtccagcc ttttcgagag 5400 gccaaagcgg aaccattagt cgatttcgtt gagaatatga tattcagatt atttggagtc 5460 cctgagatca tactaacgga caatggtact caatttgtat cgaaagcgtt taaaacccta 5520 ctgcaggaat acaatgttaa ccattggtta acccccgcgt accatccgca agtaaataac 5580 acggaacgag tgaatagagt tataacaacg gcaatcagag cgacccttaa gaaagaacat 5640 aagcattggg ccgacgacat ccaagcaatt gcgaatgcta tccgtacggc ggtccatggc 5700 tccaccaaat atagtcctta tttcgtcgta ttcggacgaa atcaagtttc cgacggaaga 5760 gaatactctc aaatgagaga cgtaacacaa gcgagcacaa tggatgagac agaaatatct 5820 gcttcaagga aaaaactgtt tgaggaagta aaggcgaacc tatcggcagc atatcgaaga 5880 catgaaaaga gctacaacct tcgttcgaat gctggctgtc cacgttatgt tgctggggaa 5940 agagtcctca agaaaacttt tgacctttct gataagggca aaggtttctg taagaaacta 6000 gcccctaaat ttgaaccttg cgtgattagg aaagtactgg gttcccacac ctacgaacta 6060 gaagacaatg cagggaaacg gattggggtt ttctatgcag atcagctcaa aaagctaaat 6120 gcatcgtctc atccaaacta ataaaactaa tttaaagcta tgtgcctctt tatgagcaac 6180 aatgattacg atgagtacga aatccaaata aaatacctac ggggtaaata atttttcgat 6240 taaaagtcga attcgaccag ggaaaatgga agggtcagtt tgctagcaaa acmttgtctg 6300 cttgtgatga aacccgccaw caacaccwca cttcacaaag caaagaatga gccggacgat 6360 gacctgtcca aagtagtgga tcaccacgac tgcataaagc gagtaccggt aatgatttct 6420 taaaataaag tagtgaagta ctagaaggga agagcaagct gctaacataa cgacctgccc 6480 aagagtactt gagtagaatc ttcaaaatcg acaaaatttt ccaaaattga tcaagctatg 6540 tacaatctct attgagtgac aatgaggaga agtcccctaa aaacaccaac tacgggtaga 6600 cgcaattttg tttgattgaa taggacgtga agtgattcat tctatgacga atcccttgta 6660 aatatctctt tgtaaatagt tgaataggtt agttagttag taataagtcc tacacttcgt 6720 cctgaattat aactagatta gtacttagat taggttagta ttgaaaaatt gaaattactc 6780 acgattttcc atgaattttt cccattttgt tccctttttc gaatgaattc ttagtttcct 6840 acgttttttt ccttttcgtc tgtccttcga tttgtttagt acctaaaagt cgaaaggtgg 6900 cattaataat ttgcaaagta aagtgaattt tgaaaattag aacaataatt tacctttttt 6960 cctttagttg gtaggctcct cgaatagttg aactcgtttt cgaatttgaa gatccctttc 7020 ctggcagtgt cgataagttc tttgcagccc acacgatcca gtccaattgc agtccttctt 7080 taccgatcca tatagtttac tggtttattt ttagtcctat ccgtccatcc ttccatctcc 7140 atctttgttt acctttcsgt ccatcagagc agttttcctt cccgtgcaat tcgatggaat 7200 ttccggagcc ggtatggggt cgcgctcatt ccatttctgg gcaattcacg tccaattgca 7260 ttccaatccc cktcagatac tttccgtcac agcaattctt gttggaatcc tcagcaacac 7320 ctcccagacg gcgactgcac cgcccgtagg ttgcggcagc tgcgaacaaa acaaaaacac 7380 acaaaaagct ctcttcaaac tcaacaccat caatcgaacg acaaattatc acttccctct 7440 ggttctggca cagcactaaa acaccttttc catcaatttt cactgttttc cgaccgtttt 7500 cactactttt ttacaattcc gattccgcga cgtgcttagt tttccgcgac gaactgtcaa 7560 actttgtttt ggttctcgca tagggttcat gatgttggca agagtgagtg actatgaatg 7620 aacgaaatga gtgggtggat gaatgtttcg cgcatgcgag gggtgttcga atggaaaatt 7680 ttgtgttgtg tgtttaaaga acaatttcgt tgtcggtgtt cacactgacg gtcagaaatg 7740 ttcggatatt tacgttcttt gagtttcggc tacatggata gaagtttccg gattaatttc 7800 cgggtgatta tctgtgagtc gaagcagacc cccacattta agatggagac tatcagatcc 7860 gctattctcg ggaagatcct gaggttggat ttsgatattc ttcgtcagtt tattttttca 7920 ctacggtgat agtwtaccga gaactctcac ccgttgctta gttgatatac tgaggttggt 7980 gcagacaaga atttttatat gtaagaatgc gttgccaaca atctgaatga cagtttattt 8040 tttcacgttt awttgcctga tcaagatcaa aactccatag ccagtcattt ttgtacatag 8100 acgcccttct ttgttgcttt gtatataaat agtttataag gtctgattga tttttcattt 8160 actaattaat tcctgctgga gaccggagtt gaaaaaacgc tgatggtcag tagcagacgt 8220 gattggttga tcatgcgctg taaattgagg gagagtctgt taaggcgatg aaaccatatg 8280 accccaggat ttaaaagtga atacaaaatc aatcgaatga ccttcgaaaa tttgattgat 8340 cttcgaaatt ccgttttcga ttctcaatca aattttcgaa aatttagtaa ggaatga 8397 // ID Tc1-1_TCa repbase; DNA; INV; 1630 BP. XX AC . XX DT 21-MAR-2009 (Rel. 14.03, Created) DT 21-MAR-2009 (Rel. 14.03, Last updated, Version 1) XX DE DNA transposon: consensus. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Tc1-1_TCa. XX OS Tribolium castaneum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Coleoptera; Polyphaga; Cucujiformia; OC Tenebrionidae; Tribolium. XX RN [1] RP 1-1630 RA Jurka J.; RT "Mariner/Tc elements from insects."; RL Repbase Reports 9(3), 677-677 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Tribolium CC castaneum Genome Project. XX FH Key Location/Qualifiers FT CDS 304..1317 FT /product="Tc1-1_TCa_1p" FT /translation="MPTGTPIGKELKERVIDAFLNNEKQADIARRFQLPRY FT SVSKIIIRYNERGHLNNNPKSGRPRKTTINMDRRIKRISENDPWKSASKII FT AEIPEVGVSTRTIQRRLVAAKLFSRRPAKKPLISERNRRARLQFAREHLNW FT TVQDWRKVLFSDETRYKLFNSDGMKRVRRPINTRFNPKYVTPTVKHGHGSV FT FLWGCFSWNGVGPLSFIEGNMDRFIYRDILQDVMLPYAEWEMPLRFIFQQD FT NDPKHTSALVSEWFQNNNIHVLQWPAQSPDLNPIENLWEEVERRIRIQRFP FT NKQSLIDKIKEVWSNLDENVIQKLISSMPRRCQKIILNNGYAINY*" XX SQ Sequence 1630 BP; 583 A; 256 C; 304 G; 487 T; 0 other; cactatcgga caaaagtaag gcaacatttt gaaaaatata gaaattaatg tctatctaat 60 gtaaataaat tatgatcggg caaactatgg tattattaaa agaatcaaag gaagctactg 120 tcattaatgt caagtaatta tttattaagg ttgaaaaaaa ccgctttgtt gacttgtagt 180 gtaaaaaagg attattttaa gacgacaaaa gtaaggcaac atttagattt ttttcaaaaa 240 atgttgttat aatgaaaaat ttgttaggcc aatattaaca gttcgtcttg tgcaacaaga 300 gccatgccaa cgggtacacc aattggtaaa gagttgaaag aaagagtgat agatgctttc 360 ttaaacaatg aaaaacaagc ggatattgca agaagatttc aactgcctag atacagcgtg 420 tctaaaataa ttattcgcta caacgaaaga ggacatttaa acaataatcc aaaatcggga 480 agaccacgaa aaaccactat aaacatggac agacgcatta aaagaatttc ggaaaacgat 540 ccatggaaat ctgcatctaa aattatagcg gaaataccgg aagtaggagt tagtacacga 600 acaattcaaa gaagactggt tgcagcaaag ctttttagta gacgtcctgc taaaaaacct 660 ttgatttccg aaagaaatag acgcgccaga cttcaatttg caagagaaca tttgaattgg 720 acagtgcaag attggaggaa agttctgttt agtgacgaaa ctcgatacaa attatttaat 780 tcggatggaa tgaaacgtgt gagacggcca atcaatacac gattcaaccc caagtatgtt 840 acacccactg ttaaacatgg tcatggtagc gtttttctgt gggggtgttt ttcgtggaat 900 ggtgtgggtc cgttgagctt tattgagggt aatatggatc ggttcatcta tagggacata 960 ctccaagatg taatgctgcc gtatgcagaa tgggaaatgc ctcttcgttt tatctttcag 1020 caagataacg accccaaaca cacttctgca ttggtcagtg agtggttcca aaataataat 1080 attcatgttt tgcaatggcc tgcccaatca ccagacctca acccaattga gaacttatgg 1140 gaagaggtag agcgacgaat ccggatacaa agatttccca acaagcaatc tttgatagat 1200 aaaatcaagg aagtgtggtc aaatctggat gaaaatgtta ttcaaaaatt aatttcttca 1260 atgcccagac ggtgccaaaa aattatactt aataacggct atgccataaa ttattgaatt 1320 tttcgttcta tttatttaaa ctttagtgtt gggtagtgtt atgtaaatgt tgaatcaaat 1380 aaaatttgag ttgaaaatat tttctgacaa tgttgcctta cttttgtcgc attaatatcg 1440 ggctttttta tatttcaagc tagtagtatg tcataataat tacccacagg ctatccattt 1500 tcaacacgaa taagaatatc tttaaacatt tattataaaa ttaaattgaa ggtaaatacc 1560 aactcttaaa atttaaaata ccaatatttg tatatttttc aacatgttgc cttacttttg 1620 tccgatagtg 1630 // ID TAS_LTR repbase; DNA; INV; 252 BP. XX AC Z29712; XX DT 13-MAR-1998 (Rel. 3.02, Created) DT 13-MAR-1998 (Rel. 3.02, Last updated, Version 1) XX DE Long terminal repeat from retrovirus-like element TAS. XX KW BEL; LTR Retrotransposon; Transposable Element; LTR; TAS; TAS_I; KW TAS_LTR; endogenous retrovirus; env; gag; pol. XX OS Ascaris lumbricoides OC Eukaryota; Metazoa; Nematoda; Chromadorea; Ascaridida; OC Ascaridoidea; Ascarididae; Ascaris. XX RN [1] RP 1-252 RA Felder H., Herzceg A., de Chastonay Y., Aeby P., Tobler H. RA and Muller F.; RT "Tas, a retrotransposon from the parasitic nematode Ascaris RT lumbricoides."; RL Gene 149(2), 219-225 (1994). XX RN [2] RP 1-252 RA Aeby P., Spicher A., de Chastonay Y., Muller F. and Tobler H.; RT "Structure and genomic organization of proretrovirus-like RT elements partially eliminated from the somatic genome of Ascaris RT lumbricoides."; RL EMBO J 5(12), 3353-3360 (1986). XX RN [3] RP 1-252 RA Heinz H.F.; RT "TAS_LTR."; RL Direct Submission to Genbank (04-JAN-1994)Felder H. F. Heinz, RL Institute of Zoology, University of Fribourg, Perolles, Fribourg, RL CH-1700, Switzerland. XX DR GenBank; Z29712; Positions 1 252. XX CC TAS internal sequence is reported in Repbase as TAS_I. XX SQ Sequence 252 BP; 66 A; 57 C; 44 G; 85 T; 0 other; tgtcgcgtca aatggcgaaa cattcaaaac aaaattccag cgaaatgatc aagaagattt 60 cgctgagatt ttgtgagaaa ggaaacataa aatcctcact tgtgagagga ttttatgtct 120 cctcctttgt tatcattctt tgtaaacgta tttaactctc gtcgtctctt agtctacgcc 180 tctttcaaaa ggtgcttagg ggacctccct cttcttcttt cccacatatt tttgccgcgt 240 ttttacgcga ca 252 // ID MHSAT1 repbase; DNA; INV; 169 BP. XX AC L07109; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 28-SEP-1995 (Rel. 1.08, Last updated, Version 1) XX DE Meloidogyne hapla satellite repeat. XX KW SAT; Satellite; Simple Repeat; MHSAT1; satellite repeat; KW Repetitive element. XX OS Meloidogyne hapla OC Eukaryota; Metazoa; Nematoda; Chromadorea; Tylenchida; Tylenchina; OC Tylenchoidea; Meloidogynidae; Meloidogyninae; Meloidogyne. XX RN [1] RP 1-169 RA Piotte C., Castagnone-Sereno P., Bongiovanni M., Dalmasso A. RA and Abad P.; RT "Cloning and Characterization of two satellite DNAs in the low RT C-value genome of the nematode Meloidogyne spp."; RL Unpublished (1992). XX DR GenBank; L07109; Positions 1 169. XX SQ Sequence 169 BP; 58 A; 28 C; 26 G; 57 T; 0 other; cttggaatta gaaataattg tttatatgag ttccttgtaa agcaacctct agatgccacc 60 agatataaaa aattgttgca cagtcaaaac tggaaatggt cttcattatt gaatatctta 120 cagaatgctg tatctatcct gaaaaaatcc tgtgtcgaac ttaattttc 169 // ID Gypsy-5_CQ-I repbase; DNA; INV; 3758 BP. XX AC AAWU01034000; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-5_CQ_; KW Gypsy-5_CQ-LTR; Gypsy-5_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-3758 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 389-389 (2011). XX DR Genome; AAWU01034000; Positions 49675 53432. XX CC Positions [2864-3376] - Integrase core CC 'AGAC' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 1496..3742 FT /product="Gypsy-5_CQ-I_2p" FT /translation="MIRAYHQIPVNEEDIEKTAVITPFGLYEFPRMQFGLC FT NAGQTFQRFMHKVLGDLDFVVVYMDDICIASATAAEHKQHVRKVFERLRDF FT GLVINMAKSKFAREQVEFLGYVVSHEGVLPLPDRVKAVSDFPEPATVKDLR FT RFLALLNSYKRFIPHATDVQLALRNLIPGNKKNDTRKLVWTEEARSAFREC FT KRSLSEATLLHYPDSRKPLALMVDASNTAAGAALQQLDGATWKPLGFFSEK FT FSRTQQEYSTFGRELLAAKRAVEYFRYMVEGRKFVVFTDHKPLTFAMASNS FT NSRLPHEQRYLKYISCFTTDIRHISGRKNVVADALSRVATISMPCPIDYEQ FT MAVDQAVDNELQQLLTSGTSLKLVLKQTSLATKPLYCDVSVDGRIRPFVPA FT QHRNSVLQFLHSIAHPGIRGTRRLLSDRFVWSSMNKDVRSFVKCCVDCQKS FT KIHRHTRAPFESFNLPKARFRHIHVDLVGPLPPSNGQRYLLTIVDRFTRWP FT EAVPLADMTAETVAWAICSSWIARFGVPEQITTDQGRQFESQLFRELAMLT FT GSKHTRTTAYHPQANGLVERFHRTLKAAIMAVDSAHWVDRLPIILLGLRSA FT LREDLGCSVADLVYGQPLRLPGEYFEPATTGTLQSDFVKQLQRAVGQLKPQ FT KVMHHAKPLVFVQDELSKCSHVFVRIDAVKRPLQRPYDGPFEVVERHEKFL FT DLLINGKRQRISLDRLKPAFLCDENVANHPLDDPSTKVTPSGHRIRFLV" XX SQ Sequence 3758 BP; 900 A; 990 C; 1009 G; 859 T; 0 other; ctggtgaccc cgacgactaa agttaaaact gaaggcaaaa tgccagaggc cgaaacaaac 60 acggatgcga acgaggaggc cgccggagcc gagaatgccg ccgtggccgc atcggtagct 120 gtgaggttac ccgatttctg gcgcaacgac cccgccatgt ggtttgcaca ggctgaggcc 180 caattcgccc tggccggcgt tgttcgggac cacacgaagt ataaccacat agtggccaaa 240 gtggaccagc agacaatttg ccacatagcg gacttggtca ctgaccctcc tctaaccaac 300 aagtatcccg caatcaaaag tcggctcatt tcccgtttcg tgatgtcacc tcaaggcaaa 360 ctcgaacaac ttctcggttc gtgtgacctc ggtgacatgc gtcccacaca cctgctggcc 420 aaaatgcagg agctgtcaat gggattgaac gtcaaccccg aattgctgaa gatgctgttt 480 ctccagagga tgccggcgaa cgtcaaagcg atacttgcca tcagcgacgg gagtttgacc 540 aagttagcgg agatggcgga caagatgctg gagtcggcgc tgaacgttgc cgctgttgct 600 gcgaatcccg gtccaagcag acaagccgta tcaacagctg cgcactcgga ggatatgtca 660 acagacgatt tgcgcgaaca agttgctttc ctgacggcgg aggttcggag gatgcggaca 720 agatcaactt cgaggggccg atctgtatcg aggtccgggc gttctgcgga ggagatctgt 780 tggtaccaca agaagtatgg tactcgggcc acaaggtgcc gagagccgtg tcagttttca 840 aaaaacatag tgatcgtcca cgtgtttcgg cggaggtggg cggcctatcg caaagtcgcc 900 gtcttctgat ctacgatcgt tcgagcaaaa caaaatattt gatcgacatc cagcttcacg 960 ccgcaaatgg gagcaacatc aaggtctacg gctcccgctt cgtaaacatc gaccttggcc 1020 ttcgaaggaa gttctgctgg aatttcctga ttgcgaatgt ggggatggca ataattggcg 1080 ctgacttcct cgcaaacttc ggtcttctcg tcgatttgaa gaaccatcgc ctcaccgacg 1140 gaaaaacagg actgcagtca tcggccggat tgacatctgc ggctgttttc ggcgtgacta 1200 cggttgggtt cgaccatccg ttccgagacc ttctgcagga attccgggtt atcaccgtcc 1260 cgaactcgat gcaaacagct gcaaagcacg acgtcaagca tttcatcgaa acaaaaggtc 1320 cccccgttgc attcaaggtt cgtcgcttgg caccggacaa ggtcgacgca gcaagagcgg 1380 agttcaaggt tatcgcgcgc tcaataaggt gactgtccca gacaggtatc ctgttcctca 1440 tatacacgac cttttgtatg cgttccaggg aaagtccatt tttacaacgt tggacatgat 1500 tcgcgcctac caccaaatcc ctgtgaacga ggaggatata gagaaaacgg ccgtaatcac 1560 accgttcggc ctttacgagt ttcccaggat gcagttcggc ctatgcaacg cggggcagac 1620 ctttcagcgg ttcatgcata aggtgctcgg cgacctggac ttcgtcgtcg tttacatgga 1680 cgatatctgt attgcatctg ccaccgcggc ggagcacaag cagcacgtgc ggaaggtttt 1740 cgagcgccta cgagatttcg gattggtcat taacatggcc aaaagcaagt tcgctcggga 1800 acaagttgag tttctcggct acgtcgtcag ccacgaaggt gttctgccgc ttccggatcg 1860 tgtgaaagct gtgagcgatt tcccggaacc agcgacggtg aaagatctgc ggcggttttt 1920 agcgctcttg aacagctaca agcgattcat cccacatgct accgatgtcc agcttgcgtt 1980 gcgaaacctc atccccggga acaagaaaaa cgatacccga aagctcgttt ggacggagga 2040 agcccgatca gccttccggg agtgtaagcg gtctctttcg gaagccacgt tgctgcacta 2100 tccggattcc agaaaaccgc tggcattaat ggtcgacgct tctaacacgg cggcgggggc 2160 agctctgcaa cagctggatg gcgcaacctg gaaacccttg ggtttcttct cagagaaatt 2220 ttctcgaact caacaggaat actcaacgtt tggccgggag ttgctagcag ctaagagagc 2280 ggtcgagtat tttcgataca tggtcgaagg caggaagttc gtggttttca ccgatcacaa 2340 gccgttgaca ttcgcgatgg cctcgaattc caacagtcgc cttccacacg agcagcgcta 2400 tctgaagtac atttcgtgct ttacgaccga cattcgacac atcagcggta ggaagaacgt 2460 tgtggctgat gcactgtcgc gggttgcgac aatttcgatg ccttgtccga ttgactacga 2520 gcagatggcg gtggaccagg ctgtcgacaa cgaactgcag cagctgttga cttcgggcac 2580 gtcgttgaag ctggtgctga aacaaacctc gcttgctacc aagccactct actgtgacgt 2640 ctccgtggac ggtagaattc gaccgtttgt ccctgcgcag catcgtaact ccgttctaca 2700 gttcctccat tcgatagcac atccgggaat tcgaggaact cgtcgtctgt tatctgaccg 2760 attcgtttgg agttctatga acaaggatgt gaggagcttt gtgaagtgct gtgttgattg 2820 tcaaaagtcc aaaattcacc ggcacacccg agcacccttt gaatcgttca atcttcccaa 2880 agcacgtttc cgacacatac acgtggactt agtcggaccc ttgcccccat cgaacggtca 2940 acggtacctg ctcaccatcg tcgacaggtt cacacgatgg cccgaagctg ttccacttgc 3000 cgacatgacc gccgaaaccg tcgcgtgggc tatttgctcg tcttggatag cacgatttgg 3060 ggtgccggag cagatcacaa cggaccaagg gagacaattc gagtcgcagt tgttccgtga 3120 actggcgatg ctaacgggtt ccaagcacac gcggacgacg gcatatcatc cccaggccaa 3180 cggattggtg gaacggttcc acaggacgct aaaggctgca atcatggcag ttgattctgc 3240 tcactgggtg gaccggctgc ccatcattct gctgggactg cgttctgcgc ttcgcgagga 3300 cctcggctgt tcggtcgccg acctagtcta cggccagcca ttgcgacttc ccggtgaata 3360 cttcgagcct gctacaaccg gaactctgca gtcggacttc gtgaagcagt tgcagcgcgc 3420 agttgggcaa ctgaagccgc agaaagtgat gcatcacgca aaaccattag tgtttgttca 3480 agatgagcta tctaaatgta gtcatgtttt tgttagaatt gacgcagtta aacgccccct 3540 tcagagacca tacgatggac cgtttgaagt tgtagaacga catgagaagt ttttagattt 3600 gttgattaac ggaaagcgac agcgcatttc tttggataga ttaaaaccag cttttttgtg 3660 tgacgaaaac gttgcaaatc atccactaga tgatcctagc accaaagtta ccccatcagg 3720 tcataggatt cgctttctgg tgtaactggg gggagtac 3758 // ID ORTE-4_AAe repbase; DNA; INV; 5276 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A non-LTR retrotransposon family encoding cysteine protease from DE Aedes aegypti. XX KW Non-LTR Retrotransposon; Transposable Element; ORTE; ORTE-4_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5276 RA Kojima K.K. and Jurka J.; RT "A lineage of non-LTR retrotransposons encoding an OTU cysteine RT protease from the yellow fever mosquito."; RL Repbase Reports 11(4), 1127-1127 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 15 sequences with >95% CC identity. This family encodes OTU superfamily cysteine protease CC upstream of apurinic-like endonuclease. It is positioned at the CC sister lineage of the lineage including RTE and RTEX in CC RTclass1. XX FH Key Location/Qualifiers FT CDS 367..4785 FT /product="ORTE-4_AAe_1p" FT /note="OTU cysteine protease, endonuclease and FT reverse transcriptase." FT /translation="MDLNYSLLLFDDNSLARSLIDQIYNYPNNTRQFKDVN FT TFRIEIAKYVDQNRLVFKSIWGNKLETYINLXQTGEKCVEMEILQATAALV FT NSXINVFDNDGDKFIIKPIIPGNEINKELNIYANIGDESVXFSSMIHVQDS FT DITGLSGLVERYTVKHDVKMNRRVLIRNRTENDCIEEAIYHQLFWNDENES FT EIEKIMNDLYQIEIKDKSELGQSKLYWEQVVCTXKRTVILVTESVRNVEIK FT YTKNDAKKGNIILWLNKGKKTHYGSVLALDDNTNALENNTEQTKNVEKRKV FT NIIKTKDNNKXKRENXIYKKEFNSSKLLKIKRMLGDGNCLVWALLDQINKN FT KTNSLTLQDVKDTRNKIANYILHHRDEFEPFLVEGLEEGGFERFVEEIRTL FT GIHLGHEAMVAFGRMHNTSVEIYQEDEPVIRIENDDMGNNNVVRVWYCGDN FT NVKNHYDSVVEILDGKYVGMEGHKHTDNQEQTEKLVVGEETDGSDSRIRME FT KDKSIGDMERNLEKTSHCMQNQNVGKDENNFKFIRIATLNVRGCMKTEKRE FT EINDLLVEQNIDIAALQEVNVLGSIVETKNYKWFVKRDTNNKARGLAFLLR FT KGTGIELGEIRNIGSATISTAVKINKKTKLILVNVHGPNKKASNFFSTLSK FT EIDKEHIKRKLIILGDWNAQIGLEVLEEGDYKHVGRKLGFDQCNENGEEFK FT LFLQLHNLQNLSSKIGSNTEVTWKSGNRRSQIDHVVKALCSDVRIRFIQGF FT WTTISTDHLLMLMDVKLKDTTVKKRTNRDHKILDANALKYDVVRKKYHDEL FT MKFQHDEGLDLDQSYDKLTKIMKISASKALKSSKEPLTPKRKNALNRLNKA FT LSIYRKKPYMLTYKWKVDNRKEEYRRAVNQHEEKVIKSFYQNLNDFDVGVR FT IRKSYAFLKKYAKKKSQKKVQVTTKIWNEILKSSAGPEIALIEEQDFCPIT FT EPPTAEEMRRILQWSCNGKSPGADGIRMEHIKYADEETFQSLVQIWRRIWM FT ENKMPEEMTKSIQVPIPKVGNPKSVDDFRRISLCNAVYKPYAKWLRNRLKQ FT FTGEPELHQAAYTEGRSTDDHVFIAKRVMDEYWNAGKTLFVASVDIKKAFD FT NVDLNKLKDILMDLEVPTHVIDRVLLCVKEDRAMIRWQGQYSESVKRGKGI FT KQGCPLSPFLFNLVIQYILKRVAEKVSNLKLLEVGVFTLPIILAFADDILI FT LTEDQDELEKIIAALEDCLCEVGLEINSKKSQVLIRIPNATKPPPEKIMLN FT GKEYKTCKSLKYLGVTLTSDLNRPATTRQRSINAIKSAKNIVQFCKTFKPS FT WDIGKLIYKTVIAPAIQYGTKVSTLTKRSRLQLAKYEKLILKDIWNNCRVV FT KRKFNVRKEMSGKTINRRVRVARLNYYGHIIRRPNKHPLKSAFKFKFCFKK FT EGRPSFTWKDTLQQDFSRYRDMTNSKWKELATDREKLKKKAEEIYKDSNSE FT ISDGEDSE" XX SQ Sequence 5276 BP; 2031 A; 702 C; 1136 G; 1394 T; 13 other; gggtatggtg aggggtatgg tgtggtgaca aaactgtgca tagaaaaaaa aagtggaaaa 60 cttgaatgat aattcctgta agtatcgaag atggattgct gtatgctgtg tataaatata 120 tagagaattt gtctcaatgc acaaaagttg aamagtwkma tgaattcata tttcttttta 180 aagacactag ggtaagttgm ttatttatat tttattccta tactcaatwa cattgggtca 240 tcwatattca ccatcattat ccattgatga tttcttttta ctgttttctt gatatttagc 300 gtaagtaaat caatgtatat ataattatga tattcttatc tcattatttt actatcaatt 360 accacgatgg atttaaatta ctcacttcta ctttttgacg ataattcact agcacgctct 420 ttaattgatc aaatatacaa ttacccaaat aatactcggc agtttaaaga tgtaaataca 480 ttcagaatag aaatagcgaa atatgttgat caaaacagat tggttttcaa aagtatttgg 540 ggtaataaat tggaaacata cattaattta akacaaacag gagagaaatg tgtagagatg 600 gagatattgc aagcaactgc agctttagtg aatagtawca ttaatgtttt cgataatgat 660 ggagataaat ttattataaa gccaattata cctggaaacg agattaataa ggaattgaac 720 atttatgcaa atattggcga tgaaagtgta casttcagta gtatgatcca tgtgcaagat 780 tcagatataa cggggttatc aggtttggtc gaaagataca ctgtaaaaca tgatgtaaaa 840 atgaacaggc gagtattgat acgtaacagg acagaaaacg attgtatcga ggaggcaata 900 tatcatcaat tattttggaa tgatgaaaat gagtcggaaa tcgaaaagat tatgaatgat 960 ttgtatcaaa ttgaaatcaa agataaaagt gaactagggc aaagtaaatt atattgggag 1020 caagtagtat gtactttwaa aagaacggta atattagtaa cagaaagtgt gagaaatgtt 1080 gagataaaat acacgaaaaa tgatgcaaag aaaggcaaca taattctatg gttaaataag 1140 ggtaaaaaaa cacattatgg tagtgtattg gctttagatg ataatactaa tgctttggag 1200 aataacacgg aacaaaccaa aaatgtagag aaaagaaagg ttaatatcat caaaacaaag 1260 gacaataaca agwttaaaag agaaaatktc atatacaaga aagaatttaa ttcaagtaaa 1320 ttactcaaaa taaagagaat gttaggcgac gggaattgtt tggtatgggc cttacttgat 1380 caaatcaaca agaataaaac taattcatta acacttcaag acgtaaaaga cactagaaat 1440 aaaatcgcga attatatttt gcatcacaga gacgaatttg aaccattttt ggtagaaggg 1500 ctagaggaag gaggttttga acgttttgtg gaggaaatcc gaacactagg cattcactta 1560 ggccatgaag ccatggttgc ctttggcaga atgcataaca catcggtaga aatctatcag 1620 gaggatgaac cagttataag aatagaaaat gatgacatgg gtaataataa tgttgtgcga 1680 gtttggtatt gtggtgacaa taacgttaaa aatcattatg atagtgtggt agagattttg 1740 gacggaaaat atgtaggaat ggaaggacat aagcatactg ataatcaaga gcaaaccgaa 1800 aaattggttg taggggaaga aactgatgga agtgacagtc gtattcgaat ggaaaaggac 1860 aagagcatag gcgatatgga aaggaattta gaaaaaactt ctcattgtat gcaaaatcaa 1920 aatgtaggaa aagatgagaa caactttaaa ttcattagaa ttgccacact gaatgtgaga 1980 ggttgcatga agacagaaaa acgagaagaa ataaacgatt tgttggtgga gcaaaacatt 2040 gatatagcag cattgcaaga ggttaatgtt ttaggtagta tagttgagac aaagaactat 2100 aaatggtttg taaagaggga cacaaacaac aaagctagag ggctagcgtt tctgctcaga 2160 aaaggtacgg gtatagaact aggagaaatt aggaacattg gaagcgctac catatcgact 2220 gctgttaaaa tcaacaagaa aacaaagctt attctagtga atgttcatgg tccaaataaa 2280 aaagcatcca actttttctc gacgttgagt aaagaaatcg acaaagaaca tataaaaagg 2340 aaactaatta ttttaggaga ctggaatgct caaatcggac tggaggtgtt agaagaagga 2400 gattataaac atgtgggcag aaagctagga ttcgaccaat gcaatgaaaa tggcgaggaa 2460 tttaagctat ttttacaact acacaatctt caaaacttat cttcaaaaat tggaagtaac 2520 acagaggtta cttggaagag tggtaacagg agaagtcaaa ttgaccatgt tgtcaaagct 2580 ctttgtagcg atgtaagaat aagatttatt caaggatttt ggactacgat aagtactgat 2640 catcttctta tgttgatgga tgtaaaattg aaagatacaa cagttaagaa aagaacaaat 2700 cgagatcata aaatcttaga tgcgaatgct cttaaatatg acgtggtaag gaaaaagtac 2760 cacgatgagc ttatgaagtt ccagcatgat gaaggattag atttagacca gagttatgac 2820 aaactaacga aaataatgaa aatatcagcg tcaaaagcat taaaatcgtc caaggaaccg 2880 ttgacgccca aaaggaaaaa tgcattgaat agattaaaca aagcattgtc aatttacagg 2940 aaaaagccat acatgttgac ctataaatgg aaagtagata acagaaaaga ggagtatcgg 3000 agagctgtta atcaacacga agaaaaagtc ataaaaagct tttatcagaa cctaaatgat 3060 tttgatgtcg gagtgaggat aagaaagtca tatgcgtttc taaagaagta tgcaaagaaa 3120 aaatcccaaa aaaaggttca agtgactacg aaaatatgga atgaaatttt aaaaagtagt 3180 gctggaccag aaatcgcgct tattgaagag caagactttt gtccgataac ggaaccgcca 3240 acagcagagg agatgagacg tatactacag tggtcttgta atgggaagtc accaggtgct 3300 gacggaataa gaatggagca tataaaatat gcagatgaag aaacgtttca aagtcttgta 3360 cagatatgga gaagaatttg gatggaaaac aaaatgcctg aagagatgac aaagtcgata 3420 caggtaccta ttcccaaagt gggcaatccg aaatcagtgg atgatttcag gagaataagc 3480 ttatgcaatg ctgtgtataa gccttatgca aaatggttgc ggaataggct aaagcagttt 3540 actggagaac cggaattgca tcaggcggcg tatactgaag gacgttcaac agatgatcac 3600 gtgtttattg caaaacgagt catggatgag tattggaatg cggggaaaac attgtttgta 3660 gcatcagttg atattaagaa agccttcgat aatgtagatc taaacaaatt gaaggatatt 3720 ttaatggatc tggaagtacc gactcatgtg attgatagag ttcttctctg tgtcaaggaa 3780 gatagagcta tgattagatg gcaagggcag tattcggaat cagttaagcg gggaaaaggt 3840 ataaaacaag ggtgtccatt atcgcccttt ttgttcaatc tcgtcataca atacatattg 3900 aagagagtag ctgagaaagt ctcaaaccta aaacttttgg aggttggagt gttcacctta 3960 ccaattatac tcgcttttgc agacgatatt cttatattga cggaggatca ggatgagttg 4020 gagaaaatca ttgcagcgtt agaggactgt ttgtgtgaag taggtctaga aataaatagt 4080 aagaaaagtc aggttcttat aagaataccg aatgccacga aacctccgcc tgaaaaaatt 4140 atgttgaacg gaaaggaata taagacgtgc aaatcgctta agtacttggg agtcacttta 4200 actagtgatt taaatcgacc agcaactacc agacaacgat caataaacgc gataaaatcg 4260 gctaagaaca tagtacaatt ttgcaaaacg ttcaaaccgt catgggatat tgggaaacta 4320 atatacaaaa ctgttattgc tccagctata caatacggta caaaagtatc aacacttaca 4380 aaacgtagta gattacaatt agccaaatat gagaaactaa ttttgaaaga tatatggaat 4440 aactgtagag tagtaaagag aaaattcaat gtgaggaaag agatgagcgg aaaaactatc 4500 aatagaagag ttagagtggc taggttgaat tactatgggc acattatcag gaggccaaat 4560 aagcatccac tgaaatctgc ttttaaattt aagttctgtt ttaaaaagga gggaagacct 4620 agtttcacat ggaaagatac gttacagcaa gatttttcaa gatatcgaga catgacaaat 4680 agcaagtgga aagaactagc aacggataga gaaaaactta aaaagaaagc agaagaaatt 4740 tataaggaca gcaatagtga aatttcggat ggagaagata gcgagtagat atgtgatgac 4800 agggaagcat tgttagaaaa gaagacgata tacaagaata aatatagagt aacatatgta 4860 cacaaataat tgaatgtgat ttaacttact gccaagaacc aaacagtaag atttgttatg 4920 atatacttct gaccacggat gatgaatctt aatacgaagc caacgaacat gggtacaaag 4980 aggtaagata ctttaataac actatatatg atatattact atatattggg tattggggag 5040 gggtagcata agggagttaa ttgaactgtt gactggatga aagtgcgtgg ggtcgccagc 5100 cttttgtcat gagaaaacag ccttagtgtt gtcttccaaa ggtcttcaaa gatattaact 5160 agccctgcta caacaaacca tcaggtagat cattccatcc cggtatgatt ctgcagccat 5220 aggtaatcct tgcgctgatg gtggctaaat aatcatcatc atcatcatca tcatca 5276 // ID RTEX-3_BF repbase; DNA; INV; 6814 BP. XX AC . XX DT 30-JUN-2009 (Rel. 14.07, Created) DT 30-JUN-2009 (Rel. 14.07, Last updated, Version 1) XX DE Amphioxus RTEX-3_BF autonomous non-LTR retrotransposon - DE consensus. XX KW RTEX; Non-LTR Retrotransposon; Transposable Element; Jockey-5_BF; KW RTEX-3_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-6814 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-6814 RA Kapitonov V. and Jurka J.; RT "Young families of RTEX non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(7), 1719-1719 (2009). XX DR [2] (Consensus) XX CC The complete RTEX-3_BF consensus sequence contains two ORFs. The CC RTEX-3_BF ORF1 protein contains the esterase domain (529-681 aa). XX FH Key Location/Qualifiers FT CDS 179..2671 FT /product="RTEX-3_BF_1p" FT /note="Esterase domain." FT /translation="MGDHAEDHAKTHAKTPKRKTPGTKFWNVNLNQLPADE FT SLAFDYEDQDKSRFCGVKLRSKQLDKWITAHKDAFNETFERTKETTVTWKS FT TPGITCMTVQTTTATRKSGKGLLSVHFYHKKHGLMVQGQCTKWWMNILYPE FT LLVNINAQVDMQTRSTIKDGVLRVRDAIDNFPTVTLESTTPKQVFNSSVHE FT TPTLPKCANPRNLQDLQAPVSEDMEEEIQIVFDTSAKHSGNIEVPVSSDAE FT DQGPSRIEKRAKQVSDTKNGEQSELTTKAVDMPNLARLTAVEKSQRISDAA FT FGDLQKQFVDMTHNLNALRDELLTKEREREESQTNMLKELSTKVEAAQKSL FT ESHIKKVVNKLQTNICHVQKDLVSQREELAKEKNATVQEMTIERNIWMERN FT RGLSIKVDALSASNAELQQAITIQDSELQYLFDRLKQIEENRTCRGCTKPT FT HTDMQNLNNVHDNDIPGDKQDNNAATNQTIVKQIANMLPSAHDNATIATQS FT REVPSPATSVKESATNTSTGSQHQRYEVRIFADSIFRDVDVKRAFNGKSTR FT LVRTSTVTAAIDCIKQINDSTTQQVILHIGSNDLDNSKLRNDSVSTTLNNT FT RRLLEVTKASFPKAQVAVSQVLQRGLNDNSTLNINIKNYNQEVLNLSRSGN FT FKYIKHRKLTQDRHLYLPDQIHLDPNSGTKMLVSDVKRTLYPASGTGAVRQ FT SPAWTDTQAARLPASVNQRTPMRWQNGPSFPRPVPVPVTRSRKPPYQPPTV FT KQAHRNTAVGTGQAAPGLSKGANIGVPGQLGPTTRSGDTASGTGQVVRPAW FT WKAGKNGHPSPQWRQLGHRLRKVWNILIS" FT CDS 2963..6697 FT /product="RTEX-2_BF_2p" FT /note="AP endonuclease and RT domains." FT /translation="MEKKSALVFSSWNIQGSVKKKFSDVEFCNYFRNSDIV FT CIQETWADNYVDFQFPGYSLFSSNRCRNKRAKRNSGGVATIFKSSFRKGIT FT KLESSSVDSIWCKLDKKFFGLEKDLYLCNIYIPPESSKSIQVDPFDTLSID FT ICKFSNLGDIIILGDFNARTGSVLETYFSIDSPDDPLTKDVLDTKKRNNRD FT SKVNNYGKRLIELCSSADLTILNGRFAGDLKGDFTCYHYNGSSVVDYCITQ FT RSMLTDIQYMNVNSISPFSDHCHISFSLSAKHSPLVLKEDSQCYNPKPTQF FT IWTDESKQLYLDTINNKDTHSKLRDFCNKTFHSSEEAVTSFTEILLDVSKS FT SLRIRRKIQNNKKTQIKKNKIWFDQNCWSLRQRVKRLSLELRKEPWKQETR FT TKYFTALKEYKKIIKHSKRNYKANLLRDLETLNEKNPKQFWSLFNTLDKEV FT NGTKYNGNIDPNITSKEWIEHFQGLNNLVKNNEFDSSFEKSVTDSLKHLDK FT NAQNSLDYPFTTPEIMTGLKSLKSGKACGIDSISNEMLKYGAEKICEPLVI FT LFNTILSNNKFPSNWTTSILTPIFKSGDKSNVDNYRGIAISSCLSKLFTFL FT LNTRLQKFVESNCLLADTQFGFRRKCRTSDNVFILKALIEKYIEKKRGKLY FT VCFVDMKKAFDSVWRDGLFYKLLNLGVGGKFFNVLQSMYLNVNYTVRLQNG FT LSSPFVSTCGVRQGCNLSPLLFNIFINDLPQCFADKCDAAILNSKSLNCLS FT WADDLALISLSKEGLQNCINNLESYCNKWKLRVNVSKTKVLVFTKGSINRL FT SNKFYIYGKEILVTDSYTYLGIPLTSSGKFKAARKYLKTKAMRALFKLKSL FT LFSEKNIPIHLGMSLFNKFVLPILLYGAELTCFDQTSKAIKIIVSKQIPES FT SPKTVFSSFLNKLHLEGDLPATVRKSVSSDLTHTYVIHFKKRSDKDRLLRL FT ASGMKLENDNFAIENIRLPSHISIPEFDTIDMSFQKFLLGVHAKSSNDGIR FT GELGTFPISINAEIQLIKYWHRLANLPEDSLLREAYDVVLSGEYDWTNHVT FT DILNYNGFGYVWTNPRLYHVNMLTDQLRLRLQDIYIQEWHSSIQNNSKLSI FT YSTLKERYGQKKYLSNVHNFELRRAITKIRICSHKLNIEAGRYSKIPRDQR FT FCPFCPNEIEDERHFVMDCSRYNDKRTELFTLLSACSKDFATLCSIDKFNY FT ILGGDNPHCVQVGKYIHDCLYRRTNSVNDMLDPLC" XX SQ Sequence 6814 BP; 2283 A; 1415 C; 1325 G; 1791 T; 0 other; aagatggcgg acgagtacgg tcacgcaaca cgaggtctcc ggttttgatt ctatattgaa 60 ggtgttaacg ctacaggaag tgagttatcg gaactttata ctaattctat atatacatcg 120 catattctgt atatgatttg gagattttga agactgttag gagaacttta attgtaaaat 180 gggtgaccac gccgaggacc acgccaaaac ccacgccaag acgccgaaac gcaagacccc 240 gggtacgaag ttctggaatg tcaacctgaa ccaacttccc gcggacgagt ctcttgcctt 300 cgattacgaa gaccaagata aatcccgttt ctgtggggtt aaactgcggt ccaaacagct 360 ggacaaatgg atcacagccc acaaggacgc tttcaacgaa acgtttgaaa ggaccaagga 420 gacaactgtc acctggaaaa gcacacctgg cattacatgt atgacggtcc agaccaccac 480 agccacaagg aagtcaggta aaggtcttct ctctgtacat ttctatcaca agaaacacgg 540 attgatggta caagggcaat gcaccaagtg gtggatgaac attttgtatc ctgaactctt 600 agtcaacata aacgcccaag tagatatgca gacaaggtca acaatcaaag atggcgtcct 660 acgcgtcagg gacgccatag acaactttcc tacggtaaca ctagaatcga ctacaccaaa 720 gcaagtcttt aatagcagtg tccatgaaac accaacttta cccaagtgcg ccaatcctcg 780 aaacctacaa gatctgcaag cacccgtaag tgaagatatg gaagaagaga tacagattgt 840 gtttgataca agcgcaaaac atagcgggaa catagaagtg cctgtaagta gcgatgcgga 900 agatcaaggg ccgagtagga ttgaaaaacg cgccaaacag gtctctgata ctaagaatgg 960 cgaacaaagt gagcttacaa caaaagccgt agatatgcca aacttagcgc ggttgactgc 1020 ggttgagaaa tcccagagaa tcagcgatgc tgcttttggc gatttacaga aacagtttgt 1080 cgacatgact cataatctta acgcgctacg ggatgagctg cttacaaaag aacgggagag 1140 ggaagagtct caaaccaaca tgcttaaaga actgtccact aaagttgaag cagcacagaa 1200 gtcactagag agccacatca agaaggtagt caacaaatta cagaccaaca tctgtcatgt 1260 ccaaaaagat ctagtgtcgc aaagagagga gctagcgaaa gaaaaaaacg cgacagttca 1320 agaaatgaca atagagcgga acatctggat ggagaggaat cgtggcttgt ccatcaaagt 1380 ggacgccttg tctgcatcga atgcagagtt gcagcaagcc atcacaattc aagactcaga 1440 acttcagtat ttgtttgata ggttaaaaca gattgaagaa aaccgtacct gccgaggctg 1500 tacaaaacca acacacacgg acatgcaaaa cttgaacaac gtgcatgaca atgacatccc 1560 tggcgacaaa caagataaca atgctgctac caaccaaacc attgtcaagc agattgcaaa 1620 catgcttccc tctgcccatg acaatgcaac aattgcaaca cagagccgtg aagtcccttc 1680 accagccacg agtgtgaaag agtcggcaac taacactagt acgggaagtc agcatcaaag 1740 gtacgaagtg agaatttttg cggactccat cttcagagat gtagatgtga aacgtgcttt 1800 caatggaaag tccacgagat tggtcagaac cagcacagtg acagcagcca ttgactgtat 1860 taaacaaatc aacgactcga caactcagca agttatactt cacatcggtt ccaacgatct 1920 tgataatagc aagctgagaa atgactcagt cagtacaacc ctgaacaaca cccgcagact 1980 actggaggtg accaaagcat cgttccctaa agcgcaggta gcagtctcac aggtgctcca 2040 gaggggatta aatgataact ccacactcaa tatcaacatc aagaactata accaagaagt 2100 tctgaatctg tctaggagtg gcaacttcaa gtacatcaaa caccgcaaac tgacacagga 2160 caggcatctc tacctccccg accagatcca cctggatcct aatagcggca ccaagatgct 2220 tgtgtcggac gtcaagcgaa ccttgtaccc ggcatccggg accggggccg tgagacagtc 2280 tcccgcctgg acagatacac aggccgcccg tcttcctgcc agtgtcaacc agcggacgcc 2340 catgaggtgg caaaacggac cttcttttcc gcgccctgtt ccagttccag tcacgaggtc 2400 taggaagcca ccttaccagc cgcccaccgt gaaacaagca cataggaaca cagcggttgg 2460 aacaggacaa gctgcacctg gcttgtcgaa gggagctaac atcggagttc ccggccagct 2520 tggaccgacg acgaggagcg gggacacggc cagcggtacc ggacaagtcg ttaggcccgc 2580 ctggtggaaa gctggcaaga acggacatcc cagcccacag tggagacagc taggacaccg 2640 actccgtaaa gtttggaaca ttcttatatc ttaatactca tatgattcta caccaccttc 2700 aaggtaaacc ttcatgaatt gttacattta accctattac tatacatacg gaccatgtac 2760 cgctcattta gagagactcc atcattgtgt acaaatacta ttcatgactg acatgtatca 2820 ttgatagctt gaatacttgt ttattgatat gatgatatag tttaagtctt atctgaatac 2880 ttgtttactg atataataat atagtataag tcttacttct tgtagctgta tttacacctt 2940 caatatcttt aatttcaagg acatggagaa aaaatctgcg ctagtttttt caagttggaa 3000 tatccaaggc agtgttaaga aaaagtttag cgatgtagag ttttgtaatt actttagaaa 3060 tagtgatatt gtctgtatac aggaaacctg ggcagacaat tatgtagatt ttcagttccc 3120 agggtatagt ttgtttagta gtaatagatg taggaataag agggcaaaac gtaattcggg 3180 tggggttgcg actattttca aaagttcgtt cagaaagggg attacaaaat tagagagtag 3240 ttcagtcgac agcatttggt gcaaacttga caaaaaattc tttggactcg aaaaagacct 3300 atacctttgc aacatttata tccctcccga atcctcaaag tccatacaag tcgacccatt 3360 tgatacatta tccattgata tctgcaaatt ctccaatctt ggtgacataa tcatcctagg 3420 ggatttcaat gctagaacag gatctgtttt ggaaacctat tttagtattg attccccgga 3480 tgaccccctg acaaaagatg ttttagatac aaagaaaagg aataatagag attctaaagt 3540 aaacaattac ggtaaacgcc ttatagaatt gtgctcgtcg gctgatttga ccatcttgaa 3600 tggcagattc gcgggggatc taaaaggtga ttttacatgt tatcattata atggttcaag 3660 tgttgtggac tactgcatca ctcaaaggtc gatgttaacc gatatacaat atatgaatgt 3720 taattctata tcgccatttt ccgatcactg tcacatctct ttctcattgt ctgcaaagca 3780 ctctcctcta gtcctgaagg aggatagcca atgttataat cctaaaccaa cacaattcat 3840 ttggactgac gaatcaaaac aattatattt agatactata aacaacaaag acacgcattc 3900 aaaactgaga gatttctgca acaaaacatt tcactcgtca gaggaggccg taacaagctt 3960 cactgaaatt ttattagatg tttccaaaag ctctcttaga ataagacgta agatacaaaa 4020 caataaaaag acacaaatca aaaagaacaa aatttggttt gatcaaaatt gttggtccct 4080 acgtcaaaga gtaaaaagat tatctttaga actgaggaaa gaaccatgga aacaagaaac 4140 cagaacgaag tatttcacag ccttgaaaga gtacaaaaaa attatcaaac attctaaacg 4200 gaattataaa gccaatctat tacgtgatct agaaacatta aatgagaaaa accccaaaca 4260 attttggtca ttgttcaata ctcttgacaa ggaagtaaac ggcacaaaat ataacggtaa 4320 catagatcct aatatcacaa gtaaagaatg gatagaacac tttcaaggat taaataatct 4380 ggtaaaaaac aatgaatttg acagttcttt tgaaaaaagt gtcactgatt ctcttaaaca 4440 tttagataaa aacgcccaaa attcattaga ttaccctttc acgacaccag aaattatgac 4500 tggtttgaaa tcactaaaat ctggaaaagc atgtggtatt gactctattt caaacgagat 4560 gttaaaatat ggggcagaaa agatatgcga accactggtc attctcttta atacaatcct 4620 ttccaacaac aaatttccat ctaactggac cacaagtatc cttacaccaa tttttaaatc 4680 tggcgacaaa tctaacgtag ataattatag aggaatcgcc atctcaagct gtctgtccaa 4740 attgttcaca ttccttttaa atactcgctt acagaaattc gttgaaagca attgtctgct 4800 agcggacact caattcggat tcagaagaaa gtgccgaacc tctgacaatg tatttatcct 4860 gaaagctctt attgagaaat atattgaaaa aaagagaggg aaattatatg tttgctttgt 4920 tgacatgaag aaagcgtttg acagtgtatg gcgggacggt ttattctaca aattgttaaa 4980 tcttggtgtt ggtgggaaat tcttcaatgt attgcaatcc atgtacctca atgttaacta 5040 tactgtcaga ctacaaaatg gtttgtcaag cccctttgtt tccacatgcg gagtaagaca 5100 aggctgcaat ctcagtccat tattattcaa tatttttatt aacgacctac ctcaatgctt 5160 tgctgataaa tgtgacgccg caattttgaa ttctaaatca ctcaactgtt tatcttgggc 5220 agatgactta gccttgatct cattgtcgaa agaggggctt cagaattgta tcaataacct 5280 ggaatcttat tgcaacaaat ggaaattaag ggttaatgta tctaaaacga aagtactggt 5340 ttttacaaaa ggttctatta acagattatc aaacaaattc tatatatatg gcaaagaaat 5400 actagtaacc gacagttata cttaccttgg tattcccctg acttcttcgg gaaagtttaa 5460 agcagccagg aaatatttaa agactaaagc catgagagct ttattcaaac taaagtcact 5520 tttattctca gaaaagaaca ttccaattca tttaggaatg agccttttca acaagtttgt 5580 actgcctatt ttactgtacg gagccgaact cacatgtttt gaccaaactt ctaaagcaat 5640 caaaattatt gtatcaaaac aaattccaga atcttcaccc aaaactgttt tctcttcctt 5700 tttaaacaaa ctccatttag agggagatct ccctgcaact gttagaaaga gtgtgtcttc 5760 ggacctcact catacttacg tcatccattt taagaaaaga tcagacaagg atagattatt 5820 gagactggca tcaggaatga aattagaaaa tgacaacttc gcaattgaaa atatacgact 5880 tccttcacat attagtatcc cagaatttga taccattgac atgagtttcc aaaaattttt 5940 actcggagta catgcaaaat catcaaatga cggcatccgt ggtgagttag gaacctttcc 6000 catttcgatc aatgctgaga tacaattaat caaatactgg caccgtctag caaacttacc 6060 cgaagattct ttactacgtg aagcatatga tgttgtactt tccggtgaat acgattggac 6120 aaatcacgta actgacattc tcaactataa tggttttggg tatgtatgga caaacccgag 6180 gttgtatcat gttaatatgc taaccgacca attacgactc cgtttacagg atatatacat 6240 tcaagaatgg cattcttcta tccagaataa ctcaaaactt tccatttatt ctacgctaaa 6300 ggaaagatac ggacaaaaga agtatctttc gaatgtacac aatttcgaac ttcgtagagc 6360 aattacaaag ataagaatct gtagtcacaa actaaatata gaagccggaa gatattctaa 6420 aattccccgt gaccagaggt tttgtccatt ctgcccaaat gagatcgaag acgaacgtca 6480 ctttgttatg gattgttctc gatataatga taaacgcaca gagctgttca cattactttc 6540 cgcttgctct aaagactttg ctactctctg ttcgatagat aaattcaact acattctggg 6600 aggcgataat cctcactgtg tacaagtagg caaatatatc cacgattgtt tataccgtag 6660 aaccaatagt gtaaatgaca tgctagaccc tttatgttga tgtatccata cctatagata 6720 tccatatatg tgtgtttagt tgtactgtaa gtagttgtta tacatgtaga ccttacgaaa 6780 gtcgctgtac ttttgtcgtg tcaataaaga ttgt 6814 // ID Ginger1-8_HM repbase; DNA; INV; 5501 BP. XX AC . XX DT 03-FEB-2010 (Rel. 15.01, Created) DT 03-FEB-2010 (Rel. 15.01, Last updated, Version 1) XX DE Ginger1 DNA transposon from Hydra magnipapillata. XX KW Ginger1; DNA transposon; Transposable Element; Ginger; integrase; KW Ginger1-8_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-5501 RA Bao W., Kapitonov V.V. and Jurka J.; RT "Ginger DNA transposons in eukaryotes and their evolutionary RT relationships with long terminal repeat retrotransposons."; RL Mobile DNA 1, 3-3 (2010). XX DR [1] (Consensus) XX CC TIR is 58-bp long. Tpase gene contains two introns: 333-435, CC 597-817. XX FH Key Location/Qualifiers FT CDS join(212..332,435..596,818..2685) FT /product="Ginger1-8_HM_1p" FT /translation="MSAHSYSVEDIKLYLYKGEINCSKSQRKSFLKYAKKF FT KLCDGKLYYVTASKNLQVLFNDNEKMLAFRDVHDSNHGAHVGLNNTRAKLK FT GSFYWLGMVNDITKWVKECDKCQRMEKIRTVAPELKPIKVNGLWDFLGIDL FT IGPLPITKLGNKYILTITDLWSKYIEAFPIPEKSAFYVSKCLTTLFYRFGP FT PKKILSDQGREFVNSLNEQLFSLFQIKHLITSAYHPQTNGQDERTNQTIKK FT SLSKLSNDTQDNWDELLEAVLFGLRTCVQKSTKFTPFFLMFGREANLFSTL FT SLNNSDSGTNDNNIVESNALDEQIQEKLGSYNKVVTEVNNNICHAQTKMKK FT YYASKQLKGWKSFTFKTGDQVLVRNYRKIGRKGSRMEHDWLGPGIISELKQ FT TGATLTINGKVWKKAVSLSNMKPYIVDKQVLSAELLLKEHDYCLKMTNQSY FT KKEDLKKRSSKKRHISELKCYDKDKKYKSVKMEISTSSLSDDKSILKRKRL FT IYPAISLDNNFYSTLSAQSIKQCIDILNNPFGWLDDTLIDIAQSFLSHQFP FT KVGGFQSSCIFNSNNFGGFVSGKFVQIFNVRNSHWVLISNVTSDNGSSSVQ FT YYDSLFTGFNKSTVPLLVHRVARSMLINEGIAFINLEVMRCQNQDNGNDCG FT LHAIANATALCHGIDPSVILWEQNSMRSHFLKCVENRKLEMFPYILLNDTK FT TCSISFQCDDLCPCALK" XX SQ Sequence 5501 BP; 1904 A; 732 C; 812 G; 2052 T; 1 other; tgttgcgtta aataagtagc ccgttaaata agtagacaaa cagttttgcg catgcgtttt 60 cacatgcgca gatgtgctgt taagtgtccg ttaggtaagt aacatttatc gttaaataag 120 taacctcgtt ccgttaggta cagttaaagt ttatttggat ttcattttta gccatattta 180 actatttaat tcatttaaag tttattataa aatgtcagct cacagttatt ctgttgaaga 240 tattaagctt tatctttata aaggagaaat aaattgctct aaatcacagc gtaaatcatt 300 tttaaaatat gcaaaaaagt tcaaactatg cggtatgtaa tcgttgtata ataaattgtt 360 gaataagtaa taattatttt taataaacaa taattatttt tattgaataa tcactaactt 420 attatatact tcagatggta agttgtacta tgtaacagcg tcaaaaaacc tgcaagttct 480 ttttaatgac aatgaaaaaa tgttagcctt tagagatgtc catgattcta atcatggggc 540 tcatgttggc cttaacaata caagagcaaa attaaaaggt tctttctact ggctaggtaa 600 tattattgtc tttttataat agtacttttt taatttctat aaggttttta agttctttta 660 agtgttatac atttcttgaa cttaattgct aggcaaatta aattaaaaca tgaattgata 720 tcaattcatg atatcaaaag atagagtaca gctctatttt tttaaatttg gttatgtttt 780 ggttcatgtc aaacttttgt ttttgttgaa ttgttaggaa tggttaatga cattaccaaa 840 tgggttaaag aatgtgacaa gtgtcaaaga atggaaaaaa taagaactgt tgctcccgaa 900 ttaaaaccta tcaaggttaa tggactgtgg gattttttag gcatagacct aatcggacct 960 cttccaatta caaagcttgg aaacaaatat attttaacaa taactgatct ttggagcaaa 1020 tatattgaag cttttcctat tcctgaaaag tcggcatttt atgtttctaa gtgccttact 1080 actttgttct ataggtttgg tcctccaaaa aaaattctat cagatcaagg tagggagttt 1140 gtaaatagtt taaatgaaca attattctct ttatttcaaa tcaaacactt gataacctct 1200 gcctaccacc ctcaaaccaa tggacaagat gaaagaacta atcagacaat taagaaatct 1260 ctctctaaat tatctaatga tacccaggac aattgggatg aattgttaga agctgtttta 1320 tttggtctgc gcacatgtgt gcaaaagtca actaaattta cacccttttt tttaatgttt 1380 ggcagagaag caaacttgtt ctctacttta tcactgaata atagtgattc tggtactaat 1440 gacaataaca ttgttgaaag taatgcattg gatgagcaaa ttcaagaaaa attgggttca 1500 tataataaag ttgttacaga agtaaataac aatatttgtc atgcccaaac aaaaatgaaa 1560 aagtattatg cctcaaaaca acttaaagga tggaaatcat ttacatttaa aacaggcgat 1620 caagttttag ttagaaatta cagaaaaata ggtcgtaaag gaagtcggat ggaacatgat 1680 tggttgggac ctggaattat ttcagagtta aaacaaactg gtgctacttt aaccatcaat 1740 ggtaaggttt ggaaaaaagc tgtttcttta tccaacatga agccttatat agttgataag 1800 caagtattgt cggcagaatt gcttttaaaa gaacatgatt attgtttgaa aatgacaaat 1860 caatcatata aaaaagaaga tttaaaaaaa aggagttcta aaaagaggca catatcagaa 1920 ttgaaatgtt atgacaaaga taaaaaatat aaatctgtta aaatggaaat ctctacatca 1980 tctctatctg acgataaaag cattttaaaa cgtaaaagat tgatttatcc tgcaatttct 2040 ctagataaca atttttattc aacattatca gcacagtcaa tcaaacaatg tatagatatt 2100 ttaaataatc cctttggttg gctagatgat actttaattg atattgcaca aagttttctt 2160 tcgcatcagt ttcctaaagt aggtgggttt caaagttctt gtatttttaa ttcaaataat 2220 tttggtggtt ttgtttctgg aaagtttgtt cagatattta atgtgaggaa ttcacattgg 2280 gtattaataa gtaatgtaac ctctgataat ggttcaagtt ctgttcaata ttatgattcc 2340 ctttttactg gttttaataa aagtactgtt ccattactag tacacagagt tgcacgatca 2400 atgttgataa atgagggtat tgcattcata aatttagaag tcatgagatg ccagaatcaa 2460 gacaacggaa atgattgcgg gcttcatgct attgctaatg ctacagcttt atgtcatggc 2520 attgatccaa gtgtaattct ttgggaacaa aacagtatga gatctcattt tttgaagtgt 2580 gttgagaata gaaaactaga gatgtttcct tacattttat taaatgatac aaaaacctgc 2640 tcaatatcat ttcaatgtga tgatctctgt ccttgtgcat taaaataatg attctgtttt 2700 gtgtctttac atcaagaaaa tatttaaatt ataagcttgt acaatatatg tgaatgttta 2760 tttacaatat gtaatacgtt gacaacatgt gatgaaacat caagttttca gttatttttt 2820 atgtttaaaa aacttatatc agaatccctg taaatataat aatccctata aaatataata 2880 atccctataa aaataatata ataaaattat tggttttaat ttttattgtg gatcgctatt 2940 ttctgaagtt tgtgttttgt gaggacattt actgtatcaa aatattgtaa ataaatctaa 3000 aaaatcaatt gccttttttt gaaaaaaaaa aggatagtat tttaagtcta gtattatttg 3060 agtcagtact ttagcttgtt attaatttag ttttagatta gactttactt ttgttagact 3120 agagtttgct caattttata ttagagttat atatagagtt tttaaactga atatattaag 3180 gaagcttttt aaaaatttgt aaattgtttt ttcaatttta agtcacttta aaatttatta 3240 aagtatcgct attttaaata tttaatcaaa aacttttagt ttcatttaga tttgtaatat 3300 tcatttgtgt aatattaaga cacttgaggt tgttgaatcc atttcaggtt acaagagtat 3360 ttttaactaa cttagtatta cctttttctt attatttatt ttgaatacaa agatttttaa 3420 atgaaatttt tcaataaaat atttgcatat ttaatttact aatttaaaaa aacttaataa 3480 atgcatacta cattattatt ttctagtaaa taattgtcct tttttttaga aatgacaagc 3540 taagctcctt ggttacaagc ggtaaaagat aaccttttag gtaaactcaa tctctgatgt 3600 ttgtttgcaa agacaagtaa attccaatct agcagaggta aacgttaagg gacagattta 3660 aaaacaaagt aaatgtaaat taatttttag ctaataaatt tttataattt tattatgtta 3720 tcgtaattat ttgtaaatct ttttcccgca taaatttgtc ttggatgtta tcccatgtgt 3780 ttttttcatt ggaattaaga gttatggaag tttattaaag gggttataaa attattggat 3840 tttgctaatt tttgcaattt ttcatgaact taccattcca gattaaataa tgcgatgaaa 3900 gtaactgttt cattgagcat tttctatcag ctgcgaaaat cttattgatt tttttaaaaa 3960 tattcctgac gtaggctgct ttatttgtat tgctgttgct acaaaacaac tctttaaaat 4020 atttacyttt tcaataattt agtatgatgc cttggaatct tattattgat tttttaacaa 4080 gaaacttttc taaagtaaaa tgacaaaatt tagattaaaa aaactcattt taactttcaa 4140 ttaatattcc tttaaaacat aataacatta cacttatctg aaatattttc acaccttgta 4200 gtattattat gcgatatttg tttcatgtca aactatccta ttaagatctt ttgaaatttg 4260 aataaggata ggggaaccca gagcaagatg actaacgttt tattattctt aatttttgca 4320 aagtattttt caaaagaaaa taaaaatgaa aaatcctcag atcaggtcag gatattgtta 4380 ccataacaaa tgaaaactta gtaactacaa agcagagttg ttagttcttg gccttggtct 4440 tgtcttaagg ccaaaaaaaa aggctttggt cttggtttgg ccttgaaact cttgtccgta 4500 aaaattttgg ccttgacctt aattgtttcg gcctaggcct tcaaaaaaaa aatacataaa 4560 attaaagaat taaaggtttt cattctttct ttttatgtat ttttgttttt agctttcata 4620 tatatctaca agtatttttc ctattgactt cgtagaattt tgcgcataac cttggtttat 4680 ttttataaca tgtggtttat tgtcacagca attgttacct acagttttgt taataattgg 4740 ccttaacgtt agccaaaaat ttaagtcttt ggccttgaaa atgtttgcgt aggccttagc 4800 cttaaaattc ttattcttgg ccttgacctt aaaactcttg gccttatcct tgactttgta 4860 acttttggcc ttggccatgc tacttttggc cttgttaaca acattgctac cgagcacttt 4920 ttctaaagta ttgcaaaata aacacgcatt gcaaaaagca ttccaaagaa ggatacgtca 4980 ttagaataac atctatcagc aattatccta accgtaacat catttttttt ttgatggaaa 5040 aaaaaaactt atgcatataa aacgacgtta ccaatgttta aacaaaaaaa aaatgttatt 5100 ttaaatacaa gctacaaaat gaaaaatatg cggtttcatt tccatccatt tacaaatgtt 5160 tttttttgtg atagaaaaat taatcgtgcc tcaatttttt aaatttattt ttattaacta 5220 aagttaatga tatagataaa gcgtaactaa agttaatgat atagataaac cgtaaattgg 5280 ttttaaaata acattgtaac acttaccttg acaacgtttt tgtttcattt tgacataaag 5340 tttggctaaa cgaaatatgc cttgattgct caatttgtct acttttttaa cgagctactt 5400 atttaacgca tataggcgtt aaataagtat ccgtatacta acgcatgcgc aaacattaca 5460 ttttgtctac ttttttaacg ggctacttat ttaacgcaac a 5501 // ID Gypsy-251_AA-LTR repbase; DNA; INV; 155 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-251_AA_; KW Gypsy-251_AA-I; Gypsy-251_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-155 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1104-1104 (2011). XX DR [1] (Consensus) XX SQ Sequence 155 BP; 48 A; 33 C; 31 G; 43 T; 0 other; tgagcaaatg tgctcatctc ttcaaccgag agccaaataa agattagcac tcagtgttaa 60 gttgaaagcc ctatggtaaa gacgtatatg ctaactattg attccgaaac gccaagtgta 120 ctatttcgca gtcgtatcag ttctcggtcg taaca 155 // ID I-7D_AAe repbase; DNA; INV; 6584 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A I non-LTR retrotransposon family from Aedes aegypti. XX KW I; Non-LTR Retrotransposon; Transposable Element; I-7D_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6584 RA Kojima K.K. and Jurka J.; RT "I non-LTR retrotransposons from the yellow fever mosquito."; RL Repbase Reports 11(4), 1360-1360 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >94% CC identity. The consensus is 80-82% identical to I-7_AAe, CC I-7B_AAe and I-7C_AAe. XX FH Key Location/Qualifiers FT CDS 922..2556 FT /product="I-7D_AAe_1p" FT /translation="MEPNDTGGGTPEYLVSSDDELVLGQMNKILKTQHSEA FT METIDDGTPSPPLSPSAVPPVHSGTALSGDVTVNLSVHDSPHSSQKQTVTN FT SGLLGSSSQSHAFSPSGSSSNPAPPRHKAYPPGSKGPFLVFFRPKANGKPL FT NKLQIERDLAKSFRGILEIDSPSRDKLRVTVSDRDQANKIAAFELFLMEYR FT VYIPSREVECYGVVTEHNLTRADIMSGAGGFKNRAVPLAPVLDANQMNKVL FT PDGSKQPSNSFRVTFSGSALPSVLVIGRLRLPVRLYVPTVMHCEKCRQLGH FT TAPYCSNKPRCSKCGEQHAEGSCNMEPKCTSCGQAPHELSACPKYIEREKH FT TINSLKQRSRRSYADMLKKIAPTVAPSAHTFQSNNIFASLPDNDQCSDSED FT GEEYTIIETGTKRKRAVTKRRLKQQASQNVPVDQNASSKPLTKSRNGGITK FT KGSPPGFKFSAGEFPSLPGTSKTPVVPVFCPEEQPIRQQEQNNASGKLTLS FT GIVDFIFEMFQFSPETRNLLNMALSLVKPFLKQIASKWPILDSFVSFDG" FT CDS 2552..6232 FT /product="I-7D_AAe_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="MANMVNEVGDMIEVLQWNCRSLMKNMEAFKFLVHSTR FT CDIFALCETWLTSDKNISFHDFNIIRLDRGDGYGGVLLGISKLHSFYRIDF FT PSMTGIEVVACHITARGKSLSIASVYIPPNARVSRRDLAAICEAMPAPWLV FT LGDFNSHGTAWGSPRDDNRATLIYDLCDYFNLTILNTGEATRIKPPDPPSM FT LDLSICSNSLSLDCMWKVIQDPHGSDHLPIKISVTNNSCRTHQIDVAYDLT FT KHIDWGKYAEAISVGEQSVEILPPLEEYQFLSALIINSALQAQRKPVPGSS FT VRRRPPTPWWDGECTGVYRERSDAFKEYRKRGTRDNYDRYCSLDRKFKSLV FT KAKKRGYWRNFVNGLSRETSMRTLWTVGRRMRNAVSVNEDRESSSRWIFDF FT AKKVCPDSVQVQTITRDYPDERNEMDSTFSMAEFSLALLSCNNSAPGMDRI FT KFNLLKNLPDVAKRRLLNLFNAFLESNIVPDDWRQVRVIAIQKPGKPASDY FT NSYRPIAMLSCLRKLLEKMILFRLDKWVESNGLLSDTQFGFRRGKGTSDCL FT ALLSTEIQLAYAQKEQMGSVFLDIKGAFDAVCVDVLSDKLHECGLSSILNN FT YLYNLLSEKHMFFSHGNSTTLRISYMGLPQGSCLSPLLYNFYVRDIDDCLM FT RNCSLRQLADDAVVSVTGPEADDLQGPLQDTLDNLSAWAVKLGIEFSPEKT FT ELVVFSRKRNPADLELQFMGKELSQGLSHMYLGVWFDSKCTWGKQIRYLYQ FT KCQQRINFMRTLTGTWWGAHPDDLIKLYRTTILSVLEYGSFCFQSAAKTHI FT LKLQRIQYRCLRIALGCMNSTHTMSLELLAGILPLSDRFVELSLRFLIRCE FT VLNPLVIENFEKLIERNPQTRFMTLYYAYMTLEVNPSLNLPNHGCFPDFSS FT SAVVFDLTMKQDIRGIPDLLRSEQIPKIFASKFGHASCDKRFYTDGSKTND FT STGFGVYNEFHSAAYKLQNPCSVYVAELTAIHYALERIASLPSDQYFIFTD FT SLSSIEAIRSMRPVKHSPYFLGEIRSTLSALSNESYTITLAWVPSHCSIPG FT NEKADSLAKVGAMEGDIYDRQIAFNEFFSMARQHALVSWQQKWDAGELGRW FT LHSILPRVSKKPWFKGLDLSRDFIKVMCRLMSNHYSLGSHFYRIGLTDSNH FT CSCGAGYQDINHVVWECPEYGFARSDFCASLRAQGKPEKEDIRDVLGKLDF FT VYMKLMYNFLKQANIIV" XX SQ Sequence 6584 BP; 1760 A; 1493 C; 1441 G; 1889 T; 1 other; tcattccttg ttgagtgctc gcccgagtca gtcgactatc gcgcttagtg atataccgat 60 caaccgaagt tcaccagtgt ttttcccaag tatatgggtt ttttcctggc tacgatcgga 120 aagcataccg ccgatagtgc acccgcacaa ggagcagcta ccattggccg agcccagatt 180 ggtagactga acatctttaa cttgaccatc gccaccagca tcaatcggtg gcgataatac 240 cacaagaggc atcatctaat tcgcaccacg tggcagaaca aaagcgatcg gcgtcatcgt 300 ttcatcgcat caaaacaagc tgcagaaata gacgatcgca ccacgccggt acatctgtcc 360 tcgcagctag agcagcagaa aggagattgt aggtattgtt cctctccagt ttcataccat 420 ccgcattact tggtcgctgt tagtccccgt tgttagagtg atattctcac agtggtctcg 480 aagtgctctt tcgctcaatt aattggcata tttttgatat ttttttcaaa acattttgtt 540 ggagtaatat tcacacaccg tgtgacacag tggtctcgga gtgttttatt gctcaattaa 600 ttttttttcc gcctcatttt ttttttgtga atatattttt tttatttgaa aatatatttt 660 ttgttaatat tttttcgtta tttttttttt cttttttttc gttaatattt tttttttgtt 720 aatatttttt tttttgttaa tatttttttt tcttttttta tttttttaca ctattaggaa 780 tttctcgcca aactagttgt agtacgtgag ttggttttta tttaattcct ttggcaatag 840 tgtgattaaa ttaagtgaag ttaattcgct cattagcgct gtgttgatct tcctttgcgg 900 tagagcaaaa attgctccgc aatggaacca aatgatactg gcgggggcac gccggaatat 960 ttagtctctt ctgacgatga gttagttttg ggtcagatga ataaaattct aaaaacccaa 1020 catagtgagg ccatggagac catcgatgat ggtactcctt ctcctccttt atctccttca 1080 gctgtgcctc cagtacactc aggtactgca ttgtcaggag atgttacggt taatctctct 1140 gttcatgact caccgcattc gtctcaaaag caaacggtta ctaattctgg tctgcttggc 1200 agttcaagcc aatcccatgc gttctccccc tctggttcct catcgaaccc ggcacccccc 1260 cgccataaag cttacccacc cggatctaag ggtccctttc tggttttctt tcggcccaaa 1320 gcgaatggaa aaccgttgaa caaactccaa attgaaaggg atctggcaaa atcgtttcga 1380 ggtatcctcg agatagattc acccagccgt gataagttgc gtgtcacagt cagtgatcgc 1440 gaccaggcga acaaaattgc tgcctttgag ctctttttga tggagtacag agtctacatt 1500 cccagtcgcg aggtcgaatg ttatggagtg gtcacggagc acaatttgac tcgcgcagac 1560 atcatgtcgg gagctggtgg gtttaaaaat cgtgccgttc cgctggctcc tgttcttgat 1620 gccaatcaaa tgaacaaagt gttgcctgat ggctcaaaac agccgtcaaa ttcgttccgt 1680 gtaaccttct ccggttcggc cttaccaagc gtcctcgtga ttgggcgtct tcggttacct 1740 gttcgtctct atgtaccgac ggttatgcac tgtgaaaagt gccggcagtt gggccacact 1800 gcaccttact gcagcaacaa accacgttgt agtaagtgtg gcgaacaaca tgccgaaggt 1860 tcctgcaata tggagccaaa atgtacctct tgcggccaag ctcctcacga actcagtgca 1920 tgcccgaagt acatagagcg ggagaaacat accataaact ctctgaaaca gcggtccagg 1980 cgttcttatg cagacatgct gaaaaagatc gctccgactg ttgctccatc ggcccatacc 2040 tttcaaagca acaatatctt cgcatccctg ccggataacg atcaatgctc tgactctgag 2100 gatggagaag agtatacaat aattgagaca gggacgaaaa ggaagcgtgc tgttacaaaa 2160 cggcgcttga agcaacaggc ttctcagaac gtccctgttg atcaaaacgc ttcttcaaag 2220 cctttgacaa aatcaagaaa tggcggaatc accaaaaagg ggtcacctcc aggcttcaag 2280 ttttcagctg gagaatttcc gtcacttccg ggaacttcta aaaccccagt tgtcccagtt 2340 ttttgcccag aagaacaacc aattcgtcag caggagcaaa ataatgcttc cggaaaactg 2400 actctttctg ggatagtgga tttcattttc gaaatgtttc aattctcacc cgaaacgagg 2460 aacctgctca acatggcact ttccttggtg aaaccttttc tgaagcaaat agcttcaaag 2520 tggccgattc ttgactcgtt cgtatctttc gatggctaat atggtcaatg aggtcggaga 2580 tatgatcgaa gtgctacagt ggaattgtag aagtcttatg aaaaatatgg aggcgttcaa 2640 atttttagta cacagcactc gctgcgacat atttgctctc tgtgaaacat ggctcacttc 2700 tgataaaaat atctctttcc acgattttaa tattattcgt ctggatcgag gggatggata 2760 tggaggggtg ctcttgggga ttagcaagct ccactccttt tacagaatcg acttcccctc 2820 gatgacaggc attgaagtag ttgcatgtca tatcacagcc cgaggtaaaa gcctcagcat 2880 agccagtgtg tacataccgc caaacgctag agtgtctcgc agagacctwg cggcaatatg 2940 cgaagctatg cctgcgccat ggttggtcct aggagatttt aattctcacg gtacagcctg 3000 ggggtcaccg agggacgaca atcgcgcaac cttaatatat gatctttgcg actacttcaa 3060 tctgacaatt ttgaacactg gggaagcaac acgaataaaa cctccagatc ccccaagtat 3120 gttagacctc tcaatctgtt cgaattcatt atcattggat tgcatgtgga aagtaattca 3180 ggatccccat ggtagtgatc acctgcctat caaaatttca gttaccaata attcgtgtcg 3240 aacacaccag atcgacgtag cgtatgacct cacgaagcat atcgactggg gaaaatatgc 3300 tgaagcgatc tccgtaggtg agcaatcggt cgagatactt cctccgctgg aagaatatca 3360 attcttatcc gcgttgatta taaacagtgc tcttcaagct cagcgtaaac cggttccagg 3420 atcttcagta cgacgtcggc cgcccacccc gtggtgggat ggcgaatgca ccggagtcta 3480 tcgcgaaaga tcagatgcgt ttaaagaata tcggaaacgt ggcacgcgcg ataactacga 3540 tcggtattgt tctcttgacc gcaagttcaa gagcctcgtg aaagcgaaga aacgcggtta 3600 ctggaggaat ttcgttaatg gtctatcacg ggaaacgtcg atgcgaactc tttggacggt 3660 cggaagaaga atgcgcaatg cggtatcggt aaatgaggat cgtgaaagct cttctcgatg 3720 gatattcgac ttcgcgaaga aagtttgccc ggattctgtc caagtgcaaa cgatcacacg 3780 agattatcca gacgaaagga atgaaatgga ctcaactttt tcgatggcag aattctcact 3840 tgctctcctt tcatgtaaca attctgcccc aggtatggac aggattaagt tcaacttgct 3900 caaaaacctc ccagacgtcg caaagaggcg cttgttgaac ttattcaatg cattcttgga 3960 gagcaacatt gttccggatg attggagaca agtgagagtt attgccatcc aaaaaccagg 4020 aaagcctgct tcggactaca actcgtaccg tcctattgcg atgttgtcgt gtcttcgaaa 4080 gttgttagag aagatgattc tctttcggct ggacaaatgg gttgaatcga atggccttct 4140 ctcagatact caatttggat tccgcagagg caaaggaacg agcgattgtc ttgcgttgct 4200 ttctactgaa attcaactgg cctatgctca aaaagaacaa atgggctcag tctttttgga 4260 tattaagggg gcttttgatg cagtttgcgt agatgtcctt tcagacaaac tccacgagtg 4320 tggactttcc tcgattttga acaactactt gtacaatttg ttgtctgaga agcatatgtt 4380 tttctcacat ggaaactcga caactttgcg aataagttac atgggtctcc cccagggctc 4440 gtgtttaagt cctcttcttt acaattttta cgtaagagat attgatgatt gtctcatgag 4500 aaattgctcg ttaagacagc ttgcggatga cgccgttgtt tctgtcacag gaccagaggc 4560 ggacgatttg caaggaccgt tgcaagatac tctggacaat ttgtccgcat gggctgtaaa 4620 gctgggtatc gaattctctc cggagaaaac tgagttagtt gtcttttcta ggaagcgtaa 4680 tcctgcggat ctagagcttc aattcatggg taaggaactc tctcaaggtt tatcacatat 4740 gtatttaggg gtctggtttg attctaaatg cacctgggga aaacaaataa gatatctgta 4800 tcagaagtgc caacaacgaa tcaacttcat gcgtacactc acaggaacat ggtggggagc 4860 ccacccggat gatctgataa agctatatcg caccacgatt ctgtcggttc ttgaatacgg 4920 tagtttttgc ttccaatccg ccgcgaaaac acacattctg aagctccagc gtattcagta 4980 tcgctgtctt cggatcgctt taggctgtat gaactcgact cataccatga gcttagagtt 5040 attagcagga atactgcctt tatcagatcg gttcgtggaa ttgtcgttaa ggttcctcat 5100 ccgctgtgaa gtgttaaacc cgttggtaat tgaaaatttt gagaagctaa tcgaacgaaa 5160 tcctcaaaca agatttatga cattgtacta cgcgtacatg actctggagg ttaacccttc 5220 tttgaatctt cccaatcacg gttgcttccc tgacttctca agttccgctg tagtttttga 5280 tctgaccatg aagcaagata tccgtggaat accagatctg cttcgttcgg agcaaatacc 5340 aaaaattttt gcaagcaagt tcggtcacgc cagctgcgat aaaaggtttt acacagacgg 5400 gtcaaaaaca aatgattcca ctggatttgg tgtatataac gagtttcata gcgccgcata 5460 taaacttcag aacccttgct ccgtgtatgt cgcagaatta acggctatac attacgcatt 5520 agagcgaatt gcctctcttc cctctgatca atatttcatt tttacggata gccttagctc 5580 catagaggct attcgttcaa tgaggccggt aaagcactct ccgtatttcc tcggtgaaat 5640 acgatctaca ttgagtgctc tatcgaatga atcatatact atcaccttgg cctgggtccc 5700 ttcgcattgc tcgattccgg gcaatgagaa agcggactcg cttgccaagg tgggcgctat 5760 ggaaggcgat atttatgatc gacaaatcgc cttcaatgaa tttttttcaa tggctcgtca 5820 acatgcattg gtcagctggc aacaaaagtg ggatgccgga gaattgggca ggtggttaca 5880 ttccattctc ccgcgggtat cgaagaagcc gtggttcaag gggttggact taagccgaga 5940 ctttattaag gtaatgtgtc gtcttatgtc caatcactac tctctaggct ctcacttcta 6000 tcgaataggg ctcacagata gcaatcactg cagctgcggt gcaggttacc aagatatcaa 6060 ccacgtagtt tgggaatgcc ccgaatacgg ttttgccaga tctgattttt gtgcatccct 6120 cagggcccaa gggaaaccag aaaaggaaga cattagggat gtgttgggta aacttgactt 6180 tgtgtacatg aagcttatgt acaatttttt gaaacaagct aacatcattg tttaactccc 6240 ctttctcgtc ggtccacttg tccaccttgt gtcccctcga aacccgtctt tttgtatcgt 6300 tacaggttgt cattgtccac tcaacgttcg accagcagca aagcaaacac gtcacaactt 6360 aaagctgctg aatgtcaacg gaacgatccc attgtcctat cccttccttg aaattattgt 6420 tacccctaac ctcgaccaaa ccgcgagttt ttcggttccc caaaactaac atagaaaatt 6480 aagaaaacac catgaatttg taaataaaat caaaagaact cggctccgta atgcctaatg 6540 gcgcatgagc ctaataaata aatgattaag taaaaaaaaa aaaa 6584 // ID Academ-4_BF repbase; DNA; INV; 8340 BP. XX AC . XX DT 16-MAY-2011 (Rel. 16.05, Created) DT 16-MAY-2011 (Rel. 16.05, Last updated, Version 1) XX DE DNA transposon: consensus. XX KW Academ; DNA transposon; Transposable Element; Academ-4_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-3053 RA Yuan Y.W. and Wessler S.R.; RT "The catalytic domain of all eukaryotic cut-and-paste transposase RT superfamilies."; RL Proc Natl Acad Sci U S A 108(19), 7884-7889 (2011). XX DR [1] (Consensus) XX SQ Sequence 8340 BP; 2573 A; 1672 C; 1825 G; 2270 T; 0 other; tagtccatgg gccagagggg tttttggtac caccgaaagg gacaagtgct atcatggatt 60 ttttgtatta gcagtgtcca aagtttattt caagctatga taaccagttg tcatgcacag 120 ctttcttact gtaatcacgt gaccaagtga cgtcatccaa attgacctgg ccgtttggtc 180 atttattgaa aaggacggga gaaatatgta aaagatcaat aattttcatt gaaaattggt 240 tgataggtga aaaacacccc aaatattaca aaatagtaag ttttttgtca gtatttagta 300 tcttgaggaa gttatgagtc ttcaaaggct aattttgggg gtaatttttg agtaaaaatg 360 acattgttta ctttccaaaa ccagtcattt atttggcttg cctatggggc cttgtttgat 420 acgttttctt gatttgccat gtcttaatct gcattttggg ctttaaagca acaaaattcg 480 tcaattctat caggagttac gggtatttca atgattacac ccaacggaca tttttaataa 540 attgctgcga atatcgcaat gcatcatggg taattcaaga gactctctat aaaacaatcc 600 acgtggccga gtcaaaatgt tgttttttct caacatttgg ggtggtagta gggcattggt 660 ccatacatac aaaaatggtt aagtttgtgg taaagtggtg ttcaccggtt gccatggcaa 720 cggccatttt tcacattgcg attttcacct gttaatggtc aatcaatcgt tccttggcat 780 cacatgttgt taatggtatc tgacgtaacg taccgttaac tccgactttt ttctctacat 840 tttttacgaa ttgcgttcaa agcaaaatcg atcctaaatc caaagcttgc cgaaacaaga 900 cattttttta aatgtttatt tgccctttta ataaaaacaa gtacttttac aaaaagctgg 960 catttggcgt acatggatat tccgattggc gaggcttttc ggtgggtaat gagcagttgc 1020 tttcaatgta gcagtttcct ttcgatctca tgttcgattc ttcacaattt tttcgaattt 1080 tcgagtgcat tcgcgaacgc attctataac agtttgtctt acaaaaagtc agcttatcgc 1140 tagaaatgga tgtgcgtttt ttatgcgcct tgaaatggat acctctgccc tggagaattt 1200 gatctgaggc aagccaagat gccgtgctct cgttcacgca gtcgcagtgc atcgggagaa 1260 ccacgtgagt actttttact tcgtatggat tcagactaaa atacggctgc gaaactttgt 1320 cgaactttcc atgaaagtca tcgtctattt atccacttta aagaaaagat gatacacgta 1380 tgtgttgaaa tataggtaaa gctaggacga tcatgaaagt atacactcag gcagaaatac 1440 aatgttgatg tagtgtttca aatgtttatt tcgccattta gtttcagcat tatcaagtga 1500 acgattcttg tttcaactta atcaacttct gagctattca ctatagaaaa gattaaccaa 1560 accttaaagt gacggtgcct acttttttac gtcataaggg gcaaaaagaa aggctgtgga 1620 gatcaatggc gaagatcgga agaagataca acgcaaggaa gatgaataca ccaatcattg 1680 tggtgaatta gacacagtag aatcggaacc aaatggttgt gctatccatt gcacaaacga 1740 aaccgctgaa aatctagtcc atctaaaaga tgcagagtct tggaaaaccc ttttgcgtgc 1800 tgcagagata cgcaaacatc aagtactctt agacatcgct tcatctgtaa aggaaggaga 1860 aattccggac atcaagtatc atcggaaatg cagaagtgtg tttacaatga agaagttact 1920 tgacaaaatg caggaaacag aaagttgtgg cgacgagaca gctaagcggc ttcccagtag 1980 aggaaaatgt agcccgaata aaacgtacga acgtctttgt attttctgtg acaagacaag 2040 caaatatgtg aagggcacaa acagccgtga gacgttagta cagtgttgcg atatgcgggc 2100 agacaacacc atcaggcaga tagccacaga aaagaatgac agcaagattc ttgcaatcgt 2160 gacccgggaa ttagtcgcag cagaagcgta ttatcacagg agttgttaca agagttacac 2220 tagaccagaa gcaagctgca ctctgaacta tgagagaaag agtgaatctg cagatgatga 2280 atatgcccgt cttgagaccg acgcttacca gatgctttat cactacatca gatcggatgt 2340 gatagaaaag gagagagttg tccgcatgtc ggaaataacg gccttacttg ttgagtatct 2400 aacctctctg ggcatcaagg agtgtaagcc ttcctcgaaa aaacacatca gaagacatct 2460 ggaggcagag tttggtgcga tgctcaagtt tgaaactttg ctcgacgata cacccggagt 2520 gtttctcatc cccgctaact tgacacccat gaacatggcc aagaatttgc taactgtttt 2580 gatggccgag aaggatgacg gcacatcgaa gacattatcc aacatagagc aagctgcaat 2640 agatattcgc aatgctatct tgagcaaaga acgcactatg tcttggccgc cacgccctgc 2700 agagctcgat gacaatgcgc tggacattcc tcaagagctg attgcctttt tgagtacctt 2760 gcttactggc agcaaagagg tgtcagtgga tgaatgcaat gcacgagtaa agcgattgat 2820 gaagtcctat gctcaagatc tcatgtttgg agtaagcagg ggacagatta agccacctaa 2880 gcacgtcctt ctaccatatg cagtgaagac tttgacaaac aacgtcgagc ttgtgtccat 2940 gctcaatcgc tgtggtcatg gcatctcata ttctcagctc gaggaaataa acactgctct 3000 ctgtctgcag aaaatggaac aaaacagcga aaccccattg ccagacaaca tccagcctta 3060 tgttagtacc acattggcat gggacaatat tgacagactt gaagagaccc tttcaggcga 3120 aggtacctcg catcgtgtaa acggaatagc agtacaggca aggcattttg gaccacgttt 3180 ctacagcgag caatcaccag taatacccaa aagtaagaga agaagcgtcg agccgcaaga 3240 tgtcgtcagc ctaccaattt ataattcagg agagcgccaa ggccccaaga caagagggta 3300 tgtagatgtc acttgccagg atgcaataga gagtgcgaga agaagaaacc tactgtggat 3360 tttggtgcgt ctacatggag aaatcagcca gagagtaagc ggatggaccg gatacaacat 3420 tttagtccgt aatgaaactg acgtcatcaa agacagcata gggtacctcc ccacaataga 3480 tgcaccagct actaacatgt cgactgtcca tgaaatcctg atgaggtcac taaaaatcaa 3540 agacgcacta caccttaaga gcatagtgct ggttttcgac caggctctat atgctaaggt 3600 cacagagatc atgtggaaac accctcaaac attcaaggac attgtcccaa gaatgggtat 3660 gtttcacacg ctactcaccc tgttgtcaat catcgggaag cgatttgaag acgcgggtct 3720 gagggacatc tgcatagagt caggagtgat agcagaaggg tcagtcacgg gagtgcttga 3780 agggcgcaaa tacaacaggg caatccggtt ccacaaactg atgtacgaag cactccaacg 3840 acttgtttgg aagcactttt tgaagtggat agagaaatca cctgcgaaac agaaattggt 3900 gaaggatgtc ttcgcgagct tgaaacctct ctacaatgat gtctgccagg atgaacagga 3960 gaaggttttg gcaaaccaga agttcgccaa gtttgtcaaa ctctacgatg aacacctcga 4020 gtttcttcgc cacagtaaag ggaagttagc aagcttctgg atgtcgtaca ttgagattgt 4080 ggagatcatg cttaacttgg tgagggcttc aagagaagga gactgggagt tacatctatc 4140 tgcaatagca cagatgatac cttggtgctt tgcctacgac aaggttaact atgcgcggta 4200 cctgcctgcc tacctatttg acatgtctca ccttaacgaa acccatcctg aagccttcaa 4260 ctatctgaat tctggtgggt tctcagtgca gatcggtgac cacaatccat ttggccgtat 4320 accggtagat cagacatgcg aagaaacagt taacaaggac acacaaacgt caggaggtac 4380 taaagggttc agtctcaagc ctggggcgat ctgcaagtac tatcttgtcg ctgaataccg 4440 aagcctgttt ctgaagcagt tgagggatat gctagatgaa cataaggctc attctgagca 4500 tactgacctc cagagtagca gaattgcaag agacgaagct gatgtgatcg cacttgttct 4560 tatgctggaa cgtttctgga taaatccatt caacagcgaa catcaagact tggtatgtct 4620 gtcaaccggg aggtttgcta ccccaaagat agaaaaggac ctacttaatg cgaaagtcgt 4680 tggtgaagag gcatacaaat cgttccgcat gcagcgcctc gaaatcaaca aagacacaca 4740 acaggcccag ttccacgaca cactgaagaa agccaagctc catacattct ctgagctgaa 4800 caagaaggtc aagttcaaag ctaaaacgac caaggagatc attcttaaag ctgacatggc 4860 tctgttcggt cagatgatca tcatagcgga aagtagaaag ctccaactga gagatgtgct 4920 tcgccatcct ctgggtcctc ttccgtggtc gttggctact gcagatggat cattacgaaa 4980 aactacgaag tctacactag caaaggaagt acagaagaat gtaccggctg cggataatat 5040 tccccaccca tctgcatgca tcattgatgg catggctctt gtacaaaggc taaaagctga 5100 tcaaaagacg ttttcagaag tcgctgattc cctactcgac atgattttgc acgaaggatc 5160 acactcaagc agaatagatg ttgtgttcga tgtgtatcga caggagtcca tcaagtctgc 5220 agaacgagaa cgtagaggat ctgagtgtgg gagtgaattc agaaatattc agcccgaaca 5280 caaagtacta cagtggagaa aatttgtact gaacccgaag aataaaacgt cacttacaag 5340 atttgtcacc gaggaatgga agaaagacaa atacagaaga aagctacaca gcaaggtgct 5400 ttacatagcc tgtgaggaag agtgtcacaa gatttctgca aagagtgcca tccccgttcg 5460 agacttgaac tcaaatcagg aggaagctga caccagaatc atactccatg tcgcacatgc 5520 ggccaggtca ggctacaaca cagtgattgt gtcatcagaa gatacagacg tcttcctact 5580 ctgtttggct tttaagcagt ccattccagc gtcgatattt gtcaagtgtg ggacgcactc 5640 aagaataaaa tatgttagta tcacgaatgc ggcacaggtc tggggccagg acatttgcag 5700 cagtctactc ggaatgcacg cgtttaccgg atgcgatagt gtgagtgcgt ttgctggccg 5760 ggggaagctg ggggcacttc ggctcgtcaa ggaaaacagg gacttccagg agatgttcaa 5820 acttgttgga atggattggg agctctcaaa tgagcttttc aagaaactag aggagtttac 5880 ttgccatatg tactcttccc gaccaggcac aagcgacgtc aatgagctca gatataggtt 5940 gttctgtgca aagagaggca gcattgattc tgtccaactc ccaccgtgtg ctgattgttt 6000 gtataaccac gctaagaggg caaactacgt tgcagcaatc tggaagaaaa gtctggagag 6060 ccaccctgta attccaagcc cgatcggttt gggctggtgt aaggacggag atcagctggt 6120 aattgattgg atggatggcg agccggcccc aactgctgta ctagagctgt tgtcgtgctc 6180 atgttcaaaa gcatgcaaga ttactaagtg cagctgcttg aagaatggct tgaagtgtac 6240 agacatgtgc aaatgtatgg actgcgacaa cagaccggac gaggaggaca accaagatga 6300 cactgaggat agcgatggtg atcatgatga agacgaatat gctgcataat gaggtgagaa 6360 taacatacaa atgatatcac gtttggctgc ttgtgagctc ctagctttac gctctttttt 6420 gcatttgggt caacgtttaa gtcacgtttg taaataattt cttattatcc ctttgtgcag 6480 cgttagatgg agaactttca cctaggactc agctctccag acatctccat gcttctgcgc 6540 tctctacccg ccaacagcac ggcagggatc cagaactcta ctcactcaac aacaatccta 6600 tatgtatata tatccgcttt gaaactaaga ctcttatcaa atagtattgc taagctgtaa 6660 gattgttata tataccatag aataagcatt tgatatgtac cttacattga tttcaaattg 6720 aatgaatagc cctcttgccg attcgttgta attggccccg ttgttgtaaa tatattgcta 6780 atatgatgtc atcccataat tttgatatta tttataccac gggatagtgc cagcgtggaa 6840 ctgtgggcgt ggcggactga agatggttca atttccgcat gaaatgtttt ttttttcatt 6900 aggagctgcc ttttcaaaac gcctatctaa attatgatca aaatcaactg aaaattataa 6960 caatcctata tgtatatata tccgctttga aactaagatt ctaatcaaat agtattgcta 7020 agctgtaaga tataccatag aatcagcatt tgatatgtcc cttacattga ttccaaattg 7080 aatgaatatc cctcttgccg actcgttgtt gtaattggcc ccgttgaata tattgtaaat 7140 atgtcggaag gcatgtaagt cttgttgttt tgaattaatt agtttactga ctcaatacag 7200 gaagaagatg acctggatta tgctccatat tatttccatt tagctaagtc aggcatacaa 7260 aactatatat ttttagtata aaaatagctt aaaataccta aatttaagaa ttgcaagtaa 7320 cgtattccaa aagtaaaaat aaaataatgt gggcgcatat cctacacacc tttaaaacaa 7380 atttcagttc atttgggttt gatataaggg cacaggatcc aaaaatatat atttttagtc 7440 ataaatggcc tcaaatccca aaataactca tttttgaagt gatgtatgca aaacttgttg 7500 gttgtgtcct gtggacatat gtccaatgca cctctgtatc acatttcagg tcattaggtt 7560 gcaataccag ggtacagggg cccaaaatat atatttttcg tccaaaatgg ctaaaaaccc 7620 taaatgaaac aaattttaag gtctttatca aaaaaattag aaaaaaatgc tgggtgtatt 7680 tgctatctct acctccatgc caaatttcaa ctcatttggc ccaaagatga aaaagttgaa 7740 tccatttaaa gatttggcaa gtgaaacacg gcaaaaaaac aatatacatg tatttcaatc 7800 catactgtag tatggattga aatataagat agttctactt atttcttacc taactaaaat 7860 gttttcttaa agcccttaca ttagttatga atcgttcagt gtcattggtc atgccttgat 7920 aatggccaaa cttcaaaagc atcaaacaat ggcttgttta gcaactttcc tgtgagaaaa 7980 tgttattttt caccgttgcc atggcaacca ttgtagaaat ttgtctaagt ttatttcaaa 8040 atggttccaa atatattcaa caggtatttg tatgatctaa gtgcagagaa atactcatca 8100 aatgtggttg gggattatca aaaccacaat tactgacctg taataaacga tttttggtag 8160 ttctatagat ttgagtgacg tcacaaaatc acgtgataat tcctataaag ctgggcatct 8220 gacacggtta tcatgcaaaa taataaagta gaggaatcaa ccttttcaaa aaactaagtc 8280 accctttgtc cctttatttg gtaccaaatg gccatttttg gggacctggc ccatggacta 8340 // ID Gypsy-613_AA-I repbase; DNA; INV; 7897 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-613_AA_; KW Gypsy-613_AA-LTR; Ty3_gypsy_Ele177; Gypsy-613_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-7897 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [3150-3653] - Reverse transcriptase CC Positions [4719-5198] - Integrase core CC LTRs are 95% similar to each other. XX FH Key Location/Qualifiers FT CDS 2754..5597 FT /product="Gypsy-613_AA-I_2p" FT /translation="MIDVGSGPETLETVTPRDKKVLCFTIEPSTELPTIEQ FT EENDETLDIPVYEGPTESVADPDNIETEHKLTEVQRVQLLEVIKRFELTSE FT GKLGRTNLIEHEIVLKEGVKPRNPPMYKCSPYVQEAINAEVERFKKLDAIE FT ECYSEWTNPLVPVPKKNGKVRVCLDSRRINKLTVKDSYPMRNMQDIFRRLG FT KAKYFSVIDLKDAYFQIPLKEDSRNYTAFRTAKGVFRFKVLPFGVMNAPFT FT MSRLMDKAIGFDLEPFVFVYLDDIVIASETFEEHLRLLKIVGERLNKAGLT FT ISIEKSRFCRKQVMYLGYLLNESGVAIDSARIQPILDYARPRNQEDIRRLM FT GLAGFYQRFIKDYSRITAPITDLLTKENKKVIWTKEAENAFNELKSVLTSA FT PILGNPDFTKTFTIESDASDRAVGAALVQEQDGETRVISYFSKKLNRTQRR FT YAAVEKECLGVLSAIHHFRHYIEGTKFRVVTDARSLLWLFNVGAETGNAKL FT LRWALRIQAYDFDLLYRKGKANITADCLSRSVEVDAITASQPDNDYEELME FT EIKTNPVKYKNYRILDGIVFRYVKIKDRLSDPRFEWKQLPMESQKLDVIRK FT EHDKAHFGYEKTLAAVKQRYFWPRMNGEIKKFCRECLKCQMSKSGNINVTP FT PMGSQKPVEYPWQFVTLDYVGPLPPSGKNRSTCLLVATDVFSKFVLVQPFR FT EAKANSLADFVENMIFRLFGVPEIILTDNGSQFVSKRFKDLLQAYHVNHWL FT TPAYHPQVNNTERVNRVITTAIRATLKRDHKHWADDIQSIANAIRTAVHDS FT TKHNPYFVVFGRDQVSDGREYGWIRDNYRPKDDDNEKNVVTDRRKKLFSEI FT KTNLAAAYKRHAKTYNLRSNANCPAYMVGEKVLKQTFDLSDKGKGFCKKLA FT PKYEPAVVRKVLGTNTYELEDDSGKRIGVYFANRLKKLHSPSSN" XX SQ Sequence 7897 BP; 2316 A; 1550 C; 1860 G; 2160 T; 11 other; ggctggcccg ttatatttgg cgcccaacaa tacaaggctt tttaagattt aagttgtagc 60 tgactgttgt aatataactt ttcgtattta cccatttttg ttgcatgtct tagcacatat 120 ttatattgaa agtgtgaaat tcattgtaag ggaactcaaa ttttgtgaat tttcgatcaa 180 tatatttgat ttcgttgttt ttgtatttag gtttataatt tttcgtttat ttcaataaac 240 atcaatcatt tgtctaattt tgttcatttg ttacattgtt tgcttcatat ttgtattatt 300 tattccttac tgttatttat tgctatttgt tatttattcc ttactcttaa atatttgcta 360 attaataagg taactcctaa gtcgattgtt cttaaataag tgtgtatgca atttagtgcg 420 gttgattttt cttgtctttg ttgtatttct ggttgattgt agcttctgaa attatacaaa 480 aatgaatcaa ttatttcccc gagccgaaga tctcactctc gaagagatca attatgagtt 540 cgttattcga catcaaccag aagaagtttt taagctcgaa ttgtcaggca aacaaagaca 600 tttgagacaa ctattcaagg aagatcagaa agaagggcgt aaccatcgtt cgccttactc 660 aatcagcgag gaagcgtcac acatcgaagg cagaatagaa aatttagcaa aggcattgga 720 gaaaagggta gagaccaagt tcgaatctcg ggtaattcat tactggtatc gggtwaaaag 780 ggctattgcg aggaatgatg aagagatgag aatgagaaga gacttgattt gtaaaattga 840 aaggattatg caggactttc agtttgggcc gtcgttatct ccttacaaaa cgcaaagcaa 900 tacagtcktt caagkcagcg aagagggagc kgctggcaaa actatgaacg cagacgcagg 960 agtaagtcaa aggtcagaat cttcgaatca matgaatgac ctttctcagt atgctaaagg 1020 tgctattcct aagggaatga gtcattcacw ccaggaaack gtagggaatc ctgagccaga 1080 gtttgtggtc tcgcgtmaag aatgggaaga catgaaggca atgatgatga gtctgatgaa 1140 tagaatggcc ggtgcgaatc cgaatgacag ggktggaagt gtaggtaacg atagccaaag 1200 gtcacgaagg tcggaaatag aaaggcagga accacctaga gtgattgaac caccgtacag 1260 ttttcagcaa cgtgcagtta gaccaatatt agagtcaagg gactcaagca cggacaatta 1320 cagtgctcgt atacatcggt taccaagaga gagctttggc ccaagcgacg aagaggacga 1380 ttaccgacat catcgagcaa gagaatttcg ggctcttcca agatacggtg taaggagaga 1440 tcgagagcag aataatcgga tagagaaatg gaagctgcgg ttcactggcg aacctaggtc 1500 catgtcaatc gagaactttc tttataaggc caagaagtta gcagaacgag agggtgtgcc 1560 cagagaaatt ctacttcgtg acatccactt actgcttgaa ggcgctgcgt cggattggtt 1620 tttcacgttc gtagatgatt tggacacatg ggacgttttc gaaaccaaca ttgtctaccg 1680 ttttggtaac ccaaacaagg accaaggaat cmgaacgaaa attcacgagc gcaaacagca 1740 aagaggagaa tctttcatag ctttcgttac ggaaatcgaa aagctaaaca aaatgctwtc 1800 tcggccgttg tcagcgcgga gaaagttcga gatagtctgg gacaacatgc gacagcatta 1860 tcggtccaaa atttctattg tggaagttaa cgatcttcaa caactgacga ggttgaatta 1920 caggattgac gcagctgatc cccagttaca gtaccagact ggcgatacct ctctacgacg 1980 acccatcaac cagattgagg cggaagacac cgactacgac agcgaccggt ccgcgccggt 2040 gaatgaaata agaggtcgat acaacagaga cgttcggcca ggacagaaca acaaccgaga 2100 aagacggaat caacagcaga gtgatccaaa ccaacaaacc agaagagggg atcaaacaac 2160 gccgccaatg gcctgctgga attgccagaa cgaaggacat ggctggaggc aatgtaataa 2220 gccgaaagta gtattttgct acggttgtgg gaatctaggt aggacaacgc gaacctgtga 2280 gcgttgctcc aggaattacg gagtcccacc ggctgaacag cagggaaacg aatagagggg 2340 tgtaagtcag ggaatgcaag catccgtgtg caaaaagaag ttcccaaaat acctatgaca 2400 aatctaacat acgatccttt attgcaattt atcacattaa gatacgggtt agtaagtgcc 2460 cgcatattaa agtaaaaatt ttcgattcag actgggaagc tttactagat tccggggcgg 2520 ggataagcgt tcttaattcg gtggaggtgg tagagaagta cggcttgaag atacagtcag 2580 cgtcaataag agtaagcact gcagatgggt ctaattacgg atgtcttggc tatgtaaata 2640 tacctttcac gttcaagaac atcacgaaag ttatccctac gattatagtc ccagaaattt 2700 cccggaatct gatacttgga gcagattttt gggatgcatt tgggataaaa ccgatgatcg 2760 atgtaggtag tggaccagaa actttagaaa ctgtcactcc aagggacaag aaggttctct 2820 gtttcaccat tgaaccatcg acggagctac caacgattga acaggaggag aatgacgaga 2880 ctttggacat tccagtttat gaaggcccaa cagaatccgt ggctgatcca gacaatatcg 2940 agacggagca caaactgacc gaagtgcaga gagtgcagtt gttagaagtc atcaagcggt 3000 ttgagctaac atcagaggga aagctgggaa gaacgaacct catagaacat gagatcgttc 3060 tcaaggaagg tgtaaaacct cgaaatcctc cgatgtacaa gtgctcacct tacgtccagg 3120 aggcaatcaa tgccgaagtg gaacggttca agaaacttga cgccattgaa gagtgttata 3180 gcgagtggac gaatccactc gtcccagtcc cgaaaaagaa tggcaaagta cgggtgtgtt 3240 tggattccag aaggatcaac aaacttacag tgaaggactc ttacccaatg cgcaatatgc 3300 aagacatatt ccggcgattg ggaaaggcca aatatttttc ggtaatcgat cttaaagacg 3360 cttactttca gatccctctg aaagaggaca gtcgaaatta caccgcattc cgtaccgcta 3420 aaggagtgtt tcgcttcaag gtgttgccgt tcggggtgat gaacgcacct tttacgatgt 3480 cgcgtctaat ggacaaggcg attggattcg atttagaacc attcgtcttc gtttacttag 3540 atgacattgt catcgcctcc gaaaccttcg aagaacatct tcgacttttg aagattgtgg 3600 gagagagatt aaataaggca gggttaacga tttccataga aaagtcgcgt ttttgtagga 3660 aacaggtcat gtaccttggt tatttgttga acgaaagcgg ggtagcaatc gactcggcac 3720 gcattcaacc gattttggat tatgctcggc cgaggaacca ggaggatata cgcaggctca 3780 tgggtttagc gggattctac cagcggttca taaaggacta cagtcgaatc acagccccaa 3840 ttacagattt gctgaccaag gaaaacaaga aggtgatttg gaccaaagag gcagaaaacg 3900 cgttcaatga gttgaagtcg gtgctgactt cggcaccaat tttgggaaac cctgatttta 3960 cgaagacctt caccatcgag tcagatgcgt cggatcgtgc ggtaggagcg gctctggtcc 4020 aagagcagga cggggagacc cgcgtgatca gctattttag caaaaagttg aatcggacgc 4080 aacgacgata cgcagcagtg gaaaaggaat gcttaggagt gctttccgcc attcaccact 4140 tccggcatta tatagaaggg accaaattcc gagtggtgac agacgcacgc agtctgctct 4200 ggctgtttaa cgtaggagcc gagaccggaa acgcgaagct attaaggtgg gcgctccgga 4260 tacaagcgta tgatttcgat ttgctgtatc ggaaaggaaa ggcgaatata accgccgact 4320 gcctatctcg atcagtggaa gttgatgcga ttacggcatc acaaccggac aacgactacg 4380 aagaactgat ggaagagatt aagactaacc cggttaagta caaaaactac cgaattttgg 4440 acggaattgt gtttcgatac gtgaaaatca aagatcgatt gagcgaccct agattcgaat 4500 ggaagcaact tcccatggaa tcgcagaagt tggatgtcat caggaaggag catgacaagg 4560 cccatttcgg gtatgaaaag acactggcag cggttaaaca gcgctacttc tggccaagga 4620 tgaacgggga gataaagaag ttctgtcggg aatgcctaaa atgccagatg agtaagtcgg 4680 ggaacatcaa tgtcacgcct ccgatgggtt cgcagaagcc agtagaatat ccgtggcagt 4740 ttgtgacctt ggactacgta ggtccactgc cgccgtcggg gaagaaccga agtacgtgtc 4800 tcttggtggc cactgatgta ttcagtaagt ttgtcctcgt gcaacccttc agagaagcaa 4860 aggctaactc gctcgctgat tttgtagaga atatgatatt tagactcttt ggggtaccgg 4920 agataatcct gaccgacaat ggttcccaat tcgtgtctaa gcgattcaag gatttgctgc 4980 aagcttacca cgtgaaccat tggttgacgc cggcctatca tcctcaggtc aataataccg 5040 aacgagtcaa cagggttatt acaaccgcca ttcgtgcgac gctcaagaga gaccacaaac 5100 actgggcgga cgatatccag tccatagcaa acgctattag aaccgcggtg catgattcga 5160 caaagcacaa tccctatttc gtagtgtttg gccgcgatca ggtttctgac ggtagggagt 5220 atggttggat ccgtgataat tacagaccga aagacgacga taacgagaag aatgttgtta 5280 cggataggcg aaagaagttg tttagcgaaa ttaaaacaaa tcttgcggcg gcgtacaaaa 5340 ggcacgccaa aacttacaac ttgcgctcga atgcgaactg tcctgcgtat atggtagggg 5400 agaaggtctt gaagcagaca tttgaccttt cagacaaggg gaaagggttc tgcaaaaagt 5460 tagcccctaa gtatgaacct gcggtggtta gaaaggtact gggtacgaac acgtacgaac 5520 ttgaagacga ttcaggcaag agaatcggtg tttacttcgc caatcgactc aaaaagttgc 5580 attctccatc gtctaactag aggatggaag gttttccagc tatgtatttc tttaagaagg 5640 accactactt gatgaacaaa aacacctacg gggtcaaaac atctacggac agatcgaagg 5700 actcgtgtta aggaagtttt tcaactgact tgcactgagt gtaccgactg atcctaatac 5760 gaagacaaat tttgaacaat ttatgtaaag cgatttccag ctatgtatct cttttagagg 5820 gaccactaat tttgatgaat gaaaacacct acggggtcaa aacatctacg gacagatcga 5880 cggactcgtg tgaagaaggt tcttgaactg actagcactg agggtaccga ctgagcctaa 5940 aacttcgaca aactttttaa caagctatgc aacttttgaa aaagcaacta cgaacaacta 6000 agattcaata aaataccgta aaagggcaga attccaccta acaaaccacg attgatgatt 6060 agagaagacg cgaataccca atcaagactt tctcactcgt tagtgagctg tacatatgta 6120 aatattagtt tgttagttag cctagttccc tatcacttaa atttagatta gtacttagct 6180 taaattagtt gaaattacca aattaactca cgatttcctt ttgtggattc gtagtattgt 6240 ctcttttttt cgttggtcga gcgatacttg tatataagtc tggtttcttg tctttaatca 6300 gccagcgttg taaatatcct aaaaataaga attgttgaat ttttgttatt tatttactta 6360 cgttatttat ttttcttagt ctttcgtagt tcctttgtcg atttgaagtc aaacaccgtt 6420 tcgttctcaa ttttttcacg tccttatccg tccttagtcc cattatatcc taggaagtaa 6480 gttctcacag gtggaattat accttcacct agcacctgtc ctcgtccata gtaatattag 6540 tcaagtccat ccttcaatta cgtccttttt ttgtcccgtc catatagttt gtccaaagat 6600 ccttctgtag tagtaagttt acatttccag tcccagtttc tcatgtaaac attacgtcat 6660 atcgtctctt ttttttcgtc atttgagttt gtccgttttc gtattccgtc cattcaacca 6720 caaaggtttc tattagaagt ccttcttctt cctctctcgc ttcgtttcat cctacgctcg 6780 atagccaatc catagttttt ttagtcacct gtaaataaat aaaacaaaca acttacaccc 6840 gatagattgg aacgactgaa tagggtactc tcctgaatgc cagacgtcct gagggcggaa 6900 aatcgtcctt ttggtgccgt tttccgcgat attcgacact tttccgcgaa acgtgtcctc 6960 cagatcgcga actgatacgt cttttgtttc ctctttctga gtttggcagt acggttcagt 7020 gttgacatgt atgtgtgata gaccggtcaa attgagagtg agtgcgcgcg tgtcggagtt 7080 ggattccggt gcgtatttga gtgtatgata tgagtgaatg gttggtattt ttgtgatttc 7140 gtgtgtttct tagagatctt ttggtagtcg gatattttct ggcgtagaaa gattctctca 7200 aggaagaatt tgagtttcga tgttacagtg agcacatttt gcggatgaga gactccgtag 7260 ctgcgaactg taaatagaaa cggacccctt tagtgaggaa gttggtgtct cgcaatgctc 7320 gctacgcgct gggtgagagg caaccagacc tacttgctag ttttcgttca tgtgaatgtt 7380 tttcgtaaac gttatttgtt gatcggcaat gtcctggtgt aaaataatgt ttcagtaatt 7440 tatgtgagct tgctcagagt tgtgcacgat tctggagttg aagccgtttt catgttgaat 7500 tttgagattt atgtgtatga atgaggtcat ctacgttatg ttctgtcaaa caaagtagct 7560 tccgtagtct caatgttgtt acctagactt tcgagcgaat tttcggccta cccctatttc 7620 tttttctatc agtaagcgta tttgaaagat aattattaga ttttcgctgg agaccggaga 7680 ctaacgtcgc tgatggtcag tagcagtcaa gctctataat ggaacttgat ctgtaaattg 7740 agaaggagtt gacttcggtt aacgatgaac ttatatgacc ccgaaaatcg aaaaatattg 7800 atcttatgcg tcttgcttgg cctacgaaaa tttgattggc tattttcatc tccattttca 7860 atgccaatca aattttcgta tttttagtgg ggaatgt 7897 // ID Gypsy3-LTR_Dpse repbase; DNA; INV; 527 BP. XX AC Unknown_group_550; XX DT 13-MAY-2009 (Rel. 14.05, Created) DT 13-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy3_Dpse; KW Gypsy3-I_Dpse; Gypsy3-LTR_Dpse. XX OS Drosophila pseudoobscura OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; obscura group; OC pseudoobscura subgroup. XX RN [1] RP 1-527 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1042-1042 (2009). XX DR Genome; Unknown_group_550; Positions 30412 30938. XX SQ Sequence 527 BP; 148 A; 126 C; 162 G; 91 T; 0 other; tgcggacgcc gagaaactgc tggcgacgcg atggacaagg agcggtccag accttggggg 60 tggtaggcgc agggaagatc acaaagtgca tgaattggat gcgaacaccc acgacccgga 120 ggaacagacc gaggcattgg atgctgtata taggacgagg gaatccggag gcgatcgcgc 180 ccagctgagg tgctggaact gcgctcagcc ggggcataca tattttgagt gtgagtcggc 240 cgtgcgtaac ctgttctgct acaagtgcgg gctggccggg gtcatcctcc ctaaatgccc 300 taaatgtcga ccgggaaacg cgaaaaggag cacgacgcaa ttggaggagt cgagctccac 360 gtcgccgcga agcagccagt agataggtcc tccaacgtta aattagatac ccctaaggaa 420 agaagtagca ataagaatat ttacactacc atatataagg ttaaccaagc tgtattaccc 480 gaagctagat ggtccgagcg caagccgata aaggaaaggc agcgtca 527 // ID Kiri-10_AAe repbase; DNA; INV; 4476 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 16-FEB-2011 (Rel. 16.02, Last updated, Version 1) XX DE A Kiri non-LTR retrotransposon family from Aedes aegypti. XX KW Kiri; Non-LTR Retrotransposon; Transposable Element; Kiri-10_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4476 RA Kojima K.K. and Jurka J.; RT "Kiri non-LTR retrotransposons from the yellow fever mosquito."; RL Repbase Reports 11(2), 705-705 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 13 sequences with >97% CC identity. CC This family and related Kiri elements found in two mosquito CC species, Culex quinquefasciatus and Aedes aegypti, constitute a CC distinct lineage belonging to the CR1 group. The phylogenetic CC analysis with PhyML indicates that Kiri is a sister lineage of CC the L2 clade. Although several Kiri elements do not encode two CC proteins, probably due to 5' truncation, Kiri elements generally CC encode two proteins. Their ORF1 proteins commonly contain an CC L1-like RNA-binding domain. XX FH Key Location/Qualifiers FT CDS 279..1034 FT /product="Kiri-10_AAe_1p" FT /translation="MSTKRDLSTRRAKQDEAATSSEDMSVSQLAQWISSKL FT DTTKEELVKRINEGMVSVKTEIKTELDIMRSQLDKSISDLSKSVATNSDAI FT QSTTSALSRSQYTNDLIVSGVPYIKDEILTNYFETWCKQLGYTSVPLVDIR FT RLSKQAMDVGKSYKILLQFAITNQRSDFYHKYLKARSLALDQIGFKSNDRI FT FINENLTPLARAIKAKALLVKKEGKLHSVFSKNGEIFVKRSAGANAILISS FT ESHLVQVLQQL" FT CDS 1480..4302 FT /product="Kiri-10_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MAEQTDLNTSTNICVPGIVLRSAFVAEKLSVMCLNVQ FT SLCARNMIKFDELRQIMNVSNVDVACVCETWLNSRIDTSLVEIAGYNSIRS FT DRIGRTGGGLLIYIKKCFKFRTLDTSYVTINGHTIEFMFIEAHIQNRKILI FT GLFYNHPELDCSDVLFEKISEYGSNYDEILVSGDFNTNLLKKNTKSERLLE FT TLDVLGLMNVGIEPTFFYSQGCSQLDLVMTNNNNSILKFGQVSVPNISHHD FT LMFVSLDLTTVQSEQDVYYRDYKNVNPNRVIEQFNQLDWVSFYDADNPDTL FT ISLFNLNVKTLYECCIPLRKLKNPKGRSNPWFNAQIQKSMVDRDRAYKKWK FT ITKSYNDFQNFKRLRNVTNTLVTKAKRDFFNCQLNTDLPSKQLWNKLKELG FT FTTRSLNVENNFTADEINWSLHKHFSTSSSDTVQSESFVSNGFRFSNIQEY FT DVINAIFDINSNAVGLDDIPIKFIKFVLPLLLHPITYLYNCIITKSVFPRA FT WKLSKIIPIKKKNNCSSLDNLRPISILSALSKAFERILKKQICAYIHENHL FT LSKCQSGYRPGHSVKTAMIKICDDIGLVLDRGNSVIMILLDFSKAFDTISH FT SLLCKKLQHNFNFYKDAVDLISSYLKNRKQAVFSNDMLSSFLDVSSGVPQG FT SVLGPILFSLYINDLPPIIKHCQVHLFADDVQLYLNCNKSDLNMVANKINE FT DLECIRAWSERNLLKLNAKKTNALLISRYIQVEYPEIKIGNETIGFVDRAT FT SLGFTVGNNFKWDSYVLGQCGKIYGSLRSLYTKASLLSTNVKLKLFKTFIL FT PYFISCDFLFFNVSSNTQERLKVALNSCVRFVYNLKRQDHVTHLQHTLLGN FT SFFSFFKTRICLLLHKIIITKQPSYLHSKLQPFRSNRIRNFLIPAHLTTLY FT AGSFFVRGVIIWNSLPNVIKELNSMFKFKKQCKVHFK" XX SQ Sequence 4476 BP; 1426 A; 788 C; 832 G; 1430 T; 0 other; ttggcaacac tggtatatgg agtagtcata gaagctaagt gctgacgcaa agaaatgctc 60 aaaaagtgca ataataccgt aaattaaatc cccagtgtag tgcatattta cttcaaaact 120 atcatttaaa atgcagaagt gatatcttgt gccaatatca gttgacaagc ttaaaaactg 180 aaagtaaaaa aattgctgtg ttgattggtg tcatcaaaag tttggatcgg ttgacttagc 240 ggttttatcg ttatcgttta cggtgtaatt catcgataat gagtaccaaa cgagatctga 300 gcactcggcg agccaagcaa gatgaggctg caaccagtag tgaggatatg tccgtctctc 360 agcttgccca gtggatttcg tccaaactcg acactacgaa ggaggaactg gtgaagagga 420 tcaacgaagg aatggtgagc gtaaaaacag agatcaagac cgaactggat attatgcgct 480 ctcaattaga taaatccata agtgatctca gcaagtcggt ggctacaaac agtgatgcta 540 tccaatccac tacgtcagcc ctaagtcgtt ctcagtatac gaacgacttg atcgtgagtg 600 gtgtgccgta catcaaggat gagattctca cgaactattt tgaaacatgg tgcaaacaac 660 tcggatatac ctctgtacca ctcgttgata tacgtcgtct ctcaaaacaa gcgatggatg 720 ttgggaagag ctacaaaatt ctgctccagt tcgcaatcac caaccaacga agcgattttt 780 accacaagta tctgaaggca agatcactcg cgctggacca aattggattt aagagcaacg 840 acaggatttt catcaatgaa aatcttacac ctcttgcgag agcaatcaag gctaaggcac 900 tcttggtgaa aaaagaagga aagctgcatt ctgtcttctc gaaaaacggg gaaatattcg 960 tcaagagatc tgccggagca aatgctatcc tgatttcgtc ggagagtcat ctggtacaag 1020 tacttcaaca actataacct ttccttcgct acactcactc tccttccaat ccatcctata 1080 acaatcccat gattcctctc ctaaaagtca ttcgatagct tgcaccctct cctaatgaac 1140 gctctctgtc cttccaatcc attcccaaaa ttatcctatg tttcctctcc taaaagttat 1200 ctggacctgt gcggggatgc tgttgggatg ctgttgcgag gatgctgctg ttgctgttga 1260 ctgctgtgct gaagaatgct atccgaattt agggatgatt atcaaaatga ttctaaaaat 1320 gtatttgttc aagaagaatg ttattgctga aaaaatattt tgaatattgt gtgtactttt 1380 tgagttctta tttgcgtcac tgtagctatg tttgattaat gtttttgagt tttagttttg 1440 acaccttgca ttgctgctta aatatttcat tcatggataa tggctgaaca aaccgatttg 1500 aacacctcta cgaatatttg tgtacctggt atagtcttga gatctgcatt tgtggctgaa 1560 aagttatctg ttatgtgtct caatgttcag agcttgtgtg ctagaaatat gatcaaattt 1620 gatgaattgc gacagataat gaatgtgtct aatgtggatg ttgcatgtgt gtgtgagact 1680 tggttgaaca gtagaatcga tactagctta gttgaaattg ctggctataa ctcaataaga 1740 agtgatcgga ttggtagaac aggaggcggt ttgctgatct acattaaaaa gtgcttcaaa 1800 tttagaactc tagacacttc ttatgtgacg atcaatggcc atacaattga attcatgttc 1860 attgaagctc atattcaaaa taggaagatt ttaataggtc tgttctacaa tcatcctgaa 1920 ttggattgtt cggatgtttt gtttgaaaaa atatctgaat atggttcaaa ttacgatgaa 1980 atcttagtat caggagattt caacactaat cttttaaaaa agaatacaaa gtctgaacgt 2040 ttgttagaga cactggatgt tcttggtttg atgaacgttg gaattgagcc gacctttttt 2100 tattcacaag gctgttccca gttagactta gttatgacca ataataacaa cagtattctc 2160 aagttcggtc aagtcagcgt tccaaatatt tctcaccatg atctgatgtt tgtgtccctt 2220 gatttaacga cagttcaatc agaacaagat gtttactatc gtgactataa gaatgttaat 2280 ccgaatcgag tcattgaaca gtttaatcaa cttgactggg taagttttta tgacgctgac 2340 aatcctgata ctcttattag tttgttcaat cttaatgtaa aaactcttta tgaatgctgt 2400 atacccctac gaaaattaaa aaatccaaaa ggcagaagta atccatggtt taacgcacaa 2460 atccagaaat ctatggtaga tcgagatcgt gcatacaaaa aatggaaaat tacaaaaagt 2520 tataacgatt ttcaaaactt caagcgactg agaaatgtta cgaatacttt agtaacaaaa 2580 gcaaaacgtg atttctttaa ctgtcaattg aatacagact tgccttcgaa gcaactgtgg 2640 aataaactga aggagctagg atttacgaca cgctctttga atgtggagaa taattttacc 2700 gcagatgaaa tcaactggtc actccacaaa catttctcaa cgtcgtctag tgatactgtt 2760 caatctgaat cattcgtttc aaatggcttt cgatttagta atatacaaga gtatgatgtt 2820 atcaatgcta tcttcgatat taattcaaat gctgttggac tggatgacat accgattaaa 2880 tttatcaaat tcgttctacc gctgttgctg catccaatca cgtatttata taattgtatc 2940 atcaccaaat cagtttttcc gagggcatgg aagttgtcaa aaataattcc aattaagaaa 3000 aagaacaact gtagctcgct ggacaatctc cgccctatta gcattctaag cgcgttatcc 3060 aaggcatttg aacgcattct caaaaaacaa atatgcgctt atattcatga gaatcattta 3120 ctttcaaaat gtcagtctgg ttataggcct ggccatagtg taaaaactgc catgataaaa 3180 atttgtgatg acattggact tgttctagat aggggaaata gtgtaataat gatattgtta 3240 gatttttcaa aagcatttga cactatatcg cactctttac tctgtaaaaa gctccaacac 3300 aattttaatt tttataagga cgctgtcgat ttgatttcgt cttatttaaa aaatcgtaag 3360 caagctgttt tttcaaatga tatgttgtca tcttttctag atgtttcatc aggtgtgcca 3420 caaggttctg tcctaggtcc cattttattc agcttatata ttaatgatct tcccccgatt 3480 atcaaacact gtcaagttca tttgtttgcg gacgacgtcc agctttattt gaactgtaac 3540 aaatccgatc taaatatggt cgcaaacaaa atcaatgagg atttggaatg tatccgagcg 3600 tggtcggaga gaaatttgct taaacttaat gcgaagaaaa ctaatgcttt attgatttct 3660 cgctacattc aggtcgaata tccagaaatt aaaataggta acgaaactat tggatttgta 3720 gatcgtgcaa ccagcttagg ttttactgta gggaataact tcaaatggga tagttatgtg 3780 ttaggtcagt gtggtaaaat ttacggttca ttgcgatctc tttatactaa agctagttta 3840 ttaagcacaa atgtaaagct gaaactattt aaaacgttca ttttacccta ttttatttca 3900 tgtgactttc tctttttcaa tgtgtcctct aacactcaag aacgattgaa agttgcatta 3960 aattcatgtg ttcgttttgt ttataatctc aagcgccaag atcatgttac acacctgcaa 4020 catacattgt taggtaactc tttttttagt ttctttaaaa cacgtatatg tcttttattg 4080 cataagataa ttataacgaa acagccctct tacctacact ctaagttaca accgtttcgt 4140 agtaatagaa taagaaactt tttaattcct gcacatctca caactttgta cgcaggttct 4200 tttttcgtaa gaggtgtcat tatatggaat tctctgccta atgtcattaa ggagttaaat 4260 tcaatgttta aatttaagaa gcaatgcaag gttcacttca aatagttagt tgttagtaaa 4320 ataatcgtta tagaaaagaa aaatggtaga aagtagtttg aatatttctg tgacgcactt 4380 catgcaaaaa aaaaacgttg tagcgttcaa aagatgtaaa tcttatgcta cgtgtgtaat 4440 aaataataat aataataata ataataataa taataa 4476 // ID OARP1 repbase; DNA; INV; 218 BP. XX AC M23276; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 25-MAR-1997 (Rel. 2.02, Last updated, Version 2) XX DE Onchocerca armillata repetitive DNA sequence, clone pOA5. XX KW OARP1; Repetitive element. XX OS Onchocerca armillata OC Eukaryota; Metazoa; Nematoda; Chromadorea; Spirurida; Filarioidea; OC Onchocercidae; Onchocerca. XX RN [1] RP 1-218 RA Murray A.K., Post J.R., Crampton M.J., McCall J.P. RA and Kouyate B.; RT "Cloning and characterization of a species-specific repetitive RT DNA sequence from Onchocerca armillata."; RL Mol. Biochem. Parasitol 30(3), 209-215 (1988). XX DR GenBank; M23276; Positions 1 218. XX SQ Sequence 218 BP; 85 A; 22 C; 26 G; 85 T; 0 other; acataaagta tatttttttt tagagctgaa atataattta cgaacatgtt tcgtattata 60 taattcaaag gaattttatt aaaattgaat aattatgctc attcactgct ataatatgtg 120 atctgcatgt gctattccaa aaaaacataa agtataattt ttttttagag ctgaaatata 180 atttcgaaca tgtttcgaat aatataattc aaaggaat 218 // ID Ginger2-1_HM repbase; DNA; INV; 6676 BP. XX AC . XX DT 03-FEB-2010 (Rel. 15.01, Created) DT 03-FEB-2010 (Rel. 15.01, Last updated, Version 1) XX DE Ginger2 DNA transposon from Hydra magnipapillata. XX KW Ginger2/TDD; DNA transposon; Transposable Element; Ginger; KW Ginger2; integrase; Ginger2-1_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-6676 RA Bao W., Kapitonov V.V. and Jurka J.; RT "Ginger DNA transposons in eukaryotes and their evolutionary RT relationships with long terminal repeat retrotransposons."; RL Mobile DNA 1, 3-3 (2010). XX DR [1] (Consensus) XX CC TIR is 45-bp long. XX FH Key Location/Qualifiers FT CDS join(391..780,1214..1363,1595..1738,1932..2456, FT 3599..4108,4405..4458,4599..4664,4870..4926, FT 5341..5640) FT /product="Ginger2-1_HM_1p" FT /translation="MYWYHKVYECFMSNFCNPMIGKIVKMNPECFEQQKYQ FT FEKWLNLNNKSNSSVLSECEYTSIIEYLKDKNDGKTGYITSRNIQRRIKSN FT KFKLIDYPPLGLKDILCAPTKCNTEVDFICYIYCIYLFSIIKIYLLIQNNL FT RESSPFGNYSRVVSTKDVFSAINIAHCQNGLHLGALKTYKKIIEGYANIAR FT KAVEIFISFCPTCNLNKRQLKKAPLQPIISTGFLQRLQIDLIAMESKPDKE FT FRYIGHVVDHFSKFHILFPMRNKTALETANNLKRFVLSYFGLPAILHSDNG FT SEFVNDVIKSLLLTWPGKTQIVNGSPRHSQSQGLVEQGNFTIERLISSREY FT DSGQSNWSEWLVEIQCKFNVNYCFFFNEKLMLKVNQFYCHYNDRIFLCNIF FT CYHFDKQFTTCPIRKNLCHETASNLEKNAKFMVKKYNKQKKFKVAVFKEGD FT HVSVAIPKSLRNSTDMLRLPCVVLHKSFGENPTYKLVTSHGVLEKRFNATH FT LMPYSSIVKVKSDANVSLSEAVLNEITSKVVFCRCKKACQTKHCRCFENGI FT PCVSRCHKGKTKNCKNNNPHFSVQSENDDSLTESVDTQMAIISSINEQKLF FT EFNRNNINEDEDSGNSQDVVAFKRDEVSNVQISNAMELSITFEEKDIFLDH FT FKKMLQNGDITHYLASDDAIKDARDFSFAYGDITRQDKMIDIHIDVGDELV FT LLIISVYKHLKLNTVIALQQYAARKYNLNKMV" XX SQ Sequence 6676 BP; 2316 A; 856 C; 979 G; 2525 T; 0 other; tgttccaccc taagtccgtt ttcaaccgaa gccgggcttg gggtgatgtt ccaccccgag 60 tgcggttttt cgttgaaact cggggtaggt cagttttact tggggtgagt gactttttaa 120 tgcgagttga tatttataat gtcatttcaa gcaaattttt aaactttaaa atgcaaacac 180 tcaagagcga atgatttttt gaaaatattt tatgatatat tatataattt attatatttt 240 aacattagat aagaataaaa aaaaataaaa aattagtgtt tcgtattagt atataaaaag 300 ttgtatatta ttgtattgta taataatgta tattagtatt gtaatatatt acaaatagta 360 tttgtataat aatacaaacg tatttgaaat atgtattggt atcataaagt ctatgaatgt 420 ttcatgtcta atttttgtaa tcctatgata ggaaaaattg ttaaaatgaa tcctgaatgt 480 tttgaacagc aaaaatatca gtttgaaaaa tggttaaatc ttaacaacaa gtcaaacagc 540 tctgttttat cagaatgtga atatacttcc attatcgaat atttgaaaga taaaaatgat 600 ggaaaaactg gatatattac ttcacggaat attcaacgaa gaatcaagtc aaacaagttt 660 aaactaattg attacccacc cttaggttta aaagatattt tatgtgctcc aactaaatgt 720 aatacagagg tagattttat ctgctatatt tactgcatat atttgttttc tatcatcaaa 780 taattatact gcattatata ctgaaaaaca aatttatgat attatcttat tataagatat 840 tatcataaat ttgataagat ataagattta tattgtgacg caaagtttca ccctgtgtac 900 acaatttgtg tatacatgca ttttgtgtat gctatttaac ttttatatgt cggaatataa 960 taaaagtaaa cagactatac agcacatttt atgcttttcg ttgtattaca gggtgaaact 1020 ttgttagatt ttagcttcga tagatagaaa ctttaataga ttttttaaca aaatttgtct 1080 gtgcattaaa tagtaagttt aaaggtaaat gtgtttacaa ttatactttt attaatatta 1140 tatcatgctg tctagtatat ttaaagtgtt ttaaaattgg ttttaaagtt ttaataagta 1200 aatatctaat taaatttatt tattaattca gaataatttg cgagaatcat cgccatttgg 1260 aaactacagt cgtgtagtga gtactaaaga tgttttttct gctattaata ttgcacattg 1320 ccagaatggt ctgcatcttg gggctttaaa aacttataag aaggtactga ttacttactg 1380 atatttttat ttttgtttta tgttgtgctt tttctattgc gatagtcatg gtatatttac 1440 tttttatatg ctgtggaaaa tatgatattg tactaatttt atctaccttc agataaatac 1500 taactaccta aaatattata gttttctaca ttttaaagat tcagttttca atgtaggggc 1560 ttgctgtatt tataaattaa cttttctttt atagattatt gagggttatg caaacattgc 1620 aagaaaagct gttgaaattt ttatcagttt ctgcccaact tgcaatttaa ataaaaggca 1680 gttaaaaaaa gcaccattac aaccaataat ttctactggt tttttgcaac gtcttcaagt 1740 ataatcagtt ttttacttta aatttctaac catgtttttt taaatagtat actcatttgt 1800 tttttaacca tgttttttta actaatatac tcagtttgtt gttaaatata actttgttgt 1860 ttattaaatg ttattctgct atttataaag tttgactatt tatattatat ttatatacta 1920 aatatactta gattgatttg attgcaatgg aatctaagcc agataaggaa ttcagatata 1980 ttggtcatgt tgttgatcat ttcagtaagt ttcatatttt gtttcctatg agaaataaaa 2040 cagctttaga aacagcaaac aatcttaaaa ggtttgtgct gagttacttt ggtttaccag 2100 caattttgca tagtgacaat ggtagtgaat ttgtaaatga tgttataaaa tcacttttgt 2160 taacatggcc tgggaaaacc caaattgtaa atggttcacc aagacattct caaagtcaag 2220 gtttagttga acagggcaac tttaccatag agcgattaat atcttctcgt gaatatgatt 2280 ctggtcaaag taactggtca gaatggttgg ttgaaattca atgtaagttt aatgtaaatt 2340 attgtttttt ttttaatgaa aagcttatgc tcaaagttaa tcagttttat tgccactaca 2400 atgatagaat ttttttatgt aacatatttt gttatcattt tgacaaacaa tttactgtaa 2460 taatatagga aaaaatattt tatttttttc ccctaaaaat ctaactaaat ttgtatagat 2520 gttatgaata ctacatatgt aaggagcata aaaacgacac cttatgagat tgtgtttggc 2580 caaaaactaa atggacaatt ccctttagag ggtaaagaag taaatgaaga tgatgttgca 2640 aacataattg agaatcctaa tcaaggtaaa tttaaagatg agttaaatgt caatcctatt 2700 tgagtgagtt taatgtcaat cctattaaaa aatcattttt tgcaatatag caggatgaac 2760 ataaatttct tttttttttt tatctttttt tgtcatttgt gtcacaaaat tgttgatttt 2820 ataataaaaa acctggatta ctttcttcat ttttattcat taatcaaata agttatttga 2880 aattccattt tgtgtgtgtg tgcaagtttc ttttataaaa tattggatgg tttattcatt 2940 ttgactataa atatctgaaa tttttgttat tgactagatg ttgctgatgg taggacaact 3000 gacactccag aatgcagcag aacttacaaa agcattgcaa aagcaaaatg tattatctat 3060 tatattaaaa attgctgtgc attttaatta gaccagcaat tgtacatttt gattagaaaa 3120 atgcctcatg ttatttattt tccaggtgat ggaggtagta acactgcaga atgcagtgga 3180 acttcaagtg atattgaagg taacttactg tttactttgt tattacttgt ggtcttaatt 3240 ataattgttg tcaaacaaaa atgtttgaca accattaaaa ctttcagttt tcatgtcttt 3300 aattgaattt tttgttttca aatacttaga aatattttgt tttcagtata caaaatattt 3360 tttgatcttc tttatttctt tatttttgta tcaaggtggc ttaagtgata acactctcac 3420 tgacggagaa gactgtggta ttgttgactt ttcgtaccac catgagtctg aagaagatac 3480 taaagatagg atgaaggtat aaaaaaatat tttataatgt taatgttttt cagttaaaaa 3540 aaatctttcg aattaatttt tttattcaag tttttatatc aagttgtgtt tttagataac 3600 ttgtccaatt cgcaagaacc tttgccatga aacagcaagt aatcttgaaa aaaatgcaaa 3660 attcatggtt aaaaagtata ataaacaaaa aaaatttaaa gttgcagttt tcaaagaagg 3720 tgatcatgtc agtgtggcta taccaaaatc attgcgtaac tctacagata tgctgcgttt 3780 accatgtgtt gttcttcata agtcgtttgg ggaaaatcca acctataagt tggtgacatc 3840 acatggagtt cttgaaaaaa gatttaatgc cacccactta atgccataca gcagtattgt 3900 caaagttaaa agtgatgcca atgttagctt aagtgaggcc gtcttaaatg aaatcactag 3960 taaagtagtc ttttgtcgat gtaaaaaagc ttgccaaacc aagcactgtc gatgctttga 4020 aaatggaata ccatgtgtct cacgatgtca taaaggaaaa acaaaaaatt gcaaaaacaa 4080 caacccccat ttttctgtac agagtgaaag taagtcaaat gaaaccttag accaattatt 4140 ttctgaagct ttgaattatt ttatttaaca ccattaatag aaaattagat tttactattt 4200 tgtatgaata gtatataaat catattggtt tttgaggctt ccaggcgtta agtacatata 4260 tcaactgagc gttttggagg attttgattt gttatagttt tgtggcagtt tctcctttga 4320 aataaaatgt aattaagtaa ttcattttga acttcaaatt cattgaacat tttataaaca 4380 aatactgctt tttaaatttt agggaatgat gatagtttaa ctgaatcagt agatactcaa 4440 atggcaataa tttccagtaa gtttaacctt aagtatagtt agtttaactt tcagtaattt 4500 ctaataagaa tgttacaaac caacaacaac tttcaaaata cttgtttttt caagttcttt 4560 gcaataaata agaatatcta ttttgtatat ttgtagcaat caatgagcaa aaattatttg 4620 agtttaatcg caacaatata aatgaagacg aagattcagg taacattata tcacagtata 4680 caaattactt actatttata aatcataact gcagcagatt tttaaattta atttataata 4740 aaaagaaagt ttcaccatgt tgcctcgcat aaagttcagt ataaattaaa ctgtcttcta 4800 atgcattctt agatattttg tttctgttga gtgtggcaaa agtaattgca taattaaatt 4860 tcaggcaatt cccaagatgt agtggcattt aaaagagatg aagtctcaaa tgtgcaaatt 4920 tcaaatagtg agttgttttt tatgttgttt tttaaaatat ctataatatt tattataata 4980 acaaaaatat aatagcaaaa aagtttataa acatgctttt tgttttttat tggaggcatt 5040 ttttgcttgc cttgtgtgtg tatggcaatg tgtttgtatc ttcttttttt atttaatttt 5100 tacttttatg taaatgacgt ttgttttaaa tacattcact ctcgatatct agatatctcc 5160 aacttttgag tatctccaac tttttttgaa tcccgttgaa cccttaatca tattatactc 5220 tcgataactc gaacatttgc tatctcgaac taatttcttg gtcccttgga ggttcgagat 5280 atggagaatt gactgtatta taattaaaaa aatgtattct tttaatttgt tgcattttta 5340 gcgatggaac tgtccattac atttgaagaa aaagacatat ttctggacca ctttaaaaaa 5400 atgcttcaaa atggtgatat tactcattat ctagcttcag acgatgccat caaagacgcg 5460 cgagattttt catttgcata tggtgacatt acacgtcaag ataagatgat agatatccac 5520 attgatgtcg gcgatgagtt ggttttgtta ataattagtg tttataagca tttgaagtta 5580 aatacagtaa ttgcccttca acaatatgct gcaagaaaat acaatttaaa caaaatggtt 5640 taatgtgata aactttcaat taaacaattt aatattgctt taacttgatg attgctcaag 5700 tttataaaag ttttttaatc tgcattcttt tgaattttac taaacttgtt aaaatgttat 5760 ctgttttcac ataatattat gtttccaaat gaatacttta ataacttgtt ttatgaaaaa 5820 actctatatt gattgcaaaa aaaccttatt ttgtaacatc tataaatata attttccata 5880 aaaacttttt cccgcaacaa aatgtttgtt ttgctatagg tttagttaca ttcaataaaa 5940 cctgtaaata ttatccaatt aatttagtga ccgtttatcg ttttttataa tacaatagta 6000 atatgctata aatatcgttt tcataatgaa ttatgattcc agtttaaatt tgttctattt 6060 gccagtagga aaaactttcg cttttgcgct ctctttaccg agcttattat taatatattg 6120 agcttaaaat ttccctcgtt ggtaaaaaaa aagaggaaat tcaccgagag tgccgtaaag 6180 aatgcaggca atttacggaa gctccaataa aaattagtca atttaaaaaa aataataaaa 6240 tggtactaaa gttgaacatt ttttgaatat aactttatta tcatactaac tcaactctac 6300 ataaaaagta tggacatatg tttgcccgaa acattgatat aagtttgtct aaaaaatata 6360 tttcattttg tttaaaatct ttacagaaaa agccataatt ttaaagaaga acatcgaaaa 6420 taaatatttg tacttttatt attttttata acgcaaatct taataatcaa aaatagttta 6480 atttatttta tattacgcta tcaatccgct aaaaaaaaaa aaatccgcta aaaaaaaatc 6540 agcatgttgc aaaaataata acttttaaaa ctttcgaaat ccgtacttgg gatatgcgtg 6600 tcgaactcgg ggtacctacc ccgagttcgg caccccaagc caggctttgc gtaaaaacca 6660 aactcggggt ggaaca 6676 // ID Helitron-1N1_DVir repbase; DNA; INV; 519 BP. XX AC . XX DT 31-MAR-2007 (Rel. 12.03, Created) DT 31-MAR-2007 (Rel. 12.03, Last updated, Version 1) XX DE Non-autonomous family of Helitrons - a consensus sequence. XX KW Helitron; DNA transposon; Transposable Element; Nonautonomous; KW Interspersed repeat; Helitron-1_DVir; Helitron-1N1_DVir. XX OS Drosophila virilis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; virilis group. XX RN [1] RP 1-519 RA Kapitonov V.V. and Jurka J.; RT "Helitrons in fruit flies."; RL Repbase Reports 7(3), 129-129 (2007). XX DR [1] (Consensus) XX CC This is a consensus sequence of a family of non-autonomous CC Helitron transposons transposed in the Drosophila virilis genome CC a few million years ago (numerous copies are less than 5% CC divergent from the consensus sequence). Helitron-1N1_DVir is a CC deletion derivative of the autonomous Helitron-1_DVir. These CC transposons are usually inserted in the TTT|TTT target sites CC without the target site duplications (the insertion site is CC marked by "|"). Different families of Helitrons constitute ~5% of CC the D. virilis genome. XX SQ Sequence 519 BP; 147 A; 113 C; 108 G; 151 T; 0 other; ttataccctg aacccattaa aaatgggtat aagggtatat tgtatttgtg caaaatccaa 60 atgtatgtaa caggcagaag gaagcatctc cgaccccata aagtatatat attcttgatc 120 agcatcaata gccgagtcga tctagccatg tccgtctgtc tgtccgtccg tccgtccgtc 180 cgtccgtccg tccgtccgtc cgtccgtccg tatgtatgaa cgcaaggatc tcagaaccta 240 taagagctag agacttgaaa ttttagatgt aggtgctcct agtgcctgcg cagatcgagt 300 ttgtttccga taatcgataa cttactccgt ttccaagcaa tcgataaaaa tcgatatcga 360 tatcctgttt tttgggcaat tttggtaaat aataagagct agagtcacca aacttgacat 420 atagcttcta aaatagaata tatatatgca tttgatgttg gaagaagagg gttcagggta 480 tcccctagtc gggagctccc gactagaacc tcttacttg 519 // ID L1-4_CQ repbase; DNA; INV; 4520 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE An L1 non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW L1; Non-LTR Retrotransposon; Transposable Element; L1-4_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4520 RA Kojima K.K. and Jurka J.; RT "L1 non-LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 134-134 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 8 sequences with >95% CC identity. XX FH Key Location/Qualifiers FT CDS 157..1248 FT /product="L1-4_CQ_1p" FT /translation="MNSQNNRENTFRVDFSNLPKKPSVKEVHQFLARDLGL FT PREKTKRVQINHVQNCVFVKTADLETAEQIVEDHDGKHVREVNKKKYTIRI FT TMEDGCTNVKLHDLTEDVSDQQIKRFMSQYGDVMSVTELLWSDDLDYHGIS FT SGIRSVKMVLKRPIKSYVTIQGQCAYVKYAGQQSTCRHCGEYVHIGIPCVQ FT NKKLLVQRVSVNERLKPGAKESFASVLKSGSTDTRQDDNISSSDDEERDSA FT AMSEDDDEYTVVRYGKGKHPKLPKPIKSKPRQETIKVNPIATPGGTPLDAV FT VEQETDKEESVFKTPLLLPGKHKRNDGNDTDTSTASTASNGKRGPGRPRKT FT KPTDVDTQPQASPSADEPMQP" FT CDS 1255..4446 FT /product="L1-4_CQ_2p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="MDFLTYNIASININTISNHTKLNALNNFVRTLNLDIV FT FLQEVENDQLAIPGFNLVCNVDQERRGTAIALKQHIRFSHVERSLDGRLIC FT LRVNETTTLCNIYAPSGTQHRAYRENFFNGTLAHYLRQATEYVILGGDFNS FT VIHTKDATGDNNYSLALKNATQQLQLLDAWDTLKPNRVEFTYVTYNSASRI FT DRFYVSSNLRPQLRDADVHVCSFTNHKALTTRICLPNLGREPGRGFWSIRP FT HILSADNIEEFQEKWTYWTRHRRNFTSWIQWWVSYVKPRIRSFFRWKSREK FT YMEFHQQHQELYAQLRRAYDAYYQDRTVLSTINRIKSEMLLLQRRFSDTFY FT RSNETRVAGEPISLFQLGERQRKRTTIEALDVEENTLNSSAEIEQHVFGYF FT RQLYAEEQPEIADEFACDRAVSAACPSNASCMDDITTDEIFQAIKSSAKNK FT SPGSDGIPLEFYAKTFDIIHRHLNLILNEALRGNIPPEFVDGTIVLVKKKG FT AGSTIKSYRPISLLNTDYKLLSRILKCRLDRVMRDNQILTTSQKRSNGDKN FT IFQATLSLKDKVAQLKERRLFGKLVSFDLDHAFDRVDRGFLMRTMRALGIN FT PRLVELLDKIGRLSTSRVLVNGHLSPSFPIGRSVRQGDPLSMHLFVIYLHP FT LIKRLEQICVGSNDLVVAYADDVSIITTCAERIERVKEQFENFGRSAGAKL FT NVRKTISIDVGYINNNNRLAVPWLQTADKVKILGIVFVNSVRLMGKLNWVP FT LVGMLTRQLWLHRMRTLTLQQKVVLLNTFITSKLWYVASVLAITKADVARL FT TRLMGSFLWAGQCIRVPMQQLALPVELGGLNLHLPAFKCQALLVNRHLREI FT ENLPFYNSFVSTTRNPPNLRIVPTNCPCLKTVCSELPYLPSALQANPSANL FT LHAHYLNKIDKPKVVLENPTANWNRIWRNIAAKHLTSFERCHYYLLVNRKL FT SNQRLLHRMQRADSDMCPNCNNEPEDIPHKISTCPRVAAAWTVLQRRLRNI FT AQNRNISLTHLLQPTLLAIRRSVKVKVLKTFIQFVIFVSKDNNVIDINELE FT FHLDTEV" XX SQ Sequence 4520 BP; 1271 A; 1201 C; 1050 G; 995 T; 3 other; tcagttagcg tgcaaccacc gtggcatgca gacgcatttc atactgtgtg gaacttgaga 60 ctatcgaaat tcgcattttc tacacgcttt cgtttttttc acgaatccgc cactcggtgg 120 ccgtcgcgtt kcggtccgcg gttcttgtga ttcaccatga attcacaaaa taatcgcgaa 180 aacacgtttc gagtcgattt ctcgaacctt ccgaagaagc ccagcgtcaa ggaagtacac 240 cagtttctcg cgcgtgacct tggccttcct cgggagaaga caaagagagt gcaaatcaac 300 catgtgcaaa actgtgtgtt tgtgaaaact gccgaccttg aaacggctga acaaattgtc 360 gaagaccacg acggcaaaca tgtgcgcgaa gtgaataaga aaaagtacac gatccgcatc 420 acgatggaag atggatgcac caatgtgaaa ctacacgact tgacggagga tgtctccgac 480 cagcaaatca agcgtttcat gtcccagtac ggggatgtga tgtcggtcac ggagctgctg 540 tggagtgatg atctcgatta tcacggcata tcatcgggta ttcgctcggt gaaaatggtc 600 ctgaaacgcc cgattaagtc gtacgtgaca atccagggtc aatgtgccta cgtaaaatat 660 gcgggccaac aatccacgtg tcgccactgt ggcgagtacg tgcacattgg cattccatgt 720 gtacaaaaca aaaagctcct mgtgcaacgg gtgagcgtga acgagcgcct aaaaccaggt 780 gcgaaggaat ctttcgccag cgtgctgaaa tctggttcaa ccgacaccag acaggatgac 840 aacatatctt ccagcgatga cgaagaacgc gactcagcag cgatgtccga ggacgacgac 900 gaatacactg tggttcgtta cgggaaaggc aaacatccaa agttgcctaa accgataaag 960 tctaaacccc ggcaagaaac catcaaggtc aatcccattg ctactcccgg tgggacacct 1020 ctcgatgcgg ttgttgaaca ggagacagac aaggaagaat cggttttcaa aacgccacta 1080 ctactgccag gaaaacacaa acgaaacgat ggaaacgaca ctgatacttc tacagcttcc 1140 acggccagca atggcaagcg gggtccgggt cgtccacgga aaaccaaacc aaccgacgtt 1200 gatacacaac cacaagcatc gcccagcgct gacgagccca tgcaacccta aaaaatggat 1260 tttttgacct acaatatagc ttctataaac atcaacacaa tctcaaatca caccaaactc 1320 aacgcactca acaactttgt cagaaccctc aaccttgaca tcgtgtttct acaagaagtg 1380 gagaacgatc agctggctat tcccggattc aacctggtgt gcaacgtaga tcaagagagg 1440 agaggaacag cgatagcgct gaaacagcac attagattct cacacgtgga gagaagccta 1500 gacggtcggc taatctgtct tcgggtcaac gaaacgacta ctctctgtaa catttacgct 1560 ccgtcgggca ctcagcatcg agcataccga gagaatttct ttaatgggac ccttgctcac 1620 taccttcggc aagctacgga gtatgtcatc cttggcggtg atttcaactc cgtaatacac 1680 acaaaggacg ctacggggga taacaactac agcctagcgc tgaaaaatgc tacacaacag 1740 ctgcagctgt tggacgcctg ggacactttg aagccgaatc gggttgagtt cacctacgtg 1800 acgtacaact cggcctccag aattgaccgt ttctatgtga gctctaacct ccgaccccag 1860 ctcagagatg ccgatgttca tgtctgctcg ttcaccaacc acaaagccct aaccacccgg 1920 atctgcttgc cgaatctcgg aagggagcca gggagagggt tctggtcaat ccgtccccac 1980 atcttgtctg ctgacaacat cgaagaattc caggaaaagt ggacctactg gacgcgacac 2040 agacggaact tcacctcctg gatccagtgg tgggtctcct acgttaagcc aaggattcgt 2100 agcttcttcc ggtggaagtc tcgggagaaa tacatggagt ttcatcaaca acatcaagag 2160 ctctatgcgc agctccggag agcctacgac gcatactatc aagatcgaac cgtgctctcc 2220 acaatcaacc ggatcaagag cgaaatgctg cttctccaga gacgcttttc ggacaccttt 2280 taccgcagca atgaaacacg tgttgctgga gagccgatct ctctgttcca gctgggagag 2340 aggcaacgaa aacgtacgac gatcgaggcg ctggatgtgg aagaaaacac tctcaacagc 2400 tctgcagaaa ttgagcagca cgtgttcggc tattttcggc agctgtatgc cgaggagcag 2460 ccggagatcg cagacgaatt cgcatgtgac agggctgtat cagctgcttg cccatcaaat 2520 gcgtcctgta tggatgatat cacgaccgac gagatattcc aagctatcaa gtcaagcgca 2580 aagaacaagt cccctggaag tgacggtatt ccacttgagt tctatgcaaa aaccttcgat 2640 atcatccaca ggcacttgaa cctcatcctg aatgaggccc tgcgcggcaa catcccgccg 2700 gagtttgtcg atggtacaat agtactcgtg aaaaagaagg gtgccggttc aaccattaaa 2760 tcctatcgcc ctatctcgct ccttaacacg gattacaagc ttctctcacg aatactaaag 2820 tgcaggctgg atcgtgtgat gcgagacaac cagatcctca ccacatctca aaagcgctcc 2880 aacggcgaca aaaacatctt tcaagctaca ctctcactca aggacaaggt cgcgcagctg 2940 aaggagagaa gactgttcgg gaagctcgtc tcgttcgatc tcgatcatgc ctttgatcgg 3000 gtcgatagag gtttcctgat gagaactatg cgagcgcttg gtatcaaccc gcgcttggtt 3060 gaactgctgg acaaaatcgg ccggctgtcc acttcccgtg ttctggtgaa cggtcatctg 3120 tcaccgtcat tcccgatcgg acgctccgtc cgtcaagggg atcctctctc gatgcacctc 3180 tttgtsatct atctccaccc cctcatcaag cgactcgagc aaatctgcgt tggctccaac 3240 gacttggttg ttgcctatgc cgacgacgta tcgatcatca ccacgtgtgc cgagagaatc 3300 gagagagtga aggagcagtt cgagaatttc gggcgatcag ctggtgccaa gctcaatgtt 3360 cgaaagacga tctccatcga cgtaggctac atcaacaaca acaatcgact cgctgtgccc 3420 tggctgcaga cagctgacaa ggtcaagatt ctcggcattg tgttcgtgaa ttcagtgcga 3480 ctgatgggca aactaaactg ggtcccgctg gtaggcatgc tcactcggca gctctggcta 3540 caccgaatgc gtacactaac acttcaacaa aaagtcgtac tgctgaacac gttcatcact 3600 tctaaactat ggtacgtagc atcggtgtta gcaatcacga aggcagatgt tgccagactc 3660 acacggctaa tgggatcatt tctgtgggcc ggccagtgca ttagagtgcc gatgcagcag 3720 ctcgcgctgc cagttgagtt ggggggtttg aatctgcatc tgcctgcgtt caaatgtcaa 3780 gctctcctcg tcaacagaca tctacgcgag atcgagaact tacccttcta caactctttt 3840 gtgagtacta cacggaatcc cccaaactta cggatagttc caaccaactg tccgtgtctg 3900 aaaacagtct gcagcgagct cccttacctc cccagtgcac tccaagcgaa tccctcggcc 3960 aatcttctcc acgctcacta cctgaacaag atcgacaagc cgaaagtggt gctggaaaac 4020 ccaacagcaa actggaatcg aatctggcga aacattgcgg ccaaacacct gacatcgttc 4080 gagcgctgtc attactacct gctcgtcaac cggaagctgt ccaaccagcg gctgttgcac 4140 aggatgcaac gggccgactc tgacatgtgt cccaactgca acaacgaacc agaggacatc 4200 ccacacaaaa tctcaacatg cccgcgcgtt gcagcggcgt ggaccgtgct gcagagaaga 4260 ctaaggaaca tcgcccaaaa cagaaacatc tctctcaccc atctattgca accaacactt 4320 cttgcaatta gacgatcagt taaagttaaa gtgcttaaaa cgttcattca gtttgtcatc 4380 tttgttagca aggataataa tgtaattgac attaatgaat tagaatttca cttagatact 4440 gaagtttaag ctatttggtc gagtagtttt taagcaatca aaaccaataa aacaatattt 4500 tataaaaaaa aaaaaaaaaa 4520 // ID Gypsy-37_DPu-LTR repbase; DNA; INV; 368 BP. XX AC scaffold_112; XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from Daphnia: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-37_DP_; KW Gypsy-37_DPu-I; Gypsy-37_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-368 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Direct Submission to RU (16-DEC-2010). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR Genome; scaffold_112; Positions 285222 285589. XX SQ Sequence 368 BP; 101 A; 103 C; 64 G; 100 T; 0 other; tgtaatgacc tgtgtgtaag attcggccta ctgcttgtag tagagagcaa tctactacgc 60 agtatcccaa ggtcaatcta ctacgccgta cttcaaggtc actgtactac gcagtacctt 120 gatgcgtttc tctattgcac catcacacct gttgactatg caactacatc cgggattgcg 180 ccacactctg cagtgatggt gcaatcccca tcaccttgat aaccgcctat aaaagcccga 240 cactctgttc atccagcaga gtcgtacccg gcattgtaac cccacaacta cgttaataca 300 aatacttgca aagaacactg tctcctcctc gttatttagt tatcaagaac tattgtgaaa 360 tcatcaca 368 // ID hAT-20_HM repbase; DNA; INV; 3402 BP. XX AC . XX DT 07-DEC-2008 (Rel. 13.12, Created) DT 12-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-20_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3402 RA Jurka J.; RT "hAT-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 2009-2009 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 900..3071 FT /product="hAT-20_HM_1p" FT /translation="MKRQNMLTTYFSKKIREETSVSNDSRSTITEVEAVSR FT TLQEEILEIPSSQNEDNNNLPSCWNAKQLKEFKIKYDGLFVSNQKLGCEYC FT LKMESKLISQEWKKCNVTTSGKDKIVQQASLRKKMSEHFVSKAHKLAVEHA FT KILKKQTFTACIDKMNEKFISSTTRIFNTVYSLAKRNXPFSDVESVIELQV FT KNGLDMGTGLHSRHSASKIVEHIAQETKKELFISIIKNNLKICTVIDEAST FT ISSKSTLIVFIKVESSTVASSEISPIIFVDIVELDAQDANTIFQSLLGVLL FT AVGFDTNYLKLNLIGFCSDGASVMLGCKSGVSVRIKEMFPNVIIWHCLNHR FT LQLVLDDSIKDIKQVNHFKVFMDKIYCIFHQSNKNQKELNDISQELSLDII FT KIGRVLGPRWAACSLRSALAVWRSYPALYTLFSSNQKMSGMAKRLQNKCFL FT VDLALMIDILQECSLLSNALQARNINVARGDQLIRRTINAFQMLEDRRGHY FT EKKVDEVINSEAFKNIIFVENRIYGNLPRDSLIDSIIKNMKLRLMDCSHLG FT SNSNKKNIDNSIIYELANLLEPSTWKIEDIVVPWIEAENNFDKFRKFFNFD FT IDKNDFRDYVENVINNGHQQISKIAENIQKAKNMVNTVAVSSAEAERGFSL FT MNIIITDIRARLTVQTVSNLMTLNLVGRPLENWNAVPYVKTWLRNHNSADD FT IRVKKSKPKEYTENQLAIWKYFK*" XX SQ Sequence 3402 BP; 1229 A; 485 C; 555 G; 1132 T; 1 other; cagggatgct ttcctaccgg ttcgtgcgaa ccgttagatg taaattctat ggttcgtgcg 60 aaccggctat taaaataagc gaaaataaaa tttttttact ttggcaaaaa aaaattattt 120 aaataaaatt tatttaccat ttggtgcaat tttatataaa caatttgaat attttatcta 180 gttttcattg cgattctaat tttttacttt atagaaatct ttcagttaat tagtcctccc 240 ctcccctttt tatttaatat gtaaattacc agtgttttca ttggttgatt ttaaagttaa 300 cgcgcgtttt ttccggaccg ttagtaacag tttagaaatg ctaacattaa catgcacgtg 360 tagcgtcaaa aatatgtatt cgtaacagtt gctgaggttt ttaacattgt ttaaatatcc 420 attgtatttt ttctagatgc attactactt taattaaaat ttggaattac ttaactaaat 480 tatttttatt tataatagct tgattatttg ttaatcaaac aataaattat aaatgtaaca 540 aactataaat gtaacttaaa agagtgttat aaaagtaaca acactgtagt gttattactt 600 tggtagtcat aaatttattg ttttgttttt cataacattt tcactaacaa attaaagttt 660 attcgcctta attccgtaaa tttcgtggtt ttccataaac tgaaactttg aagaccagat 720 aatttcacag atatcgcaaa tcgcgatgcc ttaattcaat ccttgagtgt caaattgtta 780 acatgtattt ctgataattt ttctttaata aaaatatagc tttgtgttaa agttatttgg 840 ttactggtct gatattcacc tttttatgaa tttttaaata cctattttat tttttgtaga 900 tgaagagaca aaatatgtta accacatatt tctcgaaaaa aataagagag gaaacatcag 960 taagtaatga tagcaggtcg actataacag aggtagaagc agtttcaaga actttacagg 1020 aagaaatact agagattcct agttcccaaa atgaggacaa taataattta ccaagctgtt 1080 ggaatgcaaa acagttgaag gagtttaaaa ttaaatatga tgggctattt gtcagtaatc 1140 agaaattggg atgtgagtat tgtttaaaaa tggaatcaaa actaatatct caggaatgga 1200 aaaaatgtaa cgttacgaca tcggggaaag acaaaatagt gcaacaagca tctttaagaa 1260 aaaaaatgag tgaacatttt gtatcaaaag cacacaaact agctgttgaa catgccaaga 1320 tacttaaaaa acaaactttt acagcatgca tcgataagat gaacgaaaaa tttattagtt 1380 ctactacaag aatcttcaac actgtttaca gtctagcgaa aagaaatmgt ccgttttcag 1440 acgtagaaag tgtaatagag cttcaagtaa aaaacgggtt ggacatggga accggacttc 1500 attctcgaca tagtgcaagt aagatagtag agcatatcgc gcaagaaact aaaaaagaat 1560 tgtttatatc aattattaag aataatttaa aaatttgtac cgttattgac gaagcttcca 1620 ccatttccag taaatcaacg cttatagttt tcatcaaagt agaatcttct acagtagcat 1680 cttctgaaat atccccaata atattcgtag atattgtgga acttgacgca caagatgcga 1740 atactatttt tcaaagtttg ttgggtgtat tacttgctgt tggatttgat acaaattatt 1800 taaaactgaa tttaatagga ttctgttctg acggtgctag cgtaatgctt ggttgtaaat 1860 ctggagtcag cgtacgaata aaagagatgt ttccaaacgt tattatttgg cactgtctca 1920 atcaccgcct acaacttgtt ttagatgatt ccataaaaga tatcaaacag gttaatcatt 1980 ttaaagtttt catggacaaa atatattgta tttttcatca atctaataaa aaccaaaaag 2040 aactgaacga tatttcccag gagcttagtt tggatataat taagattgga agagttttag 2100 gaccaagatg ggctgcgtgc agcctaagat ctgcacttgc tgtatggcgg agctaccctg 2160 ctttgtatac gttgttctct tctaaccaaa aaatgtctgg aatggcaaaa cgactacaaa 2220 acaaatgttt tctagtagat ttagctctaa tgattgacat cctacaggag tgctctctac 2280 tatctaatgc ccttcaagct agaaatataa acgttgccag gggagatcaa ctgattagaa 2340 gaacaattaa cgcttttcaa atgctcgaag atcgtcgcgg tcattacgaa aaaaaagtgg 2400 atgaagttat aaactctgaa gcattcaaaa acattatctt cgtcgaaaac agaatttatg 2460 gaaatcttcc cagagattcc cttattgaca gcatcataaa aaatatgaaa ttaagattga 2520 tggattgttc tcatctcgga tcaaatagta acaaaaaaaa tattgataat agcattattt 2580 atgaactagc caatttgcta gagccgagca catggaaaat tgaagatatt gtagttccgt 2640 ggatagaggc cgaaaataat tttgataaat ttcgaaaatt ttttaatttt gatatcgata 2700 agaatgattt ccgtgattat gtagaaaatg taataaacaa tggccatcaa caaattagta 2760 aaattgcaga aaacattcaa aaagcgaaaa atatggttaa tacggttgcc gttagcagtg 2820 ctgaagcaga acgaggattt tcattgatga acatcataat tactgatata agagccagat 2880 taacagtgca gacagtatca aatttgatga cattaaactt agttggtcgt ccattggaaa 2940 attggaacgc agttccctat gtaaaaactt ggttgaggaa ccataactcg gcagatgata 3000 taagggttaa aaaatctaaa ccgaaagaat atactgaaaa tcaattagct atttggaaat 3060 attttaaata atataattac tttataacaa taatatttac taatatatga aatttcacaa 3120 ttttttgtta tgttcagttt ctttttatgt aaattttgta tctcttgaaa tgaatgaata 3180 aaataaaagt gattttcacc tcgttgaatt atttttttat tctcttttat cctattacat 3240 tattgttgca aagaatatga acgatagtta aatattcttg acttagtctg ccactctgac 3300 aaccataata ttttgatgga aatattaata ttaaaaaaat aaaaaaattt gtaagaaaat 3360 atcactgcga accggttgat aaattttggg aaagcatccc tg 3402 // ID Mariner-13_HM repbase; DNA; INV; 3552 BP. XX AC . XX DT 22-MAR-2008 (Rel. 13.03, Created) DT 22-MAR-2008 (Rel. 13.03, Last updated, Version 1) XX DE Mariner-type family: consensus sequence. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-13_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3552 RA Jurka J.; RT "Families of Mariner elements from Hydra magnipapillata."; RL Repbase Reports 8(3), 230-230 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 1060..2649 FT /product="Mariner-13_HM_1p" FT /translation="MLASLPKKYGYSFKNGKPSNKWWRLLNNRHNKKVTLR FT QPESTASVRHQCMNLIKVANYFTALKEVIKDCQPEVIWNMDETGLQLDFKA FT HKIVAAKGTKYLHMRTSGNREMITLIVCVNAAGRALPPHVIPKGKTVKSLQ FT SFQVLDAPEGTNWSPSESGWTKQGIAKLWFELTFLKNIGPQRPQVLILDGH FT DSHNFVELIDMAIQNRIEIVELPAHTSNWLQPCDRTVFKPFKVSYNKAAQD FT MMSNHPGVIINKANFTGLLTKAWNKSMTKANIESGFKSCGIFPYNPSIIPA FT EAYLPNYVYSEDRLMTNLFDQIKTVDNTTTVKQIKQTTDSTKVLLPDIELE FT QHLITNPRFAVANDHDYLAEPITVVEHRSDDLVELNMKTIELKAASNKATS FT KFQDSFRIMSYSEIEQPIDQHEVDFTVSTLLASDFAKTRSFAPNFPFKNSS FT YPGDGDNDILPYSYPNFTKSKKAKNADKFFVLTSKEAYESQLKQKQKQAQN FT EKEKEMRKLKRLQNLKIKEATVLSKDNPRKKPLL" XX SQ Sequence 3552 BP; 1298 A; 500 C; 546 G; 1206 T; 2 other; agtgaccaaa ttccagccac attttaaatt tattatttct gtcttgtttt ctgctttatg 60 tttgtaaatg ctactttttt gagactggta tgaattcagc caataaaaag aaagcattgt 120 tatagttggc ggctgtttat aatagaagta acatcataaa aaatatttat ctcaaaaaag 180 tactacttta tatttaaaat atcatgtaag catccagttt gaagcttata aatattaaat 240 atttattact taacgaaatg ttaattattt taaactataa atatatttct taaaaagaca 300 attttagctt tataattatg ttgagtcatt actatagatt gtatttctta attaattgca 360 cctggtagaa agaccaaaat ccggccaggt ataaacctta ttccggccaa ggccgaaatt 420 aggttcacga gtgtctaaaa gtaactcata cttgtatgat atttatatta ttagatttta 480 ttcttttaat ataagtactt tatgttacat ttatgtccta aaagttatca atcttaaata 540 tttgcattta tttagttaaa tatttatata atatttttaa aatttgagga tgatgactta 600 aaaataaaca aaagtctaaa taattgaaag atattatatt gtttcatgta aattattact 660 tataaatgaa attatattaa aaaataatct ttaattgaaa attttaataa tttataatat 720 aaagttttca ttttagaatg gctgataata aaaataaaat ttataataag aaacaatgtc 780 atccaactcg aggtaaatat atgaaatata aagaggaagt gttacttgaa gctataaaca 840 gtgttattag ttcacctaaa atgtctgtta gagctgctgc taaatttttt aaagtacccc 900 gagcaacttt gcaragcagg ataaatggaa aagttgagat tggtgcaaaa gcaggaagaa 960 aaagtttaat gccgcttgat cttgaaaaaa agttactcga ttttgcagac aacagagcta 1020 ggatgggcat tggttttgga aaagaacaat tcttggtata tgctggcaag cttgccaaaa 1080 aaatatggat attcttttaa gaatggcaaa ccatctaata aatggtggcg actacttaac 1140 aatcgacata acaaaaaagt aactcttcga caaccagaat caacagcatc agttcgtcat 1200 caatgcatga atctgataaa ggtggcaaat tattttactg cacttaaaga agtcattaaa 1260 gactgtcagc ctgaagtaat atggaacatg gatgagacag gactgcagct agactttaag 1320 gcgcacaaaa tagttgctgc aaaaggtaca aaatatttac atatgagaac aagtggaaat 1380 cgagaaatga ttactttaat tgtttgtgtg aatgcagcag gaagagcctt accacctcat 1440 gtgattccaa aaggaaagac agtaaaatcc cttcaaagtt ttcaagttct agatgcccct 1500 gaaggaacca actggagtcc aagtgaatct ggatggacta aacaagggat tgcaaaacta 1560 tggtttgagc ttacattttt aaaaaacatt ggaccacaac gtccacaagt tttaatttta 1620 gatggccacg attcccataa tttcgtagag ttaatagata tggccataca aaaccgaatt 1680 gaaattgttg aattacctgc tcatacaagc aattggcttc aaccttgtga tcgaactgta 1740 tttaagccat tcaaagtttc atataataaa gcagcccagg atatgatgtc taatcaccct 1800 ggggttataa taaacaaggc caattttaca ggacttttaa ccaaagcctg gaataaaagt 1860 atgacaaaag ctaatattga atctggcttt aaatcatgtg gaatctttcc atataatcct 1920 tcaattattc cagctgaagc ttacttgcca aactatgtgt attctgaaga caggttaatg 1980 actaatctct ttgatcaaat taaaacagtt gataatacaa caacagttaa gcagattaag 2040 caaacaacag acagcacaaa agtactttta ccagatattg agttggaaca acatctaatt 2100 accaacccaa gatttgctgt agcaaatgat cacgactatt tagctgaacc aataaccgta 2160 gtagaacata ggagtgatga tttagtggaa ctaaatatga aaactataga actaaaagct 2220 gcaagtaata aagctactag taagtttcaa gacagcttta ggattatgtc ttactctgaa 2280 attgaacagc ctattgatca gcatgaagtt gattttacag tttcaactct tttagcttct 2340 gattttgcaa aaactagatc ttttgcccca aatttcccct ttaaaaattc atcttatcca 2400 ggagatggag ataatgacat tttgccatat tcttatccaa attttacaaa gagcaaaaaa 2460 gctaagaatg cagacaagtt ttttgtgctt acttctaagg aagcatatga gtcacagtta 2520 aagcagaagc aaaaacaagc tcaaaatgaa aaggaaaaag aaatgcgtaa attgaaaaga 2580 ctacaaaact taaaaataaa ggaagcaacg gtattgtcaa aagataatcc tagaaaaaaa 2640 ccgttgctat aaaaatgaac caataaaagt ataaaaaagc aatgtgtaat ttgatcatgt 2700 tgttatgaat ttttgttctt agtgtttaaa aaagtctttt atagagttca ctgaattctt 2760 tttttttttt aatgtgaata ttatattcta tgcaaatctt ttaaaatagt aattcaaaat 2820 attacattga cattaattat cctgatttac ataataattc ttatttgaaa tattaaacct 2880 agttttgtgt ttggttttta tttgcattaa aaaatatttc cttttatttt ttaccaacat 2940 gtatacgttt tttttttctt gcccatggac aaggtggtgc aactcagttc ctgaaaactt 3000 atttttttta gatttaggca tccttactaa gttttggaaa gtttatttgt tgatgaatgc 3060 ttagtattaa gttaagcaat cattttctaa ctgatgttta agatcacgtt cattttaact 3120 ttaaactaaa caaactttaa atccaatttt gcaaaaactg tttgacttat tgtttagctg 3180 tcaaccggct aaaattgtag gatcaaaaaa attattggtt caatatttag taattcaatt 3240 aattttaaaa tattttaaat gccaacccat aaaagtatta gtaaaaagct tgttatttat 3300 attggctgta attaggttca ttccgcatac gcctggctga aatttggtca tagtggctgt 3360 attttggtca aatttatata ctttttgtaa tcgttgtttt wtcaaacatt ttttatttat 3420 aataattgct gtaattatat cctaaatgct gaaagattta tctttctaga aatgtataat 3480 tttcatatag acctctttgt taatatgtct gtagaaacaa ttttgtaaag atgcggctgg 3540 aatttggtca ct 3552 // ID Copia-14_DPu-I repbase; DNA; INV; 4237 BP. XX AC scaffold_26; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-14_DPu_; KW Copia-14_DPu-LTR; Copia-14_DPu-I. XX NM Copia-14_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-4237 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 691-691 (2010). XX DR Genome; scaffold_26; Positions 1060332 1064568. XX CC 'TATAC' target site duplication CC LTRs are 98% similar to each other. CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX FH Key Location/Qualifiers FT CDS 784..1956 FT /product="Copia-14_DPu-I_1p" FT /translation="MTTHVRKISCIADKLRELGQPLQEMQLVTKALATLPE FT KFRIVRSVWANVALDERTMDNLLQRLRSEENVLKSYERPDDSSQAFAARGH FT TRGRGRGNRRGSRIYHGVSNGFVNQHSSRDDVRCGYCPHYGHKTQDCRKKK FT RDEAEQKANKDQALLSSSSSNPNSAFAFFVDSGATQHMSDQKILFEDFTQI FT QTGTWSIAGIGDTHLQVLGKGHITVTVQVNNQSSIRVIKDVLYVPGLGTNL FT FSVAAATNSGLEARFSKDALSFNRGDELVLTGKRSGNTLYLLDLKPHTASI FT TSTRRIDSAFRAGLRASLLVWHQRLGHMNHQTILKMVSEELISGLHLTNEK FT IPKTLCAACELFHRQPLKIGRTRATRIGELVHSDVEGPMPCPSIGQAR" FT CDS 1913..4213 FT /product="Copia-14_DPu-I_2p" FT /translation="MLKVRCHVQVSAKRANMPCLQMISLVGELYITPFEAW FT NGRKPDVSNIRIFGSRAFVRCPNVKKLEARCLGALVGISNTQKAFRIYISS FT PPRIIVSHDVKIDETVMYKMPKDNTALKWIETTKETPTPIADSMDDTTTNL FT TTETDPVINDAVDLPISTFDLAPASILATQVPDHEEEQEVIVMEDNENNSE FT VPIEVDDPAESKIGIRRSSRLPCYSEKYLAYRQSLGLQAVPFEKSEQPKIC FT PAEPSSYVEATTCPDADHWIPAIFDEYESLIKNSTWTVCELPPDRKPIKGK FT WVFTFKPGYKEVAPRFKARFVAKGYSQIYGLDYVDTFSPVEKPYSLRTILA FT IAAAKDLEMIQLDIKTAFLNGDLQEEIYMKQPEGFVIPGKENQVCRLLKSL FT YGLKKASRAWNQKFHAFIMKFGLTQSKADPCVSFQHQREGDIDEMLIILII FT YVDDGIILSNRKQTLTDLLDYLKMAFEIRSLPAHRFIGIDIIRDRPKRMVY FT IYQPEYVVKIAEKFNMSTCTPLTIPANPCCRLSPEMSPQNKEEEDEMKAVP FT YREAVGSLMHIMVMTRPDIAYAVGQVAQYAQKPGKQHWRAVKRILAYLIKT FT KNFGLHFGNTSTSLIGFCDADYAGDLQTRRSTSGFVFLHLGGHVSWASRRQ FT SCVALSTTEAEFVAAADATKEAVWFQQLLSELGIDAPSTTLYCDNQSAIAL FT VNNPTFHQRTKHIDVRLFYIRELQESKKFNIVYLNTEQQIADILTKPLAVP FT RFEKLRDALGVILVPV" XX SQ Sequence 4237 BP; 1313 A; 1004 C; 844 G; 1076 T; 0 other; ggttatgggc ccagattcac gtgttctgta acgacacgaa atggccaacc aaatgattga 60 aaatcacctg aaagatgtta atcatgtacc caagtttgac ggcacaaact ttcgtgaatg 120 gagttttgaa ctgagactta tactataaca gttgggtcta cttggactcg ttgaagcaag 180 agttggacac actctgcacg atgaggtaac ataattctca tatcactaca tgtttaaatc 240 acattcatat gtattccgtt atgacatgtg acacacatgt acaatctttc atgtacgatt 300 cacacacatg ctcacatgta agaatctaac tcatgtacga ttcacacaca tactcacatg 360 tatgaatcta acgcatgtcc tcatacacat gcccacatga atcaaatttc atttcttgta 420 tgttatttat acacacatgc ctaaacatct atatctctta cagataagag acgatccaga 480 taatcctgaa ctaatcacga atgctgctca aattgatgca tggattctga gagatgtcac 540 gtgcaggaat tatatttttg ccactctcac gaaaccaatg aaggaaggtc tctactcttg 600 tgatacggca gcagccatgt ggacgaaatt ggattcacag tatcgactta gagccgcaga 660 gaacttacat ctcctatggc aagaattcta tgacttcagt catcaacctg gtatgctcaa 720 cttcataatg catcattaaa gaacatagaa cttattctca ttctcccact tatagaggat 780 gacatgacaa ctcacgttcg aaaaatttca tgcattgctg acaaactaag agaacttgga 840 caacctctac aagaaatgca gctcgtgact aaagccctgg ctactcttcc tgaaaaattc 900 agaatcgtta gatctgtatg ggccaacgtt gcacttgatg aacgaactat ggacaatctt 960 ttgcaacgtc ttcggtcaga agagaatgta ctcaagtcct atgaaagacc ggatgactcc 1020 agtcaagcct ttgcagcacg aggtcataca cgaggaagag gtcgtggaaa tagacgaggt 1080 agcagaattt accatggagt cagcaacggt tttgtaaacc aacattcatc acgtgacgac 1140 gttcgctgcg gttactgccc acactatggc cacaaaactc aagactgccg aaagaaaaaa 1200 agagatgagg cagaacagaa agcaaacaaa gaccaagcac tactatcatc ttcatcttcc 1260 aacccaaaca gtgcatttgc tttcttcgtc gactcagggg ccacccagca tatgtcggat 1320 cagaaaattt tattcgaaga cttcacgcaa attcaaactg gaacctggtc aattgctggc 1380 attggagaca cgcaccttca agtactggga aaaggccaca tcacagtcac tgtccaagtg 1440 aacaatcaat catccatcag agtcatcaaa gatgtccttt acgtgcctgg ccttggtacc 1500 aaccttttct cggtggctgc tgctaccaac tctggattgg aagcccgttt ttccaaagat 1560 gctctctcct ttaatcgtgg tgatgagctc gtactgacag gaaaacgttc cggcaacact 1620 ctctatctac tagatctcaa acctcacaca gcttctatca cctcaacaag gcgcattgac 1680 tcagcattca gagccggtct acgagcttca ctactagtct ggcaccaacg attaggacac 1740 atgaaccacc aaaccatttt aaagatggtc tccgaagaac tcatatctgg cctccatttg 1800 acaaatgaaa aaatcccgaa gacactttgt gctgcatgtg aactgttcca tcgacaaccc 1860 ttgaagattg gaagaaccag agccactcgc attggtgaat tagtccattc agatgttgaa 1920 ggtccgatgc catgtccaag tatcggccaa gcgcgctaat atgccctgtt tacagatgat 1980 ttctctggtt ggagagttat atataacccc gtttgaagct tggaatggaa gaaaaccaga 2040 cgtttctaac atccgaatat ttggatcaag agcgtttgta cggtgcccaa atgtcaagaa 2100 gttggaagca agatgtctag gggcactcgt cgggataagc aacacacaaa aggccttcag 2160 gatctacata tcttccccac caagaatcat cgtcagccat gatgtaaaaa ttgacgagac 2220 cgtcatgtac aaaatgccga aggacaatac tgcactgaag tggatagaaa ctaccaaaga 2280 gacaccaaca ccgattgccg attctatgga tgataccacg actaatctca ctacagaaac 2340 cgatccagtc attaatgatg ctgtcgacct tcccatttct accttcgacc tggccccagc 2400 atcaattctt gccacacaag tcccggacca tgaagaggaa caggaagtca ttgtgatgga 2460 agacaacgaa aataattccg aagtccctat cgaagtagac gacccagctg aatcaaaaat 2520 tggcatccgc agatcatcac gtctaccctg ctactctgag aaatacctag cgtacagaca 2580 atcactagga cttcaagctg tacctttcga aaaatctgaa caacctaaga tttgtcctgc 2640 ggaaccatcc agctacgttg aagcaactac gtgtccggat gctgaccatt ggattccggc 2700 gatattcgac gaatatgaat ccctcatcaa aaattccaca tggaccgtgt gcgaacttcc 2760 acctgacaga aaaccgataa aaggaaaatg ggtcttcacg ttcaaacctg gctacaaaga 2820 agttgcgcca aggttcaaag cccgatttgt ggctaaaggt tactcccaaa tttacggtct 2880 tgactatgtt gatacttttt cgccagttga aaaaccctat tcacttcgaa caatactcgc 2940 tattgctgca gccaaagacc tggagatgat tcagttagac atcaagacgg cattcctcaa 3000 tggcgacctt caggaagaaa tttatatgaa acagccagaa gggtttgtca tccccgggaa 3060 ggagaaccag gtgtgcagat tattgaaaag cttgtatgga ttgaagaagg cgtctcgagc 3120 ctggaatcaa aaatttcatg cgtttattat gaaatttgga ttaacccaaa gcaaagctga 3180 cccgtgtgtc tctttccaac atcaacgcga aggggatata gatgagatgc tgattatact 3240 catcatttac gttgatgatg gaatcatctt gagcaaccgc aaacaaactc ttacagatct 3300 cctggattat ctaaaaatgg catttgaaat tcgctctcta cctgcccacc gattcattgg 3360 catcgacatc attcgagatc gtcccaagcg aatggtatac atttatcaac cagaatacgt 3420 cgtcaagata gctgaaaaat tcaacatgtc tacctgtaca ccactcacca tccctgccaa 3480 tccatgctgt aggctctcac ctgagatgtc accccaaaat aaggaagaag aagatgaaat 3540 gaaagctgtt ccctatcgag aagcggtcgg atcacttatg catatcatgg ttatgactag 3600 accagatatt gcatatgccg taggacaagt agcccaatac gctcagaaac caggaaaaca 3660 gcattggcga gcagtcaaac gaattctggc gtatctcatc aaaacaaaaa actttgggtt 3720 acattttgga aacactagta cttcacttat tggtttttgt gacgcagact atgccggcga 3780 cttgcagaca cgtcgctcca cttccggatt tgtgttcctt catcttggag gtcatgtctc 3840 ttgggccagc cggcgccaat catgtgtggc actctcgacc actgaagccg aattcgttgc 3900 tgctgctgat gcgacaaaag aagccgtttg gtttcaacaa ctactgtcag agctcggaat 3960 tgatgcgcct tcgactacct tatattgtga taaccaaagc gcaattgccc ttgtaaataa 4020 ccctactttt caccagagaa ccaagcatat cgatgttcgt cttttctata tcagagagtt 4080 gcaagagagt aaaaagttca acatagtcta cctaaacacg gaacaacaga ttgcagatat 4140 tctgacgaaa cctcttgcag ttccaagatt cgaaaagctt cgagatgctt tgggagttat 4200 tttggtaccc gtttagaaat taatgtttga gggaaag 4237 // ID Gypsy15-LTR_Dya repbase; DNA; INV; 164 BP. XX AC chr3L; XX DT 18-MAY-2009 (Rel. 14.05, Created) DT 18-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy15_Dya; KW Gypsy15-I_Dya; Gypsy15-LTR_Dya. XX OS Drosophila yakuba OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; melanogaster subgroup. XX RN [1] RP 1-164 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1099-1099 (2009). XX DR Genome; chr3L; Positions 9931857 9932020. XX SQ Sequence 164 BP; 75 A; 12 C; 29 G; 48 T; 0 other; tgtaaactgt atataattgt aaatactaaa taactaaata tcgatagaat taattgaata 60 gatttaagaa tgaacgatag tatcatcgat atatcgaatg tagagaaggg atcgagaagt 120 tgaaggagaa acatcgttaa taaagaatat ataaaattgt taca 164 // ID piggyBac-N2_BF repbase; DNA; INV; 947 BP. XX AC . XX DT 13-APR-2011 (Rel. 16.04, Created) DT 13-APR-2011 (Rel. 16.04, Last updated, Version 1) XX DE piggyBac-N2_BF DNA transposon DNA transposon - consensus. XX KW piggyBac; DNA transposon; Transposable Element; Nonautonomous; KW TTAA TSDs; piggyBac-N2_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-947 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M. and Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-947 RA Kapitonov V.V. and Jurka J.; RT "Transposable elements in the amphioxus genome."; RL Deposited to Repbase as the tmplanrep.ref file (02-May-2008). XX RN [3] RP 1-947 RA Kapitonov V. and Jurka J.; RT "A family of piggyBac-N2_BF DNA transposons from the amphioxus RT genome."; RL Direct Submission to Repbase Update (13-APR-2011). XX DR [3] (Consensus) XX SQ Sequence 947 BP; 317 A; 176 C; 181 G; 273 T; 0 other; ccctattcag accgggcttt tttggtcatc cctggaccgg gggggggggc ttttgaggcc 60 ctccacttct aagtcctata actctaaaac gacgtgccgt agggccacca aattttcagg 120 acatgatttc gacataatat ctgacaagta tgtgacgttt gacgttacgt tgacgtaaga 180 tgacgtaatt atgacgtcat caatttgatt acattggcca aacccatgaa aaatgggtat 240 tctcagaaaa cttcaagtta tgctacaaaa atgtaacaac aagtgcagat aacagaataa 300 taatgtttat tacgacttgc gttgccaata gcaacatcaa tgacgtcaat atgacgtcat 360 aaaatgacgt catgcacatc taagatacga gttgggctcc gccatattat atccaccacc 420 ttgaattttt aaaaatactt tttttgcaac aaataatcac aaagggcaat ggaaatagac 480 taaattcatt catttagatg gtttttgatg aaaaaatacc aggtagttac aaattttaag 540 ggataattag cttgttattc ttgatctggc cattcggtcc gccatcttgg attttgggct 600 gatgacgtca tcaaattagc ataatttatg catatataat tatgaaagat atgctgaata 660 atataagatt ttatttatgt aacaaaatcg cagtaatata gcaaataaat gtgaaaaata 720 catagttaga accaaaaata gtgatttttg gcaaaactgc ctgtcaacaa acggttgcca 780 tggcaacaag aaaaaatgga aaatttgtca aacttcgtaa aattgttgcc aacaatattt 840 taggaaaact caccaaattt ggtggctcta gcgtaagccg ttctggcgtt ataggacatc 900 gaagttagcg cgggcctcaa aagccccccc ccccggtctg aataggg 947 // ID BMRP1 repbase; DNA; INV; 144 BP. XX AC X13869; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 28-SEP-1995 (Rel. 1.08, Last updated, Version 1) XX DE B. mori silk fibroin repetitive sequence. XX KW BMRP1; Repetitive element. XX OS Bombyx mori OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Bombycidae; Bombycinae; Bombyx. XX RN [1] RP 1-144 RA Mita K., Ichimura S., Zama M. and James C.T.; RT "Specific codon usage pattern and its implications on the RT secondary structure of silk fibroin mRNA."; RL J. Mol. Biol 203, 917-925 (1988). XX DR GenBank; X13869; Positions 37 180. XX SQ Sequence 144 BP; 10 A; 26 C; 64 G; 44 T; 0 other; ggtgccggtg ccggttcagg tgctggtgct ggttcaggag ctggtgctgg ttcaggtgct 60 ggtgctggtt caggtgctgg tgctggttca ggtgctggtg ctggttcagg agctggtgct 120 ggttcaggtg ctggtgctgg ttca 144 // ID R2_DAn repbase; DNA; INV; 3548 BP. XX AC . XX DT 08-NOV-2010 (Rel. 15.11, Created) DT 08-NOV-2010 (Rel. 15.11, Last updated, Version 2) XX DE 28S rDNA-specific non-LTR retrotransposon R2 in Drosophila DE ananassae. XX KW R2; Non-LTR Retrotransposon; Transposable Element; R2_DAn. XX OS Drosophila ananassae OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; ananassae subgroup. XX RN [1] RP 1-3548 RA Stage D.E. and Eickbush T.H.; RT "Origin of nascent lineages and the mechanisms used to prime RT second-strand DNA synthesis in the R1 and R2 retrotransposons of RT Drosophila."; RL Genome Biology 10(5), R49-R49 (2009). XX DR [1] (Consensus) XX CC This family is specifically inserted into 28S ribosomal RNA CC genes. XX FH Key Location/Qualifiers FT CDS 125..3292 FT /product="R2_DAn_1p" FT /note="reverse transcriptase and restriction-like FT endonuclease." FT /translation="FERRKDPWGYRPPGTLKQIGATENNEPRNLNRFVRGE FT STASSLESTQFGTSAEVNLAGRVPCTICEMTFSSKRGLGVHMSHRHKDDLD FT AQRLRVDKKARWSEEETLMMARKEVELAASGVRFLNKKLAEIFTHRSADAI FT SSYRKRSEYKAKLEQIRGQSVPTPEAEEINTTQRRPSNSEQNRRVPRSEGG FT PIAPTEQTNNEILRVLQGLAPVVCLPRWRAEVLQNIVDNAQVSGQETTLQS FT LSSYLMEIFPPRNEPHILTRPRTEPRNMRQRRRQQYARVQRNWDKHPGRCI FT KSLLEEDDESVMPNQEVMEPYWRRVMTQPSSSSIKRDMFNMEHSLERVWSA FT VNQRDLRATKVKLSSSPGPDGITPKTARSVPEGIMLRIMNLILWCGNLPYS FT IRLARTIFIPKKATANQPQDYRPISVPSVIVRQLNAILASRLSAAINWDTR FT QRGFLPTDGCADNTTIVDLVLREHHKRFKSCYIGTLDVSKAFDAVAHEAVY FT NTLASYGAPKGFINYLRKAYEGGGTMLAGNGWVSEAFIPARGVKQGDPLSP FT ILFNLVIDRLLRSLPSEIGAKVGNAMTNAAAFADDIVLFAETPMGLQKLLD FT TTVCFLSSVGLTLNTDKCFTVSIKGQAKQKCTVVERRSFLIGGRECPSLKR FT TDEWKYLGIKFTAEGRARYDPAEDLGPKLLRLTRAPLKPQQKLFALRTVLI FT PQLYHKLTLGSVTIGVLKKFDKLVRYTARKWLGLPVDVPVSFFHAPHKSGG FT LGLPSLRWTAPMLRLKRLSNIKWPHLERSEVASSFVEEEMRRARDRLQAGS FT EELLTRSQVDSYLANRLHMSVDGCGLREAERFAPQHGWVSQPTRLLTGKEY FT TDGIKLRINALPSRSRTTRGRHELERRCRAGCDAPETTNHILQQCYRTHGR FT RIARHNGVVNFLKRGLERRGCVVHVEPSLQGETGLNKPDLVAIRQNRIYVI FT DTQIVTDGHSLDQAHQRKVGKYDTPDIRTNLRRSFGAFDIEFHSATVNWRG FT IWSGQSVKRLIASDLLSSGDSNIISVRVISGGLWSWRQFMYLSGYTRDWT" XX SQ Sequence 3548 BP; 962 A; 826 C; 947 G; 813 T; 0 other; agaatatgga tttgattgtg cagagggggt gctataccgt aactcgtaag ccatgcaatc 60 agatcaagtc gactcaaaac ctcctcgtgg tattctctgg gtgccagtat ttactggtag 120 ctgatttgag cggcgaaagg atccttgggg ttaccggccc cctggaacct taaaacaaat 180 tggtgcaact gaaaataacg agcctcggaa cctaaatcgt tttgtaagag gagaatccac 240 ggcttccagc ctggagagca cacaatttgg aaccagtgca gaggttaacc ttgcagggag 300 ggtgccctgt acgatatgtg aaatgacgtt cagctccaag aggggtttgg gcgttcacat 360 gtcacatcgg cacaaagacg atcttgatgc acaacgtctt cgtgtcgata aaaaggcaag 420 gtggtcagag gaagaaacct tgatgatggc gagaaaggag gttgagcttg cagcaagtgg 480 tgtacgattt cttaataaga agctagcgga gattttcacc caccgcagtg ccgatgcgat 540 atcttcgtat cggaagagga gtgagtacaa ggcaaaacta gagcagataa gggggcaatc 600 cgttcccacc ccagaagcag aagaaatcaa caccacacag cgccgcccta gtaatagcga 660 gcaaaaccga cgagtaccaa gatcagaagg gggaccaatc gcaccaaccg aacagacgaa 720 caacgaaatc cttagggtac tacagggtct agcacctgta gtatgcttac cccggtggag 780 agccgaggtc ctgcaaaata tcgtagataa tgcgcaggtc tcgggacagg aaaccactct 840 ccaaagttta tccagttatc tcatggaaat ttttccgcca cggaatgaac cgcacattct 900 gacgaggccc cgaacggagc ctcgaaatat gagacaacgt agaaggcagc agtacgcgag 960 ggttcagcgt aactgggata aacatccggg gcgatgcata aagtccctac tggaggaaga 1020 tgatgagtcg gtgatgccaa accaggaggt catggagcca tattggagac gggtaatgac 1080 tcagcctagc tcaagctcga taaaacgcga catgtttaac atggagcatt cactcgagag 1140 ggtatggtcc gctgtgaacc agcgcgatct tagggccaca aaagtcaaat tatctagttc 1200 tccaggcccg gacgggatca ctccaaaaac tgccaggagt gtccccgaag gcattatgct 1260 tcgcataatg aacttgatcc tctggtgcgg gaatttgccg tactctatcc gtctggcccg 1320 aaccatcttc attccgaaga aggcgacggc aaatcaaccg caagactatc gtcctatttc 1380 agtcccctcg gttatagtta ggcaactaaa tgccattttg gcttcccggt tgagcgcagc 1440 catcaactgg gacacgcgtc agcgagggtt cctacctacc gatgggtgtg ctgataatac 1500 gacgattgtt gatttagttt tgagggaaca tcataagcga tttaaatcgt gctacatcgg 1560 gaccctcgat gttagtaagg cctttgatgc tgtagctcac gaagcggtct acaacacatt 1620 ggcttcatat ggtgccccga aaggcttcat caactactta cggaaggcgt acgagggcgg 1680 cggcacaatg ctcgctggga acgggtgggt ttcagaggcg ttcattcctg cccgaggagt 1740 gaagcagggt gaccctctgt ctcccatact attcaacttg gtcattgacc ggttgcttag 1800 gtccttaccc agtgagattg gtgccaaagt cggaaatgcc atgacaaacg cagcagcatt 1860 cgcagatgat atagtccttt ttgcggaaac tccgatgggg cttcagaaat tgttggacac 1920 caccgtttgt ttcctttcct cggtgggtct cacccttaat actgataaat gtttcacggt 1980 cagtattaag gggcaagcca aacaaaagtg taccgtcgtc gaacggcgaa gcttcttgat 2040 tggcgggcgc gagtgtcctt cattgaagcg tactgacgag tggaagtact tagggattaa 2100 attcactgcg gaggggcggg cccggtacga tccagcagag gacctcggtc caaagctgtt 2160 gagattgact cgggcccccc tgaaaccaca acagaagtta tttgcccttc ggaccgtcct 2220 tatcccacaa ctctatcaca agctgaccct tgggagtgtg acgataggcg ttctgaagaa 2280 atttgacaaa ttagttcgat ataccgcacg gaagtggttg gggcttccgg tggacgtacc 2340 agtttctttt ttccatgccc cccacaagag tgggggtctc gggttaccat ctctaagatg 2400 gacagcacca atgcttcgac taaagcgatt gagcaacata aaatggcctc acctcgagcg 2460 atccgaggta gccagctctt tcgtggagga agaaatgcgg agggcccggg ataggcttca 2520 ggctggaagt gaagaactgt taacccgttc gcaggtagat tcgtacttgg caaatagatt 2580 gcacatgtct gttgatggtt gcgggctccg tgaagcagag cgttttgctc cgcaacacgg 2640 gtgggttagt cagcccacgc gtttgctaac aggaaaggaa tatactgatg gaatcaaact 2700 gcggataaat gccctaccct caaggtctcg tactacgagg ggaaggcacg aattggagag 2760 acggtgccgt gcaggatgtg atgctcccga aacaacaaat cacatcttgc agcaatgcta 2820 tagaactcac gggaggagga tagctcggca caacggcgta gtaaattttc tcaagcgggg 2880 acttgagcga agaggctgcg tcgttcatgt tgaaccaagt ctgcagggcg aaaccggact 2940 gaataaacct gacctggtgg ctatccgaca aaatcgcatt tatgtgattg acactcagat 3000 tgtgactgac ggacattctc tcgaccaagc gcaccagcgt aaggtcggga agtacgatac 3060 accggacata cggacgaatt tgcggagatc tttcggtgcc tttgacattg agttccattc 3120 cgccactgtg aactggaggg gaatatggag tggtcaatca gtaaaacggt tgatcgcttc 3180 agacctcctc agctctggtg atagcaatat catcagtgtc cgggtaatca gtggtggtct 3240 ctggagctgg cggcagttca tgtatctgtc ggggtacact cgcgattgga cttagccaat 3300 gcacgggttc cagattaagc ttgctgccga agcataccat caaaatcggc ataaaattcg 3360 cttaataaag gaggtggttt tagtacgtag gcgtcccggg acttgtctcg ggatgaatcg 3420 tgcatgcgta taattgggat cgataacaaa taccaactaa gttattacta atatatcgaa 3480 atacataaat atcccgtcct tacgtatctt tgaagatttc catcctcagc gaacaaaaaa 3540 aaaaaaaa 3548 // ID piggyBac-5_SM repbase; DNA; INV; 2423 BP. XX AC . XX DT 29-MAY-2008 (Rel. 13.05, Created) DT 29-MAY-2008 (Rel. 13.05, Last updated, Version 1) XX DE DNA transposon from Schmidtea mediterranea. XX KW piggyBac; DNA transposon; Transposable Element; piggyBac-5_SM. XX OS Schmidtea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae. XX RN [1] RP 1-2423 RA Kapitonov V.V. and Jurka J.; RT "Families of autonomous piggyBac element from the freshwater RT planarian genome and two clear examples of horizontal transfer of RT piggyBac transposons between a flatworm and insect."; RL Repbase Reports 8(5), 524-524 (2008)Repbase Reports. XX DR [1] (Consensus) XX CC piggyBac-5_SM is a very young family of piggyBac transposons, CC characterized by 13-bp TIRs and TTAA target-site duplications. CC The consensus sequence was reconstructed based on multiple CC alignment of 14 copies (they are ~99.6% identical to the CC consensus). This transposon may be currently active. XX FH Key Location/Qualifiers FT CDS 423..2237 FT /product="piggyBac-4_SMp" FT /note="piggyBac transposase." FT /translation="MDKRIFSKGKLSQKDLIDFWENFNGFESEDDDTIDDD FT DLEDPTYHSAVEEDSDDEETTPSATVPTELPCGSSADGIVSKPSKSKDILW FT QKKNLKPNEEQLHFCGNSDLPGGLLQLQTPYQFFSYFVSEDLIAKIVKQTN FT LYSVQKHADRPMAFTTSDIKQYIGIVFYMSFVHMPNIRCYWSPELGYPPVN FT STMSVNKFEKLRRYIHFNDNSTFVTREEPGFDRLHKIRPLIDHLKKKFNSI FT PLENHLSIDEQMCSTKVKHFMKQYMPMKPHKWGFKLYILSGISGFAYNFEI FT YSGQENSMVRPDGEPDLGASSNIVLRLSRIIPRKQNFRLYHDNFYSAIPLM FT VHLAKEGIFSLGTIRRNRVPNCKLPTEKIMKKDDRGKSYEFVANIDGVDIS FT NLVWKDNKYVTLVSSFAGTHPESVVRRYDRKMKKVIEVNCPYIIQEYNRHM FT GGVDLLDSHIGRYKIAIKSRKWYMRIFYHMMDLAMVNAWLLYKRVLKQNGC FT LESSILDQPSFRAEVAHVLCKINTTSAKRGRPNALEKEIQLKKTKGPTQYI FT PPQDVRNDQFGHWPIQIDIKMRCKYPNCKGFTRTKCEKCGVGLCLNKNNNC FT FKNFHTQ" XX SQ Sequence 2423 BP; 819 A; 406 C; 459 G; 739 T; 0 other; cccttaaatg catcgtgttg ccatttggca acataccaat attgaagctc atacattgca 60 cagaaataaa atattgatgc aatacatgtt acagtggaca ggctacatag ttctgcgcat 120 gcgcatgatg tgaccagcgt gaatatagca gctataggtt atttcttctt attcgtttac 180 aggcgtttaa acttattcgc gcgctcgtca tacgcgaaag ttattcagtt tttttaaata 240 gtttacaaaa catttttatt tggttttaaa tataattgac attaacaaag gtatgtattc 300 tttattgagc atttgtaatt ataaataaac tttgtattac tgcatataaa atctatattt 360 cgctttgttg ccgtttggct acagcatgca tttacttata aatttttgtc cctaaaattt 420 agatggataa gcgaatattt agcaagggaa agttatcaca gaaggatctg attgattttt 480 gggaaaactt caatgggttc gagagtgaag acgacgatac tattgacgac gatgatttgg 540 aagaccccac ttaccattcc gctgttgaag aagacagcga tgatgaagaa actacaccat 600 cagcaacagt tccaacagaa ttaccatgcg gctcctctgc tgatggcatt gtatcaaagc 660 cttcaaaaag taaggacatt ttatggcaga agaaaaactt gaaaccaaat gaagaacagt 720 tacacttttg cggaaattct gatttgccag gaggtttatt acagttgcaa acgccttatc 780 aattctttag ttactttgtt tccgaagatt taatagcaaa gattgttaaa caaaccaatt 840 tgtatagtgt ccaaaaacac gctgacagac caatggcttt tactacttct gatataaaac 900 agtatatcgg tattgtcttt tatatgtcgt ttgtacatat gccaaacatc aggtgctact 960 ggagtcccga actgggctat cctccagtaa atagtacaat gtctgtgaac aaatttgaaa 1020 agttgcgtcg gtatattcat ttcaatgata attcaacttt tgtgacacgt gaggagcctg 1080 gttttgatag attacacaaa attagaccgt tgatcgatca tttaaaaaaa aaattcaaca 1140 gcattccttt ggagaatcat ctgtctatag atgagcaaat gtgctcaaca aaagtgaaac 1200 attttatgaa gcagtatatg ccgatgaagc cgcacaaatg gggattcaag ctttatatat 1260 tatcaggaat ttctggattt gcttataact ttgaaatata ctcaggccaa gaaaattcaa 1320 tggttcgacc tgacggtgaa cctgaccttg gggcttcatc aaacattgtt ctgcgtttat 1380 cccgtattat accaagaaaa caaaattttc gattgtacca tgataatttt tattcagcta 1440 ttccactaat ggttcacttg gcaaaagaag gcattttttc attgggtaca atacgtcgga 1500 atagggttcc taattgcaag ctacctacgg aaaaaataat gaagaaggat gatcgcggta 1560 agagctatga atttgtggct aatatagatg gtgtcgatat atcaaattta gtttggaaag 1620 acaataaata tgtaacactt gtgtcgtctt ttgccggaac gcatcccgaa tcagtagtca 1680 ggaggtacga cagaaaaatg aagaaagtta tagaagtgaa ttgcccatat ataatccagg 1740 aatacaatcg tcatatggga ggcgtagatc tcctagatag tcatattggg aggtataaaa 1800 ttgctataaa aagccgaaaa tggtatatgc gcatatttta ccatatgatg gatctagcca 1860 tggtgaacgc ctggctactc tataaacgag ttctcaaaca aaatggatgt ttggaatcta 1920 gcattctaga tcaacctagt tttcgagctg aagtagcaca tgttttgtgc aaaataaaca 1980 caacatcggc taaaaggggt cggccgaatg ctcttgaaaa agaaattcaa ttgaagaaaa 2040 ccaaaggacc cacacaatac ataccgccac aagacgtacg aaatgaccag tttggtcact 2100 ggcctattca aattgatata aagatgaggt gcaaatatcc aaattgcaaa gggtttactc 2160 gtacaaaatg cgaaaagtgc ggtgtagggt tgtgtttgaa taaaaataac aactgcttta 2220 aaaacttcca cacacaataa atagcctttg cgtaatatgt ctaaatgcat attgtagcca 2280 tttggcaaca aacccttttt ggatgaaata taaacttttt ggagaattat aatttttttt 2340 attatttttc tgtagttgta gttactttat tacaaagcgt caaattaaaa aaaacgaaaa 2400 aaattaagtc atgcatttaa ggg 2423 // ID Gypsy-24_DYa-LTR repbase; DNA; INV; 214 BP. XX AC chr2L_random; XX DT 06-APR-2011 (Rel. 16.04, Created) DT 06-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-24_DYa_; KW Gypsy-24_DYa-I; Gypsy-24_DYa-LTR. XX OS Drosophila yakuba OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; melanogaster subgroup. XX RN [1] RP 1-214 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; chr2L_random; Positions 1705327 1705114. XX SQ Sequence 214 BP; 66 A; 47 C; 56 G; 45 T; 0 other; tgtggcgacc ggcgttgcca acaaccaagc atcgggagag cagcgacggg agagacacta 60 ccgatcgcct agtgtctgag tgagagaatg gattactcct gagcgtactg ccaagcagcg 120 tggatgtgcg aataccatag atctgaataa acgaactgaa tatatataac cataaatcat 180 aacgcgtgtt actcttagta agcgggtcat taca 214 // ID LIN6_SM repbase; DNA; INV; 5324 BP. XX AC . XX DT 16-FEB-2008 (Rel. 13.02, Created) DT 03-MAR-2008 (Rel. 13.02, Last updated, Version 1) XX DE Non-LTR retrotransposon from Schmidtea mediterranea: consensus. XX KW Non-LTR Retrotransposon; Transposable Element; LIN6_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-5324 RA Jurka J.; RT "Non-LTR retrotransposon from Schmidtea mediterranea."; RL Repbase Reports 8(2), 164-164 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS join(2555..3649,3592..4317,4321..5292) FT /product="LIN6_SM_1p" FT /translation="IGYLDRRDPHGGWIDDTRLTLQVPAHSAAKYRQALWD FT HGYSLATYGETSWPRRPDDRGDEVELPSVAIKREPGRNPAREARQRAEFNL FT LQLENLDDIILSESSDTSQPPSQLLYQPSSCPKTATIQQVDLDQSSFTDDT FT SLTAILSKPQSQSTSYPINNSPLTIQQHNVTISSSPEPTCCSTPHNHLNLN FT TFNTPSTFFASLKKQPFEQPITPITKNCKTDLYEPCSLNLTQLDESLPINP FT PRIKPSPLISDDEEPSSTKTPSQKPAKNLTYAQNSQISDLFRTQEPLSDNS FT SPESLITLNTSIISNLATPNEPHNFIIPAPVLNITSKIXTXNIPISTPLPS FT CLKQQNQSIRYNSMILMRLYMITTKPINPIQLHDINETLYDMNRRGKLICR FT PKHRNKTLCNSISIYNRHFIFEHAQKVHNITITDNTIIDCYQCKPKKGPNX FT HAMVKIKFIDLYKHIELNDHPIEIDNCSKNNQIIKINSSNKIICHNNNKGK FT NIRLCYHKFTYNTAINDIILHMTKKHKKKXFDKEAEIYCYCGAWISIHNIA FT LHINDKHPSTPNINSTDYIDKXDNNKNLFAGIINTDETIQVPIELPKTLLF FT TKNIDSESQWSQHSIKAFIFTHAMKKSSIFVDPISSNSFIQHNFKTFFSTF FT PFHKFSDWNEIIMPMCKPNGSWFFFFLDKKRKLATLIDPLSENSLLQHSDL FT ASDILNTILNLQNIVLETDIDLSSLEYPRCSINEDSPFFICHFLKCLNSSE FT PISIPDIPNMTTSIKSIIINYIRNGFLETDIRKYKNCIDSLIFQMNNTDDT FT VNCSLIEILKINNNLNPLKYKIKTKSDIIYKNKKREHELECSKRYKFKYKP FT KEQLSKILNCNDIITIKPCMNKFINIFANNTSPVSEFSTIQLYNSPENDTN FT TPTCPEDIKLILKKLKNTAPGLDKILL" XX SQ Sequence 5324 BP; 2049 A; 1081 C; 724 G; 1455 T; 15 other; tcaagatcca cacgaaatga cataaaaata acataataac aaaagatttg accgaaaatt 60 gacaatttac gtaaaagtaa agaattaatg aaaaaaatga ataaaaataa aataattcta 120 gttataattc aaataaaaag ctaatgaata aattgtcaaa taaaatcgag aaaataataa 180 aaattagaat tagattagac aaataatatc aaggtcaaat tttgttttac cgcatcaaat 240 ttaacacgca aggttagggt cacagaatag gtatatatac caattcattt tcacagaatt 300 ttaatgcatt cacttagtga gcaaagtcac ttttcttcag agaattcgag aatcactgtt 360 aatccaagga ccaactatcg ataataattt ccacaatacg aactcaggca tccttctatg 420 taagccctac ttgagcgtat gtgtgggaaa taattgtcgt acgtatatat gtatgctaaa 480 gagaaatatg atgactatgt aatgattcga aatatatgtt taaatgagag agttgaacct 540 gtcgaaattt ttctggccca cagccacaca ctatgcaaga cattaaaata cgacacaact 600 atcaagctgt gtaagtgcgg tttctccgcc tatagcttgt gcccgtgaga gttccgtcgt 660 gcgcaacgac gggctcgaac gtaaaatcat attcgtttat aggggcagtg ggaagatcgg 720 gaactagtct tttggctggc tctctttcac ccatctcccg aaaattcgac ctaaaatcta 780 taatacaaat cggctaaatt tatcacaccg agtagtccag tgggcactct taaaatctaa 840 tgtgaacaya tatttcttag gatcaaattt aaaggctcgg gaggtggaaa ctcggaatcg 900 atgtaactaa tttaaataaa ttcatgaata acattctgta ctatgatcaa aagatcgata 960 aacaatataa ttctcaaggt agcaataact gaccatagtt aattatgtac tatagaatta 1020 tgctagggaa aaactttaaa ataatagaca agaaccyaat taaattaata ctgacttaat 1080 aatatcctta aataaaaaty taataaacta acagtaacag aatgctaata gcaaagcagg 1140 tctaatgttc tcgctgaaat aaaatttaaa taacaayaaa ctaaattaag gtaatttgaa 1200 ttaaaacrct tataataaag tcatgataac cctttcgtag agaaaaataa tataaattaa 1260 tatgtgaaac ggacaactaa aatttaatca gaaaaaatct ctcgaactga gagcaaataa 1320 ataaccacaa atatccttat rcgtagaaaa acgatcmttg atataaaata aaattaagat 1380 ataatcataa agaatttcgt taaaccataa agacaaacgt ctgcgaaatc cataaatacc 1440 tttattacag tatataataa atcaaacgct gtcaaaggcg gcgtatcata gaacttcttg 1500 atcgactctt tattgagaat tgatagtgaa cttccgccca acatgtaaaa tccggtagaa 1560 aattgtgaat actgacccag agtagtagag gtttgagcat tgatttgtga ggaatgaatt 1620 accaactgga gagtagaaat gaaaagggat ccagtattca tgaatcaact gagaagagga 1680 ggacaataat gtacctagtt cggacgaact cagggataag aatccaaaga gaataacttt 1740 cattctgtcg gtataacact aagagaaaca caataaattc caaaataaat actctgaaat 1800 tgttgacatc agagaaaatt ctagaccaaa atcgattaaa ataattttta tgaaacgaga 1860 agtgttacgc tccattgata aagcgaaagg aaatttcata attccgcaca aaatatccct 1920 artaggaata tcaacccaac cagaaaatcg cacccaaaaa taaaatggaa taaatagggt 1980 gataacaaat aattccacca accgtaaacr caattaatct gatagaaatc cagtgaatgg 2040 tgatttacac cggcggttca agcggaacaa ataagcaatt aattacctat aataggaaaa 2100 atttctacaa tattaacaaa aaaaaataaa attggtaaaa tggattacac taaattctta 2160 aaaaataaat gaactatagt taatggggga tagactagtc ragcggcgct gagtcaacac 2220 catgaatgta tatatagagt tataaatcca gtgagttaaa taacaatgtc gacaatgtct 2280 aaacaacatc aaccgcactc agatggtagg cgtagaagga aggagcagtt cggagatagg 2340 ttccagaatc gccaaccgcc taatgaagca ggactagaaa ggaacttcat cgaacagaaa 2400 cgaggtaaat aactccagtt tacaatgata aatttttctc tcattgatta tgatttttca 2460 cgcctaatga ctgacgatga taataatttt agtgaactac gtaggatttt ctaactaacc 2520 cataaaatac catttgcact ataatattaa ttaaattggt tatttagatc gcagagatcc 2580 gcacggtgga tggattgacg acacgcggct tactctgcaa gtacctgctc actcagccgc 2640 aaaataccga caagccttat gggaccatgg ctactctctg gccacgtatg gcgaaaccag 2700 ttggccccga cgaccagacg atcgtggaga tgaagtcgag ctacccagtg tagctatcaa 2760 gagggaaccg ggtaggaacc cggcacgaga ggctcgccaa cgagccgagt tcaacttact 2820 gcagctagaa aacctcgatg atattatcct atccgaatct tctgatactt cacagccccc 2880 ttcgcaacta ctctaccaac cttcatcctg tcctaaaact gcaaccatac aacaagttga 2940 cctagatcaa tcttccttca ctgatgatac ttcacttaca gcaatactat ccaaacccca 3000 gtcacaaagc acttcttacc ctataaacaa ctcaccttta accattcaac aacacaacgt 3060 aaccatatct agctcacctg aacccacttg ctgctctaca ccacataacc acttaaatct 3120 taacaccttc aatacaccct cgactttctt tgcttccttg aaaaaacaac cctttgaaca 3180 acctataaca ccaataacca aaaattgcaa aactgacctt tatgaacctt gctctttaaa 3240 ccttacgcaa cttgatgagt cactacccat taatccacct agaattaaac catcgcccct 3300 aatctctgat gacgaagaac catcaagcac taaaactcct tcacaaaaac cggccaaaaa 3360 ccttacttat gctcaaaact cacaaatatc tgaccttttc cgtactcagg aacctctatc 3420 tgataactcg tctcctgaat cattaattac tcttaacacc tctattatct ctaacttagc 3480 aacaccaaat gaacctcata actttattat accagcacct gtacttaaca taacctccaa 3540 aatawctact amaaatatcc caatatctac accccttcct tcttgtctta aacaacaaaa 3600 ccaatcaatc cgatacaact ccatgatatt aatgagactt tatatgatat gaatcgcaga 3660 ggaaaactta tctgtaggcc aaagcaccgt aacaaaacct tatgcaactc catctctata 3720 tataacagac atttcatctt tgaacatgca caaaaagtac ataatataac tataactgat 3780 aataccatta ttgactgtta tcaatgtaaa ccaaaaaaag gaccaaacar acatgctatg 3840 gtcaaaatca aatttattga tctatataaa cacattgaat taaatgacca tccaatagaa 3900 atagacaact gtagcaaaaa taatcaaata attaaaatta actcatccaa taaaataata 3960 tgtcacaaca ataacaaagg taaaaatata cgtttatgct atcacaaatt tacatacaac 4020 actgcaataa atgacattat actacatatg actaaaaaac acaaaaaaaa ayactttgac 4080 aaagaggctg aaatctactg ttattgtgga gcctggatta gcattcacaa catcgcatta 4140 catatcaatg ataaacaccc ttcaacacca aatattaata gcacagatta catagacaaa 4200 aawgataata acaaaaacct ttttgcaggc ataataaaca ctgacgaaac aattcaagta 4260 cctatagaat taccaaaaac acttttattc acaaaaaaca ttgactctga atcacaatga 4320 tggtcccagc actcgattaa ggctttcatc tttactcatg ctatgaaaaa atcttccata 4380 ttcgttgacc ccatatcttc taactccttc atccaacaca acttcaaaac tttcttctcc 4440 accttcccct tccataaatt ctctgactgg aacgagatca ttatgcctat gtgcaagccc 4500 aatggatcct ggttcttttt ctttcttgac aaaaaacgta aattagctac tcttattgac 4560 cctctatctg agaacagcct tcttcaacac tctgaccttg ctagcgatat tttgaacaca 4620 atcctaaacc ttcaaaatat cgtccttgaa actgatatag acctaagctc ccttgaatat 4680 cctcgctgct ctataaatga ggactctccc ttttttatct gtcacttctt aaaatgcctt 4740 aattcgagtg aacccatttc tatccccgac atacctaata tgaccaccag tataaaatca 4800 ataattatta attatattag aaatggtttc ttagaaactg atatccgaaa gtataaaaac 4860 tgtatagaca gtcttatttt ccaaatgaac aatactgatg atacagttaa ttgttcactt 4920 attgagatcc ttaaaattaa caataactta aatcctctta aatacaaaat taaaactaaa 4980 agtgatataa tatataaaaa taaaaagcgt gaacatgagt tagaatgttc taagagatat 5040 aaatttaaat ataaacctaa ggaacaacta tcaaaaattt taaactgtaa tgatataata 5100 actattaaac cttgtatgaa caaatttatt aatatttttg caaacaatac ctctcctgtc 5160 tctgaattta gcacaataca actttataac tctcctgaaa atgacacaaa cactcctact 5220 tgccctgagg atatcaaatt aatcttaaaa aaacttaaaa acactgcccc tggtcttgat 5280 aaaatccttc tctgactggc gtacatctcc tgactttctc tttc 5324 // ID Copia-30_DPu-I repbase; DNA; INV; 4179 BP. XX AC . XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from Daphnia: internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-30_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-4179 RA Jurka J.; RT "LTR retrotransposons from Daphnia."; RL Direct Submission to RU (16-DEC-2010). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 25..4104 FT /product="Copia-30_DPu-I_1p" FT /translation="MSAIAFSTRDTSHISKFNGSNFPFWKFQVSLVLKQHD FT LMDVVLGKEVQPQQADPATAANTTAINSWNKRDNSASCCIVATIEETFQRS FT LINCKSSKEMWDRLAAQYEQAASENKYFLQQRFYNYSFQQGHDVMTHITEI FT ETLANQLSDLGEIQTENQIITKVLCTLPPSFRSVVSAWENVEDSKKKLPLL FT TTRLLKEESLNKMHDNGESAEDKAFFSKRSNQVRPRDAQSSKKIDKRVCYY FT CDPSGRIGHTEERCWVKHGRPGRPSQNLLQSADAALSHHHTEWPVDYGFSS FT YITSFLSSTSRNLKQWFADSGASQHMSDQRWMFSTFHAVKRGSYPVKGIGS FT DNCPLQATGKGDIRIKSKVNGEWRNNIIYDVLFVPKLGANLFSVRAATKRG FT VKVVFDNDGVSITYNSSVVATGTSNGTNLYLLDFEPVSHINEDTSVVNPIS FT LLSKTNAASINTWHHRLGHLNTATIKKMESLKVADGLLISSSPQPSICEGC FT VYGKHHHQPFPTGGRTRGTRIGDLIHSDLVGPMSIPSPSGSRYMVIFKDDF FT SGYCSIFFLKKKSETAEHFKFFVLRLEKETNSFVNVLRTDNGGEYTGGDFQ FT KWIKSKGIRHETSIPKTPQQNGVSERQNRTIIESARSMITASNQSXELWAE FT ASNCAVYLRNRVIGKSLPEMTPYEAWFGRKPNLSHXRMFGCPAYMHIPADE FT RSKFSPKARKCVFVGYCETQKGFRLWDSVRRRIFVSRDVIFNEVVPXLXHX FT FSSDCAAKEXPCNPAVLPXQATAPPEDEETASDVIGVETTEQSTSNPTASK FT EXDAVDSSSPSTDDLGPRKRRPPVRWADESTTGIYAGIAVDPDELVEPETY FT EEAIISPQSVEWKLAMEEEMNSLIKNETWSLVELPLGRAPXRNRWVYKIKR FT SGQNNVKFKARLVAKGFSQRPGIDYGETYSPVVKHDSLRTILSLAAAQDLE FT LLQLDVKTAFLNGDIDEELYMDQPTGFKEDSQMVCLLKKGLYGLKQASRAW FT NEKFNHFLIQYGFTRSNADSCVYFQRDENNITIIAIWVDDGLLCSNHQSKL FT EDIVNYLSDKFEITSGPVDHFVGLEISRDRCKGLIHISQQAYVKKILARFK FT MGECKPRSVPADPCSHLEKELFTSETQDYPYREAIGSLMFAAICTRPDISY FT AVGQVAKYSSKPSHVHWEAVKRIFAYLKGTISLGISYFRGVKDGVLQAFSD FT SDFAGDADDRRSTTGNIFILNGGAVSWRSQKQKCVALSTAESEYIAASMAT FT KEVVWLRRLLMEIGCQQIKSTPLFCDNQSAIRLVYNPEFHQRTKHIDVKFH FT HIRDMQVQQEISIQYIPTENQLADALTKNLDHQKLSRFKHSIGMCPFV" XX SQ Sequence 4179 BP; 1305 A; 916 C; 890 G; 1050 T; 18 other; cctagttttt ttttttgaga aaacatgtca gccattgctt tttctacaag agatacaagc 60 catataagca agtttaatgg ctccaacttc cctttttgga agtttcaggt atcattagta 120 ttgaaacaac atgatttaat ggatgtagta ttggggaaag aagtacaacc ccaacaagca 180 gatccagcca ctgctgcgaa caccactgca ataaactcat ggaacaaaag ggacaattct 240 gcgagttgct gtattgtggc cacaattgaa gaaacattcc aacgatcact catcaactgc 300 aagtcatcca aggagatgtg ggatcgttta gcagcccaat acgaacaagc cgcctcagaa 360 aataagtatt tcttacagca acgtttctat aactattcct ttcaacaagg ccacgatgtg 420 atgacacata taacagaaat tgaaaccttg gccaatcagt tgagtgattt aggagaaatt 480 caaacagaaa atcagattat caccaaggtg ctatgtactc ttccaccaag cttcagatca 540 gtcgtgtcag catgggaaaa cgtcgaggat tcaaagaaga aactcccact actaacaaca 600 aggcttctaa aagaagaaag tttgaataag atgcacgaca atggagaatc agcagaagac 660 aaagcattct tttcaaaaag gtcaaaccaa gtcagaccca gagatgctca atcaagtaag 720 aaaatcgaca aaagagtatg ttattactgt gacccaagtg gaaggattgg tcacaccgaa 780 gaacgttgct gggtgaagca cggaagacct ggccgcccat cacaaaattt attgcaatct 840 gctgatgctg ctttaagtca tcaccatacg gaatggccag ttgattacgg cttctcttcc 900 tatataacct cctttctctc atctacttcc aggaacctga agcaatggtt tgctgactca 960 ggtgcctcac agcacatgtc tgaccagcga tggatgttct caacctttca tgcagtaaag 1020 aggggtagct atcctgtgaa gggaatcgga tcggacaact gcccacttca agcaacagga 1080 aaaggagaca tccgcattaa atctaaagtg aatggagaat ggagaaacaa catcatttat 1140 gatgttttat tcgtcccgaa acttggagca aatctttttt ctgttcgtgc agcaacaaaa 1200 agaggagtaa aggtggtgtt tgacaacgac ggtgtctcca ttacttataa ctcatccgtt 1260 gtggcgacag ggacaagtaa tggaaccaat ctttatctac ttgattttga accagtcagc 1320 cacatcaatg aagacacatc agtagtaaac cccatctctc ttctttcaaa aaccaatgct 1380 gcatcaatca acacctggca ccatcggtta ggacacctaa atacagctac aatcaagaag 1440 atggaatctc ttaaagttgc tgacggtctt ctgatatcat catcaccaca accctccatc 1500 tgcgaaggat gcgtgtacgg aaaacatcat catcaacctt ttccaacggg aggccgcacc 1560 agaggtaccc gcattgggga tttaattcat tctgatttag taggcccgat gtctatacct 1620 tctcccagtg gctcacggta catggtgatc ttcaaagacg atttttctgg atactgttca 1680 atattcttcc tcaagaaaaa atccgaaact gctgaacatt ttaaattttt cgtactccgt 1740 ctagaaaaag aaactaacag cttcgtgaac gtcctaagaa ccgacaacgg aggagaatat 1800 accggaggtg attttcaaaa atggatcaaa tccaaaggaa ttcggcacga aaccagtatt 1860 cccaagacac ctcaacaaaa cggcgtttcg gaacggcaaa accgaaccat catcgaatca 1920 gctagaagca tgatcacagc ttctaatcag tcamgtgagt tgtgggcaga agcctcaaat 1980 tgtgctgtat atttgagaaa cagagtaatc ggaaagtctt taccagagat gacgccatat 2040 gaggcmtggt ttggamgaaa acctaatctt tcccacktaa gaatgtttgg atgtccagcc 2100 tacatgcata ttccagcaga tgagcggtct aaattctccc ccaaagcccg gaaatgtgtg 2160 tttgtggggt actgtgagac ccaaaaagga tttcggctat gggactcggt gcgcagaaga 2220 atttttgtsa gcagagacgt aatattcaac gaagtngttc caganttanc mcatcanttc 2280 agttcagatt gtgcagcaaa agaggkgcct tgcaatcctg cagttctacc tmttcaagct 2340 acagctcctc ctgaagacga agaaacagct tcagatgtca tcggagtcga aacaactgaa 2400 caatcaacaa gtaaccctac agcttctaaa gaaamcgacg ccgttgattc ctcgagtcct 2460 tcaacagatg atcttggtcc aaggaaaaga agacctccag ttaggtgggc agatgaatca 2520 acmaccggca tctacgcagg aattgctgta gacccggatg agctggtaga acctgaaact 2580 tatgaggagg cnatcatctc accacagtca gtcgaatgga aattagccat ggaagaagaa 2640 atgaattcac tcatcaagaa cgaaacctgg tctttggtcg aacttccact nggacgtgct 2700 cctatnagga atcggtgggt gtacaaaatc aagagaagtg gacaaaacaa cgttaaattc 2760 aaggctcgac tagtcgcaaa aggattctct cagagacccg gcatcgacta tggagaaaca 2820 tattctcccg ttgtcaagca cgattctctt cgaacaattc tttcactcgc tgcagcacaa 2880 gatctagaac tactccagct cgacgtcaag acggcattcc tgaatggtga cattgatgaa 2940 gagttgtaca tggatcagcc taccggcttc aaggaagata gtcaaatggt ttgtctcctt 3000 aaaaagggac tatatggntt gaagcaagcc tcaagagcct ggaacgaaaa gtttaatcat 3060 tttttaatcc aatatggttt taccagaagc aatgcagact cttgcgttta ttttcaacgg 3120 gacgagaaca acatcaccat aatcgcaata tgggtcgacg atggacttct ttgcagcaac 3180 catcaatcaa aactggaaga cattgtcaat tatctctcag acaagttcga gattacgtct 3240 ggaccagtgg accactttgt cggtttggag atttctagag atagatgtaa aggcctcatt 3300 cacatctccc aacaggccta tgtgaagaaa attctcgccc ggttcaagat gggagagtgt 3360 aaacctcgaa gcgttccagc cgacccatgt tctcatttgg aaaaggaatt gtttacctcc 3420 gaaactcaag attatcctta cagggaggca attggttcct tgatgttcgc agccatctgt 3480 actcgaccgg acatttcata cgccgtcggc caggtggcca aatactccag taaaccgagc 3540 catgttcact gggaagctgt aaaacggatt tttgcatatt taaaaggaac aatttccctt 3600 ggaatctctt acttcagagg agttaaagat ggggttttgc aagccttttc tgattcggat 3660 tttgccggag atgcggacga cagaagatcc acaactggaa atatttttat tctcaatggc 3720 ggcgctgtgt catggagaag ccaaaaacag aaatgtgtcg ccttgtcgac agccgaatca 3780 gagtatattg ctgcaagcat ggcaacaaaa gaggttgtct ggttgagacg ccttttgatg 3840 gaaataggct gtcagcagat taagtcgacc cctctcttct gcgataatca atcagccata 3900 cgtcttgttt acaaccccga gtttcaccaa aggacgaagc acattgatgt taaatttcat 3960 catataagag acatgcaggt acaacaggag atcagcatac agtacatccc gacggagaat 4020 caattggcgg acgcgctgac gaaaaatctg gaccatcaaa aactttcaag attcaaacac 4080 agcattggaa tgtgtccatt tgtataaatc gtgtgaaatg tttaagtggg tgttcagtgt 4140 tcaccatttt tgttctcgtg tttcgtgatt gagtgggtg 4179 // ID Gypsy-8_DWil-LTR repbase; DNA; INV; 242 BP. XX AC scaffold_180700; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-8_DWil_; KW Gypsy-8_DWil-I; Gypsy-8_DWil-LTR. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-242 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_180700; Positions 3505698 3505939. XX SQ Sequence 242 BP; 88 A; 41 C; 50 G; 63 T; 0 other; tggttttccc agctctagag caagccggta aagttgcgtc tcgaagtgtg agcagtgaaa 60 tcaaaaataa agagaaacat tgtgaagtaa attataattt tactactaag ctaataaact 120 aatatataca ataaagctga tgaaaataaa acccggtgtg tgcaaattaa atctgttcga 180 ctcggtgaag gttgaatctc cacagcttgc gaggctcgtc cgggaaaaag ctaaattcta 240 ca 242 // ID BEL-196_AA-I repbase; DNA; INV; 6372 BP. XX AC supercont1.77; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-196_AA_; KW BEL-196_AA-LTR; BEL-196_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6372 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.77; Positions 1305704 1299333. XX CC Positions [5245-5826] - Integrase core CC 'CAATC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 5032..6255 FT /product="BEL-196_AA-I_3p" FT /translation="MHPFPRLFVEYMHKKLQHAGGRTLLTVIREEFWPIDG FT RRLVRSVVRNCFRCQRLNPAPAQQQIGQLPASRVTPSRPFSVVGIDYAGPF FT YLKPIHKRAAPAKSYLCLFVCFSTKAVHLELVCELSTSAFLAALRRFISRR FT GRPIHIHSDNGKNFEGAKNEIVQLFAMLAGTHHIDEINSFCASEGITWHLT FT PPKAPHFGGLWESAVKVAKKHIYRVIGSSRYTYEDFSTLFAQIEAVMNSRP FT LLPMTDDPNDLAALTPAHFLIGTSLMALPDPDLRRIPTSQLDHYWQLQHLM FT QRFWTLWQQEYLQELQKDTKCYARNDDILPGRLVIILDEQQPTTRWPLGRI FT VKLHPGTDNITRVVTLRTAKGVIKRPVTKVCILPFAPADSPAPNEITCPAD FT QQSQCDDVSPNAAK" FT CDS join(892..3063,3067..4557) FT /product="BEL-196_AA-I_1p" FT /translation="MTTAPEMATQQQLLFRRDTILASLGRAEAFVADYVAE FT RDEAQLQLRISHIDNIWANLEAIQAQLEDCATSEEGRARHAGVRADYEPRL FT FSIKASLISKLPTPTSSNARNPCSTNVHALSGIKLPTISLPEFDGDYQQWL FT TFHDTFVALIHSNTEVLDIQKFHYLRAAVKGEAAQLIEAISISSANYNLAW FT ETLKGRYSNEYLMKKRHLQALFDIPRMKKESATTLHGLVDEFERHTKILRQ FT LGEPTDSWSSLLEHLLCTRLHDDTLKVWEDHASTVKDPNYSCLIEFLQRRI FT RVLESISVNHHVPSGSASSTPLSKKQSPIAHLSSCSSTTSTSKRCLACNQT FT HPLVRCFKFQRLPLSERLQIVNAKRLCLNCLKVDHSYRNCPSEMSCRHCNR FT RHHTLLHTSTPDSTRRSTNGNTSVASTSSPTPVAQSSPSASSETSQNSLVT FT AAELDPIVEASVPVQHNRENVFLLTVIVHIVDSFGQEHPARALLDSASQPN FT LITERLASILRLRRSNVNITVQGAGKLSKPVRQSVYTEVRSRNQHFSCGVN FT FLIMDKVTADLPSRNISTAGWNIPKDLVLADPSFNQSQPIDLVLGAKHFYS FT FFPSAARLQLSENLPVLVDSVFGWIVAGSSSPFCPESSSNKSRPSSTTISM FT VTLEESIERFWKNEELVMNNNYSVEERRCETLYQSTVSRNEEGRYIVRMPR FT QPDFHAMIGSSKLNALRRFELLEKFERDSKLKTDYHAFMQEYIDLGHMREV FT AESEKDPPVAYYLPHHPVFKDSSTTTKVRVVFDGSAKTSTGYSLNEALCVG FT PVVQDDLLDIMMRFRTYKVAVVGDIAKMYRQVLLHPEDRPLVRIFFRFSPQ FT SPIRTYELQTVTYGLAPSSFLATRTLHQLADDEGHKYPLGGPALRKNFYVD FT DFIGGAQNVQEAIQLRLQMSELLAKGGFELRKWTSNELEVLRGLKDEQIGT FT QSSLQFNPNETIKALGICWEPEADTLRFDSHISPCDEPPTKRSILSCISRL FT FDPLGLIAPVIVKSKLLMQELWLLSCGWDDAVPDSVRQKWDGIRSELPTIS FT TYRADRYAFLPNASVQLHAFSDASEVAYGACIYARCEDSDGRVRISLLASK FT SRVAPLKRVSLPRLELSAAVLGAHLYERIRKSIHIEIAASYFWSDSAITLQ FT WIRSPPNNWKTYVANRVSEIQHFTHGCAWNHVSGYQNPADLVSRGMAVIDF FT VKSKLWRSGPD" XX SQ Sequence 6372 BP; 1587 A; 1615 C; 1454 G; 1716 T; 0 other; taattttggt gccgtgacca ggattccgcc atcggatcat ctaaaaacga cctgggagcc 60 gccatttcga cactgagcat tgttcgtcac tgcgacagcc attgtacccg gaggatagtt 120 attttctcaa ggctcgatta tatcgggcag caggtaatta gctgtccgag tatgtttgct 180 ggccgcacag gtgcccttta gattttactt ttcttgcatg tacactcttc gttgctctgc 240 ggataccgcc aattcgactg gatttcgtca tttggccatt ctaccgggat cgtcaaacaa 300 tcactcgcct gtcgccatag tgccatccgg cgccattctt ggaagttggc ggcttggctt 360 ccccgtagct attattcctg cagaagcttc tggatgttgg cattgtttgc caagtgaata 420 gctggacatt gacctcctgt tgccgccttt cgaagtcggc cattgccttt gatgctcttc 480 tgttcctgtc agtgaatacc agtgctattg tgacatcatc aagcgtgacc gtacgaagtg 540 atcccacacg gtatttttct gcgcttctgg attttatacg acgttgtcag tggctgctaa 600 ggatttcttc gaagattttt ccatcccgtt tcgtcaattg tgattctcct atgggcctat 660 tacggcaaat tggactgctt tcgacggctt ctggtattag ctgctccgct cccgatacgc 720 attcggagga tttggtgctc gtctggtttt tgctgcattt attgattaat tcaaggcata 780 ttactgcctt ggaacaggtg agtaccgttc caagtagcta ctttgtccca ccctcgggtc 840 taccgtggtc ttattgttct cagattcact gactgctgca ttccataact gatgaccacc 900 gcaccggaaa tggctacgca acagcaattg ctcttcagaa gggacacgat actggcatcg 960 ctgggtcggg cagaggcttt tgttgctgac tacgtagctg agcgtgacga ggcgcaactt 1020 cagttgagga ttagccacat cgacaacatt tgggccaatt tagaagcgat acaagcccaa 1080 ctggaggact gtgctacgtc agaggaaggt agggcaaggc atgcaggtgt tcgtgcggat 1140 tatgaaccgc gccttttctc aattaaagct agcttgatat cgaaattacc gactcctact 1200 tcttcaaacg ctcgcaatcc ctgctccact aatgttcatg ctctctctgg aatcaaacta 1260 ccaactattt cgttgcctga atttgatggg gactaccaac agtggctaac tttccatgat 1320 acctttgtcg ctctgattca ctcgaatact gaggtgctgg acattcaaaa gtttcattat 1380 ctgagggcag ctgttaaagg cgaggcagca cagttaattg aagcgatttc aatcagctcg 1440 gcaaattata accttgcgtg ggaaactttg aaaggtcggt actccaacga gtatttgatg 1500 aaaaagcgac atctgcaggc gctgtttgac attccacgaa tgaagaagga atctgctacg 1560 acacttcacg gtttagtgga cgagttcgag agacacacaa agatactgcg ccagctaggg 1620 gagcctactg attcctggag ctctcttctc gagcacctgc tttgtactcg acttcatgac 1680 gacacgctca aggtatggga ggaccatgcg tcaacggtca aggatccaaa ttactcctgt 1740 ctcatcgagt ttctgcaacg gaggatcagg gtactggagt ctatttctgt taatcaccac 1800 gtgccatcgg gttccgcttc gtctacgcct ttgtcgaaga aacaatcacc gattgctcat 1860 ctttcatcgt gctcttccac tactagtact tcgaaacgat gtctagcctg caaccaaacc 1920 cacccattgg tcagatgttt taaatttcag cggcttccac tttctgaacg tctacagata 1980 gttaatgcga agcgcctgtg ccttaactgc cttaaggttg accattctta ccgcaactgt 2040 ccatcagaaa tgagctgcag gcattgcaat cggaggcacc acactcttct gcatacttcg 2100 acacctgatt ccacccgtcg atctacgaac ggaaatactt cagtagcctc tacgtcttct 2160 ccaacaccag tggctcaatc cagtccgtcg gcttcgtcag agacatccca gaattccctt 2220 gtgactgcag cggagcttga tcccattgtt gaagccagtg tcccggtgca acataatcgg 2280 gaaaacgtgt ttttgctgac cgtcattgta cacatcgttg attcgttcgg gcaggagcat 2340 ccagcgcgcg ctctactaga cagcgcctcg caaccgaatc taatcacaga acggctggca 2400 agtattcttc gcttgagacg cagcaatgtg aatattacag tccagggtgc tggtaaatta 2460 tccaagccgg tgcgccagtc agtctacaca gaagttagat ccaggaatca acacttttca 2520 tgcggtgtta acttcttgat aatggataaa gtcacggctg accttccatc aaggaatata 2580 tccacagctg ggtggaatat cccaaaggat ctcgttctgg ccgatccatc gttcaaccaa 2640 tcacagccaa ttgatcttgt tctcggagcc aagcattttt actccttttt ccctagcgcg 2700 gcgcgcctgc agctgtccga aaatcttcca gttcttgtgg atagcgtatt tggctggatc 2760 gtagctggtt cttcgtcacc tttttgtcct gaaagctcat cgaacaagtc tagacctagt 2820 tcgactacaa tttctatggt gacgcttgaa gaaagcattg aacgtttttg gaagaatgag 2880 gaactcgtca tgaacaacaa ctactccgta gaagaaaggc gatgcgaaac tctgtatcaa 2940 tcaaccgttt ctcgaaatga agaaggtcgc tatattgtgc gcatgccacg tcaaccagat 3000 ttccacgcaa tgatcggatc ctcaaaactg aatgcgttgc gacgattcga acttttggaa 3060 aagtgattcg aacgagattc caagctcaag acggactacc atgctttcat gcaagaatac 3120 atcgaccttg ggcatatgcg tgaggtggct gaaagcgaaa aggatccgcc cgtagcgtat 3180 tacttgcctc accatccagt attcaaagac tctagtacga caactaaggt gcgagtcgta 3240 ttcgatgggt cagcgaagac gtcgaccgga tactcgctca atgaggcact ttgtgtgggc 3300 ccggtggtac aggacgacct tctcgacatt atgatgcgtt ttcgtacata caaggtggca 3360 gttgtagggg atattgcaaa gatgtatcgt caggtattgc tgcacccgga agaccgacca 3420 ttagttagaa tctttttccg cttctcaccg caatccccga ttcgaaccta tgaactgcaa 3480 accgttactt acggcctagc accgtcttct tttcttgcga cacgcacact ccaccagctc 3540 gcagacgatg aaggtcacaa gtaccctctt ggtggtccag cattgcgcaa gaatttctat 3600 gtggacgact ttattggagg tgctcaaaat gttcaagaag caattcaact ccgtttacaa 3660 atgagtgaac tattggccaa aggagggttc gagctacgca aatggacctc aaacgaatta 3720 gaagtcttgc gtggcctgaa agatgaacaa attggaacgc agtcatcgct tcaatttaat 3780 ccgaatgaaa ctattaaagc attaggaata tgctgggaac cagaagccga cacgctgcga 3840 tttgactccc atatcagccc gtgtgacgaa ccaccaacaa agcgaagcat tctttcttgc 3900 atttctcgtc tcttcgatcc ccttgggctc attgcaccgg taattgtgaa atccaaactt 3960 ctcatgcaag agctttggct actttcttgt ggatgggacg acgcagtgcc agattccgtt 4020 cgccaaaaat gggatggtat acgcagcgag cttcctacca tttcaaccta tcgagccgat 4080 agatatgcgt ttttaccaaa cgctagtgtt caattgcatg ctttttccga tgcttcagaa 4140 gttgcgtatg gtgcgtgcat ctatgctcga tgcgaggatt ctgacggtcg agttcgtata 4200 agcctattgg cctcgaagtc tcgcgtagca cctctcaagc gtgtaagtct gccacgtctt 4260 gagctaagcg ctgcggtgct aggtgctcac ctgtacgaga gaattcgaaa atccatacac 4320 atcgagattg ctgcatcgta tttttggtca gactcagcca tcacgctgca atggatacga 4380 tcccctccta acaactggaa aacctacgta gcgaaccgcg tctccgagat acagcatttt 4440 actcatggtt gcgcatggaa tcacgtatcc ggctatcaaa accctgctga tttggtgtcc 4500 cgagggatgg cggtaataga tttcgtcaag agcaagcttt ggcgtagcgg gcccgattga 4560 ctagcctgcc cgaaggaaaa ctggccaagt tcaacccctc ctacagtacc cgaggaaaat 4620 atcgaagctc gtattgtggc tgcaacgatc acgacgccgc caatcaaccc attattctta 4680 cgctggtcct ccttttctcg tttatcgcat gtcgtcggat attgttttcg attcgctgat 4740 aattgccgct acaaaactcg atcgcaatcg ttaagtctgt ctagactaag tagaattgcc 4800 ctggatccta aggagttaca aagatcaaaa gccttcctga cccgtcttgc acaagaagac 4860 tgcttctccg atgaactacg tgagttaaaa caagagcgta ctgtagcgaa acgatccccc 4920 cttcgaaagt tgagcccatt tttagattct gagagagcgt tgagagtagg tggtcggtta 4980 aatcattcgc ttctccccta tcaagctaag caccctgccc tacttcctaa gatgcatcca 5040 tttccccgcc tctttgtgga atatatgcat aaaaagttgc aacacgccgg tggacgcacg 5100 ctcctgacag tgatcagaga agaattttgg ccaatagatg gtcggagatt ggtacgcagc 5160 gtagtacgga attgcttcag atgtcaacgt cttaatccag caccggctca gcaacaaatt 5220 ggccaacttc cagcgtcacg agttacccca agtcgtcctt tcagtgttgt aggcattgat 5280 tatgctggtc ccttctactt aaagccgatt cacaaacgag ccgcacctgc aaaatcgtat 5340 ctctgcttgt ttgtgtgctt ttcgacaaag gcagtacatt tagagctagt ttgcgagctt 5400 tccacttccg cctttttggc agcactccgt cgtttcattt cccgtcgagg tcgaccgata 5460 cacatccact cggacaacgg gaaaaatttt gaaggagcca agaacgaaat tgtccagttg 5520 tttgcaatgt tggcaggtac ccaccatatt gatgaaatca attccttttg cgcctcggaa 5580 gggattactt ggcaccttac acctcctaaa gctcctcact tcggtggcct ctgggagtcc 5640 gccgtgaagg tagcgaagaa gcacatctat cgagtgatag gctcgtctcg atatacgtac 5700 gaggattttt ccacactttt tgcacagatt gaagccgtca tgaactccag gccactactc 5760 ccgatgactg acgatcctaa cgatttagca gcactaactc cggcacattt cctgattggt 5820 acgtcgctaa tggccctgcc cgacccagat ttgcgccgga ttccgactag tcagctggat 5880 cattattggc aactacagca tctaatgcag cggttttgga ctctctggca gcaagaatat 5940 ctccaagagc tgcaaaagga caccaaatgc tacgcccgga acgatgatat cctacccggt 6000 agactcgtca ttattctgga tgaacagcag ccaaccacac gttggccact aggtcgcatc 6060 gttaaactgc atccaggaac ggacaatatc acccgtgttg tcactctgcg caccgcaaaa 6120 ggagttatca agcgtccggt aacaaaggtt tgcatccttc cgtttgcccc agcagattca 6180 ccggctccaa acgaaatcac ttgcccagca gatcagcaaa gtcagtgcga tgatgtgtcg 6240 ccaaatgcag ctaaatagat gtaaaatttc ctgtaaaatt agaattaaga tttatgttcc 6300 tctgcgagta gtcttgtaac atttgttgtt gttgttgttt tgaaagttga atgtcattca 6360 aggcggcggg ta 6372 // ID TRE3B repbase; DNA; INV; 5292 BP. XX AC AF134170; XX DT 13-AUG-1999 (Rel. 4.07, Created) DT 16-AUG-2009 (Rel. 14.09, Last updated, Version 2) XX DE non-LTR retrotransposon TRE3-B, complete sequence. XX KW L1; Non-LTR Retrotransposon; Transposable Element; ORF1; KW reverse transcriptase; endonuclease; ORF2; TRE3B. XX NM TRE3B. XX OS Dictyostelium discoideum OC Eukaryota; Amoebozoa; Mycetozoa; Dictyosteliida; Dictyostelium. XX RN [1] RP 1-5292 RA Szafranski K., Gloeckner G., Dingermann T., Noegel A.A., RA Eichinger L., Rosenthal A. and Winckler T.; RT "Non-LTR retrotransposons with unique integration preference RT downstream of Dictyostelium discoideum transfer RNA genes."; RL Unpublished. XX RN [2] RP 1-5292 RA Szafranski K.; RT "TRE3B."; RL Direct Submission to Genbank (10-MAR-1999)Abt. Genomanalyse, RL Institut fuer Molekulare Biotechnologie, Beutenbergstrasse 11, RL D-07745 Jena 07745, Germany. XX DR GenBank; AF134170; Positions 1 5292. XX FH Key Location/Qualifiers FT CDS 139..1347 FT /product="TRE3B_1p" FT /translation="MADDHSTELQLNHTDNNTTNTNNTSTTSTTKNKSAKI FT DTPKKQTPTNQLSLNKGLDNQLDIIAPPSITRPLSFLEKNNNTINIKCYFN FT QPHIREELTSKDDTRIQQIHKNIHKLFEDILSPSAFKIIGATAFVNCNVDK FT LTSIINLQNQLALNIDKLNYRFDISSNDYAIIKTFIKTYSPKHANDAITNN FT LTKFTSCTSIESFTVTYSYAVAFKSYIITYHLIDHTHNIVDQKIEQPSGIT FT RYIKAATRSVTKHKIQHAKNADMATTIKNAITNPNMNKNNQPPKNNINTPP FT INEIANSNRNNNSLQNSGSKAILNESLDINNQSQNNNEISHKSDSSNNNNG FT KNSLSMSNPTDPTIVPQKPLQSQPNKPSAPTPNTINTRRSSNALTTQPVNT FT GAAKPSKV*" FT CDS 1344..5171 FT /product="TRE3B_2p" FT /translation="MSSKINFITWNCRGFTENPNNQHNSYQKAVDTLQLLI FT SKLPPRPTISILTELNSSPLQLKHDFPDHIISVTKRGNGVIIINHNPNTLK FT LKHISTIDGRAILADVIFPNQLPIGILGIYAPASNNTIKKNSYATSLQTLL FT HNHSTNPIPNQSQHVSIIAGDFNCIHNDNSLLDDIIQSYTYQHHLIDQGEF FT KNTPTHTHGNRLDRIYTQYNRIDPENIYTQVHQIPSSLSDHNPISTTISSP FT FQINKHKKRWILSPGTANDHEAINIINHLITNHNNTSINQFTNWSKVKSKI FT IKTLKNYQYNKDKQTQRAKQILLEKAAKFKHLEEQCIAEYNIICNKNKLHQ FT NHTNILKKHINGEIPSKYLSAILNKRSKDAKIESIQYGNIITSDPDQIENA FT FVEFYTNLYSLQICCPITHQLMLNTWPIIKNEYWNGLDSPFIQDEVEAAIK FT TCNPNKSPGPDGVTNAFYINHLNQVKPILTTLFNDILENPHHITTEFTEGL FT IHTIYKKGNPLLISNRRPITLLNTDYKILSKVINARLLRILPFIINNNQTG FT FVPHRFIIDNIININELINYLKSKNLPGIITLFDFFKAFDSISHDSIKRTL FT IHIGIPIKLINLIHKLLSDSQAKISINGKTTRKFDIKRGVKQGDPISATLF FT VIVIEILARTINADNSIIGLPISPNPQIKIKFTQFADDSTTYNINYEQQQQ FT SIKHFDNFCASTSSSLNFDKSAIIEINPHKITDKHIINNIPQSKRIPITKK FT DQSERVLGYFFNHNGLHRKLPETMKTLIKSLVLWKTSGTTLKTKTTIINTY FT SLSPITYLSYLEEFTKDEEIQINKLISWFMNSPANSESPTLNTNSIENNTS FT TIHSRAKIPLMCYDRSLKPLKEGGWGMWNIQLRQVAQKIWIYNRFLQMHKS FT ANNSIYMISWMDQIINKSISSPYLIKIKKEWENYATQIGHLKDKVQILQPI FT LTKNQPSQTLNTNNISLPPTLKEIYSTILNNHQCKSKDYIGKKYSDLLLTS FT HQQSIQLLWKYTYDQLFVKIQKLKDPKGRDTMQRFHARCLPINHLHNKVCP FT ICNNEMNNDPYGHLFFNCQHTINFINHDKLKYFIYKNCNGNKNWSLTKNQT FT TKLYTLIYKPQTNNKAFKPDPTININFHFAENAQYKKYRFNWNYINTNLDL FT VRTHAYWNIISLVIHQIWIWLCKSLFDINITQSIDNWNNQINNTTLDYDIL FT KSKWHKLIRLEYSRTLSNFNQYSIKNNLTKTQKETQWSETIKKFKKEWSIN FT TNEPIPTITPPINY*" XX SQ Sequence 5292 BP; 2309 A; 1161 C; 489 G; 1333 T; 0 other; aaacaaagaa ataaacgaga atttatacat ataataaata aatcactgtt atatcttcac 60 cgccacattc attcctctta attttaccct ttttttgctc aataaagcaa gggctatcgt 120 cctatatact tttcaacaat ggctgatgat cactcaactg aattacagtt aaatcataca 180 gataacaaca caaccaatac aaataatacc tccacaacct caacaacaaa aaacaaaagt 240 gcaaaaatcg acacacctaa aaaacaaacc ccaacaaacc aactctcact aaacaaagga 300 cttgataatc aattagatat cattgcacct ccatcaatca caagaccact ctctttctta 360 gaaaaaaaca ataacaccat caacattaag tgttacttta atcaaccaca catccgtgaa 420 gaattaacct caaaagatga cacaagaatt caacaaattc acaaaaacat tcacaaactc 480 ttcgaagata tactgtcgcc atcagcattc aaaataattg gtgcaactgc cttcgtaaat 540 tgcaacgttg ataaattaac ttcaataatc aaccttcaaa accaacttgc actaaatatc 600 gacaaactca actatagatt cgatatatca tcaaatgatt atgcaatcat taaaacattc 660 attaaaacgt actcacccaa acatgcaaac gatgcaatta caaacaacct cactaaattc 720 accagttgta catcaattga atcatttact gtcacatact catatgctgt tgcgtttaaa 780 agctacatta taacatatca tctaattgat cacacacaca acattgttga tcaaaagatc 840 gaacaaccat ctggtatcac aagatatatc aaagctgcaa cacgtagtgt cactaaacac 900 aaaattcaac atgcaaaaaa tgcagatatg gcaacaacta taaaaaatgc catcacaaac 960 ccaaacatga acaaaaacaa tcaaccacca aaaaacaaca ttaacacccc accaatcaac 1020 gaaattgcaa attccaatcg caataataac tcacttcaaa atagtggttc aaaagctata 1080 ctaaatgaat cacttgatat taacaatcaa tcccaaaaca ataatgaaat ttcacacaaa 1140 agtgattctt caaataataa taacgggaaa aactcattat caatgagcaa tccaaccgat 1200 ccaacaattg taccacaaaa acctctacaa tctcaaccaa acaaaccaag tgcaccaact 1260 ccaaatacaa taaatacaag aagaagcagc aatgccctga ctacacaacc agtcaatact 1320 ggtgctgcca aacctagtaa ggtatgagct caaaaatcaa ctttataaca tggaattgta 1380 gaggattcac tgaaaaccca aacaatcaac acaattccta ccaaaaagct gttgacacac 1440 tacaactact tatatcaaaa cttccacctc gtccaacaat ctcaatactc actgaactca 1500 attcatcacc attacaactt aaacatgatt ttcccgatca catcatctct gtcaccaaac 1560 gaggaaatgg tgtaatcatc attaaccata acccaaacac actcaaactc aaacacatat 1620 caacaattga tggtagagct attttagcag acgttatatt tccaaatcaa ttaccaattg 1680 gtattcttgg aatttacgca ccagcatcaa ataataccat caaaaagaat tcatatgcaa 1740 cttcactaca aacacttctc cacaaccact caacaaaccc aataccaaac caatcccaac 1800 atgtatctat tatcgcagga gatttcaact gtatccacaa cgacaattca ctattagatg 1860 atattatcca atcctacaca taccaacacc atctaattga tcaaggtgaa tttaaaaaca 1920 ctccaactca tacacatggc aacagattag atagaatata tactcaatac aatagaatag 1980 accctgaaaa tatatataca caagtccacc aaattccatc ttcattatca gatcataatc 2040 caatatcaac aacaatttct tcaccttttc aaataaataa acataagaaa agatggatat 2100 tatcacctgg tactgcaaat gaccatgaag caatcaatat tatcaatcat ttaataacta 2160 atcacaataa cacatcaatc aatcaattta caaactggtc aaaagtaaaa tcaaaaatta 2220 ttaaaactct aaaaaactat caatataaca aagacaaaca aactcaacgt gcaaaacaaa 2280 tactattaga aaaagcagca aaattcaaac atcttgaaga acaatgcatc gctgaataca 2340 atattatatg taacaaaaac aaattacatc aaaaccacac aaacatatta aaaaaacata 2400 taaatggaga aatacctagc aagtatctat cagcaatact aaacaaaaga tcaaaagatg 2460 caaaaattga atcaatccaa tatggtaaca tcattacatc cgaccccgac caaattgaaa 2520 atgcgtttgt tgaattctac acaaatctat atagcttaca aatatgttgt ccaattactc 2580 accagctcat gctaaacaca tggcccatca taaaaaatga atattggaat ggtttagact 2640 caccatttat acaagatgaa gttgaagctg caattaaaac ctgtaacccc aacaaatcac 2700 ccggtccaga tggcgttaca aatgcattct acattaatca tttaaaccaa gttaaaccaa 2760 ttctcactac attattcaac gatatactag aaaatcctca ccacatcaca acagaattca 2820 cagaaggtct tatacacaca atatacaaga aaggcaaccc cttactcata tcaaatcgtc 2880 gtcccattac acttctaaac actgattaca agatcctctc aaaagtaatc aatgcacgtc 2940 ttctgagaat acttcctttc atcatcaaca acaaccaaac tggtttcgta ccccacagat 3000 tcatcataga caacatcatc aacatcaatg aattaatcaa ctacctcaaa tctaaaaacc 3060 tccctggtat catcacacta tttgacttct ttaaagcatt tgattctatc tctcacgata 3120 gcattaaaag aacattaatt catattggta taccaataaa attaataaat ttaatccaca 3180 agctattatc tgactcacaa gcaaaaattt caattaatgg taaaacgacc agaaaattcg 3240 atattaaaag aggtgtaaaa caaggtgacc caatctcagc cactttattt gtaattgtaa 3300 ttgaaatatt agcaagaaca ataaatgcag acaattcaat aattggatta ccaatttcac 3360 ccaatccaca aattaaaatc aaatttaccc aatttgcaga cgactctaca acatacaaca 3420 tcaactacga acaacaacaa caatcaatca aacatttcga taatttctgc gcttcaacat 3480 catcatcatt aaattttgac aaaagtgcaa taatagaaat caacccccac aaaatcactg 3540 ataaacacat aataaacaac attccacaat caaaaagaat accaataacc aaaaaagatc 3600 aatcagaaag agttcttggc tatttcttta atcataatgg tttacatagg aaattaccag 3660 aaacaatgaa aacactgatt aaatcattag tgctatggaa aactagtggc acaacattaa 3720 aaacaaaaac aaccatcata aacacctact cactatcacc aataacatat ttatcatacc 3780 ttgaagaatt tacaaaagat gaagaaattc aaattaataa attaatctca tggtttatga 3840 attctcccgc aaattctgaa tcaccaactc tcaataccaa ttccattgaa aacaacacat 3900 caaccatcca ttcacgtgca aaaatacctt tgatgtgtta tgatagatca ttaaaaccat 3960 taaaagaagg tggttggggt atgtggaata tacaattacg acaagttgct caaaagattt 4020 ggatatacaa cagattttta caaatgcata aatctgcaaa caattcaata tacatgataa 4080 gttggatgga tcaaatcatc aacaaatcaa tctcttctcc ataccttata aaaatcaaaa 4140 aagaatggga aaactatgca actcaaattg gacatctaaa agataaagtt caaatcctac 4200 aaccaatatt aacaaaaaac caaccctctc aaacattaaa cacaaataat atttcactac 4260 caccaacact caaagaaatc tactcgacaa tactaaacaa tcaccaatgc aaatcaaaag 4320 actatattgg caagaaatat tcagatctac ttcttacttc gcaccaacaa tcaatacaac 4380 tattatggaa atacacatac gaccaacttt ttgtcaaaat tcaaaaatta aaagatccaa 4440 aaggccgtga tacaatgcaa agattccatg ctagatgtct tccgattaat catctacaca 4500 acaaagtttg ccccatttgc aacaatgaaa tgaataatga tccctatggt catttgtttt 4560 tcaattgtca gcacacaatc aacttcataa accacgacaa attaaaatat tttatatata 4620 aaaattgcaa tggaaacaaa aactggtcac taacaaaaaa ccaaacaaca aaattataca 4680 cactcattta taaaccacaa acaaacaaca aagcatttaa accagatcca actatcaata 4740 ttaattttca ttttgcagaa aacgcacaat ataaaaaata caggttcaac tggaattata 4800 taaacacaaa tcttgatcta gtcagaacac atgcatattg gaacataatc tcattagtta 4860 ttcaccaaat atggatatgg ttatgcaaat cattatttga catcaatata actcaatcaa 4920 ttgataattg gaacaaccaa ataaacaaca caacactaga ttatgatata ttaaaatcaa 4980 aatggcacaa gctaataaga ttggaatatt caagaacatt atcaaacttt aaccaatact 5040 caattaaaaa caacctcaca aaaactcaaa aagaaaccca atggtccgaa accatcaaaa 5100 aatttaaaaa agaatggagc ataaacacaa atgaaccaat tccaacaatt acaccaccaa 5160 ttaattatta attacctaat ctaaaataac ataccatttt aaattcacaa aaactaaaaa 5220 acctaaaata aataaataaa aataacatac taattgtata ctacaatagt ttatttatat 5280 caaaaaaaaa aa 5292 // ID Copia-2_TCa-LTR repbase; DNA; INV; 253 BP. XX AC ChLG7; XX DT 21-FEB-2011 (Rel. 16.02, Created) DT 21-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the red flour beetle genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-2_TCa_; KW Copia-2_TCa-I; Copia-2_TCa-LTR. XX OS Tribolium castaneum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Coleoptera; Polyphaga; Cucujiformia; OC Tenebrionidae; Tribolium. XX RN [1] RP 1-253 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the red flour beetle genome."; RL Direct Submission to RU (21-FEB-2011). XX DR Genome; ChLG7; Positions 6097140 6096888. XX SQ Sequence 253 BP; 62 A; 60 C; 54 G; 77 T; 0 other; tgttgggaat tacattgtat tcagacgcga atttgtgccc tgcgagtgct tgttgttagt 60 gcgcatgttc ggcttcagat gaagatgaag gatcctcgag agagagtaag agttgatgtg 120 cagtcgccga cgtatagact cgcaactctc ggttattgtt tccaattctt tcctgcccgt 180 tccattaata tattaaccgt attatcaacc gtgtttcaat actacctacc actgctcgac 240 caccaccaca aca 253 // ID PERERE-5 repbase; DNA; INV; 5057 BP. XX AC BN000796; XX DT 04-SEP-2005 (Rel. 10.08, Created) DT 04-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE Schistosoma mansoni Perere-5 non-LTR retrotransposon (EST). XX KW CR1; Non-LTR Retrotransposon; Transposable Element; SR1; KW PERERE-5. XX OS Schistosoma mansoni OC Eukaryota; Metazoa; Platyhelminthes; Trematoda; Digenea; OC Strigeidida; Schistosomatoidea; Schistosomatidae; Schistosoma. XX RN [1] RP 1-5057 RA DeMarco R., Machado A.A., Bisson-Filho A.W. RA and Verjovski-Almeida S.; RT "Identification of 18 new transcribed retrotransposons in RT Schistosoma mansoni."; RL Biochem Biophys Res Commun 333(1), 230-240 (2005). XX DR EMBL/GenBank/DDBJ; BN000796; Positions 1 5057. XX FH Key Location/Qualifiers FT CDS 377..1615 FT /product="PERERE-5_1p" FT /translation="MVGSVHKCGQPYCLFPVEEGMQCDECKKWFHKMCTRL FT SPAAYKRCSKPNSHWLCMFCCSSKTLLIQEAISLLALAKKVDIGRAGNMDT FT DNEDCISVVNAAMAGTRHPISQHEAKICTPIQPMKATPLSTEGPVALAATE FT DPARTTIVQNSSNLDAENSGKWITRKRKRDGKTSKESARMVGKSSLDSRPE FT HQATPLKSKCKEIINTVVDGNSLTSPNVVGSNSLDPHDTVIKKANSKKPVA FT NRQDLTSRSKAVNTTFKAVKQVPSVIIWNLAESKDTDPAKRHAHDLQLVGS FT LIRNILPKEVPGVHISKVIRLGKYVGGETQQRRILKLVLGNFEERDVLLNN FT ARKTRDSNIRIRPDWPLEDRTKWKNALTELKTRRLNGEANLTIKGFRVVRS FT WKPILPRPVWVERMSPKTP" FT CDS 1657..4386 FT /product="PERERE-5_2p" FT /translation="MSELSLMVDDLNPDIIVITETWLTVDIDCSPVISGYI FT CVRSDRVRSRKGGGIILYIRDHFRINLIISEAHVSGTCEVVCCTIRSRNGL FT VTIGGIYRSPCCLADDFILKHVCSWSKAECCLIVGDFNAPHINWIELTAPG FT GGFDSVLLSSIIQCALVQCIVKPTHIDLEHNPSLIDLVLTHHCEDIADIQH FT LPPLVNSDHVVLSFKFRIHGIEFASAPPRPNIWRANIPAIRDYAISIDWSV FT DADGSIDDARNRFKSTFESVTQPFIPWSARKLSYCPPWINKETRKLLKRRK FT HFWDLFLSTGQASFKCEYKNRNVCKKTIAKSRTAYERQLAYDSRNCPKRLF FT SYVKRRTKRSDGIPSLLLSENSALLTESDLAKAETFANYFSEVYSTCSVTT FT TCPVDNEIPAIGPLHVTEDIVLPLITNLKPCKIPGPDGLHPRLLSSLADII FT SRPLTMLFNMSLSQAQLPRDWKDAIVSPIFKDGQRRIVSNYRPVSLTSIVV FT KLLEKLIRAKLLSHIDSYNLLTPEQHGFRNKHSCLTNLLIAREDWIAALDN FT GQSVDVIFIDFSKAFDKVSHLGLMQKLSSFGITGAIQEWIGSFLCDRRQGV FT RVNGTLSEWKPVKSGVSRGTILGPLRFLLYINELPLLLRSSTLLFADDVKI FT WRTIRNETDRCYLQADLDELVRWAHDWGLEVNSRKSVFMHIGHDISYHYTI FT NGTSLPRVHEHKDLGVIISHDLKTTTHCNAVASKGYRALWSLRRAFKCFDE FT DMFRILYPTYVRPHLEYCIQAASPCLIKDTKSLERVQRVGTKLVKGLSKLP FT YDERLKRLNLFPLSYRRTRGDLILAFRIFNYDLGVNMSYLFAPSSTNNLRG FT HSKKVLKPRSNKLKVGSRFSHRVVNDWNALPEQVVSAPSVNIFKKKLDLHW FT KEICQD" XX SQ Sequence 5057 BP; 1466 A; 1068 C; 1072 G; 1451 T; 0 other; cgacgagctc aaacggtaat tcatccactg cactgaaaca attcaagtag aagaaacccg 60 gcctgatccc aagtctattt agagaaagcc agtcatttta caacactgtg gaaacaccgc 120 gcgcgctatt taaaaggacg gtggtattgt ttcacgttga gtttaattca ctgttttgat 180 cgtaaattac gtatcaaaac taccgtttac actttaattg aatttatttc agttaacgtg 240 acattgtatt tgtgtgtttt gattctgctt ttgtgtattt actgcatcaa agcattcttg 300 gtcgactgag gcttgtttat ttgataaggt tgtgatttat cttacatatc taccttgcct 360 atcattttat tctaagatgg tcggctctgt acacaagtgt ggacagccgt attgcttatt 420 ccccgtggaa gaagggatgc aatgtgacga atgtaagaag tggtttcata aaatgtgcac 480 ccgcttaagc cctgctgcct ataaaaggtg ctcgaaacca aattcccatt ggctctgtat 540 gttttgttgc tcaagcaaga cattacttat ccaggaagcc attagccttc tagcgttagc 600 taaaaaggtc gacataggtc gtgctggcaa catggatact gataatgaag attgtatcag 660 tgtcgtaaat gctgccatgg caggtaccag acacccaatt tcgcaacatg aagcaaaaat 720 ttgtactccg atacaaccaa tgaaagccac gccattaagt actgagggac cagtcgcttt 780 agcagcgact gaggacccgg ctagaaccac tatcgtccag aatagttcca atctggatgc 840 tgagaatagt ggcaaatgga ttacgcggaa acgcaaaaga gacggaaaaa catccaaaga 900 aagcgcgcgt atggtcggta agtcatcgtt ggactcacgt cctgagcatc aggcaactcc 960 gctcaagtcc aaatgtaagg aaataattaa tactgttgtg gatggaaata gtctaacttc 1020 accaaatgtt gttgggagta actcgcttga cccacatgac actgtcataa agaaagctaa 1080 tagcaagaag ccggtagcta accggcagga tctgacatca cggtcaaagg ccgtgaacac 1140 gacgtttaag gcagttaaac aggtgcctag tgtaatcatc tggaacttgg cagaatcgaa 1200 agacactgat cccgctaaga gacatgccca cgacctgcag ttagtgggat ctctcatcag 1260 aaatatactg ccaaaggagg tcccaggggt acatatctcg aaagtcatca gacttggtaa 1320 gtatgttgga ggcgaaaccc aacagagaag gatccttaaa ttagtactcg gaaattttga 1380 ggaaagggac gtattactaa ataacgcacg caaaactcgt gactcaaaca ttcgtattcg 1440 acctgactgg ccacttgagg atcggaccaa gtggaagaat gccctaacag agttaaaaac 1500 tcgaaggcta aacggtgaag cgaaccttac gattaagggt tttcgggtag tcaggtcttg 1560 gaagccaata cttccgagac ctgtgtgggt agaacgcatg tctccaaaaa caccctaaat 1620 gtgtgttata cgaatgctcg tagtcttcgg aataagatgt ctgaactaag tttgatggta 1680 gatgacttaa atccggatat aattgttatc accgaaactt ggctcactgt agatatagac 1740 tgttctccag tcatttcagg ctacatctgt gtcagaagtg atagagttag aagtcgcaag 1800 ggaggaggca taatattata tattagggac cacttccgca ttaacttgat catatcggag 1860 gcgcatgtaa gtggtacgtg tgaggtagta tgttgtacaa ttaggtctag aaacgggtta 1920 gtgacgatag gaggaatata tcgtagtccg tgttgtctcg ctgacgactt tatcctaaaa 1980 cacgtctgtt cttggagtaa agcagaatgt tgtctcatcg ttggagactt caatgcaccc 2040 catataaact ggatagagct cacagctcca ggtgggggtt tcgatagcgt ccttctgtca 2100 tcaattattc aatgtgcact tgtacaatgt atcgttaagc cgacgcatat agacttggaa 2160 cataatccgt cattgataga ccttgtcctc acacaccact gtgaagatat cgccgacatt 2220 caacaccttc ccccgctggt gaatagtgac catgtcgtac tttcctttaa gttccggata 2280 catggtatag aatttgcatc tgctccaccc cgtcctaata tatggagagc taatattccg 2340 gcaataaggg actatgctat ctcaattgac tggtcggtcg atgctgatgg atcgattgat 2400 gatgcacgga ataggtttaa gtctacgttc gaatccgtaa ctcaaccgtt tatcccatgg 2460 tcagcacgta agctatcata ttgccctcca tggattaata aggaaacccg gaagctgttg 2520 aagcgtagga aacacttctg ggacttattc ttgtcaacag gtcaggcatc atttaagtgc 2580 gagtataaaa accggaatgt atgtaaaaag accatagcta aatcccgcac ggcttacgaa 2640 cgacagttgg catacgatag ccgtaattgt ccgaagcgac tgttttcgta cgttaaaagg 2700 cggacgaaac gtagtgatgg tatcccctca ttactgttaa gcgaaaattc agctttattg 2760 actgaatctg atctagcgaa agcggagaca tttgctaact acttcagtga agtctactct 2820 acgtgtagtg ttacaactac ttgtcccgtt gacaatgaga taccagccat aggaccttta 2880 cacgtgacgg aggatatagt tcttcctttg ataactaacc taaaaccttg taagataccg 2940 ggccccgatg ggttacaccc acgtttgctg tcgtcgcttg cggacattat ctcaagaccg 3000 ctcacgatgc tcttcaacat gtccctttca caggctcaac tacctagaga ctggaaagac 3060 gccatagtta gtcctatatt taaggatggt cagagacgga tagtatctaa ctaccggcct 3120 gttagcctga ccagtatagt agttaagtta cttgaaaaat taattcgggc taagctactg 3180 agccacattg attcgtacaa tctgttaact cccgagcaac atgggtttcg taacaagcat 3240 tcatgcttga ctaacttact cattgcgaga gaggattgga tagctgcact agacaacggc 3300 cagtctgtcg acgtgatatt tatagatttt agcaaagcgt ttgataaggt ttcacactta 3360 ggtcttatgc aaaagctctc gagctttggt ataacaggtg ctatacagga atggatagga 3420 agcttcttat gtgaccgcag acagggagtg agggtcaatg gaacgctatc cgaatggaaa 3480 ccggtcaaaa gtggtgtatc tcgaggcacc atcttaggcc ccttacgttt ccttctgtat 3540 atcaatgaat tacctttatt acttaggtcg tcgacgttat tgtttgcgga cgatgttaaa 3600 atttggagaa caataagaaa tgagactgat cgttgttatc tgcaagcaga tttagacgaa 3660 ttagttaggt gggctcatga ttggggcctg gaagttaatt ccaggaagag cgtgttcatg 3720 cacatcgggc acgatattag ttaccactat accattaatg gtacatctct gccacgcgtt 3780 catgagcaca aagacttagg agtaattata agtcacgact taaaaactac tacgcactgc 3840 aatgcggttg cttccaaagg ctatcgtgcg ctatggtcgc ttcgcagagc atttaaatgc 3900 tttgacgagg atatgtttcg aattttatat cccacctatg taaggccaca cttagaatac 3960 tgtatccaag cagctagccc ttgtctgata aaggatacta aatcgcttga gcgtgtccag 4020 cgtgtaggga caaaattagt aaagggtctt tctaaactcc cttacgatga gcgtttaaaa 4080 cgtctaaacc ttttcccctt atcttatcgt aggacgcgag gtgaccttat attggcattt 4140 cgtatcttta attatgatct gggtgttaat atgtcttatc tttttgctcc ctccagcact 4200 aataatctcc ggggtcatag caagaaggtt ctgaaaccgc gatctaataa actaaaggtg 4260 gggtcccgtt tttctcatcg agtggtcaat gactggaacg ctttaccaga acaagtggta 4320 tcagcaccat cagtgaacat tttcaaaaag aagctggacc ttcactggaa ggaaatctgt 4380 caggattaac acaggttcat caacctacta tccttattac tgaagactga agccatttgt 4440 atcggaatac tgttttcagc acttcatttt ataattacac aattccatac ttccttctga 4500 actaccttca tttatacagt tccttgtcca ctcatacctt gcccacactt atttttcttt 4560 aacacattag tatgttctta cctaataaat acttcagtaa taggcaaata gactaaatgg 4620 ttatacctta actattcctc catgaatatt gtgtgctctg cttttcgtcc ttcttttttc 4680 ttacaccttt ttcagcctcc tgcagatact ccaccgaaaa cagatatggc acgggtgact 4740 tcgtcctaca ccgttattaa tgcttccgat attttggagg gatataagtt cgaaactttt 4800 aaaagccccc aacttcgcat gtatctattc tcgaccatca gttgtatgtt gcaacgctac 4860 ctccataaag tgcgccaagt gtgatattga catgttcctt ataaaatcca tttacattca 4920 tatcttatga aatatcaacc cagctgagta attttcaaca gtgaaaattg taacttcttt 4980 aacaatttga tcttgcctgt aaaccggcat gcctcatgta acaatctgtt atttgagaat 5040 aaattattat tattatt 5057 // ID Gypsy-87_AA-LTR repbase; DNA; INV; 210 BP. XX AC supercont1.127; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-87_AA_; KW Gypsy-87_AA-I; Gypsy-87_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-210 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.127; Positions 1722779 1722570. XX SQ Sequence 210 BP; 63 A; 48 C; 42 G; 57 T; 0 other; tgtagtatcg taaccttaat tgtaacgctc agagaaataa ttatataacg gttgccggct 60 ttggcgtcta gcgcattgag ctactactga tagcggtcgt cagttaacga ataaagatca 120 ttagtgtttg agagaccaaa cgatatcgac gtgttattca ttactcctga tattcccgaa 180 cgaacccccg aacgaactcc cgacactaca 210 // ID Crack-4_AAe repbase; DNA; INV; 3925 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A Crack non-LTR retrotransposon family from Aedes aegypti. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; KW Crack-4_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-3925 RA Kojima K.K. and Jurka J.; RT "Crack clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1220-1220 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 16 sequences with >94% CC identity. Closely related to Crack elements in Culex pipiens CC (Crack-1_CP to Crack-4_CP). XX FH Key Location/Qualifiers FT CDS 56..874 FT /product="Crack-4_AAe_1p" FT /translation="MNSSEKREAIPSENDXVRNSLNDLRQLIRNDILEASD FT RTNSRIDDMSFQIKQSVSKLEAEMETIKNSQQFISDEFETMKASIIHQKEN FT IVALKKEVSQIRTDCDTTHQHVEELNYELNALKQISLEGHMLVSNVIKVAD FT ENLEELLARMCTLLDIVCSSESVLSVSRLSSSNQQGIQPILVRFTCVSMKE FT KLMKAARERPIFCDEIGLGVKQRIYFNHRLTPANQRLLGAARRFKKQFNYK FT FAWFTNGEIFLRKDESSRAIKITDIRDLNGLN" FT CDS 920..3775 FT /product="Crack-4_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MDYCPNLLNELNIEYKLDVIDVSDMNLNILCLNTRSC FT RNKMDELTQMVLDLKKKTIQVIVFTETWLHDGEFCNIMDYTAYHSCRKDRR FT GGVSIFVLNSLKSQMILELNYDVNNFLIIELLDLKIKVMGVYNPGRNVHLF FT LDKFEELISSVDDLIVCGDFNLNLLDSSNELVENYRYRVESLGFIILNSLN FT PLHATRLSNTISTTIDHFITNYLDKRMHLITKDTECHLSDHKTLILSIEVQ FT VSKPEIQEIKYCVKYERFLNNSFAQIVNECDSFSTLTQKLSSIVEKHKEPL FT VIKKSYKIRKPYINNELLLEIERKNHLYKSFKEATPNSPLKSELHSKYTRL FT RNRIRNKTKMAKENYYKTKIESHKGDGRKTWELLKELVFQQSKSGPSHNIA FT LQESGVMLNDAKEISDCFNNYFINVGEMCAPQRTSYAFHSFMPQAVEYTFN FT FVRVEEDSIIKVVESLNSSAASGLDRISVKFIQKCRGYLIGKITELVNEAI FT HTCVFDDTLKIAKIIPVYKTGSKYDKTNYRPISVLPTLSKIVEKVLTQQLS FT NYIFQNNLVHTNQFGFVPKSSTESATLELINFVVKGLDDGQFVACIFIDLK FT KAFDCIPHEILLEKLKYYGLSVSAIQLMTSYFSNRQQICCVNGILSDAKVI FT CTGVPQGSVMGPLLFNLFINDLLQLPLKGLLQCYADDAVSKYRANNLNLLQ FT EMMQHDLELMHQWFSANKMSMNTEKTNFILFTTSFYTPSLTLSINNEPLRQ FT VQETNYLGLIIDNRLKWNNHINKVKKKILPYIFAMKKVRKCLGVQSCWQIY FT SSYILSQLTYLVCIWGSAASTHLNVLKVLQNRAIKSIRRLPTLFPTIALYT FT YKYLSLSDLYRFSLIFTIYKIKNGIIKSNIEIIPATNIHSHATRNRDRFYT FT NRPRTELASRHIFYAGIVNYNLLPNNIRNETIALNFKRKLKTYIVENM" XX SQ Sequence 3925 BP; 1428 A; 643 C; 677 G; 1173 T; 4 other; tgactctccc gacgaacttg aaaaaacaaa tactcaacgt cacaaacaac caagaatgaa 60 ctctagtgag aaacgtgaag ctataccaag tgagaacgat saagtgagaa attctttgaa 120 cgatctacga cagctgatac gtaacgatat cctcgaagca agtgatagaa caaactcaag 180 gatcgatgat atgtcgttcc aaattaagca gtcggtatcg aaacttgaag ctgaaatgga 240 aactatcaaa aattcgcagc agtttatctc tgatgaattt gagacaatga aagcttcaat 300 tatccatcag aaagaaaata tagtagcgtt aaagaaggag gtatcacaaa tcagaacaga 360 ctgtgataca actcatcagc atgttgaaga acttaattat gaattaaatg ctctgaaaca 420 aatcagcttg gaaggtcaca tgctggtttc caacgtgatc aaggtggcgg atgaaaatct 480 ggaagaacta ttagctagaa tgtgtacact gttggatatc gtatgcagct ctgaaagcgt 540 cttgagtgtg agtcgcctgt cttcatcaaa tcaacaagga attcagccaa tcttagttcg 600 ctttacttgc gtgtcgatga aggaaaagct tatgaaagcc gcacgggaac gcccgatctt 660 ctgtgacgaa attggcttag gggttaagca acgcatctat ttcaatcatc gcctcacgcc 720 tgcaaaccaa cgacttttag gagcagcgcg taggtttaaa aagcagttta attacaaatt 780 tgcttggttc acaaatggtg aaatattttt gagaaaggat gagtcctcaa gagcaataaa 840 aattactgat atccgcgatc taaatggatt gaattagtaa tattatacct aacagaatta 900 tatttgtatt gttaatatta tggattactg tcctaattta ctaaatgaat taaatattga 960 atataagctt gatgtaatcg atgtatctga tatgaatttg aatatactat gtttaaatac 1020 acgcagctgc agaaataaaa tggatgagtt aacacaaatg gtactagatt tgaaaaaaaa 1080 aactattcaa gttattgttt tcaccgaaac ctggctacat gatggcgaat tttgtaacat 1140 tatggactat acagcttacc attcgtgtag aaaagatcga agaggaggag tttctatctt 1200 tgttttgaat agtttgaaga gtcaaatgat tttagaactc aattatgatg taaataattt 1260 cctgatcatt gagttattag acttgaaaat aaaagtaatg ggcgtctata atccaggaag 1320 gaatgttcat ttgttccttg ataaatttga agagcttatc tcgagcgttg atgacttaat 1380 tgtctgcggt gactttaatc ttaacctttt ggatagctct aatgagctag ttgaaaacta 1440 tcgttataga gtggaaagtt taggatttat aatactgaat agcttgaatc cgttgcatgc 1500 tacccgtcta tcgaatacca tttccactac catcgatcat tttatcacga attatcttga 1560 taagagaatg catttgataa caaaagacac tgaatgtcac ctatcagacc ataaaactct 1620 aattttatct atcgaggttc aagtttccaa acctgaaata caggaaataa aatactgtgt 1680 gaagtatgag agattcctca ataactcatt tgctcaaatt gtaaatgaat gtgacagttt 1740 cagtaccctt acgcaaaaac tttcaagcat tgttgaaaaa cataaagaac ccttggtgat 1800 taaaaaatca tacaaaattc gaaaaccata cattaacaat gagttactac ttgagattga 1860 acgtaaaaat catttatata aaagctttaa agaggcaacg ccaaactcac cgttaaaaag 1920 tgaattgcac agtaaataca caagattgcg aaatcgaata agaaacaaaa ctaaaatggc 1980 aaaagaaaat tactacaaaa ctaaaatcga aagccataaa ggtgatggaa gaaaaacttg 2040 ggaattatta aaggagttag tttttcaaca aagcaaatca ggcccatctc ataatatagc 2100 tctacaggaa agtggagtaa tgctgaatga tgctaaggaa atatctgatt gtttcaataa 2160 ttattttata aacgttggcg aaatgtgtgc acctcagaga acttcatatg cattccattc 2220 ttttatgccc caagcagttg agtatacatt caatttcgtt agagttgaag aggatagtat 2280 cataaaagta gttgaatctc tcaattccag tgctgctagt ggactcgata gaatttccgt 2340 aaagtttata caaaagtgca gaggttatct aatcggcaaa ataacwgagc tagtaaatga 2400 agcgattcat acttgcgtat ttgatgacac cttaaaaata gcaaaaatta tccctgtata 2460 taaaacaggc agtaaatatg ataaaactaa ttaccgtcct atctcagttt tacctacgct 2520 gtcgaaaata gtggaaaagg ttctgactca acaactttcc aattacatat ttcaaaataa 2580 tctagtccac actaatcagt ttggttttgt accgaaatca agcactgaat cagcaactct 2640 agaacttata aactttgtag tgaaaggact agacgatggt caatttgtgg cgtgcatwtt 2700 tatagaccta aaaaaagcgt ttgattgcat tcctcatgaa atcttgctag aaaaattgaa 2760 atattatggm ttgagtgttt cagctattca acttatgaca tcatacttct cgaatcgtca 2820 acaaatatgc tgtgtgaatg gaatattaag tgatgccaaa gtcatttgta ctggagtgcc 2880 ccaaggctca gttatgggac ctttactatt taaccttttt attaatgatt tactgcagtt 2940 accattgaaa ggcttacttc aatgttacgc tgatgatgct gttagtaaat atagagcaaa 3000 taatttgaat ttactccaag aaatgatgca acacgatttg gagttaatgc accaatggtt 3060 ttctgccaac aaaatgtcta tgaacacaga aaagaccaat ttcatattat ttacaacatc 3120 cttttatact cccagtttaa ccctttctat aaataatgag ccactgcgac aagtacagga 3180 aacaaactat cttggattga ttattgataa tcgcctgaaa tggaataatc acattaacaa 3240 ggtaaagaaa aaaattttgc cctacatttt tgctatgaaa aaagttcgta aatgtcttgg 3300 agtacaaagt tgttggcaaa tatacagttc ttatatcctt tctcaattga catatttggt 3360 ttgcatatgg ggttccgcag caagcacaca tctgaatgta ttgaaagttc ttcaaaatcg 3420 agccataaaa tctataagaa ggttacccac actatttccc actattgcac tttataccta 3480 taaatattta tctttgagtg atctgtacag atttagtttg attttcacta tttataagat 3540 caaaaatgga attattaaaa gcaatattga aatcatacca gcaactaata tacattcaca 3600 tgctaccagg aatagggata ggttttatac caatagacct cgaactgaat tagcttcaag 3660 acacattttt tatgcaggaa tagtgaatta caacttatta cccaacaaca ttagaaatga 3720 aacaattgca ctgaatttca aaagaaaact aaaaacatat attgttgaaa acatgtaaat 3780 aaaattcatt gatgaataga attaagatat tagagtaagt taaattataa gatgctcaca 3840 tcgcctaatt atatttccca gctaagtagc tgcgcacagt ttgtacaaac ttttcagcta 3900 tgtattgaaa taaagcaaaa aaaaa 3925 // ID Gypsy-49_AA-LTR repbase; DNA; INV; 288 BP. XX AC supercont1.288; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-49_AA_; KW Gypsy-49_AA-I; Gypsy-49_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-288 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.288; Positions 217653 217366. XX SQ Sequence 288 BP; 67 A; 53 C; 76 G; 92 T; 0 other; tgtgaggtgt atactggact tatcctttca gtttcctttc acactactga tgacggttga 60 atgatgattc gtttgataga tttatgtgtt ggtaccactg ttcgagccac acaacggcga 120 atgttgaata cggcaacgtt ggcgcttcat caagcatcat tgtgtagtga acgctcgtgg 180 aggtaagtcg tgctcggggt acaagtgcat ctcgggcgag tgattcttca tcatgggaat 240 tacaaatggc tgccttgagg gtaagtcctg tattattggt aatttaca 288 // ID BEL-635_AA-I repbase; DNA; INV; 5982 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-635_AA_; KW BEL-635_AA-LTR; Pao_Bel_Ele49; BEL-635_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-5982 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [5019-5582] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 33..5981 FT /product="BEL-635_AA-I_1p" FT /translation="MEDLRDSIDGHNCKRCTRPNSAEDQMVACDKCHEWEH FT FGCANVDGNVRDQSYVCKECTGETSGQLHKGTAGRENRPKHSDSRSVMTTR FT SKTKLKTPVGSQAGSRVSERVAVLEAELKIVEEERLAKEEEMKEKEKMKKK FT EIEERKRQLEEKRKLLEEEAELRQLQLAEERKIQEEQQQIRQDSLTKRKAL FT LRQIAESNAGSKCASEVDVQDNVSSWLADLQLTGRPLECGEDNQGRHKEVV FT NPSPTGIPTGPVDAVNPQVTSRMGVKNPVVHHVYQTNPNGSTRFSPPNPVQ FT RDLNYQHALHDNPWIPSRYRAMATEEPIGRMVENNTPTVLGAEHIAARQVM FT GKELPTFCGNPEDWPIFICCFEQTTITCGYSDAENLIRLQRCLKGPALEAV FT RSRLLLPSSVPSVIETLRTLYGRPELLIRSLLRKVQQVAAPRHDRLESVME FT FGLAVQNLVDRLEAAHQESHLTNPILIQQLVEKLPGPLRLDWGIYKKHYPM FT PTLRTFGKFMSQLVSAASEVTFDLPVASGGFKNERYKPREKAHIQAHSIGS FT AIQTSSSSKFSTNSRKTGKICVVCDCEGHRVCDCLKFKAMNVDERWKTIQH FT KGLCRTCLNNHGKWPCKSWQGCGVQDCRNKHHTLLHCPVPSAPSHSVNVSS FT NVSMSSEGTYTFFRVLPVVLHYKEQSVMVFAFIDEGSELTFLDASVASQLG FT VNGENEPLTLKWTGNVTRREAKSQKVHLEISGKKGKTKHQLVGAHTVSSLS FT LPSQSMRYQELAKKFPHLRGLPIEDHARIQPQLLIGLDNLQLGVPLKIRQG FT RQGEPIAAKCRLGWGIYGSKPDAGSSQAVINFHMTEVDNPDMLLNEQLKNF FT FTVESMGTEGTQAKLESDEDKRAKRMLADTTRRTSSGFEVALLWKTDNPEL FT PDSYSMAARRLQSLEKRLKQDPSMKQKIREQIAEYERKGYAHRITNAELET FT ADPRRVWYLPLGVVVNPKKPEKIRLIWDAAAKASGISLNSQLMKGPDFLSS FT LTAVLIQFRHFPIAVSGDIREMFHQLKMRAEDQQSQRFLWREDPSERPEIY FT VMDVATFGATCSPASAQYVKNVNAAAFSDQYPRAVIAIQKYHYVDDYLDSF FT ETIQEAKQVVKEVRTIHAEGGFELRNFRSNSSELLRSIGEKTSNNIKDMTL FT ERSAISESILGMRWVPVEDTFTYTFSLRDDLRTILENNHNPTKREVLKILM FT TLFDPLGFIAFFLIHGKVLMQDIWASGCDWDEQINEALCIRWKQWTRFFPQ FT LDALRIPRCYFKSLVPGDKNQLQVHVFVDASEGAYACVAYFRLATGEDVSV FT SLIGAKSKVAPLKSLSIPRLELKAAVLGVRYLESISNQHSFKVHQRYMWSD FT SATVLAWINSEHRRYNKFVAVRIGEILTSTEQKNWRWVPSKLNIADQATKW FT NEGPKLQSDSQWFQGPSFLHNAEEEWPRQSKLPLTEEELRPNHSRVLHHKH FT HQPIIDVSRYSKWIRLHRVMAYVLRFLDNVQRKVQGIPLQVGTLQQEELER FT AECELWREAQSEVFANEIRVLSSSKGGPDDQHLTVNKSSPIYKTWPYVDDK FT GVLRKRSRAGAATFMSFEAKYPVILPHDHPITFLLIDWYHQFYRHANRETI FT ANEVRQRFEIARLRSLIYKVAKSCVWCRVMKAKPQPPIMAPLPEYRVTPYV FT KPFTSVGLDYFGPLLVRVGRSQVKRWVALFTCLAIRAIHLEVVHSLSTESC FT VMAVRRFVARRGSPAEFYTDNATCFQGASRELQEEIGNKLSTTFTSAHTKW FT RFIPPATPHMGGVWERLVRSVKVAAGTILEAPRKPDDETLETILCEVEAMI FT NCRPLTYIPLDSADQESLTPNHFLLGSSNGSKMMPMEPLKFIGNLRSSWKL FT AQSITDSFWTRWIKEYIPIITRRSKWFENVKDLEVGDLVMVIGGPSRSQWV FT RGRIVEVVKGKDGRVRQAMVKTSSGVHRRAAVNLAVLDVLDEAKPGNGSSD FT VPGFTEGG" XX SQ Sequence 5982 BP; 1778 A; 1274 C; 1496 G; 1434 T; 0 other; aacttaaaga tattttgtac gtgggttccg aaatggaaga tcttcgcgac tcgatagatg 60 gccacaattg caagcgatgt acccgaccga attcagcgga agaccaaatg gtagcttgcg 120 acaaatgcca cgaatgggag cattttggtt gcgcaaatgt tgatggaaac gtaagagatc 180 aatcgtacgt ttgtaaggag tgtacaggag aaacatcagg tcagctacat aaaggaacgg 240 caggacgcga aaatcgaccc aaacactcgg attcccgtag cgtaatgacc acgcggagta 300 aaacaaagct gaaaactccg gttggaagcc aggctggctc tcgtgtttct gaacgggttg 360 ctgttctgga ggcggagctg aaaatagtag aagaggaacg tctggcgaaa gaagaggaaa 420 tgaaggaaaa ggagaaaatg aagaagaagg agattgagga aaggaagcgt cagttagagg 480 aaaagagaaa gttgttggaa gaagaagccg agttacgaca gctgcagtta gcggaagaac 540 ggaagattca agaagagcag cagcagattc ggcaggattc tttgacgaaa aggaaagcac 600 tcttacggca aatcgcagaa agcaacgctg gcagcaaatg tgcatccgag gtcgacgttc 660 aggataatgt ctcatcgtgg cttgcggatc tgcaacttac tggaagaccc ttggaatgtg 720 gcgaggacaa tcaagggcga cataaggaag tcgtcaatcc aagccctact ggcattccta 780 ctggccccgt agatgcagtc aatcctcagg ttacctcgcg aatgggagtt aaaaatcctg 840 tcgttcacca tgtttatcaa acgaatccaa acggttctac tcgtttttcg cctccaaatc 900 cagtgcagag agatctcaac tatcaacacg cattgcacga caatccttgg attccatcta 960 ggtatcgtgc catggcgaca gaggaaccaa taggacgaat ggtcgaaaac aacacgccga 1020 cagtgcttgg agccgaacac atagcagctc gtcaagtgat gggtaaagag ttacctacgt 1080 tctgtggcaa tcccgaagat tggcccatat ttatctgctg cttcgaacaa acaacaatca 1140 cctgcgggta ttctgatgct gaaaatttaa tccgattaca aaggtgcctg aaaggacctg 1200 cactagaggc agtacgcagt cgcctattgc taccgtccag tgttccctcg gtaatcgaaa 1260 cgctacgcac tttatatggt cgtccagagt tactgattcg atctttgcta cgaaaggttc 1320 agcaagttgc ggctccaagg cacgaccgtc tagaatcggt aatggagttt ggattagcag 1380 tacaaaactt agtagaccgc ctggaggcag cgcatcagga aagccattta accaatccta 1440 ttctaataca acaattggtt gaaaaactac caggtcctct caggctagac tggggaatat 1500 acaagaaaca ttaccccatg cccacactgc ggacatttgg taaatttatg tcccaattag 1560 tcagcgcagc aagtgaagtt acgttcgact taccggtcgc tagcggtggt tttaaaaatg 1620 aacgatataa acctagagaa aaggcccata ttcaagcaca ttcaataggg tctgcaatac 1680 aaactagttc ttcctccaaa ttttcgacta acagtaggaa aactggtaag atttgtgtag 1740 tttgcgactg tgagggtcat agagtctgcg actgtctcaa gttcaaggcg atgaatgtgg 1800 acgagcgatg gaaaaccata caacacaaag gactttgcag aacatgcttg aataatcatg 1860 ggaaatggcc atgtaaatcc tggcaaggat gtggtgttca agattgtcgc aataagcacc 1920 atacattgct ccactgtcca gttccatctg cgccatcgca ctcagtcaat gtttcttcta 1980 atgtctccat gtcttctgaa ggtacgtata ctttcttccg cgtgctaccc gttgtactgc 2040 attataaaga gcaatcagtc atggtatttg ctttcattga tgaggggtcg gagctcacct 2100 ttctcgatgc gagcgttgca agtcaattgg gcgtaaatgg cgagaatgaa ccgttaacat 2160 tgaaatggac tggaaacgtg acccgaaggg aagcaaaatc tcaaaaggtg cacttggaga 2220 tctccgggaa aaaagggaag acaaagcatc agttggtcgg tgctcatacc gtaagtagtc 2280 tgtccctacc ttctcagtcc atgagatacc aagaattggc aaaaaaattc cctcatctgc 2340 gtggtcttcc gatagaagat catgcacgga tccaacccca acttttgatt ggactggata 2400 atctgcagtt gggtgttcca ttaaaaatcc gtcaaggacg gcaaggagaa ccgattgctg 2460 caaaatgccg actaggttgg ggtatttatg gcagcaaacc agatgctggc tcttcccaag 2520 cagtgataaa cttccacatg accgaagtgg acaaccccga catgctgttg aacgagcagt 2580 tgaaaaactt tttcacagtt gaaagtatgg gaacagaagg tacgcaagcg aaactggaat 2640 ctgacgagga taaaagggcc aaacgaatgc tggcagacac taccaggcgt acctcatcag 2700 gatttgaagt agcccttttg tggaagactg ataatcctga gctaccggat agctattcga 2760 tggcagcacg taggctgcaa tccctggaga agaggttaaa gcaagatcca agtatgaagc 2820 agaaaatacg agagcagata gcggaatacg aacgaaaggg ttacgctcat cggataacga 2880 atgctgagct agaaacagct gacccacggc gagtgtggta cttaccactt ggtgtggtag 2940 tgaatcccaa aaagccagaa aaaataaggc taatttggga cgcagccgca aaagcatcag 3000 gaatttctct taattctcaa ctgatgaagg gtcccgactt tctgtcgtca ctaacggctg 3060 tactcattca gtttagacat tttccgattg ccgtaagtgg tgacattagg gaaatgtttc 3120 accaactaaa aatgcgagct gaagaccagc aatcccagcg attcctgtgg agggaagatc 3180 cgtcggaacg tccggaaata tatgtgatgg atgtggctac ttttggggca acgtgctccc 3240 ctgcgtcggc acaatatgtg aaaaatgtca acgcggcggc attctccgat cagtatcccc 3300 gagctgtaat tgccatccag aaatatcatt atgtagatga ctatttggat agctttgaga 3360 ctattcaaga agccaagcag gtagtgaagg aggtgaggac aatacatgca gaaggaggtt 3420 ttgaattaag aaatttccga tcgaactcat ctgaactgct gcgcagcatt ggagaaaaga 3480 cttcaaataa tatcaaagat atgacgttgg agcgcagcgc gatatctgaa tcgatactcg 3540 ggatgagatg ggttcctgtc gaggatacat tcacgtatac attctctcta agagatgatc 3600 tgcgcacaat tttggagaat aaccacaatc caacaaaacg cgaagtactg aagatactta 3660 tgaccctttt cgatccttta ggcttcattg ccttttttct tatacatggg aaagtgctca 3720 tgcaagatat ctgggcctca ggatgcgatt gggacgagca aatcaatgaa gctttgtgca 3780 ttcgttggaa gcaatggacg cgattctttc cacaacttga tgcattgcgc attccacgct 3840 gctattttaa gtcgctggtt ccaggtgaca agaaccagct acaggtccac gtgtttgtgg 3900 atgcaagtga aggtgcctac gcttgcgttg cctatttccg cctggcaaca ggagaagacg 3960 taagcgtgtc actaatcgga gctaaatcca aagtggcgcc tctcaagtcg ttgtctattc 4020 ctagattgga actcaaggca gcagtccttg gggtgagata tttggaatca atcagtaatc 4080 agcattcctt caaagttcac cagcgataca tgtggagtga ctcagcaaca gtgttggcgt 4140 ggataaattc cgaacaccgt cgttataaca aattcgtagc cgttcggatt ggagaaattt 4200 taacgtctac ggagcagaaa aattggcgct gggtaccttc caagttgaat atagcggacc 4260 aagccaccaa gtggaatgag ggaccgaaac ttcaatcgga cagtcagtgg ttccaaggtc 4320 cgagtttcct acataatgca gaagaagaat ggccgaggca gtcaaaactt ccactaaccg 4380 aagaagaatt acgcccaaac catagccgcg tgttgcatca caagcatcat cagccgataa 4440 ttgacgtcag tcgctacagt aaatggatca ggttacaccg tgtaatggcg tatgttctgc 4500 gctttcttga caatgtgcaa cggaaagtac agggcattcc actgcaggta ggaacgctcc 4560 aacaagagga actagaacgt gcggagtgcg agttatggag agaagcacag tcggaagttt 4620 tcgctaatga aataagagtt ctatctagca gtaaaggtgg tcctgacgat caacatctca 4680 ccgtgaacaa atccagccct atctataaaa cttggcctta cgttgacgat aaaggcgtac 4740 tacggaaacg tagtcgagca ggagcagcaa ctttcatgag ttttgaagcg aagtatccgg 4800 tgattctacc ccatgaccat ccgattactt tcttgctcat cgactggtat catcagttct 4860 accgccacgc taacagagaa actattgcaa atgaagtaag acaacgtttc gagattgcaa 4920 gactacgttc cctgatttac aaggtagcaa aaagctgtgt ttggtgtcgt gtgatgaaag 4980 caaaacctca gccacccatt atggcaccac tcccggagta tcgagttacg ccttatgtta 5040 aaccgtttac atctgtcggg ttggactatt tcggcccatt gctggtgcga gttgggcgta 5100 gtcaagtaaa acggtgggtt gctctattta cctgcctggc aataagggct atacacctcg 5160 aggtcgtgca tagtctttca actgagtctt gcgttatggc agtacgacga tttgtggcaa 5220 gacgcggttc accagcagaa ttctatacag acaacgcgac ctgttttcag ggtgccagca 5280 gggaattgca agaagaaatt ggtaataaac tctctactac atttaccagt gcgcatacaa 5340 agtggagatt cattcctccc gcaacaccac acatgggcgg tgtttgggaa cgcctggtgc 5400 gttctgtaaa agtagcagct ggaacaattt tagaagcgcc acggaaaccg gatgacgaga 5460 ctttggagac cattctgtgc gaggttgaag cgatgatcaa ttgtcgtcct ctcacctaca 5520 ttcctttaga ctctgcagat caggaatctt taacgcccaa tcatttttta ttgggcagtt 5580 ctaatggatc aaaaatgatg ccgatggaac ctttaaagtt tattgggaat cttcgcagta 5640 gctggaaatt agcgcaatcc attacggaca gtttttggac gagatggatt aaggaataca 5700 ttccaatcat caccagacga tcaaagtggt ttgaaaatgt taaggacctt gaagtaggag 5760 atttggtgat ggttattgga ggtccgtcga gaagtcaatg ggtgcgcggg agaattgtag 5820 aagtcgtgaa aggcaaagat ggcagagttc gtcaagcaat ggtgaaaaca tcttcaggag 5880 ttcatcgacg agcggcggtg aatctagcag ttttggacgt cttagatgaa gcaaaacctg 5940 gtaacggatc ctcggatgtt ccaggattta cggaaggggg aa 5982 // ID MuDR10x_AP repbase; DNA; INV; 2129 BP. XX AC Contig4024; XX DT 25-JUN-2009 (Rel. 14.07, Created) DT 25-JUN-2009 (Rel. 15.12, Last updated, Version 2) XX DE MuDR-type DNA transposon. XX KW MuDR; DNA transposon; Transposable Element; MuDR10x_AP. XX NM MuDR10x_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-2129 RA Jurka J.; RT "DNA transposons from pea aphid."; RL Repbase Reports 9(7), 1359-1359 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX FH Key Location/Qualifiers FT CDS 821..1429 FT /product="MuDR10x_AP_1p" FT /translation="MLGTMDIKTHTGEPFILSNDSDKNIIIFSCKTNIDFL FT CKSETIYLDGTFDYCPKLFTQLFTLHGYFNNNYVPLVFALLPNKTKQTYKH FT FLNILVLVCQNVGLILKPTTVICDFEESIHIALREVWNNVNIFGCRFHLTQ FT SWYRKIQNLGLCKAYKNNKTPEGEWLNNIFGLIFLNPNEVGSCFVDDFMST FT IPHDIKFQMFAD*" XX SQ Sequence 2129 BP; 784 A; 316 C; 298 G; 731 T; 0 other; gtgtttgtaa tttgcaccaa aatagagggt atgtaatttg caccaccctc aggtagtaag 60 aaataggaat tgtaatttgc accaaaatta aaatttaaga tagtatcatt agtatcgata 120 gtattactcg atattctcgt tgacgcgacg tcaagactag atgaacacga atgacaataa 180 taatttttcg ggatttttca ccttatctgg taatattttg acgttttacg tccaattgtc 240 ggtcgtaatc gcgttatata tatatttata taatgtatat cagtttatta ttttgatcaa 300 atatcaatta agctacacgt ttagtctttc cgtttcgtcg taatacgtaa tttaacgtaa 360 catttttgta ttgcagtatt taattttttt cttgaaatat ggaagttaag ttaattaaaa 420 gtaatcgcgg ggaagatatt ttagaatacg ataaccaaac tttttatttt gcatacaaaa 480 caaaaactac aaatataatt cgttggcgtt gtacacagag aaaatgttca gcaaaattgt 540 ttactacagg tgaaaatttg atgattgtcc gtgatgagag tgtacatgta aatcacgatg 600 agaaaaactt aaacgttata aataaaatca attgtgcaaa ataaaagcac gagaatcacc 660 atatgaaaaa ccagccaaaa ttttaagaaa tgtagtgcaa aattcaacag attccaatgt 720 tatcactact aaggatattt cggatttaag acgtaatata tataatacga agaggaagat 780 attgccaaca tttccagtat cacagaatga agtacattca atgcttggaa caatggatat 840 caaaacacac acaggtgaac cgtttatatt atcaaacgat tctgataaaa atataattat 900 attttcttgc aaaacaaaca ttgatttttt atgcaaaagt gaaacaatct atcttgatgg 960 gacattcgat tattgtccga aattattcac tcaacttttt actttacatg gatattttaa 1020 taataattac gttcctcttg tatttgcgct gttaccaaat aaaacaaaac aaacgtataa 1080 acatttttta aatattttgg tacttgtatg tcaaaatgtt ggtctaatat taaaacctac 1140 aacagttata tgcgattttg aagaatcaat tcatatcgca ttaagagaag tttggaataa 1200 tgttaatatc tttggatgtc gttttcactt aactcagtct tggtacagga aaatacaaaa 1260 cttaggacta tgtaaagcgt ataaaaataa taaaaccccc gaaggtgaat ggttgaataa 1320 tatttttgga ttaatatttt taaatcctaa tgaagtgggc tcatgttttg ttgatgactt 1380 tatgtcaaca ataccacacg atataaaatt tcaaatgttt gctgactgac ttgttgactg 1440 acaattatgt agacaacaat tcattatttc caccgtatat ttgggcagtc gacacataca 1500 agtatacaac gaatgcatgt gaatcctttc acagtaaatt taatgcagag ttttatcatc 1560 cacacccaca agttttcaat ttcattaaag ttctaacaga ttttcaaaca gatacttata 1620 taaaactatc aagttaacat ataactccgc gaataacttc tcaaacatgt caacggacga 1680 aacaaattca agctgctttg gaccagtata ataataaatt gatttcaaga acttcattca 1740 ttgcattagt tgcatacaaa tacaaaaaac gtataagtat taaaacagcg aagtataaca 1800 gcaattaata ttatgtacct acataagttt atttttttcc aactataaat tattatttac 1860 agattattcc accttcaatt aactacgaaa cataaaatga tatcaaaatg caacataggt 1920 aaatttgatt ttttttttga atttaacaca taattatttt aactgacgaa ctttttttgt 1980 atacatttgt ttctttttta aataaacata aacgtatact atcgaaaaaa taatcgattt 2040 taattttggt gcaaattaca attcctattt cttactacct gagggtgatg caaattactt 2100 acccttattt tggtgcaaat tacaaacgc 2129 // ID Gypsy-191_AA-LTR repbase; DNA; INV; 136 BP. XX AC supercont1.102; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-191_AA_; KW Gypsy-191_AA-I; Gypsy-191_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-136 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.102; Positions 1414607 1414742. XX SQ Sequence 136 BP; 43 A; 33 C; 16 G; 44 T; 0 other; tgttgcggtg tttactataa tgtccctctt caatattcca actatacacc cctgtgcatt 60 ctcaaaacgt caaaagacat cgttgtaaca ttaagttaat aatgctacct tctgaaataa 120 acacgtctct tcaaca 136 // ID Copia-15_SI-I repbase; DNA; INV; 4089 BP. XX AC AEAQ01019900; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-15_SI_; KW Copia-15_SI-LTR; Copia-15_SI-I. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-4089 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01019900; Positions 4519 431. XX CC Positions [1504-2028] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 1183..2901 FT /product="Copia-15_SI-I_1p" FT /translation="MGYTVTFHNDECCIVRACDGVQVALGEPCGNLYKLKT FT PNVIYAISNPDPEPMKNCVHKWHSILGHRDIQLVKTLPSSDLVNGIHFDEC FT SKECNNILNCEVCLKGKMSRMKFPSSNNRAKEKLDLIHSDVCGPMQTATPS FT GKRYVLTFIDDYSKFTAVYLIKEKLEVFGRFREFVEMCKKMFNRKPKLIRT FT DRGGEYMSVEFIKFLDQEGIQYQRTAPYSPQQNGAAERKNRTLLEMARCMI FT LEAKLENKFWGEAVVMANYIQNRLPARDIVRTPFENWYGTKPNLNSFKRFG FT SKCYVHVPNEKRRKLDSKATEAIFVGYDAISKAYRCYVPSSAKVVISRDAK FT FVYKDSDWKINQDQFKEENEVTITHLPEDEFYSADEDETIEENDPATSDAF FT RDLQQSPIAQRDINEHVPVNNENEIRIPELRRSTRSNLGVPPCRYTEKRIN FT VIKEEKVEPKTYNQAISSEQKNDWIQAMKNEIKSLKENETWELVDLPKDRR FT AIGCKWVYKIKSYGENEIFKARLVAQGFSQKYGIEYDEVFAPVVKHATYRI FT LFTVAARKQMKVIHLDAKSVSKRKIK" FT CDS 2870..4075 FT /product="Copia-15_SI-I_2p" FT /translation="MLRAFLNGKLSETIYMKQPPGFTEEGREHQVCLLRRS FT IYGLRQSARVWNQEIHQTLTAAGYIQSKNDPCLYIKYNKGKICYVLIYVDD FT LIVASEDDNMLEECEKILSAKFKIKNLGDVRNYLGLQIKRDTNGNFALNHK FT RYIMKIIKDFGMMEAKTSNVPISVSYGKGGNSELLIDNENYRKLVGCLLYI FT AVNTRPDISASVTILAQKVTKPTQEDWNELKRILRYLKGTADLSLMLGGIN FT CNTGLVGYADANWAEDVALQQRKSTTGYVFKYFDGVISWCSKRQTCVAKSS FT MEAEYIALSEASSEALWIRRLLRDFKQPEEDPTTIYEDNQSCLKLIKEEKL FT SNRSKHIDVHAYFVKDHVDKGTIKCVYCPTENMVADLLTKPLSANRIVKLR FT ETCSLMEI" XX SQ Sequence 4089 BP; 1451 A; 703 C; 913 G; 1022 T; 0 other; tgctattagg ttatgggccc agtccctcga aagtagtgct ttgaagactg aaagagttca 60 agtgaattga cttaacctat aaaacgacag actgcataaa aacgatgacg tcggaatcga 120 agtatattat aacgaagttg aataatggaa attattttaa ctggcgatac aagatgaaga 180 tgttattaat cgagaagggc gtttggacag taattacagc agatgcgcca aatccggtta 240 cacaagactg gacgcaaaag gatgaaaagg cacacactgc aattgcattg aacattgagg 300 acgaccagat tcaacacatc agaagttgcg atacagcgaa aagtgcatgg aagaacttgc 360 aagattttca cgaaaagaac acgccgaaca acaaagtgag catcctgcga cagctcatga 420 caacacgatt agacgaaggc ggtaatgtag aatctcacgt aactaaaatg acggaattat 480 ttcaacgctt catagcttat ggagatgatt taaagcaagg aattaatttt gtgtgttaca 540 attctcagtt cactacctga aagctatgat gtattagtta caactttaga gaatcgcaaa 600 gatgaattaa catcaagtgt tgtttgctcg gctgtgattg cggaatatag aaaacgtctc 660 gaacgaagcc gtgataataa tactgaggct gtattaagga tcagttcgtc aaccaaagga 720 tcatcaaatg caaaaggtga agcatttgac gtgagcaagt cgaaatgtct tttctgcaag 780 cgcaaaggcc actggaaaaa ggattgccgt aagttacttg cgcataaaga gaaaaagagt 840 caagaacaac aaagtaagat tcagcaacag gtaaattcgg cagaagctag cggcagtgct 900 caacatttat tcgcagcaat gacgtttaat agtgacggct ggttggttga tagtggagca 960 actaatcaca tagtgagtag acgtgaaatt tttaccgaca tgaaaaagca ttccgaaagc 1020 atctacgtag caaatggtta taaagtaact gcagtcggta aaggaacagt cgatgcaaaa 1080 tttatgaata aatctggtga cattgttgag gttacgatta aaaatgtatt gtatgtacca 1140 gatatataag gaaattttat ttctgtacga cgtttaaaca aaatgggcta tacggtgact 1200 ttccacaatg atgaatgctg tatcgttcgt gcatgcgacg gagtacaagt tgcattagga 1260 gaaccttgtg gaaatctata caagctgaaa acaccaaatg taatatatgc aataagtaat 1320 ccggatccgg aaccaatgaa gaactgcgtt cacaaatggc atagtattct tggccataga 1380 gacatccagc tagtgaagac attgccatca agcgatcttg tcaacggtat tcattttgac 1440 gaatgcagca aggaatgtaa caatatatta aactgcgagg tatgtcttaa aggtaagatg 1500 agcaggatga aatttcccag ttcgaataat cgagcaaaag aaaagttaga cttgattcat 1560 tctgatgtct gtgggcccat gcagacagcg acaccatctg gtaaaaggta cgttctcacc 1620 tttattgatg actattctaa atttactgct gtctacctta ttaaagaaaa attggaagtt 1680 tttggtagat ttagagaatt cgtggagatg tgcaagaaga tgtttaatcg taaaccaaaa 1740 ttaattcgca cagatagagg aggagagtac atgagtgtcg aattcatcaa gtttttggat 1800 caagaaggca ttcaatatca aagaacggca ccgtatagtc cacagcagaa tggcgcggcc 1860 gagcgtaaaa acaggactct cctcgagatg gcaagatgca tgatcttgga agctaagctg 1920 gaaaataaat tttggggcga agcagtcgta atggcaaatt acatacaaaa tcgtttgcca 1980 gccagggaca ttgtacgaac accgtttgaa aattggtatg gtactaaacc taatctaaat 2040 tctttcaaac gatttggatc gaaatgttac gtacacgtac ctaatgaaaa gaggaggaaa 2100 cttgattcga aagcaaccga ggcaattttt gttggctatg atgcgatctc aaaagcatat 2160 aggtgctacg ttccatcaag tgcaaaagta gtcataagtc gagatgccaa attcgtctac 2220 aaggacagcg attggaagat taatcaagat cagtttaagg aggagaatga agtcacaatt 2280 acgcatctac ctgaagatga attctacagt gccgatgaag acgaaacaat agaagaaaac 2340 gatccagcaa cgagtgatgc atttagagat cttcagcaat caccgatagc tcaacgagat 2400 attaatgaac acgtccctgt taacaatgaa aatgaaatca gaataccaga acttcgtcgg 2460 tctacaagat cgaatttagg ggtaccacca tgtcgttaca ctgaaaaaag aatcaacgtc 2520 atcaaggaag aaaaagtcga gccaaaaaca tacaatcaag caatcagttc cgaacaaaag 2580 aatgattgga ttcaagcaat gaagaatgaa attaagtcac ttaaggaaaa cgaaacgtgg 2640 gagcttgtag acctaccgaa ggatcgtcgt gcaattggat gcaaatgggt atataaaata 2700 aaatcctatg gagaaaatga aatattcaaa gcacgactgg ttgcacaagg tttctcccaa 2760 aaatatggaa tcgagtatga tgaagtgttt gctccagtgg taaaacatgc aacatacagg 2820 atcttattca ccgttgcagc cagaaagcaa atgaaagtta ttcatctcga tgctaagagc 2880 gtttctaaac ggaaaattaa gtgaaacaat ctacatgaaa caaccccccg gcttcacaga 2940 agaaggaaga gaacatcagg tttgtctctt gaggaggagt atatacggct tgagacaatc 3000 cgctagagtg tggaatcagg aaattcatca aacacttaca gctgcgggtt acatacaaag 3060 caaaaatgat ccatgcctgt atattaagta taataaaggt aagatctgtt acgtactaat 3120 ctacgtcgat gatttgattg ttgccagcga agatgataat atgttagaag aatgcgagaa 3180 gattttaagc gcaaaattca aaataaaaaa tcttggagat gtccgtaatt atcttggttt 3240 gcaaattaaa cgggacacca atggaaactt cgcactcaat cacaagcggt acatcatgaa 3300 gatcatcaaa gatttcggta tgatggaagc aaaaacttcg aatgttccaa tcagcgtgtc 3360 atacggcaaa ggtggaaatt ctgagttact tatagataac gaaaattatc gtaaattagt 3420 tggttgtttg ctttacatcg ctgtgaatac aagaccagac atatcagcga gtgtgacaat 3480 cttggcacaa aaggtgacaa aaccaacaca ggaagattgg aatgaactaa agcgtattct 3540 tcgatatctc aagggtacag cagacctttc gttgatgctt ggtggcatca attgtaatac 3600 tggcctcgtc ggttatgccg atgcaaattg ggcagaagac gtcgcgttgc aacaacgcaa 3660 gtcgactacc ggctacgttt ttaagtattt tgatggagtg atcagctggt gtagcaagag 3720 acaaacttgt gttgctaaat catccatgga agctgaatat attgcactat cagaggcaag 3780 cagtgaagcg ctatggattc gtcgattatt gagagacttt aagcagccgg aagaggaccc 3840 aacaaccatc tatgaggata atcagagctg cttgaaatta atcaaagaag agaagctgtc 3900 aaatcgttca aagcacattg atgtgcatgc atattttgtc aaggatcatg tagacaaggg 3960 aactatcaaa tgtgtttatt gcccaacaga aaacatggtt gctgatctat tgacgaaacc 4020 attgtcagcc aatcgaatcg taaaattacg agagacttgc agtctgatgg aaatttgaat 4080 gaggaggag 4089 // ID BEL-2_BMa-LTR repbase; DNA; INV; 541 BP. XX AC AAQA01000514; XX DT 24-FEB-2011 (Rel. 16.02, Created) DT 24-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Brugia malayi genome: long terminal DE repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-2_BMa_; KW BEL-2_BMa-I; BEL-2_BMa-LTR. XX OS Brugia malayi OC Eukaryota; Metazoa; Nematoda; Chromadorea; Spirurida; Filarioidea; OC Onchocercidae; Brugia. XX RN [1] RP 1-541 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Brugia malayi genome."; RL Direct Submission to RU (07-FEB-2011). XX DR Genome; AAQA01000514; Positions 1766 2306. XX SQ Sequence 541 BP; 172 A; 129 C; 60 G; 180 T; 0 other; tgtcgagaac aagctcgaaa ctccactaga aataatcttt gtaccgaaca atcgaattta 60 gcaaagggtc aaatttaaat caatgtattt atatttaagc aaatcatgtt aatcatttat 120 tgtaagtttt attgtaattc aaattcttta atgactcata gatggtaatg tgttaacaaa 180 ttcccatgct aatctgtcat attatctaat gcacatcttt tccaatcctt tgtttgttat 240 tctaattccg tttacaaaaa ctaatctttc ccataagcct tccttcctag cttcccattg 300 tatcaatcgc tttatctctc caccctcaaa tcatgctata taagggcaat taccactcaa 360 ccgaaggatt tgttcccaaa agtctccccg ttgcttagct catcctcctc actaaattgt 420 caaactactg attgcttgtt tggtttaatt acccaagata aagcactcga ctcaattgtg 480 caagtgaacc aacccaattc aatccattca caacaccaat ccaatccaac cattcacaac 540 a 541 // ID BEL-70_CQ-I repbase; DNA; INV; 6816 BP. XX AC . XX DT 07-JAN-2011 (Rel. 16.02, Created) DT 07-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-70_CQ_; KW BEL-70_CQ-LTR; BEL-70_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-6816 RA Jurka J.; RT "LTR retrotransposons from the southern house mosquito."; RL Direct Submission to RU (07-JAN-2011). XX DR [2] (Consensus) XX CC Positions [5049-5630] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS join(651..1571,1575..6014) FT /product="BEL-70_CQ-I_1p" FT /translation="MTPSLRSLSKQEHFLNDSLDMLLQLVENYNEQRDKDA FT LEGWRQHAATLYKEFKSIRLQTEVQMDQEKPETDPDNEANLRKIRFDFETK FT YLKVYSLIVVQLENLKKAEQATLNSVPPKSEETPDLTHSRIKLPEVKLPNF FT DGQLIHWINYRDTFVSMIDSNPSLQPIEKFAYLNSSLAPEAKRHIETIDVT FT AVNYSVAWDILVQRFDNKKLIVRAYIECLLSIEPMEKESYESLARVIDEFE FT RNLNMIGKLEVDTKGWSVLLAHMVCARLDDSVLRQWEQFHKSKEVPTYPEL FT IKFLRDHLSILQSLPSKSRSKESTKPDPGRLTKPRFSHAVTISSGSRSCPF FT CKKPPHSFFLCETFRKLPVQQRFDAVKRNSLCINCLSPDHLVKKCTSSSCR FT VCGQKHHTMLHQNSTLRFSNSNPPQKLSSLPAPDQPQTQNQITSNVVPTPQ FT TNMSQHQSEPTLPSTSHVSTTLHGNARTLPATVLLQTAVVKVSDSTGHFVW FT ARALLDPASQLSLMTEQLSQKLKLKRVKANQEIGGVGNTTVVSSYIVEARI FT HSHCTDFFADVSFNVLSGITRELPARALCTEDWNLPTDIVFADPSFYQPGP FT IDMILGIEVYYELIEEGLTRLGPGQPVLQKTVLGWVVSGKIGSHPAPTLSL FT TCLCHSLTLDEQLERFWELESCQSTSTLSVEETQCEAHFTATTTRDESGRF FT VVRLPRRPAVMSTLGSSKEIAIRRYLALERRLTANPSLKESYSKFIEEYRT FT LGHMAEVDEAECSSSSLPTYFLPHHCVLRPESTTTKLRVVFDASCATDTGV FT SLNDCLMVGPVVQQDLFSIVLRFRWYRYAMVSDVEKMYRQVLVHPADRPLQ FT MIVWRDSPHEPLRFYQLTTVTYGTSCAPYLATRCLAELAKVGETTYPEAAK FT VVATDFYMDDLLTGVNDVAAGQQLCSHLQNLLGSAGFCLRKWSSNCPSILE FT NLPKELRDERALFEFNSPTGPIKTLGIQWDIASDRILFTVPDWNKSLVITK FT RVMVSDAAKLFDPYGLIGPVIVIAKLNLQDVWSMQKNWDEPLAEVLQQIWL FT DYRASLEELRNLSVPRCVIPVIYVVSRQLHGFCDASEKAYGANIYLRTETS FT DGKAKTSLLASKSRVAPLGNPKKKKQQKTNLPRLELNGGLLLAHLLVKVIV FT ATNFDGEVFLWTDSEIVLHWLNSLPSRWKTFVANRVSEIQHLTAKAVWNHV FT PGSENPADIISRGMLPGLLIFSVLWWFGPFWLSLPRRYWPATFRSKEESFK FT KEDLEERPVSLAAQTVEPNPIFGKFSSYTHLIRTVALLRRFSYNSNPLNQN FT NRRIGFLRTAELKESMETLVRLVQRETFPDELAALADERSVEPHSKIKNLA FT PFLSDGIIRVGGRLGHAPVSEGRKHPILLPAKHPFTDLVVRYFHEKMLHAQ FT PQLLIASVRSQFWPLRIRNTARRIVHTCVSCFRCRPTNLEQLMGDLPEERV FT TPTLPFLSTGVDLCGPFKYRHPELRNKFTICYVAVFVCLTTKAVHVEMVGD FT LTTGAFLASLRRFVARRGKPRLIECDNAQNFKGALNELVVLREQFNSQQHQ FT HDVVRAAAEDGIEFKFIPPRSPNFGGLWEAAVKSLKTHLRTVLGNSVLTAE FT QLNTLLTQVESCLNSRPLTPLSNDPQDFEALTPGHFLIHRPLTAIPEPSLE FT EVPQNRLDRWQVVQEFLRRIWKRWTTDYLSGLQQRTKWTSKKDNIRIGTMV FT LVKDENRPPLHWHLGRVVHVYSGTDGNVRVVDIRTKNGRFRRAISKICILP FT IHQPSQPGESDLADDL" XX SQ Sequence 6816 BP; 1781 A; 1769 C; 1642 G; 1622 T; 2 other; tttttttggt ccttgctacc cggattaatc gcgtcgcgag tgaaacggac taatcgtgtc 60 gtaaaatcga cgatcaatta ccgtcgcggt acaaattgat cttgtgaaaa agtggtctga 120 acattttgtg ctaccaggaa gacataaaaa gtgatctccg cttgatcgga aaattgaaga 180 aagatccaga aagaattgcg gcgagctgca gatcatctgg accagatcgg cgtgggatcg 240 agaacgagag ttgctgatac gaaacgtggt cgaggttggc gtggttggct attgttcttc 300 gggaacaaaa gaggctaatt acgagggcac attgatcgtc gtcgatcgta agtacgcatg 360 catgaaagca gtgaagctgc tgattgatag attggtgaaa gattcgcatg aaaaattggc 420 taaaactacg aattgaacga cgctaattga ttgccctgaa cgacttgatc tgatctgcgc 480 gcttcggagg gatttcttga tccatgcaca cgtggtttgg tagttagagg ttagagattg 540 gttcgctgct catcgtaatc gaagttgtgg tagtgcattg acgaggtggg ttaacttact 600 gcagtaggca aaatttgggg acaaaactgg cattcgcaac actttccgaa atgactcctt 660 ctctgcggtc actttcgaag caagagcatt ttttaaatga ctcgctggac atgctactac 720 aactggttga gaattacaat gagcagaggg acaaagatgc gctggaagga tggagacagc 780 atgcggctac tctctacaaa gaatttaagt cgattcgctt acaaaccgaa gtgcagatgg 840 atcaagaaaa gcctgaaacc gaccccgaca atgaagcaaa cctgcgcaaa atccgttttg 900 actttgaaac caaatatctc aaggtgtaca gtctgatcgt tgttcaactc gaaaacttga 960 aaaaagcaga acaagccacc ctgaactctg tgccccccaa atctgaggaa acccctgact 1020 tgacgcactc tcggattaag ttaccagagg tgaaattgcc caactttgat ggacaactga 1080 tccattggat caattatcgg gacacctttg tcagcatgat cgattcaaac ccttctcttc 1140 aaccaatcga gaagttcgct tatctgaatt cgtcgctggc tccagaggca aagaggcaca 1200 tcgagaccat tgacgttact gctgttaatt attctgtcgc ttgggatatt ttggttcagc 1260 gtttcgacaa taaaaagctc attgttaggg cctacatcga gtgtctcctt agcatcgaac 1320 ccatggaaaa ggagtcctac gagtcgttag cacgtgtcat cgacgagttc gagaggaatc 1380 tcaacatgat cgggaagctg gaagttgaca ccaagggatg gagtgtgttg ctggcgcaca 1440 tggtgtgcgc acgactggat gactcggtcc tgaggcagtg ggagcagttt cacaagtcca 1500 aagaagttcc tacttatcct gagctgatta agtttttgcg cgatcatctg tctattcttc 1560 agtccctccc amcgtcaaag tcgcgttcga aagagtctac taaacctgac ccaggtcgtt 1620 tgacgaaacc aaggtttagt catgcagtca caattagtag cggttcgaga agctgtccct 1680 tttgcaagaa gccaccccac tcattcttcc tgtgcgaaac cttccgcaag ttgccagtcc 1740 agcaacggtt cgatgctgta aaacggaact ccctctgcat aaattgcctt tcgcccgacc 1800 acctcgtcaa aaaatgcact tcgagttcct gtcgtgtttg cggacaaaaa caccatacaa 1860 tgctgcacca aaattccaca cttcgtttct ctaactctaa cccacctcaa aaattgtcgt 1920 cgttgccagc acctgaccaa ccacaaactc aaaatcagat cacatcgaat gtcgttccta 1980 ccccacaaac caacatgtcg cagcatcaat cagaacccac tttgccaagt acctcccacg 2040 tatcaaccac tttgcacgga aacgcacgaa cccttcctgc taccgtgctg ctgcagacag 2100 ctgtcgtgaa ggtttcagac agcactggcc acttcgtgtg ggctagagct ctgttggatc 2160 cagcatctca actgagcctg atgacggagc agttgtcgca gaaactgaag ctgaagcggg 2220 tgaaggcgaa ccaggagatt ggcggcgttg ggaatacaac tgtcgtctca tcttacatcg 2280 tcgaagctag gattcactcc cattgtactg actttttcgc tgacgtctcg ttcaacgttc 2340 tgtccggaat cactcgagaa cttcctgcta gagcgttgtg tacagaggac tggaacctac 2400 caacagatat cgttttcgct gatccttctt tctatcaacc tggccctatc gacatgatat 2460 tgggaatcga agtctactat gagctgattg aagaaggact aactcgtctc ggaccaggac 2520 aacctgttct tcaaaaaacc gtcctggggt gggtcgtgtc cggaaaaatt ggatcccatc 2580 cggcacccac gctcagtctg acgtgcctct gtcatagcct cacgctcgac gaacaactcg 2640 aacgattttg ggaattggaa tcttgccagt ctaccagcac gctgtccgtt gaggaaactc 2700 aatgcgaagc gcattttacg gcgacaacca ctcgtgacga atccggacga ttcgtggtgc 2760 ggctaccaag gaggcctgcc gttatgtcca ctctgggaag ttccaaggag atagcaattc 2820 gtcggtatct cgcccttgaa cgtcgtctga cagcaaaccc ttcgctgaag gaatcctact 2880 cgaagttcat tgaggagtat cgtactcttg gacacatggc agaagtcgat gaagcagaat 2940 gttccagctc aagcttaccc acctattttt tgccccacca ttgcgtactc cgacctgaaa 3000 gcacaactac aaaattgcgc gttgtttttg acgcatcgtg cgcaacagac accggagtgt 3060 cgttgaatga ttgcttgatg gtcggtcctg tggtccaaca agatctgttc tcgattgtgt 3120 tacgctttcg gtggtatcga tacgccatgg tatctgacgt cgaaaaaatg tacaggcaag 3180 tcctggtaca tcctgctgat cgtccgctgc agatgatcgt gtggagagat tctccccatg 3240 aaccgcttcg cttctaccag ctgacgactg taacttacgg gacgtcctgc gccccatatt 3300 tggcaacaag gtgtttagcg gaattggcca aagttggtga aaccacgtac cccgaagccg 3360 cgaaggtagt tgccacagac ttctacatgg atgacctgtt gacgggtgtc aacgacgtcg 3420 ctgcaggaca gcagctttgt tcccacctgc aaaacctact tggctctgcg ggattctgcc 3480 tgcggaaatg gtcatcgaac tgtccgtcga ttctcgagaa tctacccaaa gagttgcggg 3540 atgaaagagc gctgttcgag ttcaactcac caactggtcc catcaaaacg ctcggaattc 3600 aatgggacat cgcttcagac cgtattctct tcaccgtacc ggactggaac aagtcgttgg 3660 tcatcacgaa acgggtgatg gtgtcggatg cagctaaact tttcgatcct tatggattga 3720 ttgggcccgt tatcgtgata gcgaagttaa acctgcagga cgtttggagt atgcagaaga 3780 actgggacga acctctcgcc gaggttctgc agcaaatttg gttggattat cgagccagtc 3840 tggaagaact acgaaacctg tcggttccac gttgtgtcat tccagtaatc tacgttgtct 3900 ctcgccaact gcatgggttc tgcgatgcgt ctgaaaaggc gtacggcgcg aacatttatc 3960 tgcgtaccga aacatcggat gggaaggcca aaactagctt acttgcgtct aaatctagag 4020 ttgctccact gggcaaccca aagaagaaaa agcaacagaa gaccaatctt ccccgactgg 4080 agctgaatgg cggactgctg cttgcacacc ttctcgtaaa ggtgatcgtc gctacgaact 4140 tcgacggaga agtgttcctg tggacggatt cggaaatcgt acttcactgg ttgaactccc 4200 ttccctcgcg ctggaaaacc ttcgtcgcaa accgtgtatc cgagatacaa catctgaccg 4260 ccaaagctgt gtggaaccac gtgccgggct ctgagaatcc ggcagacata atttcgagag 4320 ggatgctgcc tggtcttcta attttctccg ttctctggtg gttcggacct ttttggctct 4380 cgttacctcg tcgctactgg cctgccacat ttcgctcgaa ggaggaaagc ttcaaaaaag 4440 aagacctgga agaacgccct gtatcgctag ctgctcaaac tgtcgaacca aacccaatat 4500 ttggaaaatt ctcgtcttac acacacctaa ttcgaaccgt cgccttgctc aggcgtttct 4560 cgtacaactc gaacccactg aaccagaaca atcgtcgtat cggattctta cgaactgccg 4620 agcttaagga gtcgatggaa accttggttc gactagtcca gcgtgaaacc tttccggacg 4680 agttggcagc gttggcggat gaacgatcgg ttgaaccaca ctcgaaaatc aagaacttgg 4740 ccccgttctt gtcggatgga ataattcgtg tcggtggscg gctgggccat gctccagtct 4800 cggaaggaag gaagcatccg atcctgttac cagcaaaaca cccattcacc gatctcgtcg 4860 ttcgctactt ccacgagaag atgctccatg cccaacctca actcctgatc gccagcgtcc 4920 gttctcaatt ctggcccttg cgcatacgca acacagctcg gaggatcgta cacacttgcg 4980 tgagctgctt tagatgtcgt cccacaaacc tggagcagct aatgggcgac ttgccggaag 5040 aacgtgtaac accaactctg cctttcctga gcaccggagt cgatctctgt ggaccgttca 5100 aatatcgcca tccggagctg agaaacaagt tcacgatctg ctatgttgct gttttcgtct 5160 gtttgactac aaaagcggtg cacgttgaaa tggtcggaga tcttacaact ggagcattct 5220 tggcttcact tcgtagattc gtagcccgac gcggaaagcc gcgcctcatc gagtgcgaca 5280 acgctcagaa cttcaaaggt gcactcaacg agcttgtggt gcttcgtgag cagtttaact 5340 cccagcagca tcagcatgat gtcgtacgcg ccgcagcaga agacggcatt gagtttaagt 5400 tcatcccacc gcgctctccg aatttcggag gcttatggga agctgcggtt aaatccctga 5460 agacccacct gcgaaccgtt ttgggcaact ctgtcctgac cgctgaacaa ctaaacactc 5520 tgctcactca agtcgaaagt tgtctaaatt cccgtcctct cactcccctt tcgaacgacc 5580 cgcaagattt tgaggccctg acgccggggc attttctaat ccaccgtccc ctgacggcca 5640 tacctgaacc atccctcgaa gaagtgcctc aaaacagatt agatcgttgg caagtcgttc 5700 aggagtttct ccgcaggatt tggaagcgct ggaccacgga ttacctatcc ggtctgcagc 5760 agcgaactaa atggacgagc aaaaaggaca acatccggat cggaaccatg gttctggtaa 5820 aagacgaaaa ccgcccacct cttcactggc acctgggccg ggtcgtacac gtctacagcg 5880 gcactgatgg gaacgtacga gtggtggaca tccgaacgaa gaacggtcgg ttcaggcgcg 5940 ctatctccaa gatatgcata ctaccgattc accagccgtc gcagcctgga gaaagcgacc 6000 tcgctgatga cctctagaag ttgccaccca ctctaccatc gcagtttcaa actgcgcaac 6060 gaccgagagc ccaagggctc tcggttcatg taagtagtat taatcaatga attcgcaaaa 6120 aactcaagga atgtttatcc cccatcatgc ccgccggaca caccgagctt caacccacaa 6180 ccatggctcc aaccgacaag aagtaccaac aaccaactcg cttaatcgtc gatgcatcgg 6240 ccttccatcg cagctacgag ctgcgcatcg accgagggcc ctagggctct cggttccctg 6300 taagtagttt tgtttaatga accttcaaaa agcttaagga attttcgtcc cccatcatgg 6360 ctgcctgacg tcacgagctc caaccctcaa ccatgagctc caagcgcttc aaccgtcaac 6420 cacagcgctc gccgtctcat caatgggtcg ctgatcgatt gatagacgag cctggtacca 6480 ggaaagcagg agagcaggag agcagtcaac aatggtgcgg agcagctgac ttcgtatatc 6540 aacaaaccga ctggtcacgt gaccgatcgg ctgattgctg tggagcaacg aacaacgaga 6600 cagctgaccg acgattgttg ctgaacaaaa ggagaacgaa gaacagctga tcgacccgat 6660 cagctgactg atgtcaacag agagccgaac aaacacgttg ttgtaacgaa gaagaagatc 6720 gaactaaaca gagaaaataa cgtacacata tttgtagtaa taggtctaac agtagatttt 6780 aggttaaaag gtaggctttt aacggtggcc ggcata 6816 // ID BEL-186_AA-LTR repbase; DNA; INV; 494 BP. XX AC supercont1.94; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-186_AA_; KW BEL-186_AA-I; BEL-186_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-494 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.94; Positions 1898283 1897790. XX SQ Sequence 494 BP; 164 A; 78 C; 106 G; 146 T; 0 other; tgttgcgata gtattttgca tttagattta attcaaatac ttcagagaaa ctgctaactg 60 gaattgcgaa ctgtaaattg attcagtgta aaagttctca taagagagag cataagaaga 120 cgaaagttga aaggattccc taaagccgtc acagcgcttc atcgggattt atagtttaga 180 ttatatagtt tatataaaag aagtaagtaa ctgttgaaat gtgtttgcaa atgtattaat 240 ctagtgtcta aatttaccac agttaacact gagaagatcc ggcgaagccg atgccagtag 300 atgtttgctg agtggaagat agggcacaga tagtggttgc actgaaaaat tgtaatgcga 360 ggtaaatgag tcatgaagac attgatctaa tcaaactgaa tttttagctt tagcttatcc 420 accaatttgc ggagttaact tgattagctt aaagaagctc agttcccgac caggcgccct 480 accctttggc aaca 494 // ID CR1-40_BF repbase; DNA; INV; 2578 BP. XX AC . XX DT 30-JUL-2009 (Rel. 14.07, Created) DT 30-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE Amphioxus CR1-40_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-40_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-2578 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-2578 RA Kapitonov V. and Jurka J.; RT "Young families of CR1 non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(7), 1611-1611 (2009). XX DR [2] (Consensus) XX SQ Sequence 2578 BP; 909 A; 575 C; 554 G; 540 T; 0 other; agagacaact atttatatca atatgtcaca gaacccacaa gaattagaga aaaccagact 60 cccacactgg acgatttggt ttttacaaaa gaaccagaca tggtggagag aatctgttac 120 gaggacccac tgggttcaag cgaccacgtg aagttaactt ttagatgtaa cttttgtgta 180 aatcccacaa gacgtgtaac aaaatcttgg agctacgata aagcaaacta caacaagatg 240 aggacagata tcgacattga ctgggataaa gcactaagca acaagaagac acaggaagct 300 ttcagtattt tccagtctat tctcacacac gcggtgcaag acaatgtccc atcaaagctc 360 atagacaact caaaaaagaa cttcaaacct aaatggatga caaagaaagc cttccgtgct 420 tcacgaaaga aattccatgc atggacacgt tatttaaaca ctaaaagcag tcaggactgg 480 gctgcctata agtctgcccg taacgctgct acacacgagg ccagacgtgc aaggaaggac 540 tttgaaagac gaatcgccag agaagcaaaa accaacaata aagccttttg gagctatata 600 aacagccgca ggaaggtaaa gtcacaggtt ggcgacctga aaaacacaca tgggacgtac 660 acaagtaacg accaagaaaa agctgaaatt ctgaatgatc agtatttcaa tacgttcaca 720 agagaagacc ttcaaaacat tcctgaattc ttacaaaagc cattaacaac accagtatta 780 gaaacaatca caatcgagga agaagaagtg ctagaacaac taaacagtct acgcaccgac 840 aaatctcctg ggaacgacag cttacaccca aggatactca aagagctggc acatgagtta 900 acttaccccc tgaccagaat cttccaaagt agcatcgacg aatccactct tccagaggag 960 tggagagagg caaatatctc tccactattc aaaaaggggg acaagtctga ccctgctaac 1020 tatcgcccag tgtcattaac atcagtacca tgtaaaatgc tagagaagat tatatccaag 1080 aggattactg aacacatgag gacaaataac tttacctgtg accagcagca cggtttttct 1140 aaaggcaagt ctactgtcac taacctactc gaagccttgg atgtttggac agaagcgcta 1200 agtcacggtc tacccgttga cgttattttt ctggactacg caaaagcctt cgatacagtg 1260 ccacacatgc gtctattgaa gaagatcgag tccctaggaa ttaagggtgc attacttggc 1320 tggatcaaga atttcctcac atgcagaagg cagagagtct cagtaaatgg ggtcacatca 1380 agttggaaac cagtaaccag cggcgttcca caaggaagcg tactgggacc actcttattt 1440 acgatcttta tcagcgatat acaaatccag ctacacaact tcgtctccct attcgctgac 1500 gacacaaaga tctacgcagc atgctacaac tgtgtgagtg acaaccattc gaggtgctta 1560 caagcagatc tagacactct tcacatgtgg tcagtcaata tgcaaatgag gtttcatcca 1620 gacaaatgca aatcaatgca catggggaag ggaaatcaag gacataaata cagtctggca 1680 gacgcagaag gagtagtaca catcctgaaa gatacacggg aggaaaagga cctaggtgtt 1740 ctaatcgaca acaaattgag cttcagttca cacacacaga ctcaagtaac aaaggcaaac 1800 agagtacttg gagttataaa acacacgttt aaatatctcg acatggaaac attcctacta 1860 ctctacaaaa gcctggtgag accgcattta gagtacgcaa ctgtgatctg gagtcctaaa 1920 accaagagag actgtgattt agttgaaagg gtccagagaa gagccacacg tatagtggaa 1980 tctatctcac acctcccgta ctcagagagg ctcaaagcgc tacagcttcc tacccttctc 2040 ttccggcgac aacgtgcaga tatgattatg atgtacaaaa ttgcacatgg gatggtgagc 2100 ttgaggacgg atagtcattg caacatatgc gacagggcga tgtttattcc tagctacgcc 2160 acaaacacca gaggtcatcc ttacaagtac caaatacaag acgccagagg acctagaaca 2220 aactttttcc cttcaagatg tacacccaca tggaatggtc tctctgtgag aactgtaaca 2280 agcagtaccg tcaacacctt taaaggaaga ctgaaggagg aatggaagca ccacccaaac 2340 cagtttgagt acacattctc ctattaacca catgagtcat gaaggaagga aggaaggaag 2400 gaaggaagga agaaagaagg aagaaagaag gaaggaagga aggaaggaag taagaaggaa 2460 ggaatgaatg aaggaaggaa gcttaaccct gcgccatcta cataccgatt ctaccagcct 2520 cattacccga caagggtacc tatcggctgg atgcaaggtg aatatgcaag gtgaatat 2578 // ID Gypsy-6_IS-I repbase; DNA; INV; 4087 BP. XX AC ABJB010051582; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the black-legged tick genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-6_IS_; KW Gypsy-6_IS-LTR; Gypsy-6_IS-I. XX OS Ixodes scapularis OC Eukaryota; Metazoa; Arthropoda; Chelicerata; Arachnida; Acari; OC Parasitiformes; Ixodida; Ixodoidea; Ixodidae; Ixodinae; Ixodes. XX RN [1] RP 1-4087 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the black-legged tick genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; ABJB010051582; Positions 21519 17433. XX CC Positions [3193-3660] - Integrase core CC 'GTTAC' target site duplication CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 133..4074 FT /product="Gypsy-6_IS-I_1p" FT /translation="MNASSQASQPPPQPPPPYHLSAPEPFTFVPEHWTTWK FT RRWERFRTASGLQGKTQAEQVNSFVYLMGPKAEEIFASLPISADKTESHPD FT VVAAFENFFIVKKNVVYERAVFNSRSQREGEAAADFITALYALVETCDYGN FT LKDELLRDCIVVGVRDKRLSTKLQMESELTLERAVLLTKQTETVGQQQETL FT KHSTETAPTETAHVDRLQAPQQRAKKFHPKSPQKPSVDQPHATQSCFWCGN FT AVKHPKQQCPAAKARCNKCGKQGHFASVCKAKRLRQVAKLNESTSTEADFF FT FAGTVNTGPKKKPWKVNVVVGGHNVEFRLDTGADVTVIPTGLLAKLDKGIT FT LQKPDMLLLGPAKQRLDIVGVMQATVVYNGKSTVASLYVTKDLDEPLLGLD FT LIERFGILCRVSRAVAEPCLDPINEFPEEFRGLGEAPYTCKIKLKSPVEPV FT AVTSPRRIPVPLLKAVERQLRKMENEGVIPTVSEPTDWCSPIVVVPKKDGD FT IRICVDYTMLNKSVQREYHPIPSVEPILATLGQAKYFSRLDAYSGFYQVKL FT HPESTLLTTFITPFGRFKFNRLPFGISCAPEHFQRMTHQLLEGLDGVACHI FT DDILVWAKTREEHDSRLREVFGRLKEKEITLNREKCVFAQETVKFLGHVIN FT KDGVTPDKKNIATIVDMPPPTNVTELKRFLGMVNFIARFIPNLAIKTGPLR FT DLLHKDVPFSWGSTQRHAFEDVKNCLTSQPVLALYCPTKETIVSADASSFG FT LGAVLMQRQEHKSLRAVAYASKTLTEAERGYSQIEKEALGVTWACEKFKDY FT LIGLRFHIETDHKPLIPLFTRKPVDDLTPRLQRLRLRMMRYDYSMQHMPGR FT DLVVADALSRQPLQGQDSSRLAEEVADFEQALIRHVSVPDVCLQSLADAQD FT QDSVCRALKNYVQTTWPKTKKQVHQECLPYWQFKNKLTLHEGILMRGQRYL FT IPVPLRSSVLSSLHDGHEGIIKCTRRAQQSCWWPGLSKDIADTVEKCVSCM FT KQRQPRNQSLMPTPFPDRPWQRVAMDLFYANGKCYLVVTDYYSRYIEIALL FT ESQRPETVILKSKSIFARHGIPETVVTDNGPQFRSEFLCFARNWGFKHVTS FT SPKHAQSNGCAEAAVKIAKTKLTKSFDPYRALLAYRATPLENGFSPADLLF FT GRRLRTHVPISAELLRPSVPDHHQVEDFEHKARKRQAINYDRRHAVRDQPE FT FQPSQRVWITDLKRAGTVLNKAETPCYCWIGTDQGIIRRNAKFLVIDRRRC FT NFEEDLISLGSLPCSPELPDCPASQPASLQQSRSGPTSRSGRPLRPPCRYG FT YD" XX SQ Sequence 4087 BP; 1091 A; 1098 C; 1057 G; 841 T; 0 other; tggtgtcaga agctaagttc agcttcacgc gccgctagac aaaacgctcg gcaggcacac 60 gagcatcgaa cgttcggcgt tgccactcgc tgtcaactca acggatcacc gttaggccta 120 caggtcggaa caatgaacgc gtcttcacaa gcctctcaac cgccgccgca gccaccaccg 180 ccataccacc tttcagcacc ggagccgttc acgttcgtac ctgagcactg gacgacctgg 240 aagagacgat gggagcgttt ccgcaccgcc tcaggcctac aagggaagac ccaagccgag 300 caggtaaact cgttcgttta cctcatggga cccaaggcag aggagatttt cgcctccctc 360 cccatttctg cggacaagac ggaatcccat cccgatgttg tcgcggcatt cgagaacttt 420 ttcattgtga aaaagaacgt cgtgtacgag cgcgcagtgt tcaatagccg ctcacagcgt 480 gagggagaag cagcagcgga ttttatcacg gcactgtacg cgctggtgga gacgtgcgac 540 tacggcaacc tcaaggatga gcttctcagg gattgcatcg tcgtcggtgt gagagacaaa 600 agactgtcca ctaagctaca gatggaaagc gagctcactt tggagcgagc ggtgttgcta 660 acgaagcaaa cggaaactgt tggccagcaa caagaaaccc tcaagcattc taccgagaca 720 gcgcccactg agactgcaca tgtggacagg ctgcaggccc cacaacagag ggctaagaag 780 tttcacccaa aaagtccaca aaagccttcg gtggatcagc cacacgcaac ccagtcatgt 840 ttttggtgtg ggaatgcagt caagcacccg aagcagcaat gtcctgcagc caaagctcga 900 tgcaacaagt gcggaaagca aggccatttt gcatcagtct gtaaagccaa gcgactccgg 960 caagtcgcca agctgaacga aagcacgtcg acggaggcag acttcttttt cgcaggaacg 1020 gtcaacactg ggccgaagaa aaagccgtgg aaagtcaacg tcgtcgtagg cggtcacaac 1080 gtcgagttcc gcctagacac aggagcggac gttaccgtga ttccaacagg tttactggca 1140 aagctggaca aggggataac gctccagaaa cccgacatgc tccttcttgg accagcgaag 1200 cagcggctgg acatcgtggg ggtgatgcaa gcaacagtag tatacaatgg gaaaagtacc 1260 gtggcgagcc tctacgtcac aaaggacctg gacgagcctc ttctgggctt ggacctcatc 1320 gaacgcttcg gtatactgtg ccgagtgagt agagccgtag cagagccatg tctcgaccca 1380 atcaacgagt tcccagagga gttccgaggt ttgggagaag ccccctacac ttgcaaaatc 1440 aagctgaagt cgccggtaga gcctgtggca gtcacatcac cgcgacgcat cccagttcca 1500 ctcttgaaag ctgtggagcg tcaactccga aagatggaaa acgaaggagt gatccccact 1560 gtcagtgagc caacagattg gtgctcgcca atcgtggtag taccgaaaaa ggatggtgac 1620 atacgcatct gtgtggacta caccatgttg aacaagagtg tgcagagaga gtatcatccg 1680 attccaagtg tggagccaat tctggctact ttgggacaag ctaagtactt ctcccgtttg 1740 gatgcttact ctggctttta tcaagtgaaa ctgcaccccg agtcaacact gctcaccacc 1800 ttcatcactc catttggtcg gttcaagttt aaccgcctcc ctttcggcat ctcctgtgct 1860 cctgagcatt ttcaaagaat gactcatcaa ctcttggaag gacttgatgg tgtggcgtgt 1920 catatcgatg atattctggt gtgggccaag accagggaag agcacgacag tcgcctgcgt 1980 gaagtcttcg gcagactgaa ggaaaaagag atcacgctta accgtgaaaa gtgcgtgttc 2040 gcccaggaaa cagtcaagtt tctgggacat gtgatcaaca aagacggtgt caccccagac 2100 aaaaaaaata tagcgaccat cgtagacatg ccacctccaa caaatgtcac agaactcaag 2160 aggttcctag gaatggtgaa tttcatcgca cggttcatac cgaaccttgc gatcaagaca 2220 ggacccctga gagatctgct tcacaaggat gttccattca gctggggcag cacgcagcga 2280 catgcttttg aagatgtaaa gaattgcctt acatcacaac cagtgctcgc actctactgt 2340 ccaacaaagg agacaatcgt gtctgctgat gcctcaagtt ttggcctggg agcggtcctg 2400 atgcagcgcc aagagcacaa aagcctgcga gccgtcgcat atgcatcgaa aactctgaca 2460 gaggcagaga ggggatattc tcagatagag aaagaggccc ttggcgttac gtgggcatgt 2520 gagaagttca aggactatct catcggtcta aggttccaca ttgaaactga ccataagcca 2580 ctcatccccc tcttcacacg gaagccagtg gatgatctta ccccaaggct acaacggctg 2640 cgtctcagga tgatgcgcta tgactacagc atgcaacata tgcccggcag ggacttggtt 2700 gttgctgatg ccttgtctcg tcaaccactg cagggacagg acagctcgcg cctcgcagag 2760 gaagtcgccg actttgagca agccctcatt cggcatgtga gtgtcccaga tgtatgtctg 2820 cagagccttg cagatgcaca agaccaagac tctgtttgca gagctctgaa gaactatgtt 2880 caaacaactt ggccgaagac gaaaaaacaa gtgcaccagg agtgcttgcc ttactggcag 2940 ttcaagaaca aactaacgtt gcatgaggga atcctgatga gagggcaacg atacctgata 3000 cctgttccac tgagatcctc agtgctaagt tctctccatg atggtcacga aggcatcatc 3060 aagtgcacaa gacgagccca gcaatcctgt tggtggcctg gtctatcaaa agacattgct 3120 gacactgtgg agaaatgtgt cagttgcatg aagcagaggc aacctcggaa ccagtcactc 3180 atgccaactc ccttccctga ccgaccgtgg caaagagtag ccatggacct tttttatgct 3240 aatgggaagt gctaccttgt tgtgacggac tactactccc gctacatcga gatagcactg 3300 cttgaaagtc aaagaccaga gaccgtcatc ctcaagtcga agtcaatctt cgcaagacat 3360 gggatccccg agacagttgt gacggacaat ggaccacagt ttcgctctga atttctgtgc 3420 ttcgcaagaa actggggttt caagcatgtc acctcaagtc ccaaacatgc ccagagcaat 3480 ggctgtgccg aggccgcagt caagattgcg aagactaaac tgacaaagtc atttgatccc 3540 tacagagcac tcttggcgta ccgagcaaca cctttagaga acggcttcag ccctgccgat 3600 ttgttgtttg gccggagact gcggacacat gtgccaatat cagcggagct gttgaggcct 3660 tctgttccag atcatcatca agtcgaagac tttgagcaca aggctcgaaa acgccaagcg 3720 atcaactacg acaggcgaca tgctgtaagg gatcaaccag agttccagcc gtcccagagg 3780 gtgtggatca ccgatctgaa gcgtgcaggg acagtgctca acaaagcgga aacaccttgt 3840 tactgctgga ttggcactga ccagggcata attcgccgca atgccaagtt cctcgtcatc 3900 gatcgtcgac gatgcaactt tgaggaggac ctcatctctc tcggcagctt gccctgttcg 3960 cccgagctcc ctgactgccc agccagccag ccagcatcac tgcaacagtc aaggtccgga 4020 ccaacctctc gatctggaag gcccttgcga ccaccgtgcc gttatggcta tgactgaaag 4080 ggggaga 4087 // ID Copia-129_AA-LTR repbase; DNA; INV; 154 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-129_AA_; KW Ty1_copia_Ele197; Copia-129_AA-I; Copia-129_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-154 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 154 BP; 50 A; 31 C; 25 G; 48 T; 0 other; tgttgcgagt agatcaattg tagtaaccta cgacacttca atactagcaa cggttagacc 60 tagcaacgat tgtttagttg atgtaattag aaaataaatg ttcattacct cttcagacta 120 cagccaatac agacgtgttt ttctacaact ctca 154 // ID ZEBEDEE repbase; DNA; INV; 3256 BP. XX AC Z86116; XX DT 13-MAR-1998 (Rel. 3.02, Created) DT 13-MAR-1998 (Rel. 3.02, Last updated, Version 1) XX DE A.aegypti DNA, copia-like transposable element Zebedee. XX KW Copia; LTR Retrotransposon; Transposable Element; LTR; ZEBEDEE; KW copia-like retrotransposon; endonuclease; integrase; KW reverse transcriptase. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-3256 RA Warren M.A., Hughes A.M. and Crampton M.J.; RT "Zebedee: a novel copia-Ty1 family of transposable elements in RT the genome of the medically important mosquito Aedes aegypti."; RL Mol. Gen. Genet 254(5), 505-513 (1997). XX RN [2] RP 1-3256 RA Crampton M.J.; RT "ZEBEDEE."; RL Direct Submission to Genbank (11-MAR-1997)Crampton J. M., RL Liverpool School of Tropical Medicine, Molecular Biology and RL Immunology, Pembroke Place, Liverpool, UK, L3 5QA. XX DR GenBank; Z86116; Positions 424 3679. XX CC Zebedee appears to be flanked by 22bp direct repeat sequences. CC A single open reading frame (ORF) of 972 amino acids has the CC coding potential for a polyprotein with homology corresponding CC to the conserved amino acid motifs of Long Terminal Repeat CC retrotransposon protease, integrase and reverse transcriptase. CC The Zebedee family appears to have a low copy number CC number in different strains of Ae.aegypti. Transcripts of CC the elements have been detected in culture mosquito cells. CC Despite the lack of a gag homologue or the LTR hallmarks of CC previously characterized Copia-Ty1 RTPs, phylogenetic analyses CC place Zebedee within this group, showing considerable homology CC to Copia from D.melanogaster [1]. XX SQ Sequence 3256 BP; 875 A; 760 C; 876 G; 745 T; 0 other; cttcgttgtc tcggtcatat cagtgaaagt ggaatggcct ctctggtgcg tgagaacctg 60 gcccttggtg tgcgtttcaa gccggagaag attaagttct gcgatgtgtg tgtcccggaa 120 aaacacagcc gggacccgtt cgatggtgca agagaccggg cgaccagatc gttggagcgt 180 atccattcgg acgtgtgtgg gccgattgag cctgcatcct gggaaggtca tcgctatctg 240 gtctcgttca ttgatgacta gggtaacggt cgtattttgg accccctaag gaagtgattt 300 tagttttttg tcccaattaa ttaattgcac agcgtaacca agtcaaagaa catgccaaaa 360 tgtagagaaa aacttctcct acgagataaa aatgccccta cgatcatgaa ggatccaaaa 420 aatgtcgaaa ataattgaat tgtgaagaac tgtttctcat tattttggac acaccaatat 480 attttggacc cctggaattt ttttatcgtt tggcgaaaag tccacgacac tggttgtata 540 atatgctcgc ccatagtcta atcaatcgat tgacattgga aaaccgaaga aattctacgc 600 ttttagtgca gaaatttgaa aaacgttctt gtttacattt aaaaaaattc tgacggcttc 660 aatttctgtg tgtaacgttt tgtaacggtt gcatggatga accgattaaa ttaaattaat 720 ttcagataaa ataaacacaa tttctatgta taaacggttg atgcaaatgc ggaatagtcc 780 tatctttcca aatggcatgc gaattgagtt gaaggaagga aattgaagcg atggcgactg 840 cggcttgtgg cacgaagatt tcaaaactga ccgtagacca gggacgtgag tacttctcaa 900 ccagccagaa gaactactac aaagcgaagg ggattcaggt cgagccaact atagcctact 960 ctccgcaaca aaatgaagtg gcggagagat tcaacaggac cctaatcgag aaggtacgta 1020 ccatgcaaat tgaatctcat gctccaaaat cgttctggtg tgaagcggtg ctgtcggccg 1080 tgttcctgat caatcggagc ccaacctcgg ctgttttaag aaacgttacg cctgctgaac 1140 tatggtacgg tcggaagcca agtctggaga aggcgcgcgt ttttggttgt caagtctatg 1200 cctggatacc gatcaacgga ggaagaagtc ggacacgaag agtcgacagc tgatcatggt 1260 tggttacgcg cccaatggct accggctgtg ggacaaggac gagaggagaa tcgtaattgc 1320 tcgtgacgtc aagttcaatg aaaactactt tccgtgggcc aacaatgcaa gagatcgcgc 1380 ggaagagaga ctggtggtac ccttggtcta cgagcaagag ggggaagaac atgccaagga 1440 agccattgat gaggacgcca tccaacctga tgatgcggcg gtcggtccgt tcgacccgtc 1500 cgaccacgaa ggcgaacttc taaccgagga tgataccgat gacgaagctg cgacggaact 1560 agccaaccct gcgctccctt cgcaatcaga tctggacatg tcctccggcc ctagaggcga 1620 gcagtccgag aggcgcagga aacgggagcg ccgttttccg ggtaagttcc ttgattatct 1680 tactggattt agagctagtg ctgttggtcc tccctcccct tcagacgttc ccgaaacata 1740 cgatgaaatc cgtacccgtg aggaatcgtg ccctctggaa ccaggcggtg caggaacgag 1800 ctccgttcaa tgaaggccaa acgatgtctg gaagagtggt gggcgtgccc ccgccggcgt 1860 gaagctactc aaaacgaaat ggtcttccga ctcaaggaga acgaggctgg acgatcggtt 1920 cggcataagg cgcgactggt ggtcaagggt ttccttcaac gtcctggaat cgatttcgag 1980 gacacgttcg caccagtggc taagctgtct accgtgcgag tggtgctagc cgttgcggtt 2040 catcatggtt tcaaggtgca ccagatagac gtaagcacag cgttcctcca cggtatgctg 2100 aaggaagatt tgttcatgga agaacctgaa gacgtgaagg cgaagcccgg cagtgtgtgc 2160 aagctgcaga ggtcgttgta cgggttgaag caagcaccac gttgctggaa cgaaaggttc 2220 aactcagcgc ttctgaagct gggattcagc agatcccacc gtgcctactg tctctacgtg 2280 tgcaacactg aaggggatga agttttctta gtgctgtacg tcgacgacct cctgatagcc 2340 ggacggaaca ttcgaacaat tcagaagctg aagcgccgtc tttccgatga gttctaaatg 2400 accgattgcg gcgacgaggt ttttctttgg ggatgcggct cgcttacaat ccggagcatg 2460 gtgacctgaa gttgacccaa gaaacagctg ccacgaagat cttggagaag ttcgctatga 2520 ctgaatgtta ccccgccaag aatccgatgg agaaaggatt gcagcctgat gtatcaaatg 2580 ctgtgtgcgc ccggatttgt gctacccggt cagctacttg ggaaggttcc agcaatcatc 2640 aacgaacaat cactggcagg cgctgaagcg agttgtcaga tacctctaag ggacggctac 2700 cctgggacta ctctacaaga gaaacgagaa cagcagaccg ctcgtcggat acgtggacgc 2760 cgactgggca tccgatgcgg aagatcggaa tcggtaacag gtttcctgtt caaagtctac 2820 ggcatcgacg gtatcctggg ctagtcggaa gcagccaaca gtctccatat cgtcaagcga 2880 ggccgaatac gtcgcgctga gtgcagccgt atcggaagct atctggttgt ccggaatttt 2940 agaagacctc aactgcaaga agctgacgga ccctgtatca atatacgaag acaaccgtgg 3000 ctgcataggg ctgcccaaaa atgctgaatc gaagcgcatc aagcacgtaa atatcaagca 3060 ccactttatc aaggaccatg tggctgccgg gaccatcaaa atcgaaccga tcagcacccc 3120 cgagcagcaa gcagacattt tcacgaaggc gttagatgtc acccggttcc aatccctcag 3180 gagcaagatc ggggtcaccg attgagagag ggtgttagaa atctagcact cgctgactgc 3240 tggatttaca tcacag 3256 // ID Hoyak1 repbase; DNA; INV; 2603 BP. XX AC . XX DT 21-SEP-2009 (Rel. 14.09, Created) DT 21-SEP-2009 (Rel. 14.09, Last updated, Version 1) XX DE Hoyak1 is a putative autonomous DNA transposon. XX KW hAT; DNA transposon; Transposable Element; HOBO; transposase; KW Hoyak1. XX OS Drosophila yakuba OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; melanogaster subgroup. XX RN [1] RP 1-2603 RA Ortiz M.F. and Loreto E.L.S.; RT "Characterization of new hAT transposable elements in 12 RT Drosophila genomes."; RL Genetica 135(1), 67-75 (2009)DOI 10.1007/s10709-008-9259-5. XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 1287..2096 FT /product="Hoyak1_1p" FT /translation="MIKAFSDAENIYCVNHLLNNAVEKAIHAIPEMQSLVS FT NCSKLVKYFKKSGMNSTLGVSLKSFCPTRWNTVFYLLESVETNWIELTAVL FT KEKNQTSRVEEININHLGTIVRLLETFENVSKKLEASKRPTIHLILPNLNK FT LIKTCQFDCNDTNIIRDLKFQLNSQLSSTVNPKLKKYHKIALFLFPPTNKL FT IQFSPTEKEAVINDCKILMNHFLQDSCILNNFETELAEDEFAEYLDVPKVE FT TIQTKVEAEIAGYSNINVAYTPDFNVLAW" XX SQ Sequence 2603 BP; 859 A; 461 C; 450 G; 833 T; 0 other; tagacatctg catgattcca ctcataacat acgtcacata gaatacattc gaatggaatt 60 gaattgaatg agtgaaacgc acgaaatgaa acgacattgt atgagtgaga ggttctcatt 120 ccactcataa cttcctacct gcaacgaaat gaacgtaact gaatgaagcg taaaatgaaa 180 tgaaacattt ggaaatgcct agaacattcg aatgattcga gttgttgcgt gcgaattcgg 240 tccattcctt tgcttattct tgacagctgt acttcaaaga cgcaataatg gacgcattat 300 tttcaaaaaa agccgacaaa tctgtatgta tttgtctaac gttttgtttt aatttactat 360 tgttgtttta atgtatatat atttataaca attcgcttat ttgttttgat cgtgcttaga 420 ttaaaacctg tgtatttgtg acctgtccat aagtgtttat tgtgtaattt tttttcagac 480 ttaattcagt ttgattgcaa tatataggtt gtaaaaggca cagtggacat taaagagaag 540 ctgactagtg gcatgtacag cctaattgaa aagagaggcc gaagcgaagt ttggcaattt 600 ttctccaaaa ttaaaaacga tgatggagaa gaactcgtca accttgtagc ttgtaagacg 660 tgctataccg ttttaaaatt tacaggcagc acatcaaacc tggtcaagca caaatgttat 720 ttaaataatg caagcaaatt tcgtcaaggt gctccaatag aagttaatca ggaaacaaaa 780 gaggagggca tatctgttgt aacagaatgg gttgtcaaga attgtcgtcc ccttaaaatt 840 attgatgact cgggcatcaa aaagtttgca tcatttttaa ttaatgttgg agctacatat 900 ggtggaaatg tcgacgtcaa taagctgtta ccacatccga caaccttatc ccgaaatatt 960 tcaacaatat attcgtccca ctttggcccc ataaaatcag aaatacaaac atacaaagcc 1020 tttggatatg ccattacaag tgacatatgg acagatgact atttgaaggc atcttatttg 1080 tcatgtaccg tccattacat taaagaggga gttctggtcg atcgccttat ggccatgaag 1140 tcaatgaagg gcttgcccag cacaggtttg ttaactccat ctttgtgtgt ataagtataa 1200 ttcattattt ttcttatcaa ggtctcaata tccgaacaaa aattgaggct attttgaagg 1260 attttggatg tgacctcgtg accaatatga taaaagcctt ttcggacgct gagaatattt 1320 attgcgtcaa ccatttgtta aataacgctg ttgaaaaggc cattcatgct attcctgaaa 1380 tgcaaagcct tgtttcaaat tgcagcaagt tggtaaaata cttcaaaaaa agcggaatga 1440 attcaacact tggagtatcc ttgaagagct tttgtccgac tcggtggaat accgtgtttt 1500 acctacttga gtcagttgaa accaactgga ttgagttgac agctgtcttg aaggaaaaaa 1560 atcagacatc tagagtggaa gaaatcaata taaaccattt aggtactatt gttcgattgt 1620 tggaaacgtt tgaaaatgtg tccaaaaaac tggaagcatc taagcgccca acaatccatc 1680 ttattttacc aaatttaaat aagctgataa agacctgcca atttgattgc aacgatacaa 1740 atattattag agacttaaaa ttccaattaa atagccagtt atcatctaca gttaatccaa 1800 agttgaaaaa atatcacaaa atagccttgt tcctttttcc cccaacaaat aaactaattc 1860 aattttcacc tactgagaaa gaggcggtca ttaacgattg caaaatttta atgaaccact 1920 tccttcaaga cagctgcatc ttaaacaact ttgaaactga gttagctgaa gatgaatttg 1980 ccgaatattt agatgtccct aaagttgaga caattcaaac caaggtggaa gcagaaattg 2040 caggatattc taacataaat gttgcataca cccctgattt taatgtttta gcttggtgaa 2100 atttgaataa acaatttttt ccacttttgc ataaagtaag ttgcaaaata ttttgcattc 2160 cagcaagcag tgcggcttcg gaacgcgcat ttattagcgc aagaaattta attacagaaa 2220 agcgttgttt aatagccacc aatccagaaa atataaacaa aataatgttt ttgcattcga 2280 atttaagtta attatattgt tattttagta ataaatttaa tatataacca aattacatat 2340 atgcatacac aactataaaa acaaatttca tttctttttc attgtttaca gttgtcactc 2400 gaatcattcg aatgaagcgt ctcaatttac tgtgttctca ttccattcaa ttcatttcca 2460 tctgtttcgt tgttgtgcga ctgagtcgta cgtaaattca ctcactcact cataccagtt 2520 tgactttgct ttcattctca ttccattcat tttcggcatt gtgcatttga atgtgccttt 2580 ttcgccactc atgcagatgt cta 2603 // ID Gypsy-31_DWil-LTR repbase; DNA; INV; 248 BP. XX AC scaffold_181154; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-31_DWil_; KW Gypsy-31_DWil-I; Gypsy-31_DWil-LTR. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-248 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_181154; Positions 3531096 3530849. XX SQ Sequence 248 BP; 67 A; 55 C; 61 G; 65 T; 0 other; tgtggagacc cgtgccaagg gagctttcgg cggccggcga tcaacagaca gaaagccccg 60 cggcacaagc cacggaatga gcttttgacg tacgaacttt gggtatcgct ttctccatcg 120 atcttcactt tgattcttga cggcaagcgc agcatatatg cataagcagc acatgaataa 180 attaatagct aattggttaa tggtaaactt gttgttgttt ctaatcgctg gaataccggg 240 ttaataca 248 // ID Gypsy-95_AA-I repbase; DNA; INV; 5145 BP. XX AC supercont1.1; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-95_AA_; KW Gypsy-95_AA-LTR; Gypsy-95_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5145 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.1; Positions 3664484 3659340. XX CC Positions [2339-2716] - Reverse transcriptase CC Positions [3974-4444] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 740..5083 FT /product="Gypsy-95_AA-I_1p" FT /translation="MAYVVAPNIEPYRKGQSFAGWMRRLTFHFRVNKIQDN FT DKKDQMFMLGGDYLFSMAEKLYPTEALLDEVTYAELVQKLKERLDRTDSVL FT LQRYHFGSKLQQPGETASDFVFTLKLQAEHCEFGDQKNRLILDRLLVGLTD FT NNLKHRLLTEDSAKLTLDQAEKIITTWEMAATHTRALANNDGVGLVGFVNN FT RYPVTGGRGAVLQRVRDLAQSYRGPVESRLGARPEIRPAEDCRQDSRVRFQ FT RGSQSQQHRHRDPSAGSSNRRLQRGDEQWPIDQRYCSYCKRGGHVRRKCYK FT MKNEMNSRVNHVETQEAESSTGSLSQLAERLDRLRSGNWDSDDNDSGELQC FT MHVTSINKISDPCLLNISIENHTVQMEVDSGSSVTVMGKILYTSTFDLPLV FT KSSKQLIVINGSRLKVSGEVGVRVKYNGKSANLTLLVLDCDYQFIPLLGRP FT WLDEFFPNWRNFFANLMPLNNISVNQSEALIAEIKLNFSDVFVKDFSKPIK FT HFEADLVLKTDIPIFKKAYDVPYRLREKVLKYLDKLEAEKVITPIQTSEWA FT SPVVVVMKKNNDIRLVIDCKVSINKLIIPNTYPLPVSQDIFASLAGCKVFC FT SLDLEGAYTQLSLSKRSRKFMVINTIKGLYTYNRLPQGASSSASIFQQVMD FT QVLQGIENVCVYLDDVLIAGKDLKDCKAKLYKVLDRLSKVNIKINWDKCKF FT FVTELEYLGHIISDKGLIPCSSKIATIREAKVPTNVTELKSFLGLINYYNK FT FIPNLSSKLYHLYNLLKNDVKFLWDSKCNEAFENSKKLLLETDFLEFYDPE FT KPIVVVTDASGYGLGGVIAHVVDEVEKPICFTSFSLNAAQKKYPILHLEVL FT ALVCTIKKFHKYLYGQKFTVFTDHKPLVGIFGKEGKHSIFVTRLQRYILEL FT SVYDFEIQYRPSAKMGNADFCSRFPLEVAVPSAYDQEFVKSINFGDNLPID FT FSAIAEETKKDASLQKVMSFMLNGWPKRIDKQFVDIFTNRNDLEVVEECLL FT FQERVVIPQVLQGDILKLLHGNHAGIVKMKRLARQTVYWFGINTNIERFVA FT ACDICASMAIVPKQNITSKWIPTLRPFSRVHIDFFHFEHRTFLLIVDSFSK FT WLEVEWMRKGTNTSEVLTKLIEFFARFGLPDVIVSDNGPPFNSHTFKSFLQ FT KQGIKVLNSPPYNPASNGQAERLVRTVKDVLKKFLLEPGMLELKLEDQINL FT FLFNYRNNNLTLEGSFPSEKVFAYSPKMLIDLVNPKKHYKQMLVPHNPNDE FT AKGSQNENNDQTDQLDALIPGDLVWYKHNIPHLREKWIKASFLKRFSKNLL FT QIMVGNEAITTAHPTQIRLVKEGSGSRQPRRSMRVVETGRSIPATAEPEGT FT VVRDDDIRQLPERNTESEGAAFGETDVRQLPDLSDRERRRNNSAEIIERKS FT RKRKFPSEPVELNGLLRRSKRTRKIISDNEFEYY" XX SQ Sequence 5145 BP; 1554 A; 905 C; 1169 G; 1517 T; 0 other; gaaaagtggc gacgaggtta actggtgcgg cggagcattt ttcggcgtac tcggacggct 60 actcggagca gcaacaccct ttcatctgcg attagtgcag cggcggtggt gctgatagga 120 agtaattacg gtgagaacaa cacggtgcat agcataaatc ccaaggtaat aagggaaaaa 180 gtgaggtgat aacgggggca aaaaggtgat ttcggtgaca atttgttcat tccggtgatg 240 ttgacacgcc atttgctttt ggtggcgtag gctgtgtgaa agcataaaag acaattacca 300 ttgttggttt tctttttatt tacttttttt ttgtgaaacg caaccatttt gttttgatta 360 aatattaaca gtgctttcct ttttctattt tggaagataa atccacagcg ggcggcaatc 420 cggttgtttt cgggtgctga aaggatcttc gctctagtgt acgagtggct gcagtttcct 480 tcccgctggg tggcggacgg tttgatttct ctccccgctg ggtgccggac gattcatttt 540 cttctcccgt cgggtggcga tcattaccca gcctgtaaat tgcggtgagc gtgacgtcga 600 tcggtgagta atcattttgt ttttctatct gattttgaac attagactgg ttcaatttgt 660 tttgcatatt ttttttgtga aaagtttaaa aagtttttaa aattagtgtt ttttattgtt 720 ttgcttagat ttttgcaaaa tggcgtatgt agtggctcca aatattgaac cctaccgaaa 780 aggccaatct tttgcgggat ggatgcgtcg gttaacgttc catttccggg taaataagat 840 acaggacaat gataagaaag atcagatgtt catgctgggc ggggattacc tgtttagcat 900 ggccgaaaaa ctttatccaa ccgaagcact tttggatgaa gtgacctacg ctgagctagt 960 tcaaaagctc aaggaacgtc ttgatagaac tgattctgtt ctactacagc gttatcattt 1020 tggctcaaag ctgcaacaac ctggtgaaac agcgagcgat tttgttttta ctttgaagct 1080 ccaagctgag cattgcgagt tcggcgacca gaagaaccga ctcattttgg accgtcttct 1140 ggttggcttg acggataaca accttaagca tcgccttctt actgaggaca gtgccaaatt 1200 gacccttgat caagctgaaa agattatcac aacctgggaa atggcggcga ctcatacaag 1260 agctttggcg aacaacgatg gtgtcggctt ggtaggtttt gtgaacaata ggtatcctgt 1320 gactggagga agaggagcag ttcttcagcg agtgagggac ttagcccaga gctaccgtgg 1380 cccggtggaa agccgacttg gtgcgcggcc agaaatccgg ccggccgagg attgtcgaca 1440 agattcccga gttcggttcc agcgaggttc ccaatcccag cagcataggc accgagatcc 1500 atctgctggc tcatctaatc gcagattaca gcgaggtgac gagcaatggc cgatagatca 1560 gcgctattgt agctactgca aacgcggggg ccatgtacgg aggaaatgct acaagatgaa 1620 gaacgagatg aacagcaggg tcaaccatgt cgaaacccag gaggcagaga gcagcaccgg 1680 aagcttgagc cagctggccg agcgactaga ccggttgaga tctggtaact gggactccga 1740 tgataatgat tcaggtgaat tacagtgtat gcatgttact tctattaata agataagcga 1800 tccttgttta ttgaatattt ctattgaaaa tcacactgta cagatggagg ttgacagtgg 1860 ctcttctgtt acagtgatgg gcaaaatttt atacacatcc acatttgatt tgcctctagt 1920 aaaaagttct aaacaactaa ttgtgataaa tggttccagg ttaaaagttt ctggtgaagt 1980 tggagttaga gttaaatata atggtaaatc agctaatttg acacttttag tgcttgactg 2040 tgactaccag ttcattcctt tgctgggtcg accgtggttg gatgaattct ttcccaactg 2100 gagaaatttt tttgccaatt tgatgccatt gaacaacatt tcagtaaatc aaagtgaagc 2160 attaatagct gaaattaaac taaatttttc tgacgttttt gtcaaggatt tttccaagcc 2220 aattaaacat tttgaagctg atttggtttt aaaaactgac attcctattt ttaaaaaggc 2280 ttacgacgtt ccttatcgat taagggagaa agttttaaaa tatctggaca aacttgaagc 2340 agaaaaagtg attactccta ttcaaaccag tgagtgggca tcacctgtag ttgttgtcat 2400 gaagaaaaat aatgacatac gactagtaat tgactgcaag gtttcaatta ataagctcat 2460 cattccaaat acttatcctt tacctgtatc tcaggatatt ttcgctagtt tggctggatg 2520 caaggttttt tgttctcttg atttggaggg tgcatacaca caactatcat tatcaaagcg 2580 atcaaggaaa tttatggtaa ttaacacaat caaaggactt tatacctaca atagattgcc 2640 acagggggct tcctctagtg catctatttt ccagcaagtt atggatcaag tattgcaagg 2700 tattgaaaac gtttgcgtct atttggatga tgtgcttatt gcaggaaaag acttaaagga 2760 ttgcaaagct aaactttata aagttttaga cagattgtct aaagtaaata tcaagattaa 2820 ctgggacaaa tgcaagtttt tcgtaactga attggagtat ttgggtcata ttataagtga 2880 taaaggtttg ataccatgct caagcaaaat tgcaacaata cgggaagcaa aagttccaac 2940 aaatgtgacc gaattgaagt cctttttagg acttatcaat tattataaca aatttatacc 3000 caatttgtct tcaaaattgt atcacttgta caacttattg aagaacgatg ttaagttttt 3060 atgggacagc aaatgcaatg aagcttttga aaattctaag aaattgttgc ttgaaactga 3120 ttttcttgaa ttctatgacc ctgaaaaacc aattgttgta gtcacagatg catctggcta 3180 tggtttgggc ggagtaattg ctcatgttgt tgacgaggta gagaaaccaa tttgtttcac 3240 gtctttttcg ttaaatgcag cacaaaagaa ataccccata ttacatttag aagtacttgc 3300 acttgtatgc acaattaaga aattccataa atatttgtat ggtcaaaaat ttacagtttt 3360 tactgaccac aaaccattag tggggatttt tggtaaagaa ggcaagcatt caatatttgt 3420 gacaagatta caacgctata ttttagaatt atcggtgtat gattttgaaa tccagtaccg 3480 accttcagca aaaatgggca atgcggattt ttgttcgcga ttcccattgg aggtggcagt 3540 tcccagtgca tatgatcaag agtttgtgaa aagcatcaat tttggggaca acttgcctat 3600 agatttctca gctattgctg aagaaacaaa aaaggatgct agtttgcaga aagttatgag 3660 ttttatgcta aatggttggc ctaaaaggat agacaagcaa tttgttgata tttttacgaa 3720 tcgaaatgat ttggaagtcg ttgaggaatg cttgttattc caagaaaggg ttgtaatacc 3780 acaagtattg caaggcgaca ttctaaagct tttgcatggc aaccatgctg gtatcgtcaa 3840 aatgaaaaga ttggcaaggc aaacggttta ctggtttgga atcaacacaa atattgaaag 3900 atttgttgct gcttgtgaca tctgtgcaag tatggcaatc gttccaaagc aaaacataac 3960 ttcaaaatgg atacctacct taagaccatt tagcagggtg cacatcgact ttttccattt 4020 tgagcatcgc actttcttgt tgattgtaga tagtttctct aaatggttgg aggtagaatg 4080 gatgaggaaa ggtaccaaca cttcagaggt tttgacgaaa cttattgaat tttttgctcg 4140 atttggtttg ccggacgtta tagtgtcgga caatggtccc cctttcaact ctcatacatt 4200 caaaagtttt cttcaaaaac aaggcataaa ggttttaaac agtcctccat acaatcctgc 4260 tagtaatggc caggcggagc gacttgttag gacggtgaaa gatgtgctca agaaatttct 4320 attagagcca ggaatgttag agttgaagtt ggaagatcag ataaacttat ttttatttaa 4380 ttatagaaac aataacttga cgttggaagg tagtttccct tctgaaaagg tgtttgctta 4440 ttcgccaaaa atgttgatcg atttggttaa cccaaagaaa cactataagc aaatgctggt 4500 accacataac cccaatgatg aagcaaaagg tagtcaaaat gaaaataacg atcaaactga 4560 tcagttagat gcactgatac cgggggattt agtgtggtac aagcacaaca ttccacattt 4620 acgcgaaaaa tggataaaag catccttttt aaaacggttc tctaaaaact tattacagat 4680 catggttgga aacgaggcga taactaccgc gcatccaact caaatacggc tcgttaagga 4740 agggagtggt tcgcgccagc cgagacgatc gatgcgagtg gtcgaaactg gtcggtcgat 4800 accggctacc gctgaacccg aaggaacggt cgtgcgagat gacgatatcc ggcaattacc 4860 ggaacgaaat acggagtctg aaggagcagc ctttggagaa accgatgtcc ggcaattacc 4920 ggatctgagt gaccgtgaaa gaagaagaaa taatagtgct gaaataatag agcgtaagag 4980 taggaaaaga aagtttccgt cggaaccggt agagcttaat ggcctactac ggcgttcgaa 5040 aagaactaga aaaattatat cagacaatga attcgaatat tactaaatgc tatatttttt 5100 aattattgaa ttattgttca atcttttgca agagggaaga actaa 5145 // ID Gypsy-33_DWil-LTR repbase; DNA; INV; 304 BP. XX AC scaffold_181130; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-33_DWil_; KW Gypsy-33_DWil-I; Gypsy-33_DWil-LTR. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-304 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_181130; Positions 821215 820912. XX SQ Sequence 304 BP; 104 A; 46 C; 63 G; 91 T; 0 other; tgtagtgtca tatggctaca tatgtgctga gagatggcag ccctggatgc ggtgcgacaa 60 tgtggcataa tcagtcgata agaagaaaaa atgagagaag gaacgataga gaacagacgt 120 ataggccgaa ggtgtacaaa agtcgcttaa gttgtatttc aagttgttac agtcgtaaag 180 ttgctaaaag ttgtcatcta aagttgttaa gtctaagttc tttttatcct aatttcaacc 240 gactattttg ctatatatct aactaataaa caacaattat tgtatacacg caataattct 300 taca 304 // ID LIN7_SM repbase; DNA; INV; 5985 BP. XX AC . XX DT 17-FEB-2008 (Rel. 13.02, Created) DT 11-AUG-2009 (Rel. 14.09, Last updated, Version 3) XX DE Non-LTR retrotransposon from Schmidtea mediterranea: consensus. XX KW NeSL; Non-LTR Retrotransposon; Transposable Element; LIN7_SM. XX NM LIN7_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-5985 RA Jurka J.; RT "Non-LTR retrotransposon from Schmidtea mediterranea."; RL Repbase Reports 8(2), 165-165 (2008). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS join(2659..5052,5016..5810) FT /product="LIN7_SM_1p" FT /translation="MPIYGKNSSSWSFFFLDKRRRIAMIIDPTADDSHALH FT FDLANDILRTILNIENIFGKLKFGLTEIEYPICREFGLSSLYVCHFIQCLI FT LGSALTIPDVSAMRILMRPIINKYNCTKFQDNDARKYRILIDDLVYQLDLD FT TITCDETLCEIERINGRLNPEKYFKKEKPKSDIIHLRFKKSAELLCVKRLR FT FQANQRQEIAKIWEDDGIEHRPPINAFLKTFASSECPISDTKSITLPFYSD FT NESNLTTDNENMSHVMKQLDNTAPGMDLITVADWKIISPKHILITAICNNV FT LRNGVCPKKWKLFRTVLILKPGKKDESFKTSSWRPLAIMDTAYRIFATLLN FT NRLLTWIKKGNLISQNQKAIGIPDGCAEHSATLHLAIDHAKRRKKELHIVW FT LDITDAFGSLPHDLIWYTLEGMGMKKETIYLIKELYRDVSTFFDCQGMISE FT SVNITKGVKQGCPLSMTLFCLSIDYILKSVLNDCPFHLHDMNISILAYADD FT IVLVSDSFTKIEMALRRTVRLASHANLRFKPSKSGYLSINNNNHDIYKLYL FT YNEEVPIISNENKYKYLGIDFSHKQNQDIDGRLQSALALTNSLFKSYLHPA FT QKLNAYKTFIHSKLIFSFRNCIIGHRILDCDRNRITQGREKQLGYDQKIKA FT LLKTMVGDKFHALNNYFLYTHCKLGGLGVSSSIDEYLIQSVTGITRLFHSY FT NTDFREMMMKELAHARGEENFEAALKWINCEIDKPFKNASFFVKFQKSVYA FT LKRKFEIYIKLKFEDNSFKLEMRYKNRISYIGYRNLGTLFQKTYTTLPRNT FT LSKDLHDFVGLHYAEKWYEMRVQGHIGNAIGDSITAKYLIASDILNDAQYC FT FLVKARNNLLNLNYNAYRLKFSLDTMCRLCHLDAETQAHVFNHCLAKPNAR FT RIKHENVLISIVAFLEKIGFEIDVEKSPKYISVPTKLKPDMVIRSNRNKEI FT HVLDLKVPYDSVEGFEKAREDNYVKYKDLAIEIGKAFGEKATISAVVIGCL FT GTWDKKNNAPLSKIGLSKPEIVSLARIACSKAVIACYHIYREHISYTKSTM FT ALPFSI" XX SQ Sequence 5985 BP; 2237 A; 1049 C; 1035 G; 1653 T; 11 other; aatatcaatg ccaaaaaaat ttaattcaaa attgacattc aatagcaccg ggccgaaatc 60 tgaagctgat gacatggaac caaagaactg caattccaac ctgatctcag agatgaatag 120 attatttcaa acattcatta atcggattat ggaaaaaaat atagttcaaa ataagaacag 180 gacatccaga tcaaaatcaa tttcaaaacc taactgcaat aatgatcggg aatcaaaaat 240 aactccaaag tttaaaaata aaggatttaa aattgaaaat catctccgag atgataaaaa 300 tttcaatgat ttaaaaaaat gtttttacaa tcatgagaaa ttttataaaa atctcgaaaa 360 ctgcgattta aatgaactga tatttaaaat atttttacta aaagtaaaaa ttaataaatt 420 tgaatcgaaa atcacagaat ttaaaaataa ttttaaaaac gagaaaaaag ataaaaagaa 480 gataataaat aatgaagata aaacaacgga aactaaaatt ttaaaaagtt ttaaagaagt 540 tttaactaat gggataaata aagaattaca atccaataaa aagaatataa ataatgagaa 600 aaaagctgaa attaaaaaag atgaatatta taaacaaaat tcaaaaaaaa taattagatc 660 aaaaacaatc aacctcgata taatcacaaa actcactgat gatttcttta acaacctaat 720 atggaccttt cataaatatg atgaaaaaga agattcaatt caatataaca cagaaatcct 780 caaatacgtc caaataaaat ttattcttat ggacaaaaat atcagaatgc catccaatgc 840 tatcaaaaaa attgagaata gtatatcaaa ggaaatcttt aaattgacta tcgataatta 900 tttcaacctt ttaaataatg agatacattt gaattttaat aatcgattta aaaatttaaa 960 atatgaatct tttttaaaag attttttatt tataatcggt ttaaaattta aatcattttt 1020 aaatgtgttt aaatatctta aattatcaaa ccatatcaat atcaatgcca aaaaaatttt 1080 tcgcgcgcat caggcgcgcg aaactaatgt acgcagctcg tcgacctccg gtcgactcgc 1140 ggaaaataat gtcaaaaact ctgaagcaga aaatattgtc cataatcgat acaacttgct 1200 cgtggacttg gatgatgatg agatgggcag gacaaataaa tcatttcaat acaagaaaaa 1260 cccaaaattg acaaggtgta aaatgggatc catatccata cctggggaaa tggatagtac 1320 aatgtcatat tcgccaatcc cctcaaatat ccaattatct aagatcaata tatccacatc 1380 aattccaagt aacacgggca tcacagattg caacaaatct ccaattccta tacctcttat 1440 aaagaagaac ctagacgata tattaaataa attaaaagag gacaaaacga aggattcgaa 1500 gataattgac ctcgacacat cccgtatttt ctctgaaaca gagaatgaag aggtcgatag 1560 ggcatcaaat atataccttg aaagagactc cgacatgagt cttttctcac aaacgccctc 1620 gcaatctgag aacaacaacg ccttaaatta tcggtctgat aaatctatca ttcaaagagt 1680 tctaaaagta aaacaaaaaa gtctaactgc tggaatacct cctttctctc gtaagaagaa 1740 gaatttcttc agaaatatca aacctgaaga cataaacagt mcaattatat atagattgga 1800 tagtaaggga aaagtgggat gtaaatcaaa ctggaagaaa ccagattgcg gtgaatttcc 1860 cgtatatgat tatgaaggat tagttgagca ygctatgatg aatcatgcyt caacatttaa 1920 tgaagaaacc ctcattgact gtctggtgtg tcacccaaaa aaaggtaaaa atatgcacat 1980 gatgacctta atcagattcg ctgatatttt taaccacatc atgatcaatg agcatcaggt 2040 tgatattgcc agacatgacc aaaagaagat atatctacac cttacaaagg aaaacctgct 2100 gcactgyaca tatcgcacaa ataacaagaa aatattgtgc aaaaaagtat ttaatattaa 2160 ctcaacgatg aatgatgtgc ttgagcacat ggcaatacac actggatatt gcttcgagca 2220 acagaaaaaa ataatgtgtt actgcgggat gtggaaamct tttgatgaat tgatcaaaca 2280 tatcaagaat cattcacatg aaygagttta taaactcagt tgaaaccact gaacatgagg 2340 tctcaaactg tatcaacagc atctctctca atgtttgatg gagtattggc ctcttgcgaa 2400 acccaaaata ttcctgatgt agatttawct ctaaaaccaa gaatcctgcc tgaaaatttg 2460 gcatttggaa aaaacttgga tattgaactt agttgatggt cccagcactt aatcaaagca 2520 tttgtatata cttttgctat aagaccttcc acaatttata tagatcctat tacttgcagt 2580 gctttgattc agtgcaatta yaaaactttc ttcgaaactt ttccttttaa agattttgct 2640 gattggaatg agatttttat gccaatctac ggcaaaaact cttcctcgtg gtcttttttc 2700 tttttggaca agagaaggcg cattgccatg attattgatc cgactgcgga tgatagtcat 2760 gctctgcatt ttgatttggc aaatgacatc cttagaacga tacttaacat cgaaaatata 2820 tttggcaagt tgaaatttgg actcactgag attgaatacc cgatatgtcg tgagtttggc 2880 ctgtcttctt tatatgtatg ccatttcata caatgcctta tacttggctc tgcattaaca 2940 attcctgatg tatcagccat gaggatactg atgagaccaa taataaataa atataattgt 3000 acaaagtttc aagacaatga tgcaaggaag tatcggatac taattgatga tctcgtgtat 3060 cagttggatc ttgatactat tacttgcgat gaaaccttgt gcgaaattga acgaattaat 3120 ggtagattga accctgagaa gtatttcaaa aaggaaaaac ctaaatctga catcatacac 3180 ctacgcttca agaaatctgc tgaactacta tgtgttaaga gattgaggtt ccaagctaac 3240 caaaggcaag aaatagcgaa aatttgggaa gatgatggca tagaacacag accacctata 3300 aatgcatttt taaaaacttt tgcaagttct gaatgtccaa tctcagacac gaaatcaatt 3360 accttacctt tctattctga caatgagtct aatcttacca cagacaacga aaacatgtcc 3420 catgtaatga arcaactcga caacactgca ccagggatgg atttaattac agttgcagac 3480 tggaagatta tatctccaaa gcatatcctg ataaccgcaa tctgcaacaa cgtgctgaga 3540 aatggtgtat gccctaaaaa atggaaacta ttcagaacgg tgctaatcct aaaaccaggg 3600 aaaaaggatg aaagctttaa aacaagctct tggagacctc tagcgatcat ggacacagca 3660 tacagaattt tcgcgacgct attgaataac cgtctcctaa cttggataaa gaaaggtaat 3720 ctaataagcc aaaaccagaa ggcaattgga attcctgatg gttgtgctga acacagtgct 3780 actttacatc tagcaatcga tcatgctaag agaaggaaaa aagaactcca tatagtatgg 3840 ttggatatca ctgatgcatt tggctctttg ccccatgacc ttatctggta cacgcttgaa 3900 ggcatgggaa tgaaaaagga aactatttac ctaataaaag agctatatag agacgtcagt 3960 accttctttg attgtcaagg gatgatatcg gaatctgtga acataacaaa aggggtcaaa 4020 caaggatgcc ctctgtctat gacactcttt tgtctgtcta ttgactacat ccttaagtca 4080 gtgctgaatg actgtccatt tcatttacat gatatgaaca tcagtatcct ggcttatgct 4140 gatgatattg tccttgtctc tgactccttc acaaagattg agatggcctt gaggaggacc 4200 gtgagattgg catctcatgc gaatctcaga ttcaaacctt caaaatcagg atacttgtca 4260 ataaataata ataaccacga tatctacaag ttgtatctat ataatgaaga ggtaccgata 4320 atatccaatg agaacaagta caaatacctt gggattgact tctcacataa acagaatcag 4380 gatattgacg gaagacttca atcggcactt gcactgacta actccctgtt taagtcatat 4440 ttacatccgg cacagaagct aaatgcctac aaaaccttca tacattcaaa actcattttt 4500 tcatttcgca actgtataat tggacacaga attcttgatt gtgacagaaa tcggattacg 4560 cagggacgag agaaacagct tggttatgac caaaagatta aggcgcttct gaagacaatg 4620 gttggggata aatttcatgc tctgaataat tactttctat atacccattg taaattggga 4680 ggacttggag tatcctcatc catagatgaa tatttgatac agagcgttac tggaataacg 4740 agactttttc attcatacaa cactgacttc agagagatga tgatgaaaga actcgcrcat 4800 gctagaggag aggagaactt tgaggctgcg cttaaatgga tcaactgcga gattgacaag 4860 cctttcaaaa acgcctcttt ctttgtgaaa ttccagaaat cggtatatgc tctcaaacga 4920 aaattcgaaa tatacatcaa attgaaattt gaggataata gtttcaaatt ggagatgaga 4980 tataaaaatc gcatttcata cattggttat cgtaacctcg gaacactctt tcaaaagacc 5040 tacacgactt tgtaggtctt cactatgctg agaaatggta tgaaatgaga gtgcagggac 5100 atattggaaa tgccattgga gacagtatta cggctaaata cctaatagcc agcgacatcc 5160 ttaatgatgc gcaatactgt ttcttggtaa aagctaggaa taacctgctc aatttgaatt 5220 acaatgcgta ccgcctaaaa ttcagcctag acacaatgtg cagactgtgc cacctggatg 5280 ctgagaccca ggcacatgta tttaaccact gtctagcaaa accaaatgcc cgaagaatta 5340 aacatgagaa tgttctaata agcatagttg ccttcctaga gaaaattggt tttgagattg 5400 atgtagaaaa atctcccaaa tatatctcag taccaacaaa gctgaagcct gacatggtaa 5460 ttaggtccaa taggaataaa gaaatacacg tcctggacct aaaagtgccc tatgactcgg 5520 tggaaggctt tgaaaaagcg cgggaagaca actatgtgaa atataaagat ttggcaattg 5580 aaatcggtaa ggcatttggt gaaaaggcta caatatccgc tgtggtgatc ggatgcttag 5640 gtacatggga caagaaaaac aatgccccat tatcaaagat tggcctgtct aaacctgaga 5700 ttgtgtctct tgcaaggata gcatgctcca aagcggtaat tgcatgctat catatatacc 5760 gagagcacat atcatataca aagagtacaa tggcactacc attttccatt taaatcagaa 5820 tgtatacgag gcaatactgg taattgaatc ggtattgcag acttgtgtga gtatgataaa 5880 aacaccatag tgtgattgaa atgctgagcc tagctcgcat atttagccga aaggccgctt 5940 ttgagataaa aacaaattyt gaaaaaaaaa aaaaaaaaaa aaaaa 5985 // ID DNA-1_SM repbase; DNA; INV; 492 BP. XX AC . XX DT 06-NOV-2008 (Rel. 13.11, Created) DT 06-NOV-2008 (Rel. 13.11, Last updated, Version 2) XX DE Putative DNA transposon. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA-1_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-492 RA Jurka J.; RT "Non-autonomous DNA transposons from flatworms."; RL Repbase Reports 8(11), 1796-1796 (2008). XX DR [1] (Consensus) XX CC 3 bp TSD. CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX SQ Sequence 492 BP; 169 A; 114 C; 71 G; 138 T; 0 other; gggagcatcc attaaggacg tccccccaaa aataagcaat ttttgatccc cctccccctt 60 gtcccttttc acgagcaccc cccgggggac atccctccaa aaatagatat cctttatttt 120 cagaagaaat ttaaataatt tataagcata taggctaaat ttaatttcaa cattatatgt 180 agtagtacac aatattggtt gaaaaataga gaaaaaaaca tgcatacata attaaaatga 240 tcaaatggga aaaatagggg acatgaaaca tgaaacatgc cgcttactta cttaccatat 300 atagtatata aaaaagataa tagaaaaatt ttaaaaaatc attaatccat tatgtataat 360 atttaaacga ccaatgctgc tggggatgtc ctaagcactt gaaccctccc cctccctagt 420 gtccccattt gtcccttttg cgaagacccc ctccctcccc cttcttatgg gacgtcctta 480 atggatgctc cc 492 // ID Gypsy-7_AC-I repbase; DNA; INV; 4320 BP. XX AC AASC02007709; XX DT 18-JAN-2011 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from Aplysia californica (California sea DE hare): internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-7_AC_; KW Gypsy-7_AC-LTR; Gypsy-7_AC-I. XX OS Aplysia californica OC Eukaryota; Metazoa; Mollusca; Gastropoda; Heterobranchia; OC Euthyneura; Euopisthobranchia; Aplysiomorpha; Aplysioidea; OC Aplysiidae; Aplysia. XX RN [1] RP 1-4320 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Aplysia californica (California sea RT hare)."; RL Direct Submission to Repbase Update (02-FEB-2011). XX DR Genome; AASC02007709; Positions 15460 19779. XX CC Positions [3333-3851] - Integrase core CC 'CCAG' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 1123..2403 FT /product="Gypsy-7_AC-I_1p" FT /translation="MQVYVTASTKLPTGKRPCRLLGVAASDPRKTNTLTLS FT DRHTGEIFLVDTGAEISVFPASRQDRQHKIMSEPLAAANGTRIRTWGKRTF FT TIHLGTNHIYSHEFVLAEVTRPILGADFFTKHNLVIDLRQKRLLSLDKASI FT KLKDSRQILTEGGLSWSPHNEYSKIIAKEFPEILIPQFTSHINKHGIEHHI FT VTQGPPTSARARRLDTQKLEAAKEEFFNMEKMGVIQRSKSPWASPLHMVPK FT KDRSWRPCGDYRRLNGQTLDDRYPLPHIQDFNIKLSGCTFFSKVDLVRGYH FT QIPMSESSIPKTAIITPFGLWEFTRMPFGLKNAAQSFQRLMDGVLRDIPFT FT FVYLDDILVASHSEKEHKEHLRVLFEILKDNGLVINKAKCIFGVRRWISWD FT IASPRKVFFPYQTASTHSATVSHQKIVQVYNVS" FT CDS 2268..4196 FT /product="Gypsy-7_AC-I_2p" FT /translation="MYFWSTTLDFLGHRITTKGIFPIPDRINTLRNCEPPK FT DRTSLQRFLGMINYYHRFLPNIASHLAPLHAQASGKGQKIEWDDSCQAAFD FT KAKATLATATLLHHARPDAPTNVTVDASDKAIGAQLEQRHGRTWVPIAFFS FT RKLSDAEQKYSAFDRELLAAYSAIKHFKYFLEGRAFKLFTDHKPLTTAFQS FT QADRSPRQTRHMSYIAEFTSDIQHIKGKFNVVADALSRICSTQAVPDGFPL FT GTPSQLAKAQSDSEEMDTYLSKDTGLEFRYVESDGARVLCDISTGTARPVI FT PSSWTRKVFDHVHSMCHSGVKPTQRAITERFVWFDMKRQIRQWCKECHPCQ FT SSKIHHHTRAPLVNRAPPTSRFKSLHIDLVGPLPNCHGMSYLLTVVDCFTK FT WPDAIPLPDAQATTCASALLHHWISKFGVPEEITSDRGPQFTSTLWSEFNK FT LLGVEHHDTTAYHPQANGMVEKFHRQLKASLKSRVTGPNWFNELPVILLGI FT RSSWKVDPGCSPAQLVYGTTLRLPGEFFQPRNAQTIEPDFIFLQQLQKTMR FT SMEPTVSNYHGSQTSHVPSNLAATGYVYVRHDAQRKPLQRPYDGPFKILNT FT HDKYFTLDLKGRKEKVSVVRLKPAYISNANNQLGKVFEHPNASTH" XX SQ Sequence 4320 BP; 1302 A; 1056 C; 912 G; 1050 T; 0 other; ttggtgaccc cgacaaagga atatcgccac aagtcacgaa caactgtcgt cgacataata 60 ttctagacat taccccccag tgctttcttc actgaccacg tacttcgcgg acaatcccgg 120 taagatctgt ttctattagc ctacaatttt acttctagtt agttatctct agatctggat 180 ccctgcttac aatgtgcact gctggcattt attgacgtcg ctcctaagct tactgaaatg 240 caatatttgt ctggccatta ctagattcta aattctagct ctaagtctaa caaaagcttc 300 acagacattg tagaaattaa tcgccgtttt aagtcctata tctagcctag actagacatc 360 tggagtcaag ctaaccatgg cttcagacaa ggagataaat gccgcatcag tcaaaattcc 420 agaattctgg aacaagtccc ccgaagtctg gtttgccaaa gtagaggcac agtttggcac 480 taaaaatatc acccaagacc agaccaaata tgactacttg gttagtgcct tagatatgga 540 aacggctgag gaaattcagg ccattttatt acacccacct gcgagagaga aatacaattc 600 gctcaagtca atgttaatat caacatttgg gaaatcacag attcaaaaag acatggaact 660 tttgaatctc aatggccttg gagaccgtaa acccacagct ttgcttcgaa agattaatgc 720 cctaaatgac gacccccaaa cgcttaagcg cgcgttattt ctgacaaatt tgccctctga 780 aatgagaaca atcttggcag cccagaacat agcggatatt caggccctcg cgaaagctgc 840 ggatcatgtt tgggaggcta ggtttgccgc cggggctaga caaactgtcg attcagtgga 900 aaccgagttc gtggagagca atagcgaggc ggttacccca gtaacttacg ccgccaagtc 960 aagaagaccg actaaacaac ccaccccaga tagaaatttc cagaaaactt ttaaagaaaa 1020 gaacagcagt agacctggat ctaactcgca ttcacacaga caagctcaca tatgctatta 1080 tcattcacgt ttcggaactg atgcccgaag atgtgaagcc ggatgcaagt ttatgtcact 1140 gcttccacaa agctcccaac agggaaacgg ccttgtcggt tgctaggcgt ggcagcttcc 1200 gacccaagga aaacaaacac gctcaccttg tcggatcgtc acacaggtga aattttcttg 1260 gtagacacag gggcagaaat ttccgttttt cctgctagta gacaagacag acaacataaa 1320 ataatgtcag aacccctagc tgcagccaac ggcacgcgca ttaggacttg gggtaaaaga 1380 actttcacaa ttcacttggg aaccaatcac atttattccc acgagtttgt cttggctgag 1440 gtaaccagac ctatcttagg cgcagatttc tttaccaaac acaacttagt gatagatcta 1500 agacaaaaac gattgttatc attggacaaa gcttcgatca agcttaaaga cagtcgccag 1560 atcctcacag agggcggttt gtcctggtcg ccgcacaacg aatactcaaa gatcatcgcg 1620 aaagaattcc cagaaatact aatcccgcaa ttcacttctc atatcaacaa gcatggtatt 1680 gaacatcata tcgtcacgca aggtccaccc actagcgcac gggcaagacg tttagacacc 1740 cagaaattag aggcggcaaa agaagagttt ttcaacatgg agaagatggg ggtcattcaa 1800 aggtccaagt cgccatgggc gtcaccgcta cacatggtac ccaaaaaaga tagatcgtgg 1860 agaccatgtg gggattacag gcgactgaat ggtcaaacac ttgatgacag gtatccccta 1920 ccccacatac aggatttcaa catcaaactc tcagggtgca catttttctc caaagtagat 1980 ttggtcagag gttaccacca aattccgatg tcagaatcct ccatccccaa aacagcaatt 2040 atcacacctt ttggtttgtg ggagtttacc agaatgccgt ttggtttgaa gaatgcagcc 2100 caatcattcc aacgtctaat ggacggtgtg ctgagagata tacccttcac gttcgtttat 2160 ctggatgaca ttttagtggc aagtcattca gaaaaggagc ataaagaaca cctcagggta 2220 ttgtttgaaa tcctgaaaga caatgggcta gtgatcaaca aagctaaatg tatttttgga 2280 gtacgacgct ggatttcctg ggacatcgca tcaccacgaa aggtattttt cccataccag 2340 accgcatcaa cacactccgc aactgtgagc caccaaaaga tcgtacaagt ctacaacgtt 2400 tcttaggcat gatcaactat tatcacagat tcctgcccaa catcgcctca catttggctc 2460 cattacatgc ccaagcaagt ggtaaaggac aaaagattga atgggatgac agctgtcaag 2520 cagcttttga caaagcgaag gcaaccttag ccacggcaac ccttcttcat cacgctcgac 2580 ctgacgctcc aaccaatgtc acagtagacg cctctgacaa agctataggc gcacagttag 2640 aacaacgaca tggtcgcact tgggtaccca tagcgttttt ctctcggaaa ctctcagatg 2700 ctgaacagaa atatagtgct tttgacaggg agttactggc tgcatacagc gctatcaaac 2760 acttcaaata ctttttggaa ggaagagctt tcaaattgtt cactgatcat aagccactca 2820 ccacagcatt tcaatcccag gcagaccgat ccccacgcca aactcgtcac atgtcatata 2880 tagccgagtt tacctcggat atacaacaca tcaagggtaa atttaatgtg gtggcagacg 2940 ctctatccag gatttgcagc acacaggccg ttccagatgg tttcccctta ggaaccccta 3000 gccagctggc taaagctcag agtgattcgg aagaaatgga cacctatctc tccaaagaca 3060 ccggtctgga gtttcgatat gtggagtctg atggtgcgag agtactttgt gatatttcaa 3120 caggaacagc gaggccggtc ataccttcat cgtggacaag aaaagtgttc gatcacgttc 3180 atagtatgtg tcattctggg gttaagccaa cacagagagc cattactgaa cgttttgtgt 3240 ggttcgatat gaagcgacag atacgacagt ggtgtaaaga atgtcatccg tgccaatctt 3300 cgaagattca tcaccacact cgagcgccct tggtaaatag agcaccccct acctcgaggt 3360 tcaaaagttt gcatattgat ctagtcgggc cgcttccaaa ctgtcatggt atgtcctacc 3420 ttcttacagt agtagactgt ttcactaagt ggccggatgc gataccctta ccagacgccc 3480 aagctacaac gtgtgcatcc gcccttttac atcattggat atcaaagttc ggagtcccag 3540 aagaaataac atctgataga gggccacaat tcacttcaac cttatggtca gagttcaaca 3600 agctcttagg agtggaacat catgatacca cagcatacca cccacaagca aatggcatgg 3660 ttgaaaagtt tcatcgccag ctcaaggcat ctttgaagtc tcgggtcact ggacctaatt 3720 ggttcaacga gttaccagtc atattgctag ggataaggtc ctcgtggaag gtagatccag 3780 gttgttcgcc agcacagtta gtgtatggga ccacactcag actaccaggc gaatttttcc 3840 aaccacggaa tgcccagaca atagaacccg atttcatttt tcttcagcaa ttacaaaaga 3900 caatgcgcag tatggaacca acagtttcca actaccacgg gagtcagacg tcccatgtac 3960 ctagcaatct tgctgcaacg ggatatgttt atgttcgcca cgacgcgcag agaaaaccac 4020 ttcagagacc gtatgatggt ccattcaaga tcttgaatac ccatgacaaa tatttcactt 4080 tggatttgaa aggacgcaaa gaaaaagtat cagttgttcg tttaaagcca gcatacattt 4140 caaatgccaa caatcaactt ggaaaagttt ttgagcaccc taacgcatcc acccactagt 4200 ccacacggac gggacgtaca gtctggcctc cagagaggtt ggacctgtag ggataaacaa 4260 ttatttcgtg catagtgcat cgtactatcc gcactgtgca cagcactggg agggaggtaa 4320 // ID CR1-65_HM repbase; DNA; INV; 4362 BP. XX AC . XX DT 24-DEC-2008 (Rel. 13.12, Created) DT 24-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE CR1-type family - consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-65_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4362 RA Bao W. and Jurka J.; RT "CR1 families from Hydra magnipapillata."; RL Repbase Reports 8(12), 1892-1892 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 168..932 FT /product="CR1-65_HM_1p" FT /translation="MATLSEIKKMLKEMFMDFKKEMQNQTEALLKQQEKNV FT LDIISGNTKIINKRLDKIENILNENISKIKILEKDVADVKLSLNFQENLID FT QKVAACHNSFEKELTCLQLLKNQQRNVEDRTRRDNLRIDGLPENDKETWSQ FT TEEKVKIFLEKNLGLLGIDIERAHRTGIKKDNRTRSIVMKLKSYKDKMKIL FT KETNRLKGSNIYVNEDFSKMTVDIRKKLFAEVKERRLKGENLAVRYDKIIH FT LKNAVKYSINVPK*" FT CDS join(1001..2128,2195..4054) FT /product="CR1-65_HM_2p" FT /translation="MAETINYNFFKTFSGENVRHFSKNIDPDVNILDNYSL FT ETNYFKVHEVTSILNKSLHYISLLNINIRSMSKNFENLKLMLNNISIKFSI FT ICLTETWCQSEDMSNSNLHLTDYKSVHQPRKNGIGGGVCIFIHNSLTYKKV FT DNLSVNNSNCESLTIEIINQKEKNSFITVLYRPPNGSYNQFENDLRRILTQ FT TSKKHLYLLGDFNLNLINLNTDNHVKNFINTLSQFSAYPMINKPTRVTRKT FT SSIIDNIITNNYSNSTINSGIIKTDISDHFPVFLTTNTKCCINKKPITVYI FT RHVNKGSIAIFRQLLREINWELLNDCNDTCSAYDFFIRVFTRQYEKAFPKV FT QKTIKSKSVHSPWMTKGLLRSSKKKKKSFTKNIKSNHKKNFYSGLLQKSFG FT NAKKTWTIIKEITGKTKITSSNFPNRLKTDKGEIFNKKTIAERLNNFFINV FT GKNLAKEISPSSKTFQSFLKKSDFIMDNSELSIEELRSAFDMLQKNKSAGL FT DEINTNVLKSVFDIIELPLFIIFKLSLKNGDFPDQLKLAKIIPIYKNGDDS FT LESNYRPISILSCFSKVLERIMYNRIYNFIDKNNILYPKQFGFRRNHCTEH FT AVMDLASNILKGFDGNNYTLGVFVDLSKAFDTVDHEILLYKLKNYGIQNTN FT IKWLESYLQNRKQCVTYDLTYTQLEVISCGVPQGSILGPLLFLLYINDIHL FT SSKFLNFVLFADDTNIFFSGSNLKFVFSTVNTELINLNEWFKANKLSVNID FT KTKYILFTKPSKTDNIPLKLPDLFINNIKVKRVHSMKILGIIFDDHLNWKN FT HIQFVENNVSKALGILFKTKHLLNIMCLKSLYFSFIHSHISYCNIAWASNY FT SSSLKKINTKQKQASRLILNAGRYACAEPLLKKINVLNIYKLNIYQNLLFM FT FKIYNNMLPTYFQPHFTLINHKYSTRYSISNFLIPTTSLRKTDFSITCRGP FT RLWNYFLNRDMKNMTSIKLFKRAVKLYLLDLNNKNILLYF*" XX SQ Sequence 4362 BP; 1718 A; 629 C; 599 G; 1414 T; 2 other; tttttttgca actcagcttg cgtagtgaac agatgtgttt wtaacagctt tcawaaatct 60 ttctaaattt gcaaacacga aaaaaaaata tatataaact aacatttttt tttcgtattt 120 tgtaaaagtg tagatatttt tgtttttgaa ctcgattttt ctatatcatg gcaacattat 180 cagaaattaa aaaaatgtta aaagaaatgt tcatggactt caaaaaagaa atgcaaaatc 240 aaacggaagc attactcaaa caacaagaaa aaaacgtatt agatattata agcggaaata 300 cgaaaataat taacaagcga ctggataaaa ttgaaaacat tctaaatgaa aacataagta 360 aaattaaaat actcgagaaa gatgttgcag atgttaaatt aagtttaaat tttcaagaaa 420 accttattga tcaaaaagtt gcggcttgcc acaacagctt tgaaaaggaa ttaacttgtt 480 tacaactctt aaagaatcaa caaagaaacg ttgaagatag aacaagaaga gataacttaa 540 gaatcgatgg gttaccggaa aatgataaag aaacttggtc acaaacggaa gaaaaagtaa 600 aaatatttct tgaaaaaaat ctaggtttgt tgggaatcga cattgaacgt gcacatcgta 660 ctggaattaa aaaagataat cgtacaagat caatcgtgat gaaattaaaa agttataaag 720 ataaaatgaa aatattgaaa gaaacgaata gattaaaagg atcaaatata tatgtcaacg 780 aagatttctc taaaatgaca gttgatatta ggaaaaagtt gtttgcagaa gttaaagaac 840 gacgtctaaa gggagaaaac cttgcagtga ggtatgacaa aattattcat ttaaaaaacg 900 cggtaaaata ctcgatcaat gttccaaagt aaaagtaatg cagtttttat ttatttaact 960 ttttttaatt tttattttta tttttttctt gtctttataa atggcggaaa caattaatta 1020 taattttttt aaaacatttt ctggtgaaaa cgtacgtcac ttttcaaaga acattgatcc 1080 tgacgtaaac attcttgaca attattcatt agaaactaac tattttaaag ttcatgaggt 1140 tacaagtatt ttaaataaat cactacatta catatcatta ctaaatataa atattagaag 1200 catgagcaaa aattttgaaa atctaaaatt aatgctaaat aatataagta ttaaattcag 1260 tataatctgt cttacagaaa cctggtgtca gagtgaagat atgagtaact ctaatttgca 1320 tttaacagat tacaagtctg ttcaccaacc aagaaaaaac ggtattggcg gaggcgtatg 1380 catatttatt cacaactcgt tgacgtataa aaaagtagat aatctaagtg tgaataactc 1440 aaattgcgaa tcattaacta tagagataat aaaccaaaaa gaaaaaaatt cgtttataac 1500 agttttatat cgaccaccta atggcagtta caatcaattt gaaaatgact taaggcgaat 1560 attaactcaa acctctaaaa aacatctata cttactcgga gactttaatt taaacttgat 1620 aaatttaaac actgataacc acgtcaaaaa tttcataaat actctttctc aatttagtgc 1680 ttaccccatg attaacaaac cgacgcgtgt aactcgaaaa acatcatcta tcattgacaa 1740 tattatcact aataattata gtaactcgac aatcaacagt ggtataataa aaacggatat 1800 cagtgaccat ttccctgttt ttttaactac aaatactaaa tgctgtataa ataaaaaacc 1860 cataacagta tacataaggc atgtaaacaa aggctcgatt gcgattttta gacagctttt 1920 gcgtgaaatc aattgggagt tattgaatga ttgcaatgac acatgtagcg catacgattt 1980 ctttatccgc gtctttacaa ggcaatatga aaaggctttt cctaaagttc agaaaactat 2040 aaaatccaaa agtgtgcata gcccgtggat gactaaaggt cttttaagat catcaaaaaa 2100 aaaaaaaaaa agctttacga aaaatattta aaaaacaaaa actatgaaaa cgaaactaaa 2160 tataaaaaat acaaaaacta ttttgaaaaa ataaaaaagc aatcataaaa aaaacttcta 2220 ttctggacta cttcagaaat catttggaaa cgctaaaaaa acttggacta taataaaaga 2280 aataacaggg aaaacaaaaa ttacatctag caacttccct aatcgactga aaactgataa 2340 aggtgaaatt tttaataaaa aaacaattgc tgagagactg aataattttt ttataaatgt 2400 tgggaaaaat ttagccaaag aaatctctcc aagtagcaaa acattccagt cttttttgaa 2460 aaaatcagat tttattatgg acaattcgga actttccatt gaagaacttc gttcagcttt 2520 tgacatgctt cagaaaaata aaagtgcagg tcttgatgaa attaatacaa atgttttgaa 2580 aagtgtcttt gatattatcg aattgcctct tttcattatt tttaaacttt cattaaaaaa 2640 cggggacttt cctgatcaat taaaacttgc taaaataatt ccaatttata aaaacggtga 2700 cgattcttta gaatcaaact acagacccat ctcaattctt tcctgttttt caaaggtact 2760 tgaacgtatt atgtataata gaatatataa cttcatcgat aaaaacaata ttttatatcc 2820 taaacagttt ggatttcgga gaaatcattg cactgagcat gctgtgatgg atttggctag 2880 caatattttg aaaggatttg atggtaataa ttatacactt ggagtttttg ttgatttgtc 2940 gaaggcgttc gacactgtcg accacgaaat ccttctctat aaattaaaaa attatggaat 3000 tcaaaatact aatataaaat ggctcgagtc ttatttacaa aatcgtaaac aatgtgtgac 3060 gtatgattta acttatacgc agttagaagt tattagttgt ggtgtccctc agggctcaat 3120 acttggtcct ctgttgtttc tcttatatat taatgacatc cacttatcct ctaaattcct 3180 taattttgtt ctttttgctg atgatactaa tattttcttt tcggggtcta atttaaaatt 3240 tgttttttct actgtaaaca cagagctaat taatctcaat gaatggttta aagccaataa 3300 actgtccgta aacattgata aaactaaata cattctgttc acaaaacctt cgaaaactga 3360 taacataccc ctgaaacttc ctgatttatt tataaacaac attaaagtaa aaagagttca 3420 ttcaatgaaa atcctaggta taatatttga cgatcattta aactggaaaa atcatatcca 3480 atttgtggaa aacaacgttt caaaagctct tgggatcctt ttcaaaacga agcacctttt 3540 gaatataatg tgcttaaaaa gcttgtattt ttcatttatt catagtcaca ttagttattg 3600 caatatagcg tgggcaagta attattcatc ctctttaaaa aaaattaata ccaaacaaaa 3660 acaagcgagc cggttaattt tgaatgcagg tagatatgca tgcgccgagc ctttacttaa 3720 aaaaattaat gttttaaata tttataaact aaacatctat caaaatctgt tatttatgtt 3780 taaaatttat aacaacatgc ttccaacata ttttcagcct catttcacct taattaatca 3840 caaatactct acaagatatt ccattagcaa ttttctcata cctacaacat cccttagaaa 3900 aacagacttc tcaataacat gtcgaggacc acgtttgtgg aactatttcc tgaatagaga 3960 catgaaaaac atgacttcaa ttaaactatt taaaagagcc gttaaactat acttattaga 4020 tttaaataat aaaaacattc ttttgtattt ttaaaaaaac atttatatat tgatttgtta 4080 acacgtatat atgcttgtgg atcctatgac ttatatattt tgtgaactgt gatgtgttgt 4140 attatgtaca aaacttgtaa tatttataaa acttatacaa cgtatatttt aagaagttaa 4200 tacgtaaaac tttattgata ttttcacgaa cttaggagca gggtgataag acacattgtc 4260 ttctgcttgc tcctgtcaga atataaatat ttaattttta atttttacaa aaaacattgt 4320 aaaatgattg tttattgacg aaatatatat atatatatat at 4362 // ID BEL-43_AA-I repbase; DNA; INV; 6115 BP. XX AC AAGE02017492; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-43_AA_; KW BEL-43_AA-LTR; BEL-43_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6115 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02017492; Positions 28301 34415. XX CC Positions [5137-5724] - Integrase core CC 'CCAAC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 196..1608 FT /product="BEL-43_AA-I_1p" FT /translation="MALTAHTPKGKKKGKNMADDANIGNENEDDSSRTLTI FT SEQLIQQKFKEVTRQKKEEFDVLLKKKETVARKLTRIQVALQRPEAADKHY FT LQLQVKMLESAYGEYSELQNHIYDLNTSEEVRCAEEMRFIEFEELYSVLYV FT QLTKQIDAVVKNETEPALLQPVHNQPHIPPLKAPLPTFDGNFENWFAFKNM FT FQNVMARYENEAPAIKLYHLRNSLIGAAAGVIDQDIINNNDYDAAWETLRE FT RFEDKQVIVDKHIDAMFNIPAMTKESAVSLRKLIDTVSKNVDALKNLELPV FT QGLGEMMLLNVLAKKLDLETRKAWGLNQKDNELADYQSTMEFIKERCKVYE FT KISRSSKATVEVVKQVRSAGKTDSKVHSLVATNEKCTHCKGDHEIWKCELF FT KKVNLSERYNSLRKSGSCFNCLERGHITGKCKSERSCKQCGKRHHTSLHPV FT DTYPGDSTTTKCRLEERSIEAVDRGR" FT CDS 4588..6084 FT /product="BEL-43_AA-I_2p" FT /translation="MFDNVSRFSIMQRAMAYVIRFTDYIRSGRQQLTKGLP FT TVDEMKRALLLIIRLIQKECFSDEIRALEEEKEFKYPLKCLNPFIDEKDDT FT LRVGGRIRHAQIPYGSKHQLLLPSRHPVTVAIIRYLHKANMHIGQRALLAV FT VRQQFWPLRAKNVIRDVIHKCTPCYRAHPKRATQLMGDLPEYRVQAAYPFC FT NVGIDFAGPFTMRAAVSTKKSLITKGYVCVFVCMATRAIHLEAASNLSTDA FT FLASLQRFVSRRGLPTKIFSDNATNFVGANNELTKLADLFQTEMHQKKLNG FT FCVQRNIEWSFIPPRSPHFGGIWEAGVKSAKFHLRPILADHKLSYEELTTV FT LAQIEATLNSRPLVPSSDDPNDMTAITPAHFLIGREFQAVLEPSYAHIKTG FT RLSRWQVLQDLKQKYWRTWSTDYLQELQRRQRDFKTTKFKVGALVLIVDDN FT LPPLQWSLARITELHPGKDGHVRVVTLQTKDSVIKRAVKYVCLLPLDEEES FT AEI" FT CDS 1961..3949 FT /product="BEL-43_AA-I_3p" FT /translation="MQPFEIAQLNIPEDLQLADPGFNVPGQVDILIGSGLF FT FKLIKHGQLQLADHLPAVQETSLGWIVSGLIPTSQMGVGGTLCTIVTQDDI FT GKLLERFWHFDSYDEAAPVLERSLEDVCVAHFLETHRRDENGRFVVRLPFN FT EAKEKLGDSETMARKRFFAVERRLDKDPDLKRQYVDFMSEYAELGHMVEVT FT PSQNENPTDAFYLPHHCVLKPTSSTTKLRVVFDGSAESPTGVSVNQTQMVG FT PTVQNDLVSIHLRFRTFPYAISADIPKMYRQVRMDDDFLRVFWRSNRDDPL FT RVYALKTVTYGLASSPFLATMALRQLADEEEHRYPLAASTVKKSFYIDDML FT AGANSLEEAVELLRQVTGLLHDGGFDIHKVCSNSKELLDMVPENKREKLGV FT IDDAAINCLMKTLGIVWNPVCDVFTFRIAEVLPATQLTKRVILSEISRIFD FT PLGFLGPVLTAAKLIMRELWLLDLHWGDALPQDLVELWMEFRDQLQSLNGL FT EIPRCVLNRDCIGIELHGFADASDLAFGACLYARSVFRDGKAEMRLICSKS FT RILPKKQGTKKEVTTPRAELQAALLLARLAVKLIGAMEIQFASVVLWSDSQ FT IVLCWIKKSPDSLKIYVGNRVKEIQMLTNEFVWRYIPSKLNPADLISRGVQ FT PNRLREQQVWWTGL" XX SQ Sequence 6115 BP; 1791 A; 1272 C; 1556 G; 1496 T; 0 other; ttttttggtc catgcgagcc gaatgtgtgg ctgaaatcgg aaaatcgaat cgatcgcatt 60 tcctagcgtg gaaattgcga aatcgcgaag tgaaaaccgt gtattgaaca gtttgagtag 120 tgaaaaggaa cgccattgaa tcggcgtata cgtttgatag tgtctgggaa caaagagtgc 180 gaacgcgaaa acaaaatggc gctaactgcg cacactccga aaggcaaaaa gaagggaaaa 240 aatatggcgg acgatgcgaa tatcgggaac gagaatgaag atgattcgtc gcgtacgctc 300 accattagcg agcaactcat tcagcagaaa tttaaggaag tgacccgaca aaagaaagaa 360 gaatttgatg tgttgctaaa gaagaaagaa acggttgcgc gaaaactgac ccgtattcaa 420 gttgctctgc aaaggcccga agcagcagac aagcattatc tgcaacttca ggtgaaaatg 480 ttggaaagtg catacggtga atacagtgag ttgcagaatc acatatacga cctaaacacc 540 agtgaagaag tgcgttgcgc ggaagaaatg cggtttattg agttcgaaga gctttacagt 600 gtgctttacg ttcagctgac gaagcaaatc gacgccgtag tgaagaacga gacggaaccg 660 gcattattgc aacccgttca taatcagccg catatccctc cgttgaaggc tccgttgcca 720 acatttgacg gcaatttcga aaattggttc gcgttcaaaa atatgttcca gaatgtgatg 780 gcacgttacg aaaatgaagc tccggccatc aagctgtacc atttacgcaa ttcgttgatt 840 ggtgcagcag ccggcgtgat agaccaggac atcataaaca ataacgacta tgatgcggcc 900 tgggagacgc tccgagagcg tttcgaagac aagcaagtga tcgtcgacaa gcatattgat 960 gcgatgttca acataccggc catgacgaag gaaagtgcag ttagtcttcg aaagctgatc 1020 gacacagttt cgaagaacgt cgatgctcta aagaacctgg aactgccggt gcaaggatta 1080 ggagaaatga tgctgttgaa cgtgctggca aagaagctcg accttgaaac tcgcaaggcg 1140 tggggattga accagaagga caatgaattg gcggattacc aatcgacgat ggaattcatc 1200 aaagaaaggt gtaaagtgta cgaaaaaatc agccgtagca gcaaggctac agtggaagtg 1260 gtgaagcaag tgcgatccgc tgggaaaaca gactcaaaag tgcatagcct tgttgctacc 1320 aatgaaaagt gtacgcactg caaaggagat catgaaatct ggaagtgtga actgtttaaa 1380 aaagtgaact tgagtgaaag gtacaattca ctacgcaaga gtggatcctg tttcaattgt 1440 ttggaacgag gacacattac aggaaagtgc aaatccgaac ggtcgtgcaa gcagtgtggc 1500 aaacgccacc atacgtcact gcatcccgtc gatacgtatc cgggagattc aacaacaacc 1560 aagtgtcgcc tcgaagagcg aagcatcgaa gccgtcgaca gaggtcgcta atactacggc 1620 agcaccatcg actaatggat ctgttctttg ttcgaacgtc gaagctgatc aggacacgct 1680 actggcaaca gccattgcac ttatccacgg tgctagaaaa cgaacggtgc agtgccgagc 1740 agtattggac tcagcttcgc acaagcattt cattacggaa gcattggtag ctaagctcgg 1800 attgaaacgg aagaaggcta actatacgat cgttggcatt ggaggtaacc agctagctat 1860 ccagcacaag gtgcatgcga ggatcaaatc gaatgttagt gaatacgagt cacagtgcct 1920 tgaattcctg gtcgtcaaga aaatcacggg tgatcttccg atgcagcctt tcgaaatagc 1980 ccagctgaac ataccagaag accttcagtt ggctgatcct ggattcaatg tgcccggtca 2040 ggtcgacatt ctgattggat cgggtttatt tttcaagctc atcaaacacg gacaactgca 2100 gttggcagat catttgcctg ctgtacaaga aacttcgctt ggttggattg taagtggttt 2160 gattccaact agccagatgg gcgttggcgg tacgctctgc acgatagtaa cccaagatga 2220 catcggtaag ttacttgaac ggttttggca ttttgattcc tatgacgaag cagctccggt 2280 gcttgagcgt tcacttgagg acgtttgcgt cgcgcatttt ctcgaaacgc accgacgtga 2340 cgaaaacgga agatttgtcg tgcggctccc tttcaacgag gcaaaggaga agctgggcga 2400 ttccgaaacc atggccagga aaaggttttt tgcagttgag cgaagactgg ataaggatcc 2460 ggatctgaaa cgacaatacg ttgacttcat gagcgagtat gcagaattgg ggcatatggt 2520 agaagttacg ccttcgcaaa atgaaaatcc aacagatgcg ttttatttgc cccaccattg 2580 tgtactaaag ccgaccagct caaccacaaa acttcgcgtg gtgttcgacg gttcggcgga 2640 atcgccgaca ggagtgtccg tcaatcaaac gcaaatggtt ggcccgacag tacaaaatga 2700 tttggtgtcc atacatttga ggtttcgaac gttcccgtat gcaatcagcg cagatattcc 2760 gaaaatgtat cgacaagttc ggatggacga tgattttctg cgggttttct ggcgaagcaa 2820 ccgtgatgat ccgttgaggg tatatgcgct gaaaacagtt acatacggtc ttgcatcatc 2880 gcccttcttg gcaacgatgg cactgcgaca gctagctgat gaagaagaac atagatatcc 2940 actggccgca tcaaccgtga agaaatcctt ttatatcgac gatatgcttg ccggtgcgaa 3000 ttctttggaa gaggctgtgg aacttttgcg acaggtgacc ggcttgctgc acgacggagg 3060 gttcgacata cacaaagtgt gttcgaattc aaaggagcta ttggacatgg ttcccgagaa 3120 caaacgagag aagcttggag tgattgatga cgccgctatc aattgtttga tgaaaacact 3180 tggaattgtg tggaatcctg tctgtgatgt attcacattc cgcattgcag aagttctccc 3240 agcgacacaa ttgaccaaac gggtgatatt gagtgaaatt tctagaattt tcgacccgct 3300 tggttttctt ggcccagtgc tcacagcggc caaattgatc atgagggagt tgtggcttct 3360 cgatctccat tggggcgatg ctttgccaca ggacttggtg gagctttgga tggagtttcg 3420 tgaccaattg cagtcattaa acggactgga gatcccacgt tgtgtgttga atcgggattg 3480 cattggtatt gaacttcatg gatttgccga tgcatccgat ctagcatttg gagcttgtct 3540 ctacgcccgg tcagtgttcc gagatggaaa ggctgaaatg agattgattt gtagcaaatc 3600 gagaatacta ccgaagaaac aaggaacgaa aaaggaagta actacaccgc gtgctgaact 3660 gcaagcagct cttctattgg ctcgtcttgc cgtgaaattg attggtgcca tggagattca 3720 atttgcgtcc gttgtcctat ggagcgattc tcagatagta ctctgctgga tcaagaagtc 3780 tccggattcg ttgaaaatct acgttggaaa tcgagtgaag gaaatccaga tgctgacgaa 3840 cgagtttgtt tggaggtaca ttccatcgaa attgaatcct gctgacctca tttcaagagg 3900 ggttcagccg aaccgactgc gagaacagca agtttggtgg accggactgt aggagatata 3960 cgtgcttatt tataagttaa gaaaatatat ttcatagaat tgtgcaataa tattgagtac 4020 aattacgaat agattgtttc ttcagttaat ttataataaa acgttattta tctgaacaaa 4080 aaataataaa acaaatttta catacatgta attccaagct aagacaattg gcaaaaatct 4140 tcaaatcatc gaataatgta ttcgacaaca ctgcatcgat agagagcgaa acagatcttg 4200 aagagagcga gcttccatac gtgcgctcac ctgacggctg atgcggaagc gaagcgatga 4260 atcacgaaca caacgacgga gaggggaaac gaacaagaga aaaagcattg aaaatgtaga 4320 tgaagggaac tgtgaacggc tgaatgagat cagttttgag atgtatgttg tgaagtgcgg 4380 atatgtgtaa tttgttgaaa ataagtaaaa aaaaaaaaaa aagtttgaaa caaatagtgt 4440 ttttattttg aagaattcct acacggaccg gacgccctga agcaagcgaa ctttcagttg 4500 gatgatccac ccctgattca ggaagcttct attcccgaac tgcgaggaat tactctgttc 4560 agcacgagcg tttgccaacg attgaaaatg tttgacaacg tgagccggtt ttccataatg 4620 cagagagcca tggcttacgt gattcgtttt acggactaca tcagaagcgg acggcagcaa 4680 cttacaaaag ggttgcctac tgttgatgaa atgaaacgtg ctctgttgct gattatacga 4740 ctgatccaga aggaatgctt cagtgacgaa attcgtgcgt tggaggagga gaaggaattc 4800 aagtatccat tgaaatgttt gaatccgttt atcgatgaga aggacgatac tctgcgtgta 4860 ggaggacgaa ttcgacatgc ccagattcca tatggcagca agcaccagct tctgctgcct 4920 tctagacacc cagtaaccgt tgcaatcatt cgatatctgc ataaggccaa catgcacatc 4980 ggtcaacgag ctttgcttgc tgttgtgagg caacagtttt ggcctcttcg agcgaaaaat 5040 gtaattcgtg acgtcatcca taagtgcacc ccatgctatc gtgcccatcc gaaaagagct 5100 acgcagttga tgggagactt gccggagtac agggtccaag ctgcgtatcc attttgcaac 5160 gtcggaatag atttcgccgg gccgttcacg atgagagcgg ctgtgtctac aaagaaaagt 5220 ttgattacta aggggtacgt gtgtgttttt gtgtgtatgg ccacgcgtgc tatacatttg 5280 gaggcagcgt caaatctctc gacggacgca ttcttggcct cacttcaacg gttcgtcagt 5340 cgaagggggt tacccacgaa gatcttttcg gataatgcca ccaattttgt tggtgcgaat 5400 aatgagttga cgaaacttgc cgatcttttc caaaccgaaa tgcaccagaa gaagctcaat 5460 ggattttgtg ttcaacgaaa catcgagtgg tcctttattc cccctcgcag tcctcacttt 5520 ggaggaatat gggaggccgg agtgaagtcg gccaaatttc acctgcgacc cattttagca 5580 gatcacaaac tctcgtacga agagctgacg acggtattgg cgcagataga agctactttg 5640 aattcgcgcc ctttggttcc gtcttccgac gatccaaatg acatgacagc tatcacgcca 5700 gcccattttt tgataggcag agaatttcaa gcggtactcg aaccatcata tgcgcacatc 5760 aaaaccggaa ggctgtctag atggcaagta ctacaggacc tgaagcaaaa atattggcga 5820 acctggtcaa cagattatct acaagaactt cagcgacgtc aacgggactt caaaacaact 5880 aaattcaagg taggcgcatt agttctcatt gttgatgaca atctgccccc acttcagtgg 5940 agccttgcgc gtattacgga gttgcatccg gggaaagatg gccacgtgcg tgtggttact 6000 ttgcagacca aggactctgt catcaaacgg gcggttaaat atgtgtgctt attaccgctt 6060 gacgaagagg agtcagctga aatttgaaat gcagccattt caacggccgg gaaga 6115 // ID I-53_AAe repbase; DNA; INV; 6679 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE An I non-LTR retrotransposon from Aedes aegypti. XX KW I; Non-LTR Retrotransposon; Transposable Element; I-53_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6679 RA Kojima K.K. and Jurka J.; RT "I clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1324-1324 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 10 sequences with >97% CC identity. XX FH Key Location/Qualifiers FT CDS 788..2125 FT /product="I-53_AAe_1p" FT /translation="MTSPTGGDPGGSLGPRLPLFMDPQNEFGRLTILQMTG FT IDGVNLPVDPYLIGKSIELVVGDSAIESASSEQTKRYTIRVRNPVHVEKLI FT KMTKLIDGTPISITPHPTLNISKCVISSYDTICYTEEELLENLGPQNVMKV FT QRITRMENGNRVNTPAIILTFNQTTYPGHVKVGLLRIATRPFYPNPLLCYN FT CFNYGHSKVRCSAAKRCHNCSAEFHDENCQESPVCCNCKGDHRSTSRTCPI FT YQKEXAVVKLKVDDNLTYHEARRRVAEGNLTYAQAAAQTRLDTSKIEALME FT ENKKKDEIINKLIADNKKKDDTILKLLEDVNQKNQQLESLVHKVDSLQSSL FT AALSSTIQSAPETSPQSEKCPQRQVKVITITDKAKGITKRRLQVEDNLLRR FT SKRLQNASPESRSPLAKAAKPATSDTDLDPIIEYDEEVVDNSDDETLSVHP FT N" FT CDS 2070..6542 FT /product="I-53_AAe_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="MMKKWWITLMTRPYLYTPINHLTMSVISNKSCKFTRP FT INDPTTDNRNNPNTSIINNSGRFGSGESEVPLSLPQAGAQLSGGSVRDVIP FT QPVDGESLCAAYEVDGKVNERNPYSTSTTPSTNHKASTSPDINNLNASMGL FT DNPRPFPTPTLTGDVPQPCDEPPAAVGVGGLLSQASTSVNLAKSPHTYPLN FT ISAAPCLQGYNRKAAPPATKMPPPPSVLSDAEAYDRLYHRQQATSSSRQRT FT TIPTAILQDDNGTQPSTPPNLASTQRSASPCSSENRSLTNHCKKYRIALQW FT NMNGLFNNMSDLQLLTRDNPPQTLALQEVHCRDSKGLDRILSGQYQWYVKS FT GSTRFQNVGIAVLRALPHTQVQLNTSLTALAVKIDLPFQHTIVSLYLPQNG FT IGNLENALQNLIGQLEKPYIILGDLNSHHYAWGSTKTDKRGTAILNVAEEN FT DLIVLNDGSPTFLRGSCTSCIDVSLVSSEIISLLKWNIHADPMGSDHYPIE FT IQSNDSPPETTRRPRWRLEEANWDGFEQDIISSINPEYIYSPESLCELIYD FT AAKANIPRTSSKPGPKALHWWNDDTQAAVKARRKALRKFKRTPKTHPDWKA FT VHENYQRLRNQCRDVIRRAKCESWEAFLDGFDSSQSTTEMWRRVNALSGKR FT RAQGIGILQNGVVSRDPIFVSNALGEYFSKLSSHEAYSRSFITAQTKAHCT FT STSISVQDDVDDCDFNKPFTISELLYALDSARGKSAGPDDVSYPLFRHLPL FT PVKCSLLESINQMWSNGTFPKKWQHSLVVPIPKKKIGSTDPRDYRPIALTN FT CASKIMERMVNRRLTTVLQERNLLDQRQHAFLKGRGTGSYLAALGQTIDDA FT LTNDLHVDIAALDLSKAYNRVWRPGVLRQLINWGFIGNMGKFIQGFLQDRT FT FQVMIGNTRSRTFQEETGVPQGSVLAVTLFLVSMNSVFDNLPKGIFIYIYA FT DDIIIVVVGKKPKLIRRKLQAAVRAVAKWAENIGFSMAAEKCSISHCCTLR FT HHPWNNPVTVAGMPIPFKKELRILGVTVDRKCNFRSHFNLVKKETECRIRL FT IKAISGRHKTNNRKSLQTVGRSIVVSKLLYGLEIIIRAYREMIVAFSPVYN FT KVIRITSGLLPSSPSLATCVEAGVLPFPFVLTIAAASRTVSYLEKTYGDCT FT NIFILEKAKNLVREYAAMDFPPVAQVYRVGDRRWNKSGPKIDWTIKHAVRA FT GDSSTKISAELNSLLSNKYSMHVKIFTDGSRAKETVGVGVYSSTFQIARRV FT PNEFTVFSAEAAAIYLAIRNIEDPNAQTIILSDSASVLSSLENPRQKHPLI FT QLIEARIPANTTLCWIPGHCGIKGNEEADRLACQGRRSVMWNLQTPGHDIR FT SAITRGAHRTWYLQWLSNRDIFLRKIKTDIGQWKDRKDRKEQKIISRLRVG FT HTRITHSHYISANPKPTCTICSTPLTVEHILVNCPQYEATRVNLNLHTSIR FT NILQNDPVEEIKLINFLKDSNLYSSI" XX SQ Sequence 6679 BP; 2090 A; 1625 C; 1382 G; 1581 T; 1 other; cagtgtatat caatagtcta ggcagagaag acgtcgcgtt tttctactct ggctattttt 60 catagtaaag cgtcccaagt cgcgcaaaaa tcgtcatttc ttcgccgtgg atgtaattta 120 ttgtattgca aatatagtga ttgttggttt tatcaccaaa gtggtacagt ttggaccttt 180 ctggccgcta gaatcggttc cacttggact gcatcaagtg tacgatcggc gaccacaata 240 accgcgtggt gaagctacca gtagtagtgc tatttagaac catccaagca gacaatccca 300 aaaagaacat tgtacactgg tactagtgaa gatcgataga tcaacagtgt gtgttgttcc 360 cgttctctca gtgagtgacc agcttgcaca gtagcacgtg cttatgaaga ttagtgatgc 420 aaacatgtta aagcacgaaa cgatggactg tcgaatcaag cattggaatt cgcaaaactt 480 gcgccatggc gcttccttta gtgcgtccca agaggcgcaa cattatttct tggtactgtg 540 caccataggt gcaagaaacc aagcagacag tcccaaaaga acattgtcac tgtaccatga 600 atagtgtttg ttcctcctgg agcgctgccc tagtaacgaa aagagagatc aaaacaaata 660 agatccaact cgtgatcatc tatccattcg ccttggggta ggtagcaggt gactgctatc 720 atcgccgttt gtcggcctat tgcgaatgga agatcatgcc aatagtgagt aggtaaaaac 780 acccactatg acctccccca ctgggggcga tccgggaggg tcgttgggcc caagattgcc 840 tcttttcatg gacccacaga atgaatttgg caggttgaca atactgcaaa tgaccggaat 900 tgatggtgtg aatttgcctg tggacccgta tcttatcggg aaaagtatcg aattggtggt 960 tggggattct gctatcgaat ctgcatcaag tgagcaaaca aaacggtaca ctattcgggt 1020 tcgcaaccct gtgcatgttg aaaagcttat caaaatgacc aagctcatag atggtacgcc 1080 aatttctatc actccgcatc cgaccctcaa catcagtaaa tgtgtaattt cctcgtatga 1140 cactatctgc tacacagagg aagagctgct ggaaaatcta gggccacaaa acgtgatgaa 1200 ggtacaaagg attaccagaa tggaaaatgg caatcgtgtt aacaccccag caataatatt 1260 aacattcaac caaacaacct atccgggaca cgtaaaagtt gggctgctac gaatcgctac 1320 ccgccccttt taccccaacc cgcttttatg ctacaattgc ttcaattatg gtcattcgaa 1380 agtacgttgc tctgctgcta aacgctgtca caactgttcc gctgaattcc acgatgagaa 1440 ttgtcaagaa tctcccgtct gctgcaactg caaaggagac caccgctcca caagtcgaac 1500 gtgtccaatc taccagaaag aagkcgccgt agtcaagctc aaggtggacg acaacttaac 1560 gtaccatgaa gcaagaagaa gagttgcaga gggtaatttg acctatgcac aagccgctgc 1620 ccaaacccga ttggatacct ccaaaataga agctttgatg gaagaaaaca agaaaaaaga 1680 tgaaatcatc aacaagctta tcgcggataa caaaaagaag gacgatacca ttctcaagct 1740 cctcgaggac gttaatcaga agaaccaaca attggaaagc ttggtgcaca aagttgactc 1800 tctgcaatcc agcctagctg ctctgtccag caccattcag tcagcgcctg aaacatcacc 1860 tcaatcagag aagtgccctc aacgtcaagt aaaagtcatt accataactg acaaagcgaa 1920 aggaatcacg aagcgcagac tgcaggtaga agataacctt ttgcgtcgat caaagcgtct 1980 tcagaacgcc tcaccagaaa gcagaagtcc acttgcaaaa gctgcaaagc cagctacatc 2040 ggacaccgac ttggacccaa taatcgaata tgatgaagaa gtggtggata actctgatga 2100 cgagacccta tctgtacacc ccaattaacc atctaacaat gtcagtaatt tccaacaaat 2160 catgtaaatt tacacgaccc atcaatgacc ccacaaccga taacagaaac aaccccaata 2220 catcgatcat caacaactcc ggtaggtttg ggagcgggga aagtgaagta cccctctcac 2280 tcccacaagc gggagcgcag ctgagtggag gatctgtccg ggacgttata ccccaacccg 2340 tagacggaga atccctctgt gcggcctatg aagtcgatgg aaaagtaaat gaaagaaacc 2400 catacagcac ctctacgact ccatcaacga accacaaggc atctacaagc ccagatatca 2460 acaacctcaa cgcttccatg ggtttggata accctaggcc tttccctaca ccaaccctga 2520 ccggggatgt accacaaccc tgtgatgagc ccccggcggc agttggtgtg gggggcctgt 2580 tatctcaggc aagtacctcc gtaaatttag caaaatcgcc acacacatac ccgctcaaca 2640 tttcagctgc accttgcctg caaggataca atcggaaagc agccccgccc gctactaaaa 2700 tgccacctcc accatcggtt ttatccgacg ccgaagccta cgatcgattg taccatcgcc 2760 aacaagcaac ctcaagtagt agacaaagaa caacgatacc aacagccatt ctacaagatg 2820 acaatggaac tcaaccatct accccgccaa acttagccag tactcaacgt agcgcatctc 2880 catgctcttc ggagaatcgc tcccttacga atcattgtaa aaaataccgg attgctttac 2940 aatggaacat gaacggtctg ttcaacaata tgagcgacct tcaactgcta acgcgcgata 3000 atcctcctca gactcttgca ctccaagaag tccactgtcg tgattcaaag ggtctcgacc 3060 ggattctttc tggtcaatac caatggtatg tcaaatcggg ttcaacacga tttcaaaacg 3120 tcggaatcgc agttcttaga gcccttcccc atactcaagt acaattgaac acatccctga 3180 cagcccttgc tgttaagatc gaccttccat tccagcacac cattgtatcc ttatacctcc 3240 ctcagaatgg aatcggtaac ttagaaaatg ccctacaaaa tctgattgga caattagaaa 3300 agccgtacat aatactggga gacctaaaca gtcatcacta cgcgtggggt tcgacaaaaa 3360 ccgacaaaag aggtaccgcc atccttaatg tcgcggagga gaatgatctc attgtactca 3420 atgatggaag tccgaccttc ttgcgtggct catgcacatc ttgcatagat gtgtcattag 3480 tatccagcga aataatcagt cttcttaaat ggaatatcca cgctgatcca atgggcagcg 3540 accactatcc gattgaaatc caaagtaatg attctccacc ggaaacgact agaagaccca 3600 gatggcgtct tgaagaggcc aattgggacg gtttcgaaca agacattata tcttcaatca 3660 atccggagta catatattct cctgaaagcc tgtgtgaact catttacgac gctgctaaag 3720 ccaatatacc aagaacgagc agtaagccag gcccaaaagc cttacattgg tggaacgatg 3780 acacacaagc ggcggtgaaa gctcgtagga aggcgctcag aaaatttaaa cgtaccccaa 3840 aaactcaccc ggactggaaa gcagtccacg aaaattatca acggctaaga aatcagtgca 3900 gagacgtcat tcgtagagct aaatgcgagt cttgggaagc atttctcgac ggtttcgaca 3960 gcagccaatc taccaccgaa atgtggcgac gagttaacgc attaagcgga aaaagaagag 4020 cacaaggtat tggtatcctt caaaacggag tcgtctcacg agatccaata ttcgtttcca 4080 atgctttggg agaatacttc tcaaaacttt cttcacatga agcatacagt cgtagtttta 4140 taactgcaca aacaaaagca cactgtacat ccaccagtat atcagtacaa gatgatgtag 4200 acgattgtga cttcaacaaa cccttcacaa tctcagaact actttacgca ctggattctg 4260 ctagaggaaa atctgcaggc ccagacgacg tcagctatcc tctcttcaga catcttccac 4320 taccagttaa gtgttctctc ctggaaagta ttaaccagat gtggtccaac ggtactttcc 4380 caaaaaaatg gcagcacagt ttagtcgtcc cgatacccaa gaaaaaaatt ggatcgactg 4440 accctcgtga ttatcgcccc atagcactta caaactgtgc ctcgaaaatt atggagcgca 4500 tggtgaacag gcgcctcact actgttctcc aagaacgtaa ccttttagat caacggcaac 4560 atgctttctt aaagggtaga ggaaccggat cttatctcgc agcactgggc caaacaatcg 4620 atgatgctct gacaaacgat ttacacgttg acatagcggc cttagatcta tcgaaggcct 4680 ataacagagt ctggcgaccc ggagttctac ggcagcttat caactggggt tttattggta 4740 atatggggaa atttatccaa ggttttttgc aagatagaac attccaagtg atgataggga 4800 acacacgttc caggaccttc caagaagaga ccggagtacc gcaaggctct gtgttagcgg 4860 taaccttgtt tctggtatca atgaactccg tattcgacaa tctaccaaag ggcattttca 4920 tatacatcta tgctgatgat attattattg ttgtggttgg caagaaacca aagttaattc 4980 gacgaaagct acaggcggct gtaagagctg ttgctaaatg ggcggaaaat attggtttca 5040 gcatggcagc tgaaaaatgt tcgatctcac actgttgtac gttaagacat cacccatgga 5100 ataacccggt tacggtcgca ggaatgccaa tccctttcaa aaaagaactt cgaatcctcg 5160 gagtgacagt tgaccgcaaa tgtaacttcc gaagtcactt caatctcgtc aagaaggaaa 5220 ctgaatgcag aattcgatta attaaagcga ttagtggaag acataaaacc aataaccgta 5280 aatcattgca gactgtcggc cgcagtattg tggtcagtaa actactttac ggattagaaa 5340 tcataatacg tgcctacaga gaaatgatag tagcttttag tcctgtctac aacaaggtaa 5400 tacgaatcac ctcaggatta cttcccagtt ctcctagcct tgcaacatgt gtagaagccg 5460 gagtactccc gtttcctttc gtactaacga tcgcagcagc atccagaacc gtaagttatt 5520 tggagaaaac atatggtgac tgtaccaaca ttttcatcct agaaaaagcc aaaaaccttg 5580 tcagggagta tgcagccatg gatttccctc cggtagcaca agtctatcgg gtcggagatc 5640 gccgatggaa caaatcagga cccaagatcg attggacaat taagcacgca gttcgcgcag 5700 gggattcttc aaccaaaata tctgctgaac tcaatagcct gctcagcaac aagtatagca 5760 tgcatgtgaa aatattcacg gacggctctc gagcaaagga aaccgtagga gttggtgtct 5820 atagttcaac gtttcaaatc gcacggcgag ttcccaacga attcaccgta ttttctgcag 5880 aggctgctgc tatatacctc gctatcagaa acattgaaga ccccaatgca cagacgataa 5940 ttttatcgga ctcagccagc gtcttgtcgt ctttggagaa ccctagacag aagcatccac 6000 tgatacagtt aatcgaagca cggattcccg caaatactac gctctgttgg ataccagggc 6060 attgtggcat aaaaggcaac gaagaagcag atcggctagc atgccaaggg agacgttcag 6120 taatgtggaa ccttcaaaca cctgggcatg atatacgtag cgcgattaca cgtggtgctc 6180 atcgtacgtg gtatctacag tggctttcaa accgtgacat tttcctcagg aagatcaaaa 6240 ccgacattgg acagtggaaa gaccgcaagg atcgcaaaga acaaaaaata atatctcgcc 6300 tgagagtggg gcacaccaga ataacccatt cgcattacat ttctgcaaat ccaaaaccaa 6360 cctgtaccat ctgttcaaca cctttaaccg tggaacatat cttagtcaac tgcccccaat 6420 atgaagcaac aagagtcaat ttgaaccttc atacaagtat acgcaatatc ctccagaacg 6480 acccagttga agaaattaaa ctcataaact ttctgaaaga tagtaactta tattctagca 6540 tttaattaca atatctttat aataacctta aattgttata tactccttgt atttagtctg 6600 tttttttttt ctatttcaaa cgaggtgaac cagccttagg cactgaaaac ctctataata 6660 aagcaataaa aaaaaaaaa 6679 // ID BEL-237_AA-LTR repbase; DNA; INV; 188 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-237_AA_; KW BEL-237_AA-I; BEL-237_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-188 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 930-930 (2011). XX DR [1] (Consensus) XX SQ Sequence 188 BP; 62 A; 41 C; 28 G; 57 T; 0 other; tgatcgctct acctgtgcca aatcgctaag cccctcgcct aacgataaat aagagaacaa 60 aattgatgtt ttacttcagt gtattttttc tagtattaat aaagatgcag ttcagatcga 120 acgcaattta tttcgattcg acgcaaatca cccgaaaacc aattctgact gagctatttc 180 tataaaca 188 // ID piggyBac-20_SM repbase; DNA; INV; 1945 BP. XX AC . XX DT 12-AUG-2009 (Rel. 14.08, Created) DT 12-AUG-2009 (Rel. 14.08, Last updated, Version 2) XX DE DNA transposon from Schmidtea mediterranea. XX KW piggyBac; DNA transposon; Transposable Element; piggyBac-20_SM. XX OS Schmidtea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae. XX RN [1] RP 1-1945 RA Jurka J.; RT "Families of autonomous piggyBac elements from planaria."; RL Repbase Reports 9(8), 1830-1830 (2009). XX DR [1] (Consensus) XX CC The closest sequence in Repbase is human LOOPER. ORF corrupted. CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX SQ Sequence 1945 BP; 720 A; 268 C; 353 G; 603 T; 1 other; cagcatgttt taccatctgg gtcattttga ccctgacatt tcttttctca aaattgtaat 60 tgctatcttg ttttatttag gtttttttat tgtttttctt tctgccattt ttaaattaca 120 atttcgtaat tagcatttga ttttatttca ttaaatttaa ttctaaaaaa tgaaacgaag 180 ttcaaacaaa ctttattcat tgccttatgt cttgaatggg atgaatcacc cgacgatgaa 240 tgtgaagagt caaataattc atccttgtca gattatgatg aagaagataa gtcagtttca 300 gatgcaaaac caagtattgt cctcgggaat ttgtttatga aatccagaga taataaagag 360 acatggtcaa ctgtgcctat aaaaaagaaa actggaaggc aacagcagca taatatcgta 420 aaaattccta caggattgac caattttacc caaagaaact gtgatagttt gacatcagtt 480 ttcaaacttt attttcgttc aaaattgata gatgaaatat gtgggtggac aaataaagaa 540 ggagaaaatg ttattccaga ctggaaagta attgataggc atgaatttca aaaatggatt 600 ggtataacaa tattgatagg agtttataag tctaaaaatg agtcagtttc taacctttgg 660 aatatggaaa atggaagacc gatgttcaat aaaataatga gtagaaacag atttttacaa 720 attttacgag ttcttaggtt tgatgatgct gttgctagaa gaaagaagcg atccaaagat 780 aaactacagc ccattagatc tgtttttgaa aactggaaca gtacactaaa agatgcatat 840 aatcttggaa gcaacgttac cattgatgaa cagctggtca cctttcgagg gaggtgtcca 900 ttcaaacaat atatcccatc caagccaggt aaatatggag ttaaaatttg ggcattatgt 960 gatagtgtta atcattatgc gcataacatg caaatttata ctggaaaaga tgaaggtcaa 1020 cctagagaaa aaaacaagga gaaagagttg taacagatct ggttgaaaat attgtgggct 1080 ctggtagaaa tataacgtgt gacaattttt ttacaagctt atcagtggca agaaaattat 1140 tagaaaaaaa attaacagtt gttggaacta tcagaaaaag tcgaatggag ttaccaaaag 1200 aaataatggt tgcaaaaaca agggaagttt attctacaaa gtttttattt caagatgatg 1260 ctcttctcgt ctcttattgt ccgaaaaaat ataaattagt tactcttttg agcacaatgc 1320 ataatcaacc agaaattcac agcagcgaac ctaaaaagaa accagaagtt atcaaatact 1380 ntaatgcaac taaagctgga gttgacagta tggaccaaat ggtgcgctgt tatagtgtaa 1440 aaagatctac caggcgttgg cccatggtaa tatttttcaa catgattgat atcactgctc 1500 ttaatgcttt ttattatatg gaatataatg aatccacttg aaggaaaaaa gaataaacga 1560 aggtcatttc ttattgattt gggaaaagaa atggcagatg ttgatttgac atttacacaa 1620 tcatctaatt tggtttcagt aaacccaact ccaaagaaaa gaagtcgttg taactattgt 1680 gacaacaaaa aggatcgtaa aacaaacatt ttgtgttcca aatgtaacaa atttgtttgc 1740 ggagaacatt atcaagcagt ttgtgtaaaa tgtatataaa tgtggaagac caaattgatt 1800 ttactatttt ttgttttaat gttaaaatac aaaaacctat gaaaacaaac tcatgtcttg 1860 cggtgtttga acaatataaa taagaaaaaa catgaaatgt ataatcaggg tcaatttgac 1920 ccggttggta aaacatgcta tgtga 1945 // ID P-21_HM repbase; DNA; INV; 3216 BP. XX AC . XX DT 21-MAR-2008 (Rel. 13.03, Created) DT 21-MAR-2008 (Rel. 13.03, Last updated, Version 1) XX DE P-type DNA transposon family: consensus. XX KW P; DNA transposon; Transposable Element; P-21_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3216 RA Jurka J.; RT "P-type DNA transposon families from Hydra magnipapillata."; RL Repbase Reports 8(3), 367-367 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(152..1450,1446..1940,1859..2650) FT /product="P-21_HM_1p" FT /translation="MPKKCCVTNCNGNYNKQNKEKVFRLPSDKEEYNRWIN FT VIPRDNIPESPDTVVCERHFPLGYPTVIKFGHKRPRDPPSVFTCVKPSLIP FT SVPSPLRTTFKASSSVRNIIPDELDMFIAADQITCFKSLHQKLQNKEINFE FT LVDVTWYFAGEHLLIIQSTDFEENTSIARFMLKIKNDFSFEGYFCGIKYTI FT ESLSKNRIFILNSSSKLQEAINFLNNLTKSIKLINLLQHISDMSPITIGKK FT KYSTETIVRSFEYFAISRSLYGRLREDYQLPSISLLTKLTSKLDSAMNDTE FT FFKSVYSKLENEMQKNCIILIDEVYVKSSLTYHAGSVFGKAVNKPDSLATT FT VLSFMLVSLYGGPQYLCRILPVCELDSKFLFEQTQFVLEGIKSAGGIPVAL FT ICDNNRVNQALFKMFDCVEPWLTKDNMFLLYDFVHLIKSSLKSVRNNWLTE FT KSQKLCYEXEGKTKYACWSNILNLFRLEENNIVKLSKLTNVSVAPKPIERQ FT KVSTCLQVFCDETVTALKSNSNIKEAEGTITLLEIFVKFWKIVNVKGLYAD FT IRYKDPDRAVISTSEDARLTFLLNLATMAEQMTSKQGKRQFQLTRDTGTAL FT SCNNGRTNDQQTRQKTISANQRYRYSTLAHTCRGLVAIXKFLLQHSFKYVI FT LGKFTTDPLEKMFGKLRQGSGGTYFINVQQVIEKVKIQRAKLSLQIGIDID FT SLSAESGHNCQKCGYLLSSNAIDIFDNLPELENSVQIDTKXGLVYIAGYVV FT RNDVEEQEDTKFYYEEFGDYTEEINRGGLKIPNDNVCQWTIFSYILFHEIV FT NKVCRKSLCNMLMLISESYNFNMERKHAVSLSNIYFKNYXSLYTPRSYKEP FT KQKILKLSK" XX SQ Sequence 3216 BP; 1123 A; 449 C; 500 G; 1134 T; 10 other; catggccctc tgtattaaca ggcctcgtaa acgcgtaaat cggagcctgg agctgggggc 60 cattgaactt caaaactatc ttcgataact gctagcgaca acaaagtttt tttaatatag 120 tacaattgtt atctagtttt cagacaaaaa aatgcctaaa aaatgttgcg tgacaaactg 180 taatggtaat tacaacaaac aaaataaaga aaaagtttty cggcttcctt ctgataaaga 240 agaatataat cgttggataa atgttatwcc acgagataat atacctgaaa gtccagatac 300 agttgtgtgc gaaagacact ttccattggg atatcctaca gttataaagt ttggtcataa 360 aaggccacgt gatcctccat cagtatttac atgtgtaaaa ccaagtttga taccaagtgt 420 tccatcgcca ttaagaacaa cctttaaagc ttcttcatct gttcgtaata ttatacccga 480 tgaacttgat atgtttatag ctgctgatca gattacatgt tttaaatcat tacatcagaa 540 gttacaaaat aaagaaatta attttgaatt agttgatgta acttggtatt ttgctggtga 600 acatttgcta ataattcaat caacagattt tgaagaaaac acttccattg ccagatttat 660 gctmaaaata aaaaatgatt tttcattcga rggatatttt tgtggaatta agtatacaat 720 tgaatctcta tctaaaaaca ggattttcat attaaattct tcctcgaaat tacaagaagc 780 tataaatttt ctaaataacc taacaaaaag cattaaactt attaatcttt tacaacatat 840 cagtgacatg agcccaatta ctattggaaa aaaaaagtat agtacagaaa ctattgttag 900 atcatttgaa tattttgcaa tttcgagatc gttatatgga agattaagag aagattatca 960 acttccaagt atttccttat taacaaagct aacttctaaa ctggattctg caatgaacga 1020 cacagagttt tttaaatcag tttattctaa gcttgaaaat gaaatgcaaa aaaattgtat 1080 tattttaatt gacgaagttt atgtcaaatc atctcttaca tatcatgctg gctcagtctt 1140 tggtaaagcg gtaaataaac cagattcttt agcaactact gttttaagtt ttatgttagt 1200 atcactttac ggtggaccac aatatctttg cagaatttta ccagtctgtg agcttgattc 1260 taaatttctg tttgaacaaa ctcaatttgt tttagaggga ataaagagtg caggaggaat 1320 accggtagcc ttaatatgcg ataataaccg tgtaaatcaa gcactattta aaatgtttga 1380 ttgtgttgag ccatggttaa ctaaagataa tatgttcttg ttatatgact ttgtccatct 1440 aataaagtct taaatctgtt cgtaataact ggcttacaga aaaaagtcaa aagctatgtt 1500 atgaayatga agggaaaact aaatatgcat gctggtctaa tatcttaaat ttgtttaggc 1560 ttgaagaaaa taatatagta aaattgtcta aacttacaaa tgtgtctgtt gcaccaaaac 1620 ctattgaaag acaaaaggtg tctacttgtt tacaagtctt ttgtgatgaa actgtaactg 1680 ctttaaaatc caacagtaac ataaaagaag cagaaggcac cattacttta cttgaaatat 1740 ttgtgaaatt ttggaaaatt gttaatgtaa aaggactgta tgctgatatt cgatataaag 1800 atccagatag agcagttata tcaacttcag aagatgctag actaacattt ttacttaatc 1860 ttgcaacaat ggcagaacaa atgaccagca aacaaggcaa aagacaattt cagctaacca 1920 gagatacagg tacagcactt tagctcatac atgtcgtggt ttagttgcca ttrttaaatt 1980 tttgctccaa cattcgttta agtatgttat acttggaaaa tttacaactg acccactaga 2040 aaaaatgttt ggaaaacttc gacaaggttc tggaggaaca tattttataa atgtgcaaca 2100 agtkattgaa aaagtaaaaa ttcaaagagc caaattaagt ctccaaattg gaatagacat 2160 tgattcttta tctgcagaaa gtggacataa ctgtcaaaaa tgtggatatt tacttagcag 2220 taatgctatt gatatatttg ataatttgcc tgaacttgaa aatagtgttc aaatagatac 2280 taaaryagga ttggtatata tagctggata tgttgtgcga aatgatgttg aagaacaaga 2340 agatacaaaa ttttattatg aagaatttgg tgattatact gaagaaatta atcgtggagg 2400 tttgaaaata ccaaatgaca atgtttgtca atggacaatt ttttcttata ttttgtttca 2460 tgaaattgtt aataaagtct gccgaaagtc actctgcaat atgttaatgc taatatcaga 2520 gtcttataac tttaatatgg aaagaaaaca tgctgtttca ctttcaaata tatattttaa 2580 aaactatgkt agtttatata caccaaggtc atataaagag ccaaaacaaa agattttaaa 2640 actttcaaag taatatattg cttttttaat tatgacgact ttaattttac gtatatattt 2700 ctttaaattc tatttttttt gttattttta tacacttcaa tattatattg tatagcttgg 2760 aaatattttc aaaatgtcat tacaattttg ttgttttaac taaaaatgtt taatttctag 2820 ttttacattt taaaatatag tttttgtgtg gcttcattgt tatgtttctt atatttccta 2880 tgtatttgtt ttaattatga tgacttttat gattattact atttcctgtg tatttgttta 2940 attaatgatt gagtattttg taaaatctct ctagcttggt aatttttttc tagatttagc 3000 cttattacaa ataaaccttt atggctagga aatgtttttt ttttaatctt tgttattata 3060 taaacaggtt ttattaaaat aaaaacattt aaacgttttt tacgaccaaa agtttatgtt 3120 atccttaact tttctttcgt tgttgttgtt ttaatggccc ccagccccag gctccgattt 3180 acgcgtttac gaggcctgtt agtatagacg gccatg 3216 // ID Copia-28_DPu-LTR repbase; DNA; INV; 313 BP. XX AC scaffold_92; XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from Daphnia: long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-28_DP_; KW Copia-28_DPu-I; Copia-28_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-313 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Direct Submission to RU (16-DEC-2010). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR Genome; scaffold_92; Positions 255380 255068. XX SQ Sequence 313 BP; 89 A; 78 C; 48 G; 98 T; 0 other; tgttgaagtt caaacctagt cgtgcctcag aaatcaccgc taccaactaa gttgccgaac 60 cccagcgaag gcacgattcc taaaaggttg ctgtttaacc ctaatcccct tgtctgtcag 120 ctgaaagcct cgtttcaagt tccaagactc tcttactttc ctgacttgtt caagagaaga 180 ctatctctgt ttcgttaact aaattagtgc attattccac acacttgata aaaggtaact 240 agtccttcca tccatgtgtt aaatacaatg catttatttc tgaccacaat ttgacagtca 300 ttttacattg aca 313 // ID Gypsy-262_AA-LTR repbase; DNA; INV; 205 BP. XX AC . XX DT 13-JAN-2011 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-262_AA_; KW Gypsy-262_AA-I; Gypsy-262_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-205 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (13-JAN-2011). XX DR [1] (Consensus) XX SQ Sequence 205 BP; 50 A; 44 C; 41 G; 69 T; 1 other; tgtggggcga atgggttatc cccctacccc gaaataaaga aaaacttttt actgtgtttt 60 tgctacttcc gttatgatca gttcktagtt tgacgagatc gctcgcgccg tgcgtcattt 120 tttatcgctg tgcaaagtta tcaaataaat cgatttgaag agatttcggg cgttatttca 180 cttactccga ccaagacttt ctaca 205 // ID Copia-7_AA-LTR repbase; DNA; INV; 127 BP. XX AC AAGE02017938; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-7_AA_; KW Copia-7_AA-I; Copia-7_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-127 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02017938; Positions 5850 5724. XX SQ Sequence 127 BP; 43 A; 29 C; 20 G; 35 T; 0 other; tgttggatgg caaccgaagg actacccata gcaacctaga ataacatatt gtagtaggct 60 tagacattag agcaataaaa ttttcttcac ttccactatt actcccaaca taaccagacg 120 tgtttca 127 // ID BEL1-LTR_Dmoj repbase; DNA; INV; 305 BP. XX AC scaffold_6489; XX DT 13-MAY-2009 (Rel. 14.05, Created) DT 13-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL7_Dmoj; KW BEL1-I_Dmoj; BEL1-LTR_Dmoj. XX OS Drosophila mojavensis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; repleta group; OC mulleri subgroup. XX RN [1] RP 1-305 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1006-1006 (2009). XX DR Genome; scaffold_6489; Positions 46344 46040. XX SQ Sequence 305 BP; 79 A; 67 C; 46 G; 113 T; 0 other; tgttgggcca aattcactta tcttattgtt aacttgaatt ttatacctat atttttacat 60 ttgggttcac cctaattact ctatctctct tgcgtggtga tttcttatta cttgcattaa 120 ttcgcctagc tctaagcaag ctgcgcttgc tcatacgttt gtacttttag tgtatatcaa 180 gaactctacg aataaacgtt gcacctacga tgcgctgaat ttcatatacg gctcatttgt 240 ttagttagaa cagcatcccc atactctgaa ctcatgctag acagaaaaac gcttctgatc 300 ttaca 305 // ID Chap1b_Cis repbase; DNA; INV; 727 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 06-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE hAT DNA transposon from Ciona savignyi. XX KW hAT; DNA transposon; Transposable Element; Chap1b_Cis. XX OS Ciona savignyi OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. XX RN [1] RP 1-727 RA Smit A.F.; RT "Chap1b_Cis - hAT DNA transposon from Ciona savignyi."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX CC Ci000014,Ci000109. Typical NTCTAGAN target site duplications. CC Copies are 5% diverged from consensus. Non-autonomous; small CC match to transposase; closest to Chaplin4_Fur. XX SQ Sequence 727 BP; 223 A; 145 C; 147 G; 204 T; 8 other; caggggtggg caacatacgg cccgcgggcc gggtccggcc cgtgaaggga tttcgaccgg 60 cccccagact gtatctaaca ataacctgta atgtggccca ggtccggccc gctgtctgta 120 tccaaccaca cactaatgtg gtccgacgca tcattcctaa aagttgncta tcgctctact 180 cttaccgcac tgttantgcg agctggtgaa ataaacaatt gccttagtga acaaacaatt 240 gtgttagcac actttttaaa caaacaatat aaggtcaaag cagaaagcgt tgtaaacatc 300 acncactact gggtactgtg taacaacgtg actgttaggg ttagtttttc tttagctgtt 360 atcagttata tttaaattct aaatataaat ggattttatg aattattaag gattattaac 420 catggaatat cctangaaac gtaaattnga aagagagtgt cgagcattta acaaagattg 480 gataccgaag tattttttac aaaggttggc aacaaagcag tatgtttact gtatggtcag 540 agtgttgcgg tattaaaata gtacaacatt agtcgtcant atttgacgaa acatggaaac 600 tatggcaata atttagtgaa taaatatgtg gttaagttan tgaatattat tcatctaata 660 atccggccca ccaaggactt cattttccca catctggccc gccnactaga aaagttgccc 720 acccctg 727 // ID R1B_SS repbase; DNA; INV; 1484 BP. XX AC AF015821; XX DT 27-JUL-1999 (Rel. 4.06, Created) DT 27-JUL-1999 (Rel. 4.06, Last updated, Version 1) XX DE Scolopendra sp. retrotransposon R1 reverse transcriptase gene, DE partial sequence. XX KW R1B_SS. XX OS Scolopendra sp. WDB-1997 OC Eukaryota; Metazoa; Arthropoda; Myriapoda; Chilopoda; OC Pleurostigmophora; Scolopendromorpha; Scolopendridae; OC Scolopendra. XX RN [1] RP 1-1484 RA Burke D.W., Malik S.H. and Eickbush H.T.; RT "R1 and R2 Provide an Estimate of the Age and Stability of RT Retrotransposons."; RL Unpublished. XX RN [2] RP 1-1484 RA Burke D.W. and Eickbush H.T.; RT "R1B_SS."; RL Direct Submission to Genbank (24-JUL-1997)Biology, Univ. of RL Rochester, 135 Superior Road, Rochester, NY 14625, USA. XX DR GenBank; AF015821; Positions 1 1484. XX SQ Sequence 1484 BP; 383 A; 330 C; 455 G; 316 T; 0 other; gcgtttgccg atgacacggc tattatcatc gaggagactt ccattaagaa gatcgaagac 60 agctggcaag aggcggaaag gagagtgcag cggtgggcaa caaggaataa aatcgttttt 120 aatgaaaaca agacggaggc cattttcttc ccagccaaac atgccaaaaa ggtaccaatt 180 atcagatttg gtctgaacct ggtaggtata aaagagtcac ttcgttactt aggtgtgtat 240 ctcgataaga gtcttttgtg gagggtacac ttacaggtcg tacagggcaa agttgggcga 300 atcgcactga aaatgcgtca ggttgtgggc acacattggg gcattgacca aatgaccatg 360 cgagtgctgt ataaacgagc tctgcttcct ctggtcatct acggctgtga gatatgggga 420 gagtgcgcac gcaagaagtg gggagtgaaa cgacttgaac aaatccagcg acccctcctt 480 ttggggggtg gtaaaagcgt gtcgaactac ctccacacac gccttacaag ttttggcggg 540 gctcccccca ctcagtgttg agataattgc gcgaaggcaa cattatgccc gcttgggggc 600 tgacgactat gaaagccata tcccggtgtg cgactgggtg catccagcac acatagagga 660 agaagcagtt gggacatacc cacttgaaat tgaggcgagg acagccctct ttacagatgg 720 gtcaaagact gaggacggtg taggttatgc ggtggtggtg atggaagggg gagaggaggt 780 gcgaactcat acgggttgcc tgccgagcta ctgctctatc gcccaagcag agatgagggc 840 tatttattgg ggctcttcag ttttggagtg attgtggaaa tgggccggcc tgtctgtgct 900 ctgattcaag atcgtctctt caacatatcg ctaacggtcg ccagggagat gcaacagtca 960 tggcagcgga cgcccttcaa cgctgccgag agcaacagag ggcggtctcc ctctcctgga 1020 ttaaggcaca cagcgggcag gccggcaacg aacgggcgga cgagttggca agacaggctg 1080 cgaagaacga cctgtcgaca ggtgaagaac atgttttgag accgaaggcc tactcgaggg 1140 atatcttgag aaatgaagtg cgtgcgttat ggcaacaaca gtgggcaatt tcgataaaag 1200 gacgttggac ttttgaggtt ctcccacagg tgtcgcagtc gccgccggtg cttacaagca 1260 agatggtgca gctcctaacc aatcatggga actttgcgca gtatcgaagc aggttccggc 1320 ttacagagga agatggcgtc tgtatctgca aacaagggca ggagacagcg cacgacgtcc 1380 tggtggaatg tggcctgcca cacagagccc gggcccgttc tcggatcgag caaatggctg 1440 tggtacaggg tgtccagacg tggcagccag agtgtatgga atag 1484 // ID Mariner-2_HM repbase; DNA; INV; 3601 BP. XX AC . XX DT 17-MAR-2008 (Rel. 13.03, Created) DT 17-MAR-2008 (Rel. 13.03, Last updated, Version 1) XX DE Mariner-type family: consensus sequence. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-2_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3601 RA Jurka J.; RT "Families of Mariner elements from Hydra magnipapillata."; RL Repbase Reports 8(3), 219-219 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(510..1043,1435..1812,2064..3128) FT /product="Mariner-2_HM_1p" FT /translation="MHTIVLYHMYFIDKNIISNFAMGKPKFILKVRKEPFV FT VGKPNSKAEAKVQRKKINMKELQIQQAVSWCIEHKKRGYAALQTGMFSFIK FT DRGTIDRRLDGKTANIKKQHLRILTAEEEHSIVEFIKNKNRCHQGISRKQV FT TNLIIDVLKIRDYCNTTNQGGRNFGKLSDNAKHVVKTGKIYLLDSLAEELI FT KCEIFTNSKKLEPGVWKGDIDTKRIFNFDETPQFINYGVDGTNSGLAYVAR FT GEPCRKMIRENRESITICPVVSLAGNNNTNYLCKCTYAIICIYLKKKNNQS FT CFHHLRSISQSFYIYLSEENIQRPVVMLADGHSSRFDYNVLHFLHEKKINL FT FISPPDTTGGLLNCWIKVLIKTCIVNMTKKRDELFTTFQTINREGFMTILG FT EMWDKWASRDTIINAAKRVGISKEGLNVNNMQQDKFQQAANCMVQNQEQEP FT SSNLVLSTPKKICTRSSANVLLTPITPHSFPELAKTNHRYGSANYWKFMFE FT QSQMIMKESYEKSLILEDIPGLLTVNKVKPKDMNKTKNIRVTNIHGSMEAQ FT NIIEQVELIEIRKRKKKDQKEKKKKEKDKEKEQFYKCKLKCVCNGICAVKG FT FKECPVCHEIKKSVCSKISCRVDGMKPIMIKLATSTKTNAKYKGNLNDVES FT DNINWGNVLC" XX SQ Sequence 3601 BP; 1355 A; 498 C; 601 G; 1147 T; 0 other; tacaggtagg tacaccttaa aaactacaaa aaaaagtggt accctataac tgaatattgg 60 aaacttctga agtgctgaaa ctttctatac ttaaaaacta ctttaaatga atgtcttccc 120 tggccttatt ttttaatcta tagtttttag taaagttttg tgattaaaag aaaagtcgcc 180 ttttttgatt gttacccctt aaaaaccgca atcaggtttt ttgcttataa ttttagagcc 240 gttgatcagg ttccctcaaa attttgtaaa cattttgttt tcgggccata gtattggtac 300 acacattatt gtaaatctat tataaacgaa aagacataaa caaccccaaa actccttaaa 360 actacaaaat aaatccttac aaattacaac taaatactaa tatactgtaa ttaggggatt 420 actattaatt aaaatttgta gatagtttaa catatagggc actcagatat tatattttta 480 tttagtagga atattactta agtttattta tgcacacaat tgttttatac catatgtatt 540 ttatagataa aaatattata agtaattttg caatggggaa accaaaattt attttaaaag 600 tgagaaaaga accttttgtt gttggaaaac caaattcaaa ggcagaggca aaagtccaga 660 ggaaaaaaat aaatatgaaa gaacttcaga ttcaacaagc tgtgtcttgg tgcatagagc 720 acaaaaaaag aggctatgct gctttgcaaa ctggtatgtt ttcttttatc aaagatcgag 780 gaaccattga tcggagatta gatggaaaaa ctgctaatat aaaaaagcaa catctgcgaa 840 ttttgacagc agaagaagaa cactcgattg tagaattcat taaaaataaa aacaggtgcc 900 atcaaggaat atcgaggaag caagtcacta atctaattat tgatgtttta aaaattagag 960 actattgcaa caccacgaat caaggcggtc gcaattttgg taaactatct gataatgcaa 1020 aacatgttgt taagacagga aagtaagtag agtgtttagt tactaaataa attataatga 1080 aagtttcttt atgtaatata ttacactagt tatttgtttt ttaacactat tccatataat 1140 tctaacataa ctttattata aacacaagtt ttaaatttat tataggcttg gtcagtcgtt 1200 ttggcagaga tgggaagcag agcaccatag cttagttcta aaacgacaag gaaatgtacc 1260 tattaacagg gcattgaact gcacaagggg tatggctgag tcacatttag gtaagaaaat 1320 aattaataat atatttttgt actttatttg ctgatatgct ccataattaa aacgatgata 1380 tataaatgtc ttagattagc cctagggtgc agtgtactag caaaaacatt ttgaatttat 1440 cttttagatt cgttggctga agaacttatt aaatgtgaga ttttcacaaa ctcaaaaaaa 1500 cttgaacctg gtgtgtggaa gggtgatatt gatacaaaaa gaatttttaa ttttgacgaa 1560 acaccccagt tcataaatta tggggttgat ggaacaaatt ccgggctcgc gtatgttgca 1620 cgtggggaac cttgcagaaa aatgattagg gagaatcgtg aatcaataac aatatgccca 1680 gttgtttcat tagcaggtaa taataacaca aactatttat gtaaatgtac ctatgcaata 1740 atttgcattt atctgaagaa aaaaaacaac caatcatgtt ttcatcactt aagaagtatt 1800 agtcaatcgt tttagtgttt gtttttagtg aatgttttta tccccatggt tatgttaagt 1860 ataggtttaa tcataagtaa tattttttta ttttacattt caggcgaaat tgttgtttca 1920 caagttattt ttgctggtaa aggtatcaca agtcatatgg ccccaaaaac agctgtcaaa 1980 aatataaaaa atttaattat atcatctact gaaaaaggaa gccaggataa ccattcttta 2040 ttggacctac acaaaaagtt tgatatatat atttgtcaga agaaaatata caacgacctg 2100 ttgtaatgct agcagatgga cacagttcaa gatttgacta caatgttcta catttcctcc 2160 atgaaaaaaa aattaacttg tttataagtc ctcctgatac aactgggggg ttactcaact 2220 gttggatcaa agtcctaatc aaaacatgca tcgtgaatat gacaaaaaaa agagatgaac 2280 tttttacaac tttccaaaca ataaaccgag aaggcttcat gacaatttta ggagaaatgt 2340 gggataaatg ggcttcaagg gataccataa ttaatgctgc aaagagagtt ggcatatcta 2400 aagaaggttt aaatgtaaat aatatgcagc aagataaatt tcaacaggct gcaaactgta 2460 tggttcaaaa tcaagaacaa gagccttcta gcaatcttgt tctaagtaca ccaaagaaaa 2520 tatgcacacg ttcatcagca aatgttttgt taacaccaat tactccacat tcatttcccg 2580 aattagcaaa aacaaatcat cgttatggat cagctaatta ttggaaattt atgttcgagc 2640 agtcgcagat gattatgaaa gaaagttatg aaaaaagttt aattctagaa gatataccag 2700 gtttgcttac agtaaacaag gtaaagccaa aagacatgaa taaaacaaaa aatatacgtg 2760 tcacaaacat acatgggtca atggaagccc aaaacatcat tgaacaggtt gagttgattg 2820 aaattagaaa aaggaaaaaa aaagatcaaa aggagaaaaa aaagaaagaa aaagataaag 2880 aaaaagaaca attttataaa tgtaaattaa aatgtgtttg caatggcatt tgtgcagtaa 2940 agggttttaa agagtgccca gtctgccacg agataaagaa atcagtatgc agcaaaattt 3000 cttgtcgagt tgatggaatg aagccaataa tgattaaact agccacatca accaaaacaa 3060 atgctaaata taaaggcaat ttaaatgacg tggaaagcga taacatcaat tggggaaatg 3120 ttttatgttg atttgaaatg taacttattg attctggaaa gaaattttga ctttgtgtct 3180 caatgtttgt attactcttt gtacccctta aaaactacaa gcgtctctac tgctagtaaa 3240 aagcttattt tatcaatcag agaaggatat tgtgaaaact ttttttattt ttatttgtag 3300 caacaatata gttatctaaa aaaaaaggaa ttgcaatctt attttcaaat tggtgcttac 3360 ccatgccaaa aactgtgttt ttttagttta aaattaattt attttttatt tttaaattgc 3420 ttttttcgca tattattaat aagaaaatag tcaaacttgt atttttttgt agagagtatt 3480 tatatggagt ctcaaaacaa ataatattat gcgttaaagc tcatacagtg cttgacaaga 3540 acataaatta tgactttgta gtttttaagg tgtttgtgtt tttaaggtgt acctacctgt 3600 a 3601 // ID Gypsy-45_AA-LTR repbase; DNA; INV; 233 BP. XX AC supercont1.16; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-45_AA_; KW Gypsy-45_AA-I; Gypsy-45_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-233 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.16; Positions 995017 995249. XX SQ Sequence 233 BP; 77 A; 42 C; 48 G; 66 T; 0 other; tgttgtgttt agctcgagta gatcgaacta attttagagt gtctgatctg tgaacgcatt 60 ccgatgtaaa aggaagaact ggcatcattg taaactacaa tatcgaaagt tgatgaaatg 120 actaaatgag agaccgaagt ctcattcgtt aagtggatac caagctactc agatcacttt 180 tccctaacaa gagaaaatac ctctaagttt gacctcaggc tcgggattta aca 233 // ID Copia-1_AA-LTR repbase; DNA; INV; 226 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-1_AA_; KW Copia-1_AA-I; Copia-1_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-226 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 938-938 (2011). XX DR [2] (Consensus) XX SQ Sequence 226 BP; 69 A; 48 C; 34 G; 75 T; 0 other; tgttggcgta gacgacccta ttgccaaaaa tacgcgagca cacccctgca tatatgtgac 60 atttcttaat attactaata catcaacatt tatgtattct aagaacgata ataaacagta 120 tgatttattg cttcgccata gtaacagctc cgcactttat atatcggttt cgtgataata 180 tagctttttc gacggtttat cgttctatat tcctgaacag ccaaca 226 // ID BEL-141_AA-LTR repbase; DNA; INV; 376 BP. XX AC supercont1.294; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-141_AA_; KW BEL-141_AA-I; BEL-141_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-376 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.294; Positions 536314 536689. XX SQ Sequence 376 BP; 100 A; 79 C; 92 G; 105 T; 0 other; tggtcaagca ggacagaaag ggttaaaagt gagtgaccaa acacgttcca cactatccgt 60 accattggtt atcttgctcc atcgcattag aactcttggc ccgtttcgct tcacgagacg 120 agtaggtatg agcgtgtgat agtgtgcgaa gggtatttgt cccaatcgct ctcgatcggt 180 ggccagccag ctgtgggatt tttatgtata atcagagcag tcagaagtaa atataccgtc 240 gcggagacaa caagttgttg tccgtttaat ttacaataat aaaagtgtac tttctgttcg 300 cgcaatagag tgtacttttt cagtgaaaac cgccgcgagt ttttaatgtg tccgaaaccc 360 tgttgacgcg cgaaca 376 // ID Penelope-14_HM repbase; DNA; INV; 2132 BP. XX AC . XX DT 15-SEP-2009 (Rel. 14.09, Created) DT 15-SEP-2009 (Rel. 14.09, Last updated, Version 2) XX DE Non-LTR retrotransposon: consensus. XX KW Penelope; Non-LTR Retrotransposon; Transposable Element; KW Penelope-14_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2132 RA Jurka J.; RT "Non-LTR retrotransposons from Hydra magnipapillata."; RL Repbase Reports 9(9), 1938-1938 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 466..1656 FT /product="Penelope-14_HM_1p" FT /translation="MGSYDGAEICELVGIHILNEISDIIIKTDIGLYRDDG FT LLIIRSSNGQETERTRKNIIQKFKNIGFQIDIKTNLKSVDFLDVTFNLSQN FT TYKPFKKPNDNLQYIHTSSNHPPQILKQLPTSISNRLSQNSSNEAIFNQSK FT NEYEQALKNSGYKNNSLKYHIKQNPPQRNRKRNIIWFNPPFNKNVSTNVAK FT IFLKLLDENFPKTHKLHKVFNRNTVKVSYSCTENIEKIIKNHNKKIIFPKN FT ENNDKKCNCRKEAECPMNGKCKSTNVIYKCIIAAKNQPNKVYIGLTEGEWK FT NRYNNHKSSFNNVKYKKSTALSNYVWDLKNNFNETPIFTWSILKSVPPYSN FT ISKRCPLCLYEKFAIISYPKPDELLNKRNEIISKCRHENKFLLKNYKARYK FT PN*" XX SQ Sequence 2132 BP; 901 A; 432 C; 227 G; 572 T; 0 other; gccttaaaac taaatttgag taacagaata gaacacctag caaaatctcc tgcatttata 60 accctaaaag accataaaga aaattttaaa tcaaaaccaa catgccgcct cattaacccg 120 gcaaaaagcg aaatcggaat aataagcaaa aaaattttag acacagtaaa cagtaactta 180 agaattaggc ttaaagtaaa ccaatggaga aacacaaccg atgttatcaa ttggtttaat 240 aacattccag acaaaaataa ttgtttcttc attcaattcg atataaatga attttaccca 300 tcaataacaa ccgaaatatt taataaagcc atttcctttg caaagcaaca cactcccata 360 acagacgacc aacttagaat aataaaacac agccgaaaat ccctcctcta ctacaacaat 420 aacacgtgga taaaaaaaca gagtcgtgag agtttcgacg tcacgatggg cagctatgat 480 ggcgccgaaa tttgcgaact cgtaggaatc cacatactta atgaaatatc cgatataatt 540 ataaaaaccg atatagggct ataccgcgat gacggtttat taataatacg tagttctaac 600 ggccaagaaa ctgaacgaac tcgaaaaaac attattcaaa agtttaaaaa cataggcttt 660 caaatagata taaaaacaaa tcttaaatca gttgacttcc ttgatgtcac ttttaattta 720 tcccaaaaca cctataaacc attcaaaaaa cctaacgata atcttcaata cattcataca 780 tcctctaacc acccaccaca aatccttaag caattaccta cttcaataag caacagactt 840 tcacaaaatt cctcaaatga agctatattt aaccaatcca aaaacgaata tgaacaagca 900 ctaaaaaata gtggatataa aaacaactca ctaaaatatc atattaaaca aaatccaccc 960 caaagaaatc gtaaacgcaa tatcatatgg tttaaccccc cattcaacaa aaacgtctcc 1020 actaacgtcg caaaaatatt tttgaaatta cttgacgaaa attttcctaa aacacacaaa 1080 ctacataaag tctttaacag aaacactgta aaagtaagct acagctgtac tgaaaatatc 1140 gaaaaaatta ttaaaaacca taacaaaaaa attattttcc caaaaaatga aaataatgac 1200 aaaaaatgca attgtcgaaa ggaagcagaa tgccccatga atggaaagtg taaatccaca 1260 aacgttatat ataaatgtat catcgcagct aaaaatcagc ccaacaaagt gtatattgga 1320 ttaacggaag gcgagtggaa aaacagatac aacaatcaca aatcatcatt caataatgtt 1380 aaatacaaaa agtccaccgc cttatcaaac tacgtctggg atttgaaaaa caattttaac 1440 gaaaccccaa tatttacatg gtctattctg aaatctgtac ccccctactc aaatatttcc 1500 aaaaggtgcc cactatgtct atacgagaaa ttcgcaatta tatcctatcc taaacctgat 1560 gaactcctaa ataaacgaaa cgaaatcata tcaaagtgcc gccacgaaaa caagttttta 1620 ttaaaaaact acaaagctcg ctacaaacct aactaaccac taacattaac tacgcaaccc 1680 tatttcgtac acctctaggc taaccaataa attattcaca tacacagctt ttgtaatata 1740 tttcgtaata taatttatat tcttataccc acttttgaaa cttattttta tattcattta 1800 tttttaatac aataattttt aaaaacaatt cacattatta ttaaatcgta tttttataac 1860 agacattaaa acatacaaca acttcacata tattaaccaa aaacaaatta ataataacat 1920 taaccttaac tactaaacaa tttccgtccc ataacgatct aaaatactaa tataaaccgt 1980 aaattttttc taaacacatt gtagcaaaat taacttttga ctacttgcct gatgattgtg 2040 gtaacacatg aaactttaag tcgcaatata atattataaa tattagcatt tagctaattc 2100 ctgatactgc gggtaacagt aatatatata ca 2132 // ID Gypsy-217_AA-I repbase; DNA; INV; 6503 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-217_AA_; KW Gypsy-217_AA-LTR; Gypsy-217_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-6503 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1037-1037 (2011). XX DR [1] (Consensus) XX CC Positions [3158-3619] - Reverse transcriptase CC Positions [4640-5116] - Integrase core CC LTRs are 95% similar to each other. XX FH Key Location/Qualifiers FT CDS 388..2112 FT /product="Gypsy-217_AA-I_1p" FT /translation="MPPRVSVSGPDFSTLYRGMRVDYLSPEELNYELFLRN FT IIIGDDQSCCKRRRRLKQIMKNEREGHEFIIHYDQDPEDDLKTCRYLFREH FT EQTLSKSPSEQVKNTCKARLLHLGHRLAIVKNHAIGECKTQAIEAFRSVLD FT LFSEHFWSDDAFYAETGGDSNEDDLLDEAVGGSDDPPQTFQAQYVTTEQFN FT QAMTDVGSLIKALASQISGMREEIQKISQPAQPSNNNDTNPFRQPLERPGQ FT LPPSSRGSWANPNADRPAVPPKVSFKKSTGFPESTTIGGDNLNNSFLRRLD FT ELSFTSSVQDNPQRASNNAPTRVQYAPPALQAAPQRKVVPVSQWKIRKYSG FT TDQGLGLNDFLSHIHQLAISEHASADDLFDSAIHLFDGAALSWYTARRNQN FT TLFGWDHLLRELKYEFRHPDLDSVLRSKIYQTRQQKGESFQQFYLQIEKLF FT QAMDYPITEQEKVEILKANLRYDCRKALIGRNIRTLEELIKIGKDLDATDF FT SAFSKVFGTNKREACAITAGSSGSAGSRTFYSRNKPQEPKNVSDQFSKNFK FT AGASGQKGSSNKSNNSQFDSKNMEKSPC" FT CDS 2771..5500 FT /product="Gypsy-217_AA-I_2p" FT /translation="MDFWKIFGIYPAVASIATSSSEMPPSSESSKVDLTDN FT QHQILNRTKKLFKVATAENLDTTYLYEHTITIKEEYKGSKPVRLYPYPIAP FT KIQEGLFIELERMLDRGIIEESNSDWSLNIVPVRKPSGAIRLCLDARKLNE FT RTVRDAYPLPHPGRILGRLPKARYLSTIDLSEAFLQIPLAQESRRYCAFSV FT QGKGMFQFTRLPFGLINSPATLARLMNRVLGQGVLEPNVFVYLDDIVIVTE FT TFEEHVRLLEEVARRLREANLSIKLEKSHFCLDEIPFLGYILSSQGLTVNP FT EKIRPIVEYERPNTITKLRRFLGMSNYYRRFIADYSNTTSALSELLKTKSK FT NLKWTPEAEEAFQDIKAKLITAPILTSPNFNNEFIVHTDASDYAIAGVLTQ FT KVDGQERVIEYYSKKLATPERSYHATEKEGLAALLSIEHFRGYLEGSHFVL FT VTDSSALTFIMRSKWKSSSRLSRWSLSLQMYDMTIVHRRGKDNVVPDALSR FT SVMAVSVKSDCDWFNSLKQQVEDKPGEYPDFRLHNGQLLKYVFNDELADHR FT FDWKIIPSPEARLDIIKASHDGQMHIGVDKTLGKIRQQFYWPNMFREVREY FT VRKCSECRQIKPPNVSTTPLMGDMRQAFRPWQIIALDYIGPLPRTTCGKQY FT ILVTMDLFSKWVQLRAFPQISAKSLCTELVDHWFFRNSVPAIVLTDNATTF FT TSREFRNLLDRFEIQQWLTSRYHSQANPVERVNRGINTAIRSYARDNQRSW FT DSRLSEIEVVINSTIHSSTGYSPFRVARGEEVILKGSEHSRVASEGETTLE FT ERVARIKEQNPQLFALVQKNLVKAHATSTNTYNLRNRKPAKPFEVGNTVYK FT RNTKLSNSNEYYNSKLGAQYVPCTVVARHGSSSYELVDAHGRNIGVWPAQL FT LKAAGQ" XX SQ Sequence 6503 BP; 1804 A; 1487 C; 1496 G; 1709 T; 7 other; atggcgccca acgtaaaagt aggaacaggt tcctaggaga gaaaaaatat tctcgaatat 60 tttttcttca ctagtagtgg gggagtttca aatagcttaa tttgaaagtc actctggtca 120 ggctagagac ggaaaaataa cgttttcttg ttgttcgcgt tttcaaagcc gcttggcgaa 180 gtagaaaaac gttgtggaaa gcaacgcgag tgtgcatttt gcgtcacttc ggctatctga 240 tactttctaa actaatcacc aatttctata ggcctctggg tgtttgaagt cgaagattag 300 agttgtgagt tggggtttta acaaccaacg acagtagttt aaaagctccc attttgcatt 360 tagtatcccg ttgcagattc cgttaaaatg ccacctcgcg tatccgtgtc cggtccggac 420 tttagcaccc tttatagggg aatgagggta gactatttgt ctcccgaaga gctcaattac 480 gagcttttct tacgaaacat tatcataggg gatgaccaat cgtgttgcaa acgtcgccgg 540 cggttaaaac agataatgaa aaatgaacga gaaggtcatg agttcataat acattatgat 600 caggatcctg aagacgatct taagacctgt cggtatctgt ttcgggaaca cgaacagact 660 ctctctaaat ccccaagtga gcaggtgaaa aacacttgta aagctcgctt gttgcatttg 720 ggacatcgac tcgcgattgt gaaaaatcac gcgatcggtg aatgtaagac acaagctatc 780 gaagcatttc gcagtgtgct tgatcttttt tcggagcact tctggtccga tgatgcgttt 840 tacgcggaga ccggtggcga ttccaatgag gacgacttgt tggacgaagc tgtcgggggt 900 agcgatgacc caccacaaac ctttcaagcg caatacgtaa ctactgagca gttcaaccaa 960 gcgatgactg atgttggaag tctaattaaa gctttggctt cccagatcag tggcatgagg 1020 gaagagattc aaaagatttc tcaacccgcc cagccttcaa acaacaatga caccaatcct 1080 ttccggcaac cgctcgagag accaggacaa cttccccctt cttccagggg aagttgggca 1140 aatcctaatg cagacagacc tgcggttcct cctaaggtct cttttaaaaa gagtactggt 1200 tttccggagt ctaccacaat tgggggagac aacttgaaca attctttcct acgtagattg 1260 gatgaacttt ctttcacgtc ctctgttcaa gataacccgc aaagggcwag caataatgct 1320 cccacaaggg tgcaatatgc tccgcccgct ctccaagcag ctccacagcg caaggtagta 1380 ccggtgtctc aatggaagat tcgaaaatac tcaggaacag accaaggtct gggattgaat 1440 gattttctgt ctcacattca tcagttggcc atatccgaac acgcctccgc cgatgattta 1500 ttcgattcgg ccatccattt gttcgatggc gcagccctaa gctggtacac agctcggcgc 1560 aaccaaaaca ctctttttgg ttgggatcat ttgcttcggg agctcaaata cgaattccga 1620 catcccgatt tggactcagt actccgcagt aaaatttacc aaacgaggca acagaaaggc 1680 gaatcgttcc agcaatttta tttgcagatc gagaagcttt tccaggcaat ggattatccg 1740 atcacagagc aagaaaaggt ggagattctc aaagccaatc tacgctacga ttgtcgcaaa 1800 gcgttgattg ggagaaatat tcgaactctt gaagagttaa tcaaaatcgg gaaggatttg 1860 gatgccactg atttctccgc tttttcgaaa gtattcggta ccaacaaacg ggaagcttgt 1920 gcgataaccg caggatccag tggttctgcg ggatctagaa cwttctactc gcggaataaa 1980 cctcaagaac cgaagaacgt atctgatcag ttttcgaaaa acttcaaagc tggcgctagt 2040 ggacaaaagg ggtcgtcgaa taagtcgaac aattcgcagt tcgattccaa gaacatggaa 2100 aaaagtccat gtaamacctc aactccgaaa gaaacgtgtg ctggtccgag taagtcgagt 2160 gcattactga aaatggtatc gaattaccgt cctccatcaa atgacgaatg tttcaattgt 2220 ggagaagaac acgatttgtc agactgtccc atccctcgca gagtattttg cgatgcgtgt 2280 ggctttaagg gctacacgag gagtaattgt ccgtactgtt taaaaaacca gatgcggaag 2340 tcctaaagac gctggaactt ccgcaagcgc aaaacgaagc agatttcatc agctggctct 2400 cggaactggg tcagttttat gagccacgtt cggatgaatc ggaggatgaa tctgtttctc 2460 aacaaattca taaaattact cttgataaca gttacgatga tcgcccacat attaatatta 2520 acgttttcga tactacagta acggcgcttc tcgactgcgg aagcatctta cttttattaa 2580 ccgagctttg tacaacaaat ttcgacgagc taagatcagg gaaccttcaa cacmggtcga 2640 actcagaacg gccgatggct cccgtctaga gattatcgga gaagtgctcc ttccctatac 2700 attcaatggt aagacccgcg tgctaccaac tttggtggcg ccagcactga ccaagkaatg 2760 tatctgtgga atggattttt ggaagatatt tgggatctat cccgcagtcg cttccattgc 2820 tacttcctcg tcagagatgc ctccttcgtc ggaatcttct aaggttgacc ttaccgacaa 2880 tcaacaccaa atcctaaatc gcaccaaaaa gcttttcaaa gtggccactg cggaaaactt 2940 agacaccacc tacctttatg agcacaccat tactataaag gaggagtaca aaggctcaaa 3000 acccgtgaga ttgtatccgt acccgattgc tccaaaaatc caggaaggac tcttcataga 3060 actggaaagg atgcttgacc ggggtataat tgaagagtcg aactcagact ggtcattgaa 3120 cattgttccg gttcgaaaac cttcaggggc tatacgtctt tgtcttgatg ctcgtaagct 3180 caacgagcgg accgtacgtg acgcttatcc cctcccgcac cctggacgca tacttggccg 3240 cttaccgaaa gcccgctacc tgtcgacaat cgacctttcg gaagcattcc tgcagattcc 3300 attagcgcag gaatctcgtc ggtattgtgc gttcagtgtg caaggaaagg ggatgttcca 3360 gttcacccgc cttcccttcg gcttgatcaa cagtcccgct acactggcac ggttgatgaa 3420 ccgagtatta ggtcagggtg tgctggaacc gaatgtcttc gtctacttag acgatatcgt 3480 tatagtaacg gagacattcg aagaacacgt acggttgctc gaagaggttg cgagacgtct 3540 tcgggaggca aatttgtcaa ttaagctcga gaaatcacat ttttgcttag atgagatacc 3600 gttcctcggc tacatcctgt cttctcaagg attaacggtt aatccagaaa agataaggcc 3660 gatagtcgag tacgagcgtc cgaataccat tacgaagctt cgacgattct tggggatgtc 3720 gaactattat cggcgattca tcgccgatta tagcaacaca acctcggccc tttctgaact 3780 tctgaaaacg aagtccaaaa atttgaaatg gacccctgaa gccgaagagg cgttccaaga 3840 catcaaagct aaattgatca ccgcccccat tttgaccagc ccaaacttca ataacgaatt 3900 catagtccac acggacgcga gtgactacgc cattgctggc gttctcactc aaaaagtgga 3960 cggacaggaa cgagtgattg aatattattc aaaaaagctc gcgacaccag aacggtcata 4020 tcatgcgacc gaaaaggaag ggcttgcagc cttgctctcg attgagcact tccgagggta 4080 tctcgagggg agccactttg tcctggtgac agactcttct gctttgacgt ttattatgcg 4140 gtcaaagtgg aagagttcat ctcgtctcag tcgatggagt ttgtccttgc aaatgtacga 4200 catgaccatc gtgcatcgcc gtgggaagga caacgttgtc cctgatgcac tgtcgagaag 4260 tgtcatggca gtatcagtga aatctgactg tgattggttc aactccctta aacaacaagt 4320 tgaggataag ccaggcgagt atccagactt tcgcctgcat aacggtcaat tgttaaaata 4380 tgtatttaac gatgaattgg ccgatcatcg tttcgattgg aagattatac ccagcccgga 4440 agctcgcttg gatataatca aagccagtca cgatggtcag atgcatatag gagtcgataa 4500 aactctggga aagattcgcc agcagttcta ttggccgaac atgttccggg aggtgagaga 4560 atatgttcgg aaatgttccg aatgcagaca aattaaaccc ccaaatgtgt caaccacgcc 4620 gctaatggga gacatgcgtc aagcctttcg tccctggcaa attattgccc tcgactatat 4680 tggtcccctt ccacgcacta cgtgcgggaa gcagtacatt ttagtgacaa tggacctttt 4740 tagtaagtgg gtgcaattac gtgcgttccc ccagatttcg gcgaaatctc tttgcaccga 4800 gttagtggat cactggtttt tccgtaattc kgtaccagct attgtgctaa ctgataacgc 4860 tactaccttc acgtcccgag aattccgcaa tctgctggac cgcttcgaga tccaacagtg 4920 gttaacttca cgttaccact ctcaagcgaa tccggtggaa cgcgtgaatc gcggcattaa 4980 cacggcaatc cgctcttatg cgcgtgataa tcagagatcc tgggattctc gcttatcaga 5040 aattgaggta gttattaact ctacaataca ttcctctact ggctacagcc cattccgggt 5100 agcccgagga gaagaggtaa ttttgaaagg aagtgagcac tctcgtgttg cgagcgaagg 5160 tgaaaccacg ctggaagaaa gagtagcccg aattaaagag caaaaccctc agttgtttgc 5220 cctagttcaa aaaaacttag taaaggcaca cgcaacatcc accaatacat ataatttacg 5280 caaccgtaaa cctgctaagc cttttgaggt aggcaatacg gtgtataaaa gaaacacgaa 5340 gctttcaaat tctaacgaat attacaattc gaaattagga gcacaatatg taccttgtac 5400 ggttgttgcc agacacggtt catcgtccta cgaattagtt gatgcgcatg gccgcaatat 5460 tggcgtatgg cctgctcagt tgctgaaagc cgcaggacaa tgacctttga cgctattgtt 5520 gtagcctgtc tgatgatcgt gagaggatcc gttcccggtc actgagtccg gcttgtagcg 5580 aacctagaat aataaattat gaaaaatgac cttctctagt aattttcaaa ttgataagcc 5640 taccctctgt ccttggcgca tttttcgtgg gcgttcgctt accgcggagc gactcctgac 5700 gaagttcacg cttgttagga atctgttagg aacaatttta attacgttaa ttgtctgccg 5760 gaggagacgt cctcaaacgg tggcataaaa gattagacgc gcgcggactc cgttccacag 5820 tggacgagag tcaaccgaat actgaaatcc ggatttgaat cgatcgactc acgagaagtc 5880 ggtctaagca gttcgcgcaa ccgagcaact gttatacgat cgttcgtacg tacgacatac 5940 cgcaatacgc gagagtttgt ctcacgccat acgcataact ctcacgcagc ttgcgcacgt 6000 gtaagttatc acgaatacgt cttgtgatag tatttccggc cacagtagac ttcgttagsa 6060 aataggcaaa ttgtctctga tcacataatt agaaaattag tttcgtcgtg tcctccgtcc 6120 gagtcggtcc tgtcccgcga gacagtcggt ggcgtattat aatgtttagt ccaggatgtg 6180 taactgggat cacggacgta aagtggaggt ttgttctcat atagttttgt gtcgacaccg 6240 ccctcgaaag cctagggagg ttgtgagagg tgaagattgt attgtgatgc tgctagtttt 6300 gactttctgg taattagctt tctcgcgatg aattccctgc aatgtgcgtt tcgttcaatt 6360 tcgttttctt cttcaagtta ctattatttt tccgtattta gtgctattgt taaataagat 6420 ttgtacagtt taaatattag tttcctagaa caaaaaaaga acaattgtaa attatccttt 6480 tttttcactg accagagggg gac 6503 // ID P1_AP repbase; DNA; INV; 1498 BP. XX AC . XX DT 21-FEB-2009 (Rel. 14.02, Created) DT 21-FEB-2009 (Rel. 15.12, Last updated, Version 3) XX DE P-like DNA transposon - a consensus sequence. XX KW P; DNA transposon; Transposable Element; P1_AP. XX NM P1_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-1498 RA Jurka J.; RT "P-like DNA transposons from pea aphid."; RL Repbase Reports 9(2), 467-467 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX FH Key Location/Qualifiers FT CDS join(249..425,395..1168) FT /product="P1_AP_1p" FT /translation="MIMYIVYNIILITKQNNKYKKYYKFKFENSWLIGISF FT FIRILSGEFSSNITLNLYTQEEHNFKFIYTRRISQDCLENFFGSVRKQSGD FT CITPTPIQFTRAFKKLFTCCFLVHSGAENCADDFSIILAQLKDVNNVSVNN FT VLDNNTRPSNTDISISQNDLDFRGGSLSLVSKNIIKYKCGFLIKKCLTKHT FT CEVCENYSMAHQALDENNLYCYFKAYQTDKSLFGSLRMPNDDFFIFICKLE FT ILFQXNFSDFALKDKILNSFIELFKTEHLNHPCENFPIIYLLKLFTRMSIY FT YAIKTINQNLKNPTNKRKLIIWKH*" XX SQ Sequence 1498 BP; 588 A; 165 C; 207 G; 537 T; 1 other; gtatagaaga taacaagtag ggaaagcaag ctttatcaca gaatacaact attttatcag 60 caatcggagc tggcagacca cagttttgat aatttcaaaa taaataattg gcgggtaaac 120 taaaaaatgt aatattatta tacaccgtag gtatagaaga tctgaagatg atttatcacc 180 tagtaacatc gaaattaaaa attctattta taaataatgt acctaatgtt atgagggaat 240 gatgataaat gataatgtat attgtataca atataatatt aataacaaaa caaaataata 300 agtataaaaa atattataaa tttaaatttg aaaattcgtg gttaatcggt attagttttt 360 ttataaggat cctgagcggt gagttttcaa gtaacataac tttaaattta tatacacaag 420 aagaataagt caagattgtt tagaaaattt ttttggatcg gtgagaaaac aatctggaga 480 ctgcataaca cctacaccaa ttcaatttac tagagccttt aaaaagttat ttacctgttg 540 ttttcttgtt catagtggtg cagaaaactg tgctgatgat ttttctatta ttttagccca 600 attaaaagat gtaaataatg ttagtgtgaa taatgttttg gacaacaata ctagaccatc 660 taacactgat atttcaattt cacagaatga tttagatttt aggggaggtt cattatcttt 720 agtcagtaaa aatataataa aatataaatg tggtttttta ataaagaagt gtctgactaa 780 acatacctgt gaagtgtgtg aaaattatag tatggcccat caagcgttgg atgaaaataa 840 cttatactgc tactttaagg catatcaaac tgacaaatct ctatttggta gtttgaggat 900 gccaaatgat gattttttta tatttatttg taaacttgaa attttatttc aagamaattt 960 ttctgacttt gccttaaaag acaaaatatt aaactctttt atagaacttt ttaaaactga 1020 acatttaaat catccatgtg aaaactttcc tatcatctat ttattaaaac tttttacaag 1080 aatgtcaatt tattatgcaa ttaaaactat taatcaaaat ttaaaaaatc ctactaataa 1140 aagaaaacta ataatatgga aacattaaaa tattatttgt gtacataaac tatataagta 1200 tataactatt ttgtttttaa agagatttga ataaatatat taattgtaaa aatattgaat 1260 tgtttttagt tacttcattt actgtcatac ctatattata agttttgtta ttaatttaat 1320 attaattatt aacacaattt gaagttataa atttcatata aattatgaaa taaatcatat 1380 aatcatagct aaaagtggta attgatgcag tgataagcag aaatgataag ataaaaaagt 1440 aaggatccag tgaacaatgg aatttgtagt agctttccct acttgttatc ttctatac 1498 // ID Gypsy-232_AA-LTR repbase; DNA; INV; 504 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-232_AA_; KW Gypsy-232_AA-I; Gypsy-232_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-504 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1068-1068 (2011). XX DR [2] (Consensus) XX SQ Sequence 504 BP; 149 A; 112 C; 96 G; 147 T; 0 other; tgtgctatac acgcatcgac ccaaaaacat ccttccagca tctatcagcc aagcaataaa 60 tacaacacag aatgttttgt cacaaattga ctaggtcagg tcagcatgcg taagcaattt 120 gcatcaaggc actcgagtgc gctacatgtt atgctgactg catgtttact cgttggtggt 180 cgtccgtttg gacgatgcgc taggctcata tttcaaccca aattgttatt gccaaatatt 240 gtcaccgccg acccaggcat ctttgccatt agcataagat tgtatgattg tattatttgt 300 gttaatgtat tacagcaagc cactaaataa aaggagctga tgttccaatg atagaacagt 360 cttgttctga ctctgaatct atgcgtttca atgcacctct ggtgtatacc gaaacatccc 420 tctagcaacc tggtaatcag gtactgctaa ctatttagga atagactaag atcagtaaca 480 tatcgatgtg caatacatag tcca 504 // ID LDRP1 repbase; DNA; INV; 96 BP. XX AC M21009; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 28-SEP-1995 (Rel. 1.08, Last updated, Version 1) XX DE L.donovani highly repetitive DNA. XX KW LDRP1; Repetitive sequence. XX OS Leishmania donovani OC Eukaryota; Euglenozoa; Kinetoplastida; Trypanosomatidae; OC Leishmania; Leishmania donovani species complex. XX RN [1] RP 1-96 RA Ellis J. and Crampton J.; RT "Characterization of a simple, highly repetitive DNA sequence RT from the parasite Leishmania donovani."; RL Mol. Biochem. Parasitol 29(1), 9-17 (1988). XX DR GenBank; M21009; Positions 1 96. XX SQ Sequence 96 BP; 19 A; 28 C; 22 G; 27 T; 0 other; taaccctaac cctttttctc tccacttatc cttgcacagc gacaccctct ccaacaggac 60 ggggcgtgac ggtgtactgg tgtatggtta gggtta 96 // ID BEL-101_AA-LTR repbase; DNA; INV; 504 BP. XX AC supercont1.335; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-101_AA_; KW BEL-101_AA-I; BEL-101_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-504 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.335; Positions 1037935 1038438. XX SQ Sequence 504 BP; 177 A; 69 C; 122 G; 136 T; 0 other; tgttgacggc gccgagctct accccaagct caggaataag agtgagaagt gagaagggtc 60 accgagacaa gtgatacgcg cgaccggtca actgtggtgg aaggtcgttg atgatgatgt 120 gttgtcaaac gaaaagtaca aaacataaac aaaaacaaac tcggaagata gttgatgtgg 180 agtttgagca aatcaataag tttttctgtg tgaggcaaat taattagttt tctgtgtgaa 240 tatatcacga aaaagaagta ggtgagatcg gtaattatcc gaaacgattt attcgtaagt 300 aattataatt tatgtataac aggaaggact tgtaatcgaa atcagtaccg catcgttgta 360 agagggaaag agagtgaact atttattgta agtttgaaat caattagggt ggactatgta 420 attaaaataa aatattttca gtttgagctg agcgtaaaac gactgctacg aaaaggaagt 480 tttcatctcg atcgattccg aaca 504 // ID BEL-127_AA-LTR repbase; DNA; INV; 740 BP. XX AC supercont1.130; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-127_AA_; KW BEL-127_AA-I; BEL-127_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-740 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.130; Positions 368110 368849. XX SQ Sequence 740 BP; 264 A; 106 C; 179 G; 191 T; 0 other; tgttgcgacg ccgcttggac acccctactt acggagcata agtggaacat gacagttcgc 60 aacgcggtgg tagtggttag cgtaagagca taaaaaacag gttcgaagta gtagtaggca 120 gcaggagtaa ataaaagcaa aagtgtagcg ggagaaatat agtgaattaa cgccattgca 180 aattaaagtc ggttgtttga gaattttatt gcgaaataca tattattccg gtagttaatt 240 agcgatcaac aggtcaaaat cggtgtgtaa agtgccttct gggaagtccg taagtgtacc 300 tgaaaaggaa gaaaagaaaa taagtgcgaa aatctaggag aaaataatat ttgatttgct 360 tgttatttac ttacctgtag atcgggattg ctacaaacta tttcaatagg aaatcgtagt 420 taatctaaca gtccggtgga ttagaacgta agtagacctg aaatggaaag aagagagaag 480 cattaattgg aaaattacaa cttaggcgaa gatcacacga ccaaaacagc tgattgaaga 540 tagggtagtt ggaccttagt tcgacacctg caaagtggaa gagtaaaagg gagtctagag 600 taaaaggaaa ccacttcttg tgagtagcaa ttatttggtc ttaataatgt aatttatata 660 agaataaata aatttgcagt ttgagcatcg ccgaaaacaa cttggctgct gaggaaaatt 720 ttggtttcca ctcgggaaca 740 // ID CR1-13_BF repbase; DNA; INV; 5072 BP. XX AC . XX DT 30-JUL-2009 (Rel. 14.07, Created) DT 30-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE Amphioxus CR1-13_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-13_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-5072 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-5072 RA Kapitonov V. and Jurka J.; RT "Young families of CR1 non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(7), 1584-1584 (2009). XX DR [2] (Consensus) XX SQ Sequence 5072 BP; 1439 A; 1162 C; 1025 G; 1446 T; 0 other; cgctaggtgg cgctttcggc agtagcagtg tttatgagtc ggtctgtaat agttctcaat 60 aaaagcttat ttctcgtcat ttgcgtgccc atccgatcta aatccacgtg cagagagtta 120 gcaggacagt atcgctacac tttgaagacg agacttgaac gttttgtgtt gtgtacaacg 180 atttctggat cttgtaagtt gccagcgtcg gctgagtggg gagggtgccg aacaaaggaa 240 tcctacacga cagcacacca gacgtcctac gatcgtcgag tggagttcta aatgcccaga 300 aaagcatcca tcaaagaagc tgatttctaa gtctacgggg ataaccgcac gatttgaccg 360 gtgagtcttc tgtgaagaaa ttggctataa gcacagtgtt gggctgtagc aagcctttgt 420 ttacattttc atgatgtccg aagtggatat atccgtgcca gatccgcttg cggacaagct 480 ggccgacacg aatctgtgga ttaaatgcta catgaaatca ggattcaggg caagggaaga 540 agacgttgcc ttgtccctag tagaaagtac gcaacatcat ggagccgatg gtgtgagacc 600 gtcagacata ttgggtgttc aagttattcc cccaatgatc atcataaact tgtcaaaccc 660 caaagtaaaa gccgcagctc ttaaacaggc tcatattcag ttacatggca aggatctgac 720 tttgctcgac ttcgtgacca cacccctgac aaaacctcgc ttaaccagga tatcaataca 780 cggggtgtat cactcggtac ctgacgtcgt tgtagctgat tgggttgaca agttttccat 840 acgtgccaca ccaatcgaga gacatctagt aaaaaacgct actgacatgt tcaagcacct 900 tagaacgggg cacaggttct gttatgtgga aaagttcact gagcgtccgc ccccgaggtt 960 caccaccata agtgtgccgc atccactgaa cccttctgaa tgtactgaca ttgatcttga 1020 ggtattttgt ggtcaagggc agatcgttaa ctgtcgacgt tgcaaagctc tcgaccacaa 1080 ggcttacgaa tgccctaaca tcacgtgctt ttcatgcggt atgactgggc acaccaaggc 1140 aaattgcgct tccattactg aaaccgagcg acatccaact cgctatcaaa gaaggtccgc 1200 tgaccgagat gattctgttg ggtttagtcc ccaaacccaa gactcggtag acatgatcaa 1260 ccgagccttt aacaagtatg tgagaccagc cgtacctacg tctaacagat tccagtcact 1320 cgctagacta gaggacgaat ctgacatcgc cattgcagga gctatacggg acattcgtgt 1380 tcaacacaca gatcagcgtc cgtcagcggt tgctgatgct gacaggcgcc acctctctaa 1440 tcacaggaat aagcggcggt ctagggcctc cagactcatg tcggctgagg aaccacatcc 1500 caacagcgac gtcacagcac caagctaccg cccctcaaac gatcgctaca acctgaggaa 1560 caggaaacgc tcgctcccct cacccgagca acgcacacat ccggcaacta aaaaacagaa 1620 cgcagaacga tttcaggtca gtcaaggagt cccgaaacct gctcgtgacc tacacaccaa 1680 aaccgtttcc cacataacca cccctccact caccaccgat gttccacctt tggacctccg 1740 gactacgtgt gaggaatccc tgtcggccat gcccgcgaca agtcctcctg tttcaaccgt 1800 ttcctgtgat actgtccgcg actgcgccac gaacactcac tcaaactcca gaccgttggt 1860 gggcgaatag caacgagcta agactctccc aactgacatt tcatgtctca atgtactata 1920 tttaaacgcc cgcagtgtca agtcagttag caagaaatac aacaaactcg ttcgctttca 1980 aaatcttgtt gctactgtcc agcctgacat catatctgtg actgaaacct ggctaactcc 2040 gtgtgtagcc gaccaggagc ttctgcccca ggactacacc gttcatcgga aagacagaag 2100 tgtcaccaag cctggagtga caggtggtgg tgtaatgttg gctgttaaat ctaatattct 2160 ctcacagaga agagaggacc ttgagcccat ggacgaggtt cttatttgtg acattcatcc 2220 cacaaatcat caaaaaattg tactcattgt gtattacagg ccaccgagtg gtgacttgtc 2280 tgtctttaat gctaatctga gttctattct aagctgtgtc tccaaggagt acagccaggt 2340 agtactgatg ggtgatctta actgcccaaa catagattgg tctaaaggag gaagccactc 2400 cgaacatgag agtcgcctat gtgatttact aaatgactac gctcttactc agataaataa 2460 tgtcatctcc aacagtcatg gtcatttgct agatgtagtg ttgactaaca tacctgacaa 2520 atgtttagaa atcgaacagg tgacatcaga gtatcctact gaccatgcta tattacagtt 2580 tggtttgatc atcaatagga agatcaaaag gcatgaagat aaacgatatg tttataatta 2640 taaaagaggt aactttaata ggctaaggtt agatctagaa acaaaacttc acaatatcaa 2700 cgatgatgtt aatcttgaca atatctggga gaggtggatt ttcgaagtta ccgaatctgt 2760 tgagctttgt gttcccaaaa tgtgtgttaa acgatcatct tcattacctt ggttcgacgg 2820 tgaagtcaga catgcgcgca acaaaaagaa gacggcatgg cgattagcca aacgaactaa 2880 cagtccttca cattggagca agtttagaaa actcagaaac gaccttcaaa gactcattaa 2940 gacaaagtac aactcgtttc tgaacaatct tggctcgcta gtacagacga acccgaaaag 3000 gttctggtct ttttttagat ctaaaaccaa atccaaatgc cttccggccg tcttgaattc 3060 agctagcgcc gtagcaagct ccgctataga gaaggctgac atgtttaacg attactttca 3120 ttcagtcttc actttatcta atgataccct ttgtctgcct gaaatcgctg ttttcgaaca 3180 tccatgtcta ggaaatattg tattttcaat tgaagaagtc caaaatgtac tgtctagact 3240 cgattgctca aaggctattg gtccagatgg actttctcct accgtatttc gagagtgcgc 3300 acatgtgatc gctcctcacc tgactgcatt gtttaacaga agtttgaacc aaggctgggt 3360 tcctgcgcag tggaaggatg caaatatttg tcctgtcttt aaaaaaggcc ggaaagattt 3420 agtagaaaat taccgcccaa tttctcttct tagtattgtt ggaaagataa tggaaaggtg 3480 tttgtttaat cgtatctttc ctcatttaaa gcacttgatc tatccgttac aacacggttt 3540 catcaagggc agatcatcat cgacacagct ccttgaaatt taccatgaaa ttggcgggat 3600 tttagatcga ggtggccaag tagacattat atatttggat ttttccaagg catttgactg 3660 cgtttctcac cagctgttag tccataagct caaaatgtat ggtattcatt ctaatttact 3720 ttcctggttc catagttatc tttcatgtag gaggcaacgt gtaatggtcg agggtacatg 3780 ttcggattgg ctcccagtag tatcgggagt gccccaaggg tcaattttag gtcccctcct 3840 tttcttatta tatatcaacg atcttccatc cgtagtaagc aacaaaatgg gcctttacgc 3900 tgacgactcc aaatgctaca aacatgtatc tactgtttta gattgtgtat cactccagag 3960 agatattgat tgcatgtacg attggagtaa tacttggcaa atgaatttta atcctgacaa 4020 atgtaaaatt ctcaggataa gcagatctag aaatcctatt acttttacgt acaaaatgaa 4080 tgacacggtc ctcgagtctg ttcctgaatt taccgactta ggggtattag ctacgtggaa 4140 tttaactttc gattctcatg tacataatat cacttcgaga gcaaattcag ctctcggctt 4200 tttaaaaaga tctgtaggtt tcaatgcccc tgtaaatgta aaaaaaatgc tatatctcac 4260 tttagtcaga agcagacttg agtactgctc tgtcgtttgg tcaccctata cacacaacct 4320 catttcctca gtcgaaaaag ttcaaagacg tgcctccaag tatattctta atgattacac 4380 agcagactac aagacgagat taatacattg ttcactatta cctctgtcct accgacgaga 4440 aatacttgat ctatgttttt tatataagtg cttactagga ctttatgata ttaatattga 4500 caactttcta aatctcccac atcccctttt aagagcacat tcacaagcta aacttatccc 4560 tggtaaatgt cacactacaa actttcagta ctcctatttt aacagaatcg tctacatatg 4620 gaactcctta ccacctgaaa ttagaaaact tcggctaaat ccaacattct caacaaatcg 4680 ttttaagcag tccgttattg atcattactt ctctcttttg gccacacatt ttgatgtcca 4740 tctaataagc acttggaccc attgttccaa gtgctgattg gatgcacatc taaatgtgtg 4800 gtcaacctat agttacatca ttctcatttg tttgttaacc ttagtttatc tgatcccttt 4860 tattgttgtt tcgtttgggt attgtttcgt atcttgccgt tttttgttag aatacctcta 4920 tgtaaagttc tttgttttta tttgcccttt ctctattgtt cattgttatg attttaactg 4980 ttgctggagg tgtgggtcat gtaaaggtat atcctgttac ccacaccttg cagaactttt 5040 tctgtaaagt ttaataaata aataaataaa ta 5072 // ID Copia-1_AC-LTR repbase; DNA; INV; 215 BP. XX AC AASC02061324; XX DT 27-JAN-2011 (Rel. 16.02, Created) DT 27-JAN-2011 (Rel. 16.02, Last updated, Version 1) XX DE LTR retrotransposon from Aplysia californica (California sea DE hare): long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-1_AC_; KW Copia-1_AC-LTR; Copia-1_AC-I. XX OS Aplysia californica OC Eukaryota; Metazoa; Mollusca; Gastropoda; Heterobranchia; OC Euthyneura; Euopisthobranchia; Aplysiomorpha; Aplysioidea; OC Aplysiidae; Aplysia. XX RN [1] RP 1-215 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Aplysia californica (California sea RT hare)."; RL Direct Submission to Repbase Update (02-FEB-2011). XX DR Genome; AASC02061324; Positions 632 846. XX SQ Sequence 215 BP; 70 A; 38 C; 33 G; 74 T; 0 other; tgttaataga taaactcacc tcattattat tattatttta tgaaacattt taatgcgaaa 60 tgttctgctg atgtcttgaa tatgtaggat caggatgatc ccccttttgt ttgtattgta 120 acaataataa agaaaacttc ctgttgacag agggaagaag aacgaaccac gctctgctac 180 catttttatt tattttgtcg aacacccacc aaaca 215 // ID DNA2-6_TCa repbase; DNA; INV; 609 BP. XX AC . XX DT 22-MAR-2009 (Rel. 14.03, Created) DT 22-MAR-2009 (Rel. 14.03, Last updated, Version 1) XX DE Non-autonomous DNA transposon. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Nonautonomous; KW DNA2-6_TCa. XX OS Tribolium castaneum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Coleoptera; Polyphaga; Cucujiformia; OC Tenebrionidae; Tribolium. XX RN [1] RP 1-609 RA Jurka J.; RT "DNA transposons from insects."; RL Repbase Reports 9(3), 667-667 (2009). XX DR [1] (Consensus) XX CC TSD is TA. Based on that, it is tentaively classified as putative CC non-autonomous Tc1/Mariner. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Tribolium CC castaneum Genome Project. XX SQ Sequence 609 BP; 214 A; 114 C; 93 G; 187 T; 1 other; cagggtgttt acggtcttgg ggttgaaact taaaggagat gatatgagtg ataraagaac 60 cccaaaacgc tatataaaat aaaaattcac attataacag aaaatcatta aatttacatt 120 atttgtttgt gaactgctgc ttcgactaca acaattacta ttaccttgtg cttcgccgta 180 agtcctaatc atgacaacca attcattatt tcaataattg tagtccaggg acgtaaattt 240 gatgaaaaac ctccacaaaa cacaaaccaa acttgctcga aacgatttcg gttgaagtga 300 tcaaacagag gaccagcgga tttaaatttt gatcaaaact aaagtaaagc caaatgaaat 360 gatgacagtg tcaaaactat gaagtgacgt ttttcactgt cgtggttaca atattcttat 420 ttgttgcaac aattaaattt tattttaaca gagaaattat tccagtgacc ctcaaaccct 480 gtaccaatac ttcggagtta aataacattc ttaaaaataa caattccctg agtttttttc 540 gcagttacct tatttctaca ctcttttagt accaggtaaa gtttcgactc ctagaccgta 600 aacaccctg 609 // ID hATm-8_HM repbase; DNA; INV; 3273 BP. XX AC . XX DT 22-MAR-2008 (Rel. 13.03, Created) DT 22-MAR-2008 (Rel. 13.03, Last updated, Version 1) XX DE hAT-type family: consensus sequence. XX KW hAT; DNA transposon; Transposable Element; hATm-8_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3273 RA Jurka J.; RT "Families of hATm elements from Hydra magnipapillata."; RL Repbase Reports 8(3), 212-212 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(236..574,618..1760,2114..3025) FT /product="hATm-8_HM_1p" FT /translation="MSTSNGYGKKSHREVFLVGKRIGTIPITQLPTKRTVL FT QYYHYRDLMLRADFPNISSVSIASCPLGDFFTAQCAEIGGCLQKCLPCVLY FT EVKKIWQKAGIPFVNTDKHIRFLLYYYTSNNYKVITFYIFIINYNSFNAFF FT YFCFRDKIVDLEKKRKDLNKHRNRLSKNLVLKRETFSRMLEEVFDVSQNNC FT EHTIMKDKTKDLKTRREDVDFLNDQKGARVQKMLGKDKISQNIVKNKRKRE FT MKKKNQSIKESIRKQKELVTEKFDDTMKNLNTLDNDQDEQYLQPSIKIKKS FT NAVPLIVPKNIGKDLAPTASRYGISSTALSATLASLINNSSGTTDDFSISK FT RSILRQKKSSISTTACKIKDNFIILAKGKSLTVHFDSKIMVELREIGKEKV FT ERLAVLISSPDLSESQLLGILIVKDGTAESLGKAIKKTLQDWELFDYVDCL FT SFDTTVTNTGWLKGVCTLIEQWRGKALIWMACRHHVYELHIKVNIEKKSAS FT ESLTFLQNCLDNDIFPRNDYKYFIQLAVLWLGGKVKDFHFRFPMASHHARF FT MAQGVYYLTYDILQPQFSNLCPDIISLQEGKLIHEIGSFTALFYVPWFIKA FT PIPAIAPSLDLKAINEMIQYSELCLKPAEAVLKSLEKHNWYLNERFVVMCL FT ADKCLPVDQLHKLALKLAQTAKPTSYKMGKLSSEIFTDNEKKITKRIEDFI FT GPESWFFFDLLKMTLHETEWLKISSSNWETCSGYIRFKNYITGLLVVNDSA FT ERAIKLGQDFIETFRNEEDNQANLLVVADHRLKFQTTEKKSWLKALK" XX SQ Sequence 3273 BP; 1223 A; 453 C; 531 G; 1066 T; 0 other; taatgagggg ggctgggtca cttacaaata tttcaaagat atctttaagg ttatttttgt 60 agcctgataa atctaattta agattatatt ttgttttatt gcaaaaaatg aaatgaaatg 120 gtcagcccta cccatttaaa gacatgctgt gtatatttgg gagttagggg gggaggggtt 180 cagttaaaaa aaatccgtaa tttttaaaat tttcatttaa gtgtaaataa aaacaatgtc 240 aacaagcaat ggttatggta aaaaatctca tcgtgaagtt tttcttgttg gaaaaagaat 300 cggaacaata cctatcacac aattgccaac taaaagaact gtattgcaat attaccatta 360 cagagatctt atgctaagag ctgattttcc aaacatttca tcagttagta tagcttcatg 420 tccacttggt gatttcttta ctgcacaatg tgctgaaata ggtggttgtc tccaaaaatg 480 tttaccatgc gtcctttacg aggtgaagaa aatttggcaa aaagcaggaa ttccttttgt 540 taatactgac aagcatatca ggtttttatt atattaaaat gctttttaat ttttagtttt 600 atccagttta aaattgatat tatacaagta acaattataa agtcataaca ttttatatat 660 ttataataaa ttacaacagt tttaatgcat tcttttactt ttgttttaga gacaagattg 720 ttgacttaga aaagaagaga aaagatttaa acaaacacag gaaccgttta agtaaaaatt 780 tagttttaaa aagagaaact ttttcaagaa tgctagagga agtattcgat gtaagtcaaa 840 acaattgtga acatactatt atgaaagata aaactaaaga tttaaaaact agaagagagg 900 atgtagattt tcttaacgac cagaaaggag cgcgtgttca gaaaatgttg ggtaaagata 960 aaatttcaca aaatattgta aaaaataaaa gaaaaagaga aatgaaaaaa aaaaatcaaa 1020 gtataaaaga gtccataaga aaacagaaag agttggttac tgagaaattt gatgacacaa 1080 tgaaaaattt aaatacttta gataatgacc aagatgaaca gtatctacag ccttctataa 1140 aaattaaaaa atctaatgct gttcctctta tagttcctaa aaatatagga aaagatttag 1200 ctccaacagc atcacgatat ggaataagtt cgactgcact gagtgcaact cttgcttctc 1260 ttataaataa cagttctgga acaacagatg atttttctat ttctaaacga agtattttga 1320 ggcagaaaaa atcaagtata agtacaactg cctgtaaaat taaagacaac tttattatat 1380 tggctaaggg taaaagtctt accgtgcatt ttgactccaa aattatggta gaattgagag 1440 aaattggtaa agaaaaagtt gaaagattag ctgtgctaat cagttcacca gatctttcag 1500 aatcccaact tttgggtatt ttaattgtga aagatggcac tgctgagtct cttgggaaag 1560 caatcaaaaa aactttacaa gactgggagc tatttgacta tgttgattgt ctgtcctttg 1620 acaccacagt cacaaatact ggttggctaa aaggagtctg tactcttatt gagcagtgga 1680 gaggaaaagc tttgatttgg atggcttgtc gtcatcatgt ctacgaatta catataaagg 1740 taaatattga aaagaagtca taaaatattt ttatatttat ttatttattt aatatttctc 1800 gatgaattta ataacgattc accttattat taaaatttag ttaagtgatt tgaaataaat 1860 atattttagc acttttcaag tgttcttact ggaggtaaaa caagtggacc aaagactgag 1920 ttatttgaaa ggctaaagtg taattggaaa tcgattcttg aaaagaagat aaattatgat 1980 aacctaaaga gatttgactc agaaaaaaaa attaaatctt ttttggaaaa gcaggttggt 2040 ttgattgtcc taattagatt aatctgtttg attataagtt ttaataataa actttcttta 2100 ttttttattt taggcttctg aatcattaac attccttcaa aattgcctcg ataatgatat 2160 atttccaaga aatgactaca agtattttat acagttagca gttctttggt taggtggcaa 2220 agtaaaagat tttcacttta gatttccaat ggcttcacat catgccaggt ttatggcaca 2280 gggcgtttat tatttaacat acgacattct acagccacaa ttcagtaact tatgtcctga 2340 tataatatca ttacaagagg gaaagctgat tcatgaaatt ggttcattta cagcactatt 2400 ttatgttcca tggtttatta aagctcctat ccctgccata gcaccttctt tagatctaaa 2460 agccattaat gaaatgatac aatattctga attatgtctt aaaccagcag aggccgtact 2520 aaagtcactt gaaaaacaca attggtattt aaatgaaaga tttgtggtta tgtgtcttgc 2580 tgataaatgc ttacctgtag atcagctaca taaattagct cttaaattag ctcaaacagc 2640 aaagcctacc agttataaaa tgggaaaact aagttcagaa atttttactg acaatgaaaa 2700 aaaaataaca aaaagaattg aagattttat tggtcctgaa agctggtttt tctttgattt 2760 acttaaaatg acattacatg agacagaatg gctgaaaatc agttcatcaa actgggaaac 2820 atgttcaggt tatataagat ttaaaaatta tatcactggt cttcttgttg taaatgacag 2880 tgcagaacgt gcaattaaac ttggtcaaga ctttattgaa actttccgaa acgaagaaga 2940 taaccaagct aatttactgg ttgttgcaga tcatcgtcta aaatttcaga caacagaaaa 3000 aaaatcttgg ttgaaagctt taaaataact tttctttaaa aaataaatac aaatgtaaaa 3060 aaaaatttta tttaaaattg taagaaaaaa attttactcc ccccccccac ctcccaaata 3120 tacacagcat gtctttaaat gggtagggct gaccatttca tttcattttt tgcaataaaa 3180 caaaatataa tcttaaataa gatttatcag gctacaaaaa taaccttaaa gatatctttg 3240 aaatatttgt aagtgaccca gcccccctca tta 3273 // ID Mariner-32_SM repbase; DNA; INV; 2346 BP. XX AC . XX DT 12-AUG-2009 (Rel. 14.08, Created) DT 12-AUG-2009 (Rel. 14.08, Last updated, Version 2) XX DE Mariner DNA transposon from Schmidtea mediterranea: consensus. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-32_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-2346 RA Jurka J.; RT "Mariner DNA transposons from Schmidtea mediterranea."; RL Repbase Reports 9(8), 1881-1881 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 270..1265 FT /product="Mariner-32_SM_1p" FT /translation="MCCYTICDNILFYSSYPFMLPSKSYSKRSKISIEKKK FT EIVKKHEMGICTTNISRDFNLPKSTVSTILKNKDLIKVADVAKGVNVLTKQ FT RPQIIEEVEKLLLIWINQKQLCGDSISELIIREKANQIYKDLFKQISEIND FT IDNGLFKASRGWFEKFRKRSGIHSVVRHGEASSSDLQAAEIFKKEFIEFIA FT NEGYVSQQVFNCDETGLFWKRMPNRTFITQEEKSMPGHKPMKERLTLLLCS FT NASGDFKLKPMLVYHSENPRVFKKKNVIKSKLPVVWKSNTKAWMTKLFLWI FT GCVIYLHLVCENICRKINYQLSVFLSWIMHLHILVILKLI" FT CDS 1139..2053 FT /product="Mariner-32_SM_2p" FT /translation="MDWLRDIFAPGVRKYLQENKLPIKCLLIMDNAPSHPC FT NIEVDLIDELNFIKVMYLPPNTTPLIQPMDQEVIANFKKLYMKELFSKCFE FT ITNDTDLTLREFWKEHFNILHCINFIDKAWIGVTCRTLNKAWEKLWPDCLI FT EICVGPYEVMSEVDAVKDIVSMAQNMGLEVNNVDVEEIVLDSDENYTTEDL FT QMLLCEKQGVFKKDMLFEAEQEEYITSATIKDICNKWTEVLKFVEKYHPNK FT FDVNRAINIMNDNAISYFQETLKRRKIQTTLDTFLTKISEEKEPKNQRKTT FT DEINVEEKYPFQQ" XX SQ Sequence 2346 BP; 882 A; 319 C; 408 G; 737 T; 0 other; cagtggtacc tcggtttacg aacaaaagtt tttgaaaaat cttaaccaat tttgtctcga 60 tttacgaaca tgcttcggtt tacgaacatc aaaatataaa atgacaaagc gtagatattt 120 aaaattttta gtgtttaaat aaacaatttt aatatgaaat attttaaaac atattacata 180 actttaaata aatacatgtt gtgatattta ttgtaattaa tagttaatct taataattta 240 atacatggtt tatcttaata atttaattaa tgtgttgtta tactatttgt gataatattt 300 tattttatag ttcttatccc ttcatgctac catcaaaatc ttattcaaag cgcagtaaaa 360 tatcaattga aaaaaagaaa gaaattgtga agaaacatga gatgggcatt tgtacaacca 420 atatttcaag agattttaat ttaccgaaat caacggttag cacaatttta aaaaacaaag 480 atttaattaa agtagccgat gttgcaaagg gagtgaatgt cttgacaaaa caaagaccgc 540 aaataataga ggaggtagag aaattattat taatttggat aaatcaaaaa cagctttgtg 600 gtgatagtat tagtgaattg ataatacgtg aaaaagctaa ccaaatatat aaagatttat 660 ttaagcagat ctccgaaatc aatgacattg ataatggact ttttaaggct agtaggggat 720 ggttcgaaaa gtttcgtaaa agaagtggta ttcatagtgt ggttagacat ggagaagctt 780 cgagttcaga tctccaagcc gcggaaatat ttaaaaaaga gtttatagag tttattgcaa 840 atgagggtta tgtttcacaa caagttttta attgtgatga gactggactt ttttggaagc 900 gaatgcccaa cagaaccttt attacacagg aagaaaaatc aatgcctgga cataagccga 960 tgaaagaaag gctaactctt ttactatgta gcaatgcaag cggagatttt aaattgaagc 1020 cgatgctagt ataccattca gaaaatccac gagtttttaa aaagaaaaac gtgattaagt 1080 ctaaattgcc tgtggtttgg aaatcaaaca ctaaagcttg gatgacaaag ctttttttat 1140 ggattggttg cgtgatatat ttgcacctgg tgtgcgaaaa tatttgcagg aaaataaatt 1200 accaattaag tgtcttctta tcatggataa tgcaccttca catccttgta atattgaagt 1260 tgatttgatt gatgaactta acttcataaa agtcatgtat ctcccgccta atacaacccc 1320 tttgattcag cctatggatc aagaagtgat tgcaaatttt aaaaaattgt acatgaaaga 1380 attattttca aaatgttttg aaattaccaa tgatactgac ctaactctaa gggaattttg 1440 gaaggaacat tttaatatcc ttcattgcat aaattttatt gataaagcat ggattggtgt 1500 tacttgtcgt accttgaaca aggcgtggga aaaactttgg ccggattgtt tgatagaaat 1560 atgtgttggc ccttatgaag tcatgtccga agttgatgca gtaaaagata tcgtgtcaat 1620 ggcccaaaac atgggtttag aagtaaacaa tgttgatgta gaggagatag tattagactc 1680 agacgaaaat tataccacag aagaccttca aatgcttttg tgtgagaagc agggtgtgtt 1740 taaaaaggat atgttatttg aggctgaaca agaagaatat atcacaagtg ctaccatcaa 1800 agacatttgc aataagtgga ccgaagtact taaattcgtt gaaaagtatc atccaaacaa 1860 atttgatgta aaccgagcaa ttaatattat gaatgataat gcaatatcat attttcaaga 1920 gactctaaaa agaagaaaaa tacagactac tctcgataca tttttgacga aaatctcaga 1980 agaaaaagaa cccaagaatc aaagaaaaac cactgatgaa attaatgttg aagaaaagta 2040 ccctttccag cagtaaattg ttttgaccta aataaaactg tttcttttta aaacctatga 2100 tttataaaaa tatataacat tacgatattt tttacacact aaattaaaaa taatttatag 2160 caaaacctcg gtttgcgaac gctaaagttt aaaaaaattc ggtttacgaa caaaaaatta 2220 tatattggaa cgaattaatt gcatttccat acaatttcta ttgtaattga tgctccggtt 2280 aacgaacatt tcggtttacg aacacaatct tggaacggat taagttcgta aaccgaggta 2340 ccactg 2346 // ID R1C_NLo repbase; DNA; INV; 6413 BP. XX AC . XX DT 08-NOV-2010 (Rel. 15.11, Created) DT 08-NOV-2010 (Rel. 15.11, Last updated, Version 2) XX DE 28S rDNA-specific non-LTR retrotransposon R1 in Nasonia DE longicornis. XX KW R1; Non-LTR Retrotransposon; Transposable Element; R1C_NLo. XX OS Nasonia longicornis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-6413 RA Stage D.E. and Eickbush T.H.; RT "Maintenance of multiple lineages of R1 and R2 retrotransposable RT elements in the ribosomal RNA gene loci of Nasonia."; RL Insect Molecular Biology 19(Suppl 1), 37-48 (2010). XX DR [1] (Consensus) XX CC This family is specifically inserted into 28S ribosomal RNA CC genes. XX FH Key Location/Qualifiers FT CDS 515..2368 FT /product="R1C_NLo_1p" FT /translation="GRTASSDRRSSQVEKSLPYGRPGELCAGHRRRDGVRC FT GCPGSAWGRGRKKQMDTIKTQLRKMTRSSKGVGCSREDFLKKDSKGGGGDS FT EDTRSSVYSADSLRERSRSRSPTGLKSTKDFELVESKAARKQRKAEEKRRR FT RENANASVNDRMIKSVNCSVEDVSKKVNVSRSENDVSKVGATECMEVGMEE FT CVSMKESANVKRTVKRNAVGQDRFKGAKGELFVVCENLRQPLKRIKREVAD FT AEVPVVSTSECVNESAEISEVAVDELRAVTDGLRGHLLSDANKFTKWQASS FT VLDHASKFEGLVQRLMLENARLRGELTAHKSMKAELAKVSETVRRVDEGMN FT VVKTRVAAASRAPPAVAGAAPGRGLGANAGPKPSFALVVRGANEQLTCDEV FT RRRMIESTSEDVNVRVRTIRPARGGGVVVETASDGERKALSRCAGLAEAGL FT RAAEPKVMDPRVIVYGVPNEMTNEHLLRGLYEKSLREHVSVNEFTKRVKIV FT RRVDGQRLGNVIVELPLPWRDRLLQDGRVFVGWNSFKCCPYERIMCCFRCQ FT GYDHRAKECKSEPLCYKCGKSGHRMDACKAAEDCSNCRARKLPSEHLARSL FT QCPMYAWRLQLLRSRFVNNG" FT CDS 2361..5558 FT /product="R1C_NLo_2p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="TMAESQTNGSDVCVNVRILQVNCQKSYAAMCDIANYA FT LEDGIEICLFQEPYVYKDRVCGLPAGSRMYLSKSGEAAVVVFGKRYECMLL FT NEGAHEDAVCVWVKGPVGEILVVSLYCRPNGSMQGCVDYLDRVVSTRNGRR FT LLVGMDANAASDLWHSKSMVRAWQAVRRGAVLRDWVVHAEMDVLNVPTLAY FT TFSGARGESDIDVTLYKGSECQFEWMLKDDWGISDHNPIVITMSTGENVDV FT NGERMHKWNARKCNWLLYRGLIETFASDYGYDEYSVLGAEEKLTLLYKWMT FT EANEVCMEKVVSRPAPKRKSVVWWNECLSEKKRMVRERRRAYQRERARTGD FT PDRMKWREWKECEREYRRMMKDAKESDWHGTVERKGETDPWGVISAFCMGK FT LNPVSLAGLRTANGCTKTWMESARVLLDEFFPADDGIPAEEVYGVQMDMNE FT FCMGELDEAVLGMKMRKAPGLDGLTNEMLRQVWKAAPLFLKGLFDTCLSEG FT LFPHKWKEARVVVLLKGADKDVAESRSYRPISLLGSPGKVMERMLVARLMR FT HMEGKWNECQYGFMKGKCTEDAWARAKENVRAAESEYVLGIFVDFKGAFDN FT LLWRVALQKLREAGCTYEELRVWHSYFSDRSVCMYNGMDVVEKCARRGCPQ FT GSISGPPVWNLGMNDLLNELSGLGVEVVAYADDLLLLVQGNRRNELEQSAS FT EALSVVYRYGMNIGVEVSDSKTVCMMLKGSLNMLNRVVHVSANGTDDKKIR FT CVDRVKYLGVNVGVGMDFSVHIDGMRKRVTTVIMRLRRVLRKSWGLKRGVV FT SMMVNGLILPAVMYGASVWYEQLHKRKLRGSRRLSEELVSCQRVVLYACTR FT VCRTVSTEAMQILFGSLPWDIECIRRADLHKLRKVLPMNENDLVADEDLNE FT LSLHECRELVDQRALAAWQDRWEATSNGRVTYEWIRDVGFSGRSMKYFEPS FT LRVCYILTGHGSLNSFLFSRNLSNSPACACGAEREDWIHVLCECDMYAAFR FT DLDSIGVRRTEEGWDVSGVLCDRAKYECLCAFVERAFSMRESIVQRMRERE FT RERKRTVRM" XX SQ Sequence 6413 BP; 1477 A; 1234 C; 2249 G; 1453 T; 0 other; ccccctcacg atcactggct tgtcgcgccg agggggcccg agttcaggag tttgcgaaag 60 cgaattcctg ttccgctaaa gtgatccaaa cccaggaagt ggtggatgat gtggtgcttc 120 ggctagccga ccccactctc aggaaccggt ccgcgaaagc ggaacggaga ccctgagaag 180 tgtgggaacg cgaactggag cgcctgatcg gaagccaccg ggcgtgacga ccccggggtt 240 cctagtccca agcgatgagc gccgtcgctt ggcccgtaaa ccttgcaccg tcttcaacaa 300 ccttcgcagg gcgtggcgtt ggtacgttgg cggtgtgggg agcgggggct tggcttaacc 360 gcccggccgc gaggtggggt cctctataaa accccccaat cctacgctta ggtgggcggc 420 tacgggatgc atggctcctg gagaggagtc ccgagctcac caaccccggc tggctgtggc 480 ggcttgttcg ggttgcgttg ccttctcgag gtaagggcgc actgcttcga gtgaccgtcg 540 ttcgtcccaa gtagagaagt cccttcctta tggcaggccc ggtgagctat gcgctgggca 600 ccgtcgtaga gacggtgtcc ggtgcgggtg ccctgggtct gcttggggga ggggacgtaa 660 gaaacagatg gatacaatta aaacacagct tcggaaaatg acgcgtagta gtaagggggt 720 aggttgtagt agggaggact tccttaagaa ggactctaag ggcgggggtg gagatagtga 780 ggacacgcgc tcctccgtat actctgccga cagcctgcgc gagcgttcac gctcccggag 840 tccaacgggc ctgaaatcca ctaaggattt cgagctcgtt gagtcaaagg cggctagaaa 900 gcagcgcaag gctgaggaga agaggaggag gcgtgagaat gcaaatgcga gtgtgaatga 960 tagaatgatt aagagtgtga attgcagcgt ggaggatgta agcaagaaag tgaatgtgag 1020 tagaagtgaa aatgacgtga gcaaggttgg agcgacggag tgtatggaag tcggtatgga 1080 agagtgtgtg agtatgaaag agagtgcgaa tgtaaagagg acggtgaaga ggaatgcggt 1140 gggccaggat aggttcaagg gggcaaaggg cgagctcttt gtggtgtgcg agaatttgcg 1200 gcagccgctt aagcgcatca agcgcgaggt ggctgacgca gaggtgcctg tggtcagtac 1260 gagtgagtgt gtgaatgaga gcgctgagat ttcggaggta gcggtcgacg agctgcgtgc 1320 cgtcactgac ggacttcgtg ggcatctgct ttcagatgcc aacaagttca cgaagtggca 1380 ggccagcagt gtgctcgacc atgcctcgaa attcgaaggg ctggtgcaac gtctgatgtt 1440 ggagaatgcg aggttgcgtg gtgagcttac tgctcacaaa agtatgaagg ctgagttggc 1500 gaaagtgagt gagactgttc gacgagtgga tgagggtatg aatgttgtaa agacgagggt 1560 agcagcggcg tcacgagcac ctccggcagt ggctggagct gctcctggta ggggtttggg 1620 tgcgaatgcg gggcctaagc ccagcttcgc gctcgttgta cgtggcgcaa atgagcagct 1680 cacgtgcgac gaggtgcgaa gaaggatgat tgagagcacg agtgaagacg tgaatgtgag 1740 ggtgagaacc atcagacctg cgcgtggtgg tggggtcgta gtggagacgg ctagcgatgg 1800 tgagagaaag gctctctccc gttgtgccgg actcgccgag gcgggactcc gtgcggcgga 1860 gcccaaagtg atggatcctc gagtgattgt gtacggtgtc ccgaatgaga tgacgaatga 1920 gcatctcctt aggggcctgt acgagaaaag tttgcgtgag catgtcagtg tgaatgagtt 1980 cacgaagcgt gtgaagatcg tcaggagagt ggatgggcag cgactcggca atgtgattgt 2040 cgagttaccc ctgccatggc gtgataggct gttgcaagat ggtagagtgt ttgttggatg 2100 gaacagcttt aaatgctgtc cgtatgaaag aataatgtgc tgtttccgct gccagggcta 2160 cgaccatcgt gccaaggaat gtaagagtga gcctctgtgc tacaaatgtg gcaagagtgg 2220 tcacaggatg gatgcctgta aggctgcaga ggactgcagc aattgcaggg caaggaagct 2280 tccctcggag catttggcga gatcgctgca atgcccgatg tacgcttgga gactgcagtt 2340 gttgcgttct cgtttcgtga acaatggctg agtcccaaac aaatggatca gatgtgtgtg 2400 tgaatgtgag gatcttgcag gtaaactgcc aaaagtctta cgctgcgatg tgcgatattg 2460 caaactacgc gcttgaggac ggcatagaga tatgcctatt ccaagagccg tatgtttata 2520 aagatagggt ttgtggttta ccagcgggat ccagaatgta tctcagtaag tctggagaag 2580 cggctgtagt agtgtttggg aaacggtatg aatgcatgtt gttaaatgag ggagcacatg 2640 aggatgccgt atgtgtctgg gtgaaagggc cggtggggga gatacttgtt gtctctctct 2700 actgtagacc gaatggtagt atgcaaggat gtgttgacta ccttgatagg gtagttagca 2760 ctaggaatgg acgtcggttg cttgtaggaa tggatgcgaa tgctgcgtcc gacctttggc 2820 acagtaagtc catggtgcgg gcgtggcaag cggtgcgtcg gggtgctgtg ttgcgtgatt 2880 gggtagtgca tgcggaaatg gatgtcttaa atgtccctac cctggcttac accttcagtg 2940 gagccagggg ggagagtgac attgatgtca ctctctacaa gggtagtgag tgtcagtttg 3000 aatggatgct gaaggatgac tggggcatta gtgatcacaa tcctattgtg atcacgatgt 3060 ctacgggaga aaatgtagat gtaaatggcg agaggatgca taagtggaat gcaaggaagt 3120 gcaattggct gctgtaccgg ggccttatcg agacctttgc cagcgactat gggtacgacg 3180 agtactctgt gttaggggca gaggaaaagt taacgctcct gtacaagtgg atgactgagg 3240 cgaatgaggt gtgcatggag aaggtcgttt cacgacctgc tcctaagcgc aagagtgtgg 3300 tgtggtggaa tgaatgttta agtgagaaga agcgaatggt gcgtgaacgg cgaagagcgt 3360 atcagcgtga aagagcaaga acgggtgatc cggatcgtat gaaatggcga gaatggaagg 3420 agtgtgaaag agagtatagg cgaatgatga aggatgcaaa ggagagtgat tggcatggca 3480 cggtggagcg gaagggggaa actgacccat ggggtgtcat ctcggcattt tgcatgggga 3540 agttaaaccc tgtaagcctg gctgggctgc ggacggcaaa tgggtgcacg aaaacatgga 3600 tggagagtgc aagagttctt ctggacgaat tcttccccgc agacgatgga attcctgcgg 3660 aggaggtcta tggagtccag atggacatga atgagttttg tatgggtgag ttagacgagg 3720 cagtcttggg catgaaaatg cgcaaggctc ctggattgga tgggttgacg aatgagatgt 3780 tgcgccaggt gtggaaagca gcccctttat ttcttaaggg gctgtttgac acgtgtctga 3840 gtgagggact ctttccacac aagtggaagg aggccagagt ggtcgttctc ctgaaaggag 3900 ccgacaagga tgtggccgag tctaggtcct acaggcctat cagcctgttg ggtagcccgg 3960 gcaaggtcat ggagcgaatg ttggttgcgc gcttgatgag gcacatggag ggcaaatgga 4020 atgagtgtca gtatggtttt atgaaaggga aatgtacgga ggatgcctgg gcgagagcga 4080 aagagaatgt aagggcggct gagagtgagt atgtccttgg aatctttgtg gatttcaagg 4140 gtgcgtttga caacttactg tggagagtag ctctacagaa gttgagagag gctggatgta 4200 cgtatgagga actgcgtgtg tggcactcct attttagtga taggagtgtc tgtatgtata 4260 atgggatgga tgtggttgag aaatgtgcgc gaagaggttg cccgcaggga tccatatcag 4320 gacctcctgt gtggaacctc ggaatgaatg acttgttgaa tgagttgtcc ggactggggg 4380 tggaggtcgt cgcgtatgct gatgacctcc tgctgctagt tcagggcaac aggaggaatg 4440 aactggagca gtcggcgtct gaggcactga gtgtggttta caggtacggt atgaatattg 4500 gtgtggaagt gtctgattct aagacagtgt gcatgatgct gaaaggtagt ctgaatatgc 4560 tgaatcgtgt ggtgcatgtg tcagcgaatg ggacggatga caagaagatt aggtgtgtag 4620 accgtgtgaa gtacctgggt gtgaatgtgg gcgtcggtat ggacttttcg gtccatatcg 4680 atggaatgag aaagagggtc actacggtga ttatgcgcct caggagagtc ctcagaaaga 4740 gctggggact caagcggggc gtagtgagta tgatggtgaa tggcctcatt ctgccggcgg 4800 ttatgtatgg agcgagtgtt tggtacgaac agctgcataa aaggaagttg cgtggatctc 4860 ggagactgag tgaggaactc gtcagctgcc agagagtggt gttgtatgcg tgcacgcgtg 4920 tgtgtagaac tgtctcaacg gaggcgatgc agattttatt tgggtcgctt ccgtgggaca 4980 tcgagtgtat caggcgggcg gatttgcaca agttgcgaaa agtcctgccc atgaatgaga 5040 atgacctggt ggctgacgag gacctgaatg aattgtcctt gcatgaatgc cgtgagttgg 5100 tggaccaacg tgctcttgca gcttggcagg accgttggga agccacgagt aacgggcgtg 5160 tgacgtatga atggatacgg gatgtgggat tctccggccg ctcgatgaaa tatttcgagc 5220 cgagcctgag ggtctgctac attctgacgg gccacgggag cttgaactcg tttctcttct 5280 cgagaaacct gagcaactcc ccggcctgcg cgtgtggagc tgagagagag gattggatac 5340 atgtgctgtg tgaatgtgat atgtatgcgg ccttcaggga tcttgactcc attggggtca 5400 ggagaactga ggaaggatgg gacgtgagcg gagtgctttg cgaccgtgcg aagtatgagt 5460 gtctgtgtgc ctttgtcgag cgcgcattca gtatgcgtga gtcgattgtg cagagaatga 5520 gagagagaga gagagagagg aagagaacgg tgagaatgta gatcttagat tagggtaatg 5580 gatgaggggg tgcgggggtg agggggctag tgggtagata taaggggtag tgggaagggg 5640 ggtagggtaa gggaacaggt gggttggggg gtagggtaac tgcgtgtgtg tgtgttcgga 5700 gtgagtgtgt taaacggagt gtggaaatga atatgtgtgt gtcggctggc cagctaccgc 5760 tggccggctc cacgcagggg gattccactg ttgctcttct aatcgaggct ggcctgtgcc 5820 ggactcgttg gaggaccaag agaggcatct ctgggttttg cgaacccacg gacccgagca 5880 gcccttccag aggcgggatg gtaagatccc aactggaacc ctcaccaggg ttaaaacggt 5940 accatgggcg accggggtgc ccgctggggg agaattgcct ccctcgcccg gtcacttggg 6000 tttggattcg tggtggcagt ggttgaaagc ccacatcgct tggggttagg ggttggcact 6060 gggtgaaaga ctccttgggt gctccgcacc atcggagact ggaacccttt gccgacctcg 6120 acgtgtgagt tgcggtctca actcggggag cggcctgctt aaaccgttag ggattggatg 6180 ggtcccggcc ccaaccgagg gtctccaaag gtcttaccaa cctgcggagg aatcggtagt 6240 cgcggtttag tagagggccc aattggctgg caatgtctcg gcattgccgt ctaattggtc 6300 tcaaagctac tccgcgatgc gttggccgag cgtatctcgg cccctcgccc cgtggggggc 6360 cgtgtgggta ggccgaaagg caggtactgc acgttaaaac aaagagacga tct 6413 // ID Gypsy-2_AC-I repbase; DNA; INV; 4786 BP. XX AC AASC02001731; XX DT 18-JAN-2011 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from Aplysia californica (California sea DE hare): internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-2_AC_; KW Gypsy-2_AC-LTR; Gypsy-2_AC-I. XX OS Aplysia californica OC Eukaryota; Metazoa; Mollusca; Gastropoda; Heterobranchia; OC Euthyneura; Euopisthobranchia; Aplysiomorpha; Aplysioidea; OC Aplysiidae; Aplysia. XX RN [1] RP 1-4786 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Aplysia californica (California sea RT hare)."; RL Direct Submission to Repbase Update (02-FEB-2011). XX DR Genome; AASC02001731; Positions 5630 10415. XX CC Positions [3683-4111] - Integrase core CC 'ATACC' target site duplication CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS join(357..2762,2766..3707) FT /product="Gypsy-2_AC-I_1p" FT /translation="MRPLPQCNCQCASEPLGDLILKADPNIQSKSVKEVKS FT VMREFAVIPVTIGVRRAELMQLRQAPDEQFRTFAARVKGKAETCMFTTIVK FT CTYNEVVQADYTIETVRDVLLAGIADLDIRRKALSTRDIQKKSVNDVIAFV FT EGREMARNATPSTSMSALSAFKKRQSYENDPTKRPPPLDTNRSNDLTKGPS FT PPDTNKPVPCPDCGKKFNVFKQRANGSWNTRPHTKCINCWRASRRREEPTS FT LSSLSTEPAVSQIGAVALHREVISKSDLKKSRSSCHPRAEFCITMAGLKKN FT TNPPEALVRGIVDSGAMSNLWGLLKNFTDSGFSKSDLLPVNMDMRAANKNP FT INIVGAFNALVKGKSPSGEIIASKSLVYISDCVNDFFLSYDTMLDLGIINK FT SFPTIGACMNKCFGSTANKPEKKEHQQHVRSINSGCTGEEENEYNCKCPQR FT ESIPQRPTSLPFAPIPENNEKMRDWLLERYSKSTFNTCPHRPLPCMTGPPV FT EIHVEETAKPKACHTAAPIPLHWQEQVHKDLVRDEALGVIEKVPFGEPVTW FT CHRMVVTRKHDGSPRRTVDLSPLNKFCKRETFSAVAPFPLARLITGKTRKS FT DTDAWNGYHSVPLREDDRHLTTFITPFGRWRYTCAPQGFLSSGDGYNRRFD FT AILSNFERKERCVDDTCHYDDNLDAHWWRTIDLLSTCGAAGIVLNPDKFQF FT AQREVDFAGFRTSEDKILPLPKYLDAIKSFPTPTSTTDIRSRFGLINQVAN FT YAQLRDALKLFRPFLSPKYKFLWSPILDRAFEESKEVIIDAIHKGVEIFDI FT RPTCLRPDWSKGGIGYVLLQKHCTCESAIPDCCAEGWRITLAGSRFLNGAE FT QRYAAIEGEALAIAWGLEQTRYFTQGCQDLLIVTDHKPLKKIFGDRTLDEI FT SNTLLFRLKQRTLPWHFHIEQMPGVTNSVADAMSRYPSPHANNTLSIADMS FT ESMIAASISNDLVDVTTISWDRIVAETQKDSVLSALLSALISSNDREWEDR FT PEFSESLYVLDGVILYRDRVVIPSSLRKTITEGLHAAHQGVSSMEIRAQSI FT VFWPGMTKDIQHVRARCTDCNKNAPSQAELPSNPAHPPSTPFEKVFADFFD FT YMEATTIW" XX SQ Sequence 4786 BP; 1366 A; 1251 C; 1142 G; 1027 T; 0 other; tgatgatttc atcgctttga gagcagccac actagtgttc ttcatgacgg tgtcagcgaa 60 aaagtttgaa agcaattgat ctcccaggaa aactaaccca ccagtcttca acccacaact 120 ccgtaacatg gtcatatgcc ctttcccagg ctgtaactat gaaaccggtg aaggagacgc 180 cccggtggtg gcggcactgc tcaatataca tgccctcaca cacacccaaa ctgcccaccc 240 tcctgcacca cctcaaccaa aactcgacgg tcccaaaatt gacttagggg tagaggagga 300 ggtatggaac gggttcgtaa ggagatggga agctttcaaa ataggatcgg gcataaatga 360 ggccactgcc tcaatgcaac tgtcaatgtg cgtcagaacc cctgggcgat ctgatactga 420 aggcagaccc taacatccag tcgaaatcag tcaaggaggt caagtcagta atgcgggaat 480 ttgctgtgat accagtcacc attggtgtga gaagggcaga actcatgcag ctacgccagg 540 cacccgatga acaatttcgc actttcgctg ccagggtaaa gggcaaggcc gagacctgca 600 tgttcacaac catcgtcaaa tgcacataca atgaagttgt ccaagccgac tacacaattg 660 agactgtcag ggacgtcctc ctggcaggaa tcgcagacct cgacattcgc agaaaggccc 720 ttagtacacg ggacattcag aagaaaagcg tcaacgacgt catcgctttc gttgaaggcc 780 gcgagatggc tagaaatgcc acgcctagca catccatgtc agcactctca gcgttcaaga 840 agagacagtc gtatgaaaat gaccccacca agaggccccc tcctctagat accaacagat 900 ctaatgatct cactaagggg ccttctcctc cagacacaaa caaaccggtg ccatgccctg 960 actgtgggaa gaaattcaat gttttcaagc aaagggcaaa cggctcctgg aacactagac 1020 cccacacgaa atgcatcaat tgctggcgag ctagtcgtcg cagagaagag cccacatctc 1080 tcagctcctt gtcaactgag ccggctgtgt cccagattgg tgcggtggct ctgcatcgcg 1140 aagtgatctc caaaagtgac ctcaaaaagt ctaggagctc atgccatcct cgtgcagagt 1200 tctgcatcac aatggccggg ctcaagaaga acacaaaccc gcctgaagcc ctcgtgagag 1260 gtatcgtgga cagcggggcc atgtcaaacc tgtggggtct actaaagaac ttcactgaca 1320 gtggcttttc aaaatctgac cttctgcctg tgaatatgga catgagggcc gctaataaaa 1380 accccataaa cattgttgga gcctttaatg cattagttaa gggtaaatca cccagtggtg 1440 aaattattgc tagcaaaagc ttggtgtaca tatcagactg tgtgaatgat ttcttcttgt 1500 catatgacac tatgcttgat ttaggcatca ttaataaatc ttttcctaca attggtgcat 1560 gcatgaacaa atgttttggg tccacagcaa acaaaccaga aaagaaggag catcaacagc 1620 atgttaggtc aattaattct ggttgcactg gtgaggaaga aaatgagtac aattgcaaat 1680 gccctcagcg agagtccata ccccaaagac ccacttcact gccttttgct cctataccgg 1740 agaacaatga gaagatgcgt gattggctgc tggagaggta cagtaaatcc accttcaaca 1800 catgccctca caggcccttg ccttgcatga caggaccccc tgtagaaata catgttgagg 1860 agactgccaa acccaaagcc tgccacactg cagcacccat accccttcac tggcaagagc 1920 aggtccataa agacctggtg cgggatgaag ctttgggggt aatagagaag gtccccttcg 1980 gagagccagt tacttggtgt caccgcatgg tagtcacaag gaaacacgat ggcagcccaa 2040 gaagaacagt cgatctctcc cccctcaata aattctgcaa acgagaaacc ttttcagcag 2100 tagccccttt tccccttgcg aggctcatca ctggtaaaac aaggaagtct gacactgatg 2160 cttggaatgg ctaccacagt gtgccactaa gagaagatga cagacacctc actactttca 2220 taaccccatt tgggagatgg aggtacactt gcgcaccaca gggtttcctt tcttcagggg 2280 atggatacaa cagacggttt gatgcgatcc tgtcaaactt cgagagaaag gagcgatgcg 2340 tagatgacac atgtcactat gacgacaacc ttgatgcaca ttggtggagg accattgatc 2400 tcttatccac atgtggagca gctggcatag tcctcaaccc agacaagttc cagtttgcac 2460 aaagagaggt tgacttcgcc ggattcagga catcagaaga caaaatcctc ccgctgccaa 2520 agtaccttga tgcaatcaag tcctttccca cccctacaag cacaacggat atccgcagtc 2580 ggtttgggct gatcaatcag gttgccaatt acgcccagct tagagatgca ctgaagctct 2640 tccgaccttt tctaagtccg aaatacaagt ttttgtggtc ccctatcctg gacagggcat 2700 ttgaggaatc caaggaagtg atcatcgatg caatccacaa aggtgtggaa atttttgaca 2760 tttaaagacc cacatgcctt cgcccagact ggtcaaaggg aggtatcggc tacgtccttc 2820 tacagaaaca ttgcacctgc gagtctgcca taccagactg ttgcgctgag ggctggagaa 2880 tcaccctagc aggatcacgc ttcttgaatg gtgctgaaca acgttatgca gcaatagaag 2940 gggaagcgct agcaatagca tggggtctgg agcagaccag gtatttcacc caaggctgcc 3000 aggacttgtt gatagtcaca gaccacaaac ccttgaagaa aatctttggg gacaggacat 3060 tggatgagat atcaaacaca cttctctttc gtttgaagca gcgaacgtta ccgtggcact 3120 tccacatcga gcaaatgcca ggggtcacaa acagcgttgc tgatgcaatg tcaagatacc 3180 cctcaccgca cgcaaacaac acactctcca ttgcagacat gtcagaaagc atgatcgcag 3240 catccattag caatgatttg gtagatgtga ccaccatatc gtgggaccgt atagtcgcag 3300 aaacccaaaa ggacagcgta ctctcagctc tactcagtgc ccttatatca tcaaatgatc 3360 gagaatggga agacaggcca gaatttagtg agtccctata tgtcttggat ggtgtcatcc 3420 tctacagaga tagagttgtc ataccatcgt ccctaaggaa gaccattact gagggccttc 3480 acgctgctca ccaaggagtc tcatcaatgg agatacgggc tcagtccatt gtgttctggc 3540 caggaatgac taaggacata cagcatgttc gagccagatg cacggattgc aacaagaatg 3600 caccgtctca agccgaactt ccttcaaacc ctgcccaccc accttcaacc ccattcgaga 3660 aagtctttgc agacttcttt gactatatgg aggccaccac tatctggtga tcggtgaccg 3720 tctgtctggg tggcccgagg tttattccac ccccacagga agtgcgcatg ctggagcaag 3780 aggccttata gcctgtctac ggaaattctt cgccacattt ggagtaccag aggagttgtc 3840 atcagacggc ggccctgaat tgactgccag ccagacaaaa aatttcctgt ccacatgggg 3900 tgtacaccat aagaaatcgt ccgcatacca cccacaactc aatgggcgtg ctgaggtagc 3960 tgtcaaatct gccaagcgcc ttctcagatc caacatcaac ccctccggaa gcctcgacag 4020 cgacaaattc ttgaaagccc ttctgcaact taggaacacc ccagaccctg attgcaagct 4080 ttcacctgcc gagattgtct tcaggcgtcc gatcagagac atgttctttt ttgtgaaaag 4140 gcttgaaaag ttatgtaata aagccatata acccatttgg agagaagtat gggcagccaa 4200 ggaaactgcg cttcggacaa ggttcgtgaa aacatctgag aagctcaacg agcactcaag 4260 aaacctgccc aagatgtcca ttggggacag atgttttgtc cagaaccaga ctggcaatag 4320 ccctaaacga tgggaccgaa caggattggt tgtcgaaacc ggccccaatg accaatacgt 4380 ggtcaagatt gatggctcag gacggctgac gtcaagaaac cggcgcttcc ttcgccagtt 4440 ccaacctgcc acttcgacca tacaaccggc accggcacgt aacagtccag atgacttcga 4500 ggttactcgg ccttcggagg aggagatcca tacgggacat gagtttgcga ctgagccaat 4560 caatgaggag caaaggattg ccccaagccc tttccctcag ccccaacccc ccagaggcac 4620 cccaggaggc aaagaaaagt cttcctacag ccctgaaacg ccttctccca cacaataaag 4680 aggggttaca ggaagcggtc aaaccacagg agcagggggg gggggggaga aaactaaggg 4740 gggggggggc taggaaagat ctgaacatgc catgaacttg ggggga 4786 // ID L1-30_AAe repbase; DNA; INV; 5537 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE An L1 non-LTR retrotransposon from Aedes aegypti. XX KW L1; Non-LTR Retrotransposon; Transposable Element; L1-30_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5537 RA Kojima K.K. and Jurka J.; RT "L1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1383-1383 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 6 sequences with >92% CC identity. XX FH Key Location/Qualifiers FT CDS 2326..5499 FT /product="L1-30_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MQMFSYKIATINVCNISNQTKLDALRSFIFSSELDII FT FLQEVQSEQFSMPGYTVIFNVDADQRGTAIVVRDQFNVHNIEKSLDNRIIS FT LRIGPITFLNIYAPSGAAQRTAREDFFNSSVAHYLHHSTESVVLGGDFNAV FT VNPKDATGCSNISPMCKRLMNSANLVDTWEALNGQRIEFSFIRSNTASRID FT RLLISNTMKFQLRTAHFAATSFSDHKAYIVRIVMPHLGTPSGRGMWRLQPH FT ILDDSEILNELSRKWVYWVRARSNYSSWVEWWMLHTKPKIISFLKWKTSTV FT YREFRDAIELYRRFLKEAYDNYHGNPQQVGEIHHIKAQMLRLQRNFSMNLK FT KCNETYISGESTTLFHVAQVHGKRSKTSIPTLLTENGIEIHDQSQIKESIR FT AYFENLYSSVDDEPSIDFRPSKSIPQNNTANENLMREITQDEILXGIKGSS FT SRKSPGADGLPKEFFLKTWRIICVEFTNVINDVLQGKAMKDFFNGILVLVK FT KKGSDKSARGYRPISLLNFDYKVLARILKQRMNCLLPLVLSENQKCSNGKR FT TIFEATSRIYDKICQLKHFRRSSLLVSFDLDHAFDRVSHTFLKQTMDQMNF FT NPSFINLLSTIWNQSYSKILVNGHLTQEIKITRSVRQGDPLSMHLFVLYLQ FT PLLDTISRKFPNAMMNAYADDISMFLDDEQAVMQVVTIFNDYGLASGAKLN FT KQKTVAVMIGSVNLTEDTEWLNIETFVNILGVRYGDNIKQAQKLNWQSVLN FT GLRTRLWFHHPRKLNLIQKIILINTYINSKIWYMASNIPLTKGYATKLKAE FT IGKFIWHGQTLQRIAFANLILPKARGGLNLHCPETKAKALLLNRMLNLXHL FT LPFAASLLQNSNNTVPPMFNHISVLKTELAQLPDSLVSTPTSKGIYLHFLE FT MKPDPGFTSSSQRNWKIVFKHLHSKNLTSTQRANWFSIMHKKIKHRELLFQ FT RGVSDNPYCEICPGEVETTVHKLFRCQSTRGIWRYLRRQILLQESSLMRLE FT PEEFMYPCLRNINQASKDYIIKLLGIFFSYHIETPENLIGLGNFIFYLNIN FT K" FT CDS join(145..1206,1185..2318) FT /product="L1-30_AAe_1p" FT /translation="MDDYRKDSVYLDFSGMPIRPKLEKVHELISKKIQLDM FT SKVNCIQPSMTKARVIIELKSQAYVEELVSEHSSKHTVEQNNKEYAIPIVP FT YDNAIEVRIADLPSYFSTEMIAKHLAPYGEVVSTQEEVWKNFWPGLSTGVR FT LVRMRIRKPIPSYIPMTTHTAYITYRNQIRTCRYCVRPLHIGRTCNEARKE FT QGQDINSRLTAAQVVQGIIPPSSSNVQTNVPQPSSNTPSTEKEIDVQTSDE FT EMNDVTNTNLVTPPTDRVMRSSTKAQSRPSSLSRFPPACTQAAKPSRTSSP FT PSNKATSENQDPTADFVISDDSEEALNAPKRHRSQQPTENPETPFLEIKRK FT SRSRTHNSSSRDTQFFFPVSFIMGICLAEAQSDTATNLKKLSTLSFTILRY FT SLVWRHNLPSLADITNDEGFSYCEVSLSICPAEVDTATNLKQIPHFFTSTY FT RHRGAAPHNTQESQHWPTSSHFSASKTPQSKVNINNGQTTVSEGYSLCEVS FT PSICPTEANVDAAALENLTLPVATPFLCCTRRITWRSHNMEWLSHFTLSLD FT LIVRLLYCSPLLFCNLGSTSSEITYHQHKSLNELVNECLTITQKRKSHNFF FT NDGGNGNGSYTVSEGYIPSKVSPSICPTEANVDTAALEENSLLVETQPTTH FT PEIYYCEFTFEIQTTSSSLRVCIETNSAIHANQVSENNVQNKNIAMTMNPI FT PEQLICIVSVLVTIRDRERTQERSIRWDSSKY" XX SQ Sequence 5537 BP; 1831 A; 1241 C; 1080 G; 1382 T; 3 other; agttgacatt tagctttcgt gtcggtcaga cgttttcgag attatcgcta agtgaaaaca 60 atcgaggtga tagtgccggc ggacgaagtc ctaaaggctt tacgcaaaaa gtgttaggcc 120 ctgtgggggt ctggtcattc cactatggac gactaccgca aggactccgt ttacctggat 180 ttcagcggaa tgccaatccg ccccaagcta gagaaagttc acgagctcat cagtaagaaa 240 atccagctag acatgtcgaa agtcaattgc attcaaccca gcatgactaa agcccgagtc 300 ataatcgaac ttaagtccca agcatacgtg gaagagctag tatccgagca tagttcgaag 360 catactgttg agcagaacaa caaggagtat gcaataccaa tagttcccta cgacaacgca 420 attgaggtta ggatagctga tttgccgtcc tacttctcta cggaaatgat cgccaaacac 480 ctagctccat acggcgaggt tgtatcaaca caagaagagg tgtggaaaaa cttttggcct 540 ggtttgtcaa ccggcgtccg gctggtgcga atgaggattc gaaagccaat tccatcctac 600 ataccgatga ccacccacac agcgtatatt acttaccgca atcagattcg cacctgtagg 660 tattgcgtaa gaccgcttca catcggtcgt acctgtaacg aggcacgcaa agaacaaggc 720 caagacatta acagtcgtct tacggcagcc caagtggtgc aaggaataat cccaccatcg 780 tcatcgaacg tacaaacaaa tgtcccacag ccatcttcaa atacacctag caccgaaaag 840 gagattgatg tgcaaacgtc agatgaagag atgaatgacg taacgaacac aaatctcgtt 900 actccaccaa cagatcgcgt aatgagatcg tccacgaaag cccaaagcag accgtcatcc 960 ctttcccgtt ttccacctgc atgcactcaa gcagctaaac cgagcagaac atcatcacca 1020 ccaagtaaca aagcaacatc agaaaaccaa gaccccacag ctgacttcgt tatctctgat 1080 gacagcgagg aagcactaaa tgcaccaaaa cgacatcgat ctcaacaacc caccgaaaat 1140 cccgaaacac cttttctcga aataaagcgc aagagcagga gtaggacaca caattcttct 1200 tcccggtaag ctttataatg ggtatatgtc ttgccgaagc acaatccgat actgccacca 1260 atctcaagaa actttcaact ttatcattca caatattacg atattcactc gtttggagac 1320 acaacctacc atctttggcg gacatcacga acgacgaggg cttttcctac tgtgaggtaa 1380 gccttagtat atgtcctgcc gaagtagata ctgccacaaa tctcaagcaa attccacact 1440 tcttcacatc aacatatcgt caccgtggag ccgctccgca taatacccag gaatcacagc 1500 actggccaac atcatctcac ttttctgcat cgaaaacacc acaatcaaaa gtcaacatta 1560 ataacggaca aactactgtc agcgagggct attctctctg tgaggtaagc cctagtatat 1620 gtcctaccga agcaaatgta gatgctgccg ctctagaaaa tttaacactc cccgtagcaa 1680 cacccttttt atgctgtaca cgacggatca cttggagaag ccacaacatg gaatggctat 1740 cgcacttcac tttatcactg gatttgattg tcaggctact atactgctca ccactgctgt 1800 tctgcaacct tggaagtaca tcatcggaaa tcacctacca ccaacataaa tcactcaatg 1860 aactcgttaa tgaatgcctc acaatcacac aaaaaagaaa gtcacacaac tttttcaacg 1920 acggagggaa tggaaatgga tcatatactg tcagcgaggg ctacattcct agtaaggtaa 1980 gccctagtat atgtcctacc gaagctaatg tagatactgc cgcactagaa gaaaactcac 2040 ttcttgtaga aacacaaccg acgacacacc cggagatata ctactgcgag tttacattcg 2100 aaatccaaac gacgtcttca tcactgaggg tctgtatcga aactaatagt gccattcatg 2160 cgaatcaggt gagcgaaaac aatgtccaaa acaaaaacat tgcgatgacg atgaatccga 2220 ttcctgagca gctgatatgt atcgtgtcag tgttagtgac tatcagagac agggaacgta 2280 cacaagagcg ctctataagg tgggactcct caaaatatta gggtgatgca aatgttcagt 2340 tataaaattg caacaatcaa tgtttgcaac atttcgaatc agacgaaact agacgcatta 2400 agatcgttca tattttcatc tgagctagat attatttttt tacaagaagt gcaaagtgaa 2460 cagttcagca tgccaggata tacggttata ttcaatgttg atgccgacca gcgaggtaca 2520 gctatagtag tcagggatca gttcaatgta cataatattg aaaaaagcct agacaaccga 2580 ataatttcac taaggattgg gcctataaca tttttaaata tctatgctcc ctctggtgcg 2640 gcacagagaa cagcaaggga ggactttttc aattcttccg ttgcacacta tctccatcac 2700 tccacggaga gtgtggtgct tggtggtgat ttcaatgcag tggtcaatcc aaaagatgca 2760 actggctgca gtaatataag cccaatgtgc aagcgactaa tgaattcagc aaacttagtg 2820 gatacttggg aagcattaaa tggacagcgg atcgagttct catttatcag atcgaataca 2880 gcatcacgta tcgatcgatt gctaatttca aatacaatga aattccagct acggacagca 2940 cattttgcag ccacatcgtt ctcggatcac aaagcatata tcgtacgcat tgtgatgccc 3000 cacttaggaa cgccatcagg gcgtggcatg tggcgccttc aaccgcatat tctcgatgac 3060 tccgaaattt taaacgagct ctccagaaaa tgggtttact gggttcgagc gagaagcaac 3120 tatagctcat gggtagaatg gtggatgttg catacaaaac ctaaaataat ttcctttctg 3180 aaatggaaaa cctcaaccgt ataccgagaa ttcagagacg caatagaatt gtatcgtagg 3240 tttttgaaag aggcttatga taactaccac ggaaatccgc agcaagttgg agaaattcat 3300 cacatcaaag cacaaatgct tcgtctccaa agaaatttct cgatgaattt aaaaaaatgt 3360 aacgaaacgt acatctccgg cgaatccacg actttgttcc atgtagctca ggttcacggg 3420 aaaagatcaa aaacgtctat cccaacgtta ctgacagaaa atggaattga aattcacgat 3480 cagtctcaaa tcaaggagtc gatcagagcg tattttgaaa acctatactc ctcggtggat 3540 gatgagccat ccatagactt caggccatca aagtcaatcc cccaaaacaa tacagcaaat 3600 gaaaacctga tgcgagaaat aacacaggat gaaattctck cgggcatcaa aggaagctca 3660 tctcggaagt caccaggcgc tgatgggctt ccgaaagagt tttttctgaa aacctggcga 3720 ataatttgtg tcgaatttac caatgtcatc aatgacgttc tacaaggcaa ggcgatgaaa 3780 gatttcttca atggcatctt ggttctcgtt aaaaagaaag ggagcgacaa gagtgcgaga 3840 ggataccgac caatttcact cctgaacttt gattataaag tcttagcgcg aatcctaaaa 3900 cagagaatga attgtctcct tccactagtt ttgtcagaaa accagaaatg ctcgaacggt 3960 aaaaggacta tttttgaagc aacaagccgt atttacgata aaatatgtca actgaagcat 4020 tttcgtagaa gttcgctgtt agtatcsttc gatctagacc acgcatttga tcgagtgagc 4080 cacacgtttt tgaaacaaac tatggatcag atgaatttca acccgagttt cattaaccta 4140 ctcagtacga tttggaatca atcctattca aaaattttgg ttaatggaca cttgactcag 4200 gagataaaaa tcactcgatc ggtaagacag ggtgaccctc tgtctatgca cctttttgtt 4260 ctgtatttgc aacctctact agataccatc tcacgaaaat ttcccaatgc tatgatgaac 4320 gcctacgcgg atgatatatc gatgtttctc gacgatgagc aagcagtaat gcaagttgtt 4380 actatattca atgattacgg actagcttct ggtgctaaac taaacaagca aaaaactgta 4440 gcagtaatga tcggtagtgt taatctcaca gaagatacgg aatggttaaa catcgagact 4500 tttgtcaaca tccttggcgt acgatacgga gacaatatca aacaagcgca aaagttaaac 4560 tggcagtctg ttctaaatgg attgcgaaca cgactttggt ttcaccaccc aaggaagctg 4620 aatctgattc aaaaaatcat acttataaat acatatatca actccaaaat atggtacatg 4680 gcttctaata taccccttac aaagggatac gcaacgaagt tgaaagctga aatagggaaa 4740 tttatctggc atggccaaac tttacagcgc atagcttttg caaatttgat tcttccaaaa 4800 gctagaggtg gtctgaactt acactgtcca gaaacgaaag ccaaggcact tcttctaaat 4860 cgaatgttaa atttgkctca cctgcttccg ttcgctgcaa gtctgttaca aaattcaaac 4920 aatacagtac cacctatgtt taaccatatt agtgttttga aaacagagct agcgcaacta 4980 ccggatagcc tagtcagtac gccaacatct aaggggatat atctacactt tttggaaatg 5040 aaacctgatc caggttttac atcgagcagt caacgcaact ggaaaatcgt tttcaaacat 5100 ctacactcta aaaatttaac atcaacgcaa cgagcaaact ggttttccat aatgcacaaa 5160 aagatcaagc atagagagtt actgtttcaa cgaggcgtct cagataatcc ctattgtgaa 5220 atttgtccag gagaagtgga aaccacggtg cacaagctgt ttcggtgcca aagcacaaga 5280 ggaatttgga ggtacctacg aagacaaata ttattgcagg aatctagcct tatgagatta 5340 gaaccagaag aatttatgta tccttgctta agaaacataa accaagcttc taaagattat 5400 ataatcaaac tgttaggaat ttttttcagt tatcatattg aaacaccaga aaacttaata 5460 ggactaggaa actttatatt ttatcttaat attaacaagt aattaagcaa atacaaataa 5520 ggttttacaa aaaaaaa 5537 // ID Copia-100_AA-I repbase; DNA; INV; 3345 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-100_AA_; KW Copia-100_AA-LTR; Copia-100_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-3345 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [1879-2406] - Integrase core CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 632..3178 FT /product="Copia-100_AA-I_1p" FT /translation="MKSQARTNAGAGVAEKTCFFCKKKGHLRRNCRLLQEK FT QNKSGEEKQKGNPKAKQATMEHSNAVLFLAGEEQQESRWVIDSGASRHMTN FT SKSLYTLFVKDDTPNVVLANGATVKSVGSGDCKLNGVSGCGGITEITLKDV FT LFVPTLNSSLLSVAQLTERGFTVGFKRDGCEIVNVAGEVVAKGDRSGSLYV FT LRISESAMEVQCKRHSEWCQHQWHRRFGHREPSVVEKMYKEKLCNGMVLKD FT CGIRMTYEPCIEGKLSRTPIPKAATKKTKEVLDLVHTDLCGPMRTPTPSGN FT RFLMTVIDDHSRYTVVYLLEKKSDAPDRLMEYVRYVETLSGRKPKVVRSDG FT GGEYCNERLKSFFEREGIKAQYTTAYTPQQNGVAERRNRYLQEMATAMLLD FT AKLDKKYWGEAVMTAAHLQNRLPSRSVDSMPFEKWFGWKPDLSYLKVFGCK FT AWVHVPDTKRGKFDSKARKLTFIGYSEQHKAYRFVDLETEKVTISRDAKFV FT ELEEEEAVSQPKTVSEPTDDEVNWMPLRNSAPADEQQDMNEAEDDSSEVDS FT ECEEFFDMGSPQAFQEDPLAIKIEPSEEPVEQEDVPASVSKLATRKTRGKL FT PKHLGDYEVGIATHAMKEPVNFNEATKSEQAKSWRSAMDDEYQSHMVNGTW FT DLVDLPPGKKVVGSRWVYKAKLNEAGEVARFKARIVAQCAVCARPDIAASA FT SILGRKFAAPTENDWTAAKRVIRYLKGTVDWRLKLGGVETNELVAYSDANW FT AGDPQTRKSMTGFATFFAGGVVSWVSRRQDCVSLSFMEAEFIALGETCQEV FT LWMRRLLEDLGESQLTATVVHEDNQGCLAFAQSEKINKRSKHVETKR" XX SQ Sequence 3345 BP; 872 A; 699 C; 1042 G; 731 T; 1 other; ggttatgggc ccagtggaag ttgtgaagga atcgtgagtt ttccgtgaga aaagtgcgtc 60 gttcgcgaaa gtgattttcg gttcggtcag tcgggaaaag tttgactttt gtttgcgtcg 120 tcggcgtcgt cgtgagtgta ttggtgcagc gcagcatgga ttccatcagc aagtttgcca 180 aactaaacaa ccagaattgg caaacctgga agtttagaat ggaaatgctg cttaccagag 240 aggaactctg gtacgtcatc gacaccgtga agctggagcc ggaaacgccg cagtggacca 300 aggacgataa gaaggcgaag gctacgatgg ggttgtgcat cgaagacaac cagtacagtc 360 ttgtgaagac ggcagcatca gcgcgtgatt tctgggagca gctgcagaac tatcacgaga 420 aggtaaccat ggtccgatta tcttccgaga ttttcgctgc tgaaacggcc ctgcagtttg 480 caggagcctt cctgattcct actccggatt agtgacagcg ctcgaaggtc gccctgacgg 540 cgctttgacg gtacagttgg tgaaatccaa actactcgat gaatacaagc ggcgcaagga 600 gtgcggaagt ggatcatccg atgtgaaagc gatgaagagc caagcgagga ccaatgctgg 660 tgctggtgtt gcagagaaaa cttgtttctt ctgtaagaag aaggggcacc tgcgtcgaaa 720 ttgtcgtcta ctccaggaga agcagaataa aagcggtgaa gagaagcaga agggcaaccc 780 gaaagcgaag caggctacta tggagcattc gaatgcagtt ctttttcttg ccggtgagga 840 gcagcaggaa agccgctggg tgatcgatag cggtgctagt cgacacatga caaacagcaa 900 atcactttac acgctgttcg tgaaggatga cactcctaac gtggttctgg cgaatggggc 960 cacagtgaag tctgttggat ccggtgactg taagctgaac ggtgtgagtg gttgtggcgg 1020 cattacggaa atcaccctga aagacgttct cttcgtgcca acgctgaata gcagtttgtt 1080 gtcggtggca cagttgactg agaggggatt taccgttgga ttcaagcggg acggctgcga 1140 aatcgttaac gtcgctggag aagttgttgc caagggtgat cgcagtggaa gtttgtacgt 1200 tttgaggatt tccgagtctg cgatggaggt gcaatgcaag cggcatagtg agtggtgcca 1260 acaccaatgg cacagaaggt tcgggcaccg ggaaccgtcg gttgtagaga agatgtataa 1320 agagaagctt tgcaacggaa tggtgctgaa agactgcggc atacgtatga cgtatgagcc 1380 gtgtattgaa ggtaagctct ccaggacacc gataccgaag gcagctacga agaagaccaa 1440 ggaagtgttg gacctcgtgc acacggactt gtgcggtccg atgcgaaccc ccacgccgag 1500 tggaaaccgt tttctaatga ctgtcatcga cgatcacagt aggtatactg ttgtgtacct 1560 attggaaaag aagtccgacg ccccggatcg actgatggag tatgtcaggt acgttgaaac 1620 cctttccgga agaaagccga aagtagtcag gtcggacggc ggcggggaat actgcaacga 1680 acgtttgaag agtttctttg agcgagaggg tataaaggcg caatacacga ctgcctatac 1740 tcctcaacaa aatggcgtcg ccgagaggag gaaccgttac ctgcaggaga tggcaactgc 1800 catgcttctt gacgccaagc tggacaagaa gtattggggt gaggctgtga tgacggctgc 1860 tcacttgcaa aatcgattgc cgtcgcgatc ggtggacagc atgccttttg agaagtggtt 1920 tgggtggaaa cccgacctga gctacctgaa ggtgtttggc tgtaaggctt gggttcacgt 1980 tccggacacc aagcgtggaa aatttgacag caaggcgcgt aagctcacgt ttatcggtta 2040 ctccgaacag cacaaggcct accgttttgt ggatctggag accgagaagg ttacaattag 2100 tcgcgatgcc aagttcgtcg agcttgagga ggaggaagct gtgtcgcagc cgaagactgt 2160 ttcggagcct acagacgacg aggtgaactg gatgccgttg cgaaattcgg cgccggccga 2220 tgagcagcag gatatgaatg aagccgagga tgattcaagc gaagtggata gcgaatgtga 2280 agagttcttt gacatgggtt cgcctcaagc attccaagag gatccgttgg cgattaaaat 2340 tgagccatct gaagaaccag ttgagcaaga ggatgttcca gcgagcgttt caaagttggc 2400 gactcggaag actcgtggca agttgccgaa gcatttagga gattatgagg tgggtattgc 2460 tacccacgcg atgaaggaac cggtgaactt taacgaagca acgaagtctg agcaagcaaa 2520 atcctggcga agtgcgatgg acgacgagta ccagtcgcac atggtgaacg gcacttggga 2580 cctagtcgac ttgccgcccg gcaagaaagt agtaggtagt cgctgggttt acaaggcgaa 2640 gctgaatgaa gcgggcgaag tagcgcgatt caaggcacgc atcgtagcac agtgtgcagt 2700 gtgtgcaaga cccgacatag ctgcaagtgc gtcgatttta gggagaaaat ttgctgcgcc 2760 tactgagaac gactggacgg ctgctaagag agtgattcgt taccttaaag gcaccgtcga 2820 ctggagatta aagctaggag gcgtcgagac caacgagcta gttgcgtact cggacgctaa 2880 ttgggcagga gacccacaga cgcgtaagtc aatgaccgga tttgcaacgt tctttgctgg 2940 aggtgtcgta tcttgggtca gtcgtcgtca ggactgcgtg agcctttcgt tcatggaggc 3000 ggagttcatt gcgttgggag aaacttgcca ggaagttctg tggatgagac gcttgttgga 3060 agatttgggg gagagccagt tgaccgcgac tgtcgtccat gaggacaacc aaggttgtct 3120 agcgtttgca caatcggaga agatcaacaa gaggtcgaaa catgtggaga ctaagcgttk 3180 tttcgtcaag gatttgtgtg agcgaggaac cgtaaagctg aaatattgtc aaaccgacga 3240 aatgcagagc cgacgtgctg acgaagccac ttggggctgt gaaacaccat cggttctcgg 3300 agctacttgg acttggacct ccgagggctg accgttgagg aggag 3345 // ID DNA8-46_AP repbase; DNA; INV; 389 BP. XX AC . XX DT 25-AUG-2009 (Rel. 14.09, Created) DT 25-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-46_AP. XX NM DNA8-46_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-389 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 1976-1976 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 389 BP; 170 A; 39 C; 35 G; 144 T; 1 other; tagtgtttga agtatgaaat atttcacaaa aacgtgaaac attgaaattt ttttaacaca 60 aataaaaaca cgatttattt atttttttaa cacaatgttc agtgttttaa attctaatgt 120 ataaatacag taggtatata atatacaata tacatgtata atattgtata cctacttgta 180 tcatttaaac atcaaaatat ttaaaaatnt tgaataccat tttaaatgta taattccaaa 240 ggttaggtat acatacaggt accatacagt tatgaaggta atttatgaca attatatgta 300 aatataaaac ttatttaaaa ataaaacaaa atttattaaa atactgaaat atttcaaaat 360 tttaaatttc atgaaatttt caaacacta 389 // ID Helena_Ds repbase; DNA; INV; 4912 BP. XX AC chr3R; XX DT 17-DEC-2007 (Rel. 12.12, Created) DT 19-DEC-2007 (Rel. 12.12, Last updated, Version 1) XX DE The only complete copy of the non-LTR retrotransposon Helena from DE the complete genome of Drosophila simulans. XX KW Non-LTR Retrotransposon; Transposable Element; HELENA; Helena_Ds; KW LINE-like; ORF1; ORF2. XX OS Drosophila simulans OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; melanogaster subgroup. XX RN [1] RP 1-4912 RA Rebollo R., Lerat E., Lopez Kleine L., Biemont C. and Vieira C.; RT "Losing helena: the extinction of a drosophila line-like RT element."; RL BMC Genomics 9, (2008). XX DR droSim1; chr3R; Positions 15006433 15011344. XX CC Helena belongs to the JOCKEY clade of non-LTR retrotransposons. CC The Helena sequence includes two open reading frames that CC encode the 579-aa ORF1p (positions 294-2033) and 907-aa CC ORF2 (positions 2030-4753). XX FH Key Location/Qualifiers FT CDS 294..2033 FT /product="ORF1p" FT /note="DNA/RNA-binding protein." FT /translation="MDTNNETSNSQLRNKPIDYEEMRQVAGKSPLDIRMEL FT LNQRSSCTESVNLSTITTTITFTSATCNTTSTTTAAISRPSSREKAATRLS FT DAQKLLTKERKRKTNLTPNHSSKRAVRDAKLEPYPAINSNSNSNNRFAMLD FT MELDETSDGIETPQVCHDVASTSCASAAPNIPNDDCVPNVTSNSPQQYTDK FT QNSYKQSKPPQIVLSLTNLNDLYELITEVTSLDNLTVKVNQGETVRILPKD FT SDTYRAIVNLFDNSGIEFHTYQMKEEKPHRIVVKGLHHSTLTYEIIDNFKT FT YGFDVLQVHNPRSRRNREEKLNIFFINIKTCAKINDIYDINTICRQKVRIE FT RMRKSSEIAQCIRCQEFGHTAKYCRRHPNCARCGENHLTKLCVLPNDQQPI FT CIHCGGNHTASYKGCQFYQEYLRRSMGTVKKQQPKQAKFDKSATNQQQPQQ FT KQQHAIASTPKDHTGSLSYADIARNGNTTAQPRLHNVQLKGTNIKQQHPLD FT VQSILAQQQEQFMKWQQQLQQQQQQQFLSWLQQQQQEQQQQNKLNSQRLER FT LENIVFEMANMLKQRTGDTSAPQLHSNALPSQ" FT CDS 2030..4753 FT /product="ORF2p" FT /note="endonuclease/reverse transcriptase." FT /translation="MNPLKILIWNVNGISGKAREVELFAHNNGIDILLLNE FT IRLNRGNTVKIYGYSFYPAYKPSSHNHGMGGAAVLVRSSLRHFPQRVIETR FT TIQMSSVKVSTGLGDMEFSAIYCPPRNRIEERHFSDILVSCGQRYFVGGDW FT NARHWLWGDTYNSPRGRELAEAISARGAYILATGSPTRYPHVPSHRPTCID FT FAVYHGINLDRTSISENWDLDSDHVALVATLQTEGAYVRPCSRLINSRTDL FT LVFRQHLENSLQLNTVLSSKEDIENAVDSLTQNIHRAASASTPSEPEIRPR FT SYGIVLTREARELIRTKRRLRRRAIRTQDPWDRILWNRAAKQLKVVLRELR FT SDFFEQKLSSMDYTVDANYSLWKCTKALKRQPLRWVPVRCPGGEFAKTEVE FT QANAFGFHLEDRFTPYDFATTEQIRETHQYLQMPLQMSWPIKPIRIEEILE FT IIKLLPKHKAPGIDRICHATLKVLPIKAIIYIALIFNAILRIQVFPRQWKM FT AAILMIHKPGKPEDDPESYRPISLLPSLSKLWERLIANRINDIIRQGNILP FT DHQFGFRKGHGTIEQVHRLVKHILQAFDDCEYSNAVFIDMQQAFDKVWHVG FT LLCKIKTLLPAPYFCILKSYLEERQFKITVRNSYSSIYPMRAGVPQGSVLG FT PLLYSLYTADIPCPSFEHMAAPNRTLIATYADDIAVVYNSRDSREAANGLQ FT EYINALAAWCKRWNLKINPLKTTNPCFTLKTLIPNTPPIRLEGVTLNQPLQ FT ATYLGITLDKRLTFGPHLKNTVKKCGHRSQQLRWLMNRRSTLSMRCKRAVY FT AHCIVPIWLYGIQIWGIAAKSNYNRIQVMQNRALRQITNCPWYVRNSTLHK FT DLNIHTVESQIGRHTSRYSDRLLSHSSLLARRLIPARPLRRLKRQGFAKTI FT GQQ" XX SQ Sequence 4912 BP; 1657 A; 1095 C; 981 G; 1179 T; 0 other; tgagctggtc acactgagcg gttgtttata gtttttgtct gtgaatttca ctttaaattt 60 aattcgcaaa cctgcattcg tagctgtgca aaacagcacg cattctaaca tttggactct 120 gccttgcaga gtcggcatgt gaacatcaac aaagaacggt ggataaggtg caaaaaaatt 180 tgcattctct gcccattggg cccctcacca cgtggcagcc gcacaaggga ggcaacaagc 240 aacgtactta ttagaaaaaa aaaaaataaa tagaccggat tagtaactgt aaaatggata 300 cgaacaacga aacttctaac tctcagctta gaaataagcc gattgattat gaagaaatgc 360 ggcaagtagc tggtaagtct ccactggata tcaggatgga attattgaat caacgctctt 420 catgcactga atctgttaac ctaagcacaa taactacgac gataacattt acaagtgcaa 480 cttgcaatac tacatctaca acaactgcag caatcagtag gccgagctcc agagaaaaag 540 ctgctactag attgtctgat gcgcaaaaac ttctgacgaa agagaggaag aggaagacca 600 atttaacacc caaccatagc tctaaacgag ctgtcagaga cgcgaagctg gagccatatc 660 cagccataaa tagtaattca aactcaaaca ataggtttgc catgctagac atggaattgg 720 acgaaaccag tgatggcata gaaactccac aagtttgtca tgatgtcgca tccacctcgt 780 gtgcatcggc tgctccgaat attcctaatg atgactgtgt accaaatgtg acatccaata 840 gcccacaaca gtatactgat aagcagaatt cttataaaca atcaaaacca ccgcaaatag 900 tactgagcct taccaatctt aatgatctct atgagctcat tacggaggtc actagcctag 960 ataatttaac agttaaagtc aatcaagggg aaacagtgag aatattaccc aaagactctg 1020 atacttacag agctattgtt aatctttttg ataattcggg aattgaattc catacgtacc 1080 aaatgaagga agagaagcct cacagaatag ttgttaaggg actccaccat agcaccctaa 1140 cctacgaaat tatcgacaac tttaaaacat atggctttga tgttctacaa gtacacaacc 1200 caagatccag gagaaataga gaagaaaaac ttaatatatt cttcattaat ataaagacct 1260 gtgcaaaaat taatgacata tacgatatta acacaatatg ccgacagaaa gtgcggatag 1320 aaagaatgcg taaatcatct gaaattgcac aatgcatacg ttgtcaggaa ttcggccaca 1380 cagctaaata ctgtcgtcgt catcccaact gtgctcgatg tggtgaaaat cacttaacca 1440 agctatgcgt acttcccaat gatcaacagc ctatctgtat acactgtgga ggaaatcaca 1500 cggcaagtta caagggttgc cagttttacc aggagtatct tcgacgatca atgggcactg 1560 taaagaagca acaacctaag caagccaagt ttgataaaag tgcaacaaac caacaacaac 1620 cccagcaaaa acagcagcat gcaatagcta gcactcccaa agatcatact gggagcttgt 1680 cctacgcaga tattgcaaga aatggcaata caacagccca gcctcgtcta cataatgtac 1740 aattaaaggg aactaatatt aagcagcaac acccgcttga cgttcaatca atattggcac 1800 agcaacagga acaatttatg aagtggcagc aacagcttca acagcaacaa cagcagcaat 1860 tcctatcgtg gctacagcag cagcaacagg agcaacaaca acaaaacaag ttgaatagtc 1920 aacgactcga aaggctggaa aatattgttt ttgaaatggc caatatgctg aagcaacgga 1980 ctggggatac atcggctccc caactccata gtaacgcttt accatcgcaa tgaaccctct 2040 gaagattctt atctggaatg taaatggtat ttcaggtaaa gccagagaag tagagctctt 2100 cgcacacaac aacggcattg acattcttct cctaaacgag atcagactca acagagggaa 2160 cacagttaag atatatggat acagctttta tcccgcatac aaaccttcaa gccataatca 2220 cggaatggga ggagcagcag tactggtgag aagttctctt cgtcatttcc cgcaaagagt 2280 tattgaaacg agaactattc agatgtcttc agtcaaggtc tccaccgggc tgggagatat 2340 ggaatttagc gcgatttact gtccaccaag aaatagaatt gaggaaaggc acttcagtga 2400 catacttgtc tcttgtggac aaaggtattt cgttggtggg gactggaacg cccgacattg 2460 gctatggggt gacacgtaca attcacccag aggtcgagaa ctagcagaag ccatttcagc 2520 cagaggggct tatatccttg caacaggttc accaactaga tacccacatg tgcccagtca 2580 cagacctacc tgcattgatt ttgctgtgta ccatgggata aacttagaca gaactagtat 2640 ttctgaaaat tgggatctag actccgatca tgtagccctt gtggctactc tacaaacaga 2700 aggtgcctat gttagaccat gctctcggtt aataaacagc agaactgatc tccttgtttt 2760 cagacaacat ctggaaaact ctctccaatt aaatacggtt ctgagctcta aggaagacat 2820 cgagaacgca gtagacagtc taacgcaaaa tatacataga gccgcttctg cttctacgcc 2880 gtctgagccc gagatacgcc ccagaagtta tggtattgtt ctaacaagag aggccagaga 2940 acttatcaga actaagagac gccttcgaag aagagcaatt cgaactcagg atccatggga 3000 cagaatcttg tggaaccgag cagcaaagca actcaaagtc gtcttaaggg aactcagaag 3060 tgatttcttt gagcaaaaat tatcctccat ggactacacc gttgatgcaa actattcgct 3120 gtggaagtgc acaaaagccc ttaaacgaca accacttcga tgggtacccg tacgctgtcc 3180 aggtggggaa tttgcaaaaa ctgaagtgga acaggctaat gcattcggct tccacctaga 3240 ggatcgcttc actccttacg acttcgccac gacagaacaa ataagagaga ctcaccagta 3300 cctacaaatg ccattgcaga tgtcttggcc tattaagcca ataaggatag aagaaatcct 3360 tgaaataatt aaattactgc cgaagcataa agctccagga attgacagga tttgtcatgc 3420 cacgctaaag gttttaccta taaaagcgat aatatatata gcactaatct ttaatgctat 3480 tttaaggatc caagtgttcc caagacagtg gaaaatggct gctattttga tgatccacaa 3540 gcctggaaag ccggaagatg atcctgagtc gtatcggcct ataagcctct taccctcact 3600 ttctaaatta tgggagagac ttattgccaa tcggataaac gacattataa gacaaggcaa 3660 tatcttgccg gatcatcaat ttggatttcg aaagggacac ggaactattg aacaggtcca 3720 cagactggtg aaacacatac tacaggcttt tgacgactgc gagtactcca acgccgtctt 3780 tatagatatg caacaagcct tcgacaaagt atggcatgtt ggattattat gcaagataaa 3840 gacccttcta cctgcgccct acttctgtat tttaaagtca tatctagaag aaagacaatt 3900 taaaatcacg gtgagaaata gctactcctc tatataccca atgagagctg gagtccctca 3960 gggcagtgtt ctcggaccgc tactgtattc cttgtacact gctgatatcc cttgcccgag 4020 tttcgaacac atggcagcac cgaacaggac tcttattgca acctatgcag atgacatcgc 4080 agttgtatat aactctaggg acagcagaga ggcagctaac ggactacaag aatatattaa 4140 tgctctggca gcctggtgta aacggtggaa cctaaaaata aacccactga aaacaacaaa 4200 cccgtgcttc acgttaaaaa cgcttatccc aaacacccct ccaattcggc tagaaggagt 4260 taccctgaat cagcccctgc aagcaacata tctaggtatc accctggata aacggctcac 4320 ctttgggccg catctaaaaa acacagtaaa aaaatgtggc cacagatcac aacagctgag 4380 atggctcatg aatagaagga gcactctttc gatgaggtgc aaaagagctg tgtatgcgca 4440 ctgtatcgta ccgatatggt tatacgggat ccagatttgg ggaattgcag ccaaatcgaa 4500 ttataatcgt atccaggtga tgcaaaatcg cgcattacga caaataacca actgcccctg 4560 gtatgtacgt aactctacac tccataaaga cctcaatatt cacacagttg agtcacaaat 4620 tgggagacat acaagtcgat atagtgacag attactgagc catagcagtc ttcttgcaag 4680 acgtctcatc cccgctcgac ctctaagaag acttaaacgg caaggcttcg ccaagacaat 4740 tgggcagcag taaaaactct tcatttatgt tcctactcca cttattttat tttatagtgt 4800 ttggtaattg ataagaatac ttctccacac tgtgatgtta agatacaata ttatgatgta 4860 cggattcttc acttaataaa tattaataaa aaaaaaaaaa aaaaaaaaaa aa 4912 // ID Copia-22_SI-LTR repbase; DNA; INV; 161 BP. XX AC AEAQ01023538; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: long terminal DE repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-22_SI_; KW Copia-22_SI-I; Copia-22_SI-LTR. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-161 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01023538; Positions 20 180. XX SQ Sequence 161 BP; 37 A; 33 C; 30 G; 61 T; 0 other; tgttagagat aacctttttt tgttcctcgt gttatatact ccacattata tatgtatgta 60 tattgggaga ctctcttttg tattgctaca cgtggctctg gccaataaac actttttttc 120 ttttcaatag cgttgagcct gagaaccgcg acgcgcagtc a 161 // ID MULE-1_CapOwc repbase; DNA; INV; 5149 BP. XX AC . XX DT 17-MAY-2011 (Rel. 16.05, Created) DT 17-MAY-2011 (Rel. 16.05, Last updated, Version 1) XX DE DNA transposon: consensus. XX KW MuDR; DNA transposon; Transposable Element; MULE-1_CapOwc. XX OS Capsaspora owczarzaki OC Eukaryota; Ichthyosporea; Capsaspora. XX RN [1] RP 1-3053 RA Yuan Y.W. and Wessler S.R.; RT "The catalytic domain of all eukaryotic cut-and-paste transposase RT superfamilies."; RL Proc Natl Acad Sci U S A 108(19), 7884-7889 (2011). XX DR [1] (Consensus) XX SQ Sequence 5149 BP; 1193 A; 1464 C; 1363 G; 1129 T; 0 other; cgaaccagaa caatacgcct aaatccctac gggttccata atgaatggta aaatggtaat 60 catgagatat caccataatc gacccaccat tgtgatcgac ttgatgtggc caataatagt 120 gtcagaggag tggcggtgct cagatgagcg gttttctcag aaaatcgaaa aatcaaaatg 180 gcctcgaggc gaccgcaaaa cagcgccaca agcgttcttc caggaccagt gcagcagagc 240 atcataatct ttgctcaact ttgcaccccc cgcattttcg agatcgagca caccgactgc 300 ctagggtgca tcgcgtacta ttacggcgag gaagcgcgca ggatagaaac actgttttga 360 cccccagaag aacgccacaa cgccggtcgc cattgccgag ctcgatttcc gacgtgtcca 420 gtcggtgtcg gaccgcgcac ttttcagggg tgattacaga tgatatttac aaccgttttg 480 gctcaatggc ggggcgatcg cggtggggag cgattttgtg tgaagctgga tctttaccac 540 tttgggctgg attgagccac tcttttctgt tgttcttcag agccataggt ctcaggtcga 600 ttcttgtact tttccgtcct cacgagaccg accagacctg caacacacac acacacgtca 660 agcaattcat ccaagcataa gaacccgaca cacaagagcc agcgagcgca cgaggcgaga 720 tggaggagga cgaggagatc cagccgcaac caagctacga gacatgcagt gtgcggctgg 780 tggtccttca cgacctcgtt cccgacgcca gccaggatcg cttcatggat ctggatgtac 840 ctgttgccca gccagaggcg ctgccaacgt acggcgatgt ggagcgagcc ttgcagaccc 900 atgactttag agaagttgcc ttccgcccaa accctcagtt ttggtgtcca gtcgttggcg 960 tcaagaacaa gttctacccg tcaaacaaga cgattgcaca ccccgaatgc gttcatcttt 1020 ctgtcaggtt gcgttgcccc tgttgcaaat ccaacatccc gcggtcgcga tttcacagcg 1080 cgaacaaaac ttgagattgt cttccgtgca acccgcgtcc accgacttgg ccgacgacga 1140 aactagcgta atgcaagatg atagggacga cattcttgaa tctcttaacg accaagacgg 1200 gcacgacgaa aactgccgct caagaacgcc tcggaagcgc aggggagtcc atctcaccaa 1260 ggaggagcga gacagacttc cgcagtggag ccctagtacg acgctcattc ctgttcgaac 1320 acgccgttgg tattgcttca atagacgcga tccgtcgagg ccgcctacga ccgtgccaga 1380 cacaccaact ctccaacaac gatttattcg cgcctgctcc agcaagaagc gagttgacaa 1440 caactttcag ttacgtccgg agctcggcgc cggatcctgc agtttcggcc tgagattgac 1500 ggagcttggc caacacacgc acggcgagct gcaccttatt gggttcaagc tcgagcagat 1560 gggtcgccac gacatggctt gcaccgacgg ctcgctcgct ttcaagcagc gcgcgcatcc 1620 tgaaatcaaa gagtgggtca tggatctggt gcgcaagggg tgctcggagg cgtccattgt 1680 agcgcagttg gaagggccaa ctctcaaccc gccgcctcgc cctctttcaa gttggaacgt 1740 accgtacact tctggtcgct ggtatcccac caaagacgca attcgcatga tgatagtcaa 1800 ctcaagcagc gagaagctac atggaaccga ccagcagcgc atcgaccaat ttctcgcgtc 1860 aacctcgtcg tatgcctgct ctcaccctca ctcgctctac caagatgggc agtgccccgg 1920 cgaaagccaa tacggcgaca aaagcgatgt catttatcat caatttgttc actgcgactg 1980 ccagagctgc ctcgaaatcc ccctgaggga gtcgaacatt ctgattttaa tgacgccttg 2040 tcagcgggca acgtgggacg cccttcgtcc caaggtcctg ttcctggatt caaccttctc 2100 agtcaaccgc tacgactggg cgacaacagc actcgttgct gctgacgaat caggcgctgg 2160 tgttgtgccg ctggcgttca tgatctccga cagagatcgc gagcatgaat atgcgcggtt 2220 cttgtccgag gtcaagcacg ccgtaggtga agcgtttaat ccggtgcgca tcgtcatcga 2280 caaatgcggc gctgagcgaa actgcatcaa tcaggcggtg ccgggtgtgc aggtggcaac 2340 ttgctggttt cacgtctgcc aagcggtgga gcgccagtac ccgggcagac tagatcttct 2400 ctcgtgtatg gctaaaatgc ggtatgccag gaatgcacaa gagctgaagg acgcgagcaa 2460 cagcttgctc aatgccgccg gcgagttcca aggttatttt ctgtctcagt ggctcaacga 2520 acgagatatt aaaacctgga cgctcttcca ccttggcttg gtgcgtgatg cgctggatca 2580 aggccacacc aacaatttga ttgagcggta agcttgtgat gattgtgagt cgcttgtccc 2640 gtgccgaact ttggaacctt gttgacatgt cattgcgact tgcctccccc atagcttttt 2700 cggcacgttc aagtacaagt ttctgaaagg ttaccgcatg cacgatttgg caggtctcgc 2760 aatcaggatt gttcgggaca cgaaccagca cttctgtttg caacgccaac gccagctact 2820 cgacaagccg aatgtcgcct tccttccaaa gttccgctcc ttactgcttg agaaagagat 2880 gcacgcagag aagctcgtgg aaaccggtca ggtgttggag cttcatcgct actttggtgt 2940 aggacttgtt ctgcctcagc ggacgctttc tgagcttgat ctactccaaa ttcgcgaact 3000 cgcaaacgct caccttgagc cattttatca gacgcgtggc atgcccttga ctgagggttt 3060 gcagcagctg cacgaagcgc tgaagcaaca gtacccagat gttgtggtcg ttggtgtcga 3120 gcggcgcttt tgcagcgcca tgcacgatca cgctttcatt tgcagccata tccgtgccgt 3180 cgcgcggcac tttggaggca tcgagaactt tcgtcttcaa atccattctc tcttcgaaga 3240 gccaccagtg tgtcccgtgt cccgctcagc acccgtcaat cctgcaagcg aaactgctgt 3300 gtccgcagcc aatactgcgt atgccgatga tgccctgccg actggcgcaa ccgacatggc 3360 cgtagacgat cagcttgatg cagacggacc tgctcgcacc tctgacatgg ccgtagacga 3420 tcaccttccc tcgggcccgg atgcagacgg acctgctgac cgcacctcgg acctccccgc 3480 cgagctcgat ccgaatcacg gcgcgtcacc ttctgcgcgt aactccgatg atgctccaga 3540 gcacagctct ggacttgttg agacggtggc cagtcccgac gacctggtgg acgagctctc 3600 caagctgttc gagaaaggac tcgagcgtct caagaccatg gctcagggcg atgagttgca 3660 aagcctcgtt cgccagctca cacgcaattg cagcaacagc atgcggaatg cgacaggtca 3720 gcgagggatt ctttcattag atacccggcg cacgcgtggt gcaactcgta aggctgccaa 3780 atctacccga gctgcccaag ccaaggcaaa gcgaactgct gctgccaaag ccaaccgagc 3840 agcagcagca gtagcagcag cagcagcagc atcaacatcg tcagcctcaa ccgttcttcc 3900 aggtaacgca actgcatccg cagcagcgtc gttggcgacc agttcgtgga cacgacgcaa 3960 gaagtctgga gtgaaggggc ggcccaaagc taagcatcgt gttcctgctg cacttacccg 4020 agaagtcgca agatcctttt gcgccaaaga cagtcaagca ttcgagagcg caagggcggc 4080 ggtgactgcg tacaacaccc aacaatagta gtttgtcgac tacatcccga cttctcgcag 4140 caatcgcggt actcgagatc cgtggggcga cactacccct tcatttgcta attccgatcc 4200 cacatttggg aatgcgagtg atggtgcacg ttcagccagt cggaaacgac ctcgattgga 4260 gccagaggct gacccgcagg aacggaacca gcgcaaccaa cagaggattg gtttgaacat 4320 tttgcccctc aactgcggcc gcctggaggg cgctcagcgg ccatccgtgc gcgcgacaca 4380 attcgacggt tttcacgcta cgacacgagt cgggatgacg atgcagacgt gtttgaggcc 4440 cagcctgagg agatggacga cgatgacttc aggtacactc tggaggagtg tgtgcgtgct 4500 agacggcttg aaatgtttga ttcagctgta aacccagagt ttttgtgaag ttttgtatga 4560 ttgtgtgtgc ttggtgttgt actttgggtc attgtggtgc tgtcctcccc actgttttca 4620 cagcagcggc gcctgagagg tgtcagcatc cttgacccca ttcgatttta aattttgaca 4680 catggataaa cttgtctttg aagtttcgcg gcatagaacg tggcagaaga gcttgaaaac 4740 agctcgacag cgcaaaaagg aggaaagcgc aaaaggagga aaaaattttc taagtccaac 4800 ttttcctgtc gcccccttga tgctcactgc tacgcagcct gactgggtga cccgatccgc 4860 ccgcccttcg accccatgac tcaaacacca ataccattcc ataaatgccc tagtagtggg 4920 ttgctggtcc accgtggggg ttggtggctt ggccgttccc tgcacgctcg cggcgctccc 4980 aaccattccc tcccgtgctc ggtccagcgc agactttttc accggctcac agttgaaatc 5040 tacattatca tttatgggca acatgatgct ggcagagaga agaggctaga tgacactttg 5100 agattcggac ctggaacccg tagtgattta ggcggattgt tctggcacc 5149 // ID Gypsy-616_AA-I repbase; DNA; INV; 5033 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-616_AA_; KW Gypsy-616_AA-LTR; Ty3_gypsy_Ele25; Gypsy-616_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-5033 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [4084-4560] - Integrase core CC LTRs are 96% similar to each other. XX FH Key Location/Qualifiers FT CDS 3274..4608 FT /product="Gypsy-616_AA-I_3p" FT /translation="MEKLKLALCDRPTLRLYCPNAVTELHTDASRIGYGAI FT LLQKFAGENAFHPVYFYSRKTSPAEANYPSYELEVLAVISALKKFRVYLLG FT ISFTIVTDCAAFKMTMEKKDISPRIAGWALLLEEYEYTLEHRPGSRMKHVD FT ALSRNPVVLLLQSNIIEQIKACQQKDERLSAIMKVLKTEPFYDYLVDHGIL FT YKYFNGGKLLVIPSSMQHSVIRNIHEQGHFGVKKMSELLTREYWIENLPFK FT LDKFVKSCVSCILAGRKVGKKEGYLYPIPKDEVPLSTYHIDHLGPLTASKK FT HYRYIFTVIDAFSKFVWIYPVKSTTADEVLKKLEIQRCVFGNPKRIISDRG FT AAFTSTAFQDYCETEGIEHLQITTGVPRGNGQVERIHGVIVPALAKLSVEN FT PENWYRYVGNLQRFLNNTLQRSIARTPFELLFGTKMKAPEDIKLAEIIEQE FT " FT CDS join(628..1500,1504..3246) FT /product="Gypsy-616_AA-I_1p" FT /translation="MKLNMMENTFTRLIDQLNFSNTTVTNLTQELSALRTE FT MNYMRAETTRLNSAVEEIRSGSMVPQQASTPNGDSSVGYDENRHIRSHANC FT AANNQVVINARVNAENDELNDGDGRADGSVLGRNIYPGVANGVANNDLLGN FT GPNNNSVTPENVQNQLDNRNRECENDNVGIGEGRHAIERVSGSCNGSVAYG FT GLRGDYHMEAHAHDMAREIEPRLSVSEAETAFSEFSGTEFYPVQKWIADFE FT ELSDSIGLTELRKFVIAKRKLTGLAKMSLNTTNNVSNWRTLKDFLMKEFES FT ENCAVVHERLRNRKRNLGENVLEYFLQMREIGAKANVDISSIITYTINGIN FT DNGPDKTILYGSKTLDEFREKLRVYQSIKDSKYGRDGNFSGKPNGYNRNFT FT NRFKSPPRSISCYNCGRDGHISRECPKKFGNKNANICRSNTAVSKGTYLTI FT MVGTIPFHAFFDSGSDVSLLKEDWKNKLNLKMDHNDRKRLLTLKGGIWTLG FT SVSLDIEVENSPLRITFDILRDEDLPHDILLGRNLLEFGDVSVTADGAKFS FT PKEEHFALRIRTDDVSDDCLEHIEDNFVRDKVKEIIENYSPQQVETTRVKL FT KIVVKNESPIQQLPRRLAPLEKKIVQEQIDKWQEDGIIRPSTSEYSSPIVL FT AHKKDGTRRLCVDYRKLNKIIVRDHFPIPLIEDIIDDLHSARVFSTLDLEN FT GFFHVPVEEDSIKYTSFVTPAGQFEFLRTPFGLCTSPTVFQRFVNDIFRDL FT IDKKIVVIYIDDILIPAENEEEALERLAMVLKVAQDHGLKVKWKKCQFLMR FT RIQHLGYEIEDGNIRPMSEKTDAVSKFPEPKNHKEIQQFLGLTGYFRKFIE FT GYSVIAKPLTDLLRKDVIF" XX SQ Sequence 5033 BP; 1708 A; 784 C; 1127 G; 1400 T; 14 other; aatttggggg ctcgtccggg aacgcgggaa gaaactgtcg cgatagtcgt taatgtgcgc 60 cggtgggcgg gaagtttaga agaattatcg tgaaatcgtg ccttgaaatt gcattgcaaa 120 gaaatattaa ctttaagtga agtgaattga attgaactta attttgaatt gaaaaaatta 180 gtgttaagat attaaattga attaattagt gataagtgaa tttggattat ttttttcata 240 atatagtggc agtgtaaaat atttcgccag acaagaatat aacaaaagtg ttctttgaat 300 ttaatttttc catcgtaaaa aaaaaccctt attgtgaact ccattacaag aagcagaatc 360 cactgatctg caaagttgac ctggggttaa gggtcattgt acggtgcaaa aaaaaataaa 420 caacaactcg gaagcgtgtg tttcagtgcc gataaacgga gaaaaaaaaa aagtgtatct 480 ctttcttcct tgtgactatt tcgttcggaa gtgatagtga cgatgcagga cagcagcatt 540 ccgccttttc ggcctagcag cggcgsgtca ccccctgctt ccaatgcgcc taatccccaa 600 gtgcaatttt ccgctgcttc atgtgaaatg aaattaaata tgatggaaaa cacttttacg 660 cgcttaatag atcaacttaa tttttctaat acgactgtga cgaacctgac acaagaactt 720 agtgcgttgc gcactgaaat gaactatatg agggcagaaa caacccgttt aaattcggct 780 gttgaagaaa ttcgctctgg ctcaatggtg ccccaacagg caagcactcc taatggagat 840 agttcggttg ggtatgatga aaataggcat attagaagcc atgcgaactg tgcggcaaat 900 aaccaagtag tgataaacgc tagagtgaat gctgagaatg acgaattgaa tgacggcgat 960 ggtcgtgctg atggtagtgt tttgggaaga aatatttatc caggtgttgc gaatggcgtg 1020 gcaaataatg atttgttggg caatggtcca aacaataatt cggttactcc agaaaatgtt 1080 caaaatcagc tagataaccg caatcgcgag tgtgaaaatg acaacgttgg gatcggtgaa 1140 ggccgacacg ctatagagag agtgagtggg agctgcaatg gctctgtggc gtacggtggc 1200 ttgcgtggtg attaccacat ggaggcccat gcgcatgaca tggctagaga aattgaaccc 1260 cgattatcgg taagtgaagc tgagactgcc ttttcagaat tttcgggtac tgaattttat 1320 cctgtacaaa aatggattgc tgattttgag gaattgtctg attctattgg cctgacagag 1380 ctacgtaagt ttgtgattgc aaagcgtaaa ctaacaggtt tagctaaaat gtcattgaac 1440 actactaaca atgtmtctaa ttggagaacg ttgaaggact ttttgatgaa agaatttgag 1500 twttcagaaa attgcgctgt ggtgcacgaa agattgcgaa atcgcaaaag aaatttaggt 1560 gaaaatgtct tagagtactt tttgcaaatg cgggaaattg gggcgaaggc caatgtggac 1620 atttcgtcga ttattaccta caccattaac ggaataaacg acaatggacc agacaaaact 1680 attttgtatg gatcgaagac cttggatgag tttagagaaa aacttcgtgt gtatcaaagt 1740 ataaaagaca gtaaatatgg cagggacgga aactttagcg gaaagcccaa tggctataat 1800 agaaatttca caaatagatt caaaagtcca ccaagatcaa taagctgtta taattgtgga 1860 agagatggtc atatctctag agaatgcccg aagaagtttg gaaataaaaa cgctaatatc 1920 tgtagatcta acactgctgt atcaaagggt acttatttga caataatggt tgggacaatt 1980 ccctttcatg ctttttttga ttctggttcc gacgtctctt tattaaagga agactggaaa 2040 aataagttga atttgaagat ggatcacaac gatcgtaagc gtttattgac gctcaaagga 2100 ggaatatgga cgttaggcag cgtttcacta gatatagagg tagaaaactc acctttaaga 2160 attacatttg acattttaag agatgaagat ttgccacatg atatattgtt gggtagaaat 2220 ttattggaat tcggtgatgt cagtgtcacg gctgatgggg caaagttcag cccgaaggag 2280 gagcattttg cgttgagaat aagaacggac gatgtttccg atgattgttt agaacatatt 2340 gaagataatt ttgtgcgtga taaagttaaa gaaattatag aaaactattc tccgcaacaa 2400 gtagaaacaa ctagagttaa actgaaaata gtagtcaaaa atgaatcacc cattcagcaa 2460 ttgccacgac ggttagctcc gctagagaag aaaatagtgc aagaacaaat agataaatgg 2520 caagaagatg gaatcatccg tccgagtact tctgaataca gtagccctat tgtattagct 2580 cataaaaagg atggaacaag aagactatgt gttgattatc gtaaattaaa caaaattatt 2640 gtgcgggatc attttccgat tcctttaatt gaggacataa tcgatgattt acattcagca 2700 cgtgtatttt caacattaga tttagaaaat gggttttttc atgtaccggt tgaagaagat 2760 agcataaaat atacgtcatt tgttactcct gcaggacagt ttgagttttt aagaacacct 2820 tttggattat gtacctcgcc aacagttttt caacgatttg taaatgatat ttttcgagac 2880 ctaatcgata agaagattgt tgtaatttac attgatgata tattgatacc cgctgaaaat 2940 gaagaagagg cattggagag attagcgatg gtgttaaaag tagcacaaga tcatggatta 3000 aaagtcaaat ggaaaaaatg tcagtttttg atgcgaagaa tacagcactt gggatatgaa 3060 atagaagatg gaaacattcg tccaatgagt gaaaagactg atgcagtatc gaaattccct 3120 gaacctaaaa accataaaga aatacaacaa tttttgggat tgacaggata ttttagaaaa 3180 tttattgaag gctattcggt aattgctaag ccattgactg atttgttgcg taaagacgtt 3240 atatttwgct ttggaccaga acaacgtaat gctatggaga aattgaaact ggcgttgtgc 3300 gatagaccaa cactaaggct gtactgcccg aatgctgtta ctgagcttca tacggatgca 3360 agtagaattg gatatggagc aattttgttg cagaaatttg ccggagaaaa tgcatttcac 3420 ccagtttatt tttacagcag aaaaacatca cctgctgagg csaactatcc gagttacgaa 3480 ttagaagttt tagcagtgat ttcagccttg aaaaagttta gagtttattt gctaggaatc 3540 tcgtttacta ttgtaacaga ctgtgcggct ttcaagatga cgatggagaa gaaagatata 3600 tccccaagaa ttgctgggtg ggcattgcta ctcgaagaat acgaatacac attagaacat 3660 cgcccwggtt ctcgcatgaa acacgtagat gcacttagta gaaatcctgt agtattgcta 3720 ttacaaagta atataataga acaaattaaa gcgtgtcagc agaaagatga gagactgtcg 3780 gcaataatga aagttttaaa aacagaacca ttttacgact atttagtaga tcatggaata 3840 ttatacaagt attttaatgg tggaaaactt ttagttattc ctagttcgat gcaacattca 3900 gttataagaa atattcatga acaaggtcat tttggwgtga aaaaaatgag tgaactctta 3960 acaagggagt actggatcga gaatttacca ttcaaactag ataaatttgt taaaagttgt 4020 gtttcttgca tattggcwgg tcgcaaagtt ggaaaaaaag aaggatatct ttaccctatt 4080 cctaaagatg aagtaccact atckacatat catatcgatc acttaggacc actgacagca 4140 tcsaaaaagc attatagata tattttcact gtaatagatg ctttcagtaa atttgtttgg 4200 atataccctg ttaaatctac aacagctgat gaagtattga aaaagttaga aattcaacga 4260 tgtgtatttg gtaatccaaa gagaatcatt tccgatagag gagctgcttt tacgtcwact 4320 gcttttcaag attattgtga aaccgaaggt attgaacatt tacaaattac aacwggggta 4380 ccacgtggca acggccaagt tgagaggatt catggagtca ttgtacccgc tctagctaag 4440 ctttcagttg aaaatccgga aaactggtat cgttatgttg gaaaccttca aagatttttg 4500 aacaatactc tccaacgaag catagcwcga acccctttcg aattattgtt tggcactaaa 4560 atgaaagctc cagaagacat aaaactagca gaaataattg agcaggaakg gataagaatt 4620 ttcgaagaat ctagggatga aattaggaag atggcaaggg ccagtataga agattcgcag 4680 agaaaaatga aaaaacaatt tgatcataat agaaaagatg cagaagatta tgaagttggt 4740 gatttagttg cgataagtcg tactcaattt ggcactcatt tgaaaatcgc tcggaaattt 4800 cttggaccat atcgcgtaat taagaagaaa agaaatgaca gatacgacgt tgaaaaggtc 4860 gggaatgagg aaggaccaaa gcggacatca acatgtgcgg aattcatgaa gcggtttgta 4920 ccaggcgaag atgaagacga agatatctac gactacgatg aagagcaaac cagaggggat 4980 gctgccgatc agattacacc atccgggtcg gatggtggtc aggatggccg att 5033 // ID Copia-8_AA-I repbase; DNA; INV; 1870 BP. XX AC AAGE02018254; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-8_AA_; KW Copia-8_AA-LTR; Copia-8_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-1870 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02018254; Positions 23293 21424. XX CC 'TTTTC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 343..1851 FT /product="Copia-8_AA-I_1p" FT /translation="MTEKVSIPLLNNSNYSTWKVRMQMLLMRDDWWFAIEE FT PRPDPATSSWKKGNQKALATIVLFLSDNQMHLVKDVTTAKDAWMKLKSFHE FT KVTMTSRVSLLKRICNMNMTEGQEMEKHFFELEELFDRLHCAGQQLDTSLR FT IAMMLRSVPDSYDGLVTALESRKDEDLTIELVKQKLMDEWQRRSERLGNPG FT ESDEKALKLYTKRQEQKVCYYCSKPGHFKRNCRLFKREQCGKEDEERTATA FT KQAAEAENPICFTVGSRQCKGCWFVDSGCSNHMTNDRRFFKKLDETVTVDV FT ILADGSRSKSAGIGEGLVKCLGSDGKLKEILIKEVMYVPELCSGLLSVRKL FT TRKGFRVQFGAKRCDIIGLSGKLVAVGELNDDLYELKTEEMANARNSSRRQ FT QSCRLVLLGRSCETGIEAHSNSTVRMKCVYDQLAEQYIQEELSNGNQYDRL FT DDDLKLLTDMRCLTKQNEFDSTRTLEQLRWHKGWCVRNKCVFSDTMHLMNR FT ATFPARG" XX SQ Sequence 1870 BP; 535 A; 329 C; 554 G; 452 T; 0 other; ggttatgggc ccagttgcgc gagtgtggtt ccccgtggaa acgaaagttg tggacattcg 60 atttcggaag agttagctgt cttgaaagct gtagcagtag gctgtgtggt gctcgtacgg 120 cgctctgatt tggcgtagtt cttgtccgga gtaagagcga gagagttggc tggctgaaag 180 ctgaaacatt tgatgagaag tgacttgtat cccgaatgaa gtgagaggaa tagtgttgtt 240 gctccggttc ctcactgtcg gttcgagttg agtgagatag ttggtaactc acagtgcgtc 300 aagaaaaagt cggtttggtt gtaacgaaaa caaaaacaga aaatgacgga gaaagtttcg 360 attcccctgt taaacaacag taattattca acctggaagg tgaggatgca gatgttattg 420 atgcgagatg attggtggtt tgcgatcgaa gaaccaagac cggacccggc tacctccagt 480 tggaagaaag ggaatcagaa ggcactggca acaatcgtgc tgtttttgtc ggataatcag 540 atgcatctgg tgaaggacgt aacgacggct aaggacgcat ggatgaaatt gaagtcgttt 600 catgagaagg tgacgatgac ttctcgggtg tcgctcctca agagaatttg caacatgaac 660 atgaccgagg gacaagaaat ggagaagcat tttttcgagc tagaagagct cttcgatcgc 720 ttgcattgcg cgggtcagca gttggatacg tcactcagga ttgcaatgat gcttcggagc 780 gtcccggatt cctacgacgg attggtaact gctctagaaa gtcgaaaaga tgaagatctg 840 accatcgagc tggtgaagca gaagctgatg gacgaatggc agcggcgatc ggaacgattg 900 gggaatccag gcgaatcaga tgagaaggcc ctgaagctgt atactaagcg acaggagcag 960 aaagtttgct actattgtag taagccgggc catttcaaaa ggaactgccg gctgtttaaa 1020 agggagcaat gtggaaagga agacgaagaa cgtactgcta cagccaagca ggctgctgag 1080 gcggaaaacc caatctgctt tacagttgga agtcgtcagt gcaaggggtg ttggtttgtg 1140 gatagcggat gctcaaacca tatgaccaat gacagaagat tttttaagaa gttggatgaa 1200 actgttaccg ttgatgtgat actggccgac ggatctcgtt cgaaatcagc aggaattgga 1260 gagggattgg tgaagtgtct aggcagcgat ggaaaactga aagagatttt gatcaaggaa 1320 gttatgtatg tacccgagtt gtgcagtgga ttactatccg ttcggaaatt aacacgaaaa 1380 ggtttcagag tacagtttgg cgctaaaagg tgtgatatca tcggtttatc aggaaagtta 1440 gtggctgttg gtgaactcaa tgacgacttg tatgagttga aaacggaaga gatggctaac 1500 gcaaggaaca gttctcgacg acagcagagc tgccgattag ttttgcttgg acgatcttgt 1560 gaaactggaa ttgaagccca ctcgaattca acggtacgaa tgaagtgcgt ttacgaccag 1620 ctggctgagc aatacattca ggaggagctg tccaacggaa atcaatacga tcgtttggat 1680 gatgacctga agctactgac tgatatgcga tgtttgacga agcaaaacga gttcgattcg 1740 accaggacct tggagcagct gcgctggcac aagggctggt gcgttcgaaa taaatgtgtg 1800 ttttcggata ccatgcattt gatgaatcga gcgacatttc ctgctagagg atgaaaacat 1860 cgaggaggag 1870 // ID Copia-109_AA-LTR repbase; DNA; INV; 376 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-109_AA_; KW Ty1_copia_Ele76; Copia-109_AA-I; Copia-109_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-376 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 376 BP; 131 A; 64 C; 70 G; 111 T; 0 other; tgttgaaata tcgcggaact attcgcgagg agatcgcgat gatcagattc gatcaaaata 60 ataacaatcc agcattgaag gattgatgac gtatgtcatg ttattccaat aatgcccaaa 120 cgagatccaa cgggaacttg catacgaggt tgtatgcggg cgctataaat gttctgctgc 180 agataggttc ataaatattc tcacttgaat tcagtgttgc gagtaatgct attggattat 240 catgtgaatc atattgaatt ataaacattg gattatttac atcgctatcg aacattgact 300 accaataatg acctaacaga cgaaataaag aatttttatc tacaaaccaa accaaaactg 360 agtttaagtt ttaaca 376 // ID BEL-56_AA-I repbase; DNA; INV; 6131 BP. XX AC supercont1.150; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-56_AA_; KW BEL-56_AA-LTR; BEL-56_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6131 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.150; Positions 1233993 1240123. XX CC Positions [5185-5742] - Integrase core CC 'CCCCAC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 79..6129 FT /product="BEL-56_AA-I_1p" FT /translation="MATSSDLQNCIACNMPDREENLVACDACKDRYHFSCA FT SVDDGIREKCWSCSKCASAINVDDVSKSGKSGKSSRSSRSMRIQLQLMKIE FT EERKAEEKLMLERQQLEKERQEKALQEKAAMEKRFRDEKYALLMAEANEDD FT DESIASQKSRVSSKDKVNRWIKDHVGMLEDSDGQPYPVGGKSGSRDPPAKP FT ANRVEEGAKRLETPLTSLGENVGKSKATEVLKQGATTSTPNQPNRLGPPES FT FLQMQQAPERPSNQEDPPVPKQHALKSGPLLPHQIHQPVFQSVQQQVPRSA FT AWPPNQQLEPWLFSSKPPLVPPSQQSVLLPLGQQTRQPTSTSFPPWQQSVS FT QHAEGVPVVPPPRPPPGFPNPLSRSLSFQPDQSQAAWGQFQQQLAARQVVP FT KELPVFSGNPEDWPLFVSSYRNSTEMCGYSDAENLMRLQRCLKGSALEAVR FT SNLLLPTSVPQVMATLETLFGSPDRLLQTMLNKVRNVPAPKAERLETLVNF FT GITVQNLVGHLKGANQQAHLMNPSLLNELVEKLPANIRLDWALHKQRAGFV FT DLGTFSEYMMAITSAASDVTQFSDQDVHRANRSDRQRPKEKVFMNTHSTSD FT VRKSGIVGGKLVDDKARPCYVCQSLKHRIRDCPRFKTLAVGDRWKLVSTHL FT LCKICLVPHGKWSCKSNKVCEVGGCSKGHNTLLHPGQPEESSRSGASAPAV FT VSVHRHLQSTILFRIIPVTLYGKETHVSTFAFLDEGSQLTLMDADIADQLN FT IVGEVRPLCLTWTANIKRREAASQQVSLQISGSLSDAKFVLDEVRTVEGLA FT LPTQTLQYSELTRQYPYLEGLPIQDYKKAVPKILIGMDNIHLSLSLRRRER FT GAREPVATKTKLGWTVFGCVGDPAKMNESPSFHICECEELNGIHELVESFF FT TEESLGIVAPRTLEGEEERRARELLESTTVQRVDGRFETGLLWRYDEVQFP FT QSYAMAERRLACLHRRLKSSPELLANLNAQIADYQQKGYAHKASKQEIEEF FT EPKRTWFLPLGVVINPRKPGKIRMIWDAAAKVNGISFNGMLLKGPDLLSPL FT PKVLYHFRQRQVAVVGDVREMYHQFFIRQADRSAQCFLWSPDPSQTPEIFV FT MDAATFGSTCSPCQAQFIKNLNAKNWTNEFPEAAVALVENHYVDDYLDSRD FT TAAEMAKLATDVRTVHAKAGFEIRNWQSNSQQVLEKLGEKLTAQRKDFTSN FT KTSHVERVLGMAWLPSEDVFTFILTLPEDLQRLVSGDSAPTKRRILRFIMS FT LFDPLGLISHVLIHGKIIIQDLWRSKVGWDDPISEEVDRDWRRFVDVFDRL FT NQMQVPRCYFPEYDRNSYDTLELHMFADASLSAFSCVAFFRIIDRGQVRCS FT LVASKAKVAPLKPLSIPRLELNAALIGSRLAKSITDSHSLEIKKRYFWTDS FT RTVLLWIRSDARRYRQYVAVRIGEILDVADDATKWKGAPDLNPTNRWFRGP FT DFLYKMESEWPQQGIEYVEDPAELRAAAVHVHSEVINEVFIECSRFSKWER FT LLRSTAYVFLFIRQCRGTRSTGEASGRLDQEDFQKAERVLWRSAQKDAYSN FT ELKALEQIRSGNENEAISEVQKNSPLRKLSPFIDEGGVLRMGGRIGASPFA FT TYDAKFPVILPKRHKVTDLIIDKYHRRYGHQNGETVVNELRQKYHVSNLRA FT EVRRVARNCQWCRTYKAEPQIPRMAPLPKERVTPFVRPFSSVGIDYFGPYS FT IKIGRSLVKRWVELYTCLAVRAVHLEAAASLSTESCMLALRRFIARRGAPE FT KIFTDNGTNFVGAERQLAKQLKDVNLKLSESFTNRNTKWIFIPPASPHMGG FT AWERMVRAVKAAMNSMNHPRTPTEEVFLTVLCEAESMVNSRPLTYMPLEHS FT EQEALTPNHFILLSSDGVKQTEKIPTEMRTIQRSDWKLCQHILDQFWCRWI FT HEYLPAITRRTKWVAEAKAIAVGDVVFLVDGSVRNRWTRGVITAVLPGKDG FT RIRQATVRTSTGTFRRPVAKLAVIDILPESKAEEPEQLYGAG" XX SQ Sequence 6131 BP; 1684 A; 1409 C; 1666 G; 1372 T; 0 other; ttctttaaag aattccgacc gaggcgttgg tcgaagacgg atttccagca gaaccaccat 60 aggagaggct gtttgacgat ggcaacgagc tcggatttgc agaattgcat tgcatgcaat 120 atgccggacc gggaggagaa tttggtagcc tgtgatgcct gtaaggatcg gtaccatttc 180 tcatgcgctt cggtagacga cggtatccga gagaaatgtt ggtcatgcag caaatgtgcc 240 tctgccatta acgtggacga cgtgtcgaaa tccggaaaat ccggcaaatc cagccggagc 300 agtcggtcga tgaggattca acttcaactg atgaagattg aggaagagcg taaggccgaa 360 gaaaagctga tgttggagcg acaacagcta gaaaaggaac gccaggaaaa agctttgcag 420 gagaaagcag ccatggaaaa gcggtttcga gatgaaaaat acgcccttct gatggctgaa 480 gcgaacgaag atgacgacga aagcattgcg agtcagaaaa gtcgagtgag cagtaaggat 540 aaagttaacc ggtggataaa ggatcacgtt ggcatgttag aagacagcga cggtcaacct 600 tatccggttg gtggaaaatc aggatctagg gaccccccag cgaaaccagc aaatagggtg 660 gaagaaggag caaagcgcct tgaaacccca cttacgtcgt tgggcgaaaa tgtcggcaaa 720 tctaaagcaa cggaagtact gaaacaagga gccaccacat cgacgcccaa tcagccaaat 780 cgtttgggac caccagaatc gtttttgcag atgcaacaag cacccgaacg accatcgaac 840 caggaagatc cgccggttcc gaagcagcac gctttgaaat caggaccact gctgccccat 900 caaatccacc aaccagtttt ccagtcggta cagcagcaag ttccacgatc tgcagcatgg 960 ccaccgaatc agcaattgga accgtggcta ttttcttcga aaccaccatt agtgcccccg 1020 agtcagcaat cagtactact tccgctgggc cagcaaacac gacagcccac ctcgacgtca 1080 ttcccaccat ggcagcaatc tgtctcacaa cacgccgaag gtgtgccggt ggtgccaccc 1140 cctcgaccgc ctcctggatt tccgaatcca ctaagcagat cgctatcgtt tcaacccgat 1200 cagtcccaag ctgcatgggg ccaatttcag cagcagctgg cagcaaggca agtagttccg 1260 aaggagctac cggtgttctc gggaaacccc gaggactggc cgcttttcgt gagtagctac 1320 cgaaactcaa cggaaatgtg tggctattca gatgcagaaa acctcatgag gctgcagagg 1380 tgcctcaaag gcagcgcgct ggaggctgtt cgaagcaatt tactgttgcc tacgtcagtc 1440 cctcaagtca tggcgacact agagacgctg tttggaagcc ccgaccgact gctgcagacc 1500 atgttgaaca aggtgcgaaa tgttccggca ccgaaggcag aacggttgga aacattggtc 1560 aacttcggca taaccgtgca aaatctagta ggacacctga aaggagcgaa ccagcaggcc 1620 catctgatga acccttcgct gttgaacgag ctagtggaga agttgccagc caatatccgt 1680 ttagactggg cactccacaa gcagcgtgct ggattcgtag atctgggaac cttcagcgaa 1740 tacatgatgg cgataacttc agcagccagc gatgttacgc agttcagcga ccaagacgtc 1800 caccgagcca accgaagcga taggcagcgt cccaaggaga aggttttcat gaacacgcat 1860 tccacatcag atgttcgtaa gtcagggatt gtcggaggta agctagtaga cgataaggca 1920 agaccgtgtt atgtttgtca gagcttgaag catagaatca gggactgtcc tagatttaaa 1980 acattagctg tgggggatcg ctggaaattg gtcagtacgc atctactgtg taaaatttgc 2040 ttagtgccgc acggcaagtg gtcatgtaag tccaataaag tgtgtgaagt tggaggctgt 2100 tccaaagggc ataatacttt gttacatccg ggtcagcctg aagagtcttc acgttccgga 2160 gccagtgcac cggcggtcgt ttctgtgcat cgccatcttc agagtacgat cctatttcgc 2220 attatcccgg tcacgctcta cggcaaagag acacacgtat caaccttcgc atttctcgat 2280 gagggttcac aactgaccct gatggatgcc gacatcgcag atcagctgaa tatcgtcgga 2340 gaagtacgtc cgctgtgcct cacttggaca gcaaacatca aacgacgaga ggccgcatca 2400 cagcaagtta gtttgcaaat ctctggttca ttgagtgatg cgaaattcgt gttggacgaa 2460 gttcgcacgg tagaaggttt agctctaccg acgcagaccc ttcagtacag cgagctgaca 2520 aggcagtatc cgtatttgga aggacttcca atccaggact acaagaaggc ggttccaaag 2580 atccttatag ggatggacaa cattcatctc tctctttctt tgagaagaag ggagcgcggt 2640 gcacgagaac ctgtcgctac gaaaacgaaa ttaggttgga ctgtattcgg ctgcgtagga 2700 gatccagcga agatgaacga atcgccgtcg tttcacattt gcgagtgtga ggaactaaat 2760 ggaatccatg agctggtgga aagtttcttc acggaggaga gtctgggcat tgtggcgcca 2820 cgtacattgg agggtgagga agaacgccgt gcaagggagc ttttggaaag cacaacggtg 2880 caacgtgtag atggtcgttt cgagactgga ttactgtggc gttatgacga agtacaattt 2940 ccacaaagtt atgccatggc ggagagaaga ctagcctgcc tgcaccgtcg actgaaatcg 3000 agtcccgaat tgctagcgaa cttaaatgcc caaattgcag actaccaaca aaagggatat 3060 gcacacaaag cgtcgaagca agaaattgag gagtttgaac ccaaacgcac ctggtttttg 3120 ccgttgggag ttgtcattaa tccacggaaa cccgggaaga tccgcatgat ctgggacgca 3180 gctgccaaag ttaatggcat ttcgttcaac ggaatgctac tgaagggtcc agatcttcta 3240 tcgccactgc caaaggtgtt gtaccacttt cgccagcgac aggtagcggt ggtaggagat 3300 gtgcgggaga tgtaccacca atttttcatc cggcaagctg atcgttctgc gcaatgtttc 3360 ctttggagcc cggatccgtc gcaaaccccg gagattttcg taatggatgc agcaaccttc 3420 ggttcgacgt gctctccgtg tcaggctcag tttattaaaa atctgaacgc gaaaaactgg 3480 accaacgagt tcccggaagc agccgtagca ctggtagaaa accactatgt ggacgactat 3540 ttggacagta gggatacagc agcagagatg gcgaaactag ctacggacgt tcgaacggtg 3600 catgccaaag ccgggttcga aatcaggaat tggcagtcga attcgcagca ggtgttggaa 3660 aagctaggtg agaaactcac agctcagcgg aaagacttca ccagtaataa gaccagccat 3720 gttgagcgcg tactagggat ggcttggctt cctagcgagg acgtgttcac ctttattcta 3780 acattaccgg aggacttaca gcgactcgtg tcgggagatt ctgcgcctac gaaaaggagg 3840 attttacgat tcatcatgag cctgtttgat ccccttgggc ttatatctca cgtcctcata 3900 catggaaaga tcatcattca agatctgtgg cggagtaaag tcggatggga tgatcctatt 3960 tctgaagaag tcgatagaga ttggagaagg tttgtagacg ttttcgatag gctgaaccag 4020 atgcaagtgc cgcgctgtta ctttcccgaa tacgacagga acagctatga tactttggag 4080 ctccatatgt ttgccgatgc aagtttgtca gcgttttcgt gtgtggcgtt cttccgcatt 4140 attgatcgag ggcaggtacg gtgttcgttg gtggcttcga aggcgaaagt agcaccgctt 4200 aagccattat cgattccgcg actggagttg aatgctgcgc taatagggag tcgtttagcg 4260 aagtccatta ccgacagtca ttccctggag atcaagaagc gatacttctg gactgattcc 4320 agaaccgtgc tactgtggat acggtctgat gccaggagat atcgtcagta tgttgccgtg 4380 cgaataggtg aaatcctcga tgttgccgat gatgcgacga aatggaaagg cgcgccagat 4440 ctgaatccaa ccaaccgctg gttcaggggt ccggattttt tgtataagat ggagagtgaa 4500 tggccacagc aagggattga gtacgtcgaa gatccggcag agttacgggc agcagcggta 4560 catgtgcaca gtgaagtcat caacgaagtt ttcattgaat gcagccgatt ctcgaaatgg 4620 gaacgtcttc tgcgatcaac tgcgtatgtt tttctgttca ttcgtcagtg tcgtggcaca 4680 agaagtactg gtgaggcatc tggaaggctg gatcaagaag atttccagaa agcggagcgt 4740 gtattgtgga gaagcgcgca aaaggatgcg tattcaaacg agttaaaagc tttggagcag 4800 ataagatcag gaaatgaaaa cgaagcgatc agcgaagtgc aaaaaaatag tccgttgagg 4860 aagctatcgc cgttcatcga tgaaggtggt gtgctgcgca tgggcggtcg cataggagct 4920 tcgccatttg ctacatacga tgcaaaattt cctgtcatat tgcccaaaag acataaagtt 4980 accgatctca tcatcgacaa gtaccatcgg aggtacgggc atcagaatgg tgaaactgtg 5040 gtaaacgaac taaggcagaa ataccacgtt tcgaacttgc gcgctgaagt tcgcagggtt 5100 gctaggaatt gtcagtggtg ccgcacttac aaagcagaac cgcagatccc gcgaatggct 5160 ccgcttccaa aggagcgtgt tacgccattt gttagaccgt tttcaagtgt cgggatagac 5220 tattttggcc cgtattccat taagatcggt aggagtctcg tcaagcgatg ggtagaactt 5280 tacacttgcc ttgcagttag agctgtgcat ctcgaggccg cggcatctct gtcaacggag 5340 tcgtgtatgc tggcactccg tcgatttatc gctagacgag gagcaccgga gaagattttc 5400 accgacaacg gtacgaattt tgttggcgca gagcggcagc tagcgaaaca actgaaggac 5460 gtgaacctga agctatctga aagtttcacc aatagaaaca ctaaatggat ttttatccct 5520 cctgcttccc cgcacatggg cggtgcgtgg gagagaatgg tcagggcggt taaagctgcc 5580 atgaactcaa tgaaccatcc aagaacccct actgaagaag tgttcctgac cgttctgtgt 5640 gaagctgagt ccatggtcaa ttcacgacca ttaacctaca tgccattgga acattcggag 5700 caggaagcac tcactcctaa tcactttatt cttcttagct cagacggagt gaagcaaact 5760 gaaaagatac caacggagat gaggacaatt caacgaagtg actggaaact ttgtcagcac 5820 attctagatc agttctggtg cagatggatc catgagtatc taccagccat tacgagacgg 5880 accaagtggg ttgcagaagc taaagcgata gcagtcggtg atgttgtctt cctcgtagac 5940 ggaagtgtaa ggaaccgctg gactcgaggt gtgataacag ctgttttgcc gggaaaagat 6000 ggacgtatta ggcaagcaac ggtccggact agtacaggaa catttcgcag gcccgtcgcc 6060 aaactggcgg tcatagacat tcttccggag agtaaagctg aagaaccaga gcagctttac 6120 ggggcgggga a 6131 // ID R1N_NLo repbase; DNA; INV; 851 BP. XX AC . XX DT 08-NOV-2010 (Rel. 15.11, Created) DT 08-NOV-2010 (Rel. 15.11, Last updated, Version 2) XX DE Non-autonomous SINE-like non-LTR retrotransposon R1 in Nasonia DE longicornis. XX KW R1; Non-LTR Retrotransposon; Transposable Element; R1N_NLo. XX OS Nasonia longicornis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-851 RA Stage D.E. and Eickbush T.H.; RT "Maintenance of multiple lineages of R1 and R2 retrotransposable RT elements in the ribosomal RNA gene loci of Nasonia."; RL Insect Molecular Biology 19(Suppl 1), 37-48 (2010). XX DR [1] (Consensus) XX CC This family is less specific to 28S ribosomal RNA genes, compared CC to other R1 elements in Nasonia. XX SQ Sequence 851 BP; 200 A; 181 C; 274 G; 196 T; 0 other; cggaagaccc ccgagagggt ggcaattctg ccagggcagt tgggtgaacc ctgagatttc 60 ggtagcctcg gtatacctcc cacgaaagga agggggaagg ttcagtaatg ttccgagctg 120 gaactagtct agtgattgaa ggaccatcag gttggtgaag atcaaaacta gtatacccag 180 gcccttgtaa gctagtcggg ctctaactag ctgcttggga ctctagccct gggaacgctc 240 cggcgtactg tatggttggc caatgccaaa aacgagtggg acgtttagaa cgtggtagca 300 tgcttagtac gcttcggcgg aaacgtttgt gaactctcta ctggggagga taaaatcagt 360 accacggggt cggggagccg caggatagtg aatcctgtcc cgacagggta aaggactcgt 420 ggtggcagtg gttgcccgta tgctgggcag agagtttatc tttcgatgtg tgagttgcag 480 tgcacaactc gggtattgac ctgcagagtc atgagggatt agatgggtcc cgccgcccat 540 ccgagcggca ggttgacttc ccacttaagg tcacctggca tcggtaggcc gttgaatgta 600 tacgcgtaca ttctttggtc gcgttggccg agcgtacctc ggcccgttgt cctacggcat 660 accgaaagtc agtggaggct cgccggatat gttgcgaggt ggttaaagta gtgtggatga 720 gaaaagtgtg tgtgtgtgca taaggccgag tggcgtatac tggcgtgtca aagctgatta 780 gtaagtgtgc ctagagggcc atgtgggtga gccataaggc aggtactgca tgtaaaaaca 840 aaaacaggaa c 851 // ID CR1_Ele29 repbase; DNA; INV; 4480 BP. XX AC . XX DT 22-OCT-2010 (Rel. 15.1, Created) DT 22-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE A CR1 clade non-LTR retrotransposon family from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1_Ele29. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4480 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4480 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (22-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. This consensus is generated from 20 CC sequences with >97% identity, and ~99% identical to the original CC sequence in [1]. XX FH Key Location/Qualifiers FT CDS 591..1469 FT /product="CR1_Ele29_1p" FT /translation="MEICNSCAVNMTAPEVVCGGFCKATFHFKCAKISDTL FT YSQIVTNSATFWMCQGCREIMGNARFKNTLNSMNAATKEVNDMYQKLVDDL FT KTEIKDSLIAELKQEIQGGFNKLSPAILSPAPXRFQFNNRSASKRLRDRDE FT VTFQLPDQPSKVFRGTNQAINASEDSADRSXNRFWIYLTKISPEVTEDDVV FT NLAKNCLQTENVVAKVLVPKGRPLSSFSFISFKVGVSNDLKSRALDPSNWP FT REIQFREFVETSSSVRHFWKPGQAADPGVSNNPELRPAPASTSYQPHQQLT FT E" FT CDS 1355..4417 FT /product="CR1_Ele29_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="RSTFLETRTGCGSWGFEQPRIASCSSVNLVSASSTTH FT RIDSSDNPLMLSTPYLIPHNTSPTCSTSGSQSFPTSLTVYYQNVRGLRTKT FT NNLLLSLLTCDFDIVVLTETWLHSEIASSELTCNYVIYRCDRNSSTSNLRR FT GGGVLIAVKSELNCKAANLTSPVDLEQTIVQIVLPHLSVYICCVYIRPNSH FT PDIYTKHVESVDQVLKEVNAHDIILCLGDYNLPNLTWRFDEDLSGFLPTNV FT STEQEIALVEPMLSTGLYQINDVLNVNDRLLDLVFVSDPCIIDLFESPSSL FT LKIDAHHKPIILSLDARSRHARLQSSPELSADYDFSRCDYHAINNLIASQD FT WNQLLDQDSVDNSVATFYHVINRIISETVPVKAKRSTKSFRQPWWTPQLRN FT LRNRLRKARKRFYRNRNPTTISNLKTAEDDYLRLQHNRFREYVNSLQSNFK FT QDPSSFWTYINKRKSSATIPADVSYRTRSAGSPEENANLFAEFFRSVYVEN FT PPPISQDYVNSLPSYNIQLHRVTFDGDSIRAALSKVDASKGPGPDLIPPSF FT IKQCAQSLAAPIASIFNRSLIEGVFPEAWKLASITPIHKSGSVHDVENYRP FT ISILCCLAKVFETLLHDALYPVVQPAISESQHGFVKKRSTTSNLMTFTHAV FT FDKLEKRSQVDAIYVDFSKAFDKVPHNLAIAKLNRLGLPCWTIQWLKSYLS FT SRKAFVKVQGARSDIFGISSGVPQGSHLGPLIFVLFINDLCDQLDCGKLLY FT ADDLKLFRTIKTLLDCCALQADVDRVSNWCDLNGMQMNANKCKVISFSRRQ FT STTVFDYSLNSSSLERVSSIKDLGVILDSKLRFHEHIATMTAKANAMLGFL FT RRNTQLFDDAYALKSLYCALVRSVLEYGVQIWAPYHAVHVESIERVQKRFI FT RYALRGLPWNDPAHLPPYEHRCALIGVQSLANRRTLLQRLFVFDVITGNID FT CSSLLANVQFHAPARQLRNRQLLWIPSHRSLYGYNNPLDSCCRLFNDVCDV FT FDFNISKLVFKNRIK" XX SQ Sequence 4480 BP; 1190 A; 1112 C; 916 G; 1259 T; 3 other; tcattaaaat aagtcaggta ggatagtgaa tttctacgaa aattccgagt cggcttgagt 60 gtggttcgag tttagtaagc ccactggcac agccgcctgc catatctact cccacctcgc 120 cgctctccac ctgaactgcc gttgtattac gatcaagtca tcgctgctct aataccatca 180 caaaatcacc acctccaccg ccggcctcga tcagcatttg tatgtcctgc gaaaaatcaa 240 attccgttcc tcaatccaaa tctacacccg attgtactgc caagcctgct gaataaagcc 300 tgattttcta ctgctgttct cactccgata tccgaaaacg acccagcata aacctgccct 360 cctgcctatt cgccgcatcg cctgcattcc caactgcata gaaggtaaac aaacatcatt 420 ccgtcccgct catcccgtgc gtagtttgag gctgtgtgtg cacactcata cattacccga 480 tcagattgtg attatccgtc atctcgccag tgctgccatt tgttgcgatt ttccgaagtt 540 ttttttttgc ggtaagagca tctccacccg ctgtttcaat tttcaccgat atggaaatct 600 gcaacagttg tgccgtcaac atgactgctc ccgaagtcgt ctgtgggggt ttctgcaaag 660 caactttcca cttcaagtgc gctaaaatat cggatacgct ctacagtcaa atcgtaacta 720 actccgctac cttctggatg tgccaggggt gtcgcgagat tatgggtaac gcacgtttca 780 agaatacttt gaattcaatg aacgctgcta ctaaagaagt aaatgatatg tatcaaaaat 840 tagtcgacga tctgaaaacc gagattaagg acagcttgat tgccgagttg aaacaggaga 900 ttcaaggcgg cttcaacaag ctttctccmg ctatcttgtc tccagcgccc gkccggtttc 960 agttcaacaa ccgttctgct tcgaagagat tacgtgatag agacgaggtc acctttcaac 1020 tgcctgacca gccttcgaaa gttttccgtg gaacaaatca agccatcaat gcgtctgaag 1080 actccgctga taggtcggaw aacaggtttt ggatatatct gacgaaaatt tcgcccgaag 1140 tcacggaaga cgacgttgtc aaccttgcaa aaaactgctt gcagaccgaa aacgtggtag 1200 cgaaggtttt ggtgcctaaa gggcggccgc tatcgtcgtt ttcattcata tcgtttaagg 1260 taggtgtcag taacgattta aaatcaagag cattggatcc ctctaactgg ccccgtgaaa 1320 ttcaattccg agagttcgtc gaaaccagtt ctagcgttcg acatttttgg aaaccaggac 1380 aggctgcgga tcctggggtt tcgaacaacc cagaattgcg tcctgctcca gcgtcaacct 1440 cgtatcagcc tcatcaacaa ctcaccgaat agacagttcc gacaatcctc ttatgttatc 1500 cacaccgtac ctaattccgc ataatacgtc tccgacttgt tctacgtccg gtagtcaaag 1560 cttccctacc tcactgacag tttactatca gaacgtccgt gggttgagaa caaaaacaaa 1620 caacttgctt ctctctctgc ttacttgcga cttcgacata gtggtgttaa ctgaaacctg 1680 gctccactcc gaaattgctt ccagtgaact aacttgcaac tacgtaatct atagatgtga 1740 tcgtaattct tccaccagca atcttcgaag aggtggcgga gttctaattg cagtgaaatc 1800 tgagctgaat tgcaaagccg ctaatctgac atctcctgtc gatttggaac aaactatcgt 1860 gcagattgtt ttaccacatc tttccgtgta tatttgttgc gtttatattc gaccgaacag 1920 tcatcctgat atttacacga aacacgtcga atctgttgat caagtactga aggaagtcaa 1980 cgcccacgat attattttat gccttggtga ttacaatcta cctaatttga cctggcgttt 2040 tgatgaggat ctttcagggt ttttaccaac caacgtatca accgaacagg agattgcctt 2100 agttgaacct atgttgtcga ctggtttgta tcaaatcaat gacgttctga acgtgaacga 2160 taggttactt gacttggttt tcgtaagtga tccatgcata attgatctct tcgagtcacc 2220 atcatcgctt cttaaaattg acgctcacca taagccaatc attttatcgt tggacgcccg 2280 ttctcgtcac gcaagactgc aatcatctcc tgagctcagt gctgactatg acttctcccg 2340 ttgtgactac catgctatta acaacttgat agcttctcag gattggaatc agcttcttga 2400 ccaagactct gttgataact cagtggctac tttctaccac gttattaacc gaatcatcag 2460 cgaaactgtt cccgtcaaag ctaagcggtc tacgaaatcg ttcaggcaac cttggtggac 2520 accgcagctt cgtaacctcc gcaaccgttt acgaaaagcc agaaaaagat tctatcgtaa 2580 cagaaatccc acaacgatca gtaatcttaa gacagcggag gatgattatt tgcgcctaca 2640 acataatcgt ttccgggaat acgtcaacag cctccaatcg aattttaagc aagatccatc 2700 atccttttgg acgtacataa ataagcgaaa aagttcagcc acaatccctg ccgacgtttc 2760 ttatcgcacg cgtagtgccg gctcacccga agaaaacgca aatctctttg cagagttctt 2820 tcgttcagta tacgttgaaa atccacctcc aatatctcag gactacgtaa acagccttcc 2880 atcgtacaac atacagctcc atcgagtcac gtttgacggc gatagtattc gtgcagcact 2940 cagcaaagta gatgcttcaa aaggtcccgg acccgacctt attccgccaa gcttcatcaa 3000 acaatgcgct caatcgcttg cggcgccgat tgcatcgatt ttcaaccgtt cgttgatcga 3060 gggagtcttc cccgaagcgt ggaagctcgc gtctatcact cctatacata aatcggggag 3120 tgtacatgac gtggaaaact accgaccaat atctattctc tgctgcttgg cgaaggtctt 3180 cgaaactttg ctacacgatg ctttgtatcc agttgtgcag cccgctatat ctgagtctca 3240 gcacggattc gtcaaaaagc gatcaacgac gtcgaatctg atgacgttca ctcatgctgt 3300 gttcgacaaa ctcgagaaac gtagccaagt tgatgccata tatgtggatt tctcaaaagc 3360 tttcgacaag gtacctcaca atttagccat cgcaaaactg aaccgccttg gactgccttg 3420 ttggaccatc cagtggctga aatcttattt atcctcccgt aaagcttttg tgaaagtaca 3480 aggcgcgaga tctgatatat ttggtatatc ctccggagtt ccacaaggaa gtcacctagg 3540 tccacttatt ttcgtgctct tcataaacga tttgtgtgat cagctggact gtggcaaact 3600 actgtatgca gatgatttga agctatttcg tacgatcaaa acattgctgg actgttgcgc 3660 gctacaagct gatgtcgata gagtttccaa ttggtgtgat cttaacggta tgcagatgaa 3720 tgctaacaaa tgcaaagtca tatcattcag ccgtcgtcag tccaccacag tttttgacta 3780 ttctttgaat agctcatcgc tggagagagt atcgtccatc aaggatcttg gagtgatcct 3840 tgattccaaa ctccgattcc atgaacatat cgccacgatg acagccaaag ccaatgcaat 3900 gcttggtttt cttcgccgca atacacagct tttcgacgat gcttatgcac ttaagtcact 3960 ttattgcgcc ttagttcgca gtgtgcttga atacggagtg cagatttggg ctccgtatca 4020 tgctgttcat gtcgaaagta tcgaaagagt ccaaaagcgt ttcataaggt acgctcttcg 4080 aggtctgcca tggaatgatc ctgctcacct gcctccatat gagcatcgct gcgcgctgat 4140 aggtgtgcag tctttggcta accgtcgaac tctgctgcag agattatttg tcttcgacgt 4200 tatcaccggc aacattgatt gcagcagtct gctggcaaat gtgcagtttc acgcgccagc 4260 tcgtcagctc cggaatcgac aattgctctg gataccttcc catcgctcgc tctatgggta 4320 taataatccg ctcgattcat gttgtagatt gtttaatgac gtttgtgatg tttttgattt 4380 taatattagt aaacttgtgt ttaagaatag aattaagtaa acctaagctc agtctgtgcg 4440 gcataatgtc gaagatgttg taaataaata aataaataaa 4480 // ID BEL-5_AA-I repbase; DNA; INV; 6233 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-5_AA_; KW BEL-5_AA-LTR; BEL-5_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6233 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 859-859 (2011). XX DR [2] (Consensus) XX CC Positions [4321-4905] - Integrase core CC 'ATCAC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 1138..5280 FT /product="BEL-5_AA-I_1p" FT /translation="MVKPEEFKELKKQEKQLMGIIAGIARFVEKFQKGRHE FT PQIDVRLETLEEAIKRFYSVRRKIELAYEDEDGAGEKIAENTRAEREKRER FT QFEETFEAVQEQYFEVKTALIQLRPKVENSRPAGGNNAAETQPASRVKLPE FT IKLPSFSGRIQDWTSFRDTFRSLIHNNQQLADVDKFTYLRSSTSGEALQEL FT NSVELTGDNYEVAWKLLEKKYENKKLIVKALLDALFAVEPMRKESCEALNH FT VISEFDKNLLMLNKIGEQTKGWSTILAYMVCSCLDSTTLRHWETHHNSKDV FT PKFEDLMKFLRDQCAVLQSIAPAKAPAGDNSRHRMSTTHTSSQLQRRCEFC FT GDSYHIPFKCGRLSNMTVAQRAVEINKKRLCRNCLTAGHFAEGCSRGSCTR FT CGRKHHTLLHFEGSSSGGRQPKQSGPANSPPSGTQAPNRSNPAPAASQQQS FT QQQNRSKPQGQTQTSTQPATQTQPTQAYPVIRSQDTHPQPTTNSQPAISLK FT STGSSHSGGTTRQVLMSTAIVKVVDRFGNTALARTLLDSCSEFCYMTCSFS FT RKLKFHEQRDFLQVQGIGNSSVTSTKAVEAIIEARSPSISSFGERMQFHLL FT PKITQTLPLKPVRVELCEIPTEVLLADPTFGEPGPVDMIIGAEFYFDVLRA FT GRRKLSESGPTLQETVFGWVVSGRVPETTSVMPTTASYVSTTVDLQDLIAR FT FWELETCHVNRTHSVEETACEEMFNKSTVRDQEGRFVVTLPKKEHMLERLG FT ESRNIAMKRFIGLERRFTMNAELKEAYKEFIHEYLLMGHMKEVDGEQPTAD FT PVYYLPHHAVLRPDSTTTKLRVVFDASCKTSTGISLNDALMVGPVVQNDLI FT STILRFRHHRIAVTADVAKMYRMVRVPEQDQHLQRILWRDTPEEPVKTFEL FT LTVTYGTASAPYLATRCLKKLGEDSVSTHPIASRVVQEDFYVDDMLSGADT FT VNQTRKLMNEVIELTNTVGFTLRKWNSNSAGLLSKLPKHLRDERTVLDLDS FT SNATVKTLLTSAVRGKFWPVHINSLVRKVIHDCIPCFRNKPKVLEQIMADL FT PSVRVNPAPPFMNVGVDYCGPFFVNYPNRRASPVKCYVAIFVCLVVKAVHL FT ELVMDLTSQAFLAALKRFTARRGRPKLIKCDNATTFVGAKRELTELHRLFH FT AQKFQDRLIKDTSSDGIEFRFIPPRTPNFGGLWEAQVKSFKGHFKKAIGTK FT VLKVDEMLTALAQIEAVLNSRPLTPISNDPNDYEALTPGHFLGQRPLTAIP FT ERDLQDVSTNRLDKWEDAQQVAQQVWSRWSTQYLSDLHNRTKWTKQRDNIR FT VGMMVLLKDENAPPLKWHLGRVTKIFKGSDGNIRVVTVRTKDGCFDRGISK FT VCPLPIRDNEDTQQLSSA" XX SQ Sequence 6233 BP; 1517 A; 1725 C; 1728 G; 1263 T; 0 other; tatggtcctt cgaaccggat tcgaccggtg ggacttagta tcgtagtcag tgcaagtggc 60 ggttaatctg agcccgccgt gaaagccgtt gattggcaat aaaaagtgca acaatcacgc 120 tcctatcgcg aagagaaaaa ccattccgca gagccgggat agggtttgaa caaaagtgct 180 actgttgtgt tgtgtacgaa ttcaaactcg cgccatcgaa acggctacgc gttccctgtg 240 ttgggataga acgtagaaag aaagtgaatt gtgtgcaaac cccgccgtct aaacggccag 300 cctatccagc agagatagag aaaaaagatt accgcgtgtg tgatatcgtc ggcattgtat 360 cgccgttgtt cgatctcccc gacggtggag tacagtaatc cccccggtag tgcgttcaag 420 tggggctgct gttcggcgaa ttatttcgcg aaattcggcc gacggcgatt atttgaagag 480 gccgttggtg aagtgattgt aaaaaatcga ttgctgcgga gtgtccgacg gcacacatcg 540 acgggggcca cctacggacg ggtggagcga gttgacggcg atcattggcg agtcgtcggt 600 gcaggccagc ggcgaggaga aagctacgta atcatcctga ttgccgtcag ggaagtactt 660 gcatggtttt tggggtggtt gggttgcatg agggttcgag taaatgagtg gaaaattcaa 720 taaaaattga aattcgcttg agcgttttca tttctgggtt tgtggcttgg ttgggtttga 780 tcgcccttgg aactagtgct tggccctttc tctggttcaa aaattgctgt ctttgtcatc 840 gattcattct gtcgcttgtc aattcgctgg ctttcggtcg gtcgcgcgcg tccgttgagt 900 gattaatccc agtagcagtg tctgcggtcg tgcggcgcat ggcgaataca gtgaagaata 960 gtgaagtgca gtgatcaaaa actcttttca cagtactgct ggccagtgat aagtgacact 1020 ctcggtgtgc gaaattgcac ctggacttgt gctgtgaact gtgaattgcc aaccggcaaa 1080 agtgactatt tcccagcagt attgtgaagt tgtgggttac agaagcattc tgtgaagatg 1140 gtgaagcctg aagagttcaa ggagctgaag aagcaagaga agcagctgat ggggatcatc 1200 gctgggatcg cccggttcgt cgaaaaattc cagaagggaa ggcacgaacc ccaaatcgac 1260 gttcgcttag agaccctaga ggaggccatc aagcggttct acagcgttcg gcggaagatc 1320 gagctggcgt acgaggacga ggacggcgct ggggagaaga tcgccgagaa tacccgggca 1380 gagcgagaga agcgggaacg gcagttcgag gagaccttcg aagccgtgca ggagcagtat 1440 tttgaagtga aaactgccct cattcagctg cgtcccaagg tggagaactc acggccagcc 1500 ggaggtaaca atgcggccga aacgcaacca gcgtcccgcg taaagcttcc ggaaatcaag 1560 ctgccgagtt ttagcggaag gatccaggat tggaccagct tccgagacac attccgtagt 1620 ctgattcaca acaaccagca actcgcagat gtcgacaagt tcacttacct gcggtcctcc 1680 acctcgggag aagcactcca ggagttaaac tcggtggagt tgaccgggga caactacgag 1740 gtggcttgga agcttctgga gaagaaatac gagaacaaga agctcattgt caaggccctc 1800 ctggacgcac tgttcgccgt tgagccgatg cggaaagaaa gctgcgaggc gctgaaccac 1860 gttattagcg agttcgacaa gaaccttctc atgttgaaca agatcggcga gcagacgaag 1920 ggctggtcga cgattctcgc ttacatggtg tgctcttgcc ttgactcgac cactctccgt 1980 cactgggaga cgcaccacaa tagcaaagac gtccccaagt ttgaagatct gatgaagttc 2040 ctgagggatc agtgcgcagt gctgcagtca atcgcccctg cgaaggcccc cgctggcgac 2100 aacagccgac atcgaatgtc tacgacgcac acgtcatccc agttgcagcg gcggtgtgag 2160 ttctgcgggg attcctacca tatcccgttc aaatgcggca gactgtcgaa tatgacggtg 2220 gcccagcgag cggtagaaat caacaagaag cgcctctgcc ggaactgcct gaccgctggt 2280 cacttcgctg aaggatgctc cagaggttcg tgtacccgct gtggacgaaa acaccacacc 2340 ctcctccatt tcgaaggttc ttcaagcggc ggtcgtcaac cgaagcagtc tggtcctgcc 2400 aattctcccc cctccggtac gcaagcgccg aaccgatcga acccagctcc agcggcaagc 2460 cagcaacagt cacaacagca aaaccgttcg aaacctcagg gtcagacaca gacatccact 2520 caaccagcca cacaaacaca acccacacaa gcataccccg tgattcgctc acaagacact 2580 cacccacagc ccaccaccaa ctcccaacct gcaatttccc ttaaatccac cggatcgtcc 2640 cacagcggag gcacaacgcg acaagtgttg atgtcaaccg cgattgtgaa ggtggtggac 2700 cgttttggca acactgcgtt ggccagaacc ctcctcgact cgtgctcgga gttctgctac 2760 atgacctgta gcttctccag aaagttgaag ttccacgaac agcgggattt cctgcaggtg 2820 cagggcatcg gcaacagcag cgtgacgtcg acgaaagcgg tcgaagccat catcgaagcc 2880 cgaagcccat cgatttcctc gttcggggag cgcatgcagt tccacctgtt gccgaaaatt 2940 actcaaactc ttcctctcaa accggtacgc gtggagctgt gcgagattcc cactgaagtg 3000 ctgctagcgg atcccacctt tggcgagccc ggtcctgtag atatgatcat cggtgcggag 3060 ttctactttg acgtgctgcg ggccggaaga agaaagttgt cggagagcgg tcccacgctg 3120 caggagaccg tcttcggctg ggtcgtgtcc ggcagagttc cggagaccac aagcgtcatg 3180 cccacgacag caagctacgt cagcacgacc gtcgatctgc aagacctgat cgccagattc 3240 tgggagctgg agacgtgcca cgtgaaccga acgcattcgg tggaagaaac ggcctgcgag 3300 gagatgttca acaagtcaac ggtccgtgat caagaaggca ggttcgtagt gacgctgccg 3360 aagaaggagc acatgctgga acgcctggga gagtccagaa acatagctat gaagcggttc 3420 atcggcttgg agagaaggtt caccatgaac gccgagttga aggaggccta caaggagttc 3480 atccatgaat atctgctgat gggccacatg aaggaagtcg acggagagca acccactgcc 3540 gatccagtgt attacctgcc gcatcacgcc gtgttgcggc ccgatagcac gacaaccaag 3600 ctgagggtcg tgttcgacgc gtcgtgcaaa acctctacag ggatttccct caacgacgct 3660 ctgatggtgg gaccggtcgt ccagaatgac ctgatctcca ccattctccg tttccgtcat 3720 caccgtatcg ccgtcactgc cgacgtcgcc aaaatgtacc ggatggtccg tgttcccgag 3780 caagatcaac atctgcagcg aatcctttgg agagacacac cggaggaacc tgtgaagacg 3840 ttcgagctgc tcacagtcac gtacggcact gcgagtgcac cgtacctggc cacaagatgc 3900 ctgaagaagt tgggagaaga tagtgtctcg acgcatccga tcgctagtcg ggtggtccaa 3960 gaggacttct acgtcgacga catgttgtca ggcgcggaca cggtcaacca aacacggaag 4020 ctgatgaacg aggttattga gctgacaaac acggtaggtt tcactctgag aaagtggaac 4080 tcgaattcgg cagggctgct ttccaagctg cccaagcatc tgcgggatga acgaacggta 4140 ctcgacctgg attcttcgaa cgcgacggtg aaaaccctgc tgacctcggc ggtccgtgga 4200 aaattctggc ccgttcacat caacagcctg gtgagaaagg tgatccacga ctgcatcccg 4260 tgctttcgga acaagccgaa ggttctggag cagataatgg ccgacctgcc gtcggtaaga 4320 gtcaaccccg cgccaccatt catgaacgtc ggcgtggact actgcggtcc attcttcgtg 4380 aactacccca accggcgagc aagtcctgtg aagtgctacg tggcaatctt cgtctgcctg 4440 gtcgtcaaag ccgtccatct ggagctggta atggacctaa cgtcgcaagc cttcctagcg 4500 gcattgaaac gcttcactgc ccgccgcggc aggccaaagc taataaagtg cgacaacgca 4560 actacgttcg tcggtgcgaa acgcgagctg accgagctcc accgcctctt ccacgcccag 4620 aagttccagg accggctgat caaggacact agttccgacg gcatcgagtt ccgtttcatc 4680 cctccccgga cgcccaactt cggcggcctt tgggaggcgc aggtgaagtc cttcaaggga 4740 cacttcaaga aggcgattgg caccaaggtg ctgaaggtgg acgagatgct gactgccctg 4800 gcccagatcg aggcagttct caactcccgt ccactgactc cgataagcaa cgaccccaac 4860 gactacgaag ccctcactcc agggcacttt ctgggccagc ggcccctgac cgctatcccg 4920 gaacgagatc tccaagatgt ttccaccaac cggttggaca agtgggaaga cgcgcaacaa 4980 gtggcgcagc aagtctggag ccggtggtcc acgcagtacc tctcggatct gcacaaccgc 5040 accaagtgga ccaagcaacg cgacaacatt cgcgtcggca tgatggtcct gctgaaggac 5100 gaaaacgctc caccactgaa gtggcacttg ggtagggtca ctaagatctt caaagggtcg 5160 gacggcaaca tccgcgtcgt cacggtccgc accaaagacg gatgcttcga tcgcggaatc 5220 tcgaaggtct gtcctcttcc catccgcgac aacgaagata cccaacaact ctctagcgca 5280 tgaggcttac gccgattgcc gctccggtgc ttcggcacca tccatttctc gaagtcgtca 5340 agtcacccca agggcaatcg tcgtaaaggt caagttgtat cgtccatgca tgttgtatat 5400 gtcgtcagtc cgtagtcaat gtggtgaagg gttcggcgcc cacggcgtcg gcgttgacca 5460 aagggtcatc gcgagaatca gaaaactcta atcacttcgt ctatcgccgg tagcaccacc 5520 gctaccaact tttcccggag gaagccatgg taacccctcc accccgtcgt cgtcaactgg 5580 ttggctctgg gcttggccag ctcatcctgt cggccagcat cccgatggct cgaagcgacg 5640 aagtgatggt cacccgacca cagaagccct ccgcccgatc cggacactac cacccaggcg 5700 cacactccac ggagaatcgt caggcaccat tcttgcctgg gtcggtaagt tgttccggtt 5760 ttacagaaat ctatgtcagc ccactacatc atatctttcc cgaggaaacc atcaccacac 5820 gaaccactcc acgcaccacg aaagacgaag gaagccaggc atcagccgat gcgctggtgt 5880 acacggtcga gacccgacga gactctaccg atgaagccac ccgaagaaga agccgccgtg 5940 cgtccggcac ggcaatcccg aaaaaggtca tccctaaccc caaggcagcg cacgtgagcc 6000 gtagctcgac gtgtgtcggc gctgaaggcg cctcgaagca accgcagcaa cttagcagtt 6060 cccaccccct cgcgcgagtg tggcgaggtc ctagtcgaca agggtcgacg aagttttgtt 6120 ctagtaggga accttgattc caatttgtat tgttaggggt agttgtaaaa tagtatgtgt 6180 agcaatgtat gttagtcttg tttgaaactc gtctggtttc aaggcggccg gca 6233 // ID Gypsy-78_AA-LTR repbase; DNA; INV; 183 BP. XX AC supercont1.245; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-78_AA_; KW Gypsy-78_AA-I; Gypsy-78_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-183 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.245; Positions 841225 841043. XX SQ Sequence 183 BP; 45 A; 45 C; 42 G; 51 T; 0 other; tgtgaagtgt accccaacct taacttccag tgtatggcac tcaatatgag tgccatcgca 60 gtaggcttat gcaggcagca aatatctctg tcctcacttt ctatcaccga ccggcgttga 120 ggtaagacat gctgtgcgaa gacactagtt aattcctttt cggctggtat cagggtatgc 180 aca 183 // ID Copia-15_DPu-LTR repbase; DNA; INV; 322 BP. XX AC scaffold_27; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-15_DPu_; KW Copia-15_DPu-LTR; Copia-15_DPu-I. XX NM Copia-15_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-322 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 694-694 (2010). XX DR Genome; scaffold_27; Positions 439156 439477. XX CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX SQ Sequence 322 BP; 90 A; 63 C; 48 G; 121 T; 0 other; tgttggaaat acagatgcgt atgtatcctt tattccaaca atctggcaac actgtttatt 60 ctgtctgatc tcaattctgt ctgctattca cttgatatct attctgtacc atattgtgtg 120 tgatcttctc acctcgtgtt gaatagacaa gtaattcgtc aacatattgt attctactta 180 tatcttgttt gattcccaca atattattga ggtaagtttc aatatcttta cgccaaaagg 240 cgatacgtct atgagacgat tatatataca catatatgtg ctgttgctaa atggagtcac 300 atttcatacc atattcccaa ca 322 // ID RTE-7_PPac repbase; DNA; INV; 2391 BP. XX AC . XX DT 07-JUL-2010 (Rel. 15.07, Created) DT 07-JUL-2010 (Rel. 15.07, Last updated, Version 2) XX DE A family of RTE non-LTR retrotransposons: consensus. XX KW RTE; Non-LTR Retrotransposon; Transposable Element; RTE-7_PPac. XX OS Pristionchus pacificus OC Eukaryota; Metazoa; Nematoda; Chromadorea; Diplogasterida; OC Neodiplogasteridae; Pristionchus. XX RN [1] RP 1-2391 RA Jurka J.; RT "RTE non-LTR retrotransposons from nematodes."; RL Repbase Reports 10(7), 1066-1066 (2010). XX DR [1] (Consensus) XX CC >98% identical to consensus. CC This sequence was derived from sequence data generated by Genome CC Sequencing Center at Washington University School of Medicine in CC St. Louis. XX FH Key Location/Qualifiers FT CDS 187..2310 FT /product="RTE-7_PPac_1p" FT /translation="MMKDDMSGIKKKELMKVERRVKWWKLRDKEERDKFAV FT EVAIRGVLSADPLDASTNVDSLWNTMTGRMIECAREVLGETKGMKRKSDDR FT WFWSDEEVKKAVKEKRNAYWQWHRRKSKESWEAYKEKRRDCRRVVAIAKMK FT TFDDLYEKLNGPDGEKIVYRITKARDKESKDIQEVKSVKDEMGNILREEKE FT VKCRWEEYFRELLNVEKGQTRLNEGDKVQGVIPEWNELEVNLALSKCKWGK FT AMGPDGIPADCWKSIGDFGVRWLTRLMNRILEDGKMPDAWRKSEIVLIYKG FT KGDVRECGNYRGIKLLSHSMKLYEKMLDRRLRECVTLNESQFGFIPACSTA FT EPIFALRMLLERHYEFGMPVVAAFLDMEKAYDRTPRRQIWRSLRDRAVPER FT YIDLIKDTYEGASAMVRTPCGKTNEIPIQVGVHQGSALSPLLFITVLDSVL FT EGLICEAPQTMMYADDICIVGTDVKEVENGVRKYQTRLNEAGLTLNTGKTE FT FIGFECGNGLMTDVNHTAIKRVEQFKYLGAMITVDGSAEKDIEHRIKCAWM FT KWRGCGGVMNDRRISLKLKGKVYKSVIRPTLLYGSEMWPISERQMDKMMVV FT EMKMLRWACGLNVRDRVRNEAIRAMTKCAKLEGKIRERRLRWFGHVKRREK FT EHVCRRMMTYTIRGKRPVGRPKLRWSEMIKQDITQLRLTERHALNRDLWKT FT KTSFPDPV" XX SQ Sequence 2391 BP; 781 A; 360 C; 720 G; 530 T; 0 other; tccattctca acacacactt taagaaaaga aagtcacagc tgctgacgta tcattcgggt 60 aaacaccatt cgcagttgga ctatctccta gtgaggagta aagatcgaag attggtgaag 120 gatacgaaag tgtttccctc tgagtgtgtc gcatctcagc ataagcctgt gatttgcgat 180 gtgtggatga tgaaggatga tatgtcaggg ataaagaaaa aggaattgat gaaagttgag 240 cggagagtga aatggtggaa gttaagagat aaggaggaga gagacaaatt cgctgttgag 300 gttgccatta gaggagtgct gagtgcggat ccactagatg caagcacgaa tgttgactct 360 ctctggaaca cgatgactgg cagaatgatt gaatgtgccc gagaggtgct tggagagaca 420 aaaggaatga aacgaaagag tgatgaccga tggttctgga gtgatgaaga agtgaagaag 480 gcggtgaagg agaagagaaa tgcgtactgg caatggcata gaaggaagag taaagagtcg 540 tgggaggctt ataaggagaa gaggagggat tgtcgacgag tggtagcgat tgctaaaatg 600 aaaacattcg atgatctcta tgagaagctg aatgggccgg atggagagaa gatcgtatat 660 agaataacga aggcacgtga taaagaaagt aaggatattc aagaagtcaa atcggtaaag 720 gacgagatgg gtaatatctt gagagaggag aaggaggtga aatgccgatg ggaagaatac 780 ttccgtgagt tactgaatgt tgagaaagga cagaccagat tgaatgaagg agataaagtg 840 cagggggtga ttccggaatg gaatgaatta gaagtgaatc tggcactttc caagtgtaaa 900 tggggaaaag caatgggtcc agatggaata cccgctgatt gttggaaaag tatcggagat 960 tttggagtga gatggctcac acggctgatg aaccgaattc tagaagatgg aaagatgccc 1020 gatgcgtgga gaaagagtga aatcgtgctg atttacaaag gaaagggaga cgtaagagag 1080 tgcggaaact accgcggcat aaagttgctc tctcattcaa tgaagttgta tgagaagatg 1140 ctggatcgac gattgagaga atgcgtgact ctgaatgagt ctcaatttgg atttataccc 1200 gcttgctcaa cagcagagcc aatattcgcc ttgagaatgc tcttggagag acattatgag 1260 tttggaatgc ccgtagtcgc tgcttttctg gatatggaaa aagcttacga tcgcacgccg 1320 aggaggcaga tttggagatc tttgagagac agagcggtgc cggagcgata tattgatctg 1380 atcaaggaca cttatgaagg tgcttcggca atggtgagaa caccgtgtgg aaagactaat 1440 gaaatcccga tacaagtcgg cgttcatcaa gggagtgctc tctccccatt gttattcatc 1500 acagtactgg atagtgtact ggaaggactg atttgtgaag caccacagac aatgatgtac 1560 gcagatgaca tatgcattgt tggaacggat gtgaaggaag tggagaacgg tgtgagaaaa 1620 tatcagacac gattaaatga ggctggtctg actctcaaca cgggaaaaac ggagttcatt 1680 ggatttgaat gtggaaatgg actgatgacc gatgtgaacc atacagcgat caaaagagtg 1740 gaacaattca agtacttggg tgcgatgatc actgtggatg gaagtgctga gaaggatata 1800 gaacatcgaa tcaaatgtgc gtggatgaaa tggagaggtt gtggaggagt gatgaatgat 1860 agaaggatca gtttaaagtt gaagggaaaa gtctacaaga gcgtgatacg accgacactc 1920 ttgtatggca gcgagatgtg gccgatcagt gaaaggcaaa tggataagat gatggtagta 1980 gagatgaaga tgctgagatg ggcttgtgga ctgaatgtcc gtgatcgagt aagaaatgag 2040 gctattagag cgatgacaaa gtgtgctaag ctggaaggga agataagaga acggagattg 2100 agatggttcg gccatgtgaa gagacgagaa aaggagcatg tatgtagaag gatgatgaca 2160 tataccatcc gaggaaagag gccagtgggt cgaccaaagt tgagatggtc agaaatgatc 2220 aaacaggaca tcacgcagct gcgcttgacc gaacgccacg cactcaacag agacttgtgg 2280 aaaacgaaga cctcctttcc tgaccccgtt taacgtgaat gagaagagca tgccctaaca 2340 tcccaggcct gccctctcag atacaggaaa taaatgattt gatttgattt g 2391 // ID Crack-27_BF repbase; DNA; INV; 2915 BP. XX AC . XX DT 30-APR-2009 (Rel. 14.04, Created) DT 30-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE Amphioxus Crack-27_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; KW Crack-27_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-2915 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-2915 RA Kapitonov V. and Jurka J.; RT "Young families of Crack non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(4), 832-832 (2009). XX DR [2] (Consensus) XX CC Its 5'-terminal portion is incomplete. XX FH Key Location/Qualifiers FT CDS 2..2740 FT /product="Crack-27_BF_2p" FT /translation="NSVVEVSGYNLYRRDRDRRGGGAGVYVKRTIPSQRRS FT DLEHTDLEVCCVEIKPEKARKTLLACIYRPPISGRKWKDAADSFVHDLSTT FT AENENADVMIMGDFNVDLMKSTTESSSLEFLMGLYQLVPLIRQPTRITEHS FT ATCIDNIFVSNPDRHKQSQSVTWGASDHHLILTCAKAGVETGGARTCEYRN FT YKRYCQQAFIDSLKTVRWETVLDCVNASDAWNAFKDIFLHVAEVHAPIKEK FT QVKEKGWSAPWLTDSVKNLMGQRDAARRKAIKTGEATDWECYRSLRNQTTS FT AVRMAKKSHFASAISEAEGNHSLMWKIINTFTGKTKSKCQVQKVQRPDDSC FT TSEPTEMAEEFNKYFSSCASNLASDIPVSSDDPVRHVPESTSTFCLQPVEE FT TTVLNELLKLKTKKATGLDKIPSKLLKDSAPVIVKPLTHIFNLSMSTGQVP FT NDWKLARVSPIYKAGNRDNVSNYRPVSVLNVSSKVMERIVHNQVAHFMDRN FT GLLTAHQSGFRKHHSTGTAVQKVVEDIKSAFNAHEVTVALFLDLRKAFDTV FT NHEILLRKLQKMGFDIGTVNWFRSYLTDRLQCVDVQNKQSTLTQVTCGVPQ FT GSVLGPLLFCLYVNDLPQVVEKCNIHMYADDTILYYSASLLSECEEAVSAD FT MERAANWFKENRLSLHPDKTKSMVFGLSQKLRHTGKTVSVTDGVNTFEQVN FT TFTYLGVTMDSTLQWSDHIEKTTKKLLSGLGALCRAKPFVTKEILQVMCRT FT LLYSHVDYCSTAWMPSLLNSNKSKMMQLSRLLNRAARIITGSRRKDCIPTE FT TLLAEAGMESLSERAKGNILVNVYKAVHNKAPSYLQNMFKWMSPPVLHRRT FT RYATKQLSEWDPHMLNVPKAHVQSFRGCLQFQGPTLWNELSAVQRQQTTQA FT TFKNGLMQ*" XX SQ Sequence 2915 BP; 868 A; 683 C; 696 G; 668 T; 0 other; caactctgtg gtagaagtca gtggatacaa cctgtacagg agagacagag accggcgcgg 60 aggcggggct ggcgtctacg tcaaacggac tatcccctcc caacgccgat cagatctgga 120 acacactgac cttgaagttt gctgtgtgga gataaaacct gaaaaggcgc gtaagacact 180 gctcgcgtgt atctaccggc cacctatctc aggacgtaag tggaaggacg cagcagattc 240 ctttgtgcac gatctcagca ccactgccga aaacgaaaac gccgatgtga tgatcatggg 300 cgattttaac gtggatttaa tgaaatcaac aactgaatca tcttctttgg aatttcttat 360 gggtctgtac caacttgtcc cactcatacg ccaaccaacg cgtataacag agcactcggc 420 aacgtgcata gacaacatat tcgtgtcgaa ccctgaccga cacaagcaga gtcaaagtgt 480 cacctggggt gcgtccgacc accatctcat cctgacgtgt gctaaggcag gggtggaaac 540 cgggggtgcc cggacatgtg agtacaggaa ctacaaacgc tactgtcagc aagcttttat 600 tgacagcctg aaaacagtcc gctgggagac agtgttagac tgtgtaaatg cctcggacgc 660 atggaacgcc ttcaaggaca tatttctaca tgtggccgaa gttcatgctc ctatcaaaga 720 gaagcaagtg aaagaaaaag gctggtctgc gccatggctg acggacagtg ttaaaaactt 780 gatgggccag agagatgcag cgaggcgcaa ggccatcaaa acaggggaag caacagactg 840 ggaatgctat aggtctctca ggaatcaaac aacttcagct gtaaggatgg ctaagaaaag 900 tcactttgca agtgctattt ctgaagcgga aggcaaccac agtctgatgt ggaaaataat 960 caacactttt acggggaaaa ctaaaagcaa atgtcaagtc cagaaggtac aacgacctga 1020 cgatagctgt acatcagaac ctacggaaat ggctgaggag ttcaacaagt acttctcgtc 1080 ctgtgcatca aaccttgcaa gtgacatccc agtgtccagt gacgaccccg ttcgtcacgt 1140 acctgagtca acatcaacgt tttgcttaca gccagttgaa gaaacgacgg tgctgaacga 1200 gcttcttaag ctcaagacta aaaaggctac aggacttgac aagatcccat ctaaactcct 1260 gaaagattct gctccagtca tcgtcaagcc gttgacacac atcttcaacc tgtcaatgtc 1320 aacggggcaa gttccaaatg actggaaatt ggctagagtc tccccaatat acaaagcagg 1380 gaacagggac aacgtttcca actaccgtcc agtgtccgtg cttaacgtgt cttccaaagt 1440 catggagaga atcgtgcata accaggtcgc gcacttcatg gacagaaatg gcctacttac 1500 ggcccaccag agtggattca ggaaacacca cagtacaggt actgctgtcc agaaggtcgt 1560 agaagacatt aaatctgcat ttaacgccca tgaggtcacg gttgcgcttt tcctcgacct 1620 tagaaaagcg tttgacactg tcaatcacga aatcctacta cgaaaactgc agaagatggg 1680 ctttgacatc ggaacagtaa actggtttcg atcttacctc acggatcgcc ttcagtgtgt 1740 agatgtacaa aacaaacagt ccacgctaac tcaagtcaca tgtggagtac cacaggggag 1800 cgtcttggga cctttgttgt tctgtttgta tgtgaatgac ctacctcaag ttgtggagaa 1860 gtgcaacatc catatgtacg ccgacgacac tattctgtac tattcggcaa gcctgctgag 1920 cgagtgtgaa gaagctgtat ccgcagacat ggaaagagct gctaactggt ttaaagaaaa 1980 caggctatca cttcatcccg acaaaactaa atctatggta tttggtttgt ctcaaaaact 2040 acgtcacacg gggaagactg tttctgtaac agatggagtt aacacattcg agcaggtgaa 2100 caccttcacg taccttggag ttactatgga ctcaactcta caatggtcag atcacattga 2160 gaaaactacc aagaagcttc tgtcgggtct cggtgcctta tgtcgtgcca aaccctttgt 2220 gactaaggag atactgcagg tcatgtgcag aacgctgcta tactcacatg tggactactg 2280 ttcgacggca tggatgccat cacttttaaa ttccaacaag tcaaaaatga tgcagttgag 2340 caggttgtta aaccgggctg ccagaatcat tacaggcagt agacgcaagg actgtatacc 2400 caccgaaacc ctgttggcag aggctgggat ggagtcactt tccgaacgag cgaaaggaaa 2460 catactggtt aacgtgtaca aggcagttca caacaaggcg ccgtcctact tacagaacat 2520 gtttaaatgg atgtctccac ctgtgctaca cagacgaaca cggtatgcta caaagcagct 2580 gtcggaatgg gacccccaca tgttgaacgt accaaaggct cacgtccagt ccttcagagg 2640 ttgtctgcag ttccagggcc ctacgttgtg gaacgaactt tctgccgttc agagacaaca 2700 gacaacgcaa gcaactttta aaaacggact gatgcagtaa tactgttggc agaactttca 2760 ccgttatcgt attcctatag ttaagttata tgactgtgta gttataagtc gtgtatggtg 2820 tctgttttta tatgtgtcca gagagtcctg aaaaacaggc tgaagcctga gcgatgtact 2880 ctggtaaaac aaataaaaat gaaaaatgaa atgaa 2915 // ID BEL-5_DPu-LTR repbase; DNA; INV; 347 BP. XX AC scaffold_283; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: long terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-5_DPu_; KW BEL-5_DPu-LTR; BEL-5_DPu-I. XX NM BEL-5_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-347 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 658-658 (2010). XX DR Genome; scaffold_283; Positions 31047 31393. XX CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX SQ Sequence 347 BP; 91 A; 74 C; 74 G; 108 T; 0 other; tgttcccgcc agcacagccc caaacggata attcagaaac cgaagtctgt ttttccctct 60 gatagatgtc atttcaatcc tttgtttacc cttccctttc tctctcttgc tgaaaggtgc 120 tcttgggcaa aacgaggcag ttacgtggct acactctcgc agagactagc taacagctcc 180 ctaacgtgtt aattttccat attttgattg attgtgtgct taacgtgtac gttgacttaa 240 tgtgtgatta ttctatgtaa ctgtgtgaga agaaatagac aaagccagaa gggcgaatta 300 aaatctacat tttgagaagt gtctcgtcaa ggtgttcgga gggaaca 347 // ID SMAR25 repbase; DNA; INV; 2603 BP. XX AC . XX DT 08-OCT-2007 (Rel. 12.1, Created) DT 22-OCT-2007 (Rel. 12.1, Last updated, Version 1) XX DE Consensus sequence of Mariner-type family of repeats. XX KW Mariner/Tc1; DNA transposon; Transposable Element; SMAR25. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-2603 RA Jurka J.; RT "Mariner-type element from freshwater planarian (Schmidtea RT mediterranea)."; RL Repbase Reports 7(10), 1083-1083 (2007). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 514..2190 FT /product="SMAR25_1p" FT /translation="MLHNSPALKMRKAISLDTKIKILDQLATGQGATVVGN FT NFGIHEATVRTIKKNETAIRASVCSGTKLSAKSSSYVRDVVKEKMEKALVI FT WIEDKSQKRIPVDGLAIKQTALRIYKRIQEVDPDTSSQSKQHAFSASTGWM FT TGFLKRHALHNIKIKGETASADELAAKEFPEKLKKIIEDGEYTPDQVWNLD FT ESGLFWKRMPRRTYVAKSQKTAGGFKVAKDRITLLFCSNASGERILKPLLV FT HRALRPRSMKSVDFNKLPVHWMANKKAWVTSAIFTEWFQKHFIPEVRRYMK FT EKCLEFKVLLILDNAPGHPVLEHPNVQFCFLPPNTTSLIQPLDQGIIATFK FT TYYIRSSFHYVVEKLDNYEESSVIDEWKKFSIMDCINQVKIALDLLKPSTL FT NSCWKNIWPECVKCKDPVIDNNAELADIITMANAIGGDGFDDLSLADVEEL FT LVDESLSENEIIDLTLETRYDKEHSDSDERDRPILKASLIKEGLDLATKLG FT SHFEQHDHDEERAAKFQRELKSLMASYREVYNGLTRNKTQSKITDFVQKST FT DVPTQSARNDKD" XX SQ Sequence 2603 BP; 929 A; 400 C; 446 G; 828 T; 0 other; cacctaattc tgtttttaca cgaattattt ttacacgaat ttgaaataac acgagtaaat 60 acgaaaaaat atttctattt ttacacgaca taatttgaaa taacacgaat acaaaaaaaa 120 caaaattact ttttttacat gaattattgg atttaacacg ttttttttca atgaattttt 180 ttaaagtaaa aagagagcaa ctcaccacga tgacagtcaa ctcaccacgc taaaatagct 240 tgaactttac tagaacaatt ttgtacaagt aaatttcaag caagtaaatt tcaagcatca 300 gctgttttgt tttgtttgtt tacacacaat ttagttccta tttgacaatt ataattgaaa 360 gatcgttagt ttttgctgat ttttaacaag tgctaagtaa aataaagtag aagctcatat 420 aattttggta gaccaataac aggttagttt caaatattta attttagatt tttgcatttt 480 tgattaataa aatatatttt ggtagccaca cccatgttac ataattctcc tgcattaaaa 540 atgaggaaag cgatcagctt agataccaaa atcaaaattt tagatcaact tgcaacggga 600 caaggcgcaa cggttgtggg aaataatttc ggaatccatg aagctactgt aagaacaatt 660 aaaaaaaatg aaactgcaat tagagcatca gtatgttctg gaacaaaatt aagtgcaaag 720 tcatcatcgt atgtaaggga tgttgttaaa gagaaaatgg aaaaagcttt ggtaatctgg 780 atagaagaca aatcccaaaa gagaatacca gtagacggac ttgctatcaa gcagacagca 840 ttaagaatct ataaacgtat ccaggaagtt gatccagata catcatctca gtcaaaacaa 900 catgcatttt ccgcaagtac tggttggatg actggttttt taaaaagaca cgctctccac 960 aatataaaaa ttaagggaga aactgcatcc gctgatgaat tggctgctaa agaatttccc 1020 gaaaaactaa aaaaaattat tgaagatgga gaatatactc cagaccaagt ttggaattta 1080 gatgaaagcg gccttttttg gaagagaatg cctagaagaa cttatgtagc aaaatcgcag 1140 aaaacagccg gtggttttaa agtagcaaag gaccgtatta cgttgttgtt ttgttccaat 1200 gcttcaggag aacgtattct taagccactc ctagtacatc gtgccttaag accacgttcc 1260 atgaaaagtg tagatttcaa taaattgcct gtacactgga tggcgaacaa aaaggcttgg 1320 gtaaccagtg caattttcac agagtggttt cagaagcact tcatcccaga agttaggcgc 1380 tacatgaaag aaaaatgtct cgaatttaaa gttcttttga ttctagacaa tgcaccaggc 1440 catccagttt tggagcaccc aaacgtacaa ttttgttttt tgccgcctaa taccacttcc 1500 cttatacagc cactagacca ggggataatt gctacgttta aaacgtatta cataagaagt 1560 tcatttcatt atgttgtaga aaaacttgat aattatgagg aatcatcagt catagatgaa 1620 tggaaaaaat tttctataat ggactgcatt aatcaagtca agatcgcgtt agatttatta 1680 aagccatcaa ctttgaactc gtgttggaaa aatatttggc cagaatgcgt aaaatgcaaa 1740 gatcctgtta tcgataataa cgctgaactt gctgatataa taacaatggc caatgcaatc 1800 ggtggggatg gattcgatga cctatcgtta gcagatgtag aagaattgtt ggtagatgaa 1860 agcttgagcg aaaatgaaat tatagatctc acccttgaga ctcgttatga taaagaacat 1920 agtgatagtg acgaaagaga tcgtccaatt ttgaaagcat ctctaattaa agaaggtctt 1980 gatctcgcta caaaattagg tagtcatttt gaacaacatg atcatgacga ggaacgagct 2040 gccaaatttc aacgtgaact gaaatcatta atggcatctt acagggaagt ttataatggg 2100 ttaacgcgaa ataaaactca atctaagata actgatttcg ttcaaaaatc tactgatgtc 2160 ccaacacagt cggctagaaa tgataaagat taacaaaatc attccagtga tggcagtgat 2220 attgtagtct ttcgcaaacg tttacgttta ttatcagaca gtgacaataa ataaaattgt 2280 ttaatgttta aatttttctt atggttgtac tttatgttat ttaagttttt tggcaatgaa 2340 tttctggtat tatttttttt atgaaatcta aattttaatt tattttatta taatttagtt 2400 tatttaataa attaaaacga aaataaattc ttaaaataac tacaacaata tttttttata 2460 ttatttagtc taattttaac ttattttttg aggtctagaa ccaatctatt atttttgtac 2520 ttggccagta tcttttttta cacgattttt ttttacacga atttctcggg aaccattcta 2580 tcgtgtaaaa acagaattag gtg 2603 // ID DNA8-105_AP repbase; DNA; INV; 650 BP. XX AC . XX DT 29-AUG-2009 (Rel. 14.09, Created) DT 29-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-105_AP. XX NM DNA8-105_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-650 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2043-2043 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 650 BP; 221 A; 83 C; 84 G; 259 T; 3 other; caaggctggg cattaatgag ttaaaacgtt aagttaattt taattaagtt aattaactcg 60 ttactttttt tttcaaatta acttttaact taataagtta cttttttttc aagattaacg 120 tgttaacttc aagttaattt tatttcaaaa atatctaaat taagttaatt ttcgtccgaa 180 gaataatgca aatttttgaa tgtgttttta ctttgatgta tcattttaaa tttccccagt 240 aattttcatg gacgtaaaga aaaacggaac ttttctcctt tacgggacac cccttttttt 300 ggtttttttg gggggggact tagtcatttt ttgggtaaaa aaaatgttgg tgaaatcaga 360 atttggcgca cgaanttant ttcgaagtta tagctaatcg aagttttcan tttatattat 420 aaccagaaaa actatctaac aaaccatatt atgtgtctgg aatttttatt tttgcaaatt 480 agcaaatttt aacttaacac taagttaagt tacttttttt ttaaatttaa cttttaactt 540 aacgagttaa cgaaaaaaaa tacgagtaac ttttaactta acttaactta attattttaa 600 aataacttaa cttaacgagt taaaaaaaat cgttaacttg cccagccttg 650 // ID SAT-4_NVi repbase; DNA; INV; 102 BP. XX AC . XX DT 07-MAY-2009 (Rel. 14.06, Created) DT 07-MAY-2009 (Rel. 14.06, Last updated, Version 1) XX DE Nasonia vitripennis satellite repeat. XX KW SAT; Satellite; Simple Repeat; SAT-4_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-102 RA Bao W. and Jurka J.; RT "Satellite repeats from Nasonia vitripennis."; RL Repbase Reports 9(6), 1160-1160 (2009). XX DR [1] (Consensus) XX SQ Sequence 102 BP; 41 A; 28 C; 21 G; 12 T; 0 other; ggaaggcgag cgagagcgag cagcaaaact aaaaccaaaa tactcaaacc cacctacccc 60 acggacagcg agttcaataa taaaatatct cggcacatcg ga 102 // ID Dcardi9 repbase; DNA; INV; 473 BP. XX AC GU229938; XX DT 19-APR-2011 (Rel. 16.04, Created) DT 19-APR-2011 (Rel. 16.04, Last updated, Version 1) XX DE Mariner-like element internal region of mellifera subfamily. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Dcardi9. XX OS Drosophila cardini OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; cardini group; OC cardini subgroup. XX RN [1] RP 1-473 RA Wallau G.L., Hua-Van A., Capy P. and Loreto E.L.; RT "The evolutionary history of mariner-like elements in Neotropical RT drosophilids."; RL Genetica 139(3), 327-338 (2011). XX DR EMBL/GenBank/DDBJ; GU229938; Positions 1 473. XX CC Clone Dcardi9. XX SQ Sequence 473 BP; 124 A; 122 C; 133 G; 94 T; 0 other; tgggtcccgc accgagctga cgccaagaaa aacccttctg gtgctgctga tacgcgaacg 60 aactagactg ctgattacaa gcggatggtg actggcggcg aaaagtgggt cacatacgac 120 aatattaagc gaaaacggtc gtggtcgaag gccggtgaat cgtcccaaac agtggccaag 180 ccgggattga cggccaggaa ggttttgcta tgtgtttggt gggattggaa gggtattatc 240 cactatgagc tgctcccata tggccagacg cttaattcta ccatctactg cgaacaactg 300 gaccgcttga agcaggcaat cgaccagaag cgtccagaat tggccaacag gaggggtata 360 gtgttccacc aggacaacgc caggcaacac acatctttgg tgacgcgcca gaagctacgg 420 gagctcggat ggaaggtttt atcgcatcca ccatattccc ccgacctagc ccc 473 // ID Tx1-4_BF repbase; DNA; INV; 5541 BP. XX AC . XX DT 28-APR-2009 (Rel. 14.04, Created) DT 28-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE Amphioxus Tx1-4_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW Tx1; Non-LTR Retrotransposon; Transposable Element; L1-4_BF; KW Tx1-4_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-5541 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-5541 RA Kapitonov V. and Jurka J.; RT "Young families of Tx1 non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(4), 841-841 (2009). XX DR [2] (Consensus) XX FH Key Location/Qualifiers FT CDS 28..1671 FT /product="Tx1-4_BF_1p" FT /note="ORF1." FT /translation="MPQVTPSAYSKRRTASITFLTKQESDFDDVLELLRTH FT NINIRTEVDTIQSSGKNKSEVVFKTNNALNKWAPFLAADNDIDVELYGNGV FT TVVTVMGASIELDDNYVRHQLKEYGEIKDGRYLTYASRGYPEIKTGTRQYK FT IALKKHIPNSIRVGGDNVAVRYNGQPRMCHRCGGQDHFVAACTVDKCSRCH FT ELGHKAGDCTEQIRCNNCKGEGHTFRSCPLSFANKLSMTTAWGGTTSWTKP FT TETVSQEKSGGKEAKQNQAKNGSEAKQGGSGGTQSHKKDPPLIESVSGESE FT GEESDKEVMGEEETIPHKVVGGSSAANSGDKNKGVEVKPDGETSPLKEDDV FT SKAEAVVKTPARAVTPPSESPPKGSHEEGFWDLGSLPEDGANKMPEDEEEV FT EEMEVDKPTTTFKLALSESLAEKLSGDLGPGFTIGDGEEGPELTINLAEPL FT VIDDSDREENAPKRPHQSDPSESDDSDTSAKNPKEKLERKKVKKDPDNTAK FT FGSQIDMFASSGESQSEEPKNGSKTKKPGALKGAGNKGGGNKGGKTSRGKP FT R" FT CDS 1699..5178 FT /product="Tx1-4_BF_2p" FT /note="endonuclease and RT." FT /translation="MSMDLRILTLNVNGMKEKLKRSQVFDFCRNLKVDVAC FT LQECHISSLADKSSWSRQWGNKTVWSLGTNSARGTGILMSDKWSIVSSKVD FT MDGRIVSVLISDGSSRYNVVNVYAPNTARDRAVLFASLHEYMFPNATLIIA FT GDFNCVLDPALDRHTTSSASPGLNTQDVIELRSLCTDLGLKDTWRDEHPTL FT LEYTWRSPSNQTRTRLDRIYVSDDPSTKARIVSCPFSDHDAVVTDIEPTHP FT VQQGRGVWKCNVKTLSDPLFVQDLEEQYRGWQNQKSDHSSMREWWDEVKVK FT IRDLVVTHSKRRAKGARLVQKGLEKEVDMLRSRLNCGDCSTTTLREYEEAK FT DKANKWVRDQLEGQRLRSRIKHFEEGEKPTRFFFRSERDEGKKKLIREIRK FT NDGDITSETDEITEVFRDFYAKLFTKDQLDSQDRDYFLSKLEETLPEEVSD FT QLERPISAEELLTALKSMANSKTPGSDGLPKEFYLQFWDLLGPDMLEVFKE FT GLQDGLLAPSQRQGIITLLEKKGDQLNPANKRPISLLNVDYKILSKVLANR FT LKAVIGLVVHTDQSCGIPGRSISDSVCLLRDIAAYVNDKDLPCVFLALDQE FT KAFDRVDHDFMVSILERLGFGPTFRQYIATLYSEVSSKVLVNGSLSSSIPV FT ERGVRQGCPLSPLLYVLCIEPLAAAIRADTQIKGVHLPGGSGRDTKIVQYA FT DDNTVVLTDSRSIDRTFELISKYESGTGAKLNLGKTVALWLGCWRGRQDQP FT YPIGGWSSSHLKILGSPVGNQNLAEEAWLQRFAKFKATLEKWKDRKLTLFG FT KVLVINSLAAATLWYVAPVYPLSRSVQRKIEKLMFEFLWDGKTELVNRKTL FT YLSKEEGGLGLVCIPLKAKALLLKSVKKALTDPEAPTSRYAHYWLGLSLRQ FT LDPESWNNNIPHSMERPPHFESIFGHLKDLETKDVQIDWKSCTTKSLYDDL FT LKAENIVPRCVQNNPTVRWKDVWKAIHNPLIQKWDRMLAWHAAHNSLKTRQ FT KLKSWRGFVDSDVCPRRGCNMVESVTHLFWECPVVAEVWEWVEGLIYRRIL FT PQFNMPGAFAVYSLVFQSMPTKTRLVLETLAAMTRSLLWKSRCEVIFEKKH FT FTGQELITRLQKMISERLDFEFARLSPAAFYDTWVEGHTWIDFDISHIVVH FT L" XX SQ Sequence 5541 BP; 1519 A; 1189 C; 1536 G; 1297 T; 0 other; agaaaccgta gttaaacgtc acgaaacatg ccacaagtta caccttccgc ttatagcaag 60 cgaaggaccg catctatcac gttcctgacc aaacaggagt cagatttcga cgatgtgttg 120 gagttgcttc gcactcataa catcaacatt cgaacagagg tggacaccat ccaaagtagt 180 ggcaaaaaca agagtgaggt ggtgttcaaa accaacaatg ctctgaacaa gtgggctcca 240 ttccttgcag ctgataatga catcgacgta gagctgtatg ggaatggtgt cacggtagtg 300 actgttatgg gagcctccat tgagctggat gacaactatg tgcgtcacca gctcaaggaa 360 tacggcgaga taaaggatgg gcgatacctg acctatgcca gtcggggata ccctgaaatc 420 aagacgggga caaggcaata taagattgcc ctgaagaagc acatccccaa ctcgatcaga 480 gtggggggtg ataacgtcgc agtaaggtat aacgggcagc cgagaatgtg ccaccgctgc 540 ggcggccagg accactttgt ggcggcttgc accgttgaca agtgttcacg gtgccacgag 600 ctcggccaca aagctggtga ttgcacggaa cagatcaggt gtaataattg caaaggggag 660 ggacatacgt tcaggtcctg tcccctgagc tttgcgaaca agttgtccat gacgacggcc 720 tggggcggta ctacttcttg gaccaagccc acagaaacag tgagtcagga gaaatcaggt 780 ggcaaggagg ctaagcagaa tcaggctaag aatggctcag aggctaagca gggtggaagt 840 ggtggtaccc aatcccataa gaaagatccc ccactgattg agtcggttag tggtgagagt 900 gagggagaag agagcgataa ggaagtcatg ggcgaagagg aaactattcc tcataaggtg 960 gtagggggct cctctgctgc caactcaggg gataagaaca agggggtgga agtcaagccc 1020 gacggtgaaa cctcccctct caaggaagat gatgttagca aagcggaggc ggtggtaaag 1080 acaccggcta gggctgtcac accgccgtcc gaatcgccac cgaaaggtag tcatgaggag 1140 gggttctggg atctgggctc cctccctgag gatggtgcca acaaaatgcc agaagatgag 1200 gaagaggtgg aagaaatgga ggtagacaag cctaccacca ccttcaagct ggccctgtcg 1260 gaatcgctag ctgaaaaact gagtggtgac ttgggtcctg gattcaccat aggtgatggt 1320 gaagagggtc ccgagcttac catcaacctg gccgagcccc ttgtcataga tgacagtgat 1380 agagaggaga atgcgcctaa gcgtcctcat cagagtgacc cgagcgagtc ggacgactca 1440 gatacctcgg caaagaaccc taaagagaaa ctggaaagga aaaaggtcaa aaaggacccc 1500 gataataccg caaaattcgg gtcccaaatt gacatgtttg cctcatccgg ggagtctcag 1560 tctgaggagc ccaagaacgg ctccaaaacg aaaaagccag gcgccctcaa aggtgctgga 1620 aacaagggtg gtggcaacaa aggcggtaag actagcagag gcaagccacg ctaggattct 1680 ttgaccgacc taaaaagcat gtccatggac ttacgaatat tgacgcttaa tgtgaacggt 1740 atgaaggaga agctgaagag atctcaggtt tttgattttt gtcggaattt aaaggtagat 1800 gtggcttgtc tccaggaatg tcacatatct tccctggccg acaagtcgtc ttggagcagg 1860 cagtggggaa acaaaactgt ctggtcactg ggcaccaact ctgccagggg aactgggata 1920 cttatgtctg acaagtggtc tatcgtctcc tcaaaggtag acatggatgg caggatagtc 1980 tctgtcctta tctcggatgg ctcgtcgaga tataatgtag ttaacgtata tgcccccaac 2040 acggcaagag atagagctgt tctctttgcc tctctgcacg agtacatgtt tccaaatgcc 2100 accctcatta tagcagggga ctttaattgt gttttagatc ccgcccttga cagacacacc 2160 acgtcttctg caagcccagg cctcaataca caggatgtga ttgaacttag atccttgtgc 2220 acggatcttg gtttgaagga cacgtggagg gacgagcacc caactctatt agagtacacg 2280 tggcgttctc cctccaacca aaccaggaca aggttagata gaatctatgt ctcagatgac 2340 ccttccacga aggccaggat agtctcttgt ccgttctctg accacgatgc agtggttact 2400 gatatagaac ccacccaccc tgtccaacaa ggccgtggtg tgtggaaatg taacgttaaa 2460 actttgtccg accccctctt tgtccaagac ttagaggaac agtatagagg ttggcagaac 2520 caaaagagtg accactcgtc catgagagag tggtgggatg aggtcaaagt taagataagg 2580 gacctggtgg tgactcactc caaaagaaga gcgaaaggtg ctagacttgt tcagaaaggt 2640 cttgagaagg aggtagacat gctcagatct cgcctgaatt gcggagattg tagcaccacc 2700 acccttcgtg agtatgagga ggcaaaggat aaagctaata agtgggtccg ggatcagttg 2760 gagggacaac gcctcaggtc caggatcaaa cattttgaag agggtgagaa acccactcgc 2820 ttcttcttca gaagcgaaag ggacgaaggg aagaagaaac ttattcgtga gatcagaaag 2880 aatgatggag acatcacttc tgagactgac gaaattacgg aagtttttcg tgatttttat 2940 gccaagctgt tcacgaaaga ccagttagac tcccaagata gggattactt ccttagtaaa 3000 ctggaagaga cccttcccga agaagtatct gatcagcttg aaagaccaat ttcggctgag 3060 gagttgttaa cagcactcaa aagtatggcc aacagcaaaa cacccggatc agatggattg 3120 cccaaagaat tttacctgca attctgggac cttcttggac cagacatgtt agaggtgttc 3180 aaagaagggt tgcaggatgg tttgttagca ccatcacaac gtcaaggaat cattaccttg 3240 ttggagaaaa agggggatca attaaatccc gccaataagc ggccgatctc gctcctcaat 3300 gtggactata agatactgtc caaggtgtta gcgaatagac taaaggcagt gattgggctg 3360 gtggtccata cggaccagtc gtgtggtatc ccagggcgtt ccatctcaga tagtgtttgc 3420 cttctgaggg atattgctgc ctatgtcaat gacaaagacc ttccctgtgt ctttctagcc 3480 ttagatcagg agaaggcttt tgatagagta gaccatgact ttatggtcag tattttagag 3540 aggttgggtt tcggacccac gtttaggcaa tacatagcta ctttgtatag cgaggtatcc 3600 agcaaggtgc tggtcaatgg tagtctctcg tcctctattc ctgtggagag gggagtcaga 3660 caaggctgtc cattgtcccc tcttctttat gttctgtgta ttgagccctt agccgcagcg 3720 atcagagctg acactcagat aaagggtgtg catctgcccg gaggttctgg gagagacacc 3780 aagattgtgc aatacgcgga cgacaatact gttgtcctga cagacagtag gtccattgac 3840 cgtactttcg aattgatttc gaagtacgaa tctggaactg gggcaaagct aaacttggga 3900 aagacagtgg cgctgtggct ggggtgctgg cgaggacgcc aagatcagcc atatcccatt 3960 ggaggatggt cttcgtctca tctgaagatt ttaggcagtc cggttggtaa ccagaacttg 4020 gccgaagagg catggttgca aagatttgcc aaattcaagg caacactcga gaagtggaag 4080 gatcgcaaac ttaccttgtt tgggaaggtc ttagtaatca acagtttggc ggctgccact 4140 ctttggtatg ttgcccctgt ttatccgctg tctcgttcag ttcaaagaaa gattgagaaa 4200 ctgatgtttg agtttctgtg ggacggcaaa acagagttgg tgaacagaaa aactctctac 4260 ctctccaagg aagagggtgg tttgggactg gtttgcatcc cgctcaaggc caaggcattg 4320 ttgcttaagt ccgttaagaa ggccctcact gatcccgagg cgcctacgag caggtatgcc 4380 cactactggt taggtctgag tctcagacaa ctggaccccg agtcgtggaa caacaacatt 4440 ccgcactcca tggaacgtcc cccgcacttt gagtcgattt ttggacacct caaagacttg 4500 gagacgaaag atgttcagat tgactggaaa tcctgcacca ccaagtcgtt gtacgatgac 4560 ctcctgaaag cagagaacat tgtccctcga tgtgttcaga ataatccgac agtcaggtgg 4620 aaggatgtgt ggaaggccat ccacaatccc ttgatccaga agtgggacag gatgcttgcc 4680 tggcacgctg cacacaactc tctgaagact agacagaagt tgaagtcttg gagaggcttt 4740 gtcgactctg atgtctgtcc taggagaggt tgcaacatgg tggaaagtgt aacacatctc 4800 ttctgggagt gtccggtggt ggctgaggtg tgggagtggg ttgaaggcct catttacagg 4860 aggattcttc cccagtttaa tatgccgggt gcctttgccg tctatagttt ggtcttccag 4920 tctatgccaa ctaagacgag acttgtcttg gagacattgg cagctatgac caggtctctg 4980 ctttggaagt cacgttgtga ggtgattttt gagaagaaac actttacggg ccaggagctg 5040 atcactcgtt tacagaagat gatcagcgag aggcttgact ttgagtttgc acgtttaagc 5100 ccggctgcgt tctatgacac ttgggttgaa ggacacacgt ggattgattt tgatatctcc 5160 cacattgtgg tgcacttgta gttggcttgt gtttcctcct ggaagaaagg aggtctagag 5220 aatgaaaaag agaaaatcca aaaaaaatta taaaagagaa aacccaaaaa aagtagaata 5280 gtggggttgg gaagggggga atggtgggtg agtgtgtgtg tgtgcactta ccttgggttt 5340 gtgagggggt ggaccaagag tggtgggtgg gagtgaggtg ctgcgccgag ccgagccgat 5400 ccgctttatg ttttcgattt gatctggatt tgtgtctgag atatgtaagc cggggagtgt 5460 ttatcctccc gcggtatgaa ctgtataaga acaagtaaat gctctgaaat agtatataat 5520 tacttttgtc aaaagagtat c 5541 // ID Gypsy-2-LTR_LG repbase; DNA; INV; 601 BP. XX AC . XX DT 07-APR-2009 (Rel. 14.04, Created) DT 07-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE Long terminal repeat of the LTR retrotransposon. XX KW Gypsy; LTR Retrotransposon; Transposable Element; 4-bp TSD; KW Gypsy-2-LTR_LG. XX OS Lottia gigantea OC Eukaryota; Metazoa; Mollusca; Gastropoda; Patellogastropoda; OC Lottioidea; Lottiidae; Lottia. XX RN [1] RP 1-601 RA Bao W. and Jurka J.; RT "Gypsy LTR retrotransposons from Lottia gigantea."; RL Repbase Reports 9(4), 931-931 (2009). XX DR [1] (Consensus) XX CC TSD is 4-bp long. XX SQ Sequence 601 BP; 179 A; 114 C; 107 G; 201 T; 0 other; tgtaagaaaa taccaatttt cttactgttt gtttaatagt gtgaaacaaa gttcacacat 60 attgttcctc acagaccctt ccagtaacac tatttgtcca ggtgacaggt ctacgaccgc 120 taacctgaag ctcacctgaa aacatgtatc gtaacaatac cttccgaggt aatcctaatg 180 gctttaacaa tacctacata agataatcct aatggatata ataatacttt ttggtaatat 240 gtttaactat agttttaggg gtctgtggga aaagcaaaca tgatcaatat aacttatcat 300 attcttttat tctcttatta ttttatattg tattttttgt atgggtttgg ctatatcaga 360 cggcgacgag aacatcttag ttagtatgtt tcgaggatac cgagctgctc agactagtca 420 gtatgtttcg aggataccga gctgctcaga ctagtcagta tgtttcgagg ataccgagct 480 gctcagtcgt ctgtctggta ttgctgaccg ctatatttct aattgtaata aacagtgctt 540 aaacatcgca agtactttat cttcatttac cttcatagac acaacgtgag gaagtccaac 600 a 601 // ID R8Hm-B repbase; DNA; INV; 4265 BP. XX AC . XX DT 28-MAY-2010 (Rel. 15.05, Created) DT 28-MAY-2010 (Rel. 15.05, Last updated, Version 2) XX DE R8Hm-B - 18S rDNA-specific non-LTR retrotransposon from Hydra DE magnipapillata. XX KW R2; Non-LTR Retrotransposon; Transposable Element; R8Hm-B. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4265 RA Kojima K.K., Kuma K., Toh H. and Fujiwara H.; RT "Identification of rDNA-specific non-LTR retrotransposons in RT Cnidaria."; RL Molecular Biology and Evolution 23(10), 1984-1993 (2006). XX DR [1] (Consensus) XX CC R8Hm-B and its close relative R8Hm-A belong to the R2-A clade (or CC R2 clade), but specifically insert into 18S rDNA instead of 28S CC rDNA. TSDs are GGTTGCAAAGCTGAAA. XX FH Key Location/Qualifiers FT CDS 266..3739 FT /product="R8Hm-B_1p" FT /note="includes RT and restriction-like FT endonuclease." FT /translation="MSNRITIGDVPSVGKGGLTVNKQTAGADGAEACVVIH FT PGAKGIWSSPACLRKFTIGKELRAHLAQIHKLAPSAVRYRCNKCPYEGDVQ FT LSVGTHLRYCKGIAGVVEEKKQFACAICNFSSDTFSGLQVHKQRKHVVEWN FT EQLKEKTEFAWTDRELRELAVKEVTIPFSVVNTETFAVLDITTRTKDAVRK FT IRYTDRYKSILAEVRAQVNAVAEEAPQASDESQITLLVNTGRGAELQPAVI FT NITDSIELVTDVNEVEMVTSNSTNEEQPINAPVEPAVIEADLGRQDAKLYL FT ASLRQSDCTNASDRWTLAYCRGEVDWCKTKSRLFKVSRHARGLRQPQRVEN FT WEFPEGFRPNRNLRKWRKYSFLQSCYRTKKKETVSKILDGTFKDTPEEEIR FT PELEEVQRVYVDRLEVRTQLDTTRTVHIDERFDLVSYGRITIREVQDAISA FT SKKDASGGPDGLLLQDVKKASPRQLCIIFNMWYLHGIPVVENRCRTILLHK FT GGEKHLTSNYRPVTIGNMLNRVYAKIWDRRIRKNLQLHVRQKAFVPLDGCF FT ENVKTIQCILQSYRRSRREHNVVFVDLAKAFDTILHDSIEKALLRKGIPRS FT VIKVVDSLYAGAVTSITVGKTKTRPICINSGVKQGCPLSPLLFNLVIDELA FT ERLEATGCGLDLEGHVISSMAFADDYVLLAKDSVEMNVLMNVCNTFFEEKG FT LAVNPAKCQSLRVLPVKGKRSMKVLTRTHRWWKINNQDVEIPSMTYESVGK FT YLGVMIDPAGKIALPIEEWKLWLTRLRECKLKPDQKVKVLKEVVCARANYV FT LRMSGCGICELRKWSRFVRGWVKSIIHFPAWCNSEWMHSSKGLGIPDVVSG FT IVIQRMRAAEKMAKSTDGVVRVVGARIVQTNRVLWKRAGLAGIELDAARKF FT CEVKRVNKIGNQTNGGALKTIAESSVSRHWLLEKNIRPGNKILVWKAMAGV FT IPTKINLSRGVADQTLKKCRCCGLTAETDCHILAGCPTSRDAYSKRHNLLC FT DKLAKELRLNGGPSRRVWRERMCLSGNGRRYKPDIVVKDDGVITVIDMACP FT YEKSERHLSQCEDAKVAKYEPLRLDRSWTQELEGNNGRSANEISVVGIAVG FT AIGTITRKTQRILSKLKLAKVGRPLQIIACNESAQIIRRHLSGSRLRNLR" XX SQ Sequence 4265 BP; 1247 A; 798 C; 1178 G; 1042 T; 0 other; cttggggtca ctgacacatt tttcggtagc catagttttt tgagaggaag agtggaagtt 60 tttccatgag tcgtctctcg tataaactgt ggtaaatccg gccatccagc ctctacgcgg 120 cgcaactaga aacttggatc agtgatcaag gctaatggat gacgggactc catggataag 180 gagatataaa gatcttattt gaacgcatct taaggggtta tggggctaac acccccttaa 240 ttctggtgca catttattga ccgttatgag caatagaatc acgataggtg atgtaccctc 300 ggtaggaaag gggggtttaa ctgtcaataa acaaacagca ggagctgatg gtgctgaagc 360 gtgtgtagtc atacacccag gtgccaaggg tatttggtcc tctcctgcgt gtttaagaaa 420 gtttacgatc ggaaaagaac taagggcaca tttggctcaa attcataaac ttgcaccgag 480 tgcagttcgg tacaggtgta ataagtgtcc gtatgagggt gatgtccaac tcagtgtggg 540 aacacatctg aggtactgta agggtattgc gggagtggtg gaggagaaaa agcaattcgc 600 ttgcgcgatt tgtaatttct cttcggatac cttttcagga cttcaggtgc ataagcaaag 660 aaagcatgta gttgaatgga acgagcagct gaaagagaaa acggagtttg cttggacaga 720 cagggaactg cgggagctgg cggttaagga agtaacgatt cctttctctg tggtgaatac 780 ggagaccttt gctgtgctag atattacgac gcggactaag gatgctgtga ggaaaattcg 840 ctacacggat agatacaaat ctatcctggc tgaagtacgc gcacaagtta acgctgtggc 900 ggaggaagcg ccgcaagcta gtgatgagag tcaaataacg ctcttagtta acacaggcag 960 gggagcagaa ttacaacctg ctgtgattaa tataactgat tcaattgaat tagttactga 1020 tgtcaatgag gttgaaatgg taacatcgaa ttcaaccaat gaagaacagc ctatcaacgc 1080 gccggtggaa ccggctgtaa ttgaggcgga cttgggaaga caggatgcga aactatatct 1140 cgcatcgctg cgtcaaagcg attgcacaaa cgcatctgat cgatggaccc ttgcgtattg 1200 caggggagaa gttgattggt gtaagacgaa aagcaggctt ttcaaagtat caagacatgc 1260 ccggggttta agacaacctc aaagggtgga gaattgggag tttccagagg gattcagacc 1320 taacaggaac cttcgtaaat ggaggaagta ttcattcttg caaagttgct atagaacgaa 1380 gaagaaggaa actgttagta agattcttga tggtactttc aaggacacac ctgaggaaga 1440 gattaggcca gagttggagg aagtacaacg tgtgtacgtt gaccggctag aggtaagaac 1500 tcagctggat accactagga cagtgcatat agacgaaaga ttcgatttag taagctatgg 1560 tcgcattacg atcagggagg tacaagacgc aatcagcgca agcaagaagg atgcctcagg 1620 gggtcccgac ggcttgctcc tacaggacgt gaaaaaggcg agcccacgcc aattgtgtat 1680 catctttaat atgtggtact tgcatggaat ccctgtagtg gaaaataggt gccgaacaat 1740 actcttgcat aagggtggcg agaagcatct aacgtcgaac taccgacctg tgacgatcgg 1800 caatatgctg aatagggtat acgctaagat ctgggacaga cggatcagaa aaaacctgca 1860 acttcatgtg agacagaaag cattcgtccc gctggatggg tgctttgaga atgtaaaaac 1920 catccaatgc attctccagt cttacagaag gagcaggcgg gaacacaatg tcgtatttgt 1980 cgatcttgca aaagcgtttg atacgatttt gcatgattcg atagagaaag cattgctgag 2040 gaaaggcata ccgcgaagtg tgataaaagt ggtagacagc ttatatgcgg gagctgtcac 2100 gagcattacg gttgggaaaa caaagactcg acctatatgt ataaattcag gggtgaagca 2160 gggttgtcct ctatctcctt tgctgttcaa tctagtaata gatgaactag cggagaggct 2220 ggaggcaact ggctgcggtc ttgatctgga aggtcacgtc atttcttcca tggcttttgc 2280 tgatgactac gtgttgttgg cgaaagactc ggttgaaatg aacgtgctaa tgaacgtgtg 2340 caatacgttc tttgaggaga agggtttagc tgtaaatcca gcaaaatgtc agtcgttacg 2400 cgttttgcct gtaaaaggca aacggtccat gaaagtcctt acgaggacgc atagatggtg 2460 gaaaattaat aaccaggatg ttgaaatccc atctatgaca tacgaaagtg ttggaaaata 2520 tcttggggta atgattgacc cagctggtaa gattgctctt ccgattgagg aatggaagct 2580 ttggctaact aggttaaggg agtgtaagct caaacctgat caaaaagtga aggtgctgaa 2640 agaggtagtt tgtgcccgag caaactatgt tctccggatg tccgggtgcg gaatctgtga 2700 gctccgtaag tggtcacgat ttgtgagggg atgggtgaaa tccatcattc acttccccgc 2760 atggtgcaat agcgaatgga tgcattcgag caaaggctta ggcattcctg atgtagtgtc 2820 aggaattgtc atccaacgaa tgagagctgc ggaaaaaatg gctaagtcaa cagacggagt 2880 agtccgagtt gtcggggccc gcattgtgca gacaaataga gttttgtgga aaagggccgg 2940 attagcaggc atagaactgg atgccgccag gaagttctgt gaggttaaga gggtgaacaa 3000 aattggcaat caaaccaatg gaggcgccct caagactata gcagagtcct cggtgagccg 3060 gcactggtta ttggaaaaga atataagacc tggaaacaaa attctagttt ggaaggcaat 3120 ggcaggagtg attccaacaa agatcaatct gtctagaggc gtagccgacc agactctcaa 3180 aaaatgtcgg tgttgtggtt taacagcaga aactgattgt cacatcttgg ccggatgtcc 3240 taccagtcgg gatgcgtact cgaaacgtca taacttgctt tgtgataaac tcgccaaaga 3300 gctaagactc aatggtgggc caagcagacg ggtgtggcgc gagaggatgt gtctctctgg 3360 gaatggcagg cgttataagc ccgatattgt tgtgaaagat gatggtgtaa ttactgtcat 3420 cgatatggca tgtccgtacg agaaatcgga aagacaccta agtcaatgcg aagatgcaaa 3480 agttgctaag tacgagccac taaggcttga taggagttgg actcaagaac ttgaggggaa 3540 taacggcaga agtgctaatg aaatatcagt tgtagggatt gcagtagggg cgattggaac 3600 aattacgcgt aaaacccagc ggatacttag caagttgaaa ctggccaagg tcggaagacc 3660 gttacaaata attgcatgta atgaaagcgc ccaaattata agacgacatc tttcgggatc 3720 gagacttaga aatttgcggt gaatgcccga ggtagttggg ataatgatgc acaagctcgt 3780 aaggcgactt gctgcacgta tgccgctaaa cgcttagctc gatgagtgca tgtcaagacg 3840 gtcgggagta tgatcagtgg agctgacttt ccagacaact cacgcggatt cgcgtgcggt 3900 ggatacaaca cctggtataa catatgaagg gttccatcta gtacagggat aacgatccat 3960 gggagcaaac taattagttg gaggtaatcc aacgccgctg ttgagtcagt ttttaaccgc 4020 cagtcaactc ttgtaggtta tcggtcttcg gcagaccttg gaccgcctag cgccggccaa 4080 cagtttgtcg tcgactaaca tgatgatttg cgagagaaac ccacgctttg tcacttatgt 4140 gaggataaaa tctcttgtcc atatgatcct ttgaagggaa cagcgctttg agcttgctcg 4200 gcgttggcac ctttagtctg taatattttc ttgatattat ggacgaaaaa ggtagtatgg 4260 ttgca 4265 // ID ITmD37E_Ele13 repbase; DNA; INV; 1296 BP. XX AC . XX DT 13-OCT-2010 (Rel. 15.1, Created) DT 13-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE An ITmD37E DNA transposon family from Aedes aegypti. XX KW Mariner/Tc1; DNA transposon; Transposable Element; ITmD37E_Ele13. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-1296 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-1296 RA Kojima K.K. and Jurka J.; RT "ITmD37E-type DNA transposons from the yellow fever mosquito."; RL Direct Submission to Repbase Update (13-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. >93% identical to consensus. TIRs are 27 bp CC long. TA TSDs. This consensus is ~94% identical to the original CC sequence in [1]. This family encodes a DD37E-type transposase and CC is similar to Tx_mos from Toxorhynchites amboinensis. XX FH Key Location/Qualifiers FT CDS 152..1168 FT /product="ITmD37E_Ele13_1p" FT /note="transposase." FT /translation="MASKQEENRRRILRAVMENPTASMRWIGRQLGITHTT FT VSRVVKSFKESQTTERRAGSGRKSKTADPVKARKVLDYFKRNPNLSIRDAA FT LKAKCSAWFVQQTMKRAGLRVFKVRKAPNRRDQQNTAAKSRARKLYREWLT FT KPKCVVMDDETYIKADAKQIPGLEFYVAKSRFDVPEDIRKKKMDKFAKKYL FT LWQAICQCGKLSKPYITTGTINSEIYRTECLQKRLLPFLRSHGGETLFWPD FT LASCHYARSTLDWYRDNNVVFVPKTANPPNSPEIRPVERFWAIMKAKVRKS FT SKVFKNELELKKAWMKVTKKVGSTIAQNVMRGLKGKVRAFSEGKDIE" XX SQ Sequence 1296 BP; 398 A; 268 C; 326 G; 304 T; 0 other; aggcctgtac tcgaaaaaaa tgcaccattt ttatccgtgt gtaacttttt tgtccgtggg 60 taaaaattga tgaaaatctg ggtataacta gattggagtg tattgtttac atgtgcaaaa 120 tttcataaca atcggttcag tgattattga gatggcatcg aaacaagaag aaaatcgccg 180 gagaattttg cgcgccgtca tggaaaatcc gactgcgagc atgagatgga tcggaagaca 240 actgggaatc acccatacaa ctgtctctcg ggtggtaaaa tcttttaagg agtcccagac 300 gaccgagcgg cgggctggaa gtggaagaaa atcgaagaca gctgaccctg tgaaggcccg 360 gaaggtgttg gattatttca agcgtaatcc gaacctttcg attcgcgacg cggctctgaa 420 ggccaagtgt tccgcctggt tcgttcagca aaccatgaaa cgagctgggc ttcgtgtatt 480 caaggtccgg aaggcaccga accggcgtga tcaacaaaac acggcggcca aatcccgcgc 540 aaggaagctg tatcgagagt ggttaacgaa accgaaatgt gtggttatgg atgacgaaac 600 ctacatcaag gcagacgcaa agcagatacc ggggctggag ttctacgttg ccaaatcacg 660 gtttgatgtt ccggaagaca taaggaagaa aaaaatggac aaattcgcca aaaaatactt 720 gctctggcag gccatatgcc aatgcggtaa actgagtaaa ccgtacatta caacgggtac 780 catcaacagc gaaatttacc gcaccgagtg cctccagaag cgtctgttgc ccttcctgcg 840 gtctcacgga ggcgagacat tattttggcc ggatctggca tcatgccatt acgcgcgatc 900 gacgttggat tggtacagag acaataacgt tgtttttgta ccaaaaaccg ccaacccccc 960 aaattcacca gaaattcgtc cggtggaaag attttgggct ataatgaagg ccaaagttcg 1020 aaaatcgtcc aaggtgttta aaaacgagct ggaattgaag aaagcatgga tgaaagtgac 1080 caaaaaggtt ggctccacca ttgcacaaaa tgtaatgagg ggtcttaagg gaaaggtgcg 1140 ggctttcagc gaaggaaagg acattgaata aagtaattat gccgaaacat aatccatatg 1200 tattagttca ttccctgaaa gtttcaagaa aatccgactt aaaataaatt tttggcgaac 1260 acttttgtct ggtgcatttt tttcgagtac aggcct 1296 // ID Gypsy-620_AA-I repbase; DNA; INV; 8288 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-620_AA_; KW Gypsy-620_AA-LTR; Ty3_gypsy_Ele185; Gypsy-620_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-8288 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [6176-6658] - Integrase core CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 1759..3516 FT /product="Gypsy-620_AA-I_1p" FT /translation="MVRKYYNIDLRTAADQVPVMRLATPQPEAMGATARGV FT QSKGSPASQKATEKPDMDTEVILIETPVKNRLSNISFPQDAEELISFSDDK FT RTLACPQGTPKDRTNQESLTGAFPKQSRIDEGQSDVGLGDVIEGVRPAVVR FT GKLNEFGVGLPLEVPADRVKVGMPGGTSEFRTPFSPLRLDCMWETPEHRSN FT IPNSMLGLQKTVGNPRATTASEAKVLPSASNPSQPMWNPANGKREMDEFVH FT KSEIEGYIKAYLNQVFVPERRYPGMNHPIVDTLVDRIANVGIHDPEVSQIS FT REIDQRQEGNLDPQIQQPGRNPSSGDAGMNNRYFVSGVGLSQPQTAPPLIS FT PFNNGVEPLYPQTNRWGPNDQLGGGWSNSGSRRRLPHQTCNIIEKWPKFSG FT DSNPVPVVDFLRQVEILSRSYQVTPDELRMHAHLLFKDDAYVWFTAYESQL FT ATWDMLLVYLKMRYDNPNRDRFIREEMRNRKQRPNELFSAYLTDLEAMSQR FT MTRKMTNEEKFDIIVENMKLSYKRRLALEPVNSIAHLAQLCYRFDALEANL FT YTPRSGLKPAVNAIGYDDELDRESGDDEEAFVLALQQKN" FT CDS 4559..7030 FT /product="Gypsy-620_AA-I_2p" FT /translation="MVSDCRKVIDEEVERMQKLGVVEECSGPIEFLNPLLP FT IKKSNGKWRICLDSRRLNSCTKRDDFPFPNMVGILQRIQKSRYFSVIDLSE FT SYYQVSLTDGAKDKTAFRTNKGLFRFVVMPFGLTNAPATMARLMTKVLGHD FT LEPFVYVYLDDIIITSLSFEHHCELIRKVAERLRAAGLTINLQKSKFCQKK FT IRYLGYVLSEDGLSMDASKIQPILDYAQPKTVKDVRRLLGLGAFYQKFIKN FT YSDLVTPISDLLKGKRKNFVWTQEANQAFEKLKNALVTAPVLANADFQSPF FT IIETDSSDLAVGAVLTQEQNGDRRPIAYFSKKLSSTQKRYSATERECLAVL FT LAIEHFKHFVEGSHFVVCTDAMSLTFLQTMSIESKSPRIARWALKLSKYDI FT NLKYRKGSENVPADALSRSIHSIAVELPDPYIVDLKRQIEKSPERFKDFKI FT VDGNVFKYISNSTIPEDVTFRWKMLVPKADRSKVIEEVHREAHLGYLKTLT FT KIREKWYWPRMSSDVKQFCSSCEVCKESKTPNINTRPLCGKPKQCARPWEM FT ISIDFLGPYPRSKKGNVWLLVVCDFFSKFVLIQCMKNATAPAVCAFLETMV FT FTLFGAPSVCISDNAQVFKSELFRKLLLKYGVNHWNLAVYHPSPNPAERVN FT RVIVTAIRCTLSKKINHRDWDESINTIAMAIRTNVHDSTGFSPYFINFGRN FT MVSDGQEYDNLRHLIPESGRDRKELDDETQRLYEIVRSNLHKAYEKYSRPY FT NLRANRRHDFQKGDVVYKKNVHLSDKAKNFVGKFGSRFSKALVSEKIGTNT FT YILTDMQGRRISGTFHGSFLKK" XX SQ Sequence 8288 BP; 2500 A; 1623 C; 1809 G; 2335 T; 21 other; taattggacc aataaatgaa tttgaaacta tctcgtgcaa aggtcccaaa ataaggcaca 60 tcgttaacgt ttttttgcat tggttgaaat ccttattgga gtgatctttt tataatttta 120 ataattaatt gaattggtga attgggaaag tagattgaag agttttcttg gtttaccggg 180 agggaattat caggtaagtt gggtcagcga atgggtttaa gatttgtcgt agtgtttgag 240 ccatttaact aactcgcccg agtagagggt ctcaaagccg ggaccataac mgcatcagca 300 gtcttcctcg gttgaggaag tggcgcataa gctgctgttt gtgttgagca agaacwatta 360 cgwtacattt tggcgcccaa cgtggggccc acgatctgaa agtttagaaa caaatttaga 420 aacaaattgg attgcaaatt ttattttgat taagttttgt tgagttttgg ttgagattca 480 aaaaaaaaat agctataaga attaggttca aatgtacata tattttgttc tgggtctgaa 540 ataaattagt tttagttttg atttttgaaa aattatttga aattctcaga attcttctat 600 tatttttttt attactaata attgtgatgg aaacctaccg aatttatgcg aaggatttgg 660 aggaggatga ggttgactat gagttgacga ttcgtggata tccattagag ggtccactgg 720 aaactcgtta taggtcctta cgattggcct tacgtcaacc agaagatgga gacgctgtat 780 taggtactga gctgggtgct gcgaattgaa tcccttttta ttttcaagca aatcttgaaa 840 catgcttctt gtgaagaact gagcttgtcg tattagtccc tcaaacgtca ctttgagttc 900 tctgtttatc aaatttcgct ctctgatctc tctctggaga gcttgaatgc gtcctagctc 960 gaatgacttt ctcattgtga aatcggctaa tagtgggtct gctagaagtc cgtggatcta 1020 attcattttt atgtgattgt gcctttggat tagaacgctg agttcattca aaagttgccg 1080 gagcagcaat tgaaaggatc ttgtcgatgg ttttattatt ccggccgtag caactcgatt 1140 cctagcttca aaccgaataa catcgaaaac gacgattctg attaatcaat ttcggaaaca 1200 ccgatggcgt cagaatctcg aacgctttga ttttgaagat tatatgggtg gccaaaagcc 1260 aaaaggagaa gaaatactca accgacaaaa tgttgcgcat gaagtgtcaa gtattttttc 1320 ttgtgtgtat attaggtaca ctgttgttga agtaagtctt aaatattatt tagactattt 1380 aacaatttgt tgcaatataa agaaaataac aaaaatagat atatttttta tctctgagat 1440 tacaatactt tctcacgaga tcacattact ctctcacaag taacataacc acaatctgga 1500 atgtttggct ccgacattca ggctcccgct tgacaagcaa attgagtcag accgcagcac 1560 ccaggtactg agtattcttt gctcggtgac tttgtagttg tccccataaa gttgaaagaa 1620 attgaagctc agttagaaac cgatgatttg aagtgttttt cgcgacttgt tcattatcat 1680 aaaagagtgc gacggtatgt tccggagacc gatggccaga gacgcaatca acaagagttg 1740 cttgacgtca tcgttcgaat ggttcgaaag tactataaca ttgaccttcg aaccgcagct 1800 gatcaggttc ctgttatgcg attggcgact ccccaacccg aggccatggg agctaccgcc 1860 aggggggtcc aatccaaggg ttcacctgcc tctcaaaaag ccacggaaaa accggatatg 1920 gataccgagg ttattttgat cgagactccc gtgaagaacc gattgtccaa tatttctttt 1980 cctcaagatg cggaggaatt gatcagtttt tccgacgata agcgtactct agcctgtcct 2040 caaggaactc ccaaggatcg gaccaatcag gaatcgttga caggagcgtt tcctaagcaa 2100 tctcgaattg atgagggtca aagtgatgtt ggtctgggcg atgtaatcga gggtgtacgt 2160 ccagctgtag tgcgcggtaa gttgaatgaa ttcggtgttg gtctaccgtt ggaggtccct 2220 gcagacagag tgaaagttgg tatgccaggc ggtaccagcg agtttcggac tccgttttcg 2280 ccacttcgcc tagattgtat gtgggagact ccagaacata gaagcaacat cccgaatagc 2340 atgttgggat tgcagaaaac agttggcaat cccagagcta caactgccag tgaagcgaaa 2400 gtgttgccta gtgcatcgaa tccatcacaa ccgatgtgga atccggccaa cgggaaaaga 2460 gagatggatg agtttgtgca taaatcggag attgaagggt atatcaaggc atacctcaat 2520 caggttttcg ttccagaacg ccgttaccct ggaatgaacc atccgattgt ggataccttg 2580 gtcgatcgaa ttgccaatgt tgggatccat gatccggaag tttcccagat atcaagggaa 2640 atcgaccaac gtcaggaagg taatttggat ccacaaatac aacagccagg gagaaatcct 2700 tcgagtggtg atgctggaat gaataataga tattttgtgt caggcgtagg gttgagtcaa 2760 ccgcaaactg caccaccttt aatatctccg ttcaacaatg gagtcgagcc actctaccct 2820 cagacgaacc ggtggggtcc taatgaccag ttaggtggtg gatggtctaa ttctgggtcg 2880 cgtcgaagat tgccacacca gacatgtaat attatcgaga agtggccaaa gttttccggg 2940 gattctaatc cagtcccagt ggtcgatttt ttgcgacagg ttgaaattct cagccgttca 3000 taccaagtaa cgccggacga gcttcgaatg catgctcatc tgctttttaa agatgacgct 3060 tatgtatggt tcacggctta cgaaagtcaa ctagcaactt gggatatgct cctagtatat 3120 ctcaaaatga ggtatgacaa cccaaatagg gaccggttta ttcgagaaga aatgaggaac 3180 cggaaacaga ggcctaatga gttgttcagt gcctatttga ctgatttaga ggctatgtcg 3240 caaaggatga cgcgcaaaat gactaatgag gaaaagttcg acattatagt ggagaatatg 3300 aaactctcct ataagcgcag gttggcctta gaacccgtaa actctattgc acatctggca 3360 caattgtgtt acaggttcga tgccttagag gctaatttgt acactccgag atctggatta 3420 aaaccggcgg tgaatgccat aggatatgat gacgaattgg acagagaatc aggagacgat 3480 gaagaggcct ttgtactagc tcttcaacaa aaaaattnca agaaaaacag ttcgagcggt 3540 ccaacatcta aaccaaaact tgggaaggaa agtaatcaat tactttgctg gaactgccgc 3600 aagagtggac acatgtggcg cgattgtgat cgtaaaaaag gaattttctg ccgtatatgc 3660 ggaacaccgg acactaccgc gtatcgctgt ccagagaatc ataatttgag acctcgagac 3720 tcatcacccg aggaatcaaa aaacgagtag atccagggat tcacgggaac aaatcgcctg 3780 agaacgccga tgatactgtt ccccctcgtt cgttcgaata cttctgtcac agttataata 3840 ttaacacctg cttcagacga tgtccacatc tacgagtcaa aattttagat gaggaaataa 3900 taggattggc tgacactgga gcaggagtta caataatcaa ttcggtcaat ctaattgaaa 3960 aattaggttt gaaaatccta aaatgcaata ttcgcattaa aaccgccgat aacacggggt 4020 acacctgcct tggatacgtg aatatccctt acacatacgg aacaagaaca tgtgtagttc 4080 caacaatagt agttcctgag atctcaaaac ctttgatatt aggtatagat ttcttgaaca 4140 cattcgattt taaactgatg gcacctggaa ttccagaaaa caccagttca gcccaagaac 4200 agaacccaac tttcgataga catgaagtac cgctcatgat agcagaagat ttcttctcag 4260 atgacgagca aactgtatgt tttcaaattg aatccgctga tcaagacttg tctgtcggac 4320 cttcggagat agatgggagt ttggagatgc ctacgataga gataccttca acaatcatta 4380 aatctccttg ggatatcgat acggaacatt atctaactga aaaacaacgc catgctttgt 4440 ttgaggcagt gacgaatcta cccgctacgg ttgaaggcag tttaggtcgc acagccatct 4500 tggaacattc aatagatctg ttatcaggta ccagaccccg tagactaccc atgtataaat 4560 ggtctccgat tgtcgaaaag taatcgacga ggaagtggag cgtatgcaaa agcttggagt 4620 agtggaggaa tgctcaggcc caattgagtt cttgaatcca ctactaccaa tcaagaagtc 4680 gaatggaaaa tggagaatct gtctcgattc aagaagactg aactcgtgca cgaaaaggga 4740 cgattttcca ttccccaaca tggtgggtat tctgcaaagg atacagaaat cgcgttactt 4800 ttcagtaata gacctgtcgg aatcgtacta ccaggtgagc ctgacagatg gcgcaaagga 4860 caagacagcc tttcgcacca ataagggact gtttcgtttc gtggtgatgc ctttcgggct 4920 taccaatgcg cctgcgacta tggcacggct aatgacaaag gtgctaggcc acgacctaga 4980 accttttgtc tatgtatatt tagatgatat aattatcacg tcgctttcgt tcgaacatca 5040 ttgcgagctg ataaggaagg tagccgaacg tttacgtgcc gcaggactca ccataaacct 5100 gcaaaagtcc aagttttgcc agaagaaaat acgttaccta ggatacgttc tttcagaaga 5160 tggcttgtcc atggatgcat caaaaattca gcctatcctc gactacgcgc aacccaaaac 5220 tgtaaaggac gtgagacgac tgctgggtct tggagcgttc tatcagaaat tcattaagaa 5280 ttattcggat ctcgtaacgc ctatttcaga cttactgaaa ggaaaacgaa aaaatttcgt 5340 ttggacgcag gaagctaatc aagcttttga aaagctaaag aatgcattgg ttactgcacc 5400 agttcttgcg aacgctgact tccaatcccc gttcatcatt gaaacagact catcggattt 5460 ggcagtaggt gctgtcctga cacaagagca gaatggtgac cgccgtccca tagcttattt 5520 ttcaaaaaaa ttgtctagta ctcaaaaaag gtatagcgcc accgagcgag aatgtttggc 5580 ggtgcttctt gctattgagc acttcaagca cttcgtggaa ggaagccact tcgtcgtttg 5640 tactgatgcg atgagtctta cattccttca gacgatgtcc atcgagtcga aatctcccag 5700 aattgctcga tgggcactga agttgtctaa gtacgacatc aatctaaaat atcgcaaagg 5760 gtcggaaaat gtacccgcgg atgcactttc gcgaagtatt cactcaatcg cagttgagtt 5820 accagaccct tatatagtgg atctgaagag acaaattgaa aaatcccccg agcggtttaa 5880 ggattttaaa atagtggacg gaaatgtctt taagtatatc tccaactcaa ccatccctga 5940 agatgtgact tttcgttgga agatgctcgt cccgaaagca gatcggtcaa aagttatcga 6000 ggaagtacat cgcgaagctc acttaggata tttaaaaact ctaacgaaaa tacgtgaaaa 6060 atggtactgg cccaggatga gcagtgatgt taaacaattt tgctcttcat gtgaagtttg 6120 caaagagtct aaaactccaa acatcaatac taggccactt tgcggaaaac ccaaacagtg 6180 tgctagacca tgggaaatga tttctataga tttcttgggg ccttaccccc gatctaagaa 6240 aggaaatgtt tggcttctcg tggtgtgcga tttcttctcc aagtttgtgc ttatacaatg 6300 tatgaagaac gccacggcgc ctgccgtatg tgcctttcta gaaacgatgg tctttaccct 6360 tttcggagct ccatctgttt gcatctctga taacgcgcag gttttcaaat cggagctgtt 6420 tcggaaactc ctacttaaat atggagttaa tcactggaat ttagctgtgt atcatcccag 6480 tccgaatcca gcagagcgtg taaatcgcgt tattgtaaca gcaattcgtt gtacgctaag 6540 taaaaagatc aaccaccgag actgggacga atcgattaat accattgcta tggctatacg 6600 cactaacgtt catgacagca cagggttttc accctatttt attaatttcg gccgtaacat 6660 ggttagtgat ggtcaggaat atgataacct gcgacatcta atacccgaga gcggacgaga 6720 tagaaaggaa ctagatgacg aaacccaacg gctttacgaa attgtgagaa gtaatttgca 6780 caaggcttac gaaaagtatt caaggccata caatttacga gccaatcgcc gtcatgactt 6840 ccagaaagga gacgtcgtct acaaaaagaa tgtacatctc tctgataaag cgaagaattt 6900 cgttggcaag tttggtagca ggttctccaa agcactagtt tcagaaaaaa tcggtactaa 6960 tacctatatc ctcacagaca tgcagggtcg taggatttca ggtactttcc acggctcatt 7020 tctaaaaaaa wcstaaaaga aataaacttc agctatgact gcattcccac cggagaatgc 7080 atacactaat ggaataacca taaaccaatt caaaatacac tttgtggtgc atcggtttaa 7140 catgacttga gaagtccttt gagtttcctc agtcagtata cggttcattg atcaagttgc 7200 agcaatgtct acaatgaacc ctttcgtagc tgcataagat aaagctatga atgagtcact 7260 acggctctga ccattgatga aaatcactaa aacactcatt gagtaaatgc cmaatggatc 7320 tagtcgagtg acaaacttgc atacggcaca atttcctcct agataccaat aatccmatcm 7380 ttcgtaggta gtaaaattgt tttaatttcc ccaacaaaac atcggagtca attcagtgcg 7440 tcacgcgttg ctccacatta aaaatcactt tctttacaca actttacctc actccttcgc 7500 aaaagtagat gaccttcgat tttccatagt agtagccagt tgaaactcca attttctccw 7560 tagcttccat agtttctcca taattttcca ctccgcgtgc attttttccc agcttgcatt 7620 tccatacctg ttaaaawttt tgtkttctgc wgataacagt gacattaagt tgacagmgcg 7680 atgttttctt tttgcaattg gcgatgatgt gtcctcgtca gttttaggtg acgggtgtac 7740 ccgttcatgg aatgagacga atgcttcaag kaagcgtttg tttgaaaggc acacaaagtg 7800 cagtaatgga agatgttttg ttaatttctt aaatcatttt gcgaaatttc tcagatcttt 7860 ctgaggtggt ttaggctagg aataatatgt tttttttttc aaatattttt ggagttagtc 7920 tgatatgwtg gttggtagtg ttaagtatat ttccaaatat ttttggagtg aatcggwaat 7980 tatagatggt agttgaattg aattaaggaa gcaaatgtta tggaaattag caaagcatag 8040 gattgatkgt attgagaaag taacwaatat tcgaacgtaw ttttaaggac acagaacaaa 8100 atatttgaag ttttctwttt gttgattaga tttttcaagg taaacatatg ttcatcgata 8160 gcgaaataag atcattaaat ttcaaaaata acaatattaa gctttaaaaa ataaataaat 8220 aataattaat aaaaattttg aaatttaaat aatttcaaaa tttttatttc atttcgtata 8280 ggcgagaa 8288 // ID DNAX-4_Tad repbase; DNA; INV; 207 BP. XX AC . XX DT 08-OCT-2009 (Rel. 14.1, Created) DT 08-OCT-2009 (Rel. 14.1, Last updated, Version 3) XX DE Putative non-autonomous DNA transposon: consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; DNAX-4_Tad. XX OS Trichoplax adhaerens OC Eukaryota; Metazoa; Placozoa; Trichoplax. XX RN [1] RP 1-207 RA Jurka J.; RT "DNA transposons from Trichoplax adhaerens."; RL Repbase Reports 9(10), 2146-2146 (2009). XX DR [1] (Consensus) XX SQ Sequence 207 BP; 83 A; 35 C; 35 G; 54 T; 0 other; ataagataga gaagacctgg acccccgcga aaaataataa caaaaagata ataaaaggaa 60 ataaaaaatg aaacactaaa aatcggatat atacgccgcg cgagcgcaaa gaaatatata 120 tatcgtttcg actctgaaat tttctatata tatttatgca ataatgtatg cgcgattcgc 180 gattggtcca ggtcttctct atcttat 207 // ID DNA8-20_AP repbase; DNA; INV; 454 BP. XX AC . XX DT 22-AUG-2009 (Rel. 14.08, Created) DT 22-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-20_AP. XX NM DNA8-20_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-454 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(8), 1762-1762 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 454 BP; 156 A; 79 C; 71 G; 147 T; 1 other; cagggactgg aaccgaacct aaaagaaccg gttctggaac cggaaccttt aaaatattaa 60 gaaccggaac cgaaaccttt attttaatat aataaagtac cggaaccgaa ccggaatttt 120 taaattaaat aggtnctttt aggttcaaaa ataaaataat tttatttatc ataatttaaa 180 ggtattactg ttttgtgtat aaactatgta tgatcatatt atgagttttc taatatttct 240 catactatta attaaacccg cattccggat aggtatttaa aattattata cttttcggta 300 cctaaattat ctggaaccgg taccgaaatt attcggaacc ggtaccttta ttttttttca 360 tcaaagaacc agaaccgaac cggaaccgtt attttatttt tggagaaccg gtaccggaac 420 cgataccgat aaattcaaaa ggttccagtc cctg 454 // ID Gypsy1-NVi_I repbase; DNA; INV; 5219 BP. XX AC DS265623; XX DT 03-NOV-2007 (Rel. 12.11, Created) DT 04-DEC-2007 (Rel. 12.11, Last updated, Version 1) XX DE LTR retrotransposon from parasitic wasp: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW internal portion; Gypsy1-NVi_I. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-5219 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from Nasonia parasitic wasp."; RL Repbase Reports 7(11), 1168-1168 (2007). XX DR Genome; DS265623; Positions 469892 475110. XX CC Positions [2641-3090] - Reverse transcriptase CC Positions [4129-4641] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 2611..5061 FT /product="Gypsy1-NVi_I_1p" FT /translation="MALKKDGSYRPCGVYCALNAQTVPDKYPTAHLYDCTS FT ELHGKKIFSSLDLLKAFNQIPIAPEDIEKTAIITPFGLFEFMYMTFGLRNA FT SQTFQRYINRVLGDLKFVFIYIDDILIASESVEEHFEHLRIVFERLNKFSL FT RINVDKCTFVVEELVFLGYSINSQGIRPTQEKVKAVLNFPKPNTVVELRRF FT LGMVNFYHRNLPHAAEAQVPLNAYFRESRKNDKRPIEWTAETSKAFEQIKS FT DFANASLLVHPRCGAELRLVTDASDVAMGAVLEQKSLSNEWEPLAFFSQKF FT TPAQQLYSTYDRELTAMFEAVKYFSYIVEGCDFAILTDHKPLIYALVQNNE FT KAPPRRLRQLGYISQFTTRIEHVKGSDNVVADALSRVETIRFPLEFDLADL FT AAKQEADQQLKEICESPNYPLTLKRLQLGPEHTVIYCELTGESLRPYVPQS FT LRRSVFEFFHNTAHPGPKVSDRLIRQRYVWPNMHRDIAAWCKNCLACQQSK FT ISRHVKTFPEHFVAPDGRFDHVHIDIVGPLPVRDGYQYLLTMIDRFSRWVE FT AVPLQETSAQTVARAFFDTWVSRFGAPKVITSDQGAQFESRLFTALLSLIG FT CERIRTTAYHPAANGMIERWHRTLKAALMCHKDDDWLRTLSTVLLGLRCHV FT RADTEASPAEFLYGTTLRLPGEFFIPEDITPDPQIFVEEFREHMRLVRPIP FT VAHHYKKRIFYFKELHNCTHVHMRNMAKKSLERPYSGPYKILSRESDRVFK FT IEVNGAPRTVSVELLKPAFFIAENLVDSGGVSNDSSKGNQPPVPSPVLKTY FT PAKKIRFSSDTKAQA" XX SQ Sequence 5219 BP; 1255 A; 1405 C; 1181 G; 1378 T; 0 other; ttggtgaccc cgacttcaag aagacgagcg agtggactct cgccccgccg tctccaggag 60 caccgctttt tccggaattg catcatccac gccggcggct ttcttgccgc ggtcagcctg 120 gctaggcgat cgccgtgcac cgctgccgtg ttcatcattt tcttcagcgg ccttgttgcc 180 gcggtcagcc tctcaggcga ccgccttatc cagtcttcga gctcttcctc gtcaacggcg 240 ttcacgccgc ggtcagcctt tcaaggcgac cgcccctgca gcaacgtagt gcagcgtctc 300 gtcggcgtcc ttgccgcggt cagcctagag gcgaccgcct gtgcagcatc atcgtccagc 360 tcttcagcgg catcataagc cgcggtcagc ctctcctggc gaccgcattc tccggcttcg 420 ccgatcatcg ttagcggctt tttagccgcg gtcagcctct caaggcgact gcctcgcagc 480 catctcctgc agcatcacta ccagctactt ctcgtcggtc ggcggcgcac atctcgcagc 540 agttcgacct ggacatcatc cacgagtctt caagcatcgt tagcggcttt cagcgcattc 600 tcgcaaggta tctcagtgtt agctttaagt tcgattccaa gaattcattc aaattgaatt 660 tggttcgctt ccgagaattg ggcggccatc ttggatttta gcgcgcggtt ttctcgtggc 720 gctcgcgtct cacgggttgc ggtgttcagg cgcggcgctc gtgcatcacg ggttgcagct 780 gttaggcgcg tgctcgcgcg ggttgcgctc agcgctcact cttcgcggaa tttaattaga 840 gtagcggttc cccaaccatc gcaaatttca cgcagcggcg ttttaatttt aagaatttgt 900 aaccgggcgt gcgttcttgt tcacgatctc gtctgcgcac gttgatcgta aatttaaggt 960 tagttctagt tttaagttag cttttctcac atttgtgata cgcctgcgtg cgtgcagtag 1020 gtcaatttgt actacgtact gcgacgtata taggtagcaa tccataaaat gccggaatca 1080 tcagaggaaa gagcagatcg cctgcagggc agagcttgat gcggccaggg ctgctgcgaa 1140 ccaggcaccc ggtcgtgttg aggcgtaccg tactccaaaa ctcccggtgt ttattaaggc 1200 ggatccagcc atgtggttta cacaggtgga agcctcgttt cgtcacgcgc acataacggt 1260 tgagggaacc aaagcagatc acattatcgc agcgttagat catgaggcaa tgactgccat 1320 tagagacatt gctctcacag atccccaacc tcctgacgtt tacacgcaga ttaaacagag 1380 attagtttct tcctttgcgg cctcggcaga gagcaagctc cgtaggcttc tcaaaggcca 1440 agtccttaat gatggcaaac catcactcat tttaaacagg ctcagaagct tagacgacgg 1500 cgccaaatgc gatgacgcaa ttatcagatc tgtctttctt gaacagcttc agcccaacca 1560 cagagcaatc gttctcgcgt ccgacatcag cgatctcaac aaattagcga ccttggcaga 1620 caaaattgtt gagaactcac cgtccgattc tcgcttgtcg gcggtcaatg tcaactcaga 1680 catcgcgtcg ttagcttcag aagtcaagcg ccttgctgac tccttcgaca aagtttccac 1740 tagattagga aagctagaaa actcgtttaa agcttccaaa ggtgaggatc gatcaggttc 1800 caattcaggc gaattcaaac gtccttcacg ttcgcgttcc agttctcgac attcatcagg 1860 actgtgtttt gcacacagaa agtatccaga cagtcccact tcatgcaaga agtggtgtac 1920 agagtactct aagtgggctt caaaaaacta aatgaacctt tccttgctga gacgtcaggg 1980 gatggttcct ccggttcgaa acgtctcacc cttaaggacc taaaatcagg tcgcaggttt 2040 cttatagatt cgggtgcaga aatctctgtt ctgccagcca gttacaaagt caattcaaaa 2100 ccctcatccc gcaagcttta tgcggccaat gacacgacta ttgacacatt tggggaaacg 2160 ttcctattgc tagatttagg tctaaaccgt cccatctcgt ggaattttgt tatcgcgtcc 2220 gtgcctcatg cgatcatcgg ggctgacatt ttataccact acggtttaac tgtagatatt 2280 cgcaaccgtc gtctcgttga ctcggttacg tcactatcat ccgtaggttt aatcaaaata 2340 gttccagttc tcggaatcca ttcggtagct tccagctcca aatgcgctca gctgttagcc 2400 cagtttccga aaataactgg ctcaggtccg catgatcccc agttcaaacc ggacattgtt 2460 caccacatct ataccacagg ccctcctgtc tctgagcgtc cacgtcgact ttccgcggaa 2520 aagctcagag cagcgaaagc tcaattcaaa gcctggcagg atgccggcat ctgtaggcct 2580 ggcagcggtc cgtacgctag ccagttacac atggccttaa aaaaagatgg atcttatcgt 2640 ccttgtggcg tgtactgtgc tctaaacgca cagaccgtcc ctgacaaata ccctacagcg 2700 cacttatacg attgtaccag tgagttgcat ggaaagaaaa ttttttcttc gctagatctc 2760 ttaaaagctt tcaatcagat tcctatcgcc cccgaagaca ttgaaaagac tgcgattatc 2820 acgcccttcg gtttatttga attcatgtac atgacctttg gtctgcgcaa tgccagccaa 2880 actttccaaa ggtacatcaa cagagtgtta ggagatctga aatttgtatt catctatatt 2940 gacgatatcc tgatcgcctc agaatccgta gaagaacact tcgaacactt gcgtatagtg 3000 ttcgaacgtt taaacaaatt tagtttgcgc attaatgtcg ataagtgtac atttgttgtt 3060 gaggaacttg tatttttagg atattcaatc aactcgcaag gcattcgtcc tactcaggag 3120 aaggtcaaag cggtgttgaa ctttcccaaa cccaacactg tagttgagct gcgtcgcttt 3180 ctcggcatgg tgaactttta tcaccgcaac ctccctcatg cggctgaggc ccaagtccct 3240 cttaatgcct actttcgcga gtcacgcaaa aatgacaagc gtccaatcga atggactgca 3300 gaaaccagca aagccttcga gcaaatcaaa tctgattttg ccaatgcttc cttgcttgta 3360 catcctcgct gtggtgcaga gcttcgcctc gtgacagatg cttctgacgt agcgatgggc 3420 gccgtgctag aacagaaatc actctcgaac gagtgggaac ctttagcttt cttttctcaa 3480 aaatttaccc cagctcagca gttgtacagc acctacgaca gggaactcac tgcaatgttt 3540 gaagcagtga aatatttttc ttacattgtt gagggctgtg acttcgcaat tcttaccgat 3600 cacaagccac tcatctacgc ccttgtacaa aacaacgaga aagctcctcc gcgtcgcttg 3660 cgccagttag gatacatttc tcagttcact acccgcattg aacacgttaa aggttcagac 3720 aacgtagtgg cggatgctct ctctagggta gaaaccattc gtttccccct agaattcgac 3780 ctggccgatt tagcagccaa acaggaagcg gatcaacagc taaaagagat ttgtgagtct 3840 ccgaactacc ctcttactct taagcgtctc cagcttggtc cagaacacac agtcatttac 3900 tgtgaattga ctggcgagag cctgcgtccg tatgtacctc aatctttgcg tagatcagtg 3960 ttcgaatttt tccacaatac cgctcaccct ggtcccaagg tgtctgatcg tctcatccga 4020 caacgatatg tatggccaaa catgcatcgc gacattgctg catggtgcaa aaattgtctt 4080 gcctgtcagc aatccaaaat ctccaggcat gttaaaacgt ttcctgagca ttttgttgct 4140 ccggacggcc gctttgacca cgttcatatc gatatcgttg ggccgttacc ggtgcgcgac 4200 ggttaccaat atttgctaac catgatcgat cgtttctcta ggtgggttga agctgttccc 4260 cttcaggaaa cctcagccca gaccgtagct cgcgcattct ttgacacatg ggtgtcgcgt 4320 ttcggcgcac ccaaggtcat cacttcggat caaggagcac aatttgagtc gagactcttt 4380 actgcattat tatccttgat cggttgtgag cgcattcgca ccaccgcgta ccacccagcc 4440 gctaacggca tgatcgaacg gtggcatcgc acgttgaaag ctgcattgat gtgtcacaaa 4500 gatgacgact ggctcagaac tctctccact gttctattag gcttacgttg ccatgtgaga 4560 gccgacaccg aggcttctcc ggccgaattc ctgtacggta ccacgctacg gttacctggc 4620 gaattcttca ttccggaaga cattacgccc gatccgcaaa tctttgtaga ggagttccgt 4680 gagcacatgc gcctagttcg tcccatacca gtcgcacatc actacaaaaa acgcatcttc 4740 tattttaaag agcttcacaa ttgcacccat gttcacatgc ggaacatggc caagaagtct 4800 ctcgaacgac cttactcagg tccatataag attctgtctc gcgaatccga tcgtgtattc 4860 aaaatcgagg tcaacggtgc ccctcgtacc gtttcagtag agcttttaaa gcccgcattc 4920 tttatagcgg aaaatttagt cgattcagga ggagtatcca acgacagttc gaagggcaat 4980 cagccacctg tccctagtcc ggtactcaaa acttatcctg ccaaaaaaat aagattctcg 5040 tccgatacca aagcgcaagc ttagctttgc gattcagttc ataagcagcg tttgtatttt 5100 tatactcttg tattaacagt tgtaagcaac caatgtatat atgttcatgc gattcttggg 5160 tctcaaccct tctggtgcag agcttgcaag ttgatcagca agaatctcgg gggggagtg 5219 // ID Gypsy-133_AA-LTR repbase; DNA; INV; 1718 BP. XX AC AAGE02025031; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-133_AA_; KW Gypsy-133_AA-I; Gypsy-133_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-1718 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02025031; Positions 10285 12002. XX SQ Sequence 1718 BP; 580 A; 362 C; 391 G; 385 T; 0 other; tgtagagaca ggatgaaata aataaaaaat aataataata ataattataa taataaaaat 60 aataataata catatataat ataaaacatt ttgatttccg tgtaatcagc caaaagcgta 120 tcgacaataa accacccaat actaaaattg agcccactgc tatttcattc atgtcaaaag 180 aatcattccg attggccctc caataggaag cgaggaagaa aaatccacga gtataaatat 240 gaccctgtct tctgtggaac gatcactttt tcggcgaatg tggtagagtg aagaccacta 300 gcagcagact agtagcagca gaccagtagc agcagtccag cagaaagaag caagccgcaa 360 ccaccgttaa agagttcggt tggattgttc cgcttgcagc accgattttg cgacggtctc 420 atccacctcg cttatcggct tcggacggaa acaagctgca accatcagtg ttggttgttc 480 cgcttgcatc caccttggaa actctccgga attaccgttg attgtttcta agatggcgga 540 ccaacagtgg aaaatagaaa atgcgacggg tcgcgacaat ggcgcagcta acgaaccaac 600 aatccccaag aagtaagtat gaaaaaaaaa actgtttaag caaactctgg tggagataga 660 cagagttctg cagattaaaa ggggccccgg tagagataga cggggcgccc aatatttcac 720 gcaaaaaaaa aaaatgaata aatggcatgc tctgatggag aaagacagag ttttgcaatg 780 ttaggagctc cggtggagat agacggagct tcaataattc ttatttacat gaggtaaagc 840 aaaatattga caaagggata ttttctctgg ttttatttca gaaaaatctc caaaaagcag 900 attttgcgtg aattgaaaga ggcggtcgag aagaataatg ttttgcgtga gcaagtgatt 960 caactaacga ccggtacgga tcagaacccc gaaatcggac agcctagccc cagcgtagcg 1020 cggcaggatc caatgttgct atcgacgatg aacaactgga cacttagcac cctcagcatt 1080 ccagagtgta ccccatcaca aggcgaaacc gacatcgaca agcaagcgtt tgaatactgg 1140 aaagacatcc tagtatcttc tcttcagctg gccaatgctg tcgatgaaca cacaaaattc 1200 ggcgtattta agataaaatc cggaccgaag ctacgtgaga tcttccaagc cacatcatca 1260 tctgcaggaa tgcctgatga acagagagaa ccgttttcca acgcaatggc acgactgaat 1320 gggtatttcg gatccaggaa ttacacactg tcacagcgag ggaaactcat gatgatgggc 1380 caatcagata ccgaaagcag tattaatttc gttcgtcgtg tagctacggc cgctaagctg 1440 tgtaattatg agccagatca agaaatggag gccgcggtaa gagttatctc aaaaggtgcg 1500 aatgatgcga aagtccggaa actagcgtac cgcaactggt caaaacaagg ttccttgaaa 1560 gaccttattg atcttgtacg cgaccacgaa attgagaaga cgaatgaaga agagtttcaa 1620 agaacgcgga accgtagtga gaccatgaca ttatcggcgg tttctcaaga tcgcacagag 1680 ttccgaggac atcgtcaagc attcaactca aactggca 1718 // ID SMAR24 repbase; DNA; INV; 2695 BP. XX AC . XX DT 08-OCT-2007 (Rel. 12.1, Created) DT 22-OCT-2007 (Rel. 12.1, Last updated, Version 1) XX DE Consensus sequence of Mariner-type family of repeats. XX KW Mariner/Tc1; DNA transposon; Transposable Element; SMAR24. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-2695 RA Jurka J.; RT "Mariner-type element from freshwater planarian (Schmidtea RT mediterranea)."; RL Repbase Reports 7(10), 1082-1082 (2007). XX DR [1] (Consensus) XX CC This family is highly divergent from other Mariner elements. It CC doesn't have typical TSDs (TA). XX FH Key Location/Qualifiers FT CDS 612..2396 FT /product="SMAR24_1p" FT /translation="MSTHKSHHTKKQYSVKEKLGVLDEIKTKSIRATGKAH FT NIPESNIRKWKQQEKELRALFQNNDIKVRERKRLIGGGKKPQFENLEQLLY FT EFVIDRNKLGLVVKSKYIQIHAKTLRDDYVEDINLTLNEEDLDEDLKKELN FT LKKTEFQNFKASDSWCANFKKRFNLVRRTQTGCRKLPENFQEISCDYVAEA FT REIIIKHKVKPENVLNFDQIPRLFEHEPSNTIAIKGSKEVFLKKASTSHKK FT FTYTPTINAKGEFVNQHLLFSKLKNAPKVSPGFSVDVNLTGMWSEDLLKKY FT LREIILMRPATAIFKQATLLILDSYPVHVKVVNELRDKYADLFKVYFLLVP FT PNFTGLLQGIDVSLGKSFQGSYDEFYDAYVAKALNDKSLQTKKGNIRMPTY FT IELSSWIKEWSKGRSADDIAKCFRVVGAHPDVNPLKNPESLHEPLRKCFDK FT SFDFDKWLEQHQDLLEVPSNNFFDLNNDEWGFAKKPDSLWMTLYECTKSTD FT KFKSWKQSHIEKIVEFIHGDPILHGLYDDEEMVKFEKEVLSDSRVELSAFS FT KVLGRKLIFIVLNDSMEEEEHLTYENEQESEPIKILLFNNSYAYKKDD" XX SQ Sequence 2695 BP; 968 A; 413 C; 494 G; 820 T; 0 other; ctaccgtccc ctcccgtgga gtttgcgcgc gcaatctagt cgagtttggt gctttctggc 60 aaaaattgaa aatcggaatt ttttttttca atagtttgcg catctattta ttttgcgcgc 120 gcaatctatt caaatttgtg tttatttttt taaataaaaa aaatcaatcg agtattaatt 180 tttgaacagt gttggaaaaa ataagtgaca aattaaattc atcagtgacg ataatgatcc 240 gatggggtat aagttttgga aaccatacgc tgttgttggt aggaaaaata cagcctggcg 300 atgaaatttc aagattctcg ataattaatt gaaactttta atagtttccg gaagttctga 360 taaaattttc aaacatttga tgaaattcag aagattttat gaaatttcaa agttggtaat 420 tgaatttcga taagttattg ttattttata aatttatgga ctttagatag ttttgtagaa 480 tttctgagaa agtttgtgaa tattttaaga actttcgagt tattcagaaa attttagatt 540 ttttaattgc atttcgaaaa ttttatataa aatcttcata acttcattaa aatttttcat 600 tttcaggcaa aatgagtaca cacaagtctc atcacactaa aaagcaatac agtgtaaaag 660 agaaattggg ggttttggat gaaatcaaaa ctaaatccat tcgtgctaca ggaaaagctc 720 acaacattcc cgaatccaac attcgaaaat ggaaacaaca agaaaaagaa ctgagagctt 780 tgtttcaaaa caacgacatc aaagtaagag aacgcaaacg attgattggc ggtggaaaaa 840 agccacaatt tgaaaatttg gagcaattat tatatgaatt tgttattgat cgcaacaagc 900 tgggtttagt cgttaagtct aaatacatac aaattcatgc aaaaactttg agagatgatt 960 atgtagaaga tataaatctg acactaaacg aggaagattt ggacgaggat ctgaaaaaag 1020 aattgaatct taaaaaaaca gaattccaaa atttcaaagc gtctgactca tggtgcgcaa 1080 acttcaaaaa aagatttaat ttggtgagaa gaactcagac gggatgtcga aaacttcctg 1140 aaaactttca agaaatttcc tgtgactacg ttgcagaagc tcgagagata ataattaaac 1200 acaaagtgaa gcctgaaaac gttctcaatt tcgatcaaat tcctcgtctt ttcgaacacg 1260 agccatcaaa tacaatcgca atcaaaggta gcaaggaagt ttttctgaaa aaagcttcca 1320 catcccacaa aaaattcact tacactccga caattaatgc aaagggtgag ttcgtcaatc 1380 aacacttgtt gttttctaaa ttgaaaaacg ccccaaaagt tagtccagga ttttcggttg 1440 acgtgaatct aacaggcatg tggagcgagg atcttttaaa aaaatatttg agagaaatca 1500 tcctaatgag acctgccaca gcaattttca agcaagcaac gttgttgatt ttggacagtt 1560 acccagtcca cgtaaaagtt gtgaacgaac ttagagataa atatgccgat ttattcaagg 1620 tgtacttttt attagttcca ccgaacttta ctggattatt gcaaggcata gatgtatctt 1680 taggcaaaag ttttcaagga tcatacgatg agttttatga tgcatatgta gccaaggcac 1740 tcaacgacaa atccttacaa acaaagaaag gcaacatcag aatgcctact tacatcgagc 1800 tttcaagctg gatcaaggaa tggtcaaagg ggcgttccgc cgatgacatt gcaaagtgtt 1860 ttagagttgt aggcgctcat ccagacgtaa acccattaaa aaatcctgaa agcttgcatg 1920 aaccattgcg aaaatgcttc gataaaagtt ttgattttga caaatggttg gaacaacatc 1980 aagatcttct tgaagtccct tcaaacaact ttttcgattt aaataatgat gagtgggggt 2040 ttgctaaaaa acctgattcg ctttggatga cactttacga atgtaccaaa tctacagata 2100 aattcaaatc gtggaaacaa tcgcacattg aaaaaattgt tgaatttata catggtgatc 2160 caatcttgca tggattgtac gacgacgagg aaatggtcaa atttgaaaag gaggtattga 2220 gtgatagtag agtggaactg agtgcttttt cgaaagtttt gggaaggaaa ttgattttta 2280 tcgttttaaa cgattcaatg gaagaggaag aacatttgac atatgaaaac gaacaagaat 2340 cagaacccat caaaattttg ttattcaaca acagttatgc ctacaaaaaa gacgattaaa 2400 gatattttgt aaaatttgta ctaaaacagg gaaatacaaa aaataaacag taaaaaagtt 2460 atttatttca aaatttagct ttttaattaa tttttgaaaa tgtctagatt tgaagttttt 2520 tttaccatat cgatagaaaa ataaattttt ctgcgcattc tattcaaatt ttttaaaaaa 2580 gagtttttca gaaaatcaat gttcacattt tatacttcgt ttagattgcg cgcgcaattt 2640 attcgaatag tttgcgcatg attgcgcgcg caaactccac gggaggggac ggtag 2695 // ID MuDR-7_TV repbase; DNA; INV; 2587 BP. XX AC . XX DT 16-DEC-2008 (Rel. 13.12, Created) DT 16-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE MuDR DNA transposon from Trichomonas vaginalis. XX KW MuDR; DNA transposon; Transposable Element; MuDR-7_TV. XX OS Trichomonas vaginalis OC Eukaryota; Parabasalia; Trichomonadida; Trichomonadidae; OC Trichomonas. XX RN [1] RP 1-2587 RA Kapitonov V.V. and Jurka J.; RT "MuDR DNA transposons from protozoans."; RL Repbase Reports 8(12), 1817-1817 (2008). XX DR [1] (Consensus) XX CC The MuDR-7_TV consensus sequence was derived from multiple CC alignment of 10 copies ~0.3% divergent from it. MuDR-7_TV copies CC are usually flanked by 10- or 9-bp TSDs. MuDR-7_TV contains CC imperfect 32-bp TIRs (3 mismatches) and codes for a 552-aa MuDR CC transposase. CC This sequence was derived from sequence data generated by TIGR CC Database: The Trichomonas vaginalis Genome Sequencing Project. XX FH Key Location/Qualifiers FT CDS 816..2495 FT /product="MuDR-7_TVp" FT /note="MuDR transposase." FT /translation="MEPVIFNTMELLQFGIQEEIKASMGYSQKCRKEYSTI FT EYKCKYPQCQAKFKVRIIDQDKYELLIHSDSHDHSAPPKSTKTISSLFVRN FT YLKHYFENNMNHATAQLKCFQDLNISYTDEELLSIFAQGPDALRKFAQRAE FT IVSKRFSSDEKVSLAIFVESVQESCPEDLIEFISEPGKLVFLYAPFEAKAL FT GHQISTFHIDSTYKLLRANIPLYALTGKTEHTIVFPFLYFFISPDTSENIQ FT YCVCRYFQYINREPEMISMDCAPQIAEAIERGIPNCHILWCGVHVLRAILR FT KADKFKSRDNFEEFYNLMKLLVFDTKEESPEEEQNQEEENPREEPAQSHLL FT QAEEIEEEEQYEEEFIENEFQPSQIFLPRESSDDPNDSEFIPEEETFETSE FT EVPEHLTQEEAADLDAVYDRILEILSTEPKAQKYFDRQWRHHLDRWNKRYR FT NGGDATNNVSESHFKVLKHSYFPERRNIRLDDFVVELYTTVVPALLIKFKV FT QLINSDKVVRILRKANDHEQILENKKVECLKMLEAIYSSLQQGSIDSNLVY FT CTLKVLLKKKQLM" XX SQ Sequence 2587 BP; 952 A; 441 C; 406 G; 788 T; 0 other; gagaaaggga caaactatcc agtgtcactt ttcaaaggaa taaaaaaaat aataaagatg 60 aaataataac atgaaatctt atataaatat aattttatct aaatagataa atgaattaag 120 cagtttacag ttatatatat ttaatgagag acaatatcta gttctatggt agaaatgact 180 cttagactca aataagtaac acagatagaa agtactcagt ataacaatta tatatgtaca 240 ctcaaagtta gtcgctgcta aaatcttaac cttgcatttc gggaaaatct gactaatggc 300 tcttagactc aaataagtaa cacagataga aagtactcag tataacaatt atatatgtac 360 actcaaagtt agtcgctgct aaaatcttaa ccttgcattt cgggaaaatc tgactaatgg 420 ctcttagact caaataagta acacagatag aaagtactca gtataacaat tatatatgta 480 cactcaaagt tagtcgctgc taaaatctta accttgcatt tcgggaaaat ctgactaatg 540 gctcttagac tcaaataagt aacacagata gaaagtactc agtataacaa ttatatatgt 600 acactcaaag ttagtcgctg ctaaaatctt aaccttgcat ttcgggaaaa tctgacaaat 660 ggctcttaga cttgttttga taattaattc ttcaattata ttataattta taacaaattt 720 cctcaaacta tctgtatgtc attcaattcc cggccactac attcaatgat ggaccatttc 780 cttcaaattt tgaaatacct tattcaattt gttgtatgga gcccgtgata ttcaacacta 840 tggagctcct ccagtttggt atccaagaag agataaaagc tagcatgggc tactctcaaa 900 aatgcaggaa agaatactcc acgatcgaat acaaatgtaa gtatccacaa tgtcaagcta 960 aattcaaagt ccggataata gatcaagata aatatgaact actaatccac tctgactctc 1020 atgatcattc tgcacctccg aaatcaacaa aaacaatatc ctcattattt gtcagaaatt 1080 atcttaaaca ttactttgaa aacaatatga atcatgctac agctcaactt aagtgtttcc 1140 aagacttaaa tatttcatat acagacgaag agttactttc aatatttgca caaggtccag 1200 atgcattacg taaatttgct caaagagctg aaattgtttc caaacgcttt agttcggatg 1260 aaaaagtttc tttagctatc tttgtcgaat ctgttcaaga aagttgtcca gaagatttaa 1320 ttgaatttat ctctgagccg ggaaaactag tttttttata tgcacctttc gaagcgaagg 1380 cccttggtca ccaaatttct acgtttcaca tcgattcaac gtataaactt ctaagggcta 1440 acattccatt atatgcactc acaggaaaaa cagaacatac tattgttttt ccttttttgt 1500 atttctttat ttctcctgac acaagcgaaa atattcaata ttgtgtttgt cgttattttc 1560 aatatattaa tagagaaccc gagatgataa gtatggactg cgccccacaa atagccgagg 1620 caattgagcg aggtattcca aactgtcata ttctttggtg tggcgtgcat gtccttcgcg 1680 ccatattaag aaaagcggat aagtttaaat cacgagacaa ttttgaagaa ttttataatc 1740 taatgaaatt attagtattc gatactaaag aagaatctcc cgaggaagaa cagaaccaag 1800 aagaagaaaa tcctagagag gaaccagctc aatctcactt acttcaagca gaagaaattg 1860 aagaagagga acaatatgaa gaagaattca tcgaaaatga atttcaacct tctcaaatct 1920 ttttaccacg agaatcttct gatgatccaa acgattctga atttattcca gaagaagaaa 1980 cgtttgaaac aagtgaagaa gttccagagc acctcactca agaagaagct gcagacctag 2040 atgcagttta tgatcgaatt cttgaaattc tttccactga accgaaagcc caaaagtact 2100 tcgataggca atggaggcat catttagatc gttggaataa gcgctatcgc aacggaggtg 2160 acgcgactaa taacgtcagt gagtcacatt tcaaagttct caagcattct tatttcccag 2220 agagaaggaa tattcgatta gacgattttg ttgttgaatt atacacaacg gttgttccag 2280 cattattgat taaatttaaa gttcaattaa tcaatagtga taaagttgtt cgtatattac 2340 gaaaagctaa tgaccatgaa caaatattgg aaaacaaaaa agtcgaatgt ttgaaaatgc 2400 ttgaagcaat atattcaagt ttgcaacaag gatcaataga ttcaaatctt gtttattgta 2460 ctttaaaggt acttttaaaa aagaaacaat taatgtaaaa gattcaaata attttttatt 2520 attattttat taatttttta aaataaggtt aataaaaaag tgacactgga tagtttgtcc 2580 cacactc 2587 // ID Copia-31_DPu-LTR repbase; DNA; INV; 279 BP. XX AC scaffold_50; XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from Daphnia: long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-31_DP_; KW Copia-31_DPu-I; Copia-31_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-279 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Direct Submission to RU (16-DEC-2010). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR Genome; scaffold_50; Positions 866027 865749. XX SQ Sequence 279 BP; 83 A; 60 C; 46 G; 90 T; 0 other; tgttggaatt attccatgcg caccatctag acatgatgtg gcaccactgt atggtctgtc 60 gtctgctatc ctaaaaccac gcttcttgtc agttgaatag tgaaccagct atcagcaata 120 aacgctagct attttcccct gcactttgtt aagtaatcat tcaacatcac tacacattcc 180 aggtatataa tttattaaat gtgtgaataa ctatgagatg agtgtgccta ttaaagtgat 240 gtgttgcacc cacacttata attatgtcaa aatcctaca 279 // ID Gypsy-29_OD-I repbase; DNA; INV; 6120 BP. XX AC CABV01003676; XX DT 24-FEB-2011 (Rel. 16.02, Created) DT 24-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Oikopleura dioica genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-29_OD_; KW Gypsy-29_OD-LTR; Gypsy-29_OD-I. XX OS Oikopleura dioica OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; OC Oikopleuridae; Oikopleura. XX RN [1] RP 1-6120 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Oikopleura dioica genome."; RL Direct Submission to RU (24-FEB-2011). XX DR Genome; CABV01003676; Positions 6607 12726. XX CC Positions [4325-4804] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 641..5803 FT /product="Gypsy-29_OD-I_1p" FT /translation="MSDNDPSTVYEAGLKISLLVEMNTDLENQITEDPSKE FT TALRPIIAKNNLERDRLNRMMGSLCKLELNRSQSYQQNQTPGSYADYREEA FT HVTQLFSRFMTDTAKLDGSSFAEFTTFLGRLQIFYKAHIDGHTNRTRPFFN FT CIYSHLDISVERQIGEELLRCTTWDEVKSLLEKNFGDRSHFYLSLSEVWDV FT EFDSSKPTAHYAAEVSKKMREAKGAIKAAFKSDKKRDLEADDLFEIVGAML FT LYLQLRSHEPQVYNQLSPSLSNVFSIKELARLAQQNIQNRVDTSVSVNFQK FT KKGGKWNKRKKSEKKDGNDKQYSGDSDKKIMKTSVEDQLQPFEYHQLKTKN FT SSKVYFASTSIMAGSKRFDAMLEIDSGATTSTISADCVPDPILKQLRPCPF FT SISGFDPKSPPQRPLGLLDCKFSFNNGPQIKVELVVTPIGFPNLLGRDILD FT NKAIDGFAIDNNKRTLTFNFDDSFTKTKTQTIQLKGVSKSRVKCFRITHNQ FT RSKATKVTSEETKFTLAASTTPNDHRCPVKRVKEVLNITFPDNSDKSHQVE FT VAKLLLEFRDIFGADGQALGEFPYEAEIRTNGQTRAVPQYSIPQAFHEPIT FT TQVNQMLKEGVVREISDPKGWNSPLIPIIKKDKTLRLCVNFKRTLNLCLSD FT KSDNFHLPSMDTVIGQLGSGNKFYSTCDLRSGYWQLRLKKEDQHKTSFTWG FT NKNYCFNRLPFGYKNSGNIFSRQVNQMLDSSPSRECTHSYIDDVIVAQTTF FT PAYMKALRDFFTALRKYGARLKSTKCTFLESSAKFLGRVITQSGIRPDPDN FT LSALRSMKPPRNVKELSSLIGSLNWVRSFLETRVGEKIATDCFSHRMVPLN FT TVRNSATETGIYKWSQEADSALRELQMRMASPPTISFPNYNETFHFYTDAS FT HFAAGGILLQEYDNRAHLVAAISHTFTRDERKWNVSEKETFAIVWGCERLA FT MMLKGRHFYLHTDHRSLVYLHNKRFKNSKISRWQTRLEEFDFTIIFIKGSD FT NQFADFMSRRPGVDDPTLDLQSDFPVGKEYEVEPNSNFRVFIPFWSENQFP FT DKLTLSRVKIHKVKVSAVAAPSETPIVSTNYGNLLEHQMQDYALSRIIHHL FT ESSSDPLDIPKLLKNNDHRVPQLREHQKSLFVDHVTGALMVQVKSGARAVV FT PEHVRSHFLYLAHDLSAHGGVPRTLERLEDVWWYDMISDVGNYVHSCVQCL FT RRKGATGKRTKPPHGQLFRGKAPGESLSIDFAHMPAVRGYRYYLICVCNFS FT RYVWCVPTKKDDAVACAKSLVDRIFLPFDFVPKHIHSDRGLHFISAVVDNL FT CKLNGIRHSISTAFHPESNSFAERNNRTVKNALFCTSNMKNGRNWVDSLPH FT VMRCLNSMTNKSTKVSPRECWFGRKSTFTHRNGEEMTAESPLIYGLNVKSL FT ARSVHNAVEISMAAADHAMQLRMKNESSPAPIPAGSCVYIKREMLSDPKRK FT NQKWVGPLKLVASNEHICLVEDAKGARDIVHRSHTCHASQRIQDLKYIEEF FT EMDYIFPHLTAIVKSKVPQTTQQRIPPSEQPNTVGSSSQGERSSTPIKTSQ FT VQTNNPDISPVKAAAGDVLSGTVTLDTSKQSTETVGSSKCQSDDAMELGAT FT GGTIWEETLFQSAISEAQPAEFSSAQSCDAANLEITPANVASPCRPTPVKE FT KRFQHQKKRARPEASPNSTVVELESKAPRTSSRAKKAVQPMNIQSSRSKSY FT D" XX SQ Sequence 6120 BP; 1808 A; 1572 C; 1326 G; 1414 T; 0 other; gtggtgtcaa aaagaccctc acgaacaaaa taaagacgtt tttctccgac gagtgtcatc 60 gattcttcag ccaatcagaa ggattgaaag gtacgaatcg acgcgaattc actttaaacc 120 gccgagattg gccgtttaaa gtcagaattt gcaacaaaca atattttttt tcgccgcacg 180 aaaattccgt tggctccgac gtcagcgcca accgaaaacg ccgcgagaga aatttgacgc 240 acccacgcac caagtcgagg aagcgcgccg cgccactcct caacacacac acctccaaca 300 ccgcgcgtgc gccaccgctg ccgccacggc cccgccgcag tacgtgacgc atcgccacgc 360 cacccacgca cacgaaatca ccactcgctc gaaacaagcg cacacatttc gacaaacaaa 420 taaatcacag aaaacagatt tttttacgaa gaatcaacat ctcctggaac tagagaaatc 480 ttagaagaag gagacaactt tacaagaact ccgagaattc ccatacgctc tccacggtac 540 ctcagcgtct aagttagtga cttttaagtt actggcaatt tctgagggcc gtaaaggtaa 600 cgcacaattg tttttctcgt gacttcgcac aactttgagc atgtctgaca acgacccttc 660 cacagtctac gaagctggct tgaagataag tcttctggtc gagatgaaca ccgatttgga 720 aaatcaaata acagaagatc cttccaaaga gacagccctt cgtccgatta tcgccaaaaa 780 caacctcgaa agagacagat taaacaggat gatgggcagc ctatgtaaat tagaacttaa 840 tcgctcccag agttatcagc aaaatcaaac gccaggcagc tacgccgact acagggagga 900 agcgcacgtc acgcagctct tttcacgatt catgacagac acagcgaaac tcgatggaag 960 tagcttcgct gaatttacga cattcttggg ccgcctacaa atattttata aagcacacat 1020 cgacgggcac acgaacagaa caaggccttt cttcaattgc atctacagtc acttggatat 1080 atcagtcgaa agacaaatcg gtgaggaact tttacgatgc acgacctggg acgaggtaaa 1140 atcccttctt gaaaaaaatt tcggcgatag aagccatttc tacctgagct tatcagaagt 1200 ttgggacgtc gaattcgaca gttcaaaacc gactgcacat tacgcagcgg aggtttctaa 1260 aaagatgcgc gaggcgaagg gagcaatcaa agctgctttc aaatccgaca aaaagagaga 1320 cctagaagcg gacgatctct tcgaaattgt aggagctatg ctactgtacc ttcaactacg 1380 atctcacgag ccccaagtct acaatcaatt gtcaccttca ctcagcaatg ttttcagcat 1440 caaagaactc gcacgtttgg cacagcagaa catccagaac agagttgata cttctgtttc 1500 tgtcaacttc caaaagaaga aaggcggaaa atggaacaag cgcaaaaagt ccgagaagaa 1560 agatggcaac gataaacaat attccggtga ctcagataaa aagataatga agacgtcagt 1620 tgaagatcaa cttcaaccct tcgaatatca ccaattgaag acaaagaact ctagcaaagt 1680 ctacttcgct tctacatcca tcatggcggg ctctaaacgt tttgacgcca tgttggaaat 1740 agatagcgga gcaactacgt caacgatttc agccgactgc gttcctgacc cgattcttaa 1800 acaactcaga ccctgtccat tctcaatttc cggattcgac cccaagtcac cgcctcaaag 1860 gccacttggt cttctcgatt gtaagttttc cttcaataat ggaccacaga taaaggtcga 1920 actggtggtt acccctatcg gttttccgaa ccttctgggc agggatattc tcgacaacaa 1980 ggcgatagac ggattcgcca tcgacaacaa caagagaaca ctgacgttta acttcgacga 2040 ctcttttacg aagaccaaaa ctcaaacgat acaacttaaa ggagtatcga aaagcagagt 2100 taagtgtttc cgaatcaccc acaatcagcg atcaaaagct acaaaggtta cttcggaaga 2160 aacaaagttt acgctagctg catcaaccac tcccaacgac caccgatgcc cagtcaagcg 2220 agtaaaagag gtgcttaaca ttacatttcc agacaacagc gacaagtcac accaggtaga 2280 agtggctaag ctacttttgg aattccgcga catttttgga gcggatggtc aagctttggg 2340 agaatttccg tacgaggcgg agattcgaac gaatggacag acacgggcag tgcctcagta 2400 ttctatacct caggcctttc acgagcccat aacgacccag gtcaaccaga tgctcaaaga 2460 aggagtagtg cgggagatat ccgaccctaa aggatggaat tctccgctta tcccaataat 2520 caaaaaggat aagacattac ggctctgcgt taatttcaag cgtacattaa atttgtgcct 2580 tagcgacaag agcgacaact tccaccttcc ttcgatggac accgttatcg gccaactagg 2640 atccggtaac aaattctata gcacgtgcga cttaaggagt ggttactggc agcttcgatt 2700 gaaaaaagaa gatcagcata aaacttcatt cacttggggt aataagaatt attgcttcaa 2760 tagactaccg ttcggataca agaatagtgg taatatcttc agtcgccagg taaatcaaat 2820 gctcgattca agtccgtcca gagaatgtac acacagctac atcgatgatg taatcgtcgc 2880 gcaaacgacg tttccagcct atatgaaggc attgcgcgat tttttcacag ctctccggaa 2940 gtacggggcg cgtctcaaga gtacaaaatg cacatttctc gaatcatcgg cgaagttcct 3000 cggccgagtc atcacgcagt ctgggatccg gccagatcca gacaatctgt cagcgctgcg 3060 ctcgatgaaa cccccaagga acgtgaagga actttcgtca ctcattggct cacttaactg 3120 ggtcagatca tttttggaaa cccgggtggg ggagaagatc gcgacagatt gcttctctca 3180 tcgcatggtg ccactcaata cagtcaggaa ctcggcgaca gagacaggta tctataaatg 3240 gtcacaagaa gccgattcgg cgctacggga gctacagatg cgaatggcta gccccccgac 3300 aatatcattc cccaactaca acgagacatt tcacttttac actgacgctt cgcactttgc 3360 ggctggtgga attctcctac aggaatacga taatcgggca cacttggtag cggcaatcag 3420 ccacacgttc actagagatg aacgaaaatg gaacgtctcg gagaaggaga cttttgcgat 3480 agtatgggga tgcgagcgtc tagcgatgat gctaaaagga cgccacttct acctgcatac 3540 cgatcaccgc agtttagtct atctccataa caaacgattc aagaactcaa agatttctcg 3600 ttggcaaacg agacttgaag agtttgattt taccattatc ttcataaaag gatctgacaa 3660 ccagttcgcc gactttatgt cgaggcgacc tggcgtcgac gaccccacgc tagatctaca 3720 atcagacttt cccgttggta aagaatacga ggttgagccc aactcgaatt ttcgagtatt 3780 catccccttc tggtcggaaa accagttccc ggataagctt acgctctccc gtgtcaagat 3840 ccacaaggtc aaggtctcag ccgtggctgc accgtcagaa actccaattg tcagcaccaa 3900 ttatgggaat ttactggaac atcagatgca ggactacgct ttgtcccgca ttattcatca 3960 cttggagtca tcatctgacc ccctcgacat cccaaaattg ctcaaaaaca acgatcaccg 4020 cgttccccaa ctcagggagc atcagaagag cctcttcgtg gatcacgtaa caggcgctct 4080 gatggtccaa gtaaaatctg gcgcgcgcgc agtcgtgccc gaacacgtaa gatctcactt 4140 tctttacctc gcgcacgatt tatcggccca cgggggtgtt ccccgcacac tcgagcgatt 4200 agaagatgtc tggtggtatg acatgatctc ggatgtagga aactacgtcc attcttgcgt 4260 ccaatgtctg agacgcaagg gtgcaactgg taaacgaaca aagcccccac atggtcaact 4320 tttcagaggt aaagcccccg gcgaaagtct cagtatcgat tttgcccaca tgcccgctgt 4380 ccgtggttac agatattatt taatatgcgt gtgtaacttc tcgagatacg tatggtgtgt 4440 cccgacaaaa aaagatgacg ccgtagcctg cgctaaatcc ctggtcgacc gtatttttct 4500 ccctttcgac tttgtcccaa aacacattca ctccgaccgc ggtttacatt tcatatcggc 4560 cgtggtggat aatctttgca aattaaacgg cattcgtcac tcaatctcga ccgcttttca 4620 tcccgaaagt aactcctttg ctgagcggaa taaccgtacg gtgaagaacg cacttttctg 4680 cacgtcgaac atgaaaaacg ggcgaaattg ggtcgactca ctcccacacg tgatgcgctg 4740 tttaaacagt atgaccaata agtctacaaa agtaagtccc cgagagtgct ggtttggtcg 4800 aaaaagcaca tttacacatc gtaacggcga agaaatgacc gccgagagcc ccttaattta 4860 tggtttgaac gtaaaatctt tggcccgttc cgtccacaat gccgtggaaa tttctatggc 4920 agctgctgat cacgctatgc agcttaggat gaaaaatgag tcatcgccag ccccgattcc 4980 agccggctca tgtgtttaca tcaagagaga aatgctctca gacccgaaac gtaaaaacca 5040 gaaatgggta ggccccctta aactcgttgc ttcgaacgag catatttgcc tcgtcgaaga 5100 tgccaaaggc gcccgagaca tcgtccaccg atcacataca tgtcacgcgt cacaacgaat 5160 tcaagatttg aaatacatcg aggagttcga gatggactat atttttcccc atctcaccgc 5220 tattgtcaag tcaaaagtgc cacagacgac tcagcagcga attccgcctt cggagcagcc 5280 aaatacagta ggttcttcgt ctcaggggga gaggagttca acgcccatca aaacgtctca 5340 ggtacagaca aataacccag atatttcgcc cgtgaaagcc gctgctggtg atgtacttag 5400 tggtacagtc actttggata cgtctaagca gtctacagaa accgttggct catcaaaatg 5460 tcaatctgat gacgcaatgg agctcggagc cacaggcgga acaatttggg aagaaacact 5520 cttccaatcg gcgatttcag aagctcagcc cgccgaattt tcaagtgcac agtcgtgtga 5580 tgcggcaaat ttagaaataa cgccagcaaa tgtcgcatca ccgtgtagac ccactccagt 5640 gaaagaaaaa cgtttccagc atcaaaaaaa gcgcgctcga cccgaagctt ctccaaatag 5700 tacagttgtc gaactagaat cgaaggctcc gcggacgtca tctcgagcga aaaaggccgt 5760 ccagccgatg aacatacaaa gctcacgcag caagagctac gactaaggcg taaaatgctg 5820 ccccttcagg tccttccatc gaccggggcc tgaaaaggtt agattttcat caatcagttc 5880 cgtctccgac aatatttgag caactcaacg atcctgccga tttaccgctc gccgacgcta 5940 gaagagctac tccacaaatt ttggtctagg ttcatgcttt ttccctcaat cgggtttttt 6000 aatgcagttc cgaacccaag atgtagcgtt ctactttgat cttgttcgtt tatgtgtctc 6060 agaaatcttt cctgcaactt ttcttatcat atgctcaagt ctcatgggga gtaaagggtt 6120 // ID Gypsy-52_CQ-LTR repbase; DNA; INV; 124 BP. XX AC AAWU01016480; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-52_CQ_; KW Gypsy-52_CQ-I; Gypsy-52_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-124 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 484-484 (2011). XX DR GenBank; AAWU01016480; Positions 1713 1590. XX SQ Sequence 124 BP; 38 A; 31 C; 18 G; 37 T; 0 other; tgttgtgatt actggcaaca ctgacactcg aatgtaacca acctaccttt acattacatg 60 taccttggat tactcaataa aagtcacttg aagtagacgt cacgcacttg ttttactccc 120 aaca 124 // ID Gypsy-627_AAe-LTR repbase; DNA; INV; 134 BP. XX AC . XX DT 27-MAR-2011 (Rel. 16.04, Created) DT 27-MAR-2011 (Rel. 16.04, Last updated, Version 1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy-627_AAe-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-134 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(4), 1427-1427 (2011). XX DR [1] (Consensus) XX CC Solo LTR. >88% identical to consensus. XX SQ Sequence 134 BP; 47 A; 20 C; 27 G; 40 T; 0 other; tggccgaaat gtagtaataa atgagtaggt agctatcacg gtaggcattg acttaggcta 60 gacattgaat gaatatgtaa aatgactgac attcaataaa ctatcaccaa actgattaga 120 tgttgtttcc ttca 134 // ID I_Ele4E_AAe repbase; DNA; INV; 5683 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A I non-LTR retrotransposon family from Aedes aegypti. XX KW I; Non-LTR Retrotransposon; Transposable Element; I_Ele4E_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5683 RA Kojima K.K. and Jurka J.; RT "I non-LTR retrotransposons from the yellow fever mosquito."; RL Repbase Reports 11(4), 1374-1374 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 5 sequences with >99% CC identity. CC The consensus is ~92% identical to I_Ele4F_AAe. XX FH Key Location/Qualifiers FT CDS 367..1593 FT /product="I_Ele4E_AAe_1p" FT /translation="METDGDTSSVVDKDSEKSSSTIKSIRIKTYPPTYLGP FT FVVFFRKKEKPINVLLISSEIYKLYKSVKEIKKISLDKLRVVFGSREDANA FT LLESKLFFNSYRVYAPCDSCEINGIIYDESLDCEDILKYGSGIFKNKAISP FT VRILECVRLSKLLFSDKGSSYIHSNCIKLTFSGSVLPDYVDIDNVKFRVRL FT FYPKIMHCDRCLLFGHTSHFCSNKPKCLKCGGVHSPSECKKQSDSCIYCGK FT KHDFLKECSVYIAHQKQFNLKIRNKNKLSYSEVIKTSDVFSTKNIFEPLSQ FT INDIENSNGEPNNFVYKPPIKRKRINQSNNHNHNLNPQPSTSYDQNFPPIK FT CSNTTQNIPGFQKTIPGFSENYNDNFNNVNNKSQNNDHDSDEGSILNILED FT IVDFLGLNDFWKKNH" FT CDS 1678..5370 FT /product="I_Ele4E_AAe_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="MAPLKNTNLNILQWNCRSIIPKIDRLKALIANLEIDI FT FCLNETWLVDTKLFRIPSFNIIRKDRNIAYGGVMIGIRENIEFKFLNFTFD FT SPIEYVAIFVKHNGLEFSIVCLYIPPQVKFSLQNLKTILNNVPSPFYILGD FT LNAHNLAWGSDITDGRGELVMNLIDELNLNILNDGSFTRVAVPPAHHSCID FT LTLCSNSLSIKSSWTIINDPNGSDHLPIKISIHNPSCDQFQQEPFVPDLTK FT NVDWHKFSDLVSSALINFDNSLSPLQNYNSFSKILIQCLHKSQSKNIFVGP FT SKKRTPCFWWDHDCTLALKNKSDAFKKFRKSGSRDHYIFYCKTEAQFTRVT FT KFKKRNYWKTFIENLDSETSLTKLWSVARNLRNYNIPSASISEYSEDWIDQ FT FASKICPDFVPTPITFKSHQCYNYYPDLCNKFSIEEMELALTITNNTAPGI FT DNIKFIVLKNLPIDGKLHLLSLYNSFLFQNIFPLEWRSIKVISLLKPDKNP FT SLVDSRRPISLLSCLRKLMERMILNRLELWAEKNNIFSSSQYGFRKGRGTR FT DCIALLASHIELSFNKKQDVVSTFLDVSGAYDSVLIDLLFNKMNDCKIPII FT ISNFLCNLFSFKIMHFYHNGSSRMVRYSYFGLPQGSCLSPFLYNLFTRDII FT SIIPNGCYFIQFADDNVIFINGKNREIIRHYMQISLDNIHTWAHNNGFTFS FT VQKTKFILFSRKHSPVSIDLFLNNHQIEQVFDYKYLGIWFDSKLKWNNHIQ FT YVQKICSKRVNFLRMITGTWWGAHPNDLITLYKTTIRSVMEYGCFSFGSAV FT QTHFSKLEKIQLRCLRISLKLMNSTHTKSVEVLAGIIPLKNRILELNCKFL FT IHCFSINHPLIDILKSLFEINPSNRMLNSFIYCSSENIIPNYSPGFHDYNI FT DIHSFRPNIDFSLYEELKQFPCHVQSYYANMIFKRKFIGVENDQIFFTDGS FT LIENVAGFGVYNFHLAHFYKLDSPCSIFTAELTALYFTCSLIKNCAPNIFV FT VCSDSFSCLQAFNSNKFHFKTHHIVLSIKGLLHNLYSKGYLIKFVWVPAHC FT NIYGNEQADLLAKLGVSRGIIYNRGINYFEYFSNLKKSTINDWQISWNTSD FT QGRYCYSICPKVKIYPWFKYFSVGRNFICTFSRLMSNHYICNCYLYRMNII FT DSNICECNKTYEDIDHIVFKCSRFNVPREKFFDRISSLSNDIPVSVRDILG FT NYNLPILKILYQYLCEISYHV" XX SQ Sequence 5683 BP; 1828 A; 844 C; 853 G; 2158 T; 0 other; catcttcggt aagtaggcct tgaccgggtt acaggtcttt tttctgcgct tgtcattttt 60 tttttcgttt caggtgattt tctacccgaa gtattgttat ttgaagataa ttgcctttgc 120 ttgctgattg ctggttggat ttgcgtaaag agaggaataa gtgtactgca agcttcaaat 180 cacaagactc ctggttgttg ctggtgatac aagccgattg acgtttttga gtgctgttgc 240 tggtgtcact gaagtttatt ggcgtttgtt aagtatttgt tttcgttttt gattagatat 300 ttttttttat tttattattc atttggtgtt tgcttagttt aaattcgtcc ccgtcatttt 360 tccataatgg aaactgacgg ggatacatct tctgtagtag ataaagattc tgaaaaatct 420 tcatctacga taaaatctat tcgtatcaaa acatatcctc ctacatatct tggccccttt 480 gttgtgtttt tccgtaaaaa agaaaaacca atcaacgttc ttttaatttc atcagaaatt 540 tataaattat ataaatctgt taaagaaatc aaaaagattt ctcttgacaa attacgtgtg 600 gtttttggat ctcgagaaga cgctaatgcg ttattagagt ccaaattgtt ttttaattca 660 tatagagtct atgctccatg cgactcatgt gaaataaatg gtataattta tgacgagtct 720 ttagattgtg aggatatttt aaaatatggt tctggtattt ttaaaaataa agctatttcc 780 ccagttagaa ttttagaatg tgttcgatta tcgaaattac ttttttccga taaaggatcc 840 tcgtacattc attctaattg tatcaaatta acattttcag gatctgtcct tcctgattat 900 gttgatattg ataacgttaa atttcgtgtt aggctctttt atccaaaaat catgcattgc 960 gaccgttgcc ttttatttgg ccatacgtcg catttttgct ccaataaacc aaaatgctta 1020 aaatgtggtg gagtacattc tccatctgaa tgtaagaagc aatctgatag ttgcatttat 1080 tgtggtaaaa agcatgattt tttgaaagaa tgttcagttt acatagccca tcagaaacaa 1140 tttaatttga aaattagaaa taaaaataaa ttatcctatt cggaagttat taaaacttct 1200 gatgtgtttt ctactaaaaa tatttttgaa cctttgtctc aaatcaatga cattgagaat 1260 tcaaatggag agcctaacaa ttttgtttat aaaccaccta ttaaaagaaa aagaataaat 1320 caatctaata accataacca taatttaaat cctcaaccat caacttctta tgatcagaat 1380 ttccctccta ttaaatgttc aaacacaact caaaatattc ctggttttca gaaaactatt 1440 cctggttttt ctgaaaacta caatgacaat tttaacaatg tcaataataa atctcaaaat 1500 aatgaccatg atagtgatga gggtagtatt ttgaatattt tagaagatat tgtggatttt 1560 ttgggactca atgatttttg gaaaaaaaat cattaaaaaa tgtttaccct ttttagcgaa 1620 aattcttgaa aaattgaatt catttggacc cctcattagt tctttatttt gttcttaatg 1680 gctccattaa aaaatacgaa cttgaatatt ttacaatgga attgtcgtag tataattcca 1740 aaaattgata gacttaaagc tttaatagca aatttagaaa ttgacatatt ttgtttaaat 1800 gaaacatggt tagttgatac taaacttttt cgaattcctt ctttcaatat aattcgaaaa 1860 gatcgtaaca tagcttacgg gggtgttatg attgggattc gtgaaaacat tgaatttaaa 1920 tttctgaatt tcacatttga ttcgccaatt gaatatgtgg ctatttttgt caaacataac 1980 ggtttggaat tttcaattgt atgtttatat attccacccc aagtaaaatt tagcttacaa 2040 aatttgaaaa caattttgaa taatgttcct tcaccatttt atatacttgg tgatctaaat 2100 gctcataatt tagcttgggg tagtgacata actgatggta gaggtgaatt agttatgaat 2160 ttaattgatg aattaaattt aaatattctg aatgatggat ctttcactag agttgcggtc 2220 ccgcctgctc atcattcttg tatagattta acactttgtt caaatagttt gtccataaaa 2280 tcttcttgga ccatcataaa tgatccaaat ggtagtgatc atttacctat aaaaatatca 2340 attcataatc cttcgtgtga ccaatttcag caagaacctt ttgttccgga tttaactaag 2400 aatgtggatt ggcataaatt ttctgacttg gtttcgtctg ccttaatcaa ttttgataat 2460 tcactttcac ctcttcaaaa ttataacagt ttttcaaaaa ttttaattca atgtttacat 2520 aaatctcaga gcaaaaacat atttgtaggt ccttctaaaa aaagaactcc ttgtttttgg 2580 tgggatcatg attgtacttt agcacttaaa aataaatcag atgcattcaa aaaatttcgg 2640 aaatcagggt ctagagatca ctatattttc tattgcaaaa ctgaagctca gtttactcga 2700 gttacaaaat ttaagaaaag aaattattgg aaaactttta tagaaaatct tgattcagaa 2760 acatcattaa caaaattatg gtcagttgct agaaatttaa ggaattataa tattccttct 2820 gcatcaattt cagaatattc agaagattgg attgaccaat ttgcttctaa aatttgtcca 2880 gatttcgttc ctacccctat cacattcaaa agtcatcaat gttacaatta ttatcctgat 2940 ctttgtaata aattttcaat tgaggaaatg gaattggcat taactattac taacaatact 3000 gctccaggca ttgataatat taaatttatc gtcttaaaaa atttacctat cgatggtaaa 3060 ttacatttac tttcattata taattcattt ttgtttcaga atatttttcc tttagaatgg 3120 cgttctatta aagtaattag tttacttaaa cctgataaaa atccttcatt agtagatagt 3180 agaagaccca ttagtttatt atcgtgcctt cgtaaactaa tggaaagaat gattttaaat 3240 cgacttgaat tgtgggctga gaaaaataat attttttcat cctctcaata cggatttaga 3300 aaaggccgtg gaactagaga ttgtattgcc ttacttgctt cacatattga attatcgttc 3360 aacaaaaaac aagatgtagt ttcaactttt cttgatgttt ctggtgcata tgattctgta 3420 ttgattgatt tattgtttaa taaaatgaac gattgtaaaa ttcccatcat catttccaat 3480 tttctgtgta atttattctc cttcaaaata atgcattttt accataatgg atcatcaaga 3540 atggtccgtt atagttattt cggtttacca cagggctctt gcttgagccc atttttatat 3600 aatctattca ccagagacat catttccatc attcctaatg gatgttattt tattcaattt 3660 gccgatgata atgtaatttt tatcaatggc aagaatagag aaattattcg tcactatatg 3720 caaatttctt tagataatat tcacacctgg gcacataata atggtttcac attttcagtt 3780 caaaaaacaa aattcatatt attttctcgc aaacattctc cagttagtat tgatttattt 3840 ctcaataatc atcaaattga acaagttttt gattataaat atcttggtat atggtttgat 3900 tcgaaattga agtggaataa tcatattcaa tatgtccaaa aaatttgctc caaaagagtt 3960 aattttcttc gaatgattac tggaacatgg tggggtgccc atcccaatga tttgattaca 4020 ctttataaaa caactattcg ttcagtaatg gaatatggtt gtttttcttt tggaagtgct 4080 gttcaaacac atttttccaa acttgagaaa attcaactgc gttgcttaag aattagttta 4140 aaattaatga attctactca taccaaatcg gttgaagtac ttgctggtat tattccactc 4200 aaaaaccgca ttcttgaatt gaactgcaaa tttttaatac actgtttttc aattaatcat 4260 ccactaattg atatattaaa atccttattt gaaataaatc ctagtaacag aatgttgaat 4320 tcgtttattt attgttcttc agaaaacatt attccaaatt attcgcctgg ttttcatgat 4380 tataacatag atattcattc ctttcgtcct aacattgatt tttctttata cgaagaattg 4440 aaacaatttc cttgtcacgt gcagtcctat tatgctaata tgatatttaa acgaaaattc 4500 attggggtgg agaatgatca aatatttttt acagatggtt cgttgattga aaatgtggca 4560 ggctttggag tgtacaattt tcatttggcc catttttata aattagattc tccttgctcc 4620 atttttacag ctgaactaac tgctttatat tttacatgca gtttaattaa aaactgtgct 4680 cctaatatat ttgtggtgtg ctcagatagt tttagttgtt tgcaagcttt caattccaac 4740 aaatttcatt ttaaaaccca tcatattgtc ttgtcaataa aagggttatt gcacaattta 4800 tattctaaag gatatttgat taaatttgtt tgggttccag ctcattgcaa tatttatggc 4860 aatgaacaag cagatttatt agcaaaattg ggtgtttcac gtggtataat atataatcgt 4920 ggtataaatt attttgaata tttttctaat ttaaaaaaat ctactataaa tgattggcaa 4980 atttcttgga atacaagtga ccaagggcga tattgttatt ccatttgtcc aaaggtaaag 5040 atatatcctt ggttcaaata cttttcggtt ggacgtaatt ttatatgtac cttctctaga 5100 ttaatgtcca atcattacat ttgcaattgc tatttatacc gtatgaatat tatagattct 5160 aatatttgtg aatgcaataa aacatatgaa gatatagatc atattgtatt taaatgttct 5220 cgattcaatg tgcccagaga gaaatttttt gacagaatca gtagtttgag taatgatatt 5280 cctgtatctg tccgtgatat attgggaaat tataacctac caatattgaa aattttgtat 5340 caatatttgt gtgaaatttc ttatcatgtt tgatacttgc tgccttgttt tctatttcat 5400 tttcaggtac tccaagtttg gcctccattc gtgtcgttga ctggcatcgt gaatacgctt 5460 gggaggacct tcatgatgat ttatccagat aattcggctc tgtgatggat ccattccgaa 5520 tgagccttta tttttaattt tcattttata acgttttatt agaaaagata aagaggtttt 5580 gtgccttttt gagaacgatt tcgaaaagga aatcactcaa agaggctttt ccctctttca 5640 aaattattga gttaataaat aataataata ataataataa taa 5683 // ID BEL-59_CQ-I repbase; DNA; INV; 2313 BP. XX AC AAWU01004145; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-59_CQ_; KW BEL-59_CQ-LTR; BEL-59_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-2313 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 271-271 (2011). XX DR GenBank; AAWU01004145; Positions 766 3078. XX CC 'CTTAG' target site duplication CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 373..2313 FT /product="BEL-59_CQ-I_1p" FT /translation="MGPKIPRTPLKKEGEEEEDLENLVFLREEQTEVIDRL FT KATIDATLVEERTEAAAKVHRWRLDTCFAEFAAIKERIYRADPKKRNDHKK FT VVVEFEALFDQLAMTLGGWNAGAAPAASSALVERHDRPIVIQQPLPRVFPS FT FDGKYENWEKFKVMFVDVVDKTNESARMKLYHFEKALIGDAAGFIDAKAIQ FT DGNYAHAWKQLTDHYEDKRRMVDLHIGGLLSVKKLACEGHLELRALVDSVV FT GNVENLKYLGQEFTGVSEQIVVYLLGHALDDDTRKVWESTVAKGAFPKYDE FT MIKTLKDRISVLERCDTTNDESPRQQHQAKSTTTNQPYQISNTAVTSSPRI FT WCDFCNGRHLTFKCTAFSGLTVRQRMQEVKEKRVCFNCLRAGHSVKRCIRT FT SSCGKCQRRHHTLLHDFYQKSTTLQRPVPAVRQTQEPLSPAEVICNPTPML FT QTAIVNLRDDNNRPVPCRILIDSGSQVNFISNSMATRLNLRRVAVNVPICG FT IGGATFNAKETITVKLQSQHSDFSADVECLIVPKVSGKIPSLPVNSTDWPI FT PKGLQLADLNFHIPGSIDMLVGASWFFRLLKGGHIQLWDNFPELRETHLGW FT VVVGGAEGAISGQQYTHTATLDTRAAMFLKVPDRPEEVSAGSSPSPGGG" XX SQ Sequence 2313 BP; 591 A; 649 C; 647 G; 426 T; 0 other; ttggtccttc aacccggatt ggaccccggc gtggacgctg gaaagaagac tggtgaagtg 60 aggaggcaaa ccggcctcag cgcgaacagt taccgagtgt ctcgggaagc aaaccggctt 120 ccgcgtgaaa acaaaagaac attcgcgggg tgcaaaccgg cttccgccgc gagtgttagt 180 gcaaggcaaa ccggccttgg cacacgaacc gttccgtgga accatcgggg gaggcaaacc 240 ggcctcagac cgactccgga agaacaaaaa gtgccaccca gcgtggaaaa aagtgaaaca 300 aacacgaaag aacaaagtga aagtaccacg tgctgttcct aacctcaaaa gtgtgcaaaa 360 gtgacaaaaa aaatgggccc caaaatacca cgcacgccgc tcaaaaagga aggcgaggag 420 gaagaagacc tggagaacct ggttttcctc cgggaagagc agacggaggt gattgatcgg 480 ctgaaggcaa cgatcgacgc gactctcgtc gaggaacgca ctgaggccgc tgcgaaggtt 540 caccgatgga ggttggacac gtgcttcgcc gagttcgccg ccatcaagga acgcatctac 600 agggctgatc cgaagaagcg gaacgaccac aagaaggtgg tcgtggagtt cgaggccttg 660 ttcgaccaac tggccatgac gctcggaggt tggaacgccg gtgccgctcc agctgcatcg 720 tccgccctcg ttgaacgaca cgaccggccc atcgtgatcc agcaacctct tccacgtgtt 780 ttcccatcct tcgatgggaa gtacgagaac tgggagaagt tcaaggtgat gttcgtcgac 840 gtcgttgaca agacgaacga gtcggcgcgc atgaagctgt accacttcga gaaggctttg 900 atcggcgacg cggccggctt tatcgacgcc aaggccatcc aggatgggaa ctacgcccat 960 gcctggaagc agctgactga tcactacgag gacaagcgcc ggatggtaga ccttcacatc 1020 ggaggtctac tgagtgtaaa gaagctggcc tgcgagggcc acctggaact tcgggctttg 1080 gtggattccg tcgttgggaa cgtcgagaac ctcaagtacc tcggccaaga gttcaccggg 1140 gtgtccgaac aaatcgtcgt ctacctcctg ggccatgctc ttgacgacga cacccggaag 1200 gtttgggagt ctacggtcgc aaaaggtgcc ttcccgaagt acgacgagat gatcaaaacc 1260 ctgaaagatc gcatctcggt tctggaacga tgcgacacca ccaacgacga atcaccaagg 1320 caacaacatc aagccaagtc gaccactacc aatcaacctt accagatctc caacacagct 1380 gtcacgtcgt ctccgagaat ctggtgcgat ttctgcaacg gacggcacct gaccttcaag 1440 tgcactgcct tcagcggcct cacggtgcgc cagcgcatgc aggaggtcaa agaaaagcgt 1500 gtctgcttta actgcctgcg cgcagggcac agcgtgaaga ggtgcatacg gacgagctcg 1560 tgtggtaagt gccaacgccg acaccacact cttctacacg acttctacca gaagtcaacc 1620 acactgcaga gaccagttcc tgctgtgcgc caaacgcaag aaccgctttc gccagccgaa 1680 gtgatctgca atcccacccc catgctgcaa acagcgatcg tcaatctgcg cgacgacaac 1740 aaccgacctg ttccgtgtcg tatcttgatt gacagcggat cgcaggtgaa ttttatttcc 1800 aactcgatgg caacccgact taacctacga agagtagcgg taaacgttcc gatctgcggg 1860 atcggaggag caacgttcaa cgccaaggaa acgatcaccg taaagcttca gtctcagcac 1920 agcgatttct ccgctgatgt agagtgtttg attgttccaa aggtgagtgg gaagattcca 1980 tcgttgccgg tcaactcgac cgattggcca attcccaagg gactgcagct ggctgatttg 2040 aacttccaca ttccgggcag catagacatg ctcgtcggtg cctcctggtt cttccgcttg 2100 ctgaaaggag gacacattca gctctgggac aactttccgg agctacgcga gactcacctg 2160 ggctgggtag tcgttggagg tgcagaaggt gccatctctg gacagcagta cactcacaca 2220 gctacgctcg acacccgagc agccatgttc ttgaaagtac ccgatcgacc ggaagaagtt 2280 tcggctgggt cgtcccccag cccgggggga gga 2313 // ID BEL-122_AA-LTR repbase; DNA; INV; 371 BP. XX AC AAGE02028819; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-122_AA_; KW BEL-122_AA-I; BEL-122_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-371 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02028819; Positions 9457 9087. XX SQ Sequence 371 BP; 111 A; 95 C; 64 G; 101 T; 0 other; tgttagggca caggccgtta tatttgtaat taccctaatg tatggaactc agacactcaa 60 tctcaatcaa cttcatatca tataaagcgg agaagagtaa cgtagatttt aagctctttc 120 tgtctaacac ttaactctca ctaccgatca ccagtatcgg cagttagaat acctatcagt 180 tgaataaaac cgttcaaaca gtgtacatgt gatcctctat cgctcgcaag aatcgtccga 240 gtaccctcag ttcagtgtaa acctcctacc actattaatt cgtcgcgatt tcgttttcgg 300 tgctaaaacc acatcctaag gtcactttac gccgcgagtg aaccacgaat ccggagaacc 360 agctccgaac a 371 // ID DNA2-2_CQ repbase; DNA; INV; 4736 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A DNA transposon family from Culex quinquefasciatus - consensus. XX KW DNA transposon; Transposable Element; nonautonomous; DNA2-2_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4736 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the southern house mosquito."; RL Repbase Reports 11(1), 69-69 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 22 sequences with >98% CC identity. 2-bp TSDs. XX SQ Sequence 4736 BP; 1703 A; 627 C; 674 G; 1732 T; 0 other; cagggccgta gccaggattt ttgttcgggg ggggcttggg tgcaaaaatt caaggagtat 60 aaaaatcagc aaatcattta taccatcact tggtttgtgg tataattatt gagatgaatt 120 gtaaaatttt aatgtaatgt atgcaaaaaa gttttcgaag attttcatat taagttatca 180 atgtatttat tgaagaaact cgttcgtcat tttttttctt tattttaaca gctagtttgt 240 attaaaggag tagagaaagt gttttttttt aacattttct tgaattgttt aattgtaatc 300 caatttcgtg ataaacgtcg ctaaatttaa aatatactta acttgaatca aatttttgaa 360 agcataaatt ttttttaaaa gaaatatcga tttatctaac cataagtatt atgttttctg 420 tattttatac tgagagaaat tttaaaccgt ctggatttac attttcagct gtgtgataat 480 tgtgatggtt tatctaacat ttttcatttt ttctagcaca acataaaact cataactgga 540 atatttgttt ttcttaaaat tttataaaca aacttaaaca atgttttatt ttaattgttc 600 taagctattg aagccatttc atattttgcg aagctcctgg tgctctcaaa aataaataat 660 tttaatgaaa tacaaaaaaa tataatcgtt gaaatttcag cttctgaagt gtctccatgg 720 caacaaaaaa aaattacgtc gaatctcaag tttacatcaa ctgagtttac atgtgattca 780 ttttttccgt gtcgagtaaa ttcacatttt ttttctgtgt aggcctataa taaatatata 840 tttgaagaaa aaaaatcagt gcctgtgttt aattttaatg tgttaaaaaa tttagataaa 900 atgtattaaa ttaattttta acgccaaaat attcactgga agagttgtta ataatgcata 960 tgtaaccatt tttaaatgca tatgtaaccg ttaaaatttt cctcgattgc atcaaacatt 1020 aaaacgaatc atgtttaact ttaatattcc gactaaacgc cattttcaaa ttcattactc 1080 attgtattct gaaaattggt ccagaatttg agcaaaaata aaattttaaa acatgacaaa 1140 aaaatttaat cagatctcaa ttctaaagat aattttttaa taaaatttta atctatatac 1200 atgtttaata atttaggatt gtaaccattt attaaaatcg atgtttgttc aagcaatatc 1260 ttgactctag gaacaaaatc ctgcacattt tatggtttag aaattttata tttacgaaac 1320 gttgctaaat taaaaataaa gaaaacttta tgtaagaatt cataaattga ttagttttat 1380 cctgtaaatc taatcttagt ctgcacttga attcaaacat accaatattc gaagcatcaa 1440 aaaaataaga ttattaaaac ataatttatt gaaatgattg aaaattattt tttgaatatc 1500 tttatttaaa aatgataaaa aatgtttttt ttttataatt taaggatttt tttagttgta 1560 attattttat atttttttat gtatttgatt tttttcattt ttggaatttc tgattttttt 1620 gaaattatca tgtgtaatta ttatcttaaa tttttgtttt aaaagtaaag aatactttat 1680 gtaagaattc ttaaatcaat tacatttttc ctgtaaatca aatcttagtc tgaattttgc 1740 tacaagaact atattttact ttcaaacgaa aattttaaaa aatgtcctgt agtacggaga 1800 ctcttaaaac tccatacaaa aaaatctcaa ggaacggtca agaaaaggtt tacgaaaaga 1860 cttctcttaa gcggtacgca gctgaaaagg aacagaaaca aaaagcgtcg tttgatattt 1920 cttcgggagc aatatgtgtt gatgagattt tgatttgttt atttggatgc gactgaaaat 1980 aacgtttgaa aataatgtat tcgcttaaat aatattgaag aattgcctca tgaagctgac 2040 tatcaaaacg ctgaatctta aaatgctgaa ataagggaaa agccattaca aaaatcactt 2100 aattttctgg taaatataaa ttaatttagg aatactagaa ttcaaacata aaaatattta 2160 aagcatccaa aaattaagat tgaaaaaaca taatttattg gattttgaat aacttaattt 2220 taaaatgata ttttttttag tttaggattt cttaagtatc aattatttta ttaatttttc 2280 agttttagat tttttaattt ttggaatttc agattttttg atattattct gttgaattgt 2340 tatcttaaat tttttatgtt tttattattt ttatttttga tttctggcat ttctttgtct 2400 tataccaaaa aaatttaaat caaaaattat gaaaatttag aaatttgaaa gtgtaaaatt 2460 tttaaatctt caattagccg ttgcgaatat tattctgaag tttatgtcac taaactctga 2520 ccaaagttga gagggttaca aatcaaaact acaagctatg gttttattat tttaaaagtg 2580 ttcaaaacac actttaaatc agtacagttg ttttgcatca ttagttttca aaatatctag 2640 ctgatggcga aatccttttt tcgtgagaaa aaactttccg cggttccgcg acatgttaca 2700 aaaattctaa atatttttaa acgagcccaa acatgtcgta tatgattatc aaagcaggaa 2760 aaacatttca aatttgtttt ccggaccgac ctggtcgctg gtcctaaaaa gacttaatcg 2820 aaataagtta tttccattaa aattgtgaag ctattttttt ttaaatattg tatttgctct 2880 ctgttttttg gtccattttg aaaaggtggc aacataaact ttggaaaata tttacaattg 2940 atattcagaa ataagaaatt ataaaaaaaa taataaagaa attttaaaat tttcaaaatg 3000 caaattctaa tctgtaaaat ccaaaaaata gaaattaaat agtttaaaaa tgaattattg 3060 atcagaaatt atataatttg aaagtgatgt ttaattaatt taatttcaga gtttttgaat 3120 ccaagcttaa atttttttaa cgttttgaat atttttatca ttttatttct tcctttaaaa 3180 atcgcaagct catgcacaaa gtgaaaatta gttgataata agtattcact tagttgatct 3240 tccgattttt gatatattaa ataaaaattg gagattggcc acggtggaaa ctaaaaaatg 3300 tgaccaaaag aaagcatttt ggacgagtgg tttcggaaaa aaggtttacg aaaaagtaat 3360 aaaaaaatac acacagatat ttctcaagca tttttctcta aggtcttaga tatgtaacta 3420 accatattct taaaaccaaa acgataaaat tagaattgta tttctgataa tagtcaaaat 3480 tcactcaaat gtttttcata ttaaatgctt tcgtttttga aaacaaaatc taaatatttt 3540 tgagaaatta tgataaattt tgaaaaatgt agagaaaaaa catgtattaa cagtttaaag 3600 gttttaatgc aaataaagga gggctattaa ttttacaatc caaattcgac tagccgaagc 3660 ctcgattatc caaagtttcg attatccgaa gttcgaagga cttcggataa ccgaatcagg 3720 aacaaatgaa atcggcatgt tttatttgct tgtttttaac atcaaattcg gcatctgttt 3780 ttttaatggt ccaatacacc aagtttttgc tttttgggtg ttttttaata cccctgacgg 3840 tttcaaaagc acccaaaaag caaaaactgg aaatttggtt tattagacct ttaaaaaaaa 3900 caccagaaat atttcttgac ctccattttg accgccacat tgaatccaaa aattctaaat 3960 tattttatgg ttcttcaggg gtcatactta agttctacaa tcaaaaataa gacaacaaga 4020 tctaaggatt tttgctgtcc ctattcagac tgtagtcccg attcgcccca gatgacggta 4080 attatttcta tgtagcccct tagagcagtc cgctcggaaa ttgctatttt cgaaggttaa 4140 taacttttta gcaaaaaacc caaaaaaata aggtgtcttc gggaaagttt ttccaaattt 4200 aaagaagata ttagttctga aaattgataa gaactggata gttattgcgc cttccattgg 4260 tcaaaaactg aaacagttgc ttttctccat acaatttgac gattttgccc aaacttggtg 4320 gtcagtggtc agaaaaagct caaattttgg aacttaggct agtgatgcgt aaatctttga 4380 tttcagggga tagccccaag taagcccaga tttggaccat cctaaccaat gtaatgctaa 4440 aactacaaga taaatgagat gttactgaat ggaacaatgt tcaaatctgc agctcaattt 4500 ttaaatatcc aaattttgga tccataatct acaaaaactg agcccggcaa tggacaggca 4560 aattacttag tttgatcaat caattcttga ttctctatct tacaaattat ttctgggaag 4620 agaacacaca atcattgatt taagattttt ttaaggtagt tcaattttaa aaaaattgga 4680 aaaaatttcg ggggggggct gcagcccgga agcccccccc cctggctacg ggcctg 4736 // ID I_Ele18 repbase; DNA; INV; 6817 BP. XX AC . XX DT 22-OCT-2010 (Rel. 15.1, Created) DT 22-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE An I clade non-LTR retrotransposon family from Aedes aegypti. XX KW I; Non-LTR Retrotransposon; Transposable Element; I_Ele18. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-6817 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6817 RA Kojima K.K. and Jurka J.; RT "I clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (22-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. This consensus is generated from 14 CC sequences with >96% identity, and ~99% identical to the original CC sequence in [1]. XX FH Key Location/Qualifiers FT CDS 404..2092 FT /product="I_Ele18_1p" FT /translation="MSDQDQFPPFDDPGGGPSDDSARNIIHGSYAGRTVPT FT WLDPSDEHGKLTVLKLEGENGPLPSKPFLLRTSVEKWIGGKIEGAFRENRG FT ISYALKVRNPKQVEKLMAMKTLADGTKVKISKHPVLNKSRCVVSCLDVKDA FT SDDELMDCLAPQGVSEFRRIKRRTGKEQYENTASIILTIEGTVVPNHIDFG FT WLRCKTRPYYPAPMVCYQCWSFGHTKKRCQQSQPTCGNCSQGHVTDETNRC FT NTDPFCKRCNSASHSLSSRKCPTYVEENAIQRIRVDMGVTYHAAKQIFEQS FT NSSRSFAAVTSASKDHTIADLSSKVDRLLQETAAKDKRIAALESASIGNTT FT ENVGKAKLDQLIQKVNQLTKEIETKDQRILALEKAQHIGSRLDLVRQHGTI FT EDLIEEVAALKGDLAKKERENLLLRELYGKKVNQNKRSTQNLNSLSPKSSD FT SLFTQKSVQSLPSPDYEPIQSSNVSNDEVRKWLQTTAANNPVTDDRKYKNK FT SKKPTASNENEPICTPSVAPVDKTSKRIHSSTDSDDSIGRSKTKINVKCPD FT AIFVTSSDGDDETMQS" FT CDS 1995..6554 FT /product="I_Ele18_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="MTRLADLKLKSMSNAPTLYSSPLLMVMTKRCKVNHAY FT DLPALLFTNIELILSISSKLPMTNTHYNRAAHSHKEESTGTPTTTNDTDIN FT THENRTTCEASGSRGPCSAAVFSRPELQDSSRHSNFPTVFETQKGSTAYTP FT QFPLYSSGSRGPVSAEVRPPPEPADTPRHFRSSGRGTDKGHATHSPKDTVG FT AESLATNAQATTGSKNNQQSRRSARIRKRKSKISTPSADNENNLLSRIGLD FT AASLAHDKTAPLPEAACNAAIYDAIHLLRYGIEIPQQSSSSYLPSKRISSV FT EQQALTVSRCTPRRTTTDMRNSNTRFAVQWNINGFTNNLADLEIICKNNPP FT IALALQEVHRTNCQRLDRSLSGKYRWIYKCASNIYHSVAIGILAQLSFTQI FT ELNTDLPIIAVRLNAPFPISIVCFYLPCGNIANLEESLLNVWNEIPEPRIL FT LGDCNGHHQIWGSPKSNPRGMTMVSVAESTGSTILNDGTKTFIRGQIESIV FT DISMVSATIANRFLWSASDDPMGSDHFPITIYLNEQPPATSRRPRWIYDQA FT DWASFQTRMDSMLEANEPSNISDLIDTIHQVASETIPKTSSIPGRKALPWW FT SPEIKKIVKARRKALRRAKRIPDGNPQKEIAFSTYKILRNNCRQSIRDAKH FT QSWTKFLDTINDQQTSAELWQKVNALNGKRSFAGLAINHQGVTTRDPSEIA FT NILAEHFASMSAIGRYDHAFVRINNVTMDSLSSFRIPPESHHEINLPFSLA FT ELNFALQHGKGKSAGPDEVGYPMLKYLSESGRSLLLRLLNNEWTRKTLPAN FT WKHSIVVPIPKNSGYSNEPDSFRPISLTCCISKVLERMVNRRLIRYLEDGN FT YLDHRQHAFRPCHGTDTYFASLGQVLDDALNDNNHIEIATLDLSKAYNRAW FT TPSVLRQLANWGLSGNILHFIKNFLTGRTFQVLIGNHRSNVTGEETGVPQG FT SVIAVTLFLVAMNSIFKSLPKGIFIFVYADDIVLIAVGKHPVALRRKLQAA FT VNAVNKWAHSSGFQLSAEKSARSHICTFNHRPPKKAITIHNTPIPFKKSLR FT ILGVRVDRRLTFNDHCSEVKRNCRTRLNLLKILSRPHTTNNRNTRLKISRA FT TIESKLFYGSELTCRASDTLNRTLAPVYNNAIRIVSGLLPSTPAIAACVEA FT GMPPFNYALVTAVCRKAVSFLEKTCKSQSEVFILKEANRLLISVANQRLPP FT VAVIHWTGARNWRAPLPNVDLSIKNNFKAGDSSTILRSTFAHLLATKYAYH FT TQRYTDGSKARGAVGIGILSGNQNFYHRLPNMCSIFSAEAAAIYLAASLPS FT DDPIAIFTDSASVISALNSQSSRHPWIQAIQSILNINNNITFIWIPGHCGI FT RGNESADQLAAIGRSNRNLYNREVPGSDVKKWVANTVRLAWSQEWHNRRNP FT FIRKIKGETHKWEDVPKLKDQQILSRLRTGHTRLSHNMGGDTEFRRRCDPC FT GVHNSVEHFICECPIFNYPRELYNIGSIRDALKNDPSSESATICFLKDAEL FT YNNI" XX SQ Sequence 6817 BP; 2066 A; 1724 C; 1396 G; 1630 T; 1 other; cagtcgacag ctcggtcttg tgccgaatag acgtcttttg ttatcgccgc gcgcgttatt 60 ctttttgttt cgatttgttg acgtatatca actactaatt tcccaaattt ctccaacatc 120 tcctaactga aataccagtg gtgtataaaa gatcagtatt taccgtgaca agcaactaga 180 tactgtcctc gggggtttac tagcttggat tgtcggccct tcaccggggc ttgtaaacaa 240 gcagatccat ctattgttgc tgtagtggct attgttttgt tgctgcagga gatatcggtt 300 tcgtgtgaac ggtgtagttt tcgtcgcgag tgaacatttt tgcaagtcca acttaggcag 360 ctgtccttgc tatcgttgaa cgctatcttt cggtcctccg actatgtcgg atcaggacca 420 attccctccc tttgatgatc ccggaggagg cccctctgac gattctgcac gaaacattat 480 ccacggttcc tacgcaggga gaaccgtccc tacgtggcta gatcccagtg atgaacatgg 540 aaaactcacg gtactaaagc tggagggtga aaacggtccc ttaccaagca agccctttct 600 cttacgcaca tctgtggaaa agtggattgg tggtaaaatc gaaggcgcgt ttcgtgaaaa 660 ccgtggtatt tcttatgcgc tcaaggttcg aaatccaaaa caagttgaaa aactaatggc 720 aatgaaaaca ttagctgacg gtacaaaggt taaaattagc aagcatcccg ttctcaacaa 780 atccagatgt gttgttagct gcctggacgt caaagacgca agtgacgatg aattgatgga 840 ctgtcttgct ccccaagggg tatccgaatt ccgcaggatc aaacgtagaa ctgggaaaga 900 acaatacgaa aacaccgctt cgattatcct tactattgag ggaaccgtcg tacccaatca 960 tattgatttt ggttggttac gatgtaagac aaggccatat tatccggctc cgatggtgtg 1020 ctaccaatgt tggtcgtttg ggcataccaa gaaacgctgc caacaatcac aacccacctg 1080 cggcaactgc tctcaaggac atgtgaccga cgaaaccaac agatgcaata cggatccgtt 1140 ttgcaagaga tgtaattcag ctagccacag cttatccagc agaaaatgtc caacgtacgt 1200 cgaagaaaat gcgatccaac gaatcagagt cgacatggga gtcacctacc acgcagccaa 1260 acagatcttc gaacaatcca acagctcacg atcgtttgct gccgttactt cggcgagcaa 1320 ggatcataca atcgcggatc tgtcttccaa agttgatcgt cttttgcagg aaacggcagc 1380 caaagataaa cgaattgcgg cacttgaatc tgcatcgatt gggaatacaa cagaaaacgt 1440 gggaaaagcc aaactcgacc aacttatcca gaaggtcaat caactaacga aggagattga 1500 gactaaggat caacgaatcc tggctttgga gaaagcacaa catatcggat ctcgtcttga 1560 tttggtccga caacatggaa caatagaaga tcttatcgaa gaagttgctg cattgaaggg 1620 tgatttagca aaaaaagagc gagaaaatct actacttcgt gagctatatg ggaaaaaagt 1680 gaatcaaaat aagcggagca cgcaaaatct caactccctt tctcctaaat cctcagactc 1740 tctcttcact caaaaatcag ttcaatcgct accgtctccc gattacgaac caatccaatc 1800 ctccaatgtt tccaacgacg aagtacgcaa atggttacaa acaactgcgg caaacaatcc 1860 ggtgaccgac gatagaaaat acaagaataa gagcaaaaaa ccgacagcgt ctaatgagaa 1920 cgaacccatt tgtacaccaa gcgtcgctcc agtagataag acgtcgaaga gaatccattc 1980 gtcaaccgac tcggatgact cgattggcag atctaaaact aaaatcaatg tcaaatgccc 2040 cgacgctata ttcgtcacct cttctgatgg tgatgacgaa acgatgcaaa gttaatcacg 2100 cctacgatct tccagcactc ctcttcacga acatcgaact aatactatcg attagctcta 2160 aattaccaat gactaatact cattacaata gagctgcgca ttcccataaa gaggagtcaa 2220 ctggtacccc caccactacc aacgatactg acatcaacac acacgaaaac agaaccacgt 2280 gtgaggcttc gggtagccgg ggcccctgca gtgcggccgt cttttcccga ccggaactac 2340 aagactcctc tcgacattcc aactttccaa ccgtatttga aacgcaaaag ggttccacgg 2400 cctatactcc acaattcccg ctatactcca gcggtagtcg aggccccgtc agtgcggaag 2460 tccgaccccc accggaaccg gcggacaccc cccgacactt caggagtagc ggaagaggga 2520 cggataaggg tcatgcaacc cattccccta aggacaccgt gggagctgaa tcacttgcaa 2580 cgaacgctca agcaaccacc ggatccaaaa acaaccaaca atcaagacgc tcagcacgta 2640 tcaggaagag gaaatctaaa atcagcactc catcagcgga caatgaaaat aacttgctta 2700 gcaggattgg actggatgct gccagcttgg cacatgacaa aacggcacca ttaccggaag 2760 cggcatgtaa tgctgctatt tacgacgcca ttcatctact tcgctatggc atcgaaatac 2820 cccaacaaag cagctcttca tatctgccaa gtaagagaat ttcatccgtt gagcaacaag 2880 cacttaccgt cagccgttgt acaccacgaa ggactaccac agatatgagg aactcgaaca 2940 cccgcttcgc tgttcaatgg aacataaacg gtttcactaa caaccttgca gatttggaga 3000 tcatatgtaa aaataaccca ccaattgccc tagcgctaca ggaagtccac cgcaccaact 3060 gccagaggtt ggacagatcg ctctcaggca aatatcgctg gatttacaag tgtgcctcaa 3120 acatttatca ttctgtagcc attggaatcc tagcacaact ttctttcact caaattgagc 3180 taaacacaga cctcccaatc attgctgttc gmctcaatgc acctttcccc atttctatag 3240 tatgcttcta tctcccctgt ggaaatatag caaaccttga agaatcactt ctgaatgtct 3300 ggaacgaaat ccccgaacct aggattttgc ttggagactg taacggccat catcagatct 3360 ggggtagtcc aaagtccaat cctcgaggca tgaccatggt atctgtggct gaatctactg 3420 gatctaccat actcaacgac ggaacgaaaa cattcatcag aggccagatc gagtccatcg 3480 tggatatatc catggttagt gctactattg caaaccgttt cttgtggagc gccagtgacg 3540 atccaatggg aagcgatcat tttccaataa ccatctatct caatgagcaa ccaccagcaa 3600 cttcacgtcg gccgcggtgg atttatgatc aggcggactg ggcatccttc caaacacgta 3660 tggacagcat gctcgaggct aacgaaccaa gcaatatatc cgatttaatt gacaccatcc 3720 atcaagtagc ctctgaaaca attcctaaaa ccagttcaat cccaggtcgc aaagccctcc 3780 cctggtggtc accagaaatt aaaaaaatcg ttaaagccag gagaaaggcg cttaggagag 3840 ccaaaagaat cccagatggc aacccacaga aagaaatcgc ctttagcaca tacaaaattt 3900 tacgaaacaa ctgccgacag tctatcagag acgccaaaca tcagagctgg acgaaattcc 3960 tcgatacaat caatgaccaa caaacctccg ccgaattgtg gcagaaagtg aacgctttaa 4020 acggcaaacg cagcttcgca ggtcttgcaa tcaatcatca aggagtcact acgcgtgatc 4080 cctccgagat agctaacatt ctagctgaac attttgccag catgtctgca attgggcgat 4140 acgaccatgc tttcgttcgt atcaataatg ttaccatgga ctccctcagc agctttcgga 4200 tccctcccga atctcatcat gagattaatc tccccttttc attggctgaa ctaaattttg 4260 cacttcaaca tggaaaggga aaatccgccg ggccggacga ggtaggatac cctatgttga 4320 aatatctctc tgaaagcggt cgatctcttc tactccggct gttaaataat gaatggacta 4380 gaaaaaccct gccggccaat tggaaacata gtatagttgt tcctatccca aaaaactcgg 4440 gctactctaa cgaaccagat agctttcggc caatttcttt aacatgttgc atctcgaaag 4500 tgttagaacg aatggtgaac cgccggctga ttcgttacct cgaagatggc aattatctag 4560 atcatcgcca acacgcgttt aggccttgcc acggaacaga tacttatttt gcctcgctag 4620 ggcaggtttt agacgacgcc ctcaacgaca acaaccacat tgagattgca actctcgatc 4680 tgtctaaagc ctataatcgc gcatggactc caagcgttct taggcagctg gcaaattggg 4740 gcctctccgg caacatactc cactttatca aaaacttctt gactggtcga actttccaag 4800 tcctcatagg caatcatcga tcgaacgtta ccggtgaaga gactggagtt cctcaaggct 4860 ccgttatcgc ggtcactcta ttcctagttg ccatgaatag cattttcaaa tcattaccga 4920 aaggaatatt tatttttgta tacgcggacg atattgtcct aatcgctgtt ggaaaacatc 4980 cggtggcact cagaaggaaa ctacaagccg ctgtaaatgc agttaacaaa tgggcacatt 5040 catcaggttt ccagctatca gcggaaaaaa gtgctaggtc tcacatctgc acttttaatc 5100 accgtccacc gaaaaaggcc attactattc acaacacacc gataccgttc aaaaagtcac 5160 tacgaatcct aggcgtccgt gtggatcgtc gacttacctt taacgatcac tgcagtgaag 5220 taaaaaggaa ctgtaggacc aggttgaact tattgaagat tctttcaaga ccacatacga 5280 cgaataaccg caatacgcgt ttaaaaatct ccagggccac tattgagagt aaactttttt 5340 atggctctga gttaacatgc agagcttccg acacacttaa tagaacattg gctcctgtat 5400 acaacaatgc catacggata gtttctggcc tactaccctc aaccccagcg atagcagcgt 5460 gtgtcgaagc tggtatgcca ccatttaatt acgcgctggt caccgcagtg tgtagaaaag 5520 cagtcagctt tctagaaaaa acctgtaaga gccaatctga agtcttcatc ctcaaggagg 5580 ctaacagact tctaatatca gtggccaatc agagactccc tccagttgca gtaatccact 5640 ggacaggagc aagaaattgg cgcgctcctt tacccaatgt cgatctttca atcaaaaaca 5700 actttaaagc tggagatagc tccacgatac tgcgttcaac atttgcacat cttttggcca 5760 caaaatacgc ttaccacaca cagcggtaca ccgatggatc aaaagcacgg ggcgcagtcg 5820 gaatcggtat actcagtggg aaccaaaact tctatcatcg tctacctaac atgtgctcca 5880 tattctctgc tgaggccgca gcaatatacc tagcagcttc cttaccgtcg gatgacccta 5940 ttgctatttt cactgattct gctagtgtca tctcagcact caactcacag tcttctcgtc 6000 acccatggat acaggcaatt caatcgattc tcaacataaa caacaatata accttcatct 6060 ggataccggg gcactgcggt attcggggga atgaatctgc agaccagctg gctgccatag 6120 gtcgttcgaa tagaaatctc tataaccgag aggtcccggg ttccgatgtg aaaaaatggg 6180 tggccaatac agtacggttg gcttggtctc aagagtggca taatagacgt aaccctttca 6240 tacgtaaaat aaaaggagaa acgcacaaat gggaggacgt acccaaactg aaagaccaac 6300 aaatcttatc gagattgcgc actggtcata cgagattatc ccacaatatg ggaggtgata 6360 ctgaattccg tagacgctgc gacccatgcg gggtgcataa ctcggtcgag catttcatat 6420 gtgaatgtcc gatctttaat tatcctagag aactctataa catcggctca atacgggacg 6480 ctttgaaaaa tgacccatct agtgaatccg ccacaatctg ttttttgaag gatgcagaac 6540 tatataacaa catctgagat ctcgaaaacc cgacgacttc aagaaatcat cgagatatcc 6600 agcactatcc tgtccaaccg ataactacgg ccccctcata caaaccatag tatgaatatt 6660 ctgataatga taacaaatat gtgtaatgat attgtaaaac taaattgaaa tgtaatctac 6720 ccaggagggc accttttgcc ctgttttttt ctcaaagagg tgaacccgcc taaggctgaa 6780 agcctctata ataaagaaaa tcaaatcaat caaatca 6817 // ID hAT-42_SM repbase; DNA; INV; 2650 BP. XX AC . XX DT 14-AUG-2009 (Rel. 14.08, Created) DT 14-AUG-2009 (Rel. 14.08, Last updated, Version 2) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-42_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-2650 RA Jurka J.; RT "haT-type repetitive family from freshwater planarian Schmidtea RT mediterranea."; RL Repbase Reports 9(8), 1845-1845 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 608..2467 FT /product="hAT-42_SM_1p" FT /translation="MTQPAKKTKTYHYNTEWEQEFFFIMMKDKCVCLICNS FT NVALPKRANVERHFRTVHKHYEIDFPLDSEVRKDKVKQLKSQLGAQQTFFT FT KFNSKSKTATIASFKVVNILMKHKKPFGDGEIWKQAFVEAGEVLFKDFKNK FT SEILDCINNMSLTRNTSMQRMVDISKNLEDILRQDIQKCRFFSLQFDESTD FT FTDTAQLCIFIRMVFDDMSTREELLTILPMKGQTRGQDIYNLFKSFISRTH FT FPIHKLVSITTDGAPSMIGKHVGFVTLCKNDEEIPSFHSYHCIIHQQTLCS FT KVLNSNYVMNIAFKIVNSIRARSLQRRQFRTLLEQCEIENTELLLHTDVRW FT LSRAAFLKRFRDLLPEIKQFLTGREELYPELEDKSWLINLAFLCDITEHLN FT KLNLQLQGRHKSILDMIIAVKTFKEKLSLFVRQLERGDLKHFKNMEEESKN FT APGILYDKYTNQITTLLNEFNTRFADLSKIANIATFMSFPFNDFIDTEEIA FT TQINEFLDSDSACLEDEIIALKSDIYLKSRATTEDNFWNLISEEKYPILRR FT VVGYMSAFFGSTYLCESAFSSMNGIKTKHRSCLTDDHLASCLRIAISSYDP FT KYEELADKMQCHVSGASSSSK*" XX SQ Sequence 2650 BP; 959 A; 404 C; 439 G; 848 T; 0 other; caggggtggc aaaagcgtcg accgcgtagc tatcgaaagt agcccgcgag cactatggaa 60 aattgcatat aaaaatcagt tttttactta ttgaattttc aactttataa acatatttaa 120 aataagtttc gattcatatg acaaaaaatt ccaataagaa acataatatt agtcagataa 180 cactcagagt gtagtaaatt tttagtttga aattcaaata tatacagttt gttaataatc 240 agaataaatt tcatgtgttt gaacaagtag gccataaaat aaaagcaaaa cattgttgct 300 acaatcttcg ataaggaata aaaaaaataa aaaacaacaa aaataaaata caacaaacag 360 gtgcttgtct atttcaattt tgaaatcaaa ttgttttcgt tagaaaaatt tgaaagcttg 420 aagtattcag tattcaatac tgagtattga ataatacctg cttataatat taatttcaaa 480 aattttttat taaaaaaaat tctttacaaa agttaacatt tttaattttt gagtatgtaa 540 gtgtgtttaa cattaatttt catatttaat ttattttagc tttgattatt ttcttgtttt 600 agccacaatg acacaaccgg ccaaaaaaac taaaacgtac cattataata ctgaatggga 660 acaggaattt tttttcataa tgatgaaaga taagtgtgtt tgtctaattt gtaattcaaa 720 cgtagcacta cctaaaagag ctaacgttga acgacatttt cgaacagtgc acaaacatta 780 tgaaattgac tttccattag attccgaggt acgaaaagac aaagtcaagc agttaaagtc 840 tcaattaggt gcacaacaaa ctttttttac gaaatttaac tcaaaatcaa aaacagctac 900 catagcatca ttcaaggtag ttaatatttt gatgaaacat aaaaaacctt ttggagatgg 960 cgaaatatgg aaacaggcgt ttgttgaagc tggtgaggtg ttattcaaag attttaaaaa 1020 taaatctgaa atattggatt gtatcaacaa tatgtcattg acgcgcaata ctagcatgca 1080 aagaatggtg gacattagca aaaatttgga agatattttg cggcaagata ttcaaaaatg 1140 ccgctttttc tctttgcaat ttgatgaatc tacagatttc actgatactg cgcaactatg 1200 catcttcatt cgaatggttt ttgatgatat gtccaccaga gaagaattat tgactattct 1260 tccaatgaaa ggacaaacaa ggggtcaaga tatttacaat ttattcaagt cttttataag 1320 tcgcacccat ttcccgattc acaagttggt atcaattacc acggatggag ctccgtctat 1380 gattggaaaa catgtcggat ttgtaacact ttgcaaaaat gatgaagaaa ttcccagttt 1440 tcattcgtat cattgtatta tccaccagca aaccttatgc tccaaagtgt tgaactctaa 1500 ctatgtaatg aatattgcgt ttaaaatagt aaattcgata cgtgcaagaa gtctacaacg 1560 acgtcaattt cgaactttgc ttgaacaatg tgaaattgaa aatactgaat tattgttaca 1620 cacggatgta agatggctta gccgtgcagc atttttaaaa cgatttagag atcttttgcc 1680 cgaaataaaa cagtttttga ctggaagaga agaattatat cccgaactgg aggataaaag 1740 ttggctgata aatcttgctt ttctttgcga cattacagag cacctgaata aattaaattt 1800 acagcttcag gggagacaca aatcaattct tgacatgata atagcagtaa agacgtttaa 1860 agaaaaactg tccttgtttg tgcgtcaact agaacggggt gatttgaaac attttaaaaa 1920 catggaagaa gaatccaaaa atgcaccagg aattctttac gataaatata ccaatcaaat 1980 tactaccctt ttaaatgaat ttaacacacg tttcgcagat ttatctaaaa ttgcaaatat 2040 tgctacgttc atgtcattcc cgtttaatga ttttattgat acagaggaga tcgctactca 2100 gattaatgaa tttctggatt cagattcagc ttgcttagaa gatgagataa tagcgttgaa 2160 aagcgacatt tacttaaaat caagagctac aaccgaagat aatttttgga atcttatttc 2220 agaagaaaaa tatccaattt tgcggagagt agttggatat atgtccgcat tttttggatc 2280 cacctatctt tgtgaatcag ccttttcttc aatgaatggc attaaaacaa aacatcgatc 2340 ttgtttgaca gatgatcatt tggcatcatg cttgcgaatc gccataagct cctatgaccc 2400 gaaatacgaa gaacttgctg ataagatgca gtgccacgta tcaggtgcaa gtagtagcag 2460 taaataagat tatattaaga tatgtttaca ttctagtata gtattataag tttttttcta 2520 atttgttttg tatttaaata gaataaatag aataactaca ggtaacaatt gaaaatagtt 2580 tttctggtag accgccagat ttaatttagt taaaagtcgc ccgcaagact tgaaagtttt 2640 gccacccctg 2650 // ID Gypsy-157_AA-LTR repbase; DNA; INV; 195 BP. XX AC AAGE02018588; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-157_AA_; KW Gypsy-157_AA-I; Gypsy-157_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-195 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02018588; Positions 42466 42272. XX SQ Sequence 195 BP; 55 A; 31 C; 41 G; 68 T; 0 other; tgttgtgtat atctaatggg tcgaacccgt tctcatagca ttatagcaga tcaagtgttc 60 actgtggtag tacacgttct gtatgattat agcaaaggtg tcttgatgag taataaagat 120 gaggtcggcc tcttttggtt taaaccttca agtgaatatc acgtatttta ataagtgctc 180 ttgtacaaca tatca 195 // ID DRE repbase; DNA; INV; 6428 BP. XX AC X57034; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 16-AUG-2009 (Rel. 14.09, Last updated, Version 3) XX DE D.discoideum retrotransposon DRE. XX KW L1; Non-LTR Retrotransposon; Transposable Element; LTR; KW Repetitive sequence; long terminal repeats; retrotransposon dre; KW DRE. XX NM DRE. XX OS Dictyostelium discoideum OC Eukaryota; Amoebozoa; Mycetozoa; Dictyosteliida; Dictyostelium. XX RN [1] RP 1-6428 RA Hofmann J., Schumann G., Borschet G., Gosseringer R., Bach M., RA Bertling M.W., Marschalek R. and Dingermann T.; RT "Transfer RNA genes from Dictyostelium discoideum are frequently RT associated with repetitive elements and contain consensus boxes RT in their 5' and 3'-flanking regions."; RL J. Mol. Biol 222, 537-552 (1991). XX RN [2] RP 1-6428 RA Marschalek R., Hofmann J., Schumann G., Gosseringer R. RA and Dingermann T.; RT "Structure of DRE, a retrotransposable element which integrates RT with position specificity upstream of Dictyostelium discoideum RT tRNA genes."; RL Mol. Cell. Biol 12, 229-239 (1992). XX RN [3] RP 1-6428 RA Schumann G., Z??ndorf I., Hofmann J., Marschalek R. RA and Dingermann T.; RT "Internally located and oppositely oriented polymerase II RT promoters direct convergent transcription of a LINE-like RT retroelement, the Dictyostelium repetitive element, from RT Dictyostelium discoideum."; RL Mol Cell Biol 14(5), 3074-3084 (1994). XX DR GenBank; X57034; Positions 1 6428. XX FH Key Location/Qualifiers FT CDS 391..1722 FT /product="DRE_1p" FT /translation="MKITNNTTNIKQHKCLQKISEIVDKTQLKKKQFKYVI FT NINHEPDDKIKKDLEKSLDKKDVLIKSNNTFKILTDSISTVIYFVNMFDAE FT LKGQFITTLRPKNMYTDFDWYKSLTEIEWVIENEGCKLIKSEIKGETLIIK FT TVKRVVEEYDTIIELDNLTLIGHSNKGWRDQTSLPENQEENKNKKQSDEQQ FT PKSTSEQIPKQGKRTIIIPPITSSSNDKTFMALDDTAQKVIDASLENVDEL FT TRIENEINTQSIKNQEIKTKSEKPIEPPSPYNTPIKQHGETKRLTTEEKLR FT EVNDIFDFTSPNTDAIRKIEFPLSIPSNQWFFSSPIKHIMSFSPTYGTPSK FT IIDSPSKLLNTPKNKQQDKTNRTNKTPVKPKLGEYGQKSIKTKMDEDIDNF FT NKEILKQISKSKNPTNTQDQQHNEQSKDQLEDKINQKLQKVDKNLPQIKL" FT CDS 2016..5156 FT /product="DRE_2p" FT /translation="MIAKTQIKCTTIYAPAKSNERHEWYKENLTEEILHSD FT IITGDFNVDCSVDNNLNKYIKTIFDEFEFTEIKNGITFPRNKSTIDRVFVS FT KKILHLNPIVTTKEIKLKSDHNMVIIELKIPEYEQQKKGERLWRQNLETLK FT MNSTSLKINKTIKYYNKKFEENTSKWYKLNICEQWLKLKDEIKKLSINIEI FT RESNKTKNKLKELAEKLETAKDSRAIFLKEEINNILKEQVRIKQANQTNTH FT INNKETPSKYLTRRLKVQRKTNEIPQILDPSNNCLVTKHEDILEVARRYYE FT NLYQKRECNEDTHHELLKTFNKRIEQKILDEINQPIEGYEIRLGIEKIQEG FT KAPGKDGLLPTFYKNHINEILPIISKLYNHFWNTTIPKDFKQGILITIYKN FT KGDPNNLDNYRPITLLNVDYKIYSKIINNRILKLLNKIISPFQTGFVPRRL FT LHDNIITLNSTIEIIKREINTKEDMEPIITFYDFEKAFDSISHNAILRTLA FT HLKLPLKMVLTIMNLLNESETSVYINNSLSKSFTSKRGTKQGDPISPTIFA FT LVVECMATTIINDRCINGVTKETIKILQFADDTATIAYNFMDHFLMNEWIK FT KFCQATSAKINQTKCSCITFKWNTRTLYTVIKSNERYLGFDFNNKGIKSKI FT NTISDNIRAKLVTWNSTSSTYMGRLIMAKTYALSQLTFHTYINTTPQHNSI FT ENNIVKFVFNTKSKNSLSLQRRQNNYINGGLNLWNLKTRELAQKAWLFERY FT LHQRVSNTPSSYIKLWEEELKNNNNNKTTTKQNQLQLHWQCKQAWTQLKTP FT QNKQTHYEHLPKLKKIYEDMMTTQSPEHNKFIPTPGQKEIMTKINSKHLPF FT KEIKKIINMKGRDLLWRYTLKALPKIYNMPCQQCGEDETSEHIFFNCKAHI FT KNTQEIFNYTLTKSGHTTHTWNVKILNHLQIALVANLIAIIFDKIWHKRNK FT LIHDEKEIIIHRQQVIRELIKTQRAAWDRTQAVINKTLRIKSKQRPEEQNK FT LDSLISLKLLQFSRQWNSPLHAIELPKHLKKYNNSLSTFYK" XX SQ Sequence 6428 BP; 3158 A; 1171 C; 736 G; 1363 T; 0 other; agatcgaaaa aaaacaaaca acaacaacaa tctctttttt tgattatttc atttttcata 60 taaaaaagac ttaaaaaaca aataaataaa taaataaact atcttaaaaa gccatctttt 120 tactttattg taatttatta tttgtttcgt tttgagaatt ttgcgtagca aaattttcaa 180 aaaaaagcaa aaaataatca gatcgaaaaa aaacaaacaa caacaacaat ctcttttttt 240 gattatttca tttttcatat aaaaaagact taaaaaaaaa aaaaaaaaaa aaaaaaaaaa 300 aaaaaaaaaa cctgggaacc caagttaatc agaaaatttc cagattttta actttttaaa 360 gaaaaatata aaaaatagaa agaaaataaa atgaaaatca caaacaacac cacaaatatt 420 aagcaacaca aatgcctaca aaaaataagc gaaattgtgg ataaaactca attaaaaaaa 480 aaacaattca aatacgtcat aaacatcaat cacgaaccag acgataaaat aaaaaaagat 540 ttagaaaaaa gtttggacaa aaaagatgtt ttaatcaaaa gtaataatac attcaaaatt 600 ctcacagaca gtattagtac agtaatctat tttgtaaata tgttcgatgc agaattaaaa 660 ggacaattta taaccactct aagaccaaaa aatatgtata cggatttcga ttggtataaa 720 tcactcacag aaatagaatg ggtaatagaa aatgaaggat gtaaactaat taaatctgaa 780 attaaaggcg aaactctaat tatcaaaaca gtgaaaagag tcgtagaaga atatgacact 840 atcattgaac tagacaatct aacattaatt ggacactcca acaaaggatg gagagatcaa 900 acctcattac cagaaaatca agaagaaaac aaaaataaaa aacaatcaga tgaacaacaa 960 ccaaaatcta catcagaaca aataccaaaa caaggaaaaa gaacaatcat tatcccacca 1020 atcacctcct catctaatga taaaacattt atggccctag atgatacagc tcaaaaggta 1080 atagatgcat cactagaaaa cgtagatgaa ctaacaagaa tagagaatga gatcaatacc 1140 caatcgataa aaaatcaaga aattaaaaca aaatcagaaa aacctattga accaccatct 1200 ccatacaata caccaataaa acaacacggc gaaacaaaga gactaacaac ggaagaaaaa 1260 cttagagagg taaatgatat cttcgacttc accagcccaa acactgatgc aataagaaaa 1320 atagaattcc cactttcaat cccttcaaat caatggttct tctcctctcc aataaaacat 1380 atcatgagtt tcagcccaac ctatggtaca ccatcgaaaa ttatcgattc accaagtaaa 1440 ttgttaaaca caccaaaaaa caaacaacaa gacaaaacaa acaggaccaa caaaacacca 1500 gtcaaaccaa aattgggtga atatggacaa aaaagcataa aaacaaaaat ggatgaagat 1560 atagacaact tcaacaagga aattctaaaa caaatcagta aatcaaaaaa ccccacaaac 1620 acacaagacc aacaacacaa cgaacaaagt aaagatcagt tagaagacaa aataaaccag 1680 aaacttcaaa aggtagataa aaatctaccc caaataaaac tataaagaaa aacacaataa 1740 gaataggtgt ttggaacgtc caaggatcca acaccatcca atctgccagc ataataaaca 1800 ccgtattaga caacaacaag ctggacgcag cacttctaac tgaaacaaat ataaaaacaa 1860 ataaaattta ctcaataaat caacaatata aaaacaaaat aacacaccac gcaccaattg 1920 ataaaacagg acaaggggtg tcacaaatca tcatcaacac ccaaataaaa acaacaacaa 1980 aaacaataaa cgaaagaatt atatcttccg aatggatgat tgcaaaaaca cagatcaaat 2040 gcacaaccat ttatgcccca gcaaagtcaa acgaaagaca tgaatggtat aaggaaaatc 2100 tcacagaaga aatactacac agcgatatta taaccggaga cttcaatgta gactgttcag 2160 tggataataa ccttaacaaa tatattaaaa caatcttcga cgaattcgaa tttacagaaa 2220 ttaagaatgg aatcacattt ccaagaaaca aatcaaccat cgatagggtc tttgtctcaa 2280 agaagatact ccatctaaac ccaatagtta ctactaagga aatcaaatta aaatctgatc 2340 acaacatggt aatcattgaa ttaaaaatac cagaatacga gcaacaaaag aaaggagaaa 2400 gactatggag acaaaaccta gaaactctca aaatgaattc aaccagcctc aaaattaata 2460 aaacaattaa atactacaat aaaaaattcg aagagaacac cagcaaatgg tacaaattaa 2520 acatttgcga acaatggttg aaactaaaag atgaaataaa gaaactgtca attaatatag 2580 aaattagaga aagcaataaa accaaaaaca aactaaaaga actcgcagaa aaactcgaaa 2640 cagccaaaga ttcaagagca atcttcctta aagaagagat caacaatatt ctaaaggaac 2700 aagtaagaat aaaacaagca aaccaaacca acacccacat caacaacaaa gaaacaccat 2760 caaaatactt aaccagaagg ttaaaagtcc aaagaaagac aaatgaaatc cctcaaattc 2820 tagatccaag caacaattgt ttggtaacaa aacacgagga tatcctcgaa gtagcaagaa 2880 gatattacga gaacctgtac cagaagagag aatgcaacga agacactcac cacgaactcc 2940 taaaaacatt caataaaagg attgaacaaa agatattgga cgaaattaat caaccaatag 3000 aaggatatga aatcagacta ggcattgaaa aaatacaaga aggaaaagca cctggtaaag 3060 atggactcct tccaacattt tacaaaaacc acattaatga aattctacca ataatatcaa 3120 aactatacaa tcacttctgg aacacaacaa tcccaaagga cttcaaacaa gggatattga 3180 ttactatata taaaaacaaa ggagacccaa acaacctgga taattataga ccaataacac 3240 ttctaaatgt cgattacaaa atctattcga aaataatcaa caataggatc ctgaagttat 3300 taaacaaaat aatctcacca ttccaaactg gattcgtacc aagaagacta ctgcacgata 3360 acatcatcac cttaaactca acaatagaaa taataaaacg agagattaac acaaaagagg 3420 atatggaacc aatcataaca ttttatgatt ttgaaaaagc attcgattca atctcacata 3480 acgccattct acgtactctt gcacatctaa aactaccact caaaatggtc ctcacaataa 3540 tgaacttact caatgaatca gaaacatcag tgtacatcaa taactcatta tctaaaagct 3600 ttacatcaaa aagaggaacc aaacaaggtg atcctatctc tcccactatc ttcgcattag 3660 tggttgaatg catggcaact actataataa atgaccgctg catcaatggt gtaaccaaag 3720 aaaccattaa aatactacag ttcgccgatg atacagcaac aatagcatac aactttatgg 3780 accatttcct catgaatgaa tggatcaaga agttttgcca agcaaccagc gcaaaaataa 3840 accaaaccaa atgctcgtgt atcactttca aatggaacac cagaacgcta tacaccgtca 3900 taaaatcaaa cgaaaggtac ctcggctttg acttcaacaa taagggtatc aaatccaaaa 3960 tcaatacaat ctcagacaac atcagagcta aactagtaac atggaactca acttcatcaa 4020 cctatatggg cagactcata atggcaaaaa catatgcact atctcagtta acgttccaca 4080 cttacatcaa caccacacca caacacaata gtattgaaaa taacatcgta aaattcgttt 4140 tcaacacaaa atctaaaaac tctctttcac tacaaagaag acaaaacaac tatatcaatg 4200 gtggcctaaa cttatggaac ctgaagacaa gagaactagc acagaaagca tggttattcg 4260 aaagatacct ccaccaaaga gtaagtaaca ctcctagttc atatataaaa ctgtgggaag 4320 aggaactcaa aaacaacaac aacaacaaaa caacaacaaa acaaaatcaa ctacaactac 4380 actggcaatg caagcaagca tggacccaat taaaaactcc acaaaataaa caaacacact 4440 acgaacacct cccaaaatta aaaaaaatat acgaggatat gatgacaact caatctcctg 4500 aacacaacaa attcatacca actcctggcc aaaaagaaat aatgaccaaa attaatagca 4560 aacaccttcc atttaaagaa atcaaaaaaa taataaacat gaaaggtaga gatctattat 4620 ggagatacac actgaaagca ctaccaaaaa tatacaacat gccatgccaa cagtgtgggg 4680 aagatgagac ctcagaacat atattcttca actgcaaagc ccacatcaaa aacacacaag 4740 aaatcttcaa ctacacccta actaagtctg gacacaccac tcacacctgg aatgtgaaga 4800 ttttaaacca tctacaaatt gcactagtag ccaatttaat agctattata ttcgataaaa 4860 tctggcacaa aagaaataaa ctcattcacg atgaaaaaga aataataatt cacagacaac 4920 aagtcatacg tgaactaatt aaaacacaaa gagctgcatg ggacaggaca caagcggtta 4980 taaacaaaac attaagaatc aaatcaaaac aacggccaga agaacaaaat aaattagact 5040 cactaatctc gctaaagcta ttacaattta gcagacaatg gaactcacct cttcacgcaa 5100 tagaacttcc taaacatctc aaaaaataca ataattcact cagtactttc tataaataaa 5160 taaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaacc 5220 tgggaaccca agttaatcag aaaatttcca gatttttaac tttttaaaga aaaatataaa 5280 aaatagaaag aaaataaaat gaaaatcaca aacaacacca caaatattaa gcaacacaaa 5340 tgcctacaaa aaataagcga aattgtggat aaaactcaat taaaaaaaaa acaattcaaa 5400 tacgtcataa acatcaatca cgaaccagac gataaaataa aaaaagattt agaaaaaagt 5460 ttggacaaaa aagatgtttt aatcaaaagt aataatacaa ttcaaaattc gaaacaaaaa 5520 aaaaaaaaaa aaaaaaaaag taaactaacc cctcttttag agaccctgta aaaaaaaaaa 5580 aaaaaataaa ataaaatgag atctagcatc tcaaggagaa gcaaataata attgcgttat 5640 ccaattcaaa aaaaaaaaag ttgaataaga agagaaaaaa aacaaacaac aacaacaatc 5700 tctttttttg attatttcat ttttcatata aaaaagactt aaaaaacaaa taaataaata 5760 aataaactat cttaaaaagc catcttttta ctttattgta atttattatt tgtttcgttt 5820 tgagaatttt gcgtagcaaa attttcaaaa aaaagcaaaa aataatcaga tcgaaaaaaa 5880 acaaacaaca acaacaatct ctttttttga ttatttcatt tttcatataa aaaagactta 5940 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaacc tgggaaccta agttaatcag 6000 aaaatttcca gatttttaac tttttaaaga aaaataaaaa aaatagaaag aaaataaaat 6060 gaaaatcaca aacaacacca caaatattaa gcaacacaaa tgcctacaaa aaataagcga 6120 aattgtggat aaaactcaat taaaaaaaaa acaattcaaa tacgtcataa acatcaatca 6180 cgaaccagac gataaaataa aaaaagattt agaaaaaagt ttggacaaaa aagatgtttt 6240 aatcaaaagt aataatacaa ttcaaaattc gaacaaaaaa aaaaaaaaaa aaaaaaaaaa 6300 aaagtaaacc aacccctctt ttagagaccc tgtaaaaaaa aaaaaaaaaa taaaataaaa 6360 taaaatgaga tctagcatct caaggagaag caaataataa ttgcgttatc caattcaaaa 6420 aaaaaaaa 6428 // ID DNA8-33_AP repbase; DNA; INV; 328 BP. XX AC . XX DT 23-AUG-2009 (Rel. 14.08, Created) DT 23-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-33_AP. XX NM DNA8-33_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-328 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(8), 1775-1775 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. Putative hAT element. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 328 BP; 96 A; 51 C; 49 G; 132 T; 0 other; cagggatggg caactggcgg cccgcgtacc atttttcact ggcccgtgct acatttcaat 60 ttagtttata aaaatataaa atgttaagat ttttctgtat tttattttat tatatttata 120 aaattattaa aaccgatata atgtttatcg taaaaagtag atataaattc ttatctagta 180 ttgaactatt gaatgtaaac aatattatat tatttatagg cgtatagtta acattaggtt 240 tttttttttg ctttctatgt tctctggccc gctagatttt cgcacattca aaattggccc 300 tctagtatca ttgagttgcc catccctg 328 // ID Gypsy13-SM_I repbase; DNA; INV; 4174 BP. XX AC Contig99; XX DT 15-AUG-2007 (Rel. 12.08, Created) DT 15-AUG-2007 (Rel. 12.08, Last updated, Version 1) XX DE LTR retrotransposon from Schmidtea mediterranea: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy13-SM_I; KW Interspersed repeat; LG_I; internal portion. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-4174 RA Xu Z. and Wang H.; RT "LTR_FINDER: an efficient tool for the prediction of full-length RT LTR retrotransposons."; RL Nucleic Acids Res 35(Web Server issue), W265-W268 (2007). XX RN [2] RP 1-4174 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from Schmidtea mediterranea."; RL Repbase Reports 7(8), 750-750 (2007). XX DR Genome; Contig99; Positions 138707 134534. XX CC Positions [3423-3908] - Integrase core CC 'AATT' target site duplication CC LTRs are 95% similar to each other. XX FH Key Location/Qualifiers FT CDS 31..1008 FT /product="Gypsy13-SM_I_1p" FT /translation="MDSDVEDFILETERFFEMTNIVDEYRSKFVKAFLSIN FT STKKYELTNETLHYTERLRNAFSKNRSLIDDLNEALNYRQGNEAAEEFCSK FT IEKLAKNILKHKLDEEELKKFLLFNSLEDKEVKKDLKMQDLKSFDEMKERL FT KRHDQVSKEINVFHADVASMNKTQRYADAVKKSLPRYNYDNRSIREREANP FT RFKDEYTREPQHNPRFKGDLTREHRYAISNGWRPMNRSKPYNTREFHSRDH FT IRTNQILRCWACHEEGHRRSECSNIQCAHCKRSGHFKHQCHELFPGRRRKF FT HKSVAAFGDEDPEADFEYPNGNAPPHEEVIGAIN" FT CDS 1026..4157 FT /product="Gypsy13-SM_I_2p" FT /translation="MSKGCMILNDCDSLNDCDSLNDNCCDGIFDYRNYINI FT YINKFCMDEVFKDPPLTSNELNVLNRDKRNDVIVYDKMQGRPSVTVMINNV FT KLVCLLDTGAKISVIDYNIFKKLGNLNMTTSNLKLRCANDSPIEICGKTLL FT NVKVDKISDIKKVEFVVAKDINPKLIGGIDFLERFGMQLQIENRTDEINTI FT MENSFKNHISEDKIINNLISTFDGIFMKHKWDVGKTELLKHEILTKGSPIV FT INPRRQPIHLIEKIEENIQEMLKYGIISECTSPWNSPLVCVKKKNSSDIRI FT CLDYRALNNITERPIFPIPNTDELLDILDGAKFFSTIDLGNAYYQVELDDA FT SKLKTAFSTKSKQYYFNRMPFGIAAAPATFQRLMNLVLGELNWNGTIVYLD FT DILIFSKTLNEHYSTIEKVFQKIKLANLKISPAKCQFKRKEVKFLGHIIGE FT DGIRTDESKIETIKNFERPKCVKSLRGFLGLANYYRKFIENYTKDSKVLEA FT LCGSKQKKLMWTDECETSFKNLKLKLTSTPVLKFPDFSKPFILDTDASFDR FT IGAVLSQKDGGKERVIAYGSRAMNKHELGYCITRKELLSIFYFTGHFKHYL FT YGKKFILRTDHKAITFMMKTKKAITPQFQTWINHLSGLDMEIQYRKGDNHN FT NADALSRKECETCVQCQMIHENPKMGKLKTRLLTLMTDNIENIWQKNSKEI FT ENIKENISLMPDKTFYKYVNGVIMTNNNKIWIPEENKSEFIIEYHKKLCHA FT GANKVKMYVSNNFDMKDLEQSIRNVVQSCENCQKRKVDTTKTKEEIICKSD FT SKLFETIYVDFCGPLQLAMKGERYILGIIDHHSKFISLHAVKSQDEKTVVR FT LLKEEWIMKYGAPRIINCDRGKSFESKLMKDFAEFHNIEINYSSPYHHSAN FT GQIERQFRTIRDAIQITLKDKRHKSWVDILPEVQFMLNSTFQKTIRKTPAE FT MVFGKKIYSEWFSEIVEEEKINQPTNREFQIGDCVLIKVNVRNKGDNRYMG FT PGVITKKIHDRSYEIKLENGSSITRNIEWLKVFKKGGM" XX SQ Sequence 4174 BP; 1646 A; 552 C; 778 G; 1198 T; 0 other; aattttatca agccaccaga aacatatgac atggattcag acgtggaaga ttttatctta 60 gaaacggaac gattcttcga gatgacaaat attgtcgatg aatacagaag caaattcgtc 120 aaagcttttc tatcaatcaa ttcaacgaag aaatatgagt tgactaatga aactctgcat 180 tatacagaaa gattaagaaa cgctttttct aagaacagaa gtcttatcga tgatctcaac 240 gaagcgctaa attatcgtca aggaaatgag gctgccgaag aattctgctc aaaaattgag 300 aaacttgcaa aaaatatttt gaagcataaa cttgatgaag aagaactaaa gaaatttctg 360 ctattcaatt cgcttgaaga taaagaagtt aaaaaagatc tcaaaatgca agatttaaaa 420 tcatttgatg aaatgaaaga aagattaaag agacacgatc aagtttcaaa agaaattaat 480 gtatttcatg ctgatgtagc ttcaatgaat aaaacacaac gctatgctga tgctgttaag 540 aagagcttac caagatataa ttatgataac agatcaattc gagaacgaga agctaacccc 600 agattcaaag atgaatatac aagagaacct caacataatc caagattcaa aggcgatctt 660 accagagaac atcgatatgc tatttcaaat ggatggagac ctatgaatag gagtaagcct 720 tataatacca gggaatttca ttcgcgagat catatccgaa caaatcaaat tttgcgatgt 780 tgggcgtgcc acgaagaagg tcatagaaga tctgaatgct ccaatattca atgtgcccat 840 tgtaaaagat ctggacattt caaacatcag tgccatgagt tatttccagg cagacgtcga 900 aaattccata aatcagtggc tgcattcggt gacgaggatc cagaggctga ttttgagtac 960 ccaaatggga atgctccacc acacgaggag gtgatcggag caataaatta aaggtaatta 1020 atgatatgag taagggttgt atgattttga atgattgtga tagtttgaat gattgtgata 1080 gtttgaatga taactgttgt gatggtattt ttgattatag aaattatatt aatatataca 1140 ttaataaatt ttgtatggat gaagttttta aagacccacc cctaacgtct aatgaactaa 1200 atgttcttaa tcgtgataaa cggaatgatg taatagtata cgataaaatg cagggaagac 1260 ccagtgttac cgtaatgatt aataatgtaa aattagtatg cctattagat actggagcta 1320 aaataagtgt gattgattat aatattttta aaaaacttgg aaatttgaat atgactacaa 1380 gtaatttaaa attaagatgt gcgaatgata gcccgattga aatttgcggt aaaactttat 1440 taaatgtaaa agttgataaa atttctgata ttaaaaaagt agaattcgtt gtagccaaag 1500 atataaaccc aaagttaata ggaggtatag attttctcga aagatttgga atgcaattgc 1560 aaatagaaaa ccgtacagac gaaataaata caattatgga aaattcattt aagaaccaca 1620 taagtgaaga taagattata aataatttaa tttccacgtt tgacggtatt tttatgaaac 1680 ataaatggga tgttgggaaa accgaactat taaagcatga aatacttact aaaggtagtc 1740 cgattgtaat aaacccaaga cgacaaccta tacatttaat agaaaaaatt gaagaaaata 1800 ttcaagaaat gttgaaatat ggcattatat cagaatgtac atccccttgg aattctccgt 1860 tagtatgtgt taaaaagaaa aacagtagtg atataagaat atgcttggac taccgtgcat 1920 taaataatat aaccgaaaga ccgatttttc ccattccaaa tactgatgaa ttattagata 1980 ttcttgacgg agcaaagttt ttctcgacaa ttgatttagg caatgcttat tatcaagtag 2040 aactcgatga tgcatcaaaa ttgaaaacag cattttcgac aaaaagcaag caatattact 2100 ttaacagaat gccatttgga atagcagccg caccagctac atttcaaaga cttatgaatt 2160 tggttcttgg agaattaaat tggaatggaa caattgtata tctggatgat atccttatat 2220 tttcaaagac attgaatgaa cattatagca ccattgaaaa ggtttttcaa aagattaaat 2280 tagcaaatct taaaataagt ccagcaaaat gtcaattcaa aaggaaagaa gtaaaatttt 2340 tgggtcatat cattggtgaa gatggaatac gtacggatga aagcaagatt gaaactatta 2400 aaaattttga aagacctaaa tgcgtaaaaa gtttgagagg atttttaggc ttggcaaatt 2460 actatcgcaa atttatagaa aattacacca aagattccaa agtattagaa gcgttatgtg 2520 gaagtaaaca aaagaagctt atgtggactg atgagtgtga aacttcattt aaaaatttaa 2580 aactgaaatt gacgtctaca cctgtattga agttcccaga tttttctaag ccattcattt 2640 tggatacaga tgcaagtttt gaccgcattg gtgctgttct gtctcaaaaa gatggaggta 2700 aagaaagagt aatcgcatat ggatcccgag ctatgaataa acacgagttg ggttattgta 2760 taacaagaaa ggaattacta tctatatttt attttactgg tcattttaag cattatctat 2820 atggtaaaaa gtttattttg aggacggatc acaaagccat tacatttatg atgaagacaa 2880 agaaagctat taccccacaa tttcaaactt ggattaacca tcttagtggt ctggacatgg 2940 agattcaata cagaaaaggt gataaccaca ataatgccga cgctttatct agaaaagaat 3000 gtgaaacatg tgttcaatgt caaatgattc atgaaaatcc taaaatgggt aaattaaaaa 3060 caagattgtt gacactgatg actgataata ttgaaaatat atggcaaaag aactcgaaag 3120 aaattgaaaa tataaaagaa aatattagtt tgatgccaga taaaacattt tataaatatg 3180 tcaacggagt gattatgaca aataataata aaatttggat tccagaagaa aacaagtctg 3240 aattcatcat agaatatcat aaaaagctct gtcacgcggg tgctaataaa gtaaaaatgt 3300 atgtatcaaa caattttgat atgaaagatt tggagcaatc tattcgtaat gttgtacaat 3360 cctgtgaaaa ctgtcagaaa agaaaggttg atacaactaa aacaaaagaa gaaataattt 3420 gcaaaagtga ttcgaaatta tttgaaacaa tatatgtaga tttctgtggt ccattacaac 3480 ttgcaatgaa aggagaaaga tatattcttg ggattataga tcatcatagt aagtttattt 3540 ctttacatgc agtgaaatca caagatgaaa aaacagtggt aagattactg aaagaagagt 3600 ggataatgaa gtatggagca ccaaggataa taaactgtga cagaggtaaa tcatttgaat 3660 caaagctaat gaaagatttt gcagaattcc acaatattga aattaattat tcaagtccat 3720 atcatcactc agcaaatgga cagattgaga gacaattccg tacaatcaga gatgcaatac 3780 agataacact caaagataaa agacataaat catgggtaga tattcttccc gaagttcagt 3840 ttatgttgaa ttcaactttt caaaaaacta ttaggaagac tccagctgaa atggtgtttg 3900 ggaaaaagat ttatagtgaa tggttctcag agattgtgga agaagaaaaa atcaatcagc 3960 cgacaaatcg cgaatttcaa attggagatt gcgtattgat caaagtaaat gttcgaaata 4020 aaggtgataa cagatacatg ggtccaggtg tgattactaa aaagattcat gatcgaagtt 4080 atgaaataaa attggaaaat ggaagttcca tcacgagaaa tattgagtgg ttaaaggttt 4140 tcaaaaaggg ggggatgtga ggcgttttat aatt 4174 // ID hAT-54_HM repbase; DNA; INV; 5063 BP. XX AC . XX DT 19-DEC-2008 (Rel. 13.12, Created) DT 19-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type DNA transposon family - consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-54_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-5063 RA Bao W. and Jurka J.; RT "hAT-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 2042-2042 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 1009..4077 FT /product="hAT-54_HM_1p" FT /translation="IFYISSLIIIFYVYLRSQSTGNCLYSSISLRLTGNNC FT IVNDLRLQTSIELFLNAEFYSNHPTFHYAFINNTSSFLCYDNILPLSVSLN FT SLDSGKKSKDLVQEEAINNCNNENWSSFLCILALSSVCNKKIECLYPDFGL FT QKYKILFSSTILPRITSLNTIESLHLLFCYEGVLNDFSAPFQHNHYTPVII FT KENLKRKNLNKSCKMSQTKIKFYFETKHQDNNYSLDNLKSKPMEVSSQPPL FT SLSNFSKITCCESFSLPSNNYDIGYLVSNYTSYSSIAKTLSNSELQHLVKD FT VFVPCKTFSFPKNLNGRSFQFMWIEKFPWLTYSKIFDGAFCLPCVLFGCNF FT PSECSLVERLFCKPFDRWNDASRYFQKHAFGKNNNKSVCRNKGLHVKTAEV FT LFSLSSIWSSKTESIDITSQKIVQSQVSKNRQLLQPIIETIILCGRLGLSL FT RGHRDDSEFHPENSESFNHTVGNFIELLHFRVKAGDKVLEDHLKYHQQNAS FT YISKTSQNQLIRCCGEVITDTIIGEIKNSKYFSIIADEASDSSNKEQLSLV FT IRFVDSKFNIREEFISFLHCTNGVTGQGLFDILLKSISDFSLDIMNCRGQS FT YDGAGAMVGHTKGLSSRILILNEKATFVHCYSHRLNLVICGSCNVQYVKNL FT LAYVKEVSYFFNLSPTRQQKLEEHIECTVPIAVKKKLKDVCRTRWVEKVNG FT LDTFQELFIPLVSCLEEMSLNANKSFNHSTASSASSLLKLITGFDFIVAMC FT ITRNVFDLTLPITRMLQSKSNDIYDGLNLIQALKDVVISLRNIVDQHHKMC FT YEQALKIAQSINVAEAKPRTSFISKNRDNTPSESISDYFKLVITIPLLDHL FT SVELDTRFDDTTLKCYKALVLVPTKMISVVQCSHDTSWKDSIISFSDFYAT FT DLPNPLALSGELDLWEAFWLNFEGDVPSNISETLKAINFPGFENIKVCLKL FT LATFPLTSCECERTFSSLRRLKNYMRSTMVQDRLNGLALMNIHSEIVIDIN FT KVIDKFATNNRRLTFK*" XX SQ Sequence 5063 BP; 1745 A; 737 C; 740 G; 1839 T; 2 other; caggggcgga tctagggtac gtctagtacg tcttaagatg tacccaaaaa aaaattaaat 60 aaaaaaaaaa aaaattaatt tacgtttcaa ttataaaaat aacttttttg gttaactgtc 120 agctcaaata agatttaatt ttataggcac ttgtgaaaca agaaatacga acggttcttg 180 tttaaagttc ttgttaaaaa gatatctgtc tttgccggag gtttaagcaa ttcttgataa 240 ataatattta attaaataaa cgaatggacg tccccaattt tattttatta taaaataaaa 300 aattatttta aatttgtatt taatttgtta aaacaataaa taaataatat ctttactgat 360 aattattttt ggacgtcccc aaattggtac acccttatta aaatattgtc aaaatctttt 420 gttttawgac gatcaggagg taaaactaat gataatatcc tataaataat aagatacacc 480 aatagacgtc cccaatatta ttttatttta ttatttatct ttatatttaa tttagttttc 540 aatatttaat ttctaaatac tcggtttctt ttataaaaaa acactgccgg acgtcccttc 600 gttaatacac cctgaaatac atgccttatt cctaatattt aaaatgttgt ttaatgttaa 660 aaattcaagc atttatttgt gtgttcgagt attaatttaa aatgtcactc aatcaagaag 720 ttaaaaacat tatgctacaa aatgatgaaa caaaattgat cgctttgaaa gatatattac 780 aaaaaaatat tagttcttgt aataatacga ttcttgtcga ctctttcgaa gaagaaaaaa 840 ctcttaaaaa ttttgaaatc tatctgccaa agagttttaa agaaaaacat aatggcagtt 900 atgttgtttt aaggtaaagg tatttgttca aattccatga tttttttaat aactttttgg 960 ttagttgtaa tgttatattg actgtaagct agttgaaaat acttataaat tttttatata 1020 agttcattaa ttattatatt ttatgtatat ctcaggtctc aatcaactgg aaactgtctg 1080 tacagttcta tctccttacg tcttactgga aataactgta tagttaacga tttaagatta 1140 cagacctcaa ttgagctatt cttaaatgcc gagttttatt ctaaccatcc aacttttcat 1200 tatgctttta ttaacaatac tagctctttt ctttgctatg ataacattct tcctttatca 1260 gtatcactaa attctttgga ttctggaaaa aaaagtaaag atttagttca agaagaagct 1320 attaacaatt gtaacaatga aaactggtct tctttccttt gtattcttgc attatcaagt 1380 gtttgcaaca agaaaataga atgtttgtat ccagattttg gtctacagaa gtataaaatt 1440 ttattcagct caactatttt acctcgtata acttccttaa acactattga gtctttacat 1500 ctcctttttt gttatgaagg tgtattaaat gatttttccg ctccttttca acacaaccat 1560 tatacccctg tgataataaa agaaaactta aaacgaaaaa atttaaacaa aagctgcaaa 1620 atgtcgcaga ctaaaataaa attttatttc gagaccaaac atcaagataa taactattcg 1680 ttggacaatt taaaaagtaa accaatggaa gtctctagtc aacccccatt atcattatca 1740 aattttagta aaattacatg ttgcgaatca ttttctttac cttccaataa ttatgatatt 1800 ggttatttgg tttctaacta tactagttat tcaagtattg caaaaacctt gagtaattct 1860 gaacttcaac atttagttaa agatgttttt gtaccttgta aaacattttc ttttcctaaa 1920 aatcttaatg ggcgaagttt tcaatttatg tggatagaaa aatttccatg gctcacatat 1980 tcgaaaatat tcgatggagc cttttgtttg ccatgtgtac tttttgggtg taattttcca 2040 tctgaatgta gtttagtcga aagattattt tgtaagcctt ttgatcgctg gaatgatgct 2100 tctcgttact ttcagaaaca tgcatttggc aaaaacaata ataagtctgt ctgcagaaat 2160 aaaggcttac atgtaaaaac tgctgaagtt ttattttcat tgtcgtcgat ttggtcttct 2220 aaaacagaat caatagatat tacatctcaa aaaatagttc aatcacaggt ttctaaaaac 2280 cgacagttat tacaaccaat tattgaaaca attattctct gtggacgtct tggtctttct 2340 ttacgaggcc atagagacga ttcagagttt catcctgaaa atagtgagtc ttttaatcat 2400 actgttggaa attttattga attattacat tttcgtgtta aagctggtga taaagttctt 2460 gaagatcatc ttaaatatca tcaacaaaat gcttcttaca tatctaagac atcacaaaat 2520 cagttaataa ggtgttgtgg agaagttatt actgacacaa taataggaga aattaaaaat 2580 tctaaatatt tttctattat tgctgatgaa gcttctgaca gttcaaacaa agaacagcta 2640 tcattagtta tacggtttgt tgattcaaag tttaatataa gagaagaatt tatcagtttt 2700 ttacattgca caaatggtgt tactggtcaa ggattatttg atattttgtt aaaaagtatt 2760 tcagattttt ctttagatat catgaattgc agaggacagt catatgatgg tgctggagca 2820 atggttggtc acaccaaagg attgtcctct cgtattttaa ttctaaatga aaaagctaca 2880 tttgttcatt gctatagcca taggcttaat ttagttattt gtggctcgtg caatgtgcaa 2940 tatgtcaaga atcttctggc atatgttaaa gaggtatcat attttttcaa tctttcacct 3000 accagacaac aaaagttaga ggaacacatt gaatgtactg ttccaatagc tgttaaaaaa 3060 aaattaaaag atgtttgcag aacacgttgg gtggaaaaag taaatggttt agatactttt 3120 caagaacttt ttattccttt ggtctcttgt cttgaagaaa tgtctttaaa tgccaataag 3180 agttttaatc attcaacagc ttctagtgca tcatcccttc tgaaactaat tacaggtttt 3240 gattttattg ttgctatgtg cataacaaga aatgtgtttg acttaactct tcccataacg 3300 cgaatgttgc agtcaaagag taatgatatt tatgatggtt tgaatcttat tcaggcgtta 3360 aaagatgttg tgatttcact tagaaatatc gttgatcaac accataaaat gtgctatgaa 3420 caggctttaa aaattgcaca aagtattaat gtagctgaag caaagccaag aacttcattt 3480 atttccaaaa acagagataa tactccttct gaatctattt ctgattattt taagttagtt 3540 atcacgattc ctttattaga ccatctaagt gttgaactag acacaaggtt tgatgatact 3600 accttaaaat gttataaagc acttgtgcta gttcctacaa aaatgatttc agtagtgcaa 3660 tgttctcatg acaccagttg gaaagactcc atcatatctt tcagtgactt ttacgcaaca 3720 gacttaccga atccgcttgc tttgtctggt gagcttgatt tatgggaagc cttctggtta 3780 aattttgaag gagatgtccc ttctaatatt tccgagacgt taaaagcgat aaattttcct 3840 ggttttgaaa acataaaagt ttgtcttaag ttacttgcta cctttccatt aacttcatgt 3900 gagtgtgagc ggacattttc ttcacttcgg cgcctaaaaa attatatgcg aagcaccatg 3960 gttcaggacc gtctaaatgg tttagcatta atgaatattc attcagaaat agttatagat 4020 atcaataaag tcattgataa gtttgccact aataatcgta gattaacttt caaataaata 4080 taaaatatat taaaaaaaaa wtttttttta ataaaaaaat gcaaaatact aaaaaaaaac 4140 agtgcaaaaa aagtcagctt cttttgccaa tggtaaaaaa aacggctatt ttcatagcca 4200 cacactttta taaaacatta tttggcataa aggaattatt aacatacatt gagttgcata 4260 catgttggtt tcaggtatat cgtttttttt atgctatgtt gaaatgatta tattgtaaat 4320 gtttttatct tttgaactga ttttttcaga ttttatatat tttgcaagca tcaatagaat 4380 taagatttta aaatctactt ttgctatgca aaataaaata ctaagattgt ggcaatttct 4440 gtatatataa ttcaacttta attattataa tatatagtta tttctgttat ttatatttgt 4500 tacaatataa acaaattata taattattta caataaaaat ataaaatgtg taagtttata 4560 atttaataat gaaatgtgta aatataaatc tagtttgtca taataagttt gtaaatatta 4620 tgtaaacata attttactat ggtaaggtgc ccaagcgata tccttgcgta tttttaggcc 4680 tatatatata tatatatata tatatatata tatatatata tatatatata tatatatata 4740 tatatatata tatatatata tatattaaca gggccgtcac taggagaggg gggtgtctaa 4800 ccccttacca aaactttgtt gaaaaattag tcggcaaatt tggacttgta gtctgcaagt 4860 ttggaactat agtcggcata tgtcagtaaa cgagaatcat acgtcacctt ctcccccccc 4920 cccctcctcc ttttttttct tcgggacggc cctgtaatat atcccctcca actccagaat 4980 agttgagggg gggagggggg agaatgaaat gttgctcgta aaaaaatcag atgtacccaa 5040 ataatttcct ggatccgccc ctg 5063 // ID Gypsy-2_PPc-LTR repbase; DNA; INV; 321 BP. XX AC chrUn; XX DT 06-JUL-2010 (Rel. 15.07, Created) DT 06-JUL-2010 (Rel. 15.07, Last updated, Version -1) XX DE LTR retrotransposon from the Pristionchus pacificus genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-2_PPc_; KW Gypsy-2_PPc-I; Gypsy-2_PPc-LTR. XX OS Pristionchus pacificus OC Eukaryota; Metazoa; Nematoda; Chromadorea; Diplogasterida; OC Neodiplogasteridae; Pristionchus. XX RN [1] RP 1-321 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Pristionchus pacificus genome."; RL Repbase Reports 10(7), 997-997 (2010). XX DR Genome; chrUn; Positions 96507510 96507830. XX SQ Sequence 321 BP; 76 A; 85 C; 52 G; 108 T; 0 other; tgttgtatag taaataagta ttccaaatat tatgctaatt attaatgtga cccgttgaca 60 ctactacagt agtcagtaca tttcatgccc cctgggctcc cactacgatt tgcctgattc 120 ctatctgtta ctgttcctac tgtaagtatt gcaattcccc gctggcctac ttttcgtcag 180 tttgttctct ctccctcgtt gcggctagtg gttgctgcct attcttctgt caactattct 240 actctgcgat acttcaataa agccgattct attggagcta caagtactct ctctcaagac 300 acacaacggc cagacacaac a 321 // ID Chapaev-19_HM repbase; DNA; INV; 2748 BP. XX AC . XX DT 14-SEP-2009 (Rel. 14.09, Created) DT 14-SEP-2009 (Rel. 14.09, Last updated, Version 2) XX DE Chapaev-type DNA transposon - a consensus. XX KW Chapaev; DNA transposon; Transposable Element; Chapaev-19_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2748 RA Jurka J.; RT "DNA transposons from Hydra magnipapillata."; RL Repbase Reports 9(9), 1913-1913 (2009). XX DR [1] (Consensus) XX CC ORF mutated (not shown). CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX SQ Sequence 2748 BP; 913 A; 497 C; 514 G; 824 T; 0 other; cactcttagc aaaaataggc gataactgtc tcgtccgcac ccgagaaaac cgcaacattc 60 ctgtgaaagc ttatgaccat gggattccaa aaatgtaaac attcaaatca tctgcctacg 120 tccgtgaaag aaaatcaatt ttgaaatttt cccttattac acatttatgt gaaatgacag 180 catttctatt ttttgcattt ttacagcttt tattaaatgt aaaaaacttc taacaaaggt 240 aaaatataga tttttgcaat ataatttttt taattttaaa ttgacaaatt tcaggaaagg 300 gcactttttc aaagaaacta aatctgcgaa ggttgcattt ttgagtctct taagaaaaaa 360 acgaaaaaat gcccaattca gctaggaatc atgatgaaaa caggaaagtt gtttgcctcc 420 tgtgtctcaa aaaggcaaac agacagctca caacgttttt gcatgaaaaa attcaacaag 480 tctacagaca acaaatcaat tttgaagatg taagagtacc gcaagggctg tgtgaaactt 540 gtagaactgc tctgaggaaa cgttatgaag gacgtcctgg tgaattacca gctctttttg 600 actttgaaac tatcattgtc aaaccactca caagaggaaa tccttgtgaa tgcttgattt 660 gtcgcattgg ccggttgaaa ttgactgaaa agcaccctct tgacactgac acgaaagaag 720 aaaagatttc tgacaaaagg tgctcaaaat gtctgtcact aattggtagg ggattgccac 780 atcaatgctc tgtatcacag tttagagaga atataagatc tctggcagcc aatgatggta 840 aagttgcgga acagattgca tcttcaacaa tcacaagcaa ggatgcatct ccacatggaa 900 cagttcgact ttcacaaacc aagggtggga gagcattgcc tataactcca ggtaaacata 960 tggttacatc atttaagtat tttggtaaaa tatttgctgt aaagttaatt ttcatatgtt 1020 ttaggaccat caagtgcaaa acagctattc aaggatcacc cagttgtaac agcgcaagat 1080 cttgctcagg ttcaaataaa cacagggttg tccaataatg gcatcagaaa gcttgcaaca 1140 acaatcaaca atgtaagcac aaccaaagtc ttggagccat acattgttaa aaaatttcaa 1200 aaccttggga aaaatcttga agagagcttt actcagacca cagcggcatt tacaaacgcg 1260 gacaaaacaa cttcaaatta cattgttgca cattgcaaga gtctcaatag tcttattgaa 1320 gatttacttt tgacacggaa gattttttca aaccacgtca ttaaattggg gatcgacggc 1380 ggtggaggat tcttaaaagt ctgtctcgga gtttgcatca aagttcattt cacatacatc 1440 acaagcatca aaggatacag gagtgaaaaa acaacttctt gtggctgtgg ctgaaggagt 1500 tcaagaaaat tacaacaatg tgaaagtcat tctttccctt ctcgacattc aagaggtcaa 1560 ttttgttatc tcttgtgaca tgaaattggc taacatcctg tgcggtctgc aggctcattc 1620 atcctctcat ccatgtacat ggtgcacagc ggaaagttct aatcttgcga attgtggtac 1680 tccaagaact tttggctctc tgaagcagtc attccatgcg ttctctgcag caggttcaaa 1740 taccaaaaat gcaaagaaat tcggaaacgt ggttcatgag cctgttcttt cactcgaaga 1800 ttctgcattg gtaattgatc taatacctcc aatggagttg catctacttc ttggagtggt 1860 caatcatctt tttaagattc tgaaagattc atggccaaag gcagatgaat ggcttcaagc 1920 attgcatata caacatcaac cctttcatgg tggacaattt gaaggaaatt cctgcaataa 1980 gttgctgaag aacttggatc ttttgcaaac attggctgaa aaagacaatg cttttcaagt 2040 ttttcccatc attgaaactt ttcgaaaatt tgcaaatgtt gtatcatcct gttttggcaa 2100 caagttgcac agtcagtttg aagaaaaaat tgatgagttt cgagcttctt tccttgcatt 2160 gccaattgga actgtcacac caaaaacaca tgctgtgttt ttccatataa aggaatttat 2220 cctgaggaaa aagatgccac ttggtattta tagcgaacaa acaacagaag caatgcacca 2280 ttgctttagt acaaattggg caaggtacaa acgcccaatc agtcatccag aatatggaaa 2340 aaggcttgtg atgtgccttg ttgattttaa cagcaagaat ctttgaagaa aacactgagc 2400 acaaaataaa tagaattgga gatttatgaa taaattatgt tgaccttaca ttagaaatct 2460 ttatgtgaga acagaatgta cctttttatc cctttagaat aatgtttttt tatttattgt 2520 atgtattttt gtttaaagaa tgctgtaaaa acgcaaaaaa tagaaatgct gtcattttac 2580 atatatgtgt aataaaggaa aatttcaaaa ttgattttct ttcacggacg taggcagatg 2640 attttaattt ttacattttt ggaatcccat ggtcataagc tttcacagga atgttgcggt 2700 tttctagggt gcggactgga cagttatctc ctatttttgc taagagtg 2748 // ID R1A_NVi repbase; DNA; INV; 6351 BP. XX AC . XX DT 08-NOV-2010 (Rel. 15.11, Created) DT 08-NOV-2010 (Rel. 15.11, Last updated, Version 2) XX DE 28S rDNA-specific non-LTR retrotransposon R1 in Nasonia DE vitripennis. XX KW R1; Non-LTR Retrotransposon; Transposable Element; R1A_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-6351 RA Stage D.E. and Eickbush T.H.; RT "Maintenance of multiple lineages of R1 and R2 retrotransposable RT elements in the ribosomal RNA gene loci of Nasonia."; RL Insect Molecular Biology 19(Suppl 1), 37-48 (2010). XX DR [1] (Consensus) XX CC This family is specifically inserted into 28S ribosomal RNA CC genes. XX FH Key Location/Qualifiers FT CDS 1085..2626 FT /product="R1A_NVi_1p" FT /translation="KGRCCFYCVCVFACLLARLHVLSAATGMGAKSRRKKK FT RASSDAASSSRSDPSPEARIRAADEAGSESESHAHARCRSPIANGKRLWAK FT TPTNASVNASDSQSECEEMEVVPVSAKRGLPASESAVGPRAKKRQQTLLSP FT RVSVERMSAHVNEEPQVNASVNKNVQTCVNDSERVGDMARIGDKLREFILN FT DANKVSKHASKIVLGCVSEYEQFLMRLQCENAELRGRLHESEKLCASVVKE FT CARAPMAPYTTVAAARGVPEQRATTKPVSVAKPSYALVLKGNENEPNTAIL FT NRLKSASGCENVRVKNVREARNGGVIVETLSEKERECLKRAVSNVRVNLSA FT NEPRKLNPRVIVYDVPNEMTEERFLSSLYEKNLSGVIERENVGREVRVASR FT QSKPGVSVGNVILDVAGRIRDRLLEEKRVYIEWNAYRVSKYENVPRCYGCF FT SYGHLLRECKGERLCYRCGKPGHRGAGCREREDCGNCRARGLSATHSALSP FT ECPEYKWRLERLRGRISS" FT CDS 2622..5801 FT /product="R1A_NVi_2p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="AHKMAHSREIRVLQVNAQRGYSVMCDLGERLLNESID FT VCLIQEPHCVNGRVVGLPYGTRVFVSKSGRAALVVAGDTIECLSLDDWDCE FT NGVCVWTKGVHGEVYMVSVYCRFGEDIEPYLLSLDRLLERIAGHECILGMD FT ANASSGLWHSKDSTSSREREARGHALERWILANEWGLNILNEPSEHYTFSG FT PRGQSDIDVTLSNIRRHDCKLEWEVRCEWGISDHNPILIRLWHGVNEASPA FT AVPPPLSWRSWKHGYGRGLYMDVLNDRADELGLTSFSDLDSSQMALKMTEW FT LTSANDECMKRAKVMRREWVVWWTDELEYMKRDVQRCRKKYQRARCRQDEN FT VADLKSEYRKCEREYKCSMREVKERDWREAVGRDANDDPWGRVYRICRGKN FT KKSDLAGLKTPDGCTKTWMESASVLMKDFYPRDEGTNVERMPEMLNVTNDG FT RGEEEYEWSEVNLAVDKLRQRKAPGLDGITSSVIKYAWLAIPMYMKCMYDR FT CLKDGNFPAIWKKARVVVLLKGGDRDKKDPGSYRNISLLSGLGKPLERLMI FT ERMMERMNGKWNDCQYGFREGRSCEDAWMRFKDSVNSVNSKYAVGILVDFK FT GAFNNLRWSVILRYLLECGCDEHEMRLWMSYFNERYASLSSEMNEVMVKEE FT RGCPQGSISGPYMWNLCMNGLLDRLSAMNERIVAYADDLMIVANGNSRMEM FT ERMADECMRIVYEWGREVGVSVSEKKTVCIMLKGKLNVDGRKMRVTVAEGT FT LSSIGYVKSARYLGVCMGERMSFAEHIRGLRTKVMGAIGGLRRVMRKEWGV FT KKKTACTWTKGLLLAGVMYGSSVWYESMKYKTMRESMNSIQRCAMYACLRV FT CRTVSTEAMQVLLGWLPWDLECVKRANAFKARRGIGMNESDLVTDAEIVEK FT GVRDACKLMKERVHEIWQRRWSESTKGRVTHEWMCDVNFACERMWFEPSLR FT VGYILTGHGTLNAWLYDRGLADSAACPCGAPREDWMHVLTRCDMYESFRRL FT DEMGVLEDANGRMDVSGVLSARSQYECLCTFIERAYRMRASVISRMNEEER FT PNE" XX SQ Sequence 6351 BP; 1546 A; 1302 C; 2057 G; 1446 T; 0 other; gtatacctcc agacgttgag ctgagcacac gtgtttttcc gcgagtctca gtgctgtgcg 60 tctctagctt tttccaccta ctccactgcc gcatttttac ggttttgggg ctctggtggt 120 gaatacgcgt gtttagtgtg tttctgtagt gtttgtttat tgctaaaagt gcctgcgtcg 180 agtgtgaatg gattaaattg tgtaaaagtg cgtgtgaaag tttctgtgcc gtggctgctg 240 cggaggcagc taagccccgc cccccttttt cattcggggc gcgatatctc gacttccggt 300 ctggggaggg gtctgaaatt tgtcaggggt gcttcctcac ataattgtga ggaacctgcc 360 gaatttcgga tttttcgacc aaccgtttcg cgagctacag aaacgacgtc gttttgacag 420 ccgcgcggga gctccgcccc tcgtctgcac ttgcgggtta atatctcggc ttccggtcga 480 cagaggggtc cgagattttt caggggtgtt tcctcacata attgtgggga acctgcgggg 540 tttcgaattt ttcgaccgac catttcgcga gttacggaaa cggcgttctt ctaacagccg 600 cgctgaaact aataagaggc tccgccccca gtctgcactt gccgcttgat atctcggctt 660 ccggtcgaca gaggggtccg aaatttttca ggggtgtttc ctcacataat tgtggggaac 720 ctgcggggtt tcggaatttc gaccgtccgg cccgggattg ctcaaaaggg gaagtgagtg 780 ggagttgcct gtagaagcgg gggaactttc tccccttctc tcgctcgtcg taggcattta 840 aaagtgcgat atctcggaca gtgctctata gaagcgagtg aaatttttct gtgtgtgcta 900 gtggacgttt acgcgatctg ccacataaat ttcacggcga tcacctgact cttccgccgt 960 tggcgggaaa agactttcgt ggcggtgtga tcgtgtatac gcgcccgtat aggcgcgaca 1020 ggctgcctcc cctgcaacct gccggctctg cgtgactgac cactttttgt ttgcacgccc 1080 gtaaaaaggt aggtgttgtt tttactgtgt gtgtgttttt gcgtgcctgc ttgcgaggct 1140 tcacgtgctg agtgccgcca ctggaatggg tgcgaaatcg cgacgaaaga agaagcgtgc 1200 gtcctccgat gctgcctcca gtagtcggtc cgacccctct cctgaggcgc ggatcagggc 1260 ggctgacgag gcggggtcgg agtccgagtc gcatgcgcac gctaggtgtc gctcgccgat 1320 tgcgaacggg aagcgactat gggcgaaaac ccctacgaat gcaagtgtga atgcgagtga 1380 tagccagagc gaatgcgaag aaatggaagt ggtgcctgtg agtgccaagc gcgggctgcc 1440 agcatccgag tcagcggtgg gtccacgggc caagaagagg cagcagactc tcttgagccc 1500 acgtgtgtct gttgagagaa tgagtgcgca tgtgaatgaa gaaccgcaag tgaatgcaag 1560 tgtgaataag aatgtgcaaa cgtgtgtgaa tgatagcgag cgtgttggag acatggcgcg 1620 cattggggat aaactccgtg agtttatcct caatgatgcg aacaaggtct ccaaacatgc 1680 gagcaagatt gtcctgggat gtgtgagcga gtatgagcag ttcctcatgc gcctccaatg 1740 cgagaacgca gaactgcgcg gcagactgca tgagagcgag aagctgtgtg cgtctgtggt 1800 taaggagtgt gcgcgtgccc ccatggcccc gtacaccacg gtcgcggctg ccagaggagt 1860 gcctgaacag cgtgcgacta cgaagccggt gagtgtggcc aagccgagct acgccctcgt 1920 actcaaaggc aacgagaatg agccgaacac ggccattctc aataggctca agagtgcgag 1980 tggctgtgag aatgtgcgtg tgaagaatgt gcgtgaagcc agaaatggtg gtgtgattgt 2040 tgagaccctc agtgagaaag agagagaatg tctgaagcgt gcggtaagca atgtgcgagt 2100 gaatctgagt gcaaacgagc cgcgaaagct caatccgaga gtgattgtgt atgacgtgcc 2160 caatgagatg actgaggagc gcttcctcag cagcttgtat gagaaaaatc tgagtggagt 2220 gattgaaaga gagaatgtgg gaagggaggt gcgtgtggcc tccagacagt cgaaaccggg 2280 agtttcggtc ggcaatgtca ttctcgatgt tgccgggcgg atccgggatc gactcctgga 2340 ggagaagcgt gtgtatatcg agtggaatgc ataccgtgtg agcaagtatg aaaacgtgcc 2400 gcgatgctat gggtgtttca gctatgggca tctgctacgt gagtgcaagg gtgagcgtct 2460 gtgttataga tgcggaaaac ccgggcatcg gggtgcgggc tgcagggagc gggaagactg 2520 cgggaactgt cgggcccgcg gtcttagtgc gacccactcg gccctctctc ctgagtgccc 2580 ggagtataaa tggagattag aacgactgag agggaggata agctcataaa atggcgcatt 2640 cccgggaaat tcgtgtgctt caagtgaatg ctcaacgggg atacagtgtg atgtgtgatt 2700 taggtgaacg actgctaaat gagagtattg atgtgtgcct gatccaggag ccgcactgcg 2760 tgaatggtcg tgtggttggg ttgccgtatg gcactagagt gtttgttagt aaaagtggca 2820 gggcggctct tgtcgtggcg ggtgatacca tcgagtgcct ctcattggac gattgggatt 2880 gcgagaatgg tgtatgtgta tggacgaaag gcgtccacgg cgaggtttat atggtctcgg 2940 tatactgccg tttcggcgag gatatcgagc cgtatctgct gagcctggac agactgcttg 3000 agagaatagc tggacatgaa tgcatcttgg gtatggacgc gaatgcctcg agcggtcttt 3060 ggcacagcaa ggattcgaca tcgagtagag aaagagaagc gagagggcat gctctcgagc 3120 ggtggattct agcgaatgaa tggggactga atattctgaa tgaacctagt gagcactaca 3180 ccttcagcgg ccctaggggg cagagcgaca ttgatgtcac tctgtccaat attcgacgac 3240 atgactgtaa gcttgagtgg gaggtcaggt gcgaatgggg aataagtgac cacaatccca 3300 tactcatccg attgtggcac ggcgtgaacg aggcttcgcc tgcggctgtt cctcctccat 3360 taagttggag gtcttggaaa cacggctatg gacgtggcct atatatggat gtactgaatg 3420 acagagcaga tgagcttggt ctgacctcgt tttcagacct tgactcatca caaatggctc 3480 tgaagatgac tgaatggttg acgagtgcta atgatgagtg catgaagaga gcgaaagtga 3540 tgaggagaga atgggttgtc tggtggacgg atgaactgga atatatgaaa agagacgtgc 3600 agcggtgccg aaaaaagtac cagcgtgcaa gatgccgtca agacgagaat gtggctgacc 3660 tcaagagtga atatagaaaa tgcgagagag aatacaagtg ttcgatgcgt gaagttaagg 3720 agcgtgattg gagggaggcc gttggcagag atgccaacga cgacccctgg ggacgcgtgt 3780 atagaatctg ccgcggcaag aataaaaaga gcgaccttgc tgggctaaag acgcccgatg 3840 gatgcacgaa gacatggatg gagagtgcaa gtgtcttgat gaaagatttc taccctcgtg 3900 atgagggtac gaacgtggaa agaatgcctg aaatgctgaa tgtgacgaac gatggcagag 3960 gtgaggaaga gtatgaatgg agcgaagtca accttgccgt agacaaactg cggcaaagga 4020 aggcaccagg gctcgatggg atcacctcct ccgtgataaa gtatgcgtgg ttggctatcc 4080 ccatgtatat gaagtgtatg tatgacagat gtctgaaaga cggaaatttc ccggcaatat 4140 ggaagaaggc gagggtagtg gtcctgctca agggaggcga cagggataag aaggatcccg 4200 gctcgtacag gaatatcagc ctccttagcg ggctgggaaa acctcttgag cgcctgatga 4260 tcgagcgaat gatggaacga atgaatggaa aatggaatga ttgccaatac ggcttccgag 4320 aggggcgttc atgcgaagat gcatggatgc gcttcaagga ctccgtaaat agcgtaaatt 4380 caaagtatgc agttggaatc cttgtcgact tcaagggagc gttcaacaac cttcgctgga 4440 gcgtcattct gcgctacctc cttgagtgtg gatgtgacga gcatgaaatg agattatgga 4500 tgagttactt taatgaaaga tatgcgagtc tgagcagcga aatgaatgag gtaatggtga 4560 aagaggaaag agggtgcccg cagggatcca tctcgggtcc ctatatgtgg aacctctgta 4620 tgaatggact actagatcgg ctgagtgcaa tgaatgaaag gatcgtagca tatgccgatg 4680 acctgatgat tgtcgcaaat ggaaacagca gaatggaaat ggagcgaatg gccgacgaat 4740 gcatgcgcat cgtgtatgaa tggggaaggg aggttggagt tagtgtgtct gaaaaaaaga 4800 cggtatgcat aatgttgaaa ggtaaactga atgtagacgg acgaaagatg agagtcactg 4860 tggctgaggg cactctctca tccatagggt acgtcaagag tgctaggtat ttgggtgtat 4920 gcatgggtga gcgtatgagt ttcgcggagc atataagagg acttagaacg aaagtgatgg 4980 gagcgattgg aggcttgagg cgtgtaatga ggaaagaatg gggtgtgaaa aagaagactg 5040 cgtgtacgtg gacgaaagga ctgctcttgg ccggagtaat gtatgggtcg agtgtatggt 5100 acgagagcat gaaatataag actatgcgtg aaagtatgaa ttcgattcag aggtgtgcga 5160 tgtatgcatg tctgagagtc tgcaggactg tctccacgga ggccatgcag gttctccttg 5220 gatggctccc gtgggaccta gagtgcgtga aaagagcaaa tgcgttcaag gctcggcgcg 5280 gcatcggaat gaatgagagt gacctagtga ctgatgctga gattgtggag aagggcgtgc 5340 gagatgcgtg taagttgatg aaagagagag tacatgagat ctggcagcgc cgatggagtg 5400 aatcaacgaa aggacgtgtg acgcatgaat ggatgtgtga tgtgaatttt gcctgtgaga 5460 gaatgtggtt tgaacctagt ctccgtgtcg gttatatcct aactggacac ggaacactga 5520 atgcatggtt gtatgatcga ggtcttgcgg actccgccgc gtgcccgtgt ggtgccccgc 5580 gtgaggattg gatgcatgtg ttaactaggt gtgatatgta tgagagtttt aggaggctgg 5640 atgagatggg tgtgcttgag gacgcgaatg gaaggatgga tgtgagtggt gtgctcagtg 5700 cgagaagcca gtatgagtgt ctgtgcacgt tcattgagcg tgcatatcgt atgcgagcaa 5760 gtgtgatcag tcgaatgaac gaagaagaaa gaccgaatga gtgagtgaat gtggaaccgc 5820 aaggggggct ctggtctcat cccctgggtc tagcgaaccc acggccccag ctaaccggtg 5880 ctaccgagcc gggcagagtg caggatctgc aaagacggaa gagacttatc tcgaacctcc 5940 aaccaggagg ctaaatggta ccacgggggt cgggggcccc agggctccga taggcttatc 6000 gggaccctgc ccgaccatgg tgttggactc gtggtggcag ctggttgata gcccaaaacg 6060 cgggcgagag gtatgggcgg tgacgtgctt cggcgcgggc tccgttttac gcctcgacgg 6120 gtgagctgcg gtctcagctc gggcgctagc ctgcagagct agcaggggca tagatgggcc 6180 ccgcctcatg cagagaggat cacctggtcc tgacaaacca ggaagagaaa tctgtagacc 6240 tgacactagt caggttgcgc tggttcgggc gtgatcccaa cccctcgccc atgggggccg 6300 tgtgaattaa gccggaaggc aggtgtcgca cgttaaatca acgagactca a 6351 // ID BEL-243_AA-I repbase; DNA; INV; 6636 BP. XX AC . XX DT 13-JAN-2011 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-243_AA_; KW BEL-243_AA-LTR; BEL-243_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-6636 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (13-JAN-2011). XX DR [1] (Consensus) XX CC Positions [5654-6235] - Integrase core CC LTRs are 96% similar to each other. XX FH Key Location/Qualifiers FT CDS 2228..3226 FT /product="BEL-243_AA-I_2p" FT /translation="MSGLVNLASELSTDVDAAQNHQKQSRGERVKQKEKLF FT THADESTIASKKESSVGESWSKACSYCDKDGHQILNCASFKALDVGGRWKA FT VRQRNLCRLCLIPHRKWPCRSKKECGVEGCRIRHHILLHSTRNDNTDTSMP FT GGTVHQNHHRTKSFSLFRYLPVTLYGKGTQVETFVFLDDGSSSTLLEAGIA FT TQLGIEGEPDSLWLSWTGKIGRHEKCSRRISVKISGAEKEEKYLLNDVRTV FT RELGLPTQTLNYAELAKVFPHLQGLPVSSYVNAKPGMIIGIEHVRMLTSLK FT TREGGNNEPVPQKRDWGGAFMVGIPQAKRPWNSCTYTQSNV" FT CDS 5135..6634 FT /product="BEL-243_AA-I_3p" FT /translation="MARTMAYVYRAIKIWKRVLWNNERSSEPQRDELVKAE FT ELLWRQAQASAYPEEVRDLMNNRRVSKRSPLYKLSAVIDEHGVIRIASRIG FT CAPHVPYAAKYPVVLPRDHRITFLLVDSFHRRFLHANNETICNELRQQFYI FT SRLRVLVRKVGQECQLCKIHKAKPEPPIMGPLPRVRLTPFIRPFTFVGVDY FT MGPFMVKVGRSSVKRWICLFTCLTIRAIHLELVHTLTATSCVMAFRRFVSR FT RGAPLEVFTDNGTNFVGAHRQLTEEKQKILKINDTCASTFTNAHTQWHFNV FT PAAPHMGGPWERMVRSVKTAMQALSDNPRHPSDEALETIMLEAEGIVNSRP FT LTYVSLEDANQEALTPNHFLLYGSSGIKQPTTSPVSTGNMLRDSYKLAQHI FT VDEFWIRWTREYLPMLTRRAKWFDPAKPLEPGDVVVVVDDIPRNRWERGRI FT LETYPDKSGQVRRAKVQTARGVFMRPAVKLALLDVTGPKMSADASPETEVI FT HGVG" XX SQ Sequence 6636 BP; 1832 A; 1493 C; 1752 G; 1549 T; 10 other; ttctactaag attgtcatcc taagttggat gatagattgt tgcccatcgg aggatcctac 60 gtcggagtat tgcgcagtct ggacagtggt ctgtccaaga agacatcttt catccctata 120 cgtgaagtac gggggaggaa cctaacctcg aattcaccga ctgaatgcgg cactaggacg 180 gagttcgagg agcagctaag tgcagtaaat ttagtctgta tcctccctat tccccgttat 240 tacatccgtt tggtcgcgaa tcgcttctct agtgataagt caatcatagt tagatctgcg 300 attatagtta gatacatctg tctcgcgatg tcggaaaaga atgtcaatct gtcgactccg 360 ggtggcgaga agcaatgtaa aaagtgcgat cgcatcgata tcgatgataa tatggtgtgt 420 tgcgacgttt gtgaagcatg ggtgcatttc cagtgcgccg gtgttaccga ttccatcgga 480 gagcccgata ggagttggaa gtgtagtgct tgtttagctg actggaatgg ttcctccaag 540 tcggaggtgt cattcaacga gtcgtcggtt agcaagagta gtcgtgtttc gcagaggctt 600 cagcttagtc tacaactgtt agaagaacag caaaagttga agaagaaacg ggctgcagaa 660 gaattgaaaa tccgtcgact ggaagaagag gccaagctga aggagatgga tgaagaacaa 720 gaatatttga ggaagaagta cgaactgttg atgcagatcg acgatgagaa agattcgtcg 780 accaggagga gcggaataag tgtgaaaagc aagcgagatc atctggagaa gtggctgact 840 aagggttcgt cagcagacgg agtaggaaaa ccgacagtta ctcaaacgat aacaacgctc 900 ccgataccaa tcgtcggtga cgttgcaaac aggagctaaa gcgccgtctt taaagacgat 960 ctcagcgaca gaaaaggccg agagtccttc cattccgctg aaccagggta tgttgtacgt 1020 agatccgact attccagtcc agggaggaag cgtaacagct ttggcgcaat cttacgagct 1080 cctagctacg acggcagtat cggggagcat gtcatgctgc cggtttcgtt ttcttcggcg 1140 ccaataataa ttgtatcgac agcgggagga accgccactc catcagcagg cacagttcca 1200 gtgttcgtac cgtctacaga catggtggtt caaggaacag atagagttag tgcgcctcag 1260 tcttctggtt ctggtatagc accggtctac tcgtatccac ggatgattaa ccacggcttc 1320 agcagctgta acaataccgc cggaaatcac gcacggttta aatggaaatc attcatctcg 1380 aggactttgt tccatcccat cgaactcgtt tcatttaccg gctggaagtc aacccgtctt 1440 gccaagtaca agttcattcc agtattctaa tgcttccgaa gctgtcgttt cgagaaatca 1500 gccgtggcac ccggtggacc cggaagcggt agtgccaacg agttcatcat tccgctctat 1560 cccggatgca gtggctgcac ctttcgacat tccagtcaac ccacgatgta ttcctgtaaa 1620 tcaagcgacg gcgcacctag catacgggcc ggcatcggcg ggaaccggag gtgccatgag 1680 taatctgcca gttccttctt taggaccaac tagcgctcaa ctggcggcac gacaagtgat 1740 gccacgcgaa ctgccgaagt ttagtgggga ccctcaagag tggcccatat tctatagctc 1800 cttcaagaac agttcggagg tatgcggtta cacggatgct gagaatctgg ccagattgca 1860 gcgaagtctt ggaggttcag cactggaagc cgttcgaagc cgccttctgc tgccagcttc 1920 cgttccattt gttatggaca ctcttcaaaa gctgtacgga cgacccgaaa cgctgatmaa 1980 ctctcttctg aagaaggtca ggagcgtccc ctcacckaag tcggacaatt taaatacgat 2040 cgtcgcatac ggattggcga tacaaaatct cgtcgatcac attgtcctcg ctgatcaaca 2100 agcgcacctt gsaaatccaa tgctgttgca ggagttaata gagaagttac ccacgtccct 2160 kaaaatgcag tggggtkcst acaaacagca gttcgcgagc gttaatttgg cgacgttcaa 2220 cgacttcatg tcgggattag taaatctggc ctccgagctt agcaccgatg tggatgcagc 2280 gcagaaccat cagaagcagt ctagaggaga aagagtcaaa caaaaggaaa aactgttcac 2340 ccacgcggac gaatctacga ttgcgtcgaa gaaggagtct tcagttggag aatcttggtc 2400 gaaagcgtgc tcatactgcg acaaagatgg acatcagatc ttgaactgcg ctagtttcaa 2460 ggcgttggac gttggaggtc ggtggaaagc agtamggcag agaaatctat gcmgattatg 2520 cttgataccc caccgaaagt ggccctgccg ttctaagaaa gagtgtggtg tcgagggatg 2580 tcgtattagg caccatatac tactacacag tacacgcaac gataacaccg atacgtccat 2640 gcccggcgga acagtccacc aaaaccatca tagaacaaaa tcgttttccc tttttcgtta 2700 tctaccggtc actctgtacg ggaaaggcac acaagtggaa actttcgtct tcctggatga 2760 tggatcgtct tcaacgttgc tggaagcagg aatcgcaaca caattaggta tcgaaggtga 2820 gcccgatagc ctttggctga gttggactgg gaagattgga cgacacgaga aatgctctag 2880 gcgaattagt gttaaaatat ctggagcaga aaaggaagaa aagtatctac taaatgatgt 2940 tcgaactgtg cgtgagttgg gactcccaac gcaaacgtta aactacgcag aactagcaaa 3000 agtgtttcct cacctccagg ggcttcccgt atccagctac gttaatgcca agccaggcat 3060 gatcattgga attgagcatg tacggatgct aacgagcctg aagacaaggg aaggcggtaa 3120 caacgaaccg gtgccgcaaa aacgcgactg gggtggtgcg tttatggtag gaattccaca 3180 agcgaagaga ccgtggaaca gttgcacata cacgcagtcg aacgtataag taacggtgag 3240 ctccatgatt ccatgaggaa gttctttgcg gttgaagatg ctgctgttac gcgtcagatg 3300 gagtccgacg aagacaaacg agcacgaagt attctggaaa caacaacggc gcgaaaggga 3360 gctcggatgg agacgggact tctctggcgg cacgataacg tctattttcc ggacagctat 3420 aagatggcgt ttagtcgatt gaaaggactg gaaaagcggt tgacaaatga tcctgattta 3480 cgtagaaagg caaatgaaca aattgaaagc tttgagcaga aacagtatgt tcgcaaagtg 3540 tcgccagaag agttggggag agctgatcca catcgtcgat ggtatctacc gctaggaata 3600 gtgacgaatc ctaaaaaacc gaacaaaata cgaatgattt gggatgcagc agcgaaggtt 3660 ggcggagtgt gtttcaatga catgctgcta aaaggacctg atctcctcgt accgctcgtg 3720 gaagttttgc tgcggttcag ggaaggaaaa atagccatct gttccgacat ccgtgaaatg 3780 tttctgagga ttttaatacg ggatgaggat aaatggtcac agtgctttct gtggcgtaac 3840 agtccggatg aggtagtcca aacatatgtg ataaacgtcg ccatgttcgg agcaaccagc 3900 tctccgtgta cggcgcagta cacgaaaaac aaaaatgctc tcgaattcgc tgagcagtac 3960 ccccgggcag tgagcgcgat atttaaaggg tcactacgta gatgacttcc tcgatagcgt 4020 caataccgtg gamgaagcag ttmgactagt ccaggaagtg cagtcacatc cacgccgtca 4080 gctggtttga attcggcaag attctttcta actcaccaga agtactcgca cgacttggag 4140 agactagccc ggcggctagc aagccactaa accttggtaa agatgcggtc cgcgagcgcg 4200 ttttaggagt agtctgggtt ccctctgctg atcatttcac cttcgatcga actgggctcc 4260 aggaagtttc gagaagcagt acagaagcac ctacaaaaag gcaggttctc cgaacagtaa 4320 tgaagctgta cgacccctta gggtttgttg cgcatttcgt agtgcacggg aaaattctta 4380 tgcaggagat ttggagatcg gggaccaatt gggacgagcc catagcggaa catctgctcg 4440 aactatggaa caggtggatt gaactgtatg agagcatcaa cgaagtcaag gttcctcgat 4500 gttttttcgg agatctcaaa ccagagaggg tagatgagat agaaattcac gtgttcacgg 4560 acgccagcgt agcggcgtgt gcatgcgtag cctatctaag aatgtcgacc aaagaaggta 4620 aatggtgctc gatggtagca gcaaaaacca aagtagcgcc acttcggacg ctttcaatcc 4680 ctcgcttaga acttcaagct gctatgatgg gatcacgttt gcttcataac atctgttctg 4740 ctctcaccct aaacatccgg aagcgtttct tatggacgga ttcagctaca gtcctagcat 4800 ggctccgttc agacggtcga cgataccatc agttcgtatc ttttcgagtt ggtgaaatcc 4860 tgtcgctcac aagtgttgac gagtggcact acgttccatc gaaactaaat gtggccgatg 4920 acgccaccaa gtggaattca ggaccgtctt ttgatcctga ggacagatgg tttcgtggtc 4980 caccattctt gcgagaatca gaggatagct ggccgaaaga atcgacgaag gaagtgaaga 5040 aaccgaggca gtaacggaag agattcgtgt tgcggcggcg catcgaacaa tagaggaagt 5100 ggttgcagtc gaacgctttt caagttggaa tcgtatggct agaacgatgg cctatgttta 5160 ccgggcaatc aagatctgga agcgtgtatt atggaataac gagcggagtt cggaaccaca 5220 gagggatgaa ttggtgaagg cagaagaatt actttggcga caagcgcagg caagcgcgta 5280 tccagaggag gttcgtgacc tgatgaataa tcgtagagtt tccaaacgta gtccattgta 5340 caagttgtct gccgttattg atgaacacgg tgttatacga attgcaagta gaataggatg 5400 cgctccccat gtcccgtacg cagcgaaata tcctgtagtg ttgccaagag atcaccgcat 5460 aacatttcta ctggtggact ccttccatcg acggtttctc catgccaata atgaaaccat 5520 ctgtaatgaa ctacgacagc agttttatat ctcaagacta cgagtactag tgcgaaaagt 5580 tggtcaagaa tgtcaactat gcaaaataca taaagccaaa ccagaaccac caataatggg 5640 cccgttaccc agagtgcgtc tcacgccgtt tattcgaccg tttacttttg tcggcgtcga 5700 ctacatggga ccttttatgg ttaaggttgg gcgcagcagt gtgaaacgat ggatttgttt 5760 attcacctgc ctcacaattc gtgccataca tctagagttg gtacacactt tgacagcaac 5820 gtcatgtgta atggcttttc gaaggtttgt gtcaagacgc ggggcaccat tagaagtctt 5880 tacggataat ggtactaact ttgttggcgc tcatcggcag ctaacggagg aaaagcagaa 5940 gatcctgaag atcaacgata cttgtgcatc cacgttcacc aacgcacata ctcagtggca 6000 ctttaatgtt ccggcagccc cccatatggg tggaccttgg gaacgtatgg tacgttccgt 6060 caaaacagca atgcaggcct tatccgataa tccacgccat cctagcgacg aagctctgga 6120 aacaattatg ttagaagcgg aaggcatagt gaactcacgc cctctaactt acgtatctct 6180 agaggatgcg aaccaggaag ctctcacacc caatcatttt ttgttgtatg ggtcatccgg 6240 tatcaagcag ccgacgacca gccctgttag tacagggaac atgctgcggg acagttacaa 6300 acttgcacag catatcgtgg atgaattctg gattagatgg acgcgcgagt atcttccgat 6360 gctcacgagg cgagccaagt ggttcgaccc cgcgaagccg ctggaacctg gcgatgtagt 6420 cgttgtagtt gacgacatac caaggaaccg ctgggagaga ggacgaatat tggaaacata 6480 tcctgacaaa tccggacaag taaggcgagc gaaggtacaa actgctcggg gagttttcat 6540 gaggcctgct gtgaagttgg ctctactaga tgttacgggc cccaagatgt ctgctgacgc 6600 tagtccggaa acggaagtga ttcacggggt ggggaa 6636 // ID EnSpm-N1_BF repbase; DNA; INV; 2810 BP. XX AC . XX DT 04-AUG-2008 (Rel. 13.08, Created) DT 04-AUG-2008 (Rel. 13.08, Last updated, Version 1) XX DE Amphioxus EnSpm-N1_BF non-autonomous DNA transposon - consensus. XX KW EnSpm; DNA transposon; Transposable Element; Nonautonomous; KW EnSpm-N1_BF; non-autonomous. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-2810 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-2810 RA Kapitonov V. and Jurka J.; RT "EnSpm-N1_BF - a family of non-autonomous DNA transposons from RT the amphioxus genome."; RL Repbase Reports 8(8), 790-790 (2008). XX DR [2] (Consensus) XX SQ Sequence 2810 BP; 825 A; 593 C; 631 G; 761 T; 0 other; cactgcacga aatcaataat ttccaggtgt ttcaagggac ttacacacct gtgaaatgtt 60 ctgtgtcctg gctgtgtaca tgactttcct ggttcggcgt gtaatgtcct tgtttttttc 120 ttacatacct ggactcgcgc ttgtacagga atttctagga cccttacacc agacgtgtag 180 gggaaatagg cagacaaagg cttggcggcg cctattctca aagccaagct tgggtcgcgg 240 gttattactg tttacggcaa ctactacccc tgcaacccag cctaggctat gcttggacaa 300 ttaataagca atgaggttta cattttaaac acaaaataag ggggagtgtc ggggggtatt 360 gaatggttgg tatgtctacc cgtgacatcg tctgacagaa caaagtggaa caaaatacaa 420 ctataaaaag aaacgacatc cacattttta ggaatacaat caaggtcaga aagaaaggtc 480 tacaaaatac taaaaaaaac aatcataatt aatgaagctg tgatttgttt ggcccgatat 540 ttccaactta tttggaaatc ctacggtcgg cgcgtattga acgccaaagg cagtatttac 600 tctgggggcc cacacgggcc attcatattc atttctacat gtcggaaaga aataaatgct 660 agggatagat cttggcttat aaataatagg aaagtatgtc ttaaaccaag gaggtttcaa 720 cttccttgct taaacaaact cgtcattttc attacataaa cagcagttcg ttagccggat 780 gtgtattgca ttgtctagcg cagacaaagg ccacaaacac ccagcgttca ttgaatcatt 840 cttattttct tctttaatca ttttgcactc acactgagat tctgtttcaa catacataaa 900 tctgaactgc tgcaatagac ttctgcgtgt atcgcgataa tcagttacgt gatctgcccc 960 ctttgtatga agagctagga aaataaacta accaaagaag tttaatttgg aaattcgtaa 1020 actagtgatg tacgatgttt ttatagctgc aaatacagag tgcctgtcgc gtcatagcga 1080 ggaagttgaa ctgaaacgac gttggacata gggaaaacat aggactgcga tctgatttat 1140 gggcagcttt ttgtcaaaag aatagagaga acacgggaca aaatacagtc cattttcttg 1200 gttactgtct ctaactttgg ctactgtcga cgcacaaact gagtgaaatg atctatgtta 1260 agatataacc tccttgactc gatgtactaa atttttgtcc cgggttatgt tcttgattgt 1320 acaaactaaa tgacaacgtt gaacgttaaa tttctaccga tgtttcaagc tgttgataat 1380 cacagcggca tgtttcacat ctttctaggc aggtattgtt ttactgtgaa gtgaaatacg 1440 ggtaatcagg cagctcaaac aacctccttg acgtgattgt aacaatagac atcatggctg 1500 gtactagcgg tagttcacag ctctttgtgt ctgtgcccat gcaaattctt cagaaaaagc 1560 caatagaatt tctcctcgca caattatagg gtcaaacctt cggccaatca acgtgccagt 1620 cacaattttc aaattcttca gttgagttgg tattgtgaat tccgcgtctt tgcgtgtgga 1680 tgtgcgttct tgctgtccaa gtgtgtctat ataccagggt gcgttatctg atggtaccga 1740 ggaacattgg tcattatttt acatcagcaa ggagtttcta ctcaagatcg cggaacagga 1800 cgacaggacg tcagctgcaa ctaataataa caggaccacc gtcgggccaa agttctctcc 1860 catggcttca aggatcaagg aacttccgta tacaactgta gcgatgatat tgcagtcggc 1920 atttgacttt gctttcgacg gaaatagtgc ggaggaatat cgaaacggct ctcatgtttg 1980 ggtagcgaaa ccaccctcat ctaggaacga caactgtatc acctagactc ccttgcagtc 2040 ttcggaacgg ggctgttttt tttggggggg ggggggggca aactcccttt cccggaagaa 2100 catagtcaac gttgaaagtc gcggaccaac gccccccccc ccccaaaaaa aaacccagcc 2160 ccgttccaaa gactgcaagg gagtctatgt actaccctgc atgccgtcaa gaagtgcata 2220 ctagtagcct acgaagacta gaagacaaca agagaccagc cggcattcta tgacatacaa 2280 aaggcactcg cgacttcaac ctatcataca cacgaacaaa cactaataaa aattgtcttg 2340 tttcaatatg aataaatata tggtgctatt acagaataag aaatgcgagt aagaaactgc 2400 cgaataaatg gggagggggg caaaatccag aagggggggg gggtggaggg cgtcagatct 2460 ctcgacgcca ctttccatga caaaagggat gcactgggtg aggcagaccg gggaaatata 2520 attaggtggt aattggccgc taatgtatgc atattccgcc aacacacaat gcaacacacg 2580 gctcatgtcc tgttgtgtaa ggttgtttca aggcaatctg taagggtatt tcaagccctt 2640 tacacgtcct gcacacagaa atggttcgtt tccgcgagat tctaggacat gtaaaggact 2700 gtacacgtct cgtttgcagg catgaaacct ccggattttg aaggacatgt ttaatttcca 2760 ggactgtgta agtcagtttc catcctggaa atgaggcttt tcaggcagtg 2810 // ID Crack-8_AAe repbase; DNA; INV; 5385 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A Crack non-LTR retrotransposon family from Aedes aegypti. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; KW Crack-8_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5385 RA Kojima K.K. and Jurka J.; RT "Crack clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1224-1224 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 22 sequences with >97% CC identity. Closely related to Crack elements in Culex pipiens CC (Crack-1_CP to Crack-4_CP). XX FH Key Location/Qualifiers FT CDS 991..2061 FT /product="Crack-8_AAe_1p" FT /translation="MTDDLDNDMYCVECKKKEDDINKLATCLYCFSSAHYK FT CRNIVGFAVRKIKENLYFCSPKCSEIYKKIMEMQNERTTMINELKFELKKT FT VTNVVSAQLQDVKVEVNTIVKAIEESQEFLSSKFDGIVSDVQHLKKDNDRL FT KSEVADLQKSHTSLTTLVHKLEINCDKTNKESLSNNAVILGIPMHANECVP FT ELVSKVAECIGADFESDSIISASRVSVSSSALNKLVPIRIVFKDNAVKESF FT FSKKREFGKLLSSSIDQSMVMNGKPTNIAIRDELTPLSLQMLNELRSLQNK FT LNLKYVWAGRDGIILVKKDEQSKPYVIRDRNDLACFVNESASNSSSHVTPS FT PKRKRSERQDKHFM" FT CDS 2072..5011 FT /product="Crack-8_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MCSFLCISFYYFLFKMALVDNFSHNNIDEFNQSIHTV FT ESNTNYLRIFQWNIRGMNDLAKFDTILETLDQCNVPIDVIVIGETWVKEEH FT KQLYYIPGFNSVFSCRDNSNGGLAMFIRTDITFTPVRNVHSDGFHHISVEL FT AINGRMYDVHGFYRPPSFDVNVFMNKLEDCLDNSNHNRSCFFVGDVNIPIN FT AKNNNVVVKYMSLLESYGFACTNTMPTRPISMNILDHVVCRLNDIQNSRND FT TISTCESDHSMIIASFKLADKREKITISKTIVDHRKIAVEFNNFLGSVCEV FT TDVNLCFENIVSTYKNLLVKYSKTITKNVSTKNSFCPWMNFNVWSLIKLKN FT NYIKRVKRNPQDIHLKDLLKHVSNKVIIAKKQCKKLYFDNILNNTPHSKLW FT KHINNIFGKDAKTDKITLIEDGAKITDDQQISEIFNNYFSSIGQKLADSIQ FT VDAGVNPLSNVRRVSSSIFLNPATANEVTMIINDLERKKSPGPDKINVDLI FT KSNSDHFSRILSDCFNRIIETGSYPNCLKIARVIPILKSGNVCDTSNYRPI FT STLSIFNKILEKLLINRFIPFLKQHNVLYSFQYGFRQGSSTATATVELLDD FT IIKGIDEKQIVGALFLDLRKAFDTLNHAILLDKLEAYGIRGIANEIVRSYL FT QDRKQFVTIGDSQSSLKSLGVGVPQGSNIGPLLFLLYVNDLNRLQLKGTPR FT LFADDTALFYPHLDADAIVEHMTEDLSILSQYFSSNLLSLNVMKTKYMMFH FT SSRKIFQQRNQVLLNSSCIEQVSSFKYLGLMLDSTLSWANHIKNVEKKVSS FT LCGIMRRVSHFVSRRILLNFYFAHIHSRLSYLIIAWGRASKSSLKKLQTLQ FT NRCLKTIFSKPFLFPTLQLYSDVSHNILPIHSLCKIQTLIFVHDTLHNNLF FT HHNIQLPSISHGYRTRRTHNLHQNRALTMFGQKRISIIGPXLYNQLPDYLK FT QISDRNSFKQKLKQYLKLNLHDILG" XX SQ Sequence 5385 BP; 1696 A; 933 C; 985 G; 1770 T; 1 other; ctggcagcac tgtttctggt agctggttct attgtgctat attgtgaatt tcatttcatt 60 aattagtagt gataaattcg attttaattt gttatttttg tcaacaagta gatattagtg 120 ccggttcttt tataccatat tcgagtaata cgagttttgt gctcacatga acgtgaaaaa 180 ttgaattttt aatccgttgt gatcatcagt tgtctttgtt ctcttcaatt ggctatttat 240 attgatgaac gtatttcgtg ttggtactgt gctggtaaac tgtgatctga gtcactcact 300 atttgtctaa gtggcagtgc ttagttccgt gagacgaaag tgcaagagtt cttgtttgat 360 ttgtttaggt ggatctagta cgccctccag tagcacacac tcgtggtttc gcttgtttat 420 ggcaagggcc attgctgagt gcatctatat ccatagcgtg taggtaagca agttggtcgc 480 gatctagtac tgcagaattt ggcgaagttg gagaaatcca tcagtagcac actgtatagt 540 ggtttcgctt gtttacggca agggccatcg tcgcagttta ctcctatttt caaatagcga 600 gtaggtatgc atgtggttgc ggtataggat tgcagaattc ggtgaagttg ggaagaacga 660 tttgatgaat gaacgactcc tcttgcgcaa gtagatttat gtgaaccttg ccactctggg 720 cgaatctttg agagtcatac atcaccgctt atcgtatgct ttgcttatat ttattatcca 780 tgcgaatagt tctcattttt cggttgtatt gaagggtctc agtaattgat tttaatctgt 840 tttaacgtgt gcacgcttat atttactgat atgtgtattt tttacctgta tataatttct 900 gctgtgttgt agtttaatta aattcgttta tttattttta ttttcgttat ttttatttat 960 tttttcttgt gtcaccagtg tagttcaaga atgacggatg atctagacaa cgatatgtat 1020 tgcgttgaat gcaaaaagaa ggaagatgac attaacaaac ttgcaacatg cctgtattgc 1080 ttttccagcg cgcattataa atgccgcaac atagttggtt ttgctgttcg taagatcaaa 1140 gaaaatttgt acttttgctc gccgaaatgt tctgaaatct acaaaaaaat catggaaatg 1200 caaaatgaac gcacaacaat gattaatgag ctgaaatttg aattaaaaaa aacagtcacc 1260 aacgttgtat ccgctcagct ccaagatgta aaagttgaag ttaatacgat tgtcaaagcc 1320 attgaagaat cccaagaatt tctatcctcg aaattcgatg gtattgtgtc cgatgttcag 1380 cacttgaaaa aagataatga tcgtttgaag tctgaggttg cagatttaca gaagtcgcat 1440 acttcattga cgacccttgt tcataagttg gaaatcaatt gcgataagac caacaaggag 1500 tccttatcga acaacgcggt tattcttggc attccaatgc atgccaatga atgtgtacct 1560 gaattggtct ccaaagtagc cgaatgcatc ggagctgatt ttgaatcaga ttccataata 1620 tctgcttctc gagtttctgt ctctagttct gctctaaaca aacttgtgcc tattcgaatc 1680 gtttttaaag ataatgctgt gaaggaatcg ttcttttcaa aaaagagaga gtttggaaag 1740 cttttgtcat cttccattga ccagtcaatg gtaatgaacg gaaagccaac gaatattgct 1800 attcgggatg aattgacacc actttcgcta caaatgctta atgaattgag gagtttgcaa 1860 aataaactaa atttaaaata cgtctgggct ggaagggacg gaataattct ggttaaaaaa 1920 gatgagcagt caaaaccata cgttattagg gataggaatg acctggcttg tttcgtgaat 1980 gagagcgcat ccaattcatc ttcgcacgta actccatctc ctaagcgcaa aagaagcgag 2040 agacaggata aacattttat gtgagttcta tatgtgtagc tttttatgta tttcttttta 2100 ttacttctta tttaaaatgg ctctcgtaga taacttttct cacaataata tagatgaatt 2160 caatcaatct attcataccg tggagtctaa taccaattac ttacgtattt tccagtggaa 2220 cattagaggc atgaatgacc ttgcgaagtt cgatacaatt ctagaaacgt tagaccaatg 2280 caatgtgcca atcgatgtta tcgttatagg agaaacatgg gtaaaggagg aacacaaaca 2340 attgtattat attcctggct ttaattctgt attttcatgc agagataact ccaatggtgg 2400 attggctatg ttcatcagga ccgatatcac cttcactcca gtgcgtaacg ttcattcaga 2460 tggatttcac catatctctg tcgagcttgc aatcaatggc cgtatgtatg atgtccatgg 2520 tttctatcgt cctccgtcat ttgatgtcaa tgtttttatg aataaattag aagattgtct 2580 tgataattca aatcataacc gctcttgttt ctttgttggt gacgttaata tacctattaa 2640 tgcaaaaaat aataacgtgg ttgtcaaata catgtccttg ctcgagtcgt atggatttgc 2700 ttgcacaaat acaatgccta ccagaccgat cagcatgaac attcttgatc atgtagtttg 2760 tagattaaat gacattcaga actcaaggaa tgacacaatc tcgacctgtg aaagtgatca 2820 ttctatgatc atcgcttcct ttaaattagc tgacaaaaga gaaaaaataa ctatttcaaa 2880 aacaattgta gaccatcgaa aaattgctgt cgaatttaat aattttctgg gtagtgtttg 2940 tgaagtaaca gatgtcaacc tttgttttga aaatatcgta tctacttata agaatttact 3000 tgtgaaatat tccaaaacta ttaccaagaa tgtaagcact aaaaacagtt tttgtccttg 3060 gatgaatttc aatgtgtggt cgttaataaa gcttaaaaat aactacatca aaagagttaa 3120 aagaaaccca caagacatac atctaaaaga tctcctaaaa cacgtctcga acaaagttat 3180 catagcgaag aagcaatgca agaaattgta tttcgacaac attttaaaca atacaccaca 3240 ctcgaagttg tggaagcaca taaacaacat ttttgggaaa gatgctaaaa ctgataaaat 3300 taccctaatt gaagatggtg caaaaatcac tgacgatcaa caaattagtg aaatattcaa 3360 caattatttt tcgagcatag gccaaaaatt agccgatagc attcaggtgg atgctggagt 3420 gaacccactt tcaaatgtaa gacgtgtgtc tagttctata tttttgaatc cagctacagc 3480 taatgaagtg acaatgatta ttaatgattt ggagcgtaaa aaaagtccgg ggcccgataa 3540 aatcaatgtt gatctcatca aaagcaattc tgatcacttt tccagaatac tatctgattg 3600 tttcaataga attattgaaa ctggctcata tcccaactgt ttaaaaattg ctagagtcat 3660 tcctatttta aaatctggta atgtttgtga tacaagcaac taccgaccga tctcaacact 3720 atccatattt aataaaatat tagaaaaatt gcttattaac aggtttattc cttttcttaa 3780 acagcacaac gtgttgtaca gttttcaata tggttttcgt caaggcagca gtacagctac 3840 ggcgactgtt gaattgcttg acgatataat aaaggggatt gatgaaaagc aaatcgtcgg 3900 tgcactcttc ttagatttac gcaaggcttt tgacacattg aaccatgcaa tactcttaga 3960 taaattggag gcctacggaa ttagaggtat agcaaatgaa attgtgcgca gctatttgca 4020 agatagaaaa cagttcgtca caattggtga ctcacagagt tcgttaaagt ccttaggggt 4080 aggtgtacct caaggtagca atattggtcc attattgttt ctgttgtacg tgaacgacct 4140 aaataggttg caattaaagg gaacaccacg tttatttgct gatgacacag ctctgttcta 4200 tccgcactta gatgcagatg caatagtgga gcatatgacc gaagatttga gtattctgtc 4260 gcaatatttt tcttctaatt tactttcttt aaatgtaatg aaaaccaaat atatgatgtt 4320 tcactcatct agaaagattt ttcaacaacg aaaccaagtc ttactcaact cttcctgtat 4380 tgaacaagta tcatcattta aatacttagg tttgatgctg gattctaccc taagctgggc 4440 aaatcacatt aaaaatgtag aaaaaaaggt atcatcgcta tgtggaatta tgcgaagagt 4500 aagtcatttt gtatcgcgca gaatactttt aaacttctat tttgctcata ttcactcacg 4560 cctgagctat ctaattatag catggggtcg tgcctcgaag tcttctttga aaaagctcca 4620 aactctccag aatagatgtt tgaaaacgat atttagcaaa ccgtttttat ttcccactct 4680 tcaactttac tctgatgtct ctcacaatat ccttccaatt cacagtctct gtaaaattca 4740 gactctaata tttgttcatg acactctgca caataatctt tttcatcata atattcaact 4800 accttcaata tcccatggct atcgtacaag acgtacacat aatttgcatc aaaatagagc 4860 acttacaatg tttggtcaaa aaagaatttc aatcattggt cctaakctgt acaatcaatt 4920 accggattac ttgaaacaaa taagcgatcg aaatagtttc aaacaaaagc taaagcaata 4980 cttaaaacta aatcttcacg acattcttgg ataaaaaata atattcgtat tttgagcaat 5040 atgtcaacta ccagaaacca atcatagatt aattaaattc tttttttctt aaatatcttt 5100 ctttttttgt ttaaagttaa ttagtaatta tatcataagt gggtccctta aaaggaacat 5160 ttgttccact gggacgtcac gccatgtaat tctatttaaa ttaattaaaa tctagtttaa 5220 cgttaatttc aatctcagct catgttttgc ttgttttttc attagttttt tattgttcac 5280 gtgctgcaca gagtgagctg agatagttgc gtccactacc aggaagctac acgtgagctt 5340 tttggtgtgg gggaaagtgg cgggtaaaaa aaaaaaaaaa agaaa 5385 // ID BEL-194_AA-I repbase; DNA; INV; 6890 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-194_AA_; KW BEL-194_AA-LTR; BEL-194_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6890 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 879-879 (2011). XX DR [2] (Consensus) XX CC Positions [5953-6501] - Integrase core CC LTRs are 93% similar to each other. XX FH Key Location/Qualifiers FT CDS 1336..5370 FT /product="BEL-194_AA-I_1p" FT /translation="MQSTSSLPFSTAVHRLPQIPENSSTDPTSSGAARVHV FT PAPPPSSITQRPPVVAHPFVSHHQHDNMVQSLFSVPIPSVVQGASTLPADP FT VAYRQPTYPMFPPMYQSATAAASSPSFGLVSAAPPAISSSSTSARSLVPPP FT NPSAPHAVPSREVLFPQPPGKPDNYPLQTNTPPNSNGDNLQHLMRQFGSIA FT LSPSQSPLLNTNLATFAPSPSQLAARQVMPRDLPPFSGNPADWPVFISSFM FT NTTLACGYSSAENLCRLQRSLKGAAYEAVQSRLLLPESVPHIMETLHMLYG FT RPELLISALLDKVRTAPSPKIEKFQTIIDFGMSIQSLCDHLEVAGQTAHLS FT NPSLLAELVAKLPPHLQMEWGSYLQGFSEVNLKTFGLFMSSVVKAVSKVTV FT FSSSSGRTNAVDKINSKSRGSINAHHSETNTNDEFNTPTNREEAKECPVCK FT NAGHRIQDCRTFKSLSVDGRWKCVQSNGLCRNCLNSHGRRSCRKISSCGTN FT GCEFRHHPLLHSTRSSTDTQMQPLVNTAENHTHRQVEQSLFFRVLPVTLHG FT PKGTVETFAFLDDGSSLTLIEDSLVKELGVEGVTMPLCLLWTANMTRTEKG FT SQQISLAVSSAGGKKYSLQEVQTVRELSLPVQTLSYEHLSEEFKHLRGLPV FT RSYTKAIPRLLIGINNLDLMVPMKVREGLRREPVAAKTRLGWCIYGGKSNG FT SQQPSVNYHACGCSSDQTLHDLVKQFFATEEIAVQSTTPLVSEAEKRAQQI FT LEDTTFRVGNRFETGLLWKFDHVELPDNYNMALRRLECLEKRMNRNPELRQ FT TLQRQLEEYQLKGYAHRATEDELASADMRRVWYLPLGAVVNPRKPGKVRMI FT WDARAAVNGISLNSVLLKGPDQLTSLPGVLMRFRQFKVAVSSDIREMFHQI FT YIRKEDRHSQRFLWRDTPTDKPEVYIMDVATFGSTCSPASAQYIKNKNAKE FT FQEMYPRAAEGIVKNHYVDDSLESYESEEEAIKTSQEMRFIHEQGGFELRN FT WLSNSRTTLSALGETDSREDKRFAADKQCEYERVLGLLWLTEEDAFGFSTE FT MKPEISEVAKKDECPTKRQMLKCLMSLFDPLGLLSIFVVHGKILLQEVWRS FT GAQWDEKVNDELHIRWKNWTRLFEAVRILEIPRCYFPGATKERYRELQLHI FT FVDASESAYCAVAYFRTLNANKSPECALVAAKTKVAPLKAQSIPRLELLAA FT VLGAHLSQFVEGNHALRITRKVFWSDSATVLSWLRADHRRYKQFVACRIGE FT LLTVTDVKNWRWVPSKQNPADIATKWGSGPDLSANSVWFNGPHFLRLSEAE FT WPTQRMYTPPTEEELRPCYAHKAIQIPEELVDV" FT CDS 5923..6888 FT /product="BEL-194_AA-I_2p" FT /translation="MAPLPLARMASFARPFTYTGLDFFGPLTVKIGRSTAK FT RWVAVFTCLTIRAVHVEVAHNLSTESCIKCIRRFICRRGSPAEFYTDNGTN FT FQGAERLLKEQIEQLAVTFTGTTTKWMFIPPGTPHMGGAWERMVRSIKTAV FT EAAYNNNRKLDDEALSTFMVEAESIVNSRPLTYLPLTSEESEAITPNHFLL FT GSSSGVRQPIVEVTEPAEALRTSWNQVQHQLDVFWRRWIREYLPTLTKRPK FT WCGEEKPIAEGQLVMVVGEGRRNEWTRGRIVETIKGADGRIRQAIIQTARG FT LARRPVARLAVLEVDEGVKTGPGGQCYGGE" XX SQ Sequence 6890 BP; 1919 A; 1692 C; 1692 G; 1559 T; 28 other; aattgtaagt aacttaaact tttttttttt tttttttgtt gggatttgaa ccactgcgac 60 cattaattga tctattgtta gacataacct aatttgaaag tgatcatgca ggaatcacac 120 aagacataaa gagggagaat tgtccttaaa tctacacatg caaaccgatt tatgtaagta 180 aaacttcaaa caaaacaaag atactaaagc taatcagaaa ataaaattta cagcaaaagc 240 tacgttacaa caaacaacaa acmcgttgta atttgctcga cgaaaatcga scttgtgaag 300 gacgttctta tcccctcaaa attttgggaa caatcttttg ataaatcgac agcagtttaa 360 ctcgtcgata tggaatcgat tcctccgcaa aatattccgg agatgatcag tgatcagaca 420 ccgctccatg cacacacgct cgacgaacaa cggtcagact gcgtacagtg cgatcgcctg 480 aacggcgagg agaacatggt gcagtgcgac agctgccaga cttggtggca cttttcgtgt 540 gccggcgtca cagattcgaw gagggatcgg tctggtcgtg ttmaaaatgc caggtkgatg 600 atcckggcag caccgcacgt asatccmgtg gattccaaaa tttcagccgt tcggggttcg 660 agattcgaga acatcagcgc gaataaattt gcagcttagt aaactacatt agcttcagga 720 ttttgaatag aagtttgtck agsagaagcg taagttgctc gastckcaac tggagctaga 780 wgaggaawac gagagtmctc ggaktaaacg awgtmgagwt agwcagcaaa cagccgcgcc 840 ggggcggggg ggcgaccaga aggggccccc ccggcgcagt tataccagcg ctgccgcccg 900 cttcaaacct tgccgattcg ataggctggt attgctggta atcacgctcc aactcaaacc 960 cagcaacctt ttaattggaa tcagccaaat cttctcaaac atcctgcktt gatagaccaa 1020 agatcgactc taagcatccg gtgctcgtcc tgcgataggt tcgacaacat tttcgaacca 1080 ggaaccgaag atgcaacmaa gcgtggcgtt ttcaataasg tggcagggaa cgtacccttg 1140 gcgcaatcaa ctccagcaca gttgcccact atagatcggc agtttcaacc agtagtcgga 1200 ttactacctc agccattccc accgggcaca acggctgtgc ctaactcaac atcggtaccc 1260 tacacggctt cagtaggtgt agagtacgct tcgacgaata ttcttacggc agcaccgtat 1320 ccgatttcga cagttatgca gtcaacaagc tcgctgccat tttccaccgc cgtgcaccgg 1380 ctaccacaaa taccagagaa ctcatctact gatccaacgt cgtcgggagc tgctcgggtc 1440 catgtaccgg ctccgcctcc gtcatcgatt acacagaggc caccagtggt tgcccatccg 1500 ttcgtttcgc atcatcaaca cgacaatatg gtgcaatctc ttttcagcgt acccatacca 1560 tcggtagttc aaggggcatc aacgctacca gcggatccgg tggcataccg ccaaccgacg 1620 tatccgatgt ttccaccgat gtaccaatcg gccacagcag ctgcttcttc gccatcattt 1680 gggcttgtgt cggcagcccc gcctgcgatc tcgtcttcgt caacgtctgc gagatcactc 1740 gtgccaccgc ccaacccctc agcaccgcat gcagtccctt caagagaagt attattccct 1800 cagccaccag gtaagccaga taattatcct ctacaaacaa atacaccacc gaacagtaac 1860 ggtgacaacc ttcaacattt gatgaggcaa tttggtagca ttgctttatc accctcgcaa 1920 tcaccactgc ttaatacaaa tttagccaca tttgccccct cgccatcgca gttagcggcc 1980 aggcaagtga tgccgcgcga tctgccccca ttttctggca atcctgccga ttggccggta 2040 ttcataagca gtttcatgaa cactacactg gcttgtgggt actctagtgc cgaaaatctt 2100 tgccgtctcc aacgaagcct gaaaggtgca gcgtacgagg ccgttcaaag tcgattactt 2160 ctgcctgaat cggtacctca tataatggaa actctgcaca tgctctatgg tcggccagag 2220 ttgctgattt cagcgctttt ggacaaggtg cgtacagctc cttcaccgaa aattgaaaaa 2280 tttcaaacaa taatcgattt tggaatgtcc attcaaagtc tttgcgacca cctagaagtt 2340 gcgggacaaa ccgcacatct gtcaaatcct tcccttctcg cggaacttgt cgccaaactc 2400 ccaccgcatt tacaaatgga atggggctcc tatctccagg gcttttcgga agtgaatctg 2460 aaaacattcg gtctcttcat gtccagtgtc gtgaaagcag tgagtaaagt gacagtgttc 2520 tcaagcagca gtggaagaac caacgccgtg gataaaatca actcgaagtc gagaggaagc 2580 atcaacgctc accacagtga aacgaacacc aacgatgagt tcaatacgcc taccaaccgg 2640 gaggaagcca aagaatgtcc ggtctgcaaa aacgccggtc atcgtatcca ggactgtcga 2700 accttcaaat ctctctcagt ggatggacgc tggaagtgtg ttcagtcgaa cgggctttgc 2760 cgtaattgtt taaactcaca cggcagaaga agctgtagga aaatcagcag ttgtgggaca 2820 aacggttgtg aatttcgcca tcatccgctt ctacattcga cgcgcagcag cacagatacg 2880 cagatgcagc ccttggtgaa cacggcagaa aaccacactc atcgccaggt ggaacagtcg 2940 ctgttcttca gagtactgcc agtgaccttg catggaccca aagggacagt ggagacattc 3000 gcttttctcg acgacggatc gtcgctaacg ctgattgagg acagcctagt caaagagctt 3060 ggagtcgaag gagtaacaat gccactgtgt ttactgtgga cggcgaacat gacacgcacc 3120 gagaaagggt cgcagcaaat atcgctggca gtttcatcag ccggaggaaa gaagtactca 3180 ctgcaagaag tacaaacagt tcgggaattg tcgctacccg tacaaacact ctcgtacgag 3240 catctctcag aggagttcaa acaccttagg gggctaccag ttagaagcta caccaaggca 3300 attccaaggt tgctgatagg aatcaacaat ttggatttga tggttccgat gaaggtgcgt 3360 gaaggcctcc gacgtgaacc tgtagctgct aaaaccagac tagggtggtg catctatggc 3420 ggaaaatcaa acggtagtca acagccatca gtgaattacc atgcttgtgg atgttcgtca 3480 gatcaaacat tacatgacct cgttaagcag ttctttgcta cggaggaaat cgccgtacaa 3540 tccaccactc cactcgtatc ggaagcagaa aagcgtgccc agcaaatact agaggatacg 3600 acgtttcgag tcggtaaccg cttcgaaacc ggtctacttt ggaagtttga tcacgtagag 3660 cttccagaca actataacat ggctcttaga cgacttgagt gcttggagaa acgtatgaac 3720 cggaacccag agctccgaca aactttgcag cgtcaattag aagaatacca gttaaaaggg 3780 tatgcgcatc gtgccacgga ggatgaacta gctagcgccg acatgcgccg cgtatggtat 3840 ttgccgctag gagcggtagt aaaccctcgg aaacctggca aggttcgcat gatttgggat 3900 gctagagcgg ccgtaaacgg gatttccctc aattcagtac tgctaaaggg tccggatcaa 3960 ctaacgtcac ttcctggtgt tctgatgcgc ttccgtcagt ttaaagtagc agtctcctcc 4020 gatataagag aaatgtttca ccaaatctac atacgcaagg aagatcgcca ttcacagcga 4080 ttcctttggc gcgacactcc taccgataaa cccgaggtat atattatgga tgttgccacm 4140 tttggctcca catgctcccc tgcgtcggcg caatacatca aaaacaagaa tgccaaggag 4200 tttcaagaga tgtatccccg agcagcagaa ggaatcgtga agaaccacta cgtggatgat 4260 tccctggaaa gctatgagtc tgaagaagaa gcaatcaaaa catcccagga aatgcgcttc 4320 atccatgagc aaggtggctt cgaattgagg aactggctat ctaatagtag aacgacccta 4380 agtgctctag gtgaaacaga ttctagagaa gacaaacggt tcgcagcaga caaacaatgc 4440 gagtatgaac gagttcttgg gctcctgtgg ttgacggaag aagatgcctt cggtttctcg 4500 actgagatga aacccgagat cagtgaagtg gctaaaaagg acgaatgccc wacgaaaagg 4560 caaatgctga agtgtctgat gtctctcttc gatccgttgg gcctgctgag catattcgtg 4620 gtacacggaa agattctcct gcaggaagtg tggagaagtg gtgcacaatg ggatgaaaaa 4680 gtcaacgacg agctccatat aagatggaaa aattggacac gactgtttga agctgttcgc 4740 atccttgaaa ttccccgctg ttactttcca ggcgccacaa aagagcggta ccgtgaacta 4800 caactacaca tatttgtaga cgccagcgag tccgcgtact gtgcggtcgc gtattttcga 4860 acgttgaacg ccaataaatc acctgaatgc gcacttgtgg cagccaaaac aaaagtggcc 4920 cctctgaagg ctcaatctat accgcgcttg gagttgctgg ccgcagtact tggcgctcac 4980 ttgtcgcagt tcgttgaagg aaaccacgct cttcgaatca caaggaaagt attctggagc 5040 gactctgcca cggtgctttc atggttgcgg gctgatcatc gacggtacaa acagttcgtc 5100 gcctgcagaa ttggggaatt gctgacggtt accgatgtca agaattggcg ttgggtaccc 5160 tccaaacaaa atccggcgga cattgcgacg aaatggggaa gcggtccgga cctatctgca 5220 aacagtgttt ggtttaatgg tcctcatttt ctgcggctgt ccgaagcaga atggcccaca 5280 cagcgaatgt atacgccacc aactgaagaa gaacttcgtc catgctatgc gcacaaggca 5340 atacaaatcc cggaggaact ggtagatgta wcccgctttt ctcggctaac aagagctgtt 5400 cgcactgccg cctgtgtgat tcgttttatt ggaaatatca aacaaaaggt tgctggacaa 5460 atacgaaatt caggcccctt gacttcagag gagctkcagg aggcagaaag tctgcttatt 5520 cgccaggcac agtggcaaag tttcccggat gagatgatgg ttctcgagag aaatcgttcg 5580 aagccgctga asgagcaaat accactggag aaaactagtg atatttttcc actgtgtcca 5640 atattagaca accaaggcat cattcgagtg gacggcagaa tcggagcggc tccaaatgtt 5700 gaaaacgaag caaagtttcc agttatcctc cccagaaaac acgtattgac tacgctactg 5760 ctgaactatt atcatcgtaa attcgttcac ggcaatacgg aaactgtagt gaatgagatt 5820 cgccagcgat actacgttcc acgactcaga acggcagtga gtagtgtagc cagagcttgt 5880 caatggtgcc gagtttacaa atgtttacca aaggttcccc gcatggcgcc gctaccgcta 5940 gctaggatgg cttcgtttgc acgaccgttt acttataccg gactggactt tttcggacct 6000 ttgacggtaa agataggcag aagtaccgcg aaacgctggg tagccgtatt cacgtgccta 6060 acaatacggg cagtgcacgt ggaagtggca cacaatctat ctacggagtc ctgcatcaaa 6120 tgcatccgtc ggtttatctg ccgtcgagga tcacctgctg agttctatac agataatgga 6180 acgaatttcc agggcgccga gagactactg aaggagcaga tagaacaatt agcagttaca 6240 ttcacaggaa ccactaccaa atggatgttc atccctcccg gtactccaca catgggcggg 6300 gcctgggaga ggatggtccg ctcaattaag acagccgtgg aggcggcata taacaacaac 6360 cgcaagctgg acgatgaagc gctgagcacc tttatggtgg aggccgaatc aattgttaac 6420 agccggcctc taacatacct accgttgacg tcagaagaga gcgaggctat aacaccaaac 6480 cacttccttc tgggtagctc aagtggagtc cgtcagccga tagtcgaagt aaccgaacca 6540 gcagaagctc tacgcacttc ctggaaccaa gtgcaacatc agctagatgt gttctggaga 6600 aggtggatta gggagtacct cccaacgctg actaagcgac caaagtggtg tggagaagag 6660 aaaccgatag ctgaaggaca gttagtgatg gtggttggcg aagggcgacg gaacgagtgg 6720 actagaggcc gaattgtcga aactatcaag ggggcagatg ggaggattcg acaagctata 6780 atacaaacgg cgagggggct tgcgcgtagg cctgtggcta ggcttgctgt cttagaggtt 6840 gatgaaggtg ttaaaactgg acctggtggc cagtgttacg ggggagagga 6890 // ID Sake2_BM repbase; DNA; INV; 3192 BP. XX AC . XX DT 30-APR-2010 (Rel. 15.07, Created) DT 30-APR-2010 (Rel. 15.07, Last updated, Version 2) XX DE Non-LTR retrotransposon - a consensus. XX KW Daphne; Non-LTR Retrotransposon; Transposable Element; KW AP-endonuclease; Sake2_BM. XX OS Bombyx mori OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Bombycidae; Bombycinae; Bombyx. XX RN [1] RP 1-3192 RA Jurka J.; RT "Non-LTR retrotransposons from Bombyx mori."; RL Repbase Reports 10(7), 1058-1058 (2010). XX DR [1] (Consensus) XX CC >95% identical to consensus. XX FH Key Location/Qualifiers FT CDS 1526..2893 FT /product="Sake2_BM_1p" FT /translation="MKYSKIIPLFKSGSTFDPSNFRPISVLPTLSKIFEKI FT ILEQLLNHFNSNNLLHNKQYGFTRGRSTIDAGVDLIKNIFQAWEESHNALG FT VFCDLSKAFDCVEHNTLLRKLHHYGIRGVSLELIKSYLSGRIQKVDVKGKR FT SSGVLLNMGVPQGSILGPFLFLVYINDLPKFIETRHEVVLFADDTSLLFKI FT KRXLEDYDDVNDAISRVVHWFSVNNLLLNNKKTKCIKFTTPNVRQVDTNIT FT VSGETLELVDSTVFLGITVDSKLQWGPHICKLASRLSSAAFAVKKIRTYTN FT EDTARLVYFSYFHSVMSYGILLWGNAADVETIFILQKRAIRAIYNMHPRES FT LRDKFKEIKILTLASQYIFENLLYVRKNIEEFPRVCDLHNVNTRNKHRLAL FT PAARIKKISNSFRGLGVQLFNKIPXNVQLLPVHRFKKTVKERLCNKAYYKV FT KDYLLDGTTWE" XX SQ Sequence 3192 BP; 770 A; 800 C; 806 G; 796 T; 20 other; ttttttttcg ggccggggnc cgnacctcct acgaggcgnc cgggcntcgg gggcgcgcgg 60 ggtatgtgag actcaacgat ctgcaggtgt tgagagcaga ccgcgggccc aagganttta 120 gggcccaccc actaaangac tcccntgcac tcttacaccc ggcgtccgat cccctccggg 180 gtcagaaccc ggncggggta ggggggttnc cgcggtcaac actacaacca nacacgcggc 240 ccaccccgag gacgcccggc cgacgggccn tcgaggcgag tcganggccn gcggcgtcgg 300 ccgcctcggt acggcggccc gccaggcggc ccgggcggcg ccgctggtgt cccgggatac 360 cccgccgggc cggaaccggg ggtcgggacg cgatacgccg ccgaccgggt tgctccgttc 420 cgcggcctcc ttcggcgaga cggtgcactc gcagaagtcg agnatcgcct tccaggactc 480 atcgccgccg agcatcgcgg ccgcgacaag tcccgtccta tttttgcgac gaggacgcgg 540 cgccgcccca gccatgcggg gcagtacgcg agcgtatgct ccgccgtgtc cccgtcgcag 600 ccacggcggt ggcactccgc cgtcggctcc ctccgagcgg tgcaggtacc ggccgaagca 660 ccntgccggc gagcacctgc gtcaaccgga agccgagacg ccctcggtca cggttcgccc 720 attccgcgac gaccgggcgc accgcctcga cggtccgggc gcaccggccg acggtccgcc 780 gagcggcggg accgcgcctc cggcgcggac cgcgcctccg gcgctgaccg ccccttctcc 840 gggacgcggt ccgcccctgg cgcggaggtc acatcgccac gcgtagtcgg cggcgagcgc 900 ctccgcctcc aggtcccagg gaggcgtccc ggcgagnacg cacgccgcct cgaagtagac 960 ggtgcggtan ccacggatcg gtcctgaccg cgattgcgcg ctgcgggcgt cgcggcggcc 1020 gcgctgctcc cgcgagccag ggcgcggcac cacacgggcg ccccgtatag cgccatgaac 1080 cccaccaccc ccgcgtagag acgacgcact accgcgtcgg gccccccgac gttgggcggg 1140 agccggccga gccggccgtc cccatcagcc gagggaccag cttctcgaag cgagcgcgaa 1200 agttccaacg accgtccagt ncgaggccga ggtaccgcaa cccggctacc ccgacctcga 1260 cgcggacgcc tccaaccacc acgtgggccc cgggcggcgt cgacatgcca gcgctgcgaa 1320 acgcagggcg ggtttcccga actttaagtt cacaaatatt gatataaagg acataattga 1380 taatttcaag ctcattaata ttaagaaaac tggtgattta tggggaattt caattaaggt 1440 tataaaatcc attattgata taatcgcacc ttaccttgct acaatattta atgactgtat 1500 tgattgtggt gtgtttcctg atctaatgaa atatagtaaa ataatacctt tgtttaaatc 1560 tggtagtact tttgacccct ctaactttag gcccatttca gtactaccta cattgagtaa 1620 aatttttgaa aaaattattt tggaacaact tttgaaccat ttcaattcca ataacctgct 1680 tcataacaaa caatatggat tcactagagg tcgttcaact attgatgcag gtgtagatct 1740 tattaaaaat atttttcagg cttgggagga gtcgcataat gcccttgggg tattctgcga 1800 cttatctaaa gcttttgatt gtgttgagca taatacattg ttgaggaaat tacaccatta 1860 tggtattagg ggcgtttcat tagaacttat taaatcttat ttatccggaa gaatccaaaa 1920 ggtagacgtt aaaggaaaga gatcttccgg tgtcttactt aatatgggtg ttcctcaggg 1980 ttccatattg ggtcccttcc tatttcttgt gtatatcaat gatttaccaa agtttattga 2040 aacccgtcac gaggtcgtat tattcgctga cgatacatcc ttattgttta aaattaaacg 2100 acanttggaa gattatgacg atgtgaatga tgctatctcg cgcgtggtac attggtttag 2160 tgttaataat ctgttattaa ataacaaaaa aacaaaatgt ataaaattta ccacacctaa 2220 tgttagacaa gtagacacta atattactgt aagtggagag acactggagc ttgtggattc 2280 gacagttttt cttggtataa cagttgattc caagctgcag tggggtcctc acatatgcaa 2340 gttagcgagt agacttagtt ctgcagcgtt tgcggtaaaa aaaatacgaa cgtatactaa 2400 tgaagatacg gcacgtctgg tttatttcag ttattttcat agtgttatgt cctatggcat 2460 tttgctatgg ggcaatgctg ccgatgtcga aacgattttc atcctgcaga agagggctat 2520 tcgtgctatt tataatatgc acccgaggga atccttgagg gacaagttta aggaaatcaa 2580 aattctaact ttagcgtccc aatacatttt tgaaaattta ttgtacgtgc gtaaaaacat 2640 tgaagaattc cccagagtct gcgatttgca taatgtgaac actaggaaca aacataggct 2700 tgcgttgccg gcggctcgaa ttaagaaaat aagcaactct tttagggggc tgggtgtaca 2760 acttttcaac aagatcccac naaacgttca actactacct gttcatagat ttaagaaaac 2820 tgtcaaggaa cgtttgtgca acaaggcata ttataaagtt aaggattatt tactagatgg 2880 cactacgtgg gaatgaggcg ttcgctcctg gccttttcat ttttcattat tattattatt 2940 ttttaattgt attttttgac ttgttttttc gtttattgta aatattttta cttatttaaa 3000 aaaaaagaaa atatttgaga aaaaacaaaa aaaaaagccc gctgagtttg tttcgccggt 3060 tcttctcagg actgtggctt tttttggaac cggtggtaga gttaacattg ttatttatat 3120 tttgacattc aacaagtgtg ttatactgac atctaagttg aaataaatga ttttgaattt 3180 gaatttgaat tt 3192 // ID LanceleTn-2 repbase; DNA; INV; 201 BP. XX AC . XX DT 15-MAY-2006 (Rel. 11.05, Created) DT 15-MAY-2006 (Rel. 11.05, Last updated, Version 1) XX DE Non-autonomous DNA transposon - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; KW Interspersed repeat; LanceleTn-2. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-201 RA Osborne P.W., Luke G.N., Holland P.W.H. and Ferrier D.E.K.; RT "Identification and characterization of five novel miniature RT inverted-repeat transposable elements (MITEs) in amphioxus RT (Branchiostoma floridae)."; RL Int. J. Biol. Sci 2(2), 54-60 (2006). XX DR [1] (Consensus) XX SQ Sequence 201 BP; 61 A; 45 C; 44 G; 41 T; 10 other; ggctaaggtc acatttccam ammggggccc ggccgggcag ctttcrggaa cgataattat 60 gatgtaaaag acaacaavaa cacaaaacgg accgaaaata attcaacggc atgcaytgtg 120 catatttatt rgcataaacy taaatttttc gtttccrcaa acagcccggc cggaccccgg 180 kttggaaatg tgacgtaggc c 201 // ID Gypsy-17_AA-I repbase; DNA; INV; 4270 BP. XX AC supercont1.336; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-17_AA_; KW Gypsy-17_AA-LTR; Gypsy-17_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4270 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.336; Positions 894906 890637. XX CC 'ATATT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 105..4112 FT /product="Gypsy-17_AA-I_1p" FT /translation="MTKTKSKSSESPVVRTPEFKALMPPSLPPPPPLQTAG FT SLHANWMRFKQAFVFWMKGAGHERHDDDVKIAILLSVVGPEGLEIFNILPL FT TAAEKEDFDDVMEAFDTYCGQKKNVIFERFVFTHRQQREGEKFDDFLRDIT FT TLVQTCEYGNLRDSLVRDQIVFGLHDRTFSDKLMGKEAKELTLEKVIEACK FT IEEQRREQLKVMSHGDPGPSGRVDQISRKGSAAGKGMSKSSSERQNRPKPN FT SSHKSTNQNSNTNTKKIVKSNTKANVSNCSYCGYDHVKGKCPAFGKTCSAC FT GRQNHFSTVCRNRSQTKHAGELSVCYMDVLESAIGSPWCEVIRIDNMDVSC FT KLDTGADVNVLPARVLSEIGVTELEDVSIVLQGFGTSSKIKPLGKKTLKVV FT SRGVLHDLDFVIVDLDVNPILGLKSCVDLGLINRVFDIRSNNTNQQFVEKY FT SDVFTGIGCFKEQHRLKIDPRATPSANPPRRVAFALLDKLKSELGRLCEDE FT IISPVNEPTEWISNLTIVEKPDKSLRLCLDPRELNKVLLKEPYLIPTIDDL FT RHKLANKKIFSVLDLKDGFFQIKLDKKSSFLCSFNTPFGVYRYNRLPFGLS FT ISPEVFQKFNEENFADIDGVFVYIDDLLIFADSVEEHDRIMEKVMERARER FT NIKFNPRKLQYRVEQVSYLGHEISRNGFRPDQGRLDDLQEIKAPSNKVELQ FT KILGMINYIRTFIPNLAEFSKPLRDLLKKNCIFTWTKAQEDCLEHIKTLIK FT NSPHLSAFNDKKPIKLQCDASQFGLGACLLQDGKPVAFSSRSLNDAEQRYA FT QIEKEMLSIVFACKKFHNYVYGRQFLVETDHKPLVAIMEKPLCGIPSARLQ FT RLRIKLLPYALTLKYLPGKYMYIADVLSRFSRRTIGPDDVEEDLTEYVHTI FT NISDNRKQQFVVATKNDPVLSDFLKIYHNGWPNDKKSLHENLRFLWKFKHD FT LHVEDDIVFMKDKIYVPTGLRRTVLDWLHVGHFGIEKSKARARELFFWPGL FT ATDVEDKVGGCKICLKFSKQNVKEPLMKHEIPELPFVKLGMDILDFEGKPY FT LVAVDYFSKWLEIIRLTSKTTSAIIDRLIDLFATHGHPRMIIADNMPFGSK FT EFRNFASEFEFNLITSSPHYPKSNGLAEKFVHIAKNILHKSKESNVDFRKA FT LLEYRNTPLKDLKISPAELLMSRKCNTYLPASNTLLVPKINNCVRELQDRH FT SSKQKQYYDRNSKCRGDLAVGDKVVYLDKNGKWLEAIVTSLADTPRSYWIR FT NSDGSVYRRNKHHLKKITDRHTHNPDLLSDRSKTDQNPLNNDKQSSSKVKT FT RFDGSHSGSDRELRRSERLKAKHQ" XX SQ Sequence 4270 BP; 1348 A; 794 C; 954 G; 1174 T; 0 other; tggcgctgta gcaaaaccgc cattgaaaag tgattgtgat cttcttagtg cgtgcatggt 60 gatcaccttc gtcaatataa ggtgaaaagt gatttctaag cagtatgact aaaactaaat 120 cgaaatcatc ggagtcacca gtagtgcgca ctccagaatt caaagcgtta atgccaccgt 180 cgctaccgcc gccgccgcca ttacaaacgg caggtagcct acatgctaac tggatgaggt 240 tcaagcaagc tttcgtgttc tggatgaagg gagccggcca tgaacgccac gatgacgacg 300 taaaaatcgc aatattactg tcggtcgttg gtccggaggg attggagatt ttcaacattc 360 taccgttaac agcagccgaa aaagaggact ttgatgatgt catggaggct ttcgatacct 420 attgcggcca gaagaagaat gtgatcttcg agcgatttgt cttcacacat cgtcagcaac 480 gtgagggcga gaaatttgac gattttctgc gggatattac gaccctagtg cagacgtgcg 540 aatatgggaa tttacgagat agtttggtac gagatcaaat tgtcttcgga ttgcacgacc 600 ggacgttttc cgacaagctt atggggaaag aggctaagga gctgacattg gagaaggtta 660 tcgaagcatg caaaattgag gaacaaagga gggagcaatt gaaggttatg agtcacggcg 720 atccgggacc gtcagggcgt gtggaccaga tcagcagaaa ggggagtgca gcgggcaaag 780 ggatgagtaa aagtagctct gagcggcaaa atcggccaaa accgaatagc tcacacaaat 840 ctactaacca aaattctaac actaacacga aaaagattgt caagtccaac actaaagcta 900 atgtctctaa ttgcagctat tgtggctacg atcacgtgaa aggcaaatgt ccagcatttg 960 ggaagacttg ctcggcctgt ggtcggcaga accattttag tactgtctgt cgaaatcgtt 1020 ctcagacgaa acatgctgga gaattgtctg tttgctatat ggatgtgcta gaatctgcca 1080 ttgggtctcc ctggtgtgag gtgattagga tcgacaatat ggatgtgtca tgcaagcttg 1140 acacgggcgc ggacgttaat gtgcttccgg cacgagtttt atcggagatt ggagtgaccg 1200 aactggaaga tgtctctatt gtgctccaag gattcgggac gtcgtctaaa atcaagccgc 1260 tcggtaagaa aacactcaaa gtggtgtccc gtggcgtttt gcatgatctg gattttgtga 1320 ttgtcgatct agatgtaaat cctatactgg gcttgaaaag ttgcgtagac ctcggactaa 1380 taaatcgtgt tttcgatatt cgatcgaata acactaatca gcaatttgtt gaaaagtact 1440 cagatgtttt tactggtatt gggtgcttca aagagcagca ccgtttgaaa atagatcccc 1500 gagccacccc ttccgcaaac ccaccccgtc gtgttgcttt tgctttgtta gataaactga 1560 aaagtgagct aggtcgttta tgcgaagatg agattatttc gccggtcaat gagcctacag 1620 aatggatatc taatctgaca attgttgaaa aaccggataa atcgttgagg ttatgcttgg 1680 atccacgtga attgaataaa gttcttttga aggaacctta tttgattcca accattgatg 1740 atttgcgaca taaattggcc aacaagaaaa tattttctgt gcttgatttg aaagatggat 1800 tcttccaaat caagttggat aaaaaatcaa gttttttgtg ctctttcaac acacctttcg 1860 gagtatatcg ctacaatcga ttgccattcg gattatcgat ttcccctgaa gtttttcaaa 1920 aattcaatga agagaatttc gcggatattg acggtgtgtt tgtctacata gacgatcttt 1980 tgatttttgc cgatagcgta gaagagcatg atcggattat ggagaaggtg atggaaagag 2040 cacgcgagag gaacatcaaa ttcaatccca ggaaactgca gtaccgcgtg gaacaagttt 2100 cctacttagg ccatgagatc tcgcgaaatg ggttcagacc agatcaaggt cgactcgatg 2160 atcttcagga aataaaagca ccatcaaaca aagttgaatt gcaaaagatt ttgggtatga 2220 tcaattacat tagaacgttc attccaaatt tggcggaatt ttcaaagcca ctaagagatt 2280 tgctcaagaa gaattgcatt ttcacttgga caaaggctca agaggattgt ctggagcata 2340 ttaaaacact cattaagaac tctcctcatc tgtcggcgtt taatgataaa aaaccaatca 2400 aactacaatg tgatgcatct caatttggat tgggggcatg tcttttacag gatggaaaac 2460 ctgtggcgtt ctcctctcgt agtttgaatg acgcggaaca acggtacgca caaattgaaa 2520 aagagatgct ttcaattgtg tttgcttgca aaaaatttca caactatgtt tatggtcgcc 2580 agtttttggt ggaaacggac cataaaccgt tagttgccat catggaaaaa ccgttgtgtg 2640 gaataccttc agctaggctg cagcggcttc gtatcaaatt acttccatac gcactaacgc 2700 tgaagtactt acctgggaaa tacatgtaca tcgccgatgt gctgtcacgt ttctctagaa 2760 gaacaatagg tccagacgat gtcgaagagg acctgacgga atacgtacac acaattaaca 2820 ttagtgacaa tcgcaaacaa caatttgttg tcgctacgaa aaatgatcct gttctgagtg 2880 actttttgaa aatttaccac aacggctggc cgaatgataa gaaatccctt cacgaaaatc 2940 tcagattcct ttggaaattt aagcatgact tacatgttga agacgatatt gtgtttatga 3000 aggacaaaat ttatgtaccg acaggccttc gtcggactgt cttggattgg ctccacgttg 3060 gccatttcgg aattgaaaag tccaaagcga gagccaggga attgtttttc tggccaggtt 3120 tggctaccga tgttgaagac aaagttggtg gatgtaaaat atgtcttaaa ttttctaagc 3180 aaaatgtgaa ggagccattg atgaaacatg aaataccaga attgccattc gttaaactgg 3240 gaatggatat tctagacttt gaaggaaaac cctatttggt cgcagtagat tatttttcta 3300 agtggctgga aataattagg ctgactagca aaacaacatc agcaataata gacagattaa 3360 ttgatctgtt tgctactcac ggacatccaa ggatgataat tgcagataat atgccatttg 3420 gttctaagga atttagaaac tttgctagcg aatttgaatt taatttaata acatctagtc 3480 cgcattaccc aaaatcaaat gggcttgccg aaaagtttgt ccacattgct aaaaatattc 3540 ttcacaaatc aaaagaatca aacgtcgatt ttcgaaaagc gttattagaa tatcgtaata 3600 cacctctaaa ggatttgaaa atttcccctg ctgagttact gatgagtaga aaatgcaata 3660 catacttgcc tgcctctaat acattgctag tacccaaaat caataactgt gttagagaac 3720 tgcaagatag gcactcttca aaacaaaagc aatactatga tagaaatagc aaatgtagag 3780 gtgatcttgc ggtgggagat aaggtggttt atttagataa aaacggaaaa tggctcgaag 3840 caatagttac gtcactggct gatacaccaa gatcctattg gatacgcaac tctgatggtt 3900 cagtgtatag gaggaataaa catcacctga aaaaaattac tgatcgtcac acgcataatc 3960 cagatctttt atcggatcgc tcaaaaactg atcaaaatcc tttgaataac gataagcagt 4020 cttcgagtaa agtcaaaaca agattcgatg gttctcactc tggaagtgat cgcgaactta 4080 ggagaagtga aaggttgaaa gctaaacacc agtgagatct aaaacagaat tctttttatt 4140 ttacccaaat tgaattggaa ataattactt taaaaattta catcttctct aaacaaaaat 4200 ttctagatta aaactgtaaa tagtaggtta atttaaataa taaataagta tcctttcttt 4260 gaggggaaga 4270 // ID RTE-4_PPac repbase; DNA; INV; 2907 BP. XX AC . XX DT 07-JUL-2010 (Rel. 15.07, Created) DT 07-JUL-2010 (Rel. 15.07, Last updated, Version 2) XX DE A family of RTE non-LTR retrotransposons: consensus. XX KW RTE; Non-LTR Retrotransposon; Transposable Element; RTE-4_PPac. XX OS Pristionchus pacificus OC Eukaryota; Metazoa; Nematoda; Chromadorea; Diplogasterida; OC Neodiplogasteridae; Pristionchus. XX RN [1] RP 1-2907 RA Jurka J.; RT "RTE non-LTR retrotransposons from nematodes."; RL Repbase Reports 10(7), 1063-1063 (2010). XX DR [1] (Consensus) XX CC ~97% identical to consensus. CC This sequence was derived from sequence data generated by Genome CC Sequencing Center at Washington University School of Medicine in CC St. Louis. XX FH Key Location/Qualifiers FT CDS 121..2862 FT /product="RTE-4_PPac_1p" FT /translation="MSRHLYIGTINARTLASRDKQTELELALDRIKCDVLA FT VQEARIVGCASFNLTSSGTLVFHSGGPTATHGVAFLLRPHLAGGAVFRGLS FT PRLATLLLPNQRLFLVCAYAPTSSYDDKEYDDFMDQVEAALRSAPRGHTPV FT LVGDLNCRVAREPGNERFVGESASPTPNSRGRTFTEVCVRNRLRIWNTFPK FT RRHGRIWTWRSPNGSTYHQMDFIAAPPSARVVNCGVVGRFDFNSDHRLVRM FT CLSLPDKVKHKRCRERRDLDRSAFTVNANLLASVPLARPNTAADAYRTIRA FT FTETAATDCWRVRRTPPWISPATRNLLQSRSQLQSNPQAAVQYSIACKAAR FT SSLVTDIKNRKEAQARHAATMGRSVTRVMQNLQSSKKRLLVPDPATGELSQ FT EVTKAAVQRFYEDLYTPAVQLPLGVPTRVPDPFPPFLPDETRHAMSLLKCG FT HSPGSDGILPEMLFHSRDHLAPIIALLLNRLVAGDLVPSELTEAVVSLLHK FT KGDPTNIGNFRPISLLTVTLKVITRCILKRFEAMLEETESSTQTGFRRGRS FT TLDNLHSVKQVAEKASEYGIPVYLAFVDFRKAFDTVEWNACWQSLEKYGAH FT PILVTLLRRIYESSSTLIRVNEDLVRATVKRGVRQGDTLSPRLFNVVLRAA FT MDEIDWESDGIRIDGKNLCHLEYADDVTLIAKNRPELERMLKKLMEACSRV FT GLEINASKTHLLTSCTTTRSPILIDGMKFDFVSSATYLGGRISLPLDHSDE FT IEHRIRLGWFAWSRLSSLLTSRLLPMKTRTRLFESCVTSTVLYGSEVWALR FT ASDKERLSVTQRKMERKILGISLRDRWTNERVRDCTKLRDWIREGLKRKAR FT WALKIRQMDMEQWSRATTVWTPYNSKRPTGHPRTRWRDDLTRAIGAHWWNA FT TYEEFRPILI" XX SQ Sequence 2907 BP; 660 A; 922 C; 737 G; 588 T; 0 other; tcattccggg ttctcaagtt agtggcgagc ggttgggtgc gattttgtcc cgccgttttc 60 cctacgcccg gcgtccctct cccagaagcg ttggcccgcg ggagttgcat cgccaccacg 120 atgagtcgcc atctctacat cggcaccatc aacgccagga cgctcgcctc ccgggacaag 180 cagaccgaac tggagctcgc cctcgaccgt atcaagtgtg acgtgctagc agtgcaggaa 240 gccaggattg tgggttgtgc ctcgttcaat cttacatcct caggcactct agtcttccac 300 tcgggcgggc ccactgcgac ccatggcgtg gccttcctgc tccggccaca cctggcaggc 360 ggagctgtgt ttcgtggcct ctcccctcgt ctggcaaccc tcctcctccc caaccaacga 420 ctcttcctgg tctgcgcata cgctcccacg tcttcctacg atgacaagga gtacgacgac 480 ttcatggacc aggttgaggc tgcactgagg agcgccccta gaggccacac gccagttctg 540 gtcggggatc tcaactgtcg agtcgcgagg gaacctggca atgaaagatt tgtcggtgaa 600 tccgcttctc ccaccccgaa ctcacgtggg cggaccttca ctgaagtctg cgtgaggaac 660 agactgcgca tctggaacac gttccccaag agaagacacg ggagaatttg gacttggcgt 720 tccccaaacg gctctaccta ccaccagatg gacttcatcg ctgcccctcc atcagcgcga 780 gttgtcaact gtggtgtcgt gggccgattc gacttcaact cggaccatcg cctcgtccgg 840 atgtgtctct ctcttcccga caaggtgaag cacaaaagat gcagggagcg gagagatctc 900 gaccgatccg cttttactgt caacgctaat ctcctagcgt cagtgcccct tgctcgcccc 960 aacactgccg ccgacgccta ccgtaccatc cgggccttca ctgagaccgc ggcaacggac 1020 tgttggagag tgcgccgaac acccccctgg atctcccctg caacccggaa cctcctgcaa 1080 tcacgaagcc aattgcaatc caaccctcaa gcagccgttc aatactccat tgcctgcaaa 1140 gctgcccgat cgagtctcgt cactgacatc aagaaccgaa aggaagcaca agcacggcat 1200 gctgcaacga tgggaaggag cgtaacacgg gtgatgcaga atctacaatc ttcaaagaag 1260 cgccttctcg tccccgaccc tgccactggt gaactatcgc aagaagtcac taaggcggcg 1320 gttcagcgct tctacgagga cctctacacc ccggcagtgc agctccccct cggagtcccc 1380 acgagagttc ccgacccgtt tcctcccttc ctaccggacg agacaaggca cgcgatgtcc 1440 ctcctcaaat gcggacactc ccctggctcc gatggcatcc tccccgagat gctcttccac 1500 tctcgagacc acctcgcgcc catcatcgcc ctactcctca atcgcttggt agccggtgac 1560 ctggtgccta gtgagctgac ggaagccgta gtctccctac tacacaagaa aggagacccc 1620 accaacatcg ggaacttccg gcccataagc ttgcttaccg taaccctgaa agtcatcacc 1680 cgatgcattc tgaaaagatt cgaggcgatg ctagaagaga ctgagtcgtc cacccaaacc 1740 ggtttccggc gcgggcgaag caccctcgac aacctgcact ccgtcaagca ggtcgcagaa 1800 aaagcatcgg agtacggtat cccggtctac ctcgcctttg tggatttccg caaggcattc 1860 gacaccgtgg aatggaacgc atgctggcaa tcactcgaga agtatggagc ccatcccatc 1920 ctcgtcactc tactccgtcg catctacgag tcctcctcca ccctcatccg agtcaatgag 1980 gatctcgttc gcgctacagt caagagagga gtccgtcaag gggataccct ttctcctcgc 2040 ctcttcaacg tagttctgcg agcggcaatg gacgaaatag actgggaatc cgatggaatc 2100 cgcatcgatg ggaagaatct gtgtcatctg gagtacgcgg atgacgtcac actcatcgcc 2160 aagaaccgac cggaactgga aaggatgttg aagaaattga tggaggcctg cagccgagtg 2220 ggactggaga tcaatgcatc aaagacccat ctcctcacct cgtgcaccac taccagaagc 2280 cctatcctga tcgacggaat gaagttcgac ttcgtctcct cagcaaccta tctcggagga 2340 aggatttccc tccccctgga tcactcggac gagatcgaac accggattcg actgggatgg 2400 ttcgcatggt ctcgtctttc ctccctcctc acatcccgcc ttcttcccat gaagaccagg 2460 acgcggcttt ttgagagctg cgtgacttcc accgttctct acgggtcgga ggtctgggca 2520 ctgagagcta gcgacaagga acgactaagt gtgacccaac ggaagatgga gcggaagatt 2580 cttggaatct ctctaaggga ccgctggaca aacgagcgcg tgcgggattg tacgaagcta 2640 cgggactgga tccgggaagg actgaaacgc aaagcgcgct gggctctaaa gatcaggcaa 2700 atggacatgg agcaatggag ccgcgccacc actgtttgga ccccctacaa cagcaaacgc 2760 ccaaccggcc atcctcggac acgctggcgg gacgacctaa cgcgagctat tggcgcacat 2820 tggtggaatg caacttatga agaatttcga cctatcctca tctgagcgta aacgcctgac 2880 atgaataaat tattattatt attatta 2907 // ID DEC1 repbase; DNA; INV; 502 BP. XX AC AJ132474; XX DT 24-APR-2000 (Rel. 5.03, Created) DT 24-APR-2000 (Rel. 5.03, Last updated, Version 1) XX DE DEC1 is a putative nonautonomous DNA transposon. XX KW DNA transposon; Transposable Element; Nonautonomous; DEC1; DEC2; KW MITE; TIR; nonautonomous DNA transposon. XX OS Tenebrio molitor OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Coleoptera; Polyphaga; Cucujiformia; OC Tenebrionidae; Tenebrio. XX RN [1] RP 1-502 RA Braquart C., Royer V. and Bouhin H.; RT "DEC: a new miniature inverted-repeat transposable element from RT the genome of the beetle Tenebrio molitor."; RL Insect Mol. Biol 8(4), 571-574 (1999). XX DR GenBank; AJ132474; Positions 1 502. XX CC 28 bp terminal inverted repeats; about 87% identical to DEC2 in CC the CC first 428 bp. XX SQ Sequence 502 BP; 168 A; 65 C; 72 G; 197 T; 0 other; tacactgacc ggcacaaaaa acgactcatt atgatttctt aataaatgat cttgaaatta 60 ctttgtgctt atttttcttt attatagtta aattgggata ttttgaatga tcgggagtgc 120 catggtcacc atggcaaccg aatgttttgt ttaaataaat tataataata aatttgcgct 180 acttccacgt tgtgtttttg atggttgatt tacgtgttaa aagatctaat gtgacaatac 240 gagatactaa ctcaaaatgc tgttcaaatt ttggcaattt ttcataacat aaaatttttt 300 ctgaaaaatt acttttattt ttttttcact aaaattttat ttgctgtgtt taatgttttg 360 tttgtcggat atctttgaca aagaataact taaataacac ttatcaatac aacatgatat 420 caaaaaaaaa ttctaagaca atgaattgct taattatatt aataaaattc aaagtgagtc 480 gttttctgtg ccggtcagtg ta 502 // ID Copia-21_CQ-LTR repbase; DNA; INV; 143 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-21_CQ_; KW Copia-21_CQ-I; Copia-21_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-143 RA Jurka J.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 358-358 (2011). XX DR [2] (Consensus) XX SQ Sequence 143 BP; 32 A; 39 C; 29 G; 43 T; 0 other; tgttggaagt caaccctaag tgcggccctg gtgattagtt tagtttagct ttaggcgcag 60 taggctttgc cacgcgcatc ttttatataa acaccaccca ctctgtgtta gccctctttt 120 cccgaataaa cgccaactcg tca 143 // ID Polinton-9_NVi repbase; DNA; INV; 17121 BP. XX AC . XX DT 02-JUL-2009 (Rel. 14.07, Created) DT 02-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE autonomous Polinton DNA transposon - consensus. XX KW Polinton; DNA transposon; Transposable Element; Polinton-9_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-17121 RA Bao W. and Jurka J.; RT "Polinton DNA transposons from Nasonia vitripennis."; RL Repbase Reports 9(7), 1551-1551 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 5453..9370 FT /product="Polinton-9_NVi_2p" FT /translation="MISAEYYIFTIDNLRACSRLDANAEDIQRWMDIARAA FT IDSLNTLLEQPTNSVGKIRQLQNFYAHLQSVWRVLEGGLKRGGGLEQQPRV FT TWDDVHSAFQSRIRSGIITNVQHIDIKSFMKDAASLFKEQIELALAEHKSI FT KVNTELAAEYVITXTEEETTAIKYFNTKSQSIFQTTNLDTWFSSNVSQSIE FT RDMEEFQEEGSGWSLRSILHLTVNINKFNPLRGSSYIQLPEPIERKKACVN FT VQNDADDQCFKWAILSALHPAEKNSSRVGKYKAFENELDFTGIEFPVMPKQ FT ISKFERQNEISVNLYMLKKNGGGQFDVSPCHITASKREKHVNLLFVQDHYE FT DEDEQNDDDDDDENVPSLKFHYVWIKYLSRLVSSQLSSRKAKQFICDRCLH FT YFWTADKLAEHERDCTRLNSFKVKLPNETNNRLKFKNYRYKERSPFVVYAD FT FECLLKPVEEDKRAYQEHEAFSVGYYVKCSFDESKSTYKSYRKTAVGEQEA FT AKWFATSLHELGQELEQLYEHPVPMDEMSSEEKREYTWARVCHVCARPFSE FT QDKKVKDHCHFTGKYRGPAHTGCNINYKDSRTIPVIFHNLGGYDSHLFIRE FT ISNAFPGRIVLLPQTKERYISFSKYMGKSEVNFRFIDSIRFMASSLDKLAS FT YLQELKTVEAVFEKDDAYSAQQIDLLKRKGVFPYEYVSTLEKLDETCLPAQ FT SEFHSRLTDSDISSDDYEHAKAVWQAFDIRTLGEYSDLYLKTDVLLLTEIF FT ENFRDNCMKAYGLDAAHYYTTPGLSWDAMLKFTGVELELLTDIEMLLFIER FT GIRGGVSQCCNRYAEANNKYMAENYDPTKESKYLMYFDVNNLYGWAMTQSL FT PCSGFKWIEDIENASFYDVADDSPIGYILEVDLEYPEELHDSHKDLPMCPE FT HRVAPESKQEKLMTTLYDKERYVIHYRSLKQALKHGLRLKKIHRALSFNQK FT AWLKDYIDLNSEMRKRAENEFEKNLFKLMNNAVYGKTMENERKRVDVKLVN FT KWKGRYGAEALIAKPNFHSCSVFDENLVAIQLARTEISITKPIYVGLSVLD FT LSKSLVYRFHYDYMRSRFGDNCKLLYTDTDSLVYEIGGRDVYEIMKEDLHE FT FDTSDYPQDNPFEMPRANKKIVGLMKDECNGKIMTGFVGLRSKMYAVRIQG FT QQMSIKKAKGVKSAVVKKTLTYEDYVHCLLENATISREQCNIRSRLHVLRS FT EKESKVALSPNDDKRYLLEGENTDTLPWGHYKIPENARKRAASDEVEGASK FT RAASDEXEGASKRARVEGXSRVEGSDGANISRAVXGENSFASQH*" FT CDS 11195..10341 FT /product="Polinton-9_NVi_3p" FT /translation="MSKKAIVEELHRPARRTYTRRKFDVRGLDETWQADLV FT EMQPYSRENKGYRYMLTVIDTFSKFAWAIPVKRKTGEDVAAAMKSILLSEG FT RVPKNLHTDRGKEFYNSTFRNLMKQYEINLYSTYSNLKAAICERFNRTLKN FT AMWKQFSLRGSFKWYDVLSKLLTAYNSRKHRTIGMKPKDVTAANVELVMKK FT FAHEAAAAKQQKPKFKVNDKVRVSKAKNLFEKGYTANWSTEIFTISRVRRT FT KPVTYELRDFRDEPIAGCFYEQELSKTAHPDVYLIEKGLEETR*" FT CDS 14114..14800 FT /product="Polinton-9_NVi_7p" FT /translation="MDSSLTLSLSGTSSVLEAQYFPPIELSKQKIYVLGLV FT ELLTFNSIPNIDTDNNKFYVGGEEIILPTGSYEIEDIDVYLREALTPKGIS FT FSLKPNNNTLRSVIKCSHPIDFRPRDSIGSLLGFTERILPENVSHSSDRPV FT AILKVNALRVECNITSGAYINGQRVHTIHEFFPAVPPGYKIIEIPSQVIYL FT PITVRSIDQLQVRIVDQDNRLVNFRGEIITIRLHLKQI*" FT CDS 12790..12083 FT /product="Polinton-9_NVi_5p" FT /translation="MNFREQAVKLPVVNFDTITEHGERRVKRHGALLPDSI FT RAVICGPSNCGKTNSLLALITHPNGLRFENVYVYSKSLNQSKYKFLKELLE FT PIDGMRYFAFSEHEEVVPPDKALPNSIMIFDDVACEKQNNIRAFFCMGRHK FT SVDCFYLCQSYAQVPKHLVRDNVNLLVIFRQDDVNLRHLYADHVNCDMTFA FT QFKELCSSCWADDKHSFLMIDKDSPINEGRYRKGFDCFAINIGRC*" FT CDS join(12042..11488,11539..11198) FT /product="Polinton-9_NVi_4p" FT /translation="MSDSNSQQKHILEQLVKARSAVKRKFDLLKYSKENFE FT RALGETFKPIVDPLQQLVVNSSVAREQKLPAAAANKRIIKSDDDLYTDSTA FT KENKTSQANETIQAAAADTTFESANSDSDNDDGAKSLSLLLRRKNVDKVYG FT VRKDQDGGGGYKLGKSRIXYDGDIITVDDXTFPKTDRIVLWSRLQIXNFPE FT NRPNRALVSAADLKNYQEILELSSVHRRNFRPDEILRYHNSNKNTXTLSLL FT SFYPTTRRPRPQSRXRRRKVAVHCFRRTKSPGEIQKWITFTGTIQINRNIH FT SLNDEL*" FT CDS 13197..13754 FT /product="Polinton-9_NVi_6p" FT /translation="MKIKSKLGMGMKSRRRMATTKRRPSTRLNAIVKAASR FT AMIKGTNSHEVIASALKGARDAVKKAGGKDNVAIPRVLPLPHKVGGFLSLL FT IPIFAGLSAAGALAGGAAGIAKAVNDSKAAKEQLEETRRHNRAMENISVGK FT GLCLKASKTGLGLHLKPYRGSSFKKKLLRDKLAREGADRRGHREVR*" FT CDS join(3860..4432,4436..5263) FT /product="Polinton-9_NVi_1p" FT /translation="KSERQREKVKLFSTSRERTSSRYTQKMELIIDFQGFQ FT SFKGEFIAKEIGIMPSRFSDENAEPQSILLSPPICCTKHIAAEYLPTNQWL FT TNNFHGIGWDDGWVPFLRMEEELTDRVAYATKVYVKGAEKKGIMKLLCPNV FT NVVDLTDYGCPPLRELCSGHFVDCPDHKNCSNPVCAKLNVKVLAEWLEKYW FT MIYVFDIIKFEKKICRFHDKNISKNVRNNCLRARTFLAHLLLAIRRTSRAI FT ASHYSYSSQTSSMWAYSVFSLPQSVPDLCTPTAAVASSISEKLRNFYHYYE FT DNDVDYVLDLQSFKVGSHFIHKELAIASLEGDASPIDVYLFEPPFEWTSLS FT SEQRRENSWIERKLIGIPWSAGTFGYDDVVKILEKSLKNARNVYVKGFEKK FT RWLEERLAPSTAVRVIDMEVLGCPSLKHFPKTGDCENHPLLSSDPQCAGRN FT VHALRSWMQTYRGVFGEDVCY*" FT CDS join(14790..15551,15555..16205) FT /product="Polinton-9_NVi_8p" FT /translation="NRYNGYRLQQQQSRSWLYKQADETQRDQSSANRQSAN FT TTKRRLPEESQIQSTSSWKIIMQEILSIQAPVAFDESVAHYELHAHQPYTP FT SGYNNSDEIRIAIQHQDLYVLPSRSSLHIHGKLTKADGSAVERTRLVNNAI FT CHMFEEIRYEMNAVEIDRSKNVGLSTLMKGWISFNPNQSSMLENAGWLDVE FT ETFQVTNNAGYFDINIPLSMILGFAEDYRKIVVNVKHELILTRSRSDLNAV FT TQTAGRNAAGQDVEDINIGLLKIEWLMPYVVASNVQKIRLLQSVEKNKPIS FT MSFRSWELYEYPLLPVSTKHVWTVKTSNQLEKPRFVVLGLQTNRKSLRTAN FT ASRFDHCNLSNVKLFLNSQYYPYGNLNLDINRNQYSVLYDMYANFQNAYYD FT KNPKPLLKKADFITHIPLFVIDCSKQSETLKSAPVDVRLEFESRDNFLANT FT SAYCLILHDRIXQYSPVSGDVKKLV*" XX SQ Sequence 17121 BP; 4645 A; 4213 C; 4272 G; 3976 T; 15 other; agtaggtaag ctgtgcgggt accagctcat gcgggtccta gctcaccccc actctttgcg 60 gctaccagct cactcagtcc cctctccgag cggctaccag ctcacccagg gtaccccctc 120 agccacgctc gcgtaataaa gagcgcactc ttgcgtcaaa gtgtcatttc gtgtcgagca 180 gcatcatcat gagagtgggc gacaagagga gcgaaaacgc acgaggcagc tatacagcag 240 cagtcgaagt tagccagcaa acggctacga tagagaggcc gccccttaaa gagcctaagg 300 aagcgcgacg ctggtcgagg ccggaacaaa atattacaat cagtccttat aaggactaga 360 agtttataaa acgtagtaat gaaatacttg tgtgtgtgtg agagagagag agagagagag 420 agagagagag agagagagag agagagagag agagagagag aatgaaacag ttcttttcca 480 taaccgcggc gccgcgattt ttaccgcgag aatctttacc gagcaagagc gcgctaggcg 540 gcaaatacgc gcgcacgaac cgagcagcaa agattttacg gagagagaac gtttgagaat 600 ctttaccgcg cgcgctatgc tgaaagagcg cgctattggc aaagatgcgc acgcagtaaa 660 gatttcttac agcgagaaag ttctctctgc cgcgcgcgaa agagcgcgcg tacgcccact 720 cctcagcgtg cgcgctattg gttcggcaca gatgcgcgcg aaggcataag acagagcgcg 780 ctttgacgag cggtatcagt cattcatcgt tcgtcatggc agaagcagca gcagcaaagc 840 aacggcggcg gcagattccc ctatctcgcg agctgcgcag cgggactaga gtcgtatttt 900 cgacagcgga cgacagcagg agcaataggc tttctcaact acctcgcgcg tggagacaaa 960 gttccattta ctacggagta gcgagccgac gatgacagca gcggtgacct tcgtcctaag 1020 acggcacaac taaccaatca gtctttttga ggactacaaa aaattttaac gtagaaataa 1080 aagacttgtg tgagaatgaa agtgagaatg aaacagttct tttcagaact atagcgtagg 1140 atagaacgcg ccgcgatgag cagtcaacgg cagatgtccc ccactatctt gccagccgca 1200 acataaagag cgagtgcacg ggacaagagc cccaacagcg gcggcggcag attcttccac 1260 tctcttacga gtcgcgcagc ggggactaga gccacaaaga tgaagcgacg gcggcagcaa 1320 ccactcagtt ccctcactat ctcgcgagtc gcgctctctc aataaaacgg gagcgcgcgc 1380 ggactagaga tacattctta acggtggtgg cgagcagacg accgcggtag agagagagag 1440 agaatcatcg caggcggaga tgcgagcggc cggctcttac cagcgcaatg cgtctcatcg 1500 acccaaggtc ccatgaaatt tccatcagta cttttcagga ctagaaatat aattcaaacg 1560 tgaaacaaac tgtctttctc tctctctctc tctctctctc tctctctctc tctctctctc 1620 tctctctctc tctctcatca gctcgcacgc gaacgagagt tttaaagaag cggtagtgag 1680 cagtttactg cgactcgacg tctcataatt tttcaaacag tccttttcag gactataata 1740 tcatgtgtta tactatgctc tctcgagagg ttcaattctt ttcaaaaaaa cgatagcgcg 1800 atatatagcc gcagcgagaa tctttgccga gcactgaaag agcgcgcaaa gatgcacgcg 1860 ccgcagtaaa aatttcttac agwatgttcg agaatcttta ccgcgcgcta ttggcaaaga 1920 tgcgcgcgca gtaaagattt cttacagcga gaacgttctc tctgccgcac gcgaaagagc 1980 gcgcgtaccc ccactcctca gcgcgcgcgc tattggctcg gcacagatcc gcgcaaaggc 2040 ataagacaga gtgcgctttt acgagcgcta tcagtcattc attgttagtc atgacagaag 2100 catcagcagc agcagcagca gcagtagcaa gtcgcggcgg agatggcagt cagtagtccg 2160 tagtataacc ggcttcgaaa gaaccattcg ccgagacgcc gagaaatgca cataaggctc 2220 gacggcgaat gccgaaacct gctcctcgag tcgaagaaat tcactggacg ccttcagtcc 2280 atgaaattcg taacaggaca atcgagatag agcgtcagag agcacctcgc agccgcagca 2340 aggagcggca gcaagagttg cggaagctca cgcgcaagcc acgctacgtg agcccgaagc 2400 gaaagccatg agcgacgatg aatatgcaca atcggccctt tacagggcca ccaaaaattt 2460 tcaaaacgta gaaaacaaac tgtttccctt tctcctctcc ctctctctct ctctctagat 2520 agttaagaat ggctctagaa aaataaggct cattcgatcg tcaacgttcg agatgtgctg 2580 tccacgaggc ataagcgcag tcgaaagatc gacgagtgag atcaagcgag aatttcaaat 2640 gatctgcatc ttagcgatga tgcaagttca tttggaagcg ctcgctcaca ttgaacgaga 2700 cgacattcgt gatcgtgagg aaatgtttga ggtgttgtac cacattcaaa ctttctatct 2760 cgacatgaac gaggacgacg acgacgacga cagttagcag cagcagtaaa aaaaagcaaa 2820 tcggcccttt tcagggccac caaaaatatt aaacgtagaa tattctatta ttcccctcca 2880 ctcttctacg agcatacact tatattcagc gatgacagct gcatactgtc attcgctttt 2940 tgactgccaa catgtgctgt ccaagacgat tgagcatcag agagcgaaag gaacgtcaag 3000 agcttgctag tcttaaactc gcgatgattg tggaaattgt gagaatttac tcggaagctg 3060 ctgctctgct agctcgacat cgagcctcgc caaactcgag tcgagttaga acattctggg 3120 atctcatcga ccagagattt ttccagtagg aaacaatcgg cccttttcag ggccactaaa 3180 ttctcacaac gtagaatacg actatatgcc cccaccacca ccaacacctt aaagtataag 3240 tatctgcgcg agctcgagag agagctcatt cttcgcagac atgccgagac gacgacgacg 3300 agtcatcaga ccctgcgact ctccaagagc acgaccgcga caagagcaag attcgacgag 3360 acgacgactc ttcacgacat ctgatcggga ggcgtcagtg gagatcgaga ggcctcattc 3420 accgttccta gcggcttctt ggctcggtac gccaagagta gagtcaccgc caccgactga 3480 ctcatcggcg tcctcgtcgt catcgtcgtc gtcatcctct tcatagagtc tctataatat 3540 ttgaaaaaat cagtcctttt aaggactaca aatctatctt aacgtaagac agtgactatt 3600 tttttaacca tctgcgtgac cttgaaaatg cttgaagaag aatatgttgg tagtgcgagg 3660 ccgtggatac gaaatcccac gagacctaag aaacttccaa cgaattctct cagacctcct 3720 tctcgccaag taaaagagaa ggagtcgcgg aaggtgccgg tgctgccgct ctcccctccc 3780 tctcatggtc tgtggatcat agaataatta cccccaccaa tcgttggact gttcgccatt 3840 tttgtaattt aatgtataaa agtcggagcg gcagcgcgaa aaagtcaaac tattctcaac 3900 ttctcgagag agaacatcra gtagatacac gcagaaaatg gagcttatta ttgacttcca 3960 aggctttcag tctttcaaag gcgaatttat cgctaaagaa atcggcataa tgccaagtcg 4020 ctttagcgac gaaaatgccg agccgcaaag tattttgctg agtccgccga tatgctgcac 4080 gaagcacatt gcggcagagt atctgccgac aaaccagtgg ttgaccaaca acttccacgg 4140 catcggctgg gacgatggat gggtcccctt cttgcggatg gaagaggagc tcaccgatcg 4200 tgtggcatat gccaccaaag tatacgtcaa gggtgcggag aagaaaggga tcatgaagct 4260 gctatgcccg aacgtcaacg ttgtggacct gaccgactat ggctgtccac cactccgaga 4320 attatgttca ggccacttcg ttgactgccc ggaccacaag aattgctcga acccggtttg 4380 cgctaagttg aatgtaaaag tcctcgcaga gtggctcgaa aaatattgga tgtagatata 4440 tgtatttgat ataataaaat ttgaaaaaaa aatctgtcgt tttcacgaca aaaatattag 4500 taaaaacgta cggaataact gtttgcgcgc gcgcaccttt cttgcgcatc tccttctcgc 4560 tataagaagg acgtcgcgag cgatagctag tcattattcg tatagctctc aaacaagcag 4620 tatgtgggcc tacagcgtgt tctcgctacc tcagagcgtc cccgatctct gcacgccgac 4680 agcagccgtg gcgagcagca tctccgagaa gttgcgaaac ttctaccact actacgagga 4740 caacgacgtc gactacgttc tggatctgca gagtttcaag gtcggcagcc acttcatcca 4800 caaggagctc gctatagcca gtctcgaggg tgatgcatcg ccgatcgacg tatacctctt 4860 cgagccgccg ttcgagtgga ctagcttgag cagcgagcaa cgtcgtgaga acagctggat 4920 cgagaggaag ttgattggaa tcccttggtc cgctggaaca ttcggctacg acgacgtagt 4980 gaagattctc gagaaaagtc tcaagaatgc ccggaacgtc tacgtcaagg gattcgagaa 5040 gaagaggtgg ttggaggaga ggttagcgcc gtcgactgct gtacgagtca tcgatatgga 5100 ggtcttgggt tgtccatcgc tcaagcattt ccccaagaca ggcgactgcg agaatcatcc 5160 tctgctctcc tccgatcctc aatgtgctgg acgaaatgtt cacgcattga gatcgtggat 5220 gcagacgtat cgcggtgtct ttggagaaga tgtgtgttac taatatgtac tttcttttga 5280 aataaaatca taaaaactgt cgttttcacg gcaaaaaaaa ataacgtaga acaagcttat 5340 acttgtcacc tctccagcgc gtgcgcaggc tgacggcgtg tatttaagtc gtcgcgagcg 5400 ctagcgctca ttctgaagta ttcattcttg aagctatttc cgaagcgaca acatgatctc 5460 cgcggagtat tatattttca caatcgacaa tctgagagcg tgctctcgtc ttgacgcgaa 5520 cgccgaggac atccagcgtt ggatggatat tgcgcgagca gccatcgatt cgctgaacac 5580 tctgctagag cagccaacga attccgttgg caaaattcgg cagctgcaga acttttacgc 5640 tcatctccag agcgtttgga gagttctcga aggcggtttg aagcgaggag gtggcctgga 5700 gcagcagccg cgagtcacct gggatgatgt gcattcggct tttcaaagcc gcatcagaag 5760 tggcatcatc acgaacgtgc agcacatcga catcaagagc ttcatgaagg acgctgctag 5820 cctcttcaaa gagcagatcg agctcgcttt ggccgaacac aagtcgatca aggtcaatac 5880 cgagctcgcc gccgagtacg tcatcaccrc caccgaggag gagacgactg cgatcaagta 5940 ttttaatacg aaatcacagt cgatcttcca gactacgaat ctcgatacgt ggttctcgag 6000 caacgtaagt caatccatcg aaagagacat ggaggagttt caagaggagg gttcgggctg 6060 gtcgcttcgt tccattctac atctcacggt gaatatcaac aagttcaatc cgttgcgagg 6120 cagctcctac atccaactcc cggaaccgat cgagaggaag aaagcctgtg tcaacgtgca 6180 aaacgacgca gacgaccagt gtttcaagtg ggccatactc tcagccttgc atccggctga 6240 gaaaaattcg agccgcgttg gcaagtacaa agccttcgag aacgagctcg atttcacrgg 6300 tatcgagttt ccggtcatgc ctaagcagat ctcaaagttc gagagacaga acgagatctc 6360 ggtcaatctt tacatgctga agaagaatgg cggaggacag tttgacgtgt ctccctgtca 6420 cattacggct tcgaagagag agaagcacgt caatcttctc ttcgtacaag accattacga 6480 ggacgaggat gagcagaacg acgacgacga tgacgatgag aacgtaccct ctctgaaatt 6540 ccactacgtg tggataaaat atttatcgcg attggtctcg tctcagctga gctctcgcaa 6600 agcgaagcaa ttcatttgcg atcgttgtct gcactatttt tggactgccg acaaactagc 6660 ggagcacgag agagactgca cgaggctcaa cagcttcaaa gtcaaactgc cgaacgagac 6720 gaataatagg ctgaaattta aaaattatcg ctacaaggag cgttcaccat tcgtagtgta 6780 cgctgacttt gaatgcctgc tcaaacctgt cgaggaagac aagcgcgctt atcaggagca 6840 cgaagcgttc agcgtcggct actacgtaaa gtgcagcttc gacgaatcca agtcgacgta 6900 caagagctac aggaagacgg ccgttggtga acaggaggca gctaaatggt tcgctacgag 6960 tcttcacgag ctcggacaag aactcgagca gctctacgag catccagtac ccatggatga 7020 gatgagctcg gaagaaaaac gcgaatacac ctgggctaga gtttgtcacg tctgcgctcg 7080 acctttcagc gagcaggaca agaaagtgaa ggatcattgc cacttcacgg gaaagtatcg 7140 cggtccggct cacaccggct gcaatataaa ttacaaggac tcgcggacga ttccagttat 7200 ttttcacaac ttgggcggct acgactctca tctcttcatc agggagatct ccaacgcctt 7260 tccgggccgt atcgtgctgt tgccgcagac gaaagagcgt tacatctctt tcagcaaata 7320 catgggaaag agcgaggtga actttcgatt catcgactct attcgcttca tggcctcatc 7380 cttggataag ctggcatcct atttgcaaga gctgaagact gtcgaggcgg tgttcgagaa 7440 ggatgacgct tacagcgcgc aacagatcga tctgctgaaa cgtaagggag tgtttcctta 7500 cgagtacgtg tccacgctgg aaaaactcga cgagacttgt ctgccagctc agtcggaatt 7560 tcacagccgc ctgactgaca gcgacatctc cagcgacgac tacgagcacg cgaaggcggt 7620 atggcaggct ttcgacattc ggacgctcgg cgagtactcg gatctgtatc tcaagacaga 7680 tgtgctacta ctcaccgaga tctttgagaa ttttcgagac aactgcatga aggcttacgg 7740 tctggatgcg gctcactact acacgacacc cggtctctcg tgggacgcga tgctcaagtt 7800 tacgggcgta gaactcgagc tgctcaccga tatcgagatg ctactcttta tcgagcgagg 7860 cattcgcgga ggtgtgagtc agtgctgtaa tcgctacgca gaggccaaca ataagtatat 7920 ggccgagaac tacgatccga caaaggagtc caagtacttg atgtactttg acgttaataa 7980 cctctacggg tgggccatga cgcaatccct gccttgctcc ggcttcaagt ggatcgagga 8040 catcgagaac gcgagtttct acgacgtggc cgacgacagt cccatcggct acatcctcga 8100 ggtggatctg gaatacccgg aggaactcca cgactctcac aaggatttgc caatgtgtcc 8160 ggagcacagg gtagcgccag agtcgaagca agagaagctg atgactacgc tctacgacaa 8220 agagcgttac gtcattcact acagaagtct caagcaggcg ctgaagcatg gtctccgctt 8280 gaagaaaatt cacagagccc tgagtttcaa tcagaaggcc tggctcaagg actacatcga 8340 tctcaacagc gagatgcgca agagagccga gaacgagttc gagaagaatt tgttcaaact 8400 catgaacaac gcggtctacg gcaaaacgat ggaaaacgag cggaagcgcg tcgatgtgaa 8460 gctcgtgaat aagtggaaag gtcgctacgg cgctgaggct ctcatagcta agccgaattt 8520 ccacagctgc tccgtgttcg acgagaatct cgtagcgatt caactggctc gcaccgagat 8580 cagcatcacc aagccaatct acgtgggact cagcgttctc gatctctcca agtcgctcgt 8640 gtacagattc cactacgact acatgaggag tcgattcggc gacaactgca agctgctcta 8700 taccgatacg gatagcttgg tgtacgagat cggtggccgc gacgtctacg aratcatgaa 8760 ggaggatctc cacgagttcg acacgtcgga ctatccgcaa gacaatccgt tcgagatgcc 8820 gcgcgccaat aagaagatcg tcggcctcat gaaggacgaa tgcaatggga agatcatgac 8880 aggattcgtc ggcttgcgca gcaagatgta cgccgttcgg attcaaggtc agcagatgtc 8940 catcaagaag gctaaaggcg tgaagagtgc tgtggtgaag aagacgctca cctacgagga 9000 ctacgtacac tgcctactcg agaacgctac gatctcgaga gagcagtgta atattcgctc 9060 gcgacttcac gttctcaggt cggagaaaga gagcaaggtg gctctaagtc ccaacgacga 9120 caagcgatat ttgctcgaag gggaaaacac cgatacgctt ccttggggtc attacaaaat 9180 ccccgaaaac gctcgcaaaa gagcggcttc tgacgaggta gagggcgcca gcaagagagc 9240 ggcttctgac gagktagagg gcgccagcaa gagagcgcga gtggaggggg astcgcgagt 9300 ggaggggagc gatggagcga atataagtcg agccgtcgyc ggtgagaatt cattcgcttc 9360 gcagcactag tagcagcagt agtaagctac atataccagc aagtaagaaa gatcagcagc 9420 agcagcagta aaaatggatc tacaaaagct gaacgaagtc gccaagacta aggagtttct 9480 cccgacgaag aagatcgccg agctcgagga aggccgagtc tacaaggtca ccaagctcag 9540 gatggtcaac acgagatttg gaaggagaac cgtggcggag ctcgacgatg ctgtgcaggc 9600 gtttctaccc caacgcttcg ccttggcttt cgacaaggat gaggagttct tcaacagaac 9660 ggcggaagaa gctagcagct ccaagctatt cctaaccaag aacgcaggca gcagcaacat 9720 tgcattctcc tcttctccct gaacattgta ataataataa tattgtttga aataaaattg 9780 aaaaaatgtc gttttaacga caaaaaaata acgtagtcac gaggaacaaa caaaatattt 9840 ggagaggggc gagctgtgaa tataagccag gcatctgcct actctgaatc attagctcaa 9900 cagtcgcaga acatycaaca acatgctcgc cgacaaaacg atagccgact tggaggaaga 9960 tgtaacgtac atggttacag agctccaact catcagcacc gagtcaggaa cgcaagctgt 10020 ggctcgtctg aacgactcat ctcagcttct tctaccacaa cgtttcgccg tagccttcga 10080 gaacgaggag gaatacttct acatactgac agctgcagct aatacctaca ggctattcgt 10140 cactaagaag cagaacggaa gccttgaatt ttcttttctt taaacatgta tatgtattta 10200 ttgtgaataa actatgtgtc gttttcacga caaatacaaa gagaaataac gaggaacaaa 10260 atattttttc cactgactga gatttgtcga tccacgtatt gtgagtgttg tcaaatccaa 10320 gccaatttaa gaatacctcc ttacctcgtc tcttcaagac ccttctcgat gagatagacg 10380 tccggatggg ccgtcttcga gagttcttgc tcgtaaaagc agccagcgat aggctcatcg 10440 cgaaagtctc ttagttcata agtcactggc ttcgtgcgtc tgactcgaga gatagtaaat 10500 atctcggtcg accagttggc cgtatagcct ttttcgaaca aatttttcgc tttgctcact 10560 cggactttgt cattgacttt gaactttggt ttctgctgct ttgctgctgc tgcctcgtgt 10620 gcaaactttt tcatgactag ctcgacattg gcagccgtca catccttagg cttcataccg 10680 atagtacgat gcttgcgact gttgtaggca gtcaggagtt tactcaacac atcgtaccat 10740 ttgaagctgc cgcgtaaact aaactgcttc cacatggcat tttttaacgt acgattgaag 10800 cgttcgcaga tcgcagcctt gagattacta tacgtcgagt acagattaat ctcgtactgt 10860 ttcatgagat tccgaaaggt ggagttgtaa aattctttgc cacgatccgt atgcaagttt 10920 ttaggcactc gaccttccga cagcagtatg cttttcatag cagctgctac gtcttcgccg 10980 gtttttctct tcacgggaat ggcccaagcg aatttcgaaa acgtgtctat gacagtgagc 11040 atatatctgt agcctttgtt ctctcgcgaa taaggttgca tctcaaccaa atcagcttgc 11100 caagtctcgt ctaatccacg aacatcgaat tttcgacgcg tgtacgtacg acgagctggc 11160 ctatgtagct cctcaacgat tgctttcttc gacatgttca cagctcgtca ttyagagagt 11220 gaatgtttcg attgatttgg atcgtcccag taaacgtaat ccatttttgt atctctcctg 11280 gcgattttgt acggcggaag cagtgtaccg ctacttttct tcttcttytt cgactttgag 11340 ggcgaggacg acgtgttgtt gggtagaaag agaggagcga taaagtcrtc gtatttttat 11400 tactattgtg ataacgcaaa atctcatcgg ggcggaaatt cctcctgtgc acgctcgaca 11460 attcgagtat ttcttgataa ttttttagat ctgcagccga gaccagagca cgattcggtc 11520 ggttttcggg aaagttstat catcgactgt aatgatatct ccatcatact ytattcttga 11580 tttgccgagt ttatatccgc cgccgccgtc ttggtccttc cgcacgccgt aaaccttatc 11640 cacattttta cgacgcaaaa gaagcgaaag cgattttgct ccgtcgtcgt tgtcgctgtc 11700 gctgtttgcc gattcgaacg tcgtatccgc agcagcagcc tgtatagttt cattcgcttg 11760 agaagtctta ttctccttag cagtagaatc agtatacaag tcatcgtcag attttattat 11820 tcgtttattt gctgcagcag caggcaattt ttgctctcta gcaactgaag aattgaccac 11880 taactgttgc aagggatcga cgatcggctt gaaagtctca cccaacgctc tctcaaagtt 11940 ttctttactg tatttcaaca gatcaaactt tcgcttaact gcgcttcgcg ctttcacgag 12000 ctgctcgagt atatgcttct gctgactatt actatcagac atgttgacat atcaacggtt 12060 agtgatagac tgttatagat cgctaacatc gtcctatatt tatagcaaag cagtcgaaac 12120 cctttcgata acgaccctcg ttgatcggac tatctttgtc gatcatgaga aaggagtgct 12180 tatcgtcggc ccagcagctc gagcataatt ccttgaactg cgcgaatgtc atgtcgcaat 12240 tgacgtgatc tgcatacaag tggcgtaaat tcacgtcatc ttgtcgaaat ataaccagca 12300 aattgacatt atctctcacc aagtgcttcg gcacttgagc gtacgactga cagaggtaga 12360 agcagtcgac gctcttgtgt cgacccatgc agaaaaatgc gcgtatgtta ttttgcttct 12420 cgcatgcgac gtcgtcgaat atcatgatcg aattgggtag tgccttgtca ggcgggacta 12480 cttcctcgtg ctcgctgaaa gcaaagtagc gcataccatc gatcggttcc agcaactctt 12540 tcaaaaattt atatttcgac tgattcagcg acttggagta gacgtacaca ttctcgaatc 12600 taagtccgtt cggatgcgtt atgagagcga gaaggctatt ggttttaccg cagtttgatg 12660 gtccgcaaat gactgcgcgt atactgtcgg gaagcagagc tccgtgacgt ttcacgcgtc 12720 tctccccgtg ctcggtgata gtgtcaaagt tgaccacggg gagcttcaca gcctgctctc 12780 tgaagttcat tctctcaaat ctcggctaag actgatcgat ttcctataaa taggctgctt 12840 ttatagccgc ctgcatcagt tatcattgga ccttcgtgga gtaatgatcg tacacattga 12900 gagcggacgt ggtttggtga acaaagtgct caacaaaatt cctgttgaac ttcatttgcc 12960 gggctatcag tactgcggac caggaacaaa gttagctaaa cgtctagcta gaggagatcg 13020 aggcataaat cctctcgatc aagcttgcaa agagcacgat atagcgtact cgcagaatcg 13080 cgagaacgta gaggctagaa atcaggccga tagagtatta gccgacaaag cttggcagag 13140 agtcagcctg acgcgaaatt agatgagaaa tttccgctta cgccgtatcg aaagcgatga 13200 aaataaaatc gaaattggga atgggtatga agagtaggag gagaatggct acgacgaaga 13260 ggagaccatc cacgaggctg aacgctatag tgaaggcagc ctcgagagcg atgatcaagg 13320 gcactaatag tcacgaagtc atcgcatcgg ctttgaaagg agctcgcgat gccgtgaaaa 13380 aggctggtgg caaagataac gtggccatac cacgagtcct gccactacct cataaagtag 13440 gtggctttct ttctctcctc atacccattt ttgctggact gagtgcggcg ggtgctttgg 13500 ctggaggagc tgctggcatt gccaaggcgg tcaacgactc caaggctgcg aaagagcagc 13560 tagaggagac tcgtcgacat aacagagcga tggaaaatat cagcgtgggt aagggactct 13620 gtctgaaagc cagcaagact ggattgggcc ttcatctcaa gccctatcgc ggaagcagtt 13680 ttaaaaaaaa acttcttcga gataagcttg cccgagaggg cgctgacaga cgtggacatc 13740 gtgaagtacg ctaaaatctt gaaaattcca tattttcgag gtgtattcat gcgtaacgcg 13800 atgcctgcga gtggtccacg aaagcgcgaa tcagccgttg tgaacttgga cgatgcgagc 13860 gggcctggaa ctcactgggt agcgtatcgc aaacgagacg acaacgtcgt ttacttcgat 13920 agctttggag atctacagcc tcccttggac ttgatgctgt atttgggagt gaacgagatc 13980 cagtacaatc acgagagata tcaagattac aacaccttca actgcggaca tctctgtttg 14040 cagttcttga gtaataagct atataagagg cgactgccgt gaatgcacat tcattcagca 14100 ttcgacagtc agtatggata gctcactcac tctgagtctc tccggaactt catctgtact 14160 cgaagctcag tactttccac cgattgagct ctcgaagcaa aagatctacg ttctcggtct 14220 agttgaactg ctcacgttca actccattcc caatatcgac acggataata acaagttcta 14280 cgtcggtggg gaagaaatta tcttacctac ggggagctac gaaatcgagg atatcgacgt 14340 ttaccttcgc gaagctttga ctcccaaggg tatatcgttc agtctcaagc ccaacaataa 14400 tacgcttcgt agcgtgataa agtgcagtca tccgatcgac tttcgtccgc gagactcgat 14460 cggctctctg ctgggcttca ccgagcgcat cttaccggag aacgtgagcc acagttcgga 14520 tagacctgta gctattctaa aagtcaacgc tctgcgagtc gagtgtaaca tcacctcggg 14580 tgcctatata aacgggcagc gcgtgcacac tattcacgag ttctttcctg ctgtacctcc 14640 aggatacaag atcatcgaaa ttccctcgca agtcatctat ctgccaatca ctgtgagaag 14700 catagatcag cttcaagtgc gcatagtcga tcaagacaat cgcttggtta attttcgcgg 14760 agagataatc accatcagat tacacctaaa acagatataa tgggtatcgt ttacaacagc 14820 agcaaagtcg gagctggcta tataagcaag ccgacgagac gcagcgagat cagtcgtcag 14880 cgaatcgcca aagcgctaac acgacaaaac gtagacttcc tgaagagtct caaattcaga 14940 gtacgtcctc gtggaaaata ataatgcagg agattctgag tatacaggcg ccggtggcct 15000 tcgacgagtc tgtagctcat tacgagttac acgctcatca gccctacacg ccatctggat 15060 acaacaacag cgacgagata cgcatagcca ttcaacatca agacctatac gttcttccct 15120 cgcgcagctc gctgcacatt cacggcaaac tgacgaaagc cgacggcagc gccgtggagc 15180 gcacgcgatt ggtcaacaac gcgatatgcc acatgttcga ggagattcga tacgaaatga 15240 atgccgtgga aatcgacaga tctaaaaatg tcggtcttag cacgctcatg aaaggctgga 15300 tttctttcaa tcccaatcag agctcgatgc tcgaaaatgc tggctggtta gacgtcgagg 15360 agacgtttca agtgaccaac aatgccggct atttcgatat caacataccg ctcagtatga 15420 tactcggctt tgccgaggac tatcgcaaaa tcgtggtcaa cgtcaagcac gagctcatac 15480 tgacgagatc gcgaagcgat ctgaacgccg tgacgcagac tgctggcaga aacgcggctg 15540 gacaagacgt ttaagaggac atcaatattg gactattgaa aatcgagtgg ctcatgccct 15600 acgttgtcgc gtcgaacgtg cagaaaattc gtctgctgca gtccgtggag aagaacaagc 15660 cgatcagcat gagttttcgc agctgggagt tgtacgagta tccgctcctt cccgtctcca 15720 cgaagcacgt gtggacggtg aaaacgtcga atcagctgga gaagccgcga ttcgtcgttc 15780 tcggtctgca gactaatcga aagagtctga gaactgcgaa cgccagtcgc ttcgatcact 15840 gcaatctcag caacgtcaaa ctcttcctca attcccagta ctatccctat ggaaatttga 15900 atctcgatat caatcgaaat cagtactctg tactgtacga tatgtacgca aatttccaga 15960 acgcctacta cgacaagaat cccaagccgc tgctgaagaa ggcggacttt atcacgcaca 16020 tacctctctt cgttatcgac tgctccaagc agagcgaaac gttgaagagc gcgcctgtcg 16080 acgttcgtct ggaattcgag tcacgtgata attttctcgc caacacttcg gcctactgtc 16140 tcatcctgca cgatcgtatc rttcagtata gtcctgtcag cggagacgtg aagaagctcg 16200 tgtaatggag tacgctgtgg atatgcaagg cttcaagcag cctggaaacg atttcgttct 16260 caaggagctg gctattgtct cgctcacgga cgagagcgag cctctagtcc tgctgtttcg 16320 agaaccgttt ccctggagaa gactcacgga aaagtatagg aaggagaatg agtggctaga 16380 gcgcagccat cacgggctat cctggtcatc gggcaacata gcctacaccg aggtcggcaa 16440 gctgctccgg gaagctcttc gagacgccag caagatcttc gtcagaggag aactacgccg 16500 acgatggcta gaacgtttcg atctcaccgc cactgatatc tgcgattacg gatatccctc 16560 ggaagatttg ccgaaaatcg tgactgtatg caccaatcac aatggagcct acaagtctac 16620 ctgcgctctg caaaatgtca aaatcatgaa actctactat ctcaccaacg ttcacatgga 16680 gtgggaagat gtttcggaga ctgaggagta cgcttaatat gcgagcgact gaaaagtgcg 16740 gcggtgggtc gagagtaaga tctcggcctg ctgtactggg ggtaccgggt tcaaatctcg 16800 gccgcggata aattttttgt gatattccgc tgggtaagct gggacccgca ctggcttgcc 16860 gactacgagc gcgcgctctt tattaccggc tggcagagct gggacccgca tactgtctga 16920 ctcgccgact gacactctga cgcgagagcg cgctctttat tacgcgaccg aaactggcaa 16980 aaaggggggg taccctgggt gagctggtag ccgctagaag tgggagctgg tagccgctcg 17040 gagagggagt gagctggtag ccgcaaagag tgggggtgag ctgggacccg catgagctgg 17100 tacccgcata gcttacctac t 17121 // ID Helitron-2_NVi repbase; DNA; INV; 6512 BP. XX AC . XX DT 13-APR-2009 (Rel. 14.04, Created) DT 13-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE Helitron DNA transposon - consensus. XX KW Helitron; DNA transposon; Transposable Element; Helitron-2_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-6512 RA Bao W. and Jurka J.; RT "Helitron DNA transposons from Nasonia vitripennis."; RL Repbase Reports 9(4), 763-763 (2009). XX DR [1] (Consensus) XX CC The consensus is incomplete at both ends. XX FH Key Location/Qualifiers FT CDS join(1373..1708,1712..2152,2140..2721,2721..3248, FT 3209..3589,3593..4069,4143..4637,4628..5485) FT /product="Helitron-2_NVi_1p" FT /translation="LNNDKIGFIANSEYDDKTYTLNYLGKMDQECEFCSAK FT HFLLELPVDKKYSNCCHKGKVIIKDKFQYPICLLQLTNRKSQESKQFMRLI FT RNYNNALAFASFGAKLDIVPGHGPVFRICGQTYHNAYSLNPVTNENRKFGQ FT LYIIDNEIANEIRFSTDIKCNIKLLNENDNILRKINPYAEAYKMMHEVEEE FT ELKRARNANELPREVKMFITRQNYSNNKTFELPSCNEVAVVFVGEEGEPPF FT NRDFCIDSKNSKPEIFQLNIPIISKHVDPMTYPLIFPFGQGGWQPNYESEI FT KNKTQVTALLYYSFXLSVRNEFNPCLNLGKLTQQFIVDSWCKVEGTRLFYI FT RNQQAKLRTETYRGLMDYVTNKAQSINVNIGKMVILPSTFIGSPRSLQQNY FT IDAMTVVQTFGKPDLFVTMTCNPKWIEIVENLNENENTLDRPDLVARVFHL FT KVAAFWNEIXKKIFGKIIGYIYVIEFQKRGLPHMHALLYLDFKDKFHSSDQ FT IDNIISAEIPDESNYPILYGLFKQHMIHGPCEIQNKNSPCMDKSKFICTKK FT FPKPFASSTNYNVENGYPIYRRRNDGRIIKYGSNRTANNQFVVPYNPYLLL FT TFNCHINVEICSTVKSINICLNIYIKVPIPLYLFKYLYKGPDSALVGFRKE FT NDLNNESDITSNNSNEITNVLNYDEIEQYLAMRYVCPPEAMYRLLEFKLYD FT QSHVIYRLAVHLENEQFVYFKQGSETELIDRNVNTTLTAWFELNKINSLAR FT DLLYIIPNHYVFDKKLKIWTPRKRIYKPILSRMYFVDPKKRELFYLRMLLL FT HVKGATSFENIRTVEGIVYSSFYEAALALHLISADNEWDKCLQEAINYQFP FT NALIQLYSFILIFHHPCNAKELYNKYKTYFYDPKLNDEIAEQTCLTKINNI FT LISHGITLRDSHLHMIRTLFTHLIRDNVIFLTRFNHQISVISVAWTGIAAN FT LLHGGKTVHTTFKLPLNLSELSTSNVSPNSNYGKFLKTIKVLKYAFDAIDK FT LFKDLHNSVEPFGGIVVVLSGDFRQILPVVRRGSRAQILESCVKKSSIWNK FT FKIMKLIDNIRVTNNDQLFNLGYSWLLNVGEQNCTLKFEKQNHVTLISNDM FT LSRGNLIYEIFGSKIDPNDKTLYQKVILAPTNIDVLKINDEILDQLEGESH FT TYLSIDLVNTETNEDMFNSISIEFVHXLTPNGLPPHKLTLKVDAIVILLRN FT LNLDGGLCNGTRLMIQELRSYVVKAEIITGTSKGKIILIPRIDLSPSIEEV FT PFTMVRRKFPFRLGFAMTINKSQSQSFDNVGLYLPSSVFSHGQLYVALSRV FT RQKNNLKILLAXDSVITENQDDKINYLNKRFNDNLKIYIKNVVYREIFELD FT NF*" XX SQ Sequence 6512 BP; 2508 A; 789 C; 831 G; 2376 T; 8 other; ttgcgaggtt atgatcactt gttttggtat aactttcaaa gcttccgaat atatccacca 60 ggtgtgccac ttttgcctta acttgcgaat acttaaaacg ggatcaatta ccatattatg 120 gcgcgttttt caaagttaca actatgtcag atttatctat ttattagttt atactacaaa 180 atttaattgt aaactcatat aaaaatttga gttaataaca ttgaaatttt taatttcaat 240 tgaatgtaat tataagatca ttttaaattt ataagttaaa aaatgttaaa tgaaaattat 300 attaatgtat tattataagt catgcttcgt ggggtgcgtg gtagcatatg gccgggtaaa 360 ttgagtacgt tattataata ataataataa tatttatcat tttattataa tggtttaatg 420 ttgtaactgt tgaataaata taagtgtatt tatattttat tttaaatata tcgtgtttta 480 aaatattatt tattgtttta cattattaaa tataattatt tttaattaaa atgatatatt 540 ttatacttga gactaaaaaa attaatagaa ttttcacttg ttctgttttt ttagattgtt 600 aaacgaaata aatataatta cttaatattg attgaatgta tgttttaaga tataagttcc 660 aatgaattat tattaacttt atatattgat aaaattttat ttttaatact aactaattct 720 aatctcttat ttattatact aggattagat cagcgtgctt tgcgcttgat aacttagatt 780 ttatctggta ataatttaat cattttttaa tcattttttt taatacttag aaaaaaattt 840 aattcattat tctttagaat catatttata aaaaaaatgt atatcttcca ctctttgaaa 900 atttacaaca ttataaaata tattccaaag agtttaagta taaagtttaa aatatacata 960 atcaatcaaa ctaagtgata acgctaaatg taaaatttct ttctaaaaat atcattaatt 1020 ccatattart actacattca aatttatata ttttatataa ctccttagta catattttat 1080 aataattgca tgaagagttt cataagtttg tggagttttg ttatcattga atttaaaatg 1140 tttttttaaa tgtcattatt tttatactga cgctagatct agattttata ctttattctt 1200 gactccttgg tatatgtttt acaatattta agagatttgt caagttttta gaaagtagaa 1260 aatttacttt gttctaaagc atagacgttg attccttatt tactcatcaa aataaatctt 1320 cctcattatg aatcatgatt aatcttaacg aatataataa aacacacatt aattaaataa 1380 tgataaaata ggatttattg ctaattctga atatgatgac aaaacatata ctttaaatta 1440 cttaggcaaa atggatcaag aatgtgaatt ttgtagtgct aaacactttt tattagaatt 1500 accagttgat aaaaaatata gcaattgttg tcataaagga aaagttataa taaaagataa 1560 atttcaatat ccgatttgtt tgttgcagtt gactaatcgc aaatcacaag aaagtaaaca 1620 atttatgcga ttaatacgaa attataataa tgctttagct tttgccagtt ttggggctaa 1680 actagatatt gttcctggcc atggccctta agtttttcgc atttgtggtc aaacttatca 1740 taatgcttat tcattaaacc cagtaacaaa tgaaaataga aaatttggac aattatatat 1800 aatagacaat gaaatagcta atgaaattcg gttttcaact gatataaaat gcaacataaa 1860 attactaaat gaaaatgaca atatcttacg taaaataaat ccttatgctg aagcatacaa 1920 aatgatgcat gaagttgagg aagaagaatt aaaacgtgct cgaaatgcca atgaattacc 1980 tagagaagtt aaaatgttta ttacacggca aaattattct aataataaaa cctttgaact 2040 tccatcatgt aatgaagttg cagtagtttt tgtaggtgag gaaggagaac ctccatttaa 2100 tcgtgatttt tgtattgatt ctaaaaattc aaaacctgaa atattccaat tataagtaaa 2160 catgttgatc ctatgacata tcctttaatt tttcctttcg gacaaggagg atggcaacct 2220 aattatgaaa gtgaaattaa aaataaaacc caagttacag cgttgctata ttattccttt 2280 cytttaagtg tacgaaatga atttaatcct tgtttaaatt taggtaaatt aactcaacaa 2340 tttatagtcg attcatggtg taaagttgaa ggaacgagat tattttatat tagaaatcag 2400 caagcaaaat taagaactga aacgtatcgt ggattaatgg attatgtaac aaataaagct 2460 caatcaatta atgtcaacat tggcaaaatg gttattttac catcaacatt tattggaagt 2520 ccgcgttctt tgcaacaaaa ttatatagac gcaatgacag tcgtccaaac atttggaaaa 2580 cctgatttat ttgttactat gacttgtaat cctaaatgga ttgaaattgt agaaaattta 2640 aatgagaatg aaaatacatt agatagacct gatttagtag ctcgagtttt tcatcttaaa 2700 gtagctgcat tctggaatga ataamaaaaa aaatttttgg taaaattatt ggttacatct 2760 atgtaattga atttcaaaag cgaggattac ctcatatgca tgcactacta tatttagatt 2820 ttaaagataa atttcatagt tctgatcaaa ttgacaatat tattagcgct gaaataccag 2880 atgaaagtaa ttatcctatt ctgtatggtc tttttaaaca acatatgata catggaccat 2940 gtgaaattca gaataaaaat tcaccatgta tggataaatc aaaatttata tgtactaaaa 3000 agtttcctaa accgtttgct tcatctacta attacaatgt agaaaatgga tacccaatat 3060 atcgtcgacg taatgatgga cgaataataa agtacggatc gaatcgtact gctaataatc 3120 aatttgtagt cccatataat ccttatttat tgctaacatt taattgtcat attaatgtcg 3180 agatatgcag cactgtaaaa tcaattaata tttgtttaaa tatttatata aaggtcccga 3240 ttccgcttta gtaggtttta gaaaagagaa tgatttaaat aatgagtctg atattacatc 3300 taataattca aatgagatca ctaatgtttt aaattacgat gaaatagaac aatatttagc 3360 tatgcggtac gtatgtccgc ccgaagcaat gtatagatta ttagaattta aattatacga 3420 tcaatcccac gtaatttatc gattagcggt ccatttagaa aatgaacaat ttgtatattt 3480 taaacaagga tctgaaacag aattaataga tcgtaatgtg aatactacat taacagcttg 3540 gtttgaatta aacaaaatta attctttagc acgagattta ttatatattt aaattcccaa 3600 tcattacgtg tttgataaaa aattaaaaat ttggactcca cgaaaacgaa tatataaacc 3660 catattgagt cgtatgtatt ttgttgatcc taaaaaacgt gaattatttt acttacgtat 3720 gttgttatta catgtaaaag gtgccacatc atttgagaat atacgaactg tagaaggtat 3780 cgtatattct tcattttatg aagcagcatt agctttacat ttaatatcag cggataatga 3840 atgggataaa tgtttacagg aagctataaa ttatcaattc cccaatgcat taattcagtt 3900 atattctttt attttaatat ttcatcatcc ttgcaatgct aaagaactat acaataaata 3960 taaaacttat ttttatgatc ctaaattaaa tgatgaaatt gcagaacaaa cttgtttaac 4020 gaaaattaat aatattttaa tttcacatgg aataacgtta cgtgattcat gattatacct 4080 ttaattgacg aaactattca aattttcaat tactctaatg atttgcgaaa ctactccatt 4140 agcaccttca tatgattcga acattattca ctcacttaat aagagacaac gtgatatttt 4200 taacgaggtt caatcatcaa atttcagtta tatctgtagc ttggactggt atagccgcaa 4260 atttgttaca tggaggtaaa acagttcata caacatttaa attaccatta aatttatcag 4320 aattatcaac aagtaatgta agtcctaatt caaattatgg aaaatttcta aaaacaataa 4380 aagtattaaa atatgcgttt gatgctattg ataaattatt taaagatttg cataattcag 4440 tagaaccatt tggaggaata gtagtcgtat tatcaggaga ttttcgtcaa atattacctg 4500 ttgtacgacg tggaagtcga gctcaaattc tagaatcatg tgtcaaaaaa agttcaatat 4560 ggaataaatt taaaattatg aaattaattg ataatattag agtcacaaat aatgatcaat 4620 tgtttaatct tggttattaa acgtaggcga acaaaattgc acacttaaat ttgaaaaaca 4680 aaaccacgta actttaatat ccaatgatat gttatctaga ggaaatttaa tatatgaaat 4740 tttcggaagt aagattgatc ccaatgataa aacattatat caaaaagtaa tattagcacc 4800 gactaatatt gatgttttaa aaattaacga cgaaatatta gatcaattgg aaggagaaag 4860 tcatacatat ttaagtattg atttggtaaa cacagaaaca aatgaagata tgtttaattc 4920 aatatcaata gaatttgtcc attyattaac acccaatgga cttcctcctc ataaattaac 4980 attaaaagtt gatgcgattg taattttatt gagaaattta aatttagatg gtggtttatg 5040 caatggaact agattaatga tacaagaatt acgatcttat gttgttaaag ctgaaattat 5100 tactggtacc tctaaaggta aaattatttt aataccccga attgatttaa gtccttcaat 5160 tgaagaagtt ccatttacca tggtacgaag aaaatttcca tttagattag gatttgcaat 5220 gactataaat aaatctcaaa gtcaatcgtt tgacaatgta ggattatatt taccgtcttc 5280 agtttttagt catggtcaat tatatgttgc tttatctaga gtacgtcaaa aaaataattt 5340 aaaaatatta ttagcaraag attcagtaat aacagaaaac caagatgata aaattaatta 5400 tttaaataaa cgtttcaatg ataatctaaa aatttatatt aaaaatgttg tttatcgtga 5460 aatttttgaa ttagacaatt tttaactaat tttatattac gtatcttcta atattctcga 5520 taattaatat ttcatttatt tatatatgtt aatactgaac attaatttaa tttttctata 5580 cattacaatt tttaatatgg ttatgtataa gtatgttctg cctaacaatt aaayaataag 5640 catcaaagaa ataaaaatag cagcaaccaa tttcattgcc attggtagcg cttgtatatt 5700 atcattttta tataataaat taattaaata gcaaaatata attatttaat tataaaagtt 5760 ttatattgta agcatcaaac aaataaaata gagcaaccaa tttcattgcc gttggtagcg 5820 cttgtacatt atatttatgt atttaacaaa ttatttaaat aataaaatat cartatttga 5880 ttataaaaac tttacataat aagcatcaaa caaattaaat agcagcttcc aattccatca 5940 cagttgatag cgctcgtaca ttttcattta yatatttaac gaattattta aataacaaat 6000 atgagtattt gattgtaaaa atttacgtaa taagcatcaa aaataaatag caaccaattt 6060 attgcattgt agcgcttgta tttatcattt tctaaaatta tttaaatagc aaaatataat 6120 tatttaatta taaaagtttc atattgtaag catcaaataa ataaaaagca gcaaccaatt 6180 ttattgacgt tggtagcgtt gtacattatt atttattatt taacaaatta tttaaataac 6240 aaaatataat tatttaatta taaaaattta tatagtaaac atctaagaaa aaaaaataga 6300 atcatccaat ttcattgcca ttggtagcgc ttgtgcatta taatttataa attcaataaa 6360 ttaaaataag ttattaaatt tattaaattt acagtacaaa actacatttt aatttaatat 6420 aaagctactt ctacatctca aatttgtagc gctcccttat tatattacaa attatggtat 6480 aaatatgtat ataaaaactt caaaataaac ca 6512 // ID hAT-6_HM repbase; DNA; INV; 4479 BP. XX AC . XX DT 07-DEC-2008 (Rel. 13.12, Created) DT 11-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-6_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4479 RA Jurka J.; RT "hAT-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 1995-1995 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 1384..3957 FT /product="hAT-6_HM_1p" FT /translation="MSEKKRLSVKKNKESQRKAILKYFRAETHSTGTDSAT FT ESDSGVSKKSKVIVDDKTVNNANISNQQNEVVNINENQIYLPEATPAVNIS FT RQIETETQISILSSADYSDPANWPTHFSDNFRQFVIKNRPQQIEKYNFPRD FT RDEQKRKFSSIYYKRTLPNGEEVSRSWLIYSVSKNVVFCLCCKLFGKSKSS FT LPALEGSGLNDWKNIGALLSSHEKSNSHLVNFQMWKELDIRLSTEKTIDHI FT NQQKINEKEQYWQRILERLIALVRVLATQNLAFRGTNEKLYNNANGNFLKF FT VEYLALFDPLMNEHVRKIKNQEIKTAHYLGKEIQNELIQILANAIKDNILT FT RVKFAKYYSIILDCTPDNSHTEQMTIIIRFVDLESPTSHDGDLVKIKEHFL FT GFVPVEKSTGGFLAKTLIEQLENFNLPIENLRGQGYDNGSNMKGKENGVQR FT KILDINPRAFFIPCSAHSLNLVVNDACKCSLDAISFFGVVQQIYSYFSGSP FT SRWQVFRSYFPDFTVKSLNDTRWESRINALKPLRYDLGKIYDALMQIFEDP FT RLQTTSVGNSSRNEAKGLANSICIFKFMVALVSWYDILFEVNISNKILQNK FT KVDLNVATQQLNITKNKLVKMRNDEGFQRIIVDAAEIAKELETVTNFEEKH FT VGRRRKKRQFDYETQDEALQDPKEKFKVEFYFKILDTAIQSIAERFEQMRQ FT YNSMFGFLHDIYSISSKSSAELLKNCRNLEEILTHGSQKDISAADLCNEIK FT VLSGRLPQQMTPHEVLTFIVEQRLIDCLPNICISLRILLTLPVSVASGERS FT FSKLKIIKNYLRSTMLQERLVGLSIISIEHEESSILNLKEIVKTFATKKAR FT KIKI*" XX SQ Sequence 4479 BP; 1621 A; 626 C; 688 G; 1544 T; 0 other; cagagccggc gctaggcact aggcagacta ggcagctgcc tataaaatat aaaaaatcta 60 actaaaatta taattttata ataattataa aaattatata taacgatgag aaattttcca 120 tttccagtaa ttgagcataa actgcaaggc aaattagcgt atcactcgca ggctttaggc 180 aaaggcaaca tgtagtgttg taatgttatt aaatagtgac tcctgcttct tgtctctcgt 240 ctgtgtttat attaataaaa tgagagataa ttaacaactt gagttgcgtt ttccaaataa 300 agtaaatatt ttctttttaa attataaaca caacaaacaa aattgtatat tatccgttcg 360 ctgttcctca tgttcattcg atcgcccgca tattcaatca gttatttcac attcattcgc 420 ttgcgtttgg ttaacgagta ctagatactt cttctgtctg tctctgattc tttgaataaa 480 tgtcaattta actctttccc tccgcaagcc gctgatagcg gcttttgatt ttttttttat 540 tatgactgca agccgctggt agcggctttc gagtatgctt cacaatttat cctgttttgt 600 gcataaatct agacacataa tttgtttttt cttaaagttt aattagttct ctacataatc 660 aagtacaagt atgagccata atgtgtaact ataattagtt atcacttgac aaagttaaaa 720 aatctcaagt aatccttgat aaaaaaaatt tctttaaatt tagttcgtaa ttagtaaaat 780 aaagcagcgt taacactgta aaatttaact tgtagaagcg tgtgagtatt tagctaagta 840 attacaaagg ttttttttca agaaaaacgt tctttttata cattaaataa gttttatttc 900 ggactttatg ttttatttat tttattttta tttttttatg ttactattta taaaaacgat 960 ggtttaagta ccttgcattt cttactgttc aacgatgcgt gttaaaaact tatttaatta 1020 aaaagttcat ttgtttatct ttcaattaat atatagtttg taacaaacta tatattaatt 1080 gaacaaagct aacaaattgt tacgctttta aaaagatata ctattcgaat tttaattagt 1140 tttttatgtt atttcatcca aattttattc aaaaaaacgt gcggtcacag cgaaaagatt 1200 tttgttaaaa cggcggtagc gaaagagtta agtgttgact attatatttt atttataata 1260 agtatttaac tgacaatatt cttatcgatt gaaaaggtag tgtattactt caattctact 1320 attgtacgct agcaagtaat aagtaaatat tttataatta cacatttctt acgtttttta 1380 accatgtctg agaaaaagag actatcagta aagaaaaata aagaaagtca aagaaaagct 1440 attttgaaat attttcgtgc agaaacacat tctacaggta ctgattctgc tactgaaagc 1500 gatagcggtg tgagtaaaaa atctaaagtg atagttgacg ataaaactgt taataatgct 1560 aatatttcaa atcaacaaaa tgaagttgta aatatcaatg aaaaccaaat ttatctacct 1620 gaagcaactc ctgctgttaa tatttcaaga caaattgaaa ctgaaactca aattagcata 1680 ttatcatctg cagactacag tgaccctgca aattggccta ctcattttag tgacaatttt 1740 cggcaatttg ttataaaaaa tagaccgcaa caaattgaaa aatataactt tccacgagat 1800 cgagatgaac aaaaaagaaa attttcatca atctattaca agcggacatt accaaatgga 1860 gaagaagtct ctcgttcttg gcttatatat tcagtttcaa aaaatgtagt tttttgttta 1920 tgttgtaaat tattcggtaa aagcaaaagt agtttgccgg ctttagaagg ttcaggttta 1980 aacgattgga agaatattgg agcacttctt tcttctcatg aaaaaagtaa ttcacatctc 2040 gtaaattttc aaatgtggaa agaacttgat atacgtttat ctacagaaaa aaccattgat 2100 catattaacc aacaaaaaat taatgaaaaa gaacaatatt ggcaacgaat tttagaaaga 2160 ttaattgcct tagtcagagt actggctacg caaaacttgg catttcgtgg aacaaatgaa 2220 aagctttaca ataacgccaa tggtaatttt ttaaaatttg ttgagtatct cgctttattt 2280 gacccattaa tgaatgaaca tgtgcggaaa attaaaaatc aagaaattaa aactgctcat 2340 tatcttggaa aagaaattca aaacgaatta atacaaatac tagcaaatgc tataaaagat 2400 aatatcttga ctagagtaaa atttgctaaa tattattcta taattttaga ctgcacccca 2460 gataatagcc atacagaaca aatgacaata ataattcgtt ttgttgattt agaatcaccg 2520 acatcgcatg atggggattt agttaaaatt aaagaacatt tcttaggatt cgtacctgtg 2580 gagaaatcta caggtggttt tttggcaaaa acattaattg aacaattgga aaattttaat 2640 ttaccaattg aaaatctccg tggacaagga tacgataacg gaagcaacat gaagggtaag 2700 gagaatgggg tacaaagaaa aattttggat ataaatcctc gggcgttttt tattccatgc 2760 agcgcacatt cactaaactt agttgtgaat gatgcatgca agtgtagctt agatgcaata 2820 agtttttttg gtgtggtgca acagatttat agttattttt cgggatcacc ttctcgttgg 2880 caagtttttc gtagctattt tccagatttt actgtcaagt cattaaatga tacaagatgg 2940 gaaagccgca ttaatgcttt aaaaccatta cgatacgatc tcggaaaaat ttacgatgct 3000 ttaatgcaaa tatttgagga tccaagatta caaactacat cagtaggaaa ttcttcccga 3060 aatgaggcta aagggcttgc aaattctatt tgcatattca agttcatggt tgcccttgtt 3120 tcatggtacg acattctttt tgaagttaat atttctaata aaatattaca aaataagaaa 3180 gttgatttga atgtagctac acaacaatta aatattacta aaaacaaact tgtaaaaatg 3240 agaaatgacg aaggctttca aagaattatt gttgatgctg ctgaaattgc aaaagaattg 3300 gaaacagtaa caaattttga agaaaaacat gtcgggagaa ggagaaaaaa gcgacagttc 3360 gattatgaaa cacaagatga agcgttgcaa gatccaaaag aaaaatttaa agttgaattt 3420 tattttaaaa ttttagacac tgctatacag tctattgcgg aaaggtttga acaaatgcga 3480 cagtataata gtatgtttgg cttccttcat gatatttatt ctataagttc aaaatcttca 3540 gcagaattat tgaaaaattg tagaaatcta gaagaaattt taactcatgg ttctcaaaaa 3600 gatattagtg cagcggatct ttgtaatgaa atcaaagtac tttctggaag actcccacaa 3660 caaatgacac cacatgaagt actaactttt atagttgaac aacgtttaat tgactgtcta 3720 ccaaatatat gtatttcttt aagaatacta ttaacacttc cagtgtccgt tgcaagtggc 3780 gagcgaagtt tttcgaagtt aaaaataatt aaaaattatt tgaggtcgac aatgttacaa 3840 gaacgactgg tgggattatc aataatttcg atagaacacg aagaatcatc aatattaaat 3900 ttgaaagaaa ttgtaaaaac atttgcaaca aaaaaagcca gaaaaataaa aatttaaata 3960 tgcttttatt tttattatta catattttca aaatacaaat gaactttcta gtttctgaaa 4020 ttcaaattct ctaataattt aaacatcaaa tcagaaacaa gtttttttat tctttgtaac 4080 tatattatat tccaagtttt caatttttat tcttaattct agattattac ttattattat 4140 ttcttgagat aagctaggtt aattattatt attttggtgc gattaaatat taggaatagt 4200 cttcattaat tctattaaat gttactttta ttctttgaat gtcataaaat ctcaaatatt 4260 tttattcttg taaataaaag ttgagtggct gaaatcctaa aatgtcaaaa ataaatattt 4320 aatacttaaa tatgtccacc attttataat gtttatgcca ttttaaataa aattaatatt 4380 tatgtttttg ctctttcaaa ttaaaatatt ttttttcgga gggggggggg ggttgaattt 4440 tttttgccta ggacgccaaa taccctagcg ccggctctg 4479 // ID LARRP1 repbase; DNA; INV; 212 BP. XX AC L42496; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 01-JUL-2005 (Rel. 2.02, Last updated, Version 3) XX DE Leishmania arabica DNA repeat. XX KW LARRP1; Repetitive element. XX NM LARRP1. XX OS Leishmania arabica OC Eukaryota; Euglenozoa; Kinetoplastida; Trypanosomatidae; OC Leishmania. XX RN [1] RP 1-212 RA Piarroux R., Fontes M., Perasso R., Gambarelli F., Joblet C., RA Dumon H. and Quilici M.; RT "Phylogenetic relationships between Old World Leishmania strains RT revealed by analysis of a repetitive DNA sequence."; RL Mol Biochem Parasitol 73(1-2), 249-252 (1995). XX DR GenBank; L42496; Positions 1 212. XX SQ Sequence 212 BP; 50 A; 68 C; 63 G; 31 T; 0 other; gcaagaatca agaggccgtg tcagagatgg acgaaggggg acggtgggag cgggaaagag 60 acgacgggta cgtggcgacg cccgtggaaa gaaagataag ccaaagacgc gtattccctt 120 gtgctgatgt gtgcccccct ctctgccaca gaccacgagc tcagctccac tccaccctaa 180 cgccccctcg ccgtcccctg tcacaggctc cc 212 // ID Gypsy-21_IS-I repbase; DNA; INV; 4173 BP. XX AC ABJB010901549; XX DT 15-FEB-2011 (Rel. 16.02, Created) DT 15-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the black-legged tick genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-21_IS_; KW Gypsy-21_IS-LTR; Gypsy-21_IS-I. XX OS Ixodes scapularis OC Eukaryota; Metazoa; Arthropoda; Chelicerata; Arachnida; Acari; OC Parasitiformes; Ixodida; Ixodoidea; Ixodidae; Ixodinae; Ixodes. XX RN [1] RP 1-4173 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the black-legged tick genome."; RL Direct Submission to RU (15-FEB-2011). XX DR Genome; ABJB010901549; Positions 4491 319. XX CC Positions [3172-3627] - Integrase core CC 'TCACC' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 137..2164 FT /product="Gypsy-21_IS-I_2p" FT /translation="MAAPVPAALIGKLDSFDESAEDWTSYIERVDEYFVLN FT GLPDEKKVAAIITSMGAKTYSILRKLTTPSKPSEKTYEEIKKHLSDYFSPA FT PLEMSERHRFYKRVQKEGETANEYMAELRRLSQNCNFGSFLDQALRDKFVC FT GLRSVQVQQRLLTMKKLTLKHALDEAVAHELAAKDIAEFKNKEETPTATVA FT QVAHVDKDKQKCFRCNRKGHAPAKCRFLEAVCHKCNKVGHISPACRSKASD FT ATKTSHRPISTSKKASRRPLKSQVHVVRDEDSAEFEFLAHIDNGNRSNPIW FT LKPRVEGQTLEMEMDTGSKYSLLPKALYEKHLLHVPLQATPVRFKTLTGET FT FSPEGVATAEVTFGKSTGQLQLYVVDTPGPPLFGRDWINHFNLLELSNVLK FT VDSQLGLKWQDQLDKRLAEFKVVFDDSIGKLKDFKLKLHLKSDAHPKFCRP FT RQVPYALKEKVTTELDRLESEGILTRVNHSDWATPIVPVVKSNGTVRICGD FT FKSTINPHLVVDQYPLPRIEDIFAKLSGGRKFSKIDLRQAYLQMEVDEESK FT PLLTINTEKGLYRYNRMIFGISPAPAVWQRTIDQILQGIPMCHATQDDILV FT TGVSEDSHLHNLTEVLRRLHEYGLKANLQKCSFFKDSVTYCGYKLDGTGLH FT NTHDKIRAVVEAPTPKSTTELRSFLGL" FT CDS 2374..4056 FT /product="Gypsy-21_IS-I_1p" FT /translation="MGAVLSHVMPDGSERPIAFGSRTLKPSERNYSQIDKE FT ALAIVWGIQKFHVYLYGRHFKLITDHQPLTRIFRPSSSIPAMQSARIQRYA FT LFLAGFNYEIEYRKSEDNANADSMSRLPLPEVPVDKPDEAEIFHMTQIEQL FT PVTAQQVRLNSQRDRVLSRVHFHILNGWDTCSQDDDILPYYRRRDELSTHQ FT GVITWGIRTVIPTKLQEKILTELHSGHIGIVRMKMLARSYVWWPTLDKDIE FT EQGKKCGSCRAQQNLPAKASPHPWELPSGPWKRIHVDFAGPFMNKMFLVVM FT DAYSRWPEVVIMNSTTATHTIEVLRAIFSRNGLPTRVVSDNGPQFASKEFA FT EFLKANGVQHTFSAPYHPATNGSAERLVQTLKNSLKSMKGEGSLYKCVATF FT LLKYRTTPHTVTGETPAKLFMGRELRTRLSLLQSDPVQRQLQGQERTLLNC FT RRRTRSFRQGDLVSVKDYRGNQEWIGGKIKCRNGPVSYEVQVGDAVWRRHI FT DQLRDSRVIESTPHPKPLGNQPTREEPQLNPVSPERPGVPGHEETSQATTR FT GVVHPAECVNTGQI" XX SQ Sequence 4173 BP; 1195 A; 1064 C; 1048 G; 866 T; 0 other; aattggcgac gaggaccgga tgagtatgta ctccgtccat ggggacattg cccccttctc 60 agcggtcggc ggaatcgaag aagagcgcct gtgaaacgag gcttagcgga cggcaggaat 120 ccgcgccaaa accacgatgg cagcaccggt accggcggca ctcatcggta agctcgacag 180 ctttgatgaa tcggcggagg attggacctc ctacatcgaa cgagtagatg agtacttcgt 240 gctgaatggc ctaccggacg aaaagaaggt cgcagccatc ataacaagta tgggagccaa 300 aacctactcc atcctaagga aactaacgac gcctagcaaa ccgtctgaga aaacttatga 360 agagatcaag aagcacctga gtgactactt ttcgcctgcg ccgttggaaa tgtccgagcg 420 tcaccgattt tacaaaagag tgcagaaaga gggggaaact gccaacgagt acatggctga 480 acttcggcgt ctctctcaaa actgcaactt tggatccttt ttggatcagg ccctgcgaga 540 caagtttgta tgcggccttc gctccgtgca agtccagcag cgcctgctga cgatgaagaa 600 gcttaccttg aagcacgcac tcgacgaggc tgtagctcac gaacttgctg cgaaggacat 660 cgcagagttt aaaaacaaag aagaaactcc gaccgccacg gtggcgcagg tggcgcacgt 720 tgacaaggac aagcaaaagt gcttcaggtg caatcggaaa ggccacgctc cagcgaagtg 780 cagatttttg gaagctgttt gccacaagtg caacaaagtg ggtcacatca gtccagcctg 840 tagaagcaag gcaagtgacg ccaccaagac ttcgcacagg ccaatttcga catcaaagaa 900 agcaagtcga cgacctttaa agagccaagt tcatgtcgta cgagacgaag atagtgctga 960 gttcgaattc ctggcacaca tcgacaatgg gaacaggagt aaccctatct ggctcaaacc 1020 aagagttgaa gggcagactc tggaaatgga aatggacaca ggctccaagt actcccttct 1080 accaaaagct ctctacgaga agcatctttt gcacgtgcca ctgcaagcaa caccagtgcg 1140 gttcaaaacc ctgaccggag aaactttcag tccagaaggt gtggccacag ctgaagtgac 1200 atttgggaag tcaacaggcc agctgcaact gtatgtggtg gacaccccag gacctccact 1260 cttcgggaga gactggatta accatttcaa cctcctggag cttagcaatg tgctaaaggt 1320 tgacagccaa cttggactga agtggcagga tcagctggat aagagacttg cggagttcaa 1380 ggtcgtcttc gatgacagca tcgggaagct caaggacttc aaattgaagc tgcacctgaa 1440 gagcgatgct cacccgaagt tctgtcgtcc acgtcaggtt ccatatgcct tgaaagaaaa 1500 agtaacaacc gaactggacc gtctggagtc ggagggcatc ctgacaaggg tcaaccacag 1560 tgattgggcc acacccatcg taccagtagt gaagtcgaat ggaactgtcc gtatttgcgg 1620 cgacttcaaa agcacaatca atccccacct tgtagtggac cagtatccac tacctcgcat 1680 tgaagacatc ttcgcaaagt tgtctggtgg aaggaaattc tccaagattg atctgcgaca 1740 agcttatctt cagatggaag tcgacgagga gtctaagcca ctgctaacta taaacactga 1800 gaaaggactg taccgttaca acagaatgat tttcggtata tcacctgctc cagctgtctg 1860 gcaacgaacg attgaccaga ttcttcaagg aattccaatg tgtcatgcaa cccaagatga 1920 cattttggtt actggtgtgt cggaagactc gcatctgcac aaccttactg aggtactgag 1980 gcgactgcac gagtatggac tgaaggccaa cctgcagaag tgctcctttt tcaaggactc 2040 tgtgacttac tgcggttaca aactcgacgg aacgggtctg cacaatactc atgacaagat 2100 cagggccgtg gtggaagctc cgacgcccaa gtccacaact gaactgcggt cgttcctcgg 2160 cttataaact actatggaaa gttcatccct gactgtgcga actttctacg cccactacat 2220 gcactcctag agaagtcatc cacctggaaa tggaccaagg agtgtgacaa ggcattcaag 2280 caagccaagg agatcatcgc gtctgacaca gtgcttgtct actatgaccc caacctggaa 2340 gtgcgtcttg catgtgatgc tggaccccat ggaatgggag cagttctctc tcacgtgatg 2400 cctgatggaa gcgaaagacc catcgcattt ggttcacgga cactaaagcc atcagaaagg 2460 aactactcac aaatagacaa agaggctcta gccattgtct ggggaattca aaagttccat 2520 gtgtacctgt atggaagaca cttcaaactg atcactgatc atcaaccact cacacgaatc 2580 ttcagaccaa gcagcagcat acctgcaatg cagtcagccc gcattcagag atatgcactt 2640 tttttggctg gattcaacta cgagattgag tatcggaagt cagaagacaa tgccaacgca 2700 gacagcatgt ccagacttcc tctgcccgag gttccagtgg acaagccaga tgaagcagaa 2760 attttccaca tgactcagat tgaacaactt cctgtcactg ctcagcaagt acgactgaat 2820 agccaacggg accgagtgct gtctagagta catttccata tccttaatgg atgggacaca 2880 tgcagtcagg atgatgacat cttgccttac taccggcgaa gagatgaact ctcgactcac 2940 caaggggtga tcacatgggg cattagaact gtcataccga cgaaactgca agagaaaatt 3000 ctcaccgaac ttcacagcgg ccacattggt atcgtgcgga tgaaaatgct tgcccgctcg 3060 tatgtctggt ggcccactct tgacaaggac attgaagagc aaggaaagaa gtgtggcagc 3120 tgccgtgccc aacagaacct tccggcgaag gcatcacctc acccgtggga acttccaagt 3180 gggccttgga agagaatcca cgtcgacttt gctggcccat tcatgaacaa aatgttccta 3240 gtcgtcatgg acgcttactc gcggtggccg gaggtagtga tcatgaacag cacaaccgcc 3300 acacatacca tagaagttct ccgtgcgatc ttctccagaa acgggctacc caccagagtt 3360 gtcagtgaca acggacctca atttgcatcg aaggagtttg cagagttcct caaggcaaat 3420 ggtgtgcaac acacattttc agctccttac caccctgcaa cgaatgggtc tgcggaaaga 3480 ttggttcaga cactgaaaaa cagcttgaag tccatgaagg gagagggaag tctatacaag 3540 tgtgtggcaa catttttgct caagtatcgc acgacacctc acacagtcac tggggagacc 3600 cctgcaaaac tgttcatggg acgggaactt cgcactcgtc tgtcactact tcaaagtgac 3660 ccagtgcagc ggcaacttca aggccaggag agaacactgt tgaactgccg acgacgaact 3720 cgttcatttc ggcagggaga cttagtctcc gtaaaagatt acagaggaaa tcaagagtgg 3780 attggaggaa aaatcaaatg tcgtaacggg ccagtatcct acgaagtgca ggttggtgat 3840 gcggtgtggc gccgacatat tgaccagctt cgcgattcaa gagtcatcga atccacacca 3900 catccaaaac ccttgggaaa tcagcctact cgagaagaac cacaactgaa cccagtatct 3960 ccagagaggc cgggagtgcc aggacatgaa gagacttcac aggcaaccac gagaggcgtc 4020 gtccatccag cggaatgcgt caacaccggc cagatttaat gtcgtgcccg acaagtcgtc 4080 gcacaaccgt gtgcgcgaca cccaaagatt cgaccccagc gtgtttcgac gcccacccat 4140 taagttggac ttgtaattta agaagggagg agt 4173 // ID BEL-53_CQ-LTR repbase; DNA; INV; 228 BP. XX AC AAWU01015731; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-53_CQ_; KW BEL-53_CQ-I; BEL-53_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-228 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 260-260 (2011). XX DR GenBank; AAWU01015731; Positions 17364 17591. XX SQ Sequence 228 BP; 83 A; 39 C; 51 G; 55 T; 0 other; tgttggacga gttgggaaat gaagtgcacc ttctgtgcaa aactttggtt tgttttcata 60 gcgtgtaaat tttccgtgta cattcaaatg acacaaataa taaagaaaca gagtttagcg 120 tgtaacagaa aaggcaagac gcgcgcgttt ttactcgatc cgttcgcgga aagaaaaaat 180 ccgaaaaatc gagaaaaata cagtccactc gaagtaaaac gggaaaca 228 // ID Copia-115_AA-I repbase; DNA; INV; 3988 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-115_AA_; KW Copia-115_AA-LTR; Ty1_copia_Ele6; Copia-115_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-3988 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [1414-1917] - Integrase core CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 25..3306 FT /product="Copia-115_AA-I_1p" FT /translation="MGPSMKEGSGIPQLTGPNYENWKFRVKLYLDAAEVSS FT VLKAEAPAADHATRPKWDQLDRKAKSLLVGFLSDECLEVVREKDTAWSMWK FT ALEETFAKRSVTSQTLLRKQLARLRMTEGSSMRNHFVAFDDLVRQLKSAGA FT KLEESDLVSQLFLTLPDSFDPLVTALENLDEANLTLETVKQRLLAEESKRI FT DRQDYASDDKGAAFVGGKKNERKKVIREFTGRCHRCKKTGHMKKDCPWKKS FT DGNARYAVGNGKAVAFMGRSDESKTSSGGKILFCVDSGCSDHLVHDASRLK FT SSRKLKEPFVVEVAKEGVSLVGKSTGELTGFSNKNVEFRMKNVIFIPDLRE FT NLLSVKKLSQAGIDVLFTGRGGIERAEFKRNGEVIGVAYLRGNLYQLELEV FT GYVGQANITAAEASRLWHRRLGHASQQAMETIAKHEMATGFTNKVDRIGFC FT DTCVMGKQCREPFDGVRERATRPLERIHSDVCGPIDPPAWDGSRFFVSFID FT DWTHFAMIYPIKRKSEVFSCFKQYEALVTAHWETKISKLTVDQGREYFSNE FT QKRYYQKRGIQVQPTVAYSPQQNGVAERFNRTLVEKVRSMLIDSQMPKRMW FT SEAAMAATYILNRSPTTALDGKVTPVEMWTKTKPDLSKLRIFGCKAMMWVP FT NQLRKKLDPKSREAIMIGYAPNGYRLWDRSSRKVIIARDVKFDEDSFPLAC FT NDAEAKQPLVIRTFYEPEGEPLDTTRADEVEVEEANDSNSEYGEADSDEQM FT SDTSNPGALPSQERREDCLESTTRRSERERKLPGKLLDFFTGFRASTASEI FT SASSDPPSTYKEIEHREDRDAWLAAISDELQSMEANNVWKFVPCPVGVKLL FT QSKWVFCVKEDADGNVVRHKARLVVKGFLQKPGIDYHETYAPVAKLTTIRV FT VLAVALQKGYYIHQTDVKTAFLHGELKENIFMAVPDGVQAAPGTVCQLRKS FT LYGLKQSPRCWNEKLNEVLLKLGFKRSQHDYCVYTRIDERGSDVIYVILYV FT DDLLIVGAKLNTIEDVKKKLSQHFKMTDCGEAKHFLGMKISYDRKQGRMEL FT SQEASIKKILRNFSMEDCNTTKTPMEKGLQLFQATSE" XX SQ Sequence 3988 BP; 1086 A; 840 C; 1151 G; 910 T; 1 other; tccgaagaga acctccttta ggttatgggc ccctcaatga aagaaggttc cggtatcccg 60 cagctgactg gtccaaatta tgaaaattgg aagttccgag tcaagcttta cctggatgcg 120 gcggaagtat cttcagtgtt gaaggcggaa gctccggctg cagaccatgc gacaagaccg 180 aagtgggatc agctggacag aaaggcaaaa tcgctgctcg tcggcttcct gtcggatgaa 240 tgtctggaag tagtccgtga aaaggacacc gcctggtcca tgtggaaggc ccttgaggag 300 acatttgcga agcgatccgt gaccagccaa acccttttga ggaagcagct ggcaagatta 360 cgtatgacgg agggatcttc tatgcggaat catttcgtgg ccttcgatga tcttgttcgc 420 caactgaagt ctgcaggtgc caagctggag gagagtgact tggtgtcaca attattcttg 480 accctaccgg atagcttcga cccgttggtg accgcacttg aaaatctgga cgaggcgaat 540 ttgaccctgg aaacggtgaa gcagcggctg ctcgctgaag agtcgaagcg aattgatcgt 600 caagattatg caagtgatga caagggtgca gcattcgtcg gagggaagaa gaatgaaaga 660 aagaaggtta tccgtgaatt tactggaaga tgccaccggt gtaagaagac cggccatatg 720 aagaaggact gtccctggaa gaagtccgat ggaaacgctc ggtatgcggt cggtaatggg 780 aaggcagtcg cttttatggg aagatctgat gagtcgaaga cgtccagtgg cggtaaaatc 840 ctgttctgtg tggactcggg gtgcagtgac caccttgtgc atgatgcaag ccgtctgaag 900 tcgtcacgga agttgaagga accgtttgtc gtagaagttg cgaaggaagg cgtgtccttg 960 gttggtaagt ctactggaga gctgacagga ttttcaaaca agaacgtgga attccgtatg 1020 aagaacgtta tattcattcc ggatttgcgg gaaaatttac tttccgtgaa gaagctttct 1080 caagccggaa tcgatgtcct gtttactggt cgaggaggta tcgagcgtgc ggaattcaag 1140 cgtaacggtg aagtcattgg cgtggcatac ttgagaggta atctgtatca gctggagctg 1200 gaagtcgggt acgtcggtca agctaacatc accgcggcgg aggcaagtcg attgtggcat 1260 cgtcgattgg gacatgctag tcaacaggcc atggaaacca tcgcgaagca tgaaatggcg 1320 accggtttta cgaacaaggt ggaccgcata gggttctgtg atacgtgtgt gatgggcaag 1380 caatgccgtg agccgtttga cggagttcgt gagagagcca ctcgtccgtt ggagcgaatt 1440 cactcagacg tgtgtgggcc gatagatccc ccggcctggg acggatcaag atttttcgtt 1500 tccttcatcg atgattggac ccattttgca atgatttatc ctatcaagcg caagagtgaa 1560 gtcttcagtt gcttcaagca gtatgaagct ttggttaccg cccactggga aacgaagatc 1620 agcaagctta cggtggatca aggtcgcgaa tatttttcga atgagcagaa gcgatactat 1680 cagaagagag gcatccaagt tcagccaaca gtggcgtact ctccgcagca aaacggagtt 1740 gctgaacgct ttaatcgaac cctagttgag aaggtaaggt caatgttaat tgattcccag 1800 atgcccaagc ggatgtggtc agaagcggcc atggcggcaa catacatcct gaatcggagt 1860 cccacgacgg cgctcgatgg aaaagttacc ccagtggaaa tgtggacgaa aacaaagccg 1920 gacctgagca agcttcgaat tttcggttgc aaggccatga tgtgggtccc gaaccagctg 1980 aggaagaaac tagatccaaa gagccgtgaa gcaattatga ttggctatgc gccaaatggt 2040 tatcgcttgt gggacaggtc gtcaagaaag gtcattattg caagagatgt caaatttgat 2100 gaagacagtt ttccgcttgc ctgcaacgat gctgaagcaa aacagccgct ggtgattcgt 2160 acgttctatg agccagaggg ggagcccttg gataccacta gagcagatga agttgaggtt 2220 gaagaggcaa acgattcaaa cagcgaatat ggagaagctg attcggatga acaaatgtcc 2280 gatacatcaa atcctggggc gctcccttcg caagaaagac gtgaagactg tttagagtca 2340 accacgaggc gcagcgaacg ggagcgcaag ctcccaggta agctattaga ttttttcact 2400 ggatttagag catccactgc ttctgagatt tccgcatctt cagatccacc atccacgtac 2460 aaggaaatcg agcatcgtga ggatcgtgac gcctggttgg cagcgattag cgatgagcta 2520 caatccatgg aggccaacaa cgtatggaag tttgtcccgt gtccggtcgg agtgaagctc 2580 ctgcaatcca aatgggtgtt ctgcgtgaag gaggatgcgg atggaaacgt tgttcgtcat 2640 aaagcgagac tggtcgttaa gggatttctc cagaagccgg gaatagacta tcacgaaacc 2700 tacgcaccag tggccaagct gacgacaatc cgagtggttc tggcagttgc tttgcagaag 2760 gggtactaca ttcatcaaac ggatgtgaaa acggctttcc tgcacgggga actgaaagag 2820 aatattttca tggcggtccc tgatggcgtg caagcagcac ctggtacggt gtgccagctc 2880 cggaaatcat tgtatgggtt aaagcagagc ccgcgatgct ggaacgagaa gcttaatgaa 2940 gttttgttga agctcggttt caagaggtcg caacacgact attgtgtgta cacccggatc 3000 gacgagaggg ggagtgacgt aatttacgtg atcctgtatg tcgacgacct gctgatcgtt 3060 ggagcgaagc taaacaccat tgaggacgtg aagaaaaagt tatcccagca tttcaaaatg 3120 actgactgcg gcgaagcgaa gcatttcctc ggaatgaaga tttcttatga taggaagcaa 3180 ggacgaatgg agctgtcaca agaggccagt atcaagaaga ttctgaggaa cttttctatg 3240 gaagactgta atacgactaa aacaccgatg gaaaagggcc tgcagctatt ccaagctacg 3300 tcagaaktaa cagaagcacc atatcgagaa ttactgggga gtttaatgta cattatgatg 3360 gctactcgac cggacatatg cttcccagta ggataccttg gacgttttca gcagcgacct 3420 gatgcttcac gctggacagc tttgaagagg gtggtgcggt tcctgaaggg caccatgcag 3480 acgtccctga agttctcacg caatgaacaa atggaaccct tggttggata tgccgatgct 3540 gattgggcaa ctgaccaaca ggacagaaaa tcggtgagtg gttttctatt tcagatattc 3600 gggaacgctg tttgttggtc aagtaagaag cagacgacgg tagcgacatc atccagtgag 3660 gccgagtatg tggcactagg ttctggcgtg acagaagcta tatggctggc tggactgctt 3720 gacgaccttg gcattcgagg cacaacaccg gtgacaattt atgaagacaa tcgtgggtgc 3780 ataggaatgg ccaagaactt ggagagcaag cgtgcgaagc acattgatat caagcaccat 3840 tttataaggg accatgtagc agccggtaac gtcaaggtgg agccaatcgg gactgagaac 3900 caactcgctg atattttcac taagtcgttg gatgtcgggc gtttccagaa actatgtcga 3960 actatcggac tccacgatcg agaggggg 3988 // ID Crack-3_HM repbase; DNA; INV; 4917 BP. XX AC . XX DT 14-SEP-2009 (Rel. 14.09, Created) DT 14-SEP-2009 (Rel. 14.09, Last updated, Version 2) XX DE Non-LTR retrotransposon: consensus. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; Crack-3_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4917 RA Jurka J.; RT "Non-LTR retrotransposons from Hydra magnipapillata."; RL Repbase Reports 9(9), 1934-1934 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 192..971 FT /product="Crack-3_HM_1p" FT /translation="MAVTLKEIKKMMKEMLNDFKKETLVLLKNQEKTVIDI FT ISANTKILNERLDKFEKNITQNANKIKKIEIELEDIKESLNFNEHVIDKKI FT ESNNKHFLNKIDINKQLEKERTNNEFLNEKIRNLEDRSRRNNLRIEGICED FT EVESWDKTEEKVHSFFLQKLGIKSIEIERAHRTGPKKDGRSRVIVLKLQQY FT KDKAKILKESARLRGTNIYVNEDFSRETVAIRRKLFAEVKERRLNGENVSV FT RYDKIISFKTNILKNVPNK" FT CDS 1023..4544 FT /product="Crack-3_HM_2p" FT /translation="MAFMNDFESISFNFFEGNHFSSDKNIDPDNNFYYDTF FT TDCAYFYPKELEGFLFQNAIKKSSNQIRILHLNIRSLNRNFEKLLNLLEET FT KNLFNIICLTETWVTLNDFDNNFNIPHFKIISQERKTSKRGGGLLIYVHES FT IFYNMRNDLSVSDGDKEVLTIELLLEKTKTFILSCCYRPPDGVSENLSMFL FT QHIVNTSDAEKKKSFVIGDFNMNCFLYNDEYKVKKFYDTIFETGSIPLINR FT PTRVTINSATLIDNIITTDIFNKDIQKGILKTDISDHFPIFLIINSNLEKN FT INTPTVVRKRVFKQKNFELFKDQLSLLHWNHIDFNDNANNIYEAFYKTLFS FT VYDANFPIVEKIIKINKSSKSPWITKGLKKSSKIKQKLYIKYLKTKTSASE FT KIYKNYKYLFEKIRKNLKRNYYSELINKFKNDLKRTWQIMKEITGKQKSCL FT GFLPQMIKVDNISLYDPRAISHEFNKFFIGIGPSLSNRISNTTASFNDFLV FT PMDNCICSDELSSELSFEEFEVALNSVKRNKATGADEINGNVIIDCFDILK FT HILFKVFRASINQGIFPEQLKVAKVTPIFKEGDRSNISNYRPISVLSVFSK FT ILERIMSNRIYKHLNKNNLLYNKQFGFKKDNSTEHAIIQFVHEISKSFEKS FT QYLLGIFIDLSKAFDTVDHHILLQKLEYYGINGIALKWFRSYLTKRKQFVI FT SNDGYQNNCLDIVCGVPQGSILGPLLFLIYINDLNKASNLISIMYADDTNL FT FLSNSNIYELFTTMNKELKHISNWFKCNKLTLNINKTKWTLFHSHAKKRFL FT PINLPQLFIDEIEIKNDSVTKFLGVYLDENISWNQHINYISTKISKNIGIL FT YKARNYLNKNCLKQLYYSFIHSYINYANTAWGSTVKSKLKLLYRRQKHAIR FT IINYADRFTHTKPLFTEMRLLNLFELNVFKVLCFVYLWRNNQSPTVFNNTF FT KLKPNNKYDMRNINFLKEPFCHTKFNQFCIAYRGPHLWNNIVLLYKARNYL FT NKNCLKQLYYSFIHSYINYANTAWGSTVKSKLKLLYRRQKHAIRIINYADR FT FTHTKPLFTEMRLLNLFELNVFKVLCFVYLWRNNQSPTVFNNTFKLKPNNK FT YDMRNINFLKEPFCHTKFNQFCIAYRGPHLWNNIVLQNFDLPTSFYLFKLN FT LKKCIISLDNILRYF" XX SQ Sequence 4917 BP; 1928 A; 735 C; 657 G; 1597 T; 0 other; cacgcaagct ttcgaagtga acagacgctt ttgtgcgcag aaaagtatta aaaattctac 60 tgaattaaat aaaagttatc gactagaaaa aagtaaataa ataaataaag tctacggaat 120 tgaagtataa taaagaaaca actaaaaaac ctacgaatct actaactaaa aagatcctat 180 caacatcagg tatggctgta acacttaaag aaataaaaaa aatgatgaaa gaaatgctta 240 atgattttaa aaaggaaact ttagttttat taaaaaacca agaaaaaaca gtaattgata 300 ttattagtgc aaatacaaaa atattaaacg aacgattgga taaatttgag aaaaatatca 360 ctcaaaatgc aaacaagatt aaaaaaattg aaatagagct tgaagatata aaagaaagtt 420 taaacttcaa cgaacacgta attgataaaa agattgaaag taacaacaaa cattttttaa 480 acaaaatcga catcaacaaa caattagaga aagaaagaac taataacgaa tttttaaacg 540 aaaaaatccg taacctcgaa gatagatcgc ggagaaataa cttaagaatt gagggaatat 600 gtgaagacga agtagaatct tgggacaaaa ccgaagaaaa agttcattca ttttttctac 660 aaaaactggg tataaagagt attgaaattg aacgagcaca tcgcactggg ccaaaaaaag 720 atggacgttc gagggtgata gttttaaaac ttcaacaata caaagacaaa gctaaaattt 780 tgaaagaatc agctcgtctc agaggaacaa acatttatgt aaacgaggat ttttctcgcg 840 aaacagttgc aatcagaaga aaactatttg cggaagtaaa agaacgacgt ttaaacggtg 900 agaatgtttc ggttcgttac gataaaatta tttcttttaa aactaatatt ttaaaaaacg 960 ttccaaacaa ataatcgaat tacgcattct aaaaaagaaa ccttaaccta attttaataa 1020 tcatggcttt tatgaacgat tttgaatcca tttcttttaa tttttttgaa ggaaaccact 1080 tttcctctga taaaaacata gacccagata ataattttta ttacgatact tttacggatt 1140 gtgcttattt ttatcctaaa gagcttgaag gttttctttt tcagaatgca attaaaaaaa 1200 gttccaatca aattagaatt ctccacctca atataagaag cttaaatcgg aattttgaaa 1260 aacttttaaa tttattagaa gaaacaaaaa atctatttaa tatcatttgt ttaactgaaa 1320 catgggttac attaaacgat tttgacaata attttaatat tcctcatttt aaaattatct 1380 cacaagaaag aaaaacgagt aaacgaggtg gagggcttct aatttatgtt catgaaagta 1440 tcttttataa tatgagaaac gatttaagcg tctccgatgg cgataaagaa gttttaacaa 1500 ttgaactttt actcgaaaaa acaaaaacct ttatactaag ctgttgttat cgaccacctg 1560 acggcgtgag tgaaaactta agcatgtttt tacaacatat cgtgaatacc agcgacgcag 1620 aaaaaaaaaa gagctttgta attggggact ttaatatgaa ttgttttctt tataacgatg 1680 agtataaagt taaaaaattt tatgacacaa tttttgaaac agggtctata cccttaatta 1740 atcgccctac tagggtaaca ataaactcgg ctaccctaat agataatatc attacaactg 1800 atatttttaa taaagatatt caaaaaggta ttttaaaaac tgatatatcc gatcattttc 1860 ctatattcct tattataaac tcaaatttag aaaaaaatat taacacccct acagttgtaa 1920 gaaaacgtgt ctttaaacaa aaaaactttg agctatttaa agatcaacta tctttgctcc 1980 attggaatca tattgatttt aatgacaatg caaataacat ctacgaagca ttttataaaa 2040 cattgttttc cgtttacgac gctaattttc caattgttga aaaaataata aaaattaaca 2100 aaagctcaaa aagtccctgg attacgaagg gattaaaaaa atcgtcaaaa attaaacaga 2160 agttgtatat taaatattta aaaacaaaaa catctgctag cgaaaaaata tacaaaaact 2220 acaaatatct atttgaaaaa atacgtaaaa acttaaaaag aaattactac tcagaactaa 2280 ttaataaatt taaaaacgat ttgaaacgca catggcaaat tatgaaagaa attactggaa 2340 aacaaaaatc gtgcttaggt tttctgccgc agatgattaa agttgataac ataagtttat 2400 atgacccaag agccatatcc catgaattta ataaattttt tattgggata ggtccttcct 2460 tatcaaatag gatttcaaat acgacggctt cttttaatga ttttctcgta cctatggata 2520 attgcatttg ctctgatgag ttatcttcag aattatcttt tgaggaattt gaagtagctt 2580 tgaattctgt gaaaagaaac aaagcaacag gagcagatga aattaatgga aacgtaatta 2640 tagattgttt cgatatatta aaacatattc tttttaaagt ttttagagct tctataaatc 2700 aaggaatttt tccagaacaa ttaaaagttg ctaaagttac tcctatcttt aaagaaggtg 2760 acagatcaaa tattagcaac tatcgaccca tttctgttct ttcggtattt tcaaaaattt 2820 tagaaagaat aatgtctaat agaatataca aacatctcaa caaaaataac ttactgtaca 2880 acaaacaatt tggttttaaa aaagataact caacagagca tgcaataatc cagtttgtgc 2940 atgaaatctc aaaatctttt gaaaaatcac agtacttgtt aggtatcttc attgacctat 3000 caaaagcttt cgacacggta gatcatcaca ttctgcttca aaaacttgaa tattatggaa 3060 ttaatggtat tgccttaaaa tggtttagaa gctatttaac aaaaagaaaa caatttgtaa 3120 ttagtaacga cggctatcaa aataattgtc ttgatatcgt ctgcggtgtt cctcagggtt 3180 caatcctcgg gcccctactt ttcttgatat atataaatga tctgaataaa gcttctaacc 3240 ttataagcat tatgtacgca gatgatacca atctattcct ctcaaacagt aatatatacg 3300 aactctttac tactatgaat aaagaattaa aacatatatc aaactggttt aaatgtaata 3360 agctaacttt aaatattaat aaaaccaaat ggactctatt ccattcgcat gcaaaaaagc 3420 gatttttacc aataaatttg ccgcaacttt ttattgacga aattgaaata aaaaatgatt 3480 ccgttacaaa attcttaggt gtttatctcg atgaaaacat ctcctggaat caacatatta 3540 attatatatc tactaaaatt tcaaaaaaca ttgggattct atataaagct cgtaactacc 3600 taaataaaaa ttgcttaaaa caactttatt actcttttat ccacagttac ataaattatg 3660 ccaacactgc ttgggggagt actgtaaaaa gtaagttaaa acttctttat cgtcgtcaaa 3720 agcatgcgat ccgcataatt aactatgccg atcgttttac acacacaaaa cctcttttta 3780 ctgaaatgag attgttaaat ttatttgaac tcaatgtatt taaagtttta tgttttgtat 3840 atctatggag aaataatcaa tcccctaccg tttttaataa tactttcaaa ctcaaaccta 3900 acaacaaata cgacatgaga aacataaact ttttaaaaga accattttgt cacacaaaat 3960 ttaatcaatt ttgcattgct tatcgtggac cccacctatg gaataatata gttttgctat 4020 ataaagctcg taactaccta aataaaaatt gcttaaaaca actttattac tcttttatcc 4080 acagttacat aaattatgcc aacactgctt gggggagtac tgtaaaaagt aagttaaaac 4140 ttctttatcg tcgtcaaaag catgcgatcc gcataattaa ctatgccgat cgttttacac 4200 acacaaaacc tctttttact gaaatgagat tgttaaattt atttgaactc aatgtattta 4260 aagttttatg ttttgtatat ctatggagaa ataatcaatc ccctaccgtt tttaataata 4320 ctttcaaact caaacctaac aacaaatacg acatgagaaa cataaacttt ttaaaagaac 4380 cattttgtca cacaaaattt aatcaatttt gcattgctta tcgtggaccc cacctatgga 4440 ataatatagt tttgcagaat tttgatttac caacttcctt ttaccttttc aagctcaatc 4500 taaaaaaatg tattatttca cttgataata tactcagata cttctaatat tttgaaaatt 4560 gttatacaac ttgaattcag tactttcttt attttaattt ttactttatt ttttagtatc 4620 ttatactggt atttattttg cttgtttgtg ctttataagg ttctgatgac aagatcttgt 4680 gatcttcttt cagatcctag cttatgtttc attgtaagca gtttatgtat gtgcgcatct 4740 acgcatttgt atgtgtatat atgcttatat ttatatatgt atgtccgtgt tttttcttat 4800 tttaatttag ttgataaatg ttgtaacacg aatttaattg taacacgata tatatggcat 4860 atataattgt actttgacat tttaatgtaa gcaaaattaa aaaaaaaaaa aaaaaaa 4917 // ID Gypsy-59_AA-I repbase; DNA; INV; 5155 BP. XX AC supercont1.119; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-59_AA_; KW Gypsy-59_AA-LTR; Gypsy-59_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5155 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.119; Positions 1702987 1708141. XX CC Positions [4119-4580] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 1575..2861 FT /product="Gypsy-59_AA-I_1p" FT /translation="MPMVVQQIADPKSRSFGNGDPNNSKKCYNCNRYGHIA FT RETDKCAARDFTCFNCGTKGHFKVCCRKRKHNEPRNEHSFKKAQKYKRIHA FT IQGNHEDHKAVFFVDDKQLNEVLRMDVGGVEIALLVDSGSPANIINAQTYE FT FLKRNNAHILNERNPDHEDMKLKSFASDGNILFSSAFEAQIKIPGEDSGIW FT SHILVAPQGQTNLLSKSTAFALGVLKIGYYVNSVSSENISGSLSEFPKVPD FT VSLRIQIDESVQPIAQAARRFPIAMEADVENAIQELLQKNIIERAEGPLTW FT VSPLVPIRKTDGRIRLCVDMRAANKAVRRENFPMPNIDVAMASIRKVSRLS FT KIDLEAAYYHFELDRQSRNITTFVARSGVYRFCRLMFGIKSAPELFQREME FT NLFRGIKGLIVYMDDLLIYAETDEEHDEILKQVTI" FT CDS 2964..5054 FT /product="Gypsy-59_AA-I_2p" FT /translation="MNMKINEQKSRFGVKEVIFLGHHVSTEGIKPTDEKIR FT AILNLQPPSSTTELRSLLGLINFVGKFVPNLATMTRHMRSLLLKQNSFVWG FT EEHSRELEAIKTVLGKVETLGYFDPTDETRLITDASPFGLGGILVQIKNGV FT SRTISCISKSLADHEKKYCQTEKECLAIIWAMEKLFIYLYGIRFTLVTDCK FT PLEYLFNKSQSKPSARIERWILRLLSFDFTVEYEPGQKNLADPLSRLTQGS FT AIAETEPDVIAWLSEQVRPTAISLDDLENASEVDEELQKVKEAVHSRNWDD FT VPSEYKNATIKEDLSLFGELILRGDRIVIPKALRERVINLAHLGHQGSTSM FT KAQLRAKVWFPAMDKMIDSVVKNCKPCRMTSLPDKPNPMSRRLPTEPWQDV FT AIDFKEGLPGDMSLLVVVCYTSRFIQVEPMKPATTQRVIGALLKMFSSFGI FT PRSITADNGPQFKAVEFKNFCFSYGIHLNISTPYWPEQNGAVERQMRNIGK FT RIKISSIQNTDWKTDLNEYLMLYHSTPQETTGLSPYKMIFGREVYNGIPSI FT HQPPTLALEAAKDRDMALKEHQKELADTKRQAKHHDLESGDTVLQRNLKKG FT ALQPNFTAEEFKVIGVNKGAVQIESNETGNVYVRNSAHLKKLKGNSTSNDY FT TISETAGPSNTTGLDEDDGDKPEASSTNNRQMLERRRKIPAKYEGFVL" XX SQ Sequence 5155 BP; 1694 A; 949 C; 1205 G; 1307 T; 0 other; atggcgacga gaatggaaat cggtacgtga aaattatcag tttagtgcaa attgtagttt 60 tagtgttaat attacagtat ggtgtgatca tttgagccaa gaaaaaaaaa atatctggaa 120 ataaattgca tgaaaagtga acaaaatcat aagaggaatt gaaaaaaaaa tgaaaccaac 180 gcagagtgaa gaaaaacaaa atggcggtaa aagtgagaga ggaataatgg tgaaaaaaat 240 tctttggcaa gagtgacgcc atcgataaag ccagaaaaca tgaaaaaaaa aacagcggca 300 atggctcagg gagacttaga aaattttgtg cgttttacat acaacgaaga taattaagca 360 cagttgcaga tttgttttgg ccagtgatca tttttcttga cgattccaat atgttttttt 420 tgcgatcgaa tcaaacaaat gtaacgtaga tgagcttcgt gcagtagtag taaatggttt 480 ttgtgttgct tcgtgcattc gatgctcttg cagggaatcc tgaaagatgg agttgctttg 540 tgctgtttat tgtgctgcgt gtgcattttt tttgctacgt gctggatatg ctgtgtgatt 600 gctacgtgct gggtttgcta cgtgctgggg taattgtcca gtttcgatga gaagtaatat 660 ttgaggttat gcttcgtgcc gagcttatgt gtgtgtgtgt ttggttatgc ttcgtgccga 720 gcttcgtgca aaaatagaaa caagaagaaa cacagaagag tgaatataga aaaacgccgg 780 cattatgtat tgacccgaag aaacgacatg aaatcgaaaa aaatgagtta tatatgataa 840 gtgcacgcat gataaataaa aataagtgat aatggaagca aaaaaaatgt atcgggagag 900 ttatttatga tgtttggttt cagatgcatc ggtcatgcca ccgttcaaaa ttggcagtga 960 tcccagaaat gattggctca aatggaaaag agcgttcgaa aggtttctca aagcaaattc 1020 agttgaagaa gatggagaaa aatgcgatct gttactggtg ctgggtggca tggagctaca 1080 atcctactat gataaggtat tcaaatggga ggtgcaaacg cctacggctg atggaaatgg 1140 ggatcttatg caagtgaagt atgattcagc tgttttgtcc ctggacaaat attttgcacc 1200 tcaatcgaaa aagcgtttcg aaaggcatct tctcagagcg atgaaacaag aagaacagga 1260 accgttcgag gaatttgtct tcaggctgca ggatcaagca aaccggtgtg cgttcgctga 1320 tgtcgacgac atgattgtcg accagataat cgaaggctgc aaatcatcag atctaaggaa 1380 gcgcctgctg acagaggatg tgactttgaa cgattcgatt atactcggca agaccctcga 1440 ggaagtacag aagcaaacca aggaatacga aagacatcca ctggcggtaa aaacattaga 1500 attacgaaat gagtaagtac tatttttttt attaatttat tgttgtattt attattcacg 1560 gatagatacg agagatgccc atggtggttc agcagattgc agaccccaaa tctcggtctt 1620 ttggtaatgg tgacccaaat aactctaaga aatgttacaa ctgcaataga tatggtcaca 1680 ttgccagaga gacagataaa tgcgctgcta gggattttac gtgtttcaat tgcggaacta 1740 aagggcattt caaagtatgc tgtcgaaaaa ggaagcataa tgaaccaaga aatgagcaca 1800 gtttcaaaaa agcgcagaag tataaacgga tccatgccat tcaaggaaac catgaagatc 1860 acaaagcggt gttttttgtc gatgacaaac aacttaatga agtgcttcga atggatgtgg 1920 gaggggttga aattgcattg ttggttgatt ctgggtctcc agctaatatt atcaacgctc 1980 aaacatacga gtttctcaaa aggaacaacg cacacatttt gaacgaaaga aatccagatc 2040 acgaagatat gaaactgaaa tcctttgcct ccgatggaaa cattttgttc agtagcgctt 2100 tcgaggcaca aatcaaaatt ccaggagaag attctggaat atggtcccac atccttgtcg 2160 ctcctcaagg tcaaacaaac ctgctaagta aaagtactgc atttgctttg ggagtactta 2220 aaattggcta ttatgtgaac agtgtaagct ctgagaatat ttcaggaagt ctttcagagt 2280 ttcccaaagt gccagacgtg tcattgagaa tacaaattga cgaaagcgtt caaccaattg 2340 cacaagctgc caggagattt cctattgcaa tggaggcaga tgttgaaaac gccatacaag 2400 aacttcttca aaagaacatc atcgaaagag cagagggtcc attgacgtgg gtttcaccgt 2460 tggtgccaat tcgaaaaact gatggacgga ttcgtttatg tgtggacatg agggcagcaa 2520 acaaagcagt acgacgagag aactttccaa tgccgaacat cgacgtggcg atggcatcca 2580 ttcgtaaggt atcaaggctc tctaaaattg atcttgaagc ggcgtattat cattttgagc 2640 tggatcgcca aagccgaaat atcaccacat tcgtcgcacg cagcggcgta tatcgctttt 2700 gcagactgat gttcgggata aagtcggcac ctgaactatt ccaaagagag atggaaaatc 2760 tcttcagggg aattaaagga ctgattgtct acatggacga cttactgatc tatgcagaaa 2820 cggatgagga gcacgacgaa atcttaaagc aggtaacaat ttgaatcttt gtcttttgta 2880 gggtaataga tgagaatata cagagatgtt aagcatatgt acatttgatt ctgtttgaac 2940 aggttttgga acgtattgaa caaatgaaca tgaagataaa cgagcaaaaa tcacggtttg 3000 gtgtcaaaga agtcatcttt ctgggccatc atgtttcaac agagggaatc aaacctactg 3060 acgagaaaat cagagcaatt ttgaacctgc aacccccctc atcaacgacg gaattaagat 3120 ctcttttggg tttgataaat tttgtcggta aattcgttcc aaatctggcg actatgaccc 3180 ggcacatgcg atccttattg ctaaaacaaa actcttttgt gtggggcgaa gaacattcac 3240 gggagctaga ggctatcaag acagtcctag ggaaggtgga aaccttagga tattttgacc 3300 cgactgacga aacgcgatta atcacagatg cgagtccatt tggattagga ggcatactag 3360 tccaaattaa gaatggcgta tccagaacca tttcatgcat ttctaagagt ttggcagatc 3420 atgagaagaa atactgccaa acagaaaagg aatgtcttgc catcatatgg gcaatggaaa 3480 aactgttcat ctatctgtac ggaatacgtt tcactctagt aacggactgt aagccgcttg 3540 aatatctgtt taacaaatca cagtctaagc cttccgctcg tatcgagcgt tggattctgc 3600 gtctattgag cttcgatttc accgtggaat atgaacccgg acagaaaaat ctagccgacc 3660 cactatccag attaacacaa ggatcggcca ttgctgaaac tgaaccggat gtgatagcgt 3720 ggttatcgga acaagtacga ccaaccgcta ttagtttgga tgatttggaa aatgcctctg 3780 aggtggacga ggagcttcaa aaggtcaagg aggcagtaca ttcgagaaat tgggatgacg 3840 ttccgtcgga atacaaaaac gcaactatta aggaagattt gtctttgttc ggagaactaa 3900 tccttagggg cgacagaatt gttataccca aagcgttgag ggaaagggtg atcaatctgg 3960 cccaccttgg acatcaagga agtacttcca tgaaagccca acttcgagct aaggtgtggt 4020 ttccagccat ggacaaaatg atagattctg tcgtaaaaaa ttgcaaacca tgcagaatga 4080 catctcttcc cgacaaaccc aatccaatgt ccagacgact tcctacagaa ccctggcaag 4140 atgtagcaat tgattttaaa gaaggccttc ctggagatat gtccctgctg gtcgtggttt 4200 gctacacttc acgtttcatt caggttgaac cgatgaaacc ggccaccact caacgtgtta 4260 tcggcgccct tttaaaaatg ttcagttcat tcggaatacc aaggtcaatc actgctgaca 4320 atggcccaca gttcaaagcc gtcgagttca agaatttctg cttcagttac gggatacacc 4380 taaacatttc taccccatat tggcccgagc aaaatggcgc tgtagaaaga caaatgcgca 4440 acataggaaa acggattaaa atcagctcca ttcaaaatac tgactggaaa actgatttga 4500 acgagtattt aatgctgtat cactctactc cacaagagac cacggggtta tccccataca 4560 agatgatatt tgggagggag gtatacaacg gcataccgtc catacaccag ccaccaacac 4620 tggccttgga agctgccaaa gatagggata tggcattgaa agaacatcaa aaggaactcg 4680 cagatacgaa gcggcaggca aaacatcatg acttggagag cggagataca gtattacagc 4740 ggaatctaaa aaagggtgcc cttcaaccga atttcacagc cgaagagttc aaagtgattg 4800 gtgttaacaa aggagcagta caaatcgagt ctaacgaaac tggtaatgtt tatgtgcgga 4860 acagcgctca tctgaaaaag ctgaagggaa attcaacatc gaacgactac acaatttcgg 4920 aaacagcggg tcccagtaac acgacaggac ttgacgaaga cgatggagat aaaccggaag 4980 cgtcgtcaac gaacaatcga caaatgctgg agagacgccg aaagatacca gctaagtatg 5040 aaggattcgt attgtaagta acgataagga tgcgggtaag tcaatcttta ttgtaaataa 5100 atatcgggta attcaaaaga gttttatagt tttcttgact aaaaaaaagg aggga 5155 // ID Gypsy-36_CQ-LTR repbase; DNA; INV; 163 BP. XX AC AAWU01012843; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-36_CQ_; KW Gypsy-36_CQ-I; Gypsy-36_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-163 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 452-452 (2011). XX DR GenBank; AAWU01012843; Positions 93223 93385. XX SQ Sequence 163 BP; 45 A; 43 C; 30 G; 45 T; 0 other; tgtggtaatg tgctgattct cgtgccaatt cataaacgtg caagcccact gggtcgtctc 60 gctctcggca ctcgacgtgc gtaaataaac aatgtgctta aatctactcc gagttagtat 120 tcatttcacg acgataatcc aactagtacc aaaccaccta tca 163 // ID BEL-46_AA-I repbase; DNA; INV; 6638 BP. XX AC supercont1.246; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-46_AA_; KW BEL-46_AA-LTR; BEL-46_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6638 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.246; Positions 164315 157678. XX CC 'ACACT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS join(25..2457,2461..6636) FT /product="BEL-46_AA-I_1p" FT /translation="MPGASKKTGNRRSVIRQLTEEESSETPRCPSCDRPDV FT ADKWMVQCDLCERWFHFSCARVDESVKDRSFACVACALRHGRESTRSIASS FT TSSARRRLELLRLEEEKELQERLLAEQAAAEAALKKKAMEEEMERMKSTMH FT ERLEIAKRYLTKKYEVLQEEEYSNDGRSRKSHSSSRSKLSDVRSWVENHDM FT VLETGFGINGVPTNSMPVTDADPNSISAGSSGNGPPLKTSTKSDVLNEHGV FT GVATGSYNREDIASWGILVPRSGQNAADSVPVPPQSLPTTSGQDSSTPVYE FT TRAARIVNALAATSISPLTASQQGEPLTSNINISSPTASEKSRAELEQELQ FT EVLQQLADLKRSTAGQNRPSSASGNCSGSIVVTTSLGIGTGTSRLHAPLQS FT APVVGREAYIPRSNPIGTQASFTLATNCTTQDTISRSSFLPHWNQMSSSNY FT SSNNQPGFPHYTANFAYPSVSIGVASQQRESNSHWPPFPVGAPSCPYSASE FT RSVHYSINQRSAFPVGASSHPFVTSVPSLPNPIVTPMITCREDNDPPSSGY FT GPNSQQIAARHVVPKELPEFAGDPMDWPLFVSSYSNSTRMCGYTNDENLMR FT LQRCLKGNAKEAVRGHLFHPSSVPQIMTTLETLYGRPELIVKCLMAKVYST FT PAPKSDKLETLINFGLVVQNLCSQLQSMGMETHLSNPSLLQELVDKLPANV FT KLDWAMYQRHLPVIDLRAFGAYMTTIVSAASNVTLYVEPPRKLEKSKGFVN FT THAAEPELRADSEAKKEEVGVTATYQPKPCLMCKELGHKVRDCNSFKQLSL FT ENRWKTAHQSLCRRCLIPHGKWPCKATTCGIESCEARHHKLLHPGEPQAST FT TGVAVPTRSQSTAVPSATVNVHRQLPCPVLFRILPVTLHGKHTSINTLAFL FT DDGSSHTLVEKDLANELGVEGEFEPLCLQWTSNIKRTEKYSQNVQLEIAAI FT GKTRKHTLEYVRTVDSLNLPPQCLNYEELARKFDYLKGLPITSYSTAAPRI FT LIGADNAKLLLTLKKREGKYSEPVAAKTRIGWTIYGKAEGIPVSSEHRLLH FT VYVHSADQQLHDAVKDFFSIENVGVALVPQVEADEDRRSREILERTTIRIP FT SGRFQTGLLWKFDYFEFPESRFVAEKRLTCLERSLSRKPEVYANVRQQIHD FT YQTKGYAHKLTEEESNSSYPRKVWYLPLGVVQNLNKPGKIRVVWDAAAKAS FT GVSLNSMLLKGPDLLTSLPSVLFRFRQREVGITGDIKEMFHQILIQPEDRQ FT AQRFLWRNHPEDTVEEYVMDVATFGATCSPCSAQYVKNKNAEEWKDVYPEA FT AAAIVENTYVDDYANSSDTVEEAVRIALEVKQIHASAGFEIRNWLSNSKQV FT LRRVGGQGSVTSKCFAVEKLTDSERVLGMTWLPESDVFTFALKFRADVLQL FT LTEEIIPTKRQVLRVVMSIFDPLGLVAPFVVHGKCLIQDIWRAGVDWDERI FT PRELLIQWRRWVEVLHHLNQVQVPRCYFPGYDSHSFRSLELHVFVDASEAA FT YACAAYFRIVDLGQVRCALVSAKTKVAPLKPMSIPRLELQAAVIGVRLMKS FT VQENHTLPINRRVIWGDSKTVQSWIRSDQRKYRQFVAYRISEILNETNLEE FT WRWVPTRFNVADEATKWGKGPCFEPESRWFIGPEFLYNHESEWPKQEFSPP FT DTPEEMRSVHAHHRVAQESLIDYHRFSKYQRLLRTVCYVLRFVGHCRKRTD FT YVECRDASVLSKAELQNAELALWKLVQAEAFPEEMSTIRKNQELPLEQAKR FT LAKSSPLSKLSVFLDKNGVLRIESRIGSAKFVEYGAKYPVILPKQHRFTEL FT LVEYFHQRYAHGYRETVANEIRQLFYIPSLRTIVRKVAKKCAWCVVYRAVP FT KAPRMAPLPAARVTPYVRPFTFIGIDYCGPFLIRIGRNNVKRWVVLITCLT FT VRAVHLEVASSMSSESCKLAIRRFIARRGAPQEIYSDNGTNFVGVSKELRA FT ESVTVNRTLAETFTNRDTQWRFNPPAAPHMGGVWERLVRSVKSSLETLSVS FT RTPDEETFRTLLIEVEGIVNSRPLTFLPLESEDDEALTPNHFLMLSSSGVL FT QPPQAPVENSGRTLRANWNQLRSLMDEFWTKWIKGYLPTICRRTKWFDDTK FT PVKIGDLVVIVDEGVRNSWTRGKVVRTIPGNDGRIRRIDVQTVSGVMQRPV FT TKVAVLDVAIEGIAADDQQQYGSG" XX SQ Sequence 6638 BP; 1823 A; 1583 C; 1702 G; 1530 T; 0 other; aactttaagg tttataacca aacaatgcca ggtgcctcga agaagaccgg caatcgtcgc 60 agtgtcatcc gacagctgac ggaggaagaa agctcagaga cccctcgttg tccgtcttgc 120 gatcgtccag atgtggctga taaatggatg gtgcagtgtg acctgtgtga gagatggttc 180 cacttctctt gtgcccgagt tgatgaaagt gttaaagacc gcagctttgc gtgtgttgcc 240 tgtgcactcc gacacggccg agaatccacc agatcgattg cgtccagtac cagctctgca 300 aggagacggc tcgagcttct tcgattagaa gaagaaaagg aactgcagga aaggctccta 360 gcagaacagg cggctgccga agctgctctc aagaagaagg ccatggagga ggaaatggag 420 cgaatgaagt cgacaatgca tgagaggcta gaaatagcga agcgttacct cacgaagaag 480 tacgaagttc tgcaggagga agaatattca aacgacggta gaagccggaa aagtcattcc 540 agctctcgca gcaagctcag cgacgtgcga agttgggtcg agaatcatga catggtcttg 600 gaaacaggat tcgggatcaa cggagtcccc accaattcca tgcccgtcac tgatgctgac 660 cccaacagca tatccgccgg aagctcggga aatgggccgc ctctaaaaac atctaccaag 720 tctgacgtcc tcaatgagca cggagtaggt gtagctactg gttcgtacaa tcgggaggat 780 attgcatcgt ggggaatact ggtgccccgt tctggacaga acgcagcaga ttcggtgcct 840 gttccacccc aatcgctacc aacaacttct ggtcaagatt ccagcacgcc ggtatatgag 900 acacgcgctg cccggatcgt taatgcactt gcagcaacta gcatctcacc gctcacagcg 960 tcacagcaag gagagccgtt gacgtccaat atcaatatat cttcacccac agccagtgag 1020 aagtctagag cggaactgga gcaggagctt caggaagtgc ttcagcaatt agcggacctg 1080 aagcgatcga cggctggcca aaatcgtccg agttctgcct ccgggaactg cagtggttct 1140 attgtggtta caactagtct aggaataggc acaggtacgt caaggttgca tgctccgtta 1200 caatcggccc cagtagtagg tagagaggca tatattccgc gcagtaatcc aataggaact 1260 caggctagct ttactttagc aacaaattgt accacgcaag acactatctc aaggtccagc 1320 tttttacccc actggaacca aatgagttcg tcaaactatt cgtcaaataa ccagccagga 1380 tttccccact acactgctaa ctttgcgtac ccgtccgttt ccatcggcgt tgcgtcacaa 1440 caacgtgaat caaattctca ttggccgcct tttccggttg gtgctccttc gtgcccgtac 1500 agtgcaagtg aaagaagtgt tcattattcg atcaaccaac gatcggcatt tccggtcggt 1560 gcttcatctc atccattcgt cacaagtgtt ccgagtctgc ctaatcccat tgtcactccg 1620 atgattacgt gtagagaaga caatgacccg ccatcttcag ggtatgggcc gaattctcag 1680 caaatcgccg cgagacacgt cgtgcctaag gagttaccag aattcgccgg ggatcccatg 1740 gactggccgc tatttgtgag tagttacagc aactcgacgc gcatgtgtgg gtacaccaac 1800 gatgaaaacc tcatgcgact tcaacgttgt ctgaagggga atgcgaaaga ggcagttaga 1860 ggtcatctgt tccatccatc gtcagtacca caaattatga ctaccctgga gacgctgtac 1920 ggtcgcccgg agttgattgt gaagtgtttg atggctaaag tgtactcaac cccggcgcca 1980 aaatcggaca agttggagac gctcatcaat ttcggtcttg tagtgcagaa tctctgcagc 2040 cagttgcagt ccatgggcat ggaaactcat ttatccaatc cttcactact tcaggaactg 2100 gtagacaagc tgccggccaa cgtcaagctg gattgggcca tgtaccaacg acatctacct 2160 gtaattgact tacgcgcttt cggcgcatac atgacgacaa ttgtgtctgc agcaagcaac 2220 gttacgctct atgttgagcc tccaagaaag ctagagaagt ctaaaggttt cgtcaacacc 2280 catgcagcgg aacctgaact gagagctgat agtgaagcta aaaaggagga agtaggtgtg 2340 actgcaacat accaaccaaa gccttgccta atgtgcaaag agcttgggca caaagtaagg 2400 gattgcaaca gtttcaagca attgtcgcta gagaatcgat ggaagaccgc tcatcagtaa 2460 agtctttgcc gtcgctgctt gattccccac ggaaaatggc cgtgcaaggc tacgacttgt 2520 ggcatagaaa gttgcgaagc acgtcatcac aagctacttc accctggcga accccaagct 2580 tcaactacag gagtagcagt accaacgaga agccaatcta cagcagtgcc atcggcgacc 2640 gtgaacgttc accgacagct cccatgtccg gttcttttcc ggatacttcc tgtaactctt 2700 catggaaaac acacgtcgat taatactttg gcatttctcg atgacggttc atcgcacaca 2760 cttgtcgaga aggatttggc taacgagcta ggtgttgaag gtgaattcga acctctatgc 2820 ttacagtgga ctagcaacat caaacgaacg gaaaaatatt cgcagaacgt ccagttagaa 2880 atcgctgcta ttggcaaaac tcgcaagcac acgttggaat atgtgcgaac ggttgacagt 2940 ttaaacctcc caccacaatg tttgaactat gaagagctgg cgcgtaagtt cgactatctc 3000 aaggggcttc caatcaccag ctattcgacg gcggctcctc gcatcttaat cggtgccgac 3060 aatgcaaagc tgttgctgac attgaagaaa cgagagggga agtacagcga gcctgttgcc 3120 gcgaaaactc gaattggatg gaccatttac ggcaaagcgg aaggtatacc tgtttcgtct 3180 gagcatcgtc ttctccatgt ttatgttcat tcggccgatc agcagttgca tgacgcagtg 3240 aaggactttt tctctataga gaatgttggc gtagccttag ttccacaagt tgaagctgat 3300 gaggaccgta gatcacgtga gattctggaa cgtacaacta ttcgcattcc atccggacga 3360 tttcaaacag gacttttatg gaaattcgac tattttgaat tcccagaaag ccgattcgtg 3420 gcagaaaagc gactgacgtg tttggagcga agtttgtcga ggaaaccaga agtctacgct 3480 aacgttaggc agcagattca cgactaccaa accaagggct acgcacacaa actaacagaa 3540 gaagaatcga atagcagtta tccaaggaag gtctggtact tgccgttagg agtggttcag 3600 aatctgaaca aacccggaaa aattcgagtt gtgtgggatg cagctgccaa ggcgagtggt 3660 gtttcattaa actccatgct actgaagggc ccggatctcc tcacatcgct gccatccgtg 3720 ttattccgtt ttcgtcagcg agaagtggga atcacgggag atataaagga aatgtttcac 3780 cagatcctga ttcaacctga agatcgacaa gctcaacgct tcttgtggcg taatcatcca 3840 gaagatacag tagaagaata cgtcatggac gtagcaacgt ttggggcaac atgttcaccg 3900 tgctctgccc aatacgttaa aaataaaaat gccgaagagt ggaaagatgt ctacccggaa 3960 gcagctgcgg ccatagtgga aaatacctac gtagacgatt acgcgaacag ttccgataca 4020 gtagaagaag ctgttcgcat agcactcgag gtgaagcaga tacatgccag tgctggtttc 4080 gaaatccgaa attggctctc aaattccaaa caagttcttc gacgggtcgg gggacaggga 4140 tcggtaacat ccaaatgctt tgctgtggag aagttgaccg attcagaacg agttttggga 4200 atgacctggt taccagaaag cgacgtcttc acgtttgctc taaagtttcg agccgacgtt 4260 ctccaactgt tgacggaaga gattatacca acgaagcgcc aggtccttag agtagtaatg 4320 agtatttttg atccacttgg actcgtagca ccatttgttg ttcacggaaa gtgcctaatt 4380 caggatatct ggcgtgccgg tgtggattgg gacgagagaa ttcctcgcga gttgttgatt 4440 cagtggcggc gttgggtgga ggtgcttcat catctcaacc aggtgcaggt acctcgctgc 4500 tactttccgg gatatgattc ccatagtttt cgcagcttgg aactccatgt atttgtggac 4560 gcaagtgagg cagcatatgc atgcgcagct tacttccgaa tagtcgacct tgggcaggtg 4620 agatgtgcac tcgtttccgc taaaacaaaa gtagcaccac taaaaccaat gtcaatccct 4680 cggctagagc tacaggctgc cgttatcggt gtacgactaa tgaaatcggt acaggagaac 4740 cacacgctac caatcaaccg tcgtgtcatc tggggtgact ctaaaacagt acaatcctgg 4800 atccgatcag accagcggaa gtatcgacag tttgtagcgt acagaatcag cgaaattctt 4860 aacgaaacaa acttggaaga atggagatgg gtcccaacgc gatttaacgt agcggatgaa 4920 gccacaaagt ggggcaaagg tccctgtttt gaacccgaga gtcgttggtt tatcggaccc 4980 gaatttcttt acaatcatga atcagagtgg ccgaagcaag agttttcgcc accggatact 5040 ccagaagaaa tgcgatccgt tcatgctcac catcgtgttg ctcaggaatc tttgatcgac 5100 taccatcgct tctccaaata tcaacggctg ctgcgaacgg tatgttacgt tcttcgcttc 5160 gttggacact gtcggaaaag aactgactac gttgagtgcc gtgatgctag cgtactctca 5220 aaagcggaac tacagaatgc agaactcgca ctttggaagt tggtgcaagc tgaagcgttt 5280 cccgaagaaa tgtcaacgat acgcaaaaac caggagttac cactggaaca agcaaaacgg 5340 ttggcaaaga gcagtccact gagcaagttg tcagtgtttc tcgacaaaaa tggagtcctg 5400 cgtattgaaa gtcggattgg cagcgcaaaa ttcgtcgaat atggagcgaa atatcccgtc 5460 atccttccga aacagcatcg atttaccgag ttgcttgtag agtatttcca ccagcgatac 5520 gctcatggat atagagaaac agtggccaac gaaatacgtc agcttttcta cattcccagc 5580 ttaaggacta tagttcgaaa ggtagccaag aaatgtgcgt ggtgtgttgt gtaccgagct 5640 gttcccaaag cgccgcgaat ggcacccctt cccgccgcaa gagtgactcc gtatgttcgt 5700 ccgtttacgt tcattggcat cgattattgt gggccttttc tgatacgcat cggaaggaac 5760 aatgtgaagc gttgggtcgt actcatcacc tgtttgacag tacgagccgt acatctggag 5820 gtggccagca gtatgtcgag cgagtcgtgc aaactggcaa tcaggcggtt catagccaga 5880 cgaggtgccc cgcaggagat atacagtgac aacgggacga attttgtcgg agtaagcaag 5940 gagttgcgag cagaatcagt gacggtcaat cgtaccttgg ccgaaacatt caccaaccgt 6000 gacacgcaat ggcgattcaa ccccccggcg gcccctcaca tgggtggtgt atgggaacgc 6060 ctggtacgat cggtaaaatc ttcactggaa actctttcgg ttagcagaac accggatgag 6120 gagacattcc gaacgctgtt gatcgaggtc gaaggtatag tcaactccag gcctttgaca 6180 ttcttaccat tggagtcgga agacgatgaa gcgttgacac ctaaccactt cctcatgcta 6240 agctcgagtg gtgtacttca gccgccacag gctccagtgg aaaatagcgg acgaacactt 6300 agagcaaact ggaaccagtt acgaagcctg atggacgagt tttggaccaa gtggattaag 6360 ggatacctgc caaccatctg tcgccgcacc aaatggttcg acgacaccaa acccgttaag 6420 attggagacc tggtggtgat agtcgatgaa ggtgtgcgaa acagctggac tcgtggaaaa 6480 gttgttagga cgattcctgg caacgatggt agaatacgca gaatagatgt acagactgtt 6540 agcggagtga tgcagcgacc cgttacaaaa gtggccgtgc ttgatgtggc tattgaaggt 6600 atagctgcgg atgaccagca gcaatacggg tcggggaa 6638 // ID SINE1b2_Cis repbase; DNA; INV; 304 BP. XX AC . XX DT 14-NOV-2005 (Rel. 10.11, Created) DT 09-DEC-2005 (Rel. 10.11, Last updated, Version 1) XX DE SINE Non-LTR Retrotransposon from Ciona savignyi. XX KW SINE; Non-LTR Retrotransposon; Transposable Element; L1-99ext_Cis; KW SINE1b2_Cis. XX OS Ciona savignyi OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. XX RN [1] RP 1-304 RA Smit A.F.; RT "SINE1b2_Cis - SINE Non-LTR Retrotransposon from Ciona RT savignyi."; RL Direct Submission to Repbase Update (11-NOV-2005). XX DR [1] (Consensus) XX CC Ci000003. XX SQ Sequence 304 BP; 68 A; 78 C; 78 G; 79 T; 1 other; ttgtccgacc gattagtctg gtgggttagt gcgtcgcctt tcagccgaag ggttgcaggt 60 tcgaaactgg tcgctagcta gtcggttgtg tccttgggca aggcacttaa cggacattgc 120 ctgaacccag cggattaatg ggttctacca aattgaagga acgtctnaat catacacaac 180 acactgcaat agctccggta acccgacggt gggcgcgagg tgatctgacg attgcccgtg 240 tgttaacccc cttgcgtttt cccattcacg gggataaaca tgaatatcct atcctatcct 300 atcc 304 // ID Penelope-4_AAe repbase; DNA; INV; 4963 BP. XX AC . XX DT 22-DEC-2010 (Rel. 16.01, Created) DT 22-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A Penelope-like element family from Aedes aegypti. XX KW Penelope; Non-LTR Retrotransposon; Transposable Element; KW Penelope_Ele3; Penelope-4_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4963 RA Jurka J.; RT "Penelope-like elements from the yellow fever mosquito."; RL Direct Submission to Repbase Update (16-DEC-2010). XX DR [2] (Consensus) XX CC ~98% identity to consensus. XX FH Key Location/Qualifiers FT CDS 2381..4591 FT /product="Penelope-4_AAe_1p" FT /translation="MQLLRRSIKALYGRINVKTLECYSLHLKLAKEFQDNF FT QIFLTKVKIAEFCESERKRKSLSEKFNKLKYQNPKFSRPSQRPNIHNIDGI FT VVNRSSEQFTDEQLTLLNKGLTYAIPKKPDTTQTVIDIETAINNNNLQTSQ FT IQEIRTHTANIIRNTNTTPQNKEHNIIKQLNTKPVFYLRADKGNSTVIFDK FT SKYEEIMQNKIKHGPYRIQRTDPLPGMIRRVDKALKEAKPVLGNVVNSLKE FT SNPSLPRIKGLPKVHKPGNEMREIVSAIGSPTHKIAKYLVQKFQNFPKQFH FT SKTVKNSKHFTEVIENLKIADDEVLVSFDVKSLFPSIPVKEAISLLEDWLL FT SQYEGTDWIKQVRTYIKLTKLCMEENYFSFRDETYKQLNGAPMGNPLSPFL FT SEIFMANLEKRLEDKNQLPEIWWRYVDDVFSIVKKHILPQTLETINKAHKN FT IEFTYEVEQDNKLPFLDLLIVKEHSSLTFEIYRKPTNTQRIIPNTSNHSHQ FT HKMSSFNHMIHRMLTIPLSETGRRKELNYIFETGQLNGYTRQTIQQLIDKR FT TREQYKRSLTTLTQEQPTLKHIAVNYNDETKKLRPILRKFGFELVFTSRNN FT QLQTLLGSTKDPIDSLYKSGVYQITCDHCNKIYIGQTKRTLNTRFKEHVAE FT VTKAHKDTEKGHVHHFKSKVAEHIYNEQHFLNTKNIKLLRHISNPWKLDVA FT ESIEIFKQNHNKLLNKDQGNGYSWLFRLIPNLHTHNT" XX SQ Sequence 4963 BP; 1670 A; 851 C; 835 G; 1607 T; 0 other; ttaaaactac aaatgaactt taacacttca ctttttgcaa aaaaaaaata aaatactaaa 60 atcactaagg tgagcaaata cctttaaact ctaatatgtg ttgcttatct tgacagatag 120 gcctatttcg tctgcgactt acagacttct tcagtgtcga gtgctcgact tgaagagaac 180 atcgcatgga tctattattt atactagaaa aagtgtgtgg gtatgttaga actagattct 240 acctgtaatc gtatacagga ggtcattttt taattgtgta cctaatcaaa tgaaaggatt 300 gtcaaaatta caaattcctg cttgtttggc aggtggtcaa tcaatgtgtt tatgtattgt 360 gtgtgtgaag gttaggtata agtctaaaca gccaagagta cccatttcct tgatctttgt 420 ttagtaattt attgtggttt tgtttgaaaa tttctatgct ttccgctacg tcaagcttcc 480 aaggatttga aatatgtcgt aagagtttga tattttttgt gttaaggaaa tgttgttcat 540 tgtaaatatg ttctgcaact tttgatttga aatggtgtac atgtcctttt tctgtgtctt 600 tgtgtgcttt agtaacttct gctacgtgtt ctttaaaacg tgtattgaga gtccgttttg 660 tttgtcctat gtaaatttta ttgcagtggt cgcatgtaat ttggtaaacg cctgatttat 720 ataaactgtc aattgggtct tttgttgaac ctaggagagt ttggagttgg ttgtttctac 780 tggtaaaaac taactcgaaa ccaaactttc gtaggattgg gcgtagcttt tttgtttcat 840 cgttgtaatt gacggctata tgtttgagtg ttggttgttc ttgtgtcaga gtagtaagtg 900 aacgtttgta ttgttctcgt gttcttttgt cgatgagttg ttgtattgtt tgtcttgtgt 960 aaccgtttaa ttgtcctgtt tcaaaaatgt agttaagttc ctttcttcta ccagtttcac 1020 tgagggggat tgttagcatt cgatgtatca tgtgattgaa ggaagacatt ttgtgttggt 1080 gtgagtgatt tgatgtgttt ggaataatac gttgagtgtt agttggtttt ctatagattt 1140 caaatgtaag tgatgagtgt tcctttacaa taagtaagtc caagaaaggc aatttgttat 1200 cctgttccac ttcgtaagta aattcaatgt ttttgtgagc cttattaatt gtttctaaag 1260 tttggggtaa gatgtgtttt ttaactatgc tgaaaacatc atcaacgtaa cgccaccaga 1320 tttcagggag ctggttttta tcttctaatc gtttctctaa atttgccata aaaatctcac 1380 ttaaaaatgg agaaagaggg tttcccattg gggctccatt taactgttta tatgtttcat 1440 ctctaaagga aaaatagttt tcttccatgc aaagtttagt taatttaatg tatgtacgta 1500 cttgttttat ccaatctgtg ccttcatatt gactaagtag ccaatcttct aataggctta 1560 tggcctcttt aacaggaatg ctagggaata aagatttgac atcgaaagaa accaaaactt 1620 catcatcggc tatttttaaa ttttcaatta cttctgtaaa atgtttggaa tttttaactg 1680 ttttactatg aaactgtttt ggaaaatttt ggaatttctg gactaaatac tttgcaatct 1740 tgtgggtggg tgatccaata gctgaaacaa tctcacgcat ttcattacca ggtttatgaa 1800 ctttaggaag tcctttaatt cttggtaatg aagggttcga ttctttcaga gagtttacaa 1860 cattacccaa aaccggtttt gcttctttta acgctttgtc aacacgtctt atcattcctg 1920 gaagaggatt tctttgaata cggtaaggtc catgtttaat tttgttctgc attatttctt 1980 cgtatttaga tttatcaaaa attacagtag aattaccttt atctgctctt aaatagaaaa 2040 ccggttttgt gttgagttgt ttgattatgt tgtgttcttt gttttggggt gtagtgttcg 2100 tgttgcggat gatgttggcg gtgtgtgttc tgatttcctg tatttgtgaa gtctgtaaat 2160 tattattgtt gattgctgtt tcaatgtcta tgactgtttg tgtcgcgtca gtttttttgg 2220 tatagcgtag cagtcctttg ttgagtaggg tcaatgttca tctggattat acaaggacat 2280 tgcttttcta aagcaatgta aaaaacacag gcttacaccc gtaagtcata aggtaattgt 2340 caaaacagct aagaacccgg acataattaa agacatagag atgcaattat tgagacgatc 2400 aataaaggca ctctacggta gaattaacgt gaaaactttg gaatgctatt ctctacatct 2460 aaaattagca aaagaattcc aagataattt tcaaattttc ctaactaaag taaaaatagc 2520 agaattttgt gaatcagaaa gaaaacgtaa atctttatca gaaaaattta ataaattaaa 2580 gtatcaaaat ccaaagtttt cacgtccttc tcaaagacct aacatacata acatagatgg 2640 aatagtggtt aatcgttcat ctgaacaatt cacagatgaa caattgaccc tactcaacaa 2700 aggactgacc tacgctatac caaaaaaacc tgacacgaca caaacagtca tagacattga 2760 aacagcaatc aacaataata atttacagac ttcacaaata caggaaatca gaacacacac 2820 cgccaacatc atccgcaaca cgaacactac accccaaaac aaagaacaca acataatcaa 2880 acaactcaac acaaaaccgg ttttctattt aagagcagat aaaggtaatt ctactgtaat 2940 ttttgataaa tctaaatacg aagaaataat gcagaacaaa attaaacatg gaccttaccg 3000 tattcaaaga acagatcctc ttccaggaat gataagacgt gttgacaaag cgttaaaaga 3060 agcaaaaccg gttttgggta atgttgtaaa ctctctgaaa gaatctaacc cttcattacc 3120 aagaattaaa ggacttccta aagttcataa acctggtaat gaaatgcgtg agattgtttc 3180 agctattgga tcacccaccc acaagattgc aaagtattta gtccagaaat tccaaaattt 3240 tccaaaacag tttcatagta aaacagttaa aaattccaaa cattttacag aagtaattga 3300 aaatttaaaa atagccgatg atgaagtttt ggtttctttc gatgtcaaat ctttattccc 3360 tagcattcct gttaaagagg ccataagcct attagaagat tggctactta gtcaatatga 3420 aggcacagat tggataaaac aagtacgtac atacattaaa ttaactaaac tttgcatgga 3480 agaaaactat ttttccttta gagatgaaac atataaacag ttaaatggag ccccaatggg 3540 aaaccctctt tctccatttt taagtgagat ttttatggca aatttagaga aacgattaga 3600 agataaaaac cagctccctg aaatctggtg gcgttacgtt gatgatgttt tcagcatagt 3660 taaaaaacac atcttacccc aaactttaga aacaattaat aaggctcaca aaaacattga 3720 atttacttac gaagtggaac aggataacaa attgcctttc ttggacttac ttattgtaaa 3780 ggaacactca tcacttacat ttgaaatcta tagaaaacca actaacactc aacgtattat 3840 tccaaacaca tcaaatcact cacaccaaca caaaatgtct tccttcaatc acatgataca 3900 tcgaatgcta acaatccccc tcagtgaaac tggtagaaga aaggaactta actacatttt 3960 tgaaacagga caattaaacg gttacacaag acaaacaata caacaactca tcgacaaaag 4020 aacacgagaa caatacaaac gttcacttac tactctgaca caagaacaac caacactcaa 4080 acatatagcc gtcaattaca acgatgaaac aaaaaagcta cgcccaatcc tacgaaagtt 4140 tggtttcgag ttagttttta ccagtagaaa caaccaactc caaactctcc taggttcaac 4200 aaaagaccca attgacagtt tatataaatc aggcgtttac caaattacat gcgaccactg 4260 caataaaatt tacataggac aaacaaaacg gactctcaat acacgtttta aagaacacgt 4320 agcagaagtt actaaagcac acaaagacac agaaaaagga catgtacacc atttcaaatc 4380 aaaagttgca gaacatattt acaatgaaca acatttcctt aacacaaaaa atatcaaact 4440 cttacgacat atttcaaatc cttggaagct tgacgtagcg gaaagcatag aaattttcaa 4500 acaaaaccac aataaattac taaacaaaga tcaaggaaat gggtactctt ggctgtttag 4560 acttatacct aaccttcaca cacacaatac ataaacacat tgattgacca cctgccaaaa 4620 caagcaggaa tttgtaattt tgacaatcct ttcatttgat taggtacaca attaaaaaat 4680 gacctcctgt atacgattac aggtagaatc tagttctaac atacccacac actttttcta 4740 gtataaataa tagatccatg cgatgttctc ttcaagtcga gcactcgaca ctgaagaagt 4800 ctgtaagtcg cagacgaaat aggcctatct gtcaagataa gcaacacata ttagagttta 4860 aaggtatttg ctcaccttag tgattttagt attttatttt ttttttcaaa agtgaagtgt 4920 taaagttcat ttgtagtttt aaacatactc taatactgta att 4963 // ID Gypsy-1_IS-LTR repbase; DNA; INV; 152 BP. XX AC ABJB010258749; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the black-legged tick genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-1_IS_; KW Gypsy-1_IS-I; Gypsy-1_IS-LTR. XX OS Ixodes scapularis OC Eukaryota; Metazoa; Arthropoda; Chelicerata; Arachnida; Acari; OC Parasitiformes; Ixodida; Ixodoidea; Ixodidae; Ixodinae; Ixodes. XX RN [1] RP 1-152 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the black-legged tick genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; ABJB010258749; Positions 2582 2431. XX SQ Sequence 152 BP; 35 A; 40 C; 45 G; 32 T; 0 other; tgttgtgtac gaggtgaccg gcgcattggc atgagcaggg agcgcgcacc agccacgagc 60 agagcgttcg gacactgccc tcttggagac attccagtga acccgctgtc gcacaggctc 120 cgttctggat ctattaaatg aagtatgtaa ca 152 // ID Gypsy-10_SI-LTR repbase; DNA; INV; 295 BP. XX AC AEAQ01022779; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: long terminal DE repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-10_SI_; KW Gypsy-10_SI-I; Gypsy-10_SI-LTR. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-295 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01022779; Positions 213 507. XX SQ Sequence 295 BP; 78 A; 74 C; 84 G; 59 T; 0 other; tgtaaggagg cggccctgag caggacttag ccgtgggaga agctgtccga aagtcggagc 60 gccgagcgcc tgtgattagg cgaatagcga tcgctcctgt agagccgcgc gcaggtgtta 120 gcccccgttt ccggaaaagt cgtattgtaa ccggagcgga gcagagagac actctcgatc 180 gtgtcgccgt gcgaacaaga taaccgcgag tctgtcgaat caactgtata agaaaataaa 240 gtgtcttgcc aaccattaac cacgagtcga ttcataccat cctgaaagtc ttaca 295 // ID BEL-103_AA-LTR repbase; DNA; INV; 405 BP. XX AC supercont1.294; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-103_AA_; KW BEL-103_AA-I; BEL-103_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-405 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.294; Positions 168106 168510. XX SQ Sequence 405 BP; 115 A; 107 C; 77 G; 106 T; 0 other; tgttcagaat tagatcaaac agcaatactg gcaacccaaa catgttccac acgatcggag 60 ctatatcact ctcgctcatc cttatcatta ttatcattac catctcacat ttcctatcag 120 cataggtagc tcatataccg gtgagaccga atcgaccgcg atcggttata ttcgcgcgta 180 tactcactac actgacccta gagtatcgtt ctatatactc agggcagtca gttagctctc 240 gatcgcgcgc gcatgagcat cgcaataaag ttcacttcaa accgaaccta acgagtctca 300 ataaagtttt tcgatcacca agtttaacgc gtgtgtagtt ctgtgcaaga aacgaattcc 360 gcagtgtccg aactagggtc gtctccaacg gtaccggtag gaaca 405 // ID BEL-199_AA-LTR repbase; DNA; INV; 518 BP. XX AC AAGE02030545; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-199_AA_; KW BEL-199_AA-I; BEL-199_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-518 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02030545; Positions 9733 9216. XX SQ Sequence 518 BP; 178 A; 94 C; 108 G; 138 T; 0 other; tgatacgaac gcgctgttaa ttaaatcaat caatgtagtt acagtagacg gaattagcaa 60 tcattatgta catgaatcat ttgtgcgtca ttgggctttt gaaaactact atcagcgcgc 120 cgcatggcat tttattgtca ctttatgttt agtttatcat ccctacagta agggtacgta 180 caaagtaagt aagcagacac gaataatcat taagaaaata aagacccaaa atcgataata 240 atagagcgac ccaaagatgc ggtcgtaaat cttggcgacg gtgagtcgtt gatagttaag 300 aaaatagaat tgcgataagc agactgagaa agtaaaattg ccgggtacag atgcaagtgc 360 gtcatggcac agtaagaccc aggcagaaaa ttgtagtata taagacctta ggtaagaaat 420 aaagttagtt cttagcttca tctcatgtga gtcggtcctt gaaatccgaa agcaccataa 480 cacacttcct cttgtcaatg ttcggttcga cccaaaca 518 // ID I-60_AAe repbase; DNA; INV; 4482 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE An I non-LTR retrotransposon from Aedes aegypti. XX KW I; Non-LTR Retrotransposon; Transposable Element; I-60_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4482 RA Kojima K.K. and Jurka J.; RT "I clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1331-1331 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 7 sequences with >91% CC identity. CC Both termini are truncated. XX FH Key Location/Qualifiers FT CDS 934..2127 FT /product="I-60_AAe_1p" FT /translation="MILTISGTVIPEHVDFGWTRCKTRNYYPSPMLCFRCW FT TFGHTGKRCTAPQRICGRCSKVHPENQTRNPTAKETEADVTATAGSSGENQ FT DISRFVCTDSTFCINCKSDDHAVSSRKCPIYLKEIDIQRIRVDNNISYPQA FT RKDYEARQNASASTGTYSGVANASKDAEIETLKAAIRRMQMDSKAKDERMA FT EMEAALRGPSVGERLDRVREHGTIDQLIKQVADLTATVQKLQQTVEMKDQI FT ISKLMAERKSAPVLEAEIPREAFSITDTQDEIPASQESIATTDPKLAAKIQ FT EWIGRNTLSSVKNVILPNDTTKTENKKDQGRKKRNKKSNEDVTSTDESMSS FT VVSQTSRRTTETSTSMPNKRNHDGSDSSNDPNNSPNLRRNKKVPKQEEVKR FT GKKK" FT CDS 3455..4267 FT /product="I-60_AAe_2p" FT /note="endonuclease." FT /translation="MNNTLGRQYKWYVKTGVNVYQSVAIGVSNDIPVTVLD FT IETDLPIIGINISWPFPVSIVSIYLPNGKIPNLKKSLGRVLEQIPGPMILL FT GDVNAHHKAWGSRHSTARGSILVGLANKHDLTILNDGSPTFSRGNSETSID FT VSLISACITNRLLWSTELDLHGCDHTPILLTLDGSAAPETTRRPRWLFDQA FT DWTTFQKMTEAKIETSPPESLQDLTELICSTAAQTIPKSSPNPGRRALHWW FT NEETRKAVKLRRKALRRVQKLQKKSSRKSP" XX SQ Sequence 4482 BP; 1297 A; 1148 C; 1067 G; 950 T; 20 other; agccaaagca gtctctcggc tgactagtgc tccccatcgg ggctgatacg gtgccggatt 60 gttcgagtcc tctctcggaa gatttttcgg gcttcacaga ggggcccacg gtgtggtttg 120 tttgagtgag aaacacgtgg tgatcaacag attcaaattc ttgtgcttgg aaagagagaa 180 aggaaactgc cgagtgagta gtgccgaggt ggacccagtg gtctccaatt tcggagacgg 240 accacgtagt gaccttgtga ttttgtgaaa gggggaaagt gcaataaccg aaaaactagt 300 gcgtagtcat cgattgtctg ttagtgaact tgtcgtgggt ttcgcagaga cgacaaaacc 360 gcgccgataa acagattaag tgtagtggta aaatcgctgc tctgccagca tggagctggc 420 gagcggtcag ctcttccgcc caggggaccc ggggggccca ggcggtggac aaaaaactgg 480 tatgaactgg agacaaggtg agtacatggg accaacactc ccatcgtata tggaccggga 540 tggaactgca ggttctctgc agtacttaaa aatgcaagcc acatccggat cgatgcctca 600 ggatccgttc ctgctgcgca tctcggttga aaagcatatt ggagctcgga tcgagggtgc 660 cttcaaggag aaccgtggcg tttcgtatgt tctcaaggta cgaagtgtgg ctcagttcaa 720 caagttgctc cgcatggaaa aactttgcga tggaacacct gtatccatca ctgagcaccc 780 gcaactaatc agaggcagtg tgtggtttcc aatcctgacg tagtgggtct tgacgatgat 840 tacctgaaga tccaactcgc cactcaaggg gttaaggacc tccgccgaat ccgacgccga 900 ctgccggatg gatcgttcgt gaacacgccg acaatgatat taacgattag tggtacggtt 960 attccggagc atgttgactt cggctggaca cggtgtaaga ccaggaatta ctacccatcg 1020 ccgatgttat gcttccgttg ctggaccttc ggtcataccg gcaagcgatg cactgcaccg 1080 caacgtattt gcggtagatg cagcaaagtg cacccagaaa atcagaccag aaaccccacc 1140 gcaaaagaga ctgaagccga cgtaactgcc acagctggtt cctctggtga aaatcaggat 1200 ataagcaggt ttgtctgtac agattcgacc ttttgtatta actgcaaatc cgatgaccac 1260 gccgtttcaa gccggaagtg tcccatctat ttgaaggaga ttgatattca gcgaataagg 1320 gttgacaaca acatctccta tccccaggca agaaaagatt atgaagcccg ccaaaacgcc 1380 agtgccagta ccggaaccta ttctggggta gccaatgcga gcaaggacgc agaaattgag 1440 accctgaaag cagctatccg ccgaatgcag atggactcta aggcgaaaga tgaaagaatg 1500 gcagaaatgg aggctgctct tcgaggccca agcgttggtg aacgcctcga tagagtccgg 1560 gaacatggca caatcgacca gctcatcaag caggtagctg atttgactgc tactgtccag 1620 aagttgcagc aaactgtaga aatgaaagac cagataatct ccaaacttat ggctgaacga 1680 aaaagcgcac ctgttctcga agcggaaatt cctcgagaag ctttttcgat cacagatacc 1740 caggacgaaa tacctgcatc ccaggaatcg atagctacta ctgatccgaa attggcagca 1800 aagatacagg agtggattgg ccggaatacc ctcagcagtg tgaaaaacgt catactaccg 1860 aacgacacaa caaaaactga gaacaaaaag gaccaaggac gaaaaaagcg aaacaaaaaa 1920 tccaatgaag acgttacatc aaccgacgag agcatgtctt cagttgtctc tcaaacatca 1980 cgtcgaacta ctgaaactag cacttccatg cccaataaac gaaaccacga tggatcagac 2040 tccagcaacg atccgaataa ctctcccaat ctgcgccgga acaaaaaggt tcccaagcag 2100 gaggaagtca aaagaggtaa gaagaagtaa gttcaccgaa acagctcagt ccctccattg 2160 aacggcccta cacagacggt ttacatttca aacaccattt tcacttccct ttaccccctg 2220 gaagatgaag taaacgtgga gtggactgag caaacaatca aaaaaacaga gaaagcctcg 2280 ggtagccggg gccccgaagt gcggacgtta cacccccacc ggaactagcg gaaacccccc 2340 gacacccgaa tgtaaatggg acgcaaaagg gccaaaaaac ccgttcccca ccatccccag 2400 tctcggatca gaacagccag ggccccgtcg gtgcggaagc caaggccgta ccggaaccgc 2460 cggacaacct ctggcattct gattcaactg gaagagggac gcacaagggt gtaaaaacct 2520 gttcccctac gcacaacgag ggagtcaagg ccacggatca aaacatttgg acttctacgc 2580 ccaaatcaaa tacgcgctga aggacggtac gttggagtca agcttcctac tagaagaaat 2640 ccgccacgca actgccgcat gacaaccgag tacatagccg agtcggaaac cgccggaagc 2700 gaatcttctg agttcgtttc ctgcacttgg gcggccagag gcwtatgatt ccgagctata 2760 cgacgccctt cacaagactc gatacgagck gtcggtcgat atcgtcgcag gtagttctgg 2820 aaccacacac gagtcggcgt cagctcatca aggggtacat ccggcccaaa agtgttccga 2880 tccgaawgat gcgacgccca caaatagcgg taagtccgat tttaatcatt gtctgcctga 2940 aatcagcctg taactscasc gtcgaacaag ctccgccatt aggcgggtsg cacktcgctc 3000 acmtcccact sgtatgacac tgtctcgtca gtcttgccmt tccgaaacac gcccccaaac 3060 gaascacatg gttccacgag tcaccaacga aatagaaagg ttaatggctt gaatctcaak 3120 gcatgcaatk tacttttctk tatttatmga ttccacacgg attcccccaa ctcaacattt 3180 ttggcgttct aacgatacat ccgagccatc aaatggtggc actttacgcg tctatcgctt 3240 tcggagamct gtctsgttcg kaatactctt cccttgtccg gaaaatcaac gcggcgtatt 3300 tcccgaaacc gaaaacactc tggcakcacg atatcgtggg tcataaacgk atttttccac 3360 aacctaccgg acttggagat gttagtttct acccgtgcgg caccggccag tagtactcgc 3420 cctgcaagag gtacataggg tcgacccagc ttttatgaac aacaccctgg gccgacagta 3480 taaatggtac gtaaaaacgg gtgtcaacgt ttaccaatcg gttgccattg gagtgtcgaa 3540 cgatatccca gtaacggtac tggatatcga aacggaccta cccatcattg gaataaatat 3600 ttcatggcct tttccggtct ccatcgtatc tatatatctc ccgaatggaa aaatcccaaa 3660 tcttaaaaaa agcttaggtc gagtgctgga gcaaattcct ggtcccatga tcttgcttgg 3720 agatgtaaac gcccatcaca aggcttgggg aagccgccat agtaccgctc gtgggtccat 3780 ccttgttggt ttagccaaca agcatgacct aactattctc aacgatggtt ctcctacctt 3840 ctcacgtggg aatagcgaaa catctattga cgtctctctc atttcggctt gtataaccaa 3900 tcgccttctg tggtctacgg aattagatct tcatggctgc gatcatactc caatattgtt 3960 aacattggat ggctctgccg ctccggaaac aacccgtcgc cccagatggc tatttgacca 4020 agctgactgg acgaccttcc aaaaaatgac cgaagcaaaa atagaaactt ctcctccgga 4080 atctcttcaa gacttaaccg aactaatatg ctccactgct gctcagacca ttccaaaatc 4140 gagcccaaac ccaggtcgtc gcgcattaca ctggtggaac gaagaaactc gtaaagccgt 4200 aaagctgcga cgaaaagctt tgagaagggt tcagaaactt caaaaaaaat cttccagaaa 4260 gtcaccctga tcgtgctaag gctcaagaat cataccaaaa tgcgagaaac gagtgtcgac 4320 aaatcataag aggcgcaaag gaagcatcct ggactgaatt ccttaacgga atcaatgagg 4380 agcaatcgtc gacggaactg tggaggcgaa tcaactgtct tcaaggaaaa agacgagcac 4440 aaggaattgc cctcaaggta aatggatcct tgacacgtga tc 4482 // ID Gypsy15-LTR_Dpse repbase; DNA; INV; 2568 BP. XX AC Unknown_group_825; XX DT 15-MAY-2009 (Rel. 14.05, Created) DT 15-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy15-LTR_Dpse. XX OS Drosophila pseudoobscura OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; obscura group; OC pseudoobscura subgroup. XX RN [1] RP 1-2568 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1098-1098 (2009). XX DR Genome; Unknown_group_825; Positions 14877 17444. XX SQ Sequence 2568 BP; 732 A; 571 C; 555 G; 710 T; 0 other; tgtagtggtt gcttgtcggg cgtataggaa agcctatcac ttttcttcgc ctgaaagtag 60 gccgatgtta tttgtacaat ttatatattt atatttatat atattatatt agttatataa 120 ttagatatca cacctcaccg agatacttat taattattaa gtttatttcc gatagttagt 180 gtatagttag gttggtttaa tttcattttg gtggtgttac atgttgaatg ttccttgtca 240 tgtggggcta gcgccaccta ggggtgacat taagaatagg cgaaaaacga aaaaaaaaat 300 ctcaccacca ttcccataac cgttcaaggt gaatggcgaa ggatcgtgca aaggcagatc 360 cggagaacca aatttttgtt ggtcggtaga tccgcccaaa tctttctttc tcctcgtggc 420 cgtacaagaa ttcagttgcg cggtgagaaa aaaaataata aaaataaaat taaaccgccg 480 tgtgtgaaac aaaagtgcac cagatcgaaa aaaaaagaac ccaagaaaac agattggggt 540 ccaaacgaaa acgtgagtac cggccttcga tagtgctagg agcctcgaag ttagattttc 600 cgaattattc tgtattttcc gcttggaggt cagcagcccc aacacaaaaa cacctggaga 660 aagagggaat tagcacctca accccatccc cctattttcg cagtgccagt tccaacaggg 720 taaatatgtg cattaagaac tgttagaaaa agtagcctga tcagtatttt caaattccac 780 attagaatcg gcacttagtt tgggaagtat tccaaagctg cagcccgagc tttaagcccc 840 cgcactacga accagctgat ccaccaagtc gcgggcagtg aactttggct gcgcgaggcg 900 aagacagccg tctgaccggg cttagtgcgg cgaattggcg taaccgtttc gcgttcaaaa 960 ccgggccaag aagcgcggta acccggaact tgcagtcagc gtcggacgga agcagcggaa 1020 attttcctac cgggctgaaa aaagatagtg cgccacgtgt ggacataacc tcccggcgac 1080 cctgagtggt atcgcctctc ttgttcttct cgtgattcgg gcctagctgt tcaacgtgta 1140 gttaaagtgc agctgaggcc agtgtttgtc ataggtgctc gttagggtca ttatgtaggc 1200 tgaatgtacc caaaaacaaa aaagtaaaga gatgtgctca catccatatg tacacacagc 1260 ccattgaaag ccactgaggc cagtaagatc ccccccgtga ctcgcattcc cttgaccctc 1320 gcagtccggt gatatgtccg tgtttgtgta gttggatata cttagtgata attgtagaca 1380 gaaaagaaaa acagaaaaga aaaaccaacc acacctgaaa agtttgaaaa tgcattttgt 1440 acaagtttaa aagattgagc ttaaaagcgt ccgctcggcc gggtcacatc agcgagctta 1500 tgccatgcag cacgtgctgt tcgattttgt ttttggtttg caggtaaaag cgtaatgcac 1560 ccacacacag cgacacgccc agctacgaaa gatctttcgc cgcgggaaca caaacaacaa 1620 ctgagtgagc tttatactat accaacaaca ctacattgct ttgcctccca aaattccgcg 1680 agcgtggcaa aagcttcgtt tttttttgct agagctgttt gtcggtcgtt gacctcacac 1740 ttttaaacta gtcacagctg attggccttc tgcctttaca gtaagattgg taaaacgaat 1800 agtatgtcca actactccct gaagccgatg tgctgggaca tgtgtaccta tatggcgctg 1860 aattattatc tttctattta aggaaaaatc attagtttat atcgcaatat aaaatatcgt 1920 atatgcactt ataactattt atacttatcg tatttattaa tctaaatatt ccctagaatt 1980 aatagctata tctcttgtga cgcttttcgt tcctactttt gtccatttag ccttgtcttg 2040 gagttctctg ttagtgtttt ttttgtaaca tgagggtatc tgaatatagt ttctaataaa 2100 atgattgtta agatgaatat gtgctagacg ttaccgtatt tgggggaacc atgccgctcg 2160 ggagcgaaat aggatgacca ataccgggca ctccagggat cctgatatta ttattggcct 2220 tttcaggatc atggtagggt gggcgtggca cgactccgtg ccagcaaaac agagatgtcc 2280 gacgcggtgg tagatcctta ttccttcact atactatctt tgcccaaata tacttaccga 2340 acccctaatg atcctttcct tgtctgtccc cctcccattc ccaaaaagag tacgcgaatt 2400 ttaataaaat aataccagca ggaacgaagc aacctaactg gagggttacc ccaaaggacc 2460 catccctatt tccgaccaga gaccagccgt ggaagcgagc gcgtactcct tctcagttag 2520 ccgaccacca gcttcattgt gcgaagattc tcttcgtgac tcgttaca 2568 // ID hAT-42_HM repbase; DNA; INV; 4536 BP. XX AC . XX DT 07-DEC-2008 (Rel. 13.12, Created) DT 14-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-42_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4536 RA Jurka J.; RT "hAT-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 2030-2030 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(962..3058,3039..4025) FT /product="hAT-42_HM_1p" FT /translation="MEIASRGCLYSSASLLSNSSLKTVNVLRLLTSIELFE FT KASYYARYSLFVKQVENNNFSNLSSVLVACVSFECSDQFKLINSESASEII FT IMEALINCRANKFSSFLCLIALSSVLNIPIRSIYPDFGLEKFKKLFNTIIF FT PFRSIKNQDQNVLNILWCNTDIMSVMQNNNNWTPNHFVPIIKKPKLISSNN FT KNLAIKNDATYPLHFFSPKTETNKKKQLDSAKHVKIQSKINFKKDNAFIEL FT PFEPSVDSKIVKSLDSNSVIIPSYNELNSLYDVSTYLVRGKLLTAGKDDSI FT LLDMVNKVFCPNSMFVFPISFNTKRSFKFEWLSQFSWLAYSSKEDGAYCLP FT CTLLGASIPNKSGILNLVFKPHQEWGNAVRDYRKHEENCVLHEKSMLSFNA FT LLSRCNSKSNVIEVDLNNSRLKLLSDNRKKLVPIIKTIIFLGRNDLAFRGH FT RDDSKYHPDIGESSTQKVGVGNFVELLNFXVDAGDQILANHLSSSPKNATY FT ISKTTQNQLIDSCGKAIEEVLIKNVKHSIFFSILCDEAVDCSNTEQMSLVL FT RYVNSNNEICEDFLRFIDCKTGTSGLSLSLNVLNALQEFGLDIQNCRGQGY FT DGAGCMAGEYKGVASRIKALNHKAIFVHCASHRLSLVVAAACQVQKVKNLL FT GQVKEISYFFNLSPKRSNCLKKYLSPNQEKIIDTCRTRWVQKLRGLDVFFE FT VLMFFFDNFIPIIFIHALEEMGLNESKEYNSETASKSSSFLRLLTDFSFIV FT SLVITKQLMDFFYAITVTLQTKSFDISQQCFEITNLKNLLLEIKNKIDIYH FT TEWYAIALSLAKTLDIQEVRPRLCNVQVYRDNYPTNTVCDYFKHSITSPLI FT EHLINELDNRFPENGMFVYKGLAAVPSTVLSRIQVKKPWKSDFYEFLNFYS FT SDMPHFTSIHAELDLWELFWKNQSSIPSTVAGTLKSIDMRGFPNIRTAFII FT LGTIPITTCECERSISVIRRLKTYSRSNMIESRFNSLALMSIHQEIFPDVE FT RVIDIFSKSGERRLDFYLVVY*" XX SQ Sequence 4536 BP; 1534 A; 676 C; 706 G; 1616 T; 4 other; caggggcgga tccagggtgt cgcgaatgat gcgcatgcat cagtcttttt tcamcgctaa 60 atttaagtgt atctcttttt cacttttttt tcttaataaa aagtaaaagt agaaattgga 120 atctggtatt cacaaaaaat gaaawacttg ttttaaactc gaaaaaaatt catttaaact 180 cgattttatt tgatctctat agttacttat tcgttactta kacaaaattc ctgcgttttg 240 gagcaactga tgcaattgtt cttcattaaa tacaaatgtg attggtttaa aaaaaaaaaa 300 gtacgtttta accaatcata tttgtatttt ttcacaaatt tttccgatca ctttgtagca 360 actgatgcaa ttgtactcta aatcattaat gattggttaa aaaacagttt aaaccaatca 420 ctatttaata aaataacaat tgcatcagtt gctacaaaac ttccgcaaaa aactttacaa 480 aaaagataag aaatgttatt tataaccaat caaatttgta gaaagtttag aacaattgca 540 tcagttgctc caaatcgttc aaattttaca aaattctgat atgccacctg atatacgttc 600 ttcgctcgta attttcgtcg ttggactttg cttgtaaact aaagtttaaa aaaaaaatga 660 attttttagt ctcaaatgca attaaggcta gacatttgcc aaatattcgc aaatataaaa 720 accagttgac tgaagaattg aaagttttaa ataataagag tccagttcag ccacatagtc 780 agtcttctga aaatactgaa atggatcgtt ttatcccaaa gcaagaagat actaaagatt 840 atgttgcttt aaggtaaaac cctattatta aagttaatgg tattttatat ttccagctag 900 tatttaatat aaactgagct tgttttagta tactttttat tttatttaag gtcatctgaa 960 gatggaaatt gccagtagag gctgcctcta cagttcagct tctcttttaa gcaattcgtc 1020 tctgaagaca gttaatgtat tgaggcttct aacttcgatt gaattgtttg aaaaagcttc 1080 ctactatgcg aggtactccc tttttgttaa acaggttgaa aacaataatt tttctaatct 1140 ttccagtgtt ctagtagcct gtgtttcttt tgaatgttca gatcaattta aattaattaa 1200 cagtgaatca gcttctgaaa ttataataat ggaagcattg atcaactgcc gagctaacaa 1260 gttttcttca tttttatgtc taattgcttt gtcatcagtt ttaaatattc ctattagatc 1320 aatttatcct gattttggct tagaaaaatt caaaaaactt tttaatacaa ttatttttcc 1380 atttcgttca attaaaaatc aagatcaaaa tgttttaaat attttatggt gtaatactga 1440 tattatgtca gtgatgcaaa ataataataa ctggacaccg aaccattttg taccaattat 1500 taaaaagcct aaactaattt ctagtaataa taagaatttg gctataaaaa acgatgctac 1560 ttatccttta cactttttta gtcctaaaac tgaaacaaat aaaaaaaaac aacttgattc 1620 tgcaaaacat gtcaaaattc aatctaaaat taactttaaa aaagacaatg ctttcataga 1680 gttgcctttt gaaccttctg tagattcaaa aatagttaag tcattagact ccaactcagt 1740 tattattcct tcatataatg aattaaattc tttgtatgat gtttcaacat atcttgttcg 1800 aggtaaactt ttaaccgctg gcaaagatga ttctattttg ttagatatgg taaacaaagt 1860 attttgtcct aattctatgt ttgtttttcc aatatctttt aatacaaaaa gatcatttaa 1920 atttgaatgg ctcagccagt tttcttggtt agcttattct agcaaagaag atggagctta 1980 ctgtcttcct tgtactttgc taggtgctag tattccaaat aagtctggaa tattaaattt 2040 agttttcaag ccccatcaag agtggggtaa tgctgttcgt gattatagaa aacatgagga 2100 aaattgtgtt ttacatgaaa aatctatgct ttcatttaac gcgttgcttt cacgttgcaa 2160 ttccaaaagt aatgttattg aggttgattt aaataattct cgtttgaaac ttctttctga 2220 taaccgaaaa aaattagttc caataattaa aacaataata tttcttggta gaaacgatct 2280 agcatttcgt ggacatcgtg atgatagcaa atatcatcct gatattgggg aatcttctac 2340 acaaaaagtt ggagttggta acttcgtaga gcttttaaat tttygtgttg atgctggtga 2400 tcaaatatta gctaatcatc tttcttcaag tccaaaaaat gcaacctata tatcaaaaac 2460 tacccaaaac caacttattg attcttgtgg gaaagcaatt gaggaagttt taataaaaaa 2520 tgtcaagcat tcaatttttt tttcaattct ttgtgatgaa gcagttgact gttcaaatac 2580 tgaacaaatg tcacttgttt taagatacgt aaattcaaat aatgagattt gcgaggattt 2640 cttaagattt attgattgca agactggtac atctggtctt agtctatctc taaatgtact 2700 taatgcactt caagagtttg gacttgatat tcaaaattgt agaggccaag ggtatgacgg 2760 tgcgggttgt atggcaggtg aatataaagg agttgcttct cgaataaaag ctcttaatca 2820 taaagctata tttgttcatt gtgcaagcca tagacttagt ttagtagtag cagctgcatg 2880 ccaagttcaa aaagtaaaaa atcttcttgg ccaagtaaaa gaaatttctt atttttttaa 2940 cttatctcct aaacgtagta actgtctcaa aaaatattta agtccaaatc aagaaaaaat 3000 aatagatact tgtcgaacaa gatgggttca aaagctgaga ggtcttgatg tttttttttg 3060 ataattttat acctattatt tttattcatg ctttagagga gatggggtta aatgagtcaa 3120 aggaatataa cagtgaaact gcttccaaat cctcttcttt tttaagatta ttaacagatt 3180 tctcttttat tgtaagttta gttattacaa aacaattaat ggattttttt tatgcaatta 3240 ctgtaactct tcaaactaaa tcttttgaca tatcccagca gtgttttgaa ataactaatc 3300 taaaaaatct gttattagaa attaaaaata aaattgatat atatcacact gaatggtacg 3360 ctatcgctct tagtttggcc aaaactcttg atatacaaga agtgagacct agattgtgta 3420 atgtccaggt ttaccgggat aattatccaa caaatacagt ttgtgactat tttaaacata 3480 gcattacttc tccattaatt gaacatctta ttaatgagct agataatagg tttccagaaa 3540 atggcatgtt tgtttataaa ggtctagcag cagttccttc aactgtactg tccagaatac 3600 aagttaaaaa accatggaaa tctgattttt atgaattttt aaatttctat tcatctgata 3660 tgcctcattt tacttcaata catgctgagt tagatttatg ggaattattt tggaaaaacc 3720 agtcaagtat tccttcaact gttgctggta ctttaaagtc cattgacatg agaggttttc 3780 caaatataag gacagcattt ataattttag gaacaattcc aataacaact tgtgaatgtg 3840 aaagaagtat ttcagttata cgtaggctaa aaacatactc cagaagtaat atgattgagt 3900 ctcgctttaa ctctttagca ttaatgtcca tacatcaaga aatttttcca gatgttgaaa 3960 gagtcattga cattttttca aagtcaggtg aaagacgttt agacttttat ttagtagtgt 4020 actaactttt attttacttt tttcttattt ctaatttgat ttttactaat attgtgtgtg 4080 tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tgtgtatata tatatatata tatatatatc 4140 cagaagtgga gggccaacct taggcaatat tactttacac tccagaaatg gagggctaac 4200 cttgggcaaa atgacagtat acagatgaag tgcaaagtcc tatggcttac cttctcgttg 4260 gtgcactttg ttaagtaaat taggcttaaa attacatgtc taatttatga acaaaatagc 4320 caaggcaaca tttgtaaaaa tttctgctaa aaatggatac tgcaaaagca atacaaaatt 4380 tagttattat tcaatgatat aaacagcggc gccactcgta aaaaaagctg tttctccact 4440 ttttatattg actatactta aaccctagag aggttgcggg ggggcgtcga taaaaaaatt 4500 tcgcatcacc catttaaaat gctagatccg cccctg 4536 // ID Copia-135_AA-LTR repbase; DNA; INV; 151 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-135_AA_; KW Ty1_copia_Ele191; Copia-135_AA-I; Copia-135_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-151 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 151 BP; 43 A; 29 C; 32 G; 47 T; 0 other; tgttggaagc gaagcgacta tgaaactcca gtaggctatg gctagctgac gtataattgt 60 aaacgagtag aaaaataaat ttgttcattc tgttcttgcc cactgaagta gaaggttgtt 120 tttattctta cgactacggc ccactttccc a 151 // ID BEL-220_AA-I repbase; DNA; INV; 6715 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-220_AA_; KW BEL-220_AA-LTR; BEL-220_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6715 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 897-897 (2011). XX DR [2] (Consensus) XX CC Positions [5664-6242] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS join(3350..4363,4367..5518) FT /product="BEL-220_AA-I_2p" FT /translation="MTRFWSCEEVESGKAYSPEEARCEKWFSSTVQRDSGG FT RYTVALPRMEDAVSRLGESKDIAFRRFQGTERRLARDPSLRQQYVAFMAEY FT LELGHMRIVSETDDTSTKRCYLPHHPVVKESSTTTKVRVVFDASCKTATGV FT SLNDVLLVGPVVQEDLRSIILRSRTKQIMLVSDVEKMFRQINVRPEDRQLQ FT CILWRNSQSEDVRTYELNTVTYGTKPAPFLATRTLHQLALDDEDRFPLAAR FT AAIEDTYMDDVITGADDVETATELQLQLNAMLSGGGFLLRKWASNCPSALD FT GISQENLAIRSMDGINLDPDPSVKTLGLTWFPTTDILGFQFSIPPVTGVIL FT TKRSVLSVIATLFDPLGLLGAVITTAKVFMQQLWTIRDENGQPLDWDKPLP FT PTVGEAWWKFHEQLPLLNHPRIERCVIIQKAVGVEIHCFSDASEKAYGACL FT YIRSTDEDGRVKVRLLSSKSKVAPLKCQTIPRLELCGAVLAAQLFEKVRES FT IRMTTTCYFWTDSTCVLRWIQAAPTTWTTFVANRIAKIQIITEGCRWSHVP FT GKQNPADLISRGVTPEEILENHFWWQGPTWLHESSDRWPRSTEISSVEEVE FT EEKRRVVAGTVSTITEFNQEYFRKFSSYSDLVRRTAYWLRLMKLLQLPATQ FT RSNQGFLSTSELREAEKTLVRLVQKETFVEELKALSKGETVPRGSHLRWYN FT PFISEDHLLKLGGRLKPTRVPS" XX SQ Sequence 6715 BP; 1706 A; 1642 C; 1667 G; 1692 T; 8 other; ttttggtcct tcgagccgga tcaggaccac tccggaccct ggaagtattc gtattctcgg 60 cattgggccg aaactggaac tttcgagtag atcgccatca cccattggat tgcgaacgcc 120 atcacttcga ggaacaattg aggcgttgtt ccattcagga catcagcgcg cccaaacttg 180 gcttcacttg tgtcgatcgc catcgctaat gatcccgctt ggcttggctt ggacttcagc 240 cgggaggatt attgtgcagg ttagtgaact ctagtactat atgtccagcc aagctaggga 300 attacttcaa actaacacac gattttctct gtggtaattt ggattgccca ctcaacaacg 360 gtgcgtttga cgaggatacg acgtttggaa ggtttgccta tcatggttcc tgatttgccg 420 cccattcaaa accgtcgata ttggacacga aaaccacgga cgtggtagga atccctacgc 480 tatagttgga tattttcgcc gacttctgtt ctatccggac cactgttcag agatcatccg 540 aggtcattga ccggctacct agctgatgga ggtcatcgac tggtattgtg actgatagaa 600 tccaacgtcg acttcgacgg gcggccaccc acgcagacga ggagtatcgc ttcgacagcc 660 agaacgtttt cggcgcagag agggagcatc acaacgaatt tagtccaggt aatttaacat 720 atttagtgta ctgccgaagc tagacactca ggaaatacac cattttgact tggattcaat 780 cctcttcgtg ttacctctct gaacgctgag ccctgcttgc ccttttgggc tcctccgtct 840 ttgaatttgg attcgttgat tgaagttgga gctggacgtc ttctttgatt ggtacctatc 900 gctaggattg attgataatt tagagcatca tttgtttcct gtggtgagtt gctgcagtat 960 atgcctggcc gaagccattc cctttggata ctacataact gttctcctct tcgatcatcc 1020 cttgcttgag tgctggtttt tgagacgatt tggattcaag gctgcttgcg aattgatgaa 1080 gctaaacttg agtaccgctt gctgttttgt cgggtaaatt aatgcagtat atgcctagcg 1140 aagctaggtc tacactttgg ctactacact gcatcttttc ctcttcggtc cactaattat 1200 tgctgagttg tcgtcgatcg aaawtcggac gcgtgacgtc acgctactgc ggtcgatttc 1260 cttgaggttt tggagcgcmw ctactcgttt ctccgggtaa attaagatag tatatgcctg 1320 gcgaagctag gccattgcag tatactacac cgcatctctc tctttctgct gcattgcacc 1380 gacgacgaca acatggtatt acctggcgga gcatcagcaa cgaaggcgcc gacgatgagg 1440 gcattgacgg cgaggctgaa ggaaatccag ctgtccttca acgacatcta tcgcttcact 1500 cagaagttct ccgaagccaa cagtatcacc gaggtggaaa tacggatggg caaactggat 1560 gagctgtggg agagttacag tgcaaccatg gtggaaatct tcgcgcacga ggactacaac 1620 ccagaaaaaa tgtcgctgga gaaggaacgt gtggmgttca gcgaccaata ctatggggtc 1680 aaaaccttcc ttctggacaa ggtcaaggag cttcagaaac cttcagtttt ggagcaatcc 1740 agccgagctg gtgatgggcc agcatcgaat accagcgacc acgtccgtct tccgcagatc 1800 caactgcaga cgttcaatgg ggacatcgac gattggctga gttttcgcga tttattcacc 1860 tcactcattc actggaaggt ggatctcccg gaggtggaga aatttcatta tttgaagggc 1920 tgccttcaag gtgaacccaa ggccctaatc gatccactcc caattacgaa agccaactat 1980 caggtggcat gggactcgct gttaaagcgc tataataata gcaagcagct gaggaagcgg 2040 caggtgcaag ctatcttcaa gctaccaact ctatccaagg aatctggagc cgatctgcac 2100 attctactag aaggcttcga acgcatcgtg cagaccctcg atcaggtcgt tcaacaaaac 2160 gaatacaagg acttgttgtt cgtcaacatc ctcacagcac gtttggatcc ggtcacgcgt 2220 agggggtggg aggaagtttc ttctacgaag gagcaggaca ctttggagga tttgttcgag 2280 tttttacgtc gaagaatcca ggttttggac tgtcttccac caaaaccaac cgacactagg 2340 ggtgccggcc agttccagca gcagccaaag ccaaggacac aacccatgaa ggctagctac 2400 agttccaccc aatcaatcca ggcgtctagg ggacgctgtg tggcatgctc ttcggatcat 2460 ctcctctacc agtgcagtga attccagagg atgacagtat cggataggga cagtctcttg 2520 aagtcccatg gactttgccg caactgtttc cgggtggggc accaggccaa ggattgtcat 2580 tcaaaatact tctgccgaaa ctgcaaggcg cgtcaccaca ctctggtctg tttcaagcaa 2640 gaaagggaaa aggaaaccaa ggttgcgacc gttgctgggg gcaacaagtc gtgcaattcc 2700 aaggatcacc aggaatcatc caatatccaa gtggcaaatg tggcagccac ggaaactgca 2760 gtgtccgctg cagctcacca acatccaacg caggttctct tggcgacagc agttgtgatg 2820 atcgaggacg acgtaggcaa tcgtcatcct gctcgtgcta tgctggattc cggatccgag 2880 agtaatttta ttacggaacg actgagtcaa cggctacaag tccatcgaga tcgggtagac 2940 atttcggtgg caggaattgg ccaggtggcc accaaggttc gacaaaggct tagagcggtt 3000 cttcggtcac gagtttcgga cttttctcgt gagttaggct tcctagtcct tccaaaggtc 3060 acagtgaacc tcccaacaac cactatcaaa acggatactt ggacactccc aagtgggatt 3120 caactggcgg atcccacatt cttcgaatcg agcgcggtgg atctcgtact cggtatcgag 3180 tgtttttttc gaatttttcg aaacaggaaa cagaatttcc ctgggggaac aacttccagc 3240 tttaactgaa tcagtgttcg gttggatcat cagtggtgga aatgcatttc ctgcacgttc 3300 tctccacata agctgtaatg cgtccaccct tgacagtttg gataccttga tgactcgctt 3360 ttggtcctgc gaggaggtgg aatcaggcaa ggcttattct ccagaagagg cacgctgcga 3420 aaaatggttc tccagtacgg ttcaacgaga ttccggtggt cgttatacgg ttgcattacc 3480 aaggatggaa gacgctgtgt caaggttggg cgaatcgaag gacatagcgt ttcggcgctt 3540 ccaaggtact gaaaggagac tggctcggga tccatcactc cgacaacagt acgtcgcctt 3600 tatggcggag tatcttgagc tcggccatat gagaattgta agtgagacwg acgatacctc 3660 aacaaaacgg tgctatttac cacatcatcc cgtggtgaag gaatcctcaa cgacaactaa 3720 ggttcgmgtt gtcttcgacg cctcctgtaa aacggcaaca ggagtttcgc tcaacgatgt 3780 actattggta gggccggtag tacaggaaga tcttcgatcc ataatccttc gcagtcggac 3840 caagcaaatc atgttggtat cggatgtcga aaaaatgttt cgtcaaatca atgttcgccc 3900 ggaagatcgc caacttcagt gtattctatg gcgaaactca caatcggaag acgtccgcac 3960 gtatgaactg aacacagtga cgtacgggac gaagccagcc ccctttttgg ccactagaac 4020 actccatcaa cttgctttgg acgatgaaga tcgatttccg cttgcagcac gggcggccat 4080 tgaagatacc tatatggacg acgtcataac gggagcagat gacgttgaaa ccgctacgga 4140 gctacagttg caactgaatg ctatgctatc gggaggaggt tttctacttc ggaaatgggc 4200 atcaaactgt ccgtccgcgc ttgatgggat ttcccaagag aacttagcta tccgttccat 4260 ggatggtatt aatttggacc cagatccttc cgttaaaacc ttagggttga cctggttccc 4320 taccacggac attcttggat ttcaattttc catccctcct gtggwcacag gagtaatcct 4380 caccaaacga tccgtattat cagtgattgc aaccctattc gaccctttag gcctcctggg 4440 agccgtgatt actacggcga aggtttttat gcaacagcta tggaccatac gtgacgaaaa 4500 tgggcaacca cttgactggg acaagccgct tccgccaacg gtgggtgagg cttggtggaa 4560 attccacgaa caactaccgc ttcttaacca tccgcgaatt gaacgatgcg taatcatcca 4620 aaaggcagta ggcgtcgaaa tccactgttt ttctgacgcc tcggagaaag cttacggagc 4680 ctgtctgtac atccgaagca cagatgaaga cggaagggtg aaggttcgac ttctttcatc 4740 caaatcgaag gttgctccat taaaatgcca gaccattcca aggttggaac tgtgcggagc 4800 agttttggcc gcacaattat tcgaaaaggt tcgcgaatcg attcggatga ccacaacttg 4860 ctatttttgg acggattcca cgtgtgtcct tcgttggatt caggctgcac ccactacatg 4920 gacgacgttt gtcgccaatc gtatagctaa gattcagatt atcaccgaag gttgtcgatg 4980 gagtcacgta cctggaaagc aaaacccggc agacctgatt tcccgaggcg ttactcccga 5040 agagattctt gaaaatcatt tctggtggca agggccaacc tggttgcatg agagttcgga 5100 tcggtggccc agatcaacgg aaatctcatc tgtagaggag gtcgaagaag aaaaacggcg 5160 agttgttgct ggaacggttt ctacaatcac cgagttcaat caggagtatt tccggaagtt 5220 ttcgtcatat tcagatctcg ttcgtcgcac tgcttactgg ttgcgcctaa tgaaactact 5280 tcaactaccg gctacacaaa gaagcaatca aggatttctc tccacatctg aactacgaga 5340 agcggaaaaa accttggttc gtttggtaca aaaggaaaca tttgtcgagg agttgaaggc 5400 gctctccaag ggtgagacag ttccaagggg ttcacatctt cgctggtaca accccttcat 5460 ctcggaggat catttactca aactaggagg acggttgaaa ccgacgcggg tccccagctg 5520 atgttgagtg tagtcaggct ccgattttgc cgttaggagg aagaagcgtt gcccgaaaca 5580 ttgtcaataa gtgccaaaaa tgctttcgct caaaaccgtc tccaattcaa caattcatgg 5640 gagaattgcc ggcttcccga gtcaccatag cacgtccatt tttgaacact ggtgtggatt 5700 actttggccc tatttaccta cgccccgttc cacgccgtgg tgtagttaag gcatacgtgg 5760 caatcttcat ttgtctgtgt acgaaagcag ttcatctaga gctagtatcg gatttgtcca 5820 cagaccgttt tcttcaggca cttcgacggt ttgtggcacg gagaggacgt tgtgcgaaaa 5880 tgtattccga caatggcaca aatttcgtcg gcgcgcggaa caaactacag gacctgttga 5940 agttgctgaa aagccgagac catcatgatt cggtttcaaa ggagtgtgcc agggatggta 6000 tccagtggca tttcatccca ccaagcgcgc ctcacttcgg aggtttatgg gaagcagccg 6060 ttagatcttc caaacatcac ctgttacgag tattgggaga aacaccggta tcacccgaag 6120 atttcaatac gctgcttacc caggttgaag cttgcctgaa ctctcgcccc cttacgccac 6180 tgtccgacga ccctaatgat ctcgaaccac tgacgcctgc ccatttcctg atcggttcat 6240 ctcttcagtc gatccctgaa ccagatttag gcgaacttcc aatcaatcgt ttagatgctc 6300 tgcaactcat tcaaaggagg ctgcwggatt tttggaagag atggcgcaga gaatatctct 6360 gccaattgcc aaggaccaag cgatggcagc ctgcaattca agtcgaagtt ggcaagctgg 6420 tagttataaa ggatgaaaat caaccaccca tgaaatggag aatgggacgt atcacggagg 6480 ttcaccctgg aagcgataag attgtgaggg ttgtgaccct taaaactgct accggtttct 6540 taacccgacc ggtagaaaag ctttgtatcc tgccgcttcc agaagacgat agtgcacaac 6600 aaattacaac agccgatcca tctcagtagc cctttcccat tccatcatcc cattccgtcg 6660 aagaggattt cctttctatt tgcagaaatt cgacgaattt cagggtgggt gaggg 6715 // ID RTE_Ele4 repbase; DNA; INV; 3435 BP. XX AC . XX DT 06-OCT-2010 (Rel. 15.1, Created) DT 06-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE An RTE clade non-LTR retrotransposon family from Aedes aegypti. XX KW RTE; Non-LTR Retrotransposon; Transposable Element; RTE_Ele4. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-3435 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-3435 RA Kojima K.K. and Jurka J.; RT "RTE clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (06-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. This consensus is generated from 30 CC sequences with >99% identity, and ~100% identical to the original CC sequence in [1]. XX FH Key Location/Qualifiers FT CDS 298..3417 FT /product="RTE_Ele4_1p" FT /note="endonuclease and reverse transcriptase." FT /translation="PHQRDRPSVSWDVKQSWHDGAPARQECWRRPNKPPVK FT NLIANNIGDNTTRNNRQRPTRRNKDYDWKLGTWNCKSLGFAGCDRIIYDEL FT HPRNFDIVALQELCWTGQKVWKSGHRAATFYQSCGTTNELGTGFIVLGKMR FT QRVIGWQPINARMCKMRIKGRFFNYSIINVHCPHEGRPDDEKETFYAQLEQ FT IYDGCSPRDVKIVVGDMNAQVGREEMYRPVIGRNSLHAVSNDNGQRCVNFA FT ASRGMVVRSTFFPRKDIHKTTWRSPDNLTENQIDHVLIDGKFFSDITNVRT FT YRSANIDSDHYLVAICMRSKLSTVTNSRRSRTPRLNIERLRDSEVAQEYAQ FT QLEVALPTEEQLGAATLEDGWRHIRSAIGSTATAALGPAAPNQRNDWYDGE FT CEQLKNEKNAAWARMLQHRTRANEARYKQARNRQNSIFRRKKRHQEERDRE FT AMEELFRAKDTRKFYEKLNRSRKGFVPQADMCRDFDGNLLTNEREVVERWR FT QHYDEHLNGDVASSGGGTEITLGARADDERLPPPDLQEVETEIRRLKNNKA FT AGVDQLPSELLKYGGDALARALHWVISKIWEEEVLPEEWMEGVVCPIYKKG FT DKLDCANYRAITILSAAYKVLSQILCRRLSPIAREFVGQYQAGFMGERATT FT DQIFAIRQVLQKCREYNVPTHHLFIDFKSAYDTIDREQLWQIMHEYGFPDK FT LIRLIKATMERVMCVVRVSGTLSSPFESRRGLRQGDGLSCLLFNIALEGVI FT RRAGIDTSGTIFRKSVQLLGFADDIDIVARNFETMADTYIRLRAEARRIGL FT NINVSKTKYMIARGSREDTARPPPRVHIDGDEIEVVEEFVYLGSLVTADND FT TSREIQRRIVAGNRAYFGLRRTLRSSKVRHRTKLTIYKTLIRPVLLYGHET FT WTLRAEDQRALGVFERKVLRTIYGGVQMEDGTWRRRMNHELHQLLGEPTIV FT HLAKIGRLRWAGHVTRMSDTNPVKMVLENNPTGTRRRGAQRARWVDQVEDD FT LRTLRRMRGWQRAAMDRVEWRRLLRTAEATQALA" XX SQ Sequence 3435 BP; 969 A; 834 C; 967 G; 665 T; 0 other; gggctgcaaa acggtgagtc gacaacaagg aaggagcgtt caacacagct ctggtcctca 60 caagttccta cctcacgctt ccacgggtca aacgatgaca aagaccgcca gctaagggtt 120 gcgtacttag ctggtagtgc agcctgggca ctgttgtcct tctgacatca gctagattga 180 ggaggtacgt ctcgagcgtc tgttcaccag gaggtgcggc tcaaacagcg tctgttctgg 240 tatccagcgg ctgagcaaga aatgctgaat cgcgcacagc taaatccaag gtggtagccc 300 catcagcgcg atcgtcctag tgttagttgg gacgttaaac agagctggca cgatggcgct 360 ccggcgagac aggagtgttg gcgtaggccc aataagccac ccgtaaaaaa ccttattgcg 420 aataacatag gagataatac gacccggaac aatcggcaaa gacccacgcg acgaaataag 480 gattacgatt ggaaacttgg aacatggaac tgcaagtcac ttggtttcgc aggttgcgac 540 aggataatct atgacgaact acatccccgt aacttcgata tcgtagcgct gcaggaactt 600 tgttggactg gacagaaggt gtggaaaagc gggcatcgag cggctacctt ctaccaaagc 660 tgtggcacca caaatgagct aggaactggc ttcatagtgt tgggcaagat gcgacaacgc 720 gttatcgggt ggcagccgat caacgcaagg atgtgcaaga tgaggataaa aggccgtttc 780 ttcaactaca gcatcatcaa cgtacactgc ccacacgaag ggagacccga cgacgagaag 840 gagacgtttt acgcgcagct ggagcagatt tatgacggat gctcgccgcg tgacgtgaaa 900 atcgttgttg gcgacatgaa cgcacaggta ggacgagagg agatgtacag accggtgatc 960 gggcggaaca gtctgcacgc cgtatctaac gacaacggcc aacgatgcgt caactttgca 1020 gcctctcgtg gtatggtagt ccgaagcacc ttcttccccc gtaaagatat ccataagacc 1080 acctggagat cacccgataa cctaaccgag aaccaaatcg accacgttct aatcgacgga 1140 aaattcttct ctgacattac caatgttcgc acataccgca gtgcgaatat agattcggat 1200 cactacctag ttgctatatg catgcgctca aaactttcga cggtgacaaa ctcgcgtcga 1260 agccgaacgc cgcggcttaa catcgagcgg ctacgggact cagaagtagc ccaagaatac 1320 gcgcagcagt tggaagtggc cctaccaacg gaagagcagc tcggcgccgc cactcttgaa 1380 gatggctgga ggcacataag atccgccata ggtagcaccg caacagcagc actaggtcca 1440 gcggccccga atcagagaaa tgactggtac gacggcgaat gcgagcagtt gaagaatgag 1500 aagaatgcag catgggcgag aatgctgcaa caccgcacga gagcgaacga ggcacggtat 1560 aaacaggcgc ggaacagaca aaactcgatc ttccgaagaa aaaagcgcca tcaagaagaa 1620 cgagatcgcg aagcgatgga agagctgttc cgcgctaaag acacacggaa gttctacgag 1680 aagcttaacc gttcgcgcaa aggctttgtg ccacaagccg acatgtgcag agactttgac 1740 ggaaatcttc tcacgaacga acgtgaggtg gtcgaaaggt ggcggcagca ctacgacgag 1800 caccttaatg gtgatgtggc cagtagcgga ggtggcacgg aaataacttt gggagcacgc 1860 gcggacgacg aaagacttcc gcctccagat ctccaagagg tagaaacgga gattagacgg 1920 ttgaaaaaca acaaagccgc tggagtggac caacttccta gcgagttgct aaaatacggt 1980 ggtgatgcac tggcaagagc gctgcactgg gttatttcca agatttggga ggaggaagta 2040 ttgccggagg agtggatgga aggtgtcgtg tgtcccatct acaaaaaggg cgacaagttg 2100 gattgcgcca attatcgtgc gatcacaatt ttgagcgccg cctacaaggt actctcccaa 2160 attctatgcc gccgtctatc accaattgct agagaattcg ttggacaata tcaggcagga 2220 tttatgggcg aacgagcaac aacggaccag atattcgcca tccgccaggt gttgcagaaa 2280 tgccgcgaat acaatgtacc cacacatcat ttattcatcg attttaaatc ggcctatgat 2340 acaatcgatc gagaacagct atggcagatt atgcacgaat acggtttccc ggacaaactg 2400 ataagattga tcaaggcgac gatggagcga gtgatgtgcg tagtccgagt atcagggaca 2460 ctctcgagtc ccttcgaatc tcgcagaggg ttacggcaag gtgatggcct ttcgtgttta 2520 ctgttcaaca ttgctttaga gggtgtgata agaagagcgg ggatagacac gagtggcacg 2580 attttcagaa agtccgttca gttacttggt ttcgccgacg acattgatat cgtagcacgg 2640 aactttgaga cgatggcgga tacgtacatc cgactaaggg ctgaagctag gcgaatcgga 2700 ctgaacatca atgtgtcaaa gacgaagtac atgatagcga ggggctcaag agaagacacg 2760 gcacgccccc cacctcgagt tcatattgac ggtgatgaaa tcgaggttgt cgaagaattc 2820 gtgtacttgg gctcactggt gaccgccgac aacgacacca gcagagaaat ccagagacgc 2880 atcgttgctg gaaatcgtgc ctactttgga ctccgcagaa cgctccgatc gagcaaagtt 2940 cgccatcgca cgaagttgac catctacaag acgctgatta gaccggtact cctctatggg 3000 cacgaaacat ggaccctacg tgcagaggat caacgcgccc ttggtgtttt cgaacggaag 3060 gtgttgcgga ccatctacgg cggagtgcag atggaagacg gaacgtggag aaggcggatg 3120 aaccatgaat tgcaccagct gctgggagaa ccaaccattg tccacctcgc aaaaattggg 3180 aggctgcggt gggccgggca tgtcaccaga atgtcggata ccaacccggt gaaaatggtt 3240 ctcgaaaaca atccgaccgg cacaagacga cgtggtgcgc agcgagcaag atgggtcgat 3300 caagttgagg acgatctgcg gacccttcgc agaatgcgtg gctggcaacg ggcagccatg 3360 gaccgagtcg aatggagacg tctcctacgt acagcagagg ccacccaggc cttagcctga 3420 ctggtaaggt aagta 3435 // ID Gypsy-13_OD-LTR repbase; DNA; INV; 430 BP. XX AC CABV01000243; XX DT 24-FEB-2011 (Rel. 16.02, Created) DT 24-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Oikopleura dioica genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-13_OD_; KW Gypsy-13_OD-I; Gypsy-13_OD-LTR. XX OS Oikopleura dioica OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; OC Oikopleuridae; Oikopleura. XX RN [1] RP 1-430 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Oikopleura dioica genome."; RL Direct Submission to RU (24-FEB-2011). XX DR Genome; CABV01000243; Positions 6254 5825. XX SQ Sequence 430 BP; 118 A; 104 C; 84 G; 124 T; 0 other; tgtaaggttc gccccgtccg aaccaagagc ctcttctcca caacattgcc gcgggcgcgc 60 attcgctgag agcgctcgcc tccttcttac gttaaaccct ttactgccca cgcattctgt 120 ctatataacc cgccaaactt gaattccctc tgactcgcaa aaacaaaatc ttcgttttta 180 cttcaacaaa cttaggcaag tgacttctga ttagagagac attattgcat gtgacttgaa 240 taaaagagat aatgtgaaag atttgactaa ctttgaccac agatgtattt gtatagtagc 300 tcaacatgag gatgctcacc ttgtaagcct tcaaattgcc acagttgatt acagttagta 360 ggactccggt cctcagctgt aatcagttag ttggtttagg agaaggggaa ccttatagac 420 tcttttgcca 430 // ID Gypsy-12_DWil-I repbase; DNA; INV; 6088 BP. XX AC scaffold_180716; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-12_DWil_; KW Gypsy-12_DWil-LTR; Gypsy-12_DWil-I. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-6088 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_180716; Positions 27256 21169. XX CC Positions [5102-5590] - Integrase core CC 'ATAT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 14..6016 FT /product="Gypsy-12_DWil-I_1p" FT /translation="MTTRNDKYFGTRQATRSTSRMAASTPTAAGAPTALTP FT AAVPAATGAAAAVPRAAVPTADILAPAGPAAAGPTADILAPAGPAAAGPTA FT ATLAPAGHAAAVLAQVGPTADMLAPAGPTAAIRAPAGLTDATFAPAGPTAD FT IFAPAGPTADMLAPAGPTAAIRAPAGLTDATFAPAGPTTDMLTPAGPTAAA FT SAPAGPAPAGPTATAHVPAGPTATTHAPAGPAAAASAPAGPAAAASAPAGP FT AAAASAPAGPAAAASAPAGPSAAAYAPAGPTTAASAPAGLTAVTLAPAGPT FT AVILAPAAHAAAVPAAYKPPRQTAAAAASAPAATAVPAATVVPTTAAIAVP FT TTAAIAVPAAAAAASITVPAAAATSTVPADAVPADYMSAMRVAALLAAPVT FT GLDAEPWEEMPALVPLVPPVPTSAAAITPATSASRSLDEQLATLQKQHELL FT QLQHQILKLQQGIDEFQAPRQQVSDVSVIESMVARFSADNAYDVKKWLNDL FT EDAFGILQLNDRLRLIACRRMLEGTAAVLLRTVSVHSYEELKAILLREFGR FT SCSVEEVYHALRGRRIKAGEGCLRFAIEMQEIAMNAPIPENELVDLIIDGL FT RDNSTRVGMLYSARTVAELKPLLERYERLRMQAAASRQSNSVKAQSTTTAV FT APGNKSSGTATSETRCFNCSGWGHYKSQCPKPIRPPNSCFKCGIVGHIYKN FT CPSRIGNATNITAAAADEAGDRLDHLQLVSVAFVCDEVQDRKFIDMFSLLD FT SGSPKSFIRESEVPSNCRLSPRTLNCRGLGNSQLLSLGQVQCRVRLRNTQI FT IHTFEVLTNQATELPMIIGRDLLYKINIRLCKLKELKYEKESLILMNKQTE FT CINPKLTAVLKSFDIFMEKRVILPCDRNEIVPAELLSDSPLPKTSAVPSET FT LNDALMAINSIELPAVSELDFDIECDRNEIVTAELLSDSPLPKTSAVPSGT FT LNDALMAINSIELPAVKELDFDIEPALREEQTAALKSIISASYVQVDPDLI FT KPMNYEMKIELTSQTPIFCRPRRLSHHERIEVREICDDLLKAGIIRHSSSP FT YAAAIVPVRKKDNTLRKCVDYRPLNKITVRDHFPIPLLDDCIEHLGGKSFF FT TILDLKNGFHHVKMHPDSIKYTAFVTPDGQFEYVRMPFGLCNGPSVFARFV FT YFVLKEFINRGQLVIYMDDILVASHDFETHLEILKEILYVLRVNGLCLKLT FT KCRFAYRELEYLGYLANSAGITPKPAHIQAIRDLPIPRDSKELERCLGLFS FT YFRRFVSGFAKIACPLSRLLRKDTPYVFSDECLEAFRTLKLKLIESPVLSI FT YNPKSETELHCDASSIGFGAVLLQKQTDGKLHPVSYFSKMASAAESKLHSY FT ELETLAIVYSLKRFETYLKFIPFKIVTDCNSLALTLRNGGHSAKIARWALL FT LENYNYTIVHRSGLGMPHADALSRIETAAYVDDINLDFQLQLTQNRDPEIV FT KLRDELEKYEIANFELRDGVVYRLSSTSIPQLYVPSEMVNHVIRVVHEKLG FT HMATEKCCAQIKKHYWFPCMKSYVNNYIKNCLKCIYYSASASDHRVTLHSI FT PKVPLPFDTVHIDHLGPLPSIRSKKKYLLVVIDAFTKFVKLYPAATTNSRE FT VCQTLESQYFANYSRPRRIISDRGTCFTSSEFEEFLNANNIIHVLNATAFL FT RYNYIKNCLKCIYYSASASDHRVTLHSIPKVPLPFDTVHIDHLGPLPSIRS FT KKKYLLVVIDAFTKFVKLYPAATTNSREVCQTLESQYFANYSRPRRIISDR FT GTCFTSSEFEEFLKANNIIHVLNATASPRANGQVERVNRVLGPMLGKLTEP FT VDHADWVQQLPNIEFALNNSVHSTTKFAASVLLFGVEQRGVVVDELTERLG FT ERTEEVDASEVDVRRLAFRNIVKSQEYNSEYYSKHHLPAVSFEEGDFVVIK FT NVDTSAGTNKKLIRRYRGPYVVHKKLPNDRYVIRDVEGSQITQLPYDGVLE FT ASKLKLWVAPDQDCTSDSQTKATPAT" XX SQ Sequence 6088 BP; 1461 A; 1717 C; 1432 G; 1478 T; 0 other; actcagaagt gggatgacca cccgaaacga caaatatttc gggacacggc aggccacacg 60 ttccacatcg agaatggctg cctccacgcc caccgctgct ggcgcaccca ctgctttgac 120 tcccgctgca gtaccggcgg ccactggagc cgccgctgca gtaccgagag ccgctgtgcc 180 aactgccgac atactcgccc ctgccgggcc cgccgctgct gggccaactg cggacatact 240 tgcccctgcc gggcccgccg cggctgggcc aactgccgcc acacttgctc ctgctgggca 300 cgctgctgcc gtactcgccc aggttgggcc aaccgctgac atgctcgccc ccgccgggcc 360 aactgctgct atacgagccc ctgctgggct aactgatgcc acattcgccc ctgccgggcc 420 aacggctgac atattcgccc ctgccgggcc aaccgctgac atgctcgccc ccgccgggcc 480 aactgctgct atacgagccc ctgctgggct aactgatgcc acattcgccc ctgccgggcc 540 aacgactgac atgctcaccc ctgccgggcc aactgctgcc gcatccgccc ctgctgggcc 600 cgcccctgct gggccaactg ctaccgcaca cgtccctgct gggccaactg ctaccacaca 660 cgcccctgct gggcccgccg ctgctgcatc cgcccctgct gggcccgccg ccgctgcatc 720 cgcccctgct gggcccgccg ctgctgcatc cgcccctgct gggcccgccg ctgctgcatc 780 cgcccctgcc gggccctccg ctgctgcata cgcccctgct gggccaacca ctgctgcatc 840 cgcccctgct gggctaaccg ctgtcacact cgcccctgct ggcccaactg ccgtcatact 900 cgcccctgct gcacacgccg ctgctgtgcc cgccgcttac aagccacctc ggcagactgc 960 cgctgctgcc gcctctgcac ccgctgccac tgctgtaccc gctgccactg tcgtacccac 1020 caccgccgcc attgctgtac ccaccaccgc tgccattgct gtacccgctg ccgctgccgc 1080 cgcatccatt acggtacctg ccgccgctgc cacatccact gtacctgcgg atgctgtgcc 1140 cgccgactat atgtctgcca tgcgggttgc ggccttgctc gccgcccccg tcaccggtct 1200 cgacgcggag ccatgggaag agatgcccgc tctagtacct ctagtacccc ccgtaccgac 1260 ttccgcggcc gccatcacac cggccacttc cgcctcccgc agcctggatg agcaactggc 1320 gacgctgcag aaacaacatg agttgctaca gctgcagcat cagatcctca agctgcagca 1380 agggattgac gaattccaag caccgaggca gcaagtcagt gacgtgagtg tcattgagag 1440 catggtggct cgattttcag cggataacgc atatgacgtg aagaaatggc tgaatgacct 1500 ggaagacgca tttggaattc tgcagttgaa cgaccgccta cgtttgatcg catgccgtcg 1560 catgctggaa ggtacggcgg cggtgctgct gcgcacggta tcggtgcaca gctacgaaga 1620 attaaaagca atcctcttgc gagaatttgg acgctcgtgt agcgtcgagg aagtgtacca 1680 cgctctgcgt ggccggcgca tcaaggcagg cgagggatgt ttgagattcg caatcgagat 1740 gcaggagatt gccatgaatg ctccgatacc ggagaacgag ctggtcgacc tgataattga 1800 tggattgcgg gacaattcta ctcgagtcgg catgctgtac tccgcccgga cggtggccga 1860 actgaagccc ttactggaac gttacgaacg cctccgtatg caggccgctg cctcccgaca 1920 gagcaacagc gtgaaggcgc agtcaacaac aactgccgtg gcgcctggaa acaaatcgtc 1980 gggcaccgct acgagtgaga ccagatgttt caactgctca ggatgggggc attacaagag 2040 tcaatgcccg aaaccaataa gaccgccgaa ttcgtgtttc aagtgtggca tagtgggcca 2100 catctacaag aactgcccgt ctcgaattgg gaatgccacc aacatcactg ccgccgccgc 2160 tgatgaagcg ggagatcgac tagaccattt acaactggtg agtgttgctt ttgtttgtga 2220 tgaagtgcaa gataggaaat ttatagatat gttttctctc cttgattcgg gcagcccaaa 2280 gagctttatc cgtgaatctg aagtcccctc caactgccgt cttagccccc gtactttgaa 2340 ctgccgtggc ctcggaaata gccaacttct gtccttaggc caagttcagt gtagggttcg 2400 cctccgcaac acacagataa tacatacgtt tgaagtcctg acgaaccaag caactgaact 2460 gccgatgatt ataggtagag atcttctgta taaaattaat attagactat gtaaactgaa 2520 agaactgaaa tatgaaaaag aatcgttgat actcatgaat aaacagactg aatgtataaa 2580 tccaaaactt actgctgttc tgaagtcctt tgacatattt atggaaaaga gagtgatact 2640 gccgtgtgat aggaatgaga ttgtgcctgc cgaattgttg tctgattctc cccttccgaa 2700 aacttctgct gtaccttcgg aaacattgaa cgatgcactt atggcgataa attccattga 2760 gctccctgct gtcagtgaat tagactttga tatagagtgt gataggaatg agattgtgac 2820 tgccgaattg ttgtctgatt ctccccttcc gaaaacttct gctgtacctt cgggaacatt 2880 gaacgatgca cttatggcga taaattccat tgagctccct gctgtcaaag aattagactt 2940 tgatatagag cccgcgttac gggaagagca aaccgcagcc cttaagtcta ttatttccgc 3000 ttcgtatgta caagttgatc cagatttaat caagcccatg aactatgaga tgaaaataga 3060 attgactagc caaactccga ttttctgccg cccccgccgt ctgtcgcacc acgaacgcat 3120 cgaggtgcgt gagatttgtg acgatctctt aaaagctggc attattcgcc acagtagttc 3180 cccttacgct gccgcgattg tccctgtacg caaaaaagac aacacgctaa gaaagtgtgt 3240 agattaccgc ccgttgaaca aaattacagt tcgagatcat tttccgattc ctctgttaga 3300 tgattgcatt gaacatctgg gtggcaaatc attcttcacc atcttggact tgaagaacgg 3360 tttccaccat gtgaaaatgc acccagactc aatcaaatat actgcctttg taacccctga 3420 tggccagttt gaatatgtac gaatgccctt cggactttgt aatggccctt ccgtgttcgc 3480 tagattcgtt tattttgtcc ttaaggaatt tataaaccga ggtcaactgg tgatatatat 3540 ggacgacatt ctagtcgcgt cgcatgactt tgaaacccac cttgagattc tgaaggaaat 3600 attgtatgtg ttaagagtga acgggctatg cctgaagctg acgaaatgcc gttttgcgta 3660 tagggaacta gagtacttgg gctacttagc caattccgcc ggtatcacgc ctaaaccagc 3720 tcatattcag gcgatacggg atttgcccat tccaagggat agcaaggaat tagagcgttg 3780 cctaggccta ttctcatact tccgccgatt tgtatcaggt ttcgctaaga ttgcgtgtcc 3840 gctgtcccga ttgttaagaa aagatacccc ttatgtcttc tccgatgaat gtcttgaagc 3900 cttccgaaca ctgaagttga aattgatcga atcccccgtg ttatctattt acaacccgaa 3960 gtctgaaaca gagttacact gcgatgctag ttcaattggt ttcggagcag tgctcctcca 4020 gaaacaaaca gacggtaaat tacacccagt gtcctacttc tctaagatgg cctccgccgc 4080 tgaatcgaaa ctacacagtt atgagctaga gacgttagct atagtgtatt ccttaaaacg 4140 ttttgagaca taccttaagt tcatcccgtt caaaatcgtg accgattgta attcccttgc 4200 gctgacactg agaaatggtg gacattccgc caagattgct cgctgggccc tgctgttgga 4260 aaactataac tacacgattg tacaccgctc tgggttgggt atgccccatg ctgatgctct 4320 aagtagaata gaaactgccg catacgttga cgatataaac cttgacttcc agctgcagtt 4380 aacccagaac cgggaccctg agattgtcaa actccgagat gagttagaga aatatgagat 4440 agcgaatttc gaactgcgag atggagtagt gtaccgcctg tcctcgacga gtatacccca 4500 actttatgtc ccttccgaaa tggtaaatca tgtcatacga gtagtgcacg agaagcttgg 4560 ccatatggcc acggagaaat gttgtgcaca gatcaagaag cattactggt tcccgtgtat 4620 gaagtcttac gttaacaatt acattaaaaa ttgcctcaag tgtatctact actccgcgtc 4680 cgcttctgac caccgtgtca cgctacatag tattccgaaa gtgccccttc cgtttgatac 4740 cgtccatatt gaccatttgg gcccccttcc gtccatacgt tctaagaaaa agtatcttct 4800 ggttgtcatt gacgctttca ccaaatttgt gaagctgtac cctgccgcca cgactaattc 4860 acgtgaagta tgccagacat tagaatctca gtatttcgca aattacagta gaccccgccg 4920 aatcatcagt gatcgaggta cctgctttac ttcctccgag tttgaagaat ttctgaacgc 4980 caataatata attcatgtgc tgaatgccac tgccttctta cgttacaatt acattaaaaa 5040 ttgcctcaag tgtatctact actccgcgtc cgcttctgac caccgtgtca cgctacatag 5100 tattccgaaa gtgccccttc cgtttgatac cgtccatatt gaccatttgg gcccccttcc 5160 gtccatacgt tctaagaaaa agtatcttct ggttgtcatt gacgctttca ccaaatttgt 5220 gaagctgtac cctgccgcca cgactaattc acgtgaagta tgccagacat tagaatctca 5280 gtatttcgca aattacagta gaccccgccg aatcatcagt gatcgaggta cctgctttac 5340 ttcctccgag tttgaagaat ttctgaaggc caataatata attcatgtgc tgaatgccac 5400 tgcctcaccc cgggccaacg gccaagtcga gcgggttaac cgtgtgttgg gccccatgtt 5460 gggtaaactg acggaacctg tggaccacgc agattgggtc caacaactgc cgaacataga 5520 gtttgcattg aataactccg tccactccac aaccaaattt gccgcttccg tacttctatt 5580 cggtgttgaa caacgaggtg tggttgtgga tgaactaacc gaacggttgg gtgagagaac 5640 tgaggaggtc gatgcctcag aggtagacgt ccggagatta gccttccgta acattgttaa 5700 gtcccaagaa tataattcgg aatattattc gaagcatcac ctccctgccg ttagttttga 5760 ggagggggat ttcgtagtta ttaagaatgt ggatacttcg gcaggcacga acaaaaagct 5820 tatcaggcgg tatcgtggtc cgtatgtggt tcataaaaaa cttccgaacg accgatatgt 5880 tatccgggat gtagagggct cccaaatcac ccaactacca tatgatggtg tcctagaagc 5940 ctccaaatta aagctatggg ttgcccctga tcaagattgc acttctgata gccaaacaaa 6000 ggcgacacca gccacttaag attcttattt accccctccg ttgttaccaa ttggaatcga 6060 ggtcgattcc tcgtcaggac ggccgagt 6088 // ID BEL-9_AA-I repbase; DNA; INV; 4082 BP. XX AC supercont1.344; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-9_AA_; KW BEL-9_AA-LTR; BEL-9_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4082 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.344; Positions 978899 974818. XX CC 'AAGAT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 715..3192 FT /product="BEL-9_AA-I_1p" FT /translation="MVNKTEKKLEGNILTLKRVLAIRDVVERFVAEYNHDR FT DAIQVAVRLESLDRINQEFQRAQGEIERLDSEKFEEHIEVRTAFEDKYCVL FT KGFLLSKMNPGQRSMDSTFFNNLTHQSAASFHHRLPKIDLPKFSGDESRWI FT SFRDNFISMIHSNDDIPTVNKLHYLLQSLEGEAKKPFESVDVQANNYASTW FT DALMKRYDNKRFLRKELFRSLFDLPMMKQESAQDLNTLVDDFQRRVKALAK FT LGEPVVHWDTPLIFILSNKLDAATLRSWEHETRQKDDVKYEELIEFLSQHV FT RMLKSVSSDLQQRFSSTNSKVAGPPARITASVKSVANTATTDPIQSIPQCL FT VCSETHLLHQCPTFIKLSVTQRRELVSQRSLCWNCFRPNHQARFCKSRFSC FT RTCHARHHSMLHDQAETAPISHVDTTKPPTQQPDSSAIAIAGPSGTSHPPE FT VSMAVQSNNSTVLLETVSLLVIDQHGKEIPVRALLDSASMSNFVSKKLANS FT LGIPRATVDVTVSGLGKSIKQVRGRITAMVKSKICNFSTTLDFLVMQKPTA FT NLPTVTVNTDSWNLPNVPLADPQFNIPGMIDIIIGGEHYHKLHTGHRLSIS FT DTMPMFINTHFGWTVSGKVSVNSAVSSPVCYLATVNHPIQPIHYQEPEAVD FT HDPINPPNQSIDLKTNTTTRIHDEGSVVHPFRSDYPQVTNTKSQKIESPHS FT YDLMQQRNRKPVTTEQTKRKHVEYVQEESPLYLLTAKHVAEPATKKTPTGT FT SATPNQRSTTTIRYHTSMQLPRFKSKQIQSVRSPNSSPRSIFRQELQVQPL FT QKIIHQYPSFIYQKLKCCSSVTIS" XX SQ Sequence 4082 BP; 1140 A; 1012 C; 899 G; 1031 T; 0 other; ttggtccttc gagccggatg aggaggacaa cccaagccag gagccagcgt ctaggaggca 60 agttggcgcc atcgtggtaa gtgtggtcaa aggcgaagac cacacaaagg taggccattc 120 attttattcg ttctatgtgg agatcagttg ggccttttac acacattccg tgcttctctc 180 cgataaagag gattgacttg cttgtaattt gcatgatcta ttgaaataaa ttgagcacaa 240 aggcataata cgatcagtgg aatagctgca gaaagacggc aaatcgtttt tgcaattttg 300 tcagtggtgg ctgataggca ttgattgcca tatttttttc aatttgcttt gcgccaacag 360 tgtcgcacat tgaaaatagt acaccgtact gggtggtgga aaatcgattg ggataagaat 420 tagatcatct tcgtatttac gatattttgc gatatcagtg gcgttcagcg gacacgcgta 480 tcgcaaaggt ggcacatcgc gacgggaagg attgatcgta tcgatttcac gtgtcacttt 540 cggtcgcacg cgtgcgtgta ggggtatcac gacgtgacgg gaaggatttt atcgatcgat 600 tgcgtggtac ctccgtcgtt tctcgcgttg aatagtgtat cgcgacggga aggagaactt 660 cgttcgattt gcactgttca cgttcggttt cgcgagtgat tcgagtggtg agtgatggtg 720 aacaaaacgg aaaagaaatt ggaggggaat atcttgaccc taaagcgagt gttggctata 780 cgagatgtgg tggaaaggtt cgtggcagag tataaccatg accgggatgc aatccaagtg 840 gcagttcgtc tagagagttt ggacagaata aaccaggagt tccaacgtgc acaaggggaa 900 atcgaaagac ttgacagtga gaagttcgag gagcatattg aggttcgcac agcttttgag 960 gacaagtact gtgtgctcaa gggtttttta ctgtctaaaa tgaaccccgg ccagcgttca 1020 atggattcaa cgtttttcaa caacctcact catcaaagcg cagcaagttt ccatcatcgg 1080 ctaccgaaga tcgaccttcc aaaattcagc ggagatgaat ctaggtggat ttcattccga 1140 gacaacttca tttcgatgat ccacagtaac gacgacatac cgaccgtcaa taagctccac 1200 tacttattgc aatcgttgga gggagaagcg aagaagccat tcgagagcgt agacgttcaa 1260 gcgaacaact atgcatctac gtgggatgca ctaatgaagc gatacgataa caaacgtttc 1320 ttgcggaaag aactgtttcg aagcttgttc gatttaccga tgatgaagca ggagtcagca 1380 caggatctca atacgttggt agacgacttt cagcggcgcg tgaaagcttt ggccaagcta 1440 ggtgaacctg tcgttcattg ggatactcct cttatcttca tactttcgaa taagctggat 1500 gcggctaccc ttcgatcctg ggagcacgag actcgccaaa aggatgacgt caagtacgaa 1560 gagctcatcg aattcctttc tcaacacgtg cgaatgctga aatctgtgtc aagcgatctg 1620 caacagcgct tttcatcgac caattccaag gtggctggcc ctccagcaag gataactgct 1680 tcagtgaagt ccgtagccaa cacagctact accgatccca ttcaaagtat ccctcagtgc 1740 cttgtctgtt ccgaaacgca tttgctacac cagtgcccaa cattcatcaa gctgtccgta 1800 acccaacgaa gagagttggt atcgcagaga agcttgtgct ggaactgttt tcgtccgaat 1860 catcaagcga gattttgcaa gtcaaggttt agttgccgaa cctgccacgc cagacaccat 1920 tcaatgctac atgatcaagc cgaaacagcc cctatttctc atgtagatac gacgaagcct 1980 cctacacagc aaccagattc ctccgcaata gcaattgcgg gaccctcagg aacatcgcat 2040 cctcctgaag ttagcatggc tgtccaatcc aacaatagca cagtccttct ggaaacagtt 2100 tctctcctcg ttattgatca acacggaaag gaaatcccag ttagagcact ccttgattcc 2160 gcatcaatgt ccaattttgt gagcaagaag ctggcaaatt cacttggaat tcctcgagcc 2220 accgtggacg ttactgtttc gggactagga aaatctatca agcaggtcag gggtaggatc 2280 acagccatgg tgaaatcaaa aatttgcaac ttctccacca cactcgattt cctcgttatg 2340 cagaagccca ccgcaaacct tccgacagtc accgtcaata cagattcatg gaacctgcct 2400 aacgttccat tggccgaccc gcaattcaat ataccgggca tgatcgacat catcataggt 2460 ggagagcact atcacaaact ccacactggt catcgtttgt caatcagtga taccatgcca 2520 atgttcatca acacacactt cggatggacg gtatccggaa aggtttcagt aaactccgct 2580 gtttcttcgc cagtttgcta tctagctacg gtgaatcatc ccattcaacc aatacactat 2640 caggagcctg aggccgtcga tcatgaccca ataaacccac caaaccagag cattgatcta 2700 aaaacaaaca ccaccactcg aattcacgat gaagggtccg tcgtacaccc ttttcggtcc 2760 gattatccgc aagtcaccaa caccaaatcc cagaaaatcg aatcacccca ctcgtacgat 2820 ctcatgcagc aacgaaatcg taagccagta actacagagc aaacaaaacg caagcacgta 2880 gaatatgttc aagaagaatc tccactctac cttctgacgg ccaaacacgt tgccgaacct 2940 gccaccaaga aaacgcctac cggaacctca gccactccga atcaacgttc taccactacg 3000 ataagatacc acacaagcat gcagttgcca agattcaaat cgaagcagat acaaagcgtc 3060 agatcaccaa attcgagtcc tcgaagtata ttcagacaag aacttcaagt ccagcccctt 3120 cagaagataa tccaccagta tccttcgttc atctaccaaa agctgaagtg ctgctcatcg 3180 gttactattt cctgaaactt caaagaccgg tagagcagtc cgaaactggc tcaaggcgca 3240 tataaggcgc aatcaggtga ttggcgttcc tttctttcca gctgagtcag ttcgatctac 3300 cgggtctttt attgttcaca gatcatccct tgttacacca tctttcatcg ctcgtattat 3360 catcaacatc ttagttccgg agactacaaa gttcggcaga gcagtccgaa actgactcaa 3420 cgcgcttttg gagcgcaatc aggtgattag cgttccaata atcttatctc ctcagttaga 3480 tctaccggat ctttcgttgt tccagttcaa catgtcatcg aagtcgtcaa cattgttttc 3540 atcccgcctc tcattgatcg acgaatattc aaagtcgaga agagcttcag ccgaatgatc 3600 tacgtcctag ttaaccgccg acttgcattc aacacaaggc gtgtttgtcc ctttcgaatt 3660 aattcatcgc atccaaccat tgttgtccat tagtgctatg cagttgctga ccattactgc 3720 aaccactaag atcggtagag tggtccgaaa ccatctcaac gcgcattcag agcgcaatca 3780 ggtgattagc gttccatttg attttactct cgtcgttcgt tctaccgggt ctttcttggt 3840 tccagatttt ctactctcat ggcatctcca atgctcgttt tgtacggatc acattcaccc 3900 atacaactct caactctact agtcgtcaat caatgttggt aaccgtgaat gggtccattc 3960 agcagtactt cgtgtcgaac caccgaagat cacccgatcg agagtacgag agaaagagta 4020 tcacgagaag taacgataga gttgtttgtt gaaacgaagt gatcgtttca aggtggccgg 4080 aa 4082 // ID Gypsy-18_DPu-LTR repbase; DNA; INV; 580 BP. XX AC scaffold_318; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-18_DPu_; KW Gypsy-18_DPu-LTR; Gypsy-18_DPu-I. XX NM Gypsy-18_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-580 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 752-752 (2010). XX DR Genome; scaffold_318; Positions 3194 3773. XX CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX SQ Sequence 580 BP; 120 A; 127 C; 128 G; 205 T; 0 other; tgttatgatc gtcgccgact tgtcgacgtt cacaccgacg gattacccgg aattacctcc 60 ctttctgctt ccctattata tttgtatttg tttcaccttg ttcgacggaa ctcgacgata 120 taagatcgtc gagtgagttc ttccctttca cgtctctctg acgttgagtt ggagttttcc 180 tctttttgtt cctttcaatt gatttgaggc cactctggtg tacccttttg aaccaagacg 240 acgtcgccat aagctcggta ctcctcccaa taaccaggtt atctcgtgtt ctttttcgtg 300 tgtgagtacc ttacttttat ttgtctgtct tcgtgttgct tattgtttta aaacgcatat 360 gtgtatttgc tggtaactgg atgtcacata catgaccagt attattctat gtatgtgtga 420 agagccggaa taccataact atggccggtg gggcgagtcc aggcagacac ggtgtagtgt 480 ggccggtata ggacttgtta agatttgtgt caggaataaa gaccgagcta ctgagagtta 540 ttgtatttgt tttctatgcc cgagtccccg tttcataaca 580 // ID Copia-95_AA-LTR repbase; DNA; INV; 1042 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-95_AA_; KW Ty1_copia_Ele173; Copia-95_AA-I; Copia-95_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-1042 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 1042 BP; 291 A; 231 C; 271 G; 248 T; 1 other; tgaagatcca atagtaaaca ataacaactc tcgtgaggtt atcacggttg tttatcaatt 60 ctcttttcac ccgttcccac agatatcaac tgtggtaaaa agctctcgta gaacatttgt 120 aaaggcctaa tctaatgatt tgaatttgag ctttgccaat gcggctagca gtgtggccag 180 attgaattat ataataatat catatcattt ctgtatatat cgctaatata ttatgttacc 240 ttttaacgac ttcggacgaa taaacttcaa tacagtatca gtatccaaac caagtctatc 300 gttgtgttat catttaactt aacgaacttt aacaggttat aggcccagcg gctagctgtg 360 gatcaacaga tctggttttg aagaagaaga caatgcctgg tcaaggaagt tctggaagcg 420 gtagtggaag tggcagcgga agcgggagcg gagcaggtaa catccttgac ggtcagtcaa 480 gtgatcgtac cggcagggtt actgccgaga gatcctacgc aacgctggga ttcgtccgtg 540 gacgaccgtt cccacgtctt cgacggaacg gagagccagt tcgcggcgtg gaagtggagg 600 atcgagcatc atctggccaa gatgaagctc gcccactgcc tgaagcggac gcccgacgag 660 gaaacgtttg ccaccgctgc accaggcgag acaccggagc aggagaatgc tcggttggag 720 aaataccgca agcgcgtgga agatgatgtc gacgccatcg acgagatcgt tttggccgtc 780 agcaacgacg tactgaacaa gmtcgtaggg gttacctacg caaaagaggc catggatatt 840 ctggtacgta cctatcagaa gtccgggact gcggcattga tgagtatgcg tcaacggctg 900 ttcatgctca gaaacaggaa cttcgagagc ctggagagac tgtttgacga atacgacctc 960 atcatacggg agcttgatcg tatgaatgcg aacttaacca gcagcgaaaa agtccacgct 1020 ctgttgattg cattgccgga ca 1042 // ID CR1-20_CQ repbase; DNA; INV; 5074 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-20_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-5074 RA Kojima K.K. and Jurka J.; RT "CR1 non-LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 24-24 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 10 sequences with >98% CC identity. XX FH Key Location/Qualifiers FT CDS 114..701 FT /product="CR1-20_CQ_1p" FT /translation="MGCKQCQKKVSDSERIICQGFCGASFHMICAKVDIPL FT KDALGQRPNNAFWMCDDCAKLFXNGHFRLIAKGHDEGNSEIADAVKTMQNE FT ITKLTSTVIAFTEKVAENSASTTTWPSKRPRDSETPTKVSIPSASRGTKAM FT TTVPVLASAEQDDLWYIWLSSFPPSVTEDDIRLMVKECLSVDDDDPIAVKM FT LVTFLH" FT CDS 1273..4782 FT /product="CR1-20_CQ_2p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="MDGSQSTDTVAPPLCCNQQHCSCRPSRPGPVFGTGNG FT AIHTGNSGKYLYLPDDIARPDVLPNFSPRHATTTTAIRDIDAEHADPGRST FT YRAMDDSRSTDTVVPPLCCNQQHCSCHPSRPGPVFGTGNGVIQNLFSGEYS FT RSADIDACPDVSPSFSQQLSMTSQHGCSSDPPEVHPPVQLIPPNSSLKPLS FT VYYQNVRGLKSKIDEFFLACSELEFDIIILIETWLDDSVTFVQLFGDEYHV FT YAVNRGKKNSVRIVGGGVLIAVRRLLDSSVCKEGMVDILEQVFVTVRLAHI FT NLHICAFYLPYEKRLEHNLLEKHINSVTTVCELARSEDIIIAAGDYNQSYL FT VWRDDGRGYATADLAETHARLRTERASHETLLDGMAICNLRQIQTVRNANN FT RILDLIFISDDESLPTVHCVEDPLVPLDSHHPAVWFEIEAVQRVAFQDDFD FT PNSLNYRRTDFATMCTNLAAVDWSPVLTCTNVNDAVTQFGIIVRLHLQRLT FT PRRRPPRKPPWSDAHLKWLKKIRSAALRQYSHDHCTICRRRYKKACSTYKR FT YNKLRYRQYVHKTQCDLKRNPKKFWSFVKTKYKENGLPSSMVYGSRVASNS FT EEKCTLFAEHFASVFNKPDVQNDIDPACLDAVPSNLVSVDTFDITRSMLRK FT AIAKLKSSFVAGPDGIPAAVIKRCGSMLEVPLLKIFNMSMTEATFPEEWKK FT SYMFPVFKKGEKRRVENYRGITSLCSCSKVLELIMCDYLMFNCRNYITFEQ FT HGFVAGRSVTTNLMEFVTTCFNSMSEGLQVDAVYTDLKAAFDRIDHDIALM FT KFAKLGFSSRICSWLESYLKGRRLQVKIGSSLSVEFSNWSGVPQGSNLGPI FT GFALFYNDAGLSLHGNCKLIYADDLKLFTTIESLADCYVLQSHIDAFAEWC FT RLNRMTLSIEKCMVISFHRKTKRHVFTFDYTLLNQPIQRVDCVKDLGALLD FT FNLTFRQQQASVIEKANRQLGFMFRATREFDDPACLRTLYYALVRSHLETS FT CIIWAPYHQNWIDRIERIQRKFVWFAARNFNWRDPSNLPPYEHRCELLGLM FT TLENRRKLSKAMFAAKLLTGELDCPNLLEQLGAAVQARSLRTRSGFLHRTL FT STTEYMRHSPFRSMCSIFNDFFDLFEFGESTTSFRTKLTSRLSIRGRRDNA FT GENERPRRSTR" XX SQ Sequence 5074 BP; 1306 A; 1303 C; 1149 G; 1311 T; 5 other; caggagcccc cacacaagca ctgcgaacaa tacagctagc tggacgtcat cacttgttta 60 catccagcta cacactgccc aacttgaagt cgccctgagt ggggttgctc aaaatgggct 120 gcaaacagtg ccagaaaaag gtctcggact ctgagcgcat catatgccaa ggwttttgcg 180 gggccagctt ccatatgatt tgcgccaagg tggatatccc gttgaaagac gctctgggac 240 aacgtcctaa taatgcgttt tggatgtgcg atgactgcgc gaagttattc mtgaacggtc 300 acttccgtct cattgccaag ggacatgatg agggcaattc ggagattgct gatgctgtga 360 agacgatgca gaatgaaatc accaaattga cttccacggt catcgcattc acmgaaaagg 420 ttgctgaaaa ttcggcgtct acgaccacct ggcccagcaa gcgtcctagg gacagcgaaa 480 cgcccaccaa agtcagcatc ccctccgcct cacgcggtac gaaggcaatg accacagttc 540 ctgtcctcgc cagcgcagaa caggatgact tgtggtatat ctggctatcg agtttcccgc 600 cgagcgtcac ggaagacgat atccgcctga tggtgaagga gtgcctgtcc gttgatgacg 660 acgaccccat cgcggttaag atgctggtga catttctaca ctgacatcaa tcacgttcaa 720 ggtcggagtc agtcgggatt accgcgagtc atctctggac ccgagtaact ggccagaggg 780 cctggcattc cgggaattcg tcgacatgaa caatcgacct gcgcctagcg taactccaat 840 ggggttctca aggcgtcggc tagattagtg tgtccacctc gctgtcgcaa actcatcact 900 gaacctcatc acttgaccgc tgccgttcta ccatcatcgt ctcgaaacca tcctgctgct 960 gttactgctg acccgggacg cacgttatac agaattaggg atgacccccg atccaccggc 1020 acagtcgcgc catcgctctg cggtattcct gccaattgta cctgccatcc aagccgtccc 1080 ggccctgtgt ccggtcgcgg aaacggggtc atccaaactg agaactcagg caagtacacg 1140 tctcgttttg atttacttgc tcgccctgat gtcctcccaa gtttcagcca acgatctaca 1200 ctatctacaa cgggtcactc tagcggcttc agcgacaatc aggctgaccc gggacgctcg 1260 ttcaaaagcg acatggatgg ctcccaatcc accgacacag tcgcgccacc gctctgctgt 1320 aatcaacaac actgttcctg ccgtccaagt cgtcccggcc ctgtgttcgg taccggaaac 1380 ggagccatcc acactgggaa ctctggcaag tacctgtacc ttcctgatga cattgctcgc 1440 cctgatgtac tcccgaattt cagtccacgc catgcaacaa caacaacagc gattcgcgac 1500 attgatgctg aacatgctga cccgggacgc tcgacttatc gcgctatgga tgactcccga 1560 tccaccgaca cagtcgtgcc accgctctgc tgcaatcaac aacactgttc ctgccatcca 1620 agtcgtcccg gccctgtgtt cggcaccgga aacggagtca tccagaatct gttctctggc 1680 gagtattcac gttcagctga tattgatgct tgccctgatg tgtctcccag tttcagtcaa 1740 caactctcca tgacttcgca acacgggtgc tcgagcgacc caccggaagt tcatccgcca 1800 gtacaactaa ttccgccgaa ctctagctta aagccgctct ctgtatatta tcagaacgtg 1860 cgaggcctaa aaagcaaaat cgatgaattc tttcttgcct gcagcgaatt ggagtttgat 1920 ataataatac taattgagac ttggctggat gattcggtca cttttgtgca actgtttggg 1980 gacgaatatc atgtttacgc tgtaaatcgt ggcaagaaga acagtgtaag aatcgttgga 2040 ggtggtgtat tgattgctgt caggcggcta ctcgattctt ccgtatgcaa agaaggaatg 2100 gttgacatcc tcgagcaagt ttttgtcacg gtacgtttgg cacatatcaa ccttcacatt 2160 tgtgctttct acctgccgta cgagaaacga ttggagcata atctgctgga aaagcacata 2220 aattctgtga caacagtttg tgaacttgct cgatcwgaag acatcatcat tgcagccgga 2280 gactacaacc aaagttatct wgtttggcgt gatgatggca gaggctacgc gacggctgat 2340 ttagctgaaa ctcatgctcg tttacgtact gagcgtgctt cacatgaaac acttttggac 2400 ggtatggcca tttgtaacct acgccagatc cagacggttc gcaatgccaa caaccgaatt 2460 ctggacctta ttttcatatc agatgatgaa tcgctaccaa ctgtccattg tgttgaagat 2520 ccactggttc ctctggactc ccaccacccc gctgtttggt ttgaaatcga ggctgtgcag 2580 cgcgttgcat ttcaagacga ttttgaccca aattcgctca actaccggag aacagacttc 2640 gcaacgatgt gcaccaacct ggctgccgtg gactggtcgc ccgttcttac ctgcacgaat 2700 gttaatgatg ctgtcacaca gtttggtatt atcgttcgct tacatctgca acgtctgacg 2760 ccacgacgtc gtccaccacg taaaccacct tggtcggacg ctcaccttaa atggttaaag 2820 aagattcggt ctgcagcttt acgtcagtac tcccatgacc actgtaccat ttgtcggcgg 2880 aggtacaaaa aggcatgttc aacttacaag cgttacaaca aactccggta caggcagtac 2940 gtccacaaaa cacaatgcga tctcaagcga aacccgaaga aattctggtc atttgtaaaa 3000 actaaataca aagagaacgg tctgccctcg tcgatggtgt acggaagtcg tgtagcgtca 3060 aattcggaag aaaaatgtac cctgttcgct gagcacttcg cgtctgtttt taataagcca 3120 gatgtccaga acgacattga tcctgcttgc ctcgatgctg ttccgtccaa tctcgtaagc 3180 gtggacacgt ttgacatcac cagaagtatg ctgcgcaagg cgattgcaaa gctgaagtca 3240 tcattcgttg ctgggcctga tggtattcct gccgctgtaa tcaaacgttg cggaagtatg 3300 ctggaggtgc ctttgctcaa aatattcaac atgtctatga ctgaggcaac tttcccagaa 3360 gaatggaaga agtcgtacat gttcccagtt ttcaagaaag gagaaaagcg gcgtgttgaa 3420 aactatcgag gaataacttc actgtgctcg tgctcaaagg ttctagaact tatcatgtgt 3480 gattatctca tgttcaactg ccgtaactat ataacttttg agcaacatgg gttcgtcgcc 3540 ggcagatcgg ttacaacgaa cctgatggag tttgtcacca cttgtttcaa cagtatgtct 3600 gaggggttac aagttgatgc tgtttatact gatctcaagg ccgcctttga ccgcatagac 3660 catgatattg cactgatgaa atttgccaaa ctaggattct cgtcccgcat ttgcagctgg 3720 ctagaatcgt acctcaaggg aagacggttg caggtgaaga ttggaagcag tttatcagta 3780 gaattttcca attggtctgg cgttccccaa ggtagcaatc ttggaccgat cggctttgca 3840 ctcttctaca acgatgctgg gctatcactt cacggaaatt gcaagttaat ttacgcagat 3900 gacctcaagt tattcacaac catcgagagc cttgctgatt gttacgttct ccaatcgcac 3960 attgatgcct tcgctgaatg gtgtcgcctt aatcgcatga cattaagcat cgaaaagtgc 4020 atggtcatct cattccatcg caaaacgaaa cggcacgttt tcacattcga ctacacactc 4080 ctcaaccagc ccatacagcg agtcgattgc gtgaaggatt taggagcctt gctcgatttt 4140 aacttgacat ttcgccagca gcaagcatca gtcatcgaaa aggctaatcg ccaactcggc 4200 ttcatgttcc gagctacccg tgagttcgac gatcccgctt gccttcgtac tctgtattat 4260 gctctcgtac gatcccatct ggaaacctcc tgtattatct gggcacctta ccatcagaac 4320 tggatcgaca ggatcgaacg tattcagcga aagttcgtct ggtttgctgc tagaaacttt 4380 aactggcgtg acccttcaaa tcttcccccg tatgaacaca gatgcgagct gctggggttg 4440 atgacgctcg agaacagacg gaagctgtcg aaagccatgt ttgctgccaa gctgctcacg 4500 ggtgaactgg actgtcctaa ccttctcgaa caactaggcg ccgcagttca agctagatct 4560 cttcgtactc gcagcggatt tttgcatcgt acgctcagca ctaccgagta catgcgacac 4620 tcgccattcc gttcgatgtg ttcgatattc aacgatttct tcgacttgtt cgagtttgga 4680 gaatcaacta catcatttcg aactaaacta acatcgagac taagcatcag gggacgacgt 4740 gacaacgccg gggagaatga acgtcctcgt cgatctacca ggtgattacc gttccttaag 4800 ttatgctcta cattcagtat tctaccggac cttttttttg taggtggtga tgatcgtgca 4860 ttcgtcgttt agcgttttct gtttagtgtt tacatgctgt tttttttttt tttaattact 4920 tgaccatttt gtactatatt gtaaagcgaa caaacaacaa caaaacttca ttaagaccat 4980 gtggtcggat gattttaata aataaataaa taaataaata aataaataaa taaataaata 5040 aataaataaa taaataaata aataaataaa taaa 5074 // ID BEL-70_AA-I repbase; DNA; INV; 3893 BP. XX AC supercont1.276; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-70_AA_; KW BEL-70_AA-LTR; BEL-70_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-3893 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.276; Positions 188683 192575. XX CC Positions [2914-3495] - Integrase core CC 'AATAG' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS join(1504..2280,2284..3585) FT /product="BEL-70_AA-I_1p" FT /translation="MKTLGLTWNPRIDEFVFCQSISSDDTVVTKRQLFSEV FT AKMFDPLGLLAPVTVLAKRLMQQTWAAKLDWDESLTGDLLDDWLQLRKSLV FT NIRDIKIPRPVIGSNYVSLELHGFADASGIAYGACLYVRSVFTNDICIVRL FT LCSKSRIAPLQELTIPRKELCAAVLLSRLVKKVHATLQISFSSVHLWSDSQ FT IVLSWLQKAPTKLQPFVQNRVIEISKDSGHFNWLYVRSKDNPADVISRGQF FT PGSLKNNSLWWNGPSFLQKVYESSTVDLILDALTPEMKPASVTSMPVINCD FT DLPLFKKFGSLRKLQRVLAYVQRFLKNCWVKNPTDRVKDNRSALNTIVIVI FT QRESFSDEMDRIENREPSKRLKALGPFLHNGALRVGGRIQKSQMPFDAKHQ FT YILPKHPLTDLIIRAYHLEHLHMGPSGLLSALRQRFWLLGSRSAVRKITRN FT CVECFRVRPTGVAQYMGNLPQNRVTPSAPFEVSGVDYAGPFQIKQPGRKSA FT MVKAYLCVFVCLVTKSLHLELVSDMSTAVFIAALQRFISRRGIVRELHSDN FT GSNFRGAKAELHELYLMFRENAAINQIELFCQNKEIRWHFIPPNAPEFGGM FT WEASVKSTKYHLKRILKGSPLTFEELYTLLTKVEAILNSRPLFSHSDDPAE FT GEVITPAHFLIGRPMTAVPEPSYEGINQNRLSKWQHIQQMREYF" XX SQ Sequence 3893 BP; 1035 A; 845 C; 900 G; 1113 T; 0 other; tattgtggtc catcggaacc agattagtgt gtgaagaaaa aagttcagtc cagaagtgtg 60 aacaaaagta cagtccaaga gtgattgaat aacagtacag tccattgaac tcaaaagtga 120 ttgaacagtg tgaatgtttc aaaaactaga actgtggaaa gtgaagaact aaaaccaatt 180 tacgaaactg atgattgatg gatttcctgt gaacgtcaat atagtcggtg tcagttccac 240 tcgtagcaag tccaatcgtt tggtcgatgt gaatcttcgt tcacagtaca gtggctatga 300 aacgactctt aagtgcctgg tcaccgctaa gattacgaac cctttgcctt ctaagtcaat 360 tgatattgct cagtggaata ttccttccaa tatcaaactg gccgatccta acttccattt 420 cccagctacg gtagatttac ttattggcat gggtaatttc tttgagcttt tgagggttgg 480 acatgtcgtg cctgctgagg gccttcccga attacgtgag actgaattgg gatgggtcat 540 cgctggagaa atccgagaat gaactaccag ctctggcgaa cgtccaacag gtgaattgtg 600 tttctatcga gtcgcttaat gagacagtca aacgtttctg ggaaatcgag gagatagata 660 ctagtcctat ttccagagac gaggatgaat gtgaggagtt gtttcgtagg tcttgtcgac 720 gcgattcaac cggtagatac atagttcagc taccagtccg tgaaaatgtg cacgatcttt 780 ctgataatcg caacttggcc cttcgtcgtt ttttcttttt tgaaagacgg ttgttgaagg 840 atccagagct ccatctacag tactccaatt tcattgagga gtataaatcc ctaggccact 900 gcaaggaaat cactgaagcg aatgatgtcc ctgggaagtt gaagtactac atgccgcatc 960 atgccgttta caggccgtcg agctcaagca ctaaactgag ggtagtgttt gatgcatcgg 1020 cgaagccccc atcgggagta tccctcaatg aagtgttgaa aatcggtccc gtcgtgcaga 1080 gcgatttatt atcgattctt ttgagattcc gtaaacatcc tttccttttt gaagagagtt 1140 ttctggcgat ctaatccaac tgatccgatc aaggtgcttg aattggttac ggttacgtat 1200 ggtacttcta ctgctccctt tttggcgacc agatctctga ttcagctaag tattgatgaa 1260 ggggcagaat ttccccttgc ggctcgtata attcacgaag attgttatgt cgatgatgtg 1320 ttgtcgggtg cagaaaccat tcaagaagca attgaatgtc gtcaccaact tcaaacttta 1380 ttgtcgaagg gtggctttcc cgtgcataag tggtgcgcga acgatgaagc gattctacag 1440 gatgttcctg agtctgagcg ggaaaagcta gttctgcttg acgatctctc cgccaacgaa 1500 gtgatgaaga cactcggatt aacatggaat ccgagaatcg atgagtttgt gttttgtcag 1560 tctatttcat ccgatgatac tgttgtgacc aaacgtcaac ttttttccga agtcgcaaaa 1620 atgtttgatc cactgggact actcgcaccg gttaccgttc tagccaagcg gctgatgcag 1680 caaacttggg ctgctaaatt agattgggac gaatctctga ctggtgatct tttggatgat 1740 tggttgcagc tccgaaaatc tcttgtaaac atccgtgaca tcaaaatccc aagaccagtc 1800 atcggttcaa actatgtatc tctcgagtta catggatttg ccgacgcttc agggatagca 1860 tacggtgcat gcctgtatgt tcgtagtgtc ttcacaaatg atatttgcat cgtcagacta 1920 ttgtgtagca agtccagaat tgcaccactt caagagttga ctattccgcg aaaggaattg 1980 tgtgcagccg tgttgctgtc tcgactggta aagaaagtcc atgcaactct gcaaatctcg 2040 ttctcttctg ttcatctctg gtcggacagt cagatagtcc tttcgtggtt gcaaaaagca 2100 ccaacaaaat tgcaaccgtt tgttcaaaat cgtgttatag agatatcgaa ggatagtggc 2160 catttcaact ggctttacgt tcgttctaaa gataacccag cggatgtaat atcccgcggt 2220 caattccccg gtagtcttaa aaataactcg ttgtggtgga acgggccttc gtttctccaa 2280 taaaaggtgt acgaatcgtc aactgtagat ttaattcttg atgctttaac acctgaaatg 2340 aaacccgctt ccgtcacttc gatgccagtg ataaattgtg acgatcttcc cctcttcaag 2400 aaattcggat ctctgcgtaa gctgcaacgt gttctggcct atgtgcagcg ctttctcaaa 2460 aactgttggg tcaagaatcc taccgatcgt gtcaaggata atcgatctgc tttaaatacg 2520 attgtgatag ttatccaacg tgaatctttt tctgacgaaa tggatcgaat cgagaatagg 2580 gaaccgtcta aaagattgaa ggcacttgga ccgtttctgc ataatggagc cttgcgagta 2640 ggtggtcgca ttcaaaagtc tcaaatgccg tttgatgcca aacatcaata cattttgcca 2700 aaacatccac ttaccgattt gatcattcgt gcataccatc tggagcattt gcacatgggt 2760 ccttctggtc ttctgtctgc tcttcgtcaa cgcttctggc tgttaggttc tcgctcagcc 2820 gttcggaaaa ttaccaggaa ttgtgtcgag tgttttcggg tgagaccaac gggtgtagca 2880 cagtacatgg gcaacttgcc acaaaaccgc gttaccccat ctgcgccatt cgaggttagc 2940 ggggttgatt acgcggggcc cttccaaata aagcaaccag gtcggaagtc cgccatggtc 3000 aaggcatatt tatgcgtgtt tgtatgttta gtgaccaaat ctctccactt agaactcgtt 3060 tcggatatgt cgactgccgt ctttattgct gctttgcaga gatttattag tcggagagga 3120 atagttcgtg aacttcactc tgacaacggc agcaattttc gtggggccaa agctgagctc 3180 catgagttat atctgatgtt tcgtgagaat gccgcaatca accaaattga gttattttgt 3240 caaaataagg agattcgctg gcacttcatt ccgcccaacg ctcctgaatt tggcggaatg 3300 tgggaagctt ccgttaaaag cacgaaatat catttaaaac gcattctcaa gggaagtccg 3360 ttgaccttcg aagaattgta tactttgctc acaaaagtag aagcaatcct caactcgcga 3420 cctctttttt cgcattcaga tgatccggct gaaggtgagg ttatcactcc agcccatttt 3480 ctaatcggcc gtccgatgac agccgttccc gagccttctt acgaaggtat caatcaaaac 3540 cgtttgtcca aatggcagca tatccaacaa atgcgggagt atttttgaag atcatgggta 3600 cacgattacc tcgcgagtct tcaaccaaga gggaagaatt tcgtccgtct tccaaacgtt 3660 catcccggta cagttgtatt agtggaagat aaaacacttc ctcctcaaca atggaaactg 3720 ggtagaatta ccaaaatcta cccaggcgaa gataatcttg taagggtggt tgatgtcaag 3780 gtgggaaatg ctgtatttag acgcccgata accaaacttt ctattcttcc aatagaggac 3840 aatattccac atgggggggc tccggacagt accgactatc ggccggggga tga 3893 // ID L1-4_HM repbase; DNA; INV; 5723 BP. XX AC . XX DT 02-JAN-2009 (Rel. 14.02, Created) DT 02-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE a non-LTR retrotransposon from the L1 clad - consensus. XX KW L1; Non-LTR Retrotransposon; Transposable Element; KW reverse transcriptase; endonuclease; L1 clad; L1-4_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-5723 RA Bao W. and Jurka J.; RT "L1-like retrotransposon from Hydra magnipapillata."; RL Repbase Reports 9(2), 428-428 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 108..1433 FT /product="L1-4_HM_1p" FT /translation="MASCVNRRETVFFKTNDDALLPLMDNQRTEEISVILL FT EKMAKASAGESIEQVEISYSSDEPSDDEDEDKPKNKKQPKNYAEAAKPETK FT KRNLPKTLICEIEKHFTLNELLKAIEDENLDEELEGAQFLRRNTAIEIVMK FT TNEAKIKLLENGLYINNYHHVFKISNNNRRPKTEGKRLNQTHVSVFGLPIE FT AKLFDIGKHFEEIGYGRHVYTKPVMRTTPGKGTPYYSGILVAVMEDMPKPM FT PTSLNILGYKIRTKHNGQEINNLIERRQPCDTINEXSSTIIHQQLTYERET FT SNSKPEEIPSNVTNEENINKITDTNKKTNNIIENEEKTNKEEEKKQPTKPK FT NETQHEKTKEEERKTLEKRRKEYYFDKPTTKEAEEQKIHTKDVEESKRAED FT TNEKSREKMDEDNTWQKKRSAPPPTPYKNDRKKRMGTSKESTTLENG*" FT CDS 1624..5487 FT /product="L1-4_HM_2p" FT /translation="MDILTIFSVNVAGLINKQKQDTVLNNLKKLNYDFYLL FT QDTHLNKLQNDELSKKWKGSVFTSAGKTRTGGTAIITKHVLKPYEKFDDEN FT GIFNYVLTKVSEQKILLLNVYAPSGHEFSQNRKKLFELIEDKIKHINLENV FT LVILAGDFNMVLNEIDRHPQIYRKKCASTEKFKKLINKLEVEDTWRLFNPN FT KQEFTYKATNNISYSRLDRIYLSKKARQNVEIQHEPMAHSDHYNACVASIT FT LDKIEIGKSLWIFNNKHLKNKNYLDQIEILIKNKTSQNLDSKKETWENLKS FT ELKEHAKKFCKQQAKNSRIEEYKLKKKFKNAIKKAHLNPRMQVLSADIKNK FT LKIFETNRAYGASIRAKIKWRTEGEKCSRAFFQLEKLKPAKQVTTKIKDKK FT NNIKSEKDEILKEFENYYKELYTSEKCDIKSQDILFSKTSPIQKIPKNLSK FT ELELPITITEIKNALKTMQNNKSPGSDGLTIEFYKQNIYLFGELLAEAINE FT TFTTGEMSESQKKAIIICLFKKGDKTDINNWRPISLLNTDYKIITKIIANR FT LKNVLPLIINENQTACVPRRSIYYNLSYTRDIIRICNKNQLDASIISIDQV FT KAFDRVDRNYLYDTLSFFGFNTVFINYIKTLYYDISAQIKINGFLSEKIEI FT NRGVRQGCPLSMILYIIQAEIFSGYIRANKNIKGITVNQKETKIQQYADDT FT NFYLIGDESIKELGKALELNERATGAKLNVTKCQGLWLGKNKTKNKENFLX FT FNWEENTLKSLGIYFSNRDSLINIKQWNESISIIQKKIKRWQMFNLSFKGK FT RIIINQILLSNLWHLAFTLPIPNERIIKXIEKIVENFLWSNSSIKTNQKVS FT KLPINQGGLSIIDIGEKLKSIQLTWVAKIFNHTLTGAWKDAATHIFNSYRK FT TNQGENIFITTHSSSAIDTLPVFYQQLIKNWSHIFNGESDIXNMSIEEILN FT QPXFYNNFIRDKNHQTLKPKKETTLNQISKISDLAKVFVPGFLDYQLTGLK FT QTTFKKIKTSIPRVWKTKIKKECQSYDHNKSNFTLRNFIHPLKPVLITDLT FT SKSIYDFLMKKKYSLGVTTLFTRWNSVFEKSMTGTSSKEWSLLFQNIFKRN FT SNNKANEIKYKAIHFALPTNSIFKRRGHAPDDLCPQCMKDRETLSHMIYSC FT EKVQPLIKYTIFLLNKIYPSSKPFKNTFKFYLFGFVENAPKFYIGNLLLDE FT LLYYVYXIRMRAYHERSMSARISLLKAYISKIKKILQIEHDIAKENNQLNS FT FLNETKEIRDINGNIKLENKLNQFLN*" XX SQ Sequence 5723 BP; 2406 A; 1018 C; 843 G; 1430 T; 26 other; tatacttacc tgactgggaa gtaaagtccg atcacgaagc ggctttttta gtatargtat 60 tttagtattt gggatatgat ttaattagaa gttgaattta yctttawatg gcgagttgtg 120 ttaatcgacg agaaacagtt tttttcaaaa ctaacgatga tgcactacta cccttaatgg 180 ataaccaacg aaccgaagaa atctcagtta ttttactcga aaaaatggcg aaagcttctg 240 ctggcgaatc aatcgaacaa gttgaaataa gttatagtag tgatgaacct tcggatgatg 300 aagatgaaga caaaccaaaa aataaaaaac aaccaaaaaa ctatgctgaa gcagcaaaac 360 cagaaacaaa aaaacgaaac ctaccaaaaa cgttgatctg cgaaattgaa aagcatttta 420 ctctaaatga actcctgaaa gcaattgaag acgaaaacct agacgaagaa cttgaaggcg 480 ctcaattttt acgaagaaac acagcaatcg aaatagtaat gaagaccaac gaagctaaaa 540 taaaactatt agaaaatgga ctatacatta acaattacca tcatgtcttc aaaatatcca 600 acaacaaccg aagrccaaaa acagaaggaa agcgtttaaa ccaaacacac gtatcggtat 660 ttggcctccc aatagaggca aaactttttg atattggaaa acactttgaa gaaataggat 720 acggtagaca tgtctacact aaaccagtaa tgagaacaac gccaggaaaa ggcacaccct 780 actattccgg catcctggtc gccgtcatgg aagacatgcc aaaaccaatg ccaacctcct 840 taaacatcct cggctataaa ataagaacta aacacaacgg ccaggaaata aacaacctta 900 tagaaagaag acaaccctgc gatacaatwa acgaaamctc gtctactata atccatcaac 960 aactaacata tgaaagagaa acaagcaact caaaacccga agaaattcca tctaacgtaa 1020 caaatgaaga aaacataaac aaaataaccg acacaaataa gaaaacaaac aacataattg 1080 aaaacgaaga aaaaacaaat aaagaagaag agaagaaaca accaacaaaa ccgaagaacg 1140 aaacccaaca cgaaaaaacc aaagaagaag aaagaaagac actcgaaaaa agaagaaaag 1200 aatattactt cgacaaacca acaaccaaag aagcagaaga acagaagatt cacacgaaag 1260 acgtagaaga aagcaagcga gcggaagata cgaacgaaaa atctcgagaa aaaatggacg 1320 aagataacac atggcaaaaa aaacgcagcg caccaccccc gacaccatac aaaaatgatc 1380 gaaaaaaaag aatgggaacg agcaaggaaa gtaccacgct ggaaaatggc taagttttta 1440 ctattatttt ttattttttc cttaattttt attaacggtt ttttwaaaaa aaatttamtt 1500 ttaaacrawc atcataacaa ttaaaccacc tgaagttggc gttctattta agaatgtgct 1560 tttttgcata ttcttaatca cggaactcca acgtagggtt ggaaaatgcc ttgacataga 1620 aaaatggata tattaacaat tttctcagtc aatgttgctg gtctaataaa taaacaaaaa 1680 caagatacag ttttaaataa tcttaagaaa ctaaactacg atttttattt actacaagac 1740 actcatttaa ataagttaca aaacgatgag ttatcaaaaa aatggaaagg cagcgttttc 1800 acctctgcgg ggaaaacccg cacaggtggg acagcaatta taacaaaaca cgtgctaaaa 1860 ccttacgaaa aattcgatga cgagaatgga atttttaatt atgtattaac taaagtaagc 1920 gaacaaaaaa ttttattact taatgtatac gctccctcgg gacatgagtt ttctcaaaac 1980 agaaaaaaac tgttcgaatt aattgaagac aaaattaaac atataaattt agaaaacgtt 2040 ttagtaattc tggcgggtga ctttaatatg gttttaaacg aaatcgatag gcacccacaa 2100 atttacagaa aaaaatgcgc ttcgacagaa aaatttaaaa agttaattaa caaactcgaa 2160 gttgaagaca cgtggcgtct ttttaaccca aataaacaag aatttactta taaagccaca 2220 aataatatat cttattctcg actcgataga atttacttaa gtaaaaaagc gaggcaaaat 2280 gtagaaatac aacacgaacc aatggcacac tctgaccact ataacgcgtg cgttgcctcg 2340 attaccctag acaaaataga aataggtaaa agtctgtgga tttttaataa taaacaccta 2400 aaaaataaaa actatttaga tcaaatagaa atacttatta aaaataaaac atcccaaaac 2460 ttagattcta aaaaggaaac gtgggaaaat cttaaaagcg aattaaaaga acacgcaaaa 2520 aaattttgca agcaacaagc taaaaacagc agaattgaag aatataaact aaaaaaaaaa 2580 tttaaaaacg caatcaaaaa agcgcaccta aacccgcgca tgcaagtttt aagcgctgac 2640 atcaaaaaca aactaaaaat ttttgaaaca aacagagcat atggcgcgtc gataagagca 2700 aaaattaaat ggcgtactga aggcgaaaaa tgcagtcgcg cgttttttca attagaaaaa 2760 ctaaaaccag ctaaacaagt aacaacaaag ataaaagata agaaaaataa catcaaatcg 2820 gagaaagatg aaatcctaaa agagttcgaa aattattaca aagaactcta cacaagcgag 2880 aagtgcgaca ttaaaagcca agacattctt ttttccaaaa catctcctat tcaaaaaata 2940 cccaaaaact taagcaaaga actagaactg ccaataacaa ttacagaaat taaaaatgct 3000 ctaaaaacaa tgcaaaataa caaatcaccc ggaagcgacg gcctcaccat cgaattctac 3060 aaacaaaata tttatttgtt tggagaatta ctggccgaag ctataaacga aacgtttacc 3120 accggtgaaa tgagtgaaag ccaaaaaaag gcgataatta tatgtctgtt taaaaaagga 3180 gataagacag atattaacaa ctggcgtcct atctcccttt taaatacaga ctataaaatc 3240 ataactaaaa tcatcgcgaa tagactaaaa aatgttctac ccttgattat taacgaaaac 3300 caaacggcat gcgtcccgcg tagatccata tattacaatt tatcctacac cagagatatt 3360 ataagaatat gcaacaaaaa ccaacttgac gcgtctatta tatccattga tcaggtcaag 3420 gcatttgata gagtagacag gaactattta tatgacacgt tatctttttt tggttttaat 3480 accgttttta taaattatat taaaacgcta tattacgaca taagcgcgca aattaaaata 3540 aacggatttt tatctgaaaa aattgaaata aacagaggag taaggcaggg ttgtcctctt 3600 tcgatgatac tatatatcat acaggccgaa attttctccg ggtacataag agcaaataaa 3660 aacataaaag gaataacagt aaaccaaaag gaaactaaaa tccagcaata cgctgacgat 3720 acaaacttct atttaatagg agatgagtcg ataaaagaac tcggaaaggc tttagagctt 3780 aacgagcgag ccactggagc caaattaaat gtaaccaaat gccaaggttt gtggcttgga 3840 aaaaataaaa caaaaaataa agaaaatttt ctaaamttta actgggaaga aaataccctt 3900 aaaagcttgg gtatmtactt ctccaataga gacagtttaa ttaacatcaa acaatggaac 3960 gaaagcattt ccattattca aaagaaaatt aaacgctggc aaatgtttaa tttatccttt 4020 aaaggaaaaa gaataataat aaatcaaata ctcttragta acctttggca tttagctttt 4080 acgctgccaa tacccaacga aagaataatt aaatwtattg araaaattgt tgaraatttt 4140 ctctggagca acagctcaat taaaacaaat caaaaagttt ctaaactccc tattaaccar 4200 ggaggtttaa gcattataga cataggcgaa aaactaaaat ccatccagtt aacttgggtg 4260 gcaaaaatat ttaatcacac tctaacaggc gcrtggaaag acgctgcgac gcatattttc 4320 aatagttata gaaaaactaa tcaaggagaa aatattttta tcacaacyca ctcaagcagc 4380 gcaattgata ccctccctgt gttctaccag caactaatta aaaactggag tcacattttt 4440 aacggcgaga gcgacataam aaatatgtcc attgaggaaa tcctcaacca gcccmttttt 4500 tataacaatt ttataagaga caaaaaccac cagactctaa aacctaaaaa agaaacaact 4560 ttgaaccaaa tatctaaaat atctgatctt gcaaaggttt ttgtacccgg attcttggat 4620 taccaattaa ccggtcttaa acaaacaaca tttaaaaaaa ttaaaacctc cattccacga 4680 gtttggaaaa caaaaatcaa aaaagaatgt caatcttatg accacaacaa atcaaatttc 4740 accttaagaa actttataca ccccctaaaa ccggttttga tcaccgattt aacatcaaaa 4800 tcaatttatg attttcttat gaaaaagaaa tacagcttgg gtgtaacaac cctatttacc 4860 agatggaact ccgtttttga aaagagtatg acgggaacct cctcaaagga atggtccctc 4920 ctctttcaaa acatatttaa aagaaatagt aacaacaaag caaacgaaat aaaatataaa 4980 gccattcact ttgcgctacc sacaaactcc atctttaaac gcagaggtca tgcacckgat 5040 gacctttgtc cccaatgtat gaaagataga gaaactctat ctcacatgat atatagttgc 5100 gaaaaagtgc aacctttaat taaatacaca atttttttgc taaataagat ttatcctagc 5160 agtaaacctt ttaaaaacac ttttaaattt tatctttttg gttttgttga aaacgcgcca 5220 aaattttata ttggtaattt attgcttgac gagcttttat attatgtcta twcaatacga 5280 atgagagctt atcatgagag aagtatgagc gcaagaataa gccttttaaa agcatatata 5340 tctaaaatta aaaaaatact tcagatcgaa cacgacattg caaaagaaaa caatcaacta 5400 aactcttttt taaatgaaac taaagaaata agagatataa acggaaacat aaaactagaa 5460 aataaactaa atcaattttt gaattaaatc aaaaaacctt atactttttg aaagacccwa 5520 actttttatt tttttatttt ttaattttcg ccttttcttt ttactttttt ctaacttttc 5580 taaactcgct tattcttgct gcgctcygtt tctcgacttg aaaagcatag aaaaaacttt 5640 gtaaatattt ctatgtaatt cttgtcaagt agtccacaag gcaattagct ggcggactac 5700 tgaaatataa acacaatacc atc 5723 // ID WUJIN repbase; DNA; INV; 185 BP. XX AC U88306; XX DT 21-AUG-1997 (Rel. 2.07, Created) DT 21-AUG-1997 (Rel. 2.07, Last updated, Version 1) XX DE Mosquito miniature inverted-repeat transposable element Wujin. XX KW DNA transposon; Transposable Element; Nonautonomous; MITE; KW nonautonomous DNA transposon; Wujin. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-185 RA Tu Z.; RT "Three novel families of miniature inverted-repeat transposable RT elements are associated with genes of the yellow fever mosquito, RT Aedes aegypti."; RL Proc. Natl. Acad. Sci. U.S.A 94(14), 7475-7480 (1997). XX RN [2] RP 1-185 RA Tu Z.; RT "WUJIN."; RL Direct Submission to Genbank (04-FEB-1997)Entomology, University RL of Arizona, Tucson, AZ 85721, USA. XX DR GenBank; U88306; Positions 1 185. XX CC 26 bp terminal inverted repeats; TA target site duplication. XX SQ Sequence 185 BP; 55 A; 36 C; 43 G; 51 T; 0 other; cagtgaaacc tccatgagtc gatattgaag ggaccatcga ctcatggaaa tatcgagtca 60 tggaacagca atcctttgga aagctgcttc tagggaccat catagtaacc atgaaatttt 120 gtttttagta tggttccatg agtcgatatc gagtcatgga acatcgactc atggaggtat 180 cactg 185 // ID Tx1-11_CQ repbase; DNA; INV; 1386 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A Tx1 non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW Tx1; Non-LTR Retrotransposon; Transposable Element; Tx1-11_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-1386 RA Kojima K.K. and Jurka J.; RT "Tx1 non-LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 643-643 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 5 sequences with >98% CC identity. CC 5'-truncated. XX FH Key Location/Qualifiers FT CDS 3..1277 FT /product="Tx1-11_CQ_1p" FT /note="reverse transcriptase." FT /translation="LYENIFGCLIANNFVKVVAYADDLNILVRNNHEFDTV FT LQLVSYFSIYSKIKLNFAKSQYLRFNGCLSGPHQIKEVESLNVLGVLICQN FT FNKLVEINYDSYLNKLKYTLSLHQKRILNLFQKAWLLNTYVLSKIWYICQI FT FPPNNKHLAAIKSICGQFIWQGFIFKVPRNELYLPVIKGGLALTDVESKAK FT SLFIKNILYSNADGNAPLDNFMLGQINNSTLTRNMREWIVKADELKLESHL FT NSSKKIYDFFIHSLNIKCKTMEDMPNLMWSNLLNNLDSNFLNSKCKTVLFC FT IFKDIVLNKKKLHLYGIRGTDSKFCENCGLVESVSHLVKYCQTSEEIWRWL FT KNILNKRLKILVKDPDELLSLSIGDRDFKYKAALWLTVQTIVYNLEKRGFG FT DLNEFKQGIRIVRFNNKAIFERHFKHFLNIF" XX SQ Sequence 1386 BP; 500 A; 186 C; 236 G; 464 T; 0 other; tgttatatga aaatattttt ggttgtctca ttgctaacaa ttttgtgaag gttgttgcat 60 atgcagatga tctgaatatt ttagtaagaa acaatcacga gtttgatact gtcttgcagt 120 tggtgagtta ttttagcata tactctaaaa taaaactcaa ttttgcaaaa tctcagtatc 180 ttcgttttaa cggttgctta tctggtccac atcaaattaa agaagtggag tctctaaatg 240 ttttaggagt tttaatttgc caaaatttca ataaattagt tgaaattaac tatgatagct 300 atttgaacaa attaaaatac acattaagtt tacatcagaa acgtatttta aatcttttcc 360 agaaagcgtg gttgttgaac acttacgttc tgtccaaaat ttggtatatt tgtcaaatat 420 ttcctccaaa taataaacat ttagcagcga taaaatcaat ctgcggacag tttatttggc 480 aaggatttat ctttaaggtt cctagaaatg aattatattt accggttatt aaaggtggtc 540 tcgctctgac agacgtagaa tcaaaagcaa agtctctttt tattaaaaac attctttata 600 gtaacgctga tggaaacgca cctttggaca acttcatgtt aggacaaatc aataatagta 660 ctttaactag aaatatgcga gaatggatcg taaaagctga tgaactcaaa cttgaaagcc 720 atttaaacag cagcaaaaaa atttatgatt tcttcattca cagtttaaac atcaaatgta 780 aaacaatgga ggacatgcct aatttaatgt ggtctaattt gttaaacaat ttagattcaa 840 attttttaaa ttcaaagtgt aaaaccgtac tgttttgcat ctttaaggac atagttttaa 900 acaaaaagaa attgcattta tacggaataa gagggacgga ctcgaagttt tgtgagaatt 960 gcggtttagt agaatcagtt agtcacttag taaaatattg tcaaacatcc gaagaaattt 1020 ggagatggtt aaaaaatatt ctgaacaaaa gactcaaaat tttagtaaag gatcctgatg 1080 agcttctatc attgagtatc ggtgataggg actttaaata caaagcagca ctttggctga 1140 ctgtgcaaac gatagtttat aatttagaaa aacgaggttt tggggatttg aatgaattta 1200 aacaaggcat aagaatcgtt cgtttcaata ataaagctat ttttgagcgc cattttaagc 1260 attttttgaa catcttctag aaacatccaa agttaaatta tctctaccga gcttatttgt 1320 cgtcttatta gcaagtggtg aaatgtaaat tcaagaaaaa tgttgtaaaa taaagtgaaa 1380 aaaaaa 1386 // ID ECORI_Hm repbase; DNA; INV; 334 BP. XX AC D38085; XX DT 14-SEP-2004 (Rel. 9.08, Created) DT 14-SEP-2004 (Rel. 9.08, Last updated, Version 1) XX DE Hemitaxonus minomensis, EcoRI family (pHME family) tandem repeat. XX KW ECORI_Hm; EcoRI family tandem repeat; pHME family. XX OS Hemitaxonus minomensis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Tenthredinoidea; OC Tenthredinidae; Selandriinae; Hemitaxonus. XX RN [1] RA Sonoda S., Yamada T., Naito T. and Nakasuji F.; RT "Repetitive DNA sequence families in Hemitaxonus minomensis and RT H. athyrii (Hymenoptera; Tenthredinidae)."; RL Jpn. J. Genet 70(1), 7-16 (1995). XX DR Genbank; D38085; Positions 1 334. XX SQ Sequence 334 BP; 118 A; 52 C; 56 G; 108 T; 0 other; aattctgtat tttctcagaa tagactctga aaaatggaat cgcaataccg ctattttatg 60 caaatcaatc ttgtggattc gaggacgtta gtaaaataat gtgagagcaa taaaaaaaag 120 aaatcaatga tttcatacat tttcatttca tgcaattttt tttttcacga aaatattccg 180 aggaatgggt ttgtgacgac actttctcat gaagaatgat tccagtaatt cgaaaatatg 240 atcaaaatca ccgtacgacc aaaagaactg gagtattcga tgatttcatg tcttttcgta 300 tttcacgcaa aatatttcaa aaaatgggtc gctg 334 // ID R1C_NGi repbase; DNA; INV; 6405 BP. XX AC . XX DT 08-NOV-2010 (Rel. 15.11, Created) DT 08-NOV-2010 (Rel. 15.11, Last updated, Version 2) XX DE 28S rDNA-specific non-LTR retrotransposon R1 in Nasonia giraulti. XX KW R1; Non-LTR Retrotransposon; Transposable Element; R1C_NGi. XX OS Nasonia giraulti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-6405 RA Stage D.E. and Eickbush T.H.; RT "Maintenance of multiple lineages of R1 and R2 retrotransposable RT elements in the ribosomal RNA gene loci of Nasonia."; RL Insect Molecular Biology 19(Suppl 1), 37-48 (2010). XX DR [1] (Consensus) XX CC This family is specifically inserted into 28S ribosomal RNA CC genes. XX FH Key Location/Qualifiers FT CDS 515..2368 FT /product="R1C_NGi_1p" FT /translation="GRTASSDRRSSQVEKSLPYGGPGELCAGRRRRDGVRC FT GCPGSAWGRGRTNQMDTIKTQLRKMTRSSKGVGCSREDFLKKDSKGGGGDS FT EDTRSSVYSADSLRERSRSRSPTGLKSTKDFELVESKAARKQRKAEEKRRR FT RENANASVNDRMNKSVSCSVQDVSKKVNVSRSENDVSKTGATESMEVGMEE FT CASMKESANVKRTVKRNAVGQDRFKGAKGELFVVCENLRQPLKRIKREVAD FT AEVPVDSTSVSVNESAEVSEVVVDELRAVTDGLRGHLLSDANKFTKWQANS FT VLDHASKFEGLVQRLMLENARLRGELSAHTGIKADLVNVRETVRRVDESMN FT VVKTRVAAASRAPPAAAGAAPGKGSGANVGPKPSFALVVRGANEQLTCDEV FT RRRMIESTSEDVNVRVRTIRPARGGGVVVETASDGERKALSRCAGLAEAGL FT RAAEPKVMDPRVIVYDVPNEMTNEHLLRGMYDKSLSEHVSVNEFTKRVKIV FT RRVDGQRLGNVIVELPLPWRDRLLQDGRVFVGWNSFKCCSYERVMRCFRCQ FT GYDHRAKECKSEPLCYRCGKSGHRINVCKAAEDCSNCRARRLPSEHSARSP FT ECPVYAWRLQLLRSRFVNNG" FT CDS 2361..5567 FT /product="R1C_NGi_2p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="TMAESQTNGTNVCVDVRILQVNCQKSYAAMCDIANYA FT LEDGIEICLFQEPYVYKDRVCGLPAGSRMYLSKSGEAAVVVFGKRYECMLL FT NEGAHEDAVCVWVKGPVGEILAVSLYCRPNGSMQGCVDYLDRVVSTRNGRR FT LLVGMDANAASDLWHSKSMVRAWQAVRRGAVLRDWVVHAEMDVLNVPTLAY FT TFSGARGESDIDVTLYKGSECQFEWMLKDDWGISDHNPIVITMSTGENVDV FT NGERMQKWNARKCNWLLYRGLIETFASDYGYDEYSVLGAEEKLTLLYKWMT FT EANEVCLEKVVTRPAPKRKSVVWWNESLSEKKRMVRERRRAYQRERARTGD FT PDRMKWREWKECEREYRRMMKDAKESDWHGTVERKGETDPWGVISAFCMGK FT LNPVSLAGLRTANGCTKTWMESARVLLDEFFPADDGIPAEEVYGVQMDMNE FT FCMGELDEAVLGMKMRKAPGLDGLTNEMLRQVWKAAPLFLKGLFDTCLSEG FT LFPHKWKEARVVVLLKGADKDVAESRSYRPISLLGSPGKVMERMLVARLMR FT HMEGKWNECQYGFMKGKCTEDAWARAKENVRAAESEYVLGIFVDFKGAFDN FT LLWRVALQKLREAGCTYEELRVWHSYFSDRSVCMYNGMDVVEKCARRGCPQ FT GSISGPPVWNLGMNDLLNELSGLGVEVVAYADDLLLLVQGNRRNELEQSAS FT EALSVVYRYGMNIGVEVSDSKTVCMMLKGSLNMLNRVVHVSTNETDDKKIR FT CVDRVKYLGVNVGVGMDFSVHIGGMRKRVTTVIMRLRRVLRKSWGLKRGVV FT SMMVNGLILPAVMYGASVWYEQLHKRKLRGSRRLSEELVSCQRVVLYACTR FT VCRTVSTEAMQILFGSLPWDIECIRQADLHKVRKGLPMNENDLVADEDLNE FT LSLHECRELVDQRALAAWQDRWEATSNGRVTYEWIRDVGFSGRSMKYFEPS FT LRVCYILTGHGSLNSFLFSRNLSNSPACACGAEREEWIHVLCECDMYAAFR FT DLDSIGVRRTEEGWDVSGVLCDRAKYECLCAFVERAFSMRESIVQRMRERE FT EENGENEELRLG" XX SQ Sequence 6405 BP; 1471 A; 1227 C; 2256 G; 1451 T; 0 other; ccccctcacg atcactggct tgtcgcgccg agggggcccg agttcaggag tttgcgaaag 60 cgaattcctg ttccgctaaa gtgatccaaa cccaggaagt ggtggatgat gtggtgcttc 120 ggctagccga ccccactctc aggagccggt ccgcgaaagc ggaacggaga ccctgagaag 180 tgtgggaacg cgaactggag cgcctgatcg gaagccaccg ggcgtgacga ccccggggtt 240 cctagtccca agcgatgagc gccgtcgctt ggcccgtaaa ccttgcaccg tcttcaacaa 300 ccttcgcagg gcgtggcgtt ggtacgttgg cggtgtgggg agcgggggct tggcttaacc 360 gcccggccgc gaggtggggt cctctataaa accccccaat cctacgctta ggtgggcggc 420 tacgggatgc atggctcctg gagaggagtc ccgagctcac caaccccggc tggctgtggc 480 ggcttgttcg ggttgcgttg ccttctcgag gtaagggcgc actgcttcga gtgaccgtcg 540 ttcgtcccaa gtagagaagt cccttcctta tggcgggccc ggtgagctat gcgctgggcg 600 ccgtcgtaga gacggtgtcc ggtgcgggtg ccccgggtct gcttggggga gggggcgtac 660 gaaccagatg gatacaataa aaacacagct tcggaaaatg acgcgtagta gtaagggggt 720 aggatgtagt agggaggact tcctgaagaa ggactccaag ggcgggggtg gagatagtga 780 ggacacgcgc tcctccgtat actctgctga cagcctgcgc gagcgttcac gctcccggag 840 tccaacgggc ctgaaatcca ctaaggattt cgagctcgtt gagtcaaagg cggctagaaa 900 gcagcgcaag gctgaggaga agaggaggag gcgtgagaat gcaaatgcga gtgtgaatga 960 tagaatgaat aagagtgtga gttgcagtgt gcaggatgta agcaagaaag tgaatgtgag 1020 tagaagtgaa aatgacgtga gcaagactgg agcgacggag agtatggaag ttggtatgga 1080 agagtgtgcc agtatgaaag agagtgcgaa tgttaagagg acggtaaaga ggaatgcggt 1140 gggtcaggat aggtttaagg gggcaaaggg cgagctcttt gtggtgtgtg agaacttgcg 1200 gcagccactt aagcgcatca agcgcgaggt ggctgacgcc gaggtgcctg tggatagtac 1260 gagtgtgagt gtgaatgaga gcgctgaggt ttcggaggta gtggttgacg agctgcgtgc 1320 tgtcactgac ggacttcgtg ggcatctgct gtcagatgcc aacaagttca cgaagtggca 1380 ggccaacagt gtgctcgacc atgcctcgaa attcgaaggg ctggtgcagc gtctgatgtt 1440 ggagaatgcg aggttgcgcg gggagctttc tgcccatacg ggtataaagg ctgacctggt 1500 gaatgtgcgt gagactgtcc gaagagtgga tgagagtatg aatgtcgtaa agacgagggt 1560 agcagcggcg tcacgagcac ctccggcagc ggccggagct gctcctggta agggttctgg 1620 tgcgaatgtg gggcctaagc ccagcttcgc gctcgttgta cgtggcgcaa acgagcagct 1680 cacgtgcgac gaggtgcgga gaagaatgat cgagagcacg agtgaagacg tgaacgtgag 1740 ggtgaggacc atcagacctg ctcgtggtgg tggggtcgta gtggagacgg ctagcgacgg 1800 tgagagaaag gctctctccc gttgtgccgg actcgccgag gcgggactcc gtgcggcgga 1860 gcccaaagtg atggaccctc gggtgattgt atacgatgtc ccgaatgaga tgacgaacga 1920 gcatctcctt aggggcatgt atgataaaag tttgagtgag catgttagtg tgaatgagtt 1980 tacaaagcgt gtgaaaattg tcaggagagt ggatgggcag cgactcggca atgtgattgt 2040 cgagttaccc ctgccatggc gtgacaggtt gttgcaagat ggtagagtgt ttgttggatg 2100 gaacagcttt aaatgctgtt cgtatgaaag agtgatgcgc tgtttccgtt gccagggcta 2160 cgaccaccga gccaaggagt gtaagagtga gcctctgtgc tatagatgtg gtaagagtgg 2220 acataggata aatgtgtgta aggctgcgga ggactgcagc aattgcagag cgagaaggct 2280 tccttcggag cattcggcga ggtcgccgga gtgcccggtg tacgcttgga gactgcagtt 2340 gttgcgttct cgttttgtga acaatggctg agtctcaaac gaatggaaca aatgtgtgtg 2400 tggatgtgcg gatcttgcag gtaaactgtc agaagtcata tgctgcgatg tgcgatattg 2460 cgaactacgc gctcgaggac ggcatagaga tatgcctatt ccaagagccg tatgtttata 2520 aagatagggt ttgcggttta ccagcgggtt ccagaatgta tctcagtaag tctggagaag 2580 cggctgtagt agtgtttggg aaacggtatg aatgtatgtt gttaaatgag ggagcgcatg 2640 aggatgctgt atgtgtctgg gtgaaagggc cggtggggga gatacttgct gtctctctct 2700 actgtagacc gaatggtagt atgcaaggat gtgttgacta ccttgatagg gtagtgagca 2760 ctaggaatgg acgtcggttg cttgtaggaa tggatgcgaa tgctgcgtcc gacctttggc 2820 acagtaagtc catggtacgg gcgtggcaag cggtgcgtcg gggtgctgtg ttgcgtgatt 2880 gggtagtgca tgcggaaatg gatgtcttaa atgtccctac cctggcttac accttcagtg 2940 gagccagggg ggagagtgac attgatgtca ctctctacaa gggtagtgag tgtcagtttg 3000 aatggatgct gaaggatgac tggggcatta gtgatcacaa tcctattgtg atcacgatgt 3060 ctacgggaga aaatgtagat gtaaatggcg agaggatgca aaagtggaat gcaaggaagt 3120 gcaattggct gctgtaccgg ggccttatcg agacctttgc cagcgactat gggtacgacg 3180 agtactctgt gttaggggca gaggaaaagt taacgctcct gtacaagtgg atgactgagg 3240 cgaatgaggt gtgcttggag aaggtcgtta cacgacctgc tcctaagcgc aagagtgttg 3300 tgtggtggaa tgaaagttta agtgagaaga agcgaatggt gcgtgaacgg cgaagagcgt 3360 atcagcgtga aagagcaaga acgggtgatc cggatcgtat gaaatggcga gaatggaagg 3420 agtgtgaaag agagtatagg cgaatgatga aggatgcaaa ggagagtgat tggcatggca 3480 cggtggagcg gaagggggaa actgacccat ggggtgtcat ctcggcattt tgcatgggga 3540 agttaaaccc tgtaagcctg gctgggctgc ggacggcaaa tgggtgcacg aaaacatgga 3600 tggagagtgc aagagttctt ctggacgaat tcttccccgc agacgatgga attcctgcgg 3660 aggaggtcta tggagtccag atggacatga atgagttctg tatgggtgag ttagacgagg 3720 cagtcttggg tatgaaaatg cgcaaggctc ctggattgga tgggttgacg aatgagatgt 3780 tgcgccaggt gtggaaagca gcccctttat ttcttaaggg gctgtttgac acgtgtctga 3840 gtgagggact ctttccacac aagtggaagg aggccagagt ggtcgttctc ctgaaaggag 3900 ccgacaagga tgtggccgag tctaggtcct acaggcctat cagcctgttg ggtagcccgg 3960 gcaaggtcat ggagcgaatg ttggttgcgc gcttgatgag gcacatggag ggcaaatgga 4020 atgagtgtca gtatggtttt atgaaaggga aatgtacgga ggatgcctgg gcgagagcga 4080 aagagaatgt aagggcggct gagagtgagt atgtccttgg aatctttgtg gatttcaagg 4140 gtgcgtttga caacttactg tggagagtag ctctacagaa gttgagagag gctggatgta 4200 cgtatgagga actgcgtgtg tggcactcct attttagtga taggagtgtc tgtatgtata 4260 atgggatgga tgtggttgag aaatgtgcgc gaagaggttg cccgcaggga tccatatcag 4320 gacctcctgt gtggaacctc ggaatgaatg acttgttgaa tgagttgtcc ggactggggg 4380 tggaggtcgt cgcgtatgct gatgacctcc tgctgctagt tcagggcaac aggaggaatg 4440 aactggagca gtcggcgtct gaggcactga gtgtggttta caggtacggt atgaatattg 4500 gtgtggaagt gtctgactct aagacagtgt gcatgatgtt gaaaggtagt ctgaatatgc 4560 tgaatcgtgt ggtgcatgtg tcaacgaatg agacggatga caagaagata aggtgtgtag 4620 accgtgtgaa gtacctgggt gtgaatgtgg gtgtcggtat ggatttttcg gtccatatcg 4680 gcggaatgag aaagagggtc actacggtga ttatgcgcct caggagagtc ctcagaaaga 4740 gctggggact caagcggggc gtagtgagta tgatggtgaa tggcctcatt ctgccggcgg 4800 ttatgtatgg agcgagtgtt tggtacgaac agctgcataa aaggaagttg cgtggatctc 4860 ggagactgag tgaggaactt gtcagctgcc agagagtggt gttgtatgcg tgcacgcgtg 4920 tgtgtagaac tgtctcaacg gaggcgatgc aaattttatt tgggtcgctt ccgtgggaca 4980 ttgagtgtat caggcaggcg gatttgcaca aggtgcgaaa aggcctgccc atgaatgaga 5040 atgacctggt ggctgacgag gacctgaatg aattgtcctt gcatgaatgc cgtgagttgg 5100 tggaccaacg tgctcttgca gcttggcagg accgttggga agccacgagt aacgggcgtg 5160 tgacgtatga atggatacgg gatgtgggat tctccggccg ctcgatgaaa tatttcgagc 5220 cgagcctgag ggtctgctac attctgacgg gccacgggag cttgaactcg tttctcttct 5280 cgagaaacct gagcaactcc ccggcctgcg cgtgtggagc tgagagagag gagtggatac 5340 atgtgctgtg tgaatgtgat atgtatgcgg ccttcaggga tcttgactcc attggggtca 5400 ggagaactga ggaaggatgg gacgtgagcg gagtgctttg tgaccgtgcg aagtatgagt 5460 gtctgtgtgc ctttgtcgag cgcgcattca gtatgcgtga gtcgattgtg cagagaatga 5520 gagagagaga ggaagagaac ggtgagaatg aggaacttag attagggtaa tgggtgaggg 5580 ggtgtggggg tgaggggggt agtgggtaga tataaggggt agagggaagg ggggtagggt 5640 aagggaacaa gtgggttggg ggtagggtaa ctgcgtgtgt gtgtgtgttc ggagtgtgtg 5700 tgttaaaagg agtgtgggaa tgaatgtgtg tgtgtcggct ggccagctac cgctggccgg 5760 ctccccgcag ggggattcca ctcttgctct tctaatcgag gctggcctgt gccggactcg 5820 ttggaggacc aagagaggca tctctgggtt ttgcgaaccc acggacccga gcagcccttc 5880 cagaggcggg atggtaagat cccaactgga accctcacca gggttaaaac ggtaccatgg 5940 gcgaccgggg tgcccgctgg gggagaattg cctccctcgc ccggtcactt gggtttggat 6000 tcgtggtggc agtggttgaa agcccacatc gcttggggtt aggggttggc actgggtgaa 6060 agactccttg ggtgctccgc accatcggag actggaaccc tttgccgacc tcgacgtgtg 6120 agttgcggtc tcaactcggg gagcggcctg cttaaaccgt tagggattgg atgggtcccg 6180 gccccaaccg agggtctcca aaggtcttac caacctgcgg aggaatcggt agtcgcggtt 6240 tagtagaggg cccaattggc tggcaatgtt tcggcattgc cgtctaattg gtctcaaagc 6300 tattccgcga tgcgttggcc gagcgtatct cggcccctcg ccccgtgggg ggccgtgtgg 6360 gtaggccgaa aggcaggtac tgcacgttaa aacaaagaga cgatc 6405 // ID Mariner-8_SM repbase; DNA; INV; 678 BP. XX AC . XX DT 11-FEB-2008 (Rel. 13.02, Created) DT 03-MAR-2008 (Rel. 13.02, Last updated, Version 1) XX DE Mariner DNA transposon from Schmidtea mediterranea: consensus. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-8_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-678 RA Jurka J.; RT "Mariner DNA transposon from Schmidtea mediterranea."; RL Repbase Reports 8(2), 152-152 (2008). XX DR [1] (Consensus) XX CC This is a relatively ancient family with incomplete ORF. CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 152..544 FT /product="Mariner-8_SM_1p" FT /translation="MVVIPGGLTPYLQAGDIGIFKELKDKISGFINTWKTS FT EEIVYTRGGNPKAPNKNLVNQWVNSAWGCVSIQNIKNSIRSAGFAENFNDW FT HISKHDIYGDRFKXAYEASIRPNLVPVQDDFEFETDVFDVIDE" XX SQ Sequence 678 BP; 236 A; 90 C; 123 G; 227 T; 2 other; tcctaatagt attaccatta aaggtcatac taataaatag tatgactatt ctttatttga 60 tgtcagtcca ggcaagtgta ttgtgtggga ttcttgccgt gcwcatattt cggcaaatgt 120 aaaagagcat tgcagaagaa gaaatattaa aatggtggta ataccgggtg gattaacacc 180 ctatttgcaa gctggagata ttggaatttt taaagaatta aaggataaaa tttctggctt 240 cattaacact tggaaaacct cggaagaaat tgtatatact agaggcggaa atcctaaagc 300 tcccaataaa aatcttgtca atcaatgggt caatagtgct tggggttgcg tgtctattca 360 aaacattaag aattcaattc gatctgctgg gtttgcagaa aattttaatg attggcatat 420 atccaagcat gatatttatg gcgataggtt caaaratgca tacgaagctt ccatacgacc 480 aaatttagtt ccggtacaag atgattttga atttgaaacc gatgtttttg atgttattga 540 tgaataaata aatttttata tttaactgat tgaatatttc atttatttta atccaaatag 600 tatgaccaaa ttaaatttaa aactaagaaa acattttttg ggtcatacta ttaaatagta 660 tgagtcatac tattagga 678 // ID Gypsy-1_RP-LTR repbase; DNA; INV; 695 BP. XX AC ACPB02032162; XX DT 28-JAN-2011 (Rel. 16.02, Created) DT 28-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Rhodnius prolixus genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-1_RP_; KW Gypsy-1_RP-I; Gypsy-1_RP-LTR. XX OS Rhodnius prolixus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Euhemiptera; Heteroptera; OC Panheteroptera; Cimicomorpha; Reduviidae; Triatominae; Rhodnius. XX RN [1] RP 1-695 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Rhodnius prolixus genome."; RL Direct Submission to RU (28-JAN-2011). XX DR Genome; ACPB02032162; Positions 546 1240. XX SQ Sequence 695 BP; 206 A; 95 C; 161 G; 233 T; 0 other; tgtaaaggga gagcatttaa tttgtaaata ataattgaat aattagtagg aaaaaatgca 60 attgttaagt taaactgctg acggcacggc tggtagcggc cgagagcaat gtaccgcgtg 120 agagagagag agagaatagc gagcgcgtgg tgtatagtgt atagacaccg gcagagttag 180 ttagagtgtg tgagaggaaa caagtgttat gcgtttgatg ggattgtttt gctgtgagta 240 ttgaacataa atttttacgc attgcttccg gccgaggcta actgttgccg tagtgaggga 300 ggagtacatt tgctagagtg tagattaaca aaagataacc gatagtatta ataataataa 360 taataataat aataataata gtaataataa tggaatgttt gagaataagg gcccaacttt 420 gttgatttaa tttataattt gcctatttaa tcttaatttt atgttatctt atagtttctt 480 cagtttcgtt tatatttcta ttgttaatta tgtttttgga agtattagtt tggttttgta 540 ttaatgagct cgactctttg tatgcgatta tctaattggt gtattgggac caacattaat 600 cagtgtattc atagccctga cgagttattt gtacccccct gctgagcgag cgaaattccc 660 cttgccctga tcgcccttaa cggcgctcgg tcaca 695 // ID DNAX-1_AP repbase; DNA; INV; 796 BP. XX AC . XX DT 22-AUG-2009 (Rel. 14.08, Created) DT 22-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A putative family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNAX-1_AP. XX NM DNAX-1_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-796 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(8), 1778-1778 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 2 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 796 BP; 279 A; 95 C; 110 G; 312 T; 0 other; gacgtttcgt ccccggcaat tattgagata ggacgtttcg tcccggagag ttattagcca 60 taggacgttt ggtccccggc atactttgat aatctaaatg catttatttt taatgaacaa 120 ttctgtaatg tctaatatat agtgttgtaa agtatttgag taaaattatt caaatatttt 180 attaaatact tttttttatt cttttaattc ttaatttgtt tctacttata attattcttg 240 ttaaataaaa ataccatgcc gtcgtggttt tactttaaca ttataacaat gtaatattat 300 ataattattt actactttat tttaagagta tttgagtaat taaaagtttt acaacactgc 360 taatacagta tggctatagg taagtaataa agatctgaaa ttctaaattt aattaatata 420 aattaaaatg taccgaaaat agttttaagg aaaagaaatt gcaatatatt gcacgggatt 480 tcaatataat gatgataatg acattttata ctatttaaaa aacataagtc agttgcctaa 540 aaataaatat ataatttatc ttaatattat agtacctatt taattttaat ttattaattt 600 ttactacatt ttgtgttgac ttaatacaat tgtgattatg tatttgtata gtttgtagta 660 ttgtcattta aagattaaat tgttcattaa aaattaatgc atttagatta tcaaagtatg 720 ccggggacaa acgtcctatg gaataactct ccgggacgaa acgtcctatc tcaataattg 780 ccggggacga aacgtc 796 // ID DNAX-5_AP repbase; DNA; INV; 155 BP. XX AC . XX DT 27-AUG-2009 (Rel. 14.09, Created) DT 27-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNAX-5_AP. XX NM DNAX-5_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-155 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2058-2058 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 155 BP; 39 A; 23 C; 41 G; 52 T; 0 other; tcccgttaca acgaactcca agggactgaa ttttttttcg ttataacgga aatttcgttg 60 taacgaaggt agaaggtagg taaggtcggt aggggacttc tatgactttc gttataacgg 120 gatttttcgt tgtatcggta ttcgttgtaa cggga 155 // ID Gypsy-209_AA-LTR repbase; DNA; INV; 217 BP. XX AC AAGE02026967; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-209_AA_; KW Gypsy-209_AA-I; Gypsy-209_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-217 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02026967; Positions 1858 2074. XX SQ Sequence 217 BP; 61 A; 51 C; 36 G; 69 T; 0 other; tgttatacct tcattagcaa acctttaatt acattcacct tgaaatccaa atatgtgctt 60 acctttataa ttacaatact ttggattgaa ctgtcataag taaatagacg agcggagagt 120 acaaacgtgt tttaactcgt ccgaaataga aattccgcgt tccacgttct tgctgttcac 180 cgtcgtctcg gccttgggat tccgccccgt tacaaca 217 // ID Copia-11_DPu-I repbase; DNA; INV; 4438 BP. XX AC scaffold_233; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-11_DPu_; KW Copia-11_DPu-LTR; Copia-11_DPu-I. XX NM Copia-11_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-4438 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 685-685 (2010). XX DR Genome; scaffold_233; Positions 16480 12043. XX CC Positions [1651-2181] - Integrase core CC 'ATCAT' target site duplication CC LTRs are 98% similar to each other. CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX FH Key Location/Qualifiers FT CDS 1054..2244 FT /product="Copia-11_DPu-I_1p" FT /translation="MISICCLSAIEHDTIVLDSGATRHMSGERSFFVELSD FT ITPGSWPINGIGGKILHANGIGNIKLTTCVNDVTTNGELKNVLYVPGLGVT FT LISIACLSISGYSVSFSGLNATVGRDSSIIMTASRSGETLYKVNAVVVTHS FT ESAMAASTCQTTVNVWHKRLGHVNGRTIQRMAAGTGVLGMLITPGATTLNE FT CCHGCELGKMHKLPFTKSDTVYHNVGDCIVSDLVGPMQVDSVGGARFYVLF FT KDVYSKYKTVYFLKHKSETADCFLDYVKTVFTATGRRVHRLRCDGGTEYIN FT NYLKNELATLGIQLQTSATYTPEQNGIAERDHRSTVESARSQMHAKEIPLK FT LWAEAINYSVYVLNRTISQSETVTPYQRWFGKAPDISNLRVFGSVAYFFFV FT EHT" FT CDS 2597..4267 FT /product="Copia-11_DPu-I_2p" FT /translation="MGGACAVHQSPDVSMAAFISMAFKAISLFYEPKTFTE FT AMSGTEGDLWHKAADHEMEAHIKNHTWTLVPLPAGRTCIPSGWDFKLKTDK FT LGLPCRRKARFFAKGYRQVKGVDYQESFASVVRYDSLRVIIAIAAARDLEL FT IQLDVTTAFLNGLIDEVGFIAQPEGYIVPGRESEVCRLNKGIYGICQASRI FT WNKTLHEALINYGLVQSTADPCVYYRITLTSFLIIAVWVDDGLVAGSSQLL FT LDTLVSYLNQKFEITAVPADLFVGIVLTRNRAKRQISLSIPQFIDKILTKF FT QLSTAPAISLPVLKGSPRLSTMASPSSPADVLSMAGLPFREVVGCLMYAAL FT TVRFDIAFMAGQLAQHCQNPGLEHWKAAVRVLRYLKGTRIHGICFGGKDSN FT HHVLVGYSDADYAGDPDTRRSTSGYVFILNGGAVTWSSRRQQIVSLSTMQS FT EYIAASEATREAVWLRRLLDNLGTTQILPTLLRCDNESAIGLAYNPLAHKG FT AKHIEVRYHYIREQVADETIVVAYVETKKQFADMLTKAVDGETFQFCIKGC FT GLDAVPEIGI" XX SQ Sequence 4438 BP; 1182 A; 988 C; 1034 G; 1234 T; 0 other; ggttatgggc ccagttgacg attcttttat tattaaccac tctctagaat ggccgacaac 60 ttctcattag aagtgattaa gcacacaaga aagtttgatg gtaaagactt taccacttgg 120 aaacacaaca tggagatgat gttctatctg aagaacttga aacccattgt tgaggtatta 180 cacccaatcg tttatacatt ggaagagaat tgattattct aatgtctgtt ttgttctttg 240 tcaatatttc tagggtgagc ttcaacttcc acccgaagaa tttgaaaaaa atgtagaaaa 300 tcctatgctg gtcaatggtg acgagatagt tgaatggaaa atgagagatt gctatgcaag 360 attcctcatc tttaatagtt gcgacgaagt cagaaaactc gcactactta atagcagaac 420 tagccacgaa atgtggacac gattggagac ccaatattta cagcgtgcag ctgataacaa 480 gcatctatta catcgagatt ttctcaatct acggcctaca tcgggtcaag acatcatgat 540 tcacatcacg gcattagaat cgatggcgac agaactcaac gatcttggtg tacacatcac 600 tgaacatgat ttgattacca ctatcatgtg tagtcttcct gcaagatttg gatttctctc 660 gtcgtcatgg gacaatgtac ctaacaacga aagaacaatg gacgctctac gagctagaat 720 tgtttctgag caaagacgaa ttgaagtcag acgacttgaa gaagaagcaa aagccgttct 780 ccagctacaa ccgtcgacaa caatgctctc caagcatcga cgagtcaaag aggattcggc 840 aggtaccgtg gtggtcggac tcgtggtagt ggtacaccac gagatacaat caatagagac 900 aacgcaaaat gtacctattg tggaaaatca agacaatacg aatttgaaag ccgcctgcgc 960 attgctactc aaggagagaa gcagcccaat cagcctaaaa gacccaataa cgatgaagat 1020 ggaactggct ctcaacgacg agtggatttc tccatgattt ccatctgctg tctgtctgct 1080 atcgaacacg ataccattgt gctagattca ggtgccacga gacacatgag tggagaacgt 1140 tcattttttg tggaactgag cgacatcaca ccgggtagct ggccaatcaa tgggattggt 1200 ggtaaaatcc ttcatgcaaa tggaatcgga aacatcaaat tgactacctg tgtgaatgat 1260 gtaaccacca atggagaact caagaatgtg ctttacgtcc caggattggg ggtgactctg 1320 atctcgattg cttgcctttc catcagtggg tactctgtgt ctttctcagg actcaatgcc 1380 actgttggta gagatagctc aatcatcatg actgcgtcca gatcaggtga aaccctctac 1440 aaggtgaatg ctgttgttgt cacacattct gaatcagcta tggctgcctc cacctgtcaa 1500 actacagtca atgtatggca taaacgcctt ggccatgtta atggaagaac aattcaacgg 1560 atggctgctg ggactggagt attgggtatg cttattacac caggagctac caccctaaat 1620 gaatgctgtc atggctgtga attgggcaaa atgcacaagc ttccattcac caagagcgac 1680 actgtctacc acaacgtcgg ggattgtatt gtatcagatc ttgttggccc tatgcaagtc 1740 gattcggttg gaggagcacg tttttatgtt ctttttaagg acgtctacag caagtataag 1800 actgtgtatt ttctcaaaca caagtctgag accgccgact gttttcttga ttacgtgaag 1860 actgttttca ccgcaactgg ccggcgagtt catcgactac gctgtgatgg tgggacagaa 1920 tacatcaaca attatctgaa gaatgaattg gcaactttgg ggatacaact gcaaacaagc 1980 gctacgtata cgcccgagca aaatggtata gctgaacggg accatcgatc taccgtagaa 2040 tctgcgagga gccagatgca tgctaaagaa atccccctga aattgtgggc ggaggctatc 2100 aattactctg tgtatgtttt gaaccgtaca atatcgcagt ctgaaactgt tactccgtac 2160 cagcggtggt ttggaaaagc acccgatatt tcgaatcttc gagtttttgg atcggtggcc 2220 tatttcttct ttgtagaaca cacctgatgt tctacgtcaa aaactcgacc ctaaggcaac 2280 aaaaggcgcc tatgtcggtg aaagcgagga acaaaaggcg agtcgcatat ttgttgaagc 2340 tactggacgg actcatatat cacggcatgt caaagtctat gaaaatttgc cgtattggtc 2400 tgtcactccg ttgggaaatg aaatccaaac tacgacgcct gatgcacccg tgtccacccc 2460 cagcacgtct gaggatgctg ttagtcctgt atccaacgat acaccggtca tcacgcagca 2520 acttgatcgt cctgcagctg tacccgttcg gaaatctctt cgtggattgg tccccaagaa 2580 actgtttgca atcgagatgg gaggtgcctg cgctgtgcat caatcgcctg atgtgtccat 2640 ggcagctttc atctccatgg cgtttaaggc catttctttg ttctacgaac cgaaaacgtt 2700 tacagaggcc atgagtggaa ctgagggtga tttatggcac aaggctgctg atcatgaaat 2760 ggaggctcac atcaagaatc atacttggac cctcgtacct ctccctgctg gtcgtacatg 2820 tattcctagt ggatgggatt ttaaactaaa aacggacaaa cttggattgc cgtgtcggcg 2880 caaggcgcgt ttttttgcta aaggctaccg ccaggtgaaa ggagttgact atcaggaatc 2940 gtttgcatcg gttgtgcgat atgattcact tcgtgtcatc atcgccattg ctgctgcgcg 3000 tgatctggaa ctcattcaac ttgatgttac cactgccttc ctcaacggac tcattgatga 3060 agttggtttc atagcccaac ctgaaggcta tattgttcct gggcgtgaat ctgaagtttg 3120 ccgattgaat aaaggcattt atggaatttg ccaggcgtcg cgcatctgga acaaaactct 3180 ccacgaggcc ctcatcaact atggtctcgt gcagagcacc gctgaccctt gtgtctatta 3240 ccgcatcaca ctcaccagtt ttctcatcat cgcagtttgg gtcgacgatg gactagtggc 3300 tggcagttct caactgctcc ttgatacttt ggtctcctat ctcaaccaga agtttgaaat 3360 caccgcggta ccggctgatc tatttgttgg cattgtacta actcgtaatc gcgccaagcg 3420 ccaaatttcc ctctccatcc cccaatttat cgacaagatt ctgactaaat ttcagctctc 3480 tactgcgcca gcaatatcgc ttccggtgct gaagggctca ccacgcttgt caacgatggc 3540 ctcaccttct agtcctgcgg atgtcctgtc catggctggt cttcctttcc gcgaagtcgt 3600 ggggtgtctt atgtatgcgg cccttactgt acgttttgat attgccttca tggctggaca 3660 attggctcaa cattgtcaaa acccgggatt agaacactgg aaagcagctg tacgggtgtt 3720 gaggtatctg aagggaacac gcattcatgg gatctgcttc ggtggaaaag attccaacca 3780 ccatgtcctc gtcggctatt cagacgctga ttacgctggc gaccctgata ctagacgttc 3840 cacttcgggt tacgttttca ttttgaatgg tggtgcggtt acgtggtcga gtagaagaca 3900 acaaattgtc tcgttgtcca caatgcaatc agaatatatt gcggctagtg aggctaccag 3960 agaggctgta tggctacgtc gcttgctgga caatttgggt acgacccaaa ttctacccac 4020 actactccgt tgtgataatg agagtgcgat tggacttgcc tataatcctt tagctcacaa 4080 gggtgctaag cacattgagg tccggtatca ttatattaga gaacaggttg cggacgaaac 4140 aattgtggtg gcgtatgttg agactaaaaa gcaatttgcg gatatgttga ctaaagctgt 4200 agacggagag acatttcagt tttgcattaa agggtgcggt ctcgatgccg tccctgagat 4260 cggtatttaa aggaatgttg ggtctctgag agttctggta tcctacaaat cagctgaatt 4320 ttgtttattg tcttttcttc tactttgatt taaacttgga gtcatcttta caattttcat 4380 tcgaattata ttatgtattc tttgtgtctg tatttgttgt gtctggaatg aggaggtg 4438 // ID PrTip repbase; DNA; INV; 3483 BP. XX AC DQ138289; XX DT 18-AUG-2005 (Rel. 10.08, Created) DT 18-AUG-2005 (Rel. 10.08, Last updated, Version 1) XX DE hAT-type DNA transposon from Philodina roseola. XX KW hAT; DNA transposon; Transposable Element; PrTip. XX OS Philodina roseola OC Eukaryota; Metazoa; Rotifera; Bdelloidea; Philodinida; OC Philodinidae; Philodina. XX RN [1] RP 1-3483 RA Arkhipova I.R. and Meselson M.; RT "Diverse DNA transposons in rotifers of the class Bdelloidea."; RL Proc Natl Acad Sci U S A 102(33), 11781-11786 (2005). XX DR Genbank; DQ138289; Positions 2889 6371. XX FH Key Location/Qualifiers FT CDS 445..1194 FT /product="PrTip_1p" FT /translation="MKRHQQDKNHLSRQVATLESLWAKKQKTNELSNNASI FT TPEQYSTSTAEAETQSSSMILEECSTILVETQTDSTLLLSNQQHFTASSVI FT TLCDISASGDEPPAQPILSVYPINEEKRCFQSQWYQNRPWLEYSIKNNSAY FT CYYCRHFGGSGMSKRIQSDAFISGFNAWRRALERDRGFDKHAKSMFHVQAM FT SSYEEYKRRLQTSSSVMNLLEKSRIEQINQNRAKLIKICSTVLLCARQMIA FT LRGHEEGLE" FT CDS 2449..3177 FT /product="PrTip_2p" FT /translation="MRNEATSSELFNQVVDFAKKHKINLDKPTRSRRKTSI FT PTRFKDSIIITTTVGQRDRGNEESFISNESKFRDELFYSLIDSILIELNDR FT FGPENVWLLSSISAAHPNNEHFLDIEHLKPLASHLSINNIQLANELGVVKH FT FLRDKVESIKTIKDTLIILAPLNEAFPATTALLRGALCFPVSSTTCERSFS FT RMKNIKTYCRNTMGDKRLSDLTLLAFERNFCIDLQETVDIFSVNHKNSRIL FT LR" XX SQ Sequence 3483 BP; 1053 A; 654 C; 684 G; 1092 T; 0 other; tagtggcgta accaccgggg gggggggggg ggctaggagg ggctgagcac cccccagaac 60 tggctgagcc cctcctagat tagttttttg aatataatcg tatatcgttt atcttcgctg 120 ttttttttaa agaaatcgac acagattcct ttttttctct ccattaaatg aagaacatct 180 catcagagaa gaagattgcc ttctttctat tctcttgtgc tttaatcttt gctcactcta 240 cacaatatgt ttgttcgatt gttgattagc tatcgtcatt cgagaaacaa tcaagctgct 300 attggttgaa tgaccaaaac agaacgtcca tcatcgacat gattcatgac tattatatga 360 acgagacaga cgattcaaaa tgtctctttt gtgcatttct tccatactct cattgttcca 420 tattgaaaca tcccaagcag gtgaatgaaa cgacaccaac aggacaagaa tcacttgtct 480 cgacaagttg ctacacttga aagtctatgg gccaaaaagc aaaaaacgaa tgaattgtcc 540 aataatgcgt cgattactcc agaacaatac tccacgagta cagccgaagc agaaacacaa 600 tcgtcatcga tgatcttaga agaatgttct acaatcttgg ttgaaacaca aacagattct 660 acgttattat tgtcaaatca acagcatttt actgccagca gcgttattac attatgtgat 720 attagtgcct ccggtgacga acctccagct caaccaattc tatctgtcta tccgatcaat 780 gaagagaagc gatgcttcca atcacaatgg tatcaaaatc gtccatggct cgaatattcc 840 atcaagaata attcagcata ttgttattat tgtcgtcatt ttggtgggtc aggtatgtct 900 aaacgaattc aaagtgatgc ttttatttca ggtttcaatg catggcgaag agcgctcgaa 960 agagatcgag gctttgataa gcatgcgaaa agtatgtttc atgttcaagc aatgagcagc 1020 tacgaagagt acaagagacg tcttcaaacc agttcaagtg tcatgaacct tctcgaaaag 1080 tcaaggatcg aacaaatcaa tcaaaaccga gccaaattaa tcaagatctg ctcgactgtt 1140 ttgttgtgtg ctcgacaaat gattgcccta cgtggacacg aagaaggttt ggagtgagta 1200 ttatagcgag tttgttcaat gtagatgcat gtaacgtttt attcttcatt tatagatcca 1260 caaatcgtgg aaattttgta gaattgttgc agtgggcgtc ctcgaccgat ccattagtcg 1320 attcaatcat aaacgattcg agttgcaatg ccacatattt atcaccaaca attcaaaacg 1380 aaattttgag tattttggcc aatcaaatac ggagcgcgat tgctgaagaa gtaagtagac 1440 taaaaacgaa cctgtaacaa aaccaatgga attttttact tctcagatga agaaccaacc 1500 attcgtgctt atggcagatg aagctcgtga tatcagtggc aaagaacaat tgtcaatcgt 1560 cctgcgatac gttgatgata aaaatgaaat ccaagaacgt tttatgggtt tcacaaagct 1620 agaccaattc gatgcgaata gtttagcaga gaagttgttt gaatttcttg agaaatggaa 1680 tgttccagct catcattgca ttgcacaatg ttacgatggg tatgtatgct acacctgtaa 1740 tgcaacttgt tataaaactt gtttttgtga ttaactttag tgccagtgtg atgagtggta 1800 agaatgctgg tgtacaaact ctaatgcgac agaagtacat gcccaaaggg atttatattc 1860 attgttttgc gcatcgcctg aatttggtca tcggtgatgt gtgtaaggta gtgtcttaca 1920 tcgatgaatt catgtctatt ttatcgaaaa tacatgaata ttttacttgt tctgctgtca 1980 ccaatgaata ttttcatcga gctcaacgat tgctggaact gggtgagtat tttaaaaaaa 2040 ttctctggac aattccagat ttgacttcat tctgatttct agatgtttcg tcgagtctca 2100 aactctgggc tccgacgaga tgggacagca tatggttttc catcgatgcc gtcaaaaata 2160 actatacggc tgttgtggct tcattgatcg atttggttga acaaggtgga catcgagctg 2220 tggacgctcg aggactgcta ttgtcgatcg aagaaccttt gtttttagtc tcgatgttca 2280 ctcttcatac gctgcttggt ccagtgaaaa ttttaagtga tcaattgaaa tgtaaagttt 2340 actgttggtt tcattgaaat tatttctctc gttcgtattt tagcttcttc tttggattat 2400 gttagtgcca gagcattgat caaatcagtg atcaatcaaa ttcagggcat gagaaatgaa 2460 gccacttcta gcgaattgtt caatcaagtc gtggattttg ccaaaaaaca taagattaac 2520 ctcgacaaac ctacccgatc tcgtcgaaaa acatcgatac caactagatt caaagattca 2580 ataatcatta caacaactgt tggtcaacgg gatcgaggta atgaagaatc atttatttcg 2640 aatgaaagta agtttcgtga tgaattattc tattcgttaa ttgattccat tctcattgaa 2700 ctgaatgatc gttttggtcc cgagaatgtt tggttattgt caagtatttc agctgctcat 2760 ccaaataatg agcatttctt ggatatagaa catcttaaac cactggcatc tcacttgtcg 2820 atcaataaca tacaattagc caatgaactc ggtgttgtta aacatttctt acgtgacaag 2880 gtggagtcga tcaaaactat caaagataca ttgattatat tggcaccttt gaacgaagca 2940 tttccagcaa caactgcatt attaagaggt gcgttatgtt ttcctgtttc atcgactaca 3000 tgtgagcgaa gcttttctcg tatgaaaaat atcaaaacat attgtcgaaa tacaatgggc 3060 gacaaacggc tcagcgacct tacacttttg gcttttgaac gaaatttttg cattgatctg 3120 caagaaacag tggatatatt tagcgtcaat cacaagaata gtcgaatttt attaagatga 3180 aagacgcata tgagttattt cttctttttt cccaagttct tctttattca tttttcgcat 3240 ttttcttttt atgtcgatgg tgtttatttg aattctagtt gtcgttaact taagtgcaat 3300 tgtaccgcat aatgatgttc atttttatat cgaaataaac gagtgaatga atatcaacca 3360 aagagaggcc gagccatgtt tacaaacaaa aagaaattca ttgtattttt cgttagctca 3420 atggagctgg tggggcttga accccccccc cccccctagg atgaaaagct ggttacgccg 3480 cta 3483 // ID hATx-24_SM repbase; DNA; INV; 2349 BP. XX AC . XX DT 14-AUG-2009 (Rel. 14.08, Created) DT 14-AUG-2009 (Rel. 14.08, Last updated, Version 2) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hATx-24_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-2349 RA Jurka J.; RT "haT-type repetitive family from freshwater planarian Schmidtea RT mediterranea."; RL Repbase Reports 9(8), 1859-1859 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 240..2255 FT /product="hATx-24_SM_1p" FT /translation="MYSSNVQSTDKSKTSSELHNIWHFFQHDKLNQSAKCN FT KCMKVLKCYGKSTSSLHSHLKTAHNTNALKRTTSDTKCMEVESSVCSSLRE FT TVQITNFFRSQTDDSLAAIVSRMTAKDGIPLSMFCTSSDLRKLLIAKGFQK FT IPKSPNSIRQMIFGYSKKIRDEVVNEISCYKQKGGRLSLSFDEWTSTRNRR FT YLNINAHSNSGGRFWNLGLARVVGSLPAGRCVDLLLQKLSIFNISMEKDVV FT CITTDGAAVMKKVGKLVNASQQLCFAHGIQLAVLSVLYRNDTSTEVELLAD FT LPENEEHSSDEDSADADLADAGLGAGNQNYNFDVGIENDHNIVSILHHRLM FT PLIAKVRKVIRLFRLSPTKNDDILQKYVKTEFGKELMLILDSKTRWNSLLT FT MLERFLKLKSAIQKALIDIKSSYQFSADEFNLLSETVATLTPIKYALEAIC FT RQDANLLTADATLKFLICTLGKINCVLANDMKNALEFRIKERRTDLSGLLQ FT YLHSGYANSSSSSTDLFPALKKITIKRLLLNMVQRLFISESDNENDELRQQ FT HSDENVSMEVSIVEETENDADTIKQQMQRAIDQHLAVPDKRNTFSSTSSII FT EKTLKRELAQFEDDGNRSKWLDLCYNTLKSIPPTSVESERAFSVSGNFCTK FT LRSRLSDETIDNLCFLRAHFNKI" XX SQ Sequence 2349 BP; 810 A; 393 C; 425 G; 721 T; 0 other; ccaattaatc ggttttagca tcataaaaac cgattcttta ttaatgaaaa ttttcttata 60 aaatttttaa aaacaaacgg tcataatgtt attttttgtt gctctccata caaaatttat 120 agcgaaaata aatatactta tatagatcta taatattaat taatcgcttt aattttttta 180 atcgaatcga atcgaaatat ttattatttt aaataaaaaa acttttggcc tggttaataa 240 tgtattcatc aaacgttcag agtaccgata aaagcaaaac gagttctgag ttacataata 300 tatggcattt cttccagcat gacaaactta atcagtcggc aaaatgtaat aaatgcatga 360 aggtgctaaa gtgttatggt aaatcaacaa gttctttgca ctcacattta aaaactgcac 420 acaataccaa cgcattaaaa agaacgacat cggatacaaa atgcatggaa gttgaatctt 480 cagtttgctc atcattacgc gaaaccgtgc agataacaaa tttttttcga agtcaaactg 540 atgattcttt agcagcaatt gtgtcaagaa tgactgcaaa ggacgggatt ccactatcca 600 tgttttgtac ttcgtccgat ttgagaaaac ttctaattgc aaaaggattc cagaaaattc 660 caaaatctcc taactcaata agacaaatga tttttggtta ctcaaaaaaa atacgagatg 720 aagttgttaa tgaaatatcg tgctataaac agaaaggtgg aagattgagt ctctcattcg 780 atgagtggac atcaactcga aatcgcagat atttaaatat aaatgctcac tcaaattctg 840 gaggacgatt ttggaatttg ggtctcgccc gtgttgtagg aagcttgcct gctggaaggt 900 gtgtagacct tcttttgcaa aaattaagca ttttcaatat ttcgatggaa aaagatgttg 960 tttgcataac cacagacgga gccgctgtca tgaaaaaggt tggaaaacta gttaatgctt 1020 ctcagcagct ttgttttgcg catgggatac agttagcagt cttaagtgtt ttgtaccgta 1080 atgacacatc tactgaagtg gagttgcttg cagatttgcc ggaaaacgaa gaacattcaa 1140 gtgatgaaga ttcagctgat gcagatttag ctgatgcagg tttaggtgct ggcaaccaga 1200 actacaattt cgatgtaggt attgaaaatg atcataatat tgtttcaatt ttgcaccacc 1260 gtttgatgcc actgatagca aaagtgcgaa aagttattag attatttcgg ctttctccta 1320 ctaaaaacga tgacatcttg caaaagtacg tgaaaaccga atttggaaaa gagctcatgt 1380 tgattttgga ctcaaaaaca cggtggaaca gcttactaac tatgttggag cggttcttaa 1440 aattgaagtc agctattcaa aaagctttga tcgatattaa atcaagctat cagttttcgg 1500 cagatgaatt caacctgctg tctgaaactg ttgctacgct tactccaatt aaatatgccc 1560 tcgaagctat ctgccgtcaa gacgctaatc tattaactgc agatgcaacg ttgaaattcc 1620 taatttgcac actgggaaaa ataaactgcg ttcttgcgaa cgacatgaag aatgcattag 1680 aattccgaat taaggaaaga agaaccgatt tatcaggttt gttgcagtat ttgcacagtg 1740 gttatgctaa tagttcatct tcctccactg acctattccc tgctttgaag aaaatcacga 1800 taaagagact cttattgaat atggtgcaac gactatttat atcagaatca gataatgaaa 1860 atgacgaact acgacagcag catagcgacg aaaatgttag tatggaagtt tcgatagtag 1920 aagaaaccga aaacgacgcc gatacgatta agcaacaaat gcaaagagca attgatcagc 1980 atttggctgt acctgacaaa cggaacacat tttcttccac atcttcaata atcgaaaaaa 2040 ctttaaaacg tgagcttgct caatttgaag atgatggaaa tcgcagtaaa tggttggatc 2100 tttgttataa caccttaaaa tcaattcctc ctaccagtgt ggaatcggag agggcgtttt 2160 ccgtatccgg aaacttttgt acaaaactga gatcacgatt aagcgatgaa acgattgata 2220 atttatgttt tctacgagct cattttaata aaatttaaaa acatattttt ccagaatttg 2280 tctaataaaa actttaatat tgttttgtta atgaattata tttttttata ttaaaaaccg 2340 attaatcgg 2349 // ID BEL5b_Cis_LTR repbase; DNA; INV; 393 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 06-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE Long terminal repeat of BEL LTR Retrotransposon from Ciona DE savignyi. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL5b_Cis_LTR. XX OS Ciona savignyi OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. XX RN [1] RP 1-393 RA Smit A.F.; RT "BEL5b_Cis_LTR - BEL LTR Retrotransposon from Ciona savignyi."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX CC Ci000121; 85% similar to BEL_Cis5_LTR. On average 2-3% divergence CC of copies. XX SQ Sequence 393 BP; 96 A; 81 C; 96 G; 118 T; 2 other; tgttacggtc attttagagc ccaatcccaa gtacctaaat tcccgatttc tgtgtgccca 60 atgaagcgtc acggcgtcgc tgccgatttt aaaggtcata gtaatttcgg ccttttctgt 120 tcttactacg tggtagaagg ttgttattga ttgagctgtt ctggatcgca cgaaagcaaa 180 cccagaaatt gtgtctntgc tgctatgttt gtatgtgtct agaaattctg cccattttgt 240 ctgtgttagc gcaattaaaa gacatcacag aagggtggtt tgttncgaac atagaggata 300 gagcaatcac tggtcaagtg cgtggagcag ctcaaattcg ccaacacagg gttggtaacg 360 tagggtaggg cttcgcccct gttgaatcct aca 393 // ID TCKIN2 repbase; DNA; INV; 122 BP. XX AC M18815; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 28-SEP-1995 (Rel. 1.08, Last updated, Version 1) XX DE T.cruzi kinetoplast minicircle DNA, clone pTc-21. XX KW Minicircle; TCKIN2; repeats. XX OS Trypanosoma cruzi OC Eukaryota; Euglenozoa; Kinetoplastida; Trypanosomatidae; OC Trypanosoma; Schizotrypanum. XX RN [1] RP 1-122 RA Degrave M.W., Fragoso P.S., Britto C., van Heuverswyn H., RA Kidane Z.G., Cardoso A.M., Mueller U.R., Simpson L. et al.; RT "Peculiar sequence organization of kinetoplast DNA minicircles RT from Trypanosoma cruzi."; RL Mol. Biochem. Parasitol 27(1), 63-70 (1988). XX DR GenBank; M18815; Positions 315 436. XX SQ Sequence 122 BP; 23 A; 19 C; 38 G; 42 T; 0 other; tggtttttgg gaggggcgtt caacttttgg ggcggaaatt catgcatctc ccccgtacat 60 tattttggcc aaaatcctaa tttttcgggg aggtggggtt cgattggggt tggtgtaata 120 ta 122 // ID Gypsy-11_OD-LTR repbase; DNA; INV; 222 BP. XX AC CABV01000403; XX DT 24-FEB-2011 (Rel. 16.02, Created) DT 24-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Oikopleura dioica genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-11_OD_; KW Gypsy-11_OD-I; Gypsy-11_OD-LTR. XX OS Oikopleura dioica OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; OC Oikopleuridae; Oikopleura. XX RN [1] RP 1-222 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Oikopleura dioica genome."; RL Direct Submission to RU (24-FEB-2011). XX DR Genome; CABV01000403; Positions 1244 1465. XX SQ Sequence 222 BP; 56 A; 62 C; 37 G; 67 T; 0 other; tgtgggatta gatatctgta cttattcaca gttctttatt acgaaacacc cgctcgctca 60 tttccttaaa aggaaacttc acctcgcagc tgccagttgc acctgcaacg tttctgtccg 120 catctccggc aagtagccgg cattctcgtt ttaccaacaa taaagtcact atcgttctgc 180 aaaagctaac gcctttttat ttgcagtatc gcagatccca ca 222 // ID Copia-9_SI-LTR repbase; DNA; INV; 298 BP. XX AC AEAQ01030799; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: long terminal DE repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-9_SI_; KW Copia-9_SI-I; Copia-9_SI-LTR. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-298 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01030799; Positions 440 143. XX SQ Sequence 298 BP; 83 A; 47 C; 63 G; 105 T; 0 other; tgattaacta tctttatatt gttttttgag tctattcatg taggttgagt cagttcacca 60 ggaagtctgt taatgtctaa ctatgttttt ggaatgggtt cagtagatgg cggtgtagga 120 tggagtgtcg gagtgtactg aaaatttttg taaccacgag ggggtcgttc attcgtccat 180 gaaacttgta acggaaaaca cgcgatatta cttcatgttc acaaatctca atacatatat 240 gttttatatc agcaatagaa ggaagaattt tattaatcat tccctgcctt cgatccca 298 // ID Ginger1-11_HM repbase; DNA; INV; 4645 BP. XX AC . XX DT 03-FEB-2010 (Rel. 15.01, Created) DT 03-FEB-2010 (Rel. 15.01, Last updated, Version 1) XX DE Ginger1 DNA transposon from Hydra magnipapillata. XX KW Ginger1; DNA transposon; Transposable Element; Ginger; integrase; KW Ginger1-11_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4645 RA Bao W., Kapitonov V.V. and Jurka J.; RT "Ginger DNA transposons in eukaryotes and their evolutionary RT relationships with long terminal repeat retrotransposons."; RL Mobile DNA 1, 3-3 (2010). XX DR [1] (Consensus) XX CC TIR is 109-bp long. Tpase gene contains one intron: 2085-2157. XX FH Key Location/Qualifiers FT CDS join(1031..2084,2158..3161) FT /product="Ginger1-11_HM_1p" FT /translation="MADKNSSEIDDSRSSKKIIKEPNFYDIENYLLSGIFP FT EHLQGRENIGIKTNFKKQSDRFVIVNGMLHYKYRKIMKVTSGPETLCVVKD FT QMSRNQYKKAAHDGLGTSTESKAIGGHVGRDKTIWKLIDSGYWWPNMNKDV FT RNYVATCESCQKSNSKMKKVSPELHPIPVPTKIWHQVGVDLCSLPKNPEGY FT VGICVVVDYFSKWIEAKPIYNKSAEEVSRFLYELICRHGCASIQINDQGRE FT FCNKVSENLLNLTGTCQRITSAYHPQANGLVERANRTIQGSMLKVLNGEQE FT KWPHSLDGILFAFRTTRHKSTGVTPFQVMYAREPVLPLHCINNDQSLIHDV FT PVDLDIEEGILEQADLLENLKGIKNIQSIIHDEVEKNIKKSQLRQKKDYEK FT RHKNQIQFNIGDEVFVYNLRRADRKGDKGKSVWDGPYEVIETFIDKGLYQL FT KQKNGETLHTKHHGSNMKLSKKRSEVEILQKSCDEVSDIKSESLEIVEVVI FT SKTNSFIPTNSKWRTKKCSLLKISLTNKTPFKKGFAVRKKKLQLGKPISVD FT KIEGDGNCLFRALSQEVCLSQEFYKILRQLAIDTLLKAEYRNAFDAYISKP FT VLKYISDSKMETDQIWGTDVEIYALATALDTPIAVYYKPPEASVFSWFFYK FT PLLQNTNGFKLMEIKENDQIVYLQNTGVHYDRVLAVS" XX SQ Sequence 4645 BP; 1704 A; 663 C; 757 G; 1518 T; 3 other; tgtaggaggt atgaagtact taaccggtac ggatcactgt acagtaattc ctacctgacg 60 gtaccgatca cttgacaagt gattcctacc taggtgccga acactgtact tgggtaccga 120 acacttaaca ttttaatttt gccgtgtaaa tgttttaaaa agaggtaaga agcatggaaa 180 ataatacaaa aaacgtttgt tcactatagt aaacatactg ctatagcagt atagtattag 240 tttaaacaaa atttcaaaat caaacgttta aaatttaata tttatctttt cttttttgta 300 tgtagtaact tttttagatc aaattaagat tattattatt attattatta atattattat 360 tattagcaag attattatat agtaaattat tatataacag ttgttaacaa ggccaaaagt 420 agcaaggcca ataccacaag ttacaagacc aaagccaaaa gttacaaggc caagagtttt 480 aaggtcaagg ccaagattta gagtttcaag gcaaagacgg acgcaaagat ttttaagact 540 taagacttca atttttgcct tacgttaagg ccaaaactgt aaatagcaag tgctgtgaaa 600 ataaaaccac attttgtaaa aatagaccaa gtttatgttc agatttcaac aaaatcaata 660 agaaaatgtg ttgttttttt ttaaggccta aacaatcaag gccaaggcca aaatttttaa 720 ggccaaggcc aacagtttca aggctaaggg ccttggcctt gaaaatggaa attttaaggc 780 caggccaaga ccaaggacta acaactttgt tatataataa ccgctttgaa ataattcaat 840 agattatcat ttcaattaat taacttattt aactaatatt gtttacatgc attttattat 900 aaatttatga taattatatt ttatgctata ggtgctatgg gcatatgatg aatgcactaa 960 tagcacataa tataagccta tagcactgct ataaacactt tttttattat tttagatatt 1020 ttagtataaa atggcagaca aaaattcatc agagatagac gactcaagat catctaaaaa 1080 aataattaaa gaaccgaact tttatgatat tgagaattat ttattatcag gaatttttcc 1140 tgaacattta caaggaagag aaaacatagg tattaaaaca aactttaaaa aacaaagtga 1200 caggtttgta attgttaatg gtatgctaca ttacaaatat agaaaaatca tgaaagttac 1260 ttctggtcca gagactttat gtgttgtcaa agatcaaatg tctcgtaatc agtacaaaaa 1320 agctgcacac gatgggttag gcacatcaac tgaatcaaag gctataggtg gacatgttgg 1380 gagagataaa acaatatgga aattgattga ttcaggatat tggtggccga atatgaataa 1440 agatgtccga aattatgttg ccacttgtga atcttgtcaa aaatcaaatt caaaaatgaa 1500 aaaagtttct ccagagttgc atccaatacc agttccaaca aaaatatggc atcaagttgg 1560 tgttgacctc tgtagtcttc ctaaaaatcc agaaggttat gttggaattt gtgtagttgt 1620 tgattacttt tcaaaatgga ttgaagcaaa gcctatttat aataaaagtg ctgaagaggt 1680 atccagattt ctttatgaac ttatatgtcg acatggttgt gcctcaattc aaattaatga 1740 tcaaggtcga gaattttgta ataaagtttc tgaaaatttg ctaaatctta ctggaacctg 1800 tcaacgtatt acttcggctt accatcctca agcaaatggt ctggttgaaa gagctaatag 1860 aactatacag ggttctatgc ttaaagtgct gaacggagag caagaaaaat ggcctcattc 1920 attagatggc atactgtttg catttcgaac aactcgccac aagtcaaccg gtgtcactcc 1980 ctttcaagtt atgtatgcaa gggagccagt tttaccctta cattgcataa ataatgatca 2040 aagcttaatt catgatgtac ctgttgattt agacatagaa gaaggtattt attttaataa 2100 acgaatggaa taaatttatt aaattttatt ttatacattt agttgttttt ttttaaggta 2160 ttttggaaca ggctgatttg ctcgaaaact taaaaggaat taaaaacatt caaagtataa 2220 ttcatgatga agtagaaaaa aatataaaga aatctcaatt gcgacaaaag aaggactatg 2280 agaaacgtca caaaaaccaa atccaattca atataggcga tgaagttttt gtttataatc 2340 ttagaagagc tgatagaaaa ggtgataaag ggaaatctgt ttgggatgga ccgtacgaag 2400 ttattgaaac atttattgat aaaggacttt atcagctaaa acaaaaaaat ggtgaaacac 2460 ttcataccaa gcatcatggt tcaaacatga aactttcaaa gaaaagatct gaagttgaaa 2520 tacttcaaaa atcatgtgat gaggtgtctg atattaaatc tgaatcttta gaaattgttg 2580 aagttgttat tagcaaaaca aactctttta tacctactaa tagtaaatgg agaacaaaga 2640 aatgctccct gttgaagatt tctttaacta ataaaacacc ttttaaaaaa ggttttgctg 2700 tacgaaaaaa aaagttacag ttgggaaaac cgattagtgt tgataaaatt gaaggtgatg 2760 gaaattgctt atttcgagct ttaagtcagg aagtttgtct aagtcaagag ttttataaaa 2820 tactgcgtca actagcaatt gatactcttt taaaagcgga atatagaaat gcttttgatg 2880 cttatatatc aaaaccagtt ttaaaataca ttagtgattc aaaaatggag actgaccaaa 2940 tatggggcac tgatgtcgaa atatatgcat tagcaactgc cttggataca ccaattgctg 3000 tttactataa gcctcctgaa gcttctgtat tttcgtggtt tttttacaaa cctcttcttc 3060 aaaatacaaa tgggtttaaa ctgatggaaa taaaagaaaa tgaccaaata gtttatctgc 3120 aaaatactgg tgtacattat gatagagttt tagctgtcag ttgatttttg ttcacattaa 3180 gttgtgtttt tattgacttt ttaaaaagtt ttatatattt atatttctta ttgactttat 3240 tttaagattg ttttgactat tcttgtaaat ccaaaatgtt atgcaattta tcattttaaa 3300 actaatcatg gacttgtttt ataactctcc cttattaagt tttgatttaa gttttagcat 3360 acaagaaatt gaatttattg atactatgct aatttaaaat actattgtaa atttattctt 3420 ttctgtaaaa acttttaatg aaagcttctt aatttttaag tagtttaggt ttgcttttta 3480 aacaacaaaa ttaatttgat tgaggaatac aattgaagtt tcttgtcttg cagatggcct 3540 cttttctttt ttaatcgcac gcgttttttt ttttaaatgc caacacagat tttaaaatac 3600 caatgagtta attataaaaa attaaaatta taaaatttta gaagacagcc aacctcaact 3660 gagatttgta gtcttctaaa atcaatgctt tttttaccct tgacaggtac agttgtccag 3720 gaactaaaaa agggtcacta gcgtgtcaca acggaatttt tactgaatga aaaaattaaa 3780 atgtagatga actttgagcg ttaattaaaa tgttttatat taggaatttt acccacattt 3840 gggtcctcat aatagtattt ttatgaaaay aatattactc taatcaatta ataaagcata 3900 gaaaacaatc aaagaagtga gtggccaaca aaacttacta caccagggtc cacgaagggc 3960 taaaagcagc ctygacaaaa atatatttta tttgtaaaac ttttaccaac attgtggtta 4020 caacagcaat aagacttatc gaattgattt aactactcgg gtcaggtcac cctgcttctg 4080 aaatacatgt aatccaatac aacttgcttt atatagagca agtacgactc cttagttgta 4140 gtgaaatcaa ctaactaagg agaagaaaaa tctgaaataa aacccctaat ctatatgtca 4200 ataagggtag tagaacctcg taaaccttga aagataacct tacgagtgtt agtgaaaata 4260 agaatactca gtcataaact tcaaaacagg tcaaatttaa gaaggagatc atgttgtaaa 4320 aactcttmtg taatgttgtt taattagaaa aaagaagttt aaaaaattta acctaaatcg 4380 tattataaaa ttaacttaat ccaattattt taaaagttat taaaaaaaaa aaaaggttta 4440 aactaaaaaa agtaaatact ctctttaaaa acgttttctt tgccgttagt acatgttacc 4500 tgagttaagt gtttggtgcc tagccggtag gaattactgt acagtgcttt gcacctaggc 4560 agcaagcact gtacagtgtt tgttaccttc cggtacggct tactgtacag tatacgctac 4620 cgttaagtca atcctgccta ctaca 4645 // ID Gypsy-163_AA-LTR repbase; DNA; INV; 186 BP. XX AC AAGE02017816; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-163_AA_; KW Gypsy-163_AA-I; Gypsy-163_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-186 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02017816; Positions 19651 19466. XX SQ Sequence 186 BP; 52 A; 27 C; 34 G; 73 T; 0 other; tgtaataagt tgtaataagt ttcagaaaat ctggcaacac ttgtattaag gcttttcgtg 60 ttgtatttag aatgttctgt tgtatttgca tagtcagttc taatctagcc atcaggttga 120 tcgggtcgaa taaaattgtt ctcgttaaat cacgttgttt tcctaataag tccgatactt 180 attaca 186 // ID DNA8-35_AP repbase; DNA; INV; 237 BP. XX AC . XX DT 23-AUG-2009 (Rel. 14.08, Created) DT 23-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-35_AP. XX NM DNA8-35_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-237 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(8), 1777-1777 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 237 BP; 78 A; 43 C; 38 G; 78 T; 0 other; catagatatt aaagaatgga taggccattt acgggaacga acatgaacat ttgttgaggt 60 tatagaggtc ttatcacaaa tagatatatg tataattatt gtatataatt tatgataaga 120 cctctgtcag aatgagaagc aatgataaaa aacagtcttt ctctcttccc catcccggcc 180 ccatgaccct ctattttgtt aatatagcca tggcctatcc attctttaat atctatg 237 // ID Gypsy-14_RP-LTR repbase; DNA; INV; 208 BP. XX AC ACPB02011988; XX DT 28-JAN-2011 (Rel. 16.02, Created) DT 28-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Rhodnius prolixus genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-14_RP_; KW Gypsy-14_RP-I; Gypsy-14_RP-LTR. XX OS Rhodnius prolixus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Euhemiptera; Heteroptera; OC Panheteroptera; Cimicomorpha; Reduviidae; Triatominae; Rhodnius. XX RN [1] RP 1-208 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Rhodnius prolixus genome."; RL Direct Submission to RU (28-JAN-2011). XX DR Genome; ACPB02011988; Positions 2071 1864. XX SQ Sequence 208 BP; 69 A; 45 C; 26 G; 68 T; 0 other; tgttcatacc gataaccacc tgcatagata agatgctcgc aaatatgata tatacccctg 60 gaatcaagag atgacagaca cacccagact atacgttata cgtgacacat tctactttta 120 attactttaa attgttgtac ttattcccca gtgttccttt caaaataaag ttctttttta 180 aaaaacggac tttcagtctt cttaaaca 208 // ID RTE_Ele2 repbase; DNA; INV; 3406 BP. XX AC . XX DT 27-OCT-2010 (Rel. 15.1, Created) DT 27-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE An RTE clade non-LTR retrotransposon family from Aedes aegypti. XX KW RTE; Non-LTR Retrotransposon; Transposable Element; RTE_Ele2. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-3406 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-3406 RA Kojima K.K. and Jurka J.; RT "RTE clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (06-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. This consensus is generated from 30 CC sequences with >97% identity, and ~99% identical to the original CC sequence in [1]. XX FH Key Location/Qualifiers FT CDS 334..3351 FT /product="RTE_Ele2_1p" FT /note="endonuclease and reverse transcriptase." FT /translation="GSIPNFSPTTKNGERRNNERYFRQPTWQRNKDYEWKL FT GTWNVRTLNEPGRASLLARELQKVGVGVAAIQEVRWPRSGEREFRAVDPIA FT NTTFKYNIYHSGGDKAEHGVGFIVIGKQMKRVMKWKPISERICVLRIRGKF FT FNYSLINIYAPTNDKPDDVKDAFYEGLDKAYGECPKHDVKIVIGDANAQVG FT REDFFRPIIGKESLHSVTNDNGLRLVTFAAARGMAICSTYFARKDIRKHTW FT RHPNGDICNQIDHILVDGRHFSDVIDVRSFRGPNIDSDHYLVVSKIRARLS FT TVANSRPQRTMRFNIQRFSADGVTAEYHQKLDERISEVNVSENLNNLWDAI FT HGVVSETAREVIGTARRRPRSGWFDEECQRVTDEKNVARRRMLVSGTRLNR FT ERYREARSAEKRIHRRKKKEFDENVIAEAQNCMEQNDMRRFYETVNGVRRK FT SAPSPVMCNDREGNLLTDKTMVAARWKEHFETLLNGGNDRASENRMNIDND FT GQAVEPPSLDEVKKAIKELKNNKAAGKDELPAELLKHGSEQLHRIIHHILL FT RIWEEEELPASWMDGLICPLFKKGHRLECANYRGITLLNSAYKIMSRILFN FT RLRPLEESFVGEYQAGFREGRSTSDQMFTLRQILDKFREYNLQTHHLFIDF FT KAAYDSVKRNELWEIMMEHGFPTKLIRLIRATLEGSKSSIRVADEISTSFV FT TLDGLKQGDALSNLLFNIALEGAIRRACVHRNGTIITRSYMLLGFADDIDI FT IGIDRRAVEEAYVPFKRETTRIGLIINTTKTKYMVAGRHRGPIGDVGSEVV FT IGGDIFEVVEEFVYLGTLVTCDNDVTREVKRRIAAASRAFYGLRNQLKSRS FT LQTKTKLALYKTLILPVALYGHESWTLKEVDRRAFGVFERKVLRTILGGKL FT DNGIWRRRMNHELYQVYNEMDIVKRIKHGRLRWAGHVARMSEERQAKQIFS FT REPGRGRRLRGRPRTRWLFAVEEDLRSLNVQGDWKRLAQDRVQWRRLLHSA FT " XX SQ Sequence 3406 BP; 971 A; 727 C; 952 G; 755 T; 1 other; tgggttgtaa aatggtgagt cgacaaccag gaaggagcgt ccaacatagc tctggtcctc 60 gcaagtccct acctcacgct tcctcgggtc taacgatgac aaagaccgcc agctaagggt 120 tgcgtactta gctggtagtg caacctgggc actgttgtcc ttctgacatc agctagagtg 180 agggggtgcg tcccgagcgt ctgttcacca ggaggtgcgg ctcaaacagc gtctgttctg 240 gtatccagcg gctgagtatg aaatgttgta tcacgtcaac taacctaagg tggcagcccc 300 atcaacgtga tgtaggtagc gcaaccccgg taaggtagca taccgaattt ctcacctacc 360 acgaaaaatg gagaaagaag aaataacgaa cgatattttc ggcaaccgac ctggcaacga 420 aataaggact atgaatggaa acttggtacc tggaatgtca ggacgttaaa tgaacccgga 480 cgagcgagcc ttttggctcg tgaactgcag aaggttggag tgggcgtggc cgccattcaa 540 gaagtgcggt ggcctagatc tggagaacgt gaattccggg cggttgatcc catcgccaac 600 actacgttca aatataacat ctaccacagc ggcggcgata aggcagaaca tggagtcggt 660 ttcatagtga tcgggaagca gatgaaacgc gttatgaagt ggaaaccgat tagcgaacga 720 atctgtgtac tgaggatacg gggcaaattc ttcaactaca gcctgatcaa catctatgca 780 ccgacgaacg acaaacctga tgacgtgaag gacgcgttct atgaaggcct tgataaggcc 840 tatggagagt gcccaaaaca cgacgtgaaa atcgttatcg gagatgcgaa cgcgcaggtc 900 ggaagagagg actttttccg tccgatcatc ggtaaggaga gtcttcactc cgttaccaat 960 gacaacggcc tacgactagt gactttcgct gctgccaggg ggatggccat ctgcagcacc 1020 tactttgcac gcaaggatat tcggaagcac acctggaggc atccaaatgg tgatatttgc 1080 aaccagatag accatattct ggtggatggc cgacatttct cagatgtcat cgatgttagg 1140 agtttcaggg gtcctaacat tgactcggat cactacctcg ttgtcagtaa aattcgagcg 1200 cggttatcaa ctgtagcgaa ttcaagacca cagcgaacaa tgcgattcaa tatccagcgc 1260 ttctcagcag acggtgttac agctgaatac caccaaaagc tggacgagcg gataagtgag 1320 gtcaatgtga gcgaaaacct caacaatcta tgggatgcaa tccatggagt ggtgagtgaa 1380 acagcgcgag aagtgatagg tactgctcga agacgcccca gaagcggatg gttcgacgag 1440 gagtgccaga gggtgacgga tgagaagaac gtggccagac gtcggatgtt agtgtctggt 1500 accaggctga acagagagcg gtacagggaa gcaagatcag ccgaaaagcg aatccatcgc 1560 aggaagaaaa aagagtttga tgagaacgtg atagccgagg cgcaaaactg tatggaacag 1620 aacgatatgc gacgattcta tgaaactgtc aatggcgtgc ggagaaagtc agcgccgtct 1680 cccgtcatgt gcaacgaccg tgaaggtaat ttgctgacag ataaaacgat ggtggctgcc 1740 aggtggaaag agcacttcga aacgttactg aatggaggga acgatagagc atcggagaac 1800 agaatgaaca tcgacaacga cggtcaagct gtggagcctc catctctaga tgaggttaag 1860 aaggcgatta aagagctgaa gaacaacaag gctgctggga aggatgagct cccggccgaa 1920 cttctcaagc acggtagtga gcagctgcat agaataattc accatatact actaaggata 1980 tgggaggaag aagaactgcc tgctagctgg atggatggcc tcatttgccc tctcttcaaa 2040 aaagggcata gattggagtg cgccaattac cgaggaataa cactcctcaa ttcggcgtac 2100 aaaataatgt cacgtattct gttcaacaga ttgagaccgc ttgaagagtc cttcgtcggc 2160 gaataccaag ctggttttcg tgagggccga tcgacgtcgg atcaaatgtt taccctgcga 2220 cagatcctcg ataaattccg ggagtacaac ttgcagactc atcatctgtt cattgatttc 2280 aaggcggcgt acgattcagt taaaagaaac gagttgtggg aaataatgat ggaacatggc 2340 tttccgacga agctgattag actgattcgt gcaacgttgg aaggatcgaa atcaagtata 2400 cgtgttgcgg atgagatttc cacatccttc gtaaccttag acggattgaa gcagggcgat 2460 gcactttcaa acctattgtt caacatagcg ttagaaggag caataaggag agcttgcgtg 2520 cataggaatg gcactattat cacacgttcg tatatgctcc ttggttttgc ggacgatatc 2580 gacataatcg ggattgatcg ccgagccgtg gaagaggcat acgtgccttt taaaagggag 2640 acaacgcgaa ttgggctcat tatcaatacc acgaagacaa agtacatggt cgctggtaga 2700 catcgtgggc ccataggtga tgttggtagt gaggtggtga taggtggtga tatwtttgaa 2760 gtagttgaag aatttgtgta ccttggaact ctagtgactt gcgataatga tgttacccgc 2820 gaggttaaaa ggcgtattgc agctgcgagt cgggcttttt acggacttcg taaccagcta 2880 aagtcccgca gcttacagac gaagaccaaa ctcgcgttgt acaagactct aattcttccg 2940 gtagctctgt acggccacga atcttggacg ttgaaagagg tcgaccggag agcttttggg 3000 gtctttgaac gtaaagtgct gcgaacaata ctcggcggta aattggacaa tggcatctgg 3060 cggcgtcgca tgaatcacga gttataccaa gtctataatg aaatggatat tgttaagcgt 3120 ataaaacacg gcaggctgcg gtgggctggg catgtagctc gcatgtcgga ggaacgtcaa 3180 gctaaacaaa tattcagcag ggaaccaggg agaggccgtc gcctccgtgg aaggccgcgc 3240 accagatggc tttttgcagt tgaggaggat ttaaggtcgc ttaacgttca gggcgactgg 3300 aagcgattgg cccaggaccg ggtccaatgg agaaggcttc ttcattcggc gtagattcaa 3360 cgcaatagca aggaattgtt gcccatcaag tatcaagtaa gtaagt 3406 // ID Gypsy-7_BM-I repbase; DNA; INV; 4963 BP. XX AC nscaf2766; XX DT 19-MAR-2010 (Rel. 15.07, Created) DT 19-MAR-2010 (Rel. 15.07, Last updated, Version -1) XX DE LTR retrotransposon from silkworm: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-7_BM_; KW Gypsy-7_BM-LTR; Gypsy-7_BM-I. XX OS Bombyx mori OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Bombycidae; Bombycinae; Bombyx. XX RN [1] RP 1-4963 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from silkworm."; RL Repbase Reports 10(7), 989-989 (2010). XX DR Genome; nscaf2766; Positions 734699 739661. XX CC Positions [3489-3998] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 117..4394 FT /product="Gypsy-7_BM-I_1p" FT /translation="MMSGNAQRGRRRITRRERSSSPSPIEEILTRLRALED FT RSRAHVETQVTTPPVLVPVTVSETPPSPREMVSAEMGTPTLTSRASNMESA FT QIEDMVTHTAERLLSAISRAQVRSNHYFVSEFDPSVHDFDTWSDEVEKARI FT LNRWDDGECLARVGHCLKGEARTWLSQWTSNVRTWSNFKLELKSLCPRTVD FT VANILYEVMRTESDKYSTYAEYARKSLLKLRIVKGLSDELLTAIIVRGITD FT PQIRAMATNAKLSSDSIVEFLSSFVKPSNANNYNKRVYSNAPNRLNKHFNH FT SPLKRQHDSRDTKTVRIKCYRCHQFGHTFSNCYKRPNNITPNSPSSDEQAI FT ALPSTSRGDISKCAFCKKAGHSEDDCFAKERSRSRNKSGVNFCREQAGSYK FT NRDVTTAVISGIPLDVLIDSGSCVSLMSENLVKYFPCKMKPTTQLLCGLGG FT KEVKSQYLVTLPVEFSDITLETDFYIVDSNHLNVPVIIGTDVLNRDGVAYV FT RTSSCQRVVRSEGITNNVMQVNSISLSDRVNTPVTGYNREQLLAVLDRYSK FT YFLSGTATTTVKNSEMHIKLTTNVPVYYRPYKLSHDEKLKVRSIVRDLLDK FT GIIRESDSEYASPILLIKKKDGSDRMVVDYRALNRITVKTRHPLPLINDYI FT DRLGNARWFSALDMVSGFHQLRMAEESIHKTAFVTPEGHYEYCKMPYGLAN FT APIVYQKTITKSLKTFIDAGQVLVYIDDVLLMTDGLDENLALLESVIKTLT FT EAGFSINLKKCTFLTNEIEYLGRIISNGQVRPSTYKIDALVKSPRPNNVKQ FT VRQFLGLAGYFRRYIPNYATKTALIAALTKKGVNFHWSNEHEKARQEIISH FT LTNEPVLAIFDPELSTELHTDASSIGYGGIVMQVHNDGRRRVVAYFSKLTV FT GAESRYHSYELETLAVVKSLQHFRQYLIGKPFKIITDCNALKMTQRKKDLQ FT PRVARWWIYMQDFDFTLEYRKGCLMSHVDYLSRNPVNVVDLVQKPQNWAQV FT AQAGDEETLTLLEKLNNGQLDNTRYVKRNDLLYYKYDVTGEQARYLCYVPK FT AFRLSLLRVFHDEHEHIGVDKTVDLILQHFWFPGLRQFVSKYVTHCVVCIS FT HKQVPRAPLQPISSWHKAPVPFDTIHVDVLGPLRESEGHKHVLIMIDAYTK FT YCLLHAVTKQNCSELQRVMTQVISLFGTFKRLVCDRSRMFDNHEFINWISN FT FGIEMHFITPEMHQENGQVERYCRTVLNMIRVEVNYNDNDWSKVLWKLQLV FT LNVTKQKTTQYSALNLLVGTDSTTPLINALIKDITCEGSNPNRDAIREIRR FT QRAEGLIRENKDKQDSYVNKNRKPPKKFNVNDLVYVIKSSQSTGKLDSGMR FT GPYKITKVLPNDRYELQLLAGAYGKKSQAAAGYMILWKGEWTPETCAAFFE FT GICFIIDALFAL" XX SQ Sequence 4963 BP; 1508 A; 970 C; 1112 G; 1373 T; 0 other; tcagaagtgg gattcgaagg ctgaaaagct gttgaggcca catctgcttt agtaaacgaa 60 tcgtagcttt cctcagccga atttagtggg ctgaagttcg gacggagttt agctacatga 120 tgtctggtaa cgcccagcgt ggtagacgca gaatcactcg ccgtgagcga agttcttccc 180 caagcccgat tgaagaaatc ctgacgagac tgcgcgcact ggaggaccgg tcacgtgccc 240 atgtggagac gcaagttact acaccaccgg tgttggtgcc agtcacggtg tcagagacgc 300 cgccttcgcc tcgtgagatg gtgtccgcag aaatggggac gccgacattg acgtcacgcg 360 ctagcaatat ggaatcagcc caaattgaag acatggtgac acacactgca gagcgtctct 420 tgtccgctat aagccgtgct caggtcaggt ccaaccatta ttttgtctca gaatttgatc 480 cgagtgttca tgacttcgac acgtggtctg atgaagtgga aaaagctcgt attttgaacc 540 gttgggacga tggagaatgc cttgctcgcg tcggacattg tttaaaaggg gaagctcgca 600 cctggcttag ccaatggacg agtaatgttc gaacatggtc aaatttcaaa ctagaattaa 660 agtcgctatg cccacgtact gtcgatgtcg cgaacattct atacgaagta atgcggacag 720 agtctgataa atactccacg tatgctgagt atgctcgtaa atcgctttta aaattacgca 780 tagttaaggg actcagcgac gagcttctaa ccgctattat tgttcgtgga attacagatc 840 cacagattcg tgctatggcc actaatgcta aactatcttc tgactcgata gtggagtttt 900 tatcaagctt tgtcaaacct tcaaacgcaa acaattataa caagagggta tattcgaatg 960 ctccgaatcg gttaaataaa cattttaatc atagtccgtt gaagcgtcag cacgattcta 1020 gagatacaaa aactgttaga attaaatgct atcggtgcca tcaattcggt catacatttt 1080 ctaattgtta taaacggcct aataatatta cacccaattc tcctagcagt gatgaacaag 1140 caattgcact accgtcaaca tctcgcggcg atatttcaaa atgtgcattc tgtaaaaagg 1200 caggtcactc agaggacgat tgtttcgcaa aagaacgctc taggtcacga aataaaagtg 1260 gtgttaattt ttgccgtgaa caagcgggta gctataagaa ccgcgatgtg acaacagccg 1320 taatttctgg gataccgtta gatgtattaa tagatagcgg atcgtgtgtc tctttaatgt 1380 cggaaaacct agtgaaatac tttccatgta aaatgaagcc tactactcag ttgttatgcg 1440 gcctaggcgg taaggaggtt aagtctcaat atttagttac gctacccgtc gagttcagcg 1500 acataacttt ggaaactgat ttctatattg tagacagtaa tcatttaaac gtacccgtta 1560 ttatcggtac tgacgtatta aatagggatg gggttgccta cgttagaact agtagctgcc 1620 aacgtgtggt gcgcagtgag gggatcacta ataacgttat gcaagtaaat tcgatttcat 1680 tatctgatcg agtaaataca cctgttactg gatataatag ggaacagtta ttagccgttc 1740 ttgatcggta ttccaaatac tttttaagtg gtaccgcgac cacaacagtg aaaaatagtg 1800 agatgcatat aaaattaaca actaatgtcc ctgtgtatta tagaccctat aagctgtcgc 1860 acgacgaaaa gttaaaagtt cgctccatcg ttagagattt gcttgataaa ggcattattc 1920 gcgagtccga ctcggagtac gccagtccga tcttactaat aaaaaagaaa gatggttctg 1980 atagaatggt ggtggactat cgcgctttga atcgaatcac cgtcaaaact cgacaccctc 2040 tccctttgat taatgactac atagatagat taggtaatgc gcgctggttc agtgcgctgg 2100 acatggtttc tggatttcat caacttagaa tggccgagga gtcaattcac aagaccgcat 2160 tcgtaacccc cgaaggtcac tacgaatatt gtaaaatgcc ctatggtcta gcgaatgccc 2220 ctatcgtata tcagaaaacc atcacaaaat cactcaaaac cttcattgat gctggacagg 2280 tcttagtgta tatcgatgat gttttattaa tgactgacgg actggacgag aatcttgccc 2340 tgttagaatc tgtcataaaa acactaacag aagcgggttt ctctattaac ttgaagaaat 2400 gtacattctt aacaaacgaa atagagtacc taggaagaat tataagtaat ggtcaggtta 2460 gaccgagcac ttataaaata gacgcgttag ttaagtctcc acgaccgaat aatgtcaaac 2520 aggttcgaca attcctgggt ttagcaggat attttcgaag atacattccc aactatgcta 2580 ctaaaactgc actcattgcc gctttaacaa agaaaggtgt taactttcat tggagtaatg 2640 aacatgaaaa agcgcgccaa gaaataattt ctcacttaac caacgaacca gttttggcaa 2700 tattcgatcc ggagctgtca actgaattgc acacggatgc gagttcgata ggttatggag 2760 gaattgtcat gcaggtacat aatgatggac gtagacgcgt agtcgcttat ttcagcaagt 2820 tgacagtggg tgctgagtca aggtaccaca gctatgagct ggagaccctc gcggtagtaa 2880 agtcattaca acattttaga caatatctaa ttggtaaacc ttttaaaatt atcactgatt 2940 gtaatgcttt aaaaatgacc cagcgtaaaa aggatctcca acccagagtt gctagatggt 3000 ggatatatat gcaagatttc gatttcacgc tagaataccg taaaggctgt ctcatgtcac 3060 acgtagacta tctgagtagg aatccagtca atgttgttga ccttgttcaa aaaccacaaa 3120 attgggctca ggtggcgcag gcaggtgacg aagaaacatt aacattgctt gaaaaattaa 3180 ataacggtca attagataac acgcgttatg taaaacgtaa tgatctcttg tactataagt 3240 atgacgtcac gggtgagcaa gcgcgatact tatgttatgt accaaaggca tttcggttaa 3300 gtttgctacg cgttttccac gatgaacatg aacacatcgg tgtagacaag acggttgatc 3360 tgattctgca gcatttttgg tttcccggac ttagacaatt tgtgtcgaaa tatgtcaccc 3420 actgcgtagt atgcatctca cataagcaag tcccacgcgc acccttgcaa ccaatatctt 3480 cgtggcataa ggctccagta ccatttgata cgatacatgt tgatgtactc ggacccttaa 3540 gggaaagtga aggtcacaaa catgtactca tcatgattga cgcatatact aagtattgtt 3600 tgctacatgc tgtcacgaaa caaaattgta gtgaactaca gcgcgtaatg acccaagtga 3660 tttcactttt tggtaccttt aaacgactag tatgcgatag gagtcgtatg ttcgacaatc 3720 acgaatttat taattggatt tcaaattttg gaatcgagat gcattttatc actcctgaaa 3780 tgcaccagga aaacggtcaa gtggagcggt actgccgaac cgttctgaac atgataaggg 3840 ttgaagtaaa ctacaatgat aatgattggt cgaaagtgtt gtggaaatta caactagttc 3900 taaatgtcac aaaacagaag actactcaat actccgcatt aaatcttctt gttggcactg 3960 actctactac tccattaata aatgcactaa tcaaagacat tacatgcgag ggctctaatc 4020 caaacagaga tgcgattcga gaaataagac ggcaacgcgc tgaaggactt attcgcgaaa 4080 ataaggataa acaagactct tatgtaaata aaaataggaa accgccaaaa aaatttaatg 4140 ttaatgactt agtctatgtg attaagagct ctcagagtac tgggaagctc gatagtggga 4200 tgagaggtcc ctataaaatt actaaggttc tgcctaatga cagatatgag ctacaactac 4260 tagcaggcgc ctatggaaag aagagtcaag cagctgctgg atatatgata ttgtggaaag 4320 gtgaatggac gcctgagaca tgtgcagcat tttttgaagg tatttgtttt ataattgatg 4380 cattatttgc tttataagct tgtttacaag ttttttctca ttttttaatt aaataaattg 4440 tgtaccttac taacataatt taattaacga tcattaagta agctgctcgg tgctcagagg 4500 aagagactca tgtaaatcct aagtgcgcac ggtattgggt tgaccctgac actcctggga 4560 aggtatcggg ttgagtcagt agcgccgctg acaagagacc tgtataaatc cgacccttag 4620 ctatgggtat ctgggttgac ctacaagaga ctgatgtaaa tcctgtttgc acttgctatg 4680 tgcagggtat ttggttgcgt gggaatagac ctgtataaag cctatgtgac ttggacgctt 4740 gtgcatacca ttgatttgtt tggctaattg gtgcataatg tgctcataat actgctgaat 4800 tcataaattt catggtaaat gttaagtgtg ttttctatgt ttgaagaagg cgacgaggag 4860 caaaatacgt tgtacgcgca gggagttgcc ggtccgtcga cgagttcctg ttaagggctt 4920 cgaggacgta gccccgtcag gagaaggccg tgttggaaga aaa 4963 // ID BEL-16_DWil-I repbase; DNA; INV; 5112 BP. XX AC scaffold_181134; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-16_DWil_; KW BEL-16_DWil-LTR; BEL-16_DWil-I. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-5112 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_181134; Positions 1050218 1055329. XX CC Positions [4427-5008] - Integrase core CC 'GACAT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 117..3275 FT /product="BEL-16_DWil-I_2p" FT /translation="MQTVMQKRGYCKSQITRAYHNANTFIEKTHAVASYDT FT RLQQLQENYLRFLQLTEELHTYKDTADWEDPDVDIDPYEEKHYATYSMLSV FT AREELHVQYARSLNNRFPVTEAREGHVSQTHVNLHYERIKLPTFSGNYEDW FT QHFSDMFVGSIGSDPHLTSCQKFHYLKSYLSGEAFSLIKHIAVTNDNYSEA FT WDRLVQRYNKRHVIIRSFIDSFLNLPTASVLNSGAIRKLADGADEVIRGLR FT ALECENRDPWLIHVLLAKVDNNTRQAWAEHSESEDYTVTIDSFIKFLLTRG FT DTLESSQLSRANHSRRIATTHHVNTETQPLIEPCTLCQHAHPLSRCEQFRA FT LDAVARHAHVRSANLCFNCLSPNHRIATCPSKNTCRVCQRKHNTLIHAAQQ FT TTGREFRDSAVRVHQPESPGSNDESPAIISHHVQTTRGHQIAVTKHSKRVS FT LHTLENIPYAGSQTLLPTILTSVKDARGKTFTCRALLDTGSTVSLATESFV FT QRIGMRRTHAKVPVRGLAANRAGVTRGLVKFRICSCNSDHNIEVETYILST FT LTSALPAQNVDMSSTQWKEILKLPLADPSFGTVGAVDVIFGSDQLWKFCTG FT EQRSFGNNVPIALNTIFGWTLAGPYSTFGENQSLANTHHVHLDLDSQVRSF FT MEMDSIPSKTSTLEIFDPTEQHFINTHTRSQTGTYVVEYPFKDPIPPIGET FT LPQAVSRLQSLERKFSRDPSLKREYIAFMDEYLKSGHMELLNTQQIDEQAD FT GCFYLPHHAVLKPESTTTKCRVVFDGSGKDSSGVSLNERLHIGPPIQRDLF FT GVCLRFRQHRYVCCSDIEKMFRKIEVAPHHTNYQRIIWRSSTIDPMQHYRL FT RTVTYGLAPSPFLAVRVLKQLAIDYEAKYPLAARVLARDAYVDDIPTGCNS FT VGELMKLKEQLISLLDEGNFKLRKWSSNSWSFLKSIPKEDCLHGHASIEDN FT VKILGIPWNPGRDEFLVNTPMIATKQAPSKRALLSDLSKVFDPLGFVAPTT FT VLLKLIFQECWFSAISWDDPIPETLRRQWQAIKRNYPCYLTAEFHDT" FT CDS 3218..5110 FT /product="BEL-16_DWil-I_1p" FT /translation="MASNQKELPLLSHCRISRYIAASSPHIELCGFADAST FT QAYGAVIYSRVRTTNGYRSRLVAAKTRVAPLKPISIPRLELNAALLLSRLL FT KLVTESLTIPISSSHCWTDSEIVLHWLSSPPRRWNTYVCHRVAEIAEDYPR FT RCWNHVRSEENPADCASRGLSPSQLLNHELWWNGPSWITKPQSEWPLSEPH FT GQSFTDLATEERRQPITVLHGMLEDSIHELLINKLSSWSRLLRVTSYCYRF FT IERLRCKGAHSDKTFLTSVELQSARCLLLQHAQRKSFSKDYEKLNKQEPLS FT IKSRLIRYSPFLDERGIVRVGGRIERSSLHYNVKHPILLPKESPISLLLIR FT HIHQTHFHTGVDATFTILRQQYWILGARNLVRRVVFQCKLCFLQRRATHTQ FT LMGNLPLPRVQATPCFQHTGLDYAGPIAIKENKGRTPRIGKAWFAIFVCLA FT TKAMHLEVVSDLTTNAFIAAFKRFISRRTRPTDLYSDNGTTFHGSRRALDE FT MRRLAIDHMKDKELASFAASEGLTWHFIPPSAPHFGGIWESGVRSVKLHLR FT RVIGANALTFEELSTVLAQIEAVLNSRPLCANGDSSLDPLTPAHFLVGSPY FT TALPEPCYLDMTFNRVERWNQLQAMVQGFWKRR" XX SQ Sequence 5112 BP; 1418 A; 1253 C; 1139 G; 1302 T; 0 other; tataaaattt ggtccttcga gccggatact agctgctgat ttatttcaat gttttgcaag 60 ttgtgtaaag ttagacggta ctcagttgga agagaccgtg tctaacctca ttcgcgatgc 120 agacagttat gcaaaaacgt ggttactgca aaagccaaat cacgcgggct taccacaatg 180 caaatacatt catcgagaaa acccacgcag tggcaagtta tgacacacga ttgcaacaac 240 tgcaagaaaa ctaccttcgg tttctacaac tcacagagga gttgcacaca tataaagaca 300 ctgcggactg ggaagatccg gacgtggaca ttgatccata tgaggaaaag cactacgcaa 360 cttactcgat gttgagtgtt gcgcgggagg agctgcacgt gcaatacgcc agatcgctca 420 acaaccgttt tcctgtaaca gaggcacgtg aaggtcacgt atcacaaacc cacgtgaact 480 tgcactatga acggatcaaa ctaccaacat tttcgggcaa ttatgaggac tggcaacatt 540 tttcggatat gtttgttggt tctatcggca gcgatccgca tctcaccagt tgtcagaagt 600 ttcactatct gaaatcttat ttgtctggcg aagcgttttc gttgattaaa cacattgccg 660 tgaccaacga taactatagt gaggcttggg acaggctggt ccaacgttac aataagagac 720 atgtaatcat tcgttccttc attgacagtt ttctaaatct cccaacggca tctgtactca 780 actctggcgc gattcgcaag ctggctgacg gagccgatga agtaattcgc ggactaagag 840 cattagagtg tgagaatcga gatccctggt taattcatgt gctactggca aaggtggaca 900 acaacactcg gcaagcatgg gcagagcaca gtgaatctga ggattacact gtaacgatcg 960 acagttttat caaatttctg ctcacccgag gtgatacgtt ggaatcgagc caactttcac 1020 gagcgaatca cagccgacga atagctacta ctcatcacgt caacacggaa acccagccat 1080 tgatagagcc atgcaccctg tgccagcacg ctcatccgct atcaagatgt gaacaattca 1140 gagcacttga tgcagttgca cgacacgcac acgtgaggag tgccaacttg tgcttcaact 1200 gtttaagccc aaaccacagg attgcaacct gtccttcaaa gaatacatgt cgagtatgtc 1260 aaagaaagca taacaccctg attcatgcag cgcagcagac gactggccgt gaattcagag 1320 atagcgcggt gcgagttcat cagcctgaga gcccaggatc taacgatgaa tccccggcta 1380 ttattagtca tcatgttcaa actacacgag gtcatcaaat agcagttaca aagcacagca 1440 agagagtttc tcttcacacc ctggagaata ttccgtatgc aggttcgcaa acgcttttgc 1500 caaccattct tacatcagtg aaagacgctc ggggcaagac cttcacttgc cgagctctac 1560 ttgacactgg gtctacagtc tcattggcaa cggaatcatt tgtccagcgg attgggatgc 1620 gacgcaccca tgcaaaagtt ccagttcgtg gtctcgcggc taacagagcc ggcgtcacca 1680 gaggactcgt aaagtttcga atctgctctt gcaattcgga tcacaacatt gaggtcgaga 1740 catacatcct cagcaccctt acctcagctc tcccagctca aaacgtggac atgagctcaa 1800 cgcaatggaa ggaaattttg aagttaccct tagctgatcc atctttcggc acagttggtg 1860 cggtagatgt catttttgga tctgatcagc tctggaaatt ctgtactgga gagcaaagat 1920 cttttggtaa caacgttcct atcgcactaa atacaatttt tggttggact cttgcaggtc 1980 catatagtac attcggggaa aatcaatcac tggcaaacac tcatcatgtt cacctggacc 2040 tcgattcgca ggtgcgatcg ttcatggaga tggatagcat tccatcaaaa acatcgacat 2100 tggagatatt tgatccgact gagcaacact ttatcaacac tcacacccgt tcccaaactg 2160 gcacgtatgt agtagaatac ccattcaaag acccgatacc accaattggc gaaacccttc 2220 cacaagctgt tagccgattg cagtcactag aacgcaagtt tagtcgcgac ccaagtctca 2280 agagagagta tatcgctttc atggacgagt atctgaagag cggtcatatg gaattactca 2340 acacacagca gatcgatgag caggccgacg gttgtttcta tctgccgcat cacgccgtct 2400 tgaaaccgga aagcaccact accaaatgcc gagtggtgtt tgatggatct ggaaaagata 2460 gtagcggagt gtcactcaat gagcgtcttc acattgggcc tcccattcaa cgagatctat 2520 ttggtgtctg ccttcgattc agacaacatc gttatgtttg ctgttcggat atagaaaaga 2580 tgttccgcaa aatagaggtt gctccccatc atacaaacta tcaacggatc atctggcgca 2640 gttccacaat tgaccctatg cagcattatc gattgcgtac tgtcacctac ggattggcgc 2700 cttcaccctt cttggctgta cgggttttga aacaattggc catagactat gaagcgaaat 2760 atccactagc cgcgcgagtc ctagcgcgcg atgcatacgt ggatgacatt ccaactggat 2820 gcaattcagt tggcgaactt atgaaattaa aagagcaact gatttcttta ttagatgaag 2880 gaaatttcaa gttgagaaaa tggagttcga acagctggag tttcttgaag tcgataccaa 2940 aagaagattg tttacatgga catgcgtcga tcgaggacaa cgttaaaata ttgggcattc 3000 cttggaatcc tggtcgagat gagttcttag ttaatacacc gatgattgca accaaacagg 3060 cccctagcaa acgggcgcta ttatcggatc tgtcaaaagt cttcgaccca ctgggctttg 3120 ttgcgcctac cactgttttg ttaaagctaa tattccaaga atgctggttc agcgctatta 3180 gctgggatga cccaataccg gaaactcttc gaaggcaatg gcaagcaatc aaaaggaatt 3240 acccctgcta tctcactgcc gaatttcacg atacatagcc gcttcatcgc cgcacattga 3300 actttgtggt ttcgcggatg catccactca agcatatggc gctgtcattt atagtcgcgt 3360 tcgtacaacg aatggttacc ggagtcggtt ggtggctgcc aagacccgag tagccccgct 3420 taagcctatc tctattcccc gtcttgaact caacgccgct cttctactta gccgattact 3480 taaacttgta acagaatctc taactatacc catttccagc tcacattgct ggactgactc 3540 ggagatagtc ttgcactggt tatcatcgcc accccgtcgt tggaatacct atgtttgtca 3600 tagagttgct gaaattgcgg aggactaccc acgtcgctgc tggaaccatg ttcgctctga 3660 agagaatccc gccgactgcg catcacgagg tctaagtcca tctcagttgt tgaatcatga 3720 gttatggtgg aacggaccct cttggattac caaaccacaa tcggaatggc cactatctga 3780 accacacggc caaagcttta ctgacttggc gactgaggaa cgacgccagc caattactgt 3840 gcttcatgga atgcttgagg acagcattca cgagttatta atcaacaagt tatcgtcctg 3900 gagccgtctc ttaagggtaa ccagctactg ttaccgattc attgagcgac ttaggtgcaa 3960 aggtgcacac tccgataaaa cattccttac atctgttgag ttacaatcag cacgatgcct 4020 actgctgcaa cacgcacaac gcaagagctt ttccaaggac tatgaaaagc ttaataaaca 4080 agagccgtta tcaattaaat ctcgattgat ccggtacagc ccattccttg acgaaagagg 4140 aattgtacga gtcggaggcc gaatcgaacg atcatcccta cactacaatg taaagcatcc 4200 cattctactg cccaaggagt caccgatcag cttgttatta attcgacaca ttcatcaaac 4260 acattttcat actggagtcg atgccacatt cactattctt cggcaacaat attggatttt 4320 gggtgcacgt aatctagtgc gccgagtggt ttttcaatgc aagctctgct tcttgcaacg 4380 acgagcaact catactcagc taatgggtaa cttgcctcta cctagagttc aagcaactcc 4440 gtgtttccag catactggcc tagactatgc tggtccaatt gcaatcaagg agaataaggg 4500 tcgaacacct cgtattggaa aggcttggtt tgctatattt gtctgtctcg caacaaaggc 4560 aatgcacttg gaggtcgtaa gtgacttaac aaccaacgct ttcattgcag catttaaacg 4620 atttatttct cgccgaacga gacccactga tctctactcg gacaatggta ccacctttca 4680 cggaagtaga cgtgcgctag acgagatgag acgtttggcc attgatcaca tgaaagataa 4740 ggaacttgct agttttgccg cgagtgaagg cctcacatgg catttcatac cgccgtcagc 4800 accacacttt ggcggcatat gggaatctgg agtgcgatcc gtgaaactac atctccgtcg 4860 ggttatcggc gctaacgcac tgacctttga ggagttgtct actgtgttag cgcaaattga 4920 agctgtatta aactcaagac cactatgcgc aaacggtgac agctctctgg accctctcac 4980 acctgctcat tttcttgttg ggtcacctta cactgcgcta cctgagccct gttatttgga 5040 catgacattc aaccgtgtgg agagatggaa tcaacttcaa gctatggtac aggggttctg 5100 gaagcgccgg ta 5112 // ID Gypsy10-NVi_LTR repbase; DNA; INV; 178 BP. XX AC AAZX01006256; XX DT 13-NOV-2007 (Rel. 12.11, Created) DT 13-NOV-2007 (Rel. 12.11, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: long terminal DE repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy10-NV; KW Gypsy10-NVi_I; Gypsy10-NVi_LTR. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-178 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from Nasonia parasitic wasp."; RL Repbase Reports 7(11), 1148-1148 (2007). XX DR Genome; AAZX01006256; Positions 2635 2458. XX SQ Sequence 178 BP; 48 A; 45 C; 50 G; 35 T; 0 other; tgtagcgaag tggcgccctc tgcgaggagc tgaatatact aggagaacgt ggggggcaga 60 acgaaagggt ccggcctagg acgcggacgt tgaccagagc ttgtaccata acctatataa 120 taaagacgag tattcctcag acgcgtggct attcttatta cccgaccacc ccgctaca 178 // ID BEL-631_AA-I repbase; DNA; INV; 6481 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-631_AA_; KW BEL-631_AA-LTR; Pao_Bel_Ele198; BEL-631_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-6481 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [5534-6094] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS join(38..1177,1181..6481) FT /product="BEL-631_AA-I_1p" FT /translation="MLGRTPIKTRSTTASRDGQPVGGSVRNVDGGNHGNHH FT CKACSGEDNTRMVQCDKCDDWYHFECVEVSQGVAYRDWNCPTCLSASNEVK FT KKDSAKVTSSAVSTSAISNPSIRVTSVPLNLVPQNPTSLTALPNVTTSTHI FT NASALVNVPSSTIFPPMSVIDNSSTLNPPNAYFSSSTCFAPNMLLPPSIFT FT LAGGTISHPHTIPQLNPSVVCAPNMFPEMTDIRQCLPSTSQPRLLPTSVTA FT NVPLASSTIIPTTVPLVGQTSNELLKVPSNKPGKNAKQVFGETSSQHSGTS FT KRSTKHKQLELELKMLDEERKLQEEENRRKREYLQKRYALLLEIASESSSV FT AEIEEDKNEDRVNEWIENGLTAGQVAGDQDPIPEQLRNVNPQPPPPNSMPP FT ICNQFGRPRPARCEAVPLSAPRMFPTLATERYSNIPAHQPQTFLPQHQVVH FT SRTQRPNMSHQLDREPRIANSLGLGRSSLRYDMFEHDEGNNLTRSQVVARQ FT AVSRELPTFSGTPEEWPLFFSSFTTTTAMCGYTAEENLVRLQKCLTGKAFE FT AVKCMLMHPMNVTQIISTLKMRFGNPEIIVHNLMAKISSTPAPKAEKLDTI FT IEFALSVQNLCATIEACQLEEYSYNVALLRELVDKLPTTIKLDWARYRRNF FT AAVNLSVFSSWLYELAEDICPLAGASSDSKLQRSGKNNPAFLNAHAEDVED FT EPKKPPHQSPSGQFKHAEVTCVVCKCDCPTIDKCQRFYELGYNSRWAVVKE FT FGLCRKCLKNHKGSCKVQKLCGKNGCEFKHHQLLHNNQRDLVAAPKTEVKS FT AASPEHREPNATRKFESECNIHHQITNKTLFRVVPVTLYGPDKHIKTFAFL FT DDGSSYTLMDATLATELEVEGKRSPLCLKWTGNMGRSENDSIKLNIQISGT FT GNNVKRYWLHGVHTVSSLDLFRQSVDAQEMANHYKHLRGIPVESYQNAQPR FT ILIGMKNASLSYPLKAREGKLNEPIATKTRLGWVLHGGSDEDDFLLAYHGV FT QVCICTERSDEFLEKAVREYFSLEGLGVTKPDKPLLCKEDQRALNILDGVV FT QTDDGHYETRLLWKYDEFRLPNSKPTALRRFHCLDSRMKKEPELAAALREK FT MKDYEEKGYIRKISEAELQAWMQRIWYIPIFPVFNVNKPGKLRLVWDCAAK FT TGNISLNSMLIKGPDQLTPLNSVLYRFRENKVGITGDIREMFLQVRIAKED FT QGCQLFLWKDNPNDETPSTYVSQVMTFGASCSPTCAQFVKNLNAEKYAGQF FT PRAVEAITKQHYVDDMLVSVESEEEAVQLAKDVRSIHAEAGFEIRNWISNS FT PAVLQALHQEKTADKSLYVGSELGMEKVLGMWWCTNSDCFTFKLSPKHDAM FT LLTGERKPTKREVLRTLMSIFDPLGLLSHVLVGLKILFQEIWRSAIGWDDE FT IPDNLYEKWENWLRILPKLEDVSIPRCYRLLTSMSPNTLVQLHTFVDASEL FT GYAAVVYLRFQQGSTIECAIVGAKSRVAPLKFVSIPRLELQSAVLGSRLAK FT TISEALSFKINERFFWTDARDVLCWLRSDHRQYSPYVAWRISEILDVTNVD FT NWRWISSQDNIADDATKWKRSPDLGCQSRWFKGPEFLWRAADTWPCEPFST FT NTTREELRANLLHHKDEFQPIFQTDKFSEWNRLLHVAATLIRFIANIRQKI FT AGIERNTGPFSKDELVCASNVLYREAQSSSFTEEMDILMNRTSSRAIPKSS FT SLFTLNPFLDGERVMRMHGRISACQYASTDAQNPIILPKDHHITKLIVKSY FT HQRYHHRNHETVLNELRQVFRIPKLRGVFRKVKADCQTCKIQRANPMPPPM FT ADLPNGRLAAYTRPFSYVGVDYFGPMTVAVGRRNEKRWGVLITCLTIRAVH FT LEVAHSLSADSCIMALRTFMARRGVPIQIYSDRGTNFVAANKELSEALREM FT DQQQVIQEITSQHTEWTFIPPASPHMGGAWERLIQTVKANLQKMLPMRRPS FT DEVLRNTLAEIENLINSRPLTYVPVDDPDAPVLTPNHFILGSSSGLKPASR FT MDDRALILRRSWRESQREADLFWQRWLRDYLPELTKRTKWFSDVKAIEVDD FT IVVIVDPDLPRNCWLKGRIIAVKQAKDGHVRSATVQTKMGIYERPATKIAV FT LDVRREELVDQKTGVPGEE" XX SQ Sequence 6481 BP; 1901 A; 1444 C; 1527 G; 1605 T; 4 other; atttaaaaaa ttcgtttatc gccatactaa ctgggcaatg ctgggaagaa cgccaatcaa 60 gacgaggagt actactgcgt cccgcgatgg tcagcccgta ggaggctctg tacgaaacgt 120 tgatggtgga aaccatggca accaccactg caaagcttgc agcggcgaag acaatactcg 180 aatggtccag tgcgacaagt gcgatgactg gtaccatttc gagtgtgtgg aagtgagtca 240 aggtgtcgcc taccgcgatt ggaattgtcc tacgtgccta tctgcctcta acgaagtcaa 300 gaagaaggat tctgcgaagg taacatcttc agctgtgagt acgtcggcta tcagcaaccc 360 ttcgatcagg gtaacatcag tacctttgaa tctcgttccc caaaacccaa caagtttaac 420 cgccttgccc aacgtaacaa ccagtactca tattaatgca tctgcattgg taaatgttcc 480 gtcgagtaca atatttccgc ctatgtctgt tattgataat agcagtacac tcaatccgcc 540 aaatgcctat ttttcatcta gcacgtgctt tgcgcctaat atgctattgc caccaagcat 600 atttacactc gctggtggca caatatccca cccccacaca attccacaat tgaacccgtc 660 tgttgtgtgt gctccaaata tgtttccaga gatgacagat ataagacaat gtctaccatc 720 aacatctcaa ccaaggctcc tgccgacatc agtaaccgcc aacgttcccc tggcatccag 780 caccatcatt ccgaccacgg ttccgttagt tggacaaact tcgaatgagc tgctgaaagt 840 accgtcaaat aaacctggaa aaaacgcgaa gcaagttttc ggggagacat caagtcagca 900 ctcaggaacg tcgaagagat ccacgaaaca caaacaattg gagttggagt tgaaaatgct 960 ggacgaggag agaaaattgc aggaagaaga aaaccggagg aaacgtgaat accttcaaaa 1020 gcggtacgcc cttttattgg agattgccag tgaatcatcg tcggttgctg aaatcgaaga 1080 agataaaaac gaggatcgag tgaacgagtg gattgaaaat ggtttgacag ccggacaagt 1140 agccggggat caggatccga tcccggaaca actgcgtwkg aatgtgaacc cgcagccacc 1200 acctccmaac agcatgcctc caatttgtaa ccagtttggt aggcccagac ctgctcgatg 1260 tgaagctgta cccttgagcg ctcctagaat gtttccaaca ttagcgactg aacggtactc 1320 taatatccca gcccatcaac cacaaacgtt tcttccgcag catcaagtcg tacactctag 1380 gacgcaacgc ccgaatatga gccatcaact agatagagag ccgagaatcg caaattcatt 1440 aggactcggt cgttcttcgt taaggtacga tatgtttgaa catgacgagg gtaataattt 1500 gactcgtagc caagtagtcg ctcgacaagc cgtctcacgt gagctgccaa ctttcagtgg 1560 cacccctgaa gaatggccac tattcttttc ctcgttcaca actacgactg ccatgtgcgg 1620 atatacagcg gaggaaaatc ttgtccgttt gcagaaatgc ctcaccggaa aagccttcga 1680 agcggtaaaa tgtatgttaa tgcacccgat gaacgttact caaattatct caactcttaa 1740 aatgcgcttc ggaaatcccg agatcatcgt gcacaattta atggccaaaa ttagctccac 1800 gccggcgcct aaagccgaga aattagatac tataatcgaa ttcgctctct ctgttcaaaa 1860 tttatgcgcc acaatcgaag cgtgtcaatt agaagagtat tcctataacg tggctcttct 1920 ccgtgagctg gttgataaat tgccaaccac tatcaagctt gattgggcca gatatcgccg 1980 taattttgcc gctgtgaatt tgtcagtttt ctcaagttgg ttgtacgagc ttgcagaaga 2040 catttgtccc cttgcagggg cgtctagtga ttcgaaactt cagcgaagcg gaaagaacaa 2100 tccggcgttt ctaaatgctc atgccgagga cgttgaggac gaaccaaaga agccgcctca 2160 tcaatctccg tcaggccagt ttaagcatgc agaagtgacc tgtgtagtat gtaaatgcga 2220 ctgtcctacc atagacaaat gtcaaaggtt ttacgagctg gggtataatt cacgatgggc 2280 tgttgttaaa gaatttggac tttgccgaaa gtgccttaag aatcacaagg gctcgtgcaa 2340 ggtacagaag ctctgcggta aaaacggttg cgaatttaaa caccatcaac ttctacataa 2400 taatcaacga gatttagtag ctgctccaaa aactgaggtc aaatctgcag catcacccga 2460 acaccgcgaa cctaatgcaa ctcgtaagtt cgagagtgag tgcaatatcc atcatcagat 2520 cactaacaaa accctgtttc gagttgtgcc tgtaactctt tatggtccag ataaacacat 2580 aaaaactttc gcgtttctcg atgatggatc ttcctatact ttgatggatg caactctggc 2640 tactgaattg gaggtagaag gaaaacgttc accactctgt cttaagtgga caggaaacat 2700 gggtcgttca gaaaatgact caattaagtt gaacattcag atttctggta ccggaaataa 2760 cgttaaaagg tactggcttc atggagtaca tactgtctct tctctagatc tattccgcca 2820 gtctgttgat gcacaagaaa tggcgaacca ttacaaacat ttgcgaggaa ttcctgtaga 2880 atcatatcag aatgcgcagc ctcgcatact gattggtatg aagaatgcaa gtctgagcta 2940 ccctttgaaa gcacgagaag ggaaattaaa cgaaccgata gcaacaaaaa ccagactagg 3000 gtgggttctt cacggtggat ctgatgaaga tgactttttg ttagcctatc atggtgtaca 3060 agtatgcatt tgtactgaaa ggtctgatga atttctggag aaagcggtac gagaatactt 3120 ttcactagaa gggcttggtg ttacaaaacc cgataaacca ctattgtgta aggaggatca 3180 gagagcgcta aacatcttgg atggagttgt ccagacagac gacggacact atgaaacacg 3240 tctactctgg aaatatgatg aatttcgttt gccgaatagt aaaccgactg ctttacgacg 3300 attccattgt ttggattcca ggatgaaaaa ggaacctgaa ctagctgctg ctcttcgtga 3360 aaaaatgaaa gattatgagg aaaagggcta cattaggaag atttcggaag cagaacttca 3420 agcgtggatg caacgtatct ggtatattcc gatatttccc gtattcaacg ttaacaaacc 3480 gggaaaatta cgcctcgttt gggattgtgc agcgaaaacc ggaaatattt ctctgaattc 3540 tatgttgata aagggtcctg atcaattaac tccgttgaat agtgttctgt accggtttcg 3600 agaaaataag gttggaatca caggcgatat tagagaaatg tttctgcaag tgagaatcgc 3660 aaaagaagat caaggttgcc agctgttttt atggaaggat aatcccaatg atgaaacacc 3720 aagcacatac gtgtcacaag tcatgacctt tggggctagc tgctcaccaa cctgtgctca 3780 atttgtcaaa aacttgaatg cggagaaata tgccggccaa tttcccaggg cagttgaagc 3840 gatcactaag cagcattatg tggatgacat gttggtcagt gtcgaaagcg aagaggaggc 3900 tgttcagtta gcgaaagatg ttagatccat tcacgctgaa gctgggtttg aaatacggaa 3960 ttggatttcg aactcgccag ctgtacttca agctctgcat caagaaaaaa cagcggacaa 4020 aagcctttat gttggatcag aattgggaat ggaaaaggtc cttgggatgt ggtggtgtac 4080 gaattcggat tgcttcacat tcaaactctc gccgaaacac gacgcaatgc ttctgacggg 4140 agaacgaaag ccaactaaaa gggaggtact gcgtacgctc atgtccattt ttgacccgct 4200 gggcctgcta tcccatgtac tcgtcggtct caaaattttg ttccaggaga tttggcgatc 4260 tgccataggt tgggacgatg agataccgga caatctatat gagaaatggg agaactggtt 4320 gcgtattctt cctaaattgg aagacgtctc catccctcgt tgctaccgtt tgctgacgtc 4380 gatgagtccg aatacgttgg tacagttaca tacctttgtc gacgctagtg agttgggtta 4440 tgcggctgta gtatatttgc gttttcaaca aggcagtacg atagagtgtg cgatcgtcgg 4500 ggctaaatca cgtgtagcgc cactcaaatt tgtctccatt cccagactgg agttacagag 4560 tgctgtcctc ggctcaaggc tcgccaaaac aatttctgaa gccttgtcct ttaagatcaa 4620 cgaacgattc ttttggactg acgctcgcga cgttctttgc tggttgcgct cagaccatcg 4680 tcaatattcc ccgtacgtgg cttggcgaat cagcgaaata ctggatgtga caaatgttga 4740 taactggagg tggatcagtt cgcaggacaa catagcagat gatgcaacta aatggaagcg 4800 gtcgccagat cttggctgcc agagcaggtg gttcaaagga ccagaattct tgtggcgcgc 4860 agcagacact tggccatgtg aacctttcag cacgaataca accagagaag aacttcgagc 4920 aaatttgctc catcacaagg atgaattcca gccgatattc cagacggata aattttcgga 4980 atggaatcgt ttgctacatg ttgcagcaac attgatcaga ttcatcgcca atatccgaca 5040 aaaaatagca ggtatcgaac gtaacactgg gcctttctcc aaggacgaac ttgtatgcgc 5100 atcaaatgtg ctttaccgag aggctcaaag ttcgtcattc acagaagaga tggacattct 5160 gatgaatagg acgtccagta gggcaattcc gaaatcaagt tccttattca cgttaaaccc 5220 attcttagat ggagagcgag tgatgagaat gcatggacga atcagcgcat gtcagtatgc 5280 cagtactgat gcacaaaatc ctataatact tcctaaggac caccacatca caaagcttat 5340 cgtaaaaagt taccatcagc ggtatcacca ccgcaaccac gagactgtgc tgaacgagtt 5400 gcggcaagtt tttcgaattc cgaaattacg aggagtgttc cgaaaggtca aggctgattg 5460 tcaaacatgt aaaattcaac gtgccaatcc catgcctcca cctatggcag accttccgaa 5520 tggaagattg gccgcgtaca ctaggccttt ctcctacgtc ggcgtagatt attttgggcc 5580 catgactgtg gcagtaggac ggagaaacga aaagcgatgg ggagtcctta taacatgctt 5640 aaccatcaga gcggtacatt tagaagtcgc acattccctt agtgcagact cttgcattat 5700 ggcattacgg acgtttatgg caaggcgggg ggttcctatt caaatttata gcgatagggg 5760 aacgaatttt gtcgccgcta acaaggagtt gagtgaagca ttgcgagaaa tggatcagca 5820 gcaagtcatc caagagataa caagtcaaca cactgaatgg accttcattc caccagcatc 5880 tccacacatg ggtggtgcat gggagaggct catccagaca gtgaaagcaa accttcagaa 5940 aatgctgcca atgcgacgtc cgagtgatga agttctgcgc aatactttag ccgaaattga 6000 gaacctcatc aattctcgac cattgactta tgttcctgtg gacgaccctg acgcaccggt 6060 gttaacaccg aatcatttca tcctaggatc ttccagcggt ctgaagcccg cttctmgaat 6120 ggatgacaga gcactaattt tgcgaagatc ttggcgagag tcacaaaggg aagcagacct 6180 attttggcaa cgttggcttc gtgattacct gcctgagctt accaaacgga cgaagtggtt 6240 ttctgatgtc aaagcaattg aagtggacga tattgtggta atagttgatc cagatcttcc 6300 aaggaattgt tggctaaaag gcaggataat tgcggtaaag caggcgaagg acggacatgt 6360 gagatcagct acagtacaaa ctaaaatggg gatctatgag cgacccgcaa ccaagattgc 6420 tgttctggac gtacgccgcg aagaattagt agaccagaag actggcgtac ctggggagga 6480 g 6481 // ID Gypsy-7_AA-I repbase; DNA; INV; 4375 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-7_AA_; KW Gypsy-7_AA-LTR; Gypsy-7_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4375 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 983-983 (2011). XX DR [2] (Consensus) XX CC Positions [3339-3800] - Integrase core CC 'CATA' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 90..4169 FT /product="Gypsy-7_AA-I_1p" FT /translation="MANVNAELLGQLSAMIAEALQASIGNAIQQVNAGAQN FT QANDAAAPTPKIPTFSMSEYRPSDGSSVADYFSRFKWALELSQIPQVQYAN FT YARVHMGVELNNALKFLVSPQDPATIDFEVLQTTLVNHFDRKKNKYVESIK FT FRQIVQQTGESVAQFVLRLRQGAADCEYEDFLDRMLIEQLLHGLEAREMCD FT EIIAKKPATFKEAYEIAYTLEATRNTAREVNTGATSSAPEETNRLGYESLR FT TRRKSTFNHPKNVHGEQRTNIQPDNEVGPCKGCGGQHLRSQCRFRDVRCHN FT CNIKGHIAKVCRSRKLSARDSSADQVESQESPSPEIDVVQSLSQVHDITST FT GKKMIDVKIDGRSLRMELDTGAPCGIIAESKLRLIKPNFALQKTNRRFSSY FT TGHPIECLGRIPVNVTIGATNRRLNLYVVAGESDSLFGREWISQFVEQINL FT NRLFATEVPVNTITSSEITPDQATRLSALLGGFEEMFSEVPGKLVGPPAKV FT HLKPDACPVFVKARDVPHALRERYAAEIDKKLASGFFEKVEYSEWSSPTHI FT VVKKNGDLRITGNYKPTVNPRMIVDEHPIPKIESIFNKMNGATMFCHLDVT FT DAYTHLPIDEEFSHVLTLNTATHGLIRPTRAVYGAANIPAIWQRRMESVLQ FT GMDDVVVSFYDDIIVFAKDFDSLLQALSVTLDRLRLNGLRLNRTKCVFATS FT SLDCLGHKIDRHGLHKSDKHVEAIRDAPRPTTPEELQLFLGKATYYSAFIP FT NLSERARSLRDMLLSDSFQWTPEADKAYRDLKAVLTSPQVLMQYDPALPLV FT LATDASKTGLGAVLSHRLSNGLERPIAFASTTMSSTEQRYPQIDKEALAIV FT WAVKKFFHYLYARKFTLVTDHKPLSQILHPDKSLPTLCISRMANYADYLAH FT FDFNIVYKPTKQHTNADYCSRIPSKTTSSGINMLGMQEGGSIEDEFDGFVL FT HQIQQLPVRAEHIARETRKDDHLGKIVQTLETGRDLSQCGYKSPENKYTLA FT ANCLHFEHRVVIPPSLRQSILTDLHVAHLGVVKMKSLARSFVYWPGIDSDI FT EVAVRSCCDCARESPAPPKFNRHHWEYPKGPWERIHVDYAGPVADMMLLII FT VDAYSKWVEVKVTSSSTTPATIAILDELFSTYGAPTTLVSDNGPQFTAAEF FT KHFLQMSGVKYHKLSAPYHPATNGQAERYVQTVKQALKSMGTTRSTLKTNI FT NEFLRQYRKATHSETAESPAKLFLGRNIRTRLDLVRPLDAGTRITEKQQSN FT FNPTFRSFIQGQKVYFLSGNARLSKWIPGTIAEQLGDLHYRIDYHGKLVRR FT HVDQIQRFHDSDQNNVPPTSPTAPTQQQESKSSEYKGVVQRRVRYYGE" XX SQ Sequence 4375 BP; 1236 A; 1130 C; 1032 G; 974 T; 3 other; catacacgac aatttggtgt cagaagtggg atcgtggatt cgcgtcgcgg tagttctcca 60 aaaccgcgaa cttcttcccg agatacaaaa tggcgaatgt aaacgctgaa ctgctaggcc 120 agctctccgc catgattgca gaggcgctcc aggcatcgat tggcaatgcc attcagcaag 180 ttaacgctgg agcccaaaat caagcgaatg acgctgccgc tcccacaccg aaaattccca 240 ctttctcaat gtcagagtac cggccatctg acggatcatc cgttgccgac tatttcagtc 300 ggttcaagtg ggccttggaa ctgagtcaaa ttcctcaggt tcagtacgcc aattatgcga 360 gagtccatat gggagtggag ctgaacaatg ctctgaaatt cttggtaagc ccacaggatc 420 cagctaccat tgacttcgaa gtgctgcaaa ctacgctggt caatcatttc gaccggaaga 480 agaacaaata cgtggaaagt attaagttcc ggcagatcgt ccagcaaacc ggtgaatcag 540 tggctcaatt tgttctccgg ctgaggcaag gagctgcgga ttgtgagtat gaggattttt 600 tggaccggat gttgatcgaa caactcctcc acggtctgga agctcgtgag atgtgcgacg 660 agatcatagc aaagaagcca gctactttca aggaagctta cgagatagca tacacgcttg 720 aagcaacccg caacacggck cgggaagtga atactggtgc gacgtcttcg gcgccagaag 780 aaaccaacag acttgggtac gaaagcttga ggacaagaag gaagtcgacg ttcaaccatc 840 cgaagaatgt ccatggagaa cagcgaacca acatccagcc cgacaacgaa gtagggccct 900 gtaaaggctg cggtggacaa cacttacgca gtcagtgtcg ttttcgtgac gtsagatgtc 960 ataactgcaa catcaagggg cacattgcga aagtgtgccg ttcaagaaag ctatccgctc 1020 gggactcatc agctgatcag gtggagtcac aagagtcgcc ctcaccggaa atcgacgtcg 1080 tgcagtcact gagtcaagtc cacgacataa catccaccgg aaagaaaatg attgacgtca 1140 aaatcgacgg tcgttccctg cgcatggaac tcgataccgg tgcaccctgc ggcataatcg 1200 ccgaatccaa attaaggttg atcaagccga acttcgcact acagaaaacc aacaggcggt 1260 tttcgagcta caccgggcac cccatcgaat gcttagggcg tatcccggtt aatgttacca 1320 tcggagccac aaatcgtcgg ttgaacttgt atgtagtagc cggcgagtca gattcactct 1380 tcggacgcga gtggatttcc caattcgttg agcaaatcaa tctcaatcga ttgtttgcaa 1440 cagaggtgcc ggtcaataca attaccagtt ctgaaattac accggatcaa gctactcgtc 1500 tttcggcgct gctaggtggc tttgaagaaa tgttcagcga agtcccaggg aagttggtag 1560 gtcctccagc taaggtgcac ctaaaacccg atgcatgccc ggtatttgtc aaggctcgtg 1620 atgtaccaca tgcactacgc gagcgatacg cagctgaaat agacaagaaa ctggcgtctg 1680 gttttttcga gaaggtggag tattctgagt ggtcatcccc cacccacata gtagttaaga 1740 aaaacggaga cttgcggatc accggcaact acaaaccaac cgtcaaccca agaatgatag 1800 tagatgaaca ccccatacca aaaattgagt ccatattcaa caagatgaac ggcgcaacga 1860 tgttttgtca tctggacgtg acagacgcct atacccatct tccaatcgat gaagaattca 1920 gccatgtatt gaccctcaac acagctacac atgggctcat ccgtcccact cgagccgtat 1980 acggagcagc aaacatcccc gcaatttggc aacgccgtat ggaatcagtt ctccaaggaa 2040 tggatgacgt cgtcgttagt ttctatgacg acataattgt attcgccaag gacttcgaca 2100 gccttctaca agcgctatcg gttacactag ataggttgag gctgaacgga cttcgtctga 2160 accgaactaa atgcgttttt gcaacgtcat cgctcgattg tttgggccac aagatcgatc 2220 gccacggatt acacaaatcg gacaagcacg ttgaagccat ccgggatgcg ccgcgaccta 2280 caacaccaga agaattgcaa ctattcctag gtaaggcaac ttattacagc gcattcattc 2340 cgaatctatc tgaaagagca aggagcctac gcgacatgct tctttccgac tcgtttcaat 2400 ggacccccga ggccgacaag gcctaccggg acttgaaagc agttttaaca tcaccccaag 2460 tgttaatgca gtatgaccct gcactaccat tggtgctcgc cactgatgca agcaagacgg 2520 gtttgggtgc cgtgctctca caccgcttga gcaacgggtt agaacgacca attgcttttg 2580 ctagcaccac catgtctagt acggaacaac gctatcccca aatagacaag gaagctctag 2640 ctatagtgtg ggcggttaag aaatttttcc actacctgta cgccagaaag ttcactctgg 2700 taacggatca caagccgctc tcacagattc tgcatcccga taaatcgctt ccaacgttgt 2760 gcatcagtcg catggcgaat tacgcggatt acctagcaca ttttgacttc aacatcgtgt 2820 ataagccaac aaagcagcac acgaatgctg attattgctc tcgtatacca agcaaaacta 2880 catcatcggg gataaacatg ctgggtatgc aagaaggagg aagcattgag gacgaattcg 2940 atggatttgt tctacaccaa attcagcaac tgcctgtgcg agcagaacat attgcacgag 3000 aaacgcgtaa ggatgaccat ctagggaaaa tcgtgcaaac gctggaaacg ggtcgtgacc 3060 tctcccagtg tggctacaag tctccggaaa acaagtacac cctagctgca aattgcctgc 3120 acttcgagca cagagtagtc atcccaccat cgctaagaca atcgatattg accgacctac 3180 atgttgctca tctgggagtc gtgaaaatga aaagccttgc caggtccttc gtctactggc 3240 caggaatcga ttctgacatc gaagtagctg ttcgttcttg ttgtgactgt gctcgagaat 3300 cccctgcacc acctaaattc aatcgacatc attgggagta tccaaaaggc ccgtgggagc 3360 gcattcatgt cgattatgcg ggcccggtag cggatatgat gcttcttatt atcgtggatg 3420 cctacagtaa gtgggtcgaa gttaaagtca cttcttcgtc aactacccct gcaacgatcg 3480 ccatcctcga tgaacttttc tcgacttacg gagcacccac aacactagtg tcggacaacg 3540 gtccgcagtt taccgctgcg gagttcaagc acttccttca gatgagtgga gttaaatatc 3600 acaagctgtc tgcaccctac catccggcca ctaacggtca ggctgagcga tacgtccaaa 3660 ccgtgaaaca ggccctgaaa tccatgggaa caaccagaag cacactaaaa acgaacatca 3720 acgagtttct tcgccagtac cgaaaagcga cccattcgga aacagctgag tctcctgcaa 3780 aactattcct ggggcgcaac atccgaacac gcttggattt ggtccgacca ctagatgcag 3840 gcacgagaat cacggagaag caacaatcca actttaaccc gacgttccga agcttcattc 3900 aaggacagaa ggtatatttt ctatctggaa atgcacggtt gagcaagtgg attcctggaa 3960 ctattgctga acaactggga gatttgcact acagaatcga ttatcatgga aaactcgtaa 4020 gacgacacgt agaccaaatt cagcgtttcc atgacagcga ccaaaacaac gtgccaccca 4080 cttcacccac tgcgcctact caacaacagg aatcaaagtc atccgaatat aaaggtgtgg 4140 tgcagcgacg agtgcgttac tatggtgaak ttatgacgtc atcaaaccgg tcaccctcaa 4200 cagcgagtga tggttctgca gttatgttag gatctccgga tgcacgtttt tcaactccgt 4260 gtggcactcc ggaactacag acaagacaca tcgatcctcc tatgctgcgc cgttcggaaa 4320 gacagcgccg cccgcctttg aagttctctc catagggagg aagaaatgtt gcata 4375 // ID PST1 repbase; DNA; INV; 168 BP. XX AC X56472; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 28-SEP-1995 (Rel. 1.08, Last updated, Version 1) XX DE S.scalaris Pst-1 major satellite family unit. XX KW SAT; Satellite; Simple Repeat; PST1; Pst-1 repeat; KW Pst-1 satellite family; Repetitive sequence; satellite DNA. XX OS Stauroderus scalaris OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Orthopteroidea; Orthoptera; Caelifera; Acridomorpha; OC Acridoidea; Acrididae; Gomphocerinae; Stauroderus. XX RN [1] RP 1-168 RA Rafferty A.J.; RT "PST1."; RL Direct Submission to Genbank (05-NOV-1990)J.A. Rafferty, PATERSON RL INST. FOR CANCER RESEARCH, CHRISTIE HOSPITAL, WILMSLOW RD, RL MANCHESTER M20 9BX, UK. XX RN [2] RP 1-168 RA Rafferty A.J. and Fletcher L.H.; RT "Sequence analysis of a family of highly repeated DNA units in RT Stauroderus Scalaris (Orthoptera)."; RL Unpublished. XX DR GenBank; X56472; Positions 1 168. XX SQ Sequence 168 BP; 51 A; 27 C; 37 G; 53 T; 0 other; ctgcaggtgc agtgtgcagc ctggtaaacg taaaataata aattacattg tctaagagga 60 ctagagccgt ttaattgtgt atatatcgtg ttttgtgcat ttccatggaa acgacttatt 120 gtcatttcta acttctaaaa gcatggatac tagtgccata caggagag 168 // ID Gypsy13-NVi_I repbase; DNA; INV; 4260 BP. XX AC NW_001814827; XX DT 20-DEC-2007 (Rel. 12.12, Created) DT 20-DEC-2007 (Rel. 12.12, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy13-NV; KW Gypsy13-NVi_LTR; internal portion; Gypsy13-NVi_I. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-4260 RA Kohany O. and Jurka J.; RT "LTR retrotransposon from Nasonia parasitic wasp."; RL Repbase Reports 7(12), 1209-1209 (2007). XX DR Genome; NW_001814827; Positions 22706 18447. XX CC Positions [3340-3624] - Integrase core CC 'TTAAG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 243..1880 FT /product="Gypsy13-NVi_I_2p" FT /translation="MANQIINVGASLGSLREFKSGEDWNVYEERLEQYFIA FT NFVQEDRKVSVLLTSIGEEVYKTLRDLCDPVLPKGKTYTELCAILKRQFSP FT RISVFKERKNFYDLKQSANETVSLWYARVKKRAVNCEFGANLDERLKDKFV FT TGLRPGKILDRVCEESHSVSLKDLLEVALKKEATIDTSSIAAAAEDAASVH FT AVSAKSGGGHGRKEGRGSNSSAKPKQTSKQEATCWHCGGTNHDFSRCKYRL FT YDCSKCGQRGHLQKICKKKDASGKQNASVSTSKGSGRKGTTNFVEKEAGEA FT SSDESCEDILNVYAIAKVCKTRIPPETIDVKIAGRKVTMEVDSGAVLSVIP FT EALYRNKLAGCKLESSSKLLRMYDGTKVTPLGEIKVEVVYKNKSVKCTMVV FT VKRNDETALMGRELMQAFGLRICEVNALSVEDVREQIFKEFKEVFDDQLGR FT FKGEPVDLKLKADARPIFMKPRPVPFAFKKDMDKELDRLEKANIITKIDNC FT EWGSPLVPVLKPDGTLRACADYSVTVNSMLEDVIHPGSANRGIVYRVARR" FT CDS 1987..3624 FT /product="Gypsy13-NVi_I_1p" FT /translation="MNRLSYGTKPAVAIFQRLMEKVLLGCKNTINFLDDIC FT VTGETLEEHVNNLKEVLGRLQKFGLKLNFKKCKFFQSKVEYLGHVIDKNGL FT HKKTDKIEAITKAPRPENATQVKAFVGMVNYYGKFIPNLAKKLSPMYKLLK FT KNVVFKWNEECEKSFKEIKKEIVSENCLVHFNPDLKIKLACDACRDGIGAV FT LLHIFPDGTERPICFASRTLSKAEKNYATIHLEALAIYWGVKKFYQYLLGN FT HFVLESDHKPLLALFGEKKGIPLMAAGRLQRWALFLSGYNYVFKHVKGMSN FT GGADGLSRLPIKSTEKEGEERSEYFNFYVEERLPIDARLIKKATRTDQILS FT KVLLYTMTEWPNKVDEGLKAFFVRKNEISVENGMLFWGYRVIIPGKYTAEI FT LDEIHATHFGASKMKALARQYFWYPNMDSEIEKLAQSCDICRRNADNPNKA FT TLLKFEETKQPLDRIHIDFLGPFEGCTYLIITDAYSKWPEVYKMNKCNATI FT TIEKLRDYCARFGLPKKIVSDNGAQFTSEEFQKWIKMNGIKHVRTAPAQA" XX SQ Sequence 4260 BP; 1430 A; 663 C; 1056 G; 1111 T; 0 other; ttttgcgacg aggtaaatat ggatctagtg agagtgcaaa atgtcgaata atagccttat 60 ggacgctcca agaaggtgcc ggtaaccaac agtattcgtg gccggctaac acaacaacga 120 gtgcaacgcc gtctaatcca gcagtgacaa gttcgacaat agtaaacgcg tcttctccgg 180 ggggtggagc cagacctagg aattacttgc agcaaggaat tttcaacaca cagcgaaacg 240 ggatggctaa tcagattata aacgtgggag ccagtttagg ctcgctacga gagtttaagt 300 cgggagaaga ttggaacgtg tatgaagagc gtttagagca gtattttatc gcaaattttg 360 tgcaagaaga taggaaagtt tcagtgctcc taactagcat aggtgaggaa gtgtataaga 420 cgctaagaga cctgtgtgac ccagtcttac ccaagggtaa gacgtacacg gagttatgcg 480 cgattttgaa gaggcagttt tcgcctcgca tctccgtttt caaagaaaga aaaaactttt 540 atgatttaaa acaaagtgca aatgaaactg ttagtttgtg gtatgctaga gttaagaaac 600 gggccgtaaa ttgtgaattc ggtgctaatt tagatgaaag acttaaggat aaatttgtca 660 cgggactaag acctggaaaa atactagatc gagtttgcga agaaagccat agtgtttccc 720 ttaaggacct tttagaggtc gcactgaaga aagaagcgac gatagacacg tcatcaatag 780 ccgccgcagc cgaggacgca gctagtgtac acgcggtttc cgcgaaatcg ggcggcgggc 840 atggaagaaa ggagggaaga ggctccaaca gttcggcgaa acccaaacaa acatcgaagc 900 aggaagcgac atgttggcat tgcggtggca cgaaccacga cttttcgcgt tgtaagtatc 960 gcttgtatga ttgttccaag tgcggtcagc gaggtcattt acaaaaaatt tgtaaaaaaa 1020 aagatgcgag cggtaagcaa aacgcttcgg ttagtactag taagggtagt gggcgaaaag 1080 gcaccacgaa ttttgttgaa aaagaggccg gtgaggctag ttcggacgaa agttgcgagg 1140 acattttaaa tgtatatgct atcgcgaaag tttgtaaaac cagaattccg ccagaaacga 1200 ttgatgtaaa aatcgcggga aggaaagtta caatggaagt tgacagcggt gcggtgttgt 1260 cagttattcc cgaagccttg tatcgcaaca agctcgctgg ctgtaaatta gaatctagta 1320 gtaaattgtt gagaatgtac gacggaacta aagtaacgcc cttaggtgaa attaaggttg 1380 aagtagttta taaaaataaa tctgttaagt gcactatggt agtagttaaa aggaacgatg 1440 aaacagcgtt aatgggtcga gaattaatgc aggctttcgg tctccgtatt tgtgaggtta 1500 atgccctttc agtagaagac gttagagaac aaatatttaa agaatttaaa gaagtttttg 1560 atgatcagtt aggtagattc aagggggagc ccgtagattt aaagctaaaa gcggatgcta 1620 ggccgatttt tatgaagcca cgcccggtac cgtttgcttt caaaaaagat atggataaag 1680 aattggatcg tttagaaaag gcaaatatta ttacaaaaat cgataactgt gagtggggat 1740 cgccccttgt acctgttttg aaaccagacg ggactctgcg ggcttgcgcg gattactcag 1800 ttacagtaaa tagtatgctc gaagacgtga tacacccggg ttccgcgaat cgaggaattg 1860 tttaccgcgt tgcaaggagg taagttattt accgtgcttg atctggctca cgcttacaat 1920 cagctagaag tatcggatag tacaaagcga ttgttagcat ggagcacgca taaaggcatt 1980 tttgcaatga atagactttc ctatggaacc aaaccagcgg tagctatttt tcaaagactc 2040 atggagaagg tgcttttagg ttgtaaaaat acaattaatt tcttagatga catttgtgta 2100 acgggagaga cattagagga acacgttaat aatttgaaag aagtgttagg gagattgcaa 2160 aagtttggac ttaagttaaa tttcaagaag tgtaaatttt ttcagagcaa agtagagtac 2220 ttaggtcacg tgattgataa aaatgggtta cataaaaaaa ctgataaaat agaggctatt 2280 acaaaggcac ctaggccgga aaatgcaacg caagttaaag ctttcgtggg catggttaac 2340 tactatggga aatttattcc aaatttagca aaaaaattga gtcctatgta caaactttta 2400 aagaaaaatg tagtttttaa atggaatgag gaatgcgaaa agagttttaa agaaattaag 2460 aaagaaatag tctcagaaaa ttgtctagtt cattttaatc cggacttaaa gattaagtta 2520 gcttgtgatg cttgtagaga tggtatcgga gcagttttat tgcatatttt tccggatggt 2580 accgaaaggc cgatttgctt cgcatctaga acgctctcga aagcagaaaa gaattacgcg 2640 actattcatt tagaggcgct agctatctat tggggagtaa agaaatttta tcaatatttg 2700 ttaggaaatc atttcgtttt agagtccgat cacaagcctc ttttagctct tttcggtgaa 2760 aagaaaggta ttccgctgat ggcggcaggc cgtctgcaaa gatgggcttt atttttgagt 2820 ggttacaatt atgttttcaa gcacgttaaa ggtatgtcaa atgggggagc ggatgggtta 2880 tcaaggcttc caattaaaag cactgaaaaa gagggagagg aacgaagtga atattttaat 2940 ttttacgttg aagagcgatt accgatagat gcgaggctga taaagaaagc cacacgaact 3000 gaccagattt taagtaaagt actcctgtat actatgacgg aatggcctaa taaggtcgac 3060 gaaggcttga aagcattttt tgtgaggaaa aatgagatta gcgttgaaaa tgggatgctt 3120 ttttggggat atcgagttat catacctgga aaatatacag ccgaaatctt agatgagatt 3180 catgccacac attttggtgc tagtaaaatg aaggcgctgg ctagacagta tttttggtat 3240 ccaaacatgg acagtgaaat tgaaaaattg gcgcaaagtt gtgatatttg tcgtcgtaat 3300 gctgataacc ccaataaagc aaccttgctc aaattcgagg aaactaaaca gcctttagat 3360 agaatacata ttgacttttt aggaccattt gaagggtgta cctatctaat aattacggat 3420 gcgtattcta agtggccgga agtttacaaa atgaataaat gtaacgcgac cattactata 3480 gaaaaattga gagattattg tgcgaggttc gggttgccta agaaaattgt gtcggacaat 3540 ggtgcacaat ttacgtcgga ggaatttcaa aaatggataa agatgaatgg aattaagcat 3600 gtaagaacag cgccagcgca ggcgtaacga gagtgagtta aatagagaga ggcaaatcga 3660 aaattataaa gggaagcgag agttatactt tgaacctaat gattttgtgt acgtgcgaga 3720 ttatagaacg ccgaataaac caaaatgggc caaggctact gtttgtgagg ccctaggacc 3780 tagaaattat ctatgtaaaa tacacgagga tgaaaattta atttggaaaa gacatttgga 3840 tcaaataata ccaaaaggcg gattttttga aaatgtatgc gaagaagtac cggcggaggt 3900 gttagataaa gataagataa tgttagaaaa tgcacaagag ttggtacccg agattgaaag 3960 ttgtagagca gaaattcctg taccaatgga atccgctgta ttaatagaga gtagcgaacc 4020 ggaaagtaca agcgaaacga ctacagctgt aacaccaccg aatgaagggt ttaaaaatgt 4080 aaatgaaggg atgcgtgaaa gtgacaaaag aaatgactct ccgaaaacaa caaatatggc 4140 actggcgatg gttaataaac ctcttactct taatgttaat gaacgcccta agcgtactat 4200 taaaagaccg gacagattga atttataacc agtagtattt gttgtcttgg gagggaggag 4260 // ID R2_PS repbase; DNA; INV; 3358 BP. XX AC AF015818; XX DT 27-JUL-1999 (Rel. 4.06, Created) DT 27-JUL-1999 (Rel. 4.06, Last updated, Version 1) XX DE Porcellio scaber retrotransposon R2, complete sequence. XX KW R2; Non-LTR Retrotransposon; Transposable Element; R2_PS. XX OS Porcellio scaber OC Eukaryota; Metazoa; Arthropoda; Crustacea; Malacostraca; OC Eumalacostraca; Peracarida; Isopoda; Oniscidea; Crinocheta; OC Porcellionidae; Porcellio. XX RN [1] RP 1-3358 RA Burke D.W., Malik S.H., Lathe C.W. and Eickbush H.T.; RT "Are retrotransposons long-term hitchhikers?."; RL Nature 392(6672), 141-142 (1998). XX RN [2] RP 1-3358 RA Burke D.W., Malik S.H., Jones P.J. and Eickbush H.T.; RT "The domain structure and retrotransposition mechanism of R2 RT elements are conserved throughout arthropods."; RL Mol. Biol. Evol 16(4), 502-511 (1999). XX RN [3] RP 1-3358 RA Burke D.W. and Eickbush H.T.; RT "R2_PS."; RL Direct Submission to Genbank (24-JUL-1997)Biology, Univ. of RL Rochester, 135 Superior Road, Rochester, NY 14625, USA. XX RN [4] RP 1-3358 RA Burke D.W. and Eickbush H.T.; RT "R2_PS."; RL Direct Submission to Genbank (09-SEP-1998)Biology, Univ. of RL Rochester, 135 Superior Road, Rochester, NY 14625, USA. XX DR GenBank; AF015818; Positions 1 3358. XX SQ Sequence 3358 BP; 730 A; 790 C; 1065 G; 773 T; 0 other; ggcaggagta actatgactc tcttgggcta tctgtaagga cattgcgtac ttgcggagca 60 ctcaagtgat taatagatta ctgatagaag cgttcccaac agtcccgctg gatactgtct 120 atttttgtgg aactcaagta ggctagggtc agccctagtt gtacgggttc accaggggta 180 ggccgtgcta acttctatcc ccgttagtga ccgagagctt cgaaaaagca acgctagatc 240 accagcagcc cccgggggga attaaccgat cgggggtctc ttggtcacgt atcggaatgg 300 cggagcgaat ggttagttgt gtcgtttgcc gaagggaatt tcgaacggcc agggggcttg 360 gcgtccatat gcagagggcc caccgcgaag aatacgatca acgcgtgggc caggaggctg 420 tggacgtcaa ggtccgctgg tcgaatgagg aaaccatcct cctcgcttcg aaggagcttg 480 agctgcaggg ccagggaatt caaaatctga atcaggaatt ggctcgtctc tttcccagat 540 cgctctaagc gattaaaggt aaaagaaagc aggtggcata tcaacgaatc cgtgatgaag 600 tggctcgggg ggtcggggtc cctgctgacc ctgaacctcc tgagccggtg gaggttgaga 660 tcccatgggt ggtctcgccg cctagggggg gattgtaagg gcgccgggaa tttccggtga 720 ggatctcgcc cggctgctgc atcaatgcgg tttttcggcg ggcgagatcg ggccaaatct 780 cccccggggc cgcgctcgca acgctcctac tccggagtgg gagctgaaca acagagtcaa 840 gcgtaagcgt cagaagtttg ctcttacgca gaccctctac tacgaggata agtccaaagc 900 gcttgacttc ataatggatg ggagaaatcc attcgaggcg ggcgagccct ctcgtgcggt 960 gagggatgag tggagggagt attttgagac acgtgtggac ctgggcaggg tggcggatgg 1020 tgcttgcttg agggtcggtc agcccgtgga cagatttcct gccgtgggtg acgacataac 1080 actggcggag ctcgagaatg cgttgaggag caccccctct aagaccgcaa agggggtgga 1140 cggtgtgggg ttggaggagt taaagcgagt gcctagaaga acccttttta acatcctcaa 1200 cggactttgg acccacgtcc cggagtcgtt gtacaaaggg agaatcactc taatccctaa 1260 gaagtcgcta ccggagttgg ctggagattt tagacctatt tgcgtgctcc ctgtggtggt 1320 tagactgcta cataggatac tcgcaaaacg gcttgcgatt gtccaacata ccgaattcca 1380 agccggtttc cagtctggca ggtctacttc ggagaacatc ttccttctga ggaccatttt 1440 ggagtcgttg cctgcgggga aggagtctat gtatatagca ctccttgact tccgtaaggc 1500 ctttgactcg gtcaaccaca cggtgttgtg tgggctgttg agggatttgg gcttgcccga 1560 gcgcctgacc gggtacgtag agtccatcta ccggtcagtg catctgactc tcggtgacga 1620 ctggttcgtt caggggcgtg gagtgttaca gggtgatccc atatccccgt tcctgttcaa 1680 cctgatgatc gattacatcc tgagtggcac tcaggcggga gtaggtgtcg gggtcggaga 1740 gcgcttagtt tccagcctgg catacgctga tgaccttgcc ctgcttgcgt cctctaggag 1800 gggtttgaat gcgaatctgg agtcggtcct cgcccgggca cgctcggtca atctggcact 1860 tggaattaac aagtgcgcga cgatcggcaa gaggtggctg ggccgagaga aaaagatgat 1920 tcttgaccga gaacccttcc tgttggaagg ggttgcgatt ccggtatatc gttggaacaa 1980 catatacaag tacttgggga tagaaacctc tccgggggcc gcggcccgct ggagtgtcac 2040 tggattgaga aaccgtctaa gtaagctcga gtccgcgtgt ttaaagcctc agcagagaat 2100 gcacctgctc aggtgttacc ttcttcctgg gctgtattac ggtttgatac accagggaat 2160 ctcggttggc ctgcttcgaa gtgctgataa gcaggtcaga gccgcagccc ggaagatcct 2220 tcaccttccg ggggacgtcc ccgtcccgtt gttcaatgaa aagtcggcgc tcggcgggct 2280 ggggctcgtc gagctagaag tcaaggtgcc ggagctgtta gacaagatca gaactggact 2340 gttggggcgg ggcggcctct ggagggtggt ggctgatcgg attaagatcc cgatctcgaa 2400 tagggaaaga cggctccttg tcacggagtt ttgtgacggc cgaggctttt gccaagcaag 2460 gaaaagaccc ggatgctgcg cgtgggtgtc gaacggcaaa cttctcatga agggagccga 2520 ttacgtagag gcagtcaaga tcaggttggg cgttgtgtcg accaaggaga gggcagcaag 2580 gggaaggaac gtgccccttc accaagtaac atgtgaccta ggctgcccga gggtggagtc 2640 gctggggcat atcctgcagg agtgcccttc gatttccaac ctccgaatct cccggcacaa 2700 cgcattggcc tcaaggatac gccgggctct cgcgggtaag gggtggagca cggtcgaaga 2760 accgactatt ccaacgatac caggaaaccg acgccccgac ctggtggcct ggcgtggaga 2820 aagggctatg gtctttgata cggccattgc gcgagctcac gcggcggttt tgaacgaggt 2880 gttcgatagg aaggttgtta agtataatac acctgaaata catcagtggg ttattggacg 2940 ttcgcgggca agagtggttg aggtctcggc gctggtcctc aactggcgtg gggatatggt 3000 ggagaagtcc tatgttatgc tgagggctct cgggttaacc tcggctatta aattcatgga 3060 ggtggcttgt atggaggggt ctgttagatc cctaaggttt ttcaaacaag cctccggcag 3120 atggggttga ggcgtgaatg tggttcgtcg ttcagttgaa cgggggtgga agtctgtcca 3180 aagactctcg cccggcatga acccgcccat gtcgttatcg tgtacataac agattaacgt 3240 tccctccggt gggacggttt actgttccgc cgggggtgga gattgctttt gtacatctcc 3300 cctctattaa gtttctgcgg tggtggaggc ggggaattat ttaaaaaaaa aaaaaaaa 3358 // ID Gypsy-236_AA-LTR repbase; DNA; INV; 214 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-236_AA_; KW Gypsy-236_AA-I; Gypsy-236_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-214 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1074-1074 (2011). XX DR [1] (Consensus) XX SQ Sequence 214 BP; 69 A; 46 C; 33 G; 66 T; 0 other; tgtagtatcg gtcctatcta catatatata attaccttgc tttatataaa tcttgagata 60 atatcgacaa tcctaagaac attgatactt gtacacccct acctgagtga gcgatcagac 120 caagcttccg gaataaagaa cattgttatt ttgtctatca accatacaag acgtgcgtgt 180 ctttatacgg tatccgaaac aacgcgattc taca 214 // ID CR1-72_HM repbase; DNA; INV; 3978 BP. XX AC . XX DT 29-DEC-2008 (Rel. 13.12, Created) DT 29-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE CR1-type family - consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-72_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3978 RA Bao W. and Jurka J.; RT "CR1 families from Hydra magnipapillata."; RL Repbase Reports 8(12), 1899-1899 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(107..766,770..3874) FT /product="CR1-72_HM_1p" FT /translation="MVNLQLGDFNTYKCEQEAIIVDLKKLIETLRTDNANL FT TKRVLDLENSKSKSSSAVTGDWASKVKKTPDQMQIINTIANETKERDKREN FT NLVIFGLEESTLSDKVKAIEEDKQNIETIMKKLKVNVKIKNVIKLKSKNGY FT KAPYIVVLNDKLERNSVLKNAKELKKDKTYESVFINPDLTESERYKSKLLR FT EECKKLNNDNINKANFYFGIRNDKVVKIKKQLNAKKDLNHIRCFNYETNNK FT INLFSGGGQNLNKLLSNKRQLDIEIVNFHERKKCDLSINKEHDTQKCNELM FT IWYTNATSLNNKIDELRLQCEIYKPDVICVTETWFKAESDVNIISYNIFRS FT DRKTHGGGVCIYVKQCINQFETNNLQLNNNFIEQIWTCICFKNNKMLIGCI FT YRPPDISHEYNNYVLLSINAAKMSIYQKEYSDLIITGDFNYNTIDWSDNNC FT PFSATDNESQTNKFIECLEDCFLFQCVCEPTFQIANYKSTNILDLIITNDK FT NKIVSITHSAPLGNVKQGHHLLKCCYLMNDYKFSTEHKKSIRLYQKGNYEL FT FSCHFNNVDWENGFKSKSINEMYDLFLYHYNYATNLFIPRINYQNRRNRNK FT WITPELKHKIRTKNNIWLANKRNGWNEESTKIQYNKLKIRIKNDIKNNIKQ FT FESTLASKFKSNPKLLYKYINEKKSVKTHIRALEIDNGEIINDQFAIATEL FT NKYFHSVYVNDNCTNIIEFHNIPSNLLLNTVLITQNDVQTRLQALDKSKSV FT GFDFVHPYVLKECSSSISYPLYLIFKKSMETGTLPDMWKKANVMPLFKKGS FT KLKASNYRPVSLTSIPCKVLERIIADCIMQHLIKNNLLSKKQHGFMKSKSC FT TTNLLEYLDILTDAFHNRTPVDVLYTDFKKAFDSVSHKKLLSKLLTFGISG FT KLLTWIVCFLANRKQRVVLGKSVTDWVDVLSGVPQGSVLGPLLFLVFINDL FT PENFINECRLYADDNKIIAPISSQNDSQLFQNDINKLDEWSTKWKLGLNFE FT KCKIMHFGSNNNEYSYFMNNNNTLMEIEKSKLEKDIGVYISSDLKWAKQVK FT YAASKANSILSLLNNTFKYKDKDLIKTLYCTYVRPNLEFSIQAWSPYYTKD FT INELEKVQRRATKMIPELRHLEYKKRLEILGLTTLETRRLRCDLIQQFKIF FT KGIEIIDIKQNQTMNSICSSGPAANIRGAKHRMKPELVRNCLKRQYFFSNR FT VVDGWNSLPAEVVQSTTVKSFKLNLQSVDLEAITNKIKTKKAYY*" XX SQ Sequence 3978 BP; 1617 A; 521 C; 617 G; 1223 T; 0 other; agtccctgtg taataagcaa aactaataat aataacatta caatttcact gacaggaggt 60 aatccactgt tatttagggt ggtagccctc ctatccagta atccaaatgg ttaacttaca 120 actaggtgat ttcaatactt acaaatgtga gcaagaagca ataatcgtag atttaaaaaa 180 gttaatcgaa actttaagaa cagataatgc aaacttgact aagagagttc tggatttaga 240 aaattctaaa tcaaaatcta gttcagcagt gacaggtgat tgggctagta aagtaaaaaa 300 aactccagat caaatgcaaa ttattaatac tatcgcaaat gaaaccaaag aaagagataa 360 aagagaaaat aacttggtta ttttcggttt agaggaatca acattaagtg ataaagtcaa 420 agctattgaa gaagataaac aaaatattga aacaatcatg aaaaagttga aggtcaatgt 480 caaaataaaa aatgttatca aattaaagtc aaaaaatgga tataaagctc catatattgt 540 tgttttaaat gataaattgg aacgtaattc tgttttaaaa aatgccaagg aattgaagaa 600 agataaaaca tacgaaagtg tatttataaa tcctgatttg actgaatctg aacggtacaa 660 atcaaagcta ttaagagaag agtgtaaaaa actaaataat gacaatatta acaaggcaaa 720 cttttacttt ggtattagaa atgacaaagt tgtaaaaata aaaaaatagc aactaaatgc 780 caaaaaagac ttaaaccaca ttcgatgctt taactatgaa actaacaata aaataaatct 840 gtttagtggt ggaggtcaaa atcttaacaa gttgctgagc aataagagac agctagatat 900 tgaaatagta aactttcatg agcgtaagaa atgtgattta agtataaaca aggagcacga 960 tactcaaaaa tgcaatgaac tgatgatttg gtatacaaat gcaacgtctc tgaataataa 1020 gattgatgag ttaaggttac aatgtgaaat atataaaccg gatgttattt gtgttacaga 1080 aacttggttt aaagcagaat cagatgtaaa tataataagt tataatattt ttagaagtga 1140 tagaaagact catggaggag gtgtttgtat ttatgtaaaa caatgtatta atcagtttga 1200 aacaaataat ttacagttaa acaataattt tatagaacaa atatggactt gtatatgttt 1260 taaaaataac aaaatgctaa ttggatgcat ttatagacca ccagatatat ctcatgagta 1320 taataactat gttttgctat ctattaatgc tgctaaaatg agtatttatc aaaaagaata 1380 cagcgatctt attattacag gtgacttcaa ctataataca atagattggt ctgataataa 1440 ttgtcctttt tcagcaactg ataatgaatc tcaaactaat aaatttattg aatgtcttga 1500 agattgtttt ctgtttcaat gtgtatgtga gccaacattc caaatagcaa actacaaatc 1560 aactaatatt ctagacctta ttatcacaaa cgataaaaac aaaattgttt ctataactca 1620 ctcagctcct ttagggaatg tcaaacaggg tcatcatttg ttaaagtgtt gttatctaat 1680 gaatgattat aagttttcaa ccgaacataa gaaatccatt agattatatc aaaaaggaaa 1740 ttatgaacta ttttcttgcc atttcaataa tgtagattgg gaaaatggat ttaaaagtaa 1800 atctataaat gaaatgtatg atttattttt gtatcattac aattatgcaa ctaatctgtt 1860 tatcccaaga attaattatc aaaatagaag gaacagaaac aaatggatta ctcctgaatt 1920 gaaacataaa ataagaacta aaaacaatat atggcttgct aataaacgta atggctggaa 1980 tgaggaatca acaaaaattc agtataataa gctaaaaatt agaataaaga atgatattaa 2040 gaacaatata aaacaatttg aaagtacttt agctagcaag tttaaatcaa atcctaaatt 2100 gctttataaa tatatcaatg agaagaaatc agtgaaaact catattcgag ctctggaaat 2160 tgataatggg gaaatcataa atgatcaatt cgccatagct actgaactaa ataaatattt 2220 tcattctgtg tatgttaatg ataattgtac aaatattatt gagtttcata atataccttc 2280 aaatttatta ttaaacacag ttttaataac acaaaatgat gttcaaacta gacttcaagc 2340 acttgataaa tcaaaatcag tcggttttga ttttgttcat ccatatgttt taaaagaatg 2400 cagctcgtca atatcatacc cattatattt aattttcaaa aaatcgatgg aaacaggtac 2460 cctaccagat atgtggaaga aggcaaatgt aatgccactt tttaaaaaag gaagcaaatt 2520 aaaagcatct aactacagac cagtctcgtt aacttcaata ccatgtaaag tgttagaaag 2580 aataatagct gattgtatca tgcaacatct cataaaaaat aatttacttt caaaaaaaca 2640 acatggtttt atgaaaagca agagctgtac aacaaaccta ttagaatacc tagatatttt 2700 aacagatgcg ttccacaata gaacaccggt agatgtattg tacacagatt ttaaaaaagc 2760 ttttgatagt gtttcacata aaaagttatt atccaagtta ttgacttttg gtatttctgg 2820 taaacttctt acgtggatag tctgtttttt agctaacaga aaacaaagag tggttttagg 2880 taaaagtgtg acagattggg ttgatgtact tagtggtgta ccacaagggt ctgttctagg 2940 tcctctcctt tttttggtat ttataaatga tttaccagaa aattttatta atgaatgtcg 3000 tttatatgct gatgataata agataattgc acctatttct tctcaaaatg attcacaatt 3060 gtttcaaaat gatataaata aacttgacga atggtcaaca aaatggaaat taggattaaa 3120 ttttgaaaaa tgtaaaatta tgcactttgg ttcaaataat aacgaatatt catattttat 3180 gaacaataac aatacattaa tggaaattga aaagtcaaaa ttagaaaaag acattggtgt 3240 ttatatatca agtgatttaa aatgggccaa acaagtaaaa tatgcagcta gtaaagcaaa 3300 tagtatactc tcactactaa ataatacatt caagtacaag gataaagatc taataaaaac 3360 actgtattgt acatatgtta gaccaaattt agagttttca attcaagcat ggagtccata 3420 ttatacaaaa gatattaatg aactagaaaa agtccaacga agagctacca aaatgatacc 3480 ggagcttaga catcttgagt acaaaaaaag attggaaatt ttgggtttga ctactttgga 3540 gactagaagg ctaaggtgcg atttaatcca gcaattcaaa atatttaaag gtattgaaat 3600 cattgatatt aaacaaaatc aaacaatgaa ctctatttgc tcatcaggtc ctgcagctaa 3660 cattagagga gcaaaacatc gaatgaaacc tgaattggtc agaaattgtt tgaaaaggca 3720 atattttttc agcaatagag ttgtcgatgg atggaatagt ctaccagccg aagttgttca 3780 atctactaca gttaaaagct tcaaattaaa cttgcaatcg gtcgacttag aagcaataac 3840 aaacaagata aaaaccaaga aagcttacta ttaacgtttt ttattattta gtgactgaac 3900 taacttgttc attaattaat ttataccggc tgtcatagat attgcctggc gctttatctc 3960 aacagcatta attaataa 3978 // ID Gypsy-2_SI-I repbase; DNA; INV; 4390 BP. XX AC AEAQ01005246; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-2_SI_; KW Gypsy-2_SI-LTR; Gypsy-2_SI-I. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-4390 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01005246; Positions 4623 234. XX CC Positions [3237-3722] - Integrase core CC 'GGTGT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 1046..2839 FT /product="Gypsy-2_SI-I_1p" FT /translation="MEVDSGASRSVIHVNDYKELFRDLELEPVSFKLKVVT FT GENVTIVGQITVSVSYHKKQFMLPLVILYSKSRFMPLLGRNWLNIFNPEWR FT NVIDTKLVIKTQDINSLTNSKSIKHNVSQVKENNKVKNMVTQIKKEYAEVF FT KDEPNAHIKKFKAEITLKEGVSPIFHRAYEMPYALKPKVEEEISKMVKSGI FT LTKVTHSDWASPVVIVPKKNSSDIRICVDFKKTLNRVIDRDHCVLPLPEDI FT FACLSGSVVLCVLDLKRAYQQLVIGENSKKLFTINTHLGLFKFNRLTYGVS FT AAPGIFQSCMETILAGISNAKCYLDDILIYGSSLTECYDYVRQVLERLREY FT NVKVNESKCKFFRESVEFLGHRIDAQRVHPLDDKVECIKKAPSPQNRTQLK FT SFLGMLNYYGKFIPMLSSILKPLYDLCGNSNGKFQWSEECERVFQESKKLL FT TESNVLIHFNPKLPIIVTYDASGYGVGAVLSHKIDGDLRPVLFASSTLSKA FT EQRYSNIERESLALIFALKKLHKYLYGRKFTLITDHQPLQFIFGKNKSIPV FT TAAARITRWALSLSAYDYELEYKPGKLIANADSLSRLPVEGRTDFRIFEFI FT " FT CDS 3351..4286 FT /product="Gypsy-2_SI-I_2p" FT /translation="MEEGTNAKETIMRLKEIFAVFGLPVELVSDNGPPFGS FT IEFVNFCQTNGITIIKAPPYHPQSNGIAERGVQTVKKSLSRALFHESGKSV FT SKNIIVSCLLNFLFTYRATPSSVTGISPAQDLFKVRPRTRFDLIKPSCCKS FT KLREGTLLSKKIHSFAVNQNVYVKNNQTKLWQIGKIVKIVSHSTYLVQVGN FT SIRFIHANDIRPNISNNIENQQGTNQKEGVPSSRPISSSKILNNNNDIIAL FT DTPLFNAVSPQQQDREIERKEESAEVVPIKVEENKKTVNSPTVKDVTVNTP FT KVFKTTRSGRIIRSPVKLNL" XX SQ Sequence 4390 BP; 1630 A; 637 C; 882 G; 1241 T; 0 other; taatattggc gacgagtaag aaaagaatcc acataatagt taaccaaaac aatttaagca 60 agaaacacaa ctcaagttgt aagtagaggt tagaatgtcg ctaatcggga atatagattc 120 aatcctaaag aggcggaaat aacagcgtat ctagaacggc tggagatgtt atttcaatgc 180 aatgatgtgg aggatacaaa aaaagtatca ttgctattga ctctaatagg aggagaagtt 240 tatgggacgc tcaaacatct tttagcgccg gcacttccaa gcagtaaatc attcggtgaa 300 ttgaaaactg tgttaatgaa tcattatagt cccaagaggc tagtgatagc agaaagattc 360 aaattctatt cggcaagcca agaagagaac gaagatataa aatcgttctt agccagactg 420 aagagtttga cacaatactg tgagttcgga ggattcttag aagaaacgct acgagatcgt 480 ctagtgtgtg gagttcgttc agaagggatt aaacgcaagt tattaacgga agaaaatctg 540 tcattcgaac gagcaatcca aattgctgta ggtttggaaa tagccgaggg tcaaatcaaa 600 ataatgggat cggagtctgg aacaatacac aaggtcaatt ttcaaaaaga caacaaaaat 660 aaaaagtcaa gtagaggtac tctacagttt aaatcgtcac ctaaccaatt caatagcggc 720 tataaggcta atcgccctgc atttgtccga aacaaaagtt gtaaaagatg tactagatat 780 catccggaaa atgttagctg tccggctatt aattgacagt gctactcgtg tcatcagaaa 840 ggtcatacag cgaagtcagt tctatgcaag aatagggtcc atgaattatc tacaaaagaa 900 gatgaagatg agtcagaggt cgttgaagaa gatacgttgg aattagggtg gatacaggaa 960 gaggacagaa aatattatat aaaccaaatg tagggacact gaaagtttaa agattaagct 1020 tccgatagaa ggtaaatatt tgttaatgga agtagattca ggagctagta gatcggtaat 1080 acatgtcaac gattataaag aattatttag agatttagaa ttagagccag tcagttttaa 1140 attaaaagta gttacaggag aaaatgtgac gatcgtaggg caaattactg taagtgtatc 1200 atatcataag aaacaattta tgttgccttt ggtaatttta tatagcaaaa gtagatttat 1260 gcctttacta ggtagaaatt ggttgaatat ttttaatcct gagtggagaa atgtcataga 1320 tacaaagctt gttattaaga ctcaagatat taattcgtta accaatagta aatctataaa 1380 acataacgtg tcacaggtaa aagaaaataa taaagttaag aatatggtaa cacaaattaa 1440 aaaagagtat gcagaagtat ttaaggatga accaaacgca catataaaaa agtttaaagc 1500 agagattaca ttaaaagaag gtgtcagtcc gatatttcat agagcatatg aaatgcccta 1560 cgccttaaaa ccaaaagtcg aagaggaaat atcaaaaatg gtcaaatctg gtattttaac 1620 aaaagtaaca catagcgatt gggcaagtcc ggtagtaata gtaccgaaga aaaatagttc 1680 agatattcga atttgtgtag actttaaaaa aactctaaat agagtaatag atagagatca 1740 ctgtgtgttg cctttgcccg aggatatatt tgcatgtttg agtggtagtg tggttttatg 1800 tgtgctggat ctaaaaagag cgtatcaaca gttagtaata ggagaaaatt caaaaaaatt 1860 gtttacaata aatacgcact tagggttatt caagtttaat cgtctgacat atggagttag 1920 cgcagctcct ggaatttttc agtcgtgcat ggaaacgata ctagcgggta tttcaaatgc 1980 taaatgctac ttagacgata ttttaattta tggttcctct ttaaccgaat gttatgatta 2040 tgtacgacaa gtgctggaac gtttgagaga gtacaatgta aaggtgaacg aatcaaaatg 2100 taagttcttt agagaaagtg tagaattttt aggtcacaga attgatgcac agcgagttca 2160 ccctttggat gataaagtcg aatgcataaa aaaagcacct tcacctcaaa atcgtactca 2220 actaaagtca tttctgggta tgttaaatta ctatgggaag tttataccta tgctttcgtc 2280 aatacttaaa ccactctatg acttgtgtgg aaattcaaat gggaagtttc agtggtccga 2340 ggaatgcgag cgagtatttc aagaaagtaa gaaattactt acagagagta acgtactcat 2400 acactttaac ccaaaattac caattattgt aacttatgat gcgagtggat acggagtagg 2460 agccgtgtta agtcataaga tagatggaga tttgagacct gttttgtttg cgtccagtac 2520 attatcaaaa gccgagcaac gttattctaa cattgagaga gaaagtttag cacttatatt 2580 tgcgttaaaa aaattacata aatatttgta tggaaggaaa tttacgttaa taactgacca 2640 ccaaccgcta cagtttattt tcgggaaaaa taaaagcata ccagttactg ctgcagcacg 2700 aattacacgt tgggcattat cgttatctgc ttatgattac gaattagagt acaaaccagg 2760 aaagttaata gcaaatgcgg atagtttgtc tagattacca gtggaaggga ggacagattt 2820 cagaatcttt gaattcattt agtttagtta atgaactccc ttttgatgca aatgatattg 2880 ctgacatcac aaaaaaagac ataattttag ctaaagtatt agaactaaca ctttcgggat 2940 ggccaaacac tattaagaat aaaattgaac tgggaccata ttttaagagg agacatgaat 3000 tattggtaga gaacaattgt cttttagtag gtaataaagt cattattcca aagatattac 3060 aaaaggaaat attacaacta ttccatgagc aacatttagg tatagtccga acaaaaatgt 3120 taatgagagc ttactgttgg tggccaggta ttaacgaaga tatcgagtga tttataaata 3180 attgtaaggt ttgtcaacaa acacaaaact ttacgagtaa gggtgcactt ctaccttggc 3240 cgagcgcacc tcataacttt tatagagtgc acatagattt cttccataaa tatgggtata 3300 catttcttat attgatagat tgtaagtcaa aatggctaga agtaaaatta atggaagaag 3360 gcaccaatgc gaaagagact attatgcggc taaaagaaat atttgcagtt tttggtttgc 3420 ctgttgagtt ggtatcagat aatggaccac ctttcggttc tatagaattt gttaattttt 3480 gccaaaccaa tggaataaca ataataaaag caccgcctta tcacccacaa agtaatggaa 3540 tcgcggaacg gggagtacaa acggtaaaaa aaagtctgtc aagagctctt ttccatgagt 3600 caggaaagag cgtcagcaaa aacataatag tctcttgttt actaaatttt ttgtttactt 3660 atagagcgac tccgtcaagc gttacgggta tatccccggc acaggatctt tttaaagttc 3720 gtccgagaac tagatttgat cttattaaac cttcatgttg taaaagcaaa ttgagggaag 3780 ggactctttt atcaaagaaa atacattcat ttgctgtaaa tcaaaatgtt tacgttaaga 3840 ataatcaaac taaattgtgg cagataggaa aaatagtaaa aattgtaagt catagtacat 3900 atctagtgca agtaggaaat agtataagat ttatacatgc gaacgatata agaccgaaca 3960 taagtaataa catagagaac caacaaggaa caaatcaaaa agaaggtgta ccttctagtc 4020 gtccaatatc tagcagtaag atattgaata ataacaatga tattatagcg ttagacacgc 4080 ctttatttaa tgcggtatca cctcaacaac aagaccgaga aatcgaacga aaggaggaat 4140 cagcagaggt ggtcccaatt aaggtagagg aaaataagaa gactgtaaac tcacccacag 4200 ttaaggatgt aacagtaaac acccctaaag tttttaaaac tactcgttcg ggaagaataa 4260 taagatcacc agtaaaatta aatttataag cgtcatactt tgttaataat aatactatat 4320 tttatatgta agttgtgtgt atatgcattc tatataatta ttatagtttt tatttaatca 4380 agggaggaaa 4390 // ID TTAA8_AP repbase; DNA; INV; 438 BP. XX AC . XX DT 23-AUG-2009 (Rel. 14.08, Created) DT 23-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; TTAA8_AP. XX NM TTAA8_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-438 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(8), 1788-1788 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC TSD is TTAA tetranucleotide. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 438 BP; 154 A; 70 C; 66 G; 148 T; 0 other; gaggctgtca gcgcactatt tgttttctct ctctggctca cacgcaacat agacaaaatg 60 catttacgca aaatcatttt ttctatgtgt ttaagtaatc ttagagtaaa gtcacccatg 120 acaaaaaaga tagagaataa tacttttgag ggaatgacat atcgatttgt ctaaatattg 180 tctcaaaaca atttaaacat catttaaatt tacaacattt ttttatttat tttgaaagtc 240 gaatacaaag tcaaataata ataaaatata aaatcgatat gtcattccct caaaaatatt 300 attctctatc ttttttgtaa taggtgactt tactctaaga ttacttaaac gcatagaaaa 360 aatgattttg cgtaaatgcg ttttgtctat gttgcgcgtg ggccagagag agaaaacgaa 420 tagtgcgctg acatcctc 438 // ID EnSpm-2_HM repbase; DNA; INV; 5983 BP. XX AC . XX DT 12-AUG-2008 (Rel. 13.08, Created) DT 12-AUG-2008 (Rel. 13.08, Last updated, Version 1) XX DE EnSpm-type family - consensus. XX KW EnSpm; DNA transposon; Transposable Element; EnSpm-2_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-5983 RA Bao W. and Jurka J.; RT "EnSpm-type families from Hydra magnipapillata."; RL Repbase Reports 8(8), 787-787 (2008). XX DR [1] (Consensus) XX CC The consensus is construct from several copies which are ~94% CC identical to the consensus. The TIR is 12-bp in length. the TSD CC is 2-bp. The transposase is quite different to most Enspm CC transposase found in plants. CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 1065..2888 FT /product="EnSpm-2_HM_1p" FT /translation="MSDSDNMKWESESEIFTDTDESDNYGNEGLLVNDLAQ FT WAVEENISHKSLKSLLHILQPHHSFLPLDPRTLLSTPRYVSYEKLNSGGIY FT HHFGIASCLQKLFKNEQSFVNLSGCSQLSMQCNIDGLPIFKSIGLQLWPIL FT GLLKQPSYTSKPFLIGLYAGYNKPSNLIEFLNSFVVETTQLEKNGIILKDK FT LYQFRIHSFVCDAPARSFLKNTKLYSGYYSCDRCIQSGKYIGRMTFPEVNS FT PLRTNEAFNDMLYEDHHKNNIKSPLTQLSIGMVTDFVLDYMHLVCLGVMRK FT LLQFWVKGPLKTRLSTQQVMKISEHQKLLRSSIPNEFMRKPRSLTELDRWK FT ATEFRLFLLYTGPLVLKYNLHPILFKHFLLLHVAISCLVSSKLCQNMCNFA FT QKLLTIFVQHSQKLYGPEFITYNVHCLVHLPDDVSRFGALDEISAFPFENY FT LQSVKKLIRKPSLPLEQVIKRVSEKMLISVSDKDTKNDYPMCKYEHNNGPI FT CPELFNXIQYKSIKLQSYMFSLTKNNNCVFLNTSKVGILQNIIKKDLDVYI FT VVKLFCNTDSFYEYPLDSKLLNTFKVSQVEDKVSIYSINDVVSKCVCLPIN FT ESEHLIISFL" XX SQ Sequence 5983 BP; 2155 A; 825 C; 894 G; 2104 T; 5 other; cccagataac accaaatgtt ggcccgacgt cggaatatag tatatgggcc accgaactca 60 ataggacgtc ctgcctacgt ctatatcttt atgttgggcc aaccatttaa aaaaacgtct 120 tttttacttc ggtacgataa cgtcgggccg accgtattat ttttacctcg tgcctacatc 180 tataggatgc gtaatgcgga tgtcagtgca aagcattaat gcttttatat attaaaaagc 240 attaatgctt tgtacttact gtttgtactt acatccattt gtagctaaaa aagataatct 300 gaaagaaaaa gaacctgtaa acgcaaatat tcttcaaaaa aatgatgact agcaagtaag 360 ttacttaaac attacattaa aatatattac aaaatttaat tattgattaa atcaataaag 420 tttaaaagtt ttgttgctaa atatttataa agtgtttatt ataaaaatga atataattat 480 attggttaca atttgtaact agtataataa taatattaac atttgtaata caaatgtcat 540 ttgttgtgaa aaccgatgta attatattga tttttgtaac agatgacaga ttaatataga 600 ccacatagat tattaataat atttcaatat aattaaagtt ttttgacaat attattgaaa 660 taaacttatc agtactgtca tatttatcaa ataaaatagt tgcataaaaa attcactaat 720 ttattaaagt taaattgtta aattttatta gatccaagtc atattcttcc ataaggcgtc 780 gtattcgtgc cagagttcaa gcacacttgg aagacattca aagttcaagt gccaaaaata 840 aagataatgg cctcatacat aattatttat cagttgaaca atgtaacagt aaactcactt 900 ttacatttca tgcagatgta aataagaagg tctattttga aaaagaaagt gatcttgata 960 aacctagtaa aattgcagat gtgtttgatg ataatcaagt tttagataat aatgaaatat 1020 ataataatca cccatctgaa tcggatgaca atgtatatct ccaaatgagt gacagtgata 1080 acatgaaatg ggaatctgaa tctgagattt ttacagatac tgatgaaagt gacaattatg 1140 gtaatgaagg actgcttgtt aatgatcttg ctcaatgggc tgttgaagaa aatatttctc 1200 acaaatcatt aaaatctctc ttacatatac ttcaaccaca tcattctttt ttgcctcttg 1260 accccaggac tttattgtca acaccaaggt atgtgagtta tgaaaaatta aatagtggtg 1320 gtatatatca tcattttgga attgcttcat gcttgcaaaa attgtttaaa aatgaacaaa 1380 gttttgtaaa tctatcaggc tgttctcaat tgtcaatgca atgcaatatt gatggtttgc 1440 ctatatttaa aagcattggt ttacagctgt ggccaattct tggtttatta aaacaaccaa 1500 gttacacaag taaacctttt ctaattggat tgtatgctgg atataacaaa ccttctaatc 1560 ttattgaatt tttaaactca tttgtagttg aaacaactca attagaaaaa aatggaataa 1620 tactaaaaga caaactttat caatttcgca ttcatagttt tgtgtgcgat gctcctgcac 1680 ggtcattttt aaaaaataca aagctatata gtggatatta tagctgtgat cgttgcattc 1740 aatcgggaaa atatatagga agaatgacat ttccagaagt aaatagccct cttcgaacra 1800 atgaagcatt taatgatatg ttgtatgaag atcatcataa aaacaatatt aaatcacctc 1860 taacacagtt gtctattggc atggtaactg actttgtgtt agattacatg catcttgtct 1920 gtttaggtgt catgagaaag ttattacagt tttgggtcaa gggtccttta aaaactagat 1980 tatctacaca acaagtcatg aaaataagtg aacaccagaa gttgttacgt agttcaatac 2040 caaatgaatt tatgcgtaaa ccgcgttccc tcactgaatt agatcgttgg aaggcaacag 2100 aatttcgttt gtttttgttg tatactggtc cattagttct taaatacaat ttgcatccta 2160 ttttgttcaa acattttttg cttcttcatg tagccatttc atgtcttgtt agcagtaagc 2220 tttgtcaaaa tatgtgcaac tttgctcaaa aattgttgac aatttttgtc cagcattcac 2280 agaagttata tggaccagaa tttattactt acaatgttca ctgcctggtt catttaccag 2340 atgatgttag taggtttggt gcattagatg aaataagcgc ctttcctttt gagaattact 2400 tacaaagtgt taaaaaactt atacgaaaac catcattacc acttgaacaa gttataaaac 2460 gagtttcaga aaaaatgctt atctctgttt ctgataaaga tacaaaaaat gattatccta 2520 tgtgtaaata tgagcataac aatggtccaa tatgtcctga gttgtttaat rtcattcaat 2580 ataagagtat taaacttcaa tcatatatgt tttcattaac aaaaaataat aactgtgtat 2640 ttttgaatac ttcmaaagtt ggaattcttc aaaacatcat taagaaagat ttagatgtat 2700 atatagttgt aaaattattt tgcaacacag actcatttta tgaatatcct ttagactcaa 2760 aattgttaaa tacatttaaa gtgtctcaag ttgaagataa agtttctata tatagcataa 2820 atgatgtagt ttcaaaatgt gtgtgtttac ctataaatga aagtgaacac ctcattatat 2880 cttttttgtg aaaaaaatat ttaactgaca tgttgttttt aattatataa ctgaatcggt 2940 taaaaagttt ttttgtttat taaagccttt aatctataga gtcttgctaa tatatttaga 3000 ttatattgta ttgttcactt tctggacact gatgaagtgg aaattgctcc taaaacctgg 3060 attaaatgta ctgatgttga tgaacatctt gaatgttatt ggccactatt taaatctcag 3120 caaaaaattt taaaagctgt tcagacatgc gttgatgttg aaactgataa atggaagagt 3180 tacaaagcaa gaattttgaa aaattatggt attgttttgc atcttttaat gctgcaattg 3240 ttgaaaaaag tttataacar ttctaatatt tataaaaatg ttttacataa gtttgatagc 3300 ttatgagcaa ttatacaaat aaatcatgtg ctgtcatact tacgaaataa atacttattg 3360 tttgtagtat ctcataactg acatatcagt taaaatataa tatagaactt catttaaacg 3420 ggtttagatt catatgaaag tgcagtaaaa gatattacaa aagcaaaaga aaaatctgat 3480 ttggggactg atgtttcaga cgtagacgat ggaacaatgc aatgtaaaca aaaacgagca 3540 aggtatgtaa aactaaatta agtttaatga tttatcacaa ttatttgtgg ttcaacaaaa 3600 attgtttcag ttctgaaatt ttgctgaaat atcttttaaa ttttgtgtat tgataaaagc 3660 ttatttttag gaattcatac ttgcctttat cagacactga tatatgcagt agtccagaaa 3720 taaatttttg tttgcaaaaa gggaactcgc aactagctaa accgaaaaaa aatctgagcg 3780 aaagtcttgc acaaaaaagc cataaaaagt taagtacaga gtacgaagat cttcctgagc 3840 cacctgtttt ttcaaattct cacttaaatt gtaacactcc agaacatgtg acaagcagat 3900 caagtggcct gtctagcatt ctgtcatcat tttctaaacc tcatggaaag ccacttttaa 3960 aagaatttcc attatgctct tcagatggat ggctgtattc agttgcaaca cgtatatatt 4020 aaattattaa ctgctaattt ttattacatc agaataatag attacatata tatcacatta 4080 tagattatat atatagatta cagaatgtca atggttattt tgttttaatt tttaatttat 4140 gttttacatc attttaaaaa ttatattgaa tttttatttt aaataatatt tataattata 4200 gtatttcttt gccaatatta aattatttag ctcactagtt ctaatatttt ttagctgtag 4260 aacagaaact aattcttctt cttgaagaaa ttaaagcgca aggagacgag cttaaggaac 4320 aaggagaaaa aactataaac ctgttacagc accttgcaaa aacaaatgac ggagagagtt 4380 tagtaagagg attacctttc caattaccag caacaactgt tgaagagttt aaccttatta 4440 acgatcaagc taaagaaagc gacattaaag gaagactggt aaagctctat ttaaacttaa 4500 tagttaatac ataaattatt tatcaaataa ataaattata tattcatgaa attattaatt 4560 attttaataa tactaattaa gttaacaata aaacattaaa tcaaatcata aaagatatgt 4620 atatgcaaac ctaattcaat attttttaat ataaaattgt attttttgta taacataaat 4680 tacttttaaa gtaatattca atcattaatc aaaaatttta gcttagtgaa acttgttttt 4740 tcaagtatta tgataaactt taccctatat cttttacatc cttattacct tatactactt 4800 tatatagaca agatttcttg gattggttgg cgggctcaat gtaaagtcaa ctaccagagc 4860 agtgctaagt cgtttactga tcacagatgt agcaaagtya tgtaattgga agggaaaagg 4920 aaataagatt gcatttagca aaatgtacct tgttggaatt gtttgtggta aaattactta 4980 ttttatattt gtgatatata aaaatatatt tgctggtaaa agtcaccttc actaggttat 5040 tgatgaacct ggtgaaacag actatagatg aaatatatat atatatatat atatatatat 5100 atatatatat atatatatat atatatatat atatatatat atatatatat atatatatat 5160 atatatatac acacacacac acgtattcat atatactttg tattttaaga tgcagtaaga 5220 agaaatccag caacaagtca agcaacagat aacgagattg gagttgtttg tcgggactgg 5280 tttaaacatg caagtgatcg agatggagga agacaacgtc gtgccaatcg ttcatgtcta 5340 cattaactat tatataaagc ctttataata tatataaata aatctttttt attttttagt 5400 tgttgttttt ttagttatac ttttttgact aatagataaa atcattattt ataagtcatt 5460 taaacaattt atttattaat aaatttattg tttattaata aagtaatttt aaacagttgt 5520 caactgtaaa tgtcaactgt tgatatatca atcagttgat aagcaagcaa tcattcatga 5580 ctaattaaca tttaataatt aacaaaatgt tttcattagt catgttttta gtaaattttt 5640 gagtggtctc tttctgtgtc cgaaaaatct tgcttttcca tcagatattt gtctaattgg 5700 tacaacaata ttaatattgt gctaacgtag taccgataat gaaatcgttg tgccagcgac 5760 gttttcacac aaagttggat aaaactgctc ttttccaact tcacaatgat gcccaacatc 5820 aacgtcggca atcaagtaat ttctgtacgt tgggccggtt agatagggcc gacgtggaac 5880 cggtgatcaa atcgcaggca atccgacgtc ttttgccgac gttgggccaa ccataacaaa 5940 gacgtctaaa tgacgtcggc tggacgtcaa tgtgttatct ggg 5983 // ID Gypsy-184_AA-LTR repbase; DNA; INV; 234 BP. XX AC supercont1.139; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-184_AA_; KW Gypsy-184_AA-I; Gypsy-184_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-234 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.139; Positions 358850 358617. XX SQ Sequence 234 BP; 74 A; 33 C; 55 G; 72 T; 0 other; tgtaacgtca gggtttgttg aaggaagtga atacagccga cgattgcgcc ggtctatagt 60 ttatgcatca tttgtaatgc tggtacataa gtaaacaaag agggcattga tctatgtaac 120 ttgttattgt tttgatagct ataaataggc tcagcaataa agacagtgga agtcagttga 180 atcccagaag ctaataactt tagtgttgtt tagtggtgaa atccgaacat atca 234 // ID Sola2-4_HM repbase; DNA; INV; 4854 BP. XX AC . XX DT 11-FEB-2009 (Rel. 14.02, Created) DT 11-FEB-2009 (Rel. 14.02, Last updated, Version 1) XX DE Sola2 DNA transposons from Hydra magnipapillata, consensus. XX KW Sola; DNA transposon; Transposable Element; Sola2; Sola2-4_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4854 RA Bao W., Jurka M.G., Kapitonov V.V. and Jurka J.; RT "New superfamilies of eukaryotic DNA transposons and their RT internal divisions."; RL Molecular Biology and Evolution 26(5), 983-993 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 1587..3503 FT /product="Sola2-4_HM_1p" FT /translation="MHTVLEAETIGLNLIPGKKLCPTCRSKVMTCLSKSIS FT ENKNDEDFDIVNERSIQNKKESVDTHFASAGVSPIKVKGLHKSGKIREGKR FT KLTALTTSIKKKVAISLNISEESFEEQSSFNNEYMEKANLFDNMMVLLTNK FT IAESTSTSKKVQLATLAPTDWSIKKVSETLHVTNYVARTARKLTLEKGILT FT MPNPKLGKALPDVTVQLVKTFYEDDEHSRIMPGKKDYVSIKKNVHMQKRLL FT LCNLKELFVAFKTKNPEIKIGFSRFCTLRPKWCITVGASGTHSVCVCAIHQ FT NLKLLLHPLGVTYKELLPYIVCDITNKDCMMRKCEKCPATMNALVEKLYDI FT LGEFEDDDEIEFDQWTTTDRSNLTHNKEPVFEYIELVCRKLEKLAPHSYLA FT KSQALYLRSRKEDMNEVTALFLGDFSENYKFVVQDEVQSFHWTNLQCTLHP FT VIIYFKKEGILKHKSYCIISDDMIHDVNMVYKIIEVVCNDIKTNLPQIKNI FT EYFSDGCAGQYKNCKNMLNLCFHCEDFGFTAKWNFFATSHGKQPCDGIGGT FT VKRLATLASLQRDMGNHILSPQTMYKYCNENIEGINFIYISSEDLTLVRTA FT LKERLDTANTIPGTRSYHQFVPLGHQKVNYYFLIIPFTIF*" XX SQ Sequence 4854 BP; 1793 A; 629 C; 757 G; 1675 T; 0 other; gggtaaaaat ttctcaaaaa acaaggggtt gcaccctgca tcaccaaatt cgctgaaatt 60 tggcacaaat gtttaaagat ataagttagg aaagactgta aaattttttc aaaaaattct 120 caacccttta agatttacag caacttgaag ttagttattt tttgactgaa aaaagagggg 180 ggataaagcc ggaaatatgt caatttacca ttaccctttt aacaaacttg tagtaatgtt 240 gcgaagctaa aacttggtac caaaggagtt taaaacctgc tgaatctgaa aatcataaac 300 ttatatctct caaaacaata aaatatagtt tatttttaga tccacaaaac gcacttgtta 360 atgacaataa aaaaggccat tgacaactat agttctagaa aaaatctgac tgatgaaaaa 420 aagttttggt tttcgttatt ttaaatatta aacatatatc tactttgcga accaattttt 480 atttggcctt atgttttatc aaggagttat tgatgtttta gtttttcaat ttttctaaag 540 atattttgca aaaaaaaaaa aaattgcaaa tatattttga atacaaatac aattattttt 600 gttttgtaat agctttgtaa aaaatttaca aagttattac aaaacatagt tgaacatttt 660 ataaatttgt ttctttttac tttcattaca taataataat gtcattttca ttacataata 720 tcatgtcatt aaatttaaca atattatgta acttttgttg ttgattgttt tgttgtaata 780 aaacatttat tatcatcaat aaaaaaagta ataaaatgat ataataatgt ctaattaaca 840 ttaccatgtt aagttacata actaaataaa attatattta gttatctttt acttttacac 900 aactttttgt aagtagttaa tggagaattt acaataatta gtagatagtt aggcatttaa 960 aatatgtcta ttaaactaat agctacttgt tcaagcaata tgttctttat aaatacatat 1020 tgatatgata ttgtttgtgt tcaagtgtac ttttcttgtt agggacatta agattattta 1080 agatttcttg tattaagtgt ttagacagag attcctcatt atatttttat atggataaca 1140 aaattaattg ctgctttgta aataattatg gagcttgtta taaacaatca tatactcaag 1200 aaagaaatct gattgaagat attgacagca atgatcgtaa gctacttgaa aaacgaacaa 1260 atctgaaaca agaacaaatt actaatatat gcacacacca tcatgatatg ttagttgtaa 1320 gatatgaggg taatcagaaa agttgctgta atccatttct gagtcaccaa aaggtgtgta 1380 gaggtaaaaa tttatatatt tatatttcat taggtaatat cattattaat aagtcatata 1440 agtttttttg gtaataagat ttgttcaatg tttcatagtt attaatataa caaattgtat 1500 agtaatataa ttacataata tatataatta atgttattta agtaatgtaa ttactttctt 1560 tttaagcatc tctacgtgtg attacaatgc atactgtgtt agaagcagaa accataggtt 1620 taaatcttat acctggaaaa aaactgtgtc caacctgtcg aagtaaagtt atgacatgtt 1680 tatcaaaaag tataagtgaa aataaaaatg atgaagattt tgatattgtt aatgaaagat 1740 ccattcaaaa taaaaaagaa agtgtagata cacactttgc atctgccgga gtatctccaa 1800 ttaaggtcaa aggattgcac aagtcaggta aaatcagaga aggtaaacga aaattgacag 1860 cattaacaac ctccattaaa aaaaaggtag ccatttccct taatatatca gaagaaagtt 1920 ttgaagaaca gtcaagtttt aataatgagt atatggaaaa agcaaacttg tttgataaca 1980 tgatggtatt gttaactaac aaaattgcag agtctacatc aacatcaaaa aaagtacaac 2040 ttgcgacact tgctccaaca gactggtcaa taaaaaaggt atctgaaaca ttgcatgtaa 2100 caaactatgt agctcgaact gcacgtaagt taacattaga aaaaggtatt ttaaccatgc 2160 ctaatcctaa attaggaaaa gctcttcctg atgtaacagt tcaactggtc aagacctttt 2220 acgaagacga tgaacatagc cgtataatgc ctggtaaaaa ggactatgta agcataaaga 2280 aaaatgtcca catgcaaaaa cgtttgctct tatgtaattt aaaggaattg tttgttgcct 2340 ttaaaaccaa gaacccagaa ataaaaatag gattttcaag gttttgcacc ttgcgaccta 2400 aatggtgtat tactgttggt gcctcaggaa cacattctgt atgcgtctgt gctattcatc 2460 agaatttgaa actgctttta catcctcttg gtgtgacata caaagaattg ttaccttata 2520 ttgtctgtga tataactaat aaagactgta tgatgaggaa atgtgaaaaa tgcccagcaa 2580 ctatgaatgc attggttgaa aaactttatg atatacttgg agaatttgaa gatgatgatg 2640 agattgaatt tgatcaatgg acaacaacag ataggtcaaa cttgacacat aataaagagc 2700 cagtgtttga atacattgaa ctagtatgta gaaaactgga aaaacttgca ccacattcgt 2760 atttagcaaa gagtcaagca ttatacctga gatcaagaaa ggaagatatg aatgaggtaa 2820 cagcattatt tctgggtgat ttttcagaaa actataaatt tgtagttcag gatgaagtgc 2880 aaagctttca ttggacaaat ttacaatgta cacttcatcc agtgattatt tatttcaaaa 2940 aagaaggcat tctcaagcac aaatcctatt gtattatttc ggatgatatg atccatgatg 3000 tgaacatggt ttataaaatt attgaggttg tttgcaatga tataaaaaca aatttgcctc 3060 aaataaaaaa tattgaatac ttttcagatg ggtgtgcagg tcaatataag aattgcaaaa 3120 atatgttaaa tctttgtttc cattgtgaag attttggttt tactgctaaa tggaactttt 3180 ttgcaactag tcacggtaag caaccttgtg atgggatagg tggtacagtc aaacgtctgg 3240 caacactagc aagtttgcaa agagatatgg gtaatcatat tttatctcca caaacaatgt 3300 ataaatattg caatgaaaac attgaaggca taaatttcat atatatttca tcagaagatt 3360 taactttggt gagaaccgct cttaaagaac gattggatac tgctaataca atacctggaa 3420 cacgtagtta ccatcagttt gtaccacttg gccatcaaaa ggtaaattat tattttttaa 3480 ttataccttt taccatattt taaaatacac gcataacttt tattgataaa ctcaaataga 3540 tattagattt ttgcagaatc attatcaaaa atatttgtat tttattttat ttcctctgca 3600 tttgctatct atgcatttaa taaaacatgt ttgataagta aaacggaata attgcatatg 3660 ttactatttt gtttaggtag gcactaagct atgcagtgtt gatgaagaat ttgctttaat 3720 tcatgacttt ggaaatatcc agataacaac tgtaaaccta gtaccaggtg attatgcctg 3780 tgtttcctat gagcaaaagt ggtggatagg agtcattgaa gatattaatt tagaagaaaa 3840 agatgtgctt gtaaaattta tgtgtcctaa tggtcctgca cggtcgttta aatggccacc 3900 aaaggaaagc caatgctgga ttcctgatgt ccacattatt tgcaaagttg aggtaccgat 3960 aacaaaaaca ggactttgct actatttagc caaagaggac ttaaaaaaaa ttattttcct 4020 aaggaagtag agatttttat atttatgagt ctattatgtc ttatccattc tttttttaaa 4080 tattatgttt tgtgaaccaa cttgtaaata atgtgaatgt aaagcatata taaaatcact 4140 aaagtttagt atttttttaa aaaaaaaaaa ttatatacgt ttaaattttg ttgtaaatat 4200 catatgaaaa tgaaaaagat aataaagtca ctaaattttt agtttctaat ttgattgatt 4260 ggtaaaaggt ggggttatat taagtgtcta atttaaaaaa tattaaaaaa ttttgtgtct 4320 atatcaaaca tcaataattc ctcaagtaaa tataaggcca aaacaaaatt ggttcgaaaa 4380 gtaggtgttg atcttatttt taagaaaatc gaaattaaat tttttttata tcagtcagat 4440 tttttttata gctataattg tcaatgccct tttttctcgg cgttataaag tgtattttat 4500 ggatcttaaa ataagctaca ttcccttgtt ttgacatata aaaaactata tttttcggat 4560 ttaacaggtt ttaaactctt ttggtacaaa gtttcagaat tgtggcatca ctacaagcac 4620 catattaagg tgaaagtaaa cggaatgatt ttcggcactt ttcccccctc agtttttagt 4680 caaaatcaag gaaacttcaa attgctgtaa attataaacg gtaaggaatt ttatgaaaaa 4740 aaattacaat ctttcctaac atatgtctat aaacatttgt gccaagtttc aaaaattttg 4800 gtgatatagg gtgcagattt gtgttttttt gtgttttttg agaaattttt accc 4854 // ID DNA-TA-9_CQ repbase; DNA; INV; 419 BP. XX AC . XX DT 28-DEC-2010 (Rel. 16.01, Created) DT 28-DEC-2010 (Rel. 16.01, Last updated, Version 1) XX DE A DNA transposon family from Culex quinquefasciatus - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA-TA-9_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-419 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the southern house mosquito."; RL Repbase Reports 11(1), 59-59 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >99% CC identity. TA TSDs. XX SQ Sequence 419 BP; 104 A; 97 C; 87 G; 131 T; 0 other; cacccaaaat tcagagtcga gataatcgtt ctctcaaaat ttattgaggg ctcagtgcca 60 aaatcgatag aataatggta ccaactcaca ccatcataac aaatgagttc attcgttagt 120 tgaatcaata aatctccaac aagtgtcata catcgacttc tgacttctcc ttggtcacca 180 agctgtggtg gccgaggcag ctaagtcatt ggattggttt gtcaaaggtc tctggttcga 240 ttcccgttgt cgacactttt ggttttttgt ttgacggatg aacttttttt gcaaatgaac 300 ctcgagagaa tagttctatc gcccatctcg agctgtgttc tctgcgtccg tgaatggtac 360 cactattctc tcgactcgag accatcattc tctcggacta gcgctgccag ttttgggtg 419 // ID Kiri-4_CQ repbase; DNA; INV; 4383 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 16-FEB-2011 (Rel. 16.01, Last updated, Version 2) XX DE A Kiri non-LTR retrotransposon family from Culex quinquefasciatus DE - consensus. XX KW Kiri; Non-LTR Retrotransposon; Transposable Element; Kiri-4_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4383 RA Kojima K.K. and Jurka J.; RT "Kiri non-LTR retrotransposons from the southern house RT mosquito."; RL Repbase Reports 11(1), 123-123 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 7 sequences with >98% CC identity. CC This family and related Kiri elements found in two mosquito CC species, Culex quinquefasciatus and Aedes aegypti, constitute a CC distinct lineage belonging to the CR1 group. The phylogenetic CC analysis with PhyML indicates that Kiri is a sister lineage of CC the L2 clade. Although several Kiri elements do not encode two CC proteins, probably due to 5' truncation, Kiri elements generally CC encode two proteins. Their ORF1 proteins commonly contain an CC L1-like RNA-binding domain. XX FH Key Location/Qualifiers FT CDS 299..1345 FT /product="Kiri-4_CQ_1p" FT /note="PHD zinc finger at the N-terminus." FT /translation="MAKNRSSDEICDVCNRGAPPGSKGKRKPKTITWIGCD FT ACPRWFHPLCVRISDSQMEEIDDYQYFCESCAVRGCLIPKPQPVMTASGGV FT SELKKVIQELSAELTKLRAELDAARETNKKQLDRLRNVVNSNTRSEVTTTR FT LASDLSEKLEKIERGAQLANTCSQTINSCRLAINKIPFKVGENVRQLVADV FT LTLVGCSDEQASITDCFRVPAKPSKWSDRSLTPTIVAVFSSLEARQKVLRK FT YFERHKDAALRNLKHGPALDYRFTVNEVLSINTFRIRNHALRLKQRGAARS FT VFVRNDKVSVLLPGQVRYTPVNSVEHLQELVSSGSSSDSSSLFFDALSANV FT SASSRC" FT CDS 1349..4225 FT /product="Kiri-4_CQ_2p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="HETSLAFLLTKMTNIANLRIGHLNVRGLEHHIDGVKL FT LLDAQQYHFFGVTETKMRASAPVGPVRVPAYNLIRHSLPSGRGRGSKACGG FT VGLYVRQGLKATPVIKSTHDIAAPIATRVEYMVVQIRINELNVGVAVVYNP FT SCSNPSFAVLYERVLLEMLDLGFDRTFIMGDFNINVASVLPSLNLTSLRRI FT HSTFNLTVLPTGPTRITDTSSTTIDLLITDCPQSVRKAKAVSASAISDHEV FT IYMLADVQVRNQNRRTTRIRNFRNVDVLQLQAEYRTKDFQRFYESNDVEEK FT ATLLTADLQELLDRHAPERMIYVRDERTPWITHQIKQTIELRDLALKLYKR FT NPNRRRGDQQWLEHSRLHDRANSLIHAAKKRYADQHFNHELPAKKLWSNLR FT REGIHNSAKADPPAEGVDADDLNRFFSDGHRELGTNRGSTRSRTPAHRTVV FT DHGENGFNFRQTNSEEVNRKIHDIQTNATGTDGIPISFVKMLCPLILPELC FT HLFNAIITARAFPSLWKTAIVTPIPKQPSPSLPKHFRPISVLPAMSKVLEK FT ILLDQITDHLDNPNAPLLAKHQSGYRKGFGTTTALAKVTHDIYSNLDNNSC FT TVMVLVDFSLAFNCVDHHLLKDKLNREFRFSSASCELVASFLGQRKQSVRL FT GDAVSAVRDVTDGTPQGSCLSALLFSLFINSLPGVLRCEYQLYADDLQIYV FT SGPIEQIEDLIASINNDLAAITNWANQNRLHPNPKKTQAIVFTKAGSVRPH FT TDIVFSGEVVPLSDRVVNLGLLLDNNMTWKHQVNSVTQKVYNTLRTFRRFA FT AVLSQPTRRKLVQAVIMPMFTYCDVVYYHGLPAAHKEQLHRCFKSAVRFVY FT NLRRRETTAAVRSTILGHDLPINYQLRTSCFMKKGYDGNLPEYLQQHLVRG FT QLERTRSFIIPRHTTTSGRSVLVAGTTCWNNLPLELKTQPTLSAFKSAFKR FT SFDD" XX SQ Sequence 4383 BP; 1162 A; 1206 C; 1054 G; 961 T; 0 other; aaaatattga aaagacggca acactggcag ccggctgcaa ccaaaacaaa gccgctgaaa 60 aagaaattac aaattctcgc tgttttctac gatttacgtg cggaaaaaag tgtccagtgc 120 cgaaacaaaa ccgtttgcta cacaattaca gtgtttgcaa gtaccggcta gtgtttgaca 180 aagttttttg aacatttgtt ttccctcctg caaagtgagg tgatgtgcta agtgtgtgtg 240 tgtgttttga aacgccagcg ccaatagtgt ccggtagcag tactatcaca atttcgacat 300 ggcaaagaac cgctcgagtg acgagatctg cgatgtctgc aaccgcggtg cgccacccgg 360 gtccaaagga aagcgaaagc caaaaacgat tacctggatc gggtgcgatg catgcccgcg 420 ctggttccac ccactatgtg tgagaatcag cgactcgcag atggaggaaa tcgacgacta 480 ccagtacttc tgcgaatcct gcgcggtacg tggttgtctc atcccgaaac ctcaacccgt 540 gatgaccgct tccggagggg tcagtgagct gaagaaggtc atccaggaac tgtcagcgga 600 gctgaccaaa ttgcgggccg agctcgatgc tgctcgcgaa accaacaaga aacagttaga 660 tcgtcttcgc aacgtcgtca acagcaacac ccgttcggag gtcaccacga ctcgcctagc 720 aagcgatctc agcgaaaagc tagagaagat tgagcgagga gctcaactcg cgaacacctg 780 ctcacaaacg atcaactcct gcaggctcgc gatcaacaag atcccgttta aggttggcga 840 aaacgtcagg cagctggttg cagatgtact gaccctagta ggatgtagcg atgaacaggc 900 gagcatcacg gactgctttc gtgttcctgc gaaaccatca aagtggtcgg accgttcact 960 cactccgacg atcgttgcgg tcttcagcag tctcgaggca aggcagaagg tgctacggaa 1020 gtacttcgag cggcacaaag acgctgccct acgaaacctg aaacatggac ccgccctgga 1080 ctaccgcttc accgtcaacg aagtgctctc gataaacacg ttccggatac ggaaccacgc 1140 gcttcggctg aaacaacgag gtgcggctcg gtccgttttt gtgaggaacg acaaggtttc 1200 tgtgcttctc ccagggcaag tcagatacac ccccgttaac tctgtcgagc atctacaaga 1260 actggtaagt tcggggagtt cctcggactc ttcgtctctg tttttcgacg ccctctctgc 1320 caatgtctca gcatcttcgc gctgctgaca cgaaacgtcg ctggcgtttc tactcaccaa 1380 aatgacgaac atcgctaacc tccgaatcgg ccacctgaat gtcaggggcc ttgagcacca 1440 catcgatggt gtaaaacttt tgctcgacgc tcaacaatac cacttcttcg gagtcactga 1500 aacaaaaatg agagcttccg ccccggttgg accggtgcga gttccagcgt acaaccttat 1560 ccggcactca ctgccgtctg gccgcggccg gggatcgaag gcctgcggcg gcgtgggact 1620 ttacgtgcga cagggtctca aagcaacgcc ggtcatcaaa tcgacccatg acatcgcagc 1680 tccgatcgca acgagggtgg agtacatggt agttcaaatt aggatcaatg aacttaacgt 1740 cggggtggct gtggtgtaca atccgtcgtg ctcgaatccg tcgttcgccg tgctctacga 1800 aagggtcctt ctcgagatgc tcgatcttgg ctttgacaga acatttatta tgggagattt 1860 caacatcaac gtagcgtctg tcttgccttc gctgaatctg acctcgttga ggcggattca 1920 ctcaacgttc aacctcaccg ttctgcctac tggtcccaca agaatcaccg acaccagctc 1980 aactaccatc gacctgctga tcaccgactg tccacagtcc gtccgaaaag caaaggcggt 2040 ctccgcaagc gctatttctg accatgaagt gatctacatg ctggctgacg ttcaggtgag 2100 gaatcaaaac cggcgtacga cccgaatccg gaatttccgg aatgtcgacg tgctacagct 2160 gcaagcagag tatcgaacca aggattttca acgcttctac gagtcgaatg atgtagaaga 2220 aaaagcaacg ctgttgactg ccgacctgca agaactcctc gatcgacatg cgccggaaag 2280 gatgatctac gtccgtgatg aaagaactcc ttggataacg catcaaatca agcagaccat 2340 cgagctgaga gacctcgcgc tgaagctgta caagcgtaac ccgaaccgga ggcgaggtga 2400 ccagcaatgg ttggagcact cgcgcttgca cgatcgtgcg aactcgctga ttcatgcggc 2460 caagaagcgc tacgctgatc agcacttcaa tcatgaactt cccgccaaaa agctctggag 2520 caacctcaga agagaaggca tacacaactc agccaaggca gacccaccag cagaaggagt 2580 cgacgctgat gatctgaacc gtttcttttc tgacggtcac cgggagctag gaacgaacag 2640 aggcagcacg cggagtagaa caccagccca cagaacagta gtagatcacg gagaaaacgg 2700 attcaacttt cgacaaacca actcagaaga agtcaaccgc aaaatccacg atatacagac 2760 caatgccaca gggaccgacg gcatacctat ctcgtttgtc aaaatgttgt gcccgctgat 2820 cttgccggaa ctctgtcacc tcttcaacgc cattatcaca gctcgagctt tcccgtcttt 2880 gtggaaaaca gccattgtaa caccgatccc caaacaacca agtcctagcc tgcccaagca 2940 tttccgcccg atcagtgtcc ttccagcgat gtcgaaggtg ttggaaaaaa ttctgcttga 3000 ccaaatcacc gatcatctcg ataacccgaa cgctccgcta ttggccaaac atcaatcggg 3060 atacaggaag ggattcggga caacgactgc tctcgccaag gtaacgcacg acatctacag 3120 caacttggat aacaacagct gcaccgtcat ggttcttgtc gatttttcac ttgcgtttaa 3180 ctgcgtcgac caccatctgc tcaaagacaa gctgaatcga gagttcaggt tctcgagcgc 3240 ttcctgcgag ttggtcgcat cgttccttgg ccaaagaaag caatccgtcc gccttggtga 3300 tgcagtctca gcagttcgtg atgtaactga cggtacaccg cagggatcct gtctcagcgc 3360 tttgctgttc agcctcttca tcaacagttt acccggagtc ctgcgatgcg agtatcaatt 3420 gtatgctgac gatcttcaaa tctacgtgtc cgggccgatt gaacagatcg aagacctgat 3480 tgccagcatc aacaacgact tggctgcgat caccaactgg gccaaccaaa atcgacttca 3540 cccgaacccc aaaaaaacgc aggccatcgt tttcaccaaa gcaggctcgg ttcgacccca 3600 cacggatatc gtcttcagtg gagaagttgt tccactgtcc gacagggttg tcaatcttgg 3660 tctcctgctg gacaataaca tgacatggaa gcatcaggtc aacagtgtga cacagaaggt 3720 atacaacact ctaaggacat ttcgccgctt tgcagcagtc ctctcccaac cgacacggcg 3780 caagctggtc caagctgtga tcatgcctat gttcacctac tgtgacgtag tctattatca 3840 cgggcttcct gcggcacaca aagagcaact ccaccggtgc ttcaaatcag cggtgcggtt 3900 tgtgtacaat ctgcgacgtc gcgaaacaac tgcggccgtg cgaagcacca tactcggaca 3960 cgatttgccg atcaactatc aactgcggac gagctgcttc atgaagaagg ggtacgatgg 4020 caacctgccg gagtacctcc aacaacatct ggtgagagga cagcttgagc ggacccgatc 4080 attcatcatc ccgcggcaca caacgacgag tggaagaagt gttcttgtcg ctggaaccac 4140 gtgctggaac aacctacccc tggagctgaa aacccagccg acgctgtcgg catttaaaag 4200 tgcttttaag agatcgtttg acgattagtt ttctttagct tacttgtacc tgttaccctt 4260 tatttggatc ctttgtaact tttctaaatt tcctaaagtg tcttccttga cctgattaac 4320 agtgtaaact aacaagttgt cgtacccaat aaaccaacaa attacaaatt acaaattaca 4380 aat 4383 // ID BEL-1-I_NVi repbase; DNA; INV; 5568 BP. XX AC . XX DT 17-APR-2009 (Rel. 14.04, Created) DT 17-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia vitripennis: internal portion - DE consensus. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-1-I_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-5568 RA Bao W. and Jurka J.; RT "BEL type LTR-retrotransposons from Nasonia vitripennis."; RL Repbase Reports 9(4), 742-742 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS join(389..3166,3028..5346) FT /product="BEL-1-I_NVi_1p" FT /translation="MAPPLRKLVASQARRLTEIRGLRSRLLAKPPASLNKN FT ILQARVDALKEVWDEARRTHSEIVARDDAEADAYVTEDNFGDLQGAYEEAL FT DEFLTLLSQFEVADQSTLPGGLLNSTASDAGFTKLPKINLPSFSGNYEDWA FT SFRDNFRNMVHDLPRISDATRLQYLKMCLTGSAADLVKEIPTTNANYASTW FT KALELRYHNPRLIITRYLTAFMALPHLKKESGDELRFFIDEATRIVRALEN FT LKMPIDQWDVWFVFLLSERLDPESRSRWESLLSEKERKKIESGAGVGQEVT FT ESSYTPATFHEFIEFLETRAQTLGVLAPDRRGEKRSAAPLKSIHPRKVFHA FT NSSQCPLCAGPHALMRCSQYKEKKPQDRQKEVKRLRLCFNCLGHHRANSCS FT SSGRCSECKQKHHSSLHDPSRSSQGSSSTAPKKDQGNSGSGSAVVLHAANS FT LVSSRKILLATARVLVVGPKNKGTYVRALLDQGSEASFVSEAIVQLLELPK FT RRTHVPLSGLGASAAGTARSIVPLTLRSAVDPGFQVETEMLVLXKLTSLLP FT SYEVSEKLAREQFSGVTLADPEFAISKKIDMILGADLYGQLLRSGVKKSST FT SQLVAQNTVLGWIVSGPMDSGMTRRAAGAATQAPVQAFHCAPQDDLENLLQ FT RFWTLEEFLSPSFKMKPEDERCERIFQDTHRRDAQGRYVVKLPLKPDLPSV FT GAETRRMALGSLSNLHRRFSRDSRIANSYREFMEMYEKLGHMTRVPPSEVH FT KADAWYLPHHPVVQMLLLIWKLRVVFDASRRTKDGHSLNAFLLTGPPLQSD FT LSLILLNWRRYRFVFTADIIKMFRQIQVEPEHQDYQRIVWSPDAASEPVDF FT RLTTVTYGTACAPYLAIRTLSQLVKDEGHRFPLGARCLESSWTSGLRTIRS FT CFQIQHSRYRRSRSVKMSRSRLRSSVSSRGSLSRIELDKWAANHQELLPDS FT AQQVSEKQIGEDESVKALGVHWVPSQDNFKFNAVNLENVAAAYTKRSVLSN FT IARLFDPLGWLAPVTVMAKILMQDMWILKCDWDSPLPAEIRERWYDYCKGL FT SALPSLSIERWLGGTANCSYQIHGFSDASSRAYAAVVYLRIDEGNGRFRVS FT LLAAKSKVAPVKTVSIPNLELCGAALLVKLILHVTKLEFLRGLPIFAWSDS FT QIVLTWLRKHPCHWKTFVANRVSLIQTELPSATWAHVPTKENPADLATRGV FT QPGELANCALWWQGPAWLSLSPTEWPRPADPVRVNHARPRSEESEILTRFS FT SLTRLVRVVAFCRRALDRIRIRGTPAESASSPFLTTAELSAARVAVIRLAQ FT AGAFAEEIKLLRSGKRLPKGNRLSXLNPFLGEEDGLLRVGGRLAHSGLSFD FT RKHPPILPQNSALSLLFVRHAHHQCLHGGPTLTSSILMPQVWILGRNRIVK FT STIRACVPCQRVKPHSAQQLMGELPADRVTSSRAFSHSGLDYAGPFQIRMS FT KGRGNRSFKGYIALFVCFATRAIHLELVGDLTSASFICAYRRFVGRRGICQ FT NLYSDNGTNFQGADKELRSMFQRASDFYKKVGAVLANDGTSWTFIPPSAPH FT YGGLWEAGVKSVKHHLKRVVGEHTLTFEEFSTVLVEIEACLNSRPLGALSA FT DVNDLRALTPSHFLNEGVSVLLPEADCPKLPENRLSRFQLLQRIRNNFWKR FT WSTEYLLHLQEREKWRDPSENFAVGQLVW*" XX SQ Sequence 5568 BP; 1168 A; 1301 C; 1553 G; 1542 T; 4 other; tggtgccccg ggtgaggagt gagtgattgt ttgggttttt aaaatcaata atttcaaagc 60 gaacttgtaa gcgagcgagt cgctgccggc gtcggtcggc agttcgcgga agcggccatt 120 gtcggcgagc gggctacggc ctcgtgcgtt attttcgcgt ttcagcttcg cgccgtagac 180 ttgcggcgta aacacagcga ttctgggaat cgagtcgtga gtagcacact catacagcgc 240 attctcaatt attatttcga atttcactct cattgttgca tttgataatt cctctcgagt 300 tcgtttgtcg attatatttc ggtgttttct tttcttgttc gaaatttgat tttgtcctct 360 cgtttgcgta tttcaaggcc agtctaccat ggctccaccg ctgaggaagt tggtagcttc 420 gcaggcaaga cgcttgacgg aaatcagggg cttgcgtagt cgtttgctcg ctaagccccc 480 tgcgtcgctc aacaaaaaca tcctccaggc tcgagtcgac gctttgaagg aggtctggga 540 tgaggctcgg aggactcact cggagatcgt cgccagagat gacgctgagg cggatgcgta 600 cgtcaccgaa gataacttcg gggatctcca gggggcttat gaggaggctc tcgatgaatt 660 tttgacgttg ctgtcgcagt ttgaggtcgc tgatcagtca acacttcctg gcggattgct 720 gaattctact gcgtcagatg ctgggttcac gaaacttccc aaaattaacc ttccttcgtt 780 ttcaggcaac tacgaggatt gggcgagttt tagggataat tttaggaaca tggtgcatga 840 tttgcctcgg atttctgatg ctactcggtt acagtacttg aaaatgtgct tgaccggtag 900 cgcggctgat ttagtgaagg aaattccaac tactaacgct aactatgcaa gcacatggaa 960 ggcgttggag cttcgttacc ataatccgag gctcatcatc accaggtact tgacggcttt 1020 tatggctctt cctcatctta agaaggaatc aggtgatgag ttacgctttt tcattgatga 1080 ggctacgcgt atcgttcgtg ctctggagaa tttgaagatg ccgatagacc agtgggatgt 1140 atggttcgtt tttctcttgt ctgagcgctt agatccggag tctcgtagtc gatgggagtc 1200 cttgcttagc gagaaggagc gaaagaagat cgagtcagga gcaggcgtag ggcaggaggt 1260 cactgagtcg tcgtacactc cggcgacatt ccacgagttt atcgaattcc ttgaaactcg 1320 agctcaaact cttggcgtgc tggctcctga tcgacgtggc gagaaacgat cggctgctcc 1380 gcttaaatcg attcatcctc gtaaggtgtt ccacgcgaac tcttcgcagt gtcctctctg 1440 tgctggtcca cacgcgctga tgcgatgctc tcagtacaag gagaagaagc cccaggatcg 1500 ccagaaggag gttaagcggc tccggctctg tttcaactgt ctgggacatc atcgagcgaa 1560 ttcttgttcc tcttcgggtc gttgttcgga atgtaaacaa aagcatcatt cctctcttca 1620 tgatccgagt aggagcagtc aaggatcatc gtctactgcg ccgaagaagg accaagggaa 1680 ctccggtagc ggctcagcgg tggttttgca tgctgctaat tccttggtgt cttcgaggaa 1740 gattcttctt gctactgcgc gtgtgttagt ggtaggccct aagaacaagg gcacgtatgt 1800 acgggcgctt ttagatcaag gctcagaagc gtcgtttgtg tccgaagcca ttgtgcagtt 1860 acttgaactg cctaagcgac gtacacacgt gccactgtct ggcctgggag ctagtgcagc 1920 tggcactgct cggtcgatcg ttccactcac tcttcgttca gcggtagatc caggattcca 1980 ggttgaaact gagatgctag ttctcyctaa gcttacgtcg cttcttcctt cttatgaagt 2040 cagtgagaag ttggcgagag agcagttttc aggcgtcact ttagctgatc cggagtttgc 2100 gatctcaaag aagatcgaca tgattttggg tgctgatctt tatggtcagt tgcttcgctc 2160 tggggttaaa aaatcctcaa cttcgcagct tgtcgctcag aacaccgtcc ttggttggat 2220 cgtttccggg ccgatggayt cagggatgac acggcgggcg gcaggtgctg ctacccaagc 2280 tccagttcag gcgtttcact gcgctcctca ggatgacttg gagaatttgt tgcagcggtt 2340 ttggactctc gaggagtttt tgtcgccatc ttttaagatg aagccggagg atgagagatg 2400 cgagagaatc ttccaggaca ctcatcgtag ggacgcacag gggcggtacg tggtgaagct 2460 gccgttgaag cctgatcttc cgtcagtggg agcagaaact cgacgtatgg ctcttggttc 2520 tctctcgaat ttgcatcgtc gtttttcacg tgattcgagg attgccaatt cgtatcgcga 2580 attcatggag atgtacgaga agttagggca catgacgcgc gttccaccgt cggaagtcca 2640 caaggctgac gcctggtatc tgcctcacca cccagtcgtg caaatgcttc tgttgatatg 2700 gaagttgaga gttgtgttcg atgcttctcg gcgtacgaag gacgggcact cgctcaatgc 2760 ttttttgttg acaggacctc cattgcagag tgacctctct ttgattcttt taaattggcg 2820 taggtatcgc ttcgtattta ctgccgatat aatcaagatg tttcggcaga ttcaggtgga 2880 accagagcat caggactatc agaggatcgt ctggtcaccc gatgcagcat ctgaaccagt 2940 tgattttcgt ctgactacgg tcacctatgg gacggcttgt gctccttatt tagcgattcg 3000 cactctgtct cagttagtta aggatgaagg tcatcggttt cctctcgggg ctcgttgtct 3060 agaatcgagt tggacaagtg ggctgcgaac catcaggagc tgcttccaga ttcagcacag 3120 caggtatcgg agaagcagat cggtgaagat gagtcggtca aggctttagg cgttcactgg 3180 gttccatcgc aggataattt caagttcaac gctgttaacc tcgagaacgt ggctgcagct 3240 tatacgaaaa ggtctgtcct ctcgaacatc gctcgtttat tcgacccgtt aggttggctc 3300 gctccagtca cggtcatggc gaagattttg atgcaggata tgtggattct gaagtgcgat 3360 tgggattcac ccttaccagc tgagattcgt gagcgctggt atgattattg caagggttta 3420 tccgcattgc catctttgtc catcgagcgt tggttaggtg gtacggcgaa ttgctcttac 3480 cagatccatg ggttttcgga tgcttcttct cgagcgtatg ctgctgtggt gtatctgcga 3540 atcgacgagg gtaatggacg ttttcgggtt tcgttattag cggctaagtc aaaggtagct 3600 ccggtgaaaa cggtgagcat tcccaatttg gagttgtgtg gtgcagctct tctggtcaag 3660 ctcatcctgc acgtgacaaa gctggagttt ctgcgagggt taccaatatt cgcttggtcc 3720 gatagtcaga tcgtgcttac gtggcttcgt aagcatccgt gtcactggaa gaccttcgta 3780 gctaatcgag tttcgttgat ccaaactgag ttgccgtcgg caacgtgggc tcacgttccg 3840 acaaaggaga atccggctga cttggcgacg cgaggagttc agccgggaga gttggctaac 3900 tgtgctcttt ggtggcaggg cccggcgtgg ctatcactgt ctccaactga gtggcctcgg 3960 cctgctgacc cggttcgagt taatcacgct cgtccaagat ccgaggaatc cgagattctg 4020 actcgatttt cgtctttgac taggcttgtt cgggttgtag cgttttgcag acgtgctctc 4080 gatcgaattc ggatcagagg aacgccggca gagagcgctt cgtctccttt tctgaccact 4140 gctgagcttt ctgcagctcg ggtggcagtc attcgtctgg ctcaagcagg ggcgtttgct 4200 gaggaaatta agttgttgag atcgggaaag agactgccga agggcaatcg gctcagcawt 4260 ttaaacccat ttttgggcga ggaagatgga ctcttgcgtg taggaggtcg attggctcac 4320 tccggtctct ctttcgatcg aaaacaccct ccgatcttgc cgcaaaattc tgctcttagt 4380 ttgttgtttg tccgacacgc tcaccatcag tgccttcacg gcggtcccac cttgacgtcg 4440 agcattctaa tgcctcaggt ttggattctg ggtagaaatc ggattgtaaa atcgacgatt 4500 cgcgcatgcg tgccttgtca gcgggttaag cctcactcgg ctcaacaact catgggtgag 4560 cttcctgctg atagagtgac aagtagtagg gctttctctc attctggact cgattacgcg 4620 ggtcccttcc aaattcggat gtcaaagggt cgtggaaatc gttccttcaa gggctacatc 4680 gcgttgttcg tatgtttcgc tactcgtgct atccatctgg agcttgttgg tgacctgact 4740 tcagcgtcgt ttatctgtgc gtatcgtagg tttgtgggac gtcgaggcat ttgccaaaat 4800 ctttacagcg acaacggcac caacttccaa ggagcagata aggagctaag gagtatgttc 4860 cagcgtgcct cagacttcta caagaaggtt ggcgctgtcc ttgccaacga tggaaccagc 4920 tggacgttca ttccccctag tgcgcctcat tacggtggtt tgtgggaagc aggtgtcaag 4980 tcggtgaagc atcacttgaa gagagttgtt ggcgagcaca ctctcacttt cgaggagttt 5040 tctacagttc tggtagagat cgaggcctgt ctcaactctc gtccactagg tgccttgagt 5100 gctgacgtca acgatctacg ggcgctcact ccctctcatt ttctcaatga aggagtctcg 5160 gtcttacttc cggaagctga ttgtccaaag ttgccagaaa atcgtttgtc cagattccag 5220 ctgcttcagc gcattcgaaa caacttctgg aagcgttggt ccacggagta tttgcttcat 5280 ttacaagagc gagagaagtg gagggatcct agcgagaact ttgcggtagg tcagctcgtc 5340 tggtgaagga tgatcggtat cctccatcaa aatggccatt gggcagggtc cttgaggtgc 5400 accctggccc agatggtttg gtgcgagtcg ttaccatcaa gacggctact tcttctctcc 5460 gacgacatgt tgctcggctc tgccctctag cgttggacga gaaggtgaag aagggctctg 5520 tgaacgcgst tagggtctga ttcgtctgtc ggacgaaggc gggcggaa 5568 // ID Gypsy-220_AA-I repbase; DNA; INV; 7602 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-220_AA_; KW Gypsy-220_AA-LTR; Gypsy-220_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-7602 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1043-1043 (2011). XX DR [2] (Consensus) XX CC Positions [4689-5165] - Integrase core CC 'TTTA' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 1153..2319 FT /product="Gypsy-220_AA-I_1p" FT /translation="MAKMNNRIVDVRQTNRNLRSQRPNIMTDESDTEYDTR FT QRNNWQFQDHFRNRLQRNGNRRDRERLVERRSTPSGDRFSTRGSHTSDSYR FT PRFTRQSRRIEHWKVSFSGDNRTVSVENFLYKLKKIAAREEVSQQSLLRDI FT HLILEGQASDWFFTYVDEFEDWDDFEEKIRFRFGNPNQDQGIRQRIHERKQ FT QRGETFSAFVSEIERLNKMLSKPLSRRRKFEVVWDNMRPHYRSKTSIVRVR FT DLEHLIQLNHRIDAADGSFQTHLEARNVESRTQRNVHQIDCASSQDEREEC FT EDLDAIDARLNQRGQHPNVRNQPTTIRNTNTQGQQQGGPAMGGVCWNCRKS FT GHNWRDCKEPRAIFCFGCGELGRTIRSCQRCTDSSRRWAGQNQGNQ" FT CDS 2262..5552 FT /product="Gypsy-220_AA-I_2p" FT /translation="MPTMHRFQPQVGWSESGKPVKECLLGNSNIPSNNVPK FT EAINFDPIDRIHTIKIQTRRCPYLRVNVFETELVALLDSGAGISVINSLEL FT SEKHGLRLQPTRLRVCTADNTEYKCLGYLNIPYTYKNITKVVPTVVVPEIS FT KPILGCNFWRSFGIAPMADLGNGPEQVGQFTDSEEMIAFTLEPIGDLPKVQ FT LSGTDSTLDVPTFDIPEGTQEPTVETLEVEHELTEGERQELVQVMKGFEFT FT STNKLGRTHLIEHEIVLKQDAKPKNQVMYRSAPAIQREIDAEIQRFKDMDV FT IEECYSEWTNPLVPVRKSNGKIRVCLDSRRLNSMTVKDSYPIRNMLEIFQR FT LENAKYFSIIDLKDAYFQIPLKEESRNYTAFRTRQGLFRFKVLPFGLTNAP FT FSMCRLMDKVIGFDLMPSVFVYLDDIVIASKTLKEHFRLLKIVAERLRDAN FT LTISLEKSRFCRKQVKYLGYLLTERGVAIDGSRIAPILDYARPKTMKDARR FT LLGLAGFYQRFIQNYSRITAPITDLLKKSKKKFEWTEAAESAFSELKSVLT FT SAPILANPDFSKSFTIESDASDTAVGAALVQELDGENRVVAYFSKKLSRTQ FT RAYSSVEKECLGVLLAIENFRHYVEGSRFKVVTDARSLLWLFTIGVESGNS FT KLLRWALKIQSYDIQLEYRKGASNITADCLSRSVNLLDLSSQDDEYEELVE FT KVQAEPDKFADYRVTDGNVFRYLKSTNRLEDGRFRWKLFPRNKERAEILRK FT EHDVAHFGFEKTLQNLKRRYFWPGMAKDVRKFCQSCLKCQTSKAGNVNVTP FT PMGAQKEFVEHPWQFVTLDYVGPLPPSGKGRHTCLLVATDVFSKFVLIQAF FT REAKASSLIEFVRNMIFLLFGVPEVILTDNGTQFTSKAFRDLLAEYGVNHW FT LTPSYHPQVNNTERVNRVVTTAIRATIKQHRDWADNLQTIACAIRNAVHES FT TKYTPYFVVFGREMVSDGQEYQKMRQADSGDSMQQPFGSGRREKLYAEIKQ FT HLAKAYERHKRTYNLRSNSECPTYTAGETVLKRSFEQSSKSKNFCAKLAPK FT YELAVVRKILGKHCYELENLQGKRLGVFYGSHLKKMHPPT" XX SQ Sequence 7602 BP; 2327 A; 1503 C; 1715 G; 2055 T; 2 other; tttagcgtgc tacattttgg cgcccaacgt ttagaaggca aacacatttt tttcattttt 60 tggcgcttat tgaataattg cacgttttag aattttcctt catttattga atacaataaa 120 ataagtacat aactaaattt gaaccgtata atttatagac aaatgcggta ttttccaacc 180 attttccttc tagtcatcat ttattagatt aaaatacaaa cagaaatata ttatttttca 240 tttatttatt tatttattta tttatttatt tatttattta tttatttata catatattat 300 ttcgtgctgt acattgatat tgtgttaaaa ttgtgcgttt tcctgtatat tttgcattaa 360 agatacgtat tagtgaactt gctagtgttt atataattat taagatctgt ttacactcct 420 attttgaaca acatgctttt tccccacgct gattatttga cgaatgacga aatcaattat 480 gagcttatat tgcgaaatca tcaagaagat gttcataaag atgttcaaac taaattgcga 540 ttgttgcgaa ggttgtttca aaccgatcag aaagagggat gggactacaa gtctcctttc 600 aaattcgaac aggaagctca tatcataaca aatacggttg aacaaatacg agaagcatgg 660 caaatgaggg agggagatcc tagactcgta tcacggttgc gacattacta tttacgagtt 720 agaagagggg aaacgtctga tagggatgtg gaaaatacga gaagagaact tcttaaatca 780 atctctgaga ttcttcgtag tacgactgag aagactccag aggacgagaa cgagataggt 840 gaagctgcag ccaaagaaga cggcaagtca ggtcacggca gtcaggaggc taaacggttg 900 ggggcatatc cgaaaaacag actttcacct tccaaaccag aaaccgttcc aacaaagaag 960 agtgttgttg aaccgacata cgaggagttg ctcgataaag ttcgggaact gcagacaact 1020 cttcaacaaa tgcgaattcg tagcgggcgt gaccagcaac ggaggcagga ggaattaaga 1080 acgtcacagt cgaaagaggc aaggcagtcc gagaacgtag kaacagatgc tgatagcagt 1140 gaaccggaaa gcatggctaa gatgaacaac cgaatagtag atgttaggca aacgaatagg 1200 aacttaaggt cacagagacc taacattatg accgacgaat cagatacaga atacgatact 1260 aggcagcgta acaactggca atttcaggac cattttcgga atagacttca gcgtaatggc 1320 aacagacgag atcgagaaag gttagttgaa cggcgaagca ctcctagtgg cgatcggttc 1380 agtacaagag gttcccatac atcagatagc taccgaccma ggtttacgcg ccagtcacga 1440 aggattgagc actggaaagt ctccttttct ggtgacaata ggacagtgtc agtagagaac 1500 ttcttgtata agttaaaaaa gattgctgct cgggaggagg tttctcaaca aagtctgctg 1560 cgggatattc atctgatctt ggagggacag gcctcggact ggttttttac gtatgttgac 1620 gagttcgagg actgggacga cttcgaggaa aaaatcaggt tcagatttgg aaatccaaat 1680 caggatcaag ggattcgaca gcgaattcac gagcgaaagc aacagcgggg ggaaactttc 1740 agcgcctttg tctcagaaat agagcgtctg aacaaaatgc tctctaaacc actctcaagg 1800 cgacggaagt tcgaggttgt gtgggacaac atgcgcccac actaccgctc aaaaacgtct 1860 atcgtccggg tacgcgatct agagcatctg atacagctaa accacaggat tgacgcagca 1920 gacggaagct ttcaaacaca tcttgaagca agaaatgtag aatctcgaac tcaacggaat 1980 gtacatcaaa tcgattgcgc ttcgtctcaa gacgaaaggg aggaatgtga agatcttgat 2040 gcgatagatg caaggttgaa ccaaagagga caacatccaa acgtaagaaa ccaaccaact 2100 acgatcagaa acaccaacac acaaggccaa cagcaaggag gtccagctat gggaggagta 2160 tgttggaact gcaggaaaag tggtcataac tggagagact gcaaggaacc cagggctatc 2220 ttctgttttg gctgtggtga gcttggaaga accattcgtt catgccaacg atgcaccgat 2280 tccagccgca ggtgggctgg tcagaatcag ggaaaccagt gaaggaatgt ttgttgggga 2340 actcgaacat tccatcgaac aatgttccca aagaggcaat caactttgac ccaatagatc 2400 gcatccacac catcaagatt caaacgagac gatgcccgta tttacgcgta aacgttttcg 2460 aaactgaact agttgctcta ctagattcag gggccgggat aagcgtaata aactccttag 2520 agttgtcaga aaaacatggt ttgcgacttc aaccaacgag actcagagtt tgtacagccg 2580 acaacacgga gtataaatgt ttaggatacc taaacatacc atatacttac aaaaatatta 2640 ccaaagtagt gccaactgtg gtggtaccag aaatctcgaa acccattcta ggttgtaatt 2700 tttggcgtag ttttggtatt gcaccgatgg ctgacttggg gaacggacca gagcaagttg 2760 gacaatttac cgattcagag gagatgatag cgtttacctt agaacccatt ggagatctcc 2820 caaaggtcca gttatcggga acagactcaa cgcttgatgt tccaactttc gatatcccgg 2880 agggaacaca agaaccgact gtggaaactt tggaagttga gcacgaattg acggaaggcg 2940 aacgacaaga gttagttcag gtaatgaagg gcttcgagtt cactagcact aacaaattgg 3000 gacgaacgca tcttattgaa catgaaattg tgctcaagca agatgcaaaa ccgaagaatc 3060 aagtgatgta tagatcggct ccagcaattc aaagggaaat agacgcggaa atccaaaggt 3120 tcaaggatat ggatgttatc gaggaatgct atagcgaatg gacgaatcca ttagttccag 3180 tcaggaagtc caacgggaaa atacgagtgt gtctcgattc tcgtcgactg aatagcatga 3240 ccgtcaaaga cagctatcca ataaggaata tgctggaaat ttttcaacgg ttggagaacg 3300 caaaatattt ctctatcatt gatttgaagg atgcgtattt ccaaattccc ctaaaggaag 3360 aaagtaggaa ctacacagcc tttcggacac gacaaggtct atttaggttc aaggtgttac 3420 cattcggttt aacaaatgca cctttctcga tgtgtagact gatggacaag gtcatcgggt 3480 tcgacctgat gccctcagtg tttgtttacc ttgatgatat agtcattgcg tctaaaacat 3540 tgaaggaaca ctttcgtcta ctaaagatcg tggcggagag gttgagagat gcaaatctaa 3600 cgatctccct cgagaaatca cggttttgtc gaaaacaggt caagtacctc ggttatctgt 3660 taaccgaaag aggggtagcc attgatggat cgagaatagc accaattctg gactacgcaa 3720 gacctaaaac tatgaaagat gcacgcaggc tgttaggatt ggcaggtttc taccaaagat 3780 ttattcaaaa ctacagtaga ataactgcac ccattaccga tctgcttaaa aaatcgaaaa 3840 aaaagttcga gtggacggaa gcagcggaaa gcgctttttc ggaactcaaa tcggtattga 3900 catcagcgcc aattctggcc aacccggatt tctcgaagtc atttacaatt gagagtgatg 3960 catctgacac ggctgtcgga gcagctctcg tacaggagct ggatggagaa aatcgtgtag 4020 ttgcatactt cagtaagaag ttaagcagaa cgcaaagggc atattcaagt gttgagaagg 4080 aatgcctggg agttttgcta gcaatcgaga atttccgaca ctatgtggaa gggtcacgtt 4140 tcaaggttgt tacggatgca cgaagtcttt tgtggctgtt cacgattgga gttgagtctg 4200 ggaactcaaa gctcctccga tgggcattga aaattcaatc gtatgatatt caactggaat 4260 accgcaaagg ggcgagtaat attactgccg attgtctctc acggtcggta aatctcctgg 4320 atttatcctc tcaagatgac gagtatgaag aactggtcga aaaggtccag gctgaaccgg 4380 acaaatttgc tgattaccgc gtcacggatg ggaacgtttt tagatacttg aagagtacga 4440 atcgattaga ggacggtcgt ttcagatgga agttgttccc acggaacaag gaacgagctg 4500 aaatccttag gaaagaacac gacgtagccc actttgggtt cgaaaagaca cttcaaaacc 4560 ttaaacgtcg ttatttttgg cctggaatgg ccaaagacgt aaggaaattt tgccagagtt 4620 gtttgaagtg ccagacttcc aaagcaggaa atgtaaatgt tacacctcca atgggagctc 4680 aaaaggagtt cgtagagcat ccctggcaat tcgtaaccct agattacgtc ggtcctctgc 4740 ccccatcagg aaagggcaga catacatgcc ttctcgttgc cacagatgtc tttagtaaat 4800 ttgttctaat ccaagctttc cgggaagcta aagccagttc gttaatagaa ttcgtaagaa 4860 acatgatctt cctacttttt ggagtaccgg aagtgatctt aaccgataac gggacacagt 4920 tcacgtccaa agcctttcgt gatcttctag cggaatatgg tgtcaaccat tggctcaccc 4980 cttcgtatca cccacaagtg aataatacgg agagggttaa ccgtgtagtc acgaccgcaa 5040 ttcgagcaac catcaagcaa caccgagact gggcagataa tcttcagacg attgcttgtg 5100 caatacgtaa tgcggtgcat gaatcaacaa agtacacacc atacttcgtt gtattcggca 5160 gggaaatggt atcagatgga caggagtacc agaaaatgcg gcaagcagat tcaggtgact 5220 caatgcaaca accgttcggt tctggtagaa gggagaaact gtatgcggag attaaacagc 5280 accttgcaaa agcctatgaa cgccataaac gcacctataa ccttcgttcg aattcggagt 5340 gcccgacata taccgcggga gaaacggttc tgaagagatc atttgaacag tctagcaaat 5400 ccaaaaactt ctgtgcaaag ttagccccga agtatgaact cgccgtagtc aggaagatac 5460 tcggaaaaca ctgctacgag cttgagaatc tacaagggaa acggctgggt gtcttttacg 5520 gaagtcacct caaaaagatg cacccgccga cctagacgta tttttcttca gctatgaacc 5580 tcttcggagt gaccacttag ttagccaaaa cacctacggg ttcattaaaa ctcctgtcac 5640 taaggtgact tacaacgttt tccagctatg tatagtgaga gacaacttct tacaaacacc 5700 tacggggaat gtcgtacgtt gtacagtcga cctcacagca gggacataat tgacaatcac 5760 ctaaaacagt taattgaaga gtgtcacctg ggcgatgaat tctcattgaa gtaacaagaa 5820 ccatgaatca tagaaagagc tgggtgatga cctgactgag agtagtgaca ccatgaagtc 5880 tatagtgagc ccagtgatga tttccgattc tctgtaccta gtactgagtt agtagtggtt 5940 cttgccaagc ctaggaagga gcccatgatg gacctcgaac gtcaaatgac aaatttaggt 6000 gggttgtgaa tcgctcagct gtgtgatgaa tcgacttaca acacaatctc acatggaaaa 6060 gttcgaagga cggtttgaac aagctatgcc acttcccctg aagcaactaa ggtagttcca 6120 aagaaaaact tcccacgaaa acacttttca aaggcaaaaa aatcattcaa ataggtagca 6180 gagacaacaa attactctcc cttgtaaata ttgtacatag ttagtgatta gtccgcgcct 6240 aacgcctacg gtaatactag aattatttct taagattact attagttaga taacgtcttc 6300 aactcacatt tccaataaga tttgccgtgt tttcgttctt caatgtaatc tcgttcctta 6360 gttgcataat ttcccatcat atttgtccaa gcttgtagtc cttctttatc gatagtgcag 6420 ttgttttcat tccatccatt gttgttgatt atcattccgt tcatcgtcag gttccttgtg 6480 gtttattttc gtccgctttg tatgtctagg tgagcgaggt tgtagtttgc tgtccagttc 6540 gaaatcttcc atagtataat ccaacttgtt tcccgtttgt gcatcgctag acccaagttc 6600 caattagaat tttccattcg ccatcttctg attgtaccaa gaaatgcttc ccatgacttc 6660 gacctaaaat tagacaaaaa tagccctccc acaatatccc caacagttct tttagaaaaa 6720 catcaagcga tacagtttag ttaatacttt ccgcaacaat aaaccgctct ttagtccttc 6780 tcatcttcac ttttgtcgga ttttgccagc gaaatttaat tttcgtcttg cagtgtttac 6840 tttttttttc acttgtttga catttggtta gtttcagtcc gttctgttgg cagcgcaatg 6900 agtgggtgag ttgcaaggga gtgagtgacg attcgttccc ggagtgaccg ttcgaggggt 6960 gattcattgt tgtttgtgtt attgtacgtt ctcgtaatat ttcgcaagaa gtattagcga 7020 cgaagcttaa tatttatgtt gtgttgtggg aaacgttttc ggacattaat gtctggagtt 7080 gtttgtcaca acggatgtca ttgtttgtga agaataaata caatcatata cctgagtaac 7140 atacttggga agattatatt taggattcac tagttgtcat gtttaatgtg gaaagtgagc 7200 gcaatcataa attcggaaca ggttctggag tggattgtcg ttcatgatcc acaagttcag 7260 tgaagtttaa ggattctgtc ttagtattcg gtctcgaaga ctggagttaa agatagatat 7320 ccaagtatat gtttacttct ctggagttgc ggccgactcc agatgcaagt atatttcatt 7380 gtttcgtttt aagatgtgta tgtgtgtggg ttttgggagc gtctgccaac atgtttcttt 7440 tcttaattta actccatagc tcaaacatat ttcatcaata gtcaacagtc attcgattgg 7500 gaatcagcat tagtttaacc ctacgaaaat ttgattgaaa ttctcaaatt tcaatcaaat 7560 tttcgtaact ttagtagagg atgatgtaac atggtcattt ta 7602 // ID CR1_Ele5 repbase; DNA; INV; 4866 BP. XX AC . XX DT 25-OCT-2010 (Rel. 15.1, Created) DT 25-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE A CR1 clade non-LTR retrotransposon family from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1_Ele5. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4866 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4866 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (25-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. This consensus is generated from 18 CC sequences with >94% identity, and ~99% identical to the original CC sequence in [1]. XX FH Key Location/Qualifiers FT CDS 182..931 FT /product="CR1_Ele5_1p" FT /translation="MEQTVVIKPKVSQDVNTTKSEIRHKLDPVAHSVKDVF FT YSNNGNVAIRCDSHSAAMKLLDSAKSTMSANYEIDIQKALRPRLKIVGFST FT DFDADSFLTKFRKQNDLPDCSFIQIVRFTKSKRDKENPMSVILEVDALSFK FT QLIKAKIAYVGWERCPVSESIDVLRCYRCSEFGHIASICTKPLCCPKCTEC FT HEVSECTSEYEKCINCTIVNKDRKLPADQQLEVNHCSWSTECPMYLKRLNK FT SRQRIDYSS" FT CDS 935..4453 FT /product="CR1_Ele5_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="QYISEGDPHGCTLVAIQLPSGKCVNSICTAEAIPDTA FT LRSSKIDFQTKFPASFDIPTTCSPIGNTVDNSPSGSNYLLNDLTGSAHPKL FT FPTDESSLDGFSYQLNELAGPGKDNTCICPAEAGNDATRTLIRPVLQINSV FT VPTDLDLLSSGANFVSSGKHASSICPAEAGEQSTPNLTHSTLQTCQASSCS FT AISTDLHQLTEPHNTNVTSANRITVYYQNIRGLRTKIDDFFLAAYESSYDV FT IVITETWLNDQIFSPQLFGDRFTVYRTDRNSSTSSKKRGGGVLIAVTNRLS FT SALPSIAIDSTVEHLWVKIFVADNVVAVGAVYLSPEIANDATVIEKHLDSA FT LNITNSLKPGDCHLLLGDFNQSSIIWTHSSSGYLYADPRDSHFSPGSCSLL FT DGFSQLDMKQLNYIEHHRGRILDLIFANSDIASLCSVQRANDPLLEEDPYH FT PALVTSLTCNCPVLFDEIETPNEFDFQKANFSTLDYDISHIDWTPLLDASD FT VDGSVDFFTYTLKQLFINNVPVPRPRRKPPWSNNHLRNLKRHRSNALRKYT FT NLKTTESKRHFNIASNRYRSYNRFLHARYTNRIESNLKHNPKKFWSFMNAK FT RKEAGLPTSVFLNEEVADSCDDMCNLFAKHFSDVFRKESVDIHRINNALAD FT VPLNALNMEYFQFSENEVLNALNKLKSSSTAGPDGIPSIVLKKCASSLCAA FT LTIICNRSIAESRFPTKWKESIMFPVYKKGDKRDVKNYRGITSLCAGSKLL FT EILVSDVLLSASKSYISPDQHGFFPGRSSETNLVQFTSFCLRNIEEGNQVD FT AVYTDLKSAFDLVDHLILIAKMERLGAPSFFTRWLESYLRNRRLSVKIRAV FT QSYYFTNNSGVPQGSNLGPLLFSIFINDVCCLLPAGFRLVYADDLKIYLVV FT KNHEDCIELQRLVDLFEDWCIRNHLVISAQKCTVISFNRKKCPITWDYQIC FT GTLLSRVTVVKDLGVLLDTNLSFRNHFSNIIARANRNLGFIMRSTGDFSDP FT FTLRSLYYSLVRSTLEYACVVWSPYTNVWILRIEAIQAKFTRYALRLLPWN FT DIANMPAYEIRCRLLGMESLRNRRNSLRALFAGKLLLAVIDAPNILERLNI FT NAPSVSLRSRDPLRPDFRRTDYGINEPITALCREFNKVYHLFDYSISASVF FT KSRLRRFSLHPESE" XX SQ Sequence 4866 BP; 1396 A; 1070 C; 942 G; 1458 T; 0 other; aactgccaca tctaatgatt tgcgtcaagc tcttgatgac ggccagtggc tgcgctctgg 60 aaaaaggcgc gtgcggtccg gtcagaacaa cactttcaaa tctcccgtta tagtaacacc 120 atctgctcgt cctgctgata atgcgatatt gccgaatggt aaagccaaat ctgttatttt 180 gatggaacaa actgttgtta tcaagcccaa ggtttcacaa gacgtgaata caactaaatc 240 tgaaatccga cacaagctcg atccggttgc tcattcagtt aaagacgtct tttacagcaa 300 caatggaaat gttgctatcc gatgcgattc tcactccgct gcgatgaaac tccttgattc 360 ggctaaatcc actatgagcg ccaactacga aattgatatc caaaaggctc ttcgaccaag 420 actgaaaatt gtcggctttt ctactgattt tgatgctgat tcatttttaa caaaattcag 480 aaaacaaaat gatctgcctg actgctcctt catacaaatt gtgcgattca caaaatccaa 540 aagggataag gaaaatccta tgtcagtcat tcttgaagtg gatgcattaa gttttaaaca 600 gctgattaaa gccaaaattg cgtatgttgg atgggaacgc tgcccagtct ctgagtcaat 660 tgatgtttta cgttgctatc gctgctcaga attcggtcac atcgcatcta tctgtacgaa 720 gcctctatgc tgtcccaaat gtaccgagtg tcatgaagta tcagagtgca cttctgagta 780 cgaaaagtgc atcaattgta ctattgtgaa caaagacagg aaattgccag ccgatcagca 840 gctcgaagtt aatcattgtt cttggagtac ggaatgcccg atgtacctaa aacgtttgaa 900 caagtcccgc caaaggatcg actattctag ctagcaatac atcagcgagg gagatcctca 960 tggctgtact cttgtagcta ttcagctgcc atcaggtaaa tgcgtgaata gtatatgtac 1020 agccgaagct attccagata ctgcactgcg ctcaagtaaa attgacttcc agacaaaatt 1080 tcccgcttcg ttcgacattc caacaacttg ttccccaatt ggtaacaccg tagacaattc 1140 tccctctgga tccaattatc tgctgaatga tctaactggt tctgcccacc cgaaactatt 1200 ccctactgat gagtcatcgt tggatggatt ttcttaccag ctgaacgaac tggctggtcc 1260 aggtaaagat aacacatgta tatgtcctgc cgaagcaggg aatgatgcta cacgcaccct 1320 catacgacct gtgttgcaga tcaactccgt tgtccccact gacttggacc tactatcatc 1380 aggcgcaaac ttcgtttcct caggtaaaca tgcatcaagt atatgtcctg ccgaagcagg 1440 tgaacagtct acacccaacc tgacccacag cacattgcag acttgtcagg cgtccagctg 1500 ttctgctatt tcgaccgatc ttcatcagct cactgaacca cacaacacca acgtgacaag 1560 tgcaaatcga atcaccgtct actatcaaaa tatcagaggg ctgcgtacaa agattgatga 1620 cttctttctt gctgcatatg aatcttctta tgatgttatc gtcattactg aaacgtggct 1680 taatgaccag atcttttcac cgcagctgtt cggtgatcgg ttcaccgtat atcgtacaga 1740 taggaactca tcaacctctt ctaaaaaaag aggtgggggc gtacttattg ctgtaaccaa 1800 tcgtcttagc tcggctctac cttcaattgc aatcgatagt actgttgaac atctttgggt 1860 gaaaattttt gttgctgata atgtcgtcgc tgttggcgct gtttacttgt cgccggaaat 1920 cgctaatgat gctaccgtaa ttgagaaaca tttggattca gctttgaaca ttaccaactc 1980 tttaaagcct ggtgattgcc atcttcttct tggtgatttc aatcaaagta gtataatttg 2040 gactcattca tcttcgggat atttgtatgc tgacccaaga gattcgcatt tctctcctgg 2100 tagctgctcc cttctggatg gattctccca gttggacatg aaacagttga actacattga 2160 acaccatcgc ggacgaatcc tggacttgat atttgccaat tcagatattg cttcgctttg 2220 ttctgtacaa agagcgaatg atcctttgct agaagaagat ccgtatcatc ctgcgttagt 2280 aacttcgctt acttgcaact gtccggtact attcgatgaa atagagactc ccaatgaatt 2340 tgacttccag aaggcaaact tttcaacact ggattatgat atttcgcata tcgactggac 2400 tcctttactt gatgcttctg acgttgacgg ttctgtggat ttcttcacct atacgctgaa 2460 acagcttttt attaataacg ttcctgtacc tcgaccacga agaaaacctc catggtccaa 2520 caaccattta cgtaacttaa agcgccatag atccaacgca ctgagaaaat acactaacct 2580 gaaaacaacg gagtcaaaga gacatttcaa tattgccagc aacaggtaca gatcgtataa 2640 tagatttctt catgctcggt acaccaatcg aattgaatct aacctgaagc ataacccaaa 2700 aaaattctgg tcgttcatga atgctaagag aaaggaagct ggcttaccaa catctgtatt 2760 tttaaacgaa gaagtagcag attcttgtga cgatatgtgc aatctttttg caaaacactt 2820 ttcagacgtt ttccgtaaag agtctgttga tatacaccgg atcaataacg ctctggctga 2880 cgttcctcta aatgctctca atatggaata tttccagttt tctgaaaatg aagtcctgaa 2940 cgcgctgaat aaactcaagt catcttctac agcgggtcct gatggtattc cttcaatcgt 3000 attaaaaaaa tgtgcctcat cgttgtgcgc tgctctcacc attatctgca acagatcaat 3060 tgctgaatcc agattcccca ccaaatggaa agaatcaata atgtttccgg tatacaaaaa 3120 gggagacaag agagacgtaa aaaattacag gggaataact tcattatgcg ctggatccaa 3180 gctgcttgaa atcctagtca gcgatgtttt gttatcagct tctaagtcgt acatttctcc 3240 tgaccaacac ggattttttc cgggacgctc cagtgaaaca aatctggtac aatttacgtc 3300 gttttgcctg agaaacatcg aagaaggcaa tcaagtcgac gcagtctaca cggacctgaa 3360 atcagctttc gatctggtcg accatttgat tctaatagct aaaatggaaa gattgggagc 3420 accttcgttc ttcactcgtt ggcttgaatc gtacttgagg aatcggcgat tgtctgttaa 3480 aatacgtgcg gtacaatctt actatttcac taacaactct ggcgtacccc agggcagtaa 3540 tttaggtccg ttactgttct cgatattcat caatgatgta tgctgcttgt tacctgctgg 3600 atttcgttta gtgtatgcgg atgatctaaa gatctatcta gttgtaaaga atcacgagga 3660 ttgcatcgag ctgcagaggt tggttgatct ctttgaagac tggtgtatca gaaatcatct 3720 ggttataagt gcccaaaaat gtacagtgat atcattcaac cggaaaaaat gtccaataac 3780 atgggattat caaatctgtg gtactctgct gtcgcgtgtc accgtcgtca aagaccttgg 3840 agtccttctt gacacaaatc tgtcttttcg taaccacttt tctaatataa ttgcaagagc 3900 aaatagaaac ttaggtttca tcatgcgatc aaccggagat ttctctgatc ctttcacttt 3960 aagatcacta tattattcgc tagttcgctc cacactggaa tatgcatgcg ttgtttggag 4020 tccttacaca aatgtatgga tattgagaat agaagcaatt caggccaaat tcaccagata 4080 tgctttgaga ctgctcccct ggaacgatat tgccaacatg ccagcatatg aaatacgatg 4140 ccgtttgctt ggtatggaat cgcttcgcaa tcgacggaat tctttgagag cattattcgc 4200 cggtaaactg ttattagcgg ttatagacgc tccaaatatt cttgagcgac tgaacatcaa 4260 tgcgccttca gtatccctgc gttccagaga tcctctcaga ccggattttc gtagaacgga 4320 ttacggaatc aacgaaccca tcactgcttt gtgccgggaa ttcaacaaag tctatcatct 4380 attcgattat tccatctcag cttcagtatt caaaagcaga ctaagacgtt tctctctaca 4440 tccggaatct gagtgagcaa aattgtgtcg caattaggtg agtcccagtc caataatttt 4500 aaaagtttta aatgatgctt cttgagtgtt cttttctttt ttgttctcgg tcatctttct 4560 tcaatacaac gagtgacgaa tacagctgcg tatgaacttg acaaatgatt acttctctca 4620 tagctctgtt taagtattta ttgcgtagaa atgtcgtttt gtattttata ttttttatgt 4680 attattttta atctgtatgt aatccattgt attttgtttt gaaaagatat ggggttttta 4740 cgcccttttg attggactac agttcatgca aaagggcttt tccccatctt aatcctcatt 4800 aagacatcga tcagttgagg tatgaaaata aataaataaa taaataaata aataaataaa 4860 taaata 4866 // ID DNA3-1_AAe repbase; DNA; INV; 3296 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A non-autonomous DNA transposon family from Aedes aegypti. XX KW DNA transposon; Transposable Element; nonautonomous; DNA3-1_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-3296 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the yellow fever mosquito."; RL Repbase Reports 11(4), 1278-1278 (2011). XX DR [2] (Consensus) XX CC ~95% identical to consensus. 3-bp TSDs; usually TWA. TIRs are CC 71 CC bp long. XX SQ Sequence 3296 BP; 1020 A; 562 C; 611 G; 1103 T; 0 other; gggtctgtct caattgcgtt gttaaaacga gtccgaactt aaatttgaca gttcgggaat 60 tatttcccct ctgatttcga tctgtcaaaa ataagtctgc tccagaggag acttattttt 120 gacagatcga gaattatttc tctagttcgt gtttcttgtt atggatgtcg tccacgtttt 180 tcgtcccgct tcgattaccg atgtggccat cggtccgacc tcccagctgc accaaggaac 240 tattgtaagt tagaaaatgc aattcgccgg atttatctcg gtatccatgc aagataccct 300 acttcggaag cagggcgcat tagagcaaat ccggatgcga atcgaatgct ccacttccga 360 agttaggtat ctagcataga ttccgagaga aatccaactt cggagaaaaa ccgtttctaa 420 cttttgccgc gtccaaaata taaagcttcc gtgtgtgttg tgtaggttgt tcaatccgga 480 tggtcaatca gaaaatttct gtgttggcca aaaggaatag ttgcagggct gttagcgagt 540 cacttttgac ctggtaacta aaaagtcact atttgcaact aaaattgaaa ggcctaaaag 600 gtagggattt ttgcctcttt ctggaaagtt tcaggtgaat agtcctcaaa catttttgtc 660 gagaagtccg aatgttttgt cggggctctt tgaggaattt ttgatgccaa tccggataaa 720 tttttgaaaa caattataac atccattttc gtttttaggc attcttttga gtgaaaactc 780 gtagttttga aatcacttat taatctctat cttgatcatt taagtccagg tctttgcata 840 ccccaaacaa acaaaatcta gtttatttca accaaaattc tacgtcgaat attgcacaaa 900 caaaaaatta gtttgctttc acaatgtcac ttggttgaaa caaactaaaa aactgttttg 960 atagttttgc gataatcaaa ttaaattcaa aaaatttttg ttagcaacaa ccaataattt 1020 ggttatttca tttgacagtt ctctattttt tcatattatt tttgtttcat tggagaggta 1080 tcaaatgttt atcattgttc aaatattgta taattacagt gcattttttg tccaatagaa 1140 ctagatcgtt caaaatgtgc tgcgatcttt gtcatcagca tcatgagctt gaagcgccat 1200 attttgggta gtatttagca agcggaaatc atcagaacct gtgtgggaga caatttttgt 1260 ttgtttttgt atcgatttat aacagttgta ttataattac ctaattagtt gtattataac 1320 tacctacgtt tcaagcctta accttccagt cgtcgcgcgg tttgccaccg tcagaaccac 1380 tacgctactg ttgggcacga aaagcgaggt tttttcaaca gtattgtaca aaatacaaca 1440 gcgcgacgac tgaagtgtta agttacaatt acaaagatca ggcaattcag agagacattc 1500 aataccctac acaaaaccat tgatgaggct cgcatccatc gttcattgtc tgaaaaaaga 1560 ttatttacct tattaaagct gggtcactca ataaaaaaga gttatgtgca tcacttaccg 1620 tacaaaatct tatgcatgag gtttccaaga aaaatgtaac agtttgttta gtttgatatt 1680 caaaattgtt tgaatctacc taaagctttg gtttgaatct cagatatcca aacaaaaata 1740 aacaacctaa taaatgatac caataaacca taacttagtt gacataaact gacaaattgg 1800 tttaaaacta tcagaattat tgttagattt tactaaaacc aaagatagaa acaacaatgt 1860 tattagttta tttcaaatgc attatggttt tatttcaact ttgtttgtca agagttgagt 1920 gcttcgtggt cttcgataat gttcttcatc ataatatgtc gtatctggtc aggatcattt 1980 gtcacgacga ttatttgcaa tgggttttgc caaatttgga gggagatatt gattttctca 2040 actaaaattt tgaaacgaca aaggtttttc gtgacctgtt ggacaaaact gacaaattgt 2100 tgaaacttca agaattattt tgattttaaa ccaaaataga aatgaatgtt attattactt 2160 caaatgcatt catgagtttg aagcaattaa attagaaaaa tagatatttc aaactttgct 2220 tttgtcaaga gttggagtgc ttcgtggtct tcgataatgt tcttcatcat aatattacat 2280 atcgtggtct gcgttcaatt ttttttatag aagtcaatga gcgtcctctg acgattttta 2340 tttgcaaaag tgagttttgc catagtaaat cccatacaaa cttttaacgg cttgcgctaa 2400 attatagttt caccgatcga gatgaaattt tacacagttg ttataggacc caaacgggac 2460 ctgaaaagtg ggctggattg agaatcagaa tttcgtccca cactaataat tcattcgaaa 2520 aaatcgtaag gcttttaggg ggagagggtg tatcaaaatt caaaatattt ctttcgaggg 2580 agggaggaca tgaaatctta cgtaagactc cctagaaaat ttaaaaaaag caataatttt 2640 gttgctcaaa ttgaaaaatc ctttgtgaaa aacttttcac tgtatccagc aagtgtcgtg 2700 cagtgataga ttttcaagac gtaaaacgtt gacgtttgtt ttgatggaaa ttttgaatac 2760 gtttgttttc ctctttaggg tattaactat attcgcccta agaaatgttg cgctcacgct 2820 ggctatgcaa tagtacaaac gtagaaaact ttatttcacg gtctcttctt tggtatcaca 2880 ctaatgacat tatggtaaac cattacgaaa aaactctcct tcaaacgatt ttttttgatg 2940 atgttgtgaa tccggtataa agaaggacaa acacgttttt taattgttta attctcattg 3000 tctgagcgtc gcactaatat gtgattagtg cactgtgttc ataaaaaatc gttagaatgc 3060 agcgccgcgc ttgaatcatt tgatttcgaa gcgcggcgag tgtgagttgt gcgctgtccg 3120 gtttggtgtc ggtgcgacaa tattttgttt tgaaatttcc tgaatcgaca acgattgcga 3180 attggaccag acaaaaatta aatttgacag tccattgatt aattggaggg gaaataattt 3240 ctgaactgtc aaatttaagt tcggactcat tttaacaacg caaatgagac agaccc 3296 // ID L1-33_AAe repbase; DNA; INV; 4410 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE An L1 non-LTR retrotransposon from Aedes aegypti. XX KW L1; Non-LTR Retrotransposon; Transposable Element; L1-33_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4410 RA Kojima K.K. and Jurka J.; RT "L1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1386-1386 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 6 sequences with >99% CC identity. XX FH Key Location/Qualifiers FT CDS 146..1156 FT /product="L1-33_AAe_1p" FT /translation="MSIRRENTFRIDYANVPKKPSFEELHDFIGSVLGLQY FT EQVVRLQPSRALGCAFVKVVNLELAQKIVAEHDDKHETEVDGKVYKLRITL FT EDGSVEVKLTDLSEDISNEQISEYLSSYGEVLSVTEQLWDSKYRFAGLPTG FT TRIVRMVVKRNIESYITIDGQTTNVVYFGQLHTCRYCSEFVHNGISCVQNK FT KLLVQKTYANVAKQTDVSPSATKPVATKNKMPFSKLFGAKSREATSSRKVN FT SNDSLSAVAKPSIPDPVDRATKPNPTEQSNSLMPPPAVSQGNQSTCRKSPR FT QASDGNDTDNSTTSNNGKRRSARVPGKKLRQNYDDDTNEEIDVQI" FT CDS 1159..4329 FT /product="L1-33_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MAFHSYNLVSININTITNPTKINALQTFLRNSEIDIA FT FLQEVENEQLLLPGYNIVCNVDHTRRGTAVALKEHIQYTNVEKSLDGRIIA FT LRVQNTTLCNVYAPSGTALRNERERFYENTLAYYLRHQTEHTVIAGDFNCV FT LRQCDATGHNTSPALLRTVQQLQLLDIWEKVCPNMPGHTYVSHNSSSRLDR FT IYVSQSLRTHLRSACLHVCSFSDHKALTARICLPYLGREAGRGFWSLRPHL FT LSPEHLNEFRMRWQYWTRQRRNFPSWIQWWVSYAKPKIKSFFRWKSKTVFD FT GFHWAHQRLYVELRQAYDGYYMNPGMLSTINRVKSQMLTLQRNFTKTFMRL FT NESYVAGEPISTFQLGEKRRKRTVITQLRDDNNVPIEGSEAIERHMLNFYR FT DLYAAGATEDTTANEFVCERVIPENDQMNECCMSEITVADVFTAIKSSSPN FT KSPGGDSIPREFYLRAFDIIWREITLIMNDALAGNFPANFVDGVIVLVKKK FT GNDQTAHSYRPISLLNCDYKIFSRVLKYRLENIMRTNHIISDSQKCSNFGR FT NIFQATLALKDRIARLKKTKQRGLLASFDLRHAFDLVDRVFLARNMCSIGF FT NPNLVRLLVRIGELSTSRLLVNGYLSESFPIERSVRQGDPLSMHLFVLYLH FT PLIKRLEEVGGPDLVVAYADDVSVISTCSERLERMRETFRRFERVSGSRLS FT LTKSSSISVGYTDVMPLTVPWLRTENTIRILGVTFANSVRLMNKLNWDEQV FT SKFARLVYLHSARTLTLHQKVTLLNTFITSKIWYLASTLAPSAVHTAKLTS FT TMGSFLWRGIPARVPMQQLARSKEAGGLKLHLPTLKCKALLLNRHMRDIDS FT LPFYKSFLTQNNPDLPADCPCLKLILSNFPILPPRVQENPSADSIHNVYIE FT QTEEPKISREHPTNNWRQIWGNISMRQLASSQRSSLFLLVNAKLDHRKLMY FT RINRTDGDNCLHCNRATETLEHKLSECVRVAPAWRLLQQKISSLFNGWRLL FT TIGELLRPQLNGIRRWKKVKILQMFIQYVLFILQCNNAIDLGSLEFEIQLV FT " XX SQ Sequence 4410 BP; 1276 A; 1086 C; 952 G; 1096 T; 0 other; attagttagc gctcaacttc catgcagagc agtcgtgttt ctcgctggaa gccccacgca 60 agctattgtt acattttttt cgatctttta catcttaatt tcgcgttctg tgctcgctgt 120 gaatagtgtc gcggatctcg ccgcgatgag tattcgacgc gaaaacacgt ttcgcatcga 180 ttatgcgaac gtgccgaaga agccgtcttt tgaggagctt cacgacttca ttggctctgt 240 gttaggcctt caatacgaac aagtcgttcg tcttcaaccg agtagggcac tcgggtgcgc 300 cttcgtgaag gttgttaacc tcgagctcgc gcagaaaatt gtagccgagc acgatgacaa 360 gcatgaaacc gaagtagacg gcaaggtcta caaactacgg atcacgcttg aagatggatc 420 cgtggaggtg aaactaaccg atctctccga agatatctcg aatgagcaga tatccgagta 480 tctcagcagt tacggtgaag ttctctctgt caccgaacaa ttatgggaca gcaaatatcg 540 ctttgctggg ctcccgaccg gtacacgtat cgtgcgcatg gtggtgaaac gcaatattga 600 aagctacatc accatagatg gccaaactac gaacgtagtt tatttcgggc agctacatac 660 atgtcgttac tgtagcgagt ttgtgcataa cggcatctct tgcgtacaga acaagaagct 720 gctcgttcaa aagacgtatg ccaatgttgc gaaacaaacc gacgtctccc ccagtgccac 780 aaaacccgtg gcgacaaaaa acaaaatgcc gttttccaag ctgtttggag cgaaatccag 840 ggaagctaca tcgagtagaa aagtaaactc aaacgattcg ttgtctgctg tggcaaagcc 900 atcgattccg gaccccgtcg acagagcaac aaaaccgaac cctaccgaac aaagtaacag 960 cctcatgcct ccgccagctg tatcgcaagg caatcaatca acatgccgaa aatctccacg 1020 ccaagcaagc gatggaaatg acaccgacaa ctccacgact tcaaacaacg gcaaacgtcg 1080 aagtgctcgt gttccaggga aaaagctgcg acagaactac gatgacgata cgaacgaaga 1140 gatagatgtc cagatctaat ggctttccac agctataatc tcgtatcgat caacatcaac 1200 accatcacaa acccgaccaa aatcaatgcc ctacaaacct tccttcgaaa ctcggagatc 1260 gacatcgctt ttctgcaaga agtggagaac gagcagctcc tcttgcctgg gtacaatatc 1320 gtatgtaacg tagaccatac taggagggga acagcagtag cactgaagga gcacatccaa 1380 tacactaacg ttgaaaagag tctggatgga cgcataatcg cgttaagggt gcaaaacact 1440 acactctgca atgtatatgc gccgtcgggt actgctcttc gaaatgagag agagcgtttc 1500 tacgaaaaca cactcgctta ctatctccgt caccaaacag agcacacggt aatagctggc 1560 gacttcaact gtgtcttgcg acaatgtgat gcgacaggtc acaacaccag ccccgctctc 1620 ctcagaaccg tgcagcaact gcaacttctc gacatttggg aaaaagtgtg cccaaacatg 1680 cccggacaca cgtacgtctc gcataactcg tcatcccgtc ttgacagaat ctacgttagt 1740 cagagcctac gaacccatct aaggtcagca tgtttacatg tttgctcatt ctcagaccac 1800 aaggctttga cagcacgaat ttgtctcccg taccttggca gagaagctgg tcgtggtttt 1860 tggtctctac gtccacatct gctatcacca gaacacctca atgaatttcg catgcgctgg 1920 caatactgga ctcgccagcg tagaaacttc ccttcctgga tccaatggtg ggtttcgtat 1980 gcgaaaccga aaattaaatc atttttccgt tggaaatcga aaacagtttt cgatggattt 2040 cattgggcac accaacggct ttacgttgaa ctgcgacaag catacgacgg ctactatatg 2100 aacccaggaa tgctgtcaac aatcaaccgt gtgaagtcgc aaatgttgac actgcagcgt 2160 aatttcacta aaacattcat gcggttgaac gaatcgtatg ttgctggtga acccatatcg 2220 acgtttcagc taggcgaaaa gcgcaggaaa agaactgtaa taacgcagct tcgagacgat 2280 aacaacgttc ccattgaagg atctgaggct atcgaaaggc atatgcttaa cttctaccgt 2340 gatctgtacg ctgctggtgc aaccgaagat acaacagcga acgaatttgt gtgcgagaga 2400 gtgatcccgg aaaatgatca aatgaacgag tgctgcatga gtgagataac ggttgccgac 2460 gtttttacag caatcaaatc aagcagcccg aacaaatccc ctggtgggga ctcaattccg 2520 cgagaattct atctgagagc gtttgacatc atttggagag aaattactct tatcatgaat 2580 gatgccctgg ctggaaactt cccagctaat tttgttgacg gagtgatagt tctcgtgaaa 2640 aagaagggta atgaccaaac tgctcactct tatagaccga tatcactcct taactgcgat 2700 tataaaatct tctcacgtgt cctgaaatat cgcctcgaga acattatgcg taccaaccat 2760 ataatcagcg atagccaaaa atgctcaaat tttggccgca atatctttca ggcaaccctt 2820 gctctaaagg atcgcattgc gagactgaaa aaaactaaac aacgtggcct cttagcatct 2880 ttcgatcttc ggcatgcctt cgatctggta gaccgtgtat ttcttgctcg gaacatgtgc 2940 tcgatcggtt ttaacccgaa tcttgttcgt ctgctggtca gaataggaga actctcgacg 3000 tcgcgactac tggtgaacgg atacttgtcg gaatcattcc caatagagag atcggttcgg 3060 cagggagacc cactctcaat gcatctattc gttctctatc ttcaccctct catcaagcga 3120 ctcgaagaag ttgggggacc tgatctcgta gtagcatatg ccgatgatgt gtcagttatc 3180 tccacctgca gcgaaaggct ggagcggatg agagaaacgt tccgtcgctt tgaacgtgta 3240 tctgggtcca ggctgagttt gacgaaatcc tcgtcgatat cggtgggtta caccgatgtt 3300 atgcctctca ccgtaccgtg gctgcggacc gaaaacacca ttcggatttt gggagttact 3360 tttgccaact cagttcgtct gatgaacaag ctgaactggg acgagcaagt cagtaaattt 3420 gctcggcttg tttatctcca ctcagcacga acactcaccc tgcaccaaaa agtaacccta 3480 ctcaacacct tcatcacatc gaagatatgg tatttggctt ctacgcttgc tccgagtgcc 3540 gttcacacgg caaagctgac gtcaacgatg ggatctttcc tatggcgggg tataccagcg 3600 agagttccga tgcaacaact tgcgcgcagc aaagaagctg gtggtttaaa actgcacctc 3660 cctaccctca aatgtaaggc gcttctcctg aaccggcata tgcgcgatat cgattccctt 3720 cctttctaca aatccttcct tacccaaaac aatcccgatc taccagcaga ctgtccttgc 3780 ctaaaactta tcctttcgaa ctttcccata ctccctccta gagtccaaga aaacccctcc 3840 gccgatagca ttcacaacgt ttacatcgaa caaactgagg agcccaaaat ttcgcgcgag 3900 catccaacta ataactggcg gcaaatttgg ggtaacatct ctatgcgcca actagcttca 3960 agtcagcgta gttcactgtt tctgttagta aacgctaaac tcgatcatcg taaactgatg 4020 taccgtatta acagaaccga tggagataat tgtctgcatt gcaatagagc aaccgaaaca 4080 ttggagcata aactcagtga atgtgtgcgc gtcgcaccag cttggcgctt gcttcaacag 4140 aaaatttctt ctctgtttaa tggatggcgt ttattgacta taggagaact cctgcgaccg 4200 caactaaatg gtattagaag atggaagaaa gtgaaaattt tgcaaatgtt cattcaatat 4260 gtcttattta tattgcaatg taataacgct atagatttag gctctctaga atttgaaatt 4320 caactcgtat aaataattat tatgtaatta attttttacg ctgaatacaa ataaaaaaaa 4380 ttttatgtta aaaaaaaaaa aaatcaaaaa 4410 // ID R4-1_ED repbase; DNA; INV; 4735 BP. XX AC . XX DT 30-JUL-2008 (Rel. 13.1, Created) DT 30-JUL-2008 (Rel. 13.1, Last updated, Version 1) XX DE Autonomous non-LTR retrotransposon from the R4 clade - a DE consensus sequence. XX KW R4; Non-LTR Retrotransposon; Transposable Element; Ed_LINE2; KW R4-1_ED. XX OS Entamoeba dispar OC Eukaryota; Amoebozoa; Archamoebae; Entamoebidae; Entamoeba. XX RN [1] RP 1-4735 RA Lorenzi H. and Caler E.; RT "Genome wide survey and discovery of repetitive elements in three RT Entamoeba species."; RL Repbase Reports 8(10), 1685-1685 (2008). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 15..1454 FT /product="R4-1_ED_1p" FT /note="ORF1." FT /translation="MQNIQQILLPWLLSNTIGSCKENQTRHRYSSSKKRKD FT FNSNKMHNNYDKRNNRKKGFRKRRTQRKTNKKQGKYNDNKHYTNKKPKWLT FT TKEIQKKIKRITRKWEKGEYEELLKKYEGKEKKWYNILQEVYSIRKKKTLE FT KIKEEFKDKTKDWKSEDIKTIEEEIKNKNPIKEMNDRCKAARIINEKENKG FT KTQDKEEKMEEEGNKEEINTLQNKIKELEETIKELQTKKMEEERTISDEMI FT QKARETVRKEFEEEIEKIQTQINNIQNTCNEIRKENSKLKEENNTLQGKIN FT EANGRRIIEVGGKDEIIRTLRASYERLKKEKDEVLKGAELEKKRADELEKS FT AELLEKNIRDKENQISVYQKSAIQNYEIAEITRKELDHTQRENEEQKQLIS FT KQQESIENFLSQKELEKKKTMTLVINTQPILRTSSSGTIQMNQTTPMKKTI FT LEIEEEESKTLSIKKQEMIISQEEKKTQQIKQIEF" FT CDS 1852..4719 FT /product="R4-1_ED_2p" FT /note="ORF2." FT /translation="MGMVRRIENLQEDMKMISMCMHYYLQNKFIGDKFDEV FT KEILRKINRLEDLVKGKWYRIKCIMRLKEQDADEIKMKAQERIKYLKYMMT FT MIKERIKFNKNERVRINTNKIMFDSNNKIKIKKKDPQNEIYPNNDITLEYW FT KSLYETQVILNKEHWKIKQITSWNERYRNYDSVIITMDDLNLSLMKISNWK FT APGIDTIYGYYWKRMSSSRDSILNIFNEWLNFNENIPLDMVSGRTILIHKG FT GDRNDVTNYRPISCTNVIMKVFTSILKEKIHNRINMNNESLKISKNQLGCK FT LQSLAAKEGLINSYMLKQQKEEKYPKYVESYYDIKKAYDSVNHEWVIEALK FT YFNVEGVIIDIVESMMNRWKIFIGYKYNEYLGCIKLKRGILQGDSLSPLLF FT IIQMNIISQIIEEKFPKANHTLYMDDLRIMTQSSEEMGTIHNEIKEIISGI FT GMEMKNKSGMVLKNINKIPDGMQNIPIIEGEDLYKYLGVWQSDEINDTFNC FT KIIKEKMMKKMEEINTDESSNRSYITRINTEVIPIFRYSASVVNWKVTDLD FT RMDLEIRKYLRKAKYIGAGNSKDRLYVPINEMGEGLISLRDEYVIELIRTV FT MYYCTRESEIGNNIIDRWYSKKGIWNRIKKALGHKIPETKIEEIINEGIEN FT KKEKEVVVKIKKLMTNQYIKKWKSHQTSGHFRRWFESKGVDKTTTINAWKS FT INIKKNAFLQLTKMQDGAIFSGYRKAKILKNERLKYCPLCKDKIATVEHIL FT LSCIGHKKSQMEKHDHIGIIIWEGLIRKYTGNKQYKKPPYETVFQYNDITM FT IWNKQIMPKSDGLYHKRPDIYVLDKKNKTGLIFDMTIVADHNINGAYWKKR FT NMYKELKNRVMKIEKLKDVKIIAVVISINGLVNAESIKLIKQLKIEIDITK FT EIKNLVIKNMMDVMEHCGDHNQTYVVELQDEDEGTGLVSPEPGTIDTSSIN FT T" XX SQ Sequence 4735 BP; 2262 A; 546 C; 834 G; 1093 T; 0 other; agatcctttt atcaatgcag aatatacaac aaattttgtt gccatggcta ttgagtaata 60 ccatagggtc ttgtaaagaa aatcaaacaa gacacaggta ttcaagttca aagaaaagaa 120 aggacttcaa tagcaataaa atgcataata attatgataa aagaaataac aggaaaaaag 180 ggtttagaaa aagaagaaca caaagaaaaa caaataaaaa acaaggcaaa tataatgata 240 ataaacatta tacaaataag aaacctaaat ggttgacaac aaaagaaatt caaaagaaaa 300 ttaaaagaat aacaagaaaa tgggaaaaag gagaatacga agaattactt aaaaagtatg 360 aaggtaaaga aaagaaatgg tataatattc tccaagaagt atattcaata agaaagaaga 420 aaacactaga gaaaatcaaa gaagaattca aagataaaac aaaagattgg aaaagtgaag 480 atattaaaac aatcgaagaa gaaatcaaaa acaaaaatcc aatcaaagaa atgaacgata 540 gatgtaaagc tgcaagaata ataaatgaaa aagaaaataa agggaaaaca caagataaag 600 aagagaaaat ggaagaagaa ggcaacaaag aagaaatcaa cacacttcaa aataaaatta 660 aagaattaga agaaactatt aaagaacttc aaacaaaaaa gatggaagaa gaacgaacca 720 tttcagatga aatgattcaa aaagcacgag aaacagtcag gaaagagttt gaagaagaaa 780 tagaaaagat tcaaacacaa ataaacaata ttcaaaatac atgtaatgaa ataagaaaag 840 aaaactctaa attaaaagag gaaaataata cattacaagg aaaaataaat gaagcaaatg 900 gtagaaggat catagaagta ggaggtaagg atgaaatcat ccgaaccctc cgagctagct 960 atgagaggct gaagaaagag aaggacgaag tccttaaggg tgctgagctg gagaagaaga 1020 gggctgatga gctggagaag agtgctgagc tgctagagaa gaacataaga gataaagaaa 1080 atcaaatctc tgtatatcaa aaaagtgcga ttcaaaacta tgagattgca gaaataacaa 1140 gaaaagaatt agaccataca caaagagaaa atgaagaaca aaaacaatta atatcaaaac 1200 aacaagaaag tatcgaaaat tttttatccc agaaggaact ggagaagaag aaaacaatga 1260 ctctagtgat aaatacacaa ccaatactga gaacatcatc atctgggaca atacaaatga 1320 accaaactac ccctatgaaa aagacaattc tggaaataga agaagaagag agcaaaacac 1380 tgagtatcaa aaaacaagaa atgataatca gtcaagaaga gaagaaaaca caacaaatca 1440 aacaaataga attttaattg atgtgttgta ttgttttgca attaatttgt taaataagaa 1500 attaaataag aaattaagat gggaacaaat agaaagcata ataaaagtaa taaacattaa 1560 tgtaaataat gataaagaat acattaataa tattaaaaag tcacatagaa taaaagaaac 1620 aactaaaaga tatattaaaa tgaaagtgaa ctccatcata gaagggaaag tagagtgtaa 1680 tccaataata aaaaacatta tcataaaaaa taactctact gatattgatg atgaatatct 1740 taactctatc atcactcaag ttaaagaaat gaaaatagat aaatgtgtaa atattaatta 1800 ctactggaaa aggaaagaat cagataaatt aataagagaa ataaataata aatgggtatg 1860 gtcaggagaa tagaaaatct gcaagaagat atgaaaatga tttcaatgtg tatgcattac 1920 tatctccaaa ataaatttat aggagataaa tttgatgaag taaaagaaat cctaaggaaa 1980 ataaatagac tagaagatct agtcaaggga aaatggtaca gaataaaatg catcatgaga 2040 ttaaaggaac aagatgcaga tgaaattaaa atgaaagcac aagaaagaat aaagtaccta 2100 aaatatatga tgacgatgat taaagaaaga attaaattta ataagaatga aagggtcaga 2160 atcaatacaa ataaaatcat gtttgattct aataacaaaa tcaaaataaa gaagaaagac 2220 ccacaaaatg aaatataccc aaataacgat attactttag aatattggaa atcgttatat 2280 gaaacacagg tgatattaaa taaagaacat tggaaaataa aacaaatcac ctcgtggaat 2340 gaaagataca gaaattatga tagtgtcatc atcacaatgg atgatttaaa tctaagcctg 2400 atgaaaataa gtaattggaa agcaccaggt attgacacta tatatgggta ttattggaag 2460 agaatgtcat catcaagaga ttccattctt aatatcttca atgaatggtt aaacttcaat 2520 gaaaacatcc ctctagacat ggtaagtgga agaactatac taatccacaa aggaggtgat 2580 aggaacgatg tcactaatta tcgtcctatc agttgtacaa atgtaataat gaaagtattt 2640 acctccatat tgaaagagaa gattcacaat agaattaaca tgaataatga atcattaaaa 2700 attagtaaaa atcaattggg atgtaaactg caatctcttg ctgctaaaga aggattaatt 2760 aacagttata tgttaaagca gcaaaaagaa gagaaatacc caaaatatgt agagtcatat 2820 tatgacatta agaaagcata tgactcagta aatcatgaat gggtaataga agcacttaaa 2880 tattttaatg tagagggtgt catcatagac attgtagaaa gtatgatgaa cagatggaag 2940 atattcatag gatataaata taatgaatat ctaggatgta ttaaattgaa gagaggaatt 3000 ttacaaggtg attccttatc ccctctactc ttcatcatcc aaatgaatat aatttctcaa 3060 atcatagaag agaagttccc aaaagcaaac catacattgt atatggatga tttgagaata 3120 atgacacaaa gtagtgaaga aatgggaaca atccacaatg aaattaaaga aataataagt 3180 ggaataggga tggaaatgaa aaataaatca ggaatggtat taaagaacat taataaaata 3240 ccagatggaa tgcaaaacat cccaataatt gaaggagagg acttgtataa atatttaggt 3300 gtatggcagt ccgatgaaat taatgatacc ttcaattgta aaatcatcaa agaaaagatg 3360 atgaagaaaa tggaagagat aaatactgat gaaagtagta atagaagtta tattacaaga 3420 attaatactg aagtaatacc catcttcaga tattcagcat cagtagtaaa ttggaaggta 3480 actgacctag atagaatgga tttggagata agaaaatatt taagaaaggc aaaatacatt 3540 ggtgccggta actccaaaga ccgtctatat gtccctataa atgaaatggg tgaaggatta 3600 ataagcctaa gggacgaata tgttattgaa ttaattagaa cagtaatgta ctattgtaca 3660 agagaaagtg aaataggaaa taacataata gatagatggt attccaagaa aggaatatgg 3720 aatagaatta aaaaagccct aggtcataaa atcccagaaa cgaaaataga agaaataata 3780 aatgaaggga ttgaaaataa aaaagaaaaa gaggtagtag taaaaataaa gaaattaatg 3840 accaaccaat acatcaaaaa atggaagtcc catcaaacaa gtggacattt tagaagatgg 3900 ttcgagtcaa aaggggtaga taaaactact acaataaatg catggaagtc catcaatata 3960 aagaaaaatg catttctcca attaactaaa atgcaagatg gagcaatatt tagtgggtac 4020 aggaaagcaa aaatattaaa aaatgaaaga ttgaagtact gcccactttg taaagacaag 4080 atagccacag tagaacacat tctactatct tgtattggtc ataagaaatc acaaatggag 4140 aaacatgacc atattggcat tatcatatgg gaggggttaa taagaaaata cacaggaaat 4200 aaacaataca agaaaccccc atatgaaaca gtgttccaat acaatgacat cactatgata 4260 tggaacaaac aaataatgcc aaaaagtgat ggactatatc ataaaagacc agatatatat 4320 gtactagata aaaagaataa aactggtcta atatttgata tgactattgt ggcagatcat 4380 aatattaatg gagcctattg gaagaaaaga aatatgtata aggaattgaa aaatagggta 4440 atgaaaatag agaaattaaa ggatgtcaaa ataattgcag tagtaatttc gattaatggg 4500 ctagtaaatg ccgaaagtat taaattaatc aaacaattaa aaatcgaaat tgacatcact 4560 aaagaaatta aaaaccttgt aattaaaaac atgatggacg tcatggagca ctgtggagat 4620 cacaaccaaa catatgtagt tgaactacaa gacgaagacg aagggacggg attagtctcc 4680 cctgagccag gaacaataga cacaagttct attaatactt aattaacttc ttttt 4735 // ID Gypsy-17_SI-I repbase; DNA; INV; 4594 BP. XX AC AEAQ01023932; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-17_SI_; KW Gypsy-17_SI-LTR; Gypsy-17_SI-I. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-4594 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01023932; Positions 10018 5425. XX CC Positions [3280-3765] - Integrase core CC 'CACC' target site duplication CC LTRs are 94% similar to each other. XX FH Key Location/Qualifiers FT CDS 370..1437 FT /product="Gypsy-17_SI-I_1p" FT /translation="MRLIPGRLKGHARLWYDTRPRLAITWKEAKKTLMKQF FT RKSAPFSKLFKEAANYESAPGQALGDYCFHKLNKMRKLEIDIPDKYVIDAV FT IGGITDENVARTVRSAQHRDANKLYVYMTTLGNLPARDERHKAAEKNNRQD FT GGSKSAFKNKTHATKNETKTSLSDTTTDDKRADEKPRVECFNCGKPGHIAK FT KCRKPRVECGKCNRLGHTADQCIVKKDVNVVNETHDTPNSYERPIFINGRK FT LRGLIDTGSSSTLLRESIVNKYDMAVSTTSSRVLRGFAGQTTRSNRSALCD FT IRIMNATARVDAIVVPDNYLIRDVIIGRDFLEQEHIVMIKKGNTLTFRQLT FT YGRRKTTLNASST" FT CDS 2623..4140 FT /product="Gypsy-17_SI-I_2p" FT /translation="MLPLLRVGNNGVLAIRYFRVYLLGITFKVVTDCNALR FT TTFAKRDLLPRVGRWWLEVQEYTFEVEYRAGTKMTHADALSRNPVLLSLEI FT AQVNVTEGDWVLAAQLQDDQLSRIREILLKGRPSHERKHYFDEYVIRDDKV FT YRRLDDKTSAWVVPRDARLQICKLCHDDAGHLGIEKTLEKIKRNYWFSHMR FT CFVTKYVRACLSCAYYKHTPGKKQCKLNIIKKIPTQFHTLHIDHVGPFKTS FT RKHNKFLLVIVDAFTRFVIIEPVKSQKTSYVTKVLTSLIYLYGVSTRIISN FT RGTAFTSQTFSTFCKSYGVKHVLNAVATPRANGQCERYNKTIVQALATTTV FT GWDPRKWDSAAKQVQSALNTAHNKGINTTPMKALIGCETKCATEGLLLSQV FT QDVLHRLDLDEIRADIEAHVNKEQQAQKERYDKTRRDATTYENGNLVMVQI FT TSEPATSSSGKLHPKFKGPFRVRKVLVNDRYEVEDLREGYRRGRTVVAADR FT MKPWITIQGE" XX SQ Sequence 4594 BP; 1473 A; 1041 C; 1121 G; 959 T; 0 other; tatcagaagt gggattaacg acacacgtgc ttaaggaccg gtttgattac gacgatacag 60 tgacaagcaa ttatgggacg ccataagtct cgttacgagt ccgatcgaag tggttcgaat 120 cagaagagac gccatgactc ttcgtcgtcg aacgaggaga ggcaaagaag agacgtccgt 180 cgagagaaat caagacacga ggcgggcggc cgacatcgac gatcgcattc gaggggctca 240 tgcatacacc gaggagatga gttaatgatt ccgatcttcg atccctcaac ggacgactta 300 gtcattgaaa aatagattga gtacgtagac gatcttgccg aacaatacga ttgggacgac 360 cactcgatta tgcgcctaat accggggcgt ttgaaaggac acgcaaggtt atggtacgat 420 acaaggccac ggcttgccat cacgtggaaa gaagccaaaa aaacgttgat gaaacaattt 480 cgtaagtccg ctccctttag taaacttttt aaggaagcag caaattatga gagcgctccc 540 ggccaggcgc tcggagacta ttgtttccat aagcttaaca aaatgcgtaa attagaaata 600 gacatacccg ataaatatgt catagatgcg gtgataggcg ggattacgga tgaaaacgtt 660 gcaagaacag tacgatccgc acaacaccgt gacgcaaaca aattgtacgt atatatgacg 720 acgctcggta acctaccggc aagggacgaa agacataaag ctgccgaaaa aaacaaccgg 780 caagacggag gttcgaagtc cgcgttcaaa aataagacac acgcgacaaa aaacgaaaca 840 aagacatcgc ttagcgacac tactacggac gataaacggg cggacgaaaa accacgtgtc 900 gaatgcttca attgcgggaa acccggacat atcgcgaaga aatgcaggaa gccgcgcgtg 960 gagtgcggga aatgcaatcg attgggacac accgccgacc agtgtatcgt aaagaaagac 1020 gtgaacgttg tcaacgaaac gcacgataca ccgaattcat acgagcggcc gatattcatt 1080 aacgggcgta aactccgggg tttaatagac acgggtagca gtagcacact gctgcgagaa 1140 tctatcgtaa ataaatacga tatggcggtg tcgacgacgt cgagtcgcgt attgagaggt 1200 tttgcgggac aaacgactag gagcaatagg tcggcactat gcgacatcag aattatgaat 1260 gcgacggcac gcgtcgacgc catcgtggta cctgacaatt acctcattcg tgatgtcatt 1320 attggtcgcg attttctcga acaagaacat atcgtaatga ttaagaaagg aaatacgtta 1380 acgttcaggc aactgacgta cgggcgaagg aagacaacgc taaacgcatc gtcgacgtaa 1440 atctttctca cgtaaccaaa gatactacga ttacatcgct acgggtaaaa aataagcggg 1500 gagacgagac gacagtgcat aggtttatta cacgaattta gggactgcat ctcattctcg 1560 ctggccgatc taggcaagac agacgcggca tcgttaagca ttcgttgcct atcagatgcg 1620 ccgatcgtat atcgcccata ccgtctaccc gaaagcgaaa aacagatagt acagggcatt 1680 acccacgacc tcaaaattaa taacgtgata cgcgattcaa actcaccata cgccagcccg 1740 atattgctag tgaagaaaaa gaacaacgaa taccgcatgt gtatagatta caggaaactc 1800 aacgcaatca cgataaagga taagtaccct atgccattga tcgaagaaca gatcgataaa 1860 ttgggaggct gttgatacct cacaggattg gatctagcgt ccggatatta tcaagtgccg 1920 atggcagagg actcgatcaa gaaaacagct ttcgtgacgc ctcaaggtca ctacgaattt 1980 ctacgcatgc catttggcct gacaaacgca cccgcagtat tccaacgact gatggacagg 2040 gtgttaggcg acctgaaaaa ttcgatagcg tttccatatt tggatgacgt gattatacca 2100 tccaaaaccg tggaggaagg tttgacgcct acgacaagtg ttaaacgcat tccgcaagca 2160 tcacttgaca ctcagattaa ataaatgttc tttctttgca gactcaattg aatatttagg 2220 tcgagagatt agtgaacaag gggtccaacc tggtcggcgc aaggtagagg cactgataca 2280 catggcagca ccccagtctg taaaacaagt tcgccagttc ctgggactag ccagttactt 2340 caggagattt atcaaaaatt ttgcaacact actggaacct ttgacgagac tcactaagaa 2400 gaacgtaccc tgggagtgga aggatccaca agaacaggcg tttaacataa tcaaggacaa 2460 actggctact cgaccagtat tgacgatatt caatcctaac agacccactg aattgtacac 2520 ggacgcgagc gcaataggtg tgggagcgct tctgttgcag caagtcaatg gtaagatggc 2580 agcggtggcc tatttcagca agcaaaccac cgccgatcaa cgatgctacc actcctacga 2640 gttggaaaca atggcgtttt agccataagg tactttcgag tatacctact gggtataacc 2700 ttcaaggtgg taacagactg caacgcgtta agaacgacat ttgctaaacg ggatctacta 2760 cctcgagttg ggcgttggtg gctcgaggta caagaatata cattcgaggt cgaatatcga 2820 gctggaacta agatgacaca cgcagacgcg cttagccgaa accccgtgct gctctcgctg 2880 gagatagcac aggtaaacgt aacggaggga gactgggtat tggcagcaca attacaggac 2940 gatcaactgt cacgaattcg ggaaatcctc ctgaagggaa ggccctcaca cgagaggaag 3000 cattattttg acgaatacgt aattagagac gataaggttt acagaagact tgacgataaa 3060 acttctgcct gggtagtacc acgagacgcc cgtctacaaa tttgtaaatt gtgtcacgac 3120 gacgctggac accttgggat tgaaaaaacg ttggaaaaga tcaagcgtaa ctactggttc 3180 tcgcacatga gatgcttcgt gacaaaatat gtacgggcct gcttaagctg tgcttattat 3240 aagcacacgc ccggaaagaa gcaatgcaag ctaaacataa tcaagaaaat accgacgcaa 3300 tttcacaccc tacacatcga tcacgtcggt ccctttaaga cgagccgcaa acacaataaa 3360 tttctcttgg taatagtcga cgctttcact agattcgtca tcatagaacc tgtaaaaagt 3420 caaaagacaa gctacgtaac caaagttcta acgagcctta tttatctgta tggggtttcc 3480 acaagaatca ttagtaatcg agggacggct ttcacatctc aaacgtttag cacattttgt 3540 aaaagttacg gagttaagca tgtattaaac gcggtagcaa caccgcgtgc taatggacaa 3600 tgcgaacgat acaataaaac aattgtacag gccctggcta ccacaacagt tggatgggat 3660 ccacgaaaat gggactcagc tgccaagcag gtacagagtg cactaaacac ggctcacaac 3720 aaaggcatta atacaacccc gatgaaagca ctcatcgggt gtgaaacgaa gtgcgcaaca 3780 gagggactgt tgttgtcaca agtgcaggac gttttacatc gcctagattt agatgagata 3840 cgtgctgaca ttgaagctca tgtcaacaaa gaacaacagg cgcagaaaga acgctacgat 3900 aagacacgcc gagacgcaac tacatacgaa aatggcaact tggtaatggt acaaattacc 3960 agcgaaccgg caaccagtag tagtggaaag ctgcatccga aattcaaggg accattccgc 4020 gtccgaaagg tactcgtcaa cgaccgctac gaagtagagg atttgcggga aggatataga 4080 cgaggcagaa cggtagttgc tgccgatcga atgaagccgt ggatcacgat ccaaggcgaa 4140 tagtgagcaa acagcgattc acttcacgaa ctggtatcag agcacagtac ggtgcttatc 4200 taaaccagtt actacgatct ggtcttagtg ggatacaacg gtactctgct agatcagata 4260 acacgaactg attttggcca gaccgtgtgc tcgaccgtgt gcgaatcagt tataaacgga 4320 ctaatttcaa cgggtgcaac aacatctgtt tgaattagtt aagacgacct aactccggcg 4380 ggcgcgacgg cgtccacctg agctagtaaa ggacgagcta acctcgggca acacgacggt 4440 acccaaggtt ggctactaga cgtcaacacc gacgagcgtg acaacactta ttcggctcaa 4500 ccaagacgaa accggttaag gtcattcgac tgaaattgaa gaaaccgcgc aattcatgtc 4560 tgaggacaga cactgggagc aggaaaggcc gaaa 4594 // ID I-1_AA repbase; DNA; INV; 5258 BP. XX AC . XX DT 27-JUL-2009 (Rel. 14.07, Created) DT 27-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE Aedes aegypti Nimb non-LTR retrotransposon - consensus sequence. XX KW Nimb; Non-LTR Retrotransposon; Transposable Element; I group; KW I-1_AA. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-5258 RA Kapitonov V.V. and Jurka J.; RT "Nimb - a novel clade of animal non-LTR retrotransposons."; RL Repbase Reports 9(7), 1535-1535 (2009). XX DR [1] (Consensus) XX CC Nimb is novel clade of I-like non-LTR retrotransposons. It CC includes families of retrotransposons present in fish, molluscs, CC sea squirts, sea urchins and insects: I-1_DR, I-3_DR, I-5_DR, CC nimbus, I-3_AC, I-4_AC, I-1_CI, I-1_SP, I-1_AA, I-1_BM. I-1_CI is CC a family of tunicate Nimb non-LTR retrotransposon. The consensus CC sequence was derived from multiple alignment of 10 copies ~99% CC identical to each other. The 3' terminus is composed of the CC (TAAAA)n microsatellite. XX FH Key Location/Qualifiers FT CDS 181..1344 FT /product="I-1_AA_1p" FT /note="ORF1." FT /translation="MAPPKGGGKKKKATEVQPKLDGHREGPKFIVIKRKDL FT KDGSFEKVSPIFIHKGIKSICGEPLNIIKLRDGTLLVKTKNISQANQLLRG FT KMMFDMEIAVEENAKLNQTKGIVTCADLRYATEEEIMEDLQEQGVSNVEVM FT KRKRDGKLVATNSYILTFKSSGVPESVKIGYHVLSVRLFIPRPMRCYKCQF FT FGHSSKFCLKEEVCSNCCQVGHTSNDCSGKTHCRNCSSSEHASWSTKCKVF FT EAEQEITRIKVTENVSIREARQLYKARYPTITHAFSEIVRNSQVQVQNSIS FT QIPVDEPKALSISSPSQQQRVDSQFALPSSVHAGSDHVGAIGMEYQVAKAS FT RPLSQDTNELDEHELDVKRMRAAPSSSSGSSRSSDYDISLASNGL" FT CDS 1316..5014 FT /product="I-1_AA_2p" FT /note="ORF2: AP endonuclease, RT, RNase H." FT /translation="MISPLPVMDCKNSIVQWNCNGFFNSFQEIKVLDAQYN FT PMVICLQETHLKHRNSASFKNFDLYRCDFTSDNDRARGGVMTLVRSQFVSR FT RFNVDTVLQVVVVELLYPFKFTLCNIYLPPNELIEYDELAKLIIQLPKPFI FT ILGDFNAHHSLWGSKVNSTRGKLIENLLSNFDLNIFNDGKGTRLNSSNGNE FT TCIDLTICSASLSTLFSWNVLDDTCNSDHYPIFITPLNTRTVDEKRPHWIL FT KTADWEKFSLIANFDDAFFNKSIDDQNAHVVETIINAANHCIKKSSAKAGR FT KSVPWWNDTISKAVKDRTHSLNYFKKHPTSENEKRYKILKSKARALMRYHS FT KQSWRNMVASINSNVPQTLIWKQIRSISGRHQSNLITALKHNDSILSSRTD FT IVELLGKHFSEISSSKNYSSSFQRHKSIIEKSPLIITNNEKAYNKSFTTFE FT FKSALNSTKGSSPGFDMIHYEMLKHLSESGKRYLLKFYDRIWTQQIFPNEW FT RLAILIPINKPGKDSSLASSYRPISLTSCLCKLLERIVNKRLIWHIETNHI FT LSNYQFGFRRGRSTTDSLTIIESDIQEALLKKHHTYAIFFDLEKAYDLTWR FT RHVLNKLKYIGIDGNMLCFINNFLKNRKFRVLQGNTFSAPYELENGIPQGA FT VLSVTLFLLAIDDIFQQIDSKCKMLLYADDIVLYMSSNFLNSIKKRIQSSI FT NKMQKWLSNHGFRISISKTSCVHFCRRRKHSTNLVFRINDENIKEVEEVRF FT LGMIFDRKLSWRTHIANVKKRCLKAMNIIKTLSNIKWGSDRKTLIKLHNAI FT VLSRLEYGDCVYSSASSNLLKSLESVHHQGLRLASGAFRSSPVASILVDTG FT FLPLSSRRDQHTLFQMSRILKNQTHPMLELMQNLSAGRNVKDRTFYSRCTK FT ILNRLNVELPTIKTEHYSQVSPWLLSEPDINIDFSVFRKTTTNPKVYMSMF FT NQIVHKYSNYRFLFTDGSKLNSITSCAIFDESKQITKKIRLPNNCSVYNAE FT LTAILHAIEEIIFTDYDKFIIITDSLSALLSLSDPFSNHPIIQNIFDNISK FT CQQKNKTIKFYFVPSHMGIRGNEIADKAAKEALSESDQQGELLAKDFKNFT FT KMRFSQFWENVWKNITDNKYRQIRDSTHYWPCINTLGKRDSVVITRLRLGH FT THYTHSYLMSKDDPPICDSCKQTITVRHIFDECSKYSMIRRKLCIHGIDTL FT KDDLVGMKNVIRFMKETELYNLI" XX SQ Sequence 5258 BP; 1760 A; 992 C; 931 G; 1575 T; 0 other; cattcgcagt taaactctcg ttcggatagg acgtatatca agtgtgctcc tctattgatt 60 ggaaatatcg ttctctactt tttttattcc gacactccgc tttgttaagt attactctat 120 tgtttttgtg aaggtcgtga cattccaact gctgtcgccg agaacagttc cgccaataaa 180 atggcccctc caaagggagg aggtaagaaa aaaaaggcaa ctgaagtgca acccaagctc 240 gatggtcatc gggaaggtcc taagttcatc gtaatcaaac gcaaggattt gaaggatggt 300 tcttttgaga aggtcagccc gatcttcatc cacaaaggga tcaagagcat ctgcggtgaa 360 ccacttaaca ttatcaagtt gagggatgga acgttgttgg tcaaaacgaa aaacatcagc 420 caagccaacc agcttcttcg tggtaagatg atgtttgaca tggaaattgc tgtcgaggaa 480 aacgccaagc tcaatcagac aaaaggaatc gtaacatgtg ctgacctcag gtacgctacg 540 gaagaggaaa ttatggaaga tcttcaagag caaggagtat ccaatgtcga agtcatgaag 600 cggaaaaggg acggtaagct ggtcgccacc aacagctaca ttctcacatt taagtcttcg 660 ggagtacctg agagtgtcaa aatcggatac catgttctga gtgtgcgttt gttcattccc 720 cggccgatga gatgctacaa gtgtcaattc ttcggtcact cgtccaaatt ttgtttgaag 780 gaagaggtat gtagtaactg ttgtcaggta ggtcacacgt ccaatgattg ctcaggaaaa 840 acacactgta ggaactgtag ctcctcagaa cacgcgtcat ggtccaccaa atgcaaggtg 900 tttgaggccg aacaagaaat cacacgtatt aaggtcaccg agaacgttag cattagggaa 960 gctcgacagt tatacaaagc taggtacccc accatcacac acgcttttag tgagatagtc 1020 aggaacagtc aggtacaggt acagaactcg atctctcaaa ttccagtgga tgaaccaaaa 1080 gccttatcga ttagctctcc ttctcaacaa caacgagtag actctcaatt cgctttgccg 1140 agctcagtgc atgctggctc agatcatgtt ggtgcaatag gcatggaata ccaagtagcg 1200 aaagcatccc gtcctctctc tcaagatacg aatgaattag atgaacatga actcgatgtg 1260 aagagaatga gagctgcccc atcttcttca agcggtagta gtagatcgtc tgattatgat 1320 atctcccttg ccagtaatgg actgtaaaaa ctcaatcgta caatggaatt gcaatggatt 1380 cttcaatagc tttcaagaaa ttaaagtctt ggatgcccag tataatccaa tggtaatttg 1440 cttacaagaa acacacctta aacatcgtaa ctcagcttct tttaaaaatt ttgatctcta 1500 tagatgtgat tttacatctg ataatgaccg agcgcgtggc ggtgtaatga cattggttag 1560 atctcaattc gtctctcgtc gtttcaatgt agatactgta cttcaagtgg ttgttgtcga 1620 actattatat ccatttaaat ttaccctatg taacatttat cttcctccca acgaacttat 1680 cgaatatgat gaacttgcta agttaataat tcaattgcca aagccgttta ttattctcgg 1740 agattttaat gctcaccatt ccctttgggg ttcgaaggtg aattcaaccc ggggaaagct 1800 tatcgaaaat ctgctttcta atttcgacct taatatcttc aatgatggca aaggaacacg 1860 tttaaattct tccaacggta acgaaacatg catagatctt actatctgct cggcttcttt 1920 aagcaccctt ttctcatgga atgttcttga tgatacctgt aacagtgacc attaccctat 1980 tttcataacg cctctcaaca ctcgtacagt tgatgaaaag cgacctcatt ggattttaaa 2040 aacagctgat tgggaaaaat tttcactaat cgcaaatttt gatgacgctt tcttcaataa 2100 atcaattgat gatcagaatg cccacgttgt tgaaacgata ataaatgcag caaatcactg 2160 tattaaaaaa tcttctgcca aagctggacg aaagtcagtt ccatggtgga atgatacaat 2220 atcgaaagca gtcaaagata gaacacatag tttaaactat tttaaaaaac atccaacatc 2280 tgaaaatgaa aaacgatata aaatccttaa atctaaagca agggctctca tgcgatatca 2340 cagtaaacaa tcatggagaa atatggtagc ttcaattaac tctaatgtcc cgcaaacact 2400 tatttggaag caaataagga gtataagtgg gagacaccaa tccaatttaa taacagcatt 2460 aaaacataat gactccattc tctcaagccg tactgatatt gtcgaactct taggaaaaca 2520 tttctcagaa atatcttcat ctaaaaatta ctcctcttca tttcaaagac acaaatcaat 2580 aatcgaaaaa tctccactaa ttattacgaa caatgaaaaa gcctataata aatcatttac 2640 gactttcgaa tttaaatcag ccctaaactc gactaaagga tcctctcctg gatttgacat 2700 gattcattat gaaatgttga aacatttatc tgaatcagga aaacgatatt tattaaaatt 2760 ttacgatcga atatggacac aacaaatctt tcctaatgaa tggcgactag ccatattgat 2820 tcctattaat aagcctggaa aagactcttc tttggcttcc agctatcgtc cgatatcgct 2880 gacaagttgt ctttgcaagc tccttgaaag aattgttaac aaaagattaa tctggcacat 2940 tgaaacaaat catattttat ccaactacca atttgggttt cgccgcggac gatcaacaac 3000 agacagttta actatcattg aatccgatat ccaagaagct cttctcaaaa aacatcatac 3060 ttatgcgata ttttttgact tggagaaagc ctatgacttg acttggcgtc gccatgtttt 3120 gaataaattg aaatatatag gaatagatgg taacatgtta tgctttataa ataattttct 3180 caaaaatcga aaatttcggg ttctgcaagg aaacacattt tcagctccat atgaattgga 3240 aaacggtatc cctcagggtg ctgttcttag cgttactttg tttctactag caattgatga 3300 tatattccag caaatcgact ctaaatgtaa aatgttgctt tatgccgatg atatcgtgtt 3360 gtatatgtca tcgaacttcc tgaattcaat taagaaacga atccaatctt caatcaacaa 3420 aatgcaaaaa tggctctcaa atcacggttt tagaatatca atatcaaaaa catcttgcgt 3480 tcatttctgt agaaggagga aacacagcac aaatttggtt ttccgtatca acgatgaaaa 3540 tatcaaggaa gttgaagaag taaggtttct tggaatgatt tttgatagaa aactatcgtg 3600 gcgaactcac attgcaaatg ttaagaaaag atgtttaaaa gctatgaata ttatcaagac 3660 attgtcgaac attaaatggg gatctgatcg aaaaacatta attaagttgc acaatgctat 3720 tgtattatca cgtttagagt atggagactg tgtatattcc tctgcatcat caaaccttct 3780 gaaatctcta gaatccgtcc atcaccaagg ccttcgctta gcaagcggag ccttccgatc 3840 atctccagta gctagtattt tagtggatac tggcttttta cctttgagtt caagaagaga 3900 ccagcatacc ctatttcaaa tgtcaagaat tttgaaaaat caaacccatc ctatgctaga 3960 gttaatgcaa aatttgtcgg caggccgtaa tgttaaagac cgaacttttt actctaggtg 4020 cactaaaatt ttaaatcgtt taaatgtaga gttaccaact attaaaacag aacattatag 4080 tcaagtttct ccatggttgc tttctgaacc cgatatcaat attgatttta gtgtttttcg 4140 taaaacaact acaaatccta aggtatacat gtccatgttt aatcaaattg tccataaata 4200 ttccaattat cgttttttat tcacagatgg gtcgaaatta aattcgatca cttcttgtgc 4260 aatatttgat gaaagcaaac aaataacgaa gaaaatcaga cttccaaata attgttctgt 4320 gtataacgct gaactaacag ctatactaca tgcgatagaa gaaatcattt tcactgatta 4380 tgacaaattt attataataa ctgattcatt gagcgcattg ctatcactat ctgacccctt 4440 ttctaatcat cctatcatac aaaatatttt tgataacatt tctaaatgcc aacaaaaaaa 4500 taaaacaatt aaattttatt ttgttccaag ccatatggga attcggggca atgaaatcgc 4560 ggataaagcg gcaaaagagg cattatctga gtcagatcag caaggagaat tattagctaa 4620 agatttcaaa aattttacta aaatgcggtt ttcacaattt tgggaaaatg tatggaaaaa 4680 tatcacagac aataaatatc gccagattcg tgattcaaca cactattggc cttgtattaa 4740 tacgcttggt aaaagagatt cggttgtcat aacgagatta agattaggcc atactcatta 4800 cacccattct tacctgatgt ctaaagatga tccgccaatc tgtgattctt gtaagcaaac 4860 cataactgtc aggcatatct ttgatgaatg ctcaaaatac tccatgataa gaagaaaatt 4920 gtgcatccat ggtatagata ctctcaaaga tgatttagtt ggtatgaaaa atgttattcg 4980 ttttatgaaa gaaactgaac tgtacaatct aatatagaac aaaacattga aaaaagaata 5040 tatgaaatta atagaaagca ttgtatgttt cctaaagtta aaggtgttaa gttaggtttt 5100 taattattca ctttgttatt attatagtat tagtgttatt gtattattag tgtatcttta 5160 tatttattgt attaatcatt gaaatattgt atatataact gccctctgac ggcccatata 5220 gccctagttg cgctggtggc cgaccaaata aaataaaa 5258 // ID BEL-124_AA-I repbase; DNA; INV; 6364 BP. XX AC supercont1.251; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-124_AA_; KW BEL-124_AA-LTR; BEL-124_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6364 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.251; Positions 1417270 1410907. XX CC Positions [5359-5976] - Integrase core CC 'ACTAT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 37..6330 FT /product="BEL-124_AA-I_1p" FT /translation="MPSRGCIRSGVKGDKGARKKSDEGGAVGGQSAGDDEI FT PEVLVVDETLEGYSCKTCREADTKEMVQCDKCDKWFHFSCVGVSGDIADKS FT WSCTNCVTATWIQRTKAALENVTKRDDDNLDRKSTKSQPIGSIATSTRANL FT VIPSEVHNTASLDRNLVNSMHKPPTEVASGQTNPDQQKQQSDRANRIADVS FT AVDAAKALSEISFSSSQKSAVSRAKLRLMRLQEERDFQQQQEERRKIAEEK FT AAQEHRTYLEEKYRILEEALSEKGSRRSEGGSKSCSDASSRVSEWIQRARQ FT VQQIQPEVDVIRSLQFPIGSSTEPTHASGRFGQQRIRFEPSAQTGLGSHAE FT TGQRINQAFHSAAETITATGQPSQSRLSNHFRQEVLQTNPVGNPFASSTRP FT PFNSTTFDGQVLAPPACSTGVNQQNLPSGVRASQRSFQEESQDEFQLSRSQ FT VAARQAVPRDLPTFSGNPEEWPVFLSMFNRTTTMCGFTEEENLVRLQKSLK FT GKAYEAVKSRLMFPGNVSGILATLRMLFGQPEVIVQSLIEKVSSLPAIRED FT KLDSLVDFAVHVQNFCATVDACGLQEYMYNVTLLHQLVSKLPPSLKLNWAQ FT YRLTMAAVNLATFSSWIYALAEAASIVSFPPSVQPEKPRNEIRGSKKGNGY FT LNAHSETPSSEENIETSPKELSTLKPVPAKQVCPICNGSCKSADKCKRFVD FT LSREARWAAIREFSLCRTCLRMHKGNCNGKPCGKNGCTYRHHELLHNDSKD FT KPQKTAAAVATGNESEPTALPCSNLSGCNTHRSVASTVLFRYLPVVLYGKT FT TVVRTHAFLDEGSELTLLDQEIADILELEGAERPLCLRWTGGTERCEPNSQ FT AYNLQISGAKEGSKRFDLNDVRTVKDLKLPQQTMDMVKFGKEYPHLRDLPI FT DSYRDIRPRILIGTKHAHLGLVLKSREGEFGQPIAVKSRLGWTICGGQGAD FT RGDNLHHYSFHICPCNTEADENLHQAMKDYFSLDSLGVTKPCKPMLSIEDQ FT RAMTLLQSLTHRRDDRYESGLLWRYDDSCLPDSRSMALRRLQCLRKRMERE FT PDLRTTLQTKIEEYLAKGYIRLLSEEEINEKVPRRWYLPVFPVTNPNKPGK FT IRIVWDAAASAHGTSLNSALLKGPDLLCSLLTILLQFRERRIGLTGDIREM FT FHQVLIRAEDQFCQCFYWLNQNGETQVYVMQVMTFGACCSPTTAQFVKNTN FT ADRFVNEYPSAHQAITKSHYVDDMLVSLDTEEQAIQLAKDVRHVHKQGGFE FT IRNWISNSIVVLAALQETDTDEKCLDLSSELATEKVLGMWWNTTDDIFIYK FT VGWNRYDPALLGGERRPTKREVLRVLMTIFDPLGLISHFLSYLKILLQQIW FT RSGVQWDEEIDDDAYDKWLTWLKVLPCVEQVRIPRCYNSKYLMSEADEVQL FT HTMVDASENGTAAVCYLRFIREASVSCSIVAAKCRVAPLKFTSIPRLELEA FT AVVGARLSRSVQESLTFRIHRKLYWSDSRDVLCWINSDHRRYSQYVGHRIS FT EILETSEAQEWRWVPGKKNPADDATKWNSLPELSSEDRWFKGADFLWRPEA FT DWPTMHSRHDSTDTELRPSLLAHHAPPEPVICVNNFSTWERLRKVVAYVHR FT FFNNCRHKRRGKTTETGPFSVKQLLIADTFLIQLCQREAYPDEIAVLQNTP FT QRPETTKTIPKTSALYQLTPWLDGNGIMRMRTRIAACHYASDDAKNPIILP FT HKHHITHLIVAHYHRKYHHQNHETVINEIRQKFRIPRIRVCYQQVRRDCQQ FT CKNQRAAPNPPFMADLPPARLAAFSRPFTHVGIDYFGPVEVTVGRRVEKRW FT ILLATCLTIRAIHLEVVHSLNTSSCIMAFRNFMARRGTPQKVYSDRGTNFV FT GANKVLKEVNEVMNEREFMNEFGKSAIEWVFNPPLTPHMGGSWERLIRTVK FT NNLAVVCLSARPSDEVLRNLLTEIENTVNSRPLSHVPIDDDSAPALTPNHF FT LLGSSNGSKPLTDIDDSSAALRQNWLTSQILANQFWKRWVTDYLPEITRRS FT KWFQRPQPIAVGDIVVIADPKLPRNCWPTGKIIAAKVSRDGQVRSATVRTS FT SGIYDRPATKLAVLDVRCDGE" XX SQ Sequence 6364 BP; 1765 A; 1636 C; 1593 G; 1370 T; 0 other; tcatataatt ttcgtttaac gaacaaacta agcaagatgc ctagtcgcgg ttgcattaga 60 tcgggtgtga agggagataa aggcgcgcgc aaaaagtctg atgaaggtgg tgcggtcggt 120 ggtcaaagtg ctggtgatga tgagattcca gaggtgttag ttgtggatga aactttggaa 180 ggatattcgt gtaagacgtg cagagaagcg gacaccaagg agatggtgca gtgtgataaa 240 tgcgacaaat ggttccattt ctcatgtgtg ggcgtttctg gagacattgc cgacaagagc 300 tggagttgta ctaactgcgt tacagcgaca tggatccagc gaactaaggc agctctggaa 360 aacgtaacta agcgggatga tgacaaccta gatcgcaagt caactaaaag ccagcccatc 420 ggaagcatcg ctacgagtac acgtgccaac ctagtgattc cttcagaagt tcacaacact 480 gcctccctgg atcgaaatct tgtcaacagt atgcataaac ctccaacaga agtggctagc 540 ggtcaaacta atcccgacca gcagaagcag cagagtgaca gagctaatcg aattgcagat 600 gtttcggctg tggatgctgc taaagcactg tcagaaattt cattttcttc gtcgcagaag 660 tcagcggtga gccgcgcaaa gctgcgactg atgcgactac aagaggaaag ggatttccag 720 cagcagcaag aagagcgtcg taaaattgcc gaagaaaaag cagcacaaga gcatcgaacg 780 tatttggaag agaagtaccg tatactggaa gaggcgttga gcgagaaagg atcacggagg 840 agtgaaggag gatcaaagag ctgcagtgat gcgtcaagcc gagtgagcga atggatacag 900 cgggccaggc aagtacagca gatccagcca gaagtggatg ttattcgtag tctacaattc 960 cccataggca gtagcacgga gcctacccac gctagcggtc gcttcggaca gcaacgaatt 1020 cgcttcgaac caagtgcaca aactggtctc ggcagccatg cagaaactgg ccaacgcatc 1080 aatcaagctt ttcactcagc tgcagaaacg atcacagcta caggacagcc atcgcagtcc 1140 agattgtcca atcattttcg tcaggaagtg ctgcaaacga atcccgtcgg caatccattt 1200 gcatcgtcaa ctaggcctcc attcaattcc actacctttg atggacaggt cctagccccg 1260 ccggcttgtt ctaccggtgt taaccaacag aatcttccaa gcggtgtccg cgcatcccaa 1320 agatcgtttc aagaggagtc tcaggatgaa ttccagcttt ccaggtcaca agtggcggcc 1380 agacaagcag tcccgaggga tttgccaaca ttttccggaa atccggaaga atggcctgta 1440 tttctgtcaa tgttcaatcg tacgaccacg atgtgcgggt tcaccgagga ggagaaccta 1500 gttcggctgc aaaagagcct gaaaggcaag gcttatgaag cagtcaagag ccgtctgatg 1560 ttccccggaa acgtttcagg cattctggct accttgagaa tgctgtttgg gcaacccgaa 1620 gtcatcgtac agtcgctaat cgaaaaagtc agctctttac cagctattag agaagataag 1680 ttggattcgc tagttgattt cgccgtacac gtgcaaaact tctgtgccac cgtagacgcg 1740 tgtggattgc aggagtacat gtacaacgtc acgttgctcc atcagctggt cagcaagttg 1800 ccaccatcgc tgaaacttaa ttgggcacaa taccgtctaa cgatggctgc cgtcaacctg 1860 gcaaccttca gtagctggat ctatgccttg gcagaagctg caagtatcgt cagctttcca 1920 ccgtcggttc aaccagaaaa gccccgcaat gagattcgcg gctccaagaa aggaaatggg 1980 tatctcaatg cccattcaga gacaccgtcg tccgaggaga acatagaaac ttcgccaaaa 2040 gaactctcga ccttaaagcc agtaccagcg aaacaagtct gtccaatttg taacggcagc 2100 tgcaagtccg ccgataagtg taagcgcttt gtggacctat ccagagaagc gcgatgggct 2160 gcaataagag agttcagcct atgccgtacc tgtctgcgga tgcacaaggg aaactgcaat 2220 ggcaagccct gcggtaagaa tgggtgtacc taccggcatc atgagttgct gcacaacgat 2280 tcgaaggata aacctcaaaa aaccgcagca gcagtcgcaa ccggaaacga aagtgagcca 2340 actgctttac cctgttccaa cctatccgga tgcaatactc atcggtctgt tgccagtaca 2400 gtgctgttcc gctaccttcc agtcgtattg tacggtaaga cgacagtagt tcgcacgcac 2460 gccttcttgg acgagggctc agagctcacc ctgcttgatc aagagattgc tgatatatta 2520 gagttggagg gtgccgaaag accactatgc ctacggtgga cgggaggaac tgaacgctgt 2580 gaaccgaatt cccaagctta caatctacaa atctctggag ccaaggaagg cagtaagcgg 2640 ttcgatctca acgatgtgcg aacggtgaag gacctaaagt taccgcaaca aacgatggac 2700 atggtcaaat tcggcaagga atatccacat ctccgagatt taccaatcga ttcctaccga 2760 gatatacggc cgcgtatact tatcggaacc aaacacgcac atcttggcct cgtcttaaaa 2820 agccgagaag gagagtttgg gcagccgatt gcagtcaagt caaggctggg atggaccatc 2880 tgtggagggc aaggtgctga tcgtggagat aatttgcacc attacagctt ccacatatgt 2940 ccgtgtaata ccgaagcaga cgaaaatctt caccaggcaa tgaaggacta cttctctctc 3000 gatagccttg gagttactaa accgtgcaaa ccgatgctat caatcgaaga tcaacgggcc 3060 atgacgttgc tacaatcgct tactcaccgc cgggatgatc gatatgaatc cggcttactc 3120 tggcgatatg acgactcctg tctaccggat agtcggtcca tggcgctgcg gcgattgcaa 3180 tgcctgagaa aacggatgga aagagaaccg gatctgcgaa cgactttgca aacaaagatc 3240 gaggaatacc tagcaaaagg ctacatcaga ctgctcagtg aggaagaaat taatgaaaag 3300 gtgcctcgtc ggtggtatct acccgttttc cccgtaacca atccaaacaa acctggaaag 3360 attcgtatag tatgggacgc agcggccagc gcccacggca cttcgttgaa ttcggctctc 3420 ctcaaaggac cagacctatt gtgctctctt ctcacaattc tactccagtt tcgggaacgt 3480 cgcatcggac tgaccggtga catccgtgaa atgtttcacc aggtattgat tcgtgcagaa 3540 gatcagtttt gccagtgttt ctactggctc aaccaaaatg gagaaaccca agtgtacgtc 3600 atgcaggtga tgacattcgg agcatgctgt tcaccgacca ctgcacagtt cgtgaagaac 3660 accaatgctg atcgattcgt gaacgaatat ccatcagccc atcaagccat tacaaagtcc 3720 cactacgtgg acgatatgct tgtgagtctg gacactgaag agcaagccat ccagctggcc 3780 aaagacgtta ggcacgtcca caagcagggc ggtttcgaga ttcggaactg gatcagcaac 3840 tcgatagtgg ttctggcggc gctgcaagaa actgacaccg acgaaaaatg cctcgatttg 3900 tcttccgaac tggccacaga aaaggttcta ggcatgtggt ggaatacgac cgacgacatc 3960 ttcatctata aagtgggatg gaatcgctac gatccagctc tgttgggagg tgaacgtagg 4020 ccaacaaagc gggaagttct tcgcgtcctg atgaccatat tcgatcctct aggactaatc 4080 tctcacttct tgtcatatct gaagatactg ctgcagcaaa tctggcggtc aggagtacag 4140 tgggatgaag aaatcgacga cgacgcatac gacaaatggc ttacatggct gaaggtgctt 4200 ccatgcgtcg aacaagtaag gataccaaga tgctacaact ccaagtacct tatgagcgaa 4260 gccgatgaag tacagctgca cacgatggta gatgcaagcg aaaacggcac ggccgccgtt 4320 tgctatctcc gctttatccg cgaagcatcc gtttcctgtt ccattgtcgc ggctaaatgc 4380 cgagttgctc cgcttaagtt cacttcgatc ccacgactag aacttgaagc tgccgtagtt 4440 ggcgcccggc tatctcgctc cgttcaagag tctctaacct ttaggataca tcgaaagctg 4500 tactggtctg attccagaga tgttctctgt tggatcaact ctgaccaccg acgctacagc 4560 caatacgtcg gtcataggat cagcgaaatt cttgaaacat ccgaggcaca agaatggcga 4620 tgggttccag gaaagaaaaa tccagccgac gacgctacga agtggaacag tctaccagag 4680 ttgtcatctg aggacaggtg gttcaaaggt gctgactttc tttggcgccc agaggcggac 4740 tggccgacaa tgcacagccg ccatgattcg accgacaccg aactacgtcc gtcgctttta 4800 gcacaccacg ctcctccgga accagtgatt tgcgtcaaca acttctcgac ttgggagaga 4860 ttgcgcaagg ttgtcgcata cgtccaccgt ttcttcaata actgccgtca caaacgccga 4920 ggaaaaacga cagagacagg tcccttttcg gtcaaacaac ttctcattgc cgatactttc 4980 ctgattcaac tctgtcaacg ggaagcttat cccgacgaaa tagcagtgct tcaaaataca 5040 ccccaacgtc ccgaaactac gaagaccata cctaaaacca gtgccctata ccagctcaca 5100 ccgtggctag acggtaatgg gatcatgcgc atgcgaaccc gtattgctgc atgtcactac 5160 gcctctgatg acgccaaaaa cccaatcatc cttccgcaca aacatcacat cacccacctg 5220 atcgttgcac actatcaccg caaatatcac catcaaaacc acgagactgt gattaatgaa 5280 atcaggcaga aattccgcat ccctcgcata cgtgtgtgct accaacaagt gcgtagggac 5340 tgccagcagt gcaagaacca gcgagcagct ccaaatccac cattcatggc tgatcttcca 5400 cccgcccgcc tcgcagcctt ttcacgaccg ttcacgcacg tcggaatcga ctacttcggt 5460 ccagtcgaag tcaccgtcgg cagaagagta gagaaacgat ggatactgct tgctacatgt 5520 ctcactatca gagcgattca cttggaggtc gttcactccc tcaacacgag ctcctgcatt 5580 atggccttcc gaaacttcat ggcccgtcgt ggtacgcccc aaaaagtcta cagcgaccgt 5640 ggaacaaact ttgtgggtgc aaataaagta ctcaaggagg ttaacgaagt gatgaatgag 5700 cgcgaattca tgaatgaatt cgggaagtct gcaatcgaat gggtcttcaa tccgcctctc 5760 acaccgcaca tgggtggtag ttgggagcgg ctgatccgaa ccgtgaaaaa caacctggca 5820 gtggtgtgtt tgtcagcgag gccatcggac gaagtactcc ggaatttgct gactgagatc 5880 gagaacaccg tgaactcccg ccccttatct cacgtgccaa tagatgacga ttcggcacca 5940 gcgctgaccc cgaaccactt tctgctcggc tcgtccaacg gctcgaagcc gctaaccgac 6000 atcgacgaca gtagtgcagc actcagacag aactggctta cgtcccaaat cctggcaaac 6060 cagttttgga aacgctgggt aacggactac ctcccagaaa ttacccgaag aagcaagtgg 6120 tttcaacgtc cgcagcctat cgccgtcgga gatatcgttg tcatcgccga ccctaagctg 6180 ccacgtaatt gttggccgac gggaaagatc atcgcggcaa aagtaagtag agacggacag 6240 gttcggtccg caacagtgag gacgtcaagt ggaatctatg atcgcccagc aacgaagttg 6300 gcagttctgg acgtacggtg cgacggtgag taagccaatc tacagttggc gtacccgggg 6360 ggac 6364 // ID Copia-12_AA-I repbase; DNA; INV; 3067 BP. XX AC supercont1.351; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-12_AA_; KW Copia-12_AA-LTR; Copia-12_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-3067 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.351; Positions 779920 782986. XX CC Positions [1777-2304] - Integrase core CC 'GATTG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 439..2805 FT /product="Copia-12_AA-I_1p" FT /translation="MSEKYIMPRLGNDNYQTWKLRMTMLLKREELWHTIGD FT AKPEPVTAVWTNQDQKALATIVLFLEDGQLSLVKDAVCAKGAWDQLRDYHE FT KATTTSRVSLLKKVCSLDMSDGGNMQEHLFELENLFDRLSCAGLPLEEPMK FT IAMIFRSLPDSYGSLVTALEGRPDADQTIVLVKQKLLDEHQRRAERGGVSE FT KVLKSRIEQKKERVCYHCRKPGHFKRNCRLWQQSQQSSDDEQKSSKKSDTK FT AKQVTEAQSPICFTVSSKRQKNYWYVDSGCSSHMCNSRKFFRELNEDISVN FT VVLADGSVAAAAGIGEGVISCLDGDGEVRKIVMKEVLYIPSLDSNLLSVRK FT ITQRGLKVRFVKAKCEIISDDGKVVAVAKASENLFQLVVPESARLSEEVRH FT NENCPHTWHRRFGHRDPRVFDRLQTEGFVTGFTMRDCGLKQVCEDCLKGKL FT PRNSFPKFSQSRAKRVGDLVHTDVCGPVKDVTPGGYAYFMTLIDDFSRYTV FT VCLLRNKSDVAGCIKKYVAHVKNRFGRAPCVIRSDGGGEYVNHELRMFYEK FT EGIEAQLTAAYSPQQNGVAERRNRSLQEMSSCMLLDAGLPRKYWGEAIMTS FT AYIQNRMPSRSVDTTPYQKWFGEKPSVEHMKVFGSVAYVHIPNVKRSKMDA FT KSEKLVFVGYCSDRKAYRFVDPATDKLTISRDVRFLELPNVSIERSASEIK FT NNSSSIELILDGGKGKIQEEPVIADDPVQEERTSEDEEGEEFYYDSTEDPI FT EEEGAKRRTRGVLPKRLTDYVVGYTHLAIAAAKSTDKR" XX SQ Sequence 3067 BP; 897 A; 569 C; 858 G; 743 T; 0 other; ggttatgtgg cccagttttg tgacggaaga ttcgtgaagc tagtgatttt tcattccgcc 60 gccattgcat atttgctcgc ccggtgtgcg cttgcaagaa gtgattattg gcagtaaatt 120 gctcgagaag tgatattccg aaactcgagc agtgaaaagc ggaaggctgt agtttgtgag 180 aaaatcgata atttccggcc gatgagcaga tttgttgagg taaagtcccg gctcgtttct 240 cgacattttc acccgaccag cagaacagaa aacgtccggt agtgttgtga ttgtagtggt 300 gtcggaattc cagctgctga ttcgatccag taattagtag tgagagtgtg agtttcctcc 360 cgtctcgaat cgacgcgatc gcggttagat tgcgaaagtg ttgctagttg cgtggtattg 420 tttacgaata gcagaaaaat gtctgagaaa tatatcatgc cgcgtctcgg caacgacaat 480 tatcagacgt ggaagctgcg gatgacgatg ctgctgaagc gtgaagagct ttggcatacc 540 attggcgatg caaagcccga accggtcact gcggtgtgga cgaatcagga ccagaaagct 600 ctggcgacaa tagtgctttt tctggaagac ggacagttaa gcttagtgaa agacgccgta 660 tgtgcaaagg gtgcgtggga tcagcttcga gactatcatg agaaagccac cacaacgtcg 720 cgagtgtcat tgcttaagaa agtgtgtagc ttagacatga gtgatggtgg taatatgcaa 780 gaacacttat ttgaacttga gaatttgttt gacagactgt cgtgcgcggg tttaccgtta 840 gaagagccga tgaaaatcgc aatgatcttt cgtagtcttc cggactccta cggaagtttg 900 gtgacagctc tagaagggcg acccgacgct gaccagacaa ttgtgttggt taaacaaaag 960 ctcctggatg aacaccaacg tcgtgccgaa cgtggtggtg taagtgagaa agtgctgaaa 1020 tcgcgaatcg agcaaaagaa ggagcgggtt tgttaccact gtcgaaagcc aggacatttc 1080 aaaagaaatt gtcggctttg gcagcagtcg cagcagtcaa gtgatgatga acagaagagc 1140 tcgaagaaaa gtgacactaa agcgaagcaa gtaacggaag cccaatcgcc gatatgtttt 1200 actgtgagca gcaagcggca gaagaattat tggtacgttg acagcggctg ctcgagccat 1260 atgtgtaaca gccgaaagtt ctttcgcgag ctgaatgaag acatcagtgt gaatgtggtg 1320 ctggccgatg gatcggttgc agctgccgct ggtattggtg aaggtgttat ttcgtgtttg 1380 gatggagatg gcgaagtgcg taaaatcgtc atgaaagaag tactgtatat cccgagcttg 1440 gacagtaatt tgctctcggt gaggaagatc acacaacgag gactaaaagt gcggtttgtg 1500 aaggcaaaat gtgaaataat aagcgacgat gggaaggtgg ttgcagttgc aaaagccagt 1560 gaaaatctgt tccagcttgt ggttccggag agtgctaggc tgagtgaaga agtgcggcac 1620 aacgagaatt gccctcacac ctggcatcga cgtttcgggc acagggaccc gagagttttt 1680 gatcggcttc aaacggaagg atttgtcact ggatttacga tgcgagactg cggattaaaa 1740 caggtatgtg aggattgttt gaagggaaaa ttgcccagaa atagtttccc gaagttttcg 1800 caaagtagag ccaagagagt tggcgattta gtccatacgg atgtttgcgg accagttaag 1860 gatgtcacac ccgggggata tgcgtacttc atgacgctaa ttgatgattt ctcgaggtac 1920 actgtggtgt gtttgttgag gaataagtca gatgttgccg gatgcataaa gaaatacgtg 1980 gcacacgtca agaatcgatt tggccgagca ccgtgtgtca tacgttcaga cggtggtgga 2040 gagtacgtga accatgagct aaggatgttt tacgaaaaag aaggaattga agcacagctc 2100 actgccgcgt actcacccca acagaatggg gttgcagagc gaaggaaccg ctcattgcaa 2160 gagatgtctt cgtgcatgtt acttgatgca gggcttccta gaaagtactg gggcgaggcg 2220 atcatgactt cagcgtacat acagaaccga atgccctctc gttctgtgga cacgactcca 2280 tatcagaagt ggttcggtga gaagccttcg gtggagcata tgaaggtctt cggcagcgtg 2340 gcgtacgttc acatacctaa tgttaaacgt tcgaaaatgg atgcgaaatc agaaaagcta 2400 gtatttgtcg gatactgcag tgatcgcaag gcatatcgat tcgtcgatcc agctaccgac 2460 aagttaacaa ttagccgcga tgttcgcttt cttgaattgc cgaacgtttc cattgaacgc 2520 tcagcatcag aaatcaaaaa caacagcagc tcaattgagt taattttgga tggcggtaaa 2580 ggcaagattc aggaggagcc tgtgatcgcc gatgatccag ttcaggagga gcggacgtct 2640 gaggatgaag aaggtgaaga attctactac gactctacag aagacccgat tgaagaagaa 2700 ggtgcgaagc ggcgcaccag aggcgtactt ccaaagcggc ttacggacta cgtagtgggc 2760 tacacccatt tggctatagc agctgccaaa tcaactgata aacgttaacc aacatcttgt 2820 gtggagaaca ataatagttg tttgtagcag tgagaacatt gtcatcagga acattttaaa 2880 ttaaatacag aaatgaagaa atatgaagta ataatcaaaa agtagcaggg aatgaaataa 2940 aagattactt cgaacaatca tagctaaggt taggaagagc agaaaagtta agttaatcaa 3000 tcgtaaaggt aaaggaaaag aagtgaagat gtggctattg aatatgttta aaaatattga 3060 ggaggag 3067 // ID Gypsy-170_AA-LTR repbase; DNA; INV; 1281 BP. XX AC supercont1.294; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-170_AA_; KW Gypsy-170_AA-I; Gypsy-170_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-1281 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.294; Positions 667846 666566. XX SQ Sequence 1281 BP; 353 A; 256 C; 296 G; 376 T; 0 other; tgtaaccagt cactttgtag gttacaatat gaaacttgaa tgctggacca atttggttca 60 aaattatgaa ataatgtaaa ttcaatattg taagtaaatg tctaaatata aaataaatgt 120 ttgtgctccg tgaaatcatg tttgagtgaa tatttatttt atgtttaaaa gcccttggtt 180 gacatagaac gatagttttc ataaacaatt agggaattag ttcttttgat atattgtgtc 240 ttaaagtttt cgtcaagttt tgaatttaaa tgttcgtctt gtcagccatt tgtatataat 300 gttcgaattc gaaaatgtat gtgttatacg ggttgcccgc ctaggagatc tatagcccga 360 catcgaaaat ttggaatggc ggtttggatg cgatgagagg gcaagaattt gatggcgaaa 420 tcggatataa aaggacatag cgaacttcag ggggtctctt gtattgttag aagtttgatc 480 cggaggagtg aagaacagtg acagtgccgg ttacgcccgt ctctatgcgt atcggtagtt 540 ccgcgtcggt tctagtgtgt aaccgacaaa tcgaagtccc taaggtgaaa cagttaaatc 600 gtctcccata acaccggact taaagttaaa aggtacggaa ctcggtatga ccgcaccaca 660 acggttagcc aaaagacacc ccagtagaag ttagtgctag tcgagttagg tcatccgggt 720 tcaatacgtg ccaaattaag gaccaagcga gtgtggtgca aaagactgca gtgtgagtgt 780 ttgaaacggt cagaagtggc atggtgaggt ccttaccgat gtttggaagg cctcgagacg 840 atacgctaaa actgtgacct gccagagata gtgtcgtgac tcgcttggtt tcctgaccga 900 ataacagtgc gcgctcggta ccactccccg gatccgcgga acgtacgttg gccgtccgtt 960 acaataggcc tagtgtcacg tgtcgggaat agccccggtt catgggttat atcctgagac 1020 ctaccaaagt tacctaccct gctcggattc cctcgccgcc attgtccacc accgagaagc 1080 cccgcattcc atcatctgcc tgctgtatgt cgaccaccgt tcatcatggc cacaacgaaa 1140 gctaagtgtt accttttgtt caaatttcgt cctaataaac tgtccagaag cttgttctca 1200 ataaattcct tagtctattt gttaaatatt gtattgtgta aattttgagt agtttgttga 1260 accgaattgt ctagtcctcc a 1281 // ID BEL-3-LTR_NVi repbase; DNA; INV; 795 BP. XX AC . XX DT 21-APR-2009 (Rel. 14.04, Created) DT 21-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia vitripennis: long terminal DE repeat - consensus. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-3-LTR_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-795 RA Bao W. and Jurka J.; RT "BEL type LTR-retrotransposons from Nasonia vitripennis."; RL Repbase Reports 9(4), 747-747 (2009). XX DR [1] (Consensus) XX SQ Sequence 795 BP; 160 A; 249 C; 183 G; 203 T; 0 other; tgttccgtcg agacccgagg tgtaagttcc gccgtgaaaa cggtgtcgag cgaactcagc 60 gaggtcgacg tagagcatgt ggctccatct cccgtgctcg agttgaacaa gcggagtcgc 120 gacttgccgc gctcgccggc agcgccgccg gcctcgtgtc gcgcgcggcc ggacgagcgc 180 cgagtcagtt gacgataaga cgccaagctt gacagctcgt gcagtcaagc tagcgaacga 240 tcgaacgaca ctttggctct tgcgaatccg tccgtacgtc catcgagtgc ttctggtgct 300 ggtgcttgtg ctactctctc tctctctctc tctctctctc tctctctatc tctctctctc 360 attctctcta aatctctcta aatctctctc ctttgtctct ttcgctgcct gctactcgcg 420 ttcggctcgc gcccggttcc gcgaatcgta tctcgcgcgc gtcgaaacgt gcaagtcggt 480 agaactgcgc taccgacttg agtcagtgga actgcgccac ggacacgaag cgaagattct 540 ctctgagaat cgcacgaaca cgacacacac acgaacacga cacacgttct tctttttctt 600 cgcagcttgt aagaacttcg agaggggttt tatgcctcac ggctttcccc tctcaatttg 660 agcgtgatta aatattctcg attttcgctc aatcgacttc tttatttctc ccacacctca 720 ctccttgcga actcgaacac acacacaaaa aaagcgggaa tattcgcctc agaatattcc 780 gcgcgcgaca attca 795 // ID DNA4-11_AP repbase; DNA; INV; 277 BP. XX AC . XX DT 30-AUG-2009 (Rel. 14.09, Created) DT 30-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA4-11_AP. XX NM DNA4-11_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-277 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 1960-1960 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC Putative TATA TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 277 BP; 64 A; 68 C; 73 G; 72 T; 0 other; ctcggcgcgg aataagaaca tgaccaggcg gcggaccgat tagcgtccgt tacgcgcagc 60 cgatttctca ttggttcccc aaacatctgt tcacagttat acagcactgt tcaaaatgtg 120 aaccgcatac actgtgttgt tttataaaca cgtgcacgtc tctgttgaca catgaggcgc 180 tccatataca aagccggtaa ttgccactat ggtgggcggt gcggtggagg aagttgtttt 240 gaatgcgcga ctggtcatgt tcttattccg cgccgag 277 // ID Gypsy1-I_Dpse repbase; DNA; INV; 4010 BP. XX AC Unknown_group_134; XX DT 13-MAY-2009 (Rel. 14.05, Created) DT 13-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy1_Dpse; KW Gypsy1-LTR_Dpse; Gypsy1-I_Dpse. XX OS Drosophila pseudoobscura OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; obscura group; OC pseudoobscura subgroup. XX RN [1] RP 1-4010 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1027-1027 (2009). XX DR Genome; Unknown_group_134; Positions 218 4227. XX CC 'ACGC' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 89..2161 FT /product="Gypsy1-I_Dpse_2p" FT /translation="MVQQRRCRMVTFSDESPATASAAPTRSADPTATPRTK FT TTHRNPDHARPSNVPVTTHQNPNHARPINVPVTTHQNPNHARPINVPVTTH FT QNPNHARPSNVPVTTHQNPNHARPINVTMTTHQNLDHARPSNVPVTTHQNS FT NHARPINVTMTTHQNLDHARPSNVPVTTHQNPNHARPINVTMTTHQNLDHA FT RPSNVPVTTHQNPNHARPINVPVTTHQNPNHARPINVPVTTHQNPNHARPI FT NVPVTTHQNPTHARPINVPVTTHQNPNHARPSNVPVTTHQNPNHARPINVT FT MTTHQNLDHARPSNVPVTTHQNPNHARPINVTMTTPQNLEHARPSNATRTQ FT PTIRYDHARTINEDSTETTTTPLDTPRASPSASKNTQSNEDSTETTATPLD FT TPRASPSASKNTQSHEDSMETTATLLDAPRASPSASKTTQSIPTKTTELSH FT DHARTSSNIRTTFLSFDHARLSSNIRKTLLSPDHARPSRNTRTTHRSFDHA FT RPSSNTRTTHLSPDRDGPGGAKNTTRSTRAKSEVRSSAAQSERTTPRETSI FT VLHRDPLRQEDHGPEGQERVKVDQLASIPPTDYQLAPLADAAAGRQRNPFR FT RSDSTDPITVNLAIITCALLEEISPPTAPITEPPIPPWVAEFLKQKLAKFE FT GVSHIAEHTITLKDDKPIKQRYFPKNPAMQKIINAQVDELLH" XX SQ Sequence 4010 BP; 1185 A; 1326 C; 945 G; 554 T; 0 other; acgcggtcgt tcgtcagtca acgcatcgct catgaagtaa gtaccggagg gaacgcacgg 60 aatataagga ctaccgtccg cctggcggat ggtacagcaa aggaggtgcc gcatggttac 120 ttttagtgac gaaagtcccg cgactgcatc cgctgctcct acccgctcgg ccgatccaac 180 agcgaccccc cggacgaaga caacgcaccg gaatcccgac cacgccagac ccagcaatgt 240 accagtgaca acgcaccaga atcccaacca cgccagaccc atcaacgtac cagtgacaac 300 gcaccagaat cccaaccacg ccagacccat caacgtacca gtgacaacgc accagaatcc 360 caaccacgcc agacccagca acgtaccagt gacaacgcac cagaatccca accacgccag 420 acccatcaac gtaaccatga caacgcacca gaatctcgac cacgccagac ccagcaatgt 480 accagtgaca acgcaccaga attccaacca cgccagaccc atcaacgtaa ccatgacaac 540 gcaccagaat ctcgaccacg ccagacccag caatgtacca gtgacaacgc accagaatcc 600 caaccacgcc agacccatca acgtaaccat gacaacgcac cagaatctcg accacgccag 660 acccagcaat gtaccagtga caacgcacca gaatcccaac cacgccagac ccatcaacgt 720 accagtgaca acgcaccaga atcccaacca cgccagaccc atcaacgtac cagtgacaac 780 gcaccagaat cccaaccacg ccagacccat caacgtacca gtgacaacgc accagaatcc 840 cacccacgcc agacccatca acgtaccagt gacaacgcac cagaatccca accacgccag 900 acccagcaac gtaccagtga caacgcacca gaatcccaac cacgccagac ccatcaacgt 960 aactatgaca acgcaccaga atctcgacca cgccagaccc agcaatgtac cagtgacaac 1020 gcaccagaat cccaaccacg ccagacccat caacgtaacc atgacaacgc cccagaatct 1080 cgaacacgcc agacccagca acgccacccg aacccaaccc acaattcgat acgaccacgc 1140 cagaaccatc aacgaagatt cgacggagac gactacgaca ccgctggaca cacccagggc 1200 cagtccgtcc gcctcgaaga acacccagag caacgaagat tcgacggaga cgactgcgac 1260 accgctggac acacccaggg ccagtccgtc cgcctcgaag aacacccaaa gccacgagga 1320 ttcgatggag acgactgcga cactgctgga cgcacccaga gccagtccgt ctgcctcgaa 1380 aaccacccag agtatcccta ccaagacaac tgagctaagc cacgaccacg ccagaaccag 1440 cagtaacatc aggacaacgt ttctaagttt cgaccacgcc aggctcagca gcaatatcag 1500 gaaaacgctt ctaagccccg accacgccag acccagccgc aatacaagga caacgcatcg 1560 aagcttcgac cacgccagac ccagcagtaa tacaaggaca acgcatctaa gccccgaccg 1620 agacggaccc ggcggcgcta agaatacaac acgatcgacg cgagccaaat cggaagtgcg 1680 gagcagtgcg gctcaatcag agcgaacgac cccacgggaa accagcatcg ttctacaccg 1740 cgacccgcta cgccaggaag accacggccc cgaaggacag gagcgggtca aggtagacca 1800 actcgcatct atccccccaa cggattatca actcgcccct ttagccgatg ccgcggccgg 1860 tcgacaacgc aacccttttc gaagatcaga ctcgactgac cctatcaccg tgaaccttgc 1920 catcataaca tgtgcccttc tcgaagaaat ctcgccaccg acagccccca tcaccgaacc 1980 acccatcccg ccgtgggtag ccgagttttt gaagcaaaaa ttggccaagt tcgagggcgt 2040 gtctcacata gctgagcata ccatcactct gaaggacgac aagccgataa agcagaggta 2100 cttccccaaa aatccggcca tgcagaaaat catcaatgct caggtcgacg agttgctgca 2160 ctgaagaagg agaaaaggtg gatctggagt gacgaacagc aggccgcttt cgaggagttg 2220 aaggagaagt taacccgagc acccgtctta gcctgcccgg atttcgcgga gaggttcgct 2280 ctacaaactg acgcaagcga ctacggtcta ggggccgtcc tgacgcaaca gatcagtaga 2340 gaggccagtc gccagtcgaa ggcttttgaa agcagaagag aactactcag ccacggaaaa 2400 ggagtgctta gccatcgtct gggccatcag gaaattacga tgctaccagg agggctacca 2460 cttcgacgtc atcaccgacc atctcgcctt gaagtggtta aactcgatca ataatccaac 2520 cggtcgcatc gcccgttggg cgctggaact ccaacagtat cgctttgacg tgctttaccg 2580 acgcggaagc caaaacatcg tagcagacgc tctttcccgg caacctatcg acaccataca 2640 gcaagcccta gaagacctgt ccacctgccc ctggatacaa aggttgctca aacgtatcca 2700 gtcacgacct gaggaatgca gagaatttat catcgagaac ggacagatct accggaatgg 2760 acaggaatgg aactcgtgca caccgcgccg tatacccctc agcaaaaccc gacggagagg 2820 gcaaaccgaa ccatcaagac aatgattgca caatacgtcg acggggacca acggacctgg 2880 gacgatctcc tccccgagtt aagcttggcc ataaacagca gcacgtatta caccacggcc 2940 ttcagcccgg cctttttggt actcggacga gaaccaaggc taccaggagc cttgtacggc 3000 gaagtgaccc caggattagg aaatgaacca gagcctgccc acgagaaggc gacacgactg 3060 caggaagtgt tcaaggtcgt gcaggagaac acccaacggc atcccaagag cagaagagaa 3120 catacgacct ccgacggcgc gagtggcgac ccgaactagg caccctagtc ctccttcggc 3180 agcatcactt atccaaagcc gccgacggtt tcgcgtcaaa attggcccct aagtacgacg 3240 ggccgtacaa agtaacctga ttcttatctc ccaacgtagt tcgactacag attcaacgag 3300 gcaaacaaag gaaaacagcg agtttagctg acttaaaggc gttccacgga gtggaccacc 3360 ctgaggagaa cccgacgacg atggagagat aagtactaca cccaatgtaa acatcttctc 3420 actcgcaccg ctccagcagg atagggggag ggtattaata cttctagtta agacaggcag 3480 gttctaactg ataggcattt tctccctaga atcgactgaa gacatgaagc gtctaagaga 3540 cggaaggcaa gctcggcccc taggggaaaa tggtgatacg acgggacacc ctagcggaga 3600 ccagctgcct gaagtcccgg catctgccga tgaaccaacg acaaaccggg ctcgcggctg 3660 gatcgtcgac cccaatgggc catctgcctc ccgtcaagcc cagctgaggc gggtcggaga 3720 ggcgacactt cgacaaccgg acaggatgcc atggggccag cgcgcggcag ccatccaagg 3780 gtggatcgac cgacaaaccc aggacccggg gaccgacgcc gaagaggaag tagagcccga 3840 tggccgcttc gcaagcgcag gtgtccgctc ggcgcccaat cacgacgtcg ccgccctgaa 3900 tcgccgttgg ttcgggagtt atggggctgc tatcgacgac gacaacgcga caacggacac 3960 cgacgagcac gagacaataa tacgcgacgc tcaaagcctg gcgtccacgc 4010 // ID Gypsy-155_AA-LTR repbase; DNA; INV; 128 BP. XX AC AAGE02017419; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-155_AA_; KW Gypsy-155_AA-I; Gypsy-155_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-128 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02017419; Positions 60796 60669. XX SQ Sequence 128 BP; 49 A; 35 C; 28 G; 16 T; 0 other; tggcaaccac aacgaagagg aacgcattct ccaacaacag gtagcgaaac tgcaacacca 60 gcagtctgaa ctgcagcgca taattcaaca attggaagaa caacagtcga cgacggcgta 120 acacgaca 128 // ID RTE-7_BF repbase; DNA; INV; 313 BP. XX AC . XX DT 29-JUL-2009 (Rel. 14.07, Created) DT 29-JUL-2009 (Rel. 14.07, Last updated, Version -1) XX DE Amphioxus RTE-7_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW RTE; Non-LTR Retrotransposon; Transposable Element; RTE-7_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-313 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-313 RA Kapitonov V. and Jurka J.; RT "Young families of RTE non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(7), 1705-1705 (2009). XX DR [2] (Consensus) XX SQ Sequence 313 BP; 120 A; 57 C; 88 G; 48 T; 0 other; gtcaagcaaa acatcggtcc gtacgaagac ctcctaacca cagtaaagaa aagaaaactg 60 aagtggtacg gccacgtgtc acggtcatcg ggactagcca aaactatcat gcaaggaacc 120 gtccagggag agaagaaaga ggggtaggca acggaaaaga tgggaggata atatcagaga 180 atggacaggc atgacactaa gtgaaacact aagaaaaaca gaaaaccgag aagggtggag 240 aaaactggtt gccagatctt ctgtggtgcc ccaacggtca aatagttaga ctaagggata 300 gatgagatga gat 313 // ID hATm-20_HM repbase; DNA; INV; 2827 BP. XX AC . XX DT 07-DEC-2008 (Rel. 13.12, Created) DT 08-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type family: consensus sequence. XX KW hAT; DNA transposon; Transposable Element; hATm-20_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2827 RA Jurka J.; RT "Families of hATm elements from Hydra magnipapillata."; RL Repbase Reports 8(12), 1914-1914 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 242..2500 FT /product="hATm-20_HM_1p" FT /translation="MATTRSSAEVWLVGKPTQTLSTARLPSRGDVLRRLLF FT HHIEEKQEVKKSIRATIEAVLVIWERGKIPTQRIDNAERKLKKLYDEYLLL FT KKHRGSKLDSCQMKEGIFHADLEELFDISTKNASEVMKNDEDKAFLRQQQE FT DPLSCSMAGIDRNLTAREARKRVRETKMELRRQRSQAELMEKTELISSSSA FT VDIVSESSSSSSDDDVYQTPSASSTLSTSGSTKKKMKVLTNNVVSSLDRVN FT ISDRRALFVVGTVSQALGHSVKDLSLSYSTIRRKRRSVREAVTTADKVDFS FT PDDPLLLHWDGKLLPDITGGQKKVDRIAILVTGGGVEKLLGVPKIDRGTGE FT QQADACMKALKDWKLKELVRGLVFDTTSSNTGLNIGACTIMEQNLDRNLIW FT IACRHHMFEVMLSTIFTTAFGTSGGPEVGIFNRFQKQWPAINKEVFTIGNK FT EFYNCDFFLKLREEMVVYYSEAIKGQQPRDDYLELLNLCLIFLGGSTTTSE FT FKVKFRAPGATHNARWMAKAIYCIKIYLFQDQFVITAKEKKGITDISLFVA FT FVYGQFWNEAPLAERAPFNDAKLLQRIQEYPNRTIAINAAKAFHRHLWYFS FT EHLIGLAFFDSRVDNNTKRDMANNLKQQPRQKSLKRLDAATFDCHSPLPSF FT VTQRTAELFDLILKNGKEKAESFLMKESSEWEMDSTYLEMKHKVGQMKVVN FT DCAERGIALITSFNSSVTKDENQKQFLLRLVDLHRKEFPVASKSTLMKMTV FT E*" XX SQ Sequence 2827 BP; 949 A; 499 C; 569 G; 810 T; 0 other; ttagggtgat tcagattgta atagaaaaaa aaaatgtcac aaaaaacatt ctcctcaagg 60 ttgcctgtgt tcctttatca cctaggatca tgtgtaccaa aaattagact aaaagaagca 120 attttagggg ttgccatgtt ttataagatg aattaaccat tggtaaaatt ttgtgatttt 180 ctcaaatttc gcaaatttgt tttttcattg tttatttatt tataatttag aaattgataa 240 tatggcaacc actagatctt ctgctgaggt gtggcttgtt ggtaaaccta cacaaactct 300 gtcgactgcc agacttcctt caagagggga tgttcttcgg aggttgctct tccatcatat 360 agaagagaaa caagaagtga aaaaaagtat tcgggcaaca attgaagcag ttcttgttat 420 atgggagaga ggaaaaattc caactcaaag aattgacaat gcggaaagaa agttaaaaaa 480 actttatgac gaatatctgc tgctgaaaaa acaccgagga tcaaaattgg acagctgtca 540 aatgaaagaa ggaattttcc acgctgacct tgaagaactt tttgatattt caaccaagaa 600 tgcatcagaa gttatgaaaa acgatgaaga taaagcattt ttacgtcagc agcaagaaga 660 tccattaagt tgtagtatgg caggaattga tcggaattta acagctaggg aggcacggaa 720 acgagtccga gaaactaaaa tggaattgag gcgtcaaaga agtcaagcgg aactgatgga 780 aaaaacagaa ctcatttctt cttcatcagc agttgacatt gtgagtgaaa gtagctctag 840 ttcctcggat gatgatgtat atcagactcc ttccgcttcc agcacactga gcacaagtgg 900 ttcaacgaaa aagaagatga aagttctaac aaataatgtt gtatcatctc ttgatcgtgt 960 taatatttct gatcgccgtg cactcttcgt ggttggaaca gtgtctcagg ctcttggtca 1020 ctcggtaaaa gatttatcat tgtcgtacag cacaatccgc agaaaaaggc gatctgttcg 1080 tgaagcggtg actacagctg acaaggttga tttctctcct gatgatccac ttcttctgca 1140 ttgggatgga aagctcttac ctgatatcac tggtggtcaa aaaaaagtgg atagaattgc 1200 catccttgtg actggtggtg gggtggaaaa attgctgggc gttccaaaga ttgatcgagg 1260 tacaggtgag caacaggctg atgcttgcat gaaggctctt aaagactgga aattgaaaga 1320 acttgttcgg ggactggttt ttgacactac ttcatctaac actgggttga acattggtgc 1380 ctgcacaata atggagcaga atcttgaccg taatcttata tggatagcat gcaggcacca 1440 tatgtttgaa gtcatgcttt ccaccatttt cacgactgca tttggaacca gtggaggacc 1500 tgaagttgga atttttaacc gtttccagaa acaatggcca gcaatcaaca aggaggtgtt 1560 cactataggg aataaagaat tttacaactg cgattttttt ctcaaactcc gtgaagaaat 1620 ggtagtatat tatagtgaag caattaaagg tcaacaaccc agggatgatt acttagaact 1680 cttgaatttg tgccttattt ttcttggagg atcaactaca acatcggagt tcaaagtgaa 1740 gtttcgggct ccaggtgcaa cacacaatgc acgttggatg gcaaaagcta tatattgtat 1800 aaagatatat ttgtttcagg atcaatttgt tattactgca aaggaaaaga aaggaataac 1860 agatataagt ctttttgtgg catttgtata cggacagttt tggaatgaag ctccgttagc 1920 tgaacgagca ccattcaatg atgccaagct tcttcaacgg attcaagaat atccaaatcg 1980 tacaattgca attaatgcag caaaagcttt tcatcgccac ctctggtatt tttctgaaca 2040 tctaatagga ttggctttct tcgattcccg tgtcgacaac aacaccaaga gagacatggc 2100 aaacaacctg aaacaacagc caaggcagaa atctctcaaa agacttgatg ctgccacatt 2160 cgattgtcac agcccattgc cttccttcgt cacacaacgc accgcagaac ttttcgatct 2220 aattttgaaa aatggaaaag aaaaagctga atcattttta atgaaagaat cttcagagtg 2280 ggaaatggat tcaacatatc ttgaaatgaa gcacaaggtt ggtcaaatga aagtcgtcaa 2340 cgattgcgcg gaacgaggca tagcacttat aacatccttt aactcaagcg ttacgaaaga 2400 tgaaaaccaa aagcagtttc ttcttagatt ggttgatctt caccggaagg aatttccagt 2460 tgcatcaaag tctaccctga tgaagatgac tgttgaatga tgaagatgat tgatttatta 2520 ttgcctagtg caattacata gtagtactat agtattgatt tttttaacat tgaacatcca 2580 caatccacca cataataata agaaattgat aataaataat tcagtaatat aatataaaaa 2640 atgtaaaaat aaaaaatatt ttttttactt tttgtaacct tataccaaaa tgtgacaaaa 2700 catggcaacc cctaaatatc atccgatttg attcaaattt tggccaataa ctactattgt 2760 catatagaac aagggcaagc ttgaggacaa ctaaagtttt gaacctacaa attttgcatc 2820 accctaa 2827 // ID BEL-79_CQ-LTR repbase; DNA; INV; 350 BP. XX AC AAWU01022182; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-79_CQ_; KW BEL-79_CQ-I; BEL-79_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-350 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 300-300 (2011). XX DR GenBank; AAWU01022182; Positions 3666 4015. XX SQ Sequence 350 BP; 75 A; 78 C; 75 G; 122 T; 0 other; tgtttattgt ggaacaacct aaagtaaatg tagtctagat aaatttgtgt tcagtgttct 60 atgtactttc tctcttgatc gatttgtgta gtgtaacaat atttgtgtga ccttatcagc 120 gagagaattg ataagagagc gagatcttaa tctcctctct aggctagcta ttaaatttgg 180 actaataaat ctgcctcttt cggctctgac cacccgacga ctcggtcatg tgtcctgact 240 tctagtccgc gcgttctttt ttatttcgat ccgttgcgat ctgttcttgc cggtccccgt 300 tttccgtggc cggttcctgt ccgctctcga acgagcatcg aaggtgaaca 350 // ID Waldo-1_CQ repbase; DNA; INV; 5409 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version 1) XX DE A (AC)n-specific Waldo non-LTR retrotransposon from Culex DE quinquefasciatus - consensus. XX KW R1; Non-LTR Retrotransposon; Transposable Element; Waldo-1_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-5409 RA Kojima K.K. and Fujiwara H.; RT "Evolution of target specificity in R1 clade non-LTR RT retrotransposons."; RL Mol. Biol. Evol 20(3), 351-361 (2003). XX RN [3] RP 1-5409 RA Kojima K.K. and Jurka J.; RT "Waldo, (AC)n microsatellite-specific families of non-LTR RT retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 648-648 (2011). XX DR [3] (Consensus) XX CC This consensus is generated from 10 sequences with >99% identity. CC Both sides of this transposon are (AC)n microsatellites, CC similarly to other Waldo elements such as WaldoAg1 in Anopheles CC gambiae (malaria mosquito) and WaldoFs1 in Forficula scudderi CC (earwig) [2]. Waldo is a group of (AC)n microsatellite-specific CC non-LTR retrotransposons belonging to the R1 clade. XX FH Key Location/Qualifiers FT CDS 451..1926 FT /product="Waldo-1_CQ_1p" FT /translation="MKTSETNQQKGPDAPQASENTEVEEDANVEMAGGESN FT GGDTAGGVASAFRGSGKVLRSPVLNQAAASSQQIGVIGEETPKSSLLNFAG FT STPQDGVLLGRTALQEVRRRVNELFDFIKDKNNVHTRIKQMVNGVKAAMNA FT AERENNSLVVTRNSLKLRAERAEETLNAKLEEEALREKEPKTPPGPSSKRD FT RETPGEEEDAKKQKQGNGDSPDPAKEPEPTPGKEKEWEKVKKKKRKKKGKQ FT NEDTQKPKFRRERNKGEALVVEVKEGVSYADLLRKVRTDPELKELGENVVK FT TRRTQTGAMLFELKKDPAVKSSAFKSLVEKAVGYESKVRALSPETTIECRN FT LDEITTEEELEDALIVLLDDRTTPMAIRLRKAYGGTQIASIRLSTPSASKL FT LETGKVKVGWSVCPLRPVPRVTQQMTRCFRCMGFGHQARNCDGPDRTNSCR FT RCGREGHMARDCKNQPKCVLCKEGDGNSHATGGFNCPVYKKLASGKK" FT CDS 1929..4895 FT /product="Waldo-1_CQ_2p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="MEVSQVNLNHCDTAQQLLWQSTAETGCDVAIIAEPYR FT VPHDNGNWAADTARMAAIHVMGRYPIQEVVSRAFEGFVIAKVNGTFFCSCY FT APPRWTLEQFQQMLDSLTDELIGRSPIVIGGDFNAWAVEWGSRCTNARGHS FT LMEALAKLDVRLANRGTSSTFRKDGRESIIDVTFCSPRLAADMNWRVSEDY FT THSDHQAIRYSIGRRAPVPDRSSRSYGRKWKLQYFDEGLFVEALHWCDGPQ FT DLSADVLTAQLVTACDTTMPRRLEPRNCRRPAYWWNEELGTLRASCLSARR FT RVQRARSEATREECREEYRSAKAALKKAIKCSKTNCFKELCQDADANPWGS FT AYRVAMAKIRGPSMVAETCPDKLKVIVEGLFPRHDPTTWPPTPYNDEGGSN FT AEGHLITNEELVAVAKRLKVKKAPGPDGIPNFALKSAVLAFPDRFRTVLQE FT CLDEGHFPDPWKVQKLVLLPKPGKPPGDPSSYRPICLLDTLGKLLERIILN FT RLTKYTESEHGLAARQFGFRKGRSTVDAIRKVVEKADEARRKKRRGNRCCA FT IVTIDVKNAFNSASWAAIAAALHKMKVPDYLCRILKSYFENRVLVYDTADG FT QKTVVVTAGVPQGSILGSALWNGMYDGVLTLGLPNGVEIVGFADDIVLTVT FT GENVEEVEVLAMEAIAMIENWMLEVKLRIAHHKTEMVLVSNHKKVQQAQIH FT VGEHVVHSKRALKYLGVMVDDRLNFNSHVDYACEKAAKAIMALSRMMPNNA FT GPRSSRRRLLASVATSILRYGGPVWWTALGTKRNRALLDRTQRLMAMRVAS FT AYRTISSEAVGVIAGMIPIGITLEEDTVRYTRRGTRGIREAARAESLARWQ FT REWDTTEKGRWTHRLIPSVSTWVSRRHGEVTFHLTQFLSGHGCFRKYLHRF FT GHAESPLCPDCVDCEETPEHVVFACPRFEAARSEMLAIIGADTSPDNVVRR FT MCSDIAKWNAVVGAVTQITSALQRKWRDDQRRND" XX SQ Sequence 5409 BP; 1448 A; 1326 C; 1696 G; 938 T; 1 other; cccgtgaacc gtggggacag gtgagggtcc ctgcggagct tagctgccag ctaccgggcg 60 ggttgcagta ggcggatagc tgtcggcgat tgcatacatt cattgcatcg cccccggacc 120 agcagcggga ggatgtctag gacgtggcgg aattgaacaa gggcactgtt taatttcttc 180 gaaaaaacca catgggtccg taaacacctc tgtcaagcga tcgaacgccg ctataagtgt 240 tttagcccaa aacctcacca aaccccgaat ccaagatgtg atgcgacccg tgtcgaggga 300 tgcatggctg ggggggttca acaaattccc agtcgataac ggagcctgtg gggcacaggg 360 gcgaacccca cacgtaattt gcccttactg cgtcacggca gggcactggc gcagcggacc 420 gtatttccct agcgactcgt gggattcaaa atgaagacaa gtgaaacaaa ccaacaaaag 480 ggtcccgatg cgccccaagc atcggaaaac acggaggtag aagaggacgc aaacgtcgaa 540 atggcaggag gcgagtcaaa cggcggcgac acagcaggcg gagtggcgag cgcgttccgt 600 ggaagcggga aggtgttgag atccccagtg ttgaaccagg cggctgcttc gagtcagcag 660 ataggagtaa ttggagagga gactcccaag tcatccttgt tgaacttcgc cggcagtacc 720 cctcaggacg gagtcctgct tggaaggacc gcgttacagg aggtcaggag gagggtcaac 780 gaactctttg atttcatcaa ggacaaaaac aacgtccaca ccagaatcaa gcagatggtg 840 aatggagtca aggcagccat gaatgccgca gagcgcgaaa acaactcgct ggtggtgacg 900 cggaattcac tgaagctcag agctgaaaga gccgaagaaa cgctgaatgc aaaactggag 960 gaggaagcgc tacgggagaa agaaccgaaa acgccgcccg gcccaagctc taaaagggac 1020 agggaaacgc ctggagagga ggaggacgca aagaagcaga agcaggggaa tggagacagt 1080 ccggacccgg cgaaggaacc agaaccaacc ccagggaagg agaaggaatg ggagaaggtc 1140 aagaagaaga agcggaagaa aaaagggaag cagaacgagg acacccaaaa acccaagttt 1200 cgcagggagc gtaacaaagg cgaggctttg gtggtcgagg tgaaggaagg tgtttcgtac 1260 gcagacctcc tccggaaagt acgaaccgat ccggaactca aggagcttgg cgagaacgtg 1320 gttaaaacca ggcgcaccca aaccggagcg atgctttttg agctgaagaa ggatcccgcg 1380 gtcaagagct cagcttttaa gtccctcgtc gagaaagccg taggctacga gtcgaaggta 1440 agagcgctat caccggagac aacgatcgag tgcaggaacc tggacgagat cacgacggag 1500 gaagagctag aagatgcgct gatcgttctt ctggatgacc gtacgacacc gatggcaatc 1560 cggttgagga aagcctacgg cggcacgcaa attgcgtcga tccgactatc gacgccttcg 1620 gcgtctaagc tgctggaaac cggcaaggtc aaagtagggt ggtcggtgtg cccactgagg 1680 cctgttcctc gagtgaccca gcagatgacg aggtgtttcc gctgtatggg cttcggccac 1740 caggcgagaa attgcgacgg tcccgatcga accaacagtt gcagaaggtg tggtagagaa 1800 ggccacatgg caagagactg caaaaatcag ccgaagtgcg tgctctgtaa agaaggcgac 1860 ggcaatagcc atgcgacggg tggctttaat tgcccggtgt acaagaagct ggcctcgggc 1920 aaaaagtaat ggaggtgtcc caggtgaacc tcaatcactg cgacactgca cagcaactgc 1980 tgtggcagtc gaccgcggag acggggtgtg acgtggcaat tattgcagaa ccgtaccgag 2040 ttccacacga caacggaaac tgggccgcgg atacagcaag aatggcggcg atacacgtga 2100 tggggcggta ccccatacag gaagtggtct cgagggcgtt tgaaggattc gtgatcgcca 2160 aagtaaacgg aaccttcttc tgtagctgct atgctccccc aagatggacc ttggagcagt 2220 ttcagcagat gctggatagt ctgaccgatg aactgatcgg acgaagcccg atcgttatcg 2280 gaggtgactt caacgcgtgg gcggtcgagt ggggtagcag atgcaccaat gctagggggc 2340 atagcctaat ggaagctctg gcaaagctag acgttaggct ggcgaatcgc ggaaccagca 2400 gtaccttccg caaagacggt cgtgagtcca ttatcgacgt tacgttctgt agcccgcgac 2460 tggcggccga catgaactgg agggtgagtg aggactatac ccatagcgat caccaagcga 2520 tccggtacag catcgggaga cgagcccctg taccagatag gagcagccgg tcctacggaa 2580 ggaaatggaa gctgcagtac ttcgacgagg gtctcttcgt ggaagcgctc cattggtgtg 2640 atggtcccca agacttgagt gccgacgtgc taacagcaca actggtgaca gcatgcgaca 2700 caaccatgcc gcggagactg gagccaagga actgtcgtcg tccagcctac tggtggaatg 2760 aagaactcgg tacccttcgg gcaagttgcc tcagcgccag aagacgagtc cagagagcaa 2820 gatccgaagc aactagagag gagtgcagag aggagtaccg gtctgcaaag gccgcgctca 2880 agaaagcgat caaatgcagc aagacaaact gcttcaagga gttatgccaa gacgctgatg 2940 caaacccttg ggggagcgca tatcgtgtag cgatggcgaa gatcagaggc ccatcgatgg 3000 tggctgaaac gtgtcccgac aagctgaagg tcattgtgga agggctcttc ccaagacatg 3060 acccaacgac atggcctcct acaccgtaca acgacgaagg gggtagcaac gccgaaggtc 3120 atctgatcac caacgaggaa cttgtggcag tagcgaagag attgaaggtg aagaaagctc 3180 ccggcccgga tggaatcccg aatttcgccc tgaaatcggc ggttctagca ttcccggaca 3240 ggtttcgaac agtcctgcag gaatgcctgg acgaaggaca cttccccgac ccgtggaagg 3300 ttcaaaagct cgtgttgctg ccgaagccag gcaaaccacc gggggaccca tcatcgtata 3360 ggcctatatg tttgctggac accctcggaa agcttctgga acggatcatc cttaaccggc 3420 tgaccaagta cacggagagc gagcatggct tagcagcgag gcagttcggc ttccgtaaag 3480 ggagatccac ggtggacgcc atccggaaag tggtcgagaa agccgacgaa gcgcggagga 3540 aaaaacgcag ggggaaccgt tgctgcgcaa tagtcacgat tgacgtcaag aacgcgttca 3600 acagtgcgag ctgggcggcc atagcagcag cgctgcacaa aatgaaggtg cctgactatt 3660 tgtgcaggat cttgaagagc tacttcgaga accgcgtgct ggtctacgac actgccgatg 3720 gacaaaaaac cgttgttgtt accgcgggag ttccgcaggg atccattctg ggttcagcac 3780 tgtggaatgg aatgtatgac ggagtgttga cactgggact acccaacggc gtagagattg 3840 ttggctttgc agacgacata gtgctgacgg taaccggcga aaatgtcgag gaggtcgaag 3900 tgctggctat ggaggcaatc gcaatgatcg agaactggat gctcgaggtg aagctgcgga 3960 tcgctcacca caagacggag atggtgctgg ttagtaacca caagaaggtg cagcaggccc 4020 agatwcacgt tggagaacac gtcgtgcact cgaagagagc gctcaagtac ctcggggtga 4080 tggtggatga ccggctgaac ttcaacagcc acgtcgatta cgcctgcgag aaggcggcta 4140 aggcgatcat ggcactgtcg aggatgatgc cgaacaacgc tggacccagg agcagtaggc 4200 gccgcctctt ggcaagtgtc gcgacgtcca tacttaggta cggcggaccg gtatggtgga 4260 cggcgctggg gacgaagcga aatcgagcgc tgctcgacag aacgcagaga ctgatggcca 4320 tgcgggttgc aagcgcgtac aggaccattt cgtcggaagc agttggcgtc atagccggaa 4380 tgatccccat cggcatcaca ctggaggagg acaccgtgcg ctacacccgg agaggcacga 4440 gaggtatccg ggaagctgcg agagccgaat cgctggcaag gtggcaacgt gagtgggaca 4500 ccacggagaa aggcagatgg acgcatcggc ttatcccgtc cgtatccacg tgggtgagca 4560 gaaggcatgg agaggtcacc ttccacctca cacagttcct gtcgggccat ggctgcttca 4620 ggaagtacct gcacaggttt ggacatgcag agtctcctct ctgtccggac tgcgtcgatt 4680 gcgaggaaac accggagcac gtggtgttcg cctgccctcg cttcgaggca gcgcgaagcg 4740 aaatgctggc cattatcgga gcggacacca gcccggataa tgtggtgcga agaatgtgca 4800 gcgacatcgc caagtggaat gcggtcgtcg gagcggtgac gcagatcact tcggctctcc 4860 agcggaaatg gagagacgat cagaggagga acgactagga gcctagtcga aaacccacga 4920 gtgtggctgt gaaggagagc acgttatgat ggtcggctct accaaatcgg tacacgtctc 4980 gatggtcaca ggagtcgaga acccacgagt gtggctgtga aggagagcac gttatgacgg 5040 ttggctctac caaatcggta cacgtctcga tggtccaagg agagggctgc atatgactag 5100 ccgatcaaaa gcaacgcgat tcttgggcgc ggttaaaccc tcgcatggac tcatatgtat 5160 gtaggacagg aaatggttct agcacccggc atggatcctg taagtagact agtgcagaaa 5220 atgcaacgcc tccccccgaa gttataccga aaggtggtcc cggggggaca agggcacggc 5280 gttcaaggac tggtttagtg ggtcgggaaa actctttttt tgttttccca accccacact 5340 acctgaggaa tgaattctca ggtgtctggt agcagattcc gaccttgtaa aaaaaaaaaa 5400 aaaaaaaaa 5409 // ID Penelope-5_AAe repbase; DNA; INV; 3346 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A Penelope-like element family from Aedes aegypti. XX KW Penelope; Non-LTR Retrotransposon; Transposable Element; KW Penelope-3_AAe; Penelope-5_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-3346 RA Jurka J.; RT "Penelope-like elements from the yellow fever mosquito."; RL Repbase Reports 11(4), 1438-1438 (2011). XX DR [2] (Consensus) XX CC ~99% identical to consensus. XX FH Key Location/Qualifiers FT CDS 610..3045 FT /product="Penelope-5_AAe_1p" FT /translation="MSEIKVTTSKCFRLYGQXIMKKRYLLLDIRFLTQCNR FT KGLVPRFICIKPTVKNKASEKAVKSAQKIWLTEEIRCKHQQLSMVEEQLYS FT LHLQLLKQMEYHTAAECIVHKCSEEKKTCWQSVLDTVNRNLSVCVKRKKQK FT HRKKLLSLADAQATRNARRLPQIIDNFVVNLSSAQLTKWEHDLLNKGLNFA FT LPPQCAPVAEMVNNIESAIQYNSFPTKSALRHDIQRCILDAAGKQNNETQN FT DSNTERVVRQLKTHDVIYSRADKGNAVVMMDKEDYDARVLDMINSGPYDEC FT KFKNGKPKDPLNALIEEANSTRQKIARLMGEDKLERRLNVPNSKVASLYCL FT PKIHKNPIAMRPICSNICTPTEKMAAWLVNEMKGYPVTHGKSVKNSVELVE FT KLEKVEIRRGEILVSFDVAALFPNVPVPDALCSLRRHLERHRAPPNHINAY FT LTVAEVCMNQNYFMFRGKFYKQTFGLSMGSKLSPLLADVFMSDFETDLQKE FT KLFPRIWRRYVDDIFAVVKERYLSQILEMLNSRHTTIKFTVEKEMDGKLPF FT LDLMITKKEDNTLRFGIYRKPTSTDRYITADSNHYGAQKQAAFHSMAHRLF FT NIPMEKEEFVEERMKIHEAAAVNGYDEEFVNKILRKHERKKHRQIATTLQP FT HKEEPLRISLPFYPKLTNPIQGILKQYGMQAAYKSGHTLKEYLCALKDKTP FT AEDLSGIYEIPCKDCPSVYIGQTRRKFKIRLREHRNAVDNERVHESSVAVH FT SVELKHNIDWDKAKLKKSVRKVSHLNAWESMFISNAEQPLMNDDDAPISSP FT LFHLTKLDFK" XX SQ Sequence 3346 BP; 1041 A; 706 C; 720 G; 878 T; 1 other; ctgaatggcg ctttcaatat tattgaccat ctcagctaca ggggcgcatt ggggcggtag 60 ggcaaaattg agacccttat tgagaagatc atgttcgcat tcggttaatt gggcagatga 120 tagattcact acaaagttat caatgatctg cggtaaacga cgtgcattcc tagtcgcctg 180 tgcatcggca agcgagagca attttttacg atgcttttgc ttcttccttt tcacacacac 240 tgagaggtta cggtttaccg tatctaacac agattgccaa cacgttttct tctcttcgct 300 acatttatga acaatacact cagctgctgt gtggtactcc atttgcttta acagctgtaa 360 gtggagcgaa tacagctgtt cttcaaccat tgacaattgc tgatgtttgc acctaatttc 420 ttcggtaagc cagatttttt gcgcactttt cacagccttc tcactagctt tatttttgac 480 ggttggctta atgcaaatga actttggaac aagacctttt ctgttacact gtgtaagacc 540 aatgtgtccg gtctcgcgtc gcgtcttgtg agtgtactat agttcttacg ttagcacaaa 600 atcttaaaaa tgagtgaaat aaaggttaca acctcaaagt gctttcgcct ttatggccaa 660 ncgattatga agaagagata tttgctatta gatattcgat tccttacaca gtgtaacaga 720 aaaggtcttg ttccaaggtt catttgcatt aagccaaccg tcaaaaataa agctagtgag 780 aaggctgtga aaagtgcgca aaaaatctgg cttaccgaag aaattaggtg caaacatcag 840 caattgtcaa tggttgaaga acagctgtat tcgctccact tacagctgtt aaagcaaatg 900 gagtaccaca cagcagctga gtgtattgtt cataaatgta gcgaagagaa gaaaacgtgt 960 tggcaatctg tgttagatac ggtaaaccgt aacctctcag tgtgtgtgaa aaggaagaag 1020 caaaagcatc gtaaaaaatt gctctcgctt gccgatgcac aggcgactag gaatgcacgt 1080 cgtttaccgc agatcattga taactttgta gtgaatctat catctgccca attaaccaaa 1140 tgggaacatg atcttctcaa taagggtctc aattttgccc taccgcccca atgcgcccct 1200 gtagctgaga tggtcaataa tattgaaagc gccattcagt acaacagctt cccgactaaa 1260 tccgcattac gtcacgatat ccaacgttgt attttggacg ctgcaggaaa acaaaacaac 1320 gaaacacaga acgattccaa tacggagaga gtagttcgac aattgaaaac acatgatgtg 1380 atctactctc gagccgacaa ggggaacgct gtggtgatga tggataaaga ggactatgac 1440 gctagggttt tggatatgat caattccggc ccatacgatg aatgtaaatt caagaatggt 1500 aaacccaaag atcctctcaa tgcgttgatc gaggaagcaa acagcacacg gcaaaagatt 1560 gcacgtttga tgggtgagga taagcttgag aggagactga atgtcccaaa ctcaaaagta 1620 gcatcgttgt actgcctccc aaaaatccac aaaaatccta tagcaatgag gcctatttgt 1680 tccaatattt gcactccaac ggaaaaaatg gcagcgtggt tagtaaacga aatgaagggc 1740 tatccggtca cccacggaaa gagtgtgaaa aattcagtgg aattagtgga aaaactggaa 1800 aaagttgaaa ttcgtagagg agaaattttg gtttcattcg atgtggccgc actgtttcct 1860 aacgtacctg taccagacgc gctttgcagt ttacggaggc atttggaacg acatcgtgcc 1920 ccacctaacc acattaatgc ctatctcacc gtggctgaag tgtgcatgaa ccaaaactac 1980 ttcatgttta ggggtaagtt ttacaaacaa acctttgggc tcagcatggg tagtaagctc 2040 tcgccactat tggctgacgt tttcatgagc gattttgaaa cagaccttca aaaagaaaaa 2100 ctttttcctc gaatttggcg acgctacgtt gacgatattt tcgcagtagt gaaagaacgc 2160 tatttgtcac aaattctcga aatgttgaac tcccgacaca ccaccatcaa attcacggtt 2220 gagaaagaga tggatggaaa actccctttc ttggatttga tgataaccaa gaaagaggat 2280 aacaccttga gattcggtat ctaccgaaaa ccgacatcca ccgatcggta tataacggcc 2340 gattctaacc actacggagc acaaaaacaa gccgcgttcc actcaatggc acaccgcctt 2400 ttcaatattc ccatggaaaa agaagaattc gttgaggaga gaatgaagat tcatgaagct 2460 gcagcggtaa atggttacga tgaagaattt gtcaacaaaa tacttagaaa acacgaacgg 2520 aaaaaacatc gccaaattgc tactacactt caaccccata aggaagaacc cctacggata 2580 agcttgccgt tttatccaaa attaaccaac cccattcaag gcatactgaa acagtacgga 2640 atgcaggcag cctacaaaag tggccacaca ctgaaggagt atttgtgtgc cctcaaagac 2700 aagacccctg cggaagatct gtctggaatc tatgagattc catgcaaaga ctgcccatca 2760 gtttatattg gccaaacgcg aaggaaattt aaaatccgtt tgagggaaca ccgaaacgct 2820 gtggataacg aacgagtcca tgagtctagt gtggcagtac attctgttga actcaaacac 2880 aacatagact gggacaaggc gaagcttaaa aagagcgtcc gaaaagtttc ccatttaaat 2940 gcttgggaat ctatgttcat ctccaatgcc gaacaaccgt tgatgaatga cgacgatgcc 3000 ccgataagtt cgccgctttt tcacttgacg aaactggact tcaaatagtg ttatctcttt 3060 ctgacgtgta ctggagaaag tacgaagctg ttttgttctc ttctatgtaa tcagttggac 3120 atcgtcctgt agatgggcaa catcgagccc gaaaccggtc gacgaaaaag gtaaatgttt 3180 aaatccttta tatgtttgta agaacaataa gtgttgtgtc cagtctcgcg tcgcgtcttg 3240 tgagtgtact atagttctta cgttagcaca aaatcttaaa aatgagtgaa aactaagttg 3300 gaggttggat gtaaacttca actccaaaca ctagcgtttt ctcggt 3346 // ID BEL-230_AA-I repbase; DNA; INV; 5268 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-230_AA_; KW BEL-230_AA-LTR; BEL-230_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-5268 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 915-915 (2011). XX DR [1] (Consensus) XX CC Positions [4315-4899] - Integrase core CC 'GGCC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 49..5244 FT /product="BEL-230_AA-I_1p" FT /translation="MDSLVRKRTALMGRLAGIRSRVSAMDPAMARSDDLDA FT EMECLKEWWLDYRKLQEGILDLCEDDDLLEDIVKIGNKADQEHNSVKAMIT FT QFQRVVQHREHPIVAASTSTVSQAPEPNATTVGLPELNLPKGILPTFSGDY FT GEWTSFYDLFTSSVHNNPRLTDAQRLLYLKTYTSGAAAALLRHVKVEDRAY FT QGALEALRKRFDRKDQIVSHQIQRYLDIPTTNVATAANLRRAYSTADDVVR FT ALKASEREERDCWLIHLLLAKLDPETRQLWANKTSITDVSADPSTINDFLE FT FLDQRAYTMESAHRPSSHVSGKVPPKTTSFRQSSSFIATNEPSTTNCVLCK FT ERAHLLYMCRRFQNLQPSERLQLAYQHGLCLNCLREGHGRNSCTSIKCKKC FT SQQHHTLLHDASHRFHERTHQTPVRTINNLSYNNAYTSVFLATAVVHIIDA FT QGKPHAARALLDSASQACFITTALKSRLGLRSEQIDIPLQGISGLSTRITE FT AVTIEMRSRTSKYRQQLQCAVLYKISDQIPQKPVDISEWDLPSSKQLADTG FT FNIPSNVDLLIGAGVFYKLLGNERISLGETRPILQSTQLGWVVAGIYDTET FT SANTSSTSSAALCFTTSLHQEETNELNNLVARFWELEDLQPTKHLTEEERL FT CEEHYTQTTVRDSSGKYIVKLPFRKDPAVIGETAYSALHQFNAVSRKLTRN FT PRLSELYHDYINEFIEAGYISKAEPASPPDRIIYLPHHGVLKEQSTTTKLR FT VVFNASAKSSNGVSLNDLLMVGPVVQTNLYAILLNFRTPKIAITGDIERMY FT LQINVSDADRHRIRMLWQERDQHLAEYTLNTVTFGMACAPFLATRTLKQLG FT DDDGHLYPLAQEAMEDFYVDDCLTGASSIEEAIEKRRQITELLQRGGFSIR FT KWATNEPAVLHDIPPPDRAVSLSQELDPDASLKTLGIRWHFGEDVFTFSST FT CDKQKTVSCKRDILSTIAKIFDPLGLIGPAIIPAKAFMQELWQLQIDWSDP FT LPESMKQRWQNYVENLHDVESIKIPRRCMNIDHPMRLLLHGYCDASNIAYG FT AVLYLRAIDKAGNVSSRLLCSKSKVVPINRPTIPRLELCAAVLLARLIDTV FT KAALRIPIHQTVAWSDSTTTLAWIAGSPTRWKTFVSNRVAEINASLPAVNW FT RHIPGTDNPADLLSRGLTPSALESSQLWWEGPQWNDASCMESTGIQLTTEE FT EAIVQKEMRKQDSVQTFVVITNEVIDQLMHRQSSFIKLVRIVAWVNRFIKN FT CQRSDEAPSVGPLSVEEHETAKAILVRYSQQLAYPAEIQALAQQKELPKHS FT KLLSLSPFLDDGLLRVGGRLARSKLAYDIKHPLILSPTSRLAQLIFHHEHL FT QNHHLGAQSLLATVRRTYWIPHGRNLARNTVWKCVPCFKFNPKRGLQQQIM FT GQLPPERTQPVPPFYISGVDYGGPITLVQRRGPGSPTTKGYIAMFVCFVTR FT AVHIEAVSKLSSKAFLAALRRFVARRGHCGHLYSDNGTNFVGAAKEMIQWY FT RTIQSPDHNNSVADMLARGGTQWHFIPPGSPHLGGLWEAAIKSAKHHLNIV FT TRNVRLTFEEFSTLLTEIEGILNSRPISPASSDPNDVQPLTPGHFLIGRPL FT TTPNQQQIIPSSDETYDIRFKYIHELRQHFWDRWSREYVPELQIKGKWHQS FT TKGLQVGDLVLLRHQRLAPTQWALGRVVELKPSSDGHPRLVKLKVKDGELL FT QSIHNLHKLPMN" XX SQ Sequence 5268 BP; 1517 A; 1400 C; 1196 G; 1155 T; 0 other; ttaaaatttg gcgctgtaga caggatagta agtgaaagtg aagtagtgat ggatagccta 60 gtgagaaaac gtaccgcctt gatgggccgt ctcgccggta tacggtcccg agtgagtgct 120 atggatcccg ctatggcacg aagtgatgac ctagatgctg aaatggagtg cctgaaggaa 180 tggtggctgg attataggaa gcttcaagaa ggaattttgg acctctgtga agacgacgac 240 ttgctagagg acattgtcaa gattgggaat aaggcagatc aagagcacaa ttcggtgaaa 300 gccatgatca cacagtttca gcgtgtggtt caacaccgag aacatccgat cgtcgcagcc 360 tcgaccagta ctgtgagtca agcacccgaa ccaaacgcta caacggtcgg tctacccgag 420 ttgaaccttc cgaaaggaat tctcccaaca ttctcgggag attacggaga atggacgtcg 480 ttctacgatc tattcaccag ctcggtccat aacaatccac gattaactga cgcacaacga 540 ctactgtacc tcaaaacgta tacctcaggt gcagcagcag cgttacttcg tcacgtcaag 600 gttgaagacc gagcctacca aggggcgtta gaggcattga ggaaacgttt cgacagaaag 660 gatcagatcg tgagtcacca gattcagcgt tatctcgaca tacccactac caacgtcgca 720 acggccgcca atctccgacg agcgtacagt accgccgacg acgtagtacg agctctcaag 780 gcgtctgaaa gagaggagag agactgttgg ctcatccatc tactactcgc caagctggac 840 ccagaaacac gtcaactctg ggcgaacaag acttccatta ccgatgtatc agctgacccg 900 tctaccatca acgacttttt ggaattcctc gatcaacgag cgtataccat ggaatcggct 960 catcgaccat cgtcgcatgt gagtggcaag gttccaccga agacaacgtc atttcgtcaa 1020 agctctagtt tcatcgcaac caacgagcca tcgacaacga attgtgtttt gtgcaaagag 1080 cgagcacatc tattgtacat gtgcagaagg ttccaaaatc ttcaaccatc ggagagacta 1140 cagttagcgt atcaacacgg cttgtgcctc aactgcctcc gagagggaca cggaaggaac 1200 tcatgcacgt ccatcaaatg taaaaaatgc agccaacagc atcacaccct tcttcacgat 1260 gccagccatc gtttccacga acggacgcac cagacacccg tccgtaccat caacaaccta 1320 tcgtacaaca atgcgtacac atcggttttc cttgccactg cagtcgtaca catcatcgat 1380 gcgcaaggga agccacatgc agcacgagct ctgctcgatt cagcttcgca agcttgtttc 1440 atcaccacag ctttgaaatc acgcttgggc ttgcgaagtg aacaaatcga cataccacta 1500 caaggcattt ctggactatc taccagaatc accgaagcag taacgatcga gatgcgatcc 1560 cgcacaagca agtatcggca gcagttacaa tgtgcggtgc tctacaagat atcggatcag 1620 atcccgcaaa aaccggtcga catctcggag tgggacttgc cctccagtaa acagctagca 1680 gacactggat tcaacattcc cagcaacgta gatctactca ttggagcagg agttttctac 1740 aaacttctcg gcaacgaaag aatatcgctt ggagagacgc ggcctattct gcaaagcaca 1800 cagcttggat gggtcgtggc cggcatctac gacactgaaa cttcagcaaa tacttcatca 1860 acttcttcag ccgcactttg tttcaccact tcgctacacc aggaagagac aaatgaattg 1920 aacaatttgg tggcacgatt ctgggagttg gaagatctac aaccaacgaa acatctcacc 1980 gaagaagaac gtctctgtga agaacactac acacaaacca ccgttcggga ttcatctgga 2040 aaatacatcg tcaagcttcc gtttcgtaag gaccctgcgg tcatcggaga aactgcttac 2100 tctgctttac accagttcaa cgcagtatca cgtaagctga cgagaaatcc aaggctaagc 2160 gagttatatc acgactacat caacgagttc atcgaggctg gatacatatc caaagcagaa 2220 ccagcaagtc caccggatcg catcatctat ctaccccacc acggagtatt gaaggaacag 2280 agtaccacta ccaagttgcg ggtggtgttc aatgcatcag caaaatcctc gaacggcgtg 2340 tccctcaacg atctcctcat ggtgggccct gttgttcaaa ccaatctcta cgcaattttg 2400 ctgaacttcc gtacgccaaa aatagccatc actggggaca tcgagcgaat gtatcttcag 2460 atcaacgtca gtgacgcaga caggcatcgc atcagaatgt tgtggcaaga gagggatcag 2520 catctcgccg aatacactct gaacactgtg accttcggaa tggcttgcgc accgtttctg 2580 gcaacacgaa cattgaagca actgggcgat gacgatggac acttgtatcc tttggctcag 2640 gaagcaatgg aggactttta tgtcgacgac tgtctgactg gtgcatcatc cattgaagaa 2700 gcaatcgaaa aacgtcgaca gatcactgaa ctcctgcaac gcggcggctt ttctattaga 2760 aaatgggcaa ctaatgaacc agcagtactg catgacattc ccccacccga cagagcagtc 2820 agcctgagtc aagagctcga tccagatgcc agtctcaaga cactaggcat ccgttggcac 2880 tttggagaag atgtcttcac gttcagttca acctgcgaca agcaaaaaac ggtctcgtgc 2940 aagcgagaca tattgtccac gattgccaag attttcgacc ctctggggct catcggaccg 3000 gcgatcattc ctgcaaaggc attcatgcag gaactctggc aacttcaaat cgactggtca 3060 gatcctttgc cagaatctat gaagcaacgg tggcaaaact acgtagaaaa tctccatgac 3120 gtcgaaagca tcaaaattcc acgccgatgc atgaacatcg atcatccaat gagactactt 3180 ctgcatggct actgcgacgc atcaaacatc gcatatggag cggtgctata tcttcgagcc 3240 atcgacaagg caggaaacgt cagctctcgg ctgctatgct ccaaatctaa ggttgtacca 3300 ataaacagac ctacgatccc tcgactagag ctttgcgcag ccgtgctact agcccgtctc 3360 atcgataccg taaaggcagc cctgcgaatc ccaattcatc aaacggttgc ctggagtgat 3420 tctacgacta cactggcctg gatagctggt agcccaacac gatggaagac ctttgtctca 3480 aatcgcgtcg cagaaatcaa cgcttctctt cccgccgtga actggaggca tatcccagga 3540 acggacaatc cagcggactt gctaagtcgt ggactaacac caagtgcctt agaatcgtct 3600 caattatggt gggaaggacc ccaatggaat gacgcctcat gcatggaatc gacgggtata 3660 caactaacga cagaggaaga agctattgtc caaaaggaaa tgaggaaaca agattcggtt 3720 caaacctttg tcgtcatcac caatgaagtt atcgaccaac taatgcatcg tcagtcatcc 3780 ttcatcaagc tagttcgcat tgtagcttgg gtaaatcgat tcatcaaaaa ctgccagcgt 3840 tctgacgaag cacctagcgt tggaccgctt tccgtggaag agcacgaaac agcaaaggca 3900 atactcgttc gatactctca acaattagca tacccagcgg agatacaagc cctagcacag 3960 cagaaggagt taccaaaaca cagcaagctg ctttcgttat ctccattttt ggacgacggt 4020 ctgctcaggg tgggcggtag gcttgcccga tcgaagttgg cgtacgacat caagcatccc 4080 ttaattcttt caccgacaag tcgtctagct caactaattt tccaccacga gcatcttcaa 4140 aaccatcatc ttggcgcaca atctctcttg gctacagtca gacgcaccta ctggatccca 4200 catggtcgaa atcttgcgcg caacacagtg tggaaatgcg tgccttgctt caaattcaat 4260 ccaaagcgtg gcctacaaca acaaataatg gggcagctac caccagaacg aacccaacct 4320 gtgccaccct tctacatttc cggagtcgat tacggaggac cgatcacgct agtacaacga 4380 cgaggaccag gatcaccaac cacaaaaggg tatattgcca tgttcgtttg tttcgtcact 4440 cgagccgtcc atatcgaggc agtcagcaaa ctttcatcga aggctttctt ggcggcgtta 4500 cgaaggtttg tcgctcggcg aggccattgt ggacatctat acagcgacaa cgggacaaac 4560 ttcgtcggcg cagcgaagga gatgatccag tggtatcgca caatccaaag ccctgaccat 4620 aacaactctg ttgccgatat gttagcacga ggaggaacac agtggcactt tatcccacct 4680 ggttctccac atctgggtgg tctctgggag gccgccatca agtctgcaaa gcaccatttg 4740 aacatcgtca ccagaaacgt ccggctaaca tttgaggaat ttagcactct cttgacagag 4800 attgaaggta tcctaaactc acgtcccata tcaccagctt caagcgaccc aaacgatgtc 4860 caaccattga ctcctggaca tttcttgatt ggacgaccat taaccacacc gaatcaacaa 4920 caaatcattc caagttcaga cgaaacctac gacatacgct tcaaatatat ccacgaactg 4980 aggcaacatt tctgggatcg ttggagtcgt gagtatgttc ctgagctaca aattaaggga 5040 aaatggcatc agtcaaccaa ggggctacaa gttggtgact tagtcctgct tcgacatcag 5100 cgattagcgc ccactcaatg ggcattggga agagtcgttg aacttaagcc ttcatcggat 5160 ggccatcctc gactcgtcaa gctaaaggta aaggatggcg agctgctaca atcaatccac 5220 aatctccaca aactaccgat gaattgactg cgcaattgat gggcggaa 5268 // ID TRAS3_SC repbase; DNA; INV; 1923 BP. XX AC AB046673; XX DT 09-JUL-2004 (Rel. 9.06, Created) DT 30-JUN-2010 (Rel. 15.07, Last updated, Version 3) XX DE Samia cynthia TRASSc3 gene, non-LTR retrotransposon, partial cds. XX KW R1; Non-LTR Retrotransposon; Transposable Element; KW endonuclease domain; reverse transcriptase domain; TRAS family; KW TRAS3_SC. XX NM TRAS3_SC. XX OS Samia cynthia OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Saturniidae; Saturniinae; Attacini; Samia. XX RN [1] RA Kubo Y., Okazaki S., Anzai T. and Fujiwara H.; RT "Structural and phylogenetic analysis of TRAS, telomeric RT repeat-specific non-LTR retrotransposon families in Lepidopteran RT insects."; RL Mol. Biol. Evol 18(5), 848-857 (2001). XX DR Genbank; AB046673; Positions 1 1923. XX SQ Sequence 1923 BP; 445 A; 473 C; 591 G; 414 T; 0 other; gggactgtta aagcggctat cgttgtttat ggagataagt tcggggtcac tgttgatccc 60 ggactcgtcg acaaaaacat cgccgctgca gtgctgcacg cgggccacct gtcgctaggg 120 gtgatctccg tctacttcga gccgaacgaa cccatagaga cgtacattgt acgacttgag 180 agaatctgcg acaaattagg agcgctgaac ctgataatcg gtggcgacgt taatgccaaa 240 agcctgtggt ggggatccag ttccgaaggt cataggggtg aggcgtaccg ctcctttctg 300 gatgccactg gactgcagat cctcaatgaa ggcgacttac ccacattcca ggtcgttagg 360 gggggccgct tgttcacaag cattgtagac gttaccgtct gcagccccac tctcctcggc 420 aggatcgatg actggaaggt cgatatgaat ttaacatctt ccgaccataa ctccatcacc 480 ttctcgatac gcgtggacca gcctctgcct agtcagaggc cggtcactac gcgtatctac 540 aacacaaaga aggttgtatg gtcggaattc atatccacat tccaagagaa actgtccgag 600 aggagtctga cggcatgtaa tgtggataag gtcgaggaca ttgagggtct ggagacagtg 660 gtgtccgatt acgtcatctg cattgaggaa gcctgtaata aggtagttcc caaggcaggt 720 ggcgtaaaga aagtggcacg tccaccctgg tggtctgaag atctggaccg cctaaaaagg 780 gaggcgacgc gacggaagcg caggatccgc tgcgccgccc ctagcaggag gcagtacgtc 840 gtcaaggatt atcttgatgc tcttgagttg tacaagaggc aggcagcgga cgcccagacg 900 aggagctgga aggagttttg tacgacacag gaaagggaga gcctctggga cggaatctac 960 agggtactta gacggacaga gcggagacgt gaggaagtgc tgctcaagga ccccaacggg 1020 gtcattcttg acccgcaggg gtccgtggag cggctagcct ccgtgctctt tccggaggat 1080 actgtagagg atgaaacaga ggatcaccgg gccgtgaggg aaggaacgga ggggatcctc 1140 ccggccgata ttcgggggct atccgcggac gatccgccgt ttacccggga agaagtgatg 1200 cgggtgtgta aagagattca cccgctgaag gctcctggga acgacggatt aaccgcggac 1260 atatgcacac gtgcgattct agggggggga gaggacgtgt tcctggctct tgcaaacaag 1320 tgcctggagc tgtctcactt ccctagaccc tggaaggtag cccacgtgtg catactccgg 1380 aaacccggca gggaggacta ctgcgaccct aagtcgtacc gcccgatcgg cctacttcct 1440 gtgttgggaa agctcctgga aaagctcttc gtccgacgtt tacgctggca tctgctgcca 1500 aaactcagcg tgcgccaata tgggtttatg ccccagcgtg ggacagagga ctcgctctat 1560 gatctggtga atcatatcag gacccgtgtg gtggccaagg aggtcgttac cctggtatcg 1620 ctagacatag agggggcctt tgataacgct tggtggccgg gcttgaagtc ccaactcata 1680 gccaaggagt gcccaaggaa tttgtacggt atagtctctt catacctaga ggaccggaga 1740 gtagagctca attacgccgg tataaaggtg agtcgggaaa gttccaaggg atgtgtccag 1800 ggttctatag ctggaccgtc cttttgggac gttatcctag actccctttt ggtggagctc 1860 gactccgcgg gagtgtactg tcaggctttt gctgacgacg tggtcctggt tttcgacgga 1920 gac 1923 // ID Copia-2_BM-LTR repbase; DNA; INV; 190 BP. XX AC nscaf3015; XX DT 19-MAR-2010 (Rel. 15.04, Created) DT 19-MAR-2010 (Rel. 15.04, Last updated, Version -1) XX DE LTR retrotransposon from silkworm: long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; LTR-1_BM_; KW Copia-2_BM-I; Copia-2_BM-LTR. XX OS Bombyx mori OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Bombycidae; Bombycinae; Bombyx. XX RN [1] RP 1-190 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from silkworm."; RL Repbase Reports 10(4), 586-586 (2010). XX DR Genome; nscaf3015; Positions 2279247 2279058. XX SQ Sequence 190 BP; 65 A; 29 C; 23 G; 73 T; 0 other; tgttgaaata attttaaatt tggcgccttg acactcaatg tgaatatgag atgatataat 60 ataatctatg gcactacttt ttttctctac tccgtgacat ccaccgcatt gtattttcat 120 ctacatttaa atattaaata aatacaaagt ataactgtgt ataacttagt gttatttata 180 taatcgaaca 190 // ID Crack-8_HMa repbase; DNA; INV; 4327 BP. XX AC . XX DT 22-JUN-2010 (Rel. 15.06, Created) DT 22-JUN-2010 (Rel. 15.06, Last updated, Version 3) XX DE Crack-type non-LTR retrotransposon: consensus. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; KW Crack-8_HMa. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4327 RA Jurka J.; RT "Crack-type non-LTR retrotransposons from Hydra magnipapillata."; RL Repbase Reports 10(6), 787-787 (2010). XX DR [1] (Consensus) XX CC ~97% identical to consensus. CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 97..711 FT /product="Crack-8_HMa_1p" FT /translation="MSNDNVNIIKAISKDVEELKTSLNYHEELVEKKIKTA FT VITVEKNKNYNETKNNKSEFTYVKKKLREMEDRSRRNNLRVDGIKEEDNET FT WCDSEAKVHKLFDEQLGLKSIKIERAHRTGLRNNKKPRTIVLKLLDFKDKI FT AILTKSSTLKGKNIYINEDFCAETNLIRKDLKEKMKVERQSGKFAYISYDK FT LIVRDWNQKKSNLCS" FT CDS 882..3977 FT /product="Crack-8_HMa_2p" FT /translation="MVSNMTENENYFENSSFNVFRLNNLNTLDEHDPDKII FT FDDYSLMNTEAQYLFPDEIKQYLSDIEPFENLSLLHLNIRSAKANFENFKI FT FLEESNFIFNIICLSETWLTDDAYSESCRFDLLEYDAIHLQRKSKKRGGGV FT VIYVKNSLRFKMRNDMCISECNGEFVSIEIVNDKTKNILVTCCYKPPNAST FT ENFSNHLQNIIQKVSLEKKKLFVLGDFNINALNYDNDIESQNFYNDLFRYG FT VIPLINRPTRITRNSATLIDNILTNFLFENSLKKGIVKAPISDHMPIFISA FT NTSNKQKQKQNKVTFTKRLLSLNNQLAFQNELGNIDWSPLESFNDANSMFN FT SFHHNFLNLYEKHFPEKEVTIKIKNLNSPWFSKGLKKASKRKHRLYDKYLK FT NKCIKTKAEYKNYRTLFEKIKRTAKSNYYKKQLEKCQLNSRKTWQVLNEII FT GKPKINKSFFPKILHIKNKTIANENNIANEFNNFFVNIGPKLAAKIPNVNK FT SFKEYLHYNKNQFKNENLTFKEYETAFKSLKRNKSSGIDGINSNIVIDCYN FT ELKVPLFKICKRSLNEGIFPDILKSAKVKPIYKSGDKTDIGNYRPISILSI FT FSKIFERIMFNRLYAFFKDNDLFYSKQFGFQKSTSTEHAILHLINEIKNSF FT ANGEFTLGVFIDLSKAFDTVDHKIMIKKLKMYGIRGVSCRWIADYLSKRTQ FT SIYYGNKKLTNSSLISCGVPQGSILGPLLFLIYVNDLWRASNISTIMFADD FT SNFFISGKDIPQLFVQMNNELDKISLWFNANKLSINSNKTKFSLFHPLKKK FT KIPISFPKLVIENTEVSRERVTKFLGVLIDENITWNKHIAYIGSKISKSIG FT VLYQSRDLLNKHHLKQIYFSFIQSYLNYANIAWCSTSKSKLETLYRKQKHA FT LRIMNFKNKNEHAKPLFEIMNIFTLYEINVYKILCLMYKSKFQQCPSIFND FT LYEIKPNIKYTLRTTGLIEPLCKTKQEQFKITYRAPHIWNKILAKYSNLPG FT VNNINCFQKLIKKNISKYTNLINDFF" XX SQ Sequence 4327 BP; 1756 A; 593 C; 606 G; 1372 T; 0 other; attgaagcaa tactgaaaaa acaagaaaga aattttgcaa atataattaa cgctaatcta 60 aaaataacaa atcagcgtct tgataaagtg gaaaagatgt cgaacgacaa tgttaatatt 120 attaaagcta tatcaaaaga tgtggaagag ttaaaaacga gtttaaatta tcatgaagaa 180 cttgttgaga agaagataaa aaccgccgtt atcacagtgg aaaaaaataa aaattataat 240 gaaacaaaaa acaataaaag tgaatttaca tatgtaaaaa agaaattaag agaaatggag 300 gacaggtcca gaagaaacaa cctaagggta gacggtatca aagaggaaga taatgaaacc 360 tggtgcgata gtgaagcgaa ggttcataaa ttgtttgatg agcaacttgg cttaaaaagt 420 ataaaaattg agcgtgcaca tcgcactgga ctgagaaaca ataaaaaacc tagaactatt 480 gttttaaagt tattagactt taaggataaa attgctatct taacaaagtc atcaacatta 540 aaaggaaaaa acatttacat taatgaagat ttttgcgcag aaacaaacct aataagaaag 600 gatttgaaag aaaaaatgaa agtggaaaga cagtcgggga aattcgctta catatcgtac 660 gataaattaa tcgttcggga ttggaatcaa aagaaaagta atttatgttc ttaatatctt 720 ctatcttttt atatttttat atctttttat atacataagt gtcctatttt atatttttac 780 gaataaaaga gataagtttt ctaaaggaag agagaaataa attaaaaaaa aaaatttaaa 840 taatttaaat aatttaaaat aatattaatt aaaaattaaa aatggtttcg aatatgactg 900 aaaacgaaaa ttactttgaa aatagttcat ttaatgtttt tagattaaat aatttaaaca 960 cgctcgatga gcacgatcct gataaaatta tatttgatga ctattcatta atgaatacgg 1020 aagcacagta tttatttcca gatgaaataa aacaatattt atcggatatt gaaccttttg 1080 agaacttatc tcttcttcat cttaacataa gaagcgctaa agcaaatttt gaaaacttta 1140 aaatattctt agaggaaagc aattttattt tcaatataat ttgtttaagt gaaacttggc 1200 taactgatga tgcatatagc gaaagttgtc ggtttgactt actagaatat gacgcaattc 1260 acttacaaag aaagtctaaa aaaagaggtg gtggtgttgt tatttacgtc aaaaatagct 1320 tgcggtttaa aatgcgaaat gacatgtgca tatctgaatg caacggcgaa tttgtttcga 1380 ttgaaatagt taatgataag accaaaaata ttttagtaac ttgttgttac aaaccgccaa 1440 atgcgtccac agaaaatttc tcaaatcatc tccagaatat tattcaaaaa gtttctctag 1500 aaaaaaaaaa actgtttgtc ttaggtgact ttaatattaa cgccctaaat tatgacaatg 1560 atatagaaag tcaaaacttt tacaatgatt tatttcgata tggtgtgatc cctttaataa 1620 acagaccaac acgaataaca agaaattctg caacattaat agacaatata ttaactaatt 1680 ttttatttga aaactcgtta aaaaaaggaa tagttaaagc accaatttca gatcacatgc 1740 caattttcat atccgcaaat acgtcaaata aacaaaaaca aaaacaaaat aaagtcacat 1800 ttactaaacg cctcctatca ttaaacaatc aattagcgtt tcaaaatgaa ctaggaaata 1860 tagactggtc tccattagaa tcattcaatg atgctaattc catgtttaat agcttccatc 1920 acaatttttt aaacttgtat gaaaaacact tcccagagaa agaagtaaca attaaaatta 1980 aaaacttaaa ttcgccatgg tttagtaaag gtttaaaaaa agcttcgaag cgaaaacatc 2040 gactgtacga taaatatcta aaaaataagt gcataaagac caaggcagaa tataaaaact 2100 acaggacttt atttgagaaa atcaaacgaa ctgcaaaatc aaactattat aaaaagcaac 2160 ttgaaaagtg tcaattaaat tctaggaaaa cttggcaagt actaaatgaa ataattggaa 2220 aaccgaaaat caataaatct ttctttccga aaatcttaca tattaaaaat aaaactattg 2280 ccaatgaaaa caatatagca aatgaattta acaatttttt tgtgaatata ggacctaaac 2340 ttgcggcaaa aattccaaat gtaaataaat catttaagga ataccttcat tacaataaaa 2400 atcaatttaa aaatgagaat ttaaccttta aagaatacga aacagctttt aaaagtttga 2460 aacgtaataa atcttctgga attgatggta taaacagcaa cattgttatt gactgttata 2520 atgagctaaa agtgccttta tttaaaattt gtaaacgttc tctgaatgag ggtatctttc 2580 ctgatatact taaatcagca aaagttaaac ctatatataa atcaggcgat aaaacggata 2640 ttggaaacta tagaccaata tctatacttt ccattttttc taagattttt gagcgaatta 2700 tgttcaatag gttatatgca ttttttaaag ataatgactt attctattca aaacagtttg 2760 gattccagaa aagcacttca actgaacatg caatacttca cttaattaat gaaataaaaa 2820 actcttttgc aaacggagaa tttactttag gcgtgtttat tgacctctcc aaggcctttg 2880 acacagtgga tcataaaata atgataaaaa agcttaagat gtacggcata agaggcgtat 2940 cctgcaggtg gatagcagat tacctgagta aacgtacaca gtccatatat tacggaaata 3000 aaaaacttac aaattcatct ttaatttctt gtggagtgcc ccaaggatct attcttgggc 3060 ctctactttt cctaatatac gtgaatgatc tctggagagc gtcaaatata tcaacaataa 3120 tgtttgctga cgacagtaac ttttttattt caggaaaaga tatacctcaa ctcttcgtac 3180 aaatgaacaa cgagttagat aaaatttcac tatggttcaa tgcaaataaa ctttctatta 3240 attcaaataa aacaaagttc tcattattcc accctttaaa aaaaaaaaaa attcctattt 3300 cttttccaaa actagttatt gaaaacacag aagttagtcg cgaaagagtc acaaaatttc 3360 ttggtgttct cattgacgaa aatattacct ggaataaaca tattgcctat attggcagca 3420 aaatatcaaa aagtattggc gttctatatc agtcacggga tttacttaat aagcatcatc 3480 taaaacaaat ctatttcagt tttatacaaa gttacttgaa ttacgcgaat attgcgtggt 3540 gtagtacatc taaaagcaag cttgaaacac tctatcgaaa acaaaaacat gccttaagaa 3600 taatgaattt taaaaataaa aacgaacatg caaaaccact gttcgaaatt atgaatattt 3660 ttacgttata cgagataaac gtttataaaa ttttatgtct catgtacaaa agtaaatttc 3720 aacaatgccc ctcaattttt aatgatcttt atgaaataaa acctaacatt aaatatactt 3780 tacgcacaac aggcttaatt gaaccattgt gcaaaactaa acaagaacaa tttaaaatta 3840 cgtatcgtgc tccgcatatc tggaataaaa tcttagcaaa atatagtaat ttaccaggtg 3900 taaacaatat aaattgcttt caaaaactaa taaaaaagaa tatttcaaag tatacaaatc 3960 ttattaatga tttcttttaa actgctttgt ttttttgttt gtttgttttt ttgtttttgt 4020 ctttttacat tgaaaagttt ttggttttaa cactatttac cttagttaat taagttttga 4080 ttttaacact attataggaa tttaatattg taaaaacata attattacaa cggttcttga 4140 tgataagact taatggtctt ctgcaagttt cccgcgtttt tatttatttt tgtataacta 4200 actcttatta ttttgtcttt gattttttat atttataact ggtttatttt tgaaagaata 4260 agagtatctg gtattatatt acggaattta attgtaacaa aaacggaaaa ttaaaaatta 4320 aaaaaaa 4327 // ID Howilli4 repbase; DNA; INV; 2631 BP. XX AC . XX DT 08-OCT-2009 (Rel. 14.1, Created) DT 08-OCT-2009 (Rel. 14.1, Last updated, Version 1) XX DE Howilli4 is a putative autonomous DNA transposon. XX KW hAT; DNA transposon; Transposable Element; HOBO; transposase; KW Howilli4. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-2631 RA Ortiz M.F. and Loreto E.L.S.; RT "Characterization of new hAT transposable elements in 12 RT Drosophila genomes."; RL Genetica 135(1), 67-75 (0001)DOI 10.1007/s10709-008-9259-5. XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 1309..2304 FT /product="Howilli4_1p" FT /translation="MIRAFEDFEKIHCVNHLLNNAVEKAIAAVPEMGHIVS FT TCTKIVKYFIKSGMNTSLGFSLKSFCPTRWNTVYYLLKSVETNWIEVTTIL FT KDRNQTGRIEGININHLGSIVRVLEAFEQTSKKLEASNSPTIHLILPNINK FT LKKTCQSDCSDIHLIQSLKSALHIQIFSTVVPNLSKYHYLALFLFPPANKL FT IKFSSTEKETVIKDCKTIMQNFVQGSNSSTTETPVTMDEFADFVEVQQVEI FT ADQVKHEITSYANINVSYTLDFDPLAWWNLHANCFPLLYKTSCKIFCIPAS FT SAASERTFSNARNLISEKRCLIAGNSENINKIMFLHSNIK" XX SQ Sequence 2631 BP; 865 A; 452 C; 472 G; 842 T; 0 other; tagagagctg ctcggattca ctcaatacat ataaaaatat gaatggactc gaattgaatt 60 gaattgagtg agtgcttgct aacactggag tgaatgtgtg tgagtgtgaa tgtatattgc 120 ttcttataac tcacagcatc catcgaaaaa agttcgaatg aatgaagtat aaagccgaat 180 gaaaaatttg gctaacacaa ttcgaattcg aattcattcg agttatagct tacaaatttt 240 gtacagtcat tccgtttgtg tcgctgaaaa gaaatggaag tggaaacaga aaccccaggc 300 aactctgtat gtaattttca gtgttttttg tcttttgttt tgtgttgatt attgtattgt 360 agtaatatgg ttaataaatt attttctttg tgcttagata ttatctgtaa ctgtgacctg 420 tccataaatg ttacctatat aatttttttc agatttaatt tagttttatt gaaatatata 480 ggttgtgaaa agcgcagaga aaaatgcaga tgaaataaag aagaagattg atagtggcat 540 gtataagcta attgagaaga aggggcgcag cgaagtgtgg aaattttttt ccaaaataat 600 taatgctgat ggagaggagc tggctgagtt actagcctgt aatatgtgtt tttcggtgtt 660 taagtttaca ggaagcacgt ctaacatggt aaaacacagg tgttatttgg ttaactccag 720 caaatcgatg aaaactatgc ctgtggacgt taatgcagca acgaaaaagg aaggcatact 780 tattatgact gaatgggttg taaagaactg tcgtccttta aaaattgtag atgactccgg 840 ctttaaaaag gtggcgcagt ttttaattaa cgttgggtcg tcctttggat ccaacgtaga 900 cttggaaaaa ttgttaccgc atccaaccac tgtttcccga aacatagcag ctatctacga 960 ggcacatttt ggcccaataa acggtgaaat cgaaaaatat aaggcctccg gatacgctct 1020 cacaagtgac atatggaccg acaattactt aaaagtatcg tatttgtcat gcactatcca 1080 ttacataaaa gatggaatcc ttgttgatcg actcatggct atgaagtcaa tgaagatgtc 1140 ctgtacaggt atatatctca tgaccataag ttgtatagtt gaattcaaat gaattttttt 1200 taattattag gtttaaaaat tcggtctaaa attcaagaga tactgcaaaa cttcggatgt 1260 gatctagaaa tcgacaatcc agttatagta accgatcgtg ggtcaaacat gataagagcc 1320 tttgaagact ttgagaaaat acattgtgta aaccatcttc tcaacaacgc tgtggaaaag 1380 gctattgctg ctgttcctga gatgggtcac atcgtttcta cttgtaccaa gattgtgaaa 1440 tatttcataa agtctggaat gaacacctcg ttaggatttt ccttaaaaag tttttgccca 1500 actcgttgga atacagtgta ttacctttta aagtcagttg aaactaattg gattgaggta 1560 acaacgattt taaaagacag aaatcagaca ggtagaattg aaggcatcaa catcaaccat 1620 ttaggttcca tagtccgagt gctggaagct tttgaacaaa cttcaaaaaa actggaagca 1680 agtaatagtc caaccattca cctcattctc ccaaatatta acaaattgaa gaaaacttgt 1740 cagtccgatt gttccgatat tcacctcatt cagtcactga aatctgcttt acacattcaa 1800 atattttcca cagtagttcc gaatttatct aagtaccatt acctagcctt gtttcttttt 1860 ccacccgcaa acaaactaat taaattttca tccactgaaa aagagactgt aattaaagac 1920 tgcaaaacta ttatgcaaaa ttttgttcaa ggcagtaatt ccagcacaac tgaaacacct 1980 gtaactatgg acgaatttgc tgatttcgta gaagtccaac aagttgaaat cgcagaccaa 2040 gttaaacacg aaataacctc atatgcaaat ataaatgttt catatacgct agattttgac 2100 ccattggctt ggtggaattt gcatgcaaat tgttttcctc tgctttataa gacaagttgc 2160 aaaatattct gtattccagc gagcagcgca gcttcggaac gaacattttc caacgctaga 2220 aatttgattt cagaaaagcg ttgtcttatt gctggtaact cagaaaatat aaataaaatt 2280 atgttcttgc attccaatat aaaatagaaa taaatgtaat aaaatatatg caaaaaaaaa 2340 taagagttta catttcattg gtttttgtct ctttctctct ttcttttcta ttatacaaat 2400 aaatgaaatg aaaaactact tggcaactcg aatccatttg aaaccaactc atttttcttt 2460 acctgtcaca attcattcaa ttctcttttg tgttttagct ttgtgtgtat gagtcctgtg 2520 ctagagcact cactcactca tactttttcg gcctcgcact cgaactcact caactcattt 2580 tgagcattgt caattcgaat aaggtttttt gactactcat gcagctctct a 2631 // ID BEL-160_AA-LTR repbase; DNA; INV; 724 BP. XX AC supercont1.160; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-160_AA_; KW BEL-160_AA-I; BEL-160_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-724 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.160; Positions 670332 669609. XX SQ Sequence 724 BP; 264 A; 117 C; 129 G; 214 T; 0 other; tgtcacgacg agacccccag ttttgagtca actgtggccg actagttggt tggtcttagg 60 tgtatacggt tcaacgattg tcagtgattg agcgtatgga acgtcattga aattggagaa 120 acggagaagg acaaaaacaa tgataaagct ttttggcaag tggtattacg attacgagcg 180 gattgaatta gaaaaatatc ctagctaaat taaatataaa gacaaagtta tgtttgaatt 240 atcttaccta tattattaat acaaagtgag taattatcct aaaacttaat tacatctaag 300 tacgatagat tgcgaaggta tcatgcatta cttattatta gtactttgag aattcgttat 360 tcacacatat tcacttaaat ctagataaaa atactattta caagcgtcag cgtagtttgc 420 agtcgtggaa accgattaac ttaattaatt gcggatagaa gtatattgaa atcagaattg 480 taagtacatt aaacctaaat attaataatg taacctaaaa tttgccaatt catacagtgc 540 accctaaatt cggtaccgat agagtcgcga tacgatccgt gaaatcgaaa ggtcaccaaa 600 ctttgtaagt agtagtaaaa ataatgaaat attaatctaa taaaatacta ttttagctaa 660 aagctaaccg acagcacaaa cgcggtttgc tatatagatc ttggacgatc ccacccccac 720 aaca 724 // ID Gypsy-33-I_NVi repbase; DNA; INV; 6762 BP. XX AC . XX DT 14-MAY-2009 (Rel. 14.05, Created) DT 14-MAY-2009 (Rel. 14.05, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW internal portion; Gypsy-33-I_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-6762 RA Bao W. and Jurka J.; RT "LTR retrotransposon from Nasonia parasitic wasp."; RL Repbase Reports 9(5), 1002-1002 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 214..1929 FT /product="Gypsy-33-I_NVi_1p" FT /translation="MADAGKNIFSSTLRKSGTVTRSASAEILKSQLNDTLN FT AVLTTDDANQSMIDDPSLNANWARKVHEALKYLLEKADETSEIVNDLNKKH FT KKIDKXEQXEQNGLKYTALNINYDTLXTXVDEIDSSIINKPDNDDLEKFKE FT NVENSIDSKVALKFSELRLEAQPETSPSANVEALINMAIRKHEEEAEVLLT FT NMKXDITTKLTALTNTSQNLLKITKSNETRIKSLEVEHTTYNNALLKCDES FT LQNLKAYIENSNKSFSEEFQTLTSKINKIRSSSHEITLDGNNTTLSLASKS FT PKLKPPTFKADSKEKPMRYLRDLKRYIDVLNVDGAEMNIIISQTLENTASS FT WFDIAQQTINTISDFERKFKARFWNEDIQDEWSRKVEYNRYNXGSKYSHLE FT YATYVWGFAQDLERNFSEAELVRKISNHFDWDIRFTVKSQNINTQDKFFEL FT LAFKDNGHNERQSLFKKKFNSDDSNTESSGNYQTNKATNQQSNFQNKNFXK FT KTPQIQTLNVIEAKTSNPAKPTIKETGTKPKTIISKPTTTKFLLEDSLNDC FT LEDTWGTLTLSIKTNNSCFLCLGK*" FT CDS 5835..6428 FT /product="Gypsy-33-I_NVi_3p" FT /translation="MGRTFKRSAQLTRNSVVYPRKLISGVDVLVQTVPLGQ FT ETSHPGDRVITPRSAEERARLAHTWKSRGVRMPFGRAVQCDSTPPPPPRTG FT ETPAPRAVSVPPAPATHAISVSPAPAPHLHSHRRALRLLXRHHLHQLQQSP FT ASPTSKQSRSTWRNTXDDARSQEARGLRRRGFAGSRRSCVSYSTRQLSPPP FT PPTKQPW*" FT CDS 1892..5353 FT /product="Gypsy-33-I_NVi_2p" FT /translation="KPITVVSCASENNLNNISDIQGTTDKQISVCTIFNNP FT REDLLYESDDDSKNDEICACPEIKIKINNIPIKALVDTGSQITCISENFYL FT INIDKFKNIXTLPIVGTIIRGATGLKSTRLNKQLMXTTQIGTCTKDLIFLI FT IPKLIRDCILGIDAIKELGFLIDTKYDEITIRINNKIEKLSYKSKSLLESN FT MDHRYIEILEEDTTMNVDSHSPLRPVLNKARCQKSLNELTYRQNDRNSLVD FT PYDITDMEINEKIKSCVVKNPETINKLRNLIQKYRKVFYKKPGRLVGFEYK FT LKLKENPEPFFIPPYPVNINMKSKLREQIEVMLNWKIIRRSSSNFISPFVP FT VIKKNDTIRLCLDARKVNEMLEDDLESPQNIEELLQQCHGVEVMSSLDLTS FT SFWQIPLSEESKQYTAFMFEGKLYEFEVVPFGTKISSAALIRGLFHVTKNL FT GDFILNFVDDFLCISKSEKDHLDHLERLFESLIKWNFTLNFKKSEFFKKET FT RFLGFILSTKGIKPQENKIEAIKNYPIPKNIKQLQAFLGTINFYAKFAKNH FT AHELLPLLELLKKDKQKSWGWGDRQQKAFDKVKELFEKDVCLAYPDPSKPY FT ILRCDASDFAIAGALSQIDDNGDERVIIYISRTLKAAEITYFTTEKELLAV FT VWSLQKLSTYLRAAKVYIKTDHKALIFLFKCKFVGNRIRRWILGTQDYNLV FT IDHVSGKENVVADMLSRHPNFNXEEQRKPGEIIIAIFKRXKFTKDLMYKFK FT HLSKLQHDDEYLNQLFTKNSDQKNQSNNFTYLIENDIIYKKNQSNQYKIMV FT PKEMIKILILETHETYGHVGAKKVIRMLQEDFYIKNLRHKAQKLISFCDSC FT QRNKISTRPCTAPMESIIPEAPHDLLSIDFYGPLPTGRGGMKYILVTIDAF FT SKFVVLYPIRRQTCTTAIKKIFNDYVLLYGKPKRISCDHGSQFTANAWIEK FT LREENIDLVFSSIRHPQSNIVERVNKELGRFFRTFTSEKHTGWVGYINIIQ FT TIINETYHETTSFTPIELHLMGIKPKRAWKNWIHPINSEVEPPLEQKIYLA FT RERIISKAQKRAEKYNLQKKXFFVEFKEGDEVLLKSNNISDLDNKKIAKFF FT SIYEGPYEIQKKIGNTTYLLFDRDKQKEKGRFHINDLKEYKREPK*" XX SQ Sequence 6762 BP; 2505 A; 1278 C; 1164 G; 1769 T; 46 other; ttggcgatcc tgccaggata ccgggttttc aaagtggaac ttattacaat aataatcaaa 60 tcctcctatg acacatatat tgcaatttac gcaataactc aaatgcaatt tatgcaatac 120 aaacaactct gatttgcatt attaagcaat ctacacaaac aaacagactt gcaaccattt 180 tacgcagtca caaataattc agacgaacac aaaatggcgg acgcgggcaa gaacatattt 240 tctagcacgc ttagaaagag cggtacagtc actcgaagcg cttccgcgga aatmttaaaa 300 tcacaattaa atgatactct taacgcagtt ctcactacag acgacgcgaa tcaatctatg 360 atcgacgatc cctcgttaaa cgccaactgg gcacgtaaag tccacgaagc gttaaaatat 420 cttttagaaa aagcagacga aacgtcggaa atcgtcaatg atttaaataa aaaacataaa 480 aagatagaca aacragaaca artcgaacag aacggtttga aatacaccgc tttaaacata 540 aactatgata cacttaraac aaragttgat gagatagatt ctagcataat aaacaagcca 600 gataacgatg atttagaaaa atttaaagaa aacgttgaaa attctatcga tagtaaagta 660 gcycttaaat tttccgaact tcgactcgag gctcaaccgg aaacaagccc ttcggcaaac 720 gtagaagcct taataaatat ggctataaga aaacatgaag aagaggcaga agtyttatta 780 acaaatatga aamaagacat aaccacaaaa ttaactgctt tgacaaatac aagtcaaaat 840 ttattaaaaa taactaaatc gaacgaaaca cgtatcaaat cmcttgaggt agaacacaca 900 acatataaca acgcacttct taaatgcgac gaatcgttac aaaatttaaa agcatatatc 960 gaaaattcta acaaatcttt ttcagaagaa tttcaaacct taacaagtaa aatyaacaaa 1020 attcggtctt caagtcacga aattactctt gatggaaata acacaactct gagtctcgcg 1080 tcgaaatcyc ctaarttaaa accgccaacg tttaaagccg acagcaaaga aaaacctatg 1140 cggtatttgc gcgatcttaa aagatatata gacgttttaa atgtagacgg cgctgaaatg 1200 aatataataa ttagccaaac attagaaaac acrgcttctt cgtggttcga cattgcacag 1260 caaacaatta atactatatc ngatttcgag agaaaattta aagcacgatt ttggaacgag 1320 gatattcaag atgagtggtc wcggaaagta gagtacaacm gatataatwc tggttcaaaa 1380 tactctcatc ttgaatatgc tacgtatgtc tggggattcg cgcaagattt agaacgaaac 1440 ttttcagaag ccgaacttgt aagaaaaatt tcaaaccatt ttgactggga catccgcttt 1500 acggttaaat cacaaaatat aaacacacaa gataaatttt ttgaattact tgctttcaaa 1560 gataacggtc acaatgaaag acaatctcta tttaagaaaa aattcaattc ygacgattcg 1620 aacacagaat cctctggaaa ttatcaaaca aataaagcta caaatcaaca atcaaatttt 1680 caaaacaaaa attttscaaa gaagactcct caaatacaaa cgttaaacgt aatcgaggca 1740 aaaacgagca acccggcaaa acctactatt aaagaaacgg gyacaaaacc yaaaacaata 1800 atttcgaaac caactacgac aaaattttta ttagaagatt cgttaaatga ttgtttagaa 1860 gatacttggg gtaccttgac attgtcaata aaaaccaata acagttgttt cttgtgcctc 1920 ggaaaataac ttaaataata tctcagatat acaaggaaca actgataaac aaatttcagt 1980 ttgcacaatt ttcaacaatc ctcgagagga cttattatac gaatcagatg acgacagtaa 2040 aaacgatgag atttgcgctt gcccggaaat taagattaaa attaataata ttccaataaa 2100 agctttagtt gacaccggaa gtcaaattac atgcatttca gaaaatttct atctgataaa 2160 cattgataaa tttaaaaata ttsytacctt rcctatmgta ggtacaataa tymgaggggc 2220 gactggcyta aaatcaactc gtytaaataa acaattaatg tgsactacac aaatcggtac 2280 ttgtacaaaa gacttaattt tcttgatwat tccaaaatta atmcgagatt gtattctcgg 2340 aatcgatgca atcaaagaac ttggttttct tatagataca aaatatgatg aaatcacaat 2400 acgtataaat aataaaattg aaaaattatc ttataaaagc aaaagtttat tagaatcaaa 2460 catggatcat cgctatatag agatactcga ggaggataca acgatgaatg ttgattctca 2520 ctcaccactc cggccagtgt taaataaagc cagatgccaa aaatctttaa atgaattaac 2580 gtatcgacaa aatgatcgta attcattagt tgatccatac gatattactg atatggaaat 2640 caacgaaaaa attaaatcct gtgtagtaaa aaatccagaa acgatcaata aactcagaaa 2700 tttaattcaa aaatatcgta aagtattcta taaaaaacct ggaagattag tcggttttga 2760 atataaatta aaattaaaag aaaatcctga acctttcttc attcctccgt atccagtaaa 2820 tataaatatg aaaagtaagt tacgggagca aattgaagta atgctaaatt ggaaaattat 2880 acgtcggtca agtagtaatt tcataagtcc atttgtaccg gtaataaaaa agaacgatac 2940 aattcgttta tgtttagatg cacgcaaagt aaatgaaatg ttagaggatg atcttgaatc 3000 accccaaaac atagaagaac ttctacaaca atgtcacggt gtagaagtca tgtcgagtct 3060 cgatttaacg tcaagttttt ggcaaatacc tctaagcgag gaaagtaagc agtacactgc 3120 attcatgttc gaaggaaaat tatacgaatt tgaagtagtt cctttcggta caaaaataag 3180 cagtgcagct ttaattcgcg gtttatttca cgttacaaaa aatctcggtg atttcatatt 3240 aaatttcgtg gatgattttc tgtgcatctc aaaaagtgaa aaagatcact tggatcattt 3300 agaaagatta tttgaaagtt taattaaatg gaatttcaca ttaaatttta aaaaatcgga 3360 attctttaaa aaagaaacaa gattcttggg ctttattctc tcgactaagg gaataaaacc 3420 acaagagaat aaaattgaag caataaaaaa ttatccaatt cctaaaaata taaaacagtt 3480 acaagccttt ctaggcacaa taaattttta cgcgaaattc gcaaaaaatc atgcgcatga 3540 attactaccg ttattagagt tattaaaaaa agataaacaa aaatcctggg ggtgggggga 3600 cagacaacaa aaagcattcg ataaagtaaa agagcttttt gaaaaagacg tttgccttgc 3660 atatcctgat ccatccaaac catacatcct ccgttgcgac gcgtctgatt tcgcaattgc 3720 gggtgcacta tcacaaattg aygataacgg agatgaacgg gttattatat atatcagtcg 3780 aacgctcaaa gcagcagaaa taacatattt tactacagaa aaagaactac tcgcggttgt 3840 atggagtttg cagaaattaa gtacctattt acgagccgca aaagtttata taaaaactga 3900 tcacaaagct ttaatattct tattcaaatg taaatttgtc ggcaaccgaa tccgtcggtg 3960 gattctgggc acacaagatt ataatttagt aattgatcac gtatcgggta aagaaaacgt 4020 tgtagccgat atgctaagtc gacatcctaa ctttaatyca gaagaacaac gaaaaccggg 4080 agaaattatt attgcaattt ttaaaagaya taaattcacg aaagatttaa tgtataaatt 4140 taaacatttg tcaaagttac aacacgatga cgaatattta aatcaattat ttacaaaaaa 4200 ttcagatcaa aaaaatcaaa gcaataattt tacttattta attgaaaatg atataattta 4260 caaaaagaat caatcgaatc aatacaaaat aatggttcca aaagaaatga ttaaaatact 4320 tattcttgaa acacacgaga cttatggcca tgtaggcgcg aaaaaagtca ttcgtatgtt 4380 acaagaagat ttctatataa aaaatttaag acataaagct caaaagctta tttcattttg 4440 tgactcatgt cagagaaaca aaatttccac tcgaccatgt acagcgccaa tggagtcgat 4500 aattccagaa gcgccacatg atttattatc cattgatttc tatggtccgt taccgaccgg 4560 gcgcggcggt atgaaatata tacttgtaac tatagacgcg ttttcaaaat tcgttgttct 4620 atatccgatt cgtagacaaa cttgtactac agcgataaag aaaattttta atgattatgt 4680 tttgttatac ggaaaaccga agcgtatttc atgtgatcac ggctcacagt tcactgcaaa 4740 tgcgtggatc gagaaattaa gagaagaaaa tattgattta gttttttcat ctatccgaca 4800 tccacaaagt aacatcgtcg agcgagttaa taaggaactc ggtagatttt ttcgtacatt 4860 tactagtgag aaacataccg gttgggtcgg atatattaat atcatccaaa ccattataaa 4920 tgaaacgtat cacgaaacaa caagttttac tccgatcgaa ttacacttaa tgggtatcaa 4980 acctaaacgt gcgtggaaaa attggataca tccaattaat tcagaagtcg aacctccgct 5040 tgaacaaaaa atttatctcg ctcgggaacg tataatttca aaagcgcaaa aacgtgcwga 5100 aaaatataat ttacaaaara aamcattttt tgttgaattt aaagaaggag acgaagttct 5160 attaaaatca aataatattt cagatttaga caataaaaaa attgcaaaat ttttttcaat 5220 ttacgaaggt ccatatgaaa tacaaaaaaa aattggtaat acaacttatt tattatttga 5280 tcgcgataaa caaaaagaaa aaggaagatt tcatataaat gatctaaaag aatataaaag 5340 agaacctaaa taaatagaat attatttgat tacacaatag aaaaattcgt tactctcact 5400 catsaaayat tgtaaagaaa aaatgatcac gctcaaaaag artcgcgttt ctatttcata 5460 cattaktttt gaacactcag agctgtatat atacatgtaa acaagccagc acggctagtc 5520 ccgactcaag aatgcagagg ggaggtatag tccgtctttt tctctttccc cactctcccg 5580 cggcaacaca actcgtgtcc gcaaagccgc gagttccccc tctcttacta cacgcatagc 5640 gttgccacgt cactttctca gcgaacctaa gattttaggt atacgtgaga agcgggaaaa 5700 ttcaaaaata tgacaacatt gtcatttctc ctctttcgga gctgctccag ctttcyccat 5760 aaaagggctc gcgctaggct gacgccgtca gtttttcgac ggtcgcaatt taagcgcctt 5820 agaaaaatct agccatgggc cgcacattta agcgctcagc gcaacttacg cgcaactccg 5880 tggtttatcc acgcaagctc atctccgggg tggacgtttt ggttcagact gtcccgttag 5940 gacaggagac gagccacccc ggagatcgcg ttatcacgcc aaggtcggcg gaggagcgag 6000 cgcgtctagc gcacacgtgg aagtcgcgcg gtgtacgcat gccattcggc cgcgccgtgc 6060 agtgcgactc cacacctccg ccgcctcctc gcacaggtga aacacctgca ccacgcgcag 6120 taagcgtgcc accggctcct gcaacgcacg caataagcgt atcaccggct cctgcgccgc 6180 acctacactc tcacaggaga gccttacgcc tcctccytcg ccaccatcta catcagctcc 6240 aacaaagccc cgcatcacca acatcgaagc agtcaagatc gacttggmgg aataccascg 6300 acgacgcgcg aagccaagaa gcccgaggtc tgcgcagaag agggttcgcc gggtcgagga 6360 gaagctgcgt gtcttattcg acacgccagc tatcgccacc accaccaccg acaaaacaac 6420 cttggtgaac gggaagagcg aggtcctgga tgcccttcta cgcaagccaa cgcaaaagct 6480 gaacccatac gccacctacg ggtacggcaa cagcgtctgg cagcggcagg tcaggcttga 6540 ctttgaggac acctggaagt cagtgggtaa ggaggaggcg aggctgaaga ggagagtaag 6600 tgccaaggtc accgcaagag gtatatacag ctctgagtgc atgcatttta agcagcactt 6660 tttatctctt tattaaaact tataaatata tttttaatca ttttttctcc gggagtatag 6720 tgaaaatttt tcaatttttg aaaaattttc actaggggac ta 6762 // ID Daphne_DS repbase; DNA; INV; 3141 BP. XX AC . XX DT 17-FEB-2006 (Rel. 11.02, Created) DT 13-MAR-2006 (Rel. 11.02, Last updated, Version 1) XX DE Non-LTR retrotransposon Daphne_DS a consensus. XX KW Non-LTR Retrotransposon; Transposable Element; KW Interspersed repeat; reverse transcriptase; AP-endonuclease; KW Daphne_DS. XX OS Darwinula stevensoni OC Eukaryota; Metazoa; Arthropoda; Crustacea; Ostracoda; Podocopa; OC Podocopida; Darwinulocopina; Darwinuloidea; Darwinulidae; OC Darwinula. XX RN [1] RP 1-3141 RA Schoen I. . and Arkhipova I.R. .; RT "Two families of non-LTR retrotransposons, Syrinx and Daphne, RT from the Darwinulid ostracod, Darwinula stevensoni."; RL Gene 0, 0-0 - (2006). XX DR [1] (Consensus) XX CC Daphne is a non-LTR retrotransposon from a Darwinulid ostracod, CC Darwinula stevensoni, and belongs to the Daphne clade, which is a CC sister clade to the L2 clade, and includes representatives from CC both CC protostomes and deuterostomes. The consensus sequence is CC assembled CC from sequences of 26 PCR-generated clones, which diverge from the CC consensus by 1.5-6%, and is 5 truncated. Daphne codes for a CC protein CC containing the AP-endonuclease and reverse transcriptase domains. CC The CC 3' UTR of Daphne is 440 bp in length and ends with a (TTA) CC microsatellite sequence. XX FH Key Location/Qualifiers FT CDS 1..2706 FT /product="Daphne_DS_1p" FT /note="AP-endonuclease/reverse transcriptase." FT /translation="PSFQYSLQGYSSEFFCRSSTRGGCVALYVRDDHPAIP FT LPELSQLSSENNIEITAVRILNPNWKHLPDLPLIALSVYRPPNGNFTVFFN FT TLELALSKLSPAHNSVIIMGDFNIDLLKNSKNSKRCLDLFTCFGLFPCFFL FT PTRVSRSTETLLDNMFTNISPCNLYTDVVPNDDSDHRQLCLRIPLEIKTQS FT YPPTNFRRLLIPSAVKKASNFLESINWTSLMSSKSVDDQLDTLMGKIHASQ FT ARSLPTKRITPRKPSKSWLTTGIRISSARKNALFLILYYGPSNLSLLKYYK FT RYRSILHRVIRKTKQMSVISEYEQCSRDKNPRGLWRLTNRILGRDATHTIV FT GLRENDLLLTNGEDIANRMRSHFSSLFSPHTPPISSVPVGSSPQFSFFLQP FT ITELETLTIISHLSSTKATGPDNVSAKFIKKIAQSIALPLTLIFNNSLSTG FT VFPSALKQGKLIAIHKKGEKDIVSNYRPITILPVISKVFEQLVQQRLDSYL FT NCISFISQSQFGFRSNRSTQDALLHFLKEVQSILNRNQAAVGIFYDIAKAF FT DSISHKLLLQKLESLGVRGVANSWFQSYLSNRQAAVHHRDHTGRVHVSEPI FT NITSGVPQGSVLGPLLFLIYINDLPTNAPKIHLTLFADDTTACLPVSRNST FT PTSVSTSCDIAIDSWTTNNGLRLNASKTTRVLFSTLRRTDPPLTFATSTTF FT LGIRFDPYFRWDNQVDHVCRLLKYAAASIYRLSSILSQEDLKKMYYALAFP FT HLSYGLLAWGHCSERHVCRILSLQKRIVRTILHKSIRTTCRPLFPKLKFLT FT FPSLVLFHTSVYFHAIVSKNEAISNAGVHSYDTRSKNDYHRSIDSSALASR FT LIFSWGPHYFNSIPRSIRQLNHTAFRRQLKELFLSHPLYSFSEFFDITW" XX SQ Sequence 3141 BP; 788 A; 926 C; 495 G; 932 T; 0 other; ccatcatttc aatattccct tcaaggatat tcatctgagt ttttctgtcg ttcctcgacc 60 aggggaggat gcgttgccct ctatgtcaga gatgatcacc ctgcaatccc tcttccagaa 120 ttatcccagc tttcctccga aaataacata gagataacag cggtgagaat cctcaatcct 180 aactggaagc atttgcccga tcttccactg atcgctctct cggtctaccg acccccaaac 240 ggcaatttta ccgtattctt caacaccctc gaacttgccc tcagcaaact atctcctgct 300 cataatagtg taattataat gggtgacttc aacattgatc tcttgaaaaa ctcaaaaaac 360 tcaaagcgat gtctggatct attcacatgc tttggcttgt ttccctgctt cttcctacct 420 actcgcgtat cccgatcgac agagactctt ctcgacaata tgttcaccaa catatctccc 480 tgtaacctct atacagacgt tgtccctaat gatgactctg atcatcgcca actttgcctc 540 cgcatccctc tggagatcaa gacccaatcc taccccccta cgaatttcag acgcctctta 600 atcccatctg ccgtcaagaa ggcgtccaac tttcttgaga gtataaactg gacttcactt 660 atgtcctcca agtctgttga cgaccaatta gacaccttaa tggggaaaat ccatgcttcg 720 caggctcgtt ccttacccac caaacgcatt actcctcgaa agccctcaaa atcctggctg 780 acaactggta taagaatctc gagcgctcgt aagaacgccc tgtttcttat cttgtattat 840 ggcccttcaa acctatctct tctaaagtac tacaaacgct atcgatccat actccatcga 900 gttattcgga aaaccaagca aatgtccgtc atttccgaat atgaacagtg ctcgagagac 960 aaaaacccaa gaggcctctg gcgtctgaca aacaggattc taggcaggga tgctacccat 1020 acaattgtgg gacttagaga aaatgacctt cttcttacta acggcgagga tatcgctaac 1080 cgcatgagga gtcatttctc gagtctcttc tcccctcaca cacctccaat atctagtgta 1140 ccagttggct cctcccctca attctctttt ttcctccagc ctatcacaga acttgaaacc 1200 ctcacaatca tctcacatct atcgtcaacc aaggccactg gtcctgacaa tgtcagtgct 1260 aaatttataa aaaagatagc ccaatccatc gctctccccc tcacccttat cttcaacaat 1320 tcattaagta cgggcgtttt tccatcagct ctcaaacaag gcaagctcat cgcgatccat 1380 aaaaaaggag agaaggacat tgtctcaaac tatcgcccaa tcacaattct accggttata 1440 tcaaaggtct ttgaacaatt ggtgcaacaa cgtctcgact cgtatctcaa ctgcatctcc 1500 ttcatctctc agtcacagtt tggatttcgt agcaatcgca gcactcaaga cgcactgctt 1560 catttcttga aggaggtgca atctatccta aacagaaatc aagcagccgt aggcattttc 1620 tacgatatcg ccaaagcatt cgatagcatc tcccacaaat tattgctaca gaagctagaa 1680 tcgcttggcg tccgcggtgt cgcaaactcg tggtttcagt cttacctctc caaccggcaa 1740 gcagcagtac atcacaggga tcatacaggt cgcgttcatg tctcagaacc gatcaacatt 1800 actagcggcg ttccacaggg ctcggtcctt ggtccactcc tctttctaat atatattaac 1860 gacctcccca ccaacgctcc caagattcac ctcacgctct ttgccgacga caccacagcc 1920 tgccttcctg tttcacggaa ctccacgccc acttccgtct cgacatcttg cgacatagcc 1980 atagattctt ggacaaccaa caacggtcta cgcctcaatg catcaaaaac aactcgagtc 2040 cttttttcca cactccgacg tacagatccc ccactcacct tcgccacctc aaccactttt 2100 ctcggtattc gtttcgaccc ttattttcgc tgggacaacc aggtggacca tgtctgtcgc 2160 ctcttgaaat atgctgcagc atctatttac cgtctctctt ctattctctc tcaggaagat 2220 ttaaaaaaaa tgtactacgc tcttgccttt cctcacttga gctatggact tcttgcctgg 2280 ggtcattgct ctgagcgaca tgtctgtcgt atcctgtcgc tccagaaacg catcgtcaga 2340 accatcctcc acaagtcaat ccgtaccaca tgtcgccctc tcttccctaa actcaaattc 2400 cttaccttcc ccagcttggt tctctttcac actagtgtct atttccatgc tattgtctct 2460 aaaaacgaag ccatctcaaa tgcaggcgta cattcctacg acacccgaag taaaaacgac 2520 taccaccgtt caattgattc atccgccctg gctagtagac taatcttctc ttgggggccg 2580 cattacttca attctatccc caggtcaatt cgacagctca atcatacagc cttccgaagg 2640 cagttgaagg aactctttct ctcacacccg ctctactcat tctcagaatt ctttgatatt 2700 acctggtaaa ctcccctcct accccacatc agggttactt acccaccgtc ctcttactcg 2760 agtctcctct tatgtaaagg tgcagtcaca cctccctccc ccccctccct tttttttttt 2820 cgactctcct catttcgtca ttctcttata cctcacccct tttgatttta tacttatgtt 2880 catctgttca gtttcggtgt gtccaattca gttctgtatg ttctgagtgt tctgtccagt 2940 tctgtgtgtc ctgtccagtt ctatttactt tcttacttat tttattttat ctatttctgt 3000 ttatacattt atctattttt ctcttcacct tctgcctctt tatgagagta tgtatatttg 3060 tagtttttca gatatgacga actcaacgga ccaattggtc ttgaaagagt gatggaataa 3120 atattattat tattattatt a 3141 // ID Harbinger-N14_BF repbase; DNA; INV; 332 BP. XX AC . XX DT 04-AUG-2008 (Rel. 13.08, Created) DT 04-AUG-2008 (Rel. 13.08, Last updated, Version 1) XX DE Amphioxus Harbinger-N14_BF non-autonomous DNA transposon - DE consensus. XX KW Harbinger; DNA transposon; Transposable Element; Nonautonomous; KW Harbinger-N14_BF; non-autonomous. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-332 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-332 RA Kapitonov V. and Jurka J.; RT "Harbinger-N14_BF - a family of non-autonomous DNA transposons RT from the amphioxus genome."; RL Repbase Reports 8(8), 806-806 (2008). XX DR [2] (Consensus) XX CC This family constitutes ~0.15% of the genome. It is characterized CC by 30-bp TIRs and TWA TSDs (verified by insertions into other CC transposons). XX SQ Sequence 332 BP; 68 A; 94 C; 93 G; 77 T; 0 other; agctaagggc acaacccgcc gtacgtgcaa tttgtccgta cgtttttttg ggaggccacg 60 tccgccacgt atttctgaaa acgttcaggc tgtacagggc aggcgcgtcg cccgtactgg 120 cacgtacggg gggcgaatcc gcagcccgat ggcacacaaa gttttgcctg cacgtaaagt 180 tctacggcgt gtctgcgtgc tccaaaatcc gtcggattcc ctacgagttc gtacggaggc 240 ggtacttacc acttacggat cccaaaagat tgcccacgtt ttggcacgta gacagcacgt 300 atttgcacgt acggcgggtt gtgcccttag ct 332 // ID Kiri-25_AAe repbase; DNA; INV; 3882 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 16-FEB-2011 (Rel. 16.02, Last updated, Version 1) XX DE A Kiri non-LTR retrotransposon family from Aedes aegypti. XX KW Kiri; Non-LTR Retrotransposon; Transposable Element; Kiri-25_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-3882 RA Kojima K.K. and Jurka J.; RT "Kiri non-LTR retrotransposons from the yellow fever mosquito."; RL Repbase Reports 11(2), 720-720 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 9 sequences with >96% CC identity. CC This family and related Kiri elements found in two mosquito CC species, Culex quinquefasciatus and Aedes aegypti, constitute a CC distinct lineage belonging to the CR1 group. The phylogenetic CC analysis with PhyML indicates that Kiri is a sister lineage of CC the L2 clade. Although several Kiri elements do not encode two CC proteins, probably due to 5' truncation, Kiri elements generally CC encode two proteins. Their ORF1 proteins commonly contain an CC L1-like RNA-binding domain. XX FH Key Location/Qualifiers FT CDS 33..713 FT /product="Kiri-25_AAe_1p" FT /translation="MRTMMQQFTETRELIATVRXEIHDVNIKIDTVKTELQ FT XDIKAVRDECAAKFSQHDTALASLHERVDIVTQKIGALGNRNELIISGIPY FT RTGENLDSMLKAIGRHLQVKETSTLMAESRRMISGRNSDTTGLIVVEFAIR FT ATRDEFYSAYLRKRDLKLRHIGLDSDRRIYINESLTIEARKLKSRALHLKK FT EGRLTSVYTKQGVLYVKPATNESSVVIQSERDLNEYT" FT CDS 888..3740 FT /product="Kiri-25_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MLPPHNHTSTNIPGAVLNAALFPDQLNICHGNAQSLC FT ARNSNKLDEVRNLLSNSKIEIACFTESWLSSKNSDRSISIPGYSVVRNDRV FT FKRGGGIVVYYKEHLSCYKIFGTVLTSESLDKTECLALEFRVGGQKFMLMT FT VYNPPGNDCSSFLADKLTDLSVRYDSIFLVGDFNIDLHRPSSKREQLEAIL FT CAYSLTSISSEPTFFHNGGCSQLDLFLTNRTDKVLRFGQVSFPGLSHHDLI FT FASMDFAISRPTGRYTYRDYTNFDSHALENAVLSIPWNRFYQMDDPNEAIE FT FFYDHMKTVHDSCIPLRTGTRRHQHNPWFSDAVRQVLLERDLAYKDWLQAP FT LHVKNAKRQRYKILRNRANTKITQAKQQYLNQFLNINVPSKTLWKRVKNLG FT VGKDTTSKPCEFDPETVNRTFLANYIKSNHRASRPLRTTPPSPYNFSFRSV FT QYWEVVNAIWDIHSNAVGTDDLPIKFIKIALPLIVHHITHLFNVFIRTSTF FT PECWKHAKIIPLKKKAYMNDINNLRPISILCALSKAFEKLLKQQMTSYIED FT NKLLSDCQAGFRRGQSIKSAVLRVHDDLGATVDKKGAGILLLLDFSKAFDT FT ILHSKLLSKLEAQFNFSSPAIRLIESYLRGRKQTVFCGDHHSSSAEVSSGV FT PQGSVLGPLLFCCHINDLPTVLKYCSIQIYADDVQLYVRRYGPSTRELVRM FT INSDLQAVEDWSRRNSLYLNPAKSKAIYVTGNQRRASSFPVSPVVMNGQFI FT EWSESASNLGFIFQSNLQWDALVAQQCGKIYASLRTLYNCTLAAPTATKLK FT LFKSLILPHFLFGDVLHVHPSANSFERLRVALNCCVRYVFGLHRLDPVSHL FT QQNLIGSTLRGFLAHRSCIFLRNLLTTQSPVTLYQRIIQTRGRRLNNLIIP FT ANNTTCYARSLFVRGVVNWNKLPPEIKRSSSGAIFKRGCINFWNRNS" XX SQ Sequence 3882 BP; 1123 A; 847 C; 802 G; 1107 T; 3 other; caatttcaaa ttcgttatcc ctggaatgcg tgatgaggac gatgatgcag cagttcacgg 60 agacccgaga gttgatagcc actgtccgaw gcgaaatcca cgatgtaaac atcaaaattg 120 atactgtcaa gacagagctt caakgcgata ttaaagcagt gagagatgaa tgtgctgcaa 180 aattctcgca acacgatact gcgttggcat cgttacatga acgagtggat attgtgaccc 240 agaaaatcgg tgctctaggt aaccgcaacg aactcatcat aagtggtatt ccatacagaa 300 ccggagagaa tcttgattca atgctcaaag ctatcggtag gcatcttcaa gtcaaagaaa 360 cgtccactct gatggcagaa tccaggcgga tgatttctgg tagaaactcg gataccactg 420 gtcttattgt cgtcgaattt gcgatcaggg ctacacgaga cgaattctat agtgcttatt 480 tgcggaagcg ggatctcaag ctgagacaca ttggactgga ttccgatcgt cgtatctata 540 ttaatgaaag ccttaccatc gaagcgcgca aactgaagtc tagggcacta cacctgaaga 600 aagaaggtcg actgacatcg gtgtacacca aacaaggcgt tctttatgtg aagcccgcaa 660 ccaatgagtc atcagtcgtc attcagtcag agcgagactt gaatgaatat acatagcgat 720 atttttcttt tttttccttt tgttttcccg tcacttttga agttaagttt atataattgt 780 tccaaatttg ttgtattgtg agttgaagat tgtatgattg tgattaatgt tttatgtacc 840 tctgtgataa tgcagccctc tgagaattgc ttcccatatt acacttgatg ttaccaccac 900 ataaccacac ctccacgaac attccaggag cggtattgaa tgcagctcta tttcctgatc 960 agctaaatat ctgccacgga aatgctcaaa gtctttgtgc acgtaactct aacaaacttg 1020 acgaagttcg aaatctctta tccaactcca aaatcgagat agcatgtttc actgaatcat 1080 ggttatcctc taaaaacagc gaccgtagca tcagtattcc ggggtactct gttgttagaa 1140 acgatagagt tttcaagcga ggtggaggga tcgttgttta ctataaagaa cacttatctt 1200 gctacaaaat cttcggcact gttctcactt cggaatcgct ggataaaacc gagtgtttgg 1260 cattggagtt tcgtgtaggc ggtcaaaaat ttatgttgat gacggtttac aaccctccag 1320 gaaatgactg ttcatctttc ttagcagata aactgactga tttgtctgtt cgctatgaca 1380 gcattttcct ggtaggcgac ttcaacatcg atctgcatcg tccgagcagc aaacgtgagc 1440 aattggaagc gattctctgt gcttattcat tgacatctat cagtagtgag ccgacatttt 1500 tccacaatgg cggatgctcc caactcgatc tgtttcttac caaccgcacg gataaggttt 1560 tgaggtttgg ccaggtcagc tttcctggac tgtcgcacca tgatcttatt tttgcatcga 1620 tggattttgc catctctcgt cccaccggcc ggtacaccta ccgtgattac acgaattttg 1680 actcccatgc tttggaaaat gctgttctat ctattccctg gaaccgtttt tatcaaatgg 1740 atgacccaaa cgaagcaatc gaatttttct acgatcatat gaaaacagtt catgattcct 1800 gcattcctct ccgtactggt accagacgcc atcaacataa tccgtggttt tctgatgccg 1860 tacgccaagt gttactcgag cgggacctgg catacaagga ctggttacaa gccccattac 1920 atgtgaagaa tgctaaacga caaagataca agatcctaag aaaccgtgcc aacacgaaaa 1980 ttactcaagc taaacagcag tacttgaacc agttcttgaa catcaatgtc ccgtccaaga 2040 ccctctggaa gcgcgtgaag aaccttgggg taggcaaaga tacaacatct aaaccgtgtg 2100 aatttgaccc ggagactgta aatcgtacct ttttggctaa ctacataaaa agtaatcatc 2160 gggcgtctag acctttaaga acaacaccac cttcaccgta caatttttct ttccgatctg 2220 ttcaatactg ggaagttgtc aacgcaatct gggacataca ctctaatgca gttgggacgg 2280 acgatttgcc gattaagttt attaaaatcg ctctgccatt gattgtccat cacataacgc 2340 atttattcaa cgtattcatc agaacttcta cgtttcctga gtgctggaag cacgcaaaga 2400 ttatcccact gaaaaagaaa gcttacatga atgatatcaa caatctgcga cctattagta 2460 tactttgtgc attatcgaaa gcattcgaga agctgctcaa gcaacaaatg acatcgtaca 2520 tcgaggataa caaattgttg tctgactgtc aggctggatt tcgtagaggc caaagtatta 2580 aaagcgcagt tctgcgagta cacgatgacc taggtgctac cgtggataag aagggtgctg 2640 gaatacttct tcttctggat ttttccaaag ccttcgatac tatccttcat agcaaacttc 2700 ttagcaaact ggaggcacag ttcaactttt cttctcctgc cataagactg atagagtcat 2760 acttgcgagg gagaaagcag acagtttttt gtggcgatca tcattcaagc agtgcagagg 2820 tctcatcggg tgttcctcaa ggatcagtac taggccctct tcttttctgc tgtcacatta 2880 atgacctacc aactgtattg aagtactgtt ctattcagat ttatgcagat gatgttcagc 2940 tttacgtcag acgatatgga ccttccacga gagaactagt taggatgata aattcagatc 3000 ttcaagcggt ggaggattgg tcacgacgaa acagccttta cttgaatcct gcgaagagta 3060 aagcgatata cgtcacaggg aaccaacgga gagcttcaag ttttcctgta tcgccagttg 3120 ttatgaatgg tcagttcatc gaatggtcgg agagtgcaag taacctaggg tttatttttc 3180 aaagcaattt gcaatgggat gccctagttg ctcaacagtg tgggaaaatc tatgctagcc 3240 tgcgtacgct ttataattgt actttggcgg cgccaaccgc aactaaactg aagctgttta 3300 aatcgttgat tcttccacat ttcctattcg gtgatgttct acacgttcat ccaagtgcaa 3360 attcctttga aagactccgt gtggcgctaa actgttgcgt acggtacgtg tttggattac 3420 accgtcttga ccctgtaagt cacttacaac agaacctaat tggttctact ctacgaggtt 3480 tcctggctca tcgatcttgc attttcttgc gaaacctgct aacaacacag tctcccgtta 3540 cactgtatca gaggatcatt caaacccgag gtcgccgttt aaataactta ataattccag 3600 caaacaacac cacatgttat gctagatcat tgttcgttcg aggtgtggta aactggaaca 3660 aattaccacc cgaaataaag cgttcatcgt cgggggcaat tttcaaaaga ggctgcatca 3720 acttttggaa ccgcaatagt tagagttaaa aatagtttat ttatttaaat aagaactgca 3780 atagtataat agttatttct ttgcaaaacc ggaggtagca attttaaaag atttatatct 3840 tacgctgctg gacaataaaa acaaacaaam aaacaaacaa ac 3882 // ID Gypsy-10_RP-I repbase; DNA; INV; 4127 BP. XX AC ACPB02036805; XX DT 28-JAN-2011 (Rel. 16.02, Created) DT 28-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Rhodnius prolixus genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-10_RP_; KW Gypsy-10_RP-LTR; Gypsy-10_RP-I. XX OS Rhodnius prolixus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Euhemiptera; Heteroptera; OC Panheteroptera; Cimicomorpha; Reduviidae; Triatominae; Rhodnius. XX RN [1] RP 1-4127 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Rhodnius prolixus genome."; RL Direct Submission to RU (28-JAN-2011). XX DR Genome; ACPB02036805; Positions 105611 109737. XX CC Positions [3154-3669] - Integrase core CC 'TGTCC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 29..973 FT /product="Gypsy-10_RP-I_2p" FT /translation="MVGKRQRARRALSSSPNALGETHDAISAAVATPPWSQ FT HAVPAAQVYNPRALTPPFTAEHVETWLEQFENAMNLGAVQDDQKKYAYLTA FT LLPTDVMAQAAGAIRTTTGGKYLAMRSYLITRYGVTSERRLDRLFAEGELG FT DRRPTQLLDDLQRLASGTEVGATVIRRLWLQRLPTNLRTIVVAHSAPLEEL FT AAIADRVLTAQQPSYAAPTNMPPVMAPVDSVPETSNLEEQGATALYGLAQA FT EVADARTYIRQRKNSANASRITKLEVQIKQLSDQLHTLKARETLTCYYHTR FT FGARAQKCKPPCQFRSENLSANR" FT CDS 937..4128 FT /product="Gypsy-10_RP-I_1p" FT /translation="MSIPLGKPFGQSLEAASGLASISSRLYITDRVSGTSF FT LVDTGAEVSAIPPNDNNTTHNTPTTRLIAANGTSIPVFGQKTLRLNFGNDL FT IACWRFYIAGVSRPILGADILRAQGWIVDLHQQCLLSSRPQTHHGPGIIAA FT SIETSTIPPKRPTNAYLSMLDRFPRVVGLLPPEPQPQLKIAHTIQTSGPPA FT FARARRLPPDKLEAAKAEFDRMCKEGTCRPSNSHWASPLHMVRKKSGEWRP FT CGDYRQLNALTIPDRYPLPHIQDFTSSLAGKTIFSTIDLQKAYNQIPVATQ FT DIPKTAIITPFGLFEFPCMTFCLRNAAQTFQRFMHEVTRDLPACYVYIDDI FT LVASTTEDEHKQHLHELLQRLDNYGLTINPQKCHFGLPEVQFLGHLVNDQG FT IRPLPAKCTTISSFPKPRTVEQLRRFLGLFNFYRRFIPSAAHIQAPLLRFL FT KNSRKRDTRPIDWDSDTEAAFNTCKRAIAETTQLAHPEKNPNLELSVDASD FT VAIGAALHQLTAQGRQPLAFLSRRLSDTERRYSAYDRELLAAYAAVRQFRH FT WLEGRQFVIRTDHKPLTYAFRQPPDKASPRQVRQLDFIAQFTTDVRHIKGE FT ANTVADTLSRIDEVSAFEGLNYTQVAQEQAADSQLQSYLQDTPDTPHSTAM FT HLSSIPIGNTGQTLVCDTSTGRLRPYLPFQFRESVIRRIHGLAHIGPKATT FT RTVAQRFVWPKMKTDCREWARACIECQKNKVARHAKAPLVPYVPPEERFAH FT VHMDIVGPLAVSRGYTYCLTCVDRFSRWPEAIPLRSVTAAEVAAAFYQHWI FT ARFGCPATITTDQGTQFTSRLYQELAALTGAQIKHTSAYHPQANGIIERWH FT RTLKSALRSHDSTNWAVHLPTVLLGLRAAVREDTGLSPAEYTYGTTLRLPG FT DFFADQPTIMPQTEFVAQLKQTISQLRPVPAEWHATQRPFVHKALASCSHV FT FLRKEGHRRALTPPYEGPFRVIARDHKTITLEKQGKQKTVPIDRVKPAFLT FT YITERSPQQATQQDQQRPAPQEAGNLQQSRSQTGYRSRAGRLILPPHRYGQ FT PTTEGTG" XX SQ Sequence 4127 BP; 1161 A; 1228 C; 924 G; 814 T; 0 other; ctggtgaccc cgccttggac aaccggaaat ggtgggcaaa agacagagag ccaggagggc 60 gctctctagc tcaccaaacg cactaggtga gacgcatgat gccatctctg ctgccgtagc 120 aacaccacca tggtcacaac acgctgtgcc ggcagcgcag gtctacaatc ctcgcgctct 180 aacaccgcct ttcacggctg aacatgtaga aacgtggctg gagcagtttg aaaatgctat 240 gaacctcgga gcggtgcaag acgatcagaa gaaatatgca tatctaacgg cacttctacc 300 cacggacgta atggctcagg cagcaggcgc tatacgcaca accacagggg gtaagtatct 360 ggccatgcgc agttatttaa ttactagata tggagtaacg agcgaacggc gcctcgatag 420 attattcgcc gaaggcgaat taggtgatcg acgccccacg caactgctgg acgaccttca 480 gcgactagct tctgggacgg aagtgggggc aaccgtcata cgacggttgt ggctccaaag 540 actcccaact aacctacgga ctatagtggt cgcgcattcc gcgcctttag aagagttagc 600 ggccattgct gaccgtgttt taaccgccca acaacccagc tatgccgccc ccacaaacat 660 gccaccagta atggcgccag tcgatagcgt cccagaaaca tcgaacctcg aagagcaagg 720 tgcgaccgca ctgtacgggc ttgctcaagc ggaagtcgca gacgctcgca catacatacg 780 ccaacgaaag aacagtgcta atgcctcaag gataaccaag ctagaagtgc aaataaaaca 840 gctctctgat cagttgcaca cccttaaagc gcgagaaaca ctcacctgct actaccacac 900 tcgttttgga gctcgtgctc aaaaatgtaa acccccatgt caattccgct cggaaaacct 960 ttcggccaat cgctagaggc ggcaagcggt ttggcctcaa tatctagccg cctctatatc 1020 acagatagag tctcaggaac atccttttta gtagatacag gcgcagaagt ttctgcaata 1080 cctccaaacg ataacaatac aactcacaat acccctacta cgcgtttgat agcagctaac 1140 ggcacctcca taccggtctt tgggcaaaaa acactacgcc tcaatttcgg aaatgaccta 1200 atagcatgct ggcgtttcta tatcgcggga gtatctcgcc ccatactcgg agcggatatt 1260 ctgcgggctc agggatggat agttgacctg caccaacaat gcctactttc atcccgaccg 1320 caaacgcacc acgggcctgg tattattgca gccagcattg agacgtccac tataccaccc 1380 aagcggccta cgaacgctta tttatccatg ctagaccgct tccctcgagt cgtcgggcta 1440 ttgccaccag agccacagcc acaactaaaa attgcacaca ccattcaaac ctctggaccg 1500 ccagcattcg caagagcgcg acgccttccc ccagacaaac tggaagccgc caaagcggaa 1560 tttgaccgca tgtgcaagga aggcacctgt cgcccctcta atagccactg ggctagcccc 1620 ttgcacatgg taaggaagaa atcgggagag tggcgcccct gcggcgacta ccgccaacta 1680 aacgcactga ctataccaga ccgataccca ttgccacaca tccaggactt cacaagctct 1740 cttgcgggta aaacgatatt ttctacaata gacttacaga aagcctacaa ccaaattccg 1800 gtagcaaccc aggacatacc caaaaccgct attatcaccc cattcgggct ttttgaattc 1860 ccgtgcatga ccttttgcct ccgcaatgcg gcacaaacat tccaaagatt catgcacgag 1920 gtaacacgag atctccccgc ctgttacgtc tacatcgacg acattttagt tgcttccacg 1980 acagaggacg aacataaaca acacctacac gaactacttc aacgactgga caactatggg 2040 ttgacaataa accctcagaa atgccatttc ggactccctg aggtgcaatt tctagggcat 2100 ttggtaaatg accagggaat acgccccctt ccagccaaat gcaccacaat ttcctcattc 2160 ccaaaacccc gtacagtaga acaacttagg cgatttctgg gattattcaa tttttaccgt 2220 aggttcatcc caagtgcagc acacattcaa gccccacttc tacgtttcct aaagaatagt 2280 cgcaaacgcg atactaggcc aatcgactgg gactctgaca cagaagccgc tttcaacacg 2340 tgcaaacggg cgatagccga aactacccaa ctagcacacc cggaaaagaa ccctaatctc 2400 gagctatctg tagatgcttc agacgtagca atcggcgctg cgctacatca gctcaccgca 2460 caaggcagac aacccttagc cttcctttcg agacgcctta gtgatacaga acgaaggtac 2520 agcgcatatg atagggaact cctggctgca tatgccgcgg ttcgacaatt ccgccattgg 2580 ttagaagggc gccagtttgt aatacgtaca gatcataaac ctttaacgta cgccttccgg 2640 cagccaccgg acaaagcttc ccctaggcaa gtaagacagc ttgatttcat cgcgcagttt 2700 acaaccgacg tgcgccatat caaaggcgag gctaatacag tagcagatac tctctctaga 2760 attgatgagg ttagcgcatt cgaaggacta aattacaccc aggttgctca agaacaagca 2820 gcagactcac aactacagtc atacttgcaa gacacgccgg acaccccaca tagtaccgcc 2880 atgcacctgt catccatccc cattggaaat accggccaaa cactcgtttg cgacacttca 2940 accggacgct tgcgccccta tttaccattc caatttagag agtcagtaat acggcggatt 3000 cacggcctgg cacacatcgg tcccaaggcc acgacacgga ctgtcgcgca gcgctttgtc 3060 tggccgaaaa tgaagacaga ttgcagggaa tgggcccgcg cttgcattga atgccaaaaa 3120 aataaagtcg cgcggcacgc caaagcccct ttggtaccct acgttccacc agaagaacgt 3180 ttcgcgcacg ttcatatgga cattgtaggc cccctagcag tgtcgcgagg ctacacttat 3240 tgcctcacct gcgtggatag gttttcgcga tggccagagg cgatccctct gcgcagcgtc 3300 accgccgcag aagttgccgc agctttttac cagcattgga tcgcaaggtt cggttgccca 3360 gcaaccatca ccacagatca agggacacag tttacgtcgc gactatacca ggagttagca 3420 gcattaacag gagcacaaat taagcacact tcagcatatc atccgcaagc caatggcatt 3480 atagaacgct ggcaccgtac gttaaaatca gcactacgct cacacgactc tacaaattgg 3540 gcagtgcact tgcccactgt gttgctgggt ctccgagccg ctgtcagaga ggacactgga 3600 ctatcaccgg cggagtacac ttacggcacc acgctgcgac tgccaggaga ttttttcgca 3660 gaccaaccaa ccataatgcc acaaacggaa ttcgtcgcac aacttaagca gacaataagt 3720 cagttgcgcc cagtaccagc tgagtggcac gcaacacaac ggccatttgt acataaggct 3780 ttggcctcat gctcgcacgt ctttctacgc aaagaaggcc accgccgtgc acttacacca 3840 ccatatgaag gcccatttcg agtgattgca agagaccaca agaccatcac tctggaaaaa 3900 cagggaaagc aaaaaacagt gcctatagac cgcgtcaagc cggcgtttct gacctacatc 3960 acagagcgaa gcccacaaca agccacacaa caggaccagc agcgacccgc gccacaagaa 4020 gcaggaaacc tacagcaatc gcgcagccaa acaggatatc ggtcgcgggc tggccgtctc 4080 atcttgcccc cgcacagata cggccaaccc actacggagg ggactgg 4127 // ID hAT-4N1_BF repbase; DNA; INV; 561 BP. XX AC . XX DT 30-SEP-2008 (Rel. 13.09, Created) DT 30-SEP-2008 (Rel. 13.09, Last updated, Version 1) XX DE Amphioxus hAT-4N1_BF non-autonomous DNA transposon - consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW hAT-4N1_BF; hAT-4_BF; non-autonomous. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-561 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-561 RA Kapitonov V. and Jurka J.; RT "hAT transposons in the amphioxus genome."; RL Repbase Reports 8(9), 924-924 (2008). XX DR [2] (Consensus) XX CC It shares ~100-bp termini with the autonomous hAT-4_BF. XX SQ Sequence 561 BP; 139 A; 147 C; 132 G; 143 T; 0 other; catgcccgta gccagggggg gttcgggggg ttcggaagaa cccccccaca cgctcaaaag 60 gtccgctttt tcaggctcga aggtccactt tttcatgtcc aagatttttt tttcaaaggc 120 atgactgatt tgcatttttc actgatagac atttcattca atcttagtga gtctatcagc 180 aaaatttgcc ccagaaagtg caggaaatgg cttttcagag ggtccagatt tcaaaatttc 240 cccggacctc cattgcgacg cctcgcgcct ttggtgtcgg acatggtgaa aatatgtcga 300 gggagttggc ctcaaagact tatgccaaaa gccttgtcta ttttaataga aattccatta 360 aacttagcca gcgaattttg cccgtcaaaa tgcaggaaat ggcgtttcag agggtcaaaa 420 tttcaatttt tccgggggag catgcccccg gaccccccta ggcagctcac gccttcggcg 480 tgagtctcgc gcctgcggcg ctcgatggtc ccacaagtac taaaaagaac ccccccaacc 540 aaaatcctgg ctacgggcat g 561 // ID Sola2-1_Hrobusta repbase; DNA; INV; 1794 BP. XX AC . XX DT 10-MAY-2011 (Rel. 16.05, Created) DT 10-MAY-2011 (Rel. 16.05, Last updated, Version 1) XX DE Sola2-type DNA transposon: consensus. XX KW Sola; DNA transposon; Transposable Element; Sola2-1_Hrobusta. XX OS Helobdella robusta OC Eukaryota; Metazoa; Annelida; Clitellata; Hirudinida; Hirudinea; OC Rhynchobdellida; Glossiphoniidae; Helobdella. XX RN [1] RP 1-1794 RA Yuan Y.W. and Wessler S.R.; RT "The catalytic domain of all eukaryotic cut-and-paste transposase RT superfamilies."; RL Proc Natl Acad Sci U S A 108(19), 7884-7889 (2011). XX DR [1] (Consensus) XX CC Incomplete. XX FH Key Location/Qualifiers FT CDS join(113..823,1269..1793) FT /product="Sola2-1_Hrobusta_1p" FT /translation="SLLKESLQHSEAVLHIDFSENYGFKNASAIQTCHFGA FT SNQQATIHTGVLYKYNGLISFASISESLRKDPSAIWAHIEPILQNLRESYP FT GITTLHFFSDGPTTQYRNKQNFYLLSTQIYKLGFTDASWNFFEAGHGKGAP FT DAIGGALKRRADDKVNMGCDITTAQSLFKVMSESDSQIKLYYIEGSDIGRI FT TSYCHQSLTAIAGTMKVHQLYTDTELCISSRHSDKEISSYYKLGEIKFQIG FT MNPDASQFQIGMNPDASQFQIGMNLNTSQPSIGMNSDASQFQIGMNPDASQ FT FQTGMNPNASQFQIRMNLDASQFQIGMNSDTSQLSIGMNSDTSQPSIGMNP FT DAIQFSIGMNPETFYNKIRKSNISCASNAGGDIKTGDYVLVELKSLKNKCH FT YIARIDNTTSDDGELEIT" XX SQ Sequence 1794 BP; 632 A; 275 C; 301 G; 586 T; 0 other; ggtaacagaa aaagttttaa agaaagggac tgaatgaatt gaagacatta ttttccaatc 60 agattcgtga tgaactgtgc cgtcatgtct ttataataag gcaccaattt agagcctact 120 gaaagaaagt ttacagcatt ctgaagcagt attgcatata gacttctcag aaaattatgg 180 ttttaaaaat gcatctgcaa tacagacgtg ccattttgga gcaagtaatc aacaagcaac 240 tatacataca ggtgtgttgt acaaatacaa tggcctaatt tcatttgcca gtatttctga 300 atccttgcgt aaggatccat cagcaatttg ggctcatatt gaacctatac ttcagaattt 360 acgtgaaagt tatcctggca ttaccacatt gcatttcttt tctgatgggc caacaacgca 420 gtaccggaac aaacaaaatt tctaccttct ttcgactcaa atctacaagc tcggattcac 480 agatgcaagt tggaattttt ttgaagcagg gcatggcaaa ggagctccag atgccattgg 540 aggtgcctta aaacgccgag ctgatgataa ggtgaacatg ggttgtgata ttacaacagc 600 acaatctctt ttcaaggtga tgagtgaatc tgattcacag ataaaactgt actatattga 660 agggtcagat attggtagaa taacgagtta ttgtcatcag tcattaacag caatcgctgg 720 tacgatgaaa gttcatcagc tgtacaccga tacagagctg tgtatatcat cacgacattc 780 agacaaagaa atttccagct attataaact cggtgagatt aaataatatc atttattttt 840 ttataatttt atttaattta atcatgaaca ttttataaat ttaattaatt tgatttggaa 900 gttattgcat cataatttca atccaataat ttacttttta aagaaattgg ttgtcagata 960 ctgaatttaa taaaagtttg ctaattagta atttaaagtt gaggttaaga taatttaaaa 1020 tagttttcac aataataaaa taaaaaagtc gacttgcttt tatctttcat tttttttttg 1080 attgaatttt attatctatt gctagtttat tttgttaatt taatttctta ttttagccat 1140 cagtcgaatt gaatccagat gctaatcaat cttcaattga aataaatcca gatgctagta 1200 aatcttcata cggaatgaat ccagatgtta gtcaatttca aatcggaatg aatccagatg 1260 ctagttaatt tcaaattgga atgaatccag atgctagtca atttcaaatc ggaatgaatc 1320 cagatgctag tcaatttcaa attggaatga atttaaatac tagtcaacct tcaattggaa 1380 tgaattcaga tgctagtcaa tttcaaattg gaatgaatcc agatgctagt caatttcaaa 1440 ccggaatgaa tccaaatgct agtcaatttc aaatcagaat gaatctagat gctagtcaat 1500 ttcaaatcgg aatgaattca gatactagtc aactttcaat cggaatgaat tcagatacta 1560 gtcaaccttc aatcggaatg aatccagatg ctattcaatt ttcaatcgga atgaatccag 1620 aaacatttta taacaaaata agaaaatcaa atatcagttg tgcttctaat gccggtggag 1680 atataaaaac tggcgattat gtcttggttg aattgaaaag tttgaaaaac aaatgtcatt 1740 atattgctcg aattgataat acaacttctg atgatggaga attagaaata acat 1794 // ID Gypsy-115_AA-I repbase; DNA; INV; 5442 BP. XX AC supercont1.240; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-115_AA_; KW Gypsy-115_AA-LTR; Gypsy-115_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5442 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.240; Positions 473267 478708. XX CC 'GGAAC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 1088..2671 FT /product="Gypsy-115_AA-I_1p" FT /translation="MPPFKIGADPRNDWIKWKRALERFLNANKVEQDEEKF FT NLLLVLGGIDLQTYYDKVAKWEVQKTVDENEEVLEEVIVLKYESAILSLDE FT YFAPQLKKRFERHILRSMRQNNQEPFEEFVYRLREQANRCIFSDVDDMIVD FT QVIEGCSSTELRKRLLTTEISLAEVISLGKTLEEVQKQTREYERPSTSHGE FT SDLIQKVVGKVSFSARSADNNRKCYNCNRPGHLARDIEKCPAKNVECYGCH FT TKGHFKICCRKRKHDEHANRRQVGAKRIHAIVENSNETDKGVFFVKSEEMT FT EVLEMDVGGVLIKMIIDSGSPANIIDAKTYKRLKDQGAQIMNEREPQTNEK FT TFKAFASDRDIFFNAVFETEIKIPGDETGIWSHVMVAPHGQVSLLSKGTAF FT ALGILKIGYHVNSISARDRSNVEVQGHEEFPKIPDVKLPIQVDDSVPPVVQ FT AVRRFPMSMEADVEKTIQDLLDKNIIERAEGPITWVSPLVPVRKSDGRIRL FT CVDMRAANKAVKRENYPMPNIDDAMSGIKKVC" FT CDS 2856..4205 FT /product="Gypsy-115_AA-I_2p" FT /translation="MFGIKSAPELFQREMENLFRGIKGLIVYMDDFLIYGE FT TVEEHDETLREVLRRIKKFNMKINKEKSVFGVPSVEFLGHIVSVEGIRPTD FT GKVKAILDLQPPSSVSELRSLLGLINFVGKFIPNLSAHTFNMRSLLNKGTL FT FIWKDKHSQELDAVKGMIASAGCLGFFDPNDETVLVTDASPYGLGAILIQT FT KDGVSRTISCISKSLAEFEQKYCQTEKECYAIIWAMEKLYVYLYGKHFTLV FT TDCKPLEYLFNRVKSKPSARIERWILRLQSFDFTVKYEPGEDNLADSLSRL FT SQGVKDYSTRSDVISWLADEIKPSVLSIEELEKATLADNDLQKVKEAIYSG FT IWDEVPNEFKTATVKDDLTLYGELILRGDRIVIPTALREKIVNLAHLGHQG FT GISMKALLRSKVWFPFMDKLVDTIIRNCKPCKMTALPDKPNPMTTNYAVAR FT SGHRF" XX SQ Sequence 5442 BP; 1724 A; 980 C; 1328 G; 1410 T; 0 other; atggcgacga cggtaaaaat aggtaagttt cgatcttctg agttgcaaaa ttggtgaaaa 60 acgggagcga tactatggat aaaatgtgaa attaccaaaa atgtccgaat tggaattttt 120 ctaccctgaa tgcttacggc gtggttagag cggaaaaaaa acaagatggc gaacaaagga 180 aaagaaatca acagtgaaaa ttaaaaggag aaaatggcgg cagtgagata atcagcattt 240 tgaacatcta tctggatttt atggaagaaa aaaatatatc acttcgtgtg tgttaaacag 300 cagaaaggca gagtggagaa ttcatcattt gttttttggg tgaaagattg tctccgtgac 360 tagcggtaca aaggaacgct gcgtgcgaag aaaagtgtac ctcgttggaa aggccatttt 420 gtgccaagct tcgtgctggt ggacgcttcc cgaccaacca gtcttcataa ttctttatag 480 atctttgttc gacgatggca atcgctaacg gttcaaaacg aatcgatatc gatcgctatc 540 gaaaatcact gactggttga ccgggttagt gcgaagtaat gcttagtccg aagtaacgct 600 tagtgcgaag gaacgcttag tgcgaaggaa cgcttagtgc gaaggaacgc ttagtgcgaa 660 gaaacgctta gtgcgaagaa agctcagtac ggaagaacgc ttagtgcgaa ggaaacttac 720 tgtgaatatt ttggggaaat cggcagaaaa tgctacgtgc tcgtgtacat tgtgtcgaac 780 tgactgagtt tattgcattt gttgatcgat ggacacaaat tctggagtga tgatgatcgg 840 agtggacatg ttattgtgga tcggagtgga aactcaattt ttggattaca gtggacgtag 900 caaggtgatg tttataaacc ataaataaaa gaaatttgaa agacgaaagt ttattgcaat 960 ggatgaaaac ataagatgct taaaaaaaag tcaagttatt ggaaaatatt cgttgtgatt 1020 tatcaagtgg aaagaaacta agggaggcat attttttgtt ttgtttttgg ttgtagatgc 1080 gtcggcgatg ccacctttca agataggtgc agaccccaga aacgactgga tcaagtggaa 1140 gagagctctg gaacggtttt taaatgcaaa caaagtggaa caggacgaag agaagttcaa 1200 ccttttgtta gttctgggag gcattgatct tcagacatat tacgacaagg ttgcaaaatg 1260 ggaggttcag aaaacggtcg atgaaaatga agaagttttg gaagaagtga tcgtattgaa 1320 atacgaatca gcgattctct ccctcgatga atactttgcc ccacagttga agaaaagatt 1380 cgaacgacat attttgaggt caatgcggca aaacaatcaa gaaccctttg aagagtttgt 1440 gtatcgattg cgggaacaag ctaaccgttg cattttttcc gacgtggacg atatgatagt 1500 cgatcaagtt atcgagggtt gtagttcgac tgaactccgg aaaaggcttt tgacgaccga 1560 aatatcattg gctgaagtga tatcactggg taagactttg gaagaagttc aaaaacaaac 1620 gagagagtat gagagaccgt ccacgtctca cggagaaagt gacctaatcc agaaagtcgt 1680 tggcaaagta tcattcagtg caaggtcggc agacaataac aggaagtgct ataattgcaa 1740 cagaccaggt catttggcaa gagatatcga gaagtgccca gcaaagaatg tggaatgtta 1800 cggttgccac acaaaaggcc actttaaaat atgttgccgg aaacgaaaac acgacgaaca 1860 tgcgaacaga cgacaggtag gagccaaacg gattcacgcg atcgttgaga acagtaacga 1920 aacggacaag ggagtgtttt tcgtgaagtc ggaggagatg actgaagtac ttgagatgga 1980 tgtcggtgga gttctcataa agatgatcat cgattcgggg tctccagcga acatcatcga 2040 tgccaagacc tacaaacgtc taaaggatca gggagcgcaa attatgaacg aacgggagcc 2100 tcaaaccaat gagaagacct tcaaagcttt tgcatccgat cgtgatatct tcttcaacgc 2160 agtattcgaa acggagatca aaattccggg tgacgagaca ggtatttggt ctcatgttat 2220 ggttgctcca catggccaag tcagcttgtt aagcaaagga actgcttttg cattggggat 2280 tttgaaaatc ggctatcatg tgaacagtat ttctgctaga gatcgttcaa atgtggaggt 2340 tcagggtcat gaagaattcc cgaaaatacc ggatgtaaag ctgcccatcc aagttgatga 2400 ttcggtccca ccagttgtgc aagcagttcg acgttttccc atgtcaatgg aggcggatgt 2460 ggagaaaaca attcaagatt tgctggacaa aaacatcatc gaacgagcgg aaggaccgat 2520 aacttgggtt tcgccactgg ttcccgttag aaaatcggat ggtagaatca gactgtgcgt 2580 agacatgcgg gctgccaaca aagcggtgaa acgtgaaaac tacccgatgc ctaatatcga 2640 cgatgcaatg tccggaatta aaaaggtatg ctaataaata agaatgttgt agttcgtaga 2700 aaacatacgt catggtcttt ttttatttca attaggttgc taaattgtcc aagatcgatc 2760 tcgaggccgc ttattatcac tttgagcttg atagcaacag taggcatatc actacatttg 2820 tggccaggag cggggtatac agattctgca ggttgatgtt cgggatcaaa tccgccccag 2880 aactattcca acgagaaatg gaaaacttgt tcaggggtat caaaggactg atagtttaca 2940 tggatgattt cctaatttat ggggagacag tcgaagaaca cgacgaaact ttgcgtgaag 3000 ttcttcgccg aattaagaaa ttcaacatga agattaacaa agagaaatcg gtctttggag 3060 tacctagcgt ggagtttctg ggtcatattg tgtccgttga aggtattcgt ccgacagacg 3120 gaaaagttaa ggctattttg gatttgcaac caccgtcctc cgtttccgaa ttgagatctt 3180 tgttaggtct cataaatttc gttgggaagt ttattcctaa tttgtcggcc cacactttca 3240 atatgagatc ccttttgaat aagggaacct tgttcatttg gaaggataaa cacagtcaag 3300 agttggatgc ggtgaagggc atgattgcaa gcgcaggttg cctcggtttt tttgacccta 3360 atgacgaaac ggtgttagtg acagacgcca gcccatacgg tttgggggca atactgattc 3420 agactaaaga cggtgtatcg cgaacgattt cctgcatatc gaagagtttg gctgagtttg 3480 agcaaaagta ttgtcaaaca gaaaaagaat gctatgccat catatgggct atggaaaaac 3540 tttacgtata cctatacggt aagcatttca cgctcgtcac agattgtaaa cctctggaat 3600 atctattcaa ccgggtaaag tcgaaaccat ccgcgcgtat agagaggtgg attcttcggc 3660 tgcagagctt cgattttact gtaaaatacg aaccagggga agacaatttg gctgattcac 3720 tctctcggtt gtcccaaggc gtgaaagatt acagtactcg ctcggatgtc attagttggt 3780 tagcagatga gattaaaccg tcggttttgt caattgagga gctggaaaaa gcaaccctgg 3840 ccgataatga tctacagaaa gtcaaagaag cgatatactc ggggatctgg gatgaagttc 3900 cgaatgagtt caaaacagca acagtcaagg atgatttaac actgtacggg gaattgatat 3960 taagaggaga caggattgtc attcccactg cgttgagaga aaagatagtc aacctggcac 4020 acctaggaca tcaaggagga atatctatga aagcactgct tagatccaag gtttggttcc 4080 cattcatgga taaactggtt gatacaatta ttcgcaactg taagccatgt aagatgactg 4140 cgctaccgga taagcctaat cccatgacta ccaactatgc cgtggcaaga tctggccatc 4200 gattttaaag aaggactccc gggggagatg tcactattgg ttgttgtttg ctacacatgc 4260 aggtttgtac aagtagaacc aatgaaaccg gcaacgacac aaagggttat cggaactctc 4320 ctgaggatgt ttagtgcttt tggcattccc cggtcaatta catcagataa cgggccacag 4380 tttagagcga ttgagtttcg gaacttttgt ttaagctacg gcatacacct caatctttct 4440 accccatatt ggccggaaca aaatggggca gtagaaagac agatgcggaa tattgggaaa 4500 cgaatcaaaa tcagcattat tcagggaaca gattggaagg ctgatcttta tgaatatttg 4560 acgctgtacc attcaacacc tcaagaggct acaggagtgt ctccaggtca aatgatgttt 4620 ggtcgtgaga tacggaaccg aatcccgtcc attcaccaac caccaaattt gaactgcaca 4680 gttaaaaata atgaagatta cacctcatgt aaacttcatt tatgcgatgt aaacatgacg 4740 tcatgtaaat tttagtctga aatcatgtaa aaatgtgtta tatgtcatgt aaccacacag 4800 aatcctgatg ggttacatga catataattg aagtttacat gacatattat gtaaatatcc 4860 atgattttcc acgctccaat tatgtgcatc atatgtctca gaaatttaca cgtttcgttc 4920 gaactgtgtg ggaatcttcc agagatagag acatgcatca gaaggagcag cataaactac 4980 gagcagatac atcccgacaa gccaaggcgc ataatttggg gagaggagat gtagtgttga 5040 tgaagaactt caagtctgga tcgtatgaac caaacttcgg ggatgaggaa tttgaaacaa 5100 ttgacgtcaa agggaacgaa atcactgtac gctcaaatcg aacaggtaag gtgtatcgac 5160 gcaacagctc ccacctcaag aaattgttga cagcaggctc agcacttttc gatgagaaca 5220 ggtcttctaa tacacatctt agtattacgg atcgctcggt gacctccact gaagatcagg 5280 ggaaagataa cagttcctca aaacgaatta gcagacgacc ggttaggttt aacgattttg 5340 taacttagtt aatgtaacag tttgagagga tatcaataaa atgtttgtaa cacttgtaat 5400 atggatctta agatatatta tgaatccaaa atatgggagg ag 5442 // ID BEL-23_CQ-I repbase; DNA; INV; 6708 BP. XX AC AAWU01010308; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-23_CQ_; KW BEL-23_CQ-LTR; BEL-23_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-6708 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 199-199 (2011). XX DR GenBank; AAWU01010308; Positions 22655 15948. XX CC Positions [4816-5424] - Integrase core CC 'ATGTT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 445..5757 FT /product="BEL-23_CQ-I_1p" FT /translation="MASLRDLFKFERGLLDSLANLEDFVENFDAKRDAARV FT ESRLERLETVFKDFMENRARLESKHEQDLSSASEPKDPEAKEKKQQARETN FT KVTRLDFENRYLDLKDFLVSKRVKPTAEASSLAPPPCQPLSSRFQLPSFKI FT PSFDGDLTKWMSFRDAFKSVVDKDPSLSGVDKLNYLRSLVQFGASTIVEGV FT DVKETNFDVAWDLLTLRYENNKLISRALMDDLLNTEPIKRESYEALITLID FT SFERNLMQLKKLGLKTDDWSPILAHLLYSRLDTETQRHWENHHKSREVPDY FT KEMLKFLQDHLTTLQPLVNKKPRVQEHRQEASKPTPKTKFGSTLTTTTSSQ FT NKSCPFCQKPTHSPFKCESFLKLNPFQRYELAKKIGLCLNCLNPFHLARAC FT PSSACRVCGQRHHTMLHRRSTNTPNQELTLPHSTTNAQFQAQPSRASNSQS FT QSQTSAPPMLPSTSDQPSLSLAACTPNNANVVFLSTAVVKIADSHGNTYLA FT RALLDCCSERNLMSERLAQKLVLRRQNDPLSLQGVGSSQAVSKQSTLAKIL FT SRWTDYSIDLKFHILPEFRLIVPSDCVQTQTWTHPIPSSLFLADPRFFEPN FT QIDLIIGAEVHYRLLLNGFIDLGPNLPILKETVFGWIVSGKCSTSSRPSSI FT TMVCSNADLEKQVARFWELESCHSESVLSVEEKQCEEYFAKTTFRDSTGRF FT VVRLPKKEAVLEKLGESRAIAERRFHYIERRLQNKPTLGKSYAAFIREYVD FT LGHMSIVDPRNDALPAQAKPYYMPHHCIERPESTTTKLRVVFDASCATDTG FT ITNDALAVGPVVQDDLFSLILRFRLEPFVLVADIEKMYRQVLVHPDDRHLQ FT RILYRESPTEPLQTYELRTVTYGTASAPFLATRCLKQLALEEEHRFPEAAN FT VLARDFYMDDLLCGVETAREGIELSRQLIELLGSAGFKLCKWASNSSTILQ FT NIPAELRENRSEFELDSSSAPIKTLGLLWQPADDVFLFKVPIWDVRAPITK FT RLVVSDTARLFDPQGLVGPVILRSKTFVQQLWKAKVAWDRILGEKQQQYWK FT EFRTDLDALNTFTVPRWAAPGSQPVETQLHIFCDASELGYGACAYLRTESG FT NGTVSVNLLTAKSRVVSLSKQINLPRLELCGALLAAHLYEKIAPNLPPNIR FT VFFWTDSMLVLHWLAASPSRWKKFVANRVSEIQEVTAGHVWLHVPGVDNPA FT DVISRGMTATQLIDCMLWWQGPSWLRQSSRFWPCIVRTSDDLFDKEQLEER FT PVVALPAVAETSIFGVTSSLSELIRKVAYYRRFSYNARGCNQSNRRVGPLS FT TAELNESLLCLAKLAQQESFPKDLHAICTTGQVESTSKLKSLSPILVEGVL FT RTRGRLQHADVPYDRKHPIILDNKHPFTLMIMRYNHLKYLHAGPQLLIAAV FT RAKFWPLRARDLARKVVHECVDCFRCKPRTSDQIMGDLPAVRVLPILPFLN FT SGVDFCGPFYLRPPHRKAAPQKCFVAVFVCMATKAVHLELVGDLSADSFIA FT ALKRFAARRGVPKTLFCDNGTNFVGARRTLNEFLRLLKAQQVQDQVVRSCS FT ADGIQFQFIPPRTPHFGGIWEAAVKSLKNHLRRTVGNANLTAEQMSTFLAQ FT AEACLNSRPLTPISNNPDDLDGHFLVHRPLIAIPEPSYEEIATNRLSQWQM FT IQEYLRRLWKRWSTEYLSGLQQRTRWTRERDNIRIGTMVLVKEDNLPPQKW FT RIGRVIEIFPGNDGHVRVVKLRTPDGVYTRAITRLCVLPIDDNQQPAQDSQ FT " XX SQ Sequence 6708 BP; 1566 A; 1867 C; 1661 G; 1614 T; 0 other; tttttatggt ccttcgagga ccggattgta acaaccgttt tcgccgaaaa aattgatctt 60 gctggtgacg ctatcaccga acacgttgct gattgattgc tgtggccctc ttcggcgtcg 120 atctccagca tcggcgtgac ctgttcctgc ctcgtcgtgg gacaccgacg gcctgcagca 180 tcttcgagac gcccgtggag cacagcgaag gagcatcaac gaacgagatc aagccatcct 240 ccaaaacgcg cgctcggcgg cgtcgctcgg gtatagcgta caataggctg gtagcgtgag 300 tacacccaga agggaagtgc acttttttac tacaaaaatt aaaagttgca tgcgaaattc 360 gataaatttg cgattttggg gaaaatttgg gaaaattgga ctttggattt tccaaattaa 420 acgtgggatt gcgacatcta caaaatggcg tcgcttcgtg accttttcaa atttgagcgt 480 ggccttttgg actcactcgc caatctggaa gattttgtgg aaaatttcga tgcgaaacga 540 gacgcggcca gggttgaatc tcgtctggag cgactggaaa cggttttcaa ggattttatg 600 gaaaatcgtg ccagacttga gtcgaagcat gaacaagatt tgagttcggc cagcgaaccc 660 aaagaccccg aagcaaagga gaagaagcag caagcacgag aaacaaacaa ggttaccaga 720 ttggatttcg aaaatcgtta tttggacctc aaggactttc tggtctcgaa gcgtgttaaa 780 ccgactgctg aagcttcttc ccttgcccct ccgccctgcc aacccctgtc ctctcgattt 840 caactcccta gtttcaaaat cccatccttt gacggtgatc taactaaatg gatgagcttt 900 cgtgacgcgt tcaaaagtgt ggtcgacaag gacccgtcgt tgtcaggtgt ggataagctg 960 aattaccttc gctcgttggt tcagtttggt gccagtacaa ttgtggaagg agtggatgtg 1020 aaggagacca actttgatgt cgcgtgggac ctgctgacac tgcggtacga gaacaacaag 1080 ctcatctccc gcgcgctgat ggacgacctg ttgaacaccg aaccgatcaa acgagaatcc 1140 tacgaagcac taatcaccct tatcgactcg tttgaacgca acctgatgca gctgaagaag 1200 ctcggactca agacggacga ctggtcacct atacttgcac acttgctgta tagtcgtttg 1260 gatacggaga ctcaacgcca ctgggagaac caccacaagt cgcgagaagt acccgattac 1320 aaggagatgc tcaagtttct tcaagatcac ctgaccacgc ttcagccact ggtcaacaaa 1380 aagccacgtg tgcaggagca tcgccaggaa gctagcaaac caacccccaa gacgaagttc 1440 ggctctacgc tcacgacaac aacaagctcc cagaacaaaa gttgtccatt ttgtcaaaaa 1500 cccacccact ctcctttcaa atgcgaatcc ttcctcaagt tgaatccctt ccagaggtat 1560 gagttagcca agaaaattgg gctttgtttg aattgcctca atcccttcca cttggcacgg 1620 gcttgtccta gctcagcatg tcgagtttgt ggtcagcgac accacaccat gctccaccgt 1680 cggtctacta acacaccaaa ccaggagctc acgttgccgc attccactac aaatgcccag 1740 ttccaggccc agcctagtag agcctcaaat tcccaatcgc aatcgcaaac ctccgctccg 1800 cctatgctac cctctacctc cgatcaaccc tcgctgtcgc tagctgcttg tacgccgaac 1860 aatgccaacg ttgttttcct ctcgacggcc gtggtcaaga tcgcagactc tcatgggaac 1920 acttacctcg ctcgcgcgct cctggattgc tgttcggagc ggaatctcat gagtgaacgt 1980 ctggcgcaga agctggttct tcgccggcag aatgacccgc tgtcgctaca aggtgttggt 2040 tccagtcaag cagtttcgaa gcagtctacg ttggccaaga ttctctctcg gtggacagac 2100 tactccattg atctgaagtt ccacattttg ccggagttca ggctcattgt tccgtcggac 2160 tgtgtgcaga cccagacctg gacgcaccca ataccatcgt cgttgttcct cgccgatcct 2220 cgctttttcg aaccaaacca aatagatttg atcatcggcg ctgaagttca ctaccgtctc 2280 ttgctgaatg gtttcattga tcttggaccg aacctaccca ttctcaagga gaccgtcttt 2340 ggttggatcg tgtctggcaa gtgcagcacc tcgtccagac catcgtcgat cactatggtc 2400 tgtagcaacg ccgatctgga gaaacaagtc gctcgttttt gggagttaga gtcgtgccac 2460 tcagagagcg ttctttcggt ggaagaaaag cagtgtgagg agtacttcgc aaaaacaaca 2520 tttcgggatt ctactggtcg ctttgtcgta cgtctgccca agaaggaagc tgtattggag 2580 aaattgggcg agtcacgtgc gatagcagag cgtagattcc actacatcga gcgtcgactg 2640 caaaacaaac ccacacttgg aaagtcgtac gcagcattca tcagggaata cgtagatctt 2700 gggcacatgt cgatagttga tcctcgtaac gatgctcttc cagcccaagc taaaccgtac 2760 tacatgccac accactgtat cgaacgaccg gagagcacca ctaccaaact acgcgtcgtc 2820 ttcgatgctt cgtgtgcgac tgacactggt ataacgaacg atgcgctcgc agttggacct 2880 gtcgtacaag acgacttgtt cagcctgatc ttacgctttc gcctcgagcc gttcgtgttg 2940 gtagcagaca tcgaaaagat gtacagacag gtgttagtac atcccgacga tcggcatctt 3000 cagcgtattt tgtatcgaga aagtcctact gaaccactac agacgtacga gctccgaact 3060 gtaacgtacg gcaccgcgtc cgcgcccttt ctcgccaccc gctgtttgaa gcaactcgct 3120 ctggaagaag agcacagatt ccctgaggcg gcaaatgtgt tagcaaggga cttttacatg 3180 gatgatctac tttgcggggt ggaaacggca cgagaaggga tcgaactcag ccgtcagctt 3240 attgagcttc tcgggtcggc cggctttaag ttgtgtaagt gggcttccaa cagctccaca 3300 attctccaaa acatcccggc tgaacttcgc gaaaaccgca gtgagttcga actagattcg 3360 tcgtcggcgc cgatcaagac cttgggactg ctgtggcaac cagcggacga cgtgttcctg 3420 ttcaaggttc caatttggga tgttcgagcg ccaatcacca aacgtctggt cgtatcagat 3480 accgctcggt tgttcgaccc gcaaggctta gttggaccgg ttattttgcg ctctaaaact 3540 ttcgtgcaac agctgtggaa ggccaaggtc gcctgggacc gtattctggg cgaaaagcag 3600 cagcagtact ggaaggagtt tcgcacggat ttggatgctc tcaacacgtt tacggttcca 3660 cggtgggcgg cgcctggctc gcaacctgtc gaaacgcaac tgcacatctt ctgtgatgca 3720 tcggaacttg gctacggagc gtgtgcgtac ctgcgaaccg agtcgggcaa tggaacagtc 3780 tcggtcaacc tactaactgc caagtcaaga gtcgtttccc tcagcaagca aattaatctc 3840 cctcgcctcg aactgtgcgg cgcactactc gctgctcatt tgtacgaaaa gatcgcaccc 3900 aatctcccac caaacatccg agttttcttc tggacggatt caatgctagt cctccactgg 3960 ctagccgcat caccctcccg ctggaagaaa ttcgtggcca atcgagtgtc ggagattcaa 4020 gaggtcactg ctggacacgt atggctgcac gtacctgggg tggacaaccc tgctgatgtg 4080 atctcccggg gaatgaccgc tacccaactc atcgactgta tgctgtggtg gcaaggacca 4140 tcgtggctga ggcagagtag cagattctgg ccctgcatcg tgcgaacttc ggacgacctt 4200 ttcgacaagg aacagctgga agaacgaccg gtcgtcgccc tgcctgctgt ggctgaaacg 4260 agcatttttg gtgtcacttc gtcgctttcc gagctgattc gaaaggtagc ctactacaga 4320 aggttttcct acaacgctcg tgggtgcaac caatcgaacc gcagagttgg acctctctcg 4380 actgcggaac tgaacgaatc tctcctgtgc ctcgccaaac ttgcccaaca agaatcgttc 4440 ccaaaagatt tgcacgccat ttgcaccaca ggtcaagtag aatccacatc caagctcaag 4500 tcgttgtcac caatacttgt ggagggagtc cttcgtacgc gtggtcgtct tcaacacgct 4560 gatgtgccct acgatcgcaa acacccgatc atcttggaca acaaacaccc tttcacactg 4620 atgatcatgc gctacaacca tctcaagtac ctacacgctg ggcctcagtt gttgatcgcc 4680 gctgtacgcg ccaagttctg gccacttcgc gctcgtgacc tggctcgaaa agttgtccat 4740 gagtgcgttg actgttttcg ttgcaaacct cgcacatcgg accaaatcat gggggatcta 4800 ccagctgtac gggtcttgcc aatcctgccg tttttgaact ctggtgttga tttctgcggg 4860 cctttctacc tacgaccacc acatcgcaag gccgccccac agaagtgttt tgtggccgta 4920 ttcgtttgta tggcaaccaa ggcagttcat cttgaacttg ttggcgacct ttctgctgac 4980 tcattcatcg ccgctctgaa gagatttgcg gcccgtcgtg gagtgccgaa gacgctgttt 5040 tgcgacaacg gcacaaactt cgtcggtgca cggcgcacgc tgaacgaatt cctgcggctg 5100 ctcaaggctc agcaagtaca ggatcaggtc gtgcgcagct gttcggccga tggtatccag 5160 ttccaattca tccctcctcg gacgccacac ttcggaggca tctgggaagc ggccgtcaag 5220 tccctcaaga accatctacg ccgtactgtc ggcaacgcaa acctgactgc tgagcagatg 5280 tccacctttc tcgcccaagc tgaagcctgc ttgaactcgc ggccacttac gccgatctcc 5340 aacaacccgg acgatctgga cggacatttt ctggtgcaca gaccgctgat cgccataccg 5400 gaaccttcgt atgaggagat tgctacgaac cgcctgtctc agtggcagat gatccaggag 5460 taccttcgtc gtctgtggaa acgctggtcc acggaatacc tctccggcct tcagcagcgc 5520 actcggtgga ctcgcgaaag agacaacatc cgcatcggga caatggtcct ggtgaaagag 5580 gacaatcttc cgccgcagaa gtggcgcatt ggcagagtca tcgagatctt ccccggcaac 5640 gacgggcatg ttcgtgtcgt caagcttcgc acaccggacg gcgtctacac gcgagccatc 5700 actcggctct gcgtcctgcc gatcgacgac aatcagcagc cggctcagga ctcacagtag 5760 ttcctgtcgt tcgtggttcc agaaaagaac gctttgttgg atgtggaggg ggcctgatgg 5820 tcctctcaca agttaagtat ctaccgtttt tctggaccac tcctaccgac ctaaagtcct 5880 gcctcgtctc tgtagaacct cgtaccaact gtcctgccct gctgatttgt tcgtgttgtt 5940 cgcggtgcca gaaaaccctg cattgttgga ttgggagggg gtctgttgac cctctcacaa 6000 gttaagtttc taattaattt tctggaccgt gccttctcgc cctgaagtcg tgtctcgtat 6060 ctctagatgg actgcccgac ctgcccgacc tgcgcggcct gcgcgacctg ctcgacctgc 6120 tcgacctgcc tgacctgctc gatctgctca accagcacag ccagcctgga ggacctgctt 6180 gatctgctcg acctgctcca ccggctcaac cgacctgttc gacctgctcg agctgcccga 6240 tctgccagat cttcctgatc tgcacagctg ctgacgttcg cggtgccagc aaacaacgca 6300 ttgtttggat gtgagggggc ctattggtcc tctcacaagt taagtatcta attatgtttc 6360 tggaccgctc tacccgacct aaagtcgtgc ttcgtatcgc taggaaatcc aactgcactg 6420 cggaaaagtg ctcgaacggg acatcgacat cttcgagagt tccgaccatt caacccccag 6480 ccgttggtcg tggaacctgg tgacgtctcc ggcttttcta cccacaacct ttggccgagg 6540 agaagctgtg ttccagatca tcgccagaag attgtgaagt tattgaaaat tgatttgtgt 6600 ttagattaag cagaaattag atcatgttta ggtgttagtt tagaattccc caccaccaga 6660 ttagatttag attttcgatt gaaacttgca tttcaatggt gggcggaa 6708 // ID CR1_Ele39 repbase; DNA; INV; 5224 BP. XX AC . XX DT 18-OCT-2010 (Rel. 15.1, Created) DT 18-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE A CR1 clade non-LTR retrotransposon family from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1_Ele39. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-5224 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5224 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (18-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. This consensus is generated from 20 CC sequences with >94% identity, and ~99% identical to the original CC sequence in [1]. XX FH Key Location/Qualifiers FT CDS 360..1259 FT /product="CR1_Ele39_1p" FT /translation="MTAITACDRCAKTVKRNENFIECMGFCNNVVHMKCDN FT LNIEFIATIRERPNLFWMCDECTKLMKVTRFKSVVSSLGCAISTVIENQMT FT GISELKAEISKNNQQVAEVIVENNNQVAQLANIVNASTPVPAPNRERPSKR FT RREDTETSNAITLGTRNIASDIAVAVKPAANLFWMYLSRFHPSVKEDVVEN FT LAREGLQTREPLKVVSLVKKGADLSSMNFISFKVGVPVELKDVALNPNTWP FT EGILFREFEGQSKNSVWLPPAMSTTTRDGTPFGTPAPVPTPTFDAPISITI FT PTTISDEA" FT CDS 1313..5107 FT /product="CR1_Ele39_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MMEAPNLPNSVAPSAVFAHHSRLGPEVGGGNGVFHPA FT CPGKYFSNKSSSAELSLPQIESNSRTVSTADSILNFWTLRRTIGHASTICG FT HPGRTNESILGGSCSSNSVAPIAASVLHSRPGPVVRCGFGVPNLSCQASTT FT STVHRAIILRLILVRLXVLYDQLIPHHIRSTFTAPASSHELHCGPPGRTVE FT SFLGAPSSSISVVPIAASVPHSRPGPELRCGMGVSQLAFLGKYNGFCDNSL FT LDSDSHSSTTEITPTPTSHQAGIINRSLPTSTVSSNLSGRDSKEADSHRFS FT RRSDIDNHRSAVCRKHXLVIFYQNVGGINSEVNNYYLAVSDTCYDVIVMIE FT TWLDDRTLSNQVFGSDYEVFRCDRNPHNSRKSTGGGVLIAVRRGLKAKIVE FT DDEWISLEQVWIAVELNDRKLYLCAVYIPPDRIRDIDLVRTHCRSISSVVE FT MSSPLDELMVLGDFNLAGISWKPSHGGFLFPDSRQSVLYPAAIHLLDSYSA FT STLSQINHVLNENQRSLDLCFVSTQDTAPFISEAPVPLVKHVSHHPPLIVN FT IPKCLDRDFIDFPEPVSYDFRKADHRSITDVLTSLDWDVIFDTDDANSAAL FT TFSHVLGYVIDRHVPKRSQQLNRLPWQTSELKQLKTIKRAALRRYTKYRTV FT SLKIQYVRLNYEYKRISRLCYSRYQQNIQGKLRSHPKSFWKYVNQQRKETG FT LPSSMTWNNLTASDPQRICQLFSSKFADVFTDETLTDGQIAIAATNVPLYA FT QTLSTIDISVEMISTAASQLKPSFNPGPDGVPSAFLKKHISNLLMPLHRLF FT RSSVSSGIFPSCWKIAHMFPVHKKGSKRNMDNYRGISSLSAVSKLFELIIM FT QPLLSHSKQYLSDDQHGFMVGRSTTTNLLCLTSFITESMVERMQTDVIYTD FT LSAAFDKLNHSIAIAKLHRLGVSGTLLQWFRSYLSGRQLMVTIGDCTSETF FT PATSGIPQGSHLGPLIFLLYFNDVNLVLEAPRLSYADDLKLFLRIQSVNDC FT HFLQRQLGIFANWCSINRMLVNPEKCSVITFSRKKEPIYFDYELSNTTIKR FT TTHIKDLGVILDSQLNYKQHMSYAIDKASRALGFIFRMTKHFSDVHCLKSL FT YGSLVRSILEYCSPVWNPYYNNGVERVESVQRRFLRFALRKLPWSNPFRLP FT SYEDRCRLIGLELLRVRRDTSRALLVADILQGRIDCSYLLQQININVQPRA FT LRNSVMLRTPLHRTNYGLNGAIGGIQRVFNRVSASFDFHLPRSTLRRRFVS FT HFAE" XX SQ Sequence 5224 BP; 1347 A; 1262 C; 1115 G; 1494 T; 6 other; gttcstwtac attccggacc gacagttttt gtgtttttta tcgttgtaat atcgagttat 60 tagtgtctag ctgtgtgtgc tcttgaacat taaatgttta tgctccttag cggattggaa 120 tacctcaagt gattgatgac tttatagaga aatcgcgctc cacatatttt ctcgtcatct 180 gaaccgcgag tggcactttt ccactttaac gcaactgatc ccttgcctca ctttcctggt 240 taccgtactc tatagagtta taccagaatc ctattctgtt gcgtattgcg acgctcggtg 300 cgccatctat atctgaaacg atcaacgttg ctgattgggc atctaagtgt attggcgata 360 tgactgctat cactgcctgc gatcgttgcg ctaaaacggt gaagaggaat gaaaacttca 420 tcgaatgcat ggggttctgc aacaatgttg tgcatatgaa gtgcgataat ttgaatatcg 480 aatttatcgc caccattcgg gaaagaccca atctgttttg gatgtgtgat gaatgcacca 540 aactcatgaa agttacccgc tttaaaagcg ttgtttcttc tctcggctgt gcaatctcga 600 cagtaatcga aaaccagatg accggaatat ctgaattgaa agccgagata tcgaagaata 660 atcagcaggt tgccgaagtg attgtggaga ataataatca agtcgctcaa ctcgccaaca 720 tcgtgaatgc atccactcca gttccagcac ctaataggga gcgtccctca aaacgccgtc 780 gggaagacac ggaaacctca aatgcgatta ctctcggaac aagaaacatt gcctctgaca 840 ttgcagtcgc tgtcaagcct gcggctaatt tgttctggat gtatttgtca cgttttcacc 900 ctagtgtgaa agaagatgtc gtggaaaatc tggcgcgtga aggcttacag actcgggagc 960 cwctcaaggt ggtgtctctg gttaagaagg gcgctgacct tagttcgatg aactttatct 1020 cgttcaaggt tggtgttcct gttgagctca aggatgttgc attgaatccc aacacwtggc 1080 ccgaaggtat tttgttcagg gaatttgagg gtcagtcaaa aaactcagtg tggcttcctc 1140 ctgctatgtc aactacaacg cgcgatggaa cgcctttcgg tactccagca cctgtaccaa 1200 cgccaacttt cgacgctccg atttccataa cgattcccac taccatttcg gacgaagcat 1260 aatcaccgct attgacaacc aacaacaaca acccagacgc gccgtagaaa gcatgatgga 1320 agcccctaat ctccccaact cagtcgcgcc ttctgctgtc ttcgctcatc atagtcgtct 1380 gggccctgag gttggcggcg gaaatggggt cttccaccct gcttgcccag gcaagtactt 1440 ttcaaataaa tcttcttcgg ctgaactttc gctgcctcaa attgaatcga attccaggac 1500 tgtatcaact gctgactcaa tactgaactt ttggactctt cgacgcacta tcggacacgc 1560 ttcaactatt tgtggacatc caggacgcac caatgaaagc attttggggg gctcctgttc 1620 ctctaactca gtcgcgccca ttgctgccag cgttctacat agtcgtcctg gccctgtagt 1680 tagatgcgga tttggagtcc ccaacttgtc ttgtcaggca agcacaacat caactgttca 1740 tcgagcgatc attctgcgcc tcatactggt tcgccttkca gtactttacg atcaactaat 1800 cccacatcat attcggtcaa cgttcaccgc acctgcttcc agccatgaat tgcattgtgg 1860 accaccagga cgcaccgtgg aaagcttttt gggagcccct agttcctcta tctcagtcgt 1920 gccgattgct gccagcgttc cacatagtcg tcctggccct gagcttagat gtggaatggg 1980 ggtctcccaa cttgcttttt taggcaagta taacggtttt tgcgacaatt cgctgcttga 2040 ttctgattcg cattctagta ccaccgaaat cacgcctaca ccaacttctc atcaagctgg 2100 tatcattaac agaagcttac cgacaagcac tgtttcttcg aacctttccg gacgtgactc 2160 caaggaagca gattcccatc gcttttcacg cagatcagac atcgataatc atcgttcagc 2220 cgtgtgtcgt aaacatsaac ttgttatttt ctaccagaac gtcggcggaa ttaattcgga 2280 agtcaacaac tactatttag cagtttcgga tacatgttac gacgtgattg tcatgatcga 2340 gacctggctc gacgatcgaa cactatccaa ccaagttttc ggatccgatt atgaggtttt 2400 tcggtgcgat agaaatccgc ataacagcag gaaatcgact ggaggtggtg ttctaattgc 2460 tgtccgccgt ggattgaaag cgaaaattgt agaggatgac gaatggatca gtttggaaca 2520 ggtatggatt gccgttgaac taaatgaccg taagctatac ctatgtgcgg tgtacattcc 2580 ccccgatcgt attcgagaca ttgatctggt tcgaacacat tgtcgctcga tttcctcagt 2640 agttgaaatg tcgtccccac tcgatgagct catggtgctg ggtgatttca acctggctgg 2700 tatttcgtgg aaaccttctc acggcggttt tctctttccg gatagccggc agtctgtact 2760 ctatccagcg gccatacatc ttctcgatag ctacagtgca tcgactctct ctcaaatcaa 2820 ccacgtattg aatgaaaacc aacgcagctt ggatctctgt ttcgttagta cccaggacac 2880 agctccattt atttccgagg ctcctgttcc tttggttaag catgtgtccc accaccctcc 2940 gctgatagtt aacattccaa agtgtctgga cagggacttt attgatttcc ctgaacccgt 3000 atcgtacgac ttccggaagg ctgaccatcg tagcataacc gacgtgctga ctagtctaga 3060 ttgggatgtc atctttgaca ccgatgatgc gaatagtgct gctcttactt tttcacatgt 3120 tctgggatat gttatagaca gacatgttcc aaaaagaagt caacagctaa atcgtcttcc 3180 gtggcagaca agtgaattga aacagctcaa aacaatcaag agagccgctc ttaggagata 3240 cacaaaatat cgcaccgttt ccttgaaaat tcaatacgtt agactgaatt atgagtacaa 3300 gcggattagt cgtttatgct attctcgtta ccaacaaaat attcaaggaa aactcaggtc 3360 gcatccgaag tcgttttgga agtacgttaa ccaacaacgc aaagagaccg gccttccttc 3420 gtcaatgacg tggaacaatt taaccgcatc tgatcctcag cggatttgcc agttattctc 3480 ctccaagttc gctgatgtgt ttaccgacga aactctaact gatggtcaga tcgctattgc 3540 ggcgactaat gttcctctgt atgctcagac tctgagtact atcgacatca gcgtcgagat 3600 gatctcgact gccgcatcac aactgaagcc atctttcaac ccaggtccag acggagttcc 3660 atcagccttt ctcaaaaagc atatctccaa tctgctcatg ccactgcatc gattattccg 3720 gagctctgta tccagtggaa tctttccctc ttgttggaaa atagctcata tgtttccggt 3780 gcataagaaa ggtagtaaac ggaacatgga caactatcgg ggcatttcct ctctgagcgc 3840 tgtatcgaaa ctatttgagc ttatcataat gcaaccttta ttatcgcatt ctaaacagta 3900 tctgagtgac gaccaacatg ggttcatggt tggccggtca actaccacca atctgctatg 3960 tctcacttcg tttattaccg aaagcatggt tgagcggatg caaacagatg tcatatacac 4020 cgacttgtca gctgctttcg ataagctgaa ccacagtatt gctattgcta aactacacag 4080 actgggggtg agcggtacat tacttcaatg gtttcgatcc tacctctctg gtcgccagct 4140 gatggtcacc atcggcgatt gcacatcaga aacctttcct gccacgtctg ggataccaca 4200 aggcagccat ttaggacctt taatcttcct gctatatttt aacgatgtga acctagttct 4260 cgaagcccca cgtctttcgt atgctgatga cctcaagctt ttcctgcgaa tccagtctgt 4320 aaatgattgt catttcctac aacgacaact cggaatattt gcgaactggt gcagcatcaa 4380 ccgtatgttg gttaatccag agaaatgttc ggtgattact ttttcaagga aaaaagagcc 4440 catctacttc gactatgagc tgtccaacac gacaattaaa cgtacaactc acataaaaga 4500 cttaggagtt atccttgatt cgcagctgaa ctacaaacaa cacatgtcat acgccataga 4560 taaagcatct agggctttag gcttcatttt taggatgaca aaacattttt ccgatgttca 4620 ttgcctgaaa tctctgtatg gatcgttggt gcgctcgatt ctggaatatt gttctcccgt 4680 ttggaatcca tattataaca acggcgttga aagagttgaa tcggtccagc gacggtttct 4740 tcgattcgcc ttgcgcaaac tgccttggag taacccgttc cgtctgccaa gttatgagga 4800 tcgttgccgg ctgattggtc tggagctgct tcgagttcgt agggatacgt caagagctct 4860 cctggttgct gacatcttgc aaggccgaat agactgcagc tatttactac agcagatcaa 4920 cataaacgta caaccgcgag ccctgcgtaa cagtgttatg ttgagaacgc cgcttcatcg 4980 aactaattat ggcttgaacg gtgcgattgg tggaattcag cgcgttttca accgagtttc 5040 tgcatcattc gattttcacc tacctcggtc aactttgcgt cggcgctttg tgtcacattt 5100 cgcagaatag tatatttaag ttttaaaatg tattttaatt ttaattttag ttcattctat 5160 cattagggct tttgaagtct gttgatattg actgttaaat aaataaaata aaaaaataaa 5220 aaaa 5224 // ID Transib-10_HM repbase; DNA; INV; 3745 BP. XX AC . XX DT 31-JAN-2008 (Rel. 13.01, Created) DT 31-JAN-2008 (Rel. 13.01, Last updated, Version 1) XX DE Autonomous Transib DNA transposon -a consensus sequence. XX KW Transib; DNA transposon; Transposable Element; Transib-10_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3745 RA Kapitonov V.V. and Jurka J.; RT "Autonomous Transib transposons from the hydra genome."; RL Repbase Reports 8(1), 10-10 (2008). XX DR [1] (Consensus) XX CC Transib-10_HM is a young family of autonomous Transib DNA CC transposons that were active in the hydra genome less than a few CC million years ago (copies are ~1% divergent from their consensus CC sequence). The consensus sequence was obtained based on multiple CC alignment of 10 copies; it codes for a 561-aa Transib CC transposase. Like other Transib transposons, Transib-10_HM is CC characterized by 5-bp target site duplications and short terminal CC inverted repeats. The Transib-10_HM transposase is less than 30% CC identical to other Transib transposases. CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 1343..3025 FT /product="Transib-10_HMp" FT /note="transposase." FT /translation="MYFFIIYSVPCRVPKSSAVNTSCTPKPRKSFIKLTNR FT MKRKRTDSLREREVEELQHAVKRFKSNSSTDGRAILYGSNLSLQALCNLKL FT TDRTWTEFKNIVTHNRHLVPPGLTKLKQYKKLTYPEQVEYSETEVKCSLFH FT MIKHSVSRVFEYLLDYESNCIMSLSQEERDNVQIICKVGGDGQSDQSEFQN FT SATMKRGVDDRSVYSIVFVLLEIRSGEKIIWKNLIPNSPDATRHLLFCFAK FT ETASFIQEKFEHLKNEIFLLKNIEFKLGESTFVVKSSLSTIYTTMNDGKTL FT NAVVSNMLKKRISSQSCHVCLANSKEFNLKTIWKKNINLNKHVLKLGICSL FT HLWMRCMEWIFNTACKLPAVKQGRNIPSQKSKEFIENKKKFQHLFKIKLNI FT RVSFVRKGCGTTNTGNVARKFFNNPIVTGRILNLDIRMIKLFRDLLIDINR FT TTTRPDIKVFKQKSEHLFALITNDKFLKTIPMSQSVHRVIVHGTSFIRYFK FT YPIGSLSESAIEARNKENKTARIGHARLTSFAENTRDTANYLFMTSDMYLF FT CKKNKMDKKRVKTK" XX SQ Sequence 3745 BP; 1374 A; 467 C; 583 G; 1320 T; 1 other; cactatgcgt cgcgaagcgg aaataggatt ttacttactt ttcagaaaaa aattttcaat 60 tttaatattt taaatatttt ttaactgaaa caagttagta ttagttaata aaaaaaagtt 120 tttttttaat ttttttaaat ttgccttttt aggtgcggga ttttcagggg taaaaaaacc 180 catgttttca ctaaatcttt aaattacata agataatttt caaatttctt catttttttt 240 gcacattttc taataaatga tacatttcgt aaattaaaaa ccaagaaaag aaaaaaaaaa 300 tgtttggatc attgtttttg ggtgtctgta aagattgatc gattagacca tgttactaca 360 gaggtagcca gttttttatt tttttaaaat tttctcaaac aaattgactt tttataagtg 420 aaataatatt ctggctacct ctgtagtaac atgctataaa atcaattcaa aagaggtgga 480 tttattaaaa accggttagt tagtttttgt caagaagaaa ctggatgcgg gcggaacatt 540 cttttctagt ctttaagtta taacctcagt ttggctctca tcgtgattta gaatttttta 600 aaaatggtta cttttaatct tagcaaactt ctggctgata attttcctcc aaggcatatt 660 ttcacgagta aaaacgtatc tgaatgtgct aaatttttgc agtccaagtt tttatcactt 720 tatgctgatt gcactgtgaa tttcaacatt ttcaaaagtc tggcttttaa cttaataaaa 780 aaatttaatg agtgtggaaa aagaaaaaag ttctttctta ctaaatataa acattttttg 840 gatcaaaatc aacaatgcaa atatctcttg tgagtataat atattctata ttgatttatt 900 ataaataagt aataaatatt gcattatagt atatatatat awatatatat atatatatat 960 atatatatat atatatatat atatatatat atatatatat atatatatat atatatatat 1020 atatatatat atatatatat atatacatat atatattata cttgtgtgtg tagattagaa 1080 aaagattctg aaaaagagat tagaacacat gaggaagaac tgctattaat tagaaaaggt 1140 gaaaatctga ttaaacaggg taatcaactc atcaagcagg gtgagcgtaa aataaaagaa 1200 ggggaattac tgcttcgtac tcatggatcc aatgtaaaca ggtatgcata gataaatgta 1260 tctccctctt tttattttgc acacacacat acatattgtt tgactatgtt tcacaaaacc 1320 ttcaaaatgt atatatgttt caatgtattt ttttattatt tatagtgtac catgtagagt 1380 gcctaaatca tctgcagtca atacatcttg tacaccaaaa cctagaaagt cttttattaa 1440 gcttacaaac aggatgaagc gtaaaagaac agatagtctt agagaacgag aggtggaaga 1500 actgcaacat gctgtcaaaa ggtttaaatc taattcttct actgatggta gggctatttt 1560 atatggttca aatttgtcat tgcaagctct ttgtaatttg aaacttacag atagaacatg 1620 gactgaattt aaaaatatag tgactcataa tcgtcatctt gttcctccag gtttaaccaa 1680 gttgaagcaa tacaaaaaat tgacatatcc agaacaggtt gaatattctg aaaccgaagt 1740 aaaatgtagt cttttccata tgataaagca ttcagtttct agagtgtttg aatatctttt 1800 ggattatgaa agtaactgca taatgagttt atcacaagaa gagcgtgata atgtacaaat 1860 tatttgcaaa gtaggaggtg atggtcaaag tgatcaatcg gagtttcaaa attcagcaac 1920 tatgaaaaga ggtgtagatg atcgctctgt gtatagtata gtgtttgtac tgctggagat 1980 ccgaagtgga gaaaaaataa tatggaaaaa tttaattcca aatagtccag atgcaacccg 2040 tcatcttctt ttttgttttg caaaagagac tgctagtttt attcaggaaa aatttgaaca 2100 tttgaaaaat gaaatttttt tattgaagaa tattgaattt aaacttggtg aaagtacttt 2160 tgtagtaaag tctagtttgt caactattta taccacaatg aatgatggca aaacactaaa 2220 cgctgttgta tcaaatatgc ttaaaaaaag aatttcatct cagtcctgcc atgtatgcct 2280 tgcaaattct aaggagttta acttgaaaac tatttggaaa aaaaatatta acttaaacaa 2340 gcatgttttg aaacttggaa tttgcagtct tcatctatgg atgcgctgta tggagtggat 2400 tttcaatact gcttgtaaac tcccagctgt taagcaagga agaaatattc catcacaaaa 2460 gagtaaagaa tttattgaaa acaaaaaaaa attccagcat ttattcaaga taaaactgaa 2520 tataagggtt tcttttgtca ggaaaggatg tggaacaacc aatactggaa atgttgctcg 2580 caaatttttt aataacccca ttgtgacagg gagaatttta aacttggata tcagaatgat 2640 taagttgttt cgagatttac tcattgatat aaacagaaca actacaagac cggatataaa 2700 ggttttcaaa cagaaatctg aacatctatt tgcactcata actaatgata aatttttaaa 2760 aaccatacct atgtcacaat ctgtgcatag agtgatagtt catggtactt cttttataag 2820 gtattttaag tatccgatag gttctttgtc tgagagtgct attgaagcca gaaataagga 2880 aaacaaaact gctcgaattg gtcatgctcg tttaacctct tttgcagaaa acactaggga 2940 tacagctaat tatttgttta tgacatctga tatgtacctc ttttgcaaaa aaaataaaat 3000 ggataaaaaa agagtgaaaa caaaatagtt taagaaagca aagtgacagt tcttctgata 3060 actgcttaaa ataaaggtaa aatagtttta ttattaatat tgttaatagt tttgattgat 3120 tttttttttg gttaggcaat ctcataattt tggttaatta acaattcaaa gttgcgtttt 3180 tttttttgtt ttttttatgt ttttaaaaaa aaatttaaga aaaaatttaa tcaattttaa 3240 caagtaatta actattttag gaagtaagac catgaagaat catactgatg agctgttctg 3300 cgttgaattt ttaaaaaacg gctattttca tagccacaag ttattaaaaa catctattag 3360 aatagaatgt tgagaaactg taaaagaatg caaagagcca agtatcatat gatagcagag 3420 ttgtttatta tattcaatga tactgtttat tttgaaatag tgttcatata catagtttgt 3480 aaaattctat aaaatatagg cgcctacttt tttttgaata aaatcatccg gacttgtgta 3540 gttttccgga caaatctaga attagtccgg actaaccgaa aaaaaataca ataaattgtt 3600 ttttaatgaa aaccgaatat gattctttta aatatagaaa tataaataaa aatttgatgt 3660 cttgtgaaaa aataagttaa atattttaag tccttcttta tttatatgcc tatttcctca 3720 ttttgtataa gagcaacgca tagtg 3745 // ID Gypsy-60_AA-I repbase; DNA; INV; 4734 BP. XX AC supercont1.279; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-60_AA_; KW Gypsy-60_AA-LTR; Gypsy-60_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4734 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.279; Positions 1278383 1283116. XX CC Positions [3739-4221] - Integrase core CC 'CAAG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 445..4653 FT /product="Gypsy-60_AA-I_1p" FT /translation="MSGSEHDSDNFEDPIGRLSVEGRVLPPFVTDRKSKLK FT HCVEMEEMKAQVAALQKELAELRHRSLTVSDGPSGSNTTLTERVPDFREIR FT EYVTPFDPKLPSCPSAEVWVKSIDETGDVYTWSAAMRLHCARLSLSGCAKL FT WFAGCHAVIKNWDEFKAEIVKGFPSAKNPIYYHNLLSSRKWKSGETVEEYV FT YEMLAMGRKGGFTEETIVTYITSGLRSYVRRSGMTIGKVFTVQGLLEELRW FT IDSVDAVATTSRASDAGAGMATNVGDGRGEGRRKTETVGEPCYHCREMGHI FT ARNCPKVKCHLCKREGHMKRDCRGQSGKKEIREQNPKMMRVVDQKSMFVKN FT VVISGLTLKALVDTGGKVSTIQEKFAKKVGELRPSHKVLRQFGKKEVTVSS FT KVCAELKVDGINMPVELQVVPTWVQDTAIILGEDVIDQDRLIMVKRKGDVK FT FAWERESEAGSPQPNSDEEAVESSVARMYTIAVESERVIDEQSLNSDGGHL FT YDQKLVRLIEQYRDCFALNMKEMGCAKSTEMKINLVNEDPVYMKPRRLEYA FT RANVLCEIVSELLDAGIIAETESPYNSQVVLVPKKNNQFRMAVDYRLLNAK FT TIKDKYPMPDIESCLQKLAGARLFITIDLYSGYYQIPLDPESQNYTAFSTM FT DGHYRFLRMPFGLANGCAVFQRAMNRVVEVLKKKKIVVVAYIDDLIIPGKN FT EEELLWKFEELLVVLRDEGFTINLNKSHFFKTLVCFLGFEVSERGVRPGEL FT KTKAVSEFPTPTSVHSVQQFLGLSGFFRRFVQNYSIIAGPLFRLLKKDAEF FT VWQEKEESAFRKIKQLLVERPLLVLYDPKAEIELHTDASSVGLAGILLQKC FT EDVWKPVSYFSRKTSSAEAVYHSYELELLAVVASVERFRQYLIGRFFILRT FT DCSAIRDTYTKKEMNPRIARYFLKMLEYDFQIEHRSGNKMQHVDALSRSPI FT EEPVEIETVAENIMVLEISNTDFLVSMQRQDERVLEIVNKLSRDPLSDEDK FT QIHDNYVLDNHRLYRKVDGRKCWVVPNNVRWRMMRSYHDEKGHFGEEKVVS FT MMGEMFWFPKMRKYVRAYIAACPKCAFFKAKQGRPEGFLNPIPKTPIPFYT FT VHIDHLGPFPRSSKGNEHILAIICGFSKFLLAKPVKSTNTQPVITMLEEMS FT SVFGLPSRIITDRGTAFTSRVFKQFCEDYSVEHVLVAVGTPRGNGQIERSN FT RTILTALRTMVDDADRRWDEKVKFVQSAINTAPNSTTRLAPTTLVLSFKPR FT DVVQNEIISVISTENPIIPASIQELRDRVETATRENQAKQKIYYDKRRRAA FT VNYHVDELVLVVKDQYIAGGSRKLEPRFKGPFVVTEVLPNDRYRVSTIPGF FT SGRALSTVYAADHMKRWCDTVDLEGDALVGSELSTDEEDY" XX SQ Sequence 4734 BP; 1336 A; 886 C; 1368 G; 1144 T; 0 other; ttctgtagac aggattgggc ctcgagtcgc gtgccggttg aaaatcacga agagttgtgc 60 agatttcaac ggtagtgagc cacagtgagt gtggagggta gttttcaacg tgacgggcaa 120 gtaacgtggt taagaaagcg gtgcgtctgg gtcagaaatc gtgcgaaatt tctgaaaagc 180 ggttatccca gtggaggtgg cttgagcaag cacttatcgt gaattcagta ctgtagacta 240 ccaggaccat gtggtgagca cgtggttgag gatttgtaag gctatctgag gccaagactt 300 ggacaatagc cagtaagtgg acatattgtt cactgcgaag agtgttcgtc cactcagcga 360 aagtgttcgt aaaagattcg tcgtcgggaa gagtgactac ttagctgagc ccatacgaat 420 ggcgtagtga gtgctttgta tggtatgagt ggatctgaac atgatagtga caatttcgaa 480 gaccccatcg gtcgcctaag tgtagaaggc agagtgctcc caccgtttgt taccgatcga 540 aagtcaaagc taaaacactg tgttgagatg gaagaaatga aagctcaagt ggcggctttg 600 cagaaggagt tggcagaatt acggcacaga tcattgacag tcagtgatgg tccgtcgggc 660 agtaacacca cactgacgga aagagtaccc gatttcaggg aaataaggga atatgtaaca 720 ccctttgatc ccaaactgcc atcctgtccc agcgctgagg tgtgggtgaa aagtattgat 780 gaaaccggag atgtgtacac gtggtctgct gccatgcgcc tccattgtgc ccgtttaagt 840 ttgagtgggt gtgcgaaatt atggtttgcg gggtgccatg cagtgattaa gaactgggat 900 gagtttaagg cggaaatagt taagggtttc ccttcggcga aaaatccaat ctattaccat 960 aatttgctct catctcgaaa atggaagagc ggtgaaactg tagaagagta cgtgtacgag 1020 atgctagcca tgggacgaaa gggtggtttt actgaagaga cgattgtaac ttacataacc 1080 agtggactaa gatcgtatgt gagacgctcc gggatgacga tcgggaaagt gtttacagtg 1140 caaggtctgt tagaagaatt gcgctggatc gacagtgtgg acgcagtagc aactactagt 1200 agagcaagtg atgctggcgc gggtatggcc acgaatgttg gtgacggacg cggtgaagga 1260 cgaagaaaga ctgaaaccgt tggcgaacct tgctaccact gccgtgaaat gggacatatt 1320 gcccgaaatt gtccaaaggt gaagtgccat ctgtgtaaac gcgaagggca catgaagaga 1380 gactgtagag gacagagtgg taaaaaggaa attcgagagc aaaatcccaa aatgatgcga 1440 gtggtggacc agaagagcat gtttgtgaaa aatgtggtga tatctgggct caccctgaag 1500 gccttggtgg acactggcgg aaaagtttca acgatccagg aaaagtttgc caaaaaggtt 1560 ggcgagctta ggcccagcca taaagtgctg cgtcaattcg ggaaaaaaga agtgaccgtt 1620 tcctcaaagg tatgtgcaga gctaaaggtg gatggtatca acatgcctgt tgagctacaa 1680 gtggtgccta cgtgggtgca agatacggca atcatattgg gagaagatgt gatcgaccaa 1740 gatcggttga ttatggtgaa aagaaaaggt gatgtgaaat tcgcatggga aagagagtca 1800 gaggcaggat ctcctcaacc aaactccgat gaagaggccg tagaatcgtc tgtggctcgg 1860 atgtatacca ttgcggttga atctgagaga gtgatagatg aacagtcgtt gaacagtgat 1920 ggcggacacc tatacgatca gaagctggtc cgactgatag agcagtaccg agattgcttt 1980 gctcttaaca tgaaggagat gggatgtgcc aaatcgacgg agatgaagat caatctggtg 2040 aatgaagatc ccgtgtatat gaagccgaga agattggaat atgcgagagc gaatgtgctg 2100 tgcgagatcg tgagtgaact gctggatgct gggattatcg cggagactga gtccccgtat 2160 aacagccagg tggtgttagt gccgaagaaa aacaaccaat ttcggatggc cgttgattat 2220 cggctgttga acgcgaagac catcaaggat aaatatccca tgcctgacat cgaatcatgc 2280 ttacagaagc tggctggagc acggttgttt atcaccatcg atttgtacag tgggtattac 2340 cagatcccgc tggacccgga aagccagaat tacactgcgt tttctacgat ggatgggcat 2400 tatcggttct taaggatgcc attcggccta gccaacggct gtgcggtttt ccaaagagcc 2460 atgaacagag ttgtggaagt gcttaagaag aagaaaattg tggtggtcgc ttacatcgac 2520 gacttgatca ttcccgggaa aaatgaagaa gaattgctgt ggaaatttga agagctgtta 2580 gtggtcttac gagacgaagg attcacaatt aatttgaaca agagccactt tttcaaaaca 2640 ttggtgtgct tcttgggctt tgaagtgagt gagagaggag ttcgtcctgg tgaattgaag 2700 actaaggcgg tctcggagtt tccaaccccc acctcagtac attctgtgca acagtttctg 2760 gggctgtctg gattttttcg acggtttgtg caaaattaca gcataattgc tgggcccttg 2820 tttcgattgc tgaaaaagga tgctgaattt gtgtggcagg agaaagagga gagtgcattt 2880 cgtaaaatca agcaactttt ggtggagcga cctttgttgg tgctatatga tcctaaggct 2940 gagatcgagc tccacaccga tgcgtccagt gtggggttgg caggaatact gctgcaaaag 3000 tgtgaagatg tgtggaagcc cgtgagttac ttcagtcgta aaacgtctag tgctgaagcc 3060 gtataccata gttacgagct ggagttatta gctgtagtgg cgagcgtgga gagattcagg 3120 cagtatttga tcgggagatt cttcatactg cgtacggatt gttcggcgat acgtgatacc 3180 tacaccaaga aggaaatgaa ccctagaatt gctcgctatt ttttgaagat gttggagtac 3240 gatttccaaa ttgagcaccg cagtggaaat aaaatgcagc acgtagatgc tctcagtcga 3300 tcaccgatcg aggaaccagt cgagatcgaa acggttgcgg aaaacatcat ggtgctggag 3360 atctcgaata cagactttct agtgagtatg cagaggcaag atgagcgggt attggagatt 3420 gttaataaac tctcccgaga tccgctgagt gacgaagata agcagattca cgataactat 3480 gtgttagaca atcatcgctt gtaccgcaag gtggacggaa gaaagtgctg ggtggtaccg 3540 aataacgtac ggtggagaat gatgaggagc tatcatgacg aaaaggggca ttttggagaa 3600 gaaaaggtgg tttcgatgat gggtgaaatg ttttggttcc caaaaatgcg gaaatatgtg 3660 cgagcctaca tcgctgcttg ccccaaatgt gcatttttca aggcgaagca ggggagaccc 3720 gaaggttttc tgaatcccat tccgaagaca ccgataccgt tctatacggt acacatcgac 3780 catctgggac catttcctag gtcgtctaag ggaaatgaac atatccttgc tatcatttgt 3840 ggattctcga aattcttgtt agccaagccg gtgaagtcca ctaacacgca gccggtgata 3900 acgatgttgg aagagatgtc cagtgtgttc gggcttccat cccgaataat aacggatcgt 3960 ggaactgcat tcacgtccag agtatttaaa cagttttgtg aagattacag tgttgagcac 4020 gtgttggtag ccgttggtac acctcgtggg aacgggcaga tcgagagaag caatcgcact 4080 atactcactg ctctccgtac gatggtcgat gacgcggata gacgatggga cgagaaggtc 4140 aagttcgtgc agagtgctat caatacggca cccaattcta ccacaaggtt agcgccaact 4200 acgcttgtgt tgtctttcaa accgagggac gtggtgcaga acgagataat ctcagtaatt 4260 tcgactgaga accccatcat accagcgtcg attcaggagc tacgtgatcg tgtggaaaca 4320 gctacgcgcg agaatcaagc caagcagaag atatattacg ataaacggag acgagcagcg 4380 gtcaactacc acgtggacga attggtgttg gtagtgaagg accaatacat tgccggaggt 4440 agccgtaagt tggaaccacg tttcaaaggg ccgtttgtgg tcactgaggt gctaccgaac 4500 gaccgttaca gagtgagtac gatcccagga tttagtggtc gagctctctc gacagtgtat 4560 gcggcagacc acatgaagag atggtgtgat acagtggact tagagggaga tgctttagtc 4620 gggagtgaac tatcgaccga tgaggaagac tattagatta gagagatatg agatgtagag 4680 ttgagtaatt gaaatgaatt cgatgcggac atcttgaatg tctgggtgtc cgat 4734 // ID Crack-37_AAe repbase; DNA; INV; 3367 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A Crack non-LTR retrotransposon from Aedes aegypti. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; KW Crack-37_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-3367 RA Kojima K.K. and Jurka J.; RT "Crack clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1253-1253 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 14 sequences with >97% CC identity. It is positioned at the deepest branch of CC Crack/Daphne CC in the RTclass1 phylogeny. XX FH Key Location/Qualifiers FT CDS 61..2946 FT /product="Crack-37_AAe_1p" FT /note="endonuclease and reverse transcriptase." FT /translation="MYTNKAFDCVNSWLRNKDSINKITSLRIIQLNIRGMN FT DVSKFDCIGELLERYRKRVDIVVLGETWLKRDITMLYSPNGYVGIFSCRDD FT SHGGLVMYIRNDLHFNLCTNRCIDGFHHIHVTLQNKGKPLHLHAMYRPPSF FT DFRRFLSEIEHTITAKGSREACVLLGDMNVPLNLTSNNAVCEYIRLLESYG FT MFATNTITTREASGNILDHVVCPGRIASDVTNETITTDLSDHCFIVSHMNV FT LNSATKHAFTKKIINRDRLNELFNEQCVRVPLTGSANDKLVHVIRTYTSIL FT EQCTKTVNLEAKIKDHCPWMTFDLLKLIRIKDNALSKHRRHPENVAAKELL FT ERVSKLLQQKKAQCKKDYYYKQLERVDTKTAWRFINQNLGKHKKDSRISKI FT ELDGQVLNDTNSICGAFNKYFCEIGPSLASNIVSDSNIYKFGTLVPQQNSL FT YLYPTSTNEVLILINKLDIKKSSGSDNIPASFVKTHHRFFARLVTDVFNEI FT IHTGQYPECLKLARVVPIFKSGSNKELTNYRPVSCLSVLDKLIEKLLTSRI FT IEFTQRFNLIYAHQYGFRSGSSSLTACCDLVDNVYDSLDRKKIAAALFIDL FT KKAFDTIDHRMMVEKLDVLGFRGVTKSLLESYLTDRYQFVAIGNNKSLPGA FT VRTGVPQGSNLGPILFLLFINDVSKLRLNGKLRLFADDTSLVYEANNVDEL FT LNQIKEDVILLKNYYDTNLLSLNLNKTKYIVFHSPRKRVPPRTALDIQGCI FT IEEVTEYSFLGLKLDSTMKWSGHINDLKSKLSSICGVMRKLSNFVPRNWLI FT KLYHALFNSRLQYLVACWGSANKSTLKELQVIQNRCLKIIYGRPWLYPTIM FT LYKESSDSMLPIKAMYEYQTLVQVWKILTESSTHHNTELRRVPRIRESRQH FT GSLVLGRPHTEFGRKKFFYTGSKLYNDLPPVCKQSRSLNCLKANLRKHLKE FT NLSKYIL" XX SQ Sequence 3367 BP; 1084 A; 697 C; 671 G; 914 T; 1 other; tcaggaacca gcaagcacgt tgaatacacg ttacatcgac gttctggata atagattaca 60 atgtatacga ataaggcttt tgattgtgta aattcatggt tgagaaataa agattcgatt 120 aataaaatta cttctcttcg aataattcaa ttgaacatca ggggcatgaa cgatgtatca 180 aaatttgatt gtataggaga gctgttggaa agatacagga aaagggtgga tattgttgtc 240 ttgggagaaa cgtggttgaa gagagatata acaatgttat attcgccgaa tggctacgtt 300 ggtatcttct catgcaggga cgattcgcat ggaggactag tgatgtacat tcgaaatgat 360 ttacatttca atctttgcac gaatagatgt attgatggtt ttcatcacat ccatgttacg 420 ttgcaaaaca agggcaaacc attacacttg catgccatgt acagacctcc aagcttcgac 480 tttaggcggt ttttgtcgga aatcgaacac acgattactg ctaaaggaag tagagaagcc 540 tgcgtccttc ttggagatat gaatgtcccg ttgaatctta cttcaaacaa cgctgtgtgt 600 gagtatatta ggctcttgga atcctacggt atgttcgcaa cgaacacgat cacaacaaga 660 gaagcaagtg gtaatatttt agatcatgtt gtttgcccag gtcgaatcgc gagtgatgtc 720 acgaatgaga caattacgac tgacttaagt gaccattgtt tcattgtgtc ccacatgaac 780 gtactcaatt ctgcgacaaa gcacgctttc acgaaaaaga taataaatcg cgaccgattg 840 aatgagctgt ttaatgagca atgcgtacga gttcctctca ccggcagcgc aaacgacaaa 900 ttagtgcatg tgattaggac atacacgtca atactggaac aatgcactaa gacagtgaat 960 ctggaggcaa aaataaaaga tcactgccca tggatgacct tcgaccttct gaaactgatt 1020 cgcattaagg ataacgcact aagtaagcat cgccgtcatc ctgaaaatgt agcagcaaaa 1080 gaactattgg aacgtgtatc gaagttactt caacaaaaaa aagcgcagtg caaaaaggat 1140 tattattaca agcaactaga aagagttgac accaagaccg cgtggagatt tataaatcaa 1200 aatctgggaa agcataaaaa agatagcaga atttcaaaaa tcgaacttga cggtcaagta 1260 ctaaacgata caaacagcat ttgcggtgca ttcaataaat acttttgtga aattggccct 1320 tctcttgcgt caaacatagt cagcgattcc aacatataca agtttggcac tctcgtacca 1380 caacaaaatt ctctttatct ttatccaacc tccaccaatg aagtccttat tcttataaat 1440 aaactggata tcaagaaatc atctggatct gataatatac cggcaagttt cgttaaaacc 1500 catcataggt tcttcgcacg attagtaaca gatgtattca atgaaataat tcacaccggc 1560 cagtaccctg aatgcctcaa gctcgctcgt gttgttccaa tcttcaaatc cggtagcaac 1620 aaagaactca ccaattatcg ccctgtgtcc tgtttatccg tgctggacaa gcttattgag 1680 aaattgttga cttcgaggat aatcgagttt acacagcgct ttaacttgat atatgcacac 1740 cagtacgggt ttcgtagtgg atcaagctcg ctaacggcat gctgcgacct tgtcgataat 1800 gtttatgact cactggaccg caaaaagatt gctgctgcgc ttttcatcga cctcaaaaag 1860 gctttcgata ctatagatca tagaatgatg gttgaaaagt tagatgttct gggattcaga 1920 ggagttacta agtcgttact ggagagctac ctaacggatc gctatcaatt tgtagctatt 1980 ggaaataaca agagtcttcc tggtgcagta aggacaggtg ttccacaagg tagtaatttg 2040 gggccgatat tatttttgtt gttcattaac gacgtgtcta agctacggct gaatgggaag 2100 ctgcgactct ttgcggatga tacgtcgttg gtctatgaag caaacaacgt tgatgaactt 2160 ctaaaccaaa taaaagaaga tgttatactg ctgaagaatt attatgatac aaatttgttg 2220 tcgcttaacc ttaataagac gaagtacata gtgtttcact cacctcgaaa acgggtccct 2280 cctcgaactg ctttggatat tcaggggtgc atcatcgaag aggttacgga atactcgttt 2340 cttggcctta aattggattc aacgatgaaa tggagtggac acatcaacga tctcaaaagc 2400 aaattgagtt caatctgcgg agtcatgaga aaacttagca acttcgttcc gagaaattgg 2460 ctaatcaaat tataccatgc gctattcaat agtcgacttc agtatcttgt cgcatgttgg 2520 ggatcagcaa ataaatcaac gttgaaggaa cttcaggtta tccagaatag gtgccttaaa 2580 atcatatacg gtagaccctg gttgtaccca acaatcatgc tctataagga atcatctgat 2640 tcaatgcttc cgattaaagc tatgtacgag tatcaaacac ttgttcaagt ttggaaaatt 2700 ctaaccgaga gctcaacaca tcacaacaca gaactaagaa gagtcccacg tattcgagaa 2760 tctaggcagc acggtagtct tgttctagga cgtccgcaca cagaattcgg ccgtaaaaag 2820 ttcttttaca ccggaagtaa actgtacaat gatcttcccc cagtatgtaa acagtctcga 2880 agcctcaact gcttgaaagc caatctgagg aaacatttga aagaaaactt gtcaaaatac 2940 atcttataag gagtcacctg ccactatgtt ccaaaaaatt gtcactatca aatcgtttac 3000 ttgccatgtg tctgctcttt tcttctccac cagccgcctc ccgccaccac cgccaacagc 3060 ccaccaccca tcacccaccg cccaccgcca accgcccacc gcctcaactc atcacaaatc 3120 gcctatcatc aacctgtatc aacgattgtt gttagatttt agaaagaaat tgttgtaaaa 3180 tgatwttaat atgcacttcc ttcaaagaga aaataacatt ctcactggaa gtgtccatgt 3240 gttgtatatt aagatcgaaa agaagaggag gttttatgcc tctgggagaa ggatctcgca 3300 aagaatttcc actcccaggg gcttttccct gctccaaata aagaaaaaaa atgtctaaaa 3360 aaaaaaa 3367 // ID Gypsy-591_AA-LTR repbase; DNA; INV; 1336 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-591_AA_; KW Ty3_gypsy_Ele115; Gypsy-591_AA-I; Gypsy-591_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-1336 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 1336 BP; 374 A; 336 C; 346 G; 256 T; 24 other; tgttacgtta ggtaataatt agtccggtta tgaggtaaat ttgaggtgta tgcaaacttt 60 attccagtcg tgtagagtgt aaacccacct gaatgwaaac aaactatcac actactgaag 120 ggcgcttgct tgaaatagaa agctcaaaag aaaatagctt gtaaaacaaa gtgacagatc 180 cccacttccg ctacgagact gctcaccctc gcacactcgc gcataccagt tggaaaagcc 240 tcakaagtga ctagaccgtg cggtagcaaa gtcgcccgck gcggccgtcg aamcggcatc 300 gawcttggww gcgctgtgca gagtggcacg cacgcttttg gctggcgaga aamcctcagc 360 waattacatc ccgcgacacg aggcacgctg acacccctca aggacgtaac agctacmgtc 420 agcaaagaag agwtcaccaa tcggaacccc ctgcmgsgaa aacgakcgcc gcaggagaat 480 tcggcttgcc aattggaatc cgaagccatc gcaccagcgg gtggtaacaa sccgggcwtc 540 gacagagccc gaaacgwcac ctcaagggcg atcaggcagc kcaggaagcg awacgagaca 600 tcaaacwgat ccgaccgcat tgaaagggca ccgwtttccw tcggacatca tcaaccacca 660 ckggaggaaa aaaggaagag ccttcaaaag aagtgagtga ascactaatg gacgaatgac 720 gctgatccta actgaggaaa cgcctcaggg ctcttttctt tttattttgt gcagttttcg 780 atcaccccgc aaaagtcccg atcaggcggc ggctcaattc gtcgatcagg tgctgactgc 840 ctgactagca aacaaccgca atcatccctg gggatcagcg acgagcgccg gcagcagcat 900 atttgcatcc atcaccggga agaaaggcca gtttgcgttt agccgtgagg agtgaaccaa 960 ttcgccagct cgcggacgag aagaccgtcg agggtgcagt tatgcaccgt ggaaggctgc 1020 tctggtcccc gccaagcaag cagagcacgg aacgggcacg ttcaccaggc cgctccggcc 1080 cctgagaagc atgcacggaa aaggagagac acgtcggaag cagtgtgcgt gagtagccta 1140 agcggtttta cctaaatgta gataagaaga agtgccatga tttggaagat atctaagaat 1200 cctcagtagg gtagctttaa ggagcaggaa tgtaaaataa atgtttgcat acagattttg 1260 gttttgaggc gtttctgatg ttgctagccg gtccgatcaa agcccctttt attccggcag 1320 agatcgtttc acaaca 1336 // ID BEL-132_AA-LTR repbase; DNA; INV; 700 BP. XX AC supercont1.254; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-132_AA_; KW BEL-132_AA-I; BEL-132_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-700 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.254; Positions 994690 993991. XX SQ Sequence 700 BP; 250 A; 112 C; 143 G; 195 T; 0 other; tgttgcggaa agcgttacga ccgttgcagc tgcgggagaa attctccgca gggatgctag 60 cagggacgga tctaacagtc gagctaggtt cattggtatg ggtgaagaag atgacagacg 120 tcattaatgg aatatttgga aaaagtgaaa tgtgattaga tgccagggag tttctgatta 180 gaatgaggca gttattattg aaatctacgt ctaaaataaa acatcagttc gaaacatgaa 240 ttgatgaata ttgtgagtta acatgcaaat aatttaacat acaatgaatt tacttaaaat 300 tatacatgcg aatcaaaact tataggttag agacaataac aacttacaac tatctttggt 360 gaattagcat aaatttgatc ctaaaacaat tgattacgaa cataaacagt gagtagaact 420 tatttcacaa acatatactt atgactaaga cgtaaattaa cgaaaactag gtcgacagac 480 tacacaagac actactggac ggctacaagg acgttgagga cgtagaccgt acacaagaga 540 gtcacagagt tgtgagtaat tatagggtaa atgtattgaa tttgaacaac attataatta 600 aatttaactt ttagctttag agccttttcc cctactgaat aaatggattt gctacaaaaa 660 tctgtgacct gttccttttg cgtcctgcgc cgcgccaaca 700 // ID CR1-52_BF repbase; DNA; INV; 1596 BP. XX AC . XX DT 30-JUL-2009 (Rel. 14.07, Created) DT 30-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE Amphioxus CR1-52_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-52_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-1596 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-1596 RA Kapitonov V. and Jurka J.; RT "Young families of CR1 non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(7), 1623-1623 (2009). XX DR [2] (Consensus) XX SQ Sequence 1596 BP; 522 A; 442 C; 335 G; 297 T; 0 other; gtattttggc cagagcaagt acaaaacgtt ttagctaagc tcgacatcaa caaggccaag 60 gcccggactc aatacctgcc agggtcttga aacatgctgc aaccgagcta tctaaacccc 120 tagcaaagct ctttcagctc ttgctagaca aacactatat gccgaagcag tggaaagttg 180 cccacgtgat accagttcat aaaaagaaca acaaacacga ccctagcaac taccgcccaa 240 tatcacttct cagcattata agcaaagtga tggaatcaat cataaacaaa gccctgtgga 300 aacacatctc cgagaaccat ctcataagtg acaagcagtt cggattcagg gctgggcact 360 ccacagcaga tgcactgacg tacgtcattc aacacctgca tgacgccaag gacaaaagac 420 aggaaagcag actgatatgc ctggacataa gcagagcgtt cgaccgcgta tggcacaagg 480 ggttgattgc caagctgaaa gccatcggcg tcgatgggaa ggtgctaaac tggattgaga 540 actaccttgc agacagagaa ctgaaagcca ttatcagcgg aaaaacatcc accgccaggg 600 tcattaacgc gggagtcccc caaggctcaa tccttggccc cttgctcttc atcatttata 660 tcaatgattt gacagaaaag ctggaaaaca catcgatgct gtatgccgat gactcatcaa 720 tcatgagcat tataaaagaa agacgtgaaa gagctagggc agcccagtca ctgaactcag 780 acctggaaga aatacaaaat tgggcaaaca gctggaacgt catattcgga gccacaaagt 840 gtaagtgcat gacaattagc aacttgaagg acgcagtagg caaccaccca gaactcagct 900 tcatggacac tacactcgca gaggtagatg aagttgatct cttaggcgca acaatccgaa 960 aagacctgac atggaatcac cacataaaga agatggccgc ggacgcagga aaacgcctgg 1020 gtctactgcg aagagccgcc ccctacctca accctcaaca gagagccatc atatacaaga 1080 gtatggtgcg atccagcatg gaatatgcct caaccgtctg gatgggagct tgctccacct 1140 cactcgacct actcaacgcc atccaaagaa gggccaccaa gatcatagac ctccccgaaa 1200 ctgacctgta caaatatcag attcagcccc tggaacaacg cagaaacgta ggggctctca 1260 ccctcctgca ccgcatgtac aaccacgacg cacctgcacc actcaacagt ctacttcctg 1320 aaccctatac acatcgacgt gaaacccgcc tgtctacatc tcaacactgc aacgccttgg 1380 agcctgtgaa atcagctacc acatcgcacc gccgtacctt cctcccggcc acaacaaaac 1440 tctggaactt actgcctcag cacatagtga cacttaggga caaacagaag tttaagacaa 1500 gtatcaacaa ccacttcagc gacctacgtc agcgccctta aggacagctg ctaatgctga 1560 agatcggagt atgactacat agataaaaaa aaaaaa 1596 // ID TBE2 repbase; DNA; INV; 230 BP. XX AC U85403; XX DT 27-JUL-1999 (Rel. 4.06, Created) DT 27-JUL-1999 (Rel. 4.06, Last updated, Version 1) XX DE Oxytricha fallax transposon TBE2. XX KW OFU85403; TBE2. XX OS Oxytricha fallax OC Eukaryota; Alveolata; Ciliophora; Intramacronucleata; OC Spirotrichea; Stichotrichia; Sporadotrichida; Oxytrichidae; OC Oxytrichinae; Oxytricha. XX RN [1] RP 1-230 RA Herrick G., Cartinhour S., Dawson D., Ang D., Sheets R., Lee A. RA and Williams K.; RT "Mobile elements bounded by C4A4 telomeric repeats in Oxytricha RT fallax."; RL Cell 43 (3 Pt 2), 759-768 (1985). XX RN [2] RP 1-230 RA Williams K., Doak G.T. and Herrick G.; RT "Developmental precise excision of Oxytricha trifallax RT telomere-bearing elements and formation of circles closed by a RT copy of the flanking target duplication."; RL EMBO J 12(12), 4593-4601 (1993). XX RN [3] RP 1-230 RA Doak G.T., Witherspoon J.D., Doerder P.F., Williams K. RA and Herrick D.G.; RT "Conserved features of ciliate TBE1 transposons."; RL Unpublished (1997). XX RN [4] RP 1-230 RA Doak G.T., Witherspoon J.D., Doerder P.F., Williams K. RA and Herrick G.; RT "TBE2."; RL Direct Submission to Genbank (14-JAN-1997)Oncological Sciences, RL University of Utah, Rm5C334 School of Medicine, 50 N Medical RL Drive, Salt Lake City, UT 84132, USA. XX DR GenBank; U85403; Positions 94 323. XX SQ Sequence 230 BP; 70 A; 28 C; 42 G; 90 T; 0 other; caaaacccca aaacccctac agtagtttga ttgagttttt gattgatgaa agtagactat 60 tttcgcatgc tttattaggg ttttaatagg gttatgtagg ggtttaatgt ttaaatatta 120 gtaatttaag tgagtataac aagcgagttt ttaaaatatt tcacacaaac ataagcttag 180 agtcatgtgg cagttctctg aattttatct tccttagttt atgtttgatt 230 // ID BEL-225_AA-LTR repbase; DNA; INV; 571 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-225_AA_; KW BEL-225_AA-I; BEL-225_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-571 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 908-908 (2011). XX DR [1] (Consensus) XX SQ Sequence 571 BP; 225 A; 88 C; 91 G; 167 T; 0 other; tgtacagaaa accacagggc aatcatttaa cggtctagtc tgtattacag atcgttgatt 60 aaggtgacag tggataagga ataggagaga taaaaacaaa aacaaggata gagtatttca 120 tgcatacgat cgctaaagta ttcggaaatt cccaaagcac gctaactaat tgaaaattat 180 tatctaaaat tgaaatttaa agttttctta agtcctaaag tgaaattgtg gtaaactggc 240 tattgaatta tatatctaaa ttggaaataa catgctatct atctacttat catagggact 300 aggttttggt taaaatccag tgaattcacc tagaagatta aaactaaagt taaaacttgg 360 tctatacata atctacgtaa gtaaacataa attatacatc taaacaacat ttgtaaaatc 420 ctaaatacga attatgtaca gactgaacag gtatctagca agccctgctg accacttcgt 480 agcacgaggt cctagagtca aactatctat tgtaagtgaa actcattaaa cttattgtga 540 aacactaata cgaacaaaat atttaattac a 571 // ID Gypsy-28_AA-LTR repbase; DNA; INV; 192 BP. XX AC supercont1.6; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-28_AA_; KW Gypsy-28_AA-I; Gypsy-28_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-192 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.6; Positions 749089 749280. XX SQ Sequence 192 BP; 63 A; 32 C; 34 G; 63 T; 0 other; tgtagtgttt taaactaaac taatgtaata cctccttaat aatgatcttg tcgtaaagca 60 tccattttat atactatgga tacatgttag atgtatgggg gagagtatac tcgaggagac 120 aataaagggc attcgtattc taactagtaa cctgacttca cgtgttttac tccagcttac 180 gaaaatacta ca 192 // ID PENELA_Smed repbase; DNA; INV; 1962 BP. XX AC . XX DT 10-NOV-2007 (Rel. 12.11, Created) DT 10-NOV-2007 (Rel. 12.11, Last updated, Version 1) XX DE Penelope-type element: consensus sequence. XX KW Penelope; Non-LTR Retrotransposon; Transposable Element; KW PENELA_Smed. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-1962 RA Jurka J.; RT "Penela1_Smed: Consensus sequence of abundant Penelope-type RT family from Schmidtea mediterranea."; RL Repbase Reports 7(11), 1177-1177 (2007). XX DR [1] (Consensus) XX CC Individual elements are >7% divergent from consensus. XX FH Key Location/Qualifiers FT CDS join(238..954,958..1806) FT /product="PENELA_Smed_1p" FT /translation="MDNLRKNFKYPPQKDEEILNKLVFRLMKDKHLDCDLG FT KQLYAVGSQPAKLYGLPKTHKKDVPMRPVLSMIGSAQYKVAKVLDSLLKPL FT ISKEYECRDSFEFVDFITNQAVDSSDYMVSFDVTSLFTNVPLNETIDLCCD FT LWREKMLDYQTLDEVAFRELLRFATSNVAFLFNDKWYQQVDGVAMGSPLAP FT TLASIFLSKLEKQISEFPFVKPKIYKRYVDDIFLVFSSSDDVVPFFKIIMG FT FILNIKFTVEEEVNSFLPFLDLLVEKTAKGFETEIYRKPTDTGLYNTLNSF FT CDFKYNRNMIKGLIFRSWSLSSTFEKSNKSIQKLVSLLNKNGYSKAFVERL FT TKETIDRLMIAPDENNGNHSPPNDGEPIKSHVFVIPYSEGFRNFKLSVTKS FT LDNSNLFKIVSKSLKSINMFSNKSQTPIGLVSDLVYKFSCNGCNATYIGET FT SRHLRTRVQEHSRLRGVSNIIEHRKMCKGTTDLKDFKILYKNFNNKWERIL FT CEALVIKSQSPKINVQSATNLLRLFN" XX SQ Sequence 1962 BP; 658 A; 266 C; 338 G; 698 T; 2 other; ttattttggt tacttgacaa attcttcata tagcaaattt attaatatag ataaagttaa 60 gtacggctta caatcaagtg ctgttgagta tcttaagaaa aataaattac aaaatattag 120 tgattttgca gttttaaaga acctttctaa aaaagctgtg aaagtttgta attttgatac 180 gaggttgatt attttgataa attgtataag gtacttgaag gtccacaatt cgttaagatg 240 gataatttac gcaaaaactt caaatatcct cctcaaaaag atgaagaaat tcttaataaa 300 cttgttttcc gtttgatgaa agataaacac ttagattgtg atcttggtaa acaactatat 360 gcggttgggt ctcaacctgc aaaattatat ggtytaccaa aaactcacaa aaaggatgtt 420 ccaatgcgcc cggtgctttc catgataggt tcagcacagt ataaggttgc taaggttctt 480 gattcattgt taaaaccttt gatttctaag gaatatgagt gtcgagattc atttgaattt 540 gttgatttta taactaacca agctgttgat tcttctgatt atatggtgtc tttcgatgta 600 actagtttgt ttacaaatgt yccattaaat gagactattg atctttgttg tgatttgtgg 660 agagagaaga tgttggatta tcagacatta gacgaagttg catttcgtga attgttgagg 720 tttgccactt ctaatgttgc ttttcttttt aatgataagt ggtatcaaca agttgatgga 780 gtagctatgg gatcaccttt agctccaact cttgcgtcaa tctttctttc aaaattagag 840 aaacagattt cggaatttcc ttttgttaag cctaaaatat ataaacgtta cgttgatgat 900 atctttttag tattttctag ttctgatgat gttgttcctt tttttaaaat tatatgaatg 960 ggcttcattc tgaatattaa gttcacagta gaagaagaag tcaattcatt tcttccattt 1020 ttagatttat tagtcgaaaa aacggctaag ggttttgaaa ccgaaatata tagaaagcca 1080 acagatacag gcctatacaa tactttgaat agcttttgtg actttaaata taatagaaat 1140 atgattaagg gtttgatttt tagatcttgg tcattatctt cgacgtttga aaagtctaac 1200 aaaagtatac aaaagctggt ttctcttttg aataaaaatg gatattctaa ggcgtttgta 1260 gaaagactga ctaaggaaac cattgatcgc cttatgattg cgcctgatga aaacaatggc 1320 aaccatagtc cacctaatga tggtgaaccc ataaaatcgc atgtttttgt aattccatat 1380 tcggaaggtt ttagaaattt taaattgtct gtcacaaaat cattagacaa ttcaaatttg 1440 tttaaaatag tatcaaaatc gttaaaatca ataaatatgt ttagtaataa atctcaaacc 1500 ccaataggtt tagtttctga tctagtgtat aaattttcat gtaatgggtg caatgccaca 1560 tacattggag agacttcccg tcatcttcgc acacgggttc aggaacatag ccgattgaga 1620 ggtgtgagta atattattga gcatagaaaa atgtgcaaag gcacaactga tttaaaagat 1680 tttaaaattt tatataaaaa cttcaataat aagtgggaaa gaattttatg tgaggctttg 1740 gtgattaaat cacaatcccc taaaataaac gttcaatcag caacaaattt attgagatta 1800 tttaattaaa tcattaaatc gtaactatta catcattatt gtttccgtat ataacattta 1860 acaaagatta tgatggcaga ctgaagatga ggataatttc ctcgaaatat atttctttta 1920 tttggcgttt tcgagttttt attaataatg ggccatcatt tt 1962 // ID P-34_HM repbase; DNA; INV; 4723 BP. XX AC . XX DT 08-JAN-2009 (Rel. 14.02, Created) DT 08-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE P-type DNA transposon family: consensus. XX KW P; DNA transposon; Transposable Element; P-34_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4723 RA Bao W. and Jurka J.; RT "P-type DNA transposon families from Hydra magnipapillata."; RL Repbase Reports 9(2), 446-446 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 170..2863 FT /product="P-34_HM_1p" FT /translation="MPGANCSIFGCGTSRRNKGVSLFKIPSSDDDFNKEWR FT IKVLNIITKDRVVDESLRSQIKKKLLFICEKHYEANMLIINPNKTTLKPGS FT LPTLNLPVKTISNSSSCIFNPKARASASSISLKKSTFPSILDEAVHIKSYK FT SFKELVKRISLQKLDWDVNISDTFLKLSKMDDIHTVPKYEIYIDDSLSFTI FT RIFLWALPDNHEIYLHLKRSMFNTTVCNLISVLLSYIVCPGIPDKLISSHG FT YLLHVIPKNFDLFNSYSLPLNQTEHTRALSCEVVIKKNDICKGCQCLQSQN FT KLSVKKNSDKLLIPAKLRAPISKTAPERVKLTLQSYRLENKLLKDKIKEMQ FT QELNKSSHLVSDDLHKDFVSIISKNDSTVTPFMKLFWDEQQKYLNTSKQGI FT RYHPMIIRYCLGLVSKSPAVYEEIRFNEKTNSGFVILPSQRRLRDYKNYIR FT PQRGFNHEIVNELSLKVKDFSDQERFVTLLLDEMKIQENLVWDKYSGDLIG FT FVDLGDTNINYATLNKADEIATHVLVFLVRGIVNPFKYSFANFATTNIQAI FT QLFPLFWKAVSILELSCNLKVIATTSDGASCNRKFFSMHFHMTEIQDNNKN FT VKFTYRIKNKFADDDRYIYFIADPPHLIKTARNCLFHSASGKGRYMWNTDN FT HILWNHISDLFYEDLNCGLHTCPNLSLEHIRLSPFSKMNVRLAAQVLSSSV FT SIALKTFGPAEASRTAEYCQMFNNFFDCVNVKNIVDCTRKQNPFSEPFYLI FT SDERLTWLTEVFLKYFEDWKLSIDQRPGKFTKADRXNMFLSWQTYEGINIT FT VHSVVDLVKYLLNSGVQYVLTERLCQDSLENYFGQQRAIGRRKDNPSLHDI FT GYNDNTIRNQCTFXPIVGTNSLXNDPEPKTFIKEKLPRRNWNKK*" XX SQ Sequence 4723 BP; 1635 A; 692 C; 692 G; 1676 T; 28 other; catagtcata aaatacatag tcyggccagt taaaragcgc tattgtttat cttwttgtta 60 gcattttttt agaacccagt ctaccaatga gtgcacactt tttgtaaaca aagacgcgtt 120 aaaaayaata tttacaaaaa agttatttat ttaaccttgt tatttgcaaa tgcctggtgc 180 taattgttcc atttttggct gtggtacttc tcgaagaaat aaaggtgttt cactttttaa 240 aataccatcc tccgatgatg atttcaacaa agaatggaga ataaaagttt tgaatattat 300 aactaaagat agagtagttg atgaaagcct taggtcccaa ataaaaaaaa agttattgtt 360 tatttgtgaa aagcattacg aggctaacat gttaattatt aacccaaata aaactacttt 420 aaaaccaggt tcactaccta cattaaatct tccagttaaa acaatttcaa attcttcttc 480 atgtatattt aatcctaaag cacgagcatc agcttcttct atatccctaa agaaatcaac 540 ttttccttct attttagatg aagcwgtyca tattaaatca tataaatctt ttaaggagtt 600 ggtcaaacgt attagtctac aaaaacttga ttgggatgta aatattagtg acactttttt 660 aaaattgagt aagatggacg atattcacac tgttcctaag tacgaaattt atattgatga 720 tagcctctcg tttactattc gtatattttt atgggctctg ccagataatc atgaaatata 780 cttgcatcta aaaagatcaa tgtttaatac aactgtttgt aaccttattt cagttttgct 840 gtcatatata gtatgtccag gtatcccaga taaattaatt agctcccacg gctatcttct 900 tcatgttata cctaaaaatt ttgatttatt taatagttat tctttacctt taaaccaaac 960 agaacatacc cgagctcttt catgtgaagt tgttattaaa aaaaatgata tttgtaaagg 1020 atgccaatgt cttcagtctc aaaataaatt atctgttaaa aaaaatagtg ataaattact 1080 tattccagct aaacttagag ctccaatttc taaaactgca ccagaacgtg taaagctaac 1140 cctacaaagt taccgtttag aaaataaatt gcttaaagat aaaatcaaag aaatgcaaca 1200 ggagttaaat aagtcatcac atcttgtcag tgatgaccta cacaaagact ttgtttcaat 1260 catatcaaaa aatgatagta cagtcactcc atttatgaag ttattttggg atgaacaaca 1320 aaaatactta aatacttcaa aacagggaat tcgttatcat ccaatgatta ttcgctattg 1380 tttaggtctg gtttcaaagt cgcctgctgt ttatgaggaa attcgtttca atgaaaaaac 1440 aaattctgga tttgtcattt taccaagtca aaggcgtttg agagactaca aaaactatat 1500 ccgaccacaa agaggtttta accatgaaat tgtaaatgaa ctgagtttaa aggtaaagga 1560 tttttctgat caagaacggt ttgttacact tttattagat gaaatgaaaa ttcaagaaaa 1620 tttggtatgg gataaatatt ctggtgatct tataggattt gttgaccttg gtgatactaa 1680 tattaactat gcaactctga ataaggctga tgaaattgca acacatgttc ttgtttttct 1740 tgtcagaggt attgtaaacc cttttaaata tagttttgca aactttgcta caacaaatat 1800 tcaagctatt cagttattcc ccttattttg gaaagctgtc agtattttag agctttcatg 1860 caatctaaaa gtaattgcta ctacaagcga tggtgcctca tgtaatcgta aatttttttc 1920 tatgcatttt catatgacgg aaatacaaga taataataaa aatgttaaat ttacttatcg 1980 catcaaaaat aaatttgctg atgatgatcg atatatatat ttcatagctg atcctccaca 2040 tttgataaaa actgctcgaa attgcctttt tcattctgct tcaggtaaag ggagrtatat 2100 gtggaatact gataatcaca ttttgtggaa tcacatatca gatttatttt atgaagatct 2160 aaactgtggc cttcacactt gtccgaatct ttccttagaa catattaggt taagcccatt 2220 ttccaaaatg aatgtaagat tagcagctca agtactcagt tcatctgtta gcattgcttt 2280 gaagacattt ggtccagctg aagcctctag aactgctgag tattgtcaaa tgttcaataa 2340 tttttttgat tgtgtcaatg tgaaaaatat tgtagattgc actagaaagc aaaatccatt 2400 ttctgagcca ttttatttga taagtgatga aagactaaca tggttgactg aagttttttt 2460 aaaatatttt gaagattgga aattgtctat tgatcaaaga cctggaaaat tcactaaggc 2520 tgatcgargg aatatgtttt tatcttggca aacatatgaa ggtatcaata tcactgtcca 2580 ttcagttgtt gatttagtta aatacctttt aaatagtggt gtccaatatg ttttaactga 2640 acggctttgt caagattcat tagaaaacta ttttggccaa cagcgagcya taggaagaag 2700 aaaagataac ccatctttac atgatattgg ctataatgac aatacaataa gaaaccaatg 2760 taccttcmaa ccaatagtag gaactaatag tcttrataat gatcctgaac caaagacttt 2820 tatcaaagaa aaattaccac gccgaaattg gaataaaaaa taagctcatc aaaaaaagga 2880 aattaaaaac ggctattttc atagccacaa acattataaa ggctttattt ggtataaaat 2940 aatttatttt aattgttgtt gtttttaaca gggtgttrat agggagggat ctctcataaa 3000 tatatatata tatatatata tatatatata tatatatata tatatatata tatatatata 3060 tatatatata tatatatata tatatatata tatatwactc aatctatata ttaggtcata 3120 tcatttgtgc gctttattta tatatatgtg cgctttaatt ttttagaaaa aaaatgaaaa 3180 aatgaagaat aatgaaagaa aagattgctg tcaataatga ttacattatt tgatattatt 3240 acttaaaata aaaacttaca taatttcgtt tggttttcac ttaattactt attgcatctc 3300 ttcattgtgc tgcaatatag ttcatatgct agatatgttt ttaataaaat ttgtggctat 3360 gaaaataacc ttttttaaat gtctaatctt tattcagtag agtttaaaga tacttttttg 3420 atttcttttc ttaacgaatt tgttcttttc ttcttagtct tgattttatg caagtttaca 3480 agctccttgg aataggaaaa cgagcgaaca cgaaaataaa gtgttaaaag gtggtctaga 3540 agattgagag acatttctgt agatacctta gtagaactag cataacaaat ttttgaaaag 3600 taagataaaa cctcaacatc ttgaagcaat tctattgtca tcttgctgca gtcaatattt 3660 ttactcaaaa tttttgttga atttctgaat gctatttctg ctattctaaa tagtgaaaaa 3720 gcatcagatg atatagacca aagaccacct ctatttttaa aattaataaa ttttctatta 3780 gtttcatgat actcttcaaa ttcattttct acaatttttc cagccaacaa aattaaaagt 3840 gtttcttgac caaatctact ttgccaaatt tttgaataac gtatccttcg ataaaaggta 3900 ctaaatacat accctcccaa atattcaatt aacgtttttt ctttttctgt aaactgaatt 3960 gcattgtcaa aaactaaaat ygaggytgcg tttttagact tcgttaagtg agccaaaata 4020 tgattagcaa ggtcatatcc aacaagagtt gaggttttrt ctgtaagatt tttcaaaaat 4080 gmtttytgac cagcaacttt gtaaaacttt ggataaaaaa cttctgcgtc accattaaaa 4140 ttttcaataa catcttttat yaagtcgaat gaatgytttg catcatcttc tgagaatatg 4200 tattttgaaa gttcatctct aatayrtgga gggtagcaaa ratcgttttt aagaatgtta 4260 atgcttttta taacaaaakt tttgaactca tctacagaac amgtttgttt aacattagtt 4320 gttggattta ttggytgtgt tagagtgtca ctatttgagt gttgagttgt tatgtgacgc 4380 ttgagccctt tctctgattt acatttttta acacactgct ggcaattaaa actgattaaa 4440 gaatttccat tttcaatttc actaagaaga gcatcaacat cactaagraa cttatcgttt 4500 tcgaaaaaaa tatctgactc taratctaat atgagatgat caatggattg ttcgtctgct 4560 gatttcgcca tttttttaga tttttgtgca cttattagta gcctgggcac gataaataaa 4620 cagttttcaa cataaattca aaattttaac tttgattgtt tgaaaacaat agaatcttgc 4680 gaaaacggcc caactggcca gactatctgt tattatgact atg 4723 // ID BEL-80_AA-LTR repbase; DNA; INV; 403 BP. XX AC supercont1.324; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-80_AA_; KW BEL-80_AA-I; BEL-80_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-403 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.324; Positions 202287 201885. XX SQ Sequence 403 BP; 103 A; 87 C; 99 G; 114 T; 0 other; tgttcgagcc gttcgccccg aatgaatttt ttgtcagtgc atcgcccagc cgcagcctgc 60 tgagttgcgt gatgcacgaa aggacatcgt gtgatgcaac cagccgcagc cgcacgaaac 120 gagcgcgcaa aaaatgagcg cgtaaatttt gaggatgttt tctacccgcg attggaagac 180 atctttgttc gccgtgcttc caacgtgtta agacgtgtag tcgcaattaa agcaaagtga 240 attctcttaa tattaaagtt atgtaaatag gtgtaaatag aagtttgaac tgttaagcaa 300 gtgttttcat ccggtatcgg ttccctggtt aagtggtcgt ctcggagatt cgtcggtccg 360 cctatgtcga agttttttaa gaaatacagt ccactctcgt aca 403 // ID hAT-6_SM repbase; DNA; INV; 2615 BP. XX AC . XX DT 08-OCT-2007 (Rel. 12.1, Created) DT 00-0000 (Rel. 12.1, Last updated, Version 1) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-6_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-2615 RA Jurka J.; RT "haT-type repetitive family from freshwater planarian Schmidtea RT mediterranea."; RL Repbase Reports 7(10), 1034-1034 (2007). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 561..2219 FT /product="hAT-6_SM_1p" FT /translation="MNSKKRKVSEENRKYSDHWEENYFFIMQNSKLTCLIC FT RDTIAVFKEYNVKRHHETKHDVYVKSNKELRKIKLAELKSELKSQQKMFSS FT SLQLASNIVKASYSVAMLIAKKMKPFSDGEFVKECLAAVIEDVMPDKHKMI FT SSISLSRQTICRRINDISDEIVVTLRERIQQFKAYSLAFDESTDISDTSQL FT AVFIRGVNDSFGVTQEMLNLLNLKDTTRGEDVFHAIKKCLSCNSLHLEKLS FT GLTTDGAPAMIGKNKGTVKLFMDELEATSVKPSEIFVIHCLIHQQNLCAQV FT LSMNHVMDVVIKTVNYIRSHALQHRKFKAYLDELSSEYGDIVYFTKVRWLS FT RGTCLKRFFELRVEIENFMAEERNPVSNLNDEMWLLDLCFLVDITEKINQL FT NKELQGQDNLIIDACNHIKAFQTKLILFESQLRNNNLHHFPLLKEFNSSQT FT INFAKYADEIRKLSAEFTRRFSELTKYDKTFEIFSSPFQVDVFTVPEALQM FT EIINLQCNNELKQVHKTTSKIEFYNTYITAERYPNLRLFAQKIVSAFGSTY FT VCEAFF" XX SQ Sequence 2615 BP; 989 A; 366 C; 429 G; 831 T; 0 other; caggcgtggc caagccgagg cccgcgggcc gcatgcggcc ctttatgtat attatgcggc 60 ccgtggcatg attattattc gctttaaatt ttttttacag aatctataaa aaattaatat 120 attacaaaat ctactaaaat tattaaatac agactttttt gaaaaagtat tgactacaaa 180 attcaatatt aaaaagtttt tattatgttt tttagttggc agaaaaatcc gttttccgtg 240 catatatatt tattcataag taaaaaattt aaatgagtaa ttcaaaaata tttatatagt 300 aggctttgcc agaaaaatat aactaaagaa gccgacagtt attgaaaaac ttttttagtt 360 tggctttgtt ttagttagga tatgaagttg cgtattttta agtgataaaa aatatcaaat 420 tatataacac aaaaaaagta atataaaaaa tctaattaat tatatttcta taccagaaaa 480 attaaataaa taagtaagta acagtaaatg cacttatttt gaataattat acataacttt 540 aattatattt ggcctttaga atgaattcca aaaaaagaaa agtttcagaa gaaaacagaa 600 aatacagcga tcattgggaa gaaaattact tctttataat gcagaactca aaattaacat 660 gtttgatttg ccgagataca atagctgttt ttaaggaata caatgttaaa agacatcatg 720 aaacgaaaca tgacgtgtat gtgaaatcta ataaagaatt gagaaaaatt aagttagccg 780 agctaaaatc cgaattaaaa tctcagcaga aaatgtttag ttcttcatta caactagctt 840 ctaatattgt taaagcaagc tactctgtcg caatgttaat cgctaagaaa atgaaaccat 900 tttcggacgg agaatttgta aaagaatgtt tggcagctgt tatagaagat gttatgccag 960 ataaacataa aatgatttca agcattagtt tgtcgcgtca aactatttgt cgtcgcatta 1020 atgatatttc agatgaaata gtggtgactt tacgagagcg gattcaacaa ttcaaagcat 1080 attctctagc tttcgacgaa agtacagata tttcagatac atcacaactt gctgtattta 1140 ttcgtggagt taacgattct ttcggagtga cacaagaaat gttaaatctt ttaaatttaa 1200 aagacaccac tagaggagag gatgtttttc acgctataaa aaaatgttta tcctgtaatt 1260 ctttacattt agagaagctt tctggactaa caacggatgg tgcgcctgct atgattggta 1320 aaaataaagg aactgtaaag ttgtttatgg atgaacttga agctacatct gttaaaccca 1380 gtgaaatatt tgtaatacac tgtttgatac atcagcaaaa tttatgtgcc caggtattgt 1440 ccatgaacca tgtcatggat gtggtaataa aaacagttaa ctatattcga tcacatgcgc 1500 ttcagcatcg caagttcaaa gcataccttg acgaattaag ttcggaatac ggcgatattg 1560 tctattttac caaagttaga tggttgagtc gcggaacatg tttaaaaaga ttttttgaat 1620 tacgggtgga aatagaaaat tttatggctg aagagagaaa tccagtttca aatttaaatg 1680 acgaaatgtg gttgttggac ctgtgtttct tagtagatat aaccgaaaaa attaaccagc 1740 taaacaagga gttacagggg caagataatt taattattga tgcttgtaat catataaaag 1800 cattccaaac caagctaatt ttatttgagt cccaacttag aaataacaat cttcaccatt 1860 ttccgttatt aaaagaattt aatagcagtc aaactattaa ttttgcaaag tacgctgatg 1920 aaattagaaa actttcagct gaatttactc gtcgtttctc agaacttaca aaatatgaca 1980 aaacgtttga gatattttct tcaccatttc aagttgacgt tttcactgtt ccagaagctt 2040 tacagatgga aataatcaat ttgcagtgta acaatgagtt gaagcaggtt cataaaacga 2100 catcaaaaat tgagttctac aacacgtata taacagcaga aaggtatcct aacttaagat 2160 tatttgctca aaaaattgta agcgccttcg gatctacata tgtttgtgaa gccttttttt 2220 aaaaaatgaa atttaataaa agtaaatttc gcgcttcatt aacagacgaa aaccttgaaa 2280 gccaattacg atgtgcaact accaaaattg atgtagattt aaaaacattg agcgatcgca 2340 aagaaaagca aatatcccat taataaatat tatacttaag tgtactgaaa taataaaaat 2400 aaatggttca ataaaaatgt tcttgaaaca ataaaaaaaa tctactgtaa atataaaaat 2460 aaatgttttt cttggaatag ttcaaataaa tctatgtact gaaataattc aaataaatgt 2520 ttcaataaaa attgggtttt catataatat atagcccgct gaaatttttg gataaaaagg 2580 ttggcccgca aagcgatttg ggttggccag gcctg 2615 // ID Sola3-3_BF repbase; DNA; INV; 6471 BP. XX AC ABEP01035506.1; XX DT 28-JAN-2009 (Rel. 14.02, Created) DT 28-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE Sola3 DNA transposons from Branchiostoma floridae. XX KW Sola; DNA transposon; Transposable Element; Sola3; TTAA TSD; KW Sola3-3_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-6471 RA Bao W., Jurka M.G., Kapitonov V.V. and Jurka J.; RT "New superfamilies of eukaryotic DNA transposons and their RT internal divisions."; RL Molecular Biology and Evolution 26(5), 983-993 (2009). XX DR EMBL/GenBank/DDBJ; ABEP01035506.1; Positions 24150 17680. XX CC The left portion is incomplete. XX FH Key Location/Qualifiers FT CDS 1517..5050 FT /product="Sola3-3_BF_1p" FT /translation="MRFYVLAANMSCSFGSPDSECGLSRYYPEDISVIPLS FT ACNKDVTNYLATLGLSGRGHGQTDLSEQDLILNRSGRFGLSEEEKNNLTVC FT PKHRYDLTTRWTGGRRNTCAHPEHVGRKTALKEPRRVPMFLSTQMYRTYNT FT VVPVGSAMCNPCRKRAVKDAERDVERPTGSQDLSSDQPGPSSYPQSTMGPP FT SQPVGTAASTSTALKATSQVSSSESSSVISETPIVETTFESQSSSTGEGEA FT NISTWVDEQHQQEQNREALNKAVDQITEGRISPLRSSLNTPWEEVTERQRN FT YYLKKTREVMSATMSVIAPGQESDLWRSLVQAPPLYEEPPAKKKMWDQQMV FT QTLVKAYQEADTWQTRQQILSLFADDYTKDELQELIPGLSKWRIDQARAHA FT SESGPGKPVQQQPIYRTRLNPVKTDHFLAFLTQPHLLQDVAYGTKTMKLDS FT GEVITIPAAIRTIIPSRIIQQYSQYCTSANFQPLPDRTLFKIIEVCAARQQ FT KSLQGLDYISTEGAEAFDAMCSVVDSLVQNGASADWAKTVTHKLKEGKRYF FT KTDYKTHVKLEDRCGDHCIAFALSEKGKETLSVRCNHSHEIACERCSEIDS FT SVLEVQKMIEEEGLLTEEHKSRLCFTFEQSSSAIYSWKSHLLRAVNQDLAK FT QSALQQLDTKSALIVMDWAMKFLPLKYREQMTDFFGKRGRSWHVSAVITKG FT DEGKYDVECFVHIFNNCRQDWSAVCAILQSVLQTIKSENPSLTNVFLRSDN FT AGCYHCAPLILSLPAIGKKTGISVLRFDFSDPQAGKDICDRKIASMKTHIR FT RYVNEKHDVLTAEHMKEALESHGGVKGCRIAVAQTDEANKEQARQKWEGVK FT SWNNFMFETDEDIRVWKAFGVGVGEVVTFATVPQTHDLPTLTITAPFGPPQ FT SHTSHITTSAGGTKSALFCCPEPGCVAAFSSANKLEDHMDVGEHCKELERE FT STYDKIRKQWAQAVTGIAHGPGLASQSLGEGSGVGVGGSARVVNEGWAIRT FT VKGGQRMSDKMKTYLTDIFNQGAKSGQKADAVQVAHEMQHVRGADGKLLFQ FT PSEWRTAKQITSYFARLSAAQRQKESTESENTDMEAIDEESEWQELRKTVY FT TSIDYSHPVMFEDINLCELAKTKGLKSLKLKQLKCISDHFHLPITGSMARK FT VSFIQALENILSTCTCAS*" XX SQ Sequence 6471 BP; 1913 A; 1368 C; 1454 G; 1736 T; 0 other; taacgcgagc cgtgagaggg gggtctgtat ggggattttt cgggttttgg tctatatcaa 60 atcatgtaat gatgtgttgc tatggggata agttatggaa aattttgcat agctttgtga 120 aaaacatgca cagctgtcat taaatgcaac aataagttag caataataat ttgggactct 180 gcatggggga taaaaggtta cttgaaaatg aatttcacct cccatgcttt acacatatat 240 cgttgctttt tcatttggac gagtgcgtta cttactaaat aactgtgtac atttacgttg 300 aggtaaataa ttttaaaggg ctgtgtatgg ggtttatcag aactataata ctgtaaatgc 360 agaaatgttc gcggtggatt attgttcgcg gtttttgcag tgaccacttt accgcgaact 420 taaacaaccg cgaacatttt tctattatgg tattagactg cagtctatgg tattaccgcg 480 aaattaaatc ccctcgaact taaatgcatt tacagtatac aggtcacatg ggtgcagagc 540 tacctcatgt tgacaattca tttaaaaaga tcataacaaa cttttaagtt tgttatgatt 600 cttatctatg acattgtctg tgcacatttc ctgtcagatt aggattagca caggtatact 660 gccagtgtga tggcccagta ttgaataatt gtcttgcact cgttagtttt tgggtttgat 720 ccttgtctgt gtcataccaa agacattaaa aatgggaaaa acgttaaaag atgtaattat 780 tagatgtgta ctattacttt ggcgatattt aattgaaacc acttttttat gccttagagt 840 aaatcctttt atcttaggtt aggaactcat gggttgtcag caaagcattc cactcgtttg 900 ataaattgca aaaagtaatg tcaacccttc cacactcccc ctgtctggga ctgtgcagtg 960 tatgttgttc tttggtctgg gttagcagac attggggtca tgctaaagat aaggatctga 1020 caagctcttt ggctgccatg tggtttcttt aggggattct gagcacagct acatgattaa 1080 acgcaactct gatctaactc tgataaatgt gtcagatgga aagaatattt gttttgccca 1140 atttgcattg aatttttgga cgtatatatt tcttaatttc caccggttaa aatgatgtcc 1200 gaagattact ccctacttgc aaacatggca ataccccaaa gatacccttc gacagggtgt 1260 gggccatgca ctgaagcttg aaaatagtgt agctgtcctc taggtaagtt tcttagattt 1320 ttgtcatctt aattttgaac gttttctgtg tgatgatcag aatagcttaa aaggaatcaa 1380 ggtgctatta gaactgtgac atttcaaaat ggccttacag atcttgcatc ccattgaaaa 1440 cttgtattta ccattggtgc cctgttaaca gcttacatgc ataaaatctt attctattct 1500 tttctattct aaaagtatgc ggttctatgt tcttgcagcc aacatgagtt gttcatttgg 1560 aagcccagat tcagagtgtg gtctgtcacg ttactaccct gaagacatat cggtcatacc 1620 actgtctgct tgtaacaagg atgtaacaaa ttacctagct acattaggtc tcagtggacg 1680 tggccacggt cagacagacc tgagcgagca ggacctcatt ctgaatagat cagggcgatt 1740 tgggttgtct gaggaagaaa agaacaactt gacagtatgc ccaaagcatc ggtatgacct 1800 gacaacacgc tggaccggag gcaggagaaa cacatgtgct caccctgaac acgtaggcag 1860 gaaaactgcc ctgaaagaac caaggcgtgt gccaatgttt ctgtctacac agatgtacag 1920 gacatacaac acagtagtgc cagttggctc tgccatgtgc aatccatgca ggaagcgagc 1980 agttaaggat gcagaaaggg acgtggaaag acccactggt tcccaggatc tttcgtccga 2040 tcagcctgga ccaagctcat atccgcaaag taccatggga ccaccgtcac aaccagtagg 2100 gactgctgca tctacttcaa ctgctttaaa ggccacctcc caagtaagct ccagcgagtc 2160 gtcatcagtc atatcggaga ctcctatcgt tgaaacgacg tttgagtcac agagcagctc 2220 aacaggcgaa ggagaagcga acattagcac ttgggtcgat gaacaacatc aacaagagca 2280 gaacagggaa gccctgaaca aagctgtaga tcaaatcacc gaggggagaa tcagtcctct 2340 gcgatccagt cttaacactc catgggaaga agtaactgaa aggcagcgaa actactatct 2400 gaagaaaaca agagaagtta tgtcagcgac aatgagcgtc attgctccag gccaagaatc 2460 agatctgtgg cggagtcttg ttcaagcacc gccgctatat gaagaacctc ctgcaaaaaa 2520 gaagatgtgg gaccagcaaa tggtgcaaac cttggtgaag gcctaccaag aagcagacac 2580 gtggcagaca aggcagcaaa tcctatcgct gtttgctgac gattatacta aggacgagct 2640 tcaagaactc atacctggac tttccaagtg gcgcatcgat caggctaggg cgcatgcaag 2700 tgagtcaggc ccaggaaaac ctgtccagca acaacccata taccggacca gactgaaccc 2760 ggtgaagact gaccattttc tggcttttct cacgcagccc catctcctgc aggatgttgc 2820 atatggaact aaaaccatga agctggactc aggagaagtg ataacaatac ctgccgctat 2880 ccgcacaatc atcccttccc gaatcatcca acagtactca caatactgca ccagcgccaa 2940 cttccaacct ctgccggaca gaaccctttt caagattata gaggtatgtg ctgcacggca 3000 gcagaagtcc ttgcagggtc tggattacat ctctactgag ggagccgagg catttgatgc 3060 aatgtgcagt gttgtagatt cccttgtcca gaatggcgcg tctgcagact gggcaaagac 3120 tgtcacgcac aagctaaagg aaggaaagcg ctacttcaaa acggactaca agacgcatgt 3180 aaagctggag gacagatgtg gtgaccactg cattgcattt gccctgagtg aaaagggcaa 3240 ggaaacattg tcagtgagat gcaaccacag ccatgagatc gcatgtgaac gatgtagtga 3300 gatagacagt agtgttcttg aagtgcaaaa gatgattgag gaagaaggcc tcctcacaga 3360 ggaacacaag agtcgactat gttttacctt tgagcagtca tcttccgcca tttacagctg 3420 gaagtcccat ttactgcgag cagttaacca ggatcttgcc aaacaatctg ctttgcagca 3480 gctagataca aagtcggctt tgatagttat ggactgggca atgaagttcc tgccattgaa 3540 gtacagggag cagatgacag acttctttgg aaagcgtgga aggagctggc atgtgtcggc 3600 agtaatcacg aagggagacg aaggcaagta tgacgtcgag tgctttgtcc acattttcaa 3660 caactgccga caggactggt cagctgtctg tgctattctg caaagtgtgc ttcagaccat 3720 caagtccgag aatccttcgc tgacaaatgt gtttttgaga tcggacaatg cagggtgtta 3780 tcactgtgct ccgctgatct tgtctctgcc agcgatcggt aagaaaactg gcatctcagt 3840 tctgcgcttc gacttctcag accctcaggc aggcaaggac atttgcgacc gcaagatagc 3900 atccatgaag actcacataa gaaggtatgt caatgaaaaa catgatgtct taacagctga 3960 acacatgaaa gaagcgctgg agtcccatgg aggcgtcaag ggctgtagaa ttgctgtggc 4020 gcaaacagat gaggcgaaca aagagcaagc aagacaaaaa tgggagggtg tcaaatcatg 4080 gaacaatttc atgtttgaaa ctgatgaaga catacgagtc tggaaagcct ttggcgttgg 4140 ggtcggagaa gtggttacct ttgcaacagt gccacagaca catgaccttc caaccctgac 4200 aatcacagca ccgtttggtc ccccacagtc acacacgagc cacataacaa catcagcagg 4260 tgggacaaag tcagcgttgt tctgctgccc tgagccagga tgtgtggcag ccttcagctc 4320 agcgaacaag ctagaagacc acatggatgt tggagaacac tgtaaagaac tggagcgaga 4380 gtctacatac gacaaaatac gcaagcagtg ggctcaagca gtgactggaa tagctcatgg 4440 ccctggccta gcctcacagt cacttggaga aggatccgga gttggcgtcg ggggatccgc 4500 ccgagttgtc aatgaaggct gggcaatccg gactgtgaaa ggcggccaac ggatgtcaga 4560 caagatgaag acctacctaa ctgacatttt caatcagggg gcgaagtctg gacagaaagc 4620 tgatgctgtg caagtggccc atgagatgca gcacgttagg ggagctgatg ggaagctgtt 4680 gttccagccc agcgagtggc gcacagccaa gcagattaca agctattttg caagactttc 4740 agcagcacaa agacagaaag aaagcacaga atcagagaac actgacatgg aagcgataga 4800 tgaggaaagc gagtggcaag agctgcgaaa gacagtgtat actagcattg actacagcca 4860 tcctgtcatg tttgaagaca ttaacttgtg cgaactggct aaaaccaaag gtctcaaatc 4920 gctaaagcta aagcaactaa agtgcatatc agatcatttc catctaccca ttacaggttc 4980 catggcgagg aaagtctctt tcatacaggc attagagaac atcctatcca catgcacatg 5040 tgcaagttga tttgttttta atttgacagc ataatgacag catcaaggtt tgtagcataa 5100 cttttacaac ctttaacaac atgtcaagtt caaggcctgg ccagaatctg tcaccaatta 5160 caatttaaaa tttacttcag atccagctta ttgtaccatt ccaacaaaac tgaacttgtg 5220 catttgtgaa aatgcgtttc aacagtgtgt tggtgtattt ttcagatagt gcgctccatt 5280 gtaatatggt tatacatgta tatgtttgtt gtttgtttgt gtctatattt agaggcttgt 5340 aggtttgtca taaacttaat gtttatggtc tgtccttatg gtgataaatg aatgaacagt 5400 catagcaaat cattggtata ctacagtttg tccttagtag tatacctgta cgtaaggggt 5460 aaagctaaag tacatgtaca aatgacttat gtacatgtat tcttttcaaa atagccttat 5520 aattgacctt tgttatggta ttatttctag aattttcaat atgtatgaag tagttctgca 5580 tccatgtgcc ctgtgtatta aacagttttg cattcttgga gtactggtaa cccccatgta 5640 cagccttttt tcattgtgta ggacatataa gttataagca ttgatttggc aaggcaccta 5700 cttaatgaca ttagaaagca aaaatgaaaa tgtgtaaaat cagcataatt cctaagtcaa 5760 gtaaccttat gtcccccatg cagagcccca aatttatatt gctaaattat tgtagcattt 5820 tatgatagtt atgtatggtt tccaaacact tacacaacaa ttttccataa cttatcccca 5880 tagcaacaca tcattacatg atttgatata gaccaaaacc cgaaaaatcc ccatacagac 5940 ccccctctca cggctcgcgt tatgtatggc ccaattagat gacgattgtt ttttaataag 6000 ttgatgttcc tctgtccaac aacattgaaa ccattcacca gacccactgg gtatagaaaa 6060 tattccgcct taaaaagaac taaaattcac agagatgaaa attctgcttt tagtggttct 6120 agtaaccccc aagtacaccc tctcaagttc aatgtttcat cttcaacttt tggtctgtga 6180 aactagggag tccatgctgt atgaaaatac aaagattata ttctcatcgt tatcacattt 6240 attagcgtag ccatcgcaag ttcggtaatt ttcagtcata aattgttgga ttttcgaggg 6300 tctctgaggg gggtcccata tctccacaat gaaaaatagc tggagggttt taactatatg 6360 gctgttcata gttccacata cctatcacac atacaaagta tgagccgatt tggccgaccc 6420 ccatcttccg gcccctgcct ggttttgcct ggttttctcg tgcaatgact c 6471 // ID BEL1-I_DV repbase; DNA; INV; 5575 BP. XX AC scaffold_13049; XX DT 15-OCT-2009 (Rel. 14.12, Created) DT 15-OCT-2009 (Rel. 14.12, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL1_DV; KW BEL1-LTR_DV; BEL1-I_DV. XX OS Drosophila virilis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; virilis group. XX RN [1] RP 1-5575 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(12), 3092-3092 (2009). XX DR Genome; scaffold_13049; Positions 228922 223348. XX CC Positions [4613-5194] - Integrase core CC 'CTTTC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 290..5575 FT /product="BEL1-I_DV_1p" FT /translation="MFDEGASANGNDEATSASQVAVGTQAAVFKIKIKNLT FT DRLNRLSSELDPARLRDVDDYELQDYISMASDLQAKFEIVCDGLLEVDHAS FT VDEDLQTSFESTIRQLRLSLQRERGNRSKVQQIPHCSTFNSAAADDSRSTF FT VVPNHSRLPQLKLPEFSGGYTEWADFSNLFTTVIDKDPYLTNIEKLQHLRS FT CLKGTALDTIRSLEISNANYAAALELLDKRFNNKRLIFQAHISEILGLRKV FT DKGATAQLREFSDKLNSHLRALKSMGSVEQIAGCVIVHTLLQKLDSVTQAS FT WEDDAPLDVIPSCERFTTFIERRCQRLENADHATAMYTPSSQVGQNNSSRR FT TFVVTRNGTSACVFCEVAGHSIYKCLQFANLSPLLRLHEAKRLALCLNCLQ FT RGHQLRVCGSSACRVCGSKHHSLLHLGNTSSHIAASSPNNAQDTETYSSSQ FT NTLAALLSSPLTTAQHLKHDVVLLATAVINVKNRAGSLVPCRALLDSGSQL FT HIITSRLAHQLQLRKFKSTAIVSGIGDAAFASDGFSVNINVKSRVSEYSTC FT IPALIAPSITDNQPGFTLDPASWNIPSNIQLADPEFFKSQQIDMLIGASLF FT FDLLCVGQIKLAAGLPILQKTRLGWVATGGASHAGKSSFMAMRSMENPDLL FT VDSHLQPNTQIDELIRRFWELECCTDPESLPNKEERDCEAHFQANFKRLST FT GDYSVRLPLRLGMYPLGDSYQQAVRRFLNLERKLDRNPLLKPQYAAFIKEY FT LDLGHMSLVTSAALGQCKYYLPHHCVLKEDSTTTKLRVVFDGSAVTTSGHS FT LNDALMAGPTIQPKLFSILMRFRTFAVALTGDICKMYRCVRVEPADSYFQC FT ILWRESQHQKIQIYKLDTVTYGTKPASFLSVRAMHQLAMDEQKTFPIGSDI FT VKRDFYVDDLISGGSCVQEAIEILKQTSGLLAKGNFRLRKWCSSDTSVLQN FT IPEEDRETLLKFDDGSDITKTLGLVWDPASDCFLFSFSPLRLPSRLTKRSI FT LSAIARFYDPLGLVGPVITKSKIFMQDLWREKLDWDESLPVHLSTAWVNFC FT ADFEYTQQFQYPRRALSSDSTVEIHGFCDASLSAYGACVYTVSKCNGNTSV FT RLLCSKSRVAPVKTITVPKLELCGAALLAQLLSEICQMKVFDCRYYCWSDS FT AVTLAWIRNDASKFNVFVANRVAAIQELTTGMEWHHIPTELNPADIISRGA FT LPSELFRSPLWAHGPSFLSKGKEEWPASCVPVESLPELRHKVLLGTAAQPD FT LSIGCKFINSFSKLQRVFAYVYKFVNRIRGAELTVDHLHHGTHWLLRSVQM FT ATLSDDYKALKEGRHVKPSSSMASLAPFLDDFGLLRIGGRLKNSSLDFSAR FT HPIILPRQHPVTRAIIVYFHKRNLHAGPRALLSSIRLQYWPIGGRKTVSSI FT VAKCIICFRAKPRLAEHIMADLPADRLNTSYPFMVTGVDYCGPFYYKNEVR FT NRPPVKCYISLFICFATKAVHLELVKDLSTTSFLNALKRFILTRSRPSRIW FT SDNATNFVGAKNELADLNRLFLRDEHVKAVNEFCLTESIEWLFIPPRSPHF FT GGLWEAAVKTAKHHFYRSVCSSILDFDSLRTLVCHITAIINSRPLLPLSEH FT PGDLDVLTPAHFLGTAPSSSYIEPDLRQLNFNRLNYFQRVTYLQQVFWARW FT REEYLTLLQQRSKWRTPQPGLSINDVVLVKDENLPPLKWPLARVQELISGS FT DGVSRVAVLQTATGVIRRAVRKLCLLPKQDDVESPCLPTGGE" XX SQ Sequence 5575 BP; 1384 A; 1401 C; 1288 G; 1502 T; 0 other; ttttggtgac cccgacgtga tttttctttc tttcattttt tctatttttt cttgtacttc 60 gttcttgtcg tgtcaacgga cagttatcag ttcgggcaaa gctgtgtggt gagccgtttg 120 cttgcgcatc tgcatacagt tggttgttat acataaacgc cgtagcaaat ccccgctaat 180 cgttgcgcgc tataacccgc tatacgctat ctcgctcgaa cgaacacttg cgcgatcaga 240 cttgtattgt agttcggctg tgtgaacctt gttgtgcgct acattcataa tgtttgatga 300 aggtgcaagt gcaaatggaa atgatgaggc gacatcagct agtcaagtgg cggttggaac 360 ccaggctgct gtttttaaaa taaagatcaa gaatttaact gatcgactca atagattgtc 420 ctcggaactt gatcccgctc gacttcgcga tgttgatgac tacgagctgc aagattacat 480 aagcatggca tctgacttgc aggcgaaatt tgagatagtc tgtgatggtt tgttggaggt 540 ggatcatgcc agcgttgatg aggatcttca gacaagtttt gagtcaacta ttaggcagct 600 acggctgtcc cttcaacgcg agcgcggaaa tcgaagcaag gttcagcaga ttccgcattg 660 ttccaccttc aattcagccg cagccgatga ctcgcgttct acctttgttg ttccaaacca 720 ctctcgattg cctcaactta aattgccgga gtttagtgga ggctacacag aatgggccga 780 tttctcgaac ctgttcacca cggtcattga caaggatccg tatttgacca acattgaaaa 840 actccagcat ctacggtcat gccttaaagg aacagcgctg gatacaattc gctcattgga 900 aatttcaaac gcaaattatg ctgccgcttt agaactgctt gataagcgtt ttaataacaa 960 gcgtcttatt tttcaggcac acatctctga aattttgggt ttgagaaagg tggacaaggg 1020 cgcgactgca cagctgcgcg aattttcaga taagctcaac tctcatctac gtgctttaaa 1080 atcgatgggc agtgtggaac agatcgccgg ttgcgtcata gtacatacgt tgctgcaaaa 1140 actagatagc gttacgcagg ctagctggga ggatgatgcg ccgttggacg tcataccatc 1200 atgcgagcgg tttacaacct tcatagagag gcgttgccaa aggctggaaa atgcggatca 1260 cgctacggca atgtacacgc ctagctccca ggtgggccag aacaacagta gtagaagaac 1320 gtttgtagtg actaggaatg gaacgagtgc ttgtgtgttt tgtgaagtcg caggccactc 1380 tatttataaa tgtttgcaat tcgcaaattt atcgcccttg ctgcgccttc acgaagccaa 1440 gcggcttgcg ctgtgcctaa actgcctgca aaggggacat cagctgagag tctgcggctc 1500 cagcgcttgc agagtttgtg gaagcaaaca tcatagcttg ttgcatcttg gcaacacaag 1560 cagtcacatc gctgcttcta gcccaaacaa tgctcaagat accgaaactt attcgtcatc 1620 ccaaaacacc ttggcggcac ttttatcttc gcctctcact accgcccagc atctcaagca 1680 cgatgtggtc ctgcttgcca ctgccgtcat caacgtgaaa aatcgcgctg gctccttggt 1740 gccttgccgt gcgttgctcg actctgggtc gcagttgcac atcatcacct ctcgtcttgc 1800 tcatcagctc cagctgcgca aattcaagtc aacagcaatc gtctctggca ttggtgatgc 1860 agcatttgcg tccgatgggt tttcggtcaa catcaatgtc aaatctcgag tgtcggagta 1920 ctccacatgc atcccggcct tgattgcacc atccatcacc gataatcagc ctggcttcac 1980 tcttgaccct gcatcatgga acattccatc aaatatacaa ctagctgatc ctgaattctt 2040 taaatctcag caaatcgaca tgttgattgg agccagtctg ttcttcgatc tgctatgcgt 2100 cggccagatt aaactagctg ctggactgcc gatattgcaa aagactcgcc ttggttgggt 2160 ggctacggga ggtgcctcac atgctggaaa atcatcattc atggccatga gatcaatgga 2220 gaatccagat ctgctggttg actcgcatct gcagcctaat acacagattg atgaattaat 2280 ccgtcgtttt tgggaattgg aatgctgcac cgaccctgag tccttaccaa acaaggagga 2340 acgcgactgt gaggcacact ttcaggccaa ttttaaacgt ctgtcgactg gcgactattc 2400 agttcgttta ccgctacgtc taggcatgta tccgctgggt gactcctatc aacaggcggt 2460 tcgtcgattc ctgaacttag aaagaaaact agaccgtaat ccactattga aaccccagta 2520 tgcagcattt atcaaggagt atcttgactt gggtcacatg tcacttgtta cctcagctgc 2580 actgggccaa tgcaagtact acctgccaca tcactgcgtg ctgaaggagg atagcactac 2640 aaccaagcta agggtcgtgt ttgatggctc agcagttact acctctggac actcgctgaa 2700 tgatgcactg atggcaggtc ctaccatcca accgaagctg ttttcgatac tgatgcggtt 2760 tcgtacattt gcagtcgccc tgacaggcga tatatgcaaa atgtatcgat gcgtacgagt 2820 cgaacccgca gacagttatt ttcaatgtat cttgtggcgt gagtctcagc atcagaagat 2880 acagatttac aaattagaca ccgtcaccta cggcacaaaa ccagcatcgt ttctctcagt 2940 gcgagctatg caccaactgg ccatggatga gcagaaaact ttccctattg gctctgacat 3000 tgttaaaaga gatttctacg tggatgacct catttctggt ggtagctgtg ttcaagaagc 3060 aattgagata ttgaaacaaa catctggact actcgccaag gggaacttta ggctgcgcaa 3120 atggtgttct agcgacacat ctgtactcca aaacataccg gaagaggata gagaaacgct 3180 gcttaagttt gacgatggca gtgacatcac gaaaacgtta ggcctcgttt gggatcccgc 3240 ttcagactgt ttccttttct ccttctctcc actgaggttg ccctccagac tgacaaaacg 3300 gtcaatactc tccgcaattg ctcgctttta cgaccccctt ggtcttgttg gtcccgtaat 3360 aacaaaatcg aaaattttta tgcaggatct ctggagagaa aagctggact gggatgagag 3420 cctgcccgta cacttaagca cagcctgggt caacttctgc gctgattttg agtatactca 3480 gcaattccag tatccccgtc gagcactctc atcagacagt acagtggaga ttcacggatt 3540 ttgtgatgcc agcctaagcg cttatggagc atgcgtctat acagtctcaa agtgcaatgg 3600 caacaccagc gtgcgtctct tatgctccaa atcgcgtgtc gcgcctgtga aaaccatcac 3660 ggtaccaaag ctggaactct gcggagctgc attactggct caacttctca gcgaaatatg 3720 ccaaatgaaa gtgttcgact gtcggtatta ctgttggtca gactctgccg tgactttggc 3780 ttggattcgc aatgatgctt cgaaattcaa tgttttcgtt gccaaccgag tcgcagccat 3840 ccaagagctc accactggaa tggaatggca tcatattccc accgaattaa acccggcgga 3900 tataatttca cgaggagcgc tgcctagtga gctcttccgg tctccattat gggcccatgg 3960 tccgagtttt ctcagtaagg gaaaggaaga gtggcctgcc tcctgtgtac ctgtcgaatc 4020 cttaccagaa cttcgacaca aggtactcct cggaactgca gcgcaaccgg atctctcgat 4080 tggttgcaag ttcatcaatt cgttttccaa gctgcagcgg gtctttgctt atgtttacaa 4140 atttgtaaat cgaatccgag gagctgaact aaccgttgac cacttgcatc atggcacaca 4200 ttggttgctg cggtcagtgc aaatggctac tctcagcgat gactacaagg cattgaagga 4260 aggaaggcat gtcaaaccgt ctagctcaat ggcctctctt gcgccattcc ttgacgactt 4320 cggcctactt cgcatcggtg gacggctgaa aaactcgtcg ttggacttct cagcgcgaca 4380 tccgattata ttgccccgtc agcatcctgt gacgcgagca atcatagttt actttcataa 4440 acgaaactta cacgctggac ctcgcgcttt attgtcttcg attcggctcc agtactggcc 4500 cattggcggt cgaaaaacag tctccagtat tgttgccaaa tgcataatct gttttcgtgc 4560 caagccacga ttggccgagc atataatggc agacctacca gctgatcgtc ttaatacatc 4620 gtatcccttt atggtcactg gcgttgatta ctgtggacca ttttactaca agaacgaagt 4680 gcgcaacagg ccaccagtga aatgctatat cagtctgttc atatgcttcg ctacaaaggc 4740 ggtgcactta gagcttgtaa aggacttgtc cacaacatcg tttctaaacg ccctaaaacg 4800 tttcatcctg actcgctctc gaccttcaag gatctggtct gacaatgcga caaacttcgt 4860 aggtgccaag aacgaactag cagatctgaa ccgcctgttc ctgagagacg agcatgtcaa 4920 ggccgttaac gagttttgcc taaccgaatc aattgaatgg ttgttcatcc ctcctcgctc 4980 tccgcacttt ggtggactat gggaagccgc tgtgaagact gctaagcacc acttttatcg 5040 atctgtttgc tcttccattt tggatttcga ttcgctgcgt acgctcgtct gccacatcac 5100 ggcaattatt aattctcgac cattacttcc actttcagag catccaggtg atcttgacgt 5160 cttgacaccc gcacacttcc tgggtacagc gccatcttca tcatacattg agcccgatct 5220 aaggcagctt aacttcaatc ggcttaatta ttttcagcgc gtcacatatc tccaacaagt 5280 attctgggcc cgttggcgcg aagagtattt gacgcttctt caacagcgct ccaaatggcg 5340 cactcctcag cctggactct ccattaacga tgttgtcctt gtaaaggatg agaatctacc 5400 gccgctgaag tggccactcg ccagagtgca agagctgatc tctggatctg atggggtatc 5460 cagagttgct gtgcttcaaa ctgcgactgg agtcatacgt agagctgtac gaaagctgtg 5520 tttgctgccc aaacaggatg atgttgaaag cccttgcctt ccaacggggg gagaa 5575 // ID CR1-5_NVi repbase; DNA; INV; 5946 BP. XX AC . XX DT 20-APR-2009 (Rel. 14.04, Created) DT 20-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE CR1-type non-LTR retrotransposon: consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-5_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-5946 RA Bao W. and Jurka J.; RT "CR1 families from Nasonia vitripennis."; RL Repbase Reports 9(4), 752-752 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 277..1509 FT /product="CR1-5_NVi_1p" FT /translation="MLSDVAKKTIALIKMKTSEEIREQLLKELHNKTAEKH FT NTTIMDEENEMENLVAENTLLKQLVAELQDKNQLQKEIIEMQKQKSTESQT FT NIKSYAEVIRDIKPKPKRVPSIMIKANDPNQTNTMELLTKCLVTEKNVQTK FT YIRKKNESEIEVSCMNSKSLETLENVLTSKLKNCQVKVEQQGNPKIKIVGI FT DNATNMDELDIESDINTRNFKNFNSSCKVLHMYTNKRNKTSTVLMECSPDI FT YKSIRENNNKVFVGHQCCKAYDLINITPCYNCGRYGHSASKCRNDPACIKC FT SDKHNVSVCNNNKIECVNCIFNNNKFKTSLPTDRLKCSILKKKIKKYINSI FT DYPIAPVLPAWDETSAIKQIHTIQTNRLLYQRSVAPGKNVGKFQRATQLQG FT TSQQNYPANTAAEDIQ*" FT CDS join(1509..3161,3101..4384,4374..5300) FT /product="CR1-5_NVi_2p" FT /translation="MEAQINLIEEIQNSLNEQERIYRDVETFNKDRSSNND FT LILYLNIRSLNANFEKLQILLKSLKIKPYVIVCTEVWKLTHYQYYRIKDYK FT LYYNHGDINKNDGVVVYVNENVKHTNDTIVIGKLKILNTIISLNNKKQLEI FT SSLYRSHGLSCIEFNQNLKIYLDQKKNVKNHIIIGDFNINIQDYNHISVEF FT LNNLLENGFLPGFTNTTRPTNITTNEGTCIDNIFIKTDFLNTKSLTLMTPI FT TDHYPLFLELKNRTLKQDNKQLEGDFYRYNYKKLNNTAKQINWTKFKNTKD FT PNENINEIIKEIQYCMEKSKYKKPNLHNSKKTPRKDWITKAIVKSCITKET FT LYNKLKQHPDNEELKENYKQFVKTLDKVIKAAKIDHEKKTIEKNSDDPRKL FT WACINSKIGKNKNKTETTISQLKTEDNTIITNRIEIANKMNEYYCKLGETL FT SNKIITPSNKKLELPKSNPKTIYIKPTNESEIIKIIDNMKLKSGGIDKISA FT RILKTIAAIIADPLAHVINQCIEMAIWPDMLKAAEVIPIYKAGKKMKWVTI FT DRYHSNTNIQSWEKNEMGNYRPISLISNIAKIFEKVIHRRIIDFVSQSKIL FT SKNQYGFMKNIGTKDALNMISKTIYENLDQSTPIAITFLDLAKAFDTVNHK FT ILLDKLYAYGMRGKGHQLITSYLNNRKQKVKIGSHTSDFENVNTGVPQGTI FT LGPLLFILYVNDLLMNMPENSIISYADDTAVIANGKTWSEVEVKMNKHLED FT ISTWLRLNKLTLNIQKTVYITFGNYCDSVPKKIEFKINHEILKRVNETKYL FT GLIFDSNMRWDKHIQYLINKTRYLIFIFSKISKFMDTSTLRIIYYAFFHSL FT INYGIVAWGGAYQNNLQLLQTVQKRILKIINKNTFQRANPLNLQQLFAYES FT LRYHYTNFKDQFINSTSRTRNKLIILPRYNKTVSTKNSYIKAINIYNKLPN FT DLKIIDMRRKTNIRKIQDWLRINEELMSEKTNTTTIDCQQLKIWLSEIGNQ FT DQCERRCVISSKIQDLIQQRKEMNNNRQGTAENSVYLDIRTYICAKGKCNS FT YNIRKHRRISTYNYNKINNTYPNIIDSLDDITNDIINTWMLKTKNNKANIK FT FIKFIKGTCNDLENIKNPNIEKYKPIETVFSMIDGMAAENSNKARNREATA FT APGQKITESASHRMTYGDINYLGSNKAFFYFDRGIRGKQVFTNSVSSLKLQ FT SPYKDIVIIDEIIFTSDCKCDSSNQHKEMFPHYRVTCAVGLTCPARLNSTE FT KMSYIREKNATTDRFMTARQMLVRNW*" XX SQ Sequence 5946 BP; 2530 A; 957 C; 941 G; 1518 T; 0 other; aaatgaaatt taatcatgtt atttttgttt aaataacaac atataatact gtcaatatgg 60 cgccaattgt gaaaacacct ccggataagc cggatcgcag tttttacaag tgtcatccaa 120 gtaaaatagt gaaaacagtg atttgtataa tttgtgagaa ctgcatatca taaaagtgat 180 ttcgagaaga ctaaaaatcc cgtgtacgtt ggtgaacatt tgattatatg ccaagatcat 240 gtcctaggag acctaaccac taataccgaa caacatatgc taagtgatgt ggcaaagaaa 300 accattgcgc ttataaaaat gaaaacatca gaagaaatca gggaacaact actaaaagaa 360 ttacacaata aaacagctga aaaacataat acaacaatta tggacgagga aaatgaaatg 420 gagaatcttg tggcagaaaa caccctattg aaacaactag tagcagaact gcaagacaaa 480 aatcagcttc agaaagaaat aattgaaatg caaaaacaaa aaagcacaga gtcacagaca 540 aatataaaat catatgcaga ggttataaga gacataaagc cgaaaccaaa gcgagtccca 600 agtataatga ttaaggccaa tgaccccaat cagacaaaca caatggaact gctcacgaag 660 tgtctagtta cagagaagaa cgtgcaaact aagtatatac gtaaaaagaa tgaaagcgaa 720 atagaagtca gctgtatgaa ttcaaaaagt ctagaaacac tcgagaacgt gttaacaagt 780 aaacttaaaa attgtcaagt caaggtagaa caacaaggta atccaaaaat aaaaattgtt 840 ggtatagaca acgctacaaa catggatgaa ctggacattg aaagtgatat taatacacgc 900 aactttaaaa actttaacag ctcatgtaag gtactgcaca tgtacaccaa taaaaggaac 960 aaaaccagca cagtactaat ggaatgctca ccagatatat acaagtcaat acgtgaaaac 1020 aacaataagg tttttgtagg acatcagtgc tgtaaagcct atgatctgat aaacattacc 1080 ccttgctaca actgtggaag atacggacat agtgcatcaa agtgtaggaa tgatcctgca 1140 tgtataaagt gctctgataa acacaatgtg agtgtatgca ataacaataa aattgagtgt 1200 gtaaactgta tatttaacaa taataaattc aaaacaagtc taccaacaga cagactgaag 1260 tgttctatac ttaaaaagaa gataaaaaaa tatattaact ctattgatta cccaatcgca 1320 cctgtattac cggcttggga tgaaacaagt gcaattaaac aaatacacac aatacaaact 1380 aacaggctac tataccagag gtcagtagca ccaggaaaga acgtgggaaa attccaaaga 1440 gctacgcaac tccaaggcac cagtcagcag aattatccag caaacacagc agctgaagat 1500 attcaataat ggaagcacaa ataaatttga ttgaagaaat acaaaatagt ctaaatgaac 1560 aagagcggat ttacagggat gtcgagactt ttaataagga tagaagtagt aataatgatc 1620 ttatcctgta cttaaacatc agaagcctaa atgccaattt cgaaaagtta caaatccttc 1680 ttaaaagtct aaaaataaaa ccatatgtta ttgtatgcac agaggtatgg aaattaacac 1740 actatcagta ttatagaata aaagactaca aattatatta caatcacggc gacatcaata 1800 aaaatgatgg tgttgtggtt tatgtaaatg agaatgttaa acatactaat gatacgattg 1860 taataggtaa gcttaaaata ctaaacacaa ttatctccct taataataaa aaacaactag 1920 agatctcatc gttatacaga tcacacggac taagttgtat agaatttaat cagaatctca 1980 aaatctattt agatcagaaa aaaaatgtaa aaaaccatat tataattggt gattttaaca 2040 taaatataca ggattacaat catataagtg tagagttcct aaacaacctt ctagaaaatg 2100 gttttctacc aggatttaca aacaccacca gaccaacaaa cataactaca aatgaaggaa 2160 cgtgtattga taatatcttt attaaaactg acttcctgaa tacaaaatcg ctaacactta 2220 tgacgcctat tactgatcat tatccactat tcttggaatt aaaaaatcga acactaaaac 2280 aagataacaa acagttagaa ggagactttt atcgttacaa ttacaaaaaa cttaataaca 2340 ctgctaagca aataaattgg acaaaattta agaatacaaa agatccaaac gaaaatataa 2400 acgaaataat aaaagaaata caatattgca tggaaaaatc gaaatacaaa aaaccaaacc 2460 tgcataattc caaaaaaact cccaggaaag actggataac aaaagcgata gtaaaatcct 2520 gtattacgaa agaaacatta tataacaaac taaaacaaca tccggataat gaagaactaa 2580 aggaaaatta taaacaattc gttaaaactc tagataaagt tataaaggca gcgaaaattg 2640 atcatgaaaa gaaaacaatc gagaaaaata gtgatgatcc cagaaaactc tgggcctgca 2700 ttaatagcaa aataggcaaa aataaaaata aaactgagac tactattagt caactaaaaa 2760 ctgaagacaa tacaataatc acaaatagaa tagaaattgc aaataaaatg aacgaatatt 2820 attgcaaact tggcgaaaca ttgagtaata aaataattac accaagtaat aaaaaactag 2880 aattacctaa aagcaatcca aaaacaatct acattaaacc aaccaacgaa tcagaaatta 2940 taaaaattat agacaacatg aaactaaaaa gcggaggtat agataaaata agtgcaagga 3000 tactcaaaac tattgctgct attattgcag accccctggc ccacgttata aatcaatgta 3060 ttgaaatggc aatttggcca gatatgctga aggcggctga agtaatacca atatacaaag 3120 ctgggaaaaa aatgaaatgg gtaactatag accgatatca ctaatatcaa atattgctaa 3180 aatctttgaa aaagttatac acaggagaat tatagacttc gtaagtcaaa gtaagatatt 3240 aagtaaaaat caatatggat ttatgaaaaa cattggaacc aaagatgctt tgaacatgat 3300 ctctaaaaca atatacgaaa acctagatca gagtacaccg atagctatta cattcctaga 3360 cctggcaaag gcatttgaca ctgtaaatca taaaatacta ttagacaagc tttacgcata 3420 cggtatgaga ggtaagggtc atcaacttat tacgagttac ttaaacaata ggaaacaaaa 3480 agtaaaaata ggttcacata caagtgattt cgaaaacgta aatactggtg tcccacaagg 3540 gacgatactt ggcccattgc ttttcatact ttacgtaaat gacttactga tgaatatgcc 3600 agaaaactct ataatatcct atgcagatga cacggcagtg atagctaatg gaaaaacctg 3660 gtctgaagtt gaagtaaaaa tgaacaaaca ccttgaagat atctctacat ggctcaggct 3720 aaataaacta acattaaata ttcaaaaaac agtatatatc acttttggaa actattgcga 3780 tagcgtgcca aagaaaattg aatttaaaat aaatcacgaa atactaaaga gagtaaatga 3840 aacgaaatac ctaggcctaa tatttgattc taatatgaga tgggataaac atattcaata 3900 tctcataaat aagactagat acctcatttt tattttcagt aaaatttcca aattcatgga 3960 tacatctact ctaagaataa tttactacgc atttttccac agtctgatta actatggaat 4020 agttgcgtgg ggtggtgcgt accaaaataa cttgcaactg ttacaaacgg tccaaaaaag 4080 aatcctgaaa attataaata aaaatacctt ccaaagagct aatccactta acctacagca 4140 attattcgca tacgaaagtc tacgctatca ttatactaac tttaaagatc agtttattaa 4200 ttcaacaagc agaacaagaa ataaactaat aatcctacca aggtataaca aaactgtaag 4260 cacaaaaaat agttacatca aagctattaa catctataat aaactaccga atgatctcaa 4320 aatcatagac atgcggagaa aaacaaacat cagaaaaata caagactggt taagaattaa 4380 tgagtgaaaa aacaaatacc acgacaatag actgccagca actgaaaata tggctctctg 4440 agataggcaa ccaagatcaa tgtgagcgta gatgtgtaat atcctcaaaa atacaggatt 4500 taatacagca aagaaaagaa atgaataata atcgacaagg aactgcagaa aactctgtat 4560 atttagatat tagaacctac atttgcgcaa aaggcaaatg taattcatac aatatacgca 4620 aacatcgtcg tatctccacc tacaattaca acaaaattaa taatacatat ccaaatatta 4680 ttgatagttt agacgatata acaaacgata ttataaatac ctggatgctc aaaacaaaaa 4740 acaataaggc taatatcaaa ttcattaaat ttattaaagg aacatgtaat gacttggaaa 4800 acatcaaaaa tcctaatatt gaaaaatata aaccaattga aacggtcttc tccatgatag 4860 atggaatggc tgcagaaaat agcaacaaag caagaaatag agaagcgact gccgcacctg 4920 gacagaaaat cacggaatca gccagccata gaatgaccta tggcgacatt aactatctag 4980 gatcaaacaa agcctttttc tactttgaca gaggaataag agggaaacag gtttttacaa 5040 actcagtgtc ttctctcaaa ttacagtcac cgtataagga catagtaata atagatgaaa 5100 taatttttac atcagactgt aaatgtgact catctaatca gcacaaagaa atgtttccac 5160 actatagagt tacctgtgcg gttggactta cgtgcccggc taggctcaac tcaacagaga 5220 aaatgtcgta cataagagaa aaaaatgcaa caacggatag atttatgaca gcaagacaaa 5280 tgttagtgcg gaactggtga aatagtgcga aatatgaaaa gtgccattat acaaagttag 5340 aatataatat tttaaatgtt aagttttttc tttacaatac caaattacaa tacaaaaaca 5400 tagaatctaa aaaattgtgc aatataatct tatttaatga gatctttaat gtcgtcgtag 5460 cattactcgt ataaaagacc aaatcataga atcacttact aataattaag aaaatgttaa 5520 caaaattttt atatattaat attaaaagga ttatatgtca ctgttaattt tttttattag 5580 aaaataatat atattgtcat cagagttggg acgaagagag tctaaggatt cttgtaaccc 5640 gacctacgtt tgtatgaatg agaagggtgg tatgaactgt gtctggaagt ccgacatggt 5700 ttcatgttat ccttaagcat gaatggagat tgtatggtat gttgtatgac tggatgttat 5760 tgctgaagaa acagcaataa ctaaccagta ctcatgtatt tttagtattg ttaatccgga 5820 cctgcgaaca ggaaatattg tttttctctg caggattgcg tcgttatagt gtgttattgt 5880 gcaaataata tcaactctgt atcgaatgca attgttttgt aactctggag ttaaataaat 5940 aaataa 5946 // ID Gypsy-5_DGri-I repbase; DNA; INV; 7661 BP. XX AC scaffold_4666; XX DT 07-MAR-2011 (Rel. 16.03, Created) DT 07-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the Drosophila grimshawi genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-5_DGri_; KW Gypsy-5_DGri-LTR; Gypsy-5_DGri-I. XX OS Drosophila grimshawi OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Hawaiian Drosophila; OC grimshawi group; grimshawi subgroup. XX RN [1] RP 1-7661 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Drosophila grimshawi genome."; RL Direct Submission to RU (06-MAR-2011). XX DR Genome; scaffold_4666; Positions 787 8447. XX CC Positions [3363-3788] - Reverse transcriptase CC Positions [5038-5517] - Integrase core CC LTRs are 91% similar to each other. XX FH Key Location/Qualifiers FT CDS 1252..2664 FT /product="Gypsy-5_DGri-I_1p" FT /translation="MQSNELSESLANLKLGAASTSSQRNTQSGNTVYTSQS FT LKDTIVSLVKETLQGGDIIRQAINTEGNDCLNDIPFTSPRLGKEQEKVPEI FT TRLIRDFSGDRTTFGSWKKNVERIMTIYNGQEETYKYYLITLSIRNKIIGE FT ADAVLESYNTPMNWKAISKCLTEHYADRRDLKTLEYQMFYLTQGRLSVQEF FT YQMVYHQLSLILNQISVQEDAPEVIRALTNNYRGKALDTFVRGLNGDLPRL FT LAIKEPRDLSHALHLCNLVENQEHRNAFLPKPRQQIPPLPPRNHIPLHKLP FT QNNNSVQWRNFNPNLYHTPKPNMNHQMAQQHNQYPQPFRGQYTLPPQSFNP FT RQGYQAPPRPFFQKPEPKPVPMEIDNSIRTRNVNYMNKPRTNGFVGKHPSN FT APSGQPPKIQRQFHISTGQSEGGGEQPNPYGMMDEEQIYLEDAEPEGAHAL FT SDEERSGTYYGCQPTTIDASDIHFLD" FT CDS 2811..4187 FT /product="Gypsy-5_DGri-I_4p" FT /translation="MSVGGQTKISHHTFLTLFGQRNRPIQFFLLPTLKTFH FT GILGNDTLKQLNAVIDVRNETLAIDGNYRIKIRQLETQAVNNIKVRTEHMT FT NKQKHFIDHVTKTRSTLFAEPDEKLTYTTRVLGEIRTTSDSPVYSKHYQYP FT TSLRGEVEKQIDQLLRDGIIRPSRSPYNAPVWVVPKKSDASGEKKFRIVID FT YRKLNSVTISDRYPIPEIGEIIAQLGENKYFSVLDLKSGFHQIPLMARDIE FT KTAFAVNGGKYEFTRLPFGLKNSPSIFQRALDDILRDHVGKICYVYIDDII FT VFSKSEDEHINHLNTVITTLDDANMKIQIDKCEFFKTEVDFLGFTISADGI FT KTNCEKVRAIKNFPTPKSLKDLRSFLGLSSYYRRFIRDYAKLAKPLTTLLR FT GENGRLSKNLSHKAEVTLDNNAIKAFDKIKNTLSSNDVVLAYPQRIPLDYR FT CIKFRHRSSPRARR" FT CDS join(4258..5787,5791..7284) FT /product="Gypsy-5_DGri-I_2p" FT /translation="MLAIVWALQNLRMYLYGTSRVIIFTDHQPLTFALSSK FT NHNGKLKRWKSYIEEYNHELRYKPGTSNVVADALSRNPPTPEINNTTTSTV FT QSDDSSSHNLIPVMEVPINVFKNQLFLLTSETESYEFQLPFPTHHRHIVER FT PDYSEDDLIDILKKRLDPKIVNGLFTTENIMAKIQNLYPYHFASYKVRYTQ FT KQVQDVTCESEQLEIIAKEHTRAHRNAIENKIQILNNFFFPSLKKKIEKIT FT KACETCKESKYDRHPQKGELQPTPIPTYPGQIIHIDLYTTERTIVLTAVDK FT FSKYAQIKILNSKAAEDIKHPLREVMIAFGMPKLVVMDNERALNSASIKFL FT IENQLKSKIYTTPPYSSTSNGQVERFHSTLTEIMRCVKKDDGITNFEDLIF FT KSTQLYNQSIHSTVNAKPIDIFFGTRITNDPQELEKNRNKVTDKLKNKQAN FT DLEYHNNKKKAIKQYKKGDTVYCKIDKRLGSKLTPKYKKEIVEENRNTTIK FT TQGGRVIHKNNLKTYPYRFFLPTCFGLATILDYSHSQIIPIKTGVTALENG FT TYRIIHVFNIEEYETAFGNLSNELNEINKTHPLFPLLTFEFAKVESDLKTL FT KPTQRKARAVNALGTVWKWLAGTPDHYDYEILVSKTDEQVENNNKQLIINK FT ILSKRMNELTENTNKIVKDTIDNKNFQSEIATLLKYKIQIFKEELNNLIYA FT IHWAKTNTINSIILSKIETEKIANLLEEKGTFFINPEELLEFAKIKIATNG FT KKILYIINIPILSNEICTTLLLIPIKKYNKINKLKFNNIINCNQKIFGVKE FT NCENHNDKSICKKTKLVDLSNDTCVTRILKNKKADCKFINNQHVPDHEELF FT TGTILLNDFKGEIKIDEETTELRGTFLIQFENSTVRIGEDLYLSKVMTSVE FT ATPPLLQIIDGKSEVEEVLSLQMLKEININNTKELKTLRNSSTTTTIVNFG FT LSTTLFIILTIIVTCWIKQRSGTRKINRQEDNPPQQSQTIETRFYIASEDV FT RI" XX SQ Sequence 7661 BP; 3026 A; 1691 C; 1277 G; 1667 T; 0 other; tcataacaaa taactaaacc caacgcattc aaataaataa gccaacacta cgaatatcca 60 aagtcaatgc actcgaacga attccataaa acacgcgcgc cccgtcagca tatttcacag 120 ctacagcgca tggacaagcg ccccgtcagc agatcccacg ggaagaccat cgtgggaccc 180 acaacgggca actgacaccg tcggcgacaa aaccgcaacg atcagcagaa gcagcagtcc 240 gaggttaagt tatgataagt ttcagtttat aaatttagtc ttaataaaac attcaaatgg 300 aacaaattgg ttttttttat taaaaacctc gaggactcgt ttaattggcg cccgagcagg 360 gaccggtacg gaagtaaaag ttatagtgcg ctagtgaaag tgttttaact actgcgcaaa 420 tacgaacata actccgcgaa acaggacagt gtctaaaagt gcaagttcta tcagaagcgc 480 ctcaagtagc cgcagatacg aaccaacaaa gtgcaagttc tatcaaaagc gcctcaagta 540 gccgcagata cgaacaaaga ccagtgccag ttctatcaga agcgcctcaa gtagccgcag 600 atacgaataa gacaggttga tcaaaagcgc ctcaagcagc cgcagatcta aacccaaaaa 660 agtttcaaaa atacattcgt accggtgcca caataacacc gccgacaaac cctgtagtgt 720 ccgtccgcag ggtaccgtga gtgaagtatc cggaggaacc caagtcaccc agctcgactt 780 gaaatcgtcg tgggacaaaa aggagcaatt ctaaaggatc atcgcgggag attcgccgga 840 atgctgatca tattgtaagc atagtgatta agaaaaattt taacgattaa gaaaaaatag 900 aaaaactaaa ataaaaaaaa aaaaaataat aataataata ataataataa taataattat 960 aactatagaa aaaataatta cgaagaaaac caagaaagaa taacgactta caaattaaga 1020 tccaaaaaga aaaaaacata aattaagaaa gcaaaagaag tttttaaaca aagaatattg 1080 acaattttta cgaatcaaag caaccataga ttaagtaaca agttttcgaa aaatcattca 1140 caactaaata actttccaaa aatatcacca atataaaaat tcaaaacaat ttttaaaaaa 1200 aagtggtccg aatactaatt caccataatt cgttaaagac aacatcacaa gatgcagtca 1260 aacgaattgt ccgaatccct agccaatttg aagctgggag cagccagtac cagctcccag 1320 agaaataccc aatcaggcaa cacagtttat acaagccaaa gcctgaaaga tactatcgta 1380 tcactggtaa aagaaactct ccagggtggc gacataataa ggcaagcaat caataccgaa 1440 ggtaacgatt gtcttaacga tatcccattc accagtccga gactaggcaa agagcaggaa 1500 aaggtgcccg aaataacgcg gttgattaga gatttttccg gagaccgaac cacatttggg 1560 tcctggaaga aaaacgtaga aagaataatg accatttata atggtcaaga agaaacgtac 1620 aaatactact taataactct gtcaattcga aataaaataa taggagaagc cgacgctgtg 1680 ctcgaatcct ataacactcc tatgaactgg aaagcaattt ccaagtgcct tacagaacat 1740 tacgctgata gaagagactt aaaaactctc gaataccaaa tgttctactt aactcaggga 1800 cgtctgtctg ttcaagaatt ctaccagatg gtataccacc aattatctct tatccttaac 1860 caaataagcg tacaagaaga tgcaccagag gtaattagag ctctgacgaa taattatcga 1920 ggtaaagcct tagatacgtt cgtgagaggt ctaaatgggg atctacctag actcttagca 1980 atcaaagagc caagggacct gagccatgcc ttgcatttgt gtaacttggt ggagaaccaa 2040 gaacatagaa atgccttctt acccaaacca agacaacaaa taccacctct accgcccaga 2100 aatcatattc cgctacataa attgccccaa aacaataact ccgtacaatg gaggaatttc 2160 aacccaaacc tataccacac ccctaaaccg aatatgaacc accagatggc ccaacaacat 2220 aaccaatacc ctcaaccatt ccgtggtcaa tatacattgc cgccacaatc tttcaaccct 2280 agacagggat accaagcgcc cccgcgcccc ttcttccaaa aaccagagcc aaaacctgtg 2340 ccaatggaga tagataactc cattagaacc agaaacgtta attacatgaa caagccaagg 2400 accaatggtt tcgtaggcaa acaccccagc aacgcaccca gcggtcaacc tcccaaaatt 2460 caacgacagt tccatatctc aaccggacaa tcagaaggag gaggtgaaca accaaaccct 2520 tacggcatga tggacgagga acaaatttat ctcgaagatg ccgaaccgga aggagctcac 2580 gccctatcgg atgaagaacg aagtggaacg tactatggat gccaaccaac aacgattgat 2640 gcctccgaca ttcatttttt agattaaata gttcctcttt gccctatttc cagtgcataa 2700 cgaggaacgg caacacatta aattttttaa tagatacagg ctcaaaccaa aattacatac 2760 aaccaagtct agttccaagt ccgagaccga accaaaatgc atttaatgca atgtccgttg 2820 gtggacaaac aaaaatatcc caccacactt tccttacttt atttggtcaa agaaatagac 2880 caattcaatt cttcttatta ccgacattaa aaaccttcca cggtatccta ggtaacgata 2940 ccttgaaaca gctgaacgct gtcattgacg tcaggaacga aacattagct attgacggaa 3000 actataggat taaaataaga caactagaaa cacaggcagt gaataacata aaagtacgga 3060 cagagcacat gacgaacaaa caaaaacatt tcatagatca cgtcactaag acccgatcaa 3120 cactgtttgc agaaccagac gagaaactga cctacacaac cagagtgtta ggcgaaataa 3180 gaaccacatc agattctcca gtatatagca agcactacca ataccctact tcgttaagag 3240 gagaggtcga aaaacagata gaccagctgc tgcgagacgg cataataaga ccttcacggt 3300 caccgtacaa cgcccccgtg tgggtagttc caaaaaagag cgacgcctca ggcgaaaaaa 3360 agtttcgaat agtcatcgac tataggaaac ttaactctgt aacaatttcg gatcgatacc 3420 cgatccctga gattggagaa atcatcgcac aattgggcga gaataagtat ttttccgtac 3480 tagatttgaa gagtggcttt caccaaatcc ctctcatggc aagagacata gaaaaaactg 3540 cattcgcggt taatggcgga aaatatgaat tcaccagact tccgttcgga ctaaaaaatt 3600 cgccatcaat atttcaacga gcattagacg atatcttaag ggatcatgtc ggaaaaattt 3660 gctacgttta catcgacgat atcattgtct ttagcaagtc ggaagacgaa catatcaatc 3720 atctaaatac ggtcatcacg accttagatg atgcaaacat gaaaattcaa atcgacaaat 3780 gcgaattttt caagaccgaa gtcgatttcc taggtttcac tatttctgca gatggaatta 3840 aaaccaattg cgaaaaagta agagctatca aaaattttcc aactcccaaa agcctcaaag 3900 acttacgttc tttcttgggt ttatcaagct attacagaag attcattcgg gattacgcca 3960 agttggcaaa gcccctcaca accctcctaa gaggagaaaa cggtagactt tcaaagaatc 4020 tctcccacaa agcggaagta acacttgaca ataatgcaat taaagcattt gacaagatta 4080 aaaacacgct cagctcaaac gacgtggtac tagcgtaccc acaaagaatt ccacttgact 4140 accgatgcat caaatttcgc catcggagca gtcctagagc aagaaggtag acccattacc 4200 tacatatcgc gcacactctc gaaaacagaa gaaaactacg caactaacga gaaagaaatg 4260 ttagctattg tgtgggcact tcagaatctg cgcatgtacc tctacggtac atcaagggtg 4320 ataatcttta cagaccatca acctctaacc ttcgcactga gtagcaaaaa tcataacgga 4380 aaactaaaaa gatggaaaag ttacatagaa gagtacaacc atgaactaag atataagcca 4440 ggaacttcta acgttgttgc agacgctcta tccagaaatc ccccaacgcc agaaattaac 4500 aacacaacaa catcaactgt acaaagtgat gacagttcat cacacaacct tattccagta 4560 atggaagtcc caattaacgt ttttaaaaat caacttttct tactaacatc agaaacggaa 4620 tcatacgaat tccaactacc ttttcccaca caccacagac acatcgttga aagaccagac 4680 tactcagaag acgacttaat agacatcttg aaaaaacggc tcgacccaaa aattgtcaat 4740 ggattgttca cgacggaaaa tataatggca aaaatccaga atttataccc ctaccatttt 4800 gctagttaca aagtcaggta cacccagaaa caagttcagg atgtcacatg cgaaagcgaa 4860 caattagaaa tcatagcaaa agaacacact agagctcata gaaatgcaat agagaataaa 4920 attcaaatac ttaataattt cttcttccct agcctaaaga aaaaaattga aaagataaca 4980 aaggcatgcg aaacttgcaa agaaagcaag tacgacaggc atcctcaaaa aggtgaattg 5040 caaccaactc ctattccgac ttatcccggc caaattatcc acattgacct atatacgaca 5100 gagagaacta tcgttctcac cgccgtagac aaattctcaa agtatgcaca gattaaaatc 5160 ctaaattcaa aagcagccga agatataaaa cacccactac gtgaggttat gatagctttc 5220 ggtatgccca aactagttgt catggacaac gaaagagccc taaactccgc atctataaaa 5280 ttcctaatag aaaaccaatt aaaatcaaaa atatacacga caccaccata tagctcaaca 5340 agcaacggtc aagtcgaaag atttcattca acactcactg aaataatgag atgtgtgaaa 5400 aaggatgacg gcataacaaa tttcgaagat ctaatattta agtcgactca gttatacaac 5460 caatccatac actccaccgt gaatgcaaaa ccgatagaca tatttttcgg aactcgaatc 5520 accaacgacc cccaggagtt agagaaaaat agaaacaaag tcacagataa attaaaaaac 5580 aaacaagcta atgatctaga atatcataat aacaagaaaa aagcaataaa acaatacaaa 5640 aagggcgaca cagtttactg caaaatagac aaaagattag gttccaaact gaccccgaaa 5700 tacaaaaaag aaatagtcga agaaaatcga aataccacca taaaaacaca aggtggacgg 5760 gtaatccaca aaaataactt aaaaacataa tatccttaca gattcttcct accaacatgc 5820 ttcggcttag cgaccatatt ggattactcc cattcacaaa tcatacccat aaaaacagga 5880 gtcaccgcac tagaaaacgg cacctacaga attattcacg tattcaatat cgaagaatac 5940 gaaacggcgt tcggcaactt atcgaacgaa cttaacgaaa taaataaaac acacccacta 6000 tttccattac tcacatttga atttgcaaaa gtagaatcag atctgaaaac attaaaacca 6060 acacaaagaa aagccagagc agtcaatgca ttaggcacag tttggaaatg gctggccggc 6120 acgccggacc actacgatta tgaaatttta gtttcaaaaa cagatgagca agtagaaaat 6180 aacaataaac aattaataat aaataaaatt ctaagcaaaa gaatgaatga attaaccgaa 6240 aacacaaata aaattgtaaa agacaccata gacaataaga actttcaaag cgaaatagct 6300 acattactga aatacaaaat acaaattttt aaagaggaat taaataactt aatatacgcc 6360 attcattggg caaaaaccaa cacaataaac tcaataattc tgtcaaaaat tgaaaccgaa 6420 aaaattgcaa atttactaga agaaaaaggt acatttttta ttaatccaga agaattgtta 6480 gaatttgcta aaattaaaat tgccacaaat ggcaaaaaaa tattgtatat cattaatatc 6540 ccaattttat caaatgaaat ttgtacaacc ttattactaa taccaataaa gaaatacaac 6600 aaaatcaata agctaaaatt taataacata attaattgta atcaaaaaat atttggagtt 6660 aaggaaaatt gcgaaaacca taatgataag agcatttgta aaaaaaccaa gctagtagac 6720 ctaagtaacg acacttgtgt aacacgaatt ctaaaaaaca aaaaagccga ttgtaaattt 6780 ataaataacc aacatgtacc cgaccacgaa gaattgttca ctggaacaat actgctcaac 6840 gatttcaaag gagaaataaa aatcgacgag gaaacaacag aactacgggg aacattcctc 6900 atccaattcg aaaattccac ggtcaggatt ggagaagacc tctacttatc gaaggtaatg 6960 acgtccgtag aagcaacacc accactgctg caaataatag acgggaaatc agaagtcgag 7020 gaagtactct cattacaaat gttaaaggaa attaatatta acaacacaaa agaactaaaa 7080 acactccgaa acagtagcac aaccacaaca atagtcaatt ttggactctc aaccacgctg 7140 ttcatcatac taacaataat agtcacctgt tggataaagc aaagatccgg cacaagaaaa 7200 attaatagac aggaagacaa tcccccacag cagagtcaaa ccatcgaaac aagattctac 7260 atcgcgtccg aggacgtccg catttaaagg gggaggagtt aacatccatc ataacaaata 7320 actaaaccca acgcattcaa ataaataagc caacactacg aataaccaaa gtcaatgcac 7380 tcgaacgaat tccataaaac acgcgcgccc cgtcagcata tttcacagct acagcgcatg 7440 gacaagcgcc ccgtcagcag atcccacggg aagaccatcg tgggacccac aacgggcaac 7500 tgacaccgtc ggcgacaaaa ccgcaacgat cagcagaagc agcagtccga ggttaagtta 7560 tgataagttt cagtttataa attcagtctt aataaaacgt tcaaacggaa caaattggtt 7620 ttttttatta aaaacctcga ggactcgttt aatttatatt t 7661 // ID Gypsy-207_AA-I repbase; DNA; INV; 5821 BP. XX AC supercont1.1553; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-207_AA_; KW Gypsy-207_AA-LTR; Gypsy-207_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5821 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.1553; Positions 33694 27874. XX CC Positions [2817-3242] - Reverse transcriptase CC Positions [4305-4781] - Integrase core CC 'GGTA' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 778..2001 FT /product="Gypsy-207_AA-I_1p" FT /translation="MMPPWMFGNPQMQMWAAQQMWGAKSVEQFSKHKNTHN FT DQEHVDRSKSKSRQSSRRVRPKITSTSSESKDSSDSESGWPSDDPPKPVRQ FT KKRGFKSGKRPVSDWRLKYDGADNGQNLMKFLKEVEFYAKSEDMSPKDLFR FT SAIHLFNGAAKTWFMTGFENEDFMSWEELKEELKREFLSPDHDHTSEIRAI FT ARKQGPRETFQDYFIELQKIFNSLTKPMTERRKFEIVFRNMRADYKGHVVS FT SEIDNLADLKKFGRRLDATYWYKYHTSNNDSNTRGKPAQVNEINTGTKPKS FT KAKSEDKQKSRTFHNSALDRRGSDEEGESRPRKSKSPLRFNEQKEGLQILV FT EKYKPLKDGHCFNCRLQGHHARDCDRPKHKYCQKCGFMNVDTSSCPWCAKN FT ASKTVPEGRQFDRQ" FT CDS 1908..5162 FT /product="Gypsy-207_AA-I_2p" FT /translation="MRFHECGHLFVSVVRKKRVENCPRGQTVRSTVKSLNN FT IYDIKDALQTSGFDKVSNIEYVHSHNFQINEAIISLENDNRPFVKVSVFET FT PLVGLLDSGAHLSILGIGALKLIRKCSLKVFPSDTTLRTANGEALEVTGLV FT YLPLTFNGATKMIDTLVVPSLKRRILLGMNFWRAFEIRPTVGQTQVEEVVV FT DADDPDSHSTLTGSDTDLSEEQIARLDQIKVQFKIATEGVLDTTDWISHRI FT ELTEEAKKLAPARINPFPVSPKKQELINAELNKMLECKIIEKSYSDWALRL FT VPVDKSDGTVRLCLDARKLNERTVRDSYPLPHSDRILSRLGKANFITTLDL FT SKAFLQVPLHPRSRKYTAFSVLGRGLFQFTRMPFGLVNSPATLSRLMDRVL FT GSGELEPNVFVYLDDIIVVSETFEEHLTILQEVASRLKAANLSINLDKSHF FT CVQEVTYLGYILGRTGLRPNPDRVAAIVNYERPTSLRALRRFLGMCNYYRR FT FLANYSAYTQPLTDLLKNKPKSVSWNDKAEASFNKIKELLISAPILTNPDF FT TRPFTIHCDASDAAIAGVLTQTHDGIEKPIAYYSQKLSTTQQRYFATEKEG FT LAVLNSIEKFRCYIEGSKFTVYTDASALTYIMRSSWRTSSRLCRWSIELQR FT YDMTVLHRRGADNIVADALSRSVEELTVSKQDQGWFTDMVRKVQADPEKFK FT DFRYENGILKKLVSSQEDSLDYRFAWKTCVPENLRDKVLVEEHDDKMHLGA FT DKTLALVKKKYFWPKMLNDVKTYIRKCSLCRQNKPANQSQIPEPGRQRLTN FT KPFQIIALDYIQSLPRSKNGNAHLLVIMDLFSKWCLLTPVRKISASSVVKI FT LEESWFRRYSVPEFLITDNASTFLSSEFKTLLTKHKIQHWKNARHHSQANP FT TERLNRTINACIRSYVKENQKLWDTRVSEIEYCLNSTPHTATGFSPYKVLF FT GHEIVGSGEEHKIDRETDEVSDEERLERKRKIDERIYSIVERNLKKSHDKN FT IRTYNLRSKSSAPVYTIGQRVFRRNFRQSSAADSYNAKLDALYVPCTVLAR FT VGTSSYELADESGKPIGVFSACDLKPEA" XX SQ Sequence 5821 BP; 1792 A; 1307 C; 1316 G; 1406 T; 0 other; attggcgccc aacgaaaatt taaatttgat atccttctcg tcccccggaa ggggtgataa 60 aaacagctcg ctgttttaga gtatatcttc gtcctccaaa cgaaggaaaa ttgggtttga 120 gaactgtaaa tctaaattca cttccaagat ggatgttgaa ttcgaaacgt tgtatgcagg 180 tttggcgatt catcacctcg atcgcgagga gattgaatat gaattgaaaa ttcgcggact 240 accgtttacc gaaaccgaaa cgcgcgcggc tttgatgcga cggttgaaag accaactgaa 300 acaggataaa ggaagggtca atcaggacct ggacttcgac cggttggata ctaccgtaga 360 caatgaaatt aaaattatcg atgctaaagt gaatcaaatt aaagattttc ttacgaaccg 420 ttcaaaattc gagggaattc gcgaaagtct caaaacgagg cttacacatt atttcgcacg 480 ggcgaaacga ttaccggaga acaccgaaag agacgaagat ctcgcggaca tagatgcgtt 540 gattgcatgt attcgaggag catttaatac ccatttttca ttatttgcgg gacagcgaga 600 tgttatcgaa caactcaatc aatcgttttc acagttgtta gcgaacaagc ggctgaagaa 660 aagcaagatg aagaggtaga aacccttgat tcttctaatc gatcgaataa gagaaaccta 720 cagtcagtcg acaaccccgt cgcgaatccg aatttagcag caaatttcgt accgtttatg 780 atgccaccct ggatgtttgg gaacccacag atgcaaatgt gggctgctca gcaaatgtgg 840 ggagcaaaat ctgtagaaca attttcaaag cataagaata ctcataacga tcaagaacac 900 gtggataggt ctaaaagcaa gtcgcgtcag tcgtctcgtc gcgtgagacc gaaaattacg 960 agtacaagtt ctgaatccaa agattcgtct gattcagaat cgggttggcc atcggatgac 1020 cccccgaaac cggttcgtca aaagaagcgc ggttttaaat cgggtaagcg tccagtgtcg 1080 gactggcgac tgaagtacga cggtgcagat aacggtcaga acttgatgaa gtttctgaaa 1140 gaggtagagt tttacgcgaa atcagaggac atgtcgccga aagacttgtt cagatcggcc 1200 atccacttat tcaatggagc cgccaaaaca tggtttatga ccggttttga aaatgaagat 1260 tttatgtcat gggaggagct caaggaggag cttaaacgcg aattcttaag tcccgaccac 1320 gaccacacgt ccgaaatacg cgcgatcgcg aggaaacaag gtccacggga aacctttcaa 1380 gattatttca tcgagttgca aaagattttc aactcattaa ctaaaccgat gaccgagcga 1440 agaaaattcg aaatcgtgtt tcgtaacatg cgagcggatt acaaaggcca cgtggtgtcg 1500 tccgaaatcg acaatctggc cgatttaaag aaatttggca gacggttaga tgcgacgtac 1560 tggtacaagt accatacgtc caataatgat tcgaatacac gcggtaaacc ggcgcaagta 1620 aacgaaatta acacggggac taagccgaaa tcgaaggcca agtccgagga caagcagaaa 1680 tcgcgtacgt ttcataattc cgctctagat cgtcgcggtt ccgacgaaga aggtgaatcg 1740 cgtcctagaa agtcaaaaag tcctctccga ttcaatgagc aaaaagaagg tttgcaaatt 1800 cttgttgaaa agtataagcc acttaaagat ggtcattgct tcaattgtcg tcttcaaggt 1860 caccatgctc gcgactgcga tcgaccgaaa cacaaatatt gccagaaatg cggtttcatg 1920 aatgtggaca cctcttcgtg tccgtggtgc gcaaaaaacg cgtcgaaaac tgtccccgag 1980 ggcagacagt tcgatcgaca gtaaagtccc ttaacaatat ttacgacatt aaagatgctt 2040 tgcaaacctc tggatttgac aaagtgtcca acattgaata cgtccattct cataacttcc 2100 agataaatga ggcaataatc agtctagaaa atgacaatag accgtttgtc aaggtttccg 2160 tgtttgaaac gcccttggtt ggtctattgg atagcggagc tcacctgagt atcctcggaa 2220 taggagctct gaaattgatt cgaaaatgca gcctaaaagt ttttccctcc gatacaacac 2280 tgcggaccgc caacggcgaa gcgttggaag tcactggctt agtatactta cctctaacat 2340 ttaatggagc tactaaaatg atcgacactt tagtggtgcc ctcgctcaaa agaaggattc 2400 tgttgggaat gaacttctgg cgagctttcg aaatacgccc cactgtcggc cagactcagg 2460 tcgaagaagt ggtggtggat gctgatgatc cggatagcca ttccacctta acaggttcgg 2520 acacagattt gtccgaagag caaattgcta gactagatca aattaaagtg cagttcaaga 2580 tcgccacaga gggcgtgtta gacacgacag actggattag ccaccgtata gagcttacgg 2640 aggaagctaa gaagcttgcc ccagctcgaa taaatccatt tcccgtttca ccgaaaaaac 2700 aggagttaat taacgccgaa ctcaataaaa tgctcgagtg taaaatcatt gaaaaatcct 2760 acagcgactg ggctctgcgt ttagtccctg tggacaagtc tgatggaaca gtgcgccttt 2820 gcctggatgc acgcaagctg aacgaacgta cggtcaggga ttcctatccc ctcccacact 2880 ccgaccgaat tttaagccga ctaggaaaag ctaacttcat taccaccttg gatctttcca 2940 aggcctttct ccaagtaccc ttgcacccta gatcgcgcaa atacacggcc ttttcagtgt 3000 tgggaagggg attgttccag ttcacccgta tgcctttcgg tctggtgaat agcccagcaa 3060 cgctatctcg cttgatggac cgggtcttgg gcagtggtga actggaacca aacgttttcg 3120 tatatcttga tgatataata gtcgtcagcg aaacgtttga ggaacacctg accatacttc 3180 aggaggttgc atccagacta aaagctgcca atctatccat caatttggac aaatcacact 3240 tctgtgtgca agaagttacc tatcttggat acatcttagg ccgtactggt ctgagaccaa 3300 atccagatag agtagctgcc atagtcaact atgagcgacc aacttcctta agagcgttga 3360 gaaggttctt ggggatgtgt aactactata ggaggttttt ggccaactac agtgcttaca 3420 cacaaccctt aacggacctt ctcaaaaaca aaccaaaatc agtcagttgg aacgataaag 3480 ctgaagcttc cttcaataaa attaaggaac ttctaattag cgccccaata cttacaaacc 3540 ccgattttac tcgaccattc acaattcact gcgatgcgag cgacgccgca attgcgggcg 3600 tactcacgca aacgcacgat gggattgaaa agccaatcgc gtactattcc caaaaacttt 3660 ccactacgca gcaacgatat ttcgcaacag aaaaagaggg cctcgctgtc ctcaattcta 3720 ttgagaaatt tcgttgctac atagagggaa gcaagtttac ggtctacaca gacgcctctg 3780 cgttgaccta tattatgaga agtagctggc gtacttcatc tcgtctatgc cgttggagca 3840 tagagcttca aaggtacgac atgactgtct tgcaccgtcg aggcgcagac aacatcgtag 3900 cggacgctct atcccggtcg gttgaagaac tgacggtctc taaacaggac cagggttggt 3960 tcaccgacat ggtaagaaaa gtccaggctg accctgaaaa gttcaaagac ttccggtatg 4020 aaaacggaat tcttaaaaag ctggtctctt cgcaagagga ttccttagat taccgctttg 4080 cctggaagac atgtgtgcct gaaaacttgc gcgacaaagt tcttgttgaa gagcacgatg 4140 acaaaatgca ccttggtgcc gataagacgc tagcattagt gaagaaaaaa tatttctggc 4200 ctaaaatgct caatgacgtc aaaacttaca tcagaaaatg ttcactatgt cgacagaaca 4260 aaccggccaa ccaatctcag attccagagc cgggtcgaca aaggttaacg aataagccat 4320 ttcaaattat tgcgctagat tacatacagt ccctcccacg cagcaagaat gggaatgccc 4380 atttgctcgt gattatggat cttttttcga agtggtgtct actcacaccg gtgaggaaaa 4440 tttcagcatc atcagtcgtc aaaatcttag aagaatcctg gttccggaga tactcggtcc 4500 cagaattctt gataacggac aatgcatcca catttttgtc gtcggagttc aaaactcttt 4560 tgacgaaaca taaaatccaa cattggaaaa atgcacggca tcacagccaa gcaaatccaa 4620 cagaacgcct taatcgtacg ataaatgcat gcattagatc gtacgtgaaa gaaaatcaga 4680 aattgtggga cacccgtgtg tccgagatcg aatattgcct aaacagtacc ccccatacag 4740 ccacaggatt ctccccctac aaggttcttt tcggacatga aatagtggga tcgggagagg 4800 aacacaagat agacagggag acggacgaag tgtccgacga ggagaggttg gaaaggaagc 4860 ggaaaatcga cgaaagaatt tattccattg tggaaagaaa cttaaaaaaa tcacacgaca 4920 aaaacattcg cacctacaat ctacggtcga aatcttccgc tccagtctac acgattggac 4980 agcgagtctt ccgacggaac ttccgtcaat cttcagcagc ggactcgtac aacgccaagc 5040 tggatgcctt gtacgtgccc tgtactgtcc tcgcacgggt tggcaccagc tcgtatgagt 5100 tggctgatga atcgggaaaa ccaatcggtg tgttttccgc ctgcgatttg aagcccgaag 5160 cctgaaagaa acgattcaat tcgatcttca aaactttcgt tatatttgat tgataactca 5220 ccattgagta gaaatcgtcc ggacgtccat tcataaaatt gatcgcagca ggatcgtcga 5280 gtcctgaaaa taagatagag aagcaaagcc aaggtcagag tcgaccaatg taaacattag 5340 aaaagtaaac ttacctggca aaacataatt ccctcctcca aaagaatcat ctctccatca 5400 gctgatccca gaaacatcga gtggcatcac aaccggtata ggctcctctt ccatcgctga 5460 tattgcgcgc ccgagatttt tctttcgccg gacgaaattc tagtgaaaca gccggaaaaa 5520 ctcgtgaaaa cttcgtaata acccgcggaa taaatcgacg actcgcatgc accgtaggta 5580 aattcgcgtc gggatgtaaa caaacaggtt gacggtttga cagttctcaa cacccagtgg 5640 tgttggttga gttactttga gttccttttt ctttttcgtg agagcgaaaa aggagaaagc 5700 gtttagaaaa gggcctattt tggtcaattt gttttaatca aagaaaattt gattatgaat 5760 cagattagaa tcatacatcc gaatgaaaat tcttttcatt cgaagaagga tgatggggta 5820 a 5821 // ID Gypsy-25_AA-LTR repbase; DNA; INV; 197 BP. XX AC supercont1.308; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-25_AA_; KW Gypsy-25_AA-I; Gypsy-25_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-197 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.308; Positions 25477 25673. XX SQ Sequence 197 BP; 56 A; 36 C; 36 G; 69 T; 0 other; tgttgtatta ccacttttgt ttaaattcat ccattgatat gcccctccca gtatgtaaca 60 gatttgataa agattttcag tgtaggttca gttgatcagc gacgcgcatc ttgagcggac 120 ataataaatg tactcggtta tacattatgt tttatttgat gctacgaaag ttcccatgct 180 tgtaacggaa tacaaca 197 // ID Chapaev-8_HM repbase; DNA; INV; 9037 BP. XX AC . XX DT 27-FEB-2008 (Rel. 13.02, Created) DT 03-MAR-2008 (Rel. 13.02, Last updated, Version 1) XX DE Autonomous Chapaev DNA transposon -a consensus sequence. XX KW Chapaev; DNA transposon; Transposable Element; Chapaev-8_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-9037 RA Kapitonov V.V. and Jurka J.; RT "Autonomous Chapaev transposons from the hydra genome."; RL Repbase Reports 8(2), 34-34 (2008). XX DR [1] (Consensus) XX CC Chapaev-8_HM is a young family of autonomous Chapaev DNA CC transposons that were active in the hydra genome less than a few CC million years ago (copies are ~2% divergent from their consensus CC sequence). The consensus sequence was obtained based on a CC multiple alignment of 15 incomplete copies; it codes for a 841-aa CC Chapaev transposase (ten exons). The consensus sequence is CC incomplete (its 3'-terminal portion in unknown). The Chapaev-8_HM CC TPase forms a distinctive group of Chapaev TPases (including CC Chapaev-3_HM, Chapaev-4_HM, Chapaev-6_HM, Chapaev-7_HM), whose CC 240-300-aa N-terminal portion composed of the Chapa zinc and RING CC fingers is similar to the N-terminal portion of RAG1 (pos. CC ~100-380). For instance, the N-terminal portion of the CC Chapaev-8_HM transposase (pos. 4-245) is 27% identical to the CC RAG1 N-terminal portion (E value <1e-14). CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(675..1694,1778..2070,2150..2359,2528..2770, FT 2951..3027,3470..3672,3760..3900,4500..4568, FT 5096..5270,5356..5660) FT /product="Chapaev-8_HMp" FT /note="Transposase." FT /translation="MEIHNKNLTLLCRVCGFLVGKKSYPIGTNQKNKIEKV FT FHVMLSDEQENVHPNKICHKCYNTINNVIKRMTSTTLHLSMNWKPHCDECF FT CCQQVEKLCKGLNFAKLLKQNKKLIGRPGIGEKVWSFSMIANIKKSIVTTH FT DSEIDIEELKNEFNPHLQLCQCNLCGKIPKQPVTLKKCEHLFCFFCIVENI FT KVKKLNETSCPKCKELILPEDLVTSVKTNSLLNMLTVECICKKKYNVMKEY FT DLYTNHKSVCIDKSIVQAAPLLSPSISSFASNLASSSSFTVSTSSSTSYLF FT NNNISEIFNLTVDNDIPRIVEDAALHVLKQKMAKDGGQVVEFKSGGSRPVL FT FSLTPKAYVSSNQASNTTIRVRNASIKRHMKVISGTSNDAVCCQTSKLINS FT FQAETKGLILDNLNTERVVISATNMVAMKADLCIPWEKLKTISKWLKSFNI FT NTASHSSQRIVAEKLSGDDLVVENAPFTFEKEEKGTFEIKYVSWGYIENLP FT MHILRHLDQLESCKRLHHHEFIPEKEIQIKIGGDYGGGSFKMTYQVANTLN FT PNSKDNTIVFSIFEAKDYRVNVKVAMSRFEKQIEDLQKMKYKDNNIRVFVF FT GDYQFLCALYGISGASGRHCCLFCYATASDMKFGEHKSSEIKDRTLEDLFL FT DHERFIENGGLKKNAKNFNNVITEPILKIPLDQVSLPSLHMALGIYLNFFN FT LFEEEVHQLDILIAAEAVKSNINFSEAYTTNIYNTLFGKHSFEFGQRPCTK FT MLCNCIPKLVHEEGYSGTSVHQFAVEISNKYKQLFDKFAQCYKIFSSKNTI FT TQDDLILLKKNINNLMQFYRLNWPEASVTPKLHMLEHHAIPFMEKWGAGFG FT FYGEQGGESIHMEFNKLKTIYQSIPCPTLRLKSILKSHYQKTNPENMRLKP FT CLKKKKRS" XX SQ Sequence 9037 BP; 3334 A; 1159 C; 1221 G; 3323 T; 0 other; cacggtcgtt caaactcact aaataaacga caagttattg tacgcaagcg cagttagata 60 ataaaaattt taagactttc aagatggctt cgggccttag ctacaaagtt aatttttagt 120 tgacgaccaa tttttttatt acttactgcg ttttttatca ttttatacga aaaaaccagt 180 cgtttggcta tcagtagtgt tttggtaatt tagtttttac tttctagtaa gaaaaataaa 240 aggagaacat tattttatga atgatgcagt tttacttaga aaattttttt tcttcgattt 300 gtttataaat ttataagtgt tttttttttt tttttttgag cggtgaaact taaaactttt 360 tataattttg tttatatgaa tcttatatct atttattatt aataaataat cttagcactc 420 aatttttttt cttgttcaaa atttacatca aagatttgtg ataaattgac accattttag 480 ttaatgaaaa ttgtccttac gtaaaaaagt taacaaagtt tttatatttt aattttaaac 540 taagtctttt attttgtatg cagtttatta tttttttgtt aatttaattt agtttttgaa 600 tatttccact atttaataga taaagttaaa gccgttattc agtacacttg tccaaaaact 660 tgtctttttc ttagatggag atccataaca agaatttgac tcttctttgt agggtctgtg 720 gatttttggt tggtaaaaaa agttacccta ttggaacaaa ccaaaaaaat aaaattgaaa 780 aggttttcca tgttatgcta tcagatgagc aagaaaatgt tcatccaaat aaaatttgcc 840 ataaatgtta taataccatt aacaatgtta tcaaaagaat gacatcaaca acacttcatt 900 taagtatgaa ctggaaacct cattgtgatg aatgcttttg ctgtcaacag gtagaaaaac 960 tctgtaaagg tctaaatttt gcaaaattac ttaaacaaaa taaaaagctt attggtcgcc 1020 ctggtattgg tgaaaaagtc tggtcatttt cgatgattgc taacattaaa aaaagcattg 1080 tgacaacaca tgactcagaa atagatattg aagaattaaa aaatgaattt aatcctcatt 1140 tacaactttg tcagtgtaat ttatgtggta aaataccaaa acaaccagta actctaaaaa 1200 aatgcgaaca cttattttgt tttttctgca tagtggaaaa tataaaagta aaaaaactca 1260 atgaaacatc atgtccaaaa tgtaaagagc tcattttacc tgaagattta gtgacaagtg 1320 ttaaaacaaa ttctctttta aatatgttaa ctgttgagtg tatatgtaaa aaaaaataca 1380 atgtaatgaa agaatatgat ttatatacta atcataagag tgtctgcatt gataaatcaa 1440 tagtacaagc agccccattg ttatcaccat caatatcttc atttgcttct aacttggctt 1500 catcatcatc ttttacagta tcaacatcat catcaacatc ttatttattt aacaacaaca 1560 tttctgagat tttcaatctc acagtggata atgatattcc aagaatagtt gaagatgctg 1620 cactccatgt tcttaagcaa aagatggcta aagatggtgg acaagtagtt gagtttaaaa 1680 gtggaggatc tagagtaatg atatttttta attttagctt aatttgtaat aactataaat 1740 taaataggtc ttacttttca aagatctttc tttttagcca gttttgtttt cgcttactcc 1800 aaaagcatat gttagcagta accaagcatc aaatacaact ataagagtca gaaatgcttc 1860 tatcaaaaga cacatgaaag ttatttctgg tacatccaat gatgcagttt gttgccaaac 1920 atcgaagctg ataaattcat ttcaagcaga gacaaagggg ttaatattag ataacctaaa 1980 tactgaaaga gtagtaatta gtgcaaccaa catggtagca atgaaagcag atctgtgcat 2040 accttgggaa aagcttaaaa caatatccaa gtttttattt ttttatatga attttattat 2100 atactagttt tttgttgcat gaaatataac tttttatgtt tgtttaaagg tggctgaaga 2160 gctttaatat aaatacagca tcacattctt cacaaagaat agttgctgaa aagttatctg 2220 gtgatgacct agttgtagaa aatgcaccat ttacctttga aaaggaagaa aaaggaacat 2280 tcgaaataaa atatgtatct tggggctata ttgaaaattt accaatgcat attttaagac 2340 atcttgatca attagaaagg tagttttaaa ccatcaatta acaattcttt taaaagcata 2400 atttataaag caaattgaaa tcttttgtgc ttatttgtta ttatttaaaa tgaggccatt 2460 gtctataaaa tcagtttatt attttataat atattcaatt acttaataat gtattgtttt 2520 tatgtagttg taaaagactt catcaccatg aatttattcc tgagaaagag atacaaatta 2580 aaataggagg tgattatggt ggagggagtt ttaaaatgac ataccaggtt gcaaatacct 2640 taaacccaaa tagcaaagac aacaccatag tttttagcat tttcgaggca aaagattaca 2700 gagtcaatgt caaagtagcc atgtcaaggt ttgaaaagca aatagaagat cttcaaaaaa 2760 tgaagtataa gtaagttaat aaaactgttt ttctattaac tgtttttttt tttatttata 2820 gctgaatttt taaatatttt aattagtttt gataatttat tttggtaaac tatactttaa 2880 tataaatata atacattaca aatgttatct taattaaatt tatgttaaac attttaacac 2940 acctctacag ggataacaat attagagtat ttgtttttgg cgattatcag tttctatgtg 3000 ccctctatgg aatatcagga gcatcaggtg attttctatt tttattttta tctctataaa 3060 tgacaaatta tcagcagttt tatagttttt actacttaaa cagaaatata ttaatttcaa 3120 aaaaaaaaaa aaaatggttg gcactttttt gctgaaagaa aaaccacaat attgtatata 3180 atatttacat aaagtattac aaattatata tacatatata tatatatata tatatatata 3240 tatatatata tatatatata tatatatata tatatatata tatatatata tatatatata 3300 tatatatata tataattaaa aagtttacaa tactaataat aaccaaaaaa ttaaataata 3360 attattatta tttataagaa aattaaattt atgaacatat ttaatttttt tatgataaat 3420 aattattctt ctttaatttg taattatttt ttattgattt taattttagg gaggcattgt 3480 tgtcttttct gttatgcgac agcaagtgac atgaaatttg gtgagcataa aagttctgaa 3540 ataaaagatc gtaccttaga agatctgttt ttggaccatg aaagatttat agaaaacgga 3600 ggtttaaaga aaaatgcaaa aaatttcaat aatgtcatca cagagccaat tttgaaaata 3660 ccactagatc aagtaatgtt gtttttattt gtcgtattaa ttttgcagta cggctaaata 3720 aaagtagatt cattatgcac tttcaatata tatttttagg tatctttacc tagtctccat 3780 atggctcttg gtatttattt gaattttttt aatttgtttg aagaagaagt ccatcaattg 3840 gacattttga ttgctgcaga agcagtcaaa agcaacatca acttttcgga agcatataca 3900 gttttcgtat caaaacaaaa acaactgtta aatcttcaaa ctaaaattct taaccttgac 3960 aatcaaattc aattaatcaa tgattttatt ttgttagctg ctgttaacaa ttcatataat 4020 ttagaaaata tttaagctct atactttaca gaaattgagt tacttgatgg tgaaaaagca 4080 aaaaaggtta aacaatttat tagtttaatt aatttgatat tatgttttat tgatttcata 4140 atttttgttt catattattt ttatattatt atgtgtagtg aaatttattt aaaatagagt 4200 ttgtgtttgt aattgatttg gtatgcatta aacagttttg caagcatttg gtatggtgca 4260 atttaattgt agaaaacagc ttaatataag cattattaat aatagcaatt ttgtaaactg 4320 tttatatgtg tgcacatata aacagtttgc aaaccagggt ttaaaaaaac cagtgtttta 4380 aatgtcatcc ctgtcacaca caattttgcc tttgttgtta tggaatgaat aatattataa 4440 tttttgctgc tatgcaagta ttgttggttt atgttattct atttgaagtt tttttttaga 4500 caaatatata taatactctg tttgggaaac attcctttga attcggtcaa agaccatgca 4560 caaagatgat cgagacggta ctgcaaaagc ttaaagtaca acgtcaagct taccacggaa 4620 aaagttttat aggaaaccat gtgcataaaa tgctaaaagt aatttaatta aagattagta 4680 taaataaata aaaaattact agtaagtaaa taacaaatta ggtttttact ggataaataa 4740 taatttatta tctatctttc atgagttatt ttctttactt ttaatgcaga tgttgttata 4800 aaaacaaatg aatttaatgt aattacaatg aataagacaa taaataaata agtcatttaa 4860 ccttatataa atcttttaaa agcaacaaaa tcaaaaattt ttattaaatt aataattaat 4920 tataacaggt taatccattt tatttttaat tattaagtta attaaacata aaaaacgggc 4980 acaatagcaa aatagttcat atcgcatttt gattataaac ttataatata tctaaactgt 5040 ctcttaacaa gactagtgaa gtatatttgt tatttagaaa tcttcaatac ttcagctatg 5100 caactgcatc ccgaagcttg ttcacgagga aggatattca ggaacaagcg tgcatcagtt 5160 tgcagttgaa attagcaata aatataaaca actatttgat aaatttgccc agtgctataa 5220 aatattttct tcaaaaaata caattacaca agacgattta attcttttaa gtaagtcgca 5280 ttattttata aaatgtttta taaggatatc tactgttttt tttttttttt ttaattgtac 5340 ttttaatttt tttagaaaaa aatataaaca acttgatgca attttatcgt ttgaattggc 5400 ccgaggcttc agtgacacct aagctgcata tgttagaaca tcatgctatt ccgtttatgg 5460 aaaaatgggg agcgggattt gggttttatg gagagcaagg cggagaatct atccacatgg 5520 agttcaacaa actcaaaact atttatcaat cgattccttg cccaactttg cgattaaaaa 5580 gtattctcaa atcccattac caaaagacaa atccagagaa catgcgtctt aagccatgtt 5640 taaaaaaaaa gaaaagaagt taataagaaa gttattttaa tatctctgat tccgctgaga 5700 aagaagtatc aatttgtcaa taaaaatatt tctaatgacc tggttttacc aaagtcataa 5760 aattcatgaa ccaaaagaaa gaaagaaaaa aaaacggcta ttttcatagc cacaagcata 5820 ataaaggcat tatttggtat aaaataattt ttgttatagg taatacatat atatcacccc 5880 tgctatctta attttttttt ctcacacaga tcagatatat aacatattca aaatatggtg 5940 tgtatctatc tagatgtatg tatattcaaa taatttaaaa gaaaccaatt taaaaactaa 6000 agcgacaaag cttaagtaaa aatatctcac gatttaaatg ttagcaaacc caaaaacaaa 6060 ttacgttgac gtttgtgcct agtattcttt tttttaattc taggtgcccc aagcagtccc 6120 tacggtctta tcacagagca ccgcggaaga gcatttgaag aaaagttcac gccttctttc 6180 ttaccgatgt tacaaaactt gctcagagat ggtgttgaac cgtgtttaaa catctctagc 6240 ttctgaggca agtgcgctta ccactgcgct acggctgctc accactgcgt tatgtcaatt 6300 tatttaatac agcctgacat tattatttac aaattcaaaa ttttattggc aaataagacc 6360 cgtgttttat tttaaatata tgctaggaat atactcttgg gtatctaata gttagatgcg 6420 tgtatttgga aaaatatata ggtaaataaa ctactaatcg aggggtgatc atatatataa 6480 tatataaaga ctcagacgcc ccgcaaatac tacggctctc tatatatatg tatatatata 6540 tatatatata tatatatata tatatatata tatatatata tatatatata tatatatata 6600 tatacatata tatatatata tatacgcaca cattatataa gccccacccc tcgattagta 6660 gtttatatta taaatattta tatatcatat atatgtccca accctcgatt agcagttttt 6720 ttacctatat atttttccaa atacacgtac ccaacattta aaaacccaag agtatattcg 6780 tagcatatat ttaaaaaaaa aacacgggtc ttatttgaat ataaaatttc gaatttgtaa 6840 aaaattatgt caaaattatg taaaagtaat tcaatagata ttattgaaat gcggtttttg 6900 aaattaaaaa ataatcgtta aagcatatta tttggagcat ttcatcattt tatttaacca 6960 aggagtcttc catacgtatg cagatatgac aagggggagg ggggggatta gttgttttac 7020 tgcgtatgta ctttatagat gccccccaat gcttaactag aaatcaagat tgctgtggta 7080 gtggttaaac taagaggtaa ataaggcctc tttaagacat cggcatgttt aaaggaagaa 7140 gtgggtaatg attttaacat ttttatttag atttatatgg aaaaacaata tatatatata 7200 tatatatata tatatatata tatatatata tatatatata tatatatata tatatatata 7260 tatatatata ttatttttaa aatcaaatac gtaaaaaatt aaagtgtttt tttttttcag 7320 tttgatagac aattaagtgc attagtttag ttcttcaatt tctttttttc aattttaaat 7380 taggtctctt ctcattcctt acacttcggg gtttgcttta tattactccc ctctttttac 7440 tttgtattca taactggcct atttttttgc aacgaaaaac atgtatgctt tttagaataa 7500 atatgtgtga tatattgatg caatgctttg atctgtttgg caattgatct acttaaatag 7560 aaatgttctt tttagcttaa tattttcttg agttgctact ttgatcaaat ttgtttaaaa 7620 atgtgctatt aacgttttct gcgaaaacac tccaacattt tatctctgaa ttgtggggtt 7680 atcctttgaa tatgcaaaca tttatataca tatgtacaaa cataaatgtg tatatatgta 7740 tgtgtgtata taatatatat atatatatat atatatatat atatatatat atatatatat 7800 atatatatat atatatatat atatatatct tcaaagatat cccccttacc ccaaaaatta 7860 tgtcatagtg ctgacctaac tatactcctg tgtataaata aatccccaaa cttgtcagtc 7920 aatgcatgta caattttctg caataaacgc atgcacaatt ttctgcaata aaatatataa 7980 gtcagttgat gtatgtacag agatatttgc ctttaaaata agtaatgcca gtatgcagaa 8040 tgaattattc atatatttaa atcacagttc gactgcaaaa gaattatata acttttcata 8100 tactgaatga taagtgtcaa gttgctagag tttaattgtt aaggttttcc actatcaaat 8160 ttgccaaaag caaaaatcta gagtgaaatc aaattactct taccaactta aaatgtctct 8220 taacaactga acaaaatcat cattttgtta actgatattt tttcaatata aaaaagttta 8280 aatcatttag aattcgatag gttgtaagct agtaaaagaa caacatcttt atataagtaa 8340 gaattgtgga ttttttatta atagccaaaa tggatacaag ttatgcttaa gaattttaaa 8400 gtaattttga tattgtaaca aagatcaaga ttagcattgc ttcttttgaa tgtttattga 8460 tcgaaaattt taatgagacc taagactgtt aaataaagta aagctgtaaa aatttaagta 8520 caattctttg aaattacttt gaaaaaaggt tgtaattatg agactatttg ttacacatta 8580 agatacgtct aaaaagtatt ttttagagtt gtgcatataa aaaaaacatg catatctctc 8640 tgatatttta tgcttatatt tcaactgatg atgactatat gtaatgaaaa catgacttat 8700 attatgtttt aaatcaaatt taaaaactta tcaaatattt aaaagtactt acacattttg 8760 tgctataaaa ttttaggaaa caggtaagtg tacacatcca cacacacata tatatctatt 8820 ttttcatctg ctgtcagttt cccttttcta gcatcagtgg gaagctttcc tgaatattat 8880 aggctcatgt caaagattat tacaaaaaaa agcaaaccac acattttatt agacccttca 8940 aactcgaatc tttatttctt agaagttaac aactcaatta agagcatatt ttataacaaa 9000 tttgactaaa gttgcaattg aaaaaactgt gtatatg 9037 // ID BEL-6_CQ-I repbase; DNA; INV; 5650 BP. XX AC AAWU01028809; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-6_CQ_; KW BEL-6_CQ-LTR; BEL-6_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-5650 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 165-165 (2011). XX DR Genome; AAWU01028809; Positions 43132 37483. XX CC Positions [4665-5249] - Integrase core CC 'TCAAT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 240..5648 FT /product="BEL-6_CQ-I_2p" FT /translation="MSEFYTPLKNPQNPDDSAFLGFAESTNGNGGNSARST FT RTATKIVELASTVDDPELAKQIKSLARQRDQAMNKLLRIYSSVNVPTPIDP FT AQVKGHATKLQAAYNEYSRFHSEIVALIPEEEMDTQERVYVRFEKYYDFTY FT SVIEKRIMRQPPAPAQVIIHQQPLKAPIPTFDGEYTKWPKFKAMFLELMAQ FT SRDSETIKLYHLNKALVGAAEGALDAKTINEGNYERAWEILNEQFGNQRVI FT VETHIRGLLSLKRMLSESHSELRTLLNECTNHVESLDYLKQPLTGVSELMT FT VHLLTAALDMTTRKLWEQTLKPGELPNYAATIAFLKSQCQVLERCESACPP FT ATTKLTTKQLVPASKVGQRSHAVTASSEEFTEKCDFCGYPHRNYQCNKLNG FT LTKLEKDEKVRALGLCFNCLQRGHRLSSCPSTKTCQKCHKRHHTQLHDDGT FT HPEQETNIAVTEPPADDPGHNFLPTMAIPLAPQSPGTNVTTTCSLSASAPK FT TVLLLTAVVLVTDRDNQAYQCRALLDSGSQANFITEAMANTLGLERKRANV FT PISGINNVKSLARDKVEVQFKSRCSEFCATLECLVTPRVTGTIPTTDIDIS FT GWVLPDGIHLADPQFFKTNKVDMLIDAELFFALMKPGHITLDDDLPELRNS FT HLGWLVTGAYRPARHNEAVQYSHVASLESVEEAIRKFWELEEVPQAATMSS FT EEQRCENHFTSTYSRDRTGRFVVRLPLRENADDVDSCRNLALRRFFMLENR FT LQRNPSLKVQYVDFVREYLQLGHCREVNLADDNPNLKPYYLPHHAVLRPGS FT SSTKCRVVFDASAKSAPTNLSLNDTLLIGPVVQSDIFSIMLRFRQHRYVFT FT ADIQKMYRQIRIHPDDTHKQRIYWREQPNEPLKILELLTVTYGTASAPFQA FT TRSLVQLANEEAENFPVAAKITRSDCYVDDVLSGAPTLNDAIGALNQLKEM FT LARGGFPIHKWCSNSPEVLAHIPEEEREPLKPLADRHVNEVIKVLGLLWDP FT SADQLLIADCSKAATERDPPATKRIIYSEVAKHFDPLGLFSPSILLGKLLV FT QGLWQCKLDWNTPISDNTQARWNELKCELPKLLQIKIPRRVTHDGAVYYEL FT HGFGDASNVAYGANAYIQSVMKDGTVRSRLLATKTKVAPLHSVTIPRKELC FT AALLLVRLICKILDALTVPLRKVVLYSDSEIVLAWLKKHPYQLETFVGNRV FT AQILALSRNFEWRYVRTHHNPADISSRGMLPGELMVCEPFWTGGEYINTAG FT VAQDLPDEILDEDLPELKANVVVMAALIQEPLEIFSKSSSYRRLQQTVAWI FT MRFFDNARKPKAERNYKRHLTVRELRRSTNVVVRVIQHVELGDEIQRVRTN FT TPCKRIGELNPVLNDDGLLRVGGRLKHAKLSDEAKHQLILPNSHPVTHNLI FT RDMHDELLHVGPAGLLSAIRQRFWLIRARSTIRQVTGSCVKCFRTNPSGIS FT QLMGDLPKQRVTPSPVFSITGVDYAGPILVKQGTYRPKVVKAYIAVFVCMA FT TKAIHLELVSDLTTDAFLAALQRFVSRRGLVSEMHSDNATNFHGADNELHR FT LYEMFCNQAEIDRIVRFCQVKEIQWHFIPPDSPEFGGLWEAAVKCTKTHLK FT RVIGNNTLNFEEMTTILCEIEAILNSRPLFAISGDPADPEVITPAHFLIGR FT PMIAVPEPSYQDLKIGRLSRWQHLQLLREQFWRAWSRDYLSSLQPRKKNRI FT TTTNVRPGMIVLLQDKNRPPLHWKLGRITAVHPGSDNLVRVVEVFSEGSTY FT TRSITKLSILPIEENQGQPDQRGKDDLNCPGE" XX SQ Sequence 5650 BP; 1499 A; 1562 C; 1440 G; 1149 T; 0 other; aatggtccgt tacgatccgg attgcggctt ctgagtgcgt gttgtccgag gacaattggg 60 ttaaagtgcg aaaaaataaa ctcaaaaagt gaaatttccg tgcgaaaact aaacgaattc 120 gaacaatcga acgtgtaaaa aaaaaaatat ccctgccaaa aggtagaaca gtgaaaataa 180 gaatcctgcg caaagcagaa cagtggaaaa aaaccctgca agaaaaaaaa aactgcagaa 240 tgtcggagtt ttacacgccg ttgaagaacc cgcaaaatcc ggatgattct gcgttcctgg 300 gatttgcaga atcaacaaac ggaaacggtg gcaactctgc gcgctcgaca agaacggcca 360 caaaaattgt ggaactggca tcgactgtcg atgaccctga actggcaaag caaatcaagt 420 cacttgcacg ccagcgggac caagcgatga acaagttgct tcgtatttat tcaagcgtaa 480 atgtgccaac cccaattgat ccggcgcagg tgaagggaca cgctacaaag ctgcaggccg 540 cctacaacga gtactcacga ttccacagcg agattgtggc gctgattcct gaggaggaga 600 tggacaccca ggagcgagtg tacgtccggt tcgagaagta ctacgacttc acatactctg 660 tgattgaaaa acgcatcatg cggcaacccc cagctccagc acaagtgatc atccatcaac 720 agccactcaa agccccgatc ccgacttttg acggcgagta caccaagtgg cctaagttca 780 aggccatgtt ccttgagttg atggcgcaat ctcgcgactc ggaaacgata aagctttacc 840 acctgaacaa ggcgttggtt ggagcagccg aaggagcgtt ggacgctaaa acgataaatg 900 aaggcaacta cgagcgggct tgggaaattc ttaacgagca gtttggaaat caacgggtga 960 tcgtagaaac tcacatccgt ggcttgttgt cccttaagcg aatgctctcc gagtcgcaca 1020 gcgaactacg cacccttctc aacgagtgca ccaaccatgt tgagagtcta gactacctga 1080 aacagccact gactggagtg tccgagttga tgaccgtcca cttgctaacc gcagctttgg 1140 acatgactac tcgcaagctg tgggagcaga cgctgaagcc tggtgagttg ccaaactacg 1200 ctgcgacaat tgctttcttg aagtcccagt gccaagtact ggaaagatgc gaatccgcct 1260 gtccgcctgc cacaacgaag ctaacaacta agcagctggt cccggcgtcg aaggtcggcc 1320 aacgatctca cgcggtgacg gccagctccg aagaattcac cgagaagtgc gacttctgtg 1380 gatacccgca tcgaaactac cagtgcaaca agctcaacgg actcaccaag ctggagaagg 1440 acgagaaggt tcgagctctt gggctttgct tcaactgtct ccaacggggc catcgcttga 1500 gtagctgtcc ttcaacgaag acgtgccaga agtgtcacaa gcggcaccac acgcagctcc 1560 acgacgacgg cacgcatccc gagcaggaga cgaacatcgc tgtgactgaa ccgccagccg 1620 acgaccctgg gcataacttc ctgccgacga tggcgatccc gttggcaccc caatcgcctg 1680 gaaccaacgt cacgacaacc tgctccctga gcgcatctgc ccccaaaaca gttctgctgc 1740 ttacagcggt ggttctggtc accgatcgcg acaatcaagc ttaccagtgc cgagccctct 1800 tggacagtgg ttcccaagca aacttcataa ctgaagccat ggccaacacc cttggcttgg 1860 aaaggaaacg tgctaacgtt cccatctctg gaatcaacaa tgtgaagagt ctggccagag 1920 acaaagtgga ggtccagttc aagtcccgat gcagcgagtt ttgtgctact ctcgaatgct 1980 tggtgacccc cagagtcact ggcaccatcc cgacgaccga catcgacatc tccggatggg 2040 tgcttcccga cggcatccac cttgccgacc cgcaattctt caagactaac aaggtggaca 2100 tgctgattga cgccgagttg ttctttgcgc tgatgaaacc aggacacatc acactcgacg 2160 acgatttgcc tgaactacga aactcgcacc taggatggct ggtcacaggt gcctaccggc 2220 ctgcaagaca taacgaagct gttcaatact cccatgtggc atcacttgag tcggtggaag 2280 aagcgatccg caagttctgg gagcttgaag aagttcctca agccgccaca atgtcctctg 2340 aagagcaacg ctgtgaaaac cactttacat ccacctactc aagagaccgt actgggcggt 2400 ttgtggttcg attgccgctt agggagaacg ccgacgacgt ggacagctgc cggaatctag 2460 cattgcgacg cttctttatg ctagaaaacc ggctgcaacg taacccttcc ctgaaggtgc 2520 agtacgtgga ttttgtgcgc gagtatctcc agctcggcca ctgccgggaa gtaaatttgg 2580 ctgacgataa cccgaatttg aagccatatt atttgccgca tcacgcggta cttcgccctg 2640 gcagctcgtc cacgaagtgt cgagttgtct tcgacgccag cgcgaagtcg gcaccgacaa 2700 atctgtcact caacgatacg ctgctgattg gcccggtggt acaaagtgac atcttctcga 2760 ttatgctccg ctttcgccaa catcggtacg ttttcacggc ggacattcag aaaatgtacc 2820 ggcagattcg aatccaccca gacgacacac acaagcagcg gatttactgg agagagcagc 2880 cgaacgagcc gttgaagatt cttgagctgc tgaccgtaac ctacggcaca gcttcggctc 2940 ctttccaggc tacccgaagt ttggtccagc tcgccaacga agaagcggag aactttccag 3000 tagctgcaaa aataacaagg tccgactgct acgtggacga cgtgttgtca ggtgcaccaa 3060 cactcaacga cgccatcgga gccctaaatc aactgaagga gatgctggct cgaggtggat 3120 tcccgatcca caaatggtgc tccaactcgc cagaggtcct agcgcacatc ccggaagaag 3180 aaagagaacc gctgaaacca ctcgccgatc gacacgtcaa cgaagttatt aaagttctcg 3240 gcctgctgtg ggatccgagc gccgaccaac tgttgattgc tgattgctcg aaggcagcaa 3300 ctgaacgcga tccgcccgct acaaagcgaa tcatctactc cgaggtggcc aagcacttcg 3360 atccgctcgg gctgttctca ccttcgattc tgctgggaaa gttgctggtt caaggcttgt 3420 ggcaatgcaa actagattgg aacacaccga taagcgacaa cacacaagca aggtggaacg 3480 agctgaaatg cgagctgccc aaactgttgc agatcaaaat tcctcgccgc gtgacgcacg 3540 acggcgcagt ctactacgaa ctacacggat ttggagatgc gtcaaacgtg gcgtacgggg 3600 cgaatgccta cattcaaagc gtgatgaaag acggaaccgt cagatctcgc ctactagcga 3660 ccaagaccaa agttgcccct ctccactccg tcacgattcc acgcaaggag ttgtgcgctg 3720 ccttgttgct cgtacgattg atctgcaaaa ttctggacgc tcttaccgta ccccttcgca 3780 aggtagtctt gtactccgac agcgaaatcg ttttggcatg gttgaagaag cacccgtacc 3840 agctcgaaac gtttgtaggc aatcgcgtcg ctcaaattct agccctatcc agaaacttcg 3900 agtggaggta cgtccggaca caccacaacc cagcagacat ctcatctcgt gggatgcttc 3960 ccggagagct catggtttgc gaaccattct ggaccggcgg agagtacatc aacacagcgg 4020 gcgtcgctca ggatctaccc gacgaaatcc tcgacgaaga cctgcccgag ctgaaggcca 4080 acgtggtggt gatggcggcc ctcatccaag agccgctaga gattttcagc aagagcagct 4140 cgtaccggcg tcttcagcaa acagtcgcat ggataatgcg attcttcgac aacgctcgga 4200 aaccgaaggc agaacgcaac tacaagcgac atctcaccgt acgagagcta cgacgttcaa 4260 ctaacgtcgt ggtgcgagtg attcagcacg tcgaacttgg agacgaaatt caacgagtga 4320 gaacgaacac gccatgcaag cggattggag aactgaaccc ggtcctcaac gatgatggct 4380 tgctgagggt tggcggacga ctgaagcatg ctaaactttc cgacgaagcg aagcaccagc 4440 taatcctccc gaattcccat ccggtcacgc acaatctgat tcgggacatg cacgacgaac 4500 ttctccacgt tggacctgct gggctactgt ctgcaatcag acagcgattt tggttgatcc 4560 gggcccgttc gaccatacga caagtaactg gatcctgtgt gaagtgcttc cgtacaaacc 4620 cgtctggaat ctcgcaactg atgggtgatt tgccaaaaca acgagttacg ccgtcacctg 4680 tcttcagcat cactggagta gattacgccg gcccaattct ggtgaagcag ggaacctacc 4740 ggccgaaggt ggtgaaggcg tacatcgcgg tcttcgtttg catggcaacg aaggccattc 4800 atttggagct ggtgtccgat cttacgaccg atgcatttct cgcagcactt caacgattcg 4860 tgagccgcag gggactggtt tcagaaatgc actccgacaa cgcgacaaat ttccacggcg 4920 ccgataacga gcttcaccga ctgtatgaaa tgttctgtaa tcaagccgaa atcgacagga 4980 ttgtgcgttt ctgtcaagtg aaggagatcc agtggcactt catcccgcca gactcgcccg 5040 agttcggtgg cctgtgggag gcggccgtca agtgtacgaa aacgcacctg aaacgagtga 5100 ttggaaacaa tactcttaac ttcgaagaga tgaccacaat actctgcgaa atcgaagcca 5160 ttctaaattc acgaccactt tttgccattt ctggagaccc ggctgatcca gaagtgatta 5220 ctccggctca tttcctaatc ggccgcccga tgattgcagt gcccgaacca tcctatcagg 5280 acctcaaaat cggccgattg agtcgctggc aacaccttca gttgctccgg gagcaatttt 5340 ggagggcttg gtcccgcgac tacctgagca gcttgcaacc aaggaagaag aatcggatca 5400 ccaccaccaa cgtccgaccg ggaatgattg ttttgctgca ggacaaaaat cgaccaccgc 5460 tgcactggaa gcttggtcgt ataaccgctg tccatccggg ttcggataac ctggtccgcg 5520 tcgtagaggt gttcagcgag ggcagtacgt acacccgatc gatcacgaaa ctgtcaattc 5580 tacccatcga ggaaaaccaa ggccaacctg atcagcgagg aaaggacgat ctcaactgcc 5640 cgggggagga 5650 // ID DNA-TA-10_CQ repbase; DNA; INV; 923 BP. XX AC . XX DT 28-DEC-2010 (Rel. 16.01, Created) DT 28-DEC-2010 (Rel. 16.01, Last updated, Version 1) XX DE A DNA transposon family from Culex quinquefasciatus - consensus. XX KW DNA transposon; Transposable Element; nonautonomous; KW DNA-TA-10_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-923 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the southern house mosquito."; RL Repbase Reports 11(1), 60-60 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >97% CC identity. TA TSDs. 26-bp TIRs. XX SQ Sequence 923 BP; 305 A; 158 C; 163 G; 297 T; 0 other; gggtagagta gtcatcaatg agacacgggg aacaatgata aaatggctct cacaagtcgt 60 agtttcaacc aatcaggctc atatttgggg gaaaggtgtg tctactggat acacgtctgc 120 catattagtg gctttggtta tggacgctcc cttgaaaagt tattcataaa tgtttgattc 180 tggggtgtaa aagtaaatta tggacaaaaa atactttttc gctcgtaggc tgccatttac 240 accaaaacta atatttcttc aaatttcttt cgacgttcca tagggattga gttgggctac 300 aatgtccttt cattaggttt gaccaaaatt tagaatatac ccagaatcca gggctgtctc 360 attgttcccc actcatttca gcccatgggt aacaatgaga cactttgatt tttcttcatt 420 aaacatttca aaatccatgt aaatctttca aaacatgaat tgaaagtgat ttttggcata 480 tttatagagt tttaacttca tttaaccaaa catacaaagt tatttggtat aaatatatgg 540 attttacaaa atttaaatac tttttagtat aacttttgtt aattaaagtt tcatgttggc 600 aaaattgctc taaaatgttc aaggcaagtc acctgcaatg gaacaataca aaaacatgat 660 gttttactag aaaagtattg attttcgtag agtgtctcat tgttccccag ctgtctcatt 720 gttcctgcca agtgcgtcta caatgagaca gttgaataac tctggctgta gatgtcggat 780 cgatctcata ttttggtcaa tgttagaaca cactaaaaga aagaaaatgc aacaaaaagc 840 tcattaaaac atgctgatga aaaaattaga aaaatcgttg aaagttgaaa accaaaagtg 900 tctcattgat gactacccta ccc 923 // ID BEL-64_AA-I repbase; DNA; INV; 5696 BP. XX AC supercont1.274; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-64_AA_; KW BEL-64_AA-LTR; BEL-64_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5696 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.274; Positions 1237570 1231875. XX CC Positions [4725-5285] - Integrase core CC 'GTGGC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 383..2413 FT /product="BEL-64_AA-I_1p" FT /translation="MKRNLMEEETKLREAELSKQKALQEKMMMMRRESMEK FT KKELMRQQAELSESSSSSNVSKSERIKDWITSQQQTEGDNLVNHTPRISTS FT NHMLNPAASQQQSKLVAPSVPTSQLAKLSLHDEVMHPILPNAYMQIAARQV FT TGKDLPVFSGNPEDWPMFIRTYEETTLACGFSDVENLVRLQKCLRGNALET FT VRSRLMMPAGVPHVIKTLQMRFGRPELIIRSLLERVRSVSAPKPERLDTLI FT DFGLAVENLVVHLQAAKQDNHLNNPVLLQELVSKLPAQLRLDWARYKVLHQ FT DSTLVAFGEFMNELIQAASEVTFDLPINLTAKTETSRNREKAFVHTHDTID FT IETRITGAVRKLPKPCIVCSATGHRVAECEEFKVKSVEERMKIVRQHNLCR FT TCLNFHQKWPCRTWKGCSIEQCHEKHHFLLHPLTPTTSTHLSTSHNSLFNK FT SCERYPYFRILPVIVSFANKQQTIFAFIDEGSSSTLLDRSIAEQLGLDGPK FT EPLTLQWTGNVARRESNSKRVRLQIAGTGKTNMFPIEEAHTVERLLLPKQS FT LSYGTLAERYPHLRGLPIADYEHAEPKILIGLDNLRLCIPLKIREGQHNDP FT IAAMCRLGWTIFGYGARTSMPTVSANFHAPAAQDVDHELNEQLSDFFSLDQ FT LGTKPPFESRCESQKDFGGNNSTSFRQI" FT CDS 2823..5654 FT /product="BEL-64_AA-I_2p" FT /translation="MFLRILIRPQDSQSQRFLWRENSGDEPSVYIIDVATF FT GSTCSPSSAQFVKNANAKEHMESFPRAASAIIRYHYVDDYLDSFETEEEAI FT AVVNQVKFIHSTGGFNLRNCLSNSEEVLLAIENTSENAAKQLSISRAEKIE FT SILGMKWNPSKDVFIYTLTLRDDLLKVIEPSHVPTKREMLKLVMSLFDPLG FT FMTFYLIHGRALIQDVWATGVDWDVPINHDLCKRWWQWISFLPSVSNLKIP FT RCYFRGQINDRKQLHIFVDASDAAYACVAYLRGNGDQGVEVALVGAKSKVA FT PLKVLSIPRLELMAAVIGSRLADSVIASHSYDIAETYLWTDSATVLAWINS FT DHRRYHKFVGVRIGEILSLTKVNQWRWVPTRQNPADDATKWGIGPSFEPCS FT RWFCGPTFLSQHEDEWPRKQRTTSTNDELIGNFNVHLVMSTSLIEFTRFSK FT WERLLRTQAYVLRFISNLGCRKTGKSKVTGILNQEELKQAERELWKQAQSE FT AYSQEQKVLLDSQGSPDIRHRLVPKSSPLYKLWPYIDLDGVIRMRGRIGAA FT WYAQPDAKYPVILPKSHRVTLLLVDYYHRHFQHANQETIVNEMRQRYEIAH FT LRTIVKQTARNCAKCRIKAAVPRSPPMAPLPVQRLEPFVRPFTFVGSDYFG FT PLTVKVGRSEVKRWVALFTCLTVRAIHLEVVHSLSSESCILAIRRFIDRRG FT APAEFFSDNGSSFIGANKQLQLEIAAMNEVLASTFTNTNTRWNFNSPSAPH FT MGGVWERLVRSVKSAICTIIDAPRRPTDEVLETILFDAEAMINSRPLTYIP FT LETADEESLTPNHFLLGSSSGIKQPPSEIQNPHINLRSCWNLVQHMTNTIW FT KRWIKEYLPVITRRCKWFDEVREIQEQDLVLIVEPSIKNRYIRGRVEKVFP FT GRDGRVRQALVRTATGVYKRPAVKLALLDLGQHGKPGKDS" XX SQ Sequence 5696 BP; 1662 A; 1277 C; 1379 G; 1378 T; 0 other; aaatctttaa gaattttgcc accaaggatg ctggctggga atacgtccga ccacaactgc 60 cattcgtgca acgagcatga cgtcatggat tcaatggtgg cctgcgacaa ttgtggtgat 120 tggcaccact ttaagtgcgt tggcgtagat aacacggtca aggaccgtaa ctggatctgc 180 aagaagtgtg aacaaggagc cttttctggt ctgcttaacc tcccgcaagc gaaagataaa 240 ccgataaaaa gtggcggcag taagtcatcg aagccgcgca gcagaaaagc ggaaaaaacg 300 gttggatcta aaatgagtat cacgtctagt gctcgtgccg cggcattcgc tcgagttcgc 360 cgaggagcag cgtaagcttg agatgaagag gaacttaatg gaggaagaaa cgaagctccg 420 agaagcggaa ttatcgaaac agaaggcatt gcaagaaaag atgatgatga tgcgaaggga 480 atccatggag aagaagaagg agttgatgcg acagcaagct gaactaagtg aatcttcgtc 540 atcgtctaat gtttcgaaat ccgaaaggat aaaagattgg attacttccc aacaacagac 600 agagggagac aatttggtta accatacacc gcggatctcc acgtcgaatc acatgttgaa 660 cccagctgca tcacaacagc aaagtaaact tgtagcacca agcgtaccta cttcacaact 720 cgcgaagctt tcgttacatg atgaggtaat gcacccgata ttgccgaatg cttatatgca 780 aattgccgct cgccaagtaa cggggaagga tcttccggtt ttcagtggta atccggagga 840 ctggccgatg tttattcgga cctacgaaga gaccaccctt gcttgtggtt tttcagacgt 900 cgaaaattta gttcgtctcc aaaaatgtct acgcggtaac gccctagaga cagtacgcag 960 tcgtcttatg atgccagctg gtgtgccaca cgtgataaaa acactgcaaa tgcgctttgg 1020 ccgaccggag ctcatcatcc gctcactact tgaacgagtt cgtagtgtgt cagcacctaa 1080 accagaacga ttagatacgc ttatcgattt tggactagcc gtggagaatc tggttgtaca 1140 tcttcaagct gcgaagcaag acaaccattt aaacaaccca gtgttactac aagaactggt 1200 ctcgaagctg cccgcgcagt tgagattgga ttgggcacgg tacaaggtac ttcatcaaga 1260 cagcacactc gtggcatttg gcgaatttat gaacgagctc attcaagcgg caagtgaggt 1320 gacgtttgat ttgccgataa atctgacagc gaaaaccgaa acgtcgagaa atagggaaaa 1380 ggcatttgta catacccatg acaccatcga tatcgaaacg aggatcacag gcgcggttag 1440 gaaattgcct aaaccatgta tcgtatgcag cgcaactgga catcgtgtcg cggaatgcga 1500 ggaatttaag gtgaagagtg tcgaagaacg gatgaaaatt gtgcggcagc ataatctgtg 1560 ccgtacttgt ttgaattttc accaaaagtg gccctgtaga acatggaagg gctgcagtat 1620 cgagcaatgt cacgagaaac atcattttct gttacaccca ctgactccta ctacttctac 1680 gcatttgtca acaagccata atagtttgtt caacaagagt tgtgaacgct atccgtactt 1740 ccgcatactg ccagtcattg tatcattcgc caacaaacag caaacgatat tcgcctttat 1800 cgacgaggga tcctcctcaa cactcttgga ccgatcgatt gcagaacagc tgggcttaga 1860 cggtccgaaa gagcctttga cgctacagtg gacgggcaac gtcgcacgtc gagaatcaaa 1920 ttccaagaga gtgcgtctac aaattgctgg taccgggaag actaatatgt tcccaataga 1980 agaggctcat acggttgaac gacttctatt gccgaaacag tcgctgtcgt acggaacctt 2040 agctgaacga tatccgcatt tacgtggctt acccattgct gactatgaac atgctgagcc 2100 gaaaatatta attggactgg acaatttaag attgtgcata ccactaaaaa ttcgggaagg 2160 acaacacaac gatccgatcg ccgcaatgtg cagattgggt tggacgattt tcggatacgg 2220 tgcgcgaaca tcaatgccaa cggtttcggc aaattttcat gcaccggctg ctcaagatgt 2280 tgaccacgaa ctgaacgagc aactcagcga ttttttctcg ttggatcagc taggaactaa 2340 gccacccttc gaaagcagat gcgagagcca gaaagatttt ggaggaaaca actcgacgag 2400 ttttcgacag atttgagaca ggtctactgt gtaaatcaga cgacattgaa tttcctaaca 2460 gttttccaat ggcgttcaga agactacaat cgttggaaag gaaattagcg aaggaaccga 2520 ccctcaaagc tcgtgtccat caaaaattga atgagtacgt tgcaaaagga tattgtcacc 2580 gggcaagtgc gagtgaattg agcttctccg acagcaaacg tgtctgatat ctcccgttgt 2640 ccgtggttgt taaccctaaa aagcctaaca aggtgagagt cgtctgggac gcagcagcaa 2700 aggtggacgg tacatcgttc aattcggtct tgcttaaagg ccctgacttg ctgactagcc 2760 ttgttgcaat tctttatcac tttcgagaac ataggatcgc agtaactggt gatatagaag 2820 aaatgtttct tcgcatccta atacgtcccc aagatagcca gtcgcaacga tttttgtgga 2880 gggaaaactc tggcgatgaa ccatcagtgt atattatcga cgttgccacc tttgggtcga 2940 cttgctcccc aagctcagcc caatttgtga agaacgctaa tgcaaaagaa catatggaat 3000 catttcccag agcggcttca gctatcatta gatatcacta tgtggatgat tatttagata 3060 gcttcgaaac ggaagaggaa gctatcgcgg tagtcaacca ggtaaaattc atccactcca 3120 ccggaggttt caaccttaga aattgtttgt ccaactcaga agaagttctg ctagcaatcg 3180 agaacacttc agaaaacgca gcaaagcaat tgagcatatc gcgagctgaa aaaattgaat 3240 ccatattggg aatgaaatgg aatccttcca aagatgtctt catttacacc ttgactctac 3300 gtgacgatct tctaaaagtg attgagccct ctcatgttcc caccaaacga gaaatgctca 3360 agcttgtaat gagccttttt gatcccctgg gatttatgac attttacttg atccatggaa 3420 gggctctgat ccaggatgtg tgggcgactg gagtggactg ggacgttcca atcaatcatg 3480 atttgtgtaa aagatggtgg caatggatca gttttctacc ttcggtgagc aacttgaaaa 3540 ttcctagatg ctactttcgg ggtcaaatca acgatagaaa acagttgcac atattcgttg 3600 atgctagtga tgcagcatac gcatgcgtgg cgtatttacg aggtaatgga gaccagggag 3660 tagaagtagc attggtaggt gcgaaaagca aagttgcgcc acttaaagtg ttatccatac 3720 cgcgacttga gctaatggca gcagtgattg gatctcgttt ggcagattca gtgatcgcct 3780 ctcattccta tgatattgcg gaaacttatc tttggacaga ttccgccacg gtcttagctt 3840 ggatcaactc agaccacagg aggtatcata agtttgtcgg agttcgaata ggcgaaatac 3900 tttcgctaac aaaggtgaac caatggagat gggttcccac gaggcaaaat ccagcggatg 3960 atgcaacaaa gtggggaatc ggtcctagct ttgagccctg tagtcgttgg ttttgtggac 4020 caacattcct ttcccaacac gaagatgaat ggccaagaaa acaaaggact acatccacca 4080 atgatgagtt gatcggtaat ttcaatgtgc accttgttat gtcgacatcg ttgatagagt 4140 tcacacgctt cagcaaatgg gagcgactgc taagaacaca agcgtacgtg ctcaggttca 4200 tttctaacct tggctgtcgt aaaacaggga aatcaaaagt aaccggaatc ctgaaccaag 4260 aagaactgaa acaagcagaa agggagctct ggaagcaagc acaatctgaa gcgtatagtc 4320 aagagcaaaa ggtgctgctt gattcccagg gaagcccgga tattcgtcat cgtttggttc 4380 ctaaatcgag tcctttgtac aagttgtggc cctacattga cctagacggt gtgattagga 4440 tgcgaggaag aatcggcgcc gcctggtatg cgcaacccga cgcaaagtac ccagtaatac 4500 taccgaagtc acatcgagta acattgcttc ttgtggatta ctatcaccgt cattttcaac 4560 acgctaacca agaaactatc gtgaatgaaa tgaggcaacg ctacgaaata gcccatttac 4620 gaacgattgt gaagcaaacg gcaagaaact gtgctaagtg tcgcataaaa gcggcagtac 4680 ctcgatctcc acctatggct cctcttccag tacaacgcct tgaaccattt gtaagacctt 4740 tcacgtttgt gggctcagat tactttggcc cgttgacagt gaaagtcggc cgttctgaag 4800 tcaaacgatg ggtggcactt ttcacgtgcc ttacggtacg tgctatacac ctcgaagtgg 4860 tgcatagctt atccagcgaa tcatgtatac tggctattcg acgtttcatt gatcgtcgtg 4920 gagctccagc ggagttcttt agcgataacg ggagtagttt tataggagct aataaacagt 4980 tgcagctgga aatcgcggcc atgaacgaag tgctagccag cacgttcacg aatacgaaca 5040 ctcgatggaa ctttaattcg ccgagtgcgc cccacatggg tggcgtatgg gagcgcctag 5100 tgcggtcggt gaaaagtgcc atctgcacga ttattgatgc tcctcgtcgg cccaccgatg 5160 aagtacttga aacgattctc ttcgatgcgg aagccatgat aaattcacga cccctcacat 5220 atataccact tgaaactgct gatgaagaat cgttaactcc gaatcacttt ttgctgggaa 5280 gttcttccgg cattaagcaa cccccctcgg agattcaaaa ccctcacatc aacttgcgta 5340 gctgttggaa tttggttcaa catatgacca acaccatatg gaaaagatgg atcaaggagt 5400 acttgccagt aatcacccga cgctgcaaat ggttcgacga agttcgcgaa attcaagaac 5460 aagaccttgt gctgattgtc gaaccatcga tcaaaaaccg atacattaga ggacgagtcg 5520 aaaaggtatt cccgggacga gatggtcgtg tgcgccaagc tttggttcga acagcaaccg 5580 gagtgtacaa gagaccagcc gtgaagcttg cactgctcga cctaggacag catggtaaac 5640 ctggtaagga ctcttagaac tgatcaatcg ccttaccagc ctttacggtc ggggga 5696 // ID Mariner-6_BM repbase; DNA; INV; 1264 BP. XX AC . XX DT 28-APR-2010 (Rel. 15.07, Created) DT 28-APR-2010 (Rel. 15.07, Last updated, Version 2) XX DE Mariner-like DNA transposon - consensus sequence. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-6_BM. XX OS Bombyx mori OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Bombycidae; Bombycinae; Bombyx. XX RN [1] RP 1-1264 RA Jurka J.; RT "DNA transposons from Bombyx mori."; RL Repbase Reports 10(7), 941-941 (2010). XX DR [1] (Consensus) XX CC >94% identical to consensus. XX FH Key Location/Qualifiers FT CDS 162..827 FT /product="Mariner-6_BM_1p" FT /translation="MLKFEPNKRHLRELLIYFFNLKKSAAEAHRLLVKAYN FT EAALSERTCREWFKKFKNGDFDVEDKDRSGRPKIYEDAELTELLEEDSSQT FT QKELALTLEVTQQAVSHRLKSLGMIHKQGNWVPYELKPRDVERRLCMSEML FT LARHQKKVFYIESSLAMKSGYITIIQKEENHGDYPATRQHPQQNRIFMEKS FT SCCVFGGISWVWCITSCLIRAKQSLELSTERN" XX SQ Sequence 1264 BP; 416 A; 238 C; 269 G; 339 T; 2 other; caaataagtt tccgccgttt tgagaaagat ggtgtggcac gaggtttacg ttcgaattca 60 cagatgataa tcggcttgaa aattaagttg atcatttaag aacaacttaa ccctatcata 120 ttattaactc cgtattagca tcatattttt tttatccaaa aatgttaaaa tttgaaccaa 180 ataagcgtca tttgcgggaa cttttaattt acttttttaa tttaaaaaaa tctgcagccg 240 aggcgcatcg attgctcgta aaagcatata atgaggctgc cttgagtgag agaacatgtc 300 gtgagtggtt taaaaagttt aaaaacggtg attttgacgt agaagacaaa gatcgcagtg 360 gaaggccaaa aatttatgag gatgcagaat tgacggaatt attggaggaa gattcgtctc 420 aaacacaaaa agaacttgca cttactttag aagtcactca gcaagcagtc tcacatcgtt 480 taaaatcgtt aggaatgatt cataaacaag gtaattgggt tccatacgaa ttaaagccga 540 gagatgttga acgccgatta tgcatgagtg aaatgctgct agctaggcac caaaaaaaag 600 ttttttacat cgaatcgtca ctggcgatga aaagtggata cattacgata atccaaaaag 660 aagaaaatca tggggactac ccggccacgc gtcaacatcc acagcaaaac cgaatattca 720 tggaaaaaag ctcatgctgt gtatttggtg ggatcagctg ggtgtggtgt attacgagct 780 gcttaatccg ggcgaaacaa tcactggagc tctctaccga acgcaattga tgagattgag 840 tcgagctctg aaggaaaaac gccctcaata ctactccaga cacgacaaaa ttattctgct 900 ncatgacaac gctcgtccgc atgtagcggt accggtgaaa anttacttaa aaacgcttaa 960 ttgggaagta ctacctcacc cgccgtattc accagacatt gccccgtctg actatcatct 1020 gttccggtcg atggcacatg ctctgtcgga gcagcggttt acatcatatg aagataccaa 1080 aaattgggtt gattcgtgga tagcctcaaa agacgaggag ttcttcagac gcggaatccg 1140 aactctgcct gagagatggg aaaaagtagc tgctagtgat ggacaatact tcgattaaat 1200 ttaaattacc gttctttcat aataaatatt attttttcga caaaaaacgg cggaaactta 1260 tttg 1264 // ID Crack-22_AAe repbase; DNA; INV; 5086 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A Crack non-LTR retrotransposon family from Aedes aegypti. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; KW Crack-22_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5086 RA Kojima K.K. and Jurka J.; RT "Crack clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1238-1238 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 19 sequences with >97% CC identity. XX FH Key Location/Qualifiers FT CDS 743..1504 FT /product="Crack-22_AAe_1p" FT /translation="MSSRSIEELPDLISEYLSNHADKVLANIERSQQFLSN FT KLDDFVDQLLCLRAEISKLKIENEQLKKSLALVSNETAAVSDAVYKNEVDL FT DMHQREKLTANAIVVGIPRVPNENTKSLFDETCRTLGLEFSNNSIVSCDRV FT SAAKGENQPIKITFNDIRDKEALITKKKQFGLLTVTMIKGVRWPHGWTNKV FT HIRDDLSPLSMDIFRELKKHQTSLKLQYVWPSRNGIILVKQTKTSKHIKIQ FT SRADLNKLLTNKP" FT CDS 2073..4955 FT /product="Crack-22_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MEQINLAVNRQLDNASIPIASFVGITVLQINIRGINS FT YEKLDSLCIFLRNLSSTIDVLVIGETWIKQGRSNFYNIPGYNSTYSCRNNS FT SGGLAIFVRDGLNFHVKNNSTDNGYHHIAVELLSSVRIIVHGIYRPPGYDV FT NHFVSNLENILSSVDSKCPCFILGDMNVAVNDVESLGTQRYLQLLSSYDMI FT VTNTHMTRPASNNLLDHVITQANHSEFVENFTIDCNLSDHCYILTQFSTKI FT HKTSKTLNKRMVNHRLVNSNFQMFLQSFDFNSLNPNDRISLITDHYAHLVN FT RFSSTVSVNVKVKTTTCPWYNFDIWKLSKISNNLFQRWKRDRLNQQLRSLL FT DHANKMLAKAKHRAKSAYYQRIFSSSNPKQLWSKINELIGNTSSKEKEPVL FT ETQGVETTNPNDIGRIFNNFFSSIGGSLASSLVSDGNINKFNTMTISNRTM FT FLRPTSQVEVSTIISGLDVSKATGIDGFSVMALKQNGVALSTIICNCFNDS FT ISMGIYPDCLKKALVFPVFKGGNAKNPTNYRPISVLPSINKVFEKLLSMRL FT NSFMDATGLLYQRQFGFRQGSSTEVAVLELADDIASSIDRKLSAGVVFLDL FT SKAFDTINHHILLQKLDAYGIRGAANDYLRSYLTNRQQQVVVSGIRSTTCS FT ISCGVPQGSNLGPLLFLIYVNDIAKLELKGKLRLFADDTAISYEATNVYEL FT SEDMSTDLHIVTSYLENNLLALNLQKTKTMLFGAKDTQGHPVLTINGVVIE FT EVDQFKYLGVLIDSQLKWDAHIREVVAKCSSLCGILRRLSSFVPQQVLLKM FT YYAFIHSRYQYGISAWGSTYNTYLKEIQIQQNRCMKAIYRLPYLQPTNTLY FT TEMEHNILPIAGLYTMRAGIIMFKIVHNINLHHNWIFNTAAHQYRTRQAHL FT LQRSGFRTEIGRRRFENMGPRIYNQLPESIKNSHSINMFRKCLRIHIKNNI FT NNFIIR" XX SQ Sequence 5086 BP; 1632 A; 1060 C; 957 G; 1434 T; 3 other; ctgctggcaa ccctgctgtg aactagctgt tgtgttctgt gctccgccta aatcgagaaa 60 attttgattg aaatatcaga taaaaaatct ctctgctgca taaaaatcgc tccaaaatcg 120 ctgctgtcct gctatcacaa agattagcac atacgtgagt agaaactcct gggaaattcc 180 tccgtaaagt tgctgtgctg ctgctagata ccatcggcca tactagtgct cgtatgtaaa 240 cacaaacccg ctttgtgaat tctgctgctg ctgttgttga tatgctgttt gttgatatgc 300 tgttgttacc aagttgaagg ctcaaattaa gctttgcgcc actcttagag aacgatcaat 360 ggtcctgctg ctataaatca aagaatcttg ataatgacga tcattttgta ctgccgaata 420 tgctgaaata aacggagttt tttgctggtt gtttttattt gcccaccgcc gccctgccac 480 cactgccaac tgctccaccg ctaccgtcca accaccgttc tattgctctg ctgtaatcat 540 ctgctgaata cacctgcgta cccatcatcg tcctactccg acacatctga agaagtccat 600 cactatccgt gtaagtgtgt aggtaagcat cacacaactc gatcggatta cgttgacgtt 660 tggccacatc agggaaccac tacaccattg tttggttttt caaaagcgcc atcttttgac 720 gagcatttga actacaagca caatgtcttc tcgtagcata gaagagttgc cagatttgat 780 ttctgaatac ttatcgaatc atgctgacaa ggtactcgct aatatcgaac gaagtcaaca 840 gtttttatcc aacaaacttg acgacttcgt tgatcagcta ctgtgtttaa gagcagaaat 900 aagtaaactt aaaattgaaa atgagcaatt gaaaaaatcc ttggcgttgg tttcaaatga 960 gacagctgca gtttctgatg cagtttacaa gaatgaagtt gatttggata tgcaccaacg 1020 agaaaaatta acagcaaatg caattgtagt aggcattcca cgtgttccga atgaaaacac 1080 aaaatctctc ttcgatgaaa cttgccgtac attgggtctt gaattcagca acaactcaat 1140 agtttcatgt gatagggtat cagctgccaa aggtgaaaat caaccgatca aaatcacttt 1200 caatgatatc cgtgataaag aggcactgat cacgaagaaa aaacagtttg gtctgctaac 1260 cgtaactatg atcaaaggcg tacgttggcc ccacggttgg actaacaaag ttcatattag 1320 ggatgatttg tctccactat cgatggacat cttccgagaa ctgaagaagc atcaaacttc 1380 gttgaaactc cagtatgttt ggccaagtag aaatggtatc atacttgtta aacaaacaaa 1440 aacttcaaag catattaaaa ttcaatctcg agctgatttg aataagcttc tgaccaacaa 1500 gccgtaattc cttcatccag ttggacatag actaatacga caagaagcaa atcaatcttg 1560 ccataaatca acagttagac aacgcatcca taccgattgc ctccttcgta gaaatcacta 1620 tgttccaatt atacgtggta tcaacagcta cgtatgcaaa agcctcatta gaatagctaa 1680 aagacttttc tggtcgactt tctcaaaaat tcgactttcg acggtcttgc tatatgccat 1740 ttcaaacgga aaaatttgga gtcaattttg aagaacgttg aagcaaaaat caaattttat 1800 accgtaaaca tttcaagtgc attacagttc gcgcaatgaa tgagcttttg atgatacgct 1860 tgtcagacgt tgagagggtg tcatttcgcc ccgatgttca ttatgccccc aatccccsta 1920 cgttcggacg agtagaattg crncatactt gttaaacaaa cgaaaacttc gaagcctatc 1980 aaaattcgat ctcgagctga tttgaataag cttctgacca acaagctgta attccttcaa 2040 cgctccggtt ggacatagac taatacaaca agatggagca aatcaatctt gccgtaaatc 2100 gacagttaga caacgcatcc ataccgattg cctccttcgt aggaatcact gtgttacaaa 2160 ttaatatacg tggtatcaac agctacgaaa agttagactc gctatgtatt tttctacgaa 2220 acttaagttc aacaatcgac gtcttggtaa taggagaaac gtggatcaaa caaggcagat 2280 caaattttta taacattcct ggttataaca gtacctattc ttgtcgcaac aactcatccg 2340 gtggcctagc tatatttgtt cgagatggtt taaacttcca tgttaagaat aactctacgg 2400 ataacgggta tcatcatata gcagttgaat tactgtctag tgtacgtatt atcgtacatg 2460 gcatctatcg tcctcctggc tatgatgtta atcactttgt tagtaatttg gaaaatattc 2520 tatcttcggt ggattccaaa tgtccctgct tcatactggg ggatatgaac gtagcagtca 2580 atgatgtgga gtctctgggg acacagagat acctacaact attatcatct tatgatatga 2640 tcgttacaaa cacgcacatg acacgtcctg ccagcaataa cctgctcgat catgttatta 2700 ctcaagccaa ccactccgaa tttgtcgaaa acttcacaat tgactgtaat ctgagcgacc 2760 actgctacat tctgactcag tttagtacca aaatacacaa gactagcaaa acgctgaaca 2820 aaagaatggt gaatcatcgg ttagtcaact ccaactttca aatgtttctc caatcattcg 2880 atttcaactc attgaatccc aatgatcgta tatcgcttat tactgatcat tatgcgcatc 2940 tggtaaatcg tttctcgtct acagtatcag ttaatgtcaa agttaaaacc actacctgtc 3000 cttggtacaa cttcgacatt tggaaactaa gcaaaatctc gaacaatttg tttcaaagat 3060 ggaaaagaga taggctgaat caacaactac gaagcctttt agatcacgca aacaaaatgt 3120 tagctaaagc caaacatcga gccaaatcag cttattacca acgaatcttt tcctctagca 3180 accctaagca actatggagt aaaataaacg aacttatcgg aaatacttca agcaaagaaa 3240 aagaacccgt tttagaaacc cagggcgtcg aaaccaccaa cccaaatgac attggacgca 3300 tcttcaacaa cttcttttct tccatcgggg gaagtctcgc tagcagcctg gtttcagatg 3360 gaaacatcaa taagttcaac acgatgacaa tatctaacag aacaatgttc cttcgaccaa 3420 catcgcaagt ggaagtttct accataattt ctgggctaga tgtgtcaaaa gctactggaa 3480 tagacggttt ctcggttatg gcattgaaac aaaatggcgt agcgttgtcc actataatat 3540 gcaattgctt caacgacagt atttctatgg gaatctatcc agattgtctc aaaaaggcgc 3600 ttgtatttcc agttttcaaa ggaggaaatg caaaaaatcc aacaaactat cgaccaattt 3660 ctgttctacc gtccatcaac aaagttttcg agaagctgtt atcgatgcgt ttgaatagct 3720 tcatggatgc cacaggattg ctctatcagc gccagtttgg ttttcgtcaa ggatcgtcaa 3780 cggaagtagc agtattggaa ttggctgatg atattgctag ttcaatagat cgtaaactgt 3840 ctgccggagt cgtcttttta gatctttcaa aggctttcga tacgataaat catcacattc 3900 ttctccagaa gctagatgcg tatggtattc gaggagccgc aaatgattat cttcgcagct 3960 atttgactaa ccgccaacaa caagttgtag tatctggaat aagaagcacc acctgttcca 4020 ttagttgcgg tgtcccacaa ggcagcaatc ttgggccatt gttgttcttg atatacgtca 4080 atgatatagc aaagctggaa ttaaaaggaa aactaagact gtttgcggac gacacagcga 4140 tatcttacga agcaaccaat gtatacgagt tgagcgagga tatgtcaact gatctacata 4200 tagttaccag ttatctggaa aacaacctat tagcgcttaa cttacaaaaa accaaaacga 4260 tgctatttgg agctaaggat actcaggggc atccagtttt gactattaat ggagtcgtta 4320 ttgaagaagt agaccaattc aaatatctgg gtgtgctaat tgatagccag ttgaagtggg 4380 acgcccatat acgtgaagta gttgcaaagt gttcatcatt atgtgggatt cttcgaaggt 4440 tatcatcgtt tgtaccacaa caggttctct tgaagatgta ctacgccttc atccacagcc 4500 gataccagta cggtatatct gcttggggtt caacgtacaa tacgtacttg aaggaaatac 4560 agatacagca gaatcgatgt atgaaagcca tttacagatt gccatatctg cagcctacaa 4620 acactctgta cactgaaatg gagcacaata ttcttccgat agcaggactt tacacaatga 4680 gggcaggaat aataatgttc aagatagttc acaacataaa cttgcaccac aactggattt 4740 tcaataccgc tgctcaccaa tataggacca gacaggctca tctattacaa agaagtggat 4800 tcagaactga gatagggagg aggagatttg agaatatggg gccaagaata tataatcaat 4860 taccagaaag tataaaaaat tctcattcaa ttaatatgtt taggaaatgt ttaagaattc 4920 acattaaaaa taatataaat aactttataa tacgctagaa gttagaattc cacagctagt 4980 tcgctagaca atcataaacg accctttaaa agaactatgg ttcattaggg attgtgtata 5040 atagactaag tgtgaatcta tgtgaaataa aatgaattga attgaa 5086 // ID Gypsy14-I_Dya repbase; DNA; INV; 3663 BP. XX AC chr3R; XX DT 18-MAY-2009 (Rel. 14.05, Created) DT 18-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy14_Dya; KW Gypsy14-LTR_Dya; Gypsy14-I_Dya. XX OS Drosophila yakuba OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; melanogaster subgroup. XX RN [1] RP 1-3663 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1093-1093 (2009). XX DR Genome; chr3R; Positions 6836532 6840194. XX CC Positions [2444-2914] - Integrase core CC 'TAAG' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 101..2302 FT /product="Gypsy14-I_Dya_1p" FT /translation="MACLKDLIANNIKNASKAAICTLHKFVFEEEGDRNNR FT KRLREFTGFKYDENAAAYIRKCTYIDEHLNERDLTSICVVLGISHDVENAP FT EHIFRNLQYGNLLAYDENGQNDESDEESENDEDNKSEEENQIDEGNSAHGI FT GRDRRNTQDETPRFAISFRDIEESIKQFDGGDETPIEVWISDFEDQALLMG FT WNELQKLIFAKKSLKGVAKLFVLSEKRLNSWNALKKSLLSEFKSTVSSKQI FT HKQLGESKRRANENAQEYLYRMKDIASRGKVEEEALIQYVIDGIDELSLNK FT SVLYGARNLSDFKRKLKDYQIINIKSKFSDGSVKGKKWQPMFSDAKSDKKE FT IACYNCGEKGHLANSCNNKEKGRKCFKCHKFGHISKNCSQDEKSVKEHEPN FT TRVLKENSGLTSKIVSIKHQHTVPESEDLKKTTTDTIMNPEDEQKIAQLEK FT KALQWDKNRALGDLESDSDLDVHNEGIADPPSGDDMDVGDERTTLDSILGS FT TMDSESVPNLEDGLPDPELAYNEGKMYARMAANTPHKETQGMDDDKLVSCS FT TPVPNTKYHHKELVNGRSHKMVYIGPTSDMGLVNRLQDEFDALYETCPNSN FT GQGIKYQLMVMWSTPHNPTARFSLSPNQPSGKKRKIVNYAWNPEPASVFVP FT EPEVSEQSLPERQIKTYLRKGIELRGKTSDIPNFKITNETLGFSNEDLKVL FT YPDTSMYPDNPVFVLEDVFTYTKQLNKCSLHYDA" XX SQ Sequence 3663 BP; 1376 A; 687 C; 782 G; 808 T; 10 other; ttttgggggc tcatccggga taaaaccaaa aaacagtgaa aacgtgtaga aaagtgaacg 60 acgaaaaaat ataaaatttt gtttataaat tcaacgaaaa atggcgtgtt taaaagactt 120 aatagctaat aatataaaaa acgcttcgaa agcagctatt tgcacattac ataagttcgt 180 tttcgaagaa gagggagaca ggaataatcg caagcgtctg cgagagttta ctggtttcaa 240 atacgacgaa aatgccgcag cctacatacg caaatgcacg tacattgatg aacatttaaa 300 cgagagggat cttacatcaa tatgcgtagt tctcggaata agtcatgatg tagaaaacgc 360 accagaacac atatttcgga atttgcaata cggcaacctt ctcgcctacg acgaaaatgg 420 acaaaacgat gaatcagacg aggagagcga gaacgacgaa gacaacaaat cagaagaaga 480 aaaccaaatc gacgaaggaa acagtgctca tgggataggc agagaccgga ggaatacaca 540 agatgagaca ccccgttttg ccattagttt tcgggacata gaggaatcaa taaaacaatt 600 cgatggcggt gatgaaactc caatagaggt gtggataagc gactttgaag atcaagcctt 660 gcttatggga tggaacgaac tccaaaaatt aatttttgct aaaaagtcgt taaagggtgt 720 cgcaaaactc tttgttttaa gtgagaaacg gctaaattcg tggaatgcgt tgaaaaagtc 780 cctcttgagt gaattcaagt cgacagtgag cagcaagcag atacacaaac aactcggaga 840 aagcaagcga cgtgcgaacg agaacgcaca agaatatttg taccggatga aagacattgc 900 atcacgagga aaagtcgaag aagaagcact tattcaatac gtgatcgacg gtattgacga 960 actttctcta aacaaatccg ttttgtatgg tgcaagaaat ttatctgact ttaaaagaaa 1020 gctaaaagat taccaaataa taaatatcaa atcaaaattc tccgacggaa gtgtgaaagg 1080 gaagaaatgg cagccaatgt tcagtgatgc aaagagtgac aaaaaagaga ttgcttgtta 1140 caactgcggt gaaaagggac atcttgctaa tagctgtaac aataaagaga aaggaagaaa 1200 gtgtttcaag tgccacaaat ttggacatat ttcgaagaat tgttcacaag acgaaaaaag 1260 tgtcaaagaa catgaaccaa acactcgagt tttgaaagaa aacagtggtt taacgagcaa 1320 gatcgtttcg attaaacacc aacatacagt cccagaatcc gaagatctga agaaaacaac 1380 aacagatacc atcatgaatc cagaagatga gcagaagata gcacaacttg agaaaaaagc 1440 actccagtgg gataagaaca gagcactagg tgacttggaa agtgactcgg acctggatgt 1500 acataacgag ggcatagcgg accctccttc aggcgatgat atggatgttg gtgatgaacg 1560 aaccactcta gatagtatat tgggctctac tatggactcg gaatcagtcc cgaacttaga 1620 ggacggactc cccgatcctg aattagcata caatgagggt aagatgtatg cgaggatggc 1680 cgccaatact ccacataaag agacacaagg gatggatgac gacaaactgg tgtcttgttc 1740 tactcctgta ccgaatacta aatatcatca caaggaatta gttaacggtc ggagtcataa 1800 gatggtttat ataggtccca cttctgacat gggactcgtc aataggttac aggatgagtt 1860 cgatgcactt tatgagactt gccccaattc taacggccag gggataaagt atcagttgat 1920 ggtcatgtgg agcacccccc ataacccgac tgcgcgcttt tctttatcac ccaatcagcc 1980 atcagggaag aaaaggaaga tcgtcaacta tgcatggaac cctgagccgg cctcggtgtt 2040 cgtacctgag cccgaggtgt ctgagcaatc ccttcctgaa cggcagatta aaacatatct 2100 caggaaagga atagagctta gaggaaaaac atctgatatc cctaatttta aaatcaccaa 2160 tgaaaccttg ggcttttcga atgaagatct taaagtatta taccctgaca catctatgta 2220 cccggacaat ccagtattcg tcttggagga tgtattcaca tataccaagc agttgaataa 2280 gtgctccctc cactatgatg cttagattga cttagattag atttaaccga gcatgaaaaa 2340 aaaaaaaaaa aaaaaaaaaa atatatatat tgactgctgc ataccgtgca ttttaagcaa 2400 ccgcaaacaa ggcaagcagg agggcttact acatccatta caaaaagaag attttccatt 2460 gcacacttac catattgact ttttaggccc attagaatgg acaaacaaaa attacaagca 2520 cattctagct gttgtagatg cttttacgaa gttctgctgg ctttacccaa caaagtcaac 2580 aactgctact gaagtcatag caaaattacg aggacaaagt aacgtttttg gcaaccccgt 2640 ctgtataatt tcagacagag gatcagcatt tacatcccaa gattttttac aatactgcga 2700 ggaagaagac ataaaagtag taaaatcaac gacaggttta ccgcgtgtaa acggacaggt 2760 ggaaaggatt aacgcaatta ttataccagt tctttccaag ttaagtgtag atgatccaac 2820 gaaatgtata gacacgttga tcgagttcaa caagccatta actcaacttt cgcgagaagt 2880 atcaacacaa caccctttga acttcttatc ggtgtaaaaa tgagaagcaa agatgataag 2940 caacttcacg acttaatcag gaaggaatca ataagcctgt ttatcgacaa tcgcaacgac 3000 cttcgcctta aagctaaaac acagattttg aaattgcaag ccgaaaacaa aaagacgtat 3060 aacttacgac gtaaacctgc acgaatgtac aacgttggcg acctagtagc cattaaacgt 3120 acacagtttg gcggcggact caaattaaaa tcaaaatatc tgggcccgta tgaaatcacc 3180 aaagtaaagc acaacaatgc gtacgatgtt caaaaaattg gaaattcgga cggcccaaaa 3240 aattcatcaa cttgtgctga atatatgaag tcatggccca ttcatgatgt aaatgatgaa 3300 aacgagtccg aaacgatgta aaaactcgac gagaacgaca taacgaataa agatggcgaa 3360 acgacaagaa aatacggcga atatgagaat gaaataacga atgaaagacg aaagatgtaa 3420 agacggtgaa aaagagcgta gggaagttaa gccaaaaaaa aaaagacgaa aaataaagac 3480 gaaacgatgt atagnnnnnn nnnncagagc cctgtgccga aaaataaaga cgataaagat 3540 gtaaaaacga cgacgacgaa tactagagag aaaacctatg aagaagacaa ccaagcgatg 3600 aaaacgacga cgaataagat gacgacgaaa cattcgagac gaatgtttgt caggatggcc 3660 gaa 3663 // ID Mariner-22_HM repbase; DNA; INV; 3049 BP. XX AC . XX DT 18-DEC-2008 (Rel. 13.12, Created) DT 18-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE Mariner-type family: consensus sequence. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-22_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3049 RA Bao W. and Jurka J.; RT "Families of Mariner elements from Hydra magnipapillata."; RL Repbase Reports 8(12), 1956-1956 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 605..2347 FT /product="Mariner-22_HM_1p" FT /translation="MPQNYKPNRGRRKYVTYTLEQLDNALKDLKNGLITQR FT QAALKYDIPRSTLKNKIKQTYPKKYGGQQIFTSKEEDMFKAYAIKSSEFGF FT PVDKYDLRCIVKGYLEKKGVIIPQFKNNFPGGDWIRSFLKRNPELTVRFAS FT NIKRKRAEIGTTVIDNYFINLSNEISNISPDNIWNYDETNLSDDPGKKKIL FT TKRGCKYPERIINSTKTSFSVMFCGNANGELLPPYVVYKAESLWNTWMEHG FT PPKARYNRSKSGWFDSTCFEDWFFSLLLPRLKKAQGRSVIIGDNVSSHLSI FT AVLDACQSNNIGFVALPANSTHLTQPLDVAYFRPMKINWRKILCEWKEKGK FT GRRVASLPKDEFPRLLDRLITNLNEHGNDNLRAGFRKTGIFPLDKSQVLSR FT LPCCNVGLDSTTDLVSQSFLDHLCKSLDDPSDGSKKPKRRKVCAVPGKSLC FT STDISLCVINENNRLNSNNVDTPISIINTIETNNIDFAGPSTNTGTTETSN FT LIFADVSLVVESHTESLDHSALGKKNFQNIDKTKNVCKKEKLLNLVENCYV FT IVKCNGELNPGQVCLFILSYLYLANQLVNFITN*" XX SQ Sequence 3049 BP; 1066 A; 435 C; 519 G; 1029 T; 0 other; ccgtaaaatt gggtggcttt ggccctacgg gatgacttta gcccaacaga aatttttgtt 60 taaaaatggt ttaaaatcga taccttaagg tatgccggtt gatgtttagt agcaaattgt 120 gggaaattta ttatttattt agtttaactt agtttactta actcgagtct ttgtgatatt 180 attgtaaatt gcactttaga agaagcacct aaaatccgaa tttttttagg tgtgtaaact 240 tttacatcga tagcagtacc tgtagaaggt tttttattgg aatatacact gaattaattt 300 taaaccatta gtgtttgatt ctatcttgtt ttgaaagctt aacagtttat tacttttttg 360 taattgaggt taaaaaaaaa taggttgtct ttggcccact acatagggtg acattggccc 420 gggctaaagt caccccatgt attagagtaa acctttttta gtgatcattt atagatatta 480 tggtagttac tatgttggaa tgaaaaattt tgtagtagta atgcaataga aaatcagtat 540 aacagtttgt attagtaaat cagtataaaa ttatagtgaa atttgtttag ttcttagttt 600 aaatatgcca caaaattaca agccaaatcg tggaagaaga aaatatgtta cttatacttt 660 agaacagtta gacaatgcat taaaagattt aaagaatggt ttaattactc aaagacaagc 720 agctttgaaa tacgatatac caagatcaac attaaaaaat aaaataaaac agacatatcc 780 taaaaagtat ggtggtcagc aaatatttac atcaaaagaa gaagacatgt ttaaggcata 840 tgcaataaaa tcatctgagt ttggttttcc tgttgataaa tatgatttaa gatgcatagt 900 aaaagggtat ttagagaaaa aaggcgtcat tatacctcag ttcaaaaata attttccagg 960 tggtgattgg attcgatctt ttttaaaaag aaatccagaa ctcacagtaa gatttgcaag 1020 caacataaag cgaaagagag cagaaatagg aacgactgtg attgataact actttataaa 1080 tctatcaaat gaaataagta atatatcacc tgataatata tggaactatg atgaaaccaa 1140 tctcagtgat gatccaggta aaaaaaaaat attaacaaag cgtggatgta agtatcctga 1200 acgcatcatt aattctacaa agacctcatt ctctgtaatg ttttgcggta atgctaatgg 1260 tgaattgcta ccaccttatg ttgtttacaa agctgagtcg ttatggaata catggatgga 1320 acatgggcca ccaaaagcac gttataacag atcaaaaagt ggttggtttg actcaacatg 1380 ttttgaggac tggtttttct ctttacttct tccaagactt aagaaggctc aaggaagaag 1440 tgtcattata ggtgataacg tatcttcaca tttgagcatt gcagtactgg atgcatgtca 1500 aagcaataac ataggatttg tggcattacc agcaaactct acacatctca cgcagccatt 1560 agatgttgcc tactttagac ctatgaagat taactggcgc aaaatattat gtgaatggaa 1620 ggagaaagga aaaggaagga gagttgcttc tcttcctaaa gatgaatttc ctagactttt 1680 agaccgttta attaccaact taaatgaaca tgggaatgat aaccttagag ctggttttcg 1740 caaaactgga atttttccat tagacaagtc tcaagtgctt tcacgactac catgttgtaa 1800 tgtaggttta gactcaacta ctgatttagt aagccaatca tttttagatc atttatgtaa 1860 gtcactggat gatccctcag atggttctaa aaaaccaaag cgacgtaaag tatgtgctgt 1920 cccaggtaaa agcttatgtt caacagatat atcattgtgt gttataaatg aaaataatag 1980 attaaattca aataatgtag atacacctat cagtattata aatacaattg agaccaacaa 2040 cattgatttt gctggcccaa gtacaaatac tggaactact gagacttcta atctaatttt 2100 tgcagatgta agtttagttg tagaaagtca tacagaatct ttagatcatt cagcattagg 2160 caaaaagaac tttcaaaata ttgataaaac aaaaaatgtt tgtaaaaagg agaaattatt 2220 aaatcttgta gagaattgtt atgtaattgt taaatgcaat ggagagttaa atcctggaca 2280 ggtttgttta tttattttga gttatttata tttagctaat caattagtaa actttataac 2340 taattaattt aataactaat attatttaat gaaactactg aaaaatttta aaaaaacttt 2400 cgaaattttt ttttctgcag ttgaaaaaat taagaaaaaa tggagcagaa attactatta 2460 tgaaaaaaaa tggacttaac tggaaatggt cactaccagc agagcaactt ttttttcccc 2520 agtctgaaat agtcgatttt attgaggaac caaagaaaat ttcgaaaaga ggaatttatt 2580 cagttccaga actttttcac taaatttttt aaaacattat ttttgtcatt tttgtttact 2640 ttatacctaa actgttttgt ttacttggta cctatacttg tttagtttgt acctatattt 2700 tatttactgt acacaagaac tttgttacta aatctttttt tgtcacttta tttttgtaaa 2760 tttattttct ttgcctaaat attgctgttt ttattgttgt tttgcttcct ttagttcctt 2820 tttacagggt tgggccaaag tcaccctaac acatagggcg actttggccc accatttaaa 2880 ccacccaaaa gatacttcag gtaaataatt ttcaaatggg agacagatag ttctaactgt 2940 cctttctatg cgcttctaaa agtacttagt gtttgaggaa ataacttttg tggctttcat 3000 gcaatagaca ttaagatgca aaatgggcca aagccaccca atcttacgg 3049 // ID BEL-178_AA-I repbase; DNA; INV; 6256 BP. XX AC AAGE02028718; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-178_AA_; KW BEL-178_AA-LTR; BEL-178_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6256 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02028718; Positions 1462 7717. XX CC Positions [5303-5863] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 1013..6256 FT /product="BEL-178_AA-I_1p" FT /translation="MTNSEDKVRSWLHRSGEQTEGKISSTSAAKDISNIPK FT NLGDTVINKVRVPRITRKSNYLFELPNEAQAVGNNKDVSYSLHPELTGRNA FT AQGDSGLYDLAENSPTTRQLAARQVMGKDLPTFSGNPEEWPIWISNFERST FT VTCGFSLDENLIRLQRSLKGPAMEMVRCRLLSPASVPHVIKTLQMRYGRPE FT TLIRALTEKIRQLPPPKMDNLNSIVDFGLAVDSLVEHLKTARQQAHLSNPS FT LLHDLVAKLPVEYRLKWSSYKSDIPVVNLATFGSFMSSLVELAFEVMDDQP FT TAAKTVQQKPNNRSFVQTHVEITSADFDSVQSNSNTPANNPAKKVCVVCQK FT EGHKVAKCYTFQAMDIDHRLKVVGQHMLCRTCLNQHGKWPCKTWKGCGIEG FT CRLRHHTMLHTATPAVPFVVSTSHSEPLSTGSPIFRIIPVTLHAKERRMKV FT FAFVDEGSEISLLDVAVAEQLGITGPSSTLSLQWTGNVTREETNSRVIQME FT ISGETSNDRFKIIDARTVKGLQLPSQSLWYQNLSRKYPHLRGLPIADYENV FT TPKLLIGLDNLKLTIPLKVREGGWGQPMAAKCRLGWSIYGCSRSAKESYTC FT GFHVGGWTNLEGELNQLVRDYITLDNTGVQPPLTPLVSEESKRAQQLLLST FT TRRVGNRFETGLLWKWDDIQLPDSYGMALRRLHSLEKRLNKQPILYDSVRQ FT QIKEYQEKGYAHQATEEELANTRPEKSWYLPLGVVLNPRKPNKVRIIWDAA FT AKVNGVSLNSVLLKGPDYLTSLIEVFYHFRLYAVALTGDIKEMFHRLFIKR FT EDRQFQRFLWRDNGSSNVEVFVMDVAIFGATCSPSSAQHVKNMNSKEFENE FT CPRAAAAIIRFHYVDDYLDSFPTAAEAVEVGSEVRRIHAAGGFEIRNFLSN FT DASVAERVGAISNEAGKTIKPEKNDNVESVLGMKWIPAEDVLTYTFLVRAD FT LGHVLDPSHTPSKREVLRVVMSLFDPLGLISYYVIHGKTLMQDIWASGVNW FT DEPISIELCEQWRRWTALLPELNSIRIPRCYFVAADYSSYASLQIHVFVDA FT SKSAYACVVYFRVESEEGPAVSLVAGKAKVAPLKIMSIPRLELQAAVLGTR FT LLNSVVAMHNLPISRGVLWTDSQTVLAWLQSDQRRYQQFVGFRVAEILSTT FT DVQEWRKVDTELNVADLATKWGKGPKFEADNPWFRGPKFLLEPENLWPKQE FT AVPCTTNEEVRLVGFHVEPTPPLIEVGRFSKWEKIHRSTAYVHRFINNLKR FT KKLGEQLELGVLKQQELADAEITLYKQAQQESFSAEIKQLNGMEKSMQLRH FT SAVPRSSNIFKLWPFQDQLGVLRMRGRIGAAPHVPFAAKFPTILPQQSLIT FT FLLVDKFHRRFRNANRETIINEMRQEFYIPKMRALVAKVARNCMFCRLRKA FT VPQLPPMAPLPKARLTSFVRPFTFVGLDYFGPVLVKVGRSNAKRWIALFTC FT LSIRAVHMEIVHSMSTESCIQAIRRFVSRRGSPAEFFTDNGTNFHGANNQL FT KREIEERNRKLASVFTNSNTRWSFNPPGTPHMGGVWERLVKSVKDAIKLTL FT DESRNPDDETLETVIIEAEGMINTRPLTYIPLESADQEALTPNHFLLGSSS FT GVKQLPVLPTDFRATLRSSWNLAKHLADEIWRRWITEYLPVISRRCKWFDN FT VRDLKEGDLVLVVDGAIRSQWTRGQVERVVPGADGRVRQAWVRTSGGIDRR FT SVAKLALLEVVTESEPNQEDGSDSRVGG" XX SQ Sequence 6256 BP; 1783 A; 1375 C; 1605 G; 1493 T; 0 other; gttaaattaa ttattcctaa ataatttgct aaagcctact taattctacc ttaattgaat 60 tacaagtgag tacaattaga atttgttaat tataactaag gatcatgcaa atttattatt 120 ctagtaattg gataacgtaa cttggaagac acgatagagg aagaattttg ttgtgaagat 180 tactgaaaat ttgtaagtcc taatgaattc atgcacatgc aaatatcatc taactaaata 240 aacttacagc taaagtatat cccacaatca cacttgtttg gattggcctg gggttgctat 300 aagagtttcg gtgacaacaa tttctttaga aattttcaac gcaaaatccc catctgcgag 360 atgagtcttc ctggagaaaa gggtggatca ccttcgaatt gcgtggcgtg tgatcgcccg 420 gacagcagcg aggacatggt ggcgtgtgat ggttgcgggt cctggtacca ctacagctgc 480 gcggaagttg atggaagcgt cgccaacctt ccgtggacgt gtggatcgtg cgggatatta 540 aaccaaacac cgaatccatc gaagggcatc cggaaggctg acggaaagaa aggaggtaaa 600 cggctcacgg tccctggcgg gacagcgagg tcatcgaaag cgtccgtcgt cagcaaagga 660 acttcgcgga agagtaagaa gaccgtagcg gatgataacg tgagttctac ttctagtgct 720 agggcaaggc tcgcactcga gctgaagatg gtagacgaac aggaccagat tagggagcaa 780 gaacttcagg ctgagcttga gctaaaaaat aaaaagttat tgctagagaa gcagagcagg 840 atcgcgagtt ggctctagaa gccaggaggt tagctgagaa aagcctttca agaaagacat 900 tagaagaaga ggagaaaatc gtaagcgcta tagaatgaga cgctgtcgct gaacgagaaa 960 gcatcgttag gcatttctca cgcggtagtg agtttcagtg tagtgcagtg agatgactaa 1020 ttccgaggat aaagtgcgaa gttggctgca tagatccggc gagcagacag agggaaaaat 1080 atcatccacg tcagcagcaa aggacatcag caacattccg aaaaatctcg gagacacagt 1140 tatcaacaaa gttcgtgtac ccaggatcac tcgaaaatcc aactacttgt ttgagcttcc 1200 caatgaggca caggcggtag gaaacaataa ggacgtatcg tacagcctac acccagaact 1260 gacaggcagg aatgcagccc aaggtgatag tgggttgtat gatttagccg aaaatagtcc 1320 gactactcga cagttagcag cccgccaagt aatgggaaag gatctaccaa cgttttccgg 1380 gaatccagag gagtggccta tctggataag caactttgaa cgctccactg tcacctgtgg 1440 attttctttg gatgaaaacc tgattcggct ccagcgcagt ctcaaaggcc cggcaatgga 1500 aatggtacgt tgcagactgc tttctcctgc tagcgttccc cacgtcatca agacgcttca 1560 gatgcgatac ggtcgtccag aaacgctgat aagagcacta actgaaaaga ttcggcaact 1620 tccccctccg aaaatggata atttgaacag tatcgtcgac ttcggtttgg cggtggacag 1680 tctggtggaa cacttgaaga ctgcgaggca acaggcgcac ttgtcaaatc cttcattatt 1740 gcatgaccta gttgcgaagc ttcctgtgga ataccggttg aagtggtcgt cgtacaaaag 1800 tgacattcca gtagtaaatc ttgccacatt cggaagcttt atgtcttcgc ttgtagaatt 1860 agctttcgaa gtcatggacg accagccaac agctgcaaaa acagttcagc aaaagccgaa 1920 caaccgaagc tttgttcaga cacacgtgga aatcacttcg gcagattttg attcggtcca 1980 gagcaattcg aatacgccag ccaataatcc agcaaagaaa gtctgcgttg tctgccagaa 2040 agagggacac aaagtagcaa aatgttacac ttttcaggcg atggacattg atcatcgcct 2100 taaggtcgtt ggtcagcata tgctctgtag aacctgcttg aaccaacacg gcaaatggcc 2160 ttgtaagaca tggaaagggt gtgggataga aggctgccgt cttcgtcacc atacgatgct 2220 gcatactgcc acgccagcgg ttccttttgt agtgtctact agtcattcgg agccattaag 2280 tactggaagt ccgattttcc ggattatacc tgtaacattg cacgcgaaag aacgccgaat 2340 gaaagtgttc gcatttgtag atgaaggctc ggaaatttcg cttcttgatg ttgcagtagc 2400 tgagcagtta ggaatcactg ggccatctag cacgttgagc ctacagtgga caggaaatgt 2460 tacgcgggaa gagacaaact ctcgtgtcat ccagatggag atctctggcg agacgtcgaa 2520 tgaccgcttc aagattatag atgcacgaac ggtaaagggt cttcaattgc cgtcacaatc 2580 gctgtggtac cagaatttgt ctagaaagta tccacacctg cgaggcttac cgatcgccga 2640 ctacgaaaac gttacaccga aacttcttat tggtcttgac aacctgaaac tcacgattcc 2700 attaaaagtt cgagaaggag gttggggcca accgatggca gccaaatgcc gtctaggttg 2760 gagcatctat ggctgctcac gttcagccaa ggaatcatac acttgtgggt ttcacgttgg 2820 aggatggacc aatttggagg gtgagttgaa ccaactcgtc cgtgattaca taacgctgga 2880 taatactggt gtgcaacctc cgctcacacc tctagtgtcg gaggagagta aacgggctca 2940 acagctactt ctgtccacaa cccgtagagt tgggaaccga tttgagaccg gcttattgtg 3000 gaagtgggat gatatccagc ttcctgatag ctacgggatg gcgcttcgtc ggctacactc 3060 tttagaaaaa cgactgaaca agcaaccaat actctacgat agtgtccggc agcaaattaa 3120 ggagtaccaa gaaaaaggct acgctcatca ggctaccgaa gaagaactcg ccaatactcg 3180 cccggagaaa agttggtacc tcccactagg agtggtgctg aaccctagaa agccgaacaa 3240 ggtgcggatt atctgggatg ctgctgccaa agtgaacggg gtgtcactca attcggtact 3300 cctaaagggt cccgattatc tcacatcgtt gatcgaagtc ttctaccact tccggcttta 3360 tgccgtagct ctaactgggg acatcaagga gatgtttcat cgccttttca tcaagcggga 3420 agaccgccaa ttccagagat ttctatggag ggataatgga tcatcaaatg tagaagtctt 3480 cgtcatggac gtcgccattt tcggtgcaac gtgctcccca agctcagcgc aacacgttaa 3540 aaacatgaac tcgaaggaat tcgagaacga atgtcccagg gcagcggcag caatcatccg 3600 atttcattat gtcgacgact acctcgatag ttttccgaca gcagctgagg cagttgaggt 3660 cggaagtgaa gtacgaagaa tacacgctgc cggcggattc gagatcagga attttctttc 3720 caatgatgca tcggttgcag agagagtagg agcaatatcc aacgaagccg ggaagacaat 3780 caaaccggag aaaaacgata acgtggaatc tgttcttggc atgaagtgga taccagctga 3840 agatgttctc acctacacct tcctagtacg tgccgatctt ggacatgtat tggatccttc 3900 ccacacgccg tccaaacgag aagtgttaag agtggtaatg agtttgttcg atccgctagg 3960 gttaatcagt tactacgtaa tccacggtaa aactctaatg caggatattt gggcatccgg 4020 agtgaactgg gacgaaccta taagtatcga attgtgcgaa caatggcgtc ggtggacagc 4080 gcttctaccg gaacttaatt ctatccgaat tcctcgttgc tatttcgtgg cagctgacta 4140 ctcctcgtat gcttcactac aaatccatgt tttcgttgat gccagtaaat cagcgtacgc 4200 atgtgttgtg tactttcgag tagagtccga agaaggaccg gcggtatcat tagtagccgg 4260 gaaagcgaaa gttgctccac tgaaaattat gtccatccca cgattggagc ttcaagctgc 4320 agtgttgggc acaagactgc tcaatagtgt cgttgccatg cataatcttc ccatcagtcg 4380 tggtgtattg tggacggact cgcaaacggt tctcgcttgg ttgcaatctg accagcgaag 4440 ataccaacaa ttcgtgggtt ttcgggtggc agagatactg tctacgacgg atgtacagga 4500 atggaggaag gtagacactg agctcaatgt ggcagatctt gcaacgaaat ggggcaaagg 4560 accgaaattc gaagcagata atccctggtt ccgaggacca aaattcttat tggagccaga 4620 aaatctctgg ccgaagcaag aagcagtgcc atgtaccacg aatgaagaag ttcggcttgt 4680 aggattccat gtggaaccta ccccaccttt gatcgaagtt ggcagattca gcaagtggga 4740 aaaaattcac cgtagcacag cgtatgttca tcgcttcatc aacaatctca agcggaagaa 4800 acttggtgaa cagttggagt tgggagtgct gaagcaacaa gaattagccg atgctgaaat 4860 cacattatac aagcaagcgc aacaagaatc cttttccgct gaaataaaac agctcaacgg 4920 aatggagaaa tcaatgcaac ttcgtcattc tgcggtgcct agatccagta atatcttcaa 4980 gctgtggccg ttccaggatc agttaggggt gctgcggatg cgaggcagga ttggtgcagc 5040 accgcacgtt ccgtttgcag caaagtttcc cacaatttta cctcaacagt ctctcatcac 5100 gttcctcctg gtggataaat tccaccgtcg ttttcgtaac gctaataggg aaactatcat 5160 aaacgagatg agacaggagt tttacatccc gaagatgcga gcgttggtgg cgaaagttgc 5220 gcgaaattgc atgttctgcc gtttacgcaa agctgtacct cagctacccc caatggcccc 5280 actgccgaaa gctcgactaa cgtcgttcgt gcggccattt actttcgttg gcctagacta 5340 tttcgggcct gtgttagtta aagttggacg tagtaatgct aagcgctgga ttgcactctt 5400 tacgtgcctg agcatccggg cagttcatat ggagattgtt cacagtatgt ctacggaatc 5460 gtgcatccaa gcaatacgtc gttttgtctc ccgtcgaggt tcacccgctg aattttttac 5520 tgataacggg acgaacttcc atggagccaa caatcagttg aagagagaga tagaggagcg 5580 taaccgtaaa ttagcatcgg tgttcaccaa ttccaatacc cggtggtcgt ttaatccacc 5640 tggaacgccc cacatgggcg gggtctggga gaggctggta aagtcggtaa aagatgctat 5700 caagttgacg ttggacgaat ctcggaatcc ggacgatgaa actctggaaa cagtaatcat 5760 cgaagcagaa ggaatgatca atacacgacc gttaacatac attcctttag aatccgcaga 5820 ccaagaggct ttaacgccga accatttttt gctggggagt tcatcgggtg ttaaacagtt 5880 accagttcta ccaacagact ttcgtgctac tttgagaagc agttggaact tggcgaaaca 5940 cctggctgat gaaatttgga gacgatggat cactgaatat ctaccagtga tttcgagacg 6000 ctgcaagtgg ttcgacaacg tgagagattt aaaagagggt gatttagtgt tggtggtaga 6060 tggcgcaata cgaagccaat ggaccagagg acaagtagaa cgagttgtgc caggagcaga 6120 cggacgtgtt cgacaagcat gggtccgtac gagcggcgga atcgatcgca ggtcggtcgc 6180 taaattagcg ctgcttgaag tagtgacgga aagtgaaccc aaccaagagg atggtagcga 6240 ttcacgggtg gggggg 6256 // ID Gypsy-168_AA-I repbase; DNA; INV; 5246 BP. XX AC supercont1.332; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-168_AA_; KW Gypsy-168_AA-LTR; Gypsy-168_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5246 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.332; Positions 1237613 1232368. XX CC Positions [4037-4507] - Integrase core CC 'CATGC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 947..5083 FT /product="Gypsy-168_AA-I_1p" FT /translation="MNKVPDEEKRDHFITLSGPTIFRELKLLYPNSNLAEV FT PYKEMVDKLKARLDKTESDLVQRLKFNVRVQQPDESLEDFVLSVKLQAEFC FT NFENFKQMAIRDRIVAGVRDKSLQQRLLNEEKLTLETAEKLIATWEIARNN FT AKNMDYSNSTEQIASLKASGFPGARLHKLATTMEAAARANSSAKVDQSRGP FT VKSRLGYSPYKKDRWQHKQWQTRSRGNGKEERNRPDYSQMVCDFCGVKGHI FT KRRCFKLKNMQRDAVNMIDPDTSGSNPDDFLSNIVSRMRADSDSENDAEGE FT KNVFKCMHVLSIGKISDPCLLSLKIENVFVEMEVDCGSSVTVMSKQQYFEK FT FSKKLTHSQRKLVVVNGTSLVIEGEVEVLVHFKGISTKLKLLILNCENNFT FT PLLGRPWLDVFFPDWRTFFVNSVGSSPQTNQPIVDEIKVKYKDVFVKNFSS FT PIQGFEADLVLKSEVPIYKKAYDVPYRLRDKVLEHLTKLENEKVITPIKTS FT QWASPVIVVMKKNNEIRLVIDCKVSINKFIIPNTYPLPTAQDVFAGLAGCK FT VFCSLDLEGAYTQLSLSEMSKKFMVINTVKGLYTYNRLPQGASSSASIFQQ FT VMDQVLNGIEHISVYLDDVLIAGKDFDDCKEKLFLVLGRLQNANIKVNWNK FT CKFFVTELTHLGHVISEKGLMPCLDKIETIEKAKVPQNVTELKSFLGLINY FT YHKFIPNLSSKLYYLYRLLKNDVKFCWDANCNKAFEESKHELVEAKFLEFY FT DPNKPIVIVPDASGYGLGGVMAHLVEGIEKPIYFTSFSLNAAQQKYPILHL FT EALALVCTVKKFHKFLFGKKFFIYTDHKPLVGIFGKEGRNSIYATRLQRFV FT LELSIYDFEIQYRQSSRMGNADFCSRFPLDQAVPVECDVELVNGINFGREL FT PLDFSVIASKTKEDMFLQNVISFMTKGWPTKVNKQYKDVYANQQDLELVDE FT CLLYQNRVVIPVSMKIKILKLLHANHAGIVKMKRLARQCVYWFGINSDIED FT YVTACDTCNSMMIVPKTKTVSKWIPTTRPFSRIHIDFFYFGHRTYLLVVDS FT YSKWVEVELMRNGTNCDKVLKKLVVLFARYGLPDVLVSDGGPPFNAHSFVN FT FLKRQGINVLKSPPYNPSSNGQAERLVRTVKDVLKKFLNDPEFSHLDVEDQ FT INLFLINYRNSCLTSEGNYPSRLIFSYKPKTILDLINPKTHYKKFLQGGTV FT HEDLTTSIKLFPGHDDINTKDCGNVPKNSIDPFENLMPGEELWYKNHNPHH FT TAKWLKANFIKKHSRNTFQIQIGSVVTMAHREQLRVYRGGDSHEKPNIRVV FT RRQPGADKDKDSSQEFFGFPEDEVQKEKRYVRKRKCTEKVEHTPELAPRRS FT KRQRTDKRNDDFVYAK" XX SQ Sequence 5246 BP; 1659 A; 869 C; 1151 G; 1567 T; 0 other; gttggcgacg agggtaaaag tgtatagtga aagtgctggg gaaatagtga atttcttcaa 60 caccgacgga gtgacggacg tcattacgca gcagtagcag cagcagcaga aaatttcgtg 120 gagtcccctt gtgagcgtga ttttccgacg gccaccatcg ctgcgattgg taattgattg 180 tccgcttcct cgcaccgtgc agtggtgaca cttcattccg gtgtggtaag ttttgttttc 240 gccgaaaagt taagaatttg gtgaattagg cgtgatttag tcatcggtgg cgtgagttcg 300 acaccatttt gaacaacgag catctgctgc ttatggtttc ttattttcta ttcataaaag 360 aaataataaa ttgacgaatt gaaaagaaat tgtatgaaac aaaggtgtca ttatattctt 420 ttattaagag agttttttct ttttatttta ctagacagtg ataattgttc accgggatcc 480 gccatcgctg tgcacccttc cgtgcttctg gcattccaat aaaaccatca aggctgtcgt 540 attgaaccag ttcgatcgac ctctggatcg accgattctt cgtttatccg ccttggccat 600 ttctggtttg taaaccttca cgtcaactga cggcttgcgt ggttgccgaa gggcgaagac 660 cgctaacagc aaacccttat ctctaccctt gtcggcaaag gctgaaggac attccgggtg 720 agtttttaat ccttttatct attggggaat caaaagtgca tttcattaca gtagacactg 780 ttttctttta gaaacaattg ttcttttcgg tagttgcaac cattttcctt agattttttt 840 tggtaaatta aaatggccca aaccaacatc aacagtacaa tagaaccttt tcgtaagggc 900 aattcgtttg gggattggat tgaacgtttt ggggtttttt tttaacatga ataaagttcc 960 cgacgaagaa aagcgggatc atttcattac cttaagtggc cctacaattt ttagggaact 1020 caaattgcta taccctaata gcaatttagc tgaagttcct tataaggaaa tggttgacaa 1080 attgaaagca cgtcttgata agaccgagtc cgatcttgta caaagattaa aatttaatgt 1140 aagagtgcag cagccggatg aatctttaga ggatttcgta ctgtctgtca aactccaagc 1200 agaattctgc aattttgaga acttcaaaca gatggccata cgcgaccgta ttgttgctgg 1260 cgtccgggac aaatctctgc aacaaagatt gttaaatgag gagaagctaa cattagaaac 1320 ggcagaaaaa ctgattgcca cttgggaaat cgccagaaat aatgctaaga atatggacta 1380 cagcaacagt acggaacaga ttgcatcttt gaaggcttct gggtttcctg gggcaagatt 1440 gcacaaatta gcgacgacaa tggaagcagc ggcgagggcc aattccagtg cgaaagttga 1500 tcaatctcgt ggaccggtca aaagccgatt aggatattcc ccttacaaaa aagataggtg 1560 gcaacacaag cagtggcaga ccagaagtcg tggaaatggc aaggaggaaa gaaaccgtcc 1620 tgattactca cagatggtgt gcgatttttg cggtgtcaag gggcacatta aacgaaggtg 1680 cttcaagctg aaaaatatgc aaagggatgc cgtgaatatg attgaccctg atacttctgg 1740 ttccaacccg gacgattttt tgagtaatat tgtcagcaga atgcgagcgg attcagacag 1800 tgagaatgat gcggagggtg agaaaaatgt ttttaaatgt atgcatgtgt tgtctattgg 1860 caaaataagt gatccttgtc ttttgagttt gaaaattgaa aatgtttttg tggaaatgga 1920 agtggattgt ggttcgtctg taactgttat gagcaaacaa caatattttg aaaaattttc 1980 aaaaaaattg acgcatagtc aaagaaagct agttgttgta aatggcacga gtcttgtgat 2040 agagggagaa gtagaagttt tagtacattt taagggaatt tcgacaaaat tgaagctttt 2100 gatactaaac tgtgagaaca attttacgcc tttattaggt aggccatggc tggatgtgtt 2160 tttcccagat tggaggacat tttttgtaaa ttcagtgggt tctagtccac agacaaatca 2220 accaatagtt gatgaaatta aagttaagta caaagatgtt ttcgttaaaa atttttcgtc 2280 cccaattcag ggttttgagg cggatttggt tttgaaatct gaagtaccga tttacaagaa 2340 agcttacgat gtgccgtata gattacgtga taaagttttg gaacatttga cgaaattgga 2400 aaatgaaaaa gtaataacac caattaaaac cagtcaatgg gcttctccag taattgtggt 2460 aatgaagaag aataatgaaa taagattggt gatagattgt aaagtttcga ttaataagtt 2520 catcattcca aacacatatc cattacccac tgctcaggat gtttttgctg gtttggctgg 2580 ttgcaaggtt ttttgctcat tagaccttga gggtgcctat actcaattgt ctttgtcgga 2640 aatgtccaag aagtttatgg taataaacac tgtaaaagga ctttacactt acaaccgctt 2700 gccacaaggt gcatcatcca gtgcttcgat ttttcagcag gtgatggacc aggttttaaa 2760 tggaattgaa catatttcgg tatatttaga tgacgttttg atcgctggaa aggacttcga 2820 tgattgcaag gaaaagctat tcttagtact tggtagactt caaaatgcca atattaaagt 2880 aaattggaac aaatgcaagt tttttgtaac agaactgact catttggggc atgtgattag 2940 tgagaagggt ttaatgccat gcttagacaa aattgaaaca attgaaaaag caaaagtgcc 3000 tcaaaatgta acagaactta agtcttttct agggttgatc aattattacc ataagttcat 3060 cccaaatttg tcatccaagc tctattattt gtatagatta ttgaaaaatg atgtaaaatt 3120 ttgttgggat gccaattgca ataaagcctt tgaagagagt aaacacgaat tggtggaagc 3180 aaaatttttg gaattttatg acccaaataa accaatagtt atagtgccag atgcctcagg 3240 atatggttta ggtggtgtaa tggcacattt agttgaagga attgaaaaac caatctattt 3300 tacatcgttt tcattgaatg cagctcaaca aaaatatcca atccttcatt tggaagcact 3360 agctttggtt tgtacagtga aaaagtttca taagtttttg tttggtaaaa agttttttat 3420 ttacacagac cataagccat tagtcggaat tttcggtaaa gaaggaagaa actcgattta 3480 tgccaccaga ctacaaagat ttgtgctaga actatcaatt tatgattttg agattcaata 3540 tagacaatca agtcgcatgg gaaatgctga cttttgctcg cgtttccctc tagatcaggc 3600 agttcctgtc gaatgcgatg tagaattggt caacggtatc aattttggta gggaactccc 3660 tctcgacttt tcggtgatag ccagtaaaac gaaggaagat atgtttttac aaaatgttat 3720 atcattcatg acaaagggtt ggccaacaaa agtaaacaag cagtacaaag acgtgtatgc 3780 taatcagcaa gatttggaat tggtagatga atgtctgtta tatcaaaata gagtggtaat 3840 acctgtatca atgaaaatta aaattttaaa acttttgcat gccaatcatg cgggaatcgt 3900 taaaatgaaa cgacttgcta ggcaatgcgt gtactggttt ggaattaatt ccgacattga 3960 ggattatgtt actgcttgtg atacttgtaa tagcatgatg atagttccaa aaacaaaaac 4020 agtttccaaa tggataccta cgacgagacc atttagcaga attcacattg atttctttta 4080 ctttggacac cgtacatatt tgctagtagt tgatagttac tcgaaatggg ttgaagtaga 4140 gctaatgagg aatggcacaa attgtgataa ggttttaaag aaattagtgg tgttatttgc 4200 tagatacgga ttaccagatg ttttggtatc agacggtggt cctccattca atgcacactc 4260 ctttgtaaat tttctaaagc gacagggaat caatgttcta aaaagtccgc cttataatcc 4320 atccagcaat ggacaggctg aaaggttggt gagaaccgta aaagatgtgc tcaaaaagtt 4380 tttaaatgac ccggagttct ctcatttgga tgtcgaggat caaatcaacc tgtttttgat 4440 caattacaga aatagctgcc tgactagtga gggaaattat ccatctcgat tgattttctc 4500 atacaagcct aaaactattt tggatttaat aaatcctaaa actcattata aaaagttttt 4560 acaaggagga actgttcatg aagatttaac tactagtata aaactatttc ctgggcatga 4620 tgacattaat acgaaggact gcgggaatgt cccgaaaaac tcgatagatc ctttcgaaaa 4680 tctgatgcct ggtgaggaat tatggtataa gaatcataac ccacatcata cagctaaatg 4740 gctaaaggca aatttcatta aaaagcactc tcgcaatacc tttcagattc aaattggaag 4800 cgtggtaaca atggctcatc gagaacagct tcgcgtgtat agaggtggcg attcccacga 4860 aaaacctaac ataagggtgg tccggcgaca gccgggggca gacaaggaca aggattcgag 4920 ccaagaattt ttcggatttc ccgaggatga ggttcagaag gagaagcgat acgtcagaaa 4980 aaggaagtgc accgagaagg tagagcatac gccagaactt gctccgcgac gttctaaacg 5040 acaacgaacg gacaagcgca atgatgattt tgtgtatgcg aagtagtgat agtcgtgatg 5100 tagtcataat gattcttaag ttcgatctga atattttttt tatttgtatt tgaaattgta 5160 attctatctg aacatgaatt aacatagctc taaatgttga ttgatattgt aatatgaatt 5220 cgaaacttct tcaaaggggg aaggac 5246 // ID Polinton-2_HM repbase; DNA; INV; 42550 BP. XX AC . XX DT 05-JAN-2009 (Rel. 14.02, Created) DT 05-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE A family of autonomous Polinton DNA transposons - a consensus DE sequence. XX KW Polinton; DNA transposon; Transposable Element; KW Polinton DNA transposon; Maverick; Polinton-2_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-42550 RA Bao W. and Jurka J.; RT "Polinton DNA transposons from Hydra magnipapillata."; RL Repbase Reports 9(2), 456-456 (2009). XX DR [1] (Consensus) XX CC TIR is 219-bp long, and TSD is 6-bp long. CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 2657..2040 FT /product="Polinton-2_HM_1p" FT /translation="MILILNLVIFINMILFRMLRKNLLRHARIHHDLEEFF FT SIKENIVKQLWINNYEIGXHDKYYRESWTLKDLCKRKLASEKLLRIVIPNL FT RKQINNLMYSFFQNNQKILSQYSVYKSHEDDCFELIDRRTLIRPFYLNIQI FT SIVDNIYKLFVNTYYINCFSATKQHKLVMKSFFNIYMECIDWLFKQHEYVI FT NPYFKPNRNDMYDFI*" FT CDS 36002..37921 FT /product="Polinton-2_HM_18p" FT /translation="MIYVKHPQVAPGDIANLTPERXVILNNTEETFVRPFI FT HIMVTPKYGGALVQNSQSAPLVPDTRDANILSVDLDVYVNNLLPSSESVKT FT SKMKHIVPWQIAFFLLKIMTNTNTIKSEGNIYNIETDLNEIPIQINALWKI FT WNDQVNVKDANNNYTEEKMSLTSFIDILKNAFGTIXTKLLYISPSAIGIEP FT TELFACFYQIFFPKTSQRKYYRFTLQNIPLASLPIPFFNVDQIIWPERELK FT MMDIFTESINYNLSNNLLNLKLNNLTFNEFIYEGTNIYSYQKFLNQYNFSS FT IPNDYYKNYYTFWCLPSFEQNTYEINLKSPLQISKAIIASTNTYNNYFEMK FT GNQISPYHAQFGSLNTFTTSPSNLPEDIYKIITADWSLIAKYNSETFIEVL FT MYCEQTKSIFSPTLKVPLVTTAAFERATTPSYKTWLNGSKSILQGNNINEL FT VFYCTSLNGQLIYQKKSDGTIINSWSNSLPLKNIILKMYQPIGNNPVPYTL FT LLQRDQYSLTNEINPFNIEIDQNMVYTMAQDFKIQKRKLAFLRCTLMQISN FT FNNSVSNFPTTTDNLIFIKCDQATNCGMYMNRRYETGIVSYFNLSDYNNAT FT QVTVNIDFTKNRKTEIQGQLTTFDIVDVEDWKKTKMXFFXH*" FT CDS 21716..22969 FT /product="Polinton-2_HM_12p" FT /translation="MTLSAKQKLAFQWFDEGLNFFITGSAGCGKSYILDAI FT ASSDQIFKNIAVTASTGKAAHQIKGMTVHSFAGIEIGTKSVDYYYKHMQVD FT VLERWRNTHVLIIDEISMLNAETFDLLHHLACKINQCNDELFGGIQVITCG FT DFRQLPPVKGEYVFKSAIWKKYMFNVIELTESFRQENIEFFNALNEIRIGK FT VSDKTVDLLMTRHYEXDHNINSNFIRLFFTNMEVDFYNLRQLMSIPSDERW FT FHSKDVIKNLKINPLFQIPISINIKIGAVVMLVKNINVEEGLCNGTIGIVT FT FIETDGVWVNINGREFKIENVREDILDCTHSIIASRIGLPLQLAFSLTVHK FT AQGCTLNKTVLNFNSKIFIMSLPYVALSRVKDLNDLYIIHNNKNELRKILK FT NITSDKDVDLFYKNLNEKNFVTC*" FT CDS 29611..30864 FT /product="Polinton-2_HM_15p" FT /translation="MEEKEEKIETNPVENEPPKNEKKDSRTLTEILNEKGF FT HSFYLKGKNEEPEIKDDVSPTQIKFNIFKPSKDVYPENTIITGPTNSGKST FT MARELLKYGKYPDAEMLFVIQMREPSESQHKTWRDIINSNKNNLEAYDIIT FT VDSAENLDPVIQRAISFSKDKYGIQNSDHRLLKTILLFDDISAFCLKSQQY FT TSACSSGSHHGVGTITVFHAVPSKSSQAWNNLVSHYKCIVSFDNNSEIEKI FT FKNDFPSAGKMHHVISDLLNKETSSQHGHLALFKRNCSVARLRTQITNSKE FT QKVFIPVDAQGKLDFDYNEMSVNKKIVTFQSFPYVKAQNCKNYYYLKSHHN FT PFCQEKEKENQQTNDEEDKNNINSEKENQIKKRKWSESERAALMLEEIKRR FT KRYGLCESSDEFSDSGDESNESE*" FT CDS 18468..19058 FT /product="Polinton-2_HM_10p" FT /translation="MSDPICEPSDQKKQKLEEEDEVQLEPKLLKEIQTLQQ FT LADIDDVDVMMLLKSSIDYNGLIKSIKXKYNNLLLGKCYYVSTLNQKRIYD FT LHHKCLFDLYQKFSFSTARLHLEMKELRDICYSEKFSFSTASLHLEIKKLL FT VEEKLKLKNNLAYNRYLVICKLYKMLXKFDKQVRYLNDDYISDIKYFGCDL FT KSLEMW*" FT CDS 3165..2608 FT /product="Polinton-2_HM_2p" FT /translation="MKKEQFNKILNELHEFYSMKQNKLKEWNYNYDLVFNL FT VTNVETWSLQDLCKRKLAYEKLLNTVIPNLQKQINDLLHTFFDGNEKLLFY FT HQIFXDNCFDNIINDINNANYIYKKHYLFVQKLIIHYVYRFFEQTYFMFRY FT KTKQDKLIMSSLFNIYMQCIDWLIEQCKFVYDPHFKPCYIYKYDFI*" FT CDS 31474..33063 FT /product="Polinton-2_HM_16p" FT /translation="MNRYKPYDFSYSPLLALRKVLAQNAYNTDVYKKTRLE FT MEKETEEKETETVPQNSGMPETTMKSDISHLIERYDLPSSLIYADPINXEA FT QIEGELRFQQELNNLLPIETQLNGDLTTSLIKDQEKWKTYLNVGGDAYAFS FT KLIGQNKYNEFNAQVDQRTISETLRTMRNIKKYVPQEENNSLNPMPPNSVQ FT SSLNNLSQKTTLNTVQPTNKKVYLLTPLNRFLTFYEKPNEAEVTKLMANLA FT RSNMAKLARNLLDMCKNQVNHTFGSNQISLRVYKPRGEEATLNNLKIYIQF FT MNNVVDVEESLASIINNVNATSYEIINDPKQYSYYANEFLTLLASPQSVGS FT LRMVADYYQIKHLTFAKLWNYLKKYNIPFRNENKLTAESAYQETFSSRPYT FT VMTSAFFESDSLVDKSFSGNIAFDLLSETKKVVNSWLIQYTEQFKGYNFNI FT LTDYIKSNSVYMLDKNIYNLMINNYNENAMQSISIGEHMLYLTFKTFFANP FT PSLPYRDVDDVVQLQDNVVTACIQANMQTCKIF*" FT CDS 33643..35928 FT /product="Polinton-2_HM_17p" FT /translation="MEDALRDKIDRLIEIQLDDLSSRLFGLTNLSKIKKIM FT SQFNTLKKNVSDLKFYEKLKTILENSVFAKWDGMITAWSHQTLLFKKTANA FT NVKKQAPRKPYVYETHRFSYEKPHQYFQADLADMESFNFNPGYRPDYCLVV FT VDGVSGKVYLRVMKKKTAEKTVSSFRHIFKDIKRDDDKSFIYLQTDEGTEF FT FNSKMKEWCESENVHHFHSKNYGKAYLAESVIGRLKLLYQKLRDKGQLKKY FT NWSYYIEEMEQKLNEKEKTSSGISPEELDADDNKGKALRMIDQYVSNKKRK FT HDFTSNMLHRVEKEHKNQKLKILKPLSVGTECYLRKYKKDAKDTFSKSSTR FT NRPYWDTTKRVIIMKVSKRTNPSVQGYLPVHIYKVGQVDDLNTTYKVKRED FT LLPINEEDKNIYFKNFNEYVDTFYKPLKKSLQKKIIVKKMNNLVFDDVLDV FT NKYDCSTYFFSEIKSKEQENVHYNVLNDFSLPLSTNDLDKPFFKNNTIMHL FT SSMDVPTNLTRIDENISLTENSYGTRMETKPSIKQMMTDIQNDMISNPNRK FT IPFYLVTKKIILYTDFAXLPGDTFQNYDLLRVEHMDAVHLEYMCSIVGYGK FT LFGVTSPSPYQDDFLIRRLELFNTSLNRRLTFADFFSNYQNKSNNITNQKY FT FNLKYLFDTNTHLRYVTNNENTIISKLSIEPNGNVKQYIQNYSYINAPNYN FT FVWYLKKKTNENLNNPSNLYPIRSSISTKSNERGLSYFGFFFKPNKCITIR FT STRFYIINTI*" FT CDS 13228..12245 FT /product="Polinton-2_HM_5p" FT /translation="MTSKQVEQSLEKAVKLFYPDNELHPMLQEFHHLENSY FT EDWEIIINDYLTKWNLQLLLPEEEYIDLYFIFIIKDYQNHQNLFEKTTRHY FT FQFNEQGSKWKSDMYDFCKLINTDFLKCTEKNKELQNVYTEDSNFTQRYLI FT LKRKTDHFIKIYKRYYRSTIDNKCTSWKNKIKLSLQYYFLFPYLIEXHLYE FT YDLLKRNSLILREEYPSCLQRLVFSYNDKLFHNGFYTLYTPSPIYNYFKVE FT KVKTXNDKVYIYKSFNQCNFYEKSIMEMFPYANWFYFFKRNFTISKILNRR FT DNRLYDVIKQEIVNDKPIHKAEHFNMYYINAKECF*" FT CDS 14131..13409 FT /product="Polinton-2_HM_6p" FT /translation="MENEIQVNYCNLMQNYLKYGDMAAATLWHIDLANEKD FT LKVNQYSKVPVKPVAQFNFLKKELVYSNKSKEEVILLPASIRKQCSLAEDA FT CNYLYRAEMKLNKDDTHFKNKTGEGEITFNAILITTQTKLNKKLLESMLGK FT IEEIQNSGYNYAYSKHKVLKSQPISTFNVRMLKEDEEIYYTEYDRQTSVEP FT VMINTLLKNGQTRSFNQNMKRAAQDLAAQESKEYKKSKQHEIKDEKNDDE* FT " FT CDS 4528..3815 FT /product="Polinton-2_HM_3p" FT /translation="MEKQKNIIMDQKQSLTFLNMTTNNILNSLETVNKQIN FT ALTKRMDNMGLEMKEFVNKNTFEKILNNISLEMKEFVNKNTFEKELKLYDN FT ILNYNKDINDKFMSDNLIIELDQMDLRLQSNESYHTQPSYTVDGYFYRFKI FT FGYSNQEDKMGIYFQLIQSPKDHSLKWPFMKTVTCTVKNEYIENTKIIANY FT SEPLKECFEKPAQAYNRAIGTDSFATHDQLKRHYIKNNKLIIHVSFI*" FT CDS 16596..17471 FT /product="Polinton-2_HM_9p" FT /translation="MVISVKVINVKWMTIKLIFVIENKMQVIMKKSMQVLM FT KKMKNVYTVIQMTKFLNVYKVIQMTKFCYKVIQMTKFLTKYLIIYYQINIK FT KSNLKEEIKNLDLRVSLLEDLVKGSDEDNEENENLRILLLLRNLDIRVSIL FT EEPTGLPDGQKIFVIKDIVKRMNINSNGYCSTXYSAPFFYKNYKMSIRINF FT NIDINXNNEMIHGDYIGIFVVLMXHIYDAVLKWPFSNKKGTISLLGEKKFK FT KSFITDDSVYFQRPIKDQMNPAFGFGRFINKTDLLNYIVEDSLFIKCKIN* FT " FT CDS join(38375..39028,39032..39595,39519..40829) FT /product="Polinton-2_HM_19p" FT /translation="MSLFRARGRYEKNLINEKIPETLINPQNSEINEIQNK FT FYNLQNQXSHFSTKADVLRIKSELDEKCEQLKKLKQQQQENIQNTYEEKPQ FT RKKNVIKKNEIEKPKRKKVKYSEEESESSEEEYVEKKPSKTKVALNSHVCA FT QCVQEVKAVDAIHGLCRNCIIQQRAILSSPFMQNNIEKKSKKEKINKKSLI FT ELGYTRALKDVKNKKIPFKKNTSDSEDEIELKKYYNKKFYLFYFSKKKKYY FT IKMGSLAKYMRAGDGNRYTPNHNGPVTIDDVLQKAEDENRLLQEATSGNLT FT LEMKAALNEKYPGFMTAVAQDEVIKTDIQKFTPDLTSELKSLITFTMKFNE FT WTIPEDYIYEFDVHLFKNINTKTRFNTTANHADLALLETILFGRKFCFSYI FT QKYKSISKLSKKKLFCLVENFVFHIYKNIRVYPNYQKNNSMPANNVINNRA FT YQRFKTMFVKKSYMKAYREHMLNHNLAFTEKTINRPTWDETRNLMKSNDXN FT AVIRALNGNVIPAPEYQTFLQDTYNALMNGEKGFRFRVPLFLLCEFFQIKS FT YYGKGKELKIEFTVEQDTNQLLEYIRPITDAQIESLANVGLEIRNPVIWVN FT TYRIKASLPLEKSLYINSLPDQPYTIARFAHYEQVITNVEGTDTVTISLGN FT IKTADMVPHCVLFSVQSKKESDHLTKSSMTPVEIACTMIKKITFHNFRRTF FT VNSSGDTYVIDLENKPEDQQLCYRSYVSWCKGNTPSTLSINNFADFYTNDD FT DSFIRNPNDYFSKTQNLTEPLAIDTTSDYNYVNDKPPATIQGNNSLQITVD FT FVTNFKGIITLYLQHAAQLIQTGTGEDENYTYYPFKFKSS*" FT CDS join(23038..24540,24544..28353) FT /product="Polinton-2_HM_13p" FT /translation="MHFKVKNIEEKLDNESYYREILYFVNDNYKDEDFIEE FT IELNDQIQTPEEITHHINQKTQDSFLIETNKALSFADLFQNFLIGYNIVTL FT CAKSTEDGYCKIIKRCYNSPVYYIKPLEGSINLQYIKSNNIRSVFNILEND FT SGWRIIRLSSFKVKTLSNLTEKSMMKLRNYELAGYKRVSELSPEELMMRRE FT YERKRKKKISELSPEELLKKRERERKRKKKVSELSPEEIMKRRECERKREK FT KYYTKNKNKILHKQGRKYIREENQDEWYGEWYRPHSLTQWNKHKSSSFKVL FT SYDQKRKMFLDYSILFENGLCFYSAFVISLLIRKWKVNTIKEWNTKVLEKN FT GQDFYSEENIMQFLIEDQNDSWIHALSLLKEKSKNLSFLELIDLCHSFAYE FT NTFDRINIKIFSINKKNKNVRLLHPKMKSQYKSVLSELLKVVYESANITKE FT DQNPQREFLNEDERKEANERKKTVNTIKICLYEKNQSFHAVSMLNERFSVI FT YNNHXLYQCSECGLKYTSQIHKQECKGFYVADKFAYGKTRHFYEPQNQFIN FT TSRNVFPFFITFDCETSVQTSNLKETEHVKPLSDLSYYCHGIVIHCLSENL FT ARECFGLDERNKVITQWIFYYENTIESLQANPLTLLPACFQENVSLLHKTH FT RQILWQKLDANDENVEKYKSELRVYDLLILADTLVRFTYEKCLPRFQNLNL FT SEHEKCQVWQEENPLCSICRCPADKIKKSDVYSYKLTTMLEPEINLLNHNE FT IRFYNDKRVQVVNHIFKRVSKLELFMLKPSTQNYLITNKVEKIKNDLFQIS FT SLFYICCRWWEKFDRTLITNGFLSDYVRINSLDLVDHMKCLSQLVMNEQQI FT NIWFENDPIELLEDSYFLIRIFQKCINEVNVHWKYERGMGIKDSITFLHLM FT SRLNHPSAHLWFHCMLEPLNREFEKKRTLKELIQDDKGRDDVLKECYGLLS FT CFEDYLVIDHDHFSGTVYGLAHDYCNKNLARYSSKLACPIFAHNASFDNRI FT MIVEGLKKILDVPLYSYGEDTEDKPFNAFLKCCNMENLFVISKGVNKFNII FT CIGKHIKIVDTLKILNCSLDQASSMLSKKAQNQIIDSMLHYLLPFYPHINQ FT LVDNEEFKNCFIKKTLFPYSQISDKLLNIEYKNIPSIEIFAKNDLMKSKTL FT EDEIKKIKDSYETVSKLWNFLNLKNLKSLLHIYTVLDTVVLGSVMIDFDER FT TFQFTQFRMLPHLSVFSYCSKLNSXMSQIQIQYPPSEEHERMIEKSVRGGM FT SSNGSQRYAIDTDYFENKIEELGLKSNIKGCLFFLDENGQYGGAQLKPLPF FT QMKYSFDLECQTLEQVIEKYHRVNPDGFSTSALVHCQIKMEPEYQDKVGGF FT SPMVEKDVIPLKDYSPTEIVAKRILKKNGQYTIEKDKTVKLLMKLSSHETC FT EFLDTLIWLTTCCGCVVEKIYKVLPLWRYPLAEQMVRKNIEKRKEAVKNGD FT KVVDATAKLHTNGHYGFKSQNTAIKQTSQFINHEEKAFQNLIKHFQDIRLV FT LLQEEGLSEEKIKPLFPIHFQNLLYENGIFVNIEPERYKMDMFEKKICIDS FT YVTDQFTYDEYYKDFIFNKLIDVKTKTIMFSGKHNEEFQDVFELLKSDINM FT YKDGLVFVSSEKSNLVNKQMRYVASQILSYAKLNVTRFAWELGEVLRNHNA FT EKHLVVTDTDSGCFFITKKTDDKQEIFRSKVEEWIYFSKIGKQWMDYSNYP FT KTHEFYSSLYAKESDHFQNEVPPPSYIGQIFAMAPKCYSAHIIKKKRNSR* FT " FT CDS 14971..15444 FT /product="Polinton-2_HM_8p" FT /translation="MSYDLNDYCDFIEYLNDCYDNCFRFLNVKMFIDMMER FT YFQFNKVHKFTLMEMKNRAQFFRLCHRNNILETYKQWFDFYQNEQKYMPRY FT AIIIILDKITEFPNIEFIDFYNRWHTINSNHSLFNQTLFEYCFTDNFVSSI FT NKILDEKYKHWICKSFAN*" FT CDS 14336..14923 FT /product="Polinton-2_HM_7p" FT /translation="MLIVTKYRILLCYVLYKRNKFLFQNHFKASPKMSLER FT FNTVYEIAYNEYNNSFTRMIKLQEDCSELIKKEKCLTQEIINLLIEIEEER FT KLCCIHPNKELSIRYYKISKFLWKNLESGNSISKEEETKYLPLTELEMKKL FT DKDELALFFTCQKCQKLNRCLHDCFWCEDRMCYLCLLDHSLQTMMKFDTNN FT CKFNC*" FT CDS join(11406..10768,10764..9607) FT /product="Polinton-2_HM_4p" FT /translation="MNRVDSIIKKIHNNCAVCALDEIAKRYQSQENKSGNS FT SPSSLVIDEEHYEDASSNDFLENILNSKRITLNSYSIEKCFKNSKFKNLLD FT QPQLCQFHLNKFDNMKKDHLDENVIADYYKSKLKVFLLERDVKPVVPFEGD FT QTDGFIDTFTAKILPTASTIIEKLDGVLKKKSVKRNIEGIIKDGTSDKKTL FT FVNILHAIVTLFISSFFSLEKEEMCTNESLSSYFNNEPFLKDYFIGVYAFD FT ELSQTQTHVNIIKRMVKNGSFFIYNNETSKENGEHWRLLLKLDANHFFCFD FT SLGTESCLKQIPPFLRSNRHLFEAVEEIQTNIQINTLNINYNEIEISSYXR FT QTNHFPYEWTSDNHEFYWFTKFILKYSSMTGQQTILFSYNKTCFQWENSSL FT CGAFAAYFISLLPNYLINGQIYLPLLCRLLNYHFYEVSSLHASELAQINLL FT PFFHFVRTKLFKHLDVTAKQLFEKDMKLHLYLRKTESLVRNNVTMKTMKIP FT IENDIFKNEVMTNNKIRSFLNDARLLRTNNECQWDKISIQDINAIRDVMNE FT ENLALKHYENMNDMYFFDYRRLKDINNFIDVPQDDVMTKKEIKTRFVQNLI FT *" FT CDS join(28229..28822,28826..29188,29192..29503) FT /product="Polinton-2_HM_14p" FT /translation="MLKKVIIFKTKFLLLLILVRFLQWHLNVTVHILSKKK FT EIQDKGQDEHKNRAKGLPNYKLCYTHYINGFNTHRAEKIFRNMPNQLENEL FT QPFLQNRLDLNFKNVTIKEHVIQFLKKFTQKNYVFDDDLFTEIRDCYRLDP FT IREANLKVLPDIEKFCSDEHIQTLLEIEKNIIKSSERYNTMITFNREVLIP FT LLFDLKKLIVIKNFFFNCKKNCKKKMEYNLFDEMLTVKQLENKLKEIGFEN FT FYLIENTEGEKSEYYTSKLICVNLNGRILIPDSFLECTKCFVCFFKKISFV FT GCFEDSCDICISQSQSKKVNYNCNHLEKMNKENYSQDKVEENKKLIYNNLF FT LLELSREIGSDWKRCGRLLNVKEGDLELIENDYDKTINRCFEMFKISVYQN FT NLTMDQLFQVLESLPRNDIKRNILKIYY*" FT CDS join(19152..19586,19590..20114,20136..20951) FT /product="Polinton-2_HM_11p" FT /translation="MELTVKIVNEFMKKFLSHNEICPFTLEELIIQAKEFQ FT PFVCEECELLNVRCQNKIFDNDEWNMFFRYDFMRSEDQFVVMLTQGFKKHL FT LHTIDRFLPQNFKIFYNFFKNYINVDVFVSHVNQLNNENKNNKVDNERKCY FT YCNNLYFVIFFSVIMNSKEKHDKDLQIFGRCIQINTDIQTDEYIDESVINL FT EHKKAYKSLKEILLEDVEIKYLREKFRYQTIANYFVTSLKEVFWTAQSLSR FT EKRNTIIQTHLIWIKTMNELCDKGIIISEDIIKKYNYCTESHKRFLKFKIE FT LKNDMIECLNLYTCCPYCEYNKICQYCIMITEAKLNNILELLEMDAISIRK FT IYQFVSMEYMLHELHKKMDCEFKDEYNWGSLISTNKRQVPMEIASFVLILA FT LSKFKKGEIPTDDFFEFYDFFKNYHDKEKFICQLNEKIEDSRKWYYDKRKL FT TVLKLNDVMKTFSERQWIESTVHNTYSVGMMKFTLSDLYDKALAYKEICGK FT CKIIKNDEQLEKLNVPIPKDSKLNVPTDWIDFFYYQFPHNYVILFLLEVLL FT TNFPYYKDKVFYTFYMFFENCDKNLIIDQVNLKVYDYVNKLFYY*" XX SQ Sequence 42550 BP; 16551 A; 5176 C; 5881 G; 14854 T; 88 other; tgatgatcta cccatttagg atttattcac ttagttctat cccgacctta atttccatcc 60 cgaccttcgt cataactaac gtcataacga taacgtcata actaatgtta taatccgata 120 atgttttttt aaaaaaaaaa aagattaatt ttttttgttt aaagttaatc tagcgttata 180 aaaaaaatat ataaaaaagc aaatgctttt taattatatt cgttataaaa aatataaaaa 240 aagttttttg tcaaattata taattgtttt acaatataaa aaaatggtta aagtaaaaaa 300 aattgatgaa agtaaaaaaa aaaaagtcaa gcgaaaactg tttgaaaaac ctgtagctgt 360 aagtttattg ttttcttttt ttaaaaaaaa tgttttgtat ttttttaaag tggttgtgtt 420 ttttttctag attgttaaac cttatgtgcc wgaaaacgta tataaagcgc ctcaaattta 480 tgtmcctcaa gtggttgaaa caagaccact tgaggaatca gatgaggaat tgtatgaaaa 540 tgaattggaa atctacgacg atattccaga aatagacgat ggtaaattaa aatttattat 600 aaacataatt tttttttaaa aaaaatatat agtttaaaat aattttttta gaacaagttc 660 gyatacttga tagaattatg ataagaatag atcatcattt aaatgatgac akaatgaata 720 gaaacgaacg atattttgca cgtaatggag attatagacg tgtagtatgt ataggttgtt 780 gtttgtgcga agaacaaagc gagcacaatc attgttgtgt aacaaccgtg tgcccgtacg 840 tacaagaact atttgaacgg tgccgcggtt gttgttattg tatataaaaa taatttgtaa 900 ataaattgta tatatgtaaa taaaaaattc atttatcaaa atgtcttttg atgcgacgga 960 cgattgtttg ttatatttat atcaactcaa tttagattta aaaaaacatt tacaggaaac 1020 gcatttaaat caaggtattt ggatggataa agttctttta gctgaagaaa gcgaattgga 1080 tagttgtaaa agagtatacg gtttttatat taaagattta aaattgataa agacatatca 1140 aatagagttg atagaagttc ttgatttgtt atttaaatgt cttccgcatc gcgtatttta 1200 tcccgacacg tgttttcata attatagtga agtcattgaa cattgtcttg atgtgttaaa 1260 aacattttat tattatttta cgtgtataga taaattgtat aaaaaaataa actatgtaaa 1320 tagtttaaaa ataatagttc aacaattgtg ttattattgt aaaggattag ttggaggatt 1380 atatttgcaa aaagatgtat tggaatgtta tcatattata aaaaaggaaa gagacaatat 1440 attaaatgaa aaaatttatt aatgtaaaat tttgtttttt ttctcatact aaacttttaa 1500 caaarttttt aaaattatta ttaattaaat aaaaaagttc ttcatttgta aaattaaatg 1560 tttttttacg tatttcgtag tcaatcaata cagcaatacg atgaaattct tttaaaactt 1620 taaaattatt gtttccataa cgatttatag ctatttycat aaraattcta tattcttggc 1680 ggttaaagct gatatgagta ttggtatcgt ataaattata attttttaat ttcattatga 1740 atctaaaaaa aataaagaat aatatttatt ttgataatac ttgttttaca aaattataaa 1800 agtctcgttt aatttcttct gttttctcaa tgttttttaa atagtttgtt tgttttaaaa 1860 cttcttcttg taccaataat aatatattgc caaacgcttt taatgttgat ggatcattaa 1920 tcttttcatg ttcaacattt aaaaattttg tcattatatc ggattctttt ctgttaaaaa 1980 ataaaaagtc ttgatcgtct acagtaatta gtttcatttt tttattctaa aataaaaagt 2040 taaataaaat catacatgtc atttctatta ggtttaaaat aaggattaat aacatattca 2100 tgttgtttaa ataaccaatc tatacattcc atataaatat taaaaaaaga tttcatgact 2160 aatttatgtt gcttagtagc agaaaaacaa tttatataat aagtatttac aaaaagttta 2220 tatatattat caacaatcga tatttgaata tttaaataaa acggtctaat taatgtacgt 2280 ctgtcaatta attcaaaaca atcgtcttcg tgtgatttgt aaacagaata ttgtgataaa 2340 attttttgat tattttgaaa aaacgaatac atcaaattgt taatttgttt acgtaaattt 2400 ggtatgacta ttcgtagtag tttttcagaa gctaactttc ttttacacaa atctttcaaa 2460 gtccatgatt cacgatagta tttatcatga wagcctattt catagttatt aatccacaat 2520 tgtttcacta tgttttcttt tattgaaaaa aattcttcta aatcatgatg tattctagca 2580 tgtctaagta aattttttct taacattcta aataaaatca tatttataaa tataacaagg 2640 tttaaaatga ggatcataaa caaatttaca ttgttcaatt aaccaatcta tacattgcat 2700 ataaatatta aaaagagatg acataattaa tttgtcttgt ttagttttat aacgaaacat 2760 aaaataagtt tgttcaaaaa atcgatatac ataatgaata attaatttct gaacaaataa 2820 ataatgtttt ttataaatat aatttgcatt gttaatatcg ttaataatgt tatcaaaaca 2880 attatcawta aatatttgat gatagaataa taatttttcg tttccatcaa aaaacgtatg 2940 caataaatcg ttaatttgtt tttgtaaatt tggtataaca gtgttcaaaa gtttttcgta 3000 agctaatttt cttttacata aatcttgtaa agaccatgtt tcaacatttg taacaagatt 3060 gaaaacaaga tcatagttgt aattccattc tttcaacttg ttttgcttca tcgaataaaa 3120 ttcgtgtaat tcattaagta ttttattaaa ttgttctttt ttcattgcta aaaaaatata 3180 caaattaaaa aaaggtcata aaaaatttta aaaaaaaaaa ttatttttta ttataatagt 3240 tcaaaagttc tttttgtttt aaactaataa cttcaggcgg ataatacgac ggagtattac 3300 attttgcgtt ttctgcccaa aatggatctg ccactttaaa gtttggatat ctgtctcgat 3360 tttccagttc tgtatacggt ggaggattta cactttcttc gtatgatggt ggtttttcaa 3420 atttaactat aaacttttta tagatatcct ttctaccgat ttcatttaac aaatttaaaa 3480 taagtttgat tttagcstcg ttagtattgt attcatagtg ttgatatacg aaatctttga 3540 actgtatata ttccattaaa tctagtccat ctttaatttt ctctcttatt ccgtaaggaa 3600 tttctaaaaa catttttaat ttttttacat cagacgtagt gatgtagtca gctaaatctc 3660 tcatgattgc actgttcatt tttttaatat tctaaaaaaa atataacatt atataaaaaa 3720 atatataata taaaaaatac attatataaa agaaacgtgt ataataagtt tattaattat 3780 atgattaatt aaatacaaaa aaatacataa caaattatat aaaagaaacg tgtataataa 3840 gtttattgtt ttttatataa tgtcttttta attgatcatg tgttgcaaaa ctatctgttc 3900 caatagctcg gttataagct tgagcaggtt tttcaaaaca ttcttttaac ggttcgctat 3960 aattggctat tatttttgtg ttttcaatat attcgttttt aactgtacat gttacagttt 4020 tcatgaacgg ccattttaaa gaatgatctt ttggagactg tatcaattga aaataaatac 4080 ccattttatc ttcttgatta ctgtaaccaa aaattttaaa tcggtagaaa taaccrtcta 4140 cagtataaga aggttgagta tgataagatt cgttagattg taatcttaaa tccatttggt 4200 ccaattctat aattaaatta tcagacatga atttgtcgtt gatatcttta ttataattta 4260 atatattatc atatagtttc aattcttttt caaatgtgtt tttattcaca aattctttca 4320 tttccaagct gatgttgttc aatatttttt caaatgtgtt tttattcaca aactctttca 4380 tttccaagcc catgttgtcc attcgtttag ttaatgcgtt tatttgtttg ttgacagttt 4440 ctaaactgtt tagaatgtta ttagtagtca tgtttaaaaa agttaaactt tgtttttgat 4500 ccataataat atttttttgt ttctccatta cttctttcca ttcattttca aaacgatgta 4560 aacaatcgat aacgtgttga tcatagttag tttttttgaa tttgatttcc acaaaataca 4620 cataatttag ttcgatttct ttctacgtgt tcctcgtgat aaatttcaca ttcatgttcc 4680 caagtatctg tcaatttaat tttttttcca cagctataac aatcataatc ttgtgatgtg 4740 ttacatttta gttgatgatt aagtaattga tcaatagtaa agggtaactt gcaataataa 4800 cataattctt tttccatagc ttcagccatt ttttttgtat aattagttgc taaaaaaaag 4860 tattaatagt attataaaaa tgtttttttt taaaaaaata ttatgggtaa tattatgtaa 4920 tattacgggt aaaattatag acaaataggg ttaacatatt ataaaaaaaa caatatgaaa 4980 aaaaatattt attattaaaa gcaaaaaaaa atacaaatta ttataaattt ataaaaaaat 5040 tttttaatgt atataatcga ttgcaatgac agctgctcag taaaatatgt tacaatgtta 5100 tgaatacgaa attgcctttt ctattgaata gaaaaagcaa tctcatattc ataataattg 5160 taaacaaaag tgtttccatt gttttttttt ttttatacct ttagttttgg ctgcyttgag 5220 aaaataaatt tttaagaaca gtctaaacta aagggacatt aaaaataaaa aagttccatt 5280 ccattccatt gtttttttta ttttttatac ctttagtttt ggctgcttta agaaaataaa 5340 tttcttaaaa cactcaaaac taaagggaca ttaaaaataa aaaaatacaa aataatttaa 5400 aaaaaagtca attgtctatt tccataggtt cattttcatc attatcatta ttttctttat 5460 catcatcatc attatattca tcatattcat cattatattc atcatcatca tcatcatatt 5520 catcatcatc atcatcgtct tcatcatcat catccatctc gtagtcttct ttttcatcga 5580 taatgtcgat gttttttaaa ttaaaaacag agtcgccatc tacaaacatg gcctcgtatg 5640 gttttacata ttcaatattt gtgcacattt caggtttgcc atatttgtag atggctttgt 5700 ttacatcagc rtaaatggct tccattttgc tttccatttt ttgttttttt ttagttaatc 5760 taaaaaaatt acaacatgtt atacaataat ataaaaaatt aatattccat gggttctata 5820 atacaatttt taactttaat aattaattcg tttaaattgt ttattaattt aaaattattt 5880 ttttgttgtt cgtcaatcaa ccatctttgt aatgttttta atataaattg cggattaaat 5940 ttttcttctg gattaacttg aaaaaatagc gtaatgactc gacatatttt taaataactt 6000 ctattgtaat cgttgtcaat ggcatcaata acacagcgat gcatattcaa ataattagca 6060 aattgttgcc aataggcttc aagatagaat gacaagatat ttgttaattc attttcactc 6120 attttatttt atataattaa atctgtaaaa aaatttaaaa aaaatgttat tatataaata 6180 tatatttttt taacaacaaa aaatattaaa aaaaaatgcc tcgcacaaga tatacaccta 6240 gatcaaaaaa aactgtagat aattatttaa ataatcaaat tgattttgta cgtactacaa 6300 caaactcgca acaaacgcaa acagaaaata gagggatacc aagcaattgt aaaaatatat 6360 tagtatgttt acgaaaaact aaaaaaacat ataaagataa caaatctaat cgtaaattgg 6420 gacgcgttgg taaaaaagta aaacgatgga aaatatcaaa cgaagcttac gttaatcgaa 6480 aacacgaaag aaatgcagcg ttaaaaaaaa gaaaaaagaa ataaacaata tattttatta 6540 taataaaaaa actaattaca ttgttgttgt aaaaataaaa gtccttcttg tatttttttt 6600 gttaaaaatt ttatttttct ttctttattt cgtttgagtt tgtatttgct ctttttctaa 6660 aaaaataaaa aaaattattt ttttaayaag ttaaaacaat ttaaaaaaat aattttttta 6720 tataaaaata tcttactttt ttaacttgaa aatgaacatt tttttctggt gktccttttt 6780 ctttaayttc ttttttaatt ttattttgtg tttttttatt taaagtttcc ataaattcat 6840 ccacsgttaa atcgattgga ttaaattcta actcttgttc atcttcaata acgatagttt 6900 caggcataat ttttttttcc cacatttttt ttctacaaac aagtttagtg attgccattt 6960 gatttttgat ataaaaaaat gcagctttta tacaaaaata tttaatcttt atagattaag 7020 taaaataaaa gatttttgtg acaaacaagt tatatttaga aacttttaat cttgcttttt 7080 aygtaggatt atatagggaa tttctcaggg aatttccctt aaaatgtgcg taaacaactt 7140 taaaaaaaac tctgtgtaga aaaaaatctc tacgtaaagt gttaaaatga caatttaaat 7200 gaattcaaaa cacgcttaaa caacacgtcg acgtagcata gtgcatcgta tcatttccgc 7260 actatgttgt aaaccgccat gaaccgatcg tctagggttc gaatctctct ataactttac 7320 ggctttaaca ttatatgtat aaacacattg agcgaacgct tgatgacgtc tgtctagtta 7380 gtttagttag ttatatgtta amcacattga gttacggtgt tactactgtt acttgatgac 7440 tgtccccact gagagctcat actttcggtg agtcctatat gtgaaccaca ttgagtatca 7500 ttgatacttg atgattgtac cccactgaga agcatactct cggtgagtcc tgagcgtgtg 7560 tgaacgagcg tgtatgtacg aaygggtgag caggcataag taaggcaaaa ataatagact 7620 aagtaagtaa ggttaagtaa gtaagtaagt aagtaagata caaacaaccg gttgtaaaaa 7680 aaacaacaga gtaaaaataa taagtaggcc tactaagaaa aacaaaataa gttaggtaag 7740 gcaggttaag taaggacagt gtcgggtgac acaaaatatt ttttaaaaaa tatataagtt 7800 tttttttcaa agttttttta caagtcaaaa aaaaatgaga aggggacaca gacgtaataa 7860 acgtgcaaaa agatatattt ttataaacaa aaaatcaaaa ccgcatacac gtttaagtaa 7920 tatgatgggt agaggtaaaa gtaatatgtt ggttaaaaaa tktgtcttag taaatatttt 7980 atattttaat aatgaaattt tttaaggtaa aaaaacatgt aaacgtttaa atatttgaca 8040 tatatttaaa taaaaaatat taatttttta ataaaggtaa aaaacaccgt ggtggtaacg 8100 tttttagaag attaagaggt ggtttaagta atccattcat tgatcctgtg gacaagttat 8160 ttaaacaatt tataaacttt agaaataaat atttagttta acaaagaatg tcattagatg 8220 aaggacgttt gtttaaacaa agttattgcg aacatgctaa taaaagatat cacagaaaag 8280 aacattgtag tcarttgtta tactaacgca tacgttaata ttattaaaga agatctagag 8340 ttttataaat tattacttga agatatgggt gaatattctt taaataacat aagagccgat 8400 gtaacaagtt atggcgaata tatattaact tttaaatatt ttaatgaata tctatgtcaa 8460 ataataagtg gaaataaaaa agaaggacat acagacattt ctactttttt aggaggattt 8520 gttattgatt ataaagtacg ttctggatta tatactatta gacatgtaaa taaaggcgat 8580 atatatattg gttatgaatt acctataata aaataatata tttaatactg ttgttttttt 8640 tatttttttt taaaaagggt atatataaaa gaataaaaaa atgtttattt aaaaaaaaaa 8700 ttttagtttt atttaaaaaa aatggacgta actgctttta aagaatctct tttacaattt 8760 tataatgtta aaaatttgga aactttacaa acaaacgata gtttgaaaga tgtaaatgga 8820 aatttcaagt tattttttcg cttggacggt agcaatcaaa taaaagtaac tcgttcaaac 8880 gatatagcta acattgattt tatatatgaa gtatttgata tgagtgttaa aaaagtattt 8940 agtatgaaat atgatgattt tgtatcagtt ataaaggatt cattatcaat atggttgtta 9000 gaagaatgcg aataaaaaaa agacgtactc gtaaagttaa acgaaaacaa cgcgctcgtg 9060 gtttatcaga ctatgttaca cctcaaaatg taactaatgt tttaaaagcg ggttcgttat 9120 tatatagttt aagaaaagca tggaaaggaa aaaaatcaac gcctgctgta acaacaaaaa 9180 cagttgcaat tcaaccaccc gttaacttat ataaaccagc ggataaaagt tttgaagtat 9240 attaaacaaa attgatgtaa aaaaacattt attttttaaa taaaattaaa atacaatgta 9300 aaatttattt tcttttcttg tgtcttgtaa ttttttttta ataaaagttt tttttttcct 9360 ttacaaacat cgcatagtag tttatttgtt tttatttcga attttgtttg agttggttta 9420 tgtagaagcc aaacttcatt gttaaaaacg gtactcgtac attcttttaa ataatttaty 9480 aatttcggat cttgtaaatt aacratatat aatattccgt ttccagcaca cgattcgcaa 9540 atttcttcta taactttttc tttaattttg cataaagctg ctttctgtat acttgttgac 9600 attttattat attaaatttt gaacaaagcg agtttttatt tctttttttg tcatgacatc 9660 atcttgaggt acgtcaataa aattatttat atcttttaat cttctgtaat cgaaaaaata 9720 catatcgttc atattttcgt aatgttttaa agcaaggttt tcttcgttca taacatctct 9780 aatagcattg atatcttgaa tagatatttt atcccattga cattcattgt ttgttctcaa 9840 taatctggca tcgtttaaaa agcttctaat tttgttattg gtcatgactt catttttaaa 9900 aatatcattt tctataggaa ttttcattgt tttcattgtt acattattac gtaccagact 9960 ctcagttttt cttaaataga gatgtagttt catgtctttt tcaaaaagtt gtttagcagt 10020 aacatccaag tgtttaaata atttagttct aacaaagtga aaaaatggaa gcaaatttat 10080 ttgagctagt tcggaagcat gaagagacga aacttcatag aagtgatagt tcaataatct 10140 gcacaacaaa ggtaaataaa tttgaccgtt aataagatag tttggtaaca aggatataaa 10200 ataagcggca aaagctccgc acaaactact gttttcccat tgaaaacaag ttttgttata 10260 agaaaacaaa atagtttgtt gaccagtcat ygaactatat tttaatataa attttgtaaa 10320 ccaataaaat tcatgattat cagatgtcca ttcgtaagga aagtgattag tttgtcgyat 10380 gtaagaactt atttcaattt cattataatt aatgtttaaa gtgtttattt gtatgttagt 10440 ttgtatttct tctacagctt caaacaaatg tctattgctt cttaaaaaag gtggaatttg 10500 ttttaaacag ctttcagtac ctaaactgtc aaaacaaaaa aaatgattag cgtctaattt 10560 gagtaaaagt ctccaatgtt ctccattttc tttgctagtt tcgttattgt aaataaaaaa 10620 agatccattt tttaccattc gtttaataat gtttacgtgt gtttgagttt gactcaattc 10680 gtcaaaagcg taaacaccta taaaataatc tttcaaaaag ggttcattat taaaataact 10740 agacaaagat tcgttagtac acatttattc ttctttttct aaactaaaaa aacttgaaat 10800 aaataatgtt acaatagcgt gtaaaatatt aacaaataat gtttttttat cggacgtgcc 10860 gtcttttatg attccttcaa tgtttctttt aacagatttc ttttttaaaa ctccatctag 10920 tttttctatg atagttgaag cggtaggtaa tatttttgct gtaaaagtat caatgaatcc 10980 gtctgtttga tcaccttcaa atggaacaac aggtttaaca tctctttcga gtaaaaaaac 11040 ttttagttta gatttgtaat aatcagctat aacattttcg tcaagatgat cttttttcat 11100 attgtcaaat ttattcaagt gaaattgaca tagttgtggt tggtcgagta aatttttaaa 11160 tttactgttt ttaaaacatt tttcaataga ataagaattt aatgtaattc tttttgaatt 11220 taaaatattt tcaagaaaat catttgaaga tgcgtcttca taatgttcct cgtctataac 11280 tagagatgaa ggtgaagagt ttccagattt gttttcttga gattgatacc ttttggctat 11340 ttcatctaat gcgcaaacag cgcaattgtt atgaattttt ttaatgatgc tgtcaactcg 11400 attcattttt ttattttttt aaaaaatatt tatttttttt tgttttttat acatgtttaa 11460 taaattttaa aaaaagaatg cgtttcattt ttattaaaaa atgttgttat tttttttagw 11520 tttggttgtt agataytgaa catattgata atgaatttaa aaatgatcat ttttggggta 11580 aaaattgtat aagagaaata tgtttacgag aattaaacga wgatttttgt twtcatgyac 11640 aagtgtggcc gtgtatyamc tatcaaacac ttaaatsaaa aatatcaaaa aacatttaat 11700 tattgtaaaa aatatattca cggattggaa tatgatccaa gttcaatgga agcaaactct 11760 gatattattt tttgtacaga tgtaaacgat tatttggaaa catattttaa aaaatatcca 11820 ttaaatgtag ttttttataa aggtggcgaa atggaartta aaattttaaa aaattttaaa 11880 aatgtttaat attttgattt waacaaatta aaatttccwa aagttraatt gatggattat 11940 ttttatgatg aaacaatgtg taacgctcat tttaaaaatg aaatttttaa aaaagatcat 12000 ttggatcatt gttgcaaata cgaaactaaa atgttaaaaa attatttaat tgattttata 12060 tattatgtta aaggtaataa tgaaaaaaga gaagaatata aagatttggt taatttaata 12120 cgagaaaata atcgatgtga agaagtmaat aaaacaaatt gtaaatgttt tgataatgta 12180 tttcttgtaa ataaaacata ttatacagta aatataaaaa aataaaaaaa ataatttktt 12240 ttttttaaaa acattccttt gcgtttatat aatacatatt aaaatgttca gctttatgaa 12300 taggtttatc gttgacaatt tcttgtttaa taacatcgta gagtcgattg tctcgtctat 12360 ttaaaatttt tgaaatggta aaatttcttt taaaaaaata aaaccaatta gcataaggaa 12420 acatttccat aattgatttt tcataaaaat tacattgatt aaacgatttg tatatgtaga 12480 ctttatcgtt matagttttt actttttcaa ctttaaaata gttataaatg ggtgacggtg 12540 tataaagagt ataaaatccg ttatgaaata atttatcatt gtacgaaaat actaatcttt 12600 gtaaacaaga aggatactct tcccgcaaaa ttaaactgtt acgctttaat aaatcgtatt 12660 cgtataaatg atyttcgatt aaatagggaa acaaaaaata atattgtaat gataatttaa 12720 ttttattttt ccaagaggtg catttgttgt caatagtgct acgataatat ctcttgtata 12780 tttttataaa atgatcagtt ttgcgtttca atattaagta tctttgtgta aaattgctat 12840 cttctgtgta cacattttgc aattctttgt ttttttctgt acattttaaa aaatcagtgt 12900 tgattaattt acaaaaatca tacatgtctg atttccattt agatccttgt tcattaaatt 12960 gaaaataatg acgcgtagtt ttttcaaata aattttgatg attttgataa tctttgatga 13020 taaaaataaa atacaaatca atatattctt cttctggtaa taacagttgt aaattccatt 13080 tagttaaata atcgtttata ataatttccc aatcttcata actgttttcc aaatgatgaa 13140 attcttgtaa cataggatga agttcgttgt ctggataaaa caactttact gctttttcta 13200 aagattgttc aacctgcttg ctagtcattt ttttttgaaa tgatatataa aaaattttga 13260 tttgtattta tatttttttg ctaatctata tgacttcata ttaaaacctt ataaggaaat 13320 aaaaaacatg ttttttttat aataactggt ttttttatta aatttaaaaa tagaaaaaaa 13380 tacaggtttt ttttaaaata aaaaaagttt attcgtcatc atttttttcg tctttaattt 13440 cgtgttgttt tgatttttta tactctttac tttcttgtgc agctaaatct tgtgcagctc 13500 ttttcatatt ttgattaaaa gatcttgttt gtccgttttt aagtaaagtg ttaatcataa 13560 caggttctac agaagtttgt ctatcgtatt ctgtataata aatttcttcg tcttctttta 13620 acattcgtac gttaaaagtg ctaataggtt gtgattttaa tactttgtgt ttgctgtaag 13680 cataattata tccactgttt tgtatttctt caatttttcc taacatagat tctaaaagtt 13740 ttttatttaa tttggtttga gttgtaatta aaatagcatt aaatgtaatt tccccttcgc 13800 cagttttgtt tttgaaatgg gtatcgtctt tgtttaactt catttctgct ctatacaaat 13860 aattacatgc atcttctgca agggarcatt gttttcgaat agatgctggt aacaatataa 13920 cttcttcttt gcttttgtta gaataaacca attctttttt taaaaaatta aattgtgcta 13980 cgggttttac gggaacttta gaatattgat ttacttttaa atctttttcg tttgccaaat 14040 caatatgcca caatgtagca gctgccatat caccgtattt taaataattt tgcataagat 14100 tgcaataatt gacttgaatt tcgttctcca ttttattgta tatgtttttt taactgttgt 14160 aaagttgtcg aatgatataa aaatgaaaaa ttttttttgt ttatatatcc twttttaaca 14220 atagagatta aaaaaaacaa gatcaaaaca agtttatata aaaaaattgt caacgtgtca 14280 ttttaatttt ttatgatatc atatttttac tttataaggt aatataatag aaaaaatgtt 14340 aattgttact aaatatagaa ttttattgtg ttatgttctg tataaaagga acaaattttt 14400 atttcagaat cacttcaaag catctccaaa gatgagttta gaaagattta atactgttta 14460 tgaaattgca tataatgaat ataataattc atttacaaga atgattaaat tacaagaaga 14520 ttgttcagaa ttaataaaaa aggaaaaatg tcttactcaa gaaataataa atttattaat 14580 tgaaattgag gaagagagaa aactttgttg tatacatcca aataaagagt tgtcgattcg 14640 ttattataaa atatcaaaat ttttatggaa aaatctkgaa tcaggaaact caatttcaaa 14700 agaagaagaa acaaagtatt taccattaac agaactagaa atgaaaaaat tggataaaga 14760 tgaattagct ttgtttttta cgtgtcaaaa atgtcaaaag ttaaatcgtt gtttacacga 14820 ttgtttttgg tgtgaagata gaatgtgtta tttatgttta ttagatcatt cactgcaaac 14880 aatgatgaaa tttgatacaa acaattgtaa atttaattgt taatttgtaa tataattatt 14940 ttttgtattt gtttttagat tagtaaaaaa atgtcatatg atttgaatga ttattgcgat 15000 tttattgaat atttgaatga ttgttatgat aattgttttc gatttttaaa tgttaaaatg 15060 tttatagata tgatggaacg ttattttcaa tttaacaaag tgcacaaatt tactttaatg 15120 gaaatgaaaa acagagctca gttttttaga ttatgtcata gaaataatat tttagaaaca 15180 tataaacaat ggtttgattt ttatcaaaac gaacagaaat atatgccaag atatgctata 15240 atcattattt tggacaagat aacagaattt cctaatattg aatttattga tttttataat 15300 cgatggcata ctattaattc caatcattca ttatttaatc aaacattatt tgaatattgt 15360 tttacggata attttgttag ttctattaat aaaatattag atgaaaaata taagcattgg 15420 atttgcaaat catttgcaaa ttaaacgaat ttattaaaca taaaacgagt atatttttta 15480 tggtaatttg taataaaatt ttgtatttgt ttttagatta gtaaaaaaat gaacaaaagt 15540 aaagaattac aagaagcttt ggataattta acaaatttaa cattatataa aataagaaca 15600 aataattaca aaatgaatga ttcttgtagt ataatgaata tagatcaaga tggtattaag 15660 attgaaagag ctattgaaaa aattaaaaat gaaacaaatg ttagagaaga tttgaaaaaa 15720 gagttatacg atgctcgtga ttatttagaa ttatgtaaaa ctatgaacgt tatgctwcaa 15780 aataaattat ttcattatga acgaatgttg caaaataatg aacaatttat aaaatgttat 15840 tattgtcaaa tttattattt aaaaagtgat gtttattatt atcctaacga tttttttcat 15900 tgttattgtc caacatgtaa actattatac ggttgttaat aaatttttaa tattttttat 15960 agatgaaaac agtaaaaata atagattcta aaggatggtc tttatatgct accaaaggaa 16020 gtgctgcttt tgatttacgt agtaccgaaa gaaaaataat attaccgttt caacgtatat 16080 taatttctac tggtgttttt atatcamaaa tggatcccga tcttcaaggt tggattccct 16140 ccaagtctgg tttagcyctc ttcctccccc tgggggtata acaggatgga attcccctgg 16200 aattatcgat tccgattata aagacgaaat ctctttctcc ctttttttga agcatatact 16260 gttccggagg ggcaatctat tgcacaaaaa ayttttttag gcccctgttt ttttttgaga 16320 aagaatttca tggagggtag ataaaaataa aaatagtcaa atggaaatac aatccatgtt 16380 gtctgatgaa gaaagaaagg gaggttttgg ttcaacagga ctttaatatt tttttatata 16440 attatttttg taatttattt tttagatgtc taataaattt attgaaatgg ttgagcaagt 16500 tattaataat atggagaatg gtgacaattt gataaaagat gtggaaaatg taacactgat 16560 cgagatggac aataattgca aatttattgt atataatggt aatcagtgtc aaagtgatca 16620 atgtaaaatg gatgacaata aaactgattt ttgttatcga aaacaagatg caagtgataa 16680 tgaagaaatc aatgcaagtg ttaatgaaga aaatgaagaa tgtgtacaca gtgattcaga 16740 tgacgaaatt cttgaatgtg tacaaagtga ttcagatgac gaaattctgt tacaaagtga 16800 ttcagatgac aaaattcttg acgaaatatt tgataatata ttatcaaata aatattaaaa 16860 aatctaattt aaaagaagaa ataaaaaatt tagatcttag agttagtctt cttgaagatt 16920 tagtgaaagg aagtgatgaa gataatgaag aaaatgaaaa tttaagaatt cttttgttat 16980 taagaaattt agatattaga gttagtattc ttgaagaacc aactggttta cctgacggtc 17040 aaaagatctt tgtgattaaa gatatagtga aacgaatgaa tattaattcg aatggttatt 17100 gttctacttw ttattctgca ccgttttttt ataaaaatta taaaatgtca ataagaatta 17160 atttcaatat tgatattaat tawaataatg aaatgataca tggcgactat attggtatat 17220 ttgttgtttt aatgycacac atttatgatg ctgttttaaa atggccgttt agtaataaaa 17280 aaggaactat atcattattg ggagagaaaa aatttaaaaa atcgtttata accgatgaca 17340 gtgtttattt tcaaagacct attaaagatc aaatgaatcc tgcttttgga tttggaagat 17400 ttataaacaa aactgatctt ttaaattata ttgtagaaga ttctcttttt attaaatgta 17460 aaattaatta atttttttat atttcatttt ttttttaaat ttgtaatttt ttttaggtaa 17520 ctaataaaaa atgttgtata acatgatagc aaaattaaaa gaaaaagata aagagttggc 17580 tcatcaatgt gaaaagttgg acgattattt aaaaggaata cgtaatgaat attctattga 17640 agatatacaa aaattcaaat cttctgtttg tgataatttg aatagcgata aacataaata 17700 tacatacgat aaaagtacta ttaatctata tttgaaagta tgtacaaagt tgattatcaa 17760 atgtaataaa tctattgatc gtttacaaaa tcaaataaaa aaagaaaaag aatatcaaga 17820 ggaaagaaat gattcgttta ttgaatttat aaaagataaa gacgattggc cgtgttgtgc 17880 taaaaaacaa aaattagaaa attatgtata cgaatttcga tatccgacaa aaacagaact 17940 agttgaaaaa ttaaaaagaa aaatacgtcg cgaattattt tgtttaatta ataaaataga 18000 gttgatatta attgctgatt cttaattatt ttgtaatatg tttttataat aatttttttt 18060 agatgataaa gataattaaa aatatgataa aagatgaaca tattgaagaa ttgcaacagt 18120 tacaactatt tgctgttagc gaaagttttg atcgtgatca atacaaatgt gttaaagaaa 18180 cttttgaaaa atttgataaa gatgtggata tgatcaagtg tgaacataat gaaattgttg 18240 aaaaacttga acttttacat ttatatagca agattgtatt agatttgtat catcagtata 18300 caatttgtat acaatttaca aaatgtaaaa aattacttga aaaaaagata aatgtaatga 18360 atagaatagg aaaattgtta agggaattgt tacattatga ttttgaaaaa tgtcgttaat 18420 tatcttatct ttattatmtt tgaaaataat tttttagata actaataatg tcagatccta 18480 tttgtgaacc atcagatcaa aaaaaacaaa agttagaaga agaagatgaa gttcaattag 18540 agccaaaatt actcaaagaa attcaaactt tacagcaatt ggctgatatt gatgatgttg 18600 atgtaatgat gttattgaaa tcatcgatag attataatgg tttaataaaa agtattaaaa 18660 yraagtataa taatttatta cttggyaaat gttattatgt tagcacgtta aaccaaaaac 18720 gtatttacga tttgcatcat aaatgtttat ttgatttata tcaaaaattt agtttttcta 18780 ctgcaagatt gcatcttgaa atgaaagagt tacgagatat ttgttattct gaaaaattta 18840 gtttttctac tgcaagtttg catcttgaaa taaaaaagtt acttgttgaa gaaaaattaa 18900 aattaaaaaa caatttagcg tataatagat accttgtaat ttgtaaatta tataaaatgt 18960 taartaaatt tgataaacaa gttcgatatt taaatgatga ttatataagt gatataaaat 19020 attttgggtg tgatttaaaa tctttagaga tgtggtgaat tttttgagac gttattgagc 19080 aaaaatataa agctatgatt tttattatat tgtattgata attatataaa tgtaaaaaaa 19140 tattttttta gatggaatta acagtaaaaa tagtaaatga atttatgaaa aaatttttat 19200 cacacaatga aatttgtccg tttacgttgg aagaattgat tattcaagct aaagaatttc 19260 aaccatttgt atgtgaagaa tgtgaattat tgaatgtgag atgtcagaat aaaatttttg 19320 ataacgatga gtggaatatg ttttttcgat acgattttat gagatcagaa gatcagtttg 19380 ttgttatgtt gacacaagga tttaaaaaac atttgttaca tacaatagat cggtttttac 19440 cacaaaactt taaaatattt tataattttt ttaaaaatta tataaatgta gatgttttcg 19500 tttcacacgt taaccaattg aacaatgaaa ataaaaataa taaagttgat aatgaaagaa 19560 aatgttatta ttgtaataat ctttattaat ttgtaatttt tttttcagtt ataatgaata 19620 gtaaagaaaa acatgacaaa gatcttcaaa tatttggaag atgtatacaa ataaatactg 19680 atatacaaac cgatgagtat attgacgaat cagtaataaa tttagaacat aaaaaagcat 19740 acaaaagttt aaaagaaatt cttttagaag atgtggaaat caaatattta cgtgaaaaat 19800 ttcgatatca aactatagct aattattttg ttactagttt gaaggaagta ttttggacag 19860 ctcaatcttt atctagagag aaacgtaata caataataca aactcatttg atttggataa 19920 aaactatgaa tgaactttgt gataaaggaa taataatatc agaagatata attaaaaaat 19980 ataattattg tactgagagt cataaaagat ttttgaaatt taaaattgaa ttaaaaaatg 20040 atatgataga atgtttaaat ttgtatactt gttgtcctta ttgtgaatat aataaaatat 20100 gtcaatattg tatataaagt taaaattttt tttagatgat aacagaagct aaactaaata 20160 atattttgga attgttagaa atggatgcca tttctattcg taaaatatat caatttgtat 20220 ctatggaata catgttgcac gaattacaca aaaaaatgga ttgcgaattt aaagacgaat 20280 ataattgggg ttcattgata tcaacaaata aaagacaagt accaatggaa atagctagtt 20340 ttgtattaat tttagcttta tctaaattta aaaaaggtga aattccaact gatgattttt 20400 ttgaatttta tgattttttt aaaaattatc atgacaaaga aaaatttatt tgccaattaa 20460 atgaaaaaat tgaagatagc aggaaatggt attatgataa acgaaagtta actgtattaa 20520 aactcaatga cgttatgaaa acttttagcg aacgacaatg gattgaatct acagttcata 20580 atacatattc agttggtatg atgaaattta ctttatcaga tttgtacgat aaagctttag 20640 catataaaga aatttgtgga aaatgtaaaa ttattaaaaa tgatgaacaa cttgaaaaac 20700 ttaatgttcc aattccaaaa gattctaaat taaatgtacc aacggattgg atagattttt 20760 tttattatca atttcctcat aattatgtaa ttttattttt gttagaagta ttgttgacaa 20820 attttccata ttataaagac aaagtttttt atacgtttta tatgtttttt gaaaattgtg 20880 ataagaattt aattattgat caagttaatt taaaagtgta tgattatgta aacaaacttt 20940 tttattatta ataaaagatt atttttttta gatggaattt acagaaataa tattaaattg 21000 tgtaatggaa tcttttaaaa aaatttgtga aaaagaaaat tggacagtgg cagatttaca 21060 atttaaatat atgagatata acgcagattg tactttagga aacgtatctg aaatggaaaa 21120 tatttttgaa gaaacaaata gctttgttaa caatcatcgt gttgttgtgt tgatgatttt 21180 acaatatata agacgaaatt ttcatgttta tttaaatgta aattttaaca tgttttatca 21240 attttatgaa aatatgagcg aatcaataaa aaaaatatgg attraagaaa tacaaatgta 21300 tattgatcta tgcataagtt gtagatatta tttaaaagaa gatacgaacg aagacgattt 21360 ttattaattt ttgttatatt aatataatta ttattttaga taaaaaatat aaaatatgga 21420 tatgcgaaga aaatatcaac aacaagattc attaaaaaaa atattaattg aaaaacaatg 21480 cgaagaagtt acaaatgtat tatcgtctac attatgtgaa atgagtaaag aaatagattt 21540 gataataaaa ttacataacg aaagaaaaaa tcaacaaaaa aaaacgaaaa attataatat 21600 ttgaactgac aaataacgaa atgatatcaa ttatgatgtg gtttcataaa tttgttatat 21660 tatatatgta ttcttaaaaa tttttttgta tttttttttc agaaaaataa aaaaaatgac 21720 tttatctgct aaacaaaaat tggcttttca atggtttgat gaaggattaa atttttttat 21780 tactggtagt gctggatgtg gtaaaagtta tatacttgat gctattgctt ctagcgatca 21840 aatatttaaa aatattgcag taacagctag tacaggaaaa gcagctcatc aaattaaagg 21900 tatgactgtg cacagttttg caggtattga aataggtaca aaatctgtcg attattatta 21960 caaacacatg caagtagatg ttttagaaag atggagaaat acacacgttt tgattattga 22020 tgaaatttca atgcttaacg cagaaacctt tgatttatta catcatttag catgcaaaat 22080 aaatcaatgc aatgacgaat tgtttggtgg gatacaagtt attacwtgcg gtgatttccg 22140 tcaactgcca cctgtaaaag gtgaatatgt tttyaagtct gcaatatgga aaaaatacat 22200 gtttaatgtt atcgaactta ctgaatcttt tagacaagaa aatattgaat tttttaacgc 22260 tttaaacgaa atacgaattg gaaaagtatc tgataaaacg gtcgatttat taatgacaag 22320 acattatgaa gytgatcata atataaattc aaatttcata cgattgtttt ttacaaatat 22380 ggaagttgat ttttataact tgcgtcaatt aatgagtatt ccttctgatg aacgttggtt 22440 tcattctaaa gatgttatta aaaatttaaa aataaatcca ttgtttcaaa tacctattag 22500 tataaatatt aaaattggtg ctgtagtcat gttagttaaa aatataaacg ttgaagaagg 22560 tttatgtaac ggtacaattg gcattgtaac atttatagaa acagatggtg tttgggttaa 22620 tataaacggt agagaattta aaatcgaaaa cgtgagagaa gacattttag attgtactca 22680 tagtattatt gcatctagaa tcggtttacc tttacaatta gcattttctt taactgttca 22740 taaagcgcaa ggttgcacgt taaataaaac tgttttgaat tttaacagta aaatatttat 22800 tatgtcgtta ccttatgtcg cgttatcacg tgttaaagat ttaaatgatc tatatataat 22860 tcataataat aaaaacgaat tgagaaaaat attaaaaaat ataacaagtg ataaagacgt 22920 tgatttattt tataaaaatt taaatgaaaa aaattttgta acatgttaaa aaaatatatt 22980 ttttttatta tatataaaat tttctwtaaa aatttttttt tagattatta ataaaaaatg 23040 cattttaaag ttaagaacat tgaagaaaaa ttagataacg aaagttatta tcgagaaata 23100 ttatattttg taaatgacaa ttataaagat gaagatttta ttgaagaaat tgaattaaac 23160 gatcaaattc aaacacctga agaaattact catcacatta atcaaaaaac acaagatagt 23220 tttcttattg aaacaaataa agcgttatca tttgctgatt tatttcaaaa ttttttaatt 23280 ggttataata ttgtaacact ttgtgctaaa tctacagaag acggttattg taaaattata 23340 aaaagatgtt acaattcgcc tgtatattat ataaaaccac tagaaggatc aataaattta 23400 caatatatta aaagtaataa cattcgttca gtttttaaca ttttggaaaa tgatagcgga 23460 tggagaataa ttcgtttatc tagttttaaa gtaaaaacat taagcaattt aactgaaaaa 23520 tcaatgatga aacttagaaa ttatgaactt gctggatata aaagagtttc tgaactgagt 23580 ccagaagaat taatgatgag acgagaatat gaacgtaaac gaaagaaaaa aatttctgag 23640 ctgagtccag aagaattatt gaaaaagcga gaacgtgaac gtaaacgaaa gaaaaaagtt 23700 tctgagctta gtccagaaga aataatgaaa agacgagaat gtgaacgtaa acgagagaaa 23760 aaatattata ctaaaaataa aaacaaaatt cttcataaac aaggtcgcaa gtatataaga 23820 gaagaaaatc aagatgaatg gtatggagaa tggtatcgac ctcattcttt aactcaatgg 23880 aacaaacata aaagtagttc gtttaaagta ttatcatacg atcaaaaaag aaaaatgttt 23940 ttagattata gtattttgtt tgaaaatggt ttgtgttttt attcagcgtt tgttatttca 24000 ttattaataa gaaagtggaa agtaaataca attaaagaat ggaatacaaa agtattggaa 24060 aaaaacggac aagattttta ttctgaagaa aatataatgc aatttcttat cgaagatcaa 24120 aacgattcgt ggatacatgc attaagtctc ttaaaagaaa aaagtaaaaa tttatcattt 24180 ctagaactca ttgatttgtg tcactctttt gcatacgaaa acacttttga tcgaataaat 24240 attaaaattt tttcaataaa taaaaaaaat aaaaacgttc gtttgctaca ccctaaaatg 24300 aaatcacaat ataaaagtgt actatcggag ctattaaaag tggtatacga gagtgcaaat 24360 ataactaaag aagatcaaaa tcctcagaga gaatttttaa acgaagacga aagaaaagaa 24420 gctaacgaac gtaaaaaaac agtaaataca ataaaaatat gcttgtacga aaaaaatcaa 24480 tcttttcacg ctgttagcat gcttaacgaa cgtttttctg ttatatataa taatcatamt 24540 taactgtatc aatgttcaga gtgtggtctc aaatatacta gtcaaattca taaacaagaa 24600 tgtaaaggat tttatgtagc agacaaattt gcatacggaa aaactagaca tttttatgaa 24660 cctcaaaatc aatttataaa cacttctcga aacgtttttc ctttttttat tacatttgac 24720 tgtgaaacaa gtgttcaaac atcaaattta aaagaaacag aacacgttaa acccttgtca 24780 gatctttcgt actattgtca cggcattgtt attcattgtt tatctgaaaa tttagctaga 24840 gaatgttttg gtttagatga aagaaataaa gttataacac agtggatatt ttattatgaa 24900 aatacaattg aaagtttgca agcaaatcct ttaacattac tacctgcatg ttttcaagaa 24960 aatgtttctt tactacacaa aacacataga caaatattgt ggcaaaaatt ggatgcaaat 25020 gacgaaaatg ttgaaaaata taaaagcgaa ctgagagtat atgatttgtt aattttggct 25080 gacactttag ttcgttttac ctacgaaaaa tgtttacctc gttttcaaaa cttaaacctt 25140 tctgaacacg aaaaatgtca agtatggcaa gaagaaaatc ctttgtgttc aatatgcaga 25200 tgtcctgcag acaaaataaa gaaatctgat gtgtacagtt acaaattaac tacaatgtta 25260 gaaccagaaa taaatttatt aaatcataac gaaattagat tttataacga caaacgtgtt 25320 caagttgtaa atcatatatt taaacgagta agcaaattag aattgtttat gcttaaaccg 25380 tcaackcaaa attatttaat aacaaacaaa gtagaaaaaa ttaaaaatga tttgtttcaa 25440 atatctagtt tattttatat ttgttgcaga tggtgggaaa aatttgaccg cactcttata 25500 acaaacgggt ttttaagtga ttatgtaaga ataaactcgt tagatcttgt ygatcacatg 25560 aaatgtttaa gtcaactcgt tatgaacgag caacaaatta acatatggtt tgaaaatgat 25620 cctatagaat tactagaaga ttcttatttt ttaatacgaa tttttcaaaa atgtatcaac 25680 gaagttaacg tgcattggaa atatgaacgt ggaatgggaa taaaagattc tattactttt 25740 ttacatctca tgtctcgttt aaatcatccg tctgctcatt tatggtttca ttgtatgttg 25800 gaaccgttaa atagagaatt tgaaaaaaaa agaacgttaa aagaattaat tcaagatgat 25860 aaaggaagag atgacgtctt aaaagaatgt tacggtttgt tatcatgttt tgaagattac 25920 ttggttattg atcatgatca cttttcaggt acagtttacg gattagctca cgattattgc 25980 aacaaaaact tggcacgtta tagtagtaaa ttagcttgtc ctatctttgc ccacaatgct 26040 tcttttgata atcgtatcat gattgttgaa ggtttaaaaa aaatattaga tgttccttta 26100 tatagctacg gagaagatac tgaagacaaa ccttttaacg cttttttaaa atgttgcaat 26160 atggaaaatt tgtttgttat ttctaaaggt gtaaacaaat ttaatattat ttgcattggc 26220 aaacacataa aaattgtaga cacattgaaa attttaaatt gttctttgga tcaagcgagc 26280 agtatgttat ctaaaaaagc acaaaatcaa attattgaca gcatgttaca ctatttgtta 26340 cctttttatc cccatattaa ccaattggta gacaacgaag aatttaaaaa ttgtttcatt 26400 aaaaaaactt tgtttcctta ttcacaaata tcagataaac ttttaaacat tgaatacaaa 26460 aatatacctt ctattgaaat atttgctaaa aacgatttaa tgaaatcaaa aacattagaa 26520 gacgaaatta aaaaaattaa agattcttat gaaactgtta gcaaattatg gaatttttta 26580 aatttaaaaa atttaaaatc tttattacac atttatacag ttttagatac tgtggttttg 26640 ggttctgtaa tgattgattt tgacgaaaga acatttcaat ttactcaatt tcgaatgtta 26700 cctcatttaa gtgtttttag ttattgtagt aaactcaatt carctatgag tcaaattcaa 26760 atacaatatc cgccctctga agaacacgaa cgtatgattg aaaaaagtgt tagaggtggc 26820 atgtcttcaa acggttcgca aagatatgca attgatacgg attatttcga gaataaaatc 26880 gaagaacttg gattgaaaag taacatcaaa ggatgtttat tttttttaga cgaaaatgga 26940 caatacggag gcgcacaact taaaccgcta ccatttcaaa tgaaatattc ttttgatttg 27000 gaatgtcaaa cattagaaca agtgattgaa aaatatcatc gcgtcaatcc tgacggtttt 27060 tcaacaagtg ctttagttca ttgtcaaatc aaaatggaac ccgaatatca agataaagta 27120 ggaggctttt cacctatggt cgaaaaagat gttatacctt tgaaagatta ttctcctact 27180 gaaattgtag ctaaaagaat attaaaaaaa aatggacaat atactatcga aaaagataaa 27240 actgtcaaac tgttaatgaa attgagttct catgaaactt gcgaattttt agatactttg 27300 atatggttga caacgtgttg cggatgcgtc gttgaaaaaa tttataaagt cttaccttta 27360 tggagatatc ctttagctga acaaatggtc agaaaaaata ttgaaaaaag aaaagaagcc 27420 gtraaaaacg gagacaaagt agtagacgct acagctaaat tacacactaa cggtcattat 27480 gggtttaaaa gtcaaaacac ggcaatcaaa caaacgagtc agtttattaa tcacgaagaa 27540 aaagcctttc aaaatttgat aaaacatttt caagacattc gtcttgtttt gttacaagaa 27600 gaaggtcttt ctgaagaaaa aataaaacct ttgtttccca tacattttca aaatttgttg 27660 tacgaaaacg gtatatttgt taacatagaa cctgaaagat ataaaatgga catgtttgaa 27720 aaaaaaatat gcatagacag ttatgttacc gatcaattta catatgacga gtattataaa 27780 gattttattt ttaataaatt aattgacgta aaaacaaaaa ctataatgtt tagcggaaaa 27840 cataacgaag aatttcaaga cgtttttgaa cttttaaaat cggatataaa catgtacaaa 27900 gacggtcttg tttttgtgag tagcgaaaaa agtaatttgg taaacaagca aatgcgttat 27960 gttgcttctc aaatattatc ttatgccaaa cttaacgtta ctcgttttgc atgggaacta 28020 ggagaagtgt taagaaatca taacgcagaa aaacatttgg tagttacaga cacagattca 28080 ggttgttttt ttattactaa aaaaacggac gacaagcaag aaatttttcg ttcaaaagta 28140 gaagaatgga tttatttttc aaaaattgga aaacaatgga tggattattc caattatcct 28200 aaaacgcacg aattttattc gagtttatat gctaaagaaa gtgatcattt tcaaaacgaa 28260 gttcctcctc cttcttatat tggtcagatt tttgcaatgg cacctaaatg ttacagtgca 28320 catattatca aaaaaaaaag aaattcaaga taaaggtcaa gacgaacaca aaaacagagc 28380 aaaaggatta cccaattata aattgtgtta tacgcattac attaacggat ttaatacgca 28440 tagagccgaa aaaatattca gaaatatgcc taatcaatta gaaaacgaat tacaaccttt 28500 tttacaaaat agattagatt taaattttaa aaacgttacc attaaggaac acgttattca 28560 atttttaaaa aaatttacac aaaaaaatta tgtatttgat gacgatttgt ttactgaaat 28620 acgcgattgt tatcgcttag atccgattcg agaagctaat ttaaaagttt tacccgatat 28680 cgaaaaattt tgtagcgatg aacatataca aacgttactc gaaattgaaa aaaatattat 28740 aaaatcatca gaacgataca atacaatgat tacgtttaat cgtgaagttt taataccttt 28800 attatttgat ttaaaaaaac tataaattgt aataaaaaat ttttttttta attgtaaaaa 28860 aaattgtaaa aaaaaaatgg aatataattt gtttgatgaa atgttaactg ttaaacaact 28920 cgaaaacaaa ttaaaagaaa ttggttttga aaatttttat ttaatagaaa atactgaagg 28980 agaaaaaagc gaatattata cttctaaatt aatatgtgtt aatcttaacg gacgaattct 29040 catacctgac tcatttttag aatgcacaaa atgttttgtt tgttttttta aaaaaataag 29100 ttttgtaggt tgttttgaag acagttgtga tatatgtatt tctcaatctc aatctaaaaa 29160 agtaaattat aactgtaatc atttagaata aaaaatgaat aaagaaaatt atagtcaaga 29220 caaagtcgaa gaaaataaaa aattaatcta caacaactta tttttgttgg aattatcaag 29280 agaaataggt agcgattgga aaagatgtgg aagactttta aacgttaaag aaggagatct 29340 tgaactcatt gaaaatgatt acgataaaac tattaatcga tgttttgaaa tgtttaaaat 29400 aagcgtatat caaaacaatt taacaatgga tcaattgttt caagttttgg aaagtttacc 29460 acgtaacgat attaaaagaa acattttaaa aatatattat taacattatt tattttgtaa 29520 aaaatttttt tttatttttt ctaataaaaa aaaaattttg ttataaaaaa attagtattc 29580 atttgttttt ttattaacga tataaaaaaa atggaagaaa aagaagaaaa aattgaaact 29640 aatcccgtag aaaacgaacc tcctaaaaat gaaaaaaaag attcaagaac gttaactgaa 29700 attttaaacg aaaaaggttt tcacagtttt tatttaaaag gtaaaaatga agaacctgaa 29760 attaaagacg atgtttcacc aacgcaaatc aagtttaata tttttaaacc ttcaaaagat 29820 gtttatcctg aaaacactat tattacaggt cctaccaata gcggaaaatc aacaatggca 29880 cgcgagcttt taaaatatgg taaatatccg gatgcagaaa tgttatttgt tattcaaatg 29940 agagaaccta gcgaatctca acacaaaacg tggcgtgata ttatcaattc taataaaaac 30000 aatttagaag cttatgacat tataacagta gacagtgctg aaaatttaga tcccgttata 30060 caacgcgcta ttagtttttc taaagacaaa tacggtattc aaaacagcga tcatcgttta 30120 ttaaaaacta ttttgttgtt tgacgacatt agtgcttttt gtctcaagag tcaacaatat 30180 actagcgcat gttctagcgg atcgcatcac ggagtaggta ccattactgt atttcatgca 30240 gtaccttcta aatcgagtca agcttggaat aatttggtat ctcattacaa atgtatcgtt 30300 tcttttgaca acaacagcga aattgaaaaa atatttaaaa acgattttcc ttctgctggc 30360 aaaatgcatc acgtcattag cgatttactt aacaaagaaa cttctagtca acacggacac 30420 ttggcattat ttaaaagaaa ttgttctgtt gccagattaa gaactcaaat taccaattcc 30480 aaagaacaaa aagtgtttat acctgttgac gctcaaggaa aattagattt tgattacaac 30540 gaaatgagcg taaacaaaaa aattgttact tttcaatcgt ttccttacgt taaagcgcaa 30600 aattgtaaaa attattatta tctcaaatct catcacaatc cattttgtca agaaaaagag 30660 aaggagaayc aacaaactaa cgacgaagaa gataaaaaca atataaatag cgaaaaagaa 30720 aatcaaataa aaaaaagaaa atggagcgag agcgaaagag ctgctttaat gcttgaagaa 30780 attaaaagaa gaaagagata tggattgtgc gaatcatcag acgaattttc agattctgga 30840 gacgaatcaa acgaatcaga atgattggga cagcgaagaa ctgttgcaac ttttacaaag 30900 cgattacgaa gcgtgcaatc tcaaaacatc aaaacgaaaa gacatgagac aaaaattagc 30960 cgttcgcagt cacattttta tacctaaaat gaacaaacgt aaagatatag acgaaaacga 31020 aattaaaata tggatcaaag aaatttcgac tcgatttaac gctgtcaaaa tgttaaaatg 31080 cgaacttttg tctgatcacg aaagattaca aatcaaaaac gcttttcaag attacgtatg 31140 tttagaaatt aaattgtttg tcaaagaaat tttatatcct tatcccaaat ttaacgaaaa 31200 cggaaaaatt attttaaaaa aagaaaaaat gaaagacaaa aacaaaacca tatcggattt 31260 cgtttacaat aattacgacg ttttaaaaga atatttcaac tctactcgtt ttaaacttta 31320 tcacgaaatt ataaattatt actctaataa attaatctca tttgacacgt tacaacctac 31380 aaaaactgtt attgatttat taagacaaca atttgacatt gattattgtc aaaaagaatt 31440 gtattctcaa aaagatttgt attattttga aatatgaata gatataaacc gtacgatttt 31500 tcttattccc ctttgctcgc tttgcgaaaa gttttggcgc aaaatgctta caacactgac 31560 gtctacaaaa aaactagatt ggaaatggaa aaagaaactg aagaaaaaga aacagaaact 31620 gttccacaaa attctggcat gccagaaaca acaatgaaat ctgacatttc tcatttgatc 31680 gaaagatacg atttaccctc ttcccttatc tatgccgatc cgataaatgy agaagcgcaa 31740 atcgaaggcg aattgcgctt tcaacaagaa ttaaataatt tattacctat tgaaacacag 31800 ctgaacggtg atttgacaac atcgctgatt aaagatcaag aaaaatggaa aacttatctc 31860 aacgtaggag gtgacgctta cgcttttagc aagttaatcg gacaaaacaa atacaacgaa 31920 tttaacgctc aagttgatca aagaaccatt agcgaaactt taagaaccat gagaaatata 31980 aaaaaatatg ttcctcaaga agaaaacaat tctcttaatc ctatgcctcc taattctgta 32040 caatcaagtt taaataactt gtcacaaaaa actactttaa atactgttca acctactaac 32100 aaaaaagttt atcttttaac tcctcttaat cgatttttaa ctttttacga aaaacctaac 32160 gaagcagaag taaccaaact catggcaaat ttagctagat caaatatggc taaattagct 32220 cgtaatttat tagacatgtg taaaaatcaa gtcaatcaca cgtttggatc gaatcaaatt 32280 tctttacgcg tttataaacc tagaggtgaa gaagcgacat taaacaattt aaaaatatac 32340 attcaattta tgaataacgt agtagacgtt gaagaatctt tagctagcat tattaacaac 32400 gttaacgcaa cttcttacga aatcattaat gatcctaaac aatattctta ttacgctaac 32460 gaatttttaa ctttacttgc ttcacctcaa agcgtaggca gtttaagaat ggttgcagat 32520 tattatcaaa ttaaacatct tacttttgct aaattgtgga attatttaaa aaaatataac 32580 atacctttta gaaacgaaaa caaattgaca gcagaaagcg catatcaaga aaccttttca 32640 agtcgtcctt ayactgtaat gacttcagcc ttttttgaaa gcgattcwtt agtagacaaa 32700 agtttttcgg gcaatattgc ttttgatctt ttatctgaaa caaaaaaagt tgtaaactcg 32760 tggttaatac aatacacaga acaatttaaa ggttacaact ttaacatttt aaccgattac 32820 atcaaatcta acagcgttta catgcttgac aaaaatattt ataatttaat gattaacaat 32880 tacaacgaaa acgcaatgca aagtatatca atcggtgaac acatgttata tcttaccttt 32940 aaaacgtttt ttgcaaatcc accttcttta ccatatagag atgtagacga cgtagtacaa 33000 ttacaagaca acgtagtaac cgcttgcata caagcgaaca tgcaaacatg caagattttt 33060 tagaacaaat taaaagcaat tcgaatcaat tttataatac agaaatcaat attaaacaaa 33120 aaacaatcat cgaaatgatg atgataatgg gttctttata cagttctcaa ctctttcgat 33180 tggatactct tttatcggca tataaaaaat gttttgaaat tcaatctaat aattctaacg 33240 aaattaaaaa ttttgcatta attttaaaat cagtcacttc attaaagaca aataatttgt 33300 ttgtgcatcc aaaagttgat atcaaccttt ctcctgtaaa gaaatcattt tttttcaaaa 33360 aagtctgtrt tacctactcc cgaatcttta gttgatacac cgttgtctca atctacttct 33420 aaagataaac cttggttaga atctactcca aaagaagact cttggttata taaagaagaa 33480 aaacctgtcg gtgctaccgc tttacctacg gaagaaaaac ctgtcggtgc taccgcttta 33540 cctactgaag ctacaccaat tactagtaaa gatataaaac ctaaacgttt gttagaaaaa 33600 aaatatccga caagaaaaaa agctaaagaa atttaaaaaa aaatggaaga cgcattaaga 33660 gataaaatag atcgtcttat cgaaattcaa cttgacgatt tatctagtcg attatttggt 33720 ttgaccaact tgtcaaaaat taaaaaaata atgtcacaat ttaacacttt gaaaaaaaat 33780 gtcagcgatt taaaatttta tgaaaaactt aaaacaattt tagaaaattc agtttttgct 33840 aaatgggacg gtatgattac tgcttggtcg catcaaacac ttctttttaa aaaaaccgct 33900 aacgcaaacg ttaaaaaaca agcgccgaga aaaccatacg tttacgaaac gcatcggttt 33960 tcttacgaaa aacctcatca atattttcaa gccgatttgg cagacatgga gagttttaat 34020 tttaatcctg gttatcgtcc cgattattgt ttagtcgtag ttgacggtgt ttcgggtaaa 34080 gtgtatttga gagtaatgaa aaaaaagact gcagaaaaaa cggtcagcag ttttagacac 34140 atttttaaag acattaaacg agacgacgac aaaagtttta tatatttgca aacagacgaa 34200 ggtacagaat tttttaattc aaaaatgaaa gaatggtgcg aatccgaaaa cgtacatcac 34260 tttcattcta aaaattacgg aaaagcctac ttagctgaat cggtcattgg acgtttaaaa 34320 ctgttatatc aaaaattaag agataaaggt caattaaaaa aatataattg gagttattac 34380 attgaagaaa tggaacaaaa actaaacgar aaagaaaaaa catcgagcgg tataagtccc 34440 gaagaattgg atgccgatga caataaaggt aaagcgctgc gtatgattga tcaatacgta 34500 agcaacaaaa aacgtaaaca tgactttaca tccaacatgt tgcatcgcgt agaaaaagaa 34560 cacaaaaatc aaaaattaaa aatattaaaa cctttatctg ttggaacaga atgttattta 34620 agaaaataca aaaaagatgc taaagatact tttagcaaat cgagtacacg aaaccgtccg 34680 tattgggaca cgacaaaacg tgtcatcatc atgaaagtca gcaaacgcac aaacccttca 34740 gtgcaaggtt atttaccggt tcatatttac aaagtaggac aagttgacga tttaaataca 34800 acgtataaag tcaagagaga agatttgtta cctattaacg aagaagataa aaacatttat 34860 tttaaaaact ttaacgaata cgtagacact ttttataaac ctttaaagaa atctttgcaa 34920 aaaaaaatta ttgtaaaaaa aatgaataat ttagtatttg atgatgtttt agatgtaaat 34980 aaatatgatt gtagtacgta tttttttagc gaaattaaaa gtaaagagca agaaaatgtt 35040 cattataacg ttttgaacga ttttagttta cccttatcta ctaacgatct agataaacct 35100 ttttttaaaa acaatactat tatgcattta tcatctatgg acgttccaac taatttaaca 35160 cgcatagacg aaaacatttc gttaacagaa aacagctacg gaacaagaat ggaaacaaaa 35220 ccttctataa aacaaatgat gacagatatt caaaacgaca tgatatctaa tccsaatcga 35280 aaaattcctt tttatcttgt aactaaaaaa attattttgt ataccgattt tgctrttttg 35340 ccaggagaca cgtttcaaaa ttatgaccta ttaagagttg aacatatgga cgcagttcat 35400 ttggaatata tgtgttcaat tgttggttat ggtaaactat ttggtgtaac gtctccgagt 35460 ccgtatcaag acgatttctt aatcagacga ttagaacttt ttaatacgag tttaaacaga 35520 agattgactt ttgcagattt tttttcaaat tatcaaaaca aaagtaataa tattacaaat 35580 caaaaatatt tcaatttaaa atatttattt gacaccaaca cacatttacg ttatgttact 35640 aataacgaaa atactatcat ttcaaaatta agtatagaac caaacggaaa tgttaaacaa 35700 tatatacaaa attattctta cattaatgca ccgaattata attttgtatg gtatttaaaa 35760 aaaaaaacta acgaaaactt aaacaaccct tctaatcttt atcctataag gtctagcatc 35820 tctactaaat caaacgaacg aggattatct tattttggtt tttttttcaa accgaacaaa 35880 tgcatcacaa tacgtagtac aagattttac atcattaata caatatgatt atttatcttc 35940 taaatcaagg tatgataact tgtacaactt tgcattatct actcaaacaa gcaaaaattt 36000 tatgatttac gtaaaacatc cacaagtagc accaggagat atagctaatc ttactcccga 36060 aagagmtgtt attttaaata atacagaaga aacttttgtt cgtcctttta ttcatatcat 36120 ggtaacacct aaatacggag gagctcttgt tcaaaattca caaagcgctc cgcttgttcc 36180 agacacacgc gatgctaata ttttaagtgt cgatttagat gtttatgtta ataacttatt 36240 acctagctca gaaagtgtaa aaacatctaa aatgaaacac attgttcctt ggcaaattgc 36300 ctttttttta ttaaaaataa tgactaacac aaacacaata aaatctgagg gtaacatata 36360 caatatagaa actgatctta acgaaattcc aatccaaatt aacgctttat ggaaaatttg 36420 gaacgatcaa gttaacgtta aagacgctaa caacaactat acagaagaaa aaatgtcatt 36480 gactagtttt atagacattt taaaaaacgc ttttggtact attrgtacaa aacttctcta 36540 catctctcca tcggctatag gtatagaacc tacagaattg tttgcttgct tttatcaaat 36600 cttttttcca aaaacaagtc aaagaaaata ttatcgattt actttacaaa acataccgct 36660 agcttcttta cctatacctt tttttaacgt cgaccaaatc atatggcctg aacgtgaatt 36720 aaaaatgatg gatattttta cagaaagtat aaattacaac ttgtcaaaca atttacttaa 36780 tttaaagtta aataatctta cttttaacga atttatttat gaaggtacta atatctactc 36840 ttatcaaaaa ttcctcaatc aatacaattt ttcatccatt cctaacgact attataaaaa 36900 ttattatact ttttggtgtt taccttcatt tgaacaaaat acatacgaaa tcaatttaaa 36960 atcaccatta caaatttcta aagctataat cgcttcaaca aatacatata acaattattt 37020 tgaaatgaaa ggaaaccaaa tttcaccgta tcatgctcaa tttggttcac tcaacacttt 37080 tactacatct ccttctaatt tacccgaaga catttataaa ataataacgg cagattggtc 37140 attgattgct aaatacaatt cagaaacatt tatcgaagtt ttgatgtact gcgaacaaac 37200 taaatctatt ttttctccaa ctttaaaagt acctcttgtt actacggcag cttttgaaag 37260 agctactaca cctagttata aaacgtggtt aaacggatca aaatctatat tgcaaggtaa 37320 caacattaac gaattggtat tttattgcac ttcgctcaac ggacaactta tttatcaaaa 37380 aaaaagtgac ggtactatca taaattcttg gtctaatagt ttaccattaa aaaatattat 37440 tttaaaaatg tatcaaccca ttggaaacaa tcctgtacct tatacattgc tattacaaag 37500 agatcaatac agtttgacaa acgaaataaa tccgtttaat atagaaatag atcaaaatat 37560 ggtttatacc atggctcaag attttaaaat tcaaaaaaga aaactcgctt ttttaagatg 37620 tactttgatg caaataagta actttaataa ttctgtatct aattttccta ctactaccga 37680 caaccttata tttattaaat gcgatcaagc taccaattgt ggaatgtaca tgaacagaag 37740 atacgaaacg ggtattgttt cttattttaa tttgtcagat tataacaatg ctactcaagt 37800 aacagtcaat atcgatttta ctaaaaatcg caaaacagaa atacaaggac aattaacaac 37860 gtttgatatt gtwgacgtag aagattggaa aaaaaccaaa atggwctttt ttkaacacta 37920 ataaagaacc gctcaaattt aacaatttaa gtactactgt tttttttaat cctactttca 37980 aaatacaaat ggtaccattt tataattagt tttaaaaaaa atgttaaaag gaatggatcg 38040 atttgattgt aattgtcttt gttgtaataa aacaaagttt tacaagataa aaaaacaaat 38100 ggaaaaaatg aatgaggatg ttcaatcaca aatacaatcg atgcaatacg aaacgttcaa 38160 gttaagagaa acgattaaaa atcaaaaaga agaaatacaa aggatttcta atgaactgct 38220 ttaaaaaaac aaacaactag aattaacgtt atgaaaataa agaacttgac cttttacatt 38280 tgtataaaat aaagtgtttt aatttgatta ctataatttt tgttataaaa agatgtcttt 38340 ttttatttta aaaaaaatta ttttttaaaa aaaaatgtct ctttttcgag caagaggaag 38400 atatgaaaaa aatttaataa acgagaaaat acctgaaact ttaataaatc cacaaaatag 38460 tgaaatmaat gaaattcaaa acaaatttta taatcttcaa aatcaamtgt ctcatttttc 38520 aactaaagca gacgttcttc gcatcaaatc agaattggat gaaaaatgcg aacaattaaa 38580 aaaattaaaa caacaacaac aagaaaatat acaaaacact tatgaagaaa aacctcaacg 38640 taaaaaaaat gttattaaaa aaaacgaaat tgaaaaacca aaacgtaaaa aagtaaaata 38700 ttcagaagaa gaaagtgaaa gttctgaaga agaatatgtt gaaaaaaaac catcaaaaac 38760 taaagttgcg ttaaattcac acgtatgtgc tcaatgtgtt caagaagtta aagctgtaga 38820 cgctattcac ggtttgtgca gaaactgtat tatacaacaa agagctattt tatccagtcc 38880 ttttatgcaa aataacattg aaaaaaaaag caaaaaagaa aaaataaata aaaaaagttt 38940 aattgaactt ggttacacgc gtgctttaaa agacgttaaa aacaaaaaaa taccatttaa 39000 aaaaaatact tctgactcgg aagatgaata aatagaatta aaaaaatatt ataacaaaaa 39060 attttattta ttttattttt caaagaaaaa aaaatattat atcaagatgg gttctttagc 39120 aaaatatatg agagcgggtg acggtaaccg ttatacgcct aaccataacg gtcccgttac 39180 cattgatgac gttttacaaa aagcagaaga cgaaaatcgt cttttgcaag aagcgacaag 39240 cggtaacctt actttagaaa tgaaagcagc attaaacgaa aaatatcctg gttttatgac 39300 tgctgtagct caagacgaag tcattaaaac agatattcaa aaatttacac ctgatttaac 39360 ttcagaatta aaaagtttaa ttacgtttac tatgaaattc aacgaatgga caataccgga 39420 agattatatt tatgaatttg atgttcattt gtttaaaaac ataaacacaa aaacaagatt 39480 taatacaaca gcaaatcacg cagatcttgc attattagaa actattttgt ttggtmgaaa 39540 attttgtttt tcatatatac aaaaatataa gagtatatcc aaattatcaa aaaaataact 39600 ctatgcctgc taataacgtt ataaataatc gagcgtacca acgattcaaa acaatgtttg 39660 ttaaaaaaag ttatatgaaa gcatatagag agcacatgct taatcataat ttagcattta 39720 cagaaaaaac aattaacaga ccaacttggg atgaaacaag aaatcttatg aaatctaacg 39780 atsctaatgc ggtcataaga gcattaaatg gtaacgtaat acctgcacct gaatatcaaa 39840 catttttaca agatacttat aatgctttga tgaacggaga aaaaggtttt cgttttagag 39900 tacccttatt tttgttatgt gaattttttc aaattaaaag ttattacgga aaaggtaaag 39960 aattaaaaat tgaatttact gttgaacaag atactaatca acttttagaa tatataagac 40020 ctataactga tgctcaaata gaatctttag ctaatgttgg attagaaatt agaaatcctg 40080 taatttgggt caatacttac cgaatcaaag ccagtttacc tttagaaaaa tctttataca 40140 taaacagttt accagaccaa ccttatacta tagctcgttt tgctcattac gaacaagtta 40200 taactaacgt agaaggtaca gataccgtta ccataagctt aggaaatata aaaacagctg 40260 acatggtacc tcattgtgtt ttgttttctg ttcaatcaaa aaaagaaagc gatcatctta 40320 ccaaatcttc tatgacacct gttgaaatag cttgcactat gattaaaaaa attacttttc 40380 ataattttag aagaactttt gtcaattctt caggtgacac ttatgtaatt gatttagaaa 40440 acaaaccaga agatcaacaa ttatgttatc gtagttatgt ttcttggtgt aaaggaaata 40500 ctccttctac tcttagtatt aataattttg cagattttta taccaatgac gatgacagtt 40560 ttattagaaa tcctaatgat tatttcagta aaacgcaaaa tttaactgaa cctttagcaa 40620 ttgacaccac tagcgattat aattatgtta acgacaaacc tccggcaacc attcaaggaa 40680 acaattcctt acaaattacc gttgattttg ttaccaattt taaaggtatc ataactctgt 40740 accttcaaca tgcagcacaa ctgattcaaa ctggaacagg agaagacgaa aattacacat 40800 attatccttt taaatttaaa tcatcataac accttttaca acatttttgt tttagttaaa 40860 ttaaccatgt tttaattgta tttttttatg cacgcacgac tatttaattt gaaaaaaata 40920 taaaaaaaaa catttatatt tgttctttct caaaaytaat tgtatggcgc gaagacggta 40980 tcgtaaaaga agaggtcgca aacatcgagg taaaggaaaa attataacta gagttaaaaa 41040 tcttgccaga cgtttactta gatcagacgt gggaggttta gcgttagatt atttgcaaaa 41100 acaacgtaat gtgtagattg tgtagataga aaaaaaagaa ttcataaaag acgaggaaaa 41160 ggaagagttt atttacctcc gttaaaagga gccggtagaa ggaaacgaat acgcggtaaa 41220 tttcttccga tattaggcgc tttattgggt ggtagtttac tatcttctat aatacgttaa 41280 aaaaaacact tgtacaaaaa aaaattattt ttaaaatgcg taaaagacta agaggaaaaa 41340 gacataaaag acgtttaaaa agacgttaca aaaaacgaat gcgtggaaaa tttttaggag 41400 caataggtgc aattgcttca agacttttac caagactttt acctatagta ggaagagccg 41460 ctttgggtgg tgcagtttcg ggaggaatat cttatggaat caatcaaatt ggaaaataaa 41520 aaaaaaacat ttctttgtat cattttaatt tattaaatat tgtattacaa ccgtatacat 41580 gctcgtgttt taaattaaat ttttcgtaac ccttataata attaataaaa ttttccaagc 41640 gttttaataa tacwactaaa cgtttgttat gtatatcaaa tttccatatg ataaacttcc 41700 aatttatagg agaataacaa taatataaaa tttttttcgt atactattat aaattcataa 41760 agttctttta aatttgtcca tcatttcttc aattgattta tattttctta gtattattta 41820 atacttctct acaaatcttc tccaattcgt aaaggtgcat actcatttat tttgattata 41880 agcacacaaa caatataatg aagtattcca aaaataaaaa ataatatcat taataatatc 41940 agcaacggaa acaccaaaac cattaacaaa aatgcaatct ttccagaaac gagtaacatc 42000 aattttgcta tctttcccaa ttttaaataa ttatatawtt ttaatgcatt aagtttaaca 42060 aagtttccaa cacrttgaaa aaacttgcat atttgtatac cacttttttt aataaaattt 42120 aaaacttcca tttaaaaatg taaaaaagaa cttctattat tgtaaaaaca aatataacag 42180 ctttaacgaa cttcccaact atgtatttga ttaccatgtg aatcgtacat atacggattt 42240 ggttttactt cttctggctt gtacggtttt acaactgcaa caggttttgg tttatcgaac 42300 tgttgtaatg gtttgttctt tgattccatc tgatttataa aattaaaaaa catttgcttt 42360 tttatatata tttttttata acgctagatt aactttaaac aaaaaaaaat taatcttttt 42420 tttttaaaaa aaatatcgga ttataacaaa aattatgacg ttatcggatt atgacgttag 42480 ttatgacgaa ggtcgggatg gaaggttggg tgggaggaaa gtgaataaat cctatatggg 42540 tagatcatca 42550 // ID Jockey-5_CQ repbase; DNA; INV; 4434 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A Jockey non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW Jockey; Non-LTR Retrotransposon; Transposable Element; KW Jockey-5_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4434 RA Kojima K.K. and Jurka J.; RT "Jockey non-LTR retrotransposons from the southern house RT mosquito."; RL Repbase Reports 11(1), 116-116 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 9 sequences with >90% CC identity. XX FH Key Location/Qualifiers FT CDS 339..1526 FT /product="Jockey-5_CQ_1p" FT /translation="MDTEGSTSEGGDGSTKVIEVNIPTSNQFDGLEDEQGD FT DPISLPPNPTPPPVLRIGEPAKGKKVRVPPISVVGKSTRQLREFLGQSRIQ FT QTAFNMKATKTGVQLICSGEDTFRSALAALRGANIQYHTYTPAAEQPMKVV FT LSGLPVYDEAELETELAVLGVHVQELKLFSRKVAGLEESALYLLHFAKGTV FT KLSDLQKVKAVFNIVVRWRYYERKPTDAVQCHRCQRFGHGMRNCNLAALCV FT KCGEKHLSAECRLPNKADLARVDKNVTREAIKCANCSGQHTANYRGCPTRK FT NYLAKLAEKKAQLRNAKPPILTPPRNTVPQNGSTGNDSPVGTSTMTFAEAL FT SQGSSNNTNSDLFTMSEFLALAREVFARLKSCKSRLDQLEALVELTAKYIY FT SV" FT CDS 1456..4188 FT /product="Jockey-5_CQ_2p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="KAANRDSTNSKLSWSLQPSISTVSNAVDILRVANWNG FT RSVHGKKLEFFDFLERHDVDVGIVTETWLHEEQAFYHPKFRCVRFDRDSAD FT AERGGGVLIAVRKGLTHTELDFSTKVIETVGISVSTADGNVHIIAAYFPGA FT RRRSAWTQFRRDISTITRGAEPFFVVGDFNARHRSWNCARANXAGTVXFQE FT AASRNFFVHFPDAFTFHPXGRGRPSTLDLALSNNLLDMTKPVALNELSSDH FT RPVLFDVSLSAPIEQSSPKFRCYARADWPRFQREINAKLDLLNPEVTDLNT FT EADVDAAVEFLTKTLLEPKLFQFQKFNSVATXLXQSRGHPQADRAPQHAKA FT SMVQETGPDLPGHRVVTKPXNSEECNQANFNKFKDTLRTLHDDRDTLWRIT FT KALRKTTKYSPPLRXGTNIIASSSEKAXLLAASFAXAHTNQMPDDPVTVAE FT VNNSIDXIDRTPLADNHSWLVRPKEVAQLIRRLKAKKTPGQDQLRNIVLKR FT LPRKAHIFIAKIFSACVRLGYFPAAWKHAVVIGIPKPNKDATNPSNYRPIS FT LLPTLSKLLERVILVRIDQHLENTRVIPDAQFGFKRGHSTNHQLVRLVKEV FT RANFARRKSAGMVLLDVEKAYDSVWQEAILHKMYLANFPLYILKIVRSFLQ FT NRSYHVAVNGHVSDRHEVLFGVPQGAVLSPTLYNIFTADLAMINGVSYFLF FT ADDTGFLASDSDPAIVVTKLQAAQNAIERYQKKWKMKTNAEKSQAIFFTRK FT RSPRYLPRREVSVWGSHVPWSEDVNYLGAALDKKLTLAKHIKNATTKCGKI FT TRMLYPLVKRRSYLDRNSKLLLYKTVLKPAMTYAYPAWHDCAASHRKKLQV FT KQNRLLKMVLNLDPFHPTDDVHRMAKMELIDDWLLRVLPKFWMGCATSANP FT LLQRMSP" XX SQ Sequence 4434 BP; 1076 A; 1322 C; 1197 G; 821 T; 18 other; gtatccaaat ccgtgacgcg agttgacacc ctggcttggt agtgtaaaca aaccaacgtt 60 ttcgtggtgc atattttttc gattttttcg ctgttttacg gttcggttac ctgcgaatcg 120 gcatcctgtc ggaccaggtg aactccgcgg acacgggagt gcaaaatgcg gccgaagaaa 180 cggcgcaaac ggtacaagtg aacagtgcac aaaaacgcaa gcttgacgac gaccctaacc 240 tcgaagagga aacaccgacg aagaacccgt acaacctgag gtcgacgacg gcggcagcgg 300 cagcagcagc agcggcggca gcagcagcgg cacttgcgat ggacaccgaa ggcagcacca 360 gcgagggtgg tgacggttca acgaaggtga ttgaagtcaa cataccgacc agcaaccagt 420 tcgacgggct ggaggatgag cagggtgacg acccaatttc gttgccgcca aatccaacac 480 cacccccggt tttgcgaatc ggggaaccwg caaaaggaaa gaaggtgcgt gtacctccta 540 tctcggtggt cggaaaatcg acccggcagc ttcgcgaatt tctggggcaa agccgaatcc 600 agcaaacagc gttcaacatg aaggcgacga agactggcgt acagctgatc tgctctggtg 660 aagacacctt ccgcagtgca ctcgcggccc ttcggggtgc caacatccag taccatacgt 720 acacccccgc tgctgaacag ccgatgaaag tagtcctttc gggactgcca gtgtacgacg 780 aagctgaact ggaaactgag ctggctgttc tcggcgtcca cgtccaggag cttaaactgt 840 tctcgcggaa agtggccggg ctggaggaga gtgcgctgta ccttctccac tttgccaaag 900 ggacagtgaa gttgtccgat ctgcaaaagg tgaaggcggt gttcaacatc gtcgttcgtt 960 ggcgttacta cgagcggaaa ccgacagatg cggttcaatg tcaccgttgc caacgcttcg 1020 ggcatggaat gcgcaactgc aatcttgcgg cgctgtgtgt gaaatgtggc gagaagcacc 1080 tctcggcaga gtgcaggctg ccgaacaaag cggacctggc gagggtggac aagaacgtga 1140 cgcgagaagc gataaagtgt gcgaactgca gcggccagca caccgcgaac taccgtggct 1200 gtccaactcg gaaaaactac ttggccaaac tggccgagaa gaaagcacaa ctcaggaacg 1260 ccaaaccccc aatcctgacc cctccccgca acaccgtgcc gcagaacggt agcaccggca 1320 acgactcacc ggtcggtaca tcgaccatga cattcgcgga agcgctgtcc caagggagca 1380 gcaacaacac caacagcgac ctgttcacaa tgtctgagtt cctcgccctt gcgagagaag 1440 tgttcgcccg gctgaaaagc tgcaaatcgc gactcgacca actcgaagct ctcgtggagc 1500 ttacagccaa gtatatctac agtgtctaac gctgtagata tcttgcgcgt ggctaactgg 1560 aacggacgat ccgttcacgg aaagaagcta gagtttttcg acttcctgga acggcacgac 1620 gtggacgtgg gaatcgtgac cgaaacctgg ctgcacgagg aacaagcctt ctaccatcca 1680 aagttccgtt gtgttcgatt cgaccgcgac tcagccgatg ccgaaagagg cggcggcgtc 1740 ctgatagcgg tccgcaaggg cctaacgcac acggagctcg acttttcgac caaggtcatc 1800 gaaactgttg gcatttccgt cagcacggcg gacggaaacg tacacatcat cgctgcctac 1860 ttccccgggg caagacggcg ctcagcatgg acacagttcc gccgcgacat ttccaccatc 1920 acgcgggggg ccgagccttt cttcgtggtc ggtgatttca acgctcgaca tcgttcctgg 1980 aactgcgcwc gggcgaacam agcggggaca gtcmtgttcc aggaggcagc cagccgcaac 2040 tttttcgtcc acttccctga cgccttcacs ttccacccgk cgggccgcgg tcggccctcg 2100 acgttggact tggcgctctc caacaacctg ctggacatga cgaaaccggt cgccttgaac 2160 gagctctcgt cagaccaccg gccwgtgctg ttcgacgtca gcctgtcagc tccgattgag 2220 cagtcatccc ccaagtttcg ctgctacgct cgtgccgact ggccgcgctt ccaaagggag 2280 ataaacgcga agctggactt gctgaacccc gaagtcaccg acctcaacac ggaagccgat 2340 gttgacgcmg cggtagagtt cctcaccaag accctgctgg agccgaagct gtttcagttc 2400 cagaagttca acagcgtagc taccamactg ckacaatccc gaggacaccc gcaggctgat 2460 cgcgctccgc aacacgcgaa ggcgtcaatg gtacaggaga cgggacccga tctaccagga 2520 catcgtgtcg tcactaaacc gkcgaattcg gaggagtgca accaggcaaa cttcaacaag 2580 ttcaaggaca cgctacggac gctgcacgac gatcgggaca cgctttggcg aatcaccaag 2640 gccctgagga agacgaccaa gtacagccct cccctgcgcm agggaacaaa tatcatcgcm 2700 tcctcatccg agaaggccma actactggcg gccagcttcg ctmgcgctca caccaaccag 2760 atgcctgatg atccagtgac tgtcgcagag gtgaacaact ccatcgatkc catcgatcgg 2820 accccgctgg cggacaacca ttcgtggttg gttcgtccca aagaggtggc gcagctcatc 2880 cgaagactga aggcgaagaa aacaccgggc caggatcaac tgagaaacat cgtgctgaag 2940 cggcttcccc ggaaggcgca catcttcatc gcmaagatct tctccgcctg tgtccgactc 3000 ggatacttcc cagccgcgtg gaagcatgcg gtggtgatag gcattccgaa gccaaacaag 3060 gacgccacta acccctcgaa ctatcggccg atcagcctgc tgccaaccct cagcaaactc 3120 ctggagcggg taattctggt gcgcatcgac caacacttgg agaacacgcg ggttattccg 3180 gacgcccagt tcgggttcaa gcgagggcac tccaccaacc atcagctcgt tcggctcgtk 3240 aaggaggtgc gcgcgaactt cgcgcggcgc aagtcagccg gcatggtgct cttggacgtc 3300 gagaaagcgt acgactcggt gtggcaagaa gcaatcctgc acaagatgta cctcgccaac 3360 tttccgctgt acatcctgaa gatcgtccgc tcgttcctgc aaaaccgatc gtaccacgtt 3420 gcggtaaacg gacacgtgtc ggaccggcac gaagttctct tcggcgtccc gcaaggcgcc 3480 gtccttagtc ccacgctcta caacatattc acggcggact tggcaatgat caatggcgtc 3540 agctacttcc tcttcgccga cgacacgggg ttcctcgcgt cggactccga cccggccatc 3600 gtcgtgacca agctgcaagc cgcccaaaac gccatcgaac gctaccagaa gaagtggaag 3660 atgaagacaa atgccgagaa gtcgcaagcg atcttcttca cacgcaagcg aagcccacgc 3720 taccttcccc gcagggaggt ttcggtgtgg ggtagccatg tgccatggtc ggaggacgtc 3780 aattacctgg gggccgccct ggacaagaag ctgacgctcg cgaagcacat caagaacgcc 3840 acgaccaagt gcggcaagat caccaggatg ctctacccgc tcgtgaaacg gcgttcctac 3900 ttggaccgca actcgaagct gctgctctac aagacggtgc tcaagccggc gatgacgtac 3960 gcatacccgg cgtggcacga ctgcgccgct tcacaccgga agaaactgca agtgaagcag 4020 aaccggctgc tgaagatggt gctaaacctg gatcccttcc acccgactga cgacgtccac 4080 aggatggcga agatggagct gatcgacgat tggctcctgc gagtgcttcc gaagttctgg 4140 atggggtgcg ccacctcagc caaccccctg ctgcagcgga tgtccccttg agctgtgata 4200 ttagaataag aaatgcctcc tttcctctaa ctatccccca tctattagca atttcgaagg 4260 ttatttttgc aattttttcc ctcccctgcc caattcgttc tcccgtgtac cagtaaactc 4320 taacaccgtt tgaactagtc agctgttgaa aggtacccca tatctcagct caggataact 4380 actgcaccga tgtttttgcc aaagcaaccc aataaatgaa atgaaatgaa atga 4434 // ID Homo6 repbase; DNA; INV; 3944 BP. XX AC . XX DT 09-OCT-2009 (Rel. 14.1, Created) DT 09-OCT-2009 (Rel. 14.1, Last updated, Version 1) XX DE Homo6 is a putative autonomous DNA transposon. XX KW hAT; DNA transposon; Transposable Element; HOBO; transposase; KW Homo6. XX OS Drosophila mojavensis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; repleta group; OC mulleri subgroup. XX RN [1] RP 1-3944 RA Ortiz M.F. and Loreto E.L.S.; RT "Characterization of new hAT transposable elements in 12 RT Drosophila genomes."; RL Genetica 135(1), 67-75 (2009)DOI 10.1007/s10709-008-9259-5. XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 1362..3167 FT /product="Homo6_1p" FT /translation="MPPQKRNFNINKYFDASDDDYVKCKKCERLLKKDRVF FT NLKKHLLLHEIQIDDKGSEISDSVATSKNLKHQLVKVKINRKTLIRSYIGL FT VTEESVPFHVLNSINMRNIIDPICQGLEAKHGINFKLNGSKCKSVLKHAAA FT NVRREIKEESKNRLLSLKIDSATRLGRNIFGISAQFISKNEIKSQILGMVE FT LKGADSSKSKNLAKEIISVLEKYEISLGQIVSITSDNGANMLKTSKILSQI FT LYTSDEDLDETFIEQKNIEYIRQIHNFEKNENMFVTDIQICRCAAHTAQLC FT ALDVTKHAVIKKCLFQCRNLTEYIRKSSNGYRDLFELKNFKLPQLDCATRW FT GSTFNMVKKLHNAKSILDIIESVENQTEEENFEANDALWDFMESYITSMAP FT LQKAIVQFQAEQLHYGNFYAVWLTCKLATNQILSKNSSNNDSLISIIASNL FT LQSMEARTKTLVSNVGFDACLYLDPRFHNIIDNSQKERAIDFLNTLWEKIK FT IFGSNPSMTYSNNSLDTDAGHTKDNNFELLDDFLSASMVVPNESTNVHAKI FT ETLKLPVMKANTDVLCFWKYKKSTDPELYALSKVCFAIPPTQVSISVLINT FT KIQS" XX SQ Sequence 3944 BP; 1290 A; 665 C; 743 G; 1246 T; 0 other; gctctcgcta ccgctctcat ttttgacgag agccgtagca gagttgtagc ctaacaataa 60 aaagtttgtg ctctgctatg ctctcagttt cgtctcgtca aaaactacag cagtagcgag 120 tagcacaaaa acaactgatg tgctacgctc tgccgtagcg gtagctgaaa acagatagcg 180 aaagcgatag ctgaaaacag cgaacgagat gaaatcatag cgagatgaaa acggagcgag 240 atgaaaacag agcgagatga aaacagagcg atatgaaaac agagcgagat gagagtagga 300 caaacagcag atttttatac cctgaaccca ttttcaatgg gaaatcaggg tataatggtt 360 ttgttgaaat gtatgtaaca gggagaagga ggcgtggcag accccacaaa gtatatatat 420 tcttgatcag ggtgacgagc tgagttgatt tggccctgtc cgtctgtctg tccgtctgtc 480 cgtctgtccg tatgtatgaa cgcgtcgatt tcagaagata taagagctag agacttcaaa 540 ttttttttaa aatgtccgta attacagcgc gcatatcaag tttatcatta aaaatcgata 600 actcatctcg atcttttttt atcgataaaa aaccttatcg aaaaccagtt ataaaattat 660 atcttctaca cctttattgc tagagtctcc aaatttttaa tgtatgctta tatgcaattt 720 atacaatttc ggtttttgtt aaaatttacc ccccacctct aaattcaccc cctcaccctc 780 acttttgtaa aaagcgtatt tcaaatacta gagacttgtg ctttggcata ttcatgttat 840 aatcataatc taaaagtttc atgaagatcg tacgtccata aaaaaagtta ttaaacaatt 900 aagttctcat agtaagcgca cactgcagcc catgcgtgtt tatgcgtggt tgcttgtatg 960 tgtatgtgtg tgtgtatgtg tgtgtgtgta tgcgtttgtg tctgtatgcg tgtgtgtgtg 1020 tatgtgtgtc tgtatgtgtg tgtgtgtatg tatacctgtg tgtgcgtacg tgcgtgtgtg 1080 tgtgtgtgtg tgcgtgtttc ttgtgtaagc tgaaagtcgc gaaaaagtat tgaaatatct 1140 attgctccaa gggttcaggg gatctcgtag tcgagcagtc tcgactagag ctttcttact 1200 tgtttttgtt tgcttttact gcaatttttg gtcggttttc gaaatgcacc gttacttttt 1260 tcattttgct ttgtggtaga gagaaaattt gacacttttt cattttgctt tgtggtagag 1320 agaaaatttg gcactttgaa gttttaattc aatgtgcaaa catgccaccg caaaaaagga 1380 attttaatat taataaatat ttcgatgcaa gtgatgacga ttacgttaaa tgcaaaaaat 1440 gcgaaagact tttaaaaaag gatcgtgttt ttaatttaaa aaagcattta ttgttgcacg 1500 aaatacaaat cgatgataaa ggaagtgaga ttagtgattc tgttgccacg tctaaaaatt 1560 taaaacatca actagtaaaa gtgaaaatta acagaaaaac attaattcgg tcatatattg 1620 gccttgtaac tgaagaaagc gtgccattcc atgtgcttaa ttcaattaat atgcgcaata 1680 taattgaccc catatgtcaa gggcttgaag caaaacatgg aataaatttt aaattgaatg 1740 gaagcaaatg caagtctgtt cttaaacacg ctgcagcgaa tgttcgacgc gaaataaaag 1800 aagagagtaa aaatcgccta ttatctttaa aaattgatag tgcaacgcgg cttgggcgaa 1860 atatttttgg catcagtgca caatttataa gcaagaatga aataaaatcg caaattcttg 1920 gcatggtgga acttaaagga gctgactcca gcaaatctaa aaatctagct aaggaaatta 1980 tttctgtttt agaaaagtac gaaatatctc tgggccaaat cgtatcaata acatcggata 2040 atggggccaa tatgttaaag acttcaaaaa ttttgtcaca aattttgtac acaagcgatg 2100 aagatttgga cgaaactttt attgagcaaa agaatattga gtacataaga caaattcata 2160 attttgaaaa aaatgaaaac atgtttgtga ctgatataca aatatgccga tgtgcagccc 2220 acactgcaca gctttgtgca cttgatgtta ccaaacacgc agttataaag aagtgtttat 2280 ttcaatgccg caaccttaca gaatacatcc gaaagtcttc aaacggctac cgtgatttat 2340 ttgaacttaa aaattttaaa ttacctcaat tggattgtgc tacaagatgg ggatcaacct 2400 tcaacatggt taaaaaactt cacaacgcaa aaagtatatt ggatataatc gaatctgtgg 2460 aaaatcagac agaagaagaa aattttgaag caaatgacgc tttatgggat tttatggagt 2520 catatatcac tagtatggct ccactacaaa aggcaatagt tcagttccaa gcagagcagt 2580 tgcattatgg aaatttttat gctgtttggc taacttgcaa actagccact aatcagattt 2640 tatcgaaaaa ctcttcaaat aatgattcac taatttctat aattgcaagc aatcttcttc 2700 aaagtatgga agcacgaaca aaaacattag tgtcaaacgt gggcttcgac gcttgccttt 2760 atttggaccc cagattccat aatattatag ataattccca aaaagaaaga gcaatcgatt 2820 ttttaaacac attgtgggaa aaaattaaaa ttttcggttc caatcccagt atgacatact 2880 caaataattc attggataca gacgctggac acaccaaaga caataatttc gagctgttag 2940 acgatttttt aagtgccagt atggttgttc caaatgagtc aactaatgtc catgcaaaga 3000 ttgaaacact aaagttacca gtgatgaagg cgaatacaga tgttttatgt ttctggaaat 3060 acaaaaagtc aaccgatccc gaactttacg ccctaagcaa agtatgtttt gctataccgc 3120 caactcaagt aagtatttcg gttttaataa atacaaaaat acaaagctaa aaacaatata 3180 tacatttctt tacaattgca ggttacaata gagagagcgt tttcctcgtt gaaactggtg 3240 ctgtccgata atcgaaatcg attaagtcat gagactttgg agaacatact tttagttagg 3300 cttaattcga atcatttaga tagtgctata gagaatttaa gtttatttga aaatgaagat 3360 tagtatatgt ttatttactt tagtttctaa tttcgtcact gaaataggtt ttttttttaa 3420 ttttttattt ttttttttaa tgtctttatt tgaaaatgaa tataaataaa gattttgagt 3480 ataatttcta atttagtaac tgaaaaaatg ttttttatta tttgatttct gtaactacac 3540 aatttatccc tacgttaatt tgacatcaat caaaaattga agtaaaaata aacacaaaac 3600 aaaatgcggc tgctcaaatc gctcccattt gccagatctg acggcaaaaa atctaaaatg 3660 acagccctgc tcatatcgct ctcgcacatt tgttttactc tcatctcgct ctgttttcat 3720 ctcgctatga tttcatctcg ttcgctgttt tcagctatcg ctttcgctat ctgttttcag 3780 ctaccgctac ggcagagcgt agcacatcag ttgtttttgt gctactcgct actgctgtag 3840 tttttgacga gacgaaactg agagcatagc agagcacaaa ctttttattg ttaggctaca 3900 actctgctac ggctctcgtc aaaaatgaga gcggtagcga gagc 3944 // ID CR1-110_AAe repbase; DNA; INV; 4532 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-110_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4532 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1198-1198 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >99% CC identity. XX FH Key Location/Qualifiers FT CDS 100..1374 FT /product="CR1-110_AAe_1p" FT /translation="MNCEICALDASADSVLWTCAGCPRKFHAACIGVTVQR FT SSLRRKDRKLIDFTSYVLPCCESCQELVQAKLDIKKLNEEHKLLTEQLHAN FT TEVLHRFNLNNEKPSIINEAFEGLEILMSAIKNELAIINKSSSLAGSVATI FT KNHITTVLDIVAQKNNDYLTNTLTSFKSDVSTELRNINDDLCQINQLQLDM FT AATTSASMNPNLFVDIVDELKAWSANILSTKSIEPLTHEAYPSLEVEMEKS FT EDTSGWRLLGDKKIWKADWTSYDARKMHRERQQKLAEKAKQKRKRRNRLPQ FT NTQQPARSKYMNKSHPRPTTVNGRNYNEPSYNRCSIIHQRNTSRNCYNRSN FT TNAAGDIDFRTNTPLLDRELLAAAKERFSRPPPTSYRPTIAFQRGEVLNPY FT PSDRQPNTPHRTNAAPARLNGRCSCQCFCQN" FT CDS 1287..4457 FT /product="CR1-110_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="RQTTEYATPNKCRTSTFEWKVLLPVFLPELTHFTEEG FT NISILTRNECIANNLGYDEGSNDCGNHSSCDRNCINLNDEHLNESDNARHD FT VDLPKASEALVYCQNFNRMRGSSKIKQIHKNILASSFSIILGTETSWDESI FT KSEEVFGCDYNVFRDDRDFLMSQRKSGGGVLIAVSSKFNSEIICSSKFKEF FT EHVWVKVNIENEKHVFASVYFPPDHANKHAYDSFFKNAEDIISGLPPEVKV FT HIYGDFNQRNADFIPDFENEGIMLPVVGENETLQLIFDKIAILGLNQINHV FT KNRQNCYLDFLLTNIQEDFYVTESIAPLWKNEAFHTALEYSLFIHNNCRPT FT VCEYEDIYEYDNANYENIKNRINLIDWQSLIREENNVEKAVDIFYSLLMEI FT IREEVPLKRKKLFGNSKNPIWFNRNIRNLKNRKQKAHKIYKKHGSDDNMQK FT YLDLCEQLNIAINSALEEYNRKIESDIKSCPRSFFNYTKTKLKSNNLPSKM FT QLDDKDANNPEEICHLFSTFFQEIYTTFSEEDRDQNYFSFFPAPSNDISVH FT QISVQDILSGLKGLDATKSAGPDEIPPVFMKKLAVELTSPLFWLFNMSLKS FT GKFPKIWKKSFLVPIYKSGKKTDIRNYRGIAIISCIPKLFESIVNRKLFGQ FT LRNRITNAQHGFFKGRSTSTNLLEFTNYTLNAMDNGNYVEALYTDFSKAFD FT RIDISMLLFKLEKIGFDKPSLNWIQSYLTNRQQIVRYNGKKSNPIQVTSGV FT PQGSHLGPLLFILYVNDISYILKHINILIYADDMKLFMEIKYANDIDIYLS FT EISIFDEWCSKSLLQLNVKKCNLITFSRKRNTEETTVVLGNQIVEKCDRVR FT DLGVILDSKLTFIDHYNTIIHKATNMLGFIKRFGSNFNDPYTIKTLYVAYV FT RSILEYCSIVWSPFMKTHEERLESVQKQFLLYALRKLGWTSFPLPSYESRC FT MLIDIQTLKKRREFAMVSFVNDIVSHRIDSPNLLNSLNFYTPSRQLRNRNL FT FFINHRRTNYAKFSPLNRMMSLYNQHCETIDLTMSRSNLRIYFNSVRYSSI FT " XX SQ Sequence 4532 BP; 1587 A; 818 C; 813 G; 1314 T; 0 other; ttgcatttta atcgtcaatc aaagtagacg tgttttttat tcgctgaaac ccgcgcgcga 60 ctcgattttg ttttaattat tctatttttt gcgtacgtca tgaactgcga aatttgtgct 120 ttggacgctt ctgccgactc ggtattgtgg acatgtgcgg ggtgtccacg taaattccac 180 gccgcgtgta tcggtgttac ggtgcagcgg agttcattga ggaggaaaga tagaaaattg 240 attgacttca cctcatatgt tttgccttgc tgtgagtctt gccaagaact cgtacaagca 300 aaattggaca taaaaaagct gaatgaagag cataaacttc taacagagca gctccatgct 360 aatacagaag tgttgcatcg cttcaattta aataatgaaa aaccaagcat aatcaacgaa 420 gcttttgaag gacttgagat tctgatgtct gcaataaaaa atgagctggc tattattaac 480 aagtcaagta gcttggctgg aagcgttgca actataaaaa atcacataac gacggttctt 540 gacatagttg cccagaaaaa caatgattat ttgacaaata cgttaacatc atttaagtcc 600 gatgtatcca ctgagctgcg gaacataaat gacgatctat gtcaaataaa tcagctgcaa 660 ttggacatgg ccgccacaac gtctgcaagt atgaacccaa atttatttgt ggatattgtc 720 gacgagctga aagcatggtc agcaaacata ttgtctacaa aaagcattga accgttgaca 780 catgaggcgt atcctagttt agaagttgaa atggaaaagt ccgaagatac ttcaggatgg 840 cgtttattag gtgacaaaaa aatatggaag gccgattgga cttcatatga tgcgcgcaaa 900 atgcatcgcg agcggcagca aaaactggct gaaaaggcca agcaaaaacg taaaaggcga 960 aacagattgc cacaaaacac acaacaaccc gcaaggagta aatatatgaa taaaagtcac 1020 ccgcgtccaa ccaccgttaa tggaagaaac tacaatgagc cgagctacaa cagatgttcc 1080 atcatccatc agagaaatac cagtaggaac tgttacaacc gtagcaatac taatgccgca 1140 ggagatattg acttcagaac aaatactcct cttctagaca gagaattact agcagcagcg 1200 aaggaaagat tttcgagacc acctcccact tcgtatcgtc caaccatagc attccagaga 1260 ggagaagtgc tgaacccgta ccctagcgac agacaaccga atacgccaca ccgaacaaat 1320 gccgcaccag cacgtttgaa tggaaggtgc tcctgccagt gtttttgcca gaattgacgc 1380 acttcacaga ggaaggtaat attagtatct taacacgcaa tgaatgtatt gctaataatt 1440 taggctatga cgaaggctca aatgactgcg gcaaccattc gagttgtgat aggaattgta 1500 taaatttaaa tgatgaacac ttaaatgaat ctgataatgc aaggcatgat gtagaccttc 1560 caaaagcttc cgaggctctt gtttattgtc agaatttcaa tcgtatgaga ggatcttcaa 1620 aaatcaagca aatccataaa aacatattag cgtcatcttt ttcgatcata ttaggcacag 1680 aaacaagttg ggacgaatct ataaaaagcg aagaagtttt cggatgtgat tataacgttt 1740 ttagagacga tcgcgatttt cttatgtcac aaagaaaatc aggaggaggg gtcctcattg 1800 cagtctcttc aaaatttaat tctgaaatta tttgttcttc gaaattcaaa gagtttgaac 1860 atgtttgggt gaaagtaaat atagaaaacg aaaaacatgt atttgcatca gtgtactttc 1920 cacctgatca tgccaacaaa catgcttacg actccttttt caaaaatgct gaagatatca 1980 tatctggtct tccccctgag gtgaaagtac acatatatgg tgacttcaac caacgcaatg 2040 ctgacttcat tcctgatttt gaaaatgagg gcatcatgct tccagtggtc ggggaaaatg 2100 aaacgttaca attaattttc gacaaaattg caattttggg cttgaaccaa attaatcatg 2160 taaaaaatcg gcaaaattgt taccttgact ttctattgac aaacatccaa gaagatttct 2220 acgttactga aagtattgca cccttgtgga aaaatgaagc atttcataca gcactggaat 2280 actcattatt tattcacaat aactgcagac ccacagtgtg cgaatatgag gatatctatg 2340 aatacgataa tgcaaattat gaaaacataa aaaatagaat aaacttaatt gactggcaat 2400 cactgatcag agaagaaaat aatgtcgaaa aagctgttga tatattttac agtttactaa 2460 tggaaattat tcgtgaagag gtacctctga aaaggaaaaa actttttggt aattcaaaaa 2520 atcccatctg gttcaataga aatatcagga atttaaaaaa tcgaaagcaa aaggctcaca 2580 aaatttataa aaaacatgga agtgatgata atatgcagaa atacctggac ttatgtgaac 2640 agttgaacat tgctataaat tctgcacttg aagagtataa tagaaaaatt gagagcgata 2700 taaagtcatg cccaagaagt ttctttaatt atactaaaac aaaattaaaa tcaaacaatc 2760 taccatcaaa aatgcaactt gacgataaag acgctaacaa cccggaagaa atttgccacc 2820 tcttttcaac gtttttccaa gaaatctata cgacattttc agaagaggat cgggaccaaa 2880 attatttttc attcttccct gccccctcaa atgacattag tgtgcaccag ataagtgtac 2940 aagacatctt atcaggcttg aaaggattgg atgccacaaa aagtgcagga cctgatgaaa 3000 tccctccagt atttatgaag aagcttgctg ttgaattaac atctcctttg ttttggctgt 3060 tcaatatgtc actaaaatcc ggcaaatttc ctaaaatctg gaagaaatca ttcttagtac 3120 ctatttacaa atcagggaag aaaactgata ttcgtaatta tcgtggaatt gctataattt 3180 cctgtattcc aaagctcttt gaatccattg ttaatagaaa attgtttggg caattaagga 3240 acagaattac aaatgcacaa catggatttt ttaaaggccg ttccacttct acaaatttac 3300 tagaattcac caattatact ttgaatgcta tggacaatgg taactatgta gaggctctgt 3360 atactgattt cagcaaagca tttgaccgca ttgatatctc tatgttactc ttcaagttag 3420 aaaaaatagg atttgataaa ccatctttaa actggataca atcatattta acaaatcggc 3480 aacaaatagt gagatataat ggtaaaaaat cgaatcctat ccaagttaca tcaggagttc 3540 ctcaaggctc acatctagga cctcttcttt tcatattata tgtaaatgat atttcctaca 3600 ttcttaagca tataaatatt cttatctatg ccgacgatat gaagttattc atggaaatca 3660 agtatgctaa cgacattgac atctatctga gtgaaatatc tatttttgat gaatggtgta 3720 gtaaaagcct acttcagtta aatgttaaaa aatgtaattt aattactttt agcagaaagc 3780 gaaacacaga agaaacaact gtagttttag gaaatcaaat cgttgaaaaa tgcgataggg 3840 tcagagattt aggcgtaatt ttagactcaa aactcacttt tatcgatcat tacaacacca 3900 ttatacataa agcaacaaat atgcttggtt ttatcaaacg ctttggttct aactttaatg 3960 atccatacac tattaaaact ttgtacgtcg cttacgtgag gtctatatta gagtattgta 4020 gtattgtatg gtctcctttt atgaagacgc atgaagaacg tcttgaatca gttcaaaagc 4080 aatttctgtt atatgctctt cgtaaattgg gctggacatc cttccctctt ccatcttatg 4140 aatcaagatg catgctcatt gatattcaga cattgaaaaa gcgccgcgaa ttcgctatgg 4200 tttcattcgt gaacgatatt gtttcgcatc gtattgattc tcctaaccta ttaaacagtt 4260 taaattttta tactccttcc cggcaattga ggaatcgtaa cttgtttttt attaatcacc 4320 gtcgtactaa ttatgcaaaa ttttctcctc taaaccgaat gatgagttta tataatcaac 4380 attgcgaaac aatagactta acaatgtcgc gtagtaacct aagaatatac tttaactctg 4440 taagatatag cagtatatag atttaagtgt taacatgtag tctactttga ttgacgataa 4500 taaataaata aataaataaa taaataaata aa 4532 // ID P-2_AP repbase; DNA; INV; 4014 BP. XX AC Contig16746; XX DT 21-AUG-2009 (Rel. 14.08, Created) DT 21-AUG-2009 (Rel. 15.12, Last updated, Version 4) XX DE P-like DNA transposon. XX KW P; DNA transposon; Transposable Element; P-2_AP. XX NM P-2_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-4014 RA Jurka J.; RT "P-like DNA transposons from pea aphid."; RL Repbase Reports 9(8), 1798-1798 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX FH Key Location/Qualifiers FT CDS join(1370..1516,1520..1843,1847..2959) FT /product="P-2_AP_1p" FT /translation="MSIRKHIIWNDKNNKFLGYCDFGNNLDIQSNETSATE FT VLVFIAISINGKKLPIGHFFQNKISAISQTELVKTALTLTHIAGLKVWGVV FT CDGAYTNVSTMKHLGCVVDGGYEELRCWFSHPVNNQKVYYIPDACHNLKLA FT RNILGNCKFIKSNKGNVKAHILYLHSVQNDITFKFSNKISAAHIHYYNNKM FT KVKYAAQTLSPSTADALEYLNKMNVTGFADVEATFEYCRVIDRVFDFLNSK FT SSFSKGLKFPIFRNNISFLKTIIIPLIKYIYSLKFNNTPLHKSKKKKTFII FT GFAIAVISVFSIAETIFLEHYSSLNMNYILTYKFSQDHIEIFFAQIRQRYR FT SNNNPNVVQFKTALKQILFKNYIKCKSNGNCNISDDDISGGIFEFKWYRRQ FT KNIDEITCSEELDEDICNRLVLLNTINTSLAEAKNNIFYYILGYIIRGIVN FT NLSCNSCITCLFQKISDHNYSHSSVSQFLNLKNKGGLISASEDAFKIIVET FT EKLFLYYTHNLKRLHFPNLNIIILRQIINKFS" XX SQ Sequence 4014 BP; 1491 A; 545 C; 511 G; 1467 T; 0 other; caactagata gccgactcga tcttaatatt gaagcaaatc cgccatatgc aaagctggaa 60 acgtttacat tatacacctt atcataatct tatcataatt cataataacc acaaacggca 120 caaacatttc caaatttcat attatgcttt tactcgttgt ctcgttttcg tgaaaccgtt 180 attttcagta cgcattaatt ttttatgtta tgactttatg agttataaat attttaataa 240 actttaatac ctacttttta tatttattca aatatggtag catcgtgttc tgcacatggc 300 tgtgtaaaca gacaaataaa aggcattaaa aatcattttt ttacgtaagt acttcttata 360 ttaaaacaaa ttacgttgtt ccaactatta atcacagaca actgctcttt tacagatttc 420 cattacgaaa tcctgagcgc aatgctaagt ggataaaagc tgttggacga aaaggcttta 480 ttccaacaaa aaatgcacga ttttgtagta atcatttttt gaaaagtgat tttaaaactc 540 cagttggtgg tacctataaa ttgttattgt gtgatgatgc tgtaccatca atattcaaca 600 tgactgttat tcaaacacca acaacaatat atattactca gaaatttgtg aaccaacttc 660 aagtcaatta actactacag agttatctac atcagttcaa catgaaccat tatgtcttac 720 tccaaaaaga catttgtgtt ttaaaagtgc tgaagaagaa ttttcattaa tctccactcc 780 ttctaaatgt aaaattactg tatcacctta tcgaaaaaca actccttcaa aattagtaca 840 taaaaataaa ataataaagc aagctaacac aattaaatta taaaagcaac aaatcagacg 900 taaagatatt aaaattaata ctttagcagg tcttttaaag ttactaaaag acaaaaaagt 960 attaaacatg gaatctgaag aacttatgtt agacaaattt gacggactct ctcaagaatt 1020 attttccaac attcataaaa atcaaaatat cgacccaaat ggtagacgtt acagtaagac 1080 tataaaagaa tttttgctta cattaaatta ctattctcct aaagcttatg agtttagtta 1140 gttcttatta gttcttttcc taatgcattt actacagtat tttctatata atactgataa 1200 ttaagatatt tattatattt acaggtcaat attaatactt cctcatgcaa aatctataag 1260 aaactggaca tctacattaa atggcgaacc tggatttttg caagaagtac tggattctct 1320 acaaacattg aatgaaaatg ataagcattg tactattgca tttgatacca tgtctatcag 1380 aaaacatatc atttggaacg ataaaaataa caagtttcta ggatattgtg acttcggaaa 1440 caatttagat attcaaagta atgaaacttc agctacagaa gtgttagtgt ttatagctat 1500 tagtattaat ggcaaatgaa aattacctat tggtcatttt tttcaaaata aaatatctgc 1560 tattagtcag actgaattag ttaaaacagc cctaactcta acacatattg ccggtcttaa 1620 agtttggggt gtagtttgtg atggagcata taccaatgta tcaacaatga aacatttagg 1680 atgtgtggtg gatggtggtt acgaagaatt aagatgctgg ttttctcacc cagtaaataa 1740 tcaaaaagta tactatatac cagatgcctg tcataattta aaattagcga ggaatatatt 1800 aggaaattgt aaattcatca aatcgaataa aggaaatgtg aaataggcac atattttata 1860 tttacatagt gtacaaaatg acataacttt taaatttagt aataaaatta gtgcagctca 1920 tattcattat tataataata agatgaaggt aaaatacgct gctcaaacgt taagtccttc 1980 tacagctgat gctttagaat atttaaataa aatgaatgta actggttttg ctgatgttga 2040 agcaactttt gagtactgtc gagttattga cagagttttt gattttctta attcgaaaag 2100 ttcattttca aaaggtttaa aatttccaat ttttagaaat aatattagct ttcttaaaac 2160 tattataatc cccttaatta aatatattta ctctttaaaa tttaataata ctcctttaca 2220 taaatcaaaa aaaaaaaaaa cttttataat tggatttgct attgctgtta tatcagtttt 2280 cagtatagca gaaacaatat ttttagaaca ttattcatca ttaaatatga attatatttt 2340 aacatataaa ttttctcagg accatattga gatattcttt gcccaaatta gacaacgcta 2400 tagatcaaat aacaacccta atgtggtaca atttaagaca gctttaaaac agatattatt 2460 taaaaattat atcaagtgta aatctaatgg aaactgtaac atatctgatg atgatatttc 2520 aggagggatt tttgagttta aatggtacag aaggcaaaaa aatattgatg agataacgtg 2580 tagtgaagaa ttggatgagg acatttgtaa cagattagta ctactaaata ctataaatac 2640 gtcattagca gaagccaaaa ataatatttt ttattatata ctaggatata tcatcagggg 2700 aattgtaaat aatttgagtt gtaattcatg tattacatgt ttgtttcaaa aaatatctga 2760 ccataattat agtcattcat ctgtttctca atttctaaat ttaaaaaaca aagggggttt 2820 aatatcagcg tctgaagatg catttaaaat tattgtagaa actgaaaaat tatttttata 2880 ttatactcac aatttaaaaa gattacattt tcccaactta aatattataa tactacgtca 2940 aatcattaat aaattttctt gagattgcaa catttttcag gacataaatt gtgaaaatat 3000 ttccatatta gatagaccac acaaaattgt acttattact ttaattacta acaaattttt 3060 aagcatccgt ctcaaatcgt atggaaaaat gttttcttct aatatcttaa atccacatag 3120 aaagcgtcat acactaacta aacaaatgtt attttcacat caataattaa attatttatt 3180 attctatgcc ttatagttaa atgttgtata ctattctttt aattatgttt atattttgta 3240 tctttttgtt aaattagaag ttaataatgt taatttatat ttttattatt attcatgttg 3300 tacattaata cctataaata atgatgtttt cctatataat gaaataaata ttttttgtaa 3360 agtttaatag tacaaatata tataatatgt ctattgtaca agttataata ctacaatttt 3420 tgtattgaat taaataaaaa ctttgtattt gatatttgtt tttaagtatt cactttttgt 3480 tacttttctc atctcaagta ttagaaaaaa atttaaaatg ttttgttaaa cagcattatc 3540 gtttatgttt atacccaatg tcacacatga aatttttttt actgggatat ttataatagt 3600 aattacattt gaaagtcaat gttttaaata tttaattcaa ttgatgtcag ctactgatat 3660 gtattgaatt tttaaaaaac aactcatttt tttatacaaa tttttgtatt ttttttagac 3720 caaggctcat taaagttgta tgtttgatta atgattaggg cacgggttca gttgtaattt 3780 ccagaaatca atcaacatca catagttttt ggttaaaaaa taataattat attttcaaat 3840 ttactacatg ccaaaacgca aattcattat tttaatattt caaaatacaa attaagttaa 3900 cacgttatca ataaacgtta ttattacaat ttatataaac attgccagct ttgcatatgg 3960 tggcgttatt catgccctcc tgcagtgttg tataggaagt cggctatcta gttg 4014 // ID Gypsy-8_AA-LTR repbase; DNA; INV; 1583 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-8_AA_; KW Gypsy-8_AA-I; Gypsy-8_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-1583 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 986-986 (2011). XX DR [2] (Consensus) XX SQ Sequence 1583 BP; 465 A; 305 C; 339 G; 474 T; 0 other; tgtaacctta aatatttatc cgtttccgtg atttatgttt tgtgtgtacc caaagctttt 60 catagtgaca ataagccacc tatatctgaa gtcatcaacc tctctgaaat atttcgatat 120 gagaagcttt agcaatatcg ggtttcgagc aatgtgctat tgatgcatac atttaggcga 180 attcaccaaa ccattttgaa aaatgaaagg tgtgacgttt ttcaatacag gtgaatcccc 240 attgcaatta aatgactaat ggaaaattac tgcctctctc tctacaaaac tgagtttaaa 300 aagaacacaa aagcacaata gaagatagtt attgcgcctt ggattttgag ctcaagccaa 360 ttcgactcgc gattaaacgc ccatttctgg tggagtgttc aagtcgttaa gacgtgtatt 420 tttggaaaag cttagtgaac tacaagttga ttgtgaacaa tagttagact aaattcgaag 480 taagtgcgcg atacagtgaa tttagtgagt tattaggaac gtttgaggaa gtttgaaatt 540 tgagaaagtt tagtgaatac aaatcaggta agctggtgag ccaattgacc gaagccatgt 600 gctaataatt atgaccgcgt tgaacaggcc tagcccatcc aacgtgggcg gtttatctgg 660 gattcttcct agtttcccct agtgaagccg agaccggaac cagcgttcaa atagtcgagc 720 atccgttttc ccgaatcaac cacgtgctcg acgacagtgg tgagtacggt gagccaagga 780 actcgaactc gtagcgccaa gacgtgtggt agcgagctcc gcaaacccag ctcgaatctc 840 caccaagcgt gaggacgcat tcacctcact attacccacg cacgagtagg cccatgtgac 900 accgttagca aacgcctggt tcatcggatt ggatcctggc ttcgaactgc tcgacatcgt 960 tcgacgtggg agcagcgccg cagcgaatat taaggtcaga cctttttaat taaatacgat 1020 tgcgcaacgt atgaagaagt ttttgaaagc gaatatcgat agctcgaaat gtgacgtcac 1080 acggtccgta gccagtttta tttgattcaa aatcccaata atttgaaaat tgaaatgaat 1140 cgaaatacaa attgtaaagc tcattaaatt tactatttag aaacaatatt ttttcggtcc 1200 gtagaatttt taataaagtt taataattac ttcgagtttg gtaatccact ctatcagacc 1260 gtgttcgatt acaatttctt acgaatttgt tcgagctcta attcatcgta agactgtcgc 1320 tgagtattgt agagagttta atagttcaac ataattttaa aaattccctc ctattattat 1380 tattatttgt tttttagaat tttggtttgg ttaatttgcg tcgcgtcgtt tgggtcaaag 1440 ttgttgataa gaagttccta ctgaactgtt tggcctgccc tgagcaggag ttctcatacc 1500 gggttatgaa cagaggtaac ggtccttcaa tctgaaggtg gcgctaaagc cgttagcttg 1560 gtggaacgga acggattgtt aca 1583 // ID Gypsy-20_CQ-I repbase; DNA; INV; 4326 BP. XX AC AAWU01028573; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-20_CQ_; KW Gypsy-20_CQ-LTR; Gypsy-20_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4326 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 419-419 (2011). XX DR Genome; AAWU01028573; Positions 3847 8172. XX CC Positions [3232-3711] - Integrase core CC 'TCCTT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 136..4260 FT /product="Gypsy-20_CQ-I_1p" FT /translation="MSNIPVPSPLQLGEEQEESFRTFKAHWGYYAIATDIA FT KKAAETQVAILMSVLGPKAILLLHDLGLTEEEKKSTAAILQKLELKLTPQR FT EKRTERTEFRNMKQLPEESYDDFLKRLRNKVKHCAFGAAEQKEELKEQFVK FT GIRNSELRKQLLRDEALSLEQMVDKANAEKKIDELVEAYDQLHCDPGEGGS FT KSALKVTTTKRLTKNCFFCGSLHGRKKEECPAWGKKCRKCGRMNHFESVCK FT SGPKFKKAPKLKKRQRRGTVRKVEEETTSSESDDSDIEESNFVDIFSVEKK FT SKSQMKIVVSLEVGPGYATTLKCQADTGAMVNIISLRDLHKCVKKPSMAPT FT KVKLRCFGGQVIVPVGKVELPVRLKQKREVLSFVVVKSKQKPLLSASACIT FT LGVIKVEHVGSVEVMLNSCEAIVKEFDDVFDGDGCFDGEFKIEVDESVRPV FT QQKARRIPVAYMGELKKTISDLEVRGIIEPVNKHAEWISNLVLVKRGSKLR FT LCLDPSELNSAIKRTRHQIPTVEEMLPDLQNAKVFSVLDAKNGFWHLKLDD FT SSSELTTFWTPFGTYRWMRMPFGISAAPEMFQKEQQQIICGLKGTRCIADD FT ILIYGVGDTMTEALEDHNRNLKAALVRFRERGLKINRSKMKLAMTEVPFFG FT HILTNTGVKPDPQKVRAVAEIESPKTKKELHTFLGLATYLGKFLPSLSEIC FT APLRNLVKQDVDFVWNEQAESSFEQLKQLAVTAPVLRYFDRREKLTIQCDA FT SKAGVGCVLLQNGQPVVYGSRTLTKAESNYACIERECLAIVFACKRFEQYV FT VGMPGVVVETDHKPLVEIFKKPIHTAPIRLQRMMLALKRYVVTVTYRKGSE FT MHVADLLSRTAKRASTEGVDNEFEIYAIKSTESLLEYFAEINLAECLNLSD FT RRFQEIAAETRKDPVLQKLMKVLTEGWPERKEDIDDELLVYRTMKDELTVQ FT QGVVLKADRVLIPKTLRQTFVKRLHRAHQGVEYTLRAARESMFWPGMSDQV FT TNAVQSCEACMEFSSSQCAPPMSTHEIPKYPFQRVHLDLCEMSMEGCKTTM FT LVTADSYSDFVEVDILKSTSTATIVECCKRNFARHGVPEVVVSDNGPQFDN FT ASFRKFAQDWEFVHSTSSPYHQSGNGKAESAVKQVKRLFKKTTRAGEDFWQ FT ALLQQRNTPNKIGSSPTQRLLGRSTRNSIPVVTSKLKHIFQPTEADKIVEH FT RKKVKKCYDRSTKPLPELQVGKEVVFQRRPDTDKRWEKAVVLEKHPDKSVQ FT LRAADGAVLRRSAVHVKPFGRAVSGECRLEHRRETSPEQPNITERDIESGG FT VLERRSLGQQKQREGNSQPVGTELDRGSPTGQESMASTSQRVADTQRPKRE FT IKRPKRLEDYVV" XX SQ Sequence 4326 BP; 1263 A; 922 C; 1279 G; 862 T; 0 other; tggtgtcaga agtgacttaa atttcgcgaa aaaaaacgaa tttcttcgtc gcgaaaacgc 60 gtcgcgcgat aaaacaacaa gaaatcgaat cgaatcccag tgaaagttgg tgcgaaacgg 120 ccgaggttca gcaagatgag caacattccg gtcccgtcgc cgctgcagct cggagaggag 180 caggaggagt ccttcaggac gtttaaagcg cattggggat attatgcgat tgccactgat 240 attgccaaga aggccgcgga gacacaggtt gccatcctga tgtccgttct gggcccaaaa 300 gcgattttgc tgctccatga cctgggcctg acagaggagg aaaagaagtc gacggccgcc 360 atcttgcaaa agttggagct gaagttgaca ccgcagcgag agaagcgaac ggagcgcacc 420 gagtttcgta acatgaagca gctgcccgaa gaaagttacg acgatttctt gaaacggttg 480 agaaacaaag tgaagcactg cgcgttcgga gcagcagagc agaaagaaga gttgaaggaa 540 caattcgtga agggaatccg taacagtgag ctccggaagc agctgctgag agatgaagct 600 ctctctctgg aacaaatggt ggacaaggcc aacgcagaga agaagatcga cgagttagtg 660 gaggcctacg accagctcca ctgcgacccg ggagagggag gctcgaagtc tgcgctgaaa 720 gtgacgacaa caaaacggtt gacgaagaac tgcttcttct gtggaagtct gcacggcagg 780 aagaaggaag aatgtccagc ctggggcaag aagtgcagaa agtgtggaag aatgaaccat 840 tttgagagtg tgtgtaaatc tggcccgaag ttcaagaagg cacccaagct gaagaaacgg 900 caaagaagag gcactgtccg aaaggtggaa gaagaaacaa cttccagtga gagcgacgat 960 tcggacattg aagaatcgaa ttttgtggac atcttcagtg tggagaagaa atcgaaaagt 1020 cagatgaaga tcgtcgtctc gctggaagtt gggccgggtt acgcaacgac gctcaagtgt 1080 caggcggaca cgggggctat ggtgaacatc attagcttga gagacctcca caagtgcgtg 1140 aaaaagccgt cgatggcacc gacgaaagtg aagctgcgct gtttcggggg gcaagtgatc 1200 gtgcctgttg gcaaagtgga acttccagtg cggttgaagc aaaaacgaga agtgcttagc 1260 tttgtggtgg tgaaatcgaa gcagaagcct ctgctgtcgg cgtcggcctg cataaccttg 1320 ggagtgatca aagtggagca tgttggctca gttgaagtga tgctgaactc gtgcgaggcg 1380 atcgtcaagg agtttgatga cgtgtttgac ggagatggct gcttcgacgg tgagtttaaa 1440 atagaagtgg atgaaagcgt tcgcccagtg cagcaaaagg cgaggcgcat tccggtggcg 1500 tacatgggag agctgaaaaa gactataagt gacctagaag tgcgcggcat tattgaaccg 1560 gtgaacaaac atgcggagtg gatcagcaac ctagtgttag tgaagcgcgg cagtaagctg 1620 cgattatgtc tggatccttc ggaactgaac tcggcgatca agcgtaccag acaccaaatc 1680 ccaacggtcg aagaaatgtt gccagatctg cagaacgcaa aagttttctc tgtcctggat 1740 gcgaaaaacg ggttttggca tttgaagtta gatgacagca gctcggaact gaccacgttc 1800 tggacacctt ttggaaccta cagatggatg cggatgccat tcggcatttc agcagccccc 1860 gaaatgttcc aaaaggaaca acagcagatc atttgtggct tgaaagggac gcgctgcatc 1920 gctgacgata tactgatata cggcgtaggc gacaccatga cagaggcatt agaggatcac 1980 aacaggaatc tgaaagctgc attggttcgg ttccgagaaa gaggattgaa gatcaacaga 2040 tccaagatga agctagcgat gaccgaagtg ccgttttttg gacacattct gacgaacaca 2100 ggtgtgaaac cggatccgca gaaagtcaga gcagttgcgg aaatagagtc accgaaaacc 2160 aagaaggagc tccacacgtt cttgggattg gccacgtatc tgggaaagtt tctaccgtcg 2220 ctgtcagaaa tttgtgctcc actgagaaac ttggtcaagc aggatgtaga ttttgtgtgg 2280 aacgagcagg cggagagctc attcgaacag ctgaaacagt tggcagttac ggcacctgtt 2340 ctgcgctact tcgaccggag ggaaaagctg accatccagt gtgatgcgag caaggcaggg 2400 gtcggttgtg ttctactaca aaacggacag ccagtagttt acggatcaag gacgctgaca 2460 aaggccgaaa gcaattatgc gtgtattgag cgagaatgtc tggcgatagt gtttgcatgc 2520 aaacggttcg agcaatacgt ggtgggaatg ccaggtgtgg tcgtggagac ggatcacaag 2580 ccgctcgtcg agatattcaa gaagcccata cacacagccc ccatcagact gcagcgaatg 2640 atgttggcgt tgaaacggta tgttgtaact gttacgtacc ggaaaggttc cgaaatgcac 2700 gtggcagatc tgttgtccag aacggcgaaa cgagcatcaa ctgaaggagt agacaacgag 2760 tttgagatct atgccatcaa aagcacggag agcctgcttg aatacttcgc ggagatcaac 2820 ctcgcggagt gcctcaacct gtcagatcgt cgttttcaag aaattgctgc ggagacaaga 2880 aaggatccgg ttctacagaa gctgatgaag gtgctgacgg aaggttggcc cgaacggaaa 2940 gaggacatcg acgacgagct gcttgtgtat cgcacaatga aggatgaact gacggtccag 3000 caaggtgtgg tactaaaggc tgatcgggta ctcatcccaa agactttacg ccagacgttc 3060 gtgaaacgtc tacatagagc tcatcaagga gtagaataca cgctcagagc cgcccgagag 3120 tcaatgtttt ggccaggaat gagtgatcaa gtgacgaatg ctgttcaaag ttgcgaagcc 3180 tgcatggagt tcagttcaag ccaatgtgca cccccaatga gcacacatga gataccaaag 3240 tacccattcc agcgtgtgca tttggacctg tgcgaaatga gcatggaagg ttgcaaaact 3300 acaatgctgg tgacggctga cagttattct gacttcgtgg aggtcgacat actgaagagc 3360 acgagcacag caaccatagt agagtgctgt aagcggaact tcgcgagaca tggcgtcccg 3420 gaagttgttg tcagcgataa cgggccgcag tttgacaacg ccagtttcag gaaatttgca 3480 caggattggg agtttgtaca cagcacttct tcaccgtatc accagagcgg aaacgggaaa 3540 gcagaatctg ctgtgaagca ggtcaagcgg ttgttcaaga agactacgcg ggcaggagaa 3600 gacttctggc aagcgttgtt acagcaaagg aatacaccaa acaaaattgg aagcagccct 3660 acgcagcgcc ttcttggaag gagcactcgg aacagcattc cggtggtgac aagtaagcta 3720 aaacacattt tccaaccaac agaagcggac aaaattgtgg aacatcgcaa gaaagtgaag 3780 aagtgctatg acagatcaac gaagccgctc ccggaacttc aagttggaaa ggaagtagtg 3840 tttcaacgac ggcccgatac agacaagcgg tgggagaaag cggtggtgtt ggagaagcat 3900 ccggacaagt cggtgcaact gagagctgca gatggagctg tgctacggag aagcgctgtt 3960 cacgtgaagc ctttcgggag agctgtgagc ggtgaatgta gattggagca ccggcgggaa 4020 acaagtcccg agcaacccaa catcacggaa cgtgacatag agtctggtgg agtattggag 4080 aggcgctcac tcggacagca gaagcaacgt gagggcaaca gccagcctgt tggcaccgag 4140 ctcgatcggg gatcgccaac cggacaggaa agtatggctt cgacgtcaca aagagtagcg 4200 gatacccaac gtccgaagag ggagatcaag agacccaaga ggctcgagga ttacgtagtt 4260 taagtttttt tttgtttgcg acacacccct ctagtttgaa ttcaacattt tttatgaaag 4320 gggaga 4326 // ID I-68_AAe repbase; DNA; INV; 6040 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE An I non-LTR retrotransposon from Aedes aegypti. XX KW I; Non-LTR Retrotransposon; Transposable Element; I-68_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6040 RA Kojima K.K. and Jurka J.; RT "I clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1339-1339 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 20 sequences with >97% CC identity. XX FH Key Location/Qualifiers FT CDS 359..1663 FT /product="I-68_AAe_1p" FT /translation="MAGANHAPSRGDPGDPGGSLRGKRQPDWMLSEDEMGQ FT TMVLLLRRKPNEANTQNSQQSPLPNPFIIYASIELAVGVQDAAKIGMTKEG FT RGTRYILRTNLKSVYRKLTNMTQLTDGTVVEVISHPTMNKVQGVVYESDST FT DVDEVALLNYLTPQGVQAVRRITKKVNGSVRNTPLVVLSFQGTLLPEFIYF FT GGLRVPVRTYYPAPTVCFQCGTYGHPRKFCQQPRICLQCSQAPHTSEGEQC FT SNAPYCMHCTGTHSPTSRTCPKYKDESNVIRIRIDRNVSTAEARKIYASET FT QKETIASEVQKRLREEESNKDKIIAELRAEVETLKRKLEAIINGQRLKSRD FT SSQIEKKSVFLQXNSSCSNQNTQRMSRKDKTFVSPPSLHSDIRRTADHAMD FT EAIRTRSRSRKHLMEISPTDVTHKSGKRASLIADSGGTSEK" FT CDS 1663..5784 FT /product="I-68_AAe_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="MTKMFPPKTHHKMTDPAMHNDSISDDLRHATDAISER FT PCDPSDYDLDTNSHDSIIPSRKILNSPSQPVDTTSRFGGGECVVPPPLLIK FT GASLGGEVIQDAILLPVDHQTPSSAASSNNSTLPKIPSDSSAYDQLYNFYP FT QPSTSNNFVTQNIDSPSARRTTSTLTRDVPSLLDEPLAAVDVARPSSRASI FT PVLSLARHESHDTSHHVSETPPVRESSPTSNCSSTSETEVNNKIIMLQWNI FT RGLWKNHPELSHIVNQNAPICIGLQEMMTKKVSTALSSRYDWELSSRHYSQ FT GGGAAGLGVLKEFPHQFYHFVTCIPVCAARIQSPLNISIVSIYVPPNTCDQ FT DLITTLDSIIENVDPPFVIGGDFNAAHEAWGSMKTTKRGCLLLEWFVEHQL FT IVINNGEPTFISSAHGTTSAIDLTVVSCSLAAKLRWSIYKDTFGSDHFPIR FT VTSDQNTPSQRSRKRWIYKDADWISFENVILESLSSHSDIDVEQLRNLIIG FT AAEKSIPXSSGVSKGRSEIWWTNEVKEKVKARRKALRKLKRLNPSDPLHDD FT AKKAFQQARAIARKTIESAKRDSWNNFCQMFNPNTSSDVLWSNFNRLMGKR FT RNGQRGLQIDNSFTQDTSVITEHFANYFYQASSGNSSTDQESGETGWDQNI FT QVSENTRLDTDFTIYELLRAIDVTNGHSTGEDQVGYLMIKHLPYAGKLAML FT KSFNKVWEKGEFPNSWKQGLVIPIPKSNRQAQHAEGYRPITLLSCIGKIYE FT RMVNRRLMTFLEENNLLNEHQHAFRAGRGTSSYFADLGEILAEIEQSNAHA FT EFALLDINKAYDQVWRTHIMRQIDSLNTGKRIRSCISNFLLDRRFRVSYGG FT SLSKDRIQENGVPQGSVLAVTLFLIAINPVFDVIPKNIRVLVYADDIVLVA FT ISKHLPKVRNRLIEAVDAVNTWAKSVKFRLSAPKSCILHMCQRSRHRWRCA FT RNVVKIDNEIVPEVKAARFLGIWINRKGSFLVHGAKIKEALFSRINCLKAL FT AFKADRNILWRIGNAVCVSKLTYGIELFGLNLLNTFQSTYNEILRITSGAL FT RSSPTLSLAVEAGQLPLHLRLVDTLARRYIRLAEKPFFNHQHLRKCAIEAF FT ADETGHDIPEIAKLLWIGKIPWDMHQVKIDWSVRSVFKKGNNKNIARACVK FT ELQNKKYEEYCQMYTDGSKSGTEVGLGVVCQELQIEAGLLAQCSVYSAEAA FT ALAVAVQTADNRATVIFSDSASCMDAIKKGDSKHPFIQSILKEASLKDITF FT CWIPGHSDIPGNEAADQAAERGRTAPPLHNVAPASDVIKWINKRLHDKFQR FT MWNDHRATFLQKIKPTVGKWIDHQDRQEQRALTRCRIGHTRLTKKHLFDRN FT QSATCEVCQGS" XX SQ Sequence 6040 BP; 1870 A; 1458 C; 1290 G; 1419 T; 3 other; acttgagttc gggacacagt cgtcattacc acgcgtccgt gttactctaa attgcggtga 60 tmtttcccag tgaaataagc cgaatcaatc tccggaaatt tgctaacata gcaagttagt 120 agtgctgatt tagccgggag tacaagaaat ccagtgaagt tattaaaata gctgaaattg 180 ccgccgatat atctgcttac ggctgctaaa gataacaaca agtgttatag catcaccgct 240 ccattgtttt gtcgtaccag ccagtgctac ctatagcaat agcgatagcc cgattaagtg 300 gagagactcg acagcagcca ataaagtgtt ttgattgtcg ttactacatc ccggcctgat 360 ggccggggca aatcatgccc cgtctagggg tgatcctggc gatccaggtg ggtcgttgag 420 gggaaaacgc caaccagact ggatgctcag tgaggacgaa atggggcaaa cgatggtgtt 480 gcttctacgc agaaaaccaa acgaagcaaa tacgcaaaat agccaacaat cgccactacc 540 aaaccctttc atcatttacg catccattga actagcggtc ggagtgcaag atgcagcgaa 600 gataggaatg accaaggaag gccgcggaac taggtacata cttcgtacca atttgaaatc 660 agtctatcgt aagctgacta acatgaccca gctcactgac ggaaccgtcg tcgaagttat 720 ttcccatcct acgatgaaca aggtgcaggg tgtcgtttac gaatctgatt ctactgatgt 780 cgacgaggta gcacttctca actatctgac gccgcaagga gtacaagctg tacgcagaat 840 cacgaaaaaa gtcaacgggt cggttcgaaa cactcctcta gtcgtattgt cattccaagg 900 tactctcctg ccggaattca tctactttgg cggactccgc gtccccgtac gcacctacta 960 tcccgctccg actgtttgtt tccagtgtgg cacctatgga catcctcgca aattctgtca 1020 acagcctaga atttgcctgc aatgctccca agcaccacac acatcagaag gggagcaatg 1080 cagcaacgct ccgtattgta tgcattgtac aggcactcac tcaccaacat cacgtacgtg 1140 tccaaaatac aaggatgaat ctaacgtgat ccgaattcga atcgatagga atgtgtcaac 1200 tgctgaagct agaaaaatct acgctagtga aacacagaag gaaacgatcg ccagcgaggt 1260 gcaaaaacgg ctaagagaag aggaatcgaa caaagataaa attattgcgg aactgcgggc 1320 tgaagtcgaa acattaaagc gaaaacttga agccatcatc aacggtcaac gtctaaaatc 1380 acgcgacagc tcccaaatag aaaaaaaatc ggtgttcctc caascaaact catcatgttc 1440 caaccagaac actcagcgaa tgtcacgtaa agacaaaacg ttcgtatctc ctccgtctct 1500 ccacagcgat attagaagga ccgcggatca tgcaatggac gaagcgatac gaacacgcag 1560 taggagcaga aaacacctaa tggaaatatc accaactgat gtcactcata aatcaggcaa 1620 aagggcatcc ttgatcgccg actctggagg cacctctgaa aaatgaccaa gatgttccct 1680 ccaaaaacac atcacaaaat gactgaccca gctatgcata atgactcgat atcggatgac 1740 ctacgacatg caaccgacgc tatctctgaa cgaccctgcg acccttcgga ctacgatttg 1800 gatactaact cacacgactc cataattccg tctagaaaaa tactcaactc tccatctcaa 1860 cctgtggata caacatcgcg gtttggaggt ggggaatgtg tagtaccccc tccactcctg 1920 ataaagggag cgagcctggg tggagaggtg atccaggacg ccatccttct acctgttgac 1980 caccaaaccc ccagctcggc cgccagctcg aacaactcaa cgctgccgaa aataccgtcc 2040 gattccagcg catacgatca gctctacaac ttctaccccc agccatctac gtccaacaat 2100 ttcgtcaccc aaaacattga ttcgccaagt gcgaggcgca ccacgtcgac cctgaccaga 2160 gatgttccga gcttgctcga tgagcctctg gcggcagtcg acgtggctcg cccttcatct 2220 cgggcaagta ttcctgtatt atcattagct agacatgaat cccacgatac ttctcaccat 2280 gtttcagaaa ccccgcctgt tcgcgaaagc tcccctacaa gcaactgctc ttcgacttcc 2340 gaaactgaag tgaacaacaa gatcatcatg ctccaatgga acatcagagg cctttggaaa 2400 aatcacccag aactctctca tatcgtcaac caaaatgcgc caatctgcat cggccttcag 2460 gaaatgatga caaagaaggt tagcacagca ctcagttccc gctacgactg ggaattatcc 2520 agccgtcact attctcaagg tggtggagca gctggattgg gggttctcaa agaattccct 2580 caccaattct atcactttgt tacatgcatc cctgtatgtg ccgcccgaat acaatccccg 2640 ctgaacatat cgattgtcag tatctatgtg cctccaaaca catgtgatca agatcttatt 2700 acgacgcttg acagcataat tgaaaacgta gatccacctt tcgtcatcgg cggggacttt 2760 aacgccgccc acgaagcttg gggtagtatg aaaactacta aacgaggatg tcttctactc 2820 gaatggttcg tcgaacatca acttatcgta ataaataatg gagaaccaac tttcatcagc 2880 tctgctcatg gaacaacttc ggcaatcgat ctaactgtcg tgtcatgtag ccttgcagca 2940 aaacttcgat ggtcaatcta caaggatacc ttcggaagcg accacttccc catcagagtc 3000 acctcggatc aaaatacacc gtctcaacgt tcacgtaaac ggtggattta taaagatgct 3060 gactggatct cttttgaaaa tgttatcctt gaatcgcttt cgtcgcattc agatatcgat 3120 gtcgagcagc taagaaatct catcatcgga gcggcagaaa agtccatccc amaatcttcc 3180 ggagtatcaa aaggaagatc tgaaatttgg tggacaaacg aagtcaaaga aaaagttaag 3240 gcacggcgta aggctcttcg caaactgaaa cgtctgaatc cttcagaccc tcttcacgat 3300 gacgctaaaa aagctttcca gcaagctcgg gcgattgcaa ggaaaacaat cgaatctgcc 3360 aaacgtgatt cgtggaataa tttttgccag atgttcaatc ctaacacttc atccgatgtg 3420 ctttggagta attttaaccg attgatggga aaaagacgaa acggacagcg aggactccaa 3480 atcgacaact cgtttaccca agataccagt gtaatcacag agcacttcgc aaattatttt 3540 taccaagcat catcaggaaa ctctagtaca gatcaggaat ctggagaaac aggatgggat 3600 caaaatattc aggtttctga aaacactcga ctcgatacag atttcacaat ctacgagttg 3660 ttacgagcca ttgatgtcac taatggacac tctaccggcg aagatcaagt cggatatcta 3720 atgatcaaac atctgccata tgcagggaag ctcgcaatgc tgaagagttt caataaagtt 3780 tgggagaagg gcgaatttcc gaacagctgg aaacaaggac tagttatacc aatcccaaaa 3840 tccaatagac aggctcagca tgcggaaggg taccggccaa ttacgcttct gagttgtatc 3900 gggaaaatat acgagcgtat ggttaatcgt cgtctaatga ccttcctcga agaaaacaac 3960 ttactcaatg aacatcaaca tgcatttcga gccggacgtg ggacatcatc ctacttcgct 4020 gatttgggcg aaattctagc agaaatcgaa caatccaacg cgcacgcaga gtttgcattg 4080 ctcgatatca ataaggcata tgatcaagtt tggcgaactc atataatgag acaaatcgat 4140 agtttgaaca ctggaaaacg cattcggagc tgcatcagca attttctctt ggatcgacgt 4200 ttcagggttt cctacggtgg ctctttgtcg aaagatcgga tccaagaaaa tggcgtcccc 4260 caaggatctg tgttagctgt caccttgttt ctgatagcta tcaacccagt atttgacgtc 4320 attccgaaaa atatacgagt gctagtatat gcggacgaca tagtccttgt cgctatttct 4380 aagcaccttc caaaagtgag aaacagacta attgaagcag tggatgctgt caatacgtgg 4440 gcaaaaagcg tcaaattccg cctgtctgcc ccaaaatcct gtatccttca tatgtgtcaa 4500 agatcaaggc acaggtggcg ctgtgctcga aatgttgtga agatcgataa cgaaatcgtt 4560 cctgaagtca aagccgcacg atttctcggg atatggatca acaggaaagg aagttttctt 4620 gtgcacgggg caaaaattaa agaagctctc ttcagccgaa tcaactgtct gaaagctttg 4680 gcttttaaag cggatcgaaa cattctatgg agaattggga atgccgtttg cgtttccaaa 4740 cttacgtatg gtattgaact gttcggcctc aacctcctca acacgttcca atcaacctac 4800 aatgaaattc ttcgaataac atcgggagca ctaagatcgt cgcccacgct tagtttagct 4860 gtcgaagccg gccagctacc attgcatctc aggctagtgg atactctcgc cagaagatat 4920 attcgattag cagaaaaacc atttttcaac catcagcatc ttcgtaaatg tgcaattgaa 4980 gcattcgccg atgagacagg acacgacatt ccagagatcg caaagctcct gtggataggg 5040 aaaatcccct gggacatgca tcaagtcaaa attgactggt ccgtgagatc agtcttcaag 5100 aaagggaata acaagaatat tgccagagcc tgtgttaaag aacttcagaa caaaaagtat 5160 gaggaatatt gtcaaatgta cactgatgga tcgaagtcgg gtactgaagt agggctaggc 5220 gtagtttgtc aagaactcca aatagaagcg ggcctccttg ctcaatgcag tgtttattct 5280 gctgaagctg ctgcactcgc tgtggctgtc caaactgcgg ataacagggc cactgtgatc 5340 ttcagtgact cagcgagctg tatggatgca ataaagaaag gagattcgaa acacccgttc 5400 atccaatcaa tcttaaaaga agcgagcctt aaggatatca ccttctgctg gattccaggt 5460 cactcggaca tacccggaaa cgaagcagct gaccaagcag ctgaaagggg cagaaccgcc 5520 cctccactac ataatgttgc acctgctagt gatgttatca aatggatcaa caaacgatta 5580 cacgataaat ttcaacgcat gtggaacgat catcgagcaa cttttcttca aaagatcaaa 5640 ccaactgttg gaaagtggat agatcatcaa gaccgccaag agcaaagagc tctaactcga 5700 tgcagaattg ggcacaccag gctcactaaa aagcatttat tcgaccgaaa tcaatcagca 5760 acatgtgagg tttgccaagg gagctgaccg ttgaacacat aatcgttact tgcagaaaat 5820 acgatgacat tcgcaacaaa actggcatta atagcaatat tcaaatagct ttgggtaaca 5880 acaaaacaga agaagctaaa ttattgaaat tcctgaagaa aagtaattta tttgaaatgc 5940 tgtaaactct gtacggaatt gtagtattga ttttttttct aagaggcgaa tgaacccttc 6000 tggtttaaaa cctctctaat aaacgcaaaa aaaaaaaaaa 6040 // ID Gypsy-137_AA-I repbase; DNA; INV; 5500 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-137_AA_; KW Gypsy-137_AA-LTR; Gypsy-137_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5500 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1007-1007 (2011). XX DR [2] (Consensus) XX CC Positions [4178-4648] - Integrase core CC 'GTTAT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS join(1052..3907,3911..4852) FT /product="Gypsy-137_AA-I_1p" FT /translation="MSLTTTTEPYVPGTIPFSQYLEQLEFLFEHNNYSADK FT YKISFLAVCGTEVYNQVKLLFPGQNVRDLTYKQITDELKKRYDKKDNDVIH FT TYKFWTRRQGQHEKSEDFVLSVKQLAELCGFGDFKDRAIRDALVIGTYDRQ FT LQKRLFDEDDLTAAKAEKLIVNQELSTDRTRFVNRDDDKRVSVVARLGRRL FT DRGPSRQSFRSRSRSFDRNRSFYNRSRSGSRRNEIHDPDKVFTCSFCHKTG FT HTKKFCFRLKRKSPKKTPRKSKPFVKFIGSPKPLPSQTSGLFKRLKKDLAS FT DSEDEDLPCMMINARSKVNEPCYVEALVQKTRLTMEIDCGSAESVISEALF FT LRNFNKCSIENSRQRLYVIDGNRLSILGKARVSVRLNGITEELYLIVLQGD FT KDFVPLMGRSWLDIFYSGWRDTFSRPVVPNRRVNAMTDVDARDEAVEEIKS FT KFPELFDKKLLNPIVGFEGDLILKDDTPVFKKAYEVPLRLRDKVVEHLDAL FT EKDGVITPIEASEWASPVVVVIKKDKGIRLVIDCKVSINKLIIPNKYPLPL FT PQDLFAALSGSKVFCSLDLAGAYSQLLLSKRSKKFMVINTIKGLYVYNRLP FT QGASSSAAIFQKVMDQVLKGLENVFCYLDDVLIAGKDFDDCRKKLFLVLER FT LARVNIKVNFKKCKFFVDNLPYLGHVLTDAGLLPCPDKVQTIREARAPQNV FT TELKAFLGLITYYSKFIPNLSSRIRVLYGLLKKNVRFDWNADCDLVFNQCK FT QFLLKPNLLEYFDPDKPVVVVTDACNYGLGGVIAHVVDGEEKPISFTSFSL FT NDAQKKYPILRLEALAVVSTVKKFHKFLYGKSFTIFTDHKPLIGIFGKEGR FT NSLSVTRLQRYVIDLSIYDYDIVYRPSAKMGNADFCSRFPISQEVPKELDR FT DYIKSLNFTGDFPLNYKAIAAETEKDEFLLRIMEYLRKGWPDRLEKRFGVY FT SHHQGLEIVDGCLLFKDRVVIPDCMKREVLKLLHRNHSGMHKIKQLARRTV FT YWFGMNGDIEQFVRSCRVCQETTALSKRAPYTPWIPTKKPFSRIHADFFFF FT EKKVFLVIVDSYTKWVEVEQMKTGTDSKKVIKVSLGVFPRFGLPDVLVTDG FT GPPFNSEHFVNFFKEQGVVVMKSPAYHPESNGQAERMVRLVKDVLKKFLLD FT PETRKLDMEEQISYFLMNYRNICLASDGAFPSERLLSYRPKTTLDLINPKH FT SFKYNLTEPDDGDQRACSSKTDKTHDLLTNLRNGDLVFYKNPETKLTLDNG FT YLQNI" XX SQ Sequence 5500 BP; 1501 A; 1047 C; 1293 G; 1658 T; 1 other; gtggcgacga ggtgaatttt gttttccgac actccgtgca tttcacgacc gctggcgctt 60 gtgcttttgg cgtggatttt tcggcgtaag ttggattttg gtgaaaaatc taaaaccgtg 120 aaaaccaaaa ggttttcttt tctattattc gccaattgtt attgcgtgca aaagaagttc 180 ttaaaatctc gcagttttta gtgccggtaa aggtttcagt gaagtgattt ttcttccagt 240 gttgtggttt agcgtggcgt gaaatatcgc ttttaatcgt gaaattgccg ctatccattt 300 tttattttca acggaagaag cgtacttttg ttataagtac cattatttca ataagcaatt 360 gaaataagag gctttttatt gtcctccatt atcagctctc tttgctttgt taaattccta 420 agaccatttt cttttcatcc acagcacaat aaaagaacgc cgtttgcaat cagtgtactg 480 gtcaactgtg gcagtgcggc ggtaaaattg tctgtcagtc aacgaagagg aaaaagccga 540 aaaacgtcag attgttttcg atttgtgtgc gagtgaagtc ggtgcgagaa aactgcatca 600 aaactgcata ttttttgttg ttgctgctgg acgttgaccc tgtggttgct gctgttggtg 660 cattttctgg tggacgtgag gtgcaggaag agttcatcga cctcttggcc ggttgatgac 720 ctaccattgg cgtattggag gaatcatttt gaggagctgt tgttgctgtt ggcttttgac 780 ccgagccttt gtgaagttgc tgctgttgct gtttggcttt tgacccgtgt gccattgaag 840 agtttgctgt tgttgctggg ctgttgtttt cgggccgttt ttattgcagt gttttgccgt 900 ttttgcattg cggccatagc aggtatgtca ttttgtttgt ttttattatc gttcattttt 960 tctgtgaatg ctaaagtgat ttttgatcga cgaagacatt tcggggaaac taattagtgc 1020 tgcttgattt tctttgcttt ttgtctgtag gatgtctttg actacgacta ctgagcccta 1080 tgtgccaggt acgatcccat tttcccagta tttagagcag cttgagtttt tatttgagca 1140 caataactac tcagctgata aatataaaat ttcctttttg gccgtttgtg gcaccgaggt 1200 ttataaccaa gtgaagcttt tatttccggg tcagaatgtt agagatttga cgtataagca 1260 gattactgac gagcttaaga agaggtacga taaaaaagat aacgacgtga ttcatacgta 1320 caaattttgg acccgtagac agggacagca tgagaaatct gaagatttcg tgctctccgt 1380 aaagcaatta gctgaattat gtggcttcgg ggattttaag gatcgtgcca tccgtgatgc 1440 acttgtaata ggcacttatg atcggcagtt acagaagcgg ttgtttgacg aggacgatct 1500 aactgctgcc aaggcggaaa aattgatcgt taaccaagaa ctgtctactg acagaactcg 1560 tttcgtgaac cgcgatgatg ataagagggt cagtgttgta gccaggttgg gtaggcgttt 1620 ggatcgtggt ccttccaggc aatccttccg tagcaggagt aggagttttg acagaaaccg 1680 ttctttttat aaccgaagca ggagcggtag taggagaaat gagattcatg atccggacaa 1740 agtttttacg tgttcctttt gtcacaaaac aggacacact aagaaattct gctttaggtt 1800 gaagaggaag agtccaaaga agactcctag gaagagtaaa ccttttgtaa agtttattgg 1860 ttctcctaaa cctctgcctt cccaaacttc ggggcttttt aaacggttga agaaggattt 1920 ggcttccgat tccgaagatg aagatttacc atgcatgatg ataaatgcca gaagtaaagt 1980 gaatgaacca tgttatgtcg aggcattagt tcagaaaacg cggttgacta tggagatcga 2040 ctgtggatcc gctgaaagcg tgatatccga ggctcttttc ttgcgaaatt tcaacaaatg 2100 ttccatcgag aatagtcgcc agagactata cgttatcgat ggtaacagac tcagtattct 2160 gggaaaagcg agggtctcgg tacggcttaa cggtatcacg gaggagcttt atctgattgt 2220 gctgcaggga gacaaggact ttgtgccgct aatgggtcgc agctggttgg atatcttcta 2280 cagtggatgg agagacacct tttctcgacc ggtggtgcca aatcgacgcg tgaatgccat 2340 gacggatgtt gatgcgaggg acgaagcagt cgaagaaatt aaaagtaagt tcccagaact 2400 ttttgacaag aaactattga atccgatcgt tggttttgag ggcgatttga tactcaaaga 2460 tgatacccct gtttttaaga aggcgtacga ggtaccttta aggttgagag ataaggttgt 2520 cgagcatctt gacgctttag agaaggatgg tgtaattacg ccgatagagg ctagcgaatg 2580 ggcttcgccg gtggtagtgg tgatcaaaaa agataaagga ataagacttg tgatagactg 2640 taaagtctcg atcaataaat taataatccc taataaatac ccactaccat taccacaaga 2700 tttgtttgcc gcactttctg gctctaaagt tttctgctct ttggatcttg ctggagcata 2760 ttcccagctt ttgctttcaa agcgatccaa aaagttcatg gttatcaata ccattaaagg 2820 cctctacgtt tataaccggt tgccgcaagg cgcatcgtcg agtgccgcaa tattccagaa 2880 agtgatggat caggtgttga aagggttaga aaatgtattc tgttatttgg acgacgttct 2940 tattgccggt aaggattttg atgactgcag aaagaaactt tttttggtcc tagagaggct 3000 tgctagggtc aatataaaag tgaactttaa gaaatgcaag ttttttgtcg ataatttgcc 3060 ctacctgggt catgttttga ccgatgcagg tcttcttcca tgcccggata aagtacagac 3120 tattcgcgaa gctcgcgctc cgcaaaatgt tactgaactt aaggcttttt taggtttaat 3180 tacctattat tcgaaattca ttccaaattt gtcctcccgg attagagttc tttatggact 3240 tttgaaaaag aacgttcggt tcgattggaa tgctgattgc gacttggttt ttaaccaatg 3300 caaacagttt ttattgaaac caaatcttct agagtacttt gaccctgaca aacctgtggt 3360 tgtggtaaca gacgcttgta attatggcct cgggggagta attgcccacg tagttgatgg 3420 agaagagaaa cctatctctt tcacttcatt ttctctaaat gacgcccaga aaaagtaccc 3480 cattcttcgt ctggaggcgc tggctgtcgt tagtacggtg aaaaagtttc acaaattcct 3540 gtatgggaag agttttacaa tcttcaccga ccacaagcca ctgattggaa tctttggaaa 3600 agaaggaagg aattctcttt cagtcacacg acttcagcgc tatgtaatag atctttcaat 3660 ctatgactat gatatagtct acagaccgtc agcaaaaatg ggaaacgcgg acttttgttc 3720 ccgatttcct atttctcagg aagttcctaa ggagctagac cgagattaca tcaaaagcct 3780 gaactttact ggagactttc cgttaaacta taaggcgata gccgctgaaa cggaaaaaga 3840 cgagtttctt ttgaggatta tggagtattt acggaaaggt tggccggacc gcttggagaa 3900 acgattccas ggcgtttact cgcatcacca gggccttgaa atagtggacg gctgcttgct 3960 cttcaaagat agagtcgtca ttccggattg catgaagaga gaggttttga agttgctcca 4020 tagaaatcac tccgggatgc ataagattaa gcaactcgcc agacggaccg tgtattggtt 4080 cggaatgaac ggagatattg agcagtttgt ccgatcttgc cgagtatgcc aggaaactac 4140 tgctttgtct aaacgtgctc cgtacacacc atggattcct acgaagaaac ctttcagccg 4200 catacatgct gattttttct tttttgagaa aaaggtgttc ttggtaattg tagatagcta 4260 caccaaatgg gtcgaggttg agcagatgaa aaccggaact gacagtaaaa aggtgatcaa 4320 agtttccctc ggcgtttttc ctaggtttgg tttaccggac gtcctagtaa ctgacggagg 4380 tccgccgttt aattctgagc acttcgtcaa cttttttaag gaacagggtg tagttgtcat 4440 gaaaagtcca gcttatcatc cggaaagcaa cggccaggca gagcgaatgg tgcgcttagt 4500 taaggatgtc ctgaaaaagt tcctgctcga cccagaaacc aggaaattgg atatggaaga 4560 acaaatttca tattttctta tgaattacag gaacatttgc ttagcttctg acggtgcctt 4620 tccctctgaa cgccttctct cctacagacc aaaaacaacg ctggatttaa ttaacccgaa 4680 gcacagtttt aagtacaatt tgacagaacc ggatgatggt gaccagcgtg catgttcctc 4740 taaaactgac aaaactcatg acctgttaac taacttgcgt aatggagatt tagtatttta 4800 caaaaaccca gaaacaaaac tgacattaga caatggttac ctgcaaaata tctaatgcac 4860 atctctccca acgttttaca gatttcactg agcggccgga tcgtttcggc gcataaacgc 4920 cagctaaaga tccctcaaga ctctcgccgt ccacatgccc gtcgttttgt tttgaatgga 4980 gagagtccga cacagccctc aacagagcca agcaacaaca ttgagtcacg caacagcaac 5040 aaacgaaaaa gagatgagga agacttgtct gaggattccg attcagactt ttacggcttc 5100 gctgccgact cctttttatt cgaggagttt tcccggttgg atcggcaatc ctatcagaat 5160 gttgaatcgt tggaccctgc aacggcctct aatcagcagg ctatcgaagc tttccctgag 5220 gatgttcgac ccacttcaag tgctacagtg aataggcata gtaagagaaa gaacaagaag 5280 cgaagattac aagattttgt ttattactga gttcattcag ttttcgattg cttaagcatt 5340 aatttttaaa aatcgtttgt tgacattttg agcattaaat ttcaatttgt gaatttgatc 5400 ttgcattctg aaagttgtaa aatagcaagt aaggttgaaa atgaatttga ttatgcgttt 5460 ataagactgt agcatgtttc tactttcagg gggaaggagc 5500 // ID Gypsy-23_CQ-I repbase; DNA; INV; 4313 BP. XX AC AAWU01011157; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-23_CQ_; KW Gypsy-23_CQ-LTR; Gypsy-23_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4313 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 425-425 (2011). XX DR GenBank; AAWU01011157; Positions 21236 16924. XX CC Positions [3414-3893] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 1476..4301 FT /product="Gypsy-23_CQ-I_1p" FT /translation="MKPGIKTIIEENKIKIFRTPIFKTEKPGNEDSKIKKP FT LDPLILENIGNVRDDQLAKFQNVYQLNYSEFKRPLTQEIKYEMKITLKTGT FT PFHYKPRRLSHSEKEKVNTKIDELLKSGIIKESNSPFASPIVLIPKKDGDI FT RMCVDYRKLNKDTLRDNYPLPLIEDLLELLRNKKLFSILDLKSGFHQIKIE FT PESTKYTSFVTPNGQYEYKCVPFGLCNAPAVFQRYVNKIFKPLIDAGKLCV FT YIDDILIFSETYEDHLTTLEKVFQILSKNLLELNISKCKFFQNEIDYLGYT FT INSKGRRPNKSHIEAVSNFPEPKNIKDVQRFLGLTSYFRKFIQGFATIARP FT LYDLLKKNSQFKFDDVEKEAFEKLKNKLITAPILAIYNPKAETQLHCDASS FT YGFGAILLQKQNDNNFHPISFFSKKTDEFEKKLHSFELETLAVVYAVKRFH FT VYLSGIEFKILTDCNALAQTLEKKEINPKIGRWALFLDKYHKTIDYRSGHR FT MQHVDALSRQNMSEGEDQNKINQMICVLNIQDIEQNIILAQEQDEKIKNIK FT RHLEINTFPGFELQNGILFRKHEQIRLLVIPKAMIDNILRICHDQNGHIGV FT EKTMIEIKKHYWFSNMKKIVKKYILNCLSCIFYSPLDGKKEGFLKSIDKGN FT EPFNTIHMDHYGPIKLGSSSNYKYILVIIDGFTKFVKFYATKTTNTDEVIK FT CLQLYMNYYSKPMRIITDRGTCFTSRKFENFLECHCVEHIKTASYTPEANG FT QAERVNRTLTPMLAKLIHDSNSKWDTLLTKIEYIYNNTFNRSIKNFPSMLL FT FGKKQANMSLNQNNIETFINNYQEIFSKENLETIREKANRNIENLQNYSKS FT SVDKKRKNITMYKEGDFIVLKKIATHKLAEKFQGPYVIKKVLPNDRFLITD FT IEGFQVSSLPFESVCSPNNMKKWLSSDYCVDEDIAVSG" XX SQ Sequence 4313 BP; 1613 A; 613 C; 815 G; 1272 T; 0 other; gacgtcttta aaagttcaga agtgggatag aagtcatgtt cctgcgaagc tataagccat 60 cccaagaaga acagtcagag gatatcggag aagaatttcc tcctctgact aatcaagatt 120 gcggagatca agctgaagtt ggttctggcc ggtcaagaag ttgtgaagaa gtttttgttg 180 gacaagctac tggttcgtct cgaagtcgga ccgagcaagt tttgtcaaga agcaaagaaa 240 cggttacgtc tggccaaggt gcaagcgcgc aaggtcaagg catgtctgga agtcaagtta 300 cgctttcttc tggtcaaggt gcagctagaa gccctggaat ttctgtctct ggcgctgtaa 360 ttcctgtccc tggcgctcat tttcctgttc cagacgcgag aaactgtaca gtgaagctgg 420 atgaaatttc aagcattctc ccgtgtttct ccgggaaccc tggggaaaat gtggaccact 480 tcatcaacag catcaagtac gctaaacgga tatttgcagt taacgaagag gtcatgaagt 540 tggtgatttt gaagcagctt aaagacagtg cgcaaaagtg gttcatgacg caagagttgc 600 tgttaaagcc atttgaagaa gttttgaaaa ttttgaaaat cacttttaca tcggtcgaaa 660 gcaagttcgt ggtgaggaag aaaatggaaa atcgaagatg gatgcccaat gaaaaatttc 720 tggactaagc caatgacaaa tgcctgcttg ctgcatcgct caaccttgaa gaaatcgagc 780 tcgttgagta tattattgaa ggaattcaag atgaatcact tcaaaatcaa gcaaggatca 840 accgttttca aactgttgct ggattggtga gagcgtttgg tttggtgcag ttgaagccgg 900 tgttcaagcc caagatttgc tacaactgta atcaaccagg acacgttgca gtcaattgtg 960 gatttttcgc tggtaatcaa aggggacaac aaaaaggggg ggactttgga ccgaagcaat 1020 ttggaggatc gtcaaatcgc gtgcctagag gacaacctgg aggagcgttt ggaggatcgt 1080 cagcggtaac aacgtctgga ggaccgtcaa cccgaggacc agcctggatt gcagcagtgg 1140 aggatgagaa tgatagggct gacttggtag gacagacgga aggatctgat ggactggtag 1200 atataaaatt caaaaattca ttaaaaactt taaaagcttt gtttgataca ggaagtccgg 1260 ttagtttagt acgacattca aaagttttag attttgaaat tttgaggttt gaacaaaaca 1320 aaatttttaa aggattagga ggaaacaaac taaatatttt aggcaaaatt atttcagatg 1380 tatcttttca acatttttgt ttcagaactg aatttttggt tgttgctgac acggatttgt 1440 cttcattcga tgcaataatt ggaagagata taattatgaa gcctggaata aaaacgatta 1500 ttgaagaaaa taagatcaaa atttttagaa cacccatttt caaaactgag aaacctggaa 1560 acgaagattc aaaaatcaaa aaacctttgg atccattaat tttggaaaat ataggtaatg 1620 taagggatga tcaattggca aaatttcaaa atgtgtatca attaaattat tcagaattta 1680 aaagaccatt aactcaggaa attaaatacg aaatgaaaat aacactaaaa actggaacac 1740 catttcatta caaaccgaga agactttcac actcggaaaa agaaaaagtc aacacaaaaa 1800 tagatgaact tttgaaatca ggcataatta aagaaagtaa ttctcctttt gcgagtccaa 1860 tagttctaat tccaaaaaaa gatggtgata taagaatgtg tgtagattac agaaaattaa 1920 acaaagacac tttgagagat aattatcctt taccattgat tgaagatttg ctggaattgt 1980 tgaggaacaa gaaattattt tcaattttgg atttgaaatc tggatttcat caaattaaaa 2040 ttgaacctga aagtacaaaa tatacatcat ttgtaactcc aaatgggcag tatgaatata 2100 aatgtgttcc atttggatta tgtaatgctc cagcagtttt tcaaagatat gttaacaaaa 2160 ttttcaaacc tcttatagat gcaggaaaat tatgtgttta tattgatgat attttaatat 2220 tttcagaaac atatgaagat catttaacaa cattagaaaa agttttccaa attttaagca 2280 aaaatttatt agaattaaat atatccaaat gtaaattttt ccaaaacgaa attgattatt 2340 taggatatac aataaactca aaaggtagaa ggccaaataa atcgcacatt gaagccgttt 2400 ccaattttcc agaaccaaaa aatataaaag atgttcaaag gtttttagga ttaacaagct 2460 attttagaaa atttattcaa ggatttgcaa cgattgcaag accactttac gatttattaa 2520 agaaaaattc acaatttaaa tttgatgatg tcgaaaaaga agcatttgaa aaattgaaaa 2580 ataaactgat aactgcacca attttagcta tttacaaccc aaaagcagaa acacagcttc 2640 attgtgatgc aagttcttat ggttttgggg caattttatt gcaaaagcaa aatgataata 2700 atttccatcc tataagtttt ttcagcaaaa aaactgatga atttgagaaa aaattgcaca 2760 gttttgaatt agaaacatta gctgtagttt atgctgtcaa gagatttcat gtttatttat 2820 ctggcattga atttaaaatt ttaactgatt gtaatgcttt ggctcaaaca ttagagaaaa 2880 aagaaattaa tccgaaaatt ggaaggtggg cattattttt agacaaatat cataaaacaa 2940 ttgattatag aagtggacat agaatgcagc atgtagatgc tttgagcagg caaaatatga 3000 gtgaaggtga agatcaaaat aaaatcaatc agatgatttg tgtgttaaat attcaagata 3060 tagaacaaaa tattattttg gcacaagaac aagatgaaaa aattaaaaat atcaagagac 3120 atttggaaat caatacattt ccaggttttg agttacaaaa tggaatttta tttagaaaac 3180 atgaacaaat tcgtttgctt gttataccaa aagctatgat tgataatatt ttaagaattt 3240 gtcatgatca aaatgggcat atcggtgtag aaaaaacaat gattgaaatt aagaaacatt 3300 attggtttag taatatgaag aaaattgtga aaaagtatat attgaattgt ttatcttgta 3360 ttttttacag ccctcttgat gggaaaaaag aaggatttct taaaagtatt gacaaaggta 3420 atgaaccttt caatacaatt cacatggacc attatggacc aataaaatta ggctcgtcta 3480 gtaattataa atatatttta gtaatcattg atggttttac aaaatttgtt aagttttatg 3540 caactaaaac gactaatact gatgaagtta ttaaatgttt acagttatac atgaattact 3600 atagcaaacc aatgagaatt ataactgata gaggaacctg ttttacctcg agaaagtttg 3660 aaaatttttt agagtgtcat tgtgttgaac atattaaaac tgcatcatac actccagaag 3720 caaacggaca agcagagcga gtaaatagaa ctttaactcc aatgcttgca aaactcatac 3780 atgattctaa ttcaaaatgg gatactttgc ttactaaaat tgaatacatt tataacaaca 3840 cattcaatag aagtatcaaa aattttccaa gtatgcttct ctttgggaaa aaacaagcga 3900 acatgagttt aaatcaaaat aatattgaaa cgtttataaa taattatcaa gaaatctttt 3960 ccaaagaaaa tttagaaacc attcgagaaa aggcaaatag aaatatagaa aatcttcaaa 4020 attacagcaa atcttcagta gacaaaaagc gaaaaaacat aacgatgtac aaagaaggag 4080 atttcattgt tttgaagaaa attgcaacac ataaattagc agaaaaattt cagggtccat 4140 atgttataaa aaaagttttg ccaaatgatc gtttccttat aacagatatt gaagggttcc 4200 aagtttccag tcttccattt gaaagtgtgt gttctccaaa taatatgaaa aaatggcttt 4260 cttcagatta ttgcgtcgat gaggacatcg ccgtgtcagg atgaccgaaa tgt 4313 // ID Chapaev-N4_AAe repbase; DNA; INV; 2032 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.03, Created) DT 23-DEC-2010 (Rel. 16.03, Last updated, Version -1) XX DE A non-autonomous Chapaev DNA transposon family from Aedes DE aegypti. XX KW Chapaev; DNA transposon; Transposable Element; nonautonomous; KW Chapaev-N4_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-2032 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the yellow fever mosquito."; RL Repbase Reports 11(3), 834-834 (2011). XX DR [2] (Consensus) XX CC >95% identical to consensus. 3-5 bp TSDs. TIRs are ~140 bp CC long. XX SQ Sequence 2032 BP; 674 A; 356 C; 344 G; 657 T; 1 other; cacggtgtct ttgtggagca actttattac aaaaaaatag gtaagtctga aatttcaaac 60 taaaaacaat tagactttkc tctttcattt gcgaccaaaa tcaaaaaaat cggtcggggg 120 gtccagaaca ttttttttta ttttgtgtaa agtgtttatg gatatgtgtt gttgtaccga 180 cgcacattgc gtggaacgta tggaggtacg ggacaaactg caagatcaaa ccatttgaat 240 accttggcat ggaaattaaa gttgttcttg gtcaaaagag ctgtaataag cataaatgac 300 atatgatata cgcataatcg caaaactgtc tcaagcatca ttttaattat tttccacgaa 360 aacccttgaa aatcgtacct aaattggtca taatgatgta agaagttatt aggaaatatt 420 tctcgcataa cttatgatct cgtcctaagg tgaagatgca tcgaagtcaa acctcgaatt 480 ttcaagagca caaatctaga gaaccagaca accttttacg tcgaacattt tattgattgg 540 tcgccaccag cgagtgacca gtgagttacc ggttgtctgc ttctctagac ttgtgctttt 600 gaaaattcga ggtttggctt ctatgtatct tcaccctaaa aaccattggt aactgagtgt 660 gtagctcgtt gttattaagc agataaacgg accaaaataa aaactgtcac cgctgggaga 720 cagaggagga tcttctcttc atcgtagcat tgtgaaataa agtgtaaatc cattcgtttg 780 ctcagtaata acaagctaca cacttggtca cctatggctt ttaggacgag atcatacatt 840 atgcgaattt tctcttttta ctcgtagata caaaagtgaa ataccggaat cttttcaaat 900 gtattacata cttgcacata aattacaaat cttgataatt tatttttttt attggaaata 960 gttgctttat ttgttattct aataggtgga tacttataga tggataacag ttttataaaa 1020 tgaccggtcg catttgaaaa aaagcgctga aatctcgact aatatccatt ctccagtatt 1080 cctagtatac ttaaatattt acttacagat atacatacgt acattcattt atatatacat 1140 ccaaacatat gtacattagg gcgattcaaa tttgaaaatt gtttgaaaat ccaatctccc 1200 atatgctcct tagcatcctt accataaaaa tagtgttctg tgaaattttc agctttctag 1260 gtggtgattt aaaggtggcc caaagacaat gtaggtctat atggaaatta ctatggagaa 1320 tttttgagaa atgttccaaa cacgctatta ctgtaatgta agaaaaaact tatcatccca 1380 tattgaaaac tcattcttca aaccttaatg aaggatgttg ctgaagaaac aaatcccttt 1440 gagctaattt tgtttgggat ttttgtacat gtttgcaggg ctataatccc atattaagct 1500 aaataatagc aaacaaactc atgaatatct cttctgctat ccgtcggatt gaatttctct 1560 cttgagcaaa tttctttgtt atggttagaa gaatgagttt ttacaatggg ataataagtt 1620 tgtacttaca ttacagtaca agcgtgtttg gaacatatct caaaatttct tcatagtaat 1680 ttccatataa acctacattg tctttgggcc acctttaaat caccacctag aaagctgaaa 1740 atttcacaga acactattta tatagtaagg atgacaagga acatatggga gattggattc 1800 tcaaacattt ttaaattttg aaccacccca atgtacataa acacacggat acgcaaatac 1860 aaacatatgt gcaagtctct cgaaatcaaa caaaaaaaaa ttactctaga ccccccgacc 1920 gattattttg gtttttgcag caaatgaacg cgagagacct agactttatt gtggcgaagt 1980 ttcagactta cctatttttt tgtaataaag ttgttctacg aaacacaccg tg 2032 // ID Penelope-7_HM repbase; DNA; INV; 3340 BP. XX AC . XX DT 13-DEC-2008 (Rel. 13.12, Created) DT 14-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE Penelope-like element. XX KW Penelope; Non-LTR Retrotransposon; Transposable Element; KW Penelope-7_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3340 RA Jurka J.; RT "Penelope-like elements from Hydra magnipapillata."; RL Repbase Reports 8(12), 2097-2097 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(229..882,792..2816) FT /product="Penelope-7_HM_1p" FT /translation="MKFKNQHQRYFIKSLEKHEYLYYKHSRCTNLLNIYHD FT CLQQEPLFIPKKFRSDNFHFMSYDEKNAIKKFELQRLKSECEILTIRKDNF FT LSSLKNLEDEVTSFINNHASSEEVKEKLVFRYEECKKEDIDRIDKKWAKKL FT ASTKTAYEKDRNNQNSKNLQNNEQEDNHQNHNIDCSTSSSSTNINNNHNQT FT SDIITSKNWSSPIQKKYNLRSSTYQLNTSNIRHYNFKKLVKPNPKKIQPKV FT INLSTKHLTTSQISLLTKGPKFCPTTKGNILHIDSDIKQFTRKLILKEKFH FT GTENTDISIIKKKSNYNIKTTLPYLDEIISKIEQIKPIKMQSVDNITQQER FT IALKDLKEMKNIVIKKADKGNTFIVMDSTYYNEKLVHQDHLLSANTYERTK FT PESDKIVFRDLIKMIGKYNFLTKNEINYITNFKWKSSSFYITPKIHKCQEI FT IDKVNASNSLYIEMKPPNDLKGRPIIAGINSPTSHLSQFLHIILSPIVTKQ FT KTFIKDDWDFLRKIPRALNSDSCILTCDIVNLYTSIPHNLGIEALKYWIEK FT HKSLIDKRFTPEFIIESTYFILKNNNFLFNDQMFRQIIGTAMGTNFAPDYA FT CLTIGYLEETRLFPKILPKYFSSKDVNDIQEFYYRYIDDGFLIWPKHLDIV FT KFKNSLTEIDNSISFTTDMGKEFIIDGNTYNIINFLDISIILENNRKIKTD FT IYYKETNTHDYLNYNSHHPYHIKKNIPYGLAKRIIVFTSDYNKEKQRLAEL FT KTWLKECEYPNNIIEKAFHNAKLQGPAPQNQSKEIFPLVTTFYSNLNCQPF FT ITETNLLLNLSSNERVKEVFSNVQPVIAYKQPPNLLRTLTSSKIQSTTKEV FT GLFKCNDNRCKICKFYLQEVKSVYDIKWYYMEHKKPYYL*" XX SQ Sequence 3340 BP; 1393 A; 522 C; 413 G; 1009 T; 3 other; aaacatattt caagttatat tgttaaaatg tcagctgawa aacaattttc agattattca 60 gataagcttt atgaagttat ttttaataaa ctgaaagaca ctttacattt aattattagt 120 gatataaagc cattaattag agaaactata gaatcaacac ttaaagaatg tttatctata 180 taccagcaac ataacactac cttacatgac aatgtagaaa aggaatacat gaagtttaaa 240 aatcaacatc aaaggtattt tataaaatcg ttagaaaaac atgaatatct ctactacaaa 300 catagtagat gcacaaacct tcttaatatt tatcatgact gcttgcaaca agaacctttg 360 tttattccta aaaagtttag aagtgacaac ttccatttta tgtcatatga cgaaaaaaat 420 gcgataaaaa aatttgagtt acagcgcctt aaatctgaat gtgagattct tacaataaga 480 aaagataatt ttctatcgtc attaaaaaac cttgaggatg aagtaactag ttttattaat 540 aaccatgcgt ctagcgaaga agttaaagaa aaattagtat ttcgatacga ggaatgtaaa 600 aaagaggata tcgatcgcat tgacaaaaaa tgggccaaaa aattagcttc tactaaaaca 660 gcgtatgaaa aagacagaaa taatcaaaat tcaaaaaatt tgcaaaataa cgaacaagag 720 gataaccacc aaaaccataa tatcgattgc tcaacttcct ctagctcaac caatataaat 780 aataatcata atcaaacatc agacattata acttcaaaaa actggtcaag cccaatccaa 840 aaaaaataca acctaaggtc atcaacttat caactaaaca cctgacaaca tctcaaatat 900 cacttcttac aaaaggtcca aagttttgcc ctactactaa aggaaatatt ttacacattg 960 attctgatat aaaacaattc acaaggaaac taatacttaa agaaaaattt catggaacgg 1020 agaatacaga cataagcata attaaaaaaa agtcaaatta taatatcaaa acaactctac 1080 catatcttga tgaaatcatt tcaaaaattg aacaaattaa acctatcaaa atgcagtcag 1140 ttgataacat tactcaacaa gaaaggatag cattaaaaga tytaaaagaa atgaaaaata 1200 tagttattaa aaaagcagac aarggaaaca cttttattgt aatggattca acttattata 1260 atgaaaaact agttcatcaa gaccacctgc tctccgcaaa tacttatgaa aggacaaaac 1320 ctgaatcaga taaaatagtt tttagagatc taataaaaat gataggaaaa tataattttc 1380 taacaaaaaa tgaaataaat tacatcacaa attttaaatg gaaatctagt agtttctata 1440 taactccgaa aattcataaa tgtcaagaaa ttatagacaa agtaaacgca tctaattcac 1500 tttatatcga aatgaaacca ccaaatgatc tcaaaggtag accaattatt gctggaatta 1560 actcaccaac atcacattta agccagttct tacatataat tttgtctcca attgtaacaa 1620 aacaaaaaac tttcattaaa gatgattggg atttcttaag aaagatacca agagcgttaa 1680 attcggatag ttgtattttg acttgtgaca ttgttaatct atatactagt attccacaca 1740 atctaggtat agaagcatta aagtactgga ttgaaaaaca taaaagttta atagataaaa 1800 gatttactcc tgaatttatc attgaatcaa catactttat tttaaaaaac aacaactttc 1860 tattcaacga ccaaatgttt cggcaaatta ttggaacagc catgggtacg aatttcgctc 1920 cagattatgc atgcctaaca attggatatt tagaagaaac aagacttttt cctaagattt 1980 tacctaaata cttttccagt aaagatgtaa atgacataca ggaattttat tatagatata 2040 ttgatgatgg tttcttaatc tggccaaaac atttagatat cgttaagttt aaaaactctt 2100 tgaccgaaat agataattca atcagtttta caacagacat gggcaaagaa tttataatag 2160 atggaaatac ttataatatt attaatttct tagacatatc tatcattctt gaaaacaaca 2220 gaaaaataaa aacggatatt tattataaag aaaccaatac tcacgactac ttaaactaca 2280 acagccatca tccttatcat ataaaaaaga acatccctta tggccttgct aaaagaataa 2340 ttgttttcac atctgattac aataaagaaa aacaacgttt agcagaacta aaaacgtggc 2400 taaaagaatg cgaatatcct aataatatta ttgaaaaagc atttcacaac gcaaaactcc 2460 aaggaccagc tccccaaaat caatcaaaag aaatttttcc tttagttaca actttttaca 2520 gcaatctgaa ctgccaacct tttattactg aaacaaatct tttgcttaat ctatcttcaa 2580 atgaaagagt gaaagaagtt ttttctaatg tccaaccagt aattgcgtat aaacaaccac 2640 ccaatctact gagaacactt acttcttcaa aaattcaatc aaccactaaa gaagtaggat 2700 tatttaaatg caatgataat cgttgcaaaa tatgtaaatt ctaccttcaa gaagttaaat 2760 cagtttacga catcaaatgg tactatatgg aacataaaaa gccatattac ttgtaatagt 2820 aagaatgtaa tatattattt aaaatgtttg tcatgtaaca agaaagctac atacaccgga 2880 aaaacaaata acctacgact tcgcactaat gggcacattt ctagttgtag aactgggcaa 2940 tcaacagata ggtttgataa ccatgtatac aattgtcaac gtgcgcataa ctccgttata 3000 gaaccttttt tccagctttt tgtgtttgtg gagctaaaaa atgaaaaaaa cctaatttct 3060 tatgagaagc attttcacaa ccaaaaacat gatactttta attgtgtgta cattacaata 3120 tagattacaa tagttgctaa cagcaacatt aaataattat catttcctgt ttttttttgt 3180 aaaaaaaaaa ttatattttg ttgttataat aacaaaaaca gcactactga tgattttttg 3240 aaaggcctct agtgctaaaa tttataatta gagtaatttt tgtaaaatta aaagaagaaa 3300 atatttcgtt tattgtttat tttgttgact aattttttta 3340 // ID DNA-4_AAe repbase; DNA; INV; 3055 BP. XX AC . XX DT 11-APR-2011 (Rel. 16.04, Created) DT 11-APR-2011 (Rel. 16.04, Last updated, Version 1) XX DE A non-autonomous DNA transposon family from Aedes aegypti. XX KW DNA transposon; Transposable Element; nonautonomous; DNA-4_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-3055 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the yellow fever mosquito."; RL Repbase Reports 11(4), 1258-1258 (2011). XX DR [2] (Consensus) XX CC ~97% identical to consensus. TIRs are ~130 bp long. It includes CC a partial sequence of RTE non-LTR retrotransposon (849-1695). CC Likely CC a composite non-autonomous transposon. XX SQ Sequence 3055 BP; 970 A; 557 C; 619 G; 909 T; 0 other; caaaggagat tgggcaaaaa tgtatggagt ttggtcccgg aatgtacacg gatgacttat 60 atatcatttg aaagatctaa atgagacaaa aaaaagttga tatggggcca aaaatattcg 120 tcgtgggaga aaaaagttat gtgtgctgga atggtgtaag aagaggtatg ttttggacag 180 ttcctattga ataatattta cttgcgcaaa tgttttgaat tcgtgcctat gtatgtgctt 240 aggtatgttt ttatcttaat tgtaaatatt ttaaaaatgt tgtgatttga cacagatttc 300 agaacgtttt gagttgctga aatgtgaata cttaaaacat acggtgataa aaaaatataa 360 agatataggt atgagcgttt cacaaacacg tcactttagg taacagttgg acacaaatat 420 caaatcgtat tatcctattt cagtgttcta gcaatatgtt tgtataaacc ggcggtttac 480 tcaggaagga ttttccagac tatatccgtt ttatcattac agtggttgcg ttgttgtatt 540 ctgtactaca ctggaagaaa tacctcgctt gtcgtacaca gcttcagcgt gctggaaatc 600 cgcgcgtcag ttacctcact aatatctttg atatcaataa cattatacta gcatgctatt 660 gtttctcaga gaaggagatc atttgaacta gtagtaactg tagtgtttga ctgtttgctg 720 tgaacaaact gccaataatt cgagcagaac catttcggtt tgctatcaat gtcagctcca 780 ttggcgacgc ctgagcacat gtggctccac aaatgcttcg ctaagaacgc taaatactca 840 tgaggatgtc aataatgtga atgtttagaa gtaatcatac catcttccag agctgcgcga 900 cacacttgag ctgggaacag ctttcatcgt gatgaacgac atgcaaaaac atgagatcgg 960 atggtggtcg atcaacaaaa gagtgtgcat gttgaggatc aacggtcgat caacggatga 1020 caaagacgca gctggaacgc aagtaggacc gccacccatg tttccacgtt aagatcattg 1080 taggagatta gaacgccaag ataaggaatt gagatggacg attgggaaga tcagtgctgt 1140 cggactggca aacatgaatg tccaatgctt gactgttttg ccgcctctaa gaatatggtc 1200 attatggtct agcacctact tccaagataa ccttccgtac caatggtgat cgctagttat 1260 ccacagcaaa tgaaatctcg aatcggtcag gtattgcttg atagattgca catctccgac 1320 ataatcaacg tcaaaatcta tcgtggtgct aacattcact ctacccacta cctgataatt 1380 ataaaaatgc gcttcaaact tcctgtcgtt aacaatgaac ggtaccgacg cccgcctcag 1440 catgaccttc agcgtctgaa ttacacggat gtcgccacag catcgttcta catctctagc 1500 aagcgtaacc ggaataggtt ttgcttgctg aaacccttct tggctattgt actacagtaa 1560 aaatagccat caactcacag gtggagtcat tgtgcgttga acgtagtcta tggtacgatt 1620 ggtttgctga agagtttaga attattctag aggaaaaaaa cgtgaagcac catacacaga 1680 agcggaagca gcagacttgt gttgatgaca caggaccgga ggttcaaggc aaaggagagc 1740 atttcaacgt tcgaatgttg gacaaaagca gccaaccaga tcccactttg agtgaagtta 1800 aggatgacgt tcaacagctc aagaataaca cagcaattgt tgaggatggt atatgagtgg 1860 aactcatcaa gatgagcccg gaaaagttga tcgcttgtct gcatcggctg atagtaatta 1920 ctattcttaa gaacctggca gttataacat ctgcaagaag agtagagcca aggggttaaa 1980 tactctgcct acagaaaacg ttagtgctta tctatctgtc ttttcttcta tctgtaaaac 2040 ttgcataaaa attgtcaagg gcttgtacac aagttactgg agggtaacag acggagagag 2100 ataggagata atataagata tgcactcatt agaaaaaaat aagttcattg gactccacgg 2160 cattcgtata ccaattcgta aaacgataat gattgacatt ttaacttaag actggttgaa 2220 taatgtagga gctcaggtga tactaaacat ccacgtatgc ggtatacagc gcgagatggt 2280 ttatactagc gagctagatt gccattgatt taaacacatg aaatcctaaa actacgcata 2340 tgttactgcg tagcatttag caatgagcat ttcaactcac atttaacgtt ttttgatgct 2400 gatgatgttg ttcattgagg ttgagtgggt tgagctgatg gttttattga tctcttcgta 2460 aaattcgtcc tggagaaata tttacttcag caaaaacaat acagtcagtt cttttataag 2520 acaccacaca cttttctttc accactgttt cccaattact tttcatcgaa tgaggctgac 2580 tttttttaga atacaagtcg taatattacc ttattacgaa caaaaaatat cgtataggtc 2640 ggttaaaaaa ccgttaggaa ttgtgattag tataatgtgg atagcagggt ctggcagaaa 2700 actttactga ataaacatat gtattttaaa atgattttta atgttattac ggtctaacat 2760 gtaatgtttt tcatctttga atcaaagtcc taattcactt ctttaaaaca tagaacatct 2820 aatcatcata taggtgcaga aaaatatatg ttttcatttt taatcgataa aaaatatagc 2880 aaaaatataa taaaatacat agaaaatcca tctcaaacca tgcggaataa cttttttctc 2940 ccacgacgga tatttttgac cccataccaa ctttttcttg tctcattcag atctttcaaa 3000 tgacatatga gtcatctgtg tacattccgg gcccagattt gcccaatctc ctttg 3055 // ID hAT-8_AP repbase; DNA; INV; 2875 BP. XX AC Contig1007; XX DT 19-AUG-2009 (Rel. 14.08, Created) DT 19-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE DNA transposon. XX KW hAT; DNA transposon; Transposable Element; hAT-8_AP. XX NM hAT-8_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-2875 RA Jurka J.; RT "DNA transposons from pea aphid."; RL Repbase Reports 9(8), 1792-1792 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX FH Key Location/Qualifiers FT CDS join(250..2025,2278..2613) FT /product="hAT-8_AP_1p" FT /translation="MSAKLRSTKNGIFLVGDVQHQINGSKLPSNGQVLAVL FT FYNIREVKLTINESANLAVRECVIFWEKARIPTKSLPNCVKKLVDLHQVWQ FT GLQKNAKKTQDVFKQRQQAFTDSLNNLFDIAHADALQLIKIEEDKIFLKHQ FT REPGRVGYLGGVDKKLSDKEKRAQLRATEEENRRMKYLSASNSLVSNEPVE FT DDLFSISDENIDSPDNQQTTIENTQPETGAILRVPKKNFITSKLVAALDRC FT QLSMRDSVFILQATIEALGYNTDEYPISKSSIQRIRTVKRKERAEAIKIDF FT KNEVPDVVTVHWDGKLLPALNARNCKEERLPIVISFKNKEQLIAVPKLDGS FT TGREQAQAVWNAIIDWDLEEKVQILCCDTTASNTGRLNGACILLEQKLDRE FT LLIFACRHHIYELVLKSVFEIKISQVTTSPDIPLFKKFRENWKNVDPNMIQ FT GYREKVEKFFTTSEIEALLTFYHAELTKKIVRDDYRELIELSVIFLNGDPD FT RKLKIRPPGAMHQARWMARAIYSLKICLLSSQSKLTSKDKSSLLDVCIFIV FT TCYVKPWLQCIIPIKSPNQDMCFLKSLKTYENVEKKHFKSGSTQIIFFIFS FT EKNIYDFISAKSKYLFTRLKIDDNFLLESPSSWQTNTSFLKAKNAVSTLAV FT VNDTAERGVKLMQDFHGLITVEEEQKQFLLRCVQEHRSIFPDCNKKTLKRK FT YVQ*" XX SQ Sequence 2875 BP; 1008 A; 481 C; 525 G; 861 T; 0 other; gggtgtccca gaaaattcaa tgtaattttt ttcatatgca taccctctta atttttttgt 60 attgtatcaa aacactcata aatgaaattt catgaacttt tgaccacgat aagccgtgcc 120 gactcgagta gtaacctgtc gttgcctgtc attacatgca ttatagacgt gcgcgcttat 180 gaacgcagtt atgtgcgtat aatttccggt tgttcatgtt agtaagcata ttgtttttgg 240 cgaatcaaca tgtcagccaa attacgcagt actaaaaacg gtatattttt agtaggagac 300 gtacaacacc aaataaacgg gtcaaaactt ccctctaatg gacaagtact tgcagtattg 360 ttttataata tccgtgaagt aaaattaact attaatgaaa gcgcgaatct cgctgttcgt 420 gagtgtgtta tattttggga aaaggcgcga atcccaacca aatcgttacc taattgtgta 480 aaaaaacttg ttgacttaca tcaagtttgg caaggcttac aaaaaaatgc aaaaaaaaca 540 caagacgttt tcaaacagcg tcaacaagca tttacagaca gtttaaataa tttgtttgat 600 attgcacacg ctgatgcctt acaattaatc aaaatagagg aagataaaat atttttaaag 660 catcagagag agcctggtcg agttggttat ttgggtggag tggataagaa actgtcagac 720 aaagaaaaaa gggcgcaact aagagcaacc gaagaagaaa atcggcgaat gaaatatctt 780 tctgcttcaa attctctggt gtcgaatgaa cctgtagaag acgatttatt ttccatttct 840 gatgaaaata tagattcacc agataatcag caaactacta tcgagaacac acaacctgag 900 actggagcga ttctaagagt accaaaaaag aattttatta cttcaaaatt agtagctgca 960 ttagacaggt gtcagttgag tatgcgagac tctgtgttta ttcttcaagc tactattgag 1020 gcgcttgggt ataatacaga tgaatatccg ataagtaaat cttccatcca acgaatccgt 1080 acggtaaaac gaaaagaacg agcggaagct ataaagattg attttaaaaa cgaagtacca 1140 gacgttgtca ctgttcattg ggacggaaaa ttgttgcctg cactaaatgc tcgaaattgc 1200 aaagaagaac gtcttcctat agttatttca tttaaaaata aagaacaact tattgctgtc 1260 ccaaaattgg acggttctac aggaagagaa caggcacagg ctgtttggaa tgcaatcata 1320 gactgggatc tcgaagaaaa agtacagatt ctttgttgtg atacgacagc ttcaaataca 1380 ggtcgtctca atggcgcttg tattctccta gagcaaaaac tagacaggga attgcttatt 1440 tttgcttgtc gccatcacat atatgagctg gtcctaaaat ccgtttttga aataaaaata 1500 agtcaagtga ccacaagtcc tgatattccg ctcttcaaga agttcagaga aaattggaaa 1560 aacgtcgatc ccaatatgat acaaggatat agagaaaaag tggaaaaatt ttttacaact 1620 tctgaaatag aagcactgct tacattttat catgctgaat tgacaaaaaa aattgtcaga 1680 gatgactatc gagagttgat agaactttca gttatatttt taaatggaga tcccgatcgg 1740 aaattaaaaa tcaggcctcc gggtgccatg caccaagctc gatggatggc gagagcaatc 1800 tattccctaa aaatatgttt gctaagttct cagtcaaaat taaccagtaa ggacaagtca 1860 tctcttctag atgtctgcat atttattgta acttgctatg tcaagccttg gcttcaatgt 1920 attataccaa taaaatcacc gaatcaagat atgtgttttc taaagtcttt aaaaacatac 1980 gagaacgttg aaaaaaaaca tttcaaaagc ggctctacac aaatttagcc aacatttatg 2040 gtatttgact gaagaagcag ctatcctggc actatttgac gatgaggtaa acgaagcaac 2100 aaaaataaag attgtggaaa atttaacaaa tgaaaatata ttgaccactg ggaaacgata 2160 catcccatca aaggacgaac tttgcggttc attgtacggt tagtatgaca tataaatatt 2220 aatataataa tatgttagtt gtttttatta tagatatctt cagttatttc taactaaata 2280 ttttttatct tttcagaaaa aaatatttat gactttatat cggccaagag caagtatttg 2340 ttcactcgtc tcaaaattga tgataatttt ctcctcgagt ctccttcatc gtggcaaact 2400 aatacttcat ttctcaaagc caaaaatgct gtgtcaaccc ttgcagttgt taacgatact 2460 gccgaaaggg gtgttaagct gatgcaagac tttcatggtt taattacagt tgaagaagaa 2520 caaaaacaat ttttattacg ttgtgtacaa gagcacagaa gcatttttcc cgactgtaat 2580 aaaaaaactt tgaaaagaaa atatgttcaa taattaattt taaaattata ataaacgtaa 2640 aatgtcaacc aattggtgtt ttatttttta aaacgtattc taattttaag gttttccata 2700 gtgctgcagc ctaaattata tttgaaagtt tgacgttaag tcggcacggg ttctggtgat 2760 cggaagtcga taatatttca taaatgagtt attaggacca aatgaaaaaa aattaggggg 2820 tatgcgtacc taatttatca aaaaaaatat atccccctta tgtcatggga caccc 2875 // ID Merlin5_SM repbase; DNA; INV; 1054 BP. XX AC . XX DT 12-AUG-2009 (Rel. 14.08, Created) DT 12-AUG-2009 (Rel. 14.08, Last updated, Version 2) XX DE Consensus sequence of Merlin-type family of repeats. XX KW Merlin; DNA transposon; Transposable Element; Merlin5_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-1054 RA Jurka J.; RT "Merlin-type element from freshwater planarian (Schmidtea RT mediterranea)."; RL Repbase Reports 9(8), 1895-1895 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 85..984 FT /product="Merlin5_SM_1p" FT /translation="MEITTFLYKSLNDNWIVNFLIENSILFVPVCYKCNRY FT MQKSKQVATVYRCFNTRDLIRCNTSKSILNGTIFWHAKLSLHEVFYTLNEW FT RKDVSAVFVALDIKKNCSTITRWYKKFNDLVFHYINLNESNMIGGPGTTVE FT IDECLLVKRKYRRGRSLRNQKWVIGGVVRGNVNEYFVEQVEHRSRRFLFDV FT IRRRVRPGTTIVTDEWRGYRGLVSILRNFDYTHQTVNHSMFYLDPVTGANT FT QTVEGFWSIMKRYLRKKGTNIGNINEILKLFKVSVFKKTMRSDWFNQCIRI FT LQEYTLFK" XX SQ Sequence 1054 BP; 378 A; 129 C; 201 G; 346 T; 0 other; ggttaaaatg caataaacta cgcgcaatta aaaattgata agaaattgat taataaataa 60 tagtttgaat ttattttttg gcatatggaa ataacaacat ttctttataa atctctaaat 120 gacaattgga ttgtcaattt tttgattgaa aattcaattt tatttgttcc agtatgctat 180 aagtgtaaca gatacatgca gaagtcaaaa caagttgcca cagtttatcg ttgtttcaat 240 acgcgcgatc ttattcgatg caatacttcg aaatcaatat taaatggaac aatattttgg 300 catgcaaaat tgagcttaca tgaagttttt tatacgttaa acgaatggag aaaagatgtg 360 tccgcagttt ttgttgcatt agatataaag aaaaactgta gtacaattac acgttggtac 420 aaaaaattta acgatttagt ttttcattat attaatttaa acgaaagtaa catgatcggt 480 ggaccaggca caacagttga gatcgatgag tgcttgctag tgaaaagaaa atacaggaga 540 ggcagaagtt taagaaacca aaaatgggtt ataggaggtg ttgtgagagg aaatgtaaat 600 gaatattttg ttgagcaagt cgagcatagg tcaaggaggt tcttgttcga cgttataagg 660 cgcagagtac gaccaggaac taccattgtg acggacgaat ggcgtgggta tagaggtttg 720 gtttccattt taagaaattt tgactatacc catcaaacag ttaaccatag tatgttttat 780 ctagatcctg ttactggagc taatacccag acagttgaag gcttttggag tattatgaaa 840 agatatttga gaaaaaaagg aacaaatatc gggaacatca atgaaatatt aaagttgttt 900 aaagtttctg tctttaaaaa gaccatgcga agtgattggt ttaatcaatg tataagaata 960 cttcaagaat atacgctttt taaataaatg cttatcaatt tttaatcaat tttttattaa 1020 tttttaattg cgcgtagttt attgcatttt aacc 1054 // ID Gypsy-40_DPu-LTR repbase; DNA; INV; 385 BP. XX AC ACJG01004461; XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the common water flea: long terminal DE repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-40_DPu_; KW Gypsy-40_DPu-I; Gypsy-40_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-385 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the common water flea."; RL Direct Submission to RU (08-FEB-2011). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR Genome; ACJG01004461; Positions 11261 10877. XX SQ Sequence 385 BP; 97 A; 107 C; 71 G; 110 T; 0 other; tgtaatgcgc agccgagact gaccagaaga gtgcatatcg tctactgtat gtaccgtatg 60 taaacccgcc cccttcatgc tcctcttcta ttaatagcca tgatcatttt gtcccgccat 120 tttggaatca ctgcaatggc cgacacgttc aactattcaa atatggccgc gcgggaatca 180 gtatataaac tcccgtcgca ccctccaaga gttagactag tcccctcact cctcaactgc 240 ttgtggctcc agttggaagt cgatctctat tatactcctt attgtattct gtgcagtgtt 300 aaataataaa cttagttcgg cccaataatc cgagttattc ttcgaagtaa gcgcgccttt 360 ccaagcccct gaaaagcgcc ttaca 385 // ID L2_Ele3 repbase; DNA; INV; 4926 BP. XX AC . XX DT 19-OCT-2010 (Rel. 15.1, Created) DT 19-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE An L2 clade non-LTR retrotransposon family from Aedes aegypti. XX KW L2; Non-LTR Retrotransposon; Transposable Element; L2_Ele3. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4926 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4926 RA Kojima K.K. and Jurka J.; RT "L2 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (19-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. This consensus is generated from 8 CC sequences with >98% identity, and ~99% identical to the original CC sequence in [1]. XX FH Key Location/Qualifiers FT CDS 664..1893 FT /product="L2_Ele3_1p" FT /translation="MHVRLAVVILCDRIGAFCAVGYAEAFSIPDALALMRR FT ISKLGPPRRTPVVLRVMSNQFNPSVMDREALIIKTMRDLLLRVDSMDLRIG FT QFGENLKTLNSVLSLTTSARRDSDVSTRVHRFLSGTTVHDHTDFYNSINRL FT NLDSSLHSNAANNTIVFPEEDQGAMLDDSVSNAPVEPRGVRPKTYASVVSK FT TTANVTNSASXVTTANIFATTAVTSVSPATTANAISTTTTIANVMPPTTAT FT SATAYINATLNPAKATGAPTDTIPAFPIMRTTLPAGDSHPTSHASVSNGVP FT QTNCRLKVVSKHRQTRNRESTEEQLKSFYVTPFTIEQTEDDIIEYLRETIN FT TNESIVKCVKLVPRNKNINELSFISFKISVSEDLVSLIGDSFYWPEGVEIR FT EFQSKNEMRSNQMLSM" FT CDS 1815..4814 FT /product="L2_Ele3_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="FLLARRCRNTRISVKKRDAIKSNVVDVIHVSDFNANV FT PLNSNFTLNSVLPHPAQTKEVIYEKNINMHYFNYLQTKTFLEKFSRIFLKI FT GVHNARSLSNNIDNYRTFLEKSEINILAIVESWLKPSITNKSVELNGYKIV FT RSDRCNSKKSRGGGVAFYLKSNTKFSILRKSGKDSDVDYLFIKLNLANLVC FT GVVYKPPDVNISKLDDIFNIVSEISSTEPNILVMGDFNINILNYNSPQTRR FT LIDQLSSLSLKLIDTWPTCHKSGCQPSVIDLLMGNCTGNICNAYQSSLGGI FT SDHDFICVDYRFKISKSKPEVYWARDYHKINNETFLSDLRDCSFDRLYYCS FT NVNEKLNSFNEIIFSVLDKHAPLKKKTARDPVSPWIDNHVRRAFKNRSEAY FT ECWKRDKSNRGKWNNFKCLRNLANREIKRKKHEYFTSKLNIDLPAKQLWNN FT IKRLGLKQVNTKAGGGDVTAASLSTYFISHFVPETFVENPGGIDAQSRFSF FT RGVTSNEILTEFTLASCDSVGNDLIPLNVLKMSLSVTLPYITDIINCCITT FT SEFPEIWKVAKVVPIGKTDNPTVESEYRPISILCALSKIFESIVSKQLNEY FT LVSNKLLSPLQSGYRKSCSTITALIKVENDVREALDKKMITVMALLDFSKA FT FDTISHHLLCNKLKFRFKMDSLSVNLIYSYLSGRSQYVDFNNTMSDKISVP FT CGVPQGSILGPLLFSMYINDLPLVLSFCKFHLYADDCQLYISESIXNLNDS FT VQKINNDIQNILNWCVHNGLILNAKKTQTIIFSNKRMKVINPPKVKVDNDF FT IEYSDVVKNLGILMDCKLTWNDQVNAVCCKVYKALHSLVALRKCTPQHTRF FT QLARALIVPLFDYGDILFSLVSNNNFRKLNHAFNTITRYVFNLRKFDHISR FT YVNLLLGRSFNNHLSLRVCTQTFKILNNPPSYLENFFSYTRSPRTPLLVVP FT RSASGNLKLSFRHRAISAWNKLPRNCRSARNYSSFKNNIELFYNS" XX SQ Sequence 4926 BP; 1520 A; 969 C; 880 G; 1555 T; 2 other; accaacacag tcaactcagt tataatcacg cttctgatat ctccgaacta attatcgata 60 tttcgtctgt tgccggaccg cctgaaacat tttgtttttg gaatttgcca aagtattact 120 cgcctatcaa aaagcaactg aaggtcaaac aaatgctatc aattccttag tgtaaaagtg 180 cttcaaagat tgtgttttca ttctgggtag ccactctgga accgctgaaa agtaccgaaa 240 attgaacaac ttgtgttagt ggcgctgagg tgatataatt gtacattcac gacgctcgta 300 tctgcaatca cagttgacta gtactgatat tgctaaattt gttctgcttg tgcatttgca 360 cgcttcatac aagcggtgct gttgtgttgg atagaggagc gtgtcttggt gcgttcggtg 420 gtctcaaaag ccgaaacatc aattgatgtg catgatctgg ttcgacaccg gctttagcga 480 acagtactat acagtggaac aataacagct gatttcggga aagcaacaca ttgcttggcg 540 cagcttaata gtgtatgtac ttatatcatc gtagcaatac tactcttgtt tcttcttttg 600 gttgggaaat tcaaatcccg tgtggacttt acttagggtg gggcatactt ttttgaatca 660 tggatgcatg taaggcttgc ggtagtaatc ttgtgcgatc ggataggagc gttttgtgca 720 gtgggttatg cggaagcgtt ttccatcccg gatgcgttgg ccttaatgcg acgaatttca 780 aagcttggac cgccacgtcg gactcctgtg gttttgcgag tcatgtcgaa tcaattcaac 840 cctagcgtga tggatcgtga agcgctcatt atcaagacca tgcgagacct tctactgcgc 900 gtggattcga tggacttgag aattggacaa tttggcgaaa atctgaaaac actcaatagt 960 gtactttcat tgacaacatc agcgcgacga gactcggacg tttcaacaag ggtacatcgt 1020 ttcctctctg gtacaactgt tcacgatcat actgactttt acaacagtat taatcgtctg 1080 aacctcgact catccctgca ctctaatgct gccaacaaca caatagtatt tcctgaggaa 1140 gatcaaggtg ctatgttgga tgattcagtg tcaaatgctc ctgttgaacc aagaggtgtt 1200 cgtcctaaaa cgtatgcatc tgtcgtttcc aagacgactg cgaacgttac caactctgcc 1260 tctcntgtta ccaccgccaa catcttcgcc actaccgccg tcacctctgt ctctcctgct 1320 accacagcta acgccatctc aactaccact accattgcca acgttatgcc accaaccacc 1380 gccacctctg ccaccgccta catcaatgct accttaaacc ctgccaaagc tactggtgcc 1440 ccaaccgata ctattcctgc attccctatc atgcgaacca cacttcctgc tggggatagt 1500 catccaacgt cccatgcttc ggtttctaac ggtgtaccgc aaacgaactg ccgtttgaaa 1560 gttgtcagca aacatcgtca aactcggaat cgtgaatcta ctgaggaaca acttaaatca 1620 ttttacgtca ctccatttac cattgagcaa actgaagacg acatcattga atatctacgg 1680 gaaaccatta acacaaacga gtccattgtg aaatgcgtta aacttgttcc gcgtaataag 1740 aacattaacg aattgtcttt catatcattt aaaattagtg tttcagaaga tttagtttct 1800 ttaattggtg atagttttta ttggccagaa ggtgtagaaa tacgcgaatt tcagtcaaaa 1860 aacgagatgc gatcaaatca aatgttgtcg atgtaataca cgtaagtgac tttaatgcta 1920 atgtaccttt gaattctaat tttacactga atagcgtcct cccacatcct gcacaaacta 1980 aggaagttat ttacgagaaa aacattaata tgcattactt taactattta caaactaaaa 2040 ctttccttga gaaattttct cgaatttttt taaaaattgg tgttcataat gccagaagtc 2100 tgtcaaataa tattgacaat tatcgaacgt ttctagaaaa atctgaaatt aacattttag 2160 caattgtaga atcttggctg aaaccatcca taacaaacaa atcagtcgaa ctgaatgggt 2220 acaaaatagt aagatctgat cgctgcaaca gtaagaaatc acgtggtgga ggagtagcct 2280 tttatttgaa atcaaatacc aaattctcta ttttacgcaa atctggaaaa gacagtgacg 2340 tcgactatct gtttattaaa ctgaatctcg ctaatttagt gtgtggtgtt gtctacaaac 2400 ctcccgatgt taacatatcc aagttggatg atatatttaa tattgtttct gaaatttctt 2460 ccactgaacc aaatatatta gtgatgggtg acttcaatat caacattttg aactataatt 2520 ctccacaaac tcgtagacta attgatcaat tgagttccct atctctaaaa ttaattgata 2580 cttggccaac ttgtcataaa tctggttgtc aaccatcagt tattgattta ctaatgggta 2640 actgtactgg taatatatgc aatgcttatc aatcgtctct gggtggtatt agtgaccatg 2700 attttatctg tgtagactat agatttaaaa tttcaaagtc aaagcctgaa gtgtactggg 2760 ctagagatta tcataaaata aataatgaga cttttctatc tgatttacgt gattgcagtt 2820 tcgacagatt atactactgt tcaaatgtga atgagaaact taattctttt aacgaaatta 2880 ttttttctgt tcttgataag cacgctcctc tgaagaaaaa aacggcaaga gatcccgtta 2940 gcccttggat cgacaatcat gtccgtagag cttttaaaaa tcgcagtgaa gcatatgaat 3000 gctggaaacg tgataaaagt aatcgtggaa aatggaataa cttcaaatgt ctacgtaatt 3060 tagccaatag agaaatcaaa cgaaagaaac atgaatattt tacatcaaaa ctgaacattg 3120 atttacctgc aaaacagtta tggaacaata ttaaacgttt aggccttaaa caagtcaata 3180 ctaaagcggg tggcggtgat gtcactgctg catcgcttag tacttatttc atttcacatt 3240 ttgttccaga aacttttgtt gaaaaccctg gtggtatcga tgctcaatca agattttcat 3300 ttcgcggagt aacaagtaat gaaattctca ctgaattcac tttagcgtcg tgtgactctg 3360 ttggcaatga tctcattcct ctgaatgttt tgaaaatgtc gttatcggta actttaccgt 3420 acatcactga tattatcaac tgttgcataa caacctctga gttccctgag atttggaaag 3480 tagcaaaagt tgtacctatt ggcaaaactg ataatccgac tgttgaaagt gaatatcgtc 3540 ctattagcat tttgtgtgcg ctctcaaaga tatttgagtc tatagtctct aagcaactta 3600 atgaatactt agtttctaac aaattactat ctccccttca atcaggatac cgaaaatctt 3660 gcagcacaat tacagccctg atcaaggttg agaatgatgt gagagaagcg ctagataaaa 3720 agatgataac tgttatggcc cttttagatt tcagtaaggc atttgataca attagccatc 3780 atctgctttg caataaatta aagtttagat ttaaaatgga ttcattatca gtaaatttaa 3840 tttattctta tttatcagga cgaagccaat atgttgattt taacaacacg atgtcggaca 3900 aaatatcggt tccttgtggt gtaccccagg gatcgatact tgggcctctt ctgttctcta 3960 tgtacattaa tgatttacct cttgttttgt ccttctgtaa gtttcatctt tatgctgatg 4020 attgccaact atatatttct gaatctataw ataatttaaa tgatagtgtc caaaaaataa 4080 acaatgatat tcaaaacatc ttgaactggt gtgtgcataa tggattaata ctgaatgcga 4140 aaaaaactca aacgatcatc tttagcaaca aacgtatgaa agtaataaat cctccaaaag 4200 taaaagtaga taatgatttc attgaatact ctgatgttgt taaaaactta ggaatactca 4260 tggactgcaa attgacttgg aacgatcaag tcaacgctgt ttgttgcaaa gtgtataaag 4320 ctctccattc actcgttgct ctcagaaagt gtactcctca acatactcga tttcaactag 4380 cacgtgctct tattgtaccc ctgtttgatt atggggacat actgttttct ctagtttcta 4440 acaacaactt tcgtaaactc aatcatgctt ttaacactat cacgcgatat gtttttaatc 4500 ttcgaaaatt cgatcacatc tctagatatg tcaatctact gcttggtcgt agtttcaaca 4560 atcatctaag tttaagggtt tgtacacaaa ctttcaaaat tttaaataat cctccatcat 4620 accttgaaaa tttcttctcg tataccagat caccaagaac accattatta gttgtaccca 4680 gaagtgcctc tggcaattta aaactttcgt tccggcaccg agcaattagt gcttggaata 4740 aactacctag aaactgcagg agtgcgagga attattcttc attcaaaaat aatattgagc 4800 tgttttataa tagttagtct agtacattag cattattgtc gcatagtagc atatactttg 4860 aaaaccatta gaaggttatt tgtgttatta ttgttatgat gtaaataaat aaataaataa 4920 aataaa 4926 // ID MuDR2x_AP repbase; DNA; INV; 1903 BP. XX AC Contig14546; XX DT 24-JUN-2009 (Rel. 14.07, Created) DT 24-JUN-2009 (Rel. 15.12, Last updated, Version 2) XX DE MuDR-type DNA transposon. XX KW MuDR; DNA transposon; Transposable Element; MuDR2x_AP. XX NM MuDR2x_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-1903 RA Jurka J.; RT "DNA transposons from pea aphid."; RL Repbase Reports 9(7), 1351-1351 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX FH Key Location/Qualifiers FT CDS 411..1220 FT /product="MuDR2x_AP_1p" FT /translation="MWFMDRTFKCSSIFAQLYIIRAEFEGNVRTCVYAFLP FT DKLENTYHEMLLNLVIASVRNATPLKPERIIIDFELGVINVINRVFNDVQI FT QGCFFHLCQSVWRQIQQLGLAKLYKENERISKECRVMTSLAFLPIDDILVE FT ISHMYTTFSQELLPLISYFDRTYISGNNNNARYPPSIWNVRRQTMSNSHRT FT NNFSEGWNSSFNKLVGTTNPSFWTVLRAIQLDECGNRLHKRQKSYTAGNNK FT LLHLCNKHQKKEITNMEFLNSIYYTCSF*" XX SQ Sequence 1903 BP; 651 A; 249 C; 272 G; 731 T; 0 other; ggtcactttg tacggtccat tgtttaaaat gcataacgcg gacactttgt actgtcgggt 60 aaaaaggtag attacagaac gcggacactt tgtacggtcg ggtaaaaagg tacattgcag 120 aacgcggaca ctttgtacta ttatatacat aaagtatata tattataaat ataagttata 180 tacctaagtt agcaagtaaa taatactata tattatatat gacaaatttt actaaaaatg 240 ttataaatac attttatata atctggataa tctcatgggt acatatacat cgataatagt 300 ttatcattat atatatttca actgattttt taatttatga caatggaaat gagcatagaa 360 ttttaatttt cgctacaaat ggaggtcttt cttttttatc ttcatcaaat atgtggttta 420 tggataggac atttaaatgt tccagtatat ttgctcaatt atacattata cgagcagaat 480 ttgaaggaaa tgttaggacg tgtgtttatg cttttcttcc agataaatta gaaaatacgt 540 atcatgaaat gttgttaaat ctcgttatag catctgtacg aaacgcaact ccccttaagc 600 ctgagcgaat tattatagat tttgagttag gtgtaataaa cgtaataaat cgagttttta 660 atgacgtcca gattcaaggt tgcttttttc atttatgcca gtctgtatgg cgacaaattc 720 agcaacttgg attagcaaaa ttgtataaag aaaatgaacg gatttccaaa gaatgtagag 780 tgatgacttc attggcgttt ttacctattg acgacatttt ggtcgaaatt agtcatatgt 840 atacaacatt ttctcaagaa ttactacctc ttatttcata ctttgataga acgtatatta 900 gtggtaataa caataatgca cggtaccctc cttctatttg gaatgtaagg cgtcaaacta 960 tgtctaattc tcatcgaact aataatttta gtgaagggtg gaatagttcg ttcaataaat 1020 tagtaggaac aacaaatcca agtttttgga cggttttgcg agcaatacaa ttggatgaat 1080 gtggcaatag acttcataag agacaaaaat cctatactgc tggcaataat aaactattac 1140 atttatgtaa caaacatcaa aaaaaggaaa taacaaatat ggaattttta aattccatat 1200 attatacatg tagtttttaa agaatttttt aaatattaaa gaatggaaat tttaaatttt 1260 atagatcgtt cattttgacg aattattcta aaattgattt attaaaccct taatattaaa 1320 taattatgta tgtgcacttt ttatctaagc ttgataccat tttgacatat tttcatatta 1380 ttatactata caatattatg tagtttttta aaatttttgt caatattata ctttacttgt 1440 cattttgaga tatttccata ttataggtac tatgcatgta gtttttaatg atttttttca 1500 ttattatatt ttatttgtta ttatgacata tttatattgt acacattttt aatttgcttt 1560 aaaactgata tctaatatta taatacaatt aagatttata ataatattat tgattgtttg 1620 attattttga tcattattaa tatacgatga aacaatttac ttttaatttt taatttttac 1680 attataattt ataatattat ataatatact tactacttag gtatataact tatatttata 1740 atatatatac tttatgtata taatagtaca aagtgtccgc gttctgtaat gtaccttttt 1800 acccgaccgt acaaagtgtc cgcgttctgc aatgtacctt tttactcgac cgtacaaagt 1860 gtccgcgttt cgcactttta atgctcgacc gtacaaagtg acc 1903 // ID BEL-649_AA-LTR repbase; DNA; INV; 651 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-649_AA_; KW Pao_Bel_Ele107; BEL-649_AA-I; BEL-649_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-651 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 651 BP; 241 A; 118 C; 102 G; 190 T; 0 other; tgtcgctcct aacaacgtta ttgatcatgc ccctctgtgc gataaggttg cgcgcccatg 60 gacttctgtc atcttgccca tgtattggac tagaataaag cgatcatgga aaacaaatct 120 gaaatagtga aaacgaaagt cattggagac acatgcagat ttgcttaaag ctagttattt 180 acagttttac ttaatctatt atcattaaaa ttacctaaaa ctacatgctt atatctatga 240 aggtagtaca atgaatttta cgaatataaa cggatttacg aatataaatt agagtgtttt 300 ttccctagat atctcgataa agttcatgcg acttatgcta gagttgatgc taaccttaaa 360 ttattagatc taaaattatc accttcaggt aactaagata aaatacatat gaattagaca 420 aaatacttac taaatgacat ataggcagac cttatagagc taatcaggga cattggacat 480 aggacggaca aagaaacgaa gattgactaa acgtgagtac tattttcaac catacaaagc 540 aattactcaa ctaataaaat aattacatgc aggatattaa aatatctcca aatacaagcg 600 gatattataa atctcggaag acctctcttt cttcgccacc gaaacccaac a 651 // ID Gypsy-127_AA-LTR repbase; DNA; INV; 233 BP. XX AC AAGE02017298; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-127_AA_; KW Gypsy-127_AA-I; Gypsy-127_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-233 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02017298; Positions 116892 116660. XX SQ Sequence 233 BP; 61 A; 63 C; 47 G; 62 T; 0 other; tgtgtggtac acgattagac actcggaaag tatataccat agtgtagcgc atgctgaaca 60 tataatagca accctatgta ttggcactca ccgtgagcac ccccaataca ggctcttgct 120 tgcaaccgcc attacatcgt cctctttcgg ctttctacca gcaagcaggt aagacgtgca 180 gaatcgtgct aagctacgtg gtcccttttc ccgcaatgat ttagcatttc aca 233 // ID Gyp2_Cis_I repbase; DNA; INV; 3940 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 06-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE Internal portion of Gypsy LTR Retrotransposon from Ciona DE savignyi. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW internal portion; Gyp2_Cis_I. XX OS Ciona savignyi OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. XX RN [1] RP 1-3940 RA Smit A.F.; RT "Gyp2_Cis_I - Gypsy LTR Retrotransposon from Ciona savignyi."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX CC Ci000114, Ci000868, Ci000363, Ci000114 0.5% div ORF from bp 645 CC to 3827. XX SQ Sequence 3940 BP; 1007 A; 1053 C; 894 G; 986 T; 0 other; ttggtgaccc tcgacgtttc agaaacgatt attttctgaa tgaattatat tatgagccta 60 gaacaggaaa acgccgtagc ccttaaactg cccacgttct ggacaggcca gccacgagta 120 tggtttcaac aaacggaggc gcaatttgct ctaaagaaca tctcggcaga tcagacaaaa 180 tattttcatg tcgtagcggc gttggaccaa agtacggcat gccgcctgct tgatttactt 240 gaaaatccgc ccgaaaaaga aaagtatcag gcaatcaaag agcgattgat cagtgttttc 300 tgcctgacgc agtcacagag agcaaagcga ctgctccaca tgccgccttt gggtgaccga 360 atgccatcct cactcatgga tgaaatgctg gccctgctgg gtaaggagaa ggaatgcttc 420 cttttccggg agttattctt agaacgtatg cctgccaaca taagaacggt cctagcttct 480 caggattttg tcgataatcg caatttagca aggcaggctg acgaattatt attggcaggc 540 cgagaggacg tggacatctt catttcgaaa gtacaaagcc gtaggtttcg cccacaacca 600 gccagagaaa tcgacactga cgccatctgc ttctttcacc gtagtttcgg aaacaaagcg 660 cgtcaatgtc gccctccttg tgcgtttcag ggaaactgca gagccgaccg tccatagctg 720 ctgcgacggc cggcacagaa cgcagccttt tctacgtttg ggatagattg tcaggtcgca 780 atttgctggt tgatacggga gccgaaatca gcgttcttcc ccctacagga cttgaacgtc 840 gcaccaacaa acccggaaaa aacctcctcg cggctaatgg cagcaatatt cgtacatacg 900 gttcacgatt ggaatcattg gatctacccc ccggtaaatt caattggaaa tttgttttag 960 ctgacgtatc tcgaccgcta cttggtgccg atttcttaag ggcaaatgcc ttgttagtgg 1020 atatcaagcg ccgacgccta atcaatacag aaacgtttgc cagcacccca atcggcacga 1080 cgtgccaatc agcaccacgt ctcaacgctg tttcatcggg caaatctgtg tatgcccagt 1140 tgctggctga attccccgca gttacaacgc ctcatttctc ccaatcgaca ttacccaagc 1200 atggagttga gcaccacatt attaccagcg gtccgcctgt acattcacgt gctcggcgac 1260 tgcaccccga gaagctcgcc gtggccaagg acgagtttcg taacatgtta aatatgggaa 1320 tcattcgaag gtcgtccagt ccatgggcat cgccactgca catggttcct aaatcttccg 1380 gtgagtggag accatgcgga gactacaggc ggcttaatgc agccacgcaa ccggatcgct 1440 atgccattcc ccacattcaa gatttttcgg cccgtttggc tgggtcttgc attttctcta 1500 aaattgattt gattcgggga taccaccagg ttcccattgc gccagaggat attccgaaga 1560 ccagtgtgat tacgcccttc gggtgttatg aatttgtacg aatgcctttc ggattaaaaa 1620 acgcggcaca gacatttcag cgcctgatgg acaccgtctg ccaatcgtta gattttgtat 1680 atgtttacct tgacgatatt ctggtagcaa gcaagtcaac ctccgaacac ctagaccatt 1740 tacgacaact ttttcagcag cttgcagacc acgatctggt tgttaatgta ggcaagtgtc 1800 aattcggagt ctcacaaatt gattttctgg gacaccgagt gaaccagcaa ggtgctcgac 1860 cgctgccaga gaaagtggca gccattcagt tgtttccccg cccaacgacg atcaaagagc 1920 ttcaacaatt cgcaggtatg attaatttct accaccgctt tataccttgt gcagcacgta 1980 tcatgtctcc tatttacgag tcattgtctg gaaaacctaa aaaactggtt tggaacgagg 2040 cattggtttc agctttcgaa gaggcgaaaa ccgctttagc gaacgccacg ctcctgcacc 2100 atccattgca tcaggcaccc actggtctaa ccaccgatgc ctcgcagcac gccatcggcg 2160 ctgttctgca acaatttatg gacggcgcat ggcgacccct ggcgtttttt agcaaacgct 2220 tacgtccccc ggaacttaaa tacagtgcgt ttgacaggga attactcgcc ttgcacctag 2280 cgatacgaca ctttcgctat tttcttgaag gtcgcagttt cacggcttac accgaccaca 2340 aaccacttac atttgccttt ttcaagacat caagtccatg gtcagcgaga caacaacgtc 2400 aattagctgc catctcagag tacacaacgg acgtacgtca catcgatggc aaacgcaatc 2460 aagctgctga cgccctctct cgcatttcga ttaatgccac gcaattgcag cttgacatgg 2520 attacgcatc gctcgcggct gcccagctgg acccggaggt ccaagcctat cgcacagcgc 2580 tcacgaacct gaaacttaaa gacatccaac ttgattcgtc tggaactacg attctctgcg 2640 atgtatcgac aggaacgcca cgtcctgtga tcccgaagtc ctggcagaag aaaatattcg 2700 atattgttca caatctgtca cacccgtcta tccgggcaac acgtaagctc atttcttcta 2760 aatttgtctg gcacggttta catcgacagg tcggtttgtg ggcaaaacaa tgtgttagct 2820 gccagaaagc caaaatccac cagcatgtgc gcgctccgct ggagcaatac ccactgtgcc 2880 aatcacgttt tagccacgtg aacattgata tcgttggccc cttggcaccc tcacaaggca 2940 atatatattt gcttacaatg gtcgatcgtt ttacaagatg gccggaagct attccgatga 3000 cagacgcgac tacaaccacg tgtgccaggg ctttcgcctt taactggatt tctcgtttcg 3060 gggtaccgtc caacatttca tcggaccgag gtccacaatt tacctcagca ctgtggtcga 3120 cttttaccag gttgcttggc attcaactcc atcgcacgac ctcatttcac gcgcaaggga 3180 acgggcttgt tgaacgattt catcgtcaca tgaagagtgc tctcatggca cggcttacag 3240 gaccaaactg ggtggacgaa cttccctggg tcctgctggg catccgcacc gctcctaaag 3300 aagacttgaa agcttcttca gcggaattag tatatggagc gaatttagta gttcccgggg 3360 atttttttgg tgtaagccaa gctgatccgc taccttctca aatactacca gcagtaaaag 3420 agacggttgg tcgactacta ccaacaccta tgtcacgcca tggtctagtt cccacaagag 3480 taccaccagt tttggacaca tgtctgtacg ttttcttacg aaacgacaaa caccgtccac 3540 ctttgacgcc accgtacaac ggtccgtacc gtgtcctcag ccgcagccac aagactttta 3600 ctctggagat aggagaacgt catgaagtgg tcagcataga tcgtgtgaaa ccagcatacc 3660 tggacatgac agaaccggtc acagtggcac aaccgccacg acgaggaaga ccacgcaaag 3720 cagacaatga taatttgcca ccgcaaaata cacctcaacg cgagtccccg gctatgtgta 3780 cccgctcggg gcgtttagtt aaacccccta ttcgttttat tgattaacta tgttgctcgc 3840 cagatgtttt tatgtttaaa cctcatttga ttttactgtg ttgctcgcat ggtttaaagt 3900 agtgctgatg ctcattatgt ccgcttctgg aggggggtta 3940 // ID CR1-34_BF repbase; DNA; INV; 3567 BP. XX AC . XX DT 30-JUL-2009 (Rel. 14.07, Created) DT 30-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE Amphioxus CR1-34_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-34_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-3567 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-3567 RA Kapitonov V. and Jurka J.; RT "Young families of CR1 non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(7), 1605-1605 (2009). XX DR [2] (Consensus) XX SQ Sequence 3567 BP; 1071 A; 844 C; 841 G; 811 T; 0 other; atcgagcggg aaaagaggcg aaagaacctc gtagttttca atgttccaga ggatttaaag 60 gactcgaccg gcagcacgct gagtgataca aaggtcttca cagacatcac taaagaagaa 120 ttcaacctcg tccccaaggt acaagcagcc tacagactgg gaaggaggaa ccctgagaag 180 ccaagaccgc tcctgatcaa aatagacaat gacgacactt tgagcagggc cttgatcctc 240 aggagagcaa aagacttgag aaacagcact cggtggtgta gagtgtacat tgtgcccgat 300 atgacgccaa aagagaggga agtggataaa aagttgcgac aggagctcaa atctcgacga 360 gaagcaggag aagcaaacct ggtcatcaga cgcggacgaa tagtgtgcac acagacaact 420 ggggaatcac agccgcggga cggcaacaac gaccaatccc aggcctgact cgacaaccac 480 aaccatctat catgtagcaa caggaaggag gagagtaaac tgaacattat gtacacaaac 540 attgatcaac ttccaaacaa aagggacgac ctgacgttag ccatatcaac aaaaaagcct 600 gatgtgataa ttctaacgga agcgatcccc aaggctcaga gattacctat tggagcagct 660 agaatcacca tccccgggtt cacagctcac atgaactttg accctgacca gggaaatctt 720 ggaacactgg gaaagcgagg gattgtgatc tacacagcag actacattga tgcaaccgaa 780 gtctcctaca gtgacattga gttccaggag cacctgtggg tgcaagtacg tttgagagga 840 cgcgataaac tattaatagg aggcatctac agaagccagt cttctgaccc aaagtcgtca 900 actgaacact tggcacaact tctacaagta gtcacgggga ctagtcctac gcacctcgta 960 atcgctggtg atttcaacta tcctgaaatc gactgggaac aagcaacctc atgtgcaaat 1020 gacgctcacc catcccacct ttttcttcgt tgtgtgcata aaaaccttct gtaccaacat 1080 gtcttcaagc ccactagata cagacccggt gagttaccga acattctcga tctagttctg 1140 acaaatgaag atggaatggt ggaaaacatt gagtacctac ctggtattgg acatagtgac 1200 catataaggc tgcagtttga aaccagacta tactctgaga gaaaggacaa cccaacgcca 1260 agactcagcc tataccgtgg tgactacgtc aatatggctg cagacttatc tggtattgac 1320 tgggacacca ggctcatgga caagtccttc gacgaggcgt acagtgattt cattagcatt 1380 ctacaaagca gtattacttc acatatccct ctgacaagca agaagtcatg taagaagaac 1440 ttgtactcca ccccagcagt caagagtctc agcaagaaaa agcggagtag ttgggaccgt 1500 tacactgtat ctggagacga ggtggattac gcccgatacg ctgctctgcg gaatgacctc 1560 agaaagctca ccaggaagct ccgatatgag tttgagtcta accttgttcg agacgtaaaa 1620 cggaacccta agtctttttg gaaatacgtt aactctcgtc tcaagaccaa atctgtaatc 1680 ggagacctga agatggcgga tggtaccatg acgactagca gcaaggagaa ggcagacacc 1740 ttgaaccagt tcttctgcag tgtgtttacg agagaagaaa ctggccgcat gcccagaatg 1800 gaagctaagt actacggacc caacctggag gacatcacaa ttactcctga aggtattctg 1860 aaaaaactga aaaagctcaa gaaggataag tcccccggtc cagatggtct gcatcccaag 1920 atactagcag aggtcgcgga agccgtagcc agaccgcttg ctattctctt ccgaaagtcg 1980 ctggacgaag gccggctggc agaggaatgg agacttgccc acataactgc catacacaag 2040 aagggaccga agaacgaacc tgggaacttc cgccctatta gtcttacgtc tgtcattgtg 2100 aaactgttcg agtctgttat tagggacgct cttgcagacc acatgatggt gaacgaactc 2160 ttctgtgacc agcagcatgg ctttgtgccc ggtagatcgt gcataacaca gctaatcacc 2220 acaatggacc tttggactca agcggtggag gcaagtgaac ccctggatgc catctaccta 2280 gattttcaga aggcctttga tagagtaccc cacgcacgct tgatgcacaa actcagccag 2340 tacggtatta gcggaaaact gcatggttgg attcaggcct ttttaacaga cagaagacag 2400 cgtgtttgcg ttgatggtga attgtcagaa tgggccatgg tttcaagtgg aattccccaa 2460 ggatctgtgc ttggaccaat tctctttgtt attttcatta acgacatgcc aagcactatc 2520 gaaagtgcct gtcgcctctt tgcagacgac accaaggtgt tccgtcgagt gagttctcca 2580 gaggaggttg ccaccttaca agcagacatc gacaatttgg cagattggtc taaggactgg 2640 cagctcagct tcaacatcac gaaatgtaaa cgcatgcata tagggtatgg taacccatgc 2700 caacaatacg agatgcaagg cctcaccttg gaagaaacca ctgaggagcg ggatctgggt 2760 gtaattgtcg accaaaaact gaaattccat gcgcattgta ccaccaccgc cggcaaggca 2820 ttccgaaccc tggggctaat caggaagagt tttcaacggc tcgatgagac aactgtgccc 2880 atattgtaca agaccatggt tcgccctata ctagagtatg gaaacgtcat ttggggtcct 2940 cacttcaaag gtgaccagca actgctcgat agagtacaac ataaggcgac gcgacttata 3000 cctggttttg gtgaacttac ctacgaggat agactgcgcc gcctcaggct ccccaccctt 3060 gaacaccgcc gtaagagagg cgacatgata cagctattta agatcgtcaa ggggtttgac 3120 cgcattgacc ctaaccgcct tttcaaattc aatgttgatg gacgtacgag aggacactcc 3180 ttcaagattg tgaaacccct agccaaaaag tcagctcggt ccaatttctt cagcgttagg 3240 gtcatcaatg catggaacga tcttcccgcg gatgtagtgg ctgctgactc tgtaaacatt 3300 ttcaaatcaa agctggacag ttacagtaat tactgggggg gcattgagta cacgcctagc 3360 ccaacatgaa ctctatattg ttattacagt ctcttttcac aaactctaga ttctagcttt 3420 taattgttaa ttgttgaata aaatggacac tagtttttaa ttgttaattg ttaaataaaa 3480 tggaaactag tttttaattg caattgttag aagcggattt cacaggcaaa ctgcctccta 3540 tccgatgatg tgatatgata tgatatg 3567 // ID CR1-36_BF repbase; DNA; INV; 3406 BP. XX AC . XX DT 30-JUL-2009 (Rel. 14.07, Created) DT 30-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE Amphioxus CR1-36_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-36_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-3406 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-3406 RA Kapitonov V. and Jurka J.; RT "Young families of CR1 non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(7), 1607-1607 (2009). XX DR [2] (Consensus) XX SQ Sequence 3406 BP; 984 A; 805 C; 706 G; 911 T; 0 other; aggggagggg aggtttcggc cggccaggca ccctacgggg atgataccct agcttgcatc 60 cgtaggggct gtgaaggggg gtaaccctgt ttcagcccta ggagcaagtc ggcttcggcc 120 cctggtcttt ttatttggct tgggttgact gcccaagtca tcttatctca tattgtcaca 180 ttacaattac gaaaagcctc gaaatggact gatctaatgg cgatgaccct tgcaagaatt 240 gacaaagatt atggaattgt gatccaattt atgctaccct tttttgtgct gttccttatt 300 tgtggtgata ttcacccgaa cccaggtcct cagcaaaatg agctcattgt tcgattcaca 360 aacattagag gtttacgcac taatctgaca taccttgaac acaatctagt tgcatcgcat 420 cctcatatat tttgtgtatc agaaaccttc ttaagcagta aggtctatga tgagctattg 480 gtgatacctg gttactctgt cttcagaagg gatagaccca atgactctgg ttggggtggg 540 ttagtagtgt actgcagtga agctcttact gttgcaagaa tgcccgactt tgagaatcac 600 tactatgaat atatatgcct tcaagtgtcc ttgcctggcc gtaagatttt catgttttgt 660 gtctaccggc ccccatcaga tgatgattct ttatttgatg tgttatctga gaatattgac 720 aatatccaag agctccaccc aaaatcagag attattgtcc ttggggactt aaactctcac 780 cacgaagact ggttaggcag cgacaagact gacatgcatg gagaatgtgc atatgaattt 840 gctattctga acaatctcca acaactcgtc gatgaaccta ctaggcttgg tactgacggc 900 aagtgctcca aactggacct ttttcttact acctctcctg ataaccatgc tgttaccgtc 960 agcgcgcctc ttgggtcctc tgaccactgc gtggtaactt ctgtggcaaa ctacaaactt 1020 ccaacccaga gcaaacctgg gagacgtaaa acttggtact atgacaaagc tgattgggac 1080 ggaatgcgct cctatctggc tgacacaaac tggtctactc tcctaaaagg caaaagtgct 1140 gatgcagcat ggacctgtgt gagagatgtg attagtgata ctatggacat gttcattcct 1200 tccaaaatga ccaaacattg cttttctgac aaaccatggt ttaatgaaga ttgcaaacta 1260 gctgttaaac agaaacaaaa tgcattcaaa agatggcaaa acaaccgaac aatggacttc 1320 tggactttgt atattaagtc aaaggcagag tgccaacgag ttgtgcgtcg tgcccgtgcg 1380 cagttttcac tgcacttaca acagaacctc aagcatgcag acaataagca atggtggaag 1440 cttgtcaaat cagtaactga cagcaataaa acgtcttctg tccctccatt agtcagtata 1500 ggcaaaactt acagtaaggc gaaagataaa gctgagctgc tgaacaagac cttcgctgca 1560 aatgctcatc ttaacgacgc agggaaaagt cccccagtcc tgccacccaa aacagatgaa 1620 accatctcat ctctgaaatt ctggcccaaa acagttcttc ggaaactacg taacttagac 1680 accaccaagg ccaatgggcc agatggaata tcggcaaagg tactcaagaa gtgcggcccg 1740 gaacttgccc ctgttctatc caaactattc gaaatctctc ttgactcaca gacagttccc 1800 tccgactgga aagctgccca ggtgatagct gttcctaaaa agggaaacaa gaaagatcct 1860 tccaactata ggcccatatc cttattgccg ataatatcca aagttatgga aagtattatt 1920 tgcgaccaca tccgtaaaca tctcgataga caccagcttc tttgtgacag ccagtatgga 1980 tttcgagaga agagatcaac aattgacatg ctttcatata ttactcagtg gtggaacaat 2040 gctcttgaca ctcagaaaga aattcgagtc attgccctgg atattaagaa agcgtttgat 2100 cgtgtttggc accgcggtct actgtccaag ctgatctcct ttggtatcca cggggatctg 2160 tacggttgga tctcatcctt cctagccgac agatgccaat cagtagtact ggatggtttt 2220 acctcctcat caattccaat aagtgccgga gtcccccagg ggagtgtctt gggacccctt 2280 ctgttcctga tctttataga tgatcttgaa caacacttag tcaatgacct ccacctattc 2340 gcagatgatt caacactaca tgtggtgatt aaaagccctg gcctaagaaa cacttgtgct 2400 ttgagcttac aacaggatct tgactcgata gaaaagtggg cctccgactg gtgcataaca 2460 ttcaatgcag gtaaaactga ggagatgatc ataagtagga aacgggacca gactcatcct 2520 ccgctgtttt tcatgaatga agaactgaag cctactcaga gcataacttt gcttggtgtc 2580 actatcacca atactctaac ctggacccca ttcatcaaaa gccttgccac aaaaaccgca 2640 agaaggctgt ttatcctagg acgcaccagg gaccttctcc ctctccaagc ccggataaca 2700 gtatacaagg catatatccg ccctctaatg gaatatgcgt cccccatatg gagtggcgct 2760 ggcacaacag ctttaaagat gctagataga cttcagagca aagctcttcg tcaacttcag 2820 attagggtcg acccacagaa tgccggcatc ttccccttgg accacaggag gaaagtagcc 2880 agtctctgca ccttctaccg gcacatcttc ctacaacctt cccaggaact ttcaggaatc 2940 atgccgacac gcgcaaccgg gacacgagta accagatcat ccacaagagc tcacccttac 3000 cttgttaaag taccgagatc aaatacccaa cttcacctat catcttatgt accacgaact 3060 agtagactct ggaattcact tccagcatca gtgtttccag caagacccga catgaacatt 3120 ttcaagacag ctgttaacaa atttcttttg aactataatt ctcagagcta gcttcattgc 3180 atatgtatat aattgtatat ttgagttgat gtagacgctt ttgggatgta catatgagct 3240 aatgtaaacg tttttggatc acaataatgt gcttaatgtt cgaggctact gtaatgtaaa 3300 tgtgacgtat gactgcttta ctttgtatcg tatgggccgt atatcatcag catgatgtaa 3360 atattcaggt ccttttgtac tgattaaaaa aaaaaaaaaa aaaaaa 3406 // ID BEL-1_BMa-LTR repbase; DNA; INV; 417 BP. XX AC AAQA01001578; XX DT 24-FEB-2011 (Rel. 16.02, Created) DT 24-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Brugia malayi genome: long terminal DE repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-1_BMa_; KW BEL-1_BMa-I; BEL-1_BMa-LTR. XX OS Brugia malayi OC Eukaryota; Metazoa; Nematoda; Chromadorea; Spirurida; Filarioidea; OC Onchocercidae; Brugia. XX RN [1] RP 1-417 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Brugia malayi genome."; RL Direct Submission to RU (07-FEB-2011). XX DR Genome; AAQA01001578; Positions 1196 780. XX SQ Sequence 417 BP; 112 A; 80 C; 57 G; 168 T; 0 other; tgttagcaaa cccttcgtta gcaaaaccct ttattagcag tctttgtttc atatcattta 60 aatgtcctcc tctgccgaat atctggtcag ttcagttctg tatataaact aatcctgtta 120 acaatcttct ttgttcatcc atgagaaatc ctcccatcta atctccattc gttaaatcag 180 tcttatatat aacccaacca atccttctat aaccatcatt tgctttatgt tttgggtggt 240 attgggatgc gtctaatact ttataatttt gtgttatttc actattttta tattattgca 300 ttttttatag tttacgcgtt tgttaaacgg tgtagcatcc gtacaatatg tataaagcgc 360 tattattgct tcatgttaaa taaaggttta attacccaag tttgaggttt cacgaca 417 // ID CR1-1_IS repbase; DNA; INV; 3022 BP. XX AC . XX DT 20-JAN-2010 (Rel. 15.03, Created) DT 20-JAN-2010 (Rel. 15.03, Last updated, Version 1) XX DE CR1-1_IS autonomous non-LTR retrotransposon from deer tick - DE consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-1_IS. XX OS Ixodes scapularis OC Eukaryota; Metazoa; Arthropoda; Chelicerata; Arachnida; Acari; OC Parasitiformes; Ixodida; Ixodoidea; Ixodidae; Ixodinae; Ixodes. XX RN [1] RP 1-3022 RA Kapitonov V.V. and Jurka J.; RT "Young families of CR1 non-LTR retrotransposons from animals."; RL Repbase Reports 10(3), 529-529 (2010). XX DR [1] (Consensus) XX CC The consensus sequence is less than 1% divergent from several CC copies of CR1-1_IS. XX FH Key Location/Qualifiers FT CDS 101..2911 FT /product="CR1-1_IS_1p" FT /note="APE endonuclease and RT domains." FT /translation="MECTLYYQNVRGLRTKSSEFLSGVIGNSYTIISLTET FT WLEDTIPSSQYFPPCYSVFRKDRDYAATRQRYGGGVLTAVDTSLCARRRYD FT LELYSECVWIEVTCRDGLNFLIGNYYFPPHLSSDVFARHFRDLEGKLDVTK FT FRVHIYGDFNLPGIDWLANYNLITSDVTSLKASSLFDFIYFNGFEQRNTIQ FT NSAGNVLDLVLVSTPINYISPITDPLTRVDHFHPPFAVSFCFPLRKQALCS FT YEYLLYSSGDYLNMYKYFQCYDWKPILNNKCADSAAEALTEVVSSAINDFV FT PKRVSRLNKYPAWFSKELRSCLRRKLHYHRLFKRSGLDKWYAKFSECRALA FT KWLFKKDESAHLHGLESDMYAKPQNFWKYVPTKKKRNSSVAPMRNSSGYTC FT DSIEIANMFAEHFKSCFSSASVPDNNATCHGTSDFLSFIPIDEPEIRESIR FT NMKSKLSAGSDGIPSFIVKGCAELLTPVLLHIFNLSIHSGIFPSIWKRSVV FT VPIHKAGDPSTVNNYRPVSLLCSFSKVFEMIMHKRLYRHFRHKMNPEQHGF FT LKGKSVETNLCSFLDYAVPLVCNREQVDVIYFDMTKAFDKVNHSTLLTKLD FT SYGLCLTYCQWFKSYLTCRTNYVRVSGSMSDYFSSPSGVPQGSNLGPLLFL FT MFVNDISLCAKNSKVLLYADDIKLFRRVQSHSDCVLLQEDALQISQWCAKN FT YLPLNEQKTRTMSLTRKRDSLAFSYTIGGSVITRVSVIRDLGIFIDSSLNF FT NHHVLTICNASFRNLGAISRFTRKFSSPVCLVRLFCCLVRTKLEYASSVWN FT CINLTASCCIERVQKRFIGIIYDRYFDRTCYYSYEQLLKKLGLCTLLDRRR FT LRDLMFLHKVVNGAVDSPLLLSTVCFHAPLRSNRFQSPFYPSALNVASPLT FT RMQLEFNKLDQSAVDIFSDLIVFRHRLESIICT" XX SQ Sequence 3022 BP; 797 A; 629 C; 635 G; 961 T; 0 other; catttggccg gcagggtgcg tttttcgccc ttttttcggc attcttaaag aactctcgga 60 cacggacacc tcggtcgatg taaacgcata atcaggtacc atggagtgca cgctttatta 120 ccagaatgtt cgtggtctta ggacgaagtc atccgagttc ttgtccggag ttataggtaa 180 ttcgtacact atcatctccc ttactgaaac gtggctcgag gacactatcc ccagctctca 240 gtattttcct ccgtgttatt cagtatttcg gaaagaccga gattatgccg ccacgcgaca 300 gcgatatggg ggaggtgttt taacggccgt tgacacttct ctgtgtgcaa ggagaaggta 360 cgatcttgaa ttgtactcgg aatgtgtatg gatcgaggtg acgtgccgtg atgggcttaa 420 ttttttaata ggtaactact actttccgcc acaccttagc tcggacgttt ttgcacgtca 480 ttttagggat ttagaaggga agctagacgt gacaaagttt cgtgtgcata tatatgggga 540 ttttaatcta cctggaattg actggctggc caattataac ttgattacga gcgatgtcac 600 cagcctgaaa gcttcgagcc tgtttgactt tatttatttt aatgggtttg aacaacgtaa 660 caccatccag aattcagcgg gaaatgtgct tgaccttgta ttagtttcta ctccaatcaa 720 ctatatttcg ccaatcactg accccttaac acgtgttgac catttccatc cgcctttcgc 780 tgtgtcattt tgttttccgc tgcgtaaaca ggccctatgc agctatgagt acttactgta 840 cagcagtggc gattacttga atatgtacaa gtactttcag tgctatgact ggaaacctat 900 cctgaataac aagtgcgctg attcagctgc tgaggcactt acagaagtgg tctcgagcgc 960 gataaatgac ttcgtgccta aacgtgtttc gagactaaac aagtaccctg cgtggttctc 1020 aaaggaatta agaagttgct tgagacgtaa gcttcattat caccgtcttt ttaaaaggtc 1080 gggattggac aagtggtacg caaaatttag tgagtgtcgt gctttggcaa aatggctttt 1140 taaaaaagac gaaagtgcac atttgcatgg gcttgagtct gacatgtacg ccaagcctca 1200 aaacttttgg aaatatgtac cgacaaaaaa gaagcggaat agtagcgtgg caccgatgcg 1260 aaatagttct ggctatacgt gtgattcaat tgaaattgcc aacatgtttg ctgagcattt 1320 taaatcatgt ttttcatcag catctgtccc tgacaataac gcgacgtgcc atggcacatc 1380 tgattttctg tcttttatcc caattgacga acctgaaatc agggaaagca ttcgtaatat 1440 gaaatcgaag ctgtctgctg gttcagatgg aattccaagt tttattgtga aagggtgtgc 1500 cgaactgcta actcctgtct tattacacat atttaattta tcgatacatt caggaatatt 1560 tccttctatc tggaaacgct ctgttgttgt tccgattcat aaagcagggg accctagtac 1620 tgtaaacaac tatcgtcccg tttcattgtt gtgcagtttt tccaaagttt ttgaaatgat 1680 aatgcataaa cgtctgtaca ggcactttag acataaaatg aacccagaac aacatggatt 1740 tctgaagggt aagtcggttg aaaccaatct atgctctttt ttagattatg ccgttccatt 1800 ggtttgcaac cgcgagcaag tcgacgtaat atactttgat atgacgaagg cattcgacaa 1860 ggttaatcac agcacacttc tgacgaaact agatagctat ggtttgtgct tgacttactg 1920 ccaatggttc aaaagttatc ttacatgtcg tacaaattat gttcgtgtgt caggatccat 1980 gtccgattat ttttcatcgc cttctggagt accccaagga agcaacctgg gtccactatt 2040 atttttgatg tttgtgaatg acatttcgct ctgtgcgaaa aattcaaaag tgctcttgta 2100 tgcggatgac atcaaactgt ttcgacgtgt acaatctcac tccgattgcg tacttcttca 2160 ggaggacgca ttgcaaattt cacagtggtg tgcaaagaat tatcttccac tgaacgaaca 2220 aaaaacaaga acaatgtctc tcactagaaa gcgtgattcg cttgcatttt cctacacaat 2280 aggtggcagc gtcatcaccc gagtctccgt cattcgtgat cttggaattt ttattgattc 2340 atcgttaaat tttaatcatc acgttttaac tatctgcaac gcctctttta gaaaccttgg 2400 ggcgatatct cgtttcacga gaaagttttc ttcgcctgtt tgtcttgtgc gtttattttg 2460 ttgcctagtg cgtacgaagc tggagtatgc ttcttctgta tggaactgca ttaacctcac 2520 tgcatcatgc tgtattgagc gtgttcaaaa acgcttcata ggcataattt acgacagata 2580 tttcgaccgt acttgctact acagctacga acaattactt aaaaaacttg ggttgtgtac 2640 gttactcgat agaagacgat tacgtgatct aatgttttta cataaggttg taaacggtgc 2700 tgttgactca ccacttttgt taagcactgt ctgttttcac gctcccttgc ggtccaatcg 2760 atttcaaagt cctttttacc cgtctgcttt gaatgtcgcg tcaccactaa caagaatgca 2820 acttgaattt aacaaactgg atcaatctgc tgtagatatc tttagcgacc tgattgtttt 2880 tcgacatagg cttgagtcaa ttatttgtac atagtcacga ccttgttttg ttgcctgcat 2940 ttatccttga tgacgagcgg tgtaccacta tttaggcctt gtgctgtttt tggacaccca 3000 acaaaaataa agattattat ta 3022 // ID CR1-17_BF repbase; DNA; INV; 3782 BP. XX AC . XX DT 30-JUL-2009 (Rel. 14.07, Created) DT 30-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE Amphioxus CR1-17_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-17_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-3782 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-3782 RA Kapitonov V. and Jurka J.; RT "Young families of CR1 non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(7), 1588-1588 (2009). XX DR [2] (Consensus) XX SQ Sequence 3782 BP; 1043 A; 829 C; 671 G; 1239 T; 0 other; ataatagatt tctgtgtatt tatatgtcac ttttcatttc gtgtagaata atgagtttgt 60 tactacttaa gcgtgagtcc tcctccctac tgtcccagat gatcagtttg ccgggtgatc 120 tgcagcatag tggggactga actaaaccaa tatagggcag cgataggaac tttctataat 180 atacgatatg gccttagatt gttgtcaagt aaattttgct atataggtat aaacacaatc 240 agcttcatat tgtcaatatt ctactccaat tttctgctct tgttgattgt attatccaat 300 gacatacacc cgaatcctgg gcctatccag cctacgggta cctctaagtg cttaaatatt 360 tttcatgcta atgttaatag tttaattgct ggtaccaagt tagatgaact ggcaacgatc 420 gctaatcgtt tccaaattga aattattgct ataactgaaa cgtggctcag tggtgccatt 480 ccatctgaag atcttgtcat tgacggttac cagctaccat tgcgccgtga tagaaatcga 540 cacggagggg gtgtactggt ttacctttcg acctggattc ctttcaaaag gcgggtcgat 600 ctcgagcccc agttatttga atctatttgg attgaactca gagttaaatc attcaaagtt 660 ctcttctctg tttactatcg ccctccgggt caggacgcta ccacagttaa tgagtttatg 720 gattccctct caaactcggt acagctggcc agagagtcac accctgacgc aattgtcata 780 gttggggatt ttaatgctaa acaccagcag tggtggcacc tcgaccaaac tacatctgtt 840 ggtgccaaat tatttcagtc tacgcagctt ctaaatctta cccaaataat caacgagcca 900 acctgtgact tatctcagaa tccctctctt attgatttga tatttactga tgcatttaat 960 tacgtagata agacatcaat attggcacct ttatcagggt gccaccacag ccctacgatc 1020 tgttctatgt tattctctgt gtattttccc aaaccttatt cacgtatcgt ttgggactac 1080 aggaacatta actttgataa gctatccgaa ttctgcacgc atcctacatg gtgcgacatt 1140 ttcttgtgtt catctgttga cgaatctgcg gtaaaactaa gccaactcat tcttgaggca 1200 aaagacctgt gtgttccaag caaatccata ttagttagac ccaaggacaa accttggata 1260 actcgtgatt tacgtacgct aatgagattg cgtgacaaac tccacaaagc tgccaaactt 1320 tctaaatcct ccactaattg ggctgcttac cgcagagttc gcaatcgact ttccaatgct 1380 atttcccata gtaaatctac atattataag cgccttatta actcattaga cgatccgtca 1440 actttaggta agaagtggtg gcatattgtt aagtattttt acaagccaaa agttacgtcc 1500 actatccctc ctcttaaaga aggtagttcc tttattctag actctacaga aaaggcagag 1560 cttcttaaca aatattttgc gtcacagtct actgttgatg attcaaatgc ttctctgcca 1620 ccattttatt accttacgga tcaacgtttg tccgaaatca ctacttcagc tgaggaagtt 1680 tgtctttttg taggtaacat agacattcat aaggcccatg gtcaagacaa cattgataat 1740 cggttcctca aagttattac gcccctaatt gctgataaaa tagcttatgt attcaatctg 1800 tctttgtgtc acggaacttt ccctgaaatt tggaaatatg ctaatgttat tcctatcttt 1860 aaaaaggggg acccccagga caaatgcaac tatcgcccgg tttctttgtt gtcttccttg 1920 tcaaaggtcc ttgagaaaat agtttacaaa catttgtaca accaccttac acagaacagc 1980 cttctttatc gattacaatc aggttttata cgtggtgatt ccactgtacg gcaattggta 2040 tactttactc atgtaattct tgaagcattg gactcgggga gggaggttcg ttctgttttc 2100 ttggatttct caaaggcctt cgacaaagta tggcatacag gcctagtcta taagttacaa 2160 aaaaatggta tcgaaggccc ccttataaat tggttatata gttacctcca aaatcgattt 2220 caacgtgttg ttatcgacgg ccagtcgtct aactggtgcg gcatttctgc cggcgtcccc 2280 cagggttctg ttctgggccc tttacttttt ttagtttaca ttaacgatat tgttgaagga 2340 cttgcctccc agcctatgct atttgctgat gatagctctc tgctccaagt tattgataac 2400 ccaattaacg acgcactttc gcttaattca gatttggaaa agattcattt atggtccacc 2460 aattggttaa tggaactaaa ccctgctaaa actgaagaaa tgtgctttac ctccaaaaga 2520 ctttatccgg tccatccccc tctttttctt aataaaacca tgattaagtc agtttctagt 2580 cacaagcata ttggggttat ttttagctcc aatatgtctt ggcatcaaca cattgaaagc 2640 attgtcacca aagcctctaa aagtgtccaa ttatttagtg ttttaaaatt caaattatcc 2700 cgtcaggttc tagaaaaaat atacaagtcc tttatacgtt ctttgttaga atatgctgac 2760 attgtttggc acgggtgcac aattgaggaa tcaaatctcg tcgaaagaat tcagtatcac 2820 tgttccctca ttgtaaccgg tgcaataaaa gggtcatcat attctgccac tcgtaaagaa 2880 ctcggctggg agtcattgtc cgacaggcgt catattcacc gcctattact atttcacaat 2940 attgttaacg gactgactcg tgaatacctt accactttac ttccttatgc aattacggtg 3000 atatgtaatt atgatttgag aaacagcaga aacttgagac ctattaaatt ttcgactgac 3060 cgctttggaa agtcctttac tccatattcc atcactgctt ggaacaactt ggacttaact 3120 cttcgtacac tcgatttctc gaaatttcga gaacatcttt ttaaaacgat acgccctccc 3180 aaattcagcc attttagctt cgggtctcga cactcttgca ttctcatttc acgacttcgt 3240 atgggcacct gcagcttgaa ttatagtctt ttcacgcggg gtcttgtgtc aagcccagcc 3300 tgcagctgcg ggtgtccata cgaaacgatt tcacactacc ttttgcactg ccctacctat 3360 aattaccagc gctcagttct catcggcaaa ctctctgggt tacttgtaca tatacttgat 3420 ttccacagtc ttgcaaataa tgccaaagaa atgttgatct tacaaggtag ttctttctgc 3480 tccttcaatg tcaattgcag aattatagaa gcaacacatg tgtacattgc aacaacaaat 3540 cgtttttaga ttcaagtata attctttgtt ttagttttct actccagtaa gggagatcga 3600 ctcacgcaca ctgttcatac atatgtaatt atctatttat tattgtcctg tataatttat 3660 tattttcaat tcttgtataa gtggcggcgt taatataagt tgaatgtaac ttgagtccgc 3720 cgtcactttg tcctcgtccg tttatttgta tttttacacg tatttcaata aagaaaaaaa 3780 aa 3782 // ID Copia-30_CQ-LTR repbase; DNA; INV; 373 BP. XX AC AAWU01004727; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-30_CQ_; KW Copia-30_CQ-I; Copia-30_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-373 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 370-370 (2011). XX DR GenBank; AAWU01004727; Positions 23667 23295. XX SQ Sequence 373 BP; 136 A; 59 C; 67 G; 111 T; 0 other; tgttgacata tacaaacttt tcaacattgg aaccacccca gctagtaata gaattccgag 60 aaccactcta ggttactcgc gagcttagta gacacagtat actgaattta tagtaagcta 120 atagagcttt atggcaaact acaattaagt agtatttgaa gaagtggtgt aagtgacaca 180 tacatgtaaa tagtaatatt aagtaaaatg tgaattatgg tatgtaacga taacggaaca 240 tgtaaaagat tgttgatagg attaggatta atttaataaa actatcttct gaatgcactt 300 gaagcaagac ttgaattctt attctacgca taagatcaca acctggtttt ccggtagcac 360 gtaaccttca aca 373 // ID ENS1_Cis repbase; DNA; INV; 6104 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 06-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE EnSpm DNA transposon from Ciona savignyi. XX KW EnSpm; DNA transposon; Transposable Element; ENS1_Cis. XX OS Ciona savignyi OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. XX RN [1] RP 1-6104 RA Smit A.F.; RT "ENS1_Cis - EnSpm DNA transposon from Ciona savignyi."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX CC ORF1 from pos 470-2911 and 3951-4532. AA 47-404 of the first CC ORFbis up to 20% identical (35% similar) to Tnp2 and other CC En/Spm transposases in plants. The 15 bp TIRs and CACTGb&CAGTG CC termini, also support classification as an En/Spm DNA transposon CC (a.k.a as CACTA transposons for their termini). The elements CC appear to duplicate TA rather than TAA. The few copies in the CC Ciona genome are the first En/Spm members observed outside CC plants. XX SQ Sequence 6104 BP; 1907 A; 1035 C; 1092 G; 2069 T; 1 other; cactgtgaaa aatttccgta gtatttacta taatatttaa cgtatatata gtacgggcgc 60 tacgtaacca tgattttacg taagcgatct agtaaatacc atgcgttctg gaattctagc 120 caatcagcga tcgcgatctc gctgtaaagt taaaatcact gtaccgcatc gcttggcgcg 180 ctttcgtctt caaaggaatt aataggatca aaggcaaaga cagaagttta gttttgtatt 240 tggtttgtaa tcgttatttg tcaggctaat gtacgatgta ttgtgaatat attgtaaact 300 tttggttgtt atcaatagtc tactgtggag ctgaggttat tatggcacat gtctggcagt 360 tattaagtac taggctactt tttgtattga ttattattta aaaggcctgt atgtatatta 420 tacattaata taacttaaaa aaaacgcagg tctgcatgat tggataaata tgtttaagtg 480 ttctttggaa ttttgtaaac agagatttga tacagttgca gacttactac agcattaccg 540 agagcatata cgctttggaa gtgaaataac atgccctatt gagaactgca acaaaaaata 600 cagagtatta tcttcatttt catcacactt atcaagatgt cattcagcta aacctcaaga 660 acaagataat gctgcgagta ccagtgacag tatgaatgtt ccattaacca attcagagaa 720 tacgttttct tatttagcaa catctaaagc tgtggatgat gaatttttgg aaccggtacg 780 tttaagtcat gatattgcat ctatcgacac tcttacacga aaatatgcct tgttttattt 840 tatgcttcag cataaatact tcattcccgt tcgcactgtc aatcagattg ttgaacagat 900 taaagacttg gttttgaaat actcaagcca tgtgactaac aggctcaagt caattttgga 960 aaactatgat attgatgatg aattagtgac atcaattctg aatgaggtta tgaatcaaga 1020 taaattccag gaaattcata aaacggggcc tctaagatct atattttcaa gagaaaaata 1080 ttttaaagcc aattttattt acattgcccc aatcgcaatg agattaaatc cagaagttgt 1140 tcaagacaaa gagacgtttc agtatgtacc gattttaaaa accattcaag ctttattcac 1200 aaataattta gtaaaacacc aatttctaaa cccgctttgc ccggatggaa gctttcgtga 1260 ttttgatgat ggtttggttg ttaaaaacaa ccatttgttt tcaggcagta ataagacact 1320 taaaataata ctgtaccaag atgcttttga agtagtaaat cccctgggtt ctgctaaaca 1380 aaaacacaaa ttactggggg tatattacac tttggcaaat ttttaccctt ggaataggtc 1440 taaaattgat gcgtttcaac ttgtcatttt atgcaaagat cagcttgttt ccgcaaacaa 1500 atgtgaagat ttgtttcggc ccttaattaa tgatcttaaa attttggaaa aagaaggcat 1560 tgatcttcca gggtttaatg aaaaagttcg tggtagtatt gtttgtataa ctggagacaa 1620 tctaggaagt cattggattg gtggcttcac aacaaacttc agcacctcat cctacatttg 1680 caggttttgt tccattaccc gcacagaatt taattgttac tgtttgtgtg aaaaacccat 1740 gcgcaacact gattcctata attgttccgt gcaaaaagtt catgatgctg gtaactccgt 1800 gcacatcgaa ggtgtgaaaa gtgacagtat ttttaatgag ttgcagtatt tccatgtttg 1860 tgctccagga ttgccccctt gtttggcgca tgacttgttt gaaggagtta tacagtacga 1920 cataatgctt gccgtgaact attttataca gcaaaagtgg acatccttga gtgttttaaa 1980 taaaaggata gtgcaatttc catataggtt tcatgataga ctttgcaagc ccttaactct 2040 gtctgctaca tcaaaaaagc tctctggaaa tgcaagtcaa aactggggtt ttcttcgatt 2100 attacctctg cttatgtttg gttgtgtgca gaatcctgaa gaccctgtat ggcaacttgt 2160 tttatgttta cgatctttaa cagaaataat tgtttcaaaa tttcttacaa aaggcacagt 2220 tattcgtctt tctttattaa ttaaagagta cttgtacatg agacaaatgc tgtttagtac 2280 tgttcctctg cgtccaaagc atcattattt gatgcattat ccatggtcaa ttatgcagtt 2340 tggcccatca gtgtgcactt ctacactaag atttgaaagc aaacattcct actttaaacg 2400 ggttattcgg aattctaaaa actttattaa tgtgtcctac actctggcgc acaatcacca 2460 actacttcaa gcttgtttac ttgacagcag tgttttttcg actgaacttc aagttgtatc 2520 ttatggcaat ggccaagtgt ctagtagaat tcagaattcc attatccagt gtggctataa 2580 tctattaaaa tgtgtttttg ttacaaaagc atgctttcgt ggaacggaat attctacagg 2640 catgtgtgtt atattatcga aagagaatcg tgatttacat ttaggacaca ttgtcagcat 2700 gtttatgttt gataattcaa ttgtgctagt tgtagaacgt tgttttgttg aattctttcc 2760 tgatttcgga atttacaaaa tggttccata cccttacagt acgttatttt catgtatttc 2820 tattgaaaat ctattttcca cttggccttt attttgctac accattagcg ataataagtt 2880 atttatttta ccacaatgcc ttaacgtgta aacggtgtaa ttttaaattt acaggtgcat 2940 cttaataata ctgtaacgtt acaaaacntt aatggatcgt gaaatactga gaattttgcc 3000 agagttggag gaaaatcctt taaaatacag gtcctgcatt tctcgcctga ctacattggg 3060 agtacaaaat gtgaaggatt ttgaacacgt ggagttagat gagctattgg gcgaatttac 3120 tattgtgcaa gctagaaagc tcattaaagc ctccaaacct ataggtccaa ctggtatgga 3180 tcttattgtt ttgttatata aaaattcaca ttaaagatat atagaatttt ttgtagtcgt 3240 aaattaaagg atttttaaat atcaattaac agtttatcga aatttaattt tttcctggtc 3300 tgtggacgtt gcttaatgct tacagttttt ttctattcag aactttcaag tggaaacagc 3360 acaagtccgg tttggagttg tagcagtagc agcagctccg ttaatagctc tgaagcgtct 3420 tgggtgttcg attttccagt tccatggagc aaatgctcca gtgctatgct ggaactactg 3480 ggcaaaggtg agccacccaa tcccatgcaa agaaggcaac ttgttcatga aacaattgat 3540 gaagttctta aggtaacgtt atatttatta ttatataact tttttagttt tccattgtct 3600 ttaaccctag taattattgt gacttgtggt cctcctaact gccaagtatg atgccttaat 3660 tccataatat gcagcgcctt attcaggcaa gttttgacat ctgactagcg gtttgggtca 3720 gccaaaattg acacatcaaa atgtaaaaat ttacacaaaa atgaacagtt ttatctgttc 3780 tacattatgt ttacacgata tcaaactaat gcataagtgt gggttattaa gcttttatca 3840 cgcaacattt ttaaatatag ttggcctaaa gaagaatcgc ataaaaacgc aaatgcctgc 3900 tgagaatttg gtactgacac ccatattata cgtttaaatt acgctataac atgatgaaaa 3960 gaatattaat ttataggttt cttttttctt tattgcagat tacaagtaaa cctggccgca 4020 agaatattga gcgaattgca ttttctattg tggcaaagta ccccgactct ttcagagatg 4080 caatagacgg cgtttctatt ggaaatggtc atggctcact gacaaatgag ctgtttaacc 4140 gatgggaaaa tacgatgagg aaaaagcgca agcttgatag ttcagatact tcttcgcctt 4200 cagaaccaac caagaagcct tatggctgtg tgaactggca accacccgct caggacgcag 4260 ctgataacaa caatatgaaa acatggctgt gtgaggaatt taaaaaggca aggagagaca 4320 ccccacatgt tatcacactg atggagaaaa cctatgcagc acaaagggaa ttcatcaaca 4380 caaatgtttc cgtgcaaacc gttctcagtc actggccatt ccttagtgac tctatgtgct 4440 tgtttcaaca ttctggcata cttttaaaga aagatgaaat gcaaaatttg ccagatacaa 4500 tgaccagaag aagctttcaa atatacaagt aagtttcagt tttaaacttg tcttaaaaac 4560 aataaaatat ttatattttg ttaacttatt tggcagagtt ttaaaaacaa gcaagaagcc 4620 atgtgtcgtt ggtgccatta ccagcataaa aaagcaatgc aaaacgttgg ggaatagtgt 4680 cgcaaaatca gtggaaataa tatgtttgct tgcagcacag cttgatgaaa atccagattt 4740 tattgccagg accataaagg taagcctttc attttttcta tttttaaatt ataatttagt 4800 atttgctttc actttcttaa tttatgtcgc aggatgattg cagttttgat gaacttctaa 4860 atgaacttcc agcctctcct gtgcttgtag ccatcggtaa gttggacatg ttttcttgct 4920 tagctttggc gctacggctt ttgtttgcac tgataattcc aatttaggtt taccatattt 4980 aaaatcgtgt gttttaattt gttctacagg tgtaactttt gcaactagcc agtccttcaa 5040 gttggtggta gaaaatcagt tagtaatggc atttgatgat ttctttgttg gtcttgcaag 5100 cctgttttgc tactactatg tttttaatat ggagtatcca ttaaaagccg tggcaacttt 5160 ggagtttatt caaaggtaag aaatagaact atacaatttt gatacttgtg cgctggctgg 5220 tgaagctaca tggttgtttt atattgtgct gattttagat tttttgtggg cattaatgca 5280 aaaggatcaa aatgcacttc atcgaaacat cacaagtacg ctcatccaaa ggtactgagc 5340 ctggtgaaga aaatgaacgc tgcaaaaaaa ttctttaaag aaaccgaata attacatgga 5400 aactgaagca tctgattaaa atcaaacttt ttaccaccac ccactaccat ataacacaga 5460 cactttgtgt tacttaattt ctgcttatta ggttttatac tttattagtt cactttgtgg 5520 gaatttatgg tattacattt aaaatagtaa aataatcaaa cggtacacac tttttaacaa 5580 ctatttctct gtaagaacat tgttttgtat tgtaatttgt aaataagttc tattgtttcg 5640 ttatcacttc attgttattg gaaagtataa tgcctcaaca attcactgtt ttctagcttt 5700 taccgaatat agtcgcttgc cagtaaaata tcaccccagt gtatttgcag ttccgttttc 5760 aagtgttcac tctttctcct aaactgttct ttcgtaataa aacagttttc tacaataaac 5820 ataattttat cctaaaatca gagaaaatcg acgtacattt ttaatcaaaa tcgccgtaga 5880 ttttgaatcg aaatatgcgt aaaattttaa tcgaaatttc tgtaaatttt gaatcgaaaa 5940 ttccgtaaac ttttaatcga aatttccgta aatttgtaat cgaaagcgcc gtaaagttta 6000 tcgtaacata aacgtaaatt tttaatcggt aactatcgta aattttacgg gtataaaaag 6060 tagtcaactg ccagtgactt ttacagtaaa aatttctaac agtg 6104 // ID EnSpm-5_HM repbase; DNA; INV; 8221 BP. XX AC . XX DT 29-DEC-2008 (Rel. 13.12, Created) DT 29-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE EnSpm-type family from Hydra magnipapillata - consensus. XX KW EnSpm; DNA transposon; Transposable Element; EnSpm-5_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-8221 RA Bao W. and Jurka J.; RT "EnSpm-type families from Hydra magnipapillata."; RL Repbase Reports 8(12), 1906-1906 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 287..2269 FT /product="EnSpm-5_HM_1p" FT /translation="MSSYLKRWRNQNKEINKSLMQELNIEDTARNFDSENN FT VIVNDNVENDNNELHFTGDDNNDNIIYHYASDISACNEDVNRNAPLLGRVP FT SSNEKCQLSSNDESDVLSRDLAAWATRNRCTRKTVNELLSILNKNGVSSLP FT HDCKTLLKTPRNIKTKVLCGGEYIYFGIKKGIKQVLKDSLKVIERIDMLIN FT IDGLPLCKSSSIQFWPILAKFSESQPFIVCLFCGKKKPDNLFDYLRDFLTE FT FRLLQVSGIEFNGKILPVSIKGFVCDAPARSFLKNIKGHGGYHSCERCLIE FT GEYDGKVIFTEVNCQLRSDEQFKEYIYLNHSLGTNNVCCDPHQLGRTPLVD FT YGILCVKEFAIDYMHCVCLGVMKRLLKFWLEGPRICRLSCQQKKQISDKLA FT FYHGKMPSEFARQPRSLDEVKRFKATEFRQFLLYTGPIILKDILPKDFYQN FT FLCLSIAVSLMSNSNVVKRQHYLDYAKKLMSCFVLNLQRLYGSQFYSYNVH FT ALLHLHEDVSYHNCSLHDLSSFPFENFLQIVKKFVKTANKPLVQLAKRLSE FT LESSNVSYGSKTITDKIIPNDRDGWFILKSNEIVRIIEIVSDTEFDCKVIP FT FHAIDNFFTIPCNSSDFGIYKISKFNKMKRKFVILSDICYKLVCLEQDNDF FT ILIEVNHEIIY*" XX SQ Sequence 8221 BP; 2889 A; 963 C; 1168 G; 3186 T; 15 other; cccagtgggc accgacgtca ttctcacgtc ggtgcgatgt tggttttacg ttggcgacat 60 cgccaaccta aaaccaacat cgcaccaacg tgagaatgac gtcgaaaatt aacatcgagt 120 tgcgacgttg aattaacatc aaagcgacgt aagacttgcg tttcgaatat taatctgtaa 180 atgacgtcac accaagttgt cattctttga cattcgctaa agattgcaaa atttgtctaa 240 gttgcgttta agcttatgaa attttgttta atatccatta ctaaaaatga gctcttattt 300 aaaacgttgg agaaatcaaa acaaagaaat taacaagtct ttgatgcagg agctcaatat 360 tgaagataca gctaggaatt ttgattcaga aaacaatgtt attgttaatg ataatgttga 420 aaatgacaac aacgaattgc attttacagg cgatgataat aatgataata ttatctatca 480 ttatgcatca gatatatctg catgtaatga agatgtgaat agaaacgcgc cattgcttgg 540 tagagttcca tcaagcaatg aaaaatgtca attaagttcg aatgatgaat ctgatgtgct 600 atctagagat ctagctgcat gggctacacg caatagatgc acaaggaaga ccgtcaatga 660 acttttaagt attttaaata aaaatggagt atcttctctc ccacatgatt gtaaaacact 720 tctgaaaacc cctcgaaata taaaaacaaa agtactgtgt ggtggagaat atatttactt 780 tggtattaaa aaaggaatta aacaagtttt gaaggattct ctaaaagtta ttgagagaat 840 tgatatgcta ataaatattg atggattgcc tttatgtaag agtagcagca tacaattttg 900 gccaatttta gcaaagtttt ctgagtcaca accatttatt gtatgtctat tttgtggtaa 960 aaagaagcca gataatttat ttgactattt gcgtgatttt ttaactgaat tcagattgct 1020 tcaagttagt ggaattgaat ttaatggaaa aattctgcct gtttcaatta agggatttgt 1080 ttgtgatgca cctgctcgtt catttttaaa gaatattaaa ggtcatggag ggtatcattc 1140 atgtgaacga tgtttaattg aaggagaata tgatggaaaa gttattttta cagaagtaaa 1200 ctgtcaattg cgaagtgatg aacaatttaa agaatatatt tatttaaatc attcattagg 1260 aacaaataat gtttgctgtg accctcatca gttaggaagg actcccttgg ttgattatgg 1320 aattttgtgt gtaaaagagt ttgctattga ttatatgcat tgtgtttgtc ttggtgttat 1380 gaagcggcta ttaaaatttt ggctcgaagg cccacgtata tgtcgcttat catgtcagca 1440 aaaaaaacaa atttctgata aactggcttt ttatcatggc aaaatgccca gtgaatttgc 1500 acgacaacca cgttcacttg atgaagttaa gcgctttaaa gcaaccgagt ttcgacaatt 1560 cttgctctac actggaccaa taattttaaa agatattttg ccaaaagatt tctaccagaa 1620 cttcctgtgt ttaagtattg cagtatcgtt aatgagcaac tcaaatgtag tgaaacgaca 1680 acattacttg gattatgcaa aaaaattaat gtcttgtttt gtattaaatc ttcaaaggct 1740 ttatgggtca cagttttatt cttataatgt tcatgctttg cttcatttgc atgaggatgt 1800 ttcttatcac aattgcagcc ttcatgatct ttcaagtttt ccttttgaga attttcttca 1860 aattgttaaa aagtttgtaa aaactgcaaa taaacctctt gttcaattag caaaacgttt 1920 aagtgagtta gaaagtagta atgtttctta tggctctaag actatcactg acaaaattat 1980 tccaaatgat cgtgatggct ggtttatttt gaaatcaaat gaaattgttc gtatcataga 2040 aatagtttca gatactgagt ttgattgcaa agttattcct ttccatgcta ttgataattt 2100 ttttaccatc ccatgtaact cgtcagattt tggaatatac aaaatctcta agttcaataa 2160 aatgaaaaga aaatttgtaa ttttatctga tatttgctac aaactagttt gtttagaaca 2220 agataatgat tttatactaa ttgaagtaaa tcatgaaata atatattaat atttttaaat 2280 ttatgttttt ttgttcgagc gtaaatttta ttatctattc aaggatttat acttgttaat 2340 gtttcaatat gaaagcaata ttttttcatt taaattttga tgcttatttc aatttattat 2400 atattaataa gttgtttttt tttttgtagt ttttatcaac ttgttaagtt tctaggattt 2460 aaatgtttaa atatgaaatg ttaatatatt taaagtcact taaatttgtt tggatatttt 2520 ttttctttta attttttgat gtttttttca acttattggt gtgtttcaga agtgctattg 2580 gattaaatgc atgtttacta aaattttcta taaacatgca ttttaataca attttttctg 2640 tattttttct gtgaaatttc tgaaattttt tttctgaaaa aaaaaagttc aaatttgaaa 2700 tttccaaaaa ttattatctc taacatcaca ttagatttta tattacattt attttacatc 2760 acattttaaa tgtgtacttc cttcttgtct ggtaaatgtt tagaataatg ccataattaa 2820 atgtctagtt tttaactatt acattttaat tacaacattt cttttagttg caacgttatt 2880 ctagcatggc aataattgtc tatttgactt ccttgtgttt cactataaaa aaagtcattt 2940 ttcaaatata agccataagt atagtaaaag gtaatatttt ttgcttttta ttttgtttgt 3000 cttttttaaa atcattttca ttattatttt ttatagatta ttatttttct tatatatgca 3060 ttttattttg cttaaatatt tttcaagttt tctacaaaat ttttcaaagt tattttttta 3120 aattatgaca taataatata tagttagtaa taatgatttt tttttaactt ttcacactaa 3180 tgagatcctt gttatagttg ataacatcac aaaaaaatga ataaaacatt agaagagcca 3240 tattgttggg ctatttggaa tgaaaatgaa gatgaagata cctccggtgt tattccatcc 3300 aattggttag ttcctgraag ccatatattt tggccaccta ataattattt tcaagtcaaa 3360 catgcatttc atgaaagaag agatcctgat cattcatgga aaaaatttac actaattaag 3420 gttaaggtct ttggtgagtg ttgtaaaata tttcaagaca acttgtatat taacttacta 3480 atacacaaac aagttaagtt gacatataca aatttcctat atatttcaac tttatatgtt 3540 tacatatata caacttgtct atctaaatct atttgtctaa tatctaaaaa tctatttgtt 3600 tgctaatgtg tatatatata tatatatata tatatatata tatatatata tatatatata 3660 tatatatata tatatatata tttatataaa tatatatata tatatattta tatwtatata 3720 tatrtatata tatatatata taattaaaaa taattttaat tatttatttt wttttatttw 3780 ttttgtataa tttagtgacw tatgattawg aawtaaaatt tgtattgctg aaswcasaag 3840 ataactttag atttgtaata ttttgttatt ttgctatagg aaaatatttc tagcacattg 3900 ggtaataaca aattaatttt aacagtaatc tatagtgtaa agtatattct atatatacta 3960 ttttgttagt atttactata atttgtagat gacagtcaaa atactctatc atttccttct 4020 ccaccatcta gaattcaagg taatgtaaat tttttatctg tttaatttat atataaatta 4080 aataatcatt tataatcatt ggagagaagg tatagttatt tgtttatatt taatccagac 4140 acacttgctt gggttagcta gttttaattt ttctatttaa aaataacttt aaaaaattta 4200 aatgaattta aycmaacaag tggaagtgga acccaacgtc aacctgtttt atggaacagg 4260 agttttttat catgacaacg ttactagaaa gattactgtt tacattacat tacagtttat 4320 tctattctta tccacattta agtgagtata tatatcatca ttacatattt ttgcatttag 4380 agaaacaaat atcgagtgtg aaaaagaatg aaatatcacc aagtgtgatt gaaaaaaaag 4440 tatagctata agttgtattt ctacttagtt ttatttttat taagtaggat gatgatgatg 4500 atgatgatga tgataatgat gatgatgatg atgatgttga tgatatgatg atgatgatga 4560 tgatgatgat gatgatgatg atgatgatga tgatgatgat gatgatgatg atgatgatga 4620 tgatgatgat gatgatgatg atgatgatga tgatgatgat taatatataa cttttttcct 4680 agttggcaaa atcaaaaaga gccttttttt ccagcaatga aactgaaaaa caagtaacga 4740 ttatttaaag ttaattattt tgtataatta tatttggtgt ggttacgacc taaaaattct 4800 actgtcaact aaatcgacta atatatttta aaaagtctac tgaaattagt taacacaatt 4860 tttaaattga tataagttga tttagtctaa atatttcaaa aaaaaaattt atttaattta 4920 aattattgca aaataaattg taataaaaat aatcaagtcg tttttaactt ttaattatat 4980 atatttaaac aatttattaa ttatatatat attttttcat gtttacaatc attaattgta 5040 aaaggtatgt aaattcaatt taacaaaatg aacaacgtat gtgaatgaag atcttgaaga 5100 tcttgttaaa attttcaata gtttaaataa atgtattttt tattgatttt tcgaataaca 5160 acttttataa aaaaatattg tattttaatg ggtacccagg tacccgcctt ttcacaattt 5220 ttcacagctt tacaattaac aaagcaaaga gttttattat taaagaacag ttatatatta 5280 agttaaatca aatataaata tgtatttata tagttgatga agttaaatca aataaaaaaa 5340 tcgaaataag aatgttagga taattacccg actaaatcga tttttagtca aatagtaata 5400 aatctggagt tgatttactc gactaatttt gacttttgac tgattgactt atagttgatt 5460 tagtcgaagg caataacaac actagttgtg ctgttaagta gtttttttat tttattcttt 5520 aaaatttcta atgttatgaa cataatcaat ataatgttat gttatgaatt tcttttattt 5580 ttcttattaa ttacttttat tattaatgct tttgactaga aaatatgcaa actgaagaaa 5640 agcaatatat ggtcagatga tgaagaggta aaaatcagtc atctgaattt aaactatgca 5700 caatactaaa taaatattat ttcttttaca aaaagaactt tttgtttaaa atactattat 5760 tgttttattt tattaataga ttttttttca ggttcatttt gaggagacct ctggaagttc 5820 acaagtccaa aactattctg aggtactgtg catatgcact aaattatgca tataaaaata 5880 tatatgtata tatgcatata aaatccagat taagttctta tttattaatc taagcttttt 5940 ttgacttttg catcttggtg atgttataat tgataatgga tacctttttt tttatcattt 6000 ttttatactt gaatttattc aattgttatt taatatttgt ttgattaatt aagttgttaa 6060 ttaattttaa tttaataaaa atcaataaag gtgttgattt cattaattta gaaagaacaa 6120 ttaactaata gtcggtatat aagaaaagat gcagctatgt tgccaattaa agggtttcct 6180 ttaactgagt caggtatttt ttgttatggg ttaaaagaat atttttgttt aaaatatttt 6240 tctggtttaa aaaagtaatc aaaaatagtc tatggttttt tggatatgag ttttctttat 6300 tacataaaat gtaataaaga aaattctatt tcattttgaa cttcttttta aattaaaatt 6360 attacacatt aaaaataata aattatttat ctcaaagttg tgattttata ttttcagaat 6420 ttcaagcaaa agtaatgaaa tacttgatca aacaagatca aaggcttgaa agtattgaga 6480 aaattttact taatttaaca tcttcaacag ttggagtaac taaagaagta tccccagtat 6540 gtgatttgac tgagttacac aatttagaga agaaaataga agatgaagaa gaatttaata 6600 aactagttaa taaattactt tttgaattta tgatattgat aattgtcata aagatccgga 6660 gaaaatcaat acttaacaaa atgtctgtat tttaatagtg ccttcgggac tttaccgtaa 6720 ttaacatacc tactaacatg aaaattgatt ttaattttaa atgttataaa ataaaatgtt 6780 tatgttgtga ttaagtttta tttcaatttt ttaattttat caggttttat cactctctgc 6840 tgctgggggt aaagatgtta aagatgttgt gaagaatatt atggaaaggt ttgttgtwac 6900 agtgctatgg actcacaacc tctttatgtt actgatatta aaatattaat agaaatatgt 6960 tgttttagtg tggctacgta tgatgtgttg gccaagttta attttcgggg aactaatcgc 7020 ctcgaaaaaa ataaagagcc aagaccaagt aaagaagctt ttaaaagtct taatatatgc 7080 aaagttgtgt gctgtacgtt ttataaatta ttttagtttt gaaaagcttt tttatattgt 7140 aaactaagtt cttatttgtt ttaacccctt ttagtatgct ataagaactt tcattaattc 7200 atatttaatt tttcttttgt tattttctat aaataagtag taaaatatta aagtttttaa 7260 tagttatata aacaacaaaa tttaaaattt aaatttatta aagttattgt ctaaaacgtt 7320 tatctttaat gtcattcctt attttctccc aacattttca aatggagtta aaacaaataa 7380 gctctctggt cgcgatatgt taacattatt attttggtat gcagtttgat tattgtttaa 7440 gtcagcgttt acaattggta taaattaaaa ataaaataac tgtttcaagc attttttttt 7500 tttacttcar gtgcagcaat gaaagataga aagtttttta aagaagacgt gataaagtgt 7560 gtaaaagaaa ttttacgtta tgctccagac aaggccacag gtggaaggcg gttttctgta 7620 taaaacttta aaattataaa agcataaaat gtaaattact tgttaactta aaaattatat 7680 tatatattat aataccaaaa ctatattcta actttttatt ttatttgaat ttagactcca 7740 gctggagtat aaactaaaaa gttaaaaatc gttactaaaa cgttagaaat cgttactaaa 7800 acgttaatat gcataaaaca taagtttgga gtttgaaaag aaagttaaaa tttagtagaa 7860 ttttactttt aacttccaaa actcgaaact taatataaaa catcattgat tataaatggt 7920 tcgctaacat cgctttttta cgtcgcgcta atatcgtgtt ttaacatcaa tccaactttc 7980 catttccaac atcgatgcga cgtcgcatac aacatcgcag ctacatcgaa tgtttacatt 8040 gctattacat cgcaattcga cgtcaaaatt acatcggtta acaacatcgc ctaaacatca 8100 aaatcttacg tcgttataac atcgcctatt aacgtcgatc caactttcca tttccaacgt 8160 taatgcgacg tcgcgttcga cgtcgcttaa acgttaatcc aacgtcgctg tgcccactgg 8220 g 8221 // ID hAT-59_HM repbase; DNA; INV; 3261 BP. XX AC . XX DT 23-DEC-2008 (Rel. 13.12, Created) DT 23-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type DNA transposon family - consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-59_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3261 RA Bao W. and Jurka J.; RT "hAT-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 2047-2047 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 473..2788 FT /product="hAT-59_HM_1p" FT /translation="MNSQLSFPSFSKKIETRTTTDIYLVGSVSPNISGMKL FT PTCRQVLAVFFHLHKIEKQTVRQSARCVIREVFHLWNMARIPTTIERNAIE FT KLENIFYKWAKIKKNMKRASATQTANENVFCEEIDRIFDIAHADAMSLITI FT EEDRLFLIDQRGERKGYMAGVDMALAKTEERVQKRKELEESRRFQEKKRKK FT EAENTSHIDSMHLSDKSTSGTETSDEEFMKPKKSNQNVVKKRSRSQNIISP FT ELASALDRTNVTDRQATYLLAAAAKGLGHDITDFAISRDTIRRARITNRNK FT IATTIKFAFNSSNVPLTVHFDGKILPNITGRDSVDRLAVVVTGYNVDQLLG FT IPKLTRGTGKAISEAVITTLLDWDIQDRVKAISFDTTAANTGTEAGACTLI FT EKKLNRELLHLACRHHMYEIVLSDVFKHSLGQSSGPEIGLFKRFRDKWSLI FT DKNNYETALNDQTCSDILASHTILKQDALFYLKTILESGKQYRDDYKELLE FT LTLLFLGDVPPRGAHVMAPGAIHRGRWMAKILYSMKIWMFKRQFKLTAKEE FT KSLRGISLFASLVYTKFWISCTNPTLAPSNDLTFLKTVFQHKAINNGIFTA FT AATAFSRHLWYLSEEMVALAFFDKEISKETKAKMVLALQNAGEDEPQKRIK FT IDLTSIENLQIESFVTENTLNFFNILELPTTFLTTVDPDKWTENEEFQQAE FT KIVKSLRVVNDTAERGVKLIQDFNCSITKNEEQKQFLLQVVSEHRSKFPLP FT QKSLLIDGTKSSKNDGNKNQ*" XX SQ Sequence 3261 BP; 1155 A; 510 C; 553 G; 1043 T; 0 other; tagggtggtg cacaaatata ttgacaattt caaaagtttc aaaaatcaat gtggtacttt 60 ttcctattgt tccttttatc attatattaa ttcctgcaaa atttaagcaa atttgaatga 120 cttttagagg tcgctcatgg ccatgtttcc tcttttaatt gccattattt tcctattcta 180 acgttactgt gaagttttta atatgtacat ttgtttattt attttattat gacataactc 240 ttaatttgat ttaatttaat taattatgat aattatttta atcgatttta aatatttttt 300 aaatatattt ttctaattta atagtgtgca actatctttg acaatatacc gcaaagcatt 360 ttaatttttg ttcatatgag tgctgatttg aggctttaaa aaaataaatt cattttgtga 420 taacaattgt tttactcagg ttatattttg attttcagaa atatcctgta atatgaattc 480 tcagttatct ttcccttcgt tttcgaaaaa aattgaaact cgaacaacta ctgacattta 540 tcttgttggg tctgtttctc caaatatttc tgggatgaag cttccaacat gtcgtcaagt 600 tcttgcagta ttttttcatc ttcacaaaat agaaaaacaa accgttagac aaagtgcaag 660 gtgtgtcatt cgagaagtat ttcatttgtg gaatatggca agaattccaa caactattga 720 acgtaatgca attgaaaaac ttgaaaatat tttttacaaa tgggctaaaa taaagaaaaa 780 catgaaacga gcttcggcaa cacaaacagc aaatgaaaat gttttttgtg aagaaattga 840 tcgcattttt gatatagcac atgctgatgc catgagtttg attacaattg aagaagacag 900 actatttcta atagatcagc gaggtgaacg taagggatac atggccggag ttgacatggc 960 attggcaaaa acagaagaac gtgtccagaa aagaaaagaa ttagaagagt ccagacgttt 1020 tcaggaaaaa aagcgaaaaa aagaagctga aaatacaagt cacattgatt caatgcacct 1080 atctgacaag tctacatctg gcactgaaac ctcagatgaa gaatttatga aaccaaagaa 1140 atctaatcaa aatgttgtaa aaaagcgatc gcgatctcaa aatatcattt cacctgagtt 1200 agcatctgct cttgaccgaa ccaacgtcac ggatcgccaa gcgacttatt tactggcagc 1260 tgcagcaaaa ggtttaggtc atgacattac agattttgct attagtcgcg atacgatccg 1320 gagagctaga attactaatc gaaacaaaat tgctacaacc ataaagtttg cattcaactc 1380 gtcaaatgtt ccattgactg ttcattttga tggcaaaata ttgccaaata ttactggaag 1440 agattcagtt gatcgattag cagttgttgt tacaggttac aacgttgacc agcttcttgg 1500 cattccaaag ttaacaaggg gtactggtaa agcaataagt gaagctgtta taacaacttt 1560 actagattgg gacattcaag accgagttaa agcaattagt tttgatacaa cggctgccaa 1620 cactggtacc gaagctggag cgtgtacttt aattgaaaaa aagcttaata gagaactatt 1680 acacttggca tgccgtcatc atatgtatga aatcgtcttg agtgatgttt ttaagcattc 1740 tcttgggcaa tccagtggtc ctgaaatagg attatttaaa cgttttcgtg acaagtggtc 1800 attaattgac aaaaataatt atgaaacagc gttaaatgat caaacatgtt cggacatttt 1860 ggcttctcac accatactca aacaggatgc actattttat ttgaaaacaa ttcttgaatc 1920 tggtaaacaa tatcgagatg actacaagga attgttggaa ctaaccttgt tgtttcttgg 1980 tgatgttcct cctcgtgggg ctcatgtaat ggctcccgga gccatacaca gaggaaggtg 2040 gatggccaaa attttatatt caatgaaaat atggatgttc aaaaggcagt tcaaattaac 2100 ggcaaaagaa gaaaaatcac ttcgaggtat tagcttgttt gctagtttgg tgtacaccaa 2160 attttggata tcatgtacca atccaacact tgctccgtcg aatgatctca ctttcttgaa 2220 gactgttttt caacataaag ccattaacaa tggaattttt acagctgcag caacagcatt 2280 ttcgaggcat ttatggtatc taagtgaaga aatggtagca ttggcttttt ttgataagga 2340 gatcagcaag gaaacaaaag caaaaatggt tttggctcta caaaatgctg gagaggatga 2400 accacagaaa agaataaaaa tcgatttaac ttcaattgaa aatctgcaaa tagaaagttt 2460 tgtaacagag aataccttaa attttttcaa cattttggaa ttaccaacta cttttttgac 2520 aactgttgat cctgacaagt ggaccgaaaa tgaagagttt caacaggcag aaaaaattgt 2580 aaaaagtcta cgagttgtta atgatacggc tgaaagagga gtaaaactaa ttcaggattt 2640 taattgttca attaccaaaa atgaagaaca aaagcaattc ttgttgcagg tagtctctga 2700 acacaggtca aagtttcctc tgccacagaa atctttgttg atcgatggaa ccaagtcatc 2760 aaaaaacgat ggaaacaaaa accaatgaat tttttttcat agattacaat aattttttgc 2820 aaagaagctg aaatataaat gaaacctaaa actctaacaa tttttaagat tttttggaac 2880 tattattgtt ttttaaattt gtctttacta tagtgaacac aaatacgtta ttataaacca 2940 gaagtttatt actttattct tcaattattt tatttgtaaa caataatata tataattcag 3000 tttaaaaaat actgcatggt tttggattaa tatacagtaa gagttaaaat gcaatgttgt 3060 atcataattt attcaaaatt tgggattgca cactaaaaga gaactcaaac ccaaaaagta 3120 aaaaacttga tgttgatgag caacctttaa aaattattca atcatgccca aacttcacga 3180 ttgttattcc tacacataat gtcacataaa gaaatggtac catcaaagaa attaaaaaaa 3240 aatttttttt gcaccaccct a 3261 // ID DNA8-67_AP repbase; DNA; INV; 382 BP. XX AC . XX DT 25-AUG-2009 (Rel. 14.09, Created) DT 25-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-67_AP. XX NM DNA8-67_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-382 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2002-2002 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 382 BP; 151 A; 45 C; 42 G; 143 T; 1 other; tagtgatggg caacaatatc gataattcga tacgatatcg atatcgtgac aaattttcga 60 tatcgatatt atatcctata tcgggaatta ttgtaacaag ttttttttct tatcttcaat 120 tagcaatgta aagcaaattt ttgaattaat aaataataat tattaattca atatttatat 180 ttatttattt atattaatta ttaattttaa taagtaataa ttaataacta ataactaata 240 agtactaagt aggtaatatt taaaatantt taaaaatatt ttcgatacga tatcgatatc 300 gtacaactta cgatatcgcg atacaatatc ggaaaaatcg gaacgatatc gatacacaaa 360 attatatcgt tgcccatcac ta 382 // ID DNA8-64_AP repbase; DNA; INV; 622 BP. XX AC . XX DT 25-AUG-2009 (Rel. 14.09, Created) DT 25-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-64_AP. XX NM DNA8-64_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-622 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 1999-1999 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 622 BP; 162 A; 84 C; 86 G; 290 T; 0 other; tagggctcgg aagttgatgc attttgcatt tttttttaga actttttaga cgatgctctc 60 ctggccacaa atgtgtttca ttagcatgtg aatctgaata actaaatttt attttgcatt 120 tttttgcatt ttttggtgtt gattactttt taagtcattt tttgacattt ttaggtcatt 180 ttcccacatt tttagtcatt tttactgatt tcacgctagt tcacttctta tcgtttcttg 240 tcgaccatta tgccgataac ttaaaacttg gaaattacca cggactaagt tagtacctat 300 ccgattttat ctcgccgatt acatttgtcg ttgtcgtgtt gtagtgaata ttattatacc 360 tacattttat tattttattt cattttttaa ggacattttt gtcatatttt gagcaattcg 420 aagctttgat aaatttatat agaattttat agtaaaatag agtaaaatat ttttaaaaaa 480 atgtattttt attagtttaa aactaatatt tgcaattttt taggtcattt tttaaggaca 540 ttttttgtca tttttatgtc atatttgcat tttttttagg tcatttttta ttgtttttaa 600 ggtcatcaac ttccgagccc ta 622 // ID Harbinger-N12_BF repbase; DNA; INV; 837 BP. XX AC . XX DT 04-AUG-2008 (Rel. 13.08, Created) DT 04-AUG-2008 (Rel. 13.08, Last updated, Version 1) XX DE Amphioxus Harbinger-N12_BF non-autonomous DNA transposon - DE consensus. XX KW Harbinger; DNA transposon; Transposable Element; Nonautonomous; KW Harbinger-N12_BF; Harbinger-3_BF; non-autonomous. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-837 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-837 RA Kapitonov V. and Jurka J.; RT "Harbinger-N12_BF - a family of non-autonomous DNA transposons RT from the amphioxus genome."; RL Repbase Reports 8(8), 804-804 (2008). XX DR [2] (Consensus) XX CC It contains 31-bp TIRs and is flanked by TWA TSDs. It is a CC non-autonomous derivate of Harbinger-3_BF. XX SQ Sequence 837 BP; 198 A; 237 C; 215 G; 187 T; 0 other; ggcgcgatct acacacagtt tttcccgata tcggcgagcc tactacctct cgccgacagc 60 tttttttcag cgtctagatg caactactat atcgccatta ctatatcggg aaaatttctc 120 ccgaatggtc cggaccattc gtaagccatc taacgatacc ttaccgatac cttagcaggg 180 gtccccgaca tgttccgcgc gcgtgtagat gctccgcgaa ttcgcgaagc tcattagcat 240 ataactacag ttaacgcggg ctcattagca tataacgcgg gaacagcggc atattagcgc 300 gggagagcaa gatggcggag ggaaaagctg ggcgctcccg ttgcacatgg agccatgcag 360 aaaccgcctt tcttataagt gtatggtcgt cggaggagat cccaaaagga cgcgggattc 420 cttggtacac cccggtgtac aggctctcca ccctccgcca caatcagggc agcgagaaag 480 aacgctgcgt tcactctagc cggccgctgt cgtctctccc tgtatctcaa tcgcctcctg 540 cgccctaccc ttgcatggcg tctcatgttt aagatcacgc gcatgagata cagaaagaac 600 attacctgat tgataacgag tggaaaagct accggcattc ccgccatttt gaaatccgcg 660 ggatgtgtca ccagaggtca aaaggggtca cgaatgttcc gcgccgcggg tgttccgcgc 720 tacgagcgga tatcccgcgc taccatttgc aaaatgtgtt tagatgcgcc ccgatatcgg 780 ccccgccgat atcggcggat gcggctatat cgggaaaaaa tgtgtctaga tcgggcc 837 // ID I_Ele13 repbase; DNA; INV; 6839 BP. XX AC . XX DT 14-OCT-2010 (Rel. 15.1, Created) DT 14-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE An I clade non-LTR retrotransposon family from Aedes aegypti. XX KW I; Non-LTR Retrotransposon; Transposable Element; I_Ele13. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-6839 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6839 RA Kojima K.K. and Jurka J.; RT "I clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (14-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. This consensus is generated from 20 CC sequences with >96% identity, and ~99% identical to the original CC sequence in [1]. XX FH Key Location/Qualifiers FT CDS 302..1999 FT /product="I_Ele13_1p" FT /translation="MMPGSGSAPHRGDPGGPGGGGNSPRNMFQGTYAGQRL FT PTYMDKEGIAGSLQVLKMQAVGENTSLPNDPFLLRLSVEKCVGGPIDGAFK FT ENRGISYALKVRSNVQFQKLLKMSRLIDGTEVMITEHPQLNTTKCVVSNYD FT SVGLSDAYLKEQLASQGVRDIRRIQKRNSAGVPENTPTMILSIVGTVIPEH FT IDFGWTRCKTRSFYPSPMLCYHCWEYGHTRKRCQETHQICGTCSQVHPEDN FT ILNGSNSRNAEDDEFSNKKPHCTNPPFCKNCKSNGHAVSSRKCPVYSKETA FT IQHIRIDMGYSYPQARREYEAQQGATGNSGSYARIANLSKDKEIADMSDVV FT KKLQDDSKRKDEKIAELEKIVQNRSVGKRLDQVQKNGTIEDLTRRVIQLTE FT AVEQLQKTVQEKDREISRLRQYETLYTHMEPSQSITTVSQSLVPTPDCASI FT PATPAAPYDLRDPRIRSRCSKWIAENNSSENGPAQNTKNNRRKQKHRKQYE FT SNKDRGFSSEESMKSFHSKNSMQTTTSVGTSASNPTKRNHPTTDSDSDSNG FT RTSKSKRSGVDGTDTIEIE" FT CDS 2065..6567 FT /product="I_Ele13_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="TIIFSTQKKPVKKKNRELTEPTSPLSTTNTTRTRRAA FT VKAPDSRGPVSADITPQPELADNLRHPRVILAYGTHKGIQTRSPFSPDTSQ FT DRKAPGGVTGRGTHKSSNTCTPEDSEGTQFNDGHQNRHFHVYSPLHNVEVK FT PVLRRYDLRTRIKPPRNRLDTPSSSKVFITSRLPPLGPEVCNADLYDQLHR FT VRFGASASIKAGPSNLSCVSASRKKFKNVNPLDPPSDSTGTSNTATVSQSR FT FTCSGSVSRSAIVSEKSDCSSPPPENSEPERASGVTSSDCNHLRKTSNIGN FT SEQHTWILQWNMNGFYNNLPDLQKIVVDKDPSIIALQEIHKATTESMDNTL FT GRKYKWITKINRNIYHSVGIGIAAELPFSNITLDTNLPIIAVRLSWPFPVT FT VVSAYIPNQNIPDLENQVKQLIGKLPEPIILLGDSNAHHQAWGSRHTDARG FT AILLEIAGQNGLTILNDGSPTFMRGQHESAIDISLASMTIMNRISWSAECD FT PMGSDHVPITILLNTTPPNTSRRPRWIYDKADWSGFQSSINSSFKNSEPKS FT ITEFTEAIHEAAKTNIPKTSPNPGRRALRWWSEDIKKAVKARRKALRAAKR FT LPAEHPDKEKAVLKLRTARNECRQIVREAKERSWAEFLDSINDSQSSAELW FT GKVNAIQGKKTKNGIALKIKNQTVRDPAIIADALADHFFNLSSLQRYPEAF FT LKRHSGAKTAIADFVIPSGTDESYNQTFSIQELQFVLQKCCGKSSGPDEIG FT YHMIKNLPMHCKAILLEMINKEWTSRTFPESWRHSLIVPIPKSTGPSSDPR FT NYRPIALTSCMSKIMERMVNRRLMEYLDINGKLDYRQHAFRSGLGTGTYFA FT TLGQTLDDAISCGEHIEMASLDLAKAYNRAWTPSVLEKLAQWGVTGNLLAF FT LKNFLTNRTFQVIIGNHRSKPVAEETGVPQGSVIAVTLFLVAMNGVFDVLP FT KGIFIYVYADDILLVVTGKHPKAIRRKLQAAVNAVVKWTDLVGFDISPEKC FT VRMHICAQKHRPPQKPITVKGKPIPTKKFAKILGVTFDRNLSFRPHFEAIK FT KSCANRMNLLKILSNKRTRSDRRSRLRIADAIICSRLLYGVDITCRASDEL FT VSILSPTYHNAIRTISGLLPSTPAKAACVEAGVMPFIYKVATTIGTKAVSF FT LERTKGDGSPAFLATEADRILNSVAGVTLPSVAELHRIGPRSWRASEPLVD FT NFIKNRLKKGSNPTIAKSLFNERITTKYANTEIMYTDGSKLSNKVGIGIAG FT KHIEDHFSLPDSVSVFSAEAAAIMQAVSHRSTKPKLIVTDSASCILSIRSQ FT TSRHPWIQVTQSLLETRGTPKVSFMWVPGHCDIPGNESADRLADLGRTSRR FT LTSEIPGADVKSWLKTAVKHAWSKDWWSDRTLFIRKIKGITTPWKDRIDRK FT EQIILSRLRTGHTKVSHNMGGGRNFRIICDNCGTANTVEHMLCVCPALESY FT RTQYNLGGIDTILSNDLAMETALIHFLKDTRLFTAI" XX SQ Sequence 6839 BP; 2178 A; 1648 C; 1473 G; 1540 T; 0 other; cagttgccag cttgctttga tagcagtcgg tcgcattttt caatcgctcc gtatgagtag 60 gtagtaggta gtgcgcagcc tattgtgttc acttagctac acattttcga gtgcggttct 120 gctcggtcaa cgagctcggt tagagtgcaa gtgtccttga attcaatact cgttcgaaca 180 aaagtgcagc aagtgtagtg cgaatagtgc gaaaagaata accataataa aaagaacgac 240 ccgtgttcca acgccagttt ttttggaaag cattacttag acaccatccc actccgccgg 300 tatgatgccg gggagtgggt ccgctcccca cagaggggac ccagggggcc ctggtggagg 360 agggaatagt ccgaggaaca tgtttcaagg aacttatgcc ggacaacgct tgcccaccta 420 catggacaag gagggtatcg caggttcact ccaagtcctc aaaatgcagg cagtagggga 480 aaatacctcc ctccctaatg accctttcct attacggctc tcggttgaaa aatgtgtcgg 540 aggacccatc gatggggcct ttaaggagaa tcgcggcatt tcttacgctt tgaaggtgcg 600 cagcaacgtg cagttccaga aactgctaaa aatgagtcgg cttatcgacg gaaccgaagt 660 gatgattacc gaacatccgc aactcaacac cacaaaatgt gttgttagta attatgattc 720 ggtcggactt tcggatgctt acctgaagga acagctagca agtcaagggg tacgagacat 780 ccgtagaata caaaaaagga attcggccgg agtgccagaa aacacaccca ccatgatttt 840 gtccattgtc ggtacggtca tccctgagca cattgatttt ggttggaccc ggtgcaaaac 900 caggagtttc tacccctcac ctatgctctg ctaccactgt tgggagtatg gacacacaag 960 aaagcgatgc caggaaaccc accaaatttg tggcacatgc tcacaagtgc atcctgagga 1020 caacatcctt aatggatcca acagtcgaaa cgccgaagat gatgaattct ccaacaagaa 1080 accacattgt acgaaccccc cattctgtaa gaactgcaaa tctaacggac atgccgtttc 1140 aagtcgaaag tgtcctgttt acagcaaaga gactgccatt cagcatatcc gaatcgatat 1200 gggctactct tacccccagg ccagaaggga gtacgaggcc caacaaggtg ccaccggaaa 1260 cagtggatca tacgctagaa ttgccaatct gagcaaggat aaagagattg ctgatatgtc 1320 cgacgttgtt aagaaacttc aggatgactc gaaacggaaa gacgaaaaga tcgcggaact 1380 ggaaaagatt gttcaaaacc gtagtgtcgg caagaggttg gaccaggttc aaaagaacgg 1440 tacaatagag gatttaactc gaagggttat tcagctaacc gaagcagtgg aacagctaca 1500 aaaaactgtt caggaaaaag accgtgaaat ttcacgcctg cgccagtacg aaacgctcta 1560 cacccatatg gaaccaagtc agtctataac aacggtcagt cagtctttag tacccactcc 1620 tgattgcgct tcaataccgg caacaccagc ggcaccctac gatctcaggg atcctagaat 1680 caggtcgcga tgctcaaaat ggatcgctga aaataacagc agcgaaaatg gcccagcgca 1740 aaacaccaag aacaaccgta ggaaacagaa acaccgaaag caatacgaat caaacaaaga 1800 cagaggattc tcttctgaag aaagtatgaa gagcttccac tccaagaact caatgcaaac 1860 tacgacctca gtaggaacaa gcgcctctaa cccaaccaaa agaaatcacc caaccacaga 1920 ctccgacagc gactccaacg gccgtacttc gaaatctaag cgaagcggcg ttgatggtac 1980 ggatacaatc gaaatcgaat aaacatcgct atccgttcag tcctctcagc tgaagctaag 2040 cacccgtaaa ctccttacta ctaaactatt atcttcagta cccaaaagaa gcccgtgaag 2100 aagaagaatc gagagttgac tgaaccaaca tctcccctat caactaccaa cacaacaaga 2160 acaagaagag cagcagtgaa agctccggat agtcggggcc ccgtcagtgc ggacattaca 2220 ccccaaccgg aactggcgga caacctccgg catccgagag tgattcttgc atatgggacg 2280 cacaagggaa tacaaacccg ttccccattt tccccagata cttctcagga cagaaaagca 2340 cctggcggag taactggaag agggacgcac aaaagtagca atacttgtac ccctgaggac 2400 agcgagggaa cccaattcaa cgatggacac caaaatagac atttccacgt ctattctcca 2460 ttgcacaacg tggaagtcaa gccagttctg agaagatacg atctccgaac acgcatcaaa 2520 ccaccgcgaa acaggttaga tactccttcc tcttcaaaag tgtttataac atctcgatta 2580 ccaccacttg gcccggaggt ttgcaacgcc gatctctatg atcaacttca tcgtgtccgc 2640 tttggagctt cagcttcgat aaaagcggga ccatcaaacc tatcttgcgt gtcagcttct 2700 cgaaaaaaat ttaaaaatgt taatcccttg gatcctcctt ccgattccac cggcacatct 2760 aataccgcta ccgtaagtca atctaggttt acctgcagtg ggtctgtttc aagatctgct 2820 atcgtctccg aaaagtcaga ttgttcatct ccaccgccag aaaacagcga accggaacgc 2880 gcttccggtg taacttcatc agattgtaat cacttgagga aaacaagtaa catcggcaac 2940 agtgagcagc acacatggat ccttcagtgg aatatgaatg ggttttataa caatctgcct 3000 gatctacaaa agattgttgt tgacaaggat ccctctatca tcgcattaca ggaaattcat 3060 aaggcaacaa cagagagcat ggataataca cttgggagaa agtacaaatg gattactaaa 3120 ataaaccgaa atatctatca ttccgtaggc attggcattg cagccgaact tccattttca 3180 aatataacct tggacacaaa ccttccaatt atagcggtac gattgtcttg gcctttcccg 3240 gtcacagtag tttcagcgta catcccaaat caaaatatac ctgatcttga aaatcaagtt 3300 aagcaactaa taggaaagtt gccagaacca ataatcctac taggagatag caatgcacac 3360 caccaggcgt ggggaagccg tcacaccgat gctcgtggtg ctatattgct cgaaatagct 3420 ggccaaaatg gactgaccat actcaacgat gggtcaccta cgtttatgag aggccaacat 3480 gaatcagcaa tagatatctc tcttgcctcc atgaccataa tgaatcggat aagctggtcg 3540 gcagaatgtg atccgatggg aagtgaccac gtcccaataa ccattctcct taacacgact 3600 cccccaaata catctcgacg accacgatgg atctatgaca aagctgactg gtcaggattc 3660 caatcatcaa tcaacagttc attcaaaaat tccgaaccaa aatcgataac ggaattcact 3720 gaagcaattc atgaggccgc aaaaactaac ataccgaaga ccagccctaa ccctgggcgc 3780 cgggcccttc gttggtggtc cgaggacatc aagaaggctg ttaaagcacg acgcaaggca 3840 ctcagagccg ccaaacgact tccagcagaa caccccgaca aggaaaaagc agttttgaag 3900 cttcgcaccg ctcgaaacga atgtcgacaa atcgtacggg aagccaaaga acgctcttgg 3960 gcagaatttc tagacagcat aaacgattcc caatcttccg ccgagctttg gggaaaagta 4020 aacgctatcc agggtaaaaa gacaaagaat ggtatagctc tcaaaataaa aaaccaaacc 4080 gttagagatc cagccataat agctgacgca ctagctgatc atttcttcaa tttatcatca 4140 cttcaacggt acccagaagc atttctaaag cgccactcag gagcaaagac agccattgca 4200 gattttgtta ttccctctgg taccgacgaa agctacaacc aaacgttctc tatacaagaa 4260 cttcaatttg ttctacaaaa atgctgtgga aaatcttcag ggccggatga aattggctac 4320 cacatgatca aaaaccttcc aatgcactgc aaagcaatct tactcgagat gatcaacaag 4380 gaatggacat ccagaacttt tccagaaagc tggcgacata gcctaatagt gccaatccca 4440 aagagtactg gaccatctag cgatccacga aattatcgac caattgctct aacaagctgc 4500 atgtccaaaa tcatggagcg gatggttaat aggagactga tggagtattt agatataaat 4560 gggaaactag actaccgaca gcatgcattc cgatctggcc ttggcactgg aacatacttt 4620 gcgactctag gtcaaaccct cgatgatgcc atttcgtgcg gtgagcacat cgaaatggct 4680 tctcttgact tggccaaagc ttataacaga gcatggactc ctagtgtgct tgagaaattg 4740 gcacagtggg gagtaacggg aaatctgctg gccttcctaa aaaacttctt gacaaataga 4800 acatttcagg tgattatagg aaatcatcgc tccaaaccgg ttgcagaaga aacgggggtt 4860 ccacagggat cagtgatagc agtcacttta ttcctagtgg ctatgaatgg ggtgttcgac 4920 gttctgccta agggtatatt tatttatgtg tatgctgacg acatacttct ggtcgtcacc 4980 ggaaagcacc caaaggctat tagaagaaag ctgcaagccg cagtcaacgc agtggtgaaa 5040 tggactgatc ttgttggatt tgatatctct ccagagaaat gtgttagaat gcatatatgt 5100 gcccaaaaac atcgaccacc ccaaaaacct atcactgtta aaggaaaacc aatacccaca 5160 aagaagtttg ctaagatact tggtgtcaca tttgaccgaa atttatcctt tcgaccccac 5220 tttgaagcaa tcaaaaaaag ctgtgcaaat cgtatgaacc tgttgaagat cctatcaaat 5280 aaacgaacga gaagcgatag aagatcacga ctgagaattg cagatgcgat catctgtagc 5340 cgtcttctat acggggttga tatcacttgt agagcttctg atgagcttgt aagcatccta 5400 agtcccacgt accataacgc aataagaacc atctccggct tattaccatc gacacctgct 5460 aaagctgctt gcgttgaagc cggagttatg ccgtttatct ataaggtagc aactacaatc 5520 ggtaccaaag ccgttagttt tcttgagcgc accaaaggtg acggatcgcc ggctttcctt 5580 gcgacagaag ccgaccgaat cctaaattca gtggctggag tcacgctccc ctcggtggct 5640 gagctccacc gtatcggacc taggagctgg cgagccagcg agccattggt agacaatttc 5700 atcaaaaata gacttaagaa gggatctaat ccgaccatag ccaaatcact attcaatgag 5760 cgtatcacaa caaaatacgc caacacagaa attatgtaca cggacggatc caaactatcc 5820 aataaagtcg gaatcggcat tgcgggaaag cacatagagg atcattttag cctacctgac 5880 tcggtgtcgg ttttctcggc tgaagcggcc gcaataatgc aagctgtatc tcatcgatca 5940 acgaaaccca aactaattgt tacggactcc gccagttgta tcctatccat tagatcccaa 6000 acatcaagac atccatggat tcaagtcaca caaagtttgc tagaaactag aggtacccca 6060 aaagtaagtt ttatgtgggt tcctggccac tgtgacatcc caggaaatga gtctgcagat 6120 cgactggcag accttggtcg aacaagccgc cgactcactt cagagattcc cggagctgat 6180 gtgaaatcct ggctaaagac agcagtgaag catgcatggt ctaaggactg gtggagtgac 6240 cgaacattat tcattcggaa aataaaagga atcacaaccc cctggaaaga cagaatcgac 6300 cgaaaggagc aaataatcct gtctcgtctt cgaacgggac atacgaaggt ctcccataat 6360 atgggaggag gtcgaaattt ccgtatcata tgtgataatt gtgggacagc caacacagtg 6420 gaacatatgt tgtgtgtttg tccggcatta gaatcatacc gaacgcaata caatcttgga 6480 ggcatcgaca caatactgag caacgatttg gcgatggaaa cagccttgat acactttttg 6540 aaggatacaa ggcttttcac tgcaatctaa ctcaataatg cagatagaaa aacggacttt 6600 gatgaagact acgagactac aacagacaac gagcacctga tatttacata aatactaaac 6660 tctgataaat taagtatgag agtagtcttc agactatttt ttttttcttt tcaaaaaact 6720 ctgtaaaaaa ctctgtttgc aacttttgtg atggatcctt aagacccatc tataatcaga 6780 gacgaaccgg ccttgggccg aaagtctctt taataaagaa ataaaaaaaa aaaaaaaaa 6839 // ID Gypsy-58_AA-I repbase; DNA; INV; 4563 BP. XX AC supercont1.29; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-58_AA_; KW Gypsy-58_AA-LTR; Gypsy-58_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4563 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.29; Positions 1491546 1486984. XX CC 'AGTTG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 1246..4431 FT /product="Gypsy-58_AA-I_1p" FT /translation="MAEVDLKFFNKWKPVLCDVDTGANTSLIGYNSLIELS FT GDGDPSLLPSSFRLQSFGGNPIKVLGQVKVPCRRLGKNFRLVLQVVDVDHR FT PLLSARASRELGLVKFCEAVSFTDANEPPSTSLSTEELLNIYRVEAQKSIE FT EHPNLFTGYGKFPGTVTLEIDSSIPSCIQPPRRVPIAMRGKLKQELEALER FT DGIIVKEPNNTDWVSNLVIVQRSGAGSGIRICIDPVHLNKALKRPHLQFTT FT LDEILPELGKAKVFSTVDTKKGFWHVVLDKPSSKLTTFWTPFGRYRWTRLP FT FGIASAPEIFQLKLQQLIQGLEGVECLADDLLIYGVGDTLEEALINHNACL FT RKLLQRLDKHNVKLNHSKLKLCQTSVKFYGHVLTDHGLQPDEEKISAIRNF FT ATPCNRKEVHRFVGMVNYLSRYIPNLSANMTNLRKLISETVAWRWTVSEEK FT EFQTVKSLVSDISTLKYYDVSKPLVMECDASCSGLGVAIFQEDGVVGYASR FT TLTATEKNYAQIEKELLAIVFGCIRFDQLIVGNPKVTVRTDHKPLITIFSK FT PLLSAPRRLQHMLLSLQRYNLSIEFVTGKDNVVADALSRAPMRSSDPVEKF FT EKLNVYRVFEEVSEMKLTNFLSIADARLTEIMNETAQDRTMQTIISYIQEG FT WPKTVDRVPDNAKIFFNYRNELSTQEGLLFRNDRLVIPYVLRRKLVDCCHI FT SHNGVEATLKLARANLFWPGMSAQIKDVVKECKICAKFSASQPNPPMQSHK FT IPVYPFQMISMDVFFADYQGSKRKFLITVDHYSDFFEVNILKDLTPESVIA FT ACKINFARHGIPQLLLTDNGTNFVNQKMAKFAGEWNFEQITSSPYHQQANG FT KSEAAVKIAKRLMKKSEESGSDFWYALLHWRNIPNKIGSSPAARLFSRSTR FT CGVPTSAVNLQPRIVENVPSAIEDNRKKIKFHYDRKTRSLPELETGSPVYV FT QIHPETTKLWTPGIVSNRLNERSYLVNVDGTEYRRNVIHLKPRKEPESEDF FT PPSTIASENQPQQTTTRPTEETECQITPHCDGKGGEASCETPTWKNKTTDS FT SVLHR" XX SQ Sequence 4563 BP; 1363 A; 962 C; 1101 G; 1137 T; 0 other; tggtgtcaga agccatcgtg agcttcagat cggttttacg gcaaacgtag tgaaaatcgt 60 gagaaaatcg tacgaaaaca actgttcgga aagtcgcgtc ggatcggcgg ccattttgtg 120 tcgattggtg aaaaagcgtg gtgatcctca tcaaaatgga tgccaaccaa ttcaagatgt 180 tcatggaaca tcaaactaac attattcgtc aaatgttaca agggatacag agttccgtat 240 caggagctgt gcagcagcag caaatacagc aaccgagtgc ctctgccgta caggtgccgc 300 agccgtcgcc attgtctctg gatggcgata tggaggaaaa tttcgaattt ttcgagaaga 360 gctggaatga ctactgcaga gctatcggaa tggatcgatg gcctcaggcg gaaaatccac 420 agaaagtcag cttcttactg actttgatag gagaagcagc gagaaaaaaa tatttcaatt 480 tcgagcttac cgaagcggag aaggcaaatc cgcaagcagc tcttgctgcc atcaaatcca 540 aagtagtgac cacgcgaaac gtgatagtgg atcgtttgga cttcttttcg gcaatccaaa 600 cgtctcgtga gccaatcgat gattttgtta cgcgtctcaa ggtcatggca aaacctgcga 660 aactagaaaa tctggaaacc gagctgattg cgtacaaggt ggttaccgca aacaagtggc 720 agcatttgcg aaccaagatg ttaacaatga ccgacatcac gttgtcaaag gctgtggatc 780 tgtgccgtgc cgaggatttt tttttttttt taaatctttt tttgagtgta ttttaacgca 840 cgagggctag ttctacacta ccgccgtgcc gaggagataa aagccaggcg gtcacaggaa 900 cttggtacat cagcatctgg atcggaagtg aataaagtga gtaagaagaa gataagtgtt 960 aaatcccagc ggtgcaaatt ttgtggtgat taccatgagt ttgttaaagg agagtgtcct 1020 gcctttggca agaagtgtca tcgttgcaaa ggcaagaatc attttgaaaa agtttgccga 1080 gctggtggga aatcgaaggg ccgcaaatct ggccggcgaa tcaaggaagt gaaagaagaa 1140 agttctacgg aagaagaatc gtctgatacc gaatcatcat caagtgaaag tgaagaatat 1200 gaaatcggta aaatttatga ctattcgagt cacggtggaa acgttatggc cgaagtggac 1260 ctaaaatttt ttaacaagtg gaagcctgtt ctatgcgatg ttgacactgg cgcgaataca 1320 agtttgatcg ggtacaatag tctgattgag ctcagtggcg atggtgatcc atctctgctt 1380 ccatcgtctt tccgattgca aagcttcgga gggaatccca taaaagtgtt gggtcaagta 1440 aaagtgccat gccgacgatt ggggaaaaac ttccgcctag tgttgcaggt ggttgacgtt 1500 gatcatcgtc cattgttatc cgccagagcg tctcgtgaac tgggtttagt gaagttttgt 1560 gaagctgtga gtttcaccga tgcgaatgaa ccaccgtcca cgtcgttgtc gacagaagag 1620 ctattgaaca tctatcgagt tgaagcccag aagagcatcg aagaacatcc gaatctgttc 1680 acgggttatg gcaaatttcc gggcacagta acactggaaa tcgacagcag tataccatca 1740 tgtatccaac caccgagaag agtacccata gccatgcgag gtaagctaaa acaagaattg 1800 gaagcactgg aaagggacgg aatcattgtt aaagaaccca ataacaccga ttgggtaagt 1860 aatttggtca tcgtacagcg ttctggtgct gggtctggta tccgcatttg tatagatcca 1920 gtacatctca ataaagcatt gaaacgaccg catctccaat tcacgactct ggacgagata 1980 cttccggagt tgggcaaggc caaagttttt tcgacagtag ataccaaaaa agggttctgg 2040 cacgtcgtcc tggacaaacc aagcagtaaa ttgaccactt tctggacgcc atttggaaga 2100 taccgatgga cgcgtctgcc gttcggtatc gcttcagctc ccgaaatctt ccagttgaag 2160 ttacaacagc tgatccaagg tttggaaggc gtggagtgtt tggccgacga tttgctcatt 2220 tacggtgtgg gagacactct tgaagaggct ctaatcaatc acaacgcttg tcttcggaaa 2280 ctccttcaac gtttggataa acacaatgtc aagttaaatc actccaagtt gaaactttgt 2340 caaacttccg ttaaatttta tggtcatgtg ttgactgatc atggcctgca accagatgag 2400 gaaaagattt ctgccatcag gaatttcgct actccgtgca acagaaagga ggttcatcgt 2460 tttgtaggga tggtcaatta cctgagccgc tacattccca acctgagtgc caatatgacc 2520 aatctacgta agctcatttc tgaaacagtt gcatggcgat ggacggtctc tgaagagaaa 2580 gagtttcaaa ccgtgaaatc gttagtgtcc gacatcagca cgttgaagta ttatgatgtg 2640 tctaagccgt tggtgatgga gtgcgatgca agttgttccg gcttgggcgt tgcaattttt 2700 caagaagatg gtgttgtcgg atacgcgtca cgaacattga cggccaccga aaaaaactat 2760 gcccaaatag agaaggagct cctcgcaatt gtatttggct gtatccgatt cgaccagttg 2820 attgttggaa atccgaaagt cacagtaagg acggatcaca aaccgttgat caccattttc 2880 agcaagccac tgctgtcggc tccacggcgc cttcagcata tgctcctaag tttgcagcgc 2940 tacaatttgt ccattgagtt cgttaccggt aaggataacg tggtcgcgga tgcattgtcg 3000 cgtgctccga tgagaagcag tgaccctgtg gaaaagttcg aaaaactgaa tgtttatcga 3060 gtattcgaag aagtatcaga aatgaaactc accaattttc tgagtattgc tgatgctagg 3120 cttacggaga tcatgaacga aacagcgcaa gatcgaacaa tgcaaaccat cattagttat 3180 attcaagagg gctggcccaa aacagtcgat cgggtaccgg ataatgcgaa aattttcttc 3240 aattaccgaa atgagctctc aacgcaagaa ggactgctgt ttagaaatga ccgtcttgtg 3300 attccatacg tgttacgaag aaagctggtc gattgttgtc acatcagtca taatggtgta 3360 gaagcaacac tgaagctggc aagagctaac ttattctggc cgggaatgag tgcacaaatc 3420 aaggacgtcg tcaaggagtg taaaatttgc gccaaattct ctgcatcaca accaaatcct 3480 cccatgcaga gtcataagat tccagtgtac cctttccaga tgatttcgat ggatgtattt 3540 tttgccgatt atcaaggatc aaagagaaaa ttccttatta cagttgatca ttactcggac 3600 ttcttcgaag tcaacatact gaaggatctg acgccggagt ctgtgattgc tgcttgtaaa 3660 attaatttcg ctcgccatgg aattccacaa cttctgctga cggataacgg tactaatttc 3720 gtgaaccaaa agatggctaa gtttgccggc gaatggaact ttgagcagat tacgtcgtca 3780 ccatatcacc aacaggctaa cgggaagtcg gaagcggcgg tgaagatcgc caaaagactg 3840 atgaagaagt ctgaggagag cggttccgat ttttggtatg ctctattgca ctggcgcaat 3900 ataccaaaca aaattggatc aagcccagcc gctcgcttgt tctcaagatc aacccgttgt 3960 ggtgtgccca ctagcgctgt aaatctgcaa cccagaatag ttgagaacgt cccatctgca 4020 atcgaagata accgtaagaa aattaagttt cactacgaca ggaaaactag atcactacct 4080 gaactagaaa ccggatcgcc tgtgtacgtt caaattcatc cggaaacaac gaagctttgg 4140 acacccggaa tcgtttccaa taggctgaac gaaaggtcat atttggtcaa cgttgatgga 4200 accgagtacc gacgaaatgt aattcacttg aagccacgta aggaacccga atcagaggat 4260 tttccgccct cgacgatagc aagcgaaaat caaccgcaac aaacaacaac acgtccaacg 4320 gaggaaaccg aatgtcaaat cacaccccac tgcgatggaa aaggaggaga agcatcttgc 4380 gagacaccta catggaaaaa caaaactacc gattcatcgg ttctacatcg gtaaatagtg 4440 aaccggttca gtttcacttt cacacgaaaa gcaatcgtcc taaacgagag cacaaactac 4500 ccgacaagct gaaggatttt tatcttaagt gagaatattt tatttttttt aaagtgagga 4560 gga 4563 // ID Gypsy-6_OD-I repbase; DNA; INV; 6224 BP. XX AC CABV01000158; XX DT 24-FEB-2011 (Rel. 16.02, Created) DT 24-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Oikopleura dioica genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-6_OD_; KW Gypsy-6_OD-LTR; Gypsy-6_OD-I. XX OS Oikopleura dioica OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; OC Oikopleuridae; Oikopleura. XX RN [1] RP 1-6224 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Oikopleura dioica genome."; RL Direct Submission to RU (24-FEB-2011). XX DR Genome; CABV01000158; Positions 58172 51949. XX CC Positions [4570-5046] - Integrase core CC 'AGGG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 570..1838 FT /product="Gypsy-6_OD-I_3p" FT /translation="MAYVLDLPDNRVKRCSCRTTLNDYLIDLAEYSMSSGT FT EDTDLQTRIKTRFGLLRECIAKLQSSGSGNNLPVIDRARTLEIQNCLAHEA FT VHDFTTGDDAELFISAIRNAHADQVLTLPKAQQSAGDEILCRAAKKCLSTD FT VKSAIQRLNIETVSTSQLIAVIEENFASKLSIFQELRKPHDVQYDSAKGLG FT PYFNELQEASRHSFRQVERLFRKEAAKDAGKEAGATVKPEPVPEVQEVNSP FT PTQPPVLPAPTEIVKASELNDLYAATLGYLTVCRVYPKIAIQMTGDIDECR FT TAQHVFQRAKSLLDRLPKSITSEESAALAGRTYDKKQAPKSDSANMEKSIK FT SLADQISKLVNVVAPSKDSAKDSSKDSNKDSKDNNGKRGKGGNKGNRGRSN FT NSKAGAAATSANDHSQSQNAADALFRTED" FT CDS 3847..5967 FT /product="Gypsy-6_OD-I_1p" FT /translation="MDELSAYRFVVEYVPGEQNVWADMFSRPFGLKKAAAT FT GTDEPAGKFVEFSDSLKAYIPSWCTKATVSEPTGPSPKLLTEGIARALVCS FT TAGDFQLSQTADELLQLASSQRADSSLGKLIEALEKFKRDPKSIKLDKNDT FT HYGIYLRNWSYFRICQRTNVLLRVFEGKRKQVLPTSLAIEYIRRAHDDSAH FT PGAPRVELALSSYWWPDMSEDILNYVKSCSICAKKKGNQGQPAHPEIGHVP FT RGIKPFETIMIDFTVFPTCSGKRFCCTILDSFTRYFVARACPGERAIDAAR FT TIVEEIVLRHQVVPKIVSSDRGTAFVSATMKELYSQLGISAQYHCAYRPES FT TGNLERAHRTLKDMIFMLTHERNMTWFEALPFCVTFMNSHRNSSTKQVPHT FT LISGRRPRLNLPETLHDPDIRADTPESYGKIVQKRIKMLSKLAKVANLSAD FT LELERRSKRHAPARPLKKGDLVHLYRPVTKAAKDSGCNWIGPYRVAKSSER FT VVKICDSTGFADWVHRSHLRYLPERKDHLKTADKALEYEILFGDDQLASPQ FT NRPVDASGSYSEPLAALPRAGRPTANKPDKARDNSRKRKRLRGKTQPAPKR FT SNSASAPALPSNSTQAAPAVNSSSTTSPQESTSLSTTPSSTRNSSVTGKNV FT ELPDETTSASTTADSLRDSLNSTVSTFFDQFTKRSSGRVTKKPDRLTLDPK FT RKNYAKK" XX SQ Sequence 6224 BP; 1478 A; 1938 C; 1423 G; 1385 T; 0 other; tctggtgact agagctgtgt ttcttttcct cgccagactc caccgaacct acagttaagc 60 cctcaaagag ctgtagtagt cgttctccta gaaagacttt aaacccccga tcacgatttt 120 ggtcattccc ggccactttc gtcgagcaac tccgcgcctg ctacctactc ggatttccgg 180 ccgtcttcgt tgatttcagc acccttttcg cgaaaaaagt cgcgataagc gccgctcaaa 240 ctacaaaatt tgagcgctcc agtcaggtac gttttcatct tcgtaaacat tgactgcgct 300 gttttctacg ctcacctgcc gctttccgtg catcctgacg catttttcgc cttcaactca 360 cttctgcccg tgtttaagcc tactcgtgac caaagccgcg aaagcgccac tctcgttttc 420 aagaaacttg agtgctcact caaggtaacc gacatcttca aggtcgatta ctgctttttc 480 acgcgccgca tacgtgtcct gatcgcgatc agcccaacgg tacgcgccgt ttcgctcacc 540 accgagttca ccaagtttga acaaccgtca tggcctacgt tttggatctc cccgacaacc 600 gcgttaagcg ttgctcatgc agaaccacgt tgaacgacta cctgattgac ctcgccgagt 660 actcgatgtc ttcaggcact gaagataccg acttgcaaac ccggatcaaa accagattcg 720 ggcttcttcg cgagtgcatc gccaagctcc agtcctctgg cagcggcaac aatctccccg 780 tcatcgaccg tgctcgcacc ctcgaaatcc agaactgtct cgcccacgaa gccgtccacg 840 acttcaccac cggcgacgac gccgagctgt tcatttctgc aattagaaat gcgcacgctg 900 accaagttct cacactccct aaggcccagc aatccgccgg agacgaaatt ctttgccgcg 960 cagctaagaa gtgcctctcc actgacgtca aatccgcgat tcagcgcttg aacatcgaga 1020 ctgtctcgac atctcaactc atcgctgtca tcgaggagaa cttcgcaagt aagctcagca 1080 ttttccaaga gctgcgtaag cctcacgacg tccagtacga ctctgcaaaa ggcctcgggc 1140 cctatttcaa cgagctccag gaagcttcgc gccacagttt caggcaagtc gagcgccttt 1200 tccgcaagga agccgcgaaa gatgccggta aagaagctgg cgcaaccgtc aagcctgagc 1260 ccgttccaga agtccaagaa gtgaactctc caccgaccca gccacccgtc cttcctgctc 1320 ctactgaaat cgtcaaagct tccgagctga acgatctcta cgctgccacg ctcggatact 1380 taacagtgtg ccgagtttac ccgaaaatcg ccatccagat gactggagat atcgatgaat 1440 gtcggactgc gcagcatgtc ttccagcgag caaaatcgct tctcgatcgt cttccaaaga 1500 gcattacctc tgaggaatcc gctgctctcg ccggcagaac ttacgacaag aagcaagccc 1560 ccaagtctga ctctgcaaac atggagaaat ccatcaaatc gcttgccgac caaatcagca 1620 agctggttaa cgtcgtcgcg ccaagcaagg actccgccaa ggactccagc aaggactcca 1680 acaaggactc caaggacaac aacggcaagc gaggcaaggg gggcaacaaa ggcaacagag 1740 gcaggtccaa caactcgaaa gctggcgctg cggcaacttc cgccaatgac cactcgcaaa 1800 gccaaaacgc cgccgacgcc ctttttcgaa ccgaggatta gacaagcgca agtctgatcc 1860 tcgcttatct tccgcgcaac tcaccttctc ctgttaccgc cctccgtggt gcgccgactt 1920 caccgtcaac ctcgccggac acgtcataac cgtcaaggac ggtatcttcg acaccggcgc 1980 tgatgacgtc gttttaccta aacgcgtcat tcccgcgcac cttctcgagc gtctgcagcc 2040 ctcaaatttc accattactg gtgtaaactc ggagtgcaag gttctcggtg agttctgctc 2100 tcaactcgat ttcagcggaa tttattttcc ctggaatccg cattctagtc accgacgggt 2160 ccgaagctcc accgctaatc ggccgtaccg tcacagatca cgctactaac gtctccttcg 2220 ggcgaaacgg aactcaagtt atattccgcc gccgagacgc gcctggagct gatgtctact 2280 cacagaccct ggacatcaag cagtacgatc gtgatccctg gaatgccgcg caacccgctt 2340 ctgccacctc gaaaaccgcc gcaacgcgtg cttttgccga cgtgaccaga accaccaacc 2400 tctgaagctc cttccgtgca gcttcaaccg cctccaggcc ccaacgcgac tacgtctgag 2460 cttctcaagc ttcctgaaaa gcgagaagga gctcaagctc cccgaaaatc acccgaacaa 2520 gcgcgagctt catgaactcg cacagctttg tgtcaggtac gctgacgttt tcgggagctc 2580 taacgacgaa ttgggtcaat tctacaagcc agtacgaata cccacaaatg gccaatctgc 2640 tcgacggaaa cagcaccaga ttccggccaa ttttcgagag tccgtcgact ccgaagtcaa 2700 gaaaatgctc gattcccgcg taatcgagat ttgcgacgac ccgcgcggct ttaatagccc 2760 actttttgtc gttccgaaga aggacaattc cgcaagggtc gtcgcaaact tcaagggtac 2820 gttgaatcgc gtacttacag atccggaccc cttccacgcg ccaaatttgc gcgaactctt 2880 cgacgagatg cgtcccggca acagctactt agctagcttg gacctgaaat ccggatactg 2940 gcaagtcgag attgatcccc tcgaccgcca caaaacagcc ttttcatggg gaggccggtg 3000 ttttcaatac cgccgtttgc cgttcggctt agcgaccgct ggcaatattt tttcacgctg 3060 cgtgcacgaa gcgcttgaat ctcttcccga cttaaaaggc gtatacgtct acattgacga 3120 cgtcatgctg gctctaccta ccttcactga ctacctggac aagctcgccg ccatcttcgg 3180 cgccgctaga aaattcggac ttcgctttca tcccgcgaag tgctgcctac ttgcccccca 3240 ggtcaagttt cttgggcgaa ttgtaagccc taaggggatg tctgttgacc ccgactacgt 3300 cactggtata gatgcctttg tcccgccgac atcgcgttct gaactgcgca cacttctcgg 3360 tcgcctgacc tggatccgcg agttcatgtg cactcgattg cacgagcgca tcgacatgac 3420 gtgcttctct cagctggttt ttcagctaaa ccagctcaac aaggaaggca ccttcgcctg 3480 gactggcaaa gcgcaagaag ctttcaaccg gtgtgaaaac gcggttacgt tcaagcccgg 3540 tgatctcctt ccccgaccca gcgctcgact ttattctcgt cactgacgcc tcactcgttg 3600 ccgaaggcgc cgttctgatg cagcttcaaa atggccgcga gaaaatagta ggagtcgctt 3660 cgcgcacctt taccagcatc gagcagcgct ggtcggcaac tgagccgcga ggcgcacgga 3720 gtgcttttcg gcatccgccg ctttcagtat ttcctgcgcg ggaagccgtt tgtggttaaa 3780 accgaccatc aagccttgtg ctatatagac acggtcgacc acaagaacgc taaactcgca 3840 agatggatgg acgagctcag cgcctacaga tttgtagtcg agtacgtgcc aggcgagcaa 3900 aacgtctggg ctgacatgtt ctcgcgcccc ttcggactaa aaaaagccgc agctactggc 3960 actgatgagc cagccggcaa attcgtcgaa ttcagcgact ccctcaaggc ctatatacca 4020 agctggtgta caaaggctac tgtttctgag ccgactggac cttctccgaa gctccttacc 4080 gaaggaatcg cacgcgcact cgtctgctcc actgccggcg atttccaact ttcccagacc 4140 gccgatgaac tccttcagct cgcgagctca cagcgcgctg attctagctt gggcaagctg 4200 atcgaagccc ttgaaaagtt caagcgtgat cctaagtcga taaagctcga caaaaatgac 4260 acgcactacg gcatctacct tcgtaactgg tcttatttcc gcatctgcca gcggacaaat 4320 gtacttttgc gcgtcttcga aggcaagcga aaacaagttt tgccgacgtc gcttgcgatc 4380 gaatacattc gcagagcaca cgacgattcc gcccatcccg gcgcaccaag agttgaactt 4440 gctcttagca gctattggtg gcccgacatg tccgaagata tcctcaacta cgtcaagagc 4500 tgctccattt gtgccaaaaa aaaaggcaat caaggtcagc cagctcatcc tgagatcgga 4560 cacgtccccc gcggcatcaa gccgttcgag acaatcatga tcgacttcac cgtattcccg 4620 acttgctctg gaaagcgctt ttgctgcacg attttggact cgttcacgcg ctacttcgtc 4680 gcccgcgcct gcccaggtga acgggctatc gatgctgccc gcacgattgt cgaagaaatt 4740 gttctccgcc accaagtcgt gcccaaaatc gtatcgtccg atcgaggcac cgcgttcgtg 4800 agcgcaacga tgaaagagct gtactcgcag ctcggcattt cagcccagta ccactgcgct 4860 taccggcctg aatcgactgg caatctcgaa agagcgcaca gaaccctaaa ggacatgatt 4920 ttcatgctca cgcacgagcg caacatgacc tggtttgaag ctctcccctt ctgcgtgacc 4980 tttatgaact ctcacaggaa ctcgagcaca aagcaagtgc cgcatactct tatttctgga 5040 cggcggccca ggctcaacct tcctgaaact ctccacgacc cggatatccg tgccgacacg 5100 cctgaatcct acggcaagat tgtgcaaaag cgcatcaaaa tgctctccaa gctcgccaaa 5160 gtcgccaact tatcggccga cctggaactc gaaaggcgtt cgaagcggca cgcaccggct 5220 cgtccactca aaaaaggcga cctggttcac ctgtaccgtc ctgtgaccaa agcggccaag 5280 gattccggct gtaactggat cggcccttac agagtcgcga aatcctctga acgcgtcgtc 5340 aaaatttgcg actcgactgg atttgcagac tgggtccatc gcagccattt gcggtacctc 5400 cctgagcgca aggatcatct aaaaaccgcc gataaggcgc tagaatacga gatcctattc 5460 ggcgatgatc agctcgcctc gcctcagaac cgacctgtgg acgcatccgg ttcatattct 5520 gaaccactag cagctttgcc tagagctggg aggccgacgg ctaataagcc cgataaagct 5580 cgagacaatt ctcgaaagag aaaaaggctg cgtggcaaaa ctcagccagc tccgaagcgt 5640 tcaaattctg cttcagctcc cgctttgcca tcgaattcta ctcaagccgc ccccgctgtg 5700 aattcatctt ctacaacttc accccaggaa tctacttcgc tgtcaacaac tccctcatca 5760 acgcgcaact cgtctgtgac cgggaaaaat gttgaattac ctgatgagac gacttcggct 5820 tcgacaactg cagactctct acgtgatagc ctcaactcga ccgtgtctac cttctttgat 5880 cagtttacaa aacgatcttc tggccgtgta acaaaaaagc ctgatcgctt gactcttgat 5940 ccgaagcgta aaaattacgc gaaaaagtag attttcttcc tgcgtgaact cattttccct 6000 tcttgggaaa agccgcgctc tgattggccg ctccgttctg gaagtggttc gatctcccgc 6060 gcgagcattt cttccggccg cgcactttga cctgcgacgc gcaattccga cgcgcgctac 6120 gtctccgccg ctggactact gtcacttcta ctcgatttcc taccttctgt tcgattggag 6180 attcaagaga acgtctctgg ttccgactct agaatctgag gggc 6224 // ID hAT-47_HM repbase; DNA; INV; 5029 BP. XX AC . XX DT 15-DEC-2008 (Rel. 13.12, Created) DT 15-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type DNA transposon family - consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-47_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-5029 RA Bao W. and Jurka J.; RT "hAT-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 2035-2035 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 1809..4820 FT /product="hAT-47_HM_1p" FT /translation="MSKSGAQKRKEKLQTEIEVSAKKCKLITSFLSVQPQC FT QGNIESNVQITSEIIKARGGKDLESNLQSTSGIIQANGSRDIESNLKSTSA FT IILARGGGDIESHSQSTSAIIQALGSRDLESNLQNTSAIIPARGGGDIESN FT SQSTSAIIQALGSRDLESNLQSTSGVIQAKGSRDIESNLKSTSAIIQARGG FT GDIESNSQSTSAFEYPPRLQLKCFFSKHPIQPHEHLPFDNKIYHRQTDLKD FT SIPRNWLSFSQNSQSLYCSFCVAFESPQSNTVSSFVSGFKDFRRVSQRVAE FT HESSSTHRRHVLDYIRIMNNQDFDRFYSTAIKKYNDEVLIRKQILSRVIDI FT IKHLCKQTLPFRGHRNEGAYSLDNDEENHGNFLALVQLLAKYDPLLASHLS FT MCQEKSTKRIEKLKRQGKAGSKGRGALVTFLSKTTISKLLNIMKNMMQEQI FT AMEVSKAKVYSVQVDSTQDISAIDQFSIVVRYVLNATVHERLLSIVPSNDG FT TGQGLFDLLNLTLQRLGFNLKYCLSDSTDGASSYHGQYNGLQQKITEAADH FT HVHIWCYAHVLNLVVKEATSCCIQAVSFFTLLQNVSTFIKVSYKRMATWIK FT LVETQIGMDKMKRLKLIGETRWSGKSNAATAIFGTFAEPSSTVFVNLILCL FT SCISESENFDTKTRHEAKTLLMPFLKFDTILTAFTYLRVFEKVGPLSIYLQ FT TRGLNVLVAFKMVEKAVIELKSQSRLFDDVHNNALQFVRMANASIIQRTET FT ILEVKIELPAKRKRKTPTMLDEHTVDERDESKALEHYKIHTYNVVMDQVVQ FT SLESRFTSHRQLYMDMACFDTSNFGDLAISGIPENSLHSIIKFLPDADVDK FT IKAELLSFVTNYEILKLSLPLTTTLRENAYQKDDIEEPLCYQHTNGACGSC FT PSCVLRILAAYRLNDKTYDNLYHIYKIICTISVTQAECERSFSKLKLIKTR FT LRNSMSNDHLESYMLMSVEKNLLDSLEYDTIIQRYASSSSELSKLFKE*" XX SQ Sequence 5029 BP; 1738 A; 787 C; 868 G; 1636 T; 0 other; cagtggcgga ctggcccacc ggccatttgg ccggtgggcc ccttgaaata tatataacgg 60 tgggccccta atatatatat atatatatat atatatatat atatatatat atatatatat 120 atatatatat atatatatat atatatatat atatatatat atatatatat atatatatat 180 atatatatat atatatatat atatatatat atatatatat atatatatat atatataatt 240 attacaatta attataaagt taaactattt ttaatttctt gtatatacta ttcaactcta 300 ttaagtgcga aattctctaa agaaatagtt ttactgtaaa actaatttca tgtaaaactc 360 tcctttgaca atcaataatg atcgtaaaac tttacaatcg ttattgattg tcgaagaaga 420 gttttacata taattgtcat aagggaattg ctaaggtttt gcaattgtta atgattttca 480 gatacaaata taatcaatga ttttcctaag ggaattgact ttgtaactat tattgcaatt 540 gcggtgcttg aattaattaa aaaatgaatt aaatattcat ttcaaaatta aaaaaataat 600 agttcaacac aaaaacacac ttgatcaacc agtttactga agcgatgtta gtcttctttt 660 gaggttggta ttttagtttg tataaaattt attatctttt tttaatctaa aacttttagt 720 taacttccgc gctcgaaata attatttaaa ggtgatcatt gatcatacta atcggctgaa 780 atgataagat ttgaaaggta gaggattaaa taaatttccg gatgatcaag ttttctgtaa 840 ctttttataa gcattactgt ttatgtaaca cataaataac actttttgtg ttagtcagct 900 taaagtacct cagcgtagtt taactatgct ggtttccgat ttaagtttta ctaatttgcc 960 ctattagggc aaattagtaa aacttaaatc ggaaatttta gcaaaatcct aaactttgac 1020 aatgtataac ttaggatgag ataaaataca tgaaaaggta ttttttaata cagctacttc 1080 tgtttgtgtt gaaagaaaaa aatatgtata tatctttttt gcttaaacag aattttagat 1140 ttttattaag tctatatttg aaattcttta ggatttcgtt ctgtcgcttc aaattgttaa 1200 aaaattataa acaaaattta aaaattcatt aatttaaaca tgattataga tcaagagagt 1260 aagaactgtt ataaaatgtc taatccaaat tatttcttaa aaacatgttc ttttttcatt 1320 aataatcgta atgttcttca cctatcaagt taaagtacta aaaaaaagga taagttggta 1380 gttgctactc ttttccttct atttccttca aggaatacaa tagaaaattc atctataatg 1440 aattttctaa tgtttaaatg caatagttta aataattcca gcagtaagca aagtcataat 1500 ataaaagaag aatacaagaa tttcacagat ctggaaatta tagctatagt taagattcta 1560 aaatataccg gaattagtgc cgtggctaag gagtcattgc ccccctttct ttaccaaaaa 1620 tctgtcatta gtggcctgtc tttatttgag ttttaatatg tttgatttgc aatttttgcc 1680 gttcaaacaa atttgatcgg acaaaattta tttggggaga tctaaatatc attttttcta 1740 aaaggagacg atctttattt ttgctttatt ttttaaatgt taaaatctta actcaggtca 1800 ctatcatcat gtcaaaatct ggagcacaaa agcgcaaaga aaaactacag acagagatag 1860 aagtttctgc aaaaaagtgc aagttaatca catcattttt gtcagtacaa ccacaatgtc 1920 aaggcaatat tgagtcgaat gtacaaatta catcagagat aattaaagca cgaggtggta 1980 aagatcttga gtcaaattta cagagtacat caggaattat acaagcaaat ggtagtcgag 2040 atattgagtc gaatttgaag agtacatcag caataatact agcgcgaggt ggtggagata 2100 ttgaatcaca ttcgcagagt acatctgcaa taatacaagc actaggtagt cgagatcttg 2160 agtcaaattt acagaataca tcagcaataa taccagcacg aggtggtgga gatattgagt 2220 cgaattcaca gagtacatca gcaataatac aagcactagg tagtcgagat cttgagtcaa 2280 atttacagag tacatcagga gttatacaag caaaaggtag tcgagatatt gagtcgaatt 2340 tgaagagtac atcagcaata atacaagctc gaggtggtgg agatattgag tcgaattcgc 2400 agagtacatc agcatttgaa tatccaccac gtttgcagtt aaaatgtttt tttagcaaac 2460 atccaattca accacatgaa catttgccgt ttgataacaa gatctaccat agacaaacgg 2520 acttgaagga tagcataccg agaaattggt tgtcatttag ccaaaattct cagtctctat 2580 attgctcatt ttgtgttgca tttgaatcac cacagtctaa caccgtttca tcatttgtgt 2640 ctggcttcaa agattttaga cgagttagcc aacgggttgc agaacacgaa tcaagcagca 2700 ctcaccgcag gcatgttctt gattacattc gtataatgaa taatcaagac tttgacagat 2760 tttatagtac agcaataaag aaatataatg atgaagtact gatacggaaa caaatacttt 2820 ctcgagttat tgatatcatt aaacatttat gtaagcaaac attgccattt cgaggtcata 2880 ggaatgaagg tgcttattcg ttggacaatg atgaagaaaa tcatggaaat tttttggctt 2940 tggtacagct cttggccaaa tatgacccac ttttagcttc tcatctctcc atgtgtcaag 3000 agaaatcaac caaaagaata gaaaaactga agcgtcaagg taaagccggc agtaaaggtc 3060 gcggtgctct ggttacgttt ttaagtaaga ctactatatc aaagcttctc aatattatga 3120 aaaatatgat gcaggaacag attgcaatgg aagtatccaa agcaaaagtg tactccgtac 3180 aagttgattc tacacaagat atatctgcaa ttgaccaatt cagcattgtg gttcgctatg 3240 ttttaaatgc tactgtgcat gaacggctgc tttctattgt accaagcaat gacggtactg 3300 gccaaggctt gttcgatctc ttaaatttga cgctacagcg gcttggcttt aacttgaaat 3360 actgcctatc ggacagcact gatggtgctt caagttatca tggccagtac aatggactgc 3420 aacaaaagat tacagaagct gcagaccacc atgtacatat ttggtgttat gctcatgttc 3480 tcaacttagt tgtgaaggag gcaaccagtt gctgcattca ggcagtttca ttttttaccc 3540 ttttgcaaaa tgtttcaaca ttcataaaag tgtcatataa gcgaatggca acatggatta 3600 aacttgttga aacacaaatt ggaatggata agatgaaaag attgaaattg attggtgaaa 3660 caaggtggtc gggtaaatca aatgctgcaa cagcgatatt tggaacattt gctgagccat 3720 cttcaacagt gtttgtcaat ttaattttat gtctgtcatg tataagcgaa tcagaaaatt 3780 ttgatacaaa gacacgtcat gaagctaaga cattgctcat gccattcctg aaatttgaca 3840 ccattttgac tgcctttacc tacctgcgag tttttgaaaa agtgggacca ctttcaatat 3900 acctgcaaac tcgtggatta aatgtgttgg ttgcattcaa gatggtggag aaggccgtaa 3960 ttgaattgaa aagtcaatca agactttttg atgatgttca taataacgct cttcaatttg 4020 tacggatggc aaatgcaagc atcattcaga gaactgaaac catattagag gtaaagatag 4080 aactaccagc taaacggaag cggaagacac caacgatgtt agatgaacat acagtagacg 4140 aacgtgacga atctaaagct ttggaacact acaaaattca cacttacaat gtggtgatgg 4200 atcaggtagt ccaaagtctt gagtcacgtt tcacttcaca tcgacaatta tatatggaca 4260 tggcatgttt tgatacatca aactttggtg atcttgccat ttcgggaatt cctgaaaaca 4320 gtttacacag cattatcaaa tttttaccag atgccgacgt tgataaaata aaggccgaat 4380 tgttatcatt tgtcactaac tatgagattt tgaaattgtc cttgccctta acaacaacat 4440 tgagagagaa tgcatatcaa aaggatgaca tagaagaacc gctctgttat cagcatacaa 4500 atggagcatg cggatcgtgt ccatcttgtg tgttacgaat tcttgctgca tatagactca 4560 acgacaaaac ctatgacaat ttgtatcata tttataaaat aatatgtact atttctgtaa 4620 cacaagccga atgtgaaaga tcgttttcta agttaaagtt aatcaagaca agattgcgta 4680 attctatgtc aaatgaccac ttagagtcgt atatgttgat gtctgtagaa aaaaatctac 4740 ttgatagcct tgagtatgat acgatcattc aaagatatgc atcttcgtcc tctgaattga 4800 gcaaactgtt taaagaatag ttattgctat actttgcact gtgcaaattg cacatcactt 4860 ctactttttt aattcaatat aattttatta atctgtgatt taattgctcc tcttaagact 4920 tatttttaga gagttgcgga ccggacagtc tttgcgatat atgtatggcc gggagggccc 4980 tttaattatt ttcatggccg gtaatatata tacagccagt ccgccactg 5029 // ID Transib-16_HM repbase; DNA; INV; 3828 BP. XX AC . XX DT 15-DEC-2008 (Rel. 13.12, Created) DT 15-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE Transib DNA transposon -a consensus sequence. XX KW Transib; DNA transposon; Transposable Element; Transib-16_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3828 RA Bao W. and Jurka J.; RT "Transib transposons from the hydra genome."; RL Repbase Reports 8(12), 2105-2105 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(1014..1313,1313..3103) FT /product="Transib-16_HM_1p" FT /translation="MKRIDLLQKIKGLGFNKVNLLESLFKEVLNVYNLHED FT DFNKSELTNLKNKLSVFKAKVGNKLEKHNRNYDRLILKEGVWLQVKKQXYF FT LINFFYALILFNLSFYLQEEFFPDLVKRIKSKEGPGRKNKVWENLGERSKR FT AKIAHLAENNHEALALASLKSAATESNINKNNFIFIVKKALDPDKANEIRK FT NMSMHEPVKMKVEDALSLKIQCDLSDEQYQIIRNSSLLQNANIYPSLHKIL FT TEKKKCYPDNMIFSETSAVCSLQSLFNHTLCRVLSLNSCNKIFSELSVKDI FT NQHGIMHFKGGFDGASSQSLYKQKYLDTTLDEVIKNEESIFQTSIVPLKLT FT VNDTVIWYNKKPSSTHFCRPVCLQYKKETNVLIKEEEERLRIEIQKLEPFV FT VSLESKPTESSMMTAIIKYNLDLTMFDGKVINALTSTTSSQSCNVCSAKPT FT EMNDIKLIRSKPVNKEALFFGLSPLHCWIRCFEYILHLGYKLKIRSFYAKT FT SEQKESVKERKAFIQKKFREELSLIVDMPKQGFGNTNDGNTARRAFEESIT FT FSEITGVDVDIISRLETILKAVCSGYDLDPSSFQSYCNDTTDKILSEYNWY FT VIPPSVHKLLEHGLQIANALELPIGVYSEESLEALNKEIRNTRLNHSCKIS FT RINVMKNQFNYLMIRSDPVISSIQFTNHRCENGKLLPIEVLSLLKQS*" XX SQ Sequence 3828 BP; 1428 A; 467 C; 586 G; 1344 T; 3 other; cgcacagtgg ttcaagatcc ttaaaaagtg gcaaaaaagg aaaaaaattt aagtctttag 60 tttgtaaaca caaataactt ttaattatgg taagaccaga tgcataacaa tttaaaactt 120 tttttgaagc tattgcaagg ttgtgtatta atttgaagca ctttactaat gcttttttta 180 tgacggaaaa aaaacggaaa tgttcgataa aatagtcagt tttagawatt tattttatgg 240 agacatagta gttaaagtgt tagtattttt tattaattta cttaaattgt gtcaattcac 300 aatgtaaatg aaagattttt ttttaagwtt attaggttta agccattact cagcatgaag 360 aaaaatatta gtaaaaaatt ttttttttat tagtaacaca ttatttatgg tacaagaaca 420 aaaatgttcc acataactcc acgcttatgt tagtgttttt gagtactgtc agcaatatac 480 tatttaatgg aataaaattt acctgctcca atttgctcca atctaaaata gagcttttat 540 gatttacttg ctaaatatta taatcagcga gtatatgata aaaggttgga gcaaattgaa 600 gaaagtaaat tttatttaat tattttaaat ataagctctt ttaaaaaaag agtatcagcg 660 ttacctaata tttttagttt gtatatatat atatatatat atatatatat atatatatgt 720 acaaatattt taaattgtat tctacaaaat agagtgctca atgttcttaa aaaaacatag 780 ctataataaa ttagtagaaa atcacttatt aaattttttt tcattttgat gagtctttat 840 tgatgaaaca cagtgtaaaa tataaaaaaa tttgattagt aattttttac taatttatat 900 atacatatgg attttcaaaa tacttgtagc tattaagtta taatgtaaat gacaataaga 960 ttgttaaact attgataact ttttgtgtgt tatgtaatag ataatatata aaaatgaaaa 1020 gaattgatct tcttcaaaaa attaaaggac taggtttcaa taaagttaat ttgctggaat 1080 ctttgtttaa agaagtttta aatgtttata atcttcatga agatgacttc aataaatcag 1140 agcttaccaa cttaaagaat aagttgtctg tttttaaagc aaaagtggga aataagttag 1200 aaaagcataa tagaaactat gacagattga ttttaaaaga gggtgtttgg ctacaggtaa 1260 agaaacaawa atatttctta ataaattttt tttatgcact cattttattt aattaagttt 1320 ctatttacag gaggagtttt ttccagatct tgttaaaaga attaaatcta aagaaggtcc 1380 aggtagaaaa aataaagttt gggaaaatct tggtgaaaga agcaaaagag ctaaaattgc 1440 ccatttggct gaaaacaatc acgaagcatt ggctttagct tctttaaaaa gtgctgcaac 1500 tgaatccaac attaataaaa acaattttat ctttatagta aagaaggctt tggacccaga 1560 taaagcaaat gaaattagaa aaaacatgtc tatgcatgaa cctgttaaaa tgaaagttga 1620 agatgctctc agcctaaaaa ttcaatgtga tctgtccgat gaacaatatc agataatcag 1680 aaatagctcc ctattacaaa atgcaaacat ctatccgtct cttcataaaa ttttaactga 1740 aaaaaagaaa tgctatcctg ataatatgat tttttcagaa acatctgcag tttgctccct 1800 tcaatctctt tttaatcaca cactgtgtag agtcttgagt ctaaatagtt gcaacaaaat 1860 attttctgaa ttatcggtta aagatatcaa tcaacatgga attatgcatt tcaaaggtgg 1920 ttttgatgga gcgtctagcc agagtctgta taaacaaaaa tatttagata caactctaga 1980 tgaagttatt aaaaatgaag agagtatttt ccaaacatct attgtgcctc ttaaactgac 2040 tgttaatgat acagtaatct ggtacaacaa aaaaccatct agtacacatt tctgcagacc 2100 agtatgtttg cagtacaaga aggaaacaaa tgtgctaatt aaagaagaag aagaacgatt 2160 acggattgaa attcagaaac ttgaaccatt tgtagtaagt ttagaatcaa aaccgactga 2220 gtcatctatg atgacagcaa ttataaagta taatttggat ttgactatgt ttgatggtaa 2280 ggttattaat gcactgacaa gtactacctc ttctcaaagc tgtaatgtat gttcagcaaa 2340 gccaacagag atgaatgaca ttaaattgat tagatctaaa cctgtcaata aagaagcttt 2400 attttttggc ctttcacctc ttcactgttg gataagatgc tttgagtaca ttcttcacct 2460 tggatataaa ttgaaaatta ggtcatttta tgctaaaaca tctgagcaga aagaatcagt 2520 taaagaaaga aaagcattta ttcaaaagaa gtttagggaa gaactaagtc ttattgttga 2580 catgccaaaa caaggttttg gaaacactaa tgatggtaat actgccagaa gagcctttga 2640 ggaatcgatt accttctctg agataactgg tgtagatgtt gacatcattt ctagacttga 2700 aacaatttta aaagcagttt gttcaggata tgatttagat ccaagctctt ttcaaagtta 2760 ttgcaatgat acaacagata aaattctttc tgaatataat tggtatgtta ttcctccaag 2820 cgttcacaaa cttttagaac atggtttaca gattgccaat gcattagaac ttcccattgg 2880 agtttattca gaagaatcac tagaggctct taacaaagag attagaaata caagattaaa 2940 ccattcatgc aaaatttcaa ggataaatgt tatgaaaaac caatttaact atcttatgat 3000 aaggagtgat ccagttatat cgagcattca attcactaac catagatgtg aaaatgggaa 3060 attgctccca atagaagttc tttctcttct taaacagtcg tgattttttt gagaatttta 3120 ttttttatag ataaatttgc atcaaaaatt aaaatttaaa tttagaatag ttataatagt 3180 aattttcaaa gtatattcta attctcagag tataactatt atatgctctt tgttgatgtc 3240 tgaatttttt tttaattatc attctttcaa tcattgtttt tatcaaaaga ggagggtaat 3300 aattgatgaa ataaatttga aataattaaa aataaattat tttatgaact aaaatttgaa 3360 atagttataa tagtaatgct caatgaattt tattgtattc tttttgttga ggaatgaatt 3420 tttttattta ttttttttaa agttctttca atcattgttt ttaacaaaag gggaggggcc 3480 cggtaataat tatgacaaat ttggatggta gtgtaccgag tgatcacgcc tgattacaag 3540 ttggagagag ttaaaaaaag atttaaaaat aaattatgta tttatgcaaa ctattacttg 3600 attcgagaag ttaaattaaa aaaaaaaatc tgcttgcgcg tatcaaattt cttacgtacg 3660 tattttttag atacgcgcac gttttgctgg ttgttgcgcg taaaaataat ttaaaaatac 3720 gtaattgatc ttttgactaa aaaaataata actatttctt taatacatat taacatgcga 3780 tggttatata atttttgccg ttttttcttc aatttgagtc actgtgcg 3828 // ID TE-X-3_NVi repbase; DNA; INV; 1018 BP. XX AC . XX DT 11-MAY-2009 (Rel. 14.06, Created) DT 11-MAY-2009 (Rel. 14.06, Last updated, Version 1) XX DE nonautonomous transposable element from Nasonia vitripennis. XX KW Transposable Element; Nonautonomous; TE-X-3_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-1018 RA Bao W. and Jurka J.; RT "Transposable elements from Nasonia vitripennis."; RL Repbase Reports 9(6), 1163-1163 (2009). XX DR [1] (Consensus) XX CC This element lack obvious TIRs.The TSD is undefined, although it CC shows a polyT/A preference. XX SQ Sequence 1018 BP; 312 A; 211 C; 177 G; 317 T; 1 other; tgaagtgata acttcttatg ggacggtgaa cctacaatgt ttgtaacttc tggaatgcgt 60 tagatcgaat gcgttaaatg gaatgcgtta cagtgatatc tataggtata ctggtatagg 120 gctcttagct ctcgacccgg caggccccct tctccgtgcg caaagctaaa tatagattta 180 ctctcttagg accttataca ctctctccgg gcgcagggat aagtatacat atagctatac 240 tgttttcgga ccccctccct ctccgagcaa tacctataga tttactctct tatgcatact 300 ataccgcatt actttgctta cgctttcctt cactgtcttt ctcattctct ctgtactcct 360 cgttggtata tcgccttcta cagacagtga cactgtgtat ttgttatcca gtgtcatacg 420 aattgttaaa tgcataattg tgaaaataat tgagtgagct ctatcggtac ttcgaggagg 480 gtagcttacc gatcggcaca gctgattcgg acacacgcca atttcggtta tcgatacaga 540 aggattgcca tacgagatcg tgtttgtacg gaataaattt tatataatca caaccaatat 600 taacgtagta gatgggcttg cgaacggtgc agttggaaaa ttagtttaca ttgaaactaa 660 cgatcaaaat gaaattacac gtgtatggtt gcactttcaa gattcaccaa aaactggaca 720 aaaaataaga ctaaaagctg ctgctcacat aactaaatat aatatccatc ctcaagcggt 780 gccaatagac cgaagaactg caactgttta cttaaataat aataaaacca ttattgcaaa 840 aagaaatcat ttcccgttag tatcagcatg ctactttgaa aaaaatatwc ttctttattt 900 gtataatcct ttccaaaaat tatattcatt tgaataataa ataagttcaa ttttaaatac 960 tttctactta aatcaagtta agttatcact tctgccgggc gtcccgagac gcacacat 1018 // ID Mariner-29_SM repbase; DNA; INV; 2492 BP. XX AC . XX DT 12-AUG-2009 (Rel. 14.08, Created) DT 12-AUG-2009 (Rel. 14.08, Last updated, Version 2) XX DE Mariner DNA transposon from Schmidtea mediterranea: consensus. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-29_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-2492 RA Jurka J.; RT "Mariner DNA transposons from Schmidtea mediterranea."; RL Repbase Reports 9(8), 1878-1878 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 466..2187 FT /product="Mariner-29_SM_1p" FT /translation="MNRKRSSDENGKKSTKRNKKVISLDVKHDILQRFQAG FT EKAVFIGRALGLAPTTVRTICNRDAKQIKTLYEAVTDSQLKSTVINRNALI FT VKMESLLLIWIEHQTQKSMPLSKMVIQAKAKSIYDDLQEKLPISNKAIYEQ FT FNASGGWFERFKFRGNLHNICLKGEAASADLAAANDFNKILKAVIQEGGYS FT SKQVFNIDETGLFWKRLPKRTYISKTESSAPGFKVSKDRLTLLLGGNANGD FT FKLKPLLIYRAENPRALKGVDKRNLPVHWRSNKKAWVTGSLFVDWLSKCAI FT PEFKNYCERENLTFKILVLLDNAPGHPTYINDLSDNVKFLFMPRNTTSLIQ FT PMDQGVISNFKSYYIRWTFKQLLHAADADEETSMAVFWKNYNILKAVKNIG FT DSWAEVKSSCMNGVWRKIWPECTATFTEVDDVPLVRTEIVNLAQEAHFEGL FT DEADFDEILNVEEDFDVSTADLIEMAQNIYENDDNSVEVPEVKQLTTKGMG FT KAIKLISEALEIFVAEDPDADQSAKVTQAVEASISCYKKIYLDKKQKAVQI FT TLNNFIIKQPKEKQTPNPEPTYELESE" XX SQ Sequence 2492 BP; 902 A; 357 C; 427 G; 806 T; 0 other; cactagagcc tcgagttgcg ttgttccaac ttgcgtcctt tcgagttgcg ccacaaaaat 60 tttgataacc taattttgag ttgcgttgct aattttcgaa atgcgtcgaa aaaaagtttt 120 aaaataaaat ttaatcaaaa aatagcataa cgtctatatt caaaaattaa aatttaaacc 180 gaaactagaa cgccatgatt tatttttaaa tgcatctgca aataaaataa ttttttattc 240 agtatttatt agtataaaag tttagttctt agttgtttgt ataacttata acgtttaagt 300 gtttcttggt tttttaataa taattttttt aaattttaaa aaatcaatta caatatatat 360 acatttctca aaggtattac attttgttta ttgtttttta ttattttttg ggtttataat 420 tgattttttg aaatgttttt tattgcccta ttgtagaata gggttatgaa tcgtaaacga 480 tcgtcagatg aaaatggtaa aaaatctacg aaaagaaata aaaaagtaat atctttggac 540 gtaaaacatg atattttaca acgtttccaa gcgggagaaa aagcggtttt cattggtaga 600 gctctcgggc tagctccaac aacagtaaga actatctgca atcgcgatgc aaaacaaatc 660 aaaacattat atgaagcagt aacagattcc caattaaaaa gtactgttat aaatcgaaat 720 gcattaatag ttaaaatgga atcactactt ttaatttgga ttgaacatca aactcaaaaa 780 agtatgccgc tgagcaaaat ggttattcaa gccaaagcta aatctattta cgatgatctg 840 caagaaaaac ttccaatatc gaataaagca atttatgaac aatttaacgc tagtggtgga 900 tggtttgagc gttttaaatt tcgcggaaat ctacacaaca tttgtttaaa aggtgaggca 960 gctagtgcag atctcgcagc agcaaatgat tttaataaaa tattgaaggc tgttatacag 1020 gaaggtggct acagttctaa acaggtgttt aatattgatg aaacaggttt gttttggaaa 1080 cgtctgccga aaagaaccta tatttcaaaa acagaatctt ctgcacctgg ctttaaagta 1140 agtaaagatc gcctaacttt acttcttgga ggcaatgcca atggtgactt caaactaaaa 1200 ccattgttaa tttatcgagc tgagaatcca agagctctaa aaggtgttga taaaagaaat 1260 cttcctgtgc attggagatc taataaaaaa gcttgggtca ctggttcttt gtttgttgat 1320 tggttgagca aatgcgccat tcctgagttc aaaaactatt gcgaacggga aaatttgact 1380 tttaaaatat tggtactact ggataatgca ccaggacatc caacatatat aaatgatcta 1440 tcagataatg taaaattttt atttatgcct cgaaacacaa cttcacttat tcaaccaatg 1500 gatcaaggag taatctcaaa ttttaagtca tattatattc gctggacatt taaacagcta 1560 ctgcatgctg ccgatgcgga tgaagaaaca tctatggctg tattttggaa aaactacaat 1620 atattgaaag ctgttaaaaa tattggtgat tcttgggcag aagtaaaatc ctcttgtatg 1680 aatggtgtat ggcgaaaaat atggccagaa tgtacagcaa cttttacgga agttgacgat 1740 gtgccattag ttcggacaga aatagttaac ttagcccaag aagctcattt cgaaggattg 1800 gacgaggccg atttcgatga aattttaaat gtagaagagg attttgatgt cagcaccgca 1860 gatttaattg aaatggctca aaatatttat gagaatgacg acaattccgt agaagtacca 1920 gaagtgaaac aattaacaac aaagggtatg gggaaagcaa ttaaattaat atctgaagca 1980 ttagaaatat ttgttgcaga agatccagat gctgatcaaa gtgcaaaggt tactcaagct 2040 gtagaggcct ccatcagctg ttataaaaaa atttacttgg ataaaaaaca aaaggctgtt 2100 caaataactc tgaataattt tataataaaa caaccaaaag agaagcaaac gccaaatcct 2160 gaaccaacgt atgaattaga atctgaataa attctataaa tttatgttgt ttcattattt 2220 tacattattt taaaattcta ttaatgtttg cattagtttt tggtttaata aaacttatta 2280 aataatagtt aatctattca tacctatttt ttatttaaat atgtagcaat ataactaatt 2340 tttttttaaa aaacatggat ttcagatttt atcggaacct attatataaa aacaggtaca 2400 aatgtataca aaattaaaat tcgaattgcg ctgtttcgag atgcgtcgcg atttattgta 2460 acgcattaac gacgcaactc gaggctctag tg 2492 // ID hAT-68_HM repbase; DNA; INV; 5873 BP. XX AC . XX DT 05-JAN-2009 (Rel. 14.02, Created) DT 05-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE hAT-type DNA transposon family - consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-68_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-5873 RA Bao W. and Jurka J.; RT "hAT-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 9(2), 408-408 (2009). XX DR [1] (Consensus) XX CC TSD is 6-bp long. CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 2782..4323 FT /product="hAT-68_HM_1p" FT /translation="MNNLGGMGANAIKNAIDSIFNETGNIPLTAAAYKSKL FT VSATADGASVNFGIYNGVLTQLKNDRXWLIKIHCVNHRLELAIKGAVKDIS FT QFKECERFYISIFSLFRNSGKLKSAVKKAAEXLNITYYTLPKISGTRFISH FT RRRGFTKLLHNWPSLIVGFENALADRDTKADMRAKISGISKRLHEYRLLCI FT VCSYLDILEKLSPLSLVFEKKMLMVNELKPAVDVTKASLDKLGNEGIDDII FT DSYLLKFRINEKDGTTNLVSSYFKEGHELKKSNPEFVQIELGNMANLNLEC FT LSSAIKLRKSAIDIILPLINDRFSSLLNPIFESMDWLDPQLWTADSMYGDV FT SISLLLKELFYPLEAAGIDFKMVLSEWKAAKLVINTQYTIALTPLEMWQKF FT FLHHRIKFPNFCLLVELMMCISGSNSAVERIFSILTVILTDRRLKMNHSTM FT EDSIIIAGNDQNFTYQEREDILSRAGDIYLSKRRVYRLDSANGSNVTDYSS FT DESSXSNDLSSTSESDE*" XX SQ Sequence 5873 BP; 1985 A; 794 C; 975 G; 2076 T; 43 other; caggcttggt cgggaagcgg gagcgatcgc actcgaataa tgaccacgga aattatattt 60 agttgcgaaa aactggccag gttttttagt cgttttgaga acattttaaa aacaaaaaaa 120 aaagcgcaga aawgattatt ggaccgatga gagacaatta caaaaaaact aagaaaagaa 180 yattagtttt aacgcgaaaa aacttatagc aaaataaatt aygtttagaa caaatatttt 240 cgaagcatcg ctcttttata acttttttat aaaattaagc aacatttttt atataaagta 300 acatcttacg atttcttaat aaggaaacaa aatgttcttt tatttatcaa agagtatatt 360 ataattttta tttataaact catgtcttat ttccagtgct ggattagtaa gggcccgcat 420 ttttaggggc caygaaattt tccagaaaac ttatttttat gtcattttta cattgtggca 480 cataaaaaag accttcacaa taaaaatagy cctaagcccy gaaaagtcca acaagattct 540 aataaaacta acgcttattt tagttttatc tatgctaaat tattttataa ataaaaaagt 600 taaatttttc aatacctgtt ttatgcttcg cgtagtttaa aaagtttaac atatttgaaa 660 caattcttct tagtaagatr aacaaataaa ataattaaat catagagtta ctttagaaat 720 taaactaaat agttaaaaac catctaaaaa ggttaaaaat attttatgaa gaggaagcgg 780 aactgcttcg taataccgca gatgatgttt agtgagtggg ctttttaatt ttaattagta 840 tttaaaacag aaaaaagtaa ctttaagttt aagtataaca agtttgtaag aatggcaact 900 gaagcaaaat taattagagt tagtaaacca atatctaaga gcttaaaaac ttttctttct 960 tgggggaaag aatcagttat cgggtacacg gaagaaaacg gattagttgt taaaatatgg 1020 tgtaagattt gtgctaggca caaaatagct atattaaaag atactttagt taaaggagcg 1080 ttgaattatt cgtttattga gggaacttgt ttggtgacaa aacatcaggt aaaattaaaa 1140 tgatttactt tttatatttt tcagtccaga aataaattta tcataatttt tatgaattta 1200 gttattttac tatgtttcaa tcataatttt tatgaattta gttattttac tatgtttcat 1260 taaaagtatt atgtataaag tattatataa tgtgccgtga atttatntta ttagttatta 1320 cattgcgaag agaaccgact ttttattgaa ttatttaatt ttaatcgtga atttttttga 1380 taataatgaa ggttgatcgc catcttaaag gacaatgtca taggttggcg gtacaaattg 1440 aaaatgggta ctctggaagt agtcgagtca ccaatctcga tatagaaaac tcggtaccac 1500 ataacagccg cgatgcgtta tcaaaaatgt taagaactgc ctatgaaatg gctttaaaac 1560 cgagcatgcc acatagccat ttcgagacgc tgataaaatg tcaaagaatt aatggtgtac 1620 ttcttttaga aggaaaatgc aacaataaag caggttattg atttttttat aatctttaca 1680 accgtttttt ttctaaatga ttttgtattt ttttacaaca ttttaattaa cttgcattct 1740 tgtaggtttt ttatttatat attattttgg tgtccgcgca aaacaaacac gaggtcagca 1800 tactaataaa agaaatcgtt taaaaaagta actaaaacaa acaacgctaa acaggcttaa 1860 gtttagggat cgtccctaaa ttatgcaacg caaaacttat ttaaaaaata taagttagag 1920 atcattttgc tctttggcgc ggcgattttt gcgggacttt aaaggttgtt tatatactta 1980 atttaattaa tgactatgcg agggaaaaat ttcaattctt tgttttttgt cttaagttta 2040 attagtataa tgactatgca aaatttaatt ctttgttttt tgtctataag tttaattttg 2100 cgttacgtaa tttatggaga aatcgttaag aagagtaatt ataacaaaca acaaactttt 2160 agagctttag tttagcgtga ataaatagar tcattgttgg ataacaaaaa aaaagtataa 2220 aaaaactctt taaaaacaac attttaaaaa ttgmctaaaa agtaaaaaaa atatataaaa 2280 aactatttaa aaacaacatt ttaaaaatgg actaaaaagt atttcacgag acycaaggac 2340 tttgtgtaca tgcacaacct gaaatatcca ttaaagtcaa tgactgaaaa tgttgggcaa 2400 ctaagtacaa cttgttacta ttggtgccca atattttcag tcattgactk taatggatat 2460 ttcagtttgt gtatgtacac atagttattg tgtcctgtta aatagtttat tttttacttt 2520 ttggtccatt tttaaaatgt tgtttttaaa aagttttttt tatatatttt tttttaattt 2580 ctttttttag cacgtgaata tatctcctgc attgccagtg ctgtaaaaga aaaagttact 2640 aaaatcatta atgataaaag ttttttctca attctctccg atggytcaca ggctagaaaa 2700 acaaaggatg agaaagagtt gattcttgta cgcgttgaac gcgaggggac ccctgcttac 2760 tttgtagtat ctttacttga tatgaacaat ttgggtggta tgggtgctaa tgccattaaa 2820 aatgctattg acagcatttt taatgaaaca ggtaatattc ctttaactgc ggcagcttac 2880 aaatccaaac ttgtaagcgc gacggccgay ggagcaagcg tyaattttgg aatttataat 2940 ggagtgttaa cgcaattaaa aaacgataga aygtggctga ttaaaattca ctgcgtaaac 3000 catcgtttag aattagctat aaaaggtgct gtaaaagata tatcgcagtt taaagagtgt 3060 gaacgytttt atatttctat ttttagtttg tttcgcaatt ctggcaaatt aaaaagcgcg 3120 gttaaaaaag ctgctgarry tttgaatatt acatactaca ccttgcctaa aatatcggga 3180 acgcgcttta taagtcayag aagacgcggt tttacaaaac tattacataa ctggccttcg 3240 cttattgtgg gctttgaaaa cgctcttgcg gatcgcgaca ccaaagcaga tatgcgtgct 3300 aagatttctg ggatatccaa acggttgcat gaatacagac tattatgtat agtttgtagt 3360 tatttggata ttttagaaaa attatctccg ttatctttgg ttttcgaaaa aaaaatgtta 3420 atggtaaatg aattaaaacc agcagtggat gtaactaagg ctagcttaga caaattaggc 3480 aatgaaggta tcgatgatat tattgattca tatttactaa agttcagaat caatgaaaaa 3540 gatggtacaa ctaatcttgt ttcttcatat ttcaaagaag gccacgaatt gaaaaaatcg 3600 aatccagagt tcgttcagat tgagctcggc aacatggcta atcttaactt ggaatgtctt 3660 agttctgcca ttaaattaag aaaatcagca attgacatca ttctgccatt gataaatgat 3720 agatttagtt cgctgttaaa tccaatattt gaatcaatgg attggttaga tccgcaatta 3780 tggacagcag acagcatgta tggtgatgtc agcatatctt tattactaaa rgaactcttt 3840 taccctctag aggctgcagg aatagatttt aaaatggttc tttcggaatg gaaagctgca 3900 aaactagtaa taaatactca gtacaccatt gcactcacac cactggaaat gtggcagaaa 3960 tttttccttc atcatagaat caaatttcca aacttttgtc ttctagttga acttatgatg 4020 tgtatttctg gctctaattc cgctgtcgaa cgcattttta gtattttgac tgtaatactg 4080 actgatcggc gtttaaagat gaaccactca acaatggaag actctataat aatagcaggg 4140 aatgaccaga attttactta ccaagaacga gaagatattt tgagtcgtgc tggggatatc 4200 tatttaagta agcgaagagt ttatcgcctt gattcagcta atggtagcaa tgtcactgat 4260 tatagtagcg acgaaagcag ctwcagtaac gatttaagtt ctacatctga atcggacgaa 4320 taaacatttt attttacatt tctttactcg taagtcactt tatgtatgta attatgtttg 4380 tgttttgttg tgtatgcttt tgtttcgttc atgtatgttt gtgttctgtc tcracctctg 4440 gcgcggtgct gatccactcg taagtcactt tatgtatgta actatgtttg tgttttgttg 4500 tgtatactty tgtttcgttc atgtatgttt ttgttctgtc tcgacctctg gcgcggtgct 4560 gatccactcg taagtcactt tatgtatgta attatgtttg tgttttrttg tgtatattty 4620 tgtttygttc rtgtatgttt ttgttctgtc tcgacctctg gtgttgtgaa acaatattgt 4680 ctaatgttag atatagctcc ttaacatgaa gttttgttga gctcactyct tatgggatgc 4740 gtcattgctg tttgcccact acgcttctgy agtgcatgaa caaaacagtt ttttagtttt 4800 tattgctwag atttagtccc ataaaatcat tacattactg aagagtccta gaggacgaaa 4860 caatattgtc tgytactctc gagtccttaa tgcaaggaag ggggggttaa gggttgatgg 4920 gaattttatt cttaaataaa cagctgcttc actayctttt attaagtact cgagaataca 4980 taaatayttw tatatatata tatatatata twtwtatata tatataaata aatatatata 5040 tatatatata tgtgtrtata tatgtgtgtc ttgtcgtgta tatgtgtctt gtcgtgtata 5100 tgttcatccg cgagttactt tgtatgcatg tttgtgttta gtaaccagaa actttttgta 5160 tgtttatttc ataattagtt ttatcaactt tgatgattat tactgaaaat gcaatttttg 5220 ttgaaagtct ttatgcttac aaaataaata atacattaga atataaacag aaacacaaaa 5280 ttacataaat aaatacataa tttttaaata aaaaataaga atatactaat aaaatctata 5340 atttataaaa aamttttaaa attttgacaa tttaaaaaat atatattata aaacatttat 5400 ttttgtaaac aaataataat agctaaaaag atttacggct ttgcagatgt ggctctaaga 5460 atgagccata gtattaaaat aataattcgt ttgataatat attcttaaaa caatttatgc 5520 tagaaatctt aaaacttaac aaatcttgaa tgatacaatt ttaaaagaat attctagaac 5580 tttatttcgc ttartcagtg tattttttga accaatagaa ttagtttgtt cgtaaatttg 5640 gcgggttttt tagtgttagc tttttattct tcggcactaa gatgttttga gaacattcaa 5700 gtttatgtat gtttccatgt atgtccgtaa aacaaaayat ttttgtttta cggaaacaca 5760 taaacttgaa tgttctcaaa acgactaaat aacgtggcca gwttttcgca actaaatata 5820 atttccgtgg tcagtattcg agagcgatcg ctcccgcttc ccgaccaagc ctg 5873 // ID Mariner-1_TCa repbase; DNA; INV; 1781 BP. XX AC . XX DT 21-MAR-2009 (Rel. 14.03, Created) DT 22-MAR-2009 (Rel. 14.03, Last updated, Version 1) XX DE DNA transposon: consensus. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-1_TCa. XX OS Tribolium castaneum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Coleoptera; Polyphaga; Cucujiformia; OC Tenebrionidae; Tribolium. XX RN [1] RP 1-1781 RA Jurka J.; RT "Mariner/Tc elements from insects."; RL Repbase Reports 9(3), 674-674 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Tribolium CC castaneum Genome Project. XX FH Key Location/Qualifiers FT CDS 417..1379 FT /product="Mariner-1_TCa_1p" FT /translation="MFSASEYCDMYLVYGASEGNALDAVRQYGIRYPQRRQ FT PSINIFRRLDQRLRENRTPIPEYKGIHPGRPRVRRTVETEEAIIDAVAENP FT SISCRQVGRLLNIPFKTVNRVTRDEMLHPYHYTKVQSLLPADYERRMAFAQ FT WILDKDVDFLRKILWTDESLFTNNGIFNRHNHHYYAINNPFLNFETGRQHR FT FSLNVWAGIVGNQLIGPHILPDRLGGDKFLNFLQFDLENHLLDDVPLAVRR FT EMWLQMDGAGPHYAVPVRRWLNENYENRWLGRGSATPWPARSPDLNPVDFF FT MGPYQRHCIYSTSAGKSSHFARKSFCSL*" XX SQ Sequence 1781 BP; 562 A; 300 C; 342 G; 576 T; 1 other; tcgggtgtgt caaaaaggtc ggataaactt tttagcacat gttaagtact gttcatagac 60 cccaaaaaat atttaaaaaa ttggtattat ttttttaatg gtatatggcc tttttaacag 120 ttacaraaaa acaacttaaa ctaaaaattc ctttgttaga tcacttacgt atggtattag 180 ttatatggtt tacattgtca taaaagcatt ttattgaaaa tttgtgttct atctaaagga 240 atagtttggt tttaatcaag ttaaagttag tggacaaaag tgcttcaaat gtgcagtagc 300 atatgacaac attgtaatat ttctatggaa acaactttga aggtattttt tggcaaagtg 360 tcaaacaaat gtgtcaaact ttttgatgat tttaaatcat ttattgtgaa acaataatgt 420 tttcggcaag tgaatactgc gacatgtact tggtttatgg tgcaagtgaa ggaaatgctt 480 tagatgcggt gagacaatat ggcatccgat atccccagcg acgacaacca agcattaata 540 tttttcgacg actcgatcag agacttcgag aaaatcgcac tccgattccg gaatataaag 600 gaatacatcc tggaaggcct cgcgtaagaa gaactgttga gacagaggaa gcaattattg 660 atgcggttgc cgaaaatcca agcatcagtt gtcgtcaagt tggtagactg ttaaacattc 720 cgtttaaaac agttaatagg gtgacgcgag atgaaatgct acatccatat cattatacaa 780 aagtccagtc tttattgcca gcagattacg aacggagaat ggcttttgct caatggatac 840 ttgataaaga tgtggacttt ttaagaaaaa ttttatggac ggatgaatct ttatttacca 900 acaacggaat ctttaatagg cacaatcacc attattacgc aataaataat ccatttttga 960 attttgaaac gggacgtcag cacagatttt cgttgaatgt atgggctgga atagtaggaa 1020 accaattaat aggtcctcat attttgccag atcggctagg aggtgataag tttttaaatt 1080 ttttgcaatt tgatttagaa aatcatttat tggatgatgt gcctctggca gtcagaagag 1140 agatgtggct acaaatggac ggtgctggtc ctcattacgc tgtaccagtt cgaagatggc 1200 ttaacgaaaa ttacgaaaat aggtggttag gccgcggcag tgctactcct tggccggcta 1260 ggtctccaga cttaaatcct gttgattttt ttatggggcc atatcaaaga cattgtattt 1320 actccaccag tgccggaaaa tctagccact ttgcaagaaa gagtttttgt agcctttaac 1380 tcagttactc cggacatgtt agaacgtacc acaaattctg ttcaaagacg tgccgaagcc 1440 tgcttattag caaatggcag aacatttcaa catttattat aaaatacatt gtttgttttt 1500 aattttactg tttattaaat tgctttgttc ctaaattagt tttccctaat tgttgtaaaa 1560 atatctatct attccggttc ccaaggaaac atttgtacct gatgctaatt tcgactatcg 1620 ctattgaggt aacatagcac aatgttgccg ctttggtcat tactttttaa agaattactg 1680 gctacacgat gtctgcacaa aaaaaatcct tcaaatttgt gatctctgaa ctactctcca 1740 gctatgctta aaagtttatc cgaccttttt gacacacccg a 1781 // ID CR1-4_BF repbase; DNA; INV; 4637 BP. XX AC . XX DT 30-JUL-2009 (Rel. 14.07, Created) DT 30-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE Amphioxus CR1-4_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-4_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-4637 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-4637 RA Kapitonov V. and Jurka J.; RT "Young families of CR1 non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(7), 1575-1575 (2009). XX DR [2] (Consensus) XX SQ Sequence 4637 BP; 1274 A; 1062 C; 952 G; 1349 T; 0 other; ggccaaaggt cacgaaaaca tggacggtcg ttaattttcg tcgctctatc gtccgcctat 60 ctttctttgg aattcgtaac tgcctagctc tgaaatttag ctaaatcaca ggccaaccta 120 ctgcccaagc ctctgagtca ggacttctgt tgatttcgag gttggttcct gcgttttttt 180 ccattgtttg agctcgtctc tggcggctgg caccgaggcc acgtttatcc gtcatggcgt 240 ccacccagag gaacacacgt gcgtcgtctc aaacccaagc taagtctgtg gagcttcggg 300 agaagctatc tgctgataac aagttactcc tggatgccat tgaagccacg cttctacttc 360 ccctgaagtc tgagatacag gagctgaagt ctcaacttgc tggcgttgct gacgagcaga 420 gggaaatcaa atcaagtgtt gcacaggtga agtctcaaat caacaccaca gatgcagctg 480 ttcgcgagac ctctgccaga ttaaatgatc ttgagcagta ttctagaaga aactgcctcc 540 tcttgcatgg ggctgtttca cctacctccg acaccgtcca gttcatcctc gacaccgcca 600 agaacaccct gggcatcgac ttgtcggagg gtgatattga tcgggctcac ccgctaccca 660 gacgcaaagg ttctgatggt caacgcccac ctcccattgt gatcaagttt gtctcgtaca 720 ccgccaggca caagtttttc tccgctaagt ccaaactcaa aggatctaag ctctttgtca 780 ctgagcagct taccaaggaa aatgctctac tgcttcgtgc aaccagagag aagatgggga 840 acggctggtc catggacgga cgcatctact gtctgcatga cagtgtcaag aagagggtcg 900 ccaaccgcac ggacttggac aacatggaac tgtaacgtta ctcactgtaa gttctgtatc 960 tctgctgagc taatggaacc tttacttatg gcataggcgg acgagagagc ccatgtccaa 1020 ctttgactgc attctgttgc cagtcacttg taaactcact ctagtatgag tataatcttg 1080 ttattagttt tgtatttatg taacaatttg tgtcaccttg ttaaacatgt ttgtcgctgt 1140 tctgtacagc atagttggtg ttgtcattat caattagcga ggaattcctc ctcacacgtg 1200 cgtgccaggc gggagagcat tgttccttta gtactccaat ccctccccac ataacatgaa 1260 gtctctctcc gtctgccacg ccaacgtgag aagtctacct gctaatgatt ttgttaaact 1320 ccatgaactc gaatctgttg ccctagatgt cggattcgac gtgatagcat taacagaaac 1380 ttggtgtgat gaaactattc ccgattcaga tattagcctg tcgtcatatc aatccccgtt 1440 acgacgggac cgcaacagac atgggggtgg agttgcgctt tacgcatcca ataatctacc 1500 gtgtaaaaga aggtctgacc ttgaaacaga tggttctgaa acaatctggt gcgaggtgaa 1560 gctaaagaac tggactgttt acatttgtac ctgttataga ccacctggcc agaatgtgga 1620 tgagcgcacg gcattcatag accacctgca gcgggcggta gataccattg tggacaatgg 1680 gccttctcag caaaatgcaa tagtcttact tggtgatttc aatggtaatt cagataatgc 1740 agctggtatg ataaacagat ttgtgtctga caataatttt tctcagttaa taacggaacc 1800 tactcgtagt accgacacct cagcatctgt gcttgatctg attattactg actcaccagg 1860 tatgtttgtg gaccatggta ttcttcctcc ccttgctaat tgtgaccatg ctgttactta 1920 tggaacaatg tctgttaagg tccctaaagt caacgcttat aaacgaacaa tgtatgactt 1980 tacgaacgtc gattatgaag aacttaattc taagattctt gaagtagact ggaattccgt 2040 gatatcggat cacattccag acgtgaatgc ctcagctgag gcttttatgt caactctcac 2100 agatgtcata aagacggtta ttccaagtaa agtagtcact atccgcccta gagataagcc 2160 ctggatgaca tgtgagatca ggcgcttact caggagaaag aggcggctct acggacggta 2220 caaaaaaaca aataactcta gtttattttg tgcatttaaa cgagttagaa ataattgtat 2280 agatttaatt agaaaagcga agagaaatca tcggcatagc ctctgtcttg accttgccaa 2340 tccaacttct aaccccaaga aatggtggtc agtttccaag gaaatctttc ttggcaaggc 2400 ggaccgggtt ctgccctctt tacttgagaa tggaaatgtt atgacagacg acaaggccaa 2460 ggccgatatt cttaacaact actttgcatc tcaaacgtac ctgaataccg atggcgcctc 2520 tttcccaaac tttgtaccca gaacagaact agctctcaat tcaattaatt gttgccccgc 2580 aaatgtctac agtgtactgt caaacctcaa tataaacaag gccacagggc ctgatggact 2640 aagtaatagg cttctgaagt gtatagcacc ttcaatagca caaccactgt cgcatttgtt 2700 taatgtttca ttagaatacg gctgttttcc gtcaatttgg aaattttcaa atgttgtccc 2760 tgtgtataaa aaaggtgata agcaggccaa agaaaactat aggcctattt cactcctgtg 2820 ctgtgtgtcc aaagtgtttg aacggatagt ttatgcccca ttatctgcgt acttacatgg 2880 caacaacctg ctaaccgatc gaaattctgg atttaccccc ggagactcaa cagttaattc 2940 attgacacac cttgttcata agctgcacaa agcagtagac caaggttttg aagtacgcac 3000 cgttttcttg gacatatcaa aagcttttga taaggtatgg cacgatggtc tccttttcaa 3060 attaagccaa ctagggatat ctggttcact tcttcaatgg atacaatctt atctcacaaa 3120 tcggaagcag agagttgtca ttaatggggc atgttccgac tggctaccga ttgacgcagg 3180 tgtcccacag gggtcgcttt tgggcccact ccttttccta gtttttatta acgatctagt 3240 ggacaacttg caaactgatg ccaatctttt tgctgatgac acgtctcttg tggaagtagt 3300 tttggatccg cttgtctcag caaatagatt gaaccatgat cttagaatgg tgggtgaatg 3360 gtcggaccag tggctggtaa cttttaatcc ccttaaaact gaactgatga cattctccat 3420 gaaaagggtc aaagtctatc acccaccatt agtgtttaaa ggagttgtgc ttaaggaagt 3480 tgactctcac aagcatttgg gcttaaactt gaaacgtgac ttgtcctggt catatcacgt 3540 gtctacattg attaccaagg caatgaagcg aatcaatctt ctgaagagga tatcatattt 3600 tgttcctcgg ggaacattag aaactctgta tgtcactatg attcgtccat tacttgaata 3660 tgctgacatt gtttggtgtg gccttctcgg caaagattgt gatcttcttg agtctgtcca 3720 gatggccgcg gcacgagttt gtaccggagc catgagaggc actaaacatg agagcttgtt 3780 agccgaggtt gcatgggaca cactagcgga cagaaggaaa aagcaccagt tgattcagtt 3840 ttggaaaatt atgaatggtc tcgcaccact gtacttacat aatattatcc ctccacgtat 3900 ctgtgatacg gtcccttaca accttcgcgc aaaccaaaac cttacttcca ttgcctgtaa 3960 aactgaaaag tataaaagat ccttcctacc ttcaactgtt cacttgtgga atgaactgcc 4020 tacgaatgct aaagccgtcc tttcgatcaa cactttcaaa aaagagctgg actcactatt 4080 caacaagccg acaaaatact cccatttctc ttatggccaa ggtcgttccg cagttcatct 4140 tacaaggctc cgtctaaact tcagtcagtt aaactaccat ctatttcaac accattgtat 4200 tgattatcct acatgtaaat gcggctatca ttgtgaaaca accgaacatt atctactaca 4260 ctgcacactc tatactagcc aaagaagttc tttgttatcg gctgtctcaa atattatagt 4320 agacctaaag cacctcaacg acactacttt atgtacttta ctgttatcag gatcaacatc 4380 tctctcacat caacagaact gtaacatttt tgatcttgtc attacctata tccattgtac 4440 aggcaggttc tcctaattga ttcagcacca acttgtcttg tttaaattta tttttctgtt 4500 atccattttg tacttatgta tgtaatgtcg tttactattc ttgttttatg cgtatgtttc 4560 ttgagtagaa ggcttaatat aagcaatttg tgctttccgc ctcatactcg caaaatatat 4620 gtgaataaat aaataaa 4637 // ID BEL-151_AA-LTR repbase; DNA; INV; 459 BP. XX AC supercont1.247; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-151_AA_; KW BEL-151_AA-I; BEL-151_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-459 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.247; Positions 1324018 1324476. XX SQ Sequence 459 BP; 149 A; 87 C; 83 G; 140 T; 0 other; tgccaccact ggcagcactg atttgaattg aatggatacg cgatacccct cgacgagact 60 atgcacgcaa cgggtgatat gaaccgatgc gtaagaaacg tcagggagac caacgaacgt 120 caaaacccac atacatttcg caagtcatag tagatgctaa gaataatgtg aattctttct 180 tcctaattat tgaaaattac tagttatatc aaggtattta gattgtgcca gttattccct 240 gaatttgtta cggtattttg cgctataagt atatcatact acgtacgata gacagtaaaa 300 tcctgaaact aagaaaaatt tgttctagta taggttatta agagcactac gtctcagata 360 ttgtcctcag atccgatttc tgaaataccc attgtgcttt ggttgtaagt tacttctgat 420 ttaacaattt gatctaagca aactgaatta aatactcca 459 // ID Mariner-22_SM repbase; DNA; INV; 1670 BP. XX AC . XX DT 12-AUG-2009 (Rel. 14.08, Created) DT 12-AUG-2009 (Rel. 14.08, Last updated, Version 2) XX DE Mariner DNA transposon from Schmidtea mediterranea: consensus. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-22_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-1670 RA Jurka J.; RT "Mariner DNA transposons from Schmidtea mediterranea."; RL Repbase Reports 9(8), 1871-1871 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 445..1545 FT /product="Mariner-22_SM_1p" FT /translation="MKFANVSSRIKGNSGRKKLCRQELQNKILALPKCDRR FT NTRTINKKTAISLGTLCNLTKEKVIKRHSNSLRPMLTPQNKKGRLEFALSH FT IDQTTNDFKSMENIVHVDEKWFYLSRDKMNFYLLPEEEVPHRQVQSKRFIS FT KVMFLAALAKPRWDYHRKTKFDGKLGLWPIVEYEPAKRNSKNRPKGTFVTK FT PIEINREIYTKLIIEKVIPAIKTKWPQYQKSMEIKIQQDNARPHAKVDDPE FT IIKTGHTDGWNINLICQPPNSPDFNVLDLGYFNSIQSIQYQSSPKNIDDLI FT AVVQRAWEDHAVEKVENIFLTLQKAFESSMLAGGGNNYQLAHVGKEALRRK FT NQLPQTLTCNLEAIETARNVLRDM" XX SQ Sequence 1670 BP; 626 A; 278 C; 299 G; 467 T; 0 other; ctacctccgt tcaaaaatga ttgtctcatt tgacatttgt tgccaactta acaatataag 60 atgacaatat gtcatttgtg tatgaacaat agaatcataa aataatttgg taagtttttg 120 gtaagtaaac gttcagttta taatagtttc ataaataact agcaatttct gaaatttgta 180 atttttggct gtgtaccaaa ataacccgct gctattttct ttgcagtcca aataacctta 240 aatcatggaa atccaacgtt ctggagtaaa gtacaaaaat ttaagtgacg aggagagaaa 300 tgccatcctg caaacacttc tggtgcgatc taacaataag aaattgccaa aaggaacaat 360 taaagatgtt gctggagaat ttaaaagtcc ccgaaaacaa tttcaagaat ttgggcgaga 420 gccggccaat gtacactcca gggcatgaaa tttgccaacg ttagttccag aattaaagga 480 aattctggtc gcaagaagtt atgtcgacaa gaacttcaaa acaaaatttt agcacttcca 540 aaatgcgaca gaaggaacac gaggacaata aataaaaaaa ccgcaatttc cttgggtaca 600 ctgtgtaatt taacaaagga aaaggtaatc aagagacaca gcaactctct gaggccaatg 660 ttaacaccgc aaaacaaaaa agggcgtttg gagtttgcct tatcacatat tgaccagacc 720 acaaatgatt tcaaatcgat ggagaatatc gtacatgtgg acgaaaaatg gttctatctt 780 tcaagagata aaatgaattt ctatttactc cctgaagaag aagtacccca tcgtcaagta 840 caaagtaaac gattcatatc taaagtaatg tttctcgctg ctttggctaa accccggtgg 900 gactatcacc gaaaaactaa atttgatgga aaacttgggc tttggccgat tgttgaatat 960 gaacctgcga aacgaaattc taaaaatcga ccaaaaggga cattcgtgac aaaaccaatc 1020 gaaataaatc gagaaattta tacaaagctt attatagaaa aagttattcc tgcaattaaa 1080 acgaaatggc cgcaatatca gaaatctatg gaaattaaaa tacaacagga taatgctcgt 1140 cctcatgcta aagttgatga tcctgaaatt attaaaactg gacatacgga tggatggaat 1200 attaacttaa tttgtcagcc tcctaacagt ccagatttca atgtacttga tttaggttac 1260 ttcaattcca ttcagtctat tcaatatcaa agttctccga aaaatattga cgatcttatt 1320 gccgttgttc aacgtgcctg ggaggatcat gctgtcgaaa aagtagaaaa tattttcttg 1380 actcttcaaa aagcgtttga aagttcaatg ttagctggag ggggcaataa ttatcaacta 1440 gcacatgttg ggaaagaagc attgcgaaga aagaatcaac ttccacaaac attaacatgc 1500 aatttggaag caattgaaac tgctcgcaat gtattacgag atatgtaaaa taatataagt 1560 agtttgtaat aaaatattta tttcataaat aaaaaagttg tctaaagcga aattgaagtt 1620 aatatattgt atataaaact aacgagacaa tcatttttga acggaggtag 1670 // ID Copia-123_AA-LTR repbase; DNA; INV; 268 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-123_AA_; KW Ty1_copia_Ele112; Copia-123_AA-I; Copia-123_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-268 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 268 BP; 74 A; 66 C; 52 G; 76 T; 0 other; tgaggccgac aacaattgct agatcatcgg ccaaactgaa cgaaccccca atcggccgaa 60 ccccttcacg cgtgctgaca gatggaggta gcaaaacgga atggaaaata tttcggcgcg 120 caatcgcacc gactgctgcg atcattcttt ttcatctttt ttattatcta gttacctctc 180 gtcgagtaaa gacacgttgc gttattaaat aagtatttta tgttcaaatc taattccggc 240 gttattattc cattgagaac tctgccca 268 // ID Tx1-7_BF repbase; DNA; INV; 5187 BP. XX AC . XX DT 28-APR-2009 (Rel. 14.04, Created) DT 28-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE Amphioxus Tx1-7_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW Tx1; Non-LTR Retrotransposon; Transposable Element; L1-7_BF; KW Tx1-7_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-5187 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-5187 RA Kapitonov V. and Jurka J.; RT "Young families of Tx1 non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(4), 844-844 (2009). XX DR [2] (Consensus) XX CC Some copies are inserted at the same site in copies of REPD-1_BF CC (pos. 380 in the consensus). XX FH Key Location/Qualifiers FT CDS 85..1386 FT /product="Tx1-7_BF_1p" FT /note="ORF1." FT /translation="MRRDNSICVTYTAQKDKLSEQETWALLDSVGIPTSSL FT QTVQALNATSFDLTFRTLHDRKLYGGKLSALQYLDVRVYGSAETVVTATRI FT PDEFDDNHVRYWLSRYGQVLASRMTTYRDRPTVRNGNRQYKVILKPGANIP FT SSWRLADGRLAFFRYVGQQRTCLKCYQEGHEAAQCTVIFCNKCQLIGHTSA FT ECSNQVVCNQCGKEGHTARRCPTSYANKLLPDNKWTSGPAAEATVASAKTP FT GQSEQSQVNETVSSGKSQSEQTNETDSSDSGSEESGEDDSAEEFASSDELT FT EKERELERELLLEATQPSPSSAERIEPNGEQETTHPRNKKRKKRKGKAKNG FT RKQNNEVEEPQTPVKNVVAKKKAPHTGDPSSDAMEVDQQTSKRPAPSDEEV FT SPKRKSEPSTVTKGPHSPGYDLPLSGELAIDEFTSSGESL" FT CDS 1389..4892 FT /product="Tx1-7_BF_2p" FT /note="endonuclease and RT." FT /translation="MMPQNMHRLNAFFMLTMVVIATLNVCGMRNPNKRKQV FT YDFCRQHKFDIVCLQESHITHVDETTKWEKEWGGQALWSLGTPNSRGVGIL FT LSPSLDLKITKKFCDDEGRVVSAVLADGTRSLRVCNVYAPNDAAERSTFFR FT GLGNFLSNSESTVLCGDFNCIESPSLDKRGGNPSRSSGGSKILKNLCASHN FT LHDSWRQLNPDGRVYTWRQPDNAVACRLDRLYSSNNLPVSGHFFRPAYFTD FT HDCVGIDVLPTAKKKSSMWKCNVSILKNKEVQSDFTASYKDWQTLKLGYPS FT LRSWWDDVKDRTRSLFIKHSCKIAKDKREARYQLNNTLGDLTNQLNAGAVS FT PDVLRQYKATKDALSELATSEAQGAIVRSRIKQFEEGEKCTSFFLKQAASN FT GKKKRICAIRDSMGQVVHEDDEIANVFYEFYAELFAEEPVDPVASDQLCDN FT LENTLTPEEAQALEAPLSTDEMLNSVRSMQNNKCPGSDGLPKEFYTTFWHL FT IADDLLKVFQEALDAGELSSSQKFGVLTLLPKKGDELDPKNRRPISLLNTD FT YKILTKVLNNRLQSVIASVLHPDQTCGVPGRSIQCNLRLTRDIVVYANNNN FT IECAIVSLDQAKAFDRVNISFLQQTLARMGFGPSFRKWVSILYNDISSSVL FT VNGSLSSPFALTRGVRQGCPLSPLLYAISLEPFAATVRKDSEVHGVKLPGG FT YEAKMSLYADDNSAFLTSDDSIVRLFDLVGLYNRGSGSMLNLDKCEGLWLG FT KWRNRPDTPVQIKWTSGSIRLLGGVFGNIDMPLTNWKEGKRKFVETLQGWG FT ARSLSFTGKVTVANSLATAKLWYLASVFTPPDSVMKEINTALFAFFWDKKR FT EMISRNTIYLPPEKGGWGLIDIAKKAKCLYSRPFSDIMSQEDLPWVQLARY FT WLGIFIRRFDPSTWSNNMPHSPNAPPHYKALQSLVSELVEASPSKKWDPGC FT TTRSLYDSLLKKDAHIPSVCLNKRSVPWPLTFQSVQHPLLENRLKDTAWLV FT AHHALKTNALLCSNWGYIRSGTCPRQNCKGHETVEHAMWYCSEVLLTWTWV FT EGFLRRWVMSDFRINKQMVIYGILPPSWKKCKIDAITYTLAVARKRIWSSR FT CEALFDKTFSTHVEMVPYIKECLRSRIKLDFVRLNETTFMSLWCSVQWARV FT DGGKLRLRF" XX SQ Sequence 5187 BP; 1397 A; 1305 C; 1267 G; 1218 T; 0 other; ggctctaaaa ggggtttctg gagaccctag aaggaacctt ccatatactc cacagagaaa 60 tagactgatt tcggactagg caaaatgagg agagacaact caatttgcgt tacgtatacc 120 gctcagaaag acaagttatc tgagcaggag acctgggcac tcctggactc tgtggggata 180 cctacatcca gtctccagac agtgcaagcg ctgaacgcta cgtcttttga tttgaccttc 240 cgtacgcttc acgatagaaa gttgtatggg ggtaaactct cggcactaca gtacttagat 300 gtccgtgtgt acggaagtgc tgagacagtc gtcacagcaa cccgtatacc ggacgagttt 360 gatgacaatc acgtacggta ctggctatcc cgttatggac aggtcctggc ttcccgtatg 420 accacctaca gggacagacc tactgtccgt aatgggaaca ggcagtacaa ggttatactc 480 aaaccagggg caaacatccc gtcctcttgg aggttagcag acggtcggtt agcctttttc 540 cggtatgtcg gacaacaaag aacatgtctg aaatgttacc aggaagggca cgaggcagca 600 caatgtactg tcatcttctg taacaaatgc caactgattg ggcatacctc tgcggaatgc 660 agcaatcaag tagtctgtaa tcagtgcggc aaggagggac acaccgcacg tcggtgccct 720 acttcatatg ccaacaaact gctgcctgac aataagtgga ccagcggtcc ggctgctgag 780 gcaaccgtgg ccagtgctaa gacccctggt cagagtgagc agtctcaagt aaacgagact 840 gtctccagtg gtaagagtca gagtgagcaa acaaacgaga ctgactccag cgactccggg 900 tcggaggaga gcggggaaga tgattccgcc gaggaatttg cttcctctga cgaactgacg 960 gagaaagaaa gggagctaga aagggagctg cttcttgaag caacccaacc atctccgagc 1020 tctgcggagc gcattgaacc aaacggggag caagaaacaa cccatcctcg caacaagaaa 1080 cgcaagaagc gcaagggcaa ggccaaaaat ggccgcaaac agaacaacga ggtcgaagag 1140 cctcagacac cagtcaaaaa tgtggtggcc aagaagaagg ccccacatac tggagacccg 1200 tcatctgacg cgatggaagt cgaccagcag acctctaaaa ggcctgcgcc ttccgacgaa 1260 gaagtcagcc caaaaagaaa gagtgagccg tcgacagtaa cgaagggccc tcatagtccc 1320 ggttacgacc ttcccttatc aggagagcta gccatcgacg agttcacgtc atctggagaa 1380 tctctataat gatgccccaa aatatgcata ggctcaatgc cttctttatg ttaacgatgg 1440 tggttatagc aaccttaaac gtttgtggga tgcgcaatcc caacaaacgc aaacaggtat 1500 acgacttttg cagacaacac aaatttgata ttgtttgcct gcaagagagt cacattacac 1560 atgtggatga aactactaag tgggaaaagg aatggggagg gcaggcccta tggagccttg 1620 gtacgcccaa ttctagaggt gtaggcatcc tgctttcccc atcccttgac ttgaaaataa 1680 ctaagaaatt ctgcgatgat gagggtcgcg tagtttccgc agtcctggct gacggcacgc 1740 gctctcttcg agtgtgcaac gtatacgcac cgaatgatgc agctgaacga tccacatttt 1800 tccgtggtct aggcaatttc ttatccaact ctgagtccac ggtcctttgt ggggatttca 1860 actgcattga gtctcccagc ctagataaaa gaggggggaa tccaagtcgt agttctggag 1920 gttcgaaaat tctaaagaac ctctgtgcta gtcacaacct acacgactcg tggaggcagc 1980 tgaatccgga tgggcgcgta tatacgtggc gtcaacctga taacgctgta gcatgccgac 2040 tggatagact ttactcatcc aacaatctgc ccgtctctgg ccattttttc cggcctgcat 2100 acttcacaga ccacgattgt gtgggaatcg atgtgcttcc aacagctaaa aagaagtctt 2160 ccatgtggaa gtgcaatgtt tccatcctaa agaacaaaga agtccagtcg gacttcactg 2220 cgtcctacaa ggactggcag actttgaagt tgggttaccc ttccctgaga tcttggtggg 2280 atgacgtcaa agaccgaacc cgctccttgt ttattaagca ctcatgtaag atagccaaag 2340 acaaaagaga agctcgttac caactgaaca acactttagg ggatctgacc aaccagctta 2400 atgctggtgc ggtctcacct gatgtcctga gacaatataa ggccacgaaa gacgcactgt 2460 cagagttggc aacctctgaa gctcagggcg ccatcgtcag gtcccgaatc aaacagtttg 2520 aggaggggga aaagtgcacg tccttctttc taaagcaggc cgcttccaac ggcaagaaga 2580 aaaggatctg tgccatacga gactcaatgg gacaagttgt ccatgaggat gacgaaatcg 2640 caaatgtttt ctacgaattc tacgcagagc tgtttgctga ggaaccagtg gacccggttg 2700 cctctgacca gctttgtgac aaccttgaaa acacattgac cccagaggag gcccaagcat 2760 tggaggcccc tctctcaacc gacgaaatgt taaactcagt ccgcagcatg caaaataaca 2820 aatgcccggg cagtgatggt ctacccaaag agttctacac gaccttctgg catctgattg 2880 ccgacgacct cctgaaggtg tttcaagaag ctcttgatgc tggagaactt tcctccagcc 2940 agaagtttgg ggttttgaca ctgttgccaa agaagggaga cgaattggac cccaagaaca 3000 ggagaccaat ctccctccta aacaccgact acaaaatttt gactaaagtt cttaataacc 3060 gccttcagtc tgtaatagcg tcggtgttgc acccggacca aacctgtggt gtaccaggaa 3120 ggtcgatcca gtgcaacctc cgactcacac gagacatagt tgtttacgca aataacaata 3180 acatcgaatg tgctattgtc tcccttgatc aagcaaaagc ctttgaccgt gtcaacatct 3240 cctttctaca acagactctc gcacgaatgg gatttggacc ttcctttcgg aagtgggttt 3300 caatacttta caacgacatc tccagttccg ttttggtgaa tggctccctg tcgtctccct 3360 tcgccctaac aaggggagtg cggcagggat gcccactttc gccactgttg tatgcaatct 3420 ctcttgagcc gtttgcggcg acagtccgca aagactctga ggtgcatgga gtaaaactcc 3480 caggcggcta cgaagccaaa atgtctctgt atgcagatga taactctgcc ttcctgacat 3540 cagacgactc tatcgtcagg ctgtttgacc ttgtcgggct gtacaacaga ggctcaggct 3600 caatgttgaa cctagacaag tgtgaggggt tatggctggg gaaatggaga aatagaccag 3660 acacccctgt acaaataaaa tggacatctg ggtccatccg tctgcttggt ggggtctttg 3720 gcaacattga catgccccta accaactgga aggaaggtaa acgcaaattt gttgaaacct 3780 tacaaggatg gggagccaga tccctctcct ttacaggcaa agtgacagtg gcaaactctc 3840 tagccaccgc caaactgtgg tacctggcgt cggtgttcac ccctcctgat tcggtcatga 3900 aggaaatcaa taccgcactg ttcgcctttt tctgggacaa aaagcgtgag atgatcagcc 3960 gaaataccat ctaccttcct cctgagaagg gcggctgggg cctaatcgac atagcaaaga 4020 aggcaaaatg tctgtacagt cggcctttct cggacatcat gtcccaggaa gacttacctt 4080 gggtgcagct cgccaggtac tggctcggta ttttcattag acgtttcgac ccgtcaacat 4140 ggtccaataa catgccccac tccccgaatg cccctcccca ctacaaagct ctacagtccc 4200 tggtatctga actcgtagag gcatccccct ctaaaaagtg ggaccccggg tgcactacgc 4260 ggtctctata cgacagcctt ctaaagaagg atgctcacat tccaagcgtc tgtcttaaca 4320 agcgttctgt tccctggcca ttaaccttcc aatctgttca acacccactc ctggagaaca 4380 gactgaagga cactgcatgg ctggttgcac accacgccct caaaacgaat gccctgctct 4440 gctcgaactg gggttacatt cgctcgggaa cttgtccaag gcagaattgt aaaggccacg 4500 aaactgtaga gcacgccatg tggtactgca gcgaagtgct gctaacatgg acttgggtgg 4560 agggttttct caggaggtgg gtcatgtcag acttccgaat aaataaacag atggttatct 4620 atggcatcct gccaccctca tggaaaaaat gcaagataga tgctatcacc tacaccctgg 4680 ccgtcgcgag gaaacgcata tggtcctcac gttgtgaagc actctttgac aagacctttt 4740 cgactcatgt cgagatggtc ccatacatca aggagtgcct ccgctcaaga ataaagcttg 4800 actttgtacg tctgaacgaa acaacgttta tgtctctgtg gtgctcagtg cagtgggcga 4860 gggtcgacgg gggaaaactg cgactacgct tttaatcccc tgcttccccc ctgtagtcgt 4920 atcagtctgg tatgaactgt ctgtcttgta cgtttcgtat gtctttttta tttggatgtg 4980 tcaagttggc aggagtgggt tggtgggagc tcggcggaca agtaacctgt tgtgggtagc 5040 ttgacacatc cctttttctt tccttcttct cttgtatgtt ctttgaaatg atgtaaaaaa 5100 aaaaatgaaa acagtgaaaa aacgtcttgt ctctcttgtc tagttttctt tcttttttga 5160 ttttgttata cccccattaa aaacggg 5187 // ID CR1-56_AAe repbase; DNA; INV; 5007 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-56_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5007 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1143-1143 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 20 sequences with >98% CC identity. XX FH Key Location/Qualifiers FT CDS 376..1218 FT /product="CR1-56_AAe_1p" FT /translation="MNSNCKQCSEPIKTIEFIQCSGFCHQSAHLKCIGLKR FT ANMDLVREHKNILWFCDRCIDNLEYLKQNPLKTKQDVVEAVSDVFRESLNV FT LKDEILETKELAKSLTDKKLSNDFSGQVRARTSWPSIKRSRDTATPKARPD FT SRLVGGTKSIEKGSLTVETVAKPPEKFWLYLSRIARHVTEDDVTELVRNCL FT QTQQPIEVRKLVRKDADLNQFAFISFKIGIEKELKETALDPSVWPKGIFFR FT EFENQQTERDFWGPPKIPRLDIRTPSATLSTPVTQITAMQ" FT CDS 1107..4874 FT /product="CR1-56_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="KPANRTGFLGTSKNSSIGHPDTFRYFVHSSHSDNCNA FT IIYPPPSTSEAAYLGTTLVQYGSNFQAYGTDHQSRRIVESHLEASDPLNSV FT LPFADIDHHSRLDPAVEVCDEVFQIACEGKYYLESNDIHILRDKSYLDSSS FT TLPCCNHGVSSGPANHSSPERKPSTSTMEAPKPPSTVVPILPAHNSRSGPV FT FGLDAGVFQTTNEGKYIKSPTTSLPESIPIFXAAPAISGPTRPISDDNRLE FT NAAMCIDRADSSFPEADAPVPENRTMDIVVPPTPADTRPSESVIAPFANIT FT TSTCSADGMVNIYYQNIGGLNSSLEDYLMASSDCCYEILVLTETWLSANTI FT SSQVLDQQYEVFRCDRGPHNSSKARGGGVLLAVRRGIKAHLIEDDAWNGVE FT QLWLAIELADRNLYTCVIYVPPDRTRDDSLLDAHMSSVQSIVDMASPKDDI FT LIIGDFNLPGICWKSSSNGFLYPDVGHSTLNVASNRFLDGYSSATLRQINH FT IVNENGRSLDLCFVSSRDVAPSIALAPAPLVKFVPHHPPLLVSINSKIIED FT CKEIPSSVFYNFKNADYDSIIRVLQTIDWANILDEHDVNSAALTFSHILSY FT VIDRHVPKKTVSVSNAAWQTSELRDLKRQKKAALRRYSKHKSQEQRDHYVR FT LNSLYKSASRDCFRRYQTGIQRRLKNNPKYFWNYVKKQRNEYGLPSSMKWN FT DTVTSDPQAICDMFASKFSSTFSNEALSDQQITSASANTQLTGLVMPDVTV FT SNESIVEASSKLKASTCLGPDGIPSLFLKNCIDCLLVPLRTIFNKSLSSGV FT FPACWKNAYMFPVHKKGDRKNIENYRGISSLCAVSKLFELVIVDRLFFHCK FT QFISPNQHGFFSGRSTASNLLCLTTYVAESMRLRSQTDVLYLDLTAAFDKI FT NHEIAVAKLERFGVHGALLSWFKSYLNGRRSQVVIGESKSAYFPIPSGIPQ FT GSHLGPLIFLLYFNDVNTVLDGPKLSFADDFKLYFKINSDADAWFLQQQLD FT AFAQWCHSNRMTVNPEKCSIITFSRLKNPKLYNYTFSGVILERVDRIKDLG FT VILDCQLTFRHHLSYMLHKASASLGFIFRVTKTFTDIHCLKALYCALVRSI FT LEYCSTIWSPYYQNGAERIESVQRRFVRFALRRLPWRDPFHLPSYESRCRL FT IGLDTLRTRRDVAKALLVADVLQGRVDCSEILQQVNLYVRPRALRITSMLR FT PQTQRTNYGVNCAITGLQRTFNRVDAEFDFNVSRDVLRGRFFTVFR" XX SQ Sequence 5007 BP; 1372 A; 1148 C; 1047 G; 1438 T; 2 other; cacttctggc aaccactgac gtgtacgcaa ctttgttgat atttcgtcgt cgttttattt 60 tcgctattta tgtgtttatt tgattcgtta attcatgtcg ttattattgt tgtgaactgc 120 tttctcgttg ctatttgatt tcaagaccgg ctttcttcgt tcgacgtgtc gtagtgagaa 180 aaaacatatc taattttcgg tccattagga gtgtttttgt tgtgcgctgc cgttacgatc 240 tggacgtacc tctctggctg tattcgtatt ctcttccgaa gcgacatctg gtggaaacca 300 acttcaatct gcaccaatac taattcacgg caaagctctt cttgtgcaac tttccatctg 360 ttggccggta tcgaaatgaa ttcaaattgt aagcagtgct ctgagccgat caaaaccatt 420 gagttcatac agtgcagtgg tttttgtcat caatcagcgc acttgaaatg tatcggcctg 480 aaaagagcta atatggacct cgtacgagag cataagaata tcttatggtt ctgcgaccgt 540 tgcatcgata atcttgaata tttgaagcaa aatccactga aaacgaagca ggatgtcgta 600 gaagcagttt cagatgtatt tcgcgaatct ctgaatgtat tgaaagacga gattctggaa 660 accaaagagc tcgcaaaatc tctgaccgat aagaagcttt ctaacgattt ttctggtcag 720 gttcgagctc gtacttcatg gccgagtatt aaacgatccc gcgacacagc tactcctaaa 780 gctcgccctg attccagact tgttggtggc accaaatcca ttgaaaaggg gagcttgact 840 gtggagaccg ttgccaaacc accagaaaaa ttctggttat atttatccag aatcgctcgt 900 cacgtcacgg aggatgacgt taccgaacta gtcagaaatt gtctacaaac tcaacaacct 960 atcgaagtgc gaaagcttgt tcgcaaggac gcagatttga accagttcgc attcatctca 1020 tttaaaattg ggattgagaa ggagctgaag gaaactgccc tcgacccttc tgtctggccc 1080 aaagggatct tcttcaggga gtttgaaaac cagcaaacag aacgggattt ttggggacct 1140 ccaaaaattc ctcgattgga catccggaca ccttccgcta ctttgtccac tccagtcact 1200 cagataactg caatgcaata atctacccac caccatcaac atcagaagca gcttatctcg 1260 gaactacctt ggttcagtac ggcagcaatt tccaagcgta tggaacggat catcaatcga 1320 gacgcatcgt agaaagccat ttggaagcct ctgatcccct caactcagtc ctgccatttg 1380 ctgacatcga tcatcacagt cgtctcgatc ctgcagttga ggtgtgtgac gaggtcttcc 1440 agatagcttg cgaaggcaag tattatttgg aatcgaatga catacatatt ctgcgtgaca 1500 aatcatacct tgacagctca tctactctac cctgttgtaa tcacggagtt tcttcgggac 1560 cagctaatca ttcatcaccg gaacgcaagc cttcaacaag taccatggaa gccccaaagc 1620 cacccagcac agtcgtgcca atcctgccag cgcacaacag tcgttccgga cctgtgtttg 1680 ggttggacgc aggggtcttc caaaccacca atgaaggcaa gtatattaaa tctccgacca 1740 cgtcactgcc tgaaagcatt ccaattttca gkgctgctcc agctatatcc ggtccaacac 1800 gaccgatttc ggatgacaat cgtttggaga atgcagctat gtgtatcgat agagcagatt 1860 cctcgttccc tgaagcagat gcacctgttc ctgagaaccg caccatggat attgtcgtgc 1920 ctcccacgcc tgccgatact cgtccgtcag aatcagttat agcccctttt gctaatatca 1980 caacctcgac atgttcagcc gatggtatgg taaacattta ctatcagaat attggcggtt 2040 taaattcgtc tctggaggat tatttgatgg ccagttcaga ctgctgttat gaaattcttg 2100 tattaactga aacttggctc tctgcaaata cgatttcaag tcaagtactg gaccagcagt 2160 acgaagtatt ccgttgtgat cgtggtccac ataacagcag caaagcgcgc ggtggcggtg 2220 tgctcttagc tgtacgacgg ggtatcaaag ctcatttaat cgaagatgac gcgtggaatg 2280 gcgtggagca gctgtggtta gcgatcgaac tagctgaccg aaacctctat acgtgtgtca 2340 tctatgttcc ccctgataga acacgtgatg attcattgct cgacgcacac atgagttcag 2400 tacaatctat tgtagacatg gcttcgccca aggacgatat cttgattatt ggtgatttta 2460 acttgcctgg aatatgctgg aaatcttcma gtaatggatt cctgtacccg gacgttggtc 2520 attcgactct taatgttgct tcgaatagat ttcttgacgg ttacagttct gctacgctac 2580 ggcagattaa ccatatcgtg aacgaaaacg gcagaagtct cgatttgtgc tttgttagct 2640 ctcgcgatgt ggctccatca atagctttag ccccagcacc gctggtgaaa tttgttccac 2700 accaccctcc gctgcttgtt agtataaata gtaaaattat cgaagattgc aaagagatcc 2760 cttcttcagt cttttacaac ttcaaaaatg cagattatga cagcatcatt cgcgttcttc 2820 agacgattga ttgggcaaac atccttgatg agcatgatgt caatagtgcg gcacttacat 2880 tctcccatat tctttcatat gtgattgatc gacatgtacc caagaaaact gtatcagtgt 2940 ccaatgcagc atggcagaca tctgagctac gagatctaaa aagacagaag aaagctgctt 3000 tgagacggta ttctaaacat aaaagtcagg agcaaagaga ccattacgtc agactcaata 3060 gtctatataa aagtgcaagt agagactgtt ttcgtcggta tcagaccggc atacaacgac 3120 gacttaaaaa taatccaaaa tatttttgga attatgttaa aaaacagcga aacgagtacg 3180 gattgccttc ttcgatgaaa tggaacgata cagtaacctc tgatccacaa gccatctgcg 3240 atatgttcgc ttctaaattc tccagcactt tctctaacga agctctatcg gaccagcaaa 3300 taacatcagc atcagctaat actcagttga ccggtttagt aatgcccgat gtgactgtca 3360 gtaacgaatc cattgtagaa gcatcaagta agcttaaagc ttcaacgtgt ctaggacctg 3420 atggaattcc ctccctgttt ttgaagaact gcattgactg cttactcgtt cccctgcgaa 3480 caattttcaa caaatctcta tcatctggag tgtttcctgc gtgctggaaa aatgcgtata 3540 tgtttccagt tcacaaaaag ggagacagga agaatattga aaattatcgt ggaatatcat 3600 cactgtgtgc tgtttccaaa ctcttcgagc tggtcatagt agaccgacta ttctttcact 3660 gcaagcaatt catcagccct aatcagcatg gattcttttc cggacgatca accgcctcta 3720 atctattatg tctcaccaca tacgtggctg aaagcatgcg cctaagatca caaacagatg 3780 ttctatattt ggacctaact gcagcattcg acaaaatcaa tcatgaaatc gcggtagcga 3840 agctcgaaag gtttggtgtt cacggtgcac ttttaagctg gtttaagtct tatcttaatg 3900 ggcgtcgttc acaagtcgtc attggtgagt ctaaatcagc atattttccg attccatccg 3960 gtattccaca aggcagtcat cttggaccgt tgatattcct gttgtatttc aatgacgtca 4020 acaccgttct tgatggacca aaactctctt tcgctgatga cttcaagctg tattttaaaa 4080 tcaattctga cgccgatgcc tggtttttgc aacagcagtt agacgctttc gctcaatggt 4140 gtcacagtaa tcgtatgacc gtaaacccag agaaatgctc gattatcacc ttttcccgtt 4200 tgaaaaaccc gaaattgtac aactacacct tctctggcgt cattctggaa agagttgatc 4260 gaatcaagga tcttggagtg atacttgact gtcagctgac ttttagacat cacttgtcgt 4320 acatgttgca caaagcttca gcatcccttg gatttatttt tcgagtgacc aagacgttca 4380 ctgacattca ctgtttgaag gcgctctact gtgcgttggt tcgttcaatt ttggagtatt 4440 gctcaactat atggagtcca tactatcaaa atggagctga aaggatagag tctgtccaaa 4500 gacgtttcgt tcgattcgct cttcgccggt tgccgtggcg cgacccattt cacctcccga 4560 gctatgaaag ccgatgtcgc ctaattggct tagacactct cagaacacgc agagatgttg 4620 ccaaggccct tctggttgct gacgttttgc agggccgagt cgattgctcg gaaatcctac 4680 agcaggttaa tttatacgta cgaccaagag ctttgcgtat cacctcaatg cttcggccgc 4740 aaacccaacg caccaattac ggagttaact gcgccataac cggtctacaa aggacgttca 4800 acagagtaga tgcagaattc gattttaacg tgtcacgtga cgtgcttcgt ggtagatttt 4860 ttactgtttt cagatagttt ttaagtgttg tgacttttaa atattttttg tttgtagtct 4920 gtgtattata accaattatt aataaccatc attggggctc aatagcctgt tgatgaagaa 4980 gaaataaata aataaaataa ataaata 5007 // ID Gypsy-13-LTR_HM repbase; DNA; INV; 151 BP. XX AC . XX DT 05-JAN-2009 (Rel. 14.02, Created) DT 05-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE Long terminal repeat of the LTR retrotransposon - a consensus DE sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy-13-LTR_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-151 RA Bao W. and Jurka J.; RT "Gypsy LTR retrotransposons from Hydra magnipapillata."; RL Repbase Reports 9(2), 399-399 (2009). XX DR [1] (Consensus) XX CC The termini of this element are 5'-TG, TA-3'. CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX SQ Sequence 151 BP; 45 A; 12 C; 19 G; 75 T; 0 other; tgtagaaata gtgacattag tcatttcttg tttttatatt gtttttatat tgttttatat 60 tgttgttgtt tacacttatt gttatttcat ttagaaacta acattgttta agatataatt 120 ataattgaga acaacgacta acttttgttt a 151 // ID Proto1-4_NG repbase; DNA; INV; 5248 BP. XX AC . XX DT 21-MAY-2009 (Rel. 14.06, Created) DT 21-MAY-2009 (Rel. 14.06, Last updated, Version 1) XX DE Proto1-4_NG is a non-LTR retrotranspsoson from the Naegleria DE gruberi amoeboflagellate genome - a partial consensus sequence. XX KW Proto1; Non-LTR Retrotransposon; Transposable Element; KW Proto1-4_NG. XX OS Naegleria gruberi OC Eukaryota; Heterolobosea; Schizopyrenida; Vahlkampfiidae; OC Naegleria. XX RN [1] RP 1-5248 RA Kapitonov V.V. and Jurka J.; RT "Proto1 non-LTR retrotransposons from the Naegleria gruberi RT amoeboflagellate genome."; RL Repbase Reports 9(6), 1147-1147 (2009). XX DR [1] (Consensus) XX CC Proto1-4_NG is a very young familiy of non-LTR retrotransposons CC that belongs to the Proto1 clade of non-LTR retrotransposons. CC This clade includes also the Proto1-1_NG, Proto1-2_NG, CC Proto1-3_NG and Proto1-5_NG families from the the Naegleria CC gruberi amoeboflagellate genome. The Proto1 elements code for two CC ORFs. The ORF2-encoded proteins are composed of the apurinic CC endonuclease, reverse transcriptase and ribonuclease H domains. CC It is likely that the Proto1 clade is a sister clade of the L1 CC clade. Proto1 retrotransposons are characterized by 15-18 bp long CC target site duplications and by a weak target site preference: CC 5'-CATTTTTTTNNNNNNNN-retrotransposon-ATTTTTTTNNNNNNNN-3'. XX FH Key Location/Qualifiers FT CDS 1459..5205 FT /product="Proto1-4_NG_2p" FT /note="contains the APE endonuclease, reverse FT transcriptase and ribonuclease H domains." FT /translation="NIMPMINPKCNFRVGCFNVNGLIKTETKEILTHAIKG FT ICHQHNIQIISITETHINKLEDQQRAFESGRGHFFFNSLMGTRKACKGTAL FT LHFINNRRKRLLCNTNLIPGLLQLTKFETRGKPINIFTVYISVRNDEESES FT YDDKILSTMIEYISAHQGEYFIISGDPNINTINPTSMRDRMWCEFFDTFNF FT YESIAPDEYTYIRKTRDGTTKTHPDHIFVSNNIIIRNKKILPSITKNDHQP FT FYIDIELETNDTWRPYISPTKIQKFIESINREVPDSFNELDNKIKSFIIEN FT LRKNDRIRIKTNESELRQEIKSKEEELNDLINSENNDTDQIERVRSEIKDL FT YHRCAEEIKRKISESFNRYDSSLIYQFDRHIKEKSINWKQVDFEEEELIDY FT FEQKFQSKTDSIQFNCAPSTISKGPSKIKITMNSVKKAIQRMKSNTPGLDF FT ISFNIIKKLSDDNLKLLVEKFNECLQTRNIPLEWKQGWVKLVPKREVKKLN FT DIRPITILPIYYRLLFNVIAFKLRRWASRNINIRQQAFISHRSTMNHGIVL FT SALAHKANREKDPLLIVNLDIEGAYDSVELKVIELALHHCKFPNELIEFIL FT NSYKDHLLSLEIGNSLSRPIHKTRGIPQGCPLAPLVYDCITQLIIDKCIDK FT WDIPVKPKKLKADDIGILCFADDLNLTATKNCNYNSRLKDLNRWLGKLSLK FT INPSKSVATQKPNPTKKIKKMKPKINDTKIPLEENVRVLGHYPWDDKMVKE FT DIMNKIAKFQLSLKYMPIMRMEIPNIGKVIHAKCISLFTHLSRCNIIPAAL FT GDKIDTAIRKCIRRRAHLDLRSHTHWFHLPIEYGGLGIPNCNDFIQRININ FT SILYLTNSRNPLVREAIKDGIKHAGKKKRISNIFDSILQTMKRYNIELKEC FT NDIDNDINLDQMEGHSFNIHTDGSKIENQVGFGINIYNSENRSFKYSFKIH FT EEYSNNVAEIMAILTAIKMLPNHSNANVHTDSQVAIEVLKKSYRGDFQGFQ FT EEFFGTIKDKHLNIHLVKVKGHEDPENIIVDKLAKEGAQGQNMIDITKLLS FT QNQKILINTNKKTIIFDYRKYIKDIQRKEYLDIVEDKSDLPFNQSQSLTNI FT NLKYMDQRFDTETKLSIWRNMTDSHIRQFRQVYCDNCECEVDLEHFIYYCE FT ETECFRRYFLTKFTNLTGASCKLRPHDTFWDRHNSILNLNHRGFIKALNII FT LVKSYNPKIIDNWHEIQSLLSLLVGKAHRKYYSCSLNSR" FT CDS join(14..436,444..1475) FT /product="Proto1-4_NG_1p" FT /note="Proto1-specific protein of unknown FT function." FT /translation="MSLHLQPVSLVDLIKRQTVMINTYKSIIENMLPMIKP FT PNETKNNSEVIKPTYVEVAKAPASEEKIPKIKVVAKNISESNLNENKRNGN FT LKQKENLKMSHKRQKTIDYSTSISIYCQTNITETIAEFFLSKVNIYVIIIT FT SLEFIFQLAINNRDNFIKAMKILTNQEVIKEAPAKYFASPMNFNATEHAVK FT YYTKNQKEFMEKYRLIINSNDIDIDQINVSLNYDSNKSISQYKLRVSLLTN FT TKINYLIQNKFETYRKIEEIINESSLDVSIVMSKNFTEEQVSESIKTIEQR FT SNITIKKENIKISKLLNNNYIINIRLQSDDEVYRFTKVVFPKRDPIGIMHF FT IQFVSTAEYLTKKYKVNKQPDSTSINNDSKLDRIMERLDLLEMNYTKLQEQ FT FNEINVVVKNTEKEFKNLMDKIPIWLQNQPNNIIDNQELELIGNHIDNMLE FT KMVEDEEIRDVRMAPNPQVRRKHDDIDDEDEPQLKYHAND" XX SQ Sequence 5248 BP; 2133 A; 748 C; 789 G; 1578 T; 0 other; gagttcaaaa aaaatgagtc tacacctcca acctgtttcg cttgtcgacc taataaagcg 60 gcagactgta atgatcaata cgtataagtc tattattgaa aacatgcttc cgatgataaa 120 gccccccaat gagactaaaa ataactcgga agtaattaaa cctacctatg tggaagttgc 180 caaagctcca gcaagtgaag agaaaattcc gaaaattaag gtggttgcta aaaatatttc 240 cgaatctaat ttaaacgaaa acaaacggaa tggaaatctt aaacaaaaag aaaatcttaa 300 aatgtcacac aagagacaaa agactattga ttatagcact agcatttcta tctattgtca 360 aaccaacatt actgagacta ttgctgaatt tttcctcagt aaggtaaaca tctatgtgat 420 tattattaca tccctgtaaa taagaattta tatttcaatt agctattaat aatagagata 480 actttatcaa agctatgaaa atacttacta atcaagaagt tattaaggaa gcacctgcta 540 aatactttgc ttcccctatg aattttaatg ctactgaaca tgctgtaaaa tactatacca 600 agaatcaaaa agaatttatg gaaaaatata gattaattat taattctaat gatattgata 660 ttgatcaaat taatgttagt ttaaactatg actcaaacaa atctatttca caatacaagt 720 tgagagtttc tctcctaaca aacactaaaa ttaattattt aattcaaaat aagtttgaaa 780 cctatcgtaa aattgaagaa attattaatg aaagctcctt agatgtatct attgttatga 840 gtaagaactt cacagaggaa caagtgtctg aatcaattaa aacaattgaa caaagatcta 900 atattactat taaaaaagaa aatattaaaa ttagtaaact attaaataat aattatatta 960 ttaatattag attacaatct gacgatgaag tttatagatt tactaaagtg gtttttccaa 1020 agagagatcc tattggtatc atgcacttta ttcaatttgt atccactgct gaatatctta 1080 caaagaaata taaagtgaac aaacaacctg atagtacctc aattaataat gacagcaaat 1140 tagatagaat tatggaaaga cttgatttat tggaaatgaa ttatactaaa ttacaagaac 1200 aattcaatga aattaatgta gttgttaaaa acacagaaaa agaattcaaa aatttaatgg 1260 ataaaattcc aatatggtta caaaaccaac caaataatat tattgataac caagaattag 1320 aactgattgg aaatcatatt gataacatgt tagagaagat ggtagaagat gaagaaatta 1380 gagatgttag aatggctcca aatcctcaag taagaagaaa gcatgatgat atagatgatg 1440 aggatgaacc acaactaaaa tatcatgcca atgattaatc caaaatgcaa tttccgagtg 1500 ggatgtttta atgtgaatgg tcttattaaa acagaaacta aagagatcct aactcatgct 1560 attaagggaa tttgtcacca acataatatt caaattataa gcattaccga aactcacatt 1620 aacaaattag aagatcaaca aagggctttc gaatcaggaa gaggacattt cttttttaat 1680 tcattaatgg gaaccagaaa agcttgtaaa ggcactgcat tgctacattt tattaacaac 1740 cgaaggaaaa gattgttatg taataccaat cttattccag gattacttca attaacaaag 1800 tttgaaacaa gaggaaaacc aattaatatc tttacagtct atataagtgt tagaaatgat 1860 gaggagagtg aatcatatga tgataaaata ctaagtacca tgatagagta tataagtgca 1920 catcaaggag aatatttcat tatatcggga gatccgaata taaataccat taatcccact 1980 tcaatgagag atagaatgtg gtgtgaattc tttgatactt tcaattttta tgaatctata 2040 gctcctgatg aatacaccta cattagaaaa acaagagatg gtacaactaa aactcaccct 2100 gatcatatat tcgtttcaaa taatattatt attagaaata agaagatttt accatcaata 2160 actaagaatg atcatcaacc attctatatt gatattgaat tagagactaa tgatacatgg 2220 agaccatata ttagtccaac aaagattcaa aaatttattg aatctattaa cagagaagtt 2280 cctgatagct ttaatgaatt agataacaaa attaaatcct ttattattga aaatttaaga 2340 aagaatgata gaattcgaat taaaactaat gaatcagaat taagacaaga gattaagtcg 2400 aaagaagagg aattgaatga tctcattaat tcagagaaca atgatacaga ccaaatagaa 2460 agagttagat ctgaaattaa agacctttat catagatgtg cagaggaaat taagagaaag 2520 atttctgaat catttaatcg atacgattca tcattaatat atcaatttga tagacatatt 2580 aaggagaagt caattaactg gaaacaagtt gattttgaag aagaggaatt gattgactac 2640 tttgagcaaa agttccaatc taaaaccgat tcaattcaat ttaattgtgc cccatctact 2700 atttcaaagg gcccatcaaa aatcaaaata actatgaatt ctgttaagaa agctattcaa 2760 agaatgaaat ctaacacacc tggactagat tttatatcat ttaatattat taagaaacta 2820 tcagatgata atttaaagct gttagtagaa aaatttaatg aatgtctcca aacaagaaat 2880 attcccttag aatggaaaca agggtgggta aaattagtac ctaagcgaga agttaagaaa 2940 ttgaatgata ttcgcccaat aaccatctta ccaatttatt accgattatt attcaatgtt 3000 attgctttta aattaagaag atgggctagt aggaacataa atattagaca acaagcattt 3060 atctcacaca gaagtacaat gaatcatgga atagtattat cagctttagc acataaagcc 3120 aatagagaaa aagaccctct attaattgta aacttggata tagaaggtgc ttatgattct 3180 gttgaactga aggtaattga attagctcta catcattgta aattccctaa tgaattaatt 3240 gaatttatcc tcaactcgta caaagaccat ttattaagct tagagatagg aaactcatta 3300 tctagaccca ttcacaaaac caggggcatc ccacaaggtt gtccattagc tcccctggtt 3360 tacgattgta ttactcaatt aattattgat aaatgtatag acaagtggga tattcccgta 3420 aaacctaaaa aactcaaagc agatgatatt ggaattctat gttttgctga tgacctaaat 3480 ttaacagcga ctaaaaattg caattataat tcaagattaa aagacctaaa tagatggctt 3540 gggaaattat cactaaagat caatccatcc aaatcagttg caacacagaa accgaaccct 3600 actaagaaaa ttaagaaaat gaaaccaaaa attaatgata ccaaaatacc acttgaagaa 3660 aatgtacgag ttcttggtca ctatccatgg gatgataaaa tggtaaaaga agacattatg 3720 aataagattg ctaaatttca actatcactg aaatatatgc ctattatgag aatggaaatt 3780 ccaaatattg gaaaagtgat acatgctaaa tgtatcagtt tattcacaca tttatctaga 3840 tgtaatatca taccagcagc tttaggtgat aaaatagata ctgctattag gaaatgtatt 3900 agaagaagag ctcatttaga tttaagaagt cacactcatt ggttccattt acctattgaa 3960 tatggaggat tgggtattcc taactgtaat gatttcattc aaagaattaa tattaactct 4020 attttatatt taacaaatag tagaaaccct ctagtaagag aggcaattaa agatgggatc 4080 aaacatgctg gtaaaaagaa gagaattagc aatatctttg atagtattct acaaactatg 4140 aaacgatata atatcgaatt gaaggaatgt aatgatattg acaatgatat taatttggat 4200 caaatggagg gtcattcatt taatattcac acagatggat caaaaattga aaaccaagtg 4260 ggatttggaa ttaatatata taattcagaa aatagaagtt tcaaatattc cttcaagatt 4320 catgaagaat attccaataa cgtggctgaa atcatggcta tattaacagc aatcaaaatg 4380 ctgccaaatc attccaatgc taatgtacat actgatagtc aagttgctat tgaagtttta 4440 aaaaagagtt acagaggaga ctttcaagga tttcaagaag aattctttgg taccattaag 4500 gataaacatt taaatatcca tctagttaaa gtgaaagggc atgaagatcc agagaatatt 4560 atagttgata aattagcaaa agaaggtgca caaggacaaa atatgattga tattaccaaa 4620 ctattatccc aaaaccaaaa gattttaatt aatactaata agaaaacaat catctttgac 4680 tatagaaaat acatcaaaga tatccagaga aaagaatatt tagatattgt tgaggataaa 4740 tctgacttac cttttaacca aagtcagtcc ctcactaata tcaatttgaa atatatggat 4800 caaagatttg acactgaaac caaattgtca atttggagaa acatgaccga ctcacatatt 4860 agacaatttc gtcaagttta ttgtgataac tgtgaatgtg aagtggattt ggaacatttc 4920 atatactact gtgaagaaac cgaatgtttt agaagatatt tcttaactaa attcacgaat 4980 cttacaggag catcctgtaa gcttcgtccc catgacactt tttgggatag acacaattca 5040 attctcaatt taaatcatag aggctttatt aaggctttaa atattatttt agttaaatcc 5100 tataatccta aaatcattga taattggcat gaaattcaaa gtctattatc actacttgtt 5160 ggtaaagcac acagaaaata ttactcatgt tctcttaatt caagataata cctcatgatc 5220 ctattgggat cagctacact gaaataaa 5248 // ID P-3_HM repbase; DNA; INV; 3071 BP. XX AC . XX DT 17-MAR-2008 (Rel. 13.03, Created) DT 17-MAR-2008 (Rel. 13.03, Last updated, Version 1) XX DE P-type DNA transposon family: consensus. XX KW P; DNA transposon; Transposable Element; P-3_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3071 RA Jurka J.; RT "P-type DNA transposon families from Hydra magnipapillata."; RL Repbase Reports 8(3), 349-349 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 133..2649 FT /product="P-3_HM_1p" FT /translation="MVNKCGVVNCKGNYNKLNKCRIFKLPKDEIEKQKWIN FT VLPLRKNFIIEPSKFFICEKHWPENTEMIKLPGGYTRPACAPCIFNVPFSC FT LPTPKPVPRQSKPEDMQLNYFIKKDRINSFSDFYPDNELHKKYKNIVISRQ FT DDKFICIFMSKDFKESFVTIIVYDKCTLCSPLTFSAFKHGISVPLANILNP FT NNGLIFYSQFFEAINLVHNYNLHVNDVIEKMTNTLSENEFELNDDKKQKKL FT NFITRQLKLLSKKSFSIADYCFAIESFYNCNYDQLRDFLVLPSKRKLQSII FT SATNIDHVLQKTFENVKNNQQKNVILIVDEVKIRPTVAYSCGVLNGMARND FT PESKATSMLCVMMKCLHGGPSLMISVTPVHKLTSLYQFIVVKEAAIIVEKH FT GAIVLGSVTDNHKINQQYCKIFNRITDYQAIHPLDNQRVWFLLFDTVHLLK FT CIRNNWISEKCQKLSLDNKTIASFSDVKSLYQYEKENILKTVPLSYAAVYP FT SRLQLQNVQHVLRVFNEKVVASLKLKGAYETSSFIQQILDWWNIVNVSAKG FT QDVRLRDHNRSVQDKNSNNLQLFLDIFKCMESGHGPKRVQCVTHDTKKALV FT QTTEGLIALCKHLFSVGFDYVLLRELQSDRIEGEFSVYRQSTGANAFMAAG FT DVFSAFKKRLARFAASFLESVEVGQPNKKGHICEGPISIEDATSIETCIYD FT VSLSETEESSAAYVAGWLEKKCEDELVYPDDEPELTNEAKDFIEEVSRGYL FT TVPHECTYQLVRIGLCFVKKTKHRACCRKRLINILSIMDTYYDFGLSSKSL FT FRRLANVLLHGIQNLDKDQAKNACLYQTSIKKARLAE" XX SQ Sequence 3071 BP; 1029 A; 451 C; 529 G; 1062 T; 0 other; atcaaggact actatacgga aggacgcgca aagtcgattt cctgactccg ctagcgaagg 60 ccattatttt tatcaaagtt gaagtgttct ttaacgttcc tgtaacattt tttttaaatt 120 ttaaccataa tcatggtgaa caaatgtggc gttgtaaatt gtaaagggaa ttataataaa 180 cttaataagt gtcgtatttt taaattaccg aaagatgaaa ttgagaaaca gaaatggatt 240 aatgttcttc ctttacgtaa aaattttatt attgagccat ctaaattttt tatatgtgaa 300 aaacattggc ctgaaaatac agaaatgatt aagctgcctg gtggctatac tcgaccagca 360 tgtgcacctt gtatttttaa tgttcctttt tcttgtttac caacgccaaa gcctgtgcct 420 cgccaatcta aacctgagga catgcaattg aattatttca tcaaaaaaga tagaataaat 480 tcgttttctg atttttatcc tgacaatgag ttgcacaaaa agtataaaaa tatagtaata 540 tctcgacaag acgataaatt catatgcatt tttatgtcaa aagatttcaa agagagtttt 600 gtgacgatta ttgtttatga taagtgtacc ttgtgttcac cgctaacctt ttcagctttt 660 aaacatggaa tttccgttcc attagctaat atattgaatc caaacaatgg cttaatcttc 720 tattcacaat tttttgaagc aatcaattta gttcataatt ataatctgca tgttaatgat 780 gttattgaaa agatgacaaa tactctttca gaaaatgaat ttgaattaaa tgatgataag 840 aaacaaaaaa aattgaattt tattacaaga caactcaaac tcttgtctaa aaaaagtttc 900 tctatagctg attattgctt tgcaatcgag tctttttata actgcaacta cgatcaactt 960 cgtgattttt tagtccttcc aagtaaacga aagcttcagt ctataatttc tgccactaac 1020 attgaccatg ttttacaaaa aacatttgaa aacgtaaaga ataatcaaca gaaaaatgtt 1080 attttgatag ttgacgaggt aaaaattagg ccaacagttg cttattcgtg tggagtttta 1140 aatgggatgg caagaaatga tccagaatct aaagcaacct ctatgctatg tgtcatgatg 1200 aaatgcttac atggtggacc tagtttaatg atttctgtta cccctgttca caaactaaca 1260 tcattgtatc agttcattgt tgtgaaagaa gctgctatta tagttgaaaa acatggtgca 1320 atagttttag gatctgtaac tgataatcat aaaatcaatc agcaatactg caaaattttt 1380 aacagaataa ctgactatca ggccattcat ccattagaca atcagcgtgt ttggtttctt 1440 ttatttgaca cagttcatct acttaaatgt attcgcaata actggatttc tgaaaagtgt 1500 caaaagctat ctctcgataa taaaaccata gcttcatttt cagatgtgaa aagtctatat 1560 caatacgaga aagagaacat tttaaaaaca gttcctttat cctatgctgc tgtctatcct 1620 tcgagacttc aactacagaa tgtgcagcat gttttaagag tgtttaatga gaaagtagta 1680 gcttccctta aactaaaagg agcttatgaa acatcctctt ttatccagca gattttagat 1740 tggtggaata ttgttaatgt ttctgcaaaa gggcaggatg ttaggttgag agatcacaac 1800 cgatctgtgc aagataaaaa ttcaaataat cttcaattgt ttttagacat atttaaatgt 1860 atggaatcag gacatgggcc taaacgagtt cagtgcgtta cccatgatac caaaaaagcc 1920 cttgtgcaaa ctactgaagg attgatagct ctctgcaaac atttgttttc agttggtttt 1980 gattatgttt tgcttcgtga gctacaaagt gatagaatag aaggagaatt ttctgtttat 2040 agacaatcaa caggggcaaa tgcttttatg gcagctggtg atgttttctc tgcttttaaa 2100 aagcgtttag ctagatttgc tgcatcattt ttagaatcag tggaagtagg acaacccaat 2160 aaaaaaggtc acatatgtga gggtccaatt agtattgaag atgcaacctc aattgaaaca 2220 tgcatttatg atgtgagtct ttctgaaact gaagagagct ctgctgccta tgtagcagga 2280 tggcttgaga aaaaatgtga agatgaacta gtttatccag atgatgaacc tgaactaaca 2340 aatgaagcta aagactttat tgaagaagtt tcaaggggct atttgacagt tccacatgag 2400 tgtacttatc agctagttag aattggttta tgctttgtga aaaaaacaaa gcacagagca 2460 tgttgtagaa aaagacttat aaacattcta tctataatgg acacatatta tgattttggt 2520 ttatcatcaa agagtctttt taggcgtttg gcaaatgttc ttctgcatgg aattcagaat 2580 ttggataaag accaagcaaa aaatgcatgt ctttatcaga catcaattaa gaaggcacgc 2640 cttgctgagt agtgaatctt tctttttaga ttttttactt gaaaaatatt gtattgttta 2700 tttctttacg aagttatatt aagttcatta tgtttaaaaa aaaaaaaagt tttaacattg 2760 tttaaaagtc tctagagaat atgatgttta tatgtttatt tctttataaa taaattgtgt 2820 caataatatg ttctttgttt ttttatatga ggttttttat atcgatcttt taagcttttt 2880 aaaataagtt ttgattttgt tattaacaat aacgtaaaat tattttaatc gtatttcaga 2940 attttgattt tataaaacgt tgtttaagcc aattttagtt atttaggagt taagtgagaa 3000 atattgactg gccttcgcta gcggagtcag gaaatcgact ttgcgcgtcc ttccgtatag 3060 tagtccttga t 3071 // ID CR1-6_BF repbase; DNA; INV; 4788 BP. XX AC . XX DT 30-JUL-2009 (Rel. 14.07, Created) DT 30-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE Amphioxus CR1-6_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-6_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-4788 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-4788 RA Kapitonov V. and Jurka J.; RT "Young families of CR1 non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(7), 1577-1577 (2009). XX DR [2] (Consensus) XX SQ Sequence 4788 BP; 1381 A; 1187 C; 926 G; 1294 T; 0 other; caagatggcg cccgtagaca cacgtatagt ctaagggctc tctgctacac tgcggatatc 60 taacaaattt cctcattctg agcctccatt tcaccaccat gacgaccaca ccgtccacaa 120 gggaagttaa gagagtctca gaccttgttg actctcagct gagactcagt tttgacaagt 180 tgaaagaaga gttcctgggc gctgtgaagc aagacatttt agcggccata aaggctgaag 240 tcagacacga actggaccag tttaagtcgt cattccaagc tgaggttgac gagctgaagc 300 acacaattgt cgagaaagac gaagaactga ctacactaaa ggtacgccta gccgaagccg 360 aaagtcatgc cgatagaaat gaacaatata gtcggaagca ttgtctgctc gtcttcggaa 420 tcccccctcc cccatcacct gaagagagga aatcagagga ctgcgcatca acattcatgg 480 acttcaccag gaaacatctc gacgtcagcg tccaacgatc agacatcgac atagcacacc 540 gtgttggccg gcctcgtgac ggagctcaca cagtcatcgt caagttcacc aactatgcaa 600 agcgtcagga agtctacaag gccaagaaac gactaaaggg gaaaacaaac agcacgggca 660 aatcctacat catcagagaa aacttgacca aacgcaacct tgaactgtat aattctgcaa 720 aacgcctgcc ctgtgtcagc tccgcatgga cccaagatgg caagattttc gtcaaggtca 780 actgcgaagg cggcgatcca gagatacacc gaatcaccgg ccccaccgac atccaccagc 840 tcgaactgat ggctaacaag taaaacttga agttattatg tttattcacc aagctattca 900 catatgatct actttcacca tattgctttt catgtcgtcc gatccgtcga acatgattaa 960 aggcaatcag atcacctgta aggggctcag cttatccttg cactctatga agctcaacgc 1020 agtgacattc tgtttatgtt gatcatatgt ttgatgtttc atttaacgtt atgatgtttc 1080 tgggttattt gaagggcatg ttcgcccctt cttttctttt tgtagtcatt attgcccagt 1140 tatgttctta ttgcttctgt tattttgatt gtgtttttgt attgtccggc tggcgcgcca 1200 atcaatcaat acttagttac tgttactcac gaatgcatga cagagagaca agtagcatca 1260 aactcagtac aagactcagt ataactaatg cctgtaaata atcagaaaca cttttctctc 1320 tgtcacgcaa acgtgaggag tcttcccgca catgatttta ccaaactcca cgaacttgag 1380 accattgctc tagaggaggg gtacgatatc attgctctct ccgagacatg gtgtgactcc 1440 accatcccag acgtagatat gtacctcccc tcttatcaag tccccatgcg tcgagatcgc 1500 aaccgccatg gtggcggagt tgcactgtat atatccaact ctgtaccatg taaaaggagg 1560 agcgatctag aaagtgatac gtctgaagca atctggtgtg aggtggatat cggtatgaaa 1620 ctatttgttt gttgttgtta tagaccgccc ggccagagtg cggaggaacg ctccacattc 1680 tttgagcacc tgcaacaatc aattgacgta attattgaca gcggcccatc tgacaaaact 1740 actattgtct tgcttggtga tttgaatgca aattctgaca atgccgcttc cacgattgaa 1800 atgtttacat cagaaaacaa cttcgaccag ctgatttcag agcccactcg cattaccgac 1860 acatcgtcat ccttgctcga tgtcctcatt acagattcac caagtctctt tacggagtgt 1920 ggtacaaacc cgcctttgtc aaattgtgac cactcacttg tatatggcaa aatgcatctt 1980 aaaatccccc gtgttaaaac ctacaaacgt actgtatacg agtttcataa ggcagattat 2040 gaaggattga actctgagct cttacttata gattggggtt ccattgtctc ggagaacatt 2100 ccggacataa acgccttaac cgaaacattc atgacaaccc tttctacaac gatcgcgcgg 2160 tttgtaccca gcaaacaggt cacaatccga ccaagagacc aaccttggat gacgtgtgaa 2220 attagacaaa tgctaaggaa gaagaaacga atgtaccgta aatacaagaa aactaacagg 2280 ccagagatat acaacacgtt caagcatatc cgtaatagtt gtacaaagtt aatcagaaag 2340 gcgaagcaca accacagact cggcctttgt ctcgacctgg caaatccatc ctcgaaccca 2400 aaaaagtggt ggtccgtctc taaagaaata ttcatgggta agatagatcg aacattacca 2460 cccttagtcg aagatgaatt tactattacg gacgacaaga ccaaggccaa tgtctttaat 2520 tgttacttcg cttctcaatc aaatcttgac acaacaggcg cctctcttcc cttgtttaca 2580 actagaactg acacggtcat cgactgcatt atctgctccc cctctgaagt ttatgaaatt 2640 ttgtgtaacc ttgatgttaa caaggccaca ggtcctgatg gtcttggtaa caggctttta 2700 aaaaacgtcg ccccatcatt ggctaatcca ctatcaaaat tattcaatat ctccatacag 2760 cacgggtgct tcccggacat ttggaaatgc tcgaacgtag tgcctgtaca taagaaaggc 2820 gacaagcagt caaaggaaaa ctacagaccc atatccctcc tatgttgtgt atccaaagtt 2880 ttggaacgag tagttcacag ctcactttcc tcgtacttcc accaaaacaa gcttcttact 2940 gatcgtaacg ccgggttcgc ccttggagat tctaccgtca actccttaac ataccttgtt 3000 cacaaaattc atcaagcact tgaccaagga cttgaaaccc gcgcagtctt ccttgatatt 3060 tccaaagcct ttgataaggt ttggcacacc ggcttactct ttaaactacg acaacttgga 3120 gtcactgggt ctctgttggc atggatagag tcgtacttaa ataaccgaag gcaaagagtc 3180 gtcatcaatg gagtgtgctc tgaatggcta cctatcaatg caggtgttcc tcagggatcc 3240 ctccttggcc ccttactttt tcttgtattt ataaatgaca ttgttgatga cttattaaca 3300 gactcccgcc tttttgcgga tgatactgca ctactagaga taattactga ccctgttgcc 3360 tcagcggaca ggttaaaccg tgatctcacc tcagtttcgg aatgggctgc tcagtggctt 3420 gttacattca atccccaaaa aacagtattg atgacgttct ccttgaagaa aaacaaagtt 3480 cgacatcccc ccttatactt taaaggtgtt cagctcactg aagtcgaatc ccacaaacac 3540 cttggcctac atctcactag agacctatca tggtcacttc acgtgtcaat cctggtgacg 3600 aaagcaatga aacgcataaa tctactgaaa agaatttctt gctttgtccc acggaaaaca 3660 ctcgagactc tgtacaaagc catgatacgt cccttgattg agtatgccaa cgtcgcctgg 3720 tgcggtattc ttaagaaaga ctctgactca ctcgaatcag tacaggtgtc cgcggctaag 3780 gtctgcacgg gtgcactaag ggggacgaat cacagccgca tcttaacgga ggtcggttgg 3840 gacacacttg ccaccagacg ggaaaagcat gtacttattc acttctataa aatggccagt 3900 ggctctgcac caaattatct ccatgaactg gtacctccac ttatccgtga ttccacccct 3960 tatggccttc gagatgacca gaactacctc cctattgccc taagaaccga aaagtacaaa 4020 agatccttct tacctgccgc agttttctcc tggagtcaac ttcctttgtc cgccaaaaca 4080 tcaccatcta ttagtatttt taaaagagaa ctagagtctt tctttgacag cccgcccaga 4140 cacacccact actcccatgg ccacggacgc cctgcagttc acttgactag actcagactc 4200 gacttcagcc agcttaactc acacctattc aaacaccatt gtataactag ccctacttgt 4260 gaatgtggga attcaaacga gacaactgag cactacttgt tgcactgcga actatacaat 4320 agcgaaagga ttcatatgtt gcactccata tcccgtttaa acctagaaga attactctcg 4380 cctgaccccc ttaactcatc tgacaatact ttatgcaccc ttctcactat gggcagccct 4440 ttccttccac actcgataaa ttgtaacatc tttgactatg ttttcaattt catcaaaatg 4500 accggaagat ttacttaatc tgatttgaaa ttgattggta atgccagcct tgtatgcgct 4560 ttagcccgtt tttgttgtta gtatttatgt tattcttatg ttgcaaaatg acgtttatgt 4620 tcatgtcatc ttaagttata cgtttcgtta taatctcaat tttgttactt ttgtgtgtca 4680 ttgtatgtgc attgttagag cagaagacct ccataagctc tttgtgagct tcttgtccaa 4740 tgctcgaaaa acgaatgcga ataaaataaa aataaaataa aataaata 4788 // ID CR1_Ele28 repbase; DNA; INV; 4941 BP. XX AC . XX DT 22-OCT-2010 (Rel. 15.1, Created) DT 22-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE A CR1 clade non-LTR retrotransposon family from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1_Ele28. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4941 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4941 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (22-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. This consensus is generated from 24 CC sequences with >94% identity, and ~99% identical to the original CC sequence in [1]. XX FH Key Location/Qualifiers FT CDS 352..1176 FT /product="CR1_Ele28_1p" FT /translation="MAKSCGKCAEAITGLDLVICRGYCRSAFHMNCTTVTR FT ALQSYFTSHKKNLFWMCDKCAELFENSHFRAISVPQNDDWPLSSLTTAITE FT LRTEIQQMAAKTTSSLTPVARSRWPVVGQRMQFKRKREEQNESRVTEACKT FT GAKQSVGNVVAVPTCVEQQDTKFWLYISRIRPDVSSDAVMAMVKANLEIEA FT DPTVVKLVPKGKDISSLTFVSFKIGIDADLKSKALDPATWPEGIMFREFED FT FGAQRSQLPSKVPRFLTPSNRDVLTPDTPVMNLT" FT CDS 1694..4867 FT /product="CR1_Ele28_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MHQLNSLPAQMWSPGCTSPASPKEASNPLITVEPLLP FT AFSSHPGPVFEHGEEVFRILNAGKYSPMLSNFSAVTSAASSVSSTPQLDHN FT AQPPTTCDGTSSAPPGKHLFSLYYQNVRGLRTKISKLRLLLSSCDYDVLVF FT TETWLRPDIVSAEISPDYVFYRCDRSSSTSHYSRGGGVLIAVKCRFKCELV FT PLSSCEHLEQVTIRVKMSHRSLYVVAVYLPPNSPTDVYSAHASAVEHIANI FT SSKKDIILSLGDFNLPNLRWQLDDAMNGYIPSNISSEPEQSLVEAMFAIGL FT QQVNNLVNINDRLLDLAFVSLPEYLDLIPPPIPLLTIDNHHMPFVLIFDEC FT EELISEPDDFIENSNFNFNACDYDQLNEALAAIDWNAVLQCGSVDEMLISF FT YDKLYGIFNEHVPRKRRKFTSTFSKPWWSAELRNLRNILRKARKRFFLTKS FT EGERENLHDVEIAYKELLLSSHENYVSTIQTNVKQNPSLFWDFVKKQKGTT FT RITSNVHLNGSHASTNKEAADLFASFFESVFSRAAPVPRRDCFAHIPTIDI FT SLPVIQFSCNEVLEAINELDVSKGQGTDAVPPLLLRNCSTTLADPITKIFN FT RSLSEKTFPALWKQACIVPIHKSGNCSSVSNYRGVSILCCMSKIFEKMVHS FT ILYNVAVPIISNTQHGFMRRRSTTSNLMSYVATLSRELEHRRQVDSVYIDF FT AKAFDTVPHVTIVNKMKHIGFPDWIIDWLRSYLSDRSAYVVINSARSRSFA FT ITSGVPQGSVLGPLLFNIFVNDLSLLLSSFVLSFADDMKMYRTIFSQLDCV FT ALQEDVNTLLIWCGDNGMRVNGTKCKVISFSRCSDKTVFEYSIDSVLLQRV FT TSICDLGVIIDEKLKFNEHIGVTAAKAFSALGFIRRHAADFTDIYALKILY FT CSLVRSILEYAAPVWCPYYSTLVLKIERVQRRFIRFALRNLPWNDPVHLPP FT YPDRCRLIGLETLSQRRVSMQCVFIFDIIQGNIDSPVLLEQIPLYVPPRQL FT RYSSMLAIPHHRTNYGHFNALDFCLREFNRFSDSFDFNVSKNVFVNRIRDR FT I" XX SQ Sequence 4941 BP; 1311 A; 1211 C; 1033 G; 1385 T; 1 other; agtcgttaat tcatagactg tttactttcg gactctgttt tgcactattc gacccgtatt 60 agtggagttt tgtgatattc tgccctgaaa gtgctgtgtt ttgtgttatt aaaaagtgtt 120 tatcgcgatt cagtaaagag cwcttccatc aacccacgcc catttttctg cgaacagcga 180 catcttgcgg tgatcactac aaattgattt ttgcacgcaa gcttgcacgg cattccgtag 240 ccgttgtgga tattcaacaa ccatcacacg caaaacgctt gtctgcttct ccaacaccgt 300 acgtgcataa ttaaaggcgt ttgctcacat cccatacgcg tggacgcaga aatggcaaaa 360 tcatgcggca aatgtgctga agcaattact ggacttgatc tggttatctg tcgtggttac 420 tgcagatctg cattccacat gaattgcaca actgtgactc gcgctttgca gtcgtatttc 480 acgtcgcaca aaaagaattt attctggatg tgtgataaat gtgcggaatt gttcgaaaac 540 tcacattttc gagcaatctc tgtaccacaa aacgatgatt ggccactatc gtcgctgaca 600 accgctatta ccgagctccg aacggaaatt caacagatgg ccgctaaaac tacatcttct 660 ttgaccccag tcgcacgcag cagatggcct gttgtaggtc aacgcatgca gtttaagcgc 720 aaacgcgagg agcagaatga atctcgtgta actgaggctt gtaaaactgg cgctaaacaa 780 tctgtcggta acgtagtcgc cgttcctacc tgcgtcgaac aacaggatac caaattttgg 840 ctttacatat ctcggattcg accggatgtt tcttccgatg ctgtaatggc aatggtaaaa 900 gccaatttgg aaatagaagc agaccctact gttgtcaaat tagttccaaa gggaaaggat 960 atcagctcgt tgacattcgt gtcatttaaa attggaattg acgcggatct gaaatccaaa 1020 gccttagacc cagctacttg gcccgaaggt attatgtttc gggaatttga agattttggc 1080 gctcaacgat ctcaactacc ctcaaaagtt ccacgttttt taactccaag caaccgcgat 1140 gttcttactc cggacactcc tgtgatgaac ttgacttaaa ctgtgcaata aatcgtcatc 1200 aaatcatcca ggaactggga cgcaataaaa ttcgctccat ggaagcccca aatccacccg 1260 ccacagtcgc gcctttgctg tcaacgttca tcagtcgtcc tggtcctgtg tgtgggattg 1320 gaggaggggt cttccaacat cctacgaacg gcaagtacaa caatagaata tacagaacag 1380 ttactgattc gtttcacggt tccagcgatt catttgatct ttgcacgtcg tttgagttac 1440 catcgttcat cgcgataccg ggatgcacgc ctgcaagcaa taaggaagcc tccaaccccc 1500 tctcacagtc gagcccatcc cgccagcgac ccgcagtcat cccggtcctg tgtttgagat 1560 cgggaaggag gtcttccaaa ttaacatcgc aggcaagtac acgcaagcat cgaacaattt 1620 ccttccattc acgtcgtctc tcgcctctag tgaagtatgc cagcgcagcc cattggatgt 1680 aaattcaccg gacatgcatc aattgaattc actgccagca cagatgtggt caccgggatg 1740 cacgtcgcct gctagcccaa aggaagcctc caatcctctc atcacagtcg agcccctcct 1800 gccagcgttc agcagtcatc ccggtcctgt gtttgagcat ggagaggagg tcttccgaat 1860 cttgaatgca ggcaagtaca gtccaatgtt gagcaacttc tctgcagtaa catccgctgc 1920 ttctagtgtt tcatcgacgc cgcaattaga ccataacgca caacctccga cgacctgtga 1980 tggcacatcc tctgcaccac ccggaaaaca cttgttctca ctctactatc agaacgtgag 2040 aggcctacgt accaaaatat caaaacttcg gttgttattg tccagttgcg actacgacgt 2100 gttagtcttc actgagactt ggcttcgacc tgacatcgtc agtgcagaaa tttcgccgga 2160 ttacgttttt tatcgatgcg accgtagcag ctctactagt cactattcaa gaggaggtgg 2220 ggttttgatt gccgttaagt gccgcttcaa atgcgagtta gttccgctgt cgagttgtga 2280 acatcttgag caagtgacta tccgagtgaa gatgtcacac cggtcactgt atgtcgtggc 2340 tgtatatctt cccccgaatt ctcccaccga tgtttactct gctcatgcaa gtgcggttga 2400 acacattgcg aacatctcgt ccaaaaagga tatcattctt tcgcttggtg acttcaacct 2460 tcccaatctg cgttggcaat tggatgacgc tatgaacggg tacattccat caaacatttc 2520 ctctgaacct gaacagtccc ttgtggaggc catgtttgca atcggtcttc aacaagtcaa 2580 caatctcgta aacatcaacg acaggttgct agacttggca tttgttagtc tgccggaata 2640 tcttgacctc attcctccac ctatcccact tttaaccatt gacaaccatc atatgccctt 2700 tgttttaatc tttgacgaat gtgaggagtt gatatctgaa cctgacgact tcatcgaaaa 2760 ttccaatttc aattttaatg cttgtgatta cgaccagctg aacgaagcac ttgctgccat 2820 tgactggaat gctgttctac agtgcggcag tgtggatgaa atgctaatca gcttctacga 2880 caaactgtat gggattttca acgagcacgt tccccgaaaa aggcgaaaat tcacttcaac 2940 attcagcaag ccctggtgga gtgcagaact gaggaatctg cgaaacatct taagaaaggc 3000 acgtaaacgg tttttcttga cgaaatccga aggtgaacgc gaaaacctgc atgatgtgga 3060 gattgcatac aaagaactct tactatcttc tcacgaaaac tatgtcagca cgatccagac 3120 taacgtgaaa caaaaccctt cgttattctg ggactttgtg aaaaaacaaa aaggcactac 3180 tcgcatcacc agcaatgtcc atcttaatgg ttcccacgct agtacaaaca aagaagccgc 3240 tgatttgttc gcgtcattct ttgaaagtgt attcagcaga gcagcacctg ttccccgtcg 3300 tgattgtttc gctcacatcc caaccatcga catctcgctt cctgttatcc agttttcctg 3360 caacgaagta ttagaagcta tcaatgaact cgatgtttcg aaaggccagg gcacagacgc 3420 cgttccccct ctcctgttaa ggaattgctc tactacgtta gctgacccta tcaccaaaat 3480 cttcaatcgt tctctcagtg agaagacatt tccagcattg tggaaacagg cttgcattgt 3540 ccctattcac aaatctggga attgcagttc tgtctctaac tatcgtggcg tttctattct 3600 gtgctgcatg agcaaaatat ttgaaaaaat ggtgcatagc atactgtaca atgtagcagt 3660 tccgatcatc tccaacactc aacatggctt catgaggcgc aggtccacaa catcaaatct 3720 catgtcctac gttgccacac tatctcgtga attggagcat agacgtcaag ttgattccgt 3780 ttatatcgat ttcgcaaagg cttttgacac cgttccacat gttacaatcg tcaataaaat 3840 gaagcatatc ggatttccgg attggatcat agattggcta cgttcgtatt tatcagaccg 3900 atccgcatac gtggtgatca actctgcgag atctcgatcg tttgccatca catcgggcgt 3960 acctcagggt agcgttcttg gaccgttgct attcaacata tttgtaaacg acttaagtct 4020 tctactgtcg tcgttcgttc tatcatttgc tgacgatatg aagatgtacc gtacaatatt 4080 ttcgcagctg gactgtgtag cacttcaaga ggatgttaat acgctgctga tatggtgcgg 4140 tgataacgga atgcgcgtca acggtaccaa gtgtaaggtt atttccttca gtagatgcag 4200 tgacaaaacc gtttttgaat actcaatcga ttcggttctt ctgcaaaggg tgacgtctat 4260 ctgcgacttg ggcgttatca tcgacgaaaa gctaaaattc aacgaacaca taggggttac 4320 cgcggccaaa gccttttccg ctcttggatt tattcgtcgg catgctgcag atttcacaga 4380 catttatgca ctgaaaattc tctactgctc actagtacgt agtatacttg aatacgctgc 4440 tcccgtatgg tgtccttatt attctacgct ggttctcaaa attgaacgtg tacaaaggag 4500 attcattagg tttgcactac gcaatcttcc gtggaatgat cccgtccatc ttccgcctta 4560 tccagaccga tgccgcttga tcggtttgga gactctatca caacgacgtg tttcgatgca 4620 gtgtgtgttc attttcgaca tcattcaggg caacatcgac agccctgtgc tccttgaaca 4680 aatccctttg tacgtgccac cccgtcagct tcgttactcc tcaatgctag caataccaca 4740 ccatcggacg aattatggcc attttaatgc acttgacttt tgcctacgag aattcaacag 4800 attcagcgat tcgttcgatt tcaatgtgtc aaaaaatgtt tttgttaata ggataagaga 4860 tagaatttaa gtaaagtaca taatattaag ttttcagtct gtgtgacgtg tagtcaaaga 4920 cggtgaataa ataaataaat a 4941 // ID BEL-4_DWil-LTR repbase; DNA; INV; 438 BP. XX AC scaffold_180701; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-4_DWil_; KW BEL-4_DWil-I; BEL-4_DWil-LTR. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-438 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_180701; Positions 1220484 1220921. XX SQ Sequence 438 BP; 128 A; 94 C; 97 G; 119 T; 0 other; tgtttaagct gcaactgaag ccagactctc tttgctaaaa agagaaaacg aacgaagaga 60 gagagagaga gaccaagaac agcgcaagtt actttgaagt gacaaacttg gagatccgtt 120 acttcaattc ttttctttct tatttcttct atggaaattt ccagaacttt gaattttgta 180 aaagacagcg tggtttgctg tacttgaatt actcttgaaa ttaatctcgg tcgatgcggt 240 tgcaccttct gtgcccgccg acaattgtac tatagtttta ggttaagaaa ataaagcaac 300 cgattgaagt caacctggtc atctcatttc ttgccaatcg cgagtgcctc aaagccgcct 360 cggtggaaat catccgaagg ccgcatcata taaagcgagc agtttggggc tccgcaaggg 420 cgcagccgtt agccaaca 438 // ID hATm-6_HM repbase; DNA; INV; 4151 BP. XX AC . XX DT 22-MAR-2008 (Rel. 13.03, Created) DT 22-MAR-2008 (Rel. 13.03, Last updated, Version 1) XX DE hAT-type family: consensus sequence. XX KW hAT; DNA transposon; Transposable Element; hATm-6_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4151 RA Jurka J.; RT "Families of hATm elements from Hydra magnipapillata."; RL Repbase Reports 8(3), 210-210 (2008). XX DR [1] (Consensus) XX CC This family is also quite diverged from other hAT families. Its CC closest known relative is hAT-10_SM. CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX SQ Sequence 4151 BP; 1518 A; 607 C; 662 G; 1364 T; 0 other; gggtgtttca tattttatca atttttgaaa tcgaactcgc gaatcgattt ataaattgac 60 tatttgtata aaaataagtt tgagcaaaaa aaaaaaaaaa aattattttt tagggtcccc 120 gccagttcga ttttgggcca aaattgtcgg gtgttttaaa cgcctggcgg ggaccttaaa 180 atttatctaa attttttttt ttttaaatgt tatatgttgg attgaatgtt tatcacataa 240 aaggagcatg tcgaaaaaat taagaaatta gtctacagaa tattttgaaa ttggtgaaaa 300 atccaaattt tctttcacaa aaacctattt tattactcgg tggcgtataa aaaacttttt 360 tttttcctaa taaactgttt ccaaacacgg aaaaaactat aatttgaata gtttttttat 420 tagtgttaaa ttaagtctaa aaaaaaataa taattggatc catggagtaa catgccccgg 480 acactaacct taatggcccc tacttaataa acattagagc gcaataaatt taaaatccgg 540 aaatttgatc ggaaaatatt tttagaaaat gtatatacca aggtacttaa ataataattt 600 aaaatttaac aactactttc caatgcgaag acaaagtatt taaactaaat ttgcgttact 660 aataataagt gctctaatat ttttttaaat aatattcaaa atggctaaaa ttacaagact 720 aaaagctaga gcttatttat ttgaagaaga aagcctgata gaaaacttaa ctcagataac 780 ttttgcaaca aataaagaat taattcttaa ctttcagttt aaaagatcta gtaacaagtt 840 aactccatcc aaaaagttta ttggttgtac tgatgggccc gataaattcg caaagtgctc 900 cggacttgtc aattgccctg acaactgtat tattttttgc agtaaaaaaa ccatggttag 960 acggaggtta taaagatgga ttattaacgg ataaatctat aaggtaagtt ttataatttc 1020 aatttataat aaaatagtta ttataaattt gtataatcat ttgaacttaa taagctagta 1080 gatgattaag gtttttttgt agtaccacta caactaattt tttttagttg ttattacgtg 1140 tggattatag cttaataata caaaatataa ggtgcatctt aaagttaata tttctatttt 1200 atttaatgta ttatcgtaat tttttcagga ataatattac tcgtttaatc gatagttggg 1260 agacattaaa gaaaagtaag aataaggact ctaaagcagc agagaaatct agggaagaat 1320 tcaaaaagaa gggggagcag gtattttgga ttggtaaaga taatattaag gaaatcttaa 1380 aaaaaacgag ccttgataaa ctttcatatt atgctgatat caaattttta aaagaccaaa 1440 agtccagtcg tctaatgact ttggggagca aagacatgag atataaaacg gtaaataaag 1500 ataagaagcc tttaaaaagt tgtaatatta aacagttacc ccaaatagac gaatcaacta 1560 cagatctaga gcttttctca gatatcagtg acgtgagtac agaatcagat atttcttttg 1620 atttatcgca acccggccca agtaacgcaa aacaagagtt aaggaaaggt tttgttaaag 1680 atattgccat gacatcagtc tcaaaaaata tttcatcaag agatctagtg catgtatgta 1740 cggatttaat cgtaagttca ggcggaaatg tggctgattt ttcagtgtct cattcaacta 1800 tttggcgagc tcaaaaaaaa tctattcggg aaaatgctga acaatataag aaaaatgtaa 1860 aaattgctac cgctaaagct acttttccca taatagcaca ttttgatgga aaaattatcg 1920 aagacatcac agaaggtata aaatcaaaaa gagatcgatt tgcggtctcg gtaaatattg 1980 atggtaaaat gaagctcctt ggcattccag caattgatcg tggtactgga caagctcaat 2040 acgatgcttt agttaaaata ctggatgagt atggaatatg tgatgacgtg aaaggtttat 2100 gttttgatac aactgctaca aatacaggaa gactttctgg tacaaatgtc aggtttagtc 2160 ataaacaaaa ctcaattcta ctagagctgg cttgcaggag gcatgtttat gagttacatc 2220 taaaacactt ctgtgaaaga cagtgcagtg gaaaaactaa atctccagaa aacctaatgt 2280 ttaaacggtt ccaattaaac tggaatgaca ttaaaagtag cattgactct tcaaagtttg 2340 taaaatatga tacccaatct attgctacca ctttcttaga aatccataga ctcgatgctg 2400 taaaatattg cgagattgct ctaaaaaaaa acacttttcc cagaggagac tacaaggagt 2460 tattgaaact gactctaatg tacttatgcc ctgaaagaga ttttcaaatt caagctccag 2520 ggtgcgtgtc tcatgcacgt tttatgtcca aagctattta ttatctcaag attcagatat 2580 tgagtttgca actttcttat gaattgacag ataatcaaaa gaacgaagtg cagtctacgg 2640 ctgagtttat ctcaatcttt tataccgtct ggtttttaaa aacctcttta ccttacgctg 2700 cgccttacca agatataaaa gcctattggc aaatgacaaa atatagaggt tacgtagaac 2760 aatatgttca gaattctgaa acaattttaa atggaattga tgccacaatg gtttcaatgg 2820 aatcacattt atggtacctt gatgaaactt taattccgct ggctcttcta gaccaaggta 2880 tttctgttgc agagagagaa gatgttgcaa aaatgctctt ttcgaaacca gttcctgaat 2940 tttttcggca ttccgagaaa ttaaatttgt taaaaactct taactttaac ctagaaaaac 3000 caccaagtat tgcccaacta gtgggagaaa actcttggtt catatttagt ttgttaaacc 3060 ttactaagct aaatgataaa ctttggttaa acagcccggc accactatgg gagtatattg 3120 aacagtttaa gattttttcc cagtttgttt ccaaccttgc agtggttaac gacgtttctg 3180 aaagatctat taaagttgta tcagactttg taaataacgt tcacaatgaa gacgatcgac 3240 aggagttact gctagctatt caccaaagga gagaaaacct taacaaggca aagactaaag 3300 aagatttgca attagcatat caagcaattg caaaatagat aatatagttt ttaaatgatg 3360 gcttttattt aatgatgtaa tattgatttg taaatatgta ttttttattt aattatactt 3420 aaacatatca ttcttttttt tttttgattt tttacaaaac caaaaaaaac tttgcaacaa 3480 taccaagaaa atcatttaaa gttgaaaagc taccatatac tttctaatct tgaggcaata 3540 aaactaataa caacaatacc acttttaacc atgttgcagt aagttcatta agtctttttt 3600 attataaaaa taactaaaat aaacaattgt tattaggcta ttaaggttag cgtaagggtt 3660 aggttacctc gaaggataaa agaaattttt ttttagacct atatatacct catttgacac 3720 tattattaaa aaaatctagg ttattgattt ttccgcgtct ggaaacagtt tattagaaaa 3780 tcaaaaatgt tttttatacg cgattgagtg ataaaataga tttttaagaa ggaaaatttg 3840 gatttttcac caatttcaga agattctgta gactaatttc ttaatttttt caacatgctc 3900 catttatgtg attaatattc aatccaacat ataacattta aaagaataaa aaaaaatttt 3960 cgataaattt caaggtcccc gccaggcgtt taaaacaccc atcatttttg gcccaaaatc 4020 gaactggcgg ggaccctaaa atttaaattt ttttattttt ttttttgctc aaacttattt 4080 tcatacaaat ggtcaattta taaatcgatt cgcgagttcg atttcaaaaa ttgataaaat 4140 atgaaacacc c 4151 // ID BEL-124_AA-LTR repbase; DNA; INV; 827 BP. XX AC supercont1.251; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-124_AA_; KW BEL-124_AA-I; BEL-124_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-827 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.251; Positions 1410906 1410080. XX SQ Sequence 827 BP; 262 A; 167 C; 155 G; 243 T; 0 other; tgttattcga caagaataac gcaccttcct ttggtcattt tgacctgcta tccccaatga 60 tcaacggtga gtgcgatagc agtgcactcg attgagatca attttctgca tgcaaatgag 120 tgttggcggg tagatcaaaa cagaaaacga aatcagcata gttcaatagt gctacatatt 180 tacctcctag tttgatagta gcgttttgaa gtgagtacat ttaattgcaa gtacattgaa 240 ttagtgatag ctttgtttcg agtgtcggta agctggacct gtatataata gagcgattgt 300 agaactaaaa ccccgtcttc aattaggctc ccatcacctc caccacctac agtgtagcag 360 catacgtgct aagccattac tccatagtga aaacggtgag aaagtggtag aatctattcc 420 agagtaaatt ttaaacaaat tacatgctat ccagatccat ctgcttgctc ctgtgaacat 480 agaagcctac cctacatagt ttgtgcccac tgcgaatatt actgaggata ctaaacgtaa 540 gttgaactct atctactgta atacatatac acatagaaac atagacgtat catgctgcaa 600 tcagtgtcag gggttcgcaa gaccacactc acagaccact ttatacagta ttgaattgca 660 tgaaattaga ttacgagtcg attaagaatc tgtgaattaa ataagtcctc aatgtccttt 720 ataggaaaat tatattctcc catacatcca aggattacgt gcgtttgaat tcatttccgt 780 gacggagaat cataattttg gaatttccgt tgaaccaaag ttcaaca 827 // ID BEL-613_AA-LTR repbase; DNA; INV; 490 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-613_AA_; KW Pao_Bel_Ele41; BEL-613_AA-I; BEL-613_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-490 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 490 BP; 180 A; 83 C; 95 G; 131 T; 1 other; tgtgacgacg agtacccccc ggtagacacc cgtcacaaag aatcacaggg atcttacggg 60 gcacgcgaaa gggtagaaac ttcacctttg acaatcgtcg gagatgtggc agacgtcaac 120 gaaggatgat aagcagaawt gatagtcgga gagtttgcgt aaacaaaact tgaattcgtt 180 attgtgcatt ataaaattac tattttctct tatatactaa tttgaacaat tgatccaagg 240 tagggataac tattgaatga acttaaataa tgtgaactta aaacctactt atgtatacaa 300 ttagatctaa aacctaccct acgagtacgg attgaaaact agcctaaagt aagcgagaag 360 aaaactgaaa aattgtaagt agagtgtaaa attatatgta acagaattta agaagcaatt 420 aaatatttca gctaaagctg ttcccaacta cttcaaccgt ttcggaattg ctgttagaat 480 cagtttaaca 490 // ID Gypsy-6-LTR_HM repbase; DNA; INV; 599 BP. XX AC . XX DT 25-DEC-2008 (Rel. 13.12, Created) DT 25-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE Long terminal repeat of the LTR retrotransposon - a consensus DE sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-6-LTR_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-599 RA Bao W. and Jurka J.; RT "Gypsy LTR retrotransposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 1979-1979 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX SQ Sequence 599 BP; 177 A; 90 C; 121 G; 208 T; 3 other; tgttacgaag gtgggttact caagcggaac gcttgagtaa ttcaagttta ttgaaccata 60 tagacgtgta gcatagacgc tagcgtacca aaacaatttg tttatgaacc gaaacgcgtc 120 tagcgtttaa acaaaacgta taaaaacgtc agttaacaaa cgttcgtagt tttcttgatc 180 sagttcttca tccagttctt gatccagttc ttgatccagt tcttsatcca gttcttkatc 240 cagttcttca tccagttctt gatccagttc ttgatccggt tcttgattgg gttcttgatt 300 tgtgttcttg atttgtggtt cttagttctg agttcgttgt ccgaggtaag tggttgaaag 360 agtcgaagaa gcgattgaag ttgttgagga gccgaagata ataaaaagat accagaagat 420 atagtaaaac aagttgtatt atatttatac ggttttatat aatatatcgg gacaattgaa 480 actagtattt atatttgtta taagttgata cgaagtgttg tttggattat cagtaactcg 540 taacatttat ttatgcgtat tatctaaagc gtagtaacta aaaggaccat acgttaaca 599 // ID Gypsy-1_TCa-LTR repbase; DNA; INV; 155 BP. XX AC chrUn_220; XX DT 21-FEB-2011 (Rel. 16.02, Created) DT 21-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the red flour beetle genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-1_TCa_; KW Gypsy-1_TCa-I; Gypsy-1_TCa-LTR. XX OS Tribolium castaneum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Coleoptera; Polyphaga; Cucujiformia; OC Tenebrionidae; Tribolium. XX RN [1] RP 1-155 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the red flour beetle genome."; RL Direct Submission to RU (21-FEB-2011). XX DR Genome; chrUn_220; Positions 384 538. XX SQ Sequence 155 BP; 55 A; 27 C; 21 G; 52 T; 0 other; tgtgcaaaat atccccttag cgcttaatgc ttagtaaaca ttttttatat tataaatata 60 tgctaaattg tagaggggag acactctatg tctcaccgcg cttgtaatta cttcaaataa 120 aatttttaaa aaagccaact gcttattcag aaaca 155 // ID BEL-1_BM-I repbase; DNA; INV; 5595 BP. XX AC nscaf2210; XX DT 19-MAR-2010 (Rel. 15.04, Created) DT 19-MAR-2010 (Rel. 15.04, Last updated, Version -1) XX DE LTR retrotransposon from silkworm: internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-1_BM_; KW BEL-1_BM-LTR; BEL-1_BM-I. XX OS Bombyx mori OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Bombycidae; Bombycinae; Bombyx. XX RN [1] RP 1-5595 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from silkworm."; RL Repbase Reports 10(4), 581-581 (2010). XX DR Genome; nscaf2210; Positions 4189182 4194776. XX CC Positions [4452-5063] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 135..5402 FT /product="BEL-1_BM-I_1p" FT /translation="MATTSKSQSLFDRNVRKMRSKINAYYSRVQSIYDSIT FT KINESNVDVFMSNASTLDMMRTEFIKSLDDLNDLLASNEDATEPNYESMVA FT FDDLFSRIKYKYSELMSLRHNTQNTANTVVGENKIETYRQCPKLPRLELCP FT FNGELQNWPIFYETFKSIIHENQSLSDSEKIQYLIGKLTGNAQLICQGFMP FT TAENYSVLWQTLVNKYNDKRVLAASYFDTLLNLPVANDADPISLENYIDKY FT SSLVSSLKQLKLECLEDFIFVHLGCKKISLELIKSFEMNYSDSDKLPTCTQ FT FINYIRNQSHILQRAYNISNNNSIRMNKYNSQQSNSNKPRNIRNNNVKTFV FT VHEQKRCIFCGVAEYHQLKNCSKFKSITPKERFSFIKTNRFCVNCLSTSHT FT VRNCTSNEKCSSCSRSHHSLLHFITKGDSGEVPKSVDIATHTDGDQDNAMP FT DSLCACALEQQCFKTVCNQQPTKTILLGTAICNIMDNSGTFLSTARILVDS FT ASQRDLITLEFCKRLNLPIYPSETKQVCGVGDVKNSIEGYCCLTLCSCTEE FT SVRITIQPLVINKIIGELPTSKIDTSSLNYLKHIKLADDRFSQPRGIDLIV FT GSDVFSRILRPHIISRAPGEPVAIETSLGYIIVGQAPVLAPSIDNSYYTHC FT TFGEDNLDNENNACIGNFLKLEDIPDVKPYTEQEEECESLYRQTTTRDSLG FT RYIVSLPFKDTPSKLGDSLQASMRRYLSLERKLLLQPEMRQEYDKVIKDYL FT EKGYLNPCSRNINESSLQYVIPHHGIVRKDKSTTKLRVVLDGSMKSSSGLA FT LNDILQVGPNLQNDLFRIILNFRLFNVAISADIRQMYLRILVKDEDRKFLR FT MLYRFDPSEEIKLYEFTRVPFGLCCSPFLAIRTVRQLVTDEGSQFPLAAPV FT ADSDVFTDDLATSCANEETAVELSNQLIKMFHAGGFDLVKFSSNSPEVLSK FT IPSSHREFEVIEFSPDSYLKILGLNWLPVEDVFTFTVNLRNRECTKRNILS FT VIARIWDLMGFVAPVTLWAKLIIKSLWANNIDWDETPRPEIVAAWHRFVSE FT LPILENVRIPRHTGIVMECIVSLLGFADASEKAYGGVVYLHVYFPQTNKFT FT ITLVCAKSRVAPLRCVSLARLELCGNLILAKLMRAIIDSYSHRCKISNVFA FT FTDSTVALAWIHSSPARWHTFVANRVTKIQDEIHPRNFYFVSGKENPADCL FT SRGLTPSQLMEHPLWFSGPRFAHLPISDWPVKNFDATSMTDIPEMKPAIML FT FTTDDARNDNILYTLACRISSWPKLVRIVLYVLRFIQLVPSKFDISHLKVA FT ETNIIKAVQRVHFDKELKNMKSGKILPSAFQRLRAFVHEGIFRVGGRLENA FT HLEFDSQHPILLPARDHVVNILIDYYHKKYLHTGPQLLMSLIRQNYWILSA FT RNIIRKRVQQCNICYRANPRNQCPLMADLPACRVQETKAFYHTGIDYAGPL FT RIIPYRRRGVRSQKAYICLFICLVTKAVHLELSTDLSTAAFLNAFKRFIAR FT RGPIKVVYTDCGTNFLGTHSYLKEMYKLISSEQYADKFANELRDNNIDWKF FT NPPSAPHFGGIWEGNIRSVKTHLNKIVGNQLLTFEEMLTTLAQIEALLNSR FT PLSVLSSDPCDPQALTPAHFINLTPLKSLPSETITEHSNLIQRKRIVDSLV FT QSFWKRWKLEYLNTLQVRNKWKKDSTPIKTGTVVLLQSENSAPLNWPLGII FT EETFPGRDGVVRVVNVRTKTGTYRRPVVKVYPLPTQ" XX SQ Sequence 5595 BP; 1776 A; 1025 C; 1071 G; 1723 T; 0 other; tatttgatca cttcgagccg gatatttaac aataatttaa ccgttgcaca tagattgagc 60 ggctaccccc aacttgaaaa aactattaat aggtgaacat tttatttaaa tttctttgtt 120 actacttagt taaaatggct actacttcta aatctcaatc actttttgat cgtaatgtac 180 gtaaaatgcg tagtaaaatt aatgcatatt actctcgagt acagtccatt tatgacagta 240 tcactaaaat aaatgagagc aatgtggatg tatttatgtc gaatgctagt actttagata 300 tgatgcgtac cgaattcata aagtcactgg acgatcttaa tgatttatta gcatcaaatg 360 aagacgccac agaacccaat tacgagtcaa tggtggcgtt tgatgaccta ttttctcgca 420 taaaatataa atattcggaa ctaatgtcgt tgcgtcacaa tacgcaaaat acggcgaaca 480 ctgttgttgg tgagaataaa attgagacct accgacagtg tccgaaatta ccgcgattag 540 aattatgtcc gtttaatggt gaacttcaaa attggcctat attttatgaa acatttaaat 600 ccataatcca tgaaaatcaa agtttatctg atagtgaaaa aattcagtat ttaatcggta 660 aactgactgg taatgctcaa ttaatttgtc aagggtttat gcccactgct gaaaattatt 720 cagttttatg gcaaacatta gtgaataaat ataatgataa gcgagtcttg gcggcgtcat 780 attttgatac gttattgaat cttccagtag ctaacgatgc cgacccgatt agcttagaaa 840 attacataga caaatattct tcgttagtaa gctcgttaaa acaattaaaa ctagaatgtc 900 ttgaagattt tatttttgtt catttaggtt gtaagaagat tagtttagaa ttaattaaat 960 cattcgaaat gaactactct gattctgata aactacccac ttgtactcaa tttataaatt 1020 atattcgtaa tcagtcccat attttacaac gtgcttataa tatatcgaat aacaatagta 1080 ttcgaatgaa taaatataat tctcaacaaa gtaattcaaa caaacctcgt aatataagaa 1140 ataataatgt gaaaacattt gttgtacacg aacagaagag atgcatattc tgtggtgtcg 1200 cggagtacca tcagttaaaa aattgttcta agtttaaatc aattacaccg aaggaacggt 1260 tctctttcat taaaaccaac cgattttgtg tcaattgctt aagtaccagt cacacagttc 1320 gaaattgtac gtcaaatgaa aaatgtagtt catgttcacg tagtcatcat tcgctgttac 1380 attttattac aaagggtgat tcgggtgaag taccgaaatc agtagacatt gccacccaca 1440 ctgacggaga tcaagacaac gctatgccag attcactgtg cgcatgcgca ttggaacagc 1500 agtgttttaa gacggtatgt aatcaacagc ccactaaaac tatattgctt ggtacagcga 1560 tatgcaatat tatggacaat tcaggcacgt tcttgagtac tgcccgaata ttggtagata 1620 gtgcatcgca acgagatctg attactcttg aattttgtaa gagacttaat ttaccaattt 1680 atccttccga aacaaagcaa gtgtgcggag tgggtgacgt gaaaaattcc attgaaggat 1740 attgttgtct aacactttgc tcttgtactg aagaatcagt acgtattaca atacaacctc 1800 tggtaatcaa taaaattata ggtgaacttc ctacctcaaa gatcgatact tcaagtctta 1860 attatctaaa acacattaag ctagctgatg acagattctc acagcctcgt ggtatagatt 1920 tgattgtggg ttccgatgta ttctctcgaa ttttacgacc tcatataatt tctagggccc 1980 ccggtgaacc tgtagcaatt gagacttcat taggttacat aatagtcggc caggcaccgg 2040 tattagctcc ctcgatagat aattcatact atacacattg cacctttggt gaggacaatt 2100 tggataatga gaataatgct tgtattggta attttttgaa attagaggat attccagacg 2160 tcaagccata cacggaacag gaagaggagt gtgaaagcct ttatagacaa acgactactc 2220 gtgatagctt aggtcgttac attgtctcgt taccattcaa agatactcct agtaagctgg 2280 gtgactcatt gcaggcatca atgcgtagat atctttcatt ggaacgtaag ctgctattgc 2340 aacccgaaat gagacaagaa tacgataaag tcattaagga ctatttagaa aagggatatt 2400 taaacccatg ttctagaaat ataaatgagt cctcactcca gtatgttatt ccccatcacg 2460 gcatcgttcg aaaggataaa agtacaacaa aattacgggt agtattggat ggcagtatga 2520 aatcatcatc gggactggca ctgaatgata tattacaagt cggaccgaat ttgcaaaacg 2580 acctttttag aataatttta aatttcagac tttttaatgt agcgattagt gccgacatta 2640 ggcaaatgta cttgaggatt ttagtaaagg acgaagatag gaaattcctg agaatgttgt 2700 atcgctttga tccaagcgag gagataaaac tatacgaatt cacacgtgtg ccattcggtt 2760 tgtgttgtag tccgttcctt gcgatcagga ctgtgcgcca gttagtgacg gacgagggat 2820 cacaatttcc tctcgcggct ccagtagctg acagtgacgt tttcactgac gatttggcta 2880 cttcctgtgc gaatgaagaa acagcggtcg aactttctaa tcaacttatt aaaatgtttc 2940 atgctggtgg ttttgatttg gtaaagtttt caagtaactc accagaggtt ctttctaaga 3000 taccctcttc acatagagaa ttcgaagtta tagaatttag tccagatagc tatttaaaaa 3060 tactgggact aaactggctt ccagtcgaag acgtattcac attcacagta aatttacgga 3120 accgcgaatg tactaaaaga aatattttat cagttatagc aagaatttgg gacttaatgg 3180 gattcgtagc tcctgttaca ctatgggcca agctcatcat taagtcacta tgggctaata 3240 atatagattg ggatgaaact cctcgccctg aaattgtagc agcttggcat cgattcgtgt 3300 ctgagttgcc aatattggaa aatgttagaa taccacgaca tactggcatt gttatggagt 3360 gcatagttag cttattggga tttgctgatg catctgaaaa ggcctatgga ggtgtcgttt 3420 atttacatgt ttattttcca caaacaaata agttcactat tacgcttgtt tgtgccaaat 3480 ccagagtagc gccgttacga tgcgtgtcat tggcaagact agaattatgc ggtaacttga 3540 ttctcgcaaa gttgatgaga gcaattattg acagctattc tcatcgctgt aaaataagta 3600 atgtatttgc ctttacggat agcaccgtgg ctttggcatg gattcactca tcaccggcta 3660 gatggcatac gtttgtagcc aaccgagtaa caaagataca ggatgaaata caccccagaa 3720 acttttattt tgtatcgggt aaagagaatc cagcagactg tctttcacgc ggcttaacac 3780 catcacaatt gatggagcat cctctttggt tcagtggacc ccgatttgct catctaccca 3840 tttcagattg gccagttaag aattttgatg ccacatcgat gactgatatt ccagaaatga 3900 aaccagctat catgctattc acaactgatg atgcacggaa tgacaatata ttatacacgc 3960 ttgcatgtcg tatttcatca tggcccaagc tggtccgtat tgtactttat gttctccggt 4020 ttattcaatt ggtaccgagt aaattcgata tttcccattt aaaggtggcc gaaacgaata 4080 tcatcaaggc tgtacagcga gttcactttg ataaagaact taagaatatg aagtctggca 4140 aaatattacc gtctgcattt caacggctca gagcctttgt tcacgaaggt atctttcgtg 4200 ttggcgggcg attggagaat gctcacttgg aatttgatag tcagcaccct attttacttc 4260 ccgctcgtga ccatgtagtg aacattttga tagactatta tcacaaaaag tatttgcaca 4320 ccggaccgca gctcttaatg tctctgatac gccaaaacta ctggattctc tctgctcgga 4380 atataataag aaagcgagta cagcagtgta atatatgtta tagagcaaat ccccgaaacc 4440 aatgtccatt gatggctgat ttacctgctt gtagagttca agaaacaaag gccttttatc 4500 acactggtat agattacgct ggcccattac gtattattcc ctacaggcga cgtggtgttc 4560 gtagtcaaaa ggcgtatatt tgtctgttca tttgtctagt taccaaggca gttcacctag 4620 aactctcaac ggacttgagc acagcagctt tcctgaatgc attcaaacgc ttcattgcgc 4680 ggcgtggtcc aatcaaagtg gtctataccg actgtggaac aaactttttg ggaactcact 4740 catatctcaa ggaaatgtat aaattgattt cctctgagca gtatgcagat aaatttgcaa 4800 atgaattgcg tgataataat atagattgga aattcaaccc accatcagcc cctcattttg 4860 gcggtatatg ggagggtaat attcggagcg ttaaaactca tcttaataaa atagttggta 4920 atcagcttct cacttttgaa gaaatgttga ctacacttgc tcaaattgaa gcattattaa 4980 attcgcgacc tctatcagtt ttgagttcag atccatgtga ccctcaagct cttactcctg 5040 cacattttat aaacttaaca ccattaaaat cactaccttc agaaacgata acggaacatt 5100 ctaatctcat ccaaaggaaa cgtattgtag acagtttggt ccagtctttt tggaaaagat 5160 ggaaacttga atatcttaat acattacaag ttcgtaataa atggaaaaaa gatagcactc 5220 ctattaaaac tggcacagtt gtactgcttc aatctgaaaa tagtgctcct cttaattggc 5280 cattgggtat aatagaggaa acttttccgg gtcgtgatgg agttgtaagg gtggttaacg 5340 ttagaacgaa aactggtact tatcgtagac ctgttgttaa ggtttatcct ttaccaacac 5400 agtagtggtt cgatataatt ctacatacat acagtccaca aaatctaatg aaatataatt 5460 aagtagtaat ttaatagtct aattgaataa taaaagtaat aagtctaaga taataagata 5520 atttataatg taggtttcat atattattat agtgtttaac tttgaaagag ctttactctc 5580 tcaaactggg gggca 5595 // ID Zator-1_HRo repbase; DNA; INV; 3655 BP. XX AC . XX DT 29-MAY-2010 (Rel. 15.08, Created) DT 29-MAY-2010 (Rel. 15.08, Last updated, Version 2) XX DE Zator-type DNA transposon: consensus sequence. XX KW Zator; DNA transposon; Transposable Element; Zator-1_HRo. XX OS Helobdella robusta OC Eukaryota; Metazoa; Annelida; Clitellata; Hirudinida; Hirudinea; OC Rhynchobdellida; Glossiphoniidae; Helobdella. XX RN [1] RP 1-3655 RA Jurka J.; RT "DNA transposons from the Californian leech genome."; RL Repbase Reports 10(8), 1187-1187 (2010). XX DR [1] (Consensus) XX CC ~99% identical to consensus CC This sequence was derived from sequence data generated by DOE CC Joint Genome Institute. XX FH Key Location/Qualifiers FT CDS 1112..2734 FT /product="Zator-1_HRo_1p" FT /translation="MDKDDMFKKLFNAFIKANPSGQKQKIQHDCVSFWNSI FT KTAPDFVRLYNAKKAELEGTTRKRLISSFFLSKNENIPSNLKSNDSSCGII FT NSCSSTFSAVPSTASCSKLTVSNDKCPAQKQLTEELNIINADLVGLMARDN FT MRILTAKQKEEMDTKKARKRQIEKDIIKKKANQERQKKFRIDKKEILQKVI FT EENPELAKKLKTKESPGRPSIVSEQPDLLKSILDIAEFGCSADERRRMETL FT RCVKTLDELHKELLNMGFKISRTAVYYHLLPKNYNTIDGRRHVSTVPVKLI FT RATNDLHRQHPDTKFATDTVHHLCELASFFGPANTTVISQDDKCRVPIGIT FT AAKMQAPMLMHMEYRVRLPDHDWVIGDRHKLIPSVYAGLVISEKGYGAKEA FT VTYSGPTFISIRSGKHSSSNAMSHARDFERLMKLPEFDTITKTMDQNAKPI FT FIFFVDGGPDENPRYPHTIQSAIRQFKKYNLDAIFIATNAPGRSAFNQVER FT RMAPLSRELSGVILPHDFYGSHLDINGKTIDEKLEILNFEHAGET" XX SQ Sequence 3655 BP; 1275 A; 577 C; 678 G; 1125 T; 0 other; aggggccgtt cgaatattac gtaacgctcc gggggggagg gggggtctaa gattttgtta 60 cggagtgtta cggggggagg gggggttcaa aaattgttac gtaacacaga aaaaaatgtt 120 gttttttata tttttcatta ttaataaatt taatttttca ttctaaaaca caaatgttag 180 tgtagttttt atagaatcta tgttaataaa tggttaatga tcatatttct ttaaaattgt 240 cgttttcatc catgaaactt agcagttatg aacattctga taaaaaaaca cgacgaatac 300 aatgcatttt cgtgaccgcg tggtacatac ggaaaaaaat caaaaaaaat tctgcttcta 360 ctctaaattc aaaaatattt ttacataatg ttaacaaatg gttaattatc atatttcttt 420 aaaattgtca tttttactca tgttttacta agtttttaaa aaaatataaa atacaccact 480 cgtagcgaat tgcctgctgc aaatgtgact cacaacaaca tagaaaaaat atttttataa 540 taatacacat ttcttttatc gcttataaca accgaaaatt atatagatag gcgcattttc 600 gaaaaataaa caaaaataac aacgacgaaa acgcgaattt gcgatagtaa tgttaaatca 660 ctttcaggaa aacgcggtga agttgagcgc gcgcgctatc gcatttttta acgttgtgcg 720 acctgctatt atagattaaa taatgttgga ttatataaaa gttagtttaa ttaatatatg 780 gatgaataaa tgatgattga atgaaagaat ataggaagaa aaatttttct tgtgacaagc 840 taaaaatcaa atttgatcca acgagagaag agttataaaa ggtttaaaaa gttaacttta 900 gattgcatag tgtagtttaa cggtgaccga aattttcaag tttgtgtttc tcttgctgac 960 atttctttaa atttttaaag ttcaagtcgt atttctagat ttagagtgag tacaatttct 1020 ttaattatgt tttaattatc tgaaattgtt attttggaaa ttttattcta catgataatt 1080 ttaattttac tttttaactg accattttag catggataaa gatgacatgt ttaaaaagtt 1140 atttaacgca tttataaaag ctaatccaag cggtcaaaaa caaaaaattc agcatgattg 1200 tgttagtttc tggaattcaa tcaagactgc acctgatttt gtccgcttgt acaacgcaaa 1260 gaaagctgaa ctagagggta ctactaggaa gcgattaatt tcgtcttttt tcttgtcaaa 1320 aaacgagaac attccttcca atttaaagtc aaatgacagc tcttgtggaa taataaattc 1380 ttgttcttca actttttctg ccgttccatc aactgcaagc tgttcaaagt taactgtttc 1440 gaatgacaaa tgtccagctc agaagcaact cacggaagaa ctgaatatta ttaacgcaga 1500 tctagtaggc ttaatggcac gcgacaatat gcgcatactc acagctaagc agaaagaaga 1560 aatggacact aagaaggcgc gtaaacgaca aattgaaaag gatataatta aaaaaaaagc 1620 taatcaagag cgtcagaaga aatttcgtat tgacaagaaa gaaatactgc aaaaagttat 1680 tgaagaaaat cctgaacttg ctaagaaatt aaaaactaaa gaaagtcctg ggcgtccgtc 1740 aatagtgagt gaacagcccg atttgcttaa atcgatatta gacattgctg aatttggctg 1800 ctcggctgac gaacgaagac gaatggaaac tttgagatgc gttaaaactt tagatgaact 1860 acataaggaa ctattgaata tggggttcaa aatatcacga acagctgtgt actatcatct 1920 ccttccaaaa aattataata caattgatgg cagaagacat gtcagcacag ttccagtgaa 1980 attgattcgt gctacaaatg atctccatcg acagcatcca gatactaaat ttgctactga 2040 taccgttcat catctttgcg agcttgcttc attttttggt ccggcaaata cgacagtcat 2100 tagtcaagat gacaaatgtc gagtacctat tggcatcaca gcagccaaaa tgcaagcacc 2160 tatgcttatg catatggaat atcgtgtaag acttccagat cacgattggg ttatcggcga 2220 tcggcataag ttaataccat ctgtatacgc cggccttgta atatctgaaa aaggttatgg 2280 tgcaaaagaa gctgtaacat actctggtcc aactttcatc tcaattcgat caggaaaaca 2340 ttcttcttca aatgcaatga gtcatgcaag agacttcgag agattgatga aacttcctga 2400 atttgatact ataacgaaaa caatggatca aaatgcaaag ccaatattca tcttttttgt 2460 tgatggagga ccagatgaga atcctagata tccacacact attcaatccg ccatcagaca 2520 gtttaaaaag tacaatttag atgcaatatt tattgctaca aatgcgccag gtcgcagcgc 2580 atttaatcag gttgaacgtc gaatggcacc tttaagcaga gaattaagtg gggtaatttt 2640 gccgcatgat ttctacggaa gccaccttga tatcaatggt aaaacaattg atgagaagct 2700 agaaatttta aactttgaac atgcaggtga aacgtagcaa ccatctccgg taatgttgta 2760 attgacaacc atccagtcat tgcagaatac gtttgcccag aaagatccga aatcgattta 2820 atagatacag catgcattga tgaaaaatgg aaaatgaatc atattcgttc ttcacaatac 2880 tttttgcaaa ttttgaagtg ccaggatcga acatgctgta cagagccacg aagcccattt 2940 ttcaactttt ttcctcaacg ttttttgcca gcacctttgc cgctagtata cagtccttct 3000 atttcaatcg caaagtcaac tgaagacgta ggtaaattta tgagcttatt tcaaaatgaa 3060 gcatttgcta accgtacact cccttatgat tttcactgcc catccgttag tagtacagta 3120 ccaacgagag tttgtataga ttgtggattg tattgtgctt cactggaaat gttgaaatct 3180 caccgaaagt caatccacaa tcgtattgtc aataccaaga agaagccgaa gaaaatcata 3240 aaaaagcgga atgaagagtt gttggtacag ttggatgatg atgttgaatg gatgaatagg 3300 gaaaatgttg aatgtattga agaaacagaa gaagatgaga atcctgtaaa taatgaagta 3360 ccaataatat ctattcaaac tcatttggaa agtatatggg aagaagattg ttgatatttt 3420 ttaacaattt tgattttgat tttagaaaca acgctatata aatacatttg atgtatgatt 3480 tattgaaaat aatagaaatt tgaaataatt ttgtttttta ttaccgaaaa attaaaaata 3540 ttttttgtta cgtaacgggg ggggggtaag tcgtttgtta cggaagtgtt acgaaggggg 3600 gggagggggt caaaaaagtc gttttttcgt gttacgtaat actcgaacgg cccct 3655 // ID Shinagawa-2_AAe repbase; DNA; INV; 1857 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.03, Created) DT 16-FEB-2011 (Rel. 16.03, Last updated, Version 1) XX DE A non-autonomous DNA transposon family from Aedes aegypti. XX KW DNA transposon; Transposable Element; nonautonomous; KW Shinagawa-2_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-1857 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the yellow fever mosquito."; RL Repbase Reports 11(3), 839-839 (2011). XX DR [2] (Consensus) XX CC >97% identical to consensus. 8-bp TSDs. TIRs are ~130 bp long CC and composed by degenerate repeats. Related non-autonomous CC elements, named Shinagawa, are found in Aedes aegypti and CC Culex quinquefasciatus. XX SQ Sequence 1857 BP; 512 A; 398 C; 338 G; 608 T; 1 other; ggctccatcc cattaccccg aacgccacta ccccgaacgc cactaccccg aatgccacta 60 ccccgaatgg gtcactaccc cgaaagccat taccccgaat gggtcattac cccgaacgcc 120 actaccccga atgagccatt accccgaata gtatgagata cagtgaaata tattgttttg 180 cgggcaaaaa tgcgtctcta atgaaaactg gtaattaatg gcgcgtaaaa ttcgtcggta 240 aaatgttcat catatatttt cccttctttt ttatgtcagc tcttctttcg aaatatattg 300 ttggcaactc ttctcattct ttcggagacg gtaccttttt atcagcccag tgggcacttt 360 cgcagttcaa gggttcattt acaaatttca taacgccaaa aatttctatt tttgacacct 420 acccacccgt tcgtaccact ttttgtatgg acaacatttt ttgtcccctt cagcgttacg 480 aaatttgtga atgggctttt gataaaataa gccgtttatt gactgagacc tctccttgtc 540 aacttaccat tattgtacat gttctattgt tcctgggcat agaacgtcaa ggaaatgctc 600 aatgcataag atccaccact ggaggaattt gaacccataa cccttaatat ggtcttgctg 660 aacagctgcg tgaacacaga tatttgaact tcttggtggt gtccatattt caattttgtt 720 ttaaagcaaa tattttcctt tcttgtgtat atatgttgtg tcatagcatg atatgacata 780 tcatattgtt acatatcacg ttgttacagt gatcattcac gtcatagctg caaacatttc 840 accagaaatt tccccttctt tcttaaatag gctgttcttt caagttttcg ttggacaagc 900 tagtcattaa tatttccata tagaacagct ttttttgaag gaatatatcc tgaaggcatt 960 cactcaagtg aatcctccta taattctaat cagtaatttt ccctttcttt catcttcttg 1020 tattgaaaca gacaggtctc ttatgaattt gctcaccgct ttgctgtcat catcatttct 1080 tcattgtgct gaatcggaaa aaaattgtga ttatctcatc gaaattcaaa ttaatgatcc 1140 cacaaaccaa cgaaagtttt tactgtagcg gcaatcaaag gccgccgttt ttctaacatg 1200 tacaagtgta ttaacgtttc cttgctttag tggaacggtt gtctatcagg ttggttttta 1260 agcgggttcc atagacgaac acttgcagcg attatctcag ggtgatcttc cattaccacc 1320 gtttacgaaa caaawtcttt aaggagatct taatgctagt agcccgacac ctacaatgag 1380 taaaggaagt gggttccgac gtccaataag aaactttact acatcctaat tttgtgcata 1440 ttatagctat cctttgcata ataaattcac ccttcttttc gaaataggct gttcttcaga 1500 gcattgcaat gtggctgtgt gaatctcacc aaaattagag gacaaatcga aataaaactt 1560 agtattcaaa aattattagc taaaaaacta gctgctttta tatcttctaa tgtttgaaat 1620 ataaattgtt gagccactcg ttaaatactc ataaggaatc gtacaattat ctccccactc 1680 tctcactaga ataagttcca aattttatgc attcggggta atggcgttcg gggtagtgac 1740 ccattcgggg tagtggcgtt cggggtaatg acccattcgg ggtaatggct ttcggggtaa 1800 tggcgttcgg ggtagtggca ttcggggtag tggcgttcgg ggtagtgtca tagaatc 1857 // ID BEL-62_AA-LTR repbase; DNA; INV; 640 BP. XX AC supercont1.350; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-62_AA_; KW BEL-62_AA-I; BEL-62_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-640 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.350; Positions 65970 66609. XX SQ Sequence 640 BP; 219 A; 96 C; 129 G; 196 T; 0 other; tgttacgtcg accagcaagc ccctcgtacg gacatccatt tgtgaatagg tcgaaatgac 60 agcacctatc tatgacagcg agaaatgttt atgcgtgttg gaagaacgat ggcaggtttg 120 tcatatgatg atgattggca aaaattgaag tagatcgatg gctattgttg gattaaaagt 180 ataaagttta cggatttcag aattaaaggt ttagctaatt gagtgaagta attgaattcg 240 agttggtaag caagtcattt attcgtgatt atcgttgata attataaaat tgaattgttt 300 aggtgtttat gcatccacat ttccacccac gttgaactag gagaattaga aaagttctgg 360 aaagaaaatt aaatttataa gtaagtttgt tattgaattt ctctaagatg cagtgtacga 420 aattaataat taaccatagg ccaacccatt gacagctgga ttgtcagata acaggagagg 480 aagataccta atctgtatct gtggtagaac accaaaattg tgagtccaca taaaatcatt 540 tataccttta actctaatca caaatacaaa tttcagctta aagcgcttca ctacaaacaa 600 cagttgtgtt ttgctaaaaa gattggtgat cgtccgaaca 640 // ID AlKe6_AL repbase; DNA; INV; 175 BP. XX AC Y11730; XX DT 09-JUL-2004 (Rel. 9.06, Created) DT 09-JUL-2004 (Rel. 9.06, Last updated, Version 2) XX DE Acricotopus lucens DNA for tandem repeat sequence AlKe6. XX KW AlKe6_AL; tandem repeat. XX OS Acricotopus lucens OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Chironomoidea; OC Chironomidae; Acricotopus. XX RN [1] RA Staiber W., Wech I. and Preiss A.; RT "Isolation and chromosomal localization of a germ line-specific RT highly repetitive DNA family in Acricotopus lucidus (Diptera, RT Chironomidae)."; RL Chromosoma 106(5), 267-275 (1997). XX DR Genbank; Y11730; Positions 1 175. XX SQ Sequence 175 BP; 68 A; 29 C; 22 G; 56 T; 0 other; aacataatgt gaaaatacac aaaaaaaagt tgattttttc tataatttga actttacaac 60 aaataagtac aaactaaaat tcatcaaatc atatatcaat cgatgcgtct tggtaccagc 120 aacaatttca taccattttg agcgctctag gacgtttgtg atagaattta ttgcc 175 // ID hAT-N16_AP repbase; DNA; INV; 615 BP. XX AC . XX DT 29-AUG-2009 (Rel. 14.09, Created) DT 29-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW hAT-N16_AP. XX NM hAT-N16_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-615 RA Jurka J.; RT "hAT-type DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2116-2116 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 615 BP; 253 A; 76 C; 92 G; 194 T; 0 other; cagtgtttgg aaggaacgag ttccaaaagg aacggattcc tggaacgaat tcctttttaa 60 agaacggcac gatgaacgaa ttccttttta taaaaaaaga acgagaaaat gaactagttc 120 ttttttttaa ggaaaaataa caaaattgtt cgttcctttc tagttccaat aatttacaca 180 gataagtatg taaaaatcga aattaagata tattttttta atgtgtataa tgtatattgc 240 taattagtga tcgttatcga gtatcgacgt taagacaatc gacatacgtt agccaaaaat 300 ttaattttga agaaaatctc aaaatatata ataatatatt gcataaacat ataaaatatg 360 tacctacttg tagttatata ttataaataa aaattaaaga agtatgcata atacgaaaaa 420 aaaaacatta atgattaaat aagaattttt attttatttg gaactaaaaa aggaactcgt 480 tcattttcca aaggaacgac aaaggaacga attctttttt tttttaagga acaaggaacg 540 gaacgaattc ctttttttaa aaaaagaacg aggaacggaa cgaattcctt tttataagga 600 actcgccaaa cactg 615 // ID BEL-215_AA-LTR repbase; DNA; INV; 407 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-215_AA_; KW BEL-215_AA-I; BEL-215_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-407 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 888-888 (2011). XX DR [1] (Consensus) XX SQ Sequence 407 BP; 135 A; 66 C; 97 G; 109 T; 0 other; tgttagcaaa actgtaatgt gtagcgaaac cataatttac aaatagacag agtttttgca 60 tgccgtcatt gaacatcaat aggaagtttt cgttttgacg caccgcagga taataggctt 120 tatggcaaag gagctttttt gtgtagaatg tttgcgcgcc tagatcgaaa gtttgttcgc 180 tcgtgtggag aggaaaaggt acagtccagt aaaaaagaag aggaaagtta tcgaaagaca 240 tgaatgaaca ggcaacagtc cattgataga ttttgaagag tagtgttctc gcggaaatta 300 cagtgaaata aagtttgaag ttattcggaa aattaatccc atgcgttgtc agtgaaattt 360 agaccaatcc ggagaaaacc ccaatcccct gagctgtcgt cggaaca 407 // ID I_Ele15 repbase; DNA; INV; 7196 BP. XX AC . XX DT 22-OCT-2010 (Rel. 15.1, Created) DT 22-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE An I clade non-LTR retrotransposon family from Aedes aegypti. XX KW I; Non-LTR Retrotransposon; Transposable Element; I_Ele15. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-7196 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-7196 RA Kojima K.K. and Jurka J.; RT "I clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (22-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. This consensus is generated from 6 CC sequences with >97% identity, and ~100% identical to the original CC sequence in [1]. XX FH Key Location/Qualifiers FT CDS 5..604 FT /product="I_Ele15_1p" FT /translation="MKAFLAQQQQMFSNAMKHLYTQNLKMQEEFESMKKSM FT STISPPNAKIVDLKQCILSATQPAEKSLNSNPSATTNQDGAVISILDLSSS FT TMSETDPIEPEEKTSALDSTADISDEEYSDPSPVVSPNTIPAKSFDTPRPK FT TIISNPNDSSHKTPTNSKPTKAANKRTATELSPLENARGNDPLPKHKRQLP FT ATGRNRSNKQ" FT CDS 679..5886 FT /product="I_Ele15_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="MANSRGAFGLEAIPQPAPSNRPRHLSSISDEENSATN FT SRGADRPEVRTQPEQSNRPRRSSDENINYASFSPARAPGSLGPSSADDFST FT PELVDNPWRWGKSGRGTDKGINTHSPTDHVGVKLNRFSGRTRGEILVPRRT FT SRESSQQTKPSTPQPVLPGQPETSVSGGKTRSNRSSSTQTKPSFLPAGCIK FT PHRSEALAKSRSNKIKITLLSSNQQTHAHGGHPTTTSFEDPITMGKSDSHF FT LSSHQIQTATCQPALPGHPELLVSGGKPHPRSPTPDNNEPSPLLAGRVKCP FT LDGALAKNRPLSNQQPPTHGKSPTPTAPDHPITMGNMAPYTAPSSPQSSSP FT LWESSTSHANASTGTFSKDPVSPTDEGPRSTVEPNRISNIRKLPSDCLTNS FT EKYDSYIASSLGPGALLAPCCSTNRASLAVDLFGHPMHSNLGGKSQQLAFS FT HEIACDVDINSNSNLETTSSWGDSRPAAVWGLERLNRFQHSAGEIGSETGN FT NPNADGRCISAPSPPSRLSETHSSTHSHPNVNRNARLILQWNINGFLNNLS FT NLEMLVNQHEPLVIALQETHKASVDIMNSRLGKKYSWHAIRGSNFYHSVAL FT GTINEIPYEILQLDTDLPIAAIKISWPFPMSIVSFYLPNGKLTNLKNRLID FT TFDKIPDPKIILGDGNGHHTTWGSRVNNARGSAIFETATETGLCILNDGCK FT TFIRGRVETSIDISLASHNIIDRLLWSVDNDLSGSDHFPIWIQLNSSPPKT FT SRRPRWLYDQANWKIFQDRMEEILESSPPSSIDHFSNSIIEAATAAIPRTS FT PTPGRRALCWWSETTKKAVKARRKALRAARRLPADHPEKIAAMEKYRDANN FT KCKFIIREAKEKSWTSFLDEINDNQSSAELWSRVNAIQGKRQTRGLVLKSG FT NQTTRDPILVADALADYFGDLSSFHRYPLDFQKKHQPANAAFAMTQVPGFP FT VQNFNKEFSMDELLFALSKSKGKSAGPDQIGYPMLKKLPPIGKRKALDLIN FT SEWLAGTLPDSWKCSLVVPIPKACGTNYDPSDFRPIALTSCLAKMMERMAN FT RRLVEYLEKNDLLDNRQHAFRAGFGTGTYFAALGDILNGALERNEHVEMAS FT LDLAKAYNRAWTPGILKCLMDWGVTGNLASFVKNFLTNRRFRVSIGNHQSK FT EIPEETGVPQGSVIAVTLFLVGMNGVFQSLPKGVFVLVYADDILLLVTGVH FT PKMVRRKLQAAVNAVAKWAIKVGFEISATKCARLHICNSKHQPPKKPITVN FT GTAIPNKKTVKIVGVTLDRHLSFKAHFDNVRSACKSRINLLRLISGKRKRS FT DRTSRIRVANAIVNSRLLYGIEITSQRFDELISNLGGTYNNAIRIISGHLP FT STPADSACAEAGVLPFRYKLANSIGSRAICFLERTKNDGSEACIEIQANQI FT LRSVAGIELPSVSERHRNGARSWSARKIKVDNYLKQKLKKGGNPRLAQALF FT LDRIRTKYRIYNVRYTDGSKALGKVGIGWCGIGFRESRSLPSQCSVFSAEA FT AAIFQAILQPSDLIGPVLIATDSASTIAAILSDTNKHPWIQEIQDVLDDEE FT NITLMWVPGHSGIAGNEEADRCANTGRDSERLTDIVPSADLKLWLREKVND FT AWRRKWQSERNLFTRKIKGDVRKWEDRPDRREQVILSRLRTGHTRVSHSMS FT GGPNFRRICETCQVHNSVEHFLYTCPTLELLRKQYDMGSIRTSLQDDRSSE FT AVLFRFLKDAKLFDEI" XX SQ Sequence 7196 BP; 2072 A; 1995 C; 1601 G; 1528 T; 0 other; atatatgaaa gcctttctcg cacagcaaca acagatgttc tccaacgcga tgaagcacct 60 gtacacgcaa aacctcaaga tgcaggaaga gttcgaatcg atgaaaaaat cgatgtcaac 120 gatctcacca ccaaatgcaa agatcgtcga cttaaaacag tgcattctct ctgccaccca 180 gccggcagag aaatcgttaa acagcaaccc atccgctaca acaaaccaag acggtgccgt 240 catcagcatc ttggacttat cgtcctcaac gatgtcggag acggatccga tcgagccaga 300 ggaaaaaact tccgccttgg attcaactgc cgatataagt gatgaggaat atagcgatcc 360 atctccagtc gtctcaccga atactattcc agccaaatct ttcgacacac ccaggccgaa 420 aactatcatc tccaacccga atgactcttc tcacaaaaca ccaacgaact ctaaacccac 480 taaagcagcg aacaaaagaa cagcaaccga actttccccc ttggaaaatg ctaggggaaa 540 cgatcccttg ccaaaacata aaagacaact acccgccacc ggtcgtaacc ggtcaaataa 600 acaatagctt tctctcccag caaaaactcc tccgtcaaca aacctaccga ccctacaaca 660 accaactgtc cgcttttgat ggcaaatagt cggggcgcct ttggactgga agctataccc 720 caaccagcac catcgaatcg accccggcat ttgtccagca tttcggatga agaaaatagt 780 gccacgaaca gtcggggcgc cgacagaccg gaagtcagga cccaaccgga acagtcgaat 840 cgaccccggc gttcgtcgga cgaaaacata aattatgcaa gcttttcccc agcgagagcc 900 cctggtagcc ttggtccctc cagtgcggac gacttctcca caccggaact ggtggataac 960 ccttggcgct gggggaagtc tggaagaggg acggataagg gaataaacac ccattcccct 1020 acggaccacg tgggagtcaa actcaatcgt ttttcgggac gcacaagggg agaaatcctc 1080 gtcccccgaa ggacttcgag ggagtcaagc caacaaacga aaccatcaac accccagccg 1140 gtgcttcctg ggcagcccga gacttccgtt tcgggtggta agacccgttc caatcgctct 1200 tcatcaaccc aaaccaagcc atctttcctc ccggcgggtt gcattaagcc ccaccgcagc 1260 gaagctctag ctaaatctcg ttccaacaaa atcaaaataa cactcctctc gtcgaaccaa 1320 caaacccatg ctcatggggg acatcctaca acaacctctt ttgaagatcc aatcacgatg 1380 ggtaagtcag acagccattt tttatcaagc catcaaattc aaaccgcaac atgccaaccg 1440 gcgcttcctg ggcaccccga actcctggtc tcgggtggta agccccaccc cagaagtcca 1500 acaccagaca acaatgagcc atctcctctt ctggcgggtc gtgttaagtg ccccctcgac 1560 ggagctctag ctaaaaatcg tcccctctcg aatcagcaac cccctaccca tgggaaatct 1620 cctacaccaa ccgctccaga ccatccaatt acgatgggta acatggcacc ctataccgca 1680 ccaagctcgc cacagtcttc ctcacctctg tgggaatcat ccacttcaca cgccaacgca 1740 tctacgggca ccttctccaa ggatccagtg tcgcctactg atgaaggacc tagatcgact 1800 gtcgaaccaa accgaataag caacatcaga aaactcccct cagactgcct caccaacagt 1860 gaaaaatacg atagttatat cgcttcttcg ctgggccccg gcgctttatt ggctccttgt 1920 tgttctacca acagagcaag tttggctgtc gacctctttg ggcaccctat gcactccaac 1980 ctaggcggta agtcccaaca acttgccttt tcccatgaaa ttgcctgcga cgttgatatt 2040 aattccaact ccaatctgga gaccacgagt agttggggcg acagccgacc agcagctgta 2100 tggggactgg aacggttgaa ccgttttcaa cactcggcgg gcgaaatagg atccgagaca 2160 ggtaacaacc caaacgcaga tggaagatgc atctcagcac cttcccctcc ttcaagactc 2220 tccgagaccc actctagtac gcactcgcac cctaatgtca atcgaaacgc tcgactgatt 2280 ctccagtgga acataaacgg ttttctcaac aatttgtcga acctggaaat gttggtaaat 2340 caacacgagc ccctggtcat cgctttgcag gagacccata aagcctctgt agacattatg 2400 aacagcagat tgggaaaaaa gtactcctgg cacgcaattc gagggtccaa tttctaccac 2460 tcggttgcgt tagggactat caacgaaatt ccatacgaga tcttacaact ggacaccgat 2520 ctacccattg cagctatcaa aatctcgtgg cctttcccga tgtcaatcgt tagcttctat 2580 cttcccaatg ggaaattaac gaatctaaag aaccgactga tcgatacctt cgataagata 2640 cctgacccaa agatcatcct tggagacggc aacggtcacc acactacctg gggcagtcgc 2700 gttaacaatg ccagaggttc cgcgattttt gaaaccgcaa ccgaaacagg attatgcatc 2760 ctgaatgatg gctgtaaaac cttcatccgt ggtcgcgtag aaacatctat tgacatctcg 2820 ctagcgtccc acaacatcat cgatcggttg ctttggtccg tggacaacga tctgtcgggg 2880 agcgatcact tcccaatatg gatccaattg aactcatcac cgccaaaaac ctcccgtcga 2940 cctcgctggc tttatgacca agccaactgg aaaatcttcc aagacagaat ggaggaaatt 3000 ctggagtctt ctcctccgtc ttctatcgat catttctcga acagcattat cgaggccgct 3060 acggcggcca ttccgaggac tagtcccaca cccgggcgaa gggccctttg ttggtggtct 3120 gagaccacaa agaaagcggt caaagcaaga cgcaaagcac tccgcgcagc cagacgacta 3180 ccggccgacc acccggagaa aattgccgcg atggaaaaat accgagacgc gaacaacaaa 3240 tgcaagttta tcatccggga agccaaggaa aaatcgtgga cgagtttcct tgacgaaatc 3300 aacgataacc agtcctccgc ggaattatgg agcagggtga acgccattca gggtaagcgc 3360 caaaccagag gtctagtcct caaatccggt aaccagacga cccgggatcc aatccttgtc 3420 gctgatgccc tggccgacta ctttggagac ctttcgtcct ttcatcgata cccgctcgac 3480 tttcaaaaaa aacaccaacc cgccaatgcc gcctttgcta tgacacaagt gccaggtttc 3540 cccgttcaga actttaacaa agaattttcc atggacgaac tcctctttgc cctgagcaag 3600 tcaaaaggaa aatccgccgg tccggaccag attggctatc cgatgttgaa aaaattaccc 3660 ccgataggaa aaagaaaggc actagaccta atcaacagcg aatggttagc cgggacccta 3720 cccgacagct ggaagtgcag cctggtagtg ccgattccga aggcttgtgg tacaaactac 3780 gaccccagcg attttcgccc aatagcctta acaagctgct tggcaaaaat gatggagcgc 3840 atggcaaatc gcagactggt ggaatacctc gaaaaaaacg atcttctgga caaccgtcaa 3900 cacgctttcc gggcaggttt cggcaccgga acctatttcg cggcccttgg ggacattctc 3960 aatggagcat tagaaaggaa tgaacacgtt gaaatggcat ccctggactt ggcgaaggcc 4020 tacaaccgag cctggacccc ggggatatta aagtgcctaa tggactgggg tgttactggg 4080 aatcttgcca gcttcgttaa aaactttcta accaaccgtc gtttccgggt tagcatcggg 4140 aatcatcaat ccaaagaaat ccccgaagaa accggtgttc cacagggatc ggttatcgcc 4200 gttacgctgt ttctggtggg aatgaatggt gtcttccaat ccttgccgaa aggagtattc 4260 gtgctggtct acgccgacga cattttgctg ctagtgacag gagtacaccc gaagatggtc 4320 aggcgaaaat tgcaggcagc cgtaaatgca gtcgccaaat gggcaatcaa agtgggcttt 4380 gaaatctcag caaccaaatg tgcaagactc cacatctgca attcgaaaca tcaaccacca 4440 aaaaagccaa tcacggtcaa cggtactgcc attccaaaca agaaaaccgt taagatcgtt 4500 ggcgtcaccc tcgatcgaca cctctccttc aaagcccact tcgataatgt ccgatcggca 4560 tgcaagtccc gaatcaactt gctgaggctg atatccggca aaaggaaaag aagcgaccga 4620 acgtccagaa tccgagttgc caacgcaatc gttaacagcc gattgcttta cgggatcgaa 4680 ataacgagtc aacgcttcga cgaactgatc agcaacctag ggggaaccta caacaacgcg 4740 atccggataa tctcgggaca tctgccttct acgccagccg actcggcctg tgccgaagcc 4800 ggagtcttgc ctttccgata caaattggct aacagtattg gcagtagggc catctgtttc 4860 ttggaaagaa ccaagaacga cggatcggaa gcctgtatcg aaatacaggc caaccagatc 4920 cttaggtccg tggccggcat agagctccct tcggtatctg agcgacaccg taacggagcc 4980 agaagctggt ctgccaggaa aataaaggtg gacaattatc tgaagcaaaa actcaagaaa 5040 ggtggcaacc caaggctcgc acaagccctt ttcctggacc ggatcagaac aaagtacaga 5100 atatacaacg ttcggtacac ggacgggtcg aaggctctcg ggaaagtggg gataggctgg 5160 tgcggcatcg gctttcggga aagcaggagt cttcctagcc aatgctcggt cttctcggca 5220 gaagctgctg ccatttttca agcaatcttg caaccgtcgg accttatcgg gccggtactc 5280 atcgccaccg actcagctag cacaatcgca gcaatattat cggacactaa caagcaccct 5340 tggattcagg agatacagga tgtgcttgat gacgaagaaa acatcacgct gatgtgggta 5400 cccggtcaca gcggaattgc gggcaacgag gaagcggata ggtgtgccaa tacaggcaga 5460 gacagcgagc gtctgaccga catcgtcccg agcgccgatc tcaaattgtg gctacgggaa 5520 aaagttaacg acgcatggag gagaaagtgg cagagcgaaa ggaacctttt caccaggaag 5580 attaaagggg acgttagaaa gtgggaagat cgtccagaca gaagggaaca agttatcctc 5640 tcccgacttc gaacggggca tacacgagtc tctcactcga tgagtggcgg accgaatttc 5700 cgaagaatct gcgagacttg ccaggttcac aactccgttg aacacttcct gtacacttgt 5760 cctactctgg aactcctgag aaagcagtac gatatgggaa gcattcgcac gagcctacag 5820 gacgatagat ccagcgaggc tgtgcttttc cgtttcctca aagatgccaa acttttcgac 5880 gaaatctaac agattttttt tcatccagcc aacagaactc tagagttatt ggataaccca 5940 agcgtaactg ggctccccaa gcacttaaac ttgggtggta agtcccacgt ctacgccagt 6000 gaaaacgttg acctaccatt gaggcaataa aataccgtta ctgggcaccc tgagcaccca 6060 aaactcaggt ggtaagtccc acgtctgcgg ctgtgttttt taaatgaagg aaactgactc 6120 ccctgggcac cctaatcact tggtagataa gacccacgcc cgcatctaga aacaacgcaa 6180 ttaaaaagca acaaaatatc gtaactgggc actctgagca ctcaaaacat agatggtaag 6240 tcccacgtct gcgactttat caacaaactg aatgacccca ctcgtacaac aagaaagaaa 6300 accggagcct gaaaagaaat aaaaatcatt aaattgggca taattaagct actttatacc 6360 cgactcaccc caagctgtca gtcgatcccc cctccgacac gatacgtttt tggaggagag 6420 aaacagcttc ccacataagg catatcgttt gcctgaatca gggatgctga caggatcaac 6480 catctctccg aatccacact gctacccccg cggctgccgg tcgctcatcg gttcaatagc 6540 tgattggtgg tcttctcccg cacccgtcgt agccggattt gccgcatact ccaaccttgg 6600 gaaaaaggag tctcaaactt tccttcctca cacgacggca catcgaggat tttccgctga 6660 aaggctggtg ggtgctcctt ctagcacccg ttgtactcct ttacgcggct catcacagag 6720 acgaagggga aaaatccttt tttcccgtat acggtgttcc cacgacgaaa tcgggttcca 6780 caagtccatc acggggctca ccagtcctcg gcggtttttg gacagaccca gaattggccg 6840 aaacaagaga aaaattacgg acattacaac acaagaagag gaagcgcgtg aatgtcaaac 6900 gttgggacat cacaaaacgg aacgcaagtt ccatgcaaga tgcaaaaagc ctcctcatta 6960 gctaggaaaa tatgttgttt ctttgtttgt tcaattccaa tcttaacttg tagtattgtt 7020 cctttcattt tacttgtttg tttgaatgtg ttaactgtaa ctagtaacgt taaatatgtg 7080 aaaactttct attttttccg tctaggctag cccatctggc tagtatttta gtcgttaaaa 7140 gtggtgaact cgccaagggc gaaaagccac tctaataaag ataaataata ataata 7196 // ID Gypsy-31-I_NVi repbase; DNA; INV; 12808 BP. XX AC . XX DT 14-MAY-2009 (Rel. 14.05, Created) DT 14-MAY-2009 (Rel. 14.05, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW internal portion; Gypsy-31-I_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-12808 RA Bao W. and Jurka J.; RT "LTR retrotransposon from Nasonia parasitic wasp."; RL Repbase Reports 9(5), 998-998 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS join(3123..5720,5545..8928) FT /product="Gypsy-31-I_NVi_2p" FT /translation="PKFKLEARSPRSSSAERCKEAREKREENTMGGQHENS FT AAIRFLTAGRQPPIVKIESPQLRARTGKFYADSGADISVIKVGELAPGYPY FT DARKIIKVLGVTPGTSFTLGQAVIQLQGLQCDIHVVPNEFPISESGIIGWD FT IINAHKGCVDAANKSLRLGEVILPFEADEQITIPPRVRMVISARVRNNNAK FT IGWVPLTDLHPNLLFGNFVAENNNGRVYAECINISESEITIPSPVVELLDC FT ETIADNTLYRADEDGSAENIANFTASLRRMFSCEKQQEKYKEVSALNNKLM FT ADDKARRERVEKILQLADTEGCNAEEIELIRAVVNDYAGVFGLEGEPLPAT FT HLLQHKIILKSDKVIRNARFRFPPALKEHMLRELAKLREQNIVVPSNSNYS FT SSLWIVPKKPDAQGNKKFRLVTDFRALNEETEGSCHPLPFTSDILEHLAAA FT NYITVMDLKQGYHQIEMHPESAHLTAFYAPDGNYGNQLLQFNRMAMGLKEA FT TITFTRAMSLAMAGLQGEEVEIYLDDLMVFSETLDQHMVRLRRVLKRLLEA FT NLTVEPRKCQFLKKEAHVLGHIVGGGSIKTDPAKTRAMAKYPVPTDPKKLK FT QALGLFSYYRRFIKNFSGIARPLFKLLQKEAEYIWGPEQTVAFNTLREMMS FT KEPVLKAPDLNQPFIVTTDASDWALGAILSQGKLGADQPCAYASRCLKGSE FT LKYPTYDKELLAIVFAKDQFRYYLFGRKFTICTDHEALKHFHTTKKPDLRF FT NRLKAALIGYDFDIIYRPGKKNANADALSRNPVISEGEDNPELPRVELYAL FT ADKQIKENPDEDANAPPGRIFRTRAVKGAAGEKARQSSSSPDDFERSRENK FT EKAGRERANFQQRKKIRTKMQTPHRAEFFAHAQSKGRQERKHDNRHRAPTI FT SSEAGKIKKKRDVNALIFSKDDTLAVRNEFNEFYLCRALHDVHHGDELIQI FT RWFTEAKNQPNVYALDYRDAIRFETVLTSTTLKRTTGNRYVLSKDEKARIF FT KILHEAIELYPAPVISQVGAETALNAASDSDESMITVKTAVSDSIRDVILP FT PGAWNYVRNVSPASATRGAKSSKSRKTPKVYSPNSSSSSESEGLIPPSGKI FT TSQEPLQTFKFPIERVNPRNILLPHSASSVSSMSANKSPAPKVPSLSHETS FT PKENHSRPNSKPMTPDVENRANDDPIPLPSPSWPWSEQIQKKRLAVDIKAK FT EFGSKGGLQINVWERKSTDGSERGNAQQIIATESNRHAPAENAHVDVRNAQ FT IRHPSPQAERIQQDAQKLDSRSGKVILDSRKPPPAANATEPEPKETSREQT FT XHIAPMDTESEMHALADPENVGPAEDFSGIPRNYDRRPEPSDENSSISSGN FT VLVQTYIRKKRCAEPYTKLRRSSNIPSFPPGNCLFFSLIKIAKLDISATEL FT RQQLRVSSMLQHCGEPGETEKILASDTDYGTVDCAFLFAHEYGINICIHYD FT LPGDTRIFCHIIVNSRDKYIHLNLTGQHFTPYLRVKVPKPAARAAARPRNR FT SPSSDTDQSARKSPILGRIKKRTRKTAHTTIGLKAKGHIAQNSDAVSEAPA FT PIAASSGSNVHPMQVATSDDHETSKSPSIIVNPAESSGPEGSGTPQFTKFQ FT RSYNAHPPSANISPQIPPDTSNSTVASSAGENIAAAEVHSGSTAHQPTRTR FT PRAEADALTDESGVSTDNERIAPTGDTTGAMIEAARKDEKLKRYLVAGARP FT PREKPPWAIPEGQEPPPFVHLQSYNEHPFRFRENLVYLMSADNFISTEIQE FT ALIERGYLDAEALVSKRFNVGEINITEFKEIQLIGVYVKRHIDDRPLKADL FT QKCLKTLRSVVLSRKINSFAIIRDLAILTIDEWTKFIDLFDAIFMNKHVTA FT ILYKNNLPVPPVSERIKILKEYHEAAMGAHRGISKTYNKVASDFYWRHMRP FT DVKQFVARCAVCQSNKLVRIKTRLPMLISNTPSLPFAQIALDFYGPLEPTE FT RGNRYILSVQDMLTKYVI*" FT CDS join(8805..9614,9694..10854) FT /product="Gypsy-31-I_NVi_3p" FT /translation="QYTIVTFRANCVRLLRATGANGTWQQIHSFSPRYANK FT VRNLIPTKHASANEVARALTEKVICIFGPPAAIVTDQGTHFQNKTLEYLAE FT IFGIHKFCTTAYHPQANGSIERMHHTLTEYLRKYVRKTDCWDDWTAICQHA FT YNCTEHESTRYTPHELLFGIKPRTPSSFPRKNDDISYNQYIEEMTTNLTAL FT QSTVAMNLVQSKYRSKYYYDRKLNTKHFREGETVFLLKEPKESKFAVEYQG FT PFEIIKISRKTNNVTLQNDETTKIVHVNKCLYLNIYFANIRVPRSLARRKN FT KLSLLTRACPYRLAVGAQPVGTFELNEESTNGFIHRFQENLGLIIEKIAPL FT ATSSTDWKIIQKTDLRAYFKAGEILMKHATQIIHACGPKCEESNILEEIKA FT AKRQAERVIDLILVHGGGEDLHRQRRSLLPFIGTVHKFLFGTLNENDEREI FT KAAIQAIDGDTRLTAALLAKQTEIVNRTLFDLDQKTARLQAHVVELANRTA FT ANNNDIATNSAMSSLKTTLLQFKLDTEVLTDAILFAAKGLVHPRILPPNTI FT NRAAKTVESAISNARFPLPEGSFSALPIMKISKLSILYAEGYLIYQIAIPL FT LDTQKFIQSITPTGDATNSKQLGGSSIYLARTSVLRNERIKPYLYAATSGK FT NRKTQEN*" FT CDS join(95..619,672..1841,1576..2292,2358..2981) FT /product="Gypsy-31-I_NVi_1p" FT /translation="FLKKLEESSTWHQDSHRRIQQRKVKENREFSRQNPRG FT RRGSRQPSELKKITQSIEKAVGRLNKEGLDCKASLKKSTKSIWAHKVDLRP FT SKSIWTNEAHLGQRTSSNTRRRGQTTDEPSSSAAVATTRQPEPAPASTRST FT VVGADQPPTRASNQPAPAASPATSMDRVIAVALSFLNTHTYFRERQNSYSV FT KSAHHKSKQKKNYIMAPPKIISAARSALRLGARSKDRTSGAESDPQDTRKQ FT LSRANRIFKIHQKMPPNYQRGVTRTLSSAVLESSDSENEIHRTRMTRNWST FT AQKQLLENSSTVPRRTEQSNQTDEESRELIDIRANSPALSYAGLPQGPQST FT QIDMKNLESRLEELRLDQSVQVDNPWGPIGQPLETPERLAINARCQQNIAN FT LCGMKNIVWGNNDLITDGPKPQDYGLPTATDSTQFVGLPPVLPIDNLGQKE FT TDGGMVRAEPAIVNAAGTSELRLSREQLQRIIKESIRQYSEEEGSVLGGKA FT SRRYRVREGNTKQRGRTYAEKNTETRRKDTRQARKVARTKNDKAAIGEARD FT RERYGCTRQSEKNQRRRALKSRSGSIVKKRAQYSVVRQADDIGSAKATPNN FT AEERTRKKTPRRDEKTPDRRERSRERRTIRRQSERRETGKDTVARDSQRKI FT RDDARSQNISREIRRNKYEPHSIRRSPYREAAANSDRTRKMHSSPSRGFMR FT YNGHATPRRAHNAYYADTESDSEDDFETETNDELETRRDGARTYDDYSSET FT HDSEYESSPGTRPDRAIYELVTSATDITFDEKCEYRHCGANSLPDTHGKAR FT RRTQPQSYRTPKGRRGGDRRTRTCLVRSKGPKSLGAAITVAIGYEGKQSAR FT SAIPSETSAARASAQVRLVTAEEASPAADRDARPEKSNAQSDRQTEKTYCT FT YCKMXGHPLDECRTIIRHAAEKIIQNPIANRGRNFRYDDNNGNGNNNGTNP FT ENRNASNGSASKLNGDQNYNNGGDSNNGGYRGRNPNFNRNSRFNNYQNNGS FT QLRPK*" XX SQ Sequence 12808 BP; 4252 A; 3170 C; 2934 G; 2446 T; 6 other; tctggcatcc ctacgtgggg ctcagcagca gccaagacag cagctgagtt tacggcaact 60 ttccaccgca gccgtctgaa aattactcag ataatttcta aagaaattgg aggaaagcag 120 cacgtggcac caagatagcc accgcagaat acaacagcga aaagtcaagg agaacagaga 180 attcagccga caaaatccac gaggtcgccg aggctctcga caaccgtcgg agctaaaaaa 240 aataacgcaa agcatcgaaa aagcagtcgg ccgattaaac aaggaaggcc tcgactgcaa 300 agcaagtctc aagaaatcca caaagtccat ctgggcccac aaagtcgatc ttcggccctc 360 aaagtccatc tggaccaacg aagcccatct gggccaacgc acaagcagca acacccgtcg 420 gcgcggacaa accaccgacg agccaagcag ctcagcagcc gtcgctacaa ccagacaacc 480 ggagccagca ccagcctcca cgcgcagtac agttgtcggc gcggaccagc cgccgacacg 540 cgcgagtaac cagccagccc cagcagcgtc accagcaacc agcatggaca gggtcatcgc 600 agtcgcgctc agctttctgt aagtccctct tattaaatat ccacgcagct tggcttccgg 660 gaacgaattg aaatacccat acatattttc gcgagcgcca aaatagctac agtgtaaagt 720 ccgcgcacca taaatcaaag caaaaaaaaa attatattat ggcacccccg aaaataatta 780 gcgcggcaag gagcgcgctc cgacttggcg cacgcagcaa ggacagaaca tcaggtgcag 840 agagcgatcc gcaagatacg cgaaaacaat taagccgggc aaatcggata tttaaaatac 900 accagaaaat gccgccgaat tatcaaagag gagtaacgcg tacgttaagc tccgctgtat 960 tagagtcaag cgattcagaa aacgagatac accgcacaag aatgacgcga aattggagca 1020 cggctcaaaa acagctactc gaaaatagca gcacggttcc gcgaagaacc gaacaatcca 1080 atcagacgga tgaagaaagc cgcgaattaa tcgacataag ggcaaacagc ccggcattat 1140 catatgcggg attaccgcag ggccctcaat ctactcagat cgacatgaaa aatcttgaga 1200 gtagattgga ggaattgcgc ttagatcagt ccgtgcaagt agataacccg tggggaccaa 1260 tcggacagcc actagaaacc cccgaacgcc tggcgataaa tgctagatgt cagcaaaata 1320 tcgcgaattt gtgcggcatg aaaaacatag tgtggggcaa taacgatcta atcactgacg 1380 gtcccaagcc tcaggactac ggtttaccca cggccaccga tagtacgcaa ttcgtgggtc 1440 tcccacccgt gttaccgata gacaacttag ggcaaaagga aacagacggg ggaatggtac 1500 gcgcggaacc agctattgta aacgcggcgg gcacgtcgga gcttcgcttg tccagggaac 1560 aactccaaag aataattaaa gagtcgatcc ggcagtatag tgaagaagag ggctcagtac 1620 tcggtggtaa ggcaagccga cgatatcggg tccgcgaagg caacaccaaa caacgcggaa 1680 gaacgtacgc ggaaaaaaac accgagacgc gacgaaaaga cacccgacag gcgagaaagg 1740 tcgcgcgaac gaagaacgat aaggcggcaa tcggagaggc gcgagacagg gaaagatacg 1800 gttgcacgcg acagtcagag aaaaatcaga gacgacgcgc gtagtcagaa tatctcgcga 1860 gaaattcgca gaaacaagta cgagccccat agcatccgac gctcaccgta ccgcgaagcg 1920 gccgcaaatt cggacagaac ccggaaaatg cacagctcgc ccagccgggg atttatgcga 1980 tataacggcc acgcaacacc gagacgcgca cataatgcgt attacgcgga tacggaatcg 2040 gacagcgaag acgattttga gacagaaaca aacgacgagc tggagacaag acgagacgga 2100 gcgcgcacat acgatgacta ctcgagcgaa acacacgaca gcgaatatga gagcagccca 2160 ggcacccgac cggatcgcgc gatatacgaa ttagtcacaa gcgcaacaga cataacattc 2220 gacgaaaagt gcgaatatag gcattgcgga gcaaattcac tcccagatac gcacggcaag 2280 gcaagacgaa ggtgaaacgg caggcgacta cggtctgcgc gtcagcaaat tatataaccg 2340 gttgaaaact attctaaact cagccccaga gttatcgaac gccgaaaggg aggcgcggcg 2400 gcgacaggcg gacgaggacg tgcttggttc gaagtaaagg gccaaaatcc ctgggtgccg 2460 ctatcacggt agcgatcggg tatgaaggga agcaaagcgc gagatcagcc atacccagcg 2520 aaaccagcgc tgcaagggca tcagcacagg tacgtctagt gaccgctgaa gaagctagcc 2580 ccgccgcgga tagagacgca agaccggaaa aatccaacgc gcagtcggac cgccaaaccg 2640 aaaaaacata ttgcacgtac tgcaagatgm caggccaccc acttgatgaa tgccgcacaa 2700 taataagaca cgccgcggaa aaaataattc aaaatccgat tgcaaataga ggcagaaatt 2760 tccgctatga tgataataat ggaaacggca acaataatgg cacaaatcca gaaaatcgca 2820 acgcgagcaa cggctccgcg agtaaactaa acggcgacca gaattataac aacgggggcg 2880 atagcaacaa cggcggatac cgcggcagaa atccgaactt taaccgtaat tcccgattta 2940 acaactacca aaataacggc tcgcaattac ggccaaaata accgaaataa ttacggcaac 3000 aatcgctata gcaatcacta taatcaccgt aactataatc gcaatgataa ctacaatgat 3060 aatcacaata ataaccgtga taatagccgc gacaacgttc aaacaaataa cgaaactaat 3120 aaccaaaatt taaactagaa gcacgctcgc cgcgttcctc aagcgccgag cgttgcaaag 3180 aagcacggga aaagagagaa gaaaayacta tgggcggaca gcacgaaaac tcagccgcga 3240 ttagattttt gacagctggt agacaaccgc cgattgtaaa aatagagagc ccccagttga 3300 gggctagaac gggcaaattt tacgcrgact cgggagcaga catttcagtt ataaaagtag 3360 gggaactagc ccctggatat ccgtacgacg cgcgtaagat aataaaagta ctcggagtta 3420 cccccggtac atcgtttaca ctagggcaag cggtcataca gctacaaggg ctgcaatgtg 3480 acattcacgt agtgccgaat gaattcccga taagcgaatc aggaatcatc ggctgggata 3540 taataaacgc acataagggt tgcgtagatg ccgctaataa gagcctaagg cttggcgaag 3600 taattctgcc gttcgaggca gacgaacaaa ttacaattcc ccctagagtc aggatggtta 3660 taagcgcgcg cgtacgaaat aataatgcaa aaatcgggtg ggttccacta acggacctac 3720 accctaacct actattcggc aatttcgtag ctgaaaataa taatggccga gtctacgcgg 3780 aatgcataaa cataagcgag tccgaaataa caatacctag ccccgttgta gaattacttg 3840 attgcgaaac aatcgcggat aatacgttgt accgagccga tgaggacggc tcagctgaaa 3900 atatcgcgaa ttttacagcg agtttgcggc gcatgttcag ctgcgaaaag cagcaagaaa 3960 aatataaaga agtaagtgca ttaaataata aactaatggc agacgacaaa gccagacgcg 4020 agagagtcga aaaaatattg cagttagcag acacggaagg ctgtaacgcg gaagaaatcg 4080 aattaattcg ggcggttgtc aatgattacg ccggagtttt cggcctcgaa ggtgagccac 4140 tcccggcaac gcacttactg caacataaaa tcatattgaa atctgataaa gtcataagaa 4200 acgcgcgatt cagattccca ccggctttaa aggagcacat gctccgagag ctagccaagc 4260 tccgcgagca gaacattgta gtaccgtcaa actcgaatta ttcatcatca ctttggatag 4320 tcccaaaaaa acccgatgcg caaggcaaca aaaagtttcg cctggtgaca gattttcgtg 4380 ccctcaacga ggaaacggaa ggaagttgcc acccgttacc gtttactagc gacatacttg 4440 aacatctcgc ggccgcgaat tacattacgg tgatggacct taaacaaggg taccatcaaa 4500 tcgagatgca cccagaatcc gcgcatctca cagcatttta cgcgccggac ggtaattacg 4560 gaaatcaact tttgcaattt aatcgtatgg cgatgggcct taaagaagct acaattactt 4620 ttacgcgagc catgtcccta gctatggccg gtctgcaggg ggaggaagtc gaaatctacc 4680 tcgacgacct catggtattt agcgaaacgt tagaccaaca tatggtccgc ttgcgacgcg 4740 tgttaaaaag actgctcgaa gcgaatttaa cagttgaacc gaggaaatgc cagttcctga 4800 aaaaggaggc gcacgttctc ggacatatcg tcggaggcgg tagcattaaa actgacccgg 4860 ccaaaactag ggcaatggca aaatacccag tgccaaccga cccgaaaaaa ctaaaacaag 4920 cactcggcct attcagttat tatcggcgat tcataaaaaa tttctcgggg atagcgcgac 4980 cgctatttaa actcctgcaa aaagaggcag agtatatatg gggcccggag caaacagtag 5040 cgtttaatac gttgagagaa atgatgtcaa aagaacccgt gctaaaagcc ccggatttaa 5100 accaaccatt catcgtgaca accgatgcga gtgactgggc gctgggcgcg attttaagtc 5160 aagggaagct gggagcggat cagccctgtg cgtacgcttc gcgctgcctg aaaggcagcg 5220 aattaaaata tccgacgtac gacaaggaac tcctcgcgat agtatttgcc aaagatcaat 5280 tccgctacta cttattcgga cgtaaattca caatatgtac agaccacgag gcgcttaagc 5340 attttcatac aaccaagaaa ccggacctaa gatttaaccg attaaaagcg gcgctgatcg 5400 ggtacgattt cgatattata tatcgcccgg gcaagaaaaa cgcgaacgcc gacgcgctat 5460 cacgaaatcc ggtgataagc gaaggtgagg ataatccaga gctgccgcga gtcgaactat 5520 acgcgctagc agacaagcaa ataaaagaaa atccggacga agatgcaaac gccccaccgg 5580 gcagaatttt tcgcacacgc gcagtcaaag gggcggcagg agagaaagca cgacaatcgt 5640 catcgagccc cgacgatttc gagcgaagcc gggaaaataa agaaaaagcg ggacgtgaac 5700 gcgctaattt tcagcaaaga tgatacgctc gcagtgagaa acgaattcaa cgagttttac 5760 ttgtgccgcg cgctgcacga cgtacaccac ggtgacgagc taattcaaat acgttggttc 5820 acagaagcga aaaatcaacc gaacgtgtac gcgctagact accgcgacgc gataagattc 5880 gagacggtgt taactagtac aacgctaaaa agaacaacgg gaaacagata cgtactaagc 5940 aaagacgaga aggcgcgtat atttaaaata ctacacgagg caattgaact ctacccagcg 6000 ccagtgatct cgcaagtggg agccgaaacg gccctcaacg cggcgtcaga ttccgacgag 6060 tcaatgataa cggtaaaaac tgcagtatca gacagcatcc gcgacgtaat tttgccgccc 6120 ggggcgtgga attacgttag aaatgtaagc ccagcaagcg ctacgcgtgg cgcaaaaagc 6180 agtaaaagcc gaaaaacccc gaaagtgtac agcccgaaca gctcgtccag cagcgaatca 6240 gaaggcctaa tacctccgtc gggaaaaata acgagccagg agcccttgca aacgttcaaa 6300 ttcccgatag aaagagttaa cccgcgcaac atactcttgc cacatagcgc tagctccgta 6360 agtagcatga gcgcaaacaa atccccagct cccaaggtac cctcgctgtc gcacgaaacg 6420 agcccaaaag aaaatcattc acggccaaac agcaagccca tgacaccaga cgtggaaaat 6480 cgagcgaacg atgacccaat accgctgccc agtccatcgt ggccgtggtc cgagcaaatc 6540 caaaagaaga ggctggcagt cgacataaaa gcaaaagaat tcggtagtaa aggggggcta 6600 caaattaatg tatgggaaag aaaatcgact gacggatcgg aaagaggaaa cgcgcagcaa 6660 ataatcgcca cagaatcgaa tcggcacgcg ccggccgaaa acgcgcacgt agacgtaaga 6720 aacgcgcaaa ttcgccaccc gagcccgcag gcagaaagaa ttcagcaaga cgcacagaaa 6780 ttagactcga gaagcggaaa agtaatacta gactcacgga aaccaccgcc agccgcgaac 6840 gcgacagagc ccgagccgaa agaaacctcg cgcgagcaaa ctyagcacat cgcgccaatg 6900 gacaccgagt cagaaatgca cgcgctcgcg gatcccgaaa acgtcggccc agccgaggat 6960 tttagcggca tcccaaggaa ttatgatcga aggccagagc catccgatga gaactcgtca 7020 atcagctcgg gcaatgtatt agtgcaaacg tatataagga aaaaacgctg tgccgaaccg 7080 tatacaaagc taagacgttc aagcaatatt cctagtttcc cgccagggaa ctgcctcttt 7140 ttctcactaa tcaaaatagc aaaacttgac atttcggcca ccgagctgcg acagcaattg 7200 cgagtttcgt caatgctgca gcactgcggc gagccaggcg aaacagagaa aatcctagct 7260 tcagatacgg actacggcac ggttgattgc gcgtttctat tcgcgcacga atacgggata 7320 aacatctgca tacactatga tttgccaggc gacacccgaa ttttctgcca cataatagtg 7380 aatagccgcg ataaatatat tcatttaaat ctaaccggcc agcatttcac gccgtacttg 7440 cgcgtaaaag tacccaaacc cgcggcaagg gcagcagcta gaccgcgaaa ccgctcccca 7500 tcaagcgaca ccgatcaaag cgcgcgcaaa tcaccgatcc tcggcaggat aaagaaaaga 7560 acgcggaaaa ctgcgcacac gacgattgga cttaaagcga agggtcatat cgcgcagaat 7620 tcagacgccg taagcgaagc gccagcgcca atcgcagcca gcagcggttc aaacgtccac 7680 ccaatgcaag tagcgacgag cgacgatcac gaaacctcga aaagcccttc aataatcgtg 7740 aatccggcgg agtcctcggg ccctgaagga agcggaaccc cgcaatttac gaaattccaa 7800 cgctcataca acgcccaccc tccaagcgcg aatataagtc cacaaatacc acccgatacg 7860 agtaacagca ccgtagcatc ctctgcggga gaaaatatag ccgcagccga agttcatagc 7920 gggtcaacag cacatcagcc aacgcgaact cgaccgcgag ccgaggccga tgctctgaca 7980 gacgagagcg gcgtcagcac tgataacgag cgcatcgctc ccacaggcga caccacgggt 8040 gcaatgatcg aggcagcgcg aaaagacgaa aaattaaagc gatacctggt agcgggagcc 8100 aggccgccgc gcgaaaaacc accctgggcc atcccagaag gccaggagcc ccctcccttc 8160 gtacacctcc aatcatacaa cgaacaccct tttcgcttca gagaaaatct cgtatattta 8220 atgtcagcgg ataactttat atcaactgaa attcaggaag ccttgatcga acgcggctac 8280 cttgacgccg aggccctggt tagcaagcgc tttaatgtag gagaaataaa cattacggaa 8340 tttaaagaga tccagctgat aggggtttac gttaaaaggc acatcgatga ccgcccattg 8400 aaagcagact tacagaaatg tttaaaaaca ttgagaagcg tggtactctc gcgcaaaatt 8460 aacagcttcg caataattcg agatctagcc atattgacga tagacgagtg gaccaaattc 8520 atcgatctgt tcgacgccat attcatgaac aagcatgtaa cagcaatcct atataaaaat 8580 aacttaccgg tacccccggt aagcgaacgc attaaaattc tcaaagagta ccacgaagca 8640 gcaatgggag ctcatcgtgg catatccaaa acttataaca aggtagccag cgatttttat 8700 tggagacaca tgcgcccaga cgtgaaacaa tttgtcgcgc ggtgcgcagt gtgccagagc 8760 aacaagctag tacggataaa aacgcgacta cctatgttga ttagcaatac accatcgtta 8820 cctttcgcgc aaattgcgtt agacttttac gggccactgg agccaacgga acgtggcaac 8880 agatacattc tttcagtcca agatatgcta acaaagtacg taatttaatc ccgacaaaac 8940 acgcgagcgc gaacgaagta gcacgtgcgc taaccgagaa agtaatctgc atttttggtc 9000 cacccgcagc tatagttaca gatcagggaa cacatttcca gaataaaacc ttagaatacc 9060 tcgcggaaat tttcggcata cataaatttt gcacaacagc ttaccaccct caagcgaacg 9120 gatccatcga acgcatgcac catacgctga ccgagtattt gcgaaaatac gtgagaaaaa 9180 ccgattgctg ggatgactgg accgcgatat gccagcacgc gtataattgt acggaacacg 9240 aaagcacacg ttacacgccg cacgagctgc tgttcggaat aaaaccacga actccgtcga 9300 gctttccgcg aaaaaacgac gatatctcgt ataatcaata tatcgaggag atgactacga 9360 atttaacagc gctacaatcg accgtggcga tgaacctcgt acaatcgaaa tatcggtcga 9420 aatattatta tgatagaaaa cttaacacca agcattttcg cgagggggaa acggtattcc 9480 tactgaaaga accaaaagag agcaaatttg cagtagaata ccaggggccg ttcgaaatta 9540 ttaaaatcag caggaaaacg aataatgtaa cattgcaaaa cgacgagaca acgaaaattg 9600 ttcacgtcaa caaataaaaa gaatagcgaa ctagctagac tcgggaggat agtcgttttt 9660 ttttcactgt ttttacgtgc cgcgcaaaaa tagtgtctat acctcaacat atacttcgca 9720 aatattcggg tcccgagaag ccttgcgcgc cgcaagaata aattatcctt gttaacgcga 9780 gcatgccctt acagattagc agtgggagca caacccgtgg gcaccttcga gttgaatgag 9840 gaaagcacca acggatttat ccaccgattt caagaaaact taggcttaat tatagaaaaa 9900 atagcaccgc ttgcgacgtc cagcaccgac tggaaaataa tccagaaaac ggacttgcgc 9960 gcatacttca aggcaggaga gatactaatg aagcacgcga cacaaattat acacgcgtgc 10020 ggcccgaaat gtgaagaaag taatatcctc gaagaaataa aagcggctaa gcgtcaggcc 10080 gaaagggtga ttgatttaat tttagtgcat ggcggcggag aagacctaca caggcagcgc 10140 cgctcgcttt tgccatttat aggcaccgtg cataaatttt tattcgggac tttaaacgag 10200 aacgacgagc gcgaaatcaa agccgcaatt caagccatag acggcgacac acgcctcaca 10260 gcagctctcc tagcaaagca gaccgaaata gtcaaccgca cgctattcga cttggatcaa 10320 aaaaccgcgc gtttgcaagc ccacgttgta gaattagcca atcgaactgc ggcaaataat 10380 aacgacatag caacgaacag cgcgatgagc agcctaaaaa ccaccctact acaatttaag 10440 ctagacaccg aggtcctcac tgacgcgatc ctgttcgcgg ccaagggatt agtccatccg 10500 cgaatcctac ctcctaatac aatcaatcgc gcagcgaaaa cagtcgaaag cgcaatctca 10560 aacgcaagat tcccactacc agagggcagc ttttcggcgc tcccaattat gaaaatttca 10620 aaattatcaa tattgtatgc cgagggttac ctaatttacc aaattgcgat ccctcttctc 10680 gatacacaaa aatttataca aagcatcacc cctaccggcg atgcaacgaa ttctaaacag 10740 ctcggaggta gcagcatata tttggccaga acatcagtac ttcgcaatga gcgaatcaaa 10800 ccgtacctat atgccgctac ctcaggaaaa aatagaaaaa ctcaggaaaa ttagtaactt 10860 gctcatagcc gaaacctgta agggaggtgc acgctaacgc ggcgtgcgag attataatag 10920 ctgccgggcg cgcattgaac aaccctgggc actgcgacgt cagaattcgt cagctcaagg 10980 acacattctg gctacggctg cacaaagcta atacatgggt attctccacg tattcagccg 11040 aaatatatac atccaatgtc ttaagagccg agcagattac ggcaaccata aacggtgcag 11100 ggctgttatg cacggcctga attctcggcg cacatggcaa acgcgtaatt aacagcgtcg 11160 cgatccctcg aagcgcgcat gaaagattca tctttcaata ccgtgcactt agatctttca 11220 gcaatcgttg ccgaattaaa ccaatcaaaa aatactttat ccgaattcga atcagcaatt 11280 aaaacggaag cggaaattcg ggccgtgaat ctacacggta taaaattaga agcactagaa 11340 tcggggatag gccttcgcga cattgccgca aaagcccgcg agcaagcgca aagcaaggca 11400 acagcgcgca aggtgcaagc ccttgacgag aacacaacca ttttcggata ctccgcgtcg 11460 gcaatcttaa taggagtaat tgcgctaggc ggcgcctgct ggtacgtaaa gagcagacgc 11520 gccagtaacc ccatagaaaa attcatccgg cagcaagaaa cgagaatgat ggtggaccaa 11580 gtgcgccagc tatcgcgcag tcgtaacgcg atcaacattc aataaaaaga aaaaataaaa 11640 atcaatctcg ctatcgattc gctacaacag attgaagccc tcgcacgctc aaaaatatca 11700 gagtcaaaag ccaagctgca tggaaagagg gacaaaatgc gagctctcta ggagcaagct 11760 tactagtaga ataaccccta gtagaaaatt aacagttgaa attcattaaa aagttaagtt 11820 taattggcca gttttatcaa atttgaatgt aaaaaaatta attaaaatat tgctaatatt 11880 ttagtgatgc gtatctataa taccacatca gtattattat taagttaaac attgtttttc 11940 cctttttttg gcattaaaac gtatatatgt cttattacga atatttggat tttttacgat 12000 tgtaacgtat ttttctacaa tttttggatg aacagtcatt tgacaaattt caagcaaaag 12060 tttcatttta tcatgattaa tgcgtcatat ctatgcattt ataacggtta aacgcttatt 12120 taggtaaatc gttaaccaca caagtgtgtt aaaacttagc ttattaaccg ttaacatttc 12180 taccagggac ctttattata tatttgtaca cataatatta atcaaatggg aatatttaga 12240 tataagttgc gcttaggcca taccagccaa gccgcaggta gcgcaagtag cgcccataat 12300 cattartaac gccaagcgcg atataattaa tttttctttt cagctatact tattacaaca 12360 caatgattat aatataaaat attgttatac caattttcgt caatgaaatt aataattatg 12420 tatatctaat tttaagaaat atttgtatgt aggagctctg ccccgccccc aaaaaattgt 12480 acagcagcag cggctgctaa tcacaacatc cgcgacataa aatacacaca caccgaaatt 12540 cacacttgca ctaaaaaaca tatataatag cacaataata aatcggaaat tggagaagaa 12600 tccaatgcgc aaacctctct cgtgtggggt cacccccact ggcatccggg acaaaagaaa 12660 tttaaaagag aaaaaaaatt tttttaaatt kaatcacaca cactcactct cacaaacaca 12720 aactcacaca agcgctccca ggtgcggcaa tcggtaactg tttttttttt ctgggcgacg 12780 aggacgtcag cccgcgcgac ctcggact 12808 // ID Gypsy-1_Cfl-LTR repbase; DNA; INV; 329 BP. XX AC AEAB01004336; XX DT 26-JAN-2011 (Rel. 16.02, Created) DT 26-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Florida carpenter ant genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-1_Cfl_; KW Gypsy-1_Cfl-I; Gypsy-1_Cfl-LTR. XX OS Camponotus floridanus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Formicinae; Camponotus. XX RN [1] RP 1-329 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Florida carpenter ant genome."; RL Direct Submission to RU (26-JAN-2011). XX DR Genome; AEAB01004336; Positions 330 2. XX SQ Sequence 329 BP; 96 A; 91 C; 67 G; 75 T; 0 other; tggcggacac tcacgtgtgc ccgaaaaaga aaagtcataa aagtaaaagt tacagcacgt 60 aaatctcgag tttacaaggg cgaacccgaa cgtgttgaaa acaataagat tttcacacat 120 atgccgggca ttcctagcct ttccgtagac gacagattct caccctttcc gcctaagcat 180 tcggctgcct acatgcgacg cgaccggacg ggcggcaccc gcgatgagta atcccccgcg 240 attaggaata agcactcgtt tgtatgtaaa ataaactgta catatcctct gtagtgattc 300 ttactccatt tccacgcacc accacctca 329 // ID CR1-19_CQ repbase; DNA; INV; 4411 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-19_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4411 RA Kojima K.K. and Jurka J.; RT "CR1 non-LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 23-23 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 10 sequences with >94% CC identity. XX FH Key Location/Qualifiers FT CDS 2..4366 FT /product="CR1-19_CQ_1p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="IXRTQRRXFLGXFKCCREHSGKSATPPVPTLNDADDA FT EFAARVCTAIDSKDTGACFGVDWRSGRFTHGSSDRITCGTIVPTIPASFRS FT SSQHLPVRVLQSSPGRAVESLMRAPRLPDATTSSPRQRLRTPASRSHQSRP FT GAGVGVGGGVSQPAPFGKYPSRSQHFLPDVRPTSSSCASTSATQTAIVSPP FT SFRAIQFSPGRAVESPMGAPRLPDATTSSPRQRLRTPASRSHQSRPGAGVG FT VGGGVSQPAPFGKYPSRSQHFLPDVRPTSSSCASTSATQTAIVSPPSFRAI FT QFSPGRAVESPMGAPRLPDATTSSPRQRLRTPASRSHQSRPGAGVGVGGGV FT SQPAPFGKYPSRSQHFLPDVRPTSSSCASTSATQTAIVSPPSFRAIQFSPG FT RAVESPMGAPRLPDATTSSPRQRLRTPASRSHQSRPGAGVGVGGGVSQPAP FT FGKYPPRSQHFLPDVRPTSSSCAGPSATQIANASTLSATVPSQLANDVQPQ FT RPGSVDRRDSSLLTIYYQNVRGLRTKTNQLHLSLSSCDYDVIAFSETWLNA FT KIVDSELSSDYTFYRADRSRETSVLERGGGVLIGVKKRFHPEPIHLRGGEK FT LEQIAVRIPLPGKTLFVCCVYIPPNSDATIYTQHSSCVQQLCDLAQGPDEV FT IVVGDYNVPNLTWHFDEDINGYLPSGQSTEQEVALIEDVLGTGLQQMFDLP FT NANGRLLDLALVSDSSRFQLIEPPRAILKVDAHHRPFILVLETQSTGTAAE FT QSGTYYDFANCDTMALNNSIASLDWANLLLVGSVDDIVERFYDKLNEIIRE FT VVPLKHRRPHPNSHQPWWNRETRNLRNRLRKARKRFLKHPTEMLGRELADR FT ELHYEMSLEAAFRGYISGIESNLKREPKSFWAFVKKRKQDKGIPQDMVYRD FT SSASTPDDSVQLFADFFKTVYSANQPTTSNESLDKILPHELHMPLVCLTEE FT DVRKVLASTDASKGPGPDCIPPSIVKQCAATLANPLAIIFNRSLESGVFPA FT KWKVASITPIHKSGNVHNVENYRSISILSCLPKVFEKIVHEFLSRAVQPII FT SACQHGFMAKRSTTTNLMTFVTDVLRSMESRCQVDAVYVDFAKAFDRVPHQ FT LAVEKLRRLGLPEWITTWLHSYLTSRSAYVKLGGSKSGLFMIPSGVPQGSH FT LGPLIFLLFIDDLCHWIKNDKVLYADDLKFYRTITSPLDCLALQSDIDSLM FT RWCDQNGMEANAKKCQVITFDRKLRPITHAYMMGISLLDRVCSVKDLGVTL FT DRKMTFNEHITSVTAKAFATIGFIRRNTTAFKDIHALKALYSALVRSQLEY FT AAQVWAPYHNVHIARLERVQRAFVRLALRTLPWRNPPEQTSIEDRRRLLGL FT DTLLKRRQRMQQLFIFDLLTKRLDCQKLSQLLIVYVPPRNLRRNPSMFRLS FT VHHTVYGHNQPFDVCCRLFNVVSDRFVLNMTKDSFKKLISC" XX SQ Sequence 4411 BP; 1023 A; 1265 C; 1077 G; 1043 T; 3 other; aatcwgaaga acgcaacgca gaawtttttt gggagmcttc aagtgctgtc gtgaacactc 60 aggaaaatcc gccactcccc ccgtcccaac ccttaacgac gccgacgacg ccgagtttgc 120 cgccagagtc tgcaccgcaa tcgactccaa agataccgga gcatgttttg gagttgactg 180 gaggtctggg cggttcaccc atggatcatc agacagaata acgtgcggca caatcgtgcc 240 aacaatccca gcatcttttc gctcttcatc ccagcatcta ccagtacgag tcctccagtc 300 ttcgccagga cgcgccgtag aaagccttat gagagcccct cgcctgcccg acgccaccac 360 gtcttcgcca cgccaacgct tgagaacccc tgcaagtcgc agccatcaga gccgtcctgg 420 tgctggtgtc ggtgtcgggg gaggggtctc tcaaccggcg cccttcggca agtacccttc 480 tcgatcgcaa catttcctgc ctgatgtccg tccaacttct agctcgtgcg ccagtacgtc 540 ggcgactcaa actgcaatcg tctcaccgcc gtcatttcga gccatccagt tttcgccagg 600 acgcgccgta gaaagcccta tgggagcccc tcgcctgccc gacgccacca cgtcttcacc 660 acgccaacgc ttgagaaccc ctgcaagtcg cagccatcag agccgtcctg gtgctggtgt 720 cggtgtcggg ggaggggtct ctcaaccggc gcccttcggc aagtaccctt ctcgatcgca 780 acatttcctg cctgatgtcc gtccaacttc tagctcgtgc gccagtacgt cggcgactca 840 aactgcaatc gtctcaccgc cgtcatttcg agccatccag ttttcgccag gacgcgccgt 900 agaaagccct atgggagccc ctcgcctgcc cgacgccacc acgtcttcac cacgccaacg 960 cttgagaacc cctgcaagtc gcagccatca gagccgtcct ggtgctggtg tcggtgtcgg 1020 gggaggggtc tctcaaccgg cgcccttcgg caagtaccct tctcgatcgc aacatttcct 1080 gcctgatgtc cgtccaactt ctagctcgtg cgccagtacg tcggcgactc aaactgcaat 1140 cgtctcaccg ccgtcatttc gagccatcca gttttcgcca ggacgcgccg tagaaagccc 1200 tatgggagcc cctcgcctgc ccgacgccac cacgtcttcg ccacgccaac gcttgagaac 1260 ccctgcaagt cgcagccatc agagccgtcc tggtgctggt gtcggtgtcg ggggaggggt 1320 ctctcaaccg gcgcccttcg gcaagtaccc tcctcgatcg caacatttcc tgcctgatgt 1380 ccgtccaact tctagctcgt gtgccggtcc gtcggcgact caaatcgcaa acgcctcgac 1440 gctgtcagcc accgtcccta gtcaactcgc taatgatgtt caaccccaac gccccggctc 1500 cgtggaccgt cgtgactcct cactgctaac catctactac cagaacgtta gaggactaag 1560 aacgaaaacc aatcaactgc acctgtcgct gagctcgtgt gattacgacg tgatcgcatt 1620 ttctgaaacg tggctcaatg cgaagattgt ggactcagag ctttcttcgg attatacctt 1680 ttatcgtgcc gatcgtagtc gtgaaacaag cgtcttggag agaggtggcg gcgtcttgat 1740 cggtgtcaaa aaacgttttc acccggaacc cattcatttg cggggcggtg agaagcttga 1800 gcagatcgca gtgcgtatcc ccttgcctgg aaagacgtta ttcgtctgct gtgtttacat 1860 tcctcccaac tcagatgcaa ctatctacac tcagcactca tcgtgtgttc aacaactgtg 1920 cgatctggcg caaggtcctg atgaagtcat cgtcgttggc gactacaacg ttccgaatct 1980 cacttggcat ttcgacgaag acatcaacgg gtacctgccc agcggccaat ctactgaaca 2040 ggaagttgct ctgattgagg acgttcttgg tactggcctc cagcaaatgt ttgatcttcc 2100 gaacgcaaat ggacgattac tcgatctggc cctcgtaagc gactcgtctc gattccaact 2160 cattgaacca cctcgggcca ttctcaaggt tgacgcacat cacaggccgt ttatcctagt 2220 tctcgaaacg caatcaacgg ggaccgctgc cgaacaatcc ggaacgtact acgactttgc 2280 taactgcgac accatggcgt taaacaactc tattgcaagc ttggactggg ctaatctgct 2340 actcgttgga tctgttgatg acatcgttga gcgattctac gacaagctaa atgaaattat 2400 ccgggaggtt gtacctctaa agcatcgacg tccgcaccca aacagccacc agccttggtg 2460 gaatcgagag acccgtaatc ttcgcaacag gctgcgaaaa gcaaggaaac gtttccttaa 2520 acatccgacg gaaatgctgg gcagggagtt ggcggatcga gagttgcact acgagatgag 2580 cttagaagct gccttccgag gatacatcag tggcatcgaa tcgaacctga agcgtgagcc 2640 gaaatccttc tgggcattcg tgaagaagcg gaagcaagac aaagggattc cgcaggacat 2700 ggtgtatcgt gactcaagcg cctcaacgcc agacgattct gtacaactgt ttgcagattt 2760 tttcaaaacc gtttacagcg ccaatcaacc tacaaccagc aacgagtccc tggacaagat 2820 cctgccgcac gagttacaca tgcccttggt gtgtctcacg gaggaagacg tgcgcaaggt 2880 cttggcctct accgacgcct ccaaaggacc tggcccggac tgtatccctc cgtccattgt 2940 aaagcagtgc gcagctacac tagcaaaccc gctcgccatt atcttcaacc gctccctcga 3000 atcgggtgtg ttccctgcta aatggaaggt agcttcgatc acgccaattc acaagtccgg 3060 gaacgtgcac aacgtggaaa attatcgttc gatctccatt ttgagctgcc tacccaaagt 3120 gttcgagaaa attgtgcatg agtttttatc ccgtgccgtg caaccaataa tatctgcctg 3180 tcagcatggt ttcatggcca agagatctac gacaacgaac ctgatgacct ttgtcaccga 3240 cgtcttaagg agtatggaga gtcgctgcca ggtggatgct gtctatgtgg attttgcgaa 3300 ggcgttcgat agagtgccgc accagcttgc cgttgaaaaa ctgcgaaggc ttggactccc 3360 tgaatggatc acgacttggc tgcactcgta cctaacttct cgttcagcct acgttaaact 3420 cggtggctcg aagtctgggt tgttcatgat cccgtctggt gtcccgcagg gtagtcatct 3480 gggcccacta atttttcttc tgtttatcga cgatctctgc cactggatca agaacgacaa 3540 ggtcttgtac gctgatgact tgaagttcta tcgcacaatt acttcaccgc tcgattgcct 3600 ggctctccaa agtgacattg attcgctcat gaggtggtgc gatcaaaacg ggatggaggc 3660 caacgcaaaa aagtgtcaag taattacatt cgaccgtaag ctgcgcccaa tcacgcacgc 3720 atacatgatg ggaatatcct tgctagaccg ggtgtgttct gttaaggatc ttggcgttac 3780 tcttgacaga aagatgactt tcaacgagca tatcacttcc gtcacggcta aggcgtttgc 3840 tacaattggt ttcatccgaa gaaatacgac tgccttcaaa gacatccacg ctctaaaggc 3900 tctgtacagt gccctagtca gaagccagtt ggagtatgct gcccaggtgt gggcaccata 3960 tcacaatgtt cacatcgcca ggttggagag ggtgcaacgc gcgtttgtgc gactagcatt 4020 acgcacgctg ccgtggagga atccgccgga gcagacatca atcgaggatc gccgtcgtct 4080 gttggggctg gacacgctgt tgaagaggag acagaggatg cagcagctgt tcatcttcga 4140 tcttttgacg aaaaggctgg actgtcaaaa attgtctcaa cttctgatcg tttatgttcc 4200 accacgtaat cttcgtcgaa atccgtctat gtttcgtcta tctgtgcacc atactgtgta 4260 tggacacaac caaccatttg atgtatgctg tcgtttgttt aatgtagttt ctgaccgttt 4320 tgttttaaat atgacgaaag atagttttaa gaaattaatt agctgttagg aaacagtctg 4380 ttcaagacga agataaataa ataaataaat a 4411 // ID TABOR_DA-LTR repbase; DNA; INV; 451 BP. XX AC . XX DT 13-OCT-2005 (Rel. 10.1, Created) DT 05-MAR-2011 (Rel. 10.1, Last updated, Version 2) XX DE TABOR_DA, an endogenous retroviral element from Drosophila DE ananassae - LTR sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW endogenous retrovirus; Interspersed repeat; reverse transcriptase; KW gag; integrase; TABOR_DA-I; TABOR_DA-LTR. XX NM TABOR_DA-LTR. XX OS Drosophila ananassae OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; ananassae subgroup. XX RN [1] RP 1-451 RA Gentles A., Kapitonov V.V. and Jurka J.; RT "TABOR_DA, an endogenous retrovirus from Drosophila ananassae."; RL Repbase Reports 5(10), 340-340 (2005). XX DR [1] (Consensus) XX CC 451 bp LTRs corresponding to internal region deposited as CC TABOR_DA-I. XX SQ Sequence 451 BP; 152 A; 104 C; 80 G; 115 T; 0 other; tgtagtatgc acatatattg aatacccact gtaccgaaga tacttaaagg gtacacactg 60 taacactttg ttgcagtgtt tacatatttc aaaatcccgg tcataactcc taagtggccg 120 aaatatgatc cgatcgttat ggattagccc acacacggta aatgctagcg tcggcataat 180 tgcaacgaac ggacagcata tacatacctc tcgcgcttac acttgagagc gcatagcaat 240 atacataagt atatagatac gtagccgtct ctctgccgct gcctgcgagc agcgcagcta 300 tgagacttag ccttagttag acttaaagaa tattgtaaaa gagcagagac atttcgatcg 360 ttggagcggc accattgctg ctccaataaa taaactccaa taaataaacc ccaaacatat 420 ttcaatacaa aacaaaacca aacttatttc a 451 // ID Gypsy-252_AA-LTR repbase; DNA; INV; 130 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-252_AA_; KW Gypsy-252_AA-I; Gypsy-252_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-130 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1106-1106 (2011). XX DR [1] (Consensus) XX SQ Sequence 130 BP; 37 A; 27 C; 19 G; 47 T; 0 other; tgtatatgtg attgtagcct ttacttccca tttgtattct gtatacccga tgtctataat 60 aaactagtct atactgtgca agcaaactgt ctacacgttt tctttagacg atccgtaata 120 gatcacaaca 130 // ID BEL-42_CQ-LTR repbase; DNA; INV; 468 BP. XX AC AAWU01000931; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-42_CQ_; KW BEL-42_CQ-I; BEL-42_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-468 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 238-238 (2011). XX DR GenBank; AAWU01000931; Positions 26398 25931. XX SQ Sequence 468 BP; 127 A; 111 C; 133 G; 97 T; 0 other; tgttcgggcc gaaagctgtc agcccgaaga aaattttaca actgagcagt gtccaatccc 60 acggctactg cgatcgacga caaagctggc aaccgcgcgc cgttgacgac ggtgcgagaa 120 cgagagagaa ggagagaacg acgcaacgaa cgcactcgcg tgtggagagt tttttggcgc 180 gcaacgaacg agttagcgcg atcgtgcgag aagctgagac aagttcgccg caagcagccg 240 aagtgatcga acggaagtcg ccgccgcgga tcgcgagttt tctcggaaga aaatctttag 300 ttttaaaatt agttgctaaa tagatgtagt caagagaaac tcgagtgttt tagtggcttg 360 cttgcgataa tccgatcctc ccagtgaaat tcccggttcc cccagtccgc cggaagaagt 420 gttgtgtttg ctcgagaaat tacagtccgc tcgaatcgaa actggaca 468 // ID Copia-19_DPu-I repbase; DNA; INV; 4391 BP. XX AC scaffold_98; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-19_DPu_; KW Copia-19_DPu-LTR; Copia-19_DPu-I. XX NM Copia-19_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-4391 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 701-701 (2010). XX DR Genome; scaffold_98; Positions 367467 371857. XX CC Positions [1669-2193] - Integrase core CC 'AAATG' target site duplication CC LTRs are 99% similar to each other. CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX FH Key Location/Qualifiers FT CDS 388..4305 FT /product="Copia-19_DPu-I_1p" FT /translation="MWLRISAQHLKNAADNQHALEQRFFEYRFQPEHDVMS FT HITEIETIANQLRDINAPVTESQLMTKIICTLPPSYRGFMSAWDSVPVKDR FT TIDTLTSRLLKEESMTKQWSRGKPDSQDAAFFAHNFPSFGDNSRQTPTPRG FT RGRGRFGRGSSRGDQANKRQHPYRFCTYHKCNIPGHTIEVCRKRLRDEKES FT RNNHSQAAPAVASDANKESTKEAVATDEDSAYMSATCFLSRRAEDWFADSG FT ATGHMTDQRSFFTSFTPVTLGTWTVKGIGSSRLYVCGHGSINFLTTVNGIK FT RTLTVDNVLYVPGLGTSLLSIAAVTDVGLSVHFIETQVSFSKDKTIVMMGE FT RIGKSLYHLAIIPLLCDANQPKFSACFAVPSASSMAIWHQRLAHTSYKTII FT KMASNNVVDGLTIPADKVVPDHPCVGCVAGKMHRSPFPIGRTRANQVGQLI FT HADMCGPMHVTTPSGARFFVIFTDDFSGWRQVYFMKQKAEVADFFKEYVSL FT LRSETNNLVHTLRADNGGEFISTSFKTWLSEKGIRFESSAPYTPEQNGVAE FT RANRTIFEAGRSLIHARHLPLELWGEAIACAVYVLNRVSNSISPVTPHQKW FT YGLKPNVSHLRIFGSIAYIHVPKADRRKLDAKSQKCFFVGYSLTQKAYRFW FT DPIGRRIKISRDVIFDEQLNDISTLELTFPTPKDNNLPQLFFRQPLPLTTP FT EPQSIPTTLQENFLVPQVDGEIETAATDTQPEPPHETIDQTNPSGVDPPAN FT TQQTSPVPTRVSPYPLRIRQPRRQWQSLKSVTFQDEIYEPDSYVEAMQTAE FT AESWKVATKDEYDSLISNKTWSISQLPPGRTAIMTRWVFTFKPGVRGTAPR FT FKARLVAKGFSQRSGIDYGETFSPVVKYDTLRVILSFVAANNLEMSQLDIK FT TAFLYGELDEEIYLQQPEGFVVAGKEKLVCRLHKCLYGLKQASRVWNATFD FT TFLQKFGLRPSDADPCLYLRHRQEEFVSVAIWVDDGLVCSNNESLVQEIIS FT YLSQHFEMRHGPANHFVGLSISRNRSDHTLFVSQPDYIRKILRRFHMQDCL FT PKDLPANPGSRLQKKTGESPSNQFPYREAVGSLLYLGLASRPDISFAVGQV FT AQFCEDPGADHCSAVRRILAYLKGTMNHGIRYGSTKSGLVGYCDSDYAGDI FT DSRRSTSGFLFLLNGGPVAWSSRRQSCVALSTTEAEFVAASEATREGIWLG FT RLMKDIKPGRKGPIDIMCDNKSAIDLIRNPVHHQRSKHIDVRFYFVRERQE FT AGEIDVQQISTIYQLADPLTKPLANPRFASLRESIGIVPVPTDLI" XX SQ Sequence 4391 BP; 1236 A; 1157 C; 957 G; 1041 T; 0 other; ggttatgggc ccagtacctt caaccttttg aaaaacaaga tgactaccat gcgagatgtt 60 agcaatattg ctaaattcaa cggtcagaat cttccaacat ggaaattagg atgttggatc 120 ctattccagc aacacaatct ggtgaagttg gttatcggag aagaaacact tcctgttgag 180 gtacattata taatcatata gtacgcaatc caacagatga tcattttgtt ttaatttatg 240 tagactaaaa atgctgatgg gatagtcaca aacgctgctg ccatagccac atggcatgag 300 aaagaccttc ttgctcgcag ctacttcatt gcaacaattg aaattcccca gcaacgtact 360 ttagtcaact gtactacagc acatgaaatg tggctgcgca tttcagccca acacctcaag 420 aatgctgctg ataatcaaca tgcactcgaa caaaggttct ttgaatatcg ctttcaacca 480 gagcatgatg tcatgtctca tatcaccgaa attgagacca tcgctaacca gcttcgagac 540 atcaacgccc ctgtcaccga aagccaactt atgacgaaga tcatctgcac tcttcccccc 600 agctaccgtg ggttcatgtc agcttgggac agtgttcctg taaaagacag gacaatcgac 660 accctgacgt cccgcctact aaaagaagag agcatgacaa agcaatggag cagagggaag 720 ccagactctc aagatgctgc cttcttcgct cacaatttcc cgtccttcgg agacaactct 780 cgtcaaacac ccacaccacg tggtcgagga cgtggaagat tcggtcgagg ctcatcaaga 840 ggtgaccaag ccaacaagag acaacatccg tacaggtttt gcacctacca caagtgtaat 900 atccctggcc acaccattga agtctgtcgc aaaagattga gggacgagaa agaaagtaga 960 aataatcatt cacaagcagc accagcagtc gcctctgacg cgaacaagga atcaacaaag 1020 gaggccgttg caactgatga agactcagcc tacatgtcag ctacctgttt tctctcacga 1080 cgcgccgagg attggtttgc ggactccggc gctaccggac acatgacgga ccaacgttcg 1140 ttcttcacgt catttacccc agtaacactt ggcacctgga ccgtaaaagg gatcggttca 1200 tctcggctct acgtgtgtgg acatggatct atcaattttc tcaccacggt caacggaatc 1260 aaacggaccc ttacagtaga taacgtgctc tatgttcctg gcctagggac tagtctactg 1320 tctatagctg cagtaacgga tgttggcctg tcagtccact tcattgagac tcaggtctcc 1380 ttctcaaaag ataaaacaat tgtaatgatg ggtgaacgca tcggtaaaag cctctaccat 1440 ctagcaatta ttccattact ttgcgatgcc aatcaaccta aattctcagc atgttttgca 1500 gtaccatcag catcctccat ggccatctgg catcaacgac tagcgcacac aagttataaa 1560 acaatcatca agatggcctc caacaatgta gtcgatggac tgactatacc agctgacaaa 1620 gtcgtccctg atcatccttg tgttggttgt gtagcaggga agatgcatcg ttcgccattt 1680 ccgatcggga gaacaagagc caatcaagtt gggcagttga tccatgcgga tatgtgtggc 1740 ccaatgcatg tcactacacc gagcggagcg agattctttg taatattcac agacgacttc 1800 agcgggtggc gtcaggtcta ctttatgaaa cagaaggccg aggtggcgga tttcttcaag 1860 gagtacgtca gcctattacg cagcgaaaca aacaatctcg tccacaccct cagagcagac 1920 aacggcggcg aattcatcag cacctcattc aaaacttggc tgtcagaaaa aggaattcgc 1980 ttcgagtcat ccgcaccata cactcctgaa cagaatgggg tagcagaaag agctaatcgt 2040 accatcttcg aagcaggacg cagccttata catgccagac atctgccact tgaactatgg 2100 ggagaggcga tcgcctgcgc cgtttatgtc ctgaatcgcg taagcaacag catctcgcca 2160 gttacaccac accaaaagtg gtacggcctt aaaccaaacg tctctcatct gagaattttt 2220 ggctccatcg cgtacatcca cgtacctaaa gccgaccgac ggaagctgga tgccaagagc 2280 caaaaatgtt tttttgttgg gtattccctc acccagaagg cttatcgctt ttgggacccg 2340 ataggaagac ggatcaaaat tagcagagat gttatcttcg atgagcagct caacgacatc 2400 tcaacacttg agctaacttt tcctactccg aaagataata atcttcccca attgtttttc 2460 agacaacccc tgccattgac tacgccagaa ccacagtcaa taccaactac acttcaggaa 2520 aatttcctgg tgccccaggt tgatggggag attgagactg cagctactga tactcaacct 2580 gagccacccc acgagacaat cgatcaaacc aatccttcag gagttgatcc accagcaaac 2640 actcaacaaa cctccccagt tccaacgcgc gtttcaccgt acccattacg gatacgccaa 2700 cctagacgtc aatggcagtc actcaagtcc gtcacattcc aagatgaaat ctacgagccg 2760 gacagctacg ttgaagccat gcagacggcc gaagcagaat cgtggaaggt ggccacgaag 2820 gacgagtacg actcattaat ttcaaacaaa acttggtcaa tttctcagct acctccagga 2880 cgaacggcca tcatgacacg atgggtcttc acattcaagc ctggtgtccg tggcacagct 2940 cctcgcttca aagcccgact ggttgctaag ggcttctcgc aacgctcagg gattgattac 3000 ggagagactt tctctccagt cgttaaatac gacacactcc gagttatact gtcctttgtt 3060 gctgccaaca accttgagat gtcacaactg gacatcaaaa cggccttcct ttatggagaa 3120 ctggatgaag aaatttactt acagcagccg gaagggttcg tagttgccgg aaaagaaaaa 3180 ctggtttgcc gccttcacaa gtgtctgtac gggctcaagc aagcctcaag ggtttggaat 3240 gccacttttg acacttttct acaaaaattt ggtctacgcc cgagtgatgc ggatccttgc 3300 ctctacctac gtcatcgcca agaagaattc gtgtcggttg ccatatgggt agacgatggc 3360 ctcgtctgca gtaacaacga gtcattggtc caggaaatca tcagctactt gagccaacac 3420 tttgaaatgc gccacggtcc tgccaatcat ttcgtcggcc tttccatctc aaggaatcgc 3480 tccgatcata cgctgttcgt ctcacaacct gactacattc gtaaaatatt acggcgattc 3540 cacatgcaag attgcctccc aaaggatctg ccggccaacc ctggaagtcg cctccaaaag 3600 aaaactggag aaagcccgtc taatcaattc ccatacagag aagcagtggg gagtttgtta 3660 tatttaggtc tagcctctcg acctgacata tcatttgctg ttgggcaggt cgcccagttc 3720 tgtgaagacc caggagccga tcattgttca gcagtccgtc gcatccttgc ctaccttaaa 3780 gggacgatga accacggaat ccgctacggc tcaacaaaaa gtgggcttgt aggatactgt 3840 gactctgact acgccggcga tatagactct cggagatcta cttcaggatt cctcttcctt 3900 ctcaacggtg gaccagttgc ctggagcagc cgcaggcagt cgtgtgttgc cctgtcaacg 3960 actgaggccg aatttgtcgc tgcgtccgaa gcaacaagag aaggaatatg gcttgggcga 4020 ctcatgaaag acatcaaacc cggacggaaa ggcccaatcg acataatgtg tgacaacaag 4080 tccgccattg atctcatcag gaacccggta catcatcaaa gaagcaaaca tatagatgtt 4140 cgcttttact tcgttcgtga acgacaggaa gctggggaga tcgacgtcca gcaaatctcc 4200 actatttacc aattggcaga ccctctcact aaacccctgg caaaccctcg ctttgcctca 4260 ttgagagagt caatcggtat tgtacccgtt ccaactgact taatctaatt ttcttttctt 4320 tgaatgcaga gggagtccaa tacactgcca tgtatttccc ctctcatcaa gtttacatgt 4380 tcgagaggga g 4391 // ID SAU3A_TR repbase; DNA; INV; 388 BP. XX AC AF442688; XX DT 09-JUL-2004 (Rel. 9.06, Created) DT 09-JUL-2004 (Rel. 9.06, Last updated, Version 2) XX DE Trichobilharzia regenti Sau3A repeated sequence. XX KW SAU3A_TR; Repetitive DNA; tandem repeat. XX OS Trichobilharzia regenti OC Eukaryota; Metazoa; Platyhelminthes; Trematoda; Digenea; OC Strigeidida; Schistosomatoidea; Schistosomatidae; OC Trichobilharzia. XX RN [1] RA Hertel J., Hamburger J., Haberl B. and Haas W.; RT "Detection of bird schistosomes in lakes by PCR and RT filter-hybridization."; RL Exp. Parasitol 101(1), 57-63 (2002). XX DR Genbank; AF442688; Positions 1 388. XX SQ Sequence 388 BP; 103 A; 85 C; 97 G; 103 T; 0 other; gatcaacccg taaaacataa ctaatcctta caaccaaaac tatagaaagt taatctgatg 60 ctcgctagtg acttgctaca ggttggagtt tcgtgagttc tagtgagaag tcgtgaccag 120 tgaggtccat tagatttcac gtgtgtggat ggatgtccca ccgccgccaa tggttgggtc 180 gcgctaaatc acgaattggt agaagttaga attcacgagc cattgaacgg gttccagtgg 240 ctgaatggta tacgagcttg ccttgtaacc ggaaggcctg ggttcgaaac ccagtgggtg 300 cgtactgccg atgattccca aactagatga aacagcagct gtccagtact ccctggtttt 360 caatggttct ctaatgatat caatccgt 388 // ID CR1-25_HM repbase; DNA; INV; 5077 BP. XX AC . XX DT 16-DEC-2008 (Rel. 13.12, Created) DT 16-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE CR1-type family - consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-25_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-5077 RA Bao W. and Jurka J.; RT "CR1 families from Hydra magnipapillata."; RL Repbase Reports 8(12), 1853-1853 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 1161..4517 FT /product="CR1-25_HM_1p" FT /translation="MSTHCKDCFKKINKNHHYLQCYHCLNLFHKKCCRLND FT IEFRILKNSKTSWFCISCREKIFPFSTLSDKELHLQTINDKVIHINEFENN FT ISSFPNTIQXNLYNELNDFISHQLNNNNDNDVNLEEYPINCKYMDIESFNS FT SNIDKNNFSLFHLNIASLVKNFDELNSLLSLINHKFSIIGITETKLKHDIL FT PTIPLNLTGYVYEHTPTKSYFGGALLYISNTLIYKLRYDLNMLKPKLLESV FT FVEIIIPKKSNIIIGCIYRHPSMDISEFIKSYLIPLFEKLSMEKNKTTFLI FT GDFNVDLLLANSDNSVSEFLDTIALNNFLPFISLPTRITNHSKTLIDNIFS FT NATSTNIFAGNFTSTVSDHLPQFLIHPLYRTKINTRKSCIFRRNLKQINCV FT ELCSDFSKISWINVINVNEGKTNKSFEEFFSSFNSLLDKHAPLKKVSNRCL FT SRGLKPWITRGILTSIKMRNKIFKKFLKAKNPNNKLNLEKSYKAYRNLLVT FT LTRRSKKNHYSKFFSDNAKNLKSTWNGIKNLLNISSRVNSSPSCLSSTNSM FT IXDPIKISESFNSFFVSVPEDLQNKIHSSNTDFNKYLKNPNLNSMFIKPTD FT KNEVLSIIHSLNDSKASGPNSIPIPIFKALANDISPVLSELFNLSFTTGVF FT PDILKTASVIPIHKKDSKLECNNYRPISLLSNVSKLLEKLMYSRIYNFLNN FT SACFHIRQFGFRSNHSTSHALISITEMIRGAIDSGSFACGVFIDLQKAFDT FT VDHNILIKKLNYYGIRGIANHWFSSYITDRTQFVSINGFESTHKVIKYGVP FT QGSVLGPLLFLLYINDLSCSLKNSMIHHYADDTNLLYINKSLTTLCKKVNQ FT DLHRLCNWLNSNRISLNVNKTEYIIFHSPQKTIDHIKIKINGKKLFPSKYI FT KYLGIYLDENLSWYNQLYHLKKKLTRANSMLSLIRYYVPIQTLRMIYFSLF FT SSHLCYCCIVWGQKRNFLLNNIASLQRASIRIMTFSPMRSDIEKKFSELKI FT LKLHNLVKFYNCLFVFDFLQNKLPSPFSNFFIKTNEVHNHHTRNTENSKLY FT YSFFXTIKYGKFSIKQQCISQWNQCIPALXKKSKKKYPLVTNFLLLNRIQF FT KNILLECISL*" XX SQ Sequence 5077 BP; 1819 A; 745 C; 529 G; 1971 T; 13 other; tcagtggtag gtcttcgcaa atggcggtca cgttttttta acggtaaaag agaaaaagaa 60 aaaagaatag aaaagataaa tataaaaatt aaagataaaa ccaaatcata ataacaagta 120 aataattttt tgttttatat gcataaataa tttgatagcg gtcttaagat cgtctttaaa 180 caaacgaaat tgaaaagtca aaacaaaagg aaaggaagaa aaaaaaaaaa aaaagaaaag 240 aaaaagaact tcgaattaaa aaaaaaaaaa ggaacactta aaaaaaaaaa acttaaaaaa 300 aaacaaaaaa aattaaaaaa aaaaaaaaga acaaaaagga agtaaacaaa taaaataaaa 360 attaaaaagt aaagagaaaa aaaaaaggaa aaaaaaaaaa aaaaagaaga agaattaaat 420 aaactatgaa agacgaaaaa acaaagaaga aaaagtatta gtagtgccag aagattgtag 480 tgaaacttta raagaattta gtttgtttat cttaaaatac gcttttaatt taatgtaatt 540 attcttattt ttgaatatta ttattttttt ttttctagct tttcttgctt gttttcatta 600 attttttctt tttttcttat actgtttatt attatatata cttattattg tttttattat 660 tgttattata atagttatta ttgctactag tattattact attattattt ttatatatct 720 atatagtttg ttagcttagt tactatagtt ttacttatta acttataata ctaagcataa 780 acttaatart ttattattat attttccact tattataact ataatatttt acttataatc 840 atttataatt tattattatt ttaatataaa ttgttttatt ttatacttac aacttatcat 900 ttaaccaatt ataacttttt tttaatatat atttacgtta ttcttctttt actwtattaa 960 tagtattttt tcattattta ttttaactgt taaataaata ttatttttgt tttactattt 1020 tatcaattga attatattta tattgttaat aaatactatt attattttta ttatttttgg 1080 ttatattgct gttcatatta ctctttgttt ccttatttca gttcattcaa tcattttatt 1140 atataaactg taccacttaa atgtctactc actgcaaaga ttgcttcaag aaaattaata 1200 aaaaccatca ctaccttcaa tgttatcatt gcctaaacct ttttcataaa aaatgttgtc 1260 gtctgaatga tattgaattt agaatattga aaaactctaa aacatcttgg ttctgtattt 1320 catgtaggga aaaaattttt ccattcagta ctttatcaga taaggagctg cacctgcaaa 1380 caataaatga taaagtgatt catattaatg aattcgaaaa caacatatca tctttcccaa 1440 acaccataca gyctaatctt tataatgaac ttaatgattt tatttcccat caacttaata 1500 acaacaatga caatgacgta aacctcgaag aataccctat taattgtaaa tatatggaca 1560 tagaaagttt taactcttct aacatcgaca aaaataattt ttctcttttt catttaaata 1620 ttgcttctct tgttaaaaat tttgacgaat taaactcact tttatctctt attaatcata 1680 aattcagtat tattggcata actgaaacta aactaaaaca tgacatttta ccaactatcc 1740 cattaaatct taccggttat gtttatgagc acactccaac aaaatcttat tttggaggag 1800 ctttacttta tatatcgaat acattaattt acaaacttcg atatgatctt aatatgctta 1860 aaccaaarct tttagaatct gtgtttgtgg aaatcataat tcctaaaaaa tcaaacatca 1920 ttattggctg tatttatcgc catcctagta tggatatttc agaatttatt aaatcatatt 1980 taattccact ttttgaaaag ctatctatgg aaaaaaacaa aactactttt ttaataggcg 2040 attttaatgt cgatctttta ttagcaaact cagataattc tgtctctgaa tttttagaca 2100 ctattgcttt aaataacttt cttcccttta tttcattgcc aacaagaatc acaaatcact 2160 ccaaaactct aattgataac atattttcta atgcaacttc tacaaatatt tttgccggta 2220 attttacttc tacggtatca gatcatttgc cccaattttt aatacatcct ctttatagaa 2280 ccaaaattaa tactcgtaaa agctgcattt ttcgtcgaaa tytaaaacaa ataaattgcg 2340 tcgagttatg tagtgatttc tcaaaaatta gctggattaa tgtaatcaat gtcaacgaag 2400 gtaagactaa taagtcattt gaagagtttt tctcaagttt caattctctt ttagataaac 2460 atgcaccatt gaaaaaagtc agcaatcgyt gcttatcaag gggtctcaaa ccttggataa 2520 cacgaggaat cttgacttct attaaaatgc gaaataaaat atttaaaaaa tttcttaaag 2580 caaagaatcc taataacaaa ctcaatctag aaaaatctta taaagcatat agaaacttac 2640 ttgttactct cacccgtcgt agtaaaaaga accattactc taagttcttc tctgacaatg 2700 ccaaaaattt gaaatctaca tggaatggta ttaagaacct tttaaacatt agttcgagag 2760 ttaactcatc tccttcatgc ttatcttcta ccaactccat gatayacgac cccattaaaa 2820 tttcggaatc ttttaattca ttctttgttt cagttccgga agatttacag aataagatcc 2880 actcatccaa tactgacttt aataaatatc ttaaaaatcc taatcttaat tcaatgttca 2940 taaaacctac agacaaaaac gaagtattat cgattattca ttctcttaat gacagcaagg 3000 cttctggacc taacagcatt ccaataccta tttttaaagc tcttgcaaat gatatctctc 3060 ctgtactttc tgaactattt aatctgtcat tcacaaccgg tgtattccct gatattttaa 3120 aaactgcttc tgttatccca attcataaaa aagactcaaa acttgaatgt aataactaca 3180 gaccaatttc attattatca aatgtcagca aacttttaga aaaacttatg tattctcgta 3240 tttacaactt tttaaataat tctgcttgtt ttcatattcg acaatttggg tttcgctcaa 3300 atcactccac ttctcacgct cttattagca ttactgaaat gattcgtggt gcaattgaca 3360 gtggttcttt tgcttgtggt gtattcatag atcttcaaaa agcttttgat acggttgatc 3420 acaacattct aataaaaaag ttaaactact acggaatacg tggtattgct aatcattggt 3480 tttcttcata tattacggat cgcactcagt ttgtttcaat taatggattt gaatccacac 3540 ataaagttat taaatatggc gttccacaag gttccgttct tggaccgttg ttgttcttat 3600 tatatataaa tgatttatct tgttcgttaa aaaattctat gatacatcat tatgctgacg 3660 ataccaatct tttatatatc aacaaatccc tcacgactct ttgtaaaaaa gttaatcaag 3720 atcttcatcg cttatgtaac tggttaaata gtaaccgtat ttccttgaat gtcaataaaa 3780 ctgaatacat tatttttcat tcccctcaaa aaactattga tcatattaaa attaaaatta 3840 acggtaaaaa actttttcca tcaaaatata taaagtatct tggaatatat cttgatgaaa 3900 acctatcttg gtataatcaa ctgtatcatc taaaaaaaaa acttacacga gctaacagta 3960 tgctttctct aattcgttac tatgttccta tccaaactct tcgaatgatt tatttttctt 4020 tattctcttc tcacctttgc tattgttgta ttgtttgggg tcaaaaaaga aactttttac 4080 ttaataacat tgctagttta caacgcgcat ctattagaat aatgacgttt tccccaatga 4140 gatctgatat cgaaaaaaaa ttttctgaac taaaaatcct aaaactccat aatcttgtaa 4200 agttttacaa ttgtcttttt gtgtttgact ttctccaaaa taaactacca agcccttttt 4260 caaatttttt tattaaaaca aacgaagttc ataatcatca caccaggaat acagaaaata 4320 gtaagctata ctattctttt ttcarcacaa tcaaatatgg taagttttcc ataaaacaac 4380 aatgtatttc tcaatggaac cagtgtattc ctgcattaay aaaaaagtct aagaaaaaat 4440 atccacttgt tacgaatttt ttactgctca acagaattca atttaaaaat atattgttgg 4500 agtgtatttc tttataattt ctctctgtac trtagttgtc ttcaatatat aatctcttta 4560 acgaacaagt ggtttgattt ttattttatt tattatttac cwatcttata tttatttatt 4620 attgtgttat taatgttaca ttttatgtta ttattatatc taattatcat ctatgatttg 4680 tacatacaat ttatattgta ttataatatt cttaatatta ttggtacttt attrttattg 4740 ttattatttt gtattattac tattattatt acatatatca ttattatttg tttaacaatc 4800 ttttcttctt ttctctctta tttatatata cagatgttga gtttttcctt tttttttttt 4860 tttttttttt tttttttttt tgttttgttt actttagttt cctttttttt tttttttctt 4920 tctatttatt tattttttcg tgaaattttt atgtttttta ggtgtcactc cattgactag 4980 tttgtaacta tatgagtgac acctaaccaa tttatattaa tttacgatga aatttgtaaa 5040 ttttattgat atggttgaat aaaaaaaaaa aaaaaaa 5077 // ID BEL-68_AA-I repbase; DNA; INV; 2604 BP. XX AC supercont1.93; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-68_AA_; KW BEL-68_AA-LTR; BEL-68_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-2604 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.93; Positions 113214 110611. XX CC 'GTTTA' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 15..2381 FT /product="BEL-68_AA-I_1p" FT /translation="MMSTPSQDTERTVYSCAGCHHPDDDEKDMVFCDHCQQ FT WYHLRCVGLSANEKDVSWSCEECLRRRADKEESPGLVLELGNLEREMEKEI FT RALETQKILHRKRLEHKNKLFLLRQQVENEKRDMELAYEMEQMKQLVADEE FT AHQKKREEALKQLQEKFEKVKRDSEQLELVARTIDKEAFGLRSGRRIEGRK FT FEQIEPGTKKPDGRKQKSCHSGRKTIDGESRRVVSGKKRSQRNSYRQKKQP FT YMGSEDSDSTNASNEGKYEENSDDSEDESSSNAERFLQDESSSLESRKSPK FT TTMRNPTKAQLSARQFLSKKLPTFSGRLEDWPMFISAYETSNTACGFSNVE FT NLARLQECLKGQALETVRSRLLLPSAVPQIIETLRMLYGRPEQLMRMLLSK FT VRKAPSPRADKLSSYIQYGVVVQELTDHLEATGLTAHLVNPMLIQELTEKL FT PANLQLEWVRYRRKAQSVTLRTLSKFLSRLVNDASEITSYCEAQGEATLSE FT EEEEDFRKYERKEQEGFLHTHIAEDNRTKEQQSYRKKTACRICGRYDHRVR FT NCETFKKLNLTERWEMVRKWRLCYSCLNEHGSVPCKLNFQCNIERCGEGHN FT TLLHIEQTGFMDCNESAPHSTSVVFRTMPVRLYNGKRTVDTIAFLDEGSSH FT TLVEKSLADELQAKGTPQQLRVTWTAGVTRLEKDSRNVELWISARGSSKRV FT HIKAAHTVHSLKLPKETVEIRKVINEYSHLRELQEAEYEQGVPQILIGLKD FT VHLCAPLESKIGKPDEPIAVRSKLGWSIYGPVSEVIKE" XX SQ Sequence 2604 BP; 844 A; 492 C; 693 G; 575 T; 0 other; ttcttgaaaa tgttatgatg tcgacaccga gtcaagatac tgaaaggacg gtttacagct 60 gcgcgggatg tcaccatccg gatgatgatg aaaaagatat ggtattttgt gaccattgcc 120 aacagtggta ccatttgcga tgtgttggtt tgtcagccaa tgagaaggat gtttcctggt 180 cctgtgaaga gtgtctgcgc cgtcgtgccg acaaagagga gtcacctggt ctagtattag 240 agttgggaaa ccttgagcga gagatggaga aggagataag ggcactagaa actcaaaaaa 300 tcttacacag gaaaaggtta gagcacaaaa ataagctgtt tttgttgcgg cagcaggtcg 360 agaatgaaaa acgtgatatg gagttagcgt acgaaatgga gcaaatgaag cagctcgttg 420 ccgacgaaga agcgcatcag aaaaaacgcg aagaagcatt gaagcagctt caggagaaat 480 ttgaaaaggt taaacgtgat tcggaacagt tggagctagt agcaaggact atagataaag 540 aagcattcgg attgagaagt ggaaggagaa tagaaggcag gaaattcgag caaatcgaac 600 cggggacgaa aaagccggat ggaagaaagc aaaaatcatg ccattcaggc cggaaaacaa 660 ttgatggaga atctcgtaga gtcgtcagcg gcaaaaaacg atcgcaaaga aattcgtatc 720 gacagaagaa acagccgtac atgggttccg aagacagcga ttcaacaaat gcatcaaacg 780 aaggcaaata tgaggaaaac tccgatgatt cggaagacga aagttcgtca aatgcagaaa 840 gatttttgca agacgaaagc tcgtcactag agagccgaaa gagtcccaaa accaccatgc 900 gtaatccaac aaaagctcaa ctgtctgcac ggcagtttct ctcaaaaaag cttccgactt 960 tttctggacg acttgaagat tggcctatgt ttataagtgc ctacgagacg tccaacacag 1020 cttgtgggtt ttcaaacgtc gagaatctcg ctcgacttca ggagtgcttg aagggccaag 1080 cactggaaac agtgagaagc agactacttc ttccgagtgc agtgccgcaa atcatcgaaa 1140 ctttacggat gctgtacggt aggccggaac agttgatgcg catgctgtta tcaaaagtaa 1200 ggaaggcacc atctccgaga gcggacaagt tatcatcgta catccagtac ggtgtagttg 1260 ttcaagaact tacggatcat ttggaggcaa ccggtttgac agcgcatctg gtgaatccga 1320 tgctgattca agaactgacg gaaaagctgc cagccaactt gcagctagag tgggtccggt 1380 atcgtagaaa ggctcaaagc gtaactctcc gaactctgtc caaatttcta tccagactcg 1440 ttaacgatgc cagcgaaatt acttcatact gtgaggctca aggtgaagca acacttagcg 1500 aagaagaaga agaagatttc cgtaaatatg agagaaaaga gcaagaagga ttcctgcaca 1560 cgcacatagc tgaagacaat agaacgaagg aacaacagtc atacaggaag aagacagctt 1620 gcaggatttg tggccgctat gatcatcgtg ttcggaattg cgaaaccttt aaaaaactca 1680 atttgacgga aaggtgggaa atggttcgaa agtggcggct ttgctattcg tgcctgaatg 1740 aacatggaag tgtcccatgc aagttgaact ttcaatgtaa tatagaacga tgtggtgaag 1800 gccacaatac actgttgcat atcgagcaga caggtttcat ggattgcaac gagagtgccc 1860 ctcacagcac gtcggtagtt tttcgcacga tgccagtcag attgtacaac gggaaacgca 1920 ccgtagatac tattgctttt ctggacgaag ggtcgtcgca taccctggtg gaaaagtcat 1980 tagcagacga gttacaggcc aaaggcactc cgcaacagct acgggtaact tggactgctg 2040 gcgtgaccag gttggagaaa gattcgagaa atgtggagtt gtggatttcg gcaagagggt 2100 cgtcgaagcg tgttcatatt aaggccgctc atactgtaca tagtttaaag ctaccaaaag 2160 aaactgtaga aatacgtaag gtaatcaatg agtattctca tctgcgcgaa ctgcaagaag 2220 cagaatacga gcagggtgtt cctcaaatac tgataggact gaaggacgtt cacctttgtg 2280 cgccactcga atcaaaaatc ggtaaaccag atgaaccgat cgccgtgaga tcaaagctag 2340 ggtggtcgat ttatgggcct gttagcgagg taatcaagga gtgaatggat attatcattg 2400 tgaaaatatt gtgaagtaat cacgtgttca atgcaaaaaa cgaacacagt ggaaactggt 2460 taaggcatgt gaacttaaaa tctgtaaaga agaaggcaag tacaaagcgg ataggtctgg 2520 tgtagaagtt ggtggtgctt gcgttcaata attgtaactc tggcgacagg gatgaatccc 2580 agctagaatt acggggaggg ggta 2604 // ID Gypsy-601_AA-I repbase; DNA; INV; 5291 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-601_AA_; KW Gypsy-601_AA-LTR; Ty3_gypsy_Ele156; Gypsy-601_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-5291 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [1335-1844] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 624..2210 FT /product="Gypsy-601_AA-I_2p" FT /translation="MKEPSSRLTKFRLALEEYDFSITYIPGKDNVLADALS FT RMTSDNLKEINKKVDQDVLVTTRSKSAKENSDGDPNNEMAGQHSLSGLSIE FT VIFNENKVIPEVRFDAPKHQIKINPAKTLIQLRRIVVMLKEMCKINKINEL FT VIKNTDPGKQFYNMIHTNNLNNGMPPLRIMDQKIKHIESKMEKDLIINDFH FT LLPTAGHAGIAKTLKNIQRRYFWSTMKKDISNFIKSCESCQKNKHIKPKNV FT PQIVTTTANSAFSKIYLDLVGPLVPSYEGHSYILTTQCELTKFITATPILN FT KSTKIVAKAFVENVILNYGVPDEIATDRGTEFMSELFTKICELLKISKLNS FT TAYHHQSIGALENSHKSLGNFLRIYAAGNPGEWSSWTKFYQFAYNTTTHLE FT TEKTPFELVFGKHCKLPSSLSDPPPILTPIYNLDDYSKLLKIKIQATQQAA FT HKSLTEAKTRRTLKLNTNTKELHYVTGQLLLIKNETGNKFSTVYEGPYPVV FT EDRGPNVAIRIKDKVDLIHKSRTKPFITNPSIL" XX SQ Sequence 5291 BP; 2003 A; 932 C; 1038 G; 1318 T; 0 other; ataagaacct tattgctatt tttgaaagac ttaggggagt taatcttaaa ttaaatcccg 60 ctaagtgcaa ttttttgcag caagagttgc tttatttggg acatttcata tcggaagaag 120 gagttcgacc tgatccttcg aagatagaaa gtataaaaaa ttggcctgta cctaaaacag 180 ccgatgaagt taaacgtttt gttgctttcg ccaactacta tcgaaagcac ataaaagatt 240 tttcaaggat ttgtattcca ttgaatcagt tgacaaggaa aggtgtagta tttgaatgga 300 taccagaatg tcaacaaagc ttcgaaatat tgaaaactaa ttttgttaac ccaccaattt 360 tggattaccc ggattttgaa aattctttta aattgcaaac tgatgcgtca gggtatgctt 420 tgggagcagt gctgtctaac cgaaaaaacg gaagaccagt agcatatgca tccagagcac 480 tgaacaaatc agaattaaat tatggaacta ttgaaaagga actattggct gtagtttggg 540 cgatcaagca tttcagacca tatctgtatg gaagaaagtt tgatttggaa acagaccaca 600 gaccattagt ctatttgttt tcaatgaagg aaccttctag taggctaact aaattcaggt 660 tagcactgga agagtatgat tttagtataa cgtatattcc aggaaaagac aacgtattgg 720 cagacgcctt atctaggatg acaagcgaca atttaaaaga aataaacaag aaagttgacc 780 aagatgttct agtgaccacg cgaagtaaat cagcaaaaga gaattctgat ggggatccaa 840 ataatgaaat ggctggccag cattcattat caggattgtc aatcgaagta atctttaatg 900 aaaacaaagt aatacctgaa gtaagatttg atgctccaaa acatcaaatt aaaattaatc 960 cagctaagac tctaatccag ctacggcgga tagtagtaat gctgaaagaa atgtgcaaaa 1020 taaacaaaat aaacgagctc gtaataaaga atactgatcc aggaaaacaa ttttacaaca 1080 tgatacatac caacaactta aataatggca tgccaccatt gagaataatg gatcagaaga 1140 taaagcacat tgaaagtaaa atggagaaag atctcataat aaatgatttt catttactgc 1200 ctacggcagg ccatgcaggt atcgcgaaaa cactaaagaa catacaaagg cgatattttt 1260 ggtcgactat gaaaaaggat atttccaatt tcataaagtc ttgcgaatct tgccagaaaa 1320 ataaacacat aaaacccaaa aatgtaccac agatcgtaac aacaacggca aatagtgcat 1380 tcagcaaaat ctatttagat ttggttggac cacttgtgcc tagttatgaa ggtcattcat 1440 acatactaac aacacagtgt gaattgacaa aattcatcac tgcgacacca atattaaaca 1500 aatcaaccaa aattgttgct aaagcttttg tagagaatgt tatattaaat tatggagttc 1560 ccgatgaaat tgctactgat cgaggaacag aatttatgtc agagttattt acaaaaattt 1620 gtgaactttt aaaaatttct aaattaaatt caactgctta tcaccatcag tcgataggcg 1680 cattggaaaa ctcgcataag agtttaggaa attttttgag aatctatgca gcaggaaatc 1740 ctggagaatg gtcatcatgg accaagtttt atcaatttgc gtataatacc acaactcatc 1800 tggagactga aaaaactcca tttgagttag tgtttggaaa acattgtaaa ctgccatcaa 1860 gtttaagcga tcctcctccg attctgacac ccatttacaa tttggacgat tattcaaaat 1920 tactaaaaat aaaaattcaa gcaacacaac aagcagcaca caaatcatta actgaagcaa 1980 aaacaagaag aactctaaaa ttaaatacaa acacaaaaga attacactat gtcactggac 2040 agttattgtt aataaaaaat gaaacaggga acaagtttag tacagtatat gagggaccgt 2100 accctgtagt agaagacaga ggaccgaatg tagcgattcg aataaaagat aaggtagatt 2160 taattcataa aagtaggaca aaacctttta taacaaatcc atcaatttta taaccaatat 2220 aatacagtaa aatattttta ggacgtgtgg cggatgagct cgctaccact tacgttttta 2280 ataataccaa acaaatatca ttccatttca cataaatgag tatctctaat gttagttctg 2340 tagagtcttg agaagactga gctggagata tgagatatca aaggcgatct acaatggatc 2400 acattagtta aacacattta catttcaaat tattagaatt aaaaataaac atgtatttca 2460 aactaaatgt tttacgtttc aaccattata aactaaatgt tttaaatttt caactataag 2520 ataaaaattt aaaaataaat aaatagggcg tgtggcggat acacccctga ctacgcctgt 2580 ggaacgcgat cggtagaagt agaagtcgaa cgacagaatg ttatgctgga taatcataac 2640 attcagccgt tgtagaacga tgaactacca aatgcttaca cagcagaaag tatgtgtcac 2700 gccactcagt aggtaggcta gcgtaaggaa atgatttttg aataaagcag atatatacga 2760 accaccgact ggacaacatc attctatcca aactaatcca gtcaatgcat ggcgatccta 2820 agccagtatt aagaccataa attagtgttt agcactaagt gaaagtattc tacactcaag 2880 tgtaatgaaa agttttgaaa aaagaagtgt ttgaggccga tatcaccgta acagttgtgt 2940 accagaatga acattacgcg tgacattaat tgcaaattaa atcgcatggg aaaaacttac 3000 tgaaacgatg ggttggtttt cggctgacga aatcgtcgcc ccaaccatca ccgcgtcagc 3060 accgaatggt caccacgtag cacaaaccat tgcactttgc gtaatggctg gcgtagccgt 3120 gggctacata ttggcaaaga gtattatcag gtgccaccgc cagcaaactg agcgggttgc 3180 cgaacgtacg gcacggctgg cgacgctccc agcttaataa gtgaaaacga gaaaccaacg 3240 taacaagtga aacatgaaaa agactagttt tctggtgctc cgttaatttt atttttcgtg 3300 tcaaatgtca aaaaactaaa attttaagag atttttacga ttacgcggta cacagtgact 3360 cgttgcatcg agaaacagta aagtgctcgt cagagaaaaa caatcggctt gataaaccaa 3420 tcgaaaagaa gaagttctcg tttcggacta accccgcaaa gagatgaaca atggcagaga 3480 aacggacgct gcacgaggaa actgatattg attagctacg aggaagtcca ggccaaacag 3540 caaataggtg aggccgtatt tggagttgca gcagcagcat tggcgaaaat cgagccacaa 3600 agcacccgga cgggatggcg cgagacgctg ttgtttgaat atcgagaact caaccagaca 3660 atggaaaatt attaacaaca acagagcagg gggagtaacc aaataaactg gtaaatttaa 3720 aaaaaaaaaa catatttaca caaaaaaaaa aatttatttt aggtaacaag gataacgaaa 3780 tcaaaacaac caaaaacatg aaaaactaag gaaaaagaag aagattaacg ataaacaaaa 3840 gagagaaaaa cagaaatgtg tgagtaaacc aaaatttttt aaaaaattga tatcttttga 3900 atggatagat tttttataat ccaagatgca attttaagat tgcatactag cttgaaaaaa 3960 tgtaatttcg gactagagaa aaaaaaatag caagattaaa gaattactaa aactcaaaga 4020 tagattgaga atcatagtaa aagataaaaa agtaagagta aatctatcaa aacaatatag 4080 agatctgaga gtgatcataa atgattgtat caaattatta aaaagttcaa acaaaaagga 4140 cctatccatt caaaagatta ttagtgaaga atcagactca gaggaggacc actcggaatc 4200 atcagacggg gaagaaacct tacccaagat ggctccaaaa ctagatttgg gcactgccct 4260 aaagctggta gataaattca acggggaagc tgaaaagctt tccagttttt ttgaaactct 4320 ggacctcctt aaagattata acgatggagt tccagaaatc gaaattttga aatttgtaaa 4380 aactcgattg actggtcccg cgcacggtgt aataaacacg gcggtgacca tggccgaagc 4440 caaaaggctt ctaaaaatta aattttctgt aaaatttagt ccgcaggcca ttgagtcgga 4500 aatggccacg atcaaacaaa acaagaaaac tttaacggac tacgggaagg agattggtga 4560 gctagcagca aagcttgcag ctgctcacgt ctccaacgga acattcgcag atgaggcggc 4620 ggctgaggcc atcgtacaac ccatcgcgat ccgaaacttc atgcagggtt tgaaggactc 4680 taggacgcaa tttttcctca aggcaaggaa tccaggcaca cttaccagag cgataagtga 4740 tgctctggag gtgcacacca atgaggagga aacggctatg tggatgcatg ccggcccctc 4800 agggttttat agaggaaact atcggggaag aagttctaat tttcggacaa gaagaggaag 4860 cagagcaaga ggtttcggtt atggcagagg ccgaggtaga ggcttccacc aaaaccagca 4920 acctcccgga aatagacaac aacccaacaa taataacaac tatccacaac aaaaccgtgg 4980 taacaacaac agaggtagac acggacatgc caatgtagct gtccaagaac ctcaacagcg 5040 accaaatcaa caacaggaag aagatgttgc gaatattatc gagctttttc gtgagtaatg 5100 tagggaaaag gctaccatcg ataacattca acatcaaaaa tgaacaggta ccatttattt 5160 tagacagtgg agcaagttgt tctattattt cagcacatct tgttccccgt gaaacagcaa 5220 ttgacaaaaa tgacaaaatt aaaataagag gaataaatgg ttcaactttg tcgtcaggaa 5280 gtattaatat t 5291 // ID Gypsy-28_CQ-LTR repbase; DNA; INV; 151 BP. XX AC AAWU01011844; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-28_CQ_; KW Gypsy-28_CQ-I; Gypsy-28_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-151 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 436-436 (2011). XX DR GenBank; AAWU01011844; Positions 13064 12914. XX SQ Sequence 151 BP; 43 A; 44 C; 18 G; 46 T; 0 other; tgttacatta attattactc gggttaccac acacgctaaa ctgtattccc acctacactg 60 aacactcttg attccctcct aaagtcattc tattaaagct cttgtatcga acagacgctc 120 aagcctttta ctctggacca catccgtaac a 151 // ID BEL-30_AA-LTR repbase; DNA; INV; 262 BP. XX AC supercont1.240; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-30_AA_; KW BEL-30_AA-I; BEL-30_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-262 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.240; Positions 601173 601434. XX SQ Sequence 262 BP; 60 A; 64 C; 53 G; 85 T; 0 other; tgtttccgtg tatgttccac ccagccctgt cgacctcacc tgcgtcgacg acacctgtca 60 gttttgcgcg ccgacttcga ttggaaactt tttcctgaga aaatgtgctt ttgtcaccat 120 tttccataac tgtaccaagt accgaagaga aaggacgcaa taaaagttga gtttgagtaa 180 ttctttattt tcggttcaat tttacgttta ttccggttcg aattgcccgt gttttcgtcg 240 gtccagtcca aggtccaaaa ca 262 // ID Gypsy-92_AA-I repbase; DNA; INV; 4284 BP. XX AC supercont1.321; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-92_AA_; KW Gypsy-92_AA-LTR; Gypsy-92_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4284 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.321; Positions 724345 720062. XX CC Positions [3232-3690] - Integrase core CC 'CAGTC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 19..4254 FT /product="Gypsy-92_AA-I_1p" FT /translation="MAENGNENGRRNQEQVRENPPPIGAAAVAGAAAGVAA FT GAPLAMQSNFTIEPYDRHRMKWSRWVERLEGALLLFRVPDDLKRPMLLHYM FT GGENYDIVSDKMAPDKPQEKTYDEIVQLLETHFNPRPLEILENFRFKCRRQ FT GDEKVDESIDDYLIALRKLAITCNFGNYLNTALRNQFVFGLKDRGIQARLL FT EVHDLTLDRARDLAVSMEMSAKGGQEIQSRAKADVNLVEHPVNKSGKPKGK FT KPGNAAASQSHHSDSKQGSKKGACYRCGSTNHFANKCVHVKTVCNFCRMTG FT HIEKVCMKKKSACSTKSKSDFHQVEESVAECKQKKKEDVYMDEICGLYDAE FT SRVDKFWVAISVNDSKVRFEVDSGAPVAIMSVTDYEKLLPAVKLDPPDLSL FT VSYCGNSIKLRGMCTVSVQYAGKVHRLLLYVADNRKHPLLGRSWMKVLRLD FT VNKFYEEVHSVAYNGVKCTADESVKKLIARYGSVCADSMGKIKGLTAKLRL FT KPNAHPAYIKARPVPFSLRSAVEQEIEKLVKEGVLEKVNHSDWATPMVPVM FT KLNNKVRLCGDYKITVNPNLVVDEHPLPTIEELFANVAGGEKFSKIDLSQA FT YLQLEVDPDQREILTLSTHLGLYRPTRLMYGVSSAPAIWQRLMEEVLNGIP FT GVTVFLDDIRVTGPNDEVHLQRLEEVLNRLSNYGMRINLDKCVFFADKIEY FT CGYVVDRHGIHKVRKKIEAVQNMPTPENKEQVRSFVGLVNYYGRFLPNLST FT MIYPLNRLLRNDVPFEWSKSCEEAFRKVKQEMQSDSFLVHYNPELPLVLAT FT DASPYGVGAVLSHILPDGSERPIQYASQTLNETQRKYKQVDREAYAIVFGI FT RRFHQYLYGRKFLLYTDNEPVKQIFSETKGLPTMSALRMQHYATFLQSFDY FT TIKFRPTKQHYNADAFSRLPIATKRPDNVVEEVDMLETSIIETMPVTVDDL FT AKRTAADGSVKVLLQGLRNGKTVEAKDRFGIDQNEFSLQQGCIMRGIRVYI FT PPDLRMKVLNELHSTHFGSTRLKTLARGYVWWERIDRDIEDLVKNCASCQV FT TRPNPAKAPLHCWEPATQPFERVHVDFAGPFMGKYFIVFVDAYTKWPEVKI FT LRDITTATTINACREFFATYGIPCVLVSDRGVQFTSGEFQRFLQLNGIFHK FT MGAPYHPATNGQVERFIQTFKNKMKAMKCDKSRMHVELCNILLTYRKTIHP FT TTGKSPSMMLFNRQIRSRLDLMLPGPTFSEKVDPKVRTIPEGGRVAARDFL FT DQEKWKYGRVIEKLGKLHYMVQLDDNRVWKRHIDQLREVGPHLPASRNLRE FT MPVLPNLPQQPAATSNATRGNAVPIYQEASTSTEPVAIPASSRVEPASVSV FT SGPRPTLSVPSTVPSASKPTTEREPSEGSYQQTPRRSTRMVKAPKRLDL" XX SQ Sequence 4284 BP; 1166 A; 934 C; 1172 G; 1012 T; 0 other; attaatctgt cgacgaggat ggcagaaaac gggaacgaaa acggccggag gaatcaggaa 60 caagttcgag aaaatcctcc tccgatcggt gctgctgctg tcgccggtgc tgctgctggg 120 gttgctgctg gcgctccgct agcaatgcaa tcgaacttca ccatcgagcc atacgatcgt 180 catcggatga agtggtctcg gtgggtcgag cgtttggagg gagcattgtt gctgtttcga 240 gttccagacg atttgaagcg tccgatgctg ctccattaca tgggtggaga gaactacgac 300 atcgtttcgg acaagatggc tcctgacaaa ccgcaggaga aaacgtacga tgagatcgta 360 cagctgctgg aaactcattt caaccctcgc ccgctcgaaa ttctggagaa ttttcgcttc 420 aagtgtagga ggcaaggcga cgaaaaagtc gatgagtcaa tcgacgacta cttgattgca 480 ctgcggaaat tggcgataac gtgtaatttc ggcaattacc tcaacaccgc tctgagaaac 540 cagttcgtct tcggcctgaa ggatcgcgga attcaggcaa ggctcttgga agtgcacgac 600 ctcacgttag atagggctcg cgatcttgcg gtatcgatgg agatgtccgc aaaagggggc 660 caagagattc aatctcgtgc gaaagctgat gtgaatttgg tcgagcatcc ggtaaacaag 720 agtggtaaac cgaagggtaa gaagccgggg aacgctgctg ctagtcagtc gcaccacagt 780 gattcgaagc aaggttcgaa aaagggtgca tgctaccgct gtggcagtac gaatcatttt 840 gcgaacaagt gcgttcatgt gaaaactgtt tgcaatttct gccgcatgac gggccacatt 900 gagaaagtgt gtatgaagaa aaagagtgct tgcagcacga agagcaaaag tgatttccat 960 caagtggaag aaagtgttgc agaatgtaaa cagaagaaga aagaagacgt ttacatggat 1020 gaaatttgtg gcttgtacga cgcggagagc cgagtggaca aattttgggt cgctatctcg 1080 gtgaatgatt ccaaagtgcg attcgaggtg gacagtggtg ctcccgtcgc gattatgagc 1140 gtgacggact acgaaaagct gctgccggct gtgaaattgg atccaccgga tttatcactt 1200 gtgagctact gtggaaattc gatcaaattg cgcggtatgt gtacggtctc ggtgcagtat 1260 gcaggcaaag ttcatcgatt gttgttgtac gtggcggaca accgaaagca tccgttacta 1320 ggaagaagct ggatgaaagt tttgcgactt gacgtgaaca agttttacga agaagtacat 1380 tcggttgcgt acaatggtgt aaagtgcacc gccgatgaat cggtgaaaaa gttgattgcg 1440 cggtacggta gtgtttgtgc tgattcgatg gggaagataa aagggcttac ggcaaagttg 1500 cggttgaagc cgaacgccca tccggcgtac attaaagcta ggccagtgcc gttttctttg 1560 agaagtgctg tcgagcaaga aattgagaag ctcgtgaaag aaggagtgct cgaaaaagtg 1620 aaccatagtg attgggcgac tcctatggtc cccgtgatga aactgaacaa caaagtgcga 1680 ttgtgtgggg attacaaaat aacggttaac cctaatctag tagtggatga gcatcctctc 1740 cccacaatcg aggagctgtt cgcgaatgtc gccggtggag agaaattctc gaagatagat 1800 ctgtcccagg cgtatttgca acttgaggtg gatccggatc agcgtgaaat tctgacgctg 1860 agcacacacc ttgggctgta ccggccgacg cggcttatgt acggcgtgag ctccgcacct 1920 gcaatatggc aacgattaat ggaagaagtg ctaaacggaa taccgggagt gacagtgttt 1980 ttggacgata tacgtgtgac gggtccgaat gacgaagtac atttgcaacg acttgaagaa 2040 gtgcttaatc gtttgagcaa ctacggaatg cgtataaact tggacaagtg cgtgttcttt 2100 gcggacaaaa tagagtattg tggttacgtt gtcgaccgtc acggtataca caaagtgcga 2160 aagaagatcg aagcggtgca gaatatgccc actcccgaaa acaaagaaca ggttcgttcg 2220 ttcgtcggtt tagtgaacta ctatggacgt tttcttccta atctgagcac gatgatctat 2280 ccgctgaatc ggctgttgcg gaacgacgtg ccgttcgaat ggtcgaagtc gtgcgaagaa 2340 gcttttagga aggtgaaaca ggagatgcag tcagacagtt ttctggtgca ctacaacccc 2400 gagttgccac tggtgttggc gactgatgcg tcaccgtacg gagtcggtgc ggtgttaagt 2460 cacatcttgc cggacggatc ggagcgtccg atacagtatg cgtcgcaaac cttgaacgag 2520 acacaacgta agtacaaaca agttgatcgt gaagcctatg cgattgtttt cggcattcga 2580 cggttccatc agtatttgta tggacgcaaa ttccttctct acacggacaa cgagccggtt 2640 aagcaaattt tctccgagac gaagggatta ccgactatgt cagccttgag aatgcagcac 2700 tacgctactt tcctccaatc tttcgactat accatcaaat ttcgccccac taagcaacat 2760 tacaatgcgg acgcgttctc acggttgccg attgctacga agcggccgga taacgtcgta 2820 gaagaagtcg atatgttgga aacgagtatt atcgagacga tgcctgtcac cgttgatgac 2880 ttggccaaga gaaccgcggc agatggttcg gtcaaagtgc tgcttcaggg tctgcgaaac 2940 ggaaagacag tggaggcgaa ggatcgattc ggtatcgatc agaacgaatt ctcattgcag 3000 caagggtgca ttatgcgcgg tattcgggtt tacattcctc cagatttacg gatgaaggta 3060 ctcaacgaat tgcattccac tcactttggc agtacccgcc taaagacact tgccagaggg 3120 tacgtttggt gggaacggat tgacagggat attgaagatc tggtgaagaa ttgtgcttcc 3180 tgtcaggtta cacgtccgaa tcctgcaaaa gcgccattgc actgctggga acctgctact 3240 caaccgtttg aacgagtaca tgttgatttc gctgggccgt tcatgggcaa atattttatc 3300 gtttttgtcg acgcctacac aaaatggcca gaggtgaaga ttctacggga cataactact 3360 gctacgacta tcaatgcatg tagagagttc tttgcgactt acggaatacc ttgcgttctg 3420 gtcagcgatc gtggggtgca gttcacctcc ggtgagtttc agcgatttct gcagttgaat 3480 ggaattttcc acaaaatggg tgcgccgtac catccggcga ccaacgggca ggtcgaaagg 3540 ttcattcaga ccttcaaaaa caaaatgaag gccatgaaat gtgacaaatc aagaatgcac 3600 gtggaacttt gcaacattct cttgacttac cggaagacaa tccacccgac aaccggaaag 3660 tctccttcca tgatgctctt taatcgacaa atccgatccc gacttgattt gatgctgcct 3720 ggtccgacat tctcggagaa ggttgatcca aaagtccgaa ctattcccga gggaggaaga 3780 gtcgctgcgc gagatttctt ggatcaggaa aaatggaaat atggtcgtgt gatagagaag 3840 cttggaaagt tgcactacat ggtccagttg gacgacaatc gtgtatggaa gcgtcatatc 3900 gaccaactac gagaagtggg accgcactta ccagctagtc gaaacctacg agagatgcca 3960 gtattaccga accttccaca acaacccgca gctacttcga acgctacaag aggcaatgct 4020 gtgccgattt atcaagaagc ctctacatct acggaaccag ttgctattcc agcaagttca 4080 agagttgaac cagcctcggt gtcggtttct ggaccacgtc cgacattgag cgttccgtcc 4140 acggttccat ctgcgagtaa gcctacaacg gagcgtgaac caagcgaagg gtcgtatcag 4200 caaacgccaa gaagatcaac ccgtatggtg aaagctccta agagattgga tttgtaacgt 4260 tgtaaaccat aaggggggaa gagc 4284 // ID Transib-N4_AAe repbase; DNA; INV; 1625 BP. XX AC . XX DT 11-OCT-2010 (Rel. 15.1, Created) DT 11-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE A non-autonomous Transib DNA transposon family from Aedes DE aegypti. XX KW Transib; DNA transposon; Transposable Element; Nonautonomous; KW otherMITEs_Ele32; Transib-N4_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-1625 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-1625 RA Kojima K.K. and Jurka J.; RT "Transib-type DNA transposons from the yellow fever mosquito."; RL Direct Submission to Repbase Update (11-OCT-2010). XX DR [2] (Consensus) XX CC [1] Named as otherMITEs_Ele32. CC [2] Consensus update and characterization as a non-autonomous CC Transib. ~98% identical to consensus. This consensus is ~98% CC identical to the original sequence in [1]. 5-bp TSDs; usually CC CANTG. TIRs are ~720 bp long. XX SQ Sequence 1625 BP; 565 A; 246 C; 258 G; 556 T; 0 other; cacactgggc caggagcaga atttagccag acaaaacctc tagcgcttta ggagtgcatt 60 ttttcgagtt ggtgtcaaag gagacttgtc ttgaatttgt ttgttcttca atttgatgat 120 aaaagttagt tggaaatttc gccgcatagg tggcgctgcg atgcaaactt ttttgttttg 180 cgtcctagag ctttcgcgtc ttcggcaatg ttttagaacg tgtaaaaata cgacaagttg 240 tcgaagacac caaagttcta ggacttcaaa taacaaagtt atggtaaaaa atgtgtaaat 300 cacttaaatt ttacgttttt tctacttttt acctcaaaat cgtaaaatat tttattttaa 360 caaccgtgcg tcgtttaatt tcgatgacat gcatgtactg tatgaaaaaa aatcttatta 420 ttatataaat atttgaattt ttgaacaaat tattgtaaaa acatcataat ttttaacata 480 actttactca aaaagttgca agtttttgca aaaacttatc aatgtaccga aatacgctat 540 atttcatcta ccaaatgtca aagtttgcca gatgtatatt ttgatatatt tgagatatct 600 taattcaaaa tagccatctc ttcatataaa tagaatggac ttattattaa ttttatcatt 660 attctacaag atttttgaca cattgcgtca gttaagattt gttaatcaaa tacttttatt 720 ttgatactag tgatttgtaa attgtgattt atgtcagaga gaactgtctc gacgcggtca 780 taacaggatg taaaaaatga attatagcag gccatagcga agttctaatt agacagttgc 840 tggagatgac ataagagcca cacaacaata tgaaaatatc tagatactgt aacacctgac 900 taataaaaag tatttgatta ataaaacttg actgacgcaa tgtgtctaac aatcttgtag 960 tacaatgata aaattgataa taagtctatt ctatttatat gaaaacggtg gttattttga 1020 attcagatat ctcaaatata ccaaaatata catccggcaa actttgacat ttggtagatg 1080 aaatatagcg tatttgggat cattgataag tttttgcaaa aatttgcaac tttttgagtt 1140 aagttatgtt aaaaattatc atatttttac aataatttgt tcaaaaattc aaatatttat 1200 ataaaaataa gatttttttt tcataaagta tatgcatgtt atcgaaatta agcgacgcat 1260 ggttgttaaa atgaaacatt ttacgatttt gacgtaaaaa gtagaaaaaa cgtaaaattt 1320 aagtgattta cacatttttt accataactt tgttatttga agtcctagaa ctttggtgtc 1380 ttcgacaact tgtcgtattt ttacacgttc taaaacattg ccgaagacgc gaaagctcta 1440 ggacgcaaaa caaaaaagtt tgcatcgcag cgccacctat gcggcgaaat ttccaactaa 1500 cttttatcat caaattgaag aacaaacaaa ttcaagataa gtctcctttg acaccaaccc 1560 gaaaaaatgc actcctaaag cgctagaggt tttgtctggc taaattctgc tcctggccca 1620 gtgtg 1625 // ID hAT-N5_BF repbase; DNA; INV; 365 BP. XX AC . XX DT 30-SEP-2008 (Rel. 13.09, Created) DT 30-SEP-2008 (Rel. 13.09, Last updated, Version 1) XX DE Amphioxus hAT-N5_BF non-autonomous DNA transposon - consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW hAT-N5_BF; non-autonomous. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-365 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-365 RA Kapitonov V. and Jurka J.; RT "hAT transposons in the amphioxus genome."; RL Repbase Reports 8(9), 915-915 (2008). XX DR [2] (Consensus) XX SQ Sequence 365 BP; 105 A; 100 C; 69 G; 91 T; 0 other; caggctttta gccaggcatg cgtcaaggcg tcatctggac gcacttttct taaaataaca 60 agaaattgga cgcactagac gcccttacac tggggagcag atcctctgac aagcatgaaa 120 ttctgattga ccagggatgc cactaggtca tttgtggtca cagtcttcta ctgtgacctc 180 tctgacagta attttgcccc tcaggatacc ttagaatgca ccattttaac ttaaaattct 240 caaaaactct gcaccatgga aggtggatgc ccccggaccc ccctacaata gtcacgccta 300 tatcacttac cgcgactcac caagaaattt ggacccccta taataaaatc ctagctaaaa 360 gcctg 365 // ID Gypsy-254_AA-I repbase; DNA; INV; 4811 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-254_AA_; KW Gypsy-254_AA-LTR; Gypsy-254_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4811 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1109-1109 (2011). XX DR [1] (Consensus) XX CC Positions [3838-4314] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 703..2562 FT /product="Gypsy-254_AA-I_1p" FT /translation="MEMNGRPIPQFCCEQIERTKLAREWKSWKTSLEFYFE FT AHEIHDQKMKRAKMLYLGGTQLQRVFTNLPDTDKVPLVAWEKRWYDTAIEK FT LDEFFQPVRQDTLERHRLREMKQKHDERFAQFILRLRQQVADCGFEKHPPE FT IAKVLTEITLIDVIVEGCSSSELRRRILKEDQTLEQIEALGAMLEGVEEQV FT KGFSADQKMDDKVFKVTDNRPWNARPIRQDRGACFNCGNMGHWSKSAQCPA FT TNQICRNCKQKGHFKSVCRAKKRMPPKVFNDHPDKRVRLIQTNENTVPDST FT NDVQEKQYYAFYSGNQSNMIDCKIGGVKWEILIDSGADCNLISSRAWAMLK FT DAKIQVHSSTKGCVRTLRAYGSQNPLSVLGSFVADIEIGQKRTTAEFFVVD FT GGQQCLLGDTTSKELGVLKVGLNIQNVNEHPKPFAKISGIQIHIHMDPEVK FT PVFQPLRKVPIPLEAAVNRKLDELLARDIIEVKVGPTTWVSPLVVVGKSNG FT DPRLCLDLRRVNEAVLRERYPMPVVDEYLARLGKNMIRSKLDIREAFLQVE FT LAPESRDCTTFITSKGLYRFKRLPFGLVTAPEAFQRTMDEILTGCEGTHWY FT LDDVVIEGATIEEHDHRLQKVA" FT CDS 2779..4788 FT /product="Gypsy-254_AA-I_2p" FT /translation="MQTKSDAIISFRRPTNETEVKSFLGLANYMNKFIPNL FT ATIDEPLRQLTQKGVRFLWTERHTNAFEAIKQAMSQAVRLGYFDVNDRTLV FT LADASPVGLGAILAQTDENNETRIVSYASKSLTDTEARYCQTEKEALSLVW FT AVERFQVYLIGRAFDLVTDCKALHFLFAPRSRPCARIERWVLRIQAFDYNV FT IHIPGEKNAADALSRLATMRSRPFDASEELIIREVASLAAGAVALSWVEIQ FT NAYKEDEELKEVINFIEADRALELPLSYRLVANELCVVDGVLMRIDRVVIP FT SKLRNTVLRLAHDGHPGSNMMKSFLRSSVWWPKMDRNVEEFVRECRGCNLV FT SAPEAPEPLVRREMPNGPWQDVAVDFLGPLPDGQHLLVVVDYYSRYVEIRE FT MASTSATATVHELSAIFALFGLPITLRADNGPQFSRECEEFSSFCQENGVR FT LINTVPYWPQHNGEVERQNRSILKRLRIAQQLGKDWKMELTKYLLVYHSTN FT HPTTGKSPAELMFGRRIRTKLPQVPLFKTDDEEVRDKDREQKQKGKEYADV FT KRKARFSEISVGDCVLMKRMKKSNKLDSEYLNEEFIVTRKVGMDCTIKSKL FT SGREFRRSVAHLKRLEPTNDGNDKRSDSTSTSYSDTSPLPKADKPSMSNVG FT AEPFRKRICREPRRFQDYVSH" XX SQ Sequence 4811 BP; 1471 A; 898 C; 1205 G; 1237 T; 0 other; attggcgaga ggattggcgg aaaaaaaaac taattggagg taggaaacat aatcttattc 60 taatttcttt tttagcttac tcggacaaaa taactattaa aaaggtgact gatgttaatt 120 ttgaagaaaa ttaaaacgtg attcagttgt ggaacatgaa aaagaaaaaa aaaagttctc 180 tgacgtgaag agtggtgaaa cgattttgag gttaaaaaaa aaccgcaacc gggaggttgg 240 aatttagaaa acagtccaca accgggaggt taagttccac aatcgggaga ttgaaatccg 300 cgaccgggag gttggttcga ttagattgat ttatataagc tgaagtccac aaccgggagg 360 ttgagatcca caatcgggag attgaaatcc gcgaccggga ggttggtatt ttgatctgat 420 agcaaacgta aacaaaagca atggtttgaa actgcatccg ggaggatgag tttttcaatg 480 tatacagcgc atgctgaggt ggaataaggg tgtgtatcat tcggtgaagt ttccacgcag 540 tgagattttt tttacttttc gaaaaaaaaa gccaatctgg ctaaccggat tgagtttata 600 catggtttaa ttggaagcaa gcgagttctg aaaagagttg ttgaaaagga aaaaaattct 660 gcatacacac ttatatgtgt actattttct gttctgttac agatggaaat gaacggacga 720 ccgattccgc agttttgttg cgaacagata gagcgcacga aattggcacg ggaatggaag 780 tcgtggaaaa cctcgctcga attttatttc gaggcgcatg aaatccatga ccagaagatg 840 aagagagcga agatgttata tcttggaggt actcagctgc agcgtgtttt cacaaaccta 900 ccggacacag ataaagttcc attggtggca tgggagaaaa gatggtacga cacagcaatc 960 gaaaagttgg acgaattctt tcaacctgtt cgccaggata cgcttgaacg acatcggtta 1020 agagagatga agcagaaaca cgatgaacgg tttgcgcaat ttattttgag gttgaggcaa 1080 caggttgcag attgcggttt tgagaagcat ccaccggaaa ttgctaaggt gttgacagaa 1140 ataacactaa tcgatgtgat cgtcgaaggt tgctcctcat ccgaactgcg acgtcgaatc 1200 ctgaaggaag atcaaacttt ggaacaaatt gaggctcttg gtgccatgct tgaaggggtt 1260 gaagagcaag tcaaaggatt cagtgctgat caaaagatgg atgacaaggt attcaaagtg 1320 acggacaaca gaccttggaa tgcacgccct attcgtcaag atagaggagc ttgtttcaat 1380 tgtgggaaca tgggacattg gtcaaagtcg gcacaatgtc cagctacgaa ccagatatgc 1440 cgaaattgca aacaaaaagg ccacttcaaa tcagtttgtc gggctaaaaa gcgtatgcca 1500 ccaaaagtat tcaacgacca cccagataag cgagttcgtt taattcagac gaacgagaat 1560 actgtacccg attcaaccaa cgatgtacag gagaaacagt attatgcgtt ctattcagga 1620 aatcagtcca acatgattga ctgcaaaatc ggaggcgtaa aatgggaaat tctcattgat 1680 tctggggccg actgcaacct gatatcgtcg cgtgcctggg caatgttgaa agatgcaaag 1740 atccaagtac actcatcgac gaaaggctgt gttagaacat tgcgggctta cggaagtcag 1800 aaccctttga gcgtattggg aagtttcgtt gccgatatcg agattggtca gaagcgcact 1860 accgcggaat ttttcgtggt agatggaggt caacaatgcc tgttaggaga caccacttct 1920 aaagaacttg gtgtgcttaa agttggactg aatattcaaa atgtgaatga gcaccctaaa 1980 ccgtttgcga aaatctctgg tattcaaatc catattcata tggatccaga agtgaagccg 2040 gttttccaac cactacggaa ggtgcctatt cctttagaag cagctgtgaa ccggaaattg 2100 gacgaacttc ttgcgcgaga tatcatcgaa gttaaggtcg gaccaactac atgggtatca 2160 ccacttgtgg tggtcggtaa atctaacggc gatccgagac tgtgcctgga ccttcgccgt 2220 gtcaatgaag cagtccttcg agaacgttac ccgatgcccg tggtagatga atatcttgct 2280 cgcttgggaa aaaatatgat tcggagcaag ttggacatca gagaagcatt cctacaggtg 2340 gaacttgcgc cggaatcaag ggattgcaca acgtttataa caagtaaagg actttatcga 2400 ttcaaacgcc ttccgttcgg ccttgtcaca gcacctgaag cgttccagag gacgatggac 2460 gaaatactga ccggctgcga aggtactcac tggtacttgg acgacgtggt tatcgaggga 2520 gctactatcg aggagcacga tcatcgattg caaaaggtag catgactcga ttgatttgaa 2580 gcttgttttt tttgttatgc agacaataaa tcaaattgaa ataaatgttt ttttttattt 2640 catctaaaaa aacaatcatc ataggtgcta aaacgattcg aagaccgtgg agtggagcta 2700 aattggcaga aatgtgtatt caaagcgaca gagcttgatt tcttgggaca taacatcacg 2760 tcggatggta tatttccaat gcaaacaaaa tcggatgcca taatatcgtt ccgtagaccc 2820 acgaatgaaa cggaagtgaa gagtttctta ggcttggcga attatatgaa taagttcata 2880 cctaatttag ccacgatcga cgaaccatta agacaattaa cacagaaggg tgtgcgattc 2940 ctgtggactg agcgacacac caatgcattc gaggcgataa aacaagcgat gagtcaggca 3000 gttagactgg gatatttcga tgttaatgac cgtacccttg tattagcaga cgcgagtcca 3060 gtcggactgg gtgctattct agcgcaaacg gatgagaata atgaaacacg tattgtaagc 3120 tacgcttcga aatctctcac ggatacagaa gctaggtatt gtcaaacgga gaaggaggcg 3180 ctttctctag tttgggccgt ggaacgtttt caagtgtact tgattggtag ggcatttgat 3240 ctcgttacgg attgcaaggc cttacacttt ttgtttgccc ctcgttcaag gccttgcgca 3300 cgcattgaac gttgggtatt acgtattcaa gcattcgatt ataacgtaat ccacattcct 3360 ggggagaaga atgcagcaga tgccctatct cggttggcaa cgatgagatc acgacctttc 3420 gacgcctctg aagagttgat tatccgagaa gttgcttctt tagctgctgg agccgtagca 3480 ttgtcgtggg tagaaatcca gaacgcgtac aaagaggatg aagaattgaa ggaggtcatc 3540 aactttatag aagcggatcg cgcactcgag ttgcctttat cttatcgttt agtagctaat 3600 gagttatgcg tagttgatgg agttctgatg cgaatagaca gagttgtgat tccgagcaaa 3660 ctgcgaaata cagtacttcg cttggcacac gatgggcacc cgggcagtaa tatgatgaaa 3720 tcttttttgc gatctagtgt atggtggccc aaaatggacc gtaacgtaga agagttcgtt 3780 agagaatgtc gtgggtgtaa tttggtttca gctcctgagg ctccagagcc acttgtgaga 3840 cgcgaaatgc caaacggtcc ctggcaagac gtggctgtgg attttttagg tcctctacca 3900 gatggacaac atttgctggt tgtcgttgac tactacagtc gttatgtaga gattcgtgaa 3960 atggcatcta cgagcgcaac tgctaccgta catgagttga gtgctatctt tgcgctattc 4020 ggcctaccta ttacgttgag agcagacaac ggaccacagt ttagcagaga atgtgaggaa 4080 tttagttctt tttgtcaaga aaatggcgtt agattaatca acacggttcc atattggccg 4140 cagcacaacg gcgaggtcga aagacaaaat aggtcgatcc taaagcgtct taggattgct 4200 caacaactgg ggaaggattg gaagatggaa ctcacaaaat acttgttggt gtaccattct 4260 accaaccatc caacgacagg aaaatcacca gcagaattaa tgtttggccg acgtattcgt 4320 accaaattac cacaggtccc attatttaag actgatgatg aagaggttcg cgataaagat 4380 cgagagcaaa agcaaaaagg gaaggaatat gcagatgtga aacggaaagc tcgttttagt 4440 gaaatatcgg tgggagattg tgtgcttatg aaacgtatga agaagtctaa caaattagac 4500 tcggagtatt taaatgaaga atttattgta acacgaaaag tgggaatgga ctgtacgatc 4560 aagtcaaagt tgtcgggaag agagtttcgt cgaagtgttg ctcatttgaa acgattggag 4620 ccaacaaatg atggcaacga taaacgcagc gatagtactt ctacctccta ctcagataca 4680 tctccattgc cgaaggctga taaaccctct atgtctaatg tgggtgccga accgtttcga 4740 aagagaatct gccgagaacc tcgtcgtttt caggattacg tatcgcatta aatattaaag 4800 ttaaggtgga t 4811 // ID BEL-78_CQ-LTR repbase; DNA; INV; 369 BP. XX AC AAWU01022194; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-78_CQ_; KW BEL-78_CQ-I; BEL-78_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-369 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 298-298 (2011). XX DR GenBank; AAWU01022194; Positions 13999 14367. XX SQ Sequence 369 BP; 114 A; 72 C; 100 G; 83 T; 0 other; tgtcggcgat ggacatctgc ctccgaaaaa tatttgacgc aaaagtaaaa atcgaacgaa 60 tcggccgcac gggttgctgc gatcaaacgc tgcctgcttt gacagctagc aacggtcgtt 120 gcggtgcggt ttggcgcgca acgaagcgtg cgcaaaaagg gaaggagaga gagagaaagg 180 gaaaattttc actcccgttt gagacgtgat cgagtaaagt caagttttgg tgtaaatacg 240 atcggaataa agttttttga agtagagaag agattttcca agtgttttga tcacacgccg 300 aataaaactg gtaagaacat cccatacagt ccacctgcag aatcgaagtt ggggaatttg 360 gcctcaaca 369 // ID DNA-5_AAe repbase; DNA; INV; 2824 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A non-autonomous DNA transposon family from Aedes aegypti. XX KW DNA transposon; Transposable Element; nonautonomous; DNA-5_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-2824 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the yellow fever mosquito."; RL Repbase Reports 11(4), 1259-1259 (2011). XX DR [2] (Consensus) XX CC >91% identical to consensus. Possibly 4-bp TSDs. TIRs are ~500 CC bp long and .terminal 150 bp are similar to those of CC DNA5-1_AAe. XX SQ Sequence 2824 BP; 895 A; 488 C; 545 G; 895 T; 1 other; cacggtgctc ccctgaccgt tattatcaga aaaaaatctc catcaaatct gtgctcacca 60 gtcgttttct atagctaagc tgcatgtctg ggcaaaattt caaaaaaatc gtagggcccg 120 ttttgaagtt acgccctttt gattgtataa gtccacaaat tcaaaggaaa tctgagatat 180 ttaaacgggt tttcattggc cgtgattatc agaagaaata taccatcaaa atttggttca 240 aaagtaggtg tttttagtta tttgtaacct ctgggcggag ttttgaatga gaagatcgtc 300 gaaattttga gttacgccct ttttgaagtg taaccattaa aatcaccaat ttcaaataga 360 ttaattaggc attctaatac cggtaagcat gttcaaatgt tttcaatgac acattgcact 420 tacatgccgt caaagtcttt cacaattgac tgatttttca taggctcaag cgatataagg 480 cattacggag ccaatttcat catgtaaccg aattagtttt tcaacaatat tctggatagt 540 ttggtgaact caggtgcccg attcaaattt ttacatgaat caaatgccac tattggctat 600 tgtaggtgtt aaatggcttc gaatcaccat tagagaaatt gttcgtcagt ctgtcggttt 660 ggtgaacaat gaactcacga ttcagattat ttaacatgca attcatgttg tatatcgaag 720 cgtaagttga cgtcaaagaa tactttcgga cttgagtttt cggttataag tagcagggta 780 tggaataaaa tgaaattaac gacgtagaaa gcagcgatac aaaatggaaa aaaatgcaaa 840 gtatagaaca tccgttttta ttttgtttaa tggaatttaa aatcgtagca cttacagagc 900 ttaattttaa atgtttgtct ttgtcacctt ggaaaagttt gtttagcaaa gtatgattta 960 ttttcgtttg tcttcactaa ttatagcagc tggtgagtct taacttcaca ggcacagttt 1020 gcgatcgaaa aaaatgaaag tttacagata aatcttcagt ttgtgtagga ttcaaaccat 1080 gttatggtag gtagaccagc aggtaaccaa cagggccaca gaacaattta cgattttccg 1140 gtaacaaaaa ctggaagtta gtcttcttta aattataccc ctctgaattc cattaccccg 1200 aatatcatga acctgaagac cataacctca gttccagttc ctagaatggc attacttacg 1260 aattccatta tcacggggca atgtcattca aggtaattat atattcggga caatagcatt 1320 cagcgtaatg gtgcgttcat ggtgatgaca ttcggggaaa tgaaattcgg ggttatgggg 1380 tagaatcttc tcaagagagt ttgatttgat attgattttt aaggaagaca tgtgaaatta 1440 agctctgcaa gtgttatgat ttaaaatcaa tgaatccaaa caatacgaat atttcttact 1500 tttcttaagc gtatttcaag tattgaggct tctcttcacg agttggcttg agaacgggtc 1560 gatcgattta tacaattctt tcactattgt attcatccag ggctcttacg tgtttgtgtg 1620 atcgaaaaga ccgggaaatt gtagatttgt gaaaaatcta aatgtgtaat gtatgataca 1680 aatgatcaca aaagtattga cagtggtcta gagtgccata gtttggccaa tagtgcatta 1740 ctcagcaatt gaagtcaagt ctgaacgatt acctgcggaa cagtttttgt ctcaagcatc 1800 agaaccgstt actaattatg ggccataccg tttcagaaaa tcatagcacg cctagtttcg 1860 agagtgcttt aaaatgctga attcatttat cagccaattg catgcaagtt tgccagcttg 1920 taaggcaaaa tgctaatttg aaatttattg tacacataca gtacttatcg acaagtttat 1980 gactcaagtt attttggtgt tattgtctgc cataaaacga aagctttgta tcggactgca 2040 aacgatctat tctaaatgtt atgcaatttc aacagattta tatcatgaaa gttgagtcct 2100 agtatgcttg cacagaagtt tgataaactc acaaacatca attaaccagt tttgtcaact 2160 ttgcctgtag cttcgtgcgg gcaggacccc aaactattag gcacatatcc tgcaaatgca 2220 ttactcagca attgaggttt aagttctggg gcagtacaaa gttagccgga acagccgggg 2280 gcagctagtt gggttataac aatattttat gaataataat aacaatacaa aattggctcc 2340 gtaatgcctt acagcgcttg agcctatgga aaatcagtca attgtgaaat actttggcgg 2400 catgtaagtg caatgtgtca ttgaaaacat ttgaacatgc ttaacggtat tagaatgcct 2460 aattaatcta tttgaaatta gtgttttcaa tggttacacc tcaaaaaggg cgtaactcaa 2520 aatttcgtcg atcttctcat tcaaaactcc acccagaggt tacaaataac taaaaaaacc 2580 aactttgaat caaattttga tggtatattt cttctgataa tcacggccaa tgaaaacccg 2640 ttaaatatct cagatttcct ttgaatttgt ggacttatac aatcaaaagg gcgtaacttc 2700 aaaacgggcc ctacgatttt tttgaaattt tgcccagaca tgcagcttag ctatagaaaa 2760 cgactggtga gcacagattt gatggagatt tttttctgat aataacggtc aggggagcac 2820 cgtg 2824 // ID SATREP_CG repbase; DNA; INV; 166 BP. XX AC U20340; XX DT 09-JUL-2004 (Rel. 9.06, Created) DT 09-JUL-2004 (Rel. 9.06, Last updated, Version 2) XX DE Crassostrea gigas satellite DNA sequence. XX KW SAT; Satellite; Simple Repeat; SATREP_CG; satellite repeat. XX OS Crassostrea gigas OC Eukaryota; Metazoa; Mollusca; Bivalvia; Pteriomorphia; Ostreoida; OC Ostreoidea; Ostreidae; Crassostrea. XX RN [1] RA Clabby C., Goswami U., Flavin F., Wilkins P.N., Houghton A.J. RA and Powell R.; RT "Cloning, characterization and chromosomal location of a RT satellite DNA from the Pacific oyster, Crassostrea gigas."; RL Gene 168(2), 205-209 (1996). XX DR Genbank; U20340; Positions 1 166. XX SQ Sequence 166 BP; 43 A; 40 C; 27 G; 56 T; 0 other; ccccaccctg cccccggggg tcatgatttt cacaactttg aatttacact acctgaggat 60 gcttccacac aagtttcagc tttcctgact gattagtttc tgagaagaag atttttaaag 120 atttactcta tatattccta tgtaaaattt cgaccccata ttgtgg 166 // ID hAT-N3_AP repbase; DNA; INV; 799 BP. XX AC . XX DT 25-AUG-2009 (Rel. 14.09, Created) DT 25-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW hAT-N3_AP. XX NM hAT-N3_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-799 RA Jurka J.; RT "hAT-type DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2102-2102 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX SQ Sequence 799 BP; 269 A; 124 C; 159 G; 247 T; 0 other; cagtgttggg aaggaacgcg ttccttttat aggaacgagg ctcggaacgc gttctattta 60 aaatatagga acgaggaacg ggaaagagtt cctattttta aggaacttga acgaaaattt 120 ccgttcctca aataatgaaa aaaaaaaaaa aatgaaagtt gtttttaata ataatatttt 180 ctagaatata tttagtgttg gtattttggt attacttcat caatcattag taaatagtaa 240 aacgaacaac ctaggtatcg ttggtatatt ttaatttagt tcgagattag catacaaggt 300 ataatcatag atattataag aatggatagg acatcgagct tatcaactga gaataaaggt 360 atacaatttt tgaggttata agagcgccaa agattagtaa cctaacctca aaaagcactc 420 ttatagtcta tctctctcgt agatattgaa ctttctctct tcttatatct gatatcatac 480 gttttatcga ctatcgtctt gatatgagac gtgagtcgtt ttaatttaaa cgatatcgaa 540 tgcgcggccc tgacgagaat cggtcggcgg actgattgtt ctctcgtggg acagagtata 600 aaacaaatca tttttgttca aatcggtgct ccttaggaag gagttagtta accgataatt 660 tctgacttaa aaattgttcg caaaaagaca ccattgacga cgtaaaaaat gtgtaaagga 720 acgggaacgc gttctttttt cgcgacagga acgaggaacg ggaacgagtt cctttttaaa 780 aggaactttc ccaacactg 799 // ID DNA-TTAA-3_CQ repbase; DNA; INV; 1394 BP. XX AC . XX DT 28-DEC-2010 (Rel. 16.01, Created) DT 28-DEC-2010 (Rel. 16.01, Last updated, Version 1) XX DE A DNA transposon family from Culex quinquefasciatus - consensus. XX KW DNA transposon; Transposable Element; nonautonomous; KW DNA-TTAA-3_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-1394 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the southern house mosquito."; RL Repbase Reports 11(1), 67-67 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >95% CC identity. ~660 bp TIRs. TSDs are TTAA. XX SQ Sequence 1394 BP; 440 A; 253 C; 257 G; 442 T; 2 other; gagcgagtcc acgagcaggg catacccctt tgtatgggac stcgcwtgga cctcaccaat 60 ctgcctgaaa ttttcagggg ttgtttgtac atataaaact agcatctggc caaaatatga 120 gcactctagg tcaacgggaa gtggggcaaa tcgggacaca aagtttgaag gttcaaaaac 180 gtcaaaaatc ttaaaaaggc tataacttag gcaaaattca atttaatttc aaaattcaaa 240 atgcatctga aagggcttaa aaaatgcaac aaaatgcagg gtagagcatc ccaattggtt 300 aattctaaag ggagttattg gcattttagt gaaaaaatag cataattttc aaactcaaat 360 aaaaaagtgt tccatccaga tatcaactcg gttcgacctg cagcttgtag gggacatctg 420 ggactaccat ctgagactga gaacgctttg ggtaaggcag tttaacatat tgaatagaca 480 cttttacttt tagtgaattt tttggttgta aatttttgct cgggggaccc cttagatccc 540 attttctggt gataatttta tcatattcgt gttcctgaga caatttcaca ttagaaacat 600 gcataaaaat gtttattttc atccatttta accctttaaa aaatgaaagt taaaaaaaat 660 tgagaagctg cttttttctg gtgtcatcta gtatccttgg aaacgaactt gattttcaat 720 aaatgcgaat cgaagatttt ttttaacttt cattttttaa agggttaaaa tggatgaaaa 780 caaacatttt tatgcatgtt tctaatgtga aattgtctca ggaacacgaa tatgataaaa 840 ttatcaccag aaaatgggat ctaaggggtc ccccgagcaa aaatttacaa ccaaaaaatt 900 cactaaaagt aaaagtgtct atttaatatg ttaaactgcc ttacccaaag cgttctcagt 960 ctcagatggt agtcccagat gtcccctaca agctgcaggt cgaaccgagt tgatatctgg 1020 atggaacact tttttatttg agtttgaaaa ttatgctatt ttttcactaa aatgccaata 1080 actcccttta gaattaacca attgggatgc tctaccctgc attttgttgc attttcgaag 1140 ccctttagat gcattttgaa ttttgaaatt aaattgaatt ttgcctaagt tatagccttt 1200 ttaagatttt tgacgttttt gaaccttcaa actttgtgtc ccgatttgcc ccacttcccg 1260 ttgacctaga gtgctcatat tttggccaga tgctagtttt atatgtacaa acaacccctg 1320 aaaatttcag gcagattggt gaggtccaaa cgacgtccca tacaaagggg tatgccctgt 1380 tcgtggactc gctc 1394 // ID BEL-39_CQ-I repbase; DNA; INV; 4033 BP. XX AC AAWU01047176; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-39_CQ_; KW BEL-39_CQ-LTR; BEL-39_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4033 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 231-231 (2011). XX DR Genome; AAWU01047176; Positions 4833 8865. XX CC 'ATCTT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 1066..3711 FT /product="BEL-39_CQ-I_1p" FT /translation="MDFKFAENKKGSCRQCDSSDDVEEFMVSCDSCDRRFH FT IKCVGLARTPRSKDHWKCDKCLAIQEMLDKQDALIKSLEEKCMKYFGTQQV FT AQNSETNQNLQVKALEAIIDKINSINISQTSSNSLVMRQSLMDLPEFDGSY FT KIWPRFKQAFYETTRQGNFSDFENINRLNNCLKGEALSRVNSLLINSSNLN FT EIMIKLEDQFGSVERVYSGLLNDVLALRNPKFENPQSMIDFISGIGDLVIN FT MECLNHPEYLNDQRLVRDLANKLPASLHQKWLMSLNEEKMLATIKNPYTAP FT TFKTLYEWLKPQEKLASMLLAERGANKESSVETIDRVNYHGIHQLKCYLCD FT QNHKLTECDEFKKISLVERRKFVAEKKLCFSCCRANHLAKHCRFAEPCNTE FT GCKSKHNRLLHLKIPEKKITVASKPTKDDLKKVNCHVRDSHQVYYPIIPVT FT LMNGENSFETFAFFDSGSSVSLIDKNVVNKLKVTGRDKPLTLAWTNGEIKE FT NNRSKSVQLNVKGPNGKLFQLNDLRTIKGLSLPHQSVDIARLKARFSYLEN FT SKLESYEKAVPTILLGLPHAFLFKGNQELSGRFNEPIARKTKLGWVLFGSS FT QIEDNKKKHVFIIREVQKEDFSKNLVSKDTNENCPKGGTIKVDQSSERKVL FT TTEIDNKLNICNVQTKTFDAISSQACVQVTKNKEVIIFIEDSSNLVEQNWN FT NQRNDYLVYFYNSKLATKVARYRVVMCRKANKEFGNSNSNLKLMKRNFPLK FT EMELYNTGNILCKFRKKTNCKLKNKVTANSDEINNLTKRQNLSHAMDVYDR FT QSISFKSLMKNLWKSKYSWDKCIPEIIKLIPTNGFNQMIVNVNSNVNLIKI FT VSIDNLEQQAHDLETTNEVQMCKLIKQRV" XX SQ Sequence 4033 BP; 1499 A; 610 C; 819 G; 1105 T; 0 other; tggtggctcc agagaggaag ctggagagtg tctgttgaaa tcagacactt tggatccccg 60 gacttccaac tagttgaaat ctagttgtga ttgtgagtat aagcgtgtcc gacgctatag 120 agcggactag tgaaaagtgt tcttgagcag tcttaggctc agagaaaatt gtggacgaag 180 tgtcgtcgaa gtgattcctg aactgtgtta gagttcagat gaacttagtg tcgagaaagc 240 tcgtcagtga aatcctgtgg acttattcgc agaagaaatc ttgagcaaga actcagatgt 300 tttgagaaga tgtaagatct ctcgtctggc agtaaacgtg gaaaaaaaaa aaaaaattta 360 attcgtacgt tgatcactgt tcaaattgtg tgtgagcctg caaactaaat cccagaagaa 420 acctgtatac gtgagcctgg aaattttatc cagaagaaaa ttttgtgtga gcctggacgt 480 tacaaccaga agaaatttgt gaagtaagcc tggtccttga aaaccagaag aacagtttta 540 tcggaaatca gtgttaccga agttatccgt gttatattag cctaggtgaa gacctagaag 600 aactaaaaat tggcagaaga tttggtggat aacggatctg ccagtggaag tcgcacacaa 660 acgtggactt agcccgacgc gggaattgtt cgtttaacgt gaacaaagtt tttgaatcct 720 ggaagaattt ctagatgaac atttggacgc agtaaattgg cgtcaaaagt tagttcgtgt 780 gtgcacagca aaaggtgttg tgagtgatcc tgaaattcaa agatccagat gaaatgttaa 840 aaggaagaag cgtctccggt gcttaaatcg gaaacagtgc aattaaaaaa aaaaaagtat 900 cgtgcaataa gaagcagctt aatagagtgg atcatcgagc gttgaaaagg aatcaacgac 960 gagtacatca acaagcgacg tgtccaacca gcaacatccc atcgagcatc aagaaagagc 1020 agcaacacat cgatcatctg aaattgtaag gtattcgaag gcaaaatgga tttcaagttt 1080 gctgaaaaca agaagggtag ttgccgtcag tgtgatagtt cagatgatgt cgaggaattt 1140 atggtttctt gcgacagttg tgatcgtaga tttcacataa agtgtgtagg attagctcgt 1200 actcctaggt caaaggacca ttggaagtgt gataaatgtt tagctattca ggaaatgtta 1260 gataaacaag atgctcttat taaatctctt gaagagaagt gtatgaaata ttttggaaca 1320 caacaagttg cacaaaattc agaaacaaat caaaatttgc aagttaaagc acttgaagcg 1380 attattgaca aaatcaatag cattaacatc tctcaaactt cttcaaacag tttagtgatg 1440 cggcaatctc taatggattt gccagaattt gacgggtctt acaaaatctg gcccaggttc 1500 aaacaagcct tttacgagac tactcgtcaa ggtaattttt ctgattttga aaatattaat 1560 cgattaaata attgtctcaa aggtgaggca ttaagcagag ttaattcgtt actcataaat 1620 tcaagtaatc tgaacgaaat aatgattaaa ttagaagacc agtttggcag tgtcgaacga 1680 gtctactctg ggttgttaaa cgatgttttg gcattgcgta acccaaagtt cgagaatcct 1740 cagtccatga tagattttat atctggaatt ggagatttag taatcaatat ggagtgcttg 1800 aatcaccctg aatacttgaa tgaccagagg ctagttcgtg atcttgcgaa caagctacca 1860 gcttcactac accaaaaatg gctgatgagt ttgaatgagg agaaaatgtt ggcgactata 1920 aaaaacccat ataccgctcc tacatttaaa acactttatg agtggttaaa accacaagaa 1980 aaattagcta gtatgctgtt agccgaaaga ggagctaaca aagaaagttc tgttgaaaca 2040 atagatagag ttaattatca tggaatacat cagttgaagt gttatttgtg tgatcaaaat 2100 cacaagttaa ctgaatgcga tgaattcaag aaaatatctc ttgttgaaag gagaaaattt 2160 gtagcagaaa agaaattatg cttttcatgt tgtcgcgcaa atcatttggc aaaacattgt 2220 agatttgcag aaccttgcaa caccgaaggt tgcaaaagta aacataatcg attacttcat 2280 ttaaaaattc ctgaaaagaa aataaccgta gcttcaaaac ctacaaaaga tgatttaaaa 2340 aaggtgaatt gtcacgtaag ggatagccat caagtttact atccaataat accagtaact 2400 ttgatgaatg gtgagaatag tttcgagaca tttgcattct ttgattcagg ctcctcggtc 2460 agtctcattg ataaaaatgt tgttaataag ctcaaagtta caggaagaga taaaccatta 2520 acattggctt ggaccaatgg tgaaattaaa gaaaataaca gaagtaaatc tgttcaattg 2580 aatgtcaaag gtcccaatgg aaaattgttc caattaaatg acctgcgaac cattaaaggt 2640 ttatcgctcc cacatcaatc tgtggacatt gcacgtctca aggcaagatt ttcttatctt 2700 gagaatagta agttggaatc ttatgaaaaa gcagtgccaa caatattact tggattacca 2760 catgcatttt tgttcaaggg taatcaagaa ctatcaggaa ggttcaatga accaattgca 2820 agaaaaacca aattaggttg ggtcttattt ggaagtagtc agattgaaga taataaaaag 2880 aaacatgtgt tcataattcg agaagtccaa aaagaagact tcagtaagaa tttggtatcc 2940 aaagatacaa atgaaaattg tcctaaaggt ggaacaataa aagttgatca aagctctgaa 3000 agaaaagttt tgacaacaga aattgataac aaattaaata tttgtaatgt gcaaacaaaa 3060 acatttgacg caatcagctc acaagcttgc gtacaagtta cgaaaaataa agaagtcata 3120 atattcattg aagatagttc taacttagtg gaacagaatt ggaataatca acgtaatgat 3180 tatttagttt acttctacaa ttcgaaactt gcaacaaagg ttgcgagata cagagttgtt 3240 atgtgtcgca aagcaaacaa agaatttgga aattcaaatt ccaatttaaa actaatgaag 3300 cgaaatttcc cattgaagga aatggaattg tacaacacag gtaatatact ctgcaagttt 3360 aggaagaaaa ctaattgcaa gttgaaaaat aaagtaaccg caaatagcga cgaaatcaat 3420 aatttaacaa agagacaaaa cttatctcat gctatggacg tttatgatcg tcaatcgatt 3480 tcattcaaaa gtttgatgaa aaacttatgg aaatcgaaat atagttggga taaatgcatt 3540 ccagaaataa taaaattgat accgacaaac ggtttcaacc aaatgatcgt aaatgtaaat 3600 tcaaatgtga atttaattaa aattgtttca attgataatt tagagcaaca ggctcatgat 3660 ttggaaacaa caaatgaagt gcaaatgtgc aagttaatca aacaaagggt ttgactagaa 3720 gaatgaatga aacaaactta gacgcaatta gaagactaat tggcaagctg aaactgaaat 3780 tttgaagtaa ctttgtatta ggatcatatg agcatgtcga caaattaata cgaagcattg 3840 atgttttcaa tggagaacaa ttacataaat agtaacacat tattagaaac gaacaagcat 3900 atgatacaga taaagatcat ttttatgaaa ataaagtaag gcataaatta caaaaacaaa 3960 ataagactca gaaatttact agcgtagaat cgagtttgtt tggtaaatct aggacgattt 4020 acgggtcccg gta 4033 // ID BEL-3_DWil-I repbase; DNA; INV; 5602 BP. XX AC scaffold_177548; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-3_DWil_; KW BEL-3_DWil-LTR; BEL-3_DWil-I. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-5602 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (04-MAR-2011). XX DR Genome; scaffold_177548; Positions 6549 948. XX CC Positions [4658-5233] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 2092..4200 FT /product="BEL-3_DWil-I_1p" FT /translation="MADPDFRTNASIDILLGNDCIWLVLTGEKLCDGQGHP FT IAVNTIFGWVITSVYTSRLQTSSTSLLTTVNIDGLLQRFWELEEVPSHTTV FT NEEDEKVETHFRRTHNRNSNGRYIVDLPFKEQNPQFADTFQGARSRFLAVE FT RLLMKDSSLKLKYTQFMREYIQLGHMKEVAADEDTNVCKKSFYLPHHPVMG FT AKLRVVFDGSFQDRDGRSLNDALHIGPCIQRNLFNVCLRFRMHRFVFSADI FT VKMFRQILVAPEHQNFQRILWRDDPEEPLKHYKLTTVTYGTACAPFLAVRV FT LEQLTTDHEYEFPRAGRILLDDFYVDDVLTGAMNEQELLDIKEKLILLMSR FT AQLELSKWVSNSKRIASNEGNKIDFSKMAAKVLGLHWRPGEDVLTYKVSLS FT EHMTCTKRQVLSDTARIFDPLGILAPVVIKFKILLQELWLLNLDWDSELPT FT KQAHLWQTFRTDIFTLRNLKIPRFVDNQPDDIELHGFSDASLKAYSAVIYN FT RVAHLDGSIGVALIAAKTRVLPLKQKSLPRLELCGALLLARLFQAVRSGLR FT QKDVKIHAWTDSTIVVSWLSHQPCKLKTFVANRTAEILETIPRSAWRHVST FT KENPADCASRGMLASDLINFDLWWKGPSWLHDCTLYSETMNRPPTCCSILK FT PAAELEFKTNILTTISKDIEDSPFEVLLGRTSSWIRLVRVVAYVFRFVHRI FT KQV" FT CDS 4166..5260 FT /product="BEL-3_DWil-I_2p" FT /translation="MYSASFIESSKSKYSGPLKFDEVTRARKFCVRQAQNQ FT FNDDRQLLLRKQSLERRSILNKLSPFVDEDGIIRVGGRLERSQLPADAKHP FT VVLPKQHRLVELLLKHEHKVNLHPGVSAMFVIIRQKYWIIGARNLIRRITH FT ECLNCFRQRHHTANQFMANLPAVRVRQAFPFEHTGCDYAGPILLKVHKGRN FT PRKEKGYICLFVCMVTSALHLELATDLSTDTFLAALKRFMARRGKCSQMYS FT DNGRNFVGAKRALEMVQLLASREHNDVVSKALADQGINWNFIPPYSPHWGG FT KWESAVRSVKLHLRRVIGTNVLTFEQMHTLLSQIESVVNSRPLCTGSDTEI FT AYLSPAQFLIGRTYTAVPEDSY" XX SQ Sequence 5602 BP; 1615 A; 1252 C; 1320 G; 1415 T; 0 other; tcataatttg gtcctgcgag ccggatcgtt gggcattgtt tgtttgttac tcattgctta 60 ataatttgtt gcaacgtggt tgcaattaat tgttaaccta aaagttagta gtcgatagtt 120 ctggcctact atcgaataga caatcgcgtg cttattggaa aaagcgcgga caaacggtgg 180 cgccactcgc ctatcggtat ccgttatcga tataattgac gagatcgtga gggaacacat 240 ctctgggaga aacacgcgtc ttgaaatttt ttcttgacat ctcatcatgg aggaaatgaa 300 aatattcata aaggccagag gaagattgaa gacaaccatt acccgcatat atggatatgc 360 agagaatcct gcacaggatg ccgacgtctt cgcaattgac accaaattgg aaactttgtc 420 caatgcgtgg aacgagttcg tgaagtctgg cgacgaattg gccaaatacg acaaactaga 480 gggttatgtg gaccctaccg aagacttcgg gatatacgag aagaaatatg aaatagcaaa 540 tgcccgcctt aaggctttac gagctgcctt ggctccgtca ggcaagttgg aagaaatcgg 600 cgcggctggt aacgatggtt cgctcttcca aggaattctg caacaaatgc agaatcaaca 660 gcaacaacta caactccaga tgcagggtca aatggaacac cagcagcgtc aaatggaact 720 ccagatgcag agtcaaatgg aacaccagca acggcaaatg gagcagcagc aactattggt 780 tgagagattg ctctcgccac agcaatcttc gtcttcgtca ggctcacaat cggttgcaac 840 accggcaaca agagattgtg agttatcgaa aatcaacatc aagctcttcg gtggagatta 900 caaggaatgg catgccttta aagacctgtt tgagagtacc attcacggaa aggcaacttt 960 gacgaatatc caaaagtttt atcacctgaa gtgatgcgtc ataggagaag cagcgatact 1020 catccagcat ttgcccgtag tggatgcaaa ttacgacact gcatgaaaat cattgagtga 1080 tcgatatgaa aagccccgtt acttggtgaa tttactgata gacacattca ccgcttgcca 1140 aaagctgttg ggcaaaactc atcatcgcta cgcgctctga catgcggagc cagtgatgtt 1200 attcgggctt tagacgcttc tggacaaacg ggccgtgact gctggctcat atacttgatc 1260 attaacaagg ttgatgaggc tacccgacgt gaatggatcg acaaaagcca agatatcgaa 1320 aatcctacca ttgaagagtt gctcaagttt ttggactcac gatgcgatcg ccttgagctc 1380 agtcaagtta tcgatcatgt agcgagcaaa ccgagggatg aaaagggaaa aaggcaagca 1440 ctcaccatgc ttgccattac aggtaattgt atgaaatgtc atagtactga gcacatactg 1500 tcggcttgtc ctcaattcca gagcttgaat gtcaagcaac gttacacctt tgcgaaggag 1560 aaacatatgt gttttaattg cttgagaact ggacatggag taagcgcttg tggctcgaag 1620 tcttcttgca agtatggcaa gcgtcgccat cattctatgc tgcattctga aggccattct 1680 aggagtctga ggagtcccca gcggcatcag cgatagcaat gatgaatgtt accagaccca 1740 gtaccccaaa tggccaacga agaagcactc tacctactgc attggtacac gtccgaaatt 1800 ctcaaggagc cttgctgacg tgccgtattt tattggatag cgcctcagac ttgtcattca 1860 tttcggaacg gtgcgtacag acgttggggc ttgcacgttc gccatttcga gttgccacat 1920 cgggcatctc agacgtcaag gcaggaataa ccagaggttt ctgctctctc cagatgatgt 1980 ctcgtgtgtc gagtcattgc atcgatatta aagctcacat tttaggaaag atcaccattg 2040 gcacgtgaaa acattgatgc ttcatcactg tcagtattta atggatatta aatggctgac 2100 cctgacttca gaactaacgc ctcaatcgac attttgctgg gcaatgattg catttggtta 2160 gttctcaccg gagagaagtt atgcgatggt caagggcacc ctatcgcagt taacaccata 2220 tttggatggg taatcacatc ggtatacact tcaagattgc aaacatcatc tacgtcattg 2280 ctgactacgg ttaatataga tggactactg caaagatttt gggaactaga ggaagttcca 2340 tcacacacaa ctgtaaacga agaggatgaa aaggtcgaaa cgcacttccg gcgtacccac 2400 aaccgcaact cgaatggaag atacatagtc gatttgccct ttaaggaaca aaacccacaa 2460 tttgcggata catttcaagg agcgagatcg cgctttctcg cagtcgagcg tcttctaatg 2520 aaggactcca gcttgaagtt aaaatatact cagtttatga gagagtatat ccagctgggt 2580 cacatgaagg aagtagctgc agacgaagac acaaacgtat gtaagaagtc attctatttg 2640 ccacaccatc cagtcatggg cgccaagctg cgggtggtgt tcgatggatc atttcaagat 2700 cgggatggac gatctcttaa cgacgcactt cacattggcc catgtataca acgcaacctc 2760 ttcaacgttt gtctgcgttt tcgaatgcac agattcgtat tttctgcgga catcgtcaaa 2820 atgttcagac agattttagt ggcccctgaa catcagaatt ttcaacgaat cctttggaga 2880 gatgaccctg aggaaccatt gaagcattac aagttgacca cagtcaccta tggcacggca 2940 tgtgcaccgt ttctggcagt tagagtgctg gaacaactga caactgatca cgaatacgag 3000 tttccaaggg cgggtaggat acttttggat gatttctacg tcgatgacgt actgacagga 3060 gccatgaacg aacaagaact gctcgacatt aaggaaaagc tgatactcct tatgtcacgg 3120 gcacaattgg agctaagcaa atgggtctcg aatagtaaac gaattgccag caacgaggga 3180 aataaaatcg atttctcaaa gatggcagca aaggtgcttg ggctgcattg gcgtcctgga 3240 gaagacgtcc taacatataa ggtttcatta tcggaacata tgacttgtac gaagcgacaa 3300 gtgctatctg acacggcacg catctttgac ccactcggca ttttggcgcc agtggtgatc 3360 aagtttaaaa ttctgctcca agaattatgg ctgcttaatc tggattggga ctcggaactg 3420 ccaacgaagc aagcgcactt atggcaaacg ttcagaacgg acattttcac gttaaggaac 3480 ttgaagatac cacggtttgt agacaatcaa cccgatgaca tcgaactgca tggattttca 3540 gatgcttcgt taaaggctta ctctgctgtc atttacaacc gtgtggcaca cctagatgga 3600 agcatcggag tggcactcat tgctgctaaa acaagagttt taccattaaa gcagaagtcg 3660 ttgccacgat tggagctttg cggagctcta ctattggctc gtttattcca agcggtaagg 3720 tctggtctgc gccagaagga tgttaagata cacgcatgga ccgattctac aatcgttgtg 3780 tcatggcttt cgcatcaacc ctgtaaactt aaaacatttg ttgccaaccg aacggcagaa 3840 atactagaga ccataccacg tagcgcttgg agacatgtca gcactaaaga gaatcctgca 3900 gattgtgcat cacgaggcat gctagcgagt gacttaatta actttgatct atggtggaag 3960 ggaccatctt ggctgcacga ctgtacacta tactcagaga caatgaatag accaccaact 4020 tgctgctcaa ttctcaagcc agcagcagaa cttgagttca aaaccaatat cttgacaacc 4080 atttccaagg atattgagga ttccccattt gaggtactac tcggaaggac atcatcttgg 4140 atcagactgg ttcgagtggt cgcatatgta ttccgcttcg ttcatcgaat caagcaagtc 4200 taaatattca ggacctctaa agttcgatga agttaccaga gcaaggaagt tttgcgtacg 4260 acaggcacaa aatcaattca atgacgatag acagctgctt ttgaggaagc aatctttgga 4320 gcgacgatca attttaaaca aactatctcc cttcgttgac gaagatggaa ttatacgtgt 4380 gggtggacgt ttggaaaggt cgcaattgcc tgcagatgcc aagcatccag tagttttgcc 4440 gaaacagcat cgccttgtgg aattgctgct gaaacatgag cataaggtca atctgcatcc 4500 aggagtttca gcaatgtttg taataattcg ccagaaatat tggatcattg gcgcacgaaa 4560 tcttattcga cgtatcacac atgagtgttt aaactgtttt agacagcgcc atcacacagc 4620 aaatcagttt atggccaatt taccggctgt tcgagtgcga caagcctttc catttgagca 4680 tactggatgc gattatgctg gacctatctt gttgaaggtt cacaagggac gaaaccccag 4740 gaaggaaaag ggctacattt gcctgtttgt atgtatggtt acttcagctc ttcatctgga 4800 gctggccact gatctcagca ccgacacatt cttggcggca ttaaaacgtt ttatggctcg 4860 tcgtgggaaa tgctcccaaa tgtatagcga caacggacgc aactttgtcg gtgcaaaacg 4920 tgcactagaa atggtacagc ttctcgcttc acgcgagcac aatgatgtcg tttcaaaagc 4980 ccttgcggat caaggcatca actggaattt tattcctcca tattcaccac actggggtgg 5040 taagtgggag tcggcagtac gctcggttaa gctgcatttg cgacgtgtaa taggaacgaa 5100 cgtcctcacc tttgagcaga tgcacacact gctttcgcaa atagaatctg tcgttaattc 5160 gcgaccattg tgcacaggat cggatactga aattgcctac ctgtcaccgg cacagttctt 5220 aattggcaga acatatacag cagttcctga agacagctat tagcagattc cgaccaatcg 5280 attgagctac tggcagcatg ttcaagcaat gcttcaagga ttttggaaac gttggcatca 5340 agaatacttg acatcgctac aacagcgccc gaaatggacc accaagttac caaacattgc 5400 gatcggtaat cttgtactga tcaaggactc taacgcgcct ccatcggcat ggctcttagg 5460 acgggtaact cagctgttca caggagctga tggcctcgtc tgagcggttc aggttctcac 5520 caaatcagga caagtcactc gccctatcac aaagattgca gccctaccgg gttgagaaac 5580 ggagtttcag gggggggcgg ga 5602 // ID SACI-7 repbase; DNA; INV; 4519 BP. XX AC BN000785; XX DT 02-SEP-2005 (Rel. 10.08, Created) DT 04-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE Schistosoma mansoni Gypsy-type Saci-7 LTR retrotransposon (EST). XX KW Gypsy; LTR Retrotransposon; Transposable Element; Boudicca; KW Saci-3; SACI-7. XX OS Schistosoma mansoni OC Eukaryota; Metazoa; Platyhelminthes; Trematoda; Digenea; OC Strigeidida; Schistosomatoidea; Schistosomatidae; Schistosoma. XX RN [1] RP 1-4519 RA DeMarco R., Machado A.A., Bisson-Filho A.W. RA and Verjovski-Almeida S.; RT "Identification of 18 new transcribed retrotransposons in RT Schistosoma mansoni."; RL Biochem Biophys Res Commun 333(1), 230-240 (2005). XX DR EMBL/GenBank/DDBJ; BN000785; Positions 1 4519. XX FH Key Location/Qualifiers FT CDS 2..697 FT /product="SACI-7_1p" FT /translation="ATKYAHIVGALPIDVATEVGDLIDNVPETDPYDKIKA FT TVIHRTSQSDEKRLQQLLTACELGDKRPSQLLRHMRQLAGSYKLDEALLKQ FT MWLQRLPYNVRQILSISGASVSLDDLADMADKMIEIYPDSHGVSAIQSSNA FT ENMSDALNIQQQITLLTQQLATLQATVATIHSRPPRSTSRRRSVSRHRLRS FT PKRAAGICWYHSNYGEKARCCTKPCNFKTNNPIYQGNEVARQ" FT CDS 1018..3792 FT /product="SACI-7_2p" FT /translation="MTNKRLIDVNTRLQVNVALSNDNEIQGVRIMKPVNNI FT FNDILTEYQCITKSDYQKESNPQLVQHHITTTGPPTRARARRLPPNKLQFA FT KREFEHMMQLGIIRQSNSPWASPLHMVPKKDQDWRPCGRYRRLNNQTIPDR FT YPIPHLHDFSLNLHGKQIFSKLDLVRAYHQIPMAPEDIEKTAVITPFGLFE FT FLRMPFGLKNAAQTFQRFMDEVTRGLDFVFVYIDDVLIASSSTEEHIQHLH FT TLFERFKSYGVVINPSKCIFDASSLEFLGHHIDSQGIKPLEDKVKAITSYP FT EPTSVKSLRRFLGTCNFYRRFLPNCADVLQPLTDLLKNDKSGTKKEKNQIF FT KLPTDAKVAFEKAKSMIANATMLQHLNTDPTTLLIQCTDASQKAVGAVLQQ FT RVNNTITPIAFFSKRLSPAQERYSTFGRELLAMYLAVKHFNFLLQGCDFII FT MTDHKPLCHSFSTSYDKNSPREARQLDYISQFTTDIRFIKGHTNIVADALS FT RRDINTMVLNHDISLETLAKLQADDAELKACKEKSSLDLKPVPIPLSDAFI FT MCDTSTNNNRPFVPHACRRKIFQHLHGLSHPGIRATTKLITERFVWPKINS FT DVKRWTRNCLQCQRSKTQKHTYSPIGKFPIPDKRFQHVHLDLVGPLPPSNS FT FTHILTAIDRFTRWAIACPIRDTSAETVASVFLDRWITNYGVPSIITTDRG FT PQFQSVLFQEFTKLLGVNHIKTTAYHPAANGMVERFHRQLKSSLMAQADSS FT KWSDALPLVLLGIRSTIKEDIGCTAAELVYGTTLTLPGQLVNYEPTTHGDS FT THFASRLLQMMQNIRAIPPREYNNPVHLDKHLETCKFVFVRVDAVKKPLTP FT PYEGPYKVIERTCKYFIINKNGNNETISIDRINQLSTNPLKPLMTIRRTNN FT HSHQLLRNKRKRKNSALHPLV" XX SQ Sequence 4519 BP; 1446 A; 1081 C; 790 G; 1202 T; 0 other; ggcgactaaa tatgcccaca tcgtcggtgc actacctata gatgtagcaa cggaagtcgg 60 tgacctgata gataacgtac cagaaacgga cccctatgac aaaataaaag ccacagtaat 120 ccaccgtact tctcagtctg atgaaaaacg cttacagcaa ctactcaccg catgcgagtt 180 aggggacaaa agaccctcac agttgcttag acatatgaga caactagcag gctcatacaa 240 gttagacgaa gcattactta agcagatgtg gttacaaagg ttaccatata acgtcagaca 300 aatcctcagc atatcaggag cctctgtcag tcttgatgat ttagcagaca tggctgataa 360 gatgatagaa atatatcctg atagtcacgg cgttagcgcg atacaatcct caaatgcaga 420 aaacatgagc gacgcactca acatccaaca acaaataaca ctgttaacac agcagctagc 480 aacactgcag gcgactgttg ccaccatcca ttctcgacct cccagatcta cgtccagaag 540 gagatcagta tccagacatc gtcttaggtc acctaagcgg gcagccggaa tttgttggta 600 tcattcgaat tatggtgaga aagcacgttg ttgcacaaaa ccgtgcaact ttaagacaaa 660 caacccaatc tatcagggaa acgaagtggc cagacagtaa cggcggcagc tgctactggc 720 caacatgtta gccgcctatt tcacttcagg gatcgcattt ctggctcgga ctttttattc 780 gatactggag cagaaatcag catcatccca ctccatctct cccgcagaca acagacaaca 840 agcactaaac tgtccctaat agcggctaac gaatcagtta tcaaaactta cggagaacaa 900 tctcttatat tagacctcgg gctccgcaga agattcacat gggtattcat agttgcccaa 960 gtcaaacgac ccatcctcgg ggctgatttt cttagtgcgt acaacctgtt agtagatatg 1020 accaacaaga gactaatcga cgtaaacaca cgtttacagg taaatgttgc gctctccaat 1080 gacaacgaga tccagggtgt tcgaataatg aaacctgtga acaacatctt caacgacatt 1140 ctaacagaat accagtgtat tacgaaatca gattaccaaa aggagagtaa cccacagctg 1200 gtgcaacacc atatcactac caccggacca cccacccgcg ccagggcgag aagactccct 1260 ccaaacaagt tgcagttcgc taagagagaa ttcgaacata tgatgcaact tggaataatc 1320 agacaatcta acagtccttg ggcctcgccg ctgcatatgg tccccaagaa agatcaggac 1380 tggcgcccat gtgggagata tcgcagattg aataaccaga ccatcccaga caggtatccg 1440 atccctcatc tacatgattt ttcgttaaac ctgcatggaa aacagatttt ttcaaagtta 1500 gatctcgttc gagcatacca tcaaataccg atggcacctg aggatatcga aaaaacagcc 1560 gttatcacac cttttggctt gtttgaattc ttgaggatgc ctttcggact taaaaacgcc 1620 gctcaaacct ttcagagatt tatggacgaa gtaactagag gattagattt tgtgtttgtc 1680 tatatcgacg atgttttgat cgctagctct tccactgaag aacatattca acacttacac 1740 acgctttttg aacgtttcaa aagttatggt gttgtaatca acccttcaaa gtgtatattt 1800 gacgcatcat cactggaatt cctgggacat cacattgatt ctcaaggaat taaaccgctt 1860 gaagataaag tcaaagccat caccagctat ccagaaccaa cctcagtaaa atcactgcgt 1920 cgtttccttg gaacatgtaa tttttatcga cgatttctac caaattgtgc cgatgtacta 1980 cagccattga cagatttact gaaaaacgat aaatcaggta ccaagaaaga aaagaaccaa 2040 atattcaagc tacctaccga cgccaaggta gcatttgaga aagctaaatc catgattgct 2100 aatgctacca tgctccaaca cctgaatact gaccccacaa cactgttgat ccaatgcacg 2160 gatgcttcac aaaaagccgt cggggcagta ttacaacagc gggttaacaa cacgattacc 2220 cccattgcct ttttctccaa gagattatcg ccagcacaag aacgttatag tacttttgga 2280 cgcgaactct tagctatgta tttagcagta aaacatttca actttttgct acagggttgt 2340 gatttcatca ttatgacgga ccacaaaccc ctttgccatt cttttagcac atcgtatgac 2400 aaaaattccc cacgagaagc cagacaactt gattatatct cccaatttac cacagacatc 2460 aggtttatca aaggtcacac aaacattgtt gcagatgcac tatctaggag ggatatcaat 2520 acgatggtac tcaatcatga catcagcctt gaaaccttgg ctaagttaca agcagacgat 2580 gcagaattaa aagcctgcaa agaaaagtct agcttagatc tcaaaccagt accaattcct 2640 ctttcagatg catttatcat gtgcgacact tcaacaaaca acaatcgtcc gtttgtccca 2700 cacgcttgca gacgaaaaat atttcagcac ttgcacggtc tttcacatcc aggcatcaga 2760 gcaacaacaa agctcattac cgaacgtttt gtgtggccta aaatcaactc ggatgtcaaa 2820 cgttggacac gtaactgtct ccagtgccaa cgcagtaaga cgcaaaaaca cacctacagt 2880 ccaattggga agttccctat ccctgacaaa cgcttccagc acgtgcattt agatttagtt 2940 gggcctctgc caccatcaaa ctctttcaca catatcctca ccgcaataga taggttcaca 3000 agatgggcca tagcatgccc cattagagac acatcagcag aaacagtcgc ttcagtcttc 3060 ctcgaccgct ggataactaa ctatggagta ccatcaataa tcacaacaga ccgcggtccc 3120 caattccaat ctgtcctttt ccaggagttc actaaacttc taggggtcaa ccatattaaa 3180 accacagctt accacccagc agctaatggc atggtggaaa gattccatcg ccaattgaaa 3240 agctcactga tggcgcaagc tgactcatca aagtggagcg atgccttacc gcttgtactc 3300 ctcggtattc gttcaaccat aaaggaagac attggatgta cagccgccga gttagtctat 3360 ggtacaactc taacgttacc cggccaacta gtcaactatg aacctacaac gcacggggat 3420 tctactcact tcgcaagtcg cctattacaa atgatgcaga acattagagc aataccacct 3480 cgagaataca acaacccagt ccatctcgat aaacatttag aaacgtgcaa atttgttttt 3540 gttcgagtag acgcggtgaa aaagccactg acaccaccat atgaaggtcc gtataaagta 3600 atagaacgca cttgcaaata tttcattatc aacaaaaacg ggaacaatga aacaatttcc 3660 atagaccgaa taaaccagct ttctacgaac cccctcaagc cactgatgac aatacggcga 3720 acaaacaatc actcccatca attgttaaga aacaagagga agaggaaaaa ctcagcacta 3780 caccctcttg tttaacacga tctgggagac ttattaaaaa acccaaacgt tatgtacatt 3840 ttatggacta acacaatcaa tcagttgtat gcattacttt gtgaagttac gcaatcttct 3900 aataaaagat atttattttc cctacatgta catttaataa ctacatttat ttgttatcac 3960 tggttttgcc aatttttttt taataagaaa aaattgttat agtaataaac gagtgaatgc 4020 taactaacta tttattaaat tcatccattt ttttattttt tattttatga ctctattaac 4080 agttttgcac acttttatta ttattagcaa accaaagctt tttaatgctc gtttttcagt 4140 tacattttgc gtagcatagc ctatattatt aacgtgtcat ttatccctat ctccaccttt 4200 tttttccgtt attacagtaa atactatttt atgtacattt atatacaata tttctcaatt 4260 tctgttcttc atattgtaaa cagtgttacc tccggacact ctggggggag ctatgtagag 4320 atagaacctt aaaacctatt ttgtaaataa tttttcatag ttttcgcatg tattttcata 4380 ttctgtacat taagcattaa acaattgtac ttgttgttta agtgtatcac tttgaacact 4440 tgtaacttga cttccttccg ttttgagtac aacatatcgt ggcactgtta ctgctcctca 4500 ataaatacac tctaattct 4519 // ID Gypsy-146_AA-I repbase; DNA; INV; 4178 BP. XX AC supercont1.6; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-146_AA_; KW Gypsy-146_AA-LTR; Gypsy-146_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4178 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.6; Positions 2262804 2258627. XX CC Positions [3343-3807] - Integrase core CC 'GAATG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 97..3864 FT /product="Gypsy-146_AA-I_1p" FT /translation="MAETNGGSASGGQPQPTNMTEAVVHILTNQQALMRQL FT AQQLTATQIAVNNLSRDESVLDSLSSNMAEFTYDKENGHTFDAWFSRYVDL FT FDKDAGKLDDAAKVRLLLRKLSPPDHERYNSFILPKLAREFTFEETVSKLK FT SLFGATVSTFRRRYNCLQTNKDDGDDYLAYSCKVNKACVDFKLSELTEEQF FT KCLIFVCGLKSKQESEIRMRLINKLNESADLTLQQVVEQCNTLVNLKQDTV FT LVEGCSSSVNAIAYPKRTKPKRLASADKSHEQPKTPCWSCGGMHFSKDCRF FT KDHRCNECGKYGHREGYCACFSSKSSNTAASTEKKKKKMRNQPLARTVTIR FT SVGQRRKFVDLQLNNVPLRLQLDTGSDISIISHRSWVKIGRPSAKPAVFCA FT RTASGEPLQLIAELECNITLNGITRGGKCFIASPNVHLDILGIDWMDLFGL FT WNHPIASFCNQITSQPTHQMVALQAQYPEVFTSEMGLCNKTPVKLVLKGNP FT KPVFRPKRPVAYSMEQAVEDELLRLQGLGVLKKVDFSDWAAPIVAVRKPNG FT TVRICADFSTGLNSVLEANHYPLPLPEDIFAKMANCRIFSHIDLSDAYLQV FT EVDPACQPLLTINTHKGLFQFTRLSPGIKSAPGAFQQLMDAMLAGLEYTVG FT YLDDILVGGRNEEEHQRNLQLILQRLKEYGFTVRIEKCSFGMQQVKHLGQI FT LDGNGIRPDPEKTTAIVNMPPPHDVSSLRSYLGAVNHYGKYVKDMRNLRHP FT MDQLLKAGTKFEWTPACQASFNRFREILQSPLLLTHYNPKMPIIVSADASA FT FGLGARIAHQFPDGTVKAVYHISRSLTAAESNYSQVEKEGLALVFAVTRFH FT RMLFGRKFTLETDHKPLLAIFGSKKGIPTYTANRLQRWALTLLLYDFEIRY FT ISTESFGHADVLSRLMDRRMRPDEEIVIANLEMEYSIKSVINESLEVFPLS FT FKTVQAETKADETIQQVIRFVNTSWPSKKTDLSDPSVQQFYLRRDSLTIVA FT DCLMYGERLVVPPKFRKRVLQQLHKGHPGVERMRSIARQYVYWPHIDADIA FT NLVQTCTACASVAKTDRKTSLESWPAPEKPWQRVHLDYAGPQDGWYYLILV FT DCFSKWPEVVRTKEITTAATIRMLRSIFARFGIPETLVTDNGTQFTSGAFE FT SYCEKNAIVHLKTAPFHPQSNGLAERFVDTFKRSLKKITAGGESLDEAIDT FT FLQCYRSTPCRSAPAGKSPAEILVGRPVRTSLAHLDYLDYFEANIN" XX SQ Sequence 4178 BP; 1126 A; 1072 C; 1017 G; 963 T; 0 other; gtggcgacga ggtacagaag ttttcgtgcg aaaaagttgc gcgttcgtcg aaaaataccg 60 catttttgtg gatcggaaga ccggggataa agtgcaatgg cggagactaa cggaggatca 120 gcgtcaggag ggcagccgca gccaacaaac atgacagagg ctgtggtaca cattctgact 180 aatcagcaag ccctgatgag gcaacttgcg cagcagctga ccgccacaca gattgccgtc 240 aataacttgt cccgcgacga atcggtgctg gactctctgt caagcaacat ggcggagttc 300 acctacgata aggagaatgg gcacacattc gacgcttggt tttcccgtta cgttgacctc 360 ttcgacaaag atgccggcaa attggatgac gcagcaaaag tgcgacttct tctgcgaaag 420 ttgagcccac ccgaccatga gcgctacaac agcttcatcc ttccgaagct cgctcgggag 480 tttaccttcg aagagacggt ctctaaactt aaatcgctgt ttggtgccac agtttccaca 540 ttccgacggc gatataactg cttgcagacc aataaggacg acggagatga ttacctggct 600 tactcctgta aggtaaacaa ggcgtgcgtc gacttcaagc tatccgaatt gacggaagag 660 caatttaagt gcctcatttt cgtttgtggc cttaaatcaa agcaagagtc tgaaattcgg 720 atgcgcctaa tcaacaagct gaacgaatcg gcggacttaa ccctgcagca agtggtggag 780 caatgcaaca ccctagtcaa cctgaagcag gatacggtgt tggttgaagg ttgttcgtca 840 tcggtgaatg cgatcgctta ccccaagcgg acgaaaccga agcggctggc gtcggcagat 900 aagagccatg agcaacctaa gaccccctgc tggtcatgcg ggggcatgca tttcagcaag 960 gactgccgat ttaaggacca tcggtgcaac gaatgcggaa agtatgggca tcgtgaaggc 1020 tactgtgcat gcttttcgtc caaatcaagc aatactgcag ccagtacgga gaaaaagaag 1080 aagaagatgc ggaatcaacc attggcgagg acggtaacaa tccgaagcgt cggccagcgt 1140 aggaaattcg tcgatctcca actcaacaat gtccctcttc gtctccagtt ggacacaggg 1200 tccgacatct ccatcatttc gcatcggtca tgggtcaaaa ttggtcggcc gagtgcaaaa 1260 ccagcagtat tttgtgccag aacagcgtcc ggagaaccac ttcagctaat tgcggaactg 1320 gaatgcaata ttaccctcaa cggcatcaca cgcgggggta agtgttttat tgcatctcct 1380 aacgtccatc tcgacattct aggcatagac tggatggact tatttggttt gtggaatcat 1440 ccgatcgctt cgttctgcaa ccagataacg tcgcagccaa cacatcagat ggttgctctt 1500 caagcgcagt atcctgaggt gtttaccagt gagatgggtc tatgcaacaa gactcccgtc 1560 aaactggtac tcaaaggcaa ccccaaaccg gtgtttagac caaaacgtcc ggtagcatat 1620 tccatggagc aagcagttga ggacgaactg ctgcgactac aaggcttggg cgttttaaaa 1680 aaggtggatt tcagcgattg ggcggcaccc attgtcgcag tacggaagcc aaacggaacc 1740 gttcgcatct gtgcggattt ttcaacaggc cttaacagcg tgctggaggc aaatcactac 1800 ccattgcctt taccagagga tatatttgca aagatggcca actgtcggat ttttagtcat 1860 attgacttgt ccgatgcgta tctgcaggta gaagtggacc ctgcctgcca accactcttg 1920 accatcaaca cgcataaggg cctcttccag tttacacggt tgtcaccagg tatcaaatcc 1980 gcccctggag ccttccaaca gctgatggac gcaatgttgg ctggcctcga gtataccgtt 2040 ggttatctgg atgacatcct ggtaggtggt cgcaacgagg aagagcacca gcgcaacttg 2100 caactcattc tccagcggct caaagagtac gggttcaccg tgcgcatcga aaaatgtagt 2160 tttggcatgc agcaagtcaa gcacctgggg cagattttgg atggtaacgg aattaggcca 2220 gatcctgaga agacgacagc aatcgttaat atgccaccac cgcacgacgt ttcttcgctt 2280 cgttcttatt tgggagccgt aaatcattat ggcaagtacg tcaaggatat gcgcaactta 2340 cgacacccca tggatcagct gctaaaagca ggaacaaaat tcgagtggac gcccgcttgc 2400 caagcatcat tcaatcgatt tcgagaaatc ctgcaatcgc cgttactcct tacgcactat 2460 aaccccaaga tgcctataat tgtatccgcc gatgcatctg catttgggct gggagctcga 2520 attgcacacc agtttcctga cgggacggtc aaggcagtgt atcacatttc ccgtagcctg 2580 acagcagctg aaagtaacta cagtcaggtt gaaaaagaag gcttagcact cgtttttgcc 2640 gttacccgct ttcatcgaat gctgttcggt cgaaaattca ccctcgagac ggatcacaaa 2700 ccgctactcg ctattttcgg atccaaaaag gggattccga cgtacacggc taatcgtctc 2760 caaagatggg cattgaccct actcctctac gatttcgaaa tccgctacat ctcaaccgag 2820 agctttgggc acgctgatgt tttgtcacga ttgatggatc gccgcatgcg cccagatgag 2880 gaaatcgtca ttgccaatct cgaaatggag tactcgatca aaagcgttat caacgagtcg 2940 ttagaagtat ttccgctatc gttcaagaca gttcaagcgg aaactaaagc cgatgagact 3000 atacagcaag tcattcggtt cgtcaacaca agttggcctt cgaaaaagac ggatctatct 3060 gatccttcag tgcaacagtt ttatttgcgc cgtgattctc tcaccattgt agctgattgc 3120 ctaatgtatg gagaacgact ggttgttcca ccaaagttcc gtaaacgggt gctccagcag 3180 ttacacaaag gtcacccggg tgtggaacgc atgcggtcaa tcgcccgaca gtatgtctac 3240 tggccacaca tcgacgcaga tattgccaat cttgtccaga cctgcaccgc atgtgcctcc 3300 gtagcaaaaa ctgatcgtaa gacatcactt gaatcgtggc ctgcacctga gaagccgtgg 3360 cagcgtgtgc atctagatta tgcaggtcca caggatggat ggtactactt aatcctggtg 3420 gactgcttca gcaagtggcc cgaggtggta cgaaccaagg aaatcaccac cgcagcaacc 3480 atacgcatgc ttcgcagtat ttttgccaga ttcgggattc ctgaaacact ggtaacagat 3540 aatggcacac aatttaccag cggtgccttt gaatcgtact gcgaaaagaa tgctatagtc 3600 cacctgaaaa ctgctccttt ccacccgcag tcgaatggcc tcgcagagag gttcgtggac 3660 acctttaaaa ggtcgctcaa gaaaatcacc gccggagggg aatcattaga tgaagccatc 3720 gacacttttc tccaatgtta ccggtcaaca ccatgccgca gcgcaccagc tggaaaatct 3780 cccgcagaaa tcttggttgg cagaccagtt cgaacatcac ttgcgcattt ggattatttg 3840 gattatttcg aagcaaacat taattagatc tcactgcaac cagcttcgaa ctcgccacca 3900 gcccgatgat caccaagaac aagcgcagtc accagatgac cttcaagttc cccttagtat 3960 tttgctggac aattgcggtc tcaatacgac atcggcttcg gaggttgcag ttggccctgc 4020 tctgcaacca ggacttcaac agcagcctac tcaacgtgaa aaccaacgcc gtcccacagg 4080 caaccaaatg caacaaccga ttcctactcg ccagtctacg agggttcgta gaccgcctgt 4140 gagatatgaa ccttatcaac tttattaaaa gggggagg 4178 // ID EhRLE3 repbase; DNA; INV; 2759 BP. XX AC AB097129; XX DT 17-NOV-2004 (Rel. 9.1, Created) DT 03-JUN-2009 (Rel. 14.07, Last updated, Version 2) XX DE Entamoeba histolytica retrotransposon EhRLE3, complete sequence. XX KW R4; Non-LTR Retrotransposon; Transposable Element; KW reverse transcriptase; EHRLE3. XX NM EHRLE3. XX OS Entamoeba histolytica OC Eukaryota; Amoebozoa; Archamoebae; Entamoebidae; Entamoeba. XX RN [1] RA Kojima K.K. and Fujiwara H.; RT "Cross-Genome Screening of Novel Sequence-Specific Non-LTR RT Retrotransposons: Various Multicopy RNA Genes and Microsatellites RT Are Selected as Targets."; RL Mol. Biol. Evol 21(2), 207-217 (2004). XX DR Genbank; AB097129; Positions 1 2759. XX SQ Sequence 2759 BP; 1283 A; 272 C; 383 G; 821 T; 0 other; aatatgatat atatcatatt aaaataaaaa caattgaaca aataaactat tataaatacc 60 tccgaaagat gaaaagaata gaaatagaaa caagaaagaa taaattgaaa aagaataaga 120 tgcaaatata aaatgatgaa atatggaaaa atgcaacatc taaataaatc taaagaagat 180 gaatatccag ataatatctc cactattaat ttctggaaag gaatctatga aactaacact 240 aatatagatt tcctgaatat taatttatat gaaatattaa atgaaaatag aattaataat 300 agaatgtcaa taatcactcc caatgaacta gactgtgata ctaaattcgt tgataaattt 360 gaaattaaat tgaaaatgat aaaatacaag ggattttgga ttaaatattt agaacaacca 420 aagaaatatt tattaaaatt atttaatgaa tggttaaaca atcctaatga aatacttctc 480 cattttattc aaggaaggac aatattaata tataagaaag gagataaatt agacacacag 540 aattacagac caataacttg tttaaattgt ctgttaaaag tgtatacgtc aatattaaaa 600 atgaaaatag aaaatcaatt gatgttaaat ccaattgaaa aacaattatc attaaatcaa 660 attggatgta agaaatatac atatgcatca aaagaaggat tattatacaa cactataatc 720 aatcaattat tattaaagac taaatggaag tgggtagaaa catattatga cgtctccaaa 780 gcatatgata gtattaatca tcaatggatt aaacaatgtt taatttattt taatatccta 840 ctagtagtaa ttaatagtat attatacata ttaaataata catatcttaa tttatattat 900 aatcaagaaa gtgataatat gattaatgta gaaagaggaa taatacaagg tgacagccta 960 tctccacttt tatttatatt atctattgat gtattatcta aacaattaga taaacaaata 1020 agcaaattaa atattaaaat gaatggagaa gaaaaacaag tccagttaaa ccacatattg 1080 tatatggacg atttaaagat aatgactaat agcctcgatg aaatggaaaa agcccacaaa 1140 ttaactaaag aaatatttaa tgcaatagga ttaaaaatta atttagagaa gagtgggata 1200 atgactaata ttagtggtaa aataaatggt gaattagatg aactaccaag agtaactaat 1260 gataatcctt ataaatatct tggtattgaa ataggagata aaataaacat caataaatat 1320 tgtacaagaa tattaaatga tgtatcttct attttatcat cactaaacac tatgaaatat 1380 agcagcctca atacaataag aaagattaat agtgatataa tctcaaaatt aagatatgga 1440 ttttgtattg tcccatggaa actaggagaa ttggagaaga tagataaaca aataagaaaa 1500 tcattaatac aactccaatt atacagtaga aatattccca aaagtagact atttgttaaa 1560 aagaatgaat tgggattagg attaatgtca ccaagagatg aatgtggtaa agaattatta 1620 agaatatatt taaaatataa gtggagatca agcacagaaa taggagaaat gatagaagca 1680 ataaaagaaa gtccaaatgg aataattaaa agaatgaaga aagcatttgg taagaatatt 1740 aattttaatg aattaatgaa tgtaatagaa ttaaatgaaa gaaagcacaa tgtaaaggaa 1800 atatttgaat ggataaataa tgaattaaat gatagatatt taaatgaatg gaaagaaaag 1860 aaatgtggtg aatatataag atgtgcaaat ggcactttta atgataagaa attaacagta 1920 gccacttgga ggtcattaga tataaaaaga aatgaattcc tccaaatagt aaagatgcaa 1980 gaaggtgtaa ctatgacagg aaatattaaa tcaaaaatac tccaaaatga tgctttaaaa 2040 tattgtaaac attgccctaa cacaatagca tctattagtc atatattgtt aggatgtcct 2100 gtaatgaaaa agaatcaaat atcaaaacat gattatgttt gtaaacaaat atttaaacat 2160 atattaatta tccattttaa tgaatttgat gcaattgatt ttgataatcc tccaaaatgt 2220 ataaataatg ataaaatggt tataacttac aataaagatt taatagttag tgaccattct 2280 ttccatgcta gaagaccaga tatatattat caaaatatga aagaaaagaa aggatatatc 2340 atagatgttg ctatatgtca agataataat ttagaattaa attatattca caaaataaat 2400 aaatacaaag aattacaaga gaaaataaga aataatagag aattaatata tgtagaaata 2460 atacctgtta tattatcaat aaatggttta atacataaag aatcagtaag aagagtaaaa 2520 tcattgaaat taaaataaga cttttccaaa gtcctacgaa caataataat aaaaaatatg 2580 aaagacttaa tgttctatac aggaaattct ggcagcacat tataagaaga acaatttaat 2640 gcgagcatgt tagaattgaa ctctattaca ccaatagagc atattgaaga aattgaaata 2700 atcacttctc agtatgaaga aaatccaatt gaggatatat ctaatatttg acctttttt 2759 // ID CR1-32_HM repbase; DNA; INV; 4349 BP. XX AC . XX DT 16-DEC-2008 (Rel. 13.12, Created) DT 16-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE CR1-type family - consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-32_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4349 RA Bao W. and Jurka J.; RT "CR1 families from Hydra magnipapillata."; RL Repbase Reports 8(12), 1860-1860 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(145..855,859..1731,1691..2173,2131..2844, FT 2744..4000) FT /product="CR1-32_HM_1p" FT /translation="MSKNFTITQIKELLDIHENTIIKLLNDKIERLENKFN FT IIVEENKYLKSEVNELKKSVEFINEKYENIKNEAANLKKTNDINNQGNEKK FT KLEDDVIINKLAELEDRSRRNNLRINGIEDCDNESWEESEKKVQEFIKSKL FT GIKYNVDIIERAHRVGRKESDKPRNRPIIIKFLNYKDKSTILEKYTNLKLW FT NQKIYINEDFSERTIEVRKKLFAEAKELRSKGKFAKVIYNKLYTRDIMNYL FT YIHILYYLLIFNMDSNNSNDFEKIFNFFQTNVLNFNEKSDPDLNYFNEINT FT SEFECKYFYPNEVKYFLKIDNINSYLNVVHINIRSLKKNFENFLNVICETD FT NSFNLICVTETWSSNEDFENNSNLQLPGFNAITLERNINKRGGGVLFYVRN FT SLLYNVRRDMSVSDENKEILTIEIINKKSKNLLISCCYRPPTGRTEDLSNF FT LQKKILEKTNLENKKNYLIGDFNLDCFDYYENQNTRKFYNNLFETGTIPII FT NRPTRITNYSSTLIDNILTTDFFNISLKKRNNPLTFLIYRLKKGIIKTDVS FT DHFPIFFSVNLDFKKEMQKSVTIKKRLFNNINLNSFKEHLLLINWEHLNFN FT NNINTIYNTFFKTFYKVYDTNFPMCETVVKTKDISCPWITKGLKKSSKIKQ FT KLYIKYLKSKSVKHKNNYITYKNLXEKLRKNAKKKNTTPTKTSQKCKKKKY FT YSYLINKHKNNSKQIWKIMKEITGKLKTNTNSLPHEIKIDNKLINDSNYIA FT AEFNKFFTTIGTKLANSIPYTDDKSAEFQTSTSSSFHFTELTYDEFEIAFS FT SLKRNKTTGYDDINVNIIIDFCDVIKHIIFKIFRASISQGMFPDCLKIAKV FT FPVFKTGDHTNINNYRPISVLSVFSKILEKIMFNRIYDHLINNKILYENQF FT GFKKKQFNRTRYSPTYARHLRIFLIIKSYTKTNLVLKKNNSTEHAILQLTR FT DISESFKKSHFTLGVFIDLSKAFDTVNHQILIKKLLSYGIKDNTLNWLKSY FT LTNRKQYVYNKESSSQVLMNVTCGVPQGSILGPLLFIIYLNDLHKTSSLTK FT IIFADDTNLFLSHENINILFNNMTNELKKVSDWFKINRLSLNTAKTKWTLF FT YPLYKKHRIPHLLPELYIDNVMIKREKVTKFLGIYIDENLTWKHHINIISS FT RVSKCIGILYKARNILNKRQLTQLYYSFVHCHINYANIIWGSTQKTKLKTL FT FNHQKHAARAINFKNRLTHSAPLLKDMKALNVYQLNIFNILCFMFKCKENL FT SPQVFQYLYSLKPKNKYNLRIDNNLNAPFCKTKYDEFCMSYRGPLLWNKIV FT LPNFDFSIKLNYSSFKTKIKEIIFSIENIFIYF*" XX SQ Sequence 4349 BP; 1787 A; 617 C; 508 G; 1432 T; 5 other; tatatataaa atatttattg atatnatgtc aygttcttat ttatatattt tttcttcaat 60 gcgattttga agcgaacgga cgcgtttttt agaacttttt agaaacttcg taagtttatt 120 tatttacata atattctata taaaatgtca aaaaatttta ccataaccca aattaaagaa 180 ctattggata tacatgaaaa tacaataatt aaacttttaa atgacaaaat cgaaagatta 240 gaaaacaaat ttaacatcat tgtagaagaa aataaatatt taaaaagtga agttaatgaa 300 ctaaaaaaat ctgtggaatt tataaatgaa aaatacgaaa acataaaaaa cgaagcagcc 360 aatttaaaaa aaacaaacga cataaataat caaggcaatg aaaagaagaa attagaagat 420 gatgttatta ttaataaact ggctgaatta gaagatcgca gtcgaagaaa caatcttcga 480 atcaacggta tcgaagactg tgataatgaa tcatgggaag aaagcgagaa aaaagtccaa 540 gaatttatta aatctaagct tggcattaag tataatgtag atataataga gagagcccac 600 cgagtaggaa gaaaagaatc agataaaccc agaaacaggc ctataataat caaattctta 660 aattacaaag acaaaagcac gatattggaa aagtacacca atctgaaact ttggaaccaa 720 aaaatctaca taaacgaaga ttttagcgaa cggacgatag aagtcaggaa aaagcttttc 780 gccgaagcaa aagagttacg ttctaaaggt aaatttgcta aagttattta taacaaactt 840 tatacacgtg atatttaaat gaactattta tatattcata tattatatta tttattaatc 900 tttaatatgg attcaaataa ttctaatgat tttgaaaaaa tatttaattt ttttcaaaca 960 aacgttttaa attttaacga aaagtctgat cctgatttaa actattttaa cgaaattaat 1020 acatcagaat ttgaatgcaa atatttttat ccaaatgaag ttaaatattt tctaaaaatt 1080 gataacatta atagttattt gaacgttgtc catattaata taaggagttt aaaaaagaat 1140 tttgaaaatt ttttaaatgt catttgcgaa actgataatt cttttaattt aatttgcgta 1200 actgaaacgt ggtcttctaa cgaggatttt gagaataatt caaacctcca actcccaggt 1260 tttaatgcaa ttaccttaga gcgaaatata aataagcgtg gaggaggagt acttttttat 1320 gttagaaata gtcttttgta taatgttaga cgcgatatga gtgtttctga tgaaaataaa 1380 gagattttaa ctattgaaat tataaacaaa aaatcaaaaa acctgctaat aagctgttgt 1440 tataggcctc caactggaag aactgaagac ctcagcaatt ttttacaaaa gaaaattcta 1500 gaaaaaacta atcttgaaaa taaaaaaaat tacttgattg gcgattttaa tttagattgt 1560 tttgattatt acgaaaatca aaatactaga aaattttaca acaatttatt tgaaacagga 1620 acaataccaa taataaatcg accaacaaga attacaaatt attcatccac tttaattgat 1680 aatatcttaa ccactgactt ttttaatata tcgcttaaaa aaaggaataa ttaaaacaga 1740 tgtatctgat cattttccta tatttttttc agtaaatcta gactttaaaa aagaaatgca 1800 aaaaagtgtc accataaaaa aaaggttgtt caacaacatt aatttaaact ccttcaaaga 1860 acatttgttg ctaataaatt gggaacattt aaactttaat aataatataa acacaattta 1920 caacactttt tttaaaacat tttataaagt ctatgataca aattttccta tgtgtgaaac 1980 agttgtaaaa acaaaagaca tatcatgtcc ctggattaca aaaggtctta aaaaatcctc 2040 gaaaataaaa caaaagctat atattaaata tttaaaatca aaatcagtaa aacataaaaa 2100 taactatatc acctataaaa acttatwtga aaaacttcgc aaaaatgcaa aaaaaaaaaa 2160 tactactcct acttgatcaa taaacataaa aataattcga aacaaatctg gaaaataatg 2220 aaagaaatta caggaaaact taaaacaaat acaaattctc tgccacatga aataaaaatt 2280 gacaacaaac taattaacga ctcaaactat attgcagctg aatttaacaa attttttaca 2340 acaattggaa ctaaactagc aaacagtatt ccgtataccg acgataaatc agctgaattc 2400 caaacatcaa cttcatcaag ttttcacttt actgagctaa cttatgatga atttgaaatt 2460 gcctttagtt cattaaaaag aaataaaaca actgggtatg atgatattaa tgtgaatatt 2520 ataatagact tttgtgatgt tataaaacac attattttta aaatctttag agcatccatc 2580 agtcaaggaa tgtttccaga ctgtctaaaa atagctaaag tatttcctgt cttcaagaca 2640 ggagatcata caaacataaa taattatcgc cctatttcag ttctttcagt cttttcaaaa 2700 atactagaaa aaataatgtt caatcgaatt tacgaccatc taattaataa taaaatctta 2760 tacgaaaacc aatttggttt taaaaaaaaa caattcaaca gaacacgcta ttctccaact 2820 tacgcgcgac atctcagaat cttttaaaaa atctcatttc acattaggag tatttattga 2880 cttatctaaa gcttttgata ccgtcaacca tcaaattcta ataaaaaaat tattatcata 2940 tggaataaaa gataacaccc tcaattggtt aaaaagctat cttacaaatc gtaagcaata 3000 tgtttataat aaagaatcct cgtctcaagt gttaatgaat gtaacgtgtg gtgttccaca 3060 aggctccata cttggacctc ttctctttat aatatatcta aatgatcttc ataaaacttc 3120 aagcctaaca aaaataatct ttgcagacga cacaaattta tttctatctc atgaaaatat 3180 taatattctt ttcaacaaca tgacaaatga attaaaaaaa gtttctgatt ggttcaaaat 3240 aaataggtta tctctcaaca cagctaaaac taaatggaca cttttttatc cactctataa 3300 aaaacataga ataccacatt tgttaccyga actatatatt gacaatgtaa tgataaaaag 3360 agaaaaagta acaaagtttc taggaatcta tattgatgaa aatttaactt ggaaacatca 3420 cattaacatt atctcttcta gagtttccaa atgcatcggt attctttata aagctagaaa 3480 catactaaat aaacggcagc tgactcaact gtactattca tttgttcact gtcatataaa 3540 ctatgcaaat attatatggg gaagcactca aaagacaaaa cttaaaactc tttttaatca 3600 tcagaaacac gcagctcgcg caattaattt taaaaatcgt cttactcact ctgcgccact 3660 tcttaaagat atgaaagctt tgaatgttta tcaacttaat atttttaata ttttatgctt 3720 tatgttcaag tgtaaagaaa atttatctcc acaggtattt caatatcttt attctttaaa 3780 accgaaaaac aaatataatt tacgaattga caacaatctt aatgctcctt tttgcaaaac 3840 taaatacgac gaattttgta tgtcctatcg tggacctctt ctttggaaca aaatagtttt 3900 acctaatttt gacttttcaa tcaaactaaa ctactcctca ttcaaaacca aaattaaaga 3960 aattatcttt tcaattgaaa acatctttat ctatttttga gtttttatca atttattgct 4020 gatatattat actttattgt attatgtttt actatttacg aatatttcca tgagttcttt 4080 gcaaacttat gtatttaatg gtttgtattt tattactttt tattactgtg ttttgttaaa 4140 cttcatatta tytttgttat tttacgaatt tttataactt cattataata ttacgtattt 4200 atattatatt atactcgata ctaaaaagca ttgtaaaggg cttcatgata agatcctcat 4260 gatcttctag aagtcctgcc gctaaatttg taatatgtta aaatttgtat tattaaaaaa 4320 attatttggc aaaaaaaaaa aaaaaaaaa 4349 // ID Jockey_Ele3 repbase; DNA; INV; 4423 BP. XX AC . XX DT 15-OCT-2010 (Rel. 15.1, Created) DT 15-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE A Jockey clade non-LTR retrotransposon family from Aedes aegypti. XX KW Jockey; Non-LTR Retrotransposon; Transposable Element; KW Jockey_Ele3. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4423 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4423 RA Kojima K.K. and Jurka J.; RT "Jockey clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (15-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. This consensus is generated from 25 CC sequences with >95% identity, and ~99% identical to the original CC sequence in [1]. XX FH Key Location/Qualifiers FT CDS 233..1510 FT /product="Jockey_Ele3_1p" FT /translation="MAEYVNKTLTHPTSASKSPTRVRSLDRISGQSMAYAN FT TDACTSELDFTEGDGDTVASPFISPNQFEPLSDDDEGELPSASAQSKKPSK FT VRLPPIFIMDSTISDIFKLLKDCSTPQSEDFLLKRNKSSVQLLTKSKDVFD FT KTISTLKSKNVPFFTHGTSDNVPAKFVLSGLPLAELSKLKDELTRVNINPS FT DVKVLSTTKTSQDTYALYVLYFPRGSVKIQDLRKTKALFNVAVSWRFYEKR FT ADDAAQCYRCQRFGHGSTNCNLAPKCVKCGGKHLTDGCTLPKKANLSSKQN FT DRSLLKCANCGANHTANFRGCPTRKAYLENLEKRKTKPVQRPPRMTNDDFP FT SLVGQGTALHRTSPNAKGRATYAQISANTPLATSANHCNENLFTISEFLCL FT ARDMFARLSGCRTKEQQFFALSELMIKYLYHG" FT CDS 1506..4178 FT /product="Jockey_Ele3_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MGDLTKLNIVNWNSRSILPKRIEFFDFLNQHQIDVAT FT VTETWLSPKNSFHHPEYHCIRSDRRSNNNERGGGVLIAIKKGIKFSQLDLT FT TGAIETVGIEIQNAAQPIHIIAAYFPGGHRGTSWNTFRRDIDNLVRRPEPF FT FVAGDFNARHRQWNCLRANKAGNILASRASSSDFFIHAPSAFTYNPQNGRR FT PSTLDLVLSNNLVNMSTVSVVNDLSSDHLPGRFDIDLIAPVITVTSTSRCY FT ARANWQLYKRLVNEKIDLTAGVINNLNCSAAIDNSILLLTTTLKEAEAIAV FT PNVPTKMYKDVKVSNSTRQLIQLRNVRRRQWIRTRDPLMLMIVESLNNRIR FT EECMSERNRQFSRTISNLENGAKDVWKISKALRKTVKYSPPLHQIDTNTLI FT SSPSEKANLLAECFASTHRNPLPSDDQTAIAVEESCTLIANTATAVNDVPM FT IRPREIEKIIKSMKQKKAPGKDKIRNALLKNLPRKGLVMLTKIINACLRNS FT YFPKVWKHAVVTAIPKPGKDITLPANYRPISLLPSMSKILERVILSRIEGH FT LEGHNIIPAQQFGFKRGHSTSHQLVRLTQFIKTAFTSGKSAGMVLLDVEKA FT YDSVWQDAILHKMRTANFPLYILKLLQSFLKERSFQVIVKGELSHVQQIPY FT GVPQGAVLSPILYNIFTSDMVMVNGVEYFLFADDTGFVAVDKKAEVVIEKL FT QTAQKAIEDYQRQWRIRINPSKTQAIFFTRRRNQRFLPQTQLTAMHHTVPW FT SDEAKYLGLVMDKKLKFDKHIASILGKCDKLIKMLYPLICRRSRLNSVNKI FT LIYKMIFRPIITYGFPAWHSCAQSRRMKLQVKQNKILKMMLDLPFNFSSDE FT LQEVSHTEHLGSWTTKLLQKFWTGCTISENDLIVNLVP" XX SQ Sequence 4423 BP; 1367 A; 1035 C; 888 G; 1133 T; 0 other; cagtttgaga accactggtt cagaagtaaa caagcctccg cgtactctga atttttggac 60 cgtttttttc cgtcgcgttc gcaagtaaaa ttcgtgcgtc tgcatcctgg ggaaacccaa 120 acccggtagc agtcaggaga agcaggatgc ggccaccagg acgtctcaaa ggatcataaa 180 cagaaatgct actaaaactg tcaatactct aacctcaatc cccaaagaca gtatggcaga 240 gtatgtgaac aaaacactga cccatcctac cagtgcttcg aaaagtccaa cccgtgttcg 300 atctctggac agaatatcgg gtcaatcgat ggcatacgcc aacactgatg cctgtacaag 360 tgaattggac tttacggaag gcgatggtga taccgtcgca tcaccattca tatctccaaa 420 ccaatttgaa ccactgagtg acgatgatga gggtgaactt ccgagtgcaa gtgcacaatc 480 gaagaagcct tccaaagtta gattgccgcc gattttcatc atggattcca caatcagtga 540 tatattcaag ctgctgaagg actgttccac accacagagt gaggatttcc tgttaaagcg 600 caacaaatct tcggtgcaac tcttaaccaa atcaaaggat gtgttcgaca aaaccatatc 660 gacattaaag agtaagaacg ttccgttctt tacgcacggt acgtcagata acgtcccagc 720 taaattcgtt ttgtcaggat tgccactcgc cgaactatcg aaactgaaag acgaactaac 780 gagagttaac attaacccaa gtgacgtcaa agttctgtcg accacgaaga catctcaaga 840 tacgtatgca ctatacgtac tctattttcc tcgcggttcc gtaaagatac aggatctgag 900 gaaaacgaaa gcactgttca atgtagctgt atcgtggaga ttttatgaga aacgtgcaga 960 cgacgctgcg caatgctatc gctgtcagcg ttttggacat ggctctacaa attgcaacct 1020 tgcacccaaa tgtgtgaaat gtggtggaaa acatctcacc gatggatgca cactccccaa 1080 gaaagccaat ctcagcagca aacaaaacga taggtccctg ttaaaatgtg cgaattgtgg 1140 tgcaaaccac actgccaatt ttcgtggttg tccaactagg aaagcctatc tggaaaacct 1200 tgagaagaga aagacaaagc ctgtacaacg cccgcctcgt atgacgaacg atgatttccc 1260 atccctagtc ggccaaggta cagccctcca tagaacttct cctaatgcaa aaggaagagc 1320 tacttatgcg caaatctctg ccaacacacc tctcgcaact tctgcaaacc attgcaatga 1380 aaatctattc acgatttctg aatttttgtg tctcgccaga gacatgtttg cccggctcag 1440 tggttgccga actaaggaac agcaattctt cgctctgtct gagctgatga ttaagtactt 1500 ataccatggg tgatttaacc aagcttaata ttgtcaactg gaacagtagg tcaatcctac 1560 caaagagaat tgaatttttc gattttctaa atcagcacca aattgatgtt gctaccgtaa 1620 cagaaacttg gctatcacct aaaaattcgt ttcaccatcc tgaatatcac tgcattcgta 1680 gtgacaggag atcaaataac aacgagcgag gtggtggtgt ccttattgct atcaaaaaag 1740 gcataaagtt ttcacaattg gacctcacta caggagcaat agaaaccgtc gggatagaaa 1800 tccaaaatgc agcgcaaccg atccacatta ttgccgcata ttttccaggt ggacacagag 1860 gtacaagttg gaacacgttt agaagggaca ttgacaactt agtgagacgc cccgagcctt 1920 tcttcgttgc gggcgacttt aatgcacgtc atcggcaatg gaactgtttg agagcgaaca 1980 aagctggaaa cattttggcc tcccgcgcat cttcctctga tttcttcatt catgccccaa 2040 gcgcgttcac atataatcct caaaacggtc gcaggccgtc aacgctagat ctggttcttt 2100 ccaataactt ggttaatatg tcaacggtgt ctgttgttaa cgatttatca tcggatcacc 2160 tgccgggtcg tttcgacata gatctaatcg caccagtaat cactgttact tccacatcac 2220 gatgttacgc acgagctaac tggcaactgt acaagcgatt ggtgaatgag aaaatcgatc 2280 taacggccgg tgtgataaac aacctcaatt gctccgcagc gatagacaac tccatcctac 2340 ttctgaccac aactctcaag gaagcagaag caatagcagt tccaaatgtg ccaacgaaga 2400 tgtataagga tgtcaaggtc tcaaattcta ctcgccagtt aattcagcta agaaacgtcc 2460 gtcgtcgaca atggattcgc actcgtgatc ctctaatgtt gatgatagtt gaatcactca 2520 ataaccggat tagggaggag tgcatgtcgg aaagaaatcg tcaattctct aggactatct 2580 ccaatctgga aaacggtgca aaagatgtct ggaaaatcag caaagctctt cgtaaaactg 2640 ttaaatatag tccaccactg catcaaattg acaccaacac tctaatatca tctccttctg 2700 aaaaggcaaa cctcctcgca gaatgctttg ctagtactca tcgtaaccct ttaccaagcg 2760 acgatcaaac agccattgct gttgaggaat cgtgtactct catcgctaat acggcaaccg 2820 cggtaaacga tgttccaatg attcgtccca gagagatcga aaagatcatt aaatctatga 2880 aacagaagaa agcacccgga aaggacaaaa tccggaacgc acttcttaag aatcttcctc 2940 gaaaaggatt ggttatgctc accaaaatca tcaacgcatg cttgaggaac tcctactttc 3000 caaaagtttg gaaacacgca gttgtcactg caattcctaa gccaggtaag gacataactc 3060 ttcctgcgaa ctatcgtcct atcagtttgc tgcccagcat gagcaaaata cttgagcgag 3120 tcattttgtc tcgaatcgaa ggacacctcg aaggtcacaa catcattcct gcgcaacagt 3180 ttgggttcaa acgtggtcac tccacaagtc atcagctagt tcgtctcacc cagttcatta 3240 aaactgcatt caccagtgga aagtcagctg gcatggtact tttggatgtg gaaaaagctt 3300 acgattccgt ttggcaagat gcaattctgc acaaaatgag gactgccaac tttccactgt 3360 acatcctgaa gcttctgcaa tctttcctta aagagcgtag cttccaagtc atcgtcaagg 3420 gagagctgtc tcacgttcaa cagataccct atggtgttcc acaaggtgct gtgctgagcc 3480 ctattttgta caatatcttc acgtcagaca tggtgatggt gaacggagtc gaatatttcc 3540 tattcgctga cgacactggc tttgtggctg tcgacaaaaa agcagaggtg gtcatcgaaa 3600 agctacagac agcccaaaaa gctattgagg attaccaacg gcaatggaga attaggataa 3660 acccgtcgaa aactcaagct atcttcttca ctcgacgacg taatcaacgt ttcctaccac 3720 aaacgcaatt aacagcaatg caccatactg ttccttggtc ggatgaggcc aaataccttg 3780 gcctagtgat ggataaaaag cttaagtttg ataagcatat tgcaagcata ttggggaaat 3840 gtgacaaact cattaaaatg ctttatcctc taatttgtcg tcgatcacga ctcaacagcg 3900 tgaataaaat actaatatat aaaatgatat tccggcctat catcacatac ggctttccag 3960 cttggcacag ctgcgctcag tctcgtcgca tgaaacttca agtcaagcag aacaagatct 4020 tgaaaatgat gctggattta ccattcaact tttcatctga tgagctacaa gaagtatcac 4080 acactgaaca cttgggttca tggaccacaa agctgctcca aaaattctgg actggatgca 4140 caatttcgga aaacgatttg atcgtcaatt tggtgccata atttgtctgt gatatagttt 4200 taagtaaata atccttcctc tttctgtccc atctctaaaa gtaatgattg caggtttttt 4260 gctagtattt tcctgcagag ctgctcaaac gatattgtat aaattacgtc aaatgacaaa 4320 ctgaattatg aaatagttgt aaggaaaacc aaaactctat tgttaatcta atgatacact 4380 ttgtaaactg acaaaacaac aaatatacat gaattgaatt gaa 4423 // ID Gypsy-26_DWil-LTR repbase; DNA; INV; 172 BP. XX AC scaffold_181145; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-26_DWil_; KW Gypsy-26_DWil-I; Gypsy-26_DWil-LTR. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-172 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_181145; Positions 903412 903583. XX SQ Sequence 172 BP; 67 A; 25 C; 41 G; 39 T; 0 other; tgtggcaacc ctgccagacc gataacggat atcgaagaga tatcggacgt aaggtggcaa 60 cgctcgggga ttttggagag tgacagcgaa caagggtaga cgtacaaaga actaaagaat 120 aaattaatat ttaaataaag tataattaaa ctataaatta cgagttgtta ca 172 // ID L2B-4_CQ repbase; DNA; INV; 4705 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE An L2B non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW L2B; Non-LTR Retrotransposon; Transposable Element; L2B-4_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4705 RA Kojima K.K. and Jurka J.; RT "L2B non-LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 145-145 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 4 sequences with >99% CC identity. XX FH Key Location/Qualifiers FT CDS 278..1603 FT /product="L2B-4_CQ_1p" FT /translation="MSTSVQCSVCSAAIKEGNEKIYCFGPCGKIMHPKCCG FT EINMDGIKAMQTNRGLKYFCHDCRNSQCDYNNVLGICNSILTKVNETEKMM FT KKEITEQIREYTKTANEANEKGFVLLKDYIMSENKKLEDKLSAKTNTSGRN FT RKEETSLSGPSGNTRSQSKQKTAKAGETGSVTQNYAEVLINGRKESEIDQN FT REIPKEKETVIIIKPKEGDQSAKKTKQQLKEKLNRTENHILNVREGRNGTV FT ILGVLDNVESVAKSVQEKCGELFNVTVPKPKKPRLKIVKVAEELNEKELRE FT ALIEQNDVSDDIELRLVTSFKNGLDEYCVWTYIIEVDSSSHEFLLELEKVN FT IGWERCKIVECFGIIRCFKCCCYGHKSNECKSDREVCSICAENHRTNECKS FT EIECCANCSTMNRMRGLKLSTDHVAFSRECPVYKRQVARKRHQIDYTR" FT CDS 1607..4432 FT /product="L2B-4_CQ_2p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="QSTKNKNNKQKWSVISLNCGGLSTNFEEFKLLVQTYK FT PRLVLVTETHITEQLGDEHLGIDGYNHVSCFSDSRHTGGVMIYSQSNIEYN FT VLCKSQIGRNWFLSIDVTRGMKIGRYATLYHSPNEPHATFLKILEENWLEQ FT FVDLSKFIVVVGDFNIDWSRETSDASNLKQICDSLGLKQNVQDYTRITRLT FT KTMIDLVFSNCDIGVRTVSEMKISDHETLCWYNVNECTVASKSMKKITCWK FT NYSKSALVNLLRQNRNLTFNFENIHENAINLISNLKKCVQTLITEKLVSVD FT KCCKWFSVELLQMKQRRDQAYLKHVTEGGANSWAIYKAFRNEYSSTLRRKR FT KEYIQKSIEENKNNSKELWKILKKMIKPESAPANTICFDNVNEKNRVEIPR FT KFNEYFVKSVEEISNSIEEVNAPPELECQFSATNMSFFETITLEKLKSVIW FT SLESKSGSDNISASVLRDAFEVIGQDLLNVINQSLISGVVPDSWKESVVIP FT IEKVQGTMNASEFRPINMLPLYEKVLELTVKEQLLNYLNVNKLLAAEQSGY FT RKNHSCETALNLVISKWKKQLESRKKIFAVFLDLKRAFETISREKLIAVLK FT QYGITGNVINWFSSYLVNRSQRTKFNSVISVSIETLVGVPQGSVLGPLLFI FT LYINDMKRVLKHCDINLFADDTVLFIASDNAVAAIEKLNFDLENLSRWLKF FT KKLKLNVSKTKYMIIANNKASLPNLIVSINGERLECVTEIKYLGVIIDDKL FT DFKKHIDYSIKKIAKKFGVLCRLRNDLTQWSLIYMYKALISPHFDFCPSII FT FLANEQQMNRLQKIQNKIMRLILKCNRRTPRQSMLNALQWLSVRQRVTFLT FT VVLIYKIANGMAPEYLQKILIRGSDIHQHNTRMASEIRAASFLLTSTQNSL FT FYKGIKLYNTLPRDFKNEKNFAVFKRKTLGFVKENVGI" XX SQ Sequence 4705 BP; 1772 A; 664 C; 956 G; 1313 T; 0 other; agtagtacag tagtacagtg tagacgcact tttattgtgt aaacaaaaat tgtctaaaaa 60 ttgatgtaaa aggtggcttt caagcttagt ttttggactc aggtgtgaat cgtgattgat 120 tctcgcgacc ggtgtagtta gtggttcgcg aaacgcatga ttaatcggaa aaaagtgata 180 agaaaaattg cgaaacaaac ggagaaacaa gaagccaggc aaagtgtttg gtgtgcttgt 240 aatagtgtgg ccgaagggcc agctataaat ctgcaccatg tctacgagcg tgcagtgtag 300 tgtttgttcc gcagctatca aagaagggaa cgaaaaaatt tattgttttg gaccgtgtgg 360 aaagataatg cacccaaaat gctgtggtga aataaatatg gatggaatca aggcaatgca 420 aacaaacaga ggcctgaaat atttttgtca tgattgcaga aactctcaat gtgattacaa 480 taatgtgctg ggaatttgta atagtatttt gacaaaggtc aacgaaactg aaaagatgat 540 gaaaaaagaa attacggaac aaataaggga atatactaaa acagcaaacg aggcgaacga 600 aaaaggattt gtattgctaa aagattatat tatgagtgag aataaaaaac tggaagataa 660 attgtctgcc aaaaccaaca ctagtgggag aaataggaaa gaagaaacat cgctctctgg 720 cccaagtgga aacactcgct ctcaaagtaa acaaaaaacg gcaaaggctg gtgagacggg 780 tagcgtgacg cagaactatg ctgaggttct aataaacggt agaaaggaaa gtgaaattga 840 tcaaaatagg gaaatcccta aagaaaaaga gactgtgatt ataattaaac ctaaggaagg 900 ggaccaatcg gcaaaaaaga ccaagcaaca gttgaaagaa aaattaaacc gcactgaaaa 960 ccacattcta aatgtgagag agggcagaaa tggaactgtt atactgggtg tcttggataa 1020 cgtggaaagt gtagctaaat cagtccagga aaaatgtgga gaactcttca atgtgaccgt 1080 tccaaaacca aaaaaaccaa gactgaaaat agtaaaagtg gctgaagagc ttaatgaaaa 1140 agagttaaga gaggccctca ttgaacaaaa cgacgtatct gatgatatag aactgcgatt 1200 agtgacaagt tttaagaatg gtcttgatga atactgtgtg tggacctaca tcattgaagt 1260 tgattcttct tctcatgaat ttttgcttga attagagaag gtgaacatag gatgggaaag 1320 atgtaaaata gtggaatgct ttggcattat tcgatgtttt aagtgttgtt gttatggcca 1380 taaaagtaat gaatgtaaga gcgataggga ggtttgctcc atttgcgcag aaaaccacag 1440 aacaaatgaa tgtaaatctg aaatagaatg ttgtgctaat tgttcgacaa tgaatagaat 1500 gagaggttta aaattaagca cagatcatgt tgcatttagt cgtgaatgtc ctgtctataa 1560 aagacaagta gcccgcaaac gccatcaaat tgattacact cgatagcaat caacgaaaaa 1620 caagaacaat aaacaaaaat ggtcagtgat tagcttaaac tgtggcggac tgtctacaaa 1680 ctttgaagaa tttaaattat tagtccaaac atataaacca agattagtgt tagtaacaga 1740 aactcatata acagagcagt tgggagatga acatttggga atcgatgggt acaaccatgt 1800 gagttgtttt tcagactcaa gacacacagg aggtgtaatg atttattcac aatcaaacat 1860 tgaatacaat gtgctttgca aaagccaaat cggacgaaat tggttcttgt caatcgatgt 1920 aactaggggt atgaagattg gaagatatgc aacgttgtat cattccccaa atgagccgca 1980 tgcaacattt ttaaaaattt tagaagaaaa ttggcttgag caattcgtag acttgtcaaa 2040 atttatagtg gtggttggtg acttcaatat tgactggtca agagaaacaa gtgatgcatc 2100 taatttgaaa caaatctgtg attcattagg tttaaaacaa aatgtccaag attatacacg 2160 cataacgagg ttgaccaaaa caatgattga tctagtcttt tctaactgtg atataggtgt 2220 gcgtactgtg agtgaaatga aaatatctga ccatgaaact ttgtgttggt ataatgtgaa 2280 tgagtgtact gttgctagta aatcaatgaa aaagatcact tgttggaaaa actattctaa 2340 atctgctcta gtaaatcttt tgagacaaaa cagaaattta acatttaatt ttgagaatat 2400 tcatgaaaat gcaatcaact taataagtaa tttaaaaaaa tgtgttcaaa cgttaataac 2460 tgaaaaatta gtctctgttg ataaatgctg caagtggttc tcagtggagc tattacaaat 2520 gaagcaaagg cgggaccagg cctatttgaa acatgtaaca gaaggaggag ctaattcatg 2580 ggcaatttat aaagcattca gaaatgaata ctctagcact ttaagaagaa aacggaaaga 2640 gtatattcaa aagtcaatcg aagaaaataa aaataacagt aaagagttat ggaaaatatt 2700 aaagaaaatg ataaaacctg aaagtgcacc ggctaatact atctgctttg ataatgtcaa 2760 tgaaaaaaat agagttgaaa ttcctagaaa atttaatgag tatttcgtaa agagtgtgga 2820 agaaataagc aattctattg aggaagtgaa tgcaccacct gagcttgagt gtcaattttc 2880 tgcaacaaat atgtcgtttt ttgaaacaat aacattagaa aaactaaaat ctgtgatatg 2940 gtctttagaa tcaaagtctg gatctgacaa tatcagtgct tctgttttgc gcgacgcttt 3000 cgaagttata ggacaagatc tattgaatgt gattaatcag tcattaatat ctggagtggt 3060 gcctgacagt tggaaagagt cagtggtcat tcccatagaa aaagtccagg gaacaatgaa 3120 tgcatcagag tttcgaccaa taaacatgct tcctctctat gaaaaagttt tagaactgac 3180 agttaaagag caacttctaa attatttaaa tgttaacaaa ctcctagcgg cagagcaatc 3240 tgggtaccgt aaaaaccatt cgtgtgagac agctcttaac ttggtgattt caaaatggaa 3300 aaaacaattg gaaagcagga aaaaaatatt cgcggttttt ctagatttaa agagagcttt 3360 tgagactatt tcaagagaaa aattgattgc agtccttaaa caatacggaa ttacaggaaa 3420 tgtgattaat tggttttctt cttacctagt caatcgaagt caacgaacta agtttaactc 3480 tgtgatatca gtttccatag aaacgttagt tggagtacct cagggaagtg ttttaggtcc 3540 ccttttattt attttatata taaacgacat gaagagagtt cttaaacatt gtgatattaa 3600 cttgtttgcg gatgatactg tgctctttat tgcctcagac aatgcagttg cggctattga 3660 aaaattaaat tttgatttgg aaaacctctc tcgttggcta aaattcaaaa aactcaaact 3720 aaatgtatcg aaaacaaagt atatgataat tgctaataac aaagcaagtc tgccaaattt 3780 gatagtcagt attaatggcg aaagattaga atgcgttacc gaaattaaat atttgggagt 3840 aatcattgac gataaactgg actttaaaaa acacatagat tatagtatta aaaagattgc 3900 aaaaaaattt ggtgtactgt gtcgtttacg caatgattta acccagtgga gtttaattta 3960 tatgtacaaa gcacttattt caccacactt tgatttctgc ccatccataa tatttctagc 4020 taacgaacag caaatgaata ggttacaaaa aatacaaaat aaaatcatgc gtttaatttt 4080 aaagtgtaat aggcgaactc caaggcaatc tatgttaaat gcactgcaat ggttatcggt 4140 gaggcagcgt gtaacatttc taacagttgt tttaatttat aaaatagcaa acggaatggc 4200 acctgaatat cttcaaaaaa tattgattcg tggttcagac attcatcagc acaacactcg 4260 tatggcttca gaaataagag cagcatcgtt tcttttaaca agcacacaaa attcattatt 4320 ttataaaggt ataaaactat acaatacatt gcctagagac tttaaaaatg aaaagaactt 4380 tgcggtgttc aaaaggaaga cattaggttt tgtaaaagaa aatgttggta tatgatttga 4440 gtactagttg agtgtagatt aagtgacttt ttttttaaat gtattaattg tgtgaatatc 4500 tttgtatgat gataaaatta gcaaaaaaaa atagataaat aaaataaggt tgatggacgc 4560 gcatgggctt aaaaaaaata aataataata gcaacataac aatgcaaaca agaagtgaat 4620 taactaaaca aaatatctaa caagataaat cagacactct gctattccca tgggcggggg 4680 taagtggagg gccatcatca tcatc 4705 // ID BEL-63_CQ-I repbase; DNA; INV; 6238 BP. XX AC AAWU01017436; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-63_CQ_; KW BEL-63_CQ-LTR; BEL-63_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-6238 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 279-279 (2011). XX DR GenBank; AAWU01017436; Positions 53196 46959. XX CC Positions [4803-5387] - Integrase core CC 'GTCTG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 435..5741 FT /product="BEL-63_CQ-I_1p" FT /translation="MPPKEATKVLAERLAKRKGILALRDNIEKFVSKFNGE FT LDMCQVSVRLDSLDQLKNDFLDVQNNIEKLDEPNNLDANIAQRVDFEQRYC FT DIKGFLLSKRPVDLNQTLEMSTMSHSHSHNFHLRLPKIELPQFDGDYSRWL FT SFRDTFVSMIHANSDIPTVAKLQYLLQSLGPTAKKPYESVDIKADNYFTTW FT SAILKRYDDKRHLKRQLFRGLYDLPAVPEECASAIHNLADDFQRHVKALEK FT LDELVQHWDTPLVNMLSYKLDQATLRAWEEKNSEKADVKYDDMIEFLYQRV FT RVLEASGSESKQSASAKVAGSSSKQSRPRFVANAASSSSSNGSCLLGCSEN FT HHLRRCPVFLGKDVQQRRDLVAQKRLCWNCLSAGHPARKCGSKFSCQTCKQ FT KHHSLVHVSSPTRVSSTPAVTSVTIDQPSNTASANAAPVGSSSTTPQVSMA FT VQTACNTVLLETVVVNVVDDHGKKHKARALLDSASMSNFMSGQLAKNLYNR FT PIKVDVAVAGIGVSTQRVRSAITAAIESRNTTFSTKLEFLILRQPSAELPT FT VPIDVSTWKLPDIVLADQRFNVPEPIDLVIGSESFWELHTGRKISLGAGLP FT WVVETHFGWAVSGTASAETTCIPRICQLSAVNDRLEAAIQRFWEVETIPEG FT PAHSVDENRCEEFFAKTTTRDATGRYVVRLPLTDNPEIVLGDSKTIAERRF FT LGLERRLDRDPATKEAYHKFMHEYLSLGHMELVDGPVDYRKPHCYLPHHPV FT FKQSSTTTKVRVVFDASCKTSSGYSVNEKMLVGPVVQPDLLSTLLKFRFQA FT VALVADVEKMYRQVLVHPDDRPLQRIVWRFNPTDPISTYELRTVTYGTASA FT PFLATRTLAQIAQDNKELHPDAAEAVARDFYVDDLISGAPDVETAIKLRRE FT ISTMLAAAGFPLKKWASNAPEVLRDIPTEDLALLPVYDLQDGQTVTTLGIV FT WDPQTDTLCFRVQLPPPASVLSKRKVMSYIAQIWDPFGLVGPTTLKAKLFM FT QRLWALKHKGEACAWDTPLPLKIQLEWKEFHMLLHMLSQVKVPRFVSTPGA FT TIQLHFFSDASEKAYGACCYVRAAAANGVTTRLLSSKSKVASLSSHDSINK FT LELNGGRLSTQLYKKVKKALNLSDAVPVFFWTDSLTVLHWLNSLPSRWKTY FT VANRVSQIQLCTSRHVWGHVPGVDNPADYISRGLSPLELLDCLCWWFGPLW FT LKQNPDYWPKTVLTAADASAVTKEERKVPIVAMTTVEADFSNRVFSLTSSF FT PKLRRDIAFCQRFLSKLRERSVQRRDAPEQYAPLTVGKVTTPLSVTPLSTH FT ELQHAEFSLVRLAQQEHYAEEISDLSGGERVSKSSPLKWLNPFVDENRILR FT VGGRLRNAKLTHESKHPIVLSARHPLTALLASYFHEKLLHAGPALMLATLR FT QKYWILGGRNLVKSVYHRCHTCFRSKPTLVIQSIADLPASRVAPTRPFSVC FT GVDYCGPFYIKSPVRNRAPTKVYVAIFVCFSTKAVHIELVSDLSTPAFIAA FT LRRLVARRGQVVELNSDNATAFKGAANALHRVYQMLKVEQVDRNEIFTWCA FT ENGIRWKFIPPRAPHFGGLWEAAVKSAKTHLLKTMGTTSANYEDMLTLLAQ FT IEMCLNSRPLTPMPDDPADAKVLTPGHFLVGSNLQAVPEMDLKEIPDNRLN FT NWELTQKRVQQIWARWYPEYLQQLQSRATKGCNPPVAVEVGRVVIIKEDNV FT PPASWPLGKIVKLHPGKDGIVRVVTLKTAAAKEVVRAVARIALLPTPSQ" XX SQ Sequence 6238 BP; 1406 A; 1769 C; 1711 G; 1352 T; 0 other; ttagtggtcc ttcgagccgg atcgaaggcc atacggaccg ggaagtgcgc gcattgggtg 60 attacgggcc acgcggccgt ccgagctact gtcccgcgga tcgggatcgg gatcggcgcc 120 atcgcgttgg gaggttgtac caagacgggt acacctcgtt cgtcgtgtgt gtgggacata 180 ccgtacgcgg tatcgtgcgc gcgcgcgcgt ggaattacgg tcacgcgtgc cagtgtcgtt 240 gtggtagaaa taccaggtcg cgcgcgagtg ttacaggtcg ttctgctgtt gctgagcagt 300 atctgtggtg gtgctgctgc tgttgaagag gtgcttgacc tgcgctgcgg gtgattgacc 360 tgtaagcgtc gtttgctgga acgagatcgt cgcgggagaa gacatctttg ctggtagctg 420 ttcaagccgt cgagatgccg cctaaggaag cgaccaaagt tctggccgag cggttggcca 480 agcgcaaggg gattctcgcg ctgcgtgaca acatcgagaa gtttgtgagc aagttcaacg 540 gcgagctgga catgtgccaa gtgtcagtgc gactcgattc cctggatcag ctgaagaacg 600 acttcctgga cgtgcagaac aacatcgaga agttggatga gccgaacaat ctggacgcga 660 acattgctca gcgggtcgac ttcgagcagc gctactgtga catcaaggga tttttgctct 720 ccaagcgacc tgtcgatctc aaccaaacat tggagatgtc tacgatgtct cactcgcact 780 cgcacaattt ccacctgcgt ctcccgaaga tcgaattgcc gcagttcgac ggagactact 840 cacgctggtt gtccttccgg gacacttttg tgtcgatgat ccatgccaac tcggacatac 900 caactgtagc caagctgcag tacctgctgc agtcgttggg accgacagcc aagaagcctt 960 acgaatccgt cgatatcaag gctgacaact acttcacgac gtggtcagcg atactcaagc 1020 ggtacgacga caagcgtcac ctgaagcgac agctgttcag aggtttgtac gatttgccag 1080 cagtgccaga ggagtgtgct tctgcgatcc acaacttggc tgatgacttc caacgccacg 1140 tgaaggcgct cgagaagttg gacgaacttg tgcagcattg ggacacgccg ctggtaaaca 1200 tgctgtcgta caagttggat caagcgactt tgcgagcgtg ggaggagaag aatagcgaga 1260 aggccgacgt taagtacgac gacatgatcg agttcttgta ccaacgggta cgcgttttgg 1320 aagctagcgg atcggagtcc aagcagtcgg catcggcaaa ggtggccggt tcctcatcga 1380 aacaatctcg gccgaggttc gttgccaacg ctgcctcttc ctccagctcg aacggttcgt 1440 gcttgcttgg gtgttcggag aaccatcatc tacgcagatg tccagtgttt cttgggaagg 1500 atgtacagca acgccgagac ctggttgcgc agaaacggct gtgctggaac tgtctctcgg 1560 ctggccaccc ggccaggaag tgtgggtcca agttctcgtg ccaaacttgt aagcagaaac 1620 accattcact tgtgcacgtc tcgtctccga cgagggtttc atcgactcct gctgtgactt 1680 ccgtcacaat tgatcaaccg tccaacaccg cgtcggcgaa cgcggccccg gtaggctcaa 1740 gttctaccac tccacaagtc agcatggcag tgcaaactgc atgcaacacg gttctgctgg 1800 agacggttgt ggtgaatgtg gttgacgatc acggcaagaa gcacaaggcg cgagcgttgc 1860 tggactcagc ttcgatgtca aacttcatgt cgggtcagct ggcaaagaac ctctacaatc 1920 ggccgatcaa ggtggacgtc gcagtagctg ggattggagt ttcaacacaa cgagttcgca 1980 gcgcaatcac agccgcgatc gagtccagga acacaacgtt ttctacgaag ctggagttcc 2040 tgatcctgag gcagccttct gcagagctgc caaccgtgcc gatagacgtg tcgacatgga 2100 aacttccgga tattgtgctg gcagatcaac gcttcaacgt acctgaaccg atcgacctgg 2160 tgatcggcag tgaatcgttc tgggaactgc acactgggcg gaagatctcg cttggagcag 2220 gtcttccatg ggtagttgaa acgcactttg gttgggcggt atctggcacc gcatccgctg 2280 aaactacctg catcccacgc atctgtcagc tttcagctgt gaacgatcgt ctggaggccg 2340 caatccagag attctgggaa gttgagacga tcccggaagg ccccgctcac tctgtcgacg 2400 agaatcgctg cgaggagttc ttcgccaaga cgaccacccg tgacgctaca ggaaggtacg 2460 tggttcgtct tcccctgacc gataaccctg aaatcgtcct gggggattcc aagaccatcg 2520 cagagcgccg ctttctcggc ctggaacgtc gattggatcg agatcctgcg acgaaggagg 2580 cctaccacaa gtttatgcac gagtatctgt cgctgggcca catggagttg gtggacggac 2640 ctgtggacta ccggaagcca cactgctact tgccgcacca ccctgtcttc aaacaatcaa 2700 gcacaacgac gaaggtcagg gtcgtcttcg acgcgtcgtg caagacgtcg tccggctact 2760 cggtgaatga aaagatgctc gttggaccgg tcgttcaacc ggacctgctg tcgacgctgc 2820 tgaagttccg gtttcaagcg gtcgcactgg tggcggacgt cgaaaagatg taccgtcaag 2880 tgctcgtgca tcctgacgat cggcctctgc agcggattgt ttggcgattc aacccgacgg 2940 atcccatctc gacctacgaa ttgcggacgg tcacctacgg cacggcaagc gctccctttc 3000 tggcaaccag aacgctcgct cagatcgcgc aagacaacaa agaactccac ccggacgctg 3060 ctgaagctgt tgcacgagat ttctacgtcg acgacttgat ctctggtgcg ccggacgtgg 3120 aaacggcgat caaactgcgc cgcgagatat ctacgatgct tgctgctgct ggttttccgc 3180 tcaagaagtg ggcgtctaac gcccctgaag tgctacgaga cattcccacc gaggatcttg 3240 cacttctgcc ggtttacgac ctgcaagatg gtcaaactgt aaccacactc ggcatcgttt 3300 gggacccaca gacagacacc ctctgtttcc gggtccaact accaccccct gcgtcggtgc 3360 tctcaaagcg caaagtgatg tcttacatcg cacagatttg ggacccgttt ggactggttg 3420 gtccaacaac attgaaggcg aagctgttta tgcagcgctt gtgggcgctc aaacacaaag 3480 gagaagcctg cgcgtgggac accccgctcc cactcaagat ccagctggaa tggaaggagt 3540 tccacatgct gctgcacatg ctcagccagg tcaaggttcc gcggttcgtc tctacgcctg 3600 gcgccactat ccagttgcat tttttctccg acgcgtcgga aaaggcttac ggcgcctgct 3660 gttacgtccg agctgcggcc gccaacggag ttaccactcg tctgctgtcc tcgaagtcca 3720 aggtcgcgtc gctgtcgtcc catgactcga tcaacaaatt ggagctaaat ggtggccggc 3780 tctctacgca gctgtacaag aaagtcaaga aagccctgaa cctgtctgat gccgttccag 3840 tcttcttctg gacagattcg cttacggttc ttcactggtt gaactctctg cccagccgtt 3900 ggaagacgta cgtcgcgaat agggtgtccc aaatccagct gtgcactagc cgccatgtct 3960 ggggacacgt acccggcgtc gacaatcccg ccgactacat ctctcgagga ctgagcccgc 4020 tggaactgct cgattgtctg tgttggtggt ttgggcccct ttggctgaag caaaatccgg 4080 actactggcc aaaaactgtg ttgacagccg cagacgcttc agctgtaacc aaggaagaac 4140 gcaaggtgcc gatcgttgcg atgaccaccg tcgaggctga tttcagcaac cgtgttttca 4200 gcttgacctc gagctttcct aagctccgtc gcgacatcgc attctgccag cgctttcttt 4260 ccaagcttcg tgaacgctcg gtgcaacggc gcgatgcacc tgaacagtac gccccgctta 4320 ctgttggaaa agtaacaacc cctctatcgg ttacgccgct ctctacgcac gaactccaac 4380 acgctgaatt ctcgctggtg cgcctagcgc agcaagaaca ctacgctgag gagatttctg 4440 acctctctgg aggtgaacgg gtgtccaagt cgtcgccact caagtggttg aacccgtttg 4500 ttgacgaaaa ccgtatcctg cgagtcggtg gccggcttcg gaacgccaaa ctgactcacg 4560 aaagcaaaca cccgattgtt ctctctgcgc gacacccgct aacggctctg ttggctagtt 4620 acttccacga gaaactactg cacgcaggtc ccgccctcat gctggccact ctccgccaga 4680 agtactggat cttgggcgga cgaaacctgg tgaaatctgt ctaccatcgc tgccacacct 4740 gtttccgcag taaacccaca ctggtaatcc agagcatagc agatttgcca gcatcgaggg 4800 tcgcaccaac tcgtccgttc tctgtctgcg gcgttgacta ctgcgggccg ttttatatca 4860 agtcaccagt acgcaaccgc gcccccacca aagtctacgt ggccattttc gtttgctttt 4920 cgaccaaggc ggtccacatc gagctggtga gcgatttatc cacaccagca ttcatcgcag 4980 ctctacgtcg actggtcgcc cgcagaggac aggttgtcga gttgaattct gacaacgcga 5040 ccgccttcaa gggtgctgcg aatgcactgc accgtgtgta ccagatgctg aaggtcgaac 5100 aagtcgatcg gaatgagatt ttcacctggt gcgcagagaa cgggattcgc tggaaattca 5160 tcccaccacg cgcgccgcac ttcggcggac tgtgggaggc ggcggtcaaa tccgccaaaa 5220 cacacctact caagacgatg ggcaccacta gtgccaacta cgaggacatg ctcacactgc 5280 tggcgcagat agagatgtgc ctcaactctc gcccactcac gcccatgccg gacgacccgg 5340 cagacgctaa agtcctgacc ccaggacatt ttctcgtcgg gagcaacctc caagccgttc 5400 cggaaatgga cctgaaggag attccagaca accgcctcaa caactgggag ctgacgcaga 5460 aacgagtcca gcaaatctgg gcacgctggt accccgagta tctgcagcag ctgcagtcac 5520 gtgcgacgaa aggttgcaac ccacctgtcg ccgtcgaggt cggcagagtc gtcatcatca 5580 aggaagacaa cgttccgcct gcaagctggc ccctcggaaa aatcgtgaag ctgcacccgg 5640 gaaaagacgg catcgttcgg gtcgtgaccc tgaagacagc agcggccaag gaagtcgtac 5700 gtgctgtcgc ccgcatcgct ctcctgccaa caccaagcca atgatcatca tcaaagtcca 5760 ggtagagcaa cccattgtgg gagatcttgc tcgatgcgca cctgcgcata gtccaggtaa 5820 ttatgtttcc aataatccac atcaacccgt tcgatctacc tggtctttct ttttccggtt 5880 tggcgaggat gagcagtcga gacagagcga gacaactcgc accaaatcca ggtagagcaa 5940 cccacttggg agaacttgct cgatgcgcac ctgcgcagtg tacaggtaat tgcggttcca 6000 ataacccaca ctcgcaagtt tgatctacct ggtctttctg tctttctcgg tggcgaggac 6060 aaacgctgag gctgaacgag ccttgctgca cttcacctgc atcggagcat cccacacctg 6120 tttgctgcac caacgaaaca gtgcaatttt gaattaggga aaatgctaga gtagggaagt 6180 gtgattagaa attaagtagt tttagttgaa atgctgccac atttcaaggt ggccggaa 6238 // ID Gypsy-198_AA-LTR repbase; DNA; INV; 309 BP. XX AC supercont1.72; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-198_AA_; KW Gypsy-198_AA-I; Gypsy-198_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-309 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.72; Positions 711554 711862. XX SQ Sequence 309 BP; 107 A; 44 C; 79 G; 79 T; 0 other; tgtagggaat aaagggatat tagtgaaagg aattaagggt ttgagagggt taataaagaa 60 aaggctcatg atggttccga gagggttaag aatcgattaa atcgacttac ccttccgcgt 120 gactgtaaaa gggaagggag agaaaatagt ataagtgaga gcaaccagtc agtcttgtga 180 ttgcttttgg tagagtaaga tcggtcctcg ctttcgaaaa atgttggtga gttcggacgt 240 gcaccaaacg taactttctt gacaaaatgc tctacagtac ttaagtaata taaaccgaaa 300 aacactaca 309 // ID Gypsy-114_AA-LTR repbase; DNA; INV; 217 BP. XX AC AAGE02027028; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-114_AA_; KW Gypsy-114_AA-I; Gypsy-114_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-217 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02027028; Positions 68322 68538. XX SQ Sequence 217 BP; 76 A; 28 C; 62 G; 51 T; 0 other; tgtagtggcg cacccctcgt gcaacgagcg taatgtagag ttcatatgta gagatagaga 60 aaatggagag tagaaaggaa aagagaggag ggaagaggat tttgactgtg actatcaggg 120 tgaacggttg taagcgggaa aagtaagttg tgaaattata tatttaaaac gcgaataaaa 180 cgagtggatt tgctgctatt catccgaaac ctctaca 217 // ID Ronin2_Cis_LTR repbase; DNA; INV; 245 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 06-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE Long terminal repeat of Gypsy LTR Retrotransposon from Ciona DE savignyi. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Ronin2_Cis_LTR. XX OS Ciona savignyi OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. XX RN [1] RP 1-245 RA Smit A.F.; RT "Ronin2_Cis_LTR - Gypsy LTR Retrotransposon from Ciona RT savignyi."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX CC Ci000191; 2% diverged copies. XX SQ Sequence 245 BP; 58 A; 45 C; 41 G; 100 T; 1 other; tgacgtgaat tgtaataatt aacattgccg ctgcgcgctc acgttaatta cacgccttgc 60 tttttwttcc tttgcgttgt gctgtgcgac ttctaacctg cagtcagctc gattgtgtta 120 gcatggtgac ttatttacgt tcgcacttat ttgtacttat ttctttattg cagatattat 180 gtttcgttat aatttcgttt aagcttataa taaagttaat gatttacaac tacgaaatca 240 tttca 245 // ID Penelope-2_AAe repbase; DNA; INV; 3832 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A Penelope-like element family from Aedes aegypti. XX KW Penelope; Non-LTR Retrotransposon; Transposable Element; KW Penelope-2_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-3832 RA Kojima K.K. and Jurka J.; RT "Penelope-like elements from the yellow fever mosquito."; RL Repbase Reports 11(4), 1436-1436 (2011). XX DR [2] (Consensus) XX CC ~95% identical to consensus. Sequences 1-1479 and 2354-3832 are CC terminal inverted repeats. XX FH Key Location/Qualifiers FT CDS join(1397..1813,1767..3560) FT /product="Penelope-2_AAe_1p" FT /note="reverse transcriptase." FT /translation="NFPHPAWCALLFLPTSGVSIRVRSRPVQYVSNKRKHT FT FECYRNRQQHKLEKLKTKQLTQNTPCDNWVLNCTDETIPDFVTRSLQLGSG FT YNNPNPHSAPYVRVLSEIESAIQRNPSADVIRHDVIKCHHQPHPLHQTTVP FT SNAIINHIHYTKQPFHDKHENIRKEIQKSKKYLRDRDDLVVTKADKGKTVV FT VMKRDEYEEKMQALVNDSETYEPLASDPTKKTLKKINTLIDHWHENGFIEY FT SERTKLKVFNCNPPRVYGLPKTHKDGRPLRIINSAIGTATYKMAKFLSKIL FT NHVTGKTEHHIVNSFQFAEEMREQQITNQDVLFSLDVVSLFTNVPVDFALE FT SIRLRWEEIEEHTKLDEESFTEMVKIVLDSTYFQYKGKFYKQKFGIPMGSP FT ISPVVANIVLERIEKDALEKLRTRGIVPRFFKRYVDDCLLCARKEEVEAIL FT NVFNGFHQRLQFTVELEVEGKLKFLDMILRREDNTITTEWFPKDADGRYLD FT FTSVSPFIHKNNTIIALVDRALKLTHAKYRVNTLKTVKQILTGNNYPCFLV FT EKVIRERLHKMYNTLQDKENATNNFITIPYIEGLGEKLSRYLRQHDFKVAY FT KPVDKIKDVLFTKTKDKISKDKNINVVYEIPCGACEKSYVGETSQFLCKRL FT SQHKYNVKTKNISGTGLSQHTIEEGHIFDFDKTKILDKVTNSYTRLITETF FT HIKIRGDDNVVNKQKDSANFKTAYNSIITKLKSITNT" FT CDS join(1789..1445,1565..273) FT /product="Penelope-2_AAe_2p" FT /note="reverse transcriptase." FT /translation="MWLMMAFDHIVTDDVSGWIALYGRFYLREHSHIRSRM FT GIGIVVSTSKLQRSCYEIRYSFVCAIKHPIVARCILGELFCFQLLQFVLLT FT VSVAFKRVFSLIGNVLHRSTSHSNRYAAVLFSASPVCVVDGFGSIQTCVFA FT YWKRTAPVDFALESIRLRWEEIEEHTKLDEESFTEMVKIVLDSTYFQYKGK FT FYKQKFGIPMGSPISPVVANIVLERIEKDALEKLRTRGIVPRFFKRYVDDC FT LLCARKEEVEAILNVFNGFHQRLQFTVELEVEGKLKFLDMILRREDNTITT FT EWFPKDADGRYLDFTSVSPFIHKNNTIIALVDRALKLTHAKYRVNTLKTVK FT QILTGNNYPCFLVEKVIRERLHKMYNTLQDKENATNNFITIPYIEGLGEKL FT SRYLRQHDFKVAYKPVDKIKDVLFTKTKDKISKDKNINVVYEIPCGACEKS FT YVGETSQFLCKRLSQHKYNVKTKNISGTGLSQHTIEEGHIFDFDKTKILDK FT VTNSYTRLITETFHIKIRGDDNVVNKQKDSANFKTAYNSIITKLKSITNT" XX SQ Sequence 3832 BP; 1226 A; 755 C; 727 G; 1124 T; 0 other; atgtgtcagt gaatcatcga tttgttttcg gcgagaatta acgtacaaat aaattgccag 60 attgtccgga cgtttcggtc gttttactgg cctacgacca tcttctgcgg actctaaaat 120 atatttttta acatttagaa cacatcaaaa acataaaatt aacaccttac aatcttctag 180 ccgtttgttt acagttggac ggccataact tttacaaaca tcataataca taaatttcta 240 catacaacat acatacaaat atcttctggt catgtgttgg taatagattt aagcttagtt 300 ataatagagt tgtacgccgt cttaaaattg gcagagtctt tctgtttgtt tacaacatta 360 tcatcgcctc gtatcttaat gtgaaaagtt tctgttatta atctagtgta gctatttgta 420 actttgtcaa gtatttttgt cttgtcaaag tcgaaaatat gtccttcttc tatagtgtgt 480 tgtgacaatc ctgtaccaga tatgtttttt gtttttacat tatacttatg ttgacttaga 540 cgtttacaca gaaattgact tgtttcacca acgtacgatt tttcacatgc accacaagga 600 atctcatata ctacatttat gtttttgtct ttagatattt tgtctttagt ttttgtaaat 660 agtacatctt taattttgtc aactggttta taagccacct taaagtcgtg ttgtcttaga 720 tatctagata gcttttcacc tagtccttca atatatggta tcgttataaa attgttcgta 780 gcgttttcct tgtcttgtag tgtattgtac attttgtgta gtctttccct aataactttt 840 tctacaagga aacacgggta gttgtttcct gtgagaattt gtttaaccgt tttcaacgtg 900 tttacccgat acttagcatg agttaacttt aaggctctat ctactagtgc tattattgta 960 ttgtttttgt gtatgaaagg actaacagag gtaaaatcta aatatctacc gtctgcatct 1020 ttcgggaacc actccgttgt gatcgtgttg tcttcgcgcc gcagaatcat gtccagaaat 1080 ttcagttttc cttctacttc taattcaaca gtgaactgca gccgttgatg gaagccattg 1140 aatacgttca aaattgcctc cacttcctcc tttcgcgcac acaacaaaca atcatccacg 1200 taccgcttga agaaacgagg tacaattccc cgagtacgca gcttctccaa cgcatctttc 1260 tctattcgct ctagtacgat gtttgctacc accggtgata ttggcgaacc catcgggatg 1320 ccaaatttct gcttataaaa tttccccttg tactggaaat aggtagagtc cagtacaatc 1380 tttaccatct cggtgaaact ttcctcatcc agcttggtgt gctcttctat ttcttcccac 1440 ctcaggcgta tcgattcgag tgcgaagtcg accggtgcag tacgtttcca ataagcgaaa 1500 acacacgttt gaatgctacc gaaaccgtca acaacacaaa ctggagaagc tgaaaacaaa 1560 acagctcacc caaaatacac cgtgcgacaa ttgggtgctt aattgcacag acgaaactat 1620 acctgatttc gtaacacgat cgctgcagct tggaagtgga tacaacaatc ccaatcccca 1680 ttcggctccg tatgtgagag tgctctcgga gatagaatct gccatacaac gcaatccatc 1740 cgctgacgtc atccgtcacg atgtgatcaa atgccatcat caaccacatc cactacacca 1800 aacaaccgtt ccatgataaa catgagaata tacggaaaga gatacaaaag tccaaaaagt 1860 atctcaggga ccgagacgat ctcgtcgtca ccaaagcgga taaggggaaa acggtcgtag 1920 tgatgaaacg tgatgagtat gaggaaaaaa tgcaagcttt ggtcaacgac agcgaaacgt 1980 atgagcctct tgcaagcgat cctaccaaga agacgttgaa gaagatcaac acgctcattg 2040 atcattggca tgagaatggt ttcattgagt attctgagag aaccaagttg aaagttttca 2100 actgtaatcc accccgtgtt tacggtctgc ccaaaaccca caaagacggc agaccactga 2160 gaattataaa ttccgcaatc gggacagcga cctataagat ggctaagttt ttgtccaaaa 2220 tcctcaacca tgtcactgga aaaactgaac atcatatagt gaacagtttt cagtttgctg 2280 aagagatgcg tgaacagcaa ataaccaacc aggatgttct gttctctcta gacgtggttt 2340 ctcttttcac caatgtaccg gtcgacttcg cactcgaatc gatacgcctg aggtgggaag 2400 aaatagaaga gcacaccaag ctggatgagg aaagtttcac cgagatggta aagattgtac 2460 tggactctac ctatttccag tacaagggga aattttataa gcagaaattt ggcatcccga 2520 tgggttcgcc aatatcaccg gtggtagcaa acatcgtact agagcgaata gagaaagatg 2580 cgttggagaa gctgcgtact cggggaattg tacctcgttt cttcaagcgg tacgtggatg 2640 attgtttgtt gtgtgcgcga aaggaggaag tggaggcaat tttgaacgta ttcaatggct 2700 tccatcaacg gctgcagttc actgttgaat tagaagtaga aggaaaactg aaatttctgg 2760 acatgattct gcggcgcgaa gacaacacga tcacaacgga gtggttcccg aaagatgcag 2820 acggtagata tttagatttt acctctgtta gtcctttcat acacaaaaac aatacaataa 2880 tagcactagt agatagagcc ttaaagttaa ctcatgctaa gtatcgggta aacacgttga 2940 aaacggttaa acaaattctc acaggaaaca actacccgtg tttccttgta gaaaaagtta 3000 ttagggaaag actacacaaa atgtacaata cactacaaga caaggaaaac gctacgaaca 3060 attttataac gataccatat attgaaggac taggtgaaaa gctatctaga tatctaagac 3120 aacacgactt taaggtggct tataaaccag ttgacaaaat taaagatgta ctatttacaa 3180 aaactaaaga caaaatatct aaagacaaaa acataaatgt agtatatgag attccttgtg 3240 gtgcatgtga aaaatcgtac gttggtgaaa caagtcaatt tctgtgtaaa cgtctaagtc 3300 aacataagta taatgtaaaa acaaaaaaca tatctggtac aggattgtca caacacacta 3360 tagaagaagg acatattttc gactttgaca agacaaaaat acttgacaaa gttacaaata 3420 gctacactag attaataaca gaaacttttc acattaagat acgaggcgat gataatgttg 3480 taaacaaaca gaaagactct gccaatttta agacggcgta caactctatt ataactaagc 3540 ttaaatctat taccaacaca tgaccagaag atatttgtat gtatgttgta tgtagaaatt 3600 tatgtattat gatgtttgta aaagttatgg ccgtccaact gtaaacaaac ggctagaaga 3660 ttgtaaggtg ttaattttat gtttttgatg tgttctaaat gttaaaaaat atattttaga 3720 gtccgcagaa gatggtcgta ggccagtaaa acgaccgaaa cgtccggaca atctggcaat 3780 ttatttgtac gttaattctc gccgaaaaca aatcgatgat tcactgacac at 3832 // ID Copia-107_AA-I repbase; DNA; INV; 4892 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-107_AA_; KW Copia-107_AA-LTR; Ty1_copia_Ele166; Copia-107_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4892 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [1603-2100] - Integrase core CC 'TTTC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 553..2169 FT /product="Copia-107_AA-I_1p" FT /translation="MDVLFGQLQSAGLVLDEKLQIAMVLRSMPESYHFLAS FT TLEARPDQDVTMGLVRSKLLDEYRKRAERKGPPTGEEQVLKAEANKQKVCF FT FCNKPGHYKRDCRKFLALKEKRYDGESAGSKNAARKKKQAKQAKETTCDAV FT SFSARELGRSVAGKVKASSWTIDSGASCHMTSCEEFFGKLEESPVTTVVMA FT DGNRAKSSGVGCGTIRTVDGRGKPMNVQLNNVLFVPDLDGGLLSVSKIVEN FT GFDVKFGQNGAEIVDSTGKIVALGEMDGGLYILKEAGHKALAAGQCHTVNC FT QHTWHRRLGHRDSTVLDRIRKEDLVAGVKVIDCGSRAVCECCLEGKMARQP FT FPHQAERRTTRPLQLVHTDLCGPMQNVTPGGNKYFMAIIDDFSRYIVLYLL FT KDKSEATACIKNYVRFTENQFGRKPALIRSDRGGEFVNKELEQFYRSEGIQ FT SQLTAGYSPQQNGVAERRNRYLKEMSVCMLLDAGLEKCFWGEAIATAAFIQ FT NRLPSRSVSKTPYELWTGRKPDLGNLKVFGCEAYVHVPDVKRS" XX SQ Sequence 4892 BP; 1321 A; 1047 C; 1370 G; 1149 T; 5 other; tttccttccg ataggttatg ggcccaggcg cgaatcggta aaattcggtc ggtgcggagt 60 ggaaaatagg actttttctc gaattgcgtc gcggcgtgat tttcggccgg tgttcggcgt 120 gcgcagtagc tgactcgttc gacgtttgct ctttttgagg tggtggagcc aatcaccgac 180 agtggaatgg cggatcccgg aagattttcg atcgccaagc tgggaaacaa caactatgca 240 gcgtggaaat tccagatgca gatgttcctt gttcgcgagg aactctggaa tgttgtgtcc 300 gaagctacac cagcagcccc aatcccggac gcatggcaca aggcggacaa gaaggccctg 360 gcaactattg cgttgagcat tgaacagagc caatatcccc tcataaaaga ttgcagtaca 420 gctaaggata tgtgggaggc actaaagcag taccatgaga aaactactgc tgcgtcacag 480 ctgtcgctcc tcattcggct gtgcgmtgca aaggtakgtg aagaaggaga tgttgaaaag 540 cacctacttg acatggatgt gcttttcgga caactgcaga gtgcagggct agtgctggac 600 gagaaattgc aaattgcaat ggttctccgt agtatgccag agtcctatca ttttctggcg 660 tcgactctgg aagcacgtcc ggatcaggac gtaactatgg ggctagttcg atcgaagcta 720 ctcgacgagt atcgcaagcg agctgagcgt aaaggaccgc cgacgggaga ggagcaggtg 780 ctgaaggcag aagcaaataa gcagaaggtg tgcttcttct gcaataagcc aggtcactac 840 aagagggact gccggaagtt tctggcactg aaggaaaagc gctacgacgg tgaatctgcc 900 gggagcaaaa atgctgcaag gaagaagaaa caagccaagc aggccaagga aacaacttgt 960 gatgctgtaa gtttttcggc tcgcgaattg ggccgatcgg ttgctggcaa agtgaaggct 1020 tcatcgtgga cgattgatag cggcgcatcc tgccacatga ccagctgtga agaatttttc 1080 ggtaagctgg aagaatcgcc ggtgactact gtggtgatgg ctgatggtaa tcgagccaaa 1140 tctagtggcg tcggatgtgg aacgattaga accgttgatg gaagaggtaa gcccatgaat 1200 gtccagctca acaacgtatt gtttgtaccc gacttggacg gcggattgct atcggttagc 1260 aagatagtcg aaaacggttt tgacgtcaag ttcggccaaa atggcgctga aattgtggat 1320 tctactggca agattgttgc acttggagag atggacggtg gattgtacat tctgaaggag 1380 gctggtcaca aagcacttgc tgcgggtcag tgccataccg tcaattgcca acacacctgg 1440 catcgtcggt tggggcaccg tgactccacc gtcctggaca gaatccgaaa ggaagattta 1500 gtcgctggtg tgaaggtcat cgattgcgga tcaagggctg tatgtgaatg ctgtctagag 1560 gggaagatgg cacgtcagcc gtttccccat caagcggaaa gacggactac gcggccattg 1620 cagctggttc atacagactt gtgtggaccg atgcagaatg tcaccccggg cggcaataag 1680 tacttcatgg cgattatcga cgatttcagt aggtacatcg tcttgtatct cctgaaagat 1740 aaatccgagg caacagcgtg catcaagaat tacgttaggt ttactgagaa ccagttcggc 1800 aggaaaccag cactcattcg ttctgaccgt ggcggcgaat tcgtgaataa ggaactggag 1860 cagttctacc gttcagaggg aatccagtcg cagctgaccg ccgggtattc tcctcagcag 1920 aacggagtgg cggagcggcg caatcgatat ttgaaagaaa tgtcggtttg catgctactc 1980 gacgctggat tggagaagtg cttctggggc gaggcaatag caacggcagc cttcattcaa 2040 aatcgcctcc cgtcgcgatc cgtaagtaaa accccttatg aattgtggac aggacgtaaa 2100 ccggatcttg gtaaccttaa ggttttcggc tgtgaggctt acgtccacgt gccggacgta 2160 aagcgcagta wgctcgacag taaagctaaa aaaactaccg aaaagctaaa cattacatat 2220 aatttgcaat tggattagat ggacaaattg atgtgaagat ttgcgaaaaa gttacacgtc 2280 ttctcagtga ggatcgaact cacgacttct cgatctctag ttggggcgcg ttaaccacta 2340 cgccatgaga gggactcatg aacgcagaag ttaacctgaa ttcgatttca gctcaataat 2400 cacgtggtcc tctttcgcaa agtgcacctc tttcggaaga attagatgcc catcccatgt 2460 tccgtaaccg gtcgtttgtg aacgtcgcga cccattggaa gcccacacaa tagaaacctc 2520 agccagcgca gttgctggct agtgtagtgt gctattccta tacctaaaaa acaatgcgct 2580 ctcgggctag gcattggata tatatagaaa gcgttgtgtt tggatgggca tctaattctt 2640 ccgaaagagg tgcactttgc gaaagaggac cacgtgatta ttgagctgaa atcgaattca 2700 ggttaacttc tgcgttcatg agtcctctca tggcgtagtg gttaacgcgc cccaactaga 2760 gatcggggag tcgtgagttc gattctcact gagaagacgt gtaacttttt cgcaaatctt 2820 cacatcaatt tgtccatcta atccaattgc aaattatatg taatgtatta gcttttcggt 2880 agttgttaaa cttccactcg gctggttggc cgtaaaccac gattcataat taaaacaagt 2940 actaaaaaac tagtttttgt tggctatgac tgcaaggcga aggcgtatcg ctttctcgat 3000 aaggcaaaaa gaaggatcac cataagccgg gatgctcggt tcctagaagt gggttcagag 3060 atgcaggagg agaaagtttc ggtcgttcca gatcctgaaa ccaagctgga ggctgacatc 3120 gagaagcaac aggaagtagt acagctggag tcggctccgg agaatgagtc agatagtgaa 3180 gaggattgga acagcattgc ggacgaatct gaatttgagg ggttctattc ggatattgac 3240 aacgaatttg aaggcgagga tctgaatcca gctggaagct caagtgtaag agagcaacgt 3300 gtgcgacaac ggcggcgccc aagaaggttc gacgattttg tggtaggtgc tgctaaagtg 3360 gtggaccatg ttccgtcaac gtacatggaa gcagtgaatt gccaggataa agaaaggtgg 3420 atagaagcaa tgaatgcgga atacaattct catcagacac gaggtacgtg gactcttgtg 3480 ccaccaccac tgaatcatcc aatcatcgga tcaaaatggg tgtttcaacg aaagaaagac 3540 acgtcaggcc aaacggtgcg ctacaaagca cgatttgtag cccaaggcta ttcccagcaa 3600 tatggagtgg atttcagcga cgttttcgct cctgtcgcga tgcagtcgac gcttagggta 3660 ctgttggcca tcgccggaca acggaaactc gaggtaagac acgctgacgt taaaagtgcc 3720 tacttaaacg gaaagctgga ggaagacgtt tatgtacgac agcctattgg attcgaggaa 3780 ccgggaaagg aggaacatgt ttgcaagctc cataagagtt tatggcttga aacaaagcgc 3840 gagagtatgg aatactacag tggctgtgaa gctgggtttc catcaatcgt tgtctgaccc 3900 atgtctctca tgaaacgatg gcgagtggtg aatggattta cttgctaatc tatgttgatg 3960 atatgatcct cgtttgcaaa gaagcagagc agattacgat tgtcggaagg agttacagcg 4020 tgagtcgccc tacccagcgc gccttggggg cgaggcggtg gctcgagcgg aggcgccgtt 4080 tcggcttgga tggagcaaag aagtcaagca taccgttgga tgttggatat ttcaagcaga 4140 ttgaaagtga cgtgctacca gataacaagc agttccacag tttggttggc gcgttgctgt 4200 tcattgcttc caatacgcga ccggawatcg cagccgctgt ttcaattttg agtcggagga 4260 ccagtgctcc aacgcagcgt gactggacag agctcaagcg agwtgttcgg tacttgatcg 4320 gcacagagga ctacgagctg aagctaggcg ttcaacaaac cggtgatatg gtccttgccg 4380 gctacagtga tgctgattgg gccggtgatc ggtccgacag aaaatcaacg agcagttttt 4440 gtgtttttct tcggtggagc tcctgtagtg tgggccagca ggaaacaagg atgcgttaca 4500 acatcaacaa cggaagctga atacttggct ctatcggatg cagcacaaga ggtgatatgg 4560 ctacgtcgtt tgctgggtga gctgggcgag cagcaacggc ggccgactat aatcaacgaa 4620 gacaatcgta gttgtataga cttcgtggcc ttagatcgtc tgaacaaacg aagcaaacat 4680 atagacacca agtatcatca tgccaaggat ttatgcgcca agggagtaat acagctacgc 4740 tactgttcga ccagtgagat gatcgccgac atctttacga agcctttggg gccgaataag 4800 atcaaatatt tcacgaaggc tctaggactt gtcggactta actgatgccg gaaatgcagt 4860 tcgtcagcga ggaggagtgt aggcggaatt tc 4892 // ID DNA8-91B_AP repbase; DNA; INV; 864 BP. XX AC . XX DT 28-AUG-2009 (Rel. 14.09, Created) DT 28-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-91B_AP. XX NM DNA8-91B_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-864 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2027-2027 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 864 BP; 290 A; 106 C; 117 G; 347 T; 4 other; cagtcttgtt aagtattcga atactttttt tgtatttcaa atacttttca aatacttttt 60 gaccaatgta ttttaaatac ttaaaaaata ctttttattt gtatttttat tctagaatac 120 taaatacttt ttttcattct cgtggtctac tccaaattcc aattaccaat atttcggaat 180 gtactatttt aaattttgtt tctaaaatat tattttaxaa tggttctaac ttctgataat 240 ttataaaata aggtttcgcc aaaatacaag cagataacag aatgactgaa cactgaaacc 300 cccgtcggct cctcactccc atattatacc atggcataca agtacgagac actgaccacc 360 actggttgga ttggcattta tcttgaatct ttaaatagtc tttcttctta tttaatgggt 420 ggtactgtgg tagtagttta taatttgaat ataagatttt ggtattaata ttattgaact 480 tttttttttt gtcatattta ttttaatttg tacaatctgt ataaaatatt tttacaataa 540 gcacaataga tggagggata cataagaggg tttggttggc ttgattaaag aaactattca 600 ataatagtag tacttcatgg cgtgggtagt axtggttagt cgacttgtta ttgaatttat 660 tgaatatttx aatataaatt atttattaat tgaactaaaa aaaaagtatt tgtgxaagta 720 ttttgaatac ttttaaaaag tatttgtatt tatatttaaa tacttttaaa aggaagtatt 780 cgaatacgta tttcaaatac aattgacacc aagtatttaa atacgtattc tgaatacatt 840 tgaaaagtat tcttaacaag actg 864 // ID ISL2EU-7_HMa repbase; DNA; INV; 3521 BP. XX AC . XX DT 22-JUN-2010 (Rel. 15.06, Created) DT 22-JUN-2010 (Rel. 15.06, Last updated, Version 3) XX DE A family of autonomous ISL2EU transposons - a consensus. XX KW ISL2EU; DNA transposon; Transposable Element; ISL2EU-7_HMa. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3521 RA Jurka J. and Kojima K.K.; RT "DNA transposons from Hydra magnipapillata."; RL Repbase Reports 10(6), 791-791 (2010). XX DR [1] (Consensus) XX CC ~97% identical to consensus. Nucleotides 46-80 and 3487-3521 are CC complementary. CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 81..1736 FT /product="ISL2EU-7_HMa_1p" FT /note="THAP DNA-binding domain." FT /translation="MSKNLINFNSDVQLCENSEIELCENSEIELYENSEDD FT SEITEGELTFNGEVEPCENSEQNSISLREVELTFNSEVDFTSKSGGCTCCV FT PNCFNNSKRNKNLSFYVIPKEKVLRKLWLAKISRKDFTPSSSHRVCSAHFQ FT GNKKTYMNNVPTIIPKTVKLTAHVPRKTKNSLGLLHKTTQIPYSEELSTPV FT LSYKEKLKQENKILKDQIEDIIKEKQALVNTQKEAICKLNDKILLSQFTVE FT RFKHNKEHFKFYTGFENFELFKVVMKFLEPEIYSLNYWGSMSTIADNLSET FT SSSKTRGRSRILNVEEEFFMVLIRLRCAFPIEDLAIRFNISSSTTSRILIT FT WYDFLHIKFRSIPIWPTKKLVNETMPSCFKDVYPNTRVIIDCTEIFTVMPT FT SYRTQSAMFSKYKHHHTAKGLIGIAPSGAITFVSDLYAGRSSDKQITNHCG FT ILKLLEKGDSLMADRGFDIVNDLPKGISLNIPPFLEGDFQLTLEKELETRR FT IASVRIHVERAIARIKNYKILQNTFPLTMAADMNKIWVIVCYLVTFLPPLI FT KTDSK" FT CDS 3451..1790 FT /product="ISL2EU-7_HMa_2p" FT /note="YqaJ superfamily exonuclease." FT /translation="MMSSFNANLENKIDASSFQNSCQASKAVYRELKHDEL FT ISVDEEESINKSISTWSRDISLLPNISHNFIKKYLVNDTIYIDNCKKGANK FT HQTLGYRLFKENYVKNVCVKANVTASINLFIVKSNVAASMKRKQYEVFIHL FT CQSTGEILYAKCHCKAGAGGCCKHVAASLYQLVDYKELDIKVVPETETCTD FT VLQIWHVPGESSNSEAILFSNLNFEKADAFKDKNSTRKRPLVSGTRRYHSL FT PNCYETSTLQLQTLCEGLENLCQGTYISSIIRDNNFQACSFFNSSLSTFID FT NDIAEPFSKKDDRLIIITLFDNLQNMIDLSCLSTEQKAFVELHLIVDIEEA FT KNIEVNTVKQSKSILWYQERCKRLTASNFGTVMNRWKKIYPTSILKKCLQT FT KNIKVGESCMWGKVNEDIAIKLYESITKSSVTRCGFFINPKCPWLGCSPDG FT VVSAEKVLEVKCPSSKKHLTIEEACQDKNFYLKFNDGAPKLKINHPYYYQC FT QGIMALTETKTIDFIVYTEKSLHIETIEYDAELWNTIIFPELTDFYFKFMS FT TEIFKVK" XX SQ Sequence 3521 BP; 1259 A; 558 C; 525 G; 1179 T; 0 other; taggtgatta atggaccttt cccataaatc ggcaaaaacg caatgcattg tggtagttgt 60 attcatttaa atcaaaacaa atgtcgaaaa acctaataaa ttttaacagt gatgttcaat 120 tatgtgaaaa cagtgagatt gaactttgtg aaaacagtga aattgaactt tatgaaaaca 180 gtgaagatga tagtgaaatt actgaaggtg aactcacttt taacggagaa gttgaacctt 240 gtgaaaacag tgaacaaaat tcgatctcac tgagggaagt tgaactcact tttaacagtg 300 aagttgactt cacttcaaaa agtggtggat gtacttgttg tgtaccaaac tgttttaata 360 attctaagcg taataaaaac ttatcatttt atgttatacc caaggaaaaa gtgttaagaa 420 agttatggtt agccaaaatt agcagaaaag acttcactcc ttcttcttca cacagagttt 480 gttcagcaca ctttcaaggg aacaaaaaaa cttatatgaa taatgttcca acaattattc 540 caaaaacagt taaactaact gctcatgtac caaggaaaac caaaaatagc cttggtttat 600 tacataaaac aacacaaata ccttatagtg aagaactgtc gacacctgtt ttaagctaca 660 aagaaaaatt aaaacaagaa aataaaattc ttaaagatca aattgaggac attataaaag 720 aaaaacaagc gttagtaaac actcaaaaag aagctatatg taagctcaat gacaaaatac 780 ttctatcaca atttacagtt gaaaggttta aacataacaa agaacatttc aaattttaca 840 caggttttga aaattttgaa ctatttaaag tagtcatgaa gtttttagaa ccagaaatat 900 attcactaaa ttattggggt tcaatgtcta ctattgctga caatttatct gaaacttctt 960 catctaaaac aagaggaaga tcacgtatat taaatgttga agaggaattt tttatggtgc 1020 taattcgact acgttgcgcc tttcctatag aagatttagc aatacggttt aatatttcat 1080 cgagcactac cagtaggata ttaattactt ggtatgattt tttgcacata aaatttagat 1140 caattccaat atggccaaca aaaaaactag ttaatgaaac aatgcctagt tgctttaaag 1200 atgtataccc taatactcgt gtaattatag actgtacaga aatatttact gtgatgccaa 1260 ctagctatcg cactcaatca gcaatgttct caaaatataa acatcatcac acagcaaagg 1320 gtttgattgg tattgcaccg agtggtgcca ttacatttgt ttctgattta tatgcaggaa 1380 gatcaagcga taaacagata acaaatcact gtggcatact aaaattatta gaaaaaggtg 1440 atagcctgat ggcagataga ggtttcgata tcgtaaatga cttaccaaaa ggaataagtc 1500 tcaacatacc accatttctt gaaggcgatt ttcaacttac tttggaaaaa gaattagaaa 1560 caagaagaat cgcatctgtg cgaattcatg tggaacgtgc aattgcaaga atcaaaaact 1620 acaaaatatt acaaaacaca tttccactta caatggctgc tgacatgaac aaaatatggg 1680 tcatagtttg ttatttagtt acttttctac cacctctaat aaaaactgat agcaaataat 1740 gtaaccagac aaaaaaagaa acattttata aatattgtgt aaatgtttat tttactttaa 1800 atatttctgt tgacataaat ttgaaataaa aatctgttaa ctctggaaaa ataattgtat 1860 tccataattc cgcatcatac tctatcgttt cgatgtgcaa tgacttttct gtgtatacaa 1920 taaaatcaat tgttttagtt tctgttagtg ccataatgcc ttgacactga tagtaataag 1980 gatggtttat ttttaatttt ggagcaccat cgttaaattt caaataaaag tttttatctt 2040 gacacgcttc ttcaattgtt aaatgttttt ttgaagaagg acatttcact tcaagtactt 2100 tttcagcgct aactactcca tcagggctac aacccaacca agggcatttt gggtttataa 2160 aaaaaccaca tcgagtaaca cttgacttcg ttatgctttc atatagtttt attgctatat 2220 cttcattaac ctttccccac atacagcttt ctcctacttt gatattttta gtttgtaaac 2280 atttttttaa aattgatgtt gggtaaattt ttttccacct attcattaca gtaccaaaat 2340 ttgaagctgt caatcgtttg catctttctt gataccataa tatactttta gattgtttaa 2400 ctgtattgac ttcaatattt ttagcttcct caatgtcaac aattaaatgc aattctacaa 2460 atgctttttg ttcagtagac aaacaactta aatcaatcat attttgtaaa ttgtcaaata 2520 gagtaataat aatcaatctg tcatcttttt tactaaaagg ttctgcaata tcattgtcta 2580 taaatgttga cagagatgaa ttaaaaaagg aacatgcctg aaagttatta tcacgaatta 2640 ttgaagatat atatgtcccc tggcagagat tttctaaacc ttcacaaaga gtttgaagtt 2700 gtaaagttga agtttcgtaa caatttggta atgaatggta tctacgtgtt cctgaaacta 2760 aaggtctttt tcgagtacta tttttgtctt taaaagcatc agctttttca aaatttaagt 2820 ttgaaaacaa aattgcttca gagttggatg attcacctgg tacatgccat atctgtaaaa 2880 catcagtaca tgtttcggtt tctgggacca ctttaatgtc taactcttta tagtcaacta 2940 gttgatataa tgatgcagca acatgcttac aacagccccc cgctccagct ttacaatggc 3000 atttagcata aagtatttca ccagtagatt ggcacaagtg tataaaaact tcgtattgtt 3060 tacgtttcat agaagcagca acattagact ttacaataaa caaatttatg gatgctgtta 3120 catttgcttt aacacataca tttttgacat agttttcttt aaataaacga tagccaagag 3180 tttggtgttt attggctcct tttttacaat tatcaatata tatagtatca ttaacaagat 3240 atttcttgat aaaattgtga gatatatttg gtaaaagtga aatgtctctt gaccaagttg 3300 atatactctt gtttatagat tcttcttcgt caacagaaat aagttcatca tgcttaagtt 3360 cccgatacac agctttagat gcctggcaag agttctgaaa agaactagca tcaattttat 3420 tttctaaatt agcattaaaa gaactcatca ttataataat atttaattcc taatacttta 3480 atcggattgt tttgatttaa atgaatacaa ctaccacaat g 3521 // ID Gypsy-24_IS-I repbase; DNA; INV; 4393 BP. XX AC ABJB010964315; XX DT 15-FEB-2011 (Rel. 16.02, Created) DT 15-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the black-legged tick genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-24_IS_; KW Gypsy-24_IS-LTR; Gypsy-24_IS-I. XX OS Ixodes scapularis OC Eukaryota; Metazoa; Arthropoda; Chelicerata; Arachnida; Acari; OC Parasitiformes; Ixodida; Ixodoidea; Ixodidae; Ixodinae; Ixodes. XX RN [1] RP 1-4393 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the black-legged tick genome."; RL Direct Submission to RU (15-FEB-2011). XX DR Genome; ABJB010964315; Positions 4315 8707. XX CC Positions [3502-3840] - Integrase core CC 'CTACT' target site duplication CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 31..1848 FT /product="Gypsy-24_IS-I_1p" FT /translation="MAEGEPRIQGVRVGASGLHPPAPFTFTSPAEWPTWIA FT TYQDCAFAAGLDTASDEVQVRTLLYCMGPQARSVLSSLGAPTPETLSFAVV FT KEKLAGHFVHPVNEIYESRRFHRRTQEQGESVDDFFTSLRNLVKLCGYNSA FT DVEDRLVRDKFVVGLRDDKLCDQLCRSTRLSAEEALCQARVHEDAEKERLS FT RSSVARGNTAPENLNVDTARRKKIATSQSARYHGKSEQSASSTYEYFGRHP FT HLRKDCPAWKSSCHFCKKQDHFAEVCKAKKRKELLGSVELHAVSTRHARYT FT EVRVNGHLSKFKIDSGAEVTVVSPSFPGLPIVLDEADGEVSGPGNLRLNVL FT GTFKATLEWRGKFTVQRLYVVQNQHTSLLGLPAIEALGVIQFLDSTEYVPS FT TQAKIFEGLGELQEEYHIRLQPNVSPFSLSVPRRIPIPLLGVVKKELDDME FT AQGVIRKIDTPTQWCSGLVVVPKPSGGYRLCVDLTKLNQVIQRERYVLPTV FT EYILGQLGEAQIFSKLDARSSFHQVKLSRDSEELTTFITPFGRYCYKRLPF FT GIASAPEYFQRLMSRLLEGLSGVVNMIDDILIFGRDCREHDQRLADVLSRL FT EQAGVTLN" XX SQ Sequence 4393 BP; 1019 A; 1174 C; 1196 G; 1004 T; 0 other; tggtgtcaga agctggttaa cgccctcact atggcggaag gtgaaccccg tatccaaggg 60 gttcgcgtcg gagcgtcggg acttcatcct ccggctccat tcactttcac cagcccggcg 120 gaatggccga catggatcgc gacttatcag gactgcgcct tcgcagccgg tctcgatact 180 gccagcgacg aagtgcaggt acgcacgctc ttgtactgca tgggaccgca agctcgaagc 240 gtcctgtcat cgttgggagc gccaactcca gagaccctgt cgttcgcggt cgtcaaagag 300 aagttggccg ggcacttcgt gcatcccgtc aacgaaatat acgagagtcg ccggttccac 360 cgtcggacgc aagagcaagg cgagtctgtc gacgacttct tcacgtcgct tcgtaacctg 420 gtcaagctct gcggttacaa cagtgcggac gtcgaagacc gcctggtacg tgacaaattc 480 gtcgtggggt tgcgggatga caagctctgt gaccagttgt gtcgttctac gaggctctct 540 gcggaagaag cgctctgtca ggcccgtgtt catgaggacg ccgaaaaaga gcggctgtca 600 cgctccagcg ttgcccgcgg gaataccgcc cccgaaaacc tgaacgtcga caccgcgaga 660 cgcaagaaga tagcgactag tcaatcggca cgctatcacg gcaaatctga gcagtccgcc 720 agttcaactt atgagtattt cggacgtcac cctcacctgc ggaaagattg cccagcgtgg 780 aaatcgtctt gccatttctg caagaagcag gaccattttg cagaagtttg caaagctaag 840 aagagaaaag aactcttggg ctccgtggag cttcacgcag tcagcacacg gcatgcgaga 900 tacaccgagg tgcgcgtgaa tggtcatctt tccaagttca aaattgactc gggtgcggag 960 gtcaccgttg tttcaccctc ttttccgggg cttcctattg tcctggacga agccgacggc 1020 gaggtgtcgg gtcccggcaa cctacgtctg aacgttctgg ggactttcaa ggctaccttg 1080 gagtggcgtg gcaagttcac ggtccagcgc ctttacgtcg tgcaaaatca gcacacgtct 1140 cttctcggac tgcctgccat tgaggctctt ggcgttattc agttcctgga ctcgacggaa 1200 tacgttccat caactcaagc caagattttt gaaggcttgg gggaactcca agaggagtat 1260 cacatccgtc tccagccgaa tgttagtcca ttttctctga gcgttccaag acggattcca 1320 atccctctgc taggtgtcgt gaaaaaggaa ctggatgaca tggaggccca aggtgtcatt 1380 cgaaaaatcg acacgccaac tcagtggtgt tccggtctag tcgttgttcc caaaccatca 1440 ggcgggtaca gactgtgcgt ggacttgaca aagctgaatc aagtgattca gcgagagcgt 1500 tacgtcttgc ccactgtcga atacatactg ggtcagttgg gtgaagctca gatattttcc 1560 aagctcgatg ctaggtccag cttccatcag gtgaaactta gtcgtgacag tgaagagctg 1620 acgacgttca tcactccctt cggccggtat tgctacaagc gtcttccatt tggcattgca 1680 tcggcaccgg agtactttca acgcctgatg tcaagacttc ttgaaggtct ctcgggtgtc 1740 gtcaacatga ttgacgacat tctaatcttt gggagggatt gtcgcgaaca cgatcaacgt 1800 ctggccgatg tcttgagtcg cctggaacaa gctggggtta cactgaacta gaaaaagtgt 1860 aaattccgcg tcacctctgt caagttccta ggcgttgtcg tggatgccga aggaatttcg 1920 ccagatccag acaagatcaa agccatcaag aattgtatgc caccagaaga tatcagcggt 1980 gtacgccggt tgcttgggat ggctaatcac atcggacggt tcattccaca ccttcctgac 2040 gttaccacac caattcgctc tctgttaaac aagaacagcg tctggatgtg gggtcctagc 2100 ctagaggagg ctttcgcccg ccttaagtca ctgcttagct ctgaaacatg catggcaaag 2160 taccacccgc gcttatctac cgtagtatct gcagatgcca gttcttatgg tttgggagcc 2220 gtgctgcttc aagagcaacc ttcggggact cggtgggctg tcgcctttgc ttccagatcc 2280 ctcactccca cggagggtcg ctacagtcaa acggagaagg agtctctagc cgcaacgtgt 2340 gcgtctcagc cgaagtcctt cttgctgagg ctcttgcgat gatactcgtt gaccctcaac 2400 aggcatagac tgctgtgaac ttgccgacgg ttctttttct gcttggcggt ttgagaaggg 2460 gacaagatgc tgacggtttc gctgcagagt gcccctgggc gtttccacca catatgaccg 2520 tggtcgctga gcggggctga gtatacggcc tcggcaattg acgtccttaa cccacacgtc 2580 ctcccccaac ttcaagggag tcagaggact tgaactatgc cggcgattgt agttctttga 2640 ctgctgtaac ttgtttgcgg catttctcgc cttaaagttc tgatggctgg gcaatgctgg 2700 ctcaagtcgt tcgggtaaac caggaagtcg agtttggagt tttctcccca ttaacaacta 2760 tgcaggagac acgccattga caccaggtgt gtctcggtac gctaggaggg ccaggtacgg 2820 atcagaactt tttgcgaaca agtccttcac ggtacgcacc gtgaagtcga ggtcgagaca 2880 gatcaccttc ctcttgtcgc ccttctcggg ggaaaggacc tggagctttt gccaccgcgg 2940 gtgcaacgca tgcgaatcaa gctgatccgt tatcagtaca acatgaagta tgtaccggga 3000 aagctcttgg cgactgccga cacactctca cgagcacctt gtgacgcgcc agcttcgaaa 3060 gaagtaggga gtgtggaact cttcattggc gaagtcttca aagacctgcc gccagcggtg 3120 gcgagtcgac tggaagacgt ccgccagcac caaacgcaag atggggagtg ttccatggta 3180 gtaacttact gtggaagagg ttggcccagc aaaaacaaag tgccagctca cattgctccg 3240 tattagaagg agagagaacg attgagcgtc tacaaaggcg tactacttct agatcgccgc 3300 atcgtcattc cgtccgctct ccgacagagt atcctggcct tactgcacga agggcaccag 3360 ggcgttcgcc gctgtcaaga acgcgccaga gaaagcgttt ggtggccgaa ctgcaattta 3420 cacatcgaac atcttgtagc cgagtgtgcc aaatgcgccg aaactcgtgt ggtacactct 3480 gagccgatga tgcagactcc aacgccagaa agaccgtggc agcgcctcgg gatagacctt 3540 ttccagctaa aaggacgtga ttacgttctg attgtagact actactcaag gtttcctgag 3600 gtcatccctc tcggatctac atctgcgcaa gcagtcatct cggctgtcaa gagttgcatg 3660 gctcgctatg ggatcccgga cgtcgtgcgg tcagataacg gaccgcagtt tgcatctcac 3720 gagtttgcaa actttgctca ggcgtacgga tttcgccatg agactagcag cccgcggtac 3780 ccccacagta acggcgaagt agagcgtatg gtgcgtaccg tgaaggactt gttcgcaaaa 3840 agttctgatc cgtacctggc cctcctagcg taccgagaca cacctggtgt caatggcgtg 3900 tctcctgcac agttgttaat ggggagaaaa ctccgaactc gacttcctgg tttacccgaa 3960 cgacttgagc cagcattgcc cagccatcag aactttaagg cgagaaatgc cgcaaacaag 4020 ttacagcagg caaagaacta caatggccgg tatagttcaa gtcctctgac tcccttgaag 4080 ttgggggagg acgtgtgggt taaggacgtc aattgccgag gccgtatact cagccccgct 4140 cagcgaccat ggtcatatgt ggtggaaacg cccaggggca ctctgcagcg aaaccgtcag 4200 catcttgtcc ccttctcaaa cggccaagca gaagaagaac cgtcggctag ttcacagcag 4260 tctatgcctg ttgagggtca acgagtatca tcgcaagagc ctcagcaaga aggacttcgg 4320 ctgagacgca cacgttgcgg tcgtgagatt cgaccgccaa gccgattgga cttatagatc 4380 ttaagagggg aga 4393 // ID P-2N1_TV repbase; DNA; INV; 4605 BP. XX AC . XX DT 26-OCT-2009 (Rel. 14.1, Created) DT 26-OCT-2009 (Rel. 14.1, Last updated, Version 1) XX DE Nonautonomous P DNA transposon from Trichomonas vaginalis - a DE consensus. XX KW P; DNA transposon; Transposable Element; Nonautonomous; P-2_TV; KW P-1N1_TV; P-2N1_TV. XX OS Trichomonas vaginalis OC Eukaryota; Parabasalia; Trichomonadida; Trichomonadidae; OC Trichomonas. XX RN [1] RP 1-4605 RA Kapitonov V.V. and Jurka J.; RT "First examples of protozoan P DNA transposons."; RL Repbase Reports 9(10), 2163-2163 (2009). XX DR [1] (Consensus) XX CC This is a young family of nonautonomous P DNA transposons CC identified in the Trichomonas vaginalis genome. The consensus was CC derived from multiple alignment of 10 copies of P-2N1_TV, which CC are less than ~1% divergent from each other. P-2N1_TV is a CC nonautonomous DNA transposon transposed by the transposase CC encoded by the autonomous P-2_TV. This nonautonomous transposon CC is characterized by 8-bp TSDs and imperfect 22-bp TIRs (2 CC mismatches). XX SQ Sequence 4605 BP; 1655 A; 606 C; 700 G; 1643 T; 1 other; aaaggtagtc tcacgaaaat acgttatttt cccgatgcct tggaagcagc tttgcaggtt 60 acccgtaagg taaccagtag aagtttctga gcaaggatga agaagattaa agagcctttt 120 tattgcaaac gctatatttg taagtgtctt tgtagaactt ctcataaatt caaagatttc 180 caaaaacgaa tttttcgatt ttgatataat aaatgatatt taaacaaatt atagatgtgt 240 ttgcaattaa aaaaatccga gttaactaca atatcttcaa tttttgtcgt tcactttttc 300 atgtcatact gatagcaaga atactcaccc aatttggtta cacttcatga tagtttgaag 360 taaaaaaatg ccacagtttt ttttatttga tatcgttatc taaattttgt acagaaatga 420 aataaattat aatatgaaac gtgtgttatt atgcttgttg ggaaacaatc caatacttat 480 gtctgaatga aatgattatt tctacgagat ttttcatatt cagctaataa tgctatcaaa 540 atcaaataac aataaaaagt tattttgtac ataatgtaac ttaatcttta atactatata 600 gaacttcgaa atacgactaa acggtgcgag ttacgcgatt aaataccttt ttttcgatcg 660 aaacttgaaa gtgaggataa ctcaatgggt aggggtatgg gttcgacttt ttctattggg 720 caactatttt tttcttttca ttgtttcagc atttttttat gcgattatgc ataattattt 780 gatttcctat atattcaata ataaatttgt tccgaactct taacaaataa ataaattttt 840 agaatattta aatttatagc tccctacatt aatataaagt gttcgaatag ttcatatata 900 gtttctttaa aataatttgt ttcattttaa attggtttgg cacaatgctt gacataacat 960 gcacaaagct atgccatttt aaaatatgta tatcaatcaa gccgattaag tccatcctaa 1020 acgaaaatca tcgatactca tgaatttatg atagtcgttc ttgatataat tgcgcgtttt 1080 aaaaaatgct tacttttgaa cacagaactt atctatccat atttatcaat gaaagtttat 1140 aagatgcatt aatgacaaaa ccttacttta tcaattataa tcttatcgta aatttcaaac 1200 tacaatactg tggggcaaat attaaataac cactatgttg cggattttat gtaagtctgc 1260 aaactgtaaa atgtaattaa acacaaataa aaaaaaatgt aaattttaat cgagaaattt 1320 agcaatcttg aaagccttga tcggcatctt ttacaagatg tcagtaatgc atattcaata 1380 actatgaaaa tctagccaag catggatttt tttcgaaaaa aaattgtatt aaatttcatt 1440 tttcattctt tatacaagtt ttaacctttg gtgcaattat atcctatcgc aaatgcatta 1500 ttgacgttat tacttggata cttcttcata acgctaaatc ttcaaattag ttatgtgttt 1560 gttactaata gaagttggaa aggtattaaa aaaggaaatg ttaataaata tacatgttta 1620 attaaaaaac aaacaagaat aaattaaata attttatttc ttaggaaagt ctaagttttg 1680 taatagataa attgcttttt gtgtctcttc ttcatcaatt tctggatgtt gattctcagg 1740 tggcggacta attaaaacat catgggcaat atcaaaccat tgttttagtt cgttggcaga 1800 aggaaaatga tcttcttgta ttggcgttat taatgattga aaatcatcag ttgattctgc 1860 attttgggaa ttttccaaat tttcatttgg aaaatttgaa ttttgagata ttgtttaagc 1920 actcagatct catttttaaa atatctgcac caaattctta catctgataa ttttcatgtg 1980 taaccaatgt ttcccaatcc atttgccata ccctactaat cgcaaattca ttccctttaa 2040 ttttgtatta aatttaaatt ccaatcttaa tttctacatt atttatgcta accctttaaa 2100 ttttgttctt atttattcac cctcagatac aaatttattt agaatgtatt cctctgaggt 2160 ctatatttgt attcgaaaat aaaatcttta aaactctcta agttatcatt ttctcttcag 2220 atatctaata tttttctaat gacataacta ctttggacgc atatccaaat tccaaaaaaa 2280 atatggaaaa aataaaaatg aaattgaaaa atacaatttt tgatatgtaa aaagagattt 2340 tgaaaaaaat tgggaaaaag agtatctaat aaaattgttt gagatctatg tagacatagt 2400 ttttacatta aaaaatcaat aatattgaaa ttgatctgaa tatatctttt tttttaagtt 2460 tgaatgcatt aatctcaaag catttgataa aatactggtt ttatttcaaa attctaaata 2520 atttgtaatt agatctttat gatgtaattt aattttaaac aagtacagat aaagagaatc 2580 ataataaagg tgatctatat tgcttaagta aaattgagta taatagatcg atcttagcaa 2640 caagatctga ggaagtagat ttgagagtcg agagtttaga tcttagcaac aagatctgag 2700 gaaatagatt tgagagtcga gagtttggat cttagcaaca agatctgagg aagtagattt 2760 gagagtcgag agtttggatc ttagcaacaa gatctgagga agtagatttg agagtcgaga 2820 gtttagatct tagcaacaag atctgaggaa gtagatttga gagtcgagag tttagatctt 2880 agcaacaaga tctgaggaag tagatttgag agtcgagagt ttggatctta gcaacaagat 2940 ctgaggaagt agatttgaga gtcgagagtt tggatcttag caactagatc tgagagtttt 3000 atcatttccg gcatattggg ccgaaaagga gtgatccgcg taagtatgaa aaaaagtgaa 3060 ttttgttgta tggtgttttt tattatatcc gtttctctca caattttcaa taaatattat 3120 tgttcaaacg gaaatataaa caaacaattc aaaaataaaa atgtaaatgt aaattgaaaa 3180 tgtattataa acatgctaaa aagcgtatat tttaaactta aaggaaaaat gaaattcgta 3240 aaaatgagca tgatgtttga taagggwgtt ttaatttttt ttcaaaaaaa ttaaggaact 3300 taaattggtt tcgaggcgta gatagggata aaaagatgct ttaagaagat ctcagtagaa 3360 atatggttta taagaggata taggtcagga tataagatta tgaaaattga agaagtgaat 3420 tatcaagatg tatttatgag taaactatat tgttttaagc tatctaaatc tgttttgata 3480 taaattaaag cgtatactta tgcaaaatgc gaaatttaaa ttaatacttt ttttttgcta 3540 tttggacaca aagctaacaa tccatccaat aaatgacgta tacctagcga gatgaagcat 3600 tttaacatga aagacaagaa aatctgatta cttgtgttga aagtttctgt atttcgtctt 3660 cttccaaaca cagcgttcag atgcagaaga aaatgcagcc gtaatgacgc aaaaagacac 3720 taaaataaat tggagaattt ctgtaaagtt tttactgagc tttgttattg aacccgaatt 3780 tgttgctgag tgacaggata ttaaatctac gacattcatg agtaattgtt atcaatttgt 3840 caataaatta tgttgaagat ggtttaaatg tcttcgaagg catgaaatta gatcctaaat 3900 ctgtttgagt taataaaaga gaacaaatgt tgaaagagtt gtcatattgc ctacaaaatt 3960 ttcaaacatt tgcattgaga caatgtgaaa aaatgatgtc caaaatacct ttactaaaat 4020 atttttaatt tgaatcataa caaggaaaaa caataatggc agaataagaa aaaaaaacaa 4080 agatgttatg atttgaacag ttgcctttta gattcaaaaa gagcatagtg ttcccaacca 4140 ctacactacc ttcctttctg ataaaaagct gaataattag tatttaatga attctccaac 4200 ttggggattt gcggcttttc caaaatctgt tatgatagtt ctatatagca attgatatat 4260 aatatgtttt ttataatcaa ataatttttt agtccagata aattttttaa aatttcaaag 4320 tctttccatt ctactaacgc tcttcacatc catattcttt accatatatg taatataacg 4380 ccttatttta attcccatgt ttacgattta tttgtcgaaa aactgtcttg cccattaaca 4440 tctcgcgaaa attttcgact ttttgaatgg ccgttttcaa gatctcaaaa actgtgggat 4500 attttttaca aagctaatcg acatttattt accacatata tgtactatac aatgcacctg 4560 gtttcatcga atttttaccg tcagttttct cgtgagacta ccttt 4605 // ID EnSpm-13_HM repbase; DNA; INV; 5909 BP. XX AC . XX DT 15-JAN-2009 (Rel. 14.02, Created) DT 15-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE EnSpm-type family from Hydra magnipapillata - consensus. XX KW EnSpm; DNA transposon; Transposable Element; EnSpm-13_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-5909 RA Bao W. and Jurka J.; RT "EnSpm-type families from Hydra magnipapillata."; RL Repbase Reports 9(2), 384-384 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 1697..3208 FT /product="EnSpm-13_HM_1p" FT /translation="MNEGEYKYLGIKEGLETILKHSAYNCDEIKLLFNVDG FT LPIFKSSGYQLWPITSQFSVFQPFTVALYGGQKKPNPIEFLTNFAHELKEL FT HNKTVTLCGKSYYVSIFAIPCDSPARSLLKGIVQHSGYHACERCDEAGVSV FT KGRIVYGTNNSNLILKRTDYGLRSGQYAQLDNENRXHQHIVTPLYEIQQLD FT LVQQFPLDYMHLVCLGVMRRILLYLKGGYPQIFSGRLASADLIKISHRLSE FT LKGKLPSEFARQPRELTEVNRWKATEFRTFLLYTGIVVLKDVLEPKRYKHF FT LSLALAIRMLCEEETILRCSYLNSARQLLNYFVTNAHEHYGDTFTVYNIHN FT LKHLTDDVEFFQSSLDVFSCFQFENYLKTLKGLVRGKQNPLIEIVKRLHEL FT KDTYSEKTITITKFNGPTNSWFITEDCICFVNNILPDGRLVCKMYKKSELD FT NFFKDFVSSKTLQIFLIRKDTLWSDQITCANIRWRKCVCLAHGNDQVIIPL FT LNCVKLG*" XX SQ Sequence 5909 BP; 2097 A; 776 C; 905 G; 2123 T; 8 other; cccagtgggc atttgacgtc ctaggacgtc caaagacgaa caaatctggt ctaaatttgc 60 gggacctaat tagacgtcct tggaggtcca aattagtcaa acttcgtctt aatttttaat 120 ttggacgtct actgacgtcc aaagtattaa catgcttgcg ctaactattt cgccgttata 180 taacaattat actcaaaaac acaattacta cggcagttaa tattttaact gccattgtaa 240 ttgtgttttt gagtataatt gttatataac agcgtgctag ttgcaacatt gcttgttaca 300 gttacttact ctgtcttcct acattacacg tcgcttgacg ttgatcattc tgtcttcctg 360 caaaattaaa taatggaacg caaaacgtat acgaaaagat ttcacggaga ttcaagtgaa 420 aaatgtcaaa ggtacaagta aattttgagt aaaacaaagc tgaacttagc attatttatt 480 ataaatgtta gttgaaaaac tatataagta ttcattttgt aagtatataa agttttttaa 540 ctgacattta tttcagcttt gttttatagt ctatagttat agccctatat tagtataata 600 aatagtatat ttctagtagt atctctagat tgaaacatct acagatatat taatcgatta 660 gattaaaata tctatactaa aactataaga tatagttttc ctataaatat tttgatctag 720 tagactaaaa tatctttaga ttawctacat gttttaaaga cagcttccat gtttttttcc 780 tcttttaaaa ctttggagtg cttattaaaa gtatatatta gtattttctg gtaatattaa 840 ctmtaaatat tatatataaa ttaactataa atattatata taaattaact ataaatatta 900 tatataaata tatatatata ttatatataa attaatatta actataaata ttaaatataa 960 aaaaggagtc ctagaggtaa gattcaaaag atatatctaa tttaaatgat gtatatttaa 1020 ttcaaaatgt gttttaaatt tcaactaatt tcaacttcat cttttcttaa attttcattt 1080 caactgtata ataataaaat taaattaaaa attaatatta catatatgaa aaatttttta 1140 ataataataa aattggttac aaattaagta aaataaattt actaccttat ctctgtttaa 1200 aycagtttta ttatgtgtat attttacaca attttattat ttgcaaagtg tttttttagg 1260 aaacgtataa ctaactggcg taataaccgt agtcatgtgt ttgcaatttc acaatgtatt 1320 gattccccaa caaatgattc aaattttagt gaaaactttg caataaaaac ttttaattgt 1380 ggttgtttgt tgcaatcaga aagtgagatt gaatcagatt gtgatcatca gcccaaaata 1440 aatagtgttt cgcaaagttt tgaaagtttt agttatcctt cagataatga aatttgtgag 1500 ttaacggata atagtcaatc attattaaaa gacattcgta tttgggtatt aaaacatcaa 1560 actacacgag aatgtgttaa tgacttataa acaataataa ttttgttatt aacaaatttt 1620 aaggaaccat gggcaccaac ttccaaaaga ttcaaggaca gttttaggaa cattgaacaa 1680 agttgataca accgaaatga atgaagggga atataaatac cttggcataa aggaaggcct 1740 tgaaacaatt ttaaagcaca gtgcatataa ctgtgatgaa attaagctac tctttaatgt 1800 agatggtctc cctattttta aatcatcagg ataccaatta tggcctatta catctcaatt 1860 ttctgttttt caacctttta ctgttgcctt atacggtggt cagaaaaaac caaatcctat 1920 tgagtttttg accaattttg cacatgagct gaaagagctg cataataaaa cagttacttt 1980 atgtggtaaa tcctattatg tctcaatctt tgcaattcca tgtgattcgc ctgctcgcag 2040 tctgctgaag ggaattgtac aacatagtgg ttaccatgcc tgtgagagat gtgatgaggc 2100 tggtgtgtct gttaaaggtc gtatagttta tggtacaaat aatagtaatt tgattttaaa 2160 aagaactgat tatgggttac gttcaggtca gtatgctcaa ctggataatg aaaacagaty 2220 tcatcagcat atagtaactc cattgtatga aattcaacaa ctagatcttg ttcaacagtt 2280 tccattagac tatatgcatc tagtttgttt aggagtaatg cgtcgtattt tattgtattt 2340 aaaaggagga tatccacaga tattttcagg aagattggct tctgcagatc tgataaagat 2400 atctcatcgt ttaagtgaac taaaaggaaa attaccttca gagtttgcga gacaaccacg 2460 tgaactaact gaagttaatc ggtggaaggc tactgaattt agaacttttc ttctatacac 2520 aggcattgtt gtactaaagg atgttctgga accaaaaaga tacaaacatt ttttgagttt 2580 agcccttgca atacgcatgc tatgtgaaga ggaaactatc ttacgatgca gttatctaaa 2640 ttcagcaagg caacttttaa attattttgt tactaatgcg catgagcatt atggtgatac 2700 tttcactgta tacaatatac ataacctcaa gcatcttact gatgatgttg aattttttca 2760 atcttctctt gatgtttttt catgttttca atttgaaaat tatttgaaaa ccttaaaagg 2820 attggtacgt ggtaaacaga atccgttaat agaaatagtt aaaaggcttc atgaattaaa 2880 agatacttac tctgagaaaa caattactat taccaagttt aatggaccta ctaactcttg 2940 gtttattact gaagattgta tatgctttgt gaataacatt ttgcctgatg gtagattagt 3000 atgcaagatg tacaaaaaga gtgaactaga taattttttt aaagattttg tttcttctaa 3060 gactcttcaa atatttctga taagaaaaga tacactttgg tcagatcaaa taacttgtgc 3120 aaatataaga tggagaaaat gtgtttgttt agcacatgga aatgatcaag ttatcatacc 3180 attactaaat tgtgtaaaat tgggatgaaa aaaaaaactt aataacttta tttatattaa 3240 acttttattt atattaaaat ttcctatttc tgaaactttt ttatgaataa atttatttaa 3300 atagattaaa gatttaaaga atagatttat ttaaatatac tgactttggt tttattatta 3360 gtctgtttaa aaaatttaga aaatctgaat ttaaatatac aattatatgt aaacaaagtt 3420 aaatggttta tgtttatagt tttaaaaaaa gtaaaaatat gtttaagact aatgtggtac 3480 gtacttttga attgttcatt aagtcctttt tttattctca ggctttttag tactgttgtt 3540 ataactgaaa aaggtctacc aaccaaaaca gtggttcctt ctcattgggt tagtttagat 3600 aaatgtattg tttatttccc acccaaagga tacaaaactc ttggtactta tatttctgag 3660 tgggctgttc cagaaccagg ctggaaagag tatgagttta ttgaatttat tttggaagct 3720 ggaacattgg agacatgcaa ccagatgctt caatttcaaa cagaagatga gtcagaaaat 3780 cacggtttat attttttatt atttatgtaa tttctcttga taaaaaaatt attatgagtg 3840 tataaataat ttttttgtat taaccgtaag gaaaatcaaa agaagttcga gcacccactc 3900 cagagctaga cttcattttt aattctaatg aacctgtgaa cagtggattt tctcaaaagc 3960 tgagtgatag atattctaat acaggtgaca aagttttatg taaaggtatt ccagtaaaac 4020 tatttgtttg attggtattt atttcaactt ttttattttt ttgcattaga aaaactgaaa 4080 gcagcacaag ctccaagtct gttgccagat tttccacctg attttgttaa caaaaatgtt 4140 ggaaaacctt tcgggaaatc aaaagaagtt cgagcaccta gcccagagct agacttaatt 4200 tttaattcta atgaacctat gagcagtgga tttactcata agctgagcaa taaatgttca 4260 aatacaggtg aaaaagttat atttaaatat atttcaaaaa aattatatat tttttaaaaa 4320 actatatttt caaaaaaaaa aaaagtcatt taactatatt ttcaacttga ttttcaaaca 4380 ggtaatttaa aaaggaaaca ttcaaatata ggcgatggtg gagtttcttt taaggaaatg 4440 tcaaacaagc gtgagtactt ttgtatttac ttgaatatat ttatcaatgt tgtttttcat 4500 ttatgaaaac tstttttttt ttgttttttt tttttgtatt gtattgataa ttacattttt 4560 aattcatttt agtttcacaa ttgagaattt attgttatat aattaaatgc aattaaatct 4620 aatttaaaag ttgcaattaa taataaatat tttgtcaata tttaaaaatc acaaataaaa 4680 tatttaattt aacttcagaa tttcaatatg caattttcat ggaactaaca agtcaattta 4740 atcaaattaa ggagaatcaa aagaaaatcc ttgatcgtct tgagctactt gaaaataatg 4800 atgcgggacc cagtgggttt gcaactgccg atattttaac agaaccaata gatacaatgg 4860 aaaggtttga tgaggaagaa aatgtattaa actccagtaa agcagctagg tttcgaaagg 4920 tagtttgtca tayttttttc accttataga aaagtaaatt taatcaacag ttttaaatgt 4980 taytttttaa aacacaactc aaaactaaaa atgctaattt tttatttttt ttattttttt 5040 attttttttt taattaggag ttkacaattt tatttacaat gttatttttc cttaaataga 5100 cacaagattg agatgagttt gcatcggatt tttttttaac gtttttattg tctctagatt 5160 acacaaataa agggcgttgg tggttcaacg ccacgtcgac ttgttaaaaa tgtgttagac 5220 agcataatga cacaatcttt acaaagctgc ttcagtaaag atggaataaa aggaaaacaa 5280 aaatttgtgg ccacaacttt atacaaatgt ttaaaaagta agttagttga catacctaat 5340 tacattgtaa ttaacttgag gtttttatta agtttgttat gaagaacaaa aaagtcataa 5400 taacatttaa atataaatga aaaatagttc taaattttgc ttgttttaga agcgctcatc 5460 tcagaaaaag atggttttga cgtgggcaat attgaggcac ttgttggcga cttgttaaaa 5520 cgtgcaatgc cgaaattgca aaaagaagga attaaataat ataaaaattc ttcagttttt 5580 tttaattata gcatgtattt taacatttaa ataattaaat aaatttttaa cttgaaaaaa 5640 gaaattattg ctagttttcc gtgccatgcc taaatacgac gaccataggt ttaataggat 5700 atcaattttg gacaaatata gacgtccgct aggcgtctta attcgtccaa aaaatgtcta 5760 aaaaagacgt cattggacgt ccgttatgga cggacaaaag gtgtttcatt tggacgtcta 5820 ccggacgtcc attagacgtc caaaatggac gtcctaggtc caatctggac gtccgctgga 5880 cgtcgttgga cgtccaaatg cccactggg 5909 // ID LEAPFROG1_EI repbase; DNA; INV; 1958 BP. XX AC LEAPFROG1_EI; XX DT 16-MAY-2005 (Rel. 10.05, Created) DT 16-MAY-2005 (Rel. 10.05, Last updated, Version 1) XX DE LeapFrog-Ei1 (LEAPFROG1_EI), a new member of the piggyBac DNA DE transposon superfamily from the single-celled eukaryotic DE reptilian parasite Entamoeba invadens. XX KW piggyBac; DNA transposon; Transposable Element; KW Interspersed repeat; Leapfrog-Ei1; LEAPFROG1_EI. XX OS Entamoeba invadens OC Eukaryota; Amoebozoa; Archamoebae; Entamoebidae; Entamoeba. XX RN [1] RP 1-1958 RA Pritham E.J., Feschotte C. and Wessler S.R.; RG Department of Biology, University of Texas at Arlington, RG Arlington, TX 76019, USA; RT "Unexpected diversity and differential success of DNA transposons RT in four species of Entamoeba protozoans. Mol. Biol. Evol. (2005) RT In press."; RL Direct Submission to Genbank (16-MAY-2005). XX DR Repbase; LEAPFROG1_EI; Positions 1 1958. XX CC leapFrog-Ei1 is a member of the piggyBac superfamily. The TIRs CC are 10-bp long and are flanked by TTAA TSD. The element contains CC a large ORF, which can potentially encode a 574-aa protein 43% CC similar to the piggyBac transposase from Trichoplusia ni There CC are several elements closely related to leapFrog-Ei1 in the E. CC invadens and E. moshkovskii genomes including deletion CC derivatives with perfect TIRs. There are also several divergent CC lineages of leapFrog in these genomes. XX FH Key Location/Qualifiers FT CDS 107..1844 FT /product="LEAPFROG1_EI_ORF" FT /translation="MEKEEMEESERCDSDNVLSDENYEENYQEEVVDKHEN FT CVNPNCQLCWKNVKFTEDDVDSINFIHDKSGNMKSTTMINTELDCFMSLFN FT DECFELITENTNLYANQKKTRKWYPTNVNEMKAFISVAIKQGVNNDNKKPI FT QELYSKEALPFYLAIFPLYRLKSLLYNIHFDNKLNRYQRQCTSFRKEEAIA FT DLYDKLNINFQNVYSPGKNVVIDEMMCPFHGKCPFKVYMPAKPERNGLKTY FT AVVDCESLYVSKIKLYCGGEFNPKNNKFINTISYTQRKTTVTHVKKQNLLE FT LNSISKKPRKEKSKGNPIKNTVIFLMNHLFGQKHVLCCDNFFGTIDLIKEL FT VDKQCDYIGTVRQIRFQVCKKFTEENKLKTILYKHINLGWSMLGYKETETK FT QRVCIISTENSKTKLDEEDGTPETVVKYNKLKGGVDMVDKICSSYSSRRTS FT RRWSVTYFYFLLDICVNNAFVMWVKSTGNNNRRTFIHNLSFQLSSFFISNI FT LIQPRLPQMERIKAMKNMGFDISTEHRQNKYIKTIGGKKSGYCHDCKQRKT FT TACCDICSATLFTWSCGDCNRVLCGECFKKIAX" XX SQ Sequence 1958 BP; 767 A; 280 C; 335 G; 576 T; 0 other; ccttaggaag agaaatctcg atattttata tcaagattga atatttattt ttaaatcgct 60 tttttatata ttaaaaaaat aaaaataaaa aacaacaaaa aacaaaatgg aaaaagaaga 120 aatggaggag tcagaaagat gtgatagtga caatgtttta agcgacgaaa attacgaaga 180 aaattatcaa gaagaagttg tcgataaaca tgaaaattgt gttaatccaa actgtcaatt 240 atgttggaaa aatgtcaagt ttacagaaga cgatgttgat tctataaatt ttatacatga 300 taaaagtggc aacatgaaat caacaacaat gataaacacc gaacttgact gctttatgtc 360 tttgtttaat gatgaatgct tcgagttaat cactgaaaac acaaatttgt atgctaatca 420 gaaaaaaaca agaaaatggt accccacaaa tgtcaacgaa atgaaagctt ttattagtgt 480 ggcaataaaa cagggtgtaa ataatgataa taaaaaacca attcaagaat tgtattcaaa 540 agaagcttta cccttttacc tcgccatttt tccgttatat cgtctaaaat ctcttttata 600 taatattcat tttgacaaca aacttaatag gtatcaaaga caatgtactt cttttcgaaa 660 agaagaagct atagccgact tatacgacaa attaaatata aattttcaga atgtgtattc 720 tccaggtaaa aatgttgtaa tagatgaaat gatgtgccct tttcacggaa aatgtccttt 780 caaagtatac atgccagcaa agccagaaag gaatggattg aaaacttatg ccgttgtgga 840 ttgtgaatcg ctgtatgttt ctaagataaa gttgtattgt ggaggagaat tcaatccaaa 900 aaataacaaa ttcattaaca caatttctta cacacaaagg aagactactg ttacccacgt 960 caaaaaacag aatttattgg aactaaactc tatatcaaaa aaaccaagaa aagaaaaatc 1020 caaaggaaac ccgataaaga acacagtgat atttctaatg aatcacttgt ttggacaaaa 1080 acatgtgctc tgttgtgaca acttttttgg aactattgat ctaatcaaag aactggtgga 1140 caaacagtgt gattacattg gcacagttag acaaataagg tttcaagtgt gtaaaaaatt 1200 cacagaagaa aacaaattaa aaactatttt atacaaacat attaacttgg ggtggtcgat 1260 gttgggatac aaggagactg aaacgaaaca aagagtgtgt ataatatcaa cagagaattc 1320 aaaaacaaaa cttgatgaag aagatggaac accagagact gttgtaaaat acaacaaatt 1380 gaaaggtgga gttgatatgg ttgacaaaat atgttcctct tattcatctc gtagaacttc 1440 gagaagatgg agtgttactt acttttattt tcttttggat atatgtgtca acaatgcctt 1500 tgtgatgtgg gtgaaatcaa cgggtaataa caataggagg actttcatac acaaccttag 1560 ttttcaatta tcctcgtttt ttatctcaaa catattgata caaccaagac ttccccaaat 1620 ggagcgaata aaagcaatga aaaatatggg ttttgacatt tctacggaac atcgacaaaa 1680 caaatacata aaaaccattg gcgggaaaaa atctggctat tgtcacgatt gcaaacaaag 1740 aaaaacaact gcctgttgtg acatttgtag tgcgacctta ttcacttggt catgtggtga 1800 ttgcaatcgg gttttatgtg gtgaatgctt taaaaagata gcttgaattt attgttattc 1860 attttgtttt tttattttac aaaaggatct ggatagaaaa tatctagttt tccctttcca 1920 gcaaaaaaaa attggcaaaa acgagtgact tcctaagg 1958 // ID P-35_HM repbase; DNA; INV; 5488 BP. XX AC . XX DT 02-FEB-2009 (Rel. 14.02, Created) DT 02-FEB-2009 (Rel. 14.02, Last updated, Version 1) XX DE P-type DNA transposon family: consensus. XX KW P; DNA transposon; Transposable Element; P-35_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-5488 RA Bao W. and Jurka J.; RT "P-type DNA transposon families from Hydra magnipapillata."; RL Repbase Reports 9(2), 447-447 (2009). XX DR [1] (Consensus) XX CC The ends of this element are not clear. CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(169..1068,1013..1597,1564..1980) FT /product="P-35_HM_1p" FT /translation="MCRTCGQXIKKCAGYVEPKSVHFYKEQLLKLYNILXD FT NXQTDIHPIYLCHSCHMRCFHIKNKCKSHFDTKDPFKFFPRSDNCILCFSD FT KKSAISELENYCDESSSHQIFCHLSVTAXNSVLKQINVAYDKKWNLKINEK FT XVSKSHCSLLNNLPKXLVVEXSNLIFSLITQSNVCNANCEYKDLVNNDLSS FT FVSEKNFKIETPYMESVTDCTVFRHKLCPILIPLTISKCEFCLELRKYLNL FT KQFRLKEKQNKIALDSGNNXMFSNKKNDRYMSKEELLFKIKILENDKKTNS FT NQIFAPPVRSWKMIKKQIATKSLHLQCRINQLIKKEGLIVDGETHKVMKEC FT LIDKNNNPFSESSPQYLLWXQQKIQVSRKDKRSMXWXPVLXRWCLSIYLKS FT PETYKHLTKSEFMYLPCENTXLKYIFXDPGCGFNADVIXRLXLXADIKNLK FT EHEKXXSXVFDEMKIKSGLVFSXTXGKIVGLTSLNDALXEFXXKLKEIKAX FT VXXKIEGNKSINSPTLAKYVIVFMVRGIFSSLCYPFGHFASSGFNSDQIFN FT CAWEATXILECIGFKVRSMVADGASPNRKFFRIHLAENYENVKDTTVHSTW FT NIWCSSIILITTLQCKNVAFSSXKKMLLIXKLKYF*" XX SQ Sequence 5488 BP; 2006 A; 647 C; 796 G; 1971 T; 68 other; aaaaacaata atgatcttaa acttttaatg tatttraaat ttatamataa cttgttaatt 60 ttaaataatt gtatagaaaa yawtattayt attawkatta ttaacaataa taatakaata 120 attttttata yagagtaaay ggaatmcyac aaaaaaaagt tkttaaaaat gtgtagaaca 180 tgtggtcaar taatcaaaaa atgtgctgga tatgttgaac caaaatctgt tcatttttac 240 aaggaacaac ttttaaaact atacaatatt ttagyggata atraccaaac tgatattcat 300 ccaatatatt tgtgtcacag ttgtcatatg agatgctttc atataaaaaa taaatgcaaa 360 agccactttg atactaaaga tcctttcaaa ttttttcctc gttcagataa ttgcatttta 420 tgttttagtg acaaaaaatc agcgataagt gaattggaaa attattgcga tgaatcttct 480 tcgcatcaaa tattttgtca tctttcagta actgctgrta attcagtcct yaaacaaata 540 aatgtagctt atgataaaaa atggaatctg aaaataaatg aaaaatwtgt ttcaaaatct 600 cattgcagtc ttctaaacaa tcttcccaag yttttggtag tagaaasttc taatctaatc 660 ttcagcctca taactcaatc caatgtctgc aatgcaaact gtgaatataa agatttggta 720 aayaatgacc tttcttcttt cgtttctgaa aaaaacttta aaatagagac tccatatatg 780 gaatcagtta ctgattgcac agtttttagg cacaagttgt gtcctatttt aattccatta 840 actatatcaa aatgtgagtt ttgtttggaa ttaagaaaat atttaaattt aaaacaattt 900 cgtctaaaag aaaagcagaa caaaattgct ttggatagtg gaaataatty aatgttttca 960 aataagaaaa acgaccgata tatgtccaaa gaagaactac tgtttaaaat aaagatcttg 1020 gaaaatgata aaaaaacaaa tagcaaccaa atctttgcac ctccagtgta gaattaatca 1080 actgataaaa aaagaaggac taattgttga tggcgagaca cacaaagtta tgaaagaatg 1140 cttgatagac aaaaataaca accctttcag tgaaagttcc ccccaatatt tattgtggga 1200 mcaacaaaag attcaggttt cacgtaaaga taagaggtct atgraatggm acccagttct 1260 taytagatgg tgtttgagca tttacctaaa aagtcctgag acatacaaac atttaactaa 1320 gtcwgagttt atgtaccttc catgtgaaaa cactckttta aaatayatat ttayagatcc 1380 aggatgyggc tttaatgcag atgttatytm tcgattartt ttrwawgctg atattaaaaa 1440 tttaaaagaa catgaaaaaa wtktttcttw agtatttgac gagatgaaaa tyaagagtgg 1500 tcttgtattc agtrmaacaa mtggtaaaat agttggtctg acctcattga atgatgctct 1560 tgawgagttt amarcaaaat tgaaggaaat aaaagcataa atagtccaac tttggcaaag 1620 tatgttattg trtttatggt tagaggaata ttttcatctc tatgytatcc atttggacat 1680 tttgcgtcta gtggatttaa tagtgatcaa atttttaatt gcgcttggga agcaacccrc 1740 attcttgaat gtattgggtt taaagttcgc tctatggtag cagatggtgc ttcrccaaat 1800 cgtaagttct ttcgaattca tcttgctgag aattaygaaa acgtcaagga tactactgtg 1860 cattcgacat ggaatatttg gtgttcaagt ataattttaa ttactacttt acaatgtaaa 1920 aatgttgctt ttagttcgra aaaaaaaatg ytgttaattw aaaaaytgaa atatttttaa 1980 caaatcacaa aaawtttgag taatattktt tagctcgcaa attttttgtg atgttcctca 2040 tttgataaaa actacactaa gtcagagttt atgtaccttc catgtgaaaa cactcgttta 2100 aaatatatat ttatagatcc aggatgcggc tttaatgcag atgttattta tcgattagtt 2160 ttataagctg atattaaaaa tttaaaagaa catgaaaaaa ttttttctta agtatttgac 2220 gagatgaaaa ttaagagtgg tcttgtattc agtgcaacaa atggtaaaat agttggtctg 2280 acctcattga atgatgctct tgaagagttt acagcaaaat tgaaggaaat aaaagcataa 2340 atagtccaac tttggcaaag tatgttattg tatttatggt tagaggaata ttttcatctc 2400 tatgttatcc atttggacat tttgcgtcta gtggatttaa tagtgatcaa atttttaatt 2460 gcgcttggga agcaacccac attcttgaat gtattgggtt taaagttcgc tctatggtag 2520 cagatggtgc ttcaccaaat cgtaagttct ttcgaattca tcttgctgag aattacgaaa 2580 acgtcaagga tactactgtg cattcgacat ggaatatttg gtgttcaagt ataattttaa 2640 ttactacttt acaatgtaaa aatgttgctt ttattcggaa aaaaaaatgc tgttaattta 2700 aaaattgaaa tatttttaac aaatcacaaa aaatttgagt aatattgttt agctcgcaaa 2760 ctttttgtga tgttcctcat ttgataaaaa ctactcgcaa taatttggag aaatctcatt 2820 ggaaccaaaa ttttagaaat ctaatggtaa gtttgatttt tagtgctaac agtggtcata 2880 atagttttta ttgtgatcct cattagagcg gttctagaga cacctttttt ttaaataggt 2940 gtcctccaag ccctaattgt gttctttatt ataaaataac actggattta aatttttttt 3000 aataaaaaat atttttaagg gacttctcaa gaccttaaaa tatcagactg catgcaacct 3060 ggtactcaaa agaagcgaaa tttaataaca atttaaaaaa cagcagtttt tataaatatt 3120 attattattt cattccaata aattgttgtt atttagctaa aaataaaaag tcaatataca 3180 caaatttttt tatattaatt tttttattta ttattttttt ataaaaaata aatttaaagg 3240 gttttgagca tactctttaa aatttttttt tgttttttgt tttaaattaa atccagtgtt 3300 attttattat aaagagcaca attaggggtt gatggacata aagaatggtg tttctagaac 3360 aaccctaatc ctcattaaat aattgcaaac tatttttaat tttgaaaagt taaatatcta 3420 ataataaaga tttctcgaga tgctcaagat cgatggtaat tctaataatt atttaagtat 3480 tcgaaaatat gatttataag tataaaatat ataattgttt tagattgagg atcagtttat 3540 aacatggcct caaattatga atgtctatga atgggatctg arcatgtttc ataatgctgt 3600 tggtctacgw atgggacaca aactcagaga tgagcwtata gagttaactc cacaatctag 3660 aatgctgatc aatttagctg cacaggttag aatgcataaa gagataaatt ttaaaatttg 3720 tataatactt tataatattt aaatgttata tttatactta aggtgttaaa tcaaactgtt 3780 gtacatatgt tggaggagca gggtaaacct gaaacaagat gtttacaaaa atttataaat 3840 ttaattgata ccttctttga ttgycttaat gtttcaaggc aatttaataa aactagaaaa 3900 ccggctttag atgtttacaa aactcatttg gataaaagat ttgaggtaca atatttcaat 3960 attgaagatt ttttatttaa attgtttact gttttatgca aaatggcgtc tttagttaaa 4020 tgcacttcat taaaatacgt tattttagat ttacttatat gtttattata ttttttattt 4080 taatattaat attactttta acctctacta taatatatat ttaatattat acatattcta 4140 accttaatac acataattaa cagatataaa tttawatgta tatayatata tatttattaa 4200 atattataca tattttaayc ttaatactag ttaatacttt tcagatattc ataaagtgtt 4260 ttttttttca aatattatta atatttttat ttttttggga caattttttg gcaataataa 4320 aaataattta tattaatatt atgttattaa ttattcaggg ttgccactct caatagcttg 4380 tcagtactta taataagaaa tttgcaaggt gtaggggttg ttgtaaaact tttattttgt 4440 cattaaaatt gtgtttaaat aaggaaaata atattcttgt gaattaacta gaaagagcat 4500 ggaaaattct cggaaaagtc aggaaattta aaaagctaaa ctcaagggaa aaggtttttt 4560 ctcagggaaa actaggggac aagtttttaa aattataaac cctgtttatt attcatatta 4620 ttattattat tattattatt attattatta ttattattat tattattatt attattatta 4680 ttataattat taatgttatt aataaatgaa tatattattt ctttaaatag tggttgaatc 4740 aaacgtttat taaattttta aatgaatggg aacaagccag tcaaaatgtt aaaaatttat 4800 cgataaaaga aaaagccaaa ctttgcttga gtaaacaaac cctagaaggg ttcagaatta 4860 caggtaaaaa ttgattaata tgaacaaaac ataaacttgt acttctatta cttgattgat 4920 aaagttgatt tcattaaatt taaatttttg gaatagtgca ctcatttact gaccttggtt 4980 caactctttt acaagaagaa ggtgtagagt acttactttc agaaaagttc agtcaagatc 5040 ctgtagagga atacttttcc aagcagcraa gaagaggggg aggaaacgaa aatccttgtt 5100 tagaagaatt taatcgtaac tttctaggtt taaatattgc yggcgacaat ctcattcgag 5160 ccttaaatgg aaattataga ggaagatttc aggaagatct aaaaattgat gttactgaca 5220 ccatgcatct gccgaagaaa aagccgaaaa aatattgatt ttatttattt aaaaacagtt 5280 acatgttatt tcatttcttt ttttttattt taaaacaatt tcgtctaaaa gaaaagcaga 5340 acaaaattgc tttgggtagt ggaaaaagca atgttttcaa taagaaaaac gacgaatatg 5400 tccaaagaag aactactgtt taaaataaag atcttgaaaa tgacaaaaaa ttttttttta 5460 taattccatt cttgtttttc cactctat 5488 // ID BEL-93_AA-I repbase; DNA; INV; 6436 BP. XX AC supercont1.273; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-93_AA_; KW BEL-93_AA-LTR; BEL-93_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6436 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.273; Positions 126123 132558. XX CC 'ATGTA' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 998..6310 FT /product="BEL-93_AA-I_1p" FT /translation="MAEREVLIGRRDTLLAALGRAEAFIENFDAGRDQGQV FT PLRLSHVDNIWANLETVQGQLEDTERTVEGRALHADIRAGYETQLFSIKAD FT LLSKMPPTSINARNPCSAQANSALSGIKLPTISLPEFDGDYQHWLTFHDTF FT VALIHSNADLPDIQKFHYLRAAVKGEAAQSIESIAISSANYSLAWDALKSR FT YSNEYLMKKRHLQALFDVQRMKRESAATLQSLVDEFERHVKILRQLGEPTD FT SWSSVLEHLLCTRLHDDTLKAWEDHASTVKDPNYTCLIEFLQRRIRVLESI FT SVNHHVPSGSTQPTASSSKKQQPIIRLSSYSSTASTAKRCPACNQPHPLAR FT CFKFQRLPLSERQQIVNAKRLCTNCLKVDHSNRNCPSEMNCRHCNRRHHSL FT LHSAAPDASRRSSSDSATVNSPTSATPVVQSFSPVAQRSEQSLVTATDIVP FT AVQTSVPIHHSRDNVFLLTVIVNIVDAFGQEHPARALLDSASQPNLITERL FT ASVLRLRRSNVNITVQGAGKLSKPVRQSVYTEIRSLNQQFSCGVNFLVMDK FT VTADLPSQHISTAGWNIPKELVLADPSFNKSQPIDMVLGAKHFYSFFPSAA FT RLQLAVNLPVLVDSVFGWIVAGSSTQARPDPTSDKATSNSVTISMVSLEES FT IERFWKSEELILNDGYSVEERHCETIYQSTVERNEEGRYVVRMPRQPDFDL FT KLGASKSNALRRFELLEKRLDRNLKLKEDYHAFMQEYLDLGHMRLVGVDEA FT EPPVAYYLPHHPVFKDSSTTTKVRVVFDASAKTSTGYSLNEALSVGPVVQD FT DLLDIMLRFRTFKVAVVGDIAKMYRQVLLHPDDRSLVRIFFRFSQSSPVQI FT YELLTVTYGLAPSSYLATRTLQQLADDEGHAYPLGGPALRKSFYVDDFIGG FT AQSVEEAIRLRTEMSELLTKGGFELRKWTSNELEVLKGLKDDQIGTQSSLE FT FSPNETVKALGICWEPESDTLRFDSNIHLDIEPPTKRSILSAISRLFDPLG FT LIAPVVVKSKILMQELWLLSCGWDDPVSAQVQQKWENIQHDLPNISSYRVN FT RYALLPNSAVQLHIFSDASESAYGACIYARCEDMKGQVRICLLASKSRVAP FT LKRVSLPRLELCAAVLGAHLYDRIKKAISIEVFATFFWTDSAITLQWIRSP FT PNTWKTYVANRVSEIQHFTHGCAWNHVSGSENPADLVSRGMTASDFIESKL FT WSSGPDWLTHQENEWPTSLPPNAPEADLEVRTISAAVTATSSINSLFLRWS FT SYTRLLHVVAYCQRFISNCRVKARTQSNELQAKIVSLHPRQLAEAKAILTR FT LAQSDSFPAEIKAIQEGQVVGKRSPLRRLSPFIDSERVVRVGGRLSLSQLP FT YQTKHPALLPKSHPFTRLVAEHYHKKLLHGGGRALLTAMREEFWPIDGRRL FT VRSVVRACFRCCRLNPVPVQQQIGQLPASRVTPSRPFSIVGVDYAGPFYLK FT AIHKRASPTKAYLCLFVCFATKAVHLELVSELSTSAFLATLRRFISRRGRP FT SDIHSDNGKNFEGAKNDIAELFALLSNNHAEIVSSCGTEGITWHLTPPKAP FT HFGGLWESAIKVAKKHLYRQIGSSRLSYEDMSTVLAQIEAVMNSRPLLPSS FT EDPNDLAALTPGHFLIGTSLLALPDPDLRSIPTSQLDHYWKLQAHIARFWQ FT HWQTEYIQELQKDTRSYMRNDNFLLGQLVIVVDEQQPTIRWPLARIVAVHP FT GADNITRVVSLRTVKGVIKRPIAKVCLLPCASSVPVDVTVSNDDLQVPAIA FT " XX SQ Sequence 6436 BP; 1529 A; 1682 C; 1516 G; 1709 T; 0 other; taaaattggt gccgtgacca ggatctcggt cgaagaacgt tccagaaagc tcctgcgcgt 60 cccgccattg cactatctgg aacaatcacc gctgtggtcg cgccattata gtcgccatcc 120 ctggggcaga ttaggagtcg ctgtgcgagg tttctgctga ttgaatataa ttcaaggctc 180 gatttatcgg gcgcaaggta attgcctgtc cgattttctt cgtgggtttc gcaggtgccc 240 tttagatcgt tgtttctttc attgcatgca ctgttcatcc ttcctcgttt ttcggagaca 300 tcggcctcta cgagtggact acgccgcccg gacaaattgc gccattgctg ccggtgatcc 360 tgcgacactt ttcgccatca cagtccagtt cttcggcgcc atatcttctg gaaagtgacg 420 ccaagccagc ccattgtgat accttctgta ttgataagct tctagaactg gtgattgttt 480 cgtcactggc cattgtggcg acaatacgac tgcttctttc gacgattgtt cctgctgcta 540 cgccatccta cttgcacctg ctcgttcagt gaggcctctg aaggcattat tctggatttt 600 ttctggatat ttgcgacgcc gtgtcgaatt aaacaggaat ttgtggattt ttgctacact 660 ttgtcatcgc tcgtgccgct ttgggaacgt tttcgacacc gtggaccaac ggttacccat 720 caacggcatt ttcgtcgatt ttgagtgttt gctgctgaaa tatcgacgtc ctgcaacctt 780 ttcggcgatt tggatcacga attgcacgga cggtgctcga atttggtcca gcctacggta 840 ttttgaagat tcaaggcacg ttttgccttg gaataggtga gtacctttcc aagtggcttt 900 attttctatc gtccggtcta ccgtggtttc ttttttggaa tagttttctg gaagtttctg 960 gattcgacga ccacgtcctg accaacgagt gccaaatatg gcggaacgag aagtgctcat 1020 cggcaggaga gatacgctgc ttgctgcatt gggtcgggca gaggcattca tcgaaaactt 1080 cgacgctgga agggatcaag gccaggtacc gctaaggttg agccacgtgg acaacatctg 1140 ggccaacttg gaaacggtgc agggtcagct tgaggacact gagaggacgg tagaaggtag 1200 ggcattgcat gctgatatcc gtgctggtta tgagacccag cttttttcaa taaaagcaga 1260 cttactttct aaaatgcctc ccacttcaat taacgctcgc aatccttgtt ccgctcaagc 1320 taactctgct ctctctggga tcaaacttcc gactatttca cttccggaat ttgatggcga 1380 ctaccagcat tggctcacct tccacgatac ctttgtcgcg ttgattcatt caaacgccga 1440 tcttcccgac attcaaaaat tccactactt gagagcagct gtcaaggggg aagctgctca 1500 atccattgag tcgattgcta tcagttccgc taactatagt ttggcttggg atgcgctcaa 1560 gagtcgttac tccaatgagt accttatgaa gaaacgccat ctccaggcgc tcttcgacgt 1620 tcaacgtatg aaaagggagt ctgctgctac ccttcaaagt ttggttgacg agtttgaacg 1680 acacgtaaaa atcctacggc aactgggaga acctaccgat tcgtggagca gcgttctcga 1740 gcatcttctt tgtacgagac tacatgatga cacgcttaag gcatgggagg accatgcgtc 1800 gacggtgaag gatccgaact acacctgcct cattgagttc cttcaacgga gaattcgggt 1860 tttggagtcc atttcagtga atcaccatgt gccatcggga tccactcagc ctactgcgtc 1920 gtcatcgaag aagcaacagc cgatcattcg tctttcgtcg tactcgtcca ctgctagcac 1980 agcgaaacga tgtcctgctt gcaatcaacc gcatccgttg gccagatgtt tcaaattcca 2040 acgtcttccg ctttctgaac gccaacagat tgtcaacgcg aagcgtctgt gcactaattg 2100 cttgaaagtc gatcattcca atcgcaactg tccatcggaa atgaattgta ggcattgtaa 2160 tcggcgtcat catagtcttc ttcactccgc tgccccagat gccagccgca ggtcctcaag 2220 tgatagcgca actgtaaatt ctccgacatc agcgacgcca gtggttcaaa gtttttcgcc 2280 agtagctcaa aggtccgaac agtccttggt gaccgctaca gatatcgtcc cagcggttca 2340 aactagcgtt cctattcacc actcaaggga taatgtgttt ctactcaccg tcattgtgaa 2400 cattgtcgat gcctttggcc aggaacatcc agcgcgagcg cttttggaca gcgcgtctca 2460 gccgaaccta ataaccgagc gactcgctag tgttcttcgt ctgaggcgta gtaacgtcaa 2520 tataacggtc caaggagcag gtaaactatc caaaccagtg cggcagtcgg tctacaccga 2580 gattcggtcc ctgaatcagc agttttcatg cggcgtcaac tttctggtga tggacaaggt 2640 tacagcggat ttaccgtcgc aacatatttc tacggcaggg tggaatattc cgaaagagct 2700 tgttttggcc gatccgtcgt tcaacaaatc gcagccaatt gacatggtac tcggagcgaa 2760 gcatttttat tctttcttcc ccagtgctgc tcgcttgcag ttggccgtca atctcccagt 2820 gctcgtagac agtgtcttcg gttggatagt agctggttct tctacgcaag cccgtccaga 2880 tcctacttcg gacaaagcca cgtctaattc tgtgaccatt tcgatggtgt cccttgaaga 2940 gagcatcgag cgtttctgga aatcagaaga attgatcctc aacgatggct actcggtgga 3000 agagcggcac tgcgaaacaa tatatcagtc cacggtcgag cgcaatgaag aaggtcgcta 3060 tgtagttcgc atgcctcgtc aacctgattt cgatctcaag cttggagcat cgaagtcgaa 3120 cgcccttcgg cgttttgagc tcttggagaa gcggctcgat agaaacttga agctcaagga 3180 agattaccac gcattcatgc aggagtattt ggaccttggt cacatgcgcc tcgttggagt 3240 agacgaagca gagcctcctg tggcttacta ccttccacac catcctgtat tcaaggattc 3300 gagtacgacc accaaggttc gagtcgtctt tgatgcttca gcaaaaactt ccacaggcta 3360 ctccctgaac gaagcacttt ccgtgggccc ggtggttcag gacgatctat tagacatcat 3420 gcttcgtttt cgaaccttca aggtggccgt ggttggcgat attgccaaaa tgtacaggca 3480 agttctactg catcccgacg acagatcgct ggttcgcata ttctttcgtt tttctcaaag 3540 ctcccctgtg caaatttacg aactgctcac tgtaacatat ggtttggcac cgtcatctta 3600 tctcgcaaca cgtaccctac agcagttggc agatgatgaa ggccacgcgt accctctcgg 3660 tggcccagcg ttgcgaaaga gcttttacgt cgacgatttt atcggaggag cacaatcagt 3720 tgaagaggcc attcgtttgc gaacggagat gagtgaactg ctgaccaagg gtggattcga 3780 attacgaaaa tggacttcga atgagctcga agtactcaag ggtttgaagg atgatcaaat 3840 cggcacgcag tcgtcattag agttcagccc caatgaaacc gtgaaagctc tagggatatg 3900 ctgggaacct gagagcgaca ctttgcgttt cgactccaac atccatttgg acattgaacc 3960 tcccacaaag cgttccattc tttctgctat ttcgcgacta tttgacccac ttggcctcat 4020 cgcaccggtc gtcgtcaaat ccaagatttt gatgcaggaa ctctggctat tatcttgtgg 4080 atgggacgat cccgtttcag cacaagtcca gcaaaagtgg gagaatattc agcacgatct 4140 tccgaatatt tcatcgtacc gtgtcaaccg ctacgctctt ttaccaaatt ccgctgtgca 4200 attgcacatt ttctccgacg cctcggagtc tgcgtacggt gcatgcatct acgcccgctg 4260 tgaggacatg aaaggacaag ttcggatttg cctgctagca tctaagtccc gagtagcacc 4320 tctcaaacga gtgagccttc cacgactgga actctgcgcc gctgtactag gtgctcatct 4380 ttatgatcgg atcaagaaag ccatcagcat cgaggttttc gccacgtttt tctggacgga 4440 ttcagctatt acgttgcagt ggattcggtc acctccaaat acatggaaga cctacgttgc 4500 aaatcgagtt tccgaaatcc aacacttcac tcatggctgt gcttggaacc acgtcagtgg 4560 cagcgaaaac cccgcggacc tggtgtctcg cggaatgact gcatcggact tcattgaaag 4620 caaactttgg agtagtggac ccgattggtt aacacaccag gaaaatgaat ggccgacatc 4680 tttacctcca aatgcacccg aagcggatct cgaggttcga acgatttctg cggcagtaac 4740 cgcaacatca tcaatcaatt cgttgtttct ccgatggtcc tcctacactc gccttcttca 4800 cgttgtcgcc tattgccaac ggttcatcag taattgtcgt gtcaaagctc gcactcaatc 4860 aaacgaactc caagctaaaa tcgttagtct tcatccccga caattagcgg aagcaaaggc 4920 gatcctcact cgcctagcgc aaagcgactc tttccccgcc gagatcaaag ccatacaaga 4980 aggccaagta gttggaaagc gttctcccct tcgaaggttg agccccttca tcgactcaga 5040 gcgagtagta agagtgggag gccgtctaag tctgtctcaa ttgccctacc aaactaaaca 5100 tcctgccctt cttcccaaat cgcatccttt cacgcgccta gtagctgagc attatcacaa 5160 aaagctgctc cacggcggcg ggcgcgcttt actaacagct atgcgtgaag aattttggcc 5220 aatagatgga cgcagattgg ttcgaagcgt tgtgcgtgca tgcttccgat gttgtcgtct 5280 taaccctgtg cccgttcaac aacaaattgg tcaattacct gcgtcccgag tgacgcccag 5340 tcgtccattc agtatagtgg gagttgatta tgccggaccg ttttacctta aggctatcca 5400 taagcgtgct tctcccacca aagcgtattt atgcctcttc gtatgtttcg ctacgaaggc 5460 ggtgcattta gagctcgtca gtgaactgtc cacatccgct ttcttggcta cgcttcgcag 5520 atttatctcc cgccgtggcc gcccgtcgga cattcattcg gacaacggca aaaatttcga 5580 gggggctaaa aatgacatcg ccgaattgtt tgccttgctt tccaacaacc atgccgaaat 5640 cgtgtcgtcc tgtggcactg aaggtataac atggcacctt acaccgccaa aagcccctca 5700 cttcggcgga ttatgggaat cggccatcaa agtagcgaag aaacatttgt accggcagat 5760 cggctcgtcg cgcctttcct atgaggatat gagcaccgtc ctggcacaaa tcgaggccgt 5820 aatgaattct cgcccactgc ttccatcgag cgaggatccc aacgatttag ctgccttgac 5880 tccagggcac ttccttattg gtacatcgct tctcgccctg cccgaccccg acctacgcag 5940 cattccgacc agccagttag accactactg gaagcttcaa gcgcacatcg cgaggttttg 6000 gcaacattgg caaacggaat acatccagga actccaaaag gataccagga gttacatgag 6060 gaacgacaac tttctgctgg gacagctcgt catcgtcgtc gacgagcaac agccaacaat 6120 ccgttggcct ttggctcgta tcgttgcagt tcatcctgga gctgacaata tcactcgagt 6180 tgtatctctg cgaaccgtca agggagtaat caaacgacca attgccaaag tttgcctgct 6240 tccgtgtgct tcatcggtgc ctgttgacgt cactgtatcc aatgatgatc tccaagttcc 6300 tgctatagca taagcgatgc aatgaagata actatttctc ttacaattaa gaaaaagttt 6360 gttagtagta taatttgatg tcgattagtt tgcttttgtt tgtttacatt ttgaatgtca 6420 ttcaaggcgg cgggta 6436 // ID DNA8-14_CQ repbase; DNA; INV; 1513 BP. XX AC . XX DT 28-DEC-2010 (Rel. 16.01, Created) DT 28-DEC-2010 (Rel. 16.01, Last updated, Version 1) XX DE Non-autonomous DNA transposon from Culex quinquefasciatus - DE consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-14_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-1513 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the southern house mosquito."; RL Repbase Reports 11(1), 91-91 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >95% identity. CC 8-bp TSD. 376-bp TIRs. XX SQ Sequence 1513 BP; 434 A; 324 C; 320 G; 435 T; 0 other; taggatgtaa caaaaccaca cttttttgcg ggcatttcag gggatgatcc cctaggtgta 60 ctgagtcaga atcccaaata tgagcccgat tggttgcgac aggacctggc gctccggctt 120 caaagtttaa atgggattta acccgtaaaa tcggtgcgtt ttggtttttg ccatttttga 180 gtccttcaac gatttttgga tttttcaaaa acctcatgag cttatagttt gtgttccagg 240 gtacaacttt gccgaagact gcgaagagat ttgaccgctc tgaaaaatgg tacagaattt 300 acaaaatctt ttctcttgtt tttctgataa cgtaaaacat gttctggcaa cactcgcttt 360 tgaggcgcgc tttcggcaaa ctatgcaacg catgctacgg tgcgtcgcga cgcgtcggca 420 tgtatttgtc caagcgtcgt gttgttgttt ttgttgttgt gattgttgag aacgaaagtt 480 cttatggaac tgcaagaagc tctccggttg actctcaggg ttcagttgga tgcccaggca 540 gaaccgattc taaaccttgc tcgagtgcca actaacgcgg ctgtcttgaa agcgctttat 600 tacgcacgga tgttgtgtgc attttgaata ttccgtattc cggaagatat ttcgagagac 660 acgcatttta gtgagagttt ccactcgctt tgtaatcaat cgaagagtta tggcaatttg 720 gcaataaaat cggtcataaa caaattaatg tattgcattg acaacataag taaatactat 780 tttcatacag cagagttctc tggctagata ttttgcgaat tcattagaca ccaattgcac 840 tatgttcgag cataggttcg ctaacatgtc ccccagcacc acaactggtc gcgacgaaaa 900 agatccacac gggttaatga caggaccagg tgggcttaga actaaatcct gcgagtcact 960 gccaaataga atacaggaat ctacgacaga ttaaaagagc agcaccggtg atcgaatcct 1020 ccggcaccat cgaaaacata agcacaacat ccaaataaag cattttcttt ccatgtgtca 1080 aactttgacc gttgtacgct ggatgttttc tcgctgctat agttacgacg cctggctctg 1140 gatgcgcgcg ttcaaagaaa gtgttgccaa tactaatcaa acaaattaat ggcaaaaagt 1200 ttttttattt tcaaatcatt gtacaagcag tactaattat caaatctctt cacagtcttc 1260 ggcaaagttg tgccctgaaa ctcaaactat aagctcatga ggtttttgaa aaatccaaaa 1320 atcgttgaag gactcaaaaa tggcaaaaac caaaacgcac cgattttacg ggttaaatcc 1380 catttaaact ttaaagccgg agcgccaggt cctgtcgcaa ccaatcgggc tcatatttgg 1440 gattctgact cagtacacct aggggatcat cccctgaaat gcccgcaaaa aagtgtggtt 1500 ttgttacatc cta 1513 // ID Sola2-1_HRo repbase; DNA; INV; 2730 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.05, Created) DT 22-MAR-2011 (Rel. 16.05, Last updated, Version -1) XX DE Sola2-type DNA transposon: consensus sequence. XX KW Sola; DNA transposon; Transposable Element; Sola2-1_HRo. XX OS Helobdella robusta OC Eukaryota; Metazoa; Annelida; Clitellata; Hirudinida; Hirudinea; OC Rhynchobdellida; Glossiphoniidae; Helobdella. XX RN [1] RP 1-2730 RA Jurka J.; RT "DNA transposons from the Californian leech genome."; RL Repbase Reports 11(5), 1469-1469 (2011). XX DR [1] (Consensus) XX CC >98% identical to consensus CC This sequence was derived from sequence data generated by DOE CC Joint Genome Institute. XX FH Key Location/Qualifiers FT CDS join(749..979,1020..2096) FT /product="Sola2-1_HRo_1p" FT /translation="MLHRCEECPGKLSVENNLRAIFEEAVISHDDNRSSLI FT SVQDTVDSFIERFSQKLDELSSHHYISKEQLKYLSDSKKIKLFGDNTGCNP FT ESTLEQHASYSSPNCSLLQKTYLFIYENLSVKSVCMISDCLDHNVSAVYAF FT QKHLINYVKQNLKNVTLIKYFSDGAVSQYKNFSNLIHHKEDFGLKAEWHFF FT ATFHGKSPCDGIGGTTKRLVARASLQASTKDQILNARDFYTYADAKINGIK FT FFWVDKKEIKDLLGMLEKRFRSADKIKGCRSHHSFIPDKNDQLLMKRLSSD FT LFGYNFNKNESEDTNDDYEPGRFVALVYNKKWYLGNIIERDDKNDDLKVNC FT LKKFKENIFYWPKNVDICYVPFKHILILIPDDNISYVGKIHITLPETVFKH FT ISTLYNNFKLKKKKKLTLEKNLFCLFLKLFYFSKKANANFIF" XX SQ Sequence 2730 BP; 1003 A; 361 C; 410 G; 956 T; 0 other; gggtcattct ctgtcaactt gaaccatggt ccacgcccga ccatctccga tttttttaaa 60 tttgaacagt aagtacctca atgtatttca tcgacaaaat aaaattttta taaaaaaatt 120 tcaattattt ttcaagttat gaagattttt gtgacagcat cctcaaattt ttgagcgatt 180 tttttcaaaa cttgtaccta cgtagaacgc tatatctctt gaactagcca acatatcttg 240 atgaaatttt ttgtgcatat cagtgaaaca atgtaataac aaaaaaaata aatgttattg 300 tctcgaatca aaggatattt tgaacaaaaa atctagaaaa aaatgaattt tttgaaaatt 360 tcaccgattt caaaacagcg tagctcaaaa tctatcgcaa aataaataaa tattttttgt 420 tattttagaa gtaccttata gtcacaaata aatcacaaaa tttcgttaaa atctgtattg 480 aattttttga gctatcggag ctcaaaatgt caaaaaaatg aaaaacttct tttgtttggc 540 gccattttta actatgaaga accaaacgtt ttttaatttt taaaatttgt ttatatagtg 600 cttttatagc ttaacaacat atcaaaaaat aaaattggcg tttgctttct ttgaaaaata 660 aaacaacttc aaaaacaggc aaaacaaatt tttttcaagt gtcaatattt tttatttttt 720 tcaatttaaa atttaaaatt tagaatgtat gttacacaga tgtgaggaat gcccaggaaa 780 actgagtgtc gaaaataact taagagcgat tttcgaagaa gctgtaattt cccatgatga 840 taatcgcagt tcattgattt ccgtacagga tacagttgac agtttcattg aaagattttc 900 gcaaaaactt gacgagttgt catcgcatca ttacataagc aaagaacaat tgaaatatct 960 tagcgattct aaaaaaattt aaaaagttcg gagtgtattg ctttaatgga ttttgctgaa 1020 aattattcgg tgataataca ggatgcaatc cagagtcaac attggagcaa cacgcaagtt 1080 actcttcacc caattgttct ttattacaga aaacttattt atttatttat gaaaacttat 1140 ctgtcaaaag cgtttgcatg attagcgact gtctcgatca caatgtttcc gctgtttatg 1200 cttttcaaaa acatttaatt aactatgtaa aacaaaacct gaaaaacgtt actttaatta 1260 aatatttttc tgatggtgcg gtaagtcaat acaaaaattt ttctaattta attcatcaca 1320 aagaggattt tggattaaaa gcagaatggc acttttttgc aacttttcat ggtaaaagtc 1380 cttgcgatgg aattggaggg acaacaaaaa gattggttgc aagagcaagt ttgcaggcat 1440 caacaaaaga tcagatattg aacgccagag atttttacac atacgctgat gcgaaaataa 1500 atggtattaa atttttttgg gtagacaaaa aagaaattaa agatctgcta ggaatgcttg 1560 aaaaaagatt taggtcagca gataaaataa agggttgtcg aagtcatcac tcatttattc 1620 cagacaaaaa tgatcaactt ttgatgaaac gtttgtcttc agatttattt ggatacaatt 1680 ttaataaaaa tgaaagtgaa gacaccaatg atgattatga gccgggtcga tttgtagctt 1740 tggtctataa taagaaatgg tatttaggaa acattataga aagagatgat aaaaatgatg 1800 atttaaaagt taactgtctg aagaaattta aagaaaatat tttttattgg cctaaaaatg 1860 tagatatttg ctatgtacca tttaaacata ttttgattct tatacctgat gataacattt 1920 cttatgttgg aaaaattcac ataactcttc ctgaaactgt ttttaaacac attagcacgt 1980 tgtataataa ttttaaattg aaaaaaaaaa aaaaattgac acttgaaaaa aatttgtttt 2040 gcctgttttt gaagttgttt tatttttcaa agaaagcaaa cgccaatttt attttttgat 2100 atgttgttaa gctataaaag cactatataa acaaatttta aaaattaaaa aacgtttggt 2160 tcttcatagt taaaaatggc gccaaacaaa agaagttttt catttttttg acattttgag 2220 ctccgatagc tcaaaaaatt caatacagat tttaacgaaa ttttgtgatt tatttgtgac 2280 tataaggtac ttctaaaata acaaaaaata tttatttatt ttgcgataga ttttgagcta 2340 cgctgttttg aaatcggtga aattttcaaa aaattcattt ttttctagat tttttgttca 2400 aaatatcctt tgattcgaga caataacatt tatttttttt gttattacat tgtttcactg 2460 atatgcacaa aaaatttcat caagatatgt tggctagttc aagagatata gcgttctacg 2520 taggtacaag ttttgaaaaa aatcgctcaa aaatttgagg atgctgtcac aaaaatcttc 2580 ataacttgaa aaataattga aattttttta taaaaatttt attttgtcga tgaaatacat 2640 tgaggtactt actgttcaaa ttttaaaaaa atcggagatg gtcgggcgtg gacttttttt 2700 taaatcgttc aagttgacgg agaatgaccc 2730 // ID Polinton-2_NVi repbase; DNA; INV; 8996 BP. XX AC . XX DT 14-APR-2009 (Rel. 14.04, Created) DT 14-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE autonomous Polinton DNA transposon - consensus. XX KW Polinton; DNA transposon; Transposable Element; Polinton-2_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-8996 RA Bao W. and Jurka J.; RT "Polinton DNA transposons from Nasonia vitripennis."; RL Repbase Reports 9(4), 792-792 (2009). XX DR [1] (Consensus) XX CC The consensus is incomplete at both ends. XX FH Key Location/Qualifiers FT CDS 964..1929 FT /product="Polinton-2_NVi_1p" FT /translation="MEHAKKMVLVPHENVERLQSVLANGVRDNNVGSILKT FT VQTPGNVMTRLDAEMSNILNSSTCKSEREKWSQYRSVLQRYLQFKDVDSYA FT KVMKKEEKDKEQRQMKKKNDNDYDKGADDNYNDDDNDIQRVEEEDRIDASI FT IESVPPKYRRKAGHVLRKLRVSGNITWDANGGVTIGGVRIHHANMIDLIND FT TMRNRKCPPPLGHTQFAVALRESFIPREFIGNKRVWKNMMAGTSPGNSTPV FT PPGGASGVIPRRVLTPEGINRLFRTPSPRRAESASRANSSAIDSSEADKSS FT SGTVAARVLKRKKTVAGQASAKKSSSIHF*" FT CDS 2522..3622 FT /product="Polinton-2_NVi_2p" FT /translation="XTFITHLHTQQRTPVLKNFFKLHGQEKKWKKGEKEKF FT HRESVVEWLESQDAYNLHRPVRRRFARRSYNVRNLDDVWEADLMDLRSLKT FT YNDGYSYILTVIDVVSKFSWLEXIKEKTSRNVAEAFSRVLSRSNGRQPICL FT QTDKGKEFIGRETQQVLRDNDIVHRVARSPDTKAAIAERLIRTVKERIWRY FT FTHKNTRRYIDVLQNIIAAYNHSKHSATKMTPASVTVYNVAKARENLQRRY FT DKNEHKVMRPKFKVGELVRVSRARSVFDKGYERGWTLELFKVARISLTRQP FT PVYHLQDLSGEDIDGCFYAEELSRVRKDLEGASFEIESILRSRGKGRSKEY FT FVSWKGYPEKFNSWVKASQLSQI*" FT CDS 7764..8531 FT /product="Polinton-2_NVi_4p" FT /translation="MAYQEENKLRPVQLLSTLYTLDKSYTTEIIVGLEYSN FT LQDEYYRPAVRLLGDDSKGITFTHXQWLQFTKCFNDFFKYLCDHSSSSSDS FT MTGQQICGTGWTAKFTHSQRDRAIEIVDSSPPKTFAIVSKRNHSTIILKKA FT KFLRLYEYIVKCIDARLEYLKFVAGSVSCVARETVEYIKSVNPDNIPVRQF FT GIYTITLVKAIINLIDDKWFDSVTTTVQTRAMEMKDQIPLSRNDIVDIFYQ FT LVSHHLDRIVDILX*" FT CDS join(5744..6421,6411..7199,7066..7605) FT /product="Polinton-2_NVi_3p" FT /translation="MDPAGSFSVHIEEANLLVRRVKISPSILLAHAQSLSR FT ATAKYPLTRVEVKAVTMHSGVHGETLDNIILGQLPKRIILGFVNNKAFNGD FT RLLNPFNFEHFNINFLCLYVDGVQVPSKPLQPDFTTRNLYVDAYHTLFSGT FT EIHFLNEGNQVTRENYPHGYCLFAFDLTPDLSANDCSHWNLIKHGSVRLEV FT RFSNALTETVNCILYAEYDNILEIDASRQVIIDFSSISAVKNSLLLTDVAT FT LYCTYKCCACDFYSQSLPLIFDNPALSYTLHSIMDVIIDIQFFKDAKHRNV FT PKEVAVVAVDGNFSSHWILTPAIALDHLSEDIRNENDWLTQYHHGLDYFDG FT EVSFKSVKKILRELSKSVQKIYVRGNDKWLLLRKIIPRRIFNLEYDQECPS FT FAKLSSDVYCMHHAVKTCHSNYRCALNNAYRLKSWLSVRASKKAEIRNTLS FT DLCNGQSSDTKESIENSLAYCRCISSRSDTEGVDETDSVCCKYGAIYAMDS FT LQILKNLSKIRSHTVGVFPADQIPKVWTKPTAFVVNTDDHTRSGMHWVAVY FT ADKSCNAFYFDSFGLPPFIPDHINRLRKNCKHFRWNNIRLQSDSSDVCGQY FT CIMFLHYMSNGLGFKKFLENFSENLQKNDDVVRRYVCCKQRKKSDDDDFIG FT NGSCIVRCLQSCSSKMSLL*" XX SQ Sequence 8996 BP; 2823 A; 1783 C; 2059 G; 2327 T; 4 other; agtagtttct caaagatctc gatcgtctgt gcgatacaac aatcgagcgt gtgattatct 60 actacgcaga gtggcagcca agctacaagg agctacagag caaaaaagta gaatttcgcg 120 agggtttgcc acagactagt gactgggctg ctgatcctag acctaaactt gttatcattg 180 atgatttgat gcgtgaatcg tcatccagtg gtagtatagt cgatatattc tctaaagcgt 240 cccatcatca ctcgctctcg gtgatattca tcacgcagaa cgtcttccat cagggtaaag 300 gacaacgcga catatctctg aacgctcaat acattgtgat ttttaggaat ttgcgcgaca 360 gatctcagat actacattta tcgcgtcagc tgtgtccgga ggatccacga ttcttgcaag 420 aagcctactg ggacgccacc tcgagaccct acggttatct tctgctggat ttgaagcaga 480 atacaccgga caactgcaga tttcgcacct gtatatttcc ggacgacgag ttgcactacg 540 tctatgtgtc acgcaataaa aagataaaag gtggtgatac attgaacctt gtaccaatcg 600 cctccctgtg atgggttcgc ctaagaagcg acatcccttt actcgcaagg acgctgtcat 660 tctgaaggcg ctttttcatt taaataataa tcagcgaaag gctctcttgc agacggctga 720 ttcaaaactg gtgcgtcata tttgtgagtg tgccctgaat gtgctaattg gtaacgttcc 780 actggagaag tctcacaagt cacgtttacg caggcacgca aaaactttgc gaaaactagc 840 cgaaccttgt gtgagtctgt caaaaaagaa gaaaattatt gtacagcgtg gtggtttctt 900 gcctgcatta ttagcaccga tcatcggcac gctgttggct agtatcatta gtaagtagac 960 aatatggaac acgccaaaaa gatggttctt gtacctcacg agaacgttga acgtctccag 1020 agtgtgttag caaatggtgt ccgcgataat aatgttgggt ctatactaaa gacagttcag 1080 acaccgggta acgtaatgac caggttggat gctgaaatga gcaatatact gaattcgtct 1140 acttgcaaaa gtgagcgaga aaaatggagt caatatcgat cagtactaca acgctatctt 1200 caattcaaag atgttgatag ctatgctaag gtcatgaaaa aggaggagaa agataaagag 1260 cagcgacaga tgaagaaaaa gaatgacaac gactacgaca aaggcgctga tgacaattac 1320 aatgacgatg acaatgatat acagcgcgtc gaagaagagg atcggatcga tgcgagtatc 1380 atagagagtg tacctccaaa gtacagacgt aaagccggcc acgtattgcg taaattacga 1440 gttagtggta atatcacttg ggatgcaaac ggtggtgtga caataggtgg tgtgcgtata 1500 caccacgcca acatgattga tctgataaat gatactatga gaaatagaaa atgtccacca 1560 ccacttggac atacgcaatt cgccgttgct ctccgtgaat catttatacc gcgcgaattc 1620 atcggtaata agcgcgtttg gaagaatatg atggctggta catcacctgg gaattctaca 1680 ccagtaccac caggtggtgc tagcggtgtt attccacgtc gggttttgac accagagggc 1740 atcaatcgtc tatttagaac accatcacct cgtcgtgctg agagtgctag tcgcgcaaat 1800 agcagtgcta ttgacagtag cgaagcagac aaaagcagta gtggtacggt agcagcgcga 1860 gtactcaagc gaaaaaaaac agtcgccggt caagcgagtg cgaaaaaaag tagcagcatc 1920 cacttctgaa gttacaacat ggcgacgttt gagcgggaga ttcagcccga gcaagagacc 1980 aaaatcacag aagacgaaga acaagagcaa gaacgagcaa gtgtaaaaaa tagtgcacta 2040 gatgaattga tcgatctact gccgtttgaa atgcacgtgc ccggatacag attctgtggt 2100 ccaggtacaa aactagcgga acgtattgaa cgtggtgaag tcggcatcaa tccgctggat 2160 gaggcatgtc gtcagcatga tttagtctac ggggataagc aaggcaatcg acgtaaggct 2220 gatcgcgtgc tagctgaata cgcattttcg cgaatgctcg cggatgaaac agccagagac 2280 gagcgaacac tggcaatgat gactgcctgc tgtatggtga gcaagataac gttcgaaaaa 2340 ttttttcacg aataaaaaga gcgctgaagc agaaaaagaa aaaggacaag aaaataaaga 2400 agaaatctat tagcaaagtt gacgcatcgg tgaaaacagt gaaagagcat aaaaaaaatt 2460 gaaaaaagaa aaaaagatgc acaatgagta gtgggatgag ggggtaaaaa gaaaaatttg 2520 agsaacattt attacgcacc tccacaccca gcagcgtacg ccggtgttga aaaacttttt 2580 caagctacac ggacaagaaa aaaagtggaa aaaaggtgaa aaagaaaaat ttcatcgtga 2640 aagtgttgtg gagtggctcg agagtcaaga tgcgtataat ctacatcgac ccgttagacg 2700 acgatttgca cgacgtagtt acaatgtgcg aaatttagat gatgtgtggg aggctgatct 2760 gatggatttg agatctctaa aaacttacaa tgacggctac agttacattc tgacggtgat 2820 cgacgtcgtc agtaaattta gttggctgga gycgatcaag gaaaaaactt cgagaaacgt 2880 cgctgaggct ttctcacgtg tgttgtcgag aagcaatggg cgtcaaccaa tatgcctcca 2940 gacagataag ggcaaagaat tcattggtcg tgagacacag caagttttgc gcgacaacga 3000 cattgttcac agagtggcac gcagcccgga tacaaaggca gccattgcag agagactgat 3060 acgaactgtc aaggagcgaa tttggcgata ttttacccac aaaaatacac ggagatacat 3120 cgatgtgttg cagaacataa tagcggcgta caaccacagt aagcactcgg cgacaaagat 3180 gacaccggca tctgtgactg tgtacaacgt tgccaaagca cgagagaatt tacagcgtcg 3240 ttacgataag aatgagcata aggtgatgag acctaaattc aaagtcggtg agctggttcg 3300 cgtcagtcgg gcaagatctg tctttgacaa aggttacgag agaggctgga cactggagct 3360 atttaaagtc gcacggattt ctctgacacg gcagccacct gtctatcatt tgcaagatct 3420 gtcgggtgaa gacatagacg gttgttttta cgctgaggaa ttgagcagag ttcgaaagga 3480 tttggaaggt gcatcttttg aaatagaaag tattttgcgt agcagaggta agggtcgttc 3540 gaaggagtat ttcgtcagct ggaaaggtta tccagaaaag ttcaattcct gggtcaaggc 3600 tagccaactt tcgcaaatat gaaagatcaa ttttacatta ttttgcctag caacagcagc 3660 atgaattatt tctcggaaaa cactacaacg cattatgtta cacagttgcc acaacagatc 3720 aaattacaag gttcatggtt ggtcgcgctc actcagaagt tcaaatacct ctgacattcc 3780 aacacgtgcc atctgaaaaa gaggagagaa aaatttccct aaaacgaata ccacccacaa 3840 gacttgagac aaatgagacg agagaggaaa attttacaat cactgaatca ttgattcgtc 3900 cgggaaatta tacaaatatt ctcactctag ttgaggaaat aaataattta gaatgcatcc 3960 gtggtcattt gagggtgaca gttgaacgtg gcggttatgt aacgatcagc cgagtctgcg 4020 caaagagtag ctgctcacag ttcagtcacg agctctattt atcggaaaaa gtaaaaaaaa 4080 ttcttggatt tgaaaaagga gagacaagga atttgtttta ttggaaatca atacactata 4140 gagtcggata gacctgctgg actttccaat gggctaccat ccatgttcat gatctacagc 4200 gacatctgtg agccatatgt gacaggtgat gttcaatcgc gtctactacg agctgtatcg 4260 atgaataccg ataattacga atatggtact actcgaatca aaagtttctc cccaccaatg 4320 tatataccac tgttattcaa cgcatttcag acaattgaaa tagatataag agaccaatgt 4380 ggccagtcga taccattcga ttacgggaca gtgactgtaa cacttcattt caagaaagtc 4440 gatgactagt cttcatgaat cgctacgaag attatttcaa ctgccaattt ggcaatggct 4500 accagtacgg tgggggaggg ggacggaatc cttacaccgg tggagttggc cacatttaca 4560 tcggatctcc ctaccaaaga ggtcacggcg gtatcggctc ctttctagct ggcatatttc 4620 gccgcgtcct tccactcatc agtcgcgggg ccaaagctgt gggtaaagag gccgtacgta 4680 cgggattcaa catcatatcc gacgttgcat cacacaacac accggtgaag gagtctttcc 4740 gcaatcgtgt gagagaatct ggtgagattc tgaagagaaa agctgaggag aaactggaca 4800 agttgatgga ggggtcagga tataaaatgt cacgctacgg aaatcctgct cagttgcagt 4860 tgctcgctgg tcctgtcaca gcgcgtagga gaaaaagagg tgttggcgga agaaagaaac 4920 gcaagtcgag gagtggtggt ggtgtaaaga agagaagtgc aacaagaaag attaagagac 4980 agaagaagaa gaagaaaaca tctcgaaata aaaaaccttc gtctagaagg agaaaaagaa 5040 gcagaacgac gagagatatt tttgacagtt aaagatggcg ttcttacata cccattcgtg 5100 cgaatgttta aagtcggagc tgctattgtt tgacattcca ccgacccaaa caacaatcga 5160 aggctcgcat tgggttcaat ataagcctat atcgtctttg accgatgatt caccgataga 5220 atttgtgata cccgggaaca gcgatgaata tctcgatctg gctcatacaa tgcttagtct 5280 tcgtgtgagt ataaaatcta gtacaagcga agaagatgtg gctgaggctg atcgagcggc 5340 ttatagactt ttaacagccc gggtgggacc agtcaacaat tttatgcatt cactattcaa 5400 tcaagtggat gtatttttca atcagaaacc cgtgtcacca cctaccaatg cttatgccta 5460 cagagcatac atagagacgc tgctaaatta cggacctgcc gccaaaacct ctcacctgtc 5520 gactgtgatt gtccttgtgg tgtgacgata cggcatggaa aaatgaacaa cacagaaaac 5580 ctgaacgagg ggtttgtgga aaggcgaaag ttacttgctg caaataaacc tgttgaccta 5640 gttggacacc tccatacaga tgtgtttaat caagagaagc tactgctcaa cggcgtcgaa 5700 gtaagggtgc gtttgagtga aatcgcgcga taacttttgc ctgatggacc ctgcgggctc 5760 tttttccgta cacattgagg aggccaatct tctagtgaga cgtgtaaaaa tcagtcccag 5820 tatattgctt gcacatgctc aatcgctctc acgagcaacg gctaaatatc cattgacacg 5880 ggtggaagtc aaggcagtca ctatgcacag tggtgtacac ggtgaaacat tggacaatat 5940 catattgggt cagctgccga agagaatcat cctgggtttc gtcaataaca aagcttttaa 6000 tggtgatcgg ctgctcaatc ccttcaattt tgaacatttt aatataaact ttctatgttt 6060 gtacgttgat ggggtgcagg tgccctcaaa accactgcaa ccggacttta cgacgagaaa 6120 tctatatgtg gatgcgtatc acacactgtt ctccggtacg gaaatacact tcctcaacga 6180 gggtaatcaa gtgacgcgcg agaactatcc ccacggctac tgtctatttg cttttgattt 6240 gactcccgat ttatcggcga atgactgcag ccactggaat ttgattaaac acggtagtgt 6300 tagactagaa gtacgtttta gcaatgcact aacagaaacg gtcaattgca ttctgtacgc 6360 agagtatgat aatattttgg aaattgatgc ctcgcgtcaa gtgatcatag atttcagcag 6420 ttaaaaacag tttacttctc acagatgtcg ctactctcta ctgtacatat aaatgctgtg 6480 catgcgactt ctacagtcag tctcttcctc taattttcga caatccagca ctgtcatata 6540 cgcttcactc aatcatggat gtgatcattg acatacagtt cttcaaggat gccaaacatc 6600 ggaatgtgcc aaaagaggtt gccgtggtgg ccgttgatgg gaactttagt agtcattgga 6660 tactgacacc tgcgatagcg cttgatcatc tgagtgaaga tatacgaaac gagaacgatt 6720 ggctgaccca atatcatcac ggtctggact actttgacgg cgaggtgtca tttaagtcgg 6780 tgaaaaaaat tttaagagaa ctttcaaaaa gtgtgcaaaa aatatatgtt agaggtaatg 6840 ataaatggct attgttacgc aaaataatac ctcgccgaat ttttaatctt gagtacgatc 6900 aggaatgtcc gtctttcgct aaattatcgt ctgatgtata ctgtatgcat cacgctgtta 6960 aaacttgtca tagtaattat cgatgcgctt taaataacgc ttaccggtta aaatcgtggc 7020 tgtctgtgcg tgcttccaaa aaagccgaaa ttcggaacac tttaagcgat ctatgcaatg 7080 gacagtcttc agatactaaa gaatctatcg aaaattcgct cgcatactgt cggtgtattt 7140 ccagccgatc agataccgaa ggtgtggacg aaaccgacag cgtttgttgt aaatacggat 7200 gatcatacga gatcgggtat gcattgggtc gccgtatatg cggacaaatc ttgtaacgct 7260 ttttactttg atagttttgg attaccccca tttatacccg atcatattaa tcgattgaga 7320 aaaaattgta aacattttcg ctggaataat attcgactac agagcgattc ttccgatgtt 7380 tgcgggcaat attgcatcat gtttttgcat tacatgagta atggtctggg cttcaagaaa 7440 tttctcgaaa atttttcaga aaatctgcaa aaaaatgatg acgtggtaag acgatatgtc 7500 tgctgtaaac aaaggaaaaa aagcgatgac gatgatttca ttggaaatgg cagttgcatt 7560 gtcaggtgct tacaaagctg tagttctaag atgtcgttat tgtaatccga ggagataatg 7620 ataccgtatc tctctcacac gtggctgttc ccactaccac cacaatcctt atccccacca 7680 taccttccca ccttcagact ttgtgctaag agtcagagtt agtcgttcga aggtggcggt 7740 ggtgatcgtg agagactcgc acaatggcgt accaggagga aaacaaactt cgtccagttc 7800 agctcctgag cactctgtac actttggaca agtcgtacac aacggaaatc attgtcggcc 7860 tggaatacag caacttgcaa gacgagtact acagaccagc tgtgcgacta ttaggcgatg 7920 actcgaaagg aattactttt acccacratc agtggttgca attcacaaag tgtttcaacg 7980 actttttcaa gtatctttgc gaccacagta gcagcagcag cgattctatg acaggtcaac 8040 aaatttgcgg tacaggttgg acagcaaaat tcactcacag tcagagagac agggctatcg 8100 aaattgtgga tagtagccca cccaagacat ttgcaatagt gagcaaacgg aatcattcaa 8160 ctattattct aaaaaaagct aagtttctac gattgtatga gtatattgtg aaatgcattg 8220 atgcgcgtct agaatatctt aagtttgtag cgggtagtgt aagttgtgtt gctagagaga 8280 ctgtagaata cattaagagt gttaatccag acaatatacc agtaagacaa tttggtattt 8340 atacgataac tcttgtaaaa gcaattatca atcttattga tgataaatgg ttcgatagtg 8400 tcacaacaac tgtgcagacg cgtgcgatgg aaatgaaaga tcaaattccg ctcagtcgta 8460 atgatattgt cgatatattt taccaacttg tatctcatca tcttgataga attgtcgata 8520 tattawaatg aaataggtga ttaattatgt ctgttttctt ttattaattt tactactatc 8580 cttcttgtaa atattatctt cgaaagcaag aaaaaaagtt accaatgtac atcacaattt 8640 gtataattat ctcattaaat atatattgaa taaataatct atcgttacaa aatataaaat 8700 acttatcttg tattatttat tgctatcctt tcttacccat tcccatagta gaaaattatc 8760 ttaaaaaata tcaagttata ttacattatt cataaaaaca ttcttaaaaa aaaataataa 8820 gtaaatcaat agagagataa attggggaga gggaacagga agggggaaag aagcgccttc 8880 ccccactatc gcatgacgtc actcctcgct gtggcgccag acatctatcc caccaccacc 8940 tctaggcaga taattggggt attgtttgtc tcgtccatag cgaggggtat actact 8996 // ID Gypsy-247_AA-I repbase; DNA; INV; 5934 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-247_AA_; KW Gypsy-247_AA-LTR; Gypsy-247_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-5934 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1095-1095 (2011). XX DR [1] (Consensus) XX CC Positions [4801-5289] - Integrase core CC LTRs are 94% similar to each other. XX FH Key Location/Qualifiers FT CDS 648..1904 FT /product="Gypsy-247_AA-I_2p" FT /translation="MKTHENRIIIDEGINHLNNILRNLNKAPNRKYRKITL FT EQKLINAKLTYGKVTDALAIIETEIKESELFFLTNAIRQVYSDVHILILSK FT LERAAIHKISLFTLSYAILFINKLLNKIKSKMAKVNIKTGATLINMYDGNP FT KNLDAFLDSVALFVDIVNNENAAASQAVKDAAKATTLRFIKTRLTETARQS FT IPENANLDQLIDALKTNCSSKTTAENVLAKLKNTKQTSSTEDFCTEVEKLT FT QELKSIYIRNRIPGDVALQMATKCGVDALIAGTKNSETKTILRAGTFTRLN FT DAIQKLHENDAVQKAQGPEDKHKNANAMQAQVFMANRNQYNRGRGRSGSFN FT YQRGNRNPHNRLGRNWQNSNQSSTIITNLVFKTREVNGIHEDMEIQEDITV FT KEDDQLFISCNLDRKRRINIANSRL" FT CDS 2503..5604 FT /product="Gypsy-247_AA-I_1p" FT /translation="MVVYSQEVEPGIFCGNTIISAKEPIFKFINTTNDEVY FT IAYANFRPKMEPLRNFHICKTQNSSKPNSERRAKILKGVNIQQIPTYAKND FT FEKLVTDYEDIFCLPDEKLTFNNFYTQNINLNNNVPVYIPNYKTIYSQGEE FT IENQVQKMLQEDIIEPSVSSYNSPILLVPKKSEDNDKKWRLVIDFRQLNKR FT ILPDKFPLPRIDSILDQLGRAKYFSTLDLMSGFHQIPLEENSKKFTAFSTN FT SGHYQFKRLPFGLNISPNSFQRMMTIAMAGLTPERAFVYIDDIVVIGCSLK FT HHLQNLRTVFERLRKYELKLNISKCKFFRTEVTYLGHKLTDKGILPDEAKY FT ATIKNYPVPKNVDDVRRFVAFCNYYRKFVENFSQIANPLNKLLKKNTKFIW FT TSECQHAFNLLRHNLMSPRILQYPDFSKSFTLTTDASDIGCGAVLSQTTET FT GEQPIAFASKTFIPAEKNKPTILKELLAIHWAINYFKPYLYGKRFTVRTDH FT RPLVYLFGMKNPTSKLTRIRIELEEFDFDVVYIKGKENVAADALSRIVTTS FT EELRQSNILLVNTRSMTRKKAQLKKIQEKIQEEIKEIDHLSVYETERPTET FT RKMMKLSTAVENNTMKFMIWKKDNKTLFAQVHQDVRNGSHALEHALSEIEK FT IAKGFKIQKLAMSADDYNFTLVPMNLFKIIAHKTLKQLQIIIYKEPLFVNS FT KDDILTILNNHHNTPLGGHVGQHRLYLRLREKYKWRDMKNSIAHFVKACEL FT CRRNKIIKHTKEPMRITTTPSKAFEIISIDTVGPLPLTIKNNRYCVTIQCD FT LTKYIVVIPIQNKEANTIAKALVENFILTYGNFLEMRSDQGTEYNNEVLNQ FT ISKLLQIKQTFSTPYHPQSIGSLERNHRCLNEYLRSFTNEHQSDWDEWTKF FT YEFSYNTTPHTDHNFTPFELVFGKKATLPHDTIQNNNEPIYNYDSYKSELK FT FKLKKSHEIAKQILLEQKNRRKEQFDNNINPISVNINDKIYLKNENRRKLD FT SFYLGPYKITKIDEPNCEIEHIYFQVKL" XX SQ Sequence 5934 BP; 2270 A; 1085 C; 998 G; 1580 T; 1 other; attaagaaaa tttggtgaag aaagtgatca acaaaatatg agaaaaaaaa tgtagaattg 60 gtggacaaat aattggattc gtgttttaaa agtgttctgg aaaaktaaaa tgggtaatga 120 agtcacaaca ctagccccac caagggtgcc ggaagaaatt atcaaaattg tgaatactaa 180 cgtacagcaa acagctcact tagaaagatc agctgacgca acgacgaaac tggtgtatat 240 tggaactgca gttttagtgc ttgtgtttct atacgttatt tatcatatca tcatcaagct 300 cgaacgactg cgaacccagg aagctataaa gcgagcagtt tcgcttgcct ccgtaacagc 360 aatcaagtaa gagcaatgac atttcaagtg cgacgcagtg acagtgaaat caacatgaga 420 atcaacggtc catccttgct gctgttttgt gcgatgttaa tggtgttgct gctaaaatac 480 atgtacaact ggctgtgcga ggaactgcga gactggaagg aaaatataca gatggcgaga 540 aacgcaatac ggtgacgtaa acggcattga agtgagttcg ttccatacgg aatttatttt 600 tttttcaata catttttata ttatattttg attctaattt atttgatatg aaaacacacg 660 aaaatagaat aatcatcgat gaaggtatca atcatttgaa taatatcctg agaaatctta 720 acaaagctcc gaacagaaaa taccgaaaaa ttactttaga gcaaaaatta attaacgcaa 780 aattaactta tggtaaagtt acggacgcat tagctataat tgaaaccgag atcaaagaat 840 cagaattgtt ttttttaaca aatgctatta gacaagttta tagcgacgta cacatattaa 900 tattatctaa acttgaaagg gctgcaattc ataaaataag cctgttcaca ttatcatacg 960 caattctctt tattaataaa ttgttaaaca agataaaatc aaaaatggca aaagtaaaca 1020 ttaaaactgg tgccacacta atcaatatgt atgacggtaa cccgaagaat cttgacgcat 1080 ttctagattc cgttgcacta ttcgtcgata tagttaataa cgaaaacgcg gctgcttcac 1140 aggctgtcaa agacgcagca aaagccacca ctcttagatt tatcaaaacg cgacttacag 1200 aaaccgctag acaatcaata cctgaaaacg caaatcttga tcagttaatt gatgcattga 1260 agacaaattg ctcatcaaaa accaccgctg agaatgtttt agcgaaactt aaaaatacaa 1320 aacaaacgtc ttccacagag gatttctgta ctgaagtgga aaagctcaca caagagctga 1380 agtccatcta catacgaaac agaataccgg gtgatgtagc gttacaaatg gccaccaagt 1440 gcggggtcga cgcattaatt gctgggacga aaaattcaga aactaaaaca atactgcgcg 1500 ctggaacatt taccaggctg aacgatgcta tccaaaaact acacgagaat gacgcggtac 1560 aaaaagctca gggcccagaa gataaacata aaaacgcaaa tgccatgcaa gctcaggttt 1620 tcatggccaa caggaatcaa tacaatagag gccgtggacg aagtggtagt tttaattacc 1680 aacgagggaa tcgcaaccct cataacaggt tgggacgaaa ttggcaaaat tccaatcaat 1740 cctctaccat aataaccaac ctcgttttca aaaccagaga agtgaatgga atacacgagg 1800 acatggaaat ccaagaggac attacagtca aagaggacga ccaactattt atttcatgca 1860 acctggatcg caagcgacga atcaacatag ccaacagcag gctctagcac taccaatgca 1920 acaacatcag caaaatcaac aaattccgat ggcgaatagc cagaacttta caggctccca 1980 acaaataatg ccaaatccaa attttttagg cgcacagttt ggacagcatg cacgataaat 2040 gcatcagttt ccaactatgt aaacctgaca ctagatttgt ctgatacccg ttgtactttc 2100 ttagtggaca ctggtgctga tatatctatc ataaaagcaa acatagtaaa acctacacaa 2160 atatattatc ccgacgagaa atgctttatt tctggaatcg gacataatgg tatttcgtca 2220 cttggtagca cttatgctaa tataatagta gatggcacat cggtgaacca aaaatttcaa 2280 atagtggaaa acgattttcc aataccaacc gatggcatta ttggaagaga ctttttaagt 2340 gtgaaccaat gtaaaataga ttatgagcct tggttgttgt cgtttaaagt aaagcagcaa 2400 gaaatttcaa tcccaattga agacaatttt cagagaaaac tttttctacc tccccgccat 2460 gaagtcactc gatacatacc aggattagag ttacaagaag atatggtggt gtattcacaa 2520 gaagttgaac ctggaatttt ttgtggtaac acaataatat cggccaaaga gcccatattc 2580 aaatttataa atacaacaaa tgacgaagtt tatatcgctt atgccaattt cagaccaaaa 2640 atggaaccgc ttagaaactt tcatatctgc aaaactcaaa attctagcaa acctaactcg 2700 gagcgacgtg caaagatttt gaaaggagtc aatatacaac aaattcccac ttatgcgaaa 2760 aatgatttcg aaaaactagt tactgattat gaagacattt tttgtttacc tgatgagaag 2820 ttgacattta acaattttta cacacagaac attaacttaa acaataatgt tcctgtgtac 2880 attcccaact acaaaacaat ttattcgcaa ggagaagaaa ttgaaaacca agtgcaaaag 2940 atgctacaag aagacattat agagccttca gtttcatctt ataattcacc aatactacta 3000 gtcccaaaga aatcagaaga taacgacaaa aaatggcgtt tggtaataga ctttcgtcag 3060 ctaaacaaaa gaatattacc agataaattc cccttgccaa gaattgatag cattttggat 3120 caacttggta gagcgaaata ttttagtaca ctggatttga tgtcaggttt ccatcaaatc 3180 cccttagaag aaaactcaaa aaagttcaca gcattctcaa caaattctgg tcattaccaa 3240 ttcaaacgac taccattcgg attgaacatt agtccgaata gttttcagcg aatgatgacc 3300 atcgctatgg ctggattaac cccagaacgt gcatttgtct atattgacga tattgtggtt 3360 attggatgtt ctttaaaaca tcatttgcaa aatttaagaa ctgttttcga aagattaaga 3420 aaatacgaat taaaattgaa catttcaaaa tgcaaatttt ttagaacgga ggtcacatat 3480 ttgggccata agttaacgga taaaggaata ttacctgacg aagcaaagta tgcaacaatc 3540 aagaattatc ctgttcctaa aaatgtcgac gatgtgcgaa ggttcgttgc tttctgcaat 3600 tactatcgca aatttgtcga aaatttttca caaatcgcca atcctttgaa taaacttctt 3660 aaaaagaata ctaaatttat ttggacatcg gaatgccaac atgcattcaa cctcttacga 3720 cataatttga tgtctcctag gattctacaa tatccagatt tttcaaaatc atttacatta 3780 acgaccgacg catctgatat tggatgtgga gccgtcttgt cacaaaccac agaaacagga 3840 gagcaaccaa tagcttttgc aagcaaaacc tttattccgg cagagaaaaa taaaccgact 3900 atactcaaag aattgttagc tatacactgg gcgataaatt actttaaacc gtacctgtat 3960 ggcaaacgat tcactgtaag aaccgatcat agaccactag tgtatctttt tgggatgaaa 4020 aatcctacat caaagttaac cagaatccga atcgaattag aagaatttga cttcgatgtt 4080 gtatatataa agggtaagga aaacgtagca gctgatgcgc tttcgcgaat agttactacg 4140 tctgaagaat taagacaatc aaatatatta ttagtaaata ctaggtcaat gacccggaaa 4200 aaggcgcaat taaagaaaat acaagaaaaa atccaagaag aaataaaaga gattgatcac 4260 ctctcagttt atgaaaccga acgtcctacg gaaacgagaa aaatgatgaa actgtcgaca 4320 gccgtagaga acaacacaat gaaatttatg atatggaaaa aagataataa aacacttttt 4380 gcacaagtgc atcaagatgt tcgaaatgga agtcatgcat tagagcatgc tctttcagaa 4440 attgaaaaaa ttgccaaagg atttaaaata caaaaattag caatgtctgc agacgattat 4500 aatttcacat tggtaccaat gaacttattt aaaataatag cgcataaaac tctaaaacaa 4560 ttacaaatta taatttataa agagcccctc tttgtcaata gtaaagatga tattttaact 4620 atattaaata accaccacaa cacaccactc ggaggtcatg ttggtcaaca tcgcttgtat 4680 ctcagacttc gcgagaaata taagtggaga gatatgaaaa attctattgc ccatttcgtt 4740 aaggcctgcg aattgtgcag aagaaataag attatcaaac atactaaaga accaatgaga 4800 attacaacaa ccccttcaaa agctttcgaa attatttcca tagatacggt aggaccacta 4860 ccgttaacca taaaaaacaa tcgctactgt gtaactatcc agtgcgattt gactaaatac 4920 attgttgtta ttccgataca aaacaaagag gctaatacca tagctaaagc tttggttgaa 4980 aattttattc taacttacgg taattttctc gaaatgcgtt ccgatcaagg tactgaatat 5040 aataatgaag tcctgaacca aattagcaaa ttacttcaaa ttaaacaaac cttttcaacg 5100 ccatatcatc ctcaatcaat cggttcattg gagaggaatc acagatgcct aaatgagtat 5160 ttgcgatcat tcaccaatga gcaccaatcc gattgggatg aatggacaaa attttacgaa 5220 ttttcataca atacaactcc tcatacagac cataacttta ccccattcga attagttttt 5280 ggaaaaaaag caactctccc tcatgatact attcaaaaca ataatgaacc tatatacaat 5340 tacgactcat acaaaagtga attaaaattt aaattaaaga aatctcatga gatagcaaaa 5400 caaattttat tagaacaaaa aaaccgaaga aaggaacaat tcgataacaa tatcaatcct 5460 atttctgtta acattaacga caaaatttat ttgaaaaacg aaaacagaag aaaattggat 5520 tcattttatc ttggaccata caaaattaca aaaatagatg agccaaattg tgaaatagag 5580 catatctact ttcaggtaaa attgtaacag tacacaaaaa cagaataatt aaagcataac 5640 tctaagaata acttgaacta gagtaccacc gagcctaacc tctcaaaaaa ataataaact 5700 caaaaaataa gtaaaagatg aggtaggtga aagttcccag ctatatgctg aggagtgcaa 5760 aattcggtta gctacatcta ggttaattta tattttatga actaaggaca aaaaaaaaaa 5820 aacgataact ctatcaaaca actatgtaag agataactgg agacacacat tcaaattttt 5880 cgaatctttt agaataattt cattacatta cattattctc gtaaaggggg atgg 5934 // ID BEL-20_AA-LTR repbase; DNA; INV; 200 BP. XX AC supercont1.208; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-20_AA_; KW BEL-20_AA-I; BEL-20_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-200 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.208; Positions 1526784 1526585. XX SQ Sequence 200 BP; 64 A; 38 C; 33 G; 65 T; 0 other; tgaaagagta aagtttgaaa tacagtccac atttctttcc gtgtaagttt ctcttttgca 60 ttacctgaaa aataaacaaa ctttgtatct ttgttcagtt ataaaaaaac gctgtaccta 120 ttaaacgcgt tcgtgttttg tcttttagta aaaccgaaga agtcgctgtc cagaccacgc 180 aaaagttgat tcccggaaca 200 // ID Gypsy-54_CQ-LTR repbase; DNA; INV; 274 BP. XX AC AAWU01036387; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-54_CQ_; KW Gypsy-54_CQ-I; Gypsy-54_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-274 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 488-488 (2011). XX DR Genome; AAWU01036387; Positions 5173 5446. XX SQ Sequence 274 BP; 62 A; 67 C; 64 G; 81 T; 0 other; tgttgggaca acctaatcag cgatggtccc caccagtccg cacgccgctg gactcgaact 60 tggcgtctgg aacgcccagg tcacgacgtc accacagcgc atacgtagag gaggttccgg 120 ttgtcaaatg gccattccgt gtttaattac ttacaagata acacgttgtt ttatttgttg 180 gacataaagt cttattttgt ccttgctttt aatttgttta atgttaattg tctctcacgt 240 acgacatcgg ccaccgggcg tacgtggtga taca 274 // ID BEL-2_SI-LTR repbase; DNA; INV; 426 BP. XX AC AEAQ01001442; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: long terminal DE repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-2_SI_; KW BEL-2_SI-I; BEL-2_SI-LTR. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-426 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01001442; Positions 2182 1757. XX SQ Sequence 426 BP; 108 A; 106 C; 105 G; 107 T; 0 other; tgtgcgagat cgcacccaac tatgttcgac agccgaattg ctcgttctcg cgccgcgacc 60 aagccagcag cggcgtgaca acgcatgaac ggcagcggga aagagcgtga tttccgatgt 120 ttgcgcgagc ttctagaaga attcgggatc gccgttgctg actaagcata agcaagagaa 180 gtcgaaaaac agccttcacg acgacggatt cctctaaaag cattcctcgc gcacattata 240 aataggcggc ttaatgccat taggggcact tggttcgaga ctgttctcgc gttcggtgcg 300 tcgccgacgt tttgtgaagg cctcaataaa atattaaatt ttgtcaagtc attagtcaag 360 cattagtcaa ttcgccttat cccattactc cgcgtgctgt tttgcagtgc cgaaatattt 420 agctca 426 // ID PiggyBac-6_HM repbase; DNA; INV; 2384 BP. XX AC . XX DT 28-JAN-2009 (Rel. 14.02, Created) DT 28-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE PiggyBac-type family: consensus. XX KW piggyBac; DNA transposon; Transposable Element; PiggyBac-6_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2384 RA Bao W. and Jurka J.; RT "PiggyBac families from Hydra magnipapillata."; RL Repbase Reports 9(2), 455-455 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 471..2045 FT /product="PiggyBac-6_HM_1p" FT /translation="MADDNIELNDSFERLLLQSDDEHDSDLMETIQQDESD FT ESDSADSESTDGGSDDGSEEWSKNVQKPLRWDFDDGTAGMNMDMVMDCKEP FT VDYYNLFMNNELIDFIVEETNRYGPTKDSNYLPTNNMEMRKFMAIILQMGY FT VVLPKLDDYWSSDPAIGGKAICGGVMTRKRFYSLMRSLHFADNDNNNGSKL FT YKIEKFVNLFIARCQHLLVPGRDICIDESMIPFRGRLRFRMYIPSKRHKYG FT IKIFKLCSNNGYTYNLSIYAGKEDQPRYGSVAERVVLKLMENLLGHGRILY FT TDNWYTSMNLAKSLLENKTNLVGTARKNRIGFPKHVTSLKLKKGDYVAEQN FT GDGIMILKWKDKREVIMLSSIHDGSITERGKPAVIVDYNKGKSFNDLSDQL FT GSYCPYVRKTMKWYLRIFFHIITQVSIVNALHLYKLHSGKNIKIVQFKRDI FT VSSILQLEKPTSSGRKHQLEKEPGEGTIRKRRCTECYNKLSKEFGATIARN FT RASQIKTRCDNCLKPFCLACFQKCHTKC*" XX SQ Sequence 2384 BP; 836 A; 341 C; 409 G; 798 T; 0 other; cacgttgacc gccttgcctt tttttgaaaa aatcacattg ccaaataatc tttttttttt 60 tgaatatttc gtgtatacat atgtatacac aaataagtca aagaggagtt tttttttttt 120 aataattttt agtttttgtt tgaacattcg aacaataatt ttctcaaaac ctaaactatt 180 taacttttaa atatttaaaa atatattttt ccaaaatgtt ccagaacctt ccagcacgcc 240 gataaacggg tatataaacc tagcgttttc tgctaaacac cagttcagca attgtatgga 300 ctttaaatat tgttgaataa ataattgtga tcgactgaaa gaaaacttga ctgtcgaaac 360 ttaacagttt tcaacttttt tcctcaaata cttgattgat attactaatt ttttgataca 420 atcgtaaatt tttattattt tctagctatt aatttttctt atttattaga atggctgatg 480 ataatattga attaaacgat agctttgaaa ggttgctgtt acaaagtgat gacgaacatg 540 attctgattt aatggaaaca attcaacagg acgaatctga tgaatccgat agtgctgata 600 gtgaatcaac tgatggtggt agtgatgatg gttcagaaga atggtcgaaa aatgtgcaga 660 aacctttaag atgggatttt gatgacggaa ctgcgggcat gaatatggat atggtaatgg 720 attgcaaaga acctgtcgat tactataact tatttatgaa caacgaactt atcgatttta 780 ttgtggaaga aactaatcgt tatggaccaa caaaggattc aaactatttg ccgacaaata 840 atatggaaat gcgaaaattt atggctatca ttttgcaaat gggatatgtt gttctgccta 900 aactcgatga ttattggtcc tctgaccctg ctattggcgg taaagcaata tgtggaggtg 960 taatgacaag gaaaagattt tattctttga tgagatccct ccacttcgct gacaatgaca 1020 acaataatgg atcaaaatta tataaaatag aaaaatttgt caacttgttc attgccagat 1080 gtcaacactt actggttcca ggcagagata tatgcattga cgaaagcatg ataccctttc 1140 gcggcagact tcgatttcgt atgtatattc ctagtaaaag gcataaatat ggcatcaaaa 1200 tctttaaatt gtgtagcaac aacggataca cttataattt atcaatttat gccggtaaag 1260 aggaccagcc aagatatggt tctgtggcag aacgtgtagt tttaaaacta atggaaaacc 1320 ttcttggaca tggcagaatt ctgtatacag ataactggta tacaagtatg aatttggcga 1380 aatctttatt ggaaaacaag acaaacttag ttgggacggc cagaaaaaat agaattggat 1440 ttccaaaaca cgtaacttct ctgaaattga agaaaggtga ttatgttgca gaacaaaatg 1500 gtgatggtat tatgatttta aaatggaaag ataaaagaga agttataatg ctttcatcta 1560 ttcacgatgg tagcattacg gagagaggaa aacctgctgt tatcgtagat tacaataaag 1620 ggaaatcgtt taacgatcta tctgaccaat tgggctctta ttgtccatac gtacgtaaaa 1680 caatgaaatg gtacctcaga atcttttttc atataattac tcaagtttcc attgttaatg 1740 ctctgcatct ttataaatta catagcggaa aaaatatcaa aattgtacaa ttcaaaagag 1800 atattgtctc atcaattttg caactggaaa agccaacatc ttctggacga aaacatcaac 1860 ttgaaaaaga accaggagaa ggaactattc gaaaacgacg ttgtactgaa tgctacaaca 1920 aattatcaaa agaatttggt gccacaattg ccaggaatcg tgctagtcaa attaagacaa 1980 gatgtgataa ttgtttaaaa cctttttgtt tagcttgttt tcaaaaatgt catactaaat 2040 gttgatttta ttgattatgt tgattttatt tttttgttat aacggttttt tttaattcta 2100 atgttatttt ctgttttatt tttttcaaat aaagttatct tttaccacag tgttttttat 2160 aatttgaacg aagttttttt tcttaaaacg ctatctattt aaacgccatt aaattttaag 2220 aattgataaa tataaagaaa caattggaaa caagtaaaaa ttagaatact taaaattaac 2280 aaaaataaat ttactatcaa aatcgtgtat ccatatggat acacgacttg aagaaacgag 2340 agtttacgcg tgtatacata tggatacacg atgcggtcaa cgtg 2384 // ID Gypsy-209_AA-I repbase; DNA; INV; 4244 BP. XX AC AAGE02026967; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-209_AA_; KW Gypsy-209_AA-LTR; Gypsy-209_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4244 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02026967; Positions 2075 6318. XX CC Positions [3225-3683] - Integrase core CC 'CAGTG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 16..2904 FT /product="Gypsy-209_AA-I_1p" FT /translation="MVDNARAAGGDQPPIAAAGGGGGGGNLPLNAAAVGGG FT NLPPSFHIDAYDRKKTRWSRWVERLETAFTIYGVVDDNLRRNLLLHLMGPE FT TYETLCDKIAPDNPRTRTFQVIVDTLEQYFNPRPLEISENFRFKCRRQGDK FT DALSPDESVEEYLVALRRIAVTCNFGGYLDTALRNQLVFGLKRNDIRGRLL FT ERRNLTLQDALDVAVSMELSQKGGAEIAGSLSKSEVNAVQHRRGKVYDKKV FT GGKSPSAGKSTGEKYCFRCGDKSHLAKACKHQNTVCSYCNIKGHLERVCMR FT KSSANKGHSGKSEAKYSTVKAIDQRSSSTDDIDVGEVMYGEICALSSSSAA FT KLWLTLVLNGVAVRFEIDTGSPVSIISAQFFQRYFKECHLRKCSLNLVSYC FT DTNIRVLGVLDVIVDCCGVRANLPLYVVDSSKHPLLGREWLKVLNVNWNTI FT LKKPAVVNVIDRAIGSCDTAEHLEQIFAKYAKVFDDSMGKISTVQAKLSLK FT PNTQAVFLKARKVPFNLRAAVDKELDKLESEGVLTKVSHSSCATPIVPVKK FT ADNRVRICGDYKQTVNPHLKVDQHPLPTIDELFASLAGGQKFTKIDLVQAY FT LQMEVAEEDRDMLTLNTHRGLYRPTRLMYGVSSAPAIWQRQIETILQGIEG FT VSVFLDDIKITGPTDAIHLHRLEEVMRRLNYYGIRVNRKKCEFFTDQIEYC FT GYQIDREGIHKVQQKVDAIVSMPRPRNKEEVRSLVGLVNYYGRFYENLSTV FT LYPLNNILKNEVPFVWTKECEKSFRTVKERMQKDNCLVHYSPDLPLILATD FT ASPYGVGAVLSHVFPDGTERPIQFASQTLNRTQQRYMQVDKEAYAVVFGVR FT KFFQYLYGRKFVLITDNQAISKIFSENKGLPVMSALRMQHYATYLQSFDYE FT IRFRKSADHANADALSRLPMQATDPENVIEECDVVEISQIDTLPLTAAELS FT EAQLRISL" FT CDS 2910..4193 FT /product="Gypsy-209_AA-I_2p" FT /translation="MLIQGLKHGQNVDGKHRFGIDQSEFSFQKGCLLRGIR FT VYIPAVFRKRVLAELHSTHFGTSRTKSLARGYCWWNGMDKEIEELIANCAD FT CQSVRPMPAKVQPHHWEPATKPFQRVHADFAGPYMDCYFFILIDAYTKWPE FT IKVCRSITAENTEAMCREIFSTFGIPSVFVSDHEVQFTATSFQSFLKQNGI FT VHKMGAPYHPSTNGQAERYVQTFKQKLKALKCPRSQLNLELSRLLLTYRKT FT IHPATGQSPSIMMFGRQIRSRLDLMLPRGNEEAPPTTTVRELENGVRVRVR FT DFLTNNKWQFGKVVSKLGSLRYAVKLDDGRIWERHIDHIVKVGVGLLSSGS FT NDSPAREPLRDEPNCSTSTVPSTISPSSQEQETAVADPVAIVPPPDDPVSR FT VPEMSHSSVSPKSETKPPLRRSTRTIKAPQKLNL" XX SQ Sequence 4244 BP; 1140 A; 958 C; 1107 G; 1039 T; 0 other; gtttggcgaa cgaggatggt cgacaatgcg cgtgctgctg gtggtgatca gccaccaatt 60 gctgctgctg gtggtggtgg tggtggcgga aacctgccgc taaatgctgc tgctgtggga 120 ggaggaaatc tgccaccgag cttccacatc gatgcgtatg accggaagaa aacacgctgg 180 tcacgatggg tggaaagact tgagactgct ttcacaattt acggcgtggt agacgacaat 240 ctgcgacgaa acctgcttct gcatttgatg ggaccggaaa catatgaaac gttgtgcgac 300 aaaattgctc ctgataatcc tcgcacgaga accttccaag tgattgtgga tacactcgag 360 caatatttca acccgagacc tttggaaatt agcgaaaatt tccgtttcaa gtgtcgtcgt 420 cagggggaca aagacgccct ttccccggac gaatccgttg aagaatacct tgttgccctt 480 cgacgcattg ctgttacctg caactttggt ggctatctgg acacagcgct aaggaaccag 540 ctagtgttcg gcttgaagcg gaatgatatt cgaggacgcc ttctggaacg caggaacctc 600 actctgcagg atgcgttgga cgtagcggtc agcatggagc tgtcacagaa aggaggcgcc 660 gagattgctg gttcactttc gaagtcggag gtaaatgctg ttcaacaccg acgaggtaag 720 gtttacgata agaaagtggg aggaaaatct ccgtcagctg gcaaatccac gggagagaag 780 tactgctttc ggtgcgggga caaatcgcac cttgctaaag catgcaaaca ccagaatact 840 gtgtgctcct actgcaacat taaggggcac ctcgagcggg tgtgtatgcg aaagtcttca 900 gcgaacaaag gtcattctgg aaaatctgaa gcaaaatata gcacggtaaa agccatcgat 960 cagcgaagta gctctacgga tgatattgat gtgggtgagg ttatgtacgg cgagatttgt 1020 gcgttgagta gttcgagtgc ggcaaagctg tggttgacgc tggtactaaa cggtgttgca 1080 gttcggtttg agatcgacac tggcagtccc gtaagcatca tcagcgcaca gtttttccaa 1140 cgatacttca aggaatgcca cctccggaag tgttcactga atctcgtcag ctactgtgat 1200 accaacattc gtgttttggg agttcttgac gtgatcgtcg actgctgtgg agtacgagcc 1260 aacttgccgt tgtacgtggt ggattcgtcg aagcatccgc tgctcggaag agagtggctc 1320 aaagtgctga acgtcaactg gaacacaatt ttaaagaagc cggctgtggt taacgttatc 1380 gatcgggcga tcggttcttg cgatactgct gagcacttgg agcagatttt cgcaaaatat 1440 gcgaaggttt tcgatgattc tatggggaaa atatcaactg tgcaagcgaa gctgagctta 1500 aagccgaata cgcaggccgt attcctgaaa gccagaaaag ttccgttcaa cttacgtgcc 1560 gctgtggaca aggaactgga taagctggaa tccgaaggag tactaacgaa agtgagtcac 1620 agtagctgtg cgaccccgat tgtgccagtg aaaaaagctg ataaccgtgt acgaatttgt 1680 ggggattata agcagacagt gaatcctcac ctgaaagttg atcagcatcc cttaccaacg 1740 atagacgagt tgttcgcatc acttgcgggc ggtcagaagt tcaccaaaat tgacctcgtg 1800 caggcgtatc tccagatgga agttgctgaa gaggatcgcg acatgctcac gctaaacacg 1860 catcgtgggc tgtaccggcc tacccgtttg atgtatggtg tgtcatctgc tccggcaatt 1920 tggcaacgcc aaatcgaaac gattttgcaa ggtattgaag gtgtgagtgt tttcctcgat 1980 gacatcaaaa taacggggcc tacagacgca atccacttgc acagactgga agaagttatg 2040 cgcaggttga attactacgg catacgtgta aaccggaaga aatgcgagtt cttcaccgac 2100 cagattgagt actgcggata tcaaatagat cgtgaaggaa tacacaaggt ccaacagaaa 2160 gttgatgcga ttgtcagcat gccacgtccg agaaacaagg aggaagttcg ttcgttggtt 2220 ggactcgtca actactacgg ccgtttctac gaaaacctta gtactgtgct gtatccgttg 2280 aataacattc tgaagaatga ggtgccattc gtctggacca aagaatgcga gaagtcgttt 2340 agaaccgtga aggagcgaat gcagaaggac aactgtctag tacactattc cccagatctg 2400 ccacttattt tggccaccga tgcgtctccc tacggtgtgg gtgctgtgtt gagccacgtt 2460 ttcccagatg gaactgagcg tcccatacag tttgcttcac agacattgaa ccgcacccag 2520 caacggtata tgcaggttga taaggaggct tacgctgttg tgttcggtgt gaggaagttc 2580 ttccaatatc tctatggtcg gaaattcgtg ctgataaccg ataaccaggc gatttcgaag 2640 attttcagcg agaataaggg cctgccagtg atgtctgcac ttcgtatgca gcattatgct 2700 acatatctac aatccttcga ctacgaaata cgttttcgta agtcggcgga ccatgctaat 2760 gctgatgccc tttctcgatt gccgatgcaa gctacggatc cggaaaacgt gattgaggag 2820 tgcgacgtcg tagaaatcag ccaaattgat actttgccat tgaccgctgc tgaactttcg 2880 gaagcgcagc tgcggataag tctgtgagca tgttgattca agggctcaaa catggacaga 2940 acgttgacgg gaagcataga tttggaatcg accaatctga gttctcgttt caaaaaggat 3000 gtctacttcg tggtattcgt gtctacattc cggcagtatt tcgcaagcgg gtgctagcgg 3060 agctgcattc cacacatttt ggaacatccc gaaccaaatc actcgcaagg ggctactgtt 3120 ggtggaacgg tatggataag gagatagaag agttgatcgc aaactgtgcg gattgccagt 3180 cggttcggcc aatgccagct aaagtccaac cacatcattg ggaacctgct accaagccat 3240 ttcaacgcgt ccatgcagac tttgctggac cgtatatgga ctgctacttc ttcatcctaa 3300 tcgacgccta tactaaatgg ccggaaatca aggtttgtcg ttctattaca gcagaaaata 3360 cggaagctat gtgtcgtgaa atattcagca cattcggtat tccgtcagtg ttcgttagtg 3420 accatgaagt gcagtttacg gcaacatcgt tccagtcatt tctgaaacaa aatggcatcg 3480 tacacaagat gggagctccg taccatccct ccacaaatgg acaagcggag cgttatgttc 3540 aaacattcaa gcaaaagctg aaagccttaa aatgtccgcg gtcgcagttg aaccttgaat 3600 tgtcgagact ccttttgacg tacaggaaga ccatacatcc agcgacaggc cagtctccat 3660 ccatcatgat gtttggtcga caaatacgtt cacgtctgga cctgatgtta cccagaggga 3720 acgaggaagc acctcctact acgacggttc gagagttgga aaacggtgtc cgtgttcgcg 3780 taagagattt cctcaccaat aataaatggc aattcggaaa ggtagtttct aaacttggaa 3840 gccttcgcta cgcggtgaag ctcgacgatg gaaggatctg ggaaagacac atcgatcaca 3900 tcgtgaaggt cggcgtagga ttgttgtcgt caggaagcaa tgattctcca gcacgcgagc 3960 ctcttcggga tgaacctaat tgttcaacat caacggtgcc gtccaccatt tctccatctt 4020 cgcaagaaca ggagacagca gttgctgatc cggtggccat tgtcccgcca cccgatgatc 4080 cagtgagtag agttccagag atgtcacaca gcagtgtatc accaaaaagt gaaacgaagc 4140 cgcctttgcg gcgttctaca agaacgatta aagctcccca aaaacttaat ctttaatatt 4200 ttttgcactg tatccatttt tgcatttttc acaaagggga gaga 4244 // ID Copia-105_AA-I repbase; DNA; INV; 4071 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-105_AA_; KW Copia-105_AA-LTR; Ty1_copia_Ele38; Copia-105_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4071 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [1506-2045] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 69..4061 FT /product="Copia-105_AA-I_1p" FT /translation="MEEEKFDRVLLPLFDGSNFAAWKFRMLILLEEHELEE FT CIHTYAAEVEELVVQEGDTNEVKAAKAKKLEKRMKKDRRCKSLLVSRIHDT FT QLEYIQGKQFPKDIWDALHRVFERRSIASRMHLKREMLQLRFEGGNLQQHF FT LRFDKLVREYRSTGAVLEDLDVVCHLLLTLGPSYSGVVTALETMPEENLSL FT EFVKCRLLDEETKRRGIESSSSSGGDAAFSGAKQVKKKKLICFHCKKEGHK FT QIDCSDRKQQGNQKNHRKSKANVAEADAGKGICFVGVSGGMDLPEDQRTRW FT YIDSGATDHLVRDKVLFSELHRLKKPVEIAVAKDGETIVAEFAGTVKIISV FT VNGKPIDCTISDALYVPKLRCNLFSVMKVEKAGMRVVFECGKAKVYNGSEI FT VASASRREKLYELDFYSTRQSAGDALLSCGRIRKSSELWHRRFGHLNENSL FT KQLMQSGMVSGMNMSSVDNSDKDMIVCESCVVGKQTRKPFSSSVAKRASRV FT LELVHTDVCGPVTPVGLSGVQYFVTFTDDWSHFTVVYLMQSKDQVVEYFEE FT YEAWATAKFGQKISRLRCDNGGEYKNKRFLKFCRSKGIQIEWTVPYTPQQN FT GLSERLNRTLVEKARAMLKDSGVDKRFWGQAVLTAAFLTNRSPTNALSTKQ FT TPFERWESRKPNVSNVRVFGCKVFVHVPDELRRKLDSKAWSGIFLGYSHNG FT YRVWNPMKKQIVVARDVDFVEDSVSLVKRNLPVDQVVHVPAEEEDDANSEL FT EVPEEDPSDGEFESFVEEEPAEEEEEIASGGRPQRNRSAPAWHQDFEMEYA FT SFALSAVNYVDEIPGTIAELQKRDDWKEWKAAIEDEMSSLKRNNTWTLVKK FT PEGRSVVSCKWVFRIKRGEAGRPEKYKARLVARGFSQKKGFDYSETYSPVA FT KMDTLRAVLALANQSGMHIHQMDVKTAFLNGELSEEIFMAQPDGFEQGRGL FT VCRLNRALYGLKQASRAWNDKFHRFVEKLGFTRSDNDQCLYTLGSGKEQVI FT PVLYVDDVLIASSSLKILQEVKRKLSERFEMTDSGEVRQFLGMNIERDFEE FT GVLRISQRNYLDGLLRRFDMSECKPRSTPIENRLKLSKADEKQRTDKPYRE FT LIGCLMYASLTTRPDLSAAINFFSQFQACPSDEHWNYLKQILRYVKGTLDV FT GLVYRKREGAALLEVFTDADWANSNVDRRSVSGYVCKIYGCTVSWTTRKQQ FT TVALSSTEAELAALCLAICHVVWMRRLVCDLGRKYGQAISVFEDNQSTIRI FT AEDSKDHSRLKHVDTKFHFVRDLVQRGVVAIKYVRSSDQEADIMTKGLPVA FT AFRGLCAKLGLERREDYRG" XX SQ Sequence 4071 BP; 1063 A; 854 C; 1249 G; 905 T; 0 other; ggttgtgggc cccggtcgcg ctttgatcga aaagtagtcg ggagtgtcgt ttttttgcgt 60 ttttcggtat ggaggaggaa aagttcgatc gtgtgttgct gccgctgttc gacggctcga 120 actttgctgc ctggaagttt cggatgctga tactcttgga agagcacgag ctcgaagagt 180 gcatccacac gtatgcagcg gaggtggagg agctggtcgt ccaggagggt gacacgaacg 240 aagtgaaagc cgcaaaagcc aagaagctgg agaagcgtat gaagaaggac cgccggtgta 300 agtccctgct ggtgtcgaga atccacgata cgcagctcga atatattcag ggcaagcagt 360 ttcccaaaga tatctgggat gccctgcatc gtgttttcga gcgtcgcagc atcgcaagcc 420 ggatgcattt gaagcgggag atgctgcagc tgcgattcga aggaggaaat ttacagcaac 480 attttcttcg cttcgataag ctggtacgtg agtatcgatc gactggcgct gttttggagg 540 atttagacgt ggtttgccat ctcctactaa cgcttggccc ttcgtattcc ggggtagtta 600 cggcactcga gacgatgcca gaagaaaatc tatcgctcga gtttgttaaa tgtcgactac 660 tggatgaaga aaccaagcgc cgtgggattg aatcgtcatc ctcgtcgggt ggagatgctg 720 ctttttccgg agcgaagcag gtgaagaaga agaagctgat ttgttttcac tgcaagaagg 780 aagggcataa gcaaattgat tgctccgata ggaaacagca ggggaaccag aagaatcatc 840 gaaaatcgaa agctaacgta gctgaggcgg atgctggaaa aggaatctgt ttcgtcggtg 900 ttagtggtgg catggatctg ccggaagacc agcgtacccg atggtacatc gattctggtg 960 caacggacca tctagtgcgg gacaaagtgt tgttcagtga gcttcatcga ctgaagaaac 1020 cggttgagat cgcggtggca aaagacggtg aaacgatagt ggctgaattt gccggcaccg 1080 tgaagattat atcggtagtg aacggtaaac cgatcgattg cacgatttca gacgctttgt 1140 atgtgcctaa actgcgttgc aacttgtttt ctgtgatgaa ggtggagaaa gcgggaatgc 1200 gtgtcgtttt cgagtgcgga aaagcgaagg tgtataacgg ctccgaaatc gtcgcgagtg 1260 cgtctcggcg tgaaaaacta tatgaactcg atttctattc gaccaggcaa agtgcgggtg 1320 atgccttgtt gtcgtgcggt cgaattcgta agagttccga gctgtggcat cgtcggttcg 1380 gccacctgaa cgaaaacagt ttgaagcaac tgatgcagag cggaatggtg tcgggaatga 1440 acatgagttc cgttgataac agtgataagg acatgattgt gtgcgaatcg tgcgtcgttg 1500 ggaaacagac gaggaaaccg ttctcttcgt cggtcgcgaa gcgcgcgtcg cgtgtgctcg 1560 agcttgttca cacggacgtg tgtggccctg taacgccggt tgggctgtcg ggtgtgcagt 1620 atttcgtgac attcaccgac gactggagcc atttcacagt cgtgtacctg atgcagtcga 1680 aagaccaggt ggtagagtac tttgaggagt acgaggcctg ggccacagcg aaattcggac 1740 agaagatttc ccgcctccga tgcgacaacg gaggggaata taagaacaag cgattcctga 1800 aattctgtag aagcaaaggc attcaaatcg aatggacggt cccttatacg ccccaacaga 1860 atgggctgag cgaacgactg aatcggacgc tagtcgaaaa ggctagagcg atgttgaagg 1920 attctggagt tgacaaacgg ttctgggggc aggctgtgct gactgcagcc tttcttacca 1980 accgaagtcc aacgaacgca ctgagtacga agcaaacacc atttgagcgg tgggagtccc 2040 ggaagccaaa cgtatctaac gtgagagtgt ttggttgtaa agttttcgtg cacgtcccgg 2100 atgaactgcg acggaagctg gactcgaaag cctggagcgg aatcttcctg ggatacagtc 2160 acaacggata cagagtatgg aatccaatga agaagcagat tgtcgtcgcg cgtgatgttg 2220 acttcgtgga agacagtgtt tcgttggtga agagaaacct acctgttgat caagtggttc 2280 atgttcctgc agaagaagaa gatgatgcta atagtgaact ggaggtaccg gaagaagacc 2340 cgtcagacgg cgagtttgaa agcttcgtag aggaagaacc agcagaagaa gaagaagaaa 2400 tcgcctccgg cggaaggcca cagcggaaca ggtcggcgcc agcatggcat caagatttcg 2460 agatggaata cgcaagtttc gccttgagtg cagtaaacta cgtggatgag attcccggaa 2520 cgatcgctga gctccagaag cgcgacgact ggaaggagtg gaaggcagcg attgaagacg 2580 aaatgagttc tctgaagcgg aacaatactt ggacacttgt gaagaaaccg gagggacgtt 2640 cggtggtgtc gtgcaagtgg gttttccgga ttaagcgtgg cgaagcaggc agaccggaga 2700 aatacaaggc caggttggta gcgagaggct tcagccagaa gaagggtttc gattactccg 2760 aaacatactc tcctgtagcg aagatggaca ccttgcgggc agtgttggcg ttggcgaacc 2820 agagcggtat gcacatccac cagatggacg ttaaaaccgc gttcctcaac ggtgaactgt 2880 cagaggagat ctttatggct caaccagacg ggtttgagca ggggagagga ctagtatgcc 2940 gcctgaatcg agccctctat gggctgaagc aagcttcgcg agcatggaac gacaaattcc 3000 atcggttcgt cgagaagttg ggcttcacac gtagtgacaa tgaccagtgc ctctacacgc 3060 ttggatccgg caaggaacag gtgatcccgg tgctgtacgt cgacgacgtc ttgatcgcaa 3120 gttcgtcgct gaagattctc caagaagtca agcgaaaact gtcggagcgg ttcgagatga 3180 cagactccgg cgaggtaaga cagttcttgg gcatgaacat cgagcgtgat ttcgaagaag 3240 gcgtgttgag aattagccaa cgaaactacc tggacggatt gctcagacgt ttcgatatga 3300 gtgaatgcaa gccgaggtcg acgccgatcg agaatcggct gaagcttagc aaagcggacg 3360 aaaagcagag aacggataag ccgtaccgtg aactcatcgg ctgtcttatg tacgcgtcgc 3420 tgactacaag gccggacctg tcggcagcta tcaacttttt cagccagttt caagcgtgcc 3480 ctagcgacga gcactggaac tatttgaagc agatactgag gtatgtaaaa gggacgctag 3540 atgttgggct ggtgtatcgg aagcgagagg gtgctgcttt gctggaagtg tttaccgatg 3600 cagattgggc gaatagcaac gtggacaggc gatctgtaag cggatatgtg tgcaagatat 3660 atgggtgtac tgtaagttgg acgacccgga agcaacaaac cgttgcttta tcgtctaccg 3720 aagcggagct agctgcgctc tgtttggcca tctgtcacgt cgtctggatg agaaggctgg 3780 tgtgcgatct gggaagaaaa tatggacaag cgatatcggt gtttgaggac aatcagtcca 3840 caatccgtat agctgaggac tccaaggatc atagccggct taaacacgtg gacacgaagt 3900 tccactttgt acgtgacctt gtgcagcgtg gagttgttgc catcaagtac gtgagatcat 3960 cggatcaaga agcggacatc atgaccaaag gtctaccagt ggccgctttc agagggcttt 4020 gtgccaagtt aggactcgag cgtagagaag actaccgtgg ttaagcaggg g 4071 // ID CR1-81_HM repbase; DNA; INV; 4033 BP. XX AC . XX DT 08-JAN-2009 (Rel. 14.02, Created) DT 08-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE CR1-type family - consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-81_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4033 RA Bao W. and Jurka J.; RT "CR1 families from Hydra magnipapillata."; RL Repbase Reports 9(2), 368-368 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(138..812,816..1928,1805..2350,2202..3740) FT /product="CR1-81_HM_1p" FT /translation="MVNVQSTEFNVFKAEQEKLIKALNRTIEELKNLNLLL FT SERVKMLENAESVKSSPLLSWASVVSGKTPVKKTVEQLNLINVVTAEKKER FT EKRENNVIIFGINPSKAIDKSSAIKEDKNSIQEVLNSINLKVEVKQIIKLK FT SKIDNQPPFVIVLNNKTDRNSILKAAKLLKNSSEFKNVYINPDLTEAERYK FT AKLLRDECKKNNLENSESLLYYYGIRNDRVVKIAKQLLFNNRKLSHMSHCA FT TELANSSRISCEYLISKSSKPDKIICTKKCRNKPINLNLINGNSNNGCNSN FT NGCSSNPYSKNSVYLLNARSLANKLHEFFMFTEANKPAIICVCETFFTSKI FT SDSIVCPKDYSIYRKDRIKIGGGVAIYCRNDIKSEQVQITQTDTDIDIVCI FT DLIFSSHNLRLITCYRPPFYSIADVAYIELMTAFIYNLCHSVSQFVIVGDF FT NFPKMDWLNYVSPNEKCHILFLNLVNNLGLHQFVQEPTRENNILDLVFSNN FT ISFLSLISVECPFSTSDHNTVHFSININDLDQSNKNIESFYDFVNADIDGF FT QSYLLNINWDFEFSFLFLQLKIIGTFSPIIFAMVFINLFQLERVILIKKLG FT LRVFLFVFTVEDYWNVFSNHFCNGIHQFVPIRKSNLNKKIKHYPSFIGKML FT NRKAFLWKRWSNTKNITHKKAYNKYASKCKKSLLTYQMRVELKLVKENNIS FT KFFNYVNRKLNNSKSIFPLKDKNNNNKITCSNEEIADVFNKHFGSVFTNDN FT NYSPFINSYVDDSIGIEDCTISTRSSVLVVTKKSLTCLTNTLAVSLQMITI FT TLHLLIVMLMIQLVLKTVLFPPEAVYKSLKEIKASTSYGPDGIPNILLKKL FT AHLLCIPLSFIFDASFKSSSLPKQWRQAFVSPVFKKGATADPNNYRPISLT FT CTCCRVMERMINVYVIKYLGHHNLISLNQHGFLNKRSTSTNLLESANDWNT FT ALDNHQITDVVYIDFQKAFDSVSHTKLLVKLASYNIKGNLLNWISAFLTDR FT YQQVKVGNSISNSIRIVSGVPQGSVLGPTLFLLYVNDLTAIXNNLDCCVKL FT YADDIKLYSSYCDMDLSQDLVVALNRLLQWAETWQLTIACSKCFALRIAPP FT SLCKGIKVNYKLGEYSLAWSSNPKDLGVTMDCNLNYKKHIFNVIHQANSRA FT YLILKCFKTRDPCILVRAFTTYIRPILEYCSPVWSPHQIGLIKSVESVQQK FT FTKKIRSLRNLSYHNRLQLLGIESLEVRRLKNDLVTCFKIFRKLVCTDQSM FT FFAIDNNSHTRGHNYKISKQSCRLDIKKI*" XX SQ Sequence 4033 BP; 1442 A; 657 C; 606 G; 1326 T; 2 other; ttttttggac ttttatagcg atcggacatg ttataagagc ttttcattaa aaaaaaaatt 60 tacggaaaaa ttcaaataaa gttaactata ctttgtaact ttttgtgaaa aaagtatatt 120 tgcatagcgt accaactatg gtgaatgttc agtcaacaga atttaatgta ttcaaagcag 180 agcaagaaaa actaataaaa gcattaaata ggaccatcga ggaactaaaa aatttaaact 240 tactgttatc agaaagagta aaaatgctcg aaaatgcgga atctgtcaaa tcatcacctt 300 tattatcatg ggcaagtgtt gtaagtggaa aaaccccagt gaaaaaaacg gtagagcaac 360 taaacctaat aaatgttgtt acagccgaaa aaaaggaaag agaaaaacgt gaaaataatg 420 taataatatt tggcattaat ccatcgaaag caatcgataa atcaagtgca attaaggagg 480 ataaaaattc aatccaagaa gttttgaatt caattaatct gaaagtcgaa gttaagcaga 540 ttataaagct caaatcgaaa atagataacc aaccaccttt tgttattgtt ttaaataata 600 agaccgatcg caattcaatt ttaaaagctg caaaattatt gaaaaattcg agtgagttta 660 aaaacgttta tattaaccct gatttaaccg aggcagaaag atataaagca aagttgttac 720 gagatgaatg taaaaaaaat aatttagaaa attcagaatc tttactatat tattatggta 780 ttagaaacga cagagtcgtt aaaatcgcaa aatagcaatt actttttaac aacaggaaat 840 taagtcatat gagtcattgt gcaacagaac ttgcaaattc gtcacgtatt agttgtgaat 900 atcttatatc taaatcgagt aaaccagaca aaataatttg tacaaaaaaa tgtcgcaata 960 aaccaatcaa tttaaattta ataaacggca atagcaataa tggctgcaat agtaataatg 1020 ggtgctcttc gaatccttat tcaaaaaatt ctgtatatct tttaaatgct agaagccttg 1080 caaataagct gcacgaattt tttatgttta cggaggcaaa taaacctgcc ataatttgcg 1140 tttgcgaaac atttttcacc agtaaaatat ctgactctat tgtttgtcca aaagattact 1200 caatctatcg taaggatcgc attaaaattg gagggggagt agctatttat tgtagaaatg 1260 acattaaatc ggaacaagta caaataactc aaacggacac ggatattgac atcgtctgta 1320 tagacttaat ttttagttcg cataatctcc gtttaatcac atgctatcgc ccaccttttt 1380 actccattgc cgatgtggct tatatcgaac ttatgactgc atttatatat aatctatgcc 1440 actctgtatc tcaatttgtt attgtcggtg acttcaattt tcctaaaatg gactggctta 1500 attacgtatc accaaacgaa aagtgccata tccttttttt aaatttagtc aacaatctcg 1560 gtttgcacca atttgtacag gaacctactc gtgaaaacaa tattcttgat cttgtatttt 1620 caaataatat ctcatttctt agtcttattt cagttgaatg tcctttcagc accagtgacc 1680 ataacactgt ccacttctct attaatataa atgatctcga tcagtcaaat aaaaatattg 1740 aatccttcta cgacttcgtt aatgccgaca ttgatggttt tcagtcttat ttattaaata 1800 ttaattggga cttcgagttt tcctttttgt ttttacagtt gaagattatt ggaacgtttt 1860 ctccaatcat ttttgcaatg gtattcatca atttgttcca attagaaaga gtaatcttaa 1920 taaaaaaata aaacactacc cgagttttat cgggaaaatg ttaaatcgta aagctttctt 1980 atggaaaaga tggtcaaata ctaaaaacat tacacacaaa aaagcatata acaaatacgc 2040 tagtaaatgt aaaaaatcat tacttactta ccaaatgaga gtcgaactta aacttgtaaa 2100 ggaaaacaat atcagcaaat tttttaatta cgttaataga aaattaaata actctaaaag 2160 tatttttcct ctaaaagaca aaaacaataa taacaagata acttgtagta acgaagaaat 2220 cgctgacgtg tttaacaaac actttggcag tgtctttaca aatgataaca attactctcc 2280 atttattaat agttatgttg atgattcaat tggtattgaa gactgtacta tttccaccag 2340 aagcagtgta taaatctttg aaggaaatta aagccagcac atcgtacggt cctgatggca 2400 tacctaacat acttctgaaa aaattagctc atctattgtg tattccactc tcatttatat 2460 ttgatgccag ttttaagtca agttctttac ctaaacaatg gcgccaagct tttgtatctc 2520 cagtttttaa aaaaggagct actgcagatc caaataatta tagacccatc tcactgacat 2580 gcacttgctg tcgagtcatg gaaagaatga tcaatgtgta tgtcataaag tatttaggtc 2640 atcataattt aatatctctt aatcaacatg gtttcctcaa caaacgttca accagtacaa 2700 atctcttaga atctgctaat gattggaaca ccgcactcga taaccatcaa ataacggatg 2760 ttgtctatat tgatttccaa aaagcgtttg actctgtctc gcataccaaa ctattagtaa 2820 aacttgcatc ctataacata aaaggtaatc ttttaaattg gatttccgct tttctcaccg 2880 atagatatca acaagtcaag gtcggtaatt ctatatcaaa ctctatacgc attgttagtg 2940 gagttccaca gggtagtgtg ttaggtccca cattgttttt attatatgtt aatgatttaa 3000 ccgctatarc taataatctg gactgttgtg taaaacttta ygcagatgat ataaagttat 3060 atagttcgta ctgtgatatg gatctaagcc aagatttagt tgttgcttta aatagacttt 3120 tacaatgggc agaaacttgg caactaacaa ttgcttgtag taagtgtttt gcacttagaa 3180 tagcacctcc ctcattatgc aaaggtataa aggttaacta taaattaggt gaatattctt 3240 tggcttggtc ttcaaaccca aaagatctag gagtaacaat ggactgtaac cttaattata 3300 aaaaacatat ttttaacgtc atccatcaag ctaactctag agcatactta attttaaaat 3360 gttttaaaac acgtgaccct tgtatcctag ttagagcttt cactacctat atacgaccaa 3420 ttttagaata ttgttctcca gtatggtcac cacaccaaat tggattgatt aaatcagttg 3480 aaagtgtcca acaaaaattt acaaaaaaaa taaggagcct cagaaatctt tcctatcata 3540 atagattaca gttgcttggt atcgaatcac tcgaagttcg gcgccttaaa aatgatttag 3600 taacttgctt taaaattttt cgtaaacttg tctgtactga tcaatctatg ttctttgcga 3660 ttgacaataa cagccacacc cggggtcata attataaaat aagcaaacaa tcttgtcgct 3720 tggatattaa gaaaatatag cttttcttac cgagttgttg acatgtggaa caatatgacc 3780 tcggaggctg taaatgcaaa taacatttta atgttcaaaa ataaaattaa atctttcgac 3840 tttaacaaat atctattgta aaaggaacaa ctatttgttt ctttatagtt ttgttatttt 3900 attttttttt tgtagcgata tttttaaatt tctttgggca cgttgttagt gtccatcttt 3960 gtatgggcct tcgtgtcctt tagaacatgt atttatactt tgttctaata aatatcaatc 4020 aatcaatcat cat 4033 // ID Sola2-2_DPu repbase; DNA; INV; 3883 BP. XX AC ACJG01001611; XX DT 17-FEB-2011 (Rel. 16.02, Created) DT 17-FEB-2011 (Rel. 16.02, Last updated, Version 1) XX DE Sola2-type DNA transposon from Daphnia. XX KW Sola; DNA transposon; Transposable Element; Sola2-2_DPu. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Direct Submission to Repbase Update (09-FEB-2011). XX RN [2] RP 1-3883 RA Jurka J.; RT "DNA transposons from Daphnia."; RL Direct Submission to RU (24-MAY-2010). XX DR EMBL/GenBank/DDBJ; ACJG01001611; Positions 16507 20389. XX FH Key Location/Qualifiers FT CDS join(1384..1557,1707..3050) FT /product="Sola2-2_DPu_1p" FT /translation="MDDEYSRVMPGRKDVKSVKKPGERRVRLQKRLLLMNI FT DELYSHYKDNAVKTLLMKPCGGQISLRYRQSGLHDVKVCFLSPKIRLLEHL FT RSFTNERETIKFQQWESTDRTALAVSELPTDDFLRKLVDMISDLTRHHFVA FT KKQGKYCRDLKETLKPNECLLQGDYSQKYSMTVQEATQGMFFNAPRQATLH FT PFLAYLNINGKIVPHSMCVFSDCLNHDAVSVNAFLKPVLQHIKIISPSIDT FT IKYFSDGAVGQYKNCKNFVNLLRHEEDFNIKYAEWNFFSNSHGKGPCDGIG FT GTIKRLAYRHSLQGGDIQTPFALFLWAEQQIENLKMFYVSSEDVNGNQKRL FT EGRLCEARTLTGTQSFHRFVPSLTSKYHMDAYDISENENFKTFQILPIPDP FT CTSVEFNSISVGKHVACLYSEDGRWYLAIVLEKNKEEQEFLLQFYKPPGES FT ASIRGFKSTTNSKDQAWIPCVDVLKNIDSLEKTSRSGRTFKLSPTEYAQVS FT KLFTKKMQDS" XX SQ Sequence 3883 BP; 1284 A; 698 C; 732 G; 1169 T; 0 other; gggtaatccg cgccaactcc acaaatcggt ggcgtgatga aacctccgat tttcttcaaa 60 tttgcacagt aggtcaaaaa ctactgtgtt attacaaccc ccaaaggatt taaaaaaata 120 tgtagccgtt aaaaagttac gggtgtttta gttttgtatt tttgacactt tcccgtgggc 180 cgctttgcaa taaccatttt tttttaaatt ttcattctgc ctcgtcccct cattaattaa 240 acttatggca atacaactat tcgatagaaa atttgattct cttttgaatg ctttttgaat 300 ttttaaaata ttttcaaaaa tagaggagtt atgtgaaaaa aagagttcaa aagtttgccc 360 actttaaaaa ttaattgtaa ttaaagctgc taaccgattt ttataatttt tttaaaataa 420 caaagcgttt ttagaactgt tagctgattt tttttgaaaa ttgtgttttt tgtcgttctt 480 ctgtaataat tcatgaaacg caaaccgttt tccggctttt caccggtttc cccacacctt 540 gcccttgaaa gggggtggag ccttattctc ttcgggtttg gcccggttcc atttgcggcc 600 attgccgcaa atctgattta acagaattat aactgttttc ctcggtgtac taacacaaaa 660 gaagtaaatc taataaatga gctttattca tttgggttta ctacgaatta tggaataagc 720 gaaatatgtt atttaaacaa agaaacgtca attttatgga ctaattaatt gcgaccaagg 780 acatggttgg aagctccgcg taatcgttac tcggaagggt cagaaccctt ctgttttttc 840 atcagatcac tttaaagttt cagtcgcttt aaatattttg tgccgatcag tgagctgctt 900 acgagaacta ttatagcaaa taatggctgc cgtaagcaac aacataaaag agtgttcagt 960 gggatagaag cttagagacg cgttgtgttt cagtgatata tatgttacta aagacagttc 1020 actcatcacg cttgacgata gtgacgtata aaaaacaatc agatatcgag taaatatgag 1080 tagtgttcca actatatgtt tgcatcatct ttacctcaag aaatgcaaaa ctgttaatga 1140 atgctcttaa aattcgctgt gaggaattac aattaaagaa aagagcggcc gatattttgt 1200 cgttgctgac gttagcacca gcttcttgga cactcaaaca gacagcaaaa tttgtctgtg 1260 tcggaagacg ctgtgcgtcg ttcaagaata ttaaaagaag aaaaaggagt actatcaaca 1320 ccggatccca agaaagcaag aaaaattggt gacagtgaaa ataatataat tcaaaacttt 1380 tacatggacg atgaatacag tcgagtgatg ccaggaagaa aagatgttaa atcagtgaag 1440 aaaccggggg aaagaagagt ccgcctacaa aaacgacttt tgctgatgaa catcgacgag 1500 ctttattctc actataaaga caacgcagta aaaactttgc tcatgaagcc ttgtggatga 1560 acaaaatttt ttcagttacg gccgcagcac gtcatcgaag ttggttccgc ttgaacccac 1620 tctgttcgtg tctgtgaaaa acaccaaaac gtgaaaatca tgatcgactc tctgtgcaag 1680 aacatgccga tagcgcatct gtttaaggac aaattagttt gcgatatcga caatcaggat 1740 tgcatgatgt aaaggtgtgc ttcttgtccc ccaaaatccg tctgcttgaa catttgcgat 1800 cttttacaaa tgaaagggaa actataaaat ttcaacagtg ggaaagtact gatcgaactg 1860 cgttagcagt ctcagagctt ccaacggacg atttcttaag aaaattagtg gacatgatta 1920 gtgatcttac aagacatcat tttgttgcga aaaaacaagg gaaatattgt cgtgatttga 1980 aagaaacatt gaaaccaaat gagtgtttat tacagggaga ttattcccag aaatactcta 2040 tgactgtgca agaagcaaca caaggaatgt ttttcaatgc acctcgtcaa gctactctac 2100 atcccttttt ggcttatctc aacatcaacg gaaaaattgt gcctcattcc atgtgcgtat 2160 tcagtgattg tttaaatcac gatgctgttt ccgtcaatgc atttttgaaa cccgttttgc 2220 aacacatcaa gattatttcc ccttcaattg ataccataaa atatttctca gatggagcag 2280 tggggcagta taagaactgt aaaaatttcg taaacctttt acgtcacgag gaagatttca 2340 acatcaagta cgctgaatgg aatttctttt ccaactcgca cggcaagggg ccatgtgacg 2400 gaattggggg cacgataaaa cggttggcgt atcgacacag cttgcaaggt ggagatatcc 2460 aaacaccatt tgctttattt ctgtgggctg agcaacaaat tgaaaatctg aaaatgtttt 2520 atgtctcatc agaagatgtt aacggaaatc agaagcgttt ggaaggaaga ctatgcgagg 2580 cgcgaacttt aactgggact caatcgtttc atcggtttgt ccctagtctg actagcaaat 2640 atcacatgga cgcttatgat atttcagaaa atgaaaactt taaaactttt caaattcttc 2700 caatacctga tccttgtaca agtgttgaat ttaacagcat ttcagttgga aaacatgttg 2760 cgtgtttgta ttctgaagat ggcaggtggt atttggccat agtcttggaa aaaaataaag 2820 aagagcaaga atttctcttg cagttttata aacctccggg agaaagtgca tctatacgtg 2880 gattcaaatc gacaactaat tctaaagacc aggcttggat tccctgtgtc gatgttttaa 2940 aaaacattga ttccctcgaa aagacttcac gaagtggacg aacatttaaa ttatcaccca 3000 ctgagtacgc gcaagtttcc aaactgttta ccaagaagat gcaagattct taaaaaaaca 3060 atttacgttt cattccttaa actctcgtct tgtatgaaaa aacacagcca ccgaatgtta 3120 tttttttaat aaactgataa atacctaact aaatatattt ttcgcttatt ccataactcg 3180 tactacaccc atatgaacaa agctcattta ttagatttac ttcttatgtg ttaatacacc 3240 gaggaaaaca gttataatat tgttaaatca aatttaaggc aatggccgca aatggaaccg 3300 ggccaaaccc gaagagaatg aggctccacc ccctttcaag ggcaaggtgt ggggaaaccg 3360 gtgaaaagcc ggaaaacggt ttgcgtttga tgaattatta cagaagaaca acaaaaaaca 3420 caattttcaa aaaaaatcag ctaaaagttc taaaaacgct ttgttatttt aaaaaaatta 3480 taaaaatcgg ttagcagctt taattacaat tattttttaa agtgggcaaa cttttgaact 3540 ctttttttca cataacttct ctatttttga aaatatttta aaaattcaaa aagcattcaa 3600 aagagaatca aattttctat cgaatagttg tattgccata tgtttaatta atgaggggac 3660 gaggcagaat gaaaatttaa aaaaaaatgg ttattgcaaa gcggcccacg ggaaagtgtc 3720 aaaaatcaaa actaaaacac ccgtaacttt ttaacagcta catatttttt aaaatccttt 3780 gggggttgta attgcacagt agttcttgac ctactgtgca aatttgaaga aaatcggagg 3840 tttcatcgcg ccaccgattt gtggagttgg cgcggattga ccc 3883 // ID TTAA13_AP repbase; DNA; INV; 574 BP. XX AC . XX DT 25-AUG-2009 (Rel. 14.09, Created) DT 25-AUG-2009 (Rel. 15.12, Last updated, Version 0) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; nonautonomous; TTAA13_AP. XX NM TTAA13_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-574 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2079-2079 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC TSD is TTAA tetranucleotide. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea CC Aphid Genome Project. XX SQ Sequence 574 BP; 202 A; 98 C; 104 G; 170 T; 0 other; ggggattgaa ggcggtgatt taataagggg tcgcgtatag gcaattcgat gtcttatact 60 cgttacgcag tgtggttgtg tgcgaagtaa atgatcatta gtgtgcgtag atacgttcgg 120 aatccgtcgt gtgccactct cgcacgcaca cgcacatgtc aataaatcga gaaaaacgaa 180 atatattctt ctatgcatca cctataaaaa ctaccaaaag cggtagtaaa ttaaacatac 240 ttcctgaagt ttgattgtaa tttcgaaaaa aagggttgaa ttcgtaagct cctagactat 300 gctttgtagg aaccactttt ttgatataaa acccaaaaag cccaaaataa atacaaatgt 360 aatgcaatta tttcatgaga ttttttaaaa aaatatccaa attttatggt cctttaaaat 420 tgaaaattga aaatcgactt cctagaaagc ctagttattt tgccaaagaa ttcatacata 480 tattaaaaat taaaatcgga cattcggaag tatgtgtaat ttaccaccgc ttattggtct 540 aaaacgtacg aaaaatacgt gacatgcgat cccc 574 // ID Perere_INT repbase; DNA; INV; 4875 BP. XX AC BK004067; XX DT 04-MAR-2004 (Rel. 9.02, Created) DT 28-JUL-2005 (Rel. 10.08, Last updated, Version 2) XX DE Schistosoma mansoni Perere non-LTR retrotransposon mRNA, complete DE sequence. XX KW Non-LTR Retrotransposon; Transposable Element; KW LTR retrotransposon; Perere; Perere_INT; Schistosoma mansoni. XX NM Perere_INT. XX OS Schistosoma mansoni OC Eukaryota; Metazoa; Platyhelminthes; Trematoda; Digenea; OC Strigeidida; Schistosomatoidea; Schistosomatidae; Schistosoma. XX RN [1] RA DeMarco R., Kowaltowski T.A., Machado A.A., Soares B.M., RA Gargioni C., Kawano T., Rodrigues V., Madeira M.A. et al.; RT "Saci-1, -2, and -3 and Perere, Four Novel Retrotransposons with RT High Transcriptional Activities from the Human Parasite RT Schistosoma mansoni."; RL J. Virol 78(6), 2967-2978 (2004). XX RN [2] RA DeMarco R., Kowaltowski T.A., Machado A.A., Soares B.M., RA Gargioni C., Kawano T., Rodrigues V., Madeira M.A. et al.; RT "Perere_INT."; RL Direct Submission to Genbank (03-DEC-2003)Departamento de RL Bioquimica, Instituto de Quimica, Universidade de Sao Paulo, Av. RL Prof. Lineu Prestes, 748, Sao Paulo, SP 05508-900, Brazil. XX DR Genbank; BK004067; Positions 1 4875. XX FH Key Location/Qualifiers FT CDS 822..4505 FT /product="DAA04497.1" FT /note="pol polyprotein; GI:44829167" FT /translation="MNENNIVEVTTNVIAPEAEPTGTANLSHVSQCTDIPM FT QCASLPLILIVALDNQDISDVDSNVILSSVISEAKDRTPDSIVKAHKNTAK FT PKKNIPIKKKVNKKPSGSRPNTKNISTALPNKSLISETMLNPTHRKGGYKD FT TFRSNVTRDGHYNHHVSRPTIRQTGMNQRAAWFPPNVINNKPARCNSYHHS FT CSGEDGILGYAPSTQPQHTGTCLNCPPKQVQPYQNKDFFRPPEIPCSTSTT FT IRSCYSPCDPENTLNHSPGIARTHTLDKDALGFFTLHTPFLNLLLINARSL FT LNKISALRTLAFLAKPSFILITETWCYPAVADSELNIQNYRLYRCDRETKR FT GGGCLIFALDTLTTNKVEDSILNSLPESIWISINTLNHSLLLGCIYRAPDS FT TDNLNDRIINAFIHASTLNFSARIITGDFNYPEINWSTGSCQSSNDEFLSI FT LNLYCWSQWVRTPTRGDNTLDLIFGRDTIPLSVQVYNEFESSDHRMVACAL FT PIYPSYNRPIQRTCNYRDYKHADWDLLRSLIKLSDWDEFFSCNSLTDAINI FT FYLIVNSCLDSCAPIKIYRISKHHELYIPAKYRNKLRRLKKRYFKSNDFTA FT VTQITIIFNQIKEKHRLKAINEELLALRTSSKVQNLIHIYNKRAKSTQNVD FT IPCILHNNSFIYDPKTIADLFSGNFANSKESLNDFVSRVICLTGNSIKSIS FT FTCLKISKVTNKLKVSKGHGADGISSFLYKYGGPDIQLLLLKLFTLSMESG FT SYPDRWKTAYIIPRYKSGDKTDMNNYRPINITPVISRIMEKIISDELPNYL FT LTEQFIDDSQHGFLKNRSCMTCHFDFFNLVYSLRSQGYLVLVLYLDISKAF FT DMVNHQLLIGKLASYGVENPLLAWFDSFLSDRHQIVKINSSLSNSVPVRSG FT VIQGCVLGPLLFLVFINDICECFSVGKSLLFADDLKVVYSFSPHELSNIRN FT CISTELNKVAQWCSEWQLELNTAKCGCLCFGDTSLNLNLTINGEMLSRLHT FT VVDLGLRYSDDLSFTEQIMKQTSKSQRLIGFITRNLHNSESRILMYKVCVR FT PLLEYCAFLLISTRIKDKLRLESVQRRFIFRTLGTDSVLTYNSRCSKLGLD FT PLWMRRLKLNLIFFFKILNKLSFTSSQEIQYAKAPHYDIRNSLSLAKQTYS FT RSSLYMNYFTCEFSMLWNNLPQTIRMLNSLPLFVRSINAFCSSENALNALA FT HASVSYSTSEIIGTLNV*" XX SQ Sequence 4875 BP; 1539 A; 1024 C; 814 G; 1498 T; 0 other; gcggtacgtt tggagtttgc gttacgatca cacggtattt attacaaagt ccttatatat 60 cgttaaacca cagtttaaaa atcaataata acattcagtt aacccttgaa gaaaagtggt 120 ttgactgaca caaaaacgag cttgaaaaag tcctatttct tcagatatta acaactctgg 180 tcttacctat accccatgct ctgcaactcc tgttgcacca gataatagtg ggagtggact 240 tattccacac atgttgctgg atgactgcat gtctaactta ataatgagtt ctaactcacg 300 aacagatggt atcgatctta ttaagagcga attagcctct gtctttaagc aattaagtgg 360 agtcacttgc gggctcttta ttaaagcatg atgatcttaa atttgaacta aagacactgt 420 caccccttcg actactgttg tccaaggata tagttacttc tgaattcgaa gacaggatca 480 aagtagaatc cgttattgac ttgatagcta gtgaagtcac caagcgaata atctctcgaa 540 ataatgtggt tatatataat atacctgaca aagttgccat taaaactgta agaaactcga 600 tactaaaggc agttaatctc caggacaatc catgccaatg tatccgactt aacaaaaagc 660 atcaaaaata ctcgtgccct atcgtgttta gatttgattc ccatttatta gcggaacgtt 720 tgaaagaatc tgaacagcta gtctgtgcgc atacgaagtt taaaaacgct cgtatagtct 780 cagacaaaac caccaatcaa agactaacac aaaagcgtac catgaacgaa aataacattg 840 ttgaggtcac aacaaatgtg atcgcgccag aagctgagcc cactggtaca gctaatctat 900 ctcatgttag tcagtgtact gatataccaa tgcagtgtgc tagtttgcct ctcatactca 960 tagtggccct agataaccag gacatatctg atgtagactc taatgtcatt ctttccagtg 1020 taatatcgga agccaaggat cgaacacctg attctattgt gaaggctcat aaaaacactg 1080 ctaaacctaa aaagaatata ccaattaaga agaaagtaaa taaaaaacct tctggttcta 1140 gaccgaatac caaaaatatt tcaacggcct taccaaacaa aagtttaatt tctgaaacca 1200 tgcttaaccc tacccaccgt aagggaggtt ataaggacac atttcgttca aacgttacac 1260 gagacggcca ctacaatcat catgtaagca ggcctactat tagacagacg ggaatgaacc 1320 aacgtgccgc atggtttccc ccaaatgtta taaataataa acctgcaaga tgtaactctt 1380 atcatcactc ttgcagtggg gaagatggta tactaggtta tgcaccatca acgcaacccc 1440 aacatacagg cacctgtctt aattgtccac caaaacaagt acaaccttac caaaataagg 1500 atttttttcg gcctccagag atcccttgtt ccactagcac tacaattcgc tcatgctata 1560 gcccatgcga tccggaaaac acattaaacc atagtccagg catagctaga actcataccc 1620 tcgataaaga tgctctcggt ttttttacac tacacactcc ttttttaaac ttattactca 1680 ttaacgcacg ttcacttctg aacaaaatct cagccttgag aaccttagcc tttctagcca 1740 agccttcttt tatacttatc actgaaacgt ggtgttatcc agcagtggcc gattccgaat 1800 taaatatcca aaactatcga ctctatcgtt gtgacagaga aactaagcga ggaggcggtt 1860 gtcttatatt tgctttggat accttaacaa ctaataaagt tgaagatagt atcttgaata 1920 gtttaccaga atcgatctgg atatcaatca ataccctaaa ccatagtcta ctcctaggtt 1980 gtatatatag agctcctgat agtactgata atttgaatga tcgtattatc aatgcattta 2040 tacacgcatc tactctaaac ttcagcgcta ggattatcac tggtgatttc aattatcctg 2100 agattaactg gagtaccggt agctgtcagt ctagcaatga tgaattttta tcaatcctta 2160 atctgtactg ctggtcacag tgggttcgta ccccaacaag gggtgataat acacttgatc 2220 taatatttgg tagagatacc attcccctat ctgtacaagt atacaacgag tttgaaagca 2280 gtgatcatag gatggtagct tgtgctctcc ccatctatcc ctcctacaac cgaccaattc 2340 aaagaacctg caattataga gactataagc atgcagactg ggacctcttg cgctcactga 2400 tcaaactctc agactgggat gaattcttct catgtaatag tctgacagat gccatcaaca 2460 tattctattt gattgttaac tcttgtctag actcctgtgc accaattaag atttacagga 2520 ttagtaaaca tcacgaatta tatataccag ctaaatatcg taataaacta agacgcctaa 2580 agaaacgtta ttttaaatct aacgacttca cggcagtcac acaaataaca ataattttta 2640 accaaattaa agagaaacat aggttaaaag ccatcaatga agagctatta gcactacgta 2700 ctagctcaaa agtgcaaaac ctaatccaca tttacaataa acgtgctaaa tcaactcaaa 2760 atgttgatat accatgcatc ctgcataata atagttttat atatgaccca aaaactatag 2820 cagacctttt cagtggtaat tttgcaaata gcaaagaatc cttaaatgac tttgtttcta 2880 gagttatatg cttgactggt aactcaatta aatctatctc cttcacatgt ctgaaaatta 2940 gtaaggttac aaataagctc aaggtttcga aaggtcacgg tgcagacggg atttcatcat 3000 ttctctacaa atatggtggt ccagatatcc aacttcttct ccttaagctg tttactctct 3060 ctatggaatc gggctcttat cctgaccgtt ggaaaaccgc gtacatcata ccacgttaca 3120 aatccggaga taagactgac atgaacaact atcggccaat aaatattact ccagttatct 3180 ctaggattat ggaaaaaatt attagtgacg aactacccaa ctatttattg actgaacaat 3240 ttattgatga ttcgcaacat ggatttctta aaaatcgatc ctgtatgaca tgtcattttg 3300 acttctttaa tttagtctat tcgcttcgta gccaaggata tctagtatta gtgctatacc 3360 tggacatttc taaggccttc gacatggtca accaccaact tctcataggt aaactcgcat 3420 cttatggggt cgaaaacccg ttactagcct ggtttgattc cttcctcagc gatcgacatc 3480 aaatagttaa aatcaactct tcattgtcga actcagtccc tgttagaagt ggggtaattc 3540 agggttgtgt cttaggtcct ttgctctttt tagttttcat taatgatatt tgtgagtgtt 3600 ttagtgtggg taagtccctt ttgtttgcag acgatcttaa ggtggtgtac tcattttctc 3660 cacatgagct gagcaatatt cgaaattgta tcagcacgga gcttaacaag gtagcacagt 3720 ggtgctcgga atggcaactg gagctcaata cagctaaatg tggttgttta tgcttcggtg 3780 atacatcact caacctcaat cttaccataa atggggaaat gttatctagg ttacacacag 3840 tagtagatct aggacttagg tactccgacg acttgtcgtt tactgaacag ataatgaaac 3900 aaacgtccaa gtctcaacgc cttataggtt tcataactcg aaaccttcat aacagcgaat 3960 ctcgtattct aatgtacaaa gtctgtgttc gaccactcct cgaatactgc gcgtttctcc 4020 ttattagcac gcgcataaag gataaattaa gactggaatc agtacagaga cgatttatat 4080 ttcgtactct tggaactgat agtgtcttga catataattc gaggtgtagt aaactagggc 4140 ttgacccttt atggatgagg agactcaaac ttaaccttat cttctttttc aaaatactta 4200 acaaactctc cttcacatcc agtcaggaga ttcaatatgc taaagcccca cattacgaca 4260 ttcgtaactc cttgtcttta gcgaaacaaa catattctag atcttctctc tacatgaatt 4320 actttacctg tgagttttct atgctctgga ataatttacc ccaaactatc cgtatgttaa 4380 actctctccc attgtttgtt cgctcaatta atgctttttg ctcctctgaa aatgcattaa 4440 atgctctagc gcatgcaagt gtatcttact ccacaagtga gattatagga actttaaatg 4500 tttaacctct tacttgctat cattgttttg ttgttccgtg ctctttgctt taatcttact 4560 ttctctagtt gtagataatt aataataact tgctctggca ctggaaataa ccttacagaa 4620 caatgttgat caatcatcag gctatccact taataaaaaa tgacaagttc cctggtgttg 4680 cattggcttg ccgtgacata tgccatattt aagctaatta tcaattacta aaccagcaaa 4740 cataaaactt tcttatattt catgtttgtt ctctacatta aatgttgatt tttataacaa 4800 tctgatcatg cctgtaaact ggtgtgaatt ctgtaaatat tattttcaga atatattatt 4860 attattatta tttag 4875 // ID Gypsy-1_DGri-LTR repbase; DNA; INV; 1503 BP. XX AC scaffold_15203; XX DT 07-MAR-2011 (Rel. 16.03, Created) DT 07-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the Drosophila grimshawi genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-1_DGri_; KW Gypsy-1_DGri-I; Gypsy-1_DGri-LTR. XX OS Drosophila grimshawi OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Hawaiian Drosophila; OC grimshawi group; grimshawi subgroup. XX RN [1] RP 1-1503 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Drosophila grimshawi genome."; RL Direct Submission to RU (06-MAR-2011). XX DR Genome; scaffold_15203; Positions 1876307 1877809. XX SQ Sequence 1503 BP; 458 A; 326 C; 293 G; 426 T; 0 other; tgtgacggcg cgaatattca gcttgcaatt ttcaaaacac tgcacaaaac atgcacatta 60 gacttaagca gctggtctag atcagctggt gcagcttatg gtgcgttaga gagctttcga 120 acgcgaactt tcgctcttaa tctttctaac gcgaactttt gctcttgaac tgtcaagttg 180 gctgcgcctg cgtttctgaa tgcatttcat tcttccgcat gctatcgatg caagaagcaa 240 cagcagatcg gtgccaagaa aagttttctg tttgttttac gtgacaattt gcacgtggtc 300 aaataaatac aatcggcaaa catccgaagc tgggataata tttaaagtgc gatattacgg 360 ataccagaac gctgaagacc ggttataccc gacagcagtc gaatttaata aaatttcgac 420 tcattaccgg gattttggca gttcacgtgt ttcgagtgtt gggtggacct gcaagcacac 480 cacatccaat cattgcacca caccacaata caccattgac cacggaactg gccacccaag 540 ccccttcgtc tattcttata cctcattatt ttgttgaaaa ataaaatcaa gccgacttca 600 gcagagaagc gttgccagcg cctattcagg gctgctcaac gcctctcaat acaacaacaa 660 taagaagaac aactacaaca acacaggaac aacaaccccg gtgtcttgaa gtcaccgcat 720 agcaggcata gagtataaaa aaaaaaggaa aaaaatgaat tttagttcta agtaatatta 780 tactccatac ttataaattt gtaaaatatt ttctagtgag cgctttgtat ccattttgca 840 cctttgactc taattatcac cgcctacaga tcgtctacgt cgacaggcct gctgacatcg 900 atggatcgct aaggatgata tggtagtaag tacggcgtag ccggacgtga tggcatcaga 960 gatgcaatat tggtaactaa agcgcccata gaattttgtt ggtattgtat tgatattcct 1020 aatagtataa aattaataag agtataatag tatcagagcc acaatttata atgaagttac 1080 aatgtcctta ttgaattcca ctatcatgaa tagtccttgt atttaactta attcaattat 1140 agaaatataa acatgggata atttgttatt attaaaggca ccttaggtct gtagcaccgc 1200 aagcttcata gggtaggatc caaaattgca atagattcct acaaggttgg gtgggtaaag 1260 aaggagaact ctaacaactt atctccttct ttagataggc tgtataagca acatcagtcc 1320 gttccatttg tatggctcgc attccttatc ccgttcttcc cgttcttccc tgtctacgaa 1380 attattactg tagcaccgtt tgtttaggcg gcacaatacc cacgcccaat ttccctgagc 1440 aaggccgccc caaatagtag acagggtgtg caagccgaga cgacattgtt agtgttcgtt 1500 aca 1503 // ID SMAR12 repbase; DNA; INV; 1792 BP. XX AC . XX DT 02-OCT-2007 (Rel. 12.1, Created) DT 22-OCT-2007 (Rel. 12.1, Last updated, Version 1) XX DE Consensus sequence of Mariner-type family of repeats. XX KW Mariner/Tc1; DNA transposon; Transposable Element; SMAR12. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-1792 RA Jurka J.; RT "Mariner-type element from freshwater planarian (Schmidtea RT mediterranea)."; RL Repbase Reports 7(10), 1070-1070 (2007). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS join(284..610,614..1564) FT /product="SMAR12_1p" FT /translation="MAPVKRHAYEADFKLNAISHAVQHGNRAAAREFNINE FT SMVRKWRKQEDALRQVKKTKLSFRGNKARWPQLEDKIEQWVIEQRTAGRSV FT STVSIRLKATVIARDMEINDFGGPSWCFRFMKRRNLSIRTRTTISQQLPKD FT YEEKLAIFRTYCKNKINEKKIRPEHITNMDEVPLTFDIPVNRTVEKTGTST FT VSIRTTGNEKSSFTVVLGCQANGQKLPPMVIFKRKTLPKEKFPVGVILKAN FT PKGWMDEEKMSEWLSEVYVRRPDGFFHKSPSLLICDSMRAHLTDTVKAQVK FT KTNSVLAIIPGGLTKELQPLDIGVNRSFKVKLRAACEHWMTEGEHTFTKTG FT RQRRASYATICQWIVDAWAKVSVSSVIRAFRKAGIITEQLSNSNETDSDND FT DKKDPGMLDAEIAQLLNSDTEDEEFDGFLVEE" XX SQ Sequence 1792 BP; 545 A; 380 C; 424 G; 443 T; 0 other; taccgtattt ttcggaccat aaggcgcact taaaatcctt tcattttctc aaaaatcgtc 60 agtgcgcctt ataatgcggt gcgcctaatg tatggttgtg ctttctggct ttctgaccta 120 gaaccgattt tatgtggtac acggcgctca aaaatctgtc aacatgtttt agcatggcct 180 tggtaagcta caaagctgca ctgattggat tgagcattac agccatcgta gtcagcaggt 240 gtgtacattc aaacattaat tttctatacc cgctgcccca aaaatggctc ctgttaagag 300 acatgcttat gaagcggatt tcaaactcaa cgctatcagt cacgcagtac aacatggaaa 360 tagagcagct gcgagagaat tcaacattaa tgaatcaatg gtacggaagt ggaggaagca 420 agaagatgcc ctgcgccagg taaagaagac caaactgagt ttccgaggga acaaagcgag 480 atggccacag ttggaggaca aaattgaaca gtgggttatt gaacagagaa ccgcaggtag 540 aagcgtctct acggtctcta ttcgactcaa agcaacagtg atagcacgcg acatggagat 600 caacgatttt tgaggaggcc cctcttggtg cttccgtttt atgaaaaggc gtaatctctc 660 catccgcaca agaactacta tctcacagca actgccaaag gattacgaag aaaagctggc 720 cattttccgc acctactgca agaacaagat caatgaaaag aagatccggc cagaacacat 780 caccaacatg gacgaggtcc cccttacctt tgatatcccc gtaaaccgta ctgttgagaa 840 aacggggacc agtacggtat ctatacgcac tacaggaaac gagaagtcat ccttcactgt 900 agttctcggt tgccaggcta atggccaaaa actaccaccc atggtaattt tcaagaggaa 960 gactttgcca aaagaaaagt ttccggttgg cgtcatttta aaggctaacc caaagggctg 1020 gatggacgag gagaagatga gtgagtggct gagcgaggtt tacgtcagga gaccggatgg 1080 ctttttccac aaatctccgt ccctattgat ctgtgactcc atgcgcgccc atctgaccga 1140 tactgtcaaa gcccaagtga agaaaactaa ttctgtgctt gccatcattc cgggtggatt 1200 aaccaaagaa ctccagccgc tagatattgg tgtcaacagg tcgtttaaag ttaaattgcg 1260 agctgcgtgt gagcattgga tgacagaagg tgaacacaca ttcaccaaga caggaaggca 1320 acgccgggca agttacgcca ctatctgcca gtggatcgtg gatgcctggg cgaaggtatc 1380 agtctccagt gtcatccgag ctttcaggaa ggccggaatc atcactgaac agctaagtaa 1440 cagcaacgag actgactctg ataatgatga taagaaggat ccgggcatgc ttgatgccga 1500 aatcgcccaa ctgttaaact cagacactga agatgaagaa tttgatggat ttttggtaga 1560 ggaatagaat aaaccaaaaa agtcagtgta ttgttttgta ttttatgtgt gtaacactga 1620 acaatgttga gttattgtga aagagttgaa tagagttgaa taaagtttga cttatcagag 1680 ttttgtgtca tttaatgtgt gcgccttatg tatgaaaata aaaccgttag cagtcgatca 1740 atgatattgc gccttataac acggtgcgcc ttatggtccg aaaaatacgg ta 1792 // ID Gypsy-2_PPP-LTR repbase; DNA; INV; 393 BP. XX AC ADBJ01000051; XX DT 13-DEC-2010 (Rel. 15.12, Created) DT 13-DEC-2010 (Rel. 15.12, Last updated, Version -1) XX DE LTR retrotransposon from the Polysphondylium pallidum (slime DE mold) genome: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-2_PPP_; KW Gypsy-2_PPP-I; Gypsy-2_PPP-LTR. XX OS Polysphondylium pallidum OC Eukaryota; Amoebozoa; Mycetozoa; Dictyosteliida; Polysphondylium. XX RN [1] RP 1-393 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Polysphondylium pallidum (slime RT mold) genome."; RL Repbase Reports 10(12), 2161-2161 (2010). XX DR GenBank; ADBJ01000051; Positions 314130 314522. XX SQ Sequence 393 BP; 195 A; 56 C; 24 G; 118 T; 0 other; tgttatccaa taatattaga taatatagat aattaacaca acattgtcta ttcaactacc 60 aatatatcaa tatacaatta acccaataat taatattaca attcagataa ttatattcaa 120 catataatcc agcatacaaa tcatccagat gattttatta tattcaacga atcaactata 180 ctacaattca gaataataat gtaaaatacc aacaaacaac agatataaat atcccagata 240 gagatacaga aaggaaaaga aatatagaca gaagatctaa cattaaatat ataaataaaa 300 ctaaatctat aaacaaaact aaatacttta tttaattaat tatatattat tattaactac 360 tatggacata aataacaata ataataacaa tca 393 // ID BEL-16_AA-I repbase; DNA; INV; 5636 BP. XX AC AAGE02020450; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-16_AA_; KW BEL-16_AA-LTR; BEL-16_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5636 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02020450; Positions 64085 58450. XX CC Positions [4638-5219] - Integrase core CC 'GCTAC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 294..5600 FT /product="BEL-16_AA-I_1p" FT /translation="MTLPRTPEAAKKVKMEEIKVLVHQRGQVKGSVTAIVK FT ALEKAEDDPSQVSLPILRVYSKKLESFYNTYVSLHKEILACSPSGKLDEQD FT VKLSEFEDLHTDALIRVEILIEALTKPLSATVPTLSQQNSPQVIVQQQPLR FT APIPTFDGRYENWPRFKMMFQDIVDKCSDSDAIKLHHLDKALVGSAAGIID FT AKTLADNNYHHAWEILTERFENKRVIIDTHIGGLLSLKRMNKESYCELREL FT LDTCTRHVEGLKYMGQEIDDTSGLIITKILTSCLDSVTRKYWERSLTHGEL FT PDLDETLKFLKDQCRVLERCETDQSAVAKSTVGKSSQPASKSVNVKVHAST FT SGETNESCQFCCGNHFNYQCSDFRKLAVPDRIAKVKESRVCFNCLRRGHRS FT LNCQSKSNCSKCHKRHHSLLHEEKQKLDSSEHADSKSQEQNVPIQAAASPP FT PQPAVPVSSVSTNVSSASCSAFLPNVLLLTAVVNLMDKNGRAVRCRAFLDC FT GAQTNLLSAAMFQKLGIDSTPVNADIVGVSGARSKSTRLVEVNVRSVNTEY FT SSFLKCLVTSKITNALPCKAVNVSEWKIPSGFPLADPKFGTPAEIDLLIGV FT TEFFRLLKPGHLVIGEGLPELRETELGWVVAGEIQDESSAIINPQQVSSVT FT IETLNDAIKRFWEIEEVNTQSTSTTEEQECEELFRKSYQRDSAGRFVVKLP FT FRSNLHLLEDNRSLALRRFLFLERRLRKQPNLHVQYSSFIDEYEHLGHCKE FT IRESEDRPNMIKYYMPHHAVLRPDSSSTKLRVVFDASAKSSPSAMSLNDVL FT QVGATVQSDLFSILLRFRMHRFVFTADITKMYRQVRVHPDQTSVQRIFWRT FT NPAERLRILELQTVTYGTAAAPFLATRSLVQLCEDEGSDFPLAARIILEDC FT YVDDIISGADTIEEAVDCCRQLQVLLGKGGFPVHKWCANNELTLGDIPKSK FT REKLLQLDQLSANEVIKTLGLIWDPSSDEFLYAVDQSPAEPTLVTKRQVFS FT KCAKLFDPVGFLSPIVVLAKQLMQRTWSAKIDWDSPLEGDLLADWQRFSEC FT LAGVNEIKVPRPVLCFPYIALELHGFADASGLAYGACIYIRCVNAENECNV FT RLLCSKSKIAPLQDVTIARKELCAALLLSRLVRKVLPILRVAVSAVHLWSD FT SQIVLAWLRKPPNMLQPFVRNRVVEIVNENQHTQWSYVRSKENPADVVSRG FT QSPNLLKNNVMWWNGPPFLHLPNYEPAVIEEINDSELPEIKPSALVVVDAL FT NVDDLSLFKRFSSFRKLQRVLAYVQRFIRNCREKVVENRIKEFSPTISELR FT TALHTIVILIQHEALFEEVQRVEHGEPCKVVGMLGPFLQDGALRVGGRLQN FT SRLPVQMKHQYILPKHYITDLIIRAYHEENMHVGPSGLLSTLRQRFWLLGS FT RSAVRKITRHCVRCFRSKPKGVKQYMGNLPPARVTCAAPFEVTGVDYAGPF FT LVKQGTRKPIVIKAYISVFVCLVTKAIHLELVSDMSAVAFVAALQRFVSRR FT GVPREIHSDNGSNFRGAKADLHNLFLLFNDQVAVKDIGSFCQSKEIEWNFI FT PPEAPEFGGLWEAAVKSTKHHLKRILGETPLTFEEFGTLLAQVEAILNSRP FT LFSLSDDPSDGEVITPSHFLIGRPMNAIPEPSYESVGVNRLSRLQHLQLMR FT EHFWKSWVRDYLVSLQPRGKNYFRVQNVVPGQVVLLEDKNLPPQQWKMARI FT VTVYPGSDNLVRAVDVRVGESVFRRPINKLSLLPIEDNRSSDQDFATRE" XX SQ Sequence 5636 BP; 1494 A; 1219 C; 1340 G; 1583 T; 0 other; tagtggtccg tccgaaccgg atgaagaaaa ccgaacaaga actacagtcc aagtgaacta 60 agtgaaaatc gcagtgcagc accaaaaact gtggactagt gaactgtact tttgctgcaa 120 ctgtgattct gaaaatgata aagtggaaaa tgtaaaagaa cagaaaaaca gtgagcagaa 180 attattgaag tgagtgaatt tcgaaaagaa aactggtggc gtccatccat tcgagcagtg 240 tcatcatcgt gagtttgtgt tgattgctct tccgtagcag cgtttcctgt ccgatgacac 300 taccacgtac accagaagca gccaagaaag tgaaaatgga ggagattaag gtgctcgtcc 360 atcagcgcgg ccaagtgaaa ggcagtgtaa ccgccatcgt gaaagcgctg gaaaaggcgg 420 aagatgatcc gagccaggtg agtcttccga ttctgcgagt gtattctaaa aagctcgaaa 480 gcttttacaa cacgtatgtt agtttgcaca aggaaatttt agcttgcagt ccatccggaa 540 agcttgatga gcaggatgtg aaactttccg aatttgaaga tctccacacc gacgctttga 600 ttcgtgttga gattctgatt gaagcattaa caaaacctct ttccgctact gtaccaactc 660 tgtcgcaaca aaactctcct caggttattg ttcagcagca gccgcttcga gctcccattc 720 caacctttga tgggcgctac gaaaattggc cgcgcttcaa aatgatgttc caagacatcg 780 ttgacaagtg ttccgattcc gatgcaatca aattgcatca ccttgataaa gcgctagttg 840 gttcagcagc aggtattatt gacgcgaaga ctctagcgga caataattac caccatgctt 900 gggaaatcct tactgagcgg tttgagaata aacgggtaat cattgatacg catattgggg 960 gccttttgtc gctgaaacgt atgaacaagg agtcttattg tgagttacgt gagttgttgg 1020 atacgtgcac gcgacacgtc gagggtctaa agtacatggg tcaagaaatc gatgatacat 1080 ctgggttgat tattacaaaa attcttactt cttgtctgga ttcagttact cggaagtatt 1140 gggaacgatc tctcactcac ggagagttgc ccgacttgga tgaaactttg aaatttctca 1200 aggatcagtg tcgcgttctg gagcgatgtg aaactgatca atcagccgtc gcgaagtcta 1260 ctgtcggtaa atcctctcag cctgcgtcga aatctgttaa tgttaaagta catgcgtcta 1320 cttctggaga aaccaatgaa tcgtgtcagt tctgctgtgg taatcatttt aattaccaat 1380 gttccgattt tcgcaaattg gcggtgccag accgaattgc aaaggtgaag gaatctcgag 1440 tgtgtttcaa ttgccttcgc cgtgggcatc ggtcgttaaa ctgtcagtct aagagtaatt 1500 gctccaagtg tcacaaacgt catcattcgc tgctccacga agaaaagcaa aaactcgatt 1560 cttctgagca cgcggattct aagtcgcaag aacagaatgt tccaatccaa gccgcggcat 1620 ctcctccgcc acaaccggca gtcccagtga gttctgtgtc aacgaatgtt tcttcagctt 1680 cttgttcagc ctttcttccg aatgtgttgc ttctgactgc cgttgtgaac ctcatggata 1740 aaaatggcag agccgttcgt tgtcgcgcat ttttggattg cggcgcacag acaaacctct 1800 tgtctgccgc gatgttccag aaattgggaa tcgatagtac gcctgtcaat gctgatattg 1860 tcggcgtaag tggcgcgcgt agtaagtcca ctcgattggt agaggttaat gtccgttcag 1920 tgaataccga atacagttcg tttttgaagt gtctggtcac gtcaaaaatt acgaatgctc 1980 taccgtgcaa agccgtgaat gtttctgaat ggaaaattcc ttcaggattt ccgttggctg 2040 atccaaagtt tgggactcca gccgaaatag acctgttgat aggtgttact gagttcttcc 2100 gtcttctgaa accaggtcat ttggtaattg gtgagggtct gcccgaattg cgtgaaactg 2160 agttgggctg ggtcgtcgct ggagagatcc aagacgaatc ttcagcaatc ataaacccgc 2220 aacaagtgag ttccgtaaca atagagacac tcaacgatgc tattaaacgg ttttgggaaa 2280 ttgaagaggt gaacacccaa tccacttcta ctacggaaga acaagaatgt gaggaactct 2340 tccgtaagtc ttatcagagg gattccgctg gtagatttgt agtgaagctt ccatttcgaa 2400 gcaacttaca cctgctcgaa gataataggt cgttggctct tcgtcgtttc cttttccttg 2460 aaagacgcct acgtaaacag ccgaacctgc atgttcaata ctcatccttc attgatgaat 2520 atgagcactt gggacattgc aaagaaattc gtgagtccga agatcgtcct aacatgataa 2580 aatattacat gccgcatcat gccgtgctcc gtccggacag ttcgagcacc aaattaaggg 2640 ttgtcttcga tgcatcagca aagtcaagcc cttcagcaat gtcactcaac gatgtgcttc 2700 aagttggtgc gactgttcag agcgatttat tcagtatctt gcttagattt cgaatgcatc 2760 gttttgtctt cactgctgac atcacaaaga tgtacagaca ggttcgtgtc catccggatc 2820 agacttcagt ccagcgtatt ttttggcgta ccaatcctgc agagaggctt agaattctcg 2880 agctgcaaac agttacatat ggtacggctg cagctccctt tctggccact cgttcattag 2940 tccagttgtg tgaggacgaa gggtctgatt ttccactagc agctcgcatc atccttgagg 3000 attgttatgt agacgacatt atttccggtg ctgacacaat cgaagaagct gttgattgct 3060 gtcgacagct tcaagttctt ctcggaaagg gtggttttcc cgtacacaag tggtgtgcta 3120 ataatgagtt gaccttaggt gatattccca agtctaagcg agaaaaattg ttgcaactcg 3180 accagctatc tgccaatgag gtcataaaaa ctctcggttt aatatgggat ccatcaagcg 3240 atgaatttct ctacgcagtt gatcagtccc ctgctgaacc tacattggtg accaaaagac 3300 aggtattttc aaaatgtgca aagctgttcg atcctgtagg ttttctgtct cctatcgttg 3360 ttttggcaaa acaattgatg cagcgtacat ggtcggctaa aattgattgg gactctccac 3420 tcgaaggaga tttactggct gactggcaac gttttagtga atgtttggca ggtgtgaatg 3480 aaataaaagt cccaaggccg gtcttgtgtt tcccatacat cgccttagag ctacacggct 3540 tcgcggatgc ctctgggcta gcttacgggg catgtattta cattcgctgt gtaaatgctg 3600 agaatgaatg caatgtgcga ttgctatgca gcaaaagtaa aattgctcct ctacaagatg 3660 tgacgattgc acggaaggaa ctttgcgctg cactattact ttcacgcttg gttcgcaagg 3720 tattacctat tctccgtgtt gctgtttcag ctgttcactt atggtcagat agtcaaatag 3780 tgctggcgtg gttgcgtaag ccacccaaca tgcttcagcc atttgtgcga aatcgagtcg 3840 ttgagattgt caacgaaaac cagcataccc agtggagcta tgttcgctct aaagaaaacc 3900 cagctgatgt ggtttcccgt ggtcaatctc caaatcttct aaaaaacaac gtgatgtggt 3960 ggaatggtcc accgttcctt cacttaccga actacgaacc agccgttatc gaagaaatta 4020 atgactccga attgccagaa attaaacctt ctgcattagt tgtggttgat gctcttaatg 4080 ttgatgatct ttcgcttttc aagcgcttca gttcgtttag aaagctccag cgtgttctgg 4140 cttatgtcca acggttcatt aggaattgtc gcgagaaggt cgttgaaaac cgtatcaagg 4200 aattttcacc tacaatatct gagctcagaa ccgcactaca tacgattgtg attctgattc 4260 aacacgaggc tttgttcgaa gaagttcaac gtgtggagca tggtgaacca tgcaaggtag 4320 ttggtatgct tggtccgttt ctccaagatg gtgcactgag agtaggaggc aggcttcaaa 4380 actctcgatt gccggtgcag atgaaacacc agtacattct tccaaaacat tacattactg 4440 atctcattat tcgtgcatat catgaggaaa atatgcacgt agggccatca ggtctgttgt 4500 ccaccttacg tcaacgtttt tggctattgg gatccagatc tgctgttcgt aaaatcacaa 4560 gacattgcgt ccgctgtttc cgctcgaagc ccaagggtgt taaacaatac atgggcaatt 4620 tgccaccagc ccgagtaact tgtgcagctc cttttgaggt caccggtgta gattatgctg 4680 gtcctttttt ggtgaaacaa ggtacaagaa aacctattgt tataaaggca tacatatctg 4740 tttttgtatg tttagtaact aaagcaatac atctcgaact tgtttcggac atgtctgcag 4800 ttgctttcgt agctgcttta caaaggttcg tcagtcgaag gggggttcct cgtgaaatac 4860 attccgacaa tggatcgaat tttcggggtg ccaaggcaga cttgcataat ttgtttttgc 4920 tgtttaatga tcaagtcgct gttaaggata ttggttcctt ttgtcaatcg aaggagattg 4980 aatggaactt tattccgcct gaggctccag agtttggggg cctctgggag gcggcagtaa 5040 agagtacgaa acaccacctg aaacgtattt tgggggaaac tccattaact ttcgaagagt 5100 tcggtacgtt gttggcgcaa gtggaggcca ttcttaattc acgtccactg ttctccctct 5160 ccgatgaccc ttcagatggt gaagtgataa caccatccca cttcctgatt gggcggccga 5220 tgaatgccat tccagaacct tcttatgaat cggttggggt gaatcgttta agccgtttgc 5280 agcatttgca actaatgagg gaacactttt ggaaatcgtg ggtgcgggat tatttggtca 5340 gcctacaacc gagaggcaaa aactatttcc gggttcaaaa tgttgttccg gggcaagttg 5400 ttctactgga agacaagaat ctacctccgc aacaatggaa gatggccaga attgtcacag 5460 tgtaccctgg aagtgacaat cttgtgagag cagtagatgt tcgcgtgggt gaatcagtgt 5520 ttcgacggcc tattaacaaa ctttctctac tacctataga agacaatcgg tcatccgacc 5580 aggactttgc taccagagag tagtagtagc actctgctcc gtgcgcggcg ggagta 5636 // ID hAT-3_HM repbase; DNA; INV; 3391 BP. XX AC . XX DT 07-DEC-2008 (Rel. 13.12, Created) DT 10-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-3_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3391 RA Jurka J.; RT "hAT-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 1992-1992 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 756..3008 FT /product="hAT-3_HM_1p" FT /translation="MAVTTRSSAEVWLIGKASKQLSSSRLATNGDVLRCLL FT FHHLEEHLTIKDSIHHTIQELIVMWNKARIPTQRIDSGERKLQKLYDSYLL FT LKKNRTTDLESSRIKEQMFKDTLYELFDLATKTAMETITIAEDRQFLAMQR FT EDVTSCSMAGIDKNLLAKEARKRARDQAYDARLSKQQHSSGHLDPVSANIS FT SSEDSQDSDDDPSFEILSSASQSVFSTPKPKRLKSIVSSPEVAGALDRVNL FT PDRKAMFVVASVAKALGHPLADLTLSRSTIRRSRMSTRKHVTQNDKDNFSI FT EFPLLLHWDGKLLPDITGSKETVDRIAVIVTGNGLEKLLAVPKIGRGTGEE FT QAAACLKILDDWKIRDKLQGLVFDTTSSNTGIHKGACVLIEKAVGRDLVNI FT GCRHHVLEVILSNVFTALFGGTGGPEVGLFKRFQKKWPYIQQARFSPAKDE FT LFIGDMEILRKEMVAFYTRAIDEKQPREDYLELLRLCLVFLGGGSVGAEIK FT FRAPGAMHHARWMSKAIYALKMVLFQDQLTLTVREKKGLVELALFVALIYG FT RFWHEAPLAANAPFNDAQMLKQFQKYPNRTIADAAFTAFSRHLWFFSEHLI FT GLAFFDSRVDLDVKRAMVVNLQLPKTTLALKRVDSKNTDFNRLETFVTMRT FT YSLFELLSSTGREEARNFISKDPKYWEDDESYQKLAERVQRMKVVNDSAER FT GIALIGQYNESITKDEEQKQFLLRFVQRHRQLYPTSSKAAMLAIDDEIVD* FT " XX SQ Sequence 3391 BP; 1096 A; 613 C; 658 G; 1024 T; 0 other; gggtggtcca gatttgtgtg gtgaaaattc aaagtgtgtc tagattctct ccacaagcct 60 aactttgttc cactatgctt agggaataac catacaaaaa attggaccaa tctgaacagt 120 tttaggggtc gctatgattt caacattttt ttgacggtac tcgattttta cgaatttttg 180 tgtaagtttt gcataattga acatctatct caacattgtg gtaaataaat taatgcttgc 240 tgaaaagcag atgctatata aattcattta tatatctaat atcttttagt tgtctattgt 300 cacatgattt ggcggccatc ttgtaataag cacatgtctg cagtatttca ataaaaattt 360 tataaggatt atttttttat ttctggtaac aaatattctt atacttaagt attatatttt 420 atatatatat ttattatata gttaatataa tgaattacaa ataaattatt attaaattaa 480 atagttttca aaaagaatat gctataaaaa ttcatttata tatctaatat cttttatcaa 540 ttgtcacatg attaagtgac catcttgtat taagcacatg tctgcaatat ttcaaaacaa 600 attatgtttt tcagtaatgt tttttcattt ctgttaataa tataaagtac taaatattaa 660 ttatataaaa ttaaatagtt gaagaacaaa gaatcatttc aacttaaact tctgcaggta 720 attaataaat aataaaaata tatattacac cacttatggc agttactaca agaagctctg 780 cagaggtatg gctcattggg aaagcatcta agcagctctc ctcgtcaaga ttggcaacaa 840 atggagatgt cctccggtgt ctcctctttc atcatctgga agagcatctc acaataaagg 900 acagcattca ccatacgatt caagagctga ttgttatgtg gaataaggca agaatcccaa 960 cacaaagaat tgattctggt gaaagaaaac ttcagaaatt atacgattcc tatctgctcc 1020 taaaaaagaa tcgaactaca gacctggaaa gcagtagaat taaagaacag atgttcaaag 1080 acacacttta cgagcttttt gacttggcaa ctaaaaccgc aatggagaca ataaccattg 1140 ctgaagatag gcaatttctt gcaatgcagc gagaagatgt taccagctgc agcatggctg 1200 gaattgataa gaacttactg gcaaaggaag cccggaagag agctcgagat caagcatatg 1260 atgcaaggtt atcaaaacag caacactcca gtggtcatct cgatcctgtc tcagccaaca 1320 tcagcagctc agaagacagt caagacagcg atgacgaccc aagttttgag attctgtctt 1380 ctgcatcgca atctgttttt tctacaccaa agccaaagcg tctgaagagt attgtttcca 1440 gcccggaagt tgctggagct ttggacagag ttaaccttcc agatcgcaaa gctatgttcg 1500 ttgtcgcttc agtggctaag gcactgggac atcctcttgc agacttaact ctgtctcgaa 1560 gtacaatcag gagatctcga atgtctaccc gtaaacacgt gactcagaat gacaaagaca 1620 acttttccat tgaatttcct ttgcttctac actgggatgg gaaactgctt ccagatatca 1680 ccggttcaaa agaaaccgtg gaccgaattg cagtgattgt aactggaaat ggtttggaga 1740 aattgctagc agttccaaaa attggaagag gaacaggaga ggaacaagca gcagcttgct 1800 taaagatctt ggatgattgg aaaattcgag acaaacttca gggtttagtc tttgacacaa 1860 cctcgtccaa cacaggtatc cacaaaggag cttgtgttct catcgagaaa gcagttggtc 1920 gtgatcttgt aaatattggt tgccgccacc atgttctgga ggtcattctg agcaatgtct 1980 ttacagctct ctttggtgga acagggggac ctgaggttgg cttgttcaaa cggtttcaga 2040 agaagtggcc atatattcag caagcgagat tctctccagc taaagatgag ctcttcatcg 2100 gagatatgga aatccttcga aaggaaatgg tagcattcta caccagagcc atcgacgaaa 2160 aacaacctag ggaagattat cttgaactac ttcggttatg cctggtcttt ctagggggag 2220 gctctgttgg tgctgagata aaatttcgag ctccaggagc gatgcatcat gctcgatgga 2280 tgtcaaaggc catctacgct ctgaaaatgg tactatttca agatcagctc actctgactg 2340 ttcgagagaa gaaaggtcta gtagagcttg cattgtttgt ggcattaatt tacggacgtt 2400 tttggcacga agctccactg gctgcaaatg ctccgtttaa tgatgctcaa atgctgaaac 2460 agttccaaaa gtatccaaac cgcacaatcg ccgatgcagc ttttactgca ttttctcgac 2520 acctttggtt cttctcggaa catcttatag gtctggcatt ttttgatagt cgcgttgatc 2580 tggatgtgaa gagggccatg gtggttaacc tccaactgcc caagacaact ctagcactga 2640 agagagtaga ttccaagaat accgacttca acagactaga gacattcgtg actatgagaa 2700 cctatagttt attcgagttg ttgagcagta cagggagaga ggaggccaga aacttcattt 2760 caaaagatcc taaatattgg gaggacgatg aatcatatca gaagttggca gagagagtcc 2820 aaagaatgaa agttgttaat gacagcgcag aacgtggaat tgcattaatt ggacagtaca 2880 acgaatcgat tacaaaggat gaagaacaaa agcagttcct acttcgtttt gttcaacgtc 2940 accgccagct ataccctact tcatccaagg cagccatgtt ggcgatagat gacgaaatcg 3000 ttgattaata aacatacatt tttcaacata ttcacaatat tcttgatatt acagtttttt 3060 gtttaaaaat tacgaaagtt acgaaacgtt gatatacttt aataaaaatt gatcaattga 3120 tgatctatat tcattagata ctctacaggc attctattta cctgatttaa attgttagtg 3180 gcattaattt ttaaattcga tattatattt atagtgaaaa cttaaatttt ttgatttttt 3240 tatcattgtg gtcaaaaagt caaaatttat agcaacccct aaatattatc cgatttgctt 3300 caaatttggc tacaatattg caaatattaa taggaacaaa ggcagccttg tggagaaatg 3360 aatttcaata atttataatt ttggaccacc c 3391 // ID Gypsy-4_DWil-LTR repbase; DNA; INV; 278 BP. XX AC scaffold_181026; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-4_DWil_; KW Gypsy-4_DWil-I; Gypsy-4_DWil-LTR. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-278 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_181026; Positions 153207 152930. XX SQ Sequence 278 BP; 86 A; 32 C; 69 G; 91 T; 0 other; tgtaggattg cgtgagaacc cctaatggta tcagaatgtc gataacgata ttagtttgat 60 agatatcgat agatggtcaa tcggggaggg ggagagatgc ttgagttgct ggacgaagag 120 gttgtgttgg tctcgttaaa ttaaaagttt tcaataagat ggtttctttt tcttttgtta 180 ataattaaat aaaaatgata agctaagata catgcataaa agcaaagtgg tgtgaagagt 240 atattttatt ctggtccagt ggactcagcc tttctaca 278 // ID Sat250_Cis repbase; DNA; INV; 367 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 06-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE Satellite from Ciona savignyi. XX KW Satellite; Simple Repeat; Sat250_Cis. XX OS Ciona savignyi OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. XX RN [1] RP 1-367 RA Smit A.F.; RT "Sat250_Cis - Satellite from Ciona savignyi."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX CC Ci000250; 367 bp unit. XX SQ Sequence 367 BP; 110 A; 50 C; 78 G; 124 T; 5 other; attgtgtttg tagacccact acaagtcaag ttatgtagtt ttatactcgc gggggagagt 60 acacaaggtt cgcgaaacaa tgggttggcg atttctttcg aaaaccgtca aaaggagggt 120 gtgcgtaaac ggctctattg cgaaaaataa aggagattaa gacttagntt ttggatatgt 180 tatagctgga taaattatct ttaattcaat ataaaaanta attatatttt cctncattta 240 agtagttact tacacgatcg cattttggtc atattgagag gtgtattact nagcgnatga 300 ttaatttaaa ggtttacttt gtagatggtc gtatagcagg attcttcttc tttccaatga 360 tgccaag 367 // ID Sola3-2_CB repbase; DNA; INV; 6848 BP. XX AC . XX DT 28-JAN-2009 (Rel. 14.02, Created) DT 28-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE Sola3 DNA transposons from Caenorhabditis brenneri. XX KW Sola; DNA transposon; Transposable Element; Sola3; TTAA TSD; KW Sola3-2_CB. XX OS Caenorhabditis brenneri OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-6848 RA Bao W., Jurka M.G., Kapitonov V.V. and Jurka J.; RT "New superfamilies of eukaryotic DNA transposons and their RT internal divisions."; RL Molecular Biology and Evolution 26(5), 983-993 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS join(2106..2414,2429..3127,3269..3598,3620..4087, FT 4000..4332,4295..4798,4674..5006,5050..5703) FT /product="Sola3-2_CB_1p" FT /translation="NNELNLFQAVCPEHKKYAKAALSQRGVEDKGYVLSPP FT PEENGLMSDSDDVDNESHHQGRMHYEDPDYRARTTPLSNERYTVSEQIRDA FT FYHFAETAGQKRVSCILEFRSFQYISFKQFSDLQPDTKRKKSLILRHLLKV FT MTHIMAPDSPEELRSMAIGEDAKSIWGVNSDQELSRVLEYSSKLYYQTESR FT TERLHLLSYFSPLLSLSHIQQYIPGLSKSMYYHSKKLTQKEIVKVSGRRER FT YDPIKVQLFVEFITRFVIFLKYVNRRLFFSNVVTSSHTFGVKKGTLSDGSI FT IELPNTIRKQGATEIIRMYKMHLIVSFNSMSLQTSLFTGTKHGGKFQAQEK FT INEILNNWYQNGKIPSAYRDQLIRGIEESVNYFRTDFRIHLSQESTTADHC FT MQHALSDPVKPDFQVQCDHDHSIKCDRCEMVKNIEGELLDYSDSLVKNAEN FT SGYSRFFLDDQSAVGTYQEQYGTIKNCLKKMFEFKRHVLRSKYTDQHRAHI FT LSNLGEDEAMITLDFAQKLLPMKFYETQLDYFGKRGMSYHIAHVMANVSET FT LVSHSFVHILKDGAQVRRFSLHFLVDAVFRTTKLSQQYSHTSCLNLEVWES FT KRFICSFQDNQIVTAILTHILSELGSLGIKAVYLRSDNAGIMFVSRLGIYF FT FAGCYHSARTLFSLSTISEKTGVEVSRYSFSEPQNGKSSCDRVASQIKRQV FT SKIILFSHNTIKKAKSFFFHIIRSKNKVREYVDLKNDVTTPEEFFQAVTAR FT PLRGVSYYLADLNDRDSSKRKPATITGMSGLYDFEIDRYYLTARRYFGIGD FT GIKVKVDSKQANEATLAIIDQGGNTARKVILRASVYIELVSPDQHNKGEIE FT HCTKTSRSPILLETWTRCCQKILLYFSLNSFLQTNTIKEKSSIARRPHGLP FT SFWRLGHAVVKRFFSTSHYDENVALQDVHQEELHDHVLRIFRCPVAGCSAR FT FLYFGSLEHHLDRGKHNSASERRTLRDEALHNFRLHVLEDLIVPASKRIAA FT VTNALQKLRLHKNDQLLNLPIGWAIRTRAKQGRFSETVHTWLKDIYDSGRK FT GRKKTGKEVELMMKTAVNEDGNVLFSVEERLNWRQIQSVFSQLTRKDDDKE FT ANRAKRNAEKVETETEETSNEYSFQFEESYQDEAGPNLPDILWMKIKTDRS FT IFLGDDEEDEEDEEDESDVVIIPEADSSRRRSHSPEKSRKFKRIRDELFL* FT " XX SQ Sequence 6848 BP; 2218 A; 1237 C; 1349 G; 2044 T; 0 other; gaggctctat ccgtaccagt gggaatgtgc caggcacagt tgcaaaaagt taatccgttc 60 cagctgagtt ttgaaccatt gaaacatgaa tggaagtgat ttccaaaaag tttgctaaaa 120 tttggatttt tcgattttgg ctgaaattat agtcaaaaat cgacttttca ggtcgaaaat 180 tctaaaaatt ttcgtttttg agatttcgtt ctgaaaagtt taggggttca tttggatgca 240 taggggcgca ttttgaacaa atttgggctt cttgaaatat cggaaagtgg gaaaaacacc 300 tctgagacaa cggttttttg agtttttctc gaaaactttg aatacctggg gtaacttttg 360 acaatttttt cagattcccc atgcaatttg gagcaaaaaa gtctcttgga ccaaaaaaag 420 ttagacggac taaattcgat ttttttcgat tttttcgaaa atcaccaaag ggtcccccct 480 tatgaaattt tttttttcga aaaaaaatca aaattttttt ttgcctcaat taattgagat 540 tacattttcc tcatcgtaaa taacacgctc agaatttttc agttgatgtg atggcgtttc 600 agccaagttt taccttgaaa cgtctgcata tatcataaaa ctcgattttt ctcaaaaaat 660 ttccacagaa gacacatatt ccaacaagat tttcgaattc cccatgcaat ttggaacaaa 720 aaagtctctt ggaccaaaaa aagttagacg gactaaattg ggtttttttt cggttttttc 780 aaaaaaaaaa catataaagg acatgaaaga cctgaaaata tttcgaaaac atttttttct 840 ctttttcatc agctttcgga tttctgatca aatcataact tcacattgtt tgcaacatat 900 gttcagattc aatattctga aatacgatct ttaaccagtc aaacagcttg cgcgttttct 960 caagcattac gagcgcgtat ctctagatga ccaggatgtc ggagtttttt tttttttgtt 1020 gcgcgttgat tgaaaagtga attgacacga ttacctcaat tcataagttt ttttttcagt 1080 acgccccagt ttataaagaa acggaacgaa aattactgat ccggtggcca aacattcatt 1140 gaatatattg tttattattt tcaatttcta tttcactgtt tttcgaacat gtcgtttcga 1200 acaattctgt ttttttattt attattttac acgattgtaa tatctgaaac ggtaggataa 1260 tgttctcttt ctgtttctaa atgagctttc aggcgaaaga ccctcagata aacaacatag 1320 tgagaaatcc gagtgagccg agcacttcta atgctgatga accgttttat gtttgtgaca 1380 accgaactcc taaaaaatca tctgtaagag aatcgattag ttattctgaa aaagaaggag 1440 gaggtgatga cgccgaagac gaatgctatt ttaacagaaa gcggagatcc cctctagaaa 1500 ctcaagaaag acctaaacga tttgacgttt cgtgttcttg ttctggacca atgaccaggt 1560 aaatttgaca tgaattctat tcgaactgtt ttttagtttg aaagttgtga tggaatcgtc 1620 cgaagactac gaaaatttat atgaagctgc taatccactt tctttaattt tgtgtcgtgg 1680 cgcaggaaaa tggtcagttc aatatcagaa ttttcattac tttttatttt tcacagtgat 1740 cttaatttcg aagatccaaa cgttgtgctg gaatggtcca aaaagtcaat ctgtgataag 1800 cacaaaactg agctattgga tgattgggcc aattataagt acaatcatat ttttcggaga 1860 attgagcgtt cggttagtac tataatattc agatctcgtt caatcataaa ggcgtatatt 1920 gtttattgaa gagtaaacgg gtcgcttgtt cagtcgcggg aaccctcggt tttaaccatg 1980 aaaatggtaa acagcctttc attcgaggtc gtaataagta cgaactgaat caaagtgaag 2040 ctaatgccat actcaaaaac aatcatgtac ttcttcaccc aggaattcgt aagtttttat 2100 gttgaaacaa tgaactcaat ctttttcaag ctgtatgccc ggaacacaaa aaatatgcta 2160 aagctgctct ttctcaacga ggtgttgagg acaaaggata tgttttgagt ccaccaccgg 2220 aagaaaatgg gttgatgtcg gactctgatg atgtggataa cgaaagccac catcaaggga 2280 gaatgcatta tgaggatcct gattatcgtg cacggactac accattatcc aatgaaagat 2340 acaccgtttc ggagcagatt cgggacgcgt tctatcattt cgcagaaact gcaggtcaaa 2400 aaagagttag ttgctgactt tgcagtaaat attggaattt cgttcatttc agtatattag 2460 tttcaaacaa ttctctgatt tacaaccaga caccaaaaga aaaaaaagtc tgatactacg 2520 acatcttctg aaagtgatga cccatataat ggctccagac tcccctgaag agttgagaag 2580 tatggcaata ggagaagatg caaaaagtat ttggggcgtg aattctgatc aagaattgtc 2640 tcgtgttttg gagtattcat ccaagcttta ctatcaaaca gagagcagaa ccgaacgact 2700 tcatcttttg tcctactttt cgccgttatt gtctctaagt catattcaac agtacatacc 2760 agggctttcc aaatccatgt actatcactc taaaaaatta acacaaaaag aaattgtgaa 2820 agtttctggt agaagagaaa ggtatgatcc tatcaaagtg caactttttg tggagtttat 2880 aacaaggttt gtaatcttcc taaaatatgt gaaccggagg ttgtttttca gtaatgttgt 2940 tacaagttca catacatttg gtgtaaaaaa agggacatta agtgatggtt ctattattga 3000 actcccaaac actatcagga aacaaggagc aacggaaata atacgaatgt acaaaatgca 3060 tttgatagta agtttcaact caatgagttt acaaacctca ctattcacag gaacaaaaca 3120 tggaggataa gaaaatgagt gactctacat attttcgaat tttgagcgct tgtccagcta 3180 caaagcaatc gggagctgtt tgtgttgatt atttcatggc ggacatgttg gaagtacgtt 3240 tcaataagaa caatacactt taatgtaaaa gtttcaggca caagaaaaaa ttaatgaaat 3300 actgaataac tggtatcaga acggtaaaat cccaagtgcc tatcgtgatc agcttattcg 3360 aggaattgag gaatcggtga attattttcg cactgacttt agaatacact tgtcacaaga 3420 gtcaacgact gccgatcact gcatgcaaca cgccttatct gatcctgtca aaccggattt 3480 tcaagttcaa tgtgaccatg atcattcgat caagtgtgat cggtgtgaaa tggtgaagaa 3540 catagaagga gaacttttag actattcgga ttctttggtg aaaaacgccg aaaatagtta 3600 gttagatgaa agttgataag ggtattcacg gtttttttta gatgaccagt ctgcagttgg 3660 aacatatcag gagcaatatg gcacaatcaa aaactgtttg aagaaaatgt ttgagttcaa 3720 gcgtcacgtg ctccgttcaa aatacaccga tcaacacagg gcacatattc tctcaaatct 3780 tggagaagac gaagcaatga taactctgga cttcgcacag aaactgcttc ccatgaaatt 3840 ctatgaaact cagcttgatt actttggcaa acgtggcatg agctaccata tagctcacgt 3900 tatggcgaat gtttcggaaa ctcttgtcag tcactctttt gtccacattt tgaaggacgg 3960 tgcacaagta cgccgatttt ctcttcattt tttagttgat gcagttttca ggacaaccaa 4020 attgtcacag caatactcac acacatcctg tctgaacttg gaagtttggg aatcaaagcg 4080 gtttatttga gatcggacaa tgccggtatt atgtttgttt caaggcttgg catttacttt 4140 tttgcaggtt gttaccattc ggcaaggaca ctgttcagtc tcagtacaat ttccgaaaaa 4200 acgggagttg aagtttcccg ctactctttt tctgagccac aaaatggtaa gagtagctgc 4260 gatcgcgtcg ccagtcaaat caaaagacag gtgagcaaaa tcattctttt ttcacataat 4320 acgatcaaaa aataaggtcc gtgaatacgt ggatctcaaa aatgatgtca caacaccgga 4380 ggaatttttc caagcagtta ctgccagacc actaagagga gtgagttatt atctcgccga 4440 tctcaacgac agagacagca gcaagaggaa accggccacc ataactggaa tgtcaggact 4500 ttatgatttt gaaatagata gatactattt gaccgcacgt cggtattttg gaattggaga 4560 tggaatcaaa gtcaaggtcg attcaaaaca ggccaacgaa gcaacattgg caatcatcga 4620 ccaaggcggg aatactgcta gaaaggtgat tttgagagca tcggtataca tagaactcgt 4680 ttctccagac caacacaata aaggagaaat cgagcattgc acgaagacct cacggtctcc 4740 catccttctg gagacttgga cacgctgttg tcaaaagatt cttctctact tctcattatg 4800 acgagaacgt agctttacaa gatgtacatc aagaagaact acatgatcac gtactccgca 4860 tcttccgttg ccctgtggct ggatgctcgg ctcgattcct gtattttggc agtctggagc 4920 accatttgga tcggggaaag cacaactctg cttccgaaag aagaactttg cgagatgaag 4980 ctttgcacaa tttccgcctg catgtatgat caatgttctt ttttccgaaa aaaaaaacat 5040 tttttttagt tggaagacct gattgttccg gcttcaaaga gaatagcagc agttacaaac 5100 gctctgcaaa aacttcgtct tcacaagaac gatcagctgc tgaatttgcc aattggttgg 5160 gctattcgta ccagggcgaa acaaggacga ttttcggaaa ccgtacacac atggttgaag 5220 gatatatatg acagcggacg gaaaggaaga aagaagactg gaaaggaagt ggaattgatg 5280 atgaaaacag cagtcaatga agacggaaat gtcctgtttt cagttgagga acggctgaat 5340 tggaggcaaa ttcagtctgt gttttcacaa ctgacacgga aagatgacga caaagaagca 5400 aatcgggcaa aaaggaacgc tgagaaagtg gaaaccgaaa ctgaagaaac aagtaatgag 5460 tactcgtttc aatttgagga gtcataccaa gatgaggctg gcccaaatct tcctgatatt 5520 ttgtggatga aaataaagac cgatcggagt atatttcttg gggacgacga agaagacgaa 5580 gaagacgaag aagatgaatc agacgtggtc attataccgg aggcagattc gtctcgacga 5640 cgttctcatt ctccggaaaa gtctcgtaaa ttcaagcgca taagagacga acttttttta 5700 tgataaaaca ataaatgaat gaatttttga attactagct ggttttatat ttgacacacg 5760 ttttctcaat tgaaagtttc cgattatctc atcttatctt atcaaagcaa cccttatcaa 5820 cgcgcaacaa aaaaaaactc cgacatcctg gtcatctaga gatacgcgct cgtaatgctt 5880 gagaaaacgc gcaagctgtt tgactggtta aagatcgtat ttcagaatat tgaatctgaa 5940 catatgttgc aaacaatgtg aagttatgat ttgatcagaa atccgaaagc tgatgaaaaa 6000 gagaaaaaaa tgttttcgaa atattttcag gtctttcatg tcctttatat gttttttttg 6060 gaaaaaccga aaaaaaccca atttagtccg tctaactttt tttggtccaa gagacttttt 6120 tgttccaaat tgcatgggga attcgaaaat cttgttggaa tatgtgtctt ctgtggaaat 6180 tttttgagaa aaatcgagtt ttatgatata tgcagacgtt tcaaggtaaa acttggctga 6240 aacgccatca catcaactga aaaattctga gcgtgttatt tacgatgagg aaaatgtaat 6300 ctcaattaat tgaggcaaaa aaaaattttg attttttttc gaaaaaaaaa atttcataag 6360 gggggaccct ttggtgattt tcgaaaaaat cgaaaaaaat cggatttagt ccgtctaact 6420 ttttttggtc caaaagactt ttttgctcca aattgcatgg ggaatctgaa aaaattgtca 6480 aaagttaccc caggtattca aagttttcga gaaaaactca aaaaaccgtt gtctcagagg 6540 tgtttttccc actttccgat atttcaagaa gcccaaattt gttcaaaatg cgcccctatg 6600 catccaaatg aacccctaaa cttttcagaa cgaaatctca aaaacgaaaa tttttagaat 6660 tttcgacctg aaaagtcgat ttttgactat aattccagcc aaaatcgaaa aatcagattt 6720 ttgataaact ttttggaaat cacttccaag tatgtttcca tggttcaaaa ctctcctgga 6780 acggattaac tttttgcaac tgtgcctggc ctcaacagaa aaattccact ttttacagaa 6840 agtgcctc 6848 // ID Kolobok-1_CS repbase; DNA; INV; 4853 BP. XX AC . XX DT 17-MAY-2011 (Rel. 16.05, Created) DT 17-MAY-2011 (Rel. 16.05, Last updated, Version 1) XX DE DNA transposon: consensus. XX KW Kolobok; DNA transposon; Transposable Element; Kolobok-1_CS. XX OS Ciona savignyi OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. XX RN [1] RP 1-3053 RA Yuan Y.W. and Wessler S.R.; RT "The catalytic domain of all eukaryotic cut-and-paste transposase RT superfamilies."; RL Proc Natl Acad Sci U S A 108(19), 7884-7889 (2011). XX DR [1] (Consensus) XX SQ Sequence 4853 BP; 1605 A; 862 C; 891 G; 1495 T; 0 other; aggggaactg aagtaaattt taagaatatc ttaattaatg gaaaaaaaca cgctgatttc 60 aaaaatcaaa tcagtttctt ccaatattgt tttgttaaaa agttatgagc aaatatgcat 120 ttgattttat gttaggggat ttcccgtatt acgtcattga cgttgttgag atttcgtgcc 180 tatatgcatt gtatgcaatc atgtgccagt ttagaggtga attagtaaca gaagcgcctg 240 caggagcgtt gtattttcgt tttttttatg aagacttagc ataacacttg tcatatagcc 300 taactgaacc aatgtacgcg cgtgacagct taccttaagc gcggtcattc aggcctacac 360 ggtcctggac ctaagctgaa tggtgagata tacaggtctc gtatttaaat gactttagaa 420 taataagtct tagcaattta tatcataaat cataaaagtg aagacagaac aagaaaacta 480 gaacagaatg cattaaactg taactgccag caatttatag aaataaacca gccagcaacc 540 aaaggctaga aactatgtgg tggggacagc ctatttcttt catggtttac cttggctgtt 600 tgcataatgt tttgctgttg tgtatgtggt gcatttaatg tctgctgtta ttaaaactga 660 gcattagcct actggttgag aataccactt attgtataaa ccagttaatc cacagtagcc 720 gatgtgtagt ccgacaataa gtacacagca agcactcaga tacttaagta aagaatatat 780 ataataaacc tctacttaac tacccaaata agttaataag aatataccac ttttccaatt 840 cagttctttt cattagatat tgtaaaatgc ctaaagttta caaagtaaaa gaagccactg 900 agaaggaagg aaaagtgatt ggagcatatt gttttgctcc gggatgtaaa aattcctatt 960 acagcacacg aaattctaat ccacctgtac atttccatcg atttcctcac aacgacaaaa 1020 agcggatgaa gcaatggata caacaatgca agcgttctaa actacctgga aaaagaaccg 1080 cagtgtgtag tgcacatttt tgtgacaaag attatcatag caaaattgta tttgactcca 1140 atggagtcgc caagacgatg aaaacaaacc gcttggtgaa aaatgcagtt ccgacaattt 1200 ttaatttttc tgattatgat tttaaacata gtgatcaacc cagtacatca agcggaattt 1260 catacaggac acaacggtat gaaaagagag tttgtcaaat ggaggtatgt gctatttaca 1320 atttttgtca tgcagccaca attgtgccgt ttttcaaaaa ttcattttaa tacagggtaa 1380 ggaaaaatac cttgccatag agtctttgtt gagaccaact aatctaccta cacaaagtga 1440 tacatctcaa tctaaaccag ctgattctga agtatgttca gtcttatagt gtttttgaat 1500 tacggtattc ttttaataca gtgttgaaat tgatatttaa ataattcatt tgtaggaagc 1560 tgaagaatct gaaatggaag aaaccagctc tgtagaaata agtaccacag gatctctggt 1620 gtcatttagt gatgaaattg catcttcagt ggggagtagc tctcatactg agttgaatgt 1680 gagcaaaaaa attgaatatg taccgtcatc acagtcttca tcatcaaagt caagcgaatc 1740 atccaacagt gatgatttca atgctattga aggcgttaaa ttgattgtct atttacactg 1800 cctgttacaa cttttcaaaa cctgccatgt acctggttgc cataaggggt tgcacagtaa 1860 acctaatgta tcttttaaag gttttgccgt tatcattaca acagaatgtg tttatggtca 1920 cacgtttgtt tggagatcac agccgtttat taatggatac acatgcaatt acctaatccc 1980 ttccacattg tttgttattg gaaaaagtta tttatctttt ctgaatgtgt gtaatttgtt 2040 gaacattaaa gcggtagttg aaaggcattg ctactacatc caaaacaact tcataattcc 2100 tgtcgttgaa atgttatgga aaacttataa caaagctgta ttggaggagt tgtctggtca 2160 atcaaagaag ataattgtat ctggagatgg acgctgtgat agccctggtc actgtgcaac 2220 tttgggatca tatacagtgt tggataatga aactaacctt atacttgcac aagaaacagt 2280 acatgtgtca gaggtagcta atagttactg gctggagata gaaggtctaa aacgttgttt 2340 ggctcattta aaggtaagat ctgtcctaat attcaagttt tgcatagttc tttatagaag 2400 actaatagtg tcatttttaa aggtaaatca tgtcactgtt gatattctgt cgacggatat 2460 gcatccgggt gtacaaaaaa tgattcgaac tgacttcaag agcatttgtc atcagtatga 2520 tttgtggcac attgcaaaaa acttaaaaaa acggcttgct gcttcaaaaa acacatatgt 2580 taacatgtgg attagtgcaa taattaatca tttgtggtat tcagtggcta catgtaacaa 2640 aaatcctgtt ttactaaaag aaaaatggac ctctttactc tatcatatcc aaaaccaaca 2700 cagttggatt tccaatgaat attttcacga atgtgaccac aaaccataca ccaaggaaga 2760 agaggaagcc aggcaatggt tgcaaccaac aagtgatgct tttgctataa tacagaaagc 2820 agtatcccga acgacactgt tgaaagcatt ggaaaaggta cggctagact gatttaactt 2880 gcattgttac cgcaggaaat gttcagaagt aactaataga cattttcagg tcactgaagg 2940 cattcataca ggggaactgg aaagcatcca ttcactatat acaaaatact gcccgaaacg 3000 aattaagttt acaaaaatgg gccttcaagc aagactgcgt gtggcagccc tggatcacaa 3060 ccatgctgtt gaccgcaagc aagcagaaaa taagggtgga aaactgcggt tcaaaatcca 3120 gtactccaaa gcatccagaa gatttgtggc aaaaccaatt aaagttgcca aatcgtacaa 3180 cttcagaaag gaactcataa cagcaatttt tagtaaaatc accaaaggtt tgttttaact 3240 tttagtttat ttgaagcaaa atcataaatt aattcaagtt gaaataaagt acagatacag 3300 ttattatttt tcaacagaat catctatgtc agagaacatt agagaagcgg agatgctggt 3360 cactcttagc gaacatgcag gagttgaaaa accaccagca gaagaagtta ttgcttctat 3420 gcgtagccgt ttttccagat gatccaaatc ttcagcccta tgcaaaacag tgttcatgga 3480 tgtaaaattg tttattatta gctctttgtc accagtgtta gcattcaatg tttcttttct 3540 gtgttatttc aatacatgag tgtgaatctg catattgcac tcacctgtaa gtcactatgt 3600 gcgaagccag tgtatacacc actagtctct ggaaatgctt cacgaatttt ttgaatcaca 3660 catgctggaa taacgaagcg catatttctt ccaagcttcc cccgtgccca tagggtaaat 3720 tgacggtagg atgcaagtct gtacgttctg agcacaaaag tttgcacaaa tacatgtcat 3780 ttctatgata tttaatatac catattacac ttcacttacc gattttctat gggttgagtt 3840 aaagaattag cccgagctaa gcacatggtc aaaagtgcaa cttccagaac ttctttatcc 3900 aaacaaataa ttaaaaaatg agggttttta gtaatgcaag tccagtctgg tgttattttg 3960 tgcagaaatg catgtacatt catgcagcat aaacattctt gcggtgtgtc cataagaata 4020 cagtttttgc aacggcacca agtcatgtca ataggtaatg tttctaaata aacattttct 4080 tctatggggc ttgcatcttc agagatactt ccagacgttc tcattggttc aaacgcatat 4140 ggctgaattt gtgttgtttt attgtgtaca ataaaacttc tttcatccat gccagaatgc 4200 tgaaaactgt catccatttt aatattacaa cattaactta aatttactct acatatgaag 4260 aaaacaaaag taatttgtta aaactcctat gtatatgtac caaatttctc agtttgtacc 4320 ctaagggttg ataataacat tgctagtcta cgttacagat tatggctata tattgaaaag 4380 caacaatcat gaaagaaata ggctgttccc accgcatagt ctgatctttg attgctggct 4440 ggtttatttt tataaattgc tggcagttac agtttactgc ctcctgttct agttttgtag 4500 gcctgaatga ccgcgcttaa ggtaagctgt cacgcgcgta cattggttca gttaggctat 4560 atgacaagtg ttatgctaag tctacataaa aaaacaaaaa tacaacgctc ctgcaggcgc 4620 ttctgttact aattcacctc taaactggca cataattgca tacaatgcat ataggcacga 4680 aatctcaaca acgtcaatga cgtaatacgg gaaatcccct aacataaaat caaatgcata 4740 tttgctcata actttttaac aaaacaatat tggaagaaac tgatttgatt tttgaaatca 4800 gcgtgttttt ttccattaat taagatattc ttaaaattta cttcagttcc cct 4853 // ID Mariner-1_AP repbase; DNA; INV; 2330 BP. XX AC Contig65588; XX DT 07-MAR-2008 (Rel. 13.03, Created) DT 17-MAR-2008 (Rel. 15.12, Last updated, Version 2) XX DE Mariner-type transposable element. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-1_AP. XX NM Mariner-1_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-2330 RA Jurka J.; RT "Mariner families from Acyrthosiphon pisum."; RL Repbase Reports 8(3), 340-340 (2008). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX FH Key Location/Qualifiers FT CDS 504..1805 FT /product="Mariner-1_AP_1p" FT /translation="MIVNLFKSKMIQQPTLKVKEVAMIISKELGIGKNTIQ FT STIAEYKNKKTVSSPNKSKIRATYKQKVDDFERDAIRRKVHEFWFRKQLPT FT LDKILTAVNEDPDLNTYKRSTLHLLIHDLNFVYVKRGRNSALIERDDIVLW FT RTKYIEDIRKYRAQRRTIYYLDETWVNAGDCNDRIWQDNTVTSHRDAFLSG FT LSTGAPNPTAKGKRLIVVHIGSNEGFVDGGLLVFESKKGSSDYHEEMNGDV FT FFDWLKGVIPLLKDNSVIVMDNAPYHSVKVEKCPTLGWKKAEIESWLEEKG FT EPFQRPINKVGLMEIVKRIKPQFNKYVVDEYVKTKNMTVLRTPPYHCELNP FT IELAWSSVKRYVKSNNTTFKLPDVQKHLIDGVNQCTPEMWENFVKHVIQIE FT DRFWNVDMTVDDVMDDDNLHVMTITGDTSDSDDLGCVALE" XX SQ Sequence 2330 BP; 758 A; 407 C; 481 G; 684 T; 0 other; tatactctgt ccagcgagag aacgggccgc gcgccgacca aaatcggccc gttttcgcgc 60 atccccgagc gtacccagcc cagtgtgctc acaacgacgg cccaccgacc gatgtcgcgc 120 tccgacgacc gacgtcgctt tccgacgacc ggcggcacgt gtccgacgat tccagttgat 180 gctcgtagta cattcctgat tccagcgcga agtgtgttta ctatttatta tacgtaactc 240 aagtgaaaat tatatctttt tttcgtacga ataatataat tttaaaatga gtgatgagcg 300 accagtttct agtacaccat taaaggtgaa ccctagagga aaagtaagtg tggataaatt 360 ttaattgttt ttaatttgga aaaccggtac taggcccctt aatatattac attttgtaat 420 tttttaaaga atattatttg atttgtaatc aatatatttt ttttgtcttc atttttcagt 480 tcgtcggaag tcgtcaaaaa ctgatgattg taaatttgtt taaatcaaaa atgatacaac 540 agccaacgtt aaaggttaaa gaagtagcga tgataatttc aaaagaattg ggtattggta 600 aaaatacaat ccagtcaaca attgctgaat acaaaaacaa aaaaactgtg agttcgccaa 660 ataaatcaaa aattcgagcg acatataaac aaaaagttga tgattttgaa cgtgacgcga 720 tacggaggaa ggtgcacgag ttttggttcc gaaaacagtt acctacactg gataaaattt 780 taacagccgt taacgaagac ccagacttaa acacatacaa aaggtctact ctacatttac 840 taatacatga cttaaatttt gtatacgtca aacgtggccg taacagcgct ctcatagaga 900 gagatgatat tgtgttatgg cgcacaaaat atattgaaga tatacgtaaa taccgggcac 960 aaagaagaac gatatactac ctggacgaga cctgggtcaa cgcaggtgac tgtaacgaca 1020 gaatatggca ggataatact gttacgtccc acagagatgc ctttttgagc ggtctttcta 1080 ctggcgcacc gaatcctaca gcaaaaggga aacgacttat tgttgttcat attggatcaa 1140 acgagggttt tgtcgacggt ggcttattgg tattcgaatc caaaaaaggt tcttcagatt 1200 atcacgaaga aatgaatggt gatgtttttt tcgactggtt gaaaggcgtc attccattac 1260 ttaaagacaa ttctgtcata gttatggaca atgcgcctta tcactcagtt aaagtagaaa 1320 aatgcccaac attaggatgg aaaaaggcgg aaattgaaag ttggcttgaa gaaaagggtg 1380 aaccattcca aaggccgatc aacaaggtag gactaatgga aatcgtaaag cgtattaaac 1440 cgcagttcaa caaatacgtt gttgacgaat acgtcaaaac taaaaatatg acggtattac 1500 ggacgccacc gtatcattgt gagttgaacc caattgaact tgcgtggtcg tccgtaaaaa 1560 ggtatgtgaa gtcaaacaat accacattca aacttccgga cgtacaaaaa catttgattg 1620 atggtgttaa tcaatgcact cccgaaatgt gggagaattt tgtcaaacac gtcatccaaa 1680 tagaagacag attttggaat gtggacatga ccgtcgatga tgtgatggat gacgacaatt 1740 tacatgtaat gacgatcact ggggacacgt ccgattcaga tgaccttgga tgtgtagcat 1800 tagaataaat atattgttta tacaaatgtt taattgtcat gaatttttat tatgtatatg 1860 tattttgtat tttttttttg ttttttttta attgttatga caatggttat taacctactt 1920 ttaatgtatt ttgtattatg tttattgcat atgaaattaa ttgttaaagt tgactataaa 1980 aaaaaatatt aataaaatat tttacgttta ttttatgaca aaggtattat aataacgtgt 2040 ttttactaca tttatttgtc gaaaaatgta gaaaaaacta agtgttcata ttaaaaaagt 2100 tgtattatta aaattacgtt gcatacaata attaatgcgg taagccggta aaatattgca 2160 cgtgttatac gggcggcaat agtcaacagg tggttgattc ccgatcatcg gcgacggttc 2220 gatcgaccgt cttcggcacg ccgagagagg gggaaatggg gcctcggagc tcgctcgctc 2280 gtcagtctgc atgcgcagaa ccggcccgtt ctctcgctgg acagagtata 2330 // ID Shinagawa-3_AAe repbase; DNA; INV; 1882 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.03, Created) DT 16-FEB-2011 (Rel. 16.03, Last updated, Version 1) XX DE A non-autonomous DNA transposon family from Aedes aegypti. XX KW DNA transposon; Transposable Element; nonautonomous; KW Shinagawa-3_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-1882 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the yellow fever mosquito."; RL Repbase Reports 11(3), 840-840 (2011). XX DR [2] (Consensus) XX CC >90% identical to consensus. 8-bp TSDs. TIRs are ~130 bp long CC and composed by degenerate repeats. Related non-autonomous CC elements, named Shinagawa, are found in Aedes aegypti and CC Culex quinquefasciatus. The insertion of DNA-TA-8_AAe-like CC transposon around 272 and that of DNA-TA-4_AAe-like transposon CC around 1627 are eliminated from the consensus. XX SQ Sequence 1882 BP; 638 A; 297 C; 316 G; 629 T; 2 other; gattgtatga catttgccag aaaaccgttt gccagaatca atttgccaga atgatttttg 60 ccagaaaacc attccccaga atgtaccatt cgccagaaag ccattcccca gaatggacca 120 tttgccagaa aaccattccc cagaatcatt ttttgtaata aattttcaac tttgatatga 180 aatctattga tcgaaaaatt tatggaaaac ttaaggctac cgaagtatta aaaattgtta 240 tttgagtatt ttgattatat gtacttacat tgatttgtac ttatagcacg aaacagtgta 300 taaccaactt tcagataatt taatccagaa gagtggacta ctaacactgc atagctactg 360 cgaaaactga attggacttt agcatcccta ctacagcaac ataacgttcg cgtcgtggat 420 gttgcgccaa atcatatttc aatcttgtaa tataaaaaca agtataaaac aacgtgattc 480 gtcgaactat gttgtactaa gtaaaatttt gttacaatct ttcgcataca attttatgca 540 attttcacaa cgtgaaagat atatggggga actgtcacca gatttccatt ttttgctgca 600 taactgacct gagaaaaaaa agctaaaatg ttaggcaaac atacttttag ttgaatggat 660 atttgactga agagtgtatc tttgagaaat atcgcaaaac aatcaaaaga accgcctttc 720 attgaaataa gggaaaattt tatttgaaat atcatcagct tggcgccaat aattatttgt 780 ataactcaaa tgaagaagac aaaattcagc taaacgtcct ttcggtcata tatctgaatg 840 aattattcaa aaaaaaaaaa aataataacg ttaaactttt gttatctcga aaataatttt 900 aaaccacgta tgttttagtt ataagatagt tttatatcat ttgcgacagg ctgccsaaaa 960 attcttcaat cttcagtatt tgccctgatt gtaagtttca actatctggt accggatata 1020 ttcagaaatc ctttggagga ctgccacgtg ggcatgattc gttgtgttca gccaattgag 1080 attttgtttt tttttgtatt cagtcaattg ctgtaggtaa cgagattttt gtttttggaa 1140 tcactcgctt gaagattaca aatcatatgw atgtcatggt taaaaaatta aacatggctt 1200 ctttatttgg agacaggatt gatttcaatt ttacgatttt tgcatttcat tattttgctt 1260 caatccacat tgcagaattt ttattcccac ctgtgtttta tttaatattt tcaatcactg 1320 tatttattga aaactgagat gacagaatat tgttgatata attttcaaca agttttcagt 1380 acaattaaga agaacagcca atgatataaa gaagggtgaa tctcctatga agcatataag 1440 tccttgtgta aattagttct aaagaacagc ttaatactca aagaagggaa aatcttcaat 1500 agaatgttga aaatagaatt gtctttaatg ggtgttatag aatagtctga aagatgatca 1560 ttacagcatt gaaaaatagt aattttgcga aaattatgaa aagctcgaaa gcctatgata 1620 ttttccccta taatttgcta aacattttat cacataaaaa ggtgaaaatc ttacgaaaca 1680 cttacttaca cgcctaagta ttgctatgcg aagcttagta atgaagaaat attattttct 1740 ggggaatggt ctttctgggg aatggctttc tggggaatgg tccattctgg caaatggttt 1800 tctggcaaat ggtccattct ggcaaattga ttctggcaaa cggctttctg gcaaatggtt 1860 ttctggcaaa tgtcatacaa cc 1882 // ID Gypsy-36_OD-LTR repbase; DNA; INV; 186 BP. XX AC CABV01002103; XX DT 24-FEB-2011 (Rel. 16.02, Created) DT 24-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Oikopleura dioica genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-36_OD_; KW Gypsy-36_OD-I; Gypsy-36_OD-LTR. XX OS Oikopleura dioica OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; OC Oikopleuridae; Oikopleura. XX RN [1] RP 1-186 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Oikopleura dioica genome."; RL Direct Submission to RU (24-FEB-2011). XX DR Genome; CABV01002103; Positions 3126 3311. XX SQ Sequence 186 BP; 62 A; 51 C; 37 G; 36 T; 0 other; tggcgctccc aagcccgtgg cgctcaatag gccacgtcat aaaagcgcga gcctgcgcag 60 aagacggcag agctgagaca ctctcgaatg atgacaccca aaagacaact ttaccctttt 120 tgtactcaat aatcatacaa taaacaatat acaactggaa caccattgag gattacttag 180 ccagca 186 // ID Gypsy-64_CQ-I repbase; DNA; INV; 5595 BP. XX AC AAWU01018564; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-64_CQ_; KW Gypsy-64_CQ-LTR; Gypsy-64_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-5595 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 507-507 (2011). XX DR GenBank; AAWU01018564; Positions 10443 4849. XX CC Positions [2827-3363] - Reverse transcriptase CC Positions [4531-4998] - Integrase core CC 'ATCCT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 793..1950 FT /product="Gypsy-64_CQ-I_1p" FT /translation="MNQMSASSISVPECKASVEGEDISRLDYEYWEDLLVN FT SLTLAGITDEAMKMVVLKVKAGRKLLEVLKGTESTNEAPDESTHPFANAMF FT RLKKHFESISDVMIHRRKLATMVQTETEGDKAFIQRVATEAQQCGYSEDQR FT FDEILNTVAVRASHHKVRAEAVKWMNRRAKNPKQEGVTLQEFIEKIKEIET FT IRLNEDYVTKRQAELNAAPVNAIRAVSRPSPEIAGPSRYGDDRTVAGTSRS FT SAYKMRSAYGKSAYQKPNNQVYGRMPYQKPYSQGNGRKQFQPSNNRGGRNL FT QILNEPDALDPDRCPRCNSISHSEQNCYASRLTCYTCGRVGHMFRACPFSF FT APGNTRTSPDQNRRAEPVLALMDAKPEEKEPEGEKPDKSVSEV" FT CDS 2200..5493 FT /product="Gypsy-64_CQ-I_2p" FT /translation="MQKQDAEPTSGVALAVEVELFNDGIINAKVSGMVCQF FT LIDSGSQVNTITHTTFEQLRANASYNEGLHNVRHGTDRQLKAYAKSDGIHV FT ICTFEAFLEISDDRPILLEKFYVVKEQRSLLSRATSTRYSVLQLGLKVPVN FT TGWTADDSWMNIGFINLVSEVSVQFPKFNIPPIKIPYDKTMPACRNIFTNI FT PAAMKQAVQRRLEEQVSAGIIERVTDAMDAAFCSSMLVVPKGNDDFRLVID FT LRGPNRYILRSPYSMPTLEKILVQLEGAQWFSTIDLQSAFFHIELEEDSRH FT LTNFMTEFGMYRYIRLPFGLSNAPDVFKEVMERKVLTDCEGVLNYLDDILV FT FGRSKEEHDKNLAAVMDRLKDHNVRINTSKCVFGSKEVKFLGFELTDKGWR FT IEHGKLSAIKNFRRPSSCAEVKSFLGLITFVDKFLVHRATKTEFLRALANA FT DTFYWTEAEETEFESLKNHTVHAIKVLGYYSCSDFTELYVDASAVGLGAVL FT VQHDSDGTPRIIACASKSLTNAEANYPQSHREALAVVWGVERFSFYLIGRP FT FVIRSDASSNEFIFHTNYRIGKRAFSRAEGYALRLLPYKFRIERVKGDENV FT ADALSRLIHVTQPAVPFEEDCSNHVLCSLDAGNNNKRIINFHNSCFMTRYF FT DIGCMEITMSEIESSSERDQEMELLRAALDSGSWPAELRGYEAQRNQIHKL FT GALLCKDDRIILPKDLRQRAMESAHGGHIGVVAMKRVLRDFFWWPQMSKQT FT EKFVKSCETCVLLSKRNPPVPLASRELPEGPWEILQIDFLSVPGFGSGEFL FT MVIDTYSRFLSVTEMQRTNADATNTALIQIFKRWGLPRIIQSDNGPPFQGS FT VFCEFWEAKNVKVRKSIPLSPQSNGMIERYNQPVIKTLSAAKLEGRSWRLA FT LETFVHHHNTLVPHSRLNVTPFELLVGWKFRGTFPSLWSATKDLDRIDVRE FT ADAEQKLTSSKLADRVRRAMPSDVRVGDSVFLKQQKKSKSDATFGSEKYTV FT VAQDGSKVVICSGNGVQYTRSVNDLKKVPVAQDQANGSLVNSSPVVDHDSD FT DEQIDVATRDENSGSRIGLRNRGLVRKPSRYNDDYVYQIFE" XX SQ Sequence 5595 BP; 1656 A; 1245 C; 1421 G; 1273 T; 0 other; atggcgcagg agtagcctcc aagtaagtat agtgtggtaa aaaaaaaaat gaagaaaagt 60 gatagcgatt ttaatagccg gggcgatatc agtgattgtg gaaaaaaaat aaattaaaat 120 gtgttaaatc tacacgtgtg attcccagca aaaaagaatg aaaagctgga tgaaaatgaa 180 attcccagca aaaaaggatg aaaagctgga tggaaataag gctttcaacc aaagaccagc 240 aaaaaaagaa taaaaagctg gcaaaagtaa aacgaggaaa tttcccagta aaaaagtatg 300 aaaagctgga tgataatgga ttcctaaagc ctaacacgtg gaaaaatgga cagtaaaagc 360 cggcttagtt gacgcgtaaa aaaaaggaaa aaaaaggaaa atctaaagta aatgaatgat 420 tttgcaacta ttgagaaccg ctgatcgcca atcactgtta tcgccttaaa taattcctat 480 gtcaaaaaaa aaggattgta actaaaacgg tacctatcgt tgaaactgaa ttgcaggaag 540 aaaagcagca aaattagtgc tttcgaaaag gagttgaagg ctgcaaactc gagcaacgct 600 gagctcgaaa aggaactgga aagggcccaa gaaaccattc gcgcgctcca gtccagcctg 660 tcctcggccc ggaacgaatc ggaagaggtg aacagcgagg agttcgacat acaacccgga 720 cgatgcagca caaaacgctt gtccaggatc aacaacagtc tgccgtgcga tacttcccag 780 tttgttactt cgatgaacca aatgtcagcc tcatcgatca gcgtgccgga gtgcaaagct 840 agtgtggaag gtgaggacat ttcacggcta gattatgagt actgggaaga tctgctggtc 900 aattcgctca cgctcgctgg catcacggac gaagcaatga aaatggtggt gctcaaggtg 960 aaggccggcc ggaaacttct ggaggtcctt aaagggaccg aatcgacaaa cgaggcgccg 1020 gatgaaagca cgcatccctt tgcgaatgct atgtttcgcc tcaagaagca cttcgagtcc 1080 atttcggacg tgatgattca tcgcagaaaa ctcgctacaa tggtccaaac cgagacggaa 1140 ggtgataaag ctttcatcca acgagtcgcc acggaggcgc agcagtgcgg atattccgag 1200 gaccagcgat tcgacgagat tttgaacacg gtggccgttc gagcatcgca tcacaaggtg 1260 cgggcagaag cagttaagtg gatgaaccgt agagcaaaga acccgaagca ggaaggagtg 1320 actctccagg agttcatcga gaagatcaag gaaatcgaga caatccgtct caatgaagac 1380 tacgtgacca agaggcaagc ggaacttaat gcagcaccag tgaatgcgat tcgtgcagtc 1440 tcacgcccat caccggagat cgcgggacca tcgcgatatg gtgacgatcg aactgtcgcc 1500 ggcacgtctc gttccagcgc atacaagatg cgttcggcat acggcaagtc ggcgtaccaa 1560 aagccgaaca atcaagtcta tggaaggatg ccgtaccaaa aaccgtacag ccaaggcaac 1620 gggaggaagc agttccagcc gtccaacaat cgcggtggca ggaaccttca gatactgaac 1680 gaaccggacg cgctggatcc cgatcgttgt ccgcgctgca acagcatatc tcattcggag 1740 caaaactgct atgcctctcg cctgacctgt tacacatgcg gccgcgtggg ccacatgttc 1800 agagcgtgtc cgttttcatt tgcaccggga aacacacgca cttcgccaga ccagaaccgg 1860 cgcgctgaac cggtgctggc gctaatggac gctaaaccgg aggagaagga gcctgagggt 1920 gaaaagcctg acaagtctgt aagtgaagtt tgatgtttac ggtgagtctg agagtggatt 1980 taatcgaaca atgttgaaat aatcttgtta tgttcgaatg tggatttatt gttattgaat 2040 taattctcaa aataaagaaa atgaatcagt ttgaagtaac gaggttttat ttactttcaa 2100 catctatgca ttcactcacc gaatcagact cattgataac aatgctagct gatgattgaa 2160 ataatttacg atatattcta aacttgatag gtcactggaa tgcagaaaca agacgctgag 2220 cctacctctg gcgttgcatt agctgtagag gttgaacttt tcaatgacgg aatcatcaat 2280 gccaaagtct ccggaatggt ttgtcagttt ctgattgatt cgggttcaca agtgaacacc 2340 ataactcaca cgacatttga acaacttcgc gccaacgcat cgtataacga aggattacac 2400 aacgtccggc atggcaccga tcgtcaactg aaggcgtacg ctaagtctga cggtattcac 2460 gtgatatgca ctttcgaagc tttcctggaa atttctgacg accggccaat actgctcgag 2520 aaattttatg tggtcaagga acagcgatcg cttctcagca gagcaacgtc gacccgttac 2580 agtgtgctcc aactcggctt gaaagttccc gtgaatactg gctggacggc ggacgacagc 2640 tggatgaata tcggattcat caacctagtt tcggaagtgt cagtacaatt cccgaaattt 2700 aacatccctc cgatcaagat tccctatgat aaaacgatgc cggcttgtcg taatatattt 2760 acgaacatac ccgccgctat gaagcaagca gttcaacgga gactcgaaga gcaagtatca 2820 gcgggaatca ttgaacgagt gactgacgcg atggacgcgg cattctgctc gtctatgctc 2880 gttgtaccga aaggtaatga cgatttccga ctcgtgatcg acctgcgggg tccaaatcgt 2940 tacatccttc gcagcccgta ctccatgccg accctggaaa agattctggt tcaactggaa 3000 ggcgctcaat ggttcagcac aatagatcta caaagtgcgt tctttcacat tgaattggaa 3060 gaagattctc gacacctgac caacttcatg accgagttcg gtatgtaccg atacatacgc 3120 cttccgtttg gattgagcaa cgcacctgac gtgtttaaag aggttatgga gcggaaagta 3180 ctgaccgatt gcgaaggagt tctgaactat ttggacgaca tactggtgtt cggcaggtca 3240 aaggaagagc acgacaagaa tttagctgcc gtaatggaca ggctgaaaga ccacaacgtg 3300 cggattaata cgtccaagtg tgttttcggg agcaaggaag ttaaatttct gggatttgaa 3360 ctgacggaca aaggatggag aatcgaacac gggaagctca gtgcgatcaa gaatttccga 3420 cgtccatcat catgcgctga agtaaagagc tttcttggtc taatcacgtt cgtggacaaa 3480 ttccttgttc accgagccac gaaaacggaa ttcttgagag ctttggccaa tgcggacaca 3540 ttctactgga cggaagcgga agaaaccgag ttcgagtcgt tgaagaacca cacggtccat 3600 gccatcaagg ttctcggata ttatagctgc tcagatttta ccgagctgta cgttgatgca 3660 tctgccgtcg gactgggcgc agtgctggtt cagcatgact ctgatggaac tccaaggatc 3720 atcgcatgcg cgtccaaatc ccttacaaat gctgaagcaa attaccccca gtcacacaga 3780 gaggcattgg ccgttgtatg gggagtcgaa cggttttcgt tctacctcat cggaagaccc 3840 ttcgtgatac gttcggacgc ctcttcaaat gagttcattt tccacaccaa ctaccgcatc 3900 gggaaacgag ccttcagccg agccgaagga tacgcattgc gactcttacc ctacaagttc 3960 cggattgaac gagtgaaggg cgatgaaaat gtagcggatg ctctctccag actgattcac 4020 gttacgcaac cggctgtacc atttgaagag gactgttcaa accacgttct ttgctcgtta 4080 gacgcaggta ataacaataa acgcataatt aacttccata atagttgctt tatgacacgt 4140 tactttgata taggttgcat ggagattact atgagtgaaa tcgaatctag ctctgaacgc 4200 gatcaagaaa tggagttgct acgtgccgct ttggatagtg gatcgtggcc agcggaacta 4260 cgcggttacg aagcccaacg gaaccagatc cacaagcttg gtgctttgct gtgcaaggac 4320 gaccggatta tcctcccgaa ggacctgaga cagcgagcta tggaatcggc acatggtgga 4380 catatcggag tggtggcgat gaaacgtgtg ctgcgagatt tcttttggtg gccgcagatg 4440 agcaagcaaa ctgaaaagtt tgtcaagtcc tgtgagacct gcgtcctgtt atccaaacga 4500 aaccccccag tgcccttggc atctcgtgaa cttcctgaag gaccgtggga aatattgcag 4560 atagatttcc tatctgtgcc aggattcggc tcgggtgagt tcctaatggt aattgacaca 4620 tattcacgat tcctgtcggt aacggagatg cagcgcacaa atgcagatgc aaccaacacg 4680 gccctgattc aaatatttaa acgctgggga ttgcctcgaa tcatccagag cgacaacggc 4740 cccccttttc agggttcagt gttttgcgaa ttctgggaag ccaaaaacgt taaagtgcgt 4800 aagtcgatac cgctgagccc acaatcgaat gggatgatcg agagatacaa ccagcccgtg 4860 atcaaaacgt tgtcggctgc aaagttggaa ggacgaagct ggcgattggc cctggaaaca 4920 tttgtgcacc atcacaacac gttggtgccg cactccaggc ttaatgtgac accttttgag 4980 ttgctcgtcg gttggaaatt ccgcggtacg ttcccgagtc tatggagtgc gacaaaagac 5040 ctcgatagaa tcgacgtacg agaagctgat gcagagcaaa agctgaccag ttccaaactg 5100 gcagatcggg tacgtcgtgc catgccttcg gatgttcgag ttggcgatag cgttttcctg 5160 aagcagcaga aaaaatctaa atcggatgcc acgtttggtt ctgaaaagta tacagtggtg 5220 gctcaggatg ggtcaaaggt ggtgatttgc agcggaaacg gtgttcagta taccaggagt 5280 gtcaacgact tgaagaaggt tccggtagct caagaccaag ccaacggctc gttggttaat 5340 tctagtccgg ttgttgatca tgattcagac gacgagcaga tcgatgtagc gacacgagac 5400 gaaaattccg gaagccgcat aggtttgcgc aatagagggt tggtgaggaa accatcgagg 5460 tataacgacg attacgttta tcaaatcttc gaataactgc gattattaca catacgtgaa 5520 atgaagtcca ataaatcgta ctaaacatta aacagggttg gtcttatttc ttgaaagagt 5580 acaggaggag aatat 5595 // ID Mariner-3N1_BF repbase; DNA; INV; 963 BP. XX AC . XX DT 13-APR-2011 (Rel. 16.04, Created) DT 13-APR-2011 (Rel. 16.04, Last updated, Version 1) XX DE Mariner-3N1_BF DNA transposon DNA transposon - consensus. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Nonautonomous; KW Mariner/Pogo; non-autonomous; Pogo-2N1_BF; Mariner-3N1_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-963 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M. and Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-963 RA Kapitonov V.V. and Jurka J.; RT "Transposable elements in the amphioxus genome."; RL Deposited to Repbase as the tmplanrep.ref file (02-May-2008). XX RN [3] RP 1-963 RA Kapitonov V. and Jurka J.; RT "A family of Mariner-3N1_BF non-autonomous DNA transposons from RT the amphioxus genome."; RL Direct Submission to Repbase Update (13-APR-2011). XX DR [3] (Consensus) XX CC It is similar to Mariner-3_BF, and forms a palindrome. XX SQ Sequence 963 BP; 293 A; 188 C; 187 G; 294 T; 1 other; cgaggggtga tcaataagtt ctcggcctca cccagaaata ataagcacaa ctcaaatttt 60 gaggcacaac ttcatagtat gaagcctttt atcaatatcc tccaaatttc aaatcattgc 120 agtcattact ttccaggcat tcgttttttg taaatcagaa ggtaagtggc gagataaaag 180 aagggtgaat tgtgatcaga catgtgtggg tcgagttccc catgccgtaa attaacaaaa 240 gacattgtca aaatgtttga taatgattct cttcctattt caagcaaaaa gaattgggtg 300 tctgacttcc agcagcgcaa gttgacccgc acacctcagg tygcaggtca attgtccacc 360 tttttttttc cttctccctt acctaacgtt actgggtgtc gcctgcaaag taatgatcac 420 aataatttga attttggtac acatttatat aagactttat acttgacatc tatgcttcgg 480 aatctgagtt gtgcttatta tttctcggta aagccgagga ctaattgata accccgcgta 540 gatgtcaagt ataaagtctt atataaatgt gtaccaaaat tcaaattatt gtgatcatta 600 ctttgcaggc gacacccagt aacgttaggt aagggagaac gaaaaaaagt ggacaattga 660 cctgcaacct gaggtgtgcg ggtcaaatag gaagagaatc attatcaaac attttgacaa 720 tgtcttttgt taatttacgg catggggaac tcgacccaca catgtctgat cacaattcac 780 ccatcttttt ctcgccactt accttctaat ttagaaaaaa cgaatgcctg gaaagtaatg 840 actccaatga tttgaaattt ggaggatatt gataaaaggc ttcatactat gaagttgtgc 900 ctcaaaatct gagttgtgct tgttatttct tggtgaggcc gagaacttat tgatcacccc 960 tcg 963 // ID Hovi2 repbase; DNA; INV; 2970 BP. XX AC . XX DT 08-OCT-2009 (Rel. 14.1, Created) DT 08-OCT-2009 (Rel. 14.1, Last updated, Version 1) XX DE Hovi2 is a putative autonomous DNA transposon. XX KW hAT; DNA transposon; Transposable Element; HOBO; transposase; KW Hovi2. XX OS Drosophila virilis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; virilis group. XX RN [1] RP 1-2970 RA Ortiz M.F. and Loreto E.L.S.; RT "Characterization of new hAT transposable elements in 12 RT Drosophila genomes."; RL Genetica 135(1), 67-75 (2009)DOI 10.1007/s10709-008-9259-5. XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 1496..2434 FT /product="Hovi2_1p" FT /translation="MEEVFKEVNELKQMLQTCISLLKHCEMSGKLTNTKSL FT PSSTQSNTILCCIKSVEDNWHLVSEILVDKSETPILIADIKAVVHILLSFE FT QSVQAMAEETMPTLHIIIPHLHKLKKLCSNKPEDIAIATILKSALRSHLDS FT IETVHITKYHKIALFLFPPTNKLLLFEESDKRATIEECKRMMQQFYVETDN FT TRKRIKLDMEYCGDGLFTDFLEPMSTDTKLDIINNEINEYLSRKMSLHEMS FT NVLDWWNANRVLFPLLYKVSCKILGTPASIAPSQRALLQARTLLSEQPNGM FT QCAEDMINEIVFLGNNFKYDC" XX SQ Sequence 2970 BP; 949 A; 601 C; 586 G; 834 T; 0 other; ggaatcgata cattactcag atttaagtac tcgtatcgca ttaatcggct atcgaccgac 60 caccacaacc gcgaaacata ccaagcattt acaggtattt acagtttgtg tctaaataaa 120 agacacttat gttgtgattt attattactt cgagttaaat acatttgcgc tttgcataaa 180 tgcagttacc ccattttcat cgagcaacaa gcacaacaga aacgcatacc gagtatttca 240 aagcaccaaa ccatttgcat tgaattttgt gagtattcga atgtctcatt tggtggtgga 300 aaactctcgg cccactcgtc gccagcattc gtactattct ataaaagagg ccaaacattt 360 tgaacgaatg caagtgaaat ggatctgcta gagattaaaa gcgaaatgag ctatcaagag 420 caggtatata tttatttatt ttatcaacac acattcaatt gattatttat ttgatttaca 480 gaattcggaa tatgaatgca cttcccctct ggacggaact acaagttcca ataattatgg 540 ccatatttca aatttcagga ggaaaatagt aaacggccaa attaaactta gggaaaaacg 600 cggtcgaagt aaggtttgga atatttttgc cctaatgatt gatgaaaatg gcgacaaaat 660 tgaaaatgtt gtggcctgcc gaaactgttt caacgtatac aaatactcgg gctgtacctc 720 gaatctggtc aagcacaaat gctacatcaa ctccgagcca aaagagaatg gtgtccgcag 780 ctaccaagat gtggaagtag acagtgaaac cctaggcaga ctaaacgatg cggccgccca 840 gtggctggtg acaaattgtc gctcctggac catactcgag gactcgggcc tgaagaatat 900 tgcacgcatc tttgtggcaa tcggtgccaa tttcggtgaa aatgtcaacc tagacgctct 960 aatgccaagc ccaacaacca tagcatgcaa tatatccgac ctgtacgaat cccagcgcca 1020 aactattgct gcggagctgc acaaggctaa aggcagcgga tacagcataa cagtgactat 1080 ctggaaggac agctacctca aaaaacctta tgcagccgta accgtgcact atatccagaa 1140 atctgctatg atcagtcgct tgctcgccgt gtccccattg aacagggatt caaggacaag 1200 taatgccaga aactccttaa ataatctacc cagaaaagtt acccaaacca gtttccgcaa 1260 gactaaacta tctaaaatcc tttaaggaga atccttttat ctgaatacgt tttaatattt 1320 ctctgccttt gcaggtaacc agatcagaga acacattaaa aactgcctgc gtgcatttga 1380 aagcgacttg gacgtggagg atccaataat tgttacggac tgcgaagaag ccgctacgcc 1440 tgcctcggat tgtcagcatc atgtgcactg catccgcagt ttgttaaaca atgtcatgga 1500 ggaggttttc aaggaggtta acgagttgaa gcaaatgcta caaacatgca ttagcttgtt 1560 gaagcactgc gaaatgtccg gcaaattaac taatacgaaa tcattgccca gttctacaca 1620 aagtaataca attttgtgct gcatcaagtc tgtggaggac aactggcatt tggtgtccga 1680 aattttggtt gataagtcag aaacgccgat cctcatcgct gatattaaag ctgttgtcca 1740 tattttactc agctttgaac aatctgttca ggccatggca gaagagacca tgcccacgct 1800 gcacatcata atcccccatc tgcataagct gaaaaagctc tgttccaata agcccgaaga 1860 cattgccata gccacgattt tgaagagtgc attgaggagc cacttggatt caattgaaac 1920 agttcacata acgaaatatc ataaaatagc cctctttctt tttccgccca caaataaact 1980 gcttctattt gaggaatccg ataaaagggc cacaattgaa gaatgcaagc gcatgatgca 2040 gcagttttat gttgaaaccg ataatacgcg aaaaagaatc aaattagata tggaatactg 2100 tggagatggc ctttttacgg actttctgga gcccatgtcc actgatacga agcttgatat 2160 tataaataat gaaataaacg agtatctgag cagaaagatg tcgctccacg aaatgtcgaa 2220 tgtcttggat tggtggaatg caaatagagt tctgtttccc ctgctctaca aagtcagctg 2280 caaaatactg ggtacacctg ccagcattgc cccttcccag cgggctcttc tgcaagctcg 2340 aactctgctc agcgagcagc cgaatggaat gcaatgtgcc gaagacatga taaacgaaat 2400 agtattttta ggcaataatt ttaagtatga ttgttaaatt cagatattgt gatttcctta 2460 gcattaacac atttcatagt tcctataatg aactatagca atttcaaatt tttaaagtaa 2520 aatcaaatgg ttgtgcctac tctttgtttg cagaggccat tatagttcta tgacatttcc 2580 gtcacaatag agtgcatata ttttattata agtattaaca gcccagtcga ttttggatca 2640 gaaagctctt caaagatctt tatgaaattt atttgagagt gttgttgagt gaagaaataa 2700 ggtcaatggt tgtattttaa aaattggcag aaaacgattg tatataggtt tattttatta 2760 aatttcacaa ctgcttcaca caatattaaa gtttctgagt gatataactt aactaagtgt 2820 ttaaatagtt tttgaattaa atgtgaggcc cggcaattta atggtgctgc caccttatag 2880 aaaaagtggc cagatgggta ctttgcttca atgtttaatt cttatcgata ctatcggtat 2940 tctcttcaaa atatcaggcg tataaatgta 2970 // ID Gypsy-41_AA-LTR repbase; DNA; INV; 207 BP. XX AC supercont1.338; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-41_AA_; KW Gypsy-41_AA-I; Gypsy-41_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-207 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.338; Positions 310299 310505. XX SQ Sequence 207 BP; 65 A; 35 C; 52 G; 55 T; 0 other; tgtagggttc aaccctgggt ttgattaatg attagattat aatacgatca gagtaggcat 60 agactgggcg aaaaaggggt ttcagttaga aacagtgtac gacagagctt ggcaggaaga 120 ataaaaacgt agtgcgtgga acttgttgaa tagtgaattc agtggctgtc ctgcatcctc 180 caaatctccg aatacccata ttctaca 207 // ID L1-N2_CQ repbase; DNA; INV; 1389 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A HAL1-like non-LTR retrotransposon family from Culex DE quinquefasciatus - consensus. XX KW L1; Non-LTR Retrotransposon; Transposable Element; nonautonomous; KW L1-N2_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-1389 RA Kojima K.K. and Jurka J.; RT "HAL1-like non-autonomous non-LTR retrotransposons from the RT southern house mosquito."; RL Repbase Reports 11(1), 101-101 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 3 sequences with >98% CC identity. CC This family encodes a protein similar to ORF1ps of L1 in CC mosquitoes. Thus it is likely a HAL1-type element. XX FH Key Location/Qualifiers FT CDS 146..1204 FT /product="L1-N2_CQ_1p" FT /translation="MTTLRPRENTFKVDLSVFPKRPSFEEIHSFVHDTLGL FT RIDQVKRLQMNHVQNAAHVKCDTLKTAQDAVDQHNERHEIEVNKVKYKVRL FT QMDDSSVEVKIHDLSENVRNEDIVAFLRQYGDVHCIKDLVWGDSFAFKGVS FT SGIRVAKMTLKKDIKSFVTIQLEQTLITYRGQPRTCRHCGRTSHPGMTCTD FT NKKLEGQKNALDERLKSAQSTSSSYAIVVDKGTDKPNDVPTPSAPQTPLCP FT ISSRVTDEVEQFFTLAQTESSSPTLVEDVQMEAEDDPIVKDQCTQQTPEDG FT QPEDNVTDSVSMFKVPALPLSTTTVQSGISETDESSSDGSGFQGVKPKKPR FT GRPKKLKSKT" XX SQ Sequence 1389 BP; 415 A; 336 C; 309 G; 329 T; 0 other; tagttgggcc gaagcttttg cccagaccag acgtagtcaa ctcgagctct gtgacaagtg 60 tttttcccca gctttgtgcc tttgtgcaca ttgaagaatc gttccctatt tcattctcgc 120 agccagcaaa gtatttgttg cgacaatgac caccttgcga ccacgtgaaa acaccttcaa 180 ggtggatctt agtgtgttcc ccaagcgtcc aagttttgag gaaatccatt cttttgtcca 240 cgacacattg ggactacgaa ttgatcaagt gaaacgcctg cagatgaacc acgtgcaaaa 300 cgcggcacac gtgaaatgtg acacgctcaa gactgcccag gacgctgttg accagcataa 360 cgaacgccac gaaattgagg tgaacaaggt gaagtacaag gtacggctcc agatggatga 420 ttcatctgtt gaggttaaga ttcatgacct atcggagaat gtgcgcaacg aggacatcgt 480 tgcattcctg aggcaatacg gagatgtgca ttgtatcaag gatttagtgt ggggggacag 540 ctttgccttc aaaggtgtct cctctggcat acgtgttgca aagatgacgc tcaagaaaga 600 catcaaatcg tttgtgacta tccagctaga gcaaactctc atcacatacc gaggtcaacc 660 ccggacttgc cgccattgcg gccgtacttc gcaccctgga atgacgtgca cagacaataa 720 gaagctcgag ggacaaaaga acgctctgga tgagcgtctt aagtctgcgc agtctacttc 780 gtccagctac gcgattgtgg tagacaaagg aaccgacaaa ccaaacgatg tacctacacc 840 aagtgcgcca cagacgccgc tctgtccaat ttcgagcaga gtaactgatg aagtcgagca 900 gttctttaca ctcgcacaaa ccgaatcctc ctcaccaaca ctggtagaag acgttcaaat 960 ggaagccgaa gatgatccga tcgtgaaaga ccagtgtacc cagcagaccc cggaagatgg 1020 acagcccgaa gacaacgtta ctgattcggt gtcgatgttt aaagtaccag cccttcccct 1080 ctccaccacc accgtgcaaa gtggtatctc cgagaccgac gaatcgtcct ctgacggctc 1140 ggggtttcag ggagtaaaac cgaagaagcc acgtggtcga ccgaagaaac taaagtcaaa 1200 gacataaact tctcttctaa aaacaactca gttcaaaatt ttaaccaatt ttgagaacga 1260 tgattcaaaa tcttgtaaat tgttttctag aataaattgt gcttcaacat tagatacttt 1320 cctatcatgt taaataatct atatgaaact gtataattat gtgctaaata aacgatttta 1380 caaaaaaaa 1389 // ID CR1-93_AAe repbase; DNA; INV; 5188 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-93_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5188 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1181-1181 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 12 sequences with >92% CC identity. XX FH Key Location/Qualifiers FT CDS 281..1696 FT /product="CR1-93_AAe_1p" FT /translation="MTSVLCDVCVQSIVAETDRVYCFGGCNKILHVRCSDL FT TRAGANALKDNVGLKYMCFDCRKAQICLNKLQQKCSELVEKFDALNCKFND FT LTSNVESGIKTQLQAFENSISRIESTIHHQLLNSNSFLSAANINCISDEDQ FT EKESYAAVCRNNINTKRKKSTSEKPTDSNSAQTQHEIGGTLRSGKRRLPVP FT RIVVQSDAASCASTSHSNQPISPLVKTKRIERFEKTVLLRPKCSQQYEVTK FT SDVCDKLDPIAFAVKEVHFRDSGEVTIQCDSKEHASKLVSAATDALSDKYT FT VEVMKPLNPRVKLIGLTSKFDESVLIQKIKTQNNLPSTFKAWLIRIAEPQN FT KNLNKFNVVLELDAPSFDAIMKLQRVYIGWERCRVVENINVKRCYNCSEYG FT HLASSCEKPVCCPRCAECHEISECVSDFVKCVNCDKMNSLRTAKSDHLADI FT HHSSWSQKCPIYIKRFNKARQNIDYSI" FT CDS 1700..5113 FT /product="CR1-93_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="QSTLPSPLAVRTADRVHQSSVDDPVLSGKYKIGICPA FT EVGTTATLCHRKKSILQNLTLVQSNTSSSDVAQLSTDQPSPLSLCAPEQIN FT DRVFDQSSVDSPVLSGNYRTGICLAEARKTATLCHSENPILQTSLPVQTYT FT LDDAQPPTDKTALQSRSPIGNDTEGKFTTNNESHVEGTHGSLNIFYQNVRG FT LRTKVDEFFLAATDSDHDVIVITETWLNEEFISQLLFGDRFTVYRKDRNAE FT LTGKKRGGGVLIAVSNHLKSSSVALTMHMELEQLWVRVTTASSNAYIGVIY FT ISPEQAMDSSNIEKHIECVSHVTDRSGPRDSILLLGDFNQPGLVWKTDFND FT YFYPDPSESTFSRSSSALIDGMSLLNMKQLNPVVNHMNRKLDLVYCNEQSA FT RQCNVTEAVEPLVEIDAFHPALLVSFKCPRIVSFVESAVVPQLNFRKTDFT FT ALNHALLETDWTALHAADNVNEAVRILTNTLTELFRVYVPSHRAARSPPWS FT NSGLRILKRERARYLRLYSANRNPATKRQFQMASNRYKNYNRLLYSLYVQR FT KQNELKNNPKKFWSFVNEKRKENGLPAIMSYGTDSGASPAEICRLFACHFS FT SVFKPGVTSSQHITAALRNVPEGILNMRNITFSEEDIGSAINKLKSSYQPG FT PDGIPPIVFKMCGTALAGPLTAICNLSMAQSAFPDEWKESTLFPVYKKGDR FT GNVSNYRGITSLCTGSKLLEILVSDMLFHEIKNHISCDQHGFYAGRSTVTN FT LTEFTSFCIQNMESGAQIDTIYTDLKAAFDRVDHSLLLAKIQRMGAASNFV FT NWLHSYLTNRRLSVKIGTSQSSYFSNSSGVPQGSNLGPLLFSMFFNDVCAV FT LPRGCRLMYADDLKIYLVVRSSENCQELQNYVDSFSDWCEVNELTISTNKC FT SIISFTRRKNPIIWNYNIRGEQIDRVAVIKDLGVHLDTKLSFREHYSITVA FT KANRNLGFIFRISSEFRDPHCLRALYYSLVRSVLEYASVVWSPYTGIWTSR FT IEAVQSRFIRYALRFLPWRDPATLPPYQNRCRLLGMDTLQRRRQLERVLFV FT QKLLIGSIDAPNILSQININVVSRSLRHTDFLRLDLRRTDYGQNEPIRVMC FT SLFNRVYNLFDFSISTHTFRYRLLVSNAFV" XX SQ Sequence 5188 BP; 1498 A; 1109 C; 1105 G; 1476 T; 0 other; tgctttggta gcgtcggcga ttcgtgtgta tcgtaaagtt acggaatgtg aacgcacttt 60 taatcatgtt atttttgtcg tataaactgt aaagtgatat ctgtgtattc agtgtttgac 120 attgtctcta tcgagatatt aatcgaaacc gataaaccta agaagatacg attttatcca 180 aaaatataat ttcttgtcga cggagattgc aaagcaaaca acagacagtc tattgaattc 240 atcatccagc ccgctacagg gcgaccgaag accgttcgag atgacgtcgg tgctgtgtga 300 tgtgtgtgta cagagcattg ttgccgagac cgatcgtgtt tactgctttg gtggatgcaa 360 caaaattctg catgtacgat gttctgatct gactagagct ggtgccaatg cactgaaaga 420 caacgtaggt ctaaagtaca tgtgctttga ctgcaggaaa gcacaaatat gcctcaacaa 480 actgcagcaa aaatgttcgg aattggttga aaaattcgac gcactgaatt gtaagttcaa 540 cgatttgacg tcgaatgttg aaagtggaat caaaacgcaa cttcaggcat ttgaaaatag 600 catatccaga atcgagtcaa caattcatca tcagctgcta aacagcaaca gttttctgag 660 cgccgctaat attaactgta ttagcgatga ggatcaagaa aaagagtctt atgctgccgt 720 atgtcgtaac aatatcaata cgaaaaggaa aaagtccaca tcagagaaac ctactgatag 780 taactcagct caaactcagc atgagattgg cggaacactg cgatctggta aacgcaggtt 840 gcctgtacca agaattgttg tccaaagtga tgcagcttct tgtgcctcta caagccattc 900 aaatcagcca atatcaccac tggttaaaac aaaacgcatt gagagatttg aaaaaactgt 960 tctgttgcgt cccaagtgta gtcagcaata tgaggtcact aaatcggacg tgtgtgataa 1020 actagatccc attgcttttg cggttaagga agtgcatttc cgagattccg gtgaggttac 1080 cattcaatgt gactcgaaag aacatgcatc gaaacttgtt tctgcagcca ctgatgcact 1140 gtcggataaa tacacagttg aagttatgaa accactgaac ccaagagtga agctgattgg 1200 tctaactagc aaattcgacg aaagtgttct aatccagaaa atcaaaactc agaacaatct 1260 tccatctacg ttcaaagctt ggttaatacg cattgcagaa ccacaaaata agaatttaaa 1320 taaattcaat gttgttcttg aacttgacgc tccatctttc gatgcaatta tgaaactcca 1380 acgtgtctat atcggatggg aaagatgtcg tgtagttgag aatatcaacg ttaaacgctg 1440 ctacaactgt tctgaatatg gccatttggc atcttcgtgt gaaaagcctg tctgctgtcc 1500 aaggtgcgcg gaatgtcatg aaataagcga gtgtgtctca gattttgtga aatgcgtcaa 1560 ctgcgacaag atgaatagcc tgcgaactgc gaagtcggat catctggctg atatccatca 1620 ttcctcctgg agtcagaaat gccctattta tatcaagcgt ttcaacaagg cacgtcaaaa 1680 tatagactac tccatctagc aatcaacatt accttccccg ctagcagtgc ggactgcgga 1740 ccgtgttcac caatcttctg tggatgatcc tgttctttca ggtaaatata agattggtat 1800 atgtcccgcc gaagtgggga cgaccgctac actttgtcat aggaaaaaat cgatcttgca 1860 gaacttgact ctagtgcaga gtaatacgtc atcttccgat gtggctcaac tttcgactga 1920 tcaaccatcg ccactctcat tgtgtgctcc tgagcaaatc aacgatcgtg ttttcgacca 1980 gtcttcggtg gattctcctg ttctctcagg taattatagg actggtatat gtctcgccga 2040 agcgaggaag actgctacac tgtgtcatag cgagaacccc atcttacaga cttcactgcc 2100 agtgcaaact tatacgctcg atgatgctca accacccaca gacaaaactg cgttgcaatc 2160 caggagcccg attggaaacg atacagaggg aaagtttacg acgaataatg agtcccacgt 2220 tgagggtact cacggcagtc tcaacatctt ctaccagaac gtgagaggtt tgcgcacgaa 2280 ggtggatgaa ttctttcttg ctgcaacgga ttcggatcac gacgtgattg ttatcaccga 2340 aacctggttg aacgaagagt tcatctcaca acttttgttt ggcgacagat tcactgtata 2400 ccgaaaggat cgtaacgctg agctaaccgg caaaaaacgt ggtggtggtg ttcttatcgc 2460 cgtttcgaac catctgaagt ctagctctgt agctctcacc atgcatatgg aactggaaca 2520 gctatgggtt cgcgttacta ctgctagcag taatgcttac attggtgtaa tatacatatc 2580 ccctgaacaa gcaatggatt ccagcaacat tgagaaacac atagaatgtg tcagtcatgt 2640 taccgatcgt tcggggccac gtgactccat tctgttattg ggtgatttta accagcctgg 2700 tctggtttgg aaaactgatt tcaacgatta tttttacccc gacccaagtg aatcaacttt 2760 cagtagatct agctcggcac tcattgatgg tatgtcgcta ctcaatatga agcaactaaa 2820 tccagttgtg aatcacatga atcgtaaact ggaccttgtt tactgtaacg aacaatccgc 2880 taggcaatgc aatgttacag aagcagttga gccgctggtt gagattgatg cattccaccc 2940 ggcattattg gtttccttta aatgtccgag aatcgtgtct ttcgtagaat ctgctgtagt 3000 acctcagctc aattttcgca aaactgattt tactgcgcta aatcatgctc ttttggaaac 3060 cgattggaca gcactgcacg ctgcagacaa tgtcaacgaa gctgttcgaa ttctgactaa 3120 tactctcact gagttgttcc gtgtgtatgt cccttcacat cgagccgctc gtagccctcc 3180 atggtcgaat tctgggctaa gaattctgaa aagagagcgg gccaggtatc tgcgtctcta 3240 ctctgcaaat cgaaatccag ccaccaaacg acagtttcag atggccagca acagatataa 3300 gaattacaat cgcttgcttt attcgctgta tgtacagaga aagcaaaatg aacttaaaaa 3360 taatcctaaa aagttttggt cgttcgtaaa tgaaaaacgt aaagagaatg gacttccggc 3420 cattatgtcc tacggtactg actcaggagc ttctccggct gagatatgca gactctttgc 3480 atgccatttt tcaagcgttt tcaaaccagg cgtgacatcg tcgcaacaca ttactgccgc 3540 tttgcggaat gttccggagg gtatactcaa tatgcgaaac attacgtttt ctgaagaaga 3600 tatcggtagc gccattaata aacttaagtc ttcgtatcag ccaggcccgg atggtatccc 3660 gccaattgtg ttcaaaatgt gtggaactgc acttgctggt ccattgacag caatttgtaa 3720 tttatcaatg gcacaatcag ctttcccaga tgaatggaaa gaatcgactc tgtttcctgt 3780 gtacaaaaag ggggaccgtg gaaacgtatc taattaccgt ggcattactt cattatgcac 3840 aggctctaaa ttgctagaaa tccttgtgag cgatatgcta ttccacgaaa ttaaaaatca 3900 catctcgtgt gatcagcatg gattttatgc tggcaggtca acggtcacaa acctaactga 3960 atttacttca ttttgcatac aaaacatgga gagtggagct cagattgata ccatctacac 4020 agacttgaaa gctgcatttg accgtgtaga ccactcgttg ctgctcgcaa agatacaacg 4080 gatgggtgcc gcatcaaact tcgtcaactg gctgcattca tatttaacga accgtcgtct 4140 gtctgttaaa ataggaacat ctcaatccag ctatttttcc aacagttctg gtgttcccca 4200 ggggagcaac ctcggtccac tgctattttc tatgtttttc aacgacgttt gtgcagttct 4260 accgcgtgga tgccggctca tgtatgccga tgatttgaag atatatctcg tcgttagatc 4320 atctgaaaac tgccaggagc ttcaaaatta cgttgatagt ttttcagatt ggtgtgaagt 4380 caacgaactt acaatcagca cgaataagtg ttcgataata tctttcactc gcagaaaaaa 4440 tccaattatc tggaactaca atattcgcgg agagcaaata gatcgagtgg ctgttatcaa 4500 ggaccttggt gtacatttgg acacgaaact gtcttttaga gagcattact ccatcacagt 4560 tgccaaggca aaccgaaatc tgggttttat attcagaatc tccagtgaat tcagagaccc 4620 acactgtctc cgtgcactgt attattctct agtgcgctct gtgctcgagt acgcttctgt 4680 ggtttggagc ccgtacaccg gtatttggac atctaggata gaagccgttc agtctcgatt 4740 catccgttat gcgttgaggt ttcttccatg gcgtgatccc gctactcttc cgccgtatca 4800 gaaccgatgt cgactccttg gaatggacac attacagagg cggcgacaac ttgagagggt 4860 attgtttgtg caaaaactgc taataggtag tattgatgca cctaatattc tttcgcaaat 4920 caatatcaat gtagtttctc gtagcttaag acacactgac tttcttagat tagatttaag 4980 acgtactgat tatggtcaaa atgagccaat tcgtgtaatg tgttctttgt tcaatcgtgt 5040 gtacaatctg tttgacttct ctattagtac tcatactttt agatatcgac tacttgtttc 5100 caatgcgttt gtttagttta agttttattc atgtagacaa tgatgttaga tgaatgtagc 5160 aaataaataa ataaataaat aaataaat 5188 // ID Gypsy-35_OD-I repbase; DNA; INV; 10137 BP. XX AC CABV01003631; XX DT 24-FEB-2011 (Rel. 16.02, Created) DT 24-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Oikopleura dioica genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-35_OD_; KW Gypsy-35_OD-LTR; Gypsy-35_OD-I. XX OS Oikopleura dioica OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; OC Oikopleuridae; Oikopleura. XX RN [1] RP 1-10137 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Oikopleura dioica genome."; RL Direct Submission to RU (24-FEB-2011). XX DR Genome; CABV01003631; Positions 13646 23782. XX CC Positions [2843-3439] - Reverse transcriptase CC Positions [4601-5089] - Integrase core CC 'ATAAA' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 55..1743 FT /product="Gypsy-35_OD-I_1p" FT /translation="MNNVNQFLNKSLTHTCAITTAIRQQLLFLCLFYQDDS FT TRSNYERTQTFPTKEFLLERARDLNYNFISFKFATYAQRYFFLPIELKEST FT YRNEVVAFGEILKKFKTQEEHRSKRSIVDVEQLLQFSPTLLTPTINTGSQQ FT IRSQVIAHNTPSNTFLEIKIEMDENDPRESRERRSSLSSVLAYTESENDTI FT YDAAAEELIQNLSTVLNSSMKDDEEMQREEGQQDKPRPLAGRRIFKEEQKT FT VRINAGTAATDRPDDAKPWKNHRIPRYEESGLSVRDWATKVIFMIDLGRTV FT KMTDNEKIQLILENIPSKSFGAIIDAFQSEAKQDFNSLIDVITEELQIDEG FT EAGMTLSTLRFVEDKDKNMKKFFEKIKKLVKIKYPQLDNEGVLTTAMEHFE FT RLLPNYVTNSESWGLDTYDHTDPGKRIALANRIFNLHRTRKSINALIDRNN FT SRKSNKNDKKQSYCKFCKINGHDRSECRKCPTCSFCQIKGHTEDECFKKKK FT AGQTKSNNNYSGKTQKGQTFGNRGASGKQNKTGECFKCGSKDHWANKCPKR FT INTVNVQESHPFIQME" FT CDS 1781..5626 FT /product="Gypsy-35_OD-I_3p" FT /translation="MLPVITCTIVQYEEEVELTHELTNTLIDTGAEASLLS FT RDNLPSNFKIEDSPTTKLANAVESNFAESSSCVRCNIRMPNSDIEITNVAL FT LILNDTSSMAYSAIIGMDVLRHIQLRIGKNDNIVQLNKIFRLNKNMNEVQP FT LDFSIQPTRNCLINTKDIYLYPGEKLRVSMVKIWPSDNDISSTEIRTVPKL FT EDNKIMVEQTFFKKNWNHVNISNNSEKDVFITKNSMIASFIQLSNDQKLLN FT TLITVNDLTAAEKIVHETDLKKWIQRRNELTAKTCIDENITKKVSEAPLEH FT RATLRKIIEKYKWSLARSPSDAGMSQKYLADMQLNGDKATFTPPYPVKTDL FT IPKIDEKMNELEASGIVEECCSAYNSPVLFILKPNGSLRTVNNYSSGDNSI FT NSRLVMPRYPTINVRVLLQMVANHITSLKNKFPKEKIVFGNFDLANAFYCL FT ALRDTSRSYTAFMYSRRQLRYARMAQGLSSSPSIFAAFASKCFRSISSLEK FT GYYVQNYQDDLILISTMDRITMAIDAVFERVKENNLIVRIEKCSFYKPSIN FT FLGYTVSEFGIQVPERRVKTLLDMKFPETFRKAQQYQGAFNYYLRQTPDLS FT ALMSPLSKEIAKGKKYVLSDEIISNLKVLREKIRDGIGTAHLEYNSKNPDR FT KIFIAADTSLYCTGGVIGNVTVTNGNLDDIKIAAYCSRVLTEQESLLASRA FT RELIGIQATIRAFKDMIPSFEEVYIFTDHKSLQNIVHSPGLRTSGSTRCRS FT AFADILEIPLSRIFFVPSTSDIIKCVDSLSRINIAETEISLETFNPKVYRK FT EEKLEINTYKLRKRVPKVDYYTVIREQLASEKFSKIHEKLDNDRMATINNE FT IYVKRNNALFKKTKNNTELLIIPKNLGRNILEVLHEQSAHLGAKNLIRIIK FT NEEVWIESKTKTAHDVCRGCILCKLIYKSTEKKPEDMKIRPAFVPYTMVFT FT DIIEIRTDSSVSFNILTFQDQFSRKVTYRIVNNKEARTVSEKLAELIAEVG FT GQGQMSLCSDNGGEFTGDSTENMLNSLNVHHCYISPTNSRANLCERFHKEL FT RRILKTTTFNAKNAKHKIDLSVSIYNNRPHQALGYLTPNQALSNIDPPKYF FT CVSSPKETQEEPAYEDHLTDIRHMQGTIAKRHLETFLTHEPVEQDKYSLND FT IVVLRESTIVGHHRINEGPFIIIKVKPNNNVELQSLLTGKRLRRNIRFLSK FT IHMNDEDKEKFITNNSIVFNRKTLEIGNQNVESAKSILDLTFGTPHSDQNQ FT QKKEEHTTIIEEKRYNLRKRK" FT CDS 5629..8538 FT /product="Gypsy-35_OD-I_2p" FT /translation="MQFALLLLSRCLLATTQNASVPLENGVLMYRNGENAF FT FTENESRQNLTFSFPAPNIENKLVTTCSRSNTNITEMTREYANLLIEDWLD FT IRIDHPFYERNYTVHGDKIILHQNAEIARILLIRIQQPNYKNLEWSGAAQS FT TFHFTRCERNRNKHVILVQAETNFNKILFRVPSSSQLLGIIWKLKSHSYTP FT EVTTSEKSKIIQFETDLGRVSKIEIYFTGETQEIEHGVYASISCLSLEIEE FT ILVVKEEKPSEFAKFVQQDENLYSKDEETNVANPSIMGSFSSYGTTLFLEE FT TQEKALMAANTIWKEVYGNREFKMKSSNSTVHDYIQTVLQHKNTRHKIEMP FT DDFCDNDNGLQLQILIETSLPIIDVGIRIDNSEPLEENKDLAIKQLIDIYD FT IIPYVIGLRATDIQSLLRGEIPSDTYRVKCQNSLSDYIEFFCDVKNEIFSN FT GAERFFFMVDIDLSEWRCNKLKRFLSQKGTTTFDVKYIKRPLGRKRRQVAI FT PLVISGITSYVTHVIDKHELNEQKEQLEKQIQFSNVETAKLADLTNKDLSN FT LEINFDNLKRESEKQVNKICQEVGQVTDRVFVNQIKTEIESYKSTILKFLV FT ETNSRARHSSFMTSAIHICRSINRNLDEAACYSYMNLQRMEVTGLTPSIDT FT YGRTTIKVKVTYSIANLRSIGPITRTLAIGVPIGRKGKTYFYEFTKIPRKI FT TAISNEPIVVDKCTESPVDGSTFCNFENIYNKEFDGNCVKSVVSRNSLACP FT KEILLSSEQCITRVTKDYILVSSFVDFYSIEQEGYIGSRKTLSRGPNVTLL FT KRPEDDTMYGCGTKTSLIKGRTSSEPEMISLENGKSSEKYIHDWLSTGEFG FT HIASYTNITALHNKEDYKHISELLNAEKVPAALAPMYERLKPYFPTIVICS FT TIIMIVGLLICFIRRGYITSSLLRKLWKKRKIQKRRKAEKIAMEQIAEDTK FT TRMKRKRRDFHQFNL" XX SQ Sequence 10137 BP; 3739 A; 2184 C; 1826 G; 2388 T; 0 other; actggtgact gattaaaaaa gaagaattca cgccaaggag agggttactt ttatatgaat 60 aatgttaatc aatttctaaa caagtcatta acacacactt gcgctattac aacagcaata 120 agacagcaac tcctctttct ttgcctattc tatcaggacg atagtacaag aagcaactac 180 gagcgaacac aaacgtttcc aacaaaagaa tttctcctag aaagagcgag agatctcaac 240 tacaatttta tttctttcaa attcgcaaca tacgctcaac gttacttctt tcttccaatt 300 gaattgaaag aaagtacata cagaaacgaa gtcgtcgcgt ttggcgagat cttgaagaaa 360 tttaaaacac aagaggagca cagaagcaaa agaagcatcg tcgacgtaga acaactactg 420 caatttagtc caaccttgct cacacctaca atcaacacag gatcgcaaca aatccgttca 480 caagttatag cgcacaacac tccaagcaac acctttcttg aaatcaaaat cgaaatggac 540 gaaaacgacc caagagaaag tcgagaaaga cgatcaagcc tttcttcagt attagcatac 600 acagaaagtg aaaacgatac aatttacgac gccgcggcgg aagaacttat ccaaaacctt 660 tcaacggtct taaattcttc catgaaagat gacgaagaaa tgcaacgaga agaaggacaa 720 caggacaaac cacggccatt ggcaggaaga agaatattca aagaagaaca gaagacagtt 780 cgtataaacg ctggaacagc agcgacagat cgacctgacg acgcaaagcc atggaaaaat 840 cacagaattc cacgatacga agaatctgga ttatcagtta gagactgggc aacaaaggtc 900 atattcatga ttgacctcgg tagaacagtt aaaatgacag ataacgagaa aatacagttg 960 atcctcgaaa atattccgtc aaaaagcttt ggtgcaataa tcgacgcttt ccagtcagaa 1020 gcgaaacaag atttcaactc gcttatcgac gtcatcacag aagaactaca aattgacgaa 1080 ggagaagccg gcatgacatt gtccactcta cgatttgttg aagataagga caaaaacatg 1140 aaaaagtttt tcgaaaaaat caagaaactc gtcaaaatca aatacccaca attggataac 1200 gaaggagtcc tcactaccgc catggagcat ttcgagcgac ttttgccaaa ttacgtcaca 1260 aactcggaaa gctggggact cgacacctac gatcatacag atccgggaaa acgcattgcg 1320 ctcgcgaacc ggatcttcaa cctgcatcgc acccggaaaa gtataaacgc gcttatagac 1380 agaaacaaca gcagaaagtc aaacaaaaat gacaaaaagc aaagctactg caaattttgt 1440 aagattaatg gacacgatcg atcagaatgt cgaaaatgcc caacttgcag cttttgtcaa 1500 atcaaaggcc acaccgaaga tgagtgcttc aagaaaaaga aagcaggcca gacaaaaagc 1560 aacaataact attctggaaa aactcagaaa ggacagacgt ttggcaaccg aggtgcgtct 1620 ggtaaacaaa acaaaacagg agagtgcttc aagtgtggct cgaaagatca ctgggcaaat 1680 aaatgtccga aaagaatcaa caccgtcaat gtgcaggagt cgcacccatt tatacaaatg 1740 gagtaacctt caaaagacaa ttgttttaag cccgctgtaa atgcttcctg ttatcacctg 1800 tactatagtc caatatgaag aagaggtcga attaacacat gagctcacaa atacgttaat 1860 agacaccgga gccgaggcat cactattatc acgcgataat ttgccctcaa atttcaaaat 1920 agaagactcg ccaacaacaa agctagctaa tgctgtcgaa tcaaattttg ctgaatcatc 1980 atcctgcgtt cgatgcaata tccgtatgcc aaacagcgac attgaaatta caaatgtcgc 2040 acttctcata ctcaacgaca catcttcaat ggcatactca gctatcatcg gaatggatgt 2100 cctacgacat atacaactac gcataggcaa gaatgacaat attgttcaat tgaacaagat 2160 tttccgatta aacaagaata tgaatgaagt gcaacctttg gatttcagca ttcaaccaac 2220 aagaaattgt cttattaaca caaaagacat ttatctatat ccgggtgaaa aacttcgggt 2280 ctctatggtg aaaatatggc cttcagataa cgatatttct tcgacagaaa ttcgaacagt 2340 gccaaaactt gaagacaaca agatcatggt tgaacaaaca ttctttaaga aaaattggaa 2400 tcacgtaaat atatcgaata attccgaaaa agatgttttt atcaccaaga acagcatgat 2460 cgcgtctttt atccaacttt caaatgatca aaagctcctt aacacgctga ttacggtaaa 2520 cgacctgaca gccgccgaga aaatcgtaca cgaaacagat ttgaaaaaat ggattcaaag 2580 aagaaacgaa cttacggcaa aaacgtgcat cgatgagaac ataacgaaga aagtttctga 2640 agcaccactt gaacatagag cgactttacg aaaaatcatc gagaaataca aatggtcctt 2700 ggcaagaagt ccgtcggacg ctggcatgag ccaaaaatac ctggcggata tgcagctgaa 2760 cggcgacaaa gcgacattta cgccacctta tcccgttaaa acagacctca tcccgaaaat 2820 tgatgaaaaa atgaacgaac ttgaagctag tggaattgta gaggaatgct gtagcgcgta 2880 caactcccca gtgctattca tactcaaacc aaatgggtcg ctacggactg tcaacaacta 2940 cagctcaggc gacaattcca taaactcaag actagtaatg ccaagatatc caacgatcaa 3000 tgtgcgtgta ctgcttcaaa tggtagcaaa tcacataaca tcgctcaaaa acaaatttcc 3060 gaaagaaaag atcgtctttg gaaatttcga cttagctaat gcattctact gtctggccct 3120 gcgagacaca agccgatcgt acacagcctt catgtactct agaagacaac tgcgatatgc 3180 acggatggcg caaggcctca gttcaagccc aagcattttc gcggccttcg cgagtaaatg 3240 ctttagaagc atctcttctt tggaaaaagg ttactacgtg caaaattacc aagacgacct 3300 gatcctcata tcaacaatgg atcgtattac aatggcgatt gatgcagtct tcgaaagagt 3360 gaaagaaaac aatcttatcg tcagaatcga aaaatgttca ttctacaagc cttcaattaa 3420 ctttcttgga tacacagtct ccgaatttgg aatccaagtg ccagagagaa gagtaaaaac 3480 cctacttgat atgaaatttc cggagacttt caggaaagct caacaatatc aaggtgcatt 3540 caattactac cttcgtcaaa caccagactt gtcagcttta atgtctccac tttcaaaaga 3600 aatcgctaaa ggcaaaaaat acgtactgtc agacgaaatt ataagtaatc tgaaagtact 3660 acgtgaaaaa atacgagatg gaatcggaac agctcatctc gaatacaatt cgaaaaatcc 3720 tgatcgaaaa attttcatcg cagccgacac gagtctttac tgcactggtg gagttatcgg 3780 aaatgtcaca gtaaccaacg gaaacctaga tgacatcaaa atcgccgcat actgctctag 3840 agttctcacg gaacaagaat ctctactagc aagcagagca agagagctaa tcggaataca 3900 agcaacaatc agagccttta aagacatgat cccatcgttc gaagaagtct atatattcac 3960 agaccacaag agcctacaga acattgttca cagtccagga ctcagaactt caggatctac 4020 aagatgcagg tcagcttttg cggacatact ggaaatccct ctatcaagaa tcttcttcgt 4080 cccatcaacg tcggatatta ttaaatgcgt cgactcactg tcgagaatta acatcgctga 4140 aacagaaatc agtttagaaa ccttcaatcc aaaagtttat agaaaagaag aaaaactcga 4200 aattaatacc tataaactca gaaaacgcgt accaaaagtc gactactaca cggtaattcg 4260 agaacagctt gcaagcgaaa aattctcaaa aatccacgaa aaactggata acgatagaat 4320 ggccactatc aacaacgaga tatatgtgaa aagaaacaat gcgcttttca aaaagacgaa 4380 gaacaacaca gaacttttaa tcatcccgaa aaatctcgga agaaacattc tcgaagtcct 4440 tcatgaacaa agtgcacacc tcggtgcaaa aaacctcatc cgaatcatta aaaatgaaga 4500 ggtatggatt gaatcaaaga cgaaaactgc gcacgacgtc tgtagaggat gcatactctg 4560 caagctcatt tacaaaagca cagaaaaaaa acccgaggac atgaaaatca gaccggcttt 4620 cgtgccgtac actatggttt tcaccgacat cattgaaata cggacagatt caagcgttag 4680 cttcaacatc cttacgtttc aagaccaatt ctcacgtaag gtcacttaca gaattgtcaa 4740 caacaaggaa gcaagaacgg tttctgaaaa gctcgcggaa ctcatcgccg aagttggtgg 4800 tcaaggacaa atgtcacttt gtagcgataa tggcggagaa ttcacaggag attctacaga 4860 aaatatgctc aattccttga acgttcacca ttgttacatc agcccaacaa acagccgggc 4920 aaatctctgt gagcgctttc acaaggagtt gagaagaata ctaaaaacaa caacattcaa 4980 cgcgaaaaat gctaaacaca aaatcgatct tagcgtatcc atatacaaca atcgcccaca 5040 tcaagcgctt ggatacctaa caccaaacca agctctgagc aacattgatc cgccgaaata 5100 cttctgcgtt tcctcaccta aagaaacaca agaagaacca gcatacgaag atcaccttac 5160 ggacataaga catatgcaag gtacaatagc aaagcgacat ctggaaacat ttctgacaca 5220 tgagccggtt gaacaagaca agtattctct caatgacatt gttgttctga gagaatccac 5280 aattgttggt catcaccgaa ttaatgaagg acccttcatc atcataaaag tgaaaccaaa 5340 caacaacgtg gaacttcaaa gtctgcttac aggcaaacga ctcagaagaa atatacgatt 5400 tctctccaaa attcatatga atgatgagga caaagagaag tttattacta ataattcaat 5460 cgttttcaat cgtaaaacac ttgaaatcgg taatcagaat gttgaatccg caaaaagtat 5520 actagatctt acttttggta cccctcactc agatcagaac caacagaaaa aggaggaaca 5580 cacaacgatc atagaagaaa agagatacaa tctcagaaaa cgcaaataat gcaatttgca 5640 ctgcttctat tgtcacgctg cttgctcgcg actacgcaaa atgcttccgt cccgcttgag 5700 aatggagttc tgatgtatcg aaacggtgaa aacgcatttt tcaccgaaaa cgagtcacgt 5760 caaaacctca cgttctcctt tcctgcacca aatatagaaa acaaactcgt tacgacatgc 5820 tcacgatcaa acacaaacat aacggaaatg actcgagaat acgccaactt actaatcgaa 5880 gattggctgg atataaggat cgatcatccg ttttacgaac gcaactacac agtgcatgga 5940 gataagatca tcctccatca gaatgctgaa attgcaagaa tcctactcat ccgaatacaa 6000 caacctaact acaaaaatct ggaatggtct ggcgccgctc aaagcacttt ccatttcact 6060 cgctgcgaga gaaataggaa taagcacgtg atactggtcc aggcagaaac taacttcaat 6120 aagattcttt ttcgagtacc ctcttcaagt cagctgctcg gaataatatg gaaacttaaa 6180 agccactcct acaccccaga agtaacgaca tctgaaaaat ctaagattat ccagttcgaa 6240 actgaccttg gccgagtgtc gaagattgaa atttacttca ctggagaaac ccaagaaatc 6300 gaacatggag tgtatgcatc aatctcatgt ctcagtctgg aaattgaaga aattttagta 6360 gttaaggaag aaaagccatc agaattcgca aaatttgttc aacaagatga aaatctttac 6420 agcaaagacg aagaaacaaa tgtagctaat ccgagtataa tgggaagctt cagcagctac 6480 ggaacaacgc tttttcttga agagacacaa gaaaaagcat taatggcagc aaataccatc 6540 tggaaagaag tctacggcaa cagagaattc aaaatgaaat catccaattc aacagtacac 6600 gattacattc agacagttct acagcacaag aacacccgac acaaaattga aatgccggat 6660 gacttttgcg acaacgacaa cggccttcaa ctgcaaattc ttatcgaaac aagtcttccg 6720 ataatcgacg ttggaataag aatagacaac tcagaacctc tagaggaaaa caaagatctg 6780 gccattaaac aactcataga catatatgat ataataccat acgtcatcgg acttcgagcg 6840 acggatattc aaagcctact tcgaggagaa attccatctg acacatacag agtaaaatgt 6900 caaaattctc taagcgacta tattgaattc ttctgtgacg tcaaaaatga gattttttca 6960 aacggcgcag aaagattttt cttcatggtt gacattgatc tttcagagtg gcgatgcaat 7020 aaacttaaaa gattcctctc tcaaaaagga acaacgactt ttgatgttaa atacattaaa 7080 cggcctttgg gaagaaagcg aagacaggta gcaataccat tagtcatttc tggaatcacg 7140 tcctacgtca ctcacgttat cgacaaacac gaattaaacg aacaaaaaga acaactcgaa 7200 aagcaaattc agttttcaaa tgtagagact gcaaaactgg ctgatcttac aaacaaagac 7260 ttatctaacc ttgagataaa ttttgacaat ctgaaaagag agtcagaaaa acaagtcaac 7320 aaaatttgcc aagaagttgg tcaagttacc gatagagtct tcgtaaatca aatcaagaca 7380 gaaatcgaaa gttacaagtc tacaattctg aaatttcttg tggaaacaaa cagccgcgct 7440 cgccacagta gctttatgac ttctgcaatt cacatttgcc gatctattaa cagaaatttg 7500 gacgaggcag catgttacag ctacatgaat ctgcaacgaa tggaagttac cggtcttacg 7560 ccttcaatcg acacttatgg cagaactaca atcaaagtta aggtgacata ttcaattgca 7620 aacttgagaa gcattggacc aattacaaga acactcgcaa ttggagtacc tataggtaga 7680 aaagggaaaa cttatttcta cgagttcacc aaaataccaa gaaaaataac agcaatttct 7740 aatgagccaa tagttgtgga taaatgcact gaaagccctg tggacggatc cacattctgc 7800 aacttcgaaa atatctacaa caaagagttt gacggtaatt gcgtaaaatc tgtcgtgtca 7860 agaaacagct tggcatgccc gaaagaaata ttgctctcgt ctgaacaatg tattactaga 7920 gtcacaaaag attacattct tgtttccagt ttcgttgact tttactcaat tgaacaagaa 7980 ggctacatag gttcacgtaa aactctatca agaggaccaa atgtgactct actcaaaaga 8040 ccagaagatg atacaatgta tggatgcggt acgaaaacat cacttatcaa aggaagaaca 8100 tcttctgaac ccgaaatgat ctctctggaa aatggaaagt caagcgaaaa atacattcac 8160 gattggctgt ccactggaga attcggccac atcgcgagct acacgaatat aactgccttg 8220 cacaataaag aagactacaa acacatatcc gaacttttga atgcagaaaa agttccagca 8280 gctttagcgc caatgtacga acgactgaag ccttactttc caacaattgt catatgttca 8340 acgattataa tgatcgttgg gctcctaatc tgttttataa gaagaggata cataacgtca 8400 agcttactca gaaaactatg gaagaagaga aagatacaga aacgacgaaa agctgaaaaa 8460 atcgcgatgg aacaaattgc agaagataca aaaacgagaa tgaaacggaa aagaagagac 8520 tttcatcagt ttaatctcta aacaacaata gcccgaagct caaattctat ctctcatctt 8580 caatcaactg atcatgcgac tcgactctat taccgctcag ctttcttaac tcctttgcgc 8640 cactcgacca attttttgga tgtaccaaaa attgtcctgt ttcccaaaag agttaaaatt 8700 tttatcttca taaaatttac taactaaatt tttatttgta aaaaaaatga aatttattca 8760 ctattctaaa aaataagaca aaaaaaaaag aaaaggaaaa agataaaaat tacagattta 8820 ctggacagaa ctggatcacg attccactct ccgcaagtct gaacaaaaaa attaataaaa 8880 caaaaaaaaa tgaagtacga aatcacctcc actctcggat tcaggagtaa caacatcttc 8940 aacatcatcg ttcctcggag gaagacaatg aatacggtct tcattagccg gatcctgtaa 9000 atacgcggac gtcaacccag aaggtccagc tgcttccgaa agtgaatatt cgctcgagcg 9060 atgaacttga tacgaatatt tcccggcaag tctctccaga taagaattca gtgaacgacg 9120 gaaatgttct ggtgaaaccg taattttccg aagcttcttg tcggaaatgg gaatgttatt 9180 ctccaggaca atatgagcac tccggcaaaa agaatcgtaa tcaagcaatg gatatcgcgc 9240 aagctgccct ttaaggcgaa gaatccgcat tcgtttgctc cggttgtttc ccttcgaaag 9300 ccaaatcggc gtttcatact tcgccaaact agcaagtcgg tgcggcaccg gattttggag 9360 catttccaaa aaggacaggg aatctggtga acgatcccac cacgaagggt tcaacgtcct 9420 ccaaatgtca agacgctcga taaagaatct tcgctcatat acagcctcgt cattttttcc 9480 agaatttcaa ccactacaaa tttttcggta cacaagctgg aatcagctat gacgggagag 9540 cagactacac aaatacgaag tactgtacaa gcgtcctgct tcttcaattc tgcgagatca 9600 agtctctggc gcaatggaac atgaccggac ttcggacgac cgcagtcggc gaacatatcc 9660 taaatttcta cttcacaaaa aaaaatcatt tgtatttata acccctttta accgtattta 9720 aaaaccattt ttacactttt aaccgtattt taaaaccatt tttacatttt tgcccgatag 9780 cccgaaaaca accataaaaa taataatagt acaaatttca tgcaaactaa ctaaaatctt 9840 ttatttcaaa tctgcacgac cgtcgagttt acagttcttc gtaaacatat gacatgaagg 9900 cttggtgaaa taaaagataa ataaaaatca tgcaaattct aaatctaaat tcaaaaaact 9960 aaatttttgg ggggaggaaa aggggacaaa aatcggacaa aaatgtatcc tgttatcaac 10020 ttaccttttc caagactttt ctcttcatcc ttacaggtca caagctcttt ttaaaaaagt 10080 acttttcgaa aaaaaaaatc agccgtttat aatcaacctt tgtaaacggg ggctgga 10137 // ID Crack-21_AAe repbase; DNA; INV; 5478 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A Crack non-LTR retrotransposon family from Aedes aegypti. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; KW Crack-21_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5478 RA Kojima K.K. and Jurka J.; RT "Crack clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1237-1237 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 14 sequences with >96% CC identity. XX FH Key Location/Qualifiers FT CDS 1134..2186 FT /product="Crack-21_AAe_1p" FT /translation="MEGVDSICVECNKVEKDAGKLITCMYCFSEAHYKCRN FT LVGNAARRIKDRMYFCSHNCSSIYQRITEMQNQKSQIVETLAAELKGAVSN FT AVSLEMKSVRGEVHQITTAIEKSQQFLSDKFDAIVSDFQELRRENENLKQE FT IVKLKQTHQHMSKIVHKLEHDVDKTNRKENCNNAIVLGVPFSPDENTLGIV FT QKIIDCYGCTESNAIVSANRLGGKTKARNFLIPIRVVFQDRDSKEAVFSKK FT KNFGKLNSTSVDANYVINGKPTTITMRDELTPLSLELLREMREHQDRLNVK FT YVWSSRDGNVLVKKNEHSKPELIKTRDDMVDLINRYTNKSPVRTTPSPKRK FT CSNVNSNV" FT CDS 2243..4819 FT /product="Crack-21_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MDGLINVYHDCTDDFNLNYSLSVDNYLRIAQWNIRGM FT NDMQKFDNISLFLDSIKIPIDVLVVGETWLKSDNCALYNIPGFKSVFSCRT FT ASAGGLAVYVQVGLVFNVLKNIETDGFHLIHIEINKKGFRSEVVALYRPPS FT FDFNRFHDEIENILSVQNPLPRFFVGDLNIPINLLNNNIVVRYKSLLESYN FT YACSNTLVTRPISNNILDHFVCRNDDLGSIRNDTIYTNISDHLQVVSSIKI FT KGSKECTMLTKKFIDKPKTQELFKNFLEHFNLCGDANNSILTITTTYKNIL FT AQCTKTKSVKVNIKSQHCPWINYYLWQWLKLKQKYLKKVKNNPHNNHLKNL FT LQHVSKRTDDAKRRCKSDYYKQLLENTCHAKLWKNLNEVMGRSKPKTRLEL FT KVNGCSISDSSEVCEAFNDFFSKIGKNLADKIPKSNSNPLSNINRLEPSIF FT LNPANEIEVFGIINELDANKSSGPDNLPTSVVKANINIFSNIFAQLFNQIL FT EQGIYPECLKVAKVTPVFKSGEASDPSNFRPISTLSVISKIFEKMLVNRLV FT KFMDKYNILYKFQYGFRKGSSTSTAIVELVDFLIDKIDNKCVIGGLFIDLK FT KAFDTLNHNILLQKLEYYGIRGVANNIIKSYLSDRHQFVAIEDFRSALKTI FT NIGVPQGSNIGPLLFLLYINDLGRLPLKGIPRLFADDTAIFYPSLDPSAII FT PSMNSDLLILMRFFDSNLLSLNLMKTKYMLFHSPRKKVNFDINLRIGQTEI FT ERVYNFKYLGLVLDPSLSWGIHIDRIQRKVSSLCGLIYRVKPFVPRYALLK FT FYFGCVHSHLQYLIIVWGHACQSKLKKTASFAKSMLQNHLFLTTTISYHSN FT LYKFTT" XX SQ Sequence 5478 BP; 1777 A; 934 C; 1016 G; 1750 T; 1 other; ctggcaacac tggcgtgtag ttgatacgtg tgctaacgat acgcaatttt aattagttct 60 aacagagtgt aatttgatag tttaaatcct tttcwtctca catttgcatc gtgagtattg 120 gcatacatac tattctgaaa tcagttactt aaatattgtg ctaaaatagt cataacagat 180 aaaacttaaa gtgcatgata ctcggttatg aacagagata gtcattggtt tgtgctgtgg 240 tgatgtattg agttggtatt gttgactgct gctgttgttg ttgctggatg gatagtcgta 300 tgccttgcat tgggttaatg aattaaactg aaatttgtgt atgcgtgcat atattattca 360 gattttgcac acaaagtggt ttctgcttct gccttttgag tgaaatacat gactctgcca 420 gatagattta gcgaaaattg agcgaacggg cgtcttgcac aatctagccg ttggttcgcg 480 ctaggtgcat tggtgctgta gtagtttgat tagtagcacg ctttgcggtt ctactggtaa 540 tgaacatgct atagtaaccc tgagtcagtt agtggatttg gcgaaattcg ggcgaacggg 600 cgtcttgcac aatctagccg ttggttcgcg ctaggtgcac tggtgctgta gtagtgtgaa 660 tagtagcaca ccgagtgtat ctgttcaatt cttgtgaaac gaaaacggtt gcgtacaagt 720 cttgtacgca agtctgagta gttcttacag ggtcgacaga ttttgcaaaa tttggatgat 780 agagccgaaa tttgaattat agagcttata tgataaatcc gttgaggagc aaataggtgt 840 attattaact gtccggtaat tcagacgtag catgttagca gcaatatatt tatgtgtata 900 aatgttcgta ccatatgact tttctgttcg tattgctggg tctaattgag ctgccattct 960 ttcatgtttt tggttcaata ttgtagtgta ctgatttttt tgatttttgc tattgagttt 1020 attcaaagtt ggtgtgtgta acggttattt tccttttaca tttatttatt ttagttttcg 1080 ttggaatttg attctggatc tgtcttagca tagttgttct catattcggt ataatggagg 1140 gcgtagacag tatttgcgtc gaatgcaaca aagtagagaa ggatgctggc aagctaatta 1200 cttgcatgta ctgttttagc gaagcgcatt ataaatgccg taaccttgtc ggtaacgcag 1260 cacgacgtat caaggatagg atgtattttt gttcgcataa ttgttctagc atttatcagc 1320 gcatcactga aatgcagaac caaaaatcac aaattgttga aacactagca gctgaactca 1380 agggagcggt ttcaaatgct gtttctctgg aaatgaaaag cgttagaggt gaggtgcatc 1440 aaatcaccac cgcgatagaa aagtcacagc agttcctttc tgacaaattt gatgcaattg 1500 tgtcagattt ccaagaatta aggagggaaa atgagaattt gaagcaagaa attgttaagc 1560 ttaagcaaac acaccagcac atgtcaaaaa tagtccataa gctcgaacac gatgttgata 1620 aaactaaccg taaggaaaat tgtaacaatg ctattgttct aggggtaccc ttctctccag 1680 acgaaaatac attaggaatt gttcaaaaaa ttatcgactg ctacggttgc actgaaagca 1740 acgcaattgt gtcagcaaac agacttggcg gcaaaaccaa agcaagaaat ttcttgattc 1800 ccattcgagt agttttccag gatagagatt caaaagaagc cgttttttcg aaaaagaaaa 1860 atttcggaaa acttaattct acttcagttg atgcaaacta cgttatcaat ggaaaaccga 1920 ctacaattac aatgcgggat gaactaactc cgctgtcgtt agaattgctg agagaaatgc 1980 gggagcatca agatagatta aatgtcaagt atgtctggtc aagtagagat ggtaatgtcc 2040 tagtcaagaa aaatgaacat tctaaaccgg aattaatcaa aacacgcgat gatatggttg 2100 atcttattaa tcgttacaca aataaatcac cggttagaac aaccccttcg ccaaaaagga 2160 aatgttcaaa cgtcaacagt aatgtgtaat tctttaatgt aattttaaat gttgtgtcta 2220 cttataataa tttattttca aaatggatgg tttaataaat gtttatcatg attgtactga 2280 tgattttaac ttgaattatt ctctaagtgt tgataattat cttcgaatcg ctcaatggaa 2340 catacgaggc atgaatgata tgcaaaagtt cgataatatt tctctattct tagatagcat 2400 caaaatccca attgatgtgt tagttgttgg tgaaacgtgg ctcaaatcag ataattgtgc 2460 gctatataac ataccaggat ttaaatctgt tttttcttgt cgtactgcat cggcgggcgg 2520 ccttgctgta tatgtacagg taggattagt gttcaacgtt ttgaaaaaca ttgaaactga 2580 tggatttcat ttgatccata ttgaaattaa taaaaaaggg ttccgtagtg aagtggttgc 2640 attatacaga ccaccatctt ttgacttcaa tcgattccat gatgaaatag aaaatatatt 2700 atcggttcaa aatccacttc ctcgtttttt cgtcggtgac ttgaacattc ccattaattt 2760 attgaacaac aacattgttg ttcgatacaa aagccttctc gagtcctata attacgcttg 2820 ctcgaataca ctcgtgactc gaccaattag caataatatt ttggaccact ttgtatgcag 2880 gaacgatgac ttaggaagta taagaaatga taccatctat acgaatataa gtgatcattt 2940 acaggtagtt tcttcaataa aaatcaaggg ttctaaagaa tgcactatgc ttacaaagaa 3000 attcattgat aaaccaaaaa cgcaagagct gtttaaaaac ttcttagaac attttaatct 3060 atgcggtgac gcgaacaatt caattttaac cataaccact acctacaaaa acatattggc 3120 tcaatgtacg aaaaccaaaa gcgtaaaagt aaacatcaaa tctcaacact gtccttggat 3180 taactactat ttatggcaat ggttgaaact taaacaaaaa tatctcaaaa aagtaaaaaa 3240 caacccccac aataaccatt tgaaaaatct gctgcaacat gtgtccaaaa gaactgatga 3300 tgctaaaagg aggtgtaaga gcgattatta caaacaactg ttggagaaca cttgtcatgc 3360 aaagctttgg aaaaatttga acgaggtcat gggacgttct aaaccaaaga ctcgcttgga 3420 actaaaagta aatggttgtt caatatctga tagttcagaa gtttgtgaag cattcaatga 3480 tttcttttca aaaataggga aaaatcttgc cgacaaaata ccgaaaagta attccaatcc 3540 tttaagtaac attaaccgat tagaaccctc aatttttcta aaccctgcga atgaaatcga 3600 agtattcggg attataaatg aacttgatgc aaataaaagt agcggtcctg ataatctccc 3660 aactagcgtc gtcaaggcta atatcaacat cttttcaaat atttttgctc aactttttaa 3720 ccaaatatta gagcagggaa tatacccgga atgtcttaaa gttgcaaaag taactcctgt 3780 ctttaaatcc ggtgaagcgt ctgaccctag caattttcgg cccatttcaa ctttatctgt 3840 tatcagtaaa atttttgaaa aaatgctggt aaacaggtta gtaaaattca tggataagta 3900 caacatatta tacaaatttc aatatggctt cagaaaaggt tccagtacct ctacagcgat 3960 agttgaactt gtcgattttt taattgacaa aattgataac aagtgtgtta ttggtggtct 4020 ttttatagat cttaaaaagg catttgacac cctaaatcac aatattcttc tgcaaaaatt 4080 agaatactat ggaatacgag gtgtagcaaa taatatcatt aagagctatt tgagtgacag 4140 acatcaattt gtagcaatcg aggactttcg aagtgctttg aagacgatca atattggcgt 4200 tccccaaggc agtaatatag gccctctttt gttcctttta tatataaacg accttggaag 4260 actaccttta aagggaattc cgagactttt cgcggatgat acagcaattt tctatcctag 4320 tctggatcct tctgctatca ttccgagcat gaacagtgat ctactcattc ttatgcggtt 4380 ttttgactcc aatttattgt cactaaattt aatgaaaaca aaatatatgc tgtttcattc 4440 cccaaggaaa aaagttaatt ttgatataaa tcttcgtata ggacagaccg aaattgaaag 4500 ggtgtacaat tttaaatatc ttggcttagt attagatccg tcactttcat ggggtatcca 4560 tattgatcga attcaaagaa aggtctcatc tctttgtgga ctaatatatc gagtgaaacc 4620 atttgttcct cgttatgcac ttcttaaatt ttattttgga tgcgtacatt ctcatctcca 4680 atacctcatc atcgtttggg gtcatgcctg ccagtctaaa ttgaaaaaaa ctgcaagttt 4740 tgcaaaatcg atgcttcaaa atcatttatt ccttaccacg actatatcct accattcaaa 4800 tctatacaag tttaccacat agcgctttac ctctacgtgg cctttgcgac ttacaatctt 4860 gtttgtttgt ttatgatatt ataaaaaatc ctaacatgca tactaacttg gtattacctg 4920 ctagttccca cagctacaat acaagacatg caagtaactt gatcagatct agagccttaa 4980 cctgtctcgg tcaaacgcgg atatcattca acggaccttc tatatacaac aaaataccaa 5040 cgcgattgaa agcgatcaac aatagaattt tattcaaaac gagtttaaaa cagtactaca 5100 gatccaaaat caacacattc ctgaattaat ttcgtatcaa ctgccagttt tctgatattt 5160 tcatttaatt tatagcactc tgtttaatgt tctatgtatg ttttatttaa tagtgtataa 5220 ttatatattt agattaaact aatttagttt ctactctggg atcccttaaa aggaacaaag 5280 ttccactggg catccctaga tatatttagt attttttgta gtattcatca ctttcgttca 5340 cacaccaatg taataatgta aaagtaagtg taattattgt aaatttagat gagtccacta 5400 ccagggggct cgatgcagag ctttttggtg tgggggttag tggtgggcta aaaaaaaaaa 5460 aaaaaaaaaa aaaaaaaa 5478 // ID hAT-N6_AP repbase; DNA; INV; 588 BP. XX AC . XX DT 26-AUG-2009 (Rel. 14.09, Created) DT 26-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW hAT-N6_AP. XX NM hAT-N6_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-588 RA Jurka J.; RT "hAT-type DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2106-2106 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX SQ Sequence 588 BP; 180 A; 83 C; 93 G; 232 T; 0 other; tagggcagag gacttgtagc atttgcatat tttttttgca cagccacaaa aatagcagag 60 tgcgaaacaa atacgtttca tgcttccaca aagctagttt tgaaatttta ttgtgcatat 120 aaatgcatat tttacgtttt tttggtttgt agggcatatt ttacactttt atacgctatt 180 ttatattttt tgagacatat tttacttgaa tcgatttttt ccacagcgac atttttattt 240 caaacaatac agtcaaaatg tttggataaa aatttattta tttttaattt ttgtcgttta 300 atatggtttt tgactgtgct cgggcacacg tttttcttag aagcgataac gattttaacg 360 taaactatac ctacgacatc accaccacag atacgtgtgt acaccatgcg ggcttttatt 420 cgattaggat aaataaataa taatgttaat gtattatgta tttattattt aaataaaaag 480 gctaaaaaaa attttaagtg catattttaa ttgcatatta tggggttttt tagtgcataa 540 gtgcttgcat atttagagct tttttagagc tacaagtcct ctgcccta 588 // ID BEL-3_DGri-LTR repbase; DNA; INV; 226 BP. XX AC scaffold_14822; XX DT 07-MAR-2011 (Rel. 16.03, Created) DT 07-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the Drosophila grimshawi genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-3_DGri_; KW BEL-3_DGri-I; BEL-3_DGri-LTR. XX OS Drosophila grimshawi OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Hawaiian Drosophila; OC grimshawi group; grimshawi subgroup. XX RN [1] RP 1-226 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Drosophila grimshawi genome."; RL Direct Submission to RU (06-MAR-2011). XX DR Genome; scaffold_14822; Positions 773707 773932. XX SQ Sequence 226 BP; 81 A; 38 C; 35 G; 72 T; 0 other; tgctttgcat aactctggtg gatcttattc acttatgtaa acactacttg taattttagt 60 aattatcgat agttcactat cgaattagcc caaattgtaa taataaagag aacaagcaat 120 aatcgcacct ttgtgtcaga aaataaagtt gttaattaaa atagtttaaa taccgcccag 180 acttgaagtc tcatggttgc ataaaaaacg ctttgtgcaa cgaaca 226 // ID BEL-641_AA-I repbase; DNA; INV; 6635 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-641_AA_; KW BEL-641_AA-LTR; Pao_Bel_Ele8; BEL-641_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-6635 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [5609-6118] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 1270..2427 FT /product="BEL-641_AA-I_1p" FT /translation="MTEKKIREKIKKRERIVASLKRHAQFLGSFNPDIHTG FT EVQSRLDKIEAKFEEFEEVQEEISELDVEGIYEEDCTVAYEEFEKLYYRLR FT AALLAKLPTEGTAADLNNTLARNGPPLGAHTGVRLPQISLPEFDGDYKGWL FT SFKSTYVSLIHDSGELSDVQKFHYLKSALKGEAAKLIESLTLTNDNYSIAW FT NTITKRYSNEYLLKKRHLQALMEYPKVEKESSTAIHALVDEFEQRLKILKQ FT LGEKTDDWGAMVVHWMCSKLDANTLQLWEDHAASTKDPTFTILVNFLEKRT FT RVLEAVSSNVELKSSMQKVEVKRQKVVVHAATDGEKSGSACCCCGESHFLG FT RCSRFSKMTLKEKLQFVNSKRLCSNCLKSGHWVRDCTSKFSCP" FT CDS 4427..6118 FT /product="BEL-641_AA-I_4p" FT /translation="MYKISELRISRFAFASKWVSVQFHCFADASTLAYGAC FT LYVRTTDAVGNVRIELLSSKSRVAPLKRLTLPRLELCAAKEAALLHSKVTK FT SLSIENVQSFFWSDSTIVLHWLRSPPNTWQTFVANRVSTIQITTHAHFWRH FT VAGKENPADLVSRGMAVDDFLNSQLWKEGPPWLRDTEDAWPYSDEYCTPCE FT EHLEIRKNVHTVRVAEPPNELFGLRSSLPSLLRVVAYCRRFAHNSRYPNDR FT VASVELTADDIKSAKMALTRMAQAERFPEELKDLQRQQHVDNKSSLKRLCP FT FLDEDGVIRVGGRLRLSNESYTVKHPAVLPNHHPFTDLVIQFVHSQNFHSG FT PQLTLADIRQEFWPIHGKRVVNAVLRKCVRCFRTNPTPIQQPMGQLPVGRV FT RPGRPFLITGVDYCGPFFLKSSRRNTAPIKVYIAVFVCFSTKAMHLEMSSD FT LSTASFLSVLRRFIGYRGIPAEIHSDNAKNFSGARNELKALYDLLNDPLSF FT SVICKELSQQGIKWHFIPPRAPNFGGLWEAAVRSVKTALKKEVGLKQLSYD FT QFTTLLVQITATLNSRP" FT CDS join(2573..3358,3362..4399) FT /product="BEL-641_AA-I_2p" FT /translation="MESEDESDQGAVGSYNVGAKSGRISNVLLSTVVLVIR FT DQHGGKQMARALLDNGSQANIMSERLCQMLSLKRRTINVPISGVGEAETRA FT RFLVNTTVSSRVQDFSVGMEFLVLQKVTSELPSGHIPVGHWKIPTDIQLAD FT PNFNVSNRIDLLIGAEHFYRFLFERDTKRITLGPGLPMLINTVFGWIVTGK FT VSEVKSKAVSCCVASAPDSLEAQLHKFWEIESNEDRPAWSKEEQDSENHFL FT RTFSRTEEGRYVVRLPKHVNFQMLGDSRAMALSRFKKLERQLGWNTDKRLQ FT YNAFMQEYLDLGHMRQVSEEELLEEAKSPRKGYYLPHHAVLKESSTTTKVR FT VVFDGSASTDSGYSLNDALLKGPVIQDELLSLLLRFRKHEVALVGDVEKMY FT RQVQVHSEDTVLQRIFFRFTPEEPVQEFELTTVTYGLKPSSFLAIRALHQL FT VTDEGAEYPDATDAIVNDFYVDDYIGGASSVDEAIQLQNNLDTLMKKGGFA FT LRKWCSNRPEVLKGIPADQLGTNLSISFDINPEEKVKTLGITWEPGTDQLR FT FFYDTTDSEQAWTKRNILSSIAKLFDPLGLISPIIVTAKMLMQELALLNIE FT WDTPVPVDIE" XX SQ Sequence 6635 BP; 1779 A; 1493 C; 1629 G; 1723 T; 11 other; ttttatggtg ccgtgaccag gatttgtggt tggagaaagc ctttattcgc caagtgtatt 60 gttccgtgaa ataatacccg tccgggggcc atctttggga caacggatct tctgttaccg 120 tactcagcaa cgggcggaga atttgctctc ccaagtacta catccactgc actcacatcc 180 acccatcttg cttcatacgt tggacgcctt ccgcactgtg tcacgcgaca attactgcag 240 ttgtgagaat aaatacaagg cctatttcgc ataggccaca ggtgagtggc tttccatctc 300 tatctcgctg gcttgcgggt gcttttgata gtttttcttg gttccatgta cactggtttc 360 ttctggtgct tgtaacgtcg cgcgactcgg ctttgcaacg tcggataagc tgtttcggta 420 tcgcatacac tggacgactt gtgatatcga ttcattcaac cccttgacgg attggtagta 480 ccttcgttcg tcggttcgct tctggagagc aatttggaaa atttaggaat tcaaggccta 540 ttgtcgacag gcctcaggtg agtgcctgtc caataactaa acgcatattt ggtggtgcta 600 taagatacct tctttaacct ttttggtgtt cgtgatcgtt ggctacgatc ggaggagtat 660 caccgacgga acgttcgaca ggcggactag tacttctggg gagcccaaca actgggaggc 720 gtgacgtcac actattacgg gacgagccag cttcaacgag acgatttacc tagaggattt 780 gatcatcaac aaacaaggtc agttgtatcg aaccatctat ttcttgggga attgaataca 840 ctgaaggcga atttcattca acatacaagg cctgttgtag acaggtacaa ggttagtacc 900 tgtccaactg ctattacttc atacagtgct attttggatt ttttctcctt ttttggttgc 960 tgggtttcag cttcttgtca aaaggagttt tgctgcaaca cggtctgcat tgctggttgc 1020 actcttcaag atttttcaac aagcctcaat tcgaagtacg acaacgatca ttccaagttg 1080 cgccatcata cgggggccga cgaaagctgc attagtttga atacaaggcc tatttgagac 1140 aggcctcagg tgagtacctg tccagcagcc taattggagt tttgcagtgc ttcttggctc 1200 gtttccattc tcatttccgg tcgtcatcgt tcgtacttcg atttgctgtg ctggtagtgg 1260 ttgtgggaaa tgactgagaa gaagattaga gaaaaaatca agaagcggga gcgcattgtt 1320 gcgtcattaa aacgacatgc gcaatttctg ggtagtttca acccagatat tcatacgggg 1380 gaagttcagt ctcgattgga caaaattgag gcaaaatttg aggaatttga ggaggttcag 1440 gaggaaattt cggagttgga cgtggaaggc atctatgagg aagactgcac cgtagcctat 1500 gaggagttcg agaaattata ctatcgtcta cgagcggctt tgctggcgaa actaccaacg 1560 gaggggactg ctgctgattt gaacaacacg ttagcgcgga atggtccacc tttgggtgcg 1620 cacactggag ttcgcctacc tcaaatttct ttaccggagt tcgacggcga ctacaaagga 1680 tggctatcgt tcaaatcaac atacgtgtcg ctgattcatg actccgggga gcttagcgat 1740 gttcaaaagt ttcattattt gaaatcagct ctaaaggggg aagcggccaa acttatcgaa 1800 tcgcttacgc tcaccaatga caactattcg atcgcttgga acacgatcac gaagcggtat 1860 tccaacgaat acctgctgaa gaagaggcat ctacaggcgc tgatggagta cccgaaagtg 1920 gagaaggagt cgtctacagc aatccatgct ctggtggatg aattcgagca gcgtttaaaa 1980 attttgaagc agttaggaga gaagactgac gattggggtg ctatggtcgt ccattggatg 2040 tgctccaagt tggatgcaaa tacactgcaa ctttgggaag accacgcagc gtctacgaag 2100 gacccgacat tcacaatttt ggtgaatttt ctcgagaagc ggactagggt attggaggcg 2160 gtttcatcga acgtcgaatt gaaaagcagt atgcagaagg tggaagtcaa gagacagaag 2220 gtagtcgttc atgccgctac tgatggcgaa aaaagtggtt cggcttgctg ttgttgcgga 2280 gagtcgcatt tcttgggacg atgcagcagg ttttcgaaaa tgacgctgaa ggagaaattg 2340 cagttcgtca acagtaaacg cctttgcagt aattgtctga agtctggcca ctgggtacgc 2400 gactgcacat caaagtttag ctgtccgtga ctgtgggaaa aagcataata cgctgattca 2460 tccagggttt ccgttgagca gcagcggtgc tgtacacaac gatcatactg taggcaaacc 2520 ggagaagaca cggaacgaaa aaccggtagc aaccaacgta gctactaacg agatggaatc 2580 tgaagacgaa agcgatcaag gagcagttgg atcatacaac gtaggggcca agagtggcag 2640 aatttcaaat gttctattat ctacggtcgt actagtcatt cgagaccagc atggaggaaa 2700 acagatggct cgagcattgc tagacaacgg atcgcaagct aacatcatga gtgagcgatt 2760 atgtcagatg ctgagcctca aacggaggac gataaacgtg ccaataagcg gtgttggtga 2820 agcggaaaca cgagctagat tcctggtaaa cactactgtt agctctcggg tccaagactt 2880 ttccgtcgga atggaatttc tagttcttca gaaggtaacg tcggaattgc catcagggca 2940 catacctgtg ggacactgga aaattccaac tgacattcaa ttggccgacc cgaacttcaa 3000 cgttagcaac cgaatcgatc ttctgattgg agcagagcat ttctaccggt tcttgtttga 3060 aagagatacg aaaaggatca cgctaggtcc ggggctgccg atgctgatca atacagtatt 3120 cggttggatt gttacgggga aagtttccga agtaaagagc aaagcagtta gctgttgcgt 3180 agcgtctgct ccagatagtt tggaagccca attacacaaa ttttgggaaa tcgagagcaa 3240 tgaggatcgg cctgcttggt cgaaagagga acaagatagc gaaaatcatt ttctacggac 3300 gtttagtcgc acggaggagg gtcggtacgt cgtacgttta ccgaaacacg tgaatttcga 3360 wcagatgctt ggagactcac gcgcgatggc tctatcgcga ttcaaaaaat tggaacggca 3420 actaggatgg aatacggata agcgccttca gtacaatgca tttatgcaag aatatctgga 3480 tctgggacac atgaggcagg tcagcgaaga ggagcttcta gaagaggcaa aaagccccag 3540 aaagggctac tacttacccc atcacgctgt gctgaaggaa tccagcacta caacaaaggt 3600 tcgtgttgtt ttcgacgggt cagccagcac ggatagtggc tactcgttga acgacgctct 3660 tctaaaaggt ccagtcatcc aagatgagct ccttagcctg ctgcttcggt tccgaaagca 3720 cgaagttgcg ctagtcgggg atgttgaaaa gatgtaccgg caggtacagg tacactccga 3780 ggacaccgtt ttgcagcgca ttttctttcg gtttactcca gaagaacccg tacaggaatt 3840 tgagttaacg acggtcacat atggattaaa accttcatca ttcttagcga ttcgtgctct 3900 gcatcaactg gtgacagatg aaggagctga atacccggac gctactgatg caatagtgaa 3960 cgatttttac gtagatgatt acatcggcgg agcatccagc gttgatgagg ccatccagct 4020 acagaacaat ctcgatacgc tcatgaagaa gggtggattt gctttacgca agtggtgttc 4080 caatcggcca gaagttttga agggtatacc agccgatcaa cttggcacta atttatctat 4140 ttcctttgat ataaatccgg aggaaaaggt gaaaaccttg ggaattacct gggaacccgg 4200 gacggatcag ttgcgattct tctacgatac macagatagc gagcaagctt ggacgaaaag 4260 aaacatcctg tcgtcgatag cgaaactttt cgatccktta ggattgatat cgccaataat 4320 agttacggcc aaaatgctga tgcaagagct tgctttgctt aatatagaat gggatacgcc 4380 tgttcccgtc gacatcgaam akaagtggaa agmsttccac tcmcaaatgt acaaaatttc 4440 mgaattgcga atcagccgat tcgcttttgc ttccaaatgg gttagcgttc agttccactg 4500 cttcgctgac gcttccacct tagcttatgg tgcgtgctta tatgtacgga cgacagatgc 4560 agttggaaat gtccggatag agctgctgtc ttcgaaatcg cgcgttgcac cgctcaagag 4620 attgacgcta ccgcggctcg aactatgtgc agccaaggaa gctgcgcttc tacactcgaa 4680 agtgactaaa tctctctcca ttgaaaatgt gcaatctttt ttctggtctg acagtacgat 4740 cgtactacac tggctacgat ctccgcctaa tacttggcaa acctttgtcg ccaatagagt 4800 ctcgaccatt caaatcacta ctcacgctca tttctggcgt cacgtagcag gaaaggaaaa 4860 tcctgctgat ttagtatcgc gaggaatggc agtcgatgat tttctgaata gtcagctgtg 4920 gaaagaagga cctccatggc tgcgtgacac cgaggatgca tggccctatt ccgatgaata 4980 ctgtactccc tgcgaagaac acttggaaat tcgtaagaac gttcacacgg tcagagtcgc 5040 cgaacctccc aacgagctat tcggcctgcg ttcatcattg ccttctttac ttcgtgtcgt 5100 cgcttactgc cgtcgttttg cmcacaacag tagatacccg aacgacagag tggcttctgt 5160 tgaattgact gctgatgaca taaaatcggc taagatggca ttaacccgca tggcacaggc 5220 tgagcgattt ccggaagagc tcaaagattt acagcgacaa cagcatgttg acaataaatc 5280 gagcttaaaa agactctgtc cattccttga tgaagatgga gtcattagag ttggaggtcg 5340 tctgcgtctg tcaaacgaaa gctacacggt aaaacatccc gctgttttgc caaatcatca 5400 cccattcacg gatctcgtga tacagttcgt ccattcgcag aactttcaca gcggtccaca 5460 attaacattg gctgatataa ggcaggaatt ttggcccata cacggcaaac gcgttgtcaa 5520 tgctgtgctt cgtaagtgtg ttcgatgttt tcgaaccaac ccaacgccta ttcaacaacc 5580 catgggacaa ctaccggttg gtcgagttcg cccaggacga cccttcttga tcactggggt 5640 tgactattgc gggcctttct ttttgaagtc ttcacgccga aataccgctc ctattaaggt 5700 gtatatagca gtttttgtct gcttctcaac gaaggctatg cacctagaga tgtccagcga 5760 tctatctact gcaagcttcc tctccgtttt gcgacgattc atcggctacc gaggaattcc 5820 tgctgaaata cattcggata atgctaaaaa cttctctggg gcccgtaatg aattgaaggc 5880 attatacgac ctgctcaatg atccactcag cttctccgtt atttgcaagg agctctctca 5940 gcaaggtatt aaatggcact ttattccgcc gcgtgctccc aactttggag gattatggga 6000 agctgccgta cgctctgtaa aaactgcgct taagaaagaa gtcggcttga aacagctgag 6060 ctacgatcaa tttaccacgc ttctggtaca aatcaccgct acgctgaact cacgacctta 6120 gtctacttta tcggacgacc cacggaaccc tcagcgccaa cttccgcgca tttgtgatag 6180 gttcggccaa gaaagccctc cctgagccca atattatttc tattcccaca aacctccttg 6240 acataccgca aatacaacaa atttctaaca tttttagtaa cgatggatgg camtagatcc 6300 cgcacgcagc tacaaataca acgaataacc tgccttcgtc cccattcaca ttagcagtat 6360 cgctgtgttg ccaaagacaa attaaatcag ttttcttggt aattaatgaa aattatcggc 6420 ttacatccag gctcggatgg gatcgtacga gttgcgacgg tgaagacggc gactggagta 6480 tacaagcggg cagtcaaccg tatttgtcca ctgcctaccg acgatgtggg tgtacgtact 6540 acgaggtctt cagcacattt gcggattcgt aaagataaga tttaaaatcg aattgatcat 6600 tacatttgtt gaagctagtt caagggggcc ggtaa 6635 // ID Gypsy-2_AA-I repbase; DNA; INV; 4208 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-2_AA_; KW Gypsy-2_AA-LTR; Gypsy-2_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4208 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 973-973 (2011). XX DR [2] (Consensus) XX CC Positions [3170-3628] - Integrase core CC LTRs are 96% similar to each other. XX FH Key Location/Qualifiers FT CDS join(20..2374,2378..4159) FT /product="Gypsy-2_AA-I_1p" FT /translation="MANPGEIQAGGDGAAAGVPAVLPPPTFAIESFDKRKL FT KWMRWVERLETAFLIYGVADVQMKTHFLLHYMGSETYDVICDKVAPDSPRT FT KTYQEIVTTLDDFFSPQPLEISENFRFKCRRQGDKDAASADETVDEYLVAL FT RRIAVTCNFGAYLETALRNQLVFGIKRNDIRSRLLERRELTLKDARDIAVS FT MELSRKGGAAIEGSSTRQEVHAVQHPAGKKEKQNAKNKSAGKSANDGSCYR FT CGEKSHFANACKHKQTVCSYCKLTGHLAKVCLKKSAAAKSDSRSTNKSHSV FT QTNYVDQSGDRQRPERVEVREVCTVESSSRCKKLLMDVYVNGKCVRFEVDT FT GSPVSIISAEDQNRFFPGARLCKSDTDLVSYCNTSIDVLGVFNASVEYNGK FT TMKLPLYVVNSGKHPLLGREWLSEMSVDWSCVFQGPIAVSAIAATASPCRD FT VALRALLEKFPKVFDASIGRISNVQANLPLKKDARPVFLKARKIPFNMLKT FT VEDELEKLVAEGVLTKVNSSNWATPIVPVKKSQNRVRICGDYKQTVNPNLI FT VDKHPLPTVDELFASLAGGKKFSKIDLVQAYLQLEVAPEHREILTLSTHRG FT LYRPNRLMYGVASAPAIWQRQMEAILQGIEGVSVFLDDIKITGENDEVHLR FT RLEEVLRRLNECGIRVNKDKCEFFVDNIEYCGYLIDKDGIHKIRKKVQAIQ FT DMPRPKNVDEVRSFVGFINYYGRFFQNLSTVLYPLNNLLKNDVPFRWTKQC FT EDSFRKVKEQMQSDNCLVHYSPELPLLLATDASPGVGAVLSHVYPDGTERP FT IQFASQTLSRVQQKYMQVDKEAYAVIFGVKKFFQYLYGRKFTLLTDNQAIS FT KIFGEHKGLPVMSALRMQHYATFLQSFDYQIRFRKSSNHSNADAMSRFPLE FT KTDPENEIEESDVVELSQIDTLPLTAAELGQVTAEDQTVQKLLQGIKHGQL FT VDAKDRFGVEQHEFSLQKGCLLRGIRVYVPAALRKRVLEELHSAHFGTTRT FT KSLARGYCWWPGLDRDIEEMVSNCADCQSVRAEPTKMDLHCWETPSAPFQR FT VHVDFAGPFMDTYFFIYVDAYSKWPEVQICRSTTAECTVSMCREIFSKFGI FT PSILVSDHGVQFTSEVFQRFLKMNGVVHKMGAPYHPATNGQAERYVQTIKQ FT KLKALKCSKSQLNLELCNILLTYRKMIHPSTGKSPSMLVFGRQIRSRLDLL FT LPKNETSSKADIIVRQFLDGDRVRVRDFLSKDKWKFGRIAEKLGKLRYAVR FT LDDGRIWERHIDHIVGVGADLQDVSVDTPQEEEFIERYEPNLTASSAVDSA FT ADAVVPALPQAETTNATPVISCPNAATAPALAPTPERFASTEAGRSESTVP FT LRRSSRTVKPPQRLNL" XX SQ Sequence 4208 BP; 1087 A; 981 C; 1153 G; 983 T; 4 other; atagttctgg cgacggagga tggcgaatcc cggtgaaatt caagctggcg gagacggtgc 60 tgctgcggga gttccggctg ttttgccccc accaacgttt gcgatagaat cgttcgacaa 120 acgtaagtta aaatggatgc ggtgggttga gcgtctcgag accgctttcc tcatctacgg 180 agttgcggac gtgcagatga aaacgcattt tcttctgcat tacatggggt ccgaaaccta 240 cgatgtcatt tgcgacaagg ttgccccgga ttccccccga acgaagacgt accaggaaat 300 cgttacgacg ttggatgact tcttcagccc gcagccgctg gaaatcagcg aaaattttcg 360 cttcaagtgc cgccgccagg gcgacaaaga tgccgcatct gccgacgaaa cggtggatga 420 atacctggtg gcccttcgga ggattgcggt tacgtgcaat ttcggtgcgt accttgagac 480 ggcgttacgc aaccagcttg tgtttggcat caaacgaaac gacattcgga gccggctgct 540 ggagaggcgg gagctaacgc tgaaggatgc tcgcgatatt gccgtgagca tggagctttc 600 acgcaaagga ggagctgcga ttgaaggtag ttcaaccagg caggaggtac atgcagtgca 660 acacccagca ggtaaaaaag aaaagcaaaa tgcgaaaaac aaaagtgcgg gtaaaagtgc 720 gaatgacgga agttgctatc gttgcggtga aaaatcgcac ttcgcgaatg cgtgcaaaca 780 taaacaaacg gtttgctcgt attgtaaatt aaccgggcat ctcgcgaaag tttgcctaaa 840 gaaatctgcc gcggcgaaat ccgattcacg gtcgacgaac aaatctcatt cggttcaaac 900 gaattacgtg gaccaatctg gcgatcgtca acgccccgaa cgtgtggaag tgcgagaagt 960 ttgtacggtc gaatcgtcgt cgcgttgcaa aaagctgcta atggatgttt acgtgaatgg 1020 maagtgcgtt cgattcgagg ttgatactgg ttcgccggtc agtatcatca gtgcagagga 1080 ccagaacagg ttcttcccgg gtgcgcggtt gtgcaaaagc gatacggatt tggttagcta 1140 ctgcaatacc agcatcgatg tgcttggtgt cttcaacgcg agtgttgagt acaatggaaa 1200 aacgatgaaa ttgccgctgt acgtggtgaa ttccgggaag catccgctgc tcggccgcga 1260 gtggctgagc gagatgtcgg tggactggag ctgcgtgttt cagggaccga ttgctgttag 1320 tgcgattgct gctactgctt ccccgtgccg cgacgttgct ttgcgggcgt tgttggagaa 1380 attcccaaag gtgttcgatg cctcaatcgg aaggatttcc aacgttcaag ctaacctacc 1440 actgaagaag gacgcacgac cagtgttcct gaaggcacgg aaaataccat tcaacatgct 1500 taagacagtt gaggatgagc tggagaaact ggttgcagaa ggcgtactca cgaaagttaa 1560 ctccagtaac tgggcaacgc cgattgttcc cgtcaagaag tcgcagaatc gcgtgaggat 1620 ttgcggagac tacaagcaaa ccgtgaatcc gaatctcata gtggacaagc atcccctgcc 1680 tacggtggac gagctgttcg cttcgctagc tggaggaaaa aagttcagca agattgatct 1740 cgtccaagcg tacctgcaac tcgaagtggc accggaacat cgggaaatac tcacgctgtc 1800 tacacatcgt ggtctttatc gtccgaacag gctcatgtat ggtgttgcgt cggctccggc 1860 catctggcag cgccaaatgg aagccatact acaaggaata gaaggcgtta gtgttttctt 1920 ggacgatatt aagataacag gtgagaacga tgaagttcat ttgcgtcggc tggaggaagt 1980 acttcggcgg ctgaacgagt gcggaatacg agtcaacaag gacaagtgcg agttcttcgt 2040 ggacaacatt gagtattgcg gatacttgat tgataaggac ggcatccaca agatccggaa 2100 gaaggtgcaa gctatccagg atatgcccag accgaaaaac gtggacgaag ttcggtcgtt 2160 cgtcggattt attaattatt acggacggtt cttccagaac ctgagcacgg tgctgtatcc 2220 tctgaacaac ttgctcaaga atgacgtacc gttcagatgg accaagcaat gcgaggattc 2280 cttcaggaaa gttaaggaac agatgcaatc cgacaactgt cttgtccatt attcaccgga 2340 gttgccacta ctgctagcta cggacgcttc gccctwcgga gtaggggcgg tactaagcca 2400 cgtgtatccc gatgggactg aacgtcctat tcagttcgcg tcccaaaccc tcagtcgtgt 2460 gcagcaaaaa tacatgcagg tggacaagga ggcgtacgcc gtcatatttg gagtgaagaa 2520 attcttccaa tacctttacg gccggaagtt caccctgctt accgacaatc aggctatctc 2580 gaagatcttc ggggagcata aagggttgcc ggtcatgtct gcattaagga tgcagcatta 2640 tgctaccttt ttgcagagtt tcgactatca aatccggttc cgtaagtcgt ccaatcactc 2700 aaatgctgat gctatgtcca gatttccgtt ggagaaaacc gatcccgaga acgagataga 2760 agaatccgat gttgtggagt tgagtcaaat cgacacacta ccattgactg ctgctgagct 2820 aggccaagtt ackgcggagg atcaaacggt gcagaagctt ctccaaggaa tcaaacacgg 2880 acaactggtg gatgcgaagg atcgcttcgg agtagaacag cacgagttct ccctgcaaaa 2940 aggatgcttg cttcggggaa ttcgagtkta cgtgcctgct gctctccgga aacgtgttct 3000 ggaggaatta cattctgctc acttcgggac aactcgtact aagtcgttgg caaggggata 3060 ttgctggtgg cctggattgg accgggatat agaggagatg gtatcaaact gcgccgactg 3120 tcagtcggta cgtgccgagc caacgaagat ggaccttcat tgctgggaaa ctccgagtgc 3180 tcctttccaa agggtccatg tcgactttgc tggtcccttt atggatacct acttcttcat 3240 ctacgttgac gcttacagca agtggccgga ggttcagatc tgcagatcca ccacggctga 3300 gtgcaccgtt agcatgtgtc gcgaaatctt cagtaagttt ggcattccgt caattctggt 3360 gagcgaccat ggcgtccaat ttacatcaga ggtattccag cgattcctga agatgaacgg 3420 tgtagtacac aagatgggtg caccgtatca cccggcaacg aacggacaag ctgagaggta 3480 cgttcaaacc atcaaacaga agttgaaggc gttgaagtgc tccaagtctc agctgaacct 3540 ggaattgtgt aacatcctac taacttaccg caaaatgatt catccgtcaa ctggtaaatc 3600 accctcaatg ctcgtgttcg gtcgacagat tcgatcaagg ctagatctgc tacttccgaa 3660 aaacgaaact tcaagcaagg cggatatcat cgtgcgtcag ttcctggatg gagaccgagt 3720 acgtgtgaga gatttcctgt ccaaggataa gtggaagttt ggacggattg ccgagaagct 3780 cggaaaactg cgatatgctg tacgtctcga cgatggacga atctgggagc gtcacatcga 3840 ccacatagtc ggtgtgggcg ctgatctaca agacgtttcg gtggacactc cacaggaaga 3900 ggagttcatt gaacgatacg agccgaactt gactgcttct tctgcggtgg attcagcagc 3960 tgatgcggta gtgcctgcgc ttccacaagc agaaactacc aatgcgacac cggtgatttc 4020 ctgccctaat gctgctacgg cgccggcgct tgctcctact ccagagcgct ttgcgtctac 4080 cgaggctggt cgatcggagt ctactgtacc tctacgacga tccagtcgta ctgtgaaacc 4140 tccccaaaga ttgaatttat gattttgttg aagtattaac tgtattttct tttgacaaag 4200 gggagaag 4208 // ID Tx1-6_CQ repbase; DNA; INV; 4095 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A Tx1 non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW Tx1; Non-LTR Retrotransposon; Transposable Element; Tx1-6_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4095 RA Kojima K.K. and Jurka J.; RT "Tx1 non-LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 638-638 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 6 sequences with >99% CC identity. XX FH Key Location/Qualifiers FT CDS 2..670 FT /product="Tx1-6_CQ_1p" FT /translation="YYDGLKNRCFHCKQEGHVKSSCPKLVTVSPGSGXPRS FT YSSVAARGTPVAPPVLEVPAFKPQMQLLNSKRNQSVPTPAPPQVTVPVTEA FT AISQREKVPADGAALPAVTPATSGSSVAGSSKTPAEILKVLQSIAKETEEP FT MDTSLDKVAPKRPADPPSDGEGGVSEEGGSEEGGTAEGGDEGLGGSGVDAD FT GFKKQGGKRSNKAKKSKKVIKTIETRADAKASK" FT CDS 761..4006 FT /product="Tx1-6_CQ_2p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="MVFTRSICTININSISCIAKKLLLRDFIFNTDIDILL FT LQEVAFESFGFLHSHTAIVNFNNDSIGTAILVRKGIEFSNVLMGSCGRILS FT VSIDNFNIINIYAASGTNRKKERDILFSSTIIPHLSSVKNNIIAGDFNCIL FT LSSDSNSTVPNFCKGLESLVKSVCLFDVEKEKNKSVQFTFIRGISMSRLDR FT FYAPKDFLSKVLSVSTAAVPFSDHHAVLIKYQVPDGIPLKLFGRGFWKINP FT SLIDNSDIHNKLITVLNDLETREMFSRDLNKWWNQTVKVKIKQTYKSESFH FT LNQQIHREKNFYYTCLKEIIIEQSNGRHVYDEMIFIKSKLLQIEHDRLKNF FT SLKIPPTSLASEETLSLFQISSSIKKNYSDNLLHLRTTKGITSDYKELKPC FT IENYFTKIFANESSNNSNNNFPVLNNINKSLNLQDSNMLIRPISKDELNTI FT IKQSAKKKSPGPDGLTYEFYEHFFDVLSDHLLKLFNSYLRDGVCPEPSFSE FT GIIALVPKNGDQTDISNKRPISMLNCDYKIFSKILANRLHAVLPNLIGPGQ FT AACNPEKSCIYNLRLIRNICLRAEQTKRLKGLILSIDLEKAFDNVSHDFLW FT AVLKKYGFPNEFITCIQNLYKHASSKILFNGFLTNTILILNSVRQGCPLSM FT ALFALYIEPLIRLIYDNVQGCFIANTFIKVIAYADDLNIFVLNDHEFDTVL FT ELINYFSIFSKIKLNARKSHFMRLNNCRSGPHMLEEKMSLKILGIAFYQDF FT SRTVDSNYNSIIPKFKNVLAQHSKRKLNLVQKCIVLNSYALSKIWYIAQIF FT PPSNKHLADIRKSCGWFIWXNFFKVDRNQLYLPLEKGGLGLADPEVKAQSL FT FVKNIIFAFNDDKDSFMLAQTRNKSLTRNARAWLDLASELSHFSYLDSCKS FT FYNFLLQKRNTIPKVERENPEFSWEIIWENISKXXLSSYAKEALYMVYNNL FT VPNRSKMFRLKVRGVVDNLCEICNNVDSTEHRIKNCKNTRPVWEWVEEIIS FT KRLKLVVEDPEEIMQMSIVTSMKRKACLWLVAEVICFNLKNTNNATVKDFQ FT HHIRKIRWNFREVFKKHFGNLLNIC" XX SQ Sequence 4095 BP; 1404 A; 690 C; 772 G; 1223 T; 6 other; ttattacgat ggcttgaaaa atcggtgctt ccattgcaaa caagaagggc acgtgaagtc 60 gagctgtcca aaactggtaa cagtctcgcc gggaagcggc gktccacgtt cttacagttc 120 tgtggccgcc cgtgggactc ccgttgcgcc accggtactg gaggttccag catttaaacc 180 acagatgcaa cttctaaaca gcaaacgcaa tcagtctgta ccgacaccag cacctccgca 240 agtcaccgtt ccggttaccg aagcagcgat ctcgcagcga gaaaaggttc cagctgacgg 300 cgcagcacta ccagcagtaa caccagcaac aagtggcagt tctgtggcgg gttccagcaa 360 gaccccggcg gagattctga aggttctgca gagcatagct aaagaaacgg aggagcccat 420 ggacaccagc ctcgacaagg tggcaccaaa gaggccagca gatccgccat cggatggtga 480 agggggggtt tcggaagagg gggggtctga agagggggga actgcagagg ggggtgatga 540 gggtttgggg gggtctgggg tagacgctga cggattcaaa aaacaaggag gtaagcgatc 600 aaacaaagca aaaaaatcga aaaaagtgat caaaacgatc gaaacaagag cagatgctaa 660 ggccagcaag tgagcgtaac ataacctcac ttgcgttccc gtagaacacg tggtgctgac 720 agtcagttaa agttttttgt gcgtattggt ttgccggaag atggtcttta ctcgtagtat 780 ttgtacaatc aacatcaatt ctattagttg tattgcaaag aaactattac ttagagattt 840 tatttttaac acagacattg acatactttt gctacaagaa gtagcctttg aaagttttgg 900 attcttacat tctcacacag ctatagttaa ttttaataac gatagtattg gtacagctat 960 tcttgtgcgc aaaggtattg agtttagtaa cgtgttgatg ggatcgtgcg gacgtatact 1020 atctgtgtct atcgataact ttaacattat taatatttac gctgcttcag gtactaatcg 1080 caaaaaagaa agggatattc ttttttcttc aacgataata ccacatttat catctgtaaa 1140 aaataacatt attgctggag attttaactg cattttactt tcttctgatt caaacagtac 1200 agttccaaat ttttgcaaag gacttgaatc tttggttaaa tccgtttgtt tgttcgacgt 1260 ggaaaaggaa aaaaataaat cagttcaatt tacattcatt cgtggaatat ctatgtcaag 1320 actcgatcgc ttttatgctc caaaagactt tttaagcaag gttctgtctg tgtcaaccgc 1380 agcagttcct ttctctgatc accatgccgt tcttattaaa tatcaggttc ctgatggaat 1440 tccattgaaa ttatttggaa gaggattttg gaaaatcaat ccgagtttga ttgataattc 1500 agacattcat aataaattga taactgtttt aaatgactta gaaacccgtg aaatgttttc 1560 gcgagatttg aacaaatggt ggaatcaaac agtgaaagtc aaaattaagc aaacttataa 1620 atctgaaagt tttcatttga atcagcaaat ccatcgtgag aaaaattttt actatacgtg 1680 tttaaaagag attattatag agcaaagtaa tgggcgacac gtttatgatg aaatgatttt 1740 tattaaatcc aaacttttac aaattgagca tgatcgttta aaaaactttt cgttgaaaat 1800 acctccaaca tctctagctt ctgaagaaac tctctcgctt tttcaaatat catccagtat 1860 taaaaaaaat tactctgata acttactaca tcttcgtaca actaaaggta ttacttcaga 1920 ttacaaagaa ctaaaacctt gtattgaaaa ttattttaca aaaatttttg caaatgaatc 1980 ttccaataat tcgaataaca attttccagt attaaataat atcaataaat cattaaactt 2040 gcaagactcc aacatgttaa taagaccaat tagtaaagat gaattaaata ctattattaa 2100 acaatcagcc aaaaaaaaga gtcccggtcc tgatggactc acatacgagt tctacgaaca 2160 ttttttcgat gttttgagtg atcatttact taagctattt aatagttatt tgagggatgg 2220 agtatgcccg gaaccaagtt tttcagaagg aattatagcc ttagttccaa aaaatggaga 2280 ccaaactgac atttcaaata aaagaccaat tagtatgctt aactgtgatt ataaaatttt 2340 cagtaaaatt ctagcaaaca gactacacgc agttttacca aatttaattg gacctggcca 2400 ggcagcctgt aatccagaga aatcatgtat atataatctt cgtttaattc gcaatatctg 2460 tctccgtgct gagcaaacaa aaaggttaaa aggtcttata ttaagcattg atttagaaaa 2520 agcatttgat aatgttagcc atgactttct ttgggctgtt ctcaaaaagt acggttttcc 2580 aaatgaattc attacatgca tacaaaatct ttacaaacac gcttcatcga aaattctttt 2640 taatggattt ttaacaaata cgattttaat attaaattct gttcgacaag gttgcccttt 2700 gagcatggct ctattcgcgc tgtacattga accactgatt cgtttgatat atgacaatgt 2760 tcaaggttgt ttcatagcaa acactttcat aaaagtgatt gcttacgcag acgatctgaa 2820 catttttgtt ttaaacgatc acgagtttga tactgtgctg gagctcatca actattttag 2880 tattttttct aaaataaaat tgaatgccag gaaatctcat tttatgagat taaataactg 2940 tcgttccggc cctcacatgt tagaagaaaa aatgagctta aaaattctgg gaattgcttt 3000 ttatcaggac ttttctagaa cagtagatag taattataat tcaatcatac caaaatttaa 3060 aaatgtatta gcacaacatt caaagcgaaa attaaatcta gttcaaaaat gcatagtgtt 3120 aaactcctac gctttgtcca aaatctggta cattgcccag atatttccac cctcgaacaa 3180 acatttggca gacattagaa aaagttgtgg ttggtttata tggamcaact ttttcaaagt 3240 agatagaaat cagctctatc ttccattaga aaaaggtggg cttggattag cagatccaga 3300 agtaaaagcg caatcccttt ttgttaaaaa tataattttt gcatttaatg atgataaaga 3360 tagctttatg ttagcacaaa cccgtaataa atctcttact agaaacgcac gagcatggct 3420 ggatctcgcg tcagagttgt ctcattttag ttatttggat agctgtaaat cattttataa 3480 ctttctttta caaaaaagga atactattcc taaagttgaa cgtgagaatc cagaattttc 3540 atgggaaata atttgggaaa acattagcaa aaawwtttta agttcctatg caaaagaggc 3600 gttatacatg gtgtataata atttagtccc aaatagatca aaaatgtttc gtttaaaagt 3660 tagaggagtt gtagataacc tttgtgagat ttgtaataat gttgacagta cagaacacag 3720 gataaaaaat tgtaaaaata ccaggccagt ttgggaatgg gtggaagaga ttattagcaa 3780 gagattgaag ttagtagtag aagatccaga agaaatcatg caaatgagta tagtaacatc 3840 aatgaaaaga aaagcatgtt tatggctagt agcagaagta atttgtttca atcttaaaaa 3900 taccaataac gcaacagtaa aggactttca gcaccacatt agaaaaatca ggtggaactt 3960 tcgagaagtt ttcaagaaac attttgggaa tctgcttaac atttgctaag atcgctattt 4020 tttgttgtaa aatatctccg taaaataaag acaatcatgt gaaaaaaaaa aagggggggg 4080 awaaaaaaaa waaaa 4095 // ID CR1-69_AAe repbase; DNA; INV; 3988 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-69_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-3988 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1157-1157 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >91% CC identity. XX FH Key Location/Qualifiers FT CDS 126..773 FT /product="CR1-69_AAe_1p" FT /translation="MCPACRKLMXGTRFRGAINSTNDLLQAVMSQQNQMLD FT DLRXEIKRNTERINEIVGKQQQQDLPNRSPWPAIQPRSSKRPRIQIDPPQI FT DEDAPRARESKFWLFLSRFSPHATVEEISNLVQRNLDMNEPVEVVKLVRRG FT VEINQLSFVSFKIGMGMKWKEKAMQPENWQXGIYFREFVGVERGPAIFRVX FT RQRNQSNDFFSTPLAPPSRAAVAMQQ" FT CDS 919..3909 FT /product="CR1-69_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MPVNEKXAVNIYYQNMNSIRGSDRQKSLYLSSXELDY FT DACAFSETYLDSSVQDXLLFNSCFSVYRCDRNRNNSEKHSGGGVLIAVHRR FT LACREFARAENNEAICVRIDAMNGNIFLCCGYIPPTSNVHRYQSFAQFVES FT IIETVGPHDVVLVFGDFNLPNLKWNQRFQEGNNCFLLPDRITTQSEMAIVD FT GFLSAGLMQVCDLPNQNNNYLDLIFTTDTESCQAVVTDPIIKHEVHHYAMQ FT LTIATDTLRELEAGMPKTHLNLELMNVASVKEAIRDVSWSNVFISQSEYER FT NSHTHDQQAASMLEFLNSLDVLNCELEFSAIVSNIIDFNVLSFYFVLFSIF FT SRFAPFSRSRSIRESRYPEWFSPILIQVLKDKRKALKKKRRNPSPENIASY FT KELLATFKVLHREAYQVYISKIQNGIKANPKSFWKFVNGRRKNAGVPQVMK FT YGNRSSSNGQDTAQLFAEYFKSVYRDDIPNDYNPPINAHGDVMELSQDEVN FT QGLADLDSSKTAGPDKLPAKILKEFKDELCEPLSILFNSSLSSGYFPHLWK FT ISHITPVHKKGSRSDTENYRGIAIQSATPKLFERLVYCHLYERVSERVSAA FT QHGFLKGKSVVTNLCEFSSMVTDFVSKGYQVDCVYTDMSKAFDVVGIDSIM FT RAVTNFGISGMLFEWIRTYLQGRIQFVRVLNNTSESFSVHSGVPQGSHLGP FT LLFVMVMNDLPSFMTKAIVLIYADDVKIFLPVKIIDDCLLLQHDLRNFERF FT VDVNRLSINPSKCSVITYARRVQPIIFDYSLGHTILRRVETVRDLGVTMDQ FT CLTFNDHINTVVKESLQLFALTRRFGRDFDDPHAILTIYTNLVRTKLDFAS FT VVWRPQYDIHVQRLEAIQRKFVKFALRNLGWTREQMPDYDELCRLVDIDSI FT SNRHQIADIVFFCNILSGRIRSTLLYDRLCFNTSPVSLRRRRVFDPPLRSR FT NYTRHEPTARFMNDFNRLQDVISINMTSDEIRSRLRIFLRDY" XX SQ Sequence 3988 BP; 1176 A; 847 C; 882 G; 1072 T; 11 other; caattaccga tgaaagtgat gtagtgacct gtcaagggtt ttgcaaggct acgttccacc 60 tgaaatgctc acacctcagt gcgagtgcat gggaggaggt cagatcaaac tcgctgatct 120 attggatgtg tccagcttgc agaaagttga tgwgwggcac gcgttttcgt ggcgctatta 180 actctacaaa tgatctacta caagctgtga tgagtcaaca aaatcagatg ttggatgatt 240 tgcggkgtga gataaagagg aataccgaaa gaatcaatga aatagtaggg aaacagcaac 300 agcaggatct tcctaatcgc tcaccttggc ctgccatcca acccaggtca tcaaaacgcc 360 cacgcatcca aatcgatccg ccacagatcg atgaagatgc acctcgggca agagagtcca 420 agttttggtt gttcttgtcg aggttttccc cacacgccac tgtggaggaa atctctaatc 480 tggttcaacg gaatctggat atgaatgagc ctgtsgaggt agtgaagttg gtccgtaggg 540 gggttgagat caatcagctc tccttcgtat cgtttaaaat tggcatggga atgaagtgga 600 aagaaaaagc aatgcagcca gaaaactggc aamagggaat ttatttccga gaatttgtkg 660 gtgttgaacg aggacctgct atttttcgag ttgakcgtca gcgaaatcaa tcaaacgact 720 ttttctcaac accacttgca ccccccagcc gagcggcggt cgcaatgcag caataataac 780 gatgatgacg gactctctga aacaagatgc gaaacgaatt ttacacacac gagatgttat 840 aatgctgaag gactacctac cctcacggat gaaacaggaa cacaaacgta ccacacgtcg 900 caatcatcga atttagagat gcctgtaaat gaaaaascag ccgtcaacat ttactatcaa 960 aatatgaaca gcattagagg gagcgatcgg cagaaaagcc tgtatttgtc atcgttsgaa 1020 ttggactacg atgcgtgtgc cttcagtgaa acttacctgg actcttctgt gcaagatakt 1080 ttgctgttca attcatgttt ctccgtctac cgatgcgata ggaacagaaa caatagtgag 1140 aaacattccg gtggaggagt gctgatagca gttcatcgtc gtctcgcttg ccgtgagttt 1200 gcacgcgctg aaaataatga agcgatatgc gtwagaatag acgcaatgaa tggaaatatt 1260 ttcctgtgct gtggatatat tccacctacc tcaaatgttc accgttatca atcgttcgcc 1320 caattcgttg agtccatcat tgagaccgtt ggcccgcacg acgtcgtgct tgtatttgga 1380 gatttcaacc taccgaattt gaagtggaat cagcgttttc aagaaggaaa taattgtttt 1440 ctcttacccg atcggattac tacgcaaagt gaaatggcca ttgttgatgg atttctgtcg 1500 gctggactta tgcaagtgtg tgatttgccg aaccaaaaca ataactattt ggatctcata 1560 ttcaccactg acaccgagtc ctgtcaagct gttgttaccg acccgataat caagcacgaa 1620 gttcaccact acgcgatgca actgacgata gctacagata ctctacgcga attggaagcc 1680 ggtatgccta aaactcattt gaatttagag ttgatgaatg tcgcgtctgt taaagaagcg 1740 ataagagatg tttcatggag caatgttttc atttcacaat cagagtatga acgtaattcg 1800 cacacgcatg atcagcaagc cgcgtcgatg ctcgaattcc ttaattcact cgacgttttg 1860 aattgcgaac ttgaattttc cgcaatagtt agcaacataa tagacttcaa cgttttgtcg 1920 ttttacttcg tgttgttcag catcttttca cggtttgctc cattctccag gagtcgatcg 1980 attcgggaaa gccgatatcc tgaatggttc tctccgatcc tcattcaagt ccttaaagat 2040 aagcggaagg cacttaagaa aaagcgtcgt aatccatccc cggagaacat tgcctcgtat 2100 aaagaacttc tggcaacatt caaggtgctg cacagagaag cgtatcaagt ttacatctct 2160 aagattcaaa atgggatcaa agctaatcct aaatcattct ggaaattcgt aaatggtagg 2220 cgtaagaacg caggagttcc acaggtcatg aagtatggta acagatcctc ctcaaatgga 2280 caagacacag ctcaactttt cgccgaatac ttcaagagtg tttatcgcga cgatattcca 2340 aacgactata atcctccaat caacgcgcat ggtgatgtaa tggaattatc tcaagatgaa 2400 gtgaatcaag gtttggcaga tctagattca agcaaaacag caggtccgga caaattgcca 2460 gcaaagattc tcaaagaatt caaggacgaa ctttgtgaac cactatccat tctgttcaac 2520 tcatcgctct catccggcta ttttccacat ctctggaaaa tctctcacat aactccggtt 2580 cataaaaagg gctcgcgttc agacactgag aactaccggg gaatagcaat tcagtccgcg 2640 acaccaaaat tgttcgaacg attagtgtac tgccatctct acgaacgtgt cagtgaacgt 2700 gtttctgctg cacagcatgg ctttctcaaa ggtaagtcag ttgtgacgaa cctgtgtgag 2760 ttcagttcca tggtcaccga ctttgtctcc aagggatacc aagttgactg tgtatataca 2820 gatatgagca aggcttttga cgtcgtggga atagacagta ttatgcgcgc tgtgaccaac 2880 ttcgggatca gcggcatgct gttcgaatgg ataaggacat acttacaagg cagaatacag 2940 ttcgttagag tgctcaataa tacgtctgaa tcgtttagtg tgcattcggg agttccacaa 3000 ggtagccatc ttggtccact gcttttcgtt atggttatga atgatctacc ctcgttcatg 3060 acaaaggcaa ttgtgctgat ctatgctgac gatgtaaaaa tattcctgcc agtgaagatc 3120 atcgatgatt gtcttctgct tcaacatgac ctacgcaact ttgaaagatt cgtcgacgtg 3180 aataggctga gcattaatcc aagcaaatgt tctgtaataa catacgcaag aagagttcaa 3240 ccaattatat ttgactacag cttgggacac acgatcttgc ggcgcgtaga aactgttcga 3300 gatctaggtg tgacgatgga tcagtgctta acgtttaacg accacatcaa cacagttgtt 3360 aaggaatcac ttcaactatt cgcgctgacc cgacgttttg gccgagattt cgatgatcct 3420 catgctattt taacaatcta caccaaccta gttaggacta agcttgattt tgcgagcgtt 3480 gtgtggcgtc cccaatatga tatacatgtg caaagacttg aggcaatcca gagaaagttc 3540 gtgaaatttg cgctgagaaa cctaggatgg actagagagc aaatgcctga ttacgatgaa 3600 ctttgccgcc ttgtagatat cgacagcata agcaacagac atcagatcgc agacattgta 3660 tttttctgca atattttgag tggacgcata aggagtacac ttttgtacga tcggctttgt 3720 tttaacacca gtccagtttc acttcgccgc agaagggtgt ttgatccccc attgaggtcg 3780 agaaactaca cccggcatga gcctacagct agatttatga atgacttcaa cagactacag 3840 gatgtaattt ctattaatat gacaagtgat gaaattcgta gtagattaag gatattttta 3900 agagactact aagaaatttg tttgttaatt ttaagtaagg gtaacgggct tacagcctat 3960 agtccaaata caaatacaaa tacaaata 3988 // ID Gypsy-263_AA-LTR repbase; DNA; INV; 194 BP. XX AC . XX DT 13-JAN-2011 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-263_AA_; KW Gypsy-263_AA-I; Gypsy-263_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-194 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (13-JAN-2011). XX DR [1] (Consensus) XX SQ Sequence 194 BP; 62 A; 36 C; 43 G; 51 T; 2 other; tgtaatgatg taggtgcccg tccatcagaa acagagtagc acccagctac ggtagtgtga 60 tgaggactcc ctcggtcata agagaagaac aatatatcag caacggcgtg cagactgcta 120 gcttgtaatt cagtgaataa taaakgattc aagttgtaat amgtattgat taattcttat 180 atcgagttcc caca 194 // ID hAT-1N1_BF repbase; DNA; INV; 1200 BP. XX AC . XX DT 30-SEP-2008 (Rel. 13.09, Created) DT 30-SEP-2008 (Rel. 13.09, Last updated, Version 1) XX DE Amphioxus hAT-1N1_BF non-autonomous DNA transposon - consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW hAT-1N1_BF; hAT-1_BF; non-autonomous. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-1200 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-1200 RA Kapitonov V. and Jurka J.; RT "hAT transposons in the amphioxus genome."; RL Repbase Reports 8(9), 918-918 (2008). XX DR [2] (Consensus) XX CC This non-autonomous transposon shares the same 104-bp 5' terminus CC with the autonomous hAT-1_BF. XX SQ Sequence 1200 BP; 347 A; 258 C; 265 G; 330 T; 0 other; cagggcttta gccagcgtcc gtttttccgt caattgacgg aaatttgctg cgtgtgacgg 60 aaaaaaattt caaccaatcc gtcaaccttg acggaaaaaa aatcgttgat tgaagcaagg 120 aggccaagtt gaaaagaggg aagctacggc ttggggacgc ccggacagcc cattgtcgga 180 aagccacaca ctgtttgttt tgctattgtc atgattgttg acgattgtgt gtcggcaccc 240 tttcgcatac gataatctca ttaaaaaccc gtttttcaag gtctcactcc aagaacagtg 300 cattctacga actttgtttg gaatcttgac actcatcaaa acctaacttg tacgctagcg 360 tgccggggct ggcggccgta caaattattt tcgctcgccc agccgcgccg cttgcgacaa 420 cgcaaggtgc cagtcgcagc taaaagaacc ggcacatcat gctatccaca aatagtttga 480 tcaaaaacat aatctttagg atctaaaatg ttgcacggtt ctcttcagcg ggaattttgt 540 ttttgttttg acgagaaaac gctaattttg cagcgacaca gatgagatac ccacgccacg 600 ttgttgttgg cacacttccg ggttaattga acgccaaaat tcccccaaat gcaccataaa 660 tccgtatatg aatgcatgtt ttctcaaaag agtctttgag taattgtgtt cttaattaga 720 aaattgtcgt gtaatagaaa aatatagcgt tctcctgcct ttacaaatgt aacattttca 780 aacaaataaa atatttgaac gctggtgcgc gcggtgcatc gtgggtcggt atccatagct 840 ttaccaatca gagcgcggga tctgaggaga cgcccctcca gatgttacga tcaaccaatc 900 aacagtgcgt attgagaata tgtaatgtta gatttgcata ccccgcgagt tgattttagc 960 gccgtgtttt tgcggggaaa acgcaaagaa acgatgtttt actgttcaca ttattacaaa 1020 acgtcagcgt ggtgtcatat tttctcccaa tgacaacatt tatatcggac gaggtagaca 1080 tttttgcagt caatgacgga aatttttacg cgatgacgga aaaaaaaaga ttcttgacag 1140 gaattttccg tcaatgggaa ttggctgacg gaaaaaaatc tcgagctggc taaagccctg 1200 // ID MAR_MJ repbase; DNA; INV; 432 BP. XX AC AJ251416; XX DT 09-JUL-2004 (Rel. 9.06, Created) DT 09-JUL-2004 (Rel. 9.06, Last updated, Version 2) XX DE Meloidogyne javanica partial mariner-like element. XX KW Mariner/Tc1; DNA transposon; Transposable Element; MAR_MJ; KW partial mariner-like element. XX OS Meloidogyne javanica OC Eukaryota; Metazoa; Nematoda; Chromadorea; Tylenchida; Tylenchina; OC Tylenchoidea; Meloidogynidae; Meloidogyninae; Meloidogyne; OC Meloidogyne incognita group. XX RN [1] RA Leroy H., Leroy F., Auge-Gouillou C., Castagnone-Sereno P., RA Vanlerberghe-Masutti F., Bigot Y. and Abad P.; RT "Identification of mariner-like elements from the root-knot RT nematode Meloidogyne spp."; RL Mol. Biochem. Parasitol 107(2), 181-190 (2000). XX DR Genbank; AJ251416; Positions 1 432. XX SQ Sequence 432 BP; 124 A; 106 C; 90 G; 112 T; 0 other; tgggtgccgc acgagttgac aatacaacag aaaaatcaac gtttactcgc tgcacaacaa 60 ctgctccaac ataatcagaa agaaaatttt ttgagctttt gacttgtgat gaaaagtggg 120 gtttcgtaca aaaatcctgt aaaaaaacag tggctcaccc ccggccaacc atcggtttcg 180 acccctaaac ctgactggcg ccagaggcgt gtccttcttt ctgtttggtg atggcgtggt 240 ggtatgatcc attgggaatt agttccaaat ggctaaacaa ctaatgccaa atattattgt 300 acccaactgg atcgagttaa acaaaaaatt cggtcccctg gtctcgctgg acattttcgc 360 gcgggggtta tctttcagca ggataatcct agaaactcta attcatccac catacacgcc 420 cgatctagcc cc 432 // ID L1-2_HM repbase; DNA; INV; 5768 BP. XX AC . XX DT 31-DEC-2008 (Rel. 13.12, Created) DT 31-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE a non-LTR retrotransposon from the L1 clad - consensus. XX KW L1; Non-LTR Retrotransposon; Transposable Element; KW reverse transcriptase; endonuclease; L1 clad; L1-2_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-5768 RA Bao W. and Jurka J.; RT "L1-like retrotransposon from Hydra magnipapillata."; RL Repbase Reports 8(12), 2071-2071 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 100..1272 FT /product="L1-2_HM_1p" FT /translation="MASYAKAASVVSSVGERIVEFKVFMDMIHANNQHYVN FT DRAKEIIEKINEKVGRNKIESLTYHRDGRWIAVFNSIDDAIAMTRNEVAIT FT NCKSPVFFRRREQYGFLITIKCDPTIQDDELTEKLLPFIESVISIQHTTYS FT FDHTVKDGRRLFRVKLNCMVKELPHHIEINGAKILLNFAGKEFLCRLCGNT FT HVPRERCVSSVLNSNDKKLNNMEKNSVAEKNQKIDSEKKKDTFQFSVPASL FT TSNIFLRGEVNDCIYKKNNIPTFNFLDKQSLELPNEVKNKIKNTQKEYQSA FT FDEARKLEKQKQEKHHPVETLVIDEDRFTEVKSKTKKKKXKAKKKEDGEIS FT SGSDFDADFDVSMNENINERKRIHPQTPIKEKPTTKERRNEKPKWQT*" FT CDS join(1514..1618,1612..3561,3464..5356) FT /product="L1-2_HM_2p" FT /translation="MQQLKFYTSNVNGLSNGSKRKTVINNLKNRPHDVFCF FT LIQETHLDNDTINLLKKDWDGDVFFSPGAVYTRGTAILLKQKCEKPIFLGN FT DSKGNFNYVALKINDIKLLIINVYAPSGGGNSQKLRITFFNELINEFSNKD FT FNDFQIVMGGDFNVTLSEKDRYPAMFRQECPSIKNLKTLLNLLEIEDVWRV FT FNPLQTEYTFAATNGVSHSRLDRFYISKKXRQNIKTFFEPTILTDHYNAQV FT VVLNINKNNIXSETWILNNKHLENEEFKTLIEASWNEWAHQKNNYENIKDW FT WEAGKEXIRXISKKFSKKLKQKQNKKEFKLLKLLKNASQKMHVNNNMKKLY FT NDVKXKIEXIEKEKAEGAAIRARIQWRLEGEKCSKFFFGLEKQKLAKQYVT FT EISGSDGEVKTEQKEILAEFEKFYKNLYKNKVNDKTSQEFLFSNAKVQQVT FT EVENKLLEKEISEFEIKQALGQMKNNKSPGSDGLTIEFYKKFFEIIKKDLT FT DALNNVLFTGEMTNSQKEAIITCIFKKGNKNDIGNWRPISLTNVDYKILTK FT CLANRLKNILPNIIHSNQTACVKNRTINYNLSYTRDIISIAKTSDLDACIL FT SIDQVKAFDRVDRNFLLKSLKHFGFGRGFIFFIETIYKKIFAKIKINGCLS FT EKINQYFERSKTRLSFVNDIIYHSSRNLFNFCKTMRINILRGVRQGCPLSM FT ILYIIQAEIFSTFVRQCDKIKGIIVEGKETKIQQYADDTNFYLTGENSILE FT IGKALKINKKATGAKINIAKCQGIWLGKNIFEDKQKFLNFNWDENSFKSLG FT VVFSNNGRFSYETHWKERFDKMESSLRKWQKWKLSLKGKKTIINSLIIPQI FT LHIAFVVPLPNKNYLDKMEKKINNFFWGSRVCRLNKTKTQQKILKGGAGVN FT NLTNKLKAIQLQWISKXHSENISGPWKDAMINILNSYRDANQNIHVFKTNP FT TQIKQLPPFYAQLITNWHELKEKHKLPVTEIEEILNEPIFYNKMIQKSPKK FT FLIPDKSHIENDIVLLADIAKTFQPGFLNHKSVNXSEKNLSEIIKALPPEW FT RRKILTQSQXYDFKKAPQYIAKKKSKYELKNLTVLTLYKLLNKPIKKLEYK FT YNNWKSMFDSMDNLNDNDWKHFFVNISKRNTNNKASEIRFKAAHFILPTND FT KMFEIKVRKDQLCPQCKTKVETTAHMIYECEKVQPLIFYLLDILDQLYPLE FT YPHINXLKFILFGYEKRAKQRVFGNILLDVLLTEIFYNRMKSHFDNKVFTN FT KYLLKKLXAKIEKVLKNELLIIFKQNKQEIHKEEIKNIFCNGSLRLVADIC FT NYF*" XX SQ Sequence 5768 BP; 2380 A; 803 C; 879 G; 1677 T; 29 other; acggaaaaaa acacatctaa acttctttta agtatttgtg tacatactga aactttttag 60 tcgcgtgttt ttttttctta tctctgaaaa aatttcaaaa tggcgtctta cgctaaagca 120 gcttcagtag tcagttcggt tggtgaacga atcgtcgagt ttaaggtgtt tatggatatg 180 attcatgcga acaatcaaca ctatgtgaat gatcgtgcca aggaaattat tgaaaaaata 240 aatgaaaaag ttggtcggaa caaaatagaa agtttaactt accatcgcga tgggcgatgg 300 atagctgtkt tcaactctat tgatgatgca atagcgatga ctcgaaacga agtcgcaata 360 acaaattgta agtctcctgt tttttttaga agaagagagc aatatggttt tcttatcact 420 attaaatgtg atccaaccat ccaagatgac gaactaacag araaactttt accttttatt 480 gaaagtgtta tttccattca acacacgacg tacagtttcg accacactgt aaaagatggt 540 cgacgtttat ttcgtgttaa actgaattgt atggtaaagg aacttccgca tcatatcgaa 600 ataaatgggg cgaaaatcct tttraacttt gctgggaaag aatttttatg tagactatgt 660 ggaaacactc acgtaccaag ggaacgatgt gtttcctcag tattaaattc aaatgataaa 720 aaactcaata atatggaaaa aaactccgtt gccgaaaaaa atcaaaaaat cgactcggaa 780 aagaaaaaag atacatttca atttagtgtt ccagcctccc tgacttcaaa catctttctc 840 agaggtgagg tcaatgattg tatatataaa aaaaataata tacctacttt taatttttta 900 gacaaacaga gcctagaatt accaaacgag gttaaaaaca aaataaaaaa cacacaaaag 960 gaatatcagt cagcttttga cgaagcccga aaactcgaaa aacaaaagca agaaaaacat 1020 cacccagtgg aaacactggt tattgatgaa gatcgtttta ctgaagttaa atcaaaaaca 1080 aagaagaaaa aarataaagc gaagaagaaa gaagatggcg aaatatcaag tggaagcgat 1140 ttcgatgcag atttcgatgt ttcgatgaac gagaacataa acgagcgcaa aagaatccat 1200 cctcaaaccc cgataaaaga aaaacctact acgaaagaaa gacgcaatga gaaacccaaa 1260 tggcagacat aaataaaaga ctttttaaga gggacttttg gtgttggtat gaattagttt 1320 tttatatttt ttattctttt gcttatttat gtctaatttt atttatatgt crttgggttt 1380 cttcctcaaa agctttttcc tgatctagag atagatgcaa aatttaggtg gaaaaagtta 1440 aaaattgcga tatttttttt taaaaaaaat aaaaaaaatc aaaaaaaaaa aaawaaaaaa 1500 aaaaaaaaaa aaaatgcagc aattaaaatt ttacacctca aacgttaacg gtttgagtaa 1560 cggaagcaar cgtaaaacag ttatcaataa tttaaaaaat agaccacatg atgttttttg 1620 atacaggaaa cccatttaga caatgataca attaatttat tgaaaaaaga ttgggatggt 1680 gacgtttttt tctcacctgg agcagtttat acamgaggaa ctgcaatttt attaaaacaa 1740 aaatgtgaaa aaccaatttt cttaggtaac gactcaaaag gaaactttaa ctatgtggct 1800 ttaaaaatta atgatataaa attattgata attaacgttt acgcaccttc cgggggcgga 1860 aattctcaaa aattaagaat tacttttttt aatgaattaa ttaatgaatt cagtaataaa 1920 gattttaatg attttcaaat tgttatgggt ggtgatttta atgtaacatt atcagaaaaa 1980 gatagatatc ctgccatgtt tagacaagaa tgtccgtcaa taaaaaattt aaaaacttta 2040 ttaaacctct tagaaataga ggatgtttgg agagttttta atccccttca aactgaatac 2100 acttttgctg ccactaacgg tgtgtctcat tcgagactag accgcttcta tattagcaaa 2160 aaakctaggc aaaatattaa aacttttttt gagccaacta ttttaactga tcattataac 2220 gctcaagtag tcgttttaaa tataaacaaa aataatatta ratcagaaac atggatttta 2280 aacaataaac atttagaaaa tgaagaattt aaaactctta tagaagcgag ctggaacgaa 2340 tgggcacacc aaaaaaataa ttatgaaaat attaaagact ggtgggaagc cggcaaagaa 2400 aamataagas tcatttcgaa aaaatttagc aaaaaactaa aacaaaaaca aaacaaaaaa 2460 gaatttaaac ttttaaaatt attgaaaaac gcttcgcaaa aaatgcatgt aaataataat 2520 atgaaaaaat tatataatga tgtaaaaatk aaaattgaac awatagaaaa agaaaaagca 2580 gaaggrgcgg cgatwagagc gagaattcaa tggagattag aaggcgaaaa atgttcaaaa 2640 ttttttttcg gactagaaaa acaaaaactt gcaaaacagt atgtcactga aataagtggg 2700 agtgacgggg aagtcaaaac agaacaaaaa gaaattctag ctgaatttga aaaattttat 2760 aaaaatttgt ataaaaataa agtaaacgac aaaacctctc aagagttttt gttttcaaat 2820 gctaaagttc agcaagtgac tgaggttgaa aataaattac tagaaaaaga aatatctgaa 2880 tttgaaataa aacaagcttt aggacaaatg aaaaayaaca aatctcccgg ctcagacggg 2940 ctcactatag aattctataa aaaatttttt gaaataatta agaaagattt aactgatgct 3000 ttaaataatg ttttgtttac aggagaaatg acaaactcac aaaaagaagc cataataaca 3060 tgcattttta aaaaaggcaa caaaaatgat ataggaaatt ggagaccaat atcacttaca 3120 aatgtagatt acaaaattct aacaaaatgt ttagcaaata gactaaaaaa tatattgcca 3180 aacataatac acagcaacca aacagcatgt gtcaaaaata gaactataaa ctataatctt 3240 agttacacca gagacattat aagcattgct aaaacaagcg atttagacgc ttgcatactt 3300 tctatagatc aggtaaaagc ttttgacagg gtagatagaa attttttgct taaaagtttg 3360 aaacatttcg gatttggaag aggttttata ttttttatcg aaacaatcta taagaaaatt 3420 tttgcaaaaa ttaaaattaa cggatgttta tccgaaaaaa taaatcaata ttttgagagg 3480 agtaagacaa ggttgtcctt tgtcaatgat attatatatc attcaagcag aaatcttttc 3540 aacttttgta agacaatgcg ataaaatcaa agggattatt gttgaaggaa aagaaaccaa 3600 aatacagcaa tacgctgatg atacyaattt ttatctaaca ggagaaaact cgattctaga 3660 aataggtaaa gctttgaaaa taaataaaaa agccaccgga gcnaaaataa acatagccaa 3720 atgccaaggc atatggctag gtaaaaacat tttygaagat aaacaaaaat ttttaaattt 3780 taattgggat gaaaactctt tcaaaagttt aggagtagtt ttctckaata acggaagatt 3840 ttcgtatgaa acccactgga aagagcgttt cgataaaatg gaaagctctc tccgaaagtg 3900 gcaaaaatgg aaactttctt taaaaggaaa aaaaacaata ataaactctt tgataatccc 3960 gcaaatatta cacatagcct ttgtagtccc cctacccaac aaaaattatt tagacaaaat 4020 ggaaaaaaag ataaacaatt ttttttgggg tagtagagtc tgccgactca acaaaacaaa 4080 aacgcaacaa aaaatattaa aaggtggagc tggtgtaaac aacctaacaa ataaacttaa 4140 agctatccag ttgcaatgga ttagtaaart acacagcgaa aatataagcg ggccatggaa 4200 ggacgcaatg ataaacattt taaactccta tagagacgcc aaccaaaata ttcacgtttt 4260 taaaactaac cctacccaaa ttaagcaact cccacccttc tatgctcagc taataactaa 4320 ttggcatgaa ttaaaagaaa aacataaatt accagtaaca gaaattgaag aaattttaaa 4380 cgaacctatt ttttataaca aaatgattca aaagtcgcca aaaaagtttt tgattccaga 4440 taagagtcac atagaaaacg atattgtgct tttagctgac atagcaaaaa cttttcaacc 4500 aggcttttta aatcataaaa gcgtgaatat kagcgaaaaa aacttaagcg aaattatcaa 4560 agcccttccg ccagaatgga ggcgaaaaat tttaacccaa tctcaaaawt acgattttaa 4620 aaaagcccca caatatatag ctaaaaagaa atcaaaatat gaattaaaaa acctaactgt 4680 tttaacatta tacaaattat taaacaaacc catcaaaaaa cttgaatata aatataataa 4740 ttggaagtcc atgtttgata gcatggacaa cytaaacgat aatgattgga aacatttttt 4800 tgtcaacatt tctaaaagaa acactaataa taaagcgagc gaaattcgat tcaaagcagc 4860 gcattttata ttaccaacaa acgataagat gtttgaaata aaagtaagaa aagatcagct 4920 atgccctcaa tgcaaaacaa aagttgaaac gactgctcac atgatctatg aatgtgaaaa 4980 agtgcaacct ttaatttttt atcttttaga cattctagat caactctacc ctttagaata 5040 ccctcatatt aacartttga aatttatttt attcggctat gaaaaaagag ccaagcaaag 5100 agtttttggr aatattttat tggatgtttt rctaacggaa attttttata atagaatgaa 5160 gagccacttt gataataagg tttttacaaa caaatatctt ttaaaaaaat tgaamgcaaa 5220 aattgaaaaa gttttaaaaa acgagttact aataatattt aaacaaaata aacaagaaat 5280 acacaaggag gaaataaaaa atattttttg caacggaagc ttgagattgg ttgctgatat 5340 atgcaactat ttttgaataa cattttagac ttttttagaa actttttata taaaatactt 5400 ttaatttttt tttaactaat tgtatatatt agatgaatgt aattctttca tatatgaaat 5460 gataacagag aattaatgat tagctaaata tttgataaaa actaaagaaa taactttttt 5520 cagataagaa aaacacgtga ctaaacagtt tcaatttttc cttttcaaaa caatttttct 5580 ttttattctc tttcgatttt attttttgtg ttttttatat ttggtttttt atggtgattt 5640 tattgtaatg ggtctaagac atctttgtaa caggggggta tatatttacg aaatcttgta 5700 acttttattt ttttttacgg tgtcagtatg gcaattagcc ggatgacaat aaactacaat 5760 acataaaa 5768 // ID BEL-594_AA-LTR repbase; DNA; INV; 516 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-594_AA_; KW Pao_Bel_Ele204; BEL-594_AA-I; BEL-594_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-516 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 516 BP; 195 A; 97 C; 86 G; 138 T; 0 other; tgttggcaac acggatcagc gcgttcagca ccgccctcat agagatacca cagggaagca 60 gaccgagtga catacgaatg atgttgctct gatagggagc ttaggtagga cagctgtcat 120 taaaaagtaa gaagtcatgc ttgcatgata gctggtagag tgcatcaatt tgttactact 180 taaactactt acgaaaatct ataatttgtt attctaaatt aatcctaact ttgttaaatt 240 acatgaattg tgagtatgaa atgaattcct taaactaaat atattctaaa cacaatttat 300 aatacattta gcctaaacat aaattgcaac aaaccctact gttacaaacg acggacaaac 360 taaagttgag gacaaatcaa tgtaagttaa cctataaaaa gaaatgtaca acatatgaac 420 taaactaatg atacaattac agcttaaagc atactcaacc taaaactacg agtttgctct 480 gaagacgtcc gaaactggtc cgctcacgta acaaca 516 // ID Gypsy-9_DWil-I repbase; DNA; INV; 4258 BP. XX AC scaffold_180702; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-9_DWil_; KW Gypsy-9_DWil-LTR; Gypsy-9_DWil-I. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-4258 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_180702; Positions 1663566 1667823. XX CC Positions [3240-3710] - Integrase core CC 'AATAA' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 90..4226 FT /product="Gypsy-9_DWil-I_1p" FT /translation="MPIESAEFQELLSQQQQFTTNILEQQRQWMREILGNN FT SSSSASNSSHAGELAMCPPFHKFDKDNQNWESYLEQLSQHFNAYSVSADEK FT QKAYFLSWCGVSLYELLKNLFGSSNLNTHTYKELTDKLTVHFTTKRHIVAA FT RYEFFKKEMAHGQTHKEWVADLRGIARECQFVCSAQGCLSNYVDEMIRDQI FT IVHTPYDAVRTAALQKHQPTLEDVLMVAETYETTTKTVATLKDKRESQVPL FT NAIHTFKKYQNKPNKEKQCKNSWKSCSGCGTSHTRERCKYRDAVCYKCGRK FT GHISVVCMSKQDKSNSQTQKREEANLMEEVNTITGQVKNENRKKLIDVKIL FT NQVITFQMDSGSTASVINFETYLLLNKPKLQECSRQFFAYTHIPIPILGEL FT HTVAKCGEKVENIVIIVTSKRDEANLFGTDLFKKFGFDIVQICKIYEQENI FT RINELCCKYKSVFEPAMGTIKDFKASVFLKSTATPKFCKSRQIPFAQLEHF FT KAEANRLTEANIWKPIKFSNWASPIVLAPKPGGAIRICGDFKQAVNAQIDI FT EQYPLPTKESLFHIIRHGKQFSKIDLKDAYLQMELDEATKQFMVVNTPLGL FT FQYQRLPYGIASAPAIFQRYLEQLLKGIEGCGNYLDDIIISAPTSEEHLER FT IEQVLNILQENGIKCKKEKCFFFKDEIEYLGRRISAAGILPDTSGLEAVRL FT LKPPSNLKQLEAFMGKVNYYCNFIPNFSQLAAPLNQLRRKNVKFYFGPNQL FT QAFSALKSHIINATQLAHFDENLPIVLAADASSFGLGVALSHIQHDGQERP FT IVFASKTLDKHQEKYSQIEKEGLSIIFGVKRFHQYLYGRKFILLTDHKPLV FT SIFNPGKHLPLMTSNRLQRWGIILMAYNFDIQYRATSAHGNADALSRLPIG FT TDIEFDKEEEACNVIREINPPVNAETILKHFNNDKILKQVLHYVSVGWPEK FT LESGNEDLLPYFNRKFALTVNDHLLCLQSDANRMVIPTSLRPKVLKLLHEG FT HWGIVRMKQIARKHVWWPAIDEDIKKLAQSCNICKSNNPALPRQYQSWPTA FT TTAWERIHIDFAGPIFDAMWLICVDAYSQFPFTVQMTSVTTANTISALSSI FT FAIEGYPKTMVSDNGPQLTSEAFKEFCILHGIKHITTAPFHPASNGLAERF FT VQSFKISVKKNIQEGLPVRTAVTKYLASYRFTPNAQGKSPAELIHGRSVRT FT VLSQLFEKPVETKQELTKYSTNQKVFSRNFSRGEKWIEGTIDRPIGRMLYV FT VRTSNGFIKRHFNQIKPRMSDNTSEHNPTKYWFVPEIPQPAPCSTPELQED FT QPETHHDLPNTIQETVPDSGENAATSNETPNRIQSSRRRSGIPVRQSTRAR FT QEVNRFKPHDFRKSKN" XX SQ Sequence 4258 BP; 1452 A; 896 C; 826 G; 1084 T; 0 other; ttaatggcga cgaggcaaaa aacaaattaa aacgtttagc tacaataatt gtgtaaaatc 60 tagttggtca gtcattgatt ctcatcaaga tgccaataga aagcgcggaa tttcaagaat 120 tgctttccca gcaacagcaa ttcacaacga acattctaga acaacaacga caatggatga 180 gagaaattct tggaaataat tcttcatcta gtgctagtaa ttcaagccac gcgggtgaat 240 tagctatgtg tcctccattt cacaaatttg acaaagacaa ccaaaattgg gagtcttatc 300 tagaacagtt atcgcaacat tttaatgcat attcggtcag tgcagacgaa aaacaaaaag 360 catattttct ttcttggtgt ggcgtcagtt tgtacgagct tcttaaaaat ctcttcggga 420 gtagcaactt aaacacacat acatataaag agctcacaga caaacttact gtacatttca 480 caactaagcg tcatattgta gcagcgcgct acgaattttt taagaaagag atggcacatg 540 gtcaaacaca caaagagtgg gtagccgatt tgcgcgggat cgctcgtgaa tgtcaatttg 600 tgtgttctgc acaaggttgt ttgtccaact acgtagatga aatgatccgg gatcagatca 660 tcgtacatac cccgtatgat gcagtacgga cagctgctct tcaaaagcac cagccgactc 720 ttgaagatgt gctcatggtt gcggagacct atgagacaac gacaaaaacc gttgcaaccc 780 ttaaagacaa gagagagagt caagttccac tcaacgcgat ccatacattc aaaaagtatc 840 agaataaacc gaacaaagag aaacagtgca aaaactcgtg gaaatcgtgt tctgggtgtg 900 gaacgtctca cactagagag agatgcaagt atcgtgacgc cgtatgttac aaatgtggta 960 gaaaaggcca tatatcagtc gtgtgtatgt caaagcaaga caaaagtaat tcgcagacac 1020 aaaaaagaga agaagcaaat ttaatggaag aagtaaacac tataacgggt caagtcaaaa 1080 acgaaaatcg taaaaagcta attgatgtca aaatattgaa tcaagttatt acttttcaaa 1140 tggactcagg gtcaacagct tcagtgataa attttgaaac atatttatta ctgaataaac 1200 caaaattgca agaatgttcc agacaatttt tcgcatatac acacattcca ataccaatac 1260 taggcgaatt gcacactgtc gcaaagtgtg gagaaaaagt cgaaaatatt gtaatcattg 1320 ttactagtaa aagagacgaa gccaatcttt tcggaacaga tttatttaaa aaatttggat 1380 tcgacattgt ccaaatatgt aagatttatg agcaagaaaa tattcgaatt aacgagcttt 1440 gctgcaaata taagtcagtg ttcgagcctg ccatgggtac cataaaagat ttcaaagcca 1500 gcgtattttt aaaatctaca gcaacaccca aattttgcaa aagccgtcaa attccatttg 1560 ctcaattgga acatttcaaa gccgaagcaa atcgtctcac ggaagccaat atctggaaac 1620 cgatcaaatt tagcaactgg gcatccccca ttgtgttagc acctaagcca ggtggagcca 1680 ttcggatttg tggtgacttt aagcaagcag ttaacgcaca aatcgacata gagcagtatc 1740 cattaccaac aaaagagtcc ctattccaca taattcgtca tggtaagcaa ttcagcaaaa 1800 tagacctaaa agacgcctat cttcagatgg aactagatga ggcaacaaag caattcatgg 1860 tcgtcaacac gccattgggc ctatttcaat atcagcgcct tccgtatgga attgccagcg 1920 caccggcaat cttccagaga tatcttgagc aacttctaaa gggaatagaa ggttgtggca 1980 actacctaga tgacatcatc atctcagctc caacaagcga ggagcatctg gagcgaatcg 2040 agcaagtttt aaacatactt caagaaaacg gaatcaaatg caaaaaggaa aagtgcttct 2100 ttttcaaaga cgaaatcgaa tacttgggaa gacggattag tgctgctgga attttgccag 2160 acacatccgg attagaagct gtacgtctat taaaaccgcc ctcaaattta aagcaattgg 2220 aagcatttat gggtaaagta aattactatt gcaactttat accaaacttt tcacagttgg 2280 cagcaccttt aaaccaactt cgtagaaaaa atgtaaaatt ttattttggt ccgaatcaac 2340 tacaagcctt ttcagctctg aagtctcaca ttattaacgc aacgcaactg gcgcattttg 2400 atgaaaattt accgatcgtg ctcgcagcag acgcatcatc atttggtctc ggagtagccc 2460 tttctcatat tcaacacgat ggtcaagaga gacccattgt ttttgcttcg aagacactcg 2520 acaaacatca agagaaatac agtcaaatag aaaaagaggg attgtcaatc attttcggag 2580 ttaaacgatt tcaccaatat ctgtatggga ggaagttcat actgttaacc gatcacaagc 2640 cgcttgtatc gatcttcaat ccagggaaac atcttccatt gatgacttca aacagactac 2700 aacgttgggg tatcattcta atggcgtata attttgatat ccagtatcgg gcgacatctg 2760 cacatggcaa tgccgatgct ctttcccgac taccaattgg aactgatata gaattcgata 2820 aagaggagga agcttgcaac gtaataaggg aaatcaatcc accggttaat gcagaaacca 2880 ttctcaagca tttcaataac gataaaatcc tcaaacaagt cctgcactat gtttctgtag 2940 gatggccaga aaagttagaa agtggcaacg aagatttatt gccatacttt aatcgtaagt 3000 ttgctttaac agttaatgat cacttactat gtttacagtc agatgcaaat cgtatggtga 3060 ttccaaccag tctacgtcca aaagtcctta aactattaca tgagggacat tggggcattg 3120 ttcgcatgaa gcaaattgca aggaaacatg tttggtggcc tgccattgat gaagacataa 3180 aaaagctagc tcagtcatgc aacatctgca agagcaacaa tccagcgcta ccacgacaat 3240 accaaagttg gcctacagca accacagctt gggagcgaat ccacatcgac ttcgctggcc 3300 caatattcga tgccatgtgg ttaatttgtg tagacgcata ctcgcagttc ccgtttactg 3360 ttcaaatgac atcggtaaca accgccaaca caatctcagc tttgtcttca atttttgcaa 3420 tcgaaggata tcctaagact atggtcagcg ataatggacc gcagcttacg tctgaagcat 3480 tcaaagagtt ttgcatactc catggaatca agcatataac aacagcacca tttcacccag 3540 cctctaatgg attggctgag cgatttgtac aatcttttaa aatttctgtg aagaaaaata 3600 ttcaggaagg tttgcctgta cgaacagcag taacgaaata tcttgcttct tacagattca 3660 ccccaaatgc acaaggtaaa tctcctgctg agttaattca cggtcgttca gttcgcaccg 3720 tattaagtca acttttcgaa aagcctgtcg aaactaagca agagctgaca aaatacagca 3780 caaatcaaaa ggtattttcc agaaatttct cgaggggaga aaagtggatc gaaggcacta 3840 tcgatcgacc aattggacga atgctgtacg tcgtacgtac atcaaatggc tttatcaagc 3900 gccatttcaa ccaaatcaaa ccacgcatgt cagacaacac gtccgagcac aacccgacta 3960 agtattggtt cgttcctgaa ataccacaac cagctccttg ttcaacacca gaattacaag 4020 aagaccagcc agaaacgcac cacgatcttc cgaataccat tcaagaaact gttcctgatt 4080 ctggtgaaaa tgcagcaacg tcgaatgaaa ctccaaatcg aatccaatct tcgcgacgac 4140 gttctggtat acctgttcga caaagtacaa gagctcgtca ggaagtcaat cggttcaagc 4200 cacacgattt ccgaaaatct aagaattaat tgccgtttat aatttaaagg gggagatg 4258 // ID hAT-62_HM repbase; DNA; INV; 1943 BP. XX AC . XX DT 30-DEC-2008 (Rel. 13.12, Created) DT 30-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type DNA transposon family - consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-62_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-1943 RA Bao W. and Jurka J.; RT "hAT-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 2050-2050 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(115..192,182..556,560..751,702..971,980..1627) FT /product="hAT-62_HM_1p" FT /translation="MKNKMKRIKISTNGKRERGIILKQRFNRGFKIFPFET FT LILNSEKRNIDIKLCRGQAYDGAYVMFGIKSGLQTRIKALSPNAIYVHCCA FT HVLNLVTIEAMSVNSDVQMFLGTAEKLYTYLTSSLPRLHILQEHQKRLYES FT TVDTLKRLSDTRASRKHAVDADVESFSAILATLEDINLGESKEHTGNVRAE FT AIGLSVLIKKTFICFSNVIFAKVVNKKHSFVFLMLFLPKLLINLFVLSNYL FT QREDIDVLFAKQLIDVTKKKFVDMRNDEAFEVLHLAVKSFISEKCAELDVE FT TEFKENSSFFKFSHHAKKKRMTGELWRDERVDDPTTRLKCETYFNVLDTII FT TQINKRFHDFSNTVTHFDCLDPSKLSEENINSFKNLSCQIYCNDVNTEEAV FT VEYETFNDVYASISPSLTTDIQLKDVLQFLIEKQIAPGLPNLSILYKIYLT FT LPVTSANAERSFSKLKITKNYFRSTMTNERLSGSALISIERELAENIDFDL FT SINRFASMKSRRICVNYLKSEFFF*" XX SQ Sequence 1943 BP; 702 A; 325 C; 323 G; 593 T; 0 other; cagtggcgga tttagtatag ggcgtggggg gggggggagg agccgccccc tagaacgatc 60 aaacggcgcc ccctaaaaca ttcaaaaaaa aaaaattata aattcaaatt accaatgaaa 120 aacaaaatga aaagaataaa aataagtaca aatggaaaga gagagagagg aattatttta 180 aaacagaggt tttaaaatat ttcctttcga aacattgata ttaaactctg aaaagagaaa 240 cattgatatt aaactctgca gaggacaagc atatgatggt gcttatgtaa tgtttggaat 300 taaaagtggt cttcaaactc gaataaaagc gctttcacca aatgccatct atgttcattg 360 ttgtgctcat gttttaaatt tggtgaccat cgaagcaatg tctgtaaaca gcgatgtcca 420 gatgttttta ggcacagctg aaaaattgta tacgtacctt acgtctagtt taccacgact 480 tcatattctc caagagcacc agaaacgtct atatgaatca acagtagaca ctctgaaacg 540 actttctgat acaagatgag caagtaggaa gcatgcagtt gatgctgatg ttgaatcttt 600 ttctgctatt ctagccactc tagaagatat aaatctagga gaatctaaag aacatacagg 660 aaatgttaga gcagaagcta ttggactatc agttttaata aaaaaaacat tcatttgttt 720 ttctaatgtt atttttgcca aagttgttaa ttaatctttt tgtgttatct aactatttgc 780 aacgagaaga tatagatgtt ttatttgcaa aacagttgat tgatgttaca aaaaaaaagt 840 ttgtagatat gagaaacgat gaagcttttg aagttctaca tttagcagtc aaatcattta 900 ttagtgaaaa atgtgccgaa ctagatgtag aaactgaatt taaagaaaac tcgtctttct 960 ttaaattcag ttagcctaac atcatgccaa aaagaaacgc atgacaggag agctgtggag 1020 agatgaaaga gttgatgatc caactacacg cctaaaatgt gaaacctact tcaatgtttt 1080 ggatacaatt ataacccaaa taaataaacg atttcacgac tttagtaaca cagttacaca 1140 ttttgattgc cttgatccgt caaaattatc tgaagaaaat ataaattcat ttaaaaactt 1200 atcttgtcaa atttattgta acgacgtaaa tactgaagaa gcagtcgtag agtatgagac 1260 tttcaatgat gtgtatgcat cgatatcccc atcactgaca actgacatac aattgaaaga 1320 tgtacttcag ttcttgattg agaaacagat agcacccggt ctccctaact tgtctatact 1380 ttacaaaatc tacttgactc ttcctgttac atctgctaat gctgaaagaa gtttcagtaa 1440 actaaagatt acaaaaaatt attttagatc aacaatgaca aacgaacgtc tttctggatc 1500 agcattaatt tctattgaac gtgagcttgc tgaaaacatt gactttgact tgtcgataaa 1560 tcgttttgct tcaatgaaat ctcgcagaat ttgtgtaaac tacctgaaaa gtgaattttt 1620 tttctaaatg ctaacttttg taatatgtct agttaaatac atttttaaat gaccgaacta 1680 tttcagtaat attaaatgtt tcttatttta taaaagataa ttgttaacta aaataaacaa 1740 aggaaactca aaaaaaatta ggttatttag tgtcttatat gtatgtaata tgtataataa 1800 aagtcacata aaaacactta taaagtgttt tagtcatgaa aataatttct ggggtttaaa 1860 ataccacacc caaccccggt cgccccctat tcattgactt gccccccccc ccccccgcga 1920 cactaaacct caatccgcca ctg 1943 // ID DNA7-2_CQ repbase; DNA; INV; 418 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A DNA transposon family from Culex quinquefasciatus - consensus. XX KW DNA transposon; Transposable Element; nonautonomous; DNA7-2_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-418 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the southern house mosquito."; RL Repbase Reports 11(1), 77-77 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 20 sequences with >91% CC identity. 7-bp TSDs. 22-bp TIRs. XX SQ Sequence 418 BP; 123 A; 93 C; 97 G; 104 T; 1 other; cactgacgaa ctgacgtgcc accccataca tatttttgag aaaattctga ccgcaaattc 60 kccagcgaac accaactcca tctaccttca ggttcgctga actaattctc tctcccacac 120 gtgttttcga tattgcgact atgcttcaat ttgtaatctg cgtcgtttca aaaaagtatg 180 cgctcagaac gattttggag cgcgctgttc gtttcggcga aagcgtgtgt cagtttatgc 240 tgacttcaac aatacattgc atgtaaacat tggcgcgaag cgagagaaga gagctagtga 300 agagagggaa gagagaaaaa tcggttgaca gattgttcgc tagaagcaaa aacgccaaac 360 aagaagaaga caactgcggt caaaagtggt ttgctcgtgg cacgtcagtt cgtcagtg 418 // ID Gypsy-35_NVi-I repbase; DNA; INV; 5070 BP. XX AC . XX DT 30-JUN-2009 (Rel. 14.07, Created) DT 30-JUN-2009 (Rel. 14.07, Last updated, Version 1) XX DE Gypsy LTR-retrotransposon from Nasonia vitripennis, interanl DE region, consensus. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-35_NVi-I. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-5070 RA Bao W. and Jurka J.; RT "Gypsy LTR retrotransposons from Nasonia vitripennis."; RL Repbase Reports 9(7), 1388-1388 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 1680..4679 FT /product="Gypsy-35_NVi-I_2p" FT /translation="NWVNHVANLDVFVGNIIDDCILGSDFLDATGITDLIT FT KIILNTPSSIINKQISRIVDSSKVVPGSLQEIFDKNSENLDQDQRREFADL FT LHEFEEIFQDDDVTGKCNLVEHRIQLLHNQPIKQPIRRPPIHQIKELDECI FT AEMHSQGVIEESDSSYCSPVVPKKKKDGKIRFCINFQKINAITIKDSYPLP FT RVDDTLDKLAGYSWFCTLDLKSGYWQIKIREEDRKITAFSIGKGLWQFKVM FT PFGLCNAPATFQRLMEKLLRDYLSKFCLVYLDDVMVLGKSFNEMISNLKKV FT FVRFQEAGLRLNKKKCVFFSRQVEYLGYIVSEKGKSADPKKIQSVVNWPIP FT HNKKAVLSFVNFCGYYRRLIKGFADIAKPLYKLTEKSTKFLWSEDCQKAFS FT KLKEVLTKPPILAFPSLSSEFILDTDASDHGIGSVLSQKQDGIERVIGYYS FT RVLSKPERNYCVTRRELLAIIESLKHFHHYLIGKPFIIRTDHSSLTWLLSF FT KELDGQLARWLGKLDQYDYKIIHRKGNLHGNADGLSRRPCEDFSCKYCSRV FT EEKFNEETICRIIFEENTSEDWKKSQAEDEAISFICQTKELQKKPAWQEIS FT SMDEFTKIYCSQWDSLVLINGILYREWVSTNLDHKIRQLVVPIDKIKDILE FT EAHDSPSGGHFGVNKTLDKIRKRFYWATCKRDVENWCKSCEVCIAKKGPPD FT KGHGEMKIYNSGVPFERLQMDILGPFPLSSLGNKYLLVVTDCFSKWVEAFP FT LSNMRTKTIAEIFVDQIVCRYGVPLEVNTDQGTNFDSKLIKNLSELLGIKK FT TRTSPLHPQSNGQVERQHQTLLHYLTKYIHDNQKDWDRWVSLFLLAYRSSK FT HATTGATPAEMYTGFDLRLPLDLIRGCLPEEVEIKGDGDYVHRLRRKLSEI FT HKMARDRINLQSEKTKSWYDKRALNVEYSQGQKVWFFNPRRTKGKAPKLQS FT NWEGPWEVVRKINDVVYCIRRSPRHKSKIVNINRLGTYTERDTH*" FT CDS join(314..1342,1346..1702) FT /product="Gypsy-35_NVi-I_1p" FT /translation="RDLEVRLSQRSVKEVNLSEELRRRDQDFLSEKRLREN FT YEEKIKHLVEDIERLQRSNEALLAERSEWTRKQQDLERRLTELCRSNISNT FT NINTNSAIATGTFDRLRSNNVHPLDNTQSTQSISLGYNLKPDTYDGTTPLR FT EFLNQFQLIARANNWDSARIMVALAASLRGKARSVLDNYKDTATIYFDKLI FT SSLELRFGDAHFAPTYYFQFQNRRQQRGEDFPTLAADLMRLAGLIYPDCPF FT KTQDKIACAQFVIAIFDNSIRKILQLERINSLQIALARAMEVKVIEDQDKR FT LQHLAQRNNYTSSSFPKSVSSSSQPSGRQNFQDFVKEIECWNCGKKGHFKV FT LSDFVNTSGKLGYVKLSGASLTELDKTPTNSHSYRIDCHVGRINRNSNKFY FT FDGLINDKACIFKVDSGSDVTIINPKFVESKDMRIPINFTTLKYPTGEKVP FT VEFLISASLKLGESCS*" XX SQ Sequence 5070 BP; 1668 A; 850 C; 1030 G; 1519 T; 3 other; ttttgggggc tcgtccggga tactagtgtt tttgagtgat tattttatgc atcttcgatc 60 tggcaagatg cttccaagaa caccacacaa gccggatgca ccagcacaat ccatcgatga 120 ttcacgagaa gaagtggatt tattgaatct tgaacagacg aatctggagg acacaaatgt 180 tttcagacac acacaatctt cacaaacccc aatcctcaat acacacactg atgttccaaa 240 tccaacactg gtggctgcta ttaaggaaat aaacaacttg aagatacaac tagaaaacga 300 acatgctcaa tgaagagatt tggaagtccg tttatctcaa cgttctgtga aagaagtaaa 360 cctttccgaa gagcttcgac gacgggatca ggattttctg tccgagaaac ggcttcgcga 420 gaactatgag gagaaaataa agcacctggt tgaagacatc gagagactgc aacggtcaaa 480 cgaggcatta ctcgcagaac ggtcagagtg gacaaggaaa caacaggatc tcgaacgtcg 540 attaactgag ctatgtcgtt caaatatttc taacacaaac ataaacacaa attctgctat 600 agctactggc acattcgata ggttaaggtc taacaatgta cacccactgg acaatacaca 660 atcaacacaa agtattagtt tgggttataa tctcaaaccg gatacwtatg atggtacaac 720 accactgcgt gagtttttga atcaatttca attaattgct agagcaaata attgggatag 780 tgctaggata atggttgctt tagctgcttc tttacgggga aaagctagat ctgttcttga 840 taactataaa gatactgcta caatatattt tgataagtta atatcaagtt tagaattacg 900 ttttggtgat gcacattttg caccaackta ttatttccaa tttcagaaca ggcgtcaaca 960 acggggtgaa gattttccaa cattagctgc tgatcttatg agattagctg gtttgattta 1020 tcctgattgt ccttttaaaa ctcaagataa aatcgcctgt gctcaatttg taattgctat 1080 tttcgacaat tcaattcgga aaattcttca attagaaaga attaattctc ttcaaattgc 1140 tctggcaaga gctatggaag ttaaagttat cgaagatcag gacaagcgac ttcaacatct 1200 agctcagaga aataattata cctcttcatc atttccaaag agtgttagtt cgtcatctca 1260 gccgtcaggc agacagaatt ttcaagattt tgtaaaagaa atagagtgtt ggaattgtgg 1320 taaaaaagga catttcaagg tctaattgtc cgactttgtc aacacatcag ggaaactagg 1380 ttatgttaaa ctttctggag cgagtttaac tgaacttgac aagactccaa caaactccca 1440 ctcttatcgg attgattgcc atgttggaag aattaatagg aactctaata aattttattt 1500 tgatggttta attaatgaca aagcttgtat ttttaaagta gattctggtt ctgatgtgac 1560 tattataaat ccaaaatttg ttgagtcaaa agatatgaga attcctatta attttacaac 1620 acttaagtat ccaactggag aaaaggtacc agttgaattt ttgatttcag cgtcgttaaa 1680 attgggtgaa tcatgtagct aatttagatg tttttgtcgg aaatattatt gatgattgta 1740 tacttggaag tgattttctt gatgcaactg gtattacaga tttaattact aaaattattt 1800 taaatactcc aagttcaatc attaataaac aaatttcaag aatagtcgat tctagtaagg 1860 ttgtaccagg ctcattacaa gaaatttttg acaagaactc agagaacttg gatcaagatc 1920 aacgtcggga gtttgcggat cttctacacg aatttgaaga aatttttcaa gatgatgacg 1980 ttactggaaa atgtaatttg gttgaacaca gaattcaatt actccataat caaccaatta 2040 aacaacctat tcgaagaccc cctattcatc aaattaagga attagatgaa tgtattgctg 2100 agatgcattc tcaaggagtc attgaagaat cagatagttc gtattgctca cctgtagttc 2160 ccaagaagaa gaaggatgga aaaattcgtt tttgtattaa cttccaaaaa attaatgcaa 2220 ttacgattaa agattcgtac cctttaccaa gagttgatga tactcttgat aaacttgcag 2280 gatattcgtg gttttgtaca ttggacttga agagtggata ctggcaaatc aagattcggg 2340 aagaggatag gaaaatcacg gctttctcaa ttggaaaagg tttatggcaa tttaaggtta 2400 tgccattcgg attatgcaat gcaccggcga cttttcaacg tttgatggaa aagttactca 2460 gagattatct ctcaaaattc tgtttagttt atttagatga tgtcatggta ttaggaaagt 2520 catttaatga gatgatatcc aatttaaaaa aggtatttgt tcgttttcaa gaagcaggtt 2580 tacgtctaaa taagaagaaa tgtgttttct tcagcaggca agtcgaatat ttaggataca 2640 ttgtttctga aaaaggaaaa tcagctgatc caaagaaaat tcaatctgtt gtaaattggc 2700 caattcctca taacaaaaag gctgtgttga gttttgtgaa tttctgtggt tattataggc 2760 gtctcattaa aggatttgct gatatagcaa aacctttgta taaattaact gaaaaatcga 2820 ctaagttctt atggtcagaa gattgtcaaa aagccttttc aaaattaaaa gaggttctta 2880 caaaacctcc aattttggca tttccttctc tttcttctga atttatttta gatactgatg 2940 cttctgatca tggaattggg tcggttcttt ctcagaagca agatggaatc gaacgtgtca 3000 taggttatta tagtagagtc ctttctaaac cggagagaaa ttattgcgtc actcgacgtg 3060 aactcctcgc tattatagaa tcgttaaaac attttcatca ttaccttatc ggtaaacctt 3120 tcatcattag gactgatcat tcctctttga cttggttatt atcgtttaaa gaactagatg 3180 gacaattagc tcgttggtta gggaaattag atcagtatga ctacaaaatt attcacagga 3240 aaggtaattt acatggaaat gctgatggac tttcaagacg cccttgtgaa gatttttcat 3300 gtaaatattg ttccagagta gaagaaaaat tcaatgaaga gactatttgc cgtattattt 3360 ttgaagaaaa tacatcggaa gattggaaga agagtcaagc agaagatgag gctatttctt 3420 ttatttgtca aacgaaggaa ttacagaaaa aaccagcatg gcaagaaatt tcttcgatgg 3480 atgaattcac taaaatctat tgttcacaat gggattcgtt agtgttaatt aatggaattt 3540 tgtatagaga atgggtttca acaaatttag atcataaaat tcgtcaattg gttgttccta 3600 tagataaaat caaagatatt ttagaagaag cacatgattc accatctgga ggacatttcg 3660 gagtaaacaa aactttagat aaaattagaa aaagatttta ttgggctact tgcaaaagag 3720 atgtagaaaa ttggtgtaaa tcctgtgaag tttgtattgc aaagaaaggt cctccagata 3780 aaggccatgg tgagatgaag atttacaatt caggagttcc attcgaacga cttcaaatgg 3840 atattttagg tccctttcca ctttcttctt taggcaataa atacctctta gttgtgacag 3900 actgtttctc gaagtgggtt gaagcttttc ctctctctaa tatgagaaca aaaaccattg 3960 ctgagatttt cgttgatcag attgtctgta ggtatggggt acctttagaa gtaaatactg 4020 atcaaggrac aaatttcgat tcgaaattga ttaaaaattt gtctgagttg ttaggaatta 4080 aaaagacaag aacatcacca ttacacccac aatcaaacgg acaagtagaa agacaacacc 4140 aaacactact tcactatctc acaaaataca tacacgacaa tcagaaagac tgggatcgct 4200 gggtttcact atttctgtta gcttacaggt cctccaaaca tgcgacgacc ggagctactc 4260 cagctgagat gtacaccggt tttgatctga ggcttccttt ggacttaatt cgagggtgtc 4320 taccagagga ggtcgagatt aaaggagacg gagactacgt acacaggctt cggaggaaac 4380 tcagcgagat ccataaaatg gcacgagacc ggattaactt acaatcggaa aagacgaagt 4440 cttggtacga caaacgagct ctcaacgttg aatattccca aggacaaaaa gtttggttct 4500 tcaaccctcg aagaactaaa ggaaaagcac caaaactaca gtcgaattgg gaaggacctt 4560 gggaggttgt gagaaaaatc aatgacgtcg tctattgtat tcgtcgttca ccaagacata 4620 aatccaagat agtcaacatc aatcgtcttg ggacttacac tgagagagat acccattaat 4680 gaagactgct ctgctcacgg ttcttttgag ctgtggtttt tcccgtgggc gtgaaggtaa 4740 ataccggagc agggacttga ggtgaggttc cacggcatgt caacttcccc cctcttgtct 4800 gaagaccgat ttcgaccgac cctatgatga ggaaataaga aggatgattg aagtagattg 4860 ttcaagagac gggtccgtct tagtcctgtt aagtgatgat tggaatggcg tctgtaagca 4920 ttgctcggta acttgatgcc taggatgggt caaggtagag gatgttggtg tcagttcctg 4980 taagtcatca ttggattgta agaggatcaa cccgcttgaa ggatggatgg ttatgattgt 5040 cgggacgaca atctgaaaag gagagggcaa 5070 // ID hAT-27_SM repbase; DNA; INV; 3314 BP. XX AC . XX DT 13-FEB-2008 (Rel. 13.02, Created) DT 03-MAR-2008 (Rel. 13.02, Last updated, Version 1) XX DE hAT DNA transposon from Schmidtea mediterranea: consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-27_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-3314 RA Bao W. and Jurka J.; RT "hAT DNA transposon from Schmidtea mediterranea."; RL Repbase Reports 8(2), 76-76 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 602..3097 FT /product="hAT-27_SM_1p" FT /translation="MKRNYASGAFKRKKKAEREEEIKKIPKLDCFFTKDSA FT TESEFVAQEDRHQSDQIEVSSSTSNQPIFTGSSNQVFNDFENNLELVDNIN FT NENLVESLIESSPNANRENDVGLWGELSSEDTLYWIEKGPESCQHSTENFH FT SSKQLYNDNTVRYCSKTLFFDEKTNGEKYTREWLVYSPKIGNVFCFVCKLL FT TASNFNLATNGLRDWKNAGSSIKSHQNSSEHRNALVTYLTRKSNYSVSDQL FT QKEIQQERIYWRKVLERVVAVICTIVERNLPFRGSNEIFGMEGSGNFIGLL FT ELIAKFDPFLAEHIRKFGNPGSGKTSYLSKTIFEELLDLMAKTVLKSISDD FT IKQSKYFGMSVDSTPDISHKDQLCMIIRYVDQINFKPIERFLNFIEIENHT FT GEYLADISLEFFEKDIGLNFQDCRSQSYDNAMNMAGKYKGMRAKVLEKNDK FT AIFLPCSAHSLNLVGNSGADCCIESINFFGLIQEVYNFFSSSTERWNKLVE FT FSNRTVKSLSKTRWSARSESVKVIHENYENVMEALNAIIEDANFYGNTRNE FT ANNLLNKMEEFEFALLVIFWDQVLERMNAVSKNLQSPKVTLDVCSSLYASL FT ASYITNLKNSFDEIKIEAKKLLPNTDYTVKRKRFKKKFPDEDQTTTEPEIS FT AKENFRNNVFMKILENIENNLIQRSDCYLEISKVFGFLTNIELSQEDLKQH FT VNNVVEKYPDDIDDSLFFELLQFHEYIRNDWPNDHPNHLSFYEIIKEKNLE FT IAFPNLETILRIFLCLMIANCTGERSFSKLKLIKNFLRSTMSQKILNNLAL FT LSCNIDKLKAIDFDSLINQYALEKFRKKVL" XX SQ Sequence 3314 BP; 1227 A; 454 C; 571 G; 1062 T; 0 other; caggcccgga tttacctata ggcagactag gcactgccta ggggccccgc agttttgggg 60 gcccaaaatc gaaatcatct aaggggccac aaaaaaaaat atttttttta atggattttt 120 tatatactaa aatataaaaa aattaaaata cagtataaaa tacatgtatt atataaagaa 180 ttaaaaaaac attgtacgta tgtttttatt taaggaaatt ttctaaaata ctaacaattt 240 ggggccccaa taattttaaa agttgttata gttgaaaaaa tattttttct ttatttccgc 300 aacctccaag ggggttatca ttagtggagt ttgtcggttg aaatttgcga aggcgggaat 360 cgaattgtta ttgtgtagaa acatattttt ttggagtgtt ggtaaaaatt ttatacacaa 420 aagaaaaaaa ttatttcaaa cgaaatcggc ttaaaaataa aaaatttaaa ataaagcaat 480 tttaggacga agtgaggctt ttttttaatt gattaattgt taagatctag gtatagtata 540 aaaatattaa aattatttaa tcaatttaat taaaaaatac tataattaaa tttaagatag 600 aatgaaacga aactacgcaa gcggagcgtt taaaaggaag aaaaaagcag agagagaaga 660 agaaattaag aaaattccta agctagattg ttttttcact aaagattctg caaccgaaag 720 tgaatttgta gcacaagaag atcggcacca aagcgaccaa atagaggtct cttccagtac 780 ttcaaatcaa cctattttca ctggaagttc aaatcaagtt tttaatgatt tcgaaaataa 840 tttggagctg gtagataaca ttaataatga aaatttagtg gaatccctga ttgaatcatc 900 tccaaatgca aatcgtgaga atgacgtagg attatgggga gaattgtctt ctgaagacac 960 tttgtactgg attgagaagg gccctgaaag ttgccaacat tcaacagaaa actttcattc 1020 ttccaaacag ctctataatg acaatactgt tcgatattgc tctaagactt tattctttga 1080 tgaaaaaaca aatggagaaa agtacactcg cgaatggctt gtatattcgc ctaaaattgg 1140 aaatgttttt tgttttgttt gcaaattact cactgcttca aatttcaatt tagcaaccaa 1200 tggcctgcga gattggaaaa atgctggttc ctcaatcaaa agccatcaaa actcttcaga 1260 acatcgtaat gctttagtta catatttaac tagaaaaagt aactattctg tgtcggatca 1320 attgcaaaaa gaaatacaac aagaaaggat ttactggaga aaagtcttag aacgagttgt 1380 agctgttatt tgcaccattg ttgaacgaaa tttaccattt cgaggatcaa atgaaatttt 1440 tggaatggaa ggaagtggca atttcattgg actcttggaa cttattgcaa aatttgatcc 1500 atttttagca gagcacatta gaaagtttgg taatccagga tctggaaaaa cttcttatct 1560 ctcaaaaaca atttttgagg aattgctcga tctgatggca aaaacagtat tgaagtcgat 1620 ttctgatgac atcaaacaat caaaatattt tggaatgtct gtggattcca cccctgatat 1680 atcacacaaa gatcaattgt gcatgattat cagatacgtt gatcaaatca attttaaacc 1740 aatcgagcga ttcttaaact ttattgagat agagaatcac actggagaat atttagctga 1800 catttcattg gaattttttg aaaaggatat tgggttaaat tttcaagatt gccgatcaca 1860 atcgtatgac aacgctatga atatggccgg aaaatacaaa ggaatgaggg caaaagttct 1920 tgaaaaaaat gataaagcaa tttttttacc gtgcagtgct cattctttga acttggttgg 1980 aaattcaggt gcggactgtt gcatagaatc aatcaatttt ttcggattaa ttcaggaagt 2040 ttacaatttt ttttcatcgt ctactgagag atggaacaag ttagttgaat tttcaaatag 2100 aacggtaaaa tcgttatcaa agactagatg gagtgctcgt tctgagtcag ttaaggttat 2160 tcacgaaaac tatgaaaatg taatggaagc actaaatgcc atcattgaag atgcaaattt 2220 ttatggaaac actagaaatg aagcaaataa tttattgaat aaaatggagg agtttgaatt 2280 cgcacttctc gttatatttt gggatcaggt tctagagaga atgaatgcag tttcaaaaaa 2340 tttgcaaagt cctaaagtca cattagacgt ttgttcttcg ttatatgcat ctctagcttc 2400 atatattaca aatttgaaaa atagctttga tgaaattaaa attgaagcta aaaaattgtt 2460 accaaacact gactacactg tcaagagaaa gagatttaaa aagaaatttc cagatgaaga 2520 tcaaactacc acagaaccag aaattagtgc taaagagaac tttcgaaaca acgtcttcat 2580 gaaaatttta gaaaacatcg aaaataattt gattcagcgc tctgattgtt acttggaaat 2640 ttctaaagtg tttggttttt tgacaaatat tgaattgtcg caagaagatt tgaaacaaca 2700 tgtgaataac gttgttgaaa aatatccaga tgatatagac gattcacttt ttttcgagtt 2760 actacaattt cacgaatata tcaggaacga ttggccaaat gatcatccaa atcatttgag 2820 tttctacgag ataattaaag aaaaaaattt ggaaattgct tttccaaatt tggaaacaat 2880 tttaagaatt tttttgtgtc taatgattgc taattgcact ggcgaaagaa gtttctcaaa 2940 attgaagctg attaaaaatt ttttgaggtc aacgatgagt caaaaaattt tgaataattt 3000 ggccttatta tcttgtaaca ttgataaatt gaaagctatt gattttgatt cattgattaa 3060 tcaatatgct ctagaaaaat ttagaaagaa agttttgtaa aattatattt ttgacatgac 3120 atatgtatct tatttttaag cggaattatt taataaatat atttcaaaat ataaataggt 3180 agtaaataat tatttcttaa atacttttag caaataaaat aatctacata atttccttga 3240 aatatactaa ttaattttga gggccccaaa aagctccttt gcctagggcc aacaaatgct 3300 taaatccggc cctg 3314 // ID Gypsy-23_CQ-LTR repbase; DNA; INV; 121 BP. XX AC AAWU01011157; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-23_CQ_; KW Gypsy-23_CQ-I; Gypsy-23_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-121 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 426-426 (2011). XX DR GenBank; AAWU01011157; Positions 16923 16803. XX SQ Sequence 121 BP; 41 A; 24 C; 19 G; 37 T; 0 other; tgtaatatac tagcttagct aagatttaag tagtccatga cttagcaagc tatgactaag 60 ctattgaggt cccaacattg taactttagg atcattctca ataaagcctt caacctgaac 120 a 121 // ID MuDR-5_TV repbase; DNA; INV; 2531 BP. XX AC . XX DT 16-DEC-2008 (Rel. 13.12, Created) DT 16-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE MuDR DNA transposon from Trichomonas vaginalis. XX KW MuDR; DNA transposon; Transposable Element; MuDR-5_TV. XX OS Trichomonas vaginalis OC Eukaryota; Parabasalia; Trichomonadida; Trichomonadidae; OC Trichomonas. XX RN [1] RP 1-2531 RA Kapitonov V.V. and Jurka J.; RT "MuDR DNA transposons from protozoans."; RL Repbase Reports 8(12), 1815-1815 (2008). XX DR [1] (Consensus) XX CC The MuDR-5_TV consensus sequence was derived from multiple CC alignment of 19 copies <1% divergent from it. MuDR-5_TV copies CC are usually flanked by 10-bp TSDs (several copies are flanked by CC 9-bp TSDs). MuDR-5_TV contains imperfect 38-bp TIRs (6 CC mismatches) and codes for a 480-aa MuDR transposase. CC This sequence was derived from sequence data generated by TIGR CC Database: The Trichomonas vaginalis Genome Sequencing Project. XX FH Key Location/Qualifiers FT CDS 877..2316 FT /product="MuDR-5_TVp" FT /note="MuDR transposase." FT /translation="MQIQDHELTYIELLKYGVQQEIQAGCSYSYKTRKERN FT QIQFNCKWPNCPCRFLVDTLPDDKYHIRKCINTHNHSAPFPNKSQQITSYF FT YRSYLTRYFESNQDRVYAQLRCFKDLNIPFDSGIIPEMFQQSTDALKKFSK FT RLHAINTRFSADDVTSLAIFVDMVKSHSPDDLIEFIHEPDRMIFYYAPFEA FT NAFSHQIQTYHIDSTYKLLRSRIPFYAVTGKFAESIVFPFLYFFVWPDTSE FT NIQVCLTAYFGSINREPSYFSMDCAPQITNAVETAIPLCQIIWCGVHVLRA FT VMRKAEKFQDRSNFETFYNLMKLLVFGSEEEEIDPDEVYNNLEEILNEEPA FT AREYFDRQWRHHLDRWMLRYRNEGDGTNNISESHFKVLKHQYFPERRNLRL FT DELVIELYSSVVPSFLIKLQIKGLDSERSVKVIKKVTRDFEEMTQFKKVEC FT ISMLEAVQKGLKNDTLDPNIVYSTLKILLQRKHLI" XX SQ Sequence 2531 BP; 884 A; 432 C; 396 G; 817 T; 2 other; gagaaaggga cagactagcc cattgtcact ttttaattta gcaaatgaac gcttatggag 60 tggagttttt gggtaaatct gatttgctta taaatgaaat aaaatcaaaa gaaaccatat 120 tggcattata ttatgcaaaa ataaatratc ttacttagta cttttaaagt taatttcaga 180 aaaaatttga cacaatggta tccggcacat acaagtggtg tacaaccaaa atataagttg 240 gtttaaacat atattgctat atctcatgat cttacttagt acttttaaag ttaatttcag 300 aaaaaatttg acacaatggt atccggcaca tacaagtggt gtacaaccaa aatataagtt 360 ggtttaaaca tatattgcta tatctcatga tcttacttag tacttttaaa gttaatttca 420 gaaaaaattt gacacaatgg tatccggcac atacaagtgg tgtacaacca aaatataagt 480 tggtttaaac atatattgct atatctcatg atcttactta gtacttttaa agttaatttc 540 agaaaaaatt tgacacaatg gtatccggca catacaagtg gtgtacaacc aaaatataag 600 ttggtttaaa catatattgc tatatctcat gatcttactt agtactttta aagttaattt 660 cagaaaaaat ttgacacaat ggtatccggc acatacaagt ggtgtacaac caaaatttca 720 tatccctagt tttacattta tgtttagttt tatttcacct aattttcaaa gatcataatt 780 gaattagatg gccaagctat tcaaaacacg accaagctta tcaaaataag accaatatat 840 tcaaaattct ttattctttc aaatcatttc attttaatgc agatccaaga tcacgaactc 900 acatatatcg agttgcttaa gtatggagtg caacaagaaa tccaagcagg ttgtagttat 960 tcatataaaa ctaggaaaga aagaaatcaa atccaattta attgtaagtg gccgaattgt 1020 ccatgtcgct tccttgttga tacacttccc gatgacaagt accacatacg aaaatgtatt 1080 aacacccaca atcattctgc cccttttcca aacaaatcac aacaaattac atcatatttt 1140 tatagatctt atttaacaag atattttgaa tcaaatcaag accgtgttta cgcacaatta 1200 aggtgtttca aagatcttaa tatcccattt gatagtggga tcattccgga aatgtttcaa 1260 caatctacag atgctttaaa aaagttttct aaacgccttc atgcaatcaa tacccgtttc 1320 tcagcggatg atgttacatc tcttgcaatt tttgtcgaca tggttaagag tcattctccr 1380 gatgatctca tcgagtttat tcacgagcca gatcgtatga ttttctatta cgccccattc 1440 gaagcaaatg ctttttcaca tcaaatacaa acttatcaca ttgattccac ttataagctc 1500 ttgcgctctc gcataccatt ttatgccgtc actggcaaat tcgcggaatc aattgttttt 1560 ccgtttttgt attttttcgt ttggcccgac acaagtgaaa atatacaagt ttgtctcaca 1620 gcatattttg gctccatcaa tcgcgaacca tcttattttt caatggactg cgcacctcaa 1680 atcacaaatg ctgttgaaac agccatcccc ttatgccaga tcatttggtg tggcgttcat 1740 gtgttgcgcg ctgtcatgag aaaggctgaa aagtttcaag atcgttccaa cttcgaaact 1800 ttctataact taatgaagtt attggtcttt ggttctgaag aggaagaaat tgatcctgat 1860 gaagtttaca acaatttaga agaaatttta aatgaggaac ctgctgcccg tgaatacttt 1920 gaccgacagt ggcgccatca tcttgacagg tggatgcttc gctatagaaa tgaaggagat 1980 ggaacaaaca acatctcgga atcacatttc aaggttttga agcaccagta cttcccagaa 2040 aggcgaaatc ttcgactcga tgaacttgtc atcgagttat attccagtgt tgttccttca 2100 ttcctgatta agttgcaaat taaaggtctg gattcagaga gaagtgttaa agttatcaaa 2160 aaagtcacaa gagatttcga agaaatgaca caatttaaaa aagtcgaatg catcagtatg 2220 cttgaagctg ttcagaaagg tctgaaaaat gatacacttg atcccaacat tgtttactca 2280 acattaaaaa tattattgca aagaaaacat ttgatttaat aaacatttaa tttattataa 2340 ttaatttttt attacagtat agtaaaaaat gctataaata aattattata ttttattaaa 2400 atgtgacttt taatagtaag tgaataaaat aaatcaggaa ttgggaaaga aataaggtta 2460 attaaaaata atgaaaatta caagtttaag gttaatgaaa aagtgacaat gggcgagttt 2520 gtcccccact c 2531 // ID PFRP1 repbase; DNA; INV; 589 BP. XX AC M19409; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 28-SEP-1995 (Rel. 1.08, Last updated, Version 1) XX DE P.falciparum 5.8 kb repeat DNA. XX KW PFRP1; Repetitive sequence. XX OS Plasmodium falciparum OC Eukaryota; Alveolata; Apicomplexa; Aconoidasida; Haemosporida; OC Plasmodium; Plasmodium (Laverania). XX RN [1] RP 1-589 RA Suplick K., Akella R., Saul J.A. and Vaidya A.; RT "Molecular cloning and partial sequence of a 5.8 kilobase pair RT repetitive DNA from Plasmodium falciparum."; RL Mol. Biochem. Parasitol 30(3), 289-290 (1988). XX DR GenBank; M19409; Positions 1 589. XX SQ Sequence 589 BP; 243 A; 75 C; 83 G; 188 T; 0 other; taagtatatt atacaaataa tactagagat ttcaaaactc attccttttt ctataaatac 60 ttgtaaacat agtcatacat gatgcactag ctaatataaa tgtaattgtt aagattaaca 120 ttcttgatga agtaatgata ataccttcat tacttaatgg atatggtgat aaactaaaat 180 gtaatatacc ccaaaatatg taaagaataa taaagcttca tgaatattat gatagataac 240 ataccagaag ttaaagatga aacatacaga ataaaaactt tctcgaatag aatatacaaa 300 tattaatagg attatagggt taaatgtaaa taatatccct acagaaaagt attttaaaga 360 tgtaccatat aatgatgtta atgcaggata tgaaactaga tgtgctttta tatttgataa 420 attactaaat aaaataaatt tataagaacc gtgagataat gtgccgtaaa catataacgg 480 taagaaggtt cgccggggat aacaggttat agtatatata gagctctaat ctttatatac 540 tattggcacc tccatgtcgt ctcatcgcag ccttgcaata aataatatc 589 // ID CR1-46_AAe repbase; DNA; INV; 4766 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-46_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4766 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1133-1133 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 20 sequences with >97% CC identity. XX FH Key Location/Qualifiers FT CDS 37..687 FT /product="CR1-46_AAe_1p" FT /translation="MMNNVDTKYSALPDAVQSMQSKIENLQSAVEALTAKV FT EDKPCTPTPFVTPNLWPTRNRINTPFGSTKRRRINDEALTNVPLVTSNFGT FT KSTGVIKTVQLNQRDDDNLLWIYLSAFHPNTSEGQIASLVSECLELLNIPK FT VVKLVPKGKDPNSLQFVSFKVGVASQLKEKALACDTWPENIRFREFEDLRS FT KNGPKIVSLLPTGPPSVMNTTESSSIA" FT CDS 732..4727 FT /product="CR1-46_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MEAPSDPVTVEPFPAIINPHQLTQDKPTTSRHSRPGP FT VYGGLEGVFQPSLSGKYFRHINNFFPDGASIFSSSQAGFDNKRTQSTGQLS FT PKLAIGESATSQPGRTPRRTLEAPSDPVTVEPTSSTTINLASCRRSRPGPV FT YGGLEGVFQPALPGKYTHNSSRPLLEDYPNCSPSVDTCQYVEQVPSVSHSN FT AATINSSPLHSRPASNQPGRTPRSSMEVPSDPVTVEPFCARGGARGPGRPR FT GAAGGPGRAPRRPGRGAPPRRRGAAPRGAAPRAAPPVTSIKQVSSRRSRPG FT PVHGGLGGIFQPASPGKYACNISRPVPEDHSDFSSTINDINDRTSSANMPV FT FEDVSRRSLDSAVGRNYGCSNRLASNLHVYYQNVGGMNSTINDYLLASTDN FT CYDIIALTETWLNERTSSSQVFDSNYAVFRCDRCSSNSRKSAGGGVILAVR FT QQLKPRMVPNDSWTSVEQVWVEIPLSDRSLFFCVVYLPPDRVRDASLIAVH FT TESIAIIRAKLKPVDEIVIIGDFNLPGIKWLPTRFGFMHPDADRSTLPTST FT TSLLDSYSTNLLQQINPVENSNGRILDLCFVSNVDYAPVISEAVAPLVKLV FT NHHPPLHIVLENRSNDCFRQTVAAVYYDFKNADFDAISDVLSNIDWEDVLG FT DDEIDTAVQYFSNIMNYVIDRHVPKRVNSKHTQTPWQTNELRSIKTAKRAA FT LRYYTKHRTESSRHYYVQLNNEYKRMNQSCYQSYMKNMQLKLKANPKSFWK FT FVNDQRKESGLPTTMMYEGNWSSDPKIVCDFFADKFGGVFSEEGITPQQIE FT AAVEDVSYLGESITHIDVDRSMIMTATSKLKASCSTGPDGIPPILLKKCAS FT SLVVPLQRIFSLSLATGKFPDSWKSAFLFPVHKKGDKMHIQNYRGISALSS FT VAKLFELVLLEPMFSHCRHYIVEEQHGFMPKRSTSTNLLSFTSYITEGFNN FT AKQTDAIYLDLTAAFDSINHEIALAKLEKLGFGGRILGWLRSYLIGRQLEV FT RIGDVTSHKFAAPSGVPQGSHLGPMIFLLYFNDVNKKMKVPRLSFADDMKI FT YFQVSSIADAHILQQDLIHFADWCSTNRMRVNPSKCAVISFCRKKEPLLFE FT YNIFGTSIERTSCIKDLGVQLDSKLSFQQHISYMVGKASRSLGFLFRVGKE FT FTDIYCLKNLYCSLVRSTLEYCSPVWNPYYNNAVERIEAVQRRFIRFALRR FT LPWSNPFSLPSYESRCQLIQLETLTIRRNVARALFVSDVFTSRIECQVLLN FT SIQLQVQPRTLRNRSFLQIPLRRTNYGSYGAIVGLLRLFNRVADYFDFHLS FT REAIKRNFKRVFCRYSIIRA" XX SQ Sequence 4766 BP; 1257 A; 1237 C; 1034 G; 1238 T; 0 other; tgttcagaac tttttgcaaa tacagttttc cgcgaaatga tgaacaatgt cgacaccaag 60 tattctgctc tgccagatgc agtccaatcg atgcaatcga aaattgaaaa tttgcaatcg 120 gcagtggagg ctttaacagc aaaagtggaa gacaagcctt gcactcccac tcccttcgtt 180 acgcccaacc tatggcccac caggaatagg atcaacactc cattcggctc aactaaacgt 240 cgcagaataa acgatgaagc tctgacgaac gtgccattag ttacaagcaa ctttggtact 300 aaatcaactg gtgtcatcaa aactgtacaa ctcaatcagc gcgacgacga caatctactt 360 tggatttacc tctctgcctt ccacccaaac acttcagagg gtcagattgc ttcactagtt 420 agtgaatgct tggaattgct caacatacct aaggttgtta aattagtccc caaaggtaaa 480 gacccgaatt cgctgcaatt tgtgtctttc aaagttggtg tcgctagtca gttgaaagaa 540 aaagctctcg cctgcgacac ttggcccgaa aatattcgtt tccgggaatt tgaagatctt 600 cgatcaaaaa acggacccaa aattgtgagc ttgctgccga cgggaccgcc atctgtaatg 660 aacaccacgg aatcgagttc aattgcctga atgttccatc tagtctacca ggacgcacgt 720 tagaaagccc tatggaagct ccctctgacc ccgtaacagt cgagccgttt cctgccatca 780 tcaatccaca tcaactcacc caagacaaac caactaccag tcgtcacagt cgtcctggcc 840 ctgtttacgg aggtttggag ggggtcttcc aaccttcact ttcaggcaag tatttccgac 900 acatcaacaa tttcttccct gatggtgctt cgattttcag ttcctctcaa gctggttttg 960 acaacaaacg aactcaatct actggtcagc ttagtccaaa actcgccatt ggtgaatccg 1020 ctacaagtca accaggacgc acgccaagac gcacattgga agccccttct gaccccgtga 1080 cagtcgagcc tacttcttca accaccatca atctcgcttc ctgccgtcgc agtcgtcctg 1140 gtcctgtcta cggaggcttg gaaggggtct tccagcctgc acttccaggc aagtatacgc 1200 ataactccag tcgacctctc cttgaagatt atccaaattg cagtccttcc gtcgacactt 1260 gccaatacgt cgaacaagtg ccatccgtaa gtcactcaaa cgcagctaca ataaattcga 1320 gcccacttca ttcgagacca gcatcaaacc agccaggacg cacgcccaga agctctatgg 1380 aagtcccttc cgaccccgtg acagtcgagc cattctgcgc ccggggcggc gcgcgcgggc 1440 ccggccggcc ccgcggcgcg gcgggcggcc ccgggcgggc cccgcggcgc cccggccggg 1500 gcgccccgcc gcgccggcgg ggggccgcgc cccgcggcgc cgccccgcgc gccgcgccgc 1560 ccgtcaccag catcaaacaa gtttccagcc gtcgcagtcg tcctggtcct gttcacggag 1620 gtttgggagg gatcttccaa cctgcctctc caggcaagta tgcgtgcaat atcagtcgac 1680 ctgtccctga agaccattcc gatttcagct cgacaatcaa tgacatcaac gatcgtacgt 1740 cttctgcaaa catgccagtc ttcgaagatg tttctcgccg tagtctcgat tcggctgtgg 1800 gcaggaatta tggttgcagt aatcgactcg cttcaaacct acatgtgtac taccagaatg 1860 ttggcggcat gaatagtacc ataaacgatt acctgctggc cagtactgac aactgctacg 1920 acatcattgc tctcactgaa acgtggttga atgaacggac atcatcatcg caagttttcg 1980 atagtaatta tgccgttttc cgttgcgatc gttgttcatc aaatagtcgc aaatcagctg 2040 gaggtggggt tatccttgca gtacgccaac agctgaaacc acgcatggtc ccaaacgact 2100 cttggacaag cgtcgaacaa gtgtgggtcg aaataccact gtccgaccga tcacttttct 2160 tttgcgtggt ttatctgcca cccgaccgtg tccgagatgc atccttaata gcagtccaca 2220 ctgaatcgat tgcaatcatt cgtgcaaagc tgaaacccgt ggatgagatc gtgattattg 2280 gcgacttcaa tctccctgga ataaaatggt tacccacccg ttttggtttc atgcacccgg 2340 acgccgatcg ttctacgctt ccaaccagta ctactagttt gctcgattcg tatagtacca 2400 atttgctgca gcaaatcaac cccgttgaga acagtaatgg aagaattttg gatctctgct 2460 tcgtaagcaa cgtcgactat gctcctgtaa tctccgaggc tgttgctccg ctcgtcaaac 2520 tcgttaatca ccaccctcca ctgcacattg tcttagaaaa ccggtctaac gattgcttca 2580 gacaaacggt agctgcagta tactacgatt ttaaaaatgc tgactttgac gcaatttctg 2640 acgtcctatc gaatatcgat tgggaagacg ttttaggtga tgacgaaatt gatactgcag 2700 tgcaatattt ctccaatatt atgaactacg tgattgatag acatgtgcct aaacgagtta 2760 attccaagca cacacaaact ccttggcaaa caaacgagct acggtcaatc aaaactgcta 2820 aaagggcggc attgcgatac tatacaaagc accggactga atcgtcaaga cattattacg 2880 ttcaactgaa taacgagtat aaacgaatga accaatcgtg ttaccaatcg tacatgaaaa 2940 acatgcagct caagctgaaa gcgaacccga aatctttctg gaagttcgta aacgaccaaa 3000 gaaaggaatc tggtctaccc acgacgatga tgtatgaggg aaattggagc tccgacccta 3060 aaatcgtttg cgactttttt gccgataagt ttggtggcgt tttttcggaa gaagggatta 3120 cgcctcaaca aattgaagca gctgtggaag atgtctccta tctcggcgaa tcaatcactc 3180 atatcgatgt tgatcgatca atgattatga cggctaccag taaacttaaa gcttcttgct 3240 ccacgggccc cgatggaatc cctcctatct tgctgaaaaa gtgtgcttcg agtttggttg 3300 ttcctcttca gcgcatattt agtttgtcgc tagccactgg caaattccca gactcatgga 3360 agtctgcatt cttgtttcct gttcacaaga aaggtgacaa aatgcacatc caaaactacc 3420 gtggaatatc cgctctgagt tcagttgcta aactctttga gcttgtgcta ctggaaccta 3480 tgtttagcca ctgccgacat tacattgttg aagagcaaca cggcttcatg cctaagcgat 3540 ctaccagtac aaatctgctt tcttttactt catacattac cgaaggcttt aataacgcaa 3600 agcaaactga tgcaatttat ttggacctca ctgctgcctt cgacagcatt aaccatgaaa 3660 ttgcgttggc taaattagaa aagcttggtt tcggtggcag aattcttggc tggttgcgct 3720 cctaccttat tggccgccag ctcgaagtga gaattggaga cgtcacctca cacaaatttg 3780 ctgctccgtc aggcgttcct caaggcagcc acctcggtcc aatgatattt cttctttact 3840 tcaacgacgt caataaaaaa atgaaagttc ctcgtttgtc gttcgccgac gatatgaaaa 3900 tttattttca agtaagttcg atcgctgatg ctcacattct ccagcaggat ttgatacact 3960 ttgcggattg gtgcagtaca aaccggatga gagtcaaccc aagcaagtgt gccgtcatat 4020 ctttttgccg gaagaaggaa cctttgttgt tcgaatacaa tattttcggc acttctatcg 4080 aaagaaccag ctgcataaag gatctagggg tgcagcttga tagcaaatta tctttccagc 4140 agcacatatc atacatggtc gggaaagcct ccagaagcct cggatttctg tttcgtgtcg 4200 gcaaggagtt tactgacatc tactgcctga agaatttata ctgctcgctg gtgcgttcca 4260 ccctagaata ttgttcgccg gtttggaatc cttattacaa taatgcagtg gaaaggattg 4320 aagctgtaca gcgaaggttc atccgtttcg ccctgcgacg ccttccatgg agcaacccat 4380 tcagtttacc cagctacgag agccgctgtc agctcataca gttggaaacg ctgacaatac 4440 gtcgtaatgt tgctcgtgcg ttattcgttt cggatgtctt cacttccagg atcgagtgcc 4500 aggtattgct caacagtatc cagctgcagg tgcaaccaag aaccctacga aaccggagtt 4560 ttctacaaat tccccttcgg cgtacgaatt acggatctta tggggccatt gttggtcttc 4620 taaggctttt taatagagtt gcagattatt tcgatttcca tctgtctaga gaagcgatca 4680 agcgaaattt taaaagagtt ttttgtagat atagtatcat tagggcttag tattagcctg 4740 ttgatggaac aaataaataa ataaat 4766 // ID BEL-46_CQ-I repbase; DNA; INV; 3054 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-46_CQ_; KW BEL-46_CQ-LTR; BEL-46_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-3054 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 245-245 (2011). XX DR [2] (Consensus) XX CC 'CTTTC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 352..3054 FT /product="BEL-46_CQ-I_1p" FT /translation="MWRSFLKLPTEPESEEEKKPVVEVLVPSPVKPLTPKK FT LWEKTKKAEMADGEVKVLSKKRSQVKGKLTRILHAVQPSTQPGALPGVQPR FT EADVSPPQLRVHQRNVEKYYGEFCEIHDKILDVVEEDEGAKQQDAKWLEFE FT QLYNRTLVALEMLLTAHSGPAVSAQAVLPASQAAQQVIVHQQALRAPLPTF FT DGRYENWAKFKAMFQDLMRSSSDCDAVKLYHLDKALVGEAEGKIDLRTIQD FT NNYQGAWKELEEQYENTRLIIDLHIQGILQLKKMDKRSSKELRDVVERCSR FT HVEGLRFHKQELLGVSELIVVNILASALDRETRELWEATIEKGELPTYQAT FT VDFLRKRCHILERCELSDLEASAVTPTVPKMPSSTKPSVNVFAAATTTSEV FT VCEFCSGNHPNYRCSAFCSMSIPERLEKVREARACYNCLRIGHLVKKCPSV FT WTCKCGERHHNLLHMNLPSEQAPPKFEAVPQSSPTVVQAPSNMNSADSPEQ FT TTSCCNSGLLQTSRPVLLQTAIVNVVDKHGRLHPCRTLLDSGSQAHILSAA FT MARKLGLPLTKCNVTVIGANAVKTPARKSVSLNFSSRYVDFRDTISCLISE FT KPTGIIPSERINTSGWKIPNRLQLADPQFFESNDIDLVMASNYMWDLLRSD FT KVKLRNGTVTLRETDLGWIVTGTYDPCDKVSHSILHANVMRLDTLEEIVES FT TLLSAENEKVDSQVQTTHRCNENDRCVLQIPSPIPDTRIPPRCVVDPDAIR FT YKDTPSARIVLPFDGPAPIPEQKLRAAAYRPARPPILDSDGHWRAGDSIQK FT PSPPAKTKRQRNLSHRPVPPRQITSILGQLPVKPFKRHKVTDVRPGVGNVP FT RQALKLAIVVKSFSESEQPVQDRRSDEASWLDEDSRQEGFSASQRAGE" XX SQ Sequence 3054 BP; 803 A; 812 C; 856 G; 583 T; 0 other; ttggtcactt cgtaccggac attcggccct cggaaactcg gatccgcgag tgaaaaccaa 60 gagtgtttgt ggattccgga aaatgtgaac ggaacaagtc ggcaagaatc aaacccgagt 120 gtggatccgg aagtgagaag ttcaaaaaac cttccggaag gcgactccgt ggcctggaac 180 ggcggaaaca gtggcggcaa gagcccgcaa ataaaaagtg agaaaagtgc cagatagttc 240 cgtggtgaga ctgcagtgaa aaagcgcagc gacgccattt tgtgatagtg gttcccaaga 300 gtggaacaat ctgtaggaaa caggccgcac aagagttgcc ggtctttttc tatgtggcgg 360 tcgttcctga agctgcccac ggaaccagaa agcgaagaag agaagaaacc cgtcgttgaa 420 gtgctagtgc cgtccccagt gaaaccgctg acgccgaaga agttgtggga gaaaacgaag 480 aaagcagaga tggcggacgg agaggtcaag gtgctttcca agaagcgaag ccaagttaaa 540 gggaaactca cgcgaatcct gcatgcagtg cagccgagca cgcagcccgg tgccttgccg 600 ggtgtccaac cacgtgaggc cgacgtgtca ccgccgcagc tacgagttca ccagcgcaac 660 gtagagaagt attatggtga gttttgcgag atccatgata aaattctgga cgtggtggag 720 gaagacgagg gagccaagca gcaagatgct aagtggttgg agtttgagca actctacaac 780 cgaacgctgg tcgcgttgga aatgctgctg actgcgcata gcggaccggc tgtttcggcc 840 caggctgtgc ttccggctag tcaagccgcc cagcaagtta tcgtccacca gcaagcgttg 900 cgtgccccac tcccgacgtt cgacggacgt tacgagaact gggcaaagtt caaggccatg 960 ttccaggact tgatgcgcag ttcgtcagat tgtgatgccg tcaagctgta ccacctggac 1020 aaggcgctcg taggcgaagc ggaagggaag atcgaccttc gaaccatcca ggacaacaac 1080 taccagggag cgtggaagga actggaggag cagtacgaga acacgcggct aatcatcgat 1140 ctacacatcc aaggcatttt gcagctgaag aagatggaca agcggtcatc gaaagagttg 1200 cgagatgtcg tcgagcggtg ctccaggcac gtcgagggac tacgcttcca caagcaagag 1260 ctgttgggag tgtcggagct catcgtagtc aatatcctcg catcggcact ggaccgcgag 1320 acgagagagc tctgggaagc gacgatcgag aaaggtgagt taccgacgta ccaagcgacc 1380 gtcgacttct tgaggaagcg atgccacatt ctcgagcgat gcgagctctc cgacctagag 1440 gcatctgctg tcacgccaac agttcccaag atgccgtctt caacgaaacc ctcggtgaat 1500 gtgttcgctg cagccacgac cacaagcgag gtggtgtgcg agttctgcag tggcaaccat 1560 ccgaactaca ggtgtagtgc cttctgcagc atgtctatcc cggaaaggct ggagaaggtg 1620 agagaggcac gagcctgcta caactgcctg agaattggcc acctggtcaa gaagtgtccc 1680 tccgtatgga cctgtaagtg tggagagcga catcataatt tgcttcacat gaacctaccg 1740 agtgaacaag cgccgccaaa gttcgaagct gttccccaat cttcacccac cgttgtgcaa 1800 gcgcccagca acatgaactc agccgacagc cccgagcaga cgacatcgtg ctgtaacagc 1860 gggctgctgc aaacctcccg acctgtgttg ttgcagacag caatcgtcaa cgtagtggac 1920 aagcatggtc gattacatcc gtgtcgcaca ctcctggact ccggatcgca agcccacatc 1980 ctgtcagcgg cgatggcacg taagctcgga cttccgctga caaagtgcaa cgtaacggtg 2040 attggagcga atgcagtaaa aacaccagcg aggaaaagtg tcagcctgaa cttttcttct 2100 aggtacgtcg acttccgtga taccatttcc tgtctcatct cggagaagcc aacggggatc 2160 attccgtccg aaaggattaa cacgtccgga tggaaaattc ccaacagact gcagcttgca 2220 gatcctcagt tcttcgaatc caacgacatc gacctggtga tggcgtccaa ctacatgtgg 2280 gatttgctgc ggtcggacaa agtgaagctg cgcaacggca ccgtaacgct tcgagagaca 2340 gatctgggtt ggattgtcac gggtacctac gatccgtgcg acaaggtgag tcactctatt 2400 cttcatgcaa atgttatgcg cctggatacc ctggaagaga tcgtagagtc aacactgctc 2460 tccgccgaga acgaaaaggt cgatagtcaa gtacagacaa cccaccgctg taacgagaac 2520 gatcgctgcg ttcttcaaat tccgagtccg atcccggata cccgaattcc tccgcgctgt 2580 gttgttgatc cagacgctat tcgctacaag gacacgcctt ccgcaagaat cgtgctgccg 2640 ttcgacggtc ctgcaccgat ccccgagcag aaattgagag cagccgcgta ccggccggct 2700 cgcccaccga tactcgacag cgatggccac tggagagccg gtgatagcat ccaaaaacca 2760 agcccaccag cgaaaaccaa acgccaacga aacctgtcgc atcgtccagt tcctccgcga 2820 cagataacgt cgattctcgg tcaactacct gtaaaacctt tcaagcggca caaggtcaca 2880 gacgtacggc ctggcgttgg caacgtacca cggcaagcct tgaagttggc aattgtcgtc 2940 aaatccttct cggaatcgga gcaaccagtc caggatcgtc ggtcagatga agcttcatgg 3000 cttgacgaag attcaagaca ggaaggattt tcagcttcgc agcgggcggg agaa 3054 // ID BEL-4_CQ-I repbase; DNA; INV; 5735 BP. XX AC AAWU01030743; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-4_CQ_; KW BEL-4_CQ-LTR; BEL-4_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-5735 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 161-161 (2011). XX DR Genome; AAWU01030743; Positions 19958 14224. XX CC Positions [4788-5345] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 39..5735 FT /product="BEL-4_CQ-I_1p" FT /translation="MAPKPKPDVLADCVVCGKPKGDDAETVECEACRLWAH FT LACAGETGKVENYFCPNCSQSLQVPKTRKNAKKPKSDAGTTSGAADLPKDV FT LEQLELERVEREKKMAQEVMIRMKRIEMEKSFAEQELRIEREMQEREFAAA FT NEVRALKLKMEQEFLDKQKAAEEEFLAKQKELKKLVKKSKQKLEKAAVDPP FT AGKGEGAGAGSTDTPKGVLEKPKLPVPKLSDSDHDSSGDESDEKEDPTTPV FT SGSREVSDGPGKPVTKLTKEQLAARQAMSRHLPKFSGEPEVWPLFISSFKY FT TTEACGFTNVDNLKRLVDCLEGPALTLVQGRLVLPDAVPDVIEDLRHMFGR FT PEKLLRALLQKVRNAPAPTSDNLDTFIHFGITIKQVCDHLEVSGMKDHLNN FT PMLVQELVERLPTSHQLDWVRYRRGKVNTPLRILSDFLSELVEDVSEVAEF FT AALSLQEPTKAQGGKGKRNEFVHLHEATREKAAGGRTSMACYVCKQTNHLI FT RNCSEFRGMRVPERMKTVERLKLCTVCLSSHGNSACHSKMRCQIGNCRGQH FT HPLLHREEAARQFQRVDCNTHKVNGGVIFRTIPITMYAGSKAFNTLAFLDE FT GSSSTLMEEYVANKLGVQGKLEPLVVSWTAEIDRHENQSRRVSLTLAARGS FT REMFQLENVRTVSKLQLPAVAGVQVAEQVKEKPTILIGLDNIHLFAPLESR FT VGQQGEPIAVRSKLGWTVYGPDTPKPLVHTFLNLHVVKPASNQELHDMMRD FT QYVLDEAGVASFAVPESAEEQRAKKILEKTTKRTGERFETGLLWREDERRF FT PDSYPMATRRLQALERKLDKNPTLKQNVSQQIAEYQSKGYAHKATAEELKT FT PSNAVWYLPLNVVLNPRKPDKTRLIWDAAASVRGVSLNSQLLKGPDMLVPL FT PRVVCRFRERPIALGGDIQEMYHQIRIREEDKQAQRFLFRENPADAPQVYV FT MDVATFGSACSPSSAQFVKNLNADQYAEEYPEAAAAIKERHYVDDYYDSVD FT TVEEAIRRANEVKFIHSKGGFNIRNWVSNSRKVLDAMGEQSVKTSVHFNQE FT KSAFYERVLGIVWEPDQDVFCFAVASKPEFRDVLAGERRPTKRIVLSVVMA FT QFDPAGFLGPITILGKILVQDLWRTGCEWDEMIDELAFQRWLRWINILADA FT KHFKLPRCYFGNARWDEIEEIQLHIFADAGDKAYGCVAYFRAIVRGEVVVA FT LVMSRSKVAPLKMLSIPRLELESCVLAARMSQAIHDNHSFPISKTFFWTDS FT AVVLSWIRSDQRRYKQFVGFRIGAILSMTSLMDWNFVPTQHNIADILTKWG FT KDPNVRPDSPWVCCPRFIHDQQVNWPKKSLPPPNTTEELRVHLLLHDVKVV FT DTIVEVNHFSRWTMLVRTMACAHRFVSNLKRKTKGLPMETLRATKAQAKML FT RPTTVPSVRVALQQDEYKQAEVYLLKMVQAESFIDEVKVLTRNKDRPADQW FT LALEKSSPLYKLTPLVDEDGLLRMEGRTENAEFLPFDLRFPVILPKDHHVT FT RMIVQHYHQWFGHGYRETIKNELRQRFFILGVNTLVRQVAAACVWCKVHRN FT QPLVPRMAALPVQRITPNLRPFSYVGVDYLGPLDVSVGRRSEKRWVALFTC FT FVTRAVHLEVAYGLTAQTCLMAIRRFICRRGPPVEFFSDNGTNFRGASKEL FT VETVRSIGNDCAEVLTTSRTKWTFNPPAAPHMGGVWERLVRSVKEALEALD FT DGRRLTDEILVTCLAEAEDMINSRPLTYVAQESSQAEALTPNHFIRGVSPN FT EPNTVPPSPHPAEALRDAYKRSQQLADVMWQRWIKEYVPSVNQRTKWFGES FT RPLKAGDLVYIVDGKNRKSWIRGVVEEPIVSGDGRIRQAWVRTTSGLVKRA FT TARLAVLEIEGKPASEASEPGLRAGG" XX SQ Sequence 5735 BP; 1314 A; 1457 C; 1898 G; 1066 T; 0 other; tcatcaaaga atttgtttcc ggcgtggaac ggttcaggat ggcccccaaa cccaagcctg 60 atgttctagc cgactgcgta gtctgcggca aaccgaaggg tgacgacgct gagacggtgg 120 agtgcgaagc ttgccggctt tgggcacatt tggcgtgcgc tggagaaacg ggaaaagttg 180 agaactactt ttgcccgaac tgttctcagt cgctgcaagt gccaaagacc cggaagaacg 240 ccaagaagcc gaaaagcgat gcgggcacta cttccggcgc tgctgacctg ccaaaggacg 300 tgctggaaca gttggagttg gaacgagttg agcgggagaa gaagatggcc caggaggtga 360 tgatccgtat gaagcggatc gagatggaga agagttttgc cgagcaggag ctccggatcg 420 agagagagat gcaggagagg gagtttgcgg cggcgaacga ggtgcgcgcg ctgaagctca 480 agatggagca ggagttttta gataagcaga aggctgcgga ggaagagttc ctcgcgaagc 540 agaaggagct caagaagctc gtcaagaaga gcaagcagaa gcttgagaag gccgcggtgg 600 atccaccggc tggcaagggc gagggagcgg gcgcgggctc gacggacact ccgaaggggg 660 ttctagaaaa gccgaagtta cccgtcccca aactcagcga ttccgatcac gacagcagtg 720 gtgatgagag cgatgagaag gaggacccga cgacaccggt ttctggatcg cgcgaggtct 780 ctgacgggcc ggggaaaccg gtcacgaagt tgacgaagga gcagttggcg gctagacaag 840 caatgtcgcg acatctgccg aagttctccg gggagccgga ggtgtggcct ctcttcataa 900 gcagcttcaa gtacaccact gaagcctgtg gattcacgaa cgtcgacaac ttgaagaggt 960 tggtggactg tctcgaaggg ccagcgctga cgttggtgca gggtcggctc gtgctgccgg 1020 atgcggtgcc ggatgtcatc gaagacttgc gacacatgtt tggccggccg gagaagctgc 1080 ttcgggcgct tttgcagaag gttcggaacg cgccagctcc cacgagcgac aatctcgaca 1140 cgttcatcca tttcgggatc accatcaagc aagtgtgtga tcacctggaa gtttcgggca 1200 tgaaggacca tctgaacaac ccgatgctgg tccaggagct ggttgaaagg ttgccgacaa 1260 gtcatcagct cgactgggtg cgctacaggc ggggaaaggt caacactccg ctgagaatac 1320 tgtccgattt cttgtcggaa ctggtcgaag atgtgtcgga ggtggccgag tttgcggcgc 1380 tctcgcttca ggagccgacg aaagcacaag gcgggaaggg caagcggaac gagttcgtgc 1440 acctgcacga agcgacgaga gagaaagcgg cgggtggcag aacgagcatg gcttgctacg 1500 tttgtaaaca aaccaaccac ttgatacgga actgtagcga gttcagagga atgcgggtgc 1560 ctgagcgcat gaagaccgtg gagcgcttga agctttgcac ggtttgcttg agcagccatg 1620 gcaacagcgc gtgccattcg aagatgcgct gccagatcgg aaattgtcga ggacaacatc 1680 atccgctact tcaccgcgag gaagctgcga ggcagttcca acgggttgac tgcaacacgc 1740 acaaggtcaa cggcggggtg atcttccgga caattccaat caccatgtac gcggggagca 1800 aggcgttcaa cactctggcg ttcctggacg aaggatcgtc gagtacactc atggaggagt 1860 acgtggccaa caagctggga gtgcaaggaa agctggagcc gctggtcgtg tcctggactg 1920 ccgagatcga tcggcacgag aatcaatctc gtcgggtgag cctgacgctg gcggccagag 1980 gttcgagaga gatgttccaa ttggagaacg tgcggacggt ttcgaagctg cagttgccgg 2040 cagttgcggg cgtgcaggtt gcagagcagg tcaaagagaa accgaccatc ttgatcgggc 2100 ttgacaacat ccatctcttc gcgccgctgg aatcaagagt cggtcaacaa ggcgaaccga 2160 ttgctgtccg ttcaaagctc ggctggactg tgtacggacc tgatacaccc aagcccttgg 2220 tgcacacgtt cctgaatctg catgtcgtca agccggccag caaccaagaa ctccacgaca 2280 tgatgaggga ccagtacgtc ctggacgaag ctggtgttgc gtcgtttgcg gtgccggagt 2340 ctgccgaaga acaacgagcg aagaagatct tggagaagac gacgaagcgc accggagagc 2400 ggttcgaaac cggtctgctg tggcgcgagg acgagcggcg attcccggac agctacccga 2460 tggcaacgcg gcggctgcag gcgttggaga ggaagctgga caagaacccg acgttgaagc 2520 agaacgtctc tcaacagatc gcggagtacc agagcaaggg gtacgcgcac aaagcaacgg 2580 ccgaggagct gaagacgccc agcaacgccg tttggtatct gccactgaac gtcgtcctga 2640 accctcggaa gccggacaag acaagactga tttgggacgc ggcggcgtcg gtgcggggtg 2700 tctcgctgaa ctcacagctg ctcaagggac cggacatgtt ggtgccgctg ccacgagttg 2760 tgtgtcgttt ccgggagaga ccgatcgcgc tgggtgggga catacaagaa atgtaccatc 2820 agatcaggat cagagaggag gacaagcaag ctcaacgctt cctgttccgc gagaaccctg 2880 ccgacgctcc gcaggtgtac gtcatggacg ttgcaacttt cgggtcggct tgctccccaa 2940 gttcggctca gttcgtgaag aacctgaacg cggaccagta tgcggaggag tacccggagg 3000 cagctgcggc gattaaggag cgacactacg tggacgatta ctacgactcg gtggacacgg 3060 tcgaggaagc gattcggcgg gccaacgagg tgaagttcat ccactcgaaa ggaggtttca 3120 acatccggaa ctgggtgtcg aactcgagga aggtgctgga cgcgatggga gaacaaagcg 3180 tgaagacatc cgtgcacttc aaccaggaaa agtcggcttt ctacgagcgt gtactcggca 3240 tagtctggga accggatcag gacgtgttct gtttcgcggt ggcctcgaag ccggagttcc 3300 gcgacgttct ggcgggagag cgacgtccaa ccaagcggat cgtgctgagt gtggtcatgg 3360 cgcaattcga tccagcggga ttcctaggac cgatcaccat tctcgggaag atcctggtgc 3420 aagatctgtg gcgcacgggg tgcgagtggg acgaaatgat cgacgagctg gctttccaga 3480 ggtggctgcg ttggatcaac attttggcgg atgcgaagca cttcaaacta ccaaggtgct 3540 acttcgggaa cgcgcggtgg gacgagattg aggagattca gctgcacatc ttcgctgatg 3600 cgggtgacaa ggcatacgga tgcgtggcgt acttccgagc gattgtccgg ggcgaagtgg 3660 ttgtggcgct ggtgatgagt cggtccaagg tggcgccgct gaagatgcta tccatcccgc 3720 gtctggagct ggagtcgtgt gtacttgcgg cgcgcatgtc ccaggcgatt cacgacaacc 3780 acagtttccc tatcagcaag acctttttct ggactgattc ggcggtcgtc ctgtcgtgga 3840 tccgctccga tcagcgtcgc tacaaacagt tcgtcgggtt tcggatcggt gccattttga 3900 gcatgacgtc gctgatggat tggaacttcg taccgacaca acacaacatc gccgacatcc 3960 tgacgaagtg gggaaaggat ccgaacgtcc ggcctgacag tccgtgggtc tgctgcccga 4020 ggttcatcca cgatcagcaa gtaaactggc cgaagaaaag tttgccccca ccgaacacca 4080 cggaggaact tcgcgtgcat ctccttctac acgacgtgaa ggtggtggac accatcgtgg 4140 aggtgaacca cttctcaagg tggacgatgt tggtgaggac catggcgtgc gctcaccggt 4200 tcgtgtcgaa cctcaagcgg aaaacgaaag ggctgccgat ggagacgctg cgtgcgacga 4260 aggcgcaggc gaagatgctg agaccgacga cggtaccgtc ggttcgagtt gcgttgcagc 4320 aagacgagta caagcaggcg gaggtgtacc tgctgaagat ggtgcaggcg gagagcttca 4380 tcgacgaggt gaaagtgctg actaggaaca aggatcgtcc ggcggatcag tggctcgccc 4440 tcgaaaagtc aagtccgctg tacaagctga ctccactggt cgacgaagac ggcttgctgc 4500 ggatggaagg gagaaccgag aacgcggagt ttctgccttt cgatctgaga ttcccagtga 4560 ttctaccgaa ggatcaccac gtcacccgga tgattgtcca gcactaccat caatggttcg 4620 ggcacggata tcgcgagacg atcaagaacg aactgcgcca gcgtttcttc attctggggg 4680 tcaacacact ggttcgacaa gtggcggcgg cgtgcgtctg gtgcaaggtg caccgcaatc 4740 agccgttggt gccaaggatg gcggcgcttc ctgtgcaacg catcacaccg aatcttcgcc 4800 cgttcagcta cgttggggtt gattatctgg gtccgctcga cgtgtcagtt ggacgccggt 4860 ccgagaagcg ttgggtggcg ctgtttacat gcttcgtcac gcgtgcagtg catctggagg 4920 tggcgtatgg gctgacggca caaacgtgtc tgatggcgat tcgtcgtttc atctgtcgtc 4980 gaggccctcc ggtcgaattc ttctccgaca acggaacgaa ctttcggggc gcgagcaagg 5040 agctggtgga gacggtgcgg agcatcggga acgactgcgc cgaggtgctg acaacgtcgc 5100 gaacgaagtg gaccttcaac ccacccgctg caccccacat ggggggtgtt tgggagcgtt 5160 tggtccgctc tgtgaaggag gcgctggagg cgttggacga cgggcggcgg ctgacggacg 5220 agattctggt gacctgtctg gcggaagcgg aggacatgat caactcacgt ccgctgacgt 5280 acgtggctca ggaatcatcg caagcggagg ccctcacacc aaatcacttc atccgaggag 5340 tttcaccgaa cgaacccaac acggtcccgc catcccctca ccctgcagaa gctctacgtg 5400 acgcctacaa gcggtcgcaa cagttggcag atgtgatgtg gcagcgctgg atcaaggagt 5460 acgtgccgtc ggtgaatcaa cgcacgaagt ggttcggtga gtcgaggccg ctgaaggcgg 5520 gcgacctggt ctacatcgtc gacgggaaga acaggaagtc ctggatccgc ggagtggtcg 5580 aagagccgat cgtgtccggc gacgggagga ttcgccaagc atgggtgcga accaccagcg 5640 ggctggtgaa gcgcgcgaca gcgaggctgg ccgtgctgga gatcgaaggt aaacctgctt 5700 cagaagcttc ggaaccgggt ttacgggccg gggga 5735 // ID Sal1_HR repbase; DNA; INV; 302 BP. XX AC . XX DT 14-SEP-2004 (Rel. 9.08, Created) DT 14-SEP-2004 (Rel. 9.08, Last updated, Version 1) XX DE Haliotis rufescens Sal1 satellite repeat a consensus. XX KW SAT; Satellite; Simple Repeat; Sal1_HR; satellite repeat. XX OS Haliotis rufescens OC Eukaryota; Metazoa; Mollusca; Gastropoda; Vetigastropoda; OC Haliotoidea; Haliotidae; Haliotis. XX RN [1] RA Muchmore E.M., Moy W.G., Swanson J.W. and Vacquier D.V.; RT "Direct sequencing of genomic DNA for characterization of a RT satellite DNA in five species of eastern Pacific abalone."; RL Mol. Marine Biol. Biotechnol 7(1), 1-6 (1998). XX RN [2] RA Gentles A. and Jurka J.; RT "Haliotis rufescens Sal1 satellite repeat - a consensus."; RL Direct Submission to Repbase Update (JUN-2004). XX DR [2] (Consensus) XX SQ Sequence 302 BP; 74 A; 62 C; 76 G; 90 T; 0 other; gtcgactcct caggtcattt gggacatact gaactgaaca tttttcactt tggagccaat 60 gttgagatcg tctttgatcg cccatagctc cactcccagg ggtggagcta tggggggtca 120 aggacggtga aaacgtgacc tccaggtcaa ggtctacccg catgcaaagt ttcaaatcgc 180 tgcgagaatt tttaattttt ggtgtcattg gagtactagg tgttagactt atagtctttt 240 atacagcatt tttgaccaaa aaggtcatat catggtagga gtctcagact tggctggtcg 300 ac 302 // ID Tx1-15_CQ repbase; DNA; INV; 2098 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A Tx1 non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW Tx1; Non-LTR Retrotransposon; Transposable Element; Tx1-15_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-2098 RA Kojima K.K. and Jurka J.; RT "Tx1 non-LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 647-647 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 7 sequences with >96% CC identity. CC 3'-truncated. XX FH Key Location/Qualifiers FT CDS 152..1402 FT /product="Tx1-15_CQ_1p" FT /translation="MEPKRENTIKLRFGAGARNPNNGEVFKFFTKQQWTSD FT VLSAMYRDDFCVFIRFKSDELMQGALVQLGNRVNFEYDDGTTVWVNVTAAN FT GTFKYVRIFGLPPEVDDRQISAAMCKFGTIQLMVRERFPVETGFPIWNGVR FT GLHMEVTAAIPAQVNIQHVKARIYYDGLQNKCFACGALDHLKANCPNRKPV FT NGRLTVLTKPSEGSFASIVTNGTAALPLPTPTMVVLNKPGGSQEAGTKGNG FT SDLADQQPAGQESAVQQPAAQVVPDQESLEDRPANGSSTDGASGRSDADVG FT ELGGTESTGKQSNEQSALLLNPLFSDEIREVADSEMVIINPGDNERSDETM FT ENAEAWSEQKGKGKKKGKRGRPKKSVPDTSGSDTRGGKQFIVPATLKDLLL FT SQERRSRSRSRAVSEGGRRRGKK" FT CDS 1569..2096 FT /product="Tx1-15_CQ_2p" FT /note="apurinic-like endonuclease." FT /translation="MNFVYTIATINLNSSNCKANKGLLKDFIMNHDIDVAF FT LQEVSYEDFSFVYTHNALVNISSDKKGTAILIRKTLEFSDYILDLSGRITS FT VIVNNVNMINIYAHSGNNMRKERDHLFTEALTIHLNKPKCAFTLIGGDFNC FT VLDAKDTKGTQNNFSSGLKNLVDLLSLQDIAKTRKAN" XX SQ Sequence 2098 BP; 606 A; 447 C; 573 G; 472 T; 0 other; cagttcacag ccagagttca agtgacaaag acgtgttttt caagtcgctc tcgcatacta 60 cattgttgtg aatagttggt gtcggacatt gtccggctgg cattgtttgc cagcaaattt 120 gcctttactc ggaacattcg cagcagccga aatggaaccc aagagggaga acaccatcaa 180 gctccggttc ggagctggtg caaggaaccc gaacaatgga gaagtcttca aattcttcac 240 gaagcagcag tggacgagcg acgtgctcag cgctatgtac cgggatgact tttgtgtctt 300 catccggttc aaatcggacg agctaatgca gggtgccttg gtgcagcttg ggaacagggt 360 caactttgag tatgacgacg gtacgacggt gtgggtcaat gttacggccg caaacggcac 420 gtttaagtac gtccggatct ttgggttgcc gcccgaggtg gacgacaggc agatctctgc 480 ggccatgtgc aagttcggca ccattcagct gatggtgcgg gaaaggtttc cagtggagac 540 cggctttccg atctggaacg gggttcgcgg actccacatg gaagttacgg cagcgattcc 600 agcacaagtg aacatccaac acgtgaaagc gaggatttac tatgatggct tgcaaaacaa 660 atgctttgcc tgtggagcac tcgaccatct gaaggccaac tgcccaaacc ggaaacctgt 720 aaacggcaga ctcactgtgc tgaccaagcc gagcgaggga tcgtttgcca gcattgtcac 780 gaacgggacg gcggctcttc cactaccaac gccaactatg gtggtgttga acaagccggg 840 aggaagccag gaggcgggaa cgaaggggaa cggaagcgac cttgcggatc aacagcccgc 900 tggtcaggag tcggcagtcc aacagcctgc agctcaagtt gttcctgatc aggagagctt 960 ggaagatagg ccggcgaatg gaagttctac ggacggggcc tcgggtcgat ctgatgcgga 1020 tgttggggag ttgggcggga cggagtctac gggtaagcaa tccaacgaac agagcgcatt 1080 gctgctcaat ccgttgttct cggatgagat acgggaagtg gcagactcag aaatggttat 1140 catcaatccg ggtgataacg aacgaagcga cgaaacaatg gagaatgctg aggcttggtc 1200 ggaacaaaag gggaagggaa agaagaaggg aaagcgaggt cgcccaaaaa agagcgtacc 1260 agatacctcg ggatctgaca cgagaggggg taaacagttt attgttccgg ctactctcaa 1320 agatctgctg ctctcgcagg agagacgatc acgatcgcga agtcgggccg tgtctgaggg 1380 gggccgtagg aggggtaaga agtgagggag gggagggggg ggggaaggga atatctctat 1440 cacacgcctt ctaacaccaa cacaagccat aagatttact aacacccaac tagcaatact 1500 aacccaaact tttcactaac acaacaatat gtcgaaacta acactagatt actaacacta 1560 gtttaattat gaattttgta tatacgatcg caaccatcaa tttgaacagc agcaactgta 1620 aagctaacaa aggtttgtta aaagatttta taatgaatca cgacatcgat gtagcgtttc 1680 tgcaggaggt aagctatgaa gatttttctt ttgtttatac tcacaatgca ctagttaata 1740 tcagttctga taaaaaaggt acagctatct taattagaaa gacgctggag ttctcagatt 1800 acatactcga tttgtccgga agaataacat cagtcattgt taacaatgta aatatgatca 1860 atatctatgc tcattctgga aacaacatgc gtaaggaacg ggaccattta ttcacggaag 1920 ctttgacgat tcatttaaac aaaccaaaat gcgcatttac gcttatcggt ggggatttca 1980 attgtgtttt ggatgctaag gacacgaaag ggacgcaaaa taatttttcg agtggattaa 2040 aaaatttagt ggatcttttg agcctacagg acattgccaa gacgcggaaa gcaaatca 2098 // ID Gypsy7-LTR_Dmoj repbase; DNA; INV; 148 BP. XX AC scaffold_6500; XX DT 13-MAY-2009 (Rel. 14.05, Created) DT 13-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy7_Dmoj; KW Gypsy7-I_Dmoj; Gypsy7-LTR_Dmoj. XX OS Drosophila mojavensis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; repleta group; OC mulleri subgroup. XX RN [1] RP 1-148 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1065-1065 (2009). XX DR Genome; scaffold_6500; Positions 24725545 24725692. XX SQ Sequence 148 BP; 48 A; 29 C; 31 G; 40 T; 0 other; tgtgttggaa ttgtgtggtg aaatgcccaa cactgaatac tcatacactc acacagcctc 60 acacagtgca taatagacaa ttagaagttg acgcaacatt gttaatgtgt tggaattgtg 120 tggtgaaatg cccaacactg aatactca 148 // ID Gypsy4-LTR_Dmoj repbase; DNA; INV; 672 BP. XX AC scaffold_6541; XX DT 13-MAY-2009 (Rel. 14.05, Created) DT 13-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy4_Dmoj; KW Gypsy4-I_Dmoj; Gypsy4-LTR_Dmoj. XX OS Drosophila mojavensis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; repleta group; OC mulleri subgroup. XX RN [1] RP 1-672 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1047-1047 (2009). XX DR Genome; scaffold_6541; Positions 1442532 1443203. XX SQ Sequence 672 BP; 175 A; 162 C; 171 G; 164 T; 0 other; tgtaaggcgg ctcactgaac cgcctcacat aattttttgt tatttttaaa taggccaagg 60 gacgcgcgca agtcgcggcg atcgggaatg caactctgtc cttcgcggcg aaagttcgaa 120 gggccgacgc cgtggtgagt tgtattgccg acgccgaggc ggatcgcggc ggaggcttgg 180 aggcggccga cagagagcga aatgattatt agttttcgga cagcgaagaa tagcgggtga 240 gcgcaagtga cttattagag tttaagtaac atgtaacgaa ttaaaagccg aaagtagaaa 300 tagaagggga tgtagaaata agagtggaat gttttaaata aattaaaaga gaaattgaag 360 tgaaattcgg cgtgttgttt attttgcaag tttaacacta acaacagtag tgattcccta 420 ttcctccccc gctcccctct gttccattcg tttcaacctg tcgcggcaac gcccgactgt 480 aaaaaggagt tggctataaa gaacgcggcc gcgcccgttc accgagtgga caagccccgg 540 ctgtcgtcaa cttgtgtttc ccgtattttc ccccctcccc tctcctcact ctagtcccgg 600 caaaacacaa atttcttgtg atcaaccacc ctgggcagac ttacgctgcc ccggccatag 660 tggtacttta ca 672 // ID SIRE repbase; DNA; INV; 316 BP. XX AC . XX DT 29-JUL-1999 (Rel. 4.06, Created) DT 29-JUL-1999 (Rel. 4.06, Last updated, Version 1) XX DE SIRE repetitive element (a consensus). XX KW SIRE; Repetitive element. XX OS Trypanosoma cruzi OC Eukaryota; Euglenozoa; Kinetoplastida; Trypanosomatidae; OC Trypanosoma; Schizotrypanum. XX RN [1] RA Puerta C., Martin J., Alonso C. and Lopez C.M.; RT "Isolation and characterization of the gene encoding histone H2A RT from Trypanosoma cruzi."; RL Mol. Biochem. Parasitol 64(1), 1-10 (1994). XX RN [2] RP 1-316 RA Jurka J.; RT "SIRE."; RL Direct Submission to Repbase Update (JUL-1999). XX DR [2] (Consensus) XX CC Putative retroelement. XX SQ Sequence 316 BP; 90 A; 63 C; 62 G; 99 T; 2 other; aagaaaacat tgaagcaatt aactaacgaa cctttttycc taacttgttt ggtttccata 60 gataatttca ggatccggcc agctgcccgg aagattattc taggagcttg tgaaaagaat 120 gatcgcggga gagctggcta acttaattaa tgtatgtgta tatcctgata aatgaatgca 180 ttctttatga tacttttcta ccgtatgaat cttttgggaa gaacgcgact ttgtaggggc 240 agggarccga tagaggccag ataatatttt atttttattt tgccatccca cccaccccct 300 tttgattccc accaca 316 // ID DNA4-4_AP repbase; DNA; INV; 239 BP. XX AC . XX DT 22-AUG-2009 (Rel. 14.08, Created) DT 22-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA4-4_AP. XX NM DNA4-4_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-239 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(8), 1741-1741 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 4 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 239 BP; 77 A; 50 C; 46 G; 66 T; 0 other; cgcccgccgc agcgaagaag gtcgtatctc tgttttcaca attttttttt tcaagttgtc 60 ataactgttt ttgtgtagtt cccgcgaaac tgatagtagc ataaaacaat caaagacgtt 120 ctatacaata aaatgcagtt actaccgtct tcgcgggaac tacacaaaaa cagttatgac 180 aacttgaaaa aaaaaaattt gtgaaaacag agatacgacc ttcttcgctg cggcgggcg 239 // ID Mariner-2_BM repbase; DNA; INV; 1278 BP. XX AC . XX DT 25-APR-2010 (Rel. 15.07, Created) DT 25-APR-2010 (Rel. 15.07, Last updated, Version 2) XX DE Mariner-like DNA transposon - consensus sequence. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-2_BM. XX OS Bombyx mori OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Bombycidae; Bombycinae; Bombyx. XX RN [1] RP 1-1278 RA Jurka J.; RT "DNA transposons from Bombyx mori."; RL Repbase Reports 10(7), 937-937 (2010). XX DR [1] (Consensus) XX CC ~96% identical to consensus. XX FH Key Location/Qualifiers FT CDS 116..1192 FT /product="Mariner-2_BM_1p" FT /translation="MERFTGADRAFCVREFYKNDNSATVARRKFREHKGLH FT NFDDTPTLQTIKNWVAKFEETGSTLDKPRLGRPRTSRTEQNIDTVTQSIRE FT NPTQSTRKRARALNVSRTSLQRILKKDLHMHPYKIQLVQELKETDGIQRQN FT YANEMLNRFTSFNNIMFSDEAHFHLNGHVSKQNCRYWSPINPKLKHQKPLH FT SPKVTVWAAMSAHGILGPYFFEDGRGRAVTVTSERYVAMIEEFFIPELQNF FT SGFNARTWFQQDGATSHTSNTAMPVIRQLFPGKVISKRGDISWPPRSPDLT FT PMEWGYLKAKVYDTNPRSIEALKENIRREMTSISAVTCRAVIDNFRRRLQE FT CRDRNGLHLGDVIFKK" XX SQ Sequence 1278 BP; 379 A; 276 C; 297 G; 326 T; 0 other; cagggtggcc cataagtcct ttgaaaagta aaatttgaat aaaacgaaca gggcgcgcgt 60 gagcggtgca ggggaagcgg ggggagtaac ttcggtttct gggaatttag tcgcgatgga 120 gcgttttacc ggagcggacc gtgcgttttg tgtgcgcgag ttttataaaa acgacaattc 180 ggcaaccgtt gcgcggcgta aatttcgaga acacaaaggt ttacacaact ttgacgacac 240 tccaactctt caaacgatta agaactgggt tgctaagttc gaggagaccg gttcgacgtt 300 agataaaccg cggttgggtc gcccaaggac gtcacgtacg gaacaaaaca tagacacagt 360 gacacagtct attcgcgaaa atccgacaca gtcaactcgt aagcgcgcca gagcattaaa 420 cgtatccagg acatcgttac aacggatttt gaagaaagat ttacatatgc acccttacaa 480 aattcagttg gttcaagaat taaaggaaac tgacggtatt caaagacaaa attatgcgaa 540 tgaaatgttg aatcggttta cttccttcaa taacataatg ttctctgacg aggctcattt 600 ccatcttaac gggcatgtta gtaaacaaaa ttgccgctat tggagcccca tcaatccaaa 660 acttaagcat cagaagccct tacatagtcc gaaagtgact gtttgggccg ctatgtcagc 720 ccatggtatc ctagggcctt acttctttga ggatggcaga gggcgcgcag ttactgttac 780 gtcggagcga tatgtggcta tgatcgaaga gtttttcata ccagaattgc aaaacttttc 840 aggctttaat gccaggacgt ggtttcaaca ggatggggcc acttctcaca cctctaacac 900 tgctatgccc gttatccgtc aactttttcc tggcaaagtt atctctaaga gaggagacat 960 ttcctggcct ccacgtagtc cagatctgac cccgatggag tggggctacc ttaaggctaa 1020 agtatacgac acaaacccgc ggtcaattga ggccctaaaa gaaaacatcc gcagagaaat 1080 gacaagcatt tcagcagtga cgtgccgcgc agtcatcgac aattttagac gtcgtttgca 1140 agagtgccgt gatcgcaacg gattacattt aggagatgtt attttcaaaa aataattccc 1200 gttttttaag ctttttatgt aaataaaaat ttaattcata atgtaattag catatcaaag 1260 gacttatggg ccaccctg 1278 // ID Copia-5_SI-I repbase; DNA; INV; 4218 BP. XX AC AEAQ01008405; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-5_SI_; KW Copia-5_SI-LTR; Copia-5_SI-I. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-4218 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01008405; Positions 4822 605. XX CC Positions [1622-2068] - Integrase core CC LTRs are 95% similar to each other. XX FH Key Location/Qualifiers FT CDS 2596..4218 FT /product="Copia-5_SI-I_1p" FT /translation="MQGPDADEWLEAIVSEVKSLLKNDTWSVVKRPKDREV FT IGCRFVLRNKHNSDGTLERRKARLVAKGYAQRPGIDFCDTFAPVARMSSIR FT AVSALAAQFGLTLYHFDVTTAYLNGELEEDVFMKIPENIEDSLEEIVRSGR FT KDCKIQKKAKKMLESLANEDNALLLRKSLYGLRQAGRRWHVKLCKILKEFG FT FSQSISDPCVFYLGKGEDILLAVVYVDDIIAVSKTEQAVANLFQHLSTQLD FT IKNLGPVKHCLGIEFSQTKEIITLNQRGYVNDILERFGMSGSNPVGTPVEL FT GTKLRRNEEIAPECKTLPYRELVGALMYLATCTRPDIAHVVSYLSKFNDCF FT STDHWLAAKRVLRYLKGTCDVGLSFRRSTDPLIGFADADWANCVDDRKSYT FT GYAFILGGCPISWESRKQKTTALSSMEAEYMALSEATKEAVHLQRFFGELG FT FPYPKIKLFSDNCGAIKLAENPVFHNRSKHIDVRHHFIRGMLENGVIEVDY FT RSTEDMAADVLTKGLSGPKHRRCLELMGISTGEKYSRDHSRLEGK" FT CDS join(494..1618,1622..2632) FT /product="Copia-5_SI-I_2p" FT /translation="MKDGNDVREHLRKFFDTVDKLSEMEIEVNPDLLTIML FT LYSLPPSFENFRCAIESRDELPTPEALRVKIVEEGDARKSDTRASVPNALI FT AKGPRGRRPKPKSKQSDARETKDAEPFKYKCHRCRKVGHKAADCREALDSA FT TAQNTDVSLYARSEKALIGNAFGNRRTDWCLDSGCSTHLCKDRRDFVEISD FT EGTGRLNLANSASTEIKARGKVSIATDVNGCPRNVDIHDVSLVPDLRTNLL FT LVGKIADRGYTITFNKYIGKVIDENKRTVLIADRIDELYYLRNETPTCNIA FT TKDGNERESFERWHRRMGHLNARDLAVSVRNRKIHGINLIAPAEKLDCEIC FT ILGKMTRAPFPKNTDRNSKLLEIIHSDVCGPMVESNGNARYFITFIDDYSK FT WCEIRLLKGKDEVFQAFKEYKASAEKRTGRCIKYLQSDNGKEFRNERFDAF FT LKEHGIGRRLTVTNTPEQNGVAERRNRTLVEMARCLLIQSGLPSSFWGEAV FT NTANYVRNRCPSSSLNGETAYQKWTGNAPNVRHLREFGCNAYTLVRSPNKG FT KFADRSKKGILVGYSDESKAYRVWIPDEKRVDITRDVKFLEDDKISPTENF FT EDFTAHEEDHAESPDTSEILVPLNATEAEVGLENEPANARNEDEGDEHRDN FT ADDADPPRRGPGRPRIIRTGLRGRPSSITKPLSSMQIRNSLTQRRSQSTKR FT CKGLMQTNGSKL" XX SQ Sequence 4218 BP; 1240 A; 975 C; 1153 G; 850 T; 0 other; ccttctgatt atttccgaag accttaacct gccgtaatag gttatgggcc cagagtaccg 60 cgaaataagc gcgtgaaaga aacacgagtg ccagcacgag aaacgagacg agaacgacac 120 gaggcattcc aagaaaatgg cgtctcacca taccgtgcgg attgagcccc tgaacaaaga 180 caacttcgac acttggaaga tccagatgga agcgctgctc ataaaaaacg actcttgggg 240 atacgtgagt ggacatcacg taaagcccga ggtcattgcc acgaacgcgg aatcggtagc 300 ggccggcgct gcatgggacg cagcggatcg caaggcgaga tcggacatca tcctggggat 360 ctgtccgtcc gagctgaaac aaatcaaggg atgcgagaca tcgaacgtca tgtggcgaaa 420 gctgcagacg atctaccagt cgacgtgaca ggcgcggaag gcgaccctac tgaagcagtt 480 gactctacat cgaatgaaag atggaaacga cgtccgggag cacctgagaa aattcttcga 540 taccgtagat aagttaagcg agatggaaat agaggtaaat ccggacttat tgacaattat 600 gctactgtat agcctaccgc caagcttcga aaactttaga tgcgcaatcg agtcccggga 660 cgaattgccg acccccgaag cattacgtgt gaaaatcgtt gaagaaggtg acgcgcgaaa 720 gagtgacacg cgagcatcag tgccgaatgc cctgatagcg aaggggcctc gcggtagaag 780 accgaagccg aaatcgaagc aaagtgacgc gcgcgaaaca aaagacgctg aaccgttcaa 840 gtataagtgt catcgttgtc gtaaggtcgg gcacaaggcg gccgattgtc gggaggcgct 900 cgacagcgcc accgcccaga atacggacgt gagcctgtat gcgcgttctg aaaaagcgct 960 tatcgggaac gcgttcggaa atcggcgcac ggattggtgc ctggatagcg ggtgctcaac 1020 tcacttatgc aaggacaggc gcgatttcgt agagatatcg gacgagggga cgggaaggtt 1080 gaatctcgcg aacagtgcat cgaccgagat aaaagcgaga gggaaagtgt cgattgcgac 1140 tgacgtaaac ggttgcccta gaaacgttga tatacacgac gtgtcgttag taccggattt 1200 gcgaacgaac cttcttttgg tcggaaaaat cgccgacaga ggatacacga taaccttcaa 1260 taagtatatc ggaaaagtaa tagacgagaa taagcgcact gtgttaattg ccgatcgcat 1320 cgacgagctt tattatttgc gtaacgaaac gccgacatgt aatatcgcga cgaaagacgg 1380 aaacgaacgc gaatcgttcg aaaggtggca tcgcagaatg ggccacttga atgcgcgcga 1440 tttagcggta agtgttagaa accgaaagat acacggcata aacctaatag ctcccgccga 1500 aaaattagat tgcgaaatat gtattctcgg aaagatgacg cgcgcccctt tcccgaaaaa 1560 taccgataga aactcgaaac tattagagat tatacattcc gacgtatgcg gaccgatgtg 1620 agtagaatct aacggtaacg cgagatactt tataacgttt atcgacgatt actcgaagtg 1680 gtgtgagatt cgactgctaa aggggaagga tgaagttttc caggccttca aggagtacaa 1740 ggcctctgcg gagaaacgaa ccggcagatg tataaagtac cttcagtccg acaacggaaa 1800 ggagttcaga aacgagaggt tcgacgcttt cctgaaggag cacggaatcg gacgacggct 1860 aaccgtgacc aacactccag agcaaaacgg cgtggccgag aggcggaatc ggacgctcgt 1920 ggaaatggcc aggtgtctcc taatccagtc gggactgcca tcatcatttt ggggagaggc 1980 agtgaacacg gccaattatg tcagaaacag gtgcccgtcg agcagtttga acggcgagac 2040 cgcttatcaa aagtggactg gaaatgcgcc caacgttcgt caccttaggg aattcggctg 2100 caacgcatac accctggtcc gtagcccgaa taaggggaag tttgccgatc gatcgaagaa 2160 gggaatcctt gtcggatact ctgacgagtc caaggcctat cgagtctgga tccccgatga 2220 aaaaagggtt gacataactc gcgacgtgaa attcttggag gacgacaaga tctcacccac 2280 cgagaacttc gaggatttca ccgcgcatga ggaagatcat gctgagagcc ctgacaccag 2340 tgagattctg gtccccttaa atgcgacgga agcggaagtc ggattagaga acgaaccggc 2400 aaacgcacga aacgaggacg agggagacga gcatagagac aatgcagatg acgcggatcc 2460 gccaagaaga ggtccaggaa gaccgaggat aataaggact ggactcaggg gaaggccgag 2520 cagtatcacg aagccgctct cgtcgatgca aatacggaat tcgcttactc agcggagatc 2580 ccaatcaacg aagcgatgca agggcctgat gcagacgaat ggctcgaagc tatagtatcg 2640 gaagtcaagt cgttactcaa aaacgataca tggtctgtcg tcaagcgtcc taaggatcgc 2700 gaggttatcg gctgccgttt tgttctccgg aataaacaca acagcgacgg aacgttagag 2760 cgaagaaaag ctcgactcgt cgcgaaggga tacgcgcaac gtcctggaat cgacttctgc 2820 gacacattcg cgccagttgc tagaatgagt tcgatacgag ctgtatcagc gttggcagct 2880 cagttcggct taacgctata tcatttcgac gttactacag catatttaaa cggcgaattg 2940 gaagaagacg tcttcatgaa gattcccgag aacatcgaag acagtctgga agaaatcgtt 3000 cgctccggca gaaaagactg caagattcaa aagaaagcga aaaagatgct cgaaagtttg 3060 gcgaacgagg ataatgcgct tcttctacgc aagtccctct acggcttgcg tcaagcggga 3120 aggcgatggc acgtgaagtt atgcaagatc ctcaaagaat ttggattctc tcaatccatt 3180 tcggatccgt gtgtgttcta tcttggaaaa ggggaagata tcctcttggc tgtcgtgtac 3240 gtggacgaca tcattgcggt ctccaagaca gaacaagcgg tagccaactt gttccagcac 3300 ttatcaacgc agctcgacat caagaatcta ggacctgtaa agcattgcct aggaatagag 3360 ttttcgcaaa ccaaagaaat tatcaccctg aatcaaaggg gatatgtaaa cgatattctc 3420 gaacggttcg gaatgtccgg atcgaacccc gtcggaacgc cagttgagct cggaaccaaa 3480 ttgcggcgaa acgaggaaat cgcccccgaa tgcaaaactt tgccctatag agagctcgtc 3540 ggagctctga tgtacttagc tacatgtacg aggccggaca tcgctcacgt cgtcagctac 3600 ttgagcaagt ttaatgattg tttcagcacc gatcactggt tggcggccaa gcgtgtatta 3660 aggtacttga aaggtacctg cgatgtcggt ctttccttca gaagatcaac agacccactg 3720 attggatttg cggatgccga ttgggcaaat tgtgtagacg acagaaagtc ttacactggc 3780 tacgcattca tattaggtgg atgtcccatc tcatgggaat cgcgtaagca aaagactact 3840 gccctctcat cgatggaagc cgagtacatg gccttgagcg aagccacaaa agaggccgtg 3900 catctacaaa gattcttcgg ggaattgggt tttccgtacc caaaaatcaa attgtttagc 3960 gacaattgtg gtgctatcaa gctggccgag aatccggtat tccacaatcg ctcgaagcac 4020 atcgatgtaa gacaccactt catacgagga atgctggaaa acggcgtcat cgaggttgat 4080 tacagatcaa cagaagatat ggcggcggac gttctcacga agggtctgtc gggccccaag 4140 catcgaagat gcttagaact catgggcatt tcaaccggag agaagtattc gcgggatcac 4200 tcacgtctcg aggggaag 4218 // ID LTRTED repbase; DNA; INV; 273 BP. XX AC M32662; XX DT 06-FEB-1997 (Rel. 2.01, Created) DT 06-FEB-1997 (Rel. 2.01, Last updated, Version 1) XX DE Long terminal repeat of retrotransposon TED inserted in DE Autographa californica nuclear polyhedrosis virus. XX KW Gypsy; LTR Retrotransposon; Transposable Element; LTR; LTRTED; KW Gypsy group; retrotransposon TED. XX OS Autographa californica MNPV OC Viruses; dsDNA viruses, no RNA stage; Baculoviridae; OC Alphabaculovirus. XX RN [1] RP 1-273 RA Friesen D.P. and Nissen S.M.; RT "Gene organization and transcription of TED, a lepidopteran RT retrotransposon integrated within the baculovirus genome."; RL Mol. Cell. Biol 10(6), 3067-3077 (1990). XX DR GenBank; M32662; Positions 1 273. XX SQ Sequence 273 BP; 77 A; 45 C; 47 G; 104 T; 0 other; tgttaggtat ggagccttaa tggatatcat cgacgctgca tttcctgtta ttgtccgcca 60 gctgcataga aactgtctga atgacgtaaa tcgtcatgaa ccgctgatgt agcgaaattt 120 gtaattagtt attaactcaa aattgtatgc attcctattt ctaatatcga gtaggtctca 180 cgcattaatt attgtaattc ttataagtaa taaattagca tttaaaatca tttttggttt 240 tttttctatc tgccgtctgc agtatacgta att 273 // ID Gypsy5-I_Dya repbase; DNA; INV; 5365 BP. XX AC chrU; XX DT 18-MAY-2009 (Rel. 14.05, Created) DT 18-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy5_Dya; KW Gypsy5-LTR_Dya; Gypsy5-I_Dya. XX OS Drosophila yakuba OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; melanogaster subgroup. XX RN [1] RP 1-5365 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1052-1052 (2009). XX DR Genome; chrU; Positions 1912815 1918179. XX CC Positions [4459-4935] - Integrase core CC 'ATTA' target site duplication CC LTRs are 96% similar to each other. XX FH Key Location/Qualifiers FT CDS 464..2002 FT /product="Gypsy5-I_Dya_3p" FT /translation="MAAKHSAGISSEVSQGVENSCAVCKEILAPTVLSATT FT SCRHKFHKKCIINHLKSTQTCPVCKADCNESNLSHSNNTLFAELDAATAGT FT SGVSECQDNNDIFPLRRNTRSGRIRGTPSRRGVSTRNTYRGNLDRILTEPP FT TNSSMALSTHPELMNFIQFTVSQQQAQLMDNLTSYLGKTIETSIKSHLDNF FT QLPQPTYTNSTPDNIQPNTDLSRNVSNQNRYSHNLDNNVETRGVNRSNASS FT YVSPEKVSGIIQKWGLKFDGSKTSIRIDEFIYRICSLTKQNLGNDFGLLCD FT HLYLLFAGKASDWYWKFHRNSPNFDWNTFCGEFRKRFEDIETDYDIWEGIR FT KRKQGENETFEDFQETMERLIDRLQNGVSEAQLVELLVRNSKPSLHHELLH FT LKIPNRAQLRYEVRRHEQFYNNLRTFKPKPYRSCVAELTTEEEETGLVRVD FT EEVNQIQRRNTPTCWNCDKQGHRFDDCMAKRIIFCYGCGLKEKYKPTCPNC FT NPGNRQKDVSLSNKMHP" FT CDS join(2830..4356,4360..5340) FT /product="Gypsy5-I_Dya_1p" FT /translation="MFAEVDNMLALDVIEESTSAWSSNCVLVKKGDKNRLC FT LDSREVNKVTQRDAYPLPHIDGILSRLPPAKFISGIDMKHAYWQIPLDEKS FT KQYTAFTIPNRPLYQYKMMPFGLCNAAQTLCRLMDRVIPAHLRNRVFVYLD FT DLLVLSEDFESHMLLLQEIALCLRKANLTINVSKSKFCMKEIIYLGFIIGN FT GQIRTNPDKVKAIVEFPQPTSIKQLRRFLGLAGWYRRFVENFATISFPLTE FT LLKKKNTLEWNGEAQTAFENLKSCLTASPILITPDFSKQFILLCDASSFGV FT GCVLAQERDGVELPISYMSEKLSKAQRNYSVSELECLAVIKGIKKFRAYIE FT GQDFLVITDHAALQWLMRQKDLSGRLARWSMKLRAFTFTIKHRSGSQNVVA FT DTLSRQNDPAIDEFCQNGPIVDVDSPHFKSTEYLDLIKTISNNQSRFPDLQ FT IRDGYIYKRTTFVSDDIKQENPWKLWIPSALVPDILFQAHDAPSSSHCGMS FT KTVEKLKRYLFWPMVTQVRNYISNCVICRTTKSPNTVLKPPMGKPLTSDRP FT FQKLYIDLLGPYPRTKKGHIGLLIVVDHNTRFHFLLPLKKFTAPKICDYLK FT GAIFYTFGVPEVILSDNGSQFKSSYFQAFLTSFGVQQKLTAIYSPQANASE FT RLNRSVLAAVRAYIGKDHTNWDDNLDAISGSLRASLHRSTGSSPYFLTFGQ FT NMILNGKDYSLLRTLNLLADDTRLSKPEALELQRRQAKLNLNRAHEENANR FT YNLRTRQIEFKNGDFVYARNFTQSNAVKKFSSKLAPVFIPAKVYKKRSSYY FT YELEDERGKKLGVFHLKDIQARKQGVSSDKASLK" XX SQ Sequence 5365 BP; 1824 A; 1023 C; 1047 G; 1471 T; 0 other; catttttgcg cccaacgtgg ggcccgacaa gtccttaact tgttatagca cactaagctc 60 taaagtaatc catccacagt taaaacacaa tataactaat ttcaataaat taacatattg 120 gattgagtgc atcgatttgg aaggaatagt aaaatttagg aacatgcgat ttaatttgaa 180 agagaatgga tcgaccaaat gcaactgttt cgaatccaca gaagctcatc accatacgtt 240 tcttcactca actttattta caatttgatt ttttggaata cattatactt acacttcaat 300 aagtccgtat attgtccgat tttcaagtaa acaaaagtaa acatgaggca atagaccagc 360 tacaatccac taacaaatag agatacaact aagattctga tagaagtgat cgacataaag 420 aaagacaaac caaataatta caacatatct taataacaag aatatggcag ccaagcatag 480 tgcaggtatt tcatctgaag tttcacaagg cgttgaaaat agctgtgccg tatgcaaaga 540 aatcttagca ccaaccgttt tgtcagcaac aacatcgtgc agacataaat ttcataaaaa 600 gtgcattata aatcacttaa aaagtacaca aacatgtcca gtttgcaaag cagattgcaa 660 cgaatcgaat ttatctcact ctaataacac tctgttcgct gaattagatg ctgccacagc 720 tggtacatct ggcgtaagcg aatgtcagga taataatgat attttcccat taaggagaaa 780 taccagatca ggcaggatcc gaggtacacc ttctagacga ggagtgtcta caagaaatac 840 ttacagaggg aatttggata ggatcttaac agaacctcca acgaattcta gtatggctct 900 tagtactcac ccagaattga tgaactttat ccagtttact gttagccaac aacaggctca 960 acttatggat aaccttacct cttatttagg gaagactata gagacatcta ttaaaagtca 1020 tttggacaac tttcaattac ctcagcccac atatacaaat tccacacctg acaatatcca 1080 accaaataca gatttgagca ggaacgtttc caatcaaaat cgatattctc ataatttaga 1140 caataatgta gaaacaagag gcgtaaacag gagcaatgca agctcatatg ttagcccaga 1200 aaaagtatca ggaattattc aaaaatgggg attaaaattc gatggtagca aaacaagcat 1260 taggatagat gaattcattt acaggatatg ctcacttacc aaacaaaatc taggtaatga 1320 ttttggactt ctatgtgacc atttgtacct tctgtttgca ggaaaagcaa gcgattggta 1380 ttggaagttt catcgaaaca gtccaaattt cgattggaac acattttgtg gcgaatttag 1440 gaagagattc gaagatatag agacagatta tgatatttgg gagggaatcc gaaaacgcaa 1500 gcaaggagaa aatgaaacat ttgaagactt ccaggaaacc atggaacggt tgattgacag 1560 actgcaaaat ggagtttccg aagcacaatt agttgagtta ttggttagaa attcgaagcc 1620 tagcttacat catgagttgc tacatcttaa aatcccaaac agagcacaac tccgatacga 1680 agtgcgcagg catgaacagt tctataacaa tctcaggacg tttaagccaa aaccctatcg 1740 ctcctgcgtg gcagaattaa ccaccgaaga agaggagaca ggtttagtaa gagtagatga 1800 agaagtaaac caaattcaac gtcgtaatac cccaacttgt tggaattgtg acaaacaagg 1860 ccacagattt gatgattgta tggctaaaag aattatattt tgttacggtt gcggacttaa 1920 agaaaaatac aagccgactt gtccaaactg taatccggga aaccgacaaa aggatgtttc 1980 tctgagcaat aagatgcatc cgtagattca gaagttgcta agttaaagtc aatcacagaa 2040 acccgtgaat ttcaggagat agaaccttta ggtcagcctg atattcatta ttcctttata 2100 cccttacaca taagaattca aaattacaat aataagcgta gagaaatatt tggtaaagta 2160 gatagtatag ggactcattt aaaacccaga cgatctacta gccgattgag agaattttgg 2220 aagaaggttc gtcgtgaccg acagaaactt atttccgcta tagtacttag ctctgaccaa 2280 agactctata cggatattca tatagagggt caagattaca aggctttatt agattctgga 2340 gcaacaataa gttgtgtagg cggattagct gctcagaact tccttaaaca caacaacgta 2400 aaaaaatgtt ctggcgaaat tcgcgctgcg aacggaacaa aaagcaaagt agtttcaaaa 2460 cttacaactt ccattaagta tggagatacg atagacaatc ttgaattatt tataatacca 2520 gaacttcaac aggatgtata tttaggcatt gatttttggc aaaagtttgg attattaaat 2580 aaagttacga gcaatccgaa cattgctgag ttagacattg gtactttgga tgatgatgaa 2640 gattctccaa aatatcatga gctgaacgaa ggagaaagac ttcggctgga gacagtaata 2700 aaaacatttc catcctttgc ttccgaaggt ttaggtcgta ctaccttgtc cacacatagc 2760 attgatatag gaaatgcgag tcctgttaaa caacggcatt ggccagtttc acctgccgtt 2820 gaaaagctga tgttcgctga agtggataac atgctagcct tagatgttat cgaagagtct 2880 actagcgctt ggagcagtaa ttgcgtgtta gtaaagaaag gagacaagaa ccgtctatgt 2940 cttgattcta gagaggtaaa caaggttact cagcgggacg catatcctct accacatata 3000 gacggcatct taagccggtt accaccagct aaatttataa gtggtatcga catgaagcat 3060 gcatactggc agataccgct cgacgaaaag tctaagcaat acactgcgtt tacaataccg 3120 aatagaccgc tatatcagta taaaatgatg ccctttggtc tctgtaacgc agcccaaact 3180 ctttgtcgtc ttatggatcg agtcattcca gctcatctta ggaatagagt atttgtttat 3240 ctggacgact tattagtttt gtccgaggat tttgaatcac atatgctttt attacaggag 3300 atagctttat gcttacgtaa agcaaatctc actatcaatg tcagcaaatc taaattttgt 3360 atgaaggaga tcatatatct cggttttata attgggaatg gacaaataag aaccaaccca 3420 gataaggtta aagcgatagt agaatttcca caaccaactt cgattaagca gttgcgaaga 3480 tttctaggat tagcgggatg gtaccgacgg tttgtagaaa actttgccac tattagtttt 3540 cctttgacgg aacttcttaa aaagaagaat acgcttgaat ggaacggaga ggctcaaacc 3600 gcttttgaaa acttgaaatc atgtcttact gcctcgccaa ttctcataac accagatttt 3660 tccaagcaat tcattttgtt gtgtgacgcc agctcttttg gtgttggttg cgtactggct 3720 caggagcgag atggagtaga acttccgatt tcgtatatgt ctgaaaaatt gtccaaagcg 3780 caacgtaatt attcagtttc cgaattagaa tgtcttgccg ttataaaagg aattaaaaaa 3840 ttccgcgcgt atattgaggg ccaggatttc ttagtcataa cagaccatgc agcccttcaa 3900 tggctgatgc ggcagaagga tttatcagga aggttagctc gatggtcaat gaagttgaga 3960 gcgtttacgt tcactataaa acatcgtagc ggttctcaga acgtcgttgc agatacctta 4020 tccagacaga atgacccagc catcgatgaa ttttgtcaaa atggacccat agttgacgta 4080 gattctccac atttcaaatc cacggaatac ttggacttaa taaagacaat atctaacaat 4140 caatcaagat ttcccgatct acaaatcagg gatggatata tttacaaaag aactactttt 4200 gtatcagacg atataaagca agaaaatcca tggaaattat ggataccttc ggctttagta 4260 cctgatatat tgtttcaggc tcatgatgca ccaagttcat ctcattgtgg catgagtaaa 4320 actgtagaaa aactaaaaag atatttgttt tggccttgaa tggttacgca agtacgcaac 4380 tatatatcca attgtgtaat ttgtcgtacc acgaaaagtc caaacacggt tctaaaaccg 4440 ccaatgggta aaccgttaac ctcagacagg ccattccaaa aattatatat tgacctctta 4500 ggcccatacc ctcgcacaaa aaaaggtcac ataggtttgc tgattgtcgt tgaccataac 4560 acccgtttcc actttttgtt acctcttaag aagttcaccg ctcctaaaat ctgtgattac 4620 ttaaaggggg ctatattcta tacgtttggc gttccagaag taatactttc tgataatggc 4680 agtcagttta agtcaagcta tttccaagct tttcttacta gttttggcgt acagcaaaaa 4740 cttacagcca tatattctcc acaggctaat gcgagtgaaa gacttaacag atcggtgcta 4800 gccgctgtaa gagcttatat agggaaagat cacacaaatt gggatgataa cttagacgct 4860 attagcggat ctttaagagc gtcgttacat agaagcacag gttcgtcacc ttacttcctg 4920 acatttggac aaaatatgat ccttaacggc aaggactata gtctcctgcg tactctaaat 4980 ctgttagcgg acgacactcg gttatctaaa ccggaggctt tagaactgca gagacggcaa 5040 gctaaactga acctaaaccg agcccatgaa gaaaacgcga atagatacaa tctacgcact 5100 cggcagatag aatttaaaaa cggcgatttc gtttacgcta gaaactttac acaaagtaac 5160 gccgtaaaga aattcagtag taaattggca cccgtattta tcccagctaa agtttataag 5220 aaacgtagct cctactacta cgaactagag gatgagagag gaaagaaatt aggagtattc 5280 catttgaagg atatacaagc tagaaaacaa ggagtttctt cagataaggc ttccttgaaa 5340 tagtctcgtt acccttatct gtggg 5365 // ID Copia-137_AA-LTR repbase; DNA; INV; 153 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-137_AA_; KW Ty1_copia_Ele138; Copia-137_AA-I; Copia-137_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-153 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 153 BP; 44 A; 36 C; 26 G; 47 T; 0 other; tgaaacgaag taacctacga aagctgtcgc tatgcatagc aacccacttg cttagtagcg 60 acccctagtt gtttagtgtg tataaacgtg ttaataaatt ttcattagtt cttctgagca 120 caaccagtac acacgcgttt ccatttactc tca 153 // ID BEL-87_AA-LTR repbase; DNA; INV; 703 BP. XX AC supercont1.41; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-87_AA_; KW BEL-87_AA-I; BEL-87_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-703 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.41; Positions 2612482 2613184. XX SQ Sequence 703 BP; 273 A; 122 C; 115 G; 193 T; 0 other; tgtcacgatg agaccccgtt ctacagtcga tgtagaaaac gatcatcgcg aagaccgcac 60 aacgacacta caaacgatca caacgatatc caaaaaatcg ggtgtaaaca aaaaggaaaa 120 ggattgaatt gctctcgcat gtagaaagta caattgaaat cgtggcatac aattagttac 180 taaatattct aaaaaactaa gtgaatttat atttgaactt aaaagttgga tttctctctg 240 ggcgtaaaag ttgttgacta taaataatta gttaattgcc ttataaatta taaattatac 300 gtgcacaggt aacatatcat gcggttgaaa atgaattagt ttaattaatt ggatcttaaa 360 ctaggtgtac gtgtagttaa acctaaagcc caccatttgt catctaataa cttcgaagac 420 gaataagtct agttgaaact aatcgtgagt acaatcgttt agcaattaaa accataatat 480 aaccttaaaa ctactctata catacagcac gatctttgaa gtgacggtaa aacaagatcg 540 gaaacagtga ttcgacatta caaaggttca ccaaaaaatg taagaccagg catatataga 600 gaacacgatc ataatactaa ataaatttca ttttagctta aagcttacca aacaacaata 660 aaccctggtt tgcttctgag actttggaga aatcatccgc aca 703 // ID DNA-14_AAe repbase; DNA; INV; 429 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A non-autonomous DNA transposon family from Aedes aegypti. XX KW DNA transposon; Transposable Element; nonautonomous; DNA-14_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-429 RA Jurka J.; RT "DNA transposons from the yellow fever mosquito."; RL Repbase Reports 11(4), 1269-1269 (2011). XX DR [2] (Consensus) XX CC ~97% identical to consensus. Present in >1500 copies in the CC genome. 2 bp TSD. XX SQ Sequence 429 BP; 147 A; 70 C; 69 G; 140 T; 3 other; cccgagcaaa gttgggtaac aaaatgataa caagttgtgt tatccgataa caagcatttg 60 ataacatagt ttgttatcat tttggttgaa taagcaggca acataacaaa cgtgataaca 120 aaagtttata ctaaagatat catatagata nctcattgtg ttatcatttg aaacacattg 180 ataacaagtt ttgttatcca gcgaatttcc ctcattgata actcattatg ttatcantta 240 gttataaaga tcgaattatg ataacaaaca tttgacgtca ttcacccgtc ggttgcatcc 300 tcgatctgac agctcaaaat tcgttgataa caagagtntt cgtattgata acaaaaaata 360 acaaaatgaa atcatgctgt acatcttgtt atcactatgt tatgcggttg ttattcgact 420 ttgctcggg 429 // ID CR1-2_NVi repbase; DNA; INV; 5153 BP. XX AC . XX DT 14-APR-2009 (Rel. 14.04, Created) DT 14-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE CR1-type non-LTR retrotransposon: consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-2_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-5153 RA Bao W. and Jurka J.; RT "CR1 families from Nasonia vitripennis."; RL Repbase Reports 9(4), 749-749 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 666..2003 FT /product="CR1-2_NVi_1p" FT /translation="MAPGRNKPPDRCRSNFACCKHKPATTAVCVICDEAYH FT HSCLKQKNENYKFVGNNLIVCPEHSYQNVTSKIDEEVLSESASLLIAQIKL FT KKSEDVRNDLLAETSDKTQDLQEAESIEESDIELLRTENMLLKRLVSELTE FT KNDLLKEKLQNIKSGTDVKLRTYSDTIKYSRATQKKVPKISVKKSNNSNID FT VMKSVIECLTAEKNIQTKNIYVNKNDEVIISCLNDNSAELTETVLKEKLED FT ECNITKNELNNPKIKIVGIDNYMNMQTKEIEKDINERNFSNFEKNGEVLHM FT YKNKHNNLSTVLMEVPAEIYKHIRENSNKIFVGYQHCKVYDLVNICPCFKC FT GRFGHNAKKCRNDTLCLKCSETHKTSDCDKDSAKCINCHYSNTRYKSKLDT FT KHFTYDSSQCSILKKKLIDISTPSIIPSNPPYRQLNQSIATWNQQTGSKYL FT *" FT CDS join(1970..2197,2194..4311,4265..5074) FT /product="CR1-2_NVi_2p" FT /translation="MESANRLEVPVDVNNAADKEIQQLENNTPRNKSKKEP FT QHNSRPHTRRDQQVLLNQPKLHSTASTPVNKSDKNKIKKKIGNMDTFDYLD FT KFQNKSMQKERTCSNIENFNKVINNKKDIILHVNIRSINANFDKLKILIES FT LAIKPAIVICSEVFEQVNYNLYQLNGSDYDYIIYYNDSRINRNDGVIVFVN FT NNLVQHTEVVEAGRIKIINTRITLNDKSSLEISAMYRSHGIPKTEFILDLN FT KYLTKKRNVKNHLVIGDFNIDLLDCDHYSQDFLCNFLDKGYIPGFVGITKP FT PIGRAKGSCIDNIFIKSHNINTATYKIRHDITDHYPIIIAVNKLKEVTKKP FT YVFLNYKKLTNYAIARNWNEIMSMDDPNLAVEKLLNEIKLCVELAKTKKKA FT KKHTGRKNWITDAIVRSCETKEFLYNLWQLDVQNDQLKTQYKTYAKILDKV FT ITDAKIKYESNLIQNNSNNPKKLWEIINMKIGKVKKCNDAINYIKINNQKI FT TDKFQIAQNMNTFFCDIGRNLSANIVKPVNTELKLPAMNPISIFLKPTSTA FT EIISIINALKLKNGGVDKINAKTLKLLCKHIAGPLTHIFNKCIEKSIWPDS FT LKRAEVVPVYKSGEKHKITNYRPISLISNVAKIFERIIYNRIHDFVEQSNI FT ISKQQFGFMRKIGTKDALKYITNALYNNVDKSKPTIITFLDLAKAFDTVDH FT SILLAKLYCIGIRGQALDLLSSYLSDRYQVVKIDGIQSDSSVVNTGVPQGT FT ILGPLLFILYINDVLREIPPPPPRGYNFVCRGRSPPPPPEAIISYADDTAI FT IATGKNWIEAQDTMNNFLSVISEWLALNKLSLNVGKTVCMTFGSSIGSVPA FT QVNIKILDKNITRVEHFKYLGIVFDSNLRWENHMKHIIGKTKYLVYIFYKI FT SKTMPTETLRMIYYAFFHSIISYGIIAWGGAYRNSRDQVQKLQNKILKIIN FT KNKFPSHDNPLNIVQLFALESLKLHYYDLKEKYLASDSITRNRSIIIPKTN FT KRISNKNSYMKAINIYNELPNQLKVLNIEKFSQKRKIIEWIKSNC*" XX SQ Sequence 5153 BP; 1987 A; 864 C; 897 G; 1404 T; 1 other; gacgtcacgc ttggtaagtc tagcagcatc atcaagtcgc tctgcttatt tagcattatt 60 ttaatgtgtt ttctcactcg aatgagtgat ttttcacagc gggaaggatc ctctatctgg 120 ctcgcacgct agcgctggaa aaccacgact ttttcgtgcg tttttatgag cctagaggag 180 ttcttttcgc aagctggaga caacataacc tcatctgtca accactatac aattgctgct 240 ggcgaacatc ctcatagctg tttcatcatg aggacatagc atcggctctc aacatcgcta 300 aagaggaaaa gaaaaatctg gttttttgga acatcagccc caaattcaac cgaatggagg 360 tcttctacag agatttttag aggttagata caaatagctc tatctctctc taaagagcgc 420 gagatcagtc gtcaccgttg ctgcgcgtct gattccaata ttcgccacca gggtgcaaca 480 ccatcgacac tggacaaacg ttggcacctc tgactgatca gccatggtgt ttttggcgcg 540 ttattcaaaa atctgctaga gcgggagttt tgaaaatttt gaaaatttaa aaattttttt 600 tctttttatc aaaaattttg ataacttttc tgagccaaaa ttagtacaat tcctcacatg 660 ccaagatggc gccgggacgg aacaagcctc ctgacaggtg tcgcagtaat tttgcatgtt 720 gtaagcataa accagcaaca acagcagtgt gcgttatttg tgatgaggct tatcaccata 780 gttgcctcaa gcaaaaaaat gaaaactata aattcgttgg caataatctc atagtttgcc 840 ccgaacatag ctatcaaaac gtaacctcaa aaattgatga ggaagtttta agcgaatctg 900 cctcactttt aatagcgcag attaagctaa aaaaatctga agatgtgaga aatgacctat 960 tggctgaaac tagtgacaaa actcaagact tacaagaggc tgaatccatc gaagaaagtg 1020 atattgaact tttaagaact gaaaatatgc tgttgaaaag gttagtgagc gaactaactg 1080 agaaaaacga ccttcttaaa gaaaaattgc aaaacatcaa gagtgggact gacgttaagt 1140 tacgcacgta ctcagatacc attaagtact caagagcaac tcagaaaaaa gttcctaaaa 1200 tatctgtcaa gaaaagcaat aacagcaaca tcgatgtcat gaaatcggta attgaatgtc 1260 tcacagccga aaaaaacatt caaacaaaaa acatatacgt taacaaaaat gatgaagtta 1320 ttataagctg cctgaatgat aacagtgctg aattaacgga gacagtactg aaagaaaaac 1380 tagaggacga atgtaacata acaaagaatg aattaaataa tccgaagatc aaaatagtgg 1440 gtatcgacaa ttatatgaac atgcaaacca aggaaatcga gaaggacatt aatgaaagaa 1500 atttcagtaa ctttgaaaag aacggagaag tcctgcatat gtacaagaac aagcacaaca 1560 acttaagcac tgttctgatg gaagtcccag ctgagatcta caagcacata agagaaaata 1620 gcaacaaaat ttttgttggg tatcaacatt gtaaagtcta tgacttagta aacatctgcc 1680 catgtttcaa gtgcggtaga ttcggacaca atgctaagaa atgcagaaat gataccttgt 1740 gtttaaaatg ttctgaaacc cataaaacga gtgattgtga taaggacagt gcaaagtgca 1800 taaactgtca ctatagtaat acgagatata aatcaaagct agacacaaaa catttcacct 1860 atgactccag ccagtgcagc atcttgaaaa aaaaattgat agatatatcg actccgtcga 1920 ttatcccatc aaacccacct taccggcaac tgaatcaatc tatcgcaaca tggaatcagc 1980 aaacaggctc gaagtacctg tagatgtcaa taatgctgcg gataaggaaa ttcagcaact 2040 tgaaaataac actccacgca ataaatcgaa aaaagaacct caacataact caagaccaca 2100 tacaagacgt gatcaacaag tactattgaa ccaaccaaag ctgcactcca ctgcatcaac 2160 accagtcaac aaaagcgata aaaacaagat taaaaaatag gtaacatgga tacttttgat 2220 tatttagata agtttcaaaa taaatcaatg caaaaagaaa ggacatgcag taatattgaa 2280 aattttaata aagttataaa caacaaaaag gatattatcc tgcatgtaaa tattcgaagt 2340 ataaatgcaa attttgataa actaaaaatt ttaatagaga gtttagctat caaaccagcg 2400 attgtgatat gttccgaagt ttttgagcaa gttaattata atttatacca actgaatggc 2460 tcagattatg attatataat ttactataac gacagtagga taaataggaa tgatggcgtc 2520 attgtatttg tcaataacaa tctagtccaa cataccgaag ttgttgaagc aggtagaatt 2580 aaaataataa atactaggat aaccttaaac gataaaagta gtcttgaaat atcggctatg 2640 tacagatcgc atgggatacc aaagactgag tttatactag accttaataa gtacttaact 2700 aaaaagagaa atgtaaaaaa tcacttagtg ataggtgatt ttaatattga cctgctagac 2760 tgtgatcact atagtcaaga tttcttatgc aattttcttg ataagggata tattcctggt 2820 tttgtaggta ttacaaagcc acctattggt agagccaagg gatcatgtat cgacaatatt 2880 ttcattaagt ctcataacat taacacagcc acatataaga ttaggcatga tataacggac 2940 cattacccga tcataattgc tgtaaataag ctgaaggaag taactaaaaa accatatgtt 3000 ttcttaaatt acaaaaaatt aacaaattat gcaattgcta gaaactggaa tgaaattatg 3060 tcgatggatg atccaaattt ggctgtagaa aaattgttga acgagatcaa gctatgcgta 3120 gaattggcaa aaacaaagaa aaaggctaaa aaacacactg gtagaaaaaa ctggataact 3180 gatgcaatag tgaggtcgtg tgaaactaaa gagtttttat ataatttatg gcaactagat 3240 gtgcagaacg atcaactaaa aacacaatat aaaacttatg caaaaatttt agataaggta 3300 attaccgacg ctaaaataaa atatgaatca aatcttattc aaaataattc caataaccca 3360 aaaaaacttt gggaaataat aaatatgaaa atcgggaaag ttaaaaaatg caatgatgcc 3420 ataaattaca taaaaataaa taatcaaaaa attactgata aattccagat agcgcaaaat 3480 atgaatactt tcttttgtga tataggtaga aatctaagtg ctaatatcgt taaaccagta 3540 aatactgaac ttaagctgcc tgccatgaat cctatctcta tttttttaaa gccaacaagt 3600 acagcagaaa taataagtat aatcaatgca ctgaaattga aaaatggagg agttgataaa 3660 ataaatgcaa agaccttgaa actgctatgt aaacacattg caggtcctct aacccatatt 3720 ttcaataagt gtattgagaa atcgatatgg ccagattctt taaaaagggc cgaagtagtg 3780 ccagtctata aatcaggaga gaaacataaa atcacgaatt atcgacccat ctctctcata 3840 tccaatgttg caaagatttt tgagcgaatt atatataata gaattcatga ttttgtagaa 3900 cagagcaata taatctcgaa gcaacaattt ggtttcatga ggaagattgg tacgaaagat 3960 gctctaaagt acataactaa tgcactatat aataatgttg ataagagtaa gcctacgatt 4020 ataactttcc ttgacctggc gaaagctttc gatactgttg accatagcat actactggca 4080 aaactttatt gcattggaat taggggccaa gcgttggatc tcctttccag ctatttgagt 4140 gacagatacc aagtagtcaa aatagacggt attcaaagcg atagctctgt ggttaacaca 4200 ggcgttccac aaggaacgat cttgggaccg ctacttttta tcctgtacat aaatgatgtc 4260 ttgagggaga tccccccccc ccccccccga ggctataatt tcgtatgcag atgatactgc 4320 tattatagcc actggtaaaa attggataga agcccaagat accatgaaca attttctaag 4380 tgtgatatcg gagtggctgg cattaaataa gttgtcatta aatgtaggaa agactgtttg 4440 tatgactttt gggagcagta tcggaagtgt accagctcaa gtaaacatca aaattctcga 4500 caaaaatata actagggtag aacattttaa atatctagga attgtttttg atagcaacct 4560 aaggtgggaa aatcacatga aacatataat cgggaagaca aaatacctag tatatatttt 4620 ttacaaaatt tcaaaaacca tgcctacaga aactctcaga atgatctatt atgctttttt 4680 ccatagtatc attagctatg gtatcattgc ttggggtgga gcatatagaa atagtcgaga 4740 ccaagttcaa aagctacaaa ataaaatttt aaaaatcata aataaaaata agtttccatc 4800 acacgacaat cctctaaata tagtacaact ttttgctctt gaatctttga agctgcatta 4860 ttatgatcta aaagaaaaat acctagcatc agatagtatt actagaaata gaagtatcat 4920 tatacctaag actaataaga gaattagcaa taaaaatagc tatatgaaag caataaatat 4980 ttataatgag cttcctaayc aactgaaagt attaaatata gaaaaattct ctcaaaagag 5040 aaaaattata gaatggatta aatccaattg ttgaaaaaat atttttatat attaataata 5100 agaatagtaa taaataagaa aataaaaaaa atggtaactt gttgttaaaa aca 5153 // ID Chapaev-4_HM repbase; DNA; INV; 5336 BP. XX AC . XX DT 27-FEB-2008 (Rel. 13.02, Created) DT 27-FEB-2008 (Rel. 13.02, Last updated, Version 1) XX DE Autonomous Chapaev DNA transposon -a consensus sequence. XX KW Chapaev; DNA transposon; Transposable Element; Chapaev-4_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-5336 RA Kapitonov V.V. and Jurka J.; RT "Autonomous Chapaev transposons from the hydra genome."; RL Repbase Reports 8(2), 30-30 (2008). XX DR [1] (Consensus) XX CC Chapaev-4_HM is a very young family of autonomous Chapaev DNA CC transposons that were active in the hydra genome less than a few CC million years ago (they are <0.5% divergent from their consensus CC sequence). The consensus sequence was obtained based on a CC multiple alignment of 8 copies; it codes for a 975-aa Chapaev CC transposase (ten exons). Chapaev-4_HM is characterized by 4-bp CC target site duplications, 10-bp terminal inverted repeats, and CC 29-bp subterminal inverted repeats (separated by a 12-bp and 1-bp CC regions from the 5' and 3' TIRs, respectively). CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(462..1334,1473..1765,1982..2194,2308..2547, FT 2639..2715,2868..3073,3163..3492,3656..3853, FT 3995..4187,4365..4666) FT /product="Chapaev-4_HMp" FT /note="Transposase." FT /translation="MTEKHQQKLLRVCRICGNLTGKDSLTTASRTERIFNV FT FKINTKEDCVEVHPPQMCLKCYSTMKNIETRGNKLSKIKPKIWLKCPTKEC FT MCIYGKVGRKPSPNVGRPGVFKRWTQLNINLFLESLPLPLNKKEISEFNIQ FT LNPHITLCICKVCGRIMHQPVMIKDCQHSFCSQCIISIIKGKLENEARCPV FT CLTCIMINSLCSSVHVLEMIEHLYIACKICEKNFIKDKYENHECKKDHLLN FT NSSTIVDDLFKVDKSSHIPRNIEDAVLHVIKQKMENSKTTTIEFLSGGPRP FT LCLTVMPKAYKESASCSGHTLRKRHKHLVDQANHQVGNSKDSLVVQTSVML FT KSFDHNQKNKVLESAKIGVVEFSAEELVAFKANVGVPWNKLKTMTRWLNSR FT NIKTASNFKQRVVANTWAGNDLVVLNGDFTFQNENNKSVFEIKHAPWAYID FT NLPTNIKNLLDILESFKLIKHDGIKNDEIHIKIGGDHGGGSFKMSYQIVNQ FT DQPNSKSNSIVFSTFEARDYRTNLKVGLSRYTLQVDEIQKMIWQNHNVRVF FT IFGDYDFLCSVYGITGASGRHCCLFCDITKESIQLAPISRENGVSQRSLKS FT LNIDFNRFQSSGGNLKKAKFFNNVINETLFHIPLDQVAVPALHISLGTYLK FT FFNMFEDECHLLDIKLAGELALKNNTIGKDDFDKYINMHIQSNLIKSVIED FT CDNKITLLQDTISLKVLCEPDRTEEIRKVYAPRILYYDLKKTDKMKELNSL FT NEANILDKISGPCIQQLDEILKANQVQRQAYHGKCFVGNHVHTMLKVFLQM FT FYIKYPNLPQPLLDLCNSIPKLISNLGFIDTDVHLLSVNVSQKFKQLFQYY FT SKCHNLMNSSQSFQENDVQELESAICNLMEFYRSTWPNASVTPKMHLLETH FT VLQFIQKWGLGIGVYGEQGGESLHAEFNNLNRLFCHMKGCSRLKSMVKEHY FT VRNHPKAKAMKPEIKKKNFF" XX SQ Sequence 5336 BP; 1988 A; 639 C; 794 G; 1915 T; 0 other; cacggtgatt cagtacttat gcaagcttga gattttatta tacgcatgcg tctgataaaa 60 atattcaata ttaaaatttg tatttatcgt gttaaaaaca aataaatacc ttacaacatt 120 gtttccattt taaatccaat ttttatgagg tttttaataa ttagacaaat gtaatagtat 180 tttatatact tgagctgttt tgtgcaaaaa atctgagtga aatttatgtt tgaacattta 240 ctacaatagt taaaacagaa aagagtaata ttgtttaaaa taacacttta tatacatttc 300 aattatggac gaagcaaaat ttttatgaca tgtggattag tttttaataa aagtattatt 360 cattgatatt caaaaaatat tgacttctat aacctgtcat ttttttataa tatgttattt 420 cattatttag ctaaaaactt ttcaaaaatt ttatttagaa gatgactgaa aaacatcaac 480 aaaagttatt aagagtttgc agaatttgtg gaaatttaac aggaaaagac tctctaacaa 540 ctgccagtag gactgaaaga atatttaatg tttttaaaat aaatactaaa gaagattgtg 600 tagaggttca tccaccacaa atgtgcctaa aatgttattc aactatgaag aatattgaaa 660 cacgaggaaa taaactatct aagattaaac caaaaatttg gttaaaatgt ccaacaaagg 720 agtgtatgtg tatttatggt aaggttggtc gaaaaccatc tccaaatgtt ggacggccag 780 gagtttttaa aagatggaca caactaaata taaacttgtt tcttgaatcc ttgcccttac 840 ctttaaacaa aaaagaaatt tctgaattca atattcagct aaacccacat ataacattat 900 gtatttgtaa agtttgtggg aggataatgc atcaaccagt catgatcaaa gattgtcagc 960 actcgttttg tagtcaatgt attatatcaa ttattaaagg taagctagaa aatgaagcaa 1020 gatgtcctgt ttgtttaacc tgcattatga ttaatagcct ttgtagttct gttcatgtat 1080 tggagatgat tgaacatttg tatattgcat gcaagatttg cgaaaaaaac tttattaaag 1140 acaagtatga aaatcatgag tgcaaaaaag atcatttatt aaataattcc agcacaatag 1200 ttgatgattt atttaaagtt gataagtcaa gtcatattcc acgtaatatt gaagatgcag 1260 tgttgcatgt aattaaacaa aaaatggaga attcaaaaac aactacaatt gaatttttat 1320 ctggtggccc aagagtaaga gtaataatat ttgtaaagta cattaatata aaacttaaac 1380 tataaaattt cttgtctgga attttgaata ttcataatga taaagattgc tgatattagg 1440 tttatgtatt gatatatatt tttattttaa agcccctttg cttaactgtt atgcccaaag 1500 cctacaagga aagtgcatct tgcagtggtc ataccttgag aaaaagacac aaacacctag 1560 tggaccaagc taatcatcaa gttggaaatt caaaagattc actagtagtg caaacatcag 1620 ttatgcttaa atcatttgat cacaatcaaa aaaataaagt tttagaaagt gctaagattg 1680 gagttgttga gttttctgct gaagaattgg ttgcttttaa agcaaatgta ggggtaccgt 1740 ggaataaatt aaaaactatg actaggtaaa attatttttt taaatattta ttggttagtt 1800 tatttattag tgttaattat ttagagatta atgttaattg atttattatt taagtgaatg 1860 taggtgtata tttaaaaatg catgctttta ttaatttaca taataatgca tatataagta 1920 agcagtatga aaaataattt agtgaaaaaa atgaataagt gttatttact aaatattgta 1980 gatggttaaa ctcaagaaat attaagactg cttcaaactt taaacaaaga gtagttgcaa 2040 atacctgggc aggaaatgat cttgttgtat taaatggaga ttttacattt caaaatgaaa 2100 ataataagtc agtttttgaa atcaaacatg ctccctgggc atacatagac aatctaccca 2160 ctaatattaa aaacctgtta gatattcttg aaaggtatgg atatttgtac aaatttgtta 2220 cttaaaaatc taaatataaa ctattgtaaa aaactattta agctgtattt ttttatgtta 2280 tatttttatt ttatttaaat ttattagttt taagttaata aaacatgatg gaataaaaaa 2340 tgatgaaata cacataaaaa ttggaggaga ccatggaggt ggttcattta aaatgagtta 2400 ccaaattgta aaccaagacc aaccaaactc aaaatcaaat agtattgtat tcagtacatt 2460 tgaagcaaga gactaccgaa ctaatcttaa agtagggtta tctcgatata cattacaagt 2520 tgatgagata caaaaaatga tatggcagta agttactatt tctaaagaaa taagcattga 2580 taatttaaat aaaattcaaa tattttgtgt taatttaata tattttaatt gtttttagaa 2640 atcataatgt tcgtgttttt atatttggag attatgattt cctttgcagt gtatatggaa 2700 tcacaggagc gagtggtagg tttgctgtac ttaaagaaat tttatgttat ttgatgataa 2760 ttttttttta gttatcataa gtaaaaactt ttgtaagatt gaaaaattaa agttttaaag 2820 atataattgt ttaactccat tcaatactgt tacttatggt attataggtc gacattgctg 2880 tcttttttgt gatataacca aagaaagtat ccagttggct ccaattagta gagagaatgg 2940 tgtttcacaa cgatctctaa aatctttaaa cattgatttt aatcgattcc aaagtagtgg 3000 aggaaatttg aaaaaggcta agtttttcaa taatgtgatt aatgaaactt tgtttcacat 3060 tcctctagac caggtaattt atttttagca ctgtatttta gttttatttt gaggatttgt 3120 atggctttaa gtgtttttaa tagtgtttaa ttttattttt aggtagcagt acctgcttta 3180 catatatctt taggaaccta tttgaagttt tttaacatgt ttgaagatga atgtcacttg 3240 ttagatatca aacttgctgg agagcttgca ctaaaaaata atacaattgg taaagatgat 3300 tttgacaaat acattaatat gcatattcaa tcaaatctta taaaaagtgt tattgaggac 3360 tgtgataata aaattacact tttacaagac acaatttcat taaaagtttt atgtgaacca 3420 gacagaacag aagaaataag gaaagtatat gcaccaagaa tactttacta tgatttaaaa 3480 aaaactgata aggtatttat tatttacaaa aaaaatatga atttaattta gactttagtt 3540 agaattatat ttaacataat tttagaaagc caaaaatata tcaatttcaa aaatttacaa 3600 tacaattacc tacagtttct gtagcatata tttaacaaga aaaatatttt tctagatgaa 3660 agaactaaat tcattaaatg aagcaaacat tcttgataaa atttcaggtc cctgcattca 3720 gcaacttgat gaaattttaa aggctaatca ggtacaaaga caggcttatc atgggaaatg 3780 ctttgttgga aaccatgttc acacaatgct taaggttttt ttacagatgt tttatattaa 3840 atatcctaac ctggtgggtt ggttatatcg atcagataaa atcttttttt ttttttaatt 3900 ttttgttcag actgctttga acacatcggc cacccctgcg ttttgttaat aaaaatattt 3960 caaaattatt taattttact tatatttaat ttagcctcaa cctttgctag atttgtgcaa 4020 ctctattcca aaactgattt caaatcttgg ttttattgat acagacgttc atttactctc 4080 agttaatgtg agccaaaaat ttaaacagtt gtttcaatat tattccaaat gtcataactt 4140 gatgaacagc tcacaatcat ttcaagaaaa tgatgttcag gaacttggta ggtatttaaa 4200 atcgttttac gcaaactgta gttgaagtta gcatattctt taatgccata atagttattt 4260 tgaaaaacat taattggttg ctttttttag ttagaatttg aggaatttat ttaaagaaat 4320 tcaattaggc ttttttttta aacatgtaaa ctatttaatt ttagaatcag caatctgtaa 4380 tcttatggaa ttttatagat caacttggcc aaatgcatct gtaactccaa agatgcattt 4440 gcttgaaaca catgttcttc aatttattca aaaatgggga ttaggcattg gagtttatgg 4500 agaacaaggc ggagaaagtt tgcatgctga atttaacaat cttaatcgtt tgttttgtca 4560 tatgaaaggt tgtagccgct tgaaaagcat ggtgaaagaa cattatgtaa gaaaccatcc 4620 aaaagctaaa gcaatgaagc cagaaattaa aaagaaaaat tttttttgat ttttatgtgt 4680 gtatataata tatatatata tacacataca tatacacaca ttcaactttg ttctgagtat 4740 cagagtgttc tggtaaacac attataatat tcttgattta aaacgtgcac gcattgtaag 4800 tttaaggtta acttgaactt tatgtaaggc tttcagagta gaaaattctt ccattaaaaa 4860 gctaaaaaaa ctacgcaaaa aatgttacca gcctgccgtt aaatgcagtt aacttttttt 4920 aaaaaaatga caggttaaaa cgaaaggcat agaattttta ggcatgattt atagaattta 4980 aaacaattga actttttagt caaaaaattt tcaaacaaaa gatatccctc tagttagcta 5040 acagatattt tatttttttt atctaaaaaa ccttacattt tgctcattaa aattttcttt 5100 gtttacaaag attcttattt atttttatga tttacttaaa accttctaga atgtaagact 5160 ttaaaaatat atagttcagg tttttacacg aggaataagt aattgtttaa taaaatacta 5220 aatcaacgaa ttttagttat ttagataaaa actacttcgc catttttttc cgtaattaaa 5280 aaaaaaaaac aaatgtgcgc atgctcacaa taaaatatca ggcttaaatc accgtg 5336 // ID Copia-6_DWil-LTR repbase; DNA; INV; 196 BP. XX AC scaffold_181155; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-6_DWil_; KW Copia-6_DWil-I; Copia-6_DWil-LTR. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-196 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_181155; Positions 2209726 2209921. XX SQ Sequence 196 BP; 58 A; 55 C; 29 G; 54 T; 0 other; tgcgaataca tccggcagcc tacttggtca cccgcccatc tggcaaccct atacgcgtca 60 agacgcactc tcgtatatgg caccgtctgt gccggtcgtg tttctcatag ttctaccaca 120 aataatatat tcactactaa tatatatata tccatcaata actctgtaaa cccatgaagt 180 tgaataaatc tcaaca 196 // ID MERLIN3_SM repbase; DNA; INV; 1076 BP. XX AC . XX DT 08-FEB-2008 (Rel. 13.02, Created) DT 04-MAR-2008 (Rel. 13.02, Last updated, Version 1) XX DE Consensus sequence of Merlin-type family of repeats. XX KW Merlin; DNA transposon; Transposable Element; MERLIN3_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-1076 RA Jurka J.; RT "Merlin-type element from freshwater planarian (Schmidtea RT mediterranea)."; RL Repbase Reports 8(2), 155-155 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 69..863 FT /product="MERLIN3_SM_1p" FT /translation="MNIETINNLADNTAVIQFLRNLRLLRSVIYCQGCENI FT LREVKYKRNNDGVALRCYKVACVKNKTYVSIRRNSFFDICKLELKLIMKVV FT YCWFTDVNQCKVARDYGVHRNVVCIIYTKLREAAKLYMERTEYKLGGRGII FT CQIDESMFRYKQKYHVGRVPQDHRWVFGIVDISSTPSKYYVELVANRSAQT FT LLPIIHQRINEESIIWSDELRSYRNIANERFTHQTVNHRLHFVDPETGVHT FT QNIESLWSKLKKDSKRRMVIALQV" XX SQ Sequence 1076 BP; 399 A; 156 C; 196 G; 325 T; 0 other; ggagcttgta cattttcccc caaaatgtac accccttata aattttcatt ttaaccccat 60 tacagaatat gaacattgag acaataaata atcttgcgga caacacagcg gttatacaat 120 tccttagaaa cttgaggttg ttaaggagtg taatatactg tcaaggttgt gagaatatat 180 tacgtgaagt taaatacaag agaaataacg atggtgtagc attaagatgc tataaagttg 240 cttgtgtgaa aaacaaaaca tatgtatcta ttagaagaaa cagcttcttt gatatatgta 300 aacttgagtt aaaacttata atgaaagtag tatattgttg gtttacagat gttaatcaat 360 gtaaagtcgc aagggattac ggtgttcaca gaaatgttgt ttgcattata tataccaagc 420 ttcgagaagc tgcgaagtta tacatggaaa ggactgaata taaattgggt ggacggggta 480 taatatgcca gattgacgaa tccatgttcc gatataagca aaaatatcac gtggggagag 540 taccacaaga ccataggtgg gtctttggta tcgtggatat atctagtaca ccctctaaat 600 attatgtaga attggttgca aatcggtccg cacaaacatt attgcctata atacatcaac 660 gtataaatga agaaagtata atatggtcag atgagttgag atcgtacaga aatattgcaa 720 atgaaaggtt tactcatcaa acagttaatc atcgattaca ttttgtagat ccagaaacag 780 gtgtccatac tcagaatatc gaatccttgt ggtcaaaatt gaaaaaagac tcaaaacgca 840 gaatggtaat agctttacaa gtttaaaatt aaacttaagt gaatggatgt ggaaagacaa 900 tatagcctgt aagatttatt aaatttataa catatgtatc cgacaaatta ttacgctttt 960 tatacccacc tttatcatta cagtagagtt ttttactatt tagacaaata aattaaaaaa 1020 acattgattt tcaaaaatct gtaccacttt tcgaaaggga aaaatgtaca agctcc 1076 // ID NAVIMAR1 repbase; DNA; INV; 1223 BP. XX AC . XX DT 06-NOV-2007 (Rel. 12.11, Created) DT 06-NOV-2007 (Rel. 12.11, Last updated, Version 1) XX DE Mariner-type DNA transposon. XX KW Mariner/Tc1; DNA transposon; Transposable Element; NAVIMAR1. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-1223 RA Jurka J.; RT "Mariner DNA transposon from Nasonia parasitic wasp."; RL Repbase Reports 7(11), 1170-1170 (2007). XX DR [1] (Consensus) XX CC Highest identity with consensus: 92-93%. XX FH Key Location/Qualifiers FT CDS join(187..888,870..1214) FT /product="NAVIMAR1_1p" FT /translation="MEIPEGHFRHILLFYFRKGKTAAQAHRKLCSVYGDEC FT LSERQCQNWFARFRSGNFDLKDEPRPGRPTVEKVDEILEKIEIDRHISSRD FT IAMELNIDHKTVLNHLHKTGYKKKLDTWVPHELTSKNLMDRVSICESLLKR FT NEIEPFLKQIITGDEKWITYDNNVRKRSWLKPGEASQTVAKPVLTRRKVLL FT SVWWDWKGIVHHELLEPGQTINSTVYCQQLMRLQQAIEKKTGQNKKNRPEL FT INRKGVVFHHDNARPHISLMTRQKLREFGWEVLMHPPYSPDLAPSDYHLFR FT SLQNSLNGVKLDSKEACENHLVKFFAQKPQKFYSDGIMVLPEKWQKVIDQN FT GTYIID" XX SQ Sequence 1223 BP; 383 A; 233 C; 271 G; 336 T; 0 other; tattgggttg gggaaaaaat aatttcggtt cttcctaata gatggcgtta aagttgtata 60 tctatagtaa tgcttattca atgaagccac actatagcat gtttagaagt ttaaagtcta 120 tagaatattc tagagtacaa agtagcgtca ttcgtttact gattcaaaaa ttattgcgct 180 ttgacaatgg agattccaga aggccacttt cgtcacattt tgcttttcta cttccgaaaa 240 gggaaaactg cagcgcaggc tcatcgaaaa ttgtgcagtg tttatggtga tgagtgctta 300 agtgaacgcc agtgtcagaa ttggtttgct cgatttcgtt ccggaaactt cgatcttaaa 360 gatgaacctc gccctggtcg gccaaccgtt gaaaaagtcg atgaaattct tgaaaaaatc 420 gagatagacc gacatatttc atctcgcgac atcgctatgg aactaaacat cgaccataaa 480 acagttttga accatttaca taagactgga tacaaaaaga agctcgatac ttgggttcca 540 cacgaattaa cgtcaaaaaa tttaatggac cgagtttcca tttgtgaatc cctgctgaaa 600 cgaaacgaaa tcgagccatt tttgaaacaa ataattacgg gcgatgaaaa atggatcacg 660 tacgacaata atgtgcgaaa gagatcgtgg ttgaagcccg gagaagcttc acaaacagtt 720 gcgaagcccg tattgacgcg aaggaaggtg ttgctgagtg tttggtggga ttggaagggt 780 attgttcatc atgagctgct cgaacctgga caaaccatta attcgactgt ctactgtcaa 840 caactaatgc gattgcagca ggcgattgaa aaaaaaaccg gccagaattg atcaatagga 900 aaggcgttgt cttccatcac gacaacgcta gaccgcacat atctttaatg actcgtcaaa 960 aattgagaga gtttggctgg gaagttttga tgcatccacc gtatagcccg gaccttgccc 1020 cgtcggacta tcatttgttc cggtctctgc aaaactctct taatggtgta aagctggatt 1080 caaaagaagc ttgtgaaaat cacttggtta agtttttcgc ccagaaacct cagaagttct 1140 acagtgacgg aattatggtt ttgccagaaa agtggcaaaa ggtcatcgat cagaacggca 1200 catatatcat tgattagtgt tca 1223 // ID L2B-6_CQ repbase; DNA; INV; 2611 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE An L2B non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW L2B; Non-LTR Retrotransposon; Transposable Element; L2B-6_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-2611 RA Kojima K.K. and Jurka J.; RT "L2B non-LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 147-147 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 7 sequences with >95% CC identity. XX FH Key Location/Qualifiers FT CDS 7..2547 FT /product="L2B-6_CQ_1p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="MLAIEIHHQQFNGIFGGVYHSPSSSDRDFLDSFENWL FT QQVMADDKTNVISGDFNICWNNEGYSRDLKNIADAVGLEQKVLEHTRISVR FT SRTTIDLVFANVEDCTVKVLADWRITDHETLGINVEGVCKINMPEKKTEYT FT CWRKYSKERLQNTLLCDETLWRNNLPVDEKAQQFSRALATAVGQLTETRMR FT RSADTKRWFTPHLQAVKLKRDEAYKIFKTTNXAEHWERYKQFRNEYVRELQ FT KAKNSSVQQEIQSCSGDSKKLWRCLKDLIKPGGTPEKEIIFDDKTEPCSDE FT ETANRLNHYFIKSVKEIHDSIPTVNQPQQDPLPTVRTLDISEFSQFQPISM FT EKLRKIVAAMKNCAGVENISKRVLEDAMDVVGDKLLDIINSSLNLGVFPQE FT WKKSVVVPIPKVPNSTRAEDRRPINMLPIYEKVLETVVREQLVEYVDRAGL FT LIEEQSGFRKQHSCESALNLLMLKWKQSIENNKFILTVFVDLKRAFETIDR FT RKLLEVLRRNRIKGDVWRWFKSYLEYRTQITRYNNSVSEAEAVELGVPQGS FT VLGPLLFILYINDLTKALRRACVNLFADDTVIYVAGDKLDECYEIMNNELA FT VFADWLKWKKLKLNVSKTKYMVVTTRRSESNCTISVDGELVERVNAIKYLG FT VMLDEKLSFAEHVDYTIRKAARKLGVLCRINRYLSFDNKIMVYKTLIAPHF FT DYCASILFLATNQQRKRMQIVQSKAMRMILRCDRLTPRILMLDSLQWMSIR FT QRVEYSTLVFIFKVVNGLAPQYLTKTVQCGRDVHRHFTRQAGDIRLLNFKK FT SCTQNSLFYRGYNLFNMLPESTRATTNLREFKKLCKPFVRQRPLE" XX SQ Sequence 2611 BP; 825 A; 492 C; 678 G; 615 T; 1 other; aattggatgc tggcgataga gattcaccat caacaattca acggtatttt tggaggagtg 60 tatcattcac ctagcagcag cgacagggat ttcttggaca gctttgaaaa ctggcttcaa 120 caagttatgg cggatgacaa aacgaatgtg ataagtggtg atttcaatat ctgctggaat 180 aacgaaggct actcaaggga cctgaaaaac attgcggatg cagttggact agagcaaaaa 240 gtattggaac acactcggat cagtgtcaga agccgcacta ctattgactt ggtgtttgca 300 aacgttgaag actgtactgt aaaagtacta gcagattgga gaattacaga tcacgagaca 360 ttgggaatca acgtggaagg agtttgtaaa ataaacatgc cggagaagaa aacggagtac 420 acatgctgga gaaagtattc taaggaacgg ctgcaaaaca ctttattgtg tgatgagacc 480 ctgtggagaa acaatttgcc ggtggatgaa aaagcacagc agtttagcag agctctagca 540 acagcagtag gacaacttac ggaaacacgt atgcgacgct cagcagacac caaacggtgg 600 tttactccac acttacaagc ggtgaaacta aagcgagacg aagcatacaa aattttcaag 660 acaacgaacw gcgcggaaca ctgggaacgg tacaaacaat tccggaatga gtacgtgcga 720 gagctgcaaa aagcgaaaaa tagttcggtt cagcaagaga ttcagagctg cagtggagat 780 tcaaagaagt tatggagatg tttaaaagat ttgataaaac ctggaggaac gcccgaaaag 840 gaaattatct tcgacgacaa aactgagccg tgcagcgatg aagagacggc gaacagactg 900 aaccattact tcatcaagag tgtgaaggag atacacgatt ccatcccgac tgtgaatcaa 960 cctcagcaag acccgttacc aactgtgcgt acgctggaca tctcggaatt cagccagttc 1020 caacccattt ccatggaaaa gcttcgaaag attgtagcag cgatgaaaaa ctgtgcagga 1080 gtcgagaaca tctcgaaacg tgtgttggag gacgctatgg atgttgtagg agataagctg 1140 ctcgatatta tcaacagttc attgaacctg ggtgttttcc ctcaagagtg gaagaaatct 1200 gtggtggtgc cgattccgaa ggtaccgaac tcgacgcgtg cagaagaccg gagaccaata 1260 aacatgctgc caatctacga gaaggttttg gagacggttg taagggagca gctggtggag 1320 tacgttgatc gggccggatt acttatcgag gaacaatcag gattcagaaa gcagcactcg 1380 tgtgaatcag cccttaacct gttgatgctg aagtggaagc agtctatcga gaacaataag 1440 tttatcctga cggtgtttgt tgacttgaaa cgtgcttttg aaactattga tcggcggaag 1500 ttgttggaag tgttgcggcg aaacaggatt aaaggggacg tttggaggtg gtttaaaagc 1560 tatctggagt acagaaccca aattacacgg tataacaact cagtatcaga agcagaagca 1620 gtagagttgg gtgttccaca agggagcgtg ctgggaccac tactgtttat cctatacata 1680 aacgacctga cgaaagcact taggcgggca tgtgttaacc tgtttgctga cgacaccgtg 1740 atatatgttg ctggagacaa actagacgag tgctacgaga tcatgaataa cgagctggca 1800 gtgttcgcgg attggctgaa atggaagaag ctgaaactga acgtcagcaa aacaaaatac 1860 atggtggtga caacaaggag gagtgaaagt aactgtacaa tatctgtcga cggtgaattg 1920 gtcgagcggg ttaatgccat caagtacctt ggtgtaatgt tggacgaaaa gctctctttt 1980 gcggaacacg tggactacac tattcggaaa gctgctcgga agttgggtgt cctctgcaga 2040 atcaatcgct acctatcgtt tgacaacaaa attatggtct acaaaacgct gattgcacca 2100 cattttgact actgcgcttc aattctgttt cttgccacga accagcaacg gaagaggatg 2160 cagatcgtac agagtaaagc tatgaggatg atacttagat gtgacagatt gacaccaaga 2220 attttgatgc ttgactctct gcagtggatg tcgattaggc aacgtgtgga gtatagtaca 2280 ttagtgttta tttttaaagt tgttaacgga ttggcaccac aatacttgac aaaaactgta 2340 cagtgtggaa gagacgttca tcgacatttt actaggcaag caggagatat cagattgctg 2400 aattttaaga aatcgtgcac acagaactcg ctattctaca gaggatacaa cttgttcaac 2460 atgctaccgg agtcaacgag agcgacaact aatcttcgtg agttcaagaa actgtgtaaa 2520 ccttttgtaa ggcagagacc tttggaatga agtggcatcc cataggtgtg gctgtgaagg 2580 agagcatgtc atgacggtcg gcttctctac a 2611 // ID Copia2-I_Dpse repbase; DNA; INV; 4185 BP. XX AC Unknown_group_59; XX DT 15-MAY-2009 (Rel. 14.05, Created) DT 16-JUN-2009 (Rel. 14.05, Last updated, Version 2) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia2_Dpse; KW Copia2-LTR_Dpse; Copia2-I_Dpse. XX NM Copia2-I_Dpse. XX OS Drosophila pseudoobscura OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; obscura group; OC pseudoobscura subgroup. XX RN [1] RP 1-4185 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1023-1023 (2009). XX DR Genome; Unknown_group_59; Positions 81873 77689. XX CC Positions [1927-2460] - Integrase core CC LTRs are 97% similar to each other. The original virus is a CC duplicate. XX FH Key Location/Qualifiers FT CDS 547..4185 FT /product="Copia2-I_Dpse_1p" FT /translation="MSGNMNFNIEKLDENNYSTWEVLMRSVLIQNDLWPIV FT SGKIELSESATSEQRAAFELKDQKALASILLCVKSTQLNNIKNCVSSSQAW FT QKLNKIHRPSGPARQVLLIKQLLTLKMSDGISAQEYLRKFQVVHEALAEVN FT IKVDEKILSVVLLNGLPKSYESFVVAIETRDDLPSLSMLKIKISEEANRQE FT DSISKEITSSAELHVSYDNNNFKKNFKKGKKCFKCGKEGHFKSECRAKSKQ FT EGANNKGANSKSGERLGTMLCVTEYDKVEPEKYVPILSLAKPNDQMLNAWC FT IDSGATAHLCCDKERFVSFTEDNQKVFLAGGKSLIAEGHGTVQVKFGDRSV FT TMENVILVPDLQYNFISVTKLLKHGKKVVFEKNQAMIFENGKIVIKAHLEN FT GLFMVQDNQEKCMNIVNVEADAFKWHNRFGHVHFDALKEMVNKEMVIGLKG FT IYKPKQVCATCAKSKICVQPFQSSTNRSKGVLDLIHTDVCGPMQKQSMGGN FT RYFATFIDDCSRYVSVSFLKTKDQVFEAFKEFKSQVECQTGRKVKVLRSDN FT GREYISNVFDNYLNQNGIKRNLTVPYTPQQNGVAERANRTLVEMARSMLLH FT AGLSECYWAEAIRAATYLRNRVASSCLSGVTPFEIWTGRKPVVSHLRVFGS FT VAIALDKTKKSKFAQKGLEYIIVDYSDTAKAYRLVSKETRKLVESRDVIFV FT EPEGGKLDASCSNDIASIEVQQNQNAEPEMDSEQFKDPENEPENQEEIRSS FT NDILVIPEPIRGPGRPTIVRTGKPGRPRKQYHMLNAILAADVQVPQNYAQA FT ERSTEGEIWKASMKDEYESLVKNGTWSLTKLPSGKKSIGCRWVYAVKSNPD FT GSVNRLKSRLVAKGCSQRYGIDYKETFAPVVRHATIRLIIALAVEHGLHLH FT QMDVVTAYLNGELEDEVFMREPEGFISEVHPNHVLRLHKSIYGLKQSGRVW FT NSTLDATLRRIGFIPTVNEPCLYRKPGKNMCLIAVYVDDIILACKDLDDIS FT DIKIKIAAEFDVVDIGPMQYFLGIEVKREGSTGAISISQRKYIRELLEQYN FT LKDCRQTSTSLEAGYQVTCNKEDCRRVDITSFQKLIGELSYLAVMTRPDIA FT HSVNKLAQRNKDPHSEHEAAAKHILRYLAKTIDWEICYTKQGKAIECFVDA FT DWASDMTNRKSFTGYVCMLAGGAVSWESRKQSTVALSSTEAEYIALSSAAK FT EAI" XX SQ Sequence 4185 BP; 1445 A; 724 C; 1020 G; 996 T; 0 other; ggttatgggc ccagacacca attcacccag aaacagtata aaacagaaac agtgagaaac 60 aataaacatt gtttggtgct tagtgaaaag cagtgctaaa aaaaaatggc atagactcat 120 tggggttatg tgtgtacata gaaaggcaaa agaacaagtt gaacaaattc aacggagatt 180 cgcattgtgt tttttgatta gaattttgca gcgaatcgga ccgttgccgt ggtttaaagt 240 ccaaaataaa gcagtcccaa cggatcaaga aacacaaaag cgaacggaga ctcaagtgtg 300 tataccaaaa agtgcgtatg ccaagccgca caaaacggcg agtgtgtata cccaaaagtg 360 tgtataccaa ggcatagaaa atcgcgagtg tgtgtgtgta aatagcaact cggggggata 420 cttaaaagaa aacgaaaaaa aaaaacaaaa gaaaggcacg gacaacagta ttgtcgtcgt 480 agttgctttg tgcatgagac gatagacagt tacaaatttt gcactgtggg acaataagta 540 taaaaaatga gtggaaatat gaatttcaat attgagaaac tggacgaaaa caattatagc 600 acatgggaag ttttgatgag aagtgttctc atacaaaatg atttgtggcc aatcgtcagt 660 ggtaaaattg agcttagtga aagtgcaaca agtgagcaac gtgcagcttt tgaactaaag 720 gatcaaaagg cgttggcgtc aattttactt tgtgttaaat cgacccaact aaacaatatt 780 aaaaattgtg tatcgtcgtc acaggcatgg caaaagctaa ataaaattca caggccaagc 840 ggaccagcaa ggcaagtgct tttaattaaa caattgttga cgttaaaaat gtcggatggt 900 ataagtgcac aggaatattt gcggaaattc caggtggtac acgaggcttt agcagaggtc 960 aacataaagg ttgatgaaaa gatcttatca gttgtgttgc taaatggttt gcctaaatcg 1020 tatgaaagtt ttgttgttgc gattgagacg cgagatgatt tgccgagctt gagcatgcta 1080 aaaataaaaa tttctgaaga agcgaatagg caggaagata gcattagcaa ggaaattaca 1140 agtagcgcgg aactacacgt cagttatgac aacaacaatt ttaagaaaaa ctttaagaag 1200 ggcaaaaagt gttttaagtg cggaaaggaa ggccatttta aatcggaatg ccgtgctaaa 1260 agcaagcagg aaggagcaaa caacaagggc gcaaacagta aaagcgggga aagattgggt 1320 acaatgcttt gcgtaaccga atatgacaaa gtggaaccag aaaaatatgt accaatccta 1380 agtttagcta agccgaatga ccagatgtta aatgcatggt gcatagacag tggcgcaacg 1440 gcgcatctat gttgtgacaa ggaacgcttt gtgtcgttca cggaagataa tcaaaaagta 1500 tttttagccg gcggtaaatc acttattgct gaagggcatg gaactgttca agtaaagttc 1560 ggtgatagat ccgttacgat ggaaaatgtt atcttggtac cagatttgca gtacaatttt 1620 atatcggtca caaaactatt aaaacatgga aaaaaagttg tgtttgagaa aaatcaagcg 1680 atgatttttg aaaacggaaa aatagttata aaggcacatc tggaaaatgg attgtttatg 1740 gttcaggaca atcaggaaaa gtgtatgaac attgtaaatg ttgaggcaga tgcattcaaa 1800 tggcacaata ggttcggaca tgtacatttt gacgcattaa aggaaatggt taacaaggag 1860 atggtaatag gtctcaaggg catatacaag ccaaaacaag tgtgtgcaac atgcgcaaaa 1920 agtaaaattt gcgtgcaacc atttcaatcg agcacaaaca ggtcaaaggg cgttctagac 1980 ttaatacata ctgatgtatg cggtccaatg caaaagcagt caatgggagg caatcgatat 2040 ttcgccacat ttattgatga ttgttcaaga tatgtatcag taagtttcct aaaaaccaaa 2100 gatcaggttt tcgaggcttt taaagaattt aaaagtcaag ttgagtgcca aactgggaga 2160 aaagtcaagg ttttacgaag cgacaatggc cgagaataca tatcaaacgt ttttgacaat 2220 tacctaaacc aaaatggaat aaagcgaaac ttaacagtgc catatacacc tcaacaaaac 2280 ggagttgcag aaagggccaa cagaacattg gtagaaatgg caagaagtat gctgctgcat 2340 gctgggttga gtgaatgcta ttgggcagaa gcgattcgag ctgccacgta tttgcgaaat 2400 agagtggcaa gcagctgctt gagcggggtg acaccttttg aaatatggac aggtcgaaag 2460 ccagtggtat cgcatttaag agtttttggc tcagtagcga tagctctaga taaaacaaag 2520 aaatcaaagt tcgcacagaa aggcttagag tatataattg tggactactc ggatacggcc 2580 aaagcatacc gtttggtcag taaagaaacc aggaaactcg tagagagtag ggatgtcatt 2640 tttgtagaac ctgaaggggg caaactagat gcatcatgtt ccaatgacat agcatcaatt 2700 gaggttcaac aaaaccagaa cgctgaaccc gaaatggata gcgagcaatt taaagatcca 2760 gaaaatgagc cagaaaatca ggaggagatt aggtctagta atgacattct tgtaattcca 2820 gaaccaataa gaggaccagg acgcccaacc atagtacgga cgggcaaacc aggacgacca 2880 aggaagcagt atcatatgct caatgcgata ctggcagcag acgtacaagt tccacaaaac 2940 tacgcacaag ctgaacggtc tactgaaggc gagatatgga aagcatcaat gaaggacgag 3000 tacgaatctc tcgtgaagaa cggcacgtgg tcgcttacta aattgcctag cgggaaaaag 3060 tccattggat gcagatgggt ttacgcagtt aagtccaatc cggatggaag cgtgaacagg 3120 ctcaaatcac gtctggtggc gaaaggttgt agccaacgct acgggatcga ctacaaggaa 3180 acatttgcac cagttgtacg ccacgcaact atcaggttga taattgcact tgcagtcgaa 3240 catggtctcc acctacatca aatggatgtt gtcacggcgt atctcaacgg cgaattggaa 3300 gatgaggttt ttatgagaga acctgagggc ttcatcagcg aagttcaccc aaatcacgtt 3360 ttgaggcttc acaaaagtat atatggccta aaacaatcag gaagagtgtg gaactctacg 3420 ttggatgcaa cgctgagacg cattggtttt ataccgactg tcaacgaacc ctgtctgtat 3480 cgtaagccag gtaaaaatat gtgtctaata gctgtctacg tagatgacat tattttggcg 3540 tgtaaggacc tagatgatat atctgatatc aagataaaga tcgcagcaga atttgatgtg 3600 gttgacatcg ggcctatgca atacttttta ggtattgagg tgaaacgtga aggcagtacg 3660 ggtgcgattt caatcagcca acggaagtat attagagagt tgctcgaaca gtacaatttg 3720 aaagattgtc gacagacgtc cacgtctctt gaagctggtt accaggtgac atgcaacaaa 3780 gaggactgcc gcagggtcga catcacatcg ttccaaaagc taattggaga gttatcttat 3840 cttgctgtta tgactcgtcc ggacattgcg cattcggtga acaaattagc acaacgtaac 3900 aaggatcctc attccgagca tgaagcggca gccaagcaca tccttcgata tttggcaaaa 3960 actatcgact gggagatatg ctacacaaag caaggaaagg caattgagtg ttttgttgat 4020 gcagattggg caagtgatat gacgaaccga aagtcattca ctggctatgt atgcatgttg 4080 gcaggaggag cggtttcttg ggagtcaaga aagcaatcga cggttgccct cagttccacg 4140 gaggcagagt acatagcctt atcatctgca gcaaaagagg cgata 4185 // ID TDD3 repbase; DNA; INV; 5218 BP. XX AC AF002669; XX DT 16-AUG-1999 (Rel. 4.07, Created) DT 16-AUG-2009 (Rel. 14.09, Last updated, Version 3) XX DE non-LTR retrotransposon Tdd-3, complete sequence. XX KW L1; Non-LTR Retrotransposon; Transposable Element; ORF1; KW reverse transcriptase; endonuclease; ORF2; TDD3. XX NM TDD3. XX OS Dictyostelium discoideum OC Eukaryota; Amoebozoa; Mycetozoa; Dictyosteliida; Dictyostelium. XX RN [1] RP 1-5218 RA Winckler T., Tschepke C., de Hostos L.E., Jendretzke A. RA and Dingermann T.; RT "Tdd-3, a tRNA gene-associated poly(A) retrotransposon from RT Dictyostelium discoideum."; RL Mol. Gen. Genet 257(6), 655-661 (1998). XX RN [2] RP 1-5218 RA Winckler T. and Dingermann T.; RT "TDD3."; RL Direct Submission to Genbank (06-MAY-1997)Pharmaz. Biologie, RL Universitaet Frankfurt, Marie-Curie-Strasse 9, Frankfurt 60439, RL Germany. XX DR GenBank; AF002669; Positions 1 5218. XX CC Comparison of several genomic Tdd-3 copies revealed that element CC insertion is orientation specific and occurs about 100 bp CC downstream of tRNA genes in the D.discoideum genome [1]. Tdd-3 CC encodes two overlapping open reading frames (ORFs) flanked by CC non-redundant, untranslated regions. The deduced amino acid CC sequence of ORF2 (1334-5116) is homologous to CC apurinic/apyrimidinic endonucleases and reverse transcriptases CC (RTs) encoded by the class of poly(A) non-LTR retrotransposons. CC Update: Another copy of the same sequence called TRE3A (98% CC identical to TDD3), was removed based on priority (TDD3 was CC published earlier than TRE3A). XX FH Key Location/Qualifiers FT CDS 165..1415 FT /product="TDD3_1p" FT /translation="MNSYADAAKMANPSTHMDEASENFIKEFSTKTAEIKE FT TYPKLSIGLKDISKYCETILNMTALSTYEKNNGFYPITLEFPKMDLKGLEE FT MRKTLSNYDIPFDKIYIGKGTRKMIHLMIKNEQSLLNIIRDRSKLGTFITS FT SRDFHVLTGRVRIPREKAGIVDALIKTAFAEEIPYAIYNTIYSTDHLIVFG FT LIECKIGDQMKSGQHITEIANFTIRTATKTYSSQKKNKKVETNNTEEPLED FT EMTDPPNTQNRIAIKSFQPKPQQQTNNPANNQFEKQSTQEKKAQAKKSFQP FT FPPKNPAESQFVKPTTTTTTTSNAIQKDNANKTSNNIHSSSENREKTLNIP FT PSHKRKNEEIHSNNEVTQLSPTINRSNPYIPSTPKTPVKKESKGIIESFIE FT SIYPSPSKPQLYDSQEEIDIIS*" FT CDS 1940..5116 FT /product="TDD3_2p" FT /translation="MATVIEAIRISNNLMDMDQLNKRPTFSRTIHNTNNNL FT TRILERRLDRIYLNNSLINYSQLYLRNLIPPKINDIPLSDHNFLSTTFTLH FT NIQTNMRRWRLKKSSILSSIMLKNIDFLLNGYSRELSSNHNSISFSQLNSL FT LNKIKQLYTEFQKQNDYNNKANIKNLISLLETEFKDQAFATLSAINESKKR FT EEQLKQELNNYCEETSLKYISARIKKRHNDFTINAVKDTQGRTINKQELIE FT EEYVKYYSNLYDYKEDDPPSHYEILENWTVTRDSTWDNLENEFTSQEILEV FT IKQLNPHKSPGPDGIPNLFYITHKEKLAPILASAFNDTLRNPHLISKNYKE FT GLIITIPKKGDPELIKNRRPITLANCIYKIHSKLINNRIIPILTKVINHNQ FT KGFVPGRFILHNIISINELINYCNDKRINGIITLYISKKLLTRSHTVQSQI FT TTTHQHSNQYINLIMNLLTKSEARIEINGRTTIPFEIKRGVKQGDPLSPTL FT FVLVIEALARKILQDDRITGLPLNNSNHREKFQSFADDSASMVPDSQQLEL FT VLQHFNSFCKATPQNKHRQIFINSYRQPRQYQPKNSNIYKSRKILGILLHR FT QRDNKKNARNIKHNKIILSTMENHFLNNQNQNQPSSKAFALSKLTYYSYVE FT NFKEEELNQINKLVEWFLSAPNNKNASGFQVINLMRTKRARYPLKVGGWNI FT WNIELRQLAQKLWIINQFILALETKQTNSSHHKSWEYQIQKNKFTSRYLKE FT NLDEWNKIRIKKAIFNNNLTSIKNENGQPLSLAEWYTTIQDQNPTIPKTEF FT QSSLNLRGYSYNQLFNNILKIKDPKTRDTMFRFHARCLPINYLHNKQCPLC FT KEDMSKDPYGHLFFSCKKTKQFIKINKLKNFIYITTGKGKNWHHTRIRQNN FT NFINRITPKLSPIAKKDQEVNADYANHRFHKEYFEWNYKAIDYDLTRSFAY FT RNLMALILHNIWIWICNQIYTQDPLTDESLSYSSLLKKWHKLATLEYIKKA FT KDLKNLSAKDHNNFKDPKILLTSTIKLRKTTANYYCIPESSLPNIISFDQF FT I*" XX SQ Sequence 5218 BP; 2177 A; 1268 C; 667 G; 1106 T; 0 other; ccgtaccgcg atcaagagga tacaagatac acgtgaaaag taattcatcc ttttctatct 60 ttaatctcgg tcaattttaa ccaatctttc aaaaaaaatc accaatccac cacgatttac 120 agatctaaca taccaaatcg ccgattcaaa ggaatcccat cgagatgaat tcctacgcag 180 acgcggctaa aatggccaac ccctccaccc atatggacga ggcatcggag aacttcatta 240 aggaattctc cacaaaaaca gcagagataa aggagaccta ccccaaacta tcaataggcc 300 tcaaagacat ctccaaatac tgcgagacca tattgaacat gactgcgctc tcaacctatg 360 agaagaacaa cggattctac ccaattacac tcgaattccc caaaatggat ctcaaggggt 420 tagaagaaat gagaaaaaca ctctcaaact acgatatccc attcgacaag atatacatag 480 gcaaaggaac tagaaaaatg atccacctga tgattaaaaa cgaacaatct ctcctaaata 540 tcattagaga tagaagcaaa ttgggaacct tcatcacctc cagtagagac ttccacgtcc 600 tcacaggaag agtaagaatt cccagagaaa aggcaggaat agtggacgcc ctcatcaaaa 660 ctgcctttgc agaagaaata ccctatgcaa tatacaatac tatatactcc accgaccacc 720 tcatagtatt cggtctaata gaatgcaaaa taggcgatca aatgaaatct ggccaacaca 780 ttacagaaat cgcaaacttt accatcagaa cggccaccaa aacctactcc tcacaaaaga 840 aaaacaagaa agtagagaca aacaacacgg aggaaccatt agaagatgag atgacagatc 900 ctcctaacac ccaaaacagg atagcaatca aatcctttca acccaaacca caacaacaaa 960 caaacaaccc agcaaacaat cagtttgaaa aacaatcaac acaagagaaa aaagcccaag 1020 caaaaaagtc atttcaacct tttcccccca aaaacccagc agaatctcaa tttgtcaaac 1080 ccaccacaac cactaccaca acatccaacg ccatccaaaa agacaatgcc aacaaaacga 1140 gcaataacat ccattcatcc tcggaaaaca gagagaagac actaaacatc cctccatcac 1200 ataaacgcaa gaacgaggaa atccactcaa acaatgaagt cacccaactc tcaccaacaa 1260 taaacagaag caacccttac atcccatcca cccctaagac acctgtcaaa aaagaaagta 1320 aaggaatcat tgagagcttc atagagagta tctacccttc accatccaaa ccgcagctat 1380 acgactccca agaggagatc gatatcatct cataatagta gtaatcaaaa tcaaccaatg 1440 gaatgtcaga ggttggggca ctgacacctc cttcaaaaac aaaacagact ttatcaaatc 1500 ccaatccccc ccaaactcat tggcactaac catagtcaat gaactcaacg ccgacgcaac 1560 acaagcacac cagctattcc ctggctccat cattagcgca accaacagag gcaacggaat 1620 aggaatacta aaccacaaca accaaaacat aaaactctca ccaatattca taatagaagg 1680 tagactaata atatcagaca ttctaataaa agataccaca acgagaatct tggccatata 1740 cgccccggcc caacctgata aaagaaaaac actagcttca acactaaaca aacacttcaa 1800 caaccaatac cacaacctaa cgtctaaccc taataaaaac atcgacatca tagcaggaga 1860 cttcaactgt ttagacttca atgacaatca cacatcaaat gatgaccaag gcaacttgac 1920 aacacaatcc ccagatgaga tggccacagt aatcgaagcc atcagaattt caaataacct 1980 aatggatatg gaccaactaa ataaaagacc caccttctca agaacgatac acaacacaaa 2040 caataacctc acaagaatct tggaaaggag actagacaga atatacctta acaatagctt 2100 aatcaactac agccaattat acctaaggaa cctaatcccc ccaaaaatta acgacatacc 2160 tctatcagat cacaacttcc tatcaaccac cttcactcta cacaacatac agacgaacat 2220 gcgcagatgg agattaaaaa aatcctcaat cctctcaagc ataatgctta agaatataga 2280 cttcctactt aacggttatt caagggagct gtcatccaac cataactcta tttccttctc 2340 tcaactcaac agcttactga acaaaataaa acaactatac accgaatttc aaaaacaaaa 2400 cgactacaac aataaggcaa acatcaaaaa tctaatctcc ctactagaga cagaatttaa 2460 agaccaagcc tttgcaaccc tttcagcaat aaatgaatct aaaaaaagag aagaacaact 2520 taaacaagaa ttgaacaact attgcgaaga aacctcactc aagtacatct ccgcgagaat 2580 caagaaaaga cacaatgatt tcaccatcaa cgcagtaaaa gatacacaag gtagaacaat 2640 caacaaacag gaattgatcg aagaagaata cgtaaaatat tactcaaatc tttatgacta 2700 caaagaagat gacccaccat ctcattatga aatcttggaa aattggacag tgaccaggga 2760 ctcaacatgg gacaaccttg aaaatgaatt cacatcacaa gagatcctag aagtaataaa 2820 acaattgaac ccacacaaat ctccaggccc agatggaatt ccaaacttat tctacataac 2880 acacaaagaa aaactagctc caatactggc ctcagcattc aacgacactc taagaaatcc 2940 tcatctaatt agcaaaaatt acaaagaagg cctcattatc acgataccta aaaagggaga 3000 tcccgaacta atcaaaaaca gaagaccaat tacactggct aactgtatat ataaaatcca 3060 ctcaaaacta ataaacaata gaataatccc aatactaacg aaagtgatca accacaatca 3120 aaaaggtttc gtaccaggca gattcattct gcataatatc atatcaatta acgaattaat 3180 caactattgt aatgataaaa gaattaacgg aataattaca ctatatattt cgaaaaagct 3240 tttgactcga tctcacacgg ttcaatctca gatcactaca acacatcaac attccaacca 3300 atatatcaac ctgataatga atctactcac caaatctgag gcaagaattg aaattaatgg 3360 tagaactact ataccctttg agatcaaaag aggagttaaa caaggagatc cactatcacc 3420 caccctgttc gtccttgtaa tagaggcttt agctagaaaa attttacaag atgaccgaat 3480 tactggcctt cctctaaaca acagcaacca cagagagaaa ttccaaagct ttgcagacga 3540 ttcggcttca atggtcccag actctcaaca acttgaatta gtactacaac acttcaattc 3600 attttgcaaa gccactcctc aaaacaaaca tcgacaaatc ttcatcaatt cttataggca 3660 acccagacaa taccaaccaa agaattccaa tatctacaaa tccagaaaga tacttgggat 3720 acttcttcac aggcaaaggg ataacaagaa aaatgccaga aatattaaac acaataagat 3780 catccttagt actatggaaa accactttct caacaatcaa aaccaaaacc aaccttcctc 3840 aaaagcattc gcactctcta aattaaccta ctattcatat gtagagaact ttaaggaaga 3900 ggaattaaac caaatcaata aactagtcga atggttttta tcggcaccaa acaacaaaaa 3960 tgctagtgga tttcaagtta ttaatttaat gagaaccaaa agagcaagat acccactaaa 4020 agtagggggc tggaacattt ggaatataga attgagacaa ctggcgcaga aactttggat 4080 cataaaccag ttcatcctag cactagaaac caaacaaaca aactcatctc atcacaaaag 4140 ctgggaatat cagatccaaa aaaacaaatt cacatccaga tacctaaaag aaaacttaga 4200 cgaatggaac aaaataagaa taaaaaaagc aatcttcaac aataatctca catccataaa 4260 aaacgaaaat ggacaaccac tttcactggc agaatggtat accacaattc aagaccaaaa 4320 tccaaccatc cctaaaacgg aattccaatc atctctaaac ttaagaggct acagctataa 4380 tcaactcttc aacaacatac tgaagataaa agaccctaaa acaagagaca caatgtttag 4440 attccatgcc agatgccttc caataaacta ccttcacaac aaacaatgcc cactctgcaa 4500 agaagatatg agcaaagacc catacggtca cctgttcttc tcatgcaaga aaacaaaaca 4560 attcataaaa ataaataaac tcaaaaattt catctatatt accacaggca aaggtaaaaa 4620 ctggcaccat acaagaatta ggcaaaataa caatttcatt aatagaataa ctccaaaatt 4680 atcaccgata gccaaaaaag atcaagaagt gaatgcagat tacgccaacc acagattcca 4740 caaagaatac tttgaatgga actataaagc aatagactat gatctaacta gatcattcgc 4800 atatagaaac ctaatggccc ttatcctcca caacatttgg atttggattt gcaatcaaat 4860 atacactcaa gatcctttaa cagatgaatc actatcatac agctcccttc taaagaaatg 4920 gcacaaattg gccactctag aatacatcaa aaaagcaaaa gatctaaaaa acttatcagc 4980 aaaagaccat aataacttca aagatcctaa gatccttctc acctcaacga tcaagctaag 5040 aaaaacaaca gctaattact attgtattcc agaatcatct ctcccaaata ttatatcttt 5100 tgatcaattc atatgattaa aataaccctt cagcttaaat gataagcctt taaataaata 5160 ttaaataaaa ctcgtattaa cacagatgga catatatcaa tcttgttaat ccaatatt 5218 // ID Vingi-1_BF repbase; DNA; INV; 3146 BP. XX AC . XX DT 28-JUL-2009 (Rel. 14.07, Created) DT 17-AUG-2010 (Rel. 15.09, Last updated, Version 2) XX DE Amphioxus Vingi-1_BF autonomous non-LTR Retrotransposon - DE consensus. XX KW Vingi; Non-LTR Retrotransposon; Transposable Element; INGI; KW horizontal transfer; I group; Ingi-1_BF; Vingi-1_BF. XX NM Ingi-1_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-3146 RA Kapitonov V.V. and Jurka J.; RT "New families of I non-LTR retrotransposons from animals."; RL Repbase Reports 9(7), 1533-1533 (2009). XX RN [2] RP 1-3146 RA Kojima K.K., Kapitonov V.V. and Jurka J.; RT "Recent Expansion of a New Ingi-Related Clade of Vingi non-LTR RT Retrotransposons in Hedgehogs."; RL Molecular Biology and Evolution 28(1), 17-20 (2011). XX DR [1] (Consensus) XX CC Originally classified as Ingi [1] and re-classified as Vingi [2]. CC [1] Ingi-1_BF is a consensus sequence of the young Ingi-1_BF CC family of non-LTR retrotransposons that belongs likely to the CC Ingi clade from the I group (based on the RT domain phylogeny). CC However, Ingi-1_BF, analogously to other Ingi-like elements from CC lancelet, sea urchin, sea slug and middle-African hedgehog, CC including Ingi-2_BF, I-1_AC, I-2_AC, I-1_AAl, Jockey1_SP and CC Jockey2_SP, does not code for the ribonuclease H domain. All CC known Ingi-like non-LTR retrotransposons contain only one ORF. A CC 2800-bp 3' terminal portion of Ingi-1_BF is 71% identical to the CC Jockey2_SP from the sea urchin genome. Most likely, such high CC sequence identity is due to a horizontal transfer event. CC [2] All related non-LTR retrotransposons described above are CC re-classified as Vingi. XX FH Key Location/Qualifiers FT CDS 1..3123 FT /product="Vingi-1_BF_1p" FT /note="APE and RT domains." FT /translation="MYGGHKRSVLVAVPVRLVLRILLPHHYNCQSYNHNFA FT STIMATNDRPVDEAQRHVHPHGSFSGPALCIISFNTEGLTGAKQDLLAELC FT VKNNCDILCLQETHRGPTRTRPRINGMTLIVERPHDQYGSAIFVRHGLNVD FT KFTLTDRNNIEVLSTDLMGVNLSSVYKPPAETFNLPSVVTDGLLNIVIGDF FT NSHSTTWGYRQTNEDGDLVENWADTNRMSLIHDPKLPASFNSSRWKRGYNP FT DLVFASETIASQCEKEVLQPVPHSQHRPIALRVNAVIVPRTVPFRRRFNLK FT KANWEKFAEDLDLYIQDLPASTEHYDTFVKMIKKSSRQNIPRGCRTKYIPG FT LNEDSKALYEEYVNCFESDPFSAETLECGELLTRTITESRQRRWREFVEST FT DLTHSSRKAWKNIRILGNDFTRAQPVPQVTADQVAHQLLVNSHGNPNHHPP FT RAKLPQVDAASSEGTSPFTRPFSLDELCNAIKDMKNNKAAGLDDILCEQIK FT HFGPLALQWLLNMFNHSLSTNRIPKIWRKSRVIALLKPGKDPSIPKNYRPI FT SLLCHTYKLFERLLLNRIAPFVDELLIPEQAGFRPGKSCTGQLLNLTQYIE FT DGYEKGLITGTVFVDLSAAYDTVNHRILTKKLFEITKDVRLTELIQNLLSN FT RRFFVDLNGNRSRWRRQKNGLPQGSVLAPLLFNIYTNDQPGHPDTRRFLFA FT DDLSIGAQGRTFKEVENTLTDALVGLTPYYEANHLRANPDKTQVSVFHLKN FT READRQLKVCWQGKWLTSTNKPVYLGTTLDRTLSYKTHILNTRMKVDARNN FT ILKKLTNTKWGADASTVRGTALGLCFSTAEYAAPVWCRSAHTAKLDTALNS FT ACRAVSGCLRATRVDDLYLLCGIAPPHIRRAVAAQREKLKQETDPHHVLHL FT HTPVPKRLKSRRSFMHSVEALSSSAESQRMAMWSHHLQTAPHRLNLAPKES FT LPPGAEEAWPTWSCLNRLRTGTGRCKTLMAKWGLSPDGQTACDCGQEQTMK FT HLLVCPLLSEPCTKEDLEALTPRGRECVEYWSGTV" XX SQ Sequence 3146 BP; 902 A; 900 C; 727 G; 617 T; 0 other; atgtatgggg gccataaaag gagcgttctc gtcgcggtcc ctgtcaggct cgtcctgcga 60 atccttctac cacatcacta caactgccaa agctacaacc acaacttcgc ctcgaccatc 120 atggcgacga acgaccgacc ggttgatgag gcacagcgtc acgtgcatcc ccacgggtcc 180 ttctcaggac cggcactgtg catcatctca ttcaacacgg aaggtcttac tggagccaag 240 caagatctgc tggccgaact ctgtgtaaag aacaattgtg atatcctgtg tctccaggag 300 acacaccgtg ggcccacaag aactcgccca cgcatcaacg gcatgaccct catagtggag 360 agaccacatg accagtacgg tagcgccatt tttgtacgac atgggctaaa tgttgataag 420 ttcactctaa ctgataggaa caacattgaa gtcctgtcca ctgacctcat gggcgtcaac 480 ctaagctcgg tctacaaacc gccagcggag acattcaacc tcccaagcgt agtcaccgac 540 ggactcctta acatagtaat cggagacttt aacagccata gcaccacctg gggctaccgg 600 caaacaaacg aggatggtga cctcgtcgaa aactgggcag acacgaatcg gatgtccctc 660 atacatgacc caaaactccc agcctctttc aacagctcca gatggaaacg tggctacaac 720 cctgacctgg tgtttgcctc cgaaaccatt gctagtcaat gcgagaagga agtgttgcaa 780 ccagttccac actcccagca tcgcccaatt gcgctcagag tgaacgcagt tatcgtacca 840 cgaacagtcc ccttccgacg gaggttcaac ctgaagaagg ctaactggga aaagttcgct 900 gaagacctgg atctatacat ccaggaccta cccgcatcga ctgaacacta tgacaccttt 960 gtcaaaatga taaagaaatc ctcacggcaa aacatccccc gaggatgtcg gacaaagtac 1020 atcccaggcc tgaatgagga ttccaaggca ctgtacgaag aatacgtcaa ctgctttgag 1080 tccgacccct tcagcgcaga aacactcgag tgcggagagc tactcacacg cacaatcacg 1140 gaaagtcggc agaggcgatg gcgagagttt gttgaatcaa ctgacctgac tcatagcagc 1200 cggaaggcat ggaagaacat tcgaatcctc gggaatgact ttactagagc acaaccagtg 1260 ccacaagtca ctgcagacca agttgcccac caactccttg tgaacagtca tggcaacccc 1320 aaccaccacc ccccaagagc caaacttcca caagtagatg ccgcaagctc tgaaggaacc 1380 tcacctttca caaggccctt cagcttggac gagctatgca atgccatcaa ggacatgaaa 1440 aacaataaag cagctggact tgatgacatt ttatgcgagc agatcaaaca ctttggtcct 1500 cttgcacttc agtggctgct caacatgttc aaccatagtc tgagcaccaa caggatcccc 1560 aagatttgga ggaaatctag agtgatcgcc ttactaaaac ctggcaaaga cccctccata 1620 cccaagaact acaggccgat ttccttgctg tgccatacat acaagctgtt tgaaagactg 1680 ctactcaacc gcatcgcacc gtttgtggat gagctcctta tccctgagca ggcagggttc 1740 agaccaggaa agtcatgtac tggccaactc ctgaacctga cacagtacat cgaggatggc 1800 tacgagaaag gcttgatcac gggaactgtc tttgtggacc tctccgcagc atatgacacc 1860 gtgaaccaca ggatccttac aaaaaagctc tttgagataa cgaaggatgt aagactcact 1920 gaactgatcc agaacctact ttcaaacaga cggttctttg tagacctgaa tggcaaccgc 1980 agtagatggc gaaggcagaa aaacggtctc ccccaaggca gcgtccttgc tccgctgctc 2040 ttcaacatct acactaatga ccaaccagga catcccgata ccaggagatt cctgtttgct 2100 gacgacctca gcattggtgc acagggaagg actttcaagg aagtagagaa caccctcact 2160 gatgcccttg taggcctcac cccttactac gaagcaaacc acctgcgagc taacccagac 2220 aaaacacagg ttagtgtctt ccatctgaaa aaccgggaag cagaccgcca actgaaggta 2280 tgctggcagg gcaagtggct gactagcacc aacaagccag tctacctcgg caccacctta 2340 gacaggaccc tgtcatacaa aacccacatc ctgaacacca gaatgaaagt ggatgccagg 2400 aataacatcc ttaagaagct tacaaacact aaatggggcg cagatgccag taccgtgcga 2460 ggtacagctc ttggactgtg tttttctact gcagagtatg cggctcccgt gtggtgcagg 2520 tcagcgcaca ccgcaaagct ggacacggcc ttgaactcag cttgtagagc cgtctccgga 2580 tgcctacgtg ccaccagagt tgatgaccta tacctgctgt gtggtattgc tcccccacac 2640 atcagaagag ccgtggcagc acagcgagaa aaactgaaac aagagactga ccctcaccat 2700 gtactccacc tgcacactcc tgtgccaaag agacttaagt caagacgtag tttcatgcac 2760 tcagttgaag cactcagcag cagtgcagag tctcagagga tggccatgtg gtcccaccac 2820 ctccagactg caccacacag acttaacctc gccccaaagg aatctcttcc tccaggagca 2880 gaggaggcat ggccaacctg gtcatgtctc aaccgcctta gaacaggcac aggaagatgc 2940 aagaccctca tggcaaagtg gggtctcagc ccagatggac agactgcttg tgactgtgga 3000 caggagcaga ctatgaaaca tcttcttgtc tgccccctcc tgtccgaacc atgcaccaaa 3060 gaagacctgg aagctttgac ccccagagga agggagtgtg tggagtattg gagcgggaca 3120 gtgtagtgat gacacgacga agaaga 3146 // ID Penelope-1_NVi repbase; DNA; INV; 2022 BP. XX AC . XX DT 01-JUL-2009 (Rel. 14.07, Created) DT 01-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE Penelope-type element: consensus. XX KW Penelope; Non-LTR Retrotransposon; Transposable Element; KW Penelope-1_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-2022 RA Bao W. and Jurka J.; RT "Penelope-type element from Nasonia parasitic wasp."; RL Repbase Reports 9(7), 1550-1550 (2009). XX DR [1] (Consensus) XX CC The both ends may be incomplete. XX FH Key Location/Qualifiers FT CDS join(54..1466,1450..2022) FT /product="Penelope-1_NVi_1p" FT /translation="NSQKSCADNKKWVINLLNTELPSEVIDVVSMGHNFNK FT NIKLNKNDIITTIKNFECSTYALDAEARMLLRDILIKNIETACNNVLPYNR FT FNKDFCQKVALTKEFCFNNPSLFFTRADKGSITVCLDYEDYKEKMFNLLSD FT TKTYKTVKRNPLVSLQKNVYNILSDLNDRDSLDLKYDKFKLTQTNTVLPRA FT YGLPKIHKKDVPLRPIISTVNSPTHFLANIIDKTLRGFLKNPASHIDNSFD FT LIKKLTNIIIPQGYILVSLDVISLFTNIPLELVLCSLDKRFQKIQSHSKIP FT FECIIDIVKFLFDNTYFLFDNIIYKQIFGTPMGSPISPLFADLVMEDLETT FT VLMELKNNHDCIPLFYFRYVDDTILCVRENELDLILNKFNSYNRYLQFTHE FT LQNNNMIPFLDITLTIKNNSIITNWYQKPTNTNRVLNFNSNHTIQLKRNII FT YNLVDRALLLSHKQFHEENLVMVKKILWLKKSENSYPDKFIDSCIRYRVRY FT HTFGNSSIRNKVPTVRNVMSIPFHEKLFFTCKSSFKKYGFTIVPRMSNKLS FT DVIKLGKDKLDKWCETNVVYEISCNDCNATYVGQTKRNLKRRIDEHKKTSL FT DKILPIPTHINNFSHSFDFENVKILDKEPNNFKRLISESIFININNNNINK FT QEDFNMISKQYNK" XX SQ Sequence 2022 BP; 778 A; 286 C; 280 G; 678 T; 0 other; ttccaattag tgaattctct cgacaagaaa tataataatc taattgatag taaaattccc 60 aaaaatcgtg tgctgacaac aaaaaatggg ttataaatct gttgaatact gaacttcctt 120 ctgaagttat tgatgttgta tctatgggtc acaatttcaa caaaaatatt aaactcaata 180 aaaatgatat tattacaact atcaaaaact ttgaatgtag tacgtatgcg ctagatgctg 240 aagccagaat gttacttaga gatatactta ttaaaaatat tgaaacagcg tgcaataatg 300 ttttaccata taatagattc aataaggatt tttgccaaaa agtagcacta actaaagaat 360 tttgttttaa taatccgtcc ttatttttta ctcgagcgga taaaggtagt ataacggtgt 420 gtctggatta cgaagattat aaagaaaaaa tgttcaactt gctgagtgat acaaaaacgt 480 acaaaactgt aaaaaggaat cctcttgtat ctctgcaaaa aaacgtatat aatattttgt 540 ccgatttaaa cgatagagac agcttagatt taaaatatga taaatttaaa ctaacacaaa 600 caaatactgt tctacccaga gcatatggac tgccaaagat tcacaaaaaa gatgttcctc 660 tcagacctat tatttcaaca gttaatagtc ctacacactt cttagctaac atcattgata 720 aaactcttag aggtttttta aaaaatccgg cttcacacat tgacaacagc tttgatctaa 780 taaaaaaatt aactaatata attattcctc aaggatatat cctagtttcc ctcgatgtga 840 tttctttatt tacaaatata cctttagagt tagttttatg cagtttagat aaacgctttc 900 aaaaaattca aagtcattca aaaattccgt ttgagtgcat aattgatata gtgaaattct 960 tgtttgataa cacatatttt ttgtttgaca atataattta taaacagatc tttggcacac 1020 ccatgggctc ccctatctca ccgttgtttg cagatttagt tatggaggat ttagaaacta 1080 ctgtactcat ggaactaaaa aataatcatg actgcatacc tttgttttat tttagatatg 1140 ttgacgatac aattttatgc gtgagagaga atgagttaga tctaatctta aacaaattta 1200 atagttataa tagatactta caatttacac atgagttaca aaataataac atgattcctt 1260 ttttagatat cacattaaca ataaaaaaca atagtattat tacaaattgg tatcaaaaac 1320 ctacaaacac taatagagtt ttaaatttta attcaaatca tacaatccaa ttaaaacgta 1380 acattattta caatctagta gatagagctc ttcttctctc tcataaacaa ttccatgaag 1440 aaaatttagt tatggttaaa aaaatctgaa aatagttatc cggataaatt tattgatagt 1500 tgtattcgtt atcgtgtgag atatcacact tttggcaatt caagtattag aaataaggta 1560 cctactgtta gaaatgtcat gtcgattccg tttcatgaaa aactattttt tacatgtaag 1620 tcttccttta aaaaatatgg ctttacaatt gtacctcgta tgtcgaacaa attaagtgat 1680 gttattaagt taggaaagga taagctggat aaatggtgtg agactaatgt tgtgtatgag 1740 atttcgtgca atgattgtaa tgctacttat gtaggccaga ctaagagaaa tttaaaaagg 1800 agaatagatg agcataaaaa aacgtcttta gataaaatat tacccatccc cacgcatata 1860 aataattttt cacatagttt tgattttgaa aatgtcaaaa ttttagataa agaaccaaat 1920 aattttaaaa gactgatttc agaaagtatt tttattaata ttaataacaa taatatcaat 1980 aaacaagaag attttaacat gattagtaaa caatataaca aa 2022 // ID BEL-60_AA-LTR repbase; DNA; INV; 529 BP. XX AC supercont1.98; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-60_AA_; KW BEL-60_AA-I; BEL-60_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-529 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.98; Positions 1791719 1792247. XX SQ Sequence 529 BP; 183 A; 106 C; 90 G; 150 T; 0 other; tgtgccgaca agacccctcg tttcggcaac gctaaacgtc acaccactac aacgaccaat 60 gacagtacgg agatagagaa gaatgacgtt acgggagaaa tgtcaggcaa ctaggcttag 120 agatatctaa agactgcatc gtcatctagc aatatttgca tatgaaattt aattcaacag 180 tgaaattctt taattaaagt gtatttctga aatccattac gacaaattga atttagttgg 240 tactaattac ctatttagac cgtaagtgaa cttgaatagt atgtaattca ataattataa 300 tgcttctatg ttcgattaga gctgccacac agtaattcac gtcacgtaga ggatcagcga 360 cccaaacttg ctaaaataac ataacggtca ccaaatttgt aagcctgcga tagttctatg 420 aatatctctc atcagcaatt tattaataaa aatctatatt ttagcttaaa gctaacatca 480 cactgaaacc ggagtttgct ctcaagagtt ggtgtaccct aaccccaca 529 // ID BEL-32_AA-LTR repbase; DNA; INV; 290 BP. XX AC supercont1.240; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-32_AA_; KW BEL-32_AA-I; BEL-32_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-290 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.240; Positions 238053 237764. XX SQ Sequence 290 BP; 91 A; 62 C; 49 G; 88 T; 0 other; tgattcttta tatgatccct tgaccttatt ggccaaaccc aaaaaactat gtgcaagaaa 60 acaacgaatg aagagaaaat aatcaagaga atcgagcaga gagaaacacg tcgccttccg 120 ctatgatcaa cctcatgtaa tttcataagt aaaaatatac gaaacgaaaa gcacgccatt 180 tttccaaact tgctttattc cgagaattat tcagtttttc gctataattt cgtcggtttt 240 cattcgtcgt tggcttcgtc cgcttttgcg tgttgctccg tcgattaaca 290 // ID BEL-121_AA-LTR repbase; DNA; INV; 460 BP. XX AC supercont1.19; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-121_AA_; KW BEL-121_AA-I; BEL-121_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-460 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.19; Positions 1554530 1554071. XX SQ Sequence 460 BP; 175 A; 93 C; 72 G; 120 T; 0 other; tgttcaggac ggcgcccgat cgtccgaata atttctcacc gtgtacctgc tgccaactgt 60 gagaaactaa gaaaaaactc catcgatgac ttcgcagccg atcgtttgaa aaacacaaaa 120 cagtaagaaa cagatagcct actagacaca cacgagaaac acagtttgag acggtcagtg 180 aataacagtg aaataattta cagattgttg gacaggattc acacaaaaaa gggactaaaa 240 atgtaagcta aacattaaat ttgttattct acttatatct aaatcatgca aactatgaac 300 taaattgtaa aatttcctat tcatgcaaaa ttgaacttat gaactatttt tatgcttaaa 360 actaatgaaa ttaataaatt ttgcagctaa aagcaactcc acaacccaaa atacgagttc 420 gctctttgga ttgtccgaaa atccatcacc tgtcgcaaca 460 // ID BEL-90_AA-I repbase; DNA; INV; 5968 BP. XX AC supercont1.287; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-90_AA_; KW BEL-90_AA-LTR; BEL-90_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5968 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.287; Positions 40162 46129. XX CC Positions [5017-5574] - Integrase core CC 'ATATG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS join(22..1965,1969..5967) FT /product="BEL-90_AA-I_1p" FT /translation="MKPAEDGHNLEKTGYDCAHCEKPNHADDNMVLCEKCQ FT KWFHFMCAGVTSDIEKLPWSCDDCRKASKTADNTSIQGESGTILEVSDSTD FT QRANNDTKTSTADEDDEEERKHQKELLLMKQKFERQFEREKEKMMMQIRLE FT REMLERKKAVEAELHKMRSELYEEFHGSLDPLDQEKGAVGGNILPSQEHIE FT ADLEKLWRDKLKFQNDQLRPVADPRGAFPKFSTPKDVPTQLGTINNHQESS FT ESGPKEVTPLRNPQVPASKVNKPSLPIPSFRQPEVDSPNEVLMHQESQPVT FT NANPIPTVLNDRRHQVLTVYQEQELTRAQIAARKGPFAKLPVFTGRPEEWP FT LFISSFNNGNAACNWTDLENLGRLQESIKGPALEAVRSRLLLPESVPRVIE FT TLRHLYGRPEQLLHSLMQKARRADPPRTDRLTTFISFGMIVQQLCDHLVAS FT GMVDHLVNPMLITELVEKLPPSTKMEWVRHKRQQPAVDLSTFSDFLSEIVS FT EATEATLYTNPRIDGRPNREGKEKRSRIKETEGFLNAHVGVEQSSIPSINQ FT HNRRPCRGCNSMEHRTRVCEDFRKLVWGDKVKIVEKWKICKMCLNEHGESR FT CRFKGHCNVGECKERHHSLLHPPNSIAPLPTNCHVHNSEQHPVIFRMVPIL FT HHEGRSCDVIAFLDEGSSYSLMEGSVADQLKLKGAWEPILVKWTAGMSRLE FT RESRRVDVSISSSGSNERFLLRNVHTVQELQLPEQKIRFAEVAARFKHLCG FT LPVADCLGGSPKILIGLKHLHVYAPLESRIGNPGEPIAVRTRLGWTIYGPQ FT GNGNVATGFAGHHTTGELSNIDLQELLRKHFTLEDSGLVVNVLPESNEDRR FT ARMLLEQTTTRVGEHFETGLLWRNDQPQLPDNYQMALKRMKSLEYKLAKNP FT ELAQNVEKQIEDYQRKGYAHIATNEELMEGEAGKVWYLPLNVVLNVKKPGK FT VRLVWDAAAAVQGRSLNSELLKGPDLLSSLPSVMCPFRERPIAFGGDIAEM FT YHQVRIRECDKSAQRFLYRPNASGPPTIYVMDVATFGSTSSPCSAQYIKNL FT NAQEYAKEFPDAVEAIVNKHYVDDYLDSTFTVREAIERANQVAFIHSKAGF FT NIRNWVSNSTEFLQHFDGQPDNRLIHFNCDKSSYTERILGMSWNTSKDVFV FT FAAVLRDDLQSYLRGEKLPTKRTLLSIVMSFFDPLGLYTLFTVFGKMIIQD FT LWRNGCSWDETIDADSAKKWSRWIALVPQVQAMEIPRCYFSKVKPSDYQNL FT ELHVFADASEEAYGCVAYFRIFVNGQPRVALVSAKSKVAPLQYTSIPRMEL FT LAAVLGARLATAVKANHSEKVTKVIFHIDSATVLSWIHSDHRKYKQFVAYR FT IGEILSLTNPHEWIWVPTKNNIADVLTKWGKNGPPLDSNGEWVRGPEILYK FT ANEEWFERELPAPGVREELRAHHLFHEIVFSQNLVDTTRFSRYAILVRSLA FT CVFRFISNCRKRVQGQPIETFQANPKLGKSVIRDVFSRKVPLKREEYRLAE FT NYLWRTAQQEGFADERKTLLKNKELPQSKWHFIERSSPLYKLAPFLDEEGV FT IRMEGRSAYAEFISFEQRFPIVLPKGHDITSKLLLHYHEKFGHANRETVVN FT ELRQRFYVPHVRVAILRVMKDCSRCKIQRCRPEVPRMAPLPVQRLTPMLRP FT FSYVGVDYFGPVVVTVGRRSEKRWICLFTCLVTRAIHMEVAHSLNSQSCVM FT AIRRFICRRGAPLEIFSDNGTNFLAASKELSQKVRYIELECADIFTDARTR FT WNFNPPAAPHMGGIWERLVRSAKEALKALHDGGKLTDEILLTVLTEAEDMI FT NSRPLTYVPQESADVEALTPNHFLRGLPSGEQDQVRTLTNSAEALRDNYKQ FT SQKLADVLWQRWLTEYIPTINLRSKWCKEQDPVNEGELVYIVDGNNRRTWI FT RGIVVKVIRGIDGRIRRALVKTSKGVYRRAVAKLAVMELRSKSDPISDSGP FT ELREGE" XX SQ Sequence 5968 BP; 1734 A; 1321 C; 1496 G; 1417 T; 0 other; aaattctcaa aaattattga tatgaagcct gcagaagacg gacataacct tgaaaagacc 60 ggctatgatt gtgcgcactg cgaaaagccg aatcatgcag atgataatat ggtgctttgc 120 gagaaatgcc aaaaatggtt ccattttatg tgtgctggag taacgtcgga catcgagaag 180 ctcccttggt cgtgtgatga ttgccgaaaa gcgagcaaaa cagcggacaa tacctccatt 240 caaggcgaaa gtggtacaat attggaagtt tccgattcga cggaccaacg agcaaataac 300 gacacgaaga cgtcgacggc cgacgaggat gacgaagaag aacggaagca ccagaaggag 360 ctattgctga tgaaacaaaa atttgagcgg caattcgaaa gggaaaagga aaaaatgatg 420 atgcagatcc gtctggagag agaaatgctg gaaagaaaga aagctgtaga agcggaatta 480 cacaagatgc ggagcgagct gtacgaagaa tttcatggca gtctcgaccc cttggaccag 540 gagaagggcg ccgtaggtgg taacatctta ccaagtcagg agcacattga agcggatttg 600 gaaaagttgt ggagggataa gctgaaattc caaaatgatc agctcaggcc tgttgctgat 660 ccccggggag cttttcccaa attctccacc cccaaagacg tgccaacgca actgggtacc 720 atcaacaatc atcaagagtc gtcagaaagt ggtccaaagg aagtaacgcc gcttcgaaac 780 ccgcaagttc cagcaagtaa ggtaaacaaa ccgtctttac cgattccttc ctttcgtcaa 840 ccggaggttg attcaccaaa cgaagtactg atgcatcaag aaagccagcc cgttacgaat 900 gccaatccga taccaacagt cttgaatgat cgtcgacatc aggtactaac cgtatatcaa 960 gagcaagagt taacaagagc ccaaatcgca gctcgaaaag gaccctttgc taagctgcca 1020 gtgttcacag gccggccaga agagtggcca ctgtttatca gcagcttcaa caacgggaac 1080 gcggcttgca actggactga cctggagaat cttggaaggc ttcaagaaag catcaaaggg 1140 cctgcactgg aagctgttag aagcagatta ttgctgccgg aatcagttcc aagggtgatc 1200 gaaacgctac ggcacctcta tggcagaccg gaacaactgc ttcattcatt gatgcaaaag 1260 gcgaggagag cagatccacc tcgtactgat cgtctcacca cctttatcag ttttggaatg 1320 attgttcaac aactatgtga ccacctggtg gcatcaggaa tggtcgacca tctcgtaaac 1380 cctatgctca ttacagagct tgtagagaag ttgccaccta gcacgaaaat ggaatgggta 1440 agacacaagc gtcaacagcc agcagtggac ctcagcacgt tttcggattt cctttccgaa 1500 attgtttcgg aagcaacgga ggccacactc tacaccaatc ctcgaatcga cgggcgaccc 1560 aatcgggaag gcaaggagaa aaggtccagg atcaaggaaa ctgaaggatt tctaaatgca 1620 catgtaggag tggagcagtc atcgattcca tcgataaacc agcacaatcg caggccgtgt 1680 cgtggatgca atagcatgga acaccggacc cgtgtctgcg aggattttcg taagttggtc 1740 tggggcgata aagtaaaaat tgtagagaag tggaaaattt gcaaaatgtg tttaaacgaa 1800 catggcgagt cacgctgccg atttaaggga cactgtaacg tcggcgaatg caaggaaagg 1860 caccattccc ttctgcaccc accaaactca attgcaccac tacctacaaa ctgccacgta 1920 cacaattcgg aacaacatcc tgttattttt aggatggttc cgatttaact acaccacgaa 1980 ggacgaagct gtgacgtcat tgcgttttta gatgaaggtt cttcttattc ccttatggag 2040 ggtagcgttg ccgatcaatt aaagctaaaa ggcgcatggg aaccgatact tgtgaagtgg 2100 acagcaggta tgagtagact agaacgagaa tctaggcgtg ttgatgtctc tatttcatca 2160 agtggatcga acgaaaggtt tctcctgcga aacgttcaca ccgtccaaga gctccaatta 2220 ccagagcaaa agatccggtt cgccgaagtg gcggcccgtt tcaagcacct ttgtgggtta 2280 ccagttgccg attgcctagg tggttctccg aaaattctta tcggcctaaa acacctacac 2340 gtgtacgcgc ctctggagtc gcgtatagga aacccaggag aaccaattgc agtccgtacg 2400 aggcttggct ggacaatata cggccctcaa ggaaacggta atgtggcaac ggggttcgct 2460 gggcatcata cgactggtga attgtcaaat atcgacctcc aggaacttct tcgaaaacac 2520 ttcacgttgg aggattctgg cttagttgtt aacgttctac cagagtccaa cgaagatcgt 2580 cgagctagaa tgttgttaga acaaacgacc acacgagtcg gtgagcactt cgaaactggt 2640 cttttatgga gaaacgacca acctcaacta ccggataact accagatggc actcaagcga 2700 atgaagagtc tggaatacaa gttagcaaaa aacccagaac tggcgcaaaa tgtggagaag 2760 cagattgaag actatcaacg aaaaggatat gcccatatcg ctaccaacga agaattgatg 2820 gaaggtgaag caggcaaggt ttggtatctt cctttaaacg ttgtgctgaa cgtgaaaaag 2880 ccgggaaagg tccggcttgt ctgggatgct gcggcggcag tacaagggcg gtcgttaaat 2940 tctgagctgc taaaaggtcc cgatctttta tctagccttc catctgtcat gtgccccttt 3000 cgcgagcggc caatcgcctt tggtggtgat attgctgaaa tgtaccacca ggtacgcatt 3060 cgtgaatgtg acaagtcagc ccagagattt ttgtatcggc cgaatgcttc gggaccccca 3120 accatttatg ttatggatgt cgccactttt ggctcaacaa gttcgccttg ttctgctcaa 3180 tatattaaaa atttaaatgc ccaggagtat gctaaagagt tccctgatgc ggtcgaggct 3240 attgtcaaca aacactatgt cgacgactat ttggactcaa cgtttacagt tcgcgaggct 3300 atcgaacggg caaatcaagt cgctttcatc cattcaaagg ccggattcaa tattcgaaat 3360 tgggtttcga acagtacgga atttctccaa catttcgatg gacaaccgga taatcgattg 3420 atccacttca actgcgacaa gtccagctat acggaaagga tcttaggtat gtcgtggaat 3480 acttcgaaag atgtcttcgt gtttgctgcc gttctgcgtg atgacttgca atcataccta 3540 aggggagaga agctgccaac caaaagaaca ttgttaagta tcgtcatgag cttttttgat 3600 ccgctgggtc tgtatacgtt gtttactgtc tttggaaaaa tgatcatcca agacctttgg 3660 aggaatggct gctcatggga cgagacaata gacgctgatt cggcgaaaaa atggtccaga 3720 tggatagcgt tggtgcctca agtacaagct atggagatcc cgcgctgcta cttctcaaaa 3780 gtgaaaccgt ctgactatca gaaccttgaa ctgcacgtat ttgcagacgc gagtgaggag 3840 gcctatggat gcgtcgctta ttttcgaatt ttcgtgaacg gacaaccgag agtagcgtta 3900 gtgtcggcga aatccaaagt ggcgccacta cagtacacat ctatccccag aatggaactc 3960 ctcgctgcgg tactcggtgc tagattagct actgctgtga aggccaacca ctcggagaag 4020 gtgacgaagg tcattttcca tattgactcc gcaactgtgc tttcctggat ccattcagac 4080 cacaggaaat ataagcagtt tgttgcttat cgcatagggg aaattctgag ccttactaac 4140 ccacatgaat ggatttgggt ccccacaaaa aacaatatag ctgatgtgtt gacaaagtgg 4200 gggaagaatg gcccaccgct agactctaat ggagaatggg tgcgtggtcc tgaaattctt 4260 tacaaagcga atgaagagtg gttcgaacga gaattaccag ctccgggcgt gagggaagaa 4320 ttgcgagctc atcacttgtt ccacgaaatt gtgttttcac agaacctggt agatactact 4380 cgattctcgc gctatgctat tctggttcga agccttgctt gcgtatttcg tttcatttcg 4440 aactgccgga aaagggttca agggcagccg atcgaaacct ttcaagctaa tccaaaattg 4500 ggaaagtcgg ttatacgtga tgtattttcc agaaaagttc ctcttaaacg tgaagaatac 4560 cgactagctg agaactactt gtggagaaca gcgcagcaag aaggtttcgc tgatgaaagg 4620 aagacgctgt tgaagaacaa agaattgcca caatctaagt ggcatttcat cgaacgttct 4680 agtccgcttt ataaattggc tcctttcctt gatgaagaag gcgtgattcg tatggaaggt 4740 cgctcggctt atgctgaatt catttctttc gaacagcggt tcccaatcgt gctgccaaag 4800 ggtcacgaca ttacctcaaa actgcttcta cactaccacg aaaaatttgg ccatgcaaac 4860 cgtgagaccg tggtgaatga actgcgtcaa cggttctatg tcccacacgt tcgggtggcc 4920 attttgcgag tgatgaaaga ttgttcgcgg tgtaagatcc aaagatgtcg tccagaagtt 4980 cctcggatgg caccactccc agtgcagcgc ctcactccta tgttacgtcc tttcagctat 5040 gtaggggttg attatttcgg ccctgttgtc gttacggtag gacgacggtc cgaaaaaagg 5100 tggatctgcc tgttcacatg ccttgtaacc agagccatcc acatggaagt ggcccacagc 5160 ctaaatagcc aatcgtgtgt tatggctatt aggcggttta tttgtagaag aggtgcccct 5220 cttgaaatat tttcagataa tggcacaaat ttcttggctg cgagtaaaga gttgagtcaa 5280 aaggttcggt atattgaatt agaatgtgct gatatcttta ccgatgctag aactcgatgg 5340 aattttaatc caccagcagc gccccatatg ggtggaatat gggagaggct agtaaggtcg 5400 gcaaaggaag cactgaaagc tctacacgat ggtgggaaat taacagacga aatcctgctt 5460 accgttttaa ccgaagctga agatatgatt aattcacgcc ctctcacgta cgtaccacaa 5520 gaatcagccg acgtcgaagc tcttacaccg aaccattttc tccgtggact gccttcgggg 5580 gagcaagacc aagttagaac tctaacgaat tcggctgaag ctttacggga taactacaag 5640 cagtctcaaa agttggcaga cgtcctgtgg caaaggtggt tgacagagta tataccaacg 5700 attaaccttc gttccaaatg gtgtaaagaa caagatccgg ttaacgaagg agaactcgtg 5760 tatatagtcg acggaaacaa ccgaaggaca tggatcaggg gcatcgtggt gaaggtcatc 5820 agaggaatcg atggaagaat acgtcgagca ttggtgaaaa catccaaagg agtgtatcga 5880 cgtgcagtcg ccaagcttgc ggtgatggag cttaggagta aatctgatcc aatatctgat 5940 tctggaccag agttacggga gggggaat 5968 // ID Gypsy-3_DWil-I repbase; DNA; INV; 3613 BP. XX AC scaffold_180632; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-3_DWil_; KW Gypsy-3_DWil-LTR; Gypsy-3_DWil-I. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-3613 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (04-MAR-2011). XX DR Genome; scaffold_180632; Positions 9503 13115. XX CC Positions [2271-2699] - Reverse transcriptase CC Positions [2715-3191] - Integrase core CC 'CTTC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 375..3599 FT /product="Gypsy-3_DWil-I_1p" FT /translation="MSKLEDFVNVQEALKTEKKSNITLFHKFIYEEEGDRA FT NRQRLRKFEGFDFDCNSDLYKSKVSYVEKNLTNNDLVIICNVLGLSHDKSN FT LTSHLFRNLQAVNLLADLDTENDQNEDDDDECERDGDVGNMQDEDEMLSVV FT SDETNFRTAATNENMRNKTNRANSTVVANNIDTQGAVNVNMPPPRFALSFR FT DVEDSLRPFHGDENLSVEVWIEDFEDMADLMQWDSLQKFVFAKRAIKGLAK FT MFILSERGIKNWSTLKESLLNEFKTVINSAELHKQLSERKIKKGESVQKFL FT LKMKEMASRGNIENSALMQYVIDGINDLSMNKAILYNANNLKEFKEKIKCY FT EKMREKSSNQKSESKNVQIKKESVMKYENKESGIKCYNCGQKGHISENCDN FT KSKGRKCFGCNNFGHIAKDCPNKKPENSTDVRNTNKISADHMFMCKEVSIN FT NEKCIALLDTGSKYNIVTETIYNRLKKPKLAKVDKVLYGFGDKKRVVPIGS FT FTGRLCVSQEQFDLVFLVVSSEFLDHEVILGEDFCKQAEIKINRDGVQIGK FT YNDETELSSVMRIKVENDEIDIDSNASKSTKCEVNKLIKNYKPEKLKSTNV FT EMRIVLKDNSPIYSRPRRFAFAERCIIDDQVDQWLKDEIIEQSESEFSSPV FT VLVKKRDGTPRLCIDYRRINKVIVKDHFPLPLIEDPLDRLQGATVFSTIDL FT KNGFFHVDVEKESRRFTSFVTHNGQYQFLKVPFGLTNSPGVFQRHVNAIFR FT DLTRAGTAIPYVDDIIILGKKEGELHPLPKEELPFQTFHIDFLGPLESTHK FT QYKHILAVIDGFTKFCWLYPTKTTSTKEVITRLQQQSLIFGNPVQIISDRG FT SAFTSDEFKEYCKAENIELHAVTTGLPQANGQVERLNAVIISVLSKISIED FT PSKWYKFVGKVQQTINSTYSRSTHSTPFELLIGTKMHTKDDLKLKEIIHDE FT MIQIFNDNRDDLRNIAKQQILRMQEENKKTYNLRRRPASIYKVGDLVAIKR FT TQLGGGLKLKPKYLGPYRITMVKARDTYDVVKDTLFNDGPRRTTTCAAYMK FT PWVPCDIDDQDAFEANAY" XX SQ Sequence 3613 BP; 1343 A; 594 C; 771 G; 905 T; 0 other; aatatggggg ctcaacctac aaaataggcg tgactcaacc tacaaagaag gcatggctca 60 acttataaag aagtaaaaaa gtaagatcag tgagtgagaa agagacaaac atacacaaaa 120 cgacgcaaca gtggaatcaa acgacgacaa aaaaaaaaaa aaaaaagcgc gaaaacgaat 180 aagaattagc atcaaacgac gaaaaattgg cggctaacga cgaaaaagcg gcaaagaaac 240 ggtgaaaaag ttttcaacga acgacgcgaa actgtttgaa taccacagtt atctggtgac 300 gtggcaaacg acgtatatat acatatacat atttgtatgt acaatattag tgtgtgtgcg 360 taagagaaag aactatgagc aaacttgaag actttgtgaa tgtgcaagag gcactgaaaa 420 ctgaaaagaa atctaatatt acattgtttc acaagtttat atacgaagaa gaaggcgatc 480 gtgcaaatcg ccaaagacta agaaagtttg aagggtttga ttttgattgc aacagtgatt 540 tatataaaag taaagtgtct tatgtggaaa agaatttaac aaataacgat ttagtgatta 600 tttgtaacgt tttgggttta tcgcatgata aatcgaatct aacttcccat ttatttcgca 660 atttgcaagc tgtaaatttg ctagctgacc ttgacacaga gaatgatcaa aacgaagatg 720 acgacgatga gtgcgaaaga gatggagatg taggcaatat gcaagacgaa gacgaaatgt 780 tgtcagtagt gagcgacgaa acgaatttta gaacagctgc aacaaacgaa aatatgcgaa 840 acaaaacgaa tagggcaaac tctacagtcg tggccaacaa tattgatacg caaggtgcag 900 tgaatgtcaa catgccacca ccgaggttcg ctttgagttt tcgggacgtt gaagactctt 960 tacgaccttt ccatggcgat gaaaatttat cagttgaagt atggattgag gatttcgaag 1020 acatggctga cttgatgcaa tgggatagtt tacaaaaatt tgtttttgct aaaagagcta 1080 ttaaaggact tgctaaaatg tttattctga gtgaacgcgg aattaaaaat tggtcgacat 1140 tgaaagaatc acttttaaat gaatttaaaa cagtaataaa tagcgcggag ttacataagc 1200 aattgtcaga acggaaaata aagaaaggtg aaagtgttca aaaattttta ttgaaaatga 1260 aagagatggc atcacggggt aatatagaga acagtgctct tatgcagtat gtaatcgacg 1320 gaattaatga cttgagtatg aacaaagcaa tattgtacaa tgctaataac ctaaaagagt 1380 ttaaagagaa aataaaatgt tatgaaaaaa tgcgggaaaa gtcaagtaat caaaaaagtg 1440 agtctaagaa cgttcaaatc aaaaaagaaa gtgtaatgaa atacgaaaat aaagaaagtg 1500 gtataaaatg ctataactgt gggcaaaaag gacatatatc tgagaattgt gacaataaga 1560 gtaaaggaag aaagtgtttt ggttgcaata attttggtca catagcaaaa gactgtccaa 1620 ataaaaagcc tgaaaatagc acagatgttc gtaatactaa taaaatatca gctgatcata 1680 tgtttatgtg taaagaagta tcaatcaaca atgaaaagtg tatagccttg cttgatactg 1740 gtagcaaata taatatcgta acggaaacaa tttataatcg tttgaaaaaa cctaagctcg 1800 ctaaagttga taaagttcta tacggtttcg gagataaaaa gagagtggta cctatcggtt 1860 cgtttacggg aaggctttgc gtaagccaag aacagtttga tttggttttt ttagtggtat 1920 catctgaatt cctagatcat gaagtgattt tgggagaaga cttttgtaag caagccgaaa 1980 taaaaatcaa tcgcgatggc gtacaaatcg ggaaatataa cgatgaaacg gaattaagtt 2040 cggtaatgag aataaaggtc gaaaacgacg aaattgacat tgactcaaat gcatcgaaaa 2100 gtacaaagtg cgaagtgaac aaacttatca aaaactataa gccggaaaag ttaaaatcga 2160 caaatgttga aatgcgtatt gttttaaaag acaatagtcc aatttattcg agaccacgca 2220 gattcgcttt tgctgaacgg tgcataatcg atgaccaagt cgatcagtgg ttgaaagacg 2280 aaataataga acaatccgag tctgaattca gcagtccggt cgtattagta aaaaaacgcg 2340 atggcactcc acgtctgtgc attgactaca gaagaataaa caaggtaata gtaaaagacc 2400 attttccatt acccctgatt gaagatccat tggatcggct acaaggagca acggttttca 2460 gcacaatcga tttaaagaat ggatttttcc atgtagatgt tgaaaaagag agtcggagat 2520 tcacctcgtt cgttacacac aacggacagt accagttttt aaaagttcct tttggattga 2580 cgaactcccc tggggtattt cagagacacg ttaatgccat ttttcgtgat ttaacccgtg 2640 ctggtacggc tatcccatac gtcgacgaca ttataatcct tggtaaaaaa gaaggagaac 2700 tgcatccact gccaaaagaa gaactgcctt ttcagacgtt ccacatcgat tttcttggac 2760 ccttagagtc aactcacaag cagtataagc acattcttgc agttatcgat ggctttacaa 2820 aattttgctg gttgtacccg acgaaaacga catctacgaa ggaagtaatc acaagactac 2880 aacaacaaag tttaattttt ggaaatccag ttcaaattat ttctgacaga ggttctgcgt 2940 ttacttctga cgagtttaaa gagtattgca aggcagaaaa tatcgaactt cacgcagtta 3000 cgactggtct tccacaagcg aacggacaag tcgaacgcct taacgctgtt attatatccg 3060 tgttgtcaaa gatttcgata gaagacccta gcaaatggta taagtttgtc ggcaaagtac 3120 aacaaacgat taactcaacg tacagcagaa gcacacattc aacgccgttc gaactactga 3180 ttggtacaaa gatgcacaca aaagacgacc ttaagctgaa agaaataata cacgatgaga 3240 tgattcagat ttttaacgac aacagagatg atctacggaa tatagcgaag cagcagatcc 3300 ttcgtatgca agaagaaaat aagaagacct acaacttacg tcgcagacct gcttccatat 3360 acaaagtcgg tgatctggta gctattaaaa gaacccaact tggaggaggt ttgaagctga 3420 agcccaaata tttaggtccg taccgcatta ccatggtaaa ggccagagat acatacgacg 3480 tggtcaaaga tacattgttc aacgatggtc ctaggcgaac tacgacttgc gctgcctata 3540 tgaagccgtg ggttccctgt gacatagacg accaggatgc attcgaggcg aatgcatact 3600 aggatggccg agt 3613 // ID BEL-94_AA-LTR repbase; DNA; INV; 658 BP. XX AC supercont1.289; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-94_AA_; KW BEL-94_AA-I; BEL-94_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-658 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.289; Positions 918339 918996. XX SQ Sequence 658 BP; 205 A; 141 C; 142 G; 170 T; 0 other; tgtcaacgca gaagacggcc agctggcaac cctgccacca accgattcaa aagcgtcgct 60 ttccgtcacc gaagcgttgc gtcaaaacca atgacagctg tcacacgtag tggatgagct 120 gtcaacgcac cgtttccggt catcaattgc cacatttctt gattagaaag aagtgcaaaa 180 ccatacaaag tgcattagaa caattcagtt gttaaatcag aatttgctaa agtgaataaa 240 ataagttttc ggtgtttagt tgcttagtct ggtaggagtt tcccagttta caaagttgtg 300 cggttagcga tttaaaactc agtttgtccg gtttcagagt cgcagttgag tgtccgtttg 360 aagcacgtgc cgcagtcagg gttcccttcg gttagccgaa agtcctgtaa gaaattagaa 420 atttagaaaa gtatttagaa aactaataca accgcggcaa atttcgcagg tagtcggtcc 480 tcggaggagg ttagatttac cagaagggcg tgcaaaaatt gataagttca ccgaattgta 540 agtagaatta aaacaatgaa aacacatact taccactaat catgttacag ttgaagcaac 600 aatatacttt cgctatcaaa accggtacga ctttggctca cgccctacac ccgcaaca 658 // ID Gypsy8-I_AP repbase; DNA; INV; 4162 BP. XX AC Contig23376; XX DT 21-APR-2008 (Rel. 13.04, Created) DT 21-APR-2008 (Rel. 15.12, Last updated, Version 0) XX DE LTR retrotransposon from pea aphid: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy8AP; KW Gypsy8-I_AP; Gypsy8-LTR_AP. XX NM Gypsy8-I_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-4162 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from pea aphid."; RL Repbase Reports 8(4), 451-451 (2008). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC Positions [1462-1965] - Reverse transcriptase CC Positions [3089-3622] - Integrase core CC LTRs are 98% similar to each other. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX FH Key Location/Qualifiers FT CDS 88..2853 FT /product="Gypsy8-I_AP_1p" FT /translation="MDSKIPKEMNFNGNIEANWKNWKQRLSLYLLASNKNT FT CSDETKTAILLTLLGEEGINIYNTFSGKKIHDDKNVPIFQKVISAFDEYCL FT GKKNIIFERFNFLKYKRQHGQSLENFVTQLKLLAASCEYGELTDSIIRDQI FT IINTSDVVLQEQLINKSDLSLEKTVEIIKQSENVKKQIEIINKEEKVDPEY FT SHVDSIKGKNYTSNKNKVAQTNFKCSRCGIQHAPRSCPAFGKQCRKCAKPN FT HFANVCRSNKMVQEVADAGAEEEYAANNSLFGVSEINVPQAHSIQQPWFEE FT IKVENMFIRFKLDTGSQVNLLPLNIYKTLNVNKCWNRTTIKLEAYGGYKFM FT PLGSIILKCVVNNIIAWVTFLIINNSDIPILSLEACEKFNLIKRHEQCISM FT ITQCNNQKDKFLINNSSIFEGIGTFPGEHTIKINPNSQGSIKPARRLPQTL FT YKDVQIELNKLLKHKIISKVEEPKEWASNLVIIRKPDKTLRLCIDPSELNK FT SLKRENYLIPTFEEIRSKLINKKIFTVLDIKKGFWHVKLDSKSSDLCTFST FT PFGYFKFNRLSFGIATAPEIFIKLNQKYFGDIDNDNIIIYFDDILIATTDE FT TTHDEVLKKLVDRAKLLNIKFNKDKLQFKKTEIKYVGHIFNEQDVSPDPDK FT IKAIVSLKEPTSRVELQRLIGMFNYLREFIPNMSKIISPLRELLKKDIIWV FT WESRHSLALKELKNLVTTAPILTHFRPDKEITIQCDASKDGLGCCLLQDKK FT PIAFASRSMSETEIAYAQIEKEFLSLIFACRKFHYYIFGRTINALTDHKPL FT VSIMQKDIIKIPSNRLQKMRLKLLEYDIRLKYLPGKKMHIADLLSRDYMRE FT NSTEEFDTSGTVHCINRFNNNNVCNIKSESELDPVLNKIIEYYFQGWPNRK FT QIDKAVQLYYNLKKRNNY" FT CDS 2996..3970 FT /product="Gypsy8-I_AP_2p" FT /translation="MSNEIEQFINNCMTCNKFQNSKKKSPLIPHEVSNYPY FT EKVGADILSFEGMDYLVIVDFFSKWFDLIKLKFKTANEIIKKSKQIFSTHG FT IPKTFIADNMPFNSSEFIAFSKTWNFTIVTSSPHHPQSNGLAERTVQTSKK FT LLKKAREEGCDIESMLLEYRCTPIISLQASPAQLLFSRILRTKLPIANKLL FT EPKLQNNIQTKIKVYQQKYKSNYDKTTNKNETQFKPNTNILIQNNKVWLPG FT KVIAKANTPRSYFVKNNKGTIIRRNSKHLKNIKNNNVTEGSDEENIGVGTK FT AENKPNKSVLGKKVLTRLIRVKKLPNRLKDCVL" XX SQ Sequence 4162 BP; 1714 A; 646 C; 658 G; 1144 T; 0 other; gaattatata acatggcact gtggctaatc ttgtgtgtca agtacggagt cgattgttat 60 aatttgtaaa tacacaacaa actcaaaatg gattcaaaaa taccaaagga aatgaatttc 120 aacggcaaca ttgaagccaa ctggaaaaac tggaagcaac gactgtcgct atacctactc 180 gcaagtaata aaaacacgtg ttcagacgaa acaaaaaccg cgatattact cacgttactt 240 ggagaagaag gtatcaatat ttacaacaca tttagtggta agaaaatcca tgatgataag 300 aatgttccaa tttttcaaaa agtaatttct gcattcgatg agtactgcct gggaaaaaaa 360 aatattatat ttgagagatt taattttctg aaatataaaa gacaacacgg acagtctcta 420 gaaaacttcg taacacaatt aaaattatta gcagcatcgt gtgagtatgg ggaactcacc 480 gactcaatca taagagacca aattattatt aatacctcag atgttgttct tcaagagcag 540 ttaatcaata agtcagacct atcattagaa aaaactgtag aaatcattaa acaatcagaa 600 aatgtgaaaa aacaaattga aataataaac aaagaggaaa aagtagatcc agaatattcc 660 catgtagatt ccataaaagg aaaaaactat acttcaaata aaaataaagt agcacaaaca 720 aatttcaaat gcagcagatg tggcatacaa catgcaccaa gaagctgccc ggcctttggg 780 aaacagtgca ggaagtgcgc taaacccaac catttcgcaa atgtatgccg ctcgaacaag 840 atggttcaag aagtagcaga tgcaggagca gaggaagagt atgcagcgaa caacagcttg 900 tttggggtca gtgaaatcaa cgtaccccag gcacatagta tccaacaacc atggttcgag 960 gagatcaagg tagaaaatat gtttattaga tttaagttag acaccggctc tcaagttaac 1020 ttattacctc tgaatatata taaaacactt aatgtaaata aatgttggaa cagaacaact 1080 atcaaacttg aggcttatgg tggatataaa ttcatgccat taggctcaat aatattaaag 1140 tgtgtagtta ataatataat agcatgggtg acctttttaa taataaataa tagtgatata 1200 ccaatactga gtctagaggc atgtgaaaaa tttaatttaa tcaaaagaca cgaacagtgt 1260 atcagcatga tcacacagtg caataaccaa aaagacaagt ttttaataaa taatagtagt 1320 atttttgaag ggattggaac attcccgggt gaacatacaa ttaaaataaa tcccaatagt 1380 caagggtcta tcaaacctgc gaggagatta cctcaaacac tctataaaga tgtacagata 1440 gaacttaata aattactaaa acataaaatt atatctaaag ttgaggaacc taaagaatgg 1500 gctagcaatc ttgtgataat aaggaaacca gataaaacac tgagactttg tatagatcca 1560 tcagaactca ataagtcatt aaaacgagaa aactacctga tcccaacctt tgaagaaatt 1620 cgctctaaac taataaataa aaaaatattc accgtcttag acattaaaaa aggattttgg 1680 catgtcaaac ttgatagtaa atcatcagac ctatgtacat tcagtacacc gtttggatat 1740 ttcaaattta atagactttc tttcgggatc gcaacagctc ccgaaatatt tataaagtta 1800 aatcaaaaat attttggtga tatagataac gataatatta taatttactt tgacgatatt 1860 ttaattgcca ctacagatga aacaacacat gatgaagtat taaaaaaact agtagatagg 1920 gctaaattac taaatattaa atttaacaag gataaattac aattcaaaaa aacagaaata 1980 aaatatgtgg gccacatttt caatgaacaa gatgtctctc ctgacccaga caaaataaaa 2040 gccatagtga gtctgaagga acctacatcg agagtagaat tacaaagatt aataggcatg 2100 ttcaactacc ttagagaatt cataccaaat atgtccaaaa ttatcagtcc actcagagag 2160 ttacttaaga aagatataat atgggtatgg gaaagcagac attccttagc tctaaaagaa 2220 ttaaagaatc ttgtaacaac agctccaata ctaacacact ttagaccaga taaagaaata 2280 accattcaat gtgatgcgtc aaaggacggt ctgggttgct gtcttctcca ggacaaaaaa 2340 cccatagctt ttgcttcacg cagcatgtca gaaacagaaa tagcgtatgc acaaatcgaa 2400 aaggaatttc taagtttaat attcgcatgt aggaaatttc attactatat tttcggtcgc 2460 acaataaatg cactaactga tcataagcct ttagtgtcaa tcatgcaaaa ggatataata 2520 aaaataccat caaatcgtct acagaaaatg agactaaaac ttttagagta tgatattagg 2580 ttaaaatatt tgccaggaaa aaaaatgcat atagctgatt tattatcgag agattacatg 2640 cgagagaact ctacagagga atttgatact agtggtactg tgcattgtat aaatagattt 2700 aacaacaata atgtttgcaa tattaaatca gaatctgagt tggatccagt tcttaataaa 2760 ataatcgaat actatttcca aggttggcca aataggaaac aaattgataa agcagtgcaa 2820 ctatattata acttaaaaaa acgaaataac tattgaaaat ggaattatat atgttgcaga 2880 taaaatagtg atccctacca agttaaggcc ccttatactt aaactcctac atgaaagtca 2940 tttaggcatt aataaaacaa aagtaaaagc taaacaaatt atatattggc cagggatgtc 3000 taatgaaatt gaacaattta taaataattg catgacttgc aataaatttc aaaattctaa 3060 aaaaaaaagc cctctaatac ctcatgaagt ttcgaattac ccgtatgaaa aggtaggagc 3120 agatattcta tcttttgagg gtatggatta tttggtcata gttgactttt tctcgaaatg 3180 gtttgattta attaaattaa aatttaagac tgctaatgaa ataataaaaa aaagtaaaca 3240 aatatttagt acacatggta taccaaaaac cttcatagca gacaacatgc cgtttaattc 3300 aagtgaattt atagcttttt ctaaaacttg gaattttaca attgttactt ctagccccca 3360 ccatcctcaa agtaatggcc tagctgaaag gacagttcaa actagtaaaa aattacttaa 3420 aaaagcccga gaagaaggat gtgatataga atcaatgctt ctagaatata ggtgcacccc 3480 aataatcagt ctgcaagcat cgccagcaca actattattc agtaggattt taagaactaa 3540 attaccaata gctaacaaat tattagaacc caagctccaa aataatatac aaactaaaat 3600 aaaagtatac cagcaaaaat ataaatccaa ttatgacaaa acaactaata aaaatgaaac 3660 acaattcaaa cctaatacaa atatattaat tcaaaataat aaagtctggt taccagggaa 3720 agtcatagct aaagcaaata cacccagatc atattttgta aaaaacaata aaggcacaat 3780 tataagaaga aacagtaaac acttaaaaaa tattaaaaat aataatgtta cagaaggcag 3840 cgatgaggaa aacatagggg tcggaacaaa agcagaaaat aagcctaaca aaagtgtgct 3900 ggggaaaaag gtactaacta gactcataag ggtaaaaaaa ctacctaaca gacttaaaga 3960 ctgtgtattg taatgtaaat tagcataaag ccactgtaat cgatacatat tttgtactta 4020 tattaattat tataatgttg tattactatc ttcattatca aattgtatta ttattattaa 4080 ttttgtattg ttattaattt tgtattatga tactctataa ggtttaatat tgtattcaat 4140 tatattttat gaaagagaga ga 4162 // ID Gypsy-99_CQ-LTR repbase; DNA; INV; 226 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-99_CQ_; KW Gypsy-99_CQ-I; Gypsy-99_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-226 RA Jurka J.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 578-578 (2011). XX DR [2] (Consensus) XX SQ Sequence 226 BP; 67 A; 43 C; 34 G; 78 T; 4 other; tgttaagtat ccctcgttta gstggtctag atcakwgtaa ttgtatcttg aatctcttta 60 attatattgc aagtgtgttg catttctagt acctaagcta tttcctgtas gcaccctata 120 taaaccattg ttacactatt gtattcctct ttcccgatcc agcagttgaa gaataaagtt 180 caaaggtaaa agtgaaacta gtcttttaat ccgcaccaaa aagaca 226 // ID Gypsy-1_OD-LTR repbase; DNA; INV; 313 BP. XX AC CABV01000585; XX DT 24-FEB-2011 (Rel. 16.02, Created) DT 24-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Oikopleura dioica genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-1_OD_; KW Gypsy-1_OD-I; Gypsy-1_OD-LTR. XX OS Oikopleura dioica OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; OC Oikopleuridae; Oikopleura. XX RN [1] RP 1-313 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Oikopleura dioica genome."; RL Direct Submission to RU (24-FEB-2011). XX DR Genome; CABV01000585; Positions 25918 26230. XX SQ Sequence 313 BP; 92 A; 60 C; 58 G; 103 T; 0 other; tgatcggtaa gatcttcatt tatgaattac tcattataat ggacctcctc gctgattggc 60 ggacgcgcta ggcgcacgcc aagctctcca ctttcacttc actttcacct tcgacctgtc 120 gattattttt acgatcaatg tattcactgc cgattactgt acttatactg taattatgaa 180 tatatgagag agcttaactt ccattgtcgt cattattgag taatgatcaa agcaagagcg 240 agatagtttt agtagtagaa gatttgtact aagtattaaa agaagcatag tctctgtgat 300 acaattagga aca 313 // ID BEL-31_CQ-LTR repbase; DNA; INV; 625 BP. XX AC AAWU01011368; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-31_CQ_; KW BEL-31_CQ-I; BEL-31_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-625 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 216-216 (2011). XX DR GenBank; AAWU01011368; Positions 1685 2309. XX SQ Sequence 625 BP; 201 A; 123 C; 153 G; 148 T; 0 other; tgttgcgacc gccaagatgg gaacactgcc atgcggtgtg acggcgacgg tgcaatcttc 60 cgacaccgtc ttggctgtca acacacgcga aaaggaggag caagcaaaaa caaagtgcgt 120 gtgtgacacc cagcttcaag acacacgttg caagtcacca tttgcgaaga gtaagggaat 180 ttagtgcggg atattttctt tcggttagtt gcgcgaaaaa aagtatattt cgttgtgaaa 240 gtagtttgaa tttattgaaa actccaattt acacacctga tatatcaagc aatcgtccaa 300 tcgttgctac cgcagctaga cgctaacttg atttctgctg ctaaaccttt acgcacctgg 360 atctccacta gaatcgcacc gtcacttggt acagctttgt accggatcga tcaagcgaag 420 agtgaaagga aggaacgaag gacaggaaga ggaagaggag tcgggaagga aaggagtcaa 480 tcgggaaagg aagaccaatt aagtaggttt tggatagtta aaatgtaaac aaaattctaa 540 ataaaaatac ttaattctag ttttgaagct gctcgaagag ctgctgaaca aaaagttggg 600 gttttattta ccgaccaccc gaaca 625 // ID REP_SO repbase; DNA; INV; 988 BP. XX AC D38565; XX DT 09-JUL-2004 (Rel. 9.06, Created) DT 09-JUL-2004 (Rel. 9.06, Last updated, Version 2) XX DE Fern sawfly repetitive DNA sequences. XX KW SAT; Satellite; Simple Repeat; REP_SO; pSOL family; KW satellite family; tandemly repetitive DNA sequence. XX OS Strongylogaster osmundae OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Tenthredinoidea; OC Tenthredinidae; Selandriinae; Strongylogaster. XX RN [1] RA Sonoda S., Yamada T., Naito T. and Nakasuji F.; RT "Characterization of a family of tandemly repetitive DNA RT sequences from the fern sawfly, Strongylogaster osmundae RT (Hymenoptera: Tenthredinidae)."; RL Jpn. J. Genet 70(2), 167-177 (1995). XX DR Genbank; D38565; Positions 1 988. XX SQ Sequence 988 BP; 183 A; 211 C; 121 G; 473 T; 0 other; aattcggata ttgttgtcga attggaatct tcgtcttaaa acaaaaaatg aataatcatt 60 tctcccgacg agaaaaaatc atcactgcac ttttctcctt ttttgttttc attttctatc 120 ttcctcatgt tttttctcct gtttcgcact ctgttccatt ccgttctttt ttccgaattt 180 ttttcaaaat tttttttctt cgtctttttt tttctcgttt tcgttccttt ttcaagtttt 240 tttcttgttt tttttccccg ttttttttct aagtcttttt tctccttttt gcaaaatttt 300 tctttgttct ttttctccct tccgtttttg atttttttca aagttttttt ttactcaaat 360 tttttttttc gcccaatatt tcattgtttt ttcttccaat ttttttttta cttttttttg 420 ctttgttttc ttttttctct ttgttttttc tttctggttc tgtctaagtt ttttttttcc 480 ttctttttca ttttttccgg tttttttttc gttttttttc cgaatttttt ttcggttttt 540 ttcaaatttt tttccgtttt ttttgctttt ttttcctctc cgattccttc tgctctcttc 600 cgcttccgga atgcaccctg aatgcaccca aaatgcacct gaaatgcact cgaaatgcac 660 ccaaaacgca ccttaaatgc actcaatttc ccgacgtttt tcctgactac gaatcgattc 720 cgatgccact cgggaccttg gatattgtcg ttgtgtcgat attctcgttc caaaccgaaa 780 gaaatgtcaa tttcacgttc ggcgaaaatc ataattttgg acgtattctc tgacccttga 840 tcgatttaaa tgcaattcgg catcccagat atcgttgtag gcatagaatt tctgtttcag 900 accagagaaa aatgaatttt ttttgtctga ctaaattcgt catttttgca gcacgtcctg 960 atttcgcatc aatgcaacgc gatcttcg 988 // ID Gypsy-594_AA-LTR repbase; DNA; INV; 121 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-594_AA_; KW Ty3_gypsy_Ele66; Gypsy-594_AA-I; Gypsy-594_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-121 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 121 BP; 30 A; 26 C; 23 G; 42 T; 0 other; tgcgatcggc ggctcgcgtg catacacgtt ttaagtcatt cgttctgcct tttatttaat 60 aaagttatgt tttaaatcgt aattacgagt cgcgtcacta gacatatctc gtatcgcaac 120 a 121 // ID Gypsy-48_CQ-LTR repbase; DNA; INV; 171 BP. XX AC AAWU01034932; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-48_CQ_; KW Gypsy-48_CQ-I; Gypsy-48_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-171 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 476-476 (2011). XX DR Genome; AAWU01034932; Positions 8595 8765. XX SQ Sequence 171 BP; 45 A; 37 C; 44 G; 45 T; 0 other; tgtggtggat gcctctataa gtaagtacat aaatattata cgagttccag tctatgtgtg 60 gaggattatc tacagcgcgg tgggttagca tccacgatca gcaatcgacg cagttgtgtt 120 acgaagctca agcttaccgg tcacgttggt acgatacacg cggtcaccac a 171 // ID RTE-18_BF repbase; DNA; INV; 2445 BP. XX AC . XX DT 29-JUL-2009 (Rel. 14.07, Created) DT 29-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE Amphioxus RTE-18_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW RTE; Non-LTR Retrotransposon; Transposable Element; RTE-18_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-2445 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-2445 RA Kapitonov V. and Jurka J.; RT "Young families of RTE non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(7), 1716-1716 (2009). XX DR [2] (Consensus) XX FH Key Location/Qualifiers FT CDS 1..2430 FT /product="RTE-18_BF_1p" FT /translation="LTQTMQRVHKKDITILQGDFNAKIGTDAHKDWSGTVG FT KHGLGTTNDSGLRLLEFARYHSLAIANTFPRHKTSRKATWHSPDGKTHNMI FT DFILIGKRHLSSLNLAQTRTFLAAFQEEIGSRFAPLLADNNADDKLATEAE FT TNLKAAAENLLGKARRAKHPWISAEVLQKCDDRREKKKVRFKTRQKNSEYQ FT KANLEVKRAIREAKEEWIRQECSKIEEGIQNNNTKQAYQTLKTLTQKTVNR FT NTCIEDDEGNLLTTKPDITRRWKEYCYELYNYQLTAEQDVLEELKAYTCQL FT QEDDDPDILEAEVLSAIKTLKLGKSPGIDNIPAELLKAGGDPVVKMYTRIC FT NYVYKTGNWPEAWTTSIVIPLPKKGNLKNCQNYRTISLISHPSKILLKVIL FT KRLQPQAEQILAEEQAGFRKGRSCAEQIFNLRMICEKYRELGKPVYHTFVD FT YKKCFDRIWQNGLWAVMRRFNISRGIINSIEALYKASQSTVMIGNEFSECF FT PTSVGVRQGCLLSPTLCNIFLENIMREALPPLESPVKLAGCIINHLQFADD FT VDLIDGSKQDQQTHFSCLDSTSRRYGMEVSLEKTKCLVTGPSDTVQVTVRG FT TDLEQQVSEFTYLGSLQTEDCSSVREIKVRIAKATSVLSRLKHIWNSHNIS FT PPTKIQLLRSLVLSIFLYGAESWTLNAEIVKRINAFEMNCYRRLLRVHWST FT HTSNREVMQRVKSLVGPRPSFLSLVKKKKLQWFGHATRAKGTLTHTILQGR FT AEGARPRGRPRRTWTSDLKEWSGQTVHHLATLAEDRQRWKTYVDGCAAPTA FT GEAMGPVR" XX SQ Sequence 2445 BP; 811 A; 558 C; 584 G; 492 T; 0 other; ttaacccaaa caatgcaaag agttcataag aaggatataa ctatattaca gggagatttc 60 aatgccaaaa tagggacaga tgcccataag gactggtcag ggacagtggg caagcatgga 120 ctgggcacaa caaatgacag tggcctgcgt ctactagagt ttgctaggta ccacagccta 180 gctatagcaa ataccttccc acgacacaaa accagcagga aggcaacttg gcactcacca 240 gacggcaaga cccataacat gatagacttc attctcatag ggaaaagaca cttgagtagc 300 ttaaacttag cacaaacccg caccttcctt gctgcattcc aagaagagat tggaagcaga 360 tttgcacctt tacttgctga caacaatgca gatgacaaac ttgcaacaga agcagaaaca 420 aacctgaaag ctgcggcaga aaacttacta gggaaagcac ggcgagcaaa acacccatgg 480 atatcagctg aagtccttca aaagtgcgac gacagaaggg aaaagaaaaa ggttagattc 540 aaaacacgac aaaagaacag tgaatatcag aaagcaaacc tggaagttaa acgggccatt 600 agagaggcaa aggaagaatg gatacgccag gagtgcagta agatagaaga gggtatccag 660 aacaacaaca ctaagcaagc gtaccagact ttgaaaacac ttactcagaa gacggtaaat 720 agaaacacat gcattgagga cgatgaaggt aacttgctaa ctacaaagcc agacataaca 780 agacgctgga aagagtactg ttatgagttg tacaactacc agctgacagc agaacaagat 840 gtgctggaag aactgaaagc atacacatgt caactacaag aagacgatga ccctgacatc 900 cttgaagcag aagtgctctc agccatcaaa accttgaaac taggaaagtc accgggcatt 960 gacaatatac ctgcagagtt actgaaagct gggggtgacc cagtggtcaa gatgtacaca 1020 agaatttgta actatgtata taaaacaggg aactggccag aggcatggac aacctctatt 1080 gtcataccac ttccaaagaa aggaaaccta aagaactgcc aaaactatcg cactatcagc 1140 ctaatttctc acccaagcaa aatccttcta aaggtcatcc taaagcggct tcaaccacaa 1200 gctgaacaga ttctcgcaga agaacaggcc ggatttagga aaggcaggtc ctgcgctgag 1260 cagatcttca acttacgtat gatctgtgag aagtacaggg aacttgggaa accagtgtac 1320 cacacatttg ttgactacaa gaagtgcttt gataggattt ggcagaacgg actgtgggca 1380 gtaatgcgtc gcttcaacat aagtcggggc atcatcaact ccatagaagc actctacaaa 1440 gcgtcacaaa gcacagtcat gataggcaat gagttcagtg agtgcttccc tacgtcagta 1500 ggagtgcggc aggggtgcct gctgtcacca acactctgca acatcttctt ggaaaacatc 1560 atgagagaag ccctgccacc actagagtct ccggtcaaac ttgcaggttg tatcatcaac 1620 cacctacaat ttgctgacga tgtcgacctg attgacggct ctaaacagga ccaacaaact 1680 cactttagct gcctggactc cactagccgg agatacggta tggaagtcag cctggagaag 1740 accaagtgcc ttgtgactgg cccgtcagac acagtacagg ttacagtgag aggcactgac 1800 ctagaacagc aggttagcga atttacatac ctgggatccc ttcagacaga ggactgcagt 1860 tctgtgagag agataaaggt caggatagcc aaggcaacct ccgtactgtc gagacttaag 1920 cacatctgga atagtcacaa catctccccg ccaacaaaaa tccaacttct acgttccctt 1980 gtgttatcaa tcttcctgta tggcgctgag tcttggactc taaatgcaga gattgtgaaa 2040 cggataaatg cattcgagat gaactgctac cgaagactac ttcgtgtcca ctggtcaact 2100 cacacttcta acagagaagt catgcagcgt gttaagtccc tagtaggccc gcgtccaagt 2160 ttcctgtcac tagtgaaaaa gaagaaacta cagtggtttg gccatgctac aagagcaaag 2220 ggtactttga cccacactat actccaagga agggcggagg gtgcaagacc taggggacgt 2280 ccacgacgga cttggactag tgatttgaaa gaatggtcgg gacaaacagt ccaccatctg 2340 gcaaccttgg ctgaggacag acaaaggtgg aaaacttatg ttgatggatg tgctgcccct 2400 acggccggtg aggctatggg accggtgagg tgaggtgagg tgagg 2445 // ID Gypsy-591_AA-I repbase; DNA; INV; 7343 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-591_AA_; KW Gypsy-591_AA-LTR; Ty3_gypsy_Ele115; Gypsy-591_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-7343 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [2821-3324] - Reverse transcriptase CC Positions [4393-4869] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 155..1252 FT /product="Gypsy-591_AA-I_2p" FT /translation="MDPASLSEEEICYELALRHVNNLGALPRRARAVRLRA FT LMQEDEIRGMIYDNSSHVMDAGDNISQCQTRVRELLPEVEAAMKRGDLPAV FT RLLRSRLIHYRNRLDIIDPPEVFVDTHATLSLLVQFSLEDVDDALGRTKKS FT ARTVNSESVNQSSTGAVPKRPNQTSISAEADLRGFDAEGNPSFRSSSSNHA FT IIEQAGQPQRQSTSPAIEALENRSQRMNSTQHDNERGAESLEALSINRGRG FT RGALRPRVATSNRFTMEKDFGRHEPRISSIPPPPYEPPRVEAGEADAREDY FT YQLRADLLRQLNQRPQEYFPRPLLEERRTLKAIHNWPFRYKGEKDGSSLNT FT FLQRVEIFAASEGVPEDVLLKNV" FT CDS 2770..5241 FT /product="Gypsy-591_AA-I_1p" FT /translation="MSKYVLDEVNKEIDRMLELDVIEEALYSPWNNPLVAV FT KKKTGQYRVCLDARHLNSVMVNEGYPIPQISAIMNNLSGCNYISAIDLKDA FT FWQLPLEEKSRPVTAFTVPQRGHFQFKVVPFGLCTASQALARLMTHLFADL FT EPHVFHYLDDIIICSRSIDEHIDMLEKVAKRLRDAGLTISAEKSKFCLEEM FT KYLGYVLNQSGWNVDQEKIDCIVRFPAPQCRKEVQRFLGMCNWYRRFIADF FT SKIAVPLTELTKTKTKFRWTKDAEDAFLKLKSALVSAPVLAMPDHTKPFAI FT ACDASDVAIGAVLTQETDGEEHPIAYFSQKLSSSERKYSVTERECLAVIRA FT IEKFRGYVEGTKFTVHCDHSALSYLRSMKNPTALMSRWILRLNAFDFDIKY FT RKGEINIVPDALSRIVSTLVFSADAAVDQWYQKLMERVQEAGDKFPDFRVV FT NQELYKNCSSKNEEGALTHKWKKVVPLNQRAGVIAKFHDSASGAHLGFQKT FT WQKLQNHFYWPKMIDDVARHVRACETCKASKAPNRTMMPNMGGPKPARVPW FT ELVSVDFVGPFTRSRAGNTVMLVVVDWVTKYVVVHPMRSADSVKMVDFLEQ FT HVFLRYSRPRIILSDNGKQFLSAAFKALISKHNIIHMKTAFYCPMVNNAER FT VNRVLITCIRALLDEDHRGWDENLQAIVAAINGAKHEATGVSPHVANFGRE FT LILHTDLYTQQELNTPDDPKVAQEVRLSAIRRIHEFIIQRIKNNHEKTKQR FT YNMRTRDIVFKVGDLVWRRTFTQSSKVDHINRKLDSKFIPATVIKVLGSNL FT YMLEDVRDGKRGQYHAKDIKPD" XX SQ Sequence 7343 BP; 2109 A; 1787 C; 1900 G; 1533 T; 14 other; ttggcgccca acaaaacgaa ccattttagt ttgtggttta tagttttaga atttgtgttg 60 aatttgtggc acaattctgt tatttagtaa cacatttgta gaacctacca cttattagtt 120 cttgtctagt tggctaatta agaccatttg gatcatggat ccagcaagcc tgtctgaaga 180 agagatctgt tacgaactgg cgttgcgcca cgttaacaat ctcggagcac ttcctcgcag 240 ggcaagagcc gttcgcttgc gtgctctgat gcaggaagat gagattagag gcatgattta 300 tgacaattcc tcccatgtta tggatgcggg ggacaacatc agtcaatgtc aaactcgtgt 360 gcgcgaatta ctaccagaag tggaagcagc aatgaaacgc ggggatctgc cggccgtgcg 420 tctgctacga tcgcgattaa ttcactatcg gaataggttg gatatcatcg atccgccgga 480 ggttttcgtc gacacgcacg ctacattatc actactagtg cagttttcgt tagaggatgt 540 ggacgatgcg ctgggaagaa cgaagaaaag tgccagaaca gtgaacagtg aaagtgtgaa 600 ccaaagcagc accggagcag ttccaaaaag gcccaaccag acttctatat ccgccgaagc 660 agatcttcga gggttcgacg cagagggaaa cccgtctttt cggtcaagtt catcgaacca 720 tgcaatcatc gaacaagctg gacagccgca acgacagtct acatctccag cgattgaagc 780 gttggagaat cgstcgcagc ggatgaactc aactcaacac gacaacgaam ggggagcaga 840 atccttggaa gccttgtcta tcaaccgagg aagaggtcgg ggagcgttaa ggccgcgggt 900 ggcaaccagc aacagattca ccatggagaa agatttcggc cggcatgaac cgaggatwag 960 cagcatacca cctccgccgt acgaaccgcc tagagtagaa gcaggtgaag ctgacgccag 1020 agaagattac taccaactcc gagccgatct gctacgacag ctgaaccaga gaccgcaaga 1080 atacttccca aggccgttgt tggaagaaag aagaacactg aaggcaatcc acaattggcc 1140 gttcaggtat aaaggcgaga aagacggttc gtcgctgaat acgtttctcc aacgcgtaga 1200 aatattcgca gcatctgagg gagttcctga ggacgtattg ttgaagaacg ttwagcacct 1260 gctaatggac gacgcgctcg actggtacag caacgtttac gtgactggcg agttggtatc 1320 ctgggatgat ttcaagcgcc taatcaggca cgaattcctt ccagcgagct acgcctatat 1380 cctgagagca gaggcctacc atcgtctgca gggggaggac gagccgttca gcaagttcta 1440 ccaggacatt accacactat tccagtacgt agaccctccg atgaccgatc cagagaagct 1500 gttcatcatc aaaaagaata tgaactccac ctacgctccc atcgcagcat cgcaccattc 1560 cacacgcttg ctggtaaagg cgtgcaagga actagatgaa cttcgaaaac tgcaacaaca 1620 ccaacgaaga atctcgttac catatggagc acttatcgag ccakcactag ctacaccgma 1680 ctcttcattg cgttcgtcaa agcttcagca gccgcttcaa agatttggaa aggtccatgc 1740 gttggaatct gagaagttgc aaagctatgg agacaatcat tacattccgg aaaccgagca 1800 gtcaacgcaa gaggaagaga aggtggacca gcgaatggaa gccwtactcc agcaagtaaa 1860 cgcgctcaag ctaaggttcg atcgacgaga agcaggagga mgacagtcgt ccaatcaatt 1920 tccccaacaa ccagtggcca atcccagcaa ccaacaagcc gcggatgcat ccaatgcacg 1980 ggcacctcca gcggtgatga tgtgctggaa ctgtgatgaa gaggggcacc gcttcatgga 2040 ctgtgccaaa ccacaggcgg tgctgttctg ctatcgctgt ggtcagaagg gattctcgct 2100 gcgaagttgc ccgacatgtc gtcaacgctc gggaaacgcg caagcgggga accagtaacc 2160 gagggtatgg aatcctcgct tcaccataac gatccctcag taatacccga cttagtccaa 2220 atcagttcac tgattatcaa tcccgacaat gacaatcgtc cgcatgcagt ggtggaagta 2280 ttaggaaagc aagtcaccgg cctcctcgac agcggwgcca actgctccat tttaggaggc 2340 gacaacgtga agatggcaca agaactgggc ttgcagaaga tagcgctagt tggaggcata 2400 agaacagcag atggcacgga gcatcgcatc caatcataca ctcgcttgcc catagcctac 2460 aacaacaaga gtgaagtcgt gacgatgttg ctactcccca cgctgccgac atgtgtgatc 2520 ttcggaatga acttctggaa cgcattcagc atcaaaccgg tctgctgcac gataagcctg 2580 gagcctggca tggaaggcaa tmctcagcaa gtgtcgatga agtcgctgtc cgaagaagaa 2640 aagcggagac tggaagaagc cataagccac ttcccaaaag cagaacccgg maaattggga 2700 agaackgacc kwtacgtcca tcgcattgac gtcggagaag cgaagcctcg gaagcaacgg 2760 tactatccca tgtcgaagta cgttctagac gaagtgaata aagagatcga ccggatgctg 2820 gagctcgacg tgatagagga agcgctgtat tcaccatgga ataatccgct ggttgcagta 2880 aagaagaaga ccggccagta cagagtctgc ctggatgccc gccacctaaa ctcggtgatg 2940 gtgaacgagg ggtatcccat cccacaaata tcggctatta tgaataacct cagcgggtgc 3000 aactacatat cagcgatcga tctgaaggat gcattctggc aactgccttt agaagagaaa 3060 tctcgtccgg tgacagcgtt cacggtacca caaagagggc actttcaatt taaagtggta 3120 ccattcgggt tatgcacagc gagtcaggcc ttggcgcggc tgatgaccca tcttttcgca 3180 gatctggaac ctcatgtgtt ccactacctc gatgatatca tcatctgttc aagatcgatc 3240 gacgaacaca ttgacatgct tgaaaaggtg gccaagcggc tgagggatgc cggtctgacc 3300 atatccgcag aaaaatccaa gttttgcttg gaggaaatga aatatctagg ctatgtgcta 3360 aatcagagcg gatggaatgt agatcaggag aagatcgact gcatcgtgcg tttcccagct 3420 cctcaatgcc gaaaagaggt gcaacgattc ctggggatgt gcaattggta tcgcaggttc 3480 atagcggact tctccaaaat agcagtaccg ctgacggagc tgaccaaaac gaaaacgaaa 3540 ttccggtgga caaaggatgc agaagatgcg ttcctgaaat taaagtcagc actcgtatca 3600 gcgcctgttc tagcaatgcc agatcacacc aagccgtttg cgattgcctg cgatgcgagc 3660 gacgtagcca ttggggcggt gttaacgcaa gaaacagacg gagaagagca cccgattgct 3720 tatttctctc agaagctgtc gtcgtcggag cggaaatact cggtaaccga acgagaatgc 3780 ctagccgtga taagagcaat agagaaattc cgagggtacg tagagggtac aaaatttacc 3840 gttcactgcg accactcagc attgagctat ttgagatcga tgaaaaaccc tacagcgctt 3900 atgagccggt ggatactgcg actaaacgca ttcgatttcg atataaagta ccgaaagggc 3960 gaaatcaaca tagttccgga cgcgctttcc agaattgtta gcacgctggt gttttccgcc 4020 gacgccgccg tcgatcaatg gtatcagaag ctgatggaaa gagtgcagga agccggggac 4080 aaatttccgg atttccgtgt ggtgaaccaa gagctttaca agaactgcag cagcaaaaat 4140 gaagaaggag ccttgacaca caagtggaag aaggtcgtgc cgctcaacca gcgagctgga 4200 gtgattgcga agttccatga ttcagcctca ggagctcacc ttggcttcca gaaaacatgg 4260 cagaagctgc agaaccactt ctattggccg aaaatgattg atgacgtagc acggcacgtt 4320 agggcgtgcg aaacatgcaa agcaagcaag gcgccgaaca gaacaatgat gccgaacatg 4380 ggtggcccta aacctgctcg cgtgccatgg gagttggttt cggtagactt cgtaggccca 4440 ttcacccgtt ctcgcgccgg aaacacagtt atgcttgtcg tggtagactg ggtgaccaag 4500 tacgtcgtcg ttcatcccat gcgttcagca gattcggtga aaatggtaga cttcctagaa 4560 cagcacgtct tcttgcggta ctctcgacct cgtatcatcc tctcggacaa tggaaaacaa 4620 ttcctatcag cagcattcaa ggctttaatc tccaagcata acatcatcca catgaagact 4680 gcgttctact gtccgatggt gaacaatgcg gaaagggtga acagagtcct gatcacttgc 4740 atacgagccc tgttagacga agaccaccgt ggatgggacg aaaacctaca agccatcgta 4800 gcagctatca acggagcaaa acatgaggcc accggtgtca gtccgcacgt tgcaaacttc 4860 ggtagagagc taatactgca cactgacctc tatactcagc aggagttgaa cacgcccgat 4920 gatccgaagg tagcgcaaga agtacgtctc tctgcgatcc gccgtatcca cgaattcatt 4980 atccagcgca tcaagaacaa ccacgaaaag acaaagcagc ggtacaatat gagaacaagg 5040 gacatcgtat tcaaggtcgg agatctcgta tggcgaagaa cgttcacgca gtcatccaag 5100 gtagatcaca ttaatcgcaa actcgactcc aagttcattc cggctacagt gattaaggtc 5160 ttggggagca acttgtacat gctggaagac gtcagggatg gcaagcgagg acagtaccat 5220 gccaaggaca taaaaccgga ctagtgaagc cttttcatca agctatgtaa gcagcaatag 5280 ctgccaacca cagattatcc acggagaatc agcaaaacac gcaaagcggt catgatttct 5340 gtgtgcagaa cgattagcag acaacgccta tgggtagcag aaggaacgca gtgaagaatc 5400 accttaacgc gctccggaat cacggaggac ctagacggcg ctgctccgca taagtttcat 5460 ctctctgcac cagaatcaaa gccatcgttc agcggcattt tcagccgaag aagttcgaag 5520 ctgagctgaa tccagcattg tccgtcaggt gtgatgagca acaacattct tttagactac 5580 ctctccacct tgaccagagt ccagctatga aagcaaacgt ttttgcagac aactgccttc 5640 gccgagaagg aaacaacaca cgcattgcgg tcatagaaca gaagtcccca gcggtttaag 5700 gagataattc agcggtacac caacactttc atcgcgccag aaagtgtacc gcgcgccgtt 5760 cgcgagtcgg gagaaatcgc accttgcatt tctggaaacg ggaagcaggg atctctgcag 5820 tacaccagct ggcttctgga taggaggagt ggagagccgg agatcgacga ttgagttcgg 5880 ttcgacgcgg attcgaagct tacggccact atcaggcgca gcggaatcgc acctggactg 5940 tactgagaga ggaggagtac gtagcggagc ctacgacagg cttgggagat tggaaccccg 6000 aaagcaacca agttagtagg gaggctccaa ataccagaaa cagtaatgac gcaacagaag 6060 gaaaagccca cctaaggccc gggagattgg gaccctgaaa actagtcagt tagtagggag 6120 gctccgaata ccagaagcag cgacaacacg gtagaagcac aaagcctacg gaaggcccgg 6180 gagattggga ccctgaaagc cagcttgcaa gtaaggaggc tccaaatacc aagaaaagga 6240 gagataacca ggacgcaacg ccacagaagt tcagagccta tggaaggctc gggggattgg 6300 aaccctgaaa gtcagaggat gaccccaaat accagcaaca gtgtcggtgc aaagaaagtc 6360 ggagggcaca gccggcgagg gagattggga ctccaaagaa ctagtaaggt ggctccaaat 6420 gccggacaag ctatgcagac agcgagcaag gagttgttga cgacgagcat acgcttcatc 6480 aaaacacgtt ctgcggcaga aactagagga cccagatcac gaaccgaaac ctcgccgttc 6540 cttgcatacc agtggaccag cacaccagta tcatctgtcg atcacggtct gaacctcaaa 6600 tactacgagc catcaagcaa ccatcagtgg agcccattgc agtcgaccac aagtcgtcgg 6660 acctgaagcg cgcacacgtc gccgatcaac cgatcggaca catcgcagtc ggaggagtaa 6720 atccaaccga gtgtcattga ctgatttcca ggcacaggtc ttgaacgaag tggtagtcgg 6780 ttgcatgggg gtcactacga aatcttcctc taaagtaacg cgaccggggg gcgcggtaaa 6840 gcctcctaac tgggctcgta gtgatttagg tcgaggaaat gcgacctttt ctcaatcgta 6900 actgacaata tccactatca tcgttttcat attagccaac attaaatgta catttcgtta 6960 catgttttga tctctagttt tattgtagct cttaaaacta gcgtagatcg tactgagtgt 7020 ttgccacagg aacggcatag cacagaagat gggacaagaa ctagcgcaga gacatttgct 7080 cattagtgag cgttcagatc acctttggca ggattcgctg tctttacgtt tgtttcgatt 7140 atgtttgagc ttttggatct ggtgagcatt aaggctcgcg tatgttagtt ttgctgtagt 7200 tatcaacatc ttaaggtctg cttaagtgat gactttcaga tctgctccag tctctctgcg 7260 acttcttgta gaagtgagtc tgagagtttt ttgtgagctg cctgtcagcg ctcacaaaaa 7320 actctgcgcc tgtggtaagg ttg 7343 // ID Gypsy-8_AA-I repbase; DNA; INV; 7352 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-8_AA_; KW Gypsy-8_AA-LTR; Gypsy-8_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-7352 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 985-985 (2011). XX DR [2] (Consensus) XX CC Positions [5238-5720] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 478..2814 FT /product="Gypsy-8_AA-I_1p" FT /translation="MNYYHINEQCLEEDEINYELRIRGYSTDGSLETRRRT FT LRRLLRESEGMEVKQTWYSIEDEYVGIPMKLQQIELAIKNRVGAGCLSRLV FT HLHKRVRRYRVISFDQREKQRMLLNAVSQMALNYFGVNLDNLGQAVPLMTL FT VTGPEETNVVSSVPESRDDLAEGKRLGVFPKQQSLNEDLILFSPMPMDGSR FT VDPRRHTFPVAGPQAFRGQAIQPILVPQPQAFDPQVEDDVGGDCESPHFKS FT SIQVQLNRRDPAKSATPIPKDENAFEVNPFERPNVVTDMGQASLRQNPPLS FT NISPVLQDCFSVDGSRDSSSLSPQEQTGKKISLNEYVHVSEIETYIKSCIS FT QILTSDVVDNLAGRLKTVGLKEQNGLEIPYAWSGRPREAESPITHSFAQPK FT PSIPIKNVHGFPSYTGQKVGNPSTEWRPPIPNVSSSSMVPPLYPTTTPEPF FT PNPYRRNDLLNQQVHLPPNSSTFLGPLGTGNPLSNNFVPNHYLRQRLPHQT FT CNIIEKWPKFSGDSNAVPVVDFLRQIEILSRSYQVTQEELRMHAHMLFKGD FT AYVWFTAYDDKLDSWPTLTAMLKMRYDNPNRDRAIREDMRNRKQKPNELFS FT AYLTDIEAMSQRLVRKMSAEEKFDLVVENMKMSYRRRLALQPIQSLDHLAQ FT LCYQFDSLENNMFLTKPAMKPAVLNQILAEEELEDYEEVSEEEDATVMAVR FT PKMSRKDANKSSGSGITSESQEAGQPLCWNCKNFGHMWRECSQRKTLFCHI FT CGHNNTTAYQCPQKHNLKPREPADSKNE" FT CDS 2871..6095 FT /product="Gypsy-8_AA-I_2p" FT /translation="MSNFKYFSQNYNINTCFRRCPHLSVKILDEEIQGLAD FT TGAGVSIISSVDLIKKLGLPINKCNVKIKTADSTEYSCLGYVNVPFTYNSK FT TRVIPTIIVPEVSKILILGVDFLEAFNFRLMIPQEHSQNDSEGVESVEECS FT LFEIMWVENYFGDDDRTICFQIEPETEVVPIPTEMDESLDMPTIEVLKNPY FT ENPADIDTEHQLSDVEKQELFEAVHKLPATTDGKLGRTHLLEHTIELLPGS FT TPRKMATYRWSPVVEKVIDDEVDRMLRLGVIEECTGPVDFLNPILPVKKSN FT GKWRICLDSRRLNSCTKRDDFPFPNMLGILQRIQRSTYFSVIDLSESYYQV FT GLEKASTEKTAFRTNKGLYKFVVMPFGLTNAPATMARLMRKVLGSDLEPFV FT YVYLDDIIITSRSFEHHCQLIRIVAQRLKQAGLTINIQKSKFCQRQIRYLG FT YVLSEDGLSMDASKIQPIVDYAQPKTVKDIRRLLGLAGFYQKFIRNYSEIT FT TPITNLLKKGRKQFIWTEEAEVAFRKLKEALISAPILANPDFDVPFIIETD FT SSDIAIGAVLVQIQRGERRTIAYFSKKLSSTQRRYSATERECLAVLLSIEN FT FKHFVEGSKFIIQTDAMSLTFLRTMSIESKSPRIARWALKLAKYDLTLQYK FT KGTENIPADCLSRGVQALDVLGLDPYVDGLKSQIERFPDKFKDFKIVEGKV FT YKFVTNSTTPEDPVFRWKYVVPLCERQKVVRDIHGEAHLGYFKTLCKVKER FT FYWPRMSSFIKRFCHTCEVCRESKTPNLNVRPPCGKPKECSRPWEVISLDF FT LGPYPRSKHGNVWVLVVSDFFSKFVLTQCMRNATAPAVCLFLETMVFTLFG FT APSVLISDNAQVFKSSSFLKLLEKYQVTHWLLPVYHPAPNPTERVNRVIVT FT AIRCALNKQLSHKNWDESIPNIAMAIRTSVHESTGFSPYFINFGRHMVSNG FT SEYDHLRKLGPDKEPDPTKMSEEMKKLYEVVRLNLHRAYERYSRPYNLRSN FT ARHQFNKGDMVYRKNMHLSDKSKDYVGKFGNKFSKARVKERIGTNTYVLED FT LLGQRIPGTYHGSFLQKA" XX SQ Sequence 7352 BP; 2196 A; 1453 C; 1616 G; 2087 T; 0 other; atttggcgcc caacgtgggg cccgaaaaac agttagtaga attttcaatt ggattttgtc 60 cgaaaaactt ttaatagaat ttcaaattgg agttggttga agcacgattc atacttgatt 120 gtgaatacta attgggtcta gtttaggttt aattttattg gtcttataag tttagtttgc 180 attgagtttt ctttagttct tagtttaggt attggtgtcg gaatctaggt ttggtattgt 240 gcttacagct ttccatttga tagttttatt ttcataagcc ttctagtata tctcgttaca 300 ggaattctta agtttaattg gttattacat ctccttgaat ttcatttact ataatttcat 360 ttgaattttc tctcgtacga atcggtagtg aattagaagc tagtgtgtat ttggtgtaga 420 gtacgtgttt atttctgtca tacgtgagcg aactttgaat ttgaatttga acttaaaatg 480 aattactatc atattaacga acagtgtcta gaagaagatg agattaacta cgagttacgt 540 attcggggct attcgactga tggttcgttg gaaacccgac gtcgaacatt gcgtcgtttg 600 ttgcgagaaa gcgaaggcat ggaggtgaag caaacttggt actccattga ggacgagtat 660 gtgggcattc cgatgaagtt acagcaaatt gagctggcga ttaagaatcg agttggtgca 720 ggctgtttgt cgcgattggt acatctgcac aaacgagttc gtcgatatag ggtcatcagt 780 ttcgaccaac gagagaagca gaggatgcta ttgaacgcgg tgtctcagat ggccttgaac 840 tacttcggtg tcaacctgga caatcttggg caggcagtac ctctgatgac tctcgttacc 900 ggccctgagg agacgaacgt ggtttcttcg gtgcctgaaa gtcgggatga cttggctgag 960 ggaaagagac ttggtgtctt tccaaagcag cagtcgttga atgaggatct gatcctattc 1020 tcacccatgc cgatggacgg ttcgagagtg gatccgagac ggcacacgtt tccggtggcg 1080 ggtccgcagg ctttcagggg tcaagcgatt caacctattt tagttccaca gccacaggcg 1140 ttcgatccac aggttgaaga tgacgtggga ggtgattgtg aatcaccgca tttcaaaagt 1200 tcgattcagg tgcagttgaa tcgtcgagat ccagctaagt cggccactcc gattccgaag 1260 gatgagaatg cgtttgaagt caatcccttc gagcgaccaa acgttgtaac tgatatgggt 1320 caagccagtc ttcgtcagaa tccgccgctt tccaacattt ctcccgtgtt gcaagactgc 1380 ttttcggtgg atgggtcacg agatagttca tctttgagtc ctcaagaaca aactggaaag 1440 aaaatatcgt taaatgagta tgtccatgtc tcggagatag aaacgtatat caaaagttgt 1500 attagccaaa ttctgacgag tgatgtagtt gacaatttgg caggtcgatt gaaaaccgtt 1560 ggtttgaagg aacaaaatgg gttagagatt ccgtacgctt ggtctggaag gccgagggaa 1620 gcagaatctc ctataacaca ttccttcgct caacccaaac catcaatccc aatcaagaac 1680 gtgcatggat ttccatcata tactggtcaa aaggttggga atccgtctac ggaatggaga 1740 cctccaattc caaatgtctc ttcaagttct atggtacctc cgttgtatcc gacaactact 1800 cctgagccgt ttccgaaccc ttatcgtcga aatgaccttc taaatcaaca agtacatcta 1860 ccgccgaatt cctcgacttt tttaggaccg ctaggaacag gaaatccttt gagtaacaac 1920 ttcgttccaa atcattattt acgtcagcgt ttaccgcatc agacgtgcaa tataattgaa 1980 aaatggccga aattctccgg ggatagcaat gccgtgcccg tagtagattt tctgcggcag 2040 atagagatcc ttagccggtc ttatcaagtg actcaagagg aattacgtat gcatgctcac 2100 atgttgttca aaggggatgc atatgtatgg tttacggcgt acgatgacaa gttagattcg 2160 tggcccacac tgaccgcgat gttgaaaatg cggtatgata atcccaacag ggatagagcg 2220 attcgagaag atatgagaaa caggaagcag aaacccaacg aactttttag cgcctattta 2280 accgacatcg aggccatgtc gcaaaggttg gttcgtaaaa tgtctgccga agaaaaattt 2340 gatttggtag tggaaaatat gaagatgtcg tatagacgga gattagcttt acagccaata 2400 cagtctcttg accatctagc tcaactgtgt tatcagtttg attcgttgga aaataacatg 2460 tttcttacca aacccgcaat gaaacctgcg gtacttaatc aaatcttggc tgaagaggaa 2520 cttgaagatt atgaggaagt atcagaggaa gaagatgcaa cagtgatggc tgttcgacct 2580 aaaatgtctc gtaaggatgc aaataaaagc tcgggatccg gcataacctc agaaagtcaa 2640 gaagctggcc aacctttatg ttggaattgc aaaaactttg gtcatatgtg gagagagtgt 2700 agccaacgga aaactctttt ctgtcacatt tgtggtcaca ataatacaac ggcctatcaa 2760 tgccctcaga aacataatct caaaccacgt gaaccggcag attcaaaaaa cgagtaaaaa 2820 cggtagtttc ggggaaccaa attaccgaaa aagatcagga tagtgttccc atgtccaatt 2880 ttaagtactt ttcgcaaaac tataacatta atacttgttt tcgtagatgt cctcatttat 2940 cagtgaagat tcttgatgaa gaaattcaag gtttagccga tactggcgcc ggagtttcga 3000 taatcagctc tgtagacctg attaagaaac tggggttacc cataaacaaa tgcaacgtta 3060 aaatcaagac ggcagatagt accgaatatt cttgcctagg ttatgtgaac gtaccattca 3120 catacaattc aaaaacccga gtaatcccta caatcattgt accagaagtt tcaaaaatac 3180 ttattctcgg ggtcgacttt ttggaagcgt tcaattttcg tttaatgatc cctcaagaac 3240 atagtcaaaa tgacagtgaa ggtgttgaat ctgttgaaga gtgttcttta tttgagatta 3300 tgtgggtcga aaattatttt ggagacgacg accgtacaat ttgttttcaa attgaaccag 3360 aaactgaagt tgtacctata cctacggaaa tggacgaaag tttggacatg ccgactattg 3420 aagtcttgaa aaacccgtat gaaaatcccg cagatataga tactgaacat cagctttcag 3480 acgtcgaaaa acaggaactg tttgaggcag ttcataagct tccagcaacc actgatggca 3540 aattaggacg cactcatctt ttagaacata cgatagaact tcttcctgga agtacgccta 3600 ggaaaatggc aacctacaga tggtcgccag tcgttgaaaa agtgattgac gacgaggttg 3660 atcgtatgct acggcttggt gtaattgagg aatgcacagg acccgtagac ttcctgaatc 3720 ctatactgcc cgtaaaaaaa tccaatggca aatggagaat atgccttgat tcacgtaggt 3780 taaattcatg caccaaacgt gatgattttc ctttcccgaa tatgctcgga atactccaaa 3840 ggattcagcg ttccacttat ttctctgtaa tagacctttc ggagtcttat taccaagttg 3900 ggttagagaa agcctcgact gaaaaaacgg cctttcgcac aaacaaagga ttgtataaat 3960 tcgtggtgat gccgttcggg ttaacaaacg ctccagcgac gatggcacgg ctcatgagga 4020 aagttttggg aagtgatctt gaaccgttcg tgtatgtata tctggacgat atcattatta 4080 cgtccagatc atttgagcat cactgtcaac tgattcggat agtcgctcag cgccttaaac 4140 aagcaggact cactattaat atccagaaat cgaagttttg tcaacgacaa attcgatatc 4200 taggatatgt tttgtctgag gatggacttt cgatggatgc gtccaaaatc caacccatcg 4260 tagattatgc gcaaccaaaa acggtcaagg atataagacg cctcttaggg cttgctggtt 4320 tctaccagaa attcattcga aattactccg aaataacaac accgatcact aacttgctga 4380 aaaaaggacg caaacagttt atatggacgg aggaagccga agtggctttt cgaaagttaa 4440 aagaagcctt aatatccgcg ccaattctgg ctaatcccga ttttgacgta ccctttatca 4500 tcgagacgga tagttcagac atagcaattg gggccgtctt ggtacaaatt cagcgaggag 4560 aacgacgaac gatagcctat ttttcaaaaa agctatcaag cactcagcgc cgttacagcg 4620 caacggaacg agagtgcctg gctgtattac tgagcataga aaacttcaaa cattttgttg 4680 aaggctcaaa gttcataata cagaccgacg cgatgagctt aacgttctta cgcacaatgt 4740 ccattgaatc gaagtcccct cgtattgccc gatgggcatt aaaactcgct aagtatgacc 4800 taacgttaca gtacaaaaaa ggtaccgaaa acatacccgc agattgtctt tcgcgaggtg 4860 ttcaagcact agatgttctt ggattggatc cttatgtcga tggcctcaaa tctcaaattg 4920 agcgatttcc agataaattc aaagacttca agatcgttga agggaaggta tacaagtttg 4980 ttaccaatag tacgactcct gaagatccag tgtttcggtg gaaatatgtt gtgccattgt 5040 gtgaacgtca aaaggtcgtt cgagatattc atggagaagc acatttggga tacttcaaaa 5100 cactttgtaa agtcaaagaa cggttctact ggcctagaat gtccagcttt atcaagagat 5160 tttgtcatac ctgtgaagtt tgtcgagagt caaaaacgcc aaacctgaat gtacgaccac 5220 cgtgtggaaa gcctaaggaa tgttctcgac catgggaagt aatttcgcta gactttcttg 5280 gtccttatcc aaggtcaaag catggaaatg tatgggtttt ggttgtgagc gatttcttct 5340 caaagtttgt tcttactcag tgcatgagaa atgcaaccgc tcccgcagtt tgcttatttc 5400 tcgaaactat ggtgtttacg ttgttcgggg caccctctgt actaatatcc gacaatgccc 5460 aggtgttcaa gtctagctcc tttctcaagc ttcttgaaaa ataccaagtc actcattggc 5520 ttctcccggt ataccaccca gctccgaacc caacagaacg cgtaaatcgc gtaattgtca 5580 ccgcaattag atgtgctttg aataagcaac tctctcataa aaactgggat gagtctattc 5640 cgaacatcgc aatggcaatt agaacgagtg tgcacgagag tactggattt agtccatact 5700 tcattaattt tggtcggcac atggtgagca atggtagcga atatgatcac cttaggaaac 5760 taggtccgga taaggagccc gaccctacca aaatgagcga ggaaatgaag aaactctacg 5820 aggttgtccg acttaatctt catcgtgcat acgaacgata ctcacgtcca tacaacctcc 5880 gttctaatgc gcgacatcag ttcaataaag gagatatggt ttatcgtaaa aatatgcacc 5940 tgtccgataa gtctaaggac tacgtaggaa aattcggaaa caaattttcc aaagctcggg 6000 taaaagaaag aatcgggacc aacacttacg ttttagaaga tttattaggg caacggatac 6060 caggaacgta tcatggctct ttccttcaga aagcataagc cttacattat taaaatccag 6120 ctatgactgc tctagtaccg gctagtgcat acaccaaaaa cgttcatcaa aatacgcttc 6180 atagcggtgt tacggctcat cgtcacgtac tgagatgtcc ttttggtttc ctcgtacgtt 6240 ggccactctt agctgacatg atcagcatga aagaatcagc tatgactatg ggacgtgggt 6300 cccatataca caaagtgcct acagcagtag aaaatactct tcggaggtgt ttccagcagt 6360 ataagcaggt aaaaatagat ctgctcgaag acacagctat gtacggtgct cgattcatgc 6420 ccagaccaca ctcggaaaac actcattgag taatccttcg cagcgaaatg aaagttagtc 6480 gagtgtctaa tcacagcctc gaaacgccta cgacctccta actttcagag tagcattgct 6540 atccgtagaa gatatgttga atgtttgtgt cctcctaaaa tagtatatca acaattagtt 6600 agttgttcgt ccactacgcg agagaatttg tcgtaccatt catatcatca aatcattcga 6660 actagcttca ttctcaatcc ggttattttt ctcaattaaa gctttcaact acagcaccac 6720 agtttattcc acttcaatcg cacagttaca atagtttccc aatagttttt tcgccttaat 6780 ttccgtttta acttaaatat attgcaactc acttttaata atcgctacac ctctgacagt 6840 tctctttgca ttagctgttt atgtatgtgt agggtcaaat cagtcggtta gagtgtaatg 6900 gtttgaaacg ctatgttgac aggaatgagt aaaaatcggg cgcgtatggt tttgagtgat 6960 taggttgagc agaatgagtg aaattggtag gataggtttt aaggtttagt tgtataaggc 7020 aatttatatt ttcagacaat ttctgaagta acttaggatt atcggttatt aagtttccaa 7080 acatttttgg agtaaattgg gacttctaag gaagttaatt tggtttagat gtaagcacaa 7140 tttccaaaca tttttggagt aaaaatatcg gttattgtaa ataattttat ctaggaaaat 7200 cggaatgagt cagttagaat gagtgagttg attagaaggg agtaggagat aagaatacaa 7260 gaaagcaagt cttcatagtt gttgagagcc cgctgttcga atttgttgac ccatcaagtt 7320 tcaataaata taacaaatgt gtttgtgcaa at 7352 // ID Gypsy-87_CQ-LTR repbase; DNA; INV; 353 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-87_CQ_; KW Gypsy-87_CQ-I; Gypsy-87_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-353 RA Jurka J.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 554-554 (2011). XX DR [2] (Consensus) XX SQ Sequence 353 BP; 98 A; 80 C; 65 G; 110 T; 0 other; tgttctactt cttgggcgac cttcttgtgg cccacattct gtgacacttt gaagtcagca 60 gttgctttcc gtgttgtgtt taacactgac cctctttgat gggtaccaga tgttgacaaa 120 catgcaccaa gcactttgtt atattcaagc gaccaccgaa agagtactcg atggtaaata 180 accaaaaagt aggtgtggct tattagccat cttgttctag gaattcacca cagccaacaa 240 ggcttaagtt taagttagca ataagtttga agttaaaata aagatcattc ctagttccac 300 cgtcaaccgt gttagatgtc tcatttgtat caccttaccc tccttacata aca 353 // ID BEL-185_AA-LTR repbase; DNA; INV; 411 BP. XX AC supercont1.118; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-185_AA_; KW BEL-185_AA-I; BEL-185_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-411 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.118; Positions 1583094 1583504. XX SQ Sequence 411 BP; 154 A; 70 C; 82 G; 105 T; 0 other; tgttgcgaag caatttcgtg acataatagg aacaaagctt cctgtcaaac ctttttgtaa 60 taagaggaag aacaagttag agaaacgact aaaagtcact taaagtcgac agtgaagaaa 120 atatcatagg gaaaagaagt ctagtaaagt tgaattgaaa tcatattatc taccttaaac 180 gaatttaaac actacaagat ccagcaaagc tgaagccatt cgatgtttgt cgaaagataa 240 gggaatagaa tagaaggtcg cactgagaaa ttgtaaggaa gtgagtttgg aattgcacag 300 aattggaact aataaatctt gttttagctt tagctcatcc actaaacgaa acacctttga 360 tttgctgaaa gaagctcagc ctccgactct acgccctatc gttcggcaac a 411 // ID TGRP1 repbase; DNA; INV; 1032 BP. XX AC M57919; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 01-JUL-2005 (Rel. 10.08, Last updated, Version 2) XX DE T.gondii repeat region. XX KW SAT; Satellite; Simple Repeat; Repetitive sequence; TGRP1. XX NM TGRP1. XX OS Toxoplasma gondii OC Eukaryota; Alveolata; Apicomplexa; Coccidia; Eucoccidiorida; OC Eimeriorina; Sarcocystidae; Toxoplasma. XX RN [1] RP 1-1032 RA Cristina N., Liaud M.F., Santoro F., Oury B. RA and Ambroise-Thomas P.; RT "A family of repeated DNA sequences in Toxoplasma gondii: RT cloning, sequence analysis, and use in strain characterization."; RL Exp Parasitol 73(1), 73-81 (1991). XX DR GenBank; M57919; Positions 1 1032. XX SQ Sequence 1032 BP; 247 A; 371 C; 227 G; 187 T; 0 other; gtcgacattc gtgccacagc gacgtcggac ctctggttgt cacctgcggc ggaacctaga 60 catcgctaca ccgcccacac agcgcttccg acgcgagccg acaaaccctc aacacgtaac 120 ctccatcacg tgtagtctct gcatggcctg acgtccgagt cgaaatacga gatcggcact 180 ctcattctcg ctcactgtcg cttcctgtca acgagcagcc acgccggaaa accacctgcc 240 aatccacaca gcgaagactt ctcacccggc cctctatcca ccacccccca cgacctcatc 300 ccaccccgca acctgtggtg tcgccaaacg caacacgtcg gcagctcgtg ccaaagtgcc 360 gaagacctct tggaggtggc cgctccgcaa actaaacatg gacatgccgc cggacagtgc 420 ctccgctgcc acctaacaac acctcctcgc ggtcactcca tcagcagtcg cctcggtgca 480 tcgtggtgcc gcagttgcct cgagacacag aaccacgccg cgagtccgtt tctcttctta 540 cctccaggtt ctgaacagaa cgcccagcaa tctcgcagac agactgcaca acgacacaca 600 accaataact tctcacccat ccgcgcaggt acgcgacgaa actcatacgg gaaaaaggca 660 ggaccgtgtg gggctgccac acgccacaag cggcgtattc cgaaacgagt gaactcggat 720 cgttcgcttc cgcaactgga ttggcccacg catcccacgc cggacactca ctctctgcag 780 gattccgaca acacgtcatc gaggtgcggg aacaagttgt cctttctcca cggggtcgta 840 ttcgcctctg aagataaaag atgctacatc gctctccctt atactccctt cctgtacacc 900 gttgccacgg cgtgaaagca ctggacattg acacagcctg cagcaggcta tacagcgata 960 atatcttaca acgcacccca gactgttaca acctcctgac atctgtggcg ccaccaaacg 1020 caacacgtcg ac 1032 // ID MERLIN1_SM repbase; DNA; INV; 1115 BP. XX AC . XX DT 04-OCT-2007 (Rel. 12.1, Created) DT 07-OCT-2007 (Rel. 12.1, Last updated, Version 1) XX DE Consensus sequence of Merlin-type family of repeats. XX KW Merlin; DNA transposon; Transposable Element; MERLIN1_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-1115 RA Jurka J.; RT "Merlin1_SM: Merlin-type element from freshwater planarian RT (Schmidtea mediterranea)."; RL Repbase Reports 7(10), 1087-1087 (2007). XX DR [1] (Consensus) XX CC The youngest copies are up to 99% identical with consensus. 8 bp CC TSD. XX FH Key Location/Qualifiers FT CDS 71..988 FT /product="MERLIN1_SM_1p" FT /translation="MDYQPNKKSMINLNLLSLYDIVRTKEGAIRFLQDRGI FT LNRRKACLNNHEMILSITDERERWRCTKKTCRNEVSLKSGSWLENINLSYT FT EVVLFIYSWSFELTSIKFCQRELSISSNATIIDWNNYLREVCASSLLRNPL FT IIGGPGLHVEIDESLFTRRKNNVGRVYPQQWVFGGICRETGDCFLYVVPDR FT SANTLMPIIQQSIRPGSIIISDEWRAYSQIPNLGFGYVHETVNHSVNFINP FT ITGANTQMIEGCWSLAKQRNKRHFGTHRGMLDSYLCEYMWRRRLNGANPFE FT KILHDIREYWPPAQ" XX SQ Sequence 1115 BP; 365 A; 183 C; 220 G; 347 T; 0 other; tggcgcttgt ttagaattgc cccgcatttt ttaaataaat tttattatag ggcatttaat 60 acttttaaag atggattatc aacctaacaa aaaatcaatg attaatttga acttattgtc 120 actttatgac attgttcgaa ccaaggaagg tgccattaga tttttgcaag atagaggtat 180 attaaatcga cgcaaagcat gcctaaacaa ccacgaaatg attttatcga tcacagatga 240 aagagaaaga tggagatgta cgaaaaaaac ctgtagaaac gaggtttctt tgaagtctgg 300 atcttggctg gaaaacatta acctctctta taccgaagtt gttttattta tatattcttg 360 gagttttgaa ttaacttcga ttaaattttg ccaaagggag ttaagtattt catcaaacgc 420 aaccatcatt gactggaaca attatctccg agaggtttgt gccagtagtc tgttacgaaa 480 tcccttgatc atcggtggcc ctggtttaca tgtagagatt gatgaatctc tatttaccag 540 gagaaaaaac aacgttggac gtgtttatcc acaacagtgg gtgttcggcg gcatttgtag 600 ggaaactggg gattgctttc tttatgtcgt tcctgatcgc tcggcaaaca ccttaatgcc 660 catcatacag caatccataa gaccagggtc aattataata tctgatgaat ggcgggcata 720 tagtcaaatt ccaaacctgg gatttggtta cgtgcatgaa acagtcaacc attcagtaaa 780 tttcattaat cccattaccg gtgcaaacac acaaatgatt gaaggatgtt ggagtttagc 840 gaaacaaaga aataagcgtc attttgggac gcatcgtggc atgttagata gttatttatg 900 tgagtatatg tggcgacgaa gattaaatgg tgcaaatcca tttgaaaaga ttttacatga 960 tatccgtgaa tactggccac ctgcccaatg atcgagttag ataactgttt ttacagtcat 1020 gttagctatc tttaataata aaagcttgta ttaaaagttt attatttaaa ataaaattta 1080 tttaaaaaat gcggggcaat tctaaacaag cgcca 1115 // ID BEL-71_CQ-I repbase; DNA; INV; 8013 BP. XX AC . XX DT 07-JAN-2011 (Rel. 16.02, Created) DT 07-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-71_CQ_; KW BEL-71_CQ-LTR; BEL-71_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-8013 RA Jurka J.; RT "LTR retrotransposons from the southern house mosquito."; RL Direct Submission to RU (07-JAN-2011). XX DR [2] (Consensus) XX CC Positions [5075-5626] - Integrase core CC LTRs are 91% similar to each other. XX FH Key Location/Qualifiers FT CDS 6180..7967 FT /product="BEL-71_CQ-I_1p" FT /translation="MCSFLGHINCFEGSGLIAYDCGSPEVNLTSYSLLDVA FT SCLPPSNNLTTHEVQIQVLQRNVKSDIKVYQCKVIIKRKIRHCGMHSHTSD FT YEGSYKYIVKEFTSEECRTSQQLGIVRLTYEHQINEIKRNETTRGETLIEG FT TVRNSDCKGETYKTPEFTWFDALVYYEYEITLRDYTAVVDYENDVILLRNG FT LTCTYSHGKCLDSEDGYITWDVAFNNKCEETEFEVIYEGTVNKTKNENTKT FT HREGFNVVYSTVSDTHLFSIRTREQTKVCGHPGYLTDHPRIIVVEVIGFNS FT PFTKKSSPGRNFDLFTYFNSKITLVENYVGQSMSQLYNTVMTEMCKVDKNL FT METKLTLARLNPTEFVTSIIKRSGFTAVVAGEVLHILECKPVYVTPKSDEI FT CYQEIPVIYNNQTMYIAPVTRVLQNRGTQIDCTPLLPAKFQIGGRWYTTDS FT RFRETTAPMKLTTDLLTTWSYTPLPNLMQSGVYDAESVVKMRNLIYEQGDR FT RVASNVMHKILIGQQPNFQGFNFEALNVVEKAQNIIYSIFESSYIIGFCRI FT SSFFSGIIMMSYVCILIFNPMTWIMIYRKIKCNKTREQARININLETHL" XX SQ Sequence 8013 BP; 2683 A; 1459 C; 1653 G; 1870 T; 348 other; gagcgtaagt tccgcagtta taagaacgtt cccaccgacg tttttacatg gtggctctgc 60 agagaggatc aggagaagtg cttgtcggaa tcaggcactg aggaccccga gcttccggtc 120 agtcgaaatc tgatcgtgat cgtaagtata agcgtgtccg acgctataga gcggacaagt 180 gatcaagtgt tcttgagcag tgtcaggctc agatagtgcg gacgaattat cgtcaagagt 240 gtaaaagcgt ctccgacgct ctagagcggt caagtgacca agtgtgagca gtgtcaggct 300 cagatagtgt agacgagtta tcgtcaatag tgaataatcc tggacaatag tccagaagaa 360 caccgtgaaa tcctgggaaa gaacccagaa gaagaagaaa gtaagaagaa ccttttaata 420 ggtgattcaa cattaaactt tacttcaagt aaagggaaga ctggatgaaa atccagagat 480 ccgcttggtt gcagaaaatc accaaaaata gtaaaaaaaa aaaacctttt aatagatagg 540 tgatttcaag aagtgaagtg agtgattaaa ggaagactag aaacacatat ataaataaaa 600 atatatatat aataaaggaa taaaaactgt aagattgcaa agcagattta aatcggagas 660 agtgcaataa aaaaaaaaaa caataagaag cagcttaaca sagtggatcg tcgagcgtcg 720 aaaaggaatc aacgacgagt gcatcaacaa gcgacgtgtc aaccagcaac acccatcgag 780 catcaagaaa gatcggcgam gcatcgatca tctgaatcgt aaggtattca gagaagatgg 840 attacaagtt cacwgcgaac aaaaacggta gttgccggmg gtgcgacaat ccagatgatg 900 tcsagcaatt catggtatct tgcgacgatt gtgatcgttg gttccacctg caatgtgtag 960 gacttgcccg tactsctaag taggaaggam cactggmast gcgtcaaatg tgaagctatt 1020 tgccaagact ttacaaagtt agttgaacaa aaggctctat caaatctmtt ggagagaagt 1080 attgaagatg tttcaacwma accagcagca gaaaattcag agacaaatca agacttgcaa 1140 gttcttgcaa ggatagtcga taaaattaaa aggactgtca gctcaacaaa catcttcaga 1200 tcattttagt gattaggcaa actttaatgg acttgccaga attcgacggg tcttacaaaa 1260 cctggccaag gttcaaacaa accttttatg aaaccaccaa gcawggtaat ttttctgatt 1320 tagaaaatat taatcggcta gaaaaatgtt tgaaaggwga agcactaagc agagtcaact 1380 ccctactaat ggattcaagt aacgtgagcg aaattatgaa cattttagaa gaccgatttg 1440 ggagtgtcga acgagtctac aacgggttgw tgaacgatgt tttagcattg cgtaacccga 1500 gttttgagaa tcctaaatcc atgattgatt ttatatctgg aattggagat ttggttatta 1560 atatggagtg cttaatcatg aagaatactt gaacgaccat agactagttc ggatctcgcg 1620 aacaagctac catctggtct acatcaaaaa tggttgagga atttgaacga agaaaaaacg 1680 ctgtcggcga ctataaaccc atatatcgct ccaacgttaa aagaccttta cgagttgtta 1740 aaaccagaag aaaaattagc tattgcgctg ttagccgaaa gagggcacta agaaagttca 1800 agttgaagaa ctagatcaga gcgaatttca cacgcgggat acaagcaaat tagctgtaca 1860 agtgtaatgt gtgaamaaat cacaagctga ctgaatgcga tgaattcaag aaaatgtctc 1920 cagaagaaag gagaaaattt gtagcggaaa agaaattatg cttttcttgt tkacgcgcaa 1980 atcatktggc aaaacattgt agatttgcaa gaacttgcaa catagaaggt tgcaaaagta 2040 aacataatcg ctgcttcatg gaagcaaacg gaaaccggag cagaaaacct acaaaagaac 2100 ataaggtgaa ttgtcacgtt aagagataca acaaggttta ctatcaaatt gttccagtaa 2160 ctttgatgaa tggtgacaat aaattcgaaa catttgcatt ctttgattcc ggctcatcag 2220 tcagtctcat taataaaaat gttgcgatca gctccaagtt aaaggaacag ataaaccaat 2280 gacattggcw tggacmaatg gwgawactca agaagattcc aaaagtatgt ctgttcaatt 2340 gaatatcaaa tctcccaacg gaaaaasgtt caacttgaaw gacctgcgca ccgttaaaga 2400 tttatcgctt ccaacgcaat ctgtagacgt taaccgtctt aaaaaamagt tttcttatct 2460 taagaatagt aagttggaat cttatgaaaa agcagtgcca acaatactac tgggtttacc 2520 acatgcatac ttgttcaaag gtaatgaaga aatatcagga aggttcaatg aaccaattgc 2580 aagaatgacc aaattaggtg ggtcttattt ggaagtagta agcwtgacgt tcataacaaa 2640 aaggawcatt tgttcataat tcaagaagtc aaaaaagaag acagtcaata aagaaatggt 2700 gtccaaattc ttctctgttg aatctttcgg agtaaaagtt cccsaaaacg wkcttatttc 2760 taaawcmgat gaacgcgcmc tmmagataat gggcgawacm wtagwaccaa cmgaaagkgg 2820 wtatgaaatt ggtctgctmt ggaagggcga mgwkgttgaw ctwccwaaca gctwcaaaac 2880 mgcwctaagt mggttcctsm tkctggaaaa gaagwkmkcc aaggatmcwk cwktsaagga 2940 wtggtatmac gacaaaatmg acgaatactt gsgwaagggc tacmtgmgga agttgagccc 3000 gsaagaakcm ktgackgaam ctccgagaac mttttacctg ccccacttca taackgwaaa 3060 taagaacaaa sagccmccaa aaccaaggtt ggtcttcgat gcagcggcaa aaattaacgg 3120 tgtttcatta aattcgaagc tgttgtctgg acccgatacg accgaatcat ctctcggagt 3180 taaaatgcgc ttcagggaag gacctatcgc cgtatccggg gatataagga gatgtttcac 3240 caggtgggaa tgagtaaaga ggatagagac tctcaaagat tgtttctcgg aaagcaatcc 3300 tgaagaacgc ccttgacata tatgaaatgc aagtaatgac atttggagcc acttgctcac 3360 cagcctgtgc ccaatacgtt aaaaacgcta acgcaaaaca atttttgaat acttgtccag 3420 aggcagttga agctatgatg aaaaatcatt atgtggatga ttatctagac agcttcagcg 3480 atttgaacaa ggcaacaaaa attgtsctag atgtgataaa aatccacgac aacgcgcact 3540 ttaagatgag aaattttatc tcaaactcaa aagcattgct ggcaaattac tacccggtga 3600 tagagtatct gcattgaacg aaatcaaatt agacatggac gagaacgttt ttgaaaaagt 3660 acttggcatg tactggaata cggataaaga caattataaa tttaaaataa aagccaaaat 3720 tgaatccgag atacaagatc ctacaaaaag acaattatta tcgtttgtta tgagcattta 3780 cgatccctta ggtttgattg cacacataac aatacactca aagatcatta tgcaagacct 3840 ttggcgtgtt ggaatagagt gggatgacaa aatcccagat tctgtaaaag aagattggtc 3900 tacgttggat gaaamkatta tcwwcactgg acamcctwag gattcccaga tgttactcat 3960 atgcaaagaa cgtwacsaaa mgwgaacttc acgtwttttg cgacgcgtcm kcaagmgcgt 4020 acgckgccgt cgtmtacatm agaaccatac mwgmtgaggg mattgacktg acgatwktag 4080 cwggwaaatc aaaggttgcc ccwacmaaaa ttwtgtcaat mcckcgtttg gagctacaag 4140 ctgcgstwct mggaacwaga ctagcaaaca ckgtkcwgaa ggaamtamgg ttgkmgattg 4200 acamwgtagc tcactggtcc gactcaaaaa cwgttctgag ctggataaaa tctgaastma 4260 ggaaatataa gcaatttgtg cagtacmgag twagtgagat mctwgacasc acagtggagt 4320 ctcaatggmg atacattgac wcggccwmma accccgcaga cgaagsaact aaaacgatmm 4380 cawcmmawtc aaaatgggtw kmcggwccms mkttcttgaw aacwkacgat aacctcwgtg 4440 ggawttggag awaaaggkwc mscaaactgt tgatctaagc catgaggagc tkcgwcctat 4500 kttcacmatm kcakckgtag akaaaccwaa cttkgawtgg wtgaacatwg awtggtgctc 4560 kgattggamc cgcttgaaga ggtcggtgtg tatmgtmwtg aagtacwtwg astggctmaa 4620 acgtaaakcw aagmagacaa catttgacaa waakatwaca gcctctgaca tggaaaaggc 4680 ggagacwats ctwatcmaga aggcacaatg ggagtcgttt mscgatgaag tgttggactt 4740 gmmaaccaag aawasmatcg awgaakmgag caawatwmga mcactwwcac cgwtcctakm 4800 kgaagatggm gcamtggwwt cwgamactcg statgwmaat gcwgaatgck taccwtatgg 4860 mgckmgkcaw ccatwcatwc tsccwgaagm wcatcatgtw tcasawctcg twttgaagaa 4920 wcawcacgag wtmtttwamc ataagaagas mgawgcwgca atmgccgcag tkmgacwgaa 4980 ataccatatm attcasatwa amgcagswat gmawagagtk gkcgwtmgat gccaatwttg 5040 caaaaacwmk mgsgckaagg cttwmgctcc gmgaatggcg ccwwtgcctg aatgccgwmc 5100 kcagccgtwc gtaaaaccmt ttackcatac aggwgtwgat tattttggcc cwtacaacgt 5160 mtcaatmggw agaaggwccg aaaagmgwtg gggwgtcgtt ttcacmtgta tgacsacwmg 5220 agcagtttat ctggaastkg ctcacgatct aagcgcwgac gcsttcwtsg tstgcttgaa 5280 mtccgttcaa agcagacggg ggaagatwaa gttcttgtac agcgacaatg gcacmaactt 5340 tgtmggtgca gataacgagw tgaagaagat ccgccatmgg ctgkcgtcwg acggkatcgw 5400 wtggmmcttc awcccwccmk cwtccccwca tttcggwgga gcwtgggaac gwatggtwmg 5460 agaagttaag wccctgcttc cwagcgawac catgcctgaa cacacgcttm gakcwctmct 5520 mactgaaatt gaattcatca taaactgcag gccwctkact tctatttcat tggatgctac 5580 tgatgatgaa ccgttaacac cgaatcattt tctgatcggg tgcgcaggcg gatctgagcc 5640 atctttaaac gatgcttcta aagctgaagc tacaaggcag cagtggaaaa gagttcaact 5700 gttggcaaaa aattattggg acagatggct gcatgaatat ctccccacac aagctaagag 5760 atcaaaatgg acagaagcca ttagaaacgt aaaagttggc gatatagttc ttatcgcaga 5820 tgataatgaa aaagcaaaat ggcagaaagg aattatcgaa aaagtacacc catcaaagga 5880 tggatgcgta agatcagctg acgtaaagac taccactggc gtatacacca gaccagtagt 5940 caagttagct gtattggatg taggccttcc agaaggtcaa aaacatcaac tttcagacat 6000 taacgatact aataagcgta tcactcgaag ccaaacaaag attcaaaaca gaaactttgt 6060 gggactaata aattcgattg aacatgatga tgacgattat gaaccaaata aaactaaact 6120 aatcattgga aaaaagcaaa cctttcacaa ggcagaatct gctgctaaca ttatacttaa 6180 tgtgtagctt tctgggacat ataaattgct ttgaagggtc tggacttata gcgtatgact 6240 gtggaagccc cgaagtaaat ctaacgagct attccttatt ggacgttgct tcatgcctcc 6300 caccttcgaa caatttaaca acgcatgaag ttcagattca agttcttcaa agaaacgtta 6360 agagtgacat aaaagtttac caatgcaaag ttatcataaa aagaaagata aggcattgcg 6420 gaatgcactc tcatacatct gattacgaag gaagctataa atacatcgta aaagaattca 6480 cctctgaaga atgtcgaaca tcacaacaat taggaattgt aaggctaaca tatgagcatc 6540 aaataaacga aataaaacgg aatgaaacta ccagaggtga aactctgata gaaggaactg 6600 tgagaaacag tgactgcaaa ggtgaaactt acaagacacc agaattcaca tggtttgacg 6660 cgttagtgta ttacgaatac gaaatcactc tacgcgacta caccgccgtc gttgattacg 6720 aaaacgatgt cattctgctt agaaacggct taacttgcac atactctcat ggaaaatgcc 6780 ttgattccga agacgggtat ataacatggg atgtcgcatt taataataag tgcgaagaaa 6840 ctgaattcga ggtgatctac gaaggtacag ttaacaaaac aaaaaatgag aacacgaaaa 6900 cacaccgcga aggtttcaac gtagtttaca gcaccgtttc tgatacgcac ttgttctcta 6960 tccgtaccag agaacaaaca aaggtgtgtg gtcatcctgg ctatctaacc gatcaccctc 7020 gcataattgt agttgaagtt atcggattca actctccgtt caccaaaaaa tcttcaccag 7080 gaagaaattt tgatttgttc acgtatttca actccaaaat aacgctcgtt gaaaattacg 7140 ttgggcagag catgtctcaa ctttacaaca cggttatgac cgaaatgtgt aaggtagaca 7200 aaaatctaat ggaaacaaaa ctgacattgg ccagacttaa ccctactgaa tttgtcacca 7260 gtatcatcaa acgatctggt tttaccgcag tagtagcggg agaagtccta catattctgg 7320 aatgtaagcc agtatacgta actcccaaat ctgacgaaat atgttaccag gaaatcccag 7380 taatatacaa caatcaaact atgtatattg ctccggtaac tagagtactc caaaacagag 7440 gaacgcaaat tgattgtact ccgttgctgc cagcaaaatt tcaaataggt ggaagatggt 7500 acactactga tagtagattc agagaaacaa ctgcccctat gaaactaaca accgacctat 7560 taacaacttg gtcgtacacc ccgttaccaa acctcatgca gagtggagta tacgatgcag 7620 agagcgtagt gaagatgaga aatctaattt atgagcaagg cgatagacgc gttgcgtcaa 7680 atgtaatgca caaaatcctt atcggacagc aaccaaactt tcaaggattc aattttgaag 7740 cactaaatgt agttgaaaag gcccagaaca ttatatactc gatttttgaa tcatcatata 7800 ttataggttt ttgtcgcatt tcatcattct tttctggaat aatcatgatg agttatgtat 7860 gtatactaat atttaatccc atgacatgga taatgatata caggaagatt aaatgtaaca 7920 aaacaagaga acaagcacgg atcaacataa atttagagac gcatctgtaa tattttattt 7980 tgtaaaccat aggatgattt acgggtcccg gaa 8013 // ID BEL-193_AA-LTR repbase; DNA; INV; 332 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-193_AA_; KW BEL-193_AA-I; BEL-193_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-332 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 878-878 (2011). XX DR [2] (Consensus) XX SQ Sequence 332 BP; 96 A; 58 C; 79 G; 98 T; 1 other; tgttagggag attattwtaa attaactata tttttggcct ttactgtaga acagaatatg 60 gaatttatgt ggtagataaa ctctcaatga gtgaccgtgt gcgaagagat aggctctcgc 120 agcatccctg ataaggcaga gataggagaa agctcagttg cttaccattc gccgccggta 180 tcggtttata atatacagtc cacttcgagg gtttagtttc aattaaacaa agttttttag 240 tgatcgtatc accgggtttc gaatgctaaa tacggccgaa caggtgtatt cgcgggtgcg 300 cacattagtg caaaatcggt agtcatcgaa ca 332 // ID CR1-13_HM repbase; DNA; INV; 5960 BP. XX AC . XX DT 11-DEC-2008 (Rel. 13.12, Created) DT 14-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE CR1-type family: consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-13_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-5960 RA Jurka J.; RT "CR1 families from Hydra magnipapillata."; RL Repbase Reports 8(12), 1841-1841 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 803..5761 FT /product="CR1-13_HM_1p" FT /translation="MPLKKKKKKKSMAGTSDFESIKFNFSEINNMKIKPDI FT NNLHDTLMDCSYHYPSELEGFLFKNVCDKSFRILHVNIRSLNNNFEKLLSL FT LEETKXCFNIICLTETWISNDLNNSSNFYIPHFKLISLHRQVNKRGGGVLI FT YINENIVYYVRNDLSVSDGDKEILTIEIINNKSKNILLSCCYRPPDGVSEN FT LSIFFEQSIFKKGIKEKKKNFIIGDLNMNCFLYSKDNKVKNFYDSFFETGA FT IPLINRPTRVTKNSASLIDNIITTDISDNDIQTGILKSDISDHFPLFLSLK FT SNSEKITKSNKIKIRTFNETNIKQFKSQLSLLHWKHINFNDNADKIYDKFF FT ETFYSVYDANFPIIVKTLNPKNINNPWVTRGFKKSSKIKQKLYIKYLKTKS FT SANEKIYKEYKYLFEKVRKNLKKNYYSRLIDKFKNNSKRTWQIMNEISGRQ FT KKCSGSLPQTIVVDNKHICESRAIAHEFNKFFVDIGPKLAKKIPYTNATFR FT DFLVQMDNCTSSNELSSELSLQEFEKAFKTLKKNKSTGADDINGNIVIECF FT EYLKDILFKVYGASIRQGVFPEQLKIAKVTPILKEGDQTIISNYRPISVLS FT TFSKILERIMYNRLYNYLHSNNFLYNYQFGFKKNNSTEHAVIQFVREISNS FT FENCKYTLGVFIDLSKAFDTVDHEILLQKLKYYRINCKVIKWFKSYLSNRK FT QFVFSNNDHPNKFLNISCGVPQGSILGPLLFLIYINDLNKASNLMSIMFAD FT DTNLFFSNNDICKLFYTMNNELKSISKWFKCNKLTLNIKKTNWVLFHPVSK FT KAYLPQSLPKIFIDDVEIKRHNVTKFLGVFLDENITWKKHIDYIGTKISKN FT IGILYKTRTYLCKKSLTQLYYSLIHSYLNYANVAWGSTEKSKLKCLYRRQK FT HAIRLINFADRYTHSKPFFIEMKVLNIYELNVFNVLCFMYMWKNDIIFNDN FT ADKIYDIFETFYSVYDANFPIIVKTLNPKNINNPWVTRGFKKSSKIKQKLY FT IKYLKTKSSANEKIYKEYKYLFEKVRKNLKKNYYSRLIDKFKNNSKRTWQI FT MNEISGRQKKCSGSLPQTIVVDNKHICESRAIAHEFNKFFVDIGPKLAKKI FT PYTNATFKDFLVQMDNCTSSNELSSELXFHEFEKAFKTLKKNKSTGADDIN FT GNIVIECFEYLKDILFKVYGASIRQGVFPEQLKIAKVTPILKEGDQTIISN FT YRPISVLSTFSKILERIMYNRLYNYLHSNNFLYNYQFGFKKNNSTEHAXIQ FT FVREISNSFENCKYTLGVFVDLSKAFDTVDHEILLQKLKYYRINRKVIKWF FT KSYLSNRKQFVFGNNDHPNKFLNISCGVPQGSILGPLLFLIYINDLNKASN FT LMSIMFADDANLFFSNNDICKLFYTMNNELKSISKWFKCNKLTLNIKKTNW FT VLFHPVSKKAYLPQXLPKIFIDDVEIKRHXVTKFLGVFLDENITWKXHIDY FT IGAKISKNIGILYKTRTYLCKKSLTQLYYSLIHSYLNYANIAWGSTEKSKL FT KCLYRRQKHAIRLINFADRYTHSKPFFIEMKVLNIYELNVFNVSCFMYMWK FT NDISLPIFKDLFCSKPINKYTLRNNNFIHEPFCRTNFNQFCIAYRAPYLWN FT KIVLPNFNMHFTFPIFKYKLKNLILFMDDVVRFF*" XX SQ Sequence 5960 BP; 2379 A; 871 C; 775 G; 1922 T; 13 other; agcgtgttaa attattatac aaatatggaa atttcaatta aaaacataga aaagttaatt 60 accaacaaat tagaagaaca aaaaaaatct attctgaaag aaacagagaa gttattaaag 120 gaacaagaaa agagttttgc ttcgattatg agtgctaact taaaaatatt atcggagaga 180 attgaaaaac tagaagttta tgtaaacaat aacaaatgta acgtgacgaa tattcaaaaa 240 gatgtaagtg atattaaaac aactctaaat ttccaagaaa caaacttaat ggaaaaggtt 300 gtacaaatta aacagtgtca tgacaaggat ttaaatattc tacacaaaaa aacaatcgat 360 ttagaaaata gatctcgtcg aaataatcta cgaatagatg gagttgaaga aaaacctaac 420 gaaacttgga gcgactgtga aaatacagta aaaaatatct tcaaaaaaca actgaaaatt 480 gatggagaag tgattgttga aagagctcat cgtgttggca aatcaaggga tagtaaacta 540 ccaagaacaa tagttttgaa gcttttaaat tacaaagaca aaaacaaaat tttaaacgcc 600 gtgaaaaact tgcgtggaac aggtgtatat attaatgaag attttgcaaa agaaacgatt 660 gagattcgta aaaagctatg ggaagaagta aaaagattac gcagtgaagg caagtatgct 720 attataaaat atgatagaat tttttacagg gaatttagaa gcccaaacat taagttttaa 780 aaagaaactt ttctaaaaat aaatgcccct taaaaaaaaa aaaaaaaaaa aaagcatggc 840 tgggactagt gattttgaat ccataaaatt caacttttct gaaataaata atatgaaaat 900 aaaacccgat ataaataatt tacacgacac gcttatggat tgctcatatc actatccgag 960 cgaattagaa gggtttcttt ttaaaaatgt ttgcgataaa agttttagaa ttctccacgt 1020 aaacatacga agccttaata ataattttga aaaacttctc agtttgttag aggaaacaaa 1080 aractgtttt aatataattt gtttgactga aacgtggatt tcgaatgatt taaataattc 1140 ttcaaacttt tatattcctc atttcaaatt aatttcactc cataggcaag taaataaacg 1200 cggcggagga gttcttatat atataaacga aaacattgtt tattatgtta ggaatgattt 1260 aagtgtatct gatggcgata aagaaatttt aactattgaa attataaaca ataaatcaaa 1320 aaacatattg ttaagctgtt gttatcgacc acctgacggc gtgagtgaga acttgagcat 1380 tttttttgaa caaagtattt ttaaaaaagg tattaaagaa aaaaaaaaaa attttattat 1440 tggggaccta aatatgaatt gttttctata cagtaaagat aataaagtca aaaactttta 1500 cgattctttt ttcgaaactg gagcaatccc gttgattaat cgtccaacaa gagtaacaaa 1560 aaactcagcg tctttgattg ataatataat tacaacagat atatccgata atgatatcca 1620 aaccggtatc ctaaaatcgg atatatctga ccattttcca ttatttttgt cgcttaaatc 1680 aaactctgaa aaaataacca aatcaaataa aattaaaata cgcactttta acgaaaccaa 1740 tataaaacaa tttaaaagcc aattatcact cctacactgg aagcatatta actttaatga 1800 taatgcagac aaaatttacg ataaattttt tgaaacattt tattccgtgt atgacgctaa 1860 tttccctatc attgtaaaaa ctctaaatcc aaaaaatata aacaacccat gggtgaccag 1920 gggatttaaa aaatcatcaa aaataaaaca gaagctttat ataaaatatc ttaaaacaaa 1980 atcatctgca aatgaaaaaa tatataaaga atacaaatac ctatttgaaa aagttcgtaa 2040 aaacttgaaa aaaaattact actcaagact catagataaa tttaaaaata actcaaaacg 2100 cacttggcaa ataatgaatg aaattagtgg tagacaaaaa aaatgctcag gttccctccc 2160 ccaaacgatt gtagttgata acaaacatat atgcgaatca agagctatag ctcatgaatt 2220 taataaattc ttcgttgaca tcggtcccaa actagcaaaa aaaattcctt atacaaatgc 2280 tacatttaga gattttttag tccaaatgga taactgcact agttccaacg aattatcttc 2340 tgaactatct ttacaagagt ttgaaaaagc tttcaaaact cttaaaaaaa ataaatcaac 2400 cggagcagat gatataaacg gtaatatagt catagaatgt tttgaatacc taaaagatat 2460 cctatttaaa gtatatggag catctatacg ccaaggagtt tttccagaac aactaaaaat 2520 tgctaaagtt actccaattt taaaagaagg tgatcaaaca attatcagta attatcgccc 2580 tatctctgtc ctctccacat tttcaaaaat actagaacgt attatgtaca atagattata 2640 caattatctt cattctaata attttttata caactatcaa ttcggcttta aaaaaaataa 2700 ttccacagaa catgcagtta tccaatttgt tcgtgaaatc tctaattctt ttgaaaattg 2760 taaatataca ttaggtgttt tcatcgacct ttcgaaagca ttcgatactg tcgatcacga 2820 aattttacta caaaagctaa aatattatag aataaattgc aaagttataa aatggttcaa 2880 aagttatcta tccaaccgta aacaatttgt tttcagcaat aatgatcatc caaataaatt 2940 tttaaatatt tcatgtggcg ttccacaagg ttccattttg ggaccacttt tgttcttgat 3000 ttatataaat gatctaaata aagcctcaaa tttaatgagt atcatgtttg ctgatgatac 3060 taacttattc ttttccaata atgacatttg caaactcttt tatactatga ataatgaact 3120 taaaagtata tccaaatggt ttaaatgcaa caaactaact cttaatatta aaaaaacaaa 3180 ttgggttctt ttccacccag tctctaaaaa agcttattta ccccaaagtt tgcctaaaat 3240 ttttattgat gatgttgaaa taaaaagaca taatgtaaca aaatttctag gtgtttttct 3300 tgatgaaaac ataacatgga aaaagcatat agactatatt ggtacaaaaa tttctaaaaa 3360 tattggtatt ctatataaaa ccagaacata tttatgtaaa aaaagtctaa ctcaacttta 3420 ctactcatta attcatagtt atttaaatta tgctaatgtt gcatggggaa gcactgaaaa 3480 aagtaagtta aaatgccttt atcgccgtca gaaacatgcg atccgtttaa taaattttgc 3540 ggatcgatac actcactcca aacccttttt tattgaaatg aaagttctca atatttatga 3600 gcttaatgtt tttaatgttt tgtgctttat gtatatgtgg aaaaatgaca ttatctttaa 3660 tgataatgca gacaaaattt acgatatttt tgaaacattt tattccgtgt atgacgctaa 3720 tttccctatc attgtaaaaa ctctaaatcc aaaaaatata aacaacccat gggtgaccag 3780 gggatttaaa aaatcatcca aaataaaaca gaagctttat ataaaatatc ttaaaacaaa 3840 atcatctgca aatgaaaaaa tatataaaga atacaaatac ctatttgaaa aagttcgtaa 3900 aaacttgaaa aaaaattact actcaagact catagataaa tttaaaaata actcaaaacg 3960 cacttggcaa ataatgaatg aaattagtgg tagacaaaaa aaatgctcag gttccctccc 4020 ccaaacgatt gtagttgata acaaacatat atgcgaatca agagctatag ctcatgaatt 4080 taataaattc ttcgttgaca tcggtcccaa actagcaaaa aaaattcctt atacaaatgc 4140 tacatttaaa gattttttag tccaaatgga taactgcact agttccaacg aattatcttc 4200 tgaactawct tttcatgaat ttgaaaaagc tttcaaaact cttaaaaaaa ataaatcaac 4260 cggagcagat gatataaacg gtaatatagt catagaatgt tttgaatacc taaaagatat 4320 cctatttaaa gtatatggag catctatacg ccaaggagtt tttccagaac aactaaaaat 4380 tgctaaagtt actccaattt taaaagaagg tgatcaaaca attatcagta attatcgccc 4440 tatctctgtc ctctccacat tttcaaaaat actagaacgt attatgtaca atagattata 4500 caattatctt cattctaata attttttata caactatcaa ttyggtttta aaaaaaataa 4560 ttccacagaa catgcartta tccaatttgt acgtgaaatc tctaattctt ttgaaaattg 4620 taaatataca ttaggtgttt tcgtcgacct ttcgaargca ttcgatactg tcgatcatga 4680 aattttacta caaaarctaa aataytatag aataaatcgc aaagttataa aatggttcaa 4740 aagttattta tccaaccgta aacaatttgt tttcggcaat aatgatcatc caaataaatt 4800 tctaaatatt tcatgtggcg ttccmcaagg ttccattttg ggaccacttt tgttcttgat 4860 ttatataaat gatctaaata aagcctcaaa tttaatgagt atcatgtttg ctgatgacgc 4920 taacttattc ttttccaata atgacatttg caaactcttt tatactatga ataatgaact 4980 taaaagtata tccaaatggt ttaaatgcaa caaactaact cttaatatta aaaaaacaaa 5040 ttgggttctt ttccacccag tctctaaaaa agcttattta ccccaaartt tgcctaaaat 5100 ttttattgac gatgttgaaa taaaaagaca twatgtaaca aaatttytag gtgtttttct 5160 tgatgaaaac ataacatgga aaamgcatat agactatatt ggcgcaaaaa tttctaaaaa 5220 tattggaatt ctatataaaa ccagaacata tttatgtaaa aaaagtctaa ctcaacttta 5280 ctactcatta attcatagyt atttaaatta tgctaatatt gcatggggaa gcactgaaaa 5340 aagtaagtta aaatgccttt atcgccgtca gaaacatgcg atccgtttaa taaattttgc 5400 ggatcgatac actcattcca aacccttttt tattgaaatg aaagttctca atatttatga 5460 gcttaatgtt tttaatgttt cgtgctttat gtatatgtgg aaaaatgaca tatctttacc 5520 tattttcaaa gatctctttt gttcgaagcc aataaataaa tatacactta gaaacaataa 5580 ttttatacat gaaccttttt gtcgaacaaa ttttaatcag ttttgtattg catatcgtgc 5640 accttatctt tggaataaaa ttgttttgcc caattttaac atgcatttta cttttcccat 5700 ttttaaatat aaactgaaaa atttaatttt atttatggat gacgtagtca gattttttta 5760 actttatatt ttctttagaa ctttttaatt gtttattacg ttttatttgt tgaatatgtt 5820 ttgatgtagt ttatatacac acgtttgtaa aaggttctga tgataagatc agtacgatct 5880 tctttcagaa accttgtttg tgtttgtgaa agtaatttat ttactacgac aacaaatgta 5940 aacttaaaaa aaaaaaaaaa 5960 // ID RTEX-2_BF repbase; DNA; INV; 6811 BP. XX AC . XX DT 30-JUN-2009 (Rel. 14.07, Created) DT 30-JUN-2009 (Rel. 14.07, Last updated, Version 1) XX DE Amphioxus RTEX-2_BF autonomous non-LTR retrotransposon - DE consensus. XX KW RTEX; Non-LTR Retrotransposon; Transposable Element; Jockey-3_BF; KW RTEX-2_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-6811 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-6811 RA Kapitonov V. and Jurka J.; RT "Young families of RTEX non-LTR retrotransposons from the RT amphioxus genome."; RL Direct Submission to Repbase Update (30-JUN-2009). XX DR [2] (Consensus) XX CC The complete RTEX-2_BF consensus sequence contains two ORFs. The CC RTEX-2_BF ORF1 protein contains the DnaJ (1-70 aa) and esterase CC domains (487-672 aa). The 3' terminus is composed of the (CATT)n CC microsatellite. The DnaJ domain is conserved in the ORF1 proteins CC encoded by several families of RTEX retrotransposons in the B. CC floridae genome. Therefore, DnaJ is necessary for proliferation CC of the RTEX retrotransposons. XX FH Key Location/Qualifiers FT CDS 234..2909 FT /product="RTEX-2_BF_1p" FT /note="DnaJ and esterase domains." FT /translation="MADNSEELYRTLGVASNASKEDITEAYKKEVANYENA FT SKLGRKEPLFKEADTRYREVSKAFVVLADAARRQQYDAGLPASNSTKTVKI FT SKERTDYQVQINKQSVTIYLPNHLTNPWLKTCEEHYRTLATVGDKEENGHQ FT IKTSFVDTRSGQPVGSVSIKFFETTQKLLIQGSAYLLWFAEDYPELKKTVT FT SLSPNNSNGNAHIQASMPDPDANSTPASPASVAVNLPPCPTCQEPVESDIC FT ACNTVPKALLDVNSNNVNGTESQTGPTGDHDEFNMANTSVTPDYSIVQESM FT TKLETCLANCITDKKLFEDSVMHKLSAIEARVKSCERPHTCDGFTTEEKER FT LSSEISRLDKEKRELELKVKSLEQRHERLSTAVRKAEATNLKRQTSTVETQ FT TSESNSDRAERLLHEATISVHNRFQVLDSNDGISHGSVKHSNNNESSCLPR FT RAHSQPNQSPRETRRTHCESNQPPCVTRDRKSSQHDTMANNQHIKARSNQQ FT NNDAQLDLLILGDSNTRPLKTDILYPNKRVTKELTFNLREAIDYIQTSTLP FT DPKTILFHVGTNDVRDARDPTTVSEGFRELVQITHDKYPQSHIVLSPILPR FT DDPNLQAIGDNVNAFLKVVADETSYVHIIDNSNFSYSGTTSIKKSLYNSDG FT YHLNRNGTRVLAANIKRTVNSLIGLGQYLSRGSQPVACDQFATSPKRTTPN FT RSYRDAVLGAPAGGPPEPSTRSCPPPHTQPARKQEHSSAPSQDRRSSNHTS FT SDDPVNGSSDRCPAPDRRPPPFGLHPPVGPWGPPPAGPWGPPPAGPWGPPP FT AGPWGPPPVGPWGPPPVGPWRAPPHLPPPSHQPGPWGMSPFFPSPSHLLGK FT SGRFPDSYPCSAQNSKQQFQPPAWLWQPGTPWRGPPSMW" FT CDS 3155..6682 FT /product="RTEX-2_BF_2p" FT /note="AP endonuclease and RT domains." FT /translation="MKVFPGGNKSFCCWNVHGLGSKLDDLDFLSHVNNFDF FT ISLIETWSSDKTKINIPNYSYFHLHREKNKRARRYSGGVIFFFKDQYKHYV FT KQLPSKSTDVLWVRVDRKLLGLSKDLFICSVYISPSTSKTHKFAENHVIEI FT LEEEISKYNTEGFILLGGDFNSRLGNLNDFVEEDTFHENIPNNPSNHNQDR FT QHMDKAPPNKFGRFLTDLCIQANLRVLNGRVAGDLQGNFTCHQPNGSSTVD FT YMIASQPLFEHISLFRVNPLTIFSDHNMLSVIIRTAPFIHNNKTKNNNPNT FT KPLPKRFRWNENSAQKFLESLSQPHIIDKLNAMSVKETKNAKADIENYVSD FT LNSVLRDVGHKSLFVRVNKKRSQKARIKKWYDKSCTEMKRELNNLASLLEK FT NPSNSYIRGKFFRRKKEYRKLLRKKKRDFQQQIMDQLSSLREKDPNGFWKL FT INKLKPDKTQQNSQISNEMWLEHFSAVDKRSTDAINNSETELSTPVNTQIH FT PQSPLTPVENSTLDSPITAEELNTAISNLKNNKSSSSDMVTNEMLKCGKQI FT LQNPLLHLFNSCLQNGYFPDDWSLSHIVPIHKSGDTSLPDNYRGISIISCL FT GKLFTSILNNRLVNYAEQYSLFKPQQAGFRKNFRTSDNLFVLTTLVSKYLS FT KNAKIYACFVDFSKAFDSVWRAGLFFKLNNLGIGGKFLKTLKNMYSKTTNC FT IKTNGGLTAPFLTNCGVRQGCNLSPTLFNLFISDITSEFDDICCNSPSLHN FT RKVPCLLYADDLVLFSESQQGLQSCLVRLERYCKHWNLKVNLKKTKIIVFT FT KGGRLPKNCFFLFNGNVIEIVMSYCYLGTVVSTAGTFKANNKHLRNKGLKA FT LFSIRQSLDKADCPLAVKNKLFDSCIKPILLYGSEIWGTLKSPKNCPIESV FT HLKHCKYSLNIPKTASNLAAQAELGRYPIHLEASLNTIKYFIRLSKNVPAD FT SLQADALLCQMDLEASGAKCWASDVRNTLEKCGYAFVWQNAQSTISITPQI FT ISSIYQRLKDIYFQTFLYEIHNDKRVATAKNKLRTYRLLKQYYKEEEYLKI FT ANVRHRSTITKLRISCHKLRIELGRHNHTPLEQRICQFCSLGKVEDEVHFT FT MDCPLYNSDRNILFDYVLSKYPHFRHLNSKEKFTFLTAFDKPTLTNHTASY FT IYAITKKREEVECHRID" XX SQ Sequence 6811 BP; 2196 A; 1614 C; 1297 G; 1704 T; 0 other; aagatggcgg cagagacagg acgcgcggtt acgcgctcct gctgaaaaat tgctttttgt 60 gtcctttcct agctgtgtat gcagtagtta attcaaaagt gtttgcaaaa gttcttactt 120 tttttatccc ttgccttgag ggtaattttt agatccctat tgtctagttt agtcgagttt 180 tagacaaatt cagacccaag cggaccgacc caagcggacc caagccggac gacatggcgg 240 acaacagcga ggaactctac cgcacactag gagtcgcgtc caacgccagt aaagaagata 300 tcaccgaggc ttacaagaag gaggtcgcta actacgaaaa cgcctccaaa cttggcagaa 360 aagagcctct cttcaaggaa gcggacacca gatatcggga ggtaagcaag gcttttgtcg 420 ttttggccga cgccgctaga agacaacaat atgacgctgg cctgcctgct tcgaattcta 480 caaagaccgt aaaaatcagc aaagaaagaa ctgactacca agtacagatt aataagcaaa 540 gcgtcaccat ctacctccct aatcacctta ccaatccgtg gctgaaaaca tgcgaagaac 600 actacaggac ccttgccaca gtcggagata aggaagaaaa tggccaccag atcaaaacat 660 catttgtaga cactaggtca ggccaacccg tgggatcagt cagtatcaaa ttcttcgaaa 720 ctactcagaa actactgatt caaggctccg cgtatctact gtggtttgcc gaagactatc 780 ccgagctgaa gaagacagta actagtctct ctcctaacaa ctccaatggc aatgctcaca 840 tacaagcgtc aatgccggac cccgacgcga actctacacc ggcgagccct gcatcagtcg 900 ctgtaaacct cccaccctgc cctacgtgtc aggagccagt agaatcggac atttgtgcgt 960 gtaacacggt gcctaaagct cttctggatg tcaacagcaa caatgtcaat ggcacggaaa 1020 gtcaaactgg ccctacaggt gaccatgatg aattcaacat ggcgaacacg agcgtcaccc 1080 ctgattactc tattgttcaa gagagtatga ctaagctcga aacgtgtctc gcaaactgta 1140 tcactgacaa gaaattgttc gaagactctg taatgcataa gctttctgct atcgaagcca 1200 gagtaaaatc atgtgagcgg ccccatacat gtgatggctt tactacggaa gagaaagaaa 1260 gactttcgag tgaaatttca cgcctggaca aagaaaagcg tgagcttgaa ctcaaggtca 1320 aatcattaga acagagacat gagaggctat ctacagcagt gcgcaaagct gaagcgacca 1380 acttgaaacg ccagacgagc actgttgaaa ctcaaacctc agagtcaaac tcagatcgcg 1440 ctgaaagact attacatgaa gcaactattt ctgttcataa taggtttcaa gttctcgact 1500 ctaacgatgg catttcacat ggaagcgtaa aacacagcaa taacaacgag tcttcgtgtt 1560 tgccccgccg tgcccattct caaccaaacc agtctccgcg tgaaacccgc cgtacccatt 1620 gtgaatcaaa ccagcctcct tgtgtgacaa gagacagaaa gtcttcacag catgacacaa 1680 tggccaacaa tcaacacatc aaagccaggt ccaatcagca aaacaatgac gcccagcttg 1740 atctcttaat cctgggtgat tccaacacaa ggccactaaa aactgacatt ctgtacccca 1800 acaaaagagt tacaaaggag ctaaccttca acctaagaga agccatcgac tacattcaaa 1860 cttcaaccct cccggacccc aaaaccatac tcttccatgt ggggacgaat gatgtacgag 1920 acgcccgtga tccaacaact gtatctgaag ggttccgtga acttgttcag atcacacatg 1980 acaagtaccc acagtctcac attgttctgt cccccattct cccaagggac gaccccaacc 2040 tccaagccat aggtgacaac gtgaacgcat tcctaaaagt ggtcgcagat gagacgagct 2100 atgtacatat cattgacaac tcgaacttct cctactcagg aactactagt atcaagaaat 2160 ccctgtacaa ttctgacggg taccacttga acagaaatgg tactcgtgtt cttgcggcca 2220 acatcaagag aaccgtcaac tctctcatag gtctgggcca gtacctgagt agagggagcc 2280 aacctgtagc ctgtgatcaa tttgcaacga gtcctaagcg aaccacgccg aatcgttcat 2340 acagggacgc tgtgctcggt gcaccggcag gtgggccccc agaaccttct acacgttcgt 2400 gcccaccccc acacacccag ccagcacgta aacaggaaca ctcttcagct ccaagccagg 2460 atcgccggtc atccaaccac acctcttcgg atgatccggt caatggatca tctgaccgct 2520 gccctgcacc cgaccgtcga ccgcctcctt tcggactcca tccgcccgtg ggaccgtggg 2580 gacctccgcc tgcgggaccg tggggacctc cgcccgcggg accatgggga cctccgcccg 2640 cgggaccgtg gggacctccg cccgtgggac catggggacc tccgccggtg gggccgtgga 2700 gagcaccgcc tcacctaccg cctcccagcc atcaacccgg accatggggg atgtctccgt 2760 tcttcccctc gcccagccac ctcttaggaa aaagtggacg ctttcccgac agctaccctt 2820 gttcggccca gaactcaaaa caacagttcc agcccccagc ctggttgtgg caacctggca 2880 cgccgtggag agggcctcct tccatgtggt agaggtagat atatatgcga ttagatagta 2940 ttctgtaata cattcgtata atgtatattt ttcactttgt acacataata ttacttttag 3000 cgatttagta ttaattagta actgctgagt aagaaatagg atttaaaaat aatgatgaca 3060 ataatgcata tctatagttg ttaactgttg tagattcggc taggataagg acacaatatt 3120 ttaatttttt acatttcttt gaaccgtctc tgatatgaag gtatttccag gaggaaataa 3180 gagtttctgt tgttggaacg tacacggttt aggatcaaaa ttagatgatt tagatttctt 3240 gtcccatgta aataatttcg actttatctc gcttattgag acgtggagct ctgacaaaac 3300 aaaaatcaac atccctaact acagttactt tcacttacat cgggaaaaga acaaacgagc 3360 aagacggtat tctggaggtg taatattttt ctttaaagac caatacaaac attatgttaa 3420 acagctacca tcaaaatcga cagatgtttt atgggtgcga gtagatcgaa agctactggg 3480 gttatccaaa gatctgttta tctgttccgt ttacatcagt ccaagtacat ctaaaactca 3540 caaatttgcc gaaaaccacg taatcgaaat tctagaagag gagatttcaa aatataatac 3600 cgagggcttt atccttctag gtggtgactt taactcccga cttggaaatt taaatgactt 3660 tgtagaagaa gacacctttc atgaaaacat accaaacaac ccctcaaatc ataatcagga 3720 cagacaacac atggataaag ctcctcctaa taaattcggc agatttctta ccgacttgtg 3780 tattcaagcg aaccttagag tccttaacgg tagagtagcc ggtgacttgc agggcaattt 3840 tacttgccac caacctaacg gaagtagtac cgtagactat atgattgcca gtcagccatt 3900 atttgaacac atatctctat ttcgcgtaaa ccccctaacc atattttccg accataacat 3960 gctatcagta ataattcgca cagcgccatt tatccacaac aacaagacaa aaaataataa 4020 tccaaacaca aaaccactcc cgaaacgatt ccgttggaac gaaaactcag cacaaaaatt 4080 tcttgaatcc cttagccaac ctcacataat agacaagtta aacgctatgt cagtaaaaga 4140 aactaaaaac gcaaaggcag atattgaaaa ctacgtttcc gacttgaact cagtgctgag 4200 ggacgtcggt cacaaatcac tctttgtaag agtcaacaaa aagcgttcac aaaaagcccg 4260 aattaaaaaa tggtacgata aaagctgtac tgaaatgaag cgcgaattga ataacttggc 4320 ctcgttactt gagaaaaatc cgtcaaactc atatattaga ggaaagttct tcagaagaaa 4380 gaaggagtat agaaaactat taagaaagaa aaagagagac tttcaacagc aaatcatgga 4440 ccaactttct tcccttaggg aaaaagaccc aaatggcttt tggaaactaa ttaacaaact 4500 taaaccagat aaaacgcaac aaaattccca aatatcaaat gagatgtggt tggaacattt 4560 tagtgcagtt gacaaacgct caactgacgc tattaacaat tctgaaactg aactctccac 4620 ccctgtaaac acccaaatac atccccaatc tcctctcaca cctgtagaaa actccacact 4680 agactccccg ataacagctg aagaattaaa cacggcaatc tcaaacctca agaacaataa 4740 atctagtagc agcgacatgg taacaaatga aatgttgaaa tgcggtaaac aaatcctaca 4800 gaacccatta ctacacctat ttaactcctg cttacagaac ggttattttc ccgacgattg 4860 gtccctcagc catatcgtac ccatacataa atctggcgat acctcactcc ctgataacta 4920 ccggggaata tcaatcatta gttgcttagg aaaattattc acctctattt taaacaatcg 4980 tcttgtcaat tatgcagaac agtacagcct tttcaaacct caacaggccg gttttaggaa 5040 aaattttaga acatctgata atttgtttgt gttaaccact ttagtcagca aatacctaag 5100 caaaaatgca aaaatttatg catgttttgt agatttcagc aaagcttttg actccgtttg 5160 gcgagcaggt ctatttttca aacttaacaa ccttggaata ggaggaaaat tcttaaagac 5220 attaaaaaac atgtactcaa aaactacaaa ttgcattaaa accaacggcg gccttacagc 5280 gcctttttta actaactgtg gggtccgaca aggctgtaat ttaagtccaa ctttgtttaa 5340 tttatttatt agcgatataa cctctgagtt tgatgatatt tgttgcaact ctcctagttt 5400 acacaatagg aaagtaccgt gtcttttgta tgcagatgat cttgtattgt tttccgagtc 5460 tcaacaaggc ttgcaatcat gtctagttag attagaaagg tattgtaaac attggaacct 5520 gaaggtgaat ttaaaaaaga ctaagattat tgtcttcacc aagggaggta gattaccaaa 5580 gaattgtttc tttttgttta atggtaatgt aattgaaatt gttatgtcat attgttattt 5640 aggtacagtg gtaagtacag ctggcacctt caaggccaac aacaaacatc taagaaacaa 5700 gggcctaaaa gctttgttta gtatcaggca atcacttgac aaagccgatt gcccactagc 5760 tgttaaaaac aagcttttcg actcttgtat taaacctata ttactttatg ggtccgaaat 5820 atggggaact ctcaaaagtc ccaaaaattg tcccattgaa tctgtacacc taaaacattg 5880 taaatattca ctcaatatcc ccaaaacagc tagtaatctc gcagcacagg cagaactagg 5940 gaggtacccc attcatctgg aggcctccct aaataccatc aagtatttca ttcgtcttag 6000 caagaatgtg ccagctgaca gccttcaagc agatgctctc ttgtgccaaa tggatttaga 6060 agcatcaggt gctaaatgtt gggcctctga tgtacgcaac actctagaga aatgtggcta 6120 tgctttcgtc tggcaaaacg cccagtcaac aatctcaatt acacctcaaa ttatttcatc 6180 aatctatcaa cgtttaaaag atatttattt ccagaccttt ctatatgaaa tccacaacga 6240 taaaagagtt gcgactgcaa aaaacaaact acgaacatac agactattaa aacaatatta 6300 taaagaagaa gaatatctga agattgccaa tgtacgccac agatctacca tcacaaaact 6360 aagaatcagt tgccacaaat tacgaatcga attaggaaga cataaccaca ctcctctaga 6420 gcaaagaata tgccaatttt gtagtttagg gaaagtagaa gatgaagtac attttactat 6480 ggattgccct ttgtataata gcgataggaa cattttattt gactatgtgt taagtaaata 6540 tccccacttt agacacttga acagtaaaga gaaatttaca ttcctgacag cttttgataa 6600 accaactcta accaaccata cagctagcta catttatgca attaccaaga agcgagagga 6660 agtcgaatgt catagaatag attagacgca catctgcata tatagatgtg tatgattgtt 6720 tccattgtta catatatgtg attcttaccg tgtgacctgt acttagccca gtggggcaaa 6780 aatgtgcaat aaaggtcttc attcattcat t 6811 // ID DNA8-1_AP repbase; DNA; INV; 847 BP. XX AC . XX DT 21-AUG-2009 (Rel. 14.08, Created) DT 21-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-1_AP. XX NM DNA8-1_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-847 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(8), 1743-1743 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. Includes an insert of a different transposon similar to CC DNA4-3_AP. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 847 BP; 270 A; 109 C; 142 G; 326 T; 0 other; cataggcgtg cgcacgggta gggctgattg ggctttagcc ctacctgaga tttttttata 60 cgtgggtacc ttatgaagac atgattaaaa gtaggtatag cccttttagt ttttaaacgt 120 ttgtgtttac ccgagagact atacagggcg atttatttaa aattgaacac tcattatttc 180 aaaaagtgta attgtttttg aaaatatttt tttacacagg tacctagttt cgagtcgttt 240 acaaaacaac tttgttaata aaaaaatttt attttttatt ttttttaaat ttttttttga 300 caacagagtt ttaatttcat attccaaagt agaatatttt tctaagtatt tcgatacata 360 aaaatcgaat ttagtgcgag tagtctatgc gttataagta tttaaagttt agatgagcgg 420 agttgagtgg tacgggcttg ccccgcgaaa tgtatgtcca ctactccgct catctaaatt 480 ttaatattta tatctcataa actaatagcc ctaaagtcga tttaccatga ataatttata 540 ataataaata aatacctatt aatgttaatt gacaattttt atatattata tattatattt 600 taaacgtttt tgaataaatt gttttattag attatatttg gttaaaattt aaatcaatat 660 tgatgaattt tgtatgtgtg aaataatgaa aactgattgt ttacattttc cagtaaccaa 720 caaaagggtc acttaaaagt ttgtagtgaa aatgtctata atttttttct tgtgtgacca 780 tgactagggg gtgggtgatc agattgaatt tagccatacc tgactcagag gtctgcgcac 840 gcctatg 847 // ID Copia-10_DPu-I repbase; DNA; INV; 5336 BP. XX AC scaffold_260; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-10_DPu_; KW Copia-10_DPu-LTR; Copia-10_DPu-I. XX NM Copia-10_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-5336 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 683-683 (2010). XX DR Genome; scaffold_260; Positions 78678 73343. XX CC Positions [2596-3093] - Integrase core CC 'ATTTC' target site duplication CC LTRs are 92% similar to each other. CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX FH Key Location/Qualifiers FT CDS 2191..3387 FT /product="Copia-10_DPu-I_1p" FT /translation="MRGVLLVNGLGANLYSIGTATATGIEVLFTNDTVSFT FT RNGIALIEGRRAGKTLYHLNIQARNHTPKITTALRATKLLPLSIWHRRFGH FT VNNRTLLKMASLGCTNGLALFNDDISTFCEGCVMGKMQRLPFKLGHEKATV FT PGQRIHSDICGPIQVTTPSGNRYFATFKDDYSNYCTTNLLKKKSEVTTALQ FT NFVAKVKMQIHKDVKCIRSDNGGEYLNGELQKWLNDEGISHERSVPYTPQQ FT NGVAERTNRTLMEAARSMLYEKKVPLELWGEAVMCATHIQNRKISGTNDVT FT PFELWTGSRPDVSYLRVFGSPAFVHIPDETRRKLDPKAVECLLVGYCEHSK FT AFRMWNPVTRKIIISRDVIFREDATYESGPINQTDYDSLFPLDEVTVVNIS FT FWPIL" FT CDS 3536..5326 FT /product="Copia-10_DPu-I_2p" FT /translation="MEETMQEPEQIEVDDDPGPNQETDGREANIEDPFFGF FT HEERRRSTRIAKMKELKRTHECLAGESWKEITASQKVPKSYQEAMISEDAE FT SWEPAIQEEFLSLMENKTWELTPLPEGRETIENKWVFDIKPGYKDTPPRFK FT ARLVAKGFTQKYGVDYEETFAPVLKHSALRTVFGLIASLDLETVLLDVKTA FT FLYGELDEEIYMSQPEGFVVPGREMEVCKLLKSLYGLKQAPRAWNNRFNDF FT LIKFGLVRSDADPCIYYHHQGEEFTILCLFVDDGLICSNQAGSLSRIIKHL FT EEQFQIRTMDAERFLGVEISRNRKKKELTAAQPLFTLALLKKFRMENCHPK FT PVPADPHTHLSTEMSPKNEKEKGDMEKIPYREAVGSLLYLAMTTRPDIAFA FT VGQVSQFCQNPGNGHWNAVKRIFAYLAGTPYHGLHFKSSNHPAMKGYTDAD FT FARDQDTRRSTTGYIFLYHGGPVVWASRRQKCISLSTTEAEYVAGCETSKE FT AVWIDRLLKEIGQDQTLPIPLLCDNQSAIRLAKNPEFHQRTKHIQIKYHFI FT REQLKNGIIDLQYVSTEDQLADILTKPLETNRFQKLREKMGIIDVRGSSA" XX SQ Sequence 5336 BP; 1773 A; 1102 C; 1204 G; 1257 T; 0 other; ggttatgggc ccagaaaaac gtaaaactaa tctatcattt cttttcatcc agcctttttc 60 gttatggcga acaatactgc taaagatgtt ggacatatct ctaagtttga tggatcaaat 120 tttccctcat ggaaatatgg tgtctggatg cttcttgaga aaaaccgatt gatctcagta 180 gtggatggga gtgaaacaca acccgagcag gtataatata gagagtttat acctaaaagt 240 tttctctttt atagttgcat atgttgtaaa gaagagagcc agttaagcac taaagttttt 300 ttatatacac actcgtggat gccaagtgta cacagtggag tacagtggtt gtggtgaatc 360 ctctttcgta accactaaac tagagtcttg aacctctatg tattccactt atatcagtaa 420 gtgagacgta acagactgtg agaatgtcac tgaaatttac tgagagaaaa aaaaaatctt 480 gaatgtgaat gcatgccact tatattacta agtgagattt aacagactat gacaatagtt 540 gttataccac ctaaattatt gaatgaggat aaaataagat tctttaatgt atatacgttt 600 gccacttata ttactgagtg agagttaaaa gtgttaagag aataccactt aatggttaag 660 tgagagtaaa aacaagattc ttaatttggc acgtaaacaa ttcatgtgtc aaacaaccga 720 ggaaaagaaa accacttata taagtgagat ttgatacatt gtttgtataa agcctgtccg 780 ttaatagaag taaaataatg gagtaattga tcagtgtcat ggaagagcct ataaaatgtt 840 gggaaatatc tcatctacta tgctatcata cgattgtggt cagaatattg agctgatatt 900 ttttttgaat attttttgtc tagaatattg tagaaggagt cgttcggaat gccaatctaa 960 ttgatcaatg gaaacaaaag gatgtcgagg ctagaacgtt catatactcg acgatgaaac 1020 ccgaaaaaca gactacgctc caaggctgtg caacggcatt tgaaatgtgg agcaggattc 1080 tgacggaata tgcacaagtc tcagctgaaa gcgaaccatt gctgtggggt caattctaca 1140 gctataagtt tcaaccaggt acaaaacagt tagtgtagaa atcaattacc tggtacaaaa 1200 gaattttgtt gttgtcgttt tcagaccaaa caatcatgaa ctttattgct ggtatcgaac 1260 aaatcgctgc acaactgaga gatattggag cagctgtaga cgaaacgcaa attatagcaa 1320 aaatcctggt gtcactccca tcgagcctac aatactttct gccagcctgg gatagcacgt 1380 cacaagaatt gaaaactctt tcattcttga ctaaacgact ggttaaagag gaaatctcac 1440 aagctagaaa ccaggaaaag aaagtagagc ctacggatag tgccatgaaa tctgattcaa 1500 tggacagaac acagaaagct gaaggatcta cacaccatgc atacccggct ggaagctatc 1560 cgggtctcag aggtggccaa cgtggtagag gagcttatcg aggaagagga ggatatcatc 1620 aaagaggagg ttatcaccaa agaggaggat atcatccgta ctcctttggt aaccgagaac 1680 agaattcgac cagtttcagg ggaggacatg agcagccccc taccgaatgt ttccactgtg 1740 gagagcctgg gcatatcaag aggaaatgtc ggctctggtt aaaggtatta gagaaccaaa 1800 agaaacggaa ccaagctgat aaccaacaat cgtatagcta caaatcaagc accagcttct 1860 caactcgaaa gtaatagact acccaattgt aaattaactc acaagagaac tacaattttt 1920 tttcttttta aagggcaact gactggtttg ctgacagtgg ggctacccag catatgaccg 1980 atcaaaggaa ccttcttatt aactttgtac cagccgaacc agggcaatgg aatgtgtcag 2040 gaattggaga tactacattg cccgtgatag gacagggaga tgtacaaata acgtcggttg 2100 tgaacggaaa acacatggaa ggtactaaaa ttgacaatct gaaaactgaa caagccacgt 2160 tatgactaaa tcattttttg tgtaggattg atgagaggag ttctcttggt gaatggccta 2220 ggggctaact tatactcaat tggaacggca acggctactg gaatagaagt tctattcacc 2280 aacgatactg tctccttcac caggaatgga atagccctaa tagaaggaag acgggccgga 2340 aaaacgctgt accacctcaa cattcaggcc aggaaccaca cgcccaagat cacaacagcc 2400 ctcagagcaa caaaactctt accactttct atctggcaca gaaggtttgg ccatgtcaac 2460 aacagaaccc tattgaagat ggcatctttg ggctgcacca atggactcgc cctatttaat 2520 gacgatatct ccaccttctg tgaaggatgc gtcatgggaa aaatgcagcg gctcccattt 2580 aagcttggac atgaaaaagc tacagtgcca ggccaacgga ttcactcgga catttgtggt 2640 cccatccaag taaccactcc cagcggcaat cgctactttg caacctttaa agatgactac 2700 agcaactact gcacaactaa tctactaaag aagaagtcgg aggtgactac tgccctgcaa 2760 aacttcgtcg caaaggtgaa gatgcaaatt cacaaggacg ttaaatgcat ccgatcagac 2820 aacggaggag aatacctaaa tggtgaactg caaaaatggt tgaacgatga aggaataagc 2880 catgaaagaa gcgtgcctta cactccccaa caaaatgggg tcgcagaaag gacaaaccgg 2940 acccttatgg aagcagccag gagtatgctg tatgaaaaga aggtcccact agaactatgg 3000 ggagaagcag taatgtgtgc aacacacatc caaaacagga aaatctccgg gactaacgat 3060 gtgaccccat ttgaactctg gactggatcc agaccagatg tatcttacct cagagtattt 3120 ggatcgcctg cgtttgtcca tataccggat gaaacaagac gtaaattgga cccaaaagcc 3180 gtggaatgcc tgctagtggg atattgtgaa cactctaaag cctttcgcat gtggaaccca 3240 gtcacacgaa aaattattat cagccgtgat gttatcttca gagaagatgc cacatacgaa 3300 agtgggccga taaatcaaac tgattatgat tcattattcc ccttggatga agtaactgtg 3360 gtaaatattt ctttttggcc tattttatga acatggtgac ttcatgcact tttattttta 3420 ggaaccagaa gttgcacgag aaaaatcaaa agtcgatgtt aatgatcaaa aatcagcgaa 3480 gccacaacca actgtgatcg agaaaatagt cgaacaacaa actgaaaatt cgggaatgga 3540 ggaaacgatg caagaacctg aacaaattga agtagatgac gatccaggac cgaaccaaga 3600 gactgacggg cgagaggcta atattgaaga tccgtttttt ggattccatg aagaaaggcg 3660 gcgatcaact agaatagcca aaatgaagga actcaaacgg acccacgaat gcttggctgg 3720 cgaaagttgg aaggagataa cggcgtcaca aaaagtccca aaaagctacc aagaagcaat 3780 gatatctgaa gacgccgaaa gctgggagcc ggccatacaa gaagaattct tatcactgat 3840 ggaaaataag acgtgggaat taaccccact gccagaagga cgtgaaacca tcgagaacaa 3900 atgggttttt gatataaaac caggctacaa agacacgccg ccacgcttta aagctagact 3960 agtggccaaa gggtttaccc aaaaatatgg cgtagactat gaagagacgt tcgctcctgt 4020 tctaaaacac tccgcccttc gaactgtatt tggtcttata gcctccctgg accttgaaac 4080 tgttctgctg gatgtcaaga ctgcgttcct gtatggggaa ctagatgagg aaatttacat 4140 gtcacaaccg gaaggatttg tggtgcctgg ccgtgagatg gaagtatgca aactactcaa 4200 gagtctatat ggcctgaaac aggccccacg ggcctggaat aatcgattta acgatttttt 4260 gatcaagttt ggcttagtca ggagcgatgc agacccctgc atttactacc atcatcaagg 4320 ggaggaattt acgatcctgt gtctatttgt tgatgatggc ctaatttgca gtaaccaagc 4380 agggagtctc agtcgaatca tcaaacacct ggaagagcaa ttccagatcc gcaccatgga 4440 tgcagagcgg tttcttggag tcgaaattag ccgaaacagg aagaagaagg aactcacagc 4500 agcacaaccg ctgttcaccc tggcactact gaagaaattc aggatggaga actgccatcc 4560 aaaaccagta cccgcagacc cgcatacgca cctaagcacc gaaatgtcgc cgaaaaatga 4620 aaaggaaaaa ggagacatgg agaagattcc gtatcgggaa gctgtcgggt ccctcttata 4680 tctggccatg actacccgcc cagatatagc gtttgcagtg gggcaggtat cacagttctg 4740 ccagaaccca ggcaacgggc attggaatgc cgtaaaaaga atctttgctt atcttgcggg 4800 aacaccgtac catggccttc acttcaaaag tagcaaccat ccggcgatga aaggctacac 4860 tgacgccgac ttcgcaagag accaagacac caggcggtcg acaactggat atatcttcct 4920 ctaccatgga ggacctgtgg tgtgggccag ccgaagacag aaatgcatct cgttatcgac 4980 cacagaagcg gaatatgtag caggatgtga gacatcgaag gaagctgtgt ggattgaccg 5040 actgttaaag gagattggac aagaccaaac gctgccaata cctctactct gtgataatca 5100 gagtgcaatc cgcctagcca aaaacccgga gttccatcaa agaacaaagc atatacaaat 5160 caaataccac ttcattaggg aacaactcaa gaatggaata atcgacttgc agtacgtttc 5220 aacagaagac caacttgcag acatcctgac aaaacccctc gagactaacc gattccaaaa 5280 actgcgtgag aaaatgggaa taattgatgt cagaggatct tcagcttgag gggagg 5336 // ID MCMAR1 repbase; DNA; INV; 2004 BP. XX AC AJ437557; XX DT 10-SEP-2005 (Rel. 10.09, Created) DT 10-SEP-2005 (Rel. 10.09, Last updated, Version 1) XX DE Meloidogyne chitwoodi transposon Mcmar1-1. XX KW Mariner/Tc1; DNA transposon; Transposable Element; MCMAR1. XX OS Meloidogyne chitwoodi OC Eukaryota; Metazoa; Nematoda; Chromadorea; Tylenchida; Tylenchina; OC Tylenchoidea; Meloidogynidae; Meloidogyninae; Meloidogyne. XX RN [1] RP 1-2004 RA Leroy H., Castagnone-Sereno P., Renault S., Augé-Gouillou C., RA Bigot Y. and Abad P.; RT "Characterization of Mcmar1, a mariner-like element with large RT inverted terminal repeats (ITRs) from the phytoparasitic nematode RT Meloidogyne chitwoodi."; RL Gene 304, 35-41 (2003). XX DR EMBL/GenBank/DDBJ; AJ437557; Positions 1 2004. XX FH Key Location/Qualifiers FT CDS 593..1612 FT /product="MCMAR1_1p" FT /translation="MDEKKRIRERLLHEFQLGHTAAEAARNIKKALGDNAL FT DESTARRWFTKFRTGDFSTDDGFRSGRPSTFETEPLRAAINENPATSTRKL FT AEELGSSKDTVWRNMKEMELSYRSGRTVPHDLNEQKRQKRVEICRTLLQRQ FT QTSPFLDQILTCDESWILYDNRASEKQWLAVGQDANATPKQLHPKKQLLSV FT WWCVHGIVYWELLPLNRTITSEVYCEQLHRVQQQLRRPPYTVWARKGILFQ FT QDGARPHVSAVTRKKIEDLGWDILEHSPYSPDLAPSDYYLFSPLKDFLRGK FT QFSNEEEICTALKNFFDSKGPEWYRKGIEKLPNLWERCIQCNGNYFYE" XX SQ Sequence 2004 BP; 650 A; 336 C; 372 G; 646 T; 0 other; tactagggtg tctcataagt tctgcacgtt ggtacgccat tttgcgtaca gtagtccaat 60 ttttataagt gatataccaa attaaagctc taactgtgca ctacaagata accgaagcat 120 ttttcccaaa attcatatct gccagagata tagcggatat ccgggatata ccaaacttcc 180 tttatccgga tatatgtcaa aaaaattttt ttttattttt ctaaatttat attaaaattc 240 tatggatttt tctgaataga atgaaaaaaa aatacacaaa aaattatcat tttgagctga 300 gttatgattt ttttaaaaat tcagtttttt ttctctaaaa aataatagat taacaatcgg 360 ttggtcagtt tagtgacttt tttgatcatc tgctcgcgtt ttttttattt gtgcgggttt 420 tttgtttttc aaaacttttt ttgttcattt ttgatcggct gcgcagcttt tttgatcgtc 480 tgcacgactt ttttcgtgtc tgagttgggg tttttctgtc tggacggctt ctttaaaagt 540 gtctggcttc aaatcaaatt catcagtatt ttcttcaaat cctttcaaat aaatggatga 600 aaaaaaaaga ataagggaaa gattgttgca tgaatttcaa cttggacata cagcagcaga 660 agcagcaaga aacatcaaaa aagcattagg tgacaacgct ctcgatgaaa gcacggctag 720 aagatggttt acgaaattca gaactggcga cttcagtaca gatgatggat ttcgttctgg 780 aaggccttca acgtttgaaa ctgaaccctt acgtgccgca attaatgaga atcccgcaac 840 cagcactaga aaattggcag aagaacttgg atcatcaaaa gatactgtat ggcggaatat 900 gaaggagatg gaattaagct accgatctgg ccgcactgtt ccgcacgatc ttaatgaaca 960 aaagagacaa aaacgtgtag aaatttgccg tacattgctt caacgacaac aaacctcgcc 1020 ttttcttgat caaattctga cttgtgatga aagttggatt ctgtatgata atcgcgcatc 1080 ggaaaagcaa tggcttgcag ttgggcaaga tgctaatgca actccaaagc agcttcatcc 1140 aaagaaacag ctattgagtg tttggtggtg cgttcatgga attgtctact gggagcttct 1200 tcctttaaat cgcactataa catcagaggt ttactgtgag caactacatc gtgtacaaca 1260 acaactacgt cgtcctccat atacggtttg ggcgagaaag ggcatactat tccaacaaga 1320 cggagctcgt ccacatgtgt ctgctgtaac acgaaagaag atagaagatc ttggatggga 1380 tattcttgaa catagtcctt actctccaga tctagcaccg tcagactatt atttgtttag 1440 tcctctgaaa gattttctac gtggaaaaca attttcaaat gaggaagaaa tttgcacagc 1500 actgaagaat ttttttgact caaaagggcc tgaatggtat cgcaaaggga ttgaaaagct 1560 tcctaacctt tgggaacgat gcattcaatg taatggaaat tatttctatg aataaagatg 1620 tactgtaaac atttttatta ataaattatt gttaatctat tattttttag agaaaaaaaa 1680 ctgaattttt aaaaaaatca taactcagct caaaatgata attttttgtg tatttttttt 1740 tcattctatt cagaaaaatc catagaattt taatataaat ttagaaaaat aaaaaaaaat 1800 ttttttgaca tatatccgga taaaggaagt ttggtatatc ccggatatcc gctatatctc 1860 tggcagatat gaattttggg aaaaatgctt cggttatctt gtagtgcaca gttagagctt 1920 taatttggta tatcacttat aaaaattgga ctactgtacg caaaatggcg taccaacgtg 1980 cagaacttat gagacaccct agta 2004 // ID BEL-163_AA-LTR repbase; DNA; INV; 425 BP. XX AC AAGE02018617; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-163_AA_; KW BEL-163_AA-I; BEL-163_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-425 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02018617; Positions 23718 23294. XX SQ Sequence 425 BP; 122 A; 130 C; 90 G; 83 T; 0 other; tgttccgtct cccgccgaaa acacgtccga acatgttaag cgtcctgcca aacaagccaa 60 cgaaaactcc ccaccaatca gcgacgagca gcgatcggtg acatgtataa aagagcggtc 120 gctgccatgc cctgattatt caagtcccat ccgcgcgcga gtgcacatcg aagcagcagc 180 agcagtgaaa gcaaactttg taaaaaagag gaaagcagag aaataaacag tgttttttag 240 tgtagccgtc ggagttcttc attccagcca ccattccctc tgaggccagg agcagcgtca 300 aaccccagct ctttttagag ctccaaagag ttgagaaaat tttcacaatc gacgcttgcc 360 tcctccgctc aattccaacc tcaagtacag tccaccgccg ccaaccgttc gccccagcgc 420 gaaca 425 // ID Copia17-NVi_I repbase; DNA; INV; 7603 BP. XX AC AAZX01010587; XX DT 13-NOV-2007 (Rel. 12.11, Created) DT 13-NOV-2007 (Rel. 12.11, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: internal DE portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia17-NV; KW Copia17-NVi_LTR; internal portion; Copia17-NVi_I. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-7603 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from Nasonia parasitic wasp."; RL Repbase Reports 7(11), 1155-1155 (2007). XX DR Genome; AAZX01010587; Positions 5697 13299. XX CC Positions [4942-5349] - Integrase core CC LTRs are 93% similar to each other. XX FH Key Location/Qualifiers FT CDS 4942..7455 FT /product="Copia17-NV_I_1p" FT /translation="MIIRERLTYPIRQKTEVSKCLVEFIRSVRNVLGSDEK FT FCYLRCDRGTEFTGNAMIDVLDKFGAELQLACPDTPEDNGVAERFNRTIQS FT KIRAWMFDSGLPSTMWDLAVKASVYVYNRTPHKSYEMKIPINLLNSKIDSN FT LRQIKRFGCVAYVRATRNEGTTFSVKGIRSFIVGYIPTGYILYTPEEKRLY FT ESRHVKCIEEVVYKDYFDKRESDEELFKIDIEDEVDCQQVEQERKSQENMS FT QVDVIDVNKLVSSEEVESNDKVKETKRKRGRPKKLTKQSVMFCSEANENTE FT QTETRDFALHALLAKIQGDPQTYREAVSSKDSEHWFKSIQVELDLIKDKEV FT YEIIERPKGRIRMGKPNVIDSRWVFKRKIDKQEKTKFKSRLVIRGFKDKNA FT YELSETYAPVSRMSLIRSFLVIANKYDLKLKQLDVETAFLYGDLSEEILME FT IPEGIMVSKEYRKKFVWKLKKALYGLKISPKKWNNKFTAVMNELRFISSDM FT DPCLFFRQSNYGVMIVVLYVDDILLAGNNMTELTEFSSKLSRKFKIKDLGK FT PKEFLGIKIERCEEKKIIKLNQTKFIDNMLKRFGFDKIRSVNTPIASTQIT FT NKDRKTRETEYIVSEAVNNRLYREAVGSLLYLAGVTRPDISFAVNVLSRHQ FT VAPTKYEWKMVQRVFQYLIGTRHYSLIYSGKENYMKAYSDASLSDCGGSLT FT TCGYAIQLYGDTVDWKTHKQQSLALSTCQAEYVAMSEACQAILALHNSVCF FT ILRNDMYPIDLFCDNMAAQACANTDGGNRLRHMVERKEHYIKECVKRGYVK FT ITWVKSADQLADIFTKALSFPVHEKLSLLLLNMIYN" XX SQ Sequence 7603 BP; 2805 A; 1022 C; 1901 G; 1875 T; 0 other; ggttatggtc ccagcccgca tccacgacgc gaataaacaa ttttaatagt gtgaaagtgt 60 tcgtgagtgc gtactttaag aggaagagag aaaacgcagc ttcgagcggc attgaggaag 120 gtcgtgagat tccttaggag actacggcgc ttaagtcaat aatggggaca ggtaggtgat 180 aattaatgtg agtgtacata caaaagacgc gagtaaaatc ttaggaaagt tagagacgcg 240 gaagtgacgt ttacataatg cacgaaaaga cgcggagata aagaaaacta aaatcccgga 300 atcgagagta aagtaaatct gacgtgtaaa attaatgacg tgtgtgaaat agtgtaaaat 360 aaattcgttt atcaggaaac gctgaacaaa agacatttga cgacgtgagg gttaaacgga 420 aagttaggaa aatttataaa aaaaaaaaga cgcaagctga aatcgtgatt aaacgtgcat 480 gcgcttcttg gcacaaagca agtatgctct gaattccaag aaccggtcaa atcggcaaga 540 tggctgacga gtggcgcaat tttgagcggg aaatcgtgcg cgagaaagaa aggaagattt 600 tctgatgaag atgttcgccg tgaagaagca gagtagcgca aggaaggggg cgtggcttag 660 ctgcgcgcgc taagagcggc aaggatggga tgcatgagag agagagagag agagagagag 720 aaagagattt tgtaccggca gagttgaagt gcgtgtgtgt gtgagagaga gagagagaga 780 aagaacggta ctttaccggc agagttgaag cgcgtgcatg cgagagagag ggaaaaagcg 840 gtacagcgcg gggacgcgag agcgtatgcc atgcacgaga aagagaaaga gaaagagaga 900 gagagagaga gagagagaga gagagagaga gagagagaga gagagagaga gagagagaga 960 gagcatacat gagtgaacat gagagtatgc atgagcgtga gactgtaaat ttgaaaaaca 1020 gccgatcgtc ttttacgcta agcgaagtgc tgcggttgcc acttgactcg acgccgacga 1080 atgtccgcag gttctagtgc cgtgggcacg tgctcacgcg cgtatggccg ataggcgcta 1140 ttgccaggcg tagtgcgtga tgattgtctg cggccgtagt atacaataag ctatctgtgg 1200 aattaatgct atgttgaatt tacagagact gaggagggaa tcgaagagat agtgagaaga 1260 gtatgtaagg cgatgaatgc tttacatata ttggctgagg agagagtgta cctaggaaaa 1320 ggaaaagatg gaaaactggt attttccacg aagaattcgc tgttcaaaat aggtaaggaa 1380 gtgtactgat ggattaaaag taaagaagtg tatttagtga tgatttaaga gggcgtgtcc 1440 tagtaaggag atgagaaagt gcgcactgac agaggacggg aaaccgaggt tactaggttg 1500 agggaattta ttcagatggg ataattctta acttactatt aaaattatgc atgcatggga 1560 tgtgctaaag agtagagtaa gttaggatcc tcgtaaaatc agattaggat atgaagtatg 1620 ggacagtaat ttattaagaa acgcatgagt tatagaaagt aaaataaaat aagaattgaa 1680 ttctaataaa taaataaaat gaaatgtatg acatgtatga agtgtgattg aacatgagat 1740 attttaatga tgaatatatg actagataag atgtgatgca tgcaaagatg agatatatga 1800 gatggtatat gaaatgaaca acgaattccg ggatgagtta tgttatgtga tgctatatga 1860 tgagttgaaa tgatcagaat aaatgaggtg cttatcatat tcaaattaaa ctaatgctta 1920 tttgtgctat tttaggtcaa ttcttgtcag aagatgactg tcagttcctg aggaaaatat 1980 ggcctccagg ggagaaagtc atgcttgtgt cggagcccag aatatccgaa agaatggtga 2040 cagaggagag gctgaaggcg gctatgcagg aaatagaaaa gaccgtagaa acgaaggtga 2100 agactacact ggagtggatc actgaacaac agaaggaatt ctttgatatg caagcaaaac 2160 tcaacaaaga gttaatcgaa agcacgcttt caaaaatgac agaaggcata cagaagcaga 2220 aagagatcga agctccaaag caagacaagg ttgagaatga tacaaaagtc agatcgtggc 2280 taaggtgggc aacgactaaa accaccccag ggaccaaagc tgctccaaaa ataaaaaaca 2340 tagaagacgt ggatattgaa ctgaagatca ggcagggtgt aaaggaatac gaagaaagga 2400 acagactagg agcggatcta aggctaaagc ccaccattga cgaaagtcta ctggaagaag 2460 aaaactttgt agttccaatc cagaagaccg ggaccgagga agagggaatt gggagaaagc 2520 atctctgctc tccgacacct tcattgttta ggacgacaga gaaatcagga tgaggaattg 2580 gaagaaggag aagtcaatga agctcaagac tgggagggta aacaaatcat cccaggtaag 2640 attgtaatga caacagggaa atcacccaaa actcagatgg atctgatcga ttgacatgag 2700 gccgtcacca gaagggcaaa caagaccaca agtaaatatg atatgaatca tataggaata 2760 tctgatggag taaatggggt aagcaataat aatctaaaca tagttaaaat tatagaagat 2820 aaaattaaag aatcgttagc taaggataat aagggaggaa atggcggaat aaacccaact 2880 aatgacagta acaagaataa aatcataatt gaggtttcta gcaaagagag taatgtgatg 2940 cgaagttata agttaactac tgggatgaag tttgagattt ttgaagatta tttgatgtct 3000 gaggttaaaa ccaaaaggtt agactatgtg tttgataaaa agaaatgtga ggatgtgaat 3060 gaacctaaat taattgagaa taagcataga gttcgagata tcataatcaa tcatattgac 3120 atagaatatt acactaaaat gattgatatt aaagaaacta ttgaaatggt ggaaaaacta 3180 aaggaattaa aaaggtttga aactagaatc atgcggttaa ctgcaagaaa cgatttaaac 3240 aacttaagat atgatgtaaa taagcataaa gcttctgaat tttgggattt gtttgaggaa 3300 aaagtgaaag tatatgaaaa tacgcctaat gctgataaat taccggaaaa aagataaaaa 3360 gaatctgttt atacaagcaa tcaaagatgc ggtgcccaac gtaactgtga cggattgtct 3420 acttgaggaa tcgactgggc gagaaatgac gtacagtcaa ttaaaaaaat ttattttgca 3480 ggaggaagct ttgagaccgc agaggcaacc acctaagtcc gcgttcaacg ccggatttcg 3540 aggaggcaag agaggagagc aaattcagag aagagggaag gatgatgtca ggtattattc 3600 gtgcggcaag aatggacaca tcagcagaga ttgtccgacc ccaggagagg tggtatgtta 3660 caattgtaac aaggttggaa atcacatctc tagccagcta gcaaaagggc gaaattggaa 3720 cctgctcaca accaaggacg tggaagacca gactggacgg gccaaggagg ccacaaccaa 3780 ggatccacga gaggtggtgg cgcaggttcg attaaccgag gacgtggagg aaacagagga 3840 cgaggaggca gaagtgctag aggcagaggt acctcagccc atttcggcta catggaaaac 3900 ccagagcagg atcagtatta ccaaggtaaa ttaaattgta atttaattga tagaaatttg 3960 aagtataata aaattactaa atttatagcg gattctggtg caacagagca tctgagtaaa 4020 tcgaaattaa tttttgagag attgagtgaa aatgaagtga atgaaattaa ttgtgcgaat 4080 aagaataatt atctgagtac tcagggaaaa ggttatgtta gaattaggac agaaaatggt 4140 aaggagttaa taatgtcaga tgtattgtat tctaatgagt tgtcggaaaa cttattgtca 4200 ctaaagaaat ttgttgatca aggactcgaa gtctatttaa acaataaagt aatcaatata 4260 tttgatccga aatcccaaga gaatttatga ccggaaagta tgaaaatcca ttttggctaa 4320 ttaaattaaa gattgcaccc aaaggcaaga acgagattga agtgtctaaa gcttttgctt 4380 taataagctc attagaacag aatcatccat acaataccag gagtaaggga aatgaattaa 4440 gtcaggtaga agagagggtt gaatctaaaa atgaggagaa tatggaaaaa ttagagacat 4500 atgaaaacac agaaaattca agtctaggag atagaatatc tgatttatta catactgcta 4560 cagatagaaa ggttcatgta atggaggatt gcaatggctc taatgataat cttgagattg 4620 attataaaag tagatcatgt gaatcaatta ataaaattga caaagccatg atttggcatt 4680 taagactagg acatgtatcg gctatgtaca tcagaaaact ggcagaacag tttcctgaaa 4740 tacttgatat aaaaggagta aatgtagaag aaagtgttat taaatgtgaa gtgtgtctaa 4800 taactaaaag ttgcaaactt cctttcaata aaattaggac tagagcatcg aagccattgc 4860 aaattgtcca tgcagacacg atgggaccta ttagtccagt ttcatttcca aaaggctata 4920 agtttgttgt ggttttcata gatgattatt cgagaacgac taacatatcc tattagacaa 4980 aaaactgagg ttagcaaatg tctagtagaa tttattagga gtgtcagaaa cgtgttaggt 5040 tcagatgaga agttttgcta tttgaggtgt gatagaggga cggaattcac aggaaatgca 5100 atgatagatg ttctggataa gtttggagct gaactgcaat tggcatgccc agacacacca 5160 gaagataatg gtgtggctga gcgtttcaat aggactattc agagcaagat aagagcgtgg 5220 atgtttgact caggtttacc ttctacgatg tgggatttgg cagtgaaagc gagcgtatat 5280 gtgtataatc gaacgccgca taagtcttat gaaatgaaaa ttccgataaa tttgttaaac 5340 tcgaaaattg atagcaattt aagacagata aaaaggtttg gatgtgtagc ttatgtaagg 5400 gcaactagaa atgagggcac tacgtttagc gtcaaaggaa tcaggagttt tatagtgggt 5460 tatattccaa cagggtatat tttatataca cctgaagaga aaaggttata tgagagtaga 5520 catgttaaat gcatagagga ggtagtgtac aaagattatt ttgataaaag agagtctgac 5580 gaagagttat tcaaaataga tatagaggat gaggttgatt gtcaacaagt agagcaggag 5640 aggaaaagtc aagaaaatat gagtcaggtt gatgttatag atgtaaacaa actagttagc 5700 tcagaagaag ttgagtccaa tgataaagtg aaagaaacta aaagaaagag aggtagaccg 5760 aaaaaattga ccaagcagtc agtgatgttt tgttcagagg cgaatgaaaa tactgaacaa 5820 acagaaacaa gggattttgc acttcatgca ctattagcta aaatccaggg tgatccacag 5880 acatataggg aagcagtaag ctccaaagat agtgaacact ggttcaaatc tatacaggta 5940 gaactagatt taattaaaga taaggaagtc tatgaaataa tagaaagacc gaaaggaaga 6000 atacgtatgg gaaagcccaa tgtaattgat tcacgatggg tcttcaaaag gaaaattgat 6060 aagcaagaaa aaactaaatt caagagtagg ttagtcatta ggggattcaa ggataagaat 6120 gcgtatgagt tgagtgaaac gtatgctccg gtatctagga tgtccttgat tagatctttt 6180 cttgtgattg caaataaata tgatttgaaa ttgaagcagc tagatgtaga aactgcattt 6240 ttatatggag atttgtctga ggaaatactc atggaaatac cagaagggat tatggttagc 6300 aaagaataca gaaaaaagtt tgtttggaaa ttgaaaaagg ctttgtatgg actcaaaata 6360 agtccaaaga aatggaataa taaatttaca gctgtcatga atgaattacg ttttatttct 6420 agtgacatgg acccttgttt atttttcaga caatctaatt acggggtgat gattgtagtt 6480 ttatatgtag acgatatttt gttggcaggc aacaatatga cagaactgac tgagtttagc 6540 tcaaagttaa gtaggaagtt taaaattaag gacttgggta aaccaaaaga gtttttgggg 6600 attaaaattg agagatgtga agaaaagaaa ataattaaat taaatcaaac gaagtttata 6660 gataatatgt taaaaagatt tggctttgat aaaattagat ctgtcaatac tccgatagca 6720 tcaacacaaa tcacgaataa agataggaaa actcgagaga ctgagtatat tgtatctgag 6780 gctgttaata acaggttgta cagagaagct gtaggttctt tgttgtattt ggctggtgtg 6840 acaaggccag acatatcgtt tgctgtgaat gtcttgagca gacatcaggt agctccaaca 6900 aagtatgaat ggaaaatggt acaaagagtg ttccaatatt tgattggaac acgccattac 6960 tcattgatat attcaggaaa agaaaattac atgaaagctt attcagatgc gagcttgtcc 7020 gattgtggag gttctctgac gacttgtggc tatgccatac aattgtacgg agacacagtg 7080 gactggaaga cacacaaaca gcaaagcttg gccttgtcga catgtcaggc tgagtatgta 7140 gccatgagcg aagcatgcca agctatttta gctctgcata attcagtttg ttttatacta 7200 aggaacgaca tgtatccaat agatctattt tgtgataata tggcagcaca agcatgcgct 7260 aacacagatg gtggtaatag gctgaggcat atggttgaaa ggaaagagca ttatattaaa 7320 gagtgtgtga agcgaggtta cgttaagata acatgggtta aatctgctga tcagttagca 7380 gatatattta ccaaagccct atcctttcct gtgcatgaaa agttatcctt attactacta 7440 aatatgattt ataactaata tatgcctcat gttacagaag aaccctggga gccacagact 7500 ggagatgagg aggaagaata agtctacgcc agaggcactt ggtccagagt tccaaataat 7560 agtcttatca agacataagg acataacgcg tatggagaga gag 7603 // ID L2B-3B_AAe repbase; DNA; INV; 4205 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE An L2B non-LTR retrotransposon from Aedes aegypti. XX KW L2B; Non-LTR Retrotransposon; Transposable Element; L2B-3B_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4205 RA Kojima K.K. and Jurka J.; RT "L2B clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1407-1407 (2011). XX DR [2] (Consensus) XX CC ~87% identical to consensus. The consensus is ~79% identical to CC L2B-3_AAe. XX FH Key Location/Qualifiers FT CDS 124..1317 FT /product="L2B-3B_AAe_1p" FT /translation="MVGKCDNILVAINEIKNRLEKIEAKFERNGCDEAVKQ FT CEQNVKLVVEESAKLHGEQLKNLETQIANSACSPAMGKGPXDGDSYPSAGX FT FVEVVRRKKRVXEXDSVLRSGRVRNKSVATPKSNENRNMSARAQQVSNXKV FT SEVNENGSNSNKKFGCTVRVKPXATQSNHQTKKEVRSIVNPAQVGIKSVRN FT GLNGSIIVECDNENEAEGLAKIIEDKLSEGYSADIEQPKRPRIKILGVGGN FT YNSNELINILRDQNDIEDVQYLNVLKCVPSKKNPENKFSLICEIDAITFER FT VXRKGKLNIDFERCRVLESIDLFRCFKCCGYGHKSSECKNNLHCAKCAERH FT DVKDCSSDQEFCVNCIHSNRERKTQFDVNHSSWSVDCPIYLRKITISRSYI FT NYDA" FT CDS 1321..4122 FT /product="L2B-3B_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="QSKRATDVVLLNIAGITSHFAELEMLVSRKKPKIXML FT TETHLTSDIGISEYSIRNYKMLCCFSVSRHTGGVIMYIHESVKYHVVDNST FT CGLNWFVAIKVVKGLKAGVYGLLYHSPSGNEQEFLVHLEQNWLEKVXDDKG FT MNLIAGDFNINWKNSSDSRNLRSVMECFDLDQKVKDTTRCTIRSQTIIDLV FT FCNDDQLNVTVDHDNKISDHETIXIQFNECSQPVENHITIKCWKKYSKSAL FT IXLLRNXMPNXSERQLDEKADVLRTVLKENINKLVVVRNIKCSSRKKWYTL FT ELKTLQEARDVAYKKASASWDEADWQRYKVLRNEYTYSIRTAKAEFTQRKI FT EQNRGNSKQLWKTLKSLWKSKEKPASRISFNGVDVDDDQDICEKFNSYFVD FT SVQQINESIDDVYDCFGNDEGHVTNSWNVFHRVSYXTLIKAIAKIGSSSGI FT DNVNLQVLKDSLEVTGEYLLGIINDSLEQGKFPSSWKQSTVVPIPKVSGTT FT KSEEYRPINMLPIYEKVLEIIVKDQLLEYLNEHKIIINEQSGFRQNHSCES FT ALNLLLYKWKRMIEEKKTIVVLFLDLKRAFETISRPEMXKTLSKYGMGGNV FT LKWFESYLXDRTQVCQYGNRISSPKSVPLGVPQGSVLGPILFILYINDMKK FT AIXHCDINLFADDTVIFIAEKDEKIAIRKIRGDIKSLNKWLKIKKLKLNVQ FT KTKSMVISNRKQLNYSELKIRIEGDEVERVDVFKYLGVLIDQKLTFKAHID FT NVVKKVAKKYGXLIRLNSQLTFWSKIFLYKTLVAPHIDYCSSVLFLASETH FT LNRLQRLQNKXMRXILNCDRYTPILNMLEALQWLSVKERIIFNVLTIIFKL FT TNELLPEYLTNIIIRGRNIHNHRTRRSDDLRVVPFTMTSTQKSIYYNGIRI FT FNELPVEVRNARCVSEFKRNCATWIKNKYR" XX SQ Sequence 4205 BP; 1513 A; 594 C; 909 G; 1154 T; 35 other; ggtcaggtgc tacatgcaaa atgtgccgac ttaactagtg cgggggaaac ggctttgcgc 60 gagaatttat caatcaagta cttgtgccat gattgtagga aaaagcaagt gagtcttaat 120 gacatggtgg gcaaatgcga taatattctc gttgcgataa atgagattaa aaatcgtttg 180 gagaaaattg aagcaaagtt tgaacgaaac ggttgtgatg aagcagtgaa acaatgtgaa 240 caaaatgtaa agttagtggt ggaagagtcc gcaaagttgc atggtgagca attgaagaac 300 ttggagacac aaattgcgaa ttcagcatgc agtcctgcta tgggcaaagg cccaatsgac 360 ggcgatagtt atccatctgc tggcascttt gttgaagttg tgaggagaaa gaagcgtgtg 420 cawgaaamtg attccgtttt gcgctctgga cgtgttagaa ataagagtgt tgctactcca 480 aaatcaaatg agaacagaaa tatgagtgcg agggcacagc aagttagtaa tgsaaaagtg 540 agtgaagtga atgagaatgg aagtaattcc aataagaaat tcggctgtac agtgcgggtc 600 aaaccaawtg cgacgcagtc caatcatcaa acgaaaaaag aagtgagaag tatagtcaat 660 ccagcccaag ttggaataaa aagtgtacga aatggtttga atgggtcaat aattgttgaa 720 tgtgataatg aaaatgaagc cgaagggctt gcaaaaatca tcgaagataa attaagcgaa 780 ggctactctg cagatattga acaaccaaag agaccaagaa taaaaatact tggagtggga 840 ggcaattaca attcaaatga attgattaac attttgcggg atcagaatga cattgaagat 900 gtccagtacc taaatgtgtt gaaatgtgtt ccatcgaaga aaaacccaga gaataagttc 960 tcactaatat gcgaaattga cgcgatcaca tttgagaggg tgatkcgwaa aggtaagctc 1020 aatattgatt ttgaaagatg ccgtgttttg gaaagtattg atttgtttcg atgtttcaaa 1080 tgttgcggat atggacataa gtcaagcgaa tgtaagaata acctccattg tgctaagtgt 1140 gctgagaggc atgatgttaa ggattgttca tctgatcaag aattttgcgt taattgtatc 1200 cattccaaca gagaaagaaa aacccaattt gatgtcaatc actcatcstg gagtgtggat 1260 tgtccaattt atttgagaaa aataacaatt tcaaggagct acataaatta tgatgcatag 1320 caatcaaaaa gagcgacgga tgtcgttctt ttaaatattg ctggcattac gtcgcatttc 1380 gctgaattgg aaatgttggt gagtagaaaa aaacccaaaa ttmttatgtt gacggaaact 1440 catttgactt cagatattgg aataagtgag tacagcataa gaaattacaa aatgttatgt 1500 tgcttctctg tatccaggca cactggtggt gtgataatgt atatccatga atcngtgaaa 1560 tatcatgttg ttgacaattc aacttgtgga ctaaattggt tcgttgccat aaaagtagtc 1620 aaagggctta aagctggagt gtacggcttg ttgtatcatt caccaagtgg aaatgaacaa 1680 gagtttttgg tacatttgga acaaaactgg cttgagaaag tasttgatga caaaggaatg 1740 aaccttattg ctggtgattt caacatcaat tggaaaaaca gcagcgatag tagaaacctt 1800 cgcagtgtaa tggagtgttt tgatctagac caaaaagtaa aagacacgac aagatgtact 1860 ataagatcac agactattat agacttagtt ttctgtaacg acgatcagct gaatgttact 1920 gttgatcacg acaataagat ttcagatcat gaaacaatwg maattcagtt caacgaatgt 1980 tcmcaaccag tagagaacca catcactatt aaatgttgga agaagtattc aaaaagtgca 2040 cttattkcac tgctgagaaa twgtatgcca aatmacagcg aaagacaatt agacgaaaaa 2100 gctgatgttt tgagaactgt gttgaaggag aacataaaca aactkgtagt tgtaagaaac 2160 ataaaatgta gtagcaggaa aaaatggtac acattagaat tgaaaaccct acaagaagca 2220 agagatgtcg cgtacaagaa agctagtgca agttgggatg aagctgattg gcaamgatac 2280 aaagttttga gaaatgaata tacatattct attagaacag caaaagcaga attcacgcag 2340 agaaaaattg agcaaaatag aggaaacagc aaacaactct ggaaaacatt gaagtcactg 2400 tggaagagta aagagaaacc ggcaagtaga attagcttca atggagttga tgttgatgac 2460 gatcaagata tttgtgaaaa gtttaacagc tatttcgttg acagcgttca acaaattaat 2520 gagagcatag atgatgttta cgactgcttt ggaaacgatg aaggtcatgt cacaaacagc 2580 tggaacgttt ttcatcgagt ctcgtatgam acattgataa aagctattgc caaaattggt 2640 agttcatcgg gcatcgataa tgtaaattta caggtcctta aagattcttt agaagtcacc 2700 ggagagtatt tacttggaat aattaatgat tctcttgagc agggcaaatt ccctagtagt 2760 tggaagcaat cmacggtggt tccgatacca aaagtatccg gwacaacgaa atctgaggag 2820 tataggccga ttaacatgtt acctatttac gagaaagtgt tagagatcat tgtcaaagac 2880 caattgctgg agtacttgaa cgagcataaa attataataa atgaacaatc aggatttagg 2940 caaaaccatt catgtgagtc tgcnttaaac ttattgttgt acaaatggaa gcgaatgatc 3000 gaagaaaaga aaactattgt tgttctgttc ttagacctta agcgtgcatt tgaaactata 3060 tcgcggccgg aaatgwtaaa gactttgagt aaatatggta tgggaggaaa tgtcctcaaa 3120 tggtttgagt cktatttakc tgaccgcaca caagtgtgtc aatacggaaa tcgtatttct 3180 tcgccaaaat cagtgccgct tggagttcca cagggtagcg ttttaggacc gattctattt 3240 attttatata tcaatgacat gaagaaagcc attmagcatt gtgatattaa tttattcgca 3300 gacgatactg tcatttttat tgcagagaaa gatgaaaaga tagcaattag gaaaattaga 3360 ggtgatataa aatcgttgaa caagtggttg aaaataaaaa agctcaaatt gaacgtccaa 3420 aaaactaaat cgatggtaat aagtaacagg aagcaattaa attattcaga attaaaaatt 3480 cgtatcgaag gagatgaagt tgagagagtt gatgttttta aataccttgg ggtcctaatt 3540 gaccaaaaat tgacattcaa agcgcatata gataacgtag taaaaaaagt agcaaaaaag 3600 tatggtwtgc tgatccgttt gaacagtcaa ctsacgtttt ggagcaaaat atttttgtat 3660 aaaacmttag tggccccaca tattgactat tgctcttcag tgctattctt ggcaagtgaa 3720 acgcacctaa atagattgca aagactacaa aataaawtaa tgagatwcat tttgaactgt 3780 gacaggtata caccaatatt aaacatgtta gaggcgcttc aatggctttc tgtgaaagag 3840 cgtatwattt tcaatgtgct gactattatt ttcaaactga ccaatgaact cttgccggaa 3900 tacttgacga atattatcat acgaggacga aatattcata accatagaac tagacgaagt 3960 gatgatttac gtgttgtgcc gtttactatg acaagtactc aaaaatcgat ttattataat 4020 ggaataagaa tttttaatga attaccagtt gaagttagaa atgcaagatg cgtctcagaa 4080 tttaaaagaa attgtgcaac gtggattaaa aataagtata gataakgaaa atcgtgaatt 4140 tgtatgtatg attactgtat attattatca aaagataaat aaatggatta ttattattat 4200 tatta 4205 // ID DNA8-37_AP repbase; DNA; INV; 404 BP. XX AC . XX DT 25-AUG-2009 (Rel. 14.09, Created) DT 25-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-37_AP. XX NM DNA8-37_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-404 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 1967-1967 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 404 BP; 114 A; 98 C; 76 G; 116 T; 0 other; cataggcgtg cgcagccctc ttgttcaggg tatgcagaag cagtggcgga tccaggagga 60 gggggggcaa aagggacatt ctccccccaa tcgccttagt ttcccattgt ttacagtgtt 120 tagccaattt tgtacttttt cctccccccc caaattaagc actggattac cacaggcacc 180 ccctgaaagt tattttaaga atgtaaaatt ctgataaaat atatacacaa tacacaaata 240 gtttcgagtt taaacataat aatatgtaca ctcctgggcc gcactataat tagtaattat 300 ttttcttgcg cacaaaaatt ttttttctaa aaacgatttt tgccgctcaa agagcttcag 360 ggtatgcagc tgcataccct gcataccccc tgcgcacgcc tatg 404 // ID Gypsy2-LTR_Dmoj repbase; DNA; INV; 274 BP. XX AC scaffold_6680; XX DT 13-MAY-2009 (Rel. 14.05, Created) DT 13-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy2_Dmoj; KW Gypsy2-I_Dmoj; Gypsy2-LTR_Dmoj. XX OS Drosophila mojavensis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; repleta group; OC mulleri subgroup. XX RN [1] RP 1-274 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1035-1035 (2009). XX DR Genome; scaffold_6680; Positions 4261564 4261837. XX SQ Sequence 274 BP; 98 A; 40 C; 57 G; 79 T; 0 other; tgtagcaggc tactagccta tgtatactgt agctggttac tagcctatgt tatattagaa 60 ttaagttaac agcattatgg ttaaccatcg atagatacac atctatcgat agttaaccat 120 atcggtggct gaggaggctg tcgatatgca acatgccatt ttgagtaata gaaagaggag 180 gatcgataca ttttaagctt gaatgtgaac aatagcagac gtgtgtatat caagaaatta 240 tgaaataaag aaattaaata aaaggcctac taca 274 // ID Gypsy-37_CQ-LTR repbase; DNA; INV; 319 BP. XX AC AAWU01034069; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-37_CQ_; KW Gypsy-37_CQ-I; Gypsy-37_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-319 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 454-454 (2011). XX DR Genome; AAWU01034069; Positions 16022 15704. XX SQ Sequence 319 BP; 98 A; 74 C; 72 G; 75 T; 0 other; tggagtagtt accaaaccat tcacgcggtt tcagctagaa accgcgaaca tcgaataatc 60 acaacgtcag catgcgccgt gtgcagcggc acatcatatt acactgtaac cagaggacga 120 cgacgacgtc tggagtgacg acgtctggcg aggattccag gaaagtggcg caacaacata 180 gtttacgtag tgtctttcag attcaataaa gagagggaaa ttaattattc ccggtcagtc 240 tgtgtagagc catcaaacca tcaatttaat ttttttaaaa agtaaccttc gaaagcccgg 300 cggctaactc acgtttcca 319 // ID Copia-2_DPu-LTR repbase; DNA; INV; 288 BP. XX AC scaffold_41; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-2_DPu_; KW Copia-2_DPu-LTR; Copia-2_DPu-I. XX NM Copia-2_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-288 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 668-668 (2010). XX DR Genome; scaffold_41; Positions 662468 662755. XX CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX SQ Sequence 288 BP; 74 A; 59 C; 65 G; 90 T; 0 other; tgttgaagta acattcgaca acctgatgcg gagtggttgc gtaaaccgag gttgctaggc 60 aaccatttgg ctgacacagt ttttcgtctg cacccagcaa ccggtagcgt ggttcttcct 120 ggtcgacgac gtgacgcagg tctcacgtag caaattgtag ttccttctaa caatttgtat 180 ttcattatat cgtgtcaatt aattttgtga tccactgtgt gggtaagtta aacttgtact 240 acctgtggta aatctagatc atgcaaataa aactgatgtg tctcaaca 288 // ID BEL-1-LTR_HM repbase; DNA; INV; 306 BP. XX AC . XX DT 30-DEC-2008 (Rel. 13.12, Created) DT 30-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE LTR retrotransposon from Hydra magnipapillata: long terminal DE repeat - consensus. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-1-LTR_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-306 RA Bao W. and Jurka J.; RT "LTR retrotransposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 2074-2074 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX SQ Sequence 306 BP; 122 A; 33 C; 49 G; 102 T; 0 other; tgattggaag aaaattgaat tagcataaaa gaaaaacaac attccgatac cacgatcgat 60 ataaccagta aatagcgcca atatggctat aacgaactaa ataacaatta tgtggcgact 120 atttgaacaa tagtgttttt ttgttattga acgaaatttt tgtataaata gagttttgtt 180 gttgttgttt ttcagataca aaaatataaa gaatataagt aatatagtta atgtaaaaac 240 ggttaagatc ttaaaaatat aagttataat acatattcgg tattatttgt gatcgccatc 300 ggatca 306 // ID Gypsy-13_CQ-LTR repbase; DNA; INV; 1491 BP. XX AC AAWU01032992; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-13_CQ_; KW Gypsy-13_CQ-I; Gypsy-13_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-1491 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 406-406 (2011). XX DR Genome; AAWU01032992; Positions 1502 12. XX SQ Sequence 1491 BP; 373 A; 359 C; 362 G; 397 T; 0 other; tgtaacctgc ctgcggcagt ttaccatcgc aacagtagaa gtctgcgaat aataagaaga 60 gtgccctaag ccatcccaat cagggcaaaa tttgatcttt tgtcttctag aatttttata 120 aaaccactac acttacaaca caaacaagaa gccaagaacc atcatcatca tcgattacgg 180 cttcccagca gccgatcggt gcggattacg cgctgactct cgcgagtgcg tgatctccca 240 gctaaccgta cacacataag gcacgtatgc actagatatt gggcaatttc aaaaccgaag 300 gccgatcgtt gaatgattac cctagcacaa aacacaacat gcacacccac cgattatgtt 360 tatgctcatc atcacatcaa cataccggac cgacatcaac catgccagca tcaaactaat 420 ttcccgttgg ggaaatcaag tcgagcagcc accctggcga cgccccggta ctcactctcc 480 aatctcttat cggccgacgc caacgagagc cgtgagtgat aaagcgaggc gcgatcggtc 540 aggcaatata gccgatcgtc gcccgcggat cgtcatcagt cactggttta tcgtaaagga 600 gaccgcgtcg atcgtcaacc cgatatattt tccgccagac gatcgacctg aagaagaagt 660 ttagttcaag gaatttctgt ggtgaaaacc gtctcaccag tgtaattttt tgcggattgt 720 gtgatgtgag cctaatttta atcgtaatct tgattgtttc gcctaggagg aagaacaagg 780 cgtggaataa taaccgcact ggagtgcgag gtgccgcgac accgtaaatt ggtcgtggtg 840 gagaggtaga gtggcgaccc tccctcgtgt gcttaattgg tgttaatttg agtctttttt 900 gtactttttg tgtgctagtg aggtggcgtc tacgttcagc atggatagca tggccatgct 960 ggagtagtta ggagtacttg agatcggtaa ggagaagatc taaccgccat tctaaaataa 1020 gtcccaaaaa atgtgcttcg tccctgaaat gtgtgctaat ttctattgtc tgccattgtg 1080 caggccttca tttcctgttt cgcaagctaa attccatcat ggctggagcc gtggtgtgga 1140 gcggtgcgct gctgtgcatt cctgccgact ggacggcggg gcaaatgcac cggatggccg 1200 gcgttcttgc cggtgaccaa gctgggttct gcccgggcag caacgttgtt ccggatcgtc 1260 gagggataca cgggggtcac acacacacac acaggtttac acccgcaact ctagttttaa 1320 ggacgccaac taggtttagt tcgccaatat agatgtaagt acgctaaaat cccctctttt 1380 atttctgtgt agcttgaatt tatccttgtc ttggagggct ggttgattat ctgggattct 1440 gttgaattcg acccattttt attgtttgct gtgttctatt gcatctgtgc a 1491 // ID CR1-58_HM repbase; DNA; INV; 4671 BP. XX AC . XX DT 22-DEC-2008 (Rel. 13.12, Created) DT 22-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE CR1-type family - consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-58_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4671 RA Bao W. and Jurka J.; RT "CR1 families from Hydra magnipapillata."; RL Repbase Reports 8(12), 1886-1886 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 138..1541 FT /product="CR1-58_HM_1p" FT /translation="MAPRTVSEIFVYLNSDLNTFECNFLKNLEEICSHPKD FT IIIHNLNSMKENVIRSLRERLFFEVLDSFSSEQFREINVTLDSENPMEKNL FT RKRYKTTKCLEDIFIFSITLCENQLHKEIXKTLISCKNIETAHLDDKFLTS FT SLKDLLQISKSIQNENIEIKVELGLLKEKISIQNELIQDLIVSNDNLLFLK FT NQPRQNINSSLKHDKNTCSKSIAEDTQNVNNSCGKLFKSIPKVSNANTENL FT KSDYYAQCENKLFNYHTLNENKNSNVTANAAKDNFQQPCGKTNASIYTESQ FT PHNIKTYAKVAAQKQQTSDKTNQFKEKPSNIDDNPNSFNLVGKNNKPVRQT FT NILPLEKKHKLSEPIFGTKTPGNETIAGSRIIRRFEVFIGGVSNRINEEHM FT KLYMETELGVTPISITLNRENEFNRSYRVSVSNSEKDKIFNPSLWDNNIIV FT KPFRKKRIYSNVDQNLVSNGGQ*" FT CDS 1558..4650 FT /product="CR1-58_HM_2p" FT /translation="MDPSTIREXNKSSTTFDICTFNCQGLKSNIDFTKSLI FT QSYDITFLCEHWISKLEYSVIRDISNNTHSIYFHQSNKHEKGRPFGGNAFF FT IRKNKFQNIRVIYEDDHILAINFEKNDLNIIVIGLYLSSSRNSLSSLEEYK FT GQLDTVKGLIDSYEGNGNIILAGDFQSFPHQMYDLFERSNSKRNSYSVHLS FT EFLKTNQLELVDVTKGSGPNYTYRHQTLPNSSYIDHFAVSKYTSFINLKCI FT VHPECSNNLSDHLPLSISVEVKYTECNTIELAEDTNIPSYAWNDSNFINSY FT NVHLTNRFNNLNFTNDYEYDLLQIYKKITDSAQIAFKETLKNKKQCLYSKS FT WWTPELSRSKTILSIHFKKWRDSGFSKELNSVSFNRYQMARKNFRKAVKSA FT QNNKICAQYIKIEKLKNTKPKNFWNAFRCLNKDVNSRLFTINNKKDKESIT FT SEFANHFERVLNTPKITHRKENHRRIPPCTKHDDVTITEENIIGAISNLKL FT KKSPDSFGISAEHLKYAKCDTLTKWLIKFFNYSINYGSTPISMSTSTIVPL FT VKSYKKSLDSPSNYRGISIIPIFTKVLEYLILIICPDIRDTHSLQFGFNAN FT SSTLHAEFLISETIKHYNSNHSPVYLCSLDAEKAFDSCNWNILFEKLYFDK FT KLPLCIVNTISSLYNNSSATVSYQGCKSNSFPLNQGVRQGSILSPHLYNIY FT TESLLETIKNQGIVGTSINGNFTGVVAYADDIILLSSTLSGLKKLIKTCNI FT YSNLNCIKLNAEKTEFLISGKAQIQTNTITISGNKKNLQNKLRHLGFTWDT FT KHSVFASLGYSNVDEKISNFRAVVQTLIQSGIRFAHPSSISYIYKSLAVPT FT LTYGMELCENNPGLMKKLDIIGRSALKSFLNVSKHSKNYMNSLFKIQEISK FT SVQQNKLNLFLRLLNNETTADIIFSQLQHPLFQHSFVSDIQELCHLQILNF FT QNLIQNKKKVKIPLSKSKIPSEVERNLKYAIECWNAKEQRGVFKQILEENV FT FKRKSSEITFLFFSFYLVNWVLNK*" XX SQ Sequence 4671 BP; 1816 A; 790 C; 660 G; 1403 T; 2 other; tattgtttga gtaagacgtg tttaaagaga agaaaaaaat taataaaaat atttatcaaa 60 aactatttat gaatatttat tataatgaag attaatgtta aagtggaact aattatctaa 120 taacacttca atacgacatg gcaccacgca ccgttagtga gatatttgta tatctgaact 180 cagatttgaa cacattcgaa tgcaactttt tgaagaattt agaagaaata tgttcgcacc 240 caaaagatat tataatacac aatttaaact caatgaaaga aaatgtaata agaagtctcc 300 gtgaacgttt gttttttgaa gttttggata gcttcagtag tgaacaattc agagagatta 360 acgtaacgtt agactcagaa aatccaatgg aaaaaaatct tcgtaaacgg tacaaaacaa 420 ccaaatgttt agaagatata tttatcttct cgataactct atgcgaaaat cagttgcaca 480 aagaaattkt aaagacatta atctcttgca agaatataga aactgcgcac ttagacgata 540 aatttctgac ttcatcatta aaagatttat tacaaatttc aaaatcaatc caaaacgaaa 600 atatagaaat aaaagtggaa ttaggtttgt taaaagaaaa aataagtata caaaatgagc 660 ttattcaaga cctaatagta tctaacgata accttttatt tcttaaaaat caaccgaggc 720 aaaacataaa tagtagttta aaacacgata aaaacacgtg ctcaaaatct atcgctgaag 780 atacacaaaa cgttaataac tcttgcggta agttatttaa aagtatcccc aaagtatcaa 840 atgcaaatac ggaaaaccta aaatccgatt attatgccca atgcgaaaat aagcttttta 900 attaccacac actaaacgaa aataaaaatt ctaatgtcac cgcaaatgct gcaaaagata 960 attttcaaca accttgtggc aaaacgaatg caagcattta tacagaaagc caaccgcata 1020 atatcaaaac atatgcaaag gttgcggctc aaaaacaaca gacctcagac aaaactaacc 1080 aatttaaaga aaaaccttca aatattgatg ataatccgaa tagttttaat ttagttggaa 1140 aaaataataa acctgtccgc caaacaaata ttctaccctt agaaaaaaaa cataagttat 1200 cggagcctat atttggaacg aaaactccgg gaaatgaaac tattgctggt agtcgaataa 1260 ttcgtagatt tgaagttttt ataggtggcg ttagcaatcg aatcaacgaa gaacatatga 1320 aattgtatat ggaaactgaa ttaggcgtta cgcctatatc aattacttta aacagagaaa 1380 acgaattcaa cagatcgtat agagtctccg taagtaattc agaaaaagat aaaattttca 1440 acccttcatt atgggataat aatatcatcg ttaaaccatt ccgtaaaaag cgcatatatt 1500 ctaacgtgga tcaaaatctg gtatccaacg ggggccaata acaaatatta attgaaaatg 1560 gatccaagca ccataagaga araaaacaaa tcttcaacaa ctttcgatat atgtactttc 1620 aactgtcagg gacttaagtc aaacattgat tttacaaagt cacttataca atcctatgat 1680 ataacttttt tatgtgaaca ttggatatct aaacttgaat actccgtaat aagggatata 1740 tctaataaca cacactccat ttatttccat cagtctaata aacatgaaaa aggccgaccg 1800 tttggcggaa atgcgttttt tatacgaaaa aataaattcc aaaatataag agtaatttac 1860 gaggacgacc atattttagc aataaatttc gaaaaaaacg atcttaatat tatagttatt 1920 gggttatacc tctcctcatc gcggaatagc ctctcgtctc tggaagaata taaaggtcaa 1980 ctagatactg ttaaaggtct aattgatagt tacgaaggta acggcaatat aatattagca 2040 ggtgacttcc aatcctttcc acaccaaatg tatgatttat ttgaaagatc aaattctaaa 2100 agaaatagtt attctgtgca cttatcggaa tttttaaaaa caaatcaatt agaactagta 2160 gatgtaacta aaggttctgg ccctaattat acatatcgac accagacact accgaattcg 2220 tcatatattg atcacttcgc cgtatcaaaa tatacaagtt ttataaatct taaatgcatt 2280 gttcatcctg aatgctcaaa taatttgagc gaccacctac cactatcaat atcagttgaa 2340 gttaagtaca ccgaatgcaa taccattgaa ctagcagaag atactaatat tcctagctac 2400 gcttggaacg attctaactt tataaattct tacaacgttc atttaaccaa tcgctttaac 2460 aatcttaatt ttactaacga ttatgaatac gaccttctcc aaatctataa aaaaattact 2520 gacagcgctc aaatagcttt taaggagact ttaaaaaata aaaagcaatg tttgtattca 2580 aaatcttggt ggacacctga attaagtcgg tctaaaacta ttctttcaat tcattttaaa 2640 aaatggagag attctggttt ttctaaagag ttaaactcgg tatcatttaa ccgttaccaa 2700 atggctcgta aaaactttcg taaagccgtc aagtcggctc aaaataataa aatttgcgca 2760 caatatatta aaattgaaaa gttaaaaaat actaaaccaa aaaacttttg gaatgctttt 2820 agatgtttaa acaaagacgt aaattctaga ttatttacca taaataataa aaaagacaaa 2880 gaatctatta cttcagaatt cgcaaaccac ttcgaaagag tactgaatac tccaaaaata 2940 acacatcgta aagaaaatca tcggagaatt cccccgtgta caaaacatga tgatgtaaca 3000 ataactgaag aaaatataat tggggctatt tcaaatttaa aattaaaaaa atctcctgac 3060 tctttcggta tctcagcaga acatttgaaa tacgcaaagt gcgatacttt aacaaaatgg 3120 ctcatcaaat tttttaatta ctccattaat tatggatcga ctccaatatc aatgtcgacg 3180 tcaactattg taccccttgt taaatcgtac aaaaagtctt tagatagccc gagcaactat 3240 cgtggtatta gtattatacc aatttttaca aaggttctag aatacctgat cttaattata 3300 tgtccggata ttagagatac tcactcccta caatttggat tcaatgctaa ctcttcaacg 3360 ctacatgctg aatttctaat aagcgaaaca attaaacatt acaacagcaa ccattcccct 3420 gtgtatctat gttctttaga tgcagaaaaa gcttttgata gctgtaactg gaatatccta 3480 tttgaaaaat tatatttcga taaaaaatta ccactatgta tagttaacac tatctcctcg 3540 ttgtataaca atagtagcgc aacagtctca tatcaaggtt gtaaatcaaa ctcttttcct 3600 ctaaaccaag gagtaagaca aggatcaatt ctttcgcctc acctctataa tatttacact 3660 gaaagtcttc tcgaaactat taaaaatcag ggtattgttg gcacatctat aaacggtaac 3720 tttactggtg tggtagccta tgcggatgat ataatacttt taagctctac cttatctggt 3780 ctcaaaaagc ttataaaaac atgcaatatt tacagtaatc taaattgtat taaactgaat 3840 gccgaaaaga ctgagttctt gatctcaggt aaagcccaaa tacaaactaa tacaataaca 3900 atttctggta ataaaaaaaa tcttcagaat aaacttagac atctgggatt cacatgggat 3960 actaaacatt cagtttttgc ttcactcggt tattcaaatg ttgatgaaaa aatatcaaac 4020 tttagagctg ttgttcaaac tcttattcaa tctggaatcc ggtttgccca ccccagttct 4080 atttcctata tatataaatc attggcagtc cccaccttaa cttatggtat ggagctatgt 4140 gaaaataatc ctggtctgat gaaaaagcta gatataattg gaaggagtgc gctaaaatcg 4200 tttttaaatg tttcaaaaca tagtaaaaac tatatgaatt cgttatttaa aatccaggaa 4260 atttcaaaaa gtgttcaaca aaataagtta aacttgttcc ttcgacttct taataacgaa 4320 acaactgcag acattatctt ttcacaactg caacacccgt tatttcaaca ttcgtttgtt 4380 agtgatattc aggaactatg ccatcttcag atattaaatt ttcaaaactt aattcaaaat 4440 aaaaagaagg taaaaatacc actttctaaa agtaaaattc cttccgaagt tgaacgaaac 4500 ttgaagtatg caatagagtg ctggaatgcg aaagagcaaa gaggagtttt caagcaaatt 4560 ctggaagaaa acgtttttaa gcgaaagtca tccgaaataa ctttcttatt tttttctttt 4620 taccttgtaa attgggtgtt aaataaataa aataataata ataataatta t 4671 // ID RTE-5_PPac repbase; DNA; INV; 2904 BP. XX AC . XX DT 07-JUL-2010 (Rel. 15.07, Created) DT 07-JUL-2010 (Rel. 15.07, Last updated, Version 2) XX DE A family of RTE non-LTR retrotransposons: consensus. XX KW RTE; Non-LTR Retrotransposon; Transposable Element; RTE-5_PPac. XX OS Pristionchus pacificus OC Eukaryota; Metazoa; Nematoda; Chromadorea; Diplogasterida; OC Neodiplogasteridae; Pristionchus. XX RN [1] RP 1-2904 RA Jurka J.; RT "RTE non-LTR retrotransposons from nematodes."; RL Repbase Reports 10(7), 1064-1064 (2010). XX DR [1] (Consensus) XX CC ~98% identical to consensus. CC This sequence was derived from sequence data generated by Genome CC Sequencing Center at Washington University School of Medicine in CC St. Louis. XX FH Key Location/Qualifiers FT CDS 114..2864 FT /product="RTE-5_PPac_1p" FT /translation="MPPSMGRHFYIGTINARTLGPKDKQTEMELALDKIKW FT DVIAVQEARIVGCASFNLTSSGTVVYHSGGPTASHGVAFLLRPHLARGAVF FT RGLSPRLATLHLPDQRLFLVNAYAPTSSYDDDAYDAFIDQVETALRSAPRG FT TMPVLVGDFNCRVAREPGNERFVGNSASQSPNSRGRTFTEALVRNKLRAWN FT TFPKRRHGRTWTWRSNDGVTYHQIDFLAAPPSARVVNCGVVGRFEFNSDHR FT LVRMCLSLSGKVRQKRCREKLDFDRASFTVNASLLASLPLASPTSATDAYC FT NIKAFTDAAAANCWRKRHTPPWISRATRNLLALRHQLQANSQGPVAYAVAC FT KSARMSLAEDIRKRKEAQARQAALMGRSIVKEILKLQSTKKRLLVPDPASG FT ALSQSATKAAVKDFYEDLYSPAVQIPLAVPPHSLDPFPPFLPDEARHAMSL FT LKCGHSPGSDGILPEMLYHSRDHLAHSIAHLLNRLVAGDTVPCELSEAVVS FT LLFKKGDPTNIANFRPISLLTVTLKVTTRCILKRFEAVLEETESATQTGFR FT RGFSTLDNLHAIKQVAERTSEYGIPIYLAFVDFKKAFDCVEWSACWNSLWK FT YGAHPTLIHLLRRIYESSTTLIRVNEELVPVTVKRGVRQGDTLSPRLFNVA FT LRSAMDTIDWEEDGIRIDGRNLSHLEYADDVALVAKTRPELERMLRKLMDA FT CRRVGLEVNATKTHLLTSCKTTRAPITIQNLTFNFVDSTTYLGGRISLPLD FT HTDEIEHRIRLGWLAWSKLSHLLSSRLLPMKTRRRLFESCITSTVLYGSEV FT WALRSSDKERLSITQRKMERKMLGVALRDRWRNERVREITKLRDWNREALR FT RKARWALKVRSMQMEQWTRATTFWTPYNRKRPPGKPRARWRDDLDRAIGNW FT WNTPHEDFAPILI" XX SQ Sequence 2904 BP; 673 A; 899 C; 740 G; 592 T; 0 other; cattctgggt tctcaagtga gtggcgagcg gttgggtgcg attttcgtcc cgctgtctcc 60 gattccctca gcccggattc cctctcccag cagcgttggc acgcgggagt tgtatgccac 120 catcgatggg acgccacttc tacatcggca ccatcaatgc ccggacgctc ggccccaagg 180 acaagcagac cgagatggag ctcgccctcg acaagatcaa gtgggacgtg attgcagtgc 240 aggaagccag gattgtgggt tgcgcctcat tcaatcttac atcctcaggt actgtagtct 300 accactcggg cggcccaact gcgtcccatg gcgtggcatt tctcctccgc ccgcatctcg 360 cgcgaggagc cgtgttccga ggcctctctc cccgtctggc cactctgcac cttcccgacc 420 aacggctctt cctggtcaac gcgtatgccc ccacctcctc ctacgacgac gacgcatacg 480 atgccttcat agaccaggtt gagaccgccc tgcggagtgc gccgagaggc accatgccag 540 ttctggtagg ggacttcaac tgtagagtag caagggaacc tggcaatgaa agatttgtcg 600 gtaattccgc ctctcagtcc cctaactctc gtgggcggac tttcacggaa gccctcgtga 660 ggaacaagct gcgcgcatgg aacactttcc ccaagagaag acacggccgt acctggacct 720 ggagatcaaa cgacggcgtc acttaccacc agatcgactt tctcgctgcc cctccatcag 780 cacgagttgt caactgtggt gtcgtgggtc gcttcgagtt caactccgac caccgcctcg 840 tccgtatgtg cctgtccctc tctggcaagg tgaggcagaa aagatgcagg gagaagttag 900 atttcgatcg ggcttctttt actgtcaacg catcccttct cgcgtcactg ccccttgcca 960 gccctacctc cgctaccgac gcctactgca acatcaaggc cttcacggat gctgcggctg 1020 ccaactgctg gagaaagcgc cacaccccac cttggatctc gcgtgcaacc cggaacctcc 1080 ttgcgctacg tcatcaattg caagccaatt ctcaaggacc cgtcgcctac gcagtcgcat 1140 gcaaatccgc ccggatgagc ttagctgagg acatcaggaa gagaaaggaa gcgcaggcca 1200 gacaagcagc gctgatggga agaagcatcg tgaaggagat tctgaaactg cagtccacca 1260 agaaacggct cctcgtccct gatcctgcct cgggagcact ctctcagtcc gcaacaaagg 1320 cagcagtcaa ggatttctac gaggacctct actcgccagc agttcaaatc cctctcgcag 1380 ttcctcctca ctcgctcgac cccttccctc ccttcctccc cgacgaagca cgacacgcaa 1440 tgtccctcct caagtgtggc cattcccccg gatcggatgg cattctaccc gagatgctct 1500 accactcgag agaccatcta gcccattcca tcgcccattt gctgaatcga ctagttgccg 1560 gagataccgt accctgcgag ctgtcggaag ccgtcgtctc cctactgttc aagaaaggag 1620 acccaacgaa cattgccaac ttccgcccca tctccctact taccgtaacc ctcaaggtta 1680 cgacgaggtg cattctgaaa agattcgagg cagtgcttga agagaccgaa tcggcaaccc 1740 aaaccggatt ccggagaggg tttagcacgc tcgacaacct gcatgccatc aagcaggtcg 1800 cagaaagaac ctcggagtac ggtattccca tttacctggc ctttgtcgac ttcaagaagg 1860 catttgactg cgtcgaatgg agtgcatgct ggaactccct ctggaagtat ggagcccatc 1920 ccaccctcat ccacctgctc cgtcggatct acgagtcttc caccacactc atcagagtga 1980 acgaggaact cgtccctgtg acagtgaaga gaggcgtccg tcaaggagac acgctttctc 2040 cccgcctctt caatgttgca cttcgatctg ccatggacac catcgattgg gaggaggacg 2100 ggattcgaat cgacggaagg aatctcagcc atctcgagta cgctgacgac gtggctctcg 2160 tcgcgaagac ccgtccggaa ctcgagcgca tgctacggaa gctaatggat gcctgcagaa 2220 gagttggtct cgaagtgaat gcaaccaaga cccacctact cacgtcctgc aaaacaacac 2280 gtgccccaat tacaatccag aatctgacat tcaactttgt cgactcgacc acgtacctcg 2340 gaggaagaat ctctcttccc ttggaccaca ccgacgagat tgagcatcgg attcggctcg 2400 gttggcttgc atggagcaag ttatcgcatc tcctttcatc ccgccttctt cccatgaaga 2460 ccaggagaag actcttcgag agctgcatca cctctaccgt actctacggg agtgaagtgt 2520 gggcactcag atcgagcgat aaggagcgac tgagcatcac ccagaggaag atggagcgaa 2580 agatgctggg agtcgcgctg agagaccgct ggaggaacga gcgcgttcgg gagatcacta 2640 agctgcgcga ctggaaccga gaagcactga gacggaaagc gcgatgggcc ctcaaggtca 2700 ggagtatgca aatggagcaa tggacccgcg cgactacatt ctggacgcct tacaaccgga 2760 agcgcccgcc aggcaagcct agagcacgct ggagggacga cttggaccga gctattggga 2820 actggtggaa tacaccccat gaagattttg cccccatcct catctagcaa taattgcctg 2880 aaatgataaa ttgattgatt gatt 2904 // ID BEL-40_CQ-LTR repbase; DNA; INV; 424 BP. XX AC AAWU01013031; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-40_CQ_; KW BEL-40_CQ-I; BEL-40_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-424 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 234-234 (2011). XX DR GenBank; AAWU01013031; Positions 10298 10721. XX SQ Sequence 424 BP; 118 A; 122 C; 95 G; 89 T; 0 other; tgttggatct acgaagtgtc agttggactg gcaacacttc acaaccacga ctactacgat 60 ttttcaccac aagatagcaa ccctgatcgc ctgccgcaac agcttcctgc gatcaagcac 120 aaccctgcgc aacgctacgg ccgcacaacg gcgcgacgaa ccacgacgac gacgaaatac 180 gacgacgaag acgacgacga agaggacgag tgaaaaaacc gccattttgc gggtacagtc 240 ttttttcacc ctcggacaag tgaacacgta gttattaaac ttagtcttaa taaagctagt 300 ttgtagtgct aatccgagtg cgtttgcttt tttaccacac ccacggccag ttcgaacaat 360 caaagtacag tccacccgtg agtattccgc cgttttccgt ggggccaacg tgccggcccg 420 aaca 424 // ID AVPB1 repbase; DNA; INV; 1822 BP. XX AC AY179351; XX DT 18-AUG-2005 (Rel. 10.08, Created) DT 18-AUG-2005 (Rel. 10.08, Last updated, Version 1) XX DE PiggyBac-like DNA transposon from Adineta vaga. XX KW piggyBac; DNA transposon; Transposable Element; KW Interspersed repeat; AVPB1. XX OS Adineta vaga OC Eukaryota; Metazoa; Rotifera; Bdelloidea; Adinetida; Adinetidae; OC Adineta. XX RN [1] RP 1-1822 RA Arkhipova I.R. and Meselson M.; RT "Diverse DNA transposons in rotifers of the class Bdelloidea."; RL Proc Natl Acad Sci U S A 102(33), 11781-11786 (2005). XX DR Genbank; AY179351; Positions 1 1822. XX FH Key Location/Qualifiers FT CDS join(170..679,683..1672) FT /product="AVPB1_1p" FT /translation="MTWFSIPSHASKTTFSNTATENSGLTEFSENITSVED FT AFLCFISEEKLNKIMVYSNIEGNHNTASNGERKPITLIELKAFIGLLLLGG FT LMRKSKKNIKSLWNRSPLESPIFRATMSRNRFETIISSIQFDNKTAREERK FT RTDKFAVSREIWTDFSRKFKEMLQPWIAWYNRTAFGISREVSISSIYPSKP FT DKYAIKFWFCVDVNSYYIFDAFPYIERQPNEHRQRFVGPNVVLELMKPMYG FT SNRNVTIDNFFTSIHLAKELHSGKLTLVGTLRKNKPEIPIEFQSNKNRDVG FT SSIFGFSDNLTLVSYVPKKNKAVILLSSMHHDSKVDIGTGKPNIVLDYNKS FT KGAVDTIDEMCHKYSVKRGTRRWPLCVFYGMIDAAAINAMSLWKKKNPNWN FT ANKKYKRRLFLEELGTLLTSYLLDFRIKNSSTLHKDIQNALVRFGYPRIET FT ELETFVTDSARSKRKRCSLCDYSSDRKVSNTCYKCSEPICKQHSMKRVFRI FT NCSK" XX SQ Sequence 1822 BP; 637 A; 317 C; 340 G; 528 T; 0 other; taatgaagac tgttctgatg tggacagcga aaatggttcg gaaacacatg atgctcaaag 60 tgaaacatcg gaatatgaga gtgaaattga tctcatagaa tttgatcgcc tgcaaatcga 120 tacgaatata gattttatcg aggaaactcc ctttagttcc agatccagga tgacctggtt 180 ctcaattcca tcacatgcaa gcaaaacaac gttttctaat actgccactg agaattctgg 240 acttacagaa tttagtgaaa acataacttc tgtcgaggac gcctttctgt gtttcatatc 300 agaggaaaag ctgaacaaga taatggttta ttcaaatatc gaaggaaatc ataacacagc 360 ttctaatggt gaacggaaac caataacatt aattgaactc aaagcgttta ttgggctttt 420 attattgggt ggattaatga gaaaatcaaa gaaaaatatt aaatctttgt ggaatagaag 480 cccactggaa tctccgatat tcagagctac catgtcaaga aatcgctttg aaacaataat 540 ttcatcaatt cagtttgaca ataaaacagc acgagaagag aggaaacgaa cggacaaatt 600 cgctgtatct cgtgaaattt ggacagattt ttcgagaaaa tttaaagaaa tgctacaacc 660 ctggatcgca tggtacaatt gacgaacggc ttttgggatt tcgagggaag tgtccatttc 720 gtcaatatat ccatctaaac cggataaata tgcgattaaa ttctggttct gtgtcgatgt 780 caattcttac tatatattcg atgcatttcc ttatatcgaa cggcaaccta atgaacatcg 840 gcaacgattt gttggtccta atgtcgtttt agagctaatg aaaccaatgt acggctcgaa 900 tagaaatgtt acgatcgaca atttctttac cagtattcat ttagcgaaag aattacactc 960 agggaaactt actttggtag gaaccttgag aaaaaataag cccgagattc ctatcgaatt 1020 tcagtcaaat aaaaatcgcg acgttggttc atcgatattc ggtttcagcg acaatctaac 1080 actggtgtct tacgttccaa agaaaaacaa agctgtcata ttactctctt ctatgcatca 1140 tgatagcaaa gtagatatcg gaactggaaa accgaatatt gttctggact ataacaaaag 1200 caaaggggct gttgacacta ttgacgaaat gtgtcacaag tactctgtaa aaagaggtac 1260 aagacgatgg ccactatgtg ttttttatgg aatgattgat gctgctgcta tcaatgcaat 1320 gagcttatgg aaaaagaaaa atccaaattg gaatgcgaat aaaaaataca agcgtcgtct 1380 atttctagaa gagttaggaa cacttctgac atcctatctt ttagattttc gcataaaaaa 1440 ctcatcaaca ttacacaaag atattcaaaa tgcgctagtt cgatttggtt atccgcgaat 1500 agaaacagaa ttagaaacat ttgtcacaga ttctgcacgc tccaagcgaa aacgatgttc 1560 actttgcgat tattcaagtg acagaaaagt ttctaataca tgctacaaat gttcagagcc 1620 tatatgcaaa cagcatagta tgaaacgcgt ttttcgtatt aattgttcta aataaacaaa 1680 accaattagt ttagtcattt ttgattgact aatacttaga aacaactcag ttttaatcag 1740 atgcttgttg aaagagacaa aaatatcctt ctcaaaatgc attctatttt tatcgctaaa 1800 aataggcttg ggagtactag gg 1822 // ID Gypsy-16-I_NVi repbase; DNA; INV; 8209 BP. XX AC . XX DT 13-APR-2009 (Rel. 14.04, Created) DT 13-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW internal portion; Gypsy-16-I_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-8209 RA Bao W. and Jurka J.; RT "LTR retrotransposon from Nasonia parasitic wasp."; RL Repbase Reports 9(4), 769-769 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS join(1206..2474,2419..3039) FT /product="Gypsy-16-I_NVi_1p" FT /translation="MAKKVEKVVENWCLGIDEEEMDDALESMQLKPDGKRL FT DRVARLNRWLLGEYTESDFRESLDASTVMARVKREELRKSFIDATAEQVST FT RSIYTGDYDAEQPHPLKLLQQEIEANEVVVTPAIEMNYGQPAQGLGQPTGP FT NLQLLNAMTMRDAFAEMPVTEESEQAQAPVSTPEDQARLTAQEAQGRDQNR FT IVTIEVPTGNLVTTVTTTTTTTTTTTTTTTPTTTASEGKGEYVLQNGKFRS FT VKVTAPRAVMGAAAVTGGLDTRSQQSAATTSNRPLNRGYGEDERCSAQYAQ FT SHEVSAQNSVAQNDLLSQSIAENTATMRALMCKMEAMMSQWPTNIASSTRV FT EPPASESNPQNSRRAGHVNFQDTYSINHMNHSSFREDAGAVAGGASNSWIP FT ANPLYSQPPVVARGGGGCECDERAYASAELLGGGGDANATNAHTQALSDSS FT NYDGSRRQNNRPIGPAVNKWGIKFTGDSSQSVENFLQRMEGRRMLDDLSDA FT QMLASLQEVLTDNAYEWLQNRLSEQGEWSSWDDFCECIKRWYGQTQGYQQK FT LLSEVISRTQGPGERVRVYVNNLQGIMRRMVPRPDASQQLDRIYHNMLPSL FT KRGLQRKDFTTVEQLLEAAVEVEAALACEAW*" FT CDS join(3508..4725,4649..7120,7069..8052) FT /product="Gypsy-16-I_NVi_2p" FT /translation="MVALLSSESRQPTISPNEDESRWWVQVGIGGFRVRAL FT YDPGAAMTVMGSLGLQIASACGRPLRQRDGRQAQCANGQTSPIVGFVDLPF FT DVAGVQRDVTVAIMPEATNECLVGTNFGRLFGAVHDPNENVLYLKKERKIV FT QLQVATIATAGAIDIAAVGLAYVSDDEHAQIRRMLDSILPKSNRTIGRTQL FT IEHGIDVGSARPIKQKYYPVSPKIEEEMYQQVRELLAAGIIEPSDSAWSSP FT VVMVRKANGQYRFCVDFRKVNALSKADAYPLPNMDDILRKLQKAKYISTLD FT LSSAYHQIPLKPEARPLTAFTVPGLGLFQFTRMPFGLAYAGATFQRMIDEV FT ISPELQPYAYSYLDDIIIATETFEEHLEYLEKVLQRVNAAWIDDQPGQKCL FT LPRRGEVPRRPGTRRGLTINRDKSVFCREEVKYLGVLVNHDGFRPDPEKIA FT PIVAYPAPKNLKQLRRFLGMASWYRKFLQEFATVAEPLHRLTKKGRKYEWG FT TEQQDAFQQVKALIASAPILHRPDFESQFVLQTDASDTGLGAVLFQVINGE FT ERVLEFASRTMSKAERNYSVTERECLAVIWAIRKFRPYIEGYQFKVVTDHS FT SLRWLCNLHNPTGRLARWALEMQGHVYTVEHRKGLLNAVPDALSRMYEEDA FT VPVEAVGWASETQDEWYCEKVREIQLHPSKDPLYKMYGGQLYYFCPDEKLP FT ASMNDDTAWKIVVPKEKRAEILRECHDEPSAGHQGREKTCARVAMYYYWPG FT MYTDVAEYVRRCFVCQQCKVEQRSPSGLMGKRVITRPWQIIAGDVMGPKPK FT TARGNEYILIFEDLFTRWIECIPIKRANAKTILQHLRERVFLRYGAPEVFL FT SDNGTEFKNKAIDQYLLSDNGTEFKNKAIDQYLREQGVRHELTPPYHPQAN FT PVERVNRTINNMVRALIEENHNTWDERLSDIAFGYNTVPHSSTGVSPAILM FT YGTQPRKPSAARREQDAAAEEELQRRAIDAWHDRMWDLSQLREQASQRTQD FT AQDRQKRYYDAKRKPASYKVGDLVMRRNHVLSSGAEQRSAKLAPPYIGPYK FT IVEIRGTNTYRLVDEMGEQVDLAPAAQLKPFYSSATDEESQKSEEDGDAQP FT PRDASVDAGTSEVPSRSERDSVCEQSDIASEASEVADRPKRXKKTRVVVIN FT PATKKRGRPGKKVYTARRVRPTPETPRVEVDTEKRRPGRPKGSRNRGHATT FT TTNPSTSPRKTRAQRAADLSPVDVTAKNARTKSCGPKLEIFICLGMSPEEL FT QAAFERVMAALTREDERPHVVIREVGEGEDPWSAFPPDDEVEAYLAPEEED FT EEEGALVIRRPEPPALVAGEGETTATGLRRPRAPQPASSSVGRPISARLER FT EPVHRETNTAETRAPPAGSKATKEPSSEEAAEAAWNQHRAKLVEALEASLD FT TRRLEGEAYRLYTQSVRKHGDAFRRCAGKWARLLEEGRAAFYAAEDARRKV FT AKREDRERQLAKREVARIEAAFAEAKARVRQFQAPGRTSASEAEKRPRAKS FT AARTPTSGASDKTSQKSCYVCGHPNVSRSNKCPDVRLHKAK*" XX SQ Sequence 8209 BP; 2222 A; 2036 C; 2418 G; 1528 T; 5 other; ttggcgccca gttcgtgggt tacttaggaa ggaacgcgca aatattgtaa ttttcgcgat 60 tttcggcgcg agcgagaaac cgttgcgaat ttcagaaggt cgcgcgagcg acgtaaacac 120 gagcatcccc gtgttgttta cgtcgcgctc aaatcaggcg gatttgagcg gcgttcgaat 180 ttattcgtgc ggcgtatacg aataattcga acgcaattca cacgagaagt ctgtgaataa 240 taatcggaga tttttgttgc caggccgaga acaaatatct accgatatta tcacacgacg 300 tagaattaaa cgactcggag cagcgaaact tgaagctgac actagtatcg catgctctcc 360 gagtcgaata attttctatt tgttgtcggg agaattttaa agcactctcg ttaggataga 420 gagtagcata attatttttt cgtgacgtct cgtgtgaaag cagagcgtat atgcgaaatt 480 gtaagttgtt taattattaa caattaaaca acaaaaggca gcaaagcggc aacacaaacg 540 cgtgtagccg caaagtatag tccacgcacg ctatagtgga gcgaagcgcg cagcggcgcg 600 agtgtgtgag agagagtgcg agcgcttgcg gcgctacata gaggcaattc tacgagaacg 660 cgacgggcgc gaggtatata gtcacttcca cgggaagacg cggaacagta tacgcccagt 720 tttcagccaa gaagacctca agatttgtca agaagagaac acactcgact cgcagatcga 780 gatatacgcg taattccgcg aaaaatattt tcgtcttttt tcggattttt ttttcgggtg 840 tatgagtgcg tatatgcctg tagcacaaac tgtgcgttgc aagtgagagc acagcggtgc 900 tacgagtgcg cgcgccagaa gcaattcgtg cttatctggc gtaattaaaa ttttttttta 960 cggatttcgc tccgattata cgaattcgat ttttcgtttt attcgatttc gcgtaataat 1020 ctacgtcaca gcgacgcgct taggtgtgca agtgtgtgag tagaaaggtt gtatttttgt 1080 agcttacgtt gtagcaacaa gaatacaatc taggggctat ctggagtgag tgcgtaggat 1140 agggagctgc cacgtcgttg ttgcgcaacc gcgaaacgtc gtttagacgc agaagacaca 1200 ccaacatggc gaagaaagta gagaaggtcg tcgagaattg gtgcttgggc atcgacgaag 1260 aagagatgga cgatgccctc gagagtatgc agttgaagcc agacggcaag cgattagatc 1320 gcgttgctag gctaaacaga tggctcctcg gagagtatac cgagtcagat tttagagaga 1380 gcctggacgc ctcgacagta atggcgcgag tgaagaggga agaactgcgt aagtcattca 1440 tcgacgcaac ggctgaacaa gtatcgacac gctcgattta tacgggcgat tatgacgccg 1500 aacagccgca ccctctgaag ctgttacaac aagagatcga ggccaacgaa gtcgtggtga 1560 cgccggcgat cgagatgaac tacggtcaac cggcgcaagg attaggccaa ccgacagggc 1620 ctaatctaca actgctaaac gcgatgacaa tgagagacgc gttcgcagaa atgccagtca 1680 ctgaggagag tgaacaggca caagcgccag tgtcgacgcc ggaagaccaa gccaggttaa 1740 cggcgcagga ggcgcaaggt cgagaccaga atagaatcgt cacaatagag gtgccgacag 1800 ggaatctggt cacgacagta accacgacaa cgacgacaac gacgacaacg acgacaacga 1860 caacgccgac aactacggcg agcgaaggca aaggcgaata cgtcttgcaa aacggaaagt 1920 tccgcagtgt gaaggtcact gcaccacgcg cagtcatggg cgcagcggca gtgactggcg 1980 gattagacac cagatctcag caaagcgctg caacaacgag caatcgaccg ctcaaccgag 2040 ggtacggtga agacgagcgt tgcagcgcac aatacgccca gagtcacgaa gtatcagcgc 2100 aaaattcggt agctcagaat gacctgctaa gtcagtcgat agctgagaat accgcaacga 2160 tgagggcgct gatgtgcaag atggaggcca tgatgtcaca atggccgact aacatagcgt 2220 ccagtacaag ggtggagcca ccagcaagcg agagtaatcc acagaactca cgcagagccg 2280 gtcatgtgaa tttccaggat acgtatagta tcaatcacat gaatcacagc agcttccgcg 2340 aagacgcggg cgcggtggct ggtggagctt ccaactcttg gatcccggcg aatcctctct 2400 attcgcaacc cccagtagtt gctagggggg gggggggatg cgaatgcgac gaacgcgcat 2460 acgcaagcgc tgagtgacag tagcaactac gatgggtcgc gaagacagaa caacagaccc 2520 atcggtcctg cggtgaacaa atgggggatt aagttcaccg gggactcgtc acagagcgtc 2580 gagaacttcc tgcaacgcat ggaaggacgc agaatgctgg acgacctgtc cgatgcgcag 2640 atgttagcgt cattacaaga agtattgacg gataatgcgt acgagtggtt acagaaccgg 2700 ctctccgaac agggagaatg gtctagctgg gacgatttct gcgagtgcat caagcgatgg 2760 tacgggcaga cgcaaggcta ccagcagaaa ttactgagcg aagtaatttc gagaacgcag 2820 gggcctggcg aacgcgtgcg ggtgtacgta aataacctgc aaggtataat gcggagaatg 2880 gtaccccgac cagacgctag ccagcaactg gatcggatat accataatat gctgcctagt 2940 ctcaaacgag gactccagcg gaaagatttc acaaccgtgg agcagttact cgaggcagcr 3000 gtcgaagttg aggccgctct agcatgcgaa gcctggtaaa acgccgctcg cagccgtagg 3060 attggcagag cagaccgaca cactgcccat aatagtcgac gccataggca agctaytcga 3120 ctcgaagctt acggcaatag cgcaaccggc gaagcacgcc aacgctggag gaggcarcaa 3180 gaagtcagcc cctcagcgag gacgctccgg acctaggaaa gagagaggca agtctcctag 3240 tccgaagcgc aaaagccaga agtctggcga gggttccggt caatcggaca aaagagagca 3300 gaggaagcca atagagtgct tcaactgcca cggaaaaggc cacattgccc gagaatgtcc 3360 tttgggtaag ggaaacggca aaaaggagga gtaggcgagg ttactgctcc tcttgaaccg 3420 tcgcgcgagg agtctccccg agaggagagg cacgtgacgg agggcgtagc cgatacctcg 3480 cgcgaaacct caggcgattt tgatcgcatg gtcgcactgt tgtccagcga atctcgccag 3540 ccgacgatat cgccaaatga agacgagtcg cgctggtggg tacaagtggg catcggaggg 3600 ttcagagtaa gggctctcta cgacccaggc gccgctatga cagtgatggg gtcgttagga 3660 cttcagatcg ctagcgcctg cggtcgacct ttgcgacaac gcgatggccg tcaagcacag 3720 tgtgcaaatg gtcagacgtc gccgatcgta ggatttgtgg acttgccatt cgacgtcgcg 3780 ggtgtacaac gcgacgtaac ggtggcgatc atgcccgaag cgaccaacga atgtctagtt 3840 gggacgaatt tcggacgcct gttcggcgca gtccacgatc caaatgaaaa cgtcctgtat 3900 ctgaagaagg agcgaaagat tgtccaactt caggtggcta cgatagctac agcaggggcg 3960 atagatatcg ccgcggttgg actcgcgtac gtcagtgacg acgagcacgc gcagattcgc 4020 agaatgctgg acagcatcct gcctaaatcc aaccgtacga tcggacggac gcagttgatt 4080 gagcacggga tagacgttgg atcggccagg ccgataaaac agaagtatta tcccgtgtcg 4140 ccaaaaatag aggaggagat gtatcaacaa gtacgagagt tgcttgctgc gggaatcatc 4200 gaaccctctg acagtgcttg gtcaagcccg gtagtgatgg tgcgcaaagc gaacgggcaa 4260 tatcgttttt gcgtcgactt ccggaaagtt aatgcgctgt caaaggctga cgcatatcca 4320 ttacctaaca tggatgatat ccttcgtaaa ctgcagaaag cgaagtacat atcgacgttg 4380 gaccttagca gcgcgtacca tcaaataccg ttgaaaccgg aggcgagacc tctgacggcc 4440 tttacagttc ctgggctagg tttattccaa ttcacgcgta tgccgtttgg tttagcctac 4500 gctggagcga ccttccagcg aatgatcgac gaagtgatca gcccagaact ccagccatac 4560 gcgtactcgt atctcgatga tatcatcatc gcgacagaga cgttcgaaga gcatctcgag 4620 tacttagaga aagtgctcca gcgagtgaac gcggcgtgga ttgacgatca accgggacaa 4680 aagtgtcttt tgccgagaag aggtgaagta cctcggcgtc ctggttaatc acgatggttt 4740 tcggccagat ccagagaaga tagcgcccat tgtggcgtat ccagcgccga aaaacctgaa 4800 acagctgaga cgtttcttag gaatggcgtc gtggtatcgc aaattcttgc aagaattcgc 4860 gaccgtcgct gaaccgctcc atcgcctgac gaagaaaggc agaaaatacg agtgggggac 4920 ggaacaacaa gacgctttcc aacaagtgaa agcgctgata gcgtctgcac cgattcttca 4980 tcgcccagat ttcgaatcac agttcgtgtt gcagacggac gccagtgaca ccggtctcgg 5040 cgcggtactt ttccaagtaa tcaacggaga agagcgggtt ttggaattcg ctagccgcac 5100 aatgtcgaaa gccgagagaa actacagtgt cacagagaga gaatgtctcg ctgtcatctg 5160 ggctatacgg aagttcagac cttatattga gggatatcaa tttaaggtcg tcacggacca 5220 cagtagtttg cggtggctat gcaacttgca taatcccacc ggacgcctag caagatgggc 5280 actagaaatg cagggccacg tctacactgt ggagcataga aaaggcctgc taaatgccgt 5340 accagacgca ttgtcgcgta tgtacgaaga agacgcagta ccagtcgagg cagtcggatg 5400 ggcaagcgaa acccaggacg aatggtactg cgaaaaggtc cgtgagatac agctgcatcc 5460 gagcaaagat ccgctgtata agatgtacgg cgggcagctg tactacttct gcccagatga 5520 gaaattgcca gcgtctatga atgacgatac ggcatggaaa atcgtcgtgc caaaagaaaa 5580 gcgagcagag atcttgcgag agtgtcacga cgagccatca gccggtcacc aggggcgtga 5640 gaaaacttgt gcgcgcgtcg caatgtacta ttactggccc ggtatgtaca ccgacgtcgc 5700 tgagtatgtt cgacgatgct tcgtttgtca gcaatgcaaa gtcgagcagc gctcgccaag 5760 cggcctgatg ggaaaacgag tgattactcg gccatggcag atcatagcag gagatgtcat 5820 ggggccgaaa ccaaagaccg ctcgaggaaa cgaatatatt ctcatattcg aagacttgtt 5880 cacccgctgg atagagtgca ttccgattaa gcgggcgaac gcgaagacaa tcctgcagca 5940 tctccgagag cgggtcttct tgcggtacgg tgcgccagag gtattcctgt cggacaatgg 6000 cacagagttc aaaaataaag ccattgatca gtacctcctg tcggacaatg gcaccgagtt 6060 caaaaataag gccattgatc aatacctcag agaacaaggc gtgagacacg agttgacgcc 6120 accttatcac ccccaggcga acccagtaga gagggtaaat cgcaccatta ataacatggt 6180 gcgcgcgctg atcgaggaga accataatac atgggacgag cgattgagcg acatcgcatt 6240 cgggtataat accgtacctc attcctcgac aggcgtgagt ccagccatat tgatgtatgg 6300 gactcagccg aggaaacctt ctgcagcaag gcgcgagcag gacgccgccg cagaagaaga 6360 gttgcagcga cgcgcgatcg atgcatggca tgatcgcatg tgggatttat cccagctgcg 6420 cgagcaagcg tcgcagagaa cgcaggacgc gcaagatcgc cagaagcgct actatgacgc 6480 caagcgtaaa ccggcgtcat acaaagtcgg agacttagtg atgaggcgaa atcacgtgtt 6540 gtcttctgga gcagagcaac gctccgcgaa gttagcgcca ccatatattg ggccgtataa 6600 aatagtagaa atacgcggca ccaatactta tcgacttgtc gatgagatgg gcgagcaagt 6660 agatcttgca ccagctgcgc agctgaagcc cttctattcg tcagcgacgg acgaagagtc 6720 gcagaagagc gaagaagatg gcgacgctca acctccgagg gacgcgagcg tagacgcggg 6780 tactagcgaa gtaccgtcac gaagcgagcg agactctgtc tgcgaacaga gcgatattgc 6840 gtccgaagcg agcgaagtcg ccgaccgtcc gaagcgarcg aaaaagacgc gcgtggtcgt 6900 aataaaccct gcgaccaaga agagaggccg tccagggaaa aaggtgtaca cggctaggcg 6960 tgtgagacct acgccagaga ctcctagagt ggaggtcgac accgagaaga ggaggcctgg 7020 tcgtccgaag ggttcgcgca accgcgggca tgcaacgaca actactaacc cgtcgacgtc 7080 accgcgaaaa acgcgcgcac aaagagctgc ggacctaagc tagaaatttt catatgctta 7140 ggtatgagtc ccgaggagct gcaagcagcc ttcgagcggg tgatggccgc gttaaccaga 7200 gaggacgagc ggcctcacgt ggtcatccgg gaggtaggcg aaggagaaga cccctggagc 7260 gccttccctc ccgatgacga ggtggaggcg tacctcgccc ccgaagaaga agacgaagaa 7320 gagggggcgt tggtgatacg ccgcccagaa ccacctgcgc tggttgctgg cgagggcgaa 7380 acgaccgcga cgggtctgcg ccgcccgcgt gcgccacaac cagcgtcgtc cagtgtcggg 7440 cgacccattt cggctcgctt ggagcgcgag cccgtccacc gcgaaaccaa caccgcggag 7500 acgcgagcgc cgccggctgg ctcgaaggca acgaaggagc cttcgagcga agaggcagcc 7560 gaagcggcgt ggaaccaaca ccgcgcyaag ctggtggagg cgctggaagc gtcgctggat 7620 acgcgccggc tggagggcga ggcgtaccgg ctgtatacgc agtcggtgcg caaacacggc 7680 gacgccttcc gccggtgcgc cggcaaatgg gcgcgcctgc tcgaggaggg gcgagcggcg 7740 ttctacgccg cagaagacgc gcgacgtaaa gtcgcgaagc gcgaggaccg cgagcgccaa 7800 ctagccaagc gagaggtcgc tcggatagag gcggccttcg ccgaagcgaa agcccgcgtg 7860 cgccagttcc aggcgccggg acggactagc gcgtcggagg cagaaaagcg tccgcgcgcg 7920 aagagcgcgg cgcgcacacc aacatctggg gcgtcggaca agacgtccca gaaatcctgc 7980 tacgtctgcg ggcatcccaa tgtgtcgcgc agcaacaagt gcccagatgt tcggctacac 8040 aaggctaaat aaagcccagt aacggtagtc accgtaataa tgtgtttctt ttatcgccct 8100 agcatcccca gattccttgt cgcggacgcg atatacgacg cttcggttcg tcgcggacga 8160 aatggataga atcgccatct gatcgataga tcgattctag ggggagggt 8209 // ID REP-4_CQ repbase; DNA; INV; 1710 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A repeat family from Culex quinquefasciatus - consensus. XX KW Repetitive element; nonautonomous; REP-4_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-1710 RA Kojima K.K. and Jurka J.; RT "Repeats from the southern house mosquito."; RL Repbase Reports 11(1), 607-607 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >95% CC identity. No TIRs. XX SQ Sequence 1710 BP; 458 A; 344 C; 309 G; 599 T; 0 other; tttttgcctt tcttactaaa gaaaggtata ggttttactt taagccagga cgtcatttcc 60 atcttcgtaa atatgtcgat tcagcatgaa ttttaatcgg aaaaatcgtc aaaaaatcac 120 actgcatcga ttttcgacgc tctatcgatt caccttgaca gaactcgtct atctccaacc 180 ctcgaatttt gagaaaataa attccgtgga ggtcgagtga tgttcgtatg tggtaaaatt 240 ttgctaaatt ttgtgtcgcg ccgttctcag ctcccatata tccgatttcg attcttctaa 300 atgcagatga aagctagtgt gctgaacctg ggcgagttgc cgcgcaaatt tcgaaatatc 360 catctgtttt cggatgcggc cagacttttc aacaaaatac acagttttca aatgaaaata 420 tggtcaattt ttttcatttt catatctttt atttatctca aaaattgcat ttttcgagct 480 ctacaacctc ccaaattttc atccagatcc ataatctggt tctggagtta gagccgtttg 540 attaacctac caaaagaaaa aaaatcccta aaaaaaggta aaagcccacc tggtgacctt 600 cgccaacatt tttcatttct gattttattc aaaatttctt gctacattca tagtacagac 660 catgagtgag catctgaaac aaattcgaca tcgatcggca atcccgatca atttttagaa 720 cgatttactt tttgcctttc ttactaaaga aaggtatagg ttttacttta agccaggacg 780 tcatttccat cttcgtaaat atgtcgattc agcatgaatt ttaatcggaa aaatcgtcaa 840 aaaatcacac tgcatcgatt ttcgacgctt cgtcgccaag tcgacaagtt gtcaagttga 900 gcttcgaata ctggcgcgag aatgctggcg catatatttg ctggctcgac ttatactcgt 960 ttgtagtagc ttgtctaccg atgtttccta caacatgtaa gatatcgtag cttttcggtt 1020 tcttttgccc tgtttgtttt tgcataactg tccaatgttt atttaaaaac ttttgtgaca 1080 taggacattc atccggctac aatctatttg ttttccgtgg gctccgaaaa tcgcctaaaa 1140 gtagactcaa ttttttcgtg atttcgtaag tttatttaac agggttcatt ggattccaat 1200 aggagctttc taagcttttc atcgtttata tcgttttcaa agttttcaat gctggcgacg 1260 ccaatggctt tctacctaag gtactcgtga ttgagtgtgt agtggtgcgg ttgtaaaaat 1320 agctatctct gactctgtca cattcgtaga agctgaatgg ctatcttccc tgcaccgtgc 1380 ggcacacgcg gttcacttat acctcgggtt ttctttgcgc ctaccgcggt gtgcgttgtc 1440 actggcgcat gtgaagcttg ctatgctcgc gtttgtcata aacgcttgtt tgattaagaa 1500 tatgtacgat taaaaatatt cttttggtga aaatggcgtt ttttgtttct tttggaaatc 1560 atatcattta taatactttg ctttcatcat aagactttct tcttcttcta aacaaagtca 1620 atgttatgaa ttgaaagaag ttctaccatc gttatattct ttagtaagaa aggctctatc 1680 tcaccccagg tgggattaaa tcgggttttt 1710 // ID DNA8-2_AP repbase; DNA; INV; 236 BP. XX AC . XX DT 22-AUG-2009 (Rel. 14.08, Created) DT 22-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-2_AP. XX NM DNA8-2_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-236 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(8), 1744-1744 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 236 BP; 77 A; 59 C; 46 G; 54 T; 0 other; cagtggcgta gcgaacttta aatcatatgg aagcaaacaa attatttatg acctacccca 60 cccaactcca actgatgtat acgatactca tatgctcaaa tagggggttg gggtagcacc 120 caaaataaag ttcaaagact gaccgtgtgt atgggcttta tgaggttatg acccaccacc 180 accccccaga aaccagaagc aattgcttct gaaaaacatg attcgctacg ccactg 236 // ID L2-3_NVi repbase; DNA; INV; 4612 BP. XX AC . XX DT 09-APR-2009 (Rel. 14.04, Created) DT 09-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE CR1-like retrotransposon: consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; L2-3_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-4612 RA Bao W. and Jurka J.; RT "CR1-like elements from the parasitic wasp Nasonia vitripennis."; RL Repbase Reports 9(4), 753-753 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS join(943..1740,1744..2469,2537..3655,3481..3810, FT 3779..4429) FT /product="L2-3_NVi_1p" FT /translation="MEALKTQNIVSERISKIEQRLGPLEKRLKALDELPAL FT KTRIHNAESTITELQAQIQDLSSRSPTMQQDNGSTVPNTAEICSLRSELAE FT VKRRQEQTSNCVVVVTGLHYTRETSLHLLAFSVVNALDPTVLRRDVASVRT FT MGRLDATNSSARGDGRLPPLAVTLSSSALARSIVIAKARKRKLHTSELDAT FT LLEEAKALSPDHQGLININELLPSDVHKLRTRARLEAKKRQGCRTFVREGR FT LYTCAATMTASVLRSSTPTPSWRLFPGFRPLPTSSNTEATXQYTHQTLSSS FT AQERSKVKSEYLFCEVSAKGVSPIFVGVVYRPPHAPFFQGSNFIDQLTTHM FT HNYSTKVIMGDFNSDQLSSSEDAKFIKAFIDENSLSSVPYGATHHKQGSDT FT WLDLCLIDEQDRLLSYWKTNSPFINGHDLITATLDVQIPRYVPNTYSYRNF FT KGISAEKLRDFLSACDWSSLTSSSLDECISTLNANLTNAINHLAPLRTVTP FT RRQRHPWFTTVSHVXXXXRILVALRDLVSERDRLYRRFRDSRLDSDLRIYR FT LARDNAHKQVEEARLNYYYSRLSTLTDVAEIWRELEKLGISATKAPSPSRF FT TTDELNKHFSSISNDPLAPAVEDYLLTLESLDLPEHFEFSTITESDVLAAV FT SHFDTQARGSDGIPQVVISKALPVLAPLLSHIFNLSLSEPCFPSAWKLSLV FT RALNKVSSPTALTDYRPISLLCFLSKALEWLVHRQVSEYLESRLLLDNFQT FT GFRTGHSTHVRVGINKKKVTLLLLFDFSKAFDTVCHVRLLRKLSTFGFSKQ FT VIRWFASYLTGREQAVVGDNSERFSPRRPAGSVWSLLLHCTSTTSVSALIP FT MFPILSMRMTCKFTVNATLRSSILYQTRRAFLSSASRRVRLVLAVALYIND FT IGFCLDSDVSHLIYADDLQIYSQCHLEELDSLSNKMSANAERIMGWAAQNR FT LKLNVNKTKAIVLGSPYYINALPKVTLINMGVPGQLHLLIWGCQVSFESSV FT RNLGLVLDSKLTWKEHVTQVCKRAHSLMYRLYFFRKSTNLRLRKHLVQALL FT FPIIDYCSLVYCDLTQELDLKLQRLVNTGIRYIYGVRRDEXISPYRRELQW FT LTTAGRRKYFTACFLRKMFNSVVPSYVLAFFDFRVTLRPVRGEVTPLEIPA FT FATETLRNSFHISASYLWNNLPSHIRNTSSTVTFRKLAKDHFFQLENT*" XX SQ Sequence 4612 BP; 1195 A; 1284 C; 977 G; 1128 T; 28 other; tggtgagtac agttgacagc tctgctgatc catcattttt ctgtactgct tttctgctgc 60 ttcttgcctt cgggccctct tctgttgtag atagttgcac gctccttttt gcaaccttca 120 cgtcgataca taaatcgccg aaggctgcaa tttttctctg tcatcttttc tactgcaaac 180 tttctacacg gctcctaacc tacaaatctg cttggcttgg cttggcttgt ttcctcactt 240 gcacggcgtc tctttctctc cgcaatcaac atcgtggcga catctgctgc atcgtgggag 300 aactttcaag gcgacttcac actgatcgag gtcggtactt tcacgctctt tccactaacg 360 ctccatctag ccgtaagagt cacaacacct actctttcac tccgtcgata acttatcatc 420 ctcctacaaa tatctgcttt accgactgct agtaagttat cgacgaaaaa tgagtgcaaa 480 cggcgagaag agtgtggaag tgccttttca gccatgcagc agctgtaaac aacctataca 540 gcacaaagac cccaagaagt gtgaattttg tggtctgcac taccacgtca agtgccaaaa 600 caccgttaac atcaaaacta tcgtggtgaa cgacaggcag attatagcct gtccctcatg 660 tgcagacaac agcaacgcca agggtgccaa actgaggact aaatccacct ctgccacaac 720 atcagcaacg ggagtaactg caacaacagc agcaggcaag aatactccca aacaacagca 780 gccggtgaat aagaagacga ggtcagcacc accatcagcc aacacgtcga gaaatccctc 840 accaacccgc tcggccggag taacaacacc tacgccgtcc gttgaggccg ctttgaggga 900 cattcgcgga gcactgctct cgacgacata cgcaatgctc aaatggaggc cctcaagacc 960 cagaacatcg tgtcggagag gatttctaaa attgagcagc gtctgggccc gctggagaag 1020 cgactcaagg ccctcgatga gctgcctgca ctcaaaactc gcatacacaa tgctgagtcc 1080 accatcaccg agctgcaggc acaaatccaa gatctgtcat cgaggagccc gacaatgcag 1140 caggacaacg gcagcactgt gcccaacact gcggagattt gcagcctacg cagtgaattg 1200 gccgaggtca agaggcgcca ggagcagacc tcgaactgtg tggtcgttgt cacgggtttg 1260 cattataccc gtgaaacctc gctgcatctc ctcgctttct cagttgtaaa cgcgcttgac 1320 cccacggtcc ttagaagaga cgtagcatcc gtcaggacca tggggaggct tgatgctaca 1380 aacagctccg ccaggggtga cggcagactg ccaccactag ccgttaccct atcttcaagt 1440 gcgcttgcac gctcgatcgt catcgccaaa gcccggaaac gcaagctaca caccagcgaa 1500 ttggacgcta ccttgctgga ggaggctaaa gctctgagtc ctgaccatca agggctcata 1560 aacatcaacg agctgctccc ctcagacgtc cataagctgc gtacaagggc taggctggag 1620 gccaagaaga ggcagggctg ccgaacgttc gtcagagaag ggagactcta tacatgcgct 1680 gcaacgatga cagcgagcgt gctacgatca tcaacaccga cgccgagctg gagacttttt 1740 tagcccggtt tccgcccgct gccaacatcg tccaacactg aggctacarc acaatacaca 1800 caccaaaccc tctcttcatc agctcaggaa cgttctaagg tcaagtctga atatctcttc 1860 tgtgaggtat cggcaaaggg agtctctcct atcttcgtgg gggttgtgta tcgtccacct 1920 catgctccat tcttccaagg ctctaacttc atagaccaac taacaaccca catgcacaac 1980 tattccacga aggtcataat gggagacttc aactctgacc aactttcttc atctgaagat 2040 gccaagttca tcaaggcctt cattgatgaa aactcccttt catctgttcc ctatggtgcc 2100 acgcaccaca aacagggttc tgatacctgg cttgacttgt gtctaatcga cgagcaggat 2160 cgcctgctgt catactggaa gacaaactca cctttcatca acggacatga ccttatcacg 2220 gccactctcg acgtacagat tccacgatac gtacctaaca catactctta cagaaacttt 2280 aaaggaatca gcgccgagaa gctaagggac tttcttagcg catgtgactg gtcatccctc 2340 acctcttcat cactcgacga atgcatatcy acacttaacg ctaacctcac taacgccatc 2400 aatcatctcg ccccattgcg gactgtgaca ccaagaagac agcgtcaccc gtggttcacc 2460 acggtttctt gagacagacm trcaggatgc aggacrgccy tyatataacg aaatcttcaa 2520 aayttcgsgt aattagcacg tywgmascwg aawacgratc ctggtagctc ttcgtgacct 2580 tgtatcygag agggatagac tttacaggcg tttcagggac tccaggctcg attcggatct 2640 ccgcatttat agactagcta gagacaatgc tcacaaacaa gtcgaggaag ccaggctgaa 2700 ttattactat tcacgcctgt cwaccttgac tgatgttgcc gagatctgga gagagctgga 2760 gaaacttgga atttctgcca ccaaggcccc ttcaccatct cgatttacca cagatgaact 2820 caacaagcat ttcagctcga tctccaayga tccgttrgct cctgctgttg aggattatct 2880 tctcaccctg gaaagtcttg acctcccgga acatttcgag ttcagcacta taacggaatc 2940 ggatgtgttg gctgcagtat cgcacttcga cactcaggcc aggggaagcg acggmatccc 3000 wcaggttgty atctcaaaag cattgccagt tctcgctccc ttactaagtc atattttcaa 3060 cctgtctctg agcgaaccct gtttcccatc tgcctggaaa ttgtcacttg tgcgcgcact 3120 caacaaagtc agttcaccaa cagccctgac tgactaccgt ccgatttcac ttctctgctt 3180 tctctccaag gccctggagt ggctggtgca caggcaagtc tcagaatacc ttgaatcaag 3240 gctcttacta gacaactttc aaacaggctt ccgcactggc cacagcactc atgtcagggt 3300 cgggataaac aaaaagaaag tgacacttct ccttctcttt gactttagca aggcgtttga 3360 cactgtrtgt cacgtcaggc tcctgagaaa gctatccacc ttcggcttct caaagcaggt 3420 catccgctgg tttgcctcct atctcactgg gagagagcag gccgtcgttg gtgacaatag 3480 cgagcgtttc tctcctcggc gtcccgcagg gtccgtttgg tccttgctgt tgcattgtac 3540 atcaacgaca tcggtttctg ccttgattcc gatgtttccc atcttatcta tgcggatgac 3600 ttgcaaattt acagtcaatg ccaccttgag gagctcgatt ctttatcaaa caagatgagt 3660 gctaatgccg agaggataat gggctgggct gcacaaaaca ggctaaaact caatgttaat 3720 aaaaccaaag caattgtcct gggctcyccc tactacataa atgcactacc taaagtaaca 3780 cttattaata tgggggtgcc aggtcagctt tgaatcctct gtgcgtaatc tgggattggt 3840 gcttgactct aaactcacgt ggaaggagca cgttacacaa gtgtgtaagc gtgctcactc 3900 gctaatgtac aggctctact ttttcagaaa gagtaccaat ctcaggttgc gcaagcatct 3960 tgtgcaagcg ctcctatttc cgattatcga ctactgttct cttgtatact gtgacctgac 4020 gcaggaactc gacttaaaac tacagaggct cgtgaataca ggaattcggt acatctatgg 4080 tgtaaggaga gatgagcrca tctccccgta caggcgggag ctgcagtggc taaccactgc 4140 cggacgcagg aagtacttca cagcttgttt cctacgtaaa atgtttaact cggtcgtacc 4200 atcatacgta ctggcctttt ttgacttccg cgtcacactc cggcctgtga ggggcgaggt 4260 gacacctctg gaaattcctg ctttcgcgac ggagacgctg aggaactcgt ttcatatcag 4320 cgcttcatac ttatggaata acctgccatc acacatcaga aacacttcat ctaccgtcac 4380 tttcagaaaa cttgctaagg atcacttttt tcaactcgaa aacacataac tcgtacacac 4440 tccacgcaca cgcacacacc tactttctcg ctcattccac cttcactatc gccayactyt 4500 cacactatac atactctaaa catacctcaa tctattgtta tttcttttct tactttgtaa 4560 tgctgtaaac ttacaatcgt acaatataat gtacgcaaaa taaaactaat ct 4612 // ID BEL-206_AA-I repbase; DNA; INV; 5746 BP. XX AC AAGE02024196; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-206_AA_; KW BEL-206_AA-LTR; BEL-206_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5746 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02024196; Positions 134128 128383. XX CC Positions [4594-5166] - Integrase core CC 'ATGAC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 873..1877 FT /product="BEL-206_AA-I_3p" FT /translation="MVAWNLLLDHYQNPVRLKQSYIDSLFEFAPIKRESAG FT ELHSLVEKFEANVKILKQLGERTEFWDLILIRMLSIRLDAVTRRDWEEYCA FT TQQAVTFQDLTTFIQRRVTVLQTIGKTNEVSTAIPAKKPGSRPIVSHGATQ FT VNSRKCLVCSENHPLYQCPKFSKMSIEEKEKEIRRHQLCKNCLRKGHQARD FT CSSSSTCRRCRGHHHTQICTQPETESFKPKSSEASPSPDNALQPSANHGQP FT SPSISASIEISSHTSVGAKPRRVLLATAVVLVIDDTGREHPARALLDSGSE FT CSFVTESFSQLIKARRRRVHLPISGIGQSTSTQHAFYYCPLKD" FT CDS join(1849..2772,2776..5592) FT /product="BEL-206_AA-I_1p" FT /translation="MLFTTVRSRTSDYSAPVELLVMPKVTIDLPASSFDIS FT TWKIPSEVQLADPAFYKMSPIDMVLGAELFFELFIMTGRIELGKNLPILVN FT SVFGWVVTGRSTPSLISSPIAANLALVTDLHQLVEKFWSIEEDTTNSCPSV FT EEAACEEHFQKTVHRNEEGRYIVRLPVRENILNDLDNNRRTATRRFRLLEA FT RLARDQNLKAQYSLFMDEYLSLGHMERVQDYQQSPKRVYHLPHHAVIREDS FT TTTKVRVVFDASCRTANGPSLNDALMVGPILQQDLRSILMRSRMHQVMIIA FT DIKQMYRQILVDPRDTLQRIVWRSSMDAPLDTYKLKTVTYGTASAPFLATR FT VLKQLADDECAEFPEATKVLTNDFYVDDLISGADSIDEAVNLRKQLELLLS FT KGGFQLRKWASNEPDAVADVSADNLAMQPSVDLDRDQCIKTLGLHWEPQTD FT RLRYKVQLPETLNGETLTKRLALSNIARLFDPLGLVGPVVTTAKLFMQSLW FT LLQDNGKPWSWDKELPQSLQDSWQSYQNQLPLLNELRIDRLIICPSPTSVQ FT LHIFSDSSEKAYGACAYLRSTDSNGLIKNALLTSKSKVAPLKQQSIPRLEL FT CGALLAAELYKKISNSLPTSMQTFFWVDSTTVLNWLNRVSKIQLATKDCTW FT NHIAGKENPADILSRGATPESLLNSTLWWRGPEWLQLESSEWPIEQHNDSQ FT TSTTLREARKAPAPALNVHREPSFIDGLVAKFSNYQHMIRVTAYCKRFLRN FT CRKRPRNLFVGNGAFLSSAEIRASEFTLIGLIQQQAFPVEWAQLSKKRPLA FT TKSKLRWFNPFMSEEGVIRIGGRLTNAQQSFDSKHQILLPAHHSASRLLVK FT QVHERNLHAHPQLLLTLLRNRYWIIGARSLARNVVHNCVACFRAYPKRVEQ FT FMADLPSSRVTAVRPFAISGVDYWGPLLLRPATRRSAARKAYVAVFVCFCT FT KAVHMELVVDLTTAKFMQAFRRFTSRRGFCSQIYSDNGRNFVGASNELRRL FT LKSNEFRQAFAQECSNNAIEWHFNPPKASHFGGLWESAIASAQKHLIRVLG FT PHKLDYDDMETLLIQIECCLNSRPIIPISDDPTDIQPLTPGHFLIGSPLKA FT VPDVDVSAIPFNRLHRWQQTQKIFQDVWKRWSAEYLSSLQPRTKWCKAPVA FT IETGRLVILLDENVPPMHWPTARITDVHPGPDGVTRVVTVRTSNGQYTRPV FT SKICLLPLSSTTNSPTQEGDPDKSPSDISTTCNLREN" XX SQ Sequence 5746 BP; 1606 A; 1495 C; 1235 G; 1410 T; 0 other; ttggtccttc gaaccgcatg gccgtcgaac tccggtcatt cgtcaaccct cgttaagcca 60 tcatcgccta actacgcatc taagtcgaac cacgcaccac gatctggtga gaactgccac 120 ttgtctcaac gataaccgac tctggaaccc gtattgttcg gtgcaacatt attccgtctc 180 tccatctggc tgtcatttga attatacaag gcactcgtat tgcctttaac aggtaattat 240 aattccattc attgttatcc attcctctcg ggctacgcgg tcttgcgctt tccagatcgc 300 tgtcgcaata cgtcaccatg tcatcaacgg atcgacgcat caaatctctc aagacgaggc 360 agaagagtct gctcacgtca ttccaactcg tcatgaagtt cgtgaacgga tagagcgaag 420 aaaccgactg ccatgaagtt cctgtgagat tggaacacct tgttacgatc tggaatgatt 480 tcaatgccgt ccaaggagag cttgaaacac tggatgacac caacgtggat acacatttgc 540 aagatcgtat cgctttcgaa tcctcttact tcaaggctaa agggttcctt ctatcggtta 600 ataaaacacc atccacgcca gccactccgt cgacttctca atcacaacca aactcttttt 660 ccgccacgtc ccacgtacgg ctaccggaca tcaagctacc tgggtttgat ggtactctcg 720 accgatggtt aaacttccat gacctctaca tctctttggt ccactcatcg agcgagcttt 780 caaacataca aaaattttac tatcttcgct cttcactcac cggcgaagct ctaaagctgg 840 tgcaaacgat cgcaatctct gccaacaact acatggtggc ctggaacctc cttctggacc 900 actaccaaaa tcctgtacgg ctaaaacaat catacatcga ttcgctcttc gagtttgcgc 960 cgattaaacg agaatcagct ggtgagcttc attcattagt cgaaaagttt gaagcaaatg 1020 taaaaatact caagcagctc ggcgaaagga cagagttttg ggacctcatc ctcattcgaa 1080 tgctgagtat tcgtctggac gccgttacac gaagagattg ggaagaatac tgcgccactc 1140 agcaagctgt cacctttcag gaccttacca cgttcataca gcgtcgcgtt accgtcctgc 1200 aaaccatcgg taagacaaat gaagtgtcta cagccattcc tgcaaagaaa ccgggttcac 1260 ggccaattgt tagccacggt gccacgcaag tcaactccag gaagtgtcta gtgtgttcgg 1320 aaaatcatcc attgtatcaa tgtcctaagt tttccaagat gagtattgaa gagaaggaaa 1380 aggagatccg ccgacatcag ctgtgcaaaa attgcctgcg aaagggccat caggcaaggg 1440 attgctcatc atcgagtacg tgtcggaggt gcagagggca tcatcacacg caaatctgta 1500 cccaacccga aactgagtcg ttcaaaccca agtcttcaga agcatcaccg tcaccggaca 1560 acgctcttca gccatcggcg aaccacggtc aaccttcgcc gtcgatctca gcatctatcg 1620 aaatctcgag tcatacttct gtaggagcaa aaccaagaag agtacttcta gcaacagccg 1680 ttgttctcgt aatcgatgat actggacggg aacatccagc tagggcgcta ctcgactccg 1740 gtagcgaatg ttcttttgta acggaatcat tctctcaact gatcaaggca cgacgcagaa 1800 gggttcatct acccatctcc ggtattggac aatcaacaag cacgcagcat gctttttact 1860 actgtccgct caaggactag tgattattcc gccccggtag aactcctcgt catgcccaag 1920 gtaaccattg acctgccagc atcatctttc gacatctcga cgtggaagat accatccgag 1980 gttcagttag ccgaccctgc attctacaaa atgagcccta ttgatatggt cctaggcgca 2040 gagctctttt tcgagctttt tatcatgaca ggcagaatcg agctcggtaa aaatttgcca 2100 atcctcgtga attctgtttt tggatgggtg gtcacaggaa gaagcacacc cagtcttatt 2160 tcctccccaa tagcagcaaa cttggcgtta gtcacagatc tgcatcagct ggtagagaag 2220 ttttggtcga ttgaggaaga caccacaaat tcatgtccat cagttgaaga agcagcatgc 2280 gaggaacatt tccaaaaaac ggtccatcgc aacgaagaag ggcgatacat cgttcgacta 2340 ccggtgaggg aaaatattct caatgatcta gacaacaatc gacgcacagc cactcgacgt 2400 ttccgtctgc tagaagctag acttgctaga gatcagaacc tgaaagctca atattcgttg 2460 tttatggacg agtacctttc acttggtcac atggagcgtg tccaggatta ccaacaatca 2520 cctaaaaggg tttaccatct tcctcaccac gcagtaatcc gcgaagatag caccacaact 2580 aaagtgcgtg ttgtgtttga tgcctcctgt aggactgcga atggaccatc cttgaacgac 2640 gccctaatgg ttgggccaat actacaacag gatcttcgtt cgattctcat gagatcacga 2700 atgcatcagg taatgattat tgcggacatt aagcaaatgt acagacaaat tttagtcgac 2760 ccccgggaca cttaactgca gaggattgtt tggagatcat caatggatgc ccctttggat 2820 acgtacaaac taaaaaccgt aacgtacggt acagccagcg ccccatttct ggccacacgg 2880 gtgttaaaac agctggcaga cgatgaatgt gcagaattcc cagaagcaac caaggttctg 2940 accaacgatt tctatgtcga tgatttaatt tcgggagccg actcaataga tgaagctgtc 3000 aaccttcgca agcaactaga attattatta agtaaaggag gttttcaatt acgcaaatgg 3060 gcgtcaaatg aacccgacgc agttgcggat gtctctgcag ataatctcgc tatgcagcca 3120 tctgtagatc ttgatcgaga ccagtgtata aaaacgcttg gtttacactg ggaaccacaa 3180 acggatcgct tacgctacaa ggttcaactt cccgaaacac tgaatgggga aacgcttact 3240 aagagactcg ccctctccaa catcgcccgt ctttttgatc cactgggttt ggtaggccca 3300 gttgtgacca cggccaaact gtttatgcaa tcactatggt tacttcaaga caacgggaag 3360 ccttggagtt gggacaaaga actcccacag tcacttcaag acagctggca atcctatcaa 3420 aatcaacttc ctcttctcaa cgagctgcga atcgatcgtc taattatttg tccatcaccc 3480 acgtccgtgc aattgcacat tttctctgac tcttctgaaa aggcgtacgg tgcgtgcgct 3540 tacctgcgat ctacagactc gaacggattg ataaagaacg cactactcac ctctaaatcg 3600 aaggttgccc cgctcaaaca gcagagcatc ccacgacttg agctctgtgg ggctcttctc 3660 gctgcagaac tctataagaa aataagcaat tctttgccaa cttccatgca aacattcttc 3720 tgggtggact ccaccaccgt attaaactgg ctgaacagag tatctaaaat acagttggcc 3780 acaaaggatt gcacatggaa tcacatcgca ggaaaggaaa atccggcaga tattctttca 3840 cgtggagcaa ctccagaatc gctacttaac agcactcttt ggtggagggg gcctgagtgg 3900 ctgcagttgg aatcaagcga atggccaatc gaacaacaca acgattccca gacttctact 3960 actcttcgag aagctcgaaa ggcacccgct ccagctctaa acgttcatcg agagccatct 4020 ttcattgatg gtttagttgc aaaattttca aattatcagc acatgatacg cgtcaccgcc 4080 tactgcaaac gttttcttcg aaactgccgt aaaagaccac gaaacctgtt tgtcggaaat 4140 ggagcctttc tctcttctgc ggaaattaga gcttcggaat tcaccctcat aggactcata 4200 caacaacaag catttccagt cgaatgggca caactgagca aaaaacgacc tcttgctact 4260 aaatcaaaac ttcgttggtt caaccctttt atgtccgagg aaggagtaat ccgcattgga 4320 ggacgactga ctaacgccca gcagtcattt gacagcaagc accaaatttt gctaccagct 4380 catcattctg cctcgcgcct actggtcaaa caggttcacg aacgaaattt gcacgcacac 4440 cctcagcttc tactcacgct tcttcgcaac cgatactgga taataggggc caggagtctg 4500 gccaggaatg tggtccacaa ttgtgttgct tgcttcagag catacccgaa aagggtggaa 4560 caattcatgg ccgatctacc ttcatctcgt gtgactgccg tgcgcccatt tgcaatatca 4620 ggagtggact attggggacc attgcttctg cgaccagcaa ccagacgatc agcagcacga 4680 aaggcttacg ttgcggtttt tgtatgtttt tgtacaaaag ccgtacatat ggagttggtc 4740 gtagacttaa caacagcaaa gttcatgcag gcctttcgcc gattcacttc acgtcgagga 4800 ttttgctccc aaatctacag tgataatggg agaaattttg ttggtgcctc taacgagctg 4860 cgaagattac taaaaagcaa cgaattcagg caggcattcg ctcaagaatg ctccaacaac 4920 gccatagaat ggcacttcaa tcctcccaag gcctcccatt tcgggggact atgggagtct 4980 gccattgcat cagctcaaaa acatctgatc cgtgtcttgg ggccgcataa gctagattac 5040 gacgatatgg agacgctact cattcaaata gaatgctgcc ttaattcgag acctattatc 5100 ccgattagtg atgatccaac agatatccaa ccgctcacac cggggcactt ccttattgga 5160 tcacccttga aagcagttcc agacgtggac gtgtcggcaa taccattcaa caggctccat 5220 cgttggcaac aaactcaaaa aatctttcag gacgtgtgga aacgatggag tgccgagtat 5280 ttatcatcac tgcagccccg cactaaatgg tgcaaagctc ctgttgccat cgaaacgggt 5340 agattggtga ttctactcga cgaaaacgtt ccgccaatgc attggcctac agctcgaatc 5400 actgatgttc accctggacc agacggagta accagagtgg tgacggttcg aacttccaac 5460 gggcagtaca cacgacccgt atcaaaaatt tgcctgctgc cattgtcttc aaccacgaac 5520 tctccaacgc aggaagggga tccagataaa tctccatcgg acatcagtac aacttgcaac 5580 ctgcgggaaa actaaacaga aatattacaa ggcataataa tgcctggttc agttgctttt 5640 cttcgagttc ttcgtgacat ttccatctgc caatgatcgt taagccgact gcgatgcgga 5700 cggacaacct gtcaattcag aaaggagatt tctgaagggg ccagca 5746 // ID YOYOLTR repbase; DNA; INV; 316 BP. XX AC U60529; XX DT 11-JAN-1999 (Rel. 4, Created) DT 11-JAN-1999 (Rel. 4, Last updated, Version 1) XX DE LTR from YOYO retrotransposon. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy superfamily; LTR; YOYO; YOYOI; YOYOLTR; retrotransposon. XX OS Ceratitis capitata OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Tephritoidea; Tephritidae; Ceratitis; Ceratitis. XX RN [1] RP 1-316 RA Zhou Q. and Haymer S.D.; RT "Gypsy-like retrotransposon in the Medfly."; RL Unpublished. XX RN [2] RP 1-316 RA Zhou Q. and Haymer S.D.; RT "YOYOLTR."; RL Direct Submission to Genbank (11-JUN-1996)Genetics & Molecular RL Biology, University of Hawaii, 1960 East West Rd, Honolulu, HI RL 96822, USA. XX DR GenBank; U60529; Positions 1 316. XX SQ Sequence 316 BP; 113 A; 72 C; 51 G; 80 T; 0 other; agttaacgct gctaacattt aaaaactcca aacatatgtt tcccacggta cggaaacgca 60 cccacagcaa caatacaggt aaaatgctga tgccatgtct gccgcgccgg atcaacgtat 120 catatagccc gtacacgcgc aaccatttcg ccaacaccgt aaactgtcac ctgtgcccgc 180 aacgtgatac agatgcttaa tgttaaggaa gtaaaaattg taattcagtc ctaagttaga 240 attcaataaa gtaacatgta ctgaaagcac agaaaaaatt attttttttc tataacttaa 300 cagtaattaa gttaac 316 // ID I_Ele22 repbase; DNA; INV; 7434 BP. XX AC . XX DT 14-OCT-2010 (Rel. 15.1, Created) DT 14-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE An I clade non-LTR retrotransposon family from Aedes aegypti. XX KW I; Non-LTR Retrotransposon; Transposable Element; I_Ele22. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-7434 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-7434 RA Kojima K.K. and Jurka J.; RT "I clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (14-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. This consensus is generated from 6 CC sequences with >98% identity, and ~99% identical to the original CC sequence in [1]. XX FH Key Location/Qualifiers FT CDS 3602..7171 FT /product="I_Ele22_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="MLVRDRQPAIIALQEIHRTTPTMMNNTLGKKYLWYTK FT IAANLYHSVAIGVSTDLSAELIDVNSDLPITAIRLSWPIPISVVSLYLPNG FT KIPDLESRLISAFNQIPEPKMILGDLNSYHRAWGSRSNNARGSIIVSIASQ FT CDLTILNDGSGTFFRGQVESAIDVSLVSAAITNRFLWSAESEYRGSDHVPI FT LLTLENSFAPETTRRPRWKYEKADWSEFQMTLDNELDISPSASLSDFSALI FT NKVASATIPKTSSKPGRRALHWWNENTQKAVKARRKALRAAKRARKQFPDD FT NPERAIILANYRDARNTCRQTIREAKEKSWSEFLDTINEDQSSSELWRRVN FT SFQGKRRIKGMALKIDGTTTRNPHVIADALADYFHSISSHQRYPEKFIKRH FT SIANEAVHRFLVPPDQGQAFNIPFTMKEMEYALQKAKGKSAGPDDIGYPML FT KRLSPRGKITFLQMINNEWTAHTLPESWKHSFVIPIPKSSGPINDVGSYRP FT IALTSCVAKIMERMVNRRLIEFLEENRKLDHRQHAFRSSYGTGTYLASLGQ FT ILDDAARNGDHVEIASLDLAKAYNRAWAPGILSKLVSWGISGNLLAFVKNF FT LDGRTFQVLLGNHKSKTVTEETGVPQGSVIAVTLFLVAMSGVFLVLPKGVF FT ILVYADDILLIVTGKHPKSIRRKLQAAVSAVAKWSQEVGFDISAEKCARLH FT VCESKHIPPRKPLTVNGVPMPTRKYTKILGVTIDRHLKFHCHFNKIKEACK FT NRVSMIRSISGKRTRSDRQTRLRVADAVVCSRLFYGVELTCRSFEYMVQQL FT SSTYNNTIRALSGLLPSTPASAICVEAGVLPFRYRAAMSICCRAVSYLERT FT EDDGQVCFLAGQANHALTAVAGSVLPPVAGLHRVGPRSWRAREPLIDNAIK FT FHFRKGSNPTQVLANFRHRIQNKYRYADIRYTDGSKLAGRVGFGIYGTNLE FT QSHRLPNQCSVFSAEAAAILTAILEPSENHILIVTDSASSLQALKSSNNKH FT PFIQAIQAELDNERTSTTFMWVPGHCGIPGNERADAAASIGRQSRIFCNEI FT PGDDIRKWIKNTLWDAWASEWHQERSLFIRRIKNEVVPWNDLPNWKEQKVL FT SRLRTGHTRASHNMGDSRNFRKICETCNTQNTVQHFISDCPTLEYLRMQHD FT ITSISRALQNDAVCERTLLNFLKEARLFNEI" FT CDS join(412..2115,2112..3566) FT /product="I_Ele22_1p" FT /translation="MSLANGYPLLLGDPGGPGGRGGGPSNYINGDYTGARL FT PHYMDNDGTAGQLQYLKMQAVSGQIPQDPFLLRLSVEKCIGGQIDGAYKEQ FT QGLSYVLKVRSQTQFNRLLKMNKLNDGTAISITEHPQLNQTKCVVSNADCT FT KLDDEYLKQQLAAQGVKDIRRIKRRKPDGTSENTPTIVLTLSGTVIPPHID FT FGWTRCKTRNFYPTPMLCYRCWEYGHTGKRCTHPTRVCGRCSKVHEEENQK FT LNEQSNIRSMEQDESLSTPTSSMSERIPCTEAAYCKLCKTSDHSVSSRKCP FT VYLKEVAIQHIRVDRGISYSQATREYEARMGASRSNSTYTGVVNASKDSEI FT EDLKAVVEQLRTDAKAKDARIAEMELSPQNRGVHDRMQAVREHGTIEDLIR FT QVSELTATIEKLQQDLRRKDQFILDLMARQRTPEAESCVEIICETPEQQRN FT ALSDSSAEIPPTASFTDPIVNKKVAKWVNAIGTHKDTGTNPKKIAKDTKNK FT KKLNNDGNSTDGSMESIISGHSVCTMDSSSTHPSNSGKRNHEESEASNNSS FT TSSPNAKRHSKVRKAKATNKNKLTSRYAQSFQYDLQYNTVILISFTFTPNE FT PPNNDQDSRKSTEQTLQSPQSLPSNTPKHKSSQEASSSRGPVSAEATPQPE FT LADNPRRSSDLSVSGTHKGSTTRSPPCPDYGRDRQGPARAEAMVVPGLAEP FT PCHPDHAGRGTHKGTKTCSPLDDEGCKQTGLSPYAPVFVPRARLKSHTFSS FT KGGSQSIPPKHDQESQKTNRKSRTKTCNKPVHRRSTGHSASTTTDLFPPLG FT PEVCNSDLYDNLHKTRYGESAIIVPGTAASSYRNAAPGPAPRAAGPARRAR FT RAAGPGAGGRAAARPPGRGPPRAGAGGRRAPGPGGRPPPAPPGAAARGPPR FT GGAGAPPAPGRAPARARPRARPAAPPGAAARRAPPPRPRPGAPARPPARGG FT AAETSLGTLGLKTQAVSPTLPNGLQPPTLQTCFYRPSGYLYLWRVVRHHLP FT LVELYLVRPTIPTQHELTAASVRNLPLVQLLTAHHTVAGPFLPHSGI" XX SQ Sequence 7434 BP; 2094 A; 2000 C; 1740 G; 1598 T; 2 other; cagtttttgc tcgggctatt accgcgtgat caagtgcaca tatcgtgtcg cgatttccaa 60 cagtccgaac ggtcgaaata tccaaagata atcatcaata gttggtgatt catatagctt 120 atcttcgaat ctaacggtgt gagttggaaa acactaaaaa cactgtctgt gaggtgtgaa 180 aaaagacatt gtagctgcct gtatatgcca gcagcagaca atttgttgtg acacaccacc 240 aaagcgcgcc gcgtgtgaag taatagagat aaagacaacc cgatcaaaaa tagcaaagtg 300 atcgaagtga aaaaatcatt caagtttttg ttcgaaaaga aacaagtgta cagtggtgta 360 atatctgagt gcggtgttga ttgttaggag tttcccaacc gtctcgccgg catgtcgctg 420 gcgaacggct acccactcct cttaggggac ccagggggcc ctggaggtag aggaggtggc 480 ccatcgaact acatcaacgg cgactatact ggcgctcgtc tcccacatta catggacaat 540 gatggcacag ctggacagct acaatacctg aaaatgcagg ctgtatctgg acaaataccg 600 caagatcctt tcttgttacg attgtccgtt gaaaaatgca ttggtggaca gattgatggt 660 gcatacaaag aacaacaagg actctcctat gttctgaaag ttcgcagcca gacacaattc 720 aaccggctgc tgaaaatgaa caaactgaat gacggaacgg ctatcagtat cactgaacac 780 ccgcagctca accagacgaa atgcgtggtt tcaaatgctg actgtacgaa actggatgac 840 gagtacctca aacaacagct tgctgcacaa ggcgttaagg acattcgtcg gattaagcgt 900 cgcaaaccgg atggcactag tgaaaacaca ccaacaatcg ttctcaccct tagtggaacc 960 gttatacctc cgcatatcga ttttggatgg acgcgctgca aaacgcgtaa cttctaccca 1020 acaccgatgt tgtgttaccg ttgctgggag tatgggcaca cgggtaagcg ttgcacccat 1080 cccacccggg tctgtggacg atgcagcaag gttcatgagg aggagaatca gaagctgaat 1140 gaacaatcca atataagatc gatggagcaa gatgaaagtc tgtcaacacc taccagcagc 1200 atgtctgagc gcataccgtg taccgaagct gcttattgta agttgtgtaa aacaagtgac 1260 catagcgttt ccagccggaa atgtccagtt tacctcaaag aagtggcaat tcaacacatc 1320 cgtgttgata ggggtatatc gtattcccaa gccacacggg aatacgaggc ccgaatggga 1380 gcgagtagga gcaacagcac ctacacaggt gtcgtgaatg ccagcaagga cagcgaaatc 1440 gaagatctca aagcagtagt cgaacaatta cgaactgatg cgaaggcgaa agatgcgaga 1500 atcgcggaaa tggagctttc accgcaaaac cgcggcgtcc acgatcgcat gcaggcagtt 1560 cgtgaacacg gtacgataga agatctaatt cgacaagttt ctgaactgac tgctaccatc 1620 gaaaagctac agcaagactt gcgaagaaaa gatcagttca ttttggatct tatggccagg 1680 caaagaaccc cagaagcaga atcctgcgtt gaaattatct gcgaaacccc agaacaacag 1740 aggaacgctc tgtcggactc ctccgctgaa ataccgccca ccgcctcttt taccgaccca 1800 atcgtgaata aaaaggttgc caagtgggtc aacgccattg ggacccataa ggataccgga 1860 accaacccaa agaagatcgc gaaagatacg aaaaacaaga aaaagctgaa caacgacggt 1920 aattcgacag acggcagtat ggaatccatc atctccgggc actcagtgtg tacgatggac 1980 tcgtccagca cgcatccgtc gaattctgga aaacggaacc acgaagaatc cgaagcaagc 2040 aacaactcgt caacaagctc accgaatgcc aagcgacata gcaaagtccg caaggcaaaa 2100 gcaactaata aaaattaact tcgcgatacg ctcagtcctt ccagtacgac ctgcagtata 2160 atactgttat tctcatctca ttcacgttca cacctaacga accacccaac aacgatcaag 2220 atagcaggaa gtcgactgag caaacattac aatcacccca atcactacca tcaaacacac 2280 caaaacacaa atcatcacag gaggcctcga gtagccgggg ccccgtcagt gcggaagcta 2340 caccccaacc ggaactggcg gacaaccccc gacgctcaag tgacctttct gttagtggga 2400 cgcacaaggg ctctacaacc cgttccccac catgcccaga ctatggtcgg gacaggcaag 2460 gccccgccag ggcggaagct atggtcgtac cgggtctggc ggaaccccct tgtcatcctg 2520 atcatgctgg aagagggacg cacaaaggta ccaaaacctg ttcccctttg gacgacgagg 2580 gatgcaagca aaccggattg tcaccttatg ctcctgtttt tgttccccga gcaaggctaa 2640 aaagtcacac cttctcctca aagggtggtt cgcaatcgat ccccccaaag cacgaccagg 2700 aaagccagaa aacaaatagg aaaagtcgta ctaaaacctg taacaagcct gtgcatcgcc 2760 gatcgacagg acattccgca tcgacgacaa ccgacctctt cccccctcta ggaccggagg 2820 tttgtaactc tgatctttac gacaatctac ataaaactcg ttacggagaa tcggccatca 2880 ttgtacctgg tacagctgct tcctcgtacc gaaatgcggc cccgggcccc gccccgcgcg 2940 ccgccggccc ggcccggcgc gcgcgccggg cggccggccc cggcgcgggg gggcgcgcgg 3000 ccgcgcggcc cccggggcgg ggcccgcccc gcgcgggggc gggggggcgg cgggcccccg 3060 gcccgggcgg gcggcccccg ccggccccgc cgggcgccgc cgcccggggg cccccccgcg 3120 gcggcgccgg ggccccgccg gcccccgggc gggcgccggc ccgggcgcgc ccgcgcgccc 3180 gccccgcggc cccccccggc gccgcggccc gccgggcgcc cccgccccgc ccgcgccccg 3240 gcgcccccgc ccggcccccc gcgcgcggcg gcgccgcaga aacctctctg ggcaccctgg 3300 gcctcaaaac ccaggcggta agccccaccc tgccaaacgg actacaacca ccaactcttc 3360 aaacttgctt ttaccggccg tcgggatatc tctacctttg gcgggtggtt cgccaccacc 3420 ttccactggt agagctctat cttgttcggc ccacaatacc aacacaacac gaattaacgg 3480 cagcatcagt caggaatcta ccgttagtgc aacttctaac agcacatcat acggtggccg 3540 ggccattttt gccacacagt ggaatataaa cggatttttt cgtaatctcc cggatctcga 3600 aatgttggta cgcgatcggc aacccgcgat aattgcgctc caagaaatcc accgaacaac 3660 acctactatg atgaacaata cattgggaaa aaaatatcta tggtacacca aaattgctgc 3720 caacctgtat cattcggtag ccatcggagt atctaccgat ctctcggctg agctgattga 3780 tgtgaactcg gacctgccga taactgctat tcgtctgtca tggccaatcc ccatctctgt 3840 agtatctctt tacttgccta acggaaagat acccgatttg gaaagtcgac ttataagcgc 3900 cttcaatcaa attccggagc caaagatgat actgggtgac ttgaacagct atcatcgggc 3960 atggggtagc cgctccaata atgcacgtgg ttcaattatc gtcagcattg ccagccagtg 4020 cgatttaaca atccttaatg atggctctgg tactttcttc cgcggacaag tcgaatccgc 4080 aattgacgtc tcacttgtgt cagctgccat cacgaaccgc ttcctctggt ccgcagaatc 4140 tgaatatcga gggagtgatc atgtacctat tcttctcact ctagaaaata gctttgcccc 4200 cgaaacaaca cgacgtcctc gatggaaata cgagaaagct gattggtcag aatttcaaat 4260 gacgcttgac aatgaactag atatatcccc ttcagcttcc ctctcagact tttctgcttt 4320 gattaataaa gttgcctctg ctactattcc aaaaaccagc tcaaaacctg gacgacgcgc 4380 tctccactgg tggaacgaaa atactcaaaa ggctgttaag gctcgcagaa aagctctccg 4440 ggccgctaaa agagccagaa aacagttccc cgatgacaac ccagaaaggg ccattattct 4500 tgcaaactac cgagacgccc gcaatacatg ccgtcagacg atcagggagg cgaaagaaaa 4560 gtcctggtcc gaatttctgg acaccataaa cgaggaccaa tcgtcatctg aactatggcg 4620 acgagtaaac agcttccaag gcaagagacg aatcaaagga atggcactca agattgatgg 4680 tactacaaca agaaaccccc acgtaatagc agacgcactg gcggactatt tccatagtat 4740 ctcttcacac caacgctacc ccgagaaatt catcaaacgc cattcaattg ctaatgaagc 4800 ggtgcatcga ttcctggtcc cacctgacca aggacaagct ttcaacattc catttaccat 4860 gaaagaaatg gagtatgccc tacagaaagc aaaagggaaa tcagccggcc cagatgacat 4920 tggatatccg atgttgaaac gcctctctcc aaggggtaag ataacttttc ttcaaatgat 4980 caataacgaa tggacggctc acacccttcc cgaaagctgg aaacacagtt tcgtcatacc 5040 cattccaaaa tcatccggtc cgataaacga tgtcggcagc tatcggccga tcgccctaac 5100 cagttgcgtc gcaaaaatta tggaaaggat ggtgaaccgc agactaattg agttcctgga 5160 ggaaaaccga aaactggacc atcgacaaca cgctttccga tccagttacg gtaccggaac 5220 gtatttagct tctctcggac aaatcttgga tgatgctgca agaaatggag atcacgtcga 5280 gatagcatcc ttggatctag ccaaggctta taatcgagcc tgggcccccg gaatcctaag 5340 taagctggtc agctggggta tctctggaaa tctgcttgcc tttgttaaaa acttcttgga 5400 tgggcgcact ttccaagttc ttctagggaa ccacaagtcc aaaacggtca ctgaggaaac 5460 aggcgtcccg caagggtctg tgattgcggt taccctattt ctggtggcca tgagtggagt 5520 ttttcttgtg cttcccaaag gagtgttcat cttggtttat gctgacgata ttcttcttat 5580 tgtcaccggg aagcatccta agagtatcag gcgcaaacta caggctgcgg tttccgccgt 5640 tgccaaatgg tcacaggaag tcggtttcga tatatcggca gaaaaatgcg caagacttca 5700 tgtatgtgaa tcgaagcata ttccgccacg caagccgctg acagtaaatg gcgttcctat 5760 gccaacgaga aaatatacca aaatcctcgg cgtcaccatc gaccggcatt tgaaatttca 5820 ttgtcatttc aacaaaatta aggaggcctg taaaaatcga gtgagtatga ttagaagcat 5880 ctctggaaaa cgtactagga gcgataggca aactcggcta cgcgtggccg acgctgttgt 5940 atgcagtcgg ctgttttacg gtgttgagct tacctgccga tctttcgaat acatggtgca 6000 gcaactttcc tcaacctata ataacacaat tcgtgcactg tcgggccttc taccttcgac 6060 cccggcttca gcaatctgtg tggaagcagg cgttcttcca ttccgataca gagcggcgat 6120 gtccatatgt tgtcgagctg taagctacct agagcgaact gaggatgatg ggcaggtttg 6180 cttcctcgct gggcaagcga atcacgccct taccgctgtg gccggatccg tgcttccccc 6240 ggtggccggg ctccaccgtg ttgggcctcg aagctggaga gccagagaac cacttatcga 6300 caacgctatc aaattccatt tccggaaggg ttcaaatcct actcaggttc tagctaactt 6360 tcgccataga attcaaaata aataccgata cgctgatatc cgctatacgg acggctctaa 6420 actagcaggt cgggtgggtt tcggaatcta cggtacaaat ttagaacaat cacaccgttt 6480 accgaatcaa tgctcagttt tctccgctga agcagccgcg atcctcacag caattttgga 6540 acctagtgag aaccacattc ttatagtcac cgattctgcc agttcgctac aagcactgaa 6600 atccagcaac aacaaacatc ctttcattca ggctatccag gccgagctag acaacgagag 6660 aacatcaaca accttcatgt gggtccctgg tcactgcggt ataccgggca atgagcgtgc 6720 agatgctgct gctagtatag gtcgacagag caggatattt tgtaacgaaa ttccgggaga 6780 tgatatcaga aaatggatta aaaacacgtt atgggatgca tgggcaagcg aatggcatca 6840 agaaagatca ctattcataa gacgaatcaa gaacgaggtt gtaccttgga atgaccttcc 6900 aaactggaaa gaacagaaag tattatctcg ccttcgaaca ggccatacac gggcatctca 6960 taatatggga gacagccgaa attttcggaa gatctgtgaa acgtgcaata cgcaaaacac 7020 agtacaacat ttcatcagcg attgtcctac acttgaatac cttcggatgc aacatgatat 7080 cacatctatc agtcgtgctc tccaaaacga tgcagtatgc gaaagaactc ttctaaactt 7140 cctcaaggaa gctagacttt tcaacgaaat ttagcaaatg ctcaacgctt gaaccttcgt 7200 cgatacggaa gtacttggaa gatcatacaa cgcaagatag aatcggccta gaaaggacac 7260 cactacatca gattgattaa cagtatagtk ttaaagagct tcagttttgt tcatatttma 7320 aactctgtat tgctagaaga tacccttcag gtacttcttg atttcctatt ttctaaagag 7380 atgagccagc ctcgggctgc aaatctctta aataaagata tctatctatc tatc 7434 // ID I_Ele17 repbase; DNA; INV; 6431 BP. XX AC . XX DT 08-OCT-2010 (Rel. 15.1, Created) DT 08-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE An I clade non-LTR retrotransposon family from Aedes aegypti. XX KW I; Non-LTR Retrotransposon; Transposable Element; I_Ele17. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-6431 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6431 RA Kojima K.K. and Jurka J.; RT "I clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (26-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. This consensus is generated from 8 CC sequences with >99% identity, and ~100% identical to the original CC sequence in [1]. XX FH Key Location/Qualifiers FT CDS 341..1816 FT /product="I_Ele17_1p" FT /translation="MALAPDEFPSFGDPGGGSSNFFDGDFAGARLPNYMDP FT EGTYGAIQVLRMEATSGRLPNDPFLLRSSVEKCVGAKIDGAFPEGKDGITY FT ALKIRSKNQISKLLRLTSLSDGTGIKIIEHPTLNVCRCVVNCQSVAGLDDK FT TIEEGLADQGVRNIRRITRRNGDKVENTSTIVLTISGTVIPPNVDFGWIRC FT KTRPYYPAPMLCYNCWNFGHTSKRCQQTRPTCGTCSKDHVIDKETRCTAEV FT FCKRCDRHSHSLSSRKCPVYCKENDIQRIRVDLGISYPQARRRYEQAHGQS FT SFSNVTTAGKDQQIAELSSKVDQLQQEMEKKDRRIVTLESSLSTDNSSMIA FT DLLQKVDLLTSEMKRKDDRIQALESDLQNNSRMGLVRKHGTIEELVSKVTH FT LEEQLSHKEREVNVLRTIFNRKNQLADSKDALSSSEIPSTQDRIQTAEKTK FT QQLSQKKEKKQKKKEWQPTLMYESDTSPIPPRQKNFGENTKERSLYD" FT CDS 1930..6171 FT /product="I_Ele17_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="NTLQPTSAPHTTCSIIEYSKSLIPIIDCNSVHNSALN FT TREEPTGTISTLNTTKPEQRCIEVNKADNRKCSDMPTCSPVIFQDSRGPVG FT AEATPPPEPADNLRHPGSSGRGTHKGRNAYPPVDNVGVKTTSTSTSFVNGT FT HKGIRTRPPLKDNVGAAVTQDNPVSSPSSETPSAKTISSSSTTSSPRRNRT FT TTSTNSGSPNSMATINATKRTSFVLQWNMNGFHNNISDLECLIQSNPPVIL FT AIQEVHRTTVDKMNRSLGGRYRWTYKRNSNVYHSVAIGVLESIQFSTIQLD FT TILPIVAVTLNDPFPVTVVSVYLPCGKIPNLKNAFSQVLESIQGPKLILGD FT INGHHAAWGSPRSDKRGETLLELTEAMDLVILNDGSITFTKGQHESAVDVS FT LASPSIVSRLLWSIGDDPLGSDHHPITISLCQLPPETTRRPRWIYDQADWT FT SFQTAIDEYLCISEHNNLQDFIQAIYQAATTCIPRTSAKPGRKALPWWSPE FT TKQATKARRKALRAAKRLPVDHPDKESAISLYKIKRNECRQVIRDAKDQSW FT ENFLDGINSNQSSSEIWGKVNALSGKRKTRGMAIRHEGVITRDPGIIADAL FT GYYFASLSAFDKYSDNFIRQNQATVDDLDRIQIPDDTAGLPINEPFRLIEL FT EMALAKCKGKSAGADEIGYPMLKNLTPRGKVTLLRLLNKEWSGNTLPQEWK FT SSLVIPLPKNGSSTSAPEDFRPISLTCCISKVMERMVNRRLTHFLESNELL FT DHRQHAFRSGHGTGTYFATLAQVLSDAKTKDQHVELAALDLAKAYNRAWTP FT GVIQQLNKWGLSGHILHFLRNFLQNRTFQVCIGNHRSKHFSEQTGVPQGSV FT IAVTIFLVAMNDIFTSLPKEIYIFVYADDIMLIAVGSTEKALRRKLQASVN FT AVAKWATKSGFVISTDKSAIMHLCKASHRPIRSIVSANGLPIPLKKSVRVL FT GINIDRHLTFHDHFKKVKQSCKTRLNLLRILSKRHKMANRDVRLRVASAIL FT DSRLLYGIELTCLSTDALTTTLAPVYHQSIRIISGLLPSTPADAACVESGR FT LPFERLVTETVCKRAVSFLEKTTPCSGGVLLLDEANRLLQVHANKVLPPVA FT ETHWNGARSWNSPRLKVEMSIKTRFRAGANSQAVTTSVAELLGSKYQNYTH FT RYTDGSKVQDKVGMGITDHENSRFYRLPDPCSIFSAEAAAVLIACTIPSPN FT PIAILTDSASVVTALSSDVARHPWIQAIQMQAPPETVFIWIPGHCGIRGNV FT EADRLAASGRSATFYTRKVPGHDIRRWISSSLRSSWAVSWHNMRTPFLRKI FT KNDISRWEDPKKHRDQRILSRLRTGHTKVSHNMGSDGPFRKICSACNVPNT FT VEHFLINCPQYLAARASHGIAESIRSALNNDSDSIIRLMNYLKHIDLYHQI FT " XX SQ Sequence 6431 BP; 1955 A; 1622 C; 1376 G; 1478 T; 0 other; gagacgaaca gacgtttttg caacacgcgg gattgtttac atccaaataa caatagtttt 60 cgagtttaat caagtttttc attatcgcgg cctatttcgg agccaatttc acgaccatcc 120 ggataccgaa ctaaaagttt tttcccgtgg tcagacaacc gcgtctttcg gacgcgacac 180 gagcgatttt caagggacat cgcccacaaa atctgacagt gttttcgtag tgagtgcgcg 240 ataccgtata gtggagctac actgatctgt tgcctgcggt gccaacaagg agctcagagc 300 tgaggctgtt ttcataaaaa caacccccgg gtggccaacc atggcgttgg caccggatga 360 atttccctcc ttcggggacc ctggaggagg ttcatccaat tttttcgatg gcgactttgc 420 tggagcacga ctgccgaatt acatggatcc cgaaggcaca tatggagcaa tccaggttct 480 gcgcatggaa gcaactagcg gaagattacc gaatgaccct ttcctattac gctcatcggt 540 ggagaaatgt gtgggtgcta aaattgacgg tgctttcccg gaagggaaag atggcataac 600 ctacgctttg aaaattcgta gcaaaaatca gatctcaaag ctcctgagac tgacttcgct 660 atctgacgga accggaatca aaattattga acacccgacc ctcaatgtct gccgttgcgt 720 cgtaaattgc cagtcggttg ccggccttga cgataaaacg attgaagaag gcctagctga 780 ccagggagtt cgcaacattc gccgtatcac cagacgaaac ggagacaaag tcgaaaacac 840 gtcaacaatt gttcttacga tcagtggaac ggtgatcccc ccaaacgtgg atttcggttg 900 gatacggtgc aaaacccgac cgtactatcc agcaccaatg ttgtgctata attgctggaa 960 tttcgggcac accagtaagc gttgccaaca aacacgtcca acttgtggaa catgctctaa 1020 ggaccacgtc atcgataaag aaacccgctg cacagctgaa gttttttgca aacgttgtga 1080 caggcacagc cactccttgt caagccgtaa atgcccggta tactgcaaag aaaatgatat 1140 ccaaaggatc cgagttgatc ttggtatatc atacccgcaa gcgaggagac gatacgaaca 1200 ggcacacggg caaagtagtt tctcaaatgt aactacagcc ggaaaggatc agcaaatcgc 1260 tgagttatcc tcaaaggtgg accagctcca gcaggagatg gaaaagaagg atagaagaat 1320 cgtgactctc gaatctagcc taagcaccga caacagtagc atgattgcgg atctactgca 1380 aaaggtagac ttgctgacca gcgaaatgaa gagaaaagat gatcgcatcc aggcacttga 1440 atcggatctg cagaacaatt cgcgcatggg attggttcga aaacacggca ccattgagga 1500 actagtttct aaggttaccc atttggaaga gcagctcagc cacaaagagc gagaagtgaa 1560 cgtgctccga accattttca atcggaaaaa tcaactggct gattccaagg atgctctttc 1620 atctagcgaa ataccgtcaa cacaagatcg aatccagact gcagaaaaaa cgaagcaaca 1680 actctcccaa aaaaaagaaa agaagcagaa gaaaaaagaa tggcagccga ccctgatgta 1740 tgaaagcgac accagcccaa taccacctcg tcaaaaaaat ttcggagaga acaccaaaga 1800 gagatcactc tacgactgat tccgatgact ccttcgatcg gccaaaaaca aaaataacta 1860 cccctgatag cgatttctat gccaacgtct ccccattggg tggtgacgag ggcatgtcag 1920 aatgattaaa acacgcttca accaaccagt gctcctcata caacatgcag tataattgag 1980 tacagcaagt ctctaatccc aatcatcgat tgtaattctg tccataatag cgcgctaaac 2040 acaagagagg agcctactgg taccatttca acgctaaata ccaccaaacc cgaacaaaga 2100 tgtatcgaag tcaacaaggc cgataaccgt aaatgcagcg acatgccaac gtgttccccg 2160 gttatctttc aggacagtcg aggccccgtc ggtgcggaag ccacaccacc accggaaccg 2220 gcggacaacc ttcgtcatcc tggaagttcc ggaagaggga cgcacaaggg cagaaatgcc 2280 tatccccctg tggacaacgt gggagtcaag actacatcaa cctcaacatc ttttgtcaac 2340 gggacgcaca agggtattcg tactcgtccc ccgttaaagg acaacgttgg agcagcagta 2400 acccaagata accctgtgtc ctccccatct tccgaaactc cgtctgccaa aacgattagc 2460 tcatcatcaa ctacatcatc accaagaagg aacagaacga caacttcgac aaatagtgga 2520 tctccaaaca gcatggccac catcaacgct accaaacgga cctctttcgt tttacaatgg 2580 aatatgaacg gattccataa taacatcagc gacctagaat gccttataca aagcaaccca 2640 ccagttatcc tggctatcca agaagttcac cgtaccaccg tggataaaat gaaccgatcg 2700 cttggcggtc gctaccgttg gacatacaaa aggaattcaa atgtttacca ctcagttgcc 2760 attggagtac tagaatccat tcagttctct acaatacaat tagatacaat ccttcccatc 2820 gttgccgtta cgcttaatga tccgtttcca gtaacggtgg ttagtgttta tctgccatgc 2880 ggcaaaatac caaatttgaa aaatgccttt tcacaagttt tggaatctat acaaggccca 2940 aaactgattc tcggcgatat caacggtcat catgctgcat ggggaagccc tagatcagac 3000 aagcgagggg aaaccttact tgagctcaca gaagctatgg acctggtaat actcaacgat 3060 ggctccatca cgttcactaa gggtcaacac gagtccgctg ttgatgtatc gctagcaagc 3120 cctagcatag tcagtcgatt actctggagt attggcgacg acccgctcgg cagcgatcac 3180 catccaatca ctatctcact ttgccaatta cctccggaga caacgcgccg ccctcgatgg 3240 atatacgatc aagcggattg gacctccttc caaacagcaa tcgacgaata tctctgcatc 3300 tccgaacata acaatcttca agacttcata caagctattt atcaggctgc cacaacatgt 3360 attccaagga ctagcgcaaa acccggccgc aaagctttgc cctggtggtc cccggagaca 3420 aaacaagcaa caaaagcaag gagaaaggcg ttacgagctg cgaagcgtct gcctgtagat 3480 caccctgaca aagagtcagc catttcgtta tataaaatca aacggaatga atgtcgacag 3540 gttattcgag acgcgaaaga tcagagttgg gaaaatttct tagatggaat aaattccaat 3600 caatcttcat ctgagatatg gggaaaagtt aatgctctca gtggaaagcg caaaacccgc 3660 ggcatggcta tccgccatga gggggttatt accagagatc ctggtatcat cgctgatgca 3720 ctaggctatt attttgcatc cctgtcagca ttcgacaaat acagcgacaa cttcatccgc 3780 caaaatcaag ccaccgtaga cgatctcgat agaatacaaa ttcctgatga cactgcaggc 3840 ctacccatta acgaaccttt ccgtctaatt gagctcgaaa tggctttagc caaatgtaaa 3900 ggtaaatccg caggtgcgga cgagataggc taccccatgc taaaaaacct tacacctaga 3960 ggaaaagtta ccctccttcg actgctaaat aaggagtggt caggaaacac cctaccacaa 4020 gaatggaaga gtagcttggt gatccccctt ccgaaaaatg gatcttctac ctctgcacct 4080 gaagattttc gtccgatctc cttaacatgt tgtatcagca aggtaatgga gagaatggta 4140 aaccggcgac taacccactt cctagaatcc aatgaactcc tcgatcacag gcagcacgcc 4200 tttcgttccg gccacggcac aggtacgtac tttgctactc tggcgcaagt tctaagcgac 4260 gccaaaacta aagatcagca cgtcgaactc gccgcattag atttggcgaa agcatacaac 4320 cgcgcctgga cacctggagt cattcaacag ctcaacaaat ggggtctatc aggtcacatt 4380 ttgcatttcc tgcgaaattt tctccagaat agaacttttc aagtttgtat tggaaatcat 4440 cgttccaaac acttttcgga acaaactggc gtgccacaag gctcagttat agccgtcacc 4500 atcttcttgg tggcgatgaa tgatatattc acgagcctcc caaaggaaat ttacattttc 4560 gtctatgcgg acgacattat gctcatcgct gtaggatcaa ctgaaaaggc cctccgccga 4620 aaactacagg catccgtgaa cgcagtggcc aaatgggcaa ctaagtcggg ctttgtaata 4680 tcgaccgata aaagtgccat tatgcatctg tgtaaagcgt ctcaccgtcc aatccgttca 4740 atagtttcag caaatggcct gccaatccct ttgaaaaaat cagtgagggt tctaggaatt 4800 aacatagatc gccatctgac tttccacgat cactttaaga aagtgaaaca gagctgtaaa 4860 actcggttga acctcctacg aattttatcc aagcgccaca aaatggctaa tcgtgatgta 4920 cgtcttcggg tagccagtgc aattctagac agcagactcc tctacggaat tgaactaaca 4980 tgtttatcaa cagatgcact aacaaccacg cttgcccctg tgtatcacca atcaatccgt 5040 atcatttccg ggttgttgcc gtctacacct gctgatgccg cgtgcgtaga aagcggcagg 5100 cttcctttcg aacgcttagt tacggagacg gtatgcaaac gggcagtaag ttttctagaa 5160 aagacgacac catgcagtgg tggagttctt ctccttgatg aggcgaaccg actcctccaa 5220 gtacacgcca acaaagtgct ccctccagtg gcggaaactc attggaatgg agccaggagt 5280 tggaattcgc cacgtctcaa ggtggaaatg tcaataaaaa ctcgttttag agctggtgca 5340 aactcccagg cggtgacgac tagtgtagct gagcttcttg gtagtaaata tcaaaactat 5400 acacaccgct acacggacgg ttccaaagta caggacaaag taggcatggg tattaccgac 5460 catgagaaca gccgtttcta tcgtcttcct gatccatgtt ctattttctc cgcggaagcc 5520 gcggctgtcc taatcgcttg cactatccca tcaccaaatc caattgccat actgactgat 5580 tcagccagtg tcgtcacagc cttatcatcc gatgtagcac gacatccttg gattcaggca 5640 atccaaatgc aagcacctcc tgaaacggta tttatatgga ttcctggcca ctgtggaata 5700 cgtggaaatg tggaagcaga ccggcttgca gcatctggta gaagcgctac gttttacaca 5760 agaaaagttc ctggacatga tataagacga tggatctcct catctttaag aagttcttgg 5820 gcagtgagtt ggcacaatat gcgaacacct ttcctgcgaa aaattaaaaa cgatatatca 5880 cgctgggaag acccaaaaaa gcatcgcgac cagaggatcc tctctagatt gagaactggg 5940 cacaccaaag tttcgcataa catggggtct gacggaccct ttagaaaaat ctgttctgca 6000 tgtaatgttc caaacactgt agaacacttt ctcataaact gcccgcaata cctagcggca 6060 cgagcatcac atggcattgc tgaaagcatt cgctccgcgc tcaataacga ctccgatagc 6120 ataattcggc tgatgaatta cctaaaacac atcgatctgt accaccaaat ctgataatca 6180 tctctctcca atcaactgga cctcgcatcg aaacctgaca acgagaaagg aaaattgtat 6240 ttcaacatca cactagaaca tttaaattaa tcaactttta aagctttgtc ttcaaattga 6300 aaccaactat gtaatactaa tgtgatttat agttagccca ggggagccac tcactatgac 6360 gctctctcgt acggagatga accagccata gggctgaaaa tctccttaat aaagataata 6420 ataataataa t 6431 // ID Copia-16_DPu-I repbase; DNA; INV; 5077 BP. XX AC scaffold_31; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-16_DPu_; KW Copia-16_DPu-LTR; Copia-16_DPu-I. XX NM Copia-16_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-5077 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 695-695 (2010). XX DR Genome; scaffold_31; Positions 930762 935838. XX CC Positions [2131-2649] - Integrase core CC 'AAGAT' target site duplication CC LTRs are 100% similar to each other. CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX FH Key Location/Qualifiers FT CDS 667..3513 FT /product="Copia-16_DPu-I_1p" FT /translation="MQNIFGAIKEEQSRILMTCNSAREMWIKLESEYEEAA FT ADSIPLLWTKFYGCTFRQGQSVSSFLTELEQIAFRLKSLNIAIDDEQIMAK FT VLMSLPAEFRVFGFAWESTPVAEKTLKRLTARLITLDKSMRNDEEKRSTPD FT AAFLSKNKNGAEPHDEESRECALPAQFDRGKRGHSSHQQSGATKECWECKS FT TTHVRAQCRQYKRRREKEEDEADRKRKRFDRNDRRDSNRSRDQRDKDRHRR FT DDSRDEKDYQKERKGYSYTSSTDRKVKKPSTWYADSGATQHMTDNRALLTN FT FVPTGPEKWSVSGIGESSLTVAGQGDVILTATVNGEHLHGKMRGVLYVPGL FT GINLYSIGTATDAGLKVVFDDDTVSFSQDDVVIMEGKREGKKTLYQLNIQA FT KENHHTVERALSAAQVASLSLWHQRFGHLNHKTVLRMATLGSVAGLALFND FT KLHLSTHCRGCLLGKMSRTSFLSTRTRGTHVGDIVHSDVCGPIQICTPSGA FT RFFFESLCNFITTNFAHGFSVLLLRFFVTFKDDFSSFCEIQLLKRKSEVPE FT AFMKFNAKMKAETGRETKILRSDGGGEYCSKDFENWLAKAGITHQVTPPYT FT PQLNGVAERTNRTVIESARSQMYGKKVPLELWGLAVLCAAYVQNRVISSIG FT KVTPFELWHGKEPDVSHLRTFGSPAFTHIPDEKRRKLDPKATEGIMVGYGD FT SSKVYKIWDPVTRKIITSRDVVFEENLNHENSSELQELDYYSLLPLSPDEQ FT MPESLLDEQIPTDQPDVNMGADDGASQPNDPVIDGLGEAESNETNGETEDE FT FFHGFEPVADPVGHLRRSLRTARPSEKYLLYRGYHATAPGHRTSTDQPEKK FT IYGKLNITLSAALSSPQGPLWKTAVEDELNSLHKNETWELVPLPPGRTPIK FT SKWIFELKPGYDGVAERYKARLVALGCSQRAGLDYNETFGLLFTN" FT CDS 3685..4866 FT /product="Copia-16_DPu-I_2p" FT /translation="MCQPEGYAAYGREADVCRLIKSIYGLKQAPRVWNMEL FT NDAIVTYGLTRSQHDQCLYFRLQGEEWMAALFFVDDAIVCGTNNKALDDFV FT TYLKKMFELRTLPVGRFLGLTIKRDRSTRTLSLSQPDFVDDLVARFKMESC FT HPNVIPAEPGLHLSSAMAPKTKEEEEKMKEVPYQSAVGALLYLSTTTRPDI FT AYAVSKVARFNQNPGPQHWIAVKRIIRYLAGTKDYGIIFSSTKEQGALGFT FT DADYGGDHDDRKSTSGCIFLLNGGPISWFSRKQECTATSTTEAEFVAGSEA FT AKEGTWIKSLLEEIGQGKPGPITLFCDNQGAIKVAYNQEMHRKMKHIAIRY FT WYIREAQTNGVINTSYVSTQEQLADTFTKPLPAPRFKYLRDKIGVRDVGAL FT " XX SQ Sequence 5077 BP; 1458 A; 1109 C; 1231 G; 1279 T; 0 other; ggttatgggc ccagctatac gcgtagctaa tttttcgctc gtgttttttt tttagttcgg 60 aaaagtgaat tgaactgatc ttcggcgatg gaaaaggatg gacgccgcca tattggatgg 120 tgccttgatg gcagaaacta cgaatcatgg aagttcggga tgaaattggc actacaaagt 180 gatgagctct ggggaattgt tgacgggact gaacaaaaac ctgatgtggt aagaaacagt 240 tattattctt tgttgtgttt ttttttgcta gtctttgtgt tgatggaaat tggtggccag 300 cgaggccatg aggctatgtt gtcagtattt ttttttcctt tgctatggtc tgagtgaaca 360 tctacaggac agcttagtgt ttttccatat cgtgatgata tgagagggat tccacattat 420 gataatgtga gagggattcc gcattgagag aatgtgaaga accgagtaga aagctggtgc 480 ttttccacat tgtgacaatg tgagagggat tccgcattat gacaatgtga ggccgagcat 540 ggacatcaaa caaaaaagct aaccagcctt cttctttttc tttccccact tcagagacgc 600 aatgatgccc aagccattgt gaatgcagct gatatcacca attgggagaa gaaaaacacc 660 agggcaatgc aaaacatatt cggagccatt aaagaagaac aatcacgaat cctgatgacc 720 tgcaactcag ccagagagat gtggatcaaa ctggaatctg agtatgaaga agctgccgct 780 gatagcatac cattattgtg gaccaagttt tatgggtgca cctttaggca aggccagagt 840 gtctcaagtt ttttgacaga actagagcaa attgcatttc gtctgaagag cctcaacata 900 gccattgatg acgaacaaat aatggcaaaa gtcctcatgt cactacctgc agaattccgt 960 gtctttggct ttgcttggga aagtacaccc gtcgctgaga agactctaaa aaggctcaca 1020 gcccgtctca ttacactcga caagagcatg agaaatgatg aagaaaaacg atcaactcct 1080 gatgcagcat ttctcagtaa gaacaaaaac ggcgctgaac cacatgatga agagtcaaga 1140 gaatgtgctc tacctgccca gttcgaccgt ggaaaacgag gtcattcttc tcatcaacag 1200 tctggcgcaa cgaaggaatg ctgggagtgc aaatcgacga cgcatgtcag agctcaatgt 1260 cgtcagtaca aacgccggcg tgaaaaagag gaagacgaag cagaccgaaa aagaaaacgt 1320 tttgatcgga acgacagaag agatagtaat cgtagcagag accagaggga caaagacaga 1380 catcgccgtg acgacagcag agatgaaaag gactaccaaa aagaacgaaa ggggtacagc 1440 tatacatcct cgacggatcg caaagtcaaa aagccgtcaa catggtacgc tgactcaggt 1500 gctactcagc acatgacgga caatagggca cttctaacca actttgtacc cactggacca 1560 gagaaatggt ctgtctctgg tataggagaa tcaagtctca ctgtagcagg gcaaggagat 1620 gttattctta ctgctacagt gaatggtgaa catctgcacg gaaagatgag aggcgtgctc 1680 tacgtccctg gattaggtat caatctttac tcgattggta cggcgactga tgcaggcctc 1740 aaagtagtct ttgacgacga cactgtttct ttctctcaag acgacgtcgt catcatggaa 1800 gggaagcgag aaggaaagaa aactttatat caactcaaca tccaagcaaa agagaatcac 1860 catactgttg aaagggcact ctctgctgct caagttgcct ctctctcctt atggcatcaa 1920 cgttttggcc acctcaacca caagactgtc ctaaggatgg caacattggg gagtgtcgct 1980 ggactcgccc tcttcaatga caagcttcat ctctctaccc actgccgagg atgtctccta 2040 gggaaaatgt cacgtacatc tttcttgtct acccgcacca gaggaacaca cgttggagat 2100 atagtccact ctgatgtgtg tggccccatc caaatatgca caccaagcgg agcaaggttt 2160 ttttttgaat cattgtgtaa ttttatcaca acaaattttg ctcatggttt ttctgttttg 2220 cttttaagat tttttgtaac atttaaggat gatttctcga gcttttgtga aatccaactg 2280 ctgaaacgaa aatctgaggt ccctgaagca ttcatgaagt tcaatgccaa gatgaaggct 2340 gagactggac gagaaacaaa gatccttcgc tctgatggag gaggtgaata ttgcagcaag 2400 gattttgaga attggctggc caaagctgga atcactcatc aagtcactcc gccttatacc 2460 cctcaactga acggagttgc cgaaagaaca aataggacgg tgatcgagtc agcacggagc 2520 cagatgtacg ggaagaaagt ccccctggaa ctctggggat tggcggtcct atgtgcagct 2580 tatgttcaga acagggtgat ttctagtatt gggaaagtaa caccttttga actctggcat 2640 ggaaaggaac cggacgtgtc acacttgagg acatttggtt caccagcttt tacgcatata 2700 ccggatgaaa aacgaaggaa acttgacccg aaggctacgg aaggaatcat ggtgggatat 2760 ggcgattcat ccaaagtgta caagatatgg gatccggtga ccagaaaaat catcaccagc 2820 agagatgttg tatttgaaga aaacctcaac catgaaaact cttctgagct acaagagctg 2880 gactactact cacttcttcc attgtcacct gacgaacaga tgcctgagtc tctgctggat 2940 gaacagattc ctacggatca acctgatgtg aacatggggg ccgatgatgg agcttcccaa 3000 ccaaatgatc ccgtcataga tggactaggg gaggctgaat caaacgaaac gaatggagaa 3060 acggaggatg aattttttca tggttttgag ccagttgctg atccagtagg tcacctccgc 3120 cggtccctga gaactgcaag gccatcagaa aaatatcttc tttaccgtgg atatcatgct 3180 acagcacctg gacatcgaac atcgactgat cagcccgaaa agaaaattta tggcaaacta 3240 aacatcactc tgagtgctgc cctctcatca ccacaaggac ctctctggaa gactgccgtc 3300 gaagacgaac ttaactctct tcataagaat gagacgtggg aactcgtccc actgccacca 3360 ggacgcacac ctatcaaaag caagtggatt tttgagctca agcccggcta tgatggagtt 3420 gctgagcgct acaaggcccg gcttgtggcc cttggctgct ctcagcgggc tggactggac 3480 tataatgaaa cattcggttt gctattcact aattaatttt aatttgtaat ttgtctcata 3540 ttcgtttgtt attcattagc tccggttgtc agactatcca ctcttcggat tatcctagct 3600 ctagttgccg tgtgtgatct tagcgtcatc caactagatg tgaagacggc gttcctctat 3660 ggacgtcttg aggaggaagt ctacatgtgc cagccggaag gctatgcggc ctacggccgt 3720 gaggctgatg tctgccggtt gataaagagt atctacggac tgaaacaggc cccccgagtg 3780 tggaatatgg agctaaatga cgctatagtt acatatggcc tcactcgctc tcagcatgat 3840 cagtgtcttt acttccgtct tcaaggggag gaatggatgg cggccctgtt cttcgttgat 3900 gacgctattg tatgtggaac caataacaag gcactggatg actttgtgac atatctcaag 3960 aaaatgttcg agctgaggac tcttcctgta ggccgattcc tgggtctcac aatcaaacga 4020 gacagaagca cacgcacgct gagtctctcc cagcccgatt ttgtggacga tcttgtggca 4080 agatttaaaa tggaaagctg tcacccaaat gtgattcctg ctgagccggg tctacatctc 4140 agttctgcta tggcccctaa aaccaaagag gaagaagaaa aaatgaagga agttccgtat 4200 caaagcgcag ttggagctct tctgtacctc tcaaccacga cacggcctga tattgcttat 4260 gctgttagca aagtagcacg gttcaatcag aacccagggc ctcaacactg gattgctgtc 4320 aaacgcatta ttcgttatct tgccggaacc aaggattatg ggataatctt ctcttcaaca 4380 aaagaacaag gagcgctggg attcactgat gctgactatg gcggagatca tgacgacaga 4440 aagtcaacat caggatgcat ctttctccta aatggtgggc ctatctcatg gttcagccgc 4500 aagcaggaat gcacagccac ctctacaacc gaggcggaat ttgtggccgg aagtgaagcg 4560 gccaaagagg gaacttggat caagtcactg ctggaggaga tcgggcaagg aaaacctggt 4620 cccatcacat tattctgtga taatcaaggg gcaattaaag tagcatacaa ccaggaaatg 4680 catcggaaaa tgaaacatat tgccatccga tattggtaca tcagagaagc ccaaactaac 4740 ggagtaataa acaccagcta tgtgagcacc caggaacaac tagcagacac cttcacgaag 4800 cccctgcctg ctccacgttt caaatacctt cgtgacaaga ttggtgttcg tgatgtaggc 4860 gctctgtaaa ggcggacaca gggcgagcct atggatttcg tatttagttt tttttgtgtt 4920 gttgtcttct ttgattatgt atgatgtaag cttatggtcg tcaggattgt cggacataag 4980 gatttccgac acaaggatcg gtatcctttc cagcataatt tgaattgatg tcattgtatt 5040 gtcttcatgt cttgtgtttc ctcagtttga ggggagg 5077 // ID Copia-8_SI-LTR repbase; DNA; INV; 257 BP. XX AC AEAQ01014066; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: long terminal DE repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-8_SI_; KW Copia-8_SI-I; Copia-8_SI-LTR. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-257 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01014066; Positions 225 481. XX SQ Sequence 257 BP; 64 A; 57 C; 61 G; 75 T; 0 other; tgaaggaaaa actcctcatt tgtaaactcg cgcctgagcg ccaggctcta cgagcgctgc 60 ctgacttatg ggcgctgcaa gcgcgctgtc gattgactaa taaaaactaa agccgataga 120 gggtcttacg accagcgtag atcggggtgt acgctctctc tcgtactgcg tgcgttttca 180 tatagatgtg tgtagcttgt gttactctaa ataaacttct caagtttttt attgagcagt 240 tggtcataat cttaaca 257 // ID Gypsy-103_AA-I repbase; DNA; INV; 5445 BP. XX AC supercont1.322; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-103_AA_; KW Gypsy-103_AA-LTR; Gypsy-103_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5445 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.322; Positions 905659 911103. XX CC 'CTATG' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 1726..3018 FT /product="Gypsy-103_AA-I_1p" FT /translation="MIVDQIVEKCTSSKLRQKLLKRDMLLDEVEALGTSLE FT ESDRQLREFNRPPSEAINRLTYRGQTSSALNKPEIFKKPYQWSSNRNLYSR FT GGRNFARSEPVCFACGKKGHVKGSDVCPAIKAQCLKCRGYGHFARQCLKRP FT NIDQRGPVPVKRVRAVQEINEQNKEEDGYIFYAMGRNTFLFKIGDVEVPMV FT IDSGAAANIISMSIWEEMKRLGAQVSNMTTTIKKDFTSYASSKPMEIVGRF FT EASIQGGGKHTEATFYVAKEGQQCLLGDETAKELSVLKVGFDIGNVEEQQE FT FTKFKGVIVEIPIDESIQPVQQPYRRVPFALEEKVEGKLRMMLAKGIIERV FT RGPSRWVSPMVPVLKDSGEIRLCIDMRRANQAILRVTHPLPTVEELLGSVN FT GATVFSKLDVKDAYHQLEISEKSRTITTFITKTGLFR" FT CDS 3173..5434 FT /product="Gypsy-103_AA-I_2p" FT /translation="MFGVSCAPEIFQKTMESILAGLKGIVVYLDDVMVYGA FT NRAEHDERLTLLQNRLKEYDILLNDQKCIYGVDQLEFLGHVLVQNGIRPTE FT HRLKAIKNFREPRNVSELRSFLGLITYVGRFIPNLASKTGLLRKLLRKGSC FT FQWEEIHQKAFNEIKEAGCNVGYLGFFSREDKTKLIADASPEGVGAVLLQE FT NKEGCTRIIAFASKALSDLEKKYFQTEREALALVWAVEKFQLYLLGTKFQL FT ITDCKALKFLFSPRSKPCPRIERWVLRLQSYDYEISHEPGATNLADALSRL FT SVSEGKPFDDHTEMYIHHLVEAAIPDAVTLQNVLEETQNDPILQELMCALE FT TNLWTEDINMYKPFKSELHVAEGIVMRGDRLIIPEKLRKQVLDCAHDGHPG FT MSSMKRRLRQKTWWPKMDDQCEKYVKQCNPCTLVSSLGPPEPMLRTKMPDR FT VWTDVAVDFLGPLPSGHNLLVIIDYYSRFVEVVVMKKITAELTIEALFETF FT TRLGVPEVLRSDNGPQFISESFKSFCKEFGITQQKTTPYWPQANGEVEKMN FT SSILKRLRISQETNQEGWKWDLRNFLLMYNSTPHSTTGIAPSALMFGRVLR FT DKLPSVKQSTGQEAESIKDRDWERKLKQAKITDNRRKATLSNLNVGDIVVA FT KRMSKDHKLSSNFDPEQFEIISRTGTNVELQSLSTQKLYHRNVSHLKPITK FT SGDIVENHQTVEKELGKSDKTCPAQQELARRPQRMSRAPEYLRDYVAAIEA FT NIK" XX SQ Sequence 5445 BP; 1739 A; 919 C; 1325 G; 1462 T; 0 other; tttggcgacg acggtgaaat atgtaagtaa gaatttggaa tattgtgaaa ctaatgagtg 60 atatgaaatc aaaatttcac atgaattcgg cacatcattc atacggggaa aatggcggaa 120 gctgaagacg cagaaaatat taacagctag tgtggtatgg tagtggtagt gagcggtaaa 180 aaaaaaatag ctgttgggag acggagagag aagagaatgg ttctctttat gtggggttcc 240 tttttttttt ttttttttcg ttcgttggcc tgaaatgagt gtgtgaaagt tgtgtggtta 300 cctgctggtg cggcagaaaa aaaagcaaaa ttgagtgcaa gaaatctacc ttgtgtatgt 360 atgaatcagt tcagagaagt atgatgatat gaatgtgaat atgatgaggg gatcgcaatt 420 atgcttttga tgttttctca acccgcagag gaaaaattga acaggtgcac ttgtgtaagt 480 gacaaaacaa aatcatcatt ataagtatta tacataggta tacatctaca tatagatggt 540 aaatacttgg gaatgatgct ggatttttgg atttggattt agcatagaaa ggtaagtcaa 600 tattttagtt tcaaagctgc attgaattgt ttttgctttc caatggtctg aaatacacta 660 tgttttataa atatattgac aatcgggtat gataactaat ttcggtgaaa attgaagaag 720 cggaatcgca gagaacggta tcgttcatga tgaaattaga tgacggtatc gtcaaagata 780 aatgactaat atgtcatagc atgacgggat cgtcatccaa taataaagaa gtagagcatg 840 cgggatcgca agtgaagtat actagttcag aacagtagtt acagcgtttt gtgtggaaga 900 gtttggaagc aagcggagtc gcgatgatta tcggtgattc ttaataatat tagcctcatg 960 aagaacaagc ggagtcgcta ttgaaatgaa tataaaagcg ggttcgctgt ttaatagaaa 1020 gtaagcggag tcgctatgaa aatttagatg aaagcgggtt cgcagtttaa taaaggataa 1080 gcgggttcgc tttattatgg tggtagaggg tttattgtaa aaaaaatgaa ctggcagaaa 1140 cattggtact gtgttttttt tatagatgtt tcatcaatag taattatatt taatattatt 1200 acatgattac aatgatgtgc cgaatagtga aatgttggtt atatattcaa cacagaagta 1260 tcctgttttt tttataatgg gggtatattt tgttattttg attacgattg caacagggct 1320 ggtaacgact gtcgagtaag agtgttcctg gcgtctgtgg agtcttcgca gctgccagta 1380 gcgtggagca agtggaaaag agatttagaa tcatattttg atgcagagaa tattgtttcc 1440 cagtacgaga gaagagcaaa gttattgtat ctgggaggac cggacttgag ggatattttt 1500 gataatctcc ctgaaattga taatgtccca catgttttgg tggacccgcc atattatgat 1560 gcggcgattt ccaaattgga cgctcatttt gagccatttc gtcgtcgatc ttatgagcga 1620 catcaatttc gacaaattgc tcaaaaaccg tcagaaaggt tctcagattt cgtgctgcga 1680 ttgcgtacac aagtgaaacg ctgtgaatac agccagcctg atgaaatgat agttgatcag 1740 attgttgaaa aatgtacctc gagcaaactg cgacagaagc ttctcaaacg agatatgtta 1800 ctcgatgagg ttgaagcgct tggtacgagt cttgaagaaa gcgaccgcca gttgagggaa 1860 ttcaacagac ctcccagtga agctatcaac agattgactt atcggggtca aacatcatca 1920 gcactcaaca agccggagat tttcaagaag ccgtatcaat ggtcgagtaa ccgtaatctg 1980 tactctcgtg gtggtagaaa ttttgctcga tcggagccgg tctgtttcgc ttgtgggaaa 2040 aaaggacacg tgaagggatc agatgtgtgt ccggcaatca aagcgcaatg tttgaaatgc 2100 cgaggatacg gccattttgc acgtcagtgt cttaaaagac caaacattga tcaacgagga 2160 cccgtaccag taaaacgtgt tagagcggtt caagagataa atgaacaaaa caaggaggaa 2220 gatggataca ttttttacgc tatgggaaga aacacgtttt tgttcaagat cggagatgta 2280 gaggtaccga tggttataga ttcgggggca gcagctaata tcattagcat gtcaatatgg 2340 gaggaaatga agcgattggg tgcccaagtt tcgaatatga ccaccacgat aaagaaagat 2400 ttcacaagct acgcttcaag caagccgatg gaaatagttg gccgttttga agcttccatc 2460 caaggaggag gaaaacacac tgaagcgacc ttctatgtag ccaaagaagg gcaacaatgc 2520 ttgttgggag atgaaacggc caaagaattg agcgtcctta aagtgggttt tgacatcggt 2580 aatgttgagg aacaacagga attcaccaag ttcaaggggg ttattgttga aatacccatt 2640 gacgagagca tccaaccagt gcaacaacca taccggcgcg taccgtttgc tttggaagag 2700 aaggtggaag gaaagctgag aatgatgttg gctaagggaa tcattgaacg cgtacgagga 2760 ccttcccgtt gggtatcacc gatggtaccg gtgctcaagg attccggcga gatcagatta 2820 tgtatcgaca tgcgtcgagc gaaccaggcg attttaaggg taacgcatcc gcttccaacg 2880 gtcgaggaac tattaggttc agtcaatgga gccacggttt tttccaaatt ggatgttaag 2940 gacgcttatc accaattgga aatttcggaa aaatcgcgaa ccattacaac tttcatcacc 3000 aaaaccggat tgttcaggta atttttcgtt ttggtgaaac tgttcgccat ttttcggtaa 3060 attaaataaa taacatgaca gtgtgaattc ggaataaact taagtaatat tttaatacat 3120 caaaatatct tctttttgta aattttcatc attaaacaga tttaagaggc tcatgtttgg 3180 ggttagttgc gcacccgaaa ttttccagaa gaccatggaa tcgatactgg cgggtctgaa 3240 aggaatagtg gtgtatcttg acgacgtcat ggtttatgga gcaaacagag cagaacatga 3300 cgaaagactg actttactac aaaatcgtct caaagaatat gacattcttc ttaatgacca 3360 aaaatgtatt tatggagtag atcagttaga gtttttagga catgtattgg tccaaaacgg 3420 aatacgccct acagaacatc ggttgaaagc gattaaaaat ttcagagagc ctaggaatgt 3480 atctgaatta agaagttttt taggcttaat cacttatgta ggtcggttca tcccaaacct 3540 tgcctccaaa acgggtttac tcaggaaact tttaaggaaa ggaagttgtt tccagtggga 3600 agaaattcac caaaaagcat tcaacgaaat taaggaagct ggttgcaatg taggatacct 3660 tggatttttc agtagagagg acaaaacaaa acttattgcc gatgctagcc ctgaaggtgt 3720 gggagctgtt ttattacaag agaataagga agggtgtact cgtattattg cttttgcgag 3780 taaagctctc tcagatctgg agaagaaata ttttcagacg gaacgtgaag cgctggcatt 3840 agtctgggct gtcgagaagt tccagctata tctccttggt accaaatttc aactgatcac 3900 ggattgcaag gctcttaagt tcctgttcag tccaaggtcc aaaccatgtc caaggattga 3960 acgatgggtt ttgcgccttc aatcatacga ttatgaaata agtcatgagc cgggtgctac 4020 aaatctagct gatgcccttt cacgcttatc ggtaagtgaa gggaaaccat ttgatgacca 4080 tacagaaatg tatattcacc atctagtgga agctgctatc ccagatgctg tgaccttgca 4140 gaacgttttg gaagagacac aaaatgatcc gattctccaa gaactaatgt gcgctttgga 4200 gacgaattta tggacagagg atataaacat gtataagccg tttaaatccg aattacacgt 4260 agccgaagga atcgttatga gaggcgatcg gctgataatc cctgaaaaac ttcgaaaaca 4320 ggttctagat tgtgcacacg atggacatcc aggtatgagc tctatgaagc gaaggctgag 4380 acagaagact tggtggccaa aaatggacga ccaatgcgaa aaatatgtga agcaatgtaa 4440 tccatgtact ctggtatcat cacttgggcc accagaacct atgttgagaa ctaaaatgcc 4500 ggatagagtt tggacagatg tagcagtaga cttccttggg ccacttccta gtggacataa 4560 tctgttggtc attattgatt actatagccg atttgtggaa gtagtggtta tgaagaagat 4620 tactgcagaa ttgacgattg aagccttgtt tgaaacattc acccgtttag gggtaccgga 4680 agtactacga tcggataatg ggccacaatt tattagtgag agtttcaaat cattttgcaa 4740 agaatttggc atcacacaac aaaagacaac gccttattgg ccacaggcca acggggaagt 4800 tgagaaaatg aacagctcaa tattgaaacg tctacgtata agccaagaaa ctaaccagga 4860 agggtggaag tgggatctca gaaacttctt actgatgtat aactccacgc cacacagcac 4920 tactggtata gcaccatcag ccctcatgtt tggaagggtg ttgagggata aattaccatc 4980 ggtgaagcag tcgaccggtc aggaagctga aagcattaag gacagggatt gggagagaaa 5040 actgaaacag gcgaagataa cggataaccg caggaaggct actctcagca acctgaatgt 5100 aggagatatt gttgtggcta aacggatgtc caaagatcat aaactttcaa gtaatttcga 5160 cccggagcaa tttgagatta ttagccgtac aggaacgaac gtagaactac agtcgttaag 5220 cacgcaaaaa ctataccatc gtaacgtatc gcacctgaaa ccaataacaa aatcaggtga 5280 catagtggaa aaccatcaaa cagtcgagaa ggagttggga aaatcagata agacttgtcc 5340 ggctcaacaa gaattggcgc gacgaccaca acggatgtca agggctccag aataccttcg 5400 ggattatgtt gcagctattg aggctaacat taagtaatgg gggag 5445 // ID RTEX-9_BF repbase; DNA; INV; 1954 BP. XX AC . XX DT 30-JUN-2009 (Rel. 14.07, Created) DT 30-JUN-2009 (Rel. 14.07, Last updated, Version 1) XX DE Amphioxus RTEX-9_BF autonomous non-LTR retrotransposon - DE consensus. XX KW RTEX; Non-LTR Retrotransposon; Transposable Element; Jockey-4_BF; KW RTEX-9_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-1954 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-1954 RA Kapitonov V. and Jurka J.; RT "Young families of RTEX non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(7), 1725-1725 (2009). XX DR [2] (Consensus) XX CC The 5' terminal portion is missing. The 3' terminus is composed CC of the (ACATT)n microsatellite. XX FH Key Location/Qualifiers FT CDS 1..1812 FT /product="RTEX-9_BF_2p" FT /note="RT." FT /translation="LSHIVPIHKSGDTSLPDNYRGISIISCLGKLFCSILN FT NRLSNYAEENSLFKPQQAGFRKNFRTTDNLFTLNTLVSKYISRNSRLYACF FT VDFSKAFDSVWREGLLLKLKKLGIGGKFLLTIKDMYSKTTNCIKTNSGLTD FT TFVTHCGVRQGCNLSPTLFNLFISDIASEFDHSCCNPPVLHCMNVPCLLYA FT DDLVIFSESQRGLQSSLSRLEEYCKSWRLKVNLKKTKIVVFTKGGRLPRDC FT SFSFNNNVVEIVSSYCYLGIIVNSAGTFKANHKYLYNKGLRALFGVRQSLE FT KTDAPLSVKNKLFDSCVKPILLYGSEIWGSFKSSKSCPIESVHLKFCKQSL FT NVPKTASNLATRAELGRYPLHLEASLNAIKFFTRVCLNVPADSLQADALLC FT QLELDSLGSKCWASGVRKTLEECGYAFVWHSPHSIVSKVPSIISAIQQRLK FT DIYFQTFLCEIHNDNKGKCAKNKLRSYRLFKSTYNEEEYLKIPNERQRSII FT TKLRISCHKLRVESGRHNRTPLEQRVCQFCNLNKIEDESHFLLDCSLYSND FT RSILFSHIIGKFPRFEYMSSVEKFIFLMSFDQTTLYKHISSYIFNITKKRE FT EAEQMY" XX SQ Sequence 1954 BP; 628 A; 386 C; 341 G; 599 T; 0 other; ctcagccata ttgtacctat ccacaaatcc ggagatactt cactcccaga caactataga 60 ggaatctcta ttataagttg tctaggaaaa ttgttctgtt caatcttaaa caatagactg 120 tctaattacg cagaagaaaa cagccttttc aaaccacaac aggccggttt caggaagaat 180 tttagaacaa ctgataatct attcactttg aacacgctcg tcagcaagta tatcagtaga 240 aattcacgtc tgtacgcatg ttttgtagat tttagtaaag cttttgattc tgtttggcga 300 gaaggtcttt tacttaaatt gaagaaactt gggatagggg gcaaatttct cctgaccata 360 aaagatatgt actctaaaac tacaaattgc ataaaaacaa acagtggtct aaccgatact 420 tttgtaactc actgtggggt tcgacaaggc tgtaacctga gtccaacatt gttcaattta 480 tttattagtg acatagcctc agaatttgat cattcctgtt gtaatcctcc agttctacac 540 tgtatgaatg taccttgtct attatatgct gatgatttag taatattctc agagtcacaa 600 cgtggcttac aatcatcttt gtctagatta gaagagtact gtaaatcttg gaggttgaaa 660 gtcaacctaa aaaagacaaa aatcgtagtt ttcaccaagg gtggtcgttt accaagagac 720 tgctccttct cgtttaataa taatgtagta gaaattgtat cttcatattg ttatttaggc 780 ataattgtca actcagctgg cacattcaaa gccaatcaca aatacctata caacaaggga 840 ctaagagctt tgtttggagt tagacaatcc ctggagaaaa ctgatgcccc gttgtctgtt 900 aaaaacaagt tattcgattc ctgtgtcaaa cctatcttac tttatggatc ggagatttgg 960 ggctctttca aaagttcaaa atcttgccca atcgaatccg tacatctaaa attctgtaaa 1020 caatccctca atgtccccaa aacagctagc aatcttgcta cacgggctga attagggagg 1080 taccccttac acttggaggc ctccctaaac gccattaagt ttttcactag agtctgcttg 1140 aatgtgccgg ctgatagtct ccaggcagac gcacttttat gtcaattaga attagactct 1200 ttaggtagta aatgttgggc ttccggtgtg cggaaaactt tagaagaatg cggttatgcc 1260 tttgtttggc attctccaca ctcaattgtt tcaaaagtgc cttcaatcat ctctgctatt 1320 caacaacgtt taaaggacat ctactttcaa acatttctat gcgaaattca taatgacaac 1380 aagggcaaat gcgcaaagaa caaattacgc tcatacagac tgttcaaatc cacctataac 1440 gaagaagagt atcttaagat tccgaacgaa cgacaaagat ctattatcac caaactacga 1500 atcagctgcc acaaactccg tgttgaatca gggaggcaca accgcacacc tctggagcaa 1560 agagtctgtc agttctgtaa cctgaacaag atcgaagatg aatctcactt tctattagat 1620 tgtagtttat atagtaacga tagaagtatt ctttttagcc atattatagg aaagttccct 1680 cgctttgaat atatgtctag tgttgaaaaa ttcatttttc ttatgagctt tgatcaaacg 1740 acattataca aacacatctc aagctacata tttaacataa ctaagaaacg agaagaagcc 1800 gaacaaatgt attagaatag ttttagaatt tatgttttag attagtctta gaaaccatga 1860 tgtttactgc attactaccc tttgtatcta tatgcctgta cttagcccga tagggcatga 1920 atgtgcaata aaggctatta cattacatta catt 1954 // ID BEL-183_AA-I repbase; DNA; INV; 6072 BP. XX AC AAGE02026169; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-183_AA_; KW BEL-183_AA-LTR; BEL-183_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6072 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02026169; Positions 13943 7872. XX CC Positions [5121-5681] - Integrase core CC 'AAACC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS join(30..4577,4581..6038) FT /product="BEL-183_AA-I_1p" FT /translation="MSSREAYGSGKSVKVSKKGKSSKSKVDEVVDGGNVSI FT VVTGVQGDTDADRTLAGRTCKSCRGPDTDEMVQCDKCDKWHHFGCVGVTEE FT VADHSWSCSKCVSAKWAQRSGSTSSKILQPSGGARSSMEAQNPRTSCGLNE FT TDQRSSDPMKSTYKGQHRPGDNDAKTSVALSDLSSSSSRRSSRTLLKLQML FT KLEEEQKLAKEFLERKYALLQEAASDISSKTSSKASSKTNSSMSRVRDWVH FT GDNTHRREAGMLAWPDVFEPQRHSTQNPTTLACVNNLGAQRNQQYPFRHDQ FT LTIGRTFSMRAMSSGATESADGMVRGFQETSRQQLPVPSAVGGPYVTENGP FT TRPSDYRREPRLGVEYDEAHPLTHKQLAARQAISKELPTFSGKPEEWPIFL FT SSFTNTTAMCGFTDAENAVRLQKSLTGKAYDAVKSSLMHPSNVRSVLATLR FT MRFGQPEAIVHSLIAKITALPPLKEDKLEMIMDFAVEVQNFCAIVDACELE FT EHMYNVSLLHQLVSRLPPSIKLDWARYRQALPRVNLATFGNWVYSLAEAAS FT TVTIPNIPETKFDRTEMRQAKKSTGFINAHLEPNEAEPKPLVENAGKSSTE FT CLVCKSGCEAIRKCKQFLELSRDSRWAVVRDFNLCRRCLGKHVGGCKAQAC FT GKNGCTRMHHELLHNDSHIRKDAASMIDKTQQDKPGTSSTQHECNTHRSNV FT NTSLFRYLPVTLQGKNGCIQTFAFLDEGSKLTLMDQDLADELQLEGVGSPL FT YLRWTGGTERCEDDSRIITVSIAGSFNGAKMFKLDGVRTVKELQLPRQSLD FT AEHMQKQYPYLRGLPIESYKHARPQILIGLKHAHVSLVLQCREGKLEQPIA FT IKTRLGWTVCGGSDGDNSPNMVHYSFHIGSRDDRSDEDLHQAMKDYFAIDS FT LGVVKPSETLLSSEDQRSCRMLETLTTFKGDRYETGLLWRYDDIRLPNSRS FT MALRRYHLLEKRMAKDPDLAEALNQRIAEYITKGYIRQLTKDEETQYVSRS FT WYLPVFPVFNPNKPGKLRIVWDAAATIFGVSLNSALLKGPDQLCSLFSILL FT QFREHSIGLTGDIREMFHQIKIREQDQPCQRFFWKDESGETAVFEMCVMTF FT GACCSPSSAQFVKNLNAERFVGQYPQAVEAIIKRHYVDDMLASVKTEEEAI FT RLANEVKHIHSQGGFEIRNWVSNSQTVLRSLGENDAKTKSLDLSAEVATEK FT VLGLWWCTETDTFTYKVCWSRYDESLLEGRRRPSKREVLRVLMSIFDPLGL FT IAHFLIYLKVLLQEIWRSGAQWDDEINDVLFLKWQIWLRVLPEVENVQVSR FT CYYLKDSPDNEIELHTFVDASENGYAAVSYLRTSNDGVECTLITAKTRVAP FT LKFQSIPRLELQAAVLGARLARTIAESLSIRTTRRCFWTDSRDVLCWINSD FT HRRYTQFVAHRVSELLDTTEANEWRWVPTKENVADDATKWQGRPDLTTDSR FT WFNGPQFLRTSESNWPQQAPIHTSTQEEIRPHLVAHISTASPVLNVRDYSK FT QRLLNVVSFLFLFSANCRRKAQNQPNRTGSLAMEEIRAAENYLIQLAQRDN FT YAEEIASLSQGKPIPKNSPLYKSTPFIDKSGIIRMRGRTANCPFITYEVKN FT PIILPRDHHVTTLIITHYHTKYHHLNQETVINELRQKFTISRLRVCCAKIR FT RDCQRCKNDRAVPNTPVMADLPDARLAAYARPFTHVGIDYFGPMEVAVGRR FT VEKRWGMLATCMTTRAVHIEVAHSLSTDSCIIAIRNMIARRGAPRHIYCDR FT GTNFVGTSNELGRVMHELDNESIMSEFVDSGTSWSFNPPSAPHMGGSWERL FT IRSVKCSLKALNLPRRPSDEVLHNALLEIENTINSRPLTHVPIEDNAAPAL FT TPNHILLGSSNGVKPLTILNDSATNVRQCWRFSQIIANQFWKRWISEYLPD FT ITKRTKWYNSNTLSIEIGDVVVIVDPKLPRNCWPKGRVIATRPGRDGEVRS FT ATVRTAAGVYERPVVKLAVLEVRREAE" XX SQ Sequence 6072 BP; 1743 A; 1402 C; 1503 G; 1424 T; 0 other; tcttataata ttcgtttaca tcttcggaaa tgtcaagtcg tgaagcgtac ggttctggta 60 agagtgtgaa ggtgtcgaag aaaggaaaga gctcgaaatc gaaagtggat gaagtggtcg 120 atggtggtaa tgtgtcgata gtggtcacgg gagttcaagg agacacggat gccgatcgaa 180 cattggccgg ccggacttgt aaatcgtgca gaggtccaga caccgatgaa atggttcagt 240 gcgataagtg cgacaaatgg catcactttg gctgcgttgg agtcacggag gaggtagcgg 300 atcatagctg gagttgctca aagtgcgtat cggcaaaatg ggctcagcgt tcaggatcta 360 cttccagtaa aatacttcaa ccgagtggtg gtgcaaggag cagcatggaa gcacaaaacc 420 cacggacgtc ttgtggacta aacgaaaccg atcaacgaag ttctgatcca atgaagtcta 480 catataaggg gcaacatcga cctggagata atgatgctaa gaccagtgtg gcattaagcg 540 atttgtcgtc ctcatcgtcc cgcagatcgt caagaacact cctcaagtta caaatgctga 600 agctggagga ggaacaaaaa ttggcgaagg agtttttgga gcggaagtat gcccttctac 660 aggaggcagc aagcgatatc agttccaaga caagttctaa agcaagttcg aagactaatt 720 ctagcatgag ccgagtacgc gattgggtac atggtgacaa cacccatcga cgagaggctg 780 gtatgttagc ttggccggat gtcttcgaac cccaacggca ttctacgcag aatcctacaa 840 cgttagcgtg tgtgaataac ttgggagcac agagaaatca acagtatcca tttaggcatg 900 accaattgac tattgggcga acgttttcaa tgcgagcaat gtcatccggt gcaacagaat 960 cggccgatgg aatggtgcga ggctttcagg aaacttcacg tcaacagctt cctgttccaa 1020 gcgctgtagg aggtccatac gttaccgaga atgggccaac gcgaccatcg gattacagaa 1080 gagagccacg acttggggta gaatacgatg aagcgcaccc tcttactcat aagcagctag 1140 cagctcggca agccatatca aaggagctgc caaccttctc aggaaaacct gaagagtggc 1200 caatcttctt atcgtccttt acgaacacaa cagcaatgtg tggatttacg gatgccgaga 1260 acgcagtccg acttcaaaaa agtttgacag gaaaggcata cgatgcagtg aaaagtagtt 1320 taatgcatcc gtccaacgtg agaagtgtac tcgctacact ccgtatgaga tttggacagc 1380 cggaagcaat tgttcactcg ctgatagcga aaattaccgc attgccgccg ttgaaagaag 1440 ataagctaga aatgataatg gatttcgcag tggaggttca gaacttttgc gctatagtgg 1500 acgcgtgcga gctagaggag catatgtata acgtatcgct attgcatcaa ctcgtgagca 1560 gactgccacc gtctatcaag ttggactggg ccagatatcg acaagcactt ccgagggtta 1620 acctagccac ttttggaaac tgggtttatt ctctggcaga agcagctagt acagtgacca 1680 ttccaaatat cccagaaacc aagttcgatc ggactgagat gcgacaagcc aaaaaaagta 1740 ctggatttat taacgcacat ttggagccga atgaagcgga accgaagcca ttggtggaga 1800 atgctgggaa aagctcgact gagtgtctag tttgtaagtc cggttgtgaa gcaattagga 1860 aatgcaagca atttttagag ctgtctagag attctcgatg ggctgttgtc cgagatttca 1920 acctatgtcg gcgctgtctt gggaaacacg tcggtggatg taaagctcaa gcttgcggaa 1980 agaatggttg tacgagaatg catcacgaac tgctccataa cgatagtcat atcaggaaag 2040 atgcagcttc aatgatcgac aaaactcagc aagataaacc gggaacgagc tcaactcagc 2100 acgaatgcaa cactcaccgc tccaatgtca acacttccct attccgatac ctgcctgtaa 2160 cactccaagg aaaaaacggg tgcattcaga cgttcgcctt cctggatgag ggttctaaat 2220 taacactcat ggaccaagac ctcgcggatg agcttcaatt ggaaggggtt ggtagtccac 2280 tataccttcg atggactggc ggtaccgagc gttgtgaaga tgattcacgt attatcacag 2340 tttcgatagc aggatccttc aacggtgcaa aaatgttcaa attggacggc gtgcgaacgg 2400 tgaaagaact tcagctgcca cgacaatctc tggatgccga acatatgcag aaacagtatc 2460 cttacttgcg aggcctaccc atcgaatcat acaaacacgc acgtccacaa attttgattg 2520 ggttgaagca cgcccatgta agtcttgtac tgcagtgtcg ggaaggaaag ttagaacaac 2580 ccatagctat aaagacacga ctaggctgga ctgtatgtgg tggaagcgac ggcgacaact 2640 cgccgaacat ggtacactac tcgttccaca taggctcacg agacgatcga tcggatgaag 2700 acctgcatca ggccatgaaa gattattttg ccatcgacag tttgggtgta gtgaagccta 2760 gtgaaactct cctttcttct gaagaccaac gaagttgtag aatgctggaa acgctaacca 2820 ctttcaaggg cgatcgttac gaaacgggct tactttggcg ctacgatgac atccgtttac 2880 caaatagtcg atcgatggcc cttcgacggt atcacctctt ggaaaagcgt atggccaagg 2940 atccagattt ggccgaagca ttgaatcaga ggatagcgga atacattacg aaaggataca 3000 ttcgtcaact gacaaaggat gaagaaaccc aatacgtttc acgttcctgg tatttaccgg 3060 tattcccggt gttcaatccg aacaagccgg ggaaactacg catcgtgtgg gatgcagctg 3120 ccacgatctt cggtgtttca ctgaactccg ctcttttgaa aggccccgat cagttatgct 3180 cactcttctc catactcttg cagttccgtg agcattccat tggattaacc ggcgatatac 3240 gtgagatgtt tcaccagatt aaaattcggg aacaagatca gccatgtcaa cgattttttt 3300 ggaaggatga atccggagaa acagccgttt ttgaaatgtg tgtcatgacg tttggtgcgt 3360 gctgttcgcc cagcagcgca cagttcgtta aaaatttaaa tgcagagcga tttgttgggc 3420 aatatccaca agcagttgaa gctatcatta agcgacacta cgtagatgat atgctagcca 3480 gcgttaaaac ggaagaggag gcaatcagat tagcgaatga agtcaagcat atccactccc 3540 aaggcggctt tgagattcgc aactgggtga gcaactccca aacagtttta cgatcactgg 3600 gagaaaacga tgcgaagacg aaaagccttg acctgtcagc cgaggtagca acagagaagg 3660 ttttggggtt gtggtggtgt acggaaaccg atacgttcac gtataaagta tgctggtcac 3720 gttatgatga atctctccta gaaggacgtc gacgcccctc gaaacgagaa gtactgaggg 3780 tacttatgtc tatttttgat cccctcggtt tgattgcaca ctttttgata tatctgaagg 3840 tcctattgca agaaatctgg cgctctggag ctcaatggga tgatgaaatc aatgacgtcc 3900 tatttttaaa gtggcaaatc tggctgcggg tcctacctga agtggaaaac gttcaagtat 3960 ctcgatgcta ttaccttaaa gactcccccg ataatgaaat cgagttgcat accttcgtcg 4020 atgctagcga gaatggatac gctgctgtat cttatttgag gacatcaaac gatggagtag 4080 aatgtacgct tatcacagcc aaaactagag tcgctccttt gaaattccaa tctattccta 4140 ggcttgaact gcaggccgca gttctcggag caaggttagc acgcacaatc gctgaatccc 4200 tgtctatacg aacgactcga cgctgtttct ggaccgactc ccgagatgta ttgtgctgga 4260 ttaactcgga tcaccgtcgg tatacccagt tcgtggcaca ccgagttagt gagctccttg 4320 acaccacgga ggcaaacgag tggcgttggg tgccgacaaa ggagaatgta gcagatgatg 4380 ccacaaaatg gcaagggcga ccagacctca ccactgatag cagatggttc aatggaccgc 4440 aattccttag gacaagtgaa tccaattggc cacaacaagc tccgatacac acgtctacac 4500 aagaggaaat ccgtccgcat ctagttgcac acatttcgac tgcatctcct gtattgaacg 4560 taagagatta ctcaaaatag caacggcttc tcaacgtagt gagttttctt tttctttttt 4620 cggccaactg tcgacgtaaa gctcagaatc agcctaaccg tactgggtca ttggccatgg 4680 aagaaatacg tgcggccgaa aattatctta ttcaattagc tcaacgagac aactacgcag 4740 aagaaattgc tagcctaagc cagggtaagc caattccaaa aaatagcccg ctctacaagt 4800 caactccgtt cattgacaaa agcggaatca ttcgcatgcg aggacgtaca gccaactgtc 4860 ccttcattac ctacgaagta aagaacccta tcattttacc ccgtgatcac cacgtaacca 4920 cacttataat tacccactat cataccaaat accaccatct caaccaagaa accgtgatta 4980 acgaactccg acagaagttc accatttcgc gacttcgagt ctgttgcgcg aaaattcgaa 5040 gagactgcca acgctgcaaa aatgatcgag ctgtaccaaa cactccagtc atggccgatc 5100 tacccgatgc tagattggct gcctatgctc gacctttcac ccacgtaggg attgactact 5160 ttgggccaat ggaagtcgcc gtcggtcgaa gagtggaaaa acgttgggga atgctagcta 5220 cttgcatgac aacacgagcc gtgcatattg aagtagcgca ttcgctcagc acggattcgt 5280 gtataatagc catccggaat atgattgctc gtcgtggcgc gcctcgtcat atatactgtg 5340 accgcgggac caactttgtg gggacatcta atgaacttgg gcgagtgatg cacgaactcg 5400 acaacgaatc catcatgagc gagttcgttg attctgggac ctcctggtcg ttcaacccac 5460 catcggctcc acacatgggt ggaagttggg agcggctgat tcgtagtgtg aaatgcagtc 5520 taaaggccct gaatctaccc cgacgtccct ccgatgaagt gttacataat gctttgctcg 5580 aaattgaaaa cacgatcaac tcgcgaccat taacacacgt tccgatcgaa gacaacgccg 5640 ctccagctct cacacccaat cacattcttc tggggagctc aaacggcgtg aaaccactta 5700 ctatactcaa cgatagcgct accaacgttc gtcagtgctg gcgtttttca cagataattg 5760 ccaaccagtt ttggaaacgc tggatatctg agtacctgcc ggatataaca aagaggacaa 5820 aatggtataa ctccaacaca ttatcgattg aaatcggtga cgtggtcgtc atcgtggacc 5880 cgaaactccc tcgaaactgt tggcctaaag gaagggtgat tgcgacgcgt ccaggtcgag 5940 atggagaagt tagatcggca acagtaagga ccgctgctgg cgtctatgag cgacctgtag 6000 tcaagctagc tgtattggag gtaagacgcg aagccgagta gtcgaccatg tgatcgacgt 6060 acctgggggg ag 6072 // ID DNA8-109_AP repbase; DNA; INV; 674 BP. XX AC . XX DT 29-AUG-2009 (Rel. 14.09, Created) DT 29-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-109_AP. XX NM DNA8-109_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-674 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2047-2047 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. It contains a piggyBac-like insertions (pos. 123-679) CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 674 BP; 239 A; 81 C; 85 G; 268 T; 1 other; tagggaccgg atttatatgc aaatgcatat acgtattgga ataagctttt tgaaatctga 60 tgagaattgt ctccgattta gatcattctt attaaaataa cacgaactgt attttgcata 120 ttttgcatat ttcaacgttt ttacctatta aatacatatt taatgcatat ttgtaggttt 180 atagtgcata ttattattta tagcacatat atttgttttt cgtcgtattt tcacgttcac 240 agccacataa tatcgataat attacgacca gtcattattg ccaaaaaaag atgatataca 300 taaatataat atacatatat atgtatatat tattataata ttaatataaa taaattcgtt 360 gttcctataa tagagttcaa gtgttcgacg gaagccgatc agggctaata ttttatagat 420 tgaagataaa taacgttaaa tggtgaaraa cttagtgaag atgacaaaaa tatacttttt 480 tatactatta aaacccttat atacctcttt tattacttaa ttaattaata cttatttttt 540 acataatttt aaataaaatg ttttcgattc aattttaagt gcatatttaa aaataaatgc 600 atattttgta gcatattaga gcatttttag aagcatattt tgaggtcttt ttgtgcatat 660 aaatccggtc tcta 674 // ID RTE-3_BM repbase; DNA; INV; 1495 BP. XX AC . XX DT 29-APR-2010 (Rel. 15.07, Created) DT 29-APR-2010 (Rel. 15.07, Last updated, Version 2) XX DE Non-LTR retrotransposon - a consensus. XX KW RTE; Non-LTR Retrotransposon; Transposable Element; RTE-3_BM. XX OS Bombyx mori OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Bombycidae; Bombycinae; Bombyx. XX RN [1] RP 1-1495 RA Jurka J.; RT "Non-LTR retrotransposons from Bombyx mori."; RL Repbase Reports 10(7), 1055-1055 (2010). XX DR [1] (Consensus) XX CC >95% identical to consensus. XX FH Key Location/Qualifiers FT CDS 3..1457 FT /product="RTE-3_BM_1p" FT /translation="MYEVREAVRSMKSGKSEGPDGIPVEVWKILGEDGYKW FT LTLFFNKLLQEEVIPTEWSTSTLVPIYKNKGDVQDCGSYRGIKLMSHSMKV FT WEKVIEKRLRDESEITQNQFGFMPGRGTTDAIFALRQVCEKHRDVHRNLHM FT VFVDLEKAYDRVPRAVLWWALNEKGIPGKYVRLICAMYSRARTYVRTAAGN FT SDEFNVAVGLHQGSALSPYLFLLVMDALTSEIQEEPPWCMLFADDIVLVGE FT NGLEVQNILEKWRCKLESVGLKISRSKTEHLFCDFGGLSNFTPISLDGAPL FT PVCQDFRYLGSVIQSDGELDRTVRHRIDAGWMKWRQVTGTICDSHIPLPLK FT GKIYKTLIRPAVLYGSACWTTKVADERRLHAAEMRMLRWMCGVTRMDRIRN FT EYVRGSLKVAPVTEKLRSARLGWYGHVMRRNENEVVKRVLTMNVEGFRGRG FT RPKKKWMDCVKDDMGRRGVSEEMVYDRRVWKEKTCCADPR" XX SQ Sequence 1495 BP; 448 A; 256 C; 456 G; 335 T; 0 other; gtatgtatga agttagagaa gcagtgagaa gtatgaaaag cggaaaatcg gaaggaccag 60 acggtatacc agtggaagta tggaagatac tgggagaaga cggatataag tggctgactt 120 tattcttcaa taagctgctg caagaagaag tgatccctac ggaatggagc accagtacgc 180 tggtgcccat atacaaaaat aaaggggatg tgcaagactg tggcagctat cggggaataa 240 agcttatgtc acatagtatg aaagtttggg agaaagtgat agagaagcga ttgcgagatg 300 aaagtgagat cacccaaaac cagttcgggt tcatgcctgg tcgcgggaca acggacgcca 360 tatttgcact ccgccaagtg tgcgaaaaac atcgcgacgt gcacaggaat ctgcatatgg 420 tgttcgttga tctggaaaaa gcgtacgacc gagtacctag agcagttttg tggtgggcat 480 tgaatgagaa aggtatacct ggtaagtatg tgaggttaat ctgtgccatg tacagtcgag 540 ccaggacgta tgtacgaacc gccgcgggga attccgacga gttcaatgtg gcagtgggct 600 tacaccaagg gtctgcgcta agtccctatc tcttcctgct tgtgatggat gccttgacgt 660 cggagataca ggaagagccc ccttggtgta tgctgtttgc cgatgacata gtgctcgtcg 720 gagaaaacgg gctcgaggtc caaaacatac tggagaaatg gcgatgcaag ttggagagtg 780 ttggcctaaa aatcagcaga tcgaaaaccg aacatctgtt ctgtgatttt ggcggtctct 840 ccaattttac acccatttcc ctcgacggag cacctttgcc agtatgtcaa gacttccgat 900 acctagggtc tgttatccaa agcgacggtg aactggatcg tactgtgagg cacagaattg 960 acgcaggatg gatgaaatgg cggcaggtca cgggcaccat atgtgactcg cacatccctc 1020 ttcccctaaa agggaagata tataagacct taataaggcc tgccgtcttg tatggatcag 1080 cttgttggac aacgaaagtg gcggatgaaa ggcgattgca tgcagcagag atgcgaatgt 1140 tgcgatggat gtgtggagta acgagaatgg atagaatacg gaatgaatat gttagaggaa 1200 gtctgaaagt ggcacctgtg acagagaagc tgaggagtgc gcgtttggga tggtatggac 1260 atgtgatgag acgaaatgaa aatgaggttg ttaagagagt gttaactatg aatgtggaag 1320 gatttagagg aagaggtaga cctaagaaga aatggatgga ttgcgtgaaa gacgatatgg 1380 gtaggagggg agtgagcgaa gaaatggtat atgatagaag agtatggaag gagaaaacat 1440 gttgcgccga ccccaggtga ctgggagaag ggcaggataa tgatgatgat gatga 1495 // ID AlKe1_AL repbase; DNA; INV; 183 BP. XX AC Y11729; XX DT 09-JUL-2004 (Rel. 9.06, Created) DT 09-JUL-2004 (Rel. 9.06, Last updated, Version 2) XX DE Acricotopus lucens DNA for tandem repeat sequence AlKe1. XX KW AlKe1_AL; tandem repeat. XX OS Acricotopus lucens OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Chironomoidea; OC Chironomidae; Acricotopus. XX RN [1] RA Staiber W., Wech I. and Preiss A.; RT "Isolation and chromosomal localization of a germ line-specific RT highly repetitive DNA family in Acricotopus lucidus (Diptera, RT Chironomidae)."; RL Chromosoma 106(5), 267-275 (1997). XX DR Genbank; Y11729; Positions 1 183. XX SQ Sequence 183 BP; 59 A; 30 C; 33 G; 61 T; 0 other; aacataatgt gaaaaataca caaaatggca ctttttggaa aaattctcat tgtgacccct 60 atgaagatat atcagattgg tgtttgatat actcagttat atatcaatcg atgcgtcttg 120 gcaccagcaa caatttgata ccattttgag cgctctagga cgtttgtgat agaatttatt 180 gtc 183 // ID CR1-84_HM repbase; DNA; INV; 4327 BP. XX AC . XX DT 15-JAN-2009 (Rel. 14.02, Created) DT 15-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE CR1-type family - consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-84_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4327 RA Bao W. and Jurka J.; RT "CR1 families from Hydra magnipapillata."; RL Repbase Reports 9(2), 371-371 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 259..1173 FT /product="CR1-84_HM_1p" FT /translation="MSKPSSDADRNGAHNTCVNLDVWLKELKTVVLDLKKE FT INEIKKENLAKDEKITKLEGRILVLELETKKTPETTNNANAAPTENVWFKK FT MSAQWLKPGSTSNTALVQAITNNNKLRENKAKNLVIAGLPISKGPSSADKI FT ADDTKIVKDICVAIDNPNAGIKSVRRMFRANNSTSITQSPPLVVVEFDTID FT IRNEILTSSRKLYGKQEFKGIYIRPDRTPAEQTEFAALLKERYEANNDLKK FT HGNLDNPFRFVIRNDRVRCVDVTETQDVNGTNKFIFAGESAARAARRAKFV FT IRNKSSERQHHN*" FT CDS 1022..4177 FT /product="CR1-84_HM_2p" FT /translation="MTGSVASMSQRPRMLMEPTSSYLLEKVQRERQDVLNL FT SYEISHLSDNIIIKLDKNVIFNSPNNYNNTISSESGKQFINFLYTNPTSFT FT QNKLHELELMLSYTNPSDVIMLTETWFTNSSIVNITNFTIFRKDRDSKGGG FT VCIYVSNHLKSLEITDNQLCRENIEQIWCNIKIASESILVGCIYRPNDDAT FT TINEILKSIKRAKELRNKGLFTSIVIAGDFNLAGIEWDSDNTPRTSTQYEQ FT TFIETLHDNNLTQLVNFPTFIKNHDESAKTTLDYIIMNQPERLDQLLDFPP FT LGATCRGRGHLVLTWRYLLNKSTTNNYDNNLRKKYAFDKGNYELIKQYFNK FT IDWESKFTNSSIDEDYEYFVKEYILACNKFIPIRNNKNNTNNLEPWINKTA FT KIALKIKHKLYYRAKNNNWRRTQKTKSKYNRAAKQVKKEIFKSTCAFERHL FT AHESKKNPKLIYAYINKRQQVKRAIRSLYDKIGNITNDQINIANTINNQFQ FT SVFVKEPDDEPVFPIITEHKCSLLTILQDINEQTIKIRLENVDTSKSIGGD FT NIHPFVLKHASVAIAIPLSLIFKKSLATQTVPKNWTEANVTPVFKQGNRLL FT AENYRPISLTSIVCKILEKIVREALTAHLELHGLLSNSQHGFTTKKSCVSN FT LLETLNTITKALAQGHNVDVIYLDYAKAFDTVPHKRLLAKLLSYGIENDLV FT KWIRAFLSNRRQRVVLGETVSDWCNVTSGVPQGSVLGPQLFNIYINDLPRL FT LDNTSKLYADDTKIISIVDTHQQSLNLQNDLNKVQNWSKTWLMKLNTKKCK FT VMHFGKFNNNFKYQMQDTSSNNNMHTLETTTEERDLGVILSSDLKWKPHID FT KITNKANSVLKMLKNTFTCREKELWKKLYTSLVRPHLEYASTAWNPSLASD FT IDKIEKVQKRATKWSQPNTAYANRLANLGLTSLAKRRERGDCIETFKITNK FT IINIEDAFFNMKQSTNLRTFNNHQLQKDIYSSRQANDYSKAVKFRDNFLSN FT RVTQVWNYLPNSVIEASGATPKAAINKFKNNYDKFQKDKINSIKQ*" XX SQ Sequence 4327 BP; 1679 A; 761 C; 704 G; 1183 T; 0 other; atgcacggaa aaaaacacat actgaaactt tttagtccct gtgtaataag tgcaaaaagt 60 aaatattata aaagtttcag acaggaggta atccactgtt atttagggtg gtagccctcc 120 tatccagata tatattggtt gcttactgtt tcagacagga ggtaatccac tgttatttag 180 ggtggtagcc ctcctatcca gatagaagaa cgaagtattt cgatttgaga tatataccaa 240 ataaaatatt gattgataat gtcgaaacca tcaagcgatg cagataggaa tggtgctcat 300 aacacttgtg taaacctaga cgtgtggctg aaggagttga agacagtagt tttagatttg 360 aagaaggaaa ttaacgagat aaaaaaggag aatctagcaa aagatgaaaa gataacaaaa 420 ctcgagggaa gaatattggt actggaatta gaaacgaaaa aaactccaga gacaacaaat 480 aacgctaatg ccgctcctac tgaaaatgta tggttcaaaa agatgtctgc tcaatggtta 540 aaaccaggta gtacctcgaa tacggcctta gttcaagcaa tcaccaacaa caataagcta 600 cgtgagaata aagcaaaaaa tttagtcatc gcagggttac cgatctccaa aggaccaagt 660 tcagctgata aaatagccga cgatacaaaa attgtaaaag atatttgtgt cgcaattgat 720 aacccaaacg ctggcataaa atcggttcgt cgtatgttta gagcaaacaa ctctacttca 780 attacacaat cacctcctct agtagttgtt gagtttgaca caattgatat aagaaatgaa 840 attttaacat cttcaagaaa attatatggt aagcaagagt ttaaaggtat ttacattaga 900 ccggaccgca cgccagctga gcaaactgaa tttgcggctt tactaaaaga aagatatgaa 960 gcaaataacg atttgaagaa acacggtaat ctggacaatc cctttcggtt tgtcatccgc 1020 aatgacaggg tccgttgcgt cgatgtcaca gagacccagg atgttaatgg aaccaacaag 1080 ttcatatttg ctggagaaag tgcagcgaga gcggcaagac gtgctaaatt tgtcatacga 1140 aataagtcat ctgagcgaca acatcataat taaattagac aaaaatgtca tattcaactc 1200 accaaataat tacaacaata ccatatcttc agaatcaggt aaacaattta tcaacttttt 1260 atacactaac ccaacatctt ttacgcaaaa caaactacat gaacttgaat tgatgttatc 1320 ttacacaaat ccgtcggacg tcataatgct aactgaaaca tggttcacta attcatcaat 1380 tgtaaacata acaaatttca ctatttttag aaaagatcgt gattctaaag gtggtggtgt 1440 atgtatttat gtttcgaatc atcttaagtc attagaaata actgacaatc agctatgtag 1500 agagaatatt gaacagattt ggtgcaatat aaaaattgct tctgaatcga ttttagtggg 1560 atgcatttac agaccaaatg acgacgcaac aacaattaat gaaatactga aatcaattaa 1620 acgagcaaaa gaactaagaa ataaaggcct atttactagc atagtaattg cgggagattt 1680 taatttagcc ggcattgaat gggatagcga caatacaccc agaacttcca cacaatatga 1740 acaaacattt attgaaacgt tacacgacaa taatttaact caattagtca attttccaac 1800 atttattaaa aatcatgatg aatcagctaa aacaactttg gactacataa taatgaacca 1860 acctgaacgg cttgatcaac tactagactt tcctcccctc ggtgctactt gcagaggtag 1920 aggtcatcta gttcttacct ggcgctattt attaaataaa tcaacaacca ataattacga 1980 caataattta agaaaaaaat atgcattcga caaaggtaat tatgaattaa ttaaacaata 2040 ctttaataaa atagattggg agtcaaaatt taccaactca agtattgatg aagattatga 2100 atactttgtt aaagaatata tattagcttg taataaattc ataccaataa gaaataataa 2160 aaacaacaca aataatctcg aaccatggat aaacaaaaca gcaaaaattg ctcttaaaat 2220 caagcataaa ctctactacc gtgcaaaaaa taataattgg cgtaggacac aaaaaacgaa 2280 atcaaaatac aatagggcag ctaaacaggt aaaaaaagaa atatttaaat caacttgtgc 2340 ctttgaaaga cacttagctc atgagtcaaa aaagaatcct aaattaatct atgcatatat 2400 caataaacgt caacaagtaa aacgtgcaat tagatcacta tatgataaga taggtaacat 2460 cacaaacgat cagataaata ttgcgaatac gattaacaat caattccaat cagtttttgt 2520 caaagaacca gacgatgagc cagtttttcc aataatcacg gaacataaat gttctctgtt 2580 aactatacta caagatatca acgaacaaac tatcaaaatc agactcgaaa acgttgacac 2640 atccaaatca atcggtggtg ataacattca cccatttgtt ctcaaacatg catcagtggc 2700 tatcgcgata ccactaagcc ttatatttaa aaagagcctt gcaacacaaa cggtaccaaa 2760 aaactggaca gaagcaaatg ttactcctgt atttaaacaa ggtaaccgtt tattagctga 2820 aaactatcgt ccaatatctt taacatctat agtatgcaag atcttagaaa aaatagtaag 2880 agaagcttta actgcacact tagagcttca tggtctactt tcaaattctc aacatggatt 2940 taccactaag aaatcctgtg tatcaaactt gttagaaact cttaatacta taactaaggc 3000 cctagctcag ggtcacaatg tggacgtaat ctacttggac tacgcaaaag cgtttgatac 3060 tgtgccacat aaaagactgt tagcgaaact attatcttat ggaatcgaaa acgatcttgt 3120 taaatggata agggcctttc tctctaatcg tcgtcagaga gtagtactag gggaaacggt 3180 ctctgactgg tgtaatgtta caagcggagt accacagggc tcagttttag ggcctcaact 3240 attcaacatc tacattaacg acttaccacg tctattagac aacactagta aactatacgc 3300 ggacgacact aaaataattt caattgtaga cacgcatcag cagtcgttaa acttacaaaa 3360 cgatctcaat aaagttcaaa attggtccaa gacttggtta atgaagctaa atactaagaa 3420 atgcaaagtt atgcatttcg gtaagtttaa taacaatttt aaatatcaga tgcaagatac 3480 gtcatcaaat aataatatgc acactcttga aactaccaca gaagagagag atttaggtgt 3540 aatactaagc tcggatttaa aatggaagcc acacattgat aaaataacga ataaagcaaa 3600 cagcgtatta aaaatgctaa aaaatacttt cacttgtcgt gaaaaagaac tttggaaaaa 3660 actttacacc agtcttgtta gaccacatct agaatacgct tctactgcct ggaatccctc 3720 tttagccagt gacatcgata aaattgaaaa agttcaaaaa agagctacga aatggagcca 3780 acctaatact gcctacgcaa accgactagc taacttaggc ttaacgtcat tagctaaaag 3840 aagagaaaga ggcgactgta tcgaaacttt taaaataact aataagataa taaacatcga 3900 agacgcattt ttcaatatga aacaatcaac taatttgaga acatttaata atcatcaact 3960 ccagaaagac atatactcta gcagacaagc aaatgattat agtaaagctg tgaagtttag 4020 agataacttt ttatcaaaca gggtcacaca agtatggaat tatttgccta attcggtcat 4080 agaagcctcg ggagctacac ctaaggcggc aatcaataaa ttcaaaaaca attatgataa 4140 atttcaaaaa gataaaatta actcaattaa acaataacat tttattatta ttagattttt 4200 aatataaatt attaacgtag ctgctataac tctgcatttt ctagtgcatt gttgggtact 4260 tgtgccccta gtattgattt actgcagtat tattattatt attattatta ttattattat 4320 tattatt 4327 // ID Gypsy-243_AA-LTR repbase; DNA; INV; 215 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-243_AA_; KW Gypsy-243_AA-I; Gypsy-243_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-215 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1088-1088 (2011). XX DR [1] (Consensus) XX SQ Sequence 215 BP; 60 A; 28 C; 44 G; 83 T; 0 other; tgtaacgttt tgcgtttttc cgttaaaaac acgataaaaa agtttgtgaa cttttgacgt 60 ttttatcttt ttaagcaaaa tcatctgaat ttgaatttga ttgagctcag tgttacgtat 120 attgtcgata aggtttgtgt ctagactaag tgtactaaat tgacgtcacc cagtatggga 180 tatgtgtagt gatgtgcaaa cctttgtagt gttca 215 // ID Gypsy-66_AA-I repbase; DNA; INV; 4643 BP. XX AC supercont1.238; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-66_AA_; KW Gypsy-66_AA-LTR; Gypsy-66_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4643 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.238; Positions 1612705 1608063. XX CC Positions [3693-4154] - Integrase core CC 'ATACC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 1586..3082 FT /product="Gypsy-66_AA-I_1p" FT /translation="MLIDSGCQKNIIDDTTWRKLMQQGISVSNFRDSDETF FT RPYGMHSNPLVVCGMFESTISIMQNSRETQKDATFYVIEGGTQPLLGRTTA FT VEMGVLVLGLPSTQPVEVNQLWKGERHQFPKIPGIKLKISLDDKVVPVIQH FT ARRPPIALLNRVEEKLTSLEKMDIIEPVSEFSPWVSPLVVIVKDNGDLRLC FT VDMRQANRAIQRECYMMPTFEDFLPQLNEARFFTRLDIKEAFHQLELDESS FT RHITTFITHKGLFRYKRLMFGMSCAPELFQRVLEQILACCENVVNYLDDIL FT IFGSTLEEHNAALERVLSTLKEKNVLLNHKKCQFAVTEVEFVGHHLSSEGI FT RPLENKLSTLQSFRPPENVEELRSFLGLANFVGRFLPDLGTVTAPMRGLLR FT SNTRFEWGQHHQESFEALKSLISNVKTLSYFDNSLRTRLIVDASPFALGCV FT LVQFKSDHDDSEPRIISYASKCLSDSEKEVLSNRKRSPRYSVGCRAIFGIP FT DRT" FT CDS 2973..4607 FT /product="Gypsy-66_AA-I_2p" FT /translation="MPVSVCQIQKKRYCQTEKEALAIVWAVERFSVYLIGR FT DFELETDHKALETIFSPTSKPCARIERWVLRLQSFTFNVKYRKGSTNIADP FT FSRLVQNAVNEEFDGDNNFLVLNIVSSAAIDTNEIEDVSQNDHEFGLLRDC FT LQTDVWKHSEVKPYEVFKHELGFVGDVLVRGTKLVVPKQLRKRMMDLAHEG FT HPGETVMKRRLRERVWWPGIDKEVVQYVQRCEGCRLVSLPERPEPMQRRQL FT PSNPWIDVAIDYLGPMPSGEYLLVIIDYYSRYKEVEIMTRITAKDTIFRLD FT RIFTRLGYPVTITLDNARQFASDEFDQYCKARGITLNFTTPYWPQENGLVE FT RQNRSLLKRLQISHGLKRDWKADLQDYLMMYYTSPHSITGKTPTEMMCGRT FT IRSKIPSISDLQHCPRFSDVADRDRILKEKGRETADNARHSKQSNLETGDS FT VLMKNVLPGNKLTSNFAANEYTVVDKKGSRVTVEDSTSGKTYQRNSSHLKK FT VIKATEVPKEQQLTENYVRRHMDTMLPQTEQKQLRFYPGLPDTFRDFTM" XX SQ Sequence 4643 BP; 1432 A; 943 C; 1126 G; 1142 T; 0 other; aatctggcga cgagaataaa gtggaagcag ctaataaagg taagttgaaa aaacatgaaa 60 acataagtgt ttttctcgaa atataaccat tataaatgca acgcgttttt cctctgcatg 120 aaaacaaaat ggcgtaagaa tgtgaggtta acattgagct ctttttgtgt ttgttctcaa 180 ggaagagaag tatccggcca tcggcggaag ggtcttagag accgagttga gtgcaagcat 240 ctggccatcg gcagaagggt cctggagacc gagattaaca agagattccg gccactggcg 300 gaagggtccc ggagaccgac aaaaacaaga aaatccggcc actggcggaa gggtcccgga 360 gaccgacatt gacaagagaa tccggccact ggcggaaggg tcctggagac cgtgctcaat 420 tcaagtatct ggccattgac ggaaggatcg taactaaata tccggccttt ggcggaaggg 480 tctaggaaac cgcagcagca ttgcttgagg tctacgaaac cggtcagaag agaagcaaaa 540 gttagcgaca cacagcgagc ctacgaggcc gaacgcaatc agagatggat aagtggaata 600 ttcctcagtt caatttcaag gcgttgccaa agagtcagat tcgcgatgct tggaagcgtt 660 acagaaagaa tttcgagtat gtacagctgg caaatcggga agcgaataga gtcagactga 720 aacatatatt tttggctctt gctgggccgg aagtccagga ggtatatgag tcaattccgg 780 gagcagacgt ggaaccagcg gctggtattg acccattcca gactgcattg accaagttgg 840 atgagttttt tgctgcaaaa tatcatgacg catttgaacg aaacttgttc tggacgctaa 900 agcgtgaatc tgatgagcag ctggaaaagt ttctttttcg tgtgatggag acagccaaaa 960 gctgtaattt tggtgcaaca acccaagaaa gtcgagagat cggtattatc gacaagatga 1020 tcttattttc accgccagaa ctgaaagaga agctgcttgc tgaagaacgg ctcacgttgg 1080 aaagtctaac caagattgtg caatcgtatg gatcgttgaa gtatcaggta agccagtttt 1140 cggctgctgg gcttcataca agagatgtgc caagaccgga aacgagtcag gaagcgaaca 1200 ccggttggtg aatgttatcg ctgcggacga accggtcact atggctacga cgcgaactgt 1260 ccagccaaga accaaacatg taataaatgt gggaaaaaag gccattttgc gaaacgttgc 1320 aaaactccta cttcaactcg acagttcaag cggaaagctg aattttcaaa tgaaggaaga 1380 agttcgaaaa gggtaattat tttaaatttt caaaccttaa atttgattcc taatcttttg 1440 aacctgggtt taggtcagga ccatcaacca tgttgatgct ccgagcgcta aagaagatac 1500 ggctaagatt tctgaaagct tcatctacaa tataggaaat ggcgaagagt ttctctggat 1560 caggttaggc ggaattccag tccagatgct gattgattcc ggatgtcaaa agaacattat 1620 tgacgacacg acctggcgaa agctgatgca acaaggaata tctgtcagca acttccgtga 1680 ttccgatgaa acattccgtc cttacggtat gcattcaaac cctttagtgg tttgtggcat 1740 gttcgaaagc accatctcga ttatgcaaaa ttcccgtgaa acacaaaagg atgctacttt 1800 ctacgtaatc gagggaggaa cccaaccgtt actgggaagg actacagcag ttgaaatggg 1860 agtattggtc cttggccttc ctagtacaca accggtggaa gtaaatcaac tgtggaaagg 1920 agagagacac cagtttccga aaattccagg tataaaattg aaaatttctc tcgatgataa 1980 ggttgtacca gtcatacagc atgcgcgtcg gccgccaata gctttgttga accgagttga 2040 agaaaagttg acaagtctcg aaaaaatgga catcatcgaa ccggtgtctg aattcagtcc 2100 atgggtatcg ccactagtcg ttatagttaa agataatggc gatttacgat tgtgcgttga 2160 tatgcgtcaa gccaatcgag cgatccaacg agaatgctat atgatgccaa catttgaaga 2220 ttttcttcca cagttgaacg aagcgcgatt ttttacccgc ctagacatta aagaagcatt 2280 tcaccagttg gagcttgacg aatcgtcgcg gcatataaca acgttcatta cccataaagg 2340 gttgtttcgg tacaaacgtt taatgttcgg gatgtcttgc gctcccgagt tattccaacg 2400 agtactggaa cagatactag cttgttgcga aaacgttgtc aactatctcg atgacatcct 2460 tatttttggg agcactttag aagaacacaa tgcagcattg gaacgggtgt tgtcgacctt 2520 gaaagagaaa aacgtactgt taaatcacaa gaagtgtcag tttgcggtta cagaagtgga 2580 attcgttggg catcatctgt cgtccgaagg tattcggcca ctcgaaaata aattgagcac 2640 gttgcaatct tttcgaccac cagaaaatgt cgaagaactt aggagtttcc tcggactagc 2700 caattttgtg ggacgtttct taccagactt aggtacagtg acagctccta tgagaggcct 2760 acttcgttct aatacaaggt ttgaatgggg acaacaccat caggagtcgt tcgaagcatt 2820 gaagagtctc atctccaacg tcaaaacgtt aagctacttc gacaactctc tgaggacacg 2880 attgatcgta gatgcctctc catttgcgct tgggtgtgta ctagtccaat tcaaaagcga 2940 tcatgatgac tctgaacctc gaataatctc ctatgccagt aagtgtttgt cagattcaga 3000 aaaagaggta ttgtcaaacc gaaaaagaag ccctcgctat agtgtgggct gtagagcgat 3060 tttcggtata cctgatagga cgtgatttcg aactggagac cgaccataag gcacttgaaa 3120 ccatattttc cccaacgtcg aagccatgcg cccgtataga aaggtgggtt cttcgtcttc 3180 aatcgttcac cttcaatgtg aagtacagga aaggctccac aaacatcgca gatccattca 3240 gccgattagt acagaacgcc gtaaacgaag aatttgatgg tgataacaac tttctcgtac 3300 tcaatatcgt gagttcagcc gctattgata cgaatgagat tgaggacgtc tcacaaaacg 3360 atcacgaatt tggattgcta cgtgattgtt tgcagacaga tgtatggaaa cattcggagg 3420 taaaaccgta cgaggtgttc aaacatgagt tagggttcgt aggagatgtt ctcgtcaggg 3480 gtaccaaact cgtagtgcct aagcaattac gaaagagaat gatggatcta gcccacgaag 3540 gacaccctgg tgaaacagtg atgaaaagga gattgaggga acgtgtttgg tggccaggga 3600 tcgataagga ggttgtccaa tatgtccaaa ggtgtgaagg gtgtcgacta gttagtttgc 3660 ctgaaagacc agaacctatg cagcgacggc aattgccatc gaatccatgg atcgacgtgg 3720 caattgatta ccttggaccg atgccttcag gggaatatct gcttgtcatc atagattatt 3780 acagtagata taaagaagtc gagataatga cacgcatcac agcaaaggat actatattca 3840 gacttgatcg aattttcact cgtttaggct accctgtcac aataacactc gataacgcgc 3900 ggcaattcgc ttcggatgaa tttgatcaat actgcaaagc acgtggaatc acacttaatt 3960 tcacgacacc atattggccg caggaaaacg gattggtcga aagacaaaat cgttccttgc 4020 tgaagaggct acaaattagt catggattga aaagagattg gaaagccgat ctacaggatt 4080 atctaatgat gtattacaca tctccccact ccattacggg caaaactcct acggaaatga 4140 tgtgtgggcg taccattcga tcaaaaattc catctatttc agatcttcaa cattgtccac 4200 ggttctctga tgtagccgac agagatcgca ttctcaaaga gaaaggaagg gaaacggcgg 4260 ataatgcacg acactccaag caatccaatt tagagacagg ggattcagtc ttgatgaaaa 4320 atgtattgcc aggtaataag ctcacttcta actttgctgc caacgaatat accgtagtgg 4380 ataagaaagg ttcccgtgtg acagtggaag atagcacatc aggaaaaacg tatcaacgga 4440 attcgtcgca ccttaagaag gtgataaaag caactgaggt cccgaaagag caacagctga 4500 ccgaaaatta tgtgcgtcgt catatggata cgatgttgcc gcagacagaa cagaaacagc 4560 ttcgtttcta tccaggacta ccagatacct tcagagattt tactatgtaa gtttggacta 4620 ccgatctata agagaaaagg gga 4643 // ID Polinton-1_TC repbase; DNA; INV; 13486 BP. XX AC . XX DT 14-MAR-2006 (Rel. 11.02, Created) DT 14-MAR-2006 (Rel. 11.02, Last updated, Version 1) XX DE A family of autonomous Polinton DNA transposons - a consensus DE sequence. XX KW Polinton; DNA transposon; Transposable Element; KW Interspersed repeat; Maverick; Tlr; Polinton-1_TC. XX OS Tribolium castaneum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Coleoptera; Polyphaga; Cucujiformia; OC Tenebrionidae; Tribolium. XX RN [1] RP 1-13486 RA Kapitonov V.V. and Jurka J.; RT "Self-synthesizing DNA transposons in eukaryotes."; RL Proc. Natl. Acad. Sci. USA 103(12), 4540-4545 (2006). XX DR [1] (Consensus) XX CC This transposon is characterized by ~110-bp terminal inverted CC repeats and 6-bp target site duplications. The consensus sequence CC was built based on multiple alignment of several copies that are CC >90% identical to each other. It encodes a family B DNA CC polymerase (POLB-1_TC), retroviral integrase (INT-1_TC), ATPase CC (ATP-1_TC), cysteine protease (PRO-1_TC) and additional four CC unclassified proteins (PX-1_TC, PW-1_TC, PY-1_TC, and PZ-1_TC), CC conserved in Polintons from different species. XX FH Key Location/Qualifiers FT CDS 248..904 FT /product="ATP-1_TCp" FT /translation="MDLRFKSPFTAIVAGPTACGKTHFIIRFIKHLSLICD FT TEFERVLWFYDEWQPLYQNNDINNNLQRIEFRQGVPDINEFDGVKATLIIL FT DDLMREANGSVVDIFTKGSHHRNLSVFNITQNLFYQGKGQRDISLNANYIV FT YFKNPRDKAQINFLARQIFPENIKFVQESYKDATMRPHGYLLIDLKQNTPD FT DFRIRTNILPDEEPCYVYIPKKNYKYKSK" FT CDS 1036..1806 FT /product="PZ-1_TCp" FT /translation="MEHARKMIMIDPSELERLHNRKDTQPNTLNELDHEMK FT RIIDMKNIDDNEKWTLYNQVLQKYLKIISRSREPVCLPIIYPKKRNDQIKQ FT SERISLNILGSLPRTLQVKANTLLNIITHKGVARNETGNVIFDGDIIPGSN FT VFDLITDVMKSGGGSSKLSTEPVGWKKFAEILYKINVPRTLIVNPKRLKYI FT NEQSSERKELKEKVKVKQSSLSKFITEKEEEEEEEEEEENQITKKSLRTLS FT RKPRRLRVPWSPYKSN" FT CDS 2928..3800 FT /product="PX-1_TCp" FT /translation="MNYFPENTTTRFCTQLPKTVQLDGEWCVGLSEIQYPC FT SFLTINNGENIIYCSFQFNVSAIIQKGEKAIDIMFKDPRCFTWIDEQIRSR FT KYKSSKEFMYDFKMRKIKFESDVIITKYKTRIEAGNYETIEAVLDALNSLR FT PLVQDNVEFRLNDGSKKITVTSSSKYLTSISFSPMLSLQLGFEPDTNVLNK FT TSTHPANIILGLPSQLFVYSDIIEPQLLGEIMAKVIRIVVIDSKYYIYGTH FT HTQLFSHPHYVPVLKREFENIEIDIRSSTGEKVPFQFETLCVKLHFKKIA" FT CDS 3803..4234 FT /product="PW-1_TCp" FT /translation="MHSCHYHNYYVNQAGSGGVGAIYKGSVYQKGHGFGSF FT LRGLFRTVVPLLKSGIKTIGKETLRTGSNFLGDVVNEVPVKEAFKTRVNES FT VQNLKHKAQDKLDNLVGSGVIKRRRIQSKNHSRFSAQRVKNCNRKSKSVPY FT KDIFG" FT CDS 4239..5549 FT /product="PY-1_TCp" FT /translation="MSFLHPCSCECAKSELDLFALPPTQTSIESGQWVHYK FT TVSSISENSPLEFVVPGGEDYIDLSQTLLSLCIKISKEDGSNYVAEDNIAP FT VNNILHSMFSQVDVYLNQKLISPPNNTYPYKCYLETLLNYDSGAKNSHLTC FT GLWYTDTAGKMNVIGNENIGYAERFKHTSLSKEIDLIGHLNCDIFNQEKFL FT INGVEIRLKFARSRDSFALMSSNNLNGKIQITDATLMVRRNRINPSVLLAH FT AKALELSTAKYPITRTELKVLTIPQGVQGKSLDNVYLGQLPKRCAVCFVTN FT KAFNGDYTMNPFNFENFGLNYLSLYVDGNQIPSKPLQPVFVGGNRKFVSMY FT HTLFSGTGIHYLNTGNGISRDNYADGYSIAVFDLTPDLSSHNGFSWNLIKN FT GSLRIEVGFTRALTETVNCLVYGEFDSVLEIDKKRNVIVDYSN" FT CDS 1810..2877 FT /product="INT-1_TCp" FT /translation="TKLNEIKERYFIPRDPLSFSSADKLVKFTNIPKKEII FT KWLEGVESYTLHKQVRRRFPRNSYHVTNIGDLFQADLVDMRNIAKHNRGVN FT YLLTVIDVFSKFAWVKPLYTKTGKEVAAAFEEIFDERVPVNLQTDKGKAFL FT AKNVQKLFKQNDINYYVTNNPDVKAAVVERFNKTLKTKMYKYFTHANSMHY FT IDVLSDFIHSYNNSYHRTIKMTPNEVNPDNVLQVYNNIMSARKKDIKRIVP FT ENVYKVGDHVRITKYKHVFEKGYMSNWSGEIFKITDVIMRQPVVYKISDLM FT GEAIEGVFYHPELQRVTYNPESRFHIEKVLKKRYKRGKCEFLVQWKNYPSK FT FNSWIPCSSLSAI" FT CDS 6102..6551 FT /product="PRO-1_TCp" FT /translation="MNTLQLKRCLEMISRKKKHHVLVSAYNTLPVLITRPS FT MVIINRDPDYMPGSHWMAVYIDKFGFAYFFDSFGNQPPRNIQKFLKRNALI FT WSYNSSQIQDISSTVCGKYCTLFLLNYVLGNSVDDFLKLFNNNFYNNDKLC FT NKMFTEYFMNK" FT CDS 8242..11856 FT /product="POLB-1_TCp" FT /translation="MNNESFPQKPLGISFRKFSEISSDVILNVIGAVLQSN FT STFFSNDQLEIRVDRVRLPTGRGFSALDGKCVSFSEFAVAKNSIYVIENIN FT NRCLAYALVVGKEYCDNNFNKNSLKKFSGLKGKIILDKIALKLCQDANVDL FT DSNSGNYTHIQNFQNFLYDYVIVVYNSRDGRSVYFEGAKSHEKKKINLILE FT NNHYNVILSLTGAFSTSYYCDLCHIRYSRPDRHKNCPYICPCCHSQPPCKT FT DSALIKCNDCQRNFRGDICFSNHIRSNVCKKIMKCSKCDKHIWKNKIGKKD FT EHICGVSYCNVCQQNKPIGHLCYMQSNKINKNKFDADNKLALFIFYDFETT FT QDKKFDDKNSTVHEVNLCVMQQACSNCSKFEQLDENETCDNCGLRQNIFFD FT KPVASLLEYIAEKSKIFNVYAIAHNMKGFDGCFILQYVFQNVSRWKPEVIR FT TGTKLITIRCGKSITFLDSLNFIQLPLSKFPEAFKFEESKGFFPHFFNTND FT NFNYIGPIPDKHYYGCDTMKTKERADFLTWHREQLANNYVFNLKDEILKYC FT ILDVNILRKGCLKFRECFLKINNNVDPFLESLTIASACNLVFRRNFLKSET FT IGIIPKNGYRCSDNQSKIAITWLSWLMQTENINISHAGNGREVRLNGSLLV FT DGFCQETNTVFEFYGCWWHGCEKCFFNQTSNLNNKHDALFLRRENTFAIEE FT KIRKLGCNIRTIWECTFRNDIKDNPDLNRFVNEHNLESFTPLNPRDCFFGG FT RTNVCKLYYKCKDNEKIKYYDVCSLYPFINKYGKYPIGHPKKIYVGNDCLN FT IDITTFEGVIKCIVMPPQHLYHPVLPYRYKGKLTFPLCKTCVEISNQYDCG FT HKPEERQFTGTYVADELRKAVELGYVITLLLEAWEYNVVQYNKETNTKGLF FT TDYVNRFLKMKQEYSGWLSWCDSEDKKQLYLTQYFEKEGIVLDAENISENS FT GMRFISKIMLNSFWGKFGQRENPQKTEIIDDARTLFDLLTNHSRITHNLTI FT VNNDVLLANWDSVCEDIIPLKTVNVIIAAYTTAGARLELYKYLEKLDRRVL FT YFDTDSVIFTQKEDEWQPEIGNFLGDLTD*IGTNKNGVDSYISEFVSGGPK FT NYAYRYWVTKERLFKTVCKVKGITLHYKNEKLLN*EKIKEFILNPELGSEL FT ILYDRVIARTQRYEVISKEAKKTYRVNISKRRRLNDELYDTLPFGY" XX SQ Sequence 13486 BP; 4765 A; 2091 C; 2295 G; 4335 T; 0 other; agtagtaaat gagtgtaatt agtgatgttt taatgacttt ttaattactc ccttaatcac 60 cagcgcttaa actttaatga ttttttttaa ttacccctta atcaccagcg ctttcacttt 120 aatgacatgg gaaaaacaca cacacaaata ttgttctata aatagtacac cattcagcaa 180 aagttttcat ttttgttttg gtgttaaata atgtaagacg ttagatattt gaataaaagc 240 aagaaggatg gacttacgat tcaaatcgcc atttacagct attgttgctg gtccaactgc 300 atgtggaaaa acgcacttta taatacgatt tataaaacat ttgagtttga tctgtgatac 360 ggagtttgag cgagtgcttt ggttttatga cgaatggcag ccgttgtatc aaaataatga 420 catcaataat aatctacaaa gaattgaatt tcgtcaaggc gttcctgata taaatgagtt 480 tgatggtgta aaagcgacat taataatttt ggatgatctg atgagagagg ctaatggtag 540 tgttgtggat atttttacaa aaggtagtca ccatcgaaac ctcagtgttt tcaacattac 600 gcaaaatttg ttttaccaag ggaaaggtca gagagatatt tcgttgaatg caaattatat 660 cgtttatttt aaaaatcctc gtgataaagc tcaaattaat tttttggcta ggcagatttt 720 tccggaaaat ataaagtttg tgcaagaatc ttacaaggat gccaccatga gacctcacgg 780 ttatttgttg attgatttaa aacaaaatac tccagacgat ttccgtattc gtacaaatat 840 tctaccagat gaagaacctt gttatgtgta catccctaag aaaaattata aatataaaag 900 caagtaactt attcgtcagt agtatatgag atgtcgcgga agttagaaag aaatgccgag 960 ttactaaaat ttttgcataa aacaacacca gtatatcgga gtgttgggtc atttatttaa 1020 taaataataa tcgtaatgga acatgcaaga aaaatgataa tgattgatcc atcagagctg 1080 gaacgtttac ataaccgcaa agacactcaa ccaaacacct taaatgaatt agatcatgaa 1140 atgaaaagaa tcatagatat gaaaaacatt gatgataatg aaaaatggac tttgtataat 1200 caagtgcttc aaaaatattt aaaaattata agccgttcgc gggaacccgt ttgtctgccg 1260 ataatttatc ctaaaaagag aaacgatcaa ataaaacaat ctgaacgaat tagtctaaac 1320 attttgggtt ccttaccaag aacattacaa gttaaagcga atactttact aaacattata 1380 actcacaagg gtgttgcaag gaatgaaacc ggtaacgtta ttttcgatgg agacataatt 1440 cctggttcca atgtatttga tctaatcact gatgttatga aaagtggtgg tggtagtagt 1500 aagctgagta ccgaacctgt cggatggaaa aaatttgcag aaattttgta taaaataaac 1560 gttccacgta cacttatagt taatccaaaa cgtttgaaat atataaatga gcaaagttct 1620 gaaaggaaag agttaaaaga gaaagtaaaa gtaaaacagt cgtcgttgtc aaagttcata 1680 acagaaaaag aagaagaaga agaagaagaa gaagaagagg aaaatcaaat aacgaagaaa 1740 tcattgcgaa cattatcaag aaaaccaaga cgcttgagag ttccatggtc gccgtacaaa 1800 agcaactaga cgaaactgaa tgaaataaag gaaagatatt ttatacctcg tgaccctctt 1860 agtttttcga gtgctgataa attagttaag tttacaaata tacccaagaa agaaattatt 1920 aaatggttag aaggtgtgga atcttataca ttacataagc aagttcgtcg tcgatttcct 1980 agaaatagtt atcacgtaac aaacataggc gatctatttc aagctgattt agtagatatg 2040 agaaacatcg caaaacataa tcgaggtgta aattaccttt taactgtcat tgatgttttt 2100 tccaagtttg catgggtcaa acccttatac acaaagactg gtaaggaagt ggctgcagct 2160 tttgaagaaa tatttgatga gcgtgttccg gttaatcttc aaactgataa gggaaaagca 2220 tttttagcga aaaatgtcca aaaattgttt aaacaaaatg atataaatta ttatgtaaca 2280 aataatcctg atgtaaaagc tgctgtggta gaaaggttta ataaaacatt aaaaacgaaa 2340 atgtataaat atttcacaca cgcaaatagt atgcattata tagacgtgtt atctgatttt 2400 atacactctt acaataattc ttaccatcgt actattaaaa tgactcctaa tgaagtaaat 2460 cctgataacg ttttacaagt atacaataat atcatgagtg caagaaagaa ggatattaaa 2520 agaattgtac ccgaaaatgt gtacaaagtt ggagatcacg tcagaatcac taagtacaaa 2580 catgtatttg aaaaaggtta catgagtaat tggagtggtg aaatattcaa aataactgac 2640 gttataatga gacagcccgt tgtgtacaaa atttccgact tgatgggtga agcaatagag 2700 ggcgtcttct accacccgga attacagcgt gtaacgtaca accccgaaag tagattccat 2760 attgaaaaag tgttgaaaaa gcgatataaa agagggaaat gtgaatttct agtacagtgg 2820 aaaaattatc cttcaaagtt taactcgtgg attccgtgca gttctctaag tgcaatatga 2880 ataattaata caatgatttt tatttgacat tagtatcaaa tagttcaatg aattactttc 2940 cggaaaacac gacaacgcgt ttttgcacac agcttccaaa aactgttcaa ttggatggtg 3000 aatggtgtgt tggactttcg gaaatacagt atccatgttc attcttaact atcaacaatg 3060 gtgaaaatat catttactgc tcttttcaat ttaacgtgtc ggcgattata caaaaaggtg 3120 aaaaagccat tgacataatg tttaaagatc cacgttgttt tacatggatc gatgaacaga 3180 ttcgcagtag aaaatacaag tcttcaaaag aattcatgta cgattttaaa atgcgaaaga 3240 ttaagtttga aagcgatgta attatcacga aatataagac cagaattgaa gctggtaact 3300 acgaaacaat cgaagctgtt ttggatgctt taaattcgtt aagaccttta gtacaagata 3360 atgttgaatt tagattaaat gacggttcaa aaaaaataac tgtaacaagt agtagcaaat 3420 atctgactag catatcattt tctccgatgt taagtctcca actgggtttt gaaccagata 3480 caaatgtact gaacaaaact tcaacacatc ccgcaaacat tatattgggt ttaccttcgc 3540 aactttttgt gtacagtgat ataatcgaac ctcaactgtt gggtgaaata atggctaaag 3600 tgattcgtat tgttgttatc gacagcaaat attacattta cggtacacat catacacaat 3660 tattttcaca tccacactac gtgccggtgt taaaaagaga atttgaaaac atcgaaattg 3720 atataagaag ttcaacaggt gaaaaagtac catttcagtt tgaaactttg tgtgtaaaac 3780 tacattttaa aaaaattgca taatgcattc ctgtcattat cacaattatt atgttaatca 3840 agctggttcg ggcggtgttg gagcaattta taaaggctct gtttatcaaa aaggacacgg 3900 ttttggtagt tttttaaggg gcttatttcg gacagttgta ccgttgctga aaagtggtat 3960 taaaacaatt ggtaaagaaa cactacgtac tggttcgaat tttcttggtg atgttgttaa 4020 cgaagtgcct gtaaaagagg catttaaaac gcgtgtaaat gaatcagttc aaaatttgaa 4080 acataaagct caagataaac tagataattt agttgggtca ggagttataa aaagaagacg 4140 aattcaaagt aaaaatcata gtcggttctc tgctcagcgt gtaaagaact gtaatcgtaa 4200 aagtaaaagt gttccatata aagatatttt cggttaaaat gtcatttctg catccctgta 4260 gttgtgagtg cgcaaaaagc gagctcgatc tttttgcatt accgccaact caaacaagca 4320 ttgaaagtgg ccaatgggtt cattataaaa ccgtatctag tatatctgaa aatagtcccc 4380 tcgaatttgt tgtacctggt ggtgaagact acattgattt atcgcaaact ctactttcac 4440 tttgtattaa aatatccaaa gaagatggta gtaattatgt tgctgaagat aacattgctc 4500 cagtaaataa tattcttcat tcgatgttta gtcaagttga tgtttatctc aatcaaaaac 4560 tcatatcacc accaaacaac acatatccat acaagtgtta tttggaaaca cttttaaatt 4620 acgattctgg agcaaaaaat tcacacttaa catgtggatt atggtatact gatacagctg 4680 gtaaaatgaa tgttattgga aatgagaata tcggttacgc tgaaaggttt aaacatacat 4740 ctctaagcaa ggagatcgat ctcattggtc acttaaactg tgacatattt aaccaagaaa 4800 aatttttaat caacggcgtt gaaataagat taaaatttgc acgttctcgt gatagttttg 4860 cattaatgtc atcaaataat cttaatggaa aaattcaaat taccgatgca actctgatgg 4920 ttagaagaaa taggataaat ccaagcgtat tacttgcaca tgcaaaggct ttagagcttt 4980 caactgcaaa atatccaatt acacgaacgg aattaaaagt tttaactata ccacaaggtg 5040 tacaaggaaa gtcattggat aatgtgtatt tgggtcagct tccaaaaaga tgtgcagtgt 5100 gctttgtgac aaataaggca tttaacggtg attacacaat gaaccctttc aattttgaaa 5160 actttggatt gaattattta tctctgtacg ttgatggtaa tcaaattcca agtaaacctt 5220 tacaacccgt ctttgttggt gggaatagga aattcgtttc aatgtaccac acgttgtttt 5280 ctggtactgg tatacactat ctaaataccg gaaatggaat ttcgcgcgat aattacgccg 5340 atggatattc gatagctgtt ttcgatttga caccagacct atcgtcgcat aatggttttt 5400 cgtggaattt gataaaaaac ggaagtttac gtattgaagt aggatttact agagctctta 5460 cagaaactgt caattgttta gtgtatggtg aatttgatag tgtattagaa attgataaga 5520 agcgtaatgt aattgtagat tatagtaatt aatactcatg tatttagtat aaataaataa 5580 aaatgtttta aagtaatatt tgtcagtttc atttaacatt tcatttgata tggttgtatt 5640 tgtggatttt caagggttta attacggtaa ttgttccgat actctcacga taaaagaatt 5700 agctgtattt aatacgaata aaagtgagcc gaagtccttt ttgttcaaac cgcccgaaga 5760 tctttcaact ttaccacatc gctatcagaa gcaaaccgat tggttaacgc ataacttcca 5820 cggactttca tggaattgtg gtatttacga gtacgaaaat ttgcctgaaa tattgcatga 5880 aacaacgaaa aatgcaaaat gtatttacgt gaaaggaaca gaaaagagac gcctattact 5940 gaaacgttta ccaaaaactc aaattgtaaa cattgaagat ttagagtgtc catcgcttcg 6000 ttacttacga gaacattttg acacgacttg ttgtgtgaac cacataatta aaaacgctgt 6060 atgtgctttt gaaaacgtac aaaacttggc tacttggtac aatgaatact ttacaactca 6120 aacgttgtct ggagatgatt tctcgaaaga aaaaacatca tgttcttgtt tctgcgtata 6180 acactttacc tgtcctaatc actagaccct ctatggtgat tattaacaga gatccagatt 6240 acatgcctgg tagtcattgg atggctgtat acattgataa atttggtttc gcatactttt 6300 ttgatagttt tggaaatcaa ccgccccgaa atattcaaaa gtttcttaaa aggaatgcat 6360 tgatatggtc ttacaatagt tctcaaattc aagatatttc ttcaactgtc tgcggcaaat 6420 actgtacttt atttctttta aactatgtat tgggcaattc tgttgatgat tttctaaagt 6480 tatttaacaa caatttttat aataatgaca aactatgtaa taaaatgttt accgaatatt 6540 tcatgaataa ataaagtctt tgcaagatcc atgttttttt ttctttcttt cctcatatct 6600 cacaattcac tcaccagcac ggcgtgggcg gcgcgggtga cggcgcgggc gcgggtgcgg 6660 ggcggcgcgg aacgaaacac gggcgcgaat ccggacgcgg gagcagacgg cgcgtagccg 6720 agcacgggcg gcgacaagac ataatgagag tggttggcga catcaaaaat ctcacaaagt 6780 atatctttac cttaccattc aataggctca gactaatccg gtggggtccg tctgcagact 6840 atcaccgtag gagttctaga attatgtccc attttgtagg tctatgttta ggaaacgaaa 6900 gaattaagga ttactttgat cagcatttgg aaaccgctcc cgctgatgag acgagacaca 6960 tactttcaaa acaaatggta atatacataa tgtattctag gaagattttc tgttactcat 7020 gtctcgtctt tacaaatatt tttgccctaa gggtaatttt cacctctccc acttaaggta 7080 gttcgattta tcagtcagca tataaagggt gtagaaggaa taaaattttt acttgtcagt 7140 ttcttcttaa gtgcgagtac tacacctaat aaatcatgta taattttgat aactttttga 7200 ataatcaaag tgttagaaac ttattgtttc tttatgttgg tggttttttt aattatgatt 7260 taaatttatt tcatcatgtg ttcaatcata taaattatac aattatgttt tcgggaattt 7320 gtttaacaag tgaagaaatt cttatttaca ataagatgtc tcaacatatt aataatacca 7380 aaaaaagtaa gtgtattatt tttgtaaaat tttatcaata gtctaacttt tttttttcgt 7440 ttaattagaa gactctttat gcaataattt gttcaatcag tatgtggaca attttgatat 7500 aaacttattt tctacttctc aaatatcgaa atctaatggt ggtactacta ctactagttc 7560 tactactaat aaccaattaa accaagttaa gagcgcaaaa ggactaccaa acgagatttt 7620 tataaccaaa aaaggcattt tcgtatataa aaataataat aacagtggaa attttaaaaa 7680 cactacccca actaataccg aattgtgcgg tgagtgcatg aacacagcgg gtccttcgac 7740 ttatacacca gtctgtaagt ttctcttatt tgattttttt ttccataata caatttttta 7800 cattattagg tactcacaag aaatccgttg ttcaacatca gacaggttat ggaaattttg 7860 atacaataaa ttttagtaaa ggtaatatat tttagaactt tgttctgtaa gttataataa 7920 cctttttttt attatttaga tcaaagtaga caaaaaccta tcaccagtga ctcatataag 7980 catattataa ttggaggttt cacggaaaga tttattaaaa aatttaatct caactcaaaa 8040 tgtgtgacat ttactctaaa ggaattaact gaagaatatt tctcaaaccc tttgcaatgg 8100 ttgaaagtaa tactttttct atttgtttat ttttccgtag aaaattaata cttttttttt 8160 ttatatatat agaaagcaat tgatgaacta ctaaattacc tgggcaacga ttttcaacct 8220 gaagatatta tcggcataac catgaataat gaaagttttc cacaaaaacc attgggaata 8280 tcatttagaa aattttccga aatatcatct gacgtaatct taaacgtcat aggagcagta 8340 cttcaaagca acagcacttt tttctccaac gatcagctgg aaattcgagt agacagagtg 8400 cgtttaccga caggccgtgg cttttcagcg ttggatggaa aatgtgtaag tttctcagaa 8460 tttgctgtag caaaaaatag tatatatgtc attgaaaata taaataatcg atgtttagcg 8520 tacgcattag tagttggtaa agagtattgt gataacaatt ttaacaaaaa ctctttaaaa 8580 aaattctcgg gtttaaaagg taaaattatt ttagataaaa tagctttgaa attgtgtcag 8640 gatgctaatg ttgatttaga ctctaatagt ggtaattata cacatataca aaattttcaa 8700 aattttttat atgactatgt tattgtagtt tataatagcc gagacggtcg ttcagtgtat 8760 tttgaggggg cgaaatccca tgaaaaaaag aaaattaatt taattttgga aaataaccat 8820 tataatgtca tattaagttt aacaggagca ttttcaacat catattactg cgatttatgt 8880 cacattcgct attccagacc agatagacat aagaattgtc cctatatttg cccgtgctgt 8940 cattcacaac ctccatgtaa aaccgatagt gctttaataa aatgcaatga ttgtcaacgt 9000 aattttcgag gcgatatttg ctttagtaac cacattagaa gcaatgtttg taaaaaaatt 9060 atgaaatgtt ctaaatgtga taaacatatt tggaaaaata aaattggaaa aaaagatgaa 9120 catatttgtg gtgtaagtta ttgtaatgtt tgtcagcaaa ataaaccaat aggacattta 9180 tgctatatgc aatcaaacaa aataaataaa aacaaatttg atgccgacaa taaattagct 9240 ttatttattt tctatgattt tgaaaccaca caagataaaa agtttgatga taaaaacagc 9300 acagttcatg aagtcaattt gtgtgtaatg cagcaagcct gtagcaactg ttcaaaattt 9360 gaacaattag atgaaaacga aacatgtgat aactgtggat taagacaaaa tattttcttt 9420 gacaaacctg ttgcatcact tttggaatac attgctgaaa aatcaaaaat atttaacgta 9480 tatgcaatag ctcataatat gaaagggttt gatggctgtt tcatattaca atatgttttt 9540 caaaatgttt cacgatggaa accagaagtg attcggaccg ggactaaact tattacaata 9600 agatgtggta aatcaattac atttttagat agtcttaatt ttatacaatt accattatct 9660 aaatttccgg aagcttttaa atttgaagaa agcaaaggct tttttccgca tttttttaac 9720 actaacgaca atttcaatta tattggcccc attcctgaca aacattacta tggctgcgac 9780 accatgaaaa ccaaagaacg agccgatttt ttaacatggc atagagaaca acttgcaaat 9840 aattatgttt ttaatttaaa agacgaaatt ttaaaatatt gtattttaga tgtaaatatt 9900 ctccgtaaag ggtgtttaaa atttagagag tgttttttaa aaattaataa taatgttgat 9960 ccttttttgg aatctttaac tattgcttca gcttgcaact tggtatttcg ccgaaatttt 10020 ctcaagtctg aaactattgg tattatacct aaaaatggtt accgttgtag cgacaatcag 10080 tcgaaaatag ctattacttg gttatcatgg ttaatgcaaa ccgaaaatat taatataagt 10140 catgctggta atggtcgaga ggtccggttg aatgggtcat tgttagttga cggattttgt 10200 caagaaacta atacggtgtt tgagttttac gggtgttggt ggcatggttg tgaaaaatgt 10260 tttttcaacc aaacatctaa tttaaacaat aaacatgatg cactttttct aagacgcgaa 10320 aacaccttcg cgatagaaga aaaaattcgt aaattaggat gcaatataag aactatttgg 10380 gagtgtactt tccgtaatga tataaaagat aatccggatt taaatcgttt tgtgaatgaa 10440 cacaatttag agtcgtttac tccgttgaac cctagagatt gtttttttgg aggtcgcaca 10500 aatgtgtgca agttatatta caaatgcaag gataacgaaa aaatcaagta ttacgatgtt 10560 tgttccttgt atccgtttat aaacaaatac ggaaagtacc caataggtca ccccaaaaaa 10620 atctatgttg ggaatgattg tttgaatatt gatataacaa cctttgaagg agttattaaa 10680 tgtatagtaa tgcctccaca acatttatac catccagttt taccttaccg ttacaaaggt 10740 aaattaacct ttccattatg caaaacatgc gtcgaaattt ctaatcaata cgattgtggt 10800 cataaaccag aagaacgtca attcaccggt acttatgttg ctgacgaatt aagaaaagct 10860 gtcgaactcg gttatgttat aacgctatta ttagaagcat gggaatacaa tgttgtgcaa 10920 tataacaaag agacaaacac caagggcctt tttacagatt atgtaaatag atttttaaaa 10980 atgaaacaag aatattctgg ctggctgtca tggtgcgatt cagaagataa aaaacaatta 11040 tatctaacac aatatttcga aaaagaaggg atcgtgttag acgctgaaaa tatcagcgaa 11100 aatagcggaa tgcggttcat ctctaaaata atgttgaata gtttttgggg aaaattcggg 11160 caaagagaaa accctcaaaa aacagaaatt attgatgatg ctcgaacact ttttgattta 11220 ttaactaacc actcgcgaat tacgcataat ttaacaattg taaataatga tgttttatta 11280 gccaattggg acagtgtctg tgaagatata atccctttaa aaacagtcaa tgtaattatt 11340 gccgcttata ctacagccgg tgcgcgatta gaactttaca aatacctgga gaaattagat 11400 cgacgtgttc tgtacttcga tacagacagc gtcattttca cgcaaaaaga ggatgaatgg 11460 cagccagaaa ttgggaactt tcttggtgat ttaactgatt aaatcggtac aaacaaaaac 11520 ggtgtagaca gctatatctc cgaatttgtg agcgggggac cgaaaaacta cgcatacaga 11580 tattgggtta caaaagaacg gttatttaaa actgtttgta aagtaaaagg tataacctta 11640 cattataaga atgaaaaatt actcaattaa gaaaaaatta aagaatttat cctcaatcca 11700 gagttagggt ctgaattaat attatatgat cgagttatcg cccgaacaca gcgttatgag 11760 gttatatcca aagaggcaaa gaaaacttac cgcgtcaaca tttctaagcg tcgtcgtctg 11820 aacgatgaat tatatgacac tttgcctttt gggtacaaat cataatgcac tcttttctgt 11880 ttgtttctgt ttgtaaatta ttattgtaat atttgtaaat aaatgctgtt taattgttat 11940 tattattagt attataatga tgattattat tattattatt aatattgtca tgtttacata 12000 atatattatt ttgtaataaa aaaaataaca cagtcaaaca caagttttat ttatttattc 12060 atttatttaa attaaaacca tttacactaa taaagatgga gcattataat cattaaactc 12120 tctatagtat gctagcctct ttcgtacatg atattttata gaatgaggta acttaaagaa 12180 aatttcgtac aaaacagaac ttgattcgaa acgttcaaca ataagtcttt tgacagtatt 12240 ttcaaattct ttctcttcac ctccaccatt ttgcaatata tctaatgcga cagtgtcaat 12300 gaagtgctcg taattattat aatatgattg cttgatgatg cgctgatcaa gtaaattgaa 12360 aaaaaaatca ctaacgctta tcacattttg cactgtttca acacttaaat aacttggctt 12420 atacacgtct tcaaattttg gaataaagcc aaggattttg ttcttgcaca ataccactct 12480 gtaattttta agttcacaca taggaatttc tttatcgaca tcggttataa cgtatgcata 12540 atcacgcatc ttctttacaa attcctcaaa ttcacttttt ataaacgatg tagcagccgt 12600 cagttgagga ttgctaattt taataactgg aagataatct tcaaatggtt ttacaccaac 12660 acttgaaaat ttgcttcctg attttttgtt atcggtatca cagtctcatc caaaagtcga 12720 tatttttcat tttttatttt actcaattct tctgccttca tattctcgtc catttcatcc 12780 gcaaacttgc tcatagcctt ctccaattcc acatcttcag gtaactctat caaaagctca 12840 gaagaagaag aaggtgaatc ttgagacaaa accattactt gcgaacaggg aatttcacat 12900 ttatactcgt catccaactt cggattttta aacgcatttg gatattcagt ttcttgcgac 12960 ataaactcca cttcaatatt cgacttccca cgctttcgaa cttgacggag cattcttctt 13020 tgattcattt ttctatatat ttaaaaaaaa agtgctatta tacacattaa tattttaaat 13080 aaaaataatg tcaacttaca atttatcaac ctgggtaata cggacagatt tttttccccc 13140 tccaaaaata tcaacaactt cttcacagga caaaaacacg aacactcaca aacacaagac 13200 ctggtggaaa gcttcgtcaa agtggtaaca cacaactaaa ttaacaattt tcaaatttgt 13260 ttttctacta tatactcaca cacactaaac taatttacag cgctggtaat tagactacac 13320 taaaatccaa ttaaaatatc tttaaaatca ctcaaaagta caacgagtaa ttaaaaagcg 13380 ctggtgatta agaggtcatt aacatgtcat taaagtgaaa gcgctggtga ttaagagggt 13440 aattaaaaag tcattaaaac atcactaatt acactcattt actact 13486 // ID Outcast-1_BF repbase; DNA; INV; 5917 BP. XX AC . XX DT 21-JUL-2009 (Rel. 14.07, Created) DT 21-JUL-2009 (Rel. 14.07, Last updated, Version 3) XX DE Amphioxus Outcast-1_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW Outcast; Non-LTR Retrotransposon; Transposable Element; I; KW I-1x_BF; I-1_BF; Outcast-1_BF. XX NM L1A_Mim; LTR6_MD; LTR86_MD. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-5917 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-5917 RA Kapitonov V.V. and Jurka J.; RT "Young families of I non-LTR retrotransposons from the amphioxus RT genome."; RL Repbase Reports 9(5), 1139-1139 (2009). XX RN [3] RP 1-5917 RA Kapitonov V.V. and Jurka J.; RT "Outcast non-LTR retrotransposons in the amphioxus genome."; RL Direct Submission to Repbase Update (21-JUL-2009). XX DR [2] (Consensus) XX CC Outcast-1_BF is a consensus sequence of the young Outcast-1_BF CC family of non-LTR retrotransposons that belong to the Outcaste CC clade (I group) [3]. Originally, this family was reported as CC I-1_BF, a member of the I clade [1-2]. The Outcast-1_BF consensus CC sequence has two ORFs. ORF1 codes for a protein that contains the CC PHD and Zinc knucle domains at its N and C termini. ORF2 codes CC for a proteins composed of the apurinic endonuclease, reverse CC transcriptase and ribonuclease H. XX FH Key Location/Qualifiers FT CDS 479..1741 FT /product="Outcast-1_BF_1p" FT /note="ORF1 protein." FT /translation="MYTRAQKKKQENEAKPSIDPTNLANSDQERHFTQRKP FT GDHRTLRDTSKKQTVNQDQTADKHKPTQRSNTVLRKSTTLTKKVEKLPPVE FT EVETDEEDCVCGCDHPEGGRWICCDSCDKWWHNTCAKLSYKVCDFLTKNKE FT PYHCALCITQELNDNHSRPIEIPRKEENTQDTAKDSHIQDLSNHIVLIDGI FT PQPEHYKNSGKIIAEIARNKPHCAHNIDLAYLLPRGGIAVHCKNSKATEDL FT LQPWQDGAFNASGDQLSSHKANQSYGKRAILKNITPEATEEEIEQTILKQT FT DIQVKAHRYHYQDTGKPLRVVRVDATLGQLNDLFLKTLTIRGDTITVEPYR FT SKKTTPIRCYNCHKLGHIARLCKEEPSCVRCGGPKAHPNPCKPKCVNCSGT FT HSANDPRCREFQTIKQKLEERHLRHHR" FT CDS 1752..5756 FT /product="Outcast-1_BF_2p" FT /translation="MKKYIFLATLASLILATHDYTSEATSSTSLPTSSREK FT TSTNDNMKIAQVNIRSINTSSSLVEHMCKKQNIDVLCLSEVWNKTDRPQCL FT KTWNWIFKSRPTKRGGGVAIATKEHVKMMEIGLDDTTHADIAAVNIYSDTA FT NFTLISAYVPPDNKEGMDELTHAVKSLLSTKRPLILCGDLNARHPAWGDEK FT DNQLGHQLNNFLLTNNLHPINDFRPTRENSVIDLTITNSQAMKLVERWSVQ FT PEVQLKTDHHLITMQVGTKTPVEKVERYDLRNVDWEKWGETTHEQFQIWLE FT DNDKKTVPADIKVSDMYDSFKNTLNKCTEGIINKKTITTHSKGHWTPALEE FT QMKVTKTALRKFRRRRDTHNLNKYLKEKETLAQMDEDAQSKHWEAQLKSMD FT PKKPQSFWKTIKTHLNNNAKSVVQPLNLPNGDVATSDQEIAQVATDTFAPS FT EVEMNADLSHWKHNVSHAVETSIKHQTEAVKEDDDQEDPLNYDLELEEVEA FT AIGKMNCNSAPSPIEGILPVMIKKGGKALTEALHFLLQCIWRHGELPTQMK FT QDTKVLIRKPGKENYNLIKSYRPITLSSVIGKIMERVINNRLTWWAEVNNH FT FSHTQEAYRRHRTASHGVLRLTQDITEGWRQNKTTIAVFADYEACFDRVWQ FT EGLLYKLITRGIKGRMLCYLSSFLKDREASYRINSITTPPQQSKVGIPQGA FT VLSTTLCNLYTADAYTDTNLDNFQYADDGAGWSSGEDIQSVKEEVEAGISK FT VILNWCPLWNMKIQQSKTKAMIFPPPQADDIEIEDLTVDNKSIEVVDNFKL FT VGVTLDQQLTYDKHITTTKAKAFRALKAVSKVTQAKKNPSQQAHILLYTTL FT VRPILDYASECTIAAKDRLEKAYSPIQRQALLTATGCLERTSTEALESIAA FT IPPIDIHLTCRQAQSYLRMKSKHTGNPIFDAITNMNSHKSDIQIGTPLHLL FT QTRLNEMKGEMEESMVDKEPYYDSRLPPFTMGSITGSFTATTITTGGKDKA FT REEILTILQDITGNNVPTTVIFTDGSALGNPGPTGCAAVVYEKWGSSEPYA FT VRKPVASKSNNYEGELEGLHLAIDTITRRPTTSNKILLLCDCKSAIETVTG FT VQQVEAYNSLINTLRSKLSTLKSKGYTIQITWCPGHMGIPGNEIADKEAKL FT AAEEARNCQNYTSWTKQQAMKHIESQAHERWNRRTKLNTRSEHMQKIATNM FT KKKGQTLGRRSTQVIINQLVSGHTRLNSYQNWMNPEISPSCTNCGAVETTN FT HFLYHCPRYEKERQHMLLEVENIYETYNTAEDSRDTDITTLAGMREDLSEG FT INKQMYMAFSQYIESTHRFDVLS" XX SQ Sequence 5917 BP; 2098 A; 1561 C; 1211 G; 1045 T; 2 other; gagtgtgcgg acgtgttttg gagagctctc cgtttctaaa gccttttacc acgaaaagaa 60 ggctttttca acggatttct ctcccgctaa acaagtgtgc tcacaacatc gacacattag 120 agctacagtg gggtctttga cgaaagagaa gaggaccttt tttacaatct acagtggaat 180 cttcctgacg ccggaccaag tgggggaggg cgccatcttc gaaactaccc aagccaaccc 240 aaccaaggta ccaaaccaaa gcccattttc ctctctgccg agcttgacga actagccaca 300 gaacccttgg tgctacagta ggacctgacc acagtaaggt tattatttta gtcaccaata 360 gctacatttt tgttgaaacc gagtcaaaat ggggagggcg ccattttgaa attaccatca 420 caacaaaccc aagccaaccc aggccaggca gacgccatct tggttttcct ggtctggtat 480 gtatacaaga gctcagaaaa agaaacaaga aaacgaagca aagccaagca tagaccctac 540 aaaccttgca aacagtgacc aagaaagaca ctttactcag agaaagccag gagaccatag 600 gacactcagg gatacgagta agaagcaaac agttaaccaa gatcagactg ctgacaaaca 660 taagccaact caaagaagca acacagtact acgcaaatca acaacattaa ctaagaaagt 720 agaaaaacta ccaccagtag aagaagttga gacggacgag gaagactgtg tatgtggatg 780 tgatcaccct gaaggagggc gttggatctg ttgtgactct tgtgacaaat ggtggcataa 840 tacatgtgcc aagttgtcct acaaggtatg tgatttcctt acaaaaaaca aagaaccata 900 ccactgtgcg ctctgtatca cacaagaact gaatgacaat cattcccgac ccattgaaat 960 cccacgcaaa gaagagaata ctcaagacac agctaaagat agccatatcc aagatctctc 1020 taatcacatt gtgctgattg atgggatacc ccagcctgag cattacaaaa acagcgggaa 1080 gatcattgca gagattgcca gaaacaaacc acactgtgcc cacaacatcg acctcgcata 1140 cctacttcct agagggggta tagctgttca ctgcaaaaac tctaaagcca cagaagacct 1200 actgcaaccc tggcaagacg gagcctttaa cgccagtggt gatcaactgt ctagccataa 1260 agcaaaccaa tcatacggaa aaagggcaat cctcaagaac attactcccg aagcaacaga 1320 agaagaaatt gagcaaacca tactcaaaca aaccgatatt caagtcaagg ctcacagata 1380 ccactatcag gatactggca aaccactaag ggttgtaaga gttgatgcta cacttgggca 1440 acttaatgat ctcttcctaa aaaccctgac aataagaggg gataccatca cagttgaacc 1500 ttacagaagt aagaagacca ccccgatcag gtgctacaac tgtcacaagc ttggacacat 1560 agccagactg tgcaaagaag aacccagctg tgtcaggtgc ggagggccca aagcacaccc 1620 caatccttgc aaaccaaagt gtgttaactg cagcggcaca cactctgcca acgatccaag 1680 gtgccgcgaa ttccagacaa tcaagcagaa actggaagaa agacacctca ggcatcatcg 1740 gtaaccaagg catgaagaaa tacattttcc tagcaacgct tgcatccctg attcttgcta 1800 cacacgacta cacatcggaa gcaacatcat caacaagtct acccaccagc tcacgagaaa 1860 agacttctac aaacgacaac atgaaaatcg cccaagtcaa catccgctcg atcaacactt 1920 catcttcact cgttgaacac atgtgcaaaa aacagaacat agacgtactc tgcctctcag 1980 aggtatggaa caaaacggac agaccccaat gcctgaagac ctggaactgg atcttcaaga 2040 gcagaccaac aaaaagagga ggaggggtag ccattgctac caaggaacat gtcaaaatga 2100 tggagatagg cttggacgac accacacatg cagacattgc agctgtgaac atatactcag 2160 acacagctaa cttcacactc atctcagctt atgtcccacc cgataacaaa gagggaatgg 2220 acgaactaac acatgcagta aaatccctcc tatccaccaa gagaccacta atcctatgtg 2280 gcgacctcaa tgcccggcac ccagcatggg gggatgaaaa agacaaccaa cttgggcacc 2340 aactcaataa cttccttcta accaacaacc tacatcccat caacgacttc agaccaacaa 2400 gagagaacag cgtcatagat ctcaccatca ccaactcaca agcaatgaaa cttgtcgaaa 2460 gatggagtgt gcaaccagaa gtccaactga aaacagacca tcacctcatc actatgcaag 2520 tcggaactaa gaccccagtt gaaaaggttg agcgatacga cctcagaaat gtagattggg 2580 agaaatgggg ggagaccaca cacgagcaat tccagatttg gctagaagac aatgacaaga 2640 agacagtgcc tgcggacatc aaggtatctg acatgtacga cagtttcaag aacactctca 2700 acaaatgcac agaagggatc atcaacaaga agaccataac cacacacagt aaaggacatt 2760 ggacacctgc acttgaggaa caaatgaagg tcacaaaaac tgcattgaga aaattccgca 2820 gaagacgaga tacgcacaac ctcaacaagt acctgaagga aaaagaaacc cttgcacaga 2880 tggatgaaga cgcacaaagc aaacactggg aggcacaact aaagagcatg gaccccaaaa 2940 aacctcaatc tttctggaaa accatcaaaa cacacctgaa caataacgca aaatctgtcg 3000 tacagccact gaatctaccc aatggagatg ttgccacatc tgaccaagag atcgcacagg 3060 ttgctactga caccttcgcc ccgtctgaag tagaaatgaa tgcagatctc tcacactgga 3120 aacacaacgt ctcacatgca gtggaaacat ccatcaaaca ccagacagaa gcagtcaaag 3180 aagatgacga tcaagaagac cccctgaact atgacctaga actggaagag gtagaagcag 3240 caattgggaa gatgaattgt aactcagctc caagccccat cgaaggcatc ctccccgtca 3300 tgattaagaa aggaggaaag gcactcacag aagcactaca tttcttgcta caatgcatat 3360 ggaggcatgg cgaactacct acacaaatga agcaagacac caaagtcctt atacgcaagc 3420 ccggcaaaga gaactacaac ctgatcaagt cctaccgtcc catcacccta tccagtgtaa 3480 taggaaaaat catggaaaga gtaatcaata accgactgac atggtgggct gaagtcaaca 3540 atcacttctc ccacacccaa gaagcctaca gaagacacag gactgcttca catggtgtac 3600 tcagactcac tcaggatatc acagaaggct ggaggcaaaa caaaacaaca atagctgtct 3660 ttgcygacta cgaagcttgc ttcgatcggg tctggcagga gggcctgctg tacaagctca 3720 tcaccagagg catcaaagga aggatgctgt gttatctaag ctccttcctc aaagacagag 3780 aagccagcta caggatcaac tcgataacaa cacccccaca acagagtaaa gtcggcatcc 3840 cccaaggggc ggtactatct acaacactct gcaacctata cacagctgac gcatacacag 3900 acaccaacct tgataacttc cagtacgcag atgacggagc aggctggtcg tctggggaag 3960 atatccaatc agtaaaagaa gaggtagagg ctggcatctc caaagtcatc ctcaactggt 4020 gcccactctg gaacatgaag atccaacaga gcaaaactaa agccatgata tttccccctc 4080 cacaagcaga tgacatagag atagaggact tgacagttga caacaagagc atcgaagtgg 4140 ttgacaactt caagcttgta ggagtcacac tagaccaaca actcacctat gacaaacaca 4200 tcacgacgac aaaggcgaag gctttcagag ccctgaaagc tgtaagcaag gtcacacaag 4260 ctaagaagaa tccctctcaa caagcccaca tcctgctgta cacaactctg gtcagaccca 4320 tcctagacta tgcttctgaa tgcaccatcg ctgccaaaga cagactggag aaggcttact 4380 ctcctataca gagacaggca ctgctcactg caactggatg cctcgagaga acaagcaccg 4440 aggcccttga atcgatagcc gccatcccac ctattgacat ccatctcact tgcagacagg 4500 cccagagcta tctaagaatg aagtccaaac acacgggaaa ccccatcttc gacgccatca 4560 caaacatgaa ctcacacaag tcagacatcc agatcggaac tccgctccat ctgctacaga 4620 ccagactcaa cgaaatgaag ggcgagatgg aagaaagcat ggttgacaaa gagccatact 4680 acgactcaag acttcctccc ttcaccatgg gcagcatcac tgggtccttt acagctacaa 4740 ccatcaccac cgggggaaaa gacaaggcac gagaggaaat cctgaccata cttcaagaca 4800 tcacaggcaa caacgtaccc accacagtca tcttcacrga cggctcagcc cttggaaacc 4860 ctgggcccac tggttgtgct gcagttgtgt atgaaaagtg gggaagctct gagccatacg 4920 ctgtacgcaa accggtagcc agcaaatcta ataactacga aggggaactt gagggactac 4980 acctagccat agacaccatc accagaagac cgaccacatc aaacaagata ctcctcctct 5040 gcgactgcaa atctgcgatt gaaacagtca caggggtgca acaagtggaa gcctacaaca 5100 gcctgataaa cacactcaga agcaaactca gcacactcaa atctaaaggc tacaccatac 5160 agatcacctg gtgcccagga cacatgggaa ttcctggaaa tgaaattgca gacaaagaag 5220 ctaagctagc agcagaggaa gcaagaaact gccagaacta cacatcctgg acaaaacagc 5280 aagccatgaa acacattgag tcacaagcac atgagagatg gaacagaagg accaagctga 5340 acacaagaag tgaacacatg cagaaaatag ccaccaacat gaagaagaag ggacaaacac 5400 ttggaagaag aagcacacaa gtcatcatca accagctagt gagcggccac accagactca 5460 actcctatca aaactggatg aaccctgaaa tctcacctag ctgcacaaac tgcggtgctg 5520 tggagaccac caaccacttc ctgtaccact gccctcgcta tgaaaaggaa agacaacaca 5580 tgctgctaga agtggagaac atatatgaga catacaacac agcagaggac agcagagaca 5640 cggacatcac cacgctggct ggaatgagag aagacctttc agaaggaatc aacaaacaga 5700 tgtacatggc cttctcccag tacattgaga gtacacaccg tttcgacgta ctctcctaga 5760 aactcctcca gcaactctca agcacaagga accaagagtt aaataccaag agagactatt 5820 gactcaaaag cgaaccatgt gttactcagc accaagagac cacgtctagc gaaataatag 5880 acgttaaacg aggacaacaa caacaacaac aacaaca 5917 // ID Gypsy-79_AA-I repbase; DNA; INV; 4325 BP. XX AC supercont1.242; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-79_AA_; KW Gypsy-79_AA-LTR; Gypsy-79_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4325 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.242; Positions 946357 942033. XX CC 'TTAG' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 2594..4324 FT /product="Gypsy-79_AA-I_1p" FT /translation="MASFEILREKLIAEPVLKIYSPFARTELHTDASALEF FT GGVLMQEQSDGVLHPVMYYSKRTDSHESKLHFFELETLAVVNSIKRFHIYL FT QGIKFTIVTDCIALKETLNKKDINAKIARWEIFLSEYDYDIIHRSSSNMKH FT VDALSRLPIICALGNENNSFERALLASQMLDSKIEEVRAQLENREDKNYEL FT RDGVVYRKSKDNKIMFYVPQNMISRILKMYHDDFGHFGVEKVYELINKSYW FT FPNMKEQIKKYINNCLKCIVYSKKNFKKDGPLNIVEKGNRPFHTIHIDHYG FT PITLRNCNYRYIFVVVDAFTKYLKLYPCKTTNTSEVIKHLKHYFSAYSIPD FT QIISDRGSCFTSNSFREFYVECEIKHILIATACPRANGQVERFNRVLTPVI FT AKLVESYPTRGFGPILLDAEHFINNTFCRSIGNTPSQVLFGINQKKSTISI FT IERSINENKDIERDLIKVREKAIAINRAIQIYNKKKYDSECKINTKYQEGD FT LVYIPHHVIPGSSSKLQAQFKGPYIVRKTLPNNRYVVSDVNGIQLTRVPFT FT GVFDPVNMKLWDMKNELNAHRDDVNVRMAE" XX SQ Sequence 4325 BP; 1580 A; 604 C; 844 G; 1297 T; 0 other; attcagaagt gggatgatga tcacgcgcga tttttcaaga cgaggagctg atgatgagaa 60 tttttcaacc gtcaaaggcg tttcgaatga aaaccaaatg aaattgtcaa acccaagaaa 120 tatagacggc gatactgcag aagatattga gccgaaaccg attcaacggt tgttggtgaa 180 cgtggagttg ttacgaatgt tgtgaacaag aatggagttg gaagtaattc aagatcgatg 240 atcggcagta cgggtacgaa cagcgttcaa ctatttttac atcctgaaga tgttcgtagt 300 ttgattcctg aattcacccc ggaaaagatc actgtacaga aatggctaag aaaaatcgaa 360 aatttgaaga acgtgtatag ctgggaagaa cgagtggcgc ttcattatgc aacaatatga 420 ctgggtacgg ttcctcggat ttggtacgaa ggtgttgaac aagctgtcga cgactggaat 480 gattttaaac gaagaatagt acaggcgttt ccatcaagag tcgatgaagt ggatatccat 540 agtactttga taagaagggt gaagcaaccg gacgaaacat acgagaagtt tatttatgac 600 gttgtggcta tagcatggtg gacctaactg atgcagttat tatcaagtat attatcaatg 660 gcattccgaa ctcaaatttg aggctggcgt tgagtacagc tggacataaa tctgttaata 720 cattgttgga agcaatttta cggtacgaaa atcaaacaaa tatgcaatca gcggaaggtg 780 tcaaattttc aaaacatttt caagatcatg tgcgaatgaa taataaaaat cgttgctgct 840 acaactgcgg agaacgtgga catttggcaa gaacgtgctt ttggaaaaat taacaacgat 900 ttcaaggaag tggcagtaat cttcgacaac gttatgatgg agatgcaaga tacagagctt 960 cttactcaag caacgatcag caaattaata gtttaggagg agcaggttct cacgacgatc 1020 atggtggaga aatatttaga aaccaacagc ataggagcga tggagaaaac caacgaaaac 1080 aagagcgtta cacgatgaag gaacctaatt tgatttcggc cataggagaa accgatgcag 1140 accaccgagg aatgaatact gatgattttg agcgttctcg aggtagtatt gagtttagag 1200 gtaggaatat tctagaaaaa tcatttttca atattttagt tagattttca agtttttata 1260 aagaatgttt aactttattt gacacaggga gccccatttg tttgataaaa agtagcgttg 1320 ttccaagtaa ttctaaaatt tttaaatatt ctgaacaaac atactccggc ctaaataatt 1380 caaagattga tattttaggg acatttgata cagaaattgt tattaataat tatatttaca 1440 aattgcagct taaaattgta aaagactcta ccatggaacc gatgtgcatt ttggggtcgg 1500 acttcgtaaa acggaataac ttgttggctg aatatgacgg taacgcgata gtttttggta 1560 agaaaaatgg agaagatagg aaacttttga acgtggttac aatagaagat gatattcgtc 1620 cggatattta tgatgtggag gtaagctcag gcactttctt gaatactgat ttggatcaat 1680 tagatatatg agatacaagt attcctttta tataagttac agatgtcaga tctcttttta 1740 acaacgtgta taaaaacgga ttaagaccag ttaaacctaa aaactcgtat actatgaaag 1800 ttgaattgca aaataacaaa aattttaatt gtccaccacg tcggttgtcc tattctcaga 1860 agttagaagt agaatcgcaa ataaataaac ttttacaaga aggagtaata caagaaagta 1920 attcacctta cgctagtaga atagtattgg taaaaaagaa agataattcg tggagaatgt 1980 gtgtggatta tagagaatta aataaaatta cagtaaaaga taggtatcca attccacaca 2040 ttgaagacca tttagatagc ctgagaagta agaaatattt tagtacttta gatctcaaaa 2100 gcgggtatca tcatcttttg attgatgaaa gctctcaaaa gtatacagca tttgtaagtc 2160 atatggatca gtttgaatat aagagagtac ctttcggcct ttgcaacgca ccttcagcat 2220 ttatgagatt tattaataca atttttaaag atttaattaa aaaaggcgtg ataagaattt 2280 atatcgacga cattattatt gcaaaatctt agtggaaaac catttacaat tacaattaac 2340 taaatgtaaa tttttgaaat ctaaagtaga gtatctcgga tacaatatta attcggaagg 2400 aataagtcca agtgagaaac atgttgaatg cataaaaaac tttccagttc ctaaaaatgt 2460 gcatgatgtt cataaattta ttggattagc aagttacttt ataaaattta tcccaatttt 2520 ttcagttatt gcaaaaccac tttatgattt agtaaaaaag gataaaaagg atttcaggtt 2580 tgcaaaagag gaaatggctt ccttcgaaat tcttcgagaa aaacttattg cagaaccagt 2640 attaaaaatt tattctcctt ttgctaggac tgaactgcat acagatgcaa gtgctcttga 2700 attcggtggt gttttaatgc aggaacaatc agatggagtt ttacatccag taatgtatta 2760 tagcaaaaga acagattcac atgaatcaaa gctacatttt tttgaacttg aaacattggc 2820 agtagtaaac agtataaaga gatttcacat ttatttacag ggaattaaat ttaccatagt 2880 tactgattgt attgcattga aagaaacact aaataaaaag gacattaatg cgaagattgc 2940 tcgatgggaa atttttcttt ctgaatacga ctatgatata attcaccgta gttcttcaaa 3000 tatgaagcat gtagatgcct tatcacgttt gccaattatt tgtgcattag ggaatgaaaa 3060 taatagtttt gaaagagcat tattggctag ccagatgttg gattctaaaa ttgaagaagt 3120 tagagctcag ttagaaaatc gcgaagataa aaattatgag ttacgagatg gagtcgtgta 3180 taggaaaagc aaagataata aaataatgtt ctatgttcca caaaatatga taagtcgaat 3240 tttgaaaatg tatcacgatg atttcggcca ttttggtgta gaaaaagttt atgaacttat 3300 taacaaatca tactggtttc caaacatgaa agagcaaatt aagaaataca taaataattg 3360 tttaaaatgt atcgtatatt ctaagaaaaa ttttaagaag gatggccctt tgaatatagt 3420 agagaaagga aatagacctt ttcatacgat acacattgac cattatggac caataacgct 3480 aagaaattgt aattatagat atatttttgt ggtagtagac gcctttacaa aatatttaaa 3540 actttatcct tgtaaaacaa caaatacgag tgaagtaatt aagcatctta aacattattt 3600 tagtgcatat tcaatacctg atcaaattat ttctgatcgt ggaagttgtt tcacttcgaa 3660 tagttttaga gaattttatg tcgaatgtga aataaaacat attttaatag ctactgcatg 3720 tccaagagca aatggtcagg tagaacggtt taacagagtt ttaacaccag taatagctaa 3780 acttgttgaa agttatccta caagaggttt tggtccaatt ttattagatg ccgaacattt 3840 tattaataat actttctgtc ggtcaatagg taatacccct tcgcaagtac tatttggaat 3900 aaaccaaaag aaatcaacta ttagtattat cgagagaagt attaatgaaa ataaagacat 3960 tgaaagagat ttgattaaag ttagagaaaa agcaattgcc ataaatagag caattcaaat 4020 ttacaacaaa aagaaatacg actccgaatg taagattaac acaaaatatc aagaaggtga 4080 cctagtttat ataccacatc atgttatacc tgggagttca tctaagttac aagcacaatt 4140 caaaggtcct tatattgtca gaaaaacgct tccaaataat cgttatgttg ttagtgatgt 4200 taatggtatt cagcttacac gtgtaccctt taccggagtt tttgatcctg tgaatatgaa 4260 gttatgggat atgaaaaacg aactaaatgc acatcgggac gatgtgaatg tcaggatggc 4320 cgagt 4325 // ID DNA8-58_AP repbase; DNA; INV; 393 BP. XX AC . XX DT 25-AUG-2009 (Rel. 14.09, Created) DT 25-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-58_AP. XX NM DNA8-58_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-393 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 1992-1992 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 393 BP; 118 A; 71 C; 85 G; 119 T; 0 other; cagaggtatt caaactgtgc gtcgcggctc ccaggggcgt cgcgagagtt atcgaaggga 60 gccacgtcat ataattgaaa accaataaat attttaaaat atttttacga aaaacgtttt 120 ttattatcat tgaggcacac cggttgggaa acgttggatt gtttacagtc tacactctac 180 tataatcatt cgctaagtat agctttaatt gatcgcgtaa gtcgcaatag agacagcgat 240 accatcaact gccatcgtta aatcatcaat gcgtatgtcg tgtttggcag tattagtcgc 300 gagttaaatt atcagtcatt tttaatttgg gagccgtcaa cattttgtga agactcaaag 360 gagccttggg atgaaaaagt ttgaatacct ctg 393 // ID I-67_AAe repbase; DNA; INV; 6069 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE An I non-LTR retrotransposon from Aedes aegypti. XX KW I; Non-LTR Retrotransposon; Transposable Element; I-67_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6069 RA Kojima K.K. and Jurka J.; RT "I clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1338-1338 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 10 sequences with >93% CC identity. XX FH Key Location/Qualifiers FT CDS 329..1693 FT /product="I-67_AAe_1p" FT /translation="MAEASQGPHLGGLFDNPPGSTIGTLVPSWLLSNDEIG FT YQIVLVMQVRRELDSGEKEINHRLPSPIVIGLSVEQVIGAEKAREIVASKE FT GRGTKYLLRTNSRLTAEKLLTIKNLVDGTLVEVVPHSTLNSVEGVVYEPDS FT INDDEEKILDYLKCQGVTKVRRIRKRVNGTLRNTPLLVLSLKGAVLPQFIY FT FGLLRVPMRTYYPSPLLCYNCGIYGHPKKACKEASICLHCSQEVHISDGEQ FT CQNPAFCLHCKANHPIHSRECPKYQQEAKIIKHKTDNRISFGEARRELRDR FT CGDTYASALQQRLHQVESEKDTIIANLRKELEAVKAELKTLKETSLSNHPN FT GEIEIQNVVQSSTPTVKGVDMATKDTNSQMPSSSDSGTMTGTRTSVNATTG FT RKSRKDKLSMSPPKESSHSTNNRLLHNMELRNRSRSDKRINTSPPDRNREK FT RRHLDYNDGR" FT CDS 1683..6005 FT /product="I-67_AAe_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="MMAASNSPSYAPENTDPAMNPDPDMDNVLAKLQTEQD FT DPRIQMDYDYELLLNNTQREKIIANVKRRFEGGECVVPPSLPSTEACLGKE FT AIRDVSLQPVDCRSPSSVARIHHQSCRPPLPKETLWDSEAYDRLCHFRALP FT SNSNAEGYFIIDRTTGIFNHDSPSARPATSTLTRDVPVLSEEPLAAVGVAP FT SSSPASIQTFISQNQQSNSIGLSNNLGLQAATIHQDEDSKNDEDSLSNSTF FT GDASRKMFFLQWNIRGLWSNHPELCHLISQNRPIAIGLQEMMSKDPGNTLQ FT TQYTWSLANRHYSQGAGAAGLGILNEVPHHFMVVDTPIPVSAARLHSPFNT FT TVVSVYAPPNCPDVDIVETLEHIKNNLEPPFLIGGDFNAAHEAWGSTKSTR FT KGMLLLTWFVDNDLIVLNNGEPTFISSSHGSTSCIDLTVASKSLARNLYWS FT ISHDTHGSDHFPIRVHSEESITCTKLRKRWLYKNANWPHYEENVENFLRNE FT PNLDLSIEEVTEMILSSAKSSIPRTTGQPPKRAELWWTDSVKLAVKARRKA FT LRKLKRFNETDPMRDKARVEFQKARSIARKTVKLAKKEAWETFCSSFNPLT FT PADIIWSNFNKLCGKRRSTFKGLTIDDKYEQDPAIIAEHFADYFEKTSTGD FT SHEEVLESESRRVPLVSPENSPLDSDFSYQELLRAIDGTRGFSTGNDNVGY FT PMIRHLPISGKVAMLRSFNRVWAEGKFPSLWREGIVVPIPKPGVQPNSVEN FT YRPITLLSCIGKIYERMVNHRLTTFLEKSKFLHPGQHAFRAGRGTSSYFAE FT LNEIIQEAKRTETHLEFALLDIRKAYDQAWRPHILNQLDRVHVGNRMRACI FT EEYLNDRRFQVQYSGSFSSKKTLKSGVPQGSVLAVTLFLLAMNTVFEVVPK FT DVRILVYADDILLVAASKQLSTVRRKLLTAVAAVNDWAKKVKFRLSASKSC FT VLHVCQRKRHRKARVRSAIVIDGEAIPEVKTARVLGVWINRKGNFSTHSAK FT TKEALHSRINFIRALAPKANRSTLWKICNAVCLSKMLYGIELFGTELTSSL FT QPIFNQLLRTSSGALKSSPTYCLAVEAGELPLELRVIEILIRKYCRLVEKN FT DNGYPHLRNAVDTALRETTGGVIPDITKLQRVGRRPWNCPQVKVDWSIKNI FT FKKGNNTSIARIIVKEHLEKKYSNHVKIFTDGSKTQNDVGIGVVGPKLLIE FT RSLHPQCSVYSAEAAALVAAARYASRSPTVILTDSASCTAAIDKGESNHPF FT VQEFENIALQKNITVCWIPGHSGITGNEEADRAAERGRSARIFNRTVPSSD FT VKTWIKTLVYESFQNTWDSHRTTFLKSCKPSIDKWIDRSDRREQRVLTRLR FT IGHTLMTKKHLFDKQNPPICEVCNDEITVAHILLNCRKYDHIRRKIGLSNN FT IQIVLNNNKKNEIKLLKFVKECKLDQI" XX SQ Sequence 6069 BP; 1900 A; 1441 C; 1298 G; 1429 T; 1 other; cagtgaatta ttgcacagcg ataacaacgg tttatttacg tccgcgtgcc ggtagtttac 60 ttcgactatt cgatagttat aaagcgtatt aaattcctta ttacatcgga atactagcgt 120 tagtatagtg tgtagcagct acatgcttta gttgctcgat taacggtgat tcatactagt 180 tttcaagtga aaactttttg ctaacaagca aagcgagccg tcatcacaag cgtcgcaacg 240 aactagcacg gatcccgttc gatttggtgt atttggcttg cacaccactg ccgagcattg 300 agtagcagtt tcaaataccg cttcggccat ggccgaggct agtcaagggc cccatttggg 360 ggggctcttt gataaccctc caggaagtac aataggcaca ctagttccta gttggctact 420 gagcaatgac gagataggat accagatcgt actcgtaatg caagtccgcc gtgaactaga 480 ttcaggagaa aaggaaatca accaccgtct tcccagcccg attgtgattg gattgtcagt 540 agagcaagtg attggtgctg aaaaagctag agaaattgtc gcttctaaag aaggtagagg 600 tacgaaatac cttcttcgca cgaactctcg cctaactgcc gaaaagcttc taaccatcaa 660 aaatctggta gacgggacac ttgtggaagt cgttccccac tcaaccctca attcagtcga 720 aggtgtggtc tacgagccgg attcaataaa cgatgacgaa gaaaagattc tagactactt 780 gaagtgtcag ggggtaacta aggttcgacg cattcggaaa cgggtcaatg gaactcttcg 840 taacactcca ttgcttgtat tgtcactgaa aggcgctgta ctcccccagt tcatatactt 900 cggattacta cgagttccaa tgcgtacgta ttatccatcg cctcttctgt gttataattg 960 tggaatctat ggacacccta aaaaggcctg taaagaagca tcaatctgcc tacactgttc 1020 tcaagaagta catatttccg atggggaaca gtgtcaaaac cctgcgtttt gtttgcattg 1080 taaggcaaat catccaatcc actcgcgcga atgtccgaag tatcaacaag aagcgaaaat 1140 catcaagcac aaaaccgaca acagaatctc tttcggtgaa gcccgccgtg agctgcgcga 1200 tagatgtggc gatacgtacg ccagtgcatt acaacaacgg ttgcatcaag ttgaatccga 1260 aaaggacacg attattgcca accttagaaa agaactcgaa gccgtcaaag cagaattaaa 1320 aactctgaaa gaaacctcac tgtcgaacca ccctaacggc gaaattgaga ttcaaaatgt 1380 ggtgcagagc agcaccccga cggtaaaggg agttgatatg gctacaaaag acactaacag 1440 ccaaatgccg tcgtcatccg attctggtac catgactgga actcggacaa gcgtaaatgc 1500 aactaccgga aggaaatctc ggaaagacaa actatcaatg tcacctccaa aagaatcttc 1560 gcacagcacg aacaatagat tgcttcacaa tatggaactg cgaaaccgaa gccgaagtga 1620 caagcgtatc aatacctccc ctccggatcg gaaccgtgaa aaacgaaggc atctagatta 1680 caatgatggc cgctagtaac tctccctcgt acgcccctga aaataccgac cctgcaatga 1740 atccggaccc tgatatggac aacgtactag caaaactcca aactgagcaa gacgaccctc 1800 gcatacaaat ggactacgat tatgaacttc tcctcaacaa cacgcaacga gaaaaaataa 1860 tagcaaatgt aaaacggcga tttgagggag gsgaatgtgt agtacccccc tcacttccaa 1920 gcacggaagc gtgtctaggc aaggaggcaa tccgggacgt ctctctccaa cccgttgact 1980 gccgatctcc tagctcggtc gcaagaatac atcatcaatc atgtcgacca ccacttccga 2040 aagaaacact ttgggattct gaagcatacg accggctgtg ccattttcga gcgctgccct 2100 ctaatagcaa cgcagaagga tacttcatca ttgatcgcac caccggtatc ttcaaccacg 2160 attcgccaag cgcgaggcct gccacgtcga ccctgaccag ggatgttccg gttttgtccg 2220 aagagcctct ggcggcagtc ggcgtggccc cttcatcatc tccggcaagt attcaaactt 2280 ttatatctca aaaccagcag tcaaactcca ttggtctttc caacaaccta gggctacaag 2340 cggctacaat tcatcaagac gaagattcaa aaaacgatga ggactctctc agcaactcga 2400 ccttcggtga cgcaagcaga aagatgtttt ttctccaatg gaatattcgc ggcctttggt 2460 ccaatcatcc cgaattgtgc cacctaatct cgcagaatcg tccaatcgca attggattac 2520 aagagatgat gtctaaagat cctggcaaca ccctccagac acagtacaca tggagccttg 2580 ctaatcgtca ttacagtcaa ggcgctggtg cagctggttt aggaatcttg aatgaggttc 2640 cacatcactt catggtggta gacaccccaa tcccggtctc cgccgcacga ctgcatagtc 2700 cattcaacac taccgtggta tccgtatacg ctcctccaaa ttgcccggat gtggacatcg 2760 tagaaacttt agaacacatc aaaaacaatc tagaaccgcc atttctcatt ggtggcgatt 2820 tcaacgcagc ccatgaagca tggggcagca ccaaatcaac gcgaaaaggt atgctccttc 2880 ttacgtggtt cgtagacaat gacctgatag tactcaataa tggagagcca acgtttatca 2940 gctcatccca tggcagcaca tcttgcatag atctaaccgt agcctctaaa agtctagcga 3000 gaaatctata ttggtcgata tctcatgaca cacacggtag tgatcatttc ccgatccgtg 3060 tccattctga agaaagcata acctgtacaa aactccggaa acgttggcta tacaaaaacg 3120 ccaactggcc acattacgaa gaaaacgttg aaaattttct gcgtaacgag cccaacttgg 3180 atctttcaat tgaagaagta actgaaatga tccttagctc cgctaaatca agcattccac 3240 gaacaacagg ccaaccaccc aaaagagcag agctatggtg gacagattca gtgaaacttg 3300 ctgtaaaagc acggcgaaaa gcactaagaa aactgaaacg ctttaatgaa actgatccaa 3360 tgagagacaa agcacgagta gaatttcaaa aagctcgtag tattgctagg aaaacggtga 3420 aactggcaaa gaaagaggca tgggaaactt tctgcagctc ctttaatcca ctcaccccgg 3480 ctgacattat ttggagtaat tttaacaagc tatgtggaaa aagaagatca acgtttaaag 3540 gactcacgat tgatgacaag tacgagcagg atccagcaat cattgccgaa cactttgctg 3600 attacttcga aaaaacttca acgggcgata gtcacgagga agtactggaa tcagaaagca 3660 ggcgtgtacc gttggtttcc ccagagaata gtccattgga cagtgacttc agttaccaag 3720 agcttctacg agcaatcgat gggaccagag gattttcaac aggtaacgat aacgttggtt 3780 atcccatgat tcgccatcta cccatctccg gaaaagtggc aatgctacga agcttcaacc 3840 gcgtctgggc tgaaggcaag tttccttctc tatggagaga agggattgtt gtaccaatcc 3900 ccaaacctgg tgttcaacca aattctgtag agaattatcg tccaatcact ctactgagtt 3960 gcattggaaa gatatacgag cgcatggtga atcatcgtct gacaactttc ctcgaaaaaa 4020 gcaagtttct ccatccggga cagcatgcgt tccgagccgg cagaggaacc tcgtcatatt 4080 ttgctgagtt gaacgaaatc atccaagagg ccaaacgaac agaaactcat ctcgaattcg 4140 ctctattgga catccggaag gcatatgacc aagcctggcg accacatata ctcaaccaac 4200 tcgatcgtgt tcacgtcggt aatcggatga gagcttgcat tgaagaatac ctaaatgata 4260 ggcgcttcca agttcagtac agcggatctt tttcatccaa aaaaacattg aaaagcggag 4320 tgccgcaagg atcagtccta gcagtgactt tgtttctcct ggccatgaac acggtgtttg 4380 aagtagttcc caaagacgtt cggatcttgg tgtacgccga tgacatttta ttggtagcag 4440 catccaagca actgtccacc gtacgacgaa aacttctgac tgcagttgct gcagttaacg 4500 actgggctaa aaaagtcaaa ttccgtctat cagcttctaa gtcttgcgtc ctacacgtat 4560 gtcagagaaa aagacacaga aaagcgcggg taagatcagc aattgtgatc gatggcgaag 4620 caatcccaga agtaaaaact gcgcgcgttc tcggggtatg gataaaccgg aaaggtaatt 4680 tttccactca tagtgccaag acaaaagaag cacttcacag tcggatcaac ttcatcagag 4740 ctttggcacc gaaagcaaat cgatcaacct tgtggaaaat ctgcaatgca gtttgtttgt 4800 caaaaatgtt gtacggtatc gaacttttcg gaacagaact aaccagttct ctgcagccta 4860 tcttcaacca actgttaaga acatcatcgg gggcactcaa atcttctccg acctattgtc 4920 ttgctgttga agctggggaa ctgccactag agctacgggt catcgaaata ctaatccgta 4980 agtactgccg actggtggaa aaaaacgata acggctatcc tcatctccgc aatgcggttg 5040 atactgcgct ccgcgaaact acaggaggtg ttattcctga tatcactaag cttcaacgtg 5100 ttggccgccg tccatggaac tgccctcagg taaaagtaga ctggtccatt aaaaacatct 5160 tcaagaaagg caacaatacc agtatagcaa ggatcatcgt taaggagcat cttgaaaaga 5220 aatactccaa tcatgtgaaa atttttactg atggatcgaa aactcaaaat gatgtcggta 5280 tcggagtggt tggacctaaa ctattaatcg aaagaagttt gcacccacaa tgttctgttt 5340 actctgcaga agccgccgct ctggtagcag ctgcacgata tgcatcaaga tctccaacag 5400 ttattctaac tgattccgct agttgtaccg ctgctatcga taaaggcgaa tccaaccacc 5460 cttttgtgca agaatttgaa aatatcgctc ttcagaaaaa catcaccgtt tgctggatcc 5520 cgggccactc cggaatcaca ggaaatgaag aagcagaccg agcagctgaa cgagggcgat 5580 ctgctaggat attcaacagg accgttccct catccgatgt caaaacatgg atcaaaactt 5640 tggtgtacga gagctttcag aacacgtggg attctcacag aacgaccttc ctgaagagct 5700 gcaaaccttc gatagacaaa tggatagaca gatcggacag acgcgaacaa cgtgttttaa 5760 ctcggcttcg aattggtcat actttgatga cgaaaaaaca tctttttgat aagcaaaatc 5820 ctcctatttg tgaagtatgt aacgatgaga ttacagtagc acacattctt cttaattgta 5880 gaaaatacga tcacatcaga agaaaaatag gactaagtaa taatattcaa atcgtgctaa 5940 acaataacaa gaaaaacgaa atcaaactat tgaaatttgt taaggaatgt aaactagatc 6000 aaatctaaat taaaacagag gtgaatgaac ctatgcaggt ttaaaacctc ttaaataaaa 6060 aaaaaaaaa 6069 // ID SMAR27 repbase; DNA; INV; 1357 BP. XX AC . XX DT 10-OCT-2007 (Rel. 12.1, Created) DT 22-OCT-2007 (Rel. 12.1, Last updated, Version 1) XX DE Consensus sequence of Mariner-type family of repeats. XX KW Mariner/Tc1; DNA transposon; Transposable Element; SMAR27. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-1357 RA Jurka J.; RT "Mariner-type element from freshwater planarian (Schmidtea RT mediterranea)."; RL Repbase Reports 7(10), 1085-1085 (2007). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 203..1213 FT /product="SMAR26_1p" FT /translation="MNPTTENVNFFVATMKRNNCKATEIHNLLANAWGPEN FT ICSVRQVQSIAKKFSSGEREDFRRKDGSGRPREARTEDNINAIANLVANDS FT SVSITVLADATELSWSSVRRILIEDLKKQSVCARWVPHTLNDNQLQQRVDG FT ASLLLQDLHGAVVVIDEKWLYAKPMPPKEMNRCWVDAGGERPQQPRRIIAD FT KKFHIIVAMNFRGEHYFEIMPPGTTVNAHRYTEFLQRMMAIRRQGTLTIMH FT DNARPHTAQMTEAFLQQKGIRRIPQPPYSPDMNLMDRFIFRNMEFARRLQT FT FQNDDDVKLFLTTFLAAQKRSSLNHELLNLRQDLQLIIDAGGMYL" XX SQ Sequence 1357 BP; 405 A; 293 C; 273 G; 386 T; 0 other; gggtgatcca aaaaaaactt cgttttttat agtatagtat tcgaccaaat atatattatg 60 ttattagtgt gcatactcat agtacgtatt attcaatact ttttacagtt atttctgatt 120 attattgcat ccatatgacg tggtattacg tttttcgtat atatgtgctt tatttcatta 180 ttcttaggat gcgtgtttga ttatgaatcc tactacagag aacgtgaatt ttttcgttgc 240 aacgatgaaa aggaacaact gtaaagccac tgaaattcac aatctcctcg ctaatgcgtg 300 gggccctgag aacatttgct cagtacgaca agttcaaagc attgcaaaga aattctcctc 360 aggtgaaaga gaagattttc gacgaaaaga tggatctgga cgaccaagag aggctcgaac 420 tgaagataac atcaacgcca tcgcgaattt ggttgccaat gactcctctg tgagcatcac 480 agtactcgca gatgctactg aactgtcttg gtcatcagtg cggagaattc ttatcgaaga 540 cctcaagaaa caaagcgttt gtgctcgctg ggtccctcac accctcaacg acaatcaact 600 gcagcagcga gtagatggtg catcactact tctgcaagat ctacatggcg ctgtggtagt 660 catcgatgaa aagtggctct atgcaaaacc gatgccgccg aaagaaatga accgctgttg 720 ggtggacgct ggtggcgaac gtccacaaca acctcgccga atcatcgccg acaaaaaatt 780 ccatatcatc gtggcgatga attttcgagg agagcattac ttcgaaataa tgccgcctgg 840 tacaactgta aatgctcatc gctacacaga atttctccaa cggatgatgg caatacgccg 900 ccaaggtacg ctaacaatca tgcatgacaa tgctcgtcct cacacggcgc agatgacaga 960 agcatttctg cagcagaagg gaatacggcg catcccacaa ccgccgtatt ctcctgatat 1020 gaatctgatg gacagattta ttttccgcaa catggaattc gcacgacgat tgcagacatt 1080 ccagaatgac gacgatgtga aactcttcct gaccaccttt ctcgctgctc aaaagagatc 1140 ctctctcaat cacgagctgc taaacttgcg acaggatctc caattaataa ttgatgctgg 1200 tggaatgtat ttgtgatgca tttctccatt acatttatta gttgttcatt ttgcatgtca 1260 tatatagaaa ttatatcgtt actgtgtaaa acctgcatga atactgttat tataaatata 1320 ttatattata aaaaacgaag ttttttttgg atcaccc 1357 // ID Mariner-17_SM repbase; DNA; INV; 1660 BP. XX AC . XX DT 11-AUG-2009 (Rel. 14.08, Created) DT 11-AUG-2009 (Rel. 14.08, Last updated, Version 2) XX DE Mariner DNA transposon from Schmidtea mediterranea: consensus. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-17_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-1660 RA Jurka J.; RT "Mariner DNA transposons from Schmidtea mediterranea."; RL Repbase Reports 9(8), 1866-1866 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 223..1509 FT /product="Mariner-17_SM_1p" FT /translation="MKKTSYTISFKKEVIQYMEEGNSCYKAKRHFSERDGY FT DYSQSMFQQWFLNREQIKICSDSMKRARGGGRKAILGSLEDMLLDEIIELR FT LMKIKVTRSFISDRATQLARENNIELKATGTWVTNFMTRNGLSLRRTTNFT FT VLSDDELIMRGVNYILFLRNTLPTVDLSKTLLMDETAIYFEDSRTQTVDLK FT GRKHVVMKSTGFSSMRITALISVWGNGRKAIPLVINKGKTGNSIIRVVNGI FT YNASQSKAWVNQDLIIAWIDMMFPRFDVSPGKCIIWDSCRAHISEKVKNHC FT RIRNIKMIVIPGGLTPYVQAGDIGIFKELKDKISVIINTWKNSDAVEYTRG FT GNPKAPNNMVVSNWVRDAWNSVSNTNILNSIKSAGFSDDINAWHISKHDIY FT GEMFKNTYTLRMFNSSLHRQENIETEPDIFDLADE" XX SQ Sequence 1660 BP; 598 A; 240 C; 298 G; 524 T; 0 other; acggatatta ccgaataata tgactacttc aaatagtatg accaattttt ttatggtcat 60 actatttaat aatatgacta taattacata tacatttgaa gttaataaat tgatatttaa 120 tctacaacca gcaattttta tgtattatta tgttaattcg aaatatctgc ataattaaca 180 tttcaaaaaa ttttaatatt taaaagaaaa caaacacttt ccatgaaaaa aacatcgtat 240 acaatttcat ttaaaaaaga agttatccag tatatggagg agggaaattc atgctataaa 300 gccaaaagac atttctctga aagagatgga tatgactata gtcaatccat gtttcaacaa 360 tggtttttaa acagagaaca aataaaaatt tgttctgatt cgatgaaacg agctcgtggt 420 ggtggaagaa aggcaatact cggatctctt gaagatatgc ttttggatga gattattgaa 480 ttgagattaa tgaagatcaa agtaacgcgt tcatttattt cagatagagc aacacaactt 540 gctcgggaaa ataacattga gttgaaagca acaggtacgt gggtcacaaa ctttatgacc 600 agaaatggac tgtccttgcg aaggaccaca aattttacag tcctgtccga tgatgaatta 660 ataatgcgag gagtcaatta tattttgttt ttaagaaata ctcttccaac tgttgacctt 720 tcaaaaactc tgttaatgga cgaaactgcc atatattttg aagatagtag aactcaaaca 780 gttgatttaa aaggaagaaa gcatgtggtg atgaaatcca ctgggttttc ttcaatgcga 840 attacagcgc tgatttctgt ttggggaaat ggacgaaagg caatcccctt agtcatcaac 900 aaagggaaaa ctggaaattc tattattcgt gtggttaatg gcatttacaa tgcttctcag 960 tcaaaagctt gggtaaatca agatcttatt attgcatgga tcgatatgat gtttccacga 1020 ttcgatgtca gtccaggaaa gtgcattatc tgggactctt gccgcgcaca tatttcagaa 1080 aaggttaaaa atcattgtag aatccgaaat atcaaaatga tagtgatccc tggtggatta 1140 acgccatatg ttcaggcggg agatattgga atatttaagg aattgaaaga caaaatttct 1200 gtaataataa atacatggaa gaattcggat gctgttgaat acacaagagg gggaaatcca 1260 aaggctccaa acaatatggt tgttagtaat tgggtgagag atgcttggaa tagcgtttca 1320 aatacaaaca ttcttaactc aattaaatca gccggatttt cagatgatat taatgcatgg 1380 catatttcta aacacgatat ttatggtgag atgttcaaaa atacatatac gctccgaatg 1440 tttaattctt ctttgcatcg tcaagaaaac atcgaaacag agcctgatat ttttgattta 1500 gctgatgaat aataataata aatggtattg atataatttt gttttttgat atctaactaa 1560 tatctcaaat aatatgacta ctcgaatagt atgaccaaaa cctgacattt tttagtcata 1620 ctattaaata atatgagtca tactattcgg taatatccgt 1660 // ID Gypsy-154_AA-LTR repbase; DNA; INV; 214 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-154_AA_; KW Gypsy-154_AA-I; Gypsy-154_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-214 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1034-1034 (2011). XX DR [2] (Consensus) XX SQ Sequence 214 BP; 67 A; 17 C; 73 G; 57 T; 0 other; tgtagagtat ggtaaactgt tgagttggaa ataagagtgt aactggagag atatgtagaa 60 aagcggggta gagagatagg agagtcggaa ccggagggga cttggtgttg tgtattgtaa 120 gcggacggtt gtggaaaata aacgagtaga aatctaaagt gttttgtgaa ttggtttgga 180 gtgttggcgt taatccgaaa ggtaagactc taca 214 // ID BEL-12_AA-I repbase; DNA; INV; 5648 BP. XX AC supercont1.141; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-12_AA_; KW BEL-12_AA-LTR; BEL-12_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5648 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.141; Positions 1029555 1035202. XX CC Positions [4698-5255] - Integrase core CC 'GAAAC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 3363..5615 FT /product="BEL-12_AA-I_4p" FT /translation="MLLSRPRKDLDTGAMEVENRVGSADTNTLEEYMDALD FT ESVQKFGRNTCPRCYFPQHAHNEMVDLQLHIFVDASEEAYSCVAYFRAEFV FT GGVEIALIGGKAKVAPLKALSIPRLELMAAVIGVRLLKTIRTAHSLSINKI FT VLWSDSKTVLSWINSDHRRYRQFVACRIGEILSKSDAAQWRWVPTKENPAD FT MATKWGKGPNFSPDSHWFKGPKFLRSPESYWPTETKPTDEMTTEEMRVCLV FT HTENLKITVIDWNRFSSWTRLLRSIAFVVRFCQNLRRKVKKKPLTDGILTQ FT KELEQSELTIFRMLQNEEYPDETAILLEGRKKNSLVRFERTSKIRKLSPFM FT DDAGLIRSDSRISVACFVSYDTRFPIILPKESVITRLLLEWYHRRFLHANK FT ETIVNEVRQRFHIPAMRALVRKIAKMCQLCKIRKAVPEVPRMAPLPAARLK FT PFERPFSYVGIDYFGPISVRVNRSTMKRWIALFTCLTTRAVHLEVAHSLST FT ESCKQAIRRFIGRRGAPVEIRSDRGTNFIGASNDLQKEMMDVDRQLAETFT FT NTQTRWVFNPPAAPHMGGAWERLVRSVKVSMAAIKTTEIPKEEALVTFIVE FT AESVVNSRPLTFMPLETEQQEALTPNHFLLMSSTGVVQQPKSSMDPKVVCR FT GDWELCRSMVDQFWRRWIREFLPTISRRTKWFTDVKPIEVGDLVIVVEEKI FT RNGWIRGRIVKLKEGLDGRVRAAVVQTATGLMERPVSKIARLDIDGDKAAS FT " FT CDS join(44..1621,1625..2704) FT /product="BEL-12_AA-I_2p" FT /translation="MSARNRKVGTGKSAQWQACVDVDTERMVCCCHCKSWW FT HFECVGVNDSIATSDRTFTCPKCQRPPLQIPGSTKTSSVGKSNSKPGSLIS FT GRTRSSVRARRAQLELEKLEAQKALALKRLELEGKKLQLEAEVLEESFRLR FT DEIEREGGSHKASVFSQQSSRSKVEEWQKRQEEILCSTIVTPAQITSVDSL FT QKGTMANVEVSQGGETQPAKHLLDRAFQGISLEDSLSDGPLGGIIGRTSDV FT VAQSTTISKLGLPPASYKVPGVSSNNQVSLGFVYSQSALNRPSNEFNIPVS FT TNPSFPPVQSQITGYAEHPLASNTRVFPVVVSSKSDRRIDECSRSLEVETN FT QAVGENPIQPREEQPSHGRREHSTRCPDPQMLSNPGMNVVDYAAPSLYQQG FT GGPSPQQIAARQVISKELPIFTGDPEDWPLFISSFTNTTRACGYSEAENLA FT RLQRCLKGNALEAVRSRLLLPVAVPHVIATLETLYGRPELLIHSLLQKIRG FT VPVPKQDRLDTLIGYGMAVQNLSDHLEAGGLAHLNNPMLLFELVDKLPASM FT KLDWSLYKQRCVEVNIRTFSQYMATLVRAATDVTLHYNPKQQVHQQQQVQR FT GTKSGKDRDFCGAHSTEEALKTPTREDTLTKGTEAKVERMIPACLLCKDPD FT HRVKNCSVFAKKTLDERWKLTGLLGLCRICLGKHGKRTCRIQKKCEVDGFA FT LKDVRTVHKLDLPRQTLRYEQLAQSFPYLNGLPVKGYENALPQILIGNDNA FT HVTATLKMREGNAGEPIAAKTRLGWTVYGLQNESIDSAHSFHICECKDGPA FT LHDLVKNFFCLENLGVEAVSFPESDDVQRAKQILQTTTKRVGQRFETGLLW FT KFDSFEFPNSYPMAVRRKFVFFVMRQLKWMAFR" XX SQ Sequence 5648 BP; 1616 A; 1201 C; 1507 G; 1324 T; 0 other; aaatctaaaa gattgtggtc cgctttattg agcttcattt ccaatgagtg ccagaaacag 60 aaaagtagga accggcaaga gcgcgcagtg gcaggcgtgt gtagatgtcg ataccgagag 120 gatggtgtgc tgctgccatt gtaagtcgtg gtggcacttc gagtgcgttg gagtgaacga 180 cagtatcgct acatcggatc gaacattcac ctgtccgaag tgtcagaggc caccattaca 240 aataccaggg agtacaaaaa cgtcgagcgt tggtaagtcc aatagtaagc cgggtagtct 300 aatatccgga aggaccagat ccagcgttag agctagacgg gctcagctgg aattggagaa 360 actggaggcg caaaaagctc tagccctaaa acgtttagag ttggagggaa agaagttgca 420 gttggaggca gaagtcctgg aagagtcttt ccggctgcgt gatgaaattg agagagaggg 480 tggtagccat aaagcgagcg tgttttcaca gcagagctca cgtagcaagg tagaagaatg 540 gcagaaaagg caggaggaga ttctgtgttc gacaatcgtg acccccgcgc aaattacctc 600 ggtagatagt cttcagaagg gaaccatggc gaatgtagaa gttagtcagg gcggcgaaac 660 gcagccggct aaacatctcc tagatagggc atttcaaggt atttccctgg aagacagctt 720 aagcgatggt cctttaggcg gtataatcgg tagaacgtca gatgtcgtag cacagtcaac 780 tactataagt aagttaggac ttccaccagc tagctataag gtaccaggcg tgagcagtaa 840 caatcaagtt tcgctcggat ttgtgtattc acagtctgct cttaatcgac cctctaatga 900 attcaatatt ccggtgtcaa caaatccttc atttcccccc gttcaatccc aaataaccgg 960 gtatgctgag catccgcttg cgagcaatac tcgagtcttt cctgtggtgg tgagttcaaa 1020 aagtgataga agaatcgatg agtgtagcag atcgctagaa gtggaaacaa atcaagcagt 1080 aggtgaaaat ccaatccagc ctcgagagga acaaccgtcg catgggcgca gagaacactc 1140 cacccgatgt ccagatcctc aaatgctgtc gaatccaggg atgaatgtag tagactatgc 1200 tgccccgagc ctttatcagc aaggcggggg accaagccca cagcagattg ccgcccggca 1260 agtaatttca aaggaattgc ccattttcac cggagaccca gaagactggc cgctattcat 1320 cagctcgttt acgaacacaa cacgagcatg tgggtattca gaagcagaga atttggccag 1380 gcttcaacga tgtttaaagg ggaatgcact agaggcagtg cgtagtcgat tgcttcttcc 1440 agtagcggtt cctcatgtta tagctacact tgagacgctg tatgggagac cggagctgtt 1500 gatacactcg ctactgcaga aaattcgtgg agttccagtg ccgaagcaag atcgtcttga 1560 tactttgatc gggtatggca tggcggtaca gaatcttagc gatcatctag aagctggagg 1620 ataattggct catctcaaca atccaatgct gctttttgag ttggtggata agttgccagc 1680 tagcatgaag ttagactggt ccctgtacaa gcaacggtgt gtggaagtga acattagaac 1740 gttttcacag tacatggcga ccttggtacg agcagcaacg gatgtaaccc ttcactacaa 1800 tccgaagcag caagtacatc aacagcaaca ggttcagcga ggaacaaaaa gcggtaaaga 1860 cagggatttt tgtggagctc actctacaga ggaggcactg aaaacaccaa cacgagagga 1920 cactctaacc aaggggaccg aagccaaagt tgagcgaatg attccagctt gtctgctctg 1980 caaggatcca gatcatcgag tgaagaactg ttctgtgttc gccaaaaaga cattggacga 2040 gcgttggaag ctgacaggac tactagggtt gtgtcgtatc tgcttaggca aacacggaaa 2100 acgtacttgt aggatccaga aaaagtgtga ggtcgacgga ttcgctttga aagatgtgcg 2160 cactgtgcat aaattggatt tgccaagaca aacacttcga tacgagcagt tggcgcagtc 2220 attcccgtat ttgaacggtc taccagttaa gggatacgaa aacgcgcttc cccagattct 2280 catcggcaac gataatgctc atgtcacagc gacgttgaaa atgcgtgagg gaaatgctgg 2340 agaacctatc gctgccaaaa cacgactcgg ctggacagtg tacggattgc agaacgaaag 2400 catcgatagt gctcacagtt tccacatatg cgagtgcaag gatggaccag cgctgcacga 2460 tctagtaaag aattttttct gcctagaaaa cttgggtgtc gaagccgtat cgtttccgga 2520 gtcagacgat gtgcagcgag ctaagcaaat acttcaaacc accacgaagc gtgtaggtca 2580 gcggtttgag accggacttt tgtggaagtt cgattccttt gagtttccga atagttatcc 2640 gatggcagtc cgacgaaaat tcgtattttt tgtgatgcgg cagctaaagt ggatggcgtt 2700 tcgatgaata caatgctgct gaagggacca gatctcttga acactctgtt aggggtttta 2760 tttggatttc gggagaagcg gatagcgatt tgtgccgatc tcatggagat gtttcatcaa 2820 attcaaattc gacctgcgga tcgacatgcc cagcgcctgt tatggcggga gaacccctca 2880 caggaaccgg atgtctatct gatggatgtc gcaacgttcg gagccacctg ttccccgtgt 2940 tctgcccagt tcgttaagaa caaaaatgcc gaagaacatg catctgaata tccagaagcc 3000 gcggaagcta ttgttcggaa gcattatgtt gacgattatc ttgacagcgc agatacggtt 3060 gaggaagcag tgaaaatagc atcggaagtt agacacgttc actcgctagg agggttccat 3120 ttacagaact ggttatcgaa ttcaaaggaa gtcctggcac gagtcgggga ggaccaactg 3180 gtatacgaga agagtttaca gttagacaaa agcgactcga cagagagaat ccttggtatg 3240 ttctggaaac cagaagaaga cgtgttcgtc ttttcgtcga cgctggtgct tgatacggat 3300 catcctacaa agcgacaagc tttacgagtt gtcatgagcc ccttcgatcc agctggcacg 3360 ctatgcttct ttctcgtcca cggaaagatc ttgatacagg agctatggag gtcgaaaatc 3420 gagtgggatc agcagatacc aatacacttg aagagtatat ggacgcgttg gacgaatctg 3480 ttcaaaagtt tggacgaaat acgtgtcccc gttgctattt cccgcaacat gcacacaacg 3540 aaatggttga tctacagttg catatatttg tcgacgcaag cgaagaagcc tactcatgtg 3600 tagcatactt tcgtgcggag ttcgttggtg gagtagagat agctttgata ggaggaaaag 3660 cgaaagtggc tccattaaaa gcgctgtcca ttccccgatt agagctgatg gcggctgtga 3720 taggagttcg attgctaaaa accattcgca ccgcacactc attgagtatt aacaagattg 3780 tactgtggag cgactctaaa acagttcttt cgtggatcaa ttcggaccat cgaaggtatc 3840 ggcagttcgt cgcatgtcgt attggagaga ttttgtctaa gtcggacgcc gcgcagtggc 3900 gatgggtacc aacaaaggaa aacccggctg atatggcgac aaagtgggga aaggggccca 3960 acttttcacc cgacagccat tggtttaagg ggccgaaatt cctgcgatct ccagaaagct 4020 actggccaac tgaaacgaaa ccaacagatg agatgacgac cgaggaaatg cgagtttgtt 4080 tagtacacac agaaaacctc aaaataactg tgattgactg gaaccgtttt tcgagttgga 4140 cacgattgct gcggtcgata gcgtttgttg ttcgattctg ccagaatctg cgaagaaagg 4200 taaagaagaa gccgttaacc gacggaatac tgacacaaaa ggaattggaa cagtccgagc 4260 tgacgatttt tcgaatgttg caaaacgaag agtatcctga cgaaacagcg atccttttgg 4320 aagggcgaaa gaagaattcc ctcgtgcgtt ttgagcgtac cagtaaaatt cgtaagctgt 4380 cacctttcat ggacgatgct ggattgatca gatcggattc cagaatttct gtcgcatgtt 4440 ttgtgtcgta cgataccaga ttccctatca ttcttccaaa ggagagtgtg attacccgat 4500 tacttctgga atggtaccac cgaaggtttc ttcatgcgaa caaagagact atagtcaatg 4560 aagtgaggca gcggttccat ataccagcca tgagggcatt agtgcgcaaa atagcaaaga 4620 tgtgtcagct gtgtaaaata aggaaggcgg tcccagaagt tcctcgaatg gcgcccctcc 4680 ctgcagcaag attaaaacca ttcgagcgcc cgttctcata cgtggggatt gactattttg 4740 gacccatttc agttcgggtt aaccgcagta cgatgaaacg atggatcgca ctattcacct 4800 gccttactac tcgtgcagta catctggagg ttgcgcactc attatccaca gaatcttgca 4860 agcaagctat ccgtagattt atcggacgca gaggcgctcc agtggaaatt cgcagcgata 4920 gagggacgaa ctttattggg gcaagcaacg atttgcaaaa agagatgatg gacgttgacc 4980 gacaacttgc cgaaacattc actaatacgc aaacgcgatg ggtgttcaac cctccagctg 5040 cgcctcacat gggaggcgca tgggaacgtt tggttaggtc cgtcaaagta tctatggcgg 5100 ctataaaaac gacagaaata ccgaaggaag aagcgttagt gacatttata gtagaagcgg 5160 aaagcgtggt taattcaagg cctctaacat tcatgccttt ggagacggag cagcaggaag 5220 cgctgactcc aaaccacttc cttctgatga gctccacggg cgtggtgcaa caaccaaaat 5280 cgtctatgga tccaaaagta gtgtgtagag gagactggga actgtgcagg tctatggtag 5340 atcagttttg gagaaggtgg atacgagagt ttcttccaac catatcccgt cgaactaaat 5400 ggttcacgga tgtgaagcca attgaagttg gagacttggt gatcgtagta gaggaaaaaa 5460 tacggaacgg atggattcga ggaagaattg taaagctcaa agaaggacta gatggaagag 5520 tacgtgctgc tgtggttcaa actgcaaccg gacttatgga gcgaccggtg agcaagatag 5580 cgcgattgga tattgatggc gataaagctg cttcgtagat gttcaaccag ccttacgggt 5640 cggggaag 5648 // ID Gypsy8-SM_I repbase; DNA; INV; 13366 BP. XX AC Contig1154; XX DT 15-AUG-2007 (Rel. 12.08, Created) DT 15-AUG-2007 (Rel. 12.08, Last updated, Version 1) XX DE LTR retrotransposon from Schmidtea mediterranea: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy8-SM_I; KW Interspersed repeat; LG_I; internal portion. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-13366 RA Xu Z. and Wang H.; RT "LTR_FINDER: an efficient tool for the prediction of full-length RT LTR retrotransposons."; RL Nucleic Acids Res 35(Web Server issue), W265-W268 (2007). XX RN [2] RP 1-13366 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from Schmidtea mediterranea."; RL Repbase Reports 7(8), 762-762 (2007). XX DR Genome; Contig1154; Positions 34464 21099. XX CC Positions [9074-9385] - Integrase core CC 'TAAC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 3943..5520 FT /product="Gypsy8-SM_I_1p" FT /translation="MAENRFWAIKGTVKQTFVLDENISNNGGNHGANPQNM FT GSFANMMMMMLVMMGILRQVNSMTAYDCSNLSLGKIYSLVDTVDCREAYRD FT KVTKGEDVEYHVYQETDFYRTRVKECRLQKATFEYYCGRHSYSVFLEAKLI FT PRSVPIQVDDCINAFRTSTLKIDNKVTISAEVGKKLGITITRGGTIEADGS FT CVGRTKTRGGDVVNSIVDVEDYSIELREYEGTFEMASGKMMTHPECNVKLG FT YCKTGAATLVYSTMREFCELGYLKKTPFTKLNGSRFTSHKYQDGVWREKKM FT QQINREDISEKSTPLVLVSKDPNDAIRLVRKGQKIKCSTTVYETNYKGIYL FT SHKYIPAAENMHTSDVKIIKYVNNKIDFLYHDLQRQMQEMYQEVVLNDCRL FT NREIIRNRLAISVANPDLIMPMLMEHGTFGRIMGEAIYSYQCKIVEVEIAK FT YDKCTLELPVLYEGKLRFMAPVTHRILKEGEEPQKSTCNPAMSPLYKINNH FT WITLPDRKPPSKPVEILQAIKLTSKWNSDH" FT CDS join(7423..9384,9388..10686) FT /product="Gypsy8-SM_I_2p" FT /translation="MQKYNIIRSSKSPYSSPLVVVAKRDGSVRLCNDYRKL FT NEQTVRDNHPLPLLDYIYDKMKNSRLFTVMDAQKGFYQIKMSENSIPMTGF FT STADDHFEYLVMPFGLKGAPPTFQRAMNIMLIDAHHAMVYIDDIIIFSEDL FT DKHLEHLEDIFKRLIMANLKVKPSKCKWAEQDVIYVGHVISHYCVKPDPAN FT TEKVRNFPQATNVKQIKGFLGLTGYYRKFIRNFSSIAFPLTVIEKKGVKFA FT WGDEQQQTFDNLKDALVKPPILRFPDFKRQFMVMTDASTRGIGAVLGQQDE FT IADEYAIAYASRGLKPHEKSYAVIELEALALVFAVEKFSHYLWGRKVIFYT FT DHRPLQWLLKHRDEASKLVRWALRIQPYNIEIVYRKGKANGNADALSRMEQ FT EPEKENLKEGRLEGTIFVLTREANTLSELIEAQKCDDEINMIRQAIETKAV FT PNEFQKYWEKNKERFILEKGLIKYVEIKDNVILVPSKYREPLMLQYHDGAL FT GRHLSVRHTLSRLKAKYFWPDMKKDVKQWCQTCKICVTRRNTGKKPHVPLK FT PMPVPSTPMEMTAMDVLGSFKESAWGNKFILVFCDYLTKWPEAFPISNHKA FT DTIARIFVEQIVFRYGVLSKLLTDRGKDFMSNLLKSINDYFGILKLNTSPY FT HPQTDLVERFNGTLANMLASYVNSGQTDWDVHVPSCLFAYRNSVHSVTGET FT PFYLMYLRHSKMPVNLIFYPKATQNLDETNYLFEMRAKMQEAWAKACLNIK FT YNQETMKEYYDRKAKDHGFQVGDFVLLESHSNKKGLSPKLMRNFIGPYQVL FT KTTETNVVLQLVANKKAEPIMVHVNRCKKCAPPTKQKPDKVEYPKLAIKEK FT QMETAKIEYPKIVIKEKKPDKGILREQSRRYPLRSRQLPKTVTFATIISCI FT MTLTYAFGDFLPVAKQMSNIRYQRTDRSFRIFLKHYFQMKYQFHFSTEESE FT MTFDLDDGKRERVIDNLSPETEYSLEVHSYIQNMRRQWEIDYKVAWYIKTA FT NFTEKVKISYDGQTALFAHNFGSKNHRLGVLIVIENGESITRMVLNQQYQG FT RKWVIRPARTRSKYHILRIRLKAWLAYNIS" XX SQ Sequence 13366 BP; 5049 A; 2166 C; 2795 G; 3356 T; 0 other; ttatctttca aacattgtaa ttatatacat atatatctgt aaattttgaa ttaccgccaa 60 ataaatttta taagtaatta actgtctttg tcttttttat ttgcatagga ccacaactga 120 cctttttata ttttattatt ttgatcagta tattcattta ttgttattga tttggagtca 180 gataacgatc agtgttaaga tctcagtccc ccatcatcac acacttattt tttgatctta 240 acattttggt gagcccgacg tgatctaaaa tcgggggata aaggcggttt tgaataatta 300 agacagatat tgaatttttt tttaaatcat ttagctaaga aatcgtagat tgcaaaattt 360 tagagagggt attagaaata gacaaataat gtctgatgca cagaatttat taaataataa 420 tttaccaaat attaataata acttacccaa tattaataat ttaccgaatg ttaataattt 480 accagaacaa ccaccagtgg cggattttat gagataccca actataccag taaaaccata 540 taatggaaat ccaggtgaat ttcctgagtt tatgtccgaa tattttcaaa tggcgagagc 600 attaggcctt cctgaaccgg ttatgatttt aaggttgccc ttatatctaa aaggtatggc 660 gcgtgaaaaa tataatgatg tcattttagc aaaccaccca gtagcgtgga atgagcttat 720 agtagcatta actaatagaa ttttaccggg ggatgccacg agaatattac gacaacaatt 780 ttataataga cgacagaatt cgggtgagca tgttggggaa tttgcgtatc aaatcatgat 840 gctagctgaa cgagttttcg gagatagagc ccgttggaat gaggctaccc ttgcattagt 900 taaagaccag ttttgggcag gtctatcatc ggcaatgaga aatgcgttga atttgcacat 960 aagggataat tttgaggaat tgactagaaa agcagtttta atagaaagta gtatagagaa 1020 agtaaataat tttgcggtta gtacgaataa atctgatatc gagtgtttta attgtcataa 1080 aagagggcat ttatctaaag attgtattag taagaggagt aattcaccag gggataaatt 1140 atgtaccaga tgtgggagaa ataatcatct cgctattgat tgcaaatgtt cggacttaag 1200 ctgcgacaaa tgtaaacgaa cgggacatgt agcgtcagtt tgtaaaacta aattcggggg 1260 aaattcacag ggtaataatt ttaataataa taataataat aataataata ataataataa 1320 taataataat aataataata ataataataa taataataat aataataata ataataataa 1380 taataataat ataataataa taataataat aataataata gaaatcaaaa tcgaggtttt 1440 caaagtaata ataataataa taataattat cgaaataata atgtacagag agatgaggta 1500 aattcaacag gtaaagcaaa gtgttataaa tgtggcaaat ttggtcatat gatgcttctt 1560 gtccacagaa tcagatgagt aatgtcaata acaatggtaa taataataat aataataata 1620 atcggccaaa aacaactacg ttcgcagtta attcggatca aagattaact tctctacaac 1680 aacttgaatt agagattcaa gaggaaagaa acagattagc tcggctaacg acaaatgata 1740 atattgatga taacgaggtt tatatgcttt ccagccagag aaagcatcaa agagtggacc 1800 atgagtctaa taccgtggat ccaaaaaatg atttcaaggt tagaaccagt gagttcgaaa 1860 ccaacggtaa tggtttagcg gtttcaccaa gtaaaaatcg attggctatg gacaatgagc 1920 cagaggccag aacaaatggt ttagcggttt caccaggtaa aaattgatgg gctatggaca 1980 atgagccaga gatctacaat gaggttttag cggtttcacc caagaaaaat cgatgtaata 2040 tggtcagtga accagaagaa gcaaaagaca cgttatccac aagaggtgtt gatggaaaag 2100 ctgaaatacc attcggtaga cgagaatttg ggagatctct aagaaaggaa actatagaga 2160 aatgcgcagc attagtaaaa tatcgcgatc agtgggaaag tgaagtagta gagaaattaa 2220 aagaaattga taaagaggaa atcgtggaag ctgaaattga taaaattcta gtaataatag 2280 ccactctaga aagggatgaa gactattatt atctgataaa aaacctaatg aggaaaatta 2340 tagcaagccc gatatatgct aatgaaacag gtgaggcagt agtattcaca gtatacctgg 2400 gaaaatgctg cttaggtttg aaataaaccc taaggaaatt gttggagaac tcgtaaaact 2460 attaaaagaa gcgaggccca gttgtaacta cgtgctgtac aaatagggag ttgaaatgct 2520 aaagggagat acgctattga gttatggaat attgccaaac tacaataata cggtaaacgt 2580 ggtttacccg actgaaaaag tgaaagtgat agacacagat aatgtgttcg tgataactga 2640 aatggaaatt gaacagttaa atcaggctaa tgcaatccca caaaggttgg gatttgagct 2700 aatgcagacc cctagagatg gggaagccat taaaggaata atggattgga tgttagaatt 2760 acgcatccca gtaaatactt tgagagacat aatgccattg gtgctgcctt ttttaaaagt 2820 tacttcacca tttcaagcac aagcaaagct gctgtgtatc aaaaatggac atgctattcg 2880 ccacgaagca agcagacaag gtccattatg gttagaaatc agaatacccg ggagagggtt 2940 attgaagcaa caatgttcct gttttcacaa ggtgagaaaa ctaagggaga gaatttcgga 3000 gatattgcaa ggattatttc taatagtgat acatggatgt caggtagacg atgatgagtt 3060 cctggtagat gtggggtgtt tcccaagaga ggaaactcta gtgttaattg tgccaattga 3120 acttccaagg gagccatggt caataatgag attcccagac gcccaagaag tggaaagtga 3180 accagtacag gtggaattgc caaagggtga attcgcacca ctacctccat atccggtgga 3240 tttggatgag gaaattgaga gatttttaga aaactatgat caacccacag agggagaaaa 3300 tagagaagcg ataccgagag agtattcgga cagcagataa tagaggacga gagagataca 3360 attatccagt tcgatccaga agaaacaaac gtgggtgcaa gcacgtcatg agaatcagat 3420 ccaccagcca gaccaggaat aatccaacca gagattaaag ttaaaaaaga tggaaggggc 3480 agaccaaaga aaagtcaaat gaaaccaccc ccaaatgaac aattaaaccc ggaaacaata 3540 cagaacctaa ctcgtcaagc tttggcggga atagcaaaac aaattgatga ggatagtgaa 3600 aattatccca catttaaatc aatgatgtgg agaagattgc aacaaatcat gaggctttta 3660 aaacaagact ggcagatccc acaagatcca ggcgtaaatg tgaaaaagtt aacagtagaa 3720 atcagaaaag ctatgagaaa aattcctgaa gctatgagag gagcagatat actagatatg 3780 gtcgaacgat ttatcgaaga aggcctgaca agaggcaaaa tgataatgat cctggcaatg 3840 aggttcaagg atgtgtttag agtaatacaa aaagatcagt tacgtataaa tccagaatta 3900 gatagaatgg ctaaagttat gtcagataac ttcgagtaat ggatggcaga aaatcggttt 3960 tgggcaatca aaggaacagt aaaacagacg tttgtgttag atgaaaacat ctcaaacaat 4020 ggcggaaatc atggagcaaa tccacaaaat atgggaagct tcgcgaatat gatgatgatg 4080 atgttggtaa tgatgggaat attacgacaa gtgaattcaa tgacagcata cgactgcagt 4140 aatctgtcat taggcaaaat atattcattg gtagatacag tggattgcag ggaagcatat 4200 cgagataaag taaccaaggg agaggatgtg gaataccatg tttatcaaga aacagatttt 4260 taccgaacaa gagtgaaaga gtgcagattg cagaaggcaa cattcgagta ttactgtggt 4320 agacacagtt actcggtatt tctagaagca aaattgatac ccagatcagt accaatacag 4380 gtggatgact gtataaatgc atttcgtacc agtactctta aaatagacaa taaagtaacg 4440 atttcagcgg aagtcggaaa gaaactcggg ataactatca ccaggggagg tacaatagaa 4500 gcagatggat catgtgtggg cagaactaaa acaagaggcg gagatgtggt aaacagtata 4560 gtagacgtcg aggactattc aatagaatta cgagagtacg aaggaacgtt cgagatggca 4620 tcaggaaaaa tgatgaccca tccggaatgc aacgtaaagt tgggatattg taaaacggga 4680 gctgctacac tcgtgtattc aaccatgaga gaattctgtg aactgggata tctaaagaaa 4740 acgccattta caaaattaaa tggatcaaga tttacaagtc ataaatacca agatggagta 4800 tggagagaaa agaaaatgca gcaaatcaat agagaggata tctctgaaaa aagcacacca 4860 ttggtactag tgtctaaaga tcctaatgat gccatacgtt tggtcagaaa aggtcaaaaa 4920 attaaatgta gcacaacagt ttatgaaacc aattacaagg gtatatattt atctcataaa 4980 tatataccag cagcagaaaa tatgcatacg tcagatgtga aaattataaa atatgtgaac 5040 aacaaaattg atttcttgta tcatgatttg caaaggcaaa tgcaagaaat gtaccaagag 5100 gtagtattga acgattgtcg gctgaacaga gaaataatac gaaatagatt agcaatttct 5160 gtggcaaatc cagatctaat aatgccaatg ttgatggaac atggaacatt cggtaggatc 5220 atgggtgagg ctatctacag ttatcaatgc aagatagttg aagtagaaat tgccaagtat 5280 gacaaatgca ctctggaact cccagtgcta tacgaaggta agctacgttt catggcacct 5340 gtgacgcata gaatcttaaa agaaggggaa gaaccccaga aatcaacgtg taatccggca 5400 atgtcaccgc tttacaagat aaacaatcat tggataacat taccagatcg taaaccacca 5460 tcaaagccag tagaaatact tcaagctatc aagttgacat caaagtggaa ttcagaccac 5520 taaacgaatt gtcaaaggga ggggtataca ctccagaaga catagaggaa gccaggagag 5580 ctattgcttt tccacaagtg cgtgatcgag gggttacaga aataataacc aggatctaca 5640 agcactatga tggaaaaccg gattataatt tattattcgg accagatcat tacagacaag 5700 cagcatcaaa tgtaataagg aaaatctggg gaagatttac cagttttggc atctttgtag 5760 caggatgttc aggaatatat gcaattttta tgataataaa aatcatcgca tcacaactat 5820 tatccacgta tagtatcttc aagaaaggag gatggtcttg gaagctatta ttgggaatgt 5880 gcccattatg tgcagatttc gcagtattcc gacatcataa caaggaaatt aaaatgatga 5940 agcagagaaa cttaataaag cacgaaatag tgagagagtg ggatgaggat tatagtcatc 6000 gagacgatga caattctgac gagggaggta acatagagga caacacagcc cgagatcaca 6060 actcatcaaa accagacaac aaaaaccgaa ttttcaaaat tttgaaagcg gtaaatttct 6120 acagtaccct tatcagcgga gaaagaggag aaagcaaaac tttgccaaaa agtttagacc 6180 cccgtatgag gccaacaaac ggatcatacc cggggaagtc gagcaagaga ttctatatcg 6240 ctatgatgat agaacagatt tggaaaaaac tccattttca ccatagccaa aacgagaaag 6300 acaatacgat acaatgccca caatgggaat gaaaactatg gaaaaagtac cagtataccc 6360 gtttagcact ttggagaaat tgaaacaaag ttatgacgaa caaagtcagc cagaagcaga 6420 aagaaccgaa ctggaagcca agagattcag agaaaacgtt ttgggaactt catccatatg 6480 tatggtagct acaggaaatt tggcaaataa accaacaatc aaaatgaaag tcaatagaca 6540 tacagtccat gcactgattg actctggatc agacttgaca atgatagagg caaacttgtt 6600 aaatgaaact gagctgcaag caattgttcc caccttgaca acagcaacag gagataacgg 6660 tcaagcaatc aatgtcattg gaatactaga ttgtgaattg agactttgtg accaaaagtt 6720 caggcatgac ataaaagtca ccagagaatg catggcggaa tgcattgtgg gtatggatat 6780 actcaaaaag ttgaaagaca ttgtatttga ttgtgcaaca ggcaagttag taaaattaac 6840 gggaaggaaa aggaaatcat ataatataga tgtagttttc ctggaccggg tggtcactat 6900 accaccgcgg acagaaacag tttgttttgt ggcaactcat caagaaatgg aaggagatgt 6960 tattttcgaa ccaaaagcgg aatttgcgag taaatataat tctcccatta cacaaaatct 7020 agcaacgata tgggaaggga gaatcccaat cagaataacg aatatggata ataaaacgct 7080 gagattgtat ccagatatac gagtggggaa aattgtgaaa ctaaaagacc accaagagaa 7140 agttgcacaa gtaatgtcag ttaatgaaga ggagaaagat ccagatatcg agtactatga 7200 tgctccaaat tgtgatagta aagccctcac aagaagggaa aatacaaggt aaaccaatta 7260 ctgcgaaaat atgaccaagt tttcgcaaaa catgaatatg acttagacca agtaaatatt 7320 ttagagcatg aaattaattt aactgaaact aaacctatta aaatcagacc gtatagaatc 7380 ccacaaagct tacaggaacc agtaaataaa caaattaaat taatgcaaaa gtataatata 7440 attagatcaa gcaaatctcc ttattcttct ccgcttgtgg ttgtagctaa aagggacggg 7500 tcggtaagat tatgtaatga ttatcgtaaa ttaaatgagc aaactgtacg tgataatcac 7560 ccgctgccat tgctagacta tatatatgat aaaatgaaaa attcgagact tttcacggtg 7620 atggatgcac aaaaaggttt ttaccaaatt aaaatgagtg aaaattcgat accgatgaca 7680 ggcttttcta cagccgatga tcacttcgag tatttagtaa tgcctttcgg attaaaagga 7740 gcacctccaa cttttcagag ggcgatgaat ataatgttga ttgacgcaca tcatgcgatg 7800 gtgtatattg atgatattat catattttcg gaagatttag ataaacatct agaacatcta 7860 gaggacattt ttaaaagatt aataatggct aatctcaaag tgaaacccag taaatgtaaa 7920 tgggcagagc aagacgtgat ttacgtggga catgtgatat ctcattattg tgtaaaacca 7980 gacccagcaa atacagagaa agtgagaaac tttccccagg caacaaatgt caaacaaatc 8040 aaaggatttt taggtttgac ggggtattat cggaaattca ttagaaattt ttcctcaatt 8100 gcatttccat tgactgtcat agaaaagaaa ggagtaaaat ttgcttgggg agatgagcaa 8160 caacaaacct ttgacaatct aaaagatgcg ttggtaaaac cacccatttt gagattccca 8220 gatttcaaga gacaatttat ggtgatgacg gatgccagta ctagaggaat cggtgcagta 8280 ttgggacaac aagatgagat agcagatgaa tatgctattg catatgcaag ccgaggatta 8340 aaaccacacg aaaagagcta tgcagttatt gaattagaag cgttagctct ggtttttgct 8400 gtggaaaaat ttagtcatta cctgtggggt agaaaagtaa ttttctacac tgatcataga 8460 ccattacaat ggttattgaa acacagagat gaagcatcaa agttagtaag atgggctttg 8520 agaatacagc catacaatat tgaaattgtt tatcgaaaag ggaaggcgaa tggtaatgcc 8580 gatgccctat cccggatgga acaagaacca gaaaaagaaa acttaaaaga aggtagattg 8640 gaaggaacaa tctttgtgtt aactagagaa gcgaacacat tatcagagtt aatagaagca 8700 caaaagtgtg atgatgaaat aaatatgatt agacaagcca tagagacaaa agcagtacca 8760 aatgaatttc aaaagtactg ggaaaagaat aaagagaggt ttattttaga aaaaggtctc 8820 attaaatacg tagaaattaa agacaatgtg atattggtac caagtaagta ccgggaacct 8880 ttaatgttac aatatcatga tggagctttg ggaaggcatc tctcagtaag acatacattg 8940 tccagactga aagcgaaata tttttggcca gatatgaaaa aagatgttaa acaatggtgc 9000 caaacatgca agatatgtgt taccaggagg aacactggaa aaaagccaca tgtaccatta 9060 aaaccaatgc ctgtaccatc caccccgatg gagatgacag ccatggacgt attgggttcg 9120 ttcaaagaat cagcatgggg aaataaattt atattggtct tttgtgatta tctcacaaaa 9180 tggccggaag ctttcccgat ttcaaatcat aaggcagata ctatagcaag aatatttgtt 9240 gagcaaatag tattcagata tggagtgctt tcaaagttat taacggatag gggaaaggac 9300 ttcatgagca atctcctcaa aagtatcaat gactatttcg gtatattgaa attaaacaca 9360 tctccatacc acccccagac agactgattg gtggaaaggt tcaacggaac cttggccaac 9420 atgttggcat catatgtaaa ttccggacaa acagattggg atgtgcatgt accatcgtgt 9480 ctgtttgcat atcggaattc ggtacattcg gtaacagggg aaacaccctt ctacctcatg 9540 tatttgagac acagtaaaat gcctgtgaat ttgattttct atccaaaggc aacgcaaaat 9600 ttagatgaaa ccaattatct attcgaaatg agagcaaaga tgcaggaagc ttgggcaaag 9660 gcctgtctca atataaagta taatcaagaa acaatgaaag aatattatga tcggaaagcg 9720 aaagatcatg gatttcaggt cggggatttt gtcttgttgg aaagccattc aaataagaag 9780 ggtctttcgc caaaactgat gagaaatttc ataggaccat accaagtact taaaacaacc 9840 gaaacaaatg tggttttgca actagtagct aacaagaaag ccgaacctat tatggtacac 9900 gttaatcggt gcaaaaagtg tgccccgcca acaaagcaaa aaccggacaa agttgaatat 9960 ccaaaattag caatcaaaga gaagcagatg gaaacagcga aaattgagta tccaaaaata 10020 gtgatcaaag agaaaaaacc agataaagga attctgagag aacaatccag gagatatcca 10080 ttgagatcga gacaattacc aaaaacagtg acgtttgcga caataataag ttgcataatg 10140 acgttaacat atgcgtttgg ggatttccta cctgtcgcca agcagatgtc gaatattaga 10200 taccagcgta cagatagatc atttagaata tttttgaagc attacttcca aatgaagtat 10260 caatttcact tctcgacaga ggaaagtgaa atgacgtttg atctagatga tggaaaaaga 10320 gaaagagtaa ttgacaactt atcaccagaa acggagtata gtttggaagt tcattcatat 10380 atacaaaata tgagaaggca atgggaaatt gattataaag tcgcttggta tatcaaaaca 10440 gcaaacttta cggaaaaggt aaagatttca tatgatgggc aaacagcttt atttgcacat 10500 aactttggga gcaagaacca cagactaggg gtattgatag tgatagaaaa cggagaatcc 10560 ataaccagga tggtgttgaa tcaacaatat caaggccgaa aatgggttat tagaccagct 10620 cgaaccagaa gcaaatatca catacttcga attcgactta aagcgtggtt ggcatacaat 10680 atatcgtgag atactagaac tattacctcc aaagagggag aacgataatg taaaaaatcc 10740 gagagcaaga ttagtgaata tcactaaatt taatggaata aatttattaa atatatcaag 10800 tctgaaattg gcagcggaga atttaaccat ccaaaaagta aatttgacga tccctacaaa 10860 aaaggtaact ataaatttaa caaccccaaa agtaaaacag ttaagagcgc taaaactgaa 10920 gtataggggg gacaaagtct cactatttcg taagtagtgt aagtaaattt tatcataaca 10980 tccgtgcatg tcgtttttcc tggcagttta tcacgatatg tgatagaaat agttaatcag 11040 tttatattgt agatagcaga aacccaaatt tacaaatcga caatgaagca cccaattcaa 11100 acctacgaca aaaccgtgag ctcaatcgaa gcctatacgt cctacaaaga agaattcgtc 11160 taccggttca tgacaactac catcacctcg ctaactaacc ccttaattta taatcattgg 11220 tattacccaa gacgagtgga catacctaac gattttaaat tcccaagagg cccaagaagg 11280 caaatgatga tcacagtagt gaagagcagc aaaaccctaa aattagcaga tatctttaga 11340 gaatttaaag attgccccag aaccaatcat atttggctcc acgccattaa caaaccgttg 11400 gtagtattac taagtggaat gccaaaccct atagagttgg acagaaatag cactttcgag 11460 ttatcatatt ctgatggagg agaaaagaat ttgcaagtaa agacaacaga gccaaacttg 11520 attgtttgtc tagagatatt aaattaccaa aaaccatacc gaatagtaac ggatttggag 11580 cctccctggt gttataacac aactgaagcg acaatccata atggggagtc cttcttgagg 11640 aagcgcttaa ccaaagaaga agaaaaccta tccactattt tcgtggcggc tggatatgcc 11700 ttatcccgag taggcatgga tcctgttctg ttgtagatat ctatctcaga tttctcgggc 11760 caacaactat tgaatacaat agtcacccta gagaacctgt tagacactat cagacccacc 11820 aatatcattt aaaagaaccc gatgttagag gaaatttgga tgatgaagaa tgctgtcaca 11880 tgattagaaa aatttgtgac aaaaagatag tagtcggatt aatggttaaa aagttactag 11940 cggtagccca tctaccggca cagcgtttat tgggtattcg agatctggcc tcaatcaaaa 12000 ttttagaaga tcgacgggta cctaagatgc aaggatacta tcacttgcct ttcctttcaa 12060 aatatttcct aaaataggct ccggagagcc ggcaggatct agtagtagac aatgaggtcc 12120 gaactattag aaaaatatat atgagcattc agaatgaatg actggatcat ataacaatcc 12180 cacaaaatac ccattcagaa ggtgaacagc gggttgtcat aaaacgaaaa ttacctatga 12240 tatcaaaaga cgcaaaagtt cagaaagaag acgaattgtc cgcgccacct ggcgttacaa 12300 tctgcgacga agatgaagtc atggagaaac catcaaatac tgagatgcga aaaccccaaa 12360 tacaaatagc ccaagtaaca aaaagtgctg ccgatcagcc agaaaggaga atcccacaaa 12420 atgcgaatgc tgcctatcag caagaaaagg tcatgagacc atccataagt aacccgaatt 12480 accctgaagc agccacaagt ttgaaaaagg tggcaatccc tccgacggga cagaaagcaa 12540 agacagaaag tggaggtatg aaaagcgtag ttaaacgatt attcaaaaag aaggaaccaa 12600 ttaaacagaa tcaaccacaa gaaaaacctt cacagtcaag ttcggcaata cctaaaaaga 12660 tattcaatga accagatgac gatatagaag aagtatcata tgtaccccca aacgctgaat 12720 tacaaaagca actaaaacta aaaacggagc ctgaagtggt cgagaagttg agaataaatc 12780 gagagaattt gattctgaag ccaatttcgg tagacacaga tgggtatgat ccgatgtggt 12840 accgaaggcc aaacagccac tggcgtccaa gagttaccaa atatgatggg aaagaaatta 12900 cccgcatggc ggtaaattct aaccccaaat accaagaaat tgatgtggac ggagtcccat 12960 acttggtatc atcagtaata ctagtaagct ccttagacag ggaggcacta gccctatctt 13020 ataaaccgga gacgatggcg aagagactcg caaaatcgtt acgagagcac aaagcgcgta 13080 accgatataa ttctacagac ccccatcaat atcgcgtgcc tgaggaaccc gaagaaaact 13140 aagaaaatcc aatgccagga gaaccagtgt gaaacagaca tttatatttg tgattcttga 13200 catacatgat tttttatttt agtagttttg tgatcatgtt ttggtgttat taattattat 13260 tgattttttt tggtgtatat attttgtaag gcaaatggct cacagtagga gtaaatatat 13320 attttttttc ggggaagaaa atcttttagg gggtaagatt tgttat 13366 // ID CR1-9_NVi repbase; DNA; INV; 4291 BP. XX AC . XX DT 12-MAY-2009 (Rel. 14.05, Created) DT 12-MAY-2009 (Rel. 14.05, Last updated, Version 1) XX DE CR1-type non-LTR retrotransposon: consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-9_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-4291 RA Bao W. and Jurka J.; RT "CR1 families from Nasonia vitripennis."; RL Repbase Reports 9(5), 936-936 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS join(6..680,677..1201) FT /product="CR1-9_NVi_1p" FT /translation="MESSSAGSVTSQTAIFYKCHPKVEVKTVICIICEEAF FT HTSDFAKIDGAVKISGVLGLCPEHNQVDLTSKVNTNVLSNEAKIIIAQIKL FT AHNLRKSDQLSVSEADSDDDDENSVKIDSNLKCENALLKELNKELKEKNKL FT LRELLDKQKSEINSSKNKTYAQVISSAMPNNKPKRIPKIIVKNKDTKEFSI FT DKINDVVAHYLIKDKSIQAKKLIKKXSEIIVDCRCMLTEESASKAYNVLKK FT KLDESCDVIKEKIENXKVKVVGINNFETLDNKKIEDDINERNFSKFNKKCT FT VLHSYNNSKTHLQSVILEIPAELYQHVRENKNRIFVGYQNCKVHDYCNIKP FT CFNCGRYGHNGFKCTNNHTCLKCAGNHKTTDCTGNKLNCPNCIFSNNKYKG FT TLRIN*" FT CDS join(1201..2442,2403..4094) FT /product="CR1-9_NVi_2p" FT /translation="MAQEFNFEDIYNIERDAENIRSLNKSISKKKNLILCV FT NIRSLNANYEKLESFIESLVVKPIIIICTETWILQYPQYYQLQGYKSYYND FT SKINRADGVMLYIKKNIQEITKIEVIDRLSVVSSDTFLESGEAIRISAMYR FT CHDISKSEFTNSVRKFLSKQVNIKNQCIFGDFNIDIRDINYDKIGLAEKTI FT AQEFLNNFFENEYIPFFRGITRPSQNSENGTCIDNCFAKVRNIELESFKLN FT IPFNDHYPLFISINKFKIQKDYNQSACINYGKLINIAKRVEWDSLLQINDP FT NQAINELISMIQSCVEKATDRKYSNKNKKDSVPKKKWITKAILISCKTKEM FT LYNIWKKNPTNMKLKMDYKNYEKILSKVIKDAKYKYENNAIRKCSGNSRQL FT WSIINDKLGKKRXKEDHLIVVRKKKKQGGSLDSIIVNDQKIEDKTTIANTM FT NEYYCNVGIXLSNQIXKMDLEQLKLPVRNNYSIFINPTNMYEIQNIIIAMK FT KKAGGVDNISAITIKTLSKHILKPLEYIFNLSIQQSIWPNALKQADIVPIY FT KSGDNSCISNYRPISLTSNIAKIFEKIIYNRLYNFIMKHKIISDKQFGFIR FT KRGTKDALNCLXNIIYRNLDKSKPIITAFLDLAKAFDTVDHSILLDKLERY FT GVRGEALKLLISYLSDRKQCVKISNCKSEYKEITIGVPQGTILGPLFFILY FT VNDLLIDMQNETILSYADDTVIISCDNSWTAAQERLNEYLRKVAIWLNLNK FT LSLNVNKTVYIAYGNYCDSVPSTLNIKIGDNVINRVDSYRYLGLIIDYNMK FT WDKHINYIIKSTRYLIFIFAKLKKFMDSKTLMLLYYAFFQSITNYGIIAWG FT GAYNNYLNLIQGIQKKILRIINKNCYITQNQPLPIRQMFELECIVYHYNEL FT RDRYIRSTNKTRNKNLPLPKIDKTVSKKSSYYVAVSVFNTLPNDLKDLSIS FT KVSIKRKLKMFIGKNYFGCYIF*" XX SQ Sequence 4291 BP; 1777 A; 538 C; 724 G; 1245 T; 7 other; ccaagatgga gtcatcgagc gcaggctcgg tgacttcaca gacagcaatt ttctataaat 60 gtcatccaaa agtggaagta aaaacggtta tttgtataat atgtgaggag gcttttcata 120 ctagtgattt tgctaaaata gatggggcag taaaaattag tggagttcta ggattatgtc 180 cggaacataa ccaagtagac ctaacctcaa aagtgaatac gaatgtgctg agtaatgaag 240 ctaaaataat cattgctcag attaaattag cacacaatct gagaaaaagt gatcaattaa 300 gtgtgtctga agctgatagt gatgatgatg atgagaatag tgtaaaaata gacagtaatt 360 tgaagtgtga aaatgcccta ttaaaagaat taaataaaga actgaaagaa aaaaataagt 420 tattgagaga attacttgat aagcagaaaa gtgaaataaa tagctcaaaa aacaaaacct 480 atgctcaagt aatttcgagt gccatgccta ataacaagcc aaaacgaatc ccaaaaatta 540 tagttaaaaa taaagatact aaagaatttt ctattgacaa gataaatgat gtggttgcac 600 actatctcat taaagataag agtatacaag caaaaaaact aatcaagaaa argagtgaaa 660 taatagtaga ttgtagatgt taacagaaga aagtgcaagc aaagcatata atgtattaaa 720 gaaaaagctt gatgaaagtt gtgatgttat aaaagaaaaa atagagaaty caaaggtaaa 780 agtagttgga ataaataatt ttgaaaccct agacaataaa aaaattgaag atgacataaa 840 tgaaaggaat tttagtaaat ttaacaagaa gtgtacagta ttgcactcat ataataactc 900 taaaactcat ctacagtcag ttatattaga aatccctgct gaattgtatc aacatgtaag 960 agagaataaa aacaggatct tcgtgggtta tcaaaactgt aaggtgcatg actattgcaa 1020 cattaaacca tgttttaatt gtggtagata tgggcataat ggctttaagt gcacaaataa 1080 tcacacatgt ttaaaatgtg caggtaatca taagactaca gattgtacgg gtaataaatt 1140 aaattgtcct aattgtatct tcagcaataa taagtataaa gggaccttaa ggattaacta 1200 atggcccaag aattcaactt tgaagatata tacaacatag aacgtgatgc agagaacatc 1260 agatcactca acaagagtat ctcgaagaaa aagaacttaa ttttgtgtgt aaatataaga 1320 agtctaaatg cgaattacga aaaattagaa tcatttattg aaagcttggt agtaaaacca 1380 attattataa tatgtacgga aacatggatc ttacaatatc ctcaatacta tcaattacaa 1440 ggctataaaa gctattacaa tgatagtaaa attaataggg ctgatggagt catgttatat 1500 ataaaaaaga atatacagga aataactaaa atagaagtaa ttgatagatt gagtgttgta 1560 agttctgata cttttctaga gtcaggtgag gcaatcagga tctctgctat gtacaggtgc 1620 catgatatat cgaaatctga atttactaat tccgtaagaa aatttctgtc taaacaggtt 1680 aatataaaaa atcaatgtat ttttggggat ttcaatattg atataaggga tataaattat 1740 gacaaaattg gtttagctga aaagacaatt gctcaagaat ttctgaataa tttttttgaa 1800 aatgaatata ttccattttt tagaggaatc acaagaccat ctcagaacag tgagaacggt 1860 acgtgtatag acaattgttt cgccaaagta agaaatattg agttggagtc ttttaagtta 1920 aatattccgt tcaatgatca ttacccactc ttcataagta ttaacaaatt caaaatacaa 1980 aaagattaca atcaatcagc ctgcattaac tatggcaagt taataaatat tgcaaaaaga 2040 gttgagtggg actcattgtt acaaataaac gatcctaatc aagctataaa cgaattaata 2100 agtatgatcc aaagctgtgt tgaaaaagca actgatagaa aatattcaaa taaaaacaaa 2160 aaagattcag ttccaaagaa aaaatggatt accaaagcaa tattaatatc ctgtaaaact 2220 aaagagatgt tatataacat atggaaaaag aatccaacaa atatgaaact gaaaatggac 2280 tataaaaatt atgaaaaaat tttgagtaag gtaattaaag atgctaagta taaatacgaa 2340 aataatgcta ttagaaaatg tagtggtaat tcaagacagc tttggagcat aataaatgat 2400 aagttaggaa aaaaaagaar caaggaggat cacttgatag tataatagtt aatgatcaga 2460 aaattgagga taagactact attgcgaaca ctatgaatga atattactgt aatgtaggaa 2520 tayatttaag caatcaaatt waaaaaatgg acttagagca attgaagctg ccagttagaa 2580 acaactactc aatttttatt aatccaacta atatgtatga aattcaaaat ataattattg 2640 caatgaaaaa gaaggctggt ggggtagata atattagtgc aataactatc aaaactttat 2700 ctaaacatat attaaaacca ttagagtata tatttaatct aagcatacag caatcaatct 2760 ggcctaatgc actgaaacaa gcggatattg tacctatata taaatcggga gacaatagtt 2820 gtattagtaa ttatagacca atatcgctaa cttctaacat tgccaaaata tttgaaaaaa 2880 tcatatacaa tagactttat aattttatca tgaaacacaa aataatatct gataaacaat 2940 ttggctttat aagaaagaga ggaacgaaag atgctctyaa ttgtctawca aatattatat 3000 atagaaacct agataaaagt aaaccgatta taacagcgtt cttagacctg gctaaagcct 3060 tcgatactgt agatcacagt atactgctgg acaaattaga aagatacggt gtaagaggag 3120 aagcattaaa attactcatc agctatctat ctgataggaa acaatgtgta aaaataagta 3180 actgcaagag tgagtacaaa gaaattacga taggggttcc acaaggaacc atacttggcc 3240 ctttattttt tatattatac gttaatgacc tattgataga tatgcaaaat gaaactatat 3300 tatcttacgc agatgacaca gttatcattt catgtgataa ctcgtggaca gctgcacaag 3360 aaagattaaa tgaatatctt cgtaaggtgg caatatggct aaacctaaat aaattatcgc 3420 tcaatgtaaa caaaaccgta tatattgcat atggcaatta ctgtgatagt gtgcctagca 3480 ccttaaacat caaaattggt gataatgtaa taaatagagt agacagttat agatatttag 3540 gattaataat agactataac atgaagtggg ataagcacat aaattatatt ataaaatcaa 3600 caagatattt aatctttatt tttgcgaagc taaagaaatt tatggatagt aaaacgctaa 3660 tgttactata ttatgccttc tttcaaagta ttacaaacta tggtataatt gcatggggtg 3720 gggcttataa caattattta aatttaattc aaggtataca gaagaaaata cttcgaatta 3780 taaataaaaa ctgttatata acacagaacc aacctctacc tataaggcaa atgttcgaac 3840 tagagtgtat agtatatcat tataacgagt taagagacag atatataaga agtacaaata 3900 agacacgaaa caagaatcta ccattgccta aaattgataa aacagtaagc aaaaaaagca 3960 gctattacgt ggctgtgagt gtatttaaca ctctaccgaa tgatctcaaa gacctatcaa 4020 taagtaaagt ctctattaaa agaaaactaa agatgtttat aggaaagaat tattttggat 4080 gttatatatt ttaatattgt taactttttt atggaaaggt ataagggcat gatagcgttg 4140 attgcgttat cttggctcta tattgtagtt ttaagttttt tagtattaat gtttatgtat 4200 gccatcccta tgtacaggca actgtgtttg cctctatagg atgcctttca aaggcatatg 4260 tatatggtat ttaaataaat aaataaaaaa a 4291 // ID BEL-1_DPer-I repbase; DNA; INV; 7788 BP. XX AC super_2; XX DT 04-MAR-2011 (Rel. 16.03, Created) DT 04-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-1_DPer_; KW BEL-1_DPer-LTR; BEL-1_DPer-I. XX OS Drosophila persimilis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; obscura group; OC pseudoobscura subgroup. XX RN [1] RP 1-7788 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (22-MAR-2010). XX DR Genome; super_2; Positions 2541564 2549351. XX CC 'CCAAT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 1265..2542 FT /product="BEL-1_DPer-I_1p" FT /translation="MPTGDKKSRKKFKRVSSLSLTHPPSSNGSIATPTSSR FT PSPTERGPTTSRAHAPRAFEALLLPSVVPRTVPSTMAQSAASSARSKFALA FT ASRLTRFDQKLSAPDASTPTRSLSQVHRDQLRSLWAKVDAEFEACVEAITE FT VDPEGVADVQGMYDDCYAVYAQCLAELNDQLEPPTVPHTPQLAVQVPTSGG FT CRLPPCDTEVFSGDYQQWPTFRDLFTAIYIENPRLAPVEKLFHLNKKTSGE FT AHDIVAQAPLTNEGFASAWSALRERFQNKRLILKAQLKILFSLPQIRTESA FT AALKELQRAVHKCLTTLNHSEVSTDSIFADGVLVYLISAKLPKTTLELWEQ FT SVTHKSEIPTWHAMDKFLAERYLSLEVTEDGHPGMETQAHSRASLPMSERS FT KGNPNPFRSPREPCCKAGSLFRNNCGSEAESV" FT CDS 2598..6707 FT /product="BEL-1_DPer-I_2p" FT /translation="MRPDARSSYIKQKMLCLNCFARGHQIRECQSAHNCNT FT CGDRHHTLLHRGTPVSSQSVPNSVPLPTPAEAQPSVQNYFASGKRAVLLGT FT AIINICHLGTNFRARALIDPGSEATFITERLFQIIKLPFRLAQAQVSGLNQ FT TVSAQSKKLCHFSIRSPTKPGLQLDATAYVLPELSGSLPSHPIPQHSLRDL FT PNLPWADPTFYESSQIDVLIGADILPSILLSGSQTNICGSLLGQETIFGWV FT LTGPVPNTVQGRVASFSTQISTELETPLDKLLTKFWEVEDIPTKIESESDS FT YCERNFLRTTSRTPSGKYVVTLPFRDPDHPGSDLGYSRAAALAQFLRNENR FT LKRNGPLQEQYDTVIQEYLELEHMTEVPPTHGSSTYYLPHHAVFKPESTTT FT KVRVVFNASSPSTNGVSLNDLLHAGPVLQSDLTLQILKWRYFRYVFNADIT FT KMYRQIWVDPKHTPFQRILFRNKEGLIRDYELKTVTFGVNCAPFLAIRVLQ FT QLASDVQSRFPKASRIIRSFMYVDDVLAGADSTDEARLTIRELQAALSSAG FT FPLRKWTSNHKAILAGIPSAHRLHTDFLEMEEESTAKTLGIRWKATSDEFF FT FVPPELAPESSYTKRAVLSQIARLFDPAGWLAPFIVRSKIFMQEIWLQDLG FT WDDELPSEMRQRWQSFLRSYSALDQIHIPRWVGSRPAVKVEHHGFCDASER FT AYGAAIYVRIEVDRLVEVQLLTAKTRVAPVKTVSLPRLELCGAVLLSEMAA FT AILPNMPTASTSCYCWTDSTIVLAWLAKPACHWTTFVANRVTRISQATDIE FT KWCHVPSEQNPADLASRGVPLQELVENQLWWHGPTWLQKGRDQWPAPVNNS FT PVMTLEQRTVKAHFALNPAEDFLERFSNLERALRVRAYILRFTKRCRKLAT FT AQKGHLTSGEITEAEKTLILETQRREYPEEYRCLSGKRPTPRSSSILNMNP FT FLDRHGLIRACGRVAGSEVLRYDERHPIILPYNCQLSRLLAQFTHRITLHG FT GNQLMVRLIRSKYWIPKVQRLMKGVVNSCKVCVIHKRRLQTQMMGDLPTER FT SSFSRPFTHTGVDYAGPFEIRNYTGRACLITKGYVCVFVCFSTKAIHLEPT FT SDLTTEKFLAAFARFVARRGCPQRIHSDNGKTFVGAATLLSSDFLDAFKDS FT VTDAYSHQRVSWRFIPPGAPHMGGLWEAGVKSFKTLFYKATSTRRYTFEEL FT STLLAKIEACLNSRPLSPMSDDPTDLLALTPGHFLIGGPLMSTAEPEIKGN FT LNSIINRWQHLKALNQQFCQRWKEEYLKELHKRTKWQTPTPNLQVGDMVVI FT KEDNLPSNEWRLGRITSVYPGADDRVRVVDILTARGTLKRPIVKVVLLPVE FT PRSSIQQ" FT CDS 6798..7742 FT /product="BEL-1_DPer-I_3p" FT /translation="MAPRIRASRTTESRRDRGINSYRCRVCRGIHPLRTCH FT HFLRLSPEKRLRAVLINKYCGNCLAHQHSGQDCRSGGLCRVCGKNHHTLLH FT IPSSRRPVRSASGASSPSTAPHVKRRPRRPARSASAASSPSTSAHADRRPR FT LPVSRPVAGPPAPSVDSLLQHRSLILLPTALVVLDTGSKNFETAALIDPCT FT PVSCIDDSLASAFKLPTTCVGDEKVCTAVIRAKMGDFQLETMLKVEPRVRI FT RTPIRQLSDSVRARFGDLRLADEQFHRPATISLILGADVYPKVIQPGFLAR FT EDGLPVAQSTVFGWIVSGACTQP" XX SQ Sequence 7788 BP; 1837 A; 2301 C; 1872 G; 1778 T; 0 other; ttggtccttc gagccggatc ttcgccttcc ccataaggag aagctcaccc cagccagcat 60 cggcggatcg ctccaagtgt aagataagta caacccttcg attgtgccca acaccagtgc 120 agtgattttg tgcgagtttc ccatcccagt ccgagtgccc gcggcataca tatgtacata 180 tgtatatgcc tttctcgctc tagctccaaa ttaaaccata cttcgttctg ccaacggttg 240 tggcctccta tttctccaaa ggtttaattt gatccgagct agctcctgcg ttcctcctcc 300 gatccagtgc ccacattaat atatacatag ctgtctgtaa atcatattcc taccgctccc 360 agctaccagc tgagctttcg ctgctgacaa ctgcacatgc acaccaacat acatacatat 420 gtacatatgt atgtgtaagc aacgcaatcc aagcggctgg caacaaacaa aaagacatat 480 tcttgcaatt aatttggtgt tcattccgtg catgctcaac gggttacagt tctttcgttt 540 ggtttaaaga atatagaatc acaaatcaca accaatcttt atttaactca ttatattgcc 600 aataaaaata aaacacgact ccatacatat catcttacat atgcgcacgc attttcaact 660 aacaactctt gcacatctgt atatacatac atatattgta ttacctcata aaacggttac 720 tccgtttttt ttttacccgt tgtttcaatc tcgtcaaaag ggagggtcgt taatccaagc 780 cacggtacct agcacttaca cccatacaca cacacacata caagcaacag cagcgacgca 840 gtctgttcat ttttcctttc gctttcgctt ctccaccgtt tgcttgcgtt gttgttggtg 900 ggagaatatt tttcggtgcc gcgcttgacc gtaacggaac gccaagtgac gcgctaccct 960 gcttttcaag ggtatattgt tttgaacact agttttgtaa ccgactctat atagggggac 1020 cttgtgcatc aatatcattg ctcatccatt agtcgctctt ggttggacat ttttttttgt 1080 atttgatcgt agtattttat atttatattc cggccacgga gcccgacttc tactatataa 1140 atactcgcga taaattccgc caaggccaca tatccacggt tcgtcgttca aaacttccac 1200 ttctccattc cgctcacctg ctttgcgact ctgatacgcg ttatcgcaaa tccatcagcc 1260 aaccatgccc actggtgata agaaatctcg caagaagttt aagcgggtga gttcgctgag 1320 tttaacccat ccacccagct ccaacggctc aatcgctacg cccacatctt cccgtccgtc 1380 cccaactgag cgaggtccga ccacgtccag ggctcacgca ccacgtgcct ttgaggcctt 1440 gttgctcccc tcggtcgtcc ctcgcacggt tccgagcact atggcccaat cggccgccag 1500 ttccgcacgg tctaagtttg ccttggctgc aagcagactg acacgcttcg accagaagct 1560 cagcgctcca gatgctagca caccaacccg ctcgctctcc caagtccatc gagatcaact 1620 ccgaagcctg tgggcgaagg tggacgcgga gttcgaggcc tgtgtggagg cgataacgga 1680 agtagatcca gagggggtag ctgacgtcca ggggatgtac gatgactgct acgcggtcta 1740 cgcgcaatgc cttgccgaac tgaacgatca gctcgaaccc ccgactgtcc ctcacacgcc 1800 tcaactggcc gttcaagtgc cgacgtccgg cggttgccgt cttcctccat gtgacactga 1860 agttttcagt ggggactatc agcagtggcc cacgttccga gacctgttca ccgccatata 1920 tatagagaac cctaggttgg ctccagtaga aaaactgttc cacctcaaca aaaaaacaag 1980 cggtgaggcc cacgacatag tggcacaggc tccactcacg aacgaagggt tcgcgtccgc 2040 atggagcgcc ctccgtgagc gtttccagaa caagaggctg atactcaaag cccagctgaa 2100 gatccttttc agtctgcccc aaatccgcac cgagtcagct gcagcgctaa aagagcttca 2160 gagggccgtc cacaagtgcc ttacgacact caaccactcc gaggtctcca cagacagcat 2220 tttcgctgac ggagtgttag tgtacctcat atccgcgaag ttgccgaaaa cgactttgga 2280 gctttgggag cagtccgtga cccacaagtc cgaaattccc acctggcacg ctatggacaa 2340 gttcctggcg gagcggtacc tgtctctcga ggtgaccgag gacggccatc caggcatgga 2400 gactcaggct cactccaggg cgagtctacc catgtcagag aggtcaaagg gcaatcccaa 2460 ccctttccgt tccccaagag agccctgttg caaggcgggt tcactctttc ggaacaactg 2520 tggatccgaa gcggagagcg tgtgatcttt gctccaaaga gaaccatccg atccgggtgt 2580 gccctcaatt ccttcagatg agacctgatg ctcgctccag ctatattaag caaaaaatgc 2640 tttgcttaaa ctgtttcgca agagggcacc agattcgtga atgccaaagc gcccacaact 2700 gcaacacgtg tggagaccgg catcatacgc tgctgcaccg tggcactcca gtttccagcc 2760 aatccgtgcc caattccgta ccacttccga ctcctgccga agctcagcct agcgttcaga 2820 actatttcgc ctccggcaag agggcggtac tcctaggcac ggctattatt aatatttgcc 2880 acttggggac aaattttcgc gcccgcgctc taatcgaccc gggctccgag gcgacgttca 2940 ttacggaacg tctcttccag atcattaagt tgccattccg actcgcccag gcacaggtct 3000 cgggcctcaa tcagacagta tccgctcagt ccaagaaact ctgccatttt tccatccgtt 3060 ccccgactaa gccgggctta caattggacg ccacagccta tgtcctgccg gaactatcag 3120 gcagccttcc ctcccatccg atcccgcaac actcgttgcg agacttgcca aacctgccat 3180 gggcagaccc gacattttac gaaagctcac aaatagatgt cctgattggt gccgacatcc 3240 ttccatccat cctcttgagc ggctcacaga cgaacatctg tggatctctc ctcggacagg 3300 aaaccatttt cgggtgggtt cttacaggcc cagtgccaaa cacggtacaa ggccgggtcg 3360 catccttctc cacgcaaatt tccaccgagc tagagactcc gttggacaaa ctcctcacaa 3420 agttttggga ggtggaggac attcctacta aaatagaaag cgaatcggat tcgtactgcg 3480 aaagaaattt cctccgaact acgtcaagaa cgccaagcgg gaaatacgtc gtcacgttac 3540 cgtttcgcga cccagatcat cccggatcag atctggggta ttcaagggca gccgcgctgg 3600 cccagttcct aagaaacgaa aatcgcttga aaagaaacgg tcccctacaa gagcagtacg 3660 acaccgtaat ccaagaatac ctagagcttg aacatatgac agaagttcct ccgactcatg 3720 ggtcctcaac ctactacctt ccgcaccacg ctgtcttcaa gcctgagagc actaccacga 3780 aggtccgcgt ggtatttaac gcatccagcc cgtcgaccaa cggtgtgagt ttgaatgatc 3840 ttctgcacgc cggcccggtc ctccaatctg acttgaccct ccagatcctg aagtggcgct 3900 atttccggta cgttttcaac gcggacatca ccaagatgta tcgccaaatt tgggttgacc 3960 cgaaacacac ccccttccag cgaatattat ttcgaaacaa ggagggactc atccgcgact 4020 acgaacttaa gaccgtaacc ttcggagtca attgcgctcc ttttcttgcc atccgagtgt 4080 tgcaacagct agcaagcgac gtccagtcca gatttccgaa agcaagtcgc attatccgat 4140 cattcatgta tgtcgacgat gtcctagccg gagcggattc taccgatgag gctcgactaa 4200 cgatccgcga gttgcaggcc gcacttagct ctgcaggttt tccgctgcgg aagtggactt 4260 caaaccacaa agctattctg gcaggaattc cgagtgctca tcgtctccac acagattttc 4320 tggagatgga ggaagagagc acggctaaga ctctcgggat tcgctggaaa gcgacttcag 4380 acgagttttt ttttgtccct cccgagttgg cgcctgagtc gtcatacacc aagcgagcag 4440 ttctgtccca gatcgcaagg ctgttcgacc ccgcggggtg gttggcccca ttcattgttc 4500 ggtccaagat cttcatgcaa gaaatctggt tgcaggacct gggatgggac gatgagcttc 4560 caagcgagat gcgccagcgc tggcaaagtt tcctgcggag ctactccgct ctcgaccaaa 4620 tccatattcc aagatgggtc ggctcccggc cagcagtaaa agtcgaacat catgggttct 4680 gcgatgcatc cgagagggct tacggtgccg ccatctacgt ccgcattgag gttgaccgtt 4740 tggtcgaggt gcagcttctc acagcgaaaa cgcgagtcgc acccgtgaaa accgtgtcgc 4800 tcccccggtt agagctttgc ggggcagtgc ttttgtccga aatggcggcg gctatcctgc 4860 cgaatatgcc aacagcaagt acgagctgct attgctggac cgattccacc atagtcctcg 4920 cctggctggc aaagcccgcg tgtcactgga ccacgttcgt ggccaaccga gtgactagga 4980 tatctcaagc gaccgatatt gagaagtggt gccacgttcc atccgaacag aaccccgcgg 5040 acttagccag cagaggcgtg ccgttacagg agttggtgga gaaccaactc tggtggcacg 5100 gaccaacttg gctgcagaag ggccgagatc aatggccggc accagttaat aactcccccg 5160 ttatgacctt agagcagcga actgtaaaag cccatttcgc actcaaccca gccgaagact 5220 tcctcgaacg attctccaat ctggagagag ctctacgagt ccgtgcatat atcctgcgct 5280 tcaccaagcg ctgccggaag ttagccaccg cgcagaaggg ccatcttacg agcggcgaaa 5340 ttaccgaagc tgaaaaaact cttatcctag agacgcagcg tagagaatac cccgaggagt 5400 accgctgcct aagcggtaag cggccaacgc caaggtcaag ttctatcctg aacatgaacc 5460 ctttcctaga ccgccatggg ttgatcagag catgcggccg tgtcgcaggc tctgaagtgc 5520 taagatacga cgaacgacat ccgatcattc tcccatataa ttgtcaattg tctcgccttc 5580 tcgcgcaatt tacacaccgg ataactcttc atggcggcaa ccaattgatg gtgcgcctca 5640 ttcgatcaaa gtattggatt ccaaaggttc aaaggcttat gaagggggtt gtgaactcct 5700 gcaaggtctg cgttattcac aaaagaaggt tgcaaaccca aatgatggga gatctcccaa 5760 ccgagcggtc gtccttttcc agaccgttta cgcatacagg ggttgattac gcgggtcctt 5820 tcgaaatacg gaactataca gggagagcat gtctgatcac gaaggggtat gtgtgcgtat 5880 tcgtgtgttt ttccacaaag gcgatacatt tagaacccac gtccgacctc accaccgaaa 5940 aatttctcgc cgccttcgct cgtttcgtcg caaggcgcgg atgtccacag cgtatccatt 6000 cagataatgg aaaaaccttc gttggtgcag caaccctgct ttccagtgac ttccttgacg 6060 cgtttaagga ctcggtgact gatgcgtata gccatcagcg ggtttcgtgg cgcttcatcc 6120 caccaggagc tccacatatg ggaggcctat gggaagccgg agtgaagagt ttcaaaactc 6180 tcttctacaa ggccacgtcc actcggaggt atacgtttga agagctttct acgctgctcg 6240 cgaagattga agcgtgcctc aattccaggc cgctctcacc gatgtccgat gatccgacgg 6300 atctgctagc cctcactccc gggcacttcc ttattggggg gcccttaatg agtacggccg 6360 agcccgaaat aaaggggaac cttaattcga tcatcaatcg gtggcagcat ttgaaggccc 6420 tcaaccaaca gttctgtcaa aggtggaagg aggaatacct caaggagctc cacaaacgga 6480 ctaagtggca gacgccgacg ccaaatctgc aggttggcga tatggtagtc atcaaggagg 6540 ataacctgcc gtccaacgaa tggcggctcg gaaggataac ttccgtgtat cccggtgccg 6600 acgacagggt ccgtgtggtg gatatcctta ctgcccgcgg taccctcaaa agaccgattg 6660 tcaaggtcgt tctcctgcca gtagaacccc gtagttccat ccaacaataa cgtgaatgtg 6720 cacttgtccc attcacagct ccgtactaat acttcttcga cttctctatt ccatcctagt 6780 tccatcctca tccagacatg gccccacgga tccgcgccag ccgcaccacg gagagccggc 6840 gtgatcgagg gatcaactcg taccgttgcc gggtctgccg aggaattcat ccgctacgga 6900 cctgtcacca tttccttcgc ctgagccctg agaagagact ccgagctgtc ctcattaaca 6960 aatattgcgg gaactgcctg gcccaccaac actccggcca ggattgtcgc agtggcgggc 7020 tgtgccgggt gtgtggaaaa aaccaccaca ctctactcca cataccctcg tcgcgtcgtc 7080 cagtccgctc ggcctcagga gcatcgtcgc catctacagc gccccacgtc aaacgccgac 7140 ctcgtcgtcc agcccgctca gcctcggctg catcgtcgcc atccacctcg gcccacgccg 7200 accgccgccc gcgtctcccc gtgagccgcc ccgtggcagg gcccccggca ccatccgtcg 7260 actcacttct acaacatcga agcctcatcc tactcccgac ggcgctggtg gtcttggata 7320 cgggttccaa aaacttcgag acggcggcgc tcatcgaccc atgcacgccg gtgagctgta 7380 ttgacgactc cttggccagc gccttcaagt tgcccaccac ttgtgtgggg gacgagaaag 7440 tgtgtacggc ggtgatccgc gccaaaatgg gcgacttcca gttggagacg atgctcaagg 7500 tcgagccccg tgtgcgcatc cgcacgccta tccgacagct gagcgattcc gtgcgggcgc 7560 gtttcggtga cctgaggctt gccgatgaac aattccatcg accggcgaca atctcgctca 7620 tcttgggagc agatgtctac ccgaaggtga tccagcccgg gttccttgcg cgtgaggacg 7680 gcctgccggt ggcccagagc accgtttttg gatggatcgt gtcgggggcg tgcacccagc 7740 cataggccac cccattatac tttgcaatcc tgcaaggggg ggggagga 7788 // ID hAT-4_BF repbase; DNA; INV; 4425 BP. XX AC . XX DT 30-SEP-2008 (Rel. 13.09, Created) DT 30-SEP-2008 (Rel. 13.09, Last updated, Version 1) XX DE Amphioxus hAT-4_BF autonomous DNA transposon - consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-4_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-4425 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-4425 RA Kapitonov V. and Jurka J.; RT "hAT transposons in the amphioxus genome."; RL Repbase Reports 8(9), 925-925 (2008). XX DR [2] (Consensus) XX CC The transposase contains a zinc finger (pos. 49-125). XX FH Key Location/Qualifiers FT CDS 1371..3599 FT /product="hAT-4_BFp" FT /note="transposase." FT /translation="MYLRSETSNVFCDNVKALTTAQKYNFLCHHSKPPEDK FT TFPTQYLGGCNRSFRHKWLGKHKWMVYSEKVDGIFCMPCVLFAADPSKGQL FT VTKPFRVWNKKSEKVKTHETNCDYHDEAMEKATLLKQAVERPHTTIAAQVD FT ARKAVNIQNNREVIKSIARAVLFCGRQCIALRGDKEDLNSPGNPGNFLALL FT RLMAVTDGVLQKHLQAPAMRNATCMSPQTQNELIEVMGKHMILKGILEEIN FT AAPFYSIMADEVTSHNTELLAICARFVDSNNNIREDFLKFIQVDRITGRSI FT AEAILQFLRENGIPTRNMRGQGYDGASNMSSKSAGVQARIKEVAPQATYIH FT CNGHCLNLVISKSCALPQIRNVIDRLQDCCRYFLHSPKRSGALEKVINHNI FT GDETKRKPLLDLCKTRWAERHSAYQHFYQAYVFIVETLEMIAYSRHLDKYG FT DTYADWDTGNRSDAHQILKSITSYEFIVVFLVVYQYLSHLAGITVKLQGRA FT VDIVEAHEMVTEIQEIYKKEREEVEKGFGHIFEQSQRMAEKVGSAPEMPRI FT AVRQQHRANAMAASPREYYQRNVVIPFLDHITTSLSDRFAASAKIATSLIG FT LVPTIVCSRDVRLEDAVAQYEADLPSPELFPMELNRWKNHFMPDPPELRPA FT SPAEAIKRCDTTMFPNIGVLLKIACTLPVTSCECERSFSALRRLNNYMRAS FT MGKIRLSNLALLHIHYDTDIDLDKVVDCFAQLHPRRLELDNLL" XX SQ Sequence 4425 BP; 1185 A; 1038 C; 1092 G; 1059 T; 51 other; cacgcccgta gccagggggg gttcgggggg ttcggacgaa cccccccatt tgatcgaagg 60 tccgcttttt caagtttttt tttttttttt ttttgactgg tttgtatctc gagcgggact 120 ggattcctat aaaacttcac tgtctttggt cacttgaata aaggtgaacg ctgtggtttt 180 caattagctt ttcccgagtc cgcattatac tgataaaggc ccgggggcgg agcgcggcgc 240 gtgtcaaaaa caaactgtca ccgcttgaaa ctgttggtcc gctgtgagac acgccccggc 300 acgcgacacg agtgtcgcct gactgttgct atggtaaagt actaatggcg cccgaaagtc 360 tcacttcgct agctacaatc ggcggcctgt gaacgaatcc ggcacacgtc ccctggtgtg 420 atctaaagct tgccgctgga cacgattacg tcaggatatt gtcccccgtt tctaagagaa 480 aggatattta aattacatgt acaagagagc cgcaaagcta tcaattttcg tgtcggtggt 540 cggatacaaa ttgaacgggg ctacctcact aaatcacaaa aataatcgca gcacttatag 600 ttatagttcg aaacttgtat tagcgctagt caatgtgcta cttgcagttc atcaaatcct 660 gtgggtgacc gcgtaatggg cccttctaag gcaaacatac gtaccgtaaa gtcttgtcac 720 ggtgatgtgc caaacggtaa gaaatgaaca gttaattgaa tttaggttat gacgggatgt 780 tgtcaataat gtcaacaaat tataccgcct ctgttgacag cacggggccg ccgcgcgagt 840 tctttgttct tatgtggagg cgccaatcgg cggttagatc agaggcactt gtcggcttac 900 ttgcaaaacg tcagagtcac tgtcattcag gacagtgtac acgtgcgaca cgtgtcgaga 960 agtgtcactg ctagcaaacg tttctttttt cttttatctg tgatcaaact gtgatcgaag 1020 tcagtccgtg aacggtgtac cttctgtatt atgtattgat aaaatattgt ggcatcctca 1080 tggggcggaa agagcagaag gcgcgagaga agaaacgcgc ggcagctacg tgccacaagc 1140 tggacagctt ccttccgaag aaaacgaaga cggcacagct tttggatgag gcctcggaaa 1200 gtcggtcatc aactttggac gctcagccgt caacgtcaga cgctcagccc tcgacatcgg 1260 acgctcaaat cgacaatgta ccactcactg taagtactaa tccggtaagt ggcgaagtga 1320 gctgttatca ggacgaaggc ggcggcagca acattgtcga tattggagag atgtacctcc 1380 gttcggagac cagtaatgtg ttttgcgaca acgtaaaggc tctaacaaca gcccagaaat 1440 acaacttcct ctgccatcac agcaaaccgc cagaggataa aacatttccg actcagtatc 1500 ttggcggctg taatcgcagt ttccgacaca aatggctcgg aaaacacaaa tggatggtat 1560 atagcgagaa ggtggacggc atcttttgca tgccgtgtgt gctttttgcc gccgatccgt 1620 caaaggggca gttggtgacc aagccattca gagtgtggaa caagaaaagt gagaaagtca 1680 agactcacga gaccaactgt gactaccacg atgaagcaat ggaaaaggct accttactca 1740 agcaagcagt ggaacgaccc cacaccacaa ttgccgccca agttgatgca agaaaggcag 1800 ttaacatcca aaacaaccga gaggttatta agtctatcgc cagagcagtg ctgttctgtg 1860 gcagacaatg catcgcacta cgaggcgaca aggaagacct gaactctccc ggcaacccgg 1920 gaaacttcct ggcgttgcta aggttgatgg cagtgacaga tggtgtgctg caaaagcact 1980 tgcaagcacc tgccatgaga aatgccacat gcatgtcccc acagacgcag aacgagttga 2040 ttgaggttat ggggaaacac atgattttga agggaatctt agaagagatc aacgcggccc 2100 ccttctacag tatcatggcg gatgaagtta cctcgcacaa cactgagctt ctcgcgattt 2160 gtgccagatt tgtcgacagc aacaataaca ttagggaaga ctttctcaag ttcatacaag 2220 tagatcggat cacagggaga agtattgcag aggctatcct gcaattcttg agggagaatg 2280 gtatccctac ccgtaacatg cgagggcagg gatatgacgg tgccagcaac atgtcgtcaa 2340 aatccgcagg tgtgcaggca cgtataaagg aggtggctcc tcaggcaaca tatatccact 2400 gtaacgggca ctgtctcaac ctggtcatca gcaaatcgtg tgccttgccg cagatacgca 2460 acgtgatcga ccgcctgcag gattgctgtc gatatttcct gcacagcccg aagaggagcg 2520 gtgcgcttga gaaggtgatc aatcacaaca tcggagacga gaccaagagg aagcctctcc 2580 tggacctctg caaaacccgc tgggcagagc gtcacagtgc gtatcagcat ttctaccaag 2640 catacgtgtt cattgtagaa actctagaaa tgatcgcata cagccgccac ctcgacaagt 2700 acggagacac ttacgccgac tgggacaccg gtaaccgaag tgatgctcat cagattctaa 2760 agagcatcac ttcctacgag ttcatcgtag tcttcctggt cgtgtaccag tatctctccc 2820 atctagcggg gatcaccgta aaactgcagg gaagggcagt ggatatcgtg gaggcccacg 2880 agatggtaac ggagatccag gaaatctaca aaaaggagcg ggaagaagtt gagaaagggt 2940 tcggccacat cttcgaacaa agccagcgaa tggccgagaa ggttggcagt gctcctgaga 3000 tgccccgtat cgccgttcgg caacagcacc gagccaatgc catggcagcc agtcctcggg 3060 agtactacca gagaaatgta gtcatcccct ttctcgacca catcacaact tccctgagtg 3120 accggtttgc agcgtctgcc aagatcgcga catcgctgat cgggctcgtc ccgaccatag 3180 tgtgctcgag agacgtccgt ctcgaggacg cagttgccca gtacgaggcc gatctgccat 3240 cgccggaatt gtttcctatg gaactcaacc ggtggaaaaa ccacttcatg cctgatccac 3300 cagagttgcg gccggcgtct cccgcagagg caatcaagcg ctgtgacacc acgatgttcc 3360 cgaatatcgg cgttctgctg aaaatcgcgt gcactctccc agtaacatcg tgcgagtgtg 3420 agaggagttt cagcgcccta cgaaggctaa acaactacat gcgcgcttct atggggaaga 3480 ttcggctgtc caacctggca cttctacaca tccactacga cactgacata gacctagaca 3540 aggtagtcga ttgctttgct caactccacc cccgcagact agaactggac aacctgttat 3600 aagaccgcta agaccaaaga caattaggtt aagaataaac ggttatgagt ttaagaacaa 3660 atgtttatga gttagactgt taagcatgtt agatgttagg ttaagattat caaagttggt 3720 aagactgtta aggctgttta ttatgtgctg ggctgttagg ttgtaatcag aacataatta 3780 tataatgctt tatatttaag actggttatg tgtgtggacc atttcatgac ggatatactc 3840 aatatttatt tttcttcttt ttccttaaga taaaagaccg ctaagacaat tatgttaata 3900 tttcgttaag aataagttaa gagttagact gttactgtcg gatgttaggt tacgactacc 3960 aagttgtgat tgtgaaggct gttaggtgta aggttaagaa tattaagtta cgattgctaa 4020 gactgttagt ttagaactgt tcaacatgat tatatatgct ttatataaag gctgattatg 4080 tggatcgagg ccatttcatg acagatatat atactcagca cttgtttttc ttccaataaa 4140 tctgactcag actctaaaac ggtcatgtgg gtgtctctga ttattctcat aaaaccctct 4200 ccaaaacgcg ggaaatgccg tttcagaagg ttccagtttc aaaattttcc gggggagatg 4260 cccccggacc cccctaggag ccttgcgcct tcggcgcaag ctctcgcgcc ttcggcgcga 4320 gccctcgcgc cttcggcgct cgagatgnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 4380 nnnnnnnnnn nnnnnnncnc atcaaaaatc ctggctacgg gcatg 4425 // ID Gypsy-57_CQ-I repbase; DNA; INV; 8597 BP. XX AC AAWU01017621; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-57_CQ_; KW Gypsy-57_CQ-LTR; Gypsy-57_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-8597 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 493-493 (2011). XX DR GenBank; AAWU01017621; Positions 18928 10332. XX CC Positions [4504-4977] - Integrase core CC 'ACGC' target site duplication CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 694..5331 FT /product="Gypsy-57_CQ-I_1p" FT /translation="MANLETKDLIQLIPEFDGSLDSLDQFLMLIDYYADQI FT PEGEDQRKFLNIVFMKLKFKAAARINRIYASTWEETKANLIREFGIKTTFG FT SIIEQIETLKQGRDESFKSYAGRVLDIKYEIIKIDQSYNENSFTAYSLKSH FT IIVGIINNEIKEIATNNRHMKLENLLEFLENERIDRESLRTLEQRLLASTE FT NSNHNKFRNFRYNKNTNFYQRNNYQANKFYECYKTNTAKSTLKRTYTNSRR FT NKNQIPNNFKLQFNEYIPPHETLKINFNNDSIFYANIATLARPTKFHKFII FT DSASCNNFVRWDVVSNMRLTNIDFNDRISIRGVHGIAEDTVGSVNMILLIG FT NSKYEEKFYILKHFADDAILGAHFLKKYTLYISQSFDYIVLRTPDTKNMQY FT HNHAMTNLFRNNDIENINKVNMAGIEANSYDNFNTNYDENNRNNYDNDNGC FT NGRIQPYIEEYYDDQIEIDDDNQDYPKEMNLENDRDYDVIENMKHEKLTGN FT DRMRKIYELVKTDHLKGDINQEINNILSDFNEIFYLEGDELTYTNLTMHDI FT ETTTEIPINKRQYRFPEATKKHVEEQVEEMLRLGIIRPSKSPWNAPVLCIP FT KKDLDAEGNKRYRIVVDFRDLNTITKPFVYPIPLISEILDSIGEARYFSTI FT DLKSGFYQIPINPKDAAKTAFSTFLGHYEFLRMPMGLKNSPSTFQKFTFTL FT IYEIQPVNAFLYLDDIIVFGRTIEEHNENLYKVLNALHEHNLKVEPSKCKI FT LRKEVRYLGHIISEEGIRPTSENIDTIKNMKRPQTIKNVRSFLGTVNFYGK FT FIPNIADKRKPLNALLKKNVKFIWTEECENAFEELKNYLISEPVLVRPNYN FT DKFVLTTDASDYAIGAVLTNEKSNDHPIAYASRALIGAERNYFTIEKELLA FT IVWAVDHFKHFIYNQDFIVYTDHRPLVALWHLKETSPTLTKLRLKIQGIGC FT EIRYKQGKENVVADFLSRLNNDEVHADDKQASKLVAITTRQQAKQNRQNIS FT DNQNSTNTPNRQNSSRNNNNWNNNMNGLQQIDINLDNDDDSNDKTFTYKDF FT QDTDIDFNVLKFSKNIIPFEQAEATFMILNSVTVHKELSKYIDLPHGIKDY FT TKENIFVFPQKKIWGLILNGTYRSAVKNEEFFDGLLKCFDDYPDFAKDASV FT IQIISHRIFKTELELNLLRFFAMKLSKSFTLYATENERIYVKPKDRDQVLK FT DFHDAPLGGHVGGKRMIKRMSPLFTWENMRRDVLNYVKQCDSCQKNKIWPA FT NKMPMKITTTSYEPFDKIYMDVVMLPISNNGNNCGLVIQDDLTRFLIVAPM FT ENQESTTVARTFVENFVCKFGAPKEVVTDRGTNFVSKLLQHTCKILHIKKI FT VTSAYHPQANLVERSNRELKVYLRNFIGKDPQCWDELIPYFMFEYNTTENS FT STGYSPYELLYGRKATIPSTIYKINDSDLNYDDYICSMKDIFKDAHETARN FT NLILSKEKRKEIYDKKTNDWVPMWGDRVLVQMVQTGIGQKLQNKWRGPYDI FT VKFNSDQTTTIKNGNKFEDVHNNRLRKYND" XX SQ Sequence 8597 BP; 3494 A; 1183 C; 1492 G; 2428 T; 0 other; ggtggtgact agcggaaagc aacgctgtgt gtgtttggac acgccattat gattggacaa 60 aaccgagagt tgaaaatgtg actatgtgtg aaagttaatg aacaaaatga tttcgcgaag 120 taacaaacag ttaaaatagt gtaaaaaaag taaaaaaacg ttaaattgaa gaaaatgtta 180 tgtgaggagt tctaagatgg aagcaaatgt gcagcagtac gagatggaat caaaacgggc 240 tcgatcaatg gcagaagcat cgaaaatttc aaaaccggcc aacgatggac aacactaggg 300 attcaacaat gacgagatca tctggatcaa ctggagacga aaatcagcca catcggacac 360 ccaagaagac gaccatggaa atctggcagt ttagggtgag taaacaaaag ttataccgaa 420 atttgttgta aatagttaaa aacacgctta aaagggtgta cagcaaccaa ggaagttgtt 480 gtcttcttat gccaaaattg actaacttac ggacccaatc ctgcacaaat ctgattgtaa 540 aaacaggcaa attttgttgt aaatcactta gtagacagta ctttagggga tgaataaaac 600 attatatcga tagttcccgt agtttcgaaa atactaaaat atctgtaaca aaaatcttgg 660 tgcaaaaact cttgttgatg tgcaccgtta aagatggcga atttggagac gaaggattta 720 atccagctta ttccagaatt tgatgggtca ttagactcac tggaccaatt tttaatgtta 780 atagattatt atgctgatca aattcctgaa ggcgaagatc agagaaaatt tttaaatatt 840 gtttttatga aactaaaatt taaagcagca gcacgtatta accgaattta tgcaagtact 900 tgggaagaaa ccaaggcaaa tttaattaga gagtttggta tcaaaactac ttttggtagt 960 ataattgaac aaattgaaac tctgaaacaa ggacgggatg aatcatttaa atcatatgca 1020 ggtagagttc ttgacataaa atacgaaatc ataaaaattg atcaaagtta caacgaaaat 1080 tcatttacag cttatagctt aaaaagccac attatagttg gaattataaa taatgaaatc 1140 aaagaaattg ccacgaataa ccgtcatatg aagttggaaa atttgctaga attcttagaa 1200 aatgaacgaa ttgacagaga gtctttaaga actctcgaac aaaggttact tgctagtact 1260 gaaaattcaa accataacaa attcagaaat ttcaggtaca ataaaaatac aaatttttat 1320 caaagaaata attatcaagc taacaaattt tatgaatgtt ataaaacgaa tactgcaaag 1380 agcactttaa aacgaactta tacaaatagt cgacgcaata aaaatcaaat tccaaataat 1440 tttaaactgc aatttaacga atacattcca ccacatgaaa cattaaaaat caatttcaat 1500 aatgattcaa tattttatgc aaatattgct accttggcaa ggccaacaaa atttcacaaa 1560 tttataattg actccgcatc atgtaacaat tttgtaagat gggatgtggt tagtaatatg 1620 cgtttaacta acattgattt caacgataga atttccatta gaggggttca tggaatagca 1680 gaagatacag taggatctgt aaatatgatt ttattgatag gaaattcaaa atacgaggaa 1740 aagttctaca ttttgaagca tttcgctgat gatgcaattc ttggagcaca ttttctaaag 1800 aaatacacct tatatattag tcaaagtttc gactacatag ttttaagaac gcctgataca 1860 aaaaatatgc aatatcataa tcatgcaatg accaatttat ttagaaacaa tgacattgaa 1920 aatataaaca aagtaaatat ggcaggtatt gaagcaaata gttatgacaa ttttaataca 1980 aattacgatg aaaataatag aaataattat gataatgaca atggctgcaa tggcagaatc 2040 caaccgtaca ttgaagagta ttatgacgat caaattgaaa ttgatgatga taaccaagat 2100 tacccaaaag agatgaattt agaaaacgat cgggattatg atgtgattga aaacatgaag 2160 catgagaaac ttacaggcaa cgacagaatg agaaaaattt atgaattggt taaaacagac 2220 cacttgaaag gtgatatcaa tcaagaaatt aataatattt tgagtgattt taatgaaata 2280 ttttatttag agggggacga gttgacttat accaatttaa ccatgcacga tatagagaca 2340 acaactgaaa ttccaattaa taaaagacaa tatagatttc ctgaagctac caagaaacac 2400 gtagaagaac aagtggaaga aatgctaaga ttagggataa tacgcccaag taaaagtccg 2460 tggaatgcac cggtattatg catcccaaag aaagacttag acgccgaagg caacaaacgc 2520 tacagaattg tagtggattt tcgagaccta aatacaatta ctaaaccatt tgtgtatcct 2580 ataccgctta ttagcgaaat tttggacagt attggagaag ctcgatattt ctcaaccatt 2640 gacttgaaat caggatttta tcaaatccca attaacccga aggatgctgc gaaaaccgct 2700 ttttcaacat ttctaggaca ttatgaattt ttaagaatgc caatgggact gaaaaatagt 2760 ccatcaacat ttcaaaaatt tacattcaca cttatttatg aaatacaacc agttaatgcg 2820 tttttatact tggatgatat tattgtattc ggtagaacta tcgaagaaca caacgaaaat 2880 ttatataaag ttttgaatgc attacatgaa cataatttaa aagtcgaacc atcaaaatgt 2940 aaaattttac gcaaagaagt cagatatttg ggtcatatta ttagtgaaga agggattcga 3000 ccaactagtg aaaatataga tacaatcaaa aacatgaaaa ggccgcagac gattaaaaat 3060 gtgagatcat ttttgggaac tgttaatttt tatggaaaat ttattccaaa cattgcagat 3120 aaacgtaagc cacttaatgc attattaaag aaaaatgtga aatttatttg gacagaagaa 3180 tgcgaaaatg cttttgaaga attaaaaaac tatttaattt cagaaccagt tttagttaga 3240 ccgaactaca atgacaaatt tgttttgaca actgatgcta gcgattatgc tatcggagca 3300 gttcttacta atgagaagtc aaatgatcac cccatcgctt acgcaagtcg tgcgcttatc 3360 ggggctgaaa gaaattattt caccattgag aaggaactac tagctattgt ttgggcagta 3420 gaccacttca aacattttat ttataaccaa gattttatcg tttacacaga tcatagacca 3480 ttggtagctc tatggcattt gaaggaaaca tcaccaactt tgacaaagct tcgtttaaaa 3540 atccaaggca ttggatgtga aattcgttac aagcaaggaa aggaaaatgt ggttgcagat 3600 tttctctcac gtttaaataa tgatgaagtt catgcagatg ataaacaagc atcaaaatta 3660 gtagcaatta caactcgtca acaagcaaaa caaaatagac aaaatatttc agataatcaa 3720 aactcaacaa atactccaaa cagacagaat tcatctagaa acaataataa ttggaataat 3780 aatatgaatg gacttcaaca aattgacata aatttagata atgacgatga tagcaacgat 3840 aaaacattta cctacaagga tttccaagat acagatatcg attttaacgt tttaaaattt 3900 tctaaaaata ttataccatt tgaacaagcc gaagctacat ttatgatatt aaacagtgtt 3960 acggtacaca aagaattgag taaatacatt gaccttccac atggtattaa ggattacacc 4020 aaagaaaata tatttgtatt cccgcaaaag aaaatttggg gtttaatttt gaatggaaca 4080 tatcgttcag cagttaagaa tgaagaattt ttcgatggtt tattaaaatg ctttgatgat 4140 taccctgatt ttgctaaaga tgcatcagtt atacaaatta tatctcatag aatatttaaa 4200 acagagttgg aactaaactt attacgattt tttgcaatga aactttcaaa gtcatttaca 4260 ctatatgcaa cagaaaatga acgaatttat gtaaagccaa aagacagaga tcaagtgctt 4320 aaagattttc acgacgcacc gttgggtggc catgtcggag gtaaacgaat gattaaaaga 4380 atgagtcctc tatttacatg ggaaaatatg cgaagagatg ttttgaatta tgtaaagcaa 4440 tgtgattctt gtcaaaagaa taaaatatgg ccagcaaata agatgccaat gaaaattaca 4500 acaacatcgt atgaaccctt tgataaaatt tatatggatg tggttatgtt accaatttct 4560 aataatggaa acaattgtgg acttgtgata caagatgatt taacaagatt tttgattgta 4620 gctcccatgg aaaaccaaga aagtactact gttgctagaa cttttgtcga aaactttgtt 4680 tgtaaatttg gtgctcctaa agaagttgta accgatcgag gtacaaactt tgtaagcaaa 4740 ttattgcaac acacatgcaa aatattacac attaaaaaga tcgttacaag tgcatatcat 4800 cctcaagcta atttagtaga gagatcaaat agagagttga aagtttattt acgaaatttt 4860 attggcaaag atccacaatg ttgggatgaa ttaataccat attttatgtt tgaatacaat 4920 actacagaaa attcatcaac tggatattcc ccctatgaac ttttgtatgg aagaaaagct 4980 actataccaa gcacaattta taaaatcaat gattcagatt taaattatga cgactatatt 5040 tgttcaatga aagatatatt caaggatgct catgaaactg ctagaaacaa cttaatattg 5100 tcaaaagaaa agagaaagga aatttatgac aaaaagacaa atgattgggt accaatgtgg 5160 ggagatagag ttttggtaca aatggtacaa acaggaattg gtcaaaagtt acaaaataag 5220 tggcgaggcc cttacgatat cgttaaattt aacagcgatc aaacgaccac tattaaaaat 5280 ggcaacaaat ttgaagatgt acataataat agacttagaa aatacaatga ttaatattag 5340 ataaagtttt atatcaaaat ataccagtat gattttttta acaatatata ttgaaaatac 5400 cttaaaatta aatgctccaa tacaagataa tatattttat aatgataatt acagtacaat 5460 caaatggaat atagataaaa attacgatga atgaaatgaa tgacatagta tcataaacga 5520 ttaatatcag aagactacaa tgaatgaaaa taataatttt aaaccatggg attatttatg 5580 tatattaaaa aaaaaaataa ttatatatat aaaaaaatat ggaattgata tgcttgatga 5640 attagaaaaa aacgattgtt atcaagagaa ttaaatgaaa aaaaaatgtt acaaatacca 5700 atttttttat agaaaaataa acatggcaaa ataacatatg attgatttaa tataattaaa 5760 tgattgatga attaatgttg agttaaacaa aatatgaatg attattatca aaaaatgcac 5820 aaacaagatg aatgttatgg attggggatg aaagataatt tttacaaatg attgaattta 5880 tagggaacat attaagaaaa ataccagtac tgagtttata catttttttt gaaaaaaacg 5940 tgaagcgatt atgatcaata taaacataat aatgaagcag tgatgaaaca cgttgaaaca 6000 attaagaagc tgttgggttg caattttggg aatatattta attgagctaa aatgggttaa 6060 cagggtactg gaaaaacaat gacgcaattt ttaactacaa ataaaaacaa atcaaacgga 6120 ctgaacaata acgaaatttc aacgacaggc aaatgaagat ggtgtgatta ttaaatgttt 6180 caaaacaaca tgtatgcaag taagaatgtg tgtgagaata attggacaag ggactgcagc 6240 ccagaatgat tgttgaatgt aaacatgtgt tttataattt acttaacaat ggattaagaa 6300 tatattttca gacttttata cgctaatacg aaggatttac aggaaaccat tggaagacga 6360 tcaagaagac aacaatatca gaaagaacgc ttgcaaggac gaatatgaac cacagctcac 6420 ggaaaggtaa catttgtgag tatatcgtgg ggaaactaaa aagatcgaaa aggacggtaa 6480 aagttccgat ctagggatac ataaacagat aaagcaacgc caaagggatt aaaaagtaga 6540 caattgtact ctatggggag agttttatgt cgaatatagc ttcaaaacag ttgtaatcca 6600 tggggggatt caatatgtta aaaacatttc aaaacatttg cactttacgg ggatagtcaa 6660 atgaaaatta aagggaatgg ataaataacg cttcaaggga aaatcaaggg ataaaataat 6720 gcttcaaggg aaaaacacct ttaaacagat aaaatgcaga gaaaaaaaat cagaacgata 6780 tcattaaaat gataattgaa ccaactaatt atcttgcaaa ttcagctatt ttattgcgaa 6840 ttgctcaaat aaacaaattt attgaatggg tttgacaaat ttgaaaacac aaagtgacag 6900 acaagtttca taaaaagttg atataacaga aaggatatct taacatacgc agttttgcaa 6960 tgacaagatg ataggatgta tagatcaatg aaaaacttta tagagtaatt ttttttattc 7020 aagtataaaa ataatagtca ataagaactg atttttttta gtattgaaaa taatagaact 7080 gatttcatat aaaaatatca acaagattta taagattgta agtaaatttg attttcaatt 7140 ttgcacagaa attaaatgat aggtacctga aaattataaa atattgttca tgcgaagaaa 7200 acaattaaga catttttgtg ggagatacaa aagtagaaac tgaagtaaaa tatgacagag 7260 acgggagacg aaacacaagg atgatcgacg cgttcgtatg ctgagaggtt ctacgcgttc 7320 gtatgcagaa gtttgtgatg gcaagatggg gagtggcaaa gtcatgcggg aggttggaca 7380 ggtaacggtg ttgatgttgt tgtaaaaggt catatcaatg aggatcaagt cgaattaaaa 7440 cgcaaattta gtaacgaaac attcacgtaa aacaaattac taaatcgaaa tgtggaatta 7500 agcgaagtaa agatggttga aatgcgaaac atcaattatg gaacaaaaat gcaagttgag 7560 atcatactgc acgtctgaaa ctattgaaaa ggagaagatc atgataaaca aatggtaaca 7620 tcatccgatg tcattaatct acaaaagcca aagttgcaaa ttacacgaaa agacaatttt 7680 ggactaacaa catgtcctaa acatttgata aaagaattac atcaatcgat gacaataatt 7740 tgagattgtc aaagatgcaa agcaactgga aaagtcaatt ttggacaaaa atactggatt 7800 tacgaacaat tctgcaatac tcggcaagta agaagtcaat gcactctcaa aattgaccac 7860 agtcaactca cagctgggtt atgaagcata attcatgatg gaaacgacaa ttgcactaat 7920 caacgaaatt caacacctta gcctcgaaaa ctgatcaatt gaccaagtca ctgtggaatg 7980 aatagcctca cgatagagac gatttttgaa ggaagacgac aacgataaca accagccgat 8040 caacacagtt ctacaacaga ttttgcttac agtgagaaaa atgcaatgat tttttttagg 8100 agagataatg aaaattaatt tcaaatgcat gtttatttag atcagtttca atgaaaatat 8160 ttttacaaga gaaattcaaa acagtaaatc aagcaattta acaggacaga agatcaaaag 8220 agtaatgaaa ataattttac agtagaaaaa atcaaaagag caattataat tataaaacaa 8280 tttacaagaa agcacaagaa gaacaaattg taaaccgcac gatcactaca cccgacaata 8340 ttctttccat agatgatcat ttctttcaga cgatcatcta gctgataagt gtgggggaga 8400 ttaaccgaac catttttttt aaatcaatat tttatagcca gccatcaaag tatcttttta 8460 tgaatttttg caaagtattt tttacacaat ttaaattatt gactttggga atattcttcc 8520 aaccaacaaa taactttcga tagatgttct taatgctttt caagaataca aacatctagc 8580 tgataagtgt gggggaa 8597 // ID Copia-21_DPu-LTR repbase; DNA; INV; 571 BP. XX AC scaffold_57; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-21_DPu_; KW Copia-21_DPu-LTR; Copia-21_DPu-I. XX NM Copia-21_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-571 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 706-706 (2010). XX DR Genome; scaffold_57; Positions 325203 325773. XX CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX SQ Sequence 571 BP; 148 A; 116 C; 109 G; 198 T; 0 other; tgttggaatg cgtaagcaaa cacacaatag gtggcagcac aaacgggagt ggtggcagca 60 caagcgggag ttctctcggc cttctacctt gactgtgcag tgctctcctt tgtctctttc 120 tctttctttc tgaccacgct aagagaaagg ttatctcgac tacgctctga ttttcccttt 180 cgatattttg tgtattcatc gaaagtgttt tcttatccaa gtttacctgg taattatctt 240 gtgtgtaaca aattgttcat tctgtgtatg tgacgaaatc tcatgcttga agtgcagaac 300 gttatcgttg agtgcacatg aagcttatca ttctacctat aactatgtca tcacttgtga 360 gatgatgtgc agaacgttat cgttgagtgc acacatctac tatcatttat gttttatgta 420 accaatcaat ggtatgatta ctgtacgtga tgttcaccaa cacattcgtc ttgaattacc 480 ctgatcattc agaagctcca ttcaatggaa tgtttatcct gttatgtaaa gaaatacaga 540 agcctagtta aatccttgtc tatttcttac a 571 // ID Gypsy4-LTR_AP repbase; DNA; INV; 220 BP. XX AC Contig7751; XX DT 21-APR-2008 (Rel. 13.04, Created) DT 21-APR-2008 (Rel. 15.12, Last updated, Version 0) XX DE LTR retrotransposon from pea aphid: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy4AP; KW Gypsy4-I_AP; Gypsy4-LTR_AP. XX NM Gypsy4-LTR_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-220 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from pea aphid."; RL Repbase Reports 8(4), 444-444 (2008). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 220 BP; 53 A; 27 C; 48 G; 92 T; 0 other; tgttatctat agttagtaga tggcagcact gggagcgagg ctactctctt cgctactaac 60 agtttagtag tcagtctgtc tcgaaagcct tgcgagtgcg ttcggtgtgt tttcaaatgt 120 gtttgttttt ttaattttgt atgttttatt gaaatgaaat ataaggaagt gttttatata 180 tgtattgagt tttgagtatc catttatttt gtctataaca 220 // ID SAT-5_AAe repbase; DNA; INV; 147 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE Satellite-type sequence: consensus. XX KW SAT; Satellite; Simple Repeat; SAT-5_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-147 RA Jurka J.; RT "Tandemly repeated DNA from Aedes aegypti."; RL Repbase Reports 11(4), 1455-1455 (2011). XX DR [1] (Consensus) XX CC 147-bp unit. XX SQ Sequence 147 BP; 48 A; 30 C; 43 G; 26 T; 0 other; ggtagaagac atctctgaag tcaaggcaga agagaaacct gttgtcgaga aggcagtttc 60 gccggttgaa gccaaggttg tcgaagctga gaaaccagca tcgccagtcg ctgaagccga 120 gccaatcgaa gatgaaattg tcgaaaa 147 // ID Mariner-16_SM repbase; DNA; INV; 2045 BP. XX AC . XX DT 11-AUG-2009 (Rel. 14.08, Created) DT 11-AUG-2009 (Rel. 14.08, Last updated, Version 2) XX DE Mariner DNA transposon from Schmidtea mediterranea: consensus. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-16_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-2045 RA Jurka J.; RT "Mariner DNA transposons from Schmidtea mediterranea."; RL Repbase Reports 9(8), 1865-1865 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 582..1586 FT /product="Mariner-16_SM_1p" FT /translation="MSSNKTLSTRKNYTIKQKVDILNEIKPGETGKSCRAI FT AKLHGINESVLRGWKNKKEDLLKAVSDVNICTRIVRRLAGGGRQMEYEEIE FT KKVLQWVLLRNEKGIRVKEKYIQLKALSVRNDIIAHGGNEELNKFQASAGW FT VDRFKKRNNLTSRRFTTAKKIPDNVDEICRSFIHNLHEPLRKIFDDSTFNF FT EEWESSYGQILKQNNNFFLDNNSEWHWPLDIRTSFFHCIWKASNETQSDFE FT SWLPEFTRKVLEFIDKHAVCKELLDDSERNCLQNGEYTRTNIEIFSVSIIN FT NWKIQVTTLDEEACEVSSFIYGSNSTVKNIHIIKHKDFFGLQL" XX SQ Sequence 2045 BP; 753 A; 273 C; 323 G; 696 T; 0 other; ctacagtctt ccaccgaata tagtacgcgc gcgtactata ttcgaatttt ataatgtaac 60 ttctaactaa agtaaaattt gcaggtactt tttctataaa ttttaacaaa caaattctta 120 taaaaatttt ttcttattct caagtaaccc ttgctttttt gtttttttaa cgttaaaaca 180 tttttatgtt tctatattat atacatacat ataatattat tagaaaatag aaatcaaatg 240 acttgtgttt atagtgtaca cgcaatttta atttctcaat caaaattatt atgtagaata 300 atcatactca ataaaagtat atatgtattg taaactattc aaaatagaat ttttgattca 360 atttaaaaac ttcccttgta aattcaaaaa tacaaatcta agtaggtttt gcttaatttt 420 cgcgaaaata taaaatatgt aattcttttg tttacgttta gtttcattag aaatctggtg 480 acacatatat ctaaatattt gttctaatta aatcaaattt tgtttaaaat aagatttctt 540 taaataagta cataacttat aatttattaa aacaccatac gatgagcagc aataaaacac 600 tttcaaccag aaaaaattat acaatcaagc aaaaggttga tatattaaat gaaattaaac 660 caggtgaaac aggaaagagt tgtcgtgcaa ttgctaaact acatggaatt aatgaatcgg 720 tattacgtgg ttggaaaaac aaaaaagagg atcttcttaa ggcggtaagt gatgtaaata 780 tttgtacccg tatagtcaga agattagctg gtggtggaag gcaaatggaa tatgaagaaa 840 tcgaaaaaaa agttctacag tgggttttgt tacgaaatga gaaaggaatc cgtgtaaaag 900 aaaaatatat tcaattaaaa gctttgtcag ttcgaaatga tattatagca catggcggta 960 atgaagagtt gaacaaattt caagcatcag ccggatgggt tgaccgcttc aaaaagcgaa 1020 acaatttgac ttcacgacga tttaccactg caaagaagat tccggacaat gtagatgaaa 1080 tttgtagaag ttttattcat aacttgcatg agccattaag aaaaattttc gatgattcaa 1140 catttaattt tgaagagtgg gaatcaagtt atggacagat tttgaaacaa aacaataatt 1200 ttttcttgga taataattct gaatggcact ggcctcttga catccgcacg agtttttttc 1260 attgtatttg gaaagcaagt aatgaaactc aaagtgattt cgaaagttgg cttcctgaat 1320 ttactcgtaa agttttagaa tttattgaca aacatgccgt ctgcaaagaa ttgctggatg 1380 attctgaaag aaactgtctt caaaatggtg aatatactag aaccaatatt gaaatttttt 1440 cagtttctat tataaataat tggaaaattc aagtcacaac tttagatgag gaagcatgtg 1500 aagtgtcgag ctttatttac gggtctaact caacagtaaa aaatattcat attattaagc 1560 acaaggactt tttcggctta caattataaa ttttaattac ttttttttta atttatgaat 1620 tttaattcaa tttaatttaa ttttatttta ttcgagcagc ggcacccata ccaataccat 1680 tgaaggtaat tggaatgcaa tcaaacaaca aactacaccg cgccacagaa cacggaaaaa 1740 cataggctta tacttgctta gatttatgtt aaagcgcaat tatggcaaaa acatgttttt 1800 aaatattatt acaattctgc tcgaaggtca aaatttttaa taaaaggggt tgtttttttt 1860 aatttttcta tttaaattgt tttgtggggg tggaagcgta cattagccga atttcgaata 1920 tagtacgcgc gtgttatatt tgagtgcgca tttactttga aaaaaactaa aatatttcaa 1980 atataaaata atttcagaaa actcgaatat aacgcgcgcg tactatattc ggtggaagac 2040 tgtag 2045 // ID Gypsy-93_CQ-I repbase; DNA; INV; 3988 BP. XX AC AAWU01007335; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-93_CQ_; KW Gypsy-93_CQ-LTR; Gypsy-93_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-3988 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 565-565 (2011). XX DR GenBank; AAWU01007335; Positions 1042 5029. XX CC Positions [3106-3618] - Integrase core CC 'AGTC' target site duplication CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 1411..3972 FT /product="Gypsy-93_CQ-I_1p" FT /translation="MQRAVKERDVKHHILTKGPPVASKARRLAPDKLDAAK FT KEFQLMSELGMCRPSSSPWASPLHCVPKKNGQLRFVGDYRALNKVTVPDRY FT PVPHIHDLLNAFQGKSIFTTIDLERAYHQIPIDDEDVPKTAVITPFGLFEF FT TTMQNGLCNAGQTFQRYMHRIFGDLGFVITFIDDICIASSSPQEHREHVKI FT VFQRLRENGLVVNLSKCKFAQQQVEFLGYLIDKDGILPLPDRVQAVRQYEL FT PTTVKQLRRFLALLNVYKHFIPQATDQQAELRALIPGNKKNDTRKVTWTDK FT AEQAFEQCKKSLSDAALLYYVDPNKPLGLMIDASNSAAGAVLQQFVGGGWK FT PLGFYSEKFSPAQQKYSTFGRELTAMKMAVRYFRHLLEGRTFTIFTDHNPL FT THALTSNSPARLPHEDRHLQYISQFTQDIRHISGKDNVVADALSRVETIST FT PTVINYADVAADQVGDSELQGLLQSPSLKFEQRPFTANGDRLYCDISVEGK FT VRPYIPLPHRRRVLETMHGLSHSGVRATRRLVTDRFVWTSMNKDVADFVKC FT CIHCQQSKIQRHTTAPVCRFDLPKSRFQHVHVDLVGPLPPSNGYRYLLTMV FT DRYTRWPEAVPITDMTAETVARAFNSTWIARFGVPEKVTTDQGRQFESELF FT RELNHLLGSEHLRTTAYHPQANGLVERFHRTLKAALMCTDPKRWADRLPLV FT LLGLRTALKEDLNCSVAELVYGQQLRVPGEFFDAPKTELDRSDYAKEMHRV FT FDELKPKDPQHHAKPKVFVQRDLKDCRFVFVRIDMVKKPLQRPYEGPFKVL FT HRGDKCFDLLIKGKSQRVTIDRIKPAFIVEEDVEELKKTVVTPSGHRIRFL FT V" XX SQ Sequence 3988 BP; 977 A; 1113 C; 1055 G; 843 T; 0 other; ttggtgaccc gcgacgctga aaagaagtgg aaaatcgacc ccgacgaccg gtctccgcga 60 aacttccagc gaaatcggat ttttttttct cgccgctcgc aaccgacgcc atcttggggc 120 cctgactacg ccaccgctga atcagcagac caagatggaa gcgcaaacac agacaaacga 180 gacaaacaat cctgccacct cggctgtcac cgcggcagtt gcagttaagc ttccagaatt 240 ttggaagaac gacccaagga tgtggtttgc tcaagcagaa gcacagtttg ctctcgcggg 300 tgtggtacag gacgaaacaa aatactacca catcatcagc aagctcgacc agactgtcat 360 tatccaagtg gccgacatcg tgacagaacc cccaaaggaa aacaaatacc cagcagtgaa 420 gggacggctg atttcgcgct acgaagtttc ggcccaagga aagttggagc agctgctgaa 480 ctcttgcgac cttggggaca tgcggccatc tcatctcttg gcccgtatgc tggaactcgc 540 agctggtttg aacgtgaacg aaagtgtgct gcgggtgctg tttatccagc gaatgccaga 600 acgcgtcaag acaatcctgt cgatttgcga tggaaacacg ctgcagcaac tggcaaacat 660 ggcagataaa atcaccgatt tttcaccgtc tgtcgtagcc gccacatctt ccgcagctgt 720 acccgggctt agcgatctcc aggaccaaat cgctcagctt actgctgagg tacgtcggat 780 gaagacttct gacggacgca gccgatcttc ttcccgaagc cgtcagtctg gtgccccggc 840 ggacagcgtt tgctggtacc acaggaagta cggacgtaac gcacagcagt gccgtgaacc 900 gtgcgtcttc aaggactcaa aaaactaggt tgcagttcac ctgaaacggc gaaggtgaac 960 ggagcaacag aaagtcgccg cttactgctc aacgaccgct cgtccagttg ccgctatctc 1020 atcgacacgg gatcggatgt ttcgatagta cctgcgacaa agaaggaccg cctcaaggga 1080 ccgtcgtcgt tccgtttgca cgctgcgaat gggacggtga taaaaaccta cgactcccgt 1140 ttcatcacga cagacttggg gctccgacgg cagtttcgtt ggaacttcat cgtagcggat 1200 gtcagtgtcg ccatcatcgg agccgatttc ctcgcttttt tcggattgtt ggtggatctc 1260 aagaacagcc ggctcatcga cgggaaaacg aacctacagt gtgtgggcgg cctctccgca 1320 gctgaaattc acacggtgac aacagttgat tccagccatc cgttcaagga cttgctgctg 1380 gagtaccgtg aaatcacgct gccgtctacc atgcaacgcg ccgtgaagga aagggacgtg 1440 aagcaccata tcctgaccaa aggcccaccc gtcgcttcga aggctcggcg gctggctcca 1500 gataaactcg acgcagccaa aaaggagttc cagctgatgt cggagttggg tatgtgtcga 1560 ccatcctcga gtccctgggc cagcccgtta cactgcgtac ccaagaagaa cgggcagttg 1620 cgcttcgtag gtgattaccg tgccctcaac aaggtaacag tccccgaccg gtacccggtc 1680 ccccatatcc atgatctgct aaacgctttc cagggcaaaa gcattttcac cacgattgac 1740 cttgagcgag cttaccacca gatcccgatc gacgacgagg acgtcccgaa gacggcggtg 1800 atcactccgt tcggcctgtt tgaattcacc acgatgcaga acggcctgtg caacgcgggt 1860 caaacgtttc aacggtacat gcacaggatc ttcggagacc ttgggttcgt gatcacattc 1920 atcgacgaca tctgcatcgc gtcgtcgagt cctcaagaac accgagagca cgtcaaaatc 1980 gtgttccagc gcttacggga gaatggactg gtggttaatc tctccaaatg caagttcgct 2040 caacagcagg tcgaattctt ggggtacctc atcgataagg atggaatctt gccgcttccg 2100 gaccgggtgc aggctgtgcg ccagtacgag ctgcccacga ccgttaaaca actccggcgg 2160 tttttggcat tgctcaatgt ctacaagcat ttcatccctc aagcaaccga ccagcaagct 2220 gaactccgag cgttgatccc agggaacaaa aagaacgaca ctagaaaagt gacctggacg 2280 gacaaggcag agcaggcatt cgagcagtgc aaaaagtcgc tttctgatgc agcgttgctg 2340 tactacgtag atcccaacaa gcctctcgga ctgatgatcg acgcttcgaa ctccgctgct 2400 ggggcggtgc tacagcagtt cgtcggcggt gggtggaaac ctctcggctt ctattcggag 2460 aaattttcgc ctgcacagca gaaatattcc acctttggcc gagaactcac ggcgatgaag 2520 atggctgttc gatactttcg acatttattg gagggtagga cgttcacgat ctttaccgat 2580 cacaaccctc tgacgcacgc gctgacgtca aactctcccg ctcgattgcc gcacgaagac 2640 cgacacttgc agtatatttc gcagttcacg caggatattc gtcacatcag cggcaaagac 2700 aacgtggtgg ctgacgcact ttcgcgagtg gaaacaattt caactcctac ggtcatcaac 2760 tacgcagatg tcgcagctga tcaagttggt gacagcgaat tgcaaggcct gctgcaatct 2820 ccgtccttga agttcgaaca acgtccgttc actgcgaatg gtgatcgatt gtactgtgac 2880 atctcggtgg aaggtaaggt ccgtccgtac attccgctgc cgcaccgtcg acgtgttctg 2940 gaaaccatgc acggcttgtc acactccggc gtccgagcca ctcggcgctt ggtaacggac 3000 cgtttcgttt ggacttcgat gaacaaagac gtcgccgatt ttgtgaagtg ctgcatccac 3060 tgccaacagt cgaagatcca acgtcacacc acggcgccgg tctgcaggtt tgacctaccc 3120 aagagccgat tccaacacgt ccacgtggat ctagttggac ctctgccgcc gtctaacggg 3180 tatcggtatc tgctgacgat ggtagaccgc tacacacgct ggcccgaagc tgtaccaatt 3240 accgacatga cagctgagac agtagcgcga gccttcaact cgacgtggat cgctaggttt 3300 ggagtgccgg agaaggtaac gacggatcaa ggacgacagt tcgaatctga actgtttcgc 3360 gagctgaacc atctccttgg ttcggagcat cttcgtacca ctgcttacca ccctcaagct 3420 aacgggttgg tagaacgctt ccacagaacg ctcaaggccg cgttaatgtg tacagacccc 3480 aagcgctggg ctgaccgact accgttggtc ttgcttggtc tgcgaacggc cctcaaggag 3540 gatctcaact gttctgttgc ggagctagtc tacggccaac agctacgagt tccaggagag 3600 ttcttcgatg ctcccaagac ggagctggac cgcagtgact acgcgaagga gatgcatcgc 3660 gtgttcgacg agctcaagcc gaaggaccca cagcaccacg ccaaaccgaa agttttcgtt 3720 cagcgcgatc tgaaggactg tcggttcgtg tttgtacgaa ttgacatggt caagaaaccg 3780 cttcaaaggc catacgaggg accgttcaaa gtactccatc gaggtgacaa gtgcttcgac 3840 ttgctgatca aaggaaagag tcagcgggta actatcgatc ggatcaagcc agcattcatc 3900 gttgaggagg acgtcgagga gctcaagaag acagtagtta ccccatctgg ccaccgcatt 3960 agattcctgg tgtaactgga gggggcac 3988 // ID CR1-25_BF repbase; DNA; INV; 3050 BP. XX AC . XX DT 30-JUL-2009 (Rel. 14.07, Created) DT 30-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE Amphioxus CR1-25_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-25_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-3050 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-3050 RA Kapitonov V. and Jurka J.; RT "Young families of CR1 non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(7), 1596-1596 (2009). XX DR [2] (Consensus) XX SQ Sequence 3050 BP; 1070 A; 632 C; 692 G; 656 T; 0 other; ctctccacca gaggaagaga ggtaaaggaa gaaaatggga aaagcaaatg caagattggg 60 gagagaaagg gaaaaatgaa ggcaagaaat acaaatgaaa tgaaacttga tgtcatgtac 120 accaacattg attgcatcac gaacaaaaaa gcagaattcc tcgccatcat tgatgagact 180 cacccagaca ttattgttac cacggaaacg aaaccaaaga atcctagatt ccacatgaca 240 aaaactgatt tacacgtaga tggatacaac ttatttacta acgttgaaga agaaggggta 300 agaggcattg cagtgcacac aaagaatgat ctgaatgtaa tggaacctaa tacagcgttg 360 ctacaagact ccgcctcaga agtacaaccc attgaaatta agttgaggaa caatgacaaa 420 ctcctactta ttgcagtgta caggagtcca aactctacac ctgagaacaa tgaaaaaatc 480 aatacactga tacgcaatat aaataccctg ggatactcgc atattctcat tgttggggac 540 tttaaccacc cagaaataca atggagtgaa ggggcaggaa acacccagtc taacaaaaac 600 gaggcattta agttccttga agctacgaac gacgcatact tgtaccagca catttgtgaa 660 gcaacgagac atcaacttag acaagaaagt aatgtactgg acttattatt gactaatgag 720 ggtgacatga taagtgacct aatcatgaga ccccccatcg gcaagagcga ccatgtggta 780 ttaaatttta agatcaactg ttatgcaaac agaacacaat acaaaaatga gacatgccag 840 tacaacaagg gggactatga taaaatgagg gaggaactgt cactggactg gcaggaaatt 900 ctaggcaagt tgaatgttga agattgctgg gaggtcttct cgggcaaagt agctgaaagt 960 gcgaacagga atgtacccaa agtctcctca agaaaaaaga agaggaaact atcttggaga 1020 gacagggaag ttaataaaaa aatgaacaaa aagcagaaac tttggaagaa atactgtgaa 1080 tctaggacaa aggatgacta cattaaatac acgagggcga gaaaccaggt cagatgggcc 1140 acaagaaaag cggtcaaaac atatgaaaag gagaaagcga gaaacatcaa aggcaatgcc 1200 aaaatcttct ggaaatatgt aaactcaaaa tccaaagtgc gccaaggcat tccagacttg 1260 gaagatggat cctcagtggc acaatctgat acagaaaaag ctgagttact caacaagttt 1320 tttgtaagca cctttactaa ggaagaccta caacacatcc caataccaac tgaaagacac 1380 tacaatgaag agataactga cattgacatt tgtttcgagg aggtccaaca aagattgaaa 1440 aacctcaacc caaacaaggc aatggggccc gacaacgtac accctagagt gttaaaggaa 1500 ttggcggaca ccctggcggt tccactgcag attctctacg tcaagactgt acaagaaggc 1560 aaactccccg atgcctggaa aaccgctaat gtcacgccaa tttacaaaaa gggttgtaag 1620 aaatcacccg gtaactaccg cccggttagc ctgacctcag tagtaggaaa aattctggaa 1680 gggttgatta gagacgccat tgtcaagcat atgaaggtaa acaatctctt tacaccacac 1740 caacatggct tcttgccagg aagatctact accactcaga tgttggagtg cttagacgag 1800 tggacagagt ggcttgaccg aggaacaccc gttgacgcgg tctatctcga cttccgcaag 1860 gcttttgatt cagtaccaat taagagacta ttagccaaaa tacaaagcta cgggatcacg 1920 ggaaaccttt taaattggat tgaatctttt ttgtcaggac ggagacagag agtctgtgtc 1980 aatggggaaa aatcagaatg ggccgaagta acaagcggag ttccccaggg gagcgtcctg 2040 ggtccggtgc tcttcactat ttttgtcaac gacatgccgg aaattgtgca gagtaagctg 2100 aagctctttg cagacgacac taagttatat agatcagtag tacaaaggga agaatgcaac 2160 aaattgcaga gggaccttca agtacttcag gactgggcaa taaaatggca gttgagcttt 2220 catcccatga aatgtacagt aataagactg ggaaaaggac accctgacta cacatactca 2280 atgttggata atgagactcg taccctactg gaatttacac aacaggaaaa ggacttgggt 2340 attactgttg acaaggagct gagtttcagc aaacatatat ccaacatttg taataaggca 2400 aatcagatcg caggtctcat atggaggact tttgcctatg tagataagga ggtgttccta 2460 ctcctatata agtcgctcat tcgtccccag ctggagtatg gggctcctgc atggtccccc 2520 tacacatgga agctggccct ggatctagaa agggtacaga agagagcaac gaagagagtt 2580 cccggcctta gaagtctacc ctatgaagag agactgaagg ctctaaactt acctacgctg 2640 gtctatagga gacttagggg ggacctgata aacacctaca agttcctaca tgggatatac 2700 gacacttcat gtccgtttga gttaaatacg agcacaagaa caaggggtca ctgcctccgg 2760 attaagagac aagcgtcgaa aagtaacaga agatcacact tcttctgcat cagagttgtc 2820 tcatggtgga acaatctacc agagccggtg gtgacatccc caagtgtaaa ctgctttaag 2880 gaaagactag acaaccacat gaagaaacac agcgtgtatt acaactttag agccctggac 2940 gacccgcaat tacctgggat gtcagtgact tagaaaggaa agagcggcct aaactggaac 3000 tgcagttcct acctgagcct gaagaactct actctactct actctactct 3050 // ID hAT-11_SM repbase; DNA; INV; 2463 BP. XX AC . XX DT 15-JAN-2008 (Rel. 13.01, Created) DT 15-JAN-2008 (Rel. 13.01, Last updated, Version 1) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-11_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-2463 RA Jurka J.; RT "haT-type repetitive family from freshwater planarian Schmidtea RT mediterranea."; RL Repbase Reports 8(1), 12-12 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 258..2159 FT /product="hAT-11_SM_1p" FT /translation="MDKWLNLNMNGNDADDNATNSSQAGQSIGKEASNVRK FT RKYDESFLQFGFTFQNWKGNEQPLCLICNELLASESMKPSKLKRHLESKHV FT SYVNKPKEYFERLCESLNKEKKTFEKFINVNEKYLLASYEVSYCIAKNKKP FT FTIGEDLVLPAAIKMVEILHGRKYGDDIRKIPLSNDTVSNRISDINKDQLI FT QLITRIKESPKFSIQLDETTDITKLAQLLVYVRYVYKESVGEELLFCRPME FT DHTTGKDIYCKVDEFLKAEGLEWKNCCGICTDGARAMTGKNIGFKSFFQAA FT HYDHITFTHCLIHREALAAKKLAPELNDVLQDAVKIINFIKSHALNSRLFS FT NLCKDTDSNYTTLLLHAEVRWLSRGQSLRRLLILKDDIKIFLTERKCELAA FT FFQNDLWLSKLCYLSDIFAKLNDLNLSLQGKNCDIFTSNDKIESFIKKINI FT WKSRVEKNSFEMFFSVDNFVIEKIHCKTFIAKTIVDHLKALEIQFRTYFIL FT NIDFQNIVWIQNPFWIGLSEINHLPLKAQEEFAELSSDSNLKLQFQKKTLT FT EFWIGTRTEFPTIADMALNVLLPFNTTYLCEVTFSALTHIKSQYRSALKML FT KRSYVQQFQTFHQDLICYAIKNRHIHLIKILTYTFI" XX SQ Sequence 2463 BP; 891 A; 350 C; 401 G; 821 T; 0 other; cagggcttct taaaccatgg gtcgcgaccc catttggggt cgcgtagcaa aattctgggg 60 tcgcgagaga taaaaataca atttaacaga aaatgttttt taacttgtaa aattattgtt 120 ataaagataa aataacaatt atcgtagata aatatatcat aaaatcttct agttcggtaa 180 ttaataaatc aattattatc tgcaatatgt tatttttata tatttattgt attttaggat 240 caaaataatt tttgcccatg gataaatggc tgaatttaaa tatgaatggt aatgacgcag 300 atgataatgc tacaaactct tctcaggcag gacaaagtat tggaaaagag gcatctaatg 360 tgcgaaaaag aaaatatgat gagtcttttc ttcaatttgg ctttacattt caaaattgga 420 agggtaatga acaacctctt tgtttgatct gcaatgaatt attggcttcg gagagtatga 480 agccatcaaa attaaaaaga catcttgaat caaagcatgt ttcgtatgtc aataaaccta 540 aagaatattt tgaaagactg tgtgaatctt taaataaaga aaagaaaacc tttgaaaaat 600 ttataaacgt taatgaaaaa tatttactag catcatatga agtttcatac tgtattgcta 660 aaaataaaaa gccgttcacc attggagaag atcttgtgtt acctgctgct attaaaatgg 720 tagaaatact acatggaaga aaatatggcg atgatattcg aaaaattcct ttgtcgaacg 780 acactgtctc gaatagaatt tctgatatta acaaagatca attaatacaa cttattacaa 840 gaattaaaga aagcccaaaa ttttcaattc agttggacga aacaactgat attactaaat 900 tggctcagtt gttagtatat gtcaggtacg tttataaaga aagcgtgggt gaagaattgt 960 tattttgtcg tcctatggaa gaccatacta caggaaaaga catatattgt aaagttgacg 1020 aatttttaaa agcagaaggt ttagaatgga aaaattgctg tggaatatgc acagatggtg 1080 caagagcgat gacgggcaaa aatattggtt ttaaatcatt ttttcaagct gctcattatg 1140 atcatataac ttttactcac tgccttattc accgagaggc tcttgcagca aaaaaattag 1200 caccagaatt gaacgatgtg cttcaggatg ctgttaagat cataaatttt attaagagcc 1260 acgcccttaa cagtcgctta ttttcaaatc tctgtaagga tacggattcc aactacacaa 1320 ctttgttatt acatgcagaa gtaagatggt tgtcaagagg tcaaagttta agaagattat 1380 tgatattaaa ggacgatatc aaaatatttt taaccgaacg aaaatgtgaa cttgctgctt 1440 tttttcaaaa tgacttatgg ctatccaagc tgtgttattt gtcagatatt tttgcgaagt 1500 taaacgatct taacttgtct cttcaaggaa aaaattgcga tatatttact tcaaatgata 1560 aaattgaaag ttttattaaa aagatcaaca tttggaaaag tagggtcgaa aaaaattcgt 1620 tcgaaatgtt tttcagtgtc gacaattttg taatcgagaa aattcattgt aaaactttta 1680 ttgcaaaaac tattgtagat cacttaaaag cgctagaaat acagttccgg acatatttta 1740 tattaaatat tgatttccaa aatatagttt ggattcaaaa tccgttttgg attggcttaa 1800 gtgagattaa tcacctgcca cttaaagctc aagaggaatt tgctgagctt tcgagtgact 1860 caaacttaaa attacaattt caaaaaaaga cgttgactga attctggatc ggaactagaa 1920 ctgaatttcc cacaatcgct gatatggcgt taaatgtact tctgccattc aacaccacat 1980 atttatgtga agttactttc tcagctttaa cacatattaa atcccaatat cgttcagcgt 2040 taaaaatgtt gaagaggtct tacgtccagc agtttcaaac attccaccaa gatttgattt 2100 gttatgcaat aaaaaacagg cacatccatc tcattaaaat tttaacttat acttttattt 2160 aatacttacc acgcaactac ttgaatatat tcaagagaat taagacggtc agaatcctct 2220 tttatttttg aaactataat aaatacaatt ttataatttt ttgtgcaatt tttaatttta 2280 gatttttgta tgtttcaaaa caaattctaa acttatatat tttcatatgt atatgtatat 2340 tgttacctaa taaataaatc atcaaaaaat attaatttat ttttctttta tatttgtatg 2400 gggtcgtcaa gaaactcgca atcataaatg gggttgtgaa ttacaaaagt ttaagaagct 2460 ctg 2463 // ID BEL-597_AA-LTR repbase; DNA; INV; 538 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-597_AA_; KW Pao_Bel_Ele56x; BEL-597_AA-I; BEL-597_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-538 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 538 BP; 204 A; 82 C; 106 G; 126 T; 20 other; tgttagtaga tcattaattc aaacctatga gtttaaatmc twaagtgaac gtaaacttaa 60 cattagcatg caatacgtac caccctagct caactgttat aaacccaata cgagtacact 120 aaacattacg aatttacgaa ctacacacaa catacataca cagaaaacga agttgwaaca 180 agattcggtw aaatgtacaa aagcaaatct gaattagtat ccaatcgaww gccaagaaat 240 awaaasgttg maakttatcg cgcgtgaagt ttattwawaa taaagwgtwa ttagktggaa 300 atawaaggtg tttgtgatat cggaatwatc agtggagttc ttggagtgaa ggaaaggact 360 gtctaacgaa aattggaagg attagtgacg ggatcaagga gtgataktag wggaaggatc 420 maaaggattg aagttattgg attgggaaag aaaaacgaac gattttaaca caacgtaggg 480 atctgtggaa ttcggtactg aagctaaaac tcaacccacg aacctacaat ccccaaca 538 // ID DNA-2-1_HM repbase; DNA; INV; 5400 BP. XX AC . XX DT 06-JAN-2009 (Rel. 14.02, Created) DT 06-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE DNA transposon from Hydra magnipapillata - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; 2-bp TSD; KW DNA-2-1_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-5400 RA Bao W. and Jurka J.; RT "DNA transposons from Hydra magnipapillata."; RL Repbase Reports 9(2), 373-373 (2009). XX DR [1] (Consensus) XX CC TSD is 2-bp long. CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX SQ Sequence 5400 BP; 1939 A; 754 C; 752 G; 1954 T; 1 other; cccaagtagc aaacaatatt ggcgtaatat tggcacgata ttgggtatct tggctaatct 60 tggccaagat atccgatatt gcgccaatat tgtcaagcaa tattacgcca atattgtctt 120 accaatattg gcaaacaata ttgcgccaat attatcccat tatcgtttta ctgatattgt 180 tatgccaata ttggcgaaat attgtttgcc aatattggct ttatattaaa ttttacatat 240 tggtacgcat gcgtaacatt aaggaatatt cataatgtcc tttagttttg ttaaatgtgt 300 ttaaatgtaa ataatagcat ctttttacat aaacaggcat ccttatgtta ttaatttttt 360 tacaaaaaca aataagagtt caaaataaaa attttaattt atgaaaagag aagaagtagg 420 aaaagtattt ttaattcaac atttacagag tgaatcagga gaagaagtgt tggtaattta 480 ttttgaaaca gatagtatca cgagaagaag tggtaagagt aataaaattc ttttattcaa 540 cattacgaat tgaattacga gaagaagtgg ttgtaattaa tttaaaacat attgcatcac 600 gagaagaata ggaaaaatat tgctatactt aagtatcaag attaaaatag taagtttact 660 cttaaatgaa aacactttga gaagatttca tctataaaat ataagtttta tagtctattt 720 attttgttaa agatattcgt ttaagaaata aatacgtcat tattattgtt tctaactgtt 780 ataagatttt aatttacgaa aaaggtacmc aataacatca ataatataac tgcttgacac 840 ataataataa atttattaca agataataat tttgcaataa aaaagttttt tttataatgt 900 cttttatttt atatagcatt ttattttatg caatagttta atgcaatgtt ataggtaata 960 aacaaatcaa taagttaaac tagatctgct acatcacgtt atgttaagag tagttaaaca 1020 agattaaagt aaaattcaca caataagtaa tgctgaaaat agtaaaaatt atataaatgc 1080 gactttgaaa atgcaataaa ctttagtgag gattgatgtt ggacattagc cttggagaat 1140 aaaactgata tagaaaattt atatcagata tattgaaatc tttcaacaca tttcacaact 1200 cctacttcta aaattaaaag cttatagaat aaatgactaa ctctacaatt ggattttaat 1260 gttacctaag agatcggcat tagagtatca ttgttggtgg tacatactca gaatggataa 1320 tagtaaatag tggggtcccc cattatttac ttttatgata tataataaag atttttctga 1380 tttggtttta gatgttgtat ctgtatttac agatgatact aagttataca gtttgctttt 1440 tgatagctca ccatgcaata acctgcaata aagtatggac attatagcaa actgggcaaa 1500 actatgaaaa ctaagtttta atgtaaaaaa atgtaaagtt atacccttag gaccaacaaa 1560 tacaaaattt tcccttagga ccaacaatgc ccttaggacc aacaaataca aaattcattt 1620 atatactcgt tgataaaaat ctcaatactt caattaatat taaaaactct ctcgtcaata 1680 aagggatctt cattgacagt aatttgactt ttattttgta agtactaaga ataacaccgc 1740 caataaagtt gtaggaatga tcaaaagatc atttgtcaat ctcaatatct tgacatttcg 1800 tactttgttt attttccttg ttcgtcctca ccttgattct tgcagctctg tctactatcc 1860 tagactttaa aaagatcaac aattaattta aaatgttcta tgctagtcta gtaaatttgt 1920 ttttggccag actggactca cttacaacca aagactactg agactaaatt taactagcat 1980 gagctatcgt ttcttatgta atgacatggc agaaacttac aaatgcattc acaatcttta 2040 taattgtagt tgcaaccctt ttgtttcctc ataaaatacc attcagtatt ttatctatca 2100 acacctgtgt cagaatgatg tcatctgcac agccattata tttattattg taattacaga 2160 ccattaagta gttaatctga gttgtaatag catgcgtgtt aatatggcca cttcttaata 2220 ttaatatatg attatatatt tagttagtat ttattcctcc attatgacta actccaactt 2280 aacaacatgc gatcatgctt ataaattgaa aaaaatgttt tgtaagacag atgtccggaa 2340 aaaacttttc taatatggaa ttcacaaatt tgtagaatag tctaccatct gaaatagtgt 2400 tcaacatctg tcacgacatt aaaaaaacat ctagatgcat cttgggttca tcaaatgtac 2460 aattgcatat aaacaattag accgacacaa ttagatacaa tagaacaaaa ggttgatagg 2520 cttttttata tacctaatta ttctatttgt aaaattataa ccttagaaaa taataataat 2580 aacaacatta ataataataa caatatgtta tatcaaaact cttataatta aacaccgaat 2640 aattaaaatt aattaaacac tgaataagtt ttatgtttat ttttttagtg ttgtatatta 2700 aaaacacttt ttttcagtgt tccataactg agaaaaatat tctatgcctg gctaactcat 2760 tcaattgagc atgagatctt aatctcatgc taatttgaat ctcgtaatct tggggtaaat 2820 atttacttag agcctcaagt tgtttatctt ccttaatgtc atatttaaat aagaaaatat 2880 tatcatagaa acctttccaa accgcgcaaa ctttaaaatt cacttttttt tactaaccaa 2940 taattaaact ctacttgcaa taaaaagcga ttacattaaa cattaagaaa tgaatctcaa 3000 taatttttag actaatattt tcagttaaaa aaaaaaaaaa aatctcaaaa aagaaagtta 3060 atctgaagtt tttttatatc ataaatttat ttgtgattat tttttagctg atgtaccgtt 3120 aaactgaagg tggtaataaa gtttacaaaa tttgctctgg ctgaattttt caatgaaaaa 3180 acaatatgtg tgccatgaat gtaatctgtt tttcatggcg tatcaatgaa agtttggtgt 3240 attggccaaa tcacttgcaa gcatcttgcc tttaaactgc catacgtaaa gtcacgcttt 3300 taaaagtaac tgaaagcagc tgaaaataag aattttttgg taattgtctt atttacaata 3360 ttgtttttgt tttatttaca gtattgtctt tgttttttat ctaaacagct atgtttgact 3420 caaatatagt aaatgttgat tgtttagtat ggtttcatct cttctttata cttttttttg 3480 acagttatga gactgcttta gaacggaatt atctgtgcct ccagcttaaa aggatgcagg 3540 ttacaaaaaa agtgacactg atgagattgt ctctcagttt ctcaaagatt gaagaaatct 3600 ttagtgtttt agggttaagt gattccatca gagctttgct tgactatctc tttcgattta 3660 ttgaatttat ttgccggtta tagatgaaaa aagatcttct aaacctagaa gaaaaattat 3720 ggaacaaaga caccttatga atcaatagta tatatcagtg aattgatatt agttaatagt 3780 ttttggaaac agtaccatgt actattaaag tttctaattt ttaaaatata tgatacattt 3840 atatgaaaca tacaaacaaa tgatacaata tgatacgtga ttgactataa ttatgttgta 3900 tatcagaatg ttttatataa atgcttatta cattttattg tgatttttat cttttatgcc 3960 cagttagcaa tgtattttgt gtagcaatga ctactaatct cagatatcta tgacatctgt 4020 agggtgtatt caccgctaca gatgtcataa tctatttata accaattgct tttactcttc 4080 cataatgtca tattttaata agaatcacat tatggaaaac cgtctaatac atacacatga 4140 ggaaccatta tacttttagt acaaccaatt atattacatt tgttttctta ttttagatga 4200 actttcttac ttcggttgga ggagatgaga aacaacttac agcggcaatc ttaatacagc 4260 taatggccaa agaagtatgt ttttcatact cagtgtatgg aatgaaacaa aaacgtcctg 4320 caaggtcgtt ataggtaaag cacaatattt gtattctaca tcccatttcc ttttaataaa 4380 taaatatttt aataaaagct aattacttta caatatttat acgtgaaaag ttatatattt 4440 caaaattata ttgtgatgat atttaaaaaa cattttgtag agttgcgttt agtcgtttat 4500 cgttgtattc gttcatttcg aaatcgctac tgacaaaagt attaaagaaa aaattggcag 4560 ctttctggcg acggccatta atagagatta cggaagaaaa atagaaccgt aactgagcaa 4620 cgcccaatat tgagcaatga aataagtatc gccaatataa tgaaagtcga tgcccactat 4680 ctcatagtga gagtataata cagtagttta tatatatgta cgtgtattta aataacattg 4740 attttatttt gttatatttt ggttttggaa aactctatac ctcttttttt catttgtatc 4800 tgtttcatat aacttcaaat aaatttaatg atttcgaatt tttttttctt aaagcgtgtt 4860 ctatgattat tgtcatctcc tagactataa cctgtttaac tcaaaagaat aatgcagcct 4920 agtactatag tatcacttaa acaagaaaac tgttagtaat tttaattatt acaaatttgt 4980 agaaggcaat attttttatt attttttaat aataaaaatt taatcgaaaa tactcaatat 5040 attgccttaa tttagttata tatattatat atatcaggat aatattgtaa actatattag 5100 gccaatattg gccagtataa tatccccatt attggcgcga tattatttac aatattggcg 5160 caatattata aactatatcg ggccaatgtt gcagtgtgaa cattttgcca atattgactt 5220 tacagtattg gtgcaatatt gtatcgccaa tattgacttt acaatattgg cgcaatattg 5280 tagcgccaat attggcaaaa tgttcacatt gcaacattgg cccgatatag tttacaatat 5340 tgcgccaata ttgtaaacaa tatcgcccca atattgcgcc aatattgttt gctactaggg 5400 // ID Gypsy-19_IS-LTR repbase; DNA; INV; 230 BP. XX AC ABJB010862880; XX DT 15-FEB-2011 (Rel. 16.02, Created) DT 15-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the black-legged tick genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-19_IS_; KW Gypsy-19_IS-I; Gypsy-19_IS-LTR. XX OS Ixodes scapularis OC Eukaryota; Metazoa; Arthropoda; Chelicerata; Arachnida; Acari; OC Parasitiformes; Ixodida; Ixodoidea; Ixodidae; Ixodinae; Ixodes. XX RN [1] RP 1-230 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the black-legged tick genome."; RL Direct Submission to RU (15-FEB-2011). XX DR Genome; ABJB010862880; Positions 476 705. XX SQ Sequence 230 BP; 74 A; 55 C; 68 G; 33 T; 0 other; tgtagcggag acgctgagtg cggggtgaac ggaacgcgac ggaacagccc gcaccacgag 60 cgttcaaaac taaacaaaaa acataaaaga gaagcaaccg gcacgcgacg aagggaacgc 120 tcgtcgtgct cgtgcgtgtg cgctcggcag ttggaagcca accactgaag atgtcggacc 180 tctcagtgcg gtgattaaac ccgttaaaaa ggaaagcaga aattgtaaca 230 // ID Mbcv_I repbase; DNA; INV; 5016 BP. XX AC . XX DT 05-JUN-2008 (Rel. 13.09, Created) DT 05-JUN-2008 (Rel. 13.09, Last updated, Version 1) XX DE LTR retrotransposon from Monosiga brevicollis: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; chromovirus; KW Mbcv; Mbcv_LTR; Mbcv_I. XX OS Monosiga brevicollis OC Eukaryota; Choanoflagellida; Codonosigidae; Monosiga. XX RN [1] RP 1-5016 RA Carr M., Baldauf S.L., Leadbeater B.S.C. and Nelson M.; RT "Three Families of LTR Retrotransposons are Present in the Genome RT of the Choanoflagellate Monosiga brevicollis."; RL Repbase Reports 8(9), 1179-1179 (2008). XX DR [1] (Consensus) XX CC Chromovirus identified in the genome of the choanoflagellate CC Monosiga brevicollis. Not annotated in the JGI release Version CC 1.0. CC Positions [3736-4215] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 62..1051 FT /product="Mbcv_I_1p" FT /note="Homology to known Gag proteins." FT /translation="MSSRTRSRTGPPGQGTHELNLESDTEPDTDPVAEPVA FT EHVMEPMTTMAPLASHVIQGFRANLQKKSQRLQTFGFDPYFSAYIAVTMHM FT NHQLKANLAATWSSHWDHVMVPFLEGALHRNLASPDTTPASLEDARAHLHK FT QFGFHLGVAALARMLDASTSNFTGLAEFDEWVQAVDCMFQHEPLEYTMLTK FT LQMLCKLQRQCTCSRLVERLATAAHQATSGPATTLTRIVKEALDGYVPTRD FT TLFSVPSNRTAHKGPTIHSAAGPKAANKQAASSPSRPPLRIPAQLRNSEEF FT TKLNKSDKQARHTFARKHHLCYHCFADDHQSKACPKN*" FT CDS 1039..4845 FT /product="Mbcv_I_2p" FT /note="Homology to known pol proteins." FT /translation="PKKLGPGVGEPASSTSPDPNPEGVPSQAARTSATDSL FT PPREEPRPVVINSVSVFSPRQDPGPLMCVGAMGSTAVTFLFDTGCDRSLLS FT AQLASSLSLPTRNGPPFDVIFGNGTRIPVETYTEVSLQLGPLTVTRSFPVV FT ALFDAADVLLGRDFLSEHDIVISARQRTATFPGAVQLPFIESPTPIGLRSL FT LRQKSAAAYLVWVSTTPDNSVRCFSATLGRSSSTSNDRDALIDDIRNEYSD FT LFDDGPTQAPAPAMRGVESAEHKIILTDNAQPVKQRDYRRGEAELAALKQL FT LTDMVRDKVIQPSTSPWSSPVLLVKKSNGSWRFCVDYRALNKVTVKDAYPL FT PRIEDCLSRLRDATCFTSLDLRSGYHQIPMAADSVPMTAFTTRYGSYEFRV FT MPFGLCNAPATFQRTMNQLLGDFLDDFVIVYLDDILIYSPTWADHERHVRR FT VLDRLRDAQFKCNIAKCSFFQDQVDFVGHTVSRNNIAMQTSKLAALRDWPL FT PATTTQLQRFLGFANYYRRFIASFSKLAKPLTDRAKNSLKRVKLAWTPQMK FT SAFEDLKKALLDGPVLMIPDTSGQYPFSLQTDASDECIGAVLSQQGRIVDC FT FSQKLHDAELNYPVRDKELLAIVRALKRYAHLIGNQTVDVYTDHRSLQYLE FT TTKLQGPVPQRRRLDRWWMDTLQHVNMRVIYLPGQHNVAADALSRMHVKDE FT DVTTFTAAEPVLGEDEDGTAIRDGGVAFVAAAANQANETTSAPGSTTVLQL FT SSDLYPQLVEGYRTDPDWQAVYAGHADPSSKPPRRGRLGIRFRRTELDAAT FT GLLYYALQPSSKAINNSADRRRLVVPQGPVRDLLLAEHHSTPLAGHLGYER FT QVATMRDKYWWRGLSTDLKDFCARCPKCQLRKDPTTARNGPLQPLDPPSTP FT FTHVTMDFITGLPKCEGCDAILVVVDRLTRMVMTRATKKAINALDTARLIL FT EMMLPMGGMPLSIVSDRDTRFVANVFQKMCKAFGTNLDFSTAHHPQTDGLT FT ERYNRVLIECLRTGAETDRDNWVQMLPMATYAINSTTHKVTRLTPLFAATG FT RHPVTPSSLLVQPKAAAFGDNDADSELRHLQSIWQYVADVTALGQEEAAER FT YDRKVNMVEFKPGDWVVVSIKAVRSPTDRATHSSKLGARCIGPFKVLERVS FT ANAYRLELPNDVRAHPVINIEHLRRYQLPANAAEDFRPAPVSRDSYGGNYL FT VETLLDKRTVRPKGRAGRPRIEYLVKWMGYSKEEATWEPRASLHDFYIKEF FT EAARPRT*" XX SQ Sequence 5016 BP; 1025 A; 1612 C; 1298 G; 1081 T; 0 other; tggtagcgat tgaaaacttc gatcgaccga ccgcccttct tcaatctttg cgctggatag 60 catgtcctcg cgcacccgtt cgcgaactgg gccccctggc caaggaaccc acgagcttaa 120 cttggaatcg gacacggaac ctgacacgga ccctgtcgcg gagcctgtcg cggaacatgt 180 catggagcca atgaccacaa tggctccgct cgcctcgcac gtgatccagg ggttccgagc 240 aaacctccag aagaagtcgc aacgcctaca gacttttggc tttgatcctt acttctcggc 300 ctacattgct gtgaccatgc acatgaacca tcagctcaag gcgaacctgg cagcaacctg 360 gtcttcccat tgggaccatg tcatggtacc tttcctcgaa ggtgccctac accgcaacct 420 tgcaagtccg gacacaaccc cagcatcctt ggaagacgcc cgagcgcacc tgcacaaaca 480 gtttggtttt cacttgggcg tcgccgccct ggcccgaatg ttggacgctt cgacgagcaa 540 ttttactgga ctcgctgagt tcgatgagtg ggtgcaggcg gttgattgca tgtttcagca 600 cgaaccgctt gaatacacca tgttgaccaa gctccagatg ctgtgcaagc tccagcgcca 660 gtgcacttgc agccgactgg tggagcgtct cgccactgcg gcacaccagg ctacatctgg 720 tccagcgacc acgttgacta gaatcgtgaa agaagctctc gatggttacg tcccgactcg 780 tgatactctt ttcagcgtgc caagcaaccg gacggcgcac aaggggccga cgatccactc 840 ggccgccgga cccaaggcag ctaacaagca ggccgccagc tctcccagcc gtccaccgct 900 ccgcattcca gctcagctcc gcaattccga agagttcacc aagctcaaca aaagcgacaa 960 acaagctcgc cacacctttg ctcgaaagca tcatctctgc taccactgct tcgccgatga 1020 ccaccagagc aaggcgtgcc caaaaaacta gggcccgggg tcggggaacc tgcatcctct 1080 acttccccgg accccaaccc agagggcgtg ccatcgcagg cagcacgtac ctcagccaca 1140 gattcactcc cacctcgtga ggagccccga cctgtggtta tcaattcggt ttctgttttt 1200 tcaccgcgcc aggaccctgg ccccttgatg tgcgtgggtg ctatggggtc aacggcggtg 1260 acgttcctct ttgacactgg ctgtgaccgc tctttgctat cggcacagct ggcatcctcc 1320 ttatcccttc caacgcgcaa cggaccacct tttgatgtca tcttcggcaa cggcacacgt 1380 atacctgtcg agacttacac cgaggtgtcc ttacagcttg gtcccctgac cgttacccgt 1440 tcttttccgg tggttgctct cttcgacgcc gccgacgttc ttctgggccg cgatttcctc 1500 agcgagcacg acattgttat ttctgctcgc caacggaccg cgacctttcc aggcgctgta 1560 cagctgcctt tcatcgagtc gccaacacct attggcctcc gttcactcct tcggcagaag 1620 tctgctgccg cctatctcgt ctgggtctct acgacgccgg acaactctgt gcgctgtttc 1680 tcagccaccc taggacggtc ttcgtccacc tccaacgacc gcgatgctct gatcgacgac 1740 attcgaaacg agtactctga cctcttcgac gacggaccga ctcaggcgcc ggcgccggct 1800 atgcgaggcg tagaatctgc tgaacacaaa attatcttga cggacaatgc gcaacccgtc 1860 aagcagcgag actaccgccg aggcgaagcc gaattggccg cactgaagca gctcctgacc 1920 gatatggttc gagacaaggt catccagcct tcgaccagcc cttggtcatc ccctgtgcta 1980 ctcgtaaaaa agtccaacgg ctcctggcga ttttgtgtcg actaccgtgc cctcaacaag 2040 gttaccgtca aagacgcata tccactgcca cggattgagg actgcttaag ccgtctccgc 2100 gacgcaacct gttttacttc gctggacctc cgttccggct accaccagat acccatggcc 2160 gccgacagcg tgcccatgac tgctttcacg acgcgctatg gctcctacga gtttcgcgtt 2220 atgccgtttg gcctgtgcaa tgctcctgcg acgttccagc ggaccatgaa tcagctcctc 2280 ggcgactttc ttgacgactt tgtcattgtc tacctcgacg acatcttgat ctacagtcca 2340 acgtgggctg accatgaacg tcacgttcga cgtgttctcg accgacttcg cgatgcgcag 2400 ttcaagtgca acatcgccaa gtgctccttc ttccaggacc aggtcgactt tgtgggccac 2460 accgtgagcc gcaacaacat tgctatgcag acctcgaagc tggcagccct ccgggactgg 2520 ccacttcctg ctaccaccac tcaacttcaa cgtttcctag gcttcgccaa ctactaccgc 2580 cgtttcattg cttcgttttc caagctcgcc aagccgctga ctgatcgtgc gaagaacagt 2640 ctcaagcgcg tcaagctggc gtggacccct cagatgaagt cagcctttga ggacctcaaa 2700 aaggctcttc tcgatggccc cgtgctcatg attccagaca cttcgggcca gtatcccttt 2760 tccctgcaga cggacgcctc tgacgagtgc atcggcgccg ttctttctca acaaggccgc 2820 atagtggact gcttctcgca gaagctccac gacgccgagt tgaactaccc tgtccgtgac 2880 aaagagttgc tggccattgt gcgcgcgctt aagcgctacg cccacttgat tggcaaccag 2940 actgtcgacg tgtacactga ccaccgctct ctccagtatc tcgagaccac caagcttcag 3000 ggcccagtac cccagcgtag gcgccttgat cgttggtgga tggatacgct tcagcacgtt 3060 aatatgcgtg tgatttacct accaggccag cacaacgtgg ccgctgacgc cctctcacga 3120 atgcacgtga aggacgagga cgttacaacg tttactgccg ccgagccagt cctaggtgag 3180 gacgaggatg gcacggcgat ccgtgatggc ggcgttgcct ttgtggcagc ggccgcgaac 3240 caggccaacg agaccacctc agcgccaggc agcacaaccg tgctccaact ttccagcgac 3300 ctctaccccc agctggtgga gggttaccga acagatcctg actggcaggc cgtgtacgcc 3360 ggccatgccg accccagcag caagccacct aggcgtgggc gccttggtat tcgattccgg 3420 cggaccgagc ttgatgctgc aacgggtctc ctctactacg ctctccagcc ttcgagcaag 3480 gctatcaaca attcggcaga tcgacgtcgg ctagtggtgc cccagggccc tgttcgtgac 3540 ctgttgctcg ccgaacacca ttcgacgcca ctggcaggcc atctgggcta cgagcgccag 3600 gtggcgacca tgcgagataa gtactggtgg cgaggcctga gcacagacct caaggacttc 3660 tgcgcgcgct gtcccaagtg ccaactgcgg aaggacccca ccacggcacg caatggtcca 3720 ctccagcctc tcgacccccc gagcacgccc ttcacacacg tgactatgga ttttatcacg 3780 gggctgccca agtgtgaggg ctgcgacgcc atcctggtgg ttgtggaccg cctgacacgc 3840 atggttatga ccagagccac gaagaaagcc atcaacgctc tcgatacagc tcgcctgatt 3900 ttggagatga tgttgccgat gggcggcatg cctttgagca tcgtctctga ccgtgatacc 3960 cgttttgttg ccaacgtctt tcagaagatg tgcaaagcat ttggaaccaa ccttgacttt 4020 tcgacggctc accatccaca gacagatgga ctgactgagc gatacaacag agtgctgatc 4080 gagtgcctcc gcaccggcgc ggagaccgac cgcgacaact gggtccagat gcttcccatg 4140 gccacctatg ccatcaacag caccacgcac aaggtcactc gcctgacgcc actgtttgcg 4200 gcgaccgggc gacatccagt gacaccgtca tccctattag ttcagcctaa ggcggcagcc 4260 tttggcgaca acgatgcaga ctcagaactt cggcacctcc agagcatttg gcagtacgtt 4320 gcagatgtga cggccctcgg ccaggaagag gcagctgaac gttatgaccg gaaggtcaac 4380 atggtggagt tcaagcccgg cgactgggtg gtggtctcca tcaaggcggt gcgttcgccc 4440 acggatcggg ccacgcactc ctcgaagttg ggcgcccgtt gcattgggcc attcaaggtg 4500 ctcgagcgcg tgagcgcgaa tgcatatcga cttgagctgc cgaacgacgt ccgcgcccat 4560 ccagttatca acatcgagca tctacgccgt taccagcttc ctgcaaatgc agcagaggac 4620 ttccgtcctg ctccagtttc tcgtgactcg tatggcggga actatctagt cgagactttg 4680 ctcgacaagc gcactgtgcg gcccaagggc cgtgcgggcc gtcctcgtat cgagtatctc 4740 gtcaagtgga tgggttactc gaaggaggag gcgacctggg aacctagggc cagccttcac 4800 gacttttaca tcaaggagtt cgaagctgcc cgccctcgta cttgagtttc acacggtcaa 4860 gtgtaccgca ttttgcgtgc ttgcatgaat caggaaactt gcattttttt tctctcatcc 4920 ttgtttccct ttttgttctc agcgtgcctt cgccatttct tccgtttgtc tttgaaccat 4980 cctcgccgga tcggggctcc ggctggcgag gggggg 5016 // ID BEL-82_CQ-LTR repbase; DNA; INV; 212 BP. XX AC AAWU01005025; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-82_CQ_; KW BEL-82_CQ-I; BEL-82_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-212 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 306-306 (2011). XX DR GenBank; AAWU01005025; Positions 601 390. XX SQ Sequence 212 BP; 65 A; 53 C; 32 G; 62 T; 0 other; tgttcgatga gatcgaacgc acactattac aaaaccaaaa ctgcgctacc tttgaattat 60 attctttgct tttctttttg cgcacaccta gtttttagaa taaaccgctc attgtctagc 120 tcgtaaccgt acgattcgca cgcgtctttc attccggccg aaatcgaagg aaaaatacga 180 ccttaaaata cagtccacta gaacaatcta ca 212 // ID Gypsy-3_RP-I repbase; DNA; INV; 4108 BP. XX AC ACPB02031233; XX DT 28-JAN-2011 (Rel. 16.02, Created) DT 28-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Rhodnius prolixus genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-3_RP_; KW Gypsy-3_RP-LTR; Gypsy-3_RP-I. XX OS Rhodnius prolixus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Euhemiptera; Heteroptera; OC Panheteroptera; Cimicomorpha; Reduviidae; Triatominae; Rhodnius. XX RN [1] RP 1-4108 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Rhodnius prolixus genome."; RL Direct Submission to RU (28-JAN-2011). XX DR Genome; ACPB02031233; Positions 15721 11614. XX CC Positions [3181-3648] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 1639..3648 FT /product="Gypsy-3_RP-I_1p" FT /translation="MVEQGICRPSKSQWASPLHVVDKKDGGVRPCGDYRQL FT NARTVPDRYGVPNIMDFRNNLYSKTIFSRLDLVRAYYHIPIREDDIHKTAI FT ITPFGLYEFTRMCFGLRNAAQTFQRFMDNIFRDLNFCFVLLDDILVASTSE FT DEHRQHLEEVFRRLDSNGLTLNPSKCEFGKEEIDYLGYHITPKGIKPHVDK FT VQAIMDYPLPKDVRALRRFLGMLNFYRPCIPKAALLQKPLNVYLLGSKKND FT KTPVSWNEESIAAFERCKAGLQDAALLAHPKPEAELSLACDASNHGMGAVL FT QQKDSGSWIPLGFFSKMFSEAQQKYSAYDRELLAIYTAVRHFRPVIEGRPL FT IICTDHKPLIYAMYQKPSSASPIRIRWLTFISQFTTDIKHVPGEANQVADA FT LSRVEEIHLTGDLDKLADLQRTDATITDLEKQPNLSLQWVNLPGCTKTVLC FT ETSNGKYRIYLPESLRRDIFDSLHGCSHPGVRATRKLIGKKYFWPNINADV FT SNWARCCIACQRSKVTRHTITKPAEFVQTGRFEQIHIDIIGPFPPSAGKVY FT CVTIIDRFTRWPEVVPVENITAEVVALAFYRGWISRFGVPKILTTDQGRQF FT ESELFQHLTSFLGTKKIRTTPYHPQSNGCVERWHRALKSALVAHLDTSNWT FT ELLPTVLWAYVPQSGMIQILA" XX SQ Sequence 4108 BP; 1312 A; 846 C; 862 G; 1088 T; 0 other; gtggtgaacc cgacgtgatt ttgtgatctc ataatttgga ctttttaaat ctaattaata 60 taattcttaa attgttttta atctaatacc aaattagttt tcgttgtatt ttatgttatt 120 tgttcataat aagttaaggt tatttctaag aattttttta ttgttaagga tattttgatt 180 gtttcttttt gtttgtgttt gtgttctttt tgttagtttt tttttaaaaa aaaatcagct 240 aatcatgaag gaagaagcaa aagaagaaat agttgctttg acctccagta ataagatacc 300 tcccttttgg aaaccagacc cagaactatg gttctgtcag atggaggcag tgttttcgcg 360 ttgtgggatc acaaatagtt taacaaaatt tcagactgtt attcctcaat ttgagtttga 420 tgtgcttcaa caggtagcag atatagtcaa acacccaagt aatgcccctt acgaggattt 480 aaaaactcga ttattaatac atacgccgaa tctgaacata aaaggattca acaatttatt 540 agagggcaag caattaggag acgaaaaacc gtcacaactc ttgcgacaaa tgaagcaact 600 ggcaggagac acggtagcaa cggatgttgt taaaactcta tggctccgtt ctctcccgaa 660 taccacacag gcaatcctct tatctacggg acacactgag gtcgacaagt tagcgagtgt 720 agcagacaaa atccatgaaa tcgaccgccc tgatagcgta tgtaccgttt ctacaaacag 780 tggactggag cataaaatcg agaaactcac tgagcaaata gccgcactag ttgcagcaaa 840 atcaacctgg gatcgaagtc gaagccccag cgctcatcgt tctcggagta aatctagata 900 caaagcagag cccaataatc caaattggct ttgttactat cattaccgat tccgagacaa 960 ggcaaaaaaa tgtcaacaac cctgtgcttg gctgaagaac acagaaaatg caagttctag 1020 ctccaaccag ggaaacagct aagcgggccg gaagtggcgg tagctggcac cgctaattcc 1080 catcgccgcc tttttttgag agaccaacgt actgggcaga catttctcat agacacaggt 1140 gctgccatat ccgtgttgcc cccatctaca aaaggtaaat ttaaggctac agaatatacg 1200 ttatacgcag caaatggaag tcccattaaa acttttggag aaaaagagtt aaaattggac 1260 ttaggactac ggcgagactt caagtggaat tttgtgattg cagacatacc caaacctatt 1320 attggtgccg acctgctgca tcacttcaat ctgttggtgg acctaaaagg taagcaactt 1380 atagatcaag ttacaaagtt atcaacaaaa gggaaagttt caaacgagcc tcccactaca 1440 atatctgcag taccagagag tcatcctttt gcagacctct tacgaaaata tccagatatc 1500 accaggccta atactattaa ggaaaatgta cgacatgata cagtacacta catagaaacg 1560 acaggaccac cagttaatgc aaaatccagg aggttacatc ctgaacgata cagagtggtt 1620 aaagatgagt ttcgtagaat ggtggagcaa ggtatttgta gaccatctaa aagtcaatgg 1680 gccagtccct tacatgttgt ggataaaaaa gatggaggcg tgagaccttg tggcgattac 1740 cgccaattaa acgctcgtac ggtaccagac cgttacggag ttccaaacat aatggacttt 1800 agaaataatt tatacagtaa aactattttt tcgagattag atctggtaag agcatattat 1860 cacattccta ttagggaaga cgacattcat aaaacggcta tcataacgcc atttggcctt 1920 tatgaattca ctaggatgtg ttttgggctg agaaatgccg cccaaacctt ccaaaggttt 1980 atggacaata tattccgaga tttgaatttt tgttttgtac tactagatga cattctagta 2040 gcatcgacaa gcgaagatga acacagacag cacttggaag aagtatttag acgtctcgac 2100 agtaatggat tgacattaaa cccttccaaa tgtgaattcg gtaaagagga aatagattat 2160 ttaggatacc acatcacccc aaaaggcatc aaacctcacg tagataaggt tcaggcaatt 2220 atggattacc ccttgccaaa agatgtaagg gcacttcgcc gtttcttggg gatgttaaat 2280 ttttataggc catgtatccc taaggctgca ttgcttcaaa aacctcttaa tgtctatctc 2340 ctagggagca agaaaaatga caagacacct gtctcatgga acgaagagag catagctgca 2400 tttgaacgat gtaaagctgg cctacaagat gctgcactgt tagcgcatcc gaaaccggaa 2460 gccgagcttt cattggcctg tgatgcctct aatcacggta tgggagctgt gctacaacaa 2520 aaagattccg gaagctggat accattagga tttttttcta aaatgtttag tgaagcgcag 2580 cagaaatact cagcatacga tagagagcta ctggccatct atactgcagt acgacacttc 2640 cgaccagtta tcgaaggtag gccgctaatt atttgtactg accacaagcc actaatttat 2700 gcaatgtatc agaaaccctc ctcagcttcc ccaatcagga ttagatggct aacatttata 2760 agtcagttca ctactgatat aaagcatgtt ccaggtgaag ccaatcaggt ggcagacgcc 2820 ttatcacggg tagaagaaat acaccttaca ggcgatttgg ataagttagc cgatttgcaa 2880 agaacagatg ctacaataac ggacctggaa aagcagccta atctcagttt acaatgggta 2940 aacttgccag gttgcactaa gacagtactt tgtgaaacat caaatggtaa atataggata 3000 tatctaccag aaagcttaag gagagatatt ttcgactctc ttcacggctg tagtcaccct 3060 ggagtgcgtg ctacacggaa gttaattgga aaaaaatatt tctggccgaa cataaacgca 3120 gacgtttcga attgggcaag atgttgcatc gcttgtcaaa gatcgaaagt tacacggcat 3180 accattacaa aaccggcaga atttgttcaa acagggcgtt ttgagcaaat ccatattgac 3240 atcattggac cctttccacc atcagctggt aaagtttatt gcgtaaccat aattgatcgt 3300 tttacccgct ggcccgaagt ggtaccagtg gaaaatataa ccgcagaagt ggttgcttta 3360 gcattttacc gaggatggat tagtagattt ggtgtaccca aaatactgac tacagaccaa 3420 ggaagacaat ttgagagtga attgtttcaa cacctcacca gctttctagg tacaaaaaaa 3480 ataagaacaa caccttatca tccgcagtca aacggatgtg tagaaaggtg gcatcgggcg 3540 ctgaaatctg ctcttgtggc tcatttagat acaagcaatt ggacagaatt actacctacc 3600 gtactatggg cttacgtacc acaatcaggg atgatacaga ttttagcgta gcagaagccg 3660 tatatggaca aactctacgg ttgccaggag aatttttatc accaagttcc agtgctaatg 3720 atttaattgg gacagtacaa aaattaaaag aggtgatgtc cgaccttcgc cctgcatcat 3780 ttagaaaaag tcgcagacac atatttgtcc ataagactct cgacaagtgc actcacatgt 3840 tcttgagaga ggacaaggtt agaaaaccac tgacaccgcc ttactctgga ccattccccg 3900 tgctcagcag gacagacaag atttgcacac tacagctgcc acagcgacag atcactgtgt 3960 cgctagaccg cgtcaagcct gcctttttgt tagtggatga gccgaattca ccagcactac 4020 ttccgaccaa cgaaccatct gttagtagat atgggcgccg gataaagccg aatgtaagat 4080 ttctggacgc agttgttggg gggagtac 4108 // ID Gypsy-31_OD-LTR repbase; DNA; INV; 1433 BP. XX AC CABV01003571; XX DT 24-FEB-2011 (Rel. 16.02, Created) DT 24-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Oikopleura dioica genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-31_OD_; KW Gypsy-31_OD-I; Gypsy-31_OD-LTR. XX OS Oikopleura dioica OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; OC Oikopleuridae; Oikopleura. XX RN [1] RP 1-1433 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Oikopleura dioica genome."; RL Direct Submission to RU (24-FEB-2011). XX DR Genome; CABV01003571; Positions 4636 6068. XX SQ Sequence 1433 BP; 454 A; 266 C; 221 G; 492 T; 0 other; tgatacaata aggagaaaat actcgacaaa aaaaattttt tattttaatt atttctttaa 60 ttcacggccc aagatcgacg tcaatttgga ttaaaattta taaaaattca ttgtcacaat 120 tgttaaaatt tacaaaagat ggtcggaaaa agcgagacat ttccgactca aacccgcttt 180 ttcatcaatt gcatgttcat ttatcatttc gaccaattga cctcaaaaaa ccgaaattta 240 aacgttataa atagtcgctt ttggtttttg acatcttctg gttatgttct ctggattcaa 300 atatgctttc atcaaaatca actggttcgg gatttaatta taagggcgct tctagcagtc 360 catgtgcgac tgtaataacg aagcattatc tttaattata tgtacgcata taagaaacac 420 ttttaattat tccgaattcg ttcaatttca tcgaaaatca tcatttccgc acaatctcga 480 aaaacgtaaa ttctcttcct tgatctcatt tttaactttt gttctatttt tatattcaca 540 atggactaca aacgataaca aacgacaacg acgaaagtcg taattcagca tgctgattcc 600 tttttgagtg ttgtcggcga gtagctagac cgtgcgaaat ccgactagaa cagctcaaag 660 gtgaatgttc ctggtaactc tatcccgtct gtttagttaa gcctgacaac cttttcttct 720 tcaacaaacg gcgagcacga taacagatta gaacgattac gacctggact ggactgacaa 780 cgaatttgga catggacttg gacttgcctt tgaattggat tggtcacata gtgagatttt 840 caatttggag cgcatttatt tatctgtttt attatttttg aaatttattt tatccgttat 900 ttattgtgtt ttattcaatc gccaatttta tatcttttca agttttaatt tactaattaa 960 tcctgaccaa attaattgga acaaaaatta aaattattta tccgtcattt atgttttcat 1020 tattttataa aaaaatacta aaaaatattt aaaattcaac cattttgcga aattgaaata 1080 tttaaaaaat cgaattattc ataaagccgc aggcccagcc agcgcattgt tctcttttga 1140 aaagcgcgca gcgcatcccg cttctcgaat cttctcgaac ttgctcaaat ggcgcgcagc 1200 gcaaaccgtt ttcttccttg ttttcaaagt tcaaatcaga tttaatctgt aatgcataaa 1260 tacgaccgcc ttctccgaaa tttcattcct ttcacaaata aaaccaccgt tgaattttat 1320 tatcgtttcg ttacttgctt ggcaactcta gagctagaaa aatcgagggc aaattgcatc 1380 tgaaataatt ggaaggtctt gtcggtagct gaaagagaga attgacggta tca 1433 // ID BEL-36_AA-LTR repbase; DNA; INV; 513 BP. XX AC supercont1.382; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-36_AA_; KW BEL-36_AA-I; BEL-36_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-513 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.382; Positions 633318 632806. XX SQ Sequence 513 BP; 173 A; 93 C; 91 G; 156 T; 0 other; tgttgccgcg ccgcttcaga atatagtgca accgggaact ccctcggtaa aacctactga 60 tagacagaag aatatgtcaa tttctgacaa gggttcacga cgtcaaatga gttggcgaaa 120 ttgaacaaca ttggttgaag caatttgata cgcaaaatta ttgaaagcat ttgtgaacag 180 taaattatag tgaattcgaa actatattat cgtgttgagc ttaattgaac tcctctagtt 240 cttcacctct taggtaatag tatgtgaatt atataataga actgatcata aacttaattg 300 tatattataa cctaacagcc acccatatac gctctcattg agtgcacagc aaattgtacc 360 tttagcatat ttcctattga ggcactgaaa tgtaagtcca actgaattgt catttaaata 420 attgaactaa ttcaataaaa tttgcagctt tgaagctgta gaaagcagct atcaaaaatc 480 ggtgtttata tgttgcccta gtcagccgca aca 513 // ID Gypsy-85_AA-I repbase; DNA; INV; 6852 BP. XX AC supercont1.248; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-85_AA_; KW Gypsy-85_AA-LTR; Gypsy-85_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6852 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.248; Positions 1437207 1430356. XX CC Positions [5211-5687] - Integrase core CC 'ATAT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 1731..2879 FT /product="Gypsy-85_AA-I_3p" FT /translation="MRSLSTKSNKKRHRKVSFSSISTSSSTTSSSQTTSSS FT EVSSEDTSESSSSTSRSAERRRERHKKERKARQSLRRIPVSEWKLKYDGRD FT QGRKLAEFLKEVKMRCKSEDVSDKELFRSAIHLFTGRAKDWFMEAYENHDF FT HSWSGLKRELKREFLPPDLDFQIEIQATGRRQARGERFVDYMHDMQKLFQS FT MTKPISERRKFEIIWRNMRFDYKNALTGAGVKSLSKLKKYGRIIDENNWNM FT FQKPSESSNRPKTHQLNEISASNNSKSKPTFSQPENTRTFFKSKPKSKTDN FT EHSKQEEQKNEKGERNERKENPIEGSSKGTLKALAEKYVRPPIGTCYNCRK FT HGHHYGDCSEKRQKFCRLCGFLDVITPECPFCQKNEQNSA" FT CDS 2813..4765 FT /product="Gypsy-85_AA-I_2p" FT /translation="MWFLGRHYAGMPFLPKKRAEFSLRRQAEASTLNPSKD FT DRLYKDLLQNGYKPFSDDDYVPEVEVNELFVKVDGDNRPFAKVHVMGREMI FT GLLDSGAQRSILGIGCKKLVKSLNFKIFPTDVSLKTVSGTPVEVEGYVHLP FT VTFNDETKIIKALITPNLPRRLILGYDFWRVFNLHPTVQFEHCELREEMER FT EGFGNEETREEEGGNEQELLTEEQKLKLEQVKQLFKFAIEGEVLGVTPLIS FT HKIELKEEFEKAPLVRINPYPTSPAMQQKINHELDNMLRQKVIEPSKSDWA FT LSTVPVLKPSGDVRLCLDARRLNDRTRRDAYPLPHQDRILSRLGSSRYLTT FT IDLTKAFLQIPLDPSSRKYTAFSVLGRGLFQFTRLPFGLVNSPATLARLMD FT EVLGYGELEPSVFVYLDDIVVVSSTFEAHIQSLTEVARRLRLANLSINLDK FT SKFCLRELPYLGYIISSDGLRPNPDRVSAIINYERPTSLRALRRFLGMSNY FT YRRFIPRFSEISAPLTNLLRKNPKTIVWNTTAEKAFLELKENLIAAPVLAN FT PNFQLPFQVQTDASDSAIAAILTQQHESGEKVVAYFSQKLSPAQQAYAASE FT KEGLAVLSAINKFRPYIESTRFVVITDASALTHEWEVAHVVTPQPMEHRAA FT GF" FT CDS 4704..6080 FT /product="Gypsy-85_AA-I_4p" FT /translation="MNGKWRTSSRLSRWSIELQGFDFEIRHRRGKDNVIPD FT ALSRSIEIATLSEGDAWYSSLYDKVRSAPDENVDFKIEEGKLYKFVPSKTE FT VLDFRFEWKLCVPEKSRDEILRKEHDEAFHIGYEKLMDKIRTRYFWPKMAT FT TIRRYVERRRTCKECKPTTISQHPVMGNPRLARKPFQMLAIDFIQSLPRSK FT AGHTHLLVLLDVFSKWTVLVPVRKIATDLIIKIIEEQWFRRFSVPEILISD FT NATSFLSNAFKDFLSRYQVKHWANSRHHSQANPVERLNRSINACIRTYVKT FT DQRLWDTRISEVEHTLNNTSHSSTGLTPYMIVFGHEIVSEGTEHLRDPDTS FT DVSESERAERKLKVDDQIQRIVRQNLAKDHDKSTRAYNLRFRKPAPVYQVG FT QNVYKRNFSQSVAGEAYNAKLGPAYTPCTVVSRRGTSSYELIDNQGKNLGI FT FSSADLKPGVPDEE" XX SQ Sequence 6852 BP; 2198 A; 1311 C; 1582 G; 1761 T; 0 other; attggcgacc aactaaaatc gaagctaatc aagagttcaa aaaggtgcta cattgttcta 60 tcaagtgtgc atgataggtt agctaggtta cggcaagtga taaggtttga gtgtttcaat 120 ggtaatgctt tagcttttgg ggatcgtttc ggagttcatt aagccatctt actagtgcgc 180 gtacactcac aaaagttgat attctttgtt ttttttttta ccaaccagcg ggagtgattt 240 tgcgtttaat tgaatgatag ttatggttat tgggtcttga ttcttctaaa tgatctgagc 300 aacgaatgtt tcggatcaaa acgactgaaa ttgtttggtt tgatgaatga ttttgtcgac 360 gctctcaatt ttgagtgatt aattagcaat tttcaaataa cttcttcgtg atagatcatt 420 taggtatttt ttcaattgct tcttcgtgat agcacaatta gttttattca atagcttctt 480 cgcgatagct caactaggtt ttttttttca attgcttctt cgtgatagct gtattaggta 540 gtttttcaaa tagtatcttc gtgattactc aattaggtat tttattgtaa ttaatcggta 600 tttgatatgg atttacaaaa gctttataag aggatggacg tgtctcatct ttccatggat 660 gaggttgaac gtgaattaca tgtgagaaac atggtattcg gaccggatga acatgaaagc 720 attaaacgga gaaagttgaa agatagaatg aagtacgaac gcgaaaagaa tatttttgtt 780 gccgctccta tttggagaac ggtacctgaa gagattgaga tagtgagggc taagcttttg 840 gtcatcggag gattgttgga caatccaaaa accgatgctc gccagagaga aaagctgtgt 900 actcgactgg tgcattatag agtgcgaata tacatgatat ataaatcgcc tggatcggat 960 aaactcacga aagagataac cgatcttggt aaactagcta gtcagatttt tcgaaaacat 1020 tttcccgaga tggaatctag ctctgaagcc caacccgaac ccaaacgttt agaatcagaa 1080 attgatcaag ttctagagga agtcagaagc gaaatagaag ttttaaatgt aacagctaca 1140 ggtaatgact tggaagagcc ggtcgtggaa gagggagcaa gtgggggtgt aaagaagaag 1200 acaattgagt ctaaaaagca ggagctagag gcatcaatga ttcgatgtga tgaaattttg 1260 ggggtgttaa caggatatga ggaaggaaag caggaaagct taaaagatgt aatccaacat 1320 ttcaaaaatt ttgttttaca caccacgaag caacagaagg aaatgagaga aagagaaatc 1380 gctttagaag agaaacgtat gaaagaagct gaggagaata tagagcggaa aaaaagatta 1440 gaaaagcttc ttatcaaact aaatgatcag ataaagttta atcaggagag aattgataga 1500 gagtcagaaa taaaccctga gaaggtagat acgggaaagg ggtcatttga accggatggc 1560 aaaaatgacc aagtattaaa atcaagtaaa gaagaagata taacgatcgc tgaacagcaa 1620 tcgtttgaag agagtagttc tgaaagtccg cttgaggtgt ttcggaaaca taaagcagta 1680 gaaagcaagg aaaggagaaa aagaaggagg ggaaaaagga gaaaagtaac atgagatcat 1740 tatccacgaa gtctaacaaa aagcgccatc gaaaggtgag cttttcctca atttcgacaa 1800 gcagtagtac aactagtagc tcacaaacta ctagtagctc agaagtttca agcgaggata 1860 cttcagaaag ctcgtcgagt actagtagat cagcagaacg tagaagggaa aggcataaga 1920 aggaacgaaa agctaggcag agtttgaggc gaattccagt ttccgaatgg aaactaaaat 1980 atgacggaag ggatcaggga cgaaagttgg cagaatttct gaaagaagtt aagatgcggt 2040 gtaaatcgga agacgtatcc gacaaagaac tttttcgtag cgcaatccac ttatttacag 2100 gacgagctaa ggactggttc atggaagcat acgaaaatca tgacttccat agctggtcag 2160 ggttgaaaag ggagctcaaa cgagaattcc taccgcccga tttagatttt caaatcgaaa 2220 tccaagccac tggtcgtcga caggctcgcg gtgaaagatt cgtcgattat atgcatgaca 2280 tgcagaagct ttttcagtcg atgacaaagc cgatttcaga gcgccgcaaa tttgaaatca 2340 tctggcgaaa tatgagattc gactataaaa atgctttaac tggagcgggg gtgaaatccc 2400 tatccaaatt aaagaaatat gggcgaatca tagatgagaa caattggaat atgttccaga 2460 aaccgagcga aagctcaaat cgtcccaaaa ctcaccagtt aaatgagatc tcagcctcaa 2520 acaactcgaa atctaagcca acttttagtc aacccgagaa tacgcgaact ttttttaaga 2580 gtaagccaaa aagtaagacc gataatgagc atagcaagca agaggagcaa aagaatgaga 2640 aaggggagag gaatgagagg aaggaaaatc cgattgaggg gtcgtctaag ggaacgttga 2700 aagctttagc tgaaaaatat gtgcgcccac cgataggcac ttgttacaac tgcaggaaac 2760 atgggcacca ttacggcgat tgctcagaaa agagacaaaa attttgtagg ctatgtggtt 2820 tcttggacgt cattacgccg gaatgccctt tctgccaaaa aaacgagcag aattcagctt 2880 gagaaggcaa gctgaagctt cgactctaaa tccttcaaaa gacgatcggc tatacaaaga 2940 cctattgcaa aacggttata aaccattttc ggatgacgat tatgtccccg aagtggaagt 3000 taacgagctt tttgttaaag tagatggaga caaccgtccg tttgcaaaag ttcatgtgat 3060 gggaagagaa atgatcgggt tactggatag tggtgctcag cgctctatac taggaatagg 3120 gtgcaagaaa ctggtgaaat cgctgaactt caaaatcttt cctacggatg tatctttgaa 3180 aactgtttct ggaactccag tagaagtaga aggctacgta cacctaccag tgactttcaa 3240 tgatgagaca aaaatcatta aggctctgat cactccaaat ctcccacgta gattaatact 3300 cgggtatgat ttttggagag tattcaatct tcaccccact gtgcagttcg aacactgcga 3360 gttgagggaa gaaatggaaa gagaaggatt tgggaatgag gaaactagag aagaggaagg 3420 aggaaatgag caggaacttc taacagaaga acaaaaatta aaacttgagc aagtaaaaca 3480 actcttcaaa tttgcaattg aaggagaagt tctcggagta actccactca tctctcacaa 3540 aatagaactc aaagaagagt tcgaaaaagc gccactggtg agaatcaacc cctatcccac 3600 gtcccccgct atgcaacaga agattaacca cgaattagat aatatgctca ggcaaaaagt 3660 tatagaacca agcaagagcg attgggctct tagtacggtt cctgtgctta aaccttcggg 3720 cgatgttcgc ctgtgtctgg acgcacgacg gttaaatgat cggactagga gggacgctta 3780 tcctctacca caccaagatc gaatattgag taggctaggg tcaagtcgat atttaacaac 3840 aatcgattta accaaagctt ttttgcagat cccgcttgac cctagctcac gaaaatatac 3900 ggcgttctcg gtgttgggca ggggattgtt ccagttcacc cgattaccct ttggcctagt 3960 caacagtccc gccacgctgg caagactaat ggacgaagtt ttagggtacg gtgaactgga 4020 accgagtgtg ttcgtttacc tcgacgacat cgtcgtggta agcagcacat tcgaagcaca 4080 catccaatct ctgaccgagg tggcccgtcg attacgattg gccaacctct ccatcaacct 4140 cgacaaatca aaattttgtc tgagagagtt accctacctc gggtacatta tctcatccga 4200 cggtttgcga ccaaaccccg atcgcgtatc ggcaataatt aattatgagc gcccaacatc 4260 acttcgtgct ttacgtcgat ttttgggaat gtcgaattat tatcgacgct ttattccccg 4320 attcagcgag atttcggcac ctcttaccaa tctcctgaga aaaaatccta aaacgatcgt 4380 atggaatacg accgcagaga aagctttcct tgaactaaag gagaatttaa tcgcagcacc 4440 agtgctcgca aatccaaact tccagctgcc cttccaagtt cagaccgatg cgagtgacag 4500 tgctattgct gcgatcctca cacaacagca cgagagtggc gagaaagtgg tggcatactt 4560 ctcgcaaaaa ctctctccgg cacagcaagc atatgcagcc tccgagaagg aaggtctagc 4620 ggtgctatct gcaatcaaca aattccgacc gtatattgaa agcactcgat ttgtcgttat 4680 taccgacgct tccgcactca cccatgaatg ggaagtggcg cacgtcgtca cgcctcagcc 4740 gatggagcat cgagctgcag ggttttgatt ttgaaattcg gcatcgaaga gggaaggaca 4800 atgttatccc ggacgctctc tcgcgttcga tagagatcgc cacgctgagc gaaggggacg 4860 catggtactc ctctctatat gacaaggttc gctctgcgcc ggatgagaac gttgatttta 4920 agattgaaga ggggaagttg tacaaatttg taccgagcaa gaccgaagtc ctcgattttc 4980 gtttcgaatg gaagctttgt gttccggaga aatcacgaga tgagattctc cgaaaagagc 5040 acgatgaagc gttccatatt ggttacgaaa aactgatgga caaaattcgc actagatatt 5100 tctggccgaa gatggccacc acgattcgca gatacgttga gcgacgccga acttgtaaag 5160 agtgcaaacc cactacaatc tctcagcatc cagtcatggg aaacccacga ctagctagaa 5220 aaccttttca aatgttggcg attgatttta ttcaatcatt gccgcgatct aaagctggcc 5280 atacccacct tctggtgttg ctagatgtct tttcgaagtg gactgttttg gtccctgtga 5340 gaaaaattgc gacagatctt attattaaga ttatcgagga acagtggttc cgccgcttct 5400 ctgttcctga aatcctgata agcgataacg caacaagctt tctcagtaac gcttttaagg 5460 acttcctgtc tcgctatcag gtgaagcact gggccaactc ccggcatcac agccaggcca 5520 acccggtcga acgtttgaac cggagtataa acgcttgtat ccgaacatac gttaaaaccg 5580 accagcgttt gtgggatacg agaatctccg aggtagaaca cacacttaac aatacgtcgc 5640 attcgtctac gggtctcacc ccgtatatga ttgtgttcgg acatgagatc gtttccgagg 5700 gtacggaaca tcttcgtgat cccgatactt ccgatgtgtc tgaatctgaa agagcagaga 5760 ggaaattaaa agttgatgat caaatccaga gaatagtacg ccaaaactta gccaaagatc 5820 atgataagag taccagagct tataatcttc ggttccggaa acctgctcca gtatatcagg 5880 tgggacagaa tgtttataag cgaaactttt cgcagtcggt ggctggagaa gcctataatg 5940 caaaattagg tccagcttat actccatgca ctgttgtgtc tcgccgaggc acaagctcat 6000 atgagcttat tgacaatcaa ggtaagaatc ttgggatatt ttcgtcggcg gatttgaaac 6060 ccggtgtccc cgatgaagag taatgtttgt ccaattccac tgtcaattta gagtctatac 6120 gtccaaaaat tccaatgttg tgtccagaat tgtgtagagc cttcgtaagt caaagtacgt 6180 aaataagatg atcgtcggtt attgtgagtt gtacagcaaa tattgtgtaa gtgaaatagc 6240 gaatctgttg gtttagaatg tctggccgtt cagaaaacaa tttatgtatt gctgtcccgt 6300 ttcgtctaaa atagaataat agtccatcat tattggttga agcagaagcg acagggaaat 6360 ttaaccgatc aaggattaaa acaataacaa atgaaaaatt tctagatcgc gatgtgcagt 6420 agctatggat ccgatggggc agatcaagtt atgataaaac actcctagtc aattacctca 6480 atttatcaca ccacacctaa attatggttc tcagttcggc aagaatatca ttcgtaaatc 6540 aaaccatcag gagatttcgt tttggtagaa agaaaaaaaa ttacctcttc ctcgaaacaa 6600 gcaaatgttt cggccattgg gagatgaata gattgactaa ccgttccgcg aatatgaaat 6660 tgcattaaca aagtcccttt ttttaaataa ggcaactgtt tgttgaaaag atgcgattta 6720 tatcattttt tgggaatttt cgagagcatt aaagttgcga ctcaaccctg agatgactat 6780 aaatgaaccc aacaagtgaa aaattaatcc aaaactttgt tttgattaat tttgaggggg 6840 gagatgaggt ag 6852 // ID Gypsy-192_AA-I repbase; DNA; INV; 5745 BP. XX AC supercont1.65; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-192_AA_; KW Gypsy-192_AA-LTR; Gypsy-192_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5745 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.65; Positions 2820290 2826034. XX CC 'GGACC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS join(1644..3332,3336..5729) FT /product="Gypsy-192_AA-I_1p" FT /translation="MESARRVRSFEILADRSRLPVEWQKWKRELERYFDAC FT GISSQWEKRSQMLHLAGPDLQEIFDHLPGVEEVPLVVRDPPYYDVAVRKLD FT EHFEPMRRRNYERHLFRQIEQKPDERFADFVLRLRIQAKRCEFDRFDKRET FT EDRLIEQIVETCRSKDLRRHILAKDMTLDEIVTLGTTLADVQQQMNELDRS FT HGEIKHQLDVVNRVIRRPQLSKPKFQSEFRSTNSTKWTNRSCFACGRKGHL FT KGDNICKAKNAKCLKCGELGHFMNRCLKRTAGSDNATSRTKRIRLIEEVEP FT ENQKEEAIFYAMGKNTFDFVIGGVKIAMVIDSGADANIIEEATWEQAKAAG FT IKTTGISSVVDRKLMAYATKLPMQINCMFWAEIKAGDNKTLTKFYVVQNGQ FT QNLLGHATAKELKVLKVGFDVASVTERPAIFPKIKGIVVEIPIDRKVQPVR FT QSYRRAPFALEEKIHDKLEYLLDRDIIEKVNEPSAWVSPVVPILKESGEIR FT LCVDIRRANQAVLRESHPLPVIEELLAGVDGAVKFSKLDVKDAYHQLEISE FT DSRVITTFITKYGLFRSFVISKTLLTLWKINDHSNLWNFVFKLRYKRLMFG FT ISCAPEAFQKVMDTLVAGLEGVIVYLDDAMVWGSTQAQHDFRLKCLLERFK FT EYNVLLNEDKCLYNVDELEFLGHYLSAAGVKPTESRVRAVEQFRRPGNTAE FT LRSFLGLITYVGRFIPHLASKTDSLRALLKKENKFQWTSAQQQAFEDIKHA FT VSNISHLGFFNPKNKTVLIADASPYGLGAVLMQEDANKQGRIIAYASKSLS FT DLERKYFQTEREALALVWAVDRFKLYLQGIKFDLVTDCKPLQFLFSPRSKP FT CARLERWIMRLQSYSYRIVYQPGPMNVADALSRLPVAQEVVGTFDPENENF FT VRMLAITSAPVAITLQEIQEESSRDEQIQDVIKALEHGVWTENAKPFKAYE FT TELCASSEVLLRGERIIIPEKMRRRTLELAHEGHPGMVVMKRRLRQKVWWP FT GLDAEVEKFVKTCRDCTLVSNAFAPEPLIRTAMPEKPWVHIAVDFMGPLPS FT GHNLLVIVDYYSRFVEVIVMKEISAKSTILALHETFCRYGIPVTMKSDNGP FT QFVSEAVQEFCREYGIEHRKTTPYWPQANGEVERANRALKKRLQISQTSKS FT DWRWDLRMYLLMYNSTPHSTTGVAPSALMFGRVLRDKLPGFPSAGMKSIEE FT VLDRDRQKKLKEAEYANIKRKAKPNPLREGEIVVAKRICKENKLASNFNPE FT ELIVVGRSGPDVTLKSKETGKIFHRNVSHLKPIVPRNQGTIDNAVPHKVNK FT NETEARQEQEPKHDNAVQNQVSKSMDRPRRETKRPDYLDDYLVRAVQDR" XX SQ Sequence 5745 BP; 1847 A; 1010 C; 1473 G; 1415 T; 0 other; gtttttgtcg acgaggtgaa aagtttgata aagtgattca gtaccgaaat tctttaacca 60 ttccaaaaaa gtggaaaaaa gtaaaaaaag aaggtgagtg ttattaaatc aaaagaaaag 120 atcttgaatt cctcgtgaag ataagtgtac cacgatgagg aaagtgagat gatggtggat 180 gcattcgagc catgaaatta ttgaaaaaaa atatgaagcg gaagaaaata atctatctac 240 agaaatagct aattactgtg aatgtgaaaa cgttcgaaat tggatttaga attgtctgaa 300 cacagtggcg tgaaaaaaaa tatggcgtca cgcacgagag atttaagctg agcttgccgt 360 gagtatgtat gagtgagagg tgaattattg aaatgaatat ccgacgcggc tcggtttctg 420 tgtgaaggcg aaagagcagt gcgacgttag acttcgttat atactcattg gattggtggc 480 tatgggcttc gcagttatgt tttattgaga aaattcgatg gtgggcgaaa aaaaagagtt 540 gaaccattcg gttggcaaag gggtgtctat ggatgttcgt aacagagtga aaaaaaaaaa 600 attgaaaggc ttaaaaggaa atatcatgtg cgatgtgatt ttggattggc aactgggtgc 660 cgatagcgtt tcacaactat gtgtgagata gagaaaagca gcagaatgct agagcaagag 720 gtgttatgta cattgatact ctggttggca actgggtgcc gatagagttt cacaactgtg 780 tgtgatagag agaaagcagc agagtgctag agaaaaaggc gttatgtaca tggatacttc 840 ggttcggcaa ctgggtgccg attgtggctc acaactgtgt gtgatagaga gaaaagcaac 900 agagtgctag aggaagaggt atcatgtacg ttaatatttc gaatggcaac tgtgtgccga 960 taggctgtcg caactgtgtg cgtagaagat aaagcaactg ggtgctattg gaaataacgc 1020 aggagctaaa aacgtggtgg gtctgaggac tctcagcttt ctgcgtatag gtttctacat 1080 ggacacgtga tggttacgtc acattaacgg tgagctgcag ttatgttata gttgttactc 1140 agaggtttgg tgcacataga actataccag tgacgaagca ggggagtaag tgagagttgt 1200 aaaaaagggt acgcattagt gtggaaacgt attggtttat gagcattagt aagtacaatg 1260 tttacaaagg tatgaaaggc aaaagtaaat aagaaaaatg tcgttatcat ggtggatttt 1320 aatggatttc gagaaattga tgtggaaaat gagataatac tggttaatga tcgattatca 1380 cggaggtgtt cagactctca gtttggcaga atcttaaggt aggtaataat aacaaaagaa 1440 aaatactgga aattatgttg agtagttcaa aacatgcatg tgaactattc ggggtcaata 1500 agtatgtatt tgggaatgtg tattatggga aggaatagaa ataataataa taataaaatg 1560 catccattca tccatccacg agatcttcat tattttaaat agtagaaacg aaaattaaaa 1620 aaagaaaatt gaaattcttt aggatggaat cagcaagacg cgtgcgatcg tttgagatac 1680 tagcagatag atctcggcta ccggtagaat ggcagaaatg gaaacgcgag ttggaacgtt 1740 actttgatgc ttgcggaata tcatctcagt gggagaaaag atcccaaatg ttacatttgg 1800 ctggcccaga tcttcaagag attttcgacc atttgccggg ggttgaagaa gttccactcg 1860 tagtacgtga tcctccttac tacgatgttg cggttcgaaa actcgatgaa catttcgagc 1920 ccatgcgtcg tcgaaattat gagcgacact tgttccgtca aattgaacag aaacctgacg 1980 aaagatttgc tgattttgtt ttgaggctca gaatacaggc aaaacgttgc gaattcgatc 2040 gttttgataa gcgagaaaca gaagacagat tgattgaaca aattgtggag acttgcagat 2100 caaaagatct acgccgacac attctggcaa aagacatgac tctggacgaa atagtaacac 2160 tggggaccac tttggccgat gtccaacagc aaatgaatga actggatcgt tcacatggcg 2220 aaataaaaca tcagttggat gttgtgaatc gggtgattcg gcgtccgcag ttgtcaaagc 2280 ccaaatttca gtccgagttc cgatctacaa attcgacgaa atggaccaac agatcttgct 2340 ttgcttgtgg aaggaagggt catttgaaag gtgataacat ctgcaaagcc aagaatgcta 2400 agtgcctcaa atgtggcgaa ctggggcact ttatgaaccg ttgtttgaaa cgcaccgccg 2460 gaagcgataa cgcaacttcg agaacaaagc ggattagact tattgaagag gttgagccag 2520 agaaccagaa agaagaagcc attttctacg ctatggggaa aaataccttc gatttcgtta 2580 taggaggagt caaaattgcg atggtcattg actcaggtgc cgacgccaat ataattgaag 2640 aggcaacctg ggagcaggca aaagcggcgg gcataaaaac aacaggaata tcgtcggttg 2700 tagatcggaa gctaatggcg tatgctacca aactgcccat gcaaataaat tgtatgttct 2760 gggcagagat taaggctggc gacaacaaga ctttgacgaa attctatgtg gttcaaaatg 2820 gtcaacaaaa cttactggga catgcaacag ccaaggaact gaaagtttta aaagtgggat 2880 tcgatgtagc atcagtgaca gagaggccgg caatctttcc gaagatcaaa ggaatagtgg 2940 tagaaattcc gatcgaccga aaggttcagc cagttcgcca atcgtatcgc cgtgcaccat 3000 tcgctcttga ggagaaaata cacgacaagc tagaatactt gctagacagg gacattatcg 3060 agaaagtaaa cgagccttct gcgtgggtct ccccggtagt accgattctt aaggagtctg 3120 gggagataag gctttgcgta gatatacgtc gtgccaacca agcggtactc cgggaatcac 3180 acccacttcc ggttattgag gagctattgg ctggtgtgga tggcgcagtt aagttttcaa 3240 aactcgatgt aaaggacgca taccaccaac ttgaaatatc agaggattct cgcgtgatca 3300 caacattcat aacgaaatat ggattgttca ggtaaagctt cgtaatttct aaaacccttt 3360 taaccttatg gaaaataaac gatcactcta acttgtggaa ttttgttttc aaactcagat 3420 ataaacggct gatgttcggc atcagttgtg cgccagaggc cttccagaaa gttatggata 3480 cactagtagc ggggttagaa ggggttatag tgtacctcga tgacgcaatg gtttggggaa 3540 gtactcaagc acagcacgat tttcgtctta agtgtctctt agaacgtttc aaagaatata 3600 acgtgctgct aaatgaggac aagtgcctgt ataacgtgga tgagttggaa tttctaggtc 3660 actacttgtc ggctgcagga gtaaaaccaa cagaaagtcg ggttagggca gtagaacaat 3720 tccgccggcc tggaaatacc gctgagctga gaagtttcct gggactcatt acctatgtag 3780 ggagattcat acctcatcta gcatcaaaaa cagattcgtt gcgggcgttg ctaaagaagg 3840 aaaataaatt tcagtggact tcggctcaac aacaggcatt tgaagacatt aagcatgccg 3900 tgtcaaacat cagccatttg ggatttttca atcctaaaaa caaaactgtc ttgatagcgg 3960 atgcaagtcc atatggtcta ggagcagttt tgatgcaaga ggatgcaaac aaacaaggac 4020 gaatcattgc ttatgccagc aagtcgttat cagaccttga acggaaatat ttccagaccg 4080 aaagggaggc tctagcgttg gtttgggcgg ttgatcgttt taagctgtat ctgcaaggta 4140 tcaaatttga tttagtcacc gattgcaaac cgctacaatt cctgttcagt cccagatcga 4200 aaccgtgcgc cagactagaa cggtggataa tgcgtttgca atcatattca tataggattg 4260 tttatcaacc aggaccgatg aatgtagctg acgcactatc gaggctgccg gtagcgcaag 4320 aggtagtagg aacctttgat cctgaaaatg aaaatttcgt tcgaatgcta gctattacgt 4380 cagcaccggt ggctatcacg ctacaggaaa ttcaggaaga atcaagcaga gatgagcaga 4440 tccaggatgt tataaaagca ctagagcacg gagtatggac agaaaacgcc aaacctttca 4500 aggcatatga aaccgaacta tgtgcttcat cagaagtact gttacgagga gagaggataa 4560 tcattccgga gaagatgcgc cgtagaacgc tggagctcgc acatgaaggg catccgggca 4620 tggtggtcat gaagagacgt ctacggcaaa aggtctggtg gccaggtttg gatgctgaag 4680 tcgaaaaatt tgtcaaaact tgcagagatt gtactttggt atcgaatgcc tttgcgccgg 4740 aaccattaat tcgtacagcg atgccggaaa agccctgggt gcatatagct gtagatttta 4800 tgggtccatt gccttcaggc cataacttgt tggttatagt ggattattat agccgttttg 4860 ttgaggtcat tgtaatgaaa gagatttctg caaaaagtac tattctagca ttacatgaga 4920 cgttctgccg atatggaatt ccggtgacga tgaaatctga caatgggcct cagttcgtaa 4980 gcgaagcagt acaagagttc tgtagggaat atggcatcga acatcggaaa acaactcctt 5040 attggccaca ggccaatgga gaggtggaaa gagcaaacag agcgctcaag aaacgtctgc 5100 agataagcca aacatcaaag tctgactgga ggtgggattt acgaatgtat ttgttaatgt 5160 ataactcaac cccacattcg actactggag tagccccgtc agcgcttatg ttcggaagag 5220 tccttagaga caagctgccg ggatttccat cagcgggcat gaagtcgata gaagaagttc 5280 ttgacagaga tcgccagaag aaattgaaag aagcagagta cgcgaacatc aaacgtaaag 5340 ctaagccgaa tcctctgagg gaaggagaga tagtcgtggc aaagagaata tgtaaggaaa 5400 ataagttggc gtcaaacttc aacccggaag aacttattgt ggttggaaga tcaggaccag 5460 acgtcacact gaagtcgaag gaaacgggaa aaatcttcca tagaaatgtc agccacttga 5520 agccgatagt tccgcgtaat caaggcacca tagataacgc agtgccgcac aaggtcaaca 5580 aaaatgaaac agaagcgcga caagagcagg agccgaagca tgataacgca gtccaaaatc 5640 aagtatcaaa aagtatggat cgccctcgcc gagagacgaa gagaccggac tacctggatg 5700 actatcttgt gcgtgcagta caagaccgtt gaggaagggt ggaga 5745 // ID Gypsy18-I_Dpse repbase; DNA; INV; 7727 BP. XX AC Unknown_group_563; XX DT 15-MAY-2009 (Rel. 14.05, Created) DT 15-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy18_Dpse; KW Gypsy18-LTR_Dpse; Gypsy18-I_Dpse. XX OS Drosophila pseudoobscura OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; obscura group; OC pseudoobscura subgroup. XX RN [1] RP 1-7727 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1108-1108 (2009). XX DR Genome; Unknown_group_563; Positions 99039 106765. XX CC Positions [3296-3838] - Reverse transcriptase CC Positions [4853-5329] - Integrase core CC LTRs are 79% similar to each other. XX FH Key Location/Qualifiers FT CDS 2156..5395 FT /product="Gypsy18-I_Dpse_2p" FT /translation="MASQSQLDEESIIEYIIEGIPDSKQNKSCLYQANNFK FT DFREQIKIYEKISANQGSTLKANPKFENSSVKIKDERRCFKCGGSNHIAKY FT CKESAVKCFNCNQLGHKANQCEAKGSRMATEKKISNVQQFSQTSFNRAFKN FT VEIGNEDISALFDSGSDICTMSEKVYRKIFPVALKNDIKELIGIGGKKIYT FT LGSFNVVTKLDDVPLEMCFHVVRDDDTLYEGVIGSDVLDFVSASIGKKGVF FT FHSINDIRQGEVEVPPEKEYVSSSFTKGSRNLEFIKYSDTQVSRPPDSGSA FT IDELEKTFHQILSSNIIEEFSRGIDLSHLNREVRSIIAELVENYKPLKPEN FT SPVEMKIILSDEVPVHQRPRRLPYVDNEKVDKQVRDWLKEGIIRHSVSEYS FT SPVVLVAKKDGSKRLCGDYKKLNEKIIRDNFPMALIDDFLLKLQDGRVFTT FT LDLCNGFFHVPVHEDSRKYTSFVTQSGQYEFNYVPFGMTNSPPVFMRYIYA FT VFRPLIDDGILILYMDDIIIPSKDVQEGLEKLKKVLSIAEISGLRIKWEKS FT QVLQRKVNFLGYIIENSTIRPSQDKTSAVENFPIPRDRKQVQRYLGLTSYF FT RRFVKDFAVVARPLTNLMNKDVSFKMGEEELASFNQLKVYLSCSPVLKLFN FT QKGKTEVHCDASMYGYGAILLQIDSEDQCFHPVEYMSRKTTPAEEKYPSYE FT LEVLAIVQALKKWRVYLLGRKIKIITDCNAFALTMRKQDVPVKVSRWAIFL FT QDFDYEIEHRSGTKMKHVDALSRVHCLLLEDSLRHRIKQAQRQDIWISAIC FT KVLESGSYDDFYLKYDILHKDPVKELIVIPSGMESEIIMIAHRQGHFGVKR FT TIDLVQREYFIPELLGKVDRIVKSCVECIVSDAKCGKKEGYLNVIDKSDEP FT LQTYHIDHIGPMELTKKQYNHVLVVVDAFSKFFWLYPTKSTGADDVVDRLQ FT KQGEVFGNPKRVITDRGTAFTANVFEEYCKEQDIQHLLIATGVPRGNGQVE FT RINKVVTTMLTKLCAEDPKAWYKHVGRVQSFINSTPPRSTKISPFKILTGI FT KMRTKYDSDLEKLVEEEYILELQ" XX SQ Sequence 7727 BP; 2389 A; 1477 C; 1841 G; 2020 T; 0 other; ttaaagggct tccgggggcg ggaaacctat atccccttcc agacaattta tctctgacga 60 cgatatacgt ctctccgctg gcggaaaact taactccggc ggcgggatat ttatatcccc 120 tgctggaaaa cttatgtcta ctgacgaaat agttctctcc gctggcagaa aacttaaatc 180 cggcggcggg atatttatat cccctgccgg acaacttatc tctgctaacg aaatacgtct 240 ctccgctggc ggaaaactta tttccggtgg cgggaaatct atatccgctg gcagaaagtt 300 tatttccggt ggcgggaaat ttataccccc tgccggaaaa cttttctccg ctcacagaca 360 tcttatctcc gctggcagaa catttacctc cggtgctggg aaagttatat cccctgccgg 420 acaacttatc tctgctgacg aaatacgtct ctccgctgga ggaaaactta ttgccggtgg 480 cgggaaatct atgtagggtt ggcagccctt taatgagtat cgatctatcg ctagggatag 540 tcaggggtag aggaagatat atcgatagta agtaagagtt cgactgcggg agcgagtaga 600 cgtacagaga agaacgcgtt aagaaaaaga aacttattgt aaccatttat taaaataaag 660 acaatacaac gatgtgtggc ttatatttga tggagtttct ggagcggatt cttactcact 720 tgtagggagc ccaccaagga tccctacaat ttgggggctc gtccgggata gaaccacaag 780 tgaatgtgta ttcgtgtaca agttgccaat accatcaagt tttgcgcaac attgtgaaat 840 tacgacgaac gaagcaacga agatcagacg aaagcagatg ccgagaagcg gctgtgagaa 900 acgaaaataa agaggaaaga ggacgcaaaa cgactccatt cgttttcgtc aaggttcaga 960 gagcagaggt agctaacgac gaagagctac aaagctacaa aacgacgaag agctacagag 1020 ctacaaaacg acgaggcttc aaaacgacga ggctacaaaa cgacgcagac ccgaaaacga 1080 cgagtaaaga ggacgcaaaa cgacactatt cgttttcgtc aaagttcaga gagcagaggt 1140 agctaacggc gaagagctac agagctacaa aacgacgcag acccgaaaac gacgaggaaa 1200 gaggacgcaa aacgacacta ttcgttttcg tcagaagagc acggcaaacg ggaaagagtg 1260 aaaagccaac gacgaaagta cggagcaaag aggactgaag agacgttcag attgcgaacg 1320 tgcagaaaag aggaaagcag aaacgacgaa acaacaacag tggtacttcg cacggccacg 1380 tgcggcttgt ataccctgcg aacgacaacg acgtgaagaa gacagagaag aagaacgcat 1440 cgtttatctg cgtgtattgg aaaagacgat tctcacacct gatgtctgta tacgcaagta 1500 ttgaatatac agaatagatt tagggtgttg caaattttaa gaagcggatt taacgtaaac 1560 tatattttag ctaattgtaa tatgcaacgc gaagaaatct tagacctaac cgtagcagag 1620 ttgaaggaaa agttgaaaga gctacatctg cagcattcag gtgttaaatc agttttacgg 1680 gagagactat attcgtattt tagactagat caagacgaaa acgaagaatc tgtatacgaa 1740 gaaacgatca atataactgg ggacgaacag acagaagacg cgccaatttt aaattcggta 1800 ggagaagcag cagttggtag ttcagtaaac atggcgttca cattaagaga catcgaagaa 1860 tccctttcag catttacggg caatggttct ccggatgttg ataactggct acgtgatttt 1920 gaacttaatg ccttaacagt gaattggaat gatttgcaaa agtttatata tggcaggaca 1980 ttagtgaaag gtgcagcgaa attatttttg agcagtcaat caggcattaa tgattggaat 2040 tctctgaagg aagcactaaa aagtgaattt agtgagaagt taacggcaaa gcaagttcat 2100 aagctattag aaaatagaaa gaagacccat aaagagactt taattgaata tttttatggc 2160 aagtcaatcc cagttagatg aagaaagtat tattgaatat attattgagg gcattccaga 2220 ctccaagcaa aacaaaagct gtctgtacca agcaaacaat tttaaagatt ttcgtgagca 2280 aattaagatt tatgaaaaga tcagtgcaaa tcaaggaagt acgctaaagg caaacccaaa 2340 gtttgagaat agttcggtaa aaatcaagga tgaaaggcgt tgtttcaaat gtggaggtag 2400 taatcatatt gcaaagtatt gcaaggaaag tgccgtcaaa tgttttaatt gcaatcaact 2460 aggtcacaag gcaaatcaat gcgaggcaaa aggttcacga atggcaacag agaagaaaat 2520 ttcaaatgtc cagcaatttt ctcagaccag ttttaatcgg gcttttaaga atgttgaaat 2580 cggtaatgaa gatatcagtg cattgtttga ttctggcagc gatatttgta ccatgagcga 2640 gaaagtctac agaaagattt tcccagttgc gttgaaaaac gatataaaag aattgatcgg 2700 aataggaggc aagaaaatat atacgttggg ttcatttaat gttgttacta agttggatga 2760 cgttcctttg gaaatgtgtt tccatgtagt ccgtgacgat gataccctgt acgaaggtgt 2820 aattggaagt gatgttcttg attttgtcag tgcaagcata gggaagaaag gagttttctt 2880 tcattcaata aatgacatcc gtcaagggga ggttgaagtt ccaccggaaa aggagtatgt 2940 tagcagtagt ttcactaaag gaagccgcaa tctagagttc attaaatata gtgatactca 3000 agtgagcagg cctccggatt cagggagcgc aattgatgaa ttggaaaaga cgttccatca 3060 gatactgtcg agcaatatta ttgaagagtt ttcacgagga atagacctgt cgcatttgaa 3120 ccgtgaggtg aggtcaataa tagcagaatt ggtagaaaat tataagccct taaaaccaga 3180 aaacagccct gttgaaatga agataatatt gtcagatgaa gtaccggtcc atcaaagacc 3240 tagacgccta ccttacgtgg acaatgagaa agtcgataag caggtcaggg attggttaaa 3300 ggagggaata attcgccata gtgtttctga atactcgtcc cctgtagtgc ttgtcgccaa 3360 gaaggatggc tcaaaaagac tttgtggtga ttataaaaaa cttaatgaaa agataattcg 3420 tgacaatttt ccaatggcgt taatcgacga ttttttactc aagcttcagg atggacgtgt 3480 tttcactacg ttggacttgt gtaatgggtt ttttcacgtg ccagtacatg aggattctcg 3540 taaatatacc tcttttgtca cacagagtgg acagtatgaa tttaattatg taccatttgg 3600 gatgactaac tcccctccag tatttatgag gtacatatat gctgttttca gaccattgat 3660 agatgacggt attttgattt tgtatatgga cgatattatt attccatcaa aggatgtgca 3720 agaaggactt gaaaaattaa agaaggtgtt gagtatcgca gagatttcag gattacgcat 3780 aaaatgggag aaatcgcaag tgttacagcg caaggttaat tttttgggtt acataataga 3840 gaattcaacg ataagaccat ctcaagataa aacttcggca gttgaaaatt ttccaattcc 3900 gagagacagg aaacaagttc aacgctattt gggcctaact tcctatttcc gtagatttgt 3960 taaggatttt gctgtcgtgg ccagaccact gacaaacttg atgaataagg atgtttcttt 4020 caaaatgggt gaagaagagt tggcatcatt caatcagttg aaagtttatt tatcctgttc 4080 tcctgtttta aagttgttta atcagaaggg caaaactgaa gttcactgcg atgcaagtat 4140 gtatgggtat ggagctatac ttttgcaaat cgactcagag gatcaatgtt tccacccggt 4200 agaatatatg agtcgtaaga caaccccagc tgaagaaaag tatccttcat acgagcttga 4260 agtactcgct atagtgcagg ctttaaagaa gtggagagtg tatttgcttg ggagaaagat 4320 aaaaattatt actgactgta atgcttttgc gttaacgatg aggaaacaag atgttccagt 4380 taaggtatct cgttgggcca tttttcttca ggacttcgat tatgaaattg agcatcgttc 4440 agggacaaag atgaaacacg tcgatgcatt gagtagagtg cattgtttgt tgttggaaga 4500 ttcactgaga catcgaataa agcaggctca acgtcaggat atttggatat cagcaatttg 4560 taaggtgcta gagtcgggtt cttacgacga cttttatttg aagtatgaca ttttgcacaa 4620 ggatccagtt aaagagttga tcgtaatacc atctgggatg gaaagcgaaa tcataatgat 4680 tgctcatcgt cagggtcatt ttggtgtcaa aagaactatc gatttagtgc agcgcgaata 4740 ctttattcca gagttgttag gaaaagtgga taggattgtt aagtcgtgtg tggagtgcat 4800 tgtgagcgac gcaaaatgtg gaaagaaaga agggtatttg aatgtaattg acaaatctga 4860 tgagcctttg caaacgtatc atattgacca tattggacca atggagttga ctaagaaaca 4920 gtataaccac gtcctagtag ttgtcgacgc tttttctaaa tttttctggc tttatccaac 4980 aaagagtaca ggcgctgatg atgtggttga taggttgcag aaacagggcg aagtttttgg 5040 aaacccgaag cgtgtgatca cggatagagg cactgcattt actgctaatg ttttcgagga 5100 atattgtaag gaacaggata tccaacattt gctgatagcg acaggagtgc cgcgtggtaa 5160 tggacaggta gagcgaatta ataaggtcgt tactactatg ttaacaaagc tttgcgcaga 5220 agatccgaaa gcctggtaca agcatgttgg tagggtacaa tcgtttataa actccactcc 5280 tccgagaagt accaagatat ccccatttaa gattttgaca ggcataaaaa tgaggacaaa 5340 gtatgacagt gacttagaaa agcttgtaga ggaagagtat attttagagc tgcagtagga 5400 gaaagaagaa attcgcaaga tagccagaga taacatagtg cagatacaaa atgaaaattg 5460 caagtcgttc aacaagtccc agaaaccaga aaaagagtac caagttaatg atatggtagc 5520 tattaagcgt acccaatttg gaaccggatt gaagctaaag ggaagattct tgggtcctta 5580 tattgtgacg cgtaaactaa gacatggacg gtacattgtt gagaaagtgg gtgatggtga 5640 agggcctttg aagaccaata cggtggcaga gtatatgaag atgtggtctc aatcattcgg 5700 gtcgaatgat gagtcaggac ggccgaatgt agggttggca gccctttaat gagtatcgat 5760 ctatcgctag ggatagtcaa gggtagagga agatatatcg atagtaagta aaagttcgac 5820 tgcagaagcg aggagacgtg ccaaaggaac tatgctaaga aacctatttg taaccattta 5880 ttaaaataaa tataatacaa cgacgtgtgg cgtatatttg atggagtttc tggagcggat 5940 tcttactcac ttgtagggag cccaccaagg atccctacat atatatcccc agacggacaa 6000 tttatcccag cagacgaatt acgtttctcc gctggcagaa aacttatttc cggtggcggg 6060 aaatttatat tcgctggcgg aaaacttatc tctgctaacg aaatacgtct ctccgctggc 6120 ggaaaactta tttccggtgg cgggaaattt atatcccctg ccggaaaact tatctctgct 6180 gacgaaatac gtctctccgc tggcggaaag ttagttccgg tgctggcaaa tctatatccc 6240 ctgccggata acctatctct gctgtcgaaa tatgtctctc cggcgaacaa cttatcgccg 6300 ctggcagaca atttatcccc gcagacgaat tacgtttctc cgctggcaga aaacttattt 6360 ccggtggcgg gaaatttata ttcgctgacg gaaaacttat ctctgctaac gaaatacgtc 6420 tctccgctgg cagaaaactt atctccggcg gccggatatt tatatcccct gccggacaat 6480 ttatctctgc tgacgaaata cgtcactccg ctggcggaaa acttatttcc ggtggcggga 6540 aatctatatc cgctgacgga caacttatct ccgctggctg acaatttatc tccggcggcg 6600 ggaaatttat atcccctgcc ggacagctta tctgtgctgg cgaaagacgt cactccgctg 6660 gcggaaaact tatttccggt ggcgggaaat ttatatccgc tggcggaaaa cttatctctg 6720 ctaacgaaat acgtcgctcc gctggcggaa agcttatttc cggtggcggg aaatttatat 6780 ccgctggtgg aaaacttatc tccgctgaca gacatcttat ctccgatggc ggacaattta 6840 tctccgcaga cgaattacgt ttctccgctg gcggaaaact catttccggt ggcgggaaat 6900 ttatatcccc tgccggataa cttatctctg ctgacgaaat acgtctctcc gctggcggaa 6960 agttatttcc ggtggtggga aatctatatc ccctgccgga caacttatct ctgctgtcga 7020 aatacgtctc tccgctggcg gaaaacttat ttccggtggc gggaaatttt tatcccctgc 7080 cggaaaactt atctctgcta acgaaatacg tctctccgct ggcggaaaac ttatttccgg 7140 tggcgagaaa tctatatccg ctgccggaca acttacctcc gctggcagac aatttatctc 7200 cgctggcgga aaatttatat cccgagacgg acaatttatt cccgcagtcg aattacgttt 7260 ctccgctggc ggacaactta tatccgctgg cagacaattt atatccgctg acgaactacg 7320 attctctgct ggcagaaaca cgtctctccg gcggcggaaa aatttatttc cgggggcgga 7380 caatctatat cccctgccag acgaattacg tttctccgcc ggcagaaaac tgatctccgc 7440 tggcagaaca tttacctccg gcggcgggaa atttatatcc cctgccggac aacttatctc 7500 tgctgacgaa acacgtctct ccgctggcgg aaaacttatt tccggtggcg ggaaatatat 7560 atccggtggc ggacaactta tctctgctga cgaaacacgt ctctccgctg gcggaaaact 7620 tatttccggt ggcgggaaat ttatatcccc agacggacaa tttatccccg catacgaatt 7680 acgtctctcc gctggcggaa aatttatttc tggtggcggg aaatata 7727 // ID Gypsy-601_AA-LTR repbase; DNA; INV; 1141 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-601_AA_; KW Ty3_gypsy_Ele156; Gypsy-601_AA-I; Gypsy-601_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-1141 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 1141 BP; 431 A; 201 C; 204 G; 305 T; 0 other; tgtctttgct atggacaaac agtttttgga atcaaggctt atgttgtcaa gggattaccc 60 cctagcatac ctgccttgct tggaaaaaat ttcttgtaca agtatgaaac aatattaaac 120 ttcaaggaac aaacactaat tgcatacgta aacaattgca aaattaaaat accattcaac 180 acaccaaata tgtcatttat atgtgtaccc gctagaacgg agatcacagc atttgtagag 240 gtaaatgtaa acgaacctca ggtgttatta agtgaggaag tcaaaccata cgtttttata 300 gctaattcaa ttgtaaaacc taacaatggg aagatcccag taaaaatttt aaatgtatca 360 agaaaaacag ttcttttgca tgaactgcaa ccaaaaatgg aaaagcttga tcaatataat 420 ataattcaac ttgatgatgt caaaactgac tcgaatagag ctgaaaaatt gttaaaggaa 480 ttaaaaatta atcatttatc tcgtgaggaa aaagccacga ttcgaaaaat ttgtctgaag 540 tacaacgata tattttgtct atcagacgat aagctatcgg taaccaaaat tttatcacca 600 tcgattgccg ttaagcaaaa tacgcaacca gtatacacaa aaccatatcg gttgccacaa 660 tcacaaaaag aagaaattgc caaacaaatt aaaaatatga ctgacagtgg tataatagag 720 gaagcccgtt ccgaatggaa tagcccggtg ctcctcgtac ccaaaaagtc atgcgatgac 780 aaaaaaaaat ggaggatggt aattgattac cgaaaggtaa acaattccct tcaagatgat 840 aggtttgaat taggaaatat cgaagacatt atagattctc tggcaggggc caagtacttc 900 acgcacttgg atctctcgca gggatactat caatgtgaga tagatcctaa aagtagacct 960 ataacagctt tttcaacagc cactggtcaa tttcaaatga ccaggctgcc aatgggatta 1020 aaaattagtc cctcaacttt ttcaagatta atgacagtcg cgatgtcggg actgaatatg 1080 gaaaaatgtt tagtttatct agacgatata atagtttttg gcaagacact agaagaacac 1140 a 1141 // ID BEL-50_AA-I repbase; DNA; INV; 5156 BP. XX AC supercont1.300; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-50_AA_; KW BEL-50_AA-LTR; BEL-50_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5156 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.300; Positions 1072095 1077250. XX CC Positions [4212-4790] - Integrase core CC 'ACAAC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 27..5129 FT /product="BEL-50_AA-I_1p" FT /translation="MEPAELERLRAKREVMFAKIKWEFSVANALHVRNPSF FT GEVCERRDKLTELATNFDAVQTTIEENTTNLMDVTSVFNYRTQFEEVYFKV FT KDIYTEFLDANQDRVSWHSAGSETRNDLRDAIKALLETQQAMLDRRDLPPQ FT QPASGGQVSNHEPQVKLPQLNIPVFRGERKAWNSFKDLFVSTIHTREDLKA FT SVKMQYLLSYLDGEAKRMVSSFPITDANYNEAWETLVTHYDKKKYTVFALV FT REFVDQPSVTNAAGLKKLVATSDDVIRQLKALGNEYETRDPWLIHLLLEKV FT DRETRGLWAQKIIDEENPMFADFIEFLQKRCDALETCSAFSRKQGETAKKE FT LPKFSASEKKIQAFHSSATQVSCAKCSKDHPTYHCDQFKAMDVSSRRELAL FT TSKLCFNCLRSSHTAKKCQSKSVCRTPQCNQRHHTLLCQKEVKQQDNKQEV FT HERSEPELYVPSISTNVAQAVAEKQGPFAFALLPTVVANIRGGDGKLHEVR FT ILIDCGSQASLITSACVKRLGLRRRNASLEVTGVSGEAVGTTAGIVTLVIS FT SRFDEETKLTTAAYVLGKLTATLPCQRFDVASMPYLKHLPLADPHFNQPGS FT VDVILGSDVFLSILEAGQVNDTNGIPVAQRSIFGWMVAGRISKQWCTHTHH FT SAIDLRQDFDIDRTLRLFWEDQEVRQAKQWTREEQRVVDHFNSTHTRADDG FT RFIVRLPMDNSKQQLGESLTAAVKRLRAMERKFETDENFKQRYIDFMQEYQ FT DLGHMDLIPEAEVQVEWTRSYYLPHHGVVKEDSSTTKLRVVFDASCATTSG FT ASLNDLLLDAPNINSDIFDIMMDFRFYEVVFTADAVKMYRQILMHPADRDY FT LRVVWRSSPDKPIQHFRLRTVTYGLKNSGFLAMAALKKAADDYEGIYPEAA FT QRIKKSTYVDDLTSGARTTEEAICLIKEINEIVESAGFTLRKWSSNSTAVL FT ESLPQTTTASQQIQFPDERDTVKALGIHWVPDKDVFTIKLERIHITRWAPN FT YNNQIELHGFSDASEEAYAAVVYLRSVDYDGKIHVTLLAAKTKVAPVRQVS FT LPRLELNAAELLAKLMKQIAESLKRFQIEQYAWTDSTIVLQWLSGHPRKWN FT TYVANRTSSILEILPRKHWAHVSSRENPADCASRGISPTELVGHHLWWSGP FT PWLVEDSATWDRTSPSDELDEITLEVRKRFQSLNVSISNAETTYVIEKHIL FT DSRSTIGAACRQLACVKRFIYNMRSKSSSLYEKRSGAILPSELNEARLLLI FT RLAQHEHYEEEAKSLAKGNEVHPKSKISSLYPFLDGSGTIRVGGRLQQSSF FT PFEVKHPAILPKNHRVSRLLVEELHLQNCHAGPTLLTATINQKYWIQGCQH FT LIKQVIQSCVKCCRQKAKTAQQLMGSLPAARVTACRPFSHVGVDYAGPILV FT RCSNTRGERCSKGYIVVFVCLSSKAVHLEVAGDLSADTFLGAFKRMIARRG FT YCNELWSDNGTNLVGANRQLTEIYEATRSHSKKTEPFFSNLGIRWRFIPPS FT SPHQGGIWEAAVKSAKELLRPVVGNEKLTFEKLSTVLCQIEACLNSRPLYP FT ISTSPDSYEALTPGHFLVGQPLNLLPEPDIGHLKANQLDNWEKVQRLTSEF FT WSRWRNEYIATLQPRGKWRNRQDNIKPHQLVLVKNDNTPPTAWELARVVEV FT HPDKQGLVRTVTLRRGKSEYQRPVQKLCPLPD" XX SQ Sequence 5156 BP; 1395 A; 1245 C; 1364 G; 1152 T; 0 other; tagttttggt ccttcggaac cgcataatgg agccagccga gctggaaaga ttacgtgcca 60 agcgagaggt tatgtttgcg aaaatcaagt gggagttttc cgtcgcgaac gcactccacg 120 tccggaatcc atcgttcgga gaagtgtgtg aacgccggga caaactcacg gaacttgcaa 180 caaactttga tgccgtgcaa actacgattg aggagaatac gacgaatctg atggatgtca 240 cgtcggtgtt taattaccgc acacaattcg aagaagttta tttcaaagtg aaagacattt 300 acactgagtt cctggacgca aatcaggacc gcgtttcgtg gcacagtgct ggaagcgaga 360 cgcggaacga tttacgggat gcaatcaagg cccttttaga aacgcagcag gcgatgctcg 420 accgccggga tctaccaccg cagcagccag catcgggtgg tcaggtatcg aatcatgagc 480 cacaagtcaa gcttccgcaa ctcaacattc ccgtcttccg aggtgagcga aaagcgtgga 540 attcgttcaa agatttgttt gtcagcacga tccacaccag ggaggatctg aaggcttccg 600 tcaaaatgca gtatttgttg tcctacttgg acggagaagc aaagcgaatg gtaagctcgt 660 tccccattac cgatgcaaat tacaacgagg catgggaaac acttgtgaca cactacgaca 720 aaaagaaata cacagtgttt gccctagtgc gggagttcgt tgaccagcca tcagtcacta 780 acgcggctgg attgaagaaa ctggtggcca cttctgatga cgtcatccga caactaaaag 840 ctttgggcaa cgagtacgag acacgggatc cttggctcat ccatttgctg ctcgagaaag 900 tggatcggga gacccgcgga ctgtgggcgc agaagataat cgacgaggaa aatcccatgt 960 tcgcagattt catcgagttc ctgcagaaac gctgtgacgc tttggaaaca tgttcggcgt 1020 tttcgaggaa gcaaggagaa actgcgaaaa aggagcttcc gaagttcagt gcaagcgaga 1080 agaagattca agcgttccat tccagcgcga cgcaggtgtc gtgtgcgaag tgcagtaagg 1140 accatccaac gtaccattgc gatcaattca aggccatgga tgtatcgtca cgtagggaat 1200 tagcactgac gtcaaaactg tgcttcaatt gcctgcgttc atcacataca gcgaaaaagt 1260 gtcagtcgaa atcggtgtgc cgtaccccac agtgcaacca gcggcaccac acactacttt 1320 gccaaaagga agtcaagcag caagacaaca agcaagaagt tcatgaacgc tcagaaccgg 1380 aattgtatgt tccatcaatc tccaccaatg tggcgcaggc ggttgcggaa aagcaaggac 1440 cgtttgcatt tgcattgctg cctacggtgg tggcgaacat tcggggaggt gacgggaaac 1500 tacatgaggt gagaattttg atcgactgtg gttcccaagc atctctcatc acgtcagcgt 1560 gtgtcaagcg tcttgggttg aggcgccgta atgcatcttt ggaagtcacc ggtgtcagcg 1620 gggaagcagt cggtaccaca gccggcattg tcacgctggt gatttcgtcg cgtttcgatg 1680 aggaaaccaa acttactaca gctgcctatg tgctaggaaa gctcacggca accctgccgt 1740 gccagcgttt cgacgtggca agcatgccgt acttgaagca tctaccactc gcagaccccc 1800 acttcaacca accgggttca gtggacgtaa tactgggatc ggatgtcttc ctgtccatac 1860 tggaggctgg tcaagtcaac gatacgaatg gtattcctgt agcacaacgt tctatctttg 1920 ggtggatggt tgcaggaaga atctccaagc aatggtgtac gcatacacat cactcggcga 1980 ttgatcttcg tcaggatttc gacatcgatc gcaccctgcg attattctgg gaggaccagg 2040 aggtacgtca agcgaagcag tggacacgcg aagagcaaag ggtcgttgat catttcaact 2100 cgactcacac acgtgcggat gatggtcgct ttatcgtccg attgccgatg gacaattcca 2160 agcagcaatt gggggagtcg ctaaccgcag cggtcaagcg gctacgagcc atggaacgga 2220 aattcgaaac cgacgagaac ttcaagcaaa gatatattga cttcatgcag gagtaccaag 2280 atttgggcca catggatctg attccggaag cagaggtgca agtcgagtgg actagatcct 2340 actacctgcc gcatcatggg gtcgtgaaag aagacagcag tacgaccaaa cttagagtgg 2400 tattcgacgc ttcgtgtgcc accacttctg gggcgtcatt aaacgatttg ctgcttgatg 2460 ctccgaatat caactccgac attttcgaca ttatgatgga cttcaggttc tacgaggtgg 2520 tatttactgc ggatgcggta aagatgtatc ggcagatact gatgcatcct gccgatcgag 2580 actatttacg ggtagtatgg agaagctcgc cggacaagcc gattcaacac tttcggcttc 2640 gtacggtgac atatgggctc aagaattctg gattccttgc catggcggca ctcaagaagg 2700 cagcggatga ctacgaaggc atatatcctg aagcagcaca gcgaatcaag aaaagtacct 2760 acgtggacga ccttacatca ggagcgagga caaccgaaga agctatttgt ttaatcaagg 2820 agatcaacga gattgtggag agcgcaggat tcacgcttcg caaatggagc tccaattcga 2880 cagcggttct cgaatcgcta cctcaaacca cgacagcatc acaacaaata caatttccgg 2940 atgagcgtga tacggtgaaa gctctgggga ttcactgggt tccggataag gatgtcttca 3000 cgatcaagct ggagaggatc cacattacac gatgggcacc caactacaat aatcaaatcg 3060 aactgcacgg tttctctgac gcttctgagg aagcctacgc ggcagtagtg tacctccgat 3120 ccgtggacta cgatggaaag attcacgtta cattgttggc agcaaagacg aaagtcgctc 3180 ctgtgcgaca ggtatctctc ccacggctcg aactcaatgc ggcagaacta ctagcgaagc 3240 tgatgaagca aattgctgaa tctctcaagc gcttccagat agaacagtat gcttggacgg 3300 attcgacgat agtgcttcag tggctctcag gacaccctcg taaatggaac acttacgtgg 3360 ccaataggac atcatcgata ctggagattt tgccacgtaa acattgggcc cacgtttcat 3420 cgagagaaaa tcccgccgac tgtgcttcac gtgggatttc tccaacagag ctcgtcggtc 3480 atcacctgtg gtggtcaggt ccaccatggt tggtggaaga ttctgctacc tgggatcgca 3540 ccagcccttc ggacgaattg gatgaaatta cattggaggt acgtaaacgt tttcaatcac 3600 tgaatgtctc tattagcaac gcagaaacaa cgtacgtgat cgagaagcac atactcgaca 3660 gccggtcgac catcggcgct gcatgccgac agttggcgtg tgtgaagcga ttcatctaca 3720 acatgagatc gaagtcatcg tcactctatg aaaaacgttc tggtgctatc ctgccctccg 3780 aattgaatga agcaagattg ctgctcatca gattagctca acatgaacac tacgaagaag 3840 aagcaaaatc gttggcgaag ggaaacgaag ttcatccgaa gtcaaagatc agcagcttgt 3900 acccgttctt ggacggcagc ggtacaatta gagtcggtgg acgtttgcag cagtcatcgt 3960 ttcctttcga ggtgaagcat ccggcgatac tcccgaagaa tcatcgcgta tcgagattgc 4020 tggttgaaga actacacttg cagaactgcc atgcgggtcc aacactcctg acagcaacaa 4080 tcaaccaaaa gtattggatt caaggctgcc agcatctcat caagcaggtc atccaaagtt 4140 gtgtcaaatg ctgccgacag aaagccaaga cagcacagca gttgatgggc agcctgccgg 4200 cggcgcgagt gaccgcatgt cgacccttca gtcatgttgg agttgactat gccggtccga 4260 ttcttgtgcg gtgcagtaat actcgtgggg aacgttgctc gaagggatat attgttgtat 4320 ttgtttgcct ctcgagcaag gctgttcatc tggaggtagc aggtgatctt tcagcggata 4380 cgttcttggg agcgttcaaa cgaatgattg ctcgccgagg atactgtaac gagttgtggt 4440 cagataatgg gacgaaccta gtcggtgcca accggcagct gacggaaatc tacgaagcta 4500 cgcggtccca tagcaagaaa accgaaccct ttttcagcaa tcttggcatt cgctggagat 4560 tcattccgcc atccagtccg catcagggtg gcatttggga agcagctgtg aagagtgcaa 4620 aggagctttt gagaccagtc gttgggaacg aaaagcttac gttcgagaag ttgtcaactg 4680 tactgtgtca aatcgaagca tgcctgaatt cgaggccttt gtaccccatt tccacatcac 4740 cggacagcta tgaagccctc acgccggggc atttcctggt agggcaacct ttgaacctat 4800 tgcctgagcc ggacatcggt catctgaaag ccaaccaact ggataactgg gagaaagtgc 4860 agagacttac atccgagttt tggagtcgtt ggcgcaatga gtatattgcc acactgcagc 4920 cacgtggaaa gtggagaaat cgccaagaca acatcaagcc tcatcagctc gtattggtca 4980 aaaatgataa tacgccgccg accgcctggg aattggcccg ggtagttgaa gttcatcctg 5040 acaagcaagg cttggtacgc accgtaacac tccgccgagg aaaatcggaa taccaacgtc 5100 cggtacagaa actgtgtccc cttccggatt aaggcttcgc ctcaaggcgg ggagga 5156 // ID Gypsy-607_AA-LTR repbase; DNA; INV; 423 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-607_AA_; KW Gypsy-607_AA-I; Ty3_gypsy_Ele57; Gypsy-607_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-423 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 423 BP; 158 A; 99 C; 60 G; 106 T; 0 other; tgtagtatac caagtagagt aataaatact gtttagcatc aataaatact gattagtttt 60 agcaagcact aattacctcg tttaggtcag taatttccaa gccatcagca acaaaccatt 120 tttttgcaaa ctaccaaaaa ccccgaccgc cataaaccgc ttgtttatga cagcctcaag 180 aaaaaaccca aaccatccat ccaatgccaa gagctacccg ctcttgagca ccgttattta 240 taaacacaaa caatacacaa taaacagtta cctgaacaaa tgtaatctag tatttaagac 300 cttaggtaaa taaacgaaga cacagtctag tgacctgaac ccttggagtg tggacacgct 360 gtccaacgta tttaaatacg agtatcacaa tcctcgacaa tcagaaaaat caattgtgta 420 cca 423 // ID hAT-82_HMa repbase; DNA; INV; 3525 BP. XX AC . XX DT 22-JUN-2010 (Rel. 15.06, Created) DT 22-JUN-2010 (Rel. 15.06, Last updated, Version 3) XX DE hAT-type DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-82_HMa. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3525 RA Jurka J.; RT "DNA transposons from Hydra magnipapillata."; RL Repbase Reports 10(6), 795-795 (2010). XX DR [1] (Consensus) XX CC >97% identical to consensus. CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 572..3355 FT /product="hAT-82_HMa_1p" FT /translation="MNEPKKRKGGAQKLREKNQKLMKLSANKCLKISDFIP FT NRNPDISKADEKVTTKSKNDVQIQINSNNNTTSTTGTESEDITATNEVLNS FT DDVAVENLSECTQNEIQVQIDSNISIAETVSHHPACDSDMVVDQGVIIVEQ FT NIDWFSRPPSTLLNKFFEIHPIQTTRDPIIVKSFVRKNGTNRKWLSYSEGN FT DALYCSICLAFLPTVVMSSFVSGMTDRRHIHQRVDEHEKSASHNECTENYF FT QHVSRSNVSNLLFSTQISVRREQVQKRRLVLLRVIEIIKVIGKRGLSYRGH FT EFEAAYTLSDNSIDHGNFLEMVLLVSKYDTCLKEHLNSCIEQSEKRKNTGS FT TGRGSMVTFLSKTTITKLIQITSRSIQKIIAKEINSAGMFSVQIDTTEDIT FT TKNQCSIVVRYVTDQIHERLLTMIACDSSTGEYLSSLVSETLKALNIDILK FT CVGNSTDGAANMNGEYNGFAAWMFKENPNQIHVWCYAHVLNLVISDTCSVV FT LSSISLFNLLNDVAVFIRGSYTRMNIFETENQNDIHNRRISTIGETRWWSK FT DAALTKIFGTFANPHSSFYVELVVALSHIEDNLSFKSEVRVKAKGFKEGLL FT KYDTILTAQIFLRVFQLTTPLSKYLQTVNMDILVAYNMVTQTIDTLKKYLE FT EFDIVLEATNNFVCKVNEKLEQHELEAENTLPEKRIRKKKVLSGETRPSEF FT EIHSDVLNSYKVKVHDVIFKSIIESMSKRFKKSRELYADFAILSPSNFEQL FT KAHSVPTSALNKLSEHLLPFDEEATAENLRTELTSFASNWNNIKESLLSTY FT DIMDINKESVEELEYNCRKSCTACKNCTVCCYLVLIQYNLISEAYAKIGLA FT YKYTLSLSFTQVACERSFSILKFIKNRLRSSMAQENLQSFMLMNTEKEILS FT NLDPETIIDELAETSDLFKNLLVV" XX SQ Sequence 3525 BP; 1333 A; 530 C; 596 G; 1066 T; 0 other; cagaggcgga ctggccatat gggcgtaccg ggacaatgcc cggtgggccg gtacaaaagt 60 gggccgatag gctggtaaaa aaattaggat gattaatcaa aaaattcaaa tgaaactaat 120 ataatacata tttgaactta atgtatgttc tttacaaaga attatttgta aaattagtga 180 cgtacgtatt ttttaatcct tcattaagtg taatgcataa tacatacatg catacataca 240 tacatatata acatatatac atgcatacat atggtacaaa aatgcaaaaa ccaaaaaaaa 300 tgaaaatgca aataaaaata ctaatcaaat tcaaagacca ttggcaatag gaaaacacga 360 aatctaatcc aaaaaaaaaa attaacaaat cactatagag tcttcaataa acccaaacgc 420 tgtcatagat accaatttcg aattatgcta accaatcaaa attgtaagaa gatgaaaacc 480 ccgcgcaatc tcaatttatc ttgaaaatca ataatataag cataggaaaa taatatcgtt 540 taaaaactat atataataat agatttttaa aatgaatgaa cccaaaaaaa gaaaaggagg 600 agctcaaaaa ttacgagaga aaaaccaaaa attgatgaaa ttatcagcta acaaatgtct 660 taaaataagc gatttcattc ccaacagaaa tcctgatata agtaaagctg atgaaaaagt 720 gacaacaaaa agtaaaaatg atgtacagat ccaaattaat tccaataata acacaacgtc 780 aacgactgga accgaatctg aagatattac agcaactaat gaagtactaa atagtgacga 840 tgtagcagtt gaaaatttgt cagaatgtac acaaaatgag attcaagttc agatcgattc 900 aaacattagc atcgcagaaa ccgtcagcca tcatcctgct tgtgatagtg atatggttgt 960 tgatcaaggt gtaattattg tagaacaaaa tattgactgg ttttctcgtc caccatcaac 1020 acttctcaat aaattttttg aaatccatcc aattcaaact accagagatc ctattattgt 1080 gaaatctttt gtacggaaaa atggaacgaa tcgtaaatgg ttatcttact ctgaaggcaa 1140 cgatgcatta tattgttcaa tatgtttagc atttttacca actgttgtaa tgagttcttt 1200 tgttagtgga atgacagata gaagacacat tcatcaaaga gttgatgaac atgaaaaatc 1260 agcatcgcat aacgaatgca cagaaaacta tttccaacat gtatctagat ccaacgtctc 1320 aaatttactg ttttctactc aaatttctgt acgtcgagag caagttcaaa agcgacgtct 1380 tgtgctcctg cgggtgatag aaattattaa agtaattgga aaacgtggtc ttagttacag 1440 gggacatgag tttgaagctg cttatacatt gtcagataat tcaattgatc atggaaattt 1500 cttagaaatg gtgttacttg tcagtaaata tgatacttgt ttaaaggaac atttaaatag 1560 ttgcattgaa cagagtgaaa aacggaagaa cacaggaagt actggaagag gttccatggt 1620 tacttttctt tcaaaaacca caataacaaa actcattcaa attacgagtc gttccataca 1680 gaaaatcatt gcaaaagaaa ttaactcagc tggtatgttt tcagttcaaa ttgatacaac 1740 agaagatata acaactaaaa atcaatgctc aatagttgtg aggtatgtaa ctgatcagat 1800 tcacgaaaga ttattaacta tgattgcgtg cgactcttca actggcgagt atctgtcgag 1860 tttagtatct gaaactttaa aggcattgaa cattgatatt ttaaagtgtg ttggtaattc 1920 aacagatgga gctgcaaaca tgaatggtga gtataacggt ttcgcagcat ggatgttcaa 1980 agaaaatcca aaccaaatac atgtatggtg ttatgctcat gttttaaatt tggttatttc 2040 agatacttgt agtgtcgttt tatccagtat ttccttgttt aatttgttga atgacgtagc 2100 agtgtttatt cgtggatcat atacaagaat gaatattttt gaaacagaaa atcagaatga 2160 tatacacaat cgcagaattt caactattgg tgaaacacgt tggtggtcaa aggatgcagc 2220 gttaacaaaa atatttggta cctttgccaa tccgcattct agtttctacg ttgaacttgt 2280 agttgcattg tctcatattg aagacaattt aagtttcaaa tctgaagttc gagtaaaagc 2340 aaaaggcttc aaagaaggac tccttaagta tgacacaatt ttaactgcac aaattttttt 2400 acgggttttt cagttgacga caccgctctc aaaatattta caaacagtaa atatggacat 2460 tttggttgca tacaacatgg ttacgcaaac gatagacaca ttgaaaaagt atttagagga 2520 atttgatata gtattggaag ctactaataa ttttgtctgt aaagttaatg aaaaacttga 2580 acaacatgaa ttagaagcag agaatactct accagaaaaa agaattcgaa agaaaaaagt 2640 attatcaggc gaaactcgtc caagtgaatt tgaaattcat tctgatgtac ttaactctta 2700 taaagttaaa gttcatgatg taatatttaa aagcataata gaaagcatga gcaaacgttt 2760 taaaaaaagt cgagaacttt atgctgattt tgctattcta agtccttcaa attttgaaca 2820 gttgaaagct catagtgtac caacatctgc tttaaataaa ttgagtgaac atctattgcc 2880 gtttgatgaa gaagctacag cagaaaattt acgtactgag cttactagtt ttgcttcaaa 2940 ctggaataat atcaaagagt cacttttgag tacatatgac ataatggata taaacaaaga 3000 aagtgttgag gaacttgaat ataattgtcg aaagtcatgt actgcctgta aaaactgcac 3060 agtttgttgc taccttgttt tgattcaata taatttaata agtgaagcat acgcaaaaat 3120 tgggttagct tataaatata ctttgtcact ttcatttacg caagttgcat gtgaaagaag 3180 tttttcaatt ttaaagttca tcaaaaatcg tttgcgtagt agtatggctc aagaaaattt 3240 acaatcattc atgctcatga atactgaaaa ggaaattcta agcaatcttg atcccgaaac 3300 aataattgac gagctagctg aaacaagtga tctgtttaaa aatcttttag ttgtttgaaa 3360 attcttaatt agttgaatat ttaaaataaa catatttgtt cttaaaaaat ttgtttaaaa 3420 atttcatagt taaattataa aaaaaatagt taattaattt tcctctaatg tgggccggtc 3480 caatatgaaa tgcccgggcc gatttttagt cccagtccgc cactg 3525 // ID BEL-135_AA-LTR repbase; DNA; INV; 556 BP. XX AC supercont1.4; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-135_AA_; KW BEL-135_AA-I; BEL-135_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-556 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.4; Positions 3158251 3158806. XX SQ Sequence 556 BP; 204 A; 94 C; 89 G; 169 T; 0 other; tgttgcggcg ccgcctggca acgctgcaca actgctggaa tctgaaccat tatctactca 60 caccaagcaa tatgataatg tcaaaccggc gatcgccttt gacgacggag aagtagatga 120 caacaaaccg tgataaaaag tctaactctg aaacgtggag atctaaagct atttaaatta 180 aagtatatta taaatatctg aaagctgaat tcattaaaat agttaatgct agtgatttat 240 ttcctttctt acgctactgt gaattaatta atgctaaagg tgaatcgtaa gaccttaatt 300 aaagtttgta accctaaata acctaaaatt gatccctaac aggtcaatat aatcgtattg 360 ccggaataat tgaaattgat actcacctaa aaaagaattg tcataaattt gtaagtttaa 420 taggagaaat ttatcgagct aagaatgagt aaatgagctt atataatttc agtatacgtt 480 acatcctctg ataccgttac cgtttataat taccttaaat ataacaagct tacactaaat 540 tgtaagtatg attaca 556 // ID Sola1-1_ACas repbase; DNA; INV; 2944 BP. XX AC . XX DT 28-MAR-2011 (Rel. 16.03, Created) DT 28-MAR-2011 (Rel. 16.03, Last updated, Version 1) XX DE Sola1-type DNA transposon: consensus. XX KW Sola; DNA transposon; Transposable Element; Sola1-1_ACas. XX OS Acanthamoeba castellanii OC Eukaryota; Amoebozoa; Centramoebida; Acanthamoebidae; OC Acanthamoeba. XX RN [1] RP 1-2944 RA Jurka J.; RT "DNA transposons from the Acanthamoeba castellanii genome."; RL Direct Submission to RU (23-MAR-2011). XX DR [1] (Consensus) XX CC >94% identical to consensus. XX FH Key Location/Qualifiers FT CDS join(787..963,1195..2874) FT /product="Sola1-1_ACas_1p" FT /translation="MYALVITLVPGCWCTFARALRELARSSVADMAEKLGR FT PEAAWREDIVRKCVLNVGGTTKFVDPYPDAHRXXXVXXXKTRCASGFXICP FT VFSSTPLLLVVSSLVWSSKKHKCSPIQKNHIYPLCGRWESEVSHQMFKYLS FT SVXQSFSTSMSHSLPTNTEVLASTLLSPIFGQSLRATAHTPRKRVLRTAEE FT RIPKKRRLLVSNDHINTVLASFCCRKKCLDNFSFADILNITQPFADLNERD FT RTTXIINIMSASCVSTLPECHFKLRLNGSPVCLNAFQKVPTLFIAALHLLS FT HLTLLLYKVHGFSRKKWRSAKQIFISGGRVPIHNTTGQQRPSNATSNTHLW FT LTTFFDVIGDRLPDATIHLPISLTWREVHSMCANAHPSTVPVLTYSAMLKH FT IASHFAYVKLPKNSRLGKCSACIDFAQQRLRAKSPLEAAQFASARTHHLAL FT SSAERLSYKDRCHQATSRPSLYMSLIIDYSNPLPLPTHTPVPKAWMHFGNR FT FTMVLGGLIDHSHGKHLFLHPQPFWPKDANLVISTLFHHIRTRXLTNTASN FT TRPSILYLQADNCAAENKNVFMLAFLSLLVSLDMFQEVYLSFLLVGHTHED FT IDQLFSTAQTKFHTSSIHTPR" XX SQ Sequence 2944 BP; 626 A; 910 C; 581 G; 807 T; 20 other; gagtcgacaa ataaaaaacc cgatgcgcat cagggaaaag agtcgacaaa taaaaaaccc 60 gatgcgcatc gggaaacaga gtcgacaaat aaaaaaaccc gatgcgcatc gggaaaagag 120 tccacaaata aaaaaaccga tgcgcatcgg tttggaactt gtcagtgtgg tgaacagtaa 180 aaacacacaa agtgtacata cctagctcca ggtcgggcag ggcaagtagt ctggcggcct 240 ccttgcattg ctcttcagtg ttgccttcag agtactggcg caccaccggc tgcttgtgag 300 gcttggtctt gatctcctcc gcctccatct ccttgtccac gtcaatcacc nctgggcgtg 360 cgcgtttcct ctccttgcgc tcttcctcta ccagcttgcg cttcctcggc ggcgcttggc 420 gctcctcctc ctccagctcc tcctcctcct ccagctcctc ctccncctcc aactcctcct 480 cccgctcctc ctcccacttc ctctccctcc tcctcctctn tctcctcctc cactatctcc 540 tcctccnnct cctcctcctt ctgctcctcc ccctgctcct ccccctgctc ctcctcctcc 600 tcctcctgct cctcctcctc ctcttcctcc ttntcctcca tctgctcctc ctcctcctcc 660 tcttccacgt ccattggctc tttcgctccc tcctctgctg tcgttggcgt acactgcgtt 720 agcagtttga gagactgctg ctttgtgcag ccagtgtctt ggccccgcgc ctgcagcaac 780 ccgtcgatgt acgccttggt gataaccttg gtgccgggtt gctggtgcac cttcgccagg 840 gcattacggg agcttgctag aagctcggtg gccgacatgg ccgagaagtt gggaaggcct 900 gaggcagctt ggcgagaaga cattgtaagg aagtgtgtgt tgaacgtggg tggtacaaca 960 aaatgatcaa aacaggtttt tcgacgtttt ataggtagtg aaggtctaca aaaaagctgt 1020 ccgcattaaa ttctaattgg agggactgga aacaggtggc gaaatttcat gtggtctgga 1080 gagctttgag atcttcccga tgcgcatcgg gttttttatt tgtcgactcg ctacccgatg 1140 cgcatcgggt tttttatttg tcgactcgct tcccgatgtg catcgggttt ttaatttgtc 1200 gacccgtatc ccgatgcgca tcgggnnnan gntgtcgncc ngtnnaaaac ccgatgcgcg 1260 tcgggttttt naatttgtcc agtattttcc tcaactccac tgttattagt tgtgagctca 1320 ttggtctgga gttccaagaa acacaaatgt tcgccaatcc agaaaaacca catctatccg 1380 ttatgtggcc gttgggaatc ggaggtatca caccaaatgt ttaaatacct ctccagtgtc 1440 atncagtcgt tctccacctc aatgagccac tctctcccca ccaacacnga ggtcctcgcc 1500 tctaccctac tatccccaat cttcggccag tccttacgag ccacagctca tactccaagg 1560 aagagagttc tccgaactgc tgaagagcgg atccccaaaa agcgtcgcct gcttgtcagc 1620 aatgatcaca tcaatactgt tcttgcatca ttctgttgtc gcaagaaatg ccttgataac 1680 ttctcctttg ctgacattct caacatcaca caaccctttg cagatctcaa tgaacgcgat 1740 cgcaccacct ncatcatcaa catcatgtca gccagctgtg tctcaacact acctgagtgc 1800 cacttcaagt tgcgactcaa tggttctcca gtgtgtttga atgctttcca aaaggtgcca 1860 acccttttta ttgcggcctt acatttgtta tcccacctca cattgctgct ctacaaggtg 1920 catggtttca gcaggaagaa gtggcgttct gctaaacaga ttttcatttc tggtgggcgt 1980 gtcccaatac acaacaccac tggacagcaa cgtccctcaa atgccacctc aaatacccac 2040 ttatggctaa ccactttctt tgatgttatt ggagatcgcc ttcctgatgc caccatccac 2100 ttacctattt ctttgacttg gcgagaagtg cactctatgt gtgccaatgc tcatccctca 2160 actgttcctg tgctcaccta ttctgctatg ctcaaacaca ttgcttccca ctttgcctat 2220 gtgaagttgc ctaagaactc acgccttggc aagtgctctg cttgcattga ctttgcacag 2280 caacgcttga gagccaaaag ccctcttgag gcagcacagt ttgcaagtgc acggactcac 2340 caccttgcac tttcttctgc agaacgcctc agttacaagg accgatgcca ccaagcaaca 2400 agtcgaccaa gtctttacat gagcctgatc attgactact ccaacccact ccctctcccc 2460 acccacactc ctgtgccaaa ggcctggatg cactttggca accggttcac catggttctt 2520 ggtggtctca ttgaccacag ccatggcaaa cacctctttc tacaccctca gcccttttgg 2580 ccaaaagatg ctaaccttgt catctccaca ctctttcacc acatccgaac tcgcnttttg 2640 actaacaccg cctccaacac tcgcccttct atcctctacc tgcaggctga caattgtgct 2700 gctgagaaca agaatgtctt catgcttgcc ttcctcagtc tcctggtcag cctggatatg 2760 ttccaagagg tctacctgag ctttctgttg gttgggcata cacatgaaga catcgaccag 2820 ctcttctcca ctgctcagac aaagttccat acctcttcca ttcacacccc aaggtaatgg 2880 tcatcactct ttttttgtgt tgattgtttt cccgatgcgc atcgggtttt ttatttgtcg 2940 actc 2944 // ID Crack-18_BF repbase; DNA; INV; 2725 BP. XX AC . XX DT 30-APR-2009 (Rel. 14.04, Created) DT 30-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE Amphioxus Crack-18_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; KW Crack-18_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-2725 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-2725 RA Kapitonov V. and Jurka J.; RT "Young families of Crack non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(4), 823-823 (2009). XX DR [2] (Consensus) XX CC Its 5'-terminal portion is incomplete. XX FH Key Location/Qualifiers FT CDS 1..2541 FT /product="Crack-18_BF_2p" FT /translation="MMLDRATCEQQETYVLGDFNIDWIQNSHASKRLKEMT FT ESYSMQQLVGVPTRAVNRGQTVTTSCIDLVFTNQPDKCDNLRLQTIGFSDH FT DAVIFTRKSKIHIKGPRTVHKRSYKLFKEEDFLSDIREAPWHLVYNFDDVN FT EATDMFTDILQGICDNHAPIRKFTVRSNAAPWLTPDIQEVMKLRDEAKRGA FT KATGLESDWAVFRKLRNRVVSLCRKAKTRFYRETFESCSGNARQTWNTINS FT LLGRKHTVSPTSIVADGKLLTKPMDIADHFAGYYEDKVANLRSCMDRTDAH FT GSPENSQSSNSHVLSGQPDNLNVLPGLLSDNSHVLPGLSSDNSHALPGLST FT SDSSHVLPSQSSFTFDPVTDKDVHKILLQLQDCKAPGQDGIDNKLLRISSD FT TIAGPMAYIINLSFTTRTFPTRWKHAKIIPIPKDNRQSLSGPNSRPVSLLP FT ACGKICERIAAEQIASYLLRSNCQSEAQHAYRKLHSTETALLKMTDDWLDN FT MDNGLMTAVVLLDYSAAFDLVDHALLLKKLTTYGFTEDALSWTESYLTNRK FT WQVYTNGAYSKQRTLHCGVPQGSCLGPQLYNIYVNDMPEVVQNGLLDQYAD FT DSTIHTRGHTLDDIRTETKIDLERVATWSDENMLKLNNGKTKVMLVGTAQK FT TRKAPPLELELRGQRLEQCPTVKLLGLHLDQNLTWDSHVTQIVKKCNRSLA FT QVGRVKDLLPRQQRIAVVNALIIPHIDYCCSVWGNTTQNNIRRLQVVQNRA FT ARLALGCDRYTHVDSMLKTLGWLTVRDRIQQKSLATFERIMLTKQPKALHD FT KITFQSQIHNYRTRSMNTIRLPKPKSNSQKRRFLYRMSQLWNSN*" XX SQ Sequence 2725 BP; 887 A; 610 C; 607 G; 621 T; 0 other; atgatgttag atagagccac atgtgaacaa caggaaactt acgttttggg tgactttaat 60 attgactgga ttcaaaactc acatgcgtca aagagactga aagaaatgac ggaaagctac 120 agtatgcaac aactagttgg tgttcccaca agagctgtaa acagaggcca gacagtgaca 180 acatcttgca ttgatcttgt tttcactaac cagccggaca aatgtgacaa tttgagacta 240 caaaccatcg gtttttctga ccacgatgct gtgattttca cgcgcaagtc caaaatccac 300 ataaagggac cgagaacagt acacaagagg tcatacaaat tgttcaagga ggaggatttt 360 ctgtctgaca tacgtgaggc gccgtggcac ctggtctaca actttgatga cgtcaatgag 420 gcaacggaca tgttcacaga catcttacaa ggcatatgtg ataaccacgc ccctatcagg 480 aaatttacag tcagatccaa cgcagcccct tggctaaccc cggacataca ggaagtgatg 540 aagctaaggg acgaggcaaa gagaggagcg aaagcaactg gtcttgaatc tgactgggca 600 gttttccgaa agctgaggaa cagggttgtg tcattgtgcc gcaaagcaaa aacacgtttt 660 tacagagaaa ccttcgaatc gtgctcggga aatgcgagac aaacatggaa taccatcaac 720 tctcttttag ggagaaaaca cacagtgagt ccaactagca ttgtggcaga tgggaagctc 780 ctgaccaaac ccatggatat agcagaccac tttgctggct actatgagga taaggtcgcc 840 aaccttcgtt cttgcatgga ccgcacggat gcccatggat cacctgaaaa ctcacaatcg 900 tcaaactcac atgtgttgtc tggtcaacca gataacttga atgtgctgcc tggactatta 960 tctgataact cacatgtgtt acctggactt tcatctgata actcacatgc gttgcctgga 1020 ctatcaacat ctgatagctc acatgtgttg cctagtcagt catctttcac atttgatcct 1080 gtaaccgaca aggacgtgca caagatcctc ttacaacttc aggactgcaa agcaccaggt 1140 caagatggaa tcgacaataa actgttaagg attagttcag ataccatagc tggaccaatg 1200 gcatacatta ttaacctttc atttactaca agaacctttc ccacgagatg gaaacatgct 1260 aaaatcatcc caattcctaa ggataatagg cagtcactat ctggaccaaa ctcgcgacct 1320 gtgagtctac ttccggcttg tgggaagata tgtgagagaa tcgctgctga gcagattgca 1380 tcctacctac ttcggtcaaa ctgtcaatct gaggcacaac atgcctatag aaaactgcac 1440 tcaactgaaa cggctttact gaagatgaca gacgactggc tggacaacat ggataatggg 1500 ctcatgacag cggtagtcct actagactac agcgcagcat ttgatctagt agaccacgct 1560 cttcttctaa agaagctgac gacgtacggg ttcactgaag atgcgttgtc ctggactgaa 1620 tcgtacctga cgaaccgcaa atggcaagta tatacaaatg gggcctacag caagcagaga 1680 accttgcact gtggggtacc acagggcagc tgcctaggac cacagctgta taacatatat 1740 gtaaatgata tgccggaggt ggtacagaat ggattattag accaatatgc ggatgatagc 1800 acgattcaca cgcgaggaca cacattggat gatataagaa cagaaaccaa aatagaccta 1860 gaaagggtgg caacatggtc agatgaaaac atgctaaaac tgaataatgg aaaaaccaaa 1920 gtgatgcttg tgggaacagc acagaaaacc aggaaagcgc cacctttgga gctggaactt 1980 agggggcaac gtttggaaca atgccccaca gtaaagctac tcggactgca ccttgaccaa 2040 aacctcacat gggacagcca cgtaactcaa atagttaaga agtgcaacag aagcttggca 2100 caggtgggga gagttaagga cttattacca cgtcagcaga gaatagccgt agtcaacgcc 2160 ttgataattc cacatataga ctactgctgc tccgtttggg ggaacacaac acagaacaac 2220 ataagaagac tacaggtggt tcagaacagg gccgcaagac ttgctcttgg ttgcgatcgt 2280 tacacacatg tggactctat gctgaagacc ttaggctggc tgactgtaag ggacaggatt 2340 caacagaaaa gccttgccac tttcgaaaga ataatgctca ccaaacaacc aaaagctctg 2400 cacgacaaaa tcacctttca gtctcagatt cacaactacc gaacacgctc aatgaacacg 2460 atcagactac caaaaccaaa atccaactct cagaaaagac gtttcctgta cagaatgtca 2520 cagttatgga acagcaacta gcaagtattg gactgcaaaa ttggactgta aatatcgacg 2580 atgagaaatt gtgttacaac tgctatgtat attactttgt atgtttttaa ctgtatgtaa 2640 tgtgtaaata tgtatgtgac catgaccaca ggaagaatag cctctgagga ggctaaatgt 2700 ggatcataat aaactcaaac tcaaa 2725 // ID AvPB2 repbase; DNA; INV; 2333 BP. XX AC . XX DT 17-AUG-2005 (Rel. 10.08, Created) DT 18-AUG-2005 (Rel. 10.08, Last updated, Version 1) XX DE PiggyBac-like DNA transposon from Adineta vaga - a consensus. XX KW piggyBac; DNA transposon; Transposable Element; AvPB2. XX OS Adineta vaga OC Eukaryota; Metazoa; Rotifera; Bdelloidea; Adinetida; Adinetidae; OC Adineta. XX RN [1] RP 1-2333 RA Arkhipova I.R. and Meselson M.; RT "Diverse DNA transposons in rotifers of the class Bdelloidea."; RL Proc Natl Acad Sci U S A 102(33), 11781-11786 (2005). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS join(458..1756,1756..2181) FT /product="AvPB2_1p" FT /translation="MNXRIEEDSESDDVNDEQSGSEIENDESQTTDADDEI FT DLEDLKVSGDSTRKSSYTSKSGMIWSSISSTSTKTKXFSGNIEKSGPTKLT FT ENIASIEDAFICLMSEKILQKILIYSNMEYERNTKLDEKEEITMLELKASI FT GLLLLAGLLGRSKSDLHSLWKTSPLESPIFKATISKSRFDKIIACLRFDDK FT STREERKKADKFAAIREIWLDFQDKLKTCYTPGLNITIDEQLLGFRGKCPF FT RQFIPTKPDKYGLKFWLCVDAESYYVLNAFPYIGRQPGQEKQAHVGESVVL FT ELLKPFYGSNRNVTMDNFFTSVPLAKNLQTKNLTLIGTLRKNKPEIPIEFL FT SSKIREIGSSLFGFEDNLTLVSFVPKKNKAVLLLSSXHHDNHVDNKTGKPV FT IILDYNKTKGAVDTVDQMCHKYTVINIFNFLSFNFVICRVKRGTKRWPLCI FT FYGMIDMAALNAFILWKSKNPVWNENKRYQRRLFLEELGLSLVTPLLDFRS FT KTSNFLHKDIQNALLIVGHPVSKRDSQKSDEDSAQSKRKRCSICETSKDRK FT TSNKCYNCSAFVCSEHCVKQIFCINCSK" XX SQ Sequence 2333 BP; 835 A; 364 C; 388 G; 732 T; 14 other; ccctagtact cccaacatgg gtcatttttg acccagacat aaaaatttag aaatgcagta 60 taatattttg atagtttcgg gtttaaagga ayatctatca tctattatga acatagarct 120 gttgcatagt ctgaaatraa actyattttc cctagttcat ctcatttcaa gtcttcaata 180 caatatagta caatttcatc agtattgaac ataatactag gtacaagagc aaattaaatt 240 tttggataaa atctaagact catacaycat ttctactgta tatktatccc caatctgctt 300 agagatattt attgattgga ctttggacta tccagttatg atgatatatg attcaccyga 360 ttagattatg aatcgaaaat atccatctat catctagtay ctttccatat ttttattttc 420 cagattcaga tttcttttcg tattataaaa aacgatcatg aacartcgaa ttgaagaaga 480 tagtgaatct gacgatgtta acgacgaaca aagtggttca gagatagaaa atgatgaaag 540 tcaaacaaca gacgctgatg atgaaatcga tttagaagat cttaaagtat caggcgattc 600 tacacgaaaa agctcataca cttcgaagtc cgggatgatc tggtcgtcga tttcatctac 660 ctcaacgaaa acaaaamcat tcagtggtaa tatcgagaaa tctggaccaa ccaaacttac 720 tgaaaatatc gcatctatcg aagacgcatt catttgtctt atgtcggaaa aaatcttgca 780 gaagattttg atttattcaa atatggaata tgaacgaaac accaaattag atgagaaaga 840 agaaataaca atgctggaat tgaaagcttc tattgggcta ttattgctcg ctggtttatt 900 aggaagatca aagtcagatt tacattcctt atggaaaaca agtcctttgg aatctccaat 960 atttaaagct actatttcaa aaagccgctt tgacaaaatt attgcgtgcc ttagatttga 1020 tgataaaagt acaagagaag agagaaaaaa agcagataaa tttgcagcaa ttcgtgaaat 1080 ctggttagat tttcaagata aattaaaaac atgctataca ccgggactta acattaccat 1140 cgatgaacaa ttactaggat ttcggggaaa atgtccattt cgtcaattta ttcctacgaa 1200 acctgacaaa tatggattaa agttctggct atgtgttgat gcrgagtcgt attatgtgtt 1260 aaatgcattt ccttacattg gacgacagcc cggtcaagaa aaacaagcac acgtaggtga 1320 aagtgtggtt ttagaaytgc tcaagccatt ttatggttca aacagaaatg ttacaatgga 1380 taatttcttt acaagtgttc cattggcaaa aaatcttcaa acaaagaatc ttacgcttat 1440 aggaacrtta cgaaaaaata aaccagaaat acctatagaa tttctatcta gtaaaattcg 1500 tgaaattggt tcttcacttt ttggttttga agataattta actttagttt ccttcgtgcc 1560 gaaaaaaaat aaagctgtct tattactttc ttcaaamcat cacgataatc atgtggacaa 1620 taagactgga aaaccagtta ttattttgga ttataataaa accaagggag ctgttgatac 1680 tgtcgatcaa atgtgccaca aatatacagt aataaatatt tttaattttt tatcttttaa 1740 ttttgtcatt tgtaggtaaa aagaggtaca aagcgatggc cgctttgtat tttctacggc 1800 atgatagata tggcggctct aaatgctttt atcttatgga aatcgaaaaa tccggtttgg 1860 aatgaaaata aaagatatca acgacgatta tttttagaag aactcggact tagtctggta 1920 acacctttat tggattttcg ttcaaaaacc tcaaattttc tacacaaaga tatacaaaat 1980 gctttgctta ttgttggtca tcctgtatcg aaaagagatt cacaaaaatc agatgaagat 2040 tctgcacaaa gcaaacgaaa acgatgttca atatgtgaaa catctaaaga cagaaaaact 2100 tcaaataaat gttataattg ctctgcattt gtttgtagtg aacattgtgt aaagcaaatt 2160 ttctgtatta attgttctaa ataaagaact aacatactgt cagtattttt ctatcaagct 2220 cgatataaga gtagcaccct tttgacaaga tagtaacctt aaaatcgaga aaatcatttt 2280 acagaagcgt aaattgctat tttagtgcgt ataagagcac gggagtacta ggg 2333 // ID I-13_AAe repbase; DNA; INV; 5497 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A I non-LTR retrotransposon family from Aedes aegypti. XX KW I; Non-LTR Retrotransposon; Transposable Element; I-13_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5497 RA Kojima K.K. and Jurka J.; RT "I non-LTR retrotransposons from the yellow fever mosquito."; RL Repbase Reports 11(4), 1368-1368 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 16 sequences with >96% CC identity. XX FH Key Location/Qualifiers FT CDS 201..1658 FT /product="I-13_AAe_1p" FT /translation="MVGNEGSSSQMPPIIEEMSSDHPSVADPQLIGTDPQP FT TASRHRCYPASAKGPYEVFLRQKDKPLNVLLISAELYSNFKTVKEVRKIGF FT SKVRVVLTNREEANNVALNERLSRLYRVYVPCNLVEIDGVIQQADMDLSYL FT KDEGVGKFKDPRIPSVPVLECNQLAIASDDENGKSFKPSYSIRVTFEGTIL FT PDFLEIRNVLIPVRIYSPKPMLCNKCKRYGHTEPLCANKARCGKCGKSHHD FT ATCPIQENICLHCKKSHESHRDCSAYKKRAQFATARIWQKSKLSYAETVGN FT LPNSDFCHENPYASLSELSDIDENESVCSRSESLKRKKLTRAKKSKRTKSS FT DHQPDSILSSQNVMNVPTSKSSNHQPCVAPLASRQQHQSVSSHPPFLDFPS FT TSRLSQPSRKPPEFQYKESDFPNLPGVASSTENSSTAVGLTVTQIVKLFSE FT IFQLTPAWTDILLKLTPILNILLKKLISSWPLLGLLVSIDG" FT CDS 1654..5325 FT /product="I-13_AAe_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="MANSLLNQKFSVFQWNCRSILPKIANLRCFLLKLNVD FT CFALSETFLTPDLNFYLQNYNILRNDRDIRAGGVMLGIKNCYSFRRIDLPN FT VNPIEVVAAEIEIDNMVFSIASLYIPPSARLIPSVLRTLVSKLSHPVFLLG FT DFNSHGCAWGEDRDDSRANMIYDILDEFRLSVLNTGKVTRIACPPSCNSRV FT DLSLCSSNLSLSCVWDVVDDPAGSDHLPIHISFCTNSKRELVVPYDLTKHI FT DWSKYSSTILTEIENVSVQEPDLHGFLTNLILMAAENSQTKPIRSSLSKTK FT APTIWWDQECTDIVKQRSDAFKMFRKFGLMQDFLNYKKLEAKSKMIFKNKK FT RNFWKQFVESLTKDTAMPILWQTARRLRNFNPSPSDIREFDSSWIDRFAEN FT VSPCFVSTQPPLFIENDTVSMDLVEDFSFTDLENALVSCNNSSPGMDGIKF FT SLLKNLPTPAKELLLRSYNWIFKNKSIPVAWRNIKVVAILKAGKDPGISQS FT YRPIGLLSCTRKILEKMILVRLEVWAEKNQLLSPTQYGFRHAKSTRDCQAL FT LATDIQIAFRSKQALTAVFLDIKGAYDSVLIDILCHKLCKLGVPVQLADFL FT FNLLSLKTMHFYQNFQVKHVRNSCLGLTQGSSLSPLLYNLYTSDIDSSIES FT GCVLVQYADDACLWATAKDVKFSEGTLQTSLNNLAIWAQNLGLEFSASKTE FT IVVFSNKRTLSPLQLHLYDLRITQSLCYKYLGLWFDSKCTWRIHVQYVVRK FT CQRRINFLRTIAGTWWGAHPSDMIKLYKTTILSVIEYGSFIFHMTAKSHFI FT KLERIQFKSLRICLGLLNSTHTQSVEVIAGVLPLNIRFLELNCRYLANCHI FT TSSRVMIQLDELFRIHPQSNFLRSYRECLVLPTSPSSDNFYDYEVPARPFE FT IDVNTSLVQNLANLPKPLLPCQAKQAYLENLSNINAEEMFFTDGSLGVDNC FT GFGVYNCNFEASFQLEFPCSIYKAEMLAVKFTVDYIKSLPPKRYLICSDSL FT SSIESLKNIRVSTKTSLIALYIKETVAWLAKRQYTITFMWVPSHCGIEGNE FT FADALAKQGARNGDVYQHCPEARDFYAIIRQTALSSWQDSWSNSELGRYCY FT SILPKVSLSSWFSTLQCNRLFVKNMSRLISNHYTLKASLFRVNIVNSNLCD FT CSLSYEDIDHIIWQCEKFEETRPSLLQQLERENRAPQPIRDVLGNMDILYM FT QIVNNFIKKSKIQL" XX SQ Sequence 5497 BP; 1649 A; 1185 C; 1054 G; 1609 T; 0 other; cagtcgctct tcaactctcg atcgaatcgg ttgtgtttag cctttattct gctttcgaac 60 tccgtgcggt gttattgtgt gaaggtcaat tcgagggact catttcaaag ctaagtatct 120 attgtgcttt ctattgtcgt gggaacgcat tgtttcgcaa cccattactg gttgcacgag 180 attggtctcg gtagcaaaat atggttggca atgaggggtc ttcctctcag atgccaccaa 240 tcatcgaaga aatgtcttcc gatcatccat cagtagccga tcctcaactg attggcacag 300 accctcagcc tactgcttcc cgccatcgtt gctatccggc atcagcaaaa ggtccatatg 360 aagtgttcct ccggcaaaaa gataaaccgc ttaacgtgtt gttgatttct gcagagcttt 420 acagtaattt caagactgta aaggaagtcc gaaagattgg tttcagcaag gtccgcgttg 480 tgctgaccaa tcgtgaagaa gcaaataatg ttgctcttaa tgaacgtttg tctcgcctct 540 atcgagtata tgttccatgc aatttagtgg agatcgatgg cgtgattcag caggctgata 600 tggatctgtc gtaccttaag gatgaaggtg ttggcaaatt caaagaccct cgtatccctt 660 cggttccagt tttggaatgc aatcagctag ccattgcatc tgatgatgaa aatggaaaat 720 cgttcaagcc ctcctattct atccgggtga cgttcgaagg aacaatcttg ccggactttc 780 tcgaaatcag aaatgtcttg atccccgttc gaatctattc tccgaagcct atgctttgca 840 acaaatgcaa acggtatggc cacacggaac cactctgtgc gaacaaagca cgttgtggga 900 aatgtggaaa atctcatcac gatgccacat gtccaatcca agaaaacatc tgcctccatt 960 gtaaaaagag ccatgaatcg caccgcgatt gctctgctta caaaaaacgt gcccaatttg 1020 ccacagcacg aatctggcaa aaatccaaac tcagttatgc tgaaacagtc ggtaatttgc 1080 ctaactccga cttctgtcat gaaaacccgt atgcgtcgct cagtgaactg tcagacattg 1140 atgaaaatga gtcagtatgt agccgttcgg aatccttgaa acgcaaaaag ttaacccgag 1200 caaaaaaatc aaagagaacc aaatcctccg atcatcagcc agattccatt ctgtcgtctc 1260 aaaatgtaat gaatgttcca actagcaaat cttccaatca tcaaccatgt gttgcaccgt 1320 tggcttcgcg acaacaacat caatccgtta gctcgcaccc accgtttctt gattttccat 1380 ccacctctcg tctgtcacaa ccctcccgga aacctcctga gtttcaatac aaagaatccg 1440 attttccaaa cctccctgga gtggccagct ccacagaaaa ctcctctaca gctgtaggtt 1500 taacagttac ccagatagtt aagctatttt cggagatatt ccagctaact ccggcttgga 1560 cagacatcct cctgaaactg acacccatat tgaacatcct tctgaaaaaa ctgatttcat 1620 cgtggccact ccttggactg ttagtatcta tcgatggcta actcacttct aaaccagaag 1680 ttttccgtat tccagtggaa ttgcagaagt attctgccaa aaatagcaaa tctaaggtgc 1740 ttcttactta agctaaatgt tgattgtttc gcgctctcgg aaacatttct tactccggat 1800 ttgaattttt atctacagaa ctacaatatt cttcggaatg atcgagatat ccgagcaggg 1860 ggtgttatgt tgggtataaa gaactgttat tcatttcgtc gtatagatct accaaatgtt 1920 aatccaatcg aagtggttgc ggctgaaatt gagattgaca atatggtgtt ttcaatcgct 1980 tccctgtata ttcctccatc agctagattg atacctagtg tcttaagaac gttagtcagc 2040 aaactttctc atcctgtatt tctactcgga gattttaact cccatggttg tgcgtggggg 2100 gaggatagag atgacagtag agctaatatg atctacgaca tattagacga atttcgccta 2160 tcagtgttaa acacagggaa agtaactcgg atagcctgtc ctccatcttg taacagtaga 2220 gtcgaccttt cactgtgctc atcgaattta tctttaagtt gtgtctggga tgttgttgat 2280 gaccctgctg gtagcgacca tctgccgatc cacatatcat tttgtacaaa ctctaaaagg 2340 gaactagtgg ttccctatga cctgacaaaa catattgatt ggtccaaata ctcctcgaca 2400 attttaacag aaattgaaaa cgtttcagta caggaaccgg atttgcacgg attcctaaca 2460 aacttgattt tgatggcagc cgaaaactcg caaactaagc caatccgttc gtcactttcc 2520 aaaactaagg ctcccacaat ttggtgggac caagagtgta ctgatattgt caagcagagg 2580 tcagatgcct tcaaaatgtt ccgaaagttt ggattgatgc aagatttctt gaattataaa 2640 aagttagagg ccaaaagtaa aatgattttc aaaaataaga aacgtaattt ttggaagcag 2700 ttcgtagaat ctctgaccaa agataccgcc atgcctattt tatggcaaac agcgcgtagg 2760 ctgaggaact ttaatccaag tccttcggat attcgagaat ttgattcgag ttggatcgat 2820 cggtttgctg aaaacgttag cccttgtttt gtgtctactc aaccgccgtt attcattgaa 2880 aatgatacgg tttcaatgga tctagtagag gattttagct tcacagattt agaaaacgct 2940 ctcgtatcgt gtaataattc ttcacctggc atggatggaa tcaaattctc tctattgaaa 3000 aatctaccta ctcctgctaa agagctcctc cttcgttcat acaactggat cttcaaaaac 3060 aaatccattc cagtagcctg gaggaatatt aaggttgttg ctatactcaa agctggtaaa 3120 gaccccggta tatcacagtc gtatcgtcca ataggtttgc tatcctgcac tcgtaaaatt 3180 ctcgaaaaaa tgatcctcgt tagactagaa gtttgggctg aaaaaaatca actactttct 3240 ccgacgcaat atggttttcg ccacgcaaaa agtacaagag attgccaagc attattagct 3300 actgatattc aaatcgcatt tcgatccaaa caagcattga ctgcagtttt ccttgatata 3360 aaaggggcgt acgattcagt gctaattgac attctatgcc acaagctatg caagcttgga 3420 gttccagttc agttagctga ttttttgttc aacctcttgt cgctgaaaac tatgcatttc 3480 tatcagaact ttcaagttaa acatgtccgc aactcatgtc taggattaac acaaggatct 3540 tctctaagcc ctctgcttta caatctctat actagtgata tagatagcag cattgaaagc 3600 ggatgtgttt tagtacaata cgctgatgat gcatgtttat gggctactgc taaagatgtt 3660 aaattttcgg aaggaactct tcagacctct ttaaataacc tcgcaatatg ggcacaaaac 3720 ctcggattgg aattctctgc ttcaaaaacg gaaatcgtgg tattctcgaa taaacgaaca 3780 ttgtctccac tacagttaca tctgtatgac cttcgaataa cccaaagcct ctgttataaa 3840 tatctcggtt tatggttcga ttcaaaatgt acttggagaa ttcatgtcca atatgttgtt 3900 cgaaaatgtc aacgaagaat aaactttcta cgtaccatag cagggacatg gtggggagct 3960 cacccgtcgg atatgataaa attgtacaaa acgactatcc tatcagtaat tgaatacggc 4020 tctttcattt ttcacatgac tgctaaatcc catttcatca aactcgaacg gattcagttc 4080 aagtccctga ggatatgctt gggtttgctg aattctactc atactcagtc ggttgaagtg 4140 atagctggtg tccttccgct gaacattcgg tttctagaat tgaattgcag gtatttagcc 4200 aattgtcaca taacatcttc tagagtgatg atccaattag atgaactgtt cagaatacac 4260 ccacagtcaa atttccttcg atcgtaccga gaatgtcttg tgttgccaac ttcccctagc 4320 tcggacaatt tttatgatta tgaagtgcca gcacgcccat ttgaaatcga cgtcaatacg 4380 tcgttagtcc aaaatctggc gaatttgccg aaaccgttac taccatgtca agctaaacaa 4440 gcataccttg agaatttatc aaacattaac gcagaggaga tgtttttcac ggatggctct 4500 ttaggagttg ataactgtgg attcggagtc tacaattgta acttcgaagc aagcttccag 4560 ttagagtttc cgtgttcaat ctataaagcg gaaatgttag ccgtaaaatt tacagtagat 4620 tatatcaaat ctcttccacc caaaagatac ctcatttgct ccgatagctt aagctcaata 4680 gaaagcctaa aaaacattcg agtttctacg aaaacaagtc taattgcttt gtacattaag 4740 gaaaccgtgg catggcttgc caaacgccaa tatacgatca catttatgtg ggttccgtcg 4800 cattgcggga ttgagggcaa tgaatttgct gatgcccttg caaagcaagg agcccgaaat 4860 ggtgatgtgt atcaacattg cccagaagct cgcgattttt atgcaatcat aagacagaca 4920 gctttgtcgt cttggcaaga tagttggtct aatagtgaac taggtagata ctgttacagt 4980 attctaccca aagttagtct ttcttcttgg ttttcaacat tacagtgcaa tagactgttc 5040 gtaaaaaata tgtctcgcct aatatcgaac cattatactt taaaagcatc tcttttcaga 5100 gtaaacatag tcaactccaa cctttgcgat tgttccctga gctacgaaga cattgatcac 5160 attatttggc aatgtgagaa attcgaagag acaagaccat ccctgttaca acaattggaa 5220 cgagaaaatc gagctccgca acccatcaga gatgttttgg gaaatatgga tatcctctat 5280 atgcaaatcg ttaataactt catcaaaaag tcaaaaattc aactgtgaac tgttcttata 5340 ataataatac accatttttg ttcattttgt gaaaaaaaaa ctaacaaaaa tactatgtgt 5400 aaatagattg taaacaacta acatgaaaaa tcggctccgt taaggtcttc caatgagcct 5460 aataaactat ttagtaaata aaaaaaaaaa aaaaaaa 5497 // ID BEL-637_AA-LTR repbase; DNA; INV; 549 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-637_AA_; KW Pao_Bel_Ele193; BEL-637_AA-I; BEL-637_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-549 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 549 BP; 195 A; 108 C; 88 G; 158 T; 0 other; tgtaacgaca agacccatca ctctggcgac actgccaccg aacaacagat ccacttttca 60 tctgtacacg attgacggtt gatgactagc agaatcaccg tagagataca aaagagaagt 120 gtatcgagca agaatatcgt gcgcaattga agtgagcaaa attatttgat aacgtaaagt 180 gaatttgata acctaaatag aattcaataa ttatatatct actatttaca gttcttaaac 240 ctatcttacg gtaaaactta aacctagctt gcattattca aacaaattgt ttccatataa 300 acgaagtgag tacgatgaac tacatgcaaa cttgtttaga agctaagata ttgtatttac 360 agaagtcaac tgaaattagt ctgtggacgt tagatacgac ccttctaaat ttcgaacagt 420 acaccaaata tgtaagcctt cttgagttat atgcaatccc tatgactaaa tacaatttat 480 ttttagctta aagcgcacaa taccacaacc tcgtgtttgc tactgagact tggtgaatct 540 aaccccaca 549 // ID OK repbase; DNA; INV; 465 BP. XX AC D32083; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 28-SEP-1995 (Rel. 1.08, Last updated, Version 1) XX DE SINE element. XX KW SINE2/tRNA; SINE; Non-LTR Retrotransposon; Transposable Element; KW OK; Repetitive element. XX OS Octopus vulgaris OC Eukaryota; Metazoa; Mollusca; Cephalopoda; Coleoidea; OC Neocoleoidea; Octopodiformes; Octopoda; Incirrata; Octopodidae; OC Octopus. XX RN [1] RP 1-465 RA Ohshima K. and Okada N.; RT "Generality of the tRNA origin of short interspersed repetitive RT elements (SINEs). Characterization of three different RT tRNA-derived retroposons in the octopus."; RL J. Mol. Biol 243(1), 25-37 (1994). XX DR GenBank; D32083; Positions 267 731. XX SQ Sequence 465 BP; 120 A; 97 C; 112 G; 136 T; 0 other; atatatatat aggcgcagga gtggctgtgt ggtaagtagc ttgcttacca accacatggt 60 tccgggttca gtcccactgc gtgcaacttt gggcaagtgc cttctactat agcctcaggc 120 tgaccaaagc cttgtgagtg gatttaatca tcatcatcat cgtcgtttaa cgtccgttct 180 ccatgctagc atgggttgct agacggaaac tgaaagaagc ctgtcgtgta tatgtatata 240 tatatatatg tgtgtgtgtt tgtgtgtctg tgtttgtccc ccccaaaatc gcttgacaac 300 cgatgctggt gtgttcatgt cccccgtaac ttagcggttc agcaaaaggg accgatagaa 360 taagtactag gcttacaaag aataagtcct ggtgtcgatt tgttcgacta atggcggtgc 420 tccagtatgg ccacagtcaa atgactgaaa caagtaaaaa tatat 465 // ID BEL-79_AA-LTR repbase; DNA; INV; 359 BP. XX AC supercont1.172; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-79_AA_; KW BEL-79_AA-I; BEL-79_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-359 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.172; Positions 666217 665859. XX SQ Sequence 359 BP; 96 A; 90 C; 86 G; 87 T; 0 other; tgtacgcgat tagcgtgccc agcggcaagg agttttctga tcgtagccac ctatgtagct 60 acgactgggc aagttaacaa ccacgttagc aacagccacg cgacagatgg gctccgaagt 120 ttctggatag aagccggagt actataaaag gagggcaccc agctcgagcg agtcagtctc 180 aatccggaag ccaacaagtg aacatttaag caactaattg taaataaagt ttagtattag 240 cagcagtata gtgtattgtg tgttcctgta ttccctgccc tggaattggt cctagttttc 300 cctttccacc agttcatcgc tccgcctaca aacgttgggt tcacggccga gcccaaaca 359 // ID BflSINE1 repbase; DNA; INV; 383 BP. XX AC . XX DT 07-JUL-2006 (Rel. 11.06, Created) DT 07-JUL-2006 (Rel. 11.06, Last updated, Version 1) XX DE Amphioxus DeuSINE element - consensus. XX KW SINE3/5S; SINE; Non-LTR Retrotransposon; Transposable Element; KW SINE3; Interspersed repeat; DeuSINE; conserved; BflSINE1; CNE. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-383 RA Nishihara H., Smit A.F. and Okada N.; RT "Functional noncoding sequences derived from SINEs in the RT mammalian genome."; RL Genome Res 16(7), 864-874 (2006). XX DR [1] (Consensus) XX SQ Sequence 383 BP; 97 A; 102 C; 101 G; 83 T; 0 other; tgtgagaccg catggcgcag cggcagcgtg ttcggctcgg gaccgagagg tcccgagtcc 60 gaatcctgcc gtgtcaccga tcttgtgccc ttgggaaagg cactttacac gactttcctc 120 actttactca ggtgaaaatg agtacctagc ttcggctagg gccgtccctc ggataggacg 180 ttaaatggag gtcccgtgtc tggggagagc cataccccag gcacgttaaa gaacccacca 240 cacgtgcgat ggtgcggtgt gagatggtgc aaatccttcc gtctagaatt ggtgattctc 300 tacaaatcac ccggactcca ggaagaatag tgacatatca gtcactaatg gagatctgta 360 taaccaaacc aaaccaaacc taa 383 // ID BEL-609_AA-I repbase; DNA; INV; 6412 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-609_AA_; KW BEL-609_AA-LTR; Pao_Bel_Ele71; BEL-609_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-6412 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [1964-2533] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 1010..2995 FT /product="BEL-609_AA-I_2p" FT /translation="MKTVVSSHTLPIGRKFLWSDSNTVLAWLRADQRRFHQ FT FVAFRIGEILESTDITDWRKVPSKWNVSDEATKWGSGPNASESSRWLRGPE FT FLYRDESEWPTQEEECSTNEEVRATVAVHRVNVSGTVQFERFSRWERLVRA FT VGYVWRFQSNCRKKKDQLPYPETVHLTSKELQLAEQTVLKLVQFEAFPQEY FT ATLKSMQEEPSHPPKQVEKTSPIYKMNPCIKDDGLIRKDGRIGAAAIVSDD FT AKFPVILPKEHYVTGLIINEYHRRFAHANNETIVNELRQRFHIPKLRVAVK FT KVAGNCGYCKVRRATPQPPRMGPLPEARLTPYIRAFTFTGLDYFGPVNVRI FT GRKNVKRWAALFTCLTTRAVHLEVAFTLSTESCKLAVRRFIARRGAPLEIY FT SDQGLNFQGARKELQEAVRKMNQELASTFTNAATEWKLNPPYAPHMGGIWE FT RLVRSVKNALAAMETYRNPDEETFLTALAEAESIVNTRPLTYLPLDSAESE FT ALTPNHFLLHSSNGVCQPAVRPVDAKSALLANWGHVQVMLDRFWTRWVKEY FT LPVITRQAKWFGETKPVRVGDLVIVVDEARRNSWERGVILKVHPGKDGRIR FT SADVRTKGGVLLRPVTKLAVLDVLDQPGMAEGRTSSNTGRGMCATVQDPSL FT SATAIGSTDPTIN" FT CDS 3478..6411 FT /product="BEL-609_AA-I_1p" FT /translation="MSSGDPVADSREQRDADQQSQCIICNQPNHADRKMVQ FT CDGCDHWFHFRCVGVNDSIEDPERSFRCAACTVPDPPGSTVSTSASVREAR FT IQLEVQRLADEKRLQEKMHAEREKQERDLLEMTLRLERERRDKAMKEKFAL FT EKDYLQRKYDLLHTQLDDDGDAVSVRSRYQPRSERVVDWITSNPAVTMSAN FT TGTIPTGLTSQLEPITHSMQQTSASAITNANRVHPAPGFISKVPPTAPSTT FT TSDSTPLPCYSNASITQAAMQNPVASVQQNFPSTSIDTSNMFVISPTIVST FT ASLSLPPLPTLPASSVPPYRHSFVPASFSSQPPQLQAIPSVPSSSLYRIGT FT SHTAVASCLPSGGIPSVWQQSTYVSMPPPTHHSTPTTVASERRPVGEQHLY FT NTQPVRSQGDQHNSASVPVSRISMHPGLGENASSVGQYAQPTAVVSNTYGL FT HGMQSSYQNYLPNYWFPGANNAPLQILPQQENIASVARPALHLGGNTSQSG FT SQGIQINQNPVLSSQFVPGGPQRASLHGVGSNVPMLQEPNISGFPSAPLDY FT APNTQQLAARHVVSRELPIFDGNPAEWPMFWSSYTTSTQMCGYTGAENLLR FT LQRCLKGEARKAVNSFLLHPANVDDIMSTLRTLYGRPEAIINTLLNDVRST FT PTPKPEKLETLVNFGLVVRNLCAHLVSAGQEMHLANPILLQELVDKLPANV FT KLGWAMHKQSVAVADLRTFAEYMNVIINAASSVSSGITEVTRAERQKGKAF FT VNSVRLENNSKSDQRTEKSSSSGKEFVESKASTAPKVRPCAVCQVHGHKPK FT DCSVFKTKSLGERWKAVQEAHLCKRCLYPHGKWPCKAPVCGSDGCQENHHK FT FLHPGNPQAGNNQGNSAPASTSGVVTVHQHLHQRILFRILPVCLHANGKSV FT NTFAFLDGGSDSTLLESSIARKLGVTGAVSPLCMQWTNGVRRTEEDSRRVQ FT LEISGTSGKRFTISGVHTVSSLD" XX SQ Sequence 6412 BP; 1762 A; 1514 C; 1645 G; 1490 T; 1 other; ctgttaaagg agtttcgctc aattccgttc tactaaaagg acctgatctg ctgcagtcac 60 taccgatggt actttgtcgc tttcggcaga gagaggtggc cattaacgct gacatcaagg 120 agatgttcca ccaagttctc attagaccgg aagatcgaca ggctcagcga tttctgtggc 180 gaaataatcc atcgctaccg gtggaggtgt tcgtgatgga cgtcgccata tttggatcta 240 catgttcacc aagttctgcg cagtttgcga aaaatcgaaa cgcccaagaa tttgccaggg 300 agtttccgaa agcctcaacc gcagttctgg aaaaccacta cgtcgacgac tacctggata 360 gcgttgatac agtcgacgaa gcggtggatc ttgcaacgga agtgaagtca ctacacgaac 420 gcgctgggtt tgaactgcgg cactggttgt ccaacaaaac tgatgtcttg gaacagatcg 480 gcgaagaatc ctctgtcaca gcgaaaaact tcgtgatggg gaaggacagc ggcgctgaac 540 gtgtgctggg gatgatatgg ttgccaaagg aggacgtttt tactttcgca atacagtttc 600 gagaagatct gaagcgattg ctcgacggaa atgcgttccc cacgaagaga gagctcctaa 660 gcctggtcat gagcatcttt gatccgctcg ggttggtggc aaacttcgta gttcacggca 720 aaatcctcgt acaagaggtg tggaggtcct agcactgcac agcttagctc aagtacaaat 780 cgatcggtgt tatttcccgg gttacgacgc agagacgctg aacagtctgg aattgcacgt 840 tttcgtggat gccagcttga atgcctacgc tgcggcggcc tacttcagag ttgtccagcg 900 tgacacggta cgctgcagtc tcgtttcatc aaaaacaaag gtagcgcctc tgaagcagct 960 ctccgtaccc aggttggagt tacaagccgc cgtccttgga actaggttga tgaaaacggt 1020 tgtttcaagt cacacccttc ccatcggtag gaaatttcta tggagcgatt cgaacaccgt 1080 gttagcctgg ctcagagctg atcagcgaag atttcaccaa tttgtggctt ttaggatcgg 1140 agaaatcctt gagtccaccg acattaccga ctggaggaag gttccatcaa aatggaacgt 1200 atcagacgag gccactaaat ggggcagtgg tccgaatgct tcggagagtt caagatggct 1260 tcgtgggccg gaatttctct accgagatga atctgaatgg cctacgcagg aagaagaatg 1320 cagtactaat gaagaggtgc gggccacagt ggccgttcat cgtgtcaacg tcagtggaac 1380 tgtacagttt gagcgatttt ccagatggga acggcttgtg cgagcagtcg gatatgtgtg 1440 gcggttccag agtaactgcc gaaagaagaa ggaccagttg ccgtatccgg aaacagtaca 1500 tctcaccagc aaggagttgc agttggcaga acaaactgtg ttgaagctgg ttcagtttga 1560 agcatttcca caagagtatg ctacgttgaa gtcgatgcaa gaagaacctt cacatccgcc 1620 gaaacaagtt gagaaaacca gcccaatcta caagatgaat ccttgcataa aagacgatgg 1680 actgattcgc aaagatggcc gaattggtgc agctgctatc gtttcggatg atgcaaagtt 1740 tccagtgata cttccgaagg agcactacgt caccggactg atcatcaatg aataccatcg 1800 gagatttgcg cacgctaaca acgagacgat tgtaaatgag ctcagacaac ggtttcacat 1860 accgaagctc agagttgctg taaagaaagt agcgggaaac tgcggctact gtaaggtgcg 1920 tagagcgact cctcaaccac ctcgcatggg cccacttcct gaagcgcgct tgactccata 1980 catcagggca ttcacgttca ccggtctaga ctactttgga cccgtaaatg tgcggattgg 2040 gcgcaaaaat gtgaaacgct gggcggcatt attcacgtgc ctcactacta gggcggtgca 2100 cctagaggtg gcctttactc tgagtacgga gtcgtgtaaa ctagctgtgc gacgattcat 2160 cgcccgccgt ggtgcgcctc tggagatata ctccgatcaa ggacttaact tccaaggggc 2220 gcggaaggag ttacaggaag ccgtccggaa gatgaaccag gagctggcat caaccttcac 2280 aaacgcagcg acagagtgga agctgaatcc gccgtatgcc ccccatatgg gcggtatatg 2340 ggaaagacta gtgcggtctg tcaaaaatgc tttggccgcc atggagacct acagaaaccc 2400 agacgaagag accttcttaa ccgcattagc agaagctgaa tccatcgtga atacacgtcc 2460 actcacctac ctaccactgg actcagctga gagcgaagca ctcacgccta accatttttt 2520 gctacatagc tcgaatggtg tgtgccagcc agcagtaagg cctgtagatg ccaagtcggc 2580 acttcttgcc aactggggtc acgtgcaagt gatgttggac cgtttctgga ccagatgggt 2640 taaagaatac ctaccggtca tcactcggca ggctaagtgg ttcggagaaa cgaaaccagt 2700 acgagttgga gatttggtta ttgtggtgga cgaggcaaga cgaaatagtt gggagcgtgg 2760 cgttatcctg aaagtgcatc ctggaaagga tggccgtata cgcagcgcag atgttaggac 2820 caaaggaggt gtgcttcttc ggccggtaac caaactggcg gtcttggacg tgctcgatca 2880 acctggtatg gctgaaggca gaacttcaag caatacgggg cggggtatgt gcgcaacggt 2940 acaagacccc tcgttatcgg ctaccgctat cggttcaacc gatccgacaa tcaattgaca 3000 actgcagcac gaaacgtcat gggggatgat tggggaaaaa ttgagataac ctattgttaa 3060 aaacgaatag cagacatcta cacaaagact ctctaagcta gattgaatct cataaatttg 3120 ttattctaaa attgtttatt ctaaagttgc taaaatagaa tttgttccta aaaataagta 3180 agtattgctt aatctattaa aatgaatttg attctaattg taaaacttaa tctatgaagg 3240 gacatacaca acaactacac atttgttact agtagacgga cagttaacgt agaagacaac 3300 aaatgtaagt aaaataaatt aaaacatgcg acatataact aaataaaact aaattacagc 3360 ttaaagccta ctcaagcaaa aaaacgggtt tgctattaag aagttgacac gtgaccgtct 3420 tcgtaacaaa atctttaaga tttgtttggt atcgttatcg gcgtggatta cataaccatg 3480 tctagtggtg atcccgttgc cgattcacgc gagcagcgtg atgcagatca gcaaagtcaa 3540 tgcattatct gcaatcaacc gaaccatgcc gatcgtaaaa tggtccagtg tgatggatgc 3600 gatcactggt tccacttcag atgcgtcggt gtgaacgaca gcatcgaaga tccggaaaga 3660 agcttcagat gtgcggcwtg tactgttccc gatcctccgg gttcaacggt atcgacatca 3720 gcgagtgtcc gtgaagcccg gatccagcta gaagtacagc gactggcgga cgaaaaaagg 3780 cttcaggaaa agatgcacgc ggaaagggag aagcaggagc gagatctgct agagatgact 3840 ttgcgtctcg aacgagagcg gagggacaag gcgatgaagg aaaagttcgc cctggagaaa 3900 gattatctcc agaggaaata tgacctccta catacgcagt tggatgacga cggggacgca 3960 gttagtgttc gtagccgtta ccaaccaaga tcagaaaggg ttgtggattg gattaccagt 4020 aatccggctg tgaccatgag tgccaacact gggactattc ccactggact cacatcgcaa 4080 ctggaaccaa tcacacactc tatgcagcaa acatcggcat cagcgatcac taatgcgaat 4140 cgtgttcacc cagctcccgg tttcatatcc aaagtaccac cgaccgcgcc gagtacgact 4200 acgagtgatt cgactcccct gccatgctat tcgaacgcat cgatcacgca ggcagcaatg 4260 caaaacccag ttgcgagtgt gcaacaaaat ttcccatcca catcaatcga caccagcaat 4320 atgtttgtga tatcaccaac tattgtgtct acggcatcgc tctcgttgcc gccactgccc 4380 acgttacccg ctagctctgt acccccctac aggcactctt tcgtaccggc gtcgttctca 4440 tctcaaccac cacaactgca agcaattcca tctgtgccat catcatcgtt gtatcggata 4500 ggtacatcac acactgctgt ggcgtcgtgt ttaccatcag gaggaattcc atcagtgtgg 4560 caacaatcga cgtacgttag tatgccacca ccaacccacc attcaacacc aacgactgtg 4620 gcttcggaga gacgtccagt aggagagcaa catctgtaca atacacagcc agttcgttct 4680 cagggagatc agcacaactc tgcatcagtc ccggtcagtc gaatcagtat gcatccaggt 4740 cttggtgaga atgctagctc tgtaggtcag tatgcacaac ctacggcagt ggtaagtaat 4800 acctatgggt tacatggcat gcaaagttcg taccaaaatt atttaccaaa ttattggttc 4860 ccgggagcaa acaatgctcc attgcaaata ttgccacagc aagaaaatat agcaagcgta 4920 gctaggccag cattacattt gggggggaac acaagtcagt caggctctca gggtatacaa 4980 ataaatcaaa acccagtcct ttcgtctcaa tttgttcctg ggggtcccca aagggcttca 5040 ctgcacggtg tggggtcaaa tgttccgatg ttacaagagc cgaatattag tggttttcct 5100 tcggctccac tggattacgc gccaaacact cagcagctcg cagctcgtca tgtggtgtca 5160 agagagcttc caatttttga tgggaatcct gcggagtggc caatgttctg gagtagctac 5220 actacttcaa cgcagatgtg tggttacacg ggagctgaga atcttcttcg tctgcagcgc 5280 tgcttgaagg gcgaagcgag aaaagcggtg aatagtttcc tgttgcaccc ggcaaacgtg 5340 gacgacatta tgtcaacctt gcgcacgctg tatgggcgcc cggaggccat cataaacacc 5400 ttgctaaacg atgtcagaag cactccaacg cccaaaccgg aaaaattgga aaccctggtg 5460 aactttggat tggtggtgcg caacctttgt gcacatttag tgtccgcagg ccaagagatg 5520 cacttggcca acccgatcct tcttcaggag ctggtggata agctgccggc caatgtgaaa 5580 ctgggatggg ctatgcacaa acagtcagta gcagtggcag atcttcgaac gtttgctgag 5640 tatatgaatg tgatcatcaa cgcagcgagc agtgtatcta gcggtattac cgaggttaca 5700 agagcagagc ggcagaaggg taaagccttc gtgaactccg tccgtttgga gaacaacagt 5760 aagagcgatc agcgaacgga gaaaagcagc agtagcggaa aagagttcgt ggaatcgaag 5820 gcatctacag ctcccaaggt gcggccgtgt gcagtctgcc aagttcatgg ccataagccg 5880 aaggattgct ctgttttcaa gacgaaatct ctcggtgaac gctggaaagc agtacaagaa 5940 gcacatttgt gtaagcgctg tttatatcct cacggcaagt ggccgtgtaa ggcacctgtt 6000 tgtggttccg atggttgcca ggagaatcat cacaagtttc tccaccctgg taatccccaa 6060 gctggaaaca accaaggaaa ttctgcacca gcttcgactt ctggtgtagt tactgttcat 6120 cagcatctgc atcagagaat tctgtttcgg atcttgccag tttgccttca cgctaacggc 6180 aaatcagtga atacgttcgc ctttttggat ggaggctctg actcaacgct gttagaaagt 6240 tctatcgcca ggaaacttgg ggtcaccgga gcagtctcac ctctctgtat gcagtggacg 6300 aatggtgtta gacggacaga agaagattcg aggcgtgtac agctagagat ttccgggacc 6360 agcggtaagc gtttcacgat ttctggagtg cataccgtgt ccagcttgga tt 6412 // ID piggyBac-1_AAe repbase; DNA; INV; 3322 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A piggyBac DNA transposon family from Aedes aegypti. XX KW piggyBac; DNA transposon; Transposable Element; piggyBac-1_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-3322 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the yellow fever mosquito."; RL Repbase Reports 11(4), 1314-1314 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 29 sequences with >92% CC identity. TTAA TSDs. XX FH Key Location/Qualifiers FT CDS 1243..3048 FT /product="piggyBac-1_AAe_1p" FT /note="transposase." FT /translation="MIHFTDTDDLDLLALTFEGDVSEVEGIDVSDDEDEIE FT AFHANTESPKCVKPQKDTPVKSGTKVRRNQNKTSAKQAASNQQGKKKGVST FT RSKSKKKGTPPKWHEDESVCFDCIEPVVKSSKPPTKPMTPYQYFGLFIDSE FT IISDICEQTNLYSSQQDETPIDVTSGDIEQHIGQLLLMGVTKVPSYRLHWG FT STTRYAPIADVMPRNKFETIKRCLHFNDNTKIKQRDEEGYDKLFKVRPFID FT ALRRNFLKIEPTPNQSIDEIMIPSKAASPLRQYNKNKPHRFGIKVEGRASS FT DGILHDFSIYGGKTNNEPSGAWGISGDVVIKLIDTLPQNVPYRIFADNWFS FT SYALVKEMKSRGLEYTGTVRQNRIPGFEMKKNLKAEGRGSFACNVSNDNIV FT VVTWMDNKPINLISSCFGVQPIDEVKRWSVSDKTYKSIPRPLVVREYNCYM FT GGIDLNDFLVALYRTQPGTKKYYMRIFYHLLDVSVVNAWLLYRRHMKQTGQ FT EHMTLLNFRIDVANSLIQYGKKIVRRRGRPTNQPIQKRQREVASVGPSLEA FT RYDGLEHWPVEDRRERCVLCKDRGSFSSFKCTKCDVCLCIKKGKNCFTAFH FT LPPD" XX SQ Sequence 3322 BP; 1119 A; 628 C; 673 G; 902 T; 0 other; ccctttcctt cccatggtag cacaggtgat ccaccacttt gcataggtcg tatacaaact 60 ggacaagtta gatcataccc attttttcac cacatgttca catatgtttc atgaactact 120 gtacaaagtt tcactagaaa tagacacttg gatttcaatt taaagcataa cgactgaagt 180 aaagtcggca ttttttcaaa tttaaataaa gtggattttc aaacatccat attcagtaat 240 atgaacataa ttttcaccaa acaaaaataa gaacaagctc gatttagtta tttttccaag 300 attcaataaa gtttttataa tattctttac acaaaactat tgattttgag gaaaacgatg 360 tggatcactg atgatccatt gggaacaaaa cgagattttc atcgattcta ttctaaccta 420 tctttctgaa aacgaaactg tcatcacatt tgtttacgtc tcagaaagtt gttctaagtt 480 cctcgtttcg aacgtgtttt aagtgaaaag tgatttttaa agaaacaata attgtttatg 540 gacgctaaaa tggaaacgaa tgaaccagaa aaggctacac tcacggatga agaggatttc 600 ttcggatttg aagcagatga tgaaatattt ccgggtatca cagacgagga ggatgcttct 660 ggactgaaat tgaaacgccg ccgcaagagt aaattatctt ccgacaccaa tcttcctaca 720 aaatggaccg aaaggatgcc ctgtggaaca atccaccgta gatcccgaga aggtgaagaa 780 agaaagaatg tcccgttcga agatacttcg tacgaacacg aacaagtacc cagtgatgat 840 ccgattccgc tggttgctgg gagcgatggt agtatttgag tttttaaaat ttttatcaat 900 cgctaataat acaggtattt gtagattgca acaataagcc aataacattc gagggagata 960 cttccgtgct tgaagaactt agcgaaccgg agaaaattat tgaaaccgga acgagccgac 1020 ttcgcacatc tcttgaaact gccgatggac acaaaggtac cactgagcaa tcaacgattg 1080 aaaccgataa cttcgatgac gatagtagtt ctgaagattt gttcgaagat cgtgatgaaa 1140 catatgattt ttctaaatta aaaccgaaaa ctacaacaga gctggatgag tgggatggta 1200 gtatatcaaa aattgattta ttacgaatct ataatataaa taatgattca tttcacagat 1260 actgatgatc ttgatttgct tgcattgacc tttgagggag acgtatctga agttgaagga 1320 atcgatgtat cggacgatga agatgaaatt gaggctttcc atgcaaacac ggaatcgcca 1380 aagtgtgtga aaccacagaa ggatacacca gtcaaatccg ggacaaaagt tcgtcgtaac 1440 caaaataaaa catcagcaaa gcaagctgca tccaaccagc aaggaaagaa aaagggtgtt 1500 tctactagat cgaaatccaa aaagaagggc actccaccta agtggcatga ggatgaaagt 1560 gtttgtttcg attgcattga accggttgta aaatcctcta aaccaccgac caagccaatg 1620 actccttacc agtacttcgg gttgttcata gattctgaaa ttatttctga tatctgtgaa 1680 caaacaaatc tttattcaag tcagcaggat gaaaccccaa tcgatgtaac ttcaggagat 1740 atcgagcagc atattggaca gctactgctt atgggtgtaa cgaaggtgcc atcgtatcga 1800 ttgcattggg gttctactac cagatatgcc ccgattgctg atgtaatgcc acggaacaag 1860 tttgagacca tcaaacggtg cttgcatttc aacgacaaca caaaaatcaa gcagcgagat 1920 gaagaaggat acgataagtt gttcaaggta cgcccgttca tcgatgctct ccggcgtaat 1980 tttttgaaaa ttgaacctac acctaatcaa tcgatagacg agatcatgat accttcaaaa 2040 gcagcatctc ctctacgaca atataataaa aacaaacctc accgattcgg tataaaagta 2100 gaaggacgag caagttccga tggaattttg cacgattttt ctatttacgg cggcaaaaca 2160 aacaacgaac cctcaggcgc atggggcata agtggcgatg ttgtaatcaa gctgattgac 2220 acactaccac agaacgttcc gtatcgaatt tttgcggata actggttttc gtcgtatgca 2280 ttagtaaaag agatgaaatc tcgtggactg gagtacacag gaaccgtaag acaaaacagg 2340 attccaggat tcgaaatgaa aaagaatttg aaagctgaag gcagaggctc atttgcatgc 2400 aatgtgtcca atgacaacat tgtcgttgta acgtggatgg acaataagcc gattaatctt 2460 atatcatcct gcttcggagt gcagccgata gatgaagtaa agcgttggtc tgtatcggat 2520 aagacttata aaagtatccc acgacctcta gttgttcggg aatacaactg ctacatggga 2580 ggcatcgatc tgaatgattt cttggttgct ctatacagaa cacaaccagg caccaagaaa 2640 tactacatga gaatttttta tcatttactg gatgtatctg ttgtgaatgc ctggttgttg 2700 tataggcgcc acatgaaaca aacgggacag gaacatatga ctcttctcaa tttccgaata 2760 gatgttgcca acagtctgat tcaatatgga aagaaaattg taagacgccg aggacgacca 2820 acaaaccaac ctatacaaaa acgtcagcga gaagtagcat ctgttgggcc gtctctagaa 2880 gcacggtatg atggactaga gcattggccc gttgaagacc gtcgagaacg ttgcgttctg 2940 tgcaaagatc gaggatcatt cagttcattt aaatgcacta agtgtgatgt ttgcttgtgc 3000 atcaagaaag gcaaaaactg cttcacagcc tttcatttgc caccagatta aatcgacgag 3060 aggcgagaag agcacaactt acatataaaa agtatgattt tgcgtaaaaa tgaaaatata 3120 aacagaaaca aaattgattt ctctttttta ctctagttgc agttatcttc ccaatggatc 3180 acaggtgatc catcataaaa attatctatt ttagtttcat tcgaatattt gagtaaaact 3240 ttacatcttt tgttctggta atttatccag atatcaaatt ttctattttt taactacaaa 3300 aaacgtcgtg ggaaagaaag gg 3322 // ID Gypsy7-NVi_LTR repbase; DNA; INV; 410 BP. XX AC AAZX01000892; XX DT 05-NOV-2007 (Rel. 12.11, Created) DT 05-NOV-2007 (Rel. 12.11, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: long terminal DE repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy7-NVi; KW Gypsy7-NVi_I; Gypsy7-NVi_LTR. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-410 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from Nasonia parasitic wasp."; RL Repbase Reports 7(11), 1138-1138 (2007). XX DR Genome; AAZX01000892; Positions 1497 1088. XX SQ Sequence 410 BP; 115 A; 102 C; 103 G; 90 T; 0 other; tgacaacttt cccgattcgc atttgaaata ataatttcaa atgtaggatc gagaaagttc 60 tacacgtata aagggtacag cgagactcat ggccacaaaa gtatcaagtg gctaagaatc 120 tcgctaacct aaaagacttt ccccggccaa cggctagagg cgcacgtgcg gagctacgtt 180 gcagtttcgg cgcggcaact gattggccag gcccccgtaa ggtctaaatg agtggcaaca 240 agcgtcgaca ctgtaacgcg gctaagggaa gtaagctggc gacgtcgtgg gaggtccaga 300 acctctttcg agcgctctcg aaccacgctg ctctctcact ccgtgatcaa gcaacgaagc 360 cggcggaccc attagaaaga tttcttttag agcttacatt ttgtagaaca 410 // ID Gypsy-194_AA-I repbase; DNA; INV; 4507 BP. XX AC supercont1.67; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-194_AA_; KW Gypsy-194_AA-LTR; Gypsy-194_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4507 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.67; Positions 1682760 1678254. XX CC 'TATAC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 423..4433 FT /product="Gypsy-194_AA-I_1p" FT /translation="MFNVPEERHEDYLLHYMGATTYDLLCDHISPIEPETK FT SYAEIVTTLEAFFDQEPLEMVEVWKFRNRTQKEGETMKEFITELQRDAKFC FT KFGDYLKKELRNQLVFGMRSKRIRSRLIEEKELTFEKALEIALSMEASGEG FT AEIFEKRTQDVNYTERRQVKVTHTAKRKCYRCGSEAHLASTCRHKETVCNF FT CKKLGHLQRVCLKFQSLGEQSKQGKESKLMRKKMQNNLIDEGDNSDDVREI FT EESDVDEILTLEICKVDDRSKSLSKVILPLEVNKRVIKFEVDCGSPVSLIG FT LNDKEVYLGELPLFKTDVELMSYCGNKIAVCGYVKAEVQLSGDSKSLRLFV FT VNTERHPLLGREWMRKLNVDWNEIIRNPVPCVGAISVRNSGKPLSGAVRDL FT IEVFPRVFDSSIGTIRGVRAALHFKPNSKPVFLKARTIPFAIRDTVEREIT FT SMAESGILKKVERSAWATPVVPVMKSADRVRLCGDYKITVNKCLVVDEHPL FT PTIDELFSNMAGGQKFTKLDLAQAYLQMEVREEDREALTLNTHLGLYQPTR FT LMYGISSAPAIFQREISQLLGDIPGISVFLDDIKVTGPDNETHLQRLKMVL FT QRLDEHGMRLNVAKCEFFADRIEYCGFVIDREGIHKMKTKVDAIQQMPIPC FT NREQVRAFVGLINYYGRFMKNLSTQIYPINNLLKDNVPFRWDDECNKAFLW FT VKKEMQSDRVLVHYDPKLPLVLATDASLYGVGAVLSHLYPDGTERPILYAS FT QTLNRSQQNYIQVDKEAYAIIFGVKKFYQYLYGRKFILVTDNKAVTQILAP FT HKGLPTLSAVRMQHYAVFLESFNYEIRFRSSKENANADAMSRLPIQDKVNQ FT HQIEEADLVEVKQIETLPVTAEVLAEGTKEDSNVKMLMQGLKVGRSVEGCD FT RFGINQSEFSVQKGCLMRGIRVYIPPKLRRRVLEELHTGHFGVSRMKSLAR FT SYCWWETVDSDIEELSRDCGDCARVRKNPVKVAPHCWEKASEPFQRIHIDF FT AGPFLGLYFFVIVDSFTKWPEVKIVPDMTTDTTIDRLREYFVTYGVPSIIV FT SDRGVQFTSDQFQTFLKRNNITHKMGAPYHPATNGQAERFVQTFKDKLKAL FT KCERKDVQFELYKILMAYRRTVHPSTDKSPSMLVFGRQMKSRLDLLVPTSV FT PKDPSPGNGACVRSFKINERVAARDFLGSSKWQFGKAAERIGKLHYMIELD FT DGRIWKRHVDQMRVGPVERNPNIESSTECTDRLRRSHLVYYSNQIPSVENS FT DTVIPQQTTVVQTPNKTPTQAVDESAVSSRPVVMKIPSESRVSAPIIEIPI FT QCNIETPRRSTRMRNPPKRLDDLCD" XX SQ Sequence 4507 BP; 1333 A; 908 C; 1149 G; 1117 T; 0 other; gtttggcgac gataggagaa aaaacctacg caggaacaga ggaatcgacg acattgcaag 60 tttccgctct cggaggtttc atcaagagga cgaattccag attgggaata tcagcgggaa 120 gttggaaggt accaaccttg ctggcgttac gaaagggggc ctttccggac gaagctggtg 180 tgagtacgag cgatcggctg ttactgtatt agtgctacac tgctgctgct gctgatgatt 240 gaatgctttg gttgctgcga acgacgggtg tgttgatgta aacaatgatg gaagacgata 300 gagaagcggc tgtggctgtt ccggctacag caccagtcgc gcctaatttc tccatggaac 360 cgttcgacaa agaaaaaggc aaatgagcga gatgggtgaa acgtttggaa ggaggatttc 420 gtatgttcaa tgtgccggag gaacgccacg aagactactt gctacattac atgggagcta 480 cgacatacga cctactttgt gatcatatca gcccgatcga accggagacg aagagttacg 540 ctgaaattgt tactacgttg gaagcattct tcgatcagga accacttgag atggtagagg 600 tatggaagtt tcgcaatcga acgcaaaagg aaggagaaac catgaaggag tttataacgg 660 aattgcaacg cgatgcgaag ttttgtaaat ttggagacta tctgaagaag gaattgcgga 720 accagctggt cttcggtatg cgatcgaaga gaattcggtc acgactcatc gaggagaaag 780 agctgacgtt tgagaaggcg ttggagattg cgctctcgat ggaagcatcg ggagagggcg 840 cggaaatttt cgagaagcga acacaagatg taaactacac tgagaggcga caagtgaagg 900 taacacacac tgcaaaaagg aaatgctacc gatgtggaag tgaggcccac ttggcgagta 960 cttgtcgtca taaagaaacc gtttgcaatt tctgcaaaaa gttaggacat ctccagcggg 1020 tgtgcctcaa attccaatcg ctgggagaac aatcaaaaca gggcaaagaa agcaaactta 1080 tgagaaagaa aatgcaaaat aatttgatag atgaggggga caacagtgac gatgtaaggg 1140 aaattgagga gtcagatgtt gatgagatcc ttactcttga gatttgcaaa gttgatgaca 1200 ggtcaaagtc tttgtcgaaa gttatcttac cgttagaggt aaataaaagg gtcatcaaat 1260 ttgaggtcga ctgtgggtct ccggtttctc tcatcggcct gaacgacaag gaagtttacc 1320 tgggcgagtt accattgttc aagactgacg tagagttgat gagctactgt ggcaacaaaa 1380 ttgcagtttg tgggtatgtg aaagcggaag tgcagctgag cggtgactcg aaatcgttgc 1440 gattgtttgt ggtcaacaca gaacgtcatc ctttgttagg tcgtgagtgg atgcggaagt 1500 tgaacgtcga ttggaatgag atcattcgaa atccagtgcc ttgtgtagga gctatttcgg 1560 ttcgcaactc tggtaaacca ttatctggtg ctgtgcggga tcttatcgaa gtttttcctc 1620 gggtgtttga ctcgtcgatt ggtactattc gaggcgttag ggctgcattg cacttcaagc 1680 caaacagcaa accagtcttc ttgaaagctc gtaccattcc cttcgccatt cgtgacactg 1740 tggagcgtga aataactagc atggcagaaa gcggaatact caagaaagtg gaacgtagtg 1800 cctgggccac tccggttgtt ccagtcatga aatcggcgga tcgtgtacgt ttatgcggag 1860 actacaagat aacggtcaac aaatgtctcg tggttgacga acaccctttg ccgaccattg 1920 atgagctgtt ttcgaacatg gctggcggac agaaatttac aaagttggat ttggcgcagg 1980 cctatctgca aatggaagtt cgagaggagg atcgtgaagc cttaaccttg aatacgcatt 2040 tgggtctcta tcaaccgaca cgtttaatgt acggcatttc ttccgcacca gccattttcc 2100 agcgtgagat ttctcagctg ctaggagata tccccggtat ttcagtattc ctagatgata 2160 ttaaggtcac aggtccagac aatgaaacac acctgcaacg attgaaaatg gttctgcaac 2220 ggttggacga acatggtatg cggttgaacg tagcaaaatg tgaattcttt gcggaccgca 2280 tcgaatactg tgggtttgtg attgaccgag aaggaataca caagatgaaa accaaagtag 2340 acgctattca gcagatgccg ataccatgca atcgtgaaca agttagagct tttgtcggtt 2400 taataaacta ttatggtcga ttcatgaaga atttaagcac tcaaatttac cccatcaaca 2460 acctgctgaa ggataacgta ccgttccgtt gggatgatga atgcaataaa gcgtttttgt 2520 gggtcaaaaa ggagatgcag tcggatagag ttttggtaca ttacgaccct aaacttccat 2580 tggtgttggc aactgatgca tcactttatg gtgttggtgc cgttttaagc catttgtatc 2640 cggatggtac cgaaagacca attctctatg catctcaaac cctgaacaga tcgcaacaaa 2700 actacataca agtcgataaa gaggcgtatg cgatcatatt tggggtcaaa aagttttatc 2760 agtatctcta cggccgcaaa ttcatcctcg taactgataa caaagccgtt actcaaatct 2820 tagcacccca caaaggactt ccaacgctaa gtgccgtgcg aatgcagcac tatgcagtct 2880 tcttggagtc gtttaattat gagattcgat tccgttcttc gaaagaaaat gcgaatgcgg 2940 acgcaatgtc cagattgccg atccaagata aggtaaacca gcatcaaatt gaggaagcgg 3000 acctggtcga agtgaaacag attgaaactt tacctgtaac agcagaagtt ttggcggaag 3060 gcactaagga ggactctaac gtaaaaatgt tgatgcaagg cttgaaagtg ggacgaagcg 3120 ttgaaggctg cgacagattt ggtatcaatc aaagtgagtt ctcggtccag aaaggttgtc 3180 tgatgagagg tatacgagtc tacattcctc ctaagctacg acggcgagta ctagaagagt 3240 tgcacacggg acacttcggc gtatccagaa tgaagtcttt ggcgcgatcg tactgctggt 3300 gggagactgt cgacagtgac atagaggaat tatcaagaga ctgtggtgat tgtgccagag 3360 ttcggaaaaa tcctgtaaag gttgctccac attgctggga aaaagcttca gaaccattcc 3420 aacgaatcca tattgatttt gcggggccgt ttctaggact gtactttttc gtgatcgtgg 3480 attcatttac aaaatggcca gaagtcaaga tcgttcccga catgacaacg gataccacaa 3540 ttgaccgtct gcgtgaatac ttcgtcacat acggagttcc atcgattatc gtcagtgatc 3600 gtggggttca gttcacctca gatcaattcc agacgtttct gaagcgtaac aatatcacgc 3660 acaaaatggg agccccgtat cacccagcta caaacggtca agccgaacgg tttgttcaaa 3720 cttttaagga caaacttaag gcattaaagt gtgaacgaaa ggatgtgcaa ttcgagctgt 3780 acaaaattct gatggcgtac cgtagaaccg tacatccttc aacggataaa tcaccatcta 3840 tgctagtatt tgggaggcaa atgaaatcca gactcgatct gttggttcct accagcgttc 3900 ctaaggatcc atctcccggg aatggagcgt gtgtgcgaag cttcaagatc aatgaacgag 3960 ttgcagcacg ggactttttg ggatcctcta aatggcaatt cggtaaagca gctgaacgaa 4020 ttggaaaact ccattatatg atcgagttgg acgatggcag aatctggaaa cggcatgtgg 4080 atcagatgag agtgggccct gtggaacgta accccaatat cgagtccagc acggaatgta 4140 ccgatagatt acgacgtagc catttggttt actactctaa tcagatacct tccgtcgaaa 4200 attccgacac tgtcatacct caacaaacaa cagtagttca aactccaaat aaaacaccaa 4260 cacaagctgt cgatgagtct gctgtatctt caaggccagt tgtcatgaag attccgagcg 4320 aaagtcgggt aagcgcacct ataatcgaaa taccaataca gtgtaacatc gaaacacctc 4380 gtagatctac aagaatgaga aacccaccta aacggcttga cgacctatgt gactaacaat 4440 actaacctta tatatatgtt gaattcgatg tatgttaggg cacagcatct gttgagaggg 4500 ggaaagc 4507 // ID Gypsy-52_AA-I repbase; DNA; INV; 7823 BP. XX AC AAGE02020210; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-52_AA_; KW Gypsy-52_AA-LTR; Gypsy-52_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-7823 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02020210; Positions 30316 38138. XX CC Positions [4737-5216] - Integrase core CC 'ACAA' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 367..2340 FT /product="Gypsy-52_AA-I_1p" FT /translation="MASKFPSASHLLNDEVDYELKLRNYGEECGKNLESKQ FT RTLRKLLHTDRAEDRDYRSMYSIDQEFDLISSRVNSIAASLAQAYDTKLIS FT RLKHYHLRTQRSNAKTNEAKMMKDSLVKQISELIAAYKPKSPLIPVGGQEE FT SSQDEEETRNRHLKKTVDGQGLDLQSAIETDSNNSHVNLAHKVQNLETQLN FT QMMSLLQQMLAKQEQSDQKQKQPEATEYANRTGTIPKNSRSQGFLPIQTGS FT ALGNGIANVNTGHNFRMNSAGRLRDEAEASHVLDNGQLAQGGHVGSFLQQQ FT DATQQLPDPVGLFQREDQLNNPSVNPPFRFQRGQLAGNTGPGPSGRFSGVA FT QQPDWYNGMRDENYRNSPLRSPSINESLQRENRMQYDRRIEKWNIFFAGTP FT RSPTLEDFIYKVKVLASMNGIPLDSLISHIHLLLRDEASNWFFTYYEANWN FT WYDFETKIRYRFGNPNQDQGNRQQIYERKQLKGETFIAFVTEVERLNKLLT FT NPLSPQRKFEIIWENMQQQYRSKLACFQVNNLDQLIQLNYRIDASNPSLHP FT VGPRHVVNNIEVDSDGESADEEEINELNKRYQRGQGGSRQQQKFGSRSEEA FT TRSPLCWNCRRNGHFWRECKEAKTTFCYVCGNPGTISTICNSHPKRDSSRG FT VTAPQNSGN" FT CDS 2616..5600 FT /product="Gypsy-52_AA-I_2p" FT /translation="MQYSCTGYVNVPVKFKGITKVIALVVVPEISRELILG FT INFWKAFNIKPMIQNGSNFEEIALIQTANTANTQAETFHFFLHPIETLPTL FT GKLDPDESLDIPGLELPGPSQATPESIETEHELTPDERALLTEVIREFPCT FT AENKLGRTTLIQHEIILREESKPKRQPLYRCSPSIQAEMDKEIERYKKLDA FT IEECSSEWANPLVPVRKSNGKIRVCLDSRRINAITKKDSYPMRDMKGIFHR FT LESAKYFSVIDLKDAYFQIPLKKECRDYTAFRTSKGLYRFKVCPFGLTNAP FT FTMCRLMDKVVGFDLEPHVFVYLDDIVVATRTLSEHFRLLRIVANRLKQAN FT LTISLDKSRFCRKQVNYLGYLLTNEGIAIDNSRIEPILNYARPKCVKDVRR FT LLGLGGFYQRFIQNYSKIVASISDLLRKGQKKFIWTEKAEESFQELKAALV FT SAPILANPDFRLPFVIESDASDNAVGAALVQHIEGEPRIIAYFSKKLSSTQ FT KKYASVEKECLGVLLAIEHFRHFVEGSRFKVVTDARSLLWLFTIGVDSGNS FT KLLRWALKIQSYDIELEYRKGKQNITADCLSRSIETLPLMSLDPEYQDLVE FT QIMKSPQDYRDFKVVDGQVYKFVKSSEGLEDTRFCWKRYPKKLDRESIVRE FT IHDRAHLGFTKTLAAVRERYFWPLMSSQIKRFCQQCVTCQTSKATNVNTTA FT PLKMQRKIAEFPWQFITMDYVGPLPASGKARNTCLLVITDVFSKFVLIQPF FT RQATADSLVPFVENMVFQLFGVPEVILTDNGTQFVSKSFEELLAKYHVTHW FT KTPSYHPQINDSERVNRVLTTAIRATIKRDHKEWANNIQAIANAIRNSVHE FT ATRYTPYFVMFGRNMVSDGREYRHMRDTSEETGSIESNERTKLYSEIRENL FT KKAFEKHSKYYNLRSNADCPKYVVGEKVLKRNTELSDKGKGYCAKLAPKYV FT PAVIKRVIGDHCYELEDDKGKRIGVFNCKFLKKFSLPTSN" XX SQ Sequence 7823 BP; 2468 A; 1483 C; 1733 G; 2139 T; 0 other; taattggcgc ccaacgtaaa atcaaacttt atgagtatta tgggaatgtc agtagtgtca 60 gtgttatgta atgaatgttt agagtccaat gtaatgtaga gtcacagtgc taggttatga 120 aaatttagtg tgagttaagt tattcggata tttgtgagtt gaggctattt tgtggaaatt 180 tatttaggaa tcttgcgtaa tacatattgc tattaagttc atgtttgatt atttgtttct 240 tcatctattt attactttgt ttgtcttaat ttatttattt cattattttt gtatagttat 300 tacatttggt ttgtgatttg caaaatttct tgaatattct acgtttaatt taacaataat 360 tacaaaatgg ctagtaaatt tccttccgct tcacatttgt tgaacgatga agttgactac 420 gaattaaaat tgcgaaatta cggtgaagaa tgtggtaaaa acttggaatc caagcaaaga 480 actttgagaa aactgttaca tacggatagg gcagaggata gagattatcg ttcaatgtat 540 tccatcgatc aggaatttga tctgatttcc tccagggtaa attccattgc tgcttcatta 600 gcgcaggctt atgacacgaa gcttatttct cgcctcaaac actatcattt gagaacacag 660 cgtagcaatg ctaaaacaaa cgaggcaaaa atgatgaaag attctttagt aaagcaaata 720 tctgaattga ttgcggctta caagcctaag agtcctttga ttccagtagg tggacaagag 780 gaatcctcgc aagatgaaga ggaaactcga aataggcatt tgaaaaaaac tgtagacggt 840 caagggttag acttacaatc ggcgatagaa actgattcaa ataactctca tgtaaatttg 900 gcgcataagg ttcagaattt ggaaactcag ctgaatcaaa tgatgtcatt gttacaacaa 960 atgttggcca aacaagagca gagcgatcag aagcaaaagc aacctgaagc aacagaatac 1020 gcgaatagaa ccgggacaat tcctaagaat tcaagaagtc aaggttttct accaatacaa 1080 actgggtctg cattggggaa tggaatagcg aacgtaaata ctgggcacaa tttcaggatg 1140 aattcagcgg gaaggctacg cgacgaagct gaagcaagtc acgtactaga taatgggcaa 1200 ttagcgcagg gaggtcacgt tggtagtttc ttacaacagc aggatgctac acagcaattg 1260 cctgatcccg taggtctatt ccagagagag gatcagctca acaacccttc agtgaatcca 1320 cctttcagat tccaaagagg tcaattggct gggaacaccg gtccgggacc gtcaggacgt 1380 ttttcaggag tagcacagca gcctgactgg tacaacggta tgagggatga aaactatcgg 1440 aacagtcctt tgagaagtcc gtctatcaac gagagtcttc aaagagagaa taggatgcag 1500 tatgacaggc gaatcgaaaa atggaacatt ttctttgcgg gtacgccgag atcaccaact 1560 ttggaagact ttatttataa agtcaaagta ttggcgagca tgaacggaat tccattagac 1620 agtttgataa gccacattca tttgttgcta agagatgaag cttcaaactg gtttttcaca 1680 tattacgagg caaactggaa ttggtacgat tttgagacga aaatcagata caggtttgga 1740 aaccccaacc aagaccaagg gaaccgtcag cagatttatg aaagaaagca actaaaaggg 1800 gaaaccttta tagctttcgt cacagaagtc gaaagactga ataagctgtt gacgaacccg 1860 ctatctcctc aacgaaaatt tgaaatcata tgggagaata tgcaacagca ataccgatca 1920 aagttagcgt gtttccaagt aaataaccta gaccagctga tacaactcaa ctatcgtatc 1980 gatgccagca acccaagtct acacccggtc ggaccgagac atgtggtcaa taacatcgaa 2040 gtggattctg atggcgagtc agcggatgaa gaggagatca acgaacttaa caagaggtat 2100 caacgaggtc aaggtggttc gagacagcaa cagaaattcg gatcaagatc tgaggaagcc 2160 actagatcac cactttgttg gaattgtcgg agaaacgggc atttctggag ggagtgtaag 2220 gaggccaaaa caacgttttg ctacgtttgc ggtaacccag ggacaatatc gacaatctgt 2280 aatagccacc cgaaacggga ttcttctcgg ggggttacag ctccacagaa ttcgggaaac 2340 tgaattcgga gtgcgtagac gggaacctta gcattcctct taaatcctct gttcccattt 2400 cttccaaacc atacaacgat cctttcaaaa gtcttctcga agttaacata cgaacaaacc 2460 aatgtccaca ggtgagagta gacatatttg gtgtagagtt tgaggcactg cttgactctg 2520 gtgccggtat tagcgtagcg aattcgacgg atttggttga tcgacatgga cttaaacttt 2580 taccatcgcc gatcaaaatt tgcacagcgg ataaaatgca gtactcttgt actggctatg 2640 taaatgttcc agtaaaattt aaggggatta ctaaagtgat agctctagtg gtagttccag 2700 agatatcaag agagcttata ttgggaataa acttctggaa ggcgttcaat atcaagccca 2760 tgattcagaa tggttcaaac ttcgaagaaa ttgcgttgat tcaaacggcg aatacagcaa 2820 acacacaagc agagactttt cacttctttt tgcacccgat cgagacacta ccgacactcg 2880 gaaaacttga tccggatgag tcgttggata ttcccggact ggagcttcca ggaccatcac 2940 aagcgacacc ggagtcgatc gaaacggagc acgagttaac accggatgaa agggcgttgt 3000 taacagaagt aatacgcgag ttcccttgta cagctgaaaa caaactaggt cgaactacgt 3060 tgatacaaca cgaaattata ctgcgtgaag agtcgaaacc aaagaggcaa ccgttgtatc 3120 gttgctctcc ttccatccaa gctgagatgg ataaggagat tgagcgctac aaaaagttag 3180 atgcgattga agaatgttcg agtgagtggg cgaatccttt ggtccctgtt cggaaatcga 3240 atgggaagat cagagtgtgc ctagactcga gacgcatcaa tgccatcacc aagaaggatt 3300 cgtatccaat gagagatatg aagggtattt ttcatcgact cgaaagcgca aaatatttct 3360 cggtgatcga tttgaaggat gcttattttc agatcccgct taagaaagaa tgtcgggact 3420 acaccgcctt tagaacctca aaagggttgt ataggttcaa agtatgtccg tttggtctca 3480 caaatgcacc ctttaccatg tgccgcctaa tggataaagt tgttggcttt gacttagaac 3540 ctcacgtttt cgtgtatcta gatgacattg tggtggctac gagaacattg tcggagcact 3600 ttcgacttct gcgaattgtg gcaaaccgtc taaagcaggc aaatctcaca atatctttag 3660 ataagtctcg attctgtaga aagcaggtaa actatctggg ttacctgctg actaacgaag 3720 gaatcgcgat tgataactcg aggattgaac caattttaaa ctacgctagg ccaaaatgcg 3780 ttaaggatgt acgccgttta ctgggattgg gtggatttta ccaacggttt atccagaatt 3840 atagcaaaat cgttgcctca atatccgact tactacggaa aggacagaag aaattcatat 3900 ggacggagaa agcggaagaa tcgtttcagg agctgaaagc ggcattggta tctgcaccaa 3960 ttctagccaa tccggacttc cggttaccat tcgtgataga gtccgatgcg tcagacaacg 4020 ctgttggggc tgcattggta caacacattg aaggtgaacc acgcattatt gcttatttta 4080 gcaaaaaatt aagcagtact cagaagaaat acgccagtgt cgagaaagag tgcttaggag 4140 tgctcttagc aatcgagcat tttagacatt tcgttgaagg aagcaggttc aaggttgtaa 4200 cggacgcaag gagccttcta tggcttttta caataggagt cgactcagga aactcaaagc 4260 ttttgcggtg ggctctcaaa atccagtcgt atgatatcga gttggagtac aggaagggaa 4320 agcaaaacat tacagcagac tgcctctcgc gctcaattga gaccttgccg ttgatgtcgc 4380 tcgatccaga atatcaggat ttagtagagc agattatgaa aagcccccag gactacagag 4440 atttcaaggt agttgatggg caagtttata aatttgtgaa aagttctgaa ggcttggagg 4500 acactcgatt ttgctggaag cgttacccta aaaagctcga tcgcgagtcg attgtacggg 4560 agattcacga tagagcccac ctgggattca ctaaaacatt agctgccgtt agggaaagat 4620 atttttggcc attgatgagt tcccaaatca aaagattttg ccaacaatgt gtaacttgcc 4680 aaacgagtaa ggctacaaat gtcaacacaa ctgctccact gaagatgcaa cggaaaattg 4740 ctgaatttcc gtggcaattt ataaccatgg actatgtggg tcccttacca gcttcgggga 4800 aggccagaaa tacatgtttg ctagtaatta cggacgtatt tagcaaattt gtgttaatac 4860 aaccatttag gcaagcgact gcggattccc tggtgccatt cgtagaaaat atggtattcc 4920 aactgtttgg tgtaccagag gtaatactta ctgacaatgg tacacaattt gtgtcaaagt 4980 cttttgaaga gcttctggct aaatatcatg ttacccattg gaagacccct agttatcacc 5040 ctcaaattaa tgattcagaa cgtgttaaca gagtcttaac aactgccata cgggcaacta 5100 ttaaaaggga ccataaagaa tgggcgaata acattcaggc tatcgctaat gctataagaa 5160 actcggttca tgaggccacg agatatactc catattttgt gatgtttgga cgcaacatgg 5220 tctctgacgg tagggagtat aggcacatga gagacacgtc cgaggaaaca ggtagcatag 5280 agtccaacga aagaacgaaa ctttacagtg aaatccgtga aaatcttaag aaagcttttg 5340 agaaacactc aaagtactac aatttaagat cgaacgcaga ctgtcctaaa tacgtagtag 5400 gagagaaggt tttgaaacga aacaccgaac tgtcagataa aggcaaaggg tactgtgcca 5460 aacttgcacc aaaatatgtc cctgccgtga tcaaacgtgt gattggagat cattgttatg 5520 agctagagga cgacaaagga aaacgaattg gagttttcaa ctgtaaattc ctgaaaaagt 5580 tttctttgcc aacctctaat taatgttaac tcgatttcaa gctatgcatc ctttctataa 5640 ggtaacaaaa cgtctaatga cgtcccaaaa tacattaatg gcacttcatt cacttgatca 5700 atgattcatt cgaatgaact gattcactag ctatgcaccc atttgctcaa tgggaacaat 5760 gcgccttgaa ggttgcctta aaatcaacga tgttgcattc agatcgaatt gtttcgttat 5820 caggtagagt tttcctggga acaagctatg aatactcagt gagtggacca agcaactatg 5880 ttgcacaaaa acatgaccgg ccattcataa gggtctcctc ggtttccgag ttgagcagtc 5940 ccacaacaac acgattcact actggtgaca acccttatga atgtgcatgg aatcagtgtc 6000 tctcacttca gaaacaatga tcaaattaag caaacaaaat ctccatcaca gctatgaacc 6060 tcaatacgag caaccattca caagaaaacc tgacgaggaa acgtttctag caaggctagt 6120 gacgaaggag tatggaagag gtaactctac ctagcctgaa tttcctcgaa gaagatcaac 6180 acgacagcta tgaacctcaa tttgagcaac tattgacaag aaatactgac gaggaaacgt 6240 ctctagcaag actaccgacg aaggtgtacg gaagaaacaa tcttacctag cctgaatttc 6300 ctctaatatt agtaaattag caacctaatc atagcatcta gattagtacc taaagtagcg 6360 ttagtttcct tagcacatta cgatatttac gattcagtcg atgatttcgt ttgtcttgta 6420 tcgtcaatgt aaataggtcc atagtttata catagtcgtg ttctcatcca ttgataatgt 6480 ccgtacaaat tttccaatac gttgatgtcg tagtcttcag tgtcaatgac gtcacacgtt 6540 cattcagtct aagtttcatt gattggtcat catatgctca aaggttgaag aggttcatca 6600 tcatctctga cagtatcgtg atatttcgtt cgtcctatcc tcaagttgtt aatagtagtc 6660 caataagaat agcagtaaga tctgaatctt gtcccaatag tttttgcaag ccgcatgctg 6720 tgttccaaat gaagaataga ttcccacgtc gcagcacctg tagaagaaac aacaaagcat 6780 tcaatttcag gcaggaaagc attaattgaa tactcccccc gtaaattttc tccttgataa 6840 agatgttctg catcgaagtc ccatccagca acaaaatctc aggaatccgt cccgtccagc 6900 aacaaaaaaa cagctattcc acactctgtc tgactcctag tgaggtgtca tggttctgat 6960 tggaatctag ttccggtggt ttgacagttc gatggctgcg tacgtgttag gtagattctg 7020 tacttgtggt gtgaagttaa atgtgaagtg ttgtataaca gttgaggggt gttaagttta 7080 gtttgctttt ttgagtttcg ggtgaccaga acaagtttct ggtgtggaaa ctcaactaag 7140 ttttaggttt aagttacgtt ttgtgagaga aatggacaaa ttgcctggag tataccttcg 7200 gggtgagcca aatctctgca aaatagatat ttgcggtaaa ttcggaggat atttatgtgc 7260 tcaacgtaat gattgaggtt ataatagaat ccgaataagc ctgtattttg ggattttaag 7320 aatttttatg taaagtattt ttccagattc gactgaatct gtcaaaagta cgttttggat 7380 taagttggtt tgagttttca gatttgtgaa gacagaatac acctaacacg ttggcgactt 7440 cgaactggaa aacatttttt gtatttattc atattttagc atttattaga gccttgtaca 7500 taataattca cattccgaat ttaagtagtt atagtagaat cagttggcag atttgctaga 7560 ggatataatg attcgctgga gaccggagtc acgcgacgct gatggtcagt agcagatacg 7620 gtccgtgata gatctgtagt ctgaaaacag ggaaggagtc aactttcagt tgatgatgaa 7680 accatatgac cccgaattta ttgaactcta agcaaagtac gtcgaacgat tcgacttttg 7740 taaatattaa tcgtaagtta aacccttacg aaaatttagt tgaatgaatt caactaaatt 7800 ttcgtaacct tagcatggag tga 7823 // ID Kolobok-18_HM repbase; DNA; INV; 2579 BP. XX AC . XX DT 10-SEP-2009 (Rel. 14.09, Created) DT 10-SEP-2009 (Rel. 14.09, Last updated, Version 2) XX DE Kolobok-type family - consensus. XX KW Kolobok; DNA transposon; Transposable Element; Kolobok-18_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2579 RA Jurka J.; RT "Kolobok-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 9(9), 1929-1929 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 625..2169 FT /product="Kolobok-18_HM_1p" FT /translation="MKISGYRIIDCSILSDIFSALACPQCGECSSLSSSEK FT MQDKKGLASKLIIKCYNCEYENEFYTSKQCNRAYDINCRTVYAMRSLGQGH FT SGIEKFTALMNMPQPMTQNNYDKLAVKIGAVTQKVAEDTMFDAVAELRKHA FT ANDDVFDIGVSCDGTWQKRGFSSLNGVFSALSMDTGKVLDVEPMCRFCKGC FT FLKKDLLKTNPTAYAQWKNSHICKNNYKGSASGMESTGAKRVFLRSIDKYK FT LRYVNYLGDGDSKSFINVIDTYPGIQVNKLECVGHYQKRVGSRLRNLKKRE FT KGLGGRGCLTDAVIDRLQNFFGVAIRQNTGNLAGMKASVLATLFHVASSKA FT NNLHFPHCPTGINSWCKFNFDKANNTNTYKPGPGLPMNIIFKIRPIFEDLS FT KDSELKKCLHGKTQNANESFNSMIWDRLPKTRYVSFDNLKFGVYDAVANFN FT IGMKASVLVFEQLNMLPGAFMVKGSNKINNKRIKQSNYKMNEKNKLRRQLN FT RTDNNLEKEGITYEPGGF*" XX SQ Sequence 2579 BP; 905 A; 390 C; 457 G; 827 T; 0 other; gggggaaacg ccctctaatt ttacccaaaa tttgtaaaaa atatttttac ttatttatgc 60 atttaaatgt atgcagaatt aaaaattatt tgtttatata ctgattttta tcatttaggc 120 cttcaaattt tgtattttct taaggtaact caaaatttat accctaacaa cgcccatagc 180 aacggtgtag caccttaata aaaatcctaa taactaatgt atatttttaa atagctgcag 240 taaaatgctt ggtttggctt ctttactcat tcagctagcg aaaatagtct ggttcatcaa 300 gttaaattac agttttctgt tattctgaag catttgaagt tgtcaacttt cttaatattt 360 tttaaaaaat gagtaaagcc tgtagaataa aatcaagaac aataattgag aagaggtttc 420 ttagggaaga aatttaactg tctcgacaaa gaaaatattg aaataaggtt gaatgataac 480 cctgctgtaa caattttgac tgattcaaat actacaactg tcaataataa ctctgtaaat 540 aacgaatctg agttaaattt attttctaca tctgtatctg cgaaaaaggt aaagccaatt 600 gaatgttcca gagcttttac aagaatgaaa atatctggat ataggattat agattgctct 660 atattatctg acatatttag tgctttagcg tgtcctcaat gtggtgaatg ttcatccttg 720 tcatcgtccg aaaaaatgca agacaaaaaa ggacttgcct caaagttaat tatcaagtgc 780 tataactgcg aatatgaaaa tgagttttat acttcaaaac aatgtaacag agcatatgat 840 attaattgta gaacagtata cgcaatgcgt tcacttggac agggacattc tggtatagaa 900 aaattcaccg ctttaatgaa tatgccacaa ccaatgacac aaaataatta tgataaatta 960 gctgtgaaaa taggtgccgt cacgcaaaaa gttgctgaag ataccatgtt tgatgcagta 1020 gcagaattgc gaaaacatgc tgctaatgat gacgtttttg atattggtgt gtcatgtgat 1080 ggtacttggc agaaaagagg tttttcatca ttaaacggtg tcttttctgc attatcaatg 1140 gacactggaa aagttttaga tgtggaacca atgtgcaggt tctgcaaagg ttgctttttg 1200 aagaaagatc ttttaaaaac aaaccctaca gcctatgccc aatggaaaaa ctctcacatt 1260 tgtaaaaata attacaaagg atctgcaagc ggtatggagt ccactggtgc aaaacgtgta 1320 tttttaagat ctatagataa gtacaaactg cgatatgtaa attatttagg agatggagac 1380 agtaaaagtt tcattaatgt catagacact tatcctggca ttcaagttaa taagttagaa 1440 tgcgtgggac actatcaaaa acgtgttggt tcacgactac gaaatttaaa gaagagagaa 1500 aaaggtctcg gtggacgtgg ttgtcttaca gatgcagtta ttgatcgtct gcaaaatttc 1560 tttggcgttg ctataaggca aaatactgga aacttagcag gtatgaaggc aagcgtattg 1620 gctacattat ttcatgttgc atcttccaaa gcaaataact tgcactttcc tcactgtcct 1680 accggcatta atagttggtg caagttcaac tttgataaag caaataatac caacacttac 1740 aaaccaggtc caggtcttcc aatgaacata atttttaaaa ttagacccat ttttgaggat 1800 cttagcaaag attcagaact aaaaaagtgt ttacatggga aaacccaaaa cgctaatgaa 1860 tccttcaaca gtatgatttg ggaccgatta ccaaagacca ggtatgtgtc atttgataat 1920 ttgaaatttg gtgtgtatga tgcggtagca aatttcaata ttggtatgaa agcttcagtg 1980 ttagtttttg aacaattaaa catgttacca ggagctttta tggttaaagg ttccaataaa 2040 ataaacaata aacgaataaa acaatctaat tataaaatga atgaaaagaa taagttgaga 2100 cgacagctaa atagaactga caataatttg gaaaaagaag gtataacata tgaaccagga 2160 ggattttaaa tacatttaaa gaagttaatg attgacatgt tttggtgtta tttaattaaa 2220 aaaatgttaa tttgcttttt tctcagtatg aagtatttaa gacgccggcc aacattgctc 2280 gccaaccatt caagagccct tcataaaatt ttcagtgaat gtttattaaa tggtgtagag 2340 tgttttaagt tgaacagaca cctgctgttt atgctgtttt aatactatat tattgaatat 2400 tgaactttga cttcaaaatg cctgaattta tatttgtttc tgttcaactt aaaacactct 2460 tctactctat gtaaacacca tataaaaatt ttaactaggg ttgttttgtt ccctgcccac 2520 aatattggcc ggcgtgaaac ctgtaaaaat tgatttaaaa attagagggg gtttccccc 2579 // ID Gypsy-24_DYa-I repbase; DNA; INV; 5304 BP. XX AC chr2L_random; XX DT 06-APR-2011 (Rel. 16.04, Created) DT 06-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-24_DYa_; KW Gypsy-24_DYa-LTR; Gypsy-24_DYa-I. XX OS Drosophila yakuba OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; melanogaster subgroup. XX RN [1] RP 1-5304 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; chr2L_random; Positions 1710631 1705328. XX CC Positions [4388-4849] - Integrase core CC 'CTTAT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 1757..4849 FT /product="Gypsy-24_DYa-I_1p" FT /translation="MLRLRQQMQRCAFGSSKAEIENICLKDKIIDVWAPLD FT LKKRLLGKEYSLEEVIEACQVEEQINKESKEMTSRPIAEPICKITHRGFNQ FT SGECPRCGKFGHTNNDPSCPARNVTCNKCSKSGHFARKCRTNLNGRFLKSQ FT RNNLKWPCTYIRSVEEEATSSKLKKGEEHCFKVSSEDEEEFIRCRVGGREI FT SLIIDSVCKFNLISHADWSLLVKRKATVFNVRTHTEKQFRAYASNKLLRVM FT YIFEAPISVEPGTEEIASFYVIENGEQSLLGRDTAIELKVLRLGRNINRIE FT EMEPFPKWKNIEVSLSIDYDVKPVQQPVRRIPAALEDKVMERLDEALSRDI FT IEPVTGPSAWISPIVLAFKENGDIRLCVDMRLANKAILGENYPLPTFDCFM FT TRLKEAKFFARLDLKDAYHQLALDESSREITTFITPRGLFRYKRLMFGVNS FT APEIFQRLLEQMLSAVPNAMNFIDDVIIFGATDLEIDSAVKEVCQIFQENN FT VLLNKEKCIWKTSKLKFLGHILSDKGIEVYPEKIDVIRSFRDPKNKEETRS FT FLGLVTYVGKFTPDLASHTDPLRKLLKTESKFTWGEAEKNAFSNLKNLLYK FT VPKLSYFNPRHRTRVIADASPVALGAVLLQSKVDNEPLIITFASKVLSEVE FT KRYSQTEKESLALVWAVEKFYYYLAGLHFELVTDHKPLEAIFKPTSKPPAR FT IERWLLRLQAYTFTVIYKSGRDNISDALSRLCQLPAVEPIDHKTEYSILRV FT AQSSIPRSMTIHEIAESSKRDEEIMDAISCLENDSWKPDSSVGFYPFRYEL FT SAVGSLLLRGNRLVVPTSLRVKILELAHEGHPGETAMKRRLRSKVWWPQID FT RDAEKFVKACRDCCLVSQDIRPPPFPTGPWIWVASDLLGPLPNNEYVLVFI FT DYFSRYMEHKFLKTISSTTLVEAMKEIFCRLGYPEHLRTDNGRQYVSEEFS FT NYCKACGIKQVRTPPNWPQANGEVENMNRALVKRLKIAYANGKNYKEELQK FT FVLMYNVTPHGTTSAHKADVQQSHP" XX SQ Sequence 5304 BP; 1800 A; 1103 C; 1209 G; 1192 T; 0 other; ttggcgacga gtggaaaaat cggtaaatga taatgataat aactaaaact ttggtaaatt 60 cgatgcatta ggaaaaaggc gcgaaattaa ggtaaacgag tggtgtgata taggctgtgc 120 gtgagtgtgt gtgtgttgga acgtgcctac cggcatatac atacataaaa atgtggaaaa 180 agaggagaga aggaataagc tgcgtgagtg cagattgcac gttgccaaca taagatgacg 240 ttaagcacaa agcggaatga gtaattttat atgcacaatt tttcgctcac tctcatccaa 300 cacacatgca ctccgacaga taggctgagt gaagagatta ctcttcgcaa gcttaagaga 360 aagtgagtta gagagacact ctctagcaac caggatataa gaaaaataac caataagaat 420 aatactcttt acaagcttaa aaggtgagct ggagagaaat ttctctagca accaaagagt 480 tgcaaatgat aatgataata actaaaactt tggtaaattc gatgaattag gaaaaaggcg 540 cgaaattaag gtaaacgagt ggcgtgatat aggctgtgcg tgattgtgtg tgtgttggaa 600 cgtgcctacc ggcatataca tacataaaaa tgtggaaaaa gaggagagaa ggaataagct 660 gcgtgagtgc agattgcacg ttgccaacat aagatgacgt taagcacaaa gcggaatgag 720 taattttata tgcacaattt ttcgctcact ctcatccaac acacatgcac tccgacagat 780 aggctaagtg aagagattac tcttcgcaag cttaagagaa agtgagttag agagacactc 840 tctagcaacc aggatataag aaaaataacc aataagaata atactcttta caagcttaaa 900 aggtgagctg gagagaaatt tctctagcaa ccaaagagtt gcaagggccg agtacgcatt 960 ataagctcta agagaggagc cgagtagtgg tacaagccaa aacatatttt gaaaaaaaag 1020 aaaagaaaca gaacacgaaa tatataaata aataaatatt ttaataatta aaaaaaaatt 1080 taatttaaaa aaaaataaaa acggaggaaa aaggaaaaat attacaagct tttaagaaag 1140 agctgggtaa tcggtttgag ttgcaaacaa aaaggttaaa ggtcgggtgg agcagtattc 1200 tctccacaag ccaaaagaat tcataaaatt taaaaggaaa aacggaaaca acaaaacccc 1260 aaatgtaaat gggaggtatt attgaactga ggctgagtaa taattacaag ctcctaagag 1320 agccgggtat taattacaaa tcaagtcgta aatctaataa tgtctttcgt cgtaaactcc 1380 aattctaggt cccgtattga aaatgaccga cagtgtcatt aaaccgtttc tgtgcaaggt 1440 gatcgacaag gcaatcctcc gaaatgaatg ggagaagtag ttgagggcat ttacaatata 1500 cctcgaggcc gagggtattg atacctagaa gcagaagagg agtaagctac tgcttttggg 1560 aggagtccag ctacagtcag tagtatactc tctacctggt gcgctggtgg agcccagcga 1620 cggaaataaa gaagacatct ataaaatcct catcgaccat ttaaataaac acttctcacc 1680 gaagcagaac tcaacgtttg aaagacatct attcagaggc ctgacaccgt tggacacaga 1740 aagctttggt gatttcatgc tgcgactccg ccaacaaatg caacgatgtg cattcggatc 1800 ctcaaaagcg gagattgaga atatatgtct aaaggataaa atcatagatg tgtgggcacc 1860 tctagacctc aagaaaagac ttctgggaaa ggagtattca ctagaggaag tgatcgaagc 1920 atgtcaggtt gaggagcaga tcaacaagga atcaaaagaa atgacatcaa gaccaattgc 1980 ggaacccatc tgtaaaatta cacatcgtgg ttttaatcag agtggggagt gccccagatg 2040 tggaaaattc ggacacacaa acaacgaccc atcatgtccg gcaagaaatg tgacttgcaa 2100 taaatgctcg aagtctggac actttgccag gaagtgccga actaacttga acggccgatt 2160 ccttaagtcg caaaggaata acctgaaatg gccttgcacg tatattcgct ctgtcgaaga 2220 ggaggccacc agctccaaac tcaaaaaggg cgaggaacat tgctttaaag tgtcaagcga 2280 agatgaagag gagttcatac gatgccgagt gggaggccgt gaaatttctt tgattattga 2340 ctcggtgtgc aagtttaacc ttatcagcca cgcggactgg tctctgctag tgaagagaaa 2400 agccacagtc tttaacgtaa gaactcatac agaaaagcag ttccgagcct atgcctctaa 2460 taaattacta cgagtaatgt atattttcga ggccccaatt tccgttgagc cgggtaccga 2520 agagattgcc tcattttacg tgattgaaaa tggagaacaa tctctgctgg gccgagacac 2580 ggctatagaa ttaaaagtcc tccgactggg aagaaacatt aaccgtattg aggaaatgga 2640 gccttttcca aaatggaaaa acattgaagt aagcctttcc atagattatg atgtgaaacc 2700 cgttcaacaa ccagtgagac gaatcccggc agcgctcgag gacaaagtta tggaaagact 2760 ggatgaagcc ctgagtcgag atataattga accagtcacg ggcccgagtg cttggatatc 2820 tcccatcgtg cttgcgttta aggagaacgg ggacatccgc ttatgtgtcg acatgcgact 2880 ggcaaacaaa gcaatcctag gggaaaacta cccattacca actttcgact gctttatgac 2940 cagactcaaa gaggctaaat tcttcgcacg cctagacctc aaggacgcct atcaccaact 3000 ggcccttgat gaatcgagcc gggaaataac aacgttcata actccaaggg gacttttccg 3060 ctataagcgt ttgatgtttg gagtaaattc tgctcccgaa atttttcaac gccttctcga 3120 gcagatgcta tctgcggtac ctaacgccat gaacttcatc gatgatgtca tcatctttgg 3180 cgcaactgac ctcgaaatcg acagcgccgt aaaagaagta tgccagatct tccaggaaaa 3240 taatgtccta ctgaataaag aaaagtgcat ctggaagaca agcaaactaa aattcctcgg 3300 acatatttta tccgacaagg gaatagaagt ctaccctgag aaaattgacg tcatccgatc 3360 tttcagagac ccaaagaaca aagaagaaac gcggagcttc ctcggtttgg tgacctacgt 3420 cgggaagttc actccggacc ttgcaagtca tacagatcct ctaagaaaac tgctgaaaac 3480 tgaaagcaaa ttcacctggg gcgaagccga gaaaaatgct ttcagcaacc taaagaatct 3540 cctatacaag gtacccaaat tatcatattt caatccaagg catcgaacac gggtgattgc 3600 ggacgccagc ccggttgccc ttggggctgt gttattgcaa tcaaaagtcg acaacgaacc 3660 tttaataatt acctttgcta gcaaggtcct gtcggaggtg gagaaacgct actcccaaac 3720 tgaaaaagag agcctagcgc tggtttgggc agtggagaaa ttttactact atctggcagg 3780 cctgcacttc gaactcgtaa ctgatcacaa gcccttggaa gccattttca aaccaacatc 3840 caagccccca gctcgcattg aaaggtggct gctgcggctt caggcatata cgttcaccgt 3900 tatctataaa tccggaagag acaacatatc agatgcgttg tctcgactct gtcaactacc 3960 ggcggtagaa ccaatcgacc acaagacgga atacagcatc cttcgcgtcg cacagagctc 4020 tatcccaaga tctatgacaa ttcatgaaat agcagagtct tccaagaggg atgaagagat 4080 tatggatgca atcagttgcc tggagaacga ctcctggaag cccgatagtt cagtgggctt 4140 ctacccgttt cggtatgagc tttcagcggt tggctcgctc cttttgaggg gaaaccgcct 4200 agtcgtgcca acatctctga gagtaaaaat actggagtta gcccacgaag gacatcccgg 4260 cgagacagca atgaaacgcc gcctgcgatc caaagtatgg tggccacaaa tagacagaga 4320 cgcagaaaag tttgtcaaag cttgcagaga ttgttgcctg gtatcacaag acatcaggcc 4380 accccctttt cccactggcc cttggatttg ggtagcgtcg gatttactag gtccgcttcc 4440 caacaatgaa tatgttctag tcttcattga ctatttctca cgatatatgg agcataagtt 4500 tcttaaaacc atatcttcaa cgaccctggt agaggcaatg aaagagatct tctgtagact 4560 gggatatcct gagcacctac gaacggataa tggacgccaa tatgttagcg aggagttttc 4620 taactactgc aaagcgtgcg gcattaaaca agttcgaacc ccaccgaatt ggccgcaagc 4680 aaacggggaa gttgaaaaca tgaacagagc ccttgtaaaa cgcttaaaaa tagcatacgc 4740 aaatgggaaa aactacaaag aggaactcca aaaattcgtt ctaatgtaca acgtaacccc 4800 acatggaaca acatcagccc acaaagctga tgttcaacag agtcatccgt gacaaaatac 4860 cgggcattgg ggatatctgc gagaatacct tagactctgg agagagagat aaagatatta 4920 ttgaaaaaaa taagaaaagc aggcagccga caaaagaaga ggggcaaagg aagtcgatat 4980 agaagtgggc gacaaggtgc ttttgaaaaa cgtagtgttc ccaaacaaac ttcgacaaga 5040 cggaatttac ggttttagaa cgacacaata atattgtggt gatcgaaggg ggtggtagaa 5100 aactaacaag aaatgcctca catcttaaga aagttccagc tagccaaacg tatccagcat 5160 cctgcccaac gccgcaacca gatgacagct taccggaatc aacacctact tcggagaacg 5220 cagtactcca gcccacagaa ggaagccccg ggatccagcc acccctgccg ccattaaagc 5280 taaaactcga taataaagga ggga 5304 // ID Gypsy-17_OD-I repbase; DNA; INV; 5219 BP. XX AC CABV01000966; XX DT 24-FEB-2011 (Rel. 16.02, Created) DT 24-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Oikopleura dioica genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-17_OD_; KW Gypsy-17_OD-LTR; Gypsy-17_OD-I. XX OS Oikopleura dioica OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; OC Oikopleuridae; Oikopleura. XX RN [1] RP 1-5219 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Oikopleura dioica genome."; RL Direct Submission to RU (24-FEB-2011). XX DR Genome; CABV01000966; Positions 6754 1536. XX CC Positions [2276-2638] - Reverse transcriptase CC Positions [4066-4548] - Integrase core CC 'TCCT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 300..1355 FT /product="Gypsy-17_OD-I_3p" FT /translation="MSVATLDWLKSYVKELDEEINAAPDKNESSVKEKIAT FT RKKLKEKLVSAIDSIKVEKEMSRDQATALKSAENALIDVNYSGHDFEETSN FT FTARVNQVYDIYIRNDSDLEESFAAKVKQRFKPIIYSRLKDASVSTKTWKE FT LQVWLNKEYDSGLSTTQLLTRAMETPFDSDAGWKKYSQSISDRMEAAKHSI FT LSQIRKTKMEKYKTTKLEDKKNEPVAEDIFSFVCASIVAGRLKVEKPEIHS FT MMANEWAKISDGATVATKIEFLQAQTNKEGGSVFFAQKSNKNGNGNKTWTN FT GGRGKSRKPCWNFAKGKCKRGDDCGFQHVKAENQESKVHVAKDPDLPDEPV FT STLFHSVLM" FT CDS 1730..2761 FT /product="Gypsy-17_OD-I_2p" FT /translation="MHGSGMELLLQSGKMVFFPWTSESSILVTKTTPQTAM FT SCTPISDQDTKCKSISELESAKSSDKQQNLSTKQKIEFLMNEKQIALKTEV FT FNGKYLDQLVDLLYSKRNVFKGTNDEIGLFSEQVRIPTVPGLTKAARARPI FT PKHLQPQVEVEIQKMLSSGIIEKCPDGKGFHSPLHIVPKKGNKIRICSDFK FT STLNQCLQESTEIWSLPNIDCLFADVEDGNNIFSSLDINSAYWNLEIAPMD FT RHKNNFLYKDQLYQYVRLPFGMKFSGDAFCKSIAKLLSTVKHQRNFKSYVD FT DILVHSKDIKTHLEVIKEILEACEIFNAKLGGAKSLSLSVILAQNQQNLWD FT V" FT CDS 2752..5064 FT /product="Gypsy-17_OD-I_1p" FT /translation="MGRLISSDGISIPEENMKALQELPPPTNRKELLSVLG FT SFVWWKNWVSCNIGDRIAVNSFSAVVKEMSVLNKANKPFAWNDDADAAFKN FT AKRILASNKVFSWPNFDYPFVVVTDASLIAIGGALIQEIGGKQKLISVYSK FT TLSPCEARWSATEREAFSLMMTIEKFNYYLQGKSFLVLTDHKALQCLDKKI FT VANDKICRWQERLSKYTFTVQYIRGAENTLADMLSRPWHKVREKSDKKPSD FT ELAGQFYNPVGEKDLLIYIPSWCSDNKLDRKMLIERVDTASSLFTLKSVAT FT GVWCPNVPILELRVIENAQSEDRVIGIVKSLVQNKVEASKWKVPDDVYGIQ FT RYKRLARNFGIHAETGLLTISWGERSCIVLPKSLIPRYLEAAHSKAHTGAD FT RTRDLLKWSWWLDQMKDIQEFVASCSTCLKFKGQDMQKGHPDRQVLFRPKR FT QWEILYVDFIQLPRSASGKSYALTVMDGYSRHLSVYPTARCRAVDCARALM FT KHVLCFDFPTYLSSDQGSHFKNELVSELCKLLGVTQNIHVAYRPESTGCLE FT RAHRVLKNALYGMSLERNMCWELVLPSVVSTMNKCKNVATKCSPFEVIYGR FT KPSFDGIQMHKNPAADSPQAYVKEVAAVLARTRKFVDLAQEESDIATKIDG FT KSKIKQIVIEIGDRILLKRSLSVETKINKNPYTGPFEVINTNGVIVRIDMD FT GKLTWVHRHHCVLQKQRALELDPDFVDLLYDDEPPQKAPDPVAEVPPERAT FT PPARRYPTRERKPPDRYQPN" XX SQ Sequence 5219 BP; 1667 A; 1115 C; 1153 G; 1284 T; 0 other; aattggtacc agaaagaacg aacttgacgc tctataaaga ccccgtaagt ttcatcccta 60 acctggccat ttgggaattg gtaatttggg aagttattaa attggtggaa ttaaatctgc 120 cgattttcga attgctgtgc accatcggaa ttgctcttta ccgcgcgaaa tacaaacctg 180 acgagacaaa aaccctcaaa gaggtcgctc agcattctga tttaaatcaa aataaatctg 240 agtgcgcgta aaaggtaaaa cgagcgagtg agtcacctaa ctgctatttt ttccgcgcga 300 tgtcagtcgc aactctggat tggctgaaaa gttacgtcaa agaactcgac gaggaaatta 360 atgcagctcc agataaaaat gaatcaagcg tgaaagagaa gatagcgaca aggaaaaaat 420 tgaaagagaa gttagtgtcc gcgattgact cgatcaaagt ggagaaggaa atgtccagag 480 accaggctac cgccctcaag tcggcggaga atgccctcat cgacgtaaat tattccggcc 540 acgatttcga ggaaactagc aactttactg ctcgagtcaa ccaagtatac gatatttaca 600 tcagaaatga ctcggatttg gaagaatctt tcgctgcgaa ggtcaagcag cgcttcaagc 660 caatcattta ttcccggctc aaagacgcct cagtatcgac aaaaacttgg aaagagcttc 720 aagtttggtt gaacaaggaa tacgacagtg gattgagcac tacacaactt ttgactagag 780 ccatggaaac tcctttcgac tcagatgctg gatggaagaa atacagtcag tcaatttctg 840 atcggatgga agccgccaaa cacagcatct tgagccaaat tcgaaaaacg aagatggaaa 900 agtacaaaac cacaaagctc gaggacaaga agaacgaacc agttgcagaa gatattttca 960 gcttcgtctg tgccagcatt gtagctggcc gattgaaagt cgagaagcca gagatacata 1020 gcatgatggc taatgaatgg gcaaaaatct ccgacggagc cacagtcgcc acgaaaatag 1080 aatttctcca ggcgcagaca aataaggagg gcggatcagt atttttcgct cagaaaagta 1140 acaaaaatgg gaatggaaat aaaacttgga caaacggcgg tcgagggaaa tcacgcaaac 1200 catgctggaa ctttgcaaaa ggtaaatgta aaagaggaga tgattgcgga tttcaacacg 1260 tcaaagctga gaatcaagaa tcgaaagtac acgtcgccaa agatcctgat cttccagatg 1320 agccagtttc gacgcttttt cattcggtgc tgatgtaaag tgcgcttcat cagcatgtct 1380 cacatgcttc acaaaatgcc gcacgtattt tcaaactaaa attacagtca cgattaatgg 1440 atttaacctt gatatcaccg ctctcacaga tactggcgca gataagaata tcttgccctt 1500 ggaactcatc ccaacttctc ttcacaaatt gattacaccc aattcgactg ttttctctgg 1560 ttgcggaaaa actcaggcga ttggaacttt gtatggttat gtacaggcgc acaaaaataa 1620 ttataaattt cacgacgtaa agttctatat tgtgaaagaa tctcttcctc caatcttagg 1680 aaaagctttt attctggaac atcgttcagt caagtctggg aatttccgaa tgcatggatc 1740 tggaatggaa ttattacttc aatctggtaa aatggttttc ttcccctgga catcggaaag 1800 ttcaatactt gtaacaaaaa cgacgccaca aacagccatg agctgtacgc caatcagcga 1860 ccaggatacg aagtgtaagt caatcagcga gctggaatca gcaaaatcgt ccgacaagca 1920 gcaaaatctc tcgacaaagc agaaaataga gttcttgatg aatgagaaac aaatcgcact 1980 caaaactgaa gttttcaatg gtaaatattt ggaccaattg gtggatttgc tatattctaa 2040 aagaaatgtg ttcaagggaa caaatgacga aataggcctc ttctcagagc aagtgagaat 2100 tccaacagtc cctggactca caaaagcagc cagagcacgt cctattccga aacacctgca 2160 acctcaagtc gaagtggaga ttcaaaaaat gctcagttct ggaattatcg aaaaatgccc 2220 tgatgggaag ggatttcata gccctttaca tatcgttcct aaaaaaggaa acaaaattcg 2280 aatttgttcc gattttaagt caactcttaa ccaatgtctt caagaaagca cggaaatatg 2340 gtccctgcct aacatcgact gtttattcgc cgatgtcgaa gatggtaaca atattttcag 2400 ctcattggat ataaattccg catattggaa cttggagata gcgcccatgg acagacataa 2460 gaataacttt ctttacaaag accagcttta tcagtatgtt cgattgccat ttggcatgaa 2520 attctccgga gacgccttct gtaaaagtat tgcaaagcta ctgtcgactg tcaagcatca 2580 gcgcaacttc aagagttatg tggacgatat tttagtacac agtaaggata taaaaacgca 2640 tcttgaagtt atcaaagaaa tactggaagc gtgcgaaatt ttcaacgcca agttaggagg 2700 cgctaagtca cttagcctaa gtgtcatttt ggcacaaaat caacaaaatt tatgggacgt 2760 ttgatttcaa gtgatggaat ttcgatcccc gaagaaaaca tgaaagccct gcaagagctg 2820 cctccgccta cgaatagaaa agaattgctc tctgtactgg gaagtttcgt ttggtggaaa 2880 aattgggtca gctgtaatat cggggaccga atcgcggtca attccttctc ggcagtggta 2940 aaagaaatgt ctgtcctcaa caaagcgaac aagccgtttg cttggaacga tgacgccgac 3000 gcagctttca aaaacgctaa gaggatctta gcgtcaaata aggtatttag ttggccgaat 3060 ttcgattatc ctttcgttgt cgtcaccgac gccagcttaa ttgcaattgg cggcgcatta 3120 attcaagaaa ttggtggtaa gcaaaaactt atatccgttt actcgaagac tttgtcacca 3180 tgtgaagcga gatggtcagc cactgaaaga gaagctttca gtttaatgat gactatagaa 3240 aaattcaatt attatttaca agggaagagc ttcctggttc tgactgatca caaggctctt 3300 caatgtctag ataagaaaat agtggcaaat gacaaaatat gccgttggca agaaagattg 3360 agtaagtata ctttcacggt acagtatatt cgcggagcgg aaaacacact cgcggacatg 3420 ctgtcacgcc cctggcacaa agtgcgagaa aagtccgata aaaagccgag cgatgaatta 3480 gctgggcagt tttacaaccc agtcggagag aaggatttgt taatctacat cccttcttgg 3540 tgctcagata acaaacttga tcggaaaatg ctgatcgaga gagtggatac cgcatcaagt 3600 ctatttactc taaaatcggt cgctaccggc gtttggtgcc caaatgtccc gattctggaa 3660 ttgcgagtta tcgaaaacgc ccaaagcgaa gatcgagtaa tcggtatagt aaaatcttta 3720 gttcaaaaca aggtggaagc gtcgaaatgg aaagtccctg acgacgttta tgggattcag 3780 cgctacaaac gactcgcgag aaatttcggt attcacgctg agactggcct ccttactatc 3840 agctgggggg agaggtcgtg catcgtgctt ccaaaatctc taattccccg ttatttggaa 3900 gcagcacact caaaagcaca tactggagct gatcgtacta gagacctgct caagtggtcg 3960 tggtggctcg atcaaatgaa ggatattcaa gaattcgtcg cctcatgttc gacttgccta 4020 aaattcaaag ggcaggacat gcagaaagga catccagatc gacaagtact gttccgcccg 4080 aaaaggcaat gggaaattct gtatgtggac tttattcagc taccgcggtc ggcgtctggt 4140 aagtcatacg cgcttactgt aatggatggt tattctcgac atttgtccgt ttatccaaca 4200 gcaaggtgca gagccgtcga ttgtgcgaga gctctgatga agcacgtgct ttgctttgat 4260 tttccaactt acctctcatc tgaccagggg tcacatttta aaaacgaact ggttagtgaa 4320 ctctgtaagt tgctaggtgt cactcaaaat atacacgtcg cgtaccgtcc ggaatcgacg 4380 ggatgcctcg aacgagccca tcgagtctta aagaatgcac tttatggaat gtccttggag 4440 cgaaacatgt gctgggaact tgtacttccg tcagtcgtca gcacaatgaa caagtgcaaa 4500 aatgtcgcta caaaatgcag cccattcgaa gttatctatg ggcgcaagcc ctccttcgat 4560 ggaattcaga tgcacaaaaa ccctgcagca gactcgccac aagcatacgt caaagaagtt 4620 gcagctgttc tcgcccggac cagaaagttc gttgacctcg cgcaagagga aagcgatatc 4680 gccactaaaa ttgacggaaa gtctaaaata aaacaaatag ttatagaaat aggcgatcga 4740 attttactca aaagaagtct ctctgtagaa accaagatta acaaaaatcc ctacactgga 4800 ccgttcgaag tcatcaacac gaatggagtt atagtcagaa tcgatatgga tggaaaatta 4860 acgtgggtgc acaggcacca ctgtgttcta caaaaacagc gcgccctgga acttgatccc 4920 gacttcgttg accttcttta tgatgacgag ccgccgcaga aagcacctga tcccgtcgcc 4980 gaggtaccgc cggaaagagc cacaccgcct gcgcgcagat atccaacgcg tgaacgaaag 5040 ccacccgatc gttatcaacc aaactaattc aggataaaat cacgtataaa attatttatc 5100 gtaacaatgt gatcataaag acaacaatga agattattca agtggatggt ctttatcttc 5160 ttattttctt ctcgactagc gaatccgact tcaacctttc cgtcgcgaat ctgggggga 5219 // ID Gypsy-35_DPu-LTR repbase; DNA; INV; 187 BP. XX AC . XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from Daphnia: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-35_DP_; KW Gypsy-35_DPu-I; Gypsy-35_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-187 RA Jurka J.; RT "LTR retrotransposons from Daphnia."; RL Direct Submission to RU (16-DEC-2010). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR [1] (Consensus) XX SQ Sequence 187 BP; 39 A; 57 C; 27 G; 64 T; 0 other; tgtgatgacg tctcatgtat cgtcttacac gtcctgttct cctatggctc tccggagcgt 60 atgttatccc gctaggcgca tatactattc tccagtccac tcttagcttc tctattgtac 120 tggtcatccc tcgaccaact cccttacttc cgtaaataca tttattcaat cagtcaacat 180 ccttaca 187 // ID CR1-15_BF repbase; DNA; INV; 3443 BP. XX AC . XX DT 30-JUL-2009 (Rel. 14.07, Created) DT 30-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE Amphioxus CR1-15_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-15_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-3443 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-3443 RA Kapitonov V. and Jurka J.; RT "Young families of CR1 non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(7), 1586-1586 (2009). XX DR [2] (Consensus) XX SQ Sequence 3443 BP; 998 A; 901 C; 728 G; 816 T; 0 other; acccaggttc ccaacaatat atttccgtct atcagcttgc tgaatgagaa ggatacaaac 60 ctactttcct tggtttcctt catccttaat cccaagcaaa actgggctct ttttaggaaa 120 cccaaaacta ctgtattttg tactattctt ctaatcttac tgagtggtga tgtacaaccc 180 aaccctggcc cacgtgcccc gaagtacccc tgtggcatgt cgcaaagctg ttaggtggga 240 aaaggtcgat gggcgcaggg cagtctgctg tgacacttgt gatacctggt atcacacaga 300 ctgtatgggc atgaccaccc ctgtttatga tgctatcaac caccctagtg tatcatggat 360 ttgttgtagt tgtggtcttc caaactttag ctctagcttg ttcagttcta ttgtctttga 420 aacccccaac tgcttctctt ttccaacacc taactccagt tcctgcacta gtccaggtag 480 ttctgtgggc tctcccgtag ccacctctac tccaactacc caagtcccaa accccccacc 540 aagggaccga acacagagca aacctcttag agctgcagtc attaacttcc agtcaattcg 600 gaacaagacc tctgaattcc atgttttctg tgacacagtc caaccggaca ttattatcgg 660 aactgaatcc tggctagatc cctctgtagg caacagtgaa atcttcccac cttcatacac 720 cgcatttagg aaggacaggg agggacgagg aggaggagtc ttcatagcca ctagatcgga 780 catcatagca acgcaccatc cagagcttga ttcgaccaac gagctaatct ggatacagat 840 aaatatgtca cattgtaaga ccatgtatgt tggagcatac tataggcctc ctgatgctgg 900 tttggatgat tacctagccc tagaacagtc gctaacaaac atacgacaga aacccggtaa 960 ccctcagatt tggttagcag gtgattttaa cttaccagct gcaacttggg gtttggattc 1020 taccagtaat tccacaagtg cagcctccag cctcactggt aagtatatca acctggctaa 1080 cgactgcgga cttgaacaag tcgtcagcga accgactcgt cgtatcggcc aagctgcaaa 1140 tgtcctggat ttattcttca catccaatcc tacgttagtt gaaaaatgta agatcgtacc 1200 gggactcggt gatcacgaca taccactgat tgacgtacat gcaaagccac aaaaaaccaa 1260 gaagaaagac aggctggtgt acctctggag gaaggggaat atggacgaac tccaaaagga 1320 tatggaagag tacagcaaca acttcttaag acaggcccca aagagaacaa ctacagagaa 1380 ttgggaggac tttaaaacaa ctctattgtc tgcagccaac aaacacatcc cacagaagag 1440 ggcaagaaat caacatagcc agccatggat tactccacag atcaggagag ccatgcggaa 1500 aagagacaaa ttgtacacta aagccaggaa aacgaacaca gaagaagcgt ggtccaagtt 1560 taaagcctgt agaaaaggtg taaaacaaag agtaaggaag gcccacagtg tgtataaagc 1620 agatttcctg gaggaaaaca tccaggagaa ccccaaggcc ttctggaatt acattaagtc 1680 tctaaagcaa gactctacca acataacgtc tatcaggcac cagggtatac taacctctga 1740 ccctaaggag aaggcagaag ctttagggga ccagttcagc tcggtcttta cgaaggagga 1800 cctggatacc agaccaacac ttgggaatcc tgtgactccc cccatatcgc aacttcacat 1860 ctcagtggaa ggggtagcta aacaactttc agatcttaat cccaacaaag ctacaggccc 1920 tgatggtctt catcctagac tcctgaagtc agtgtccaca cagattgcac ctgtattaca 1980 gtccattttt acacagtccc ttgctacagg cgacgtacca gaagactgga ggtccgccaa 2040 catctcaccc atcttcaaga agggagacaa gtccttacca tcaaactata gacctgtctc 2100 actcacatca gtctgtagca aagttttgga gcatattatt catagtcata tgatgaagca 2160 ccttgataaa tacaatatat tgtcccctgc ccaacatggc tttaggaaag gcctctcttg 2220 tgaaagccag ctattggtca cgactcacga cctgactagt gcccttgacc aaggaaaaca 2280 ggtcgacgcc atagtgctcg atttcagcaa ggcttttgat atggttcccc acaatcgcct 2340 tctgagtaag ctccagcact acggcatctc tggtaatctt ttgtcttggc ttcaagcctt 2400 ccttacacaa aggacccagg cagttgtcct tgatggggag tcctcgaaac ccacaaaggt 2460 tttgtcggga gtgccccaag gaactgttct gggacccctc ttgttcctac tttacataaa 2520 tgatttacct gatattgttg ggtccaacgt gcgtttgttt gcagacgatt gcctgcttta 2580 ccgtattatt gaacacccaa gtgacgtaca aggcttacaa gctgaccttg acgcactgac 2640 ccattggcaa aaccaatggc agatgtcctt taatccatct aagtgccaca cacttcatat 2700 aactcacaaa cgtaaaccaa ttatttctca atatgttctt tgttcagaaa accttacaag 2760 tgtcaaaacc cacccatatc ttggtgtcca actatctcat gacatgaagt gggacacaca 2820 tataaagtac gcaactggta aagcgaatcg aatactggcg gtcattcgcc gcaacttaca 2880 gcactgccca ccacgggtca aatcaacatg ctataaggcg ttggtgcgcc cgcacttgga 2940 gtacagcgca gcagtgtggg acccctatac tatcagtggg gcccaggcac tagaaagagt 3000 gcagcgtaga gcggcaagag tgaccgtcaa cgattatcgc agaaccagta gtgtatctga 3060 tatgctcaca agcctacaat ggccactcct cgctgacaga aggagagacg cacgtctaat 3120 cactttctac aaaattgtga atggcatcat aaacatatcc ccaacccagt atcttaaacc 3180 tgcccaacgc agaaccagag gaagtcacat gtttaagtat cagttaacca cagctaaaag 3240 agactgtttt aagttttctt ttttccctcg gacggttgtc gagtggaaca gactgccggg 3300 gcacactgcc caggccccgt cagtcgaggc ctttaaggcc ggtttggctg ccttgcccta 3360 gtccgcaagc cacccccccc ccccccctca gcgacgcccc agcaggggtt tttagagggt 3420 acacaagacg aagacgaaga cga 3443 // ID LINE-1_AA repbase; DNA; INV; 4709 BP. XX AC M95171; XX DT 26-APR-2005 (Rel. 10.04, Created) DT 02-MAY-2005 (Rel. 10.04, Last updated, Version 1) XX DE A. aegypti LINE-like retrotransposon, partial sequence. XX KW Jockey; Non-LTR Retrotransposon; Transposable Element; Juan-A; KW LINE-1_AA. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4709 RA Mouches C., Bensaadi N. and Salvado J.C.; RT "Characterization of a LINE retroposon dispersed in the genome of RT three non-sibling Aedes mosquito species."; RL Gene 120(2), 183-190 (1992). XX DR Genbank; M95171; Positions 1 4709. XX SQ Sequence 4709 BP; 1544 A; 884 C; 823 G; 1458 T; 0 other; ccagtgcgca tcgtaccttc gtgctgtgcg tacacgaatt ctctctgctc ttggaagttt 60 tcttaaaaac tacctaaagc aataaaacag tacttttcag tgctacaaaa acagtacttt 120 tcagtgctaa aattaaaaac ggtacttttc agtgctacta aaacagtact tttcagtact 180 attttttcta ctattgatcc ctttacgatc cttgtttgga cccgtgcctt cgatttttcg 240 ttggacccgt tggcgaaagc tagcggtggt aatccttctt ggacaccgtc ttgggaaaaa 300 acctttcgaa ggtcacgtct tctttcgttt attaattaaa catggtatca acaactaaca 360 aaaggaaggg tgaatctctg aattcactac ttccttccaa aaaagtgggt tttaaaactg 420 tcactacacg tggcaagaat ggaagaaagg acgtttctcc ggaatgcgaa ctttcttcca 480 agggtgaaat gaataattgt atcgaaatga gcaatcagtt cgatgctcta gacaaatttt 540 ccgaacacca aatcgaagca gcctctagct caggctcttt gattcaagtg aggaagcaaa 600 gagtgccgcc tatcgtggtc agttgttccg aatttggggg atttaggcag gagatcttaa 660 actccattag gggaatcaag gtttccttcc aaatcgcaaa gaaaggagac tgtcgcgttt 720 tgccggaaac tcttaaagat cgtgagcttc ttctcaaaca tcttgaagag aagaagcaca 780 atttttttac ttatgacgac aaaactgaac gtttgttcaa agttgtcttg aaaggtctct 840 caagtgacta taaatcacct gaagagatca aaaatggaat aattgattta cttggatttt 900 ccccagtcca agtaatcatt atgaaaaaga gaacccaatc tggcattgtt cggaaagggc 960 tttctcaaga attttattta gttcacttta acaaaaaaga actaaataat attaaagctt 1020 tagaaaaagc aaaacttttg tttgatgtcc gtgtgacatg ggaacatttc cagaaacctg 1080 gaggaaatta ccagaacccc actcagtgcc gtcggtgcca aaagtggggt catggtacaa 1140 aaaattgtcg catggatgct aaatgcatga tttgcggagg ttcttctcac accaaagacg 1200 tctgtccagt gaaggaagat accaccaaat tcatatgttg taattgcggg gctaaccata 1260 agtccaattt ttggaattgt ccttcacgca aaaaagtcat tgaggctcgt gccaggcaga 1320 tgaaagataa tatccgttac gataacggtc gtttccggaa tttgcctggt agagtatcga 1380 acaatgctca tttttcagtt aacgatcgct tgatcatgaa tcatacccat caggaagatc 1440 ataatcatgc tcattcacaa actaatttta ttccgtcggg tagccgttcg aatctttcta 1500 cttcgaatgt atctacccac ggtaaatcct ttgccgatat cgtagcagga aattcgaact 1560 cctcccctgt tcgatccatg ggtacccatt ctacttgttt caaatcaaat ggaaaaaacc 1620 ctaccgccac aggtaactcc gcttcttcgt ctgccgaaaa ttccaatggg aaatcacatg 1680 acatgtctgc ctctgatttt aattttctaa ctgaacaatt gaatctaatg attgatgcaa 1740 tgttcaaagc caccactatg actgaagcag tccaagtagg tgtaaaattt acaaatcaaa 1800 ttgttattgg attacgtttt tctaatggat ccaaataata atttaaatat tttaaattga 1860 atgctcgttc tctgaatggt aaagaggacg agctgtttaa ttttcttacg gttaataacg 1920 tgcatatagc agttattacc gaaacgtatt taaaacctgg atctaaactc aaaagagatc 1980 ctaacttttt tgtttatcgt aatgatcgac ttgatggggc atgtggggga gttgcaatca 2040 tcattcatag gcgtataaaa catcaactgt tttcatcatt tgaaactaaa gtttttgaaa 2100 ctttaggtgt ttctgttgaa acacagtttg gtaaatatac tttcatagct gcctatttgc 2160 cttttcaatg ctctgggcag caagttaatt tgctccaaac tgacttgcgt aaattgactc 2220 gcaataagtc aaaatttttt gtcattggtg actttaatgc caaacatcgg tcatggaata 2280 attctcaaag taattccaac ggcagaattt tatttgatga gtgctcttca ggatatttct 2340 caattcaata ccctgatagc cccacatgtt tttcctcttc tagaaatcca tctacgattg 2400 atttggtctt aaccgactct agtcatcttt gtagccaact gattactcat gctgattttg 2460 attctgatca tgtccctgtt acatttcaaa tatcccaaga agcgattctc aatcctatca 2520 gctccacttt caattattta cgagccgact ggaatatata taaaacgtat gttgactcca 2580 atcttgatgt taacatttct ttagaaacta aacttgatat tgacaatgct cttgaaactt 2640 taacaaattc cattgttgaa gcccggagca ttgcaattcc aaaatgtgaa gtaaaatttg 2700 aatccgtgat tatagacgat gatcttaaac tcttgatccg tcttaaaaac gtgaggagaa 2760 ggcaatttca agccactcgc gatcctgcta tgaaaattat atggcaggat ttgcagaaag 2820 aaatcaagaa acgttttgct caattaagaa acaaaaattt tgaaaataaa atttctcaat 2880 tggaccctgg ctctaagccc ttttggaaat tatcgaaaat cttgaaaaaa cctcagaagc 2940 caataccggc attgaaagag gaaaacaaat tattactaac taattgcgaa aaagctcaaa 3000 aacttgctat gcagtttgaa agtgcgcaca attttaattt aggacttact agtccaattg 3060 aaaatcaagt tactcaggag ttcgaaaata ttctaaatca agagaacgtt ttcgaaaatg 3120 cctgggagac ctatttggaa gaagtgagaa ctattattaa aaaattcaaa aacatgaaag 3180 ctcctggcga tgatggaatt ttctacatcc tcatcaagaa acttccagaa agtagcttat 3240 catttttagt tgatatattt aacaaatgtt ttcaattagc atattttcct gacaaatgga 3300 aaaatgctaa ggttgttcca attttaaaac cagacaaaaa tcctgcagaa gcttctagct 3360 atcgtccaat cagtttgctt tcctccatca gtaaactttt tgaaaaggtt attttgaaca 3420 gaatgatggc ccacatcaac gaaaattcaa tttttgccaa tgaacagttc ggattccgcc 3480 atggacattc gaccactcat caacttttac gtgtaacaaa tttgacccgt gccaacaaat 3540 ctgaaggcta ttctactggt cttgctcttc tagacataga aaaagcattc gacagtgttt 3600 ggcatgaagg tttgattgta aaattaaaaa acttcaattt tccaacatac attgttagaa 3660 taattcaaag ttatctgtca aatcgtacac ttcaggttaa ttatcagaac tccagatctg 3720 aaagacttcc tgtaagagct ggtgttcctc aaggcagcat tttgggacca atattataca 3780 atattttcac atctgactta cctgagttac ctcagggatg tcaaaaatct ttgtttgcgg 3840 atgacacagg cctctccgcc aaaggacgaa gcctgcgtgt catctgtagt cgattgcaaa 3900 aaagtttgga tattttttct tcatacttgc aaaaatggga gatttctcct aatgcttcca 3960 aaactcaact aataatattc ccacataaac caaaagctct ttatttgaaa ccttcaagta 4020 gacatgttgt cacgatgaga ggggttccaa taaattggtc agatgaagtt aagtatctag 4080 ggctcatgct agataagaat ttaactttca aaaatcacat tgagggcatt caagccaaat 4140 gtaataaata tataaaatgt ctttatcccc ttattaatag aaaatcaaaa ctttgtctta 4200 agaacaagct gttgatattc aaacaaattt tcaggccagc catgttgtat gctgtaccaa 4260 tatggactag ctgttgtaat accaggaaga aagctctgca gagaattcaa aataaaattt 4320 tgaaaatgat tctgaggctt cctccctggt atagtaccaa tgagttacat agaatatcaa 4380 tgttgaaaca ttggaacaaa tgtcaaatac aatcattaat aatttcaggc aaaaatcgtt 4440 acaatcttct attgccacga ttaatgcgtt atatgtttag gttaagttag gttaagtata 4500 ttaaaaacgt tttttttctc ttataagcag gtgaaatcaa ctcacctgta aaaaaactga 4560 actgctacgg caaatgaaat gtaatatgtt gttaacaaaa tgttaattta atcttaaatt 4620 tgttttacca aattaggatg atagtgttgt caaataacac agaacaccta gatataagaa 4680 atgaatgtaa tgttttttaa agaaataaa 4709 // ID CR1_Ele11 repbase; DNA; INV; 4952 BP. XX AC . XX DT 06-OCT-2010 (Rel. 15.1, Created) DT 06-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE A CR1 clade non-LTR retrotransposon family from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1_Ele11. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4952 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4952 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (06-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. This consensus is generated from 30 CC sequences with >95% identity, and ~99% identical to the original CC sequence in [1]. XX FH Key Location/Qualifiers FT CDS 14..811 FT /product="CR1_Ele11_1p" FT /translation="MAFCDRMIHLRCSPTKLNKPFVNIIQSCPNLLWICNE FT CVKLMKCARFKSTVSSFGNALELITEKQEIAHAELKREIAKQGQQIAXLSK FT GIALSTPTHSKSGLTPRQPPLKRRRDESLLPNKPLVGGTKVATDNNVTTEV FT LTVPEPAELFWLXLSRIHPSVKPEAIEKLVKDCVQCEDSVKVVPLIKKDTD FT ISRMSFISFKIGMDPKLRETALSAETWPKGILFREFESGNSKNMWFPRLNT FT PTVTISPAPGPSQYSTPTTGVEPMC" FT CDS 652..4782 FT /product="CR1_Ele11_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="RRNLAKRYSVSRIRKRKLKKHVVSPPEHADRDNFSCT FT WTFAVLNSYDRSRTNVLAGXEITDTRSSLGQSRYISNFLSPERNLATXPEE FT APILLTAVEPLLPATISRSGPACESGEGVFQTPSNGKYKHNMNFASPESIL FT VSSQPSNTSPLFRVATFPTSSSPGRTITASLEEAPYPPNTVEHFLPAACSR FT PGPVFGEGDGVFQIPATGKYIQLTTFTPPKSILVCSHCVPHRSTPTPPFSI FT SSPGRTLAEISEETPDPPNTVEPFLPATCSRPGPVLGVGNGVFRTPNIGKY FT DQIKNSASPDSFSVSSQPERSFPWNGVTSFTQQSLHRSVQLLSSNGDTSSP FT GRKITVGTKEAPDPPNTVEPLQPATCSRPGPVLGVGDGVFRSPNVGKYHPQ FT SNASAPDSNPISSAAAAQQTASDVIIYYQNVGGMNGLIDDYHLAFLDHGYD FT VIVLTETWLDSRTVSSYIFGEGYEVFRCDRNTNNSCKTVGGGVLVGVHTRL FT KASLIENESWISVEQVWVAIQLSDRKLYLCVVYIPPDRTRDMDYINTHCQS FT IAVVSEYATPSDEMIVLGDFNLPDIAWTPTHSGFLCPDYQRTSLHVGAVNL FT LDCYSTATLAQINHVMNENGRYLDLCFVSGRDTAPFISTAPAPLVKVVSHH FT PPLVVAIETNVVRDFISSPLVVAYDFRKADIHSIAELLSETDWDCILDITD FT VNNAALTFSHVLAYVIERHVPKKVIRTDSRVPWMTNDLLLLKRKKKAALRR FT FTKFRTWSSKCYYVRLNYEYKRLSRFCFHRHQSKIQRRLKLHPKSFWNYVN FT EQRKEVGLPSSMTFNGTTGSNLEEICNLFSNKFASVFVNEQLSVDSINSAA FT NNVPLANQTLSSIHLTQETISKAASQLKTSFKPGPDGVPAAFVKRMIGSLL FT EPLLKVFQLSLTSGTFPTCWKTAEMFPVYKKGSKRDVNNYRGITSLNAVAK FT LFELVVMDSLSAHCRQYLSADQHGFTIGRSTTTNLLCLTSYITESMNVRAQ FT TDVIYTDLSAAFDKLNHEIAIAKLDRLGVGGSLLRWFRSYLTDRQLVVALG FT EYRSPCFYASSGIPQGSHLGPLIFLLYFNDVHFVVEGRRLTFADDLKIFLR FT ICSIEDCRYLQDQINVFAGWCDLNRLVVNPAKCSVITFSRKKQPIIFEYSI FT FDTPIERVQCVKDLGVLLDSQLTFSHHIAYIVDKASRTLGFVFRTAKNFTD FT IYCLKALYCSLVRATLEYGSAVWSPNYNNGCERIESVQRRFLRFALRRLPW FT RDPLRLPSYESRCQLIDLQPLRIRRDVCRALTVVDTLNGRIDCEAILQQMH FT LNVQPRPLRNSSMMRLPFRRTNYGLSNAVHGLQRVFNRVASIFDFHLTRNM FT LRRKFSSFFAGRRD" XX SQ Sequence 4952 BP; 1278 A; 1225 C; 1082 G; 1362 T; 5 other; atttatctcg tgtatggcgt tctgtgacag aatgattcat cttagatgtt cacctacgaa 60 gcttaacaag ccgtttgtga acattattca atcgtgtccg aatttgcttt ggatttgcaa 120 cgagtgtgtg aagcttatga aatgtgcccg tttcaaatca actgtgtcgt cgtttggcaa 180 cgcgcttgag ctgataaccg aaaagcaaga aatcgctcac gctgagttga aaagagaaat 240 agcgaagcaa ggacagcaga ttgctcwact gtcgaaagga atcgcattgt caactccaac 300 ccactcgaaa tctggactta ctccccgcca accacctttg aaacgacgcc gtgatgaaag 360 cttgcttccc aataaaccac ttgttggcgg caccaaagtg gccaccgata ataacgtcac 420 wactgaggtg cttactgttc cagaacctgc tgagttgttt tggctgtamc tctctcgcat 480 tcacccaagt gtcaaacctg aggcaatcga aaagctagtc aaagattgtg tacagtgtga 540 agactcggtt aaagttgttc cattgataaa aaaggacacc gacataagcc gaatgagctt 600 catctccttc aaaattggta tggatccgaa actacgtgaa accgctctta gcgcagaaac 660 ttggccaaaa ggtattctgt ttcgagaatt cgaaagcgga aactcaaaaa acatgtggtt 720 tccccgcctg aacacgccga ccgtgacaat ttctcctgca cctggacctt cgcagtactc 780 aactcctacg acaggagtag aaccaatgtg ctagccggga gwgagattac tgacactcga 840 tcatctctag gccagtctcg atacatctct aattttctgt caccggaacg caatcttgcc 900 acckgccctg aggaagcccc tatcctactc accgcagtcg agcccctcct gccagcgacc 960 atcagccgtt ccggtcctgc gtgtgagtct ggagaagggg tcttccaaac tcccagtaat 1020 ggcaagtaca aacataacat gaattttgca tcgcctgaaa gcattttggt ttccagtcag 1080 ccttcaaata catcaccttt gtttcgagtc gcaacgtttc caaccagttc atccccggga 1140 cgcacaatta ccgccagcct cgaggaagcc ccatatcctc ccaacacagt cgagcatttc 1200 ctgccagcgg cctgcagtcg tcccggtcct gtgtttgggg aaggagacgg ggtcttccag 1260 atcccagcaa ccggcaagta tattcagcta acaacgttta cgccgcctaa aagcattttg 1320 gtttgcagtc attgcgttcc acatcgttca actccaacgc cacctttctc catttcatca 1380 ccgggacgca cacttgccga aatctctgag gaaacccctg atcctcccaa cacagtcgag 1440 ccattcctgc cagcgacctg cagtcgtccc ggtcctgtgc ttggtgtggg aaacggggtc 1500 ttccgaactc ctaatatcgg caagtatgat caaataaaga attctgcatc gcctgatagt 1560 ttttcggttt ccagccaacc tgaacgctct tttccatgga atggagttac ttctttcacg 1620 caacagtcgc ttcaccgttc cgttcaactg ctttcctcta atggagatac ttcatcaccg 1680 ggacgcaaaa tcaccgtcgg cactaaggaa gcccctgatc ctcccaacac agtcgagcct 1740 ctccagccag cgacctgcag ccgtcccggt cctgtgcttg gggtgggtga cggggtcttc 1800 cgaagtccga atgtcggcaa gtaccatccg caatccaatg cttccgcacc tgatagtaat 1860 cccatttcca gtgctgccgc tgctcaacaa actgcatccg acgtcataat ttactatcaa 1920 aatgttggag gaatgaacgg tctgatagac gactatcatt tagccttttt ggaccacggc 1980 tatgatgtca tagtcttgac cgaaacgtgg cttgattctc gaacagtttc gagctacatt 2040 tttggagaag gatacgaggt attccgctgc gatagaaata cgaataacag ctgcaaaacg 2100 gttggaggtg gcgtgcttgt gggagtccat acccggctta aagcaagtct aatcgaaaac 2160 gaatcctgga tttctgtgga acaagtttgg gtggctattc agcttagcga ccgcaaactc 2220 tatctatgcg tggtatacat ccctccggac cggactcgtg atatggatta catcaacacg 2280 cactgtcaat caatcgccgt cgtctcggaa tatgccaccc ctagtgacga aatgatcgtt 2340 cttggtgatt ttaacttgcc agacatcgca tggactccaa ctcacagtgg ttttctgtgt 2400 cccgattatc aacgcacttc gcttcacgtt ggtgctgtca atcttcttga ttgttacagt 2460 actgctactc tggctcagat caaccacgtg atgaacgaaa atggccgcta cctggatctc 2520 tgtttcgtga gtggtcgtga tactgctcct ttcatatcga cggcccctgc acctttggtc 2580 aaggttgttt ctcatcatcc gccgctagta gtcgccatcg aaaccaatgt tgttcgtgat 2640 ttcatctcta gtccattggt tgttgcatat gactttcgca aagctgatat ccatagtata 2700 gctgaattgt tgtccgagac agattgggat tgcattctag atataacgga tgtcaataat 2760 gcagcactaa ccttctctca cgtcctggca tatgtcattg agaggcacgt tcctaaaaaa 2820 gtcattcgga ccgattcacg tgttccgtgg atgaccaacg atcttctttt gctgaagagg 2880 aagaagaaag cagcgctcag acggtttacc aaatttcgca catggtcatc gaaatgctac 2940 tacgttagac tcaactatga atacaagcgt ttgagtcgtt tctgttttca tcgtcatcag 3000 agtaaaatac aacgccgact aaaattgcat cccaaatcct tctggaatta tgtgaacgaa 3060 cagcggaagg aagttggttt gccatcatcc atgacattca acggcactac tgggtctaac 3120 ctagaggaaa tctgcaatct gttctcaaac aagtttgcaa gtgtgtttgt aaacgaacaa 3180 ctttccgtcg attccatcaa ttccgcggcc aacaatgtgc ctttggctaa tcaaacgctt 3240 agcagcatac accttactca agaaacgatc tcgaaagcgg catcacagct caaaacttcg 3300 ttcaagccgg ggccagacgg tgttccagct gcttttgtga aaagaatgat cggcagcttg 3360 ttggaaccgc ttctcaaggt atttcagctt tcgctaacca gcggtacatt tccgacttgc 3420 tggaaaacag ccgaaatgtt tcccgtgtat aagaaaggaa gcaagcgtga cgttaacaac 3480 tatcgtggga ttacatcgtt gaatgctgta gctaagctgt tcgagctggt ggtcatggat 3540 tcgctaagcg cacactgcag acagtatttg agtgctgatc aacatggatt cactatcggc 3600 cgttctacta ctaccaacct gctatgcctt acgtcgtaca tcactgagag tatgaatgtt 3660 cgggcccaga cggacgtgat ttacaccgat ctatctgcgg cattcgataa gctaaaccac 3720 gaaattgcaa tcgccaaact cgataggctt ggagttggag gtagcctttt gagatggttc 3780 cgctcttacc tcaccgatcg tcaattagtg gtagctttag gagaatatcg atcaccctgc 3840 ttttacgcgt catcaggcat accacaaggc agtcatctgg gtccgttgat tttccttttg 3900 tattttaacg acgtacactt tgttgtggaa gggcgtcgac tgacgttcgc agacgaccta 3960 aaaatttttc tgcgaatatg ctcaattgaa gattgccgtt atctccagga ccagataaac 4020 gtcttcgctg gatggtgtga tctcaaccga ctagtcgtca atccagcaaa gtgctccgta 4080 atcacttttt cacggaagaa acagccgata atcttcgagt atagcatttt tgatacgcca 4140 atcgaacgag tgcagtgcgt taaggatttg ggtgttctgt tggattcgca actgacattc 4200 tcccatcata tcgcttatat tgtcgataaa gcatcaagaa cgcttggctt cgtctttaga 4260 actgcaaaga actttaccga catctattgc ctgaaagcgt tatactgttc cctcgttcga 4320 gcaacattgg aatatgggtc tgcggtctgg agtccgaact acaacaacgg atgcgagcga 4380 atcgaatcag ttcaacggag atttcttcgt tttgcacttc ggaggctacc ttggagagac 4440 cctttacgcc ttccgagtta cgagagccgc tgccaactga tcgacttgca acctttgcga 4500 attcgaagag acgtttgtag agctttaacc gtcgtcgata ctctaaacgg aagaatcgat 4560 tgcgaagcta tcctgcaaca aatgcatctg aacgtgcaac ctcgtccact gcggaatagt 4620 tcgatgatga ggcttccatt tcgtcgcacc aattatggat tgagcaatgc tgttcatggt 4680 ctgcaacgtg tttttaatcg tgtggcgtcg atcttcgatt tccatctgac acgaaatatg 4740 cttcgtcgta aattttctag ctttttcgct ggccggaggg actgaagaca tattttaagt 4800 ttgaaatgtt ttacgatctt gactttttta tattcaatgt tttgtcatgt ttgttttgtt 4860 atataatatg tatttttgtt tttattttta atatcattgg gactatacat agtctgttga 4920 tgtaaaccta aataaataaa taaataaata aa 4952 // ID Neptune3_Ren repbase; DNA; INV; 3664 BP. XX AC . XX DT 20-DEC-2006 (Rel. 11.12, Created) DT 20-DEC-2006 (Rel. 11.12, Last updated, Version 1) XX DE Neptune3_Ren is a Penelope-like retrotransposon. XX KW Penelope; Non-LTR Retrotransposon; Transposable Element; KW Penelope-like elements; reverse transcriptase; KW GIY-YIG endonuclease; Neptune3_Ren. XX OS Reniera OC Eukaryota; Metazoa; Porifera; Demospongiae; Ceractinomorpha; OC Haplosclerida; Chalinidae. XX RN [1] RP 1-3664 RA Arkhipova I.R.; RT "Distribution and Phylogeny of Penelope-like elements in RT Eukaryotes."; RL Syst. Biol 55(6), 875-885 (2006). XX DR [1] (Consensus) XX CC Neptune3_Ren is a Penelope-like element (PLE) from the sequenced CC genome of the sponge Reniera sp. JGI-2005. It belongs to the CC Neptune group of PLEs. Its short ORF1 has a coiled-coil region, CC and ORF2 contains regions homologous to reverse transcriptases CC and to GIY-YIG endonucleases. The element appears to be low-copy CC and probably inactive, although intact copies may exist. It is CC related to Neptune2_Ren (~60% nucleotide sequence identity) CC Consensus sequence was assembled from trace archives. XX FH Key Location/Qualifiers FT CDS 664..1050 FT /product="Neptune3_Ren_1p" FT /translation="MKYKPRKMVVISHPSQDELNEAFMQDFKDIFLKHLEA FT AIDSNKTTLEIKKSRLQSLSNEQGTNHHQPIQQDRANISVSEATRPKPKSR FT ETTRAPQKRQLSSSSQQASKRQKTIKDFLVTGPKKPPDST*" FT CDS 879..3410 FT /product="Neptune3_Ren_2p" FT /translation="SKYISLGSNKTKAKVTRDNQGTSKKATLKLKSTSIKK FT TENNQRFFSDRPKETTGQYIDSNTIMHAQHSSIHNFSNFHLTTNHINILNR FT GLSFSPTQSFNPQDHVSFLTQYDLLSISLRNNALTSNSNNTVTISDVETEE FT EFLYRPMKFINKEKETVKIRENRFITDIPDLESFIEGTKILIDKLLKREYR FT TRKKSNLSNNERKALKNLKTTEITIKPADKSLGVVILNSSDYVHQCLEHLA FT TDTYARVNSFPSEDIKRKVKNIIISFKKDLEPHKRLYNYLFPNSSHRTPRF FT YGLPKVHKSLNERGIPPVRPIVSHTNSILSKTAGFIDHILQPVAQFFPDYI FT KNSTELITELENLTLKQDVILVTMDVINLYPSIPQKECIEIVHNEMVTHSE FT LVTSNPNLITQLLQLNMTNNFFEFADITFLQRKGTAMGAAFSPTVANIFMS FT TLLRKFLSSTSDRPLFMRRYIDDIFMVWPRNKDLDKFLSELNNFHPQIKFT FT TTQSDTAVNFLDITIYKNSNFTATGKLSTSTYEKENNLFQYLHFESNHSIS FT IYKGIIIGEAMRYVRTNSTEEKYKEQLSKFIQRLKERKYPESFIYKTLRNV FT SYKRRTLYLRKVTKAKPQIMNRPKMKCIPPPRYLQLKKIIINEFHRYNLSR FT YCNQPLFITLKNTTLKDLLVNSKHKPTEEDGRTIVAKTKEIKNHQSHNLNV FT IRKAEVKQPSKCNHNRCATCQHFNPNINFNSTSTRQTYRIRHSFTCSSSNI FT IYLITCEKCRKQYVGQTTKTLRERIYHHRSSIKTGQRRYISKHFNLEGHKI FT QHLKVQVIDTNENTETLNKLERYWIQKLDTMQPRGLNVTTD*" XX SQ Sequence 3664 BP; 1401 A; 814 C; 552 G; 897 T; 0 other; tgcatttgga agggttagct agcgctggcc cgttagacca attagaaagt ctcagtccct 60 ttctcctatt gattgaatct gactgtggat aaaaccagat atcacagagt cacaccaact 120 ttatctatct ggaagtccgt tggtcgctca gtccggcttg ctgggcactc ctggccactc 180 ctaccgttca tgccatctag ccattaaaca aaaacacact cacacactca acaacaacat 240 ccttcattag tgacacacaa gtaaatacat gtaagaccca aaaacaataa ggttaagcca 300 acaagttagg atacgccaac ctataatgtt ggaccacctt tagcatcagg ttaagtcaaa 360 tgtattacag gaacaacaac ctggcacatt tggaccacct acatttttaa taggcataaa 420 ctgtatacat atagtgtact acatccatgc atgcacaagc atatatttat taattgaaac 480 agcataggta catgtatgtg tgtaaacata cttaaaacaa aaaatgtagc atacaggcac 540 acaaaagcat cataaaatga tccatggtta tatgtacata tagatactct tgaaaaagaa 600 gttacagtgt tagaagcaac gattaatcat caacagaaaa tacttcatta taaaaccatc 660 cccatgaaat ataaaccaag gaagatggtc gttatcagtc atccctctca agacgagtta 720 aatgaggcct tcatgcaaga ctttaaagat atctttctaa aacacttaga agcagctatc 780 gacagtaaca agacaacact agagattaaa aaatctaggc ttcagtcatt gtccaacgaa 840 caagggacaa atcatcatca gcccatacaa caagatagag caaatatatc agtctcggaa 900 gcaacaagac caaagccaaa gtcacgagag acaaccaggg cacctcaaaa aaggcaactc 960 tcaagctcaa gtcaacaagc atcaaaaaga cagaaaacaa tcaaagattt tttagtgaca 1020 ggcccaaaga aaccaccgga cagtacatag acagtaacac catcatgcat gcacaacact 1080 catcaatcca taattttagt aactttcatt taaccaccaa tcacattaac atactgaata 1140 gaggtctttc attttctcca acacagtcat ttaatcccca agaccatgta tcttttttaa 1200 cacagtacga tctgttaagt atatcactca gaaacaatgc actcaccagt aacagtaaca 1260 atacggtaac aattagtgac gtcgaaacag aagaagagtt cctttacaga cccatgaaat 1320 tcatcaataa ggaaaaagaa acagttaaaa ttcgagaaaa tcgttttatt acagacattc 1380 cagatctcga aagtttcatt gaaggtacaa aaattctgat agacaaatta ctaaagagag 1440 aatatcgcac aagaaagaag agcaatttat ctaataatga aagaaaagct ctaaaaaacc 1500 tcaaaacaac agagataaca ataaaaccag cagataaaag tttaggggta gtaatactaa 1560 actcaagtga ctatgtacac cagtgtttgg aacacctggc tactgacaca tacgcaaggg 1620 ttaattcttt cccatcagaa gacattaaaa gaaaggtcaa aaacatcata atcagtttca 1680 agaaagacct ggaaccacat aaacgactat acaattatct attcccaaac agcagtcaca 1740 ggactcccag attctacgga ctaccgaaag ttcacaaaag cctgaacgaa agaggaatac 1800 caccagtaag accgatagtt agccatacaa actcaattct ttcgaagaca gcaggcttca 1860 tcgatcacat actacagcct gtggcacaat tctttccaga ctacatcaaa aactcaactg 1920 agttaattac agaactagaa aatttaacac taaaacaaga tgtcattcta gtaacgatgg 1980 acgttataaa tttgtatcca tctatccctc agaaggaatg catagaaata gtgcataatg 2040 agatggtcac tcactcagag ctagtcacaa gtaatccaaa cttgatcact cagcttttac 2100 aactgaatat gactaacaac ttttttgaat ttgcagacat aaccttctta caaagaaagg 2160 gcacagctat gggagcagca ttttctccca ccgtcgctaa tatctttatg tcaacactac 2220 ttagaaagtt cttatcttct acctcagata gacctttatt catgcgaaga tacatagatg 2280 acatatttat ggtgtggccc aggaacaaag atctagataa attcctgtct gagctcaata 2340 acttccaccc acaaattaaa tttacaacaa cacaatcaga cacagcagta aattttcttg 2400 acattacgat ttacaaaaat tcaaacttca cagcaacagg aaagctcagt acctcaacat 2460 acgaaaaaga aaataaccta tttcagtatc tgcattttga atctaaccac tcaatatcga 2520 tatataaagg cataattatt ggagaagcca tgaggtatgt gcgtacaaac tcgaccgaag 2580 aaaagtataa ggaacaactt agcaaattca tacaacgact aaaagaacgt aagtacccag 2640 agagttttat ttataaaaca ttaaggaatg tcagctacaa aagaagaact ctgtatttac 2700 gtaaagttac aaaagcaaag ccacaaataa tgaatcgacc aaagatgaaa tgtattcctc 2760 ccccacgtta tcttcagcta aagaaaatca tcattaatga atttcaccgt tataatttat 2820 ccaggtactg taatcaaccg ttattcatta ctctcaagaa caccacatta aaggatctac 2880 tagtaaacag caagcacaag ccaactgagg aagatgggag gaccattgtg gccaaaacaa 2940 aggagataaa aaaccatcaa tctcacaacc tcaatgtcat tagaaaagcg gaagtcaagc 3000 aaccaagcaa gtgtaaccac aatcgttgtg caacctgtca acacttcaac ccaaacatca 3060 actttaacag cacgtcaaca agacaaacat acagaatacg ccactccttc acatgtagct 3120 caagcaatat catatactta atcacctgtg agaaatgtag aaaacagtac gtaggacaga 3180 caactaagac actaagggaa cgtatctacc atcatagatc aagtatcaaa acaggacaaa 3240 ggagatacat aagcaaacac ttcaatctgg aaggccacaa aatacaacat ctcaaagtac 3300 aggtcataga cacaaatgaa aatacagaaa ccctaaacaa attagaaaga tattggatac 3360 agaagctgga tacaatgcaa ccaagaggtt tgaatgtcac tacagactaa catcactcaa 3420 taaaattatt aaaatattca ttgattcata atcatcacaa ttgttttttt aattacctgt 3480 tgattcatgc taacaacatt tttacctaat ttaacgcaac cacagaatct tggacagacg 3540 gggtccccat gggggtgggg actccgtccg tccctccccc catggggttt ttttctctct 3600 ctctctctct ctctctctct ctctctctct ctctctctct ctctctctct ctctctctct 3660 ctct 3664 // ID Gypsy-216_AA-LTR repbase; DNA; INV; 189 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-216_AA_; KW Gypsy-216_AA-I; Gypsy-216_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-189 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1036-1036 (2011). XX DR [1] (Consensus) XX SQ Sequence 189 BP; 54 A; 39 C; 38 G; 58 T; 0 other; tgtaagcata agacgagaaa ttgtaccacc ctattgctga tgagtcatgg atcaccctat 60 tttgtgttag tttctccctc catgattgca tacgtgtcag ttgaattaga acgtaaagag 120 aaaaaggtcg tacggaataa accatcgttg ttcagtaacc tgtgtttttc ctccggatca 180 ttgaccaca 189 // ID R1-1_AP repbase; DNA; INV; 5868 BP. XX AC Contig61185; XX DT 19-AUG-2009 (Rel. 14.08, Created) DT 19-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE Non-LTR retrotransposon. XX KW R1; Non-LTR Retrotransposon; Transposable Element; R1-1_AP. XX NM R1-1_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-5868 RA Jurka J.; RT "Non-LTR retrotransposons from pea aphid."; RL Repbase Reports 9(8), 1793-1793 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC May be 5'-truncated. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX FH Key Location/Qualifiers FT CDS 782..2170 FT /product="R1-1_AP_1p" FT /translation="MSDNNEMEEEAEPSTSQAGSLQRSPPSLGGSPSHKKT FT KAGPAPAPKARVAIKWIRHTLDLASTRKSAMAVETQRNLFEKLEELDTAVT FT DLVIENLQLRSQVEEARRSAEICVGAAASQFGTELRLREAAHEQTLEAVVA FT RYAEKEAIRTHEAATKAREGEAAIAENIADRHEPDDQTFAQVTKRRQRQPR FT KEQPPVDRTAARSKSRNARRARLLDESRKEEHQPAFVIQPSDTAKASETSI FT GIWKAVMSKRIVPRCHTITTKQGKVIIKPQNKETADVLKSLARESNVLQEE FT ALMWPRFIIRGAPTSMPEDTLQEAILDQNPELGIESGCIDRVFKPVFKTGP FT RDRDDTNWVVEVNPMYYQNVRKIEHLYIGFMRCRVAGYDEVTQCHLCLRYG FT HPAAKCNETQQTCAHCGRKGHVAGECPAAEGEPSCSNCHGKHNAREKSCSA FT RTNHLVGRSRRTDYGKTQ" FT CDS 2170..5184 FT /product="R1-1_AP_2p" FT /translation="MSFPPLKIVQLNMGRAAAVSDQLLSYCQDTGVDIALV FT QEPYTNRGRLTGFEVAPIRCYLSMGTRRRGGPRHIDHGAAIIVLNPNLVVA FT SRDSGRVKNFVSIDLDCGAEGQWTMISGYFKYRVPTEIHVSALESLTEQNG FT NSLLIGLDANAFSTRWFSRINDRRGEVLSQFIDAQRLQVENKRSVHTTFRG FT PRGKTNIDVTLSSGQISTRVRDWTVLPGITSSDHQLIRYEVDAQPRRFVQP FT PPRYNVNRALVEQFKMEFHLRSEGRQCHQDIDSMATAIVEDITAAADLYIP FT KSSHRRKVKPPWWTDELLAARRNLRRAARRVDDTVTRAEYNVLRNSYTSLL FT RKCKIGSWRRFCTTEGKLPWGKLYRWMRQGVKPQSVPVLMMRPDGTQCQTL FT DESLTSLLNTLIPNDPDQEDPAPVESNEENWTETTEVELRTFAWSVAPNRA FT PGADCISGKMVRLLWQNLHPRLRSLANSCMRRAKFPKDWKAAVVVPILKGV FT GRDIRNPKSYRPVSLLPVLGKIVEKIINVSLRTQIEPRLTGKQYGFTPGRS FT TLDAIRNLLTWSRLNEEKYVLTVFLDITGAFDNLKWSALLEDLATLGVSEH FT TRSLIMDYLSGRTATLKIGGVSKTVRVTKGCPQGSILGPILWNVTMEALLR FT AEQPQYVNMQAYADDVAISVAGPTRASIIRRTEQALQPVLQWARSRGLSFS FT AQKSTAMITKGSLVHGFTLAFGDERIVTVPHTKYLGVTIDSDWKWDVHIER FT LTEDHDDIFSRLRGTMGAGWGIKRENLMTLYRGVFLPRVAYGVSLWVHAVG FT SEGNKKRLHRLQRRVLLGLTCAYRTTSTDALQVLAGVLPLDLELKWIAIKE FT DAKGLPEGVRRGTIHLAYEDIMDEWQSRWSNSHKGRWTHQCFPDVRARQTS FT PLAMGHEVAQFLTGHGNFRAKLAYFGRQPSPVCLCGAEDEFVDHVIFRCER FT HSAHRAHLELEVHRAGHLWPCEMETLVSSKRLYTALVRFAKTAAYYEQLD" XX SQ Sequence 5868 BP; 1492 A; 1533 C; 1716 G; 1127 T; 0 other; agtgtccact ttactgtgga gtccgaaaga cgtgcgttgc tacggttgat cgtgctccga 60 ggggcgccct cttgcggttt ttctgtgcag acgctgtaag tccctgattc ttcaggaatt 120 ttttccgttt ttggtgcgta ccgggttaat acaagtgcgc gaacagaccg gtgtcgattt 180 tgagtcggtc tgtccaaaac cctctctgtg gggaccgttt ttgtgtctgc ccataaggct 240 aacacggccg ccctgtggta tcagctggta tttgccgagc gatttttgtt ttttcgtcag 300 acacgagacc aggcgtcaag agcctgacga cgtgagttcg gtcattatat aacgtaatat 360 tacgtaatat cacttaaacg tgacccccca cctagtggct ggtataggta gtataccacc 420 ccctcccgaa aagtttaatt ggctataact tccaagggaa aaggggtaca gaccccggac 480 ccagggggaa ctcaattttt gagcacacca cttgttttcc cgtatttggg aacaaagccc 540 ccaccccccc ccctgatttg agttaagtat aactacgtgg gaaatcgccg caagagccta 600 ggggcaacgg ttatcggtta ccctaggtca gggccgcaat ccgacccaat agggaacggg 660 gtaccccccc accccccatt ttggtttttc gttttttggg cagaaatcgt ttagggaggc 720 tatctaaacc cataccccgc aggggaaact cggggtttcg aattcccccc ccaccagagg 780 gatgtcggac aataacgaaa tggaagagga ggcggaacct tccacgtccc aggcaggtag 840 tctgcagagg tctccccctt cattaggtgg atctcctagc cataaaaaga cgaaggctgg 900 acctgccccg gcgccgaagg ccagagtggc cattaagtgg atcagacaca ctttggactt 960 agcgtcaacg agaaagtccg ctatggcggt agagacgcag aggaaccttt ttgaaaaact 1020 ggaggagctg gatacggcgg tcaccgacct tgtgatcgaa aacttgcaac tcagaagtca 1080 ggttgaggag gcgaggaggt cagcggagat ttgcgtgggg gctgccgcat cacaatttgg 1140 caccgagctt cggctgaggg aggcagctca tgagcagacc ttggaggctg tggtggccag 1200 atacgccgaa aaggaggcca tcagaacgca cgaagccgcg accaaggcca gggagggtga 1260 ggctgccatc gctgagaaca ttgcggaccg gcatgagcct gatgatcaga cattcgccca 1320 ggtcaccaaa aggagacaga ggcaaccccg aaaggaacag ccaccagttg atcgcacagc 1380 ggctaggtcg aaatcgagaa acgctagaag agccagactc ctggatgaga gcagaaagga 1440 ggagcatcaa ccggctttcg tcatccagcc gtcagatacc gccaaggcca gtgaaactag 1500 cataggcata tggaaggcgg taatgtcaaa gagaatagtg ccgagatgcc acacgattac 1560 tacgaaacaa gggaaagtga taattaaacc acaaaacaag gagacagcgg atgtcctgaa 1620 gtctctcgcc cgggaatcca atgttctcca ggaagaagca ctgatgtggc ccagatttat 1680 catcaggggc gccccaacat ccatgccgga agacaccttg caggaggcca ttctggacca 1740 gaatccggaa ctcggaatcg agtcagggtg tatcgacaga gtgttcaaac cggtattcaa 1800 aacaggtccg cgtgacagag acgatacaaa ctgggttgtg gaggtaaatc ccatgtacta 1860 ccaaaacgtc aggaaaattg agcaccttta cattggtttc atgaggtgta gggttgcggg 1920 gtacgacgag gtcacacagt gtcacttgtg tctcagatat ggtcacccgg cagcgaaatg 1980 taatgaaaca caacaaacct gcgcacactg tggtcgaaag gggcacgtgg ctggggagtg 2040 cccggcagcc gaaggagagc cgtcctgttc taattgtcat ggcaagcata acgccaggga 2100 gaaatcctgc tctgcgagaa cgaaccacct ggtgggaaga tctaggagga cggactacgg 2160 gaagacgcaa tgagcttccc acctctcaag attgtgcagc taaatatggg ccgagctgcg 2220 gcagtcagcg accagctgct gtcctactgt caggacacgg gcgtggacat tgccctcgta 2280 caggagccgt acaccaacag aggcaggctc acaggtttcg aggtggctcc catcaggtgt 2340 tacctctcca tgggcacgcg ccgcaggggc ggtccacgac acatcgacca cggggctgcc 2400 atcatagtgc tcaacccgaa cctggtggtc gcgtcccgcg attctggccg cgtgaaaaac 2460 ttcgtcagta tcgacctcga ctgtggtgcc gaaggtcagt ggacgatgat cagtggttac 2520 tttaagtacc gtgtgcccac tgagatccat gtcagcgccc tggaaagcct cacggaacaa 2580 aacgggaaca gtttgctaat aggcctggac gcaaacgcat tctctaccag gtggttcagc 2640 cgaatcaatg atagacgtgg cgaagtgctc tcccagttca ttgacgccca acgccttcag 2700 gtagaaaaca agagaagcgt acacaccacc ttcagaggtc cccgcggcaa aaccaacatc 2760 gacgtaacgt tatctagcgg tcagatctcg accagggtac gcgattggac tgtcctgcca 2820 ggcatcacgt ccagtgatca ccagctgatc aggtacgagg tggacgctca acccagaagg 2880 ttcgtacagc ccccgcccag atacaacgtc aatagggcac tagtagagca gtttaagatg 2940 gagtttcacc tgcgatccga gggcagacag tgtcaccaag acattgacag catggcaacc 3000 gccatagtcg aggacatcac ggcagcggct gatttgtaca ttccaaagtc gagtcataga 3060 aggaaggtca aacccccgtg gtggacggat gaactcctgg cagccaggcg caacctacgc 3120 agagcggcaa ggcgtgttga cgacactgtc acccgagcag agtataatgt cctaaggaac 3180 tcctacacct cgctcctaag aaagtgcaaa atcgggtctt ggcgcagatt ctgtacgaca 3240 gaaggaaaac tcccatgggg aaaactgtac agatggatga ggcaaggtgt taagccacag 3300 tcggtaccag ttcttatgat gcgcccggat ggaacacagt gccagacgct ggatgagtca 3360 ctgacctcct tactgaacac gctgattcca aacgacccag accaagagga cccggcaccc 3420 gtagaatcaa atgaggaaaa ctggacagaa acgacggagg tagagctgag aacctttgcg 3480 tggtccgtcg ctccaaacag agcaccaggc gcggactgta tcagtgggaa gatggttcgc 3540 ttgctgtggc aaaacctgca cccgaggctg cggagcctgg ctaacagctg catgaggagg 3600 gcgaagttcc ccaaagattg gaaagcggca gtggttgtcc caatactcaa gggcgtgggt 3660 agggacatca ggaacccgaa gtcttaccgc ccggtcagcc tgctgccagt tcttggcaag 3720 atagtcgaaa aaatcataaa tgtgagcctc aggacccaga tcgagccgag actcacgggg 3780 aagcaatacg gcttcacccc aggcagatcc acgctggatg caatccggaa tctgctgacc 3840 tggtccagat taaacgagga gaaatacgtt ttgacggtgt tcttggacat aacaggggcg 3900 tttgacaacc ttaaatggtc agcgctcctg gaggacctcg cgaccctggg agtcagcgaa 3960 cacaccaggt cactgatcat ggactacctg tccggcagga ccgcgacgtt aaaaataggc 4020 ggggtgtcaa agacggtaag ggtcaccaag ggctgcccgc agggttctat ccttggccca 4080 atactttgga atgtcacaat ggaggcgctc ctcagagcgg agcaacctca gtacgtaaac 4140 atgcaggctt acgcggacga tgtcgcgatc agtgtagcag ggcctacaag agccagtata 4200 atccgacgca ccgaacaggc actccaacct gttctccagt gggcgcgctc aaggggactg 4260 agtttctcgg ctcagaaatc aacggcaatg attaccaagg gttccctggt tcacgggttc 4320 accctggcct ttggtgacga gcggatagtg acggttcccc acacgaagta cttgggagtc 4380 acaattgaca gtgactggaa gtgggatgtt cacatagaaa gattgaccga ggaccatgat 4440 gacatcttct cgagattacg cggcacaatg ggtgcaggat ggggcatcaa aagagaaaat 4500 ttgatgaccc tgtacagagg agtttttctc ccacgagtgg cgtacggtgt cagtctttgg 4560 gtccatgcag tgggttctga aggcaataaa aagaggttgc acaggctaca acgcagggtt 4620 ctgttgggac ttacctgcgc gtacagaact acttccaccg acgcactgca ggtcctagcc 4680 ggggtcttgc ctctagacct cgagctcaaa tggatagcga tcaaagagga cgcaaaaggt 4740 ctaccagagg gagttagacg gggaaccata cacttagcct acgaggacat catggacgaa 4800 tggcagagta gatggtccaa ttcacacaaa ggtagatgga cacaccaatg cttcccggat 4860 gtccgtgctc gacaaaccag cccgttggca atgggacatg aggtagccca attcctcacc 4920 gggcacggga acttcagggc caaattggcg tactttggca ggcagccatc cccggtgtgc 4980 ctttgcggag cggaggacga attcgtggac cacgtaatat tcagatgtga acgacacagc 5040 gcacacagag cgcatctgga actggaggtc cacagagccg gacacctctg gccatgcgaa 5100 atggagaccc tggtttcttc aaagagactg tacactgcac tggtcaggtt cgcaaaaact 5160 gcagcgtatt atgagcagct tgactgagac tgcgaaacag gacgggcaag acactctgcg 5220 agccagagtg ccaaaccccg catatgttct gaacgagtgt caaggagtga gaaggcagaa 5280 gcggttcata tgcggggccg tcctgagggg acactcgtgc gtcggggacg cgggcggcgc 5340 tgacgcggac aacagtggtt cgcggacaac cggaggcgcc gcggtgaccg aaagcttcgg 5400 cgagtaggtc acacctacat cgggccctgt gtctggggcc atggaagaat acagacacag 5460 tctaccccgc ttgtcgtacg aggcgactaa aggggtgggg ttgcgaccta cgcatcgcgg 5520 gcgagcgccc gcggtcaagg cccaccaaga gcctgctgag ccagcgaatc ttgggagacc 5580 gggcgtgccc tgtcggacat cccaccggtg atggcctacc taacgggggc gagaccggtg 5640 ccacgggaag ggggggagcc ccactgtccc gagttatgcc cgggccagtg cccctctacg 5700 ctgggcgggc gtcggcatgc cgtcgggcgc ccagtgccgg tttggtttcg tggtggctgt 5760 agcgggagct accgtatggt tcaagccaag gctggcaggc gaagtctcca gtgaatggct 5820 gtaccgccac gggggtgcga atccctttgg cgttgcagtt tctcctgg 5868 // ID BAMHI_AL repbase; DNA; INV; 677 BP. XX AC M35399; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 25-MAR-1997 (Rel. 2.02, Last updated, Version 2) XX DE A.lumbricoides BamHI repetitive DNA. XX KW ALBAMH1; BAMHI_AL; BamHI repetitive sequence. XX OS Ascaris lumbricoides OC Eukaryota; Metazoa; Nematoda; Chromadorea; Ascaridida; OC Ascaridoidea; Ascarididae; Ascaris. XX RN [1] RP 1-677 RA Warren T. and Pasternak J.J.; RT "A related moderately repetitive DNA family in the nematodes RT Ascaris lumbricoides and Panagrellus silusiae."; RL Nucleic Acids Res 16(22), 10833-10847 (1988). XX DR GenBank; M35399; Positions 1 677. XX SQ Sequence 677 BP; 186 A; 158 C; 161 G; 172 T; 0 other; ggatccgagt aagtgtgcaa aaacagcatt atttatgtaa acgaagctca attacatttc 60 taagtgcaat tacggctgta tcacgggttg gcaactccat attccacgga aatccaccca 120 ttcaacgggt gcaattcccg tgagtatcgt aaaataggag agtgaaagct cagaatgcgg 180 ctagaatgtg tcatcttgtt gccaaatcgg agatatgtat cgtgtgaatt gacatgtatc 240 atgccaaggt aggtcggaaa ggccaaagaa aagcggaaac cagacggtcg gaaagtacag 300 aactcgattc ttgcgattgt gcatcttcga gttctggtaa gtgtaaatgc gagtccggtg 360 tctgatcgga tctgatcggc cagtgccgag gcttacacgt gactatcaca tagtctcact 420 ctttcactct tcccttttcg cgatttccga ttcagtgcta acaactcgac gtagacaccc 480 cactctttct cctgcgcatt cctatgccgg tcaccgattg ggtcgcaaaa tgccaaagga 540 cagggcatgt aagcccgcat cttaattgtt aagattcacc gatgaatcgt caaaaatttt 600 gcaaaagcta gtggaaaacg gggttttgag gcccgttcca ccggcaaacc gtcatcgtgc 660 gccgatcaga tggatcc 677 // ID L1-42_AAe repbase; DNA; INV; 4662 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE An L1 non-LTR retrotransposon from Aedes aegypti. XX KW L1; Non-LTR Retrotransposon; Transposable Element; L1-42_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4662 RA Kojima K.K. and Jurka J.; RT "L1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1395-1395 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 9 sequences with >95% CC identity. XX FH Key Location/Qualifiers FT CDS 154..1179 FT /product="L1-42_AAe_1p" FT /translation="MPRRENTFRVDYSQFPKQLSHDELHKFIGKELGLTRE FT HVLLLQPSRRLGCTFVEVNSLELAESIVQQHDNKHEFVYDGKAYKLRIMME FT DGAVEVRLFDLSNDVSNEHIAEFLSDFGDVLNIRDLLWDERSPFGGVKTGV FT RIARMVVKKNIPSLVTIRGEDTAVGYKGQRQTCLHCLEFVHVGIPCVQNKK FT LLVQKLTADQSYANVAKQPALLKKPAQPNTLLSNRSQSQTQPNLSRQPSSK FT PNEARAPKQVNSMAPPATPSLHKSPATQPIHQLTPISGPIGVISANRKTDG FT NETDNSQTSYASTSSIRTRSRRSPPGKKMRHSNSNSNSEQERGSGDDMHL" FT CDS 1249..4584 FT /product="L1-42_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MLSVRLFAPSIWISSFCRRWKMISLPCPASTLSAMSI FT MRGEELRSPSKSTSDSPTSKKAWTEDWLPYECTIQHCVTCMPRLDLFSAPS FT GNVSTTTLSPARAAGGRGGAGPGRRRRPGRAPAPRAPAGRAPPRAPRRPPP FT PPAGGRRAPPRPRGRRRPAARGGAPGRGRAGRGYYLRHRTDHIILGGDFNC FT VIRLCDATGSNSSPALQSAIQQLRLLDVWQQLRSRESGYTYITHNSSSRLD FT RFYVSANLREHLRATSVHVCCFSDHKAVTARLCLPLPDRAPGRGYWSLRPH FT LLTTENIAELQISWQYWTRQRRYYPSWIEWWLSYAKPKFKSFFRWKSKLAF FT DAFHREQQRLYERLSQAYDGYHNNRNMITNINRIKAELLSHQRRFSEMFIR FT INETLLAGEPLSIYQIGERTRRKTTIDQLCDAHGAAIDDSDAIRNHMVEYF FT SNLYARENVDEEDEGRMFQCERIIPERDAVNEACMNDITTADILSAIRTSA FT SKKSPGPDGLPKEFYLRTFDVIHRELNLVLNEALRSNFPTQFVDGVIVLVK FT KRCAGNTARSYRPISLINFDYKILARILKRRIENVLRSHNILTAAQKCSNP FT EKNIFQATLAIKDRIAQLKANGRTAKLIGFDLDHAFDRVDHRFLFNTMHAL FT GFDTSMVNLLSRIAASASSRLLINGFLSSPFPIERSVRQGDPLSMHLFVLY FT LHPLLRRLEEACGGELIVAYADDISAIVTSVDQINAMRDLFRRFGRLSGAR FT LNEAKTTAINVGLINDPLNVPWLCTENTIKILGIVFTNSVVEMVHLNWDML FT VTNFSRQVWLHSQRGLSLHQKVTLLNTFVSSKMWYIAAHLAPTSAHVAKVT FT ATMRRFIFRGAPATVPMQQLARSKESGGLTLHLPIMKCPSLLINRHLHEIE FT SLPYYNSLLNQAIPLNANCLSDLPCLKSILKHFTNFPFQIRQRPSADLIHR FT YFIERTDRPKVETQNVNSNWLRIWRNISSRDLSPAYRSVLYMWVNQKFRHR FT QLFFKMRRSDGEQCTHCGARVETIQHKFFACPRVNDAFSLLQRKLTQATGG FT RRFDSDDLLRPSLERIGTRNKSIVLKLLVNYISFIDKCNARLDIEELNFTL FT DVGT" XX SQ Sequence 4662 BP; 1260 A; 1236 C; 1061 G; 1105 T; 0 other; agttagagct caacttccga gccgatcagt cgatgttttc ggacgagcga aattcggaag 60 tcgttcacgg actattctgt acgttattta tcgattgtat cgcacgtcgt tttcaatttc 120 acggttcaag tcgccgcgtc gccgtgtttc gcgatgccgc gccgcgaaaa cacatttcgc 180 gtcgattatt ctcagttccc aaagcaactt tcccacgatg aactccataa attcatcggg 240 aaagagctcg gtcttacgcg agaacatgtg cttttgctcc agccgagcag acgattaggg 300 tgtacgttcg tggaggtcaa cagcctcgaa ctcgccgagt ccatcgttca acaacacgat 360 aataaacacg aattcgtcta cgacggaaag gcgtataagt tacgcataat gatggaagac 420 ggagctgttg aagtgcgcct atttgatctg tcaaacgatg tctccaacga acacatcgcc 480 gaatttctct cggattttgg cgacgttctc aacattcgcg atctcctttg ggacgaacga 540 tccccattcg gtggcgtcaa aactggcgtt cgtattgccc gaatggtagt gaagaagaac 600 attccttctc tcgtgacaat ccgcggagaa gacacggcgg taggttacaa ggggcagcgg 660 cagacgtgtt tacactgcct agaatttgta cacgttggga tcccctgtgt gcaaaataaa 720 aaactgctgg tccaaaaact cacagcagac caatcatacg ctaacgtagc gaaacagcca 780 gctctgttga aaaaaccggc tcagccgaac accctactgt cgaacagatc gcagtcccaa 840 acacaaccaa acttgtcccg gcaaccgagt tccaaaccga acgaggcccg agcgcctaaa 900 caggtgaata gcatggctcc tccagcaaca ccctctcttc acaaatctcc cgccactcag 960 cctatccatc aactcacacc gatttcggga ccaatcggcg tcatctctgc aaatcgcaag 1020 accgatggta acgaaacaga caactcgcaa acttcgtacg cttcaaccag cagcatacgg 1080 acccgtagta gacggtctcc tcctggaaaa aaaatgcgac atagcaacag caacagcaac 1140 agcgaacaag aacgaggctc cggggacgac atgcatttgt aatggagcac accagcctca 1200 acatcggtac aatcaacatc aacacaatca caaactcaac caagctaaat gctctccgta 1260 cgtttattcg caccgtcgat ttggatatcg tctttctgca ggaggtggaa aatgatcagc 1320 ttgccctgcc cggcttcaac gttgtctgca atgtcgatca tgcgaggaga ggaactgcga 1380 tcgccctcaa agagcacatc agattctccc acatcgaaaa aagcctggac ggaagattgg 1440 ttgccctacg agtgcacaat acaacattgt gtaacgtgta tgccccgtct ggatctgttc 1500 tccgcaccga gcgggaacgt ttctacaaca acactatcgc ccgcgcgcgc ggccggcggc 1560 cgggggggcg cgggccccgg ccgccgccgc cggcccggcc gggcgcccgc cccccgcgcg 1620 cccgccgggc gcgcgccgcc gcgcgccccc cgccggcccc cccccccgcc ggccgggggc 1680 cggcgggcgc ccccgcggcc gcgcgggcgg cgccgccccg cggcgcgcgg cggcgccccc 1740 gggcgggggc gggccggccg cggttattat ctgcgtcatc gtaccgatca catcatcctt 1800 ggtggtgatt tcaattgtgt aattcgtctg tgtgatgcga ctggatccaa ctcaagcccc 1860 gcactccagt cggccataca gcagcttcgg ctgctcgatg tgtggcaaca gctccgatct 1920 cgtgagtcgg gatacacgta cataacacac aactcttcgt ccaggctcga tcgtttttat 1980 gtgagcgcta acctccgaga gcatctgaga gcaacgtctg ttcacgtctg ctgcttctca 2040 gaccacaaag ctgttacggc gagactatgc cttcctcttc ccgaccgcgc acccggtcgt 2100 ggatattggt ctcttcgtcc acacctcctc accaccgaga acattgcaga gctccaaatc 2160 agctggcagt actggactcg ccaacgccgt tattacccat catggattga atggtggctg 2220 tcttacgcta agccgaaatt caaatctttt tttcggtgga aatccaaatt agcgtttgat 2280 gcttttcatc gtgagcagca gcgtctttat gaacggctaa gccaagcgta cgatggctat 2340 cataacaacc gtaacatgat aacaaacatc aaccgaataa aagccgaact actttctcac 2400 cagcgtcgat tctccgagat gttcatacga ataaatgaaa cgctgttagc cggagagcca 2460 ctgtccattt accagatagg agaacgaacg agacgtaaaa ccaccatcga tcagctgtgc 2520 gacgcacacg gtgcggctat cgacgattcc gatgcaatcc gcaaccacat ggtagagtat 2580 ttttcaaatc tctatgcacg ggagaatgtg gatgaggaag acgaaggaag aatgttccaa 2640 tgcgagagaa taattcccga gcgagatgct gtaaatgaag cctgtatgaa cgacatcaca 2700 actgccgata ttttatcggc cataagaacg agcgcgtcga aaaagtcgcc cggacccgac 2760 ggattgccga aagaatttta cctacgaaca ttcgatgtga tccaccgtga actgaacctg 2820 gttttgaatg aagctctcag gtcaaacttt cctacacaat tcgtcgatgg cgtgattgtg 2880 ctggtgaaaa agcgatgtgc gggaaatacg gctcgatcat atcggccgat cagcttgata 2940 aattttgatt ataaaatttt agccagaata ctgaagcggc gcatcgaaaa tgtgctccgt 3000 tctcacaaca tattaactgc tgcgcaaaag tgttcgaacc ccgagaaaaa tatttttcaa 3060 gctactctgg caatcaaaga caggattgct caactgaaag caaacggtcg aactgccaag 3120 ctgatcggtt ttgatttaga ccatgcattt gatcgagtag accaccgctt cttgtttaat 3180 acaatgcatg ccctgggttt tgacacatcg atggtgaatc tgttgtctcg tatagcagca 3240 tccgcatcgt ctcgcttact aatcaacggc tttttatcat caccctttcc aatcgagcga 3300 tcggtcagac aaggtgatcc gctatctatg cacctttttg tgctatatct tcatcctctc 3360 ctgcgacgtt tggaagaagc gtgtggtgga gaattgattg ttgcgtacgc tgatgatatc 3420 agcgcaatag ttaccagtgt ggatcagata aatgcgatgc gagatttgtt cagacgtttt 3480 ggacgtttat ctggtgctcg tttgaacgag gctaaaacga ctgcaatcaa tgttgggtta 3540 attaatgacc cattaaacgt gccgtggttg tgtacagaaa atacgattaa aatacttggt 3600 atcgtattca ctaactcagt ggtggagatg gttcatctaa actgggatat gctcgtcacg 3660 aacttttctc ggcaagtttg gctacactca caacgcggtt tgtcccttca ccaaaaggta 3720 actctactga acacgttcgt atcatccaag atgtggtata ttgccgcaca cctagcgcca 3780 acatcagcac acgtagcaaa ggtcacggca acaatgcggc ggtttatatt tcgtggagcg 3840 ccggcaactg taccaatgca gcagcttgcg cgtagtaaag aaagtggcgg tcttacgcta 3900 cacttgccta taatgaagtg cccatcgtta ttaattaacc gtcatcttca cgagattgaa 3960 tccctacctt actacaactc cttacttaac caagcaattc ctctcaatgc aaattgtctc 4020 tcagatcttc cctgcctaaa atcaatcctc aaacatttca ccaactttcc cttccaaata 4080 cgccaacgcc cttccgccga tctcattcat cgatacttca ttgaacgcac ggacagacca 4140 aaagtggaga cacaaaatgt gaactcaaat tggcttcgaa tatggagaaa catatcgtca 4200 cgggatcttt cgcctgctta ccggagtgta ctctacatgt gggtgaatca aaagtttcgg 4260 caccgtcaac ttttcttcaa aatgaggcgg tcagatgggg aacaatgcac acactgcgga 4320 gcacgcgttg aaaccataca gcacaaattt tttgcttgcc cgagagtcaa tgacgcgttt 4380 tcacttctac aacgaaagct gacacaagct actggcggta ggcgtttcga cagtgatgat 4440 cttctgcggc cctctttgga aagaatcgga acacgaaaca aatcaatagt tttgaagctt 4500 ttagttaatt atatttcctt tattgataaa tgcaatgcaa ggttagatat agaagaatta 4560 aatttcactc ttgatgttgg aacttgaata attatgtaaa catttttaaa tccaattcga 4620 caaataaacg tacttataaa aaaaaaaaaa aaaaaaaaaa aa 4662 // ID Helitron-5_NVi repbase; DNA; INV; 5915 BP. XX AC . XX DT 16-APR-2009 (Rel. 14.04, Created) DT 16-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE Helitron DNA transposon - consensus. XX KW Helitron; DNA transposon; Transposable Element; Helitron-5_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-5915 RA Bao W. and Jurka J.; RT "Helitron DNA transposons from Nasonia vitripennis."; RL Repbase Reports 9(4), 766-766 (2009). XX DR [1] (Consensus) XX CC The consensus may be incomplete at both ends. XX FH Key Location/Qualifiers FT CDS 1357..5610 FT /product="Helitron-5_NVi_1p" FT /translation="MPRRRVVRTQEEEEIFQENRRERRAESQRRRRQAERN FT AIIDESRNEPVLRDHLGPMNISCTHCRAQHFIGEKVSNKGFSFNDCCSHGA FT VHLEPTPVFPTVLNDLFNGSHAKSNDFFQHIRVYNNSLSFASFNANLVNFA FT SRRSGPYCFKIQGQIYYQINTALYPEDNDSPSYGQLFIIDPLEAIYLRMEQ FT NSDLQYETLEILDKVIRENNIFAKSYEMMKQEINNQQTLINSDEPVPELQL FT LFALKPGTDARRYNFQRTNEVAAVFSTTADGDIPESYVTIRNKNTKTLQYV FT STMDPNVEPWIYPLFYPYGNQGWHQNLQCINRNNAGGNNRRVTRLAYTRYK FT IAIRPDEFNPFILGRRLFQQYVVDAYVKIEKDRIMYCKNHQKEIKADTYQG FT LHDYMQNSANDINGQVGKTIILPSTFIGSPRHMQQCYQDAMALINRKGKPD FT IFLTMTCNPKWPEIVENLLPHQQASDRPDIVARVFHLKKERLLDFIIKKNF FT FGEVAAYVYVIEFQKRGLPHMHLLITLAYGSKITTSEIVDKFISAEIPDAQ FT NPTLQEIVLKNMIHGPCGSWCQVDGKCSKKFPKHFVNETNMDENGYPNYRR FT RNTGILHKQTNGNEVDNRWVVPYCPILLEKFNCHINVEVVTSIRSVKYLYK FT YIYKGHDAATVTIGESNEGTLINHDEIRDFLEARYVGPVEAFYRILSKILQ FT DKSHAIIRLPLHLPYQHTVTINGKGNITQENLSSSSSMLLDYFKLNLENPE FT ARQYFYADIPSKFVHKKSKIDGLSVSSWQVRKQRFNCIGRMYSISPTQLEL FT FHLRLLLLHVKGATSFDYLKTVDGILHPTFTAACLALGLIEDDEEWIRAME FT EATVWMMPAQLRRLFVRILIHCQPVHPEELWDKFKDAMSEDFSRTNELTIS FT HQKAYAHINNLLNMEGRXLSDFPTMEQTVELNISYDETDEATLPQMAELGQ FT QQYENLNNEQKEIVDTVLHAVDLNDHDRNNCIYIDGPGGSGKTFIYTTLYN FT LLSSKNIKVCTMAFTGIAATLLPHGKTVHKTFGLPVPMYHDSSSSIKAQSK FT EGLFLKNADVFIWDEAPMAPRYALEIANRTLQHIMNNNLPFGGKIIILGGD FT FRQLLPIKIRGTRCETLNLCIKYSELWKYFKKFTLTTNMRVRPNEINFAKY FT LLEVGNGTINDLHDNIDXPEHCILDANNNIAYYSFGKLIEEKRFEDMSKCA FT ILSARNIDVDEINKQVTNLLDVTSEHVYTAIDSTENCNNGELDEVLLPEYL FT NSLNPTSLPPYELRLRKHCIVMLIRNISINEGLCNGTRLRIIDFSNHLLKC FT IILTGDKAGRIVFINRITLYCENDYPFTFKRKQFPIKIAFAMTINKSQGQT FT FHKITIDLRANVFNHGQLYVAMSRVRSWDSVKIYLGRQRQGVQVKNYVYKE FT LYL*" XX SQ Sequence 5915 BP; 2188 A; 887 C; 889 G; 1949 T; 2 other; caagtaatca tattatatcc aaaatattgt tttgactcat ttaaaaaata tcaatgctag 60 agggttagaa ttttgcatag taatttgttt tcacttacta catcaataaa tcaagttatc 120 attattgtaa gtgatataat tttgattagt ttatcagtac ctgtatgaca attttgacgt 180 taaagccatt gagttcaatg tgtcagtatg aaccaagaca attaaataga aactgccgtc 240 gtagcctaat tctacacaat tttgcaagtt tctgagcaaa gacacagatg tctaacttta 300 atagagctga tgtaatacca ttcaagctcg aatattgact atgactggat ctattcactc 360 ataatagtca atattcacac tttcagagat tctctggcta tcttggacac acaaatatta 420 tcatctgcat tatataatgt ttcgcaaaag aaaaatcaca tcagaaatca gaaaagatac 480 atcagaatag taaacttcac agagccctga gtgtgcggct attaatttac atacatgtag 540 gatcaggttg ccaattttaa cacatacttc tttaaaatac aacaaaacgt gtgactttat 600 gtagaatcta actaattcta atttttatta gattaattta aagctaagaa tacatatatt 660 ttgtttcaag atttgaaaat attaaagtat aacagaaaaa ttaataatgt atgcgattca 720 caattatgta acctcggtat ttatattttt tattgaacat tgacaattat aactattttt 780 aatttttgtt tgtttcaggt tttctaggat acatttgaag ctgttacaac acttacttct 840 ttgtttacta ttatttaatt aaaaaaaatg taaacatttt ttgtatcatt aattttttat 900 taaagataaa gacaaatcat ttataataat attgtgtcaa aatttattta cgtctaacct 960 attacaataa cttaagacct cctattacac cttcacatct tcaacacttg aacttaaaaa 1020 aatttaattt gaaaacttga acttgaaaat ttgaacttca taacttgaac ttgaaaactt 1080 aaacttgaca acttgaattt gtaaactgaa acttaaaaca ttacaattta ctaaaactga 1140 aaataaactt tgatcaaaaa ctaatcttca aattgcaaat aactttcaac tgaaaatatt 1200 acttacgaac taaaattatt aataaaattt ataaatagaa tcttattttt caaattatat 1260 ttcattacct ttatacttat tttgtatttc ttattttagc aactttgtat taatttaact 1320 aaatttctat caaatattaa taattaattt acaacaatgc ctcgccgaag agtagttcgt 1380 acacaagaag aagaagaaat atttcaagaa aatagacgag aaagaagagc tgaatcccaa 1440 agacgtagac gacaggctga acgtaatgct attattgatg aatccagaaa tgaacctgta 1500 ctccgggatc atttaggacc aatgaatatt tcatgcacac attgtagagc acagcatttt 1560 attggagaaa aagtatcaaa taaaggattt tcttttaatg attgctgtag tcatggggca 1620 gttcatttag aacctacacc tgtatttcct acagttctca atgatttgtt taatggatct 1680 catgctaaat cgaatgactt ttttcaacat atacgtgtct acaataattc attgtccttt 1740 gcttctttca atgctaactt agttaatttt gcttcaagac gttcaggacc atattgtttc 1800 aaaattcaag gacaaatata ttatcagatt aacactgccc tatatccaga agacaatgac 1860 agtccatctt atggacaact tttcattatt gatccactag aagctatata tttaagaatg 1920 gaacaaaatt ctgatcttca gtatgaaaca ttagaaattt tagataaagt aataagagaa 1980 aataatatct ttgctaaatc atacgaaatg atgaaacaag aaataaataa tcaacaaacc 2040 ttaataaact ctgatgaacc agtaccagaa ttacaattat tatttgcatt aaaaccagga 2100 actgatgcac gaagatacaa ttttcagcga accaatgaag tagctgctgt tttttctaca 2160 acagctgatg gcgacattcc agagtcttac gtaactattc gaaacaaaaa tacaaaaact 2220 ttacaatatg taagtacaat ggatccaaat gtagaaccat ggatttatcc attattttat 2280 ccttacggca atcaaggatg gcatcaaaat ttacagtgta taaatagaaa taacgctggt 2340 ggtaataatc gacgtgtgac ccgtttagct tacacaagat ataaaatagc tattagacca 2400 gatgaattta atccttttat tttgggacgt cgattatttc aacagtatgt tgtagatgct 2460 tatgtcaaaa ttgaaaaaga tagaattatg tattgcaaaa atcatcaaaa agaaattaag 2520 gcagatacat atcaaggatt acatgattac atgcaaaatt cggctaatga tattaatgga 2580 caagttggaa aaacaataat tcttccatca acattcattg gttcacctcg ccacatgcaa 2640 caatgttatc aagatgcaat ggccttaatt aatcggaaag gtaaacctga tatctttctt 2700 acaatgacat gtaatcccaa atggccagag atagtagaaa atcttttacc tcatcaacag 2760 gcttctgata gaccagatat tgtagcccga gtatttcatt taaagaaaga acgtttatta 2820 gattttatta tcaaaaagaa tttttttggc gaagtcgcag catatgtata tgtaattgaa 2880 tttcaaaaga gaggtctacc ccacatgcac ttattgataa ctttagctta tggttctaaa 2940 ataaccactt cagagattgt tgataaattt atttctgctg aaattcctga tgcacaaaat 3000 ccaacattgc aagaaattgt tttgaaaaat atgatacatg gtccatgcgg tagttggtgt 3060 caagtagatg gaaagtgttc caaaaaattt cctaaacatt ttgtaaacga aacaaacatg 3120 gatgaaaatg gttatcctaa ttatcgcagg agaaatactg gaattttaca taagcagact 3180 aatggtaatg aagtcgataa tagatgggtc gtaccatatt gtcctatttt attagaaaaa 3240 tttaactgtc atattaatgt agaagtagtt acaagtatta gatccgttaa atatttatat 3300 aaatatattt ataaaggtca tgacgctgct actgtaacaa ttggtgaatc taatgaagga 3360 acattaatta accacgatga aattagagat tttctagaag ctcgatacgt tggacctgtt 3420 gaagcatttt accgcatact tagtaaaata ttacaagata aaagtcatgc tataataaga 3480 ttgccattac atttaccgta tcaacatact gttacgataa atggcaaagg aaatattaca 3540 caagaaaatt taagttcttc aagtagcatg ttactagatt attttaaatt aaatttagaa 3600 aatccagaag caagacagta tttttatgca gatattccta gcaaatttgt tcataaaaaa 3660 tctaaaattg atggtttatc ggtttcaagt tggcaagtcc gtaaacaacg ttttaattgt 3720 attggacgta tgtattcaat tagtccaact caattagaac tatttcattt gcgtctattg 3780 ttattgcatg ttaaaggtgc tacaagtttt gattatttga aaactgttga tggtatccta 3840 caccctacat ttacagcggc atgtttagct ttgggtctta tagaagacga tgaagaatgg 3900 attagagcca tggaggaagc aaccgtgtgg atgatgcctg cgcaacttcg acgattgttt 3960 gttcgtattt tgatccattg ccaacctgtt caccctgaag aattatggga taaattcaaa 4020 gatgcaatgt ctgaagattt ttcacgaaca aatgaattaa ctataagtca ccaaaaagct 4080 tatgctcata ttaataactt attaaatatg gaaggtagar ctttaagtga ttttcctacg 4140 atggagcaaa cagtcgaact aaatatatca tatgatgaaa ctgatgaagc aactttgcca 4200 caaatggcag aacttggtca acagcagtat gagaacttaa ataatgaaca aaaggaaatc 4260 gtagatacag ttcttcatgc agttgatctt aatgatcatg atagaaataa ttgcatatat 4320 atagatggtc caggaggatc tggaaaaact tttatttaca ctactctgta caatttatta 4380 tcatcaaaaa atattaaagt atgtacaatg gcatttactg gtattgctgc aacattactc 4440 ccgcatggaa agacagtaca caaaacgttt ggcttgccag ttcctatgta tcacgattca 4500 tcatcaagta ttaaagcgca atcaaaagaa gggctatttt taaaaaatgc cgatgtgttt 4560 atctgggatg aagctccaat ggctccaaga tatgctttag aaattgcaaa tcgaacattg 4620 caacatatca tgaacaataa tttgccattt ggaggaaaaa ttattatact aggaggagat 4680 tttcgacaac ttcttccaat taaaattaga ggaactagat gtgaaactct gaatctttgc 4740 attaaataca gcgaattatg gaaatatttt aaaaagttta ctcttaccac aaatatgaga 4800 gtaaggccaa atgaaattaa ttttgctaaa tatttattag aagttggtaa cggaacaata 4860 aatgatttac atgacaacat tgatsttcct gaacattgca ttcttgatgc aaataataac 4920 attgcttact attcattcgg taaattgatt gaagaaaaac gatttgaaga tatgagtaaa 4980 tgtgctatac tgtcagctcg aaacattgat gttgatgaaa tcaacaaaca agtcacaaat 5040 ctattggatg taacgagtga acatgtatat acggctatcg acagtacaga aaattgtaat 5100 aacggtgaat tagatgaagt gcttttgcca gaatacttga attctctaaa tccaacaagt 5160 ctacctcctt atgagttgcg cttaagaaaa cattgtattg taatgcttat tagaaatata 5220 agtattaatg aaggtctatg taacggtaca cgattaagaa taatagattt ttctaatcat 5280 ttattgaaat gtataatttt aactggcgac aaagctggac gtatcgtttt cataaaccga 5340 attacattgt actgtgaaaa tgactatcct tttacattta aaagaaaaca atttcctata 5400 aaaatagctt ttgcaatgac catcaataaa tcacaaggcc aaacatttca taaaattact 5460 atagacctgc gagcaaatgt ttttaatcac ggtcagcttt atgtagctat gtccagagtt 5520 agatcttggg attctgttaa aatatatctt ggaagacaaa gacaaggagt acaagtaaaa 5580 aattacgtat ataaagaatt atatctataa cttcctttaa gacaaatttt tttgtcatta 5640 caaaaatttt aataaatatg cttacaattt gttttacttt ttcaatgata ggcattaatg 5700 agtattttat tgcataatta tttatacaga gttacaactt caagatttaa tacgtgaaat 5760 aattaaatat aattatatct attgtcttta ttattttaaa ttatctacgc actgaaaatg 5820 ttctgttttc atgattgata tatgtttata tccaaaactt aaacatgccg cgcgcaacgc 5880 gcgcgctaaa ttcctagtga atcataaaaa ctgag 5915 // ID Gypsy-2_OD-LTR repbase; DNA; INV; 189 BP. XX AC CABV01000624; XX DT 24-FEB-2011 (Rel. 16.02, Created) DT 24-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Oikopleura dioica genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-2_OD_; KW Gypsy-2_OD-I; Gypsy-2_OD-LTR. XX OS Oikopleura dioica OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; OC Oikopleuridae; Oikopleura. XX RN [1] RP 1-189 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Oikopleura dioica genome."; RL Direct Submission to RU (24-FEB-2011). XX DR Genome; CABV01000624; Positions 55453 55641. XX SQ Sequence 189 BP; 61 A; 57 C; 37 G; 34 T; 0 other; tgcagcgaag gatgctgagc ttgcaatcaa caacgacccg cagcctggca acgaccggcg 60 ccgcgctgct cctaattaaa cagtaactgt acaaaacccc ccttctcact gcaacaatct 120 cagaacacgc ataagtcccg aacagggcaa cataaaaaga gtctttattt aaacgctagc 180 atagcgtca 189 // ID Gypsy-69_CQ-LTR repbase; DNA; INV; 330 BP. XX AC AAWU01038780; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-69_CQ_; KW Gypsy-69_CQ-I; Gypsy-69_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-330 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 518-518 (2011). XX DR Genome; AAWU01038780; Positions 715 1044. XX SQ Sequence 330 BP; 90 A; 90 C; 74 G; 76 T; 0 other; tggacctgcg atatttgagc aactggaaga gctcaatcgt cagcagcagc agctagcaac 60 agcgttggag actcttcgag aaggctaggt cagcacagtg tcacccgaca cttcctgttg 120 atcgacgacg gcatgagcaa cggcatcgga caccaccagc aactcagcac ccagccagca 180 ttcctgcctg cgtagggact ttagggacta gcgcaggcct ctgtataatt tcattttgct 240 tctaataaat cactcagttt cattacctcg acgtgagcgg accttcagtc cgggattcaa 300 aattttttaa aaaaacattc cgaactacca 330 // ID DNA8-43_AP repbase; DNA; INV; 869 BP. XX AC . XX DT 25-AUG-2009 (Rel. 14.09, Created) DT 25-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-43_AP. XX NM DNA8-43_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-869 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 1973-1973 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 869 BP; 298 A; 100 C; 90 G; 381 T; 0 other; tagggctggg atttgtatgt aaatacatat ttttagtaaa acatgataaa ttaattattg 60 caaatgataa attcgtttca cgtgatttgc aataaaataa gacatatttt attttacata 120 tttttacata cttcgtaaaa ttttcgcttt tttttacata tttcgtaaat tttgacaatt 180 tcggtggctt agtaatattt tacatttata tttcgataat ggtactcggc aaattgtaag 240 tgtataatgt atataggaaa taagtaggtt aaaataaata ttaataactt gattttagtt 300 tatcttatct gtttctgttc gataaccata ccttaatgat cttagattat actttatgtt 360 caacaattat ttgacaaaat atacttacga atttacaaca gtacctatca gactatcagt 420 tcttgtgttc attcatttga ttcaaaatgg tcctatttgg gtttctattg atgaatctac 480 tgactaagtc agtgttaaga ccaaatcgcc gtacatttaa ttttgaaaat ttaaaacaat 540 atatggtttg tcattgtttc actttaaatt gattaaacat caattgtatt tcaatttaac 600 aacatatttt tttttactat tgcatatttt ttaatttttt accacaaaat attatatttt 660 ttaaattaaa ttaattagac ataaaaaaca ataaattgta cctgttattc aaaaattatt 720 tagaattgta ttttaattta aaactttaac atatttttta aatttttctt agtacatatt 780 ttttatttta tactacatat tttggcaatt ttaattacat atatatgtac atatttttgg 840 ttttttatta catataaatc ccagcccta 869 // ID hAT-76_HM repbase; DNA; INV; 4970 BP. XX AC . XX DT 15-JAN-2009 (Rel. 14.02, Created) DT 15-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE hAT-type DNA transposon family - consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-76_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4970 RA Bao W. and Jurka J.; RT "hAT-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 9(2), 416-416 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 2349..4199 FT /product="hAT-76_HM_1p" FT /translation="MNIFKVRCAKISGASLIRRKENGSTGREIIKCCADVV FT REKCAEILKLCNFFSILSDGSQARKTGKEKVLVLVRTGRNGIPIYMVTELL FT DMSLFGGSDSNSLTTGINSVFESDKSLFRLSKEHYTNKVICATADGASVNF FT GAHIGVLSQMQQNRPWLIKIHCINHRIELAIKDAFNEIPDFAQIDEFYLAN FT YYLLRNSGKLKAEVEAASKCLGITHYNLPKITGTRFVGHRRKGVFNMLETW FT PAFITAYQNYASEQTNKNTVSKVKGLLRSFKCHSFLYKVGLYLDCLELIIP FT ASKIFETNELLPHEIPATINRTILEIEDKILSIGKDDEFLDSYINRYMVTN FT NSEVEGSFAKAGDKRKHADNRTYVNVMLEMNSFVQNESIEQIRQIKRTILS FT KLSLLLSSRFKDYSRELILYRSMKFIDPIHWIKEDSDSGVQEINYISDHFE FT TPLKNAGYQRDVALREWKKVKLLVRTEFECHPVRSLWEIIFKKFIIEFPNV FT CCLVSLVMCISGSNSQVERTFSTVTNILTDKRLSLSHEALADCVVIHGNHS FT LWTKEDYNDLIERSLAKYMKKRRKCIIVDDEKTQLSLEDYDSLIETRSDED FT IDSEDDILTDNLRELYIK*" XX SQ Sequence 4970 BP; 1900 A; 657 C; 748 G; 1657 T; 8 other; caggccttgc gataaagcgt cgggcgatgt gaattttacg tcgtccaagr gcgattttgc 60 gtgtagctag ggctatttgc gtcgagcaaa aaattttttg tttgagtata tttaaagata 120 tttttgagcg gaattttaat gaagtaattt acgattattt taacatttaa taaagtattt 180 taaaaaccgc acgctgttga tataaaagtt tgaattacag aaaaattaaa tatgctcatt 240 atttacaagt agaaaaaatt cagattaaaa aaaaagttta aatcttttta agctaacatt 300 ttaataaaat tattcagcaa tagcaaagtt gtcgataaaa atgaacaaaa atactttctt 360 ttagttttag atttttaact taaataatta ttttacgggg gtttgttagg ttttaaaatt 420 attccataga tttaaaaata tttattaact gttagacgta tttaaagtat ttaagagtaa 480 tttcgaggaa atattttaaa aattggtaag cgcaaataaa aaaaaaataa aaggattaat 540 attaaaacgt attctacttg attcaataag aaagtacagc tattaaaatt tttgtaaaaa 600 gtaactcgat aaaaattact taaataataa caatcgcttt tttgttaaaa gttaccgaaa 660 ctcaacaaag cgtgctcaag aacagctgct gaaaaataga acgtcaatag tttaataagt 720 ttatataacc tggttttgtt acccggtttc taccgatgta aagcgaattt ccacaaacac 780 cctgaaaaag gctcaataaa cgtaaaaaat aaaataaaaa ttaatttgca acctgtaatc 840 ataagccatt tcataaaact taaaatataa aatatatcta aatgaggaca tatataaaac 900 gtgcattaat tttttttctt aaaaaaaaaa aaaaagttat tttactacta ctgttagtgc 960 aagtagattt tatgttatat tttttagtag aatttcattg tataaaatat cttgcattta 1020 tatacataaa aaaaacaaat ctttgctaga gtctcttttc tttaaataat ttatttcatt 1080 acttcaaata ttacacatat taacaagaat tataaaggat ccagacaaga aattgtttaa 1140 agcaaatttt tgaatcaaaa cgtttagtta aaaatccact acagcgcatt aggcaaataa 1200 attgaaaaac gyaacaattt ttttttataa accaagttag ccgttatttt gaaaaaagtg 1260 tcaagcatga cttccactta taacatttct gtaatatcac aatggaggtt ttcttaggta 1320 cgcgcacgct gtggaaccca tgaaataaat ttttattacc tttttgtttt gtaataatat 1380 ttggttaatg caaccaaaaa taattagaaa acaaaaatta accttaaatt atcaataacc 1440 actttactaa tgactattta tttttctata taaatggaga caaactttaa aaaatataca 1500 aggtataaaa aactcttcct attcttaaca aattatttaa agttaaaata tggatagcat 1560 tttgaacaac aaaagagtca gcttgaaaac ttttgaaagt tgggaggttt cggatctttt 1620 tcacatcgaa actatcaaag aaaatgagaa gacttatgtt gttaaaatcg tatgtaaatt 1680 gtgtgctaaa catagaaata ctattttatc aaaaaccatt ggtgcagtaa aagcttctac 1740 tttacggtat atcaatggca caagttgtat taaaaagaat ggtgtttcaa gacatatatc 1800 aggaacggct caccagtatg caacgcaatt agaaaataaa ggatgcaagg aaaacaaagg 1860 agcaaaaagt ataaaggtta aaaatgtttt atttagaaaa tttgttccaa tttaaataca 1920 attgcagttt atatttaatg cacattaaag cataaaatct taaaaacaag aaataagtaa 1980 ttttattttt taaattaaaa tggataaaat cactttttat tataaaatac atatacaatt 2040 aacttcatta acatctatct ctaaatttat attatttcta taaatgacct caggatgatt 2100 ccaaaaaatc gtcaaaaagt atactttaca accaattgac gttaaaagaa acacgcatag 2160 aaaattccta tcgttacctg gtaaaaattg cttatgaaat ggcgtgccgt ccaacaatac 2220 ctcatactca tttttctgtt cttgtaaggt ttaaacttgc ttttaaatgt gtgaaataaa 2280 attataaaac ttaatttttt gtytttggct acgtgtaatg actttttttt ataatttttt 2340 actgtttaat gaacatcttt aaggtgaggt gtgcaaaaat aagtggcgca agcttgataa 2400 gacgcaaaga aaatggwtcc acaggcagag aaataattaa atgttgtgca gacgtcgtcc 2460 gtgaaaaatg tgctgaaatt ctcaagttgt gcaacttttt ttctattctt agcgatggta 2520 gtcaggctag aaaaacggga aaggaaaaag tactagtttt ggttcgaaca ggccgcaatg 2580 gaatacctat ttacatggtc acggagttgt tggatatgtc cctattcggt ggtagtgatt 2640 caaattcatt aactactggt ataaacagtg tatttgaatc agacaaatca ttatttagat 2700 taagtaaaga acattatacc aacaaagtta tatgtgcgac tgcagatggt gcaagcgtaa 2760 actttggtgc acatattggt gtactgtcac agatgcaaca aaatcgtccc tggttaatta 2820 aaatccattg tattaatcat agaattgaat tggcaataaa agatgcattt aatgaaatac 2880 ctgattttgc acagattgac gaattttatc tagctaatta ttacctatta agaaattcag 2940 gtaaattaaa agcggaagtt gaagcagcat caaaatgcct tggtataacg cattacaatt 3000 taccgaaaat aacaggaacg cgatttgttg gacataggag gaagggagtt tttaatatgt 3060 tagaaacatg gccagctttt attacggctt accaaaatta tgcaagcgaa cagacaaata 3120 aaaacactgt atctaaagta aaaggcttgt taaggtcatt taagtgtcac tcgttccttt 3180 ataaagtagg tttatatctt gattgtttgg aactaattat tcctgcatct aaaatttttg 3240 aaactaatga gctattacct catgaaattc ccgcaacaat caatcgtact atattagaga 3300 ttgaagacaa aattttatca attggaaaag acgacgagtt tttggacagt tatataaacc 3360 gctacatggt caccaacaat agtgaagtcg aaggttcatt tgctaaagct ggggataaac 3420 gaaaacatgc agacaatcgt acatatgtta acgtaatgct ggaaatgaat agttttgttc 3480 aaaacgaatc aattgagcaa attcgtcaaa ttaaaagaac aattctttca aaactttcat 3540 tgttactatc aagtagattt aaagactact ctagagagct tatactttac agaagtatga 3600 agtttattga cccaattcat tggataaagg aagattcaga ctccggagtg caagaaataa 3660 actatataag cgatcatttt gagacaccat taaaaaacgc tggttatcaa agagatgttg 3720 ctcttaggga gtggaaaaaa gtaaagctgc tagttcgaac tgagtttgaa tgtcatccag 3780 ttcgtagttt gtgggaaatt attttcaaaa aatttattat tgaatttcca aatgtatgtt 3840 gcctggtatc gctagtcatg tgcatatctg gctcaaactc gcaagtagaa aggacattta 3900 gcactgtcac aaatatttta actgataaac gactctcctt aagtcacgaa gcgcttgcgg 3960 attgtgttgt tattcatgga aaccatagtc tttggactaa agaagattac aatgatttaa 4020 ttgagcgttc tctcgcaaag tacatgaaaa agcgtagaaa atgcattatt gtcgatgatg 4080 aaaaaactca actaagctta gaagactatg attcgctaat cgaaacaaga agcgatgaag 4140 acattgatag cgaagatgat attttgaccg acaatcttcg agaactatat ataaaataaa 4200 aaatttagta taaaagttta tttataaaat aaaattagtt ctcatattta accacgtaat 4260 gttttctttt ttttatttta gcgcactttt tattaaatat tgttcaattt gcaaattttt 4320 tcgctgaaat atgtctttta tacctataac atttttattc taacattttg ttttatttta 4380 ctagaaataa aattccttgt aaacattaaa aatctattca aaacgttcgt taaagttttt 4440 tagtaatacg agtgattgtg cttatcaacc gctttatgaa aataaataaa aataaataaa 4500 taaaaaaatr ttatataaaa tcgctgataa aaatattagc tattgcacag tttttaatcg 4560 tacaatatta aaatttccag tttattacty gctgactttt taaaaagaaa aaatatttat 4620 ttgaaaaact ttaaattaca tatttgagaa aaaaaaaatc aaatattagc atattttata 4680 atctttttta aatgtttagt tataatgttt tttattcgtt tagtatagac aaaacgtgtc 4740 aaacgtcgag aaaataaaca aatgttttta cattttaaaa aacgaaacat tccaaaacat 4800 ttttaaataa taacatttct acattaatga tttaaagtag ttttgcggta attaagtaaa 4860 ttggtgcagt taatgtcgat aaatatgaaa agcaaataaa aagaatttaa tgacgtcaaa 4920 aatcacgtcg tgcttatgcg tcgcgcagtt yttattcatc rcaaggcctg 4970 // ID Gypsy-69_AA-LTR repbase; DNA; INV; 157 BP. XX AC supercont1.165; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-69_AA_; KW Gypsy-69_AA-I; Gypsy-69_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-157 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.165; Positions 1823005 1823161. XX SQ Sequence 157 BP; 56 A; 38 C; 17 G; 46 T; 0 other; tgttatactg tcttgaattc gaacactcct agattttata atatgtacca gccctattga 60 taaaatagct cccataccta ttacacgtac aacaacataa acggaaataa agcaatcatt 120 gttcaaccac gaacccactc atgtcatttt gctaaca 157 // ID DNA8-88_AP repbase; DNA; INV; 620 BP. XX AC . XX DT 26-AUG-2009 (Rel. 14.09, Created) DT 26-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-88_AP. XX NM DNA8-88_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-620 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2024-2024 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 620 BP; 254 A; 97 C; 71 G; 198 T; 0 other; gggaacggat ttaaatgctc taaaaaacaa gaaaaatgcc ttaaaaaaat acgatttaac 60 gtcaaaaaat gcactaaaaa aatgccctaa aaaactttaa aaatgtattt atattcaata 120 aaaacacaat tttgaattta aaacttataa aaaaaacaca ttattatgaa aaatctgata 180 aaaaatattt aaatacaata aaaccttaat taattaactc tcaattatct tctgtccatc 240 ttctttttac cctaaaaaat taataggtat acaatttaga taggtatatt cgttatactt 300 ttaccttttg tttcatagca ctgtataatc aggtgctgct taatattctc aaacagaaat 360 cttcgacggt tatcggccag tgttgttttg tatctcgaga aacttcgttc cacgtcaact 420 gacgtcatag gtgcaaactt catacgaaaa aatgttgaaa aaatgactta aaaaccataa 480 aaatgcacta aaaatgtcaa aaaatgcaaa ataaaagtta cattttttaa ttccttgatg 540 tatgaaatga actttttgct tggaaaccaa tgttttcgga cattcagaaa aaaatgcaat 600 ttgcattcaa atccgttccc 620 // ID Gypsy-18_HM-LTR repbase; DNA; INV; 388 BP. XX AC 1101284919122; XX DT 28-JAN-2011 (Rel. 16.02, Created) DT 28-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Hydra magnipapillata genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-18_HM_; KW Gypsy-18_HM-I; Gypsy-18_HM-LTR. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-388 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Hydra magnipapillata genome."; RL Direct Submission to RU (28-JAN-2011). XX DR Genome; 1101284919122; Positions 7668 8055. XX SQ Sequence 388 BP; 152 A; 47 C; 78 G; 111 T; 0 other; tgttacaaag attactaacc attcaagcgg aacgcttggg ttaggtgaac cgtgaagatg 60 atttaacata aacaataaaa ataaaacaat tcgttcaaga accaaaacgc gtctagcgtc 120 tgtggaaatt aaataaaaac gcgaagaaga cgagatatga tgtagaaaga cgaaaaaaag 180 agagaagaac gagaagatag ttggttgaag acaataagtc cagtgagcag tttaatagaa 240 gttactggtc gtttataagt tggacaatac ttgggtgttg gcaactaata ctactcgctt 300 agctttattg tattaattat tttgtaatta ttttaatttt aaataaacaa ttgagtaacg 360 ttgttattgc gtaatatata acgtaaca 388 // ID Sat64_Cis repbase; DNA; INV; 64 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 06-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE Satellite from Ciona savignyi. XX KW Satellite; Simple Repeat; Sat64_Cis. XX OS Ciona savignyi OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. XX RN [1] RP 1-64 RA Smit A.F.; RT "Sat64_Cis - Satellite from Ciona savignyi."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX CC Ci000064 16 bp unit. XX SQ Sequence 64 BP; 24 A; 24 C; 4 G; 12 T; 0 other; gtttcaaccc acacaagttt caacccacac aagtttcaac ccacacaagt ttcaacccac 60 acaa 64 // ID Gypsy-5_IS-LTR repbase; DNA; INV; 103 BP. XX AC ABJB010064655; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the black-legged tick genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-5_IS_; KW Gypsy-5_IS-I; Gypsy-5_IS-LTR. XX OS Ixodes scapularis OC Eukaryota; Metazoa; Arthropoda; Chelicerata; Arachnida; Acari; OC Parasitiformes; Ixodida; Ixodoidea; Ixodidae; Ixodinae; Ixodes. XX RN [1] RP 1-103 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the black-legged tick genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; ABJB010064655; Positions 2221 2323. XX SQ Sequence 103 BP; 41 A; 15 C; 35 G; 12 T; 0 other; tgtaacagcg aagaagaaag gaaagaagaa gaggtgctcc ggacagggat cgaggacgag 60 aacgttggag aatgttggag aacccaggaa ataaaccgtg aca 103 // ID Gypsy-14_DPu-I repbase; DNA; INV; 12966 BP. XX AC scaffold_264; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-14_DPu_; KW Gypsy-14_DPu-LTR; Gypsy-14_DPu-I. XX NM Gypsy-14_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-12966 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 743-743 (2010). XX DR Genome; scaffold_264; Positions 40391 53356. XX CC Positions [4702-5121] - Reverse transcriptase CC Positions [6256-6741] - Integrase core CC LTRs are 99% similar to each other. CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX FH Key Location/Qualifiers FT CDS 23..3418 FT /product="Gypsy-14_DPu-I_2p" FT /translation="MLMNKSISLETVDLNKSVPIRRNSESDILRKGKFPPV FT TVRKSKLSLLRKVIPSRYSTRSTVDLKNTKLLDIGIPNSGPVVKKPSRWER FT TKAAIYNSPSLLAKTINFFQRTPASPESSSDAGSETNLPSDPFVDLPTDLD FT PLILPEPIAQSTVSDIFTRAAAFLTPVQSDVDSEDSDPELGKEVEAVFHQP FT DESAKRSEDQLSTESGVRGGRSVRPRHSTRIQTVSDGSFSNDFVGVEDCSI FT RLPEYGKGDTASGIQIVNCPDKASTCIPIFKDSDIPYTTQPCPSSNRNDIG FT DNNASVLRQESVDTRYAELARTFSGEFEYSRRQRPSGTHYDHSLPGRSESF FT RPISEQPETGISLREHSRQIFSRRNSARTPSPTVERAVRESISPARKTIYS FT SATENDCRHQRFIDPKSSLSGSLFSHPPSSEVQRAEPRHSDAPIGSDSSCF FT YRFTTYCSQTLGDGNTLAFTGNTVTVPAVSTATTTTPITANPSSFFTASAT FT HPTTTTSTVPTFSTPAPDHIHAASVANSDSVPRHSGSSISKYVHFLPADDI FT PRLSPDIPSNSHIDREIELRASRDRRDPSDRTNIDPSQPLGRIRRKGKELE FT GVQEKVKRPEAPSKQDHPVAPPRHISPLRPSQPRRSPPLAPSIGDLVDLWD FT EIPAYGWDNTPPEQTNSGNTPRTTARVDKDCSKPKTPPSHARTTCSSPNPG FT SRSVAMDSPINLSTIANVMALGDLNDLQKGNRALTMLLQLPNFSGGPTSIR FT FDRWIKLFDNIVAMSNWTDDETVNMLITKLTGPAHEMLQNILDSVTKDYQE FT IKKLLHERFHGNENQDYFQAQLEEVERQPGENIIAYGFRLKNIFEHGYPKN FT KLPSKAEEATRLQMLRQKFLSGLDLKLKNKVRYKEFKDYEDLVRETDKYNR FT RLEAEKEESSKKEFVNAITTAKSSSDTQLIWTAIEKQNEVINAITTGSRIV FT PQINESVEPPAVGLQDDITQKVAIALSNLLQTAQLPPIRGNPRPQVAPQIG FT QSTNFQPRFHQQRPQGPPYQQRQAYPSYRTQYGQPTQPRHILPQQANYPAD FT RPPRSSITCYMCGNKGHYRSECGQRPVGYAQVPAATDERQRMLTCYTCGVV FT GHRSNTCPNKGANIPAPALRQGNA" FT CDS 3451..7185 FT /product="Gypsy-14_DPu-I_1p" FT /translation="MDASEILKPQQVAYVTQLNNSTAPRVMIRINGNETEA FT LFDTGAARSLLHEKVYRSLPPNMQLASAETSVALFDVQNRRLSNLGKVTLR FT ITYGEKILEQEFIVTNGITEMCIFGIDAILKHEFVLCGKSKAIFIAGQEGT FT TQPSYQGDKEMTVLKREAIPPFTARFVETTISGAGYSLLAGFPFVFSPARV FT SVDSVSNNRRQGNLPDSVTVNRRQENLPDELNNRRQGNLPDELINRGQGNL FT PEELIVEESYNVTRDDGIYHVLVENASEREVVLQQGCILGTIEMGCRIVCA FT VGEEIAQDLDTGKGDCLKTLDEASIELALASIDERFRPQMKQLLTRHANMF FT VKGDRLGCTSVIKHHIDTQGRGPIRLRPYRTARKHEEELQRQLQSLLDQGI FT IEYSTSPWAAPVVLVLKKDGTLRLCIDFRRLNDITLKDSFPLPRIDDTLDK FT LSGAKYFTTLDLESGYWQIELDDESKEKTAFIVENNLYQFRRMAMGLCNAP FT ATFQRTMNFVLRDVLGKKALVYLDDVIIYSKTIEDHLRDIEEVFTLIEKAQ FT LKLKLKKCQFFRTEVNYLGHIITSEGIAPDPDKIEKIKNYPVPTAVDDVRS FT FLGLAGYYRRFIPNFGKVAKPLTSKTQKAAIKEAFIWKEEDQVAFDYLRTC FT LTTKPILAYPDFDQPFLLFTDACNYGIGAVLSQIQQGKEVVISYFSRQLHK FT SEMNYPTIEKEALAVVEAVKHFKYYLLDRHFTVLSDHAPLQWLKTFKDTNG FT RLGRWAVELGSLNYTIQYKPGKIHQNADCLSRLKIANVHARDELKEICADQ FT AQDPLCIHIRNFLDEGSLTEIHTKEYPIWAKEIELYEVIEGVLYRKEIPTR FT KRKQNRILTQVVVPLTLRSVVMKHLHDDPMSGHLAYYRTYMRVRNNYYWPT FT MREDIKEYCRVCQRCLENTKATFRTFLHPLELAKAPFDVVGMDFMGPFKPP FT SSQGNKYIMVVTDYFSKYVEVRALPDQTAITTADAFLDMVVRRHGTPKAIV FT SDRGVNFTSKIFRHLCKTLGIQQKFSTSYHPATNGETERFNRTLSMLLRKE FT LKDTNHEDWEDMLGDVCFAYHSSVHSSTQETPFFLLYGRDPNIPIHNLLDA FT IPQSNKSASDFVSLRMESLRIAFQRTKEENAKAREQQREQYNKRAKALCYR FT VGDRVLLDVKVRSKTENKKFISKYRGPFRVSKVYDNGTVDITNNSFTTKRV FT HVNRLRPLYDTMVWRDEFCPDLEDPIIFAGSTSLN" FT CDS 11007..12926 FT /product="Gypsy-14_DPu-I_4p" FT /translation="MSPKPNPFFLFYGAQRLTSINSFRCRQKNKMTIYILL FT LLSVYAVAFETTICNCKDPIRQEHLRIDDDKCQPIRRPTQRNADYAVWTDR FT KDGLKVRGYVCSRWEKINHISTNLFLQQIVVPDKRAVDTTEAECKIMAQTK FT RCDEMPMKFADGKWLYVPEPSESGAWMRTISTQVLNCMVEETIILQEENDL FT IDTPLGRANISDGVHTHNHMTLIWDVADADRIAHTPRLLTQGKATITQTST FT ATTFKLEDDSNQLAYHITHTDRCIMVNCESTNNTFAVAGDKHLFIVILKLG FT SEGNKTSDLILDRTASKPTKEELEYIRIRDDYYSYPEDKRMALLKLLQSHV FT QYLKDKTLDHENEILRAAHDMNCQITRIKHSLAVAAAQYDGWLAASILGLP FT TCVTLQAKGMTVLMKECRAEKITFTTETTTCGPQPRYENSTISQNGWEITT FT YQPCYWSDHYVNFNGKHYVYRNDTWKLAEATVIPMEQDWPVSFRYIDDNSY FT KYQPSTNPGYKAFITSPMNIMADMTAAMAEQSIHSGAAQEITPLTPVQTII FT MTAAEKTHVSGQMSWWETFKLILFITSLVFVFAFLIAFLRYFGIFALIFAM FT CCKPRPRLRSSRVHRRRREEVLDSSPPIALRDLLPPLNNQPI" XX SQ Sequence 12966 BP; 4149 A; 2940 C; 2564 G; 3313 T; 0 other; gcggcgtacg aatgcactca caatgttaat gaataagagt atcagtcttg aaacagttga 60 tttaaacaaa tctgtgccta ttcggcgtaa ttctgaatct gatattttaa gaaaaggaaa 120 gtttccccca gtgactgttc gtaaatccaa attatctctt ttgcgtaaag tgattccgag 180 tcgttattct acacgttcaa ccgttgactt gaagaataca aagttgcttg atattgggat 240 tcccaattca ggccctgtcg ttaaaaaacc ttccaggtgg gaacgaacaa aagcagctat 300 ttataacagc ccatctttgc tggcgaaaac tattaatttt tttcaacgga ctccggcatc 360 acccgagtcc tcatctgacg ctggttctga aaccaatttg cctagcgacc cgtttgttga 420 tttacccact gatcttgacc cattgatact acccgagccc attgcacagt cgactgtttc 480 ggatattttt actcgcgcgg ctgcattttt aacgcctgtt cagtccgacg ttgatagtga 540 agactctgat ccagaacttg gaaaagaagt cgaagcagtt tttcatcaac cagatgaatc 600 ggcaaagcgc tcagaggatc aattatctac tgagtcaggc gtgcgaggag ggcggagtgt 660 ccgacctcga cattccacgc gaattcaaac cgtctccgac ggaagctttt caaacgactt 720 tgtgggagtt gaggactgca gtatccgcct accagaatac gggaaagggg acacagcatc 780 cggaatacaa atcgttaatt gcccagacaa agctagtact tgcatcccca tttttaaaga 840 ctcagatata ccctacacca cacaaccgtg tccgtctagc aacagaaacg atattggcga 900 taacaacgcg tccgtcttac gacaagagtc agttgacact cgctacgcag aacttgctcg 960 aacgttttct ggcgaatttg aatattcacg tcggcaacga ccttcaggca cacattacga 1020 tcatagccta cctggaagaa gcgaatcgtt tcgtccaatt agtgaacaac cggaaaccgg 1080 aatttcactt cgtgagcatt cacgacaaat tttctcgaga cgaaactctg cacggactcc 1140 ttcgccaact gtcgaacgag ctgtcagaga gtctatatct ccagcacgga aaacaattta 1200 ttcgtcggct accgaaaacg attgtagaca tcaacgattt attgacccta aaagcagtct 1260 cagtggcagt ttattctcgc accctccgtc tagtgaggta caaagagcag aacccagaca 1320 ttccgacgcc cctattggca gcgactcatc ttgcttctat agatttacga cgtattgcag 1380 ccagactctc ggggacggta acacccttgc tttcaccggt aacaccgtca ccgtacccgc 1440 cgtctccacc gccaccacca caacccctat cacagccaac ccctcttcct ttttcacagc 1500 ctccgccacc catccaacca ccaccacctc caccgtcccc accttctcca ccccagcgcc 1560 cgatcacatt cacgcggcct cagttgccaa ttccgatagc gtcccaagac actccggttc 1620 cagtatcagt aaatacgtcc acttcttgcc cgccgacgat ataccgcgat tatcgccgga 1680 tatccccagc aatagccata tcgaccgaga aatcgagctc agagcaagcc gcgatcgacg 1740 cgacccgtcc gatagaacaa acatcgaccc atcacaaccg ctgggacgaa tacgtagaaa 1800 gggaaaggag ttggagggag tacaggaaaa ggttaaacgg ccagaggcgc ccagtaaaca 1860 ggaccaccca gttgcacctc cgcgccacat cagccccctt cgtccgtcgc agccaaggag 1920 atcccctccc cttgccccct caatcggcga cctcgtcgat ctctgggacg aaattccggc 1980 atacggttgg gacaatacac ctccagagca gacaaattca ggaaacacgc cacgtacaac 2040 agcacgagtt gataaggatt gcagtaagcc taagacgcct ccaagtcacg ctagaacaac 2100 ttgctcaagc cctaacccag gatcaagatc agtagccatg gactctccaa ttaatttatc 2160 aaccatcgct aacgttatgg cgttgggtga tttaaatgat ttacaaaaag gtaatcgtgc 2220 actcacgatg cttttacagc tacccaattt tagtggagga ccaacgtcta ttcgttttga 2280 tagatggata aaattatttg acaatatcgt agccatgtca aattggacag acgacgaaac 2340 ggttaatatg ttaataacta agctaacggg gccagctcat gagatgttgc aaaatatttt 2400 agacagcgtt acaaaagatt atcaagaaat taaaaaatta ttacacgagc gattccacgg 2460 gaatgaaaat caagattatt ttcaagcgca attagaagaa gttgaacgtc aaccaggtga 2520 aaatatcatt gcgtacggat ttcgattaaa aaatattttt gagcatgggt atccgaaaaa 2580 taaacttccg tcaaaagcgg aagaagctac tcgtctgcaa atgttaaggc aaaagtttct 2640 gtcaggactt gatttaaagc ttaaaaataa agtacgttat aaagaattta aagactatga 2700 agacctcgta cgcgaaaccg ataaatacaa tcgtcggtta gaagcggaaa aagaggaaag 2760 tagcaagaaa gaatttgtta atgcgatcac aacggcgaaa tcatcgtccg atacacagct 2820 tatttggaca gcgattgaaa agcaaaatga ggttataaat gccatcacga cgggctcacg 2880 gatagttcca caaataaatg aaagcgttga gcccccagca gtaggcctac aagacgatat 2940 cacccaaaaa gttgccatcg ctttgtcaaa tcttttacag actgcacagc tccctccaat 3000 taggggaaac ccaaggccac aggtagcacc tcagataggg caatcgacta actttcagcc 3060 tcgttttcat cagcaacggc cgcaaggacc tccttatcaa cagcgacaag cttatccatc 3120 gtatagaact cagtatgggc agccaactca accgcgtcac attcttccgc aacaagctaa 3180 ttaccctgca gataggcccc cacgttcatc aataacttgt tacatgtgcg gcaacaaggg 3240 acattatcgg agtgaatgcg ggcaaagacc agtaggctac gcacaagtgc ctgctgctac 3300 ggatgaaaga caaagaatgt tgacttgtta tacctgcgga gtagtaggtc atcgttccaa 3360 tacctgtcca aataaagggg ctaatatacc tgcccctgca ttacggcagg gaaacgctta 3420 gacaccaact gcggctttgg gtcagttggt atggatgcct cggaaatttt aaaaccccag 3480 caggtagctt atgtaactca attgaacaat agcacggccc ctcgagtaat gatacgaatt 3540 aatggaaacg aaacagaagc actgtttgat acgggtgctg cgagaagtct actacacgag 3600 aaagtgtata ggtcactacc accgaatatg cagctggcca gcgcagagac ctcagtagcg 3660 ttatttgatg ttcagaatcg gcggttaagt aatttaggta aagtcacgtt gcggattaca 3720 tatggtgaaa aaattttgga acaagaattt attgttacga atggtattac ggaaatgtgc 3780 atttttggga tagacgctat cttgaagcac gaatttgttt tatgcggtaa atcaaaagca 3840 atttttattg ctggtcaaga agggactacc cagccatctt atcagggaga caaagaaatg 3900 acggttttga aaagggaggc tattccacct ttcacggctc gttttgttga aactaccata 3960 agtggagccg ggtattccct tcttgcagga tttccgtttg tttttagtcc agcgagagta 4020 tcggtggact ccgtatcgaa taaccggaga cagggaaatc ttccggactc ggtgacggtt 4080 aaccggagac aggaaaatct tccggacgaa ttgaataacc ggagacaggg aaatctgccg 4140 gacgaattaa ttaaccgggg acagggaaat cttccggagg aattaattgt tgaagaaagt 4200 tataatgtaa cgcgtgacga cggaatatat catgtactcg tggaaaatgc tagcgaaagg 4260 gaggtcgtat tacaacaagg ttgtatctta ggtacaatcg agatgggttg ccggatagtt 4320 tgtgcggtcg gggaagaaat agcccaggat cttgacaccg gtaaaggaga ctgtcttaag 4380 accctagacg aggcttcaat cgaactcgct ctcgcatcaa ttgacgaacg ttttcgtcca 4440 caaatgaaac agctacttac taggcatgcc aacatgtttg taaaaggcga ccgtttaggt 4500 tgcacgagtg ttattaaaca ccatattgac actcaaggaa gaggcccaat ccgactacga 4560 ccctatcgca cggccaggaa gcatgaggag gagttacaac ggcaacttca atcgctattg 4620 gaccagggta ttatcgaata ctctacttcc ccttgggctg ccccagtcgt tttagtattg 4680 aaaaaagacg gaacactgcg actctgtatt gattttagaa gattaaatga cattacgtta 4740 aaagattcat ttccgttacc gcggattgat gacacactgg ataaattaag cggcgcaaaa 4800 tatttcacta cgctggacct tgaatcgggg tattggcaaa tcgagttaga tgatgaatca 4860 aaagaaaaaa ctgcttttat agtagagaat aatttatacc aatttagacg aatggctatg 4920 ggactttgca acgccccggc cacttttcaa cgaactatga attttgtcct gagagacgtc 4980 ttaggaaaga aggctcttgt ctatttggat gacgtaatca tctattcaaa aacaatcgaa 5040 gaccatttga gagatattga agaagtattt acattaattg agaaagcgca actcaaacta 5100 aagttaaaga aatgtcagtt ctttcgaacg gaagtgaatt atttaggcca cattattaca 5160 tcagaaggaa tagcccccga tcccgataaa attgaaaaga ttaaaaatta tcctgtccca 5220 acagcagttg acgatgttag atctttttta ggcctagcag gatactaccg gaggttcatt 5280 ccaaattttg gaaaagtagc caagcccctg acgagtaaga cgcaaaaggc ggcaataaaa 5340 gaagctttta tatggaagga agaagatcaa gtcgcgtttg actatcttcg aacatgttta 5400 acaacaaaac cgattttagc ctatccggat tttgaccagc cttttctttt gttcacggac 5460 gcctgcaatt atggcattgg agctgtgtta tcgcaaattc agcagggaaa agaagttgta 5520 atatcttatt tcagtagaca gctacacaaa tcggagatga actaccctac gatagaaaaa 5580 gaagctttgg cagtagtgga ggccgtaaag cattttaagt actacctgct tgaccgtcat 5640 ttcacagtat tgagtgacca tgccccttta cagtggctca agacgtttaa agatacgaat 5700 ggtagactcg gaagatgggc agtagaattg ggtagtctga attatactat tcaatataaa 5760 ccaggtaaaa ttcatcagaa cgcggactgt ctttctcgtc ttaaaatagc aaatgtacac 5820 gcgcgcgatg aattaaaaga aatatgcgct gatcaagcac aagatccgct gtgcattcac 5880 attcgtaatt ttttggacga aggatcgctg actgaaatac acacaaaaga atatccgata 5940 tgggcgaaag aaatcgaact atatgaagtt atagaaggag tgttatatag aaaagaaata 6000 ccgacaagaa aacgaaaaca aaaccgaatt ttaactcagg tagtcgtacc cctcaccctc 6060 agatctgtag tcatgaaaca tctacatgac gacccaatgt ctggccatct tgcctactat 6120 aggacgtata tgagagtaag aaataactac tactggccga ctatgaggga ggatattaaa 6180 gaatactgta gggtatgtca aaggtgtttg gaaaatacaa aagcaacttt tagaactttc 6240 ttacaccccc tggagttagc taaagcacct ttcgatgtag taggaatgga ttttatgggc 6300 ccgttcaagc ctccgtccag ccaaggtaat aaatacatca tggtagtgac ggattatttt 6360 tcgaagtatg tagaagttag ggctctacca gatcaaacgg ctataacaac ggccgatgca 6420 tttctagaca tggtcgtacg acgacacggc acaccgaaag caatcgtatc agacagagga 6480 gtcaatttca catcgaaaat ttttcggcat ttatgcaaaa cattaggcat tcaacaaaaa 6540 ttttccactt cttaccatcc ggctacaaat ggagaaacag aacgtttcaa ccgcaccctg 6600 tccatgctgt tgagaaagga attaaaagat actaatcatg aagactggga agatatgtta 6660 ggcgacgttt gtttcgcata tcattcatca gttcattcgt ctacgcaaga gacgccattc 6720 tttttattat acggtcgcga tcccaatatc cccatccata atttattaga cgctatccct 6780 caatctaaca agtctgcttc ggattttgtc agtcttcgaa tggagtctct aaggatcgct 6840 tttcaacgta cgaaagaaga gaatgcaaaa gctagagagc agcagcggga acaatataac 6900 aaacgagcaa aggcattatg ctacagggtg ggcgacagag ttctacttga tgtgaaagta 6960 cgatcgaaaa cagaaaataa gaaatttatt tctaaatatc gtggtccatt ccgtgtttcc 7020 aaagtttatg acaatggaac ggtggacatt acaaataact cttttactac taaaagagta 7080 catgtcaacc gtctccggcc actttacgac accatggtct ggcgagacga attttgtcca 7140 gatcttgagg atccaataat atttgccgga tccaccagtc tcaactgatg tggaagatca 7200 ggcaacagag gagatcggag aagaacatgt gagcaatcca acggcagaac aggaggatgc 7260 cgacacccgt attggtgaca gaaacatgga gcaacttgcc ccttttcatg ggtgggaaat 7320 aagcgaaagc gaccacccaa cggttgaaac aatacctgac acggtagcga atgaagaaat 7380 tcccccgcaa aaccgaatgg aaaggagtag aaggtctata aaggctccaa cgaggcttat 7440 agaggaaaaa gaataagctg ttccttaaat tttttttccc tctcctcctg ccagcctcaa 7500 ctaggaacga catttttttt ttaggaccat atctgatctt tctcttacac aacaaaaccg 7560 tgtttaaaaa aaaaaaaaaa atgaaaaaaa tctcagcaat taaaaacaac ataaaaatag 7620 taaaatcgaa caaaaaaatc agagaaagtc agacaaatag aaaaagagaa aaagaaaaac 7680 acaagaacac ccaggcaaaa atttaaaaat agggggatga gcatcaaatt tagactatgg 7740 aatcaatttg gggggtccac acgccaccat cttcttctaa gcgatccgtg ttagccaaaa 7800 aggttaaaag gacgcccgtt tttagctgac ggtccaatct ttttactaac gtccagcagg 7860 acactgtaaa cggacagttt taattctgtg tcttaattga atttttcgta ccgcaaaaat 7920 gtctgcttcg aaaccaaaac gcgaaagtgt cgatcgccgt ccgcgagcgg cacactcgtt 7980 ttacggcatt tgggaagaag gaatgtgcca aatgcgtgaa tgtgatcagc aaatggacac 8040 accaattccc cccagagttt tgtttacgct acaagagatg agggaaaaag tgcttaactc 8100 ctcacatttt atgtggtcga aagaacgttt ttcagtgtcg cctccgacgt atcacaagga 8160 cttatccagc agtggcacgg gttcttttga ttcgtgtgtg ccgcagtatg acagcacaga 8220 agaagaagaa acgagtgaca ctgatagcac gagcgacgcg tctctttact gggactcaaa 8280 aaacttggga gacggacgag acgtaaacga cgacaaccca agaagtcgag gaagcagcag 8340 gtcttccgga agaaccacag ctaagctgcc gtccgatcag gcctcgttcc tgttggtacc 8400 tcatctgaac tgtgtcaaaa cggagccgtg tgaacttccg gacgtcgttc tccactcgtt 8460 ccggccggcc tcaaacgtcc catctagcta ctcccagcac agcgaattcg accttctttc 8520 caatcgttgt atgttggttt tgctctgcat ttggggcatg tttatgactt tcatgtgggg 8580 gtatgtagct tatggattag tccaaatatt tcaaatgaaa cctgaagtaa taggcttccc 8640 cattgattct ttttccccgc aagcccaaca atccaacaac aattcactca accaattttt 8700 gcctcaataa tgatgtgggt aaaacttcca atcattatga tacccgggtc ttttcttttt 8760 caacacacat ttttttcgtc ttttattttt atttctcgtc ttcaattttt ttctattttc 8820 atatggcgtt ccttactatc tttcacgccg gtttattaat cgcctcctta ccttattttt 8880 tgtcttttac attttttacc tcttatatta ttttctcgtc tttttatttt ttctcgtctt 8940 ttattttttc ttttatgata aatggtgtcc tactttcctt ctatttttga ataattttat 9000 ttttcaacta ttttatcctc tttcgtctcg aactctttta aaaattttag ttagcctcat 9060 agtccagatt tgcattttct tctttatttt ccatcgtaat tgatacctgt ttttatctcg 9120 tatttatgta atacgtgttg ttgacaagta tcttgttatt ttacgtgtct aacgccatca 9180 tatcgttgtt tcttttccat tacttctttt cctctatagt ttttcttaaa aatttctgtc 9240 actatttgtt ttcccttttc tttctttcgc tttttagtag ttttatttaa gaaaaacaaa 9300 caataaaaag gttattccca tgtcgtgttg ttttcttttc cataattgtg tcaaactttt 9360 ctttggctga atacggaata taaaatgcga cgggaatgac taaataaaaa catgaaataa 9420 aaagagtaaa attagcacaa ttcaatcgaa gtgcataaaa ctaaatccgg gcgggcagta 9480 agtaaaataa aaaacaccga tcaatagcta aacacgaata aaaataacgt cagaactaac 9540 acacggatca attcgataaa aaaccctatt ttgttatcag tcgttttccg ttaggcatag 9600 gcctacagtt aaattattct ctaaacttga aataaaaaca agagatgtgg taaaattctt 9660 cataaaaatc ttcaaattat cttacaaaag catttctgat tagaaaagta aaagtaaaat 9720 attcttcaac atcattttta caacctcaat acaatctaag aaagtaaata aacaaactga 9780 gtcaatgacc ccgaaggcta tgtgcatacc ttaacgaaag tgaaagcgtc gctcaaaccg 9840 aatttttaca aagaagtgaa cgcatacgcc atctagcgct aaaaataaga aacacctcga 9900 acatatagca gttattgtca aattttctct ccttccgtaa atgaccagca ccaaagtaga 9960 aataaactaa acctttctca tataaaattc aacatattaa aagtaaaagt taacctactt 10020 ttatcataaa accctccatt ttcttcaagt aaagtaaaaa aaaaaaaaaa aaaaaaaaaa 10080 acctccattc taccaattcc cccgcaaccc aagcatattt acatttttct atcattcatt 10140 tgttctcccc cgctctaaac taaaattatt acttttcccc gtcattcaac atacgttcac 10200 tttccgccat acaatttaac cgtttatttt tcccacctat tttgctaatc gtaataaatt 10260 tttcccctaa aaagttccag tagaaacaaa caacagaaac aaaaactttc acgcaacatt 10320 gtattttcaa aagagcacaa attcatgaag tacatcgaca cattacaaaa acaaatctca 10380 atcgccgagg tgaagacggc ctatatctgg aaactccagc tcctcttcgg caaagattat 10440 gccctcgggg aagaaaacat caaagcccga ctcgcagaac cataaaaaac gggccgacgg 10500 aatatcgaaa tgcttccgat acctagcata accaggatat cgcctaaaaa taacaaaaac 10560 ggaacaaata aaacaaatga atacgttaat aaacaataaa aatacagcta catcaacacg 10620 acacgaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaga aaacaaaaac ttacgcgaat 10680 agacgctcgc gcgcgtccgg tggaacaaat tctcgacgga attcctgcaa ggttaaaaac 10740 tgccagctcg ctaacggctg ctgcagtaac caatgaaatc gaagccgatc agtagcacac 10800 aaatgaacgt cgtctgggat atagaaatat cgcagcgcga gttcgagagc gtgttgttgt 10860 gaaggctcca tttcgaatat taaaaccaaa ctaaaaactg cagccaccat ttcgacacaa 10920 aatataaaaa tattgacaac acggtggagt tgactactct aattccaccc agacggcgcc 10980 attcaaccga tgtgtcgcca ctttcaatgt cacccaaacc aaatcctttt tttttgtttt 11040 atggtgccca gcgacttacc tctatcaatt ctttccgttg cagacaaaag aataagatga 11100 ctatctatat cctactactg ctgagcgtct acgcggtcgc cttcgagact acaatttgta 11160 actgcaagga tccgatacga caagaacatc tacgcatcga tgacgacaaa tgtcagccaa 11220 tccgccgccc gactcaacgc aacgcagact acgctgtttg gaccgacaga aaagatggat 11280 tgaaagtgcg cggatatgta tgcagccgat gggaaaaaat taatcacatc tctacaaatc 11340 tttttcttca acaaatcgta gtgcccgaca aacgtgcagt cgacacaacg gaggccgaat 11400 gtaagatcat ggcacaaacc aaacgttgtg acgaaatgcc gatgaaattt gccgacggaa 11460 aatggcttta cgtgccagaa ccatcggaaa gtggtgcgtg gatgcgcact atatcgactc 11520 aagtattaaa ctgcatggtg gaagaaacta taattctaca agaagaaaac gatttgatcg 11580 acacaccatt gggacgagct aacatctcag atggcgtaca cacgcataac cacatgactt 11640 taatctggga tgtggccgat gcagatcgca tcgcacacac accgcggctt ctcacacaag 11700 gcaaggcaac aataactcaa acatctacag caacaacttt caaattggaa gacgactcca 11760 atcaactagc atatcacatc acacatactg accgttgtat aatggtaaac tgcgaatcta 11820 ctaacaatac tttcgcagta gctggtgaca aacatttgtt catcgtcatt ctcaaattgg 11880 gaagtgaggg caataaaacg tctgacctta ttctcgatcg tacggcgagc aaacctacaa 11940 aagaagagct tgagtacatc agaatacggg acgactatta tagctatcct gaagataaaa 12000 gaatggctct tctcaaatta ttgcaatctc atgtgcaata tttgaaggac aaaactcttg 12060 atcacgagaa tgaaattctc cgcgctgctc acgacatgaa ttgtcaaata acaaggataa 12120 aacattcgtt ggcagtggcg gcagcgcaat acgacggctg gttagcagcc agtatattgg 12180 gattgcctac ctgcgtcacc ctgcaggcaa aagggatgac cgttttaatg aaagaatgcc 12240 gcgccgaaaa aatcactttc actacggaga ctactacctg cggccctcaa cccaggtatg 12300 aaaattctac aatatcacaa aacggatggg agattacaac ctaccaacct tgttactggt 12360 ccgaccatta cgtcaacttc aacggtaaac attacgtgta ccgcaacgac acctggaagc 12420 tggcagaagc taccgttatt ccaatggagc aagattggcc ggtctcattt cgttacatcg 12480 acgacaacag ctacaaatat caaccatcga caaaccccgg atataaagct ttcatcacca 12540 gcccaatgaa catcatggcg gacatgacag cagctatggc ggagcaatcc attcactcgg 12600 gtgcagcaca agaaattact ccactcacgc ccgtacaaac aatcatcatg acggcggccg 12660 aaaaaactca cgtatctgga caaatgagtt ggtgggaaac gttcaaactg attttattca 12720 tcaccagttt ggtattcgtt ttcgcctttc tcatagcctt tttacgatac ttcggaattt 12780 tcgcgcttat cttcgccatg tgctgcaagc ctcgcccgcg acttcgttca tcccgtgttc 12840 atcgccgacg ccgtgaagag gttctggatt cctcgccacc aatagcactt cgagatcttc 12900 ttcctccgct gaacaaccaa ccaatttagt agattacggg acgtaatcca aggggccccg 12960 gagcca 12966 // ID Gypsy-597_AA-LTR repbase; DNA; INV; 563 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-597_AA_; KW Ty3_gypsy_Ele58; Gypsy-597_AA-I; Gypsy-597_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-563 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 563 BP; 172 A; 119 C; 87 G; 185 T; 0 other; tgtagcatgt ttattttaac gcatttcaaa aagatcataa tttaaggatc atgacagctt 60 acattgctca acttctagga aaacattccc acacgcaagt tagctgtact tgcatcttca 120 aagcatttgg tacatgtcat ttttcaacta atcgactagg tttcattttt cacaacccat 180 cgattatcca cataccacac aacttgagca acttattttg ttctcatgac gaagcgcatt 240 catgctgaga agcttcatat ttcaattccc ataaagacaa tttctacttt tgcctttgca 300 ctgatttgac ccgacagtgc aacggcccag cgaacacaca tgtaaacgtt tacccatccg 360 tttggtaaac gccttctttt aaatttgtat tctatgattg gttaattgat attggttgtt 420 tcagccttct cgtaatatag tggagtacta gattaagctc gaggttagct gtaagattga 480 cttcaaaata aaagacagaa cataatcaac aaccatagag tcaagttgac taattccgat 540 atgtccttta tggtatcata cca 563 // ID Gypsy-204_AA-LTR repbase; DNA; INV; 205 BP. XX AC AAGE02024741; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-204_AA_; KW Gypsy-204_AA-I; Gypsy-204_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-205 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02024741; Positions 4724 4928. XX SQ Sequence 205 BP; 63 A; 50 C; 43 G; 49 T; 0 other; tgtggcggat gcacccctga ccgccgttgg agtgcgataa actaccaaat gcatacacag 60 cagaaagtgt gtatcgcgcc atacgataga gttaagaaaa acaccgattg aatacaatct 120 taagttggcc accaaacgag tcggtcttgt ttaattatat cacatcccac caatgctgtc 180 ttgagaccaa tgtgcgacca tttca 205 // ID Gypsy-17_DWil-I repbase; DNA; INV; 4717 BP. XX AC scaffold_180762; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-17_DWil_; KW Gypsy-17_DWil-LTR; Gypsy-17_DWil-I. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-4717 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_180762; Positions 44875 49591. XX CC Positions [2185-2742] - Reverse transcriptase CC Positions [3754-4233] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 115..4680 FT /product="Gypsy-17_DWil-I_1p" FT /translation="MDSINPDDFSKSQLQIWCKNLGLPDTGTKNVLAERLN FT QLSPVARGQHPKDGAAPNGGLSKRADPPEKDCLEHLKSILYSSDDGAGQTQ FT NFEADLFQGQQQQEVTANEQNKAENGQNEAENDQNDAESETQTENESITNE FT PQEISKATNEKSNEKSNNGERNATNEKSNEKSNNGERNGTNEKSNEESQKE FT ENIVKYTLEKKLELLCKEIELVRRENEFLRLAGNPESGHSSSWLATKTGDQ FT VLAKDVKDFVLDFDGNGNVYSWISSMLHLERSFNLNENNLRLLVFGKLKGK FT ALDWIHSNPECLCQPVKTIFDAMGKVFGSKKTNVIIRREFEARKWCVGEPF FT LNYFNDKVGLAGGLRLPEDEMVEYIVDGIPDRHLREMACMQRFGSKDDVLA FT AFVNIVLQREKPVSLPNVSAKKKPTFRCFNCNCLGHLAADCNKPIRATGAC FT FGCGEMGHRVADCSQNKKKPAAISVSLDNIRIVKVHIGNDVFSLGCLIDSG FT SAVSLIKKEFVSDEIKLEPGPEHTYFGLNKSKLEVFGSILCYVEILNKNVY FT VRLLIVSNESMGSPILLGRDFMKICKAKIVLQLDDDLCKESMAESIKLSHD FT LSKEDNFENQMLNIDCDVVNSKYSVGKSLSFNETTMFENIFSKFYLERNRP FT NEPKVKCEMKLTINDISPFSCNPRRLSYSEKAELQKIIDEYLSKEYIRPSE FT SEYVSPIVLVKKTTGELRMCVDFRKLNKATSKDSYPIPLIDDLLDRLAGKT FT LFSKLDLKNGFFHVYMHEDSVKFTSFTTPLGQFEFLRMPFGLRNAPSTFQR FT FINRIFADMIRDEKVIIYLDDIMVATKDFDTHLEILKEVFDRLVDNKLELR FT LDKCEFLQTSVKYLGYSITGKGIKADDKGIQAVKDFPVPTKVSEVQSFLGL FT CSYFRRFIKDFATIAKPLYDLVRKDKKFMFGNIELSTFENLKSKLTEAPIL FT ALYSPNDETELHCDASSIGFGSILLQRKSDGKWHPVFYFSKRTTDVESRYH FT SFELETLAIIYSLKRFRVYLQGKHFKIATDCNSLTLTLSKREVNPRIARWA FT LELENYDYVLEHRPATRMQHVDALSRNTNIMVVETNSFEDNLVICQNQDDE FT ILRLKSMLERSEHSSYEMRNGLIYKKLQNGCLAFYVPKNMESHVLFKYHNE FT LGHTGIDKVVEVITKSYWFPKLRKKVEQHVNNCLKCIVFSSKNGKQEGHIH FT NIPKPDVPFEVVHIDHFGPIDVKGSTKKHVLVVVDACTKFVRLYLAKTTKS FT KEAIEALKEYFRSYSRPKCIVSDRGTCFTSNEFSSFLESCNIQHVKIATGS FT PQANGQVERINRSLGPMLAKLLASEENVSFGQAIEIVEYTMNNTIHRSLGQ FT HPSVVLFGVGQKGRQVDELKDYLLEDRSVERNLESVRAKAKEKEKKLQEYN FT ECLANRKRRGPKVFKPGDYVVVKNFDTHKGVSKKLVPRFKGPFKIVKCLEN FT DRYVISDVDGFQQSKIPYTGIWSITNMKPWFNWSVNVNENSIKEKVSMLSK FT GRHKED" XX SQ Sequence 4717 BP; 1657 A; 769 C; 1036 G; 1255 T; 0 other; gagatagacg tgttttactc gctataattt cagaagtggt catagttaac ccagtgaaaa 60 aacagtgaaa tagctataat ttagcaaaga aaaaatatat attaaaaaac aataatggat 120 tcaataaatc ctgacgattt ttcgaagtcc cagttacaaa tctggtgcaa aaacctaggt 180 ttaccagata ctggcactaa aaatgtgcta gccgaacgtt taaaccaact tagccctgta 240 gcgagagggc aacatccaaa agatggagct gctcctaacg gaggcctaag taaacgcgcg 300 gatccccctg aaaaagattg cttggagcac ctaaagagca ttttgtattc atcggacgat 360 ggcgccggtc agacgcaaaa tttcgaggca gacttgttcc aaggtcaaca gcagcaagaa 420 gtaacggcaa acgaacagaa caaagcagaa aatggacaga acgaagcaga aaatgatcaa 480 aacgatgcag aaagcgagac gcagacagaa aacgagagca ttacgaatga accgcaagaa 540 atatcaaaag caacaaatga gaagtcaaat gaaaagagca acaacggaga gcgcaacgca 600 acaaatgaga agtcaaatga aaagagtaac aacggagagc gcaatggaac aaatgagaag 660 tcgaatgaag agagccaaaa agaagaaaac atagtaaagt acactttgga aaaaaagctg 720 gaactacttt gcaaagaaat cgaactggtt cgtagagaga acgagttttt gcgccttgct 780 ggcaaccctg agtctggcca ctcaagcagc tggctggcga cgaagacagg tgaccaggtc 840 ttggccaaag atgtgaaaga ttttgtactc gattttgacg gtaatggaaa tgtttatagt 900 tggatatctt ccatgcttca tttggaaaga tcattcaact tgaacgaaaa taatttacgt 960 ttgcttgtct tcggtaaact aaagggcaaa gctttggatt ggatacactc aaaccctgag 1020 tgtttgtgtc agcctgtgaa aaccatattc gatgccatgg gaaaagtctt cggatcaaag 1080 aagaccaatg taataattcg aagagagttt gaggcacgca agtggtgcgt cggcgaaccg 1140 ttcctgaatt actttaacga caaagttggc ttggctggcg gcttgcgttt gccggaggac 1200 gagatggtcg agtatattgt tgacggaata ccagaccgtc acttgcgaga gatggcttgt 1260 atgcagcgtt ttgggtctaa ggatgacgtg ttggcagcat ttgtcaacat agtgctgcag 1320 cgggaaaaac ctgtatctct gccaaatgtt tccgcaaaga agaagcccac attcaggtgc 1380 ttcaactgta actgcctggg tcacctagca gccgactgta acaaaccaat aagagcaact 1440 ggtgcttgct ttggatgtgg tgaaatgggt caccgagtgg cagattgcag tcaaaataag 1500 aagaagcccg cagcaatatc agtaagtttg gataatataa gaattgtaaa agttcacata 1560 ggtaatgatg tattttccct aggatgcctc atagactcgg gaagcgcagt tagcttaatt 1620 aaaaaggagt tcgtttcaga tgaaattaaa ttggaacccg gtcctgaaca tacatatttt 1680 ggtttaaata aaagtaaact tgaagtcttt ggaagtattt tatgttacgt tgaaatatta 1740 aataaaaatg tttacgttcg actactgatt gtttccaacg agtctatggg aagtccaata 1800 ctgttaggta gagattttat gaaaatatgt aaagctaaaa ttgtactgca gcttgacgat 1860 gatctgtgta aagagtcaat ggccgaaagt attaagttaa gccatgactt gagcaaagaa 1920 gataattttg aaaaccaaat gctaaatatt gattgtgacg tggtaaattc aaaatattct 1980 gttggcaaaa gtttaagttt caatgaaacg acaatgtttg aaaatatttt ttctaaattt 2040 tatctggaaa gaaatcgacc aaatgaacca aaggttaaat gcgaaatgaa actcacaatc 2100 aatgacataa gtccgtttag ctgtaatcca agacgacttt cttatagcga aaaggctgaa 2160 ttgcaaaaaa taattgacga atacttaagc aaagaatata ttaggccgag tgagtcagaa 2220 tatgtttccc caatcgtgct cgtaaaaaag actacaggtg aacttagaat gtgcgttgat 2280 tttagaaaat taaacaaggc gacttcaaaa gatagttatc ctattccact tattgatgat 2340 ctgttagata gacttgcagg aaaaacctta ttttcgaaac tcgacttaaa gaatgggttt 2400 ttccatgtat acatgcatga ggattcagtg aagtttacgt cattcacgac accgcttggt 2460 cagtttgaat ttcttcgtat gccatttggc ctcagaaatg cgccgtctac atttcaaagg 2520 tttataaaca ggatatttgc cgatatgatc agggatgaaa aagttattat atatttagac 2580 gatattatgg tagcaacaaa agattttgat acgcatctag aaattttaaa agaagtattt 2640 gaccgattgg ttgacaataa attggagttg agattagata agtgcgaatt tctgcagacg 2700 agtgttaagt acctagggta ttcgattaca ggcaaaggca tcaaagctga cgataagggt 2760 atacaagcag taaaggattt tccagttccg actaaagtca gcgaggttca aagtttttta 2820 ggactatgtt cctattttcg taggtttatt aaagattttg ctactattgc caagccatta 2880 tatgacctag ttagaaaaga caaaaagttc atgtttggga acatcgaact tagtacgttt 2940 gaaaatctaa agtctaagtt aacagaggcc ccaatacttg cgctttatag cccaaatgat 3000 gaaaccgaac tgcattgtga cgcaagttct attggtttcg gatcgatttt gctacaaagg 3060 aaaagcgatg gtaaatggca tccagttttc tatttttcta aaagaactac cgatgtggaa 3120 tctagatatc atagctttga gttagaaaca ttggcgataa tttattcgtt aaaacgattt 3180 cgtgtatatt tgcagggaaa gcatttcaag atcgcgacag attgtaactc gcttacattg 3240 acacttagca aaagagaagt taaccccaga atagcccgtt gggctctaga gttagaaaac 3300 tacgattatg tgctagagca taggccggca acgagaatgc aacatgtaga tgctttaagt 3360 agaaatacta acattatggt tgtggaaact aactcatttg aagataactt agtaatatgt 3420 cagaaccaag atgatgaaat tttaagatta aagtcaatgt tagaaaggtc tgaacacagt 3480 tcgtatgaaa tgcgaaacgg tttaatttat aaaaaactac aaaatggttg ccttgctttt 3540 tacgtaccga aaaatatgga gtcacatgtc ctgtttaaat atcataatga gttaggccac 3600 acaggcatag acaaagttgt agaagttatc acaaagtcgt attggtttcc aaagttacgg 3660 aaaaaagtag aacaacatgt aaataattgc ctaaagtgta tagtcttttc gtcaaaaaat 3720 ggaaaacaag agggtcatat ccacaatata cctaaacctg atgtgccatt cgaagtggtt 3780 catatcgatc acttcggacc aatcgatgtc aaaggaagca caaagaaaca cgtgttagtg 3840 gtggttgacg cgtgtacgaa atttgtcaga ctttacttag cgaaaaccac aaaatctaag 3900 gaagcaatag aagcacttaa agagtatttt agatcctata gtaggccaaa gtgcatagtg 3960 tcggatcgtg gaacgtgttt tacgtctaac gaattttcta gctttttaga gtcttgtaat 4020 atacaacatg tcaaaattgc aactggttct ccgcaagcta atggccaggt tgaaagaata 4080 aatagaagtc taggccctat gttggcaaaa ttattagcct cagaagaaaa tgttagtttt 4140 ggtcaggcca tagagatagt agagtatacc atgaataaca ctattcatag gtcattggga 4200 cagcatccaa gtgtcgtact atttggcgtc ggccaaaaag gacgacaagt agatgaatta 4260 aaagactact tgttagaaga caggtctgtg gaaagaaatt tggagtcggt acgagcgaaa 4320 gctaaagaaa aagaaaagaa acttcaagag tataatgaat gtctagcgaa tcgaaaaaga 4380 cgagggccaa aagtttttaa gccaggcgac tatgtggtag ttaaaaattt tgacactcac 4440 aaaggggtgt ccaaaaaatt agtgccgaga ttcaaagggc cttttaaaat cgttaagtgc 4500 ctcgaaaatg atcgatacgt tataagtgac gtcgacggat ttcagcagtc gaagattcct 4560 tatactggaa tttggtctat cacaaatatg aagccatggt tcaattggtc agtaaatgta 4620 aatgaaaatt ctattaaaga aaaagtcagt atgttaagca aaggtcgtca taaagaagac 4680 taggatcagg tgatccttaa tgtcagaatg gccgaat 4717 // ID Gypsy-164_AA-LTR repbase; DNA; INV; 277 BP. XX AC AAGE02017464; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-164_AA_; KW Gypsy-164_AA-I; Gypsy-164_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-277 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02017464; Positions 449 173. XX SQ Sequence 277 BP; 84 A; 64 C; 57 G; 72 T; 0 other; tgtggtgcct accagcctta ctgtttcttt tagtcgcaag atcagcataa atacgcacag 60 tccaaagtat caacgatcgg acgattgtaa ttatgctgac ggcagcgaaa agagtcacgt 120 agctgccaac cacactcgag taagttaaat aaacatattt ttcaattcgt tcattcttaa 180 cgagactgta ctagcgagtg gttcttattc tgagatatca ccatgccagc gggagtagtg 240 aagagtcccc acacaaagtg cagtgccaat cacatca 277 // ID Vingi-2_HR repbase; DNA; INV; 3200 BP. XX AC . XX DT 17-AUG-2010 (Rel. 15.08, Created) DT 17-AUG-2010 (Rel. 15.08, Last updated, Version 3) XX DE A family of Vingi non-LTR retrotransposons - consensus sequence. XX KW Vingi; Non-LTR Retrotransposon; Transposable Element; Vingi-2_HR. XX OS Helobdella robusta OC Eukaryota; Metazoa; Annelida; Clitellata; Hirudinida; Hirudinea; OC Rhynchobdellida; Glossiphoniidae; Helobdella. XX RN [1] RP 1-3200 RA Kojima K.K., Kapitonov V.V. and Jurka J.; RT "Recent Expansion of a New Ingi-Related Clade of Vingi non-LTR RT Retrotransposons in Hedgehogs."; RL Molecular Biology and Evolution 28(1), 17-20 (2011). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 20..3178 FT /product="Vingi-2_HR_1p" FT /note="includes endonuclease and reverse FT transcriptase domains and a CCHC zinc-finger FT motif." FT /translation="MRHSQTSSTSKTARIISHPAYQNTTGFVAHSPGQLTF FT LQWKCNGIKNKKDELQQHLETHKIHIAALQETKLTSRSKEPEFDGYSLYRK FT DRGLGQGGGLVMLIHNNIPYSIKQLPDSNPIESQAIFINTNNKDLTIINTY FT IPPQSVCPSHFTASISDLLSNPNTILMGDLNAHDSLWHSNIQDARGEALAV FT EIDDSDCGSLNLDSPTRLPNNSQPTSPDVSIASDSLISSIDWTVLTTLSSD FT HLPIHITLHADLEPTKTPKTTIVNFNKANWARYTEETETAFAKSVSPNDPS FT AAEKLFRHILSSASARNIPSGRRILSRPGLSREITDLMDQRDSLRSLDPTS FT DNINSLSTVIQQKINNLKLTRWKEYLSTFNKSTNTNRLWSTIKALNGKPKQ FT YPNNAITFNNKPITNNKQMANLFNKTFTSFKTHISDKLNRFTTRTIKNLPL FT DDHQIFTPDQVSTAIKQSKPSKASGPDNLTIIHLKHLGPLGISFLTNIFNS FT SLITCKIPDIYKKAKIIPLLKPGKPDSDSKSYRPISLLCPAVKILEKCLLP FT ILSKHLTLADHQHGFRPNYSTTSALTTITNDIANGLNHKRPAHRTVLAALD FT LSKAFDTVSHRILLSDLLTTTLPRSVIRWMSNYLHGRTASTTFRQQHSKQR FT IIHTGVPQGSVLSPSLFNFYVSKTPAPPPDIKIISYADDFSIYTTGPDAAI FT LTRRLNSYLTELHDFFHNRNLEISTSKSTITLFTTYTSQYHYHPQVKVNNQ FT TLPLEHNPKILGLTFDTMLSFSQHHKMSASKASKRNSILKALSGTTYGQDK FT ETLMQTYKAIGRSVIEYACPAWSPLTSDTSYHRLQRSQNAALRVITGCHGI FT ASEQHLHSECQMLSVKQHTDLLSSQFLLQCHNTSHPCHHITREPRPDRNIK FT HTLFSKYVQTVQTVLPTDTDSRALKAASSALHCSAAKLASETYHAAVLLNG FT WPPPTPKIDKAEESLPRALRRKLAQLRSGHCISLNSYRSKIDSRVSNICPI FT CQLLPDDVPHLFSCPKNPTHFTIIDLWRNPVAVANWLRPRLDPSNDV" XX SQ Sequence 3200 BP; 993 A; 977 C; 506 G; 724 T; 0 other; cagctcggcg gcctctgcta tgagacactc tcaaaccagt tcaacctcta aaactgcgag 60 gataatatct catcccgcct atcaaaacac cactgggttc gtggcccatt cccccggtca 120 actgacattc ctccaatgga agtgcaatgg tatcaaaaac aagaaggacg aactccaaca 180 gcaccttgaa acacacaaaa tacacattgc agcactccaa gaaacaaaac tcacctcccg 240 ttccaaagag ccagaattcg atggctactc tctataccgc aaagaccgtg gtcttggaca 300 gggcggtggt ctggtcatgt taatacacaa taacatccct tactcaatca aacaactacc 360 tgactcaaat ccaatagaat cccaagccat attcatcaat accaataaca aagatctcac 420 catcattaat acttacatac ctcctcaatc ggtctgcccg tcacacttca cagcctctat 480 ctccgacctt ctatccaatc cgaatactat tttaatgggt gacctgaatg ctcacgactc 540 gctgtggcac tccaacattc aggacgcgcg aggtgaagca ctggcagtcg agatcgatga 600 ctctgattgc ggttcgttaa atcttgattc tccgacccgt ctccccaaca acagccaacc 660 aacttctcct gatgtctcga tagcatcaga ttccctgata tcttctatag actggaccgt 720 acttaccacc ttgtcttctg accatctccc tattcacatt actctgcacg ctgatctgga 780 acccacgaaa actccaaaaa caacaatcgt gaatttcaat aaggccaact gggccagata 840 cactgaagaa accgaaactg ccttcgcaaa atctgtatct cccaatgacc cttcggcagc 900 tgagaaattg tttcgtcaca tcttgtcgtc cgccagcgct cgcaacattc cttcaggaag 960 gagaattctc tcacgaccag ggctgtccag agaaatcacg gacctcatgg atcagcgaga 1020 cagtctcaga tccttggacc ctacttcaga caacatcaac tctctctcaa ctgtcattca 1080 acaaaaaatc aataatctaa agctcacccg ctggaaggaa tacctgagca cattcaataa 1140 atccaccaac acaaaccgct tatggtccac catcaaggct ctcaacggta agcctaaaca 1200 atacccaaat aatgccatca ctttcaataa caaaccaata acaaataata aacagatggc 1260 caacctattc aataaaacct ttacctcatt caaaacacac atctccgaca aattaaatag 1320 attcacaacc aggaccatca agaacctccc tcttgatgac catcaaatct ttaccccaga 1380 tcaagtatca acggccatca aacaatccaa gccctcgaaa gcctccggcc ccgacaacct 1440 caccatcata catctcaaac atttaggtcc tctaggtatc tccttcctca ccaacatctt 1500 caatagctcc ctgattacat gtaaaattcc cgacatctat aaaaaagcga aaatcatccc 1560 tctgttaaaa cctggaaaac cagatagcga ctcaaaatca tacaggccaa tatcgttact 1620 ctgcccagcc gtcaagatcc ttgaaaaatg ccttctccca atcctatcaa agcatctaac 1680 tttggcggat catcaacacg gtttccgtcc aaactactca acaacctctg ctctcaccac 1740 gataacaaac gatatcgcaa atggccttaa tcacaaaaga ccagctcatc gcaccgttct 1800 tgccgcccta gacctctcta aggccttcga cacggtcagc caccgtattc tgctctcgga 1860 tcttttaacc acaacactcc caagatctgt tatacgctgg atgtcaaatt acctgcacgg 1920 acgtaccgca tcaaccacat ttcgacaaca gcactccaaa caacgaatca tccacacagg 1980 agtgcctcaa ggctcagtgt tgtccccctc acttttcaac ttctatgtct caaaaacacc 2040 tgcaccaccc ccggacatca aaatcatctc atatgcggat gacttctcta tctacactac 2100 aggtccagat gcagctattt tgacccggcg gcttaacagt tatctgacgg agttacatga 2160 ctttttccac aaccgtaatt tggagatctc aacatccaaa tccacaataa cactctttac 2220 cacttacaca tcgcaatacc attatcaccc gcaagtcaaa gtcaataacc aaactcttcc 2280 tctagagcat aacccaaaaa ttctaggcct tacctttgac acgatgctct ccttctcgca 2340 gcatcacaag atgtcagctt ctaaggcctc taagcgaaac tcaattttaa aagctctcag 2400 tggaacaacc tatggccaag acaaggaaac actaatgcag acatacaagg cgattggccg 2460 atcagttatt gagtatgcct gcccggcctg gtctccactg acttccgaca caagctacca 2520 cagactacaa cgctcacaaa acgctgcttt acgggtaatc accggatgcc acggaatagc 2580 atcggaacaa catctccact ctgaatgcca gatgctatca gtaaagcagc atactgacct 2640 actatcatct caattccttt tacaatgcca taataccagt cacccttgcc accacatcac 2700 aagagaacca cgacccgaca gaaatataaa acacacatta ttcagcaaat atgtacagac 2760 agtgcaaacc gttctgccca cagataccga cagcagagcg ttaaaggcag cttcatctgc 2820 cttgcattgc agtgctgcga aactcgcctc ggagacatat catgccgctg tgctgctcaa 2880 cggatggcca cctccaacac ccaaaataga caaggcagag gagagtttgc ccagagcgtt 2940 gagaagaaaa ctggcccagc tccgctctgg tcactgcatc tccctcaaca gctacagatc 3000 gaaaatagac agtagggtca gcaacatctg tccaatatgc caattacttc ctgatgatgt 3060 tccccatctg ttcagctgcc caaagaaccc aactcatttc acgataatag atctctggcg 3120 taacccggtg gccgtggcta actggctacg gcctcgtctg gatccctcca atgatgtctg 3180 acgcagggca acaacaacaa 3200 // ID Gypsy-17_CQ-I repbase; DNA; INV; 4144 BP. XX AC AAWU01030073; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-17_CQ_; KW Gypsy-17_CQ-LTR; Gypsy-17_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4144 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 413-413 (2011). XX DR Genome; AAWU01030073; Positions 7242 3099. XX CC Positions [3089-3547] - Integrase core CC 'CCAGG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 90..1253 FT /product="Gypsy-17_CQ-I_2p" FT /translation="MQSAFTVEPFDRHKMKWSRWVERLEGVFLLFGIQDGA FT KRPMLLHYMGAETYDAISDKLAPVKPQTKTYQEIVELLEEHFNPEPLEILE FT NFRFKSRKQCDERAEESVSDYLTALRKLAITCNFGNYLDTALRNQFVFGIK FT DRGIQSRLLEVRGLTLARARDIAVSMEASAKGGQEIHARQSRPEMNLVEHP FT PTKPAKGKAYEKSAGKKSEEKKTCYRCGSAEHFANECKHAHTVCNFCKVKG FT HLQSVCQKKSAGQKGRRSDAHQVEESAGADRKCREKVLVEEICGVSDAADF FT NNKVDKFWLELSVNSAKVKFEIDSGSPVTVMSVWDQEKYFPTVELSSPDTG FT FVSYSGSQSNCAGCVPSGWTTVVKATNCCCMLRRTKSTLCWVEVG" FT CDS 1157..4111 FT /product="Gypsy-17_CQ-I_1p" FT /translation="MCAVRVDYGGESHELLLYVAKNEKHPLLGRSWMKVLK FT IDVVEFYENVHTVVENSAAAVQTLIERYGSVCEKSMGKIEGLTAKLQLKPD FT ARPVYLRARPVPFSIKEAVEKEIKGLVESGVLVKVNHSAWATPVVPVLKLN FT NRVRLCGDYKITVNPNVVVDDHPLPTVEELFANVAGGDMFTKIDLSQAYLQ FT LEVDPEDQEILTLSTHLGLFRPTRLMYGVSPAPAIWQRLMEEVVNGIPGVT FT VFLDDIRITGPNDRVHLERLEEVLKRLSQYNMRINLEKCQFFADRIEYCGY FT LIDRDGIHKVRKKIDAVQDMPVPENKDQVRSFVGLVNYYGRFFPHLSTTIY FT PLNCLLRNDVPFRWTKECDEAFKKVKDEMQSDRFLVHYSTELPLVLATDAS FT PYGVGAVLSHIMPDGTERPIMYASKTLNETQRRYKQIDREAYAIWLVTDNE FT PVKQIFSETKGLPAMSALQMQHYAAYLASFRYEIKFRPTREHYNADAFSRL FT PISAQTPEYVVEEVDLLEVNMIETLPLTVGDLAKSTAADPTVKVLLQGLKN FT GKTVEARDRFGIDQNEFALQQGCIMRGIRVYVPPELRQNVLAELHSTHFGS FT TRLKSLARGYVWWERIDKDIEELIRNCAACQVTRADPAKVPLHCWETPKGP FT FDRVHVDFAGPFMGTYFIVFVDAYSKWPEVKILRDISTSTTIHACREFFAA FT YGIPAVLVSDHGVQFTSAEFQHFLKQNGIFHKMGAPYHPATNGEAERFIQT FT FKGKMKALNCDKSRMHAELCNILLSYRKTIHPATGKSPSMMLFNRQIRSRL FT DLMVPNAAVVEEKPEVAVRSLPEGARVAARDFLSKDKWKYGHVAEKLGKLH FT YNIKLDDGRLWKRHIDQLREVGANLPSGRPEIPAECFEPDIVPVAPRTNPV FT QHTPQAAAAVPTPIAEPQSSRSAAATPKPPEPPAVPKPKSQGPLAVPKLKP FT TNAAGRSEANNQQSPRRSTRVIKPPRRLDL" XX SQ Sequence 4144 BP; 1031 A; 1068 C; 1261 G; 784 T; 0 other; gttttggcga cgatggcgga cgaaaaccta cctccgggta acgatcaagg tgatggccag 60 gccggggcca agctccggtg ccaattgcga tgcaatcggc gttcacagtc gagcctttcg 120 accgtcacaa aatgaaatgg tctcgctggg tggaacgctt ggagggagta tttcttttgt 180 tcggcatcca agatggcgcg aagcggccga tgctgctgca ctacatgggt gcggaaacgt 240 acgacgccat ttcggataaa cttgcgccgg tgaagccgca gacgaagacc taccaggaga 300 tcgtggaact gctggaggaa cacttcaacc cggagcccct tgaaatcctc gagaattttc 360 gcttcaagtc tcggaagcag tgcgacgaac gtgcggagga atcagtttct gattacctca 420 cggcgttgag gaagttggcc ataacctgca attttggcaa ctacctggac acagcgctac 480 ggaaccagtt cgtctttggg atcaaggacc ggggaatcca gtcgcggctg ctggaagtgc 540 gcggcctgac tctggcccga gctcgtgata ttgccgtgtc aatggaggcg tcggcaaagg 600 gtggacagga gatccacgcg cgtcagagtc ggccggagat gaatctggtg gagcatccac 660 cgacaaaacc ggcgaaggga aaggcgtacg aaaagagtgc gggcaagaag agcgaggaga 720 agaaaacctg ctaccgctgc ggaagtgcgg agcatttcgc aaacgagtgc aagcacgccc 780 acaccgtctg caacttctgc aaggtcaagg gtcacctgca gagtgtgtgc cagaagaaaa 840 gtgctggcca gaagggccga agaagtgatg ctcaccaggt ggaggaaagt gcgggcgcag 900 atagaaagtg cagagaaaaa gtactagtgg aggagatctg tggagtttcc gacgctgccg 960 atttcaacaa caaagtggac aaattctggc tggaactgtc cgtgaacagt gcgaaagtga 1020 agtttgaaat cgacagcggc tctccggtta cggttatgag tgtgtgggac caagaaaagt 1080 actttcccac ggtggaattg agttccccgg acacgggctt cgtgagttac agtggatcgc 1140 aatcgaattg tgcgggatgt gtgccgtccg ggtggactac ggtggtgaaa gccacgaact 1200 gctgctgtat gttgcgaaga acgaaaagca ccctctgctg ggtcgaagtt ggatgaaagt 1260 gctaaaaatc gacgtggtcg agttctacga gaacgtccac acagtggttg aaaacagcgc 1320 cgccgcggtg cagacgctca ttgagcgcta cggaagtgtc tgtgagaagt cgatggggaa 1380 gatcgaaggc ctcacagcga agttgcagct gaaaccggat gcgcgtccgg tgtacctgcg 1440 tgcgcggccg gtccctttct cgatcaagga ggcggtcgaa aaggagatca agggcctggt 1500 ggaaagcggt gtgctggtga aggtgaacca cagtgcgtgg gccacgccgg ttgtcccagt 1560 gctgaagttg aacaaccgag tgcgactttg cggggactac aagatcactg tcaacccgaa 1620 cgtggtggtc gatgaccatc ccctgccaac agtggaagag ctgttcgcaa acgtggccgg 1680 cggggacatg ttcacgaaaa ttgatctctc gcaagcctac ctgcagctgg aggtcgaccc 1740 ggaggatcag gagattctca cgctgagcac gcatctgggg ctgtttaggc cgacaaggct 1800 gatgtacggc gtcagcccgg cgccggcgat ctggcaacgc ttgatggagg aggtggtgaa 1860 cgggattcca ggtgtgaccg tcttcctaga cgacatccgc attacgggac cgaacgatcg 1920 ggtccacctt gagcgcctcg aagaggtgct gaaacgactg agccagtaca acatgcggat 1980 caacctggag aagtgccagt tcttcgccga ccgaatcgag tactgtggct acttgattga 2040 ccgcgacgga atccacaagg tgcggaagaa gatcgatgca gtgcaagaca tgcccgtgcc 2100 ggagaacaag gatcaagtcc gttcgttcgt tggattggtc aattattacg gtcggttttt 2160 cccacacctg agcacgacca tctacccgtt gaactgtctg ctgcggaacg acgtgccgtt 2220 ccgctggacg aaggagtgcg acgaagcgtt caagaaagtg aaggacgaga tgcagtcaga 2280 ccgcttcttg gtgcactaca gtacggagtt gccactggtc ctggcaacag acgcttcgcc 2340 gtacggggtc ggtgcagtac taagccacat catgccagac ggtacagaac gcccaatcat 2400 gtacgcgtcg aagacgctga acgagacgca gcgcaggtac aagcagatcg atcgtgaggc 2460 gtacgcgatc tggcttgtca cggacaacga gccagtgaag cagattttct cggagaccaa 2520 gggtctgccg gcgatgtcag cgttacagat gcagcactac gctgcgtacc ttgcttcgtt 2580 ccggtacgag atcaagtttc gtccgacgag ggagcactac aacgctgacg ccttttctcg 2640 attgccgatc agcgcacaaa cacccgagta cgtcgtcgag gaagttgact tgctggaggt 2700 caacatgatc gagacgctgc cgctgacggt cggagatttg gcgaagtcga ctgctgcaga 2760 tccgacagtg aaggtgctgc tgcaaggtct caagaacggg aaaacagtcg aagcacgtga 2820 tcgttttggt atcgaccaga acgagtttgc gctacaacaa gggtgcatca tgcgcggaat 2880 ccgggtgtac gtcccaccag agctgcggca gaacgtgctc gcggagctgc actctacaca 2940 ctttgggagc acacgcttga agtcgctcgc cagaggatac gtgtggtggg agcggatcga 3000 caaggacatc gaggagctga tccggaactg cgcggcctgt caggtgacac gggccgatcc 3060 tgcgaaagtg cctctacact gctgggaaac gccgaaggga ccattcgaca gagtgcacgt 3120 ggatttcgct ggaccattta tgggtacgta cttcatcgta tttgttgacg cgtactccaa 3180 gtggccagaa gtgaagatct tgcgagacat ctctacgagt acgaccatcc acgcctgtcg 3240 tgaattcttt gctgcctacg ggatacctgc cgtgctcgtg agcgaccatg gagtgcaatt 3300 cacgtcggcc gagtttcagc acttcctgaa acaaaatgga atcttccaca agatgggcgc 3360 accgtaccat ccggcaacca acggtgaggc agagagattc atccaaacgt tcaagggcaa 3420 gatgaaggct ctcaactgcg acaagtcgag aatgcacgcg gaactgtgca acatcctact 3480 cagttaccgc aagaccatac atcctgccac cggaaaatcg ccttcgatga tgctgttcaa 3540 caggcagatc cggtctcgac tggacctgat ggtcccgaac gctgcagtgg tggaagaaaa 3600 gcccgaagta gccgttcgat cgcttcccga gggggcgaga gttgctgctc gtgacttcct 3660 gagcaaggac aagtggaagt acggtcacgt tgctgaaaag ctgggcaaac ttcactacaa 3720 catcaagctg gacgacggtc gactctggaa gcgccacatc gaccagctac gagaagtcgg 3780 agcgaacctg ccgtctggac gtcctgagat accagctgag tgtttcgagc cagacatcgt 3840 gccagttgct ccacgaacca atccagtaca acacacgcca caagctgctg ccgctgtacc 3900 tacaccgatt gctgaaccgc aatcatcgcg gagtgctgcg gctactccca aaccgccgga 3960 accacctgcg gtaccgaagc cgaaatcgca aggaccactt gcggtgccga agctgaaacc 4020 gacgaacgct gctggtagat ccgaggcaaa caaccagcaa tcaccgcgac gatcgacaag 4080 agtcatcaag ccgcctcgga gactggacct gtagagagaa gtacaaaccc aaggggggaa 4140 gagt 4144 // ID Copia-38_DPu-LTR repbase; DNA; INV; 1741 BP. XX AC ACJG01004237; XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the common water flea: long terminal DE repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-38_DPu_; KW Copia-38_DPu-I; Copia-38_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-1741 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the common water flea."; RL Direct Submission to RU (08-FEB-2011). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR Genome; ACJG01004237; Positions 10054 11794. XX SQ Sequence 1741 BP; 399 A; 381 C; 510 G; 451 T; 0 other; tgttggaaaa aggacaagcg gttgctgaaa ccctttttgt tgccaaaccc aacagcagca 60 gacgactctc atctctacca ccagttgtaa agattgagtg tcatgctcac tgttcccttt 120 ttctctaagt gtttcacttc tgaatttttt cttcaacaca tcaggttcgt cttatttaac 180 gtgtctttta cgtgttaata aattttctct aacatgtaag atatgtttaa ataaaacttt 240 gtggtgaaaa gagtaagtgt actcatgtta aactgtcatt ctgtctaata atttcaacag 300 gtggccttga ttccgttgtg gtgagtgtag aggtggctga ttataccatc cgccacaccg 360 gtcgtagggg tgccgttcgc atctgcacga gttctagcac aggtccagct acacagaggg 420 cacttgctgg gtcgaggagg gtagagtagg atgactttgt cgcccagaag ttgagtttcg 480 ccagcagggc tagttgagtc aagttgatcg ggtggggcca atccaggaca cgctttttcc 540 tgatgaaaga aggcagcctg aacggtcttg gagatgttgc agcagtagct gcattcaacc 600 agaacgttaa ctgatttacc gtggaaagtc cggaagtgtc gaatggcgtc cgagatgcag 660 tttcggccag accatcgacg ggaggttcca caggtggagc agaagacttt tgctggaagg 720 ggatgggcta gttgtatgga gccgtctgtg atgcagatgc tcatcccagg gtcccgatct 780 ggtggccgac gagaaggccg actggagtga ttacgacggg ccggtctcgt tggtgtagat 840 gaggtgcctg gtgatgccgt cgagggtctc gtcaaacgtg cagaagtaac tggtgagagg 900 gcacgggtag tgatggtgga cggtgacggt gcccaggatc gatcacggct actgccagtt 960 ggagagcgag ttggagattc cacgtagtta ggtggtaggc gagcgttaac agcagcagtg 1020 gttgaggtta ggctttgacc tgggcgcttt gcgcggttgg tgaccgaggg gtagcagcca 1080 ctggagcatt acttctttct gttgccaacg gaggtgaagg tgaccgggca ggctgggttg 1140 ccgcaagcct ttgtcgtgga ggccatggct gtggtggccg ggtgtagtag ccgggtgtag 1200 tagctgggtg tagtagccgg gtgtggcagc tgggtgtagt agccgggtgt agtcgccggt 1260 gtggtaatca ctcaaagaaa atattcagag acggagacgg acggtgtatt gagtctaaat 1320 cgagggtgga cttctcactc taccgttcac tcgtatcgtg aaagcctctc acccgttcga 1380 taatagctcg gactggtcga gaaggatcga tgagaagctc gtcaactgaa cgtgccctac 1440 cccgcttgac ggtaggttat ggttcagttc agtccgtcgc ttctctagcc gttctctcca 1500 aagtagaaat gaataaagaa tttcccttat attcaaatct ttttccagtg ccagtttcac 1560 actcaccttc gtctttcacc tcacttttcc gggaacggac cgtagaatga aatgagaaat 1620 cttctttctc cttcgagcca gaaccgaaca cgtcgagagt gacccaattt atcctctgtt 1680 ccctcaagtc agaactgaac gagccgagta tgaccgagac caggacgaca cgagacgggc 1740 a 1741 // ID piggyBac-N3_BF repbase; DNA; INV; 931 BP. XX AC . XX DT 13-APR-2011 (Rel. 16.04, Created) DT 13-APR-2011 (Rel. 16.04, Last updated, Version 1) XX DE piggyBac-N3_BF DNA transposon DNA transposon - consensus. XX KW piggyBac; DNA transposon; Transposable Element; Nonautonomous; KW TTAA TSDs; piggyBac-N3_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-931 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M. and Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-931 RA Kapitonov V.V. and Jurka J.; RT "Transposable elements in the amphioxus genome."; RL Deposited to Repbase as the tmplanrep.ref file (02-May-2008). XX RN [3] RP 1-931 RA Kapitonov V. and Jurka J.; RT "A family of piggyBac-N3_BF DNA transposons from the amphioxus RT genome."; RL Direct Submission to Repbase Update (13-APR-2011). XX DR [3] (Consensus) XX SQ Sequence 931 BP; 289 A; 176 C; 167 G; 299 T; 0 other; ccctatccag actgggcttt ttgggccttc cctggcctgg gggggggggg gggggctcat 60 tcgacccccc cccttcgtat tttcaaaacg gcttactgta cgatcaccaa atttgcaggg 120 ggggttgtgc tcatcgagtt ttataattcc tgaaattttt atatcattat gacgtaatat 180 gacgtaatta tgacgtcatt aattgatatt tttggctaaa acggggaatt cccataatac 240 ctaaatatct cactcaaaaa gttaccaata aaggtttctt attaatattt aagtgttata 300 agacttctgt tgtcaaaagc cgacgcaacg acgtcaaaat gacgtcatag aatgacgtca 360 tctgtagttt agctatccgc catcttggat tatcaacctt aaatttttca agatacattt 420 tttttcaaca tataacgcta aaaccccatg gaaatggatt aaaaacgttc acatgaatgt 480 tttctgtaag aaaataccac gtagtgatga aatctgagtg aaaataagct tgttatactt 540 taaatcatga tattgtcggc catcttggat tttgaccaat tacgtcatct cattagcata 600 atttatgaat atttaattat aaatgtttaa ctaagatgtg ttagatatta aagatatatg 660 aaaagaagtt tagtttagca agaaaatctt aaaatcccat tgtttaaacc aaaattagtg 720 atttttgtaa aatttgcctt tcagaaaccc gttgccatgg caacacgaaa aatcataaac 780 ttacccaatt ttttttaaaa ttgttgccaa caatatttta ggaaaactca ccaagtttgg 840 ttgttctagc acaagccgtt tgggagctat aggaggtcaa agttggcgca ggcactttta 900 gccccccccc ccccccccag tctggatagg g 931 // ID hAT-3_AP repbase; DNA; INV; 4178 BP. XX AC . XX DT 31-JUL-2009 (Rel. 14.07, Created) DT 31-JUL-2009 (Rel. 15.12, Last updated, Version 2) XX DE DNA transposon: consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-3_AP. XX NM hAT-3_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-4178 RA Jurka J.; RT "DNA transposons from pea aphid."; RL Repbase Reports 9(7), 1365-1365 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX FH Key Location/Qualifiers FT CDS 1785..2417 FT /product="hAT-3_AP_1p" FT /translation="VCVRYAVLQEVNIVSKSXQGPNTNLDTYATLLDALIK FT FMDEYRETGFNQAKXEAKELVQSLEIETEFKVPRYRKKKLFEYYEGNTNYT FT TDPENDFRRSYFLELMDNAIQSIKKRFDQTSYYYNNNFGFLYKIGKLRTST FT QEDIRKNCXDLAMFLKVGDEKDICXCDLFDELMICRSXVDFSGVTKGELKG FT LQTPPPPPEMLRKLKNDFNGL" XX SQ Sequence 4178 BP; 1460 A; 620 C; 704 G; 1335 T; 59 other; ccagacggag cagcttttgc cgcgcatgag tatgaataca taaatacatt ttccgcagct 60 ttttttataa tataaaatct cgatccgtgg gatcgccccc aagggttact ttttatgtaa 120 tatcaggtgg ggctatagcc cccgcccaaa aaaaaaaaaa ttagccccag ttgcgccact 180 gtacaattcc ctacattaca aacaactgaa tcaggtaaag aaaaagaata cttagggatt 240 tctgaagtac aaatattttc ttcttctgat cttaaataca tagatgaacc gttacccgaa 300 aacttaatga taacagctct aacacaatat caattacaat gaagaatatt caagtacaat 360 cccctacatt acaaataact gaatcagata aagaaaaaga atacttaggg atttctgaag 420 tacaaacatt ctcttcttct gatacaaaga aactaatatg ttggatacaa atagtttgtg 480 tcaaganaaa aaacttagtg tgatagaccc cagcaacttg gactgataat atgacgcaac 540 atataagana cgaaatcgtt aaattcggtc cataccaagt gaagaatttt gattttccat 600 actcggttga ttgacggcgc aaaaagaaaa ttctcnaann attattacta caagaaaatt 660 gncaaatggt gaaattgtag anagacattg gctngtntat tcaaaatcga gtgatcgggt 720 attttgtttt tgttgtaaac ttttttcttc ttccgganca cagcaatctt agccattgtc 780 aactataggt accaacgatt ggaaacgttt atctgaaaat ttaaaatctc acgaacgttc 840 tagcctacat attgacaatc tgtacagata tggacggaac taaaantgag aatttcggaa 900 aattccacta tngataaagc acatcaancg gttatggaaa aagaaaaacg acattggaaa 960 aacgtgatga tccgcatcat agccgctata cagtatttta cacttacgnc gaataaaaaa 1020 taaagaaact cgtgtncatt atcttggnca cganatncaa aacgaactaa ttnanttaat 1080 ggcaacaaaa gtaaaaacaa atatattacg aaaaataaaa tcggcnaaat attattccgt 1140 nattatggac tgcacnccag atatcagcca acgtgaacag ttatcnntga ttattagaat 1200 tgttgacatg aattttgaaa atgaagncac taatccggaa attaaagaat actttatnga 1260 ttttgttaat ataanttcta gtactggntt gattatcaga agtgctttta nanaaattag 1320 gtgagtatca aatacccgtg ggaaactgtc gaggtcaagg ctatgacaat ggcgcnaata 1380 tggcgggtca atataaaggn gtncaggcta gattactaaa tcaaaatcca agggcatttt 1440 tatgccatgc gctgcncata gnttaaattt agtatttggn gacgctgcca aaagttcaac 1500 cagagctata catttttttg gnacngttca aagaatntac gctttatttt ctgcatctac 1560 aggaagntgg ganattttna aaaaacattg tcaccgttgg actgtnaaaa agtggtcaga 1620 aacncgatgg gagagccggc atgacagcgt aaaggcagtn cgatttcaan ttaaagaaat 1680 natggaagct ttggacgaag tttcagaaac cacaaatgat tctttaatta aaagcgaaac 1740 gcattcgtta gcnnntgaaa tnagtggnta tgaatttctt ataagtttgt gttcggtacg 1800 ctgtactgca ggaagtaaat attgtnagta aaagtntaca aggtccaaat accaatttag 1860 atacatatgc aacattactc gatgctctta ttaagtttat ggacgagtac agagaaacgg 1920 gatttaatca agctaaantt gaagcaaaag agcttgttca gtctttggag attgaaacag 1980 aatttaaagt tccgcgatat cgaaaaaaaa aattgtttga atactatgaa ggtaatacaa 2040 attatacaac agatccggag aatgatttta gacgttcata cttcttagaa ttaatggata 2100 atgcaattca gtctataaaa aaaagatttg accaaacatc atattattac aataacaatt 2160 ttggattttt atataaaatt ggaaaacttc gaacgtctac gcaagaggat attagaaaga 2220 attgtangga tcttgcaatg tttttaaaag taggcgatga aaaagatatt tgtgantgtg 2280 atttatttga cgaactcatg atttgccgaa gtnttgtgga tttcagtggc gtgaccaagg 2340 gggagctgaa ggggctgcag accccccccc ccccccccga aatgttaaga aaattaaaaa 2400 atgattttaa tggtctatga tagtcactca aaaagtctta tattgtattt aaaatattta 2460 aaaatgtatt acttatacag tggcgttccc aggaatttta aatgggggaa caccattttt 2520 tcttgaagta taaaattata tcgtaactgt gagtctacgc acattgaaaa catacaagct 2580 gtgagattga atttaggtct gagcggagat tatgtgtttt atttttaatt tttttgtcag 2640 caattacttt ttgaagcagt aaaaatgctt caatctcaat atcactattt tttgtagaca 2700 tttgacattt tcaaattttt ttttttagtg tcgattaaaa attatggact tgtaaaaaca 2760 tggataattt aatgcaagat accacataag ttgtgtctaa ctttttttct tattgaataa 2820 tgttactttc gaatcgtaaa tcgtaatccg ttgacagtat aaactaattt atgcactagg 2880 aaattaattt tataggaata aatattataa ttttggaaac ataataatat aaaaggtata 2940 ttgttggagt attggacttg gacatacata aaatactgta attatatact gaacaagatt 3000 attttcgaaa tcatatttgt taagatttta ttttcataga aaatattttg ttgggaatat 3060 attagatata ttaattacct attgggttac gatataattc tttattttat tataatttct 3120 aaggtccaat agccaatgct ttatacctaa tatattatat tgctatagga atataaatta 3180 ttatgtggtg tgtgcgaaag agaaaataga taatattacc taaatagaca ataccccctg 3240 ccaccactgg cctaaaatta aaagtaaaaa tatcacacaa tatacaaatc tagataaatg 3300 aatggtgcct tttcttagat atacaaatca atattttttt ttatcaatat taataatcaa 3360 aatcaaaata tattatatta tatacaattt actagtacaa atattacaaa tataacatta 3420 aataataaaa attaaaacaa tcagatattt taacctaact aatattattt ttagtttagt 3480 aagattgttc acggactaaa acaccggtac caccaataaa ctcgcgaaag aaacatcgca 3540 ataccgttat gctttgctaa atactgtata gcggctatga tgcggatcat cacgtttttc 3600 caatgttgtt tttctttttc catagccgat tgatgtgctt tatctatagt ggaattttcc 3660 agtccaagtt gctggggtct atcacactaa gttttttatc ttgacacaaa ctatttgtat 3720 ccaacatatt attttctttt tagcagaaga agagaatgtt tgtacttcag aaatccctaa 3780 gtattctttt tctttatctg attcagttat ttgtaatgta ggggattgta cttgaatatt 3840 cgtcattgta attaatattg tgttagggct gttatcatta aggttttcgg gtaacggttc 3900 atctatgtat ttaagatcag aagaagaaaa tatttttact tcagaaatcc ctaagttttc 3960 tttttcttta cctgattcag ttatttgtaa tgtagggaat tgtacagtgg cgcaactagg 4020 gctaaaattt ttgggggggg ctgtagcccc acctgatatt acataaaaag taacccgtgg 4080 gggcgatcct acggatcgag attttatatt ataaaaaaag ctgcggaaaa tgtatttatg 4140 tattcatact catgcgcggc aaaagttgct ccgtctgg 4178 // ID R2_FA repbase; DNA; INV; 3513 BP. XX AC AF015819; XX DT 27-JUL-1999 (Rel. 4.06, Created) DT 27-JUL-1999 (Rel. 4.06, Last updated, Version 1) XX DE Forficula auricularia retrotransposon R2, complete sequence. XX KW R2; Non-LTR Retrotransposon; Transposable Element; R2_FA. XX OS Forficula auricularia OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Orthopteroidea; Dermaptera; Forficulina; Forficuloidea; OC Forficulidae; Forficula. XX RN [1] RP 1-3513 RA Burke D.W., Malik S.H., Lathe C.W. and Eickbush H.T.; RT "Are retrotransposons long-term hitchhikers?."; RL Nature 392(6672), 141-142 (1998). XX RN [2] RP 1-3513 RA Burke D.W., Malik S.H., Jones P.J. and Eickbush H.T.; RT "The domain structure and retrotransposition mechanism of R2 RT elements are conserved throughout arthropods."; RL Mol. Biol. Evol 16(4), 502-511 (1999). XX RN [3] RP 1-3513 RA Burke D.W. and Eickbush H.T.; RT "R2_FA."; RL Direct Submission to Genbank (24-JUL-1997)Biology, Univ. of RL Rochester, 135 Superior Road, Rochester, NY 14625, USA. XX RN [4] RP 1-3513 RA Burke D.W. and Eickbush H.T.; RT "R2_FA."; RL Direct Submission to Genbank (09-SEP-1998)Biology, Univ. of RL Rochester, 135 Superior Road, Rochester, NY 14625, USA. XX DR GenBank; AF015819; Positions 1 3513. XX SQ Sequence 3513 BP; 1084 A; 813 C; 761 G; 855 T; 0 other; tagtccaaaa gccttgtcat cttaggatga cgcgcaggtt tttgagatgt atatttttat 60 agtagtagta aaatacgact tgttaaattc ttaagataaa gagtcatagt agcaagtgaa 120 gaccttttcg tggttcctgt cataagtacc ctacggggaa accaacttca cccctgcgaa 180 catggagctt agaggttacc agcctcctcg ctcgtcaatt agcttggtcc cagatcaata 240 aaccagcctc ggataaggac ggattcaccg aatatcgtta gaccctctgg ttcaacgacg 300 accatgcaaa gatgcaccac gtcagacttg actttctccc aatgtagatt tcctcattgt 360 aaattcagaa gaccatctct tactggtgtc agggttcatg aacaaagatc tcacaaggcc 420 ttttttgaca gactgcaggc tgaagtaatt cgaaatcaga cgtcaaaaaa gaaaccgaga 480 tggaccgaag aagagaagaa cctgttagca cttgcgcagg cgaatctcat aatcgaagaa 540 gaaactaata ttatcgatga cttggtctcc aagttcacat atagaaccaa agatgccctt 600 aagagccaaa tcaggaaacc cgaacacaaa acccgagtca gtgagttcac tttggcaatt 660 caagcccata tagacaatat tatgctcccc gctcccgtcc ccatagctac tgatccgttg 720 caagtctctt gcaacttcaa ggacagagca aaagactaca tcgacacatt agagcccata 780 acttcaacta aattctattt agacgagtta gaggctctgt gtgacaatat ttgtatatgg 840 ccgactaggc tacttatcac aacagtggag tcttacatac gaaaattgtt taaaacaaca 900 agtacgttga agccagcaca gaagttacat aatccttcaa acaggaatct tccaaagcgc 960 cagctaagac gtatggagta tggtaagacg cagaagctgt ggaaaaagaa cccgtgtcgt 1020 gccataaaaa caattattga cgataaggac tgtaagtcgc cacccgaaag ggaggcgatg 1080 acgcaatact ggaagacaac gttttccagt aaaaagcgga catgtcccca gtatgagcct 1140 cgcgaaagta ccaaaaccca gttgtgggaa ccggttacaa tcgaggagct tcattgctgc 1200 caccttgaaa tgacaaccag cccagggcct gacgggatca ctgttagaca gctttacctc 1260 gttccggagc aattgcttgt cagaatatta aatctcctaa tggcctgtgg caaaatgcct 1320 gatagcttct tggaatccaa aaccactctg attccaaaga agcccaactc tactgaaccc 1380 ggcgattttc gcccaatcac agttcagtca gtgttggtca ggcaactgaa taaaatccta 1440 gccgccagag tagcgcaaca catcccacta gacgaacgtc aacggggatt tagaccagta 1500 gacggagtcg ctcacaacat cttcgaatta gacatgatac ttaggtgtca tcgatctgaa 1560 ttccgcgatc tccgacttgc ctccctcgac atagcgaagg cattcgactc cattacacat 1620 aacaccattg aggataccat ggaagtaagg ggtttcccta aacccatgat taactacata 1680 atggcttgtt atcgccgcag taaaacgaga ttcaccttta atggatggat ctcagatact 1740 gtgaaaccca cttgtggggt taaacagggt gaccccttgt caccaatact gtttaatcta 1800 gtaatggaca gaatgatccg gaagctaccg aaggaggtag gagtcaatgt aggcagtaag 1860 cactataatg gtcttacatt tgccgacgac ctactgctct tcgcgactac tcctgaaggc 1920 ttgcaatcct caattgacat tgtccacctc ttcctcctgg aatgtggact attgatcaac 1980 aagcaaaaat cctttgtcct tacggtcaaa gcatacccaa agctcaagaa gacggcggtc 2040 atcgtcacag agaagtatat gttagaccgt catattcttc ccgctattga tagagagaag 2100 ctattccact atcttggcgt tccattcaca gctgaaggaa gatgtagaga cgataccata 2160 gcccacctca aacgaaaaat cgatgtgctt acaaaagcac cgcttaagcc ccagcagaga 2220 ctattcgcac ttagagttgt catcctccct agctgttatc acatcctaac gctaggagga 2280 tcgaacctaa gtctgctaaa gaaaatcgat ttgatggtaa gagctgcggg taggaaatgg 2340 tgctgcctgc caaaggacac gcccaacgca tattttcatg cgtcaagccg cgatggagga 2400 ctaggtctgc cctccatgag atggcttatt cctctgcaca ggtatctaag acttttacga 2460 tatgagggaa gaaacccaga agacactaat gtgtacttga cgactgaaat caatagagca 2520 aaaattagac tctccgacaa tggctctaac attgattgtc aagccaagct ttggcaattc 2580 tgggcggacc gattgtataa gtccgtggac ggctctgccc taatcgaatc cagtaaagta 2640 ccacaacaac atcggtgggc tacaggtggt agcagatttt tgacgggacg ggattttatc 2700 aactcaatta aattgagaat aaataccctg ccgacactat ctaggacgct acggggtcgt 2760 gaaggcaacc gaatgtgcag aggtgggtgc tacaatgtag agacgttaca tcatgtccta 2820 caggtatgtc accgaacgaa cggaacacga gttaaacgac ataacgccat caggcagtac 2880 attgcgcggg gtgctgctgt caaatttgac acagtcgaga gagagccccg tataaagtct 2940 gcgtctggtg cagttaacat cccggaccta gtggcttgta acagcgatga agtggttgtg 3000 atagataccc aaatcgtctg ggaccaggcc aacctggatg aagctcacca agccaaagcg 3060 gagaagtatg ctcacttaag tgacatactt aagcataaat atagtagaga tagggtaaag 3120 tttacctctg taacgctctc cttcagaggc ttatggagta aacaatcgct gaaggaactg 3180 actgacctgg gaattgtcaa ttccaaagac attcaaatca tttctaccag agctataata 3240 ggaggtattg catcatttag gatgtttaat tcaacaacct cagtaaattc agtaaattcc 3300 tttctagaaa tcgcattggg ataggatgat agcgcacctg gtcatcgtct ctctcagctg 3360 ctcacttgct gttctaagtg ataataccgt tgttttttta gtgggtattc ttttacgctt 3420 tcgtaggagc gagtcccaca ctcttggagc aatccggggt agtgcctaaa cgcatttctt 3480 caacgtaaaa aaaaaaaaaa aaaaaaaaaa aaa 3513 // ID TWIN2_CP repbase; DNA; INV; 230 BP. XX AC AF282725; XX DT 17-NOV-2004 (Rel. 9.1, Created) DT 17-NOV-2004 (Rel. 9.1, Last updated, Version 1) XX DE Culex pipiens Twin-Cp2 SINE retroposon. XX KW SINE; Non-LTR Retrotransposon; Transposable Element; TWIN2_CP; KW putative SINE element. XX OS Culex pipiens OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RP 1-230 RA Feschotte C., Fourrier N., Desmons I. and Mouches C.; RT "Birth of a retroposon: the Twin SINE family from the vector RT mosquito Culex pipiens may have originated from a dimeric tRNA RT precursor."; RL Mol. Biol. Evol 18(1), 74-84 (2001). XX DR Genbank; AF282725; Positions 368 597. XX SQ Sequence 230 BP; 56 A; 46 C; 70 G; 58 T; 0 other; gccgagcttc cgtggccgtg aggttatggg tttcgccttg taagcggaag gtgatgggtt 60 cgattcctgt ctggctcggc aaagtcagat cccttcaaag agtaaatctg ctcactggga 120 atactgaccg gtaggggatg ggtttcgact aacggcgtgc tggatttcca atccagaggt 180 cgtgagttcg attctcgtac cgggatgacg aggtttaaaa aaaaaaaaaa 230 // ID Kiri-11_AAe repbase; DNA; INV; 4282 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 16-FEB-2011 (Rel. 16.02, Last updated, Version 1) XX DE A Kiri non-LTR retrotransposon family from Aedes aegypti. XX KW Kiri; Non-LTR Retrotransposon; Transposable Element; Kiri-11_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4282 RA Kojima K.K. and Jurka J.; RT "Kiri non-LTR retrotransposons from the yellow fever mosquito."; RL Repbase Reports 11(2), 706-706 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 17 sequences with >97% CC identity. CC This family and related Kiri elements found in two mosquito CC species, Culex quinquefasciatus and Aedes aegypti, constitute a CC distinct lineage belonging to the CR1 group. The phylogenetic CC analysis with PhyML indicates that Kiri is a sister lineage of CC the L2 clade. Although several Kiri elements do not encode two CC proteins, probably due to 5' truncation, Kiri elements generally CC encode two proteins. Their ORF1 proteins commonly contain an CC L1-like RNA-binding domain. XX FH Key Location/Qualifiers FT CDS 313..1116 FT /product="Kiri-11_AAe_1p" FT /translation="MSNLQRTPPRTVNNAPNKRLKRDEEGDTDITLNSAVN FT MLMSQFRETREMIDDFRKDINNKIDAVKTELEGKLNLVTQEIGSFKLECAA FT KFGSNDVALSTMHERVDQLSHTIGNLQNRSELIISGIPFSSEEKLSEFFVA FT MCRQLGFDGDAYPAVDVRRMKARATLKDGDECLIAIQFALRNSRDDFYNAY FT LRTRDLKLRHLGLESDRRIYVNENLTVTARKVKVAALRLKKAGKLSSVYTK FT QGVVHVKSTAESHPVVVNSEKDLVLFS" FT CDS 1275..4121 FT /product="Kiri-11_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MSRTVDQTSTNIPGAVLNAVLLSGKFNICHGNAQSLC FT ARKTSKLDELRHVLSKSKVDVACFTESWLTSRITDRSVAIPGYRSVRNDRV FT YKRGGGIIVYYKEHLSCLKVMSTEVSTESNDKTECLALEFCLCGEKFVVVT FT VYNPPENDCSCFISDMMVEFTSRYNGIFMVGDFNTNLLRPCGRTSRLLSVL FT NGFSMKSVGDEPTYFCDESCSQLDLLLTSDSEKILRFGQVSFPALSHHDLI FT YGSMDYDTVDAPTFSTYHDYVHFDAQRLQDAVLSTPWADFYSINDPDELLQ FT FFNDHMKIIHDTCIPCRTISHRKKFNPWFTPAIRRAMLERDLSYKDWLKAP FT SQLKAQKRLQYKTLRNRVNIMITKAKEHHMERFLDCRLPSKTLWKRVKDLG FT IGKDNETKPCEFDPDEVNSVFMTNYINDIGNRTSPIHVPASPYNFSFKRVQ FT VWEIVNAIWDIKSNATGLDELPIKFLKLILPLIAHHIAYMYNKFIELSFFP FT SSWKHAKVLPLRKKPHLNAISNLRPISILCALSKAFEKLLKKQMTSYIESN FT GYLSDSQAGFRKMQGVKTAVLRVHDDLSALMDKRGTSILLLLDFSKAFDTV FT SHKKLCSKLVRQFNFSTPAVGLIESYLTNRSQTVCCKGRYSESKTVMSGVP FT QGSVLGPLLFSCYINDLPTVLKHCSMQLYADDVQLYIGRIGPCARELTAMM FT NADLERIMEWCNRNGLRVNQSKSVALLIRNRRRRVESAEPLPEMMMDGLPI FT KWANSVNNLGYVFQNDLQWDGLALQQCGKIYAGLRSLRYCAEQTPVTTRLK FT LFKSLILPHFIFGDSFLVNPGAETMNRLRVALNCCVRFVYGLSRFAHVSHL FT QKNLLGCPFQNFYAYRSCMFLRKLIKIKNPPALFQKLVPFRGQRSQNIIIP FT ANRTLTYASSMFVRGIVNWNSLPHSIKRETSEVAFKEKCLNFWSR" XX SQ Sequence 4282 BP; 1295 A; 768 C; 924 G; 1295 T; 0 other; ttatgtgtgt acaagataca atgtgccacg aatagaaggt gctgcggcaa attttaaatg 60 tgaagtgcag ctgttagtaa attccctgga aatctcgcca gcgcaatcca gagaacccgt 120 ggtttttttc gtgcgtgtta tcgagctaaa tcttgataat ctaaggtgtc tttttgcaat 180 cgaaagtttc gactcagttc aacattgaaa ggccattgtt tcactgtgga agtatccatc 240 taccaccaac tgttgctcgc ctctgttgtc actacgcaga caactaaaac tggtctaaag 300 gtgtttgcaa caatgtcgaa tctgcagaga actccgccgc gaacagtaaa caatgcgccg 360 aacaaacgct tgaagagaga tgaagaagga gatacagaca tcaccctgaa tagtgcggtg 420 aacatgctca tgagtcaatt ccgtgaaacc agagagatga tcgacgattt tcgcaaggac 480 ataaacaaca aaattgatgc tgtgaagaca gaactagagg gaaaattgaa cctagtgacg 540 caggaaatcg gctcatttaa attggaatgt gcggcaaagt tcggcagcaa cgacgtcgct 600 ttaagtacga tgcacgagag ggtagatcaa ttatctcaca ctattggaaa cctgcaaaac 660 cgaagtgaac tgattatcag tggtattcct ttttcgagcg aggagaagtt atctgagttt 720 ttcgttgcaa tgtgtagaca gctcggtttt gacggtgatg cgtatcctgc agtcgacgtt 780 aggcgtatga aggccagagc tactctgaag gatggtgatg aatgcttgat tgcaatacag 840 tttgctctca gaaattctcg ggatgacttt tacaatgctt atttgcgaac ccgtgatttg 900 aagttgagac atctaggatt ggaatcggat cgtcgtattt acgttaatga gaatttgact 960 gttactgcgc gaaaagttaa ggtagcggca ctacgactta aaaaagcagg caaactgtct 1020 tcagtttata ctaagcaggg cgttgttcat gttaaatcaa cagcagagag tcatcccgtt 1080 gtagttaatt ccgaaaaaga tttagtttta ttttcgtagt tatttgtttt tatctctttt 1140 tgtcgatctg cattttagtc atgtttaaaa tttgttattt atgttagtgt tattgtattt 1200 tcgtgaagat tgtttttact gttggaagca gaaaagtttt atagattgac aactaattac 1260 aagttttatt aataatgtca cgtacagtag atcaaacttc aacaaacatt ccaggagcag 1320 tgttaaatgc ggttttgctt tctgggaagt ttaatatatg ccacggtaat gcacaaagcc 1380 tgtgtgcaag aaagacgagc aagttagatg agcttcggca cgttttatca aagtctaagg 1440 tggatgtagc gtgctttaca gagtcttggc tgacgtcgag gatcactgat cgcagtgtag 1500 caatcccagg ctatcgaagt gtgcgtaatg atagagttta taagcgtgga ggaggaatca 1560 tagtttatta taaagaacac ctttcttgtc tcaaagtaat gagtacggaa gtttccacag 1620 aatcgaatga taagacagaa tgtctggcac tggagttttg tttatgtggt gaaaagtttg 1680 tcgtagtaac tgtttataat cctcctgaga acgactgttc ctgcttcata tctgatatga 1740 tggtagaatt tacttctcgt tacaatggta ttttcatggt cggcgatttt aatactaacc 1800 ttttgcggcc ctgtggtaga acatctcgtt tgttgtccgt tctaaacggt ttctcaatga 1860 aatccgttgg tgatgaacca acgtattttt gtgacgagag ttgttctcag ctggatttat 1920 tgctcactag cgacagtgag aaaattttgc gttttggaca agtcagtttt ccagcactgt 1980 cgcatcatga tttgatttat ggctccatgg attatgatac ggttgacgct ccaacattca 2040 gcacctatca cgattatgta cattttgatg ctcagcgatt acaagatgct gttttgtcta 2100 ccccgtgggc cgatttctat tcaattaatg atccagacga attgctccag tttttcaatg 2160 atcatatgaa aataatccat gatacttgta ttccatgtcg gacaataagc caccgtaaga 2220 aattcaaccc ttggtttact ccagctattc gtcgagcaat gttggagaga gatttgtcgt 2280 ataaagactg gttgaaagcg ccgtcacaac ttaaagcaca aaaacgacta cagtataaaa 2340 ctttgcgtaa tagggtaaac ataatgatca ctaaagctaa agagcatcac atggaaagat 2400 ttttggactg tcgtttacct tctaaaactt tatggaagcg agtgaaagat ctaggaattg 2460 gaaaagataa tgaaacaaaa ccatgtgagt tcgatccaga tgaagtcaac agcgttttta 2520 tgacaaacta tatcaatgat attggcaatc gtacatcacc aatacatgtt ccagcttcac 2580 catataactt ttcatttaaa cgagttcagg tttgggaaat tgtaaatgca atatgggaca 2640 taaaatcaaa tgctactggg ctcgacgaat tgccgataaa gttcttgaaa ctaattttac 2700 cgttaatagc tcatcacatt gcatatatgt acaacaagtt tattgaactc tccttttttc 2760 cgtcttcgtg gaaacatgcg aaagtattac cgttacgtaa aaagccacat ttgaatgcaa 2820 tctcaaattt aagaccgatt agtattttgt gtgctctttc aaaagctttt gaaaaactgt 2880 tgaaaaaaca aatgacatct tatattgaga gcaacggtta tttatcggat agtcaagctg 2940 gattcagaaa aatgcaaggt gtaaaaactg ctgttttgcg cgttcatgat gaccttagcg 3000 ctttgatgga caaacgaggg acaagtatat tgcttctttt ggatttttcg aaagcatttg 3060 atactgtgtc acataaaaag ctatgttcca agctggtgag acaattcaat ttttcaactc 3120 ctgcggttgg actaattgag tcgtatctca cgaatagaag ccagacagtt tgctgtaaag 3180 gccgatactc tgaaagcaaa actgtaatgt caggagtccc acagggttcg gtactaggac 3240 cattgttatt cagttgttac atcaatgacc ttcccactgt gctgaaacac tgttccatgc 3300 aattatacgc agatgatgtg cagctgtaca ttggtcgaat aggaccttgt gctcgtgaac 3360 tgactgcaat gatgaacgct gatctagaga ggataatgga atggtgtaac cggaacggat 3420 tgagagtaaa ccaatcgaaa agcgtagctt tgttgatccg aaaccgaaga agacgcgtgg 3480 aatcagctga gcctttacca gaaatgatga tggatggact accgataaaa tgggcaaact 3540 ctgtgaacaa ccttggctat gttttccaga atgatctaca gtgggacggc ttggctttgc 3600 aacagtgtgg aaaaatctac gctggactta gatcgttacg gtactgtgcc gaacaaacac 3660 cggttacaac aagattaaag ttatttaagt cgctcatttt gccccacttt atttttggtg 3720 attcgtttct agtaaatcct ggtgcagaaa ccatgaatcg gctccgcgta gcacttaatt 3780 gttgtgtgcg ctttgtgtac ggcctgagtc gttttgcaca tgtgagccat ctccaaaaga 3840 acctccttgg ctgtccattt cagaactttt acgcatatcg atcctgtatg tttctgagaa 3900 aactgataaa gattaaaaac ccaccagccc tgtttcaaaa acttgtgcct tttcgtgggc 3960 agcgctcgca aaatataata atcccagcaa atcgcacgtt aacatatgct agctcaatgt 4020 ttgttagagg aatagtaaat tggaacagtt tgccccattc tattaaacgg gaaacatcag 4080 aagtagcttt caaggaaaaa tgtttgaatt tttggagtag atagtattat ataagacttt 4140 attgtacact atgtaactaa ttataatctg tgaaattatt atttatgtta aatctgcaat 4200 gtgtcgccga atgtagtaat ttaaaaggtt aaaccttacg ctactggaaa taaataaatg 4260 aatgaatgaa tgaatgaatg aa 4282 // ID DNA8-100_AP repbase; DNA; INV; 660 BP. XX AC . XX DT 29-AUG-2009 (Rel. 14.09, Created) DT 29-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-100_AP. XX NM DNA8-100_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-660 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2038-2038 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 660 BP; 202 A; 129 C; 116 G; 213 T; 0 other; cagtgttcgg aacgttcact aaaaaaatga actcgttcac gttcacgttc agtccttaaa 60 aaaggaactc gttcacgttc acgttcgtgt ttttttcaaa cgaacgcgtt cacgttcacg 120 ttctttcaaa aaatgaacgc gttcattggg tcgttcattt attttttata catttttttt 180 atgaatcatt tacctgctgt aaaggctcag tcaaacagag ggtgactttt tgtccggcaa 240 tcaaaaaatg aatccggact ttttgtaccc ggacttttcg tccgccatgg tttatgggta 300 cgcggaccgc ggcgggcaag cggtctttgt attattttac agtgcccgcg cgggcagccc 360 gctttctgtt tgaccgagcc tttaatataa aagcctataa acatatacaa ctcaagccaa 420 aaataaaaat actatttaga cttatatata atataattaa ttaatatatt ttataaatat 480 atttattaaa gggtaaataa attgaacgtc ttaaaaatcg atcaagttcg ttaaaaatat 540 aaacgtcgtt cacgttcacg ttctatttta ataaaacgag tgacgttcac gtatcgttca 600 ctaaaaatga acgtgaacgc gtgaacgtgc gttcatgaac gacgttcttt ccaaacactg 660 // ID Copia-103_AA-I repbase; DNA; INV; 5686 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-103_AA_; KW Copia-103_AA-LTR; Ty1_copia_Ele83; Copia-103_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-5686 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [2959-3462] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 406..5448 FT /product="Copia-103_AA-I_1p" FT /translation="MENRVVKRFVAPAFKGDMKSYPFWKKRMEIFLRQEDL FT FYTLENSPEEENYFNPVANATPEQETERKQKLEKRLKDDGAAVNEFFTALD FT DEALIHVMECTNAKEITKRLDEVYLPKGPVAMLGLRSRLYLLKNENFSSLQ FT QLFTSHEEIIIQLNNMGEHISQEEQLNTLLVAIPDKFNYLLGALSVLRKED FT LTQMSLQQVKRIFLDEDQKNADSRGNKMKPEAAFVGNNPRQKGKPGNRNKK FT NYEKHITCFGCGKIGHRKQNCIEEKHRDQASLVEEVVEDVALLAASAIAPV FT TNDKQALPTASPTPKNVALLTTPEAPVVNESAHMQETPSPIRSSTSGFKLS FT DMVPLVVTFDCPRILSPIPSTPKVSPRVVKIASPIVSPRVQNEQLSSLKVT FT VNSSTSSSAGARRIVAVASPDASPEVQDEPVLPASINSSTSSLAGVQQAKG FT SAVTDNDSAELEAPKSTVPHTQTVHRQNGSFMVRIPLDRIELCAIGKIRHD FT RGDPFCPKLKEAVTPPAPGPGAGRGRGRAPRRPPRGAPRPAPRRPGAPPPA FT PRAPARPRPRAAPPARRPPPGAPARGPAGRPPPRRGRAAPRGAPGRRGAAA FT RRRPGAGGRPGPGARARAARAPAARRAPRARPAPPARGGNQYIRNMTMNDM FT PVKVPGVLPMYLDSGASRHMVREKRCFDELWDTKPVIVQTAEERTQLRGSR FT EGNIRVKSKVGNKVFISTMYNVLFVPNLSCNLLSVSQLESNGKSVLFQNGK FT VKILDKEGNIIAEGVKENGLYCLKFFLINSEENAHVSKEEQYELWHQRFGH FT LGKDNMFKLIKNDMVQGLDIKNSSISTELCQPCLAGKQTRNPFNKNKESRS FT RRPLELVHSDVWGPSPEPNHDGSRYFVSFIDDFTHFTRTYLISHKSDVFQK FT FQEFEALATAHFSLKITKLRTDNGGEYYSNEFIKFCKSKGIELTPTVPYTP FT QQNGVSERMNRTLMDKARSMMHDKDVPESLWGEALFVSTYLTNRSPTIALE FT ACKTPYELWFGDKPDVSKLKVFGCVAYSLVNQVHRSKLDKRNRILGMVGYA FT HNGYRLWDKEERKIVISRDVVFDENKSYFQIQSESSRSKETSPVLVEFSDE FT PVELTQPDEDVERDVESDRSSDSDEDFQEALDPLDEEFEILNRRSGRLRRP FT PNWLSDYETGLIAGNGSANIPQNIRELMNRDDWNEWKKAIAEEMKALHDNE FT TWTLIQSVPDGRKAINSMWIFTIKNDTDSPRYKARLVAKGCAQSAGIDYTE FT TFAPVAKMTTIRTLLSIAVQYDLQIHQMDVKTAFLNGNLKEEIYMKLPPGE FT DGIVRTCQLKKCLYGLKQASRSWNERFDQIIISMGFTRLRSDSCVYILKDK FT NIYLVLYVDDMLIFGKDAKCIKWIKESLSKHFQMKDIGGVKNFLGLEIFRE FT TGVLEVSQQSYVERILQNFGMENCKPVSTPISTSGKWKRDEGILTTKPYKA FT LLGCLQYLALMSRPDLCFSVNFFSQFQSSPSDSHWSGLQRILRYLQGTKTY FT RLVYHRSTNEAPLTGYADADFANGLDDRRSTSGIMYTVFGNTVSWKTKKQT FT LVVLSSTEAEFVALCEASKEGMWLSNLLHEIGIDVLPFRIHEDNIPCISIA FT EEPRHHQRTKHIDIKYAFVRELIKEKKIELNYIPTDFQLADIFTKALGEAK FT FSKFVSALNLKK" XX SQ Sequence 5686 BP; 1756 A; 1245 C; 1337 G; 1348 T; 0 other; ggttatgggc ccagaaatcc gctcttaatc aaatcgatta aacagcgaca aatatatatt 60 atcattgctc aggcgaagaa tttcgtggag ctttcggata attatatata cggaaaaacg 120 tgtgttcaga ccaagctttt cgtggaacaa catttgaaaa attgcaacaa aagttcagac 180 gaagattttc gtagaacttt cagttgtgtg aaacgtgtgt tcagacgaag cttttcgtgg 240 aacatttgaa aagtttacaa caaaagttca gacgaaagat tttcatagaa ctctcagttg 300 tgacatttgc gaacacttga agaaagttcg ggcgaagttt ttcgtagaac tgtcaacaaa 360 cattgactac atttgtaagg ttctggttca ttgacattta ttaacatgga aaatagggtt 420 gtcaaacgat ttgtagcacc tgccttcaaa ggtgacatga aatcgtatcc gttctggaag 480 aaacggatgg aaatattttt gagacaggag gatttgttct atacactcga aaatagccca 540 gaagaagaaa attacttcaa tcctgtcgca aatgctacac ccgaacaaga aacagaaaga 600 aaacaaaagc tggaaaaacg attgaaagat gatggtgcag ctgtcaacga gtttttcaca 660 gctctagatg atgaagcttt gatacatgtc atggaatgta caaatgcaaa ggaaattacg 720 aagaggctag atgaggtcta tcttccaaaa ggtccagtcg ccatgttagg gttgaggtcc 780 cggctttact tgctcaagaa tgagaatttc agttctcttc agcaattatt cacttcacat 840 gaagagatta tcatccagct caacaacatg ggggaacata tttcgcaaga agaacagctg 900 aataccttgc ttgtagcaat tccagataaa ttcaattatc tgctaggagc actttcagtt 960 ctaaggaaag aggacctaac gcaaatgtct cttcaacagg tgaaacgcat ctttctggac 1020 gaagaccaga agaatgcgga cagtcgtggt aacaaaatga aaccagaagc ggcgttcgtt 1080 ggtaacaatc ccaggcaaaa gggcaaacct ggaaatagga ataaaaagaa ctacgagaaa 1140 catatcactt gctttggttg cggaaaaatc gggcaccgaa aacaaaactg cattgaagag 1200 aaacatcgtg atcaagcatc tctagtggag gaagtcgtcg aagatgttgc gcttctggcg 1260 gcatctgcaa ttgctccagt cacaaatgat aaacaagcat tgccaacggc ttccccgaca 1320 ccgaaaaatg tggcactttt gacgacacca gaagcaccag tggtcaatga atctgcacac 1380 atgcaagaaa ctccatcacc tattcgatct tcgacatctg gattcaaatt gtcggatatg 1440 gtgcctttgg tagtgacgtt tgattgcccc agaattttgt ccccaatacc atcgacacct 1500 aaggtctcgc cgagagtggt caagattgcc tctcctattg tcagtccaag agtccagaat 1560 gaacaattgt cgtcactgaa agtcacagtt aactcatcaa cttcatcgtc tgccggagcc 1620 agaaggattg tagcggttgc ctctcctgat gccagtccag aagttcagga tgaaccagta 1680 ttgccagcct caattaattc gtcaacttca tcattggctg gagttcaaca ggccaaagga 1740 tctgcagtaa ctgacaatga ctcagctgaa cttgaagcac caaaatcaac agtaccgcac 1800 acccaaactg ttcaccgtca gaatgggtca ttcatggtcc ggatcccgct ggatcgtatt 1860 gagttgtgcg caatcgggaa aatacggcat gatcgaggag acccgttttg tccaaagtta 1920 aaggaagcag tgaccccccc ggcgcccggc cccggcgcgg ggcgcggccg cggccgcgcc 1980 ccccggcgcc cgccgcgggg cgccccgcgc cccgcgccgc ggcggcccgg ggcgcccccg 2040 ccggcgccgc gcgcgcccgc ccggccgcgg ccccgcgccg ccccgcccgc gcgccggccc 2100 ccgccgggcg ccccggcccg cggcccggcg ggccgcccgc ccccccggcg ggggcgcgcc 2160 gcgccccgcg gggcgcccgg gcgccgcggc gcggccgcgc gccggcgccc gggcgccggg 2220 gggcggcccg ggcccggggc gcgggcgcgc gcggcgcggg cgccggcggc gcgccgcgcg 2280 ccgcgcgcgc gccccgcgcc gccggcgcgc gggggcaatc agtacatacg aaacatgaca 2340 atgaacgata tgcccgtcaa ggttccagga gtgctaccca tgtacctgga ctcaggagca 2400 agtcgacaca tggtaaggga aaagaggtgc ttcgatgagt tatgggacac aaaaccggtg 2460 attgtccaaa cagctgaaga aaggacacag ctgaggggat caagggaagg gaacattcgg 2520 gttaagtcaa aagtcggaaa taaagtattc atttccacaa tgtacaacgt actttttgtt 2580 cccaatttat cctgtaacct tctttcggtg tctcagttag agtcaaacgg caagtctgtt 2640 ttgtttcaga atggcaaagt taaaattctt gataaggaag gaaacattat tgcagaaggt 2700 gttaaagaaa atgggctata ctgcctaaaa ttcttcctca tcaattccga agaaaatgct 2760 catgtgagta aggaagaaca gtatgaactt tggcaccaac gatttggaca tttgggaaaa 2820 gacaatatgt tcaaactgat taaaaatgac atggttcaag gtttggatat caaaaactct 2880 tccatttcaa cagagttatg tcaaccatgt ttggcaggaa aacagacaag gaaccctttc 2940 aataaaaaca aagaaagtcg atcccgtaga ccactggaac tcgtacattc tgatgtctgg 3000 ggtccaagcc cagaaccaaa ccatgatgga tccaggtatt tcgtcagctt cattgatgat 3060 ttcacacatt tcaccagaac ttacctgatt tcacataaga gtgatgtctt ccaaaagttt 3120 caagaatttg aagcccttgc gacagctcat ttttccctaa aaatcactaa actcagaacc 3180 gataatggag gagagtacta ttcgaatgag tttatcaaat tctgcaagtc aaaaggaatt 3240 gaattgaccc ctaccgttcc gtacactcca cagcagaatg gagtaagtga acgaatgaat 3300 aggactctca tggacaaagc tagatctatg atgcatgaca aggacgtacc tgaatctcta 3360 tggggtgaag ctctatttgt ttctacatac ctcacgaaca gaagcccaac aattgctctg 3420 gaagcctgca aaactccata tgagttatgg tttggtgata agccagatgt cagcaaacta 3480 aaagtgtttg gatgtgttgc ttactccctg gtaaatcaag tacacagaag caagcttgat 3540 aaacggaacc gtatactggg tatggttggc tatgcacata acggatacag actttgggac 3600 aaagaggaaa gaaagattgt aatatctcga gatgtggtat ttgatgagaa caagtcatat 3660 tttcagatcc aatccgaatc atcacgcagt aaagaaacgt ctccggtctt ggttgaattc 3720 tcagatgaac cagttgaact tacacaacct gatgaggatg ttgaaagaga tgttgaatcc 3780 gatcgatcaa gtgattcaga tgaagatttt caggaagcac ttgatccttt ggatgaggaa 3840 tttgaaatcc ttaatcgcag aagcggtaga ctacgaagac ctccaaactg gctgagcgac 3900 tatgaaacag gcttaatagc aggaaacgga tcagccaata ttccacagaa catacgagaa 3960 ttgatgaata gagatgattg gaatgaatgg aagaaagcta tagctgaaga aatgaaggca 4020 ttacatgaca acgaaacatg gacattaatc caatcagtac ccgatggaag gaaagctata 4080 aactccatgt ggatcttcac aatcaaaaac gacacagatt cacctcgcta caaagcaaga 4140 ttagttgcaa agggatgtgc tcaaagtgca ggaatagact atactgaaac atttgcgcca 4200 gtagctaaaa tgactacaat acgcactcta ctgtctatcg cagttcagta tgatctacaa 4260 atccaccaga tggatgtgaa aactgcattt ctaaacggga acttgaaaga agaaatttac 4320 atgaaattac ctccaggaga agatggaata gttcgtactt gtcaactcaa gaaatgcttg 4380 tacggactta aacaagctag cagaagttgg aacgaaaggt tcgatcagat cataattagc 4440 atgggcttta cacgactacg atcagattct tgtgtgtata tcctgaaaga taagaatatc 4500 tacctcgttt tatatgtgga cgacatgcta atctttggaa aagatgcaaa gtgcattaaa 4560 tggataaaag aatcattatc taaacatttc caaatgaaag acatcggcgg cgtgaaaaat 4620 tttcttggtc tggagatctt cagagagact ggagttctgg aagtttctca gcagtcctat 4680 gttgaaagaa tcctccaaaa ctttggtatg gaaaactgta aacctgtaag tacacctatc 4740 tcaacctcag gtaaatggaa aagggatgaa ggtattttga caactaagcc ttataaagct 4800 ctactcggtt gcttgcaata tttagcattg atgagtagac ccgatctctg tttttccgta 4860 aattttttta gtcaattcca gagttcccca tcagattccc actggtctgg acttcaacgc 4920 atactacgat atctacaagg gaccaagacg tatcgtttgg tataccatcg tagtacgaac 4980 gaagcaccat tgacaggata cgcggatgct gattttgcga atggccttga cgacagacga 5040 tcgacctcgg ggatcatgta tacggttttc ggaaacactg tatcgtggaa gacgaagaag 5100 cagacattag tggtcctttc atctactgaa gccgaattcg tggccttgtg cgaagcgtca 5160 aaggaaggaa tgtggctttc gaatctgcta cacgaaattg gaattgacgt attgcccttt 5220 cggatacatg aagacaacat tccttgcatc agtatcgcgg aggagccacg ccaccatcag 5280 cgaaccaaac acatcgacat taagtatgca tttgtgagag agctgatcaa agaaaagaag 5340 attgagttga attacattcc gacagatttt cagctagcag acatatttac taaggcatta 5400 ggagaagcaa aattcagtaa atttgtctca gcgttaaatt tgaagaaatg aagggaagtg 5460 aaaatttaaa ttcttattgt gcatttgctt catattttta actatcggtt caggcctctg 5520 aaccaaactc agatcagaga gaagtagaag agagaaatag tcgttaggag aaacgaataa 5580 gcggccacac atcgctagcg ttatactgcc gttctacgca tatttgtccc gcggtccgta 5640 aggtttgcat agaacatggg acaagtaggc gtagaagggc agtaca 5686 // ID Mariner-5_SM repbase; DNA; INV; 2771 BP. XX AC . XX DT 11-FEB-2008 (Rel. 13.02, Created) DT 03-MAR-2008 (Rel. 13.02, Last updated, Version 1) XX DE Mariner DNA transposon from Schmidtea mediterranea. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-5_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-2771 RA Jurka J., Bao W. and Tempel S.; RT "Mariner DNA transposon from Schmidtea mediterranea."; RL Repbase Reports 8(2), 149-149 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 419..2053 FT /product="Mariner-5_SM_1p" FT /translation="MEKQVQKRMSFSTKEKLLALDQLKNGKQQTQLARDLG FT ISESTLRGWKTNEVKIRSLPNILEDELGLQRKRVKSASDNVLDSALYQWFV FT QARSEGVPISGPILKAQAEKFNRRINGEDSKFKASNGWLDRFKKRHAISQV FT LVSGEIRSADKEAANSYPFWLKKFLEEGCYTADQVYNCDETGLCYKMLPNR FT TLATKTDNHKREGFKKRKDRVTLLFCVNKSGSHKLRPLMIGKSKAPRCFHH FT VNMKSLPFQYINSNNAWMNTTVFYDWFHKTFVPDVRAHLRKLKLEKKALLL FT LDNCPAHPPAENLVSSDGKIKVSYLPKNTTTEIQPLDQGVISVFKQNYRRE FT MIKGMVLDNNGSVDQSLALLTLKEVCHLGGKAWEAITENCIERCWMKGLGA FT AFMPSIQNSSDDELGFTGFTEEDVRLAEDVLKIYEPSHHDILNWYSVDEIC FT PIYEHISDEEIVINVGMKNEAAQSVPEKTEDSDNDDDCSTIPPPKVSEAVV FT HLEASLRWLETQDVDSLKVLQLRNILEFAKNKQSAAKKQTKLDVFFNKL" XX SQ Sequence 2771 BP; 986 A; 450 C; 479 G; 856 T; 0 other; tacagtggag ccccgctata acgcgccccg ccataacgcg aattcggatt taacgcggtc 60 gtagcatggc acccaaaatt tttatattca tttacattta gaaaaaaatt tgataaaatt 120 aaaagaacca aaacatgaaa ttaaattcat ttttaccata aaaaactgtt cttgaaaaaa 180 ataacaaaac ttaataaata tagcgtgata aaattgtcaa aatgtatgcc gtgatcaaac 240 ttgtagttaa aagcaaataa atttaatata tatttttttt gaaataaaaa tgattgtaat 300 tgtagattat ttttaatacc tttattatta aaaatgattc gtaatattta agtctttatt 360 aatttaaatc ctttataagt gtatattgtt tgtttatatt aaaaaataat tatatgttat 420 ggaaaagcaa gttcaaaaac gaatgtcgtt ttctaccaaa gagaaactgc ttgccctaga 480 tcaactaaag aatggcaaac aacaaaccca gcttgcgaga gatttgggaa tcagcgagtc 540 cactttgaga ggttggaaaa caaacgaagt aaagatccgt agtctaccca acattctaga 600 agatgaactt ggtttacagc gtaaaagagt taagtcagca agtgacaatg tgttagattc 660 agctttatat caatggtttg tacaagctcg gtctgaaggt gttccaattt caggcccaat 720 tctaaaagca caagcagaaa aatttaatcg ccgtataaat ggagaagatt ccaaatttaa 780 ggcgagtaat ggctggctag acagattcaa gaaaaggcac gctatcagcc aagttcttgt 840 ttcaggagaa attcgctccg ctgacaaaga ggctgccaat tcctacccat tttggcttaa 900 aaagtttctg gaagagggtt gttacacagc cgaccaagtt tacaattgtg atgaaacggg 960 tttatgctat aaaatgttac ccaatcgcac cctcgcaact aagactgaca atcataaacg 1020 agaaggtttt aagaagagga aagacagggt gactctctta ttttgtgtta acaaatctgg 1080 tagccacaaa ttaaggcctt tgatgattgg aaaatcgaaa gcacccagat gtttccatca 1140 tgtaaatatg aagtcacttc catttcagta tattaacagt aataatgctt ggatgaatac 1200 gaccgtattt tatgactggt ttcacaaaac attcgttcca gacgttcggg ctcatttaag 1260 aaaactcaag ctagagaaaa aagcactcct ccttctggat aattgccctg ctcatccccc 1320 agcggaaaat cttgtcagta gtgatggcaa gataaaagtt tcgtacctcc cgaaaaacac 1380 aacaacagaa atccagccac ttgaccaagg cgtaatttcc gtatttaaac aaaattatcg 1440 ccgtgaaatg ataaagggaa tggtattaga taataacggt tccgtcgatc aatcattggc 1500 attacttact ttaaaagaag tatgccacct tggaggtaaa gcctgggaag ctattacaga 1560 aaattgtatt gaacgatgct ggatgaaggg acttggggca gcttttatgc catcgataca 1620 aaatagtagt gacgatgagt taggttttac aggattcacg gaagaagacg tgcgtttagc 1680 agaagacgta ctcaaaatat acgaacccag tcatcatgat attcttaatt ggtattcagt 1740 agatgaaatc tgccctatat atgagcatat atcagatgag gaaattgtta taaatgtcgg 1800 catgaagaat gaagctgcac aatcagttcc ggaaaaaaca gaagatagtg ataacgatga 1860 cgattgctct acaattcctc ccccaaaagt ttctgaagca gtagtacatc tagaagctag 1920 tttaaggtgg ctggaaacac aagatgttga tagtttaaaa gtacttcaat taagaaacat 1980 tcttgaattt gcgaaaaaca aacagtctgc ggcaaagaag caaacgaaat tggatgtatt 2040 tttcaataaa ttataaaatt taattataaa ataacttact ttcattttgc gcaacgatta 2100 atttataatt tttcgcatgt ttttttttat ttttacaaat ttattaattg tttttgcgcg 2160 ctgaaatcat cccttgattg ctaattattc ttgcagttgt cgcataaaat aaaattttag 2220 cgttatgaaa tgagtgttta aatatacata ataataaact tttcgggcgg catgcaatga 2280 atataatata tcaacgccgt aaaaggtttt ataattcggt ttatgttatg ttttacggag 2340 cgaattcaga tataaaaatt attaaatgtt ttaaatattt aaaatcttta taaaataaaa 2400 tctttatttg acaaaacgaa accaatacgg cacagtaacc ttctatgttg atacactttt 2460 tataatttta tgatattttg gtatttttgg tcaatccgta ttataacaca aacaaaaaaa 2520 ttaataaatg ttataatgca cggtaataac ttttatatat tggcatcgat gctttaaatt 2580 tatctaaatt ttaatcaaat ttttataatc atctgatgat aagccgactt ttgattttta 2640 catgtgtaca cgtatatctt gcccgaaaaa aaagttcaat attgtaaaac cccgcattta 2700 cgcgacccca cttttaacgc gacccgattt aatggaccct gattttcgca ttataacggg 2760 gtttcactgt a 2771 // ID Chap3b_Cis repbase; DNA; INV; 577 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 06-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE hAT DNA transposon from Ciona savignyi. XX KW hAT; DNA transposon; Transposable Element; Chap3b_Cis. XX OS Ciona savignyi OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. XX RN [1] RP 1-577 RA Smit A.F.; RT "Chap3b_Cis - hAT DNA transposon from Ciona savignyi."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX CC Ci000049 Strong bias for NTCTAGAN target site duplications. CC Extended terminal similarities with Chap1b. XX SQ Sequence 577 BP; 172 A; 108 C; 126 G; 170 T; 1 other; caggggtcgg caacctacgg cccgcgggcc gcatccggcc cgccaagcaa aatcatccgg 60 cccgagacat ctnactgaag tattatagaa tacggcccac tgtgtcatct gcaatcttag 120 attttgatga attttaacaa tctatgcggc aataataaaa aatgcgcctg accgtgtcta 180 ttgagaatgt gtgcgtgcag acaaagacaa tgctttgaga gaaagcaata agtgtgtgtg 240 tttgttggca tgcgcattag tgagagtaga agcaatccaa ttgtaatatt atgacattat 300 gacgtcatga cacatattat gacatgtaac attatgacac aacagacatc gtttttgcta 360 tttcatatgt gtttgcatcg tgttctttac taactaaagt cctgtacgat agataatact 420 aggtagtaaa catatatttt tctaattgat tttgcaatct tgaaatgtta tcaaaacgta 480 gaaaagttga tgtggagttg agaaatttat tacggcccgc cgagatcatg ttaagtttgg 540 ttttggcccg cgactatgaa aaggttgccg acccctg 577 // ID Chapaev-2_ACa repbase; DNA; INV; 4029 BP. XX AC . XX DT 30-SEP-2007 (Rel. 12.09, Created) DT 30-SEP-2007 (Rel. 12.09, Last updated, Version 1) XX DE Autonomous DNA transposon - a consensus sequence. XX KW Chapaev; DNA transposon; Transposable Element; Chapaev-2_ACa. XX OS Aplysia californica OC Eukaryota; Metazoa; Mollusca; Gastropoda; Heterobranchia; OC Euthyneura; Euopisthobranchia; Aplysiomorpha; Aplysioidea; OC Aplysiidae; Aplysia. XX RN [1] RP 1-4029 RA Kapitonov V.V. and Jurka J.; RT "Chapaev - a novel superfamily of DNA transposons."; RL Repbase Reports 7(9), 777-777 (2007). XX DR [1] (Consensus) XX CC Chapaev-2_ACa is a young family of DNA transposons. The genome CC contains several copies of Chapaev-2_ACa that are less then 2% CC divergent from the consensus sequence. Chapaev-2_ACa belongs to CC the Chapaev superfamily. Hallmarks of the Chapaev transposons are CC 4-bp target-site duplications, terminal inverted repeats with the CC conserved '5-CAC and GTG-3' termini, and the Chapaev transposase. CC The Chapaev transposase is characterized by the conserved CC D-x(60-80)-D-x(220-290)-E catalytic triad. Chapaev transposons CC populate genomes of different animals, including sea urchin CC Strongylocentrotus purpuratus, amphioxus Branchiostoma floridae, CC starlet sea anemone Nematostella vectensis, sea hare mollusc CC Aplysia californica, mosquitoes Aedes aegypti and Culex pipiens, CC and nematode Caenorhabditis elegans. The N-terminal portion of CC Chapaev transposase in Chapaev-1_ACa, Chapaev-2_ACa, CC Chapaev-3_ACa, Chapaev-1_BF, Chapaev-2_BF, Chapaev-1_NV, CC Chapaev-2_NV, Chapaev-3_NV, and Chapaev-1_SP is similar to the CC N-terminal portion of RAG1 (100-370 aa in the human RAG1). It CC includes a novel type of zinc finger, called Chapa: CC H-X7-C-R-X-C-G-X35-D-X4-H-X4-C-X2-C-W-Xn-C-X2-C-X8-G. In the CC amphioxus and anemone Chapaevs, the N-terminal portion contains CC also the RING finger motif. Some Chapaev transposases (e.g. CC Chapaev-2_ACa) show low similarity to the RAG1 core. XX FH Key Location/Qualifiers FT CDS 605..3442 FT /product="Chapaev-2_ACap" FT /translation="MECHDEASHVRYIHELCRTCGGRSLSGKQKLQKRKAY FT KCIDFAHDILLVFGVTIAVDERSKHSLTICYACTNKIKRSRTHSFEGTIIN FT ARKAAETSSNLWTAFRSDIRASDCSICTHYHRTSMGSKNTPIELVTTKTSE FT ATKTTHSPHRSTFEDSANTTSNSTAEPSTHTEDNTSHEERQDEDIASLDTS FT AASSTTGHSEFELQLPPIATSTPSKPTQLMVDSNTSSLPIATSTPSKPRQL FT MVDSSTSPAFKHDKLFSQSLEISPTRPLTKEEEKLNTHLVRRKLNNSSTNT FT IQCKTRGQPIVLTKLIKARKETSTAKSSTKRQRSHLLSSVRTHIAGSSQES FT ITTQHAAELKHTKVATRKSICEKAGINLKPHMSNQSVMAMKEATDLTWSQL FT RQQRRFLKQAGLSLPNEKEQRRAMEGLFADDIVTQVENFVDTTGQIHSTLL FT GRVANITDFIVRLLETHKQQGTLTWHNSTIPTDEIWIKFGGDHGKNSMKFT FT MQIANTRKPNSKHNTFVVAMANIKDTYENLKICMSILQPQLDELSKLEWDG FT KKIVLFLFGDYDFLVKLYGLSGAQGTYPCLWCHVPKSYIKTGQRRDSLPRT FT LATLKRDHRRFKKYGKGQKKNVQRFHNSLHPPMIEVEPSEAAPPYLHILLG FT IVLKHHRLLENAAHEIDMQLCLQSKNDCKDASASLRAHGNKWKRAEELKKE FT IELLEGSIVFSNTPDQRENFEKELSEKQDELSDLDFEPLSPLSGPICSHLD FT TILDKHNITPQAYHSRSFIGNHCHKYITAKVYRDLTLFIVKKTQECTHRTD FT ILDTAFILRDTFNELNNAFRDVHLFISHTNPIPYNKIKQIQSCVAKYMSLY FT RKNFNNKVIPKQHILEKHCVPWIQKNGFGMAFHGEQGGELIHSSVAKLHRR FT ATSIKNKERQLKVIMRSQHLQSSPEQLAFAPPVKRRKTTPDS" XX SQ Sequence 4029 BP; 1291 A; 905 C; 792 G; 1041 T; 0 other; cacggctgtc taaatcgccc caaaaccgcc agtttagtta ccgggcaagg gtttcccgcc 60 gaccaccccg agacggtaat aaatcagctg tcagccgtgc aaggtaccgt gacgggagcg 120 gcactcccgg cgtgcctcgg ccgtgcccag gaaacttgat tccagtcatt ttcgattaat 180 ttgatcccca atatctcccg atctaaagat gcgatgtctc cacgtggggt ctcattttaa 240 agggctactt caacccgaca agacgatata attttctctt gtgtgcgttc actagtgtta 300 atcctgcttg ctgagaaatg cactctatcc tttattactt ttaaagcaac ttgcatctat 360 ttatttttat ttaaagtcat ttttctcaat aattgagccc acaattaaat tttttctcct 420 tgattactct tgagaaaata gtactgagtt acatgaggaa gtgagaacat tgtacgtatt 480 ctgtgtgatt catagtgacc gattttgtgt agaccttgga gtgtgaaagt accccatcct 540 cgttcgcctt gtttggattt cttctgcttt tttgctgtgg agtattggaa cattaatatt 600 cctaatggag tgccatgatg aagcgtccca tgttcggtat attcatgaac tttgtagaac 660 atgtggtggt agaagtctat caggaaaaca gaaattgcaa aagagaaaag cgtacaaatg 720 cattgacttt gcacacgata tcttgttggt atttggggtt acgatagcag ttgatgaaag 780 gagtaaacat tctttgacga tatgttatgc ctgtaccaac aagataaagc gcagtaggac 840 ccattcgttt gagggaacta taataaacgc aagaaaagca gctgaaacct cctcaaattt 900 gtggacagct tttcggtctg atatacgcgc tagtgactgt tcgatatgca cacactatca 960 tcgcacatca atggggagca agaacactcc aatagaatta gtaacaacga aaacatcgga 1020 agctacaaaa acaacacatt caccacatag atctacgttt gaagactcgg caaacacaac 1080 aagtaactca actgcagagc catctacaca cacagaggac aatactagtc atgaagaaag 1140 acaggacgaa gacatcgcat ctctcgatac tagcgctgca tcatcaacca caggccacag 1200 tgagtttgag cttcaactcc cccctattgc tacatccacg ccatccaaac ctacacagct 1260 aatggtagat agcaacactt caagtctccc tattgctaca tccacgccat ccaaacctag 1320 acagctaatg gtagatagca gcacttctcc agcatttaaa cacgacaaac tcttctctca 1380 gtctctagag atctccccaa cgcgaccact aacaaaagaa gaagaaaaac ttaatactca 1440 tttagttagg agaaaattaa acaattcaag taccaatacg attcagtgca aaacaagagg 1500 acaacccata gtactcacaa aacttataaa ggccagaaaa gaaacatcaa cagctaagtc 1560 atccactaaa cgtcaacgct cacatttgct ttcatctgtc aggacacata tcgcaggctc 1620 atcacaggag agcatcacaa cacaacacgc agcagaatta aagcacacaa aagtagctac 1680 tcgaaaaagt atttgtgaaa aggcaggcat caacttaaaa ccacacatga gtaaccaatc 1740 tgtgatggcg atgaaggaag caacagacct tacatggagc cagctcaggc aacaaagacg 1800 ttttctcaaa caagctggtc tctcgttgcc gaatgagaaa gaacaaagaa gagctatgga 1860 aggtctgttt gctgacgata ttgtcacaca ggtggaaaac tttgttgaca ccactgggca 1920 gattcacagc acactacttg ggagggttgc caatataaca gactttatag ttaggcttct 1980 agaaacacac aaacagcaag gtacacttac atggcataac tccactatcc caacagatga 2040 aatatggatt aaattcgggg gtgaccacgg caagaatagt atgaaattta caatgcaaat 2100 tgcaaacact aggaaaccaa attcaaaaca caacacattc gtagttgcca tggccaacat 2160 aaaagacaca tacgaaaatt taaagatctg catgtccatt ttacaacctc aacttgatga 2220 actttccaaa ctagaatggg acggtaagaa aatagttctc tttttgtttg gtgattatga 2280 ctttcttgtg aaattatatg gactgtctgg tgctcagggt acctatccat gtttgtggtg 2340 ccacgtgcca aagtcctaca ttaagacagg gcagagaaga gattctctac cccgtacttt 2400 agccacttta aaacgggacc acagacgttt caagaagtat ggcaaaggcc aaaagaaaaa 2460 tgtgcagcgg tttcacaata gtctccatcc tccaatgatt gaggtggagc caagtgaagc 2520 tgcgcctcca tatctacaca ttcttttggg aattgtgtta aagcatcatc gcctacttga 2580 gaacgcagcg catgaaatag atatgcagct gtgtttgcaa tccaaaaatg attgcaagga 2640 cgcgtctgca tcgctgcgtg ctcatggcaa caagtggaag agggctgagg agctgaaaaa 2700 agaaattgaa ctcctcgagg gctccattgt tttttcgaat actccagacc aaagagaaaa 2760 ttttgaaaaa gaattatctg aaaagcaaga tgagttatct gaccttgact ttgaacctct 2820 cagccctctg tccggaccca tctgttcaca tcttgacact attttagaca aacataacat 2880 cacaccgcag gcataccaca gccgttcttt catcggcaac cactgtcaca aatacatcac 2940 cgctaaagtg tacagagacc taacattatt catagtcaag aaaactcagg agtgcacaca 3000 cagaacagat attctagaca cagcatttat tttgagggac acattcaatg agctgaacaa 3060 tgcctttagg gatgttcacc tctttatatc tcataccaac ccgataccat acaacaaaat 3120 caaacaaatt cagtcatgtg ttgcaaaata tatgtcattg tatagaaaga atttcaacaa 3180 taaagtcata cccaaacaac atattctaga gaaacactgc gtcccatgga tacagaaaaa 3240 tggttttggt atggcattcc atggtgagca agggggagag ctcattcact cttctgtggc 3300 caaactacac agacgagcaa catccatcaa aaataaagaa cggcagctga aggtgatcat 3360 gagatcacag caccttcaga gctccccaga acagttggca tttgcaccac cagtcaaaag 3420 gaggaaaacg acgccggact catgaatttc cccagtttca ggtgtcggcc tatatatata 3480 tgtacatgtg taaatgagta ataagtgcac tgtgatgata tttttcaagg atcctaattt 3540 tctgtgattc tgcattcaag cttcaatatc tttgctttga cggagcgtac aatgatgcca 3600 ccacgtaagt gtttacttta tcctattttc atcatatttc cttaattttg tcaaaatcaa 3660 aagcaaagat catgttcaat ttttacttaa aacatcagac ccctttacaa ctttgcataa 3720 ttactataca gttttatggc atccgtatgt ctgcaacagt acccagtaca gtttcacgat 3780 cgcttctacg tgcccacact atattgatcg ttcagtgtat gtgattttct gtgagatcgc 3840 tagaatagtt tttgtttact gcctcagttt atatgcttaa aaaacatcgt gtgataagaa 3900 aacaaccgta actcggcatc ggcagtgaac cggcggcgct gacccggtga ccggggtcac 3960 agagtatcgg gttacgggga gtgccgcggg ctggcggttt gactaagttg gcggtcggta 4020 acagccgtg 4029 // ID Copia-48_AA-LTR repbase; DNA; INV; 201 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-48_AA_; KW Copia-48_AA-I; Copia-48_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-201 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 969-969 (2011). XX DR [2] (Consensus) XX SQ Sequence 201 BP; 71 A; 38 C; 37 G; 55 T; 0 other; tgaaggcata aaatacgttc agtgtggcaa cacgtcatcg cagtgtgaca accatgcagg 60 atggcaactg tcaagaacaa aaaacatcag tgaatatatc gcgccaaaat aactctccaa 120 ttcataatgt tcgagtgtag agaataaaac tttttcatta gttgattgac tgtacaacag 180 tacacatcgt gttttatttc a 201 // ID Gypsy-34_AA-I repbase; DNA; INV; 5976 BP. XX AC AAGE02026431; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-34_AA_; KW Gypsy-34_AA-LTR; Gypsy-34_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5976 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02026431; Positions 173201 179176. XX CC Positions [2856-3392] - Reverse transcriptase CC Positions [4781-5248] - Integrase core CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 2280..4109 FT /product="Gypsy-34_AA-I_2p" FT /translation="MRKSGEDGIIFAKVAGLPCSFLIDSGAQVNTFTEELF FT RKLISDPRYSSEVFNVQNQSDRSLKGYASKGDIDVVATFHATLFISNDRPT FT LLEKFYVVKEIRALLSRSTASRYSVLMLGLQVPIQQRDVEDNRSLYAGEIA FT AITTDVIFPKFNIPPVKIHYDKSKPPCRNIFFNIPVSVKPIVEERIHGLVR FT ANIIEPVEEGMDTSFCSSMLVVPKGKEDIRLVIDLRGPNRYIYRTPFAMPT FT LEKILAELNGAAWFSTIDLSNAFFHIELDEESRHLTNFCTEFGMFRCVRMP FT FGLCNAPDVFQETLQRKILGGCKGCRNFQDDILVYGSTKSEHDENLAVVLS FT RLSSHNVKLNDDKCVFGSQRVNFLGFDLTPERWLVEDEKLSAIQQFRTPSS FT CSEVKSFLGLITFVDKFILHRATKTENLRLLASSDSFYWNDSEEEEFSYLK FT NEALNTIRRLGYYSPLDPIELFVDASPTGLGAVLAQYNSDGKPRIIACASK FT VLSPSEQRYPHTQKEALAVVWGVERFSYYLLARPFVIRTDAEANQYIFNSD FT HRLGKRAVTRAEGWALRLQPYDFSIQRVPGSENVADVLSRLIPESQKAESF FT EENEEKHYLYHR" FT CDS 4409..5782 FT /product="Gypsy-34_AA-I_1p" FT /translation="MEITWSDIELASENDEELRLVREAMKRKGWPTELRAY FT EAQRKKLHHLGSLIFANERVVLPRDLREKALSSAHGGHVGEMAMKRIMRQF FT FWWPKMSKEVSKYVKNCETCSLLAKRNPPVPLVSREFPEGPWEILQIDFLS FT VPNFGTGEFLIVVDTYSRYLAIVEMRSIDADSTNNALNEVFRTWGLPIILQ FT SDNGPPFQSASFVKFWEEKGIKVRKSIPLSPQSNGAVERQNQGIIKVLAAS FT RLDGSNWRQALQQYVHKHNTLVPHSRLGVTPFELMVGWRHRGTFPSLWSTS FT RNEKIDLEDIRERDAETKLSSKKYADRVRGAQESDIQVGDVVLLAQAKKSK FT TDPVFSSERFKVVARDGAKVVVVNQTGVQYARNIQEIKKAPGFVIQPTQPV FT PAGKAADDEAEIVEPVTAEETAASPSSENADLVMQPDGHRSLRHRKLIARP FT RRFDDNFLYYVFQ" XX SQ Sequence 5976 BP; 1744 A; 1173 C; 1478 G; 1581 T; 0 other; tggcgcagtc ggcaggtatg tattgatcaa atttaaacaa aataactttt atttggggcg 60 agtaggactc cataaaaaaa tgaataaata cagagcgagt aggtctctgt gaaaaatggt 120 gcgagtaggt ctccatttcc gaccaaatga agtccagagc gagtaggtct ctgcgtaaac 180 tggagcgagt aggcctccat gaaaataata atatatacag agcgagcagg tctctgtgag 240 aaatggtgcg agtaggtccc catttccgaa aaaaatgaat gaatacagag cgagtaggta 300 tctattcgaa atggtgcgac taggtctcca tttccgtcaa aaaaaaaaaa gaaaaagcag 360 agcgaatctg cgtttgttag ttaattcaat gagttgatat tcgagaagtg acatgaaatg 420 gatcaagtaa tcaaagagac ttcaaatgtt tatgaagata gagcggacag gactctacaa 480 gtatgttttt tttcgttttg gttacaggga aatatgtgcg agctccaaag gaaatggatg 540 tatcggattc aagcgggttg tctaaagagc ttgaagaaga aagagcgatg agtacgcgtt 600 tggcgagtga gctggaaaaa tcacgcaagg aaaatgagga gttgaaagcg attatgtacg 660 ccaggagcaa tgtccgcgat ccattgagcc caagcgtgga tttcgaagct ggaaacggat 720 tttgcagtac cagacgaatg gctcccaatc ccattttgac gaatagtatt gaagaatcca 780 gattcctgtc ttcaatgaat catttatcat tggcatcaat caacgttcct gagtgtacag 840 cgcctgaggg ggatgatatt catcgccaaa ttttcgagca gtggaaggat ttgttactgg 900 attcgatgaa gctagctgga atcgatgatg aagcgacgat gtttacagta ttcaaggtta 960 aggccggatc gcgcctgctg ggaatttttc gaaatacgaa atcatctact gatgcaccgg 1020 atccagagga gcggcctttt tctaacgcta tgcatcgctt gcggtcttat tttggctcgg 1080 gttcagatgt catgctgatg agacgcaggc ttgctcttac cgttcaaaaa tccgatgaaa 1140 ctgacctcag tttcataaca agagttggat ccattgctcg attgtgcgaa ttcagcgagg 1200 atagagtttg aagaaattgt tgcgacggtt gccgagcacg ccaggagccg ggatgtgcga 1260 acaaccgcac ttaaattgct cagccgtaaa ggcagcttca cggacttggt tgataaagtt 1320 cgtgaattag aagcgattag actgaacgaa gagtacgtga accagaagct tgcgatgcag 1380 caaccggcgt tgattgctcc agtatcatcc tctttccact ccgttggaaa tcgtcagcag 1440 cgacaatacg tcggttttcg tggaaacaca acccagcgag gatttccatt ccgtcgtggc 1500 ggcgggcgtg taccaagagg taaagagccg tacactggtg gagcgttccg tagctcagat 1560 ggcaacaggt gttggcgctg ttacagcatt tttcattcgg cggacgactg caaagccaag 1620 gataagtggt gcaatcattg tggagctgtg ggacactttc ttcgagcttg cccaaaactc 1680 gtttccggtt catttcgtgg ccaaatgaaa cgcccgcaag cagtggacca ttcggaagta 1740 ccaccgtaca aggttgcttt tgtcgataaa gcagaagagc ctaaggcaga agaagaaacg 1800 gtaagtgtca gtcagtcaga tgagtaattt gtttatgatt tgtatgcgta gagaatatgc 1860 gatactattg aagaaaggaa cagttgaagc tatatagcta taatgtaatt tgaagtttcc 1920 aaataaaatg tccaatgagt aagaattaca gaaagacatc tggtatttta ttacacagga 1980 agcaaacgcg agggcaacca aagttgattc tcgccaactc ctctcacgta gtcctgattc 2040 gatattggtg ggccaagcag aaacggtttc aatgcactca gaattgtcct cagctatatc 2100 ctttgttgag tcttgccaag cagtagtgag gagtgatgaa gtcattgaag ctggaagtat 2160 aaatattgta agtacatttt agtcgaaggt tgcttatgag taataattga tacatatata 2220 ttataattca tcagcttgat cgttctacaa caaaggtgtt tggcgatgcg tcatacccca 2280 tgcggaaatc tggagaggac ggcataattt tcgctaaggt agcaggattg ccgtgttctt 2340 tcttaattga ctcgggagcg caagttaaca cttttaccga ggaattgttc cgtaagttaa 2400 tctcggatcc caggtacagc agtgaggtat tcaacgtgca gaaccaatca gatagatcgc 2460 taaagggtta cgcttcaaag ggtgatattg atgtggttgc tacttttcat gcgactctgt 2520 tcatctcgaa cgacagacca acgctgctcg aaaaatttta cgttgttaaa gaaattcggg 2580 ctcttctaag cagatcgaca gcgtctaggt atagtgtgtt gatgttggga ttgcaggttc 2640 caatacaaca acgtgatgta gaagacaatc gatcgctcta tgctggagag attgcagcta 2700 ttacaactga tgttatattt cccaaattca acatccctcc ggtgaaaatt cactatgaca 2760 agtcaaagcc gccctgcagg aatatttttt tcaatatccc tgtatctgtt aaaccaatag 2820 tagaagaaag aattcacgga ctggttagag cgaacattat tgaaccagtg gaggagggaa 2880 tggatacatc attctgctct tctatgcttg ttgtgccgaa gggcaaggaa gatattcggc 2940 ttgtcataga tctgaggggc ccgaatcgct acatctatcg aacaccgttt gcgatgccca 3000 cgttggaaaa gatattggca gaattaaatg gtgcagcatg gttttccaca attgatttgt 3060 caaatgcgtt cttccacatt gagttggatg aagaatcacg gcatttaaca aatttctgca 3120 ctgaattcgg catgttccgg tgcgtacgaa tgccatttgg attgtgtaac gcgccagacg 3180 tgtttcaaga gacgctccag aggaaaattc ttggaggctg taaaggctgc agaaactttc 3240 aagacgacat actggtgtac gggtcgacta aatctgaaca cgatgaaaat cttgctgtgg 3300 tcctctctcg attgtctagc cataacgtta agcttaacga cgacaaatgc gtctttggca 3360 gccagcgcgt aaactttctt ggcttcgatc taacgcctga aagatggttg gtggaggatg 3420 aaaagttatc agctatacag caattcagga cacctagcag ctgttccgaa gtaaaaagct 3480 ttcttggatt gataacattc gtcgacaagt ttattctgca ccgggcaact aaaactgaga 3540 atttgcgact attggcatcg tccgatagtt tctactggaa tgatagtgaa gaagaagagt 3600 tctcatacct aaaaaacgag gcgttgaata ctatcaggcg cttgggttat tacagcccgt 3660 tggatcctat cgagctcttt gtcgatgcct ctccgacggg gcttggtgca gtactagctc 3720 aatacaatag tgatgggaag ccccggatta tcgcgtgtgc atccaaagtg ctttctccct 3780 cggaacaacg atatccacac actcagaagg aggccctcgc ggttgtctgg ggtgtggaac 3840 gtttcagtta ctatctgttg gctaggccgt tcgttatacg aacagatgca gaagcgaacc 3900 aatacatctt taatagcgat caccgtttgg gaaagagagc cgttactcga gctgaagggt 3960 gggccttaag gctacaacca tatgattttt caatccaacg tgtgccaggc agcgaaaatg 4020 tagcggacgt gctttcacgg ctgatacctg aaagtcaaaa agcagaatct tttgaagaga 4080 atgaagagaa gcattacctg tatcacagat aacagatgtt gaagtccgat tgtaaaactg 4140 tgtaagacaa tcaacggtca ttttgaatgg caacacaagt agcaacattg tggggcgctg 4200 tcatcgctca gttcccatga cgatgtcagt ggtgaatatg aaatatttca agatacttat 4260 attcaccact gattgacgca attcgttgtg tggcggcgct agtgattgcc ggcagtttca 4320 atttgaatat ttacacaggt tttcactaaa tttttgttct agcataaaca tctgttatct 4380 gtgcctgtat gcactagata ctggttgcat ggaaataaca tggagcgata tcgagttggc 4440 atctgaaaat gatgaagaat tgcgtttggt gcgagaagca atgaagagga aaggatggcc 4500 gaccgagttg agggcatatg aagctcaaag gaaaaagctt caccatctag gatcgttaat 4560 ctttgccaat gaacgtgtgg ttctaccacg agatcttcgt gaaaaagcac tttcttccgc 4620 ccatgggggt catgtgggtg agatggccat gaagcgtata atgcggcaat ttttctggtg 4680 gccaaaaatg tcgaaggaag tttcaaaata cgtaaagaat tgtgaaacat gttcactttt 4740 ggctaagcgc aatcccccgg ttcctttggt atctagagag tttcctgaag gtccttggga 4800 aattcttcaa attgattttt tgtcggtacc caacttcgga actggtgaat ttttgatcgt 4860 agttgatacc tattcacgat accttgcgat tgttgaaatg cgatccatag atgcagatag 4920 taccaacaac gcactaaatg aagttttccg cacctggggg ttaccaataa tactacagag 4980 cgataatgga ccacccttcc aaagcgcttc gtttgtaaaa ttttgggaag aaaaaggcat 5040 taaggtaagg aagtccatac ccctgagccc acagtccaac ggggctgtcg aaaggcaaaa 5100 ccagggaatt atcaaagtgc tggcagcttc aagactggac ggatcgaatt ggagacaggc 5160 acttcaacaa tatgtgcata aacacaatac tctggtaccg cattccagac tcggcgtgac 5220 tccctttgaa ttgatggtag ggtggcgtca tcgtggcaca tttcctagcc tatggagtac 5280 ttcccgcaat gaaaagattg atcttgagga tattcgagag cgtgacgcag agacgaaatt 5340 atctagcaaa aagtacgcag acagagtacg tggtgcacag gaatcagata tccaagttgg 5400 tgatgtggtt ttgcttgccc aagcaaaaaa atcaaagacc gatccggtat tctcttctga 5460 acgcttcaag gtggtagcca gggatggagc aaaagttgta gtggtaaatc aaacgggcgt 5520 tcaatatgcc agaaacattc aggagattaa gaaagctcct ggatttgtaa tccagccaac 5580 tcagcctgtg cctgccggaa aagcggcgga tgatgaagcg gaaattgttg agccggttac 5640 tgcagaggaa actgctgctt ctccttcgtc agaaaacgcc gatctagtga tgcaacctga 5700 tggacatcgg tcactccgtc acagaaaatt gattgctcgg ccaagaagat ttgacgataa 5760 tttcctttac tatgttttcc aatagtgtct gtaatgcttg gcggcgtact ttactctggt 5820 gacactgcgg gaggttgacc aaagtttttg ttccttacta tggaaaccat tagagcaaag 5880 aggtacttcc gacgtacttt gatttgaact gcctcatttg cgccctcaaa gttacctgaa 5940 aacgaaacat tcaaacagaa tagagaaggg aaaaga 5976 // ID CR1-48_BF repbase; DNA; INV; 2053 BP. XX AC . XX DT 30-JUL-2009 (Rel. 14.07, Created) DT 30-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE Amphioxus CR1-48_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-48_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-2053 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-2053 RA Kapitonov V. and Jurka J.; RT "Young families of CR1 non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(7), 1619-1619 (2009). XX DR [2] (Consensus) XX SQ Sequence 2053 BP; 592 A; 417 C; 357 G; 687 T; 0 other; atcgtactga ctaatcctaa aagattttgg tccttcttcc actctaagac aaagtctaaa 60 agtctaccag acaaaatgaa acttcaggat gagtatgctg agactgctca ggagaaggca 120 gaattgttca atagcttctt ttactctgtt tttaactgta ccgatgggga aaccccgttg 180 cccgacttga ttgatacttc tagtgtgcca aaaatctcat cagttacatt ttctgtcagc 240 accattgaag ctattctcaa aaacctagaa gttacaaagg ccacagggcc ggacgacctc 300 ccagcgatgg tattaaataa gtgtgccgct caattagcac catctttgga aaaactgttt 360 caactatgtg tgagcaatgc aaaaattccg agtatgtgga aaaaagctaa tattactcct 420 gtttttaaga agggggataa gagtgccgta tctaattaca gacctgtgtc tttattatgt 480 atcacaagca aagttttcga aagatgtatt ttcaactctc tttatccatt attgtcaaac 540 actctttatc cccttcaaca tggtttcgtc aagggtaaat ccactaccac ccaactttta 600 gaagtttatg atgaaatagg tgaagtactg gataaatctg ggcaagttga cgtaatattc 660 ttggattttt gcaaagcatt tgatagtgtc tctcataaac tccttgtaca caaattgaaa 720 tcttttgggt tctccggaca actacttgct ttatttaacg actatctttc agaacgtttt 780 caaagaactg tcatagaagg cgaaatgtct gattacttac ctgtgttgtc aggtgtcccc 840 cagggatcca tattagggcc attcttgttt ctattattcg ttaatgattt acctgactgt 900 gttgaatttg gtaaaatggc catgtttgca gatgacgcta aatgttttag aagaattact 960 tcgatttttg attgttttaa atttcagtcc gacttagatc gtcttgtggc ctgggggaaa 1020 acctgggaga tggcatttca tccatctaag tgttcagttg ttagtatgac acgtaaaact 1080 tcacctgtag aataccctta tagcatgtca gattctgtac tcagtcgaca agatagttat 1140 aatgacttag gagttcatgt acaacataac ttatcctgga attctcatgt acttaagaag 1200 atatcaacat gtaactcaag acttgcaatg atcaagagat cagtaggctt taatgcccca 1260 tttactgtta aactcaattt gtatcgatca ctggtgatac cacatttaga ctactgttct 1320 caggtatggg caccgcatac aagattgtat ctgcgcaaag tagaaggtgt tcaaagacgt 1380 gctaccaagt acatctgtaa tgattacgaa cttaactata atgacagatt aacccatacc 1440 aaccttctac cattgtgtta cagacgagaa ctttttgata tcttatttct tttcaaatgt 1500 ttccgtaacg tgtactgtac caacatttct catgtcctca atgtatccag acctactaga 1560 tcacttcgct ctacagacca atttcaactt gtaccacggc catgtaaaac tgaatcattc 1620 tcattctctt actgtagtcg tattgccgtc atctggaaca atttaccact agatattcgc 1680 caacatatgt actcaaacct cactctacag tccgttaaaa aactcttgat ttcatattat 1740 agatcccgat ttagtgacac atttactgtt gataacttat gtacttggac gaccacttgc 1800 cgttgctcat cttgcatagt gacatagctt aaccgtttac cctgttttcc taatcatttg 1860 accttattac tctctcattt ttttaatgta tttcttttag tatttgatgt cagacttgtc 1920 tttcatttat tgtttttatc tcaactttgt atattttctg ggggagtcgg cctcgtagag 1980 gaactagatt cctgttgccg gctcccctca gattgtacgc aatctgtgaa gtggaataaa 2040 taaataaata aat 2053 // ID Proto2-1_BF repbase; DNA; INV; 4430 BP. XX AC . XX DT 10-JUL-2009 (Rel. 14.07, Created) DT 10-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE Amphioxus Proto2-1_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW Proto2; Non-LTR Retrotransposon; Transposable Element; KW Proto2-1_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-4430 RA Kapitonov V. and Jurka J.; RT "Proto2, a novel clade of metazoan non-LTR retrotransposons."; RL Repbase Reports 9(7), 1554-1554 (2009). XX DR [1] (Consensus) XX CC Proto2-1_BF is a very young family of non-LTR retrotransposons. CC It belongs to a novel clade of metazoan non-LTR retrotransposons CC called Proto2. This clade includes families of non-LTR CC retrotransposons present in the hydra (from Proto2-1_HM to CC Proto2-5_HM), annelid (from Proto2-1_CS1 to Proto2-8_CS1), and CC hemichordate (Proto2-1_SK) genomes. A model Proto2 non-LTR CC retrotransposon is 4.3 kb long, contains two ORFs; its 3' CC terminus is composed of microsatellites; usually there is no CC target site duplications generated upon retrotransposition of CC Proto2 elements. ORF1 codes for a protein conserved in elements CC from all species mentioned above. ORF2 codes for a protein CC composed from the AP endonuclease and reverse transcriptase CC domains. It appears that the Proto2 clade is a clade ancestral to CC the RTE and RTEX clades. XX FH Key Location/Qualifiers FT CDS 248..1498 FT /product="Proto2-1_BF_1p" FT /note="ORF1." FT /translation="MPRCEDCREAGAQPCQGDLVLCQRCSDKRFPPSGSGS FT EAVCKDVIINDLLCFTVNKMDTLPFDIITKLCTETYMDEEIESAKRLVFDT FT CKPDERYIKRKGGDKSTANMEDIQRVLHSTAPPSLPTFVSATLHLPAVSLE FT HVDISVCMQELQIMRQEMKLIRDCSIDSVRVQTELVALRKEIWELKNRPSS FT APPAPSSSAPPAPSSSAPPASSTSAPPVVSTSAPPAASTLAWPKESYADAV FT SDTCPSETIPKSLHAGLSGRQVERAPTTKSSQASRRRGYSSDSRALARPQS FT TSAGGHSDLDSEGFRLVQRKKKRSRAVVGTAVSSSLSAVKSRPAEIFVTRL FT EPDTLTQDVERYLTDNLSQKCAVSCSKLTTKYDGYSSFRVSVDYNALSEVL FT SPSFWPCGILVRRFHSRRRSSTS" FT CDS 1504..4329 FT /product="Proto2-1_BF_2p" FT /note="ORF2, AP endonuclease, RT." FT /translation="MEQISKLNVVSYNCRSVKNSLQTVKDLCVTADVILLQ FT EHWLLPDDLSYLSNIDGNFLAHGTSPIDMSEGLIHGRPYGGVAVLWRKSIG FT AFVNIDDLNDDRILALRINSDGKELLIICVYMPTESHNHTEIFMEKLGKIE FT SFIDESGIPNVLIMGDFNTRPDGTFAPLLYNFCLDGNLHISDIEMLPEDTF FT SYLSDAHRSTSWLDHCVCSDSVHQCISEMCVLYDFVSSDHFPLRATLDWST FT VPVFTNTADDVDSPASRFNWERASNDVLLQYHTCAENLLSEVMLPIEAIRC FT ADPNCRDEHHRESLDSFSYEIINKLQEASKTSLPGRKRTVSHLIPGWNEVV FT KEYHSAARDAFTLWRENGRPRHGAIWELLRKTRAKFKYSLRQCRRFEAQHR FT ANALATCLHKGDSRGFWSALRSQNGSKSSIPCTIDGHSGASDIANMWKNHY FT SALLNSVNSTAHKSHVNSTIQSGVSDNPGMSVTVDEICDAINELSNGKACD FT ANCISAEHLKYAGPRLHALLSLLFSAVLRHGYVPPRLLNTVLIPIVKKKNG FT NVTDKDNYRPIAVSSPICKVLERLLLRRLDEFLHSPDNQFGFKKQHGTDLC FT VFALKEVIRYYQKRGSPVFVAFLDASKAFDRVNHWLLFKKMIDRNVPLYLI FT RLLVAWYNVLTFRVRWGNVLSDNFTATNGVPQGSLISPFLFNVYVSDLSVE FT LNCSGVGCYIGNVIVNHLLYADDICIICPSVKGLQKLMSLCERFGSENHIV FT FSKSKSQCVYFPCGRLKLHGPPPSVTLYGTELQYVTSVNYLGNQITSDLSD FT DESIRYHVRGFYTKANSLIRKFRFASPGVKRVLFDAYCSSIYCAQLWCQYN FT LGTIRRLRVAYNNALRIVLGYRTRDSASQMFVTHGIDTFVARRRKLTHAFV FT QRIATSTNNIISRLYHSDAFFHSKFWTHYVNLVFKSPG" XX SQ Sequence 4430 BP; 1115 A; 1042 C; 1020 G; 1253 T; 0 other; tggttcgatc caagatggcg gcgacttttt gatagctgcg tggctctgtt cctcaaaatc 60 agaatttgtg gcactttaac gacccatcga cctgctatcg actcaccaga tgaagaccaa 120 acatcctggg ttactcttaa agtgacaatt cgagacaatt ctgacgatcg gcttcacatt 180 gaaaggtttt ctgcaccaag ggagggcact gtatggtaag cccgcggcac ccctccccac 240 tcccaccatg cctagatgtg aagactgccg tgaggcagga gcccagccct gtcaggggga 300 tcttgttctc tgccagcgct gttcagacaa gcgtttcccc cctagtgggt ctggatccga 360 agcggtatgt aaagatgtta tcatcaacga tctgttatgt tttactgtta acaagatgga 420 cactcttccg ttcgacatca ttacgaagtt gtgcaccgaa acgtacatgg atgaggaaat 480 agagtcggct aagagacttg tatttgatac atgtaagccg gacgaacgct acataaagcg 540 aaagggagga gacaagtcta ccgccaacat ggaggatatt cagcgagttt tgcattctac 600 tgctcctccg tcgttaccta ctttcgtgtc cgcaactctc cacctccccg ctgtgtcact 660 ggaacatgta gacatttctg tctgcatgca agagcttcag atcatgaggc aggagatgaa 720 actgattagg gattgctcca tagattcggt tagagtgcag actgagcttg tggcactcag 780 gaaagagatc tgggaattga aaaacaggcc aagttctgct ccgccggcac cgtccagttc 840 tgctccgccg gcaccgtcca gttctgctcc gccggcatcg tcgacttctg ctccgccggt 900 agtgtcgact tctgctccgc cggcagcgtc gactctcgcc tggccgaagg aatcctacgc 960 cgatgccgta tcggacacct gcccgtcgga gactatccct aagtccctac acgcgggctt 1020 atctggtcgg caggtggaga gggcgcctac cacaaagtca tcgcaggcat cccgacgtcg 1080 tggctactcg agtgactcta gggctctagc ccgaccgcag tccacctcag ctggtggcca 1140 cagtgacctt gactctgagg gcttccgcct ggtgcagaga aagaagaaga ggtccagggc 1200 agtggtgggt acggcggtgt cgtccagtct gtctgcggtt aagtcaagac cagcggagat 1260 ctttgtgacg cgccttgaac cggacacact gactcaggat gttgaaaggt acttgactga 1320 taacttgtcg cagaaatgcg cagtgtcctg ttctaaacta accacaaaat acgatgggta 1380 ttcttcgttc cgtgtctctg ttgactacaa tgctctgtcg gaagtcctgt ccccatcttt 1440 ctggccatgt gggatcctag tgcgccggtt tcacagtagg cgacgctctt ccacttctta 1500 actatggaac agataagtaa actgaatgtt gtatcataca actgtcgatc tgtcaaaaac 1560 tctttacaaa ctgtgaagga cctatgcgtt accgccgatg ttattttgct acaagaacac 1620 tggcttttgc cagacgattt atcatatcta tctaacatag atggaaactt ccttgcccat 1680 ggtacttccc ctatagatat gagtgaaggt ctgattcacg gccgacccta cggcggcgtg 1740 gctgtccttt ggcggaaatc aattggtgca tttgtcaata tcgacgacct caatgatgat 1800 cgcattttag ctttaagaat taattctgat ggcaaagagc ttctgattat ctgtgtgtac 1860 atgccaacgg aatctcacaa tcacaccgag atttttatgg agaaacttgg taaaatagaa 1920 tccttcatag atgagtctgg catacctaat gtactaatta tgggtgactt caatacacgg 1980 ccagacggaa cgtttgctcc tctattgtat aatttttgtc ttgatggcaa ccttcatatt 2040 agtgacatag agatgcttcc tgaggacacc tttagttacc tcagcgacgc gcatagatct 2100 acatcgtggt tagaccattg tgtttgctcc gattctgtgc accagtgtat ttctgaaatg 2160 tgtgtacttt atgattttgt gtcttccgac catttccccc ttcgagctac tttggactgg 2220 tccacagtcc ctgtgtttac aaacacagct gacgatgtgg actccccagc cagtagattc 2280 aattgggaac gtgcgtctaa cgatgttctg ttgcagtatc acacctgtgc tgaaaacctt 2340 ttgtctgaag ttatgcttcc aattgaagca atcagatgtg ctgacccgaa ctgcagggac 2400 gaacatcaca gggagtctct tgacagtttt tcatatgaga taattaacaa actacaggaa 2460 gcttcaaaaa cttctcttcc tggtagaaaa cgcactgtat cccacttgat tcctggttgg 2520 aacgaagtag ttaaggaata tcatagtgct gctcgagatg ctttcactct gtggcgggaa 2580 aatggtcggc ctagacacgg agccatctgg gagcttttac gaaaaacaag ggcgaaattt 2640 aaatactccc ttaggcagtg ccgccgtttc gaggctcaac accgggccaa tgcgttggca 2700 acgtgtttac ataaggggga tagtaggggc ttctggtcgg cccttcgctc ccagaatggc 2760 tccaaatcta gcataccttg tacaattgat ggccattcgg gtgcatccga tattgctaac 2820 atgtggaaga accactatag cgcactgtta aactccgtca attctaccgc acataagtca 2880 catgtaaaca gcactattca atcaggtgtc tcggacaatc ctggaatgtc cgttactgtt 2940 gatgaaatat gtgacgccat taatgagctg tccaatggta aagcatgtga tgctaattgt 3000 atttctgcgg agcacctgaa gtacgcaggt ccccggctgc atgcgctatt gtcattgttg 3060 ttttccgctg tgctcagaca tggttatgtt ccacctaggt tacttaacac tgttctcatt 3120 ccgatcgtta agaagaaaaa tggcaatgtg actgacaaag acaactatcg cccgatagcg 3180 gtctccagtc ctatatgcaa agttcttgaa agactgcttt taaggcgcct agatgaattt 3240 ttacactctc cggataacca gtttggtttt aaaaaacagc atggcacaga cctttgtgta 3300 tttgccttga aggaggtaat ccgttactac cagaaacggg gctctcctgt atttgtggcc 3360 ttcctagacg cctctaaggc cttcgatcgt gtaaatcact ggctgttatt taagaaaatg 3420 atcgacagaa atgtaccact ttacttgata cgtttacttg tagcctggta taatgtttta 3480 acctttagag tacgttgggg caatgtcctt tctgacaact ttactgccac taacggagtg 3540 ccacaaggta gcctgatatc tccttttttg ttcaacgtct atgtctctga cctgagtgtt 3600 gagctgaatt gctctggtgt tggttgctac atcggaaatg tcattgtgaa ccatctgtta 3660 tacgctgatg acatatgtat tatctgccca tctgttaaag gtctgcaaaa gctaatgtct 3720 ttatgtgaac gttttggctc tgaaaaccat atagtgttca gcaaaagcaa gagccaatgt 3780 gtatattttc catgtggtag actcaaattg cacggaccgc ctccttcagt caccctttat 3840 ggcacggaat tgcagtacgt tacatctgtg aactacctgg gtaaccagat tacctctgac 3900 ctgtctgatg atgagtcaat tcgctatcat gtcagagggt tctataccaa agccaattcc 3960 ttgatacgaa aattccgatt cgcctctcct ggagtgaagc gcgtcctttt tgatgcatat 4020 tgttcctcga tatattgcgc tcagctatgg tgtcagtata acttgggtac gattcgcagg 4080 ctacgcgtag cctataacaa tgctcttagg attgtgttag gttatcgaac gagagatagt 4140 gccagccaaa tgtttgtgac acatgggatt gatacttttg tagctagaag gagaaaacta 4200 acgcatgcat ttgtacaacg catcgccaca tctacaaaca acattatctc ccggttatac 4260 cactcagatg ccttcttcca tagcaaattc tggacacact atgtgaatct tgttttcaaa 4320 agcccaggct aatttaactt ttttattgta cataaagtgt atgtgtattg ttgtattgta 4380 tgtttttatg tgatatgggc caagagcctg caataaagtt gaaattgaaa 4430 // ID Gypsy-159_AA-LTR repbase; DNA; INV; 1071 BP. XX AC AAGE02017724; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-159_AA_; KW Gypsy-159_AA-I; Gypsy-159_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-1071 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02017724; Positions 4764 3694. XX SQ Sequence 1071 BP; 329 A; 248 C; 235 G; 259 T; 0 other; tgttaccgtt tgtaacaata agcctgttag tttgtaagtt tttgtatttc ctattattat 60 tatttatatg tctccgtgaa attttaaatt taaatcctat tgcatgccat aaaaacaccc 120 ctacacctga tataaattta gtacactaca ctacaataaa aactagctat aagtaaatgt 180 ataacagtta gagttcagtg gaggaaaagt aagagaaaaa gactacccaa cgatagaggg 240 attaggcatc ggagaaggat agattcggtc tatttcctat aaggatccaa ttcgaagtgg 300 acgttagaag ttaaaagaaa gttaaaagga agttaagaga gttaagaaag agaagtgagg 360 agcgaagaag cgaatcaggt atgcctgtga ctgcagaaaa attgttagac cctaaaaggt 420 tatattcgtg aatttaggac ccgttgaggt taattctgga aactccttcc ggattgagtc 480 ccttagagcc cgctagggca tttgaatcca ggtagaacga agcctgctcc aaggtagaac 540 gaagcctgct ccaagaccaa gtgaccgagt acgtagaatc ccgtggccga cgacccggtt 600 tcgaaggacg ctaacctcca ggacgggtcg ttcttaggag tcgaggccaa cgagcgtcca 660 ttcccagcac cccgttcagc ttttccacgc tgtcagctcc gtcgaacctg ctccgtagta 720 taccgtgcca ccatccctgc agagatccct gaatataccg gccggttagg taattagaca 780 tagcgacact acactgcaca cccacccaca cagatatgta cattgagggt aatatacagg 840 ctacattata aatcaccaaa gctttcattt acacccatcc caatatccga ggtctgatta 900 gcctaataga gtactccagc cgaccttgag accagagagg tgctccagat cgcgctagcc 960 aagaaaagag gtccttgacg cacccaacct caaaaggcag gccgttgctt agccaagctt 1020 gccggggtct ttgtcctggt aacccacgtg ttgcccaacc tagcagtaac a 1071 // ID BEL-83_AA-LTR repbase; DNA; INV; 302 BP. XX AC supercont1.342; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-83_AA_; KW BEL-83_AA-I; BEL-83_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-302 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.342; Positions 586682 586983. XX SQ Sequence 302 BP; 88 A; 69 C; 47 G; 98 T; 0 other; tgttgagacg aagtttagcg aagttaccct aattgatcgc ggtggctttc atccaaagga 60 ttcctttcaa ttcccaaaaa tctctctctc tctccaatac cttacaatgt gacaatcgtc 120 gcatctattg aattgtatac cgtaattgtc tccatatgaa atgatattta tataaaagga 180 agaaatgaaa tacatcaagt tagttgaaat tgtaagccta aacccgtctc gcgtatcgtt 240 atttccttcc gttcacattt ccgtaactca tcggcgcgct tttttcgtac ctcagaaaac 300 ca 302 // ID PNL_SM repbase; DNA; INV; 2616 BP. XX AC . XX DT 28-JUN-2007 (Rel. 12.06, Created) DT 03-JUL-2007 (Rel. 12.06, Last updated, Version 1) XX DE Penelope-type retrotransposon from Schmidtea mediterranea. XX KW Penelope; Non-LTR Retrotransposon; Transposable Element; PNL_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-2616 RA Jurka J.; RT "PNL_SM: Penelope-type element from freshwater planarian RT (Schmidtea mediterranea)."; RL Repbase Reports 7(6), 364-364 (2007). XX DR [1] (Consensus) XX CC Present in several thousand copies in the genome. XX FH Key Location/Qualifiers FT CDS 400..1923 FT /product="PNL_SM_1p" FT /translation="MNLKKRLAKSFVTKNLDLPNFYHLIKINKDVPSIKNK FT PIVSKTNGSTTKTSWLLSKILKSLLNNVLYHLQSSDCLLARLNITGKTTHS FT KFKYPFSLDIIALYTSINQSEAIECVRGRLENNLLIKEMNTKNIINLLTVI FT LENIYFTFDQKVYKQIKGLPIDSSISGLLAIIFMDYLEQQTLRTIQSIGFY FT ARYLDDIFITTKSKKEAEQIFHIFNEAHRDIKFEIEYQDKNESLNRLDIRI FT KIDGNTPTYEIYKKSTHKYIFMHQKSALPYLQKVNVIKNEKQRIANRYTEG FT KKNYKKQLKFDKALRTNGYHNEDIERMNSKPKIPINRTQKKKKYFYLNIPF FT ISDTLDFKIRKELNHTSLPIRISHKSWTLKQALANNTEKEKICTMKNSKTP FT KKICFRKNVVYCIICNACKEIYIGNTFRNLHQRIKERLDPAGKSSHLQKCQ FT PKKEIKSISTKILARDNIQKNFRFKKAIIILDKHPSINTKEEISEIASLIC FT CKLLTLILLLKD" XX SQ Sequence 2616 BP; 1127 A; 421 C; 357 G; 709 T; 2 other; tcccgcaatc aaatgatcaa agccaaattg ataagctccc aatatataca aacaccgata 60 aactatacct tcccaatccc gatccagaat taaattcaac actttcaaag gtttgtcgaa 120 agattatgga tgttgttgaa aaagaaggga aaaggaaacc atataaatca aatctgtcga 180 acaacgaaca aggcgtattg aaagcactta aagctaaacc atatgtatac ttaccaagtg 240 acaaaggagg tgaattttgc gttattgaga aaagccgtta ctgtgaagct gcctataaac 300 atctcagcga tagtaacaca tacaaaaaga tcagtcgcat gtcaccaaac actatcgaaa 360 tgaagatcaa caaagtatgg aaagacatcg ccgcaaaaaa tgaatctcaa aaaacgacta 420 gcgaaaagct ttgttacaaa aaacttagac cttccaaatt tctaccattt aataaaaatt 480 aacaaagatg tgccttccat taaaaataag ccaattgtat caaaaacaaa tggatcgaca 540 accaagacaa gttggctcct ttcaaaaatc ctaaaatcat tactaaacaa tgtactgtat 600 catctccaga gtagcgactg cctactggca agattaaaca taactggcaa aaccacccat 660 agtaaattca aatacccctt cagtcttgat atcatagcat tatacacatc tatcaaccaa 720 agtgaggcta tcgaatgcgt gagaggaaga ttggaaaaca atttgttaat aaaggaaatg 780 aataccaaga atatcataaa ccttcttact gttattctcg aaaatatcta tttcaccttt 840 gatcaaaaag tatataaaca aattaaaggt ctcccaatag attctagcat atcaggcttg 900 ctagcaatca tatttatgga ttacctcgaa caacagacac tgaggactat acaatccatt 960 ggtttctacg caagatatct agatgatata tttatcacga ccaagagcaa gaaagaagcg 1020 gaacaaatat ttcatatatt caatgaagct catcgggata taaagtttga aatagaatat 1080 caggacaaaa atgaatccct taatcggcta gatataagaa tcaaaataga tggaaacaca 1140 ccaacctacg aaatctacaa gaaatcaact cacaaatata tatttatgca ccaaaaatca 1200 gcacttccat acttacagaa agtgaatgtt ataaaaaacg agaagcagag gatagccaat 1260 cgatatactg aaggaaaaaa gaactataaa aaacaattaa aattcgataa agccctacgc 1320 acaaacggat accacaacga ggacatcgaa agaatgaatt caaaaccaaa aatacctata 1380 aacagaacgc aaaaaaagaa gaaatacttt tatctaaaca ttccatttat cagtgatacg 1440 ctcgacttca agatccggaa agagctcaat cacactagcc ttccaattcg tatatcacac 1500 aagtcttgga cactaaagca agcactggca aacaacacag aaaaagaaaa gatttgcaca 1560 atgaaaaata gcaagacacc aaagaaaata tgctttagaa aaaacgttgt ctactgcata 1620 atctgcaatg catgcaaaga aatatacatt ggaaatacat tcaggaatct acaccaaaga 1680 attaaagaac gcttagaccc tgccggaaaa tcctcgcacc tccaaaaatg ccaaccaaag 1740 aaagaaataa aaagtatttc aaccaaaatc ttggccagag acaatatcca aaaaaatttc 1800 agattcaaaa aggcaattat tatcttagat aaacatccat caataaacac caaagaagaa 1860 atcagtgaaa tcgccagttt gatttgttgt aaactcttaa ctctgatact tttacttaaa 1920 gattaaaaca attttatagc aaaaaaggaa aactttcagg aatgaaaatc agagtagaat 1980 tgtatttagg tcttgaagtt gtttgtggag ggttgtttga aggctttttt ggttgcgttt 2040 ttaagaggaa agcttttcat atacatcttt cataagactt taaaaactca ggaactattt 2100 attaatttgc tctgaatttt ataaaaaaat ttctttattt ttaattttat taagaaattt 2160 gaaggacaaa aaaaaatgac attagaagca gttatttaac ataagatata gaaaacattt 2220 taaaatttta aagttttata taaaaaaatt aaagtttata aagattaaat taataaatgt 2280 taaattaaac aaaaaagaga gaaaaaaaat ttattttaaa aattttttaa attaaatatt 2340 ttataaaaga tatttaaatt taaaatttta ggtttaaaat attttaaaat taaatttaaa 2400 ccaatcaaaa agtaaatttt aatgaaaatc agaaggtatt ttggtatttt tctcattgta 2460 tatagtagtt ccaaaagaaa ctgaagagga ttatttaata atcgaaactt ttttcaacaa 2520 tataattatt acttcgcagc aagggcttat tttgttaaat aataataata atattaaywt 2580 aaataataat aataataata ataataataa taataa 2616 // ID Gypsy-28-LTR_NVi repbase; DNA; INV; 155 BP. XX AC . XX DT 11-MAY-2009 (Rel. 14.05, Created) DT 11-MAY-2009 (Rel. 14.05, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: long terminal DE repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy-28-LTR_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-155 RA Bao W. and Jurka J.; RT "LTR retrotransposon from Nasonia parasitic wasp."; RL Repbase Reports 9(5), 993-993 (2009). XX DR [1] (Consensus) XX SQ Sequence 155 BP; 50 A; 24 C; 13 G; 68 T; 0 other; tgtagtatat ttcaattaac gttcttttta agttctattg tttttcatgc aacttcatat 60 tattgattgt aaataataac tgtcattcaa aataaaccta tctttgagta aacatctttg 120 cactttctct aagtttactc tatacatata caata 155 // ID SIRE3_TC repbase; DNA; INV; 923 BP. XX AC AF227616; XX DT 09-JUL-2004 (Rel. 9.06, Created) DT 09-JUL-2004 (Rel. 9.06, Last updated, Version 2) XX DE Trypanosoma cruzi clone SIRE repeat region. XX KW SINE; Non-LTR Retrotransposon; Transposable Element; SINE element; KW SIRE3_TC. XX OS Trypanosoma cruzi OC Eukaryota; Euglenozoa; Kinetoplastida; Trypanosomatidae; OC Trypanosoma; Schizotrypanum. XX RN [1] RA Vazquez M., Ben-Dov C., Lorenzi H., Moore T., Schijman A. RA and Levin J.M.; RT "The short interspersed repetitive element of Trypanosoma cruzi, RT SIRE, is part of VIPER, an unusual retroelement related to long RT terminal repeat retrotransposons."; RL Proc. Natl. Acad. Sci. U.S.A 97(5), 2128-2133 (2000). XX DR Genbank; AF227616; Positions 1 923. XX SQ Sequence 923 BP; 233 A; 122 C; 252 G; 247 T; 69 other; nttggggggg aagggaaggg ggcccnaaaa agnaaaccnn cttttccccg gggggtgggc 60 ggatttaatt aangaggtgg gaaaganaag ttttcccgaa tnggaanggg ggcagtgngg 120 gcaaangcaa taaaagggng nnngttnaat tnattaggna cccccaggnn ttaaaaattt 180 aaggtttccg ggttnnaaag ttggggggga attgggggcg ggnaacaatt tttaanacng 240 ggaaacnggt tntgnaanag gattaaggaa ttttaataag aanttcaatt ttggggaatt 300 tggccnttgg ggggccaana attttggncc cgnnggnggg aaggnggggn ggccttttna 360 aggccagggn cagggagggg gcgcaanatg tttagatgcn acntgggtgg ccgttttttg 420 ttgagcccac ggggaccnaa aangtagccc ntatagaata ggatnggttg tngtttantc 480 gttttttaag gcntttttcc caattnttgt taggggcttg tgaagagaat gattgcggga 540 gagctggtta anntaanggg tatatcctga tagatcgagt aacatttntt tatggaantt 600 ttctaccgta tgaatttttg ggaagnaaag gactttaagg gttggggaac cgatagaggc 660 cagataattt tttattttta ttttgccatt ccacccagcc cctcgattcc caccttgcgg 720 cggggtcttg tggttggagg accccaaagt ctgccanttc gtaagtaata atatttcaga 780 tntgagnaca aaagaccacg ngngtagtcn acagaatata tatatatata tatatattat 840 aagnnaaaca tgaggcaatt aactgcctct ggggtacctn tttttcctta tcttgtttga 900 tttnnataga taattttagg nga 923 // ID Dpalli7 repbase; DNA; INV; 291 BP. XX AC GU229939; XX DT 19-APR-2011 (Rel. 16.04, Created) DT 19-APR-2011 (Rel. 16.04, Last updated, Version 1) XX DE Mariner-like element internal region of mellifera subfamily. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Dpalli7. XX OS Drosophila pallidipennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; pallidipenis group. XX RN [1] RP 1-291 RA Wallau G.L., Hua-Van A., Capy P. and Loreto E.L.; RT "The evolutionary history of mariner-like elements in Neotropical RT drosophilids."; RL Genetica 139(3), 327-338 (2011). XX DR EMBL/GenBank/DDBJ; GU229939; Positions 1 291. XX CC Clone Dpalli7. XX SQ Sequence 291 BP; 73 A; 68 C; 75 G; 75 T; 0 other; cggcccaaac cattgccaag cccagattga cggccaggaa agttttgctg cgtgtttggt 60 gggattggta ggcaatcatc cactatgcgc tcaactatgg ccaaactttt aattcggtta 120 tgtactgtga gcaactcgac cgtttaaagc aggctattga ccagaagcgg tcagaattgg 180 tcgttaggaa tggtgttgag ttcaattagg acaacgctcg gcccacacat ctttgatgat 240 tcgccagaag atttgggagc tcgaatgcga tgtcctatcg cacccaccgt a 291 // ID Gypsy-10_DPu-LTR repbase; DNA; INV; 372 BP. XX AC scaffold_141; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-10_DPu_; KW Gypsy-10_DPu-LTR; Gypsy-10_DPu-I. XX NM Gypsy-10_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-372 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 736-736 (2010). XX DR Genome; scaffold_141; Positions 230242 229871. XX CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX SQ Sequence 372 BP; 77 A; 85 C; 95 G; 115 T; 0 other; tgtcacgagg aactagggct tacggccggt tcttcgtaca ggacacccct cattggattt 60 cactggtaac tggtgcacga ggaactaggg ctcacggccg gctcctcgta tacccttcat 120 tggatttcct ccccagctag aaaggttatc agacggttag gcctaaacat acatgtgtat 180 ggcctggccg tcagggaacc atgtaattct agcagcgttc tatgtgggtt atgattatct 240 ttgtgggttt gggttgtggg tcgttttcct ttaaatagac ggggaaagtg agtcttcgct 300 ctctcttcct tcagacgtgt tccgttgtag ttcttacttc aaatacacgc ttgttaaagt 360 gacgttgtca ca 372 // ID hAT-11_HM repbase; DNA; INV; 3937 BP. XX AC . XX DT 07-DEC-2008 (Rel. 13.12, Created) DT 11-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-11_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3937 RA Jurka J.; RT "hAT-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 2000-2000 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 606..2600 FT /product="hAT-11_HM_1p" FT /translation="MSIVNTFSFIFQTNIIMQSFIWSLYKINDASDVYANC FT NICKKMIKRGTMEAGQKCFSTTPLHNHIKINHPKEYHIERNKNEQKRKAQA FT VNKITPEFFSTSTTVKKETVLETSKQTSIDEYLKSQKVWSINDSRSQTIHR FT KIGLMMALDNQPFTLVEDTGFNNLMNHLEPRYCMPSRKFFSKTIIPQLYME FT LNNKISDKLIKTDYISFSSDIWTCPISHESFISLSSHSIDKDFNRFDVVLH FT ASHFPESHTGINISKKLESMWDSWKIQAERRHILVRDGASNMISGSNLAEI FT PAIHCTIHLLQLVVSDSISENIVIDVLSKCRRLVTHFNHSSLACNNFKQIQ FT LQQNLDPLCLVQDVPTRWNSTYLMLDRLNKLKIPVQLYLAERSDLMPFSTL FT EWTLILNIIKLLKPFFQLTQEMSSEITTLSSVIPNMCSLKKFMSKCQVDLS FT IQETKNRLTTSLKNRFFSEINDAKSLNILKNRYYVIATSIDPRYKFSFFED FT DIKKQAKYWLINEILSFEKTLVCSEEVDFSVNEEPSQLNFSINTEKEDTLE FT ACFKEIINATSEKTLPFKKLKLEKKSNEILIRKEIKKFYSEPLVDRKLCPI FT KTWMQEERFPLLKTIASKHLCTPASSIFSERLFSEYGNIYEKKRSRLLPKT FT GEMLLFIHHNGRKIE*" XX SQ Sequence 3937 BP; 1471 A; 553 C; 588 G; 1325 T; 0 other; tagaggtgta ccggatatag cattttacgt atccggccgg atacggatat gccggattta 60 aaatttcata tccggccgga tatgccggat ataccggata tattgaaaat taggatagga 120 tttagttttg atgtttgttt ttagtcaaat tgggaaagtg caaaatagtg tgtaaatcca 180 caaaaaaaaa aaaaaaaatt gtattacatt ttgcatgtct gtctgcataa ttggattatc 240 tgtttttatt gtagtttgaa aatttattaa caattaatta cgtagcaata actttaaaat 300 taggttgcta tctaaatttc caaactatgt attattactt taattttttt gttatgtttt 360 tttattgttt tctttacaat gtaacaatgc atttggaatc taacaatgtt tacattgaaa 420 gttaacaatg caatattaat ttcaatgtag tatttataca ctgtatagat atcaaaatta 480 ccatctgtat aatacagata aatatcttgg tatcaaatgt atgtgtataa ttattataca 540 catacatttg ataccaagat atttattaaa tatttaacta ttatactaaa atagttaaat 600 atttaatgtc aattgtgaac acgtttagtt ttatatttca gacaaacata ataatgcaga 660 gctttatctg gtcattgtat aaaataaatg atgcaagtga tgtgtatgca aactgtaata 720 tctgtaaaaa aatgattaaa cgtggaacta tggaggctgg gcaaaaatgt tttagtacaa 780 cacctctgca taaccacatc aaaataaatc atccaaaaga atatcatata gaacgaaata 840 aaaatgaaca gaaaagaaaa gctcaggctg tcaataaaat cacaccagaa tttttttcaa 900 ctagtactac agtaaaaaaa gagactgttt tggaaacttc taaacaaact tcaatagatg 960 aatatttaaa gtcacaaaag gtatggagca ttaatgactc taggtcgcaa actattcatc 1020 gtaaaattgg gttgatgatg gccttagaca accagccatt tactttagtt gaagatacag 1080 gttttaataa cttaatgaac catttagaac ccagatattg catgccaagt agaaaatttt 1140 tttctaaaac aataatacct caactttaca tggagttaaa taataaaatt tctgataaat 1200 taataaaaac agattacata agcttttctt cagatatatg gacatgccca atttctcatg 1260 aatcatttat ttcattgtct agccactcca tcgataaaga ttttaacaga tttgatgtgg 1320 tattacatgc ttctcatttt cctgaaagcc atactggaat aaacatatcc aaaaagttag 1380 aaagtatgtg ggatagctgg aaaattcaag ctgaaagacg acatattctt gtaagagatg 1440 gtgcaagcaa catgataagt gggagtaatc tagctgaaat tcccgcaatc cattgtacta 1500 ttcatttact tcaattagtt gtttcagatt caataagtga aaatattgta attgatgttc 1560 tttcaaaatg tcgtcgactt gtaactcatt ttaaccattc atctctagca tgtaacaact 1620 ttaaacaaat tcagctgcaa caaaatctcg accctctttg tcttgttcaa gatgtcccaa 1680 caagatggaa cagcacttat cttatgcttg atagattaaa caagttaaaa attccagttc 1740 aactatactt agcagaacgt tctgatttaa tgcccttttc tacattagaa tggacattaa 1800 tattaaacat tattaaactg ttaaagcctt ttttccaatt aactcaggaa atgagttcag 1860 aaataacaac tttgtcttct gtaattccaa atatgtgctc attgaagaaa tttatgtcaa 1920 aatgtcaagt tgatttaagt attcaagaaa caaagaatcg gcttacaact tctttaaaaa 1980 atcgattttt ttctgaaata aatgatgcga agtctttaaa cattttgaaa aataggtact 2040 atgtcatagc tacttcaatt gatccaagat ataagttttc tttctttgag gatgatatca 2100 aaaaacaggc taagtactgg ttaataaatg aaattttatc ctttgaaaaa actcttgttt 2160 gttctgaaga agtagatttt agtgtaaatg aagaaccatc acaattaaat ttttcaatta 2220 atactgaaaa agaagacact cttgaagcat gtttcaaaga aattataaat gcaacatctg 2280 aaaaaacatt accttttaaa aaattaaagt tagagaaaaa atcaaatgag attttgatta 2340 gaaaagaaat aaaaaagttt tattctgaac cacttgtaga cagaaaactc tgtccaataa 2400 agacctggat gcaagaagaa cgattccctc ttttaaaaac aattgcttca aaacatctat 2460 gcactcctgc atcatccata ttttctgaaa gattgttctc agaatatggt aatatctatg 2520 agaaaaagag atctaggtta ttgcctaaaa caggagaaat gcttttgttt attcaccata 2580 atgggaggaa aatagaatga ctctttgcat ttactcattt gttgaaattt tatttgttat 2640 gttctttgat agtattatta actttctaaa acattttgtt tcattgaatt taataatttg 2700 atttaagagg cattcataaa gtatgtatgc aggtacggga ggcgacatta tctaaaaatg 2760 tttaaatatt atctaaaagt gtacaagtgt gtataagggg tagggacaag agggcaaata 2820 tgcaaagtaa gggcaaggtc caggcagcaa atatgtacat ggttgggcca tctttttcat 2880 gaattttact ccaaaatttt tttcaaaatt tttattttaa acaaagacaa tttattgagt 2940 gaaaaaatct ataacaaaat atcaaaaaag aaaaaaaaaa aaaagtttat gatgaaattt 3000 caaaaaaata taataaaagc aaaaaaaaaa tttaaacatt aaaaaaaaaa aatttttttg 3060 attattttga tagaaaattt ttttgcaaaa caaattcttt aaattctcaa tttcgatcat 3120 tttaaatctt aaaaaaattg ttggaaagtt atgactgtgt ttttacccct ctccaaattg 3180 aaggggtaat aaattaaaca tggtcataac tttggaacat aaattaattg atgaaattta 3240 aaacctatcg atgaatttag ctagagtatt agatagtgta atgaaataac taaatgattt 3300 cccttgcatt tgggataggt ggtcgagaaa ataacctcta aggttatttt tgttcccaag 3360 ctctaattac catgattttt tgatgcaaaa cactttcatt ctatggaagc ataaaaaact 3420 taagtttaga gtttgcacaa agcttgataa tgtcatatct gagactttta aaaacatcat 3480 gttagagccc atttagcagc gtgtcggtaa accattttga tttagagttc tcaaaattaa 3540 atttctactt aaaagataaa ttaccctccc ctaaaaaaat gtatttatta aataattctt 3600 aatagttttt caaacctctt tgtgctctgg gaagtttaaa ttgaagatag gcttgtgacc 3660 attaattgat cataagtgat cgtaagattg tgatcattaa gtgatacgta attacaatag 3720 cgatcattaa aacatttaac gttcgctgtt cggaatcaaa tttttagttg aagacttgtt 3780 caaacgccct tcataataac accagaactt aacagcataa agtattatcc ggtatatccg 3840 gccggatata gtatatatcc ggccggatac cggatagtaa attatccggt atttttaacc 3900 ggatatccgg ccggatatcg tatccggtac acctcta 3937 // ID Gypsy-171_AA-LTR repbase; DNA; INV; 226 BP. XX AC supercont1.268; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-171_AA_; KW Gypsy-171_AA-I; Gypsy-171_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-226 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.268; Positions 1206201 1206426. XX SQ Sequence 226 BP; 58 A; 57 C; 58 G; 53 T; 0 other; tgtgggagat gggaaatcta gtcccatgaa tttagtgact ccaagcatac acacgttacg 60 gtaacgctat gcgaacacta gcaagtgata ttggcaacac ggtcactgct cgccgagaat 120 cgacggcatc gcagcagtcg tttgctcggc ctctttctca accggaacgt cgaccgagta 180 agtcgcattg gggcgattct tccgcagtgg aataacgatt ttcaca 226 // ID Copia19-NVi_LTR repbase; DNA; INV; 345 BP. XX AC AAZX01010888; XX DT 13-NOV-2007 (Rel. 12.11, Created) DT 13-NOV-2007 (Rel. 12.11, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: long terminal DE repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia19-NV; KW Copia19-NVi_I; Copia19-NVi_LTR. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-345 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from Nasonia parasitic wasp."; RL Repbase Reports 7(11), 1146-1146 (2007). XX DR Genome; AAZX01010888; Positions 5362 5706. XX SQ Sequence 345 BP; 93 A; 72 C; 75 G; 105 T; 0 other; tgttgagaat attcgcgtgt tttcatacgt atgtaaatag ggacacagtc accgcgtgtg 60 tacacactct atggtctcac gcatcgagcg tatttgtgtg agagtgcttc atagcgttac 120 ttatagcgat tttcatcgcg tcagtcctga tcgagtcgtg gtaccgatca gaagcctaaa 180 tcaacttatc ctcagtcgta gcattggttg tcttacataa tcttgtatag tccagggaac 240 gggagcagtg tcgatagatg ttatacaatt attcgcgact tattcgtaag aaaactaagt 300 gtccacaaag aacatctcca ctgtgtaatt aattcccaag gtcca 345 // ID Penelope-3_CQ repbase; DNA; INV; 4117 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A Penelope-like element family from Culex quinquefasciatus - DE consensus. XX KW Penelope; Non-LTR Retrotransposon; Transposable Element; KW Penelope-3_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4117 RA Kojima K.K. and Jurka J.; RT "Penelope-like elements from the southern house mosquito."; RL Repbase Reports 11(1), 602-602 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 22 sequences with >89% CC identity. Sequeces 23-1685 and 2435-4117 are terminal inverted CC repeats seen in several copies. XX FH Key Location/Qualifiers FT CDS 1676..3529 FT /product="Penelope-3_CQ_1p" FT /note="reverse transcriptase and GIY-YIG FT endonuclease." FT /translation="ALVAKATREFLRTRKELVVLKADEGNKTVIMNGNDYK FT SKMLALLSDAKTYATANRDPTSSIQSKNDEIVKRLKRLDAINDRLENELMS FT RKALCPRIYGQPKAHKAGLPLRPVVPNMTAPTYALSKFVGQILQSSLQSTY FT NVRDSYSFCDYINGITLPADYVLVSLDVVSLFTCIPFELVRRDVTFNWSDI FT QKHTDINLEIMLEIIEFIMKSCYFTFEGKFYSQIFGTAMGNPLSSPLADLV FT MENLMHAVVKKLDFQPPVLKKYVDDLIVALPLNKLKHVQDSFNSYSQHIQF FT TYELEENRRLPYLDMVLVRTEAQQIRTEWYRKPISSGRFLDFFSCHTTSQK FT VNTAMNFIQRVDKLSTNLNTREKMKIVDRELEINHYPKPLRRRLINCQLCN FT RSSNGNQQQQQQTQQAEPSHQQTNICYRSIPNIPTLSQTIAKTLKQDYPEV FT RLAFRNTNTVGSSFFSNMKDKVPTENHTNVIYAIPCNECESSYIGMTTTQL FT KTRMSSHRSDIKRLDELLQAGHTTDDYKIAELRQKTALLEHSIAKQHSFDT FT KKVRILDQHNRGTALPILEMCHITNNNHTVNKRTDTEGLSIIYAGLLHTLK FT TKNYKHRKPNTVTPGETEQE" FT CDS 1763..600 FT /product="Penelope-3_CQ_2p" FT /note="reverse transcriptase and GIY-YIG FT endonuclease." FT /translation="SRFYCPRQLLVRQALCAFAETLSWLSPPVLKKYVDDL FT IVALPLNKLKHVQDSFNSYSQRIRFTYELEENRRLPYLDMVLVRTEAQQIR FT TEWYRKPISSGRFLDFFSCHTTSQKVNTAMNFIQRVDKLSTNLNTREKMKI FT VDRELEINHYPKPLRRRLINCQLCNRSSNGNQQQQQQTQQAEPSHQQTNIC FT YRSIPNIPTLSQTIAKTLKQDYPEVRLAFRNTDTVGSSFFSNMKDKVPTEN FT HTNVIYAIPCNECESSYIGMTTTQLKTRMSSHRSDIKRLDEPLQAGHTTDD FT YKIAELRQKTALLEHSIAKQHSFDTKKVRILDQHNRGTALPILEMCHITNN FT NHTVNKRTDTEGLSIIYAGLLHTLKTKNYKHRKPNTVTPGETEQE" XX SQ Sequence 4117 BP; 1161 A; 955 C; 900 G; 1101 T; 0 other; caatttttac aagaaattta cgagtagtta tgctctcgac tgttactcta cgattttgga 60 tttggttttt cggctttcag tcatttagca gtgaaaacaa tgtattgttt caatttccga 120 cgtttcggca aatttattgc ctttttcaag gattctaaaa aatatttttt tttgttttta 180 actattagtg cttgttttag gtgggaaaat actgttggtt acttacaaaa acggtttgtt 240 ttcttgttat ttttggggtt ttgggtaaat tgataactgt cttttgttgt tgttcctcag 300 tcttagttgg ctgtaatttt aagggtttgc acaaaggggg gggggggggt gttaaaattg 360 agcaacgttt caagtggttc taagggcact tactgcatag cagcacatat tatgctggta 420 gctgctggtt tcggtttcga aaaaactttt tgtgttgcac tatccacgaa acggtcaatt 480 tcaatcgaac ggagcttaat ggcggtttgt tcaccaaaaa gcgtcaaagg catggctaga 540 atttgggtca ttttgtgtac taaatgtggg tcaaatgtgg gagaagggag ggaatactat 600 tcttgttctg tttcacccgg tgttactgtg ttgggttttc tgtgtttata gtttttagtt 660 ttaagtgtgt gtagtaatcc tgcgtagata atactcaggc cctctgtatc tgtacgtttg 720 tttacggtat ggttattgtt cgtgatatgg cacatctcta ggattggtag agctgtacca 780 cggttgtgtt ggtctaagat tcttactttt ttcgtgtcga aagaatgttg tttcgcgatg 840 ctgtgctcga gcaaagctgt tttctgcctc agctcggcta tcttgtagtc gtcagtggtg 900 tgtcccgcct gtagcggctc gtccaaccgt tttatgtcgc ttctgtggct cgacatcctc 960 gttttcagct gcgttgttgt catcccgatg tacgaactct cacattcgtt gcatgggata 1020 gcatagatga cgttggtgtg gttctccgtg ggcaccttgt ccttcatgtt gctgaagaag 1080 cttgagccca cggtgtcggt gttgcggaaa gctaaccgta cctccgggta gtcctgcttg 1140 agtgttttgg ctatagtttg tgacaaggtt gggatgtttg gtatggatcg gtagcagatg 1200 tttgtctgct ggtggctcgg ttcggcttgt tgcgtttgtt gttgttgttg ttggttgccg 1260 ttgctggaac ggttacatag ctggcagttg atgagtcgac gccgaagggg ttttgggtaa 1320 tgattgattt ctagctccct gtccacgatc ttcatttttt cccgagtgtt taagttggta 1380 gacagtttgt ctacacgttg tatgaagttc atagctgtat tgaccttctg gctcgtggtg 1440 tggcaggaga agaagtccag aaaacgaccg ctcgaaattg gctttcggta ccattccgtc 1500 cggatctgtt gggcttcggt gcggaccaat accatgtcca gatacggcag tctcctgttt 1560 tcttctagct cgtaggtgaa tcggatgcgt tggctgtagc tattgaacga gtcttggacg 1620 tgtttcaact tgttcagggg aagcgctacg atcaggtcgt cgacgtactt cttgagcact 1680 ggtggcgaaa gccacgagag agtttctgcg aacgcgcaaa gagcttgtcg tactaaaagc 1740 tgacgagggc aataaaaccg tgattatgaa cggcaacgac tacaagtcta agatgctagc 1800 tctactctcg gacgccaaga catacgccac ggccaatcgc gacccaacat caagtatcca 1860 gagcaagaac gacgagattg ttaagcgttt gaaacgacta gatgccataa acgatcggtt 1920 agagaatgag ctaatgtcca gaaaagcttt gtgcccaaga atctacgggc aacccaaagc 1980 tcacaaagct ggattaccac tacgaccagt agttccaaac atgacagcac ccacctacgc 2040 tctctcaaag ttcgttggac agatactgca gagctcactg caaagcacct acaacgtaag 2100 ggactcctac agcttctgtg actacatcaa cggcatcacc ctcccagctg actatgtgtt 2160 ggtatccctg gacgttgttt ctcttttcac gtgcattccc ttcgaacttg ttcgacgcga 2220 tgtcacgttc aactggagcg acatacagaa acacaccgac ataaacctgg aaatcatgtt 2280 ggaaatcatc gagttcatca tgaaatcatg ctacttcacg ttcgagggaa agttctattc 2340 acaaattttt ggaactgcca tgggtaaccc cctgtcgtcc ccactagccg atctcgtcat 2400 ggagaatctg atgcacgccg tcgtcaaaaa gctcgatttt caaccaccag tgctcaagaa 2460 gtacgtcgac gacctgatcg tagcgcttcc cctgaacaag ttgaaacacg tccaagactc 2520 attcaatagc tacagccaac acatccaatt cacctacgag ctagaagaaa acaggagact 2580 gccgtatctg gacatggtat tggtccgcac cgaagcccaa cagatccgga cggaatggta 2640 ccgaaagcca atttcgagtg gtcgttttct ggacttcttc tcctgccaca ccacgagcca 2700 gaaggtcaat acagctatga acttcataca acgtgtagac aaactgtcta ccaacttaaa 2760 cactcgggaa aaaatgaaga tcgtggacag ggagctagaa atcaatcatt acccaaaacc 2820 ccttcggcgt cgactcatca actgccagct atgtaaccgt tccagcaacg gcaaccaaca 2880 acaacaacaa caaacgcaac aagccgaacc gagccaccag cagacaaaca tctgctaccg 2940 atccatacca aacatcccaa ccttgtcaca aactatagcc aaaacactca agcaggacta 3000 cccggaggta cggttagctt tccgcaacac caacactgtg ggctcaagct tcttcagcaa 3060 catgaaggac aaggtgccca cggagaacca caccaacgtc atctatgcta tcccatgcaa 3120 cgaatgtgag agttcgtaca tcgggatgac aacaacgcag ctgaaaacga ggatgtcgag 3180 ccacagaagc gacataaaac ggttggacga gctgctacag gcgggacaca ccactgacga 3240 ctacaagata gccgagctga ggcagaaaac agctttgctc gagcacagca tcgcaaaaca 3300 acattctttc gacacgaaaa aagtaagaat cttagaccaa cacaaccgtg gtacagctct 3360 accaatccta gagatgtgcc atatcacgaa caataaccat accgtaaaca aacgtacaga 3420 tacagagggc ctgagtatta tctacgcagg attactacac acacttaaaa ctaaaaacta 3480 taaacacaga aaacccaaca cagtaacacc gggtgaaaca gaacaagaat agtattccct 3540 cccttctccc acatttgacc cacatttagt acacaaaatg acccaaattc tagccatgcc 3600 tttgacgctt tttggtgaac aaaccgccat taagctccgt tcgattgaaa ttgaccgttt 3660 cgtggatagt gcaacacaaa aagttttttc gaaaccgaaa ccagcagcta ccagcataat 3720 atgtgctgct atgcagtaag tgcccttaga accacttgaa acattgctca attttaacac 3780 cccccccccc cccccaaccc cccttttgtg tcaaaccctt aaaattacag ccaactaaga 3840 ctgaggaaca acaacaaaag acagttatca atttacccaa aaccccaaaa ataacaagaa 3900 aacaaaccgt ttttgtaagt aaccaacagt attttcccac ctaaaacaag cactaatagt 3960 taaaaacaaa aaaaaatatt ttttagaatc cttgaaaaag gcaataaatt tgccgaaacg 4020 tcggaaattg aaacaataca ttgttttcac tgctaaatga ctgaaagccg aaaaccaaat 4080 ccaaaattgt agagtaacag tcgagagcat aactact 4117 // ID Gypsy-52_CQ-I repbase; DNA; INV; 2141 BP. XX AC AAWU01016480; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-52_CQ_; KW Gypsy-52_CQ-LTR; Gypsy-52_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-2141 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 483-483 (2011). XX DR GenBank; AAWU01016480; Positions 3854 1714. XX CC Positions [1552-2031] - Integrase core CC 'GGGGA' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 241..2061 FT /product="Gypsy-52_CQ-I_1p" FT /translation="MENYHNHHSLPPFDVSDTTSIGSRWAKWKRSLDLFLE FT VNCVALGQRKRSYLLHFAGPEVQDIFYNIEGHDAPAPAGTDVYKEAIRLLD FT AHFAPMSSIPYERVIFRRMEQQDGETVEKFLHRLRDQGRLCEYGNALEMRI FT TEQVYDNCSSNALREAILKKKLMTVQDIAVEARILETVERTRGEIRKPTEE FT RKQEEQETEAAVNFVRKNKDDVCFRCGYTGHFASDRKCPARKQQCDRCQMV FT GHFRAMCKTKLVNKPKRSGRTSEIRQVRTRQTSSGTESDSSSDDDVQHVYA FT SSSSSGTGKVRCDLGGVRLDWFVDSGSQYNIISRTTWKKIKQQGAVFSRCE FT TLDKTLRVYGGGKLKVHKVIEANLSTQQQTVTHKIHVVDKQHGVNLLSRGT FT SVELGLLTIYVESSSGSRVDKDEQGKEVNEDRHERAEEGFGVARTGLPYDG FT ESAAGPWTRLDVGLIGPFGNGEHVLVLVDKDSRFVVAERMTRVIPKQIIRT FT LKQIFMRMGLPLVLFMEEKDEIIKDAIQACCDIYGISLMQGTSHSSEAEEA FT KQRIRSFYDNLEEGLELDVEVAMQEFLYNYSLAPHQATGKPPALLMFRRNL FT RNLTAHAKP" XX SQ Sequence 2141 BP; 638 A; 440 C; 619 G; 444 T; 0 other; ttttggcgac gaaagtggga ttaaatgatt aaattcgatt tggaaaaagt ttaaatacga 60 aaagtgtccg cgaaaaacaa agccgcgcgt gggagtgagt tgtaaaagac gaaataaacg 120 gaaagccagc agttcgaagg tcgccgaagg aaacataacc tcgaagggac tacgccccga 180 atgaaaaagt gaaatcgcct cagttgcgaa aacaaaaagt gcaaataagg aagtacaaga 240 atggagaatt accacaacca ccacagtttg ccgccgtttg acgtatccga cacaacttca 300 attggaagtc ggtgggcaaa atggaagcgg tcgctggacc tgtttttgga ggtcaactgt 360 gtggcgcttg gccagaggaa gcggtcatat ctgctgcact tcgctggtcc tgaagtacag 420 gacatcttct acaacatcga aggacatgat gctccagcgc ctgcagggac ggacgtgtac 480 aaagaggcca ttcgtttgct ggacgctcac tttgcgccga tgtcgagcat tccgtacgaa 540 agagtcatct tccgaagaat ggagcaacag gacggagaga cggtggaaaa gttccttcac 600 cgcttaaggg atcaaggccg actgtgcgag tacgggaacg cgttggaaat gcgtattacc 660 gagcaggttt acgacaactg cagttcgaac gcactgagag aagctattct gaagaagaag 720 ctcatgactg tccaagacat cgcggtagag gcacggattt tggaaacggt ggaacgaaca 780 cgaggagaaa ttcggaaacc aacggaagaa cggaagcagg aggaacaaga aacagaagcc 840 gcagtgaatt ttgtccgcaa gaacaaagat gacgtttgtt ttcggtgtgg ttacactggc 900 cactttgcca gtgacaggaa gtgtccagcg aggaagcagc aatgtgacag atgccagatg 960 gtaggtcact ttagagcgat gtgcaagacc aagttggtaa acaaaccgaa gcgttccgga 1020 aggacaagtg aaatccgtca agtgagaaca cgacaaacga gctcgggaac ggaatctgac 1080 agttcttcgg atgatgacgt gcagcatgtc tatgcttcga gttcgagcag cggaacggga 1140 aaagtgcggt gcgatctcgg tggcgtcagg ctggattggt ttgtggactc aggttcacag 1200 tacaacatca ttagtcggac gacgtggaag aagattaaac aacaaggagc tgtcttcagc 1260 cggtgtgaga cgttggacaa aacgctgcgg gtgtacggag gtggaaagct gaaggtgcat 1320 aaagtgattg aagcaaactt gtctacgcaa cagcaaacag tgacgcacaa gatccatgtg 1380 gtggacaagc agcacggtgt taatctgctg agtcgaggta cttctgtcga acttggcctc 1440 ctgacgatct acgttgaatc atccagtgga agtagggtgg acaaagatga gcaaggcaaa 1500 gaggtgaacg aggaccgaca cgagagagct gaggaaggat tcggcgtcgc ccgtacaggg 1560 ctgccatatg atggtgaaag tgccgctggt ccatggactc gtcttgacgt cggcttgatt 1620 ggaccattcg gaaacggcga acatgttctt gttctagtcg ataaggatag tcgctttgtg 1680 gtcgcagaga gaatgacacg agtgatcccg aagcagatta tacgaacttt gaagcagatt 1740 tttatgcgaa tgggacttcc gcttgttttg ttcatggagg agaaagacga gattatcaaa 1800 gatgcaattc aagcctgctg cgacatatac ggaataagcc tcatgcaagg aacttcgcac 1860 tcatccgaag ccgaggaggc gaagcaacgc atacgttcgt tctacgacaa tctagaggag 1920 ggactggaac tagatgtgga ggtggcaatg caggagtttc tctataatta ttcactggcg 1980 ccgcatcagg caaccggaaa gcctccagcc ttgctgatgt tccgtcgtaa tttgaggaat 2040 ctgactgcac acgcgaaacc gtaggagcac gaataacagg gcgaacgagt tcttacctaa 2100 atccgaattg ttctacactt aatttgcgaa aaggggagat a 2141 // ID Gypsy-18_CQ-LTR repbase; DNA; INV; 174 BP. XX AC AAWU01022777; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-18_CQ_; KW Gypsy-18_CQ-I; Gypsy-18_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-174 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 416-416 (2011). XX DR Genome; AAWU01022777; Positions 31717 31890. XX SQ Sequence 174 BP; 61 A; 34 C; 34 G; 45 T; 0 other; tgagaggtaa gattaagaga ggatttctag aagagaaacc tcactctcct aagctttgtt 60 catactccaa agttaccaat aaaccagtct aagcttgaac tcgaaacaag aacgacgtgt 120 tatttaatat ccgaaagagt cttacaacca cgtgtctaga ttgaggtcgt agca 174 // ID Gypsy-24_DPu-LTR repbase; DNA; INV; 167 BP. XX AC scaffold_38; XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from Daphnia: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-24_DP_; KW Gypsy-24_DPu-I; Gypsy-24_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-167 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Direct Submission to RU (16-DEC-2010). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR Genome; scaffold_38; Positions 564062 563896. XX SQ Sequence 167 BP; 35 A; 57 C; 30 G; 45 T; 0 other; tgtgatgata ccaccatctg gcaacacggc acccgtcggc tccatatata ggagaggact 60 ccctccccgc tatcctcagt ctctgtctgc tctactcaga gcgttggtcg tcccaatata 120 ctgaatcgag ccgcaatctc tctctctctc tattcgactt catcaca 167 // ID Gypsy-18-LTR_NVi repbase; DNA; INV; 847 BP. XX AC . XX DT 15-APR-2009 (Rel. 14.04, Created) DT 15-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: long terminal DE repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy-18-LTR_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-847 RA Bao W. and Jurka J.; RT "LTR retrotransposon from Nasonia parasitic wasp."; RL Repbase Reports 9(4), 774-774 (2009). XX DR [1] (Consensus) XX SQ Sequence 847 BP; 263 A; 179 C; 164 G; 241 T; 0 other; tgttaagtga cgaagttcct ggataaatta tttagaaaca ttatataaaa attacacaaa 60 gccagtaaaa aagaagatcc agaaacttcg acgcttaact aagcctcgaa ggaatgtacc 120 cgaaaccaaa acaaacttct cctccctcgg ctgaagcgcg cgtgcacagc cgacgctgca 180 gtttagccgt tgaccatagt attaggccca taaggcctaa tgagtaagac aaagagcgcg 240 gctaggctga agtaagaaga gaggagaagt aagtaccccg cgtgggagaa agcttcgact 300 ctctctctca agcatcacct cccccgaaac gccactccag ggtgtgcgct cctagagaaa 360 ggacgctaat tttaagagtt tagattttga gtaatttgcc tttgagaaat tcaataagtt 420 ttgtttcatt tgagtgtctg atttaatttg tgttttgata agttgataat aaacatcact 480 gcaatccatg tgttgagttt tggaaaataa gtcattcaga atacaaccaa ttttgagcca 540 tcctccgttt tgctaaaaca tcttgccaaa aaggagatct aaaccccgca atcttttccg 600 agcagtcatc ccagtttttc tgagcatcct ttcgggttaa attatttctc aaattccaga 660 agtccacaga acaaagtaag tagcatttaa ttcaatattt ctgaactttg acatttcggt 720 taaatcaaac agttctgcct cgacaaatta gtgttctcac tgatctcccc cgtatgggag 780 agtagctaag cgcccacgtt ttaataatat tttattgtta gctggtggaa tcggttctat 840 cctaaca 847 // ID Gypsy-2_DPu-LTR repbase; DNA; INV; 117 BP. XX AC scaffold_44; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-2_DPu_; KW Gypsy-2_DPu-LTR; Gypsy-2_DPu-I. XX NM Gypsy-2_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-117 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 720-720 (2010). XX DR Genome; scaffold_44; Positions 589744 589860. XX CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX SQ Sequence 117 BP; 25 A; 42 C; 21 G; 29 T; 0 other; tgtgatgacc tggcaatgtc gccccccccc cccccgaact cagctaggcc agttggtccc 60 agtacacgga cgagaataca tctgttactc tcatccgact tatttcactt cattcca 117 // ID Gypsy4-SM_LTR repbase; DNA; INV; 811 BP. XX AC Contig1646; XX DT 15-AUG-2007 (Rel. 12.08, Created) DT 15-AUG-2007 (Rel. 12.08, Last updated, Version 1) XX DE LTR retrotransposon from Schmidtea mediterranea: long terminal DE repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy4-SM_LTR; KW Interspersed repeat; LG_I. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-811 RA Xu Z. and Wang H.; RT "LTR_FINDER: an efficient tool for the prediction of full-length RT LTR retrotransposons."; RL Nucleic Acids Res 35(Web Server issue), W265-W268 (2007). XX RN [2] RP 1-811 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from Schmidtea mediterranea."; RL Repbase Reports 7(8), 752-752 (2007). XX DR Genome; Contig1646; Positions 59781 58971. XX SQ Sequence 811 BP; 338 A; 95 C; 166 G; 212 T; 0 other; tgttggggta ttacgaaaaa agcacagtaa aatcaataaa acaccaaaat gataaaagtg 60 caacattaat taaaacattt acaaaaattc gaaagtagta ttgatcggga gaaattggta 120 tggttgaacg ccaaattgag agaagagtcc agaaatcata ataaaaattt agtgatggat 180 ccctagaaaa tattataaga caaaaaaaga aattaattaa aaaaataatt actaatggga 240 aacccgaggc aatgggaaaa cccggcgtgt agacggaaag atataaaaaa gtaaattggc 300 ctaaacaata ttaaatgggc gacactgatc tatggagagg taattatttc ctaatttgta 360 atttagtaat ggagcaaacg ttgttgacaa atgtcggcta aggtgatctg tgggaagtaa 420 ttatttttaa tagtttaata atgtagcaaa tgttgttgac aatcgtctga aaacagtctg 480 agcaggagta taaaaggaaa aggaaaatca agaaatggcc agtggtcgaa gcaaacgaga 540 aagaatctaa actcaagtaa agtcaaagtt atctatcaga aaggttgatt aatctattgg 600 ataacctacc gtaagataat caaatttata cggtggttga agggtgcaaa taaataagta 660 aaattgtaat tacaaacatt gaaataaatc ggattgcgtt ttaactgtga taaagtgcaa 720 ataaaattat tctgtgttgg gtttgcttat ttgattcgaa aaggacacga ggagtcagag 780 ggaaccgact ttcaataata ataacccaac a 811 // ID Gypsy24-LTR_Dpse repbase; DNA; INV; 1263 BP. XX AC Unknown_group_154; XX DT 15-MAY-2009 (Rel. 14.05, Created) DT 15-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy24_Dpse; KW Gypsy24-I_Dpse; Gypsy24-LTR_Dpse. XX OS Drosophila pseudoobscura OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; obscura group; OC pseudoobscura subgroup. XX RN [1] RP 1-1263 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1127-1127 (2009). XX DR Genome; Unknown_group_154; Positions 14600 13338. XX SQ Sequence 1263 BP; 321 A; 255 C; 356 G; 331 T; 0 other; tgttgtgttg aatgtgtttg aagtttgagt gtcttaggtg tcgatccaat gtacatgtat 60 gtgaatattt atttatttat ttatttattc aaatatttaa ttgcatattg atcgttaacg 120 gagctagttg gcgcaatatt tatatccgtc tttatttatt tatttatcgg agtatttatt 180 ggtccgaagc gtggtgttat ttattcttac tagttatttc tttgggtatt tatttcttta 240 tttatatata ttatcttagt ttaggagtat cttttttttt tttgtaatat tggtgttgct 300 cggtgcagat gccacaagaa gggggtagtc gacagtatcc tacgcgatca cgcgtaaaca 360 aaacagtgga ccagatagga gaaaggtcag gggcgatgtt tgatcaagct cgggaggaac 420 ttggagctgt gggtgggttg ccaagaacac cggtccaggc caggtcagaa gaatggccaa 480 tgtccgatcc tacacccacg cttgcttccg aactggcatc agcagtcaac gtatcgttgg 540 cccaggctca tgcgacacac agggcagcga acactgactc gattagtcgg gtgttgcagg 600 aagagttgcg taaggggttt atcgagatga tgcaccaact caacgaggta ctccagccgg 660 ttagacagct gcagcagcag aaggaaccgc agcgggagga ggttcaccac gagcaagagt 720 atcgaggagc aatcccgaaa aggaagccgt ctagcacgac tgatgcgccg attccgccac 780 caaaaccatt tttgtctcgc tcggggcgag gtggtgttgg tccggaaccg tcttcaggag 840 taggacagac cgcatccagt gagcagcatc aacggggtgg gcatcaagcg cgaaattggc 900 gaaacccaat agcacacaga cactcgggtc agggtcgtgg cgagggtagg agaggtggtg 960 gaattcctcg agaagatccg catccaaatc ccccgggtaa tagaccttgg ccacggtggg 1020 gccgagtcga aaaatgggac gtgacgttcg acggagacag caacaaaatg acagtcgaag 1080 attttgtgtt ccgaattgaa ttcctacaag cccaatgccg ctgtccgtgg gatgaagttc 1140 taaggggatt tcatcacctg gttacaggaa acgcgcgaga atggtactgg cagtacatcc 1200 gtgatcatgg cggtggtgtt tggcaggaac taaggggcga cttaattgct cgtttccgag 1260 gca 1263 // ID Gypsy14-NVi_LTR repbase; DNA; INV; 968 BP. XX AC NW_001820566; XX DT 20-DEC-2007 (Rel. 12.12, Created) DT 20-DEC-2007 (Rel. 12.12, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: long terminal DE repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy14-NV; KW Gypsy14-NVi_I; Gypsy14-NVi_LTR. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-968 RA Kohany O. and Jurka J.; RT "LTR retrotransposon from Nasonia parasitic wasp."; RL Repbase Reports 7(12), 1207-1207 (2007). XX DR Genome; NW_001820566; Positions 130032 130999. XX SQ Sequence 968 BP; 322 A; 229 C; 201 G; 216 T; 0 other; tgctcactaa atacatagtc cttactccaa ctcggcacgc caacgcgaaa gaagtagcgc 60 gagtattgac tgaaaaggta atatgcgtgt ttgggccgcc agccgccatc gtaactgatc 120 agggaaccca ttttcaaaat aaagtattgg aaaaacttgc tgcaattttc ggtatagaga 180 aattcagcac gacggcatac catccgcagt caaatgggtc catagagcgg atgcaccaca 240 ctctaaccga atacttgcgc aaatacgtaa agaaaatcaa caagtgggat gaatggactg 300 cattgtgcca gcacgcatac aactcaaccg agcatgaaag tactcgctac tcaccgcacg 360 agttgctgtt tggatttaag ccgcgtactc catcgagctt tcctcgtgct gcaaacgtat 420 tgtcctacaa cgaatatata gataatatga cgtcgaattt gaccttactc caaacaacgg 480 cggcgatgaa cctcgttcag tcaaagtaca ggtcaaaaca ttattacgac ggaaagctaa 540 acatgaaaca tttccgggag ggtgagttgg tattcttgtt aaaagaaccc agaaagggta 600 aatttgccac cgaataccaa ggccccttcg aggtaatcaa aatcaaccga gcaacgaaca 660 acgtaaaaat ccaaaacggt gaaatagtaa aaaccgtaca cattaataaa attcaccggc 720 cgagtgaact tgcaactagg tcggaaagct tcgacgaaca gcccgaggaa tagtcgtttt 780 tttaccgttt ttttttgcgc ggcaccgggc gtgcgcggta ctacgcaaaa attatttaaa 840 cacctaaaca aaaaaaaaac ggtgaagcca tcaaaacagc ccagtgagcc aagcggtaca 900 tttatttttc caactaaaat cggtcaaaaa catcgttcgt ttttcagagc gatcagtacc 960 cgcgccca 968 // ID CR1-122_AAe repbase; DNA; INV; 4736 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-122_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4736 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1210-1210 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 5 sequences with >98% CC identity. XX FH Key Location/Qualifiers FT CDS 361..1224 FT /product="CR1-122_AAe_1p" FT /translation="MPMVCTSCSDEIKADHIKITCQGFCKAAFHGNCSGLS FT VHAIDEAAKHSQLFYLCNPCTKLMSDLHLRSTIRSAYEAGQEKVLSAHNEI FT VENLKQEIMTELKKEIRTSFATLANSSSRTPISSKRIASNAIPSRRLFSKS FT QTHVFQKQPMDCGTAESISPSLGIGVVSMPASQPKFWLYLSRISREVTTEQ FT VCEMAKKRLGSEDITTIRLVATGRDINTLSFISFKIGIDCNLKAKALCSST FT WPKGILFREFKDNKSNANFWKPQQTPNVSPTATTSSSTSSSVEAMAE" FT CDS 1110..4631 FT /product="CR1-122_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="VQGQQIKRKFLETPANTERLTYGDNVQLNILICGGDG FT RIDAVSPRRTALSSLETSSQFDTVPPFAASAHLSRRGPDVKSVEEVLQPAI FT SGKYFHQLELSASDITPAFCQKPSGSVCLTDFVSSSSRDQTSPKLGCTLYG FT TMEAPKPPSSVEPVEKFNLHQHSCLQSHPGPESRVGNGVFQTAPIGKYPTL FT NNCASPHDILIRSPIPMILEPANSDVLMYYQNVGGMNSTLVDYKLACSDQC FT YDIYAFTETWLNENTPSNLLFDDTYSVYRQDRSPSNSNKRSGGGVLLAIRS FT CFKSRQMFIPEINVIEQLWVSVSLSEAKTFICVVYFPPDRVNDVNLIEKHV FT ESINWIIGQMSPNDNIIIVGDFNLRFVNWNWNAQGFYYPDVAHSSGTVSSG FT CLLDGYSTACLRQLNGVKNDNGRTLDLCFVSDEMCAHCNITRAPSMLVKVC FT RHHPPLQLSIKTNTKFAFEDAEVSTLFYDFGSADFDSMNSFLMNVHWAEVL FT RDCDADRSASILSNILLYAIDQYVPKKLNRVIAHPAWSNSHLKRLKREKRK FT ALRKFSKHKTPSSRARYLRANQHYKRLNGKLFLAHQERLQRRLKSNPKSFW FT RYVSDQRKETGLPRSMTNGMLEASSTEEIAELFLVQFSNVFSNPSTSSRDI FT MDAAASVPVYSAVGPHHLITDEMVLSAAKRLKKSTGCGPDGIPSVVLKQCI FT NAISSPLTLIFNSSLETGLFPSCWKESFVFPVHKKGCKRNVSNYRGIASLC FT ATSKLFELIVLDFLSSKCSHYISSDQHGFVAKRSTNTNLVTYTSFITRQME FT NGLQVDAIYTDLSAAFDKMDHEIALAKFDRLGISGSLLTWLRSYLTGRTMC FT VKIGDHRSASFRVTSGVPQGSHLGPFLFLLYMNDINFILKCQKKSYADDLK FT LYCVIXKPSDALFLQQQLEVFANWCEKNRMVLNASKCSVISFGRKRSMYLH FT EYTLXGAAVKRENAVKDLGVILDSKLTYKEHVSYIVSKASSQLGFLFRFGK FT HFQDIYCLKALYCALVRSTLEYSAVVWAPFYQNGMRRIEGVQRKFVRYALR FT NLPWNDRNNLPSYVDRCKLIHLDLLSVRRDVAKACFVSDLLNSNVDCPELL FT SQLNVNIRRRALRSNDFLQIDRSRTNYGMNDPVKSMCRVFVACFESFEFGL FT SRNRLRQNFLSVLSV" XX SQ Sequence 4736 BP; 1308 A; 1083 C; 964 G; 1377 T; 4 other; tttggcaaca ctgttgttgt tatttagctg taatttttgt cactcctaaa ttactgtttt 60 tatccgtgta tccgtgacaa ttattcgttg tattaattgt tatctgtgta ttgacactcg 120 tattgctcag tgcaactgaa attcaataac attagactct acaacttgtt caattcgtgc 180 atttatttct tgttgagttg tgataaaatt tgtttcgtcg catcatcaac cgcatttgcg 240 tacaagaacc agtccagcac aacaatatct tccctgcaag ttttggacct cgacgctgca 300 tacctgcagg cgcttcatct ttcctggatc acatcgtgtt gatacttcgt cacaagcaaa 360 atgcctatgg tctgcactag ttgctccgat gaaatcaaag ctgatcatat taagatcact 420 tgccaaggat tttgtaaggc agcgttccat ggaaactgta gtggtctatc tgtacatgct 480 attgatgaag ctgccaaaca cagccaacta ttctatctct gcaacccttg taccaaattg 540 atgagtgatt tgcatcttcg tagcacgatt cgcagtgcgt acgaggccgg ccaagaaaag 600 gtgttaagtg cacataatga gatcgtcgaa aatttgaagc aggaaattat gactgaactg 660 aaaaaggaaa tacgtactag ttttgcgacw ctagctaatt cttcatctcg tacaccaata 720 tcatcgaaaa ggattgcctc aaacgcaatt cctagtcgcc gcctatttag caaatctcaa 780 acgcatgtat ttcagaaaca acctatggat tgcggaaccg ctgaatcaat atccccttca 840 cttggaatag gtgttgtatc tatgcccgcg agtcaaccga aattctggct ttacctctca 900 cgtatctctc gtgaagtaac gactgaacaa gtgtgcgaaa tggcaaaaaa acgtcttggg 960 tctgaggaca tcacaactat tcggctagtt gccactggtc gtgacatcaa cacgctatca 1020 ttcatctcgt tcaaaattgg tatcgactgc aacttgaagg ccaaggctct ctgctcttca 1080 acatggccaa aaggcattct ttttcgtgag ttcaaggaca acaaatcaaa cgcaaatttt 1140 tggaaacccc agcaaacacc gaacgtctca cctacggcga caacgtccag ctcaacatcc 1200 tcatctgtgg aggcgatggc agaatagatg cagtatcacc acgacgcacc gcactaagca 1260 gtttggagac ctcttcacaa tttgacacag tcccgccttt tgctgccagc gctcatctta 1320 gtcgtcgtgg tcctgatgtc aaatctgtcg aagaggttct ccagcctgcc atttcaggca 1380 agtattttca tcaattagaa ctctcagctt ctgatattac tcccgctttc tgtcagaaac 1440 catctggatc tgtatgcctc accgattttg tttcatcgag tagcagagac caaacttcgc 1500 cgaagctggg atgcacgtta tatggtacta tggaagcccc taagccccct agttcagtcg 1560 agcccgtaga aaaattcaac ctccatcagc attcctgcct tcaaagtcat cccggtcctg 1620 aatctagggt cggcaatggg gtcttccaaa ccgctcccat cggcaagtac ccgacgctga 1680 acaattgtgc ctctcctcat gacattctca ttcgcagccc tataccgatg attttggaac 1740 ccgctaattc ggacgtattg atgtattatc aaaatgtcgg tggcatgaat agcactctgg 1800 ttgactacaa actagcgtgt tccgatcaat gctacgacat ttacgctttt acggagacct 1860 ggctaaacga aaatacgccc tccaacctat tgttcgacga cacttatagc gtgtaccggc 1920 aagatcgttc tccgtccaac agcaataaac gatccggtgg cggagttctg ctagccatcc 1980 gttcctgctt caagtcacgc cagatgttca ttccggaaat taatgtcatc gagcagctat 2040 gggtctcagt gtcactctcc gaagcaaaaa ctttcatttg cgtcgtctat tttccacccg 2100 atcgtgtcaa cgacgttaac ctcatagaaa aacatgtcga atcgatwaat tggatcatag 2160 gtcagatgtc cccaaatgac aacatcatca ttgtcggcga tttcaatctt cgttttgtta 2220 actggaattg gaatgcccaa ggattttatt atcccgatgt cgcccattct agtggaacgg 2280 tttcgtctgg atgcctgctg gatggctaca gtactgcttg tctcagacaa ctgaatggtg 2340 ttaaaaacga caacggacgc acgttggatt tatgcttcgt tagcgatgaa atgtgcgctc 2400 attgcaatat cacacgagct ccatctatgt tggttaaggt gtgtaggcat caccctcctc 2460 tacaactttc gataaagacc aatactaaat ttgccttcga ggatgcggaa gttagtactt 2520 tattctatga tttcggtagt gccgattttg atagtatgaa ctcgtttcta atgaatgtac 2580 attgggctga agtacttcgc gattgtgatg ccgatcgatc tgcttcaata cttagcaata 2640 ttttactcta cgccattgac cagtatgtac caaaaaagct taaccgagta atagctcacc 2700 ctgcctggtc gaactcgcac cttaaacgtc ttaaaaggga aaaacggaaa gctttgagaa 2760 aattcagcaa acataaaact ccatcatcca gggcacggta tttacgagca aatcaacatt 2820 ataagcgatt gaatgggaaa ctatttcttg ctcaccaaga acgcttacag agacgcttga 2880 aatcaaaccc taaaagtttc tggcggtatg ttagcgatca acgcaaggag actggactcc 2940 cccgttcaat gaccaacgga atgttggaag cctcttccac tgaagaaatt gccgaactct 3000 tccttgtgca attcagcaac gttttttcca atccatcgac tagtagccga gacataatgg 3060 atgcagctgc tagcgtacca gtttattcgg ctgtaggacc tcatcacttg ataactgatg 3120 agatggtttt gtctgctgcg aagcgcttaa aaaaatcaac cggctgtggc cccgacggta 3180 ttccatctgt cgtcctcaaa caatgtatta atgcaatatc ttctcctttg acgttgattt 3240 tcaactcatc gctggaaact ggattgttcc ccagctgttg gaaagaatca ttcgtttttc 3300 cagttcataa aaaaggttgt aagcgcaatg tgtcaaacta tcgtggaatt gcctccttat 3360 gtgctacttc taaactattc gaactgatag tcttggactt cttatcatca aaatgctcgc 3420 attacatttc atcagatcag cacgggtttg tggctaaacg ctcgacaaat actaatcttg 3480 tcacttacac atctttcatc acacggcaaa tggaaaacgg cttacaagtt gatgcaatat 3540 acacggatct gtcagcagct ttcgacaaaa tggaccacga aatcgcctta gccaagtttg 3600 atagactggg aataagtggt agtctgctca cctggttgcg atcgtatcta acaggacgca 3660 ctatgtgcgt aaagattgga gatcatcgtt ccgcatcgtt tcgtgttaca tccggagtac 3720 cgcaaggcag tcatcttggg ccgtttctct tcctgctcta tatgaacgac attaatttca 3780 ttttaaagtg tcaaaaaaaa tcttacgcag atgacctcaa actgtactgc gtaattwcaa 3840 aaccaagtga cgccctattt ctgcaacagc aactagaagt gtttgcaaac tggtgcgaaa 3900 aaaacagaat ggttctcaat gcttccaaat gctctgttat ttcgtttggc cgtaaacgtt 3960 caatgtatct ccacgaatat actttgkccg gtgcagctgt aaagcgtgaa aatgctgtga 4020 aggatctcgg agtaatcttg gactccaagc tgacgtataa agagcatgta tcctacatcg 4080 tatcgaaggc gtcatctcaa ctaggctttc tatttcgttt tggcaaacac ttccaggaca 4140 tttattgttt gaaagccttg tattgtgcct tggtgcgttc tacgttggag tactcggccg 4200 tagtgtgggc accgttctac caaaacggta tgcgcagaat agagggtgtg cagcggaaat 4260 ttgtacgtta cgcgcttcgt aatttaccgt ggaacgatcg taataatcta cccagctacg 4320 ttgaccgctg taaattgata caccttgacc tcttgagtgt tagacgtgac gtagctaaag 4380 cctgcttcgt cagtgacctc cttaattcga acgttgattg tccggagctt ctgagtcaac 4440 tgaacgtgaa tattagaaga cgagcactac gttcgaatga ttttctgcaa atcgacagat 4500 cgaggacaaa ctacggcatg aatgatcctg tgaaaagtat gtgtcgtgtt ttcgttgcct 4560 gtttcgagag ctttgaattt ggtttaagta gaaatagatt aagacaaaat tttttgagtg 4620 tgctttctgt gtaatgttag atttttgtgt tttataatct aagttttgtc atttgggagt 4680 ttataacctg ttgacgtaca aaaatataaa taaatataaa taaataaata taaaaa 4736 // ID Chapaev-9_HM repbase; DNA; INV; 2805 BP. XX AC . XX DT 15-MAR-2008 (Rel. 13.03, Created) DT 15-MAR-2008 (Rel. 13.03, Last updated, Version 1) XX DE Autonomous Chapaev DNA transposon -a consensus sequence. XX KW Chapaev; DNA transposon; Transposable Element; Chapaev-9_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2805 RA Jurka J.; RT "Autonomous Chapaev transposons from the hydra genome."; RL Repbase Reports 8(3), 176-176 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 1066..2526 FT /product="Chapaev-9_HM_1p" FT /translation="MERENIVSGEQFVLSTGGNLLSVTVGVNENKSKRKKV FT NQVSFQTIMELSNVLELSKNKTKKLCSTLRSNLTGVVESNINIKMTELQDT FT LETLYECKTEEFLDGDEIVVRDIVYVKNTTEFIKFIIDERGIDTPNAIARI FT SIDGGQNFLKVIINVFDPKNHYSSSEMYEDSGVKRCFILAIVEMVSEDNGN FT LQKLLEPLKLEEVDFSLAFDLKCANSVFGLSSHSGKYACLYCEGECSLKAG FT KLRTLGSIDLCYDQYVCEGKKRLKMQDYKNVINPRLIYLKENQETILEHIV FT PPPELHIMMGVVDKLCTMLLCVWPPFQNWLKTHYILMRGYHGVGLDGNNAN FT KFMSLLDVLERDVTLTATIDILPIINCLRKFSLVKSAVFGLEMGADISAKI FT EDFKCSFSSLQLYAKDIFDYDLKVSWKIHILVCHILPFIEHNNISLGNYAE FT QCGEAVHHKFKKTWVNYKRTIGHKNYGKKLKSAVVNFNLNNQ" XX SQ Sequence 2805 BP; 976 A; 360 C; 476 G; 993 T; 0 other; actgttctcc aaattgaccg ccaggttgaa actctcccca ctttaaattt ttttgcattt 60 aattaatcat atgcattagc aaacacaatt taagttgtca gaaaaatctg aactaagtgc 120 taagtatgta tatttttaaa gttaaaactt ataatttttt tctttttttt ttcgtaagtt 180 taattaaagt cagccattat ggtagtcaaa taagctattt atggcaatga tataattgaa 240 ttacggatat aactttcact tttcaacatg gcgtcagact tcaaattatg tacttgtgca 300 gtttgcttga taaataaagg tatcagatca aaatctttaa cacctataac accttcagtg 360 cgcgaaaaaa ttcaagaata catctggcca aacttcgatt ctgatcttga aatttgtcct 420 tctgtagttt gttccaactg cagaagaaac ttatttagtc taaacaaagg ggagactgca 480 tatctttcca actggttaga gactatatca aaggtacttt aatgtggatg aattgctact 540 ttttgtgttt gtgtggttgt ctatatatat tataagtata cacacacact atataataaa 600 aatcatgtag tttttatgaa taaatatatt agcactgatt taatttataa agcacatttg 660 tattttagat aaatagaaat actattagga gaacctcttc actgcaatgt atccagacag 720 atgttgagcc agaattgaat ttttcaagtg tgaataagaa aatctgtggg ctctgctatt 780 tgaatcttgg tatgttaata ctatgatttt ataatttaaa taagattaat tgaagtttga 840 ttctctttat gttacattat aatgtttgat tcaatgtatt attgtttttt acgtaatttc 900 aagctactgt taaaatagtt ttttaatttc tcactattac ttttaggtag aggaatttct 960 catgattgtt gcaaaagtaa agctgttaac aatataattg atattagtga atcgcttgga 1020 tataaaggtg cagagcaagt agcttctgga ttgttaaaac gtaaaatgga acgtgaaaac 1080 attgtaagtg gagagcagtt cgtgttgtct actggtggaa atcttctctc tgttacagtt 1140 ggagtaaatg aaaataagtc gaaaagaaag aaagtaaatc aagtttcatt ccaaactatt 1200 atggagttat cgaatgtatt agaactctct aagaacaaaa ctaaaaaact gtgttctact 1260 ttacgcagta atctaactgg tgttgtagaa tcaaacatta atattaaaat gactgaatta 1320 caggatactc tagaaacttt atatgaatgt aaaactgaag aattccttga cggtgatgaa 1380 atagttgtta gagatatagt ttatgtcaag aatacaactg aatttatcaa atttataatt 1440 gatgaaagag gtattgatac ccctaatgca atagcaagaa tatctataga tggtggtcaa 1500 aacttcctta aagttattat taatgttttt gatccaaaaa atcattattc ttcatccgaa 1560 atgtatgaag attctggtgt gaaacgttgt tttatccttg cgattgtgga gatggtttca 1620 gaggataatg gcaatctcca aaaattattg gaacctttga aactcgaaga agtagatttt 1680 agtttagcat ttgatttaaa atgtgcaaac agtgtttttg gactttcaag tcattctggc 1740 aaatatgctt gtctttattg tgaaggagaa tgctccctca aagcagggaa actaagaaca 1800 ttgggttcca tagacttatg ttatgatcaa tatgtatgtg aaggaaaaaa aagactaaaa 1860 atgcaggatt acaaaaatgt tataaatcct cgtttaatat acttaaaaga aaatcaagaa 1920 acaattcttg aacacattgt tcctcctcct gagctccata ttatgatggg agttgtggat 1980 aaactttgta ctatgctttt atgcgtttgg ccaccttttc aaaattggtt aaaaacacat 2040 tacatattaa tgagaggata ccatggagta ggtttagatg gaaacaatgc taataaattt 2100 atgtccttgt tagatgtttt ggagagggat gttaccttaa ctgcaacgat tgatatttta 2160 ccaataatta attgtttgcg taagttttca ttagtaaaat cagcagtgtt tggtttagaa 2220 atgggtgcag atatttctgc taaaatcgaa gacttcaaat gttctttttc aagtctccaa 2280 ttatatgcaa aagatatttt tgactatgat ttgaaagttt cctggaaaat acatatttta 2340 gtgtgtcata ttcttccctt tatagaacac aataatatta gcttaggaaa ctatgcagaa 2400 caatgtggtg aagctgttca ccacaaattt aaaaaaactt gggtgaacta taaaagaact 2460 ataggtcaca aaaattatgg aaaaaagcta aaatctgctg tagttaattt taatttaaat 2520 aatcagtagc attaatatca aatatgtgtt atatatttat ttgttatatg tgttttttgt 2580 aaacgaagtt aataaatgtt gatggtttat tttgaccacc ataatggctg actttaatca 2640 aaaagcgaaa aattataagt tataattttg aaaaatatgt acttagcact tagttcagat 2700 ttttctgaca acttaaattg tgtttactaa tgcatatgat taattaaatg caaaaaaatc 2760 aggagtgggg agagttttaa cctggcggtc aatttggaga acagt 2805 // ID BEL-119_AA-LTR repbase; DNA; INV; 629 BP. XX AC supercont1.255; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-119_AA_; KW BEL-119_AA-I; BEL-119_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-629 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.255; Positions 1243887 1243259. XX SQ Sequence 629 BP; 196 A; 125 C; 120 G; 188 T; 0 other; tgttctggca acacggatgg cactcgtcag cgcggggtgc agaatcctac tcgtcgcaca 60 gaagactgcg aagccaacaa tgaattgcca tagtacaata gctgttgtta aaagccacga 120 tgtgctagcg tgaaaaacct taagttagat acaaattact atttttctaa tcagttaatc 180 taattaagtg aatttgatcc atttaaattt gtttagtaca aagagtggag cttatgtgaa 240 ttggattctc agcttttctg tgtaacaggt aaagataagg ctcacatatg ctaacaggct 300 atatctcatt tctaattatg ccattctaat atgtagagtt cattcagcgt ctcgttaaga 360 gtaccaccgt gcgcataaaa ttcatgctac taggtatgtt aaaaccctga ttcgttgaat 420 tatagttata attagttacc ggtactctca ggcggcggca acaagcgctt tcagccccaa 480 cttcccgaag agaaacgcat tggtataggc aagactaaat gtacatgatg ttacatgact 540 ttttcaacag caattataat aaaattatat ttttagcttt gagctgcaca cataaactgc 600 tgctagaaga cccttcgcct atccgaaca 629 // ID hATm-47_HM repbase; DNA; INV; 3841 BP. XX AC . XX DT 07-DEC-2008 (Rel. 13.12, Created) DT 11-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type family: consensus sequence. XX KW hAT; DNA transposon; Transposable Element; hATm-47_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3841 RA Jurka J.; RT "Families of hATm elements from Hydra magnipapillata."; RL Repbase Reports 8(12), 1941-1941 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 723..3044 FT /product="hATm-47_HM_1p" FT /translation="MLKQFKFYIFCRRCHLPVKEISVVLADLVLKQWSQSN FT ALFKTPVVISQKTLSDKILTAWHKFRDISNQKETKEKTVKLWEGKLDKLLD FT ITKCRCNILLCTDKNSPCKEVKNCSAGVHILCICSKAIKLPQLDLLWLRSQ FT RDKVGEKSGYQIYQLDKEETERQVKAVKRKITDIASLEKRQNLIKKPHINT FT QVQIENIEDKVYESVGHDLNQETSPEKIKNVNEESSEDSSNNINQKTSPFI FT NTKQNYMKIPNTALASLRFDISPAATSAIVSAFLQDLIKSNYLXPKMASLA FT CDPKKIWRARQEVMAKTRVLSEKLIIEAPIKGIFVDGRKDPTLILIEDSIT FT KTFCRRTEKQENISVTSEPDGSYLTHYTPSKGPNKPAKEAAIGLYNWMVPR FT GIDQTIVVIGSDSTNSMTGSGNNGGLLTHLEKLIGRKCFWSICMLHTNELP FT LRHLIANLDGPTNSKDGFIGPIGKLLSKVNQLKRLEKFEPIRQIEPLIKIP FT DDILKNMSTDASLSYKLVSCLESGILNPSLVNRKCGNLCHSRWLTTGQAIL FT WLYISEHGLEGDTLKNLKVLSQFVAQVYFHMWFRIKVKHSIVDGPHHLIKL FT LYLLRSQTVEVINAVSKSVQNGAFHAHSESLLISLLASSSIENRAFAVKMI FT MKVRGDLEQGNVSVRNRKKPTLNFDAMTLLELIDWSNEQILEPNFTCNMTK FT KDLQKVIDSPMEVPYYPLHTQSCERVVKQVTEAAAAVCGFQRRDGFIRARI FT EHRDVVPNLKSKKDLIKLFSFD*" XX SQ Sequence 3841 BP; 1393 A; 554 C; 601 G; 1272 T; 21 other; tagggtacgt catactaaaa awtttttgaa aattacattt ggggagtgat tttattatgg 60 gttggggtaa gctaagcaat attctaaaar ttaacttgaa aaatgtaaaa aagcgctaaa 120 aaatacattt ttacgacaaa aaattaacaa aactttacta acgtatatta aaattttaat 180 yatccaattt ataacaagtc catttgtakt tgtaatctat aaattataga acacagttay 240 tttgtacttg ttatttgctt tgmgttagta gtgaatttaa ttgtgtatat ttcaatawta 300 aatatttgca atatttgttg attttacttg tgatttgtat tgtttttgta ttacagtaaa 360 aaaaaaaaaa aaaaaaaata gaatattgtg ttatggagag tttaaaattg actagacatt 420 cttctacaac taatttaact aagtatcttg gagtaggaaa agattggttg gaaactgaag 480 ttcctacact gagagatgta ctaaggaaag gtctacwtat tcagaaaagt ttgcttcttt 540 ctgatgatct tgataggtga gttttttgaa atgatattga aaagtgaatt tctaataaac 600 ttttagtagg cacaggacgt ccaataacat ccatagaaca tccatctgga cacccgctgg 660 gcgtcttagg acatttaaag gacgtccaag tattagttca agttcaagta tttcaaacaa 720 caatgttaaa acagtttaaa ttttatattt tttgtaggag atgtcaccta cctgtaaaag 780 aaatctcggt ggtattggca gacttggtat tgaaacaatg gtctcaatcc aacgctttgt 840 ttaagacccc tgttgtcata tctcaaaaaa cgctgtcaga taaaatttta actgcctggc 900 ataagtttag agacatttct aatcaaaaag aaacaaaaga aaaaactgtc aaattgtggg 960 aaggtaagct ggacaaattg ttagatataa caaaatgtag gtgtaacata ttactttgta 1020 cagataaaaa ttctccttgt aaggaggtta aaaattgttc tgcaggagta catattttat 1080 gcatttgttc taaagctatc aagttgccac aacttgattt attatggctt agatctcaaa 1140 gagataaggt tggtgaaaaa tcaggctacc aaatatatca actagacaag gaagaaacag 1200 aaagacaggt taaagcagtg aaacgtaaaa taacagatat tgcatcttta gaaaaaagac 1260 aaaatttaat taaaaaacct catattaata cacaggttca aattgaaaac atcgaagaca 1320 aagtttatga atcagtgggg catgacttaa atcaagaaac ytcacctgaa aaaattaaaa 1380 atgtaaatga agaatcctca gaagacagct ccaataatat caatcaaaaa acttccccat 1440 ttattaacac taaacaaaat tacatgaaga tccctaatac agcactagca tctcttcgat 1500 ttgacatatc tcctgctgct acatctgcaa ttgtatctgc tttccttcaa gatttgatca 1560 aatcaaatta cttgmctcca aaaatggcta gtttagcatg tgatcctaaa aaaatttgga 1620 gagccagaca agaggttatg gcyaaaacta gagttttgtc agaaaaactc attattgaag 1680 ctcctatcaa aggtatattt gtagatggaa gaaaagatcc tactctaatt ttgattgaag 1740 attcaattac taaaacgttt tgtcgaagaa cagaaaagca agaaaacata agtgttactt 1800 ctgaaccaga tggaagctat ttgactcatt acacaccttc aaagggtcca aacaaaccag 1860 ctaaagaagc tgcaatcggt ctttataact ggatggttcc tagaggaatt gatcaaacta 1920 tagttgtgat tggatcagat tctacaaatt caatgactgg atctggtaac aatggcggat 1980 tattaactca cctggagaaa cttattggaa gaaaatgttt ttggtcaatt tgcatgttac 2040 ataccaatga gctaccttta agacacctta ttgcaaatct tgatggacct acaaactcca 2100 aggatggatt tatagggcct ataggcaaac tattatcaaa agtaaatcaa ttaaaacgac 2160 tagaaaagtt tgaaccaatt agacagattg aacctttgat taaaattccg gatgacattt 2220 taaagaatat gtcaaccgac gcttcactaa gctacaagtt agtttcttgt cttgaatcag 2280 gtatcttgaa tccttcactt gtcaacagaa aatgtggtaa tctttgccat agcaggtggc 2340 tcacaactgg tcaggcaatt ctttggcttt acattagtga acatggctta gaaggtgata 2400 ctttaaaaaa tctaaaagtt ctttctcagt ttgtggctca agtatacttt catatgtggt 2460 ttagaatcaa ggtaaagcac tctattgtcg atggacccca tcatctaatc aaacttcttt 2520 atctattaag aagtcaaact gtcgaagtta taaatgctgt tagtaagtct gttcaaaatg 2580 gggcatttca tgctcattct gagtctcttc ttatttctct tcttgcaagc tcatcaattg 2640 aaaatagagc ttttgcagtc aagatgatta tgaaagttag gggagattta gagcaaggta 2700 atgtaagtgt aagaaaccga aaaaaaccta cacttaactt tgatgctatg actctccttg 2760 aactcattga ctggagtaay gaacaaatct tggagccaaa ttttacatgt aatatgacaa 2820 aaaaagattt acaaaaagta atagattctc ctatggaagt accttactat cctctccaca 2880 cccagtcttg tgaaagagtt gttaaacaag tgactgaagc agcagctgct gtttgtggtt 2940 ttcagagacg tgatggtttt attagagcaa gaatagagca cagagatgtt gttccaaatc 3000 tgaaatccaa raaagattta atcaaattgt ttagttttga ttgattgatt ttaaagaaaa 3060 ttttatttyc cataaaaaca tgttgttgtt ttatttaata ttttgtgtaa ttataatata 3120 tttggacaaa cctgtgtgcr cccacagaac attttgatgt tttacatata acagtttcag 3180 gaaatttatc tgcacatttt aaactttaag taatactttt ttagagttca acaccctcac 3240 ttatgaggcc aaatgaactc aaaagcatga gaaaattatt ttggtgtttt tattttcaga 3300 gttcttggtg ccattttttg caaaacattc tcatttgaag cataaagata aacttaagtt 3360 tagagttgaa aatggctgaa gatatcatgt aaaacataaa aaaaactttt gtgggtggag 3420 ttaaatctaa ttcaaaactc tartttttac ctaatttgct tgattttata ctactttaaa 3480 aagtactttt tcaaatcaat aatgcatata tatatatata tatatatata tatatatata 3540 tatatatata tatatatata tatatatata tatatatata tatatttgtc tggtaaaatg 3600 ttttaacaac catktgtaat aaaamttaat tttcattata actttttttt aattttatcy 3660 attgttttca ttttaacgtt aattatytaa atattgcaat tttcaagttc atttttaccc 3720 aaaaaaaaaa aaaaaaaact tttatttcaa tgttgcttaa aatacatacc acacaataaa 3780 atcactcccc aaaytgtttt tttttttaat ttaaaatttt ttaaaagtat gacgtaccct 3840 a 3841 // ID Gypsy-1_HAS-LTR repbase; DNA; INV; 190 BP. XX AC AEAC01014456; XX DT 20-JAN-2011 (Rel. 16.02, Created) DT 20-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Harpegnathos saltator genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-1_HS_; KW Gypsy-1_HAS-I; Gypsy-1_HAS-LTR. XX OS Harpegnathos saltator OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Ponerinae; Ponerini; Harpegnathos. XX RN [1] RP 1-190 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Harpegnathos saltator genome."; RL Direct Submission to Repbase Update (02-FEB-2011). XX DR Genome; AEAC01014456; Positions 191 2. XX SQ Sequence 190 BP; 57 A; 32 C; 46 G; 55 T; 0 other; tgtattgtta gtattaagat agtcataaga ggattttacg attttgttat taatcattat 60 atgaagaata tctattattt gttataaacg ccgtcagcgc ctgtgcgcgc agcgacagcc 120 gaagcgtcat catgaacacg gtatagagtg cgagtggccg gcgtctgcga taagtctaaa 180 aaagcgagca 190 // ID BEL-9_DWil-LTR repbase; DNA; INV; 349 BP. XX AC scaffold_181100; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-9_DWil_; KW BEL-9_DWil-I; BEL-9_DWil-LTR. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-349 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_181100; Positions 208042 207694. XX SQ Sequence 349 BP; 115 A; 62 C; 61 G; 111 T; 0 other; tgttggagaa gaactcctga atttacatta attaatttaa ttattcggaa atattaacca 60 ttgggttaaa tagctaaact atcgatatat cgcaagctca aagagagttc aatctagcat 120 cactgttgtc tctcttcttg cgtttccctt tacattttac tttgtattac tctctctcgt 180 agcatataag aattgaagaa tttactgtac caagagaact cgaatgcgac ggacgcataa 240 taaatgtaaa actactaacg tcgtctaatc tgattgtggc aagggaagtt ttgtttcatc 300 gaaaccaacg agtagcagat tataaatatg gccgagaaat tggccaaca 349 // ID DNA8-57B_AP repbase; DNA; INV; 215 BP. XX AC . XX DT 29-AUG-2009 (Rel. 14.09, Created) DT 29-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-57B_AP. XX NM DNA8-57B_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-215 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 1990-1990 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 215 BP; 74 A; 38 C; 36 G; 67 T; 0 other; accatgacta aataaacagg catgtcagct cgataacact gcgcataaaa ttgtgataaa 60 atgttcccca gacttggaat aaacgctaaa atattgtacg ctataataat aatctacatt 120 atacatttta tattattctt tccatgtctg gggaacatgc gcgacgatag cgaaaatgaa 180 attttagctg acatgcctgt ttatttagtc atggt 215 // ID Gypsy13-I_Dpse repbase; DNA; INV; 7268 BP. XX AC Unknown_singleton_87; XX DT 13-MAY-2009 (Rel. 14.05, Created) DT 13-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy13_Dpse; KW Gypsy13-LTR_Dpse; Gypsy13-I_Dpse. XX OS Drosophila pseudoobscura OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; obscura group; OC pseudoobscura subgroup. XX RN [1] RP 1-7268 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1088-1088 (2009). XX DR Genome; Unknown_singleton_87; Positions 18523 11256. XX CC Positions [6339-6830] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 2957..3904 FT /product="Gypsy13-I_Dpse_1p" FT /translation="MITQAVRAMQNDLLTQLSEQMAQIIQTNVAAHLQLTG FT RDQHTRNIAPDHSSQGQEAGGGRESAGFYREGRDTPRSLASDLSQRPDKVV FT HIMNGWKLRFSGDAEGISADNFIYRVEALTHATLESNFAMLCSNASILFDG FT KAREFFWRFHKSVVVVRWDVLCQALKKQFRDTRTDVDIREAIRDRKQKEKE FT GFDAFYDAIVQLMDSLESPLSEKSVVDILRRNLRPEVRHELLNIKITSVGE FT LREICRRRESFMEDVRRNYGYQKSVPFRRQVAELVEEHMSDDATEFSEGEG FT EAEIGALALVCWNCHKEGHRYQDC" FT CDS 4512..7199 FT /product="Gypsy13-I_Dpse_2p" FT /translation="MQVAEICQESHVLIPSQKEELQKVVSEFPSFAVSGLG FT RTSVLSHSIDAGDAKPVKQRHFPVSPAVEKLLYAEVDRMLKLGVIEESDSA FT WSSPVVLVQKPGKVRLCLDSRKVNAVTRKDAYPLPQIDGILSRLPRAMFIS FT SLDLKDAYWQISLDPESRDKTAFTIPGKPLYQFKVMPFGLTNASQTMTRLM FT DKVIPAELRNEVFVYLDDLLIVSDSFESHMKVLRVVAQHVRLAGLTLNVEK FT SKFCMRSVKYLGHIVGEGVIRTDPGKISAMIDFPVPNSLRALRRFLGMVGW FT YRKFIANFASIASPLTDLLKPKCRFVMTPEGRTAFEELKKLLCSAPVLRSP FT DFSQPFFVHCDASKSGVGGVLVQKTPEGDEFPIAFVSKKLNKAQRNYSVTE FT QECLAAIVCINRFRAYIEGHDFTVITDHASLKWLMTQTDLHSRLARWALKL FT QGFNFKIEHRSGRLNVVPDALSRVNEEDLAAVDVGYGQAVDLESPAFQSDE FT YVELVNRIKANDVNLPDLRVVEDRVYRKCDFATGSTLHDHFIWKLWIPQSL FT IPEVLANAHDHPLASHGGIHKTLERVRQFLYWPSLVKDVKAYVLACDTCKA FT TKAPNSVQRPPMGVAPESQRFFQKLYVDFLGPYPRSKSGNIGIFIVLDHYS FT KFVFLKAVKKLTADLVIKYLQQDLFHTFGFPETIVSDNGSQFKSEVFQKFL FT KAHKISHTATAVYSPQSNASERVNRSVIAAIRSYVRSDQKNWDEELSSICC FT ALRTAVHSSLGTSPYYMVFGQQFVSSGETYKVLRALRLLEDKSLAFSKEDS FT LELVRSKALTVMEKQRKKNERNYNLRSREVSYQVGQEVFRRNFKQSNFQAG FT YSAKLGPAYVKARIRKKIGNAYYELEDLQGRSIGRYHAKDLKQ" XX SQ Sequence 7268 BP; 2064 A; 1484 C; 1848 G; 1870 T; 2 other; gcctaggtaa gattgagaag ggtaaaggaa aattggacta gtcagataat ttcatccgtg 60 taggaccggc cattggcgca gtggataccg caaaggagga acgccgcctc gccgtatttt 120 ctttgcaaag cggcccggat cagcttcgcg gatccaaaga cgcgaggcgt cgaggagatc 180 gagcgtccaa gggacgccag aaggggtcca gaggagcagc caaagcagcg gtggaaaaaa 240 attgcaggcc caagtacacg ggcaccgacg tcacgctagc gggggttgag tggtttttgg 300 ggaagaggaa aaaaaaaact caatcgtttg gttggagagc gagccaaggc agggctatat 360 agcgtagcta gagcgaaagc aagagcagtc cgagatttgg cggggtcttt cgagtggctg 420 cgcgagtgat ttttatcgtc gggaaccgat gaagtgattt cggaagggaa aatctaaaac 480 taaaagaaac cgggaatggc cgagatctct agcaataata ggattcagtg ttaagtgatt 540 agtgttaata atttaatgag ttaagtaata tttctttgtt aggttttttt gtgcgttaaa 600 gtctgcgaca gtcggcgaag ggcangcgaa ggaaaggaaa aaaaagaaag ggcgagagag 660 tttttttgtg gtactcgagt ggaatgagtc gctcaaacta agagtggagc gagcaaagag 720 tgaatttgcg agtgaggttt ttttgtgtgc ctttgtttta ctcccgattg gagttcactt 780 tacaggtcca aatctcctcc gggaagcgct tcatcggtgg atatgcggat catcgggtga 840 gtgagaacga acaagaaaat ttaaccctgg caattccact catgagagaa aagaaaatag 900 aaatttagat aattacatac ttccccctat tgtttgttcc gtttcttccc cccctttcgg 960 tctggctgtt gtactgtctt tatacataaa ttccgagata attgttagtt ttattcctca 1020 tcattattaa gcaattttat tgatattatt ttttttattc tttttacttt ttatgcgtac 1080 tttatgttga gttatgttta gttttaattt aggaaccgct cagtttttag tcataagtca 1140 ataaatttta taatgtttta gttcacagtt ttttttgcct tcggctagcc aaatgcccct 1200 gaggatccca taacgacgca aaggggccat tccatataaa ttgcgaccta ggaatggcca 1260 aggtagggag ggtcaatccg ccctcaacga gatctgaggc ggggaaaagc gttcggggac 1320 aggtcggaga aagggccagt tatgtgttca agcaatttta ggtccacctt aactctcgtc 1380 gtagcacgca taaattttaa cttgttaata atgaagatag gcatcaggtg agaggagggc 1440 aagatcacca ccccaattag ccgacgagca tcccagccgg gaataaccgc gtgcgccctt 1500 ctcccactcc ccaacaccca gcctggttcg ttacagttat aagtaattat taatttcatt 1560 attataaagt tagtggcgcc caacaatggg acccaaaata attggtagaa ccgcatcccg 1620 atgtggtcaa cgatatcgca gaaacacaca catcaaaaac actttcggtt tttctggcaa 1680 acgatgtaca tatgtagtgg gcttgagaag agaggcatat acgtttattc gctttattct 1740 caaactacgg ttgctgcagt atctgggcca gaataccgat cttcatagtc agctaaattt 1800 tgcaagtcag tcgcatagga tttattgttg agaagttgct agcagcgatc gacaagccac 1860 ggctgaggac gtagagctta gtaggcgatt aaatacccta cagtaatgga gtccgtagcc 1920 caacgagtta gacgtttgga aaagcaggga agaagtcttt aaataagtcc accctgacca 1980 cgaataacga gttttagcaa agacccatcc atagtttaca atagtcatgc ggcagagcta 2040 gccatcaaac gctcggaaat acagaacaaa atctttaata agagtttgcc tggccacggg 2100 tgatggaaaa ccagcatacc tgtccgcgga agcaagagtg cgaaggagag taaagaagac 2160 gaaactgatt agcttcgatt tactcgcact cgagcttgca gtcgttgttt ttgtcttgcg 2220 gatcggcgct ccaaagcaat gaaacgtttg gttgttgttg cgagggctga ttagcggcac 2280 aagcagcggt aaattatctt tgttctttct tctcacacca gttccgatat gatttttttt 2340 caggggaact aagctatgct tgaaggcaaa taaatgtttt ttttgatttc tttatatatt 2400 gcgattttca caaaatatta ttcctatttt taattataaa ttgttttatg ttttattatc 2460 agctacaaac gcgtctcaat tcctcaagct gtaagcaaag agtttttatt tttttgtgtc 2520 ataagaaaaa aaaagttgct agaacgccga aacagctgtc tcggttgaga taggtcatcg 2580 accggcaaag aattgtgagc actgggaaga aaaaagctag gtgacgtggc gaagggaatc 2640 aataatggag ggagaacctg taaatccgtc cgtttgcatt ctgtgtgngg aagagtttgg 2700 atccctgcag ttatacaaga ccaggtgtgg ccacgaattt cataaagctt gcatcgtccc 2760 ctactcgaag gaaaatgcta aatgtcccag atgtggaaga gtcacttttg agttgcaagg 2820 cggcgcatcg gtttcgggca cgcaaaccgc caaacagcag gcaggaagca aagaaaaagc 2880 ttcaggcacc ccagcgacgg gatcagcaaa taataatcca ggttcagctt cagttgatgg 2940 cgggaatata ggaatcatga ttacccaagc agtgcgagcc atgcaaaacg accttttaac 3000 ccagttgtcg gagcagatgg cgcaaataat acaaaccaat gtcgctgctc atcttcaact 3060 gactggccgg gatcaacaca ctcgaaatat agctccagac catagttcgc agggtcagga 3120 agcaggcgga ggccgagaat cggcaggatt ttacagagag ggcagagaca ctcctagaag 3180 tttagcttcc gatctgtccc aaaggccgga caaggtcgta cacattatga atggttggaa 3240 gctacgattt agcggggatg cggaaggaat tagcgctgat aattttatct atagggtaga 3300 agctctcacc catgcaacgc tggaaagcaa ctttgcgatg ctatgtagca acgctagcat 3360 cctctttgac ggaaaggctc gggaattctt ttggaggttt cataagtcag tggtagtggt 3420 ccgatgggat gtgctttgtc aggcattaaa gaaacagttc cgagacactc gcacggatgt 3480 ggacattcga gaggcgatca gggaccggaa gcagaaagaa aaagaagggt tcgatgcctt 3540 ttatgatgcc atagtccagc taatggatag tttagagtct cccttgtcag aaaagtctgt 3600 ggtagacata ctgcgaagaa atttgaggcc cgaagtaagg cacgagttgc tcaatataaa 3660 aataacttcg gttggggagc tcagggaaat atgcagacga agagagagtt tcatggagga 3720 cgtgagacgg aattatgggt atcaaaagag tgtccctttc cgtaggcaag tagctgagtt 3780 ggtggaggag cacatgtctg atgacgcaac ggagttttcc gaaggtgagg gcgaagccga 3840 aataggagcg ttggccttgg tatgctggaa ctgccataag gaaggacatc ggtaccagga 3900 ttgctaatcc aaacgcaaga tcttttgcta tgggtgtggc attccaaaca tttacaaacc 3960 agcttgcgaa aaatgtcagc aaaaaaacgg cagggcgagc acacagcctc gtcagcttca 4020 gagtgtgcgt cgccaaaaat ccgcgagccc agacgtcagc cagtagaaaa gcaacaccag 4080 gcagttatag atacgagcat tccagtccag cccgtcagta cagtaaaaac aaatttatgg 4140 cataatttca ttaattctaa ttcagatgca gaggctgtag aaacactgtt ccgtcgtaga 4200 tccaattcaa gacactccaa acggatgaga tcattttgga aggcagctaa atcaataaat 4260 gcgattagat acaaagagct cgataaacgt ccgtaagcag cgtaatacca gcaggattcc 4320 atttaaacgg ggcatctcag ttgctcatac tgcagatggc aggcggcagg aggttctggg 4380 tcgattgaga actccagtca ccctatgata acgtcgtgaa ggatttggaa ctacatatca 4440 ttccgtcgct tagccaggac ttatatctag gaatcgattt ttggtcaacg ttcaatttat 4500 tacctccgga gatgcaggta gcagaaattt gtcaggaatc gcacgtacta attccaagtc 4560 agaaagagga gttgcagaag gttgtgtcgg aatttccatc ctttgcagtc tctggtttgg 4620 gtagaacatc ggttttgtcg cactcgatcg atgcaggaga tgctaagcct gtaaaacaac 4680 ggcattttcc cgtgtcacca gccgtggaga aactgctgta tgccgaagtg gataggatgc 4740 tcaaattagg agttattgag gaatctgata gcgcctggtc ctctccagtt gtgctagtcc 4800 agaaaccggg aaaggttcga ctttgcctag acagccgaaa agtcaatgcc gttacacgaa 4860 aggacgccta tccgttgccg caaatagacg gtatactaag tcgcctcccc agagccatgt 4920 tcatctcgag tctggatctc aaggatgcct attggcagat aagcttagat ccggagtctc 4980 gggacaaaac tgctttcaca attccaggta aacctttata tcagtttaag gttatgccgt 5040 tcggtctcac caacgcgtcg caaaccatga ccagattgat ggacaaggtg atccctgcgg 5100 aacttcgcaa tgaagtcttc gtctacttgg acgacttgtt aattgtttcc gatagctttg 5160 agtcgcatat gaaggtgttg agagttgtag cgcagcatgt tcgattggct ggattaacgc 5220 tcaacgtaga aaagagcaaa ttctgtatgc gttcagtgaa gtatcttggt catatcgtcg 5280 gggagggtgt cattcgaacg gatcccggga aaatttcggc catgatagac tttccagtgc 5340 cgaactcttt gagagcccta cggagattcc ttgggatggt tggatggtat cgtaaattta 5400 tagcaaattt tgcgtcgata gcttcccctc tcactgacct gctgaaacct aagtgcagat 5460 ttgtcatgac tccagaagga cgtaccgcgt tcgaggaact taaaaaactg ttgtgctctg 5520 cgccagttct tcgtagtccc gattttagcc aacctttctt tgtgcattgt gatgccagca 5580 agtcaggggt tggaggtgtc ttagtgcaaa agactccgga aggagacgaa tttccgatag 5640 cttttgtgtc gaaaaaactt aataaggcgc aaagaaatta ttctgtgacc gagcaagagt 5700 gtctggcggc gattgtttgc atcaatcgtt ttagggcata catagaggga catgatttta 5760 cggttatcac cgatcatgca tctctgaaat ggctcatgac gcagacggat ctacattcgc 5820 ggctggcccg atgggcattg aaattacagg gctttaactt caaaatagaa caccggagtg 5880 ggcgtttaaa tgttgtaccc gatgccttat ccagagttaa tgaagaagat ttggcggccg 5940 tggatgttgg ttatggacaa gcagttgatc tcgagtcacc agcttttcag tccgacgagt 6000 atgtggagct ggtaaacaga ataaaggcca atgacgttaa cttgccggat ctgagggtgg 6060 tcgaagatcg tgtgtacagg aagtgtgatt tcgccacagg gagtactctt catgatcatt 6120 ttatctggaa gttatggatc ccacagagtc tcattccaga ggtactagcc aatgctcacg 6180 accatccctt agcatcgcat gggggaattc ataagacttt agagcgagtg cgacagtttc 6240 tctattggcc aagtttggtc aaagatgtaa aagcgtatgt actcgcgtgc gacacttgta 6300 aagcaactaa agcaccgaac tctgtccagc gacccccgat gggagtggca cccgagtcgc 6360 agaggttttt ccagaagcta tacgtagatt ttctcggccc ttatccacgt tcaaaaagcg 6420 ggaatattgg catatttata gtactagacc actattctaa gtttgtcttc ctgaaggctg 6480 taaaaaaact aacggctgat ctcgtgatca agtatctgca gcaagatcta ttccacactt 6540 ttggctttcc ggagaccatc gtatcggaca atggctcgca atttaagtcg gaagtcttcc 6600 aaaagtttct caaagcgcac aagatatcac acacggccac agcagtatac tctccgcagt 6660 cgaatgcttc agagcgagta aaccgatctg taattgccgc cattcgttcc tatgtgaggt 6720 ccgatcagaa aaattgggac gaggagctga gcagcatttg ttgtgccttg cgaaccgcgg 6780 tccattctag ccttggaact tcgccctatt atatggtatt tgggcagcag ttcgttagct 6840 ccggggaaac gtataaagtt ctgagagctc ttcgattgtt ggaagataaa tccctagcct 6900 tctccaagga ggattcgttg gaattggtac gtagcaaggc cttaacggta atggaaaagc 6960 aaaggaagaa aaacgaacga aattacaatt tgagatcaag ggaggtttcg taccaagtgg 7020 gccaagaggt gttccggagg aactttaagc agagcaactt ccaggctggc tatagtgcca 7080 aactgggccc agcctatgtg aaggccagaa tccgcaagaa gatagggaat gcttattatg 7140 agttggagga tctacaagga cggtctattg ggcgatacca cgcgaaagac ttaaaacaat 7200 agctgatccg cgcagtttgg atccgacttg tgtgatacca ctaagtcaga tcttcggcgg 7260 ggggtatt 7268 // ID Mariner-1_NGr repbase; DNA; INV; 3209 BP. XX AC . XX DT 04-APR-2011 (Rel. 16.05, Created) DT 04-APR-2011 (Rel. 16.05, Last updated, Version 1) XX DE Mariner/Tc1-like DNA transposon - consensus. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-1_NGr. XX OS Naegleria gruberi OC Eukaryota; Heterolobosea; Schizopyrenida; Vahlkampfiidae; OC Naegleria. XX RN [1] RA Fritz-Laylin L.K., Prochnik S.E., Ginger M.L., Dacks J.B., RA Carpenter M.L., Field M.C., Kuo A., Paredez A. et al.; RT "The genome of Naegleria gruberi illuminates early eukaryotic RT versatility."; RL Cell 140(5), 631-642 (2010). XX RN [2] RP 1-3209 RA Jurka J. and Kojima K.K.; RT "Mariner/Tc1-like DNA transposons from Naegleria gruberi."; RL Repbase Reports 11(5), 1733-1733 (2011). XX DR [2] (Consensus) XX CC This family encodes a DD34D-type transposase, which shows a low CC similarity to transposases of Mariner, Pogo, and IS630 DNA CC transposons. Since it is closer to Mariner/Tc1/Pogo than Zator, CC it is classified into Mariner/Tc1/Pogo superfamily, but it could CC be derived from bacterial transposons independently. 68-bp TIRs. CC TA TSDs. XX FH Key Location/Qualifiers FT CDS 205..2964 FT /product="Mariner-1_NGr_1p" FT /note="DD34D-type transposase." FT /translation="MSNKQSSSYDTSPQSFSSYSNLIDLLPSVSQVPSWTC FT TQDQPDNTPIEKKDEKTPGKKKEEKPKELKKGTKRKFENVEEAIHDXLTMK FT NVKFLKEFLKSEFGETGGNLRKDALVAKLVTRICSSVEEETDERDESFEDN FT IKKKALLIVERNQETYNESQAKKAKLVNLEETEKKENPDDKIVLIGFKNKR FT KVGYVQKTREEQIKLVEKYKEQYSTPDRGSQQEFAKQNNISPSQFSRYLSA FT CKRNELTLKVERGHPPPLLLKDQILQVIDITRKERSEKKQVSNIRLGEIMK FT EVAGLKKTPSVDYISKFSKKFLYKRKIKTTITKRERIETAYEQLYRTYYYY FT CFYAAIMFRNLKSIDRSKIVVFDETGTNTKATERTNTPIKDDDAVVLQLAD FT DIKDTFLVAVTAEGGVLPVSIVESVPGKKSTINGNKVTVEEKIAGVHIEHI FT EQWVEKVYIPNSKEGDILLWDNLSHHKCKSVQKILKQHNRINILLPVGGHH FT QSPLDNMCFRYAKRELADWKKNNSNSDRKSRNAAFEQIISNMNPDVIVNSF FT RKCLMDIWNYDSEEEYLKYVRASLKMNNDLYDLYIREFNPMCIGQFYVELP FT TPSVKVETDEKLDQDPILDDVNNPWIDSDSMKENFDQKQVEQISKIILHIQ FT NKTEFNHNIQQNAANLQLYVLSTETIACSITLSTEALETIKDDTVFLPGNT FT FDIFGICFSKISSQYLQYIPSDFSQRVTNQSKKFIETNLPKSMKFTSKGFI FT IFPVARSLHWYLLIYDCQSTDWFLFDSFAKDFSVIESETSELFEMLPSFIV FT KPKTVRKTLQKKIQVDGNHCGDYCVLLMDFLSAGFSLSDAVDIMTKPFSIS FT NYRAILVARLKHLHVVEFEPQSPLSESEGFQSEPKKSSKIELGDYDEELDE FT LFNESWEDSIFGKKIQ" XX SQ Sequence 3209 BP; 1250 A; 475 C; 552 G; 931 T; 1 other; ctgtggttac gcaatgaagt gcaagcaatt ctacaatgaa atgcaagcaa ttattttttg 60 aactgcaagc aattctacaa tgtagtacaa ttacttgctg acgaactgca agaaagtcta 120 aatataaatt tccactttga tcaagaaact tgcaaaacac acatcaaaac ttccaatcag 180 cagaagaagc aataaattta taaaatgtca aacaagcaat cgagtagcta tgacacttca 240 ccacaaagct tctccagcta tagcaattta atagatttgc taccttcagt atctcaagtt 300 ccttcatgga cctgtacaca agatcaacca gataacacac ccattgaaaa gaaagatgaa 360 aaaacaccag gcaagaaaaa agaagaaaaa ccaaaggagc taaaaaaagg aacaaaacga 420 aagtttgaga atgtggaaga ggcaatccat gatmaattga caatgaaaaa tgtcaagttt 480 ttgaaggagt ttttgaagag tgaatttgga gagacaggag gtaatcttag aaaagatgct 540 ctggtagcaa aattggtaac tagaatttgt tcttctgttg aggaagagac ggatgaaaga 600 gatgagtcat tcgaagacaa catcaaaaaa aaagccctat taattgttga acggaatcaa 660 gaaacataca atgaaagtca agcaaagaaa gcaaaactgg taaacctgga ggagacagaa 720 aaaaaggaaa atcctgatga taaaatagta ttgatagggt ttaaaaacaa aagaaaagtt 780 ggttatgttc aaaaaacaag agaagaacaa attaaacttg ttgagaagta taaagaacaa 840 tattcaacac ccgaccgtgg atcacaacag gaatttgcta aacaaaataa tatctcacct 900 tcacaattta gcagatattt atcggcatgc aagaggaatg aactaacatt aaaggttgaa 960 cgtggtcacc cacctccatt attattgaaa gaccaaattt tacaggttat tgatattact 1020 cgtaaggaac ggagtgagaa aaaacaagtt agtaatataa gacttggaga aataatgaag 1080 gaagttgcag gattaaagaa aacaccttca gtagattata tttccaaatt ttccaagaag 1140 tttctttata aaaggaagat caaaacaaca attaccaaga gagaaagaat tgaaacagct 1200 tatgagcaat tatatagaac ctactattat tattgcttct atgcagcaat aatgtttaga 1260 aatttaaaat ccattgatag aagtaaaatt gtggtttttg atgaaactgg aacaaatact 1320 aaggcgacgg aaagaacaaa tactccaatc aaggacgatg atgcagttgt tttacaattg 1380 gctgatgata ttaaggatac ttttttagtt gctgtaacgg cagaaggagg tgtattacca 1440 gtgtctattg ttgagtcagt gccaggaaag aaatcaacta ttaatggtaa taaggtaact 1500 gttgaggaaa aaattgctgg tgttcatatt gaacatattg agcaatgggt tgaaaaagtt 1560 tatattccta attcaaaaga aggagacatt ttattatggg ataatctctc tcatcataag 1620 tgtaaatcag ttcaaaagat tttgaaacaa cataatagaa ttaacatttt acttcctgta 1680 ggtgggcacc atcaatcacc attggataat atgtgtttca gatatgcaaa aagagaatta 1740 gctgattgga aaaagaataa ttctaattct gatagaaaat caagaaatgc agcatttgag 1800 cagattattt ctaacatgaa tccagatgtg attgtaaaca gttttagaaa atgtttaatg 1860 gatatttgga attatgattc agaagaggaa tacttgaagt atgtcagagc ttcactgaaa 1920 atgaataatg atttgtatga tttatatatc agagaattta atcctatgtg tattggtcaa 1980 ttctatgtag aactacctac accatcagtt aaggtagaga cagatgagaa attagatcaa 2040 gatcccatct tagatgatgt aaacaatcct tggattgata gtgattccat gaaagaaaat 2100 tttgaccaaa aacaggtaga acaaatatct aaaataatac ttcacataca aaacaaaaca 2160 gaatttaacc ataacatcca acaaaatgct gccaatctac aattgtacgt tctttcaaca 2220 gaaacaattg cttgttcaat cactttatct actgaggcat tggagactat caaagatgac 2280 actgtattct tacctggaaa tacttttgat atatttggaa tttgtttttc caaaatcagc 2340 tcacaatatt tacaatacat accgtctgat ttctctcaaa gagttactaa ccaatccaaa 2400 aaattcatag aaacaaactt accaaagagt atgaaattta catcaaaagg atttataatt 2460 tttcctgtgg caagatcatt acactggtac ttactaatct atgattgtca atccactgat 2520 tggtttcttt ttgattcttt cgctaaggat ttttcagtga tagaatcaga gactagtgaa 2580 ttatttgaaa tgttgccatc atttattgta aaaccaaaga ctgttcgaaa aacattacaa 2640 aaaaaaattc aagtggatgg aaatcattgt ggtgattatt gtgtattact aatggatttt 2700 ctttcagcag gattttcact atctgatgcc gtcgatatta tgaccaaacc attttctatt 2760 tcaaattaca gagctatact tgttgctaga ttgaaacatt tacatgtagt agaatttgaa 2820 ccacaaagcc cactttcgga atcagaaggt tttcaatctg aacccaagaa gagctctaaa 2880 attgagcttg gcgattatga cgaagaatta gatgaattat ttaatgagtc atgggaggat 2940 tcaattttcg gaaagaaaat tcaatgaaca tgaagagtga tcaatcatct taataacaat 3000 acaaaatatc ttatataata aacaaattca ttatcaatat tcgaaacaag cagaactaca 3060 gcttgatctt acagtgtgac ttctctttga tctattttaa gaattgtggt tcaagaaact 3120 ttcaattcat taaattattc gttgcagttc aaaaaataat tgcttgcatt tcattgtaga 3180 attgcttgca cttcattgcg taaccacag 3209 // ID hAT-17_SM repbase; DNA; INV; 2511 BP. XX AC . XX DT 10-FEB-2008 (Rel. 13.02, Created) DT 03-MAR-2008 (Rel. 13.02, Last updated, Version 1) XX DE hAT DNA transposon: consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-17_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-2511 RA Jurka J.; RT "hAT DNA transposon from Schmidtea mediterranea."; RL Repbase Reports 8(2), 67-67 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 480..2303 FT /product="hAT-17_SM_1p" FT /translation="MDKFLLNKSVKAPNSVTKKIEKKRKYAEDFIEFGFIA FT TEKNGQVFPFCLVCNETLSNASMVPNKLIRHKETKHPELSSKSKQYFETLR FT AQSEKQAFKMKNFCKIPEKAQIASFQVAYLLAKKRKPHTEAESIIAPAVSI FT IVSTMLGCEAAEKVKNVPLSANTISRRIQDLSNDIESQIIENFQSSEEEFS FT KLWSLQADETTDISGKAQLITFIRFVHGTKVMNQYLFCQELIETTTGRDIF FT NLIDDNVKKYQLQWQKCVGVCTDGAPSMQGKLKGLAALVRLVSPSVTITHC FT MIHREALVSKSIPSQLLIVMNQVVSIVNIIKSKPLNSRIFGLLCDAMDSDF FT KTLLYHTEVRWLSKGKVLERVVHLKTEIISFLEVEDITLDFDFRDSDWWLK FT VSFLNDLFNRLNVLNKSLQGSAENIMTATSKLLSFKEKLHLFIRKIKEKSF FT DFLPTVDKFANKDEISELALQTMEMLLKALDNYFPSLNVNLYQWILNPFGT FT NECLEFSTKEEEQLIDLKNDMVNKINFEQKGLEEFWISLGKDYPELSIKAL FT KLLLPFSTSYLCELGFSALVEIKSKKRERLQTVDEEMRVSLSSLLPRFDVV FT CSEHQAHPSH" XX SQ Sequence 2511 BP; 860 A; 432 C; 461 G; 757 T; 1 other; cagtgttctc caaactgggt gtcgcgtagg attttttggg tgtcgcgaga tttcttaaga 60 ttccataaaa attttattga gtgttaccga ttttaaaaca aattcaaaaa tttcagctaa 120 aaaatctcag gaacttccac tgagattgtt tttttattaa tcttaacgtt cattttattt 180 tcaaaaagtg atcttaagtg attgaaggcg actagaaaaa tattctcttt ctctaagttg 240 tttaacttgt attttggaaa aacaactttg cgaaaaatgt aaattgtcat tttaaacaca 300 acataaacaa aagaaacaat aaatttgatt aaaaagtgca aaaaaacatt atttccataa 360 gaaggtaaag aagtaacctc caatttgtgt tatttatcaa gtgaaaaatt gcatgtttta 420 agtaagtttt attcgataaa ttgtaaataa taatataatc gagtttattt tttattagga 480 tggataaatt tttgttgaat aaatcagtta aagcaccaaa ttcagtgacc aaaaaaatcg 540 aaaagaaaag aaaatatgct gaagacttca tcgaatttgg attcattgcg actgagaaaa 600 atggacaggt gtttcccttt tgtctggtgt gtaatgagac attgtcaaat gcgtcaatgg 660 ttcccaacaa attgattaga cacaaggaga ctaagcatcc agagctctct tcaaaatcaa 720 aacaatattt tgaaactctg cgagcacaaa gcgaaaaaca agcgtttaag atgaaaaatt 780 tctgcaagat tcctgaaaag gctcagattg caagcttcca agtagcttat ttactcgcga 840 aaaaaagaaa accacacaca gaagctgaaa gtattattgc tcctgcagtt tcgattattg 900 ttagcacaat gctgggatgt gaagctgctg agaaagtgaa gaatgtgcca ttgtcagcaa 960 acacaatttc aagaaggatt caagacctat caaacgacat tgaatctcaa atcatcgaga 1020 attttcagag ctcggaggag gaattttcca aattgtggtc gctgcaagct gacgaaacca 1080 ccgacatcag cggaaaagca cagctcatta cattcatcag attcgttcac ggtacaaaag 1140 taatgaacca atatttgttt tgccaggagt tgattgaaac gaccacagga agagatattt 1200 tcaacttgat cgatgacaat gtgaagaaat atcaacttca atggcagaaa tgcgtcggag 1260 tttgtacaga tggcgctcca agtatgcaag gaaaattgaa aggactcgcg gccttggttc 1320 gtcttgttag cccttcagtc accatcacac actgtatgat tcaccgagaa gctttagttt 1380 caaagtcaat tccttcgcag ctcctcattg tcatgaatca agtcgtctcg atcgtcaaca 1440 ttatcaagtc aaagcctttg aactccagaa ttttcggcct tctttgcgat gcgatggatt 1500 ctgatttcaa aactctgctc taccacacag aagttcgatg gctgtccaaa ggaaaagttc 1560 ttgagcgtgt tgtacacctc aaaactgaaa taatttcatt tcttgaagtt gaggatatca 1620 cactcgattt tgattttcga gacagtgact ggtggttgaa ggtatcattt cttaatgatc 1680 ttttcaacag actcaatgtt cttaataaga gtcttcaggg atcagctgaa aatattatga 1740 cggcaacatc aaagcttctt tcattcaaag aaaaacttca cttgttcatc agaaaaatca 1800 aggaaaaaag cttcgatttt ctccccaccg ttgacaaatt cgctaataag gatgaaattt 1860 ctgagcttgc tctccaaacc atggaaatgc ttttgaaagc actcgacaat tatttcccct 1920 ccttaaacgt caacctttat cagtggattt tgaatccatt tggaacaaac gaatgtcttg 1980 aattttccac gaaagaagaa gaacagctta ttgatttgaa aaacgacatg gtcaacaaaa 2040 ttaattttga acaaaaagga ctggaagagt tttggatttc tctaggcaag gattatcccg 2100 agttgagcat caaagcgtta aaattattgc ttccgttttc tacatcatac ttatgtgaac 2160 ttggattttc ggcactggtt gagatcaaat caaagaaacg tgaaagattg cagacagttg 2220 atgaagaaat gcgagttagt ttgtcgtcgc tcttgcctcg tttcgatgtt gtttgctcgg 2280 aacatcaagc acatccatca cattgaagtt tgataaggaa aaaataaata aaaatacgta 2340 aaataaaaat aatttgtgtt aattttttyt ctcaaacccg caagttaaac ggcattaaac 2400 tttcataaaa tttgatgcaa aattttacat cgaaaatatt tgtcaggggt gtcgcgaaat 2460 tttttttaag tgaatggggt gtcgccatcg aaaaaagttt ggagaacact g 2511 // ID Chapaev3-6_HM repbase; DNA; INV; 2362 BP. XX AC . XX DT 08-JAN-2009 (Rel. 14.02, Created) DT 08-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE an autonomous Chapaev DNA transposon - consensus. XX KW Chapaev; DNA transposon; Transposable Element; Chapaev3; KW Chapaev3-6_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2362 RA Bao W. and Jurka J.; RT "Chapaev3 DNA transposons from Hydra magnipapillata."; RL Repbase Reports 9(2), 363-363 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 156..1544 FT /product="Chapaev3-6_HM_1p" FT /translation="MFFHSYFYSIVIKKVTISLFLLDQIYIMMASSNRHKC FT KHNPNSFCYICGCYALVRQRRNITNFVKLAYESYFDIKLGDQDKKWAPHTV FT CHICEESLRDWTKGKRNNLPFGVPMVWREPLNHIDDCYFCLVDTSGVGKIK FT RQKITYPNIPSAIRPIQHSNEIPFPIFKGLFLSEDEQSVSATETEEFLPDL FT EDFSSNAKQFSLPQCFSQIELNDLVRDLGLSKQAAEVLASRLKEKNLLDNF FT AKVSYFRTRDETFVDFFSEVNLFVYCHDISGLLLQLGISVYNPAEWRLFID FT SSKRSLKCVILHNGNIFGSVPIGHSVYLRETYDDIKMVLNLIKYHEHNWII FT CVDLKMVNFLLGQQKGFTKFPCYLCMWDSRARNQHWIQKEWPIRKTLTVGM FT QNIVNEPIVSRNKIVFPPLHLKLGFMKQFVKALKTDSDCFQYIVKSLPGLS FT IEKKKQEFLTGHKFAIL*" XX SQ Sequence 2362 BP; 826 A; 339 C; 393 G; 804 T; 0 other; acacaggtct acaaaaaaaa actttttttc tgttgcatga gatattgaaa cttattttta 60 tgtattttta aacgctgaat ccaaatataa ggtccgtttt tgttggtaag ctctagtttt 120 tttgcaattc tcaatttatc atttttttaa aaattatgtt ttttcatagc tatttttatt 180 caatagttat aaaaaaagta accatctcat tgtttttatt agatcaaata tatattatga 240 tggcttcttc aaatcgacat aagtgcaagc ataatcctaa ctcattttgt tatatatgtg 300 gttgctatgc acttgttcga caaagacgaa acattactaa ttttgtgaaa ttagcatatg 360 aatcgtattt cgatattaaa cttggtgacc aagacaaaaa gtgggctcca catactgtat 420 gccatatttg tgaagaaagt cttcgagatt ggacaaaagg aaaaagaaat aatttacctt 480 ttggagtccc tatggtttgg agggaaccac taaatcatat tgatgactgc tatttttgtc 540 ttgttgatac tagtggtgtt ggcaaaatta agagacaaaa aataacttat cccaacattc 600 cttctgctat tagaccaatt caacattcta acgagattcc atttccaatt ttcaaaggtt 660 tatttttgtc agaagatgag caaagcgtgt ctgctacaga aacagaagaa tttttaccag 720 atttggaaga cttctcatct aatgctaaac aattcagttt gcctcagtgt tttagtcaga 780 tagaactgaa tgacttagtt cgtgatttag gactttctaa acaagcagca gaagtattag 840 cgtcacgttt aaaagaaaaa aacctacttg ataactttgc aaaagtatct tattttagaa 900 caagagatga aacttttgtt gacttttttt cagaagtcaa tttatttgtt tactgtcatg 960 atatttcagg tctcctacta cagttgggta tttctgtata taatccagca gaatggagac 1020 tttttatcga tagctcaaag cgaagcctca aatgcgttat attgcacaat ggaaatattt 1080 ttggttctgt accaattggt cattctgttt atttgcgaga aacttacgat gacattaaaa 1140 tggtgcttaa cttgattaaa tatcatgaac acaactggat tatttgtgta gatctaaaaa 1200 tggttaactt tctacttggt cagcagaaag gttttacaaa atttccatgt tatctctgta 1260 tgtgggacag tcgagcgaga aatcaacact ggatacaaaa agaatggccc attcgcaaga 1320 ctctcacagt gggcatgcaa aatattgtga atgaaccgat tgttagtcga aacaaaatag 1380 tttttccacc actccatctt aaacttggat tcatgaaaca atttgttaaa gcactaaaaa 1440 ctgatagtga ttgttttcag tatatagtta aatcattacc aggattgtca atagagaaaa 1500 aaaagcagga gtttttaacg ggccacaaat tcgctatctt ataaaagata aagaattcat 1560 taaaacgatg aatatcaaag agaaaacggc ttggttatcg tttgtgattg ttataaaaaa 1620 ttttcttgga aataaaaaag cagaaaacta tgcagattta gttgataaaa tgttgaaagc 1680 cttttgtgat cttggatgca aaatgagtat aaagcttcat tatataaata gtcatcttga 1740 ccagtttcct gaaaatcttg gagatgtgag tgaagagcaa ggagagcgat tccatcaaga 1800 tcttaagaca atcgaagatc gttaccaagg acgttgggac atacacatga tggcagacta 1860 ctgctggggc atcaagcgag aagacacagg aaaagtttat aaacgaaaga gcaacaagca 1920 aaagtttctt cctgattaag tagttgtaaa taagcaatat aatctgcaat cttttttaaa 1980 actaaagtaa aaaagatttt gatctttcac taaagtaaaa gattttgtta ttaatcaata 2040 aaaagttaat aactttgcta ggacttggat atttgaaaaa tatttatttt aatcagttta 2100 catttagttt catttgcttt ctgtttttgt tttagttttg taaacaactt ttattctaaa 2160 aagtatatcc tagcatttcc aaatgagttt caaatctgta ttttagaact atttctactg 2220 ttttgcaaaa ttaaaaaaat ataaattgca aaaaatctgt agctttccat aaaaaatgaa 2280 tgccagattt gaattcagcg tatcaaaatt atgtaagaac aagtgtcaaa tttaatgcaa 2340 taaatttttt gtagacctgt gt 2362 // ID I-2_BM repbase; DNA; INV; 5480 BP. XX AC . XX DT 28-JUL-2009 (Rel. 14.07, Created) DT 28-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE A family of I non-LTR retrotransposons from a silkworm - DE consensus sequence. XX KW I; Non-LTR Retrotransposon; Transposable Element; I group; KW I-2_BM. XX OS Bombyx mori OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Bombycidae; Bombycinae; Bombyx. XX RN [1] RP 1-5480 RA Kapitonov V.V. and Jurka J.; RT "New families of I non-LTR retrotransposons from animals."; RL Repbase Reports 9(7), 1532-1532 (2009). XX DR [1] (Consensus) XX CC The consensus sequence was derived from multiple alignment of CC several copies ~98% identical to each other. The 3' terminus is CC composed of the (GAA)n microsatellite. XX FH Key Location/Qualifiers FT CDS 152..1618 FT /product="I-2_BM_1p" FT /note="ORF1." FT /translation="MSMSSSVGENVPPDRGRSSLDSEMDDLSLILNSPVKA FT SQKRPAEDVLSVPEKRSAPHLPAAASNQHTYTRPGFDNNSVLNYSETDSGP FT FVVHVSRSDGDFAPVSNHTRTLKIAQIIYNGKISGIDEIKNMGRNRAAIIF FT KTYNEANSFLTNPLLSSNKLSAIIPRFQVTRMGVVRQIPVEWSLEDLVSWI FT ECRSVSTTVIKARRMNKKKKIDGKTVWEPTGTVVLSFLGQVLPKHVYCCSV FT SLPVNIYTLPTIQCLKCCRFGHIRDQCRSQARCSRCAGPHEGNSCSVSENN FT ITCLFCSGCHPASDPKCPEHARQRSIKMVMSEENISYIEAARRFPSVRTSY FT ADVANSSLPQKPIFLSRSAPSSNTLPSSSPSSSRHTPSTSYKKTVFIEKSS FT KPILSKSYDRHAHNDIISPPNSCLPNGCALSTPTPCAPATNDNLAELMSML FT LLNILSKFNDILPNNVLIQIHALISSLVNSQPKHNDEGPTMELS" FT CDS 1590..5276 FT /product="I-2_BM_2p" FT /note="ORF2: AP endonuclease, RT, RNase H." FT /translation="MTKVLQWNCRSIRNKKQEIIYLSNQYNPVLFAVAESW FT LRPDTLLRVPGFSCLRDDRDDGWAGTAILVRRNISFSRITLPPHSSAINIV FT AVRCFNITVVSVYIPHPNSNLIQEFSTLLTSLPCPLLVLGDFNIRHTMWGS FT ELCDSNSSLFLDKLDELNLCLLNDGTPTRRVSPSQSPSAPDLSLSSPSLVN FT SISWSVLSNSHGSDHLPILLTLADSSIPSSIPYEPLLKFRLNKADWQKYSS FT LVQNNLESLPDANDENTLFLYLEFCSILSSAAELTIPLKNSASNKIPSPAW FT WDAECSQIVQKRKEAESEFSKILSLENFLEFQKISAISKRFLRQKKSLAWR FT NFCESLSPRTPASLIWKKIKAFRKSQSDNNINSNSLSWLPDFISKLCPPFV FT PFKPCPLIPSSASNDPMNSSFSFAELCCVLDNLRDSAPGIDGFPYSFIKKL FT SDSSKKIFLNLLNHIFLSSSIPASWKTQLIIPILKPGKDPAQSCSYRPIAL FT SSVLAKIMEHLIKNRLEWIVENRSILPNSQFGFRRGYGTLDSLSLLTSDIR FT VAFSQNRHVIAVFLDISSAYDNVDLHLLRQKLINLNMPPRIVNFIFNLFYD FT RSIIVRILGENSSHRTWKGLPQGSVLSPLLYNIYTAILDTTVNRFCRILQY FT ADDIALYTISNSFSQACNCLNSALFNLNSWLNDHCLSLSIPKCSAVAFSRK FT RVIPEINIEICNQSIPVLSKVKFLGILLDSKLSGVHHLNYICNKVEKSINV FT LRALSGVKWGSHPYSQKLLYNALIRSHFDYGCFVLEPCNKMALTKLNRIQS FT KCLRIISGAMKSSPINALQVECFEPPLALRRQFLADRFFYKVISLSNHPLL FT ALLSQLAQLTETSRYWRHKSVPPLVNSYKKLQNLASPISQYDKFPLFLFSF FT DCLLFRPTTFLNIGIHKDSPGANLQFQNIVQLQWPNHYHLFTDASKLSQNG FT PVGAAVWIPKFKLSLKYKLPQTSSVYSGEAVALLEAALFIKTRGLSKSVIF FT SDSLSCLQDITKFPSHSKVNFEIILKIKETLFQCHCSGLDVTLVWIPSHSG FT ICGNELADSCAKEAVVMGCNKYNKIFPRDLHSLAKSDMLQSWNESWQISRQ FT SVGKHYGNLQPTLPSKPWFSSHCHLPKKIVSAIIRLRLGHVCSPVFLAKIR FT VRDHSLCECGLDEGTLDHIFFDCPNIVSPLYDILPNDIPRPININCLLTLV FT YSKHVNILCKFLSANNIYL" XX SQ Sequence 5480 BP; 1493 A; 1273 C; 882 G; 1832 T; 0 other; gagagggtag ttggacatct gtcagatcac acgcgtttcg attattttta gtgtttttcc 60 cgttttttct gtgtttttaa gaattgtgtt gccctatttt tgttggtggt ggattagttt 120 aagtttcatt taactggtaa gtgtattttt tatgtcaatg tcgtcctcgg tgggcgaaaa 180 tgtcccgccg gacagaggga ggtcttcttt agattcagag atggacgatc tttctctaat 240 acttaattct cctgtaaaag cttcacaaaa acgccctgcg gaagacgtat taagcgtccc 300 agaaaaaaga tcagctccac atttaccggc agctgcctct aatcaacata cctacacccg 360 gccaggtttt gataacaatt cagttcttaa ttactctgaa actgactctg gcccttttgt 420 tgtccatgta tctcgttcag atggtgactt tgccccagtt tcgaaccaca cccgtaccct 480 taaaattgct caaattattt acaatggtaa gatctcaggg atagatgaaa ttaaaaacat 540 gggtagaaat agggctgcta ttatttttaa aacatataac gaagcgaatt catttttaac 600 aaacccatta ctttcatcta acaaattgtc agcaattatt ccacgctttc aagttaccag 660 gatgggtgtc gtgagacaaa ttcctgtaga gtggtctctt gaggacctag tctcatggat 720 tgagtgtcgg tctgtttcaa ccaccgtcat taaagccaga agaatgaata agaaaaagaa 780 aatagatgga aaaacagtat gggaaccgac gggtacagtt gtcctctctt ttcttgggca 840 agttcttcct aaacatgtat attgttgcag cgtttccctc cctgtaaaca tatatactct 900 tcccactatt cagtgtttaa agtgttgtcg ttttgggcac attcgtgatc aatgtcgatc 960 gcaagcccgc tgctctcgat gcgcaggccc ccatgaaggt aattcttgta gtgtttcaga 1020 aaataacata acctgcttgt tctgctctgg ttgtcatccc gcctctgacc cgaaatgccc 1080 cgaacatgcc cgtcaaagat caattaagat ggttatgtct gaagaaaata ttagttatat 1140 tgaggcagct cgcagatttc cttcagtccg cacctcctat gccgatgttg ccaattcttc 1200 tcttccccaa aaacctatct tcctgagccg atcagctcca tcttcgaata ccctcccgtc 1260 ttcctccccc tcatcttctc gtcatactcc gtcaacttca tacaaaaaaa ctgtttttat 1320 tgaaaaatcg tctaaaccaa tactttccaa gtcttatgat cgtcatgctc acaatgatat 1380 tattagccct cctaactctt gtttacccaa tggttgtgct cttagcactc ccaccccctg 1440 tgcccctgct acaaatgata atctcgcgga attaatgtca atgctgcttc taaatatttt 1500 atcaaaattc aatgatatac taccgaacaa cgtcttgatt cagattcacg ctttaatctc 1560 tagtctcgtc aactctcaac caaagcataa tgacgaaggt cctacaatgg aactgtcgta 1620 gtatcagaaa taaaaaacaa gaaattattt atctatctaa tcaatataac cctgtcctct 1680 ttgctgttgc ggagtcatgg ttgagaccgg atactctgct tagggtgccc gggttctcgt 1740 gtctccgcga tgacagagac gacggctggg caggaactgc tatcttggtc agaagaaata 1800 tatctttttc tcgcattacc cttcctcctc attcctcagc aattaacatt gtggcagttc 1860 gctgctttaa catcacagtg gtctcagtgt acatccctca ccctaacagt aaccttattc 1920 aagaattttc gacactctta acctctcttc catgtcctct cttagtctta ggagatttta 1980 atatacgcca tacaatgtgg ggttctgagt tatgtgactc aaattcctca ttatttcttg 2040 ataaacttga tgaattaaat ttgtgtctac ttaatgatgg aactcccacc cgtagagtat 2100 ccccatccca aagtccaagt gccccagacc tgtccttgtc ctctccctct ttagttaact 2160 cgatttcttg gtctgttctg tccaactccc atggcagtga tcatcttcca atattactaa 2220 ctctagccga ttcttccatc ccatcttcaa ttccatatga accccttttg aaattcagac 2280 ttaacaaggc cgactggcaa aagtattcat ctttagttca aaataatctt gaatctcttc 2340 ctgatgcaaa tgatgaaaat actctatttt tgtacctaga attttgttcc attttatcct 2400 ccgctgccga actaaccatt cccttaaaga attcagcttc aaataaaatt ccctcaccag 2460 catggtggga tgctgaatgc tcccaaatag tccaaaaacg caaggaggct gagtctgaat 2520 tttctaaaat tctctcttta gagaatttcc tagaatttca gaaaatttcg gccatctcta 2580 aaagattttt gcgacaaaag aaatctcttg cttggcgtaa tttttgcgag tctctctccc 2640 caagaactcc tgcctccttg atttggaaaa aaataaaagc atttcgtaaa tcacaatctg 2700 ataataatat taattctaat tccttgtctt ggcttcctga ttttatctcc aaactctgcc 2760 caccctttgt gccttttaaa ccttgtccct taattccctc ctccgcctct aatgacccca 2820 tgaattccag cttctcattt gcagaactat gctgtgttct tgacaacctc cgagattcag 2880 ccccaggtat agatggcttc ccgtactctt ttataaaaaa attatctgac tcttctaaaa 2940 aaatctttct aaatctttta aatcatatat ttctctccag ttctattcca gcctcatgga 3000 aaactcaatt gataatccca attctaaaac caggcaagga tccagctcaa tcttgttcat 3060 atagacctat tgctctctcg agtgtgctag ctaaaattat ggaacatttg ataaaaaatc 3120 gtctagaatg gatagttgaa aacaggtcta ttctccctaa tagtcagttt ggctttagac 3180 gaggttacgg aacattggac agcctgagcc tcctcacttc agatattcgt gttgcattct 3240 ctcaaaatcg tcatgttata gcagtttttc tagatatttc ctctgcatac gacaatgtcg 3300 accttcattt acttagacaa aaacttatta acctgaatat gcctcctaga attgtgaatt 3360 ttatcttcaa tttgttttat gacagatcta ttatagttcg catcttaggt gaaaattctt 3420 ctcatcggac ctggaaaggt ctgcctcagg gttctgttct cagtcctctg ctttataata 3480 tttacactgc tattttagat actactgtga accgtttttg ccgtatcctc caatatgctg 3540 acgatattgc cctctatacc atatctaatt ctttctcaca agcttgtaat tgtctaaatt 3600 ctgctttgtt taacctgaat tcttggttga atgatcactg cttatccctg tccatcccta 3660 aatgttctgc ggttgctttc tcccgtaaga gagttatccc tgaaataaat attgaaattt 3720 gtaaccaatc tattccagtc ttgagtaaag ttaagttttt gggcattctc ttggattcta 3780 aactctccgg tgttcaccat ttaaactata tctgcaataa agttgagaag agcatcaatg 3840 ttctacgcgc actctccggt gttaagtggg gttcacatcc ttacagccag aagcttcttt 3900 ataatgcttt aattagaagt cactttgact atggttgctt tgttttggag ccttgtaata 3960 aaatggcact tacaaaacta aatcgtattc aatcaaaatg cctcagaatc atctctggcg 4020 ccatgaaatc ttcccctata aatgccctac aagttgaatg ttttgaacct cctttagcct 4080 tgcgtagaca gtttctagct gatagatttt tctacaaagt aatttctctc tcaaaccacc 4140 cccttttagc tctactttcc caacttgctc aactgaccga aacctctaga tactggcgtc 4200 ataaatctgt acctcctctt gtcaactcgt ataagaaact gcaaaattta gcttctccaa 4260 tttctcaata tgataaattc cccctctttc ttttctcttt tgactgtctt ttatttcgtc 4320 ccactacctt cttgaatatt gggatccata aagactcccc aggagcaaat ttacaatttc 4380 agaacatagt acaactacaa tggcctaatc attatcacct attcactgat gcatctaaac 4440 tatcccaaaa tggtcctgta ggtgctgcgg tatggatccc caagtttaaa ctctccctaa 4500 aatacaaact ccctcagact tcctctgtgt actctggtga agctgttgct cttttagaag 4560 cagctctttt tattaaaacc cgcggtctaa gtaaatcggt aattttttcg gactccttaa 4620 gctgtcttca agacataact aaatttccat cgcattcaaa agtcaatttt gaaattatat 4680 taaaaattaa agaaactctg tttcaatgtc actgctccgg acttgatgtc actctcgtct 4740 ggatcccaag tcatagcgga atttgtggga atgagcttgc tgactcctgt gccaaggagg 4800 ctgttgttat gggctgtaac aagtataaca aaatttttcc tcgagattta cattctctgg 4860 caaaatctga tatgcttcaa tcttggaatg aaagctggca gatttctcgt cagtcagtag 4920 gtaagcatta tggtaacttg caaccgaccc tgccctctaa gccttggttt tcctcccact 4980 gccatcttcc taaaaaaatt gtctcagcta taatccgtct tagacttggc catgtttgct 5040 cccctgtttt tcttgccaaa attagagtcc gtgaccattc tctttgtgag tgtggattag 5100 atgagggtac cctggatcac attttcttcg actgccccaa tatcgtatcc cctctttatg 5160 atatccttcc caatgacatt ccccgaccta ttaatattaa ttgtttatta actttagtct 5220 attctaaaca tgtaaatatt ctgtgtaaat ttttatctgc taataatatt tatctctaaa 5280 gtagttagtc taacatatat ttgtttgtgc tgttcttaaa accacagatt cgcgtatgga 5340 aaaaccgacc aaatttattt gtatttagat ttagtaattt tttccctctc taagcacctg 5400 tcatttttgt aaatactttg acattggcaa aatcaatgcc aaaggcgttg gagccaaatt 5460 atctaaaaga agaagaagaa 5480 // ID YREP_CC repbase; DNA; INV; 2819 BP. XX AC . XX DT 09-JUL-2004 (Rel. 9.06, Created) DT 09-JUL-2004 (Rel. 9.06, Last updated, Version 2) XX DE Ceratitis capitata Y-specific repetitive DNA sequence. XX KW AT-rich DNA repeat; YREP_CC. XX OS Ceratitis capitata OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Tephritoidea; Tephritidae; Ceratitis; Ceratitis. XX RN [1] RA Zhou Q., Untalan M.P. and Haymer S.D.; RT "Repetitive A-T rich DNA sequences from the Y chromosome of the RT Mediterranean fruit fly, Ceratitis capitata."; RL Genome 43(3), 434-438 (2000). XX RN [2] RA Gentles A., Kohany O. and Jurka J.; RT "Consensus."; RL Direct Submission to Repbase Update (JUN-2004). XX DR [1] (Consensus) XX SQ Sequence 2819 BP; 917 A; 506 C; 406 G; 988 T; 2 other; gaactcaaat acatatatta tatacttatg tatatatgta cccacttgat acatttaagt 60 atatcgttaa atatacaaat gtgatgtgtg tgacaaatct tcggtcggct gagtgatgca 120 aagaatcaat ttttgctgct taacagctcg tcgaatcaat gtagtgtttc atttaaagca 180 aaatcattga aacattaaaa aaaaattaga aatttaattt tataaatttt ttataaattg 240 aattttgtaa ctttcattac aaaaaaaaaa tcattatcac aatttcacaa atcgaaagtt 300 tttctagtca aatttttctt ttaaagtcaa gagattaaat ttcttttaat tgtcaaatat 360 tcaaaaatgt tatggtaaaa caattagcca tcaaaatgca tgtctaagct gataagtcaa 420 aattttacgt acctcgacca caaccggcgt cctcacgccc aaccccaact attgctattg 480 accgaaactc ctgatcaata ttgctaaaaa cttttccttt gcccaattta ccagataaat 540 tcgttcttta agcatgaaaa aaaataattc agctcgaaga catgcattga aattcaaaga 600 gttgtcggta cttattcctt aattaattcc atttctttgc cctgcacaat catcaatttc 660 tcgccactcg atgttagctt cggttccttc aagccctctt tcagatacaa tccaaccaca 720 agtcccttct gtgaagtgta tttggtaaaa tttttaacat ctcaacacaa caggctgcta 780 tgcgtgcgga tattcgtaac gaaaatcact taccgtcacc cgcgaaccat cggcgtaata 840 cttaagagat acttgtttga gattttccac ttttgaaatc tcagtagaga ttattaagcc 900 aattcgttat tcgtagcgca gacatttttt ggatgtttga ttaatcaaat tttattatcg 960 tcttttatgg aaatttttat ataggaaaac gttttcagct taaatatttt ctgaatgttt 1020 tccaaagcta cggttcgttt gagtttctag aaacaaaatt tttgtatagt aggttcgtac 1080 tgagagccaa ctatcaatct tatgggtttt tttaatttat ttggtttgga aagatcaaaa 1140 tttagttttt tagtgtttaa aatgtgatgt gcttacttga aacgactttc tcgaggccag 1200 acatatatga ggatacgagc ataattgaat tataaatgat tcgatacata taatacacaa 1260 gaatagagta gttcttttta ggtccacaaa tttccaaacg aaaaaatcaa aaaaaagact 1320 taaattttta gaatttagat aaaaatttta gtttaatgct ttgtaactaa aattgatata 1380 cttatgtaat atgtacccac ttgcatacat ttaagntatg cgtttaaata tacaaatgtg 1440 atgtgtgtgc acaaaatcwt gcgcgtcggc tgagtgatgc accacacaca tagaatctat 1500 gcaccactca gacgacgcgc aagattttcg cttctgcttt gaacagacca ctcgtccagg 1560 catcaatgtc gggcgaatcg tataactcca gtttgggtat ccctttgcga tccttcttca 1620 ttttattatc ctgatagcgc caaattgcca aaagcacact tcggtggcct attgtttcat 1680 ttaaagcaaa atcattgaaa catttaaaaa aaatttagaa atttaatttt ataaattttt 1740 tataaattta attttataac tttcattaca aaaaaaatca ttatcacaat ttcacaaatc 1800 gaaagttttt cttgtcaaat ttttctttta aaatcaaaga gattaaattt cttttaattg 1860 tcaaaaattc aaaaatgtta tggtaaaaca attagccatc aaaatgcatg tctacccgct 1920 gataagtcaa aattttacgc gcctcgacca caatgtaatg gaacttacca tgccctcgtc 1980 caacatctcc aattcattga aaccggcgcc ctcattccca atcccaacca ctgctattga 2040 ccgaaactcc tgatcaatat tgctaaaaac tttttctttg cccaatttcc agataaattc 2100 gttctttaag catgaagaaa aataattcag ctcgaagaca tgcattgaaa ttcagagagt 2160 tgtcggtact tactaattaa ttaattccgt tactttgccc tgcacacgat catcgaattt 2220 ctcgccactc gatgttagct tcggctcctt ctcgccctct ttctgataca atccaatcac 2280 aagtcctttc tgtgaagtgt atttggtaaa atttttaaca tctcaacaca acaagctgct 2340 aagtgctgcg gatattcgta acgaatggca cttaccgtca cccacgaatc atcggcgtaa 2400 tacctaagag agacttgttt gagatttttc acattttgaa atctcagtag agattttgta 2460 agccaattcg ttattcgtag cgcaaacatt ttttggatgt ttgattaatc taattttatt 2520 atcgcctttt tttatttaaa atttttatat aagaaaacgt tttcagctta aaaattttca 2580 ataaaaaata taattgaaat ttcgaactga ctgaatgttt ttcaaagcct gttacggttc 2640 gtttgagttt ctagaaacaa agtgttcgta ccatgaattt ttgtatcatg ggttcgtact 2700 tgagagccaa ctatcaaatc ttattggttt ttttttaatt tatttgcttt ggaaaagtaa 2760 aatttagttt tttggtgttt aaaatgtgat gtgcttactt gaaacgactt tctcttgaa 2819 // ID Gypsy-226_AA-LTR repbase; DNA; INV; 1709 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-226_AA_; KW Gypsy-226_AA-I; Gypsy-226_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-1709 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1056-1056 (2011). XX DR [2] (Consensus) XX SQ Sequence 1709 BP; 463 A; 317 C; 385 G; 542 T; 2 other; tgtaaggaca ttgtgaacca actttgaagc gtttgctttt cgtgtttgtt tttttctgta 60 ttgctgtcag ccgttatgct gtcagtctgg gtaagtccgg tacaatgctg tcacttgtag 120 aagttgtcca tagacaacaa ccaacaaaac gacacctagt ggttgatcat gtaaacccaa 180 tggaatattc ctaaatttaa atctagtgga cagcaaatgc ctaattgcgt cgtagagaat 240 aaatcggggc attatagtcc aaataatagc gagaattgac gatttagata atgcatctct 300 cttaaatcga acaatagatg ggatttgcgc tgcagcgatt aggatcagcc caatcttact 360 cgcgtcggtt tcatmattct tgtggagtgt ttaataccga tcagtcgaag tttttgaaaa 420 gtgaaattga aaaktattgg aagtaattat agtgacttat actaaactcg aagtaagttc 480 gcgtccgttt tagtctattt aaatagtttt aaatgtttaa tttgaattta tttctgtgtt 540 tagcaaatca atttgtcgag tgtgaacgaa attggtagaa agttaccatt attgttaaag 600 tgagtgttga attcaagtcc taaattcagg taagcctatt tgattaacat gcaagtgagc 660 taataagtga gattatccgc gttgagcagg acaaccgagc ccagtcaacg tgggcggtta 720 gtgcggagtc actgcaatcc agcgaccacg tgatcgatat actatcccag tctaccaaac 780 cgacgcctac tagacgccgt ccaatttagt gtgaagcgta agccacatct cgtcgcattg 840 ctgtcgcaaa agtcgtgcga cgaaacattc agtgtatcac gactcgcaag tcaatcagtc 900 gcgtggcgca ggtcgtgcca tatctaacgg ccatcactcc acagtgctag gggatatctg 960 taccccgaga aacgtcacca gcgcgcgagt ctagtcagtg gcgcgagttg ttggatgcaa 1020 cattttgggt gagtcacact gggtggtcga ttttcgtctc gcaacaaccg ttggtggact 1080 gcttgttcat tgaatttcga cgtggattgt caccgtggtt ccgaaaggac atccgtcgcg 1140 ggatcccgaa agaagaaagg tcagatctgt cctttttagt tgctcacaag tcagaagcta 1200 gaaaaagttg taaatttaaa acgtgacgtc acacacggtt aggaaattag gctttcctta 1260 atttaataaa atcttttctg aaatcttctc gatcatttct cgataaattg atcgctttgt 1320 actttcgcgt tcgcaacttg aatcttgaaa taaaacgttt acttctagtt tagaatacac 1380 tccactttgc ttacgtcacg gatttctttt aaaatagatt taagattgga tttttcttat 1440 tagttgctaa gcttattctt ctttgtttga tatttgagag ttgtaaatat ttgctttagt 1500 ttttagtgtg aaattgatgt tggtaagttt gaatcggttt attgcctacg acgttttggg 1560 tttgttgaag tgcgatttcg gttcgtttct ataattgacc tgccctgagg aaggcagtct 1620 cgtccgggta atttatcaga gctgtcggtc cttctttaaa gaaggtggcg cgtaagccgg 1680 cagcttaagg aacggaggag attgttaca 1709 // ID Gypsy-14_SI-I repbase; DNA; INV; 5120 BP. XX AC AEAQ01023414; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-14_SI_; KW Gypsy-14_SI-LTR; Gypsy-14_SI-I. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-5120 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01023414; Positions 5586 467. XX CC Positions [2554-3096] - Reverse transcriptase CC Positions [4117-4593] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 640..4959 FT /product="Gypsy-14_SI-I_1p" FT /translation="MEQLGQIPTIKQKIATASRAVIVVMHKFIFEEDGDRN FT NRRRLREFRGFEFSDDSAEFRAKLQYAMRFSIGDLISICNILDIAYNGNAE FT QLRERIVRALMDIGSLQSVHDEDEDDEEDVDNEGDGNNVENDTYGINDEDN FT DEDADADEEGGGTVSVERGNNRRSERDRRQRNANNDSKQFTLSYKDVEDSV FT RAFNGTDSYPIERWINDFEEAATMFDWDDLHKVVFAKRSLKGVAKLFIQSG FT GIIKTWKKLKNALIEEFATKISSAELHKMLEKRRIKKEETVQEYFLAMREL FT ASRGAVEVDALIHYVINGINDEIGNKMILYGAKTVREFKDKLETYEKIRKS FT QADKTSKYTKNRDEWTRNAGRKFSREDGRKTTDSREDGALIRCFNCGISGH FT HARNCDKKSLGKKCFSCNKFGHEAKNCTDKRVSGSNSNAEQAVNLVSVSLT FT NVIKQVVINNENLRALIDTGSPASLMREDAFTKLKKFELVQSTRVFSGFGG FT GESKAMGYFQAVVLIDGDDFPLTFYVVPYETISVEVIIGRDIIEQANLNFS FT QEGVTIYKKLETDFLAHINIPEDSQADIEAIPDAETKNRVKKLIIDYRPKK FT CKTTGIKLKITLKDEKPIFQRPRRLALPEKKIVEKQVEEWIRDGIIEPCSS FT EYASPVVVVKKKDGSPRVCIDYRPLNRIIERDRHPLPLIEDQVDKLKGALL FT FSKLDLKNGFFHVEVDEESRKYTAFITHEGQYQFLKAPFGLSNSPPVFQRF FT ISQVFRPLVNDGIMTLYLDDIIIFAANFRQAADRLEAVLITARDYGLELNA FT KKCEFMKERIEFLGQIIENGTVSPSPEKVKAVTQFPEPTTIKEVQSFLGLT FT GYFRKFVSGYSMTAKPLSDLLRKDIMFQFGEVEKEAFNKLKQILSSEPVLQ FT IFDQTLETELHTDASQDGLGAILLQRSRIDGKLHPVQYMSRKSRQAERNYK FT SYELEVLAVIEALEKFRIYLLGLKFKIVTDCAAFTQTMRKKEVSPKIWRWA FT EKLEDFDYTLEHRSGTKMRHVDALSRNAVMTIIEDGILARVKAAQADDSEL FT RMIIETLKLKPDNKYTLMGGVLYKFVEGRDVLVVPNRMQNEVIQAVHHRGH FT LSVKRTEDAIRKEYFMPDLKKKVEKLIANCIPCILANRKQGKQEGELHPLP FT KGDVPLHTYHIDHVGPLESTSKKYNHILVVIDSFTKFTWLYPTKSTTSQEA FT IKKLESQQQIFGNPACIISDRGTAFSSKEFQDYCDNEGIKHVLITTGLPRA FT NGQVERINRTIIPILTKLSLEDPTKWYRHVRQLQNILNSSYQRSINTSPFE FT LLTGVKMRCKEDLRLKEILEQALQEQFNDERTSLRSKAKEHILKIQQENCK FT TFNKHRKASRKYKVGELVAIKRTQVCPGRKLRAKYLGPYVVTKVKYNDTYD FT VKRANPGEGPGCTSTCAEFMKSWSSV" XX SQ Sequence 5120 BP; 1890 A; 850 C; 1238 G; 1142 T; 0 other; atattggggg ctcgtccgtg ccaaaaaaca agccaaagag aaagtgctaa tgagtgttta 60 aaaagaagcg acgaaagacg aagaacgagg caatacgaag aataggaaac gacgaaaaga 120 cgaaagacga aggaaagacg gaagacgaag aaaagacgaa agacgaagga aagacgaaag 180 acgaagaaaa gacgaaagac gaagaaaaga cgaaagacga agaaaagacg aaagacgaag 240 caaagacgac agacgaaaaa cgaaaggacg aaagacgaag gaaagacgaa aaacgtgaag 300 aaacagtcaa gaaaacgagt aagcaagaat cgagaacgtt cgagaaaatc agggaaaaac 360 gcaagacgag agacaaagag gagcagacgg agtagggaaa agagagacaa acaacgagta 420 ggtaagaatc aagaaccttc gagaaaacgg gggaaagacg tggaaaaaaa aacggagtag 480 gaaaaaagag agacaaacga ataggcaagg attgagaaca ttccagaatc caaaaagaga 540 acatatatac gaacaacgca cgtgtgaccg agaacattcc agagccgaac gcaagaactc 600 caattgccgt ttgtgaaagt aaaatcgata gacgaaatta tggagcaatt agggcaaata 660 cctacaataa aacaaaaaat tgctacagca tctcgagcag tgatagtggt catgcataaa 720 tttattttcg aggaagacgg agataggaat aaccggcggc ggttacgaga gttccgcggg 780 tttgaattta gcgacgactc tgccgagttc agagctaaac ttcaatatgc tatgagattt 840 tcaataggcg atttaatatc aatttgcaat attcttgaca ttgcatacaa cgggaatgct 900 gagcaattaa gagagcggat agtgagggcc ctgatggata taggatcttt gcaatcggtt 960 cacgacgagg acgaggatga cgaggaggac gttgacaacg agggtgatgg aaataacgtc 1020 gagaatgata cctacgggat caacgatgaa gataatgacg aggatgctga tgccgacgaa 1080 gagggaggag gcacagtcag tgtggaacga ggaaacaatc gtcgatctga aagagatagg 1140 agacaaagaa atgcgaataa tgactctaaa caatttactt tgagctataa agatgttgag 1200 gattcggttc gtgcattcaa tggaaccgat tcgtatccaa ttgaacgatg gattaatgat 1260 tttgaagaag ccgctaccat gtttgactgg gacgacctac acaaggtagt ctttgcaaaa 1320 agatcgctaa aaggcgtagc caaattattc atacagagcg gaggaatcat taagacgtgg 1380 aagaaactga agaacgcgtt aatagaagag tttgctacaa aaatctccag cgccgagctg 1440 cataaaatgc tagaaaaacg aagaataaaa aaggaagaaa ccgttcagga atatttttta 1500 gcaatgagag agctggcttc aagaggtgca gtggaggtag acgcattgat tcactatgtg 1560 attaacggca tcaacgacga gataggtaat aaaatgattc tatatggagc gaaaaccgtt 1620 cgtgagttta aggataaact cgagacctac gaaaaaataa ggaaaagtca agcggataaa 1680 acatcgaagt acacgaagaa tcgagacgaa tggacgagga acgcgggcag gaaattctcg 1740 cgcgaggacg gaagaaagac gaccgacagc cgggaggacg gagcgctaat acggtgcttt 1800 aattgtggta taagcggaca tcacgccagg aactgcgata agaagtcgct aggaaaaaaa 1860 tgtttcagtt gcaataagtt cgggcacgaa gcgaaaaatt gtactgacaa aagagtaagc 1920 ggttctaatt cgaatgcgga acaagcggtc aatctggtaa gcgtgtcctt gactaacgtt 1980 atcaagcaag ttgttattaa taatgagaat ttgagagcgc tcatagatac gggaagtcca 2040 gctagtctaa tgcgggaaga cgcgtttaca aagttaaaaa aatttgaact ggttcaatcg 2100 acgcgcgtat tttctggatt tggaggtggc gaatctaaag ccatgggcta ttttcaagct 2160 gtagttctta tagacggtga cgactttcct ttgacgttct atgttgtacc ttatgagaca 2220 ataagtgtag aagtaattat tggtagggat ataatagaac aagctaattt gaattttagt 2280 caagaaggtg tgactattta taaaaaattg gaaactgatt ttcttgccca tattaatatt 2340 ccagaagatt cgcaagcgga cattgaagcc attccggacg cggaaacaaa gaatagagtt 2400 aaaaaattaa taatcgatta taggccgaaa aaatgtaaaa caacgggaat taaacttaaa 2460 ataacgctta aagacgaaaa gccgattttt cagagaccac gtagattagc tttgccggaa 2520 aagaaaatag ttgagaaaca ggtagaagaa tggatccgag atggtataat tgagccatgc 2580 tcttccgaat atgcgagtcc ggtagttgta gtaaagaaaa aagacgggtc tccacgagtg 2640 tgcatcgact atcggccact caatagaata atagaacgag ataggcatcc gttgccattg 2700 atcgaggatc aggtagacaa attaaagggt gcactactat ttagtaagct tgacctcaaa 2760 aacggatttt ttcatgtaga agttgacgaa gaaagtcgaa aatatactgc atttattaca 2820 catgaaggtc aatatcaatt tttaaaggct cctttcggtt tgtctaactc tcctcccgta 2880 ttccaaagat ttattagtca agtttttcga ccactagtta acgacggaat aatgacgctg 2940 tatttagacg acattataat ttttgcggcc aattttaggc aagcggcgga cagattagaa 3000 gccgtgctaa taacagcacg cgactacggg ttagagttaa atgctaagaa atgcgagttt 3060 atgaaagaac gtattgaatt tttaggtcaa ataatagaga atggaacggt atcgccatcc 3120 ccggaaaaag taaaagcagt aacacaattt ccagaaccaa cgacgattaa agaggttcaa 3180 agcttccttg gtcttacggg atattttcgt aaatttgtta gtggatattc aatgacggcg 3240 aaacctttaa gcgaccttct gcgtaaagat ataatgtttc aattcggaga agtagaaaaa 3300 gaagctttta ataaattaaa gcaaatatta agcagtgagc ctgtgttaca gatttttgat 3360 cagacactcg agacagagct gcacaccgac gccagtcagg atggattggg cgcaatatta 3420 ttgcaacgat cgcgtataga cggtaaatta catccggtac aatatatgag tcgtaaatcg 3480 cgacaagccg aaagaaatta taaaagttac gagttagagg tgttagccgt gatagaggca 3540 ctcgagaaat ttcgtatata tctacttggc ttaaaattta agatagtgac agactgcgca 3600 gcgtttacgc aaaccatgcg gaagaaagaa gtctctccaa aaatctggcg atgggctgaa 3660 aagttggaag attttgacta tacattggag cataggtcag gcactaaaat gagacatgtc 3720 gatgctttaa gcaggaacgc agtcatgacg ataatcgaag acggcatctt agcaagagta 3780 aaagcagctc aagcagatga ttcagaatta agaatgataa tagagacatt gaaattaaaa 3840 cccgataata agtatacgtt aatgggaggt gtgctgtata aattcgtcga gggtagagat 3900 gtattggtag ttccaaatcg tatgcagaac gaagtcattc aagcagttca tcacagggga 3960 catttatcgg taaagagaac agaagatgct attcgaaaag aatactttat gccggattta 4020 aagaagaaag tggagaagtt aatagcaaac tgtatccctt gtatacttgc caatcgaaag 4080 caaggcaaac aagaaggtga actccatccg ttacctaaag gtgatgtgcc tttgcatacg 4140 tatcacatag accatgtagg tcccttggaa tccaccagca agaaatacaa tcacatttta 4200 gttgtgatag acagcttcac taaattcact tggctgtatc caacgaagag tacgacttca 4260 caagaagcaa ttaagaaatt ggagtcgcaa cagcaaatat ttggcaaccc ggcctgcatc 4320 atttcagaca gaggaacagc attctcgtcg aaagagtttc aagattattg cgacaacgaa 4380 ggaattaaac atgtcctgat tacgacgggc ttgccgagag ccaacggaca ggtggagaga 4440 attaatcgaa ccataatccc aatactcacg aagttgtcac tggaagatcc taccaagtgg 4500 tatcgccatg tgcgccagct acagaatatc ttaaattctt cttatcaaag aagcattaat 4560 acaagtccgt tcgaactgct aacaggagtg aaaatgcgat gtaaggagga tctacgctta 4620 aaagaaatcc tggagcaagc cttacaagaa caatttaatg atgaacgcac ttctcttaga 4680 agcaaagcta aagagcacat attaaaaatt caacaggaga actgtaaaac gttcaacaaa 4740 catcggaagg catcacggaa atataaagtc ggagaattag ttgcgattaa acgcactcaa 4800 gtgtgtccag gacggaaatt acgcgcgaag tatctaggac cgtacgtagt aacaaaagta 4860 aaatataacg atacatatga cgttaaacga gcaaatccag gagagggtcc aggttgcact 4920 agtacttgcg ccgaatttat gaaatcctgg tcgagtgtgt agcaattact cgaggcgaat 4980 aattaatcaa gtaggccgaa ttttggataa ataataataa taacaataat gatacagtta 5040 gtaattattc gaggcgaata attagtcagg agggccgagt gtgggtttag agatagttat 5100 cttgagtcga gctagggcga 5120 // ID Gypsy-3_SI-I repbase; DNA; INV; 4783 BP. XX AC AEAQ01004791; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-3_SI_; KW Gypsy-3_SI-LTR; Gypsy-3_SI-I. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-4783 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01004791; Positions 1292 6074. XX CC Positions [1710-2135] - Reverse transcriptase CC Positions [3315-3800] - Integrase core CC 'TATA' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS join(75..2324,2328..4217) FT /product="Gypsy-3_SI-I_1p" FT /translation="MGRHKSCHRDSKKKTSRGKRRHSSSSSSSSSSSSSDK FT KRKRLERFERLERIVNSLDRRSSHGHGSCIHRGDELMIPPFDPAKDDMTIE FT KWIQHVDELSIQYGWDDRAIMRLIPSRLKGHARQWYDTRPRLAVTWAETKN FT ELIQQFRKSVPFSKLFRDAAMYETAPGQSLGDYCFTKLNKLRKLNIVIPDE FT YIIDAVIGGIKDPNVARAVRSVQIDTASRLYAHMTALGDLPVKTEQRKAPF FT TSSHGRDGKNRQPKQSRHSQSQTANDETATSANPTDNDENRKQTKCFNCGK FT TGHFARKCCEPRAECEKCTRKGHITAMCPQGKDVNAVKSMESASNPYERTM FT FVNGQRIRGLIDTGSSCTLIRTSIAERYDMTVSLVPSVVLRGFAGQVTTSD FT RSTLCEIRIMNARAQVNAILVPDECLVYDVIVGRDFIGQEHIVMIKRGSAL FT SLKQLPAPDNDPENIIDVNFLNAESEVNIRVGTIPEDSKQRCIDLIREYGD FT CVASSMKDLGKTNAASMSLRCTTDVPVVYRPYRLPESEKRVLRSMIDELLV FT YNIIRKSSSPYASPVVLVKKSNGEHRMCVDFRKLNAITVKDKYPMPLIDDR FT MDKLGGNRYFTGLDLALGYYQVPMASDSIEKTAFVTPEGHYEFLRMPFGLT FT NAPAVFQRLMDQVLGDLRNSMAFPYLDDIIIPSRTIEEGMSRLRQVLDAFR FT KHHLTLRLEKCSFFKESIEYLGREISEQGVQPGRHKIEAVRSMEAPRSVKV FT RQFLGLASYFRRFIANFATIVEPITRLTKKNEPWSWGKAQNEAFTTIKDRL FT TTRPVLTIFDPSRSTELHTDASAIGVGAVLLEDCLQEVEGQMKVVAYFSKQ FT TTADQRCYHSYELETMAVVLALRHFRVYVLGIPFKVVTDCNALRATFAKRD FT LLPRIGRWWLEMQEYTFEVEYRAGSKMAHADALSRNPVPMALEVAQVDITE FT GDWVLAAQLQDEQLVRIRTILLGGTPTPEIKHYFDEYLIKDGKVYRRLGGN FT SKAWVVPRDARMQICRLCHDDVGHLGVEKTLERIQRNYWFAGMRRFVTKYV FT GACLHCAYYKHSAGKKQCKLNIIEKVPVPFHTVHIDHVGPFETSRKGNKFL FT LVIVDAFTKFVIVEAVKSQKTSYVVKALTNLMYLFGAPARIVSDRGTAFTS FT QTFWTFCLTYGIKHVLNAVAAPRANGQCERYNKTITQALATTTAGLDSNEW FT DSAVKQVQSALITLHNKSINATPTKALIGCEARSVTEARLLAEIQDVVHQL FT DLQELSAGIKEHIDAKQLEQKERYDRSRRAAVRYTDGELVLVRITTDPATE FT SSKKLHPKFKGPFRVRKVLPNDRYEVEDLREGGRRRRTVAAADNIKPWITV FT QDTRADNDPVSDAAGDQ" XX SQ Sequence 4783 BP; 1272 A; 1119 C; 1372 G; 1020 T; 0 other; tatcagaagt gggattgtcg cgtgcctaac accgctagga aattctacag ttgacgaaaa 60 tcgaagcgtg gacgatggga cgtcacaagt cgtgccaccg cgattcgaaa aagaagacgt 120 cgagagggaa aaggcgacat tcgtcttcct cgtcttcttc gtcgtcctcg tcgtcgtcgg 180 acaagaaaag gaaaaggctc gagcgtttcg agagactcga gcggatagtg aacagtctgg 240 atcgacgatc gtcgcacgga cacggttcgt gcatccatcg gggagatgag ctgatgattc 300 ccccatttga cccggcaaag gacgacatga ccatcgagaa gtggatccag cacgtggacg 360 agctgtccat acaatacggc tgggatgatc gcgccataat gagattgata ccaagcagac 420 tgaaaggcca tgcccggcaa tggtacgaca cgcggcctcg gttagctgtc acgtgggcgg 480 aaacaaagaa cgagttgata caacaatttc gtaagtccgt tccatttagc aagttattca 540 gagacgccgc gatgtacgaa actgcaccgg gtcaatcgtt aggcgactat tgtttcacga 600 aattaaataa gttgcgcaaa ctaaacatag tcataccgga cgaatacatt atcgacgccg 660 tgatcggcgg gataaaagac ccgaatgttg caagggcggt acgttccgta caaatagata 720 ccgcgagtag attatacgcg catatgacgg ctcttggcga tttgccggta aaaaccgaac 780 aaaggaaggc accgtttaca tcgagccacg gacgcgacgg gaagaaccga cagccgaagc 840 agtcacggca cagccaatcg cagaccgcga acgacgagac tgccacatcg gcaaatccga 900 cggacaacga tgagaaccga aaacagacaa agtgctttaa ttgcggaaaa acgggtcatt 960 tcgctaggaa atgctgcgag ccgcgagctg agtgtgaaaa gtgcacgcgc aaagggcata 1020 ttacggccat gtgtccgcaa ggaaaagacg tgaacgcggt caaaagtatg gaatccgcat 1080 cgaatccata tgagcgtacg atgtttgtca acggccaaag aatcagaggc ttaatcgata 1140 ccggcagtag ttgcacttta atacgaacct cgattgcgga gaggtatgac atgacggtgt 1200 cactcgtacc aagcgtggtg ttacgagggt ttgccggaca ggttacgaca agtgatcgat 1260 cgaccctctg cgagattcga ataatgaacg cgcgagcgca agtgaacgca atccttgtac 1320 cggacgaatg cttggtctac gatgttatcg ttggtcgcga tttcatcgga caggaacaca 1380 ttgtgatgat taaacgtggt agcgcgttat cattgaagca actaccggcg cccgataatg 1440 atcccgagaa catcatcgac gttaactttc taaatgcgga atcggaagtt aatatacgcg 1500 tcgggacgat acccgaagat tctaaacagc gatgtatcga cttgattcgg gagtacggcg 1560 actgtgttgc atcctcaatg aaagacttag ggaaaactaa cgctgcgtcg atgagtttgc 1620 gctgtacaac cgacgtccct gtggtttatc gcccatatcg gctacccgaa tcggagaagc 1680 gagtgcttag gagtatgata gacgagttgt tagtatacaa tattatccgt aagtctagtt 1740 cgccatatgc aagtcccgtg gtgcttgtga aaaaaagtaa cggtgaacat cgtatgtgcg 1800 tcgacttccg taagctcaat gcaatcacgg tcaaggacaa gtacccgatg ccgctgattg 1860 acgatcggat ggataagctt ggcggaaatc ggtattttac cggacttgac ctagcgttgg 1920 gatactatca ggtaccgatg gcatccgatt caatcgaaaa gacggctttc gtgacacccg 1980 agggtcacta tgagttccta cgcatgccat ttggactgac gaacgcaccc gcggtcttcc 2040 aacggttgat ggaccaggtg ttgggcgatc ttaggaactc aatggctttt ccgtatttag 2100 atgacattat cataccatca agaacgatag aagaaggtat gagccggttg cgccaagtgc 2160 tagatgcgtt ccgcaagcat cacttaactc taaggttgga aaagtgctca ttctttaaag 2220 aatccattga gtacctaggc agagaaatta gcgagcaggg agtccaacca ggacgtcaca 2280 aaatcgaagc tgtaaggagt atggaggctc cacggtccgt caaataagtg cgacagttcc 2340 tgggattggc aagctacttc cgacgattca ttgcgaattt tgcaacgatc gtcgaaccaa 2400 tcacgagact cactaagaag aatgagccgt ggtcgtgggg aaaggctcaa aacgaggcgt 2460 ttaccaccat caaggacagg ctcacgactc gaccggtgct aaccatcttc gatccaagcc 2520 ggtcgacgga attacatacc gacgcgagcg cgattggggt aggtgctgtt ctgttggagg 2580 actgtctgca ggaagtcgag ggccagatga aagtagtggc gtactttagc aaacagacga 2640 cggccgacca acggtgctac cattcctacg agctggagac gatggccgtt gtgctggcgc 2700 tgcgacattt cagggtatat gtcctgggga tacccttcaa ggttgtgaca gattgcaacg 2760 cgctgcgtgc aacgttcgca aaacgagact tgctaccccg cattggacga tggtggctgg 2820 agatgcagga atataccttc gaggttgagt accgcgcagg gtctaaaatg gcgcatgcag 2880 acgcgctcag ccggaaccct gttccaatgg cattggaggt cgcccaagtt gatatcaccg 2940 aaggcgattg ggtcctcgca gcccaactgc aggatgagca actggtgcgg attcggacga 3000 tcctgttggg cggaacgccg acgcccgaaa tcaaacacta cttcgatgag tacttaatta 3060 aggacggtaa ggtttatcga cgtctaggtg gtaacagcaa agcctgggtg gtgccgcgag 3120 acgcccggat gcagatttgc aggctgtgcc acgatgacgt tggtcacttg ggagtggaga 3180 agacattgga gcgtatccag cgaaattatt ggttcgctgg gatgagacga ttcgtcacca 3240 agtatgtggg tgcttgtttg cactgcgctt attataagca ctcagctggt aaaaagcagt 3300 gcaagctaaa catcattgaa aaggtgccag tgccattcca cacggtgcac atcgaccacg 3360 tagggccctt cgaaaccagc cggaaaggta acaaattctt actggtgatt gttgatgcgt 3420 tcaccaaatt cgttatcgtc gaagcggtaa agagtcaaaa gacgtcctac gtcgtgaagg 3480 ctctcacaaa tttgatgtac ctgttcggag ccccggccag gatcgtcagt gacaggggaa 3540 ctgcgttcac gtcccagacg ttctggacgt tttgccttac ctacggcatc aagcatgtgt 3600 tgaacgctgt cgctgcccct agagctaacg ggcaatgcga acgatacaac aagactatca 3660 cgcaggcgtt ggccactacc acggcaggat tggattccaa cgagtgggac tcggctgtga 3720 agcaggtgca gagtgcgtta attacgctgc acaataagag tattaacgct accccgacga 3780 aggcgctcat tgggtgtgag gcgaggagcg taactgaagc gcgtttactg gcggagatcc 3840 aagatgtggt gcatcagctg gatctgcagg agttgagtgc cggtatcaag gaacatatcg 3900 acgcaaaaca actggagcag aaggaacgat atgaccggtc tcgccgtgct gcggtgagat 3960 acaccgatgg cgagctggtg ctggtgcgga ttacgaccga tcccgccact gagagcagca 4020 agaagctgca cccgaagttc aagggtcctt tccgggttcg aaaagtgctg cctaacgatc 4080 ggtacgaggt cgaggatcta cgggagggag gtcgacggag acggacagtc gcggcggccg 4140 acaacatcaa accgtggatc acagtccagg acactcgggc ggataacgac cctgtgtcgg 4200 acgcggctgg tgaccagtag agccgaagga gcttacgggg gtcaactgtt gtaaaccagt 4260 gacccgccat cccgtcctcg cggttgctgc cggaggaaat aatgccattc tcacggttgt 4320 agccgggaga gagagtcagt tgtcatggga cgatgactcg ctgtgtcatt cttacggttg 4380 tagccggcgg tcgattgttg taaaacaacg attcgctatt ccgttcctgc ggttgttgcc 4440 gagggaatgg atgtcattct cacggttgta gccgggagaa aatcagttgc catgggacga 4500 cgactcgctg tgctcttttt cgattgtagc caggggtcga ttgttgttaa acaacgactc 4560 actatcccgc tcctgcggtt gttgccgagg gaaatggatg tcgttagtac ggttgtagcc 4620 ggaaatcaga catcgcacga gaatgagcga aaaccgtcgt atggtcgcag ccattctttg 4680 aaacttacag ggagcagagc caggggtgaa gccgcaatga caggaaggga ttggaaaact 4740 atagacgtgc ctgaggacag gccagaactt caggacggcc gaa 4783 // ID BEL-15_DWil-LTR repbase; DNA; INV; 215 BP. XX AC scaffold_181145; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-15_DWil_; KW BEL-15_DWil-I; BEL-15_DWil-LTR. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-215 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_181145; Positions 702044 701830. XX SQ Sequence 215 BP; 69 A; 38 C; 43 G; 65 T; 0 other; tgttcaggaa cggattgcgt ttatcgataa gtcagatgta gttgtaagaa tcgacgcata 60 cttaagcatt gtacacacac acatatttgt gcgccagcgc tttttatcgt ttggtttagt 120 aataaattgg cactcaactg attcagtagt gaagaaatac aaggtgaagt tatccaacat 180 aatttcaagg tacagtccac tgctagatca aatca 215 // ID Copia-28_DPu-I repbase; DNA; INV; 4489 BP. XX AC scaffold_92; XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from Daphnia: internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-28_DP_; KW Copia-28_DPu-LTR; Copia-28_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-4489 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Direct Submission to RU (16-DEC-2010). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR Genome; scaffold_92; Positions 259869 255381. XX CC Positions [1850-2374] - Integrase core CC 'CCTAA' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS join(719..2563,2567..4456) FT /product="Copia-28_DPu-I_1p" FT /translation="MSHISAIENLAEQLSNMGQPVSEAQLITKIICTLPPS FT YRGFIPAWDNVVEEEKTVALLTARILKEESMTAMYNDGQADPQDTAFFAAG FT TPQQTSSFRGNHSRGTTQRGSRGGFGSRGGQTPIQKKPKVECTYCINLGRS FT GQGHIASECRNRERDERSKRETANPATVNRDNIDFCFPSCTFDAETSNNAD FT WFADSGATQHMTDQKSKLANFTMVEIGTWTVKGINEALVNVHGYGDVYFTS FT AVDGVIRTGAFKNVLFVPKLGINLISIGAATEMGAEVHFVNNKVLFSKNGT FT VQITGQRKGKTLYLLNLTVKSAQEEVALAAKCPISLSVWHQRMAHVNVRTI FT QRMESQQLVEGLSIKSEKDETVPVCTGCVNGKMHRLPFPTGRIRAKEVGEL FT IHSDVCGPMQETSLSGARFYVLFKDDCSGWRVVNFLKNKSEVATHFKLYAA FT RLETETGKTVKTLRSDNGGEYRGEEFNNWLRSKEIRHETSIPYTPEQNGVA FT ERDNRTVVEAARSSLHAKKLPLRLWAEAVAYAVYTLNRTLSSTGKVTPYQT FT WNGGKPNISHLRIFGSKAFIHIPDSIRRKLDPKGREVTFLGYSETSKGYRV FT LIPETQKVQIVRDIIFDNHKAEERHEPADPPTLPVHPSTLPVHQSKPSAQD FT EPTENDLLPAISMTIDPVHTIEEETGPPLCNHGNESNENNIEERPTCANRP FT YPLRTRKPKVWISMKAAETSDNFEPRTYQEAMQSKNRHSWKPAILDEYKSL FT IENDTWEVVPLPPDRKVIRSKWVFTEKAGHNNTPTRFKARLVAKGYTQKQG FT IDYDETYAPVVKHDSLRILLSIVASLNLEMIQLDVKTAFLYGVVKEELYLE FT QPEGFVLAGREKEVCRLKKCLYGLKQAPRVWNQKFNEFLLLFGLSRSTADP FT CIYFRRREEELTGTLIAIWVDDGLVCSSSAEVIKDVLKHLNTYFEMRALPA FT DCFVGMEITRDRARRQLYVNQAKFIATILDRFNMDSANPKATPADPSSRLT FT MSMSPTTESGTFVMRKVPYREAVGCLMYLTIVSRPDIAFAVGQVSRYCENP FT GPAHWEAVKRILAYISGTRKHGLLFSGSSQPLNGFSDADYGGDLDHRRSTT FT GYIFTHNGGPVAWCSRRQTCTTLSTTEAEYVAASETSREAIWLRHLLQDID FT LANPNPVKIHCDNQSAIQLVRNPVFHARTKHIDIRYHFIRDQLESNKIDII FT YIETKMQIADILTKPLATPLFERLRDQIGIIPVPI" XX SQ Sequence 4489 BP; 1404 A; 1144 C; 952 G; 989 T; 0 other; ggttatgggc ccaggttcta acgaaaccgc catactgtaa aaacagaaat gaactgctct 60 gggtcaaggg atgtaagcca catcattaaa ttcaatggaa ccaacttccc actttggaaa 120 ctcggtctat ttgtatcact ggaagaacag gaagtgctgt ccatagttga tggaaccact 180 cgtatcccag atgaggtaac acacacttat ataaaaactc gcataatatg taccaaagtt 240 ttttaattgt gatgggaaaa acccaagtga tgcctatcgt cacccaagtg attccaccac 300 tgagactgtc agtatcaggt taaactaacc tataccattt atctcgtgaa ttctttcttt 360 ctctcagaat tatagaatga ttccaaatga aaacgatcca gaagctggtg tacttgagtt 420 gacgaacccc gctgaaatca tggcttggaa aaagaaagat gttctggcac gtcgtatcct 480 cctatcgacc atcgacccca acttacagaa tactctcctt ggttgcaaaa ctgcaaacga 540 aatttgggtg agattgacct cccaacatct taagaacgct gctgagaaca aatatgtagt 600 tatgcaacgg ttttatcact acaaattcca agcaggtaac cacataaatc attatggtat 660 ctggacataa aactgatttc actcggtcct ctttctcctc ttactaggcc acgacgcaat 720 gagtcacata tccgccattg agaacctggc tgaacagctc agcaacatgg gccagccagt 780 ctcagaagcc caactcatta ccaagattat ttgtactctc ccaccaagct acagaggctt 840 cattcccgct tgggacaatg tcgtggaaga agagaaaacg gttgcacttc taacagcaag 900 aatactgaag gaagaaagca tgacagctat gtacaacgat ggacaagccg acccacaaga 960 cacagctttc tttgctgccg gcacacccca acaaacgagt tccttcagag gcaaccatag 1020 tcgtggaact acacagagag gctcaagagg aggatttggc tcaagaggag gccaaacccc 1080 aattcaaaag aaacctaaag tcgaatgcac ctactgtatc aaccttggcc gatcaggaca 1140 gggccacatc gcctcggaat gccgaaaccg ggaacgagat gaaagatcaa aacgtgaaac 1200 agccaatccg gcaaccgtga acagagacaa cattgatttc tgctttccat cctgcacgtt 1260 tgacgcagaa acatcaaaca atgcagactg gttcgccgat tcaggcgcaa ctcagcacat 1320 gacggaccag aaatcaaaac tagcaaactt caccatggtg gaaatcggta cctggaccgt 1380 aaaaggaatt aatgaagcac tggtcaacgt tcacgggtat ggagatgtat atttcacatc 1440 cgcagtcgat ggagtcatcc gtacaggagc attcaaaaat gttctcttcg tgccaaagct 1500 cggtatcaac ctcatctcca ttggagccgc aactgagatg ggagctgaag tccactttgt 1560 gaacaacaaa gttttattct ccaagaacgg caccgtacag ataactggtc aaaggaaagg 1620 aaagacactt tatcttttaa acctcacagt caaatctgcc caagaagagg tcgcccttgc 1680 agctaaatgt cctatctcac tctctgtctg gcaccagcgg atggcccatg tgaacgtgag 1740 aacgatccaa cggatggaat cccaacaact ggtcgaagga ctgagcatca aatccgaaaa 1800 agacgaaaca gtccctgtgt gcacaggatg tgtcaatgga aagatgcacc gcctcccatt 1860 tcccaccggg cgcatcagag caaaagaagt aggagaacta attcactcag atgtttgtgg 1920 gccaatgcag gaaacatccc tgagtggagc tcgcttttac gttctcttca aagacgactg 1980 cagtggttgg cgagttgtga acttcctgaa aaacaaatct gaagtagcca cccatttcaa 2040 actatatgct gcacgactgg aaactgagac cggtaagaca gtaaaaactt taagatctga 2100 taatggtgga gaatacagag gagaagaatt taacaactgg cttcgttcta aagaaattcg 2160 ccacgaaaca agcatcccgt acactccgga acagaatgga gtcgcagaga gggacaatcg 2220 aacagtggtc gaagcagccc ggagctcact gcacgcaaag aagctacctc tacgcctatg 2280 ggcggaagca gtagcctatg ctgtctacac gctgaaccga accctatcca gcaccggaaa 2340 agtcacacca taccaaacat ggaatggagg taaaccaaat atctcacatc tccgaatttt 2400 tggttctaaa gcgttcattc acattccaga ctcgatcagg cggaaactgg atccaaaagg 2460 acgtgaagta acatttttgg gatactcaga aacatccaaa ggctaccgag ttctcattcc 2520 ggaaactcaa aaggtgcaga tagttcgtga tatcattttt gattaaaacc acaaagctga 2580 agagcgacat gaacctgctg acccaccaac tttgcctgtt catccatcaa ccctgcctgt 2640 tcatcaatca aagccatctg ctcaagatga accgacggaa aatgatctct tgccagccat 2700 cagcatgaca attgatcccg tgcacacaat cgaggaagaa actggaccac ctctctgtaa 2760 ccatggaaac gagtcgaacg aaaataatat cgaagaaagg ccaacatgtg caaatcggcc 2820 atacccgtta cggactcgaa aacctaaggt ttggatcagc atgaaagctg ccgaaacatc 2880 agacaatttt gagccacgaa cctatcaaga agcaatgcag tccaaaaatc gccattcctg 2940 gaaaccagca atccttgacg agtacaaatc acttattgaa aatgacacct gggaagtagt 3000 gccacttcca ccagatcgaa aagtcattag gagcaaatgg gtcttcacag aaaaagctgg 3060 ccacaacaac accccaacac gcttcaaagc ccgcctagtt gccaaggggt atacacaaaa 3120 gcaaggtata gactatgacg aaacctacgc tcctgttgtc aaacatgact ctctgcggat 3180 tcttctatca atcgtggcct cactcaacct ggaaatgatc cagctggatg tgaagaccgc 3240 cttcctgtac ggagtggtca aagaagaact ctaccttgaa caacctgaag gttttgtcct 3300 ggccggaaga gaaaaggaag tttgcagact caagaaatgt ctgtacggac tcaaacaggc 3360 accccgggta tggaatcaaa agttcaacga atttttattg ttgttcggtc tttcccgaag 3420 cactgccgac ccctgcattt acttccgccg tcgagaggag gaactcacgg gcacgctcat 3480 agcaatttgg gtcgacgacg gccttgtctg cagtagcagc gcagaagtca tcaaagacgt 3540 cctcaaacac ttgaatactt actttgaaat gcgcgcactt cccgctgact gttttgtcgg 3600 aatggagatc acaagagacc gtgccagacg ccaactgtac gtgaatcaag caaaattcat 3660 tgccacaatt cttgaccggt tcaatatgga ctcagcaaat ccaaaagcta caccagccga 3720 ccccagttcc cgtctgacaa tgtccatgtc accaacaaca gaatccggaa cgtttgtgat 3780 gagaaaagtt ccataccgcg aggctgtcgg ctgcctgatg tacctaacca tcgtatccag 3840 accagacatc gcctttgccg tcgggcaggt atcgcgctac tgtgaaaatc caggtccagc 3900 acactgggag gccgtcaaga gaatcttggc ttatatctca ggtactcgca aacacggact 3960 ccttttcagc gggagcagcc aaccactcaa tggattcagt gacgcagact acggaggtga 4020 tctcgatcac cgtcgctcaa cgactggcta cattttcact cataacggag gtccggttgc 4080 ctggtgtagt agacgccaaa cctgcacaac gctatcaaca actgaagctg aatacgtcgc 4140 tgccagcgag acatctagag aagcgatctg gctccggcat ctacttcaag acatcgacct 4200 ggccaatcca aatccagtta agatccactg cgacaatcag agcgcaattc aactagtccg 4260 taatccggtc tttcacgcaa gaacgaaaca tatcgatata cgatatcact tcatccgtga 4320 ccagctagaa tccaacaaaa tcgacatcat ctacattgaa acaaaaatgc agattgctga 4380 tattctcacc aaaccattag ccacgccact gtttgaacgc ctacgtgatc aaattggaat 4440 tattcctgtc ccaatctaat ttcttcagtt actctggttt gagcgggag 4489 // ID REP-5_CQ repbase; DNA; INV; 706 BP. XX AC . XX DT 29-DEC-2010 (Rel. 16.01, Created) DT 29-DEC-2010 (Rel. 16.01, Last updated, Version 1) XX DE A repeat family from Culex quinquefasciatus - consensus. XX KW Repetitive element; nonautonomous; REP-5_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-706 RA Kojima K.K. and Jurka J.; RT "Repeats from the southern house mosquito."; RL Repbase Reports 11(1), 608-608 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 50 sequences with >90% CC identity. No TIRs. XX SQ Sequence 706 BP; 235 A; 119 C; 135 G; 217 T; 0 other; tatatacagc aattccccac gaaaacagca tgattcgaaa aaaaagttct ccgatcgggc 60 tcaaaatttt tctgggggat ccttggccga aataattaga cccgtatttt tttgtttggc 120 cattagggtg acctacgccg tgttagggtg gttcgaaaaa tggcaatttt cgtcgatttt 180 cgcaaaaacc acttttttca aaaaatcata tctccgcgcc atttcatccg attttagctg 240 tcctagacgc aaaagaaagg tgattagttt ggctatttgg gaaaaatagt aagaagtttc 300 aaaaatctag cttaacattt gaaaaggtcg tatgaaaact taaaatgctg ttttgaaggt 360 ctcgggacca aagagcctat gtctgaaaat atttttatcg gattcctcgg aaaatttcac 420 ataacatatc aaaaaatggt gaagttatgt tttcgatact ctgagatacg attttttgaa 480 aataaaaact gggtttttcg acgcgccacg cgcaaaaatg ggaaaatgac gaaaacggga 540 aaaaatcgac ttttttcact aaaactgcga taactttaaa atttcagcga tgacctatag 600 atgttatggt accaaaagtt gcgtctttta attacgaaaa ttttggtacc ctaacatgta 660 taggtcatcg ctgaaatttt aaagttatcg cagttttagt gaaaaa 706 // ID Sola2-4_AAe repbase; DNA; INV; 4146 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A Sola2 DNA transposon family from Aedes aegypti. XX KW Sola; DNA transposon; Transposable Element; Sola2-4_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4146 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the yellow fever mosquito."; RL Repbase Reports 11(4), 1302-1302 (2011). XX DR [2] (Consensus) XX CC >99% identical to consensus. 4 bp TSDs. TIRs are ~550 bp long. XX FH Key Location/Qualifiers FT CDS 1173..3308 FT /product="Sola2-4_AAe_1p" FT /note="transposase." FT /translation="MSSRFLNCCRPFAEKKCSKELRNLTESTIDKLKAAGY FT APMSTLNTNLRICTSCRLNVDKRAICTSSVDQVAGSSKTTTTEELLDAPTT FT TEELPEVPSADSLATVPSATSVSTNQSEDECIQKVNIERFNEGIAGIKVTP FT IKWTKMGYVNYPEKKYREINEAVRRNLFKLGPEDVENTDYDEVIMNMKERF FT SNLATTRKEKLLILSMLPSSWSIQDAIDEFKTNRNTAKEAKQFKNNCLATK FT NARSSTSLTDETKEKIIQYFEDDEVSRAMPGQKDYVSVKKDGKRQAIQKRL FT MMTTLKEAYTRFKEINENIKVGFSSFASLRPRQCKLLSNSGTHNVCVCTTH FT ENINLILHSLKRINLSKDIKMLTGSLLCENTTSNCYLRSCSDCPDSSSLEN FT TLFAEFEENYIDQLSFEQWVTTDRCDLETIVKPVDEFVSFFCLKLESLIPH FT DFIKTEQSRFLKNTKNTLQDGEFLVICDFSENYSFVLQDEVQSHHWNVQQA FT TIHPFVIYFNGSTQIEHFSFIVISEDLRHDSVSVNLFIAKMINFLRVDKDK FT EIRKIYFMSDGAASQYKNRKNFSSLCQFKSKYGIDAEWHFFATSHGKGPCD FT AIGGTIKRMATRASLAKEREHPIKTAKELFDWANRRKEEDLTKLSFCFTTT FT EEYELTASELSEQYNNAKTIQGTQKFHCFIPLSENKIKAKLYSNCTDNDAK FT VFDIVKKLNNNK" XX SQ Sequence 4146 BP; 1520 A; 668 C; 695 G; 1263 T; 0 other; gagggattcc acggaggcat gcaagtcgat tctgatcgac catttcaaaa atatctgaaa 60 ctttgcacag tttttcagtt ccatctaaat cgtcattttc cgatatcaaa tcttcaagtt 120 gagtcacgac taacttttca aaagggtgta tgtgaaaatg gttcaaaaat attcaaaaag 180 ctgcacagca aaaacggttc gttcgattgt tagacaacta aagaaacaaa gttagacaac 240 taaataaaga ttccaaaaaa aatacacaca gtaaaaaaaa tttttttttg cattaaaaaa 300 catcattttt gtcacaaaaa ctcaaatatc tcaaaaccct atcggaatac caacgtaatt 360 ttttgaggga aaacggtcca ttatattagc tatctaccat aaaaatttgg tgatggtaag 420 ccaataaaca aaaaagttat gacatttcaa atatttcaca aatttgacac ttagtgaatt 480 tttttttttc attgttaatt ttttttagga ccgcagtttg ttgctgaatt ttttgttaag 540 ggtaccacat gaggttaaca agttgttttc atgatatttt atttaattat tcataactat 600 tatagcatct attagaaagt tagacgcgat ccagtgttgt gatctaaagt cttgatagtg 660 tcatattttt tattgtacgt aactgaagaa aaattctctc aatagtgttg aaaccttttg 720 ataaaagaaa cctataagaa atctaatatt gaagacaaaa gctacaaaaa aagttttcta 780 tacaggtata cgactagttt acctgcaaaa agtttacctc gtagagaaca aggcgggtct 840 atacccaggt gatctagtat acgtaaagca ggtcggtttt ctcggcgctg gttttctcca 900 caaagttaaa atctgattac acctgggtat agacccgcct tggtagagaa tagactttgt 960 taatcatatt tgggagaaag cgagtaactt caacaacatc aaaaacatga taggtacata 1020 ccatgaacat actcacgaaa aagctaaaaa tatactacca ctatggttat aaaagattta 1080 attgtaaaat ttttcttcag tcaagcaaac gacaccaaac tagtcagtaa aaacttaaaa 1140 gtgattttgt ttattaaaat ctaacgagca acatgagtag tcgctttctc aactgttgca 1200 ggccgtttgc agaaaaaaag tgttcgaaag agctacgaaa tctcaccgaa agcaccatag 1260 ataaactgaa agcggctggt tatgctccaa tgtctacatt gaatacaaat ttacgcattt 1320 gtacgtcctg ccgtttaaac gttgacaaac gggcaatctg tacatcatcg gtggatcagg 1380 ttgcaggaag ttcgaaaaca acaacaactg aggaattact agatgcaccg acaacaactg 1440 aggagttacc agaagtacca agtgcagata gtcttgccac ggtaccatca gcgacatctg 1500 tttcaacaaa tcaatcagaa gatgagtgca tccagaaggt caacatcgaa cgcttcaacg 1560 aagggatagc tggaataaaa gtgactccga ttaaatggac gaagatgggt tacgtcaatt 1620 atcccgagaa aaaataccgt gaaatcaacg aagctgtacg aagaaacctc ttcaaattag 1680 gacctgagga tgtggaaaat acagactacg atgaggtaat tatgaatatg aaggaaaggt 1740 tctcgaatct agccacgaca aggaaagaaa aattattgat tttgtcgatg ctgccaagct 1800 cgtggtctat tcaagacgcc attgatgagt tcaaaaccaa tagaaataca gcaaaagagg 1860 caaaacaatt caaaaataac tgtcttgcaa ccaaaaatgc taggtcgagt acttcattaa 1920 cagatgagac aaaagaaaaa ataattcaat attttgaaga cgatgaagta agtagagcta 1980 tgcctggcca aaaagattat gtatctgtaa aaaaagatgg aaagcgtcaa gcaatccaaa 2040 aacgattaat gatgactact ttgaaagaag cgtatacacg cttcaaggaa attaacgaaa 2100 atattaaggt aggtttttcc tcatttgcaa gccttcgtcc aaggcaatgc aagcttctat 2160 ccaattcagg aacacataat gtttgtgtgt gcacaacaca cgaaaatatt aacctaatct 2220 tacatagttt gaaaagaatc aatttatcaa aggatattaa aatgttaact ggtagtcttt 2280 tgtgtgaaaa tacaacatca aattgctatc tacgatcttg ttcggattgt ccagattctt 2340 catcattgga aaatacttta ttcgctgagt ttgaagaaaa ttatattgat cagttatcat 2400 ttgagcaatg ggtgaccacg gataggtgtg acctagaaac tattgtaaaa cctgtagatg 2460 agtttgtgtc atttttttgc ttgaaattag aaagtttaat tcctcacgac tttattaaaa 2520 cagagcaatc ccgcttttta aaaaatacga aaaatacatt acaagatggt gaatttttag 2580 tcatttgtga tttttctgaa aactatagct ttgtattgca agatgaagtg cagtcccatc 2640 actggaacgt acaacaagct acaattcatc cattcgttat ttatttcaat ggaagtacgc 2700 aaattgaaca ttttagtttt attgtaattt ccgaagattt aagacacgac tcagtatctg 2760 taaatttgtt cattgccaaa atgattaact ttttacgcgt tgataaggat aaagaaatca 2820 gaaagatata tttcatgtct gatggagcag catcgcagta caaaaaccgt aagaattttt 2880 cgagcctatg tcaatttaaa tcaaagtacg gaattgatgc agaatggcat ttctttgcta 2940 cgtcacatgg caaaggtcct tgtgatgcta ttggaggaac cataaagcgc atggccacaa 3000 gagcaagttt agccaaagaa cgtgagcatc caattaaaac tgcaaaagaa ctatttgatt 3060 gggcgaatcg cagaaaagaa gaagatttaa caaaattatc attttgtttt actactactg 3120 aagagtacga attaacggca tcagagctca gcgagcaata taataacgcg aaaacgatcc 3180 aaggaaccca aaaatttcac tgtttcattc cattgtcaga aaataaaatt aaagcaaaac 3240 tatactcgaa ctgtactgat aatgatgcaa aagtgttcga tattgtaaaa aaattgaata 3300 acaataaata aataaataag tattaataaa atgttttcat gatctcataa cgcataccca 3360 aatccaaaac ttttaaagtt tatatataac atttagaaac tcatcgatca tactctaaaa 3420 aaaaatatcc tggtaaaaaa aaaattattt tcgtgtaatc tgttaattca ttttaattta 3480 tacatatata catatacata taatatttat acatatacat aatatttata catataatat 3540 acataatact aatgaataca tgaaaacgac ttgtttaatt cacgcaaggc ccttaacaat 3600 aaaatcagca acaaactgcg gttcccgtaa taatttacac cgaaaaaaaa attttttttt 3660 ctactaaatg tgaaaattgt gacatgtttg aaatgtcata acttttttgt ttattggttt 3720 accatcacca aatttttatg gtagatagct aatataatgg accgttttcc ctcaaaaaat 3780 tacgttggta ttccgatagg gttttgagat atttgagttt ttgtgtcaaa aattatgttt 3840 tttaatgcaa aaaaaaattt tttttactgt gtgtattttt tttggaatct ttatttagtt 3900 gtctaacttt gtttctttag ttgtctaaca atcgaacgaa ccgtttttgc tgtgcagctt 3960 tttgaatatt tttgaaccat tttcacatac acccttttga aaagttagtc gtgactcaac 4020 ttgaagattt gatatcggaa aatgacgatt tagatggaac tggaaaactg tgcaaagttt 4080 cagctcaata gaaaaaaatg aattaaaaaa tttaccaaat tttggtgctg ttgcttggaa 4140 tcactc 4146 // ID Gypsy-187_AA-LTR repbase; DNA; INV; 165 BP. XX AC supercont1.90; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-187_AA_; KW Gypsy-187_AA-I; Gypsy-187_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-165 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.90; Positions 1952816 1952980. XX SQ Sequence 165 BP; 48 A; 30 C; 36 G; 51 T; 0 other; tgtagtggtc actgtgagtg attgagtgcc actacgggtt cgtcatactt ttgaaataaa 60 gttctgatca gtgtttgtca gagcatgcaa gctgaaacaa ctcggtgttg tatctcaagc 120 atccgaagat tattataaaa ccctttactg tgcatataac agaca 165 // ID DNA8-9_AP repbase; DNA; INV; 354 BP. XX AC . XX DT 22-AUG-2009 (Rel. 14.08, Created) DT 22-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-9_AP. XX NM DNA8-9_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-354 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(8), 1751-1751 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 354 BP; 111 A; 61 C; 65 G; 116 T; 1 other; cacgattaaa gatagtatgg ttgcacaacg aaccataata aagtgatgct ccgagtttcc 60 ttgttatcac aacgattagc ggaaccgagc attggcgctc caagcgttta aatattgaat 120 catatatata aatcatataa tttgtatgcc tagtagtggc tagtacttaa tttttacgct 180 atcagcgcta gtaaaagctt ctagngtatg cctagtagtg gctagtactt aatttttacg 240 ctatcagcgc tagtaaaagc ttctagagta tcgttgataa taaaagaaac tatgagcatc 300 attaaattgt ttcttatcat agttcgttgg gcaaccatac tatctttaat cgtg 354 // ID Ingi-1_AC repbase; DNA; INV; 4383 BP. XX AC . XX DT 28-JUL-2009 (Rel. 14.07, Created) DT 17-AUG-2010 (Rel. 15.09, Last updated, Version 3) XX DE A family of Ingi non-LTR retrotransposons from a sea slug - DE consensus sequence. XX KW Ingi; Non-LTR Retrotransposon; Transposable Element; I group; KW I-1_AC; Ingi-1_AC. XX NM I-1_AC. XX OS Aplysia californica OC Eukaryota; Metazoa; Mollusca; Gastropoda; Heterobranchia; OC Euthyneura; Euopisthobranchia; Aplysiomorpha; Aplysioidea; OC Aplysiidae; Aplysia. XX RN [1] RP 1-4383 RA Kapitonov V.V. and Jurka J.; RT "New families of I non-LTR retrotransposons from animals."; RL Repbase Reports 9(7), 1529-1529 (2009). XX RN [2] RP 1-4383 RA Kojima K.K., Kapitonov V.V. and Jurka J.; RT "Recent Expansion of a New Ingi-Related Clade of Vingi non-LTR RT Retrotransposons in Hedgehogs."; RL Molecular Biology and Evolution 28(1), 17-20 (2011). XX DR [1] (Consensus) XX CC [1] I-1_AC can be considered as a member of the Ingi clade in the CC I group. CC The consensus sequence was derived from multiple alignment of CC several copies ~97% identical to each other. The 3' terminus is CC composed of the (GA)n microsatellite. Target site duplications CC are not present.The consensus does not contain ORF1. The 5' CC terminus is precisely defined by 4 different copies of I-1_AC. CC Therefore, it is unlikely that the consensus is incomplete due to CC 5' truncations. In addition to the APE and RT domains, the ORF2 CC protein contains the RNase H domain. CC [2] Renamed. XX FH Key Location/Qualifiers FT CDS 2..4351 FT /product="Ingi-1_AC_1p" FT /note="ORF2: PHD finger, AP endonuclease, RT, RNase FT H." FT /translation="GANDEPARWCSRQLLPKDVSLPTDKPRRQLQNASTHL FT EHQDLNNPPGFFLGGQHNPAWLARLLILAGDVEQNPGPRWPCGVCGDSVPA FT KAVSARCQECLLWIHLQCSDLTRQQMLSLPSTKKGRRITAEWTCPSCTIGA FT AQHPLRAATQPVYTPPTQLDPNRPMTRQRHVRQLRRKNRCTDSNARNHERT FT RCMKKENNRALNILQWNIRGLRSAKIELLRHIDTNHPDIICLQETWLSRNI FT DICVPGYSIERSDRDDGNNGGGVATLIKDGITYHRFHDGVVANPDSTTEFV FT GTLLYLPGRTLKVVNLYIPPRHKGNFVPHVLKTDANTIIVGDFNCHHPSWD FT PYVDQCATGTEIDDWALDNDMIALNDGCETRIDPSSMLKSVPDISFCHVSQ FT VAYVSWKTSDTLVSDHKAIEIKLCRHSAPTRRSKAKWSFKKADWSLFNAHM FT ETETLANDMPNMTDATKLNTCLTEAILKAAKQSIPKGARKLPKPFWNEEAD FT KLIEEKVQLRKKAMATKTREDIGNYQEASKRTTSRLNEMKRDCWRNFTSEM FT SLSTDPQQVFRVINAISGKKTPSTGSALTENNKTLKGDRSKAEAFCREYAS FT VSRLKLDKADKHRHYQLNEKLKLLPIDQNTGYDSLLTMAELTVCIGKLKNG FT KSPGLDGVTNEMIKHLGPIANSALLRLFNLSWEMAIVPNSWRRAKIIPIPK FT KNKPAGKIGSYRPISLTSCLAKLMETLVKNRLMYFLESNNLLSNNQSGFRK FT LRSTEDQVLCLTQHISDGFQARPKMKRTVLALVDFSRAFDKVWRTRLMEKM FT IEKSTPSRITQWIGSFLRDRYACVEHGNATSRLRRFEQGVPQGSVLSPTLF FT LVFIDDIEAGLPETVKTGLYADDYTIYATNEDINQAENSVQAAINLLEQWA FT KQNKMEISTEKTECTLFTTSTHEAKRKPALSLKGAPITFNSNPTLLGVTLD FT RTLSSNAHIQNLELKLSSRLRPLKALTGTTWGASRECIKPIYYATCRSAID FT YCAPAYMPLAKPSNLQKLERKQNEAARIITGCSRDTRVDSLLIEADILPLR FT HRADALTAISFEKSMRLPTTNQRKASATKSVPTRTGKSNWRKHAQNLVAET FT ELDTYPREELVVLPPTQPWKAASYNVTFSTDLVADCKRANEPAVRKAAAEA FT TIRSLPLPDVEVWTDGSAEHGTEYGGSGILIKTGDTELDQRVPAGRYCSSY FT RAELVALDTALDMLVALHTAGSLPNSARIHVYTDSRSAVMRLSYGPTNQSE FT KVACSIWQHLHTLCQRSECSIHLQWIPGHAGLAGNEEVDSIAKEATALTQI FT DTPIDFSTAKAVIHRSTRAKWKREAKPKLPFAQPPSFQLEASLRRQDRRLL FT SQMRTGGKSPKLRSYLHKIAPADNPDPSCRACKEEDETMDHVFLRCPASYR FT ARTRLLGHFSNPLSALFKDPRASVEFLRAVGIADRL" XX SQ Sequence 4383 BP; 1246 A; 1285 C; 1001 G; 851 T; 0 other; gggagcgaac gacgagcctg caagatggtg ctcccgccag ctgttgccga aggatgtcag 60 tctaccaacc gacaaaccga ggcgacagct gcagaacgcg tcgacccacc tggaacacca 120 ggacctcaac aacccgcccg ggtttttcct tggcggccag cataacccgg cctggctggc 180 ccgcctgctg atcttggccg gggacgtgga gcagaatccg ggaccccgtt ggccctgtgg 240 agtatgcggc gacagtgtgc ctgcaaaagc agtatcggcc cgatgtcagg agtgtcttct 300 gtggatccac ctccaatgct ctgacctcac ccgacaacag atgctaagcc tgccatcaac 360 gaagaaggga agacgcatta ccgcagaatg gacgtgcccc tcctgtacga ttggagcggc 420 gcaacatccg ctacgagcag cgacacaacc cgtctacaca cctccaacac aactcgaccc 480 aaaccgaccc atgactcgac aacggcacgt cagacaattg cgccgtaaaa accgttgcac 540 ggactctaat gccaggaacc atgagcgcac ccgctgtatg aagaaggaga acaatcgagc 600 cctaaacatc ctccaatgga atatcagagg cctgcgcagt gccaaaattg agctccttcg 660 gcatatcgac accaaccacc cagacattat ctgccttcaa gagacatggt taagccgaaa 720 tatagacatt tgtgtgcctg gttactctat tgaacgaagt gacagagatg acggaaacaa 780 tggaggcggg gtcgcgacat tgattaaaga tggtattacc taccaccgat tccatgatgg 840 agtagttgcc aaccccgact ccacaacgga attcgtaggc acactcttgt acttacccgg 900 cagaaccctt aaagtcgtca acctgtacat cccaccacgc cacaaaggta acttcgttcc 960 ccacgtactc aagactgatg caaacaccat catcgtaggc gatttcaact gccatcaccc 1020 ttcgtgggac ccttacgtcg accagtgtgc caccggaaca gaaattgatg actgggccct 1080 cgacaatgat atgattgccc tcaatgacgg ctgcgaaaca agaatcgacc cgagcagtat 1140 gctcaaatca gtccctgata tttccttctg tcatgtttca caggttgctt atgtctcttg 1200 gaaaaccagt gacaccctgg tctctgacca caaagccatc gaaatcaaac tctgtagaca 1260 ctctgccccc acccgtagga gcaaagccaa atggtccttc aagaaggctg actggtcgtt 1320 gttcaatgcc cacatggaaa cagagacttt agcaaatgac atgcctaaca tgacggacgc 1380 cacaaaatta aacacgtgcc taactgaggc cattttaaaa gcagccaaac aaagtatccc 1440 taagggcgct cgtaaacttc ctaaaccctt ctggaatgag gaagcggaca aactgataga 1500 agaaaaagtg caattgagga aaaaagccat ggcaaccaaa acacgagaag acattggaaa 1560 ctaccaagag gcgtctaaaa gaaccacatc ccgcctaaac gagatgaagc gcgactgttg 1620 gcgcaacttt acctcagaga tgtccctctc gactgaccct caacaagtct tccgagtcat 1680 aaatgccatc tcaggcaaga aaaccccctc caccggcagt gcactcacag aaaacaataa 1740 aaccctaaaa ggagatagaa gcaaggctga agccttttgc cgtgaatacg cctcagtcag 1800 ccgcctaaaa ctcgacaagg cagacaaaca ccggcactac cagcttaatg aaaagcttaa 1860 actcctccct attgaccaga acacaggcta tgattctctc ctcaccatgg ctgaactgac 1920 agtttgtata ggaaagctga aaaacggcaa atctcccggg cttgatggtg taaccaatga 1980 gatgatcaaa cacctcggac ccatcgcaaa ctctgccctg ctccgtcttt tcaacctatc 2040 atgggaaatg gccattgtcc caaacagctg gcgacgtgcc aaaatcatac caatacccaa 2100 gaagaataag cctgcaggaa agattgggtc ctaccgcccc atcagtctca ccagctgcct 2160 tgctaagctg atggagaccc tcgtcaagaa ccgcctgatg tactttctgg agagtaacaa 2220 cttgctgagc aacaaccaat cgggcttccg caagttacgg tctactgaag accaagtcct 2280 gtgcctaaca caacatatct ccgatggatt ccaggctcgt cccaaaatga aacgaactgt 2340 gctggccctc gtggacttct cgcgtgcctt cgacaaggta tggcgcacga ggttaatgga 2400 gaagatgatt gagaaatcta ctcccagccg catcacccaa tggataggtt cgttcctcag 2460 agacagatac gcctgcgtcg agcacggcaa cgcaacaagc aggcttcgcc gtttcgagca 2520 gggagtcccc cagggcagtg tcctgtcccc cacactgttc cttgtcttca tcgatgatat 2580 tgaagcaggc ctgcctgaaa ccgtgaagac gggactctat gctgatgact atacaatata 2640 cgccaccaac gaagacataa atcaggctga aaacagtgtc caagcagcca taaacttact 2700 tgagcagtgg gcaaagcaaa acaagatgga gatcagtacg gagaaaacag agtgcaccct 2760 cttcaccacc agcacccacg aggccaagcg aaaaccagca ctctcactca aaggcgcccc 2820 catcaccttc aactccaacc ccaccttact tggggttaca cttgatcgta ctctatcctc 2880 caacgcccac atccaaaacc ttgagttaaa actgtcaagc agacttcgac cattgaaagc 2940 ccttacaggc accacctggg gtgcctccag ggaatgcata aagcctatat attatgctac 3000 ttgtcggtca gctatcgact attgtgcccc tgcctacatg cctcttgcca aaccctccaa 3060 tctgcagaaa ctggaaagaa agcagaatga agcagcaaga atcatcacag gctgctcaag 3120 ggatactcgt gtggacagcc tcctcatcga agcagacatt ttaccactaa ggcatagagc 3180 tgatgccttg actgccatct cattcgagaa gtccatgcgt ctgcctacaa cgaaccaacg 3240 caaagcctcc gcaacaaaat cagtgcccac gcggacaggc aagtcgaact ggagaaagca 3300 tgcccagaac ctcgtagccg aaactgaact ggatacgtac ccacgtgagg agctggttgt 3360 tttacccccc acacaaccgt ggaaggctgc atcatacaac gtgactttta gcaccgatct 3420 tgtcgccgac tgcaaacgtg ctaatgaacc agctgtcagg aaagcagcag cagaggctac 3480 aattcgttcg ctccccctgc ccgatgtaga agtctggact gacggttcag ctgagcacgg 3540 tactgagtat ggaggaagtg gaatactgat caagaccggt gataccgaac tcgaccaacg 3600 tgtacctgcg ggccggtatt gttccagcta cagggccgag cttgtagctc tcgacacagc 3660 gcttgacatg cttgtggcac tacatacagc aggcagcctt ccgaacagtg cacgcatcca 3720 tgtatacact gattctagat cagcagtcat gcgcctctca tacggcccta ccaaccaatc 3780 ggaaaaggtg gcctgctcaa tctggcaaca tctgcacacc ctctgtcaac gctctgaatg 3840 ttccatccac ttacagtgga ttccaggcca cgccggactg gctgggaacg aggaagttga 3900 cagcatcgcc aaagaagcaa cagcgcttac ccagattgac acgccaatcg atttctcgac 3960 agctaaagcc gtcatccacc gatcgacccg cgcaaagtgg aaaagagagg cgaaacccaa 4020 actgccgttt gcccagccgc cgtcattcca gcttgaggcc tcgctcagac gacaggaccg 4080 ccggcttctc tcccaaatga gaactggtgg aaaatcgccc aagctgagat catatctcca 4140 caaaattgcg ccagcagaca acccagatcc gtcctgtcga gcctgcaagg aggaggacga 4200 aactatggac catgtgttcc tgaggtgtcc cgccagctat cgtgctagaa cccgtctgct 4260 gggacatttc agtaacccac tttcagcact ttttaaggac ccccgggcga gtgttgagtt 4320 cctgcgtgct gtcggcatcg ctgaccgctt atgaagccac agcgccagaa gagagagaga 4380 gag 4383 // ID BEL-115_AA-I repbase; DNA; INV; 6655 BP. XX AC supercont1.79; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-115_AA_; KW BEL-115_AA-LTR; BEL-115_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6655 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.79; Positions 1250122 1256776. XX CC 'CTTAA' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 1272..2861 FT /product="BEL-115_AA-I_2p" FT /translation="MSLASYKVLVAKLSDVQQTFNDIWSYVDDLPEDAPIS FT QMTVRLERLDDLWEKFSEYLIEIKTHAVHDIEKNPYTKERKDFSDRYYHAK FT AFLLDAIKLKQESVELNTSARFLNTSGGGNLDHVRLPQIKLQTFSGNIEEW FT LSFRDLFSSVIHCKADLPEVEKLYYLKGCLQGEPKNLIDSLQITSANYKVA FT WDLLQKRYDNSKHLKKLQVQALFKLPSLTKESSSDLHTLVDGFERIVQTLD FT NVVLPGDYKDLLLVNILSARLDPVTRRGWEEFSSTKQQDTVVELTEYLQRR FT IRMLESLPGKQSESKNSAQFHQPGKFKTSSVKTSFNTVQTSGASCVACSAN FT HQLFQCSVFQRMPQLEREAVLRNHSLCRNCFKSGHHAKDCQSKFSCRYCKK FT RHHSLVCFRPDKDNSSKSVVAPKPTRSNNSRDEQDSTETATPGSSTTGNST FT SQVSNMAATDAVVAHGSAQYSKVLLATAIVIVEDNSGHRYPARALLDSGSE FT SNFVTERLSQRLRVTRGKIDGMDRPWNRPSKDQR" FT CDS 3232..6045 FT /product="BEL-115_AA-I_1p" FT /translation="MARFWSCEEVGSAENYSPREARCEQHFVHTVQRGEDG FT RYTIALPTDEDVLERIGESREIAVRRLQATERRLAKNADLRTQYCAFMAEY FT FQLGHMSKVENVPGSVKRCYLPHHPVVKEASTTTKVRVVFNASSETSTRVT FT LNDALLVGPIIQDDLRSIIMRSRTKQVMLVADVEKMFRQILVREEDRPLQS FT ILWRSSPEEEISTYELNTVTYGTKPAPFLATRVLKQLSDDEEGRFPLAAKA FT VREDTYMDDVITGSNDVDEAVQLRVQLQELMNAGGFRLRKWASNYPEALDG FT VGTEDLAIPITEISLDPDPAVKTLGLTWMPRTDTLRFQFTIPALEDVEPLT FT KRKVLSTIATLFDPLGLIGAAITSFKVFMQQLWTLESEDGRRLDWNQPLPH FT TVGETWRNYHRQLPILNEIRINRCVMVPQTERIDLHCFSDASEKAYGGCVY FT VRSEDSSGNVLVRLVASKSRVVPLSCQTIPRLELCGALLTAELFEKVNQAI FT RLPADVHFWTDSTCVLDWLVATPSTWSPYVGNRCAKIQRLTEGWQWRHVPG FT VQNPADLISRGITPQNISGNRFWWHGPDWLLGSPEGWPHSTVSSNAEEVEK FT EKRRTVTVVTATTAPEFVEEYVAKFSSYSEMIRTAAIWQRLMKLLRKPKAE FT RRGGFLSSDELKDAEYTILRQVQKVALADEWRALSEGVPVSLKSSLRWFHP FT YISADGLIRLGGRLRHSAESEDFKHPIVLPKGHRLTRLLVEQYHGRLLHAG FT PQLLLSTIRLKYWPLDGRSIVRQVVHRCHKCFRAKPSPVKQFMGELPAERV FT TISRPFSRTGVDFFGPVYVRPGPRRNAVKAYGAIFICLATKAVHIELVSDL FT STDRFIQALRRFVARRGKCADIFSDNGTNFVGARNRLQELLRLLKDVGVGT FT LALQVVPISVAFGRQRFVPRSTTCRESSETHQYP" XX SQ Sequence 6655 BP; 1703 A; 1596 C; 1733 G; 1623 T; 0 other; catggtcctt cgaaccggat agtgctgaac tggagtggtc acggatttga gcgtgtagga 60 tttccgccat cgcggtttca cgttacgacg ctttgttacc gcgctgtaat agcgaatcgc 120 catcctttcc gtatatgaaa aggagtttcg tgacaaaaga agaataggct cttggcgcgt 180 acccaagcca agccaagcct ggacgccgcc ctgttggttt cctgtactaa agattcggga 240 cggtgactgc tagcatcgga cttggaaggc aacactaagt ttgctccgcc tgtgggtttt 300 ccaggtaatt aattagacac ttagtgtatg tcttgtcgga gctaggcatt tccctttgct 360 aacaccggtt ttattcattc acactgtcat cgctattacg gagccatcat cactctacgg 420 ggatcgtcgt tccgcttaac cagataggaa gctctggagt tgtggatact tggtatacga 480 ttatcagccc ggattggaat cggcgctggt gctgatcgat tcatcggatt actactcatc 540 gcctgcacga ccgacgggtg gactcaagca aggaggcagc aacgtggagc gttatccagg 600 ttagtgaaca tttagtgtat gtccgccgaa gcgtgggcgt ttggatactc cacaatttta 660 aataaaattt atcctccttg gcttgcgaca tctacaccca cacatccctg ctatatacta 720 cggattattg aggctatata aactataatt gaacgttttt tggatacgtt cggcatcatt 780 tggcacgggc attggtacac cgttgttttc tccgggtgag ttgcacggta tatgtcccgt 840 cgaagcaatt ttttggatac tatagttaat ttccctccct ttcacgcgac ttgatgaaat 900 tgaacaagat ttgatccaca tctggctacg ctttgctacc aaagttggac gattttaccc 960 tattgctgca cccgtgacgc tgttcacttg gttaggtgag tcgaacagta tatgtgcagc 1020 cgaagttaaa tttttggata ctaccgccac gattctctcc tctattcaca ccaatccacg 1080 agattccgaa caatcgacgt tgttgtcaca acattgtgtg gccgaatacc gccgactgcg 1140 tagggtgtcc tgtatcactc tcataggtta gttagagcac gatatatgtc ctagcgaagc 1200 taggcaatta cgcagtatac tacaccatca ttttcatccg ttcaacaacc gattgatttc 1260 gccgagtcat catgtcgctg gcatcgtaca aggtcctggt ggcgaagctg agcgatgttc 1320 aacaaacctt caacgatatt tggagctacg ttgacgatct gccggaggat gcgcccatca 1380 gccaaatgac agttcgtttg gaaagactgg acgacctgtg ggagaagttt tccgagtatc 1440 tcatcgaaat caaaactcat gccgtccacg acatcgagaa gaatccgtac acaaaggaga 1500 ggaaggactt cagcgatcgc tactaccatg cgaaagcgtt tctactggat gcgatcaagc 1560 tgaagcagga atcagtggag ttgaacacgt cggctaggtt cctaaatacg tccggtggtg 1620 ggaatcttga tcatgtcagg ttgccacaga tcaagcttca aacttttagt ggaaacattg 1680 aggagtggct cagtttccgc gaccttttct cctcggtcat ccactgcaag gcggatctcc 1740 ctgaggtaga aaagctctac tacctcaagg gatgcctcca aggagaaccg aaaaatttga 1800 tcgactctct ccaaatcact agcgccaatt acaaggtcgc ttgggatctt ctacagaaga 1860 gatacgacaa cagtaaacac ttgaaaaagc tccaggtgca agcacttttc aagcttccta 1920 gcttgaccaa agaatcctca tcggatctgc acactctcgt ggatggtttc gagaggatcg 1980 tgcagacctt agataacgtc gtcctgcctg gtgattacaa ggacctattg ttggtcaaca 2040 ttctctctgc gcgattggat cctgtgacac gtagaggatg ggaggagttt tcttctacaa 2100 aacaacagga tacggttgtt gaactcaccg aatatctcca gcgacgcatc cgtatgctgg 2160 aatcgctgcc agggaagcaa tcggagtcca agaactcagc tcagttccac caacccggga 2220 agttcaagac atcttcggtc aagacgagct tcaacaccgt ccaaacatcc ggagcaagtt 2280 gtgttgcctg cagtgccaat caccagctgt ttcagtgcag tgtattccaa aggatgccgc 2340 aattggaaag ggaggcagtt ctcagaaatc attctctctg tagaaactgt ttcaagtccg 2400 gtcaccacgc taaggattgc cagtcgaagt tctcttgccg gtattgcaag aagcgtcatc 2460 attcgctggt gtgcttcagg cctgacaagg acaattcttc gaagtcggta gtggcaccga 2520 agccgacgag gtccaacaat tcgagggatg aacaggattc tacggagacg gctaccccag 2580 gttccagcac tacgggtaat tctacatcgc aagtatccaa catggcggcc accgatgccg 2640 tggtggctca cggatcagcc cagtattcca aggtactgct ggcgacggct atcgtgattg 2700 tcgaggacaa cagtggacat cgctatccag cacgcgctct gttggactca gggtcagaga 2760 gcaatttcgt tacggagagg ctcagccagc ggttaagggt aacccgcggg aagatagatg 2820 gaatggatcg gccatggaat cggccaagca aggaccaacg ttaggcagca gatcgaaacc 2880 accattcgat cacgtatttc ggattactcc cgacggatga actttctcgt tctaccgaag 2940 gtgacagtca atctacctac aactacgatc aacgtttcaa gctggaggct accagatggc 3000 atcgaattgg ctgatccatc gttcaatgtg tccgtggcga tagatatggt tctgggaatc 3060 gaatcattct tcgatttctt caggaatgga cggcagattt cgttaggaga gcgactgcca 3120 gcactcaacg aatctgtatt cggttgggtg gtttgcggtg gagtatcggt tcccaatcag 3180 tcggtgaaca ttagctgcaa cgtgtcggct tgagaaggtc tagaggagtt gatggcccga 3240 ttttggtcct gcgaggaggt cgggtcggct gaaaactatt cgccacgaga agcgcgttgc 3300 gagcagcact ttgttcatac cgtacaaagg ggggaggatg gccgttacac tatcgctcta 3360 ccgactgacg aagacgtttt ggagcggatt ggtgaatcaa gagagatcgc agttcgacgt 3420 cttcaggcta cggagcgtag attggcgaaa aacgcagatc ttcgtacgca atactgcgcc 3480 ttcatggcgg agtactttca gcttgggcat atgagcaagg tcgagaatgt cccaggttcg 3540 gtgaaacgct gctatctgcc acaccatcca gtagtgaagg aagccagtac taccaccaag 3600 gttcgcgttg tattcaacgc ttcgagtgaa acgtccacaa gggtgacgtt gaacgatgcg 3660 ctactggtcg gaccgatcat tcaagatgat ctgcgatcaa tcatcatgcg gagccgaacg 3720 aaacaggtga tgcttgtcgc ggacgttgag aaaatgttcc gccaaatcct ggttcgtgaa 3780 gaagatagac cgctgcaatc gattctgtgg aggtcatcac cggaggagga aatcagcacg 3840 tacgagctga acaccgtcac ctacggaact aaaccggcac cctttttggc gactagggtc 3900 ctgaagcagc tctcggatga tgaggaagga agatttccgc tggcggctaa ggcagtccgg 3960 gaagatacat acatggacga tgtgattact ggttcgaacg acgtggatga agctgtgcag 4020 ttgagggtac agctgcagga acttatgaat gccggaggat tcaggcttcg taaatgggct 4080 tccaattatc cagaggcgtt ggatggagtt ggaacggagg atttagccat tcctatcacg 4140 gagatcagtc tagacccaga cccggcggta aaaactcttg gattgacttg gatgccgaga 4200 accgacactc tgcgattcca attcacgatt ccagcattag aagacgtaga acccttgacc 4260 aagcgtaagg ttctatcgac catcgccact cttttcgacc cactaggcct cataggagct 4320 gcaatcacat cattcaaggt gtttatgcag caattgtgga ccctagaaag cgaggatgga 4380 aggcgattgg attggaatca accattacct cacacggtgg gtgagacatg gcgaaattat 4440 caccgacaac taccgattct caacgagatc aggatcaaca ggtgcgttat ggttccacag 4500 actgaacgaa ttgaccttca ctgcttttcg gacgcctcgg agaaggcgta cggtggatgc 4560 gtttacgtac ggagtgagga ttcaagcgga aacgttctgg ttcgtctggt cgcgtctaaa 4620 tcaagagttg tacccttgag ttgccagaca attcccagat tggagctatg tggagcactt 4680 ttaactgcgg agctgttcga aaaggtcaac caagcaatca gactaccagc agatgtacat 4740 ttctggacgg attccacgtg cgtcttagac tggttagtag ccactccatc tacttggagt 4800 ccgtacgtcg gaaatagatg cgcgaaaatt caacggttga cggagggttg gcaatggcgg 4860 catgttccag gagttcagaa tcccgctgat ctgatctcac gaggaattac accgcagaat 4920 atctcaggga atcgtttctg gtggcacggg ccggactggc tgttaggaag tccagaagga 4980 tggccacact caacagtttc atcgaacgcc gaggaggtgg aaaaggaaaa gcgccgaact 5040 gtaacagtag tcacagcgac aaccgctccg gaatttgtag aggagtatgt tgcgaagttt 5100 tcatcgtatt cggagatgat tcggactgct gcaatctggc aacgattgat gaagctcctc 5160 aggaaaccca aggcggagag aagaggtgga tttctctcat cggatgaact taaagatgcg 5220 gaatatacca tccttcggca agttcagaag gtagccctcg ccgatgaatg gcgggccttg 5280 tcagaaggag taccagtgtc actaaaatcg tcactacgct ggttccatcc ttacatttcc 5340 gcagatggtt taattcgtct cggtggccga ttgagacatt cggcagaatc tgaggacttc 5400 aaacacccaa ttgttcttcc aaaggggcat cggctgactc ggcttctcgt tgagcaatac 5460 catggaagac tacttcatgc tggacctcag ctgctattga gcaccattcg gctgaaatac 5520 tggccattag atggaagaag catcgtccgc caggttgttc atcgttgcca caagtgcttc 5580 agagccaaac cgtctccagt gaagcagttc atgggtgagc tgccggctga gcgagtcacg 5640 atatcgcgcc cattttcccg gacaggcgta gattttttcg gaccagttta cgttaggcca 5700 ggtcctagac gtaacgcggt gaaagcgtac ggagccattt ttatatgtct tgcgacaaag 5760 gctgtccaca tagagctggt ctccgactta tcgacggatc ggttcatcca ggcgttacga 5820 cggttcgtag caaggcgagg aaagtgtgcc gatatatttt cggacaacgg caccaacttt 5880 gtgggtgccc ggaaccggtt gcaagagctg ctgcgtttgt tgaaggacgt tggcgttggc 5940 actttagccc tccaagtggt ccccatttcg gtggcctttg ggaggcagcg gttcgttcca 6000 cgaagtacca cctgcagaga gtcatcggag acacaccaat atccatagag gacatggtta 6060 cgctgttggt ccaggtggaa ggatgcctca actctagacc aatcattcca ctctcaaccg 6120 atcctagtga tttatagcct ctcacgccgg gccattttct aatcggttcg tcaatacagg 6180 ctctcccaga aaccgacgta accgatgttc agttgaaccg actaaaccaa atagagctgg 6240 tgcaacggaa ggtccaggat ttttggaagc gttggagaca ggaatacctg agccaacttc 6300 aaggacgaaa caaacgctgg cgaccgccag ttagtattga agttggaaaa ctggtagtaa 6360 tctgcgacga caacctgccg cccatgcgtt ggaagttggg cagaatcgag gaaacgcacc 6420 ccggagctga tggcgtagta cgcgttgtta cgttgaggac cgcttccgga aatctaaaac 6480 gaccggtgga gaaaatctgc ctcttgccgg aaccaatcct cgagcacgaa ttctactcca 6540 aatcccaaaa ctaaaatcca tttcccaatc caaaatccat ctcccgtcct agccgaagag 6600 gagttttttt tttctttctt ttcagaaatt catggaattt cagggtgggt gagaa 6655 // ID hAT1_DYa repbase; DNA; INV; 2455 BP. XX AC . XX DT 15-MAY-2009 (Rel. 14.05, Created) DT 15-MAY-2009 (Rel. 14.05, Last updated, Version 2) XX DE hAT-type sequence: consensus. XX KW hAT; DNA transposon; Transposable Element; hAT1_DYa. XX OS Drosophila yakuba OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; melanogaster subgroup. XX RN [1] RP 1-2455 RA Jurka J.; RT "DNA transposon families from fruit fly."; RL Repbase Reports 9(5), 938-938 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS join(771..1103,1373..1666,1662..1943) FT /product="hAT1_DYa_1p" FT /translation="MIGDTGLKEFATFLIGVGATYGANVDVDNLLPHSTTI FT SRNVASLYEERFGPVKAEIGKFKRFGYAVTSDVWTDNHLRLSYLSCTIHYI FT KEGILVNRLLAVKCMKGESCTGNIVKFFKKSGSNFFLNETLKSLCPTRWNT FT VFFLLKSIEGNWIEISNTLKEKNQTARIESIHINELTGITRVLETFTDLSK FT KFESSQFLLFISQSHILTNQIKKSLSTTIVPTNNTNNSSLFLYFYFHQQIS FT CFNLLKKKKMLFRIARHSWPEMYPFDSKYVVTTVHIDQEINMYINTNTTYF FT DGFNCLAERA*" XX SQ Sequence 2455 BP; 839 A; 423 C; 442 G; 751 T; 0 other; tagagggctg cataacacca ttctctttgt ttgcatttag agtactcatt cgttttagta 60 tgtattcaga atacatacca ttcagttgtc gcgatctctt tgcgttcact tcgtatgcgt 120 tcggagtttt gggcacccga gcgaaatgaa gtcaatgaat gcgcgcgaaa actaaatgaa 180 gccaatgaat tttcaataga aattcagagt actctgaata ctttatgttg tcattgtgta 240 aaatgagttc ggacagcgcg gaggtaagtc taagttttct gtgtttgtgt attattatgt 300 gcgtgtgcgt cttatttaat aatattgtag catttgtcta atacgaatat ataaattgtg 360 ttgtcaaata taaatattta aaaaagaata gtaaaaaaaa aataataaaa tcgtttatat 420 agctaacgtc aggcgacgca tcgtttatcg aggagtcggc gactgtaata aaacagaaat 480 tggagaacgg tgtatattcg ttaaagggga aaagaggtcg tagcaaagtc tgggattatt 540 ttttacaagt aaatgatgaa gatggaaagc agataaaaga ttttgtggcc tgcaagatct 600 gtaaaagtgt ttataaattt tccggcaaca cctcgaattt agttaaacat aagtgctata 660 ttataagctg tcgaaataac catgtcctaa ccaatgttaa ttctgaaaca aaaaaggaag 720 ccaccgcaat tgctacgaag tggatagtaa ggaattgtcg gccatttaaa atgattggcg 780 atactggcct caaagaattt gccacgttct tgatcggcgt gggagcaacg tacggtgcta 840 acgtagatgt cgacaatctt ttgccccatt cgacaacaat ttcaaggaac gtggcatcgt 900 tatatgaaga gcgctttggt ccagttaaag ctgaaattgg aaagttcaag cggtttggct 960 atgcagtcac cagtgacgta tggacagaca atcatttaag attgtcgtat ctgtcctgta 1020 ccattcatta tataaaagag ggcattttgg taaaccgact tttagcagta aagtgtatga 1080 aaggagaatc ctgcacaggt aattgaattt aactataaat tgaatgacaa aatgattaat 1140 gctagttatt taattatttt taggtgtaaa tattcgtacc aaactcatgg atattttaca 1200 aaattttggt tgcaaccttg atgaagacca gcctgtgatt gttacggata gaggttctaa 1260 catgattgcg gctttccaaa aatacgatca tattcactgc gtcaatcatt tattgcacaa 1320 cattgtagag gcaacggtaa aacaaattac agaattctct aaactgcagt aaattgtcaa 1380 gttttttaag aagtcaggct caaatttttt tttaaatgag acgttgaaga gcttgtgccc 1440 aacacgatgg aacacagttt tctttctatt gaaatctata gagggaaatt ggattgagat 1500 ttcgaacact ttaaaagaaa aaaaccaaac tgctagaatt gaaagcattc atataaacga 1560 gcttacaggt atcacacgag ttcttgaaac atttaccgac ttatccaaaa aatttgaaag 1620 cagccaattc ctactattca tttcacaatc ccatatatta acaaattaaa aaaagcctgt 1680 caaccacaat tgtcccaaca aataacacaa acaattcttc cctatttctt tatttctatt 1740 tccaccaaca aataagttgc ttcaatttgc tgaagaagaa aaaaatgtta ttcagaattg 1800 caaggcactc ttggcccgaa atgtacccat ttgactcgaa atacgtagtg acaacagtgc 1860 atatcgatca agaaattaat atgtacatta atacaaacac tacatatttt gacggattta 1920 attgtctggc ggaacgagca taaaaaatgt tcccaaactt gtataaagcc agctgcaaaa 1980 ttttttgtat accagcaact agcgctgcct ctgaaagagc attctcagat gctcgcaatt 2040 taataacaga taaacgttct gccatttcgc ttaactcgga aaatattaat aaaattatgt 2100 ttttacacaa taacattgaa aattaaagaa aaaatatata tatatttata agataagaat 2160 ataatgtatt atcagttaag taataataaa caataatatt aaaacacgaa tttagcatct 2220 tttattgaaa ctcacacaca cacacacata cgcagacaag acattcgaat cacccattca 2280 ctttactcgg tcttcacttc attcgtttca gaatgtgccg tttgaattga atgaattgaa 2340 gtcggcgcgc gttgccgact tcacttcgtt catttgcaaa cgcgccgaat tcacttcgtt 2400 cgtttcaaca cgaatgtctc atttgccggt ttgaatggtg ttatgcagcc ctcta 2455 // ID Gypsy-30_AA-LTR repbase; DNA; INV; 198 BP. XX AC supercont1.271; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-30_AA_; KW Gypsy-30_AA-I; Gypsy-30_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-198 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.271; Positions 650152 649955. XX SQ Sequence 198 BP; 39 A; 43 C; 46 G; 70 T; 0 other; tgtggtagac gggtctattt atactgaacc ctattcggta tctatccatt gagaatttgc 60 accaacacta ccgtatgctc tgcattgggt cagtctgtga ttttgcgctg ctcggtgtga 120 tcggcttctc gcgtctctgc ggctttgagc aagtaatact gtttgttcgc gaagtgctca 180 ttttaattga ttactaca 198 // ID Copia-131_AA-LTR repbase; DNA; INV; 487 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-131_AA_; KW Ty1_copia_Ele140; Copia-131_AA-I; Copia-131_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-487 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 487 BP; 140 A; 88 C; 131 G; 128 T; 0 other; tggaatatta atttttcatt attttgttgt acggtgtaaa ttgtcggtat aatgaaagtc 60 ggtgtaattg gcaacacccc ccgaaacata aatagagagc gagagcaaga gcgcagctgt 120 cataataaag aaaacgagag cgaacagcag gtcgcgtgcg cgcgtaggta ggtagcaagt 180 tagacatcgc tagtgtacaa tgttcagtga agagttaggt cgggtgggtg caccaaccga 240 aatcattctc aatcggatgt gatcggaccc cgatggaagt tcactaataa agattgcgtt 300 aggcaaattt atttgagact tcgaatattt ttgacaccga tccgaaatcc gaaaagccgg 360 ttgaagttga aggaagccct gcggaagcgg attagttctg gtgagtatat aagtgaactt 420 aggaagtggt gacgacccct tctggttgct cgctttcttg gttcaaggtt tgctggctct 480 gccaaca 487 // ID Copia-120_AA-LTR repbase; DNA; INV; 185 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-120_AA_; KW Ty1_copia_Ele186; Copia-120_AA-I; Copia-120_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-185 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 185 BP; 54 A; 42 C; 30 G; 59 T; 0 other; tgttgaagca atcgaagcaa tcattgatat attcaacatc attgcgttgc tatgtaactt 60 ccatagcaac cacccttgta gatacgagcc tgaataaatt ttcattagta gttcgaccat 120 taactagaac acgtgtcaag ttattttctc ccggaaaagt cttttccctg cggcttgaat 180 ctcca 185 // ID Gypsy-619_AA-LTR repbase; DNA; INV; 592 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-619_AA_; KW Ty3_gypsy_Ele152; Gypsy-619_AA-I; Gypsy-619_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-592 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 592 BP; 202 A; 144 C; 106 G; 140 T; 0 other; tgtcgtgggt tgagaaccac tgatcaccga aacccacaat cataaacaaa ctcaaaggcc 60 atccgcctga aaccgaccaa tgcactcgaa aagtgcaaga tattgaaatt acaaccatat 120 gcgacaaccg cacctcaagt gccagaatta tttccattcc tttttccctc ctccgcgtga 180 gccctacacg gacgggcaaa cgaaatcgaa aggtttcaaa ttgcatcgcc ctggccggac 240 caccgagata cgaattgccg aaatagcaag atgcacattt ttgggcacgg accacctagg 300 catacaaaat gccacatttg caaagatgaa taatttttaa gtatggactc cagttgtctc 360 attctcacca atagaaaata gattaagcag ttaaacattg taagcttttg taacaatata 420 taaacccaga aacgacgaaa taataaacag attcttacga accagaaact gggaaaggag 480 tctcgacttt tattgagtcc ttactggtga tagctctcga gctcaccata gtttggatac 540 tctacaaaga aaagaccttc tccaatattt ctaaataccg taggatgcca ca 592 // ID I-4_CQ repbase; DNA; INV; 5624 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version 1) XX DE An I non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW I; Non-LTR Retrotransposon; Transposable Element; I-4_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-5624 RA Kojima K.K. and Jurka J.; RT "I non-LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 110-110 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 11 sequences with >99% CC identity. It is phylogenetically close to Loner elements in CC Anopheles gambiae. XX FH Key Location/Qualifiers FT CDS 407..1672 FT /product="I-4_CQ_1p" FT /translation="MSGGEGGMESDGEDDEASNVRTKIYPNGSTGPFIVFF FT RPKLKPLNLISITRDLTRKFSGVSEIKRVHANKIRVVVNNISHANEIVTCE FT LFTLEYRVYIPSRIVECDGVVTEEGLTLDELYECRGYFRNPAVNPVKIIEV FT KQLFSSSTQDGKTVYSPSNSFRVTFEGSALPKYIEIDKARLPVRLFVPKVM FT NCQKCKQLGHTTAYCCNKARCIKCGEEHDDSSCTQAATKCLYCDEDALHKL FT SDCPTYKQRQEKLKLSLKQRSNRTFAEMLKQATEPLNSGNIYNILPSDETV FT ADSINAGASTSGTGNSRKRNNGSPSIRRKEIKLSPQQDRIPNFQPTPPGIN FT PPGFPPLPRPPPLTPKPNPNKPKQGLIGFTVLINQILDALQISTGVRTVVI FT TLIPFVRTFLIKLSEQWPLISTIISFDG" FT CDS 1668..5354 FT /product="I-4_CQ_2p" FT /note="apurinic-like endonuclease, reverse FT transcriptase and ribonuclease H." FT /translation="MDNSTSKMNEEISVLQWNCRSIVPKLDSLKILAHETK FT CEVFALCETWLPPNDDGLNFPNFNIITKNRDDSYGGVLLGIRHGLTFQRLN FT LPSQPGIEVVAIQVQIKNKCFSIASVYIPPKTSVNRQQLKNIVEMMPEPRL FT ILGDFNSHGTGWGELYDDNRANLIYDLCDEFNLTIKNSGEITRIARPPARE FT SRLDLSICSRTLSIDCTWNVIQDPHGSDHLPILISIATGNQPVEPVSYTYD FT LTKNIDWKRYALIITEAIESIDPLTPQEEYTFLANLIHSSAIQAQTKPIPS FT ASSRMRPPSLWWDKECSEVYSEKSNSFKIYRRTGQIESYEQYLLLEIKFQN FT LVKCKKRNYWRTFVDGLSRETSMRTLWTTARRMRNRAPKNASEEYSDRWLH FT NFARKVCPDSTIPKQKRYSNDLVFPELSSAFSMIEFSVALLSCNNTASGMD FT GIKFNLLKNLPSVAKCRLLNLFNIFLEQNIVPEVWRQVRVIAIQKPGKPAT FT DHHSYRPICMLSCVRKLLEKMILFRLDKWMESNGLLSDTQFGFRRGKGTQD FT CLALLSTEIQLAFAKKEQMASIFLDVKGAFDSVCIEVLADKLHKSGLPPLL FT NNFLYNLLSEKHMNFIHGNVTITRSSFMGLPQGSCLSPLLYNFYVNAIDSC FT LDNGCTIRQLADDCVVSVTGQSANHLSEPLQNTLNNLSRWAMELGIEFSTE FT KTEMVVFSRKHNPPSLKLYLLGKLIIQSLVFKYLGIWFDSKGTWACQIRYL FT KQKCQQRINFLRTITGTWWGAHPTDLIRLYQTTIRSVLEYGCFCFQSAAKI FT HMIKLERIQYRCLRIALGCMHSTHTLSLEVLAGVLPLKTRLYQLAHRTLIR FT CEIRNPLVIQNFDLLLDKNPQTRFMTIYHNHITKEISPSNFTPNRSTISST FT HNPSVLFDLSMQQEIKMIPASQLSQLVPHIFLSKYNHIKAENMFYTDGSLI FT EGSTGFGVFNTKVSAFHKLQNPATVYVAEQAAIHYALGIINLQPQDHYYIF FT SDSLSTIEALRSLKSPNSSSFFFHKIKEIMSLLVEKKYKITLVWIPSHCSV FT LGNEKADSLAKQGALEGSTYDRIITYDEYFTIPRQESLVSWQTKWDKSEMG FT RWLYSIRPKVSTTSWFKHMNVERDFIRVISRLMSNHYLLNAHLYRINLKDD FT NLCGCGEGYHDIEHIVWNCPENLHARSQLLDSLRAQGRQSDFPVRDILASQ FT DVPYLLCLYRFLKSIKVHL" XX SQ Sequence 5624 BP; 1736 A; 1314 C; 1091 G; 1483 T; 0 other; cagtcggtat ctgatcgagg tacaagcaag acgtgtttgt tttcgctaga gcaccgaagc 60 ttctaatcgt tcaacggatc aagggatcca ccgccgactt ctcgaaggtt gaccttacga 120 gtgcgtacgt gacacgcgtg acagttcgtg aacgtgaata gtgcgtgtaa gccaattatc 180 gccggtcaag ttcaccatca gcagccgaac cacaaaggca accctccaaa ccgcgaagcg 240 tgttttggag ttttcgaaaa cgtttacaca aggcattgtc taaccgcgag agctagacaa 300 aggagcacca ccgaatacaa tagcaaacgg acggtttgtt tcatcctcat caactcggcg 360 tggcaaaatc tagagttaag tagaaaatca tctcccccct ccccccatgt ctgggggcga 420 aggcggcatg gaaagcgatg gcgaagatga cgaggcttcc aatgtgcgca caaagatcta 480 ccctaatggc tcaacagggc cgtttattgt cttctttcgg cccaaattga aacccctaaa 540 cctgatcagc atcacccgag atctaacgag aaagttttct ggcgtatccg aaataaaacg 600 tgtccatgct aacaagatcc gcgtagttgt gaataacatc tctcacgcca atgaaattgt 660 cacctgcgag cttttcacgc tcgaatatcg agtttacatc ccttcacgca tcgttgagtg 720 tgatggtgtt gtcacagaag agggcttaac cctagatgag ctgtatgagt gccgtggtta 780 cttcagaaac cctgccgtga accccgtgaa gatcattgag gttaaacaac tgttctcctc 840 ctccacacag gatggcaaaa cggtttactc cccttcgaac tctttccgag tgacctttga 900 aggatccgct ctgccaaaat acattgagat agacaaggct cgtctacctg ttcgactttt 960 tgtccccaag gtaatgaact gccaaaaatg caaacaactc ggccatacca cagcttactg 1020 ttgcaacaag gctagatgca tcaaatgcgg tgaagagcac gatgatagta gctgtacgca 1080 agctgctacc aagtgcctct actgcgacga ggacgccctt cacaaactct cggattgtcc 1140 gacgtataag cagcgtcagg agaaacttaa gctttctctg aagcaacgat cgaatcgtac 1200 ttttgcggaa atgctcaaac aagccaccga accactcaat tctggaaaca tctacaatat 1260 actgccttcc gacgagacgg tcgccgactc gatcaatgcg ggcgcgtcaa cgtccgggac 1320 gggtaactcg aggaaaagga acaatggatc accaagcatc cgccgaaaag aaataaagct 1380 atccccacaa caagacagga tccctaattt tcagccaact ccccctggaa tcaacccccc 1440 cggtttcccc ccattgccaa ggcccccacc tctgaccccc aaaccaaatc ctaacaaacc 1500 taagcaagga ttaatcggtt tcacagtgtt gattaaccaa attctcgatg cgctccagat 1560 ttccaccggt gttcgaaccg tggtaatcac tctgatccct ttcgttcgga catttttgat 1620 caaattatct gaacaatggc cccttatttc aacaatcata tccttcgatg gataattcaa 1680 cgtcgaagat gaatgaagaa atttctgttc tccagtggaa ctgcaggagc attgttccaa 1740 aattagattc tcttaaaata ttagctcacg aaactaaatg tgaagtattt gctctctgtg 1800 agacatggct tccacccaac gatgatggtc tgaattttcc caattttaat atcattacca 1860 aaaatagaga cgactcctac ggaggggttt tgttaggcat aagacacggt ttaacattcc 1920 aaagattgaa tcttccttct cagcctggaa ttgaagtagt tgcgattcag gttcaaatta 1980 agaataaatg tttttcaata gcttctgtat atatcccgcc caaaacaagt gttaatcgtc 2040 aacagttaaa aaacatcgtt gaaatgatgc ctgagccaag acttattctc ggcgacttca 2100 attctcatgg gacaggatgg ggtgaattgt acgacgacaa tcgagcaaat cttatatatg 2160 acttatgcga tgaatttaat ctaactatta agaacagtgg tgaaataact cgaattgcta 2220 gacctcctgc aagggaaagt agattggatt tgtcaatttg ctcaagaaca ctctcaatag 2280 attgcacctg gaacgtaatt caagatcccc atggtagcga tcaccttcct attttgattt 2340 caattgcgac aggaaatcaa cctgtagaac cagtcagcta tacatacgat cttacgaaaa 2400 atatagattg gaaaagatat gctctcatta tcaccgaggc gattgaatca atagatcctc 2460 ttacccccca agaagaatac accttccttg caaatctcat ccacagtagc gcgatccaag 2520 ctcaaacaaa accaatacca tcagcttctt cccgaatgcg acctccatct ttatggtggg 2580 acaaggagtg ctcggaagtg tactctgaga aatcaaattc tttcaaaatt tacagacgaa 2640 cgggtcaaat tgagtcttac gaacagtacc tccttttgga gattaagttc caaaatttag 2700 taaaatgtaa aaaacgaaac tattggcgaa cgtttgttga tgggctttca cgcgaaacct 2760 ccatgcgtac tctttggact acagcaagaa gaatgagaaa ccgagctccc aaaaacgcta 2820 gtgaagagta ttctgatcgg tggttgcata attttgccag aaaagtgtgc cccgactcca 2880 cgattcccaa acagaaaagg tattcgaatg atcttgtatt cccggaacta tcatccgcgt 2940 tctcgatgat agaattctcg gtcgctctcc tttcatgcaa taacactgcc tctggaatgg 3000 atggaattaa atttaatctc ctgaaaaatt tgccttccgt tgcaaaatgt cgactattaa 3060 acttattcaa tattttcctt gaacaaaaca tcgtcccaga agtctggaga caagtcagag 3120 ttatagctat tcaaaaaccg ggtaagccgg ccaccgatca ccattcatat aggcccattt 3180 gtatgctatc gtgcgtgcga aagttattgg aaaaaatgat acttttcaga ttggataaat 3240 ggatggaatc aaacggatta ttatcagata ctcagtttgg atttcgtagg ggcaagggaa 3300 cgcaggattg tttagcgctg ctttcaaccg aaattcaact agctttcgct aaaaaagaac 3360 aaatggcttc aattttctta gatgtaaagg gagcatttga ttcagtgtgc atcgaggtgc 3420 tagcagataa actccacaaa agtggactcc cacctttatt gaacaatttt ttgtataact 3480 tactctcgga aaaacacatg aatttcattc atggtaacgt gacaatcaca agatctagct 3540 ttatgggcct tcctcaagga tcatgtttaa gccctctctt gtacaatttc tatgtaaatg 3600 caattgactc ttgcctcgat aacgggtgca caataagaca attggcagat gattgcgttg 3660 tatcagttac tggtcagtcg gccaaccatc tttctgaacc tctgcagaac actttaaaca 3720 atttatctcg ctgggctatg gaattaggaa tcgagttctc aactgagaaa acggaaatgg 3780 tcgtcttctc cagaaagcac aaccccccct cactgaagct gtacctactg ggaaaactta 3840 taatacagtc cctggttttc aaatatctcg gtatttggtt tgactcgaaa ggtacttggg 3900 cttgtcaaat aagatacctg aaacagaaat gccaacagag aataaacttc ctccgaacaa 3960 tcacgggtac gtggtggggc gcacatccca cggacctcat taggctatac caaacgacga 4020 tacgttcagt attggaatat ggatgttttt gctttcaatc cgccgcgaaa atccacatga 4080 tcaaacttga aagaatacag tatcgttgtc tgcgcattgc cttaggatgc atgcactcaa 4140 ctcatacgct gagcctagag gtacttgcag gcgttcttcc gctgaaaacc agattgtatc 4200 agctcgctca cagaacgttg attcgttgtg agattaggaa tccattagtg atccagaact 4260 tcgatcttct tctcgacaaa aatcctcaga ctaggtttat gactatctat cacaaccaca 4320 taaccaagga aatctcacct tcaaacttta ctcccaaccg cagcacaata agcagcacgc 4380 ataacccatc agttttattt gatttatcta tgcaacaaga aatcaagatg ataccagcaa 4440 gtcaactttc gcaattagta ccgcatattt ttttgtctaa atataaccat attaaggcgg 4500 aaaacatgtt ctacacagac ggatcgctaa tcgaagggtc cacaggcttc ggggtattta 4560 atacgaaagt aagtgccttc cacaaacttc aaaatcctgc tacagtatac gtagcagaac 4620 aagctgcaat tcattatgca ctagggatca ttaacctgca gccacaagat cactactaca 4680 tattttctga cagccttagt acaattgagg ctctccggtc gttgaaatca cccaattcct 4740 cgtcgttctt ttttcataaa attaaagaaa tcatgagttt actggtagag aaaaaataca 4800 aaattactct tgtttggatc ccttctcatt gttctgtatt aggaaatgag aaagcggact 4860 cgttggcaaa gcaaggtgcc ttggaaggat ccacttacga tcgtattatc acttatgacg 4920 aatattttac aatccctcgt caagaatctc ttgtaagctg gcaaaccaaa tgggacaaaa 4980 gcgaaatggg tcgatggctt tactctatca ggccaaaagt ttctacaact tcgtggttca 5040 aacacatgaa tgttgaaagg gatttcatac gcgtaatatc aagattaatg tcaaaccact 5100 acctactcaa cgctcactta tatcggatta acttaaaaga tgacaatctc tgcggttgtg 5160 gagagggtta tcacgatatc gaacatattg tttggaactg tccagagaac cttcacgcta 5220 gatctcaact cttagactcc cttagggccc aaggaagaca atcagacttc cctgttcgtg 5280 acattttggc aagtcaagat gtgccatatc ttctctgctt gtaccgcttt ctaaagtcaa 5340 ttaaagtgca cctgtaacag catcaatctc gcaagcatcg ccaccctgca acctagcaat 5400 agtaacatct gataaaaact agaaccttag cccgcacaga agcaaaagtc cgtccttaaa 5460 cataatgtat tattaacctc gaaacagccg cgagtattcg gctttccccc tttactaacc 5520 ctagctttaa gtaattatgt aaaaatgata tccggctccg taaaactttg gtagatgagc 5580 ctaaataaat aaagacagtt ataaaaaaaa aaaaaaaaaa aaaa 5624 // ID hAT-19_HM repbase; DNA; INV; 2269 BP. XX AC . XX DT 07-DEC-2008 (Rel. 13.12, Created) DT 12-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-19_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2269 RA Jurka J.; RT "hAT-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 2008-2008 (2008). XX DR [1] (Consensus) XX CC This is a very young family: individual copies are >99% identical CC to consensus. CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 316..2124 FT /product="hAT-19_HM_1p" FT /translation="MASASANKRRRAETESGRGGRIFNPSWTNEYFVMEQN FT NSIMCLICFEKIAVCKMYNVNRHYTTKHAATYDKFKGQFRVDKVELLKKNF FT IGQQSIMKSKVISSYSATKISFLIAENIAKSGQPFSTGDLIKNSIKQFCNE FT VCPEKIQFAEDLSLSHQTIARRVEDLSKNIELALKEKLCKCEAYSLALDES FT TDRSDTAQLAIFIRFITSNFEIIEELLDFRHMKGTTKGEDILSEVKKTMIK FT FDLPETKLSGVTTDGASSMKGKNIGFVALFKKSINHNILSYHCIIHQEQLC FT AKVLEMKEVMEIVIQTVNFIRSRGLNHRQFKQLLEDCGSEAEDVIYFCQVR FT WLSRAATLKRFWILIPEIIKFLKIKDKDTSFLENNDWLNDLAFLVDITQML FT MELNIKLQGKDQLISKLYENVETFVLKLKLLKQQLSQKSLIHFKTLSERNT FT NTVDYEKYCNLILKVIDEFDTRFCDFKEEKNELDLFSHPFSIKVETVRDEF FT QMELIELQNNKDLKDAYKDVELLELYKKYMNIEVYPHLCKHAMKYFSLFGS FT TYICEQFFSRMKHVKSEQRHRLKDEHLTDTLRISSSTIKADIDQLCKNKQC FT QVSH*" XX SQ Sequence 2269 BP; 853 A; 311 C; 383 G; 722 T; 0 other; cagggttgtc caacgcgtgg cccgcgggcc acaaagtggc cctcgttgtc ttttgccatg 60 gcccgcgttt tcaattctta acttatcaaa aaccattcaa aacttttaac aaaatcaagt 120 gaatattata tttttaaaac tattttttct gtagctcaga attagggcag ttttttaata 180 taattttaaa aatcagattt tttttaaaat tgcaacacgg ttaaaactac cgccttattt 240 tttttataaa accggttaaa acggtattga ctatttggtt agcggaacca tagttatatt 300 gatttttttt taaatatggc ttctgctagt gctaataaaa gacgtagagc tgaaactgaa 360 agtggaagag gaggtcgtat atttaaccca tcttggacaa atgaatattt tgtaatggag 420 caaaataatt caattatgtg tctcatttgt tttgaaaaaa ttgccgtgtg caaaatgtat 480 aatgttaata gacactatac tacaaaacat gcagcaactt atgataaatt caagggccaa 540 tttcgtgttg ataaggtaga actgctaaag aaaaatttta tcgggcaaca atcaataatg 600 aaaagtaaag taattagctc atacagtgct accaaaataa gttttttaat tgcagaaaat 660 attgcaaaaa gtggtcaacc tttttctact ggagatttaa taaaaaactc tataaaacaa 720 ttttgtaatg aggtatgccc tgaaaaaata caatttgctg aagatctcag tctttcacac 780 cagacaattg caagaagggt tgaagatcta tcaaagaata ttgaattagc attgaaagaa 840 aaactatgta agtgtgaagc atatagtctg gcacttgatg aatcaacaga tagaagtgat 900 acggctcagt tagctatttt tattagattt ataacaagta attttgaaat aattgaagaa 960 ctgttggatt tcaggcacat gaaaggcact actaaagggg aagatattct ttctgaagtc 1020 aaaaaaacaa tgataaagtt tgatttgcca gaaacaaaac tctctggtgt cactacagat 1080 ggagcaagtt caatgaaagg aaaaaatatt ggatttgtgg cattatttaa gaaatccatt 1140 aatcacaaca ttctttcata tcactgtatt atacatcagg aacagttatg tgcaaaagta 1200 ttagaaatga aagaagtcat ggaaattgtt atccaaactg ttaattttat aagaagtcgt 1260 ggccttaatc acagacagtt caaacaattg cttgaggatt gtggaagtga ggcagaagat 1320 gtaatttatt tctgccaagt tagatggctt agtcgagctg caactttgaa aagattttgg 1380 atattaatac ctgaaataat aaagtttcta aaaattaaag ataaagacac aagctttctt 1440 gaaaataatg actggctgaa tgatttggca tttttagttg acattacaca gatgttaatg 1500 gaattaaata tcaagttgca gggtaaagat caacttatta gtaaattata tgaaaatgta 1560 gaaacatttg ttttgaaatt gaaacttctc aaacaacaat taagtcaaaa atcacttatc 1620 cattttaaaa cattgtcaga aagaaatacc aacacagttg actatgaaaa atattgtaat 1680 cttattttaa aagtaattga tgaatttgac acaagatttt gtgattttaa agaagaaaag 1740 aatgagttag acttattttc acatccattt tccattaaag ttgagacagt aagagatgaa 1800 tttcaaatgg aattaataga actacaaaac aataaagatt tgaaagatgc ttacaaagat 1860 gttgaattgt tagaacttta taaaaaatac atgaacattg aagtttatcc acatttgtgc 1920 aaacacgcta tgaaatactt ttcccttttt ggaagcacat acatctgtga acaatttttt 1980 tcaagaatga aacatgttaa atcagagcag agacataggt tgaaagatga acacctcact 2040 gatacacttc ggatttcatc gtccactata aaagctgaca ttgatcaatt gtgtaaaaat 2100 aaacaatgcc aagtttccca ttaaatacat tttttttgac aatattgaaa tggaattagt 2160 tgttttaaat tagaagttta aacaaaatga tttttcttaa tttgtggccc tcgtgttgtc 2220 atcaaaataa aataatggcc caggctgtaa aaaggttgga caaccctga 2269 // ID BEL-86_AA-I repbase; DNA; INV; 6387 BP. XX AC supercont1.279; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-86_AA_; KW BEL-86_AA-LTR; BEL-86_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6387 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.279; Positions 650890 644504. XX CC Positions [5420-5980] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 1178..6337 FT /product="BEL-86_AA-I_2p" FT /translation="MDQPEAATVPNRSNHPLQQPLYRLSSAHQPHPSDRVA FT MHNHASFTQRHPMFPSEPRSELPRKNRQDADDTREQRQSNNMETEFTPMVI FT NASQLAARQVMGKELPNFSGNPEDWPIFICSFELSTAACGYTDAENLIRLQ FT RCLKGHALESVRSRLLLPSSVPQVINTLRTLYGRPELLIRTLIEKVRYTPA FT PRHDRLETLVEFGLVVQNLVDHLKAAKQYTHLSNPVLMQELVEKLPGSLRM FT DWAIYRSKQPYATLATFGDFMTGLLEAASQVTFELPNSSRSSRGEQRRTKE FT KGLLHAHSSIPVSSLEPPSSPSPKRSGKPCAACEREGHRVADCHKFKSLSN FT DERWKIVHQGGLCRTCLNGHGKWPCRSWQACAIEGCNEKHHTLLHPPSPIQ FT SSLNVSVNNIMQKNGSSPMFRVLPVVLYAGKRREVVFAFIDEGSSTTFLEK FT TVADRLGVSGPVEPLTLQWTGNITREERKSQRVQLKISGEDSLNIHKLCEV FT RTVNCLVLPTQSMKYGELCHQYPHLRGLPLKDYELIQPKLLIALDNLRLAV FT PLKVREGGVSDPIAVKCRLGWSIYGCVQGSSSPRAVVHFHVAASADSDRQL FT NDQLRDYFALEDAGTSEQMRLLESDEEKRARTILQQTTKRTDRGFETGLLW FT RDDNPDFPDSYPMAIRRLKALERKLSKNNWLHERVREQISEYLRKGYAHKA FT TESELNNADRKRMWFLPLGVVVHPRKPNKIRLVWDAAAMVDGVSFNSKLLK FT GPDLLTPLPAVLSRFRQFPVAICGDIKEMFHQLSIREQDRLAQCFLWRDSP FT TNPVQIYVMDVATFGATCSPASTQYVKNINAEEFSEDYPRAATAIVKNHYV FT DDYLDSFQTVDEAADVIKEVAWVHSLGGFEIRGFRSNSTELLREIGGPSAV FT DEPKNLMLERSATSESVLGMSWDPTNDCFSYCFNLRDDLRSILDVNHIPTK FT REVLKVVMSLFDPLGLVSHFLVHGKVIIQGTWAARTRWDEPINEDLNRKWR FT QWVSLFPKLNELKISRCYFSQFPSSIKNLQVHTFVDASDIAYSCAVYFRLV FT HEDCVQIALVGAKCKVAPLKTLSTPRLELKAAVLGVRFVAAILEYHSLPVS FT QRFFWSDSTTVLAWIRSDHRRFHKFVSVRIGEILTLSEPQEWKWVQSKLNV FT SDDATKWKDGPNLKPESTWFDGPRFLYFDEAAWPEQRITTTTHEELRRVQT FT HWSREPMIDYSRFNEWTRLQRTMAYVVRFINGVRQRNDGLNIQLKVLTQEE FT LAQAEVLLWKTAQKEVFAEERSALEKTMGTPEARHAVVSKSSPIYKTWPYM FT DGQGILRMRGRIEAAGYLPFHARYPVILPKTHPVTTLIVGWFHRLYRHANR FT ETVTNELRQRFEIANMRSIVQKVAKDCSWCRINKAFPKTPVMAPLPKERLE FT PYVRPFTYVGLDYFGPVLVKVGRSQAKRWIALFTCLSVRAVHMEIVHSLST FT ESCIMAVRRFISRRGSPAVIYSDNGTCFQGANKQLTEEIAARNEAMAHTFT FT NADTKWKFIPPATPHMGGVWERLVRSVKTAIGLALEYPRKPNDETLETIIY FT EAEALVNSRPLTYIPLESADQESLTPNHFLLGSSTGNKIETTEVSGYAALR FT SSWKMAQHIVNEFWKRWIKEYLPVITRRSKWFEETKDLMVGDLVLIVSGSG FT RTQWTRGRIEEIFIGSDGRVRQALVRTANGVLRRPACKLAVLDIEGGCEPM FT " XX SQ Sequence 6387 BP; 1827 A; 1501 C; 1553 G; 1506 T; 0 other; actattaaaa ctatattatc ctatctaatt ctgaaactat ttacactaag cttttcctga 60 atcgtgaatt tgtgcttaaa actagaggtc agattttgaa atcatctgta agtaataaaa 120 attaggccta attgaactta aaaactaata tgtacaatac gattgtagca aacgcctatt 180 cacgagggtt atcgatacga tccttccaaa ctctataaaa cgtgcactaa aatcgtaagt 240 aaattgtata ctttttgcgt atcagttaat aaaaatgaac tttttagctt gaagcttaca 300 ttaccacaaa aacgtgtttg ctgtcgagat ttagtgacaa cccctccgaa taaaaatctt 360 caaggatttt atctacgggg tctcagaatg gaagcacatc agaaacccgc cggatacagc 420 tgcaagtcct gcgatagacc ggacaccgac gaggagaaat gggtcgctcg cgatagctgt 480 aagctctggg agcactttgg ctgtgcgggg gtggatgatt cggtaaagaa ccggccctat 540 cgctgtcagg agtacaaaac caaaaccgct gtccataaaa tgccgtcatc tcgcaatttg 600 ctaccaccac cttcagaagg tagatcgctg agatcatcaa cggccaagtc tggtctgagg 660 caggtcaata aagtgactca attggcatca caattcgctc ggagcgctca tagcacgact 720 tcgagtgtcc gtgctgcccg tctgcaagcc caaatgaagc tggtcgaaga ggagcaatta 780 ctgaaggagc aggagttaga agcccaagaa gcgatgcgga agaaagaaat ggaagaagaa 840 gaacgccggc tagaagaaag gaaagcacta ttggaagaag aagcgcggct acggcaacga 900 aagctgcaag acgaaaagga gtatcagaag cagcaacagt tgatcaggat ggaatctctg 960 gaaaaaaaaa aacaccatcg ctcggcaact aagcgagtgc agcagcagag gtacatcgat 1020 ccccgattca gatcaagagg tggcccgttg gcttcaacaa tgttcgccaa atttactcga 1080 cgatagaatt cgatctctaa gaattgatcc tgaagaatca gtaccagaag ccttacctaa 1140 ttcatccttg tttgcggact gccaacaaca agtagaaatg gaccaaccag aggcagcaac 1200 cgtgccgaat cgttctaacc atccgttgca gcaaccatta tatcgcctct cttcagccca 1260 tcaaccacac ccctcggatc gtgtagcgat gcacaatcat gcgagcttca ctcagcggca 1320 tccaatgttc ccctcagaac cgagatcaga attgccacga aaaaacagac aagatgccga 1380 tgatacacga gagcagcgac agtcgaataa tatggagaca gagttcactc ctatggtaat 1440 caacgccagc caacttgcag ccaggcaggt gatggggaaa gaattgccca atttctcagg 1500 taaccccgaa gattggccga ttttcatctg tagcttcgag ctatcgacag ctgcttgtgg 1560 atacaccgat gcagaaaatc tcattcggtt acagcgatgc ttaaagggac atgcactaga 1620 atcggtgcga agtagactcc ttctcccttc gagtgtccct caagtcatca acacccttcg 1680 aacgctttac ggcagaccag agctactcat aagaacactt atcgagaagg tgcgttacac 1740 tccagcacct agacatgacc gcctggaaac gttggttgaa ttcgggctcg tcgttcaaaa 1800 tctagtggac cacctgaagg cggcgaaaca gtacacgcat ttatccaacc cagtactaat 1860 gcaggaatta gtagagaagc tccccggttc actcagaatg gactgggcaa tttaccgaag 1920 caaacagccg tatgctacgt tggcaacatt cggggacttc atgacgggac tactggaagc 1980 ggctagccag gtgaccttcg aacttccgaa ctcaagtcgt agctccagag gtgaacaacg 2040 acgtaccaaa gaaaagggtc ttctccatgc tcattcatca atacccgttt cctctttgga 2100 gccaccgtca tcaccaagtc caaaaaggtc aggtaagcca tgcgccgcat gcgaacgtga 2160 gggacaccgg gtggctgatt gccataaatt caagtccttg agcaacgacg agcggtggaa 2220 aatcgttcat caagggggat tgtgtagaac ttgcttgaat ggtcatggaa agtggccttg 2280 taggtcttgg caagcgtgtg caattgaagg ctgtaacgaa aaacatcaca cccttcttca 2340 ccccccttcc cctatccaat catccctcaa tgtctccgtc aataacatta tgcagaaaaa 2400 tggttcctct cccatgtttc gagtccttcc tgtggtatta tatgccggta aacggaggga 2460 agtcgtcttc gcctttatcg acgaaggctc atcgacaaca tttttagaga agacggtcgc 2520 tgatcgtctg ggtgtttcag ggccggtcga acctttgacg ttgcagtgga cgggcaacat 2580 aactcgagag gaacgaaaat cacagcgcgt gcaactaaaa atttctggtg aagatagtct 2640 caatatccat aagctatgtg aggtccgaac ggttaattgt ctagttttgc cgacgcaatc 2700 catgaagtac ggcgaactct gtcatcaata tcctcatttg cgtgggcttc cattgaaaga 2760 ttacgaactc atccaaccga aactactaat tgcattggac aaccttcgtt tggctgttcc 2820 tttgaaggta agggaagggg gagtttcaga ccccatagca gtaaaatgcc gactgggttg 2880 gagcatttac ggatgcgtgc aaggttcatc atccccacgc gctgtagtcc atttccatgt 2940 tgctgcatct gctgacagtg accgccagct gaacgaccag ttacgtgact attttgcgct 3000 agaggatgct ggaacaagtg aacagatgag gttgttagaa tcagacgaag agaagcgagc 3060 gagaacgata ctgcaacaaa caactaaacg cacggatcgt ggatttgaaa ccggactgtt 3120 gtggcgggat gataatccgg attttccaga tagctatccc atggctattc gtcgtttgaa 3180 ggctttggag cgaaaattat caaaaaataa ctggttacac gagcgtgtgc gcgagcagat 3240 ttctgaatat ttgaggaagg ggtacgctca caaggcaacc gaatcggagc tgaacaatgc 3300 tgatcgcaaa cgcatgtggt ttctaccact tggcgtggtc gtacacccca ggaaaccgaa 3360 taaaattcga ctggtgtggg atgcagcagc gatggtggac ggagtctctt tcaactcaaa 3420 gcttttgaag ggacctgatc tactaactcc acttccagcc gttcttagcc gtttccgtca 3480 gtttcctgta gcaatctgtg gcgacatcaa agagatgttt catcagctct caatccgtga 3540 acaagaccgt ctggcacaat gttttctttg gcgcgacagt ccaacgaatc ctgttcagat 3600 ctatgttatg gacgtggcaa catttggcgc aacctgctcg cccgcctcaa cgcaatacgt 3660 aaaaaacata aacgcagaag agttttccga agactaccca cgagcggcga ctgcgattgt 3720 taaaaatcat tacgtcgacg attacttgga cagctttcag accgtcgatg aagccgctga 3780 cgtgattaag gaagtagctt gggtccactc gttaggaggt tttgagatcc gaggtttccg 3840 ctccaattct acagagcttc ttcgtgaaat agggggacct tcggcggttg atgaaccgaa 3900 aaacttaatg ctggaacgaa gtgctaccag tgaatcagtg ctggggatgt cctgggatcc 3960 cacgaatgat tgtttctcct attgcttcaa tctacgggac gatttgcgct caatactgga 4020 tgtaaatcat attcctacca aaagagaggt tttaaaggtc gtaatgagcc tattcgatcc 4080 gctcgggctc gtttcacact ttctcgtcca cggaaaagta ataattcagg ggacgtgggc 4140 tgctagaact agatgggatg agccaattaa tgaagacctt aacagaaagt ggcgtcagtg 4200 ggtgtcactc ttcccgaagc tcaatgaact aaaaatttcc cggtgctact tctcacaatt 4260 tccgtccagt atcaagaatc tacaagttca cacctttgtg gatgccagcg atatcgccta 4320 ctcttgtgct gtatattttc gtttagtaca cgaagactgt gtccaaatag cgctggtagg 4380 agctaagtgc aaagtggccc ctctaaagac actatccact ccacggctag agctgaaggc 4440 agccgtcctc ggagtaagat tcgtggcagc aattcttgaa tatcattcac ttccggtgtc 4500 ccaacggttc ttctggagcg actcaactac ggtgctagcg tggattcggt ctgaccatcg 4560 tagattccac aagttcgtgt cggtgcgaat aggcgaaatt cttactttga gcgagccgca 4620 agaatggaaa tgggtacaat ccaagctgaa cgtttccgac gatgcgacaa aatggaagga 4680 cggtcccaat ctcaaacctg agagtacctg gttcgatgga cccagattcc tttacttcga 4740 tgaagcggca tggccagagc aacggattac gacgacaact catgaagaac ttcgacgtgt 4800 tcaaactcac tggagtcgcg aaccaatgat agattactcc cgattcaacg aatggacaag 4860 acttcaacgc acaatggcat atgtcgttcg tttcatcaat ggcgtgcgtc aacgaaatga 4920 tgggctgaat atacaactga aagtactcac tcaagaggag ctagcacagg ccgaagtttt 4980 gctatggaag acggcacaaa aggaagtctt tgctgaagag cgatctgctt tggaaaaaac 5040 aatgggcact ccagaagcta ggcacgctgt cgtgtcaaaa tctagcccaa tttataaaac 5100 ttggccgtat atggatggtc aaggaatcct acgaatgcga ggccgcatag aagcggcggg 5160 ctatctaccc ttccatgcaa ggtaccctgt aatactacca aaaacacacc cagttacaac 5220 tttgatcgtg ggatggttcc atcgtctcta tcgtcacgcg aatcgagaga cggtgacgaa 5280 cgaacttcgg caacgtttcg aaattgccaa catgagatcc attgtgcaaa aggtagcaaa 5340 ggattgttcc tggtgtcgaa tcaataaggc cttcccaaaa acaccagtaa tggcccctct 5400 tcccaaggaa cgattggaac cgtatgtgag acccttcacg tacgttgggt tagattattt 5460 tgggccagtg ctggtcaagg ttggccgaag tcaagccaaa cggtggattg ctctttttac 5520 gtgtctgtca gttcgagctg tgcacatgga aatagtacac agcctttcta cggagtcgtg 5580 catcatggca gtccgaaggt ttatatcgcg ccgtggctcg cctgctgtga tatattcgga 5640 taatggcacc tgcttccaag gggccaataa acaacttaca gaggagattg ctgctagaaa 5700 cgaagcaatg gcgcacacct ttacgaacgc cgatacgaaa tggaaattca ttccacctgc 5760 cactccgcac atggggggcg tttgggagcg cttggtaaga tcggtgaaaa cagcaattgg 5820 attggctctt gaataccctc gtaagcccaa cgatgagact ctagagacca tcatttatga 5880 agctgaagcg ttagtgaact ccagaccgct cacgtacatt ccattggagt cagcagacca 5940 agagtcgctg actcccaacc attttcttct gggcagctct acgggtaaca aaatcgaaac 6000 aaccgaagtg tctggatacg ctgcactccg tagcagctgg aagatggccc aacatatcgt 6060 taacgaattc tggaagaggt ggataaagga atatctgcct gttataacac gccggagcaa 6120 gtggttcgag gaaaccaaag acctaatggt aggagaccta gttctaattg tcagtgggtc 6180 tgggcgaact cagtggacta gaggacggat tgaggaaatc ttcatcggca gcgatgggag 6240 agttcgtcaa gcacttgtac gaacggcaaa cggagttcta cggcgacctg cctgcaagct 6300 agctgtgttg gacatcgaag gaggatgtga acccatgtga ctccagaaga attcgagaca 6360 tcacttaggt tcacaggcgg gggtatg 6387 // ID Copia-3_CQ-I repbase; DNA; INV; 4167 BP. XX AC AAWU01023967; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-3_CQ_; KW Copia-3_CQ-LTR; Copia-3_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4167 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 321-321 (2011). XX DR Genome; AAWU01023967; Positions 10573 14739. XX CC Positions [1474-1977] - Integrase core CC 'GTATG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 19..3243 FT /product="Copia-3_CQ-I_1p" FT /translation="MEADKVGLPQFDGTHFAIWKFRVELLLEEKELSDHIE FT KEPTASNLALAEWKLKDVKARNLIVKCLTNDYVDSVRGKTTARQMWKTLLE FT AHAKTGTVSEMFLRKKLLALKYVDGESMEKHLASFDSIVTDLRAAGADVKT FT PEAITYLLETLPRSYDNVATALMTIDPETLTVAKVKANLLNEYVKQKERSG FT TGQEETKPAVFLGKVDRRRKPFPGQCWRCKQTGHKSFECQAPEEGAGSSRE FT SGGEVAVRLPTKKKAFQLRRPDAEEGEESVPRGGFAFMTKTTELKRKGGSE FT VTWIIDSGCTDHMVNDSSLFLAEQTLRNSYPVSLAEEGQAIAATKVGFISA FT MSYVQGKEYLCEVSDVLYAPGLRHNLLSVSRLEKAGFSVVFRNGVVELLDG FT NTIFAVGKRTQDLYELSFTLNRVEANSIQANVDRNILWHRRLAHLGMKSVR FT DLARRQMAKGIDSSITDSTELCEACIMGKHSRPPFRSRESRASRPLERVHT FT DVCGPITPATWDGKRYFVTFIDDCTHFTVIYLIASRDEVLPRFKEYKAMVE FT AGFDLPLSKLRCDNGGEYVSNEFKSFCRDSGVVVEYTMAYTPEQNGEAERM FT NRTIVEKGRAMIYDSGLPKEMWGEAVMCALYVTNRSPTRALKADVTPAEEW FT YGEKPDLSALRVFGCKAYAHIPKHKRGKFDPKSRECIMIGYDTNGYRLWDP FT VNRKLLLSRDVIFKEDEFPAIENPCERGSELRVTSADRFVSEDRVTSADRF FT VSEDRVMSADGFISDDRAEEVGGVTGVDGGPTAEEPVPEVEEEVSAVSDDE FT FFDSENPEQEERVQEAEVVLVEQEVSTVSAGGEEVNNEQDPLLVRKERSRR FT PPLWHQDYDMESSALLAEGNPDDVPVSYADIEGRSDETCWRRAIEEELSSL FT KINDTWDIVPNSAGKKVISAKWVFKRKLDKDGNPDRYKARLVARGFLQTHG FT VDYQETYAPVAKLATIRFLLAIGIQRGYHFCHMDVVTAFLHGNLEQPVYMK FT APDGVEAANGSVLKLKRSLYGLKQSPKCWNDRFHYYVISIGFVQSKADYCL FT YVKEDAAGLLYLLLYVTI" XX SQ Sequence 4167 BP; 1072 A; 940 C; 1246 G; 909 T; 0 other; ggttatgggc cctgcacgat ggaggcggac aaagtaggtc ttccgcaatt cgacggaaca 60 cattttgcga tctggaagtt tcgcgtggag ttgttgctgg aggagaaaga gttgtcggac 120 cacatcgaga aggagcccac ggcatcgaat ttggcactag cggaatggaa gttgaaggat 180 gtcaaagctc gaaatttgat cgtcaagtgc cttacgaacg attatgtgga ctctgtccgg 240 ggcaagacga cggcgcgtca gatgtggaag actcttctgg aagctcacgc gaaaactggt 300 accgtgagtg agatgttcct gcgcaagaaa ctgctggccc tgaagtacgt cgacggtgaa 360 tcaatggaaa agcatttggc aagttttgat tcgatcgtga ccgacctgcg ggcagctgga 420 gcggatgtca agacacctga ggcaatcacc tatcttctcg aaacgctgcc acggtcatac 480 gacaacgtcg ccaccgccct gatgaccatt gatccggaga cccttacggt cgcaaaagtg 540 aaagctaact tgctgaatga gtacgtgaag cagaaggaaa gaagcggcac tgggcaggag 600 gagaccaagc cagctgtttt tcttgggaaa gtcgatagac gccggaagcc gtttccgggt 660 cagtgttgga gatgtaaaca aacgggccac aagtcgttcg agtgtcaagc gcccgaagaa 720 ggagcaggaa gttcacgtga gagtggtggg gaagttgctg tgaggttacc aaccaagaag 780 aaagcgttcc agctccgacg accggatgca gaggaaggcg aggaatctgt cccaagggga 840 ggtttcgctt tcatgaccaa gacaacggag ctgaagagaa aaggtggctc ggaagttact 900 tggatcatcg attccggctg cacggaccac atggtgaatg acagctccct gttcctggca 960 gagcagactt tgaggaactc gtaccccgtg tcgctggcag aagaaggaca agcaatagca 1020 gcaacgaagg ttgggttcat ctctgccatg agctacgtgc aaggaaagga gtacctctgt 1080 gaggtttccg atgtgctgta tgcccctgga ctccggcata acttgctgtc ggtgagtcgt 1140 ctcgaaaaag ctggattttc ggtcgttttc cgtaacggtg tagtagagct gcttgatggg 1200 aacacgatct tcgcagttgg aaagcgaact caggatttgt acgagttatc gtttacactg 1260 aaccgtgtgg aggcgaattc gatccaagcg aacgtcgaca ggaacattct ctggcaccga 1320 cgtctagcgc atcttggtat gaagagtgtt cgagatttag cacggcggca aatggctaaa 1380 ggaatcgatt cgtcaattac agattctact gagctctgtg aagcgtgcat catgggaaag 1440 cactccagac cgccgttccg gtctcgtgaa agtcgtgcca gccgtcccct ggagcgcgta 1500 catacggatg tctgtggacc gatcacacca gcaacatggg atggaaaacg gtattttgtc 1560 acgtttatcg acgactgtac gcacttcaca gtaatctacc tgattgcttc cagagatgag 1620 gtcctcccac ggttcaagga gtacaaggcg atggttgaag ctgggttcga tttgcctttg 1680 tccaagctgc ggtgcgacaa cggcggggag tacgtgtcaa acgagttcaa gagtttctgc 1740 cgggacagtg gtgttgttgt cgagtacacg atggcataca ctccggaaca gaacggcgaa 1800 gctgagcgga tgaatcggac gatcgttgaa aaaggcagag cgatgatcta cgattctggg 1860 ctgccgaaag aaatgtgggg agaagccgtg atgtgtgctc tctacgttac gaaccggagt 1920 ccaaccaggg cgttgaaagc ggatgtcaca ccggcggaag aatggtatgg agagaaacca 1980 gatctttccg ctctgagagt ttttgggtgc aaagcttatg cgcacatccc gaagcacaag 2040 agaggaaagt tcgacccaaa aagccgggag tgcattatga ttggatacga cacaaacggt 2100 tacaggctct gggatccggt caaccgaaag ttattgctgt cacgggacgt gatcttcaag 2160 gaagacgagt ttccagccat tgaaaatcca tgtgagcgcg gatcggagct tcgcgtcacg 2220 tcggcggatc gcttcgtctc ggaggatcgt gtcacgtcgg cggatcgctt cgtctcggag 2280 gatcgtgtca tgtcggcgga tggcttcatc tcggatgatc gtgctgaaga agtcggtggt 2340 gttaccggtg tcgatggtgg tccaactgca gaggaacccg ttcctgaagt ggaagaggag 2400 gtgagtgctg ttagtgacga cgagttcttc gactcggaga atccggaaca ggaggagcgc 2460 gtgcaggagg ccgaagtcgt gttagtggag caagaagtgt cgactgtgtc agcgggaggc 2520 gaagaagtga acaacgagca ggaccccctt cttgttcgca aagaaaggtc gcgtcgacct 2580 ccactgtggc atcaagatta cgacatggag tcatcggcat tgctcgcaga aggaaatccg 2640 gatgacgtgc cagtctcgta cgccgacatt gaaggacgca gtgatgaaac atgctggagg 2700 cgtgctatcg aggaggagtt gtcatctttg aagatcaacg acacgtggga catcgtgcca 2760 aattcagcgg gaaagaaggt gatctctgcg aaatgggtct tcaagcgaaa gctcgacaag 2820 gacgggaacc cagaccggta taaggcgcga ctcgtggcac gcggcttcct tcaaacacat 2880 ggagtcgatt atcaggaaac atatgcgcct gtggcaaagt tagcaacgat ccgatttcta 2940 ctggcaattg ggattcaacg aggttaccac ttctgccaca tggacgtggt aacagcattc 3000 ctccatggaa acctggagca gccggtttac atgaaagctc cagatggtgt tgaagcagct 3060 aacggatctg tcctgaaatt gaaacgctca ctctacggcc tgaagcagtc tcccaagtgt 3120 tggaacgatc gcttccacta ttatgtgatc agcattggat ttgtgcagtc gaaagcagat 3180 tactgcttgt acgtgaaaga agatgccgcc ggactgctgt acctgctgct gtatgtgacg 3240 atctgatgtt tgcgaagtga tctgaaagcc atgagactga aggtgtgctg tctacagagt 3300 ctcatgaaag actcggctgt gtcgaacatt tcatgggaat gcgcatcgcg atcgatcgag 3360 atcgtgggag agtcacaatc agtcaggcgt ttttgctgaa tccattttgg agcgtttcgg 3420 catgcaatcg tgcaattcag taaccactcc catcgaaccc agcaccaaac ttcgccggcc 3480 tgagggggca acgaacacgg acactcggta cagagagtta attggaagcc tgatgtatct 3540 catgttggga agccgtccgg atctttgctt tgcggtaagc tacttcagtc gctttcaaga 3600 ttgtgcgggc gaacagcact tcaaccactt gaagagggtg cttcgttacc tgcggggaac 3660 gagtgcgtac gaactggttt actctcggaa tcctgacgca gcgccaatca caggatacgt 3720 cgactcggac tgggcgaacg atttggatga tagacggtcg acaagcgggt tcctattcca 3780 agtctacggg aacattgtga gttgggctac ttgaaagcag ggtgtcgtag ctcaatcaac 3840 aacagaagct ggatacatcg cagcagccaa cgcagtctcg gaggccctgt ggttcaaaaa 3900 gatgttcgcg gacatccttg aaccaattcg tcacccaatc ccaatccagg aggacaatca 3960 aggttgtttg tttgtggcca gaaatccgga gacgaagaga accaaacatg tggacgtcaa 4020 gtatcatctg gtacgagaaa aggtgtgtaa caaggaggtg gtgctggagt atgttccgtc 4080 tgaacaacag gcggccgata tcttaacgaa gccgctccac gaggagcgtc gaaaatccgg 4140 gatgtcctgg gatggaaaga ggaggag 4167 // ID I-75_AAe repbase; DNA; INV; 6306 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE An I non-LTR retrotransposon from Aedes aegypti. XX KW I; Non-LTR Retrotransposon; Transposable Element; I-75_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6306 RA Kojima K.K. and Jurka J.; RT "I clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1346-1346 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 15 sequences with >97% CC identity. XX FH Key Location/Qualifiers FT CDS 513..1859 FT /product="I-75_AAe_1p" FT /translation="MATGGSYGGTRPPGRTVPEFLDPQDRFGALTVLLLQK FT AGNNDLPNPFIVGLSIERAAGQIENTKSVNDGEKYILRVRNPTQVKSLLAM FT KTLTDGTIVSVVYHPTLNTCKCVISCPELLKMTEDDILTELTAQKVTEVRR FT IYRNVNQEKIKTPTLILTLCSTTFPEAIKVGPLYVRTRPYYPGPMLCYHCF FT KYGHTKSRCPGPARCKTCSDTDPKDDCKKDAYCINCQGGHQPSARKCPVYQ FT KEVEIIKTKIDHNLSYPEARRRVNEGSGSYAKITAQGRMMESDLVNELKKV FT IAAKDAQIQALTQQMEKMKDFIKTKLVMNQTSLSQHDTLLSPCETVENINH FT QDETNNSPNEITPGTGLALSRYISVNNGMTTPPKGARKPTDNSRATQKRIN FT VSPLSSPKATPAKQVVLEVEPLRNDDDALWMIECLDDTVAVNQNAVADSFD FT RQKH" FT CDS 1870..6171 FT /product="I-75_AAe_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="MANTSPSSNSPTYNRSIINSSSSSSKRFEGGECNVPP FT SLPGREALSVRKIVRDAIPQPVDLQPTTVTVCPQQSSHEINNHTANNRPYH FT HQAYNVMGTYCSIQTTRSNTLSAEEYDAIYCINQQPSTSRAPDKHVARKSY FT GSSRARPPMSTLTRDVPDFSDEPRAAVDMGCLTSQASRTSKISEYENDLQE FT NIFQNIFENAQQNQSSSEHRSASVSSCSSSTSTNQNDSKKIIALQWNISSL FT PKNLNDLIVLTNKLQPIVLALQETHVKQTRCIDSWLGGTYKWYLKSRPISH FT HSIALAVNSTTPHRVVPLKSDLLALAIQLYQNSKLTIVSIYVPPRNCENFE FT NKLQDLLNELEPPFLLLGDFNAHNMAWGSPQNDKRGNIIAQMAEDKDLIIL FT NDGSNTFSRGVAESAIDLSICSSGVARNISWRRADDTSGSDHYPIFILGND FT ALPETTRRKKWLFDSADWESYESNLLALFDQDPQYSIQQIAELMRKAGKSS FT IPMSSSKVGLKATYWWNPQVEEAVKLRRKALRKMKRINSQHPDKELYIGRF FT REARRTCRKIMAESKRKAWESFLDGINASSSPVELWRRINSLSGKRRIKGF FT SLMIGDTFSDNPREISEKLAQYFAEMSATNAYSERFLMERERNRPFPFCPG FT VVNHDLLEACNYRFSLDELVFALDTCKSKSCGPDGIGYPMIKKLPNIIKCQ FT LLDGYNKIWQTGEFPDDWHSSIVVPIPKAGPKSCQLRDFRPISLTSCMAKL FT MERMVNRRLMESLEKSKLLDQRQFAFRRGKGAGAYFASMGEVLAHTKANNL FT HADIACLDISKAYNRVWRKGVLEQLGRWGYAGNVLNFINGFLANRTFRVAI FT GGSLSSTFTEENGVPQGSVLAVSLFLIVMNSVFQHLPSGVHVFVYADDIVI FT VVAGPNATRVRRKIQSAIKMVFEWASRVGFQMSTEKCEITHCCAFRHNPML FT RPVKVNDNAIPFRKAPKVIGVSLDRNLNFCQHFQRIKQEVKSRVQLVKTIS FT SRHTCSNRHVLQRVANALICSKLLYGIEITYSGFENLIQILGPVYNSSIRM FT TSGLLPSSPTLSSCVEAGHLPFDLTIATAIATRAVRLAEKAGLNKFKELLT FT SAQNLFHRLTNATFPPVSQLCRVGDRRWDVPAPPIDWTIKKAIRAGEAPAK FT AKAVFQSTIHEKYQGYHQIYTDGSRRDNKVGIGIYSSTEEINYRLPDQMSV FT FSAEAVALNLAAKMACRLSGKSVIFTDSASVLTALENPRSKHPCVQSLEEF FT NSGNIVFCWVPGHSGISGNVKADMLAGRGCTARRLRIKTPGDDIKHYIKSS FT IDDTFAARWNVERNAFLRKIKSSTKKWNDQSNRKHQINLSRLRVGHCRFSH FT PHVCGGSREDRCECGARTNTVEHVLVNCPVYDDLRKRYHLELSIGQILDND FT PVVEENLLLFLKESGLFKKI" XX SQ Sequence 6306 BP; 2008 A; 1382 C; 1331 G; 1584 T; 1 other; cagttttgtc gggatactgt catctgtgat atcgatagtt tgtatgctca tcgaatttta 60 tcgtaaaatc tgagctaaga attattctaa aaacatagat agtatctaaa tacgaaaatc 120 gcgtatcggg acactgatat ttgatggagg tgtagtgaga aatactcaat ttgcgtattg 180 tcgaacaatt ttatgattca gttcagaatc aagaaagcat cggtactgtg atagtattct 240 attgttgtcg tcgtcgtagc acgtgctgtt attgtgtacc tatatcagtg agtgcccgag 300 caacwgatag catcacacag tgaacaatcg tcgtttattt cccattaaac ctcataccta 360 ggggtagata gtcggtgacg gctatcacac tggctggtgt tttaggggtt taagagacag 420 cttggtggag tggtaaaatt ataatcagtc tatactgcat ggtaggaaac tacattgctc 480 ttctgctgat cgactaactg aaatcgtggc ccatggccac aggtggcagt tatgggggca 540 caaggcctcc tggaagaact gttcctgagt ttctggaccc acaagaccga tttggagcac 600 taactgtcct attactgcaa aaagcaggaa acaacgatct gcctaatccc tttattgtgg 660 gtctgtcaat tgaacgagct gctgggcaaa ttgaaaatac caaatcggtg aacgatggcg 720 aaaagtacat cctcagagta agaaacccta cacaggtgaa aagtctgcta gcgatgaaaa 780 ctttaactga cggtacgata gtctcggttg tttaccaccc tacgcttaat acatgcaaat 840 gcgtcatatc atgcccagaa ttgctgaaaa tgactgagga tgacattttg acggaattaa 900 cagcacagaa agtaacggaa gttcggagaa tctaccgaaa tgtgaaccaa gagaagatta 960 aaacgcccac attaattctc acactgtgta gcactacgtt tcctgaggct attaaggtcg 1020 gccctctgta cgtccggacc cgtccttatt atcccggacc catgctttgc taccattgtt 1080 ttaagtatgg tcacacaaaa tcgcgatgtc ctggtccagc acggtgtaag acttgctcag 1140 acacagatcc caaggatgat tgtaagaagg atgcatattg cattaactgc caaggaggtc 1200 atcaaccgag tgccagaaaa tgtcctgtct atcaaaaaga ggttgaaatc atcaagacta 1260 agatcgacca caacctctcc tatccggaag ccagaagaag agtaaatgaa ggttcgggca 1320 gttacgccaa aataactgcg cagggtcgaa tgatggagtc agatctcgtc aatgaactga 1380 aaaaagttat tgctgcgaaa gatgctcaaa ttcaggcatt aactcagcaa atggaaaaaa 1440 tgaaagattt catcaaaact aaactggtta tgaatcaaac atctctttca caacacgata 1500 ctcttttgtc tccatgcgaa actgtggaaa acatcaacca tcaagatgaa actaacaatt 1560 cgccgaacga aataacgcct ggcaccggtt tggccctcag ccggtatata tcggtcaaca 1620 acggaatgac tacaccaccg aaaggagccc gaaaacccac tgacaactca cgagcaactc 1680 aaaaaagaat taatgtctcc ccgctgtcct ctcctaaggc cacaccagcc aaacaagtgg 1740 tattagaggt agagccctta agaaatgacg acgacgcttt atggatgatt gaatgtctcg 1800 atgatactgt tgctgtcaat cagaacgctg tagccgacag ttttgatcgc cagaaacact 1860 aattcgatca tggccaatac atccccatca tcaaatagcc caacttacaa tagatcaatc 1920 atcaatagca gttcttcatc atccaagcgg tttgagggag gggaatgtaa cgtacccccc 1980 tcacttccag gaagagaagc actgtcggtt aggaagatag tccgggacgc aataccccaa 2040 cccgtcgacc ttcaacctac cacagtgacc gtctgtcccc agcaatcatc gcatgaaatc 2100 aataaccaca ctgcaaacaa tcgaccatat catcatcaag catataatgt catgggaacg 2160 tactgttcaa ttcaaactac ccgatcgaac accttatccg ctgaagagta cgacgcaatc 2220 tactgtataa atcaacagcc atccaccagt agagcaccag ataagcacgt cgcccgtaaa 2280 tcgtatggct ctagtagagc aaggcccccc atgtcaaccc tgacccgaga tgttccagat 2340 ttttctgatg agcctcgggc ggcagttgac atgggttgcc taacatctca ggcaagtaga 2400 actagtaaaa tatcggaata cgaaaacgat ctccaagaaa atattttcca gaacatcttc 2460 gaaaacgctc agcaaaacca gtcatcgagt gagcacagat cagcctctgt gagttcctgt 2520 tcatcctcga catcaacaaa ccaaaatgac tccaagaaga taatcgctct gcagtggaat 2580 ataagcagcc tccctaagaa tcttaatgat ctgattgtcc taacaaacaa actgcaaccg 2640 atcgttcttg cgcttcaaga aacccatgtt aagcagacac gctgcattga ttcttggtta 2700 ggaggaacct acaaatggta tcttaagagt agaccgattt cacatcattc catcgcactg 2760 gctgtgaatt ctacaacacc tcatcgtgta gtccctttga aaagtgattt gctagcactc 2820 gcgattcaat tgtaccagaa cagcaaactt acaattgtgt caatctacgt tcctccgaga 2880 aattgtgaaa actttgaaaa caaacttcag gatctattga acgagctcga acctccattt 2940 ctcttactag gcgacttcaa tgcccacaac atggcctggg gatcaccaca aaatgacaag 3000 agaggaaata tcatcgcaca aatggcagaa gacaaggacc ttattatact caatgacggc 3060 agcaacacat tttcacgtgg tgtagctgaa tctgctatag atttgtcgat atgttctagc 3120 ggagtagcta ggaatattag ttggcgtagg gctgacgaca cgtccggcag tgatcactat 3180 ccaatattta tcctgggaaa cgacgcactt cctgaaacca ctcgccgcaa aaaatggctg 3240 tttgattcgg ccgactggga gagctacgaa agtaatctac tagcactctt cgaccaagac 3300 ccacagtatt cgatacaaca aatcgcagaa cttatgagaa aagcagggaa atcaagtata 3360 cctatgtcat cgtctaaagt gggactcaaa gcaacctact ggtggaaccc acaagtagag 3420 gaagccgtca aattgcggag gaaagcactt cgaaaaatga aaagaatcaa ctctcaacat 3480 cccgataaag agctgtacat aggaagattc cgtgaagctc gtcgcacctg tcgaaaaata 3540 atggcggaat ctaaaaggaa ggcctgggaa agcttcctag acggaataaa tgcatccagc 3600 tctccagttg aactctggcg tcggattaac tctttgagcg gaaagaggag aataaaagga 3660 ttctctttaa tgattggtga cactttttca gataatccaa gagaaattag cgaaaaactg 3720 gcccaatatt ttgccgaaat gtctgctacc aacgcatatt ccgaaagatt tttgatggaa 3780 cgtgagcgaa ataggccctt cccgttctgt cctggcgtag ttaaccatga tcttcttgaa 3840 gcttgcaatt accgtttctc cttggatgaa ctcgtttttg ctctcgacac gtgtaaaagc 3900 aaatcatgtg gacccgatgg aataggttat ccaatgataa aaaaattacc gaatatcatt 3960 aaatgccaat tattagatgg ttataacaaa atctggcaaa ctggcgaatt tccagatgat 4020 tggcattcca gcatagttgt gcctattcca aaagccgggc caaaaagttg ccagcttcga 4080 gatttccgac ctattagttt gacgagttgt atggcaaaac ttatggaaag gatggtaaat 4140 agaagactta tggaatccct agaaaaatca aagctgcttg atcaacgtca gtttgctttc 4200 cgtagaggaa agggtgcagg tgcttatttt gcatcaatgg gggaggttct cgcccacaca 4260 aaagccaaca atctgcatgc tgacatcgcg tgcctggaca tctctaaagc atataatcgc 4320 gtatggagaa aaggagtgct tgagcaattg ggaagatggg gatatgctgg aaatgtcctt 4380 aatttcatca atggattcct ggctaatcga acctttcgag tggccatcgg aggatcactc 4440 tcttcaacct tcactgagga aaatggcgtc ccccaagggt cagtgcttgc cgtctccctg 4500 tttttgatcg taatgaactc cgtgttccag catttaccat ctggtgttca tgtatttgta 4560 tacgccgatg atattgtcat cgttgtcgct ggaccgaatg ctactagagt aagaaggaaa 4620 attcaaagtg ccatcaaaat ggtatttgaa tgggcatcta gagtaggctt ccagatgtct 4680 acagagaaat gtgaaataac tcattgttgt gcttttcgtc acaatccaat gctgaggcca 4740 gtgaaggtga atgataatgc aattccattt cgtaaagcac ccaaagttat tggagtgtca 4800 ctagatcgca atctaaattt ctgccagcac ttccaaagaa taaaacaaga agtgaagagt 4860 cgagttcagc tggtaaaaac gattagcagc agacacacct gtagtaaccg tcatgtacta 4920 caacgtgttg caaatgcact tatctgcagc aaattgttat atggcatcga gattacttat 4980 tctggtttcg aaaacttgat tcaaattctc ggcccagtat ataattcttc gattcgaatg 5040 acatctggac tccttccaag ttcccctaca ctatctagct gtgttgaagc cggacatctt 5100 ccctttgacc tcacgatagc aacagctatc gctactagag cagttcgtct agcggagaag 5160 gctggcctaa acaaattcaa agaattgcta acaagtgctc aaaatctgtt tcaccgacta 5220 actaatgcta catttccgcc tgtgtcccaa ctttgtcgag tcggagacag aagatgggat 5280 gtacctgctc caccgattga ctggaccatc aaaaaagcga ttcgagctgg cgaagcgccg 5340 gccaaagcga aagcagtctt ccagagtacc atccacgaaa agtatcaagg ataccatcaa 5400 atatacaccg acggttctcg cagagataac aaagttggca ttggaatata cagctccact 5460 gaagaaataa actaccgtct tccagatcaa atgtctgtat tttccgcaga agcagttgcc 5520 cttaatcttg cagccaaaat ggcgtgtaga ctgagtggaa aatcagttat ttttacggat 5580 tcagcaagcg tgttaacggc gttggaaaat cctagatcaa aacatccctg tgtacaatca 5640 ctggaggagt tcaactcagg aaacattgta ttttgctggg taccagggca cagtggaata 5700 tctggaaatg ttaaagcgga tatgttagct gggcggggtt gtactgcaag gcgcttacga 5760 atcaaaacac caggagatga cataaagcac tacattaaat catcaataga tgacactttt 5820 gctgcaagat ggaatgttga acgaaatgcg ttcttaagga agataaaaag ttcaacaaaa 5880 aaatggaacg atcaatcgaa tcggaaacac caaattaatc tttcaagatt acgcgtggga 5940 cattgcagat tttctcatcc acatgtgtgt ggcggaagca gagaagacag atgcgaatgt 6000 ggtgctagaa cgaacactgt ggaacacgta ctagtaaact gccctgtata tgatgaccta 6060 cgaaaaaggt atcacctgga gttatcaatt ggacaaattc tcgacaacga cccagttgtt 6120 gaagaaaatt tactgctatt tcttaaagaa tctggtttgt ttaaaaaaat ctaattttta 6180 tcaagataaa atgaattgta attaagatag aatgaattgt atgtaactaa gcaaaatcaa 6240 tctcaagacg cgaacgactt aaccagttaa agcgtctcaa ataaagacaa ctaactaact 6300 aactaa 6306 // ID Gypsy2-I_SM repbase; DNA; INV; 3513 BP. XX AC . XX DT 18-FEB-2008 (Rel. 13.02, Created) DT 03-MAR-2008 (Rel. 13.02, Last updated, Version 1) XX DE LTR retrotransposon from Schmidtea mediterranea: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; LG_I; Gypsy2-I_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-3513 RA Jurka J.; RT "LTR retrotransposon from Schmidtea mediterranea."; RL Repbase Reports 8(2), 91-91 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 247..1797 FT /product="Gypsy2-I_SM_1p" FT /translation="MNQNNFFVNKKKIPESVIRENIDFSSFQKLPLEPYEG FT DPIAFTDFMNDYFIYASAYGWSETIMIQRFPLYIKGSARDAWKQINKNKIG FT NDWSKLNDAMKDKLVCKEIRRYFEQVFHERKQNPEESVLEFGYGLISLAKK FT AFSVNSVDDINIQNLIVDQFLSGLSSNLKMVSSIVEYDDIFDLIRKISYIE FT LKMADCNKVVKFNNLCSKQNIRKCSFQDKFKSEDLEINESNKNICNTLRSF FT KSLNFEKKLLLNDDYDVYGQKSYSITNKCNLNYETIENELWLEDMAVEFQC FT NDSSLEIIDQIECTHYKMCCFEKIEVVYSFNNYCVSNQLVLLKEKKIVFDD FT HSKFISMLSNFTTFNVHLSFVRSKVIRYESIVITENIINLNLLLPKCDIPL FT IPKPDEVIIDSESLSEAIMIQNECKVLKKYFIKNNITSKTFNIIYIIIKLI FT WTHLKFYENKHTEEIIRYHFEYINSRMKDRICLPMICYINVIYLKCGNRIG FT SDGELNIKTYTSLVNRLMIEF" XX SQ Sequence 3513 BP; 1342 A; 353 C; 577 G; 1241 T; 0 other; tatttggtga gcccgacgtg atcaaagtcg gtgaaaattt attgaataga aattttgttt 60 aaacgtgatt taaaaggaga aatttcaaat aattgtgata ttttgagaaa ctttgtgtaa 120 aaaatttgaa gtgaaattta atgaagattg tttgaattta aatttgaaaa ggtaaaattt 180 ttttgtttta acaatttgaa atttaggttt atgtgatttg aaaaagaaaa taatcaatta 240 tttaaaatga atcaaaacaa tttctttgtg aataaaaaga aaattccaga atcagttatt 300 agagagaata tcgatttttc cagttttcag aaattaccat tggagcctta tgaaggcgac 360 ccaattgctt ttactgattt tatgaatgat tactttatat atgcttcagc atatggatgg 420 tctgaaacca taatgataca aagatttcca ttatatatta aaggatcagc gcgtgatgca 480 tggaagcaga ttaataaaaa taaaattgga aacgattggt caaaattaaa tgatgctatg 540 aaagataaat tagtatgtaa agaaattaga agatattttg aacaagtctt tcatgaaaga 600 aaacaaaatc cagaagaatc tgttcttgaa tttggatatg gattaatttc attagctaaa 660 aaagcatttt cagtcaattc agtcgatgat attaatattc aaaatcttat tgttgatcaa 720 tttttaagtg gacttagcag taatttgaaa atggtatctt ctattgtgga atatgatgac 780 atatttgatt taattcgaaa gatttcatac attgaattaa agatggctga ttgtaataaa 840 gttgtaaagt tcaataactt atgctccaaa caaaatatta gaaagtgtag ttttcaagat 900 aaatttaaaa gtgaagatct tgagatcaat gaaagtaata aaaatatttg taacacttta 960 agatctttta aatctttgaa ctttgaaaag aaactattat taaatgatga ttatgatgta 1020 tatggtcaaa agagttattc aataactaat aaatgtaatt taaattatga aactattgaa 1080 aatgaacttt ggctggaaga tatggcagtt gaattccaat gtaatgatag ttctttggaa 1140 attattgacc aaattgaatg tacccactat aaaatgtgtt gttttgaaaa aattgaagtt 1200 gtttactcat ttaataatta ttgtgtttcc aaccaactag ttttattgaa agaaaagaaa 1260 attgtatttg atgatcacag taaatttatt tcaatgttat cgaattttac gacgtttaat 1320 gttcatttat cttttgttcg gagcaaagtt ataaggtatg aatctatagt cattacagaa 1380 aatattatta atttgaattt actgttacct aaatgtgata ttccactaat tccaaaacct 1440 gatgaagtta taattgattc tgaatcttta tctgaagcta taatgatcca aaatgaatgt 1500 aaagtactca aaaagtattt tataaagaat aatattacaa gtaaaacatt taatattata 1560 tatatcatta tcaaactaat ctggactcat ttaaaatttt atgaaaataa acataccgaa 1620 gagattatta gatatcattt tgagtacatc aactcgcgta tgaaagatcg tatttgcctt 1680 cctatgattt gttatatcaa tgttatatat ttgaaatgtg gaaatcgtat tggatctgat 1740 ggagaattaa atattaaaac ttatacctca cttgtaaata gactgatgat cgagttttaa 1800 taaaatataa tgaaaatatt tgagaaaatt atgaaattta taaacctcaa tccaaagtta 1860 ttttaatcag gttctattta ctttaaatta ttcttatgga ctacgtaaac caaagaatta 1920 gaaattatta tttttacgct aaatttgaat ttgagctact tgtatataga tgaaattttt 1980 attacttatc acttttaaat gattaaaagg tttgagtcgt gaatgattga caatatttaa 2040 atttttcaga aaaatattga attataattg taattgggaa ctgttctgag ttaaactgtt 2100 atgatcaaat attttgtaaa tttgtgtaat gagatgtatt caagcttcaa gagtctaaag 2160 ggtcaaacct tgaccaaaaa ttttattttt gatattatgt agagaacgat gagctgagta 2220 aaaatgccaa ttgaactatg gttgtatttt ctaaaggttc gaagttacaa agggtcaaac 2280 tttggccaaa aaaatttatt tttgatatgt tgtacaggac ggtgagttga gtataattgt 2340 caatggaact atagtcgtat cttatcgaac ttcaaaggtg taaagggtca aaccttgacc 2400 aaaaatacaa tttttgatat gttgtagaga agaatgagct gaataaaatt gccaatagaa 2460 ccatggtcat atcttctaaa ggttcgaagc taaaaaggct caaactttgg tcaaaaaaaa 2520 ttatttttaa tatgttgtag agaacgatga aatggcaaaa gtggtgataa aattaggatc 2580 ttccaacctt tggataaaaa gttattaagg gtcaaacctt caaaaattct aatttttgat 2640 atgttgcaga gcttgacgag ctgagcaaat tttacaatgg aactatggtt gtcagatttt 2700 tggatcaagg gttattaagg gtcaaaatct tgggcaaaaa aatttatttt tgatatgttg 2760 tagagaacga taatatggca aaagtgacga taaaattatg atttttcaac ctttggataa 2820 gacgttataa agggtcaaac cttcaaaaat tttaattttt gatatgttgt agagcttgag 2880 aagctgagca atatttacta tagaagtatg gcagtctgac ttctgggtca aaagttataa 2940 agggtcaaaa ccttggtcaa aatttttaat ttttgatatg ttgtagagat cgacgaaaca 3000 agcaaaattg ccaatagaac tttagtcatc aaagctttag ataaaaagtt ataaagggtc 3060 acaactttgg ccaaaaattt taatttttga tatgttgtgt agaataataa gacgatcaaa 3120 aattaccata gaactataat cgtgtgaact tttgataaaa agttacaatg ggtcaaaacc 3180 ttcgaatact ctagatttca gaaaattgtg aagaaattaa aaattttgaa agtgaatagt 3240 aaatattttt gtaaatcagt ggagatcgtt caatttattt catttgcata ggcgcaaatt 3300 tcatgaaatt tataaataac ataaaaaatt acattaaaca tttgcatgtt aaatttttaa 3360 acaataagtt gtatgaaatg ttttaaaagc agttgttatt tttattgata attgaattaa 3420 tgtaaagtta taaagagttg tacagcatga tcaaaatgga attgagtgtc aaattttaat 3480 tcgaggacga atttctttta aagaggggag ata 3513 // ID Gypsy-5_SI-LTR repbase; DNA; INV; 589 BP. XX AC AEAQ01012110; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: long terminal DE repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-5_SI_; KW Gypsy-5_SI-I; Gypsy-5_SI-LTR. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-589 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01012110; Positions 10384 9796. XX SQ Sequence 589 BP; 224 A; 120 C; 101 G; 144 T; 0 other; tgcaaccgcg ggaaaatctt ccagtggttg cccttcagag gatcccgtga gaaggggaac 60 tccccattaa gacatcctcg aaaccaaata tggagccaaa gagcagggac aaggcaccag 120 agtacaaagg gtcaaatgcc aagataatcc cggaccaggc gagactcaca gatgacagcc 180 agagtgacaa caacgatcag gaaataaaca atcccacaaa ggaagcagac gagacagcgt 240 tcacgcccat ggacataacg attccggacg acaacacgca atcccaacct ataaaggcaa 300 cagaggagta agcgacaaca acaacggaga attagggtta caaaacacta acaagaatag 360 tcgcaagagt aattctatca taattttttt tatttttaat tattttatac tcattgttat 420 ttttattacg actttaaact atattattat aattgtaacc ataaacataa acatagaaac 480 ataccataat ttagatttaa acgtttacta acacacaacc taatgataca tatatattag 540 cagaatagtt tcatttatta cttaatattt tagcatttat attgtgtca 589 // ID Lian-Aa1 repbase; DNA; INV; 4476 BP. XX AC . XX DT 24-SEP-2010 (Rel. 15.1, Created) DT 24-SEP-2010 (Rel. 15.1, Last updated, Version 2) XX DE A LOA clade non-LTR retrotransposon family Lian-Aa1 from Aedes DE aegypti. XX KW Loa; Non-LTR Retrotransposon; Transposable Element; Lian-Aa1. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Tu Z. and Guzova J.A.; RT "Structural, genomic, and phylogenetic analysis of Lian, a novel RT family of non-LTR retrotransposons in the yellow fever mosquito, RT Aedes aegypti."; RL Mol. Biol. Evol 15(7), 837-853 (1998). XX RN [2] RP 1-4476 RA Kojima K.K. and Jurka J.; RT "Consensus update."; RL Direct Submission to Repbase Update (24-SEP-2010). XX DR [1] (Consensus) XX CC This consensus is generated from 30 sequences with >99% identity. XX FH Key Location/Qualifiers FT CDS 641..4210 FT /product="Lian-Aa1_1p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="MQNGAAQVALVQEPYFRRGNFYLGNLVDPVFATFSKL FT EMANSRSMPRACVLVNKAIVATLISELTTRDVCAVTIDVSVGDLNRKYVYC FT SVYLPHDEPSPTDDFKRVVVHCVTKGLPLIVGSDANAHHIIWGSSDINLRG FT SSLMEYLSSTDLGLLNIGNRPTFMVSNREEVLDITLCSNRISHELTNWHVS FT DEESLSDHRYIFFEHSNVTAQTLRFRNPRSTNWELYIELVATKFHGYSPSI FT ENPSDLDDAVDTTTSYIMEAFEEACPLRSVKTTRGTPWWNSDLTRLRKQCR FT RSWNRRRSAGSESFKSARKAYKKALRSAERSGWKNLCTNVSSLSEVSRLNK FT ILAKSKDFQVNEIRLPNGDFTSSDEEVLECLFNTHFPGCVDIASTDEPNVF FT SCSYESLASARSIVTTESIQWALNSFAPFKSPGADGIYPVLLQKGFEFIKH FT VLKKLLVSSFATGYIPKSWRDITVKFIPKGGRASYEEAKSFRPISLTSFLL FT KCLERIIDHHIRDVYLANMPLHVNQHAYQSGKSTVTLLHKVVYDIEKAFAQ FT KQSCLGVFLDIEGAFDNVSFDAILEAARNHGLPTMITNWIHQMLKNRHLFS FT TLRQAAIRKLSVCGCPQGGVLSPLLWNLVADTLLRQLNNCGFPTYGFADDY FT LALIVGMCISTLFDLMQSALQVVESWCRQYGLSVNPNKTSIVLFTERRNRD FT GIRPLRLFGTEINVTDQVKYVGVILDSKLSWTPHIDFRVKKACMAFGQCRR FT TFGKTWGLKPKYIKWIYTTVVRPILAYGCLVWWQKGEVRTIQSKLGHLQRM FT CLMAMSGAFSTTPTAALEALFDVAPLHIYLKQEALSCSYRLWVLDLLEKNP FT VNRRSTHTSLFPLLVNWDKIVLAPSDLTIACNFPYRTFTTQFPSREEWTSG FT YLERSISNNIVCYTDGSLLEGRAGAGVYSRELRLNQFYSLGRNCTVFQAEI FT FALMCGVQSALQQRVMGKVIYFCSDSQAAIKALASANSRSKLVIACRTQIE FT ELNSVNSVNLVWVPGHSSIAGNELADELARDGASHDFIGPEPAIPISKCWV FT KLQINSWAATQHKQYWNSLESCRQTKLYITEPSPKVAKYLTNLSKQNCSLL FT VRALTGHCRLNYHMANIQRADSFVCDSCDSDYGTSYHLICNCPVFAQMRFQ FT LLGKHLLSETEYRSLNLQDILLFLTRCGNEL" XX SQ Sequence 4476 BP; 1180 A; 987 C; 994 G; 1315 T; 0 other; ttagtggctg cttgggcacg tcaggagtta atcatcaata ttaacctcct tttcccgagc 60 agcctagata gccgcgtagt gtcggtagcg gttgtttcaa ctggctaaga attaacacta 120 cggactgcct gttccggtgg taaaagtcca cctcacaggt gacccctaat ttatggtgtg 180 atgcgtaccg tgcctaagaa tgaatggtta ggggggtcta aataaaacct aaccgcaaac 240 ggagcctgtg gggtaccagg gcgccctcca cagtattgag tccttcctgt gctacccgga 300 gcaatggtgc aggtgacctt gtgtttctcc gagataatcg gctgcccttc ttcagtctct 360 atcttgaggc ttaataaggg tgggattatg aatatgttga catttaattt aaattatcac 420 ctatatggat tcgcattatg cgttttacac agtgtattct gtgctctttc gcctttggcg 480 actttaaata gataccgatc tggttttttc gttgctggtc tttttcgtgc tgagattgca 540 aaaagcttaa tcctgcctag tttgggtagt ggctacggtt aggttagctc agatcaatct 600 tcagcataaa agaacagcaa cgatcaatct ttgcagactc atgcaaaatg gtgcagccca 660 agtggcgcta gttcaagaac cctactttcg tagagggaac ttctatctag gtaaccttgt 720 ggacccagtt tttgctactt ttagcaagct tgaaatggca aactcgcgct ccatgccccg 780 cgcatgcgtg ctcgttaata aagcaatcgt tgctacactc atttctgagt taactaccag 840 agatgtatgt gctgtcacaa tcgatgtttc tgttggtgac ctcaacagga aatacgtcta 900 ttgttcggta tatttaccac atgatgaacc atccccaacg gatgacttca aacgagttgt 960 cgtacactgc gtaacaaaag gccttccgct gattgtgggc agtgatgcca atgctcatca 1020 catcatctgg ggcagctcgg atatcaattt gagaggctcc agtctgatgg aatacttaag 1080 tagtacagac cttggattac ttaacatagg caatcgccca accttcatgg tttctaatag 1140 agaagaagtg ttagacataa cgctctgctc gaatagaatc agtcacgagt tgacgaattg 1200 gcatgtatca gatgaggaat cattatctga tcatcgctac atcttctttg aacattcaaa 1260 tgtaactgcg cagactttgc gttttaggaa tccccggtca acaaactggg aactctacat 1320 tgaattggtt gcgaccaaat ttcatggata ttctccgtcc attgaaaatc caagtgatct 1380 ggatgatgcc gttgatacta caacatccta cattatggaa gcttttgaag aagcatgtcc 1440 tctgcggtct gtaaagacta caagagggac cccatggtgg aattccgatc tgactagact 1500 caggaaacaa tgtagaagga gttggaacag acgccgttca gctggatcag agtcgttcaa 1560 gtcagctcgc aaggcttaca agaaggctct tcgttctgct gaacgatccg gctggaaaaa 1620 cctttgtaca aatgtttcca gtttgagtga agtcagtcgg ttgaacaaaa ttcttgcaaa 1680 atctaaggat ttccaagtga acgaaattcg cttacctaat ggtgacttta cttcttccga 1740 tgaagaagtt ttagaatgtt tattcaatac acacttcccc ggatgtgtgg acatagcatc 1800 tacggatgaa ccaaatgtct tttcatgtag ttacgagtct ctggcctcgg ctcgcagtat 1860 cgtaactact gaatcgattc aatgggcact taatagtttt gctcctttca aatctccagg 1920 agcggatggg atttatcctg ttctgctcca aaagggattt gagtttatca aacatgtttt 1980 gaaaaagcta cttgtaagca gttttgctac cgggtacatt cccaaatcct ggcgtgatat 2040 tactgtaaag tttatcccaa aaggaggacg tgcgtcgtat gaggaagcga agagttttag 2100 accaatcagt ctgacctctt ttcttctgaa atgtctggaa cggattatcg atcatcacat 2160 ccgtgatgtt tatttggcaa acatgcctct tcatgtgaat caacatgctt accaatctgg 2220 aaagtccact gtgactcttt tacacaaagt tgtatacgat atcgagaaag cattcgctca 2280 gaagcaatcg tgcttgggtg ttttcttgga tattgagggt gcctttgata acgtgtcttt 2340 cgatgccata ttggaagccg cacgaaacca tgggctacct acaatgatta ccaattggat 2400 tcatcaaatg ctcaaaaacc gacatctctt ctcgacattg cgtcaagcag cgattcgaaa 2460 attgagtgtt tgcggatgcc cccaaggggg agtcttgtca ccacttttgt ggaatctcgt 2520 agcagatacg ctattgaggc aactcaataa ttgcggtttt ccaacttatg gatttgccga 2580 cgactatcta gctctgatag ttggtatgtg cataagcacc ctattcgacc tgatgcaaag 2640 tgctcttcag gtagtcgaga gttggtgtcg ccaatatggc ctttcggtta acccgaataa 2700 aacatctatt gttcttttta cggaaagacg aaaccgcgat ggaattcgac ctttacgtct 2760 ttttggcact gagattaatg tgactgatca agtaaagtat gtcggagtca ttctagattc 2820 caaactttca tggacacctc acattgattt cagagtcaaa aaagcttgca tggccttcgg 2880 tcaatgccgg cgaacctttg gtaaaacttg gggcctcaaa cccaaatata tcaaatggat 2940 ttacacaaca gttgttcgac caatattggc atatggatgt cttgtgtggt ggcaaaaggg 3000 cgaagtgaga acaatccaat caaaattggg ccatctccaa aggatgtgct tgatggcgat 3060 gtctggtgcg ttctctacaa ctcccacagc agcgctcgag gcccttttcg acgttgcgcc 3120 actacacata tatcttaaac aagaagcact ttcttgctct taccgtttat gggtactgga 3180 tctactggag aaaaatccag tgaatcgtag atctacacac acttcgttgt ttccactttt 3240 ggtgaattgg gacaaaattg tccttgctcc aagtgatctc acaattgctt gtaactttcc 3300 ttacaggaca tttaccacac aattcccttc acgggaagag tggacgtctg gctatttgga 3360 aagaagtata tcaaacaata tagtatgtta cactgatggc tcccttcttg aaggtagagc 3420 tggtgcagga gtatattctc gtgagctaag gctgaatcag ttttactcac ttggtagaaa 3480 ctgcaccgtt tttcaggcgg aaatatttgc tcttatgtgt ggagtgcaat cagcacttca 3540 acagcgcgta atgggtaaag tcatatactt ctgttcagat agtcaggctg ctataaaagc 3600 tctcgcttcg gccaactcaa ggtcgaagct tgttatcgca tgtcgaactc aaattgagga 3660 actgaattca gtcaactctg taaaccttgt atgggtacct ggccattctt ccatcgctgg 3720 aaatgaattg gctgatgagc tagctcgcga tggagcatcg catgacttca ttggccctga 3780 gccggctatt ccaatttcga agtgctgggt gaagcttcag ataaactctt gggcggcaac 3840 tcagcacaag caatattgga atagtttgga gtcgtgtcgt caaacaaaat tgtatattac 3900 tgagccatct ccaaaggtgg cgaagtattt aacaaatctg tcaaagcaga attgcagtct 3960 cttggtcaga gcgttgacag gccactgccg actcaactat cacatggcaa atattcagcg 4020 tgctgactca tttgtgtgtg atagttgtga ctccgattat ggaacttcgt atcacctgat 4080 atgtaactgt ccagtttttg cgcaaatgcg attccaatta cttggtaaac acttattaag 4140 tgaaactgaa tacagaagcc tgaatcttca ggacatcctg ttattcttaa cccgctgtgg 4200 taatgagcta taggctctct ttacgctcac gcgttttgca gtgccctttt tagggcgctg 4260 ttcgaaccca ttgtggtatg gagctacatg ctctcatttc gcttatgcga tcttccctct 4320 tcaagggacc ccactcctat ttcctcccat ctttcccttc cctttcctct cccatcgggt 4380 agatgatgaa ataggctcaa atatggcgat ggcacaaatc tcccaactgg tggggaacgt 4440 gcctttggag ccggccttct gatacctgat acctga 4476 // ID hAT-81_HM repbase; DNA; INV; 3182 BP. XX AC . XX DT 16-SEP-2009 (Rel. 14.09, Created) DT 16-SEP-2009 (Rel. 14.09, Last updated, Version 2) XX DE hAT-type DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-81_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3182 RA Jurka J.; RT "DNA transposons from Hydra magnipapillata."; RL Repbase Reports 9(9), 1923-1923 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 603..2855 FT /product="hAT-81_HM_1p" FT /translation="MANACRGTSKPTTRQETELWLIGQMSEILSFTKLPSK FT KEVMALFFYYKEAAKQTVREASHSTTNDVIEVWAKARIPTRLKKHVVEKVE FT CMFHEYDKLKKNKENKAKRSESLLKKEEEWKDGLESLFDIAHADAMKMISI FT QEDREFLLAQREAGRRGKMGSVDKALAKRERDVHNKEENFKRRKEREEQDR FT VAREEKAILQTSESEQESGNDDEAFGEPSSSTTSKRAKRIRGTQKILNDKL FT AVCLDMAKVSDRNAALVLTPALQHLGYDPADFNINRSSIRRERMKRRQRIA FT ENLKAEFKPTVPLTIHWDGKLLEDISSTEIVDRLPILVSGLGVDQLLCVPK FT LPSGTGEASATAVYDATVAWGITDKVKCVCFDTTAVNSGPRNGACILLEQK FT LDKNLLWFACRHHILEIVLEATVTLSLGPSKGPDIMIFKRFKSSWEFIDQS FT KFQTSSSNDTVSSAVSSVAIEIIAFADNQLQQFQPRDDYRELLELTIMFLG FT GIPSRGCSFKAPAGLHRARWMAKAIYSLKIWMLRSQFKMTKKEENGIADVC FT LFTVTLYVKAWFRAPSAPSSPRVDLELIKEIDKYKAQNRAVSEIAMKKFLG FT HLWYLSEELIALAFFDDEVSDDTKRLMVSALQAPGAEHPLKRITVDPSLVS FT SKNLQDFVTESSHRFFTITGLPSTFLNKDVKLWSADADYQLAKNIVSSMRV FT VNDIAERGVALMDEYNKLHTNDEEQKQFLLLIVKKYRQRFPDRAKNTLAMD FT " XX SQ Sequence 3182 BP; 1006 A; 602 C; 700 G; 874 T; 0 other; gggtggtcct taattttcaa agaatgattt ttgtatgggg cacccctgaa ttttgtttat 60 aagcatgcat acaaaattca caaaaaaact cagctaaatc tgaaaatatt tagaggtccc 120 taccacatat actagttaag tatcatttgg ggttccgtgg ctgcatttca caagttgtta 180 tgccggtgaa ctctttcact atacacacca tactgcctgc atattttatc aatttgatgt 240 tacttattta ttgttaggtt tcccagaatg catttcacca gctgtcagcc atttttaatc 300 agacattaac ggacaatcac acaatttaaa aataatattt cattaatctg aattagtgcc 360 tagtattgac tactgacttt ctcagggaaa gttccataac aacggacctc gtggatactg 420 gaacaaaagt aagtgttttc tttgtttatt tttgcttgaa tgggttttgc attgtttgca 480 ttattgcata acataaacat tgtcagttgg attactgggt tctgttagtt ttggtagaaa 540 ttataattaa tttgtataat taaattggat ggtaacaagt acaatttatt attttgaagg 600 agatggccaa tgcttgccgt gggacttcga aaccaactac aagacaagaa actgagttgt 660 ggctgatagg ccaaatgtct gaaattctca gctttacaaa gttaccgtca aagaaggaag 720 tcatggcact attcttctat tacaaagagg ccgccaaaca gaccgttcgt gaagcatcac 780 attcaacaac taatgacgtt attgaagtat gggctaaagc tcgaatcccg acgcgattga 840 agaaacatgt tgttgaaaaa gttgaatgta tgttccatga atatgacaaa ctgaagaaaa 900 ataaggaaaa taaagcgaag cgctctgaaa gtctactgaa gaaggaagaa gaatggaagg 960 atggcttaga gagtcttttc gacattgctc atgcggatgc catgaagatg attagcattc 1020 aggaggacag ggagttcctg ttagctcagc gtgaggcggg acgacgtggt aagatgggaa 1080 gtgttgacaa agccttggcc aaaagagaaa gagatgttca caacaaagaa gaaaacttca 1140 agaggagaaa ggagagagaa gaacaggaca gagtggccag agaggagaaa gctattcttc 1200 agacatccga aagcgaacaa gaatctggta atgatgatga agcatttggt gagccatcat 1260 caagcacaac ttcaaagaga gctaaacgta tacgtggtac acaaaaaata ctgaatgaca 1320 agttagcagt atgtctggat atggcaaagg tcagtgatag aaatgctgca ctagtcctga 1380 ctccagcact gcagcacctt ggttatgacc cagcagattt taatatcaat cgatcgtcca 1440 ttcgaagaga aaggatgaaa cgtagacaaa gaatagcaga gaacttgaaa gcagaattca 1500 agccaacagt tccactaaca atacactggg atgggaaatt actggaagat atcagtagca 1560 cagaaattgt ggaccggctc ccaatcctcg tgtctggact aggagttgac caacttcttt 1620 gtgtaccgaa gctgccatct ggaactggag aagcttcggc gacagctgtt tatgatgcga 1680 ctgtagcgtg gggcataact gataaagtca agtgcgtgtg cttcgatact actgccgtta 1740 atagtggtcc aagaaatggt gcttgcatcc tactggagca gaagcttgac aaaaatttgc 1800 tctggtttgc ttgtcgccat cacattttag aaatagtctt agaggcaacg gttactcttt 1860 ctcttggccc ttcaaagggt ccagacatca tgatttttaa aagattcaaa agcagttggg 1920 aatttattga ccagtcaaaa tttcagacat caagttcgaa tgacaccgtt tccagtgccg 1980 tttccagtgt tgcaattgaa attattgcat ttgctgataa ccaactgcaa cagtttcagc 2040 ctcgtgacga ttatcgtgaa ctgctagagc tgacaattat gtttctggga ggcattccat 2100 ctcgaggatg ctcattcaaa gcaccggcag gacttcatcg agccagatgg atggccaaag 2160 ctatttattc attgaaaata tggatgctga gaagccaatt caagatgaca aagaaagaag 2220 agaatggaat agcagacgtg tgtcttttca cagtgactct gtatgtaaag gcctggttta 2280 gagcaccgtc ggcaccatct tcgcctcgtg ttgatctcga gctcatcaag gagatcgaca 2340 aatacaaggc acagaatcga gcagtctctg agatcgcaat gaaaaaattt ttaggacatt 2400 tgtggtactt gtcagaagaa ctgattgcac tggctttctt tgacgatgaa gtctccgatg 2460 acacaaaacg tctgatggtc agtgccttgc aagcaccggg tgcagaacac cctttgaaga 2520 ggataactgt tgatccttcg ctggtgagct ccaagaactt gcaagacttt gtcaccgaga 2580 gcagtcatag attcttcacc atcactggtc taccgtcaac tttcctcaac aaagatgtga 2640 agctctggtc tgcagatgct gactatcagt tggcgaagaa cattgtcagc agtatgagag 2700 ttgtgaatga cattgcagaa cgtggagttg ccttaatgga cgaatacaac aagttgcaca 2760 caaacgatga ggagcagaaa cagttccttc tcttaattgt aaagaagtac cgccagcgat 2820 ttcctgatcg agcgaagaat accctagcaa tggactaact agacggtttt tctcacttcg 2880 acagtctcaa cgtagaagga caataataat tacaatatgt aacagttttt taacgtgctt 2940 caataaagca atgctattct ctgtgacagt ttaaatttgt cctgatcatt catattgtgc 3000 tgagcaaata agcctccggc gggcactttt attgatacga actctgcggt agggacccct 3060 aaattatgtt ttattcaaat aaaattttca gtacaatcat tttttcttaa tagtcgacca 3120 tttgagaggg tgccccatca gaaaactgaa aaaaaatttt tttttataaa caaggaccac 3180 cc 3182 // ID Gypsy-27_DWil-LTR repbase; DNA; INV; 291 BP. XX AC scaffold_181141; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-27_DWil_; KW Gypsy-27_DWil-I; Gypsy-27_DWil-LTR. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-291 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_181141; Positions 564373 564663. XX SQ Sequence 291 BP; 87 A; 49 C; 57 G; 98 T; 0 other; tgatatgctt acgatatgct gaaatatgag cacaagtcat ttaatacgtt gctatgtggg 60 aacgtgccaa atgcatgttt gggcagttgg caatattgat agtatattgt atgccgctac 120 cacttgtttg ggcgcttgct gactatttct aactggctcg tattattctt atttggaggg 180 aggaacatgg aacacaatca tattgcaaca caacattgac tcgtattaat taattttata 240 aatgattaat aaatttaatt catcgaacct gaacttatta actggcgccc a 291 // ID Gypsy-13_DWil-LTR repbase; DNA; INV; 370 BP. XX AC scaffold_181074; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-13_DWil_; KW Gypsy-13_DWil-I; Gypsy-13_DWil-LTR. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-370 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_181074; Positions 446439 446070. XX SQ Sequence 370 BP; 112 A; 84 C; 94 G; 80 T; 0 other; tgtaacgtag acagcctggg tggcaactcg tgtagtagga aaacgcaaac tgtccccctt 60 aagaagcggt cccattctct acggctgggc acactaataa cggcgccacc gggagggcta 120 gtttttttga gcgagcgcgg ccaacttaac gaagagaaga agatagggaa aaagagaaga 180 agggcttgaa agttttcgct gccgccaaga caaggaaaat aaagagtttg aataaacgaa 240 agaaactatc tctgttttct ttacgattct gctgagttat cttaggcctg acgatcgtac 300 gcccaaccgt aaaccaaccg ctcggccacg ggtggcgtta ttttgttcca caaaggcaaa 360 acacgttaca 370 // ID R1C_NVi repbase; DNA; INV; 6404 BP. XX AC . XX DT 22-DEC-1995 (Rel. 1.11, Created) DT 08-NOV-2010 (Rel. 2.02, Last updated, Version 2) XX DE 28S rDNA-specific non-LTR retrotransposon R1 in Nasonia DE vitripennis. XX KW Non-LTR Retrotransposon; Transposable Element; KW reverse transcriptase; Retrotransposable element R1; NVR1; R1_NV; KW R1C_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-6404 RA Burke D.W., Eickbush G.D., Xiong Y., Jakubczak L.J. RA and Eickbush H.T.; RT "Sequence relationship of retrotransposable elements R1 and R2 RT within and between divergent insect species."; RL Mol. Biol. Evol 10, 163-185 (1993). XX RN [2] RP 1-6404 RA Stage D.E. and Eickbush T.H.; RT "Maintenance of multiple lineages of R1 and R2 retrotransposable RT elements in the ribosomal RNA gene loci of Nasonia."; RL Insect Molecular Biology 19(Suppl 1), 37-48 (2010). XX DR [2] (Consensus) XX CC This family is specifically inserted into 28S ribosomal RNA CC genes. A partial sequence of this family was previously CC registered as R1_NV. XX FH Key Location/Qualifiers FT CDS 412..2391 FT /product="R1C_NVi_1p" FT /translation="VGGYGMHGSWRGVASPPTPGWLWRLVRVALPSRGKGA FT LLRVTVVSSRIEKSLPYGGHGELCAGYHRRDGARCGCPGSAWGRGRTKQMD FT SIKTKLRKMTRSKGAGGSREDFLRKDSKGGGGDSEDTRSSVYSADSLRERS FT RSRSPSGLKPTKDFELVETKAARKQRKAEEKRKRRENVSASENDNRNDRVD FT KSVNCSVDGNQGVSKKVSVSRSENDVSRIIETESMDVGMEVCASMNECANG FT KRTVKRNAVGQDRFKGARGELFVVCENLRQPPKRIKREVVDAEVPVDGKSV FT GVNESAEVSEVMVGELRAVTDGLRGHLLSDANKFTKWQASSVLDHASKYEG FT LVQRLMLENAKLRGELTAHKCMKAELANVCETVRRVDEGMNVVKMRVAASS FT RSPPAAAEVVPGSRGLGANVGPKPSFALVVRGAKEQLTCDEVRRRMIESTS FT EDVNVRVRTIRPARGGGVVVETASDGERKALSRCAGLVEAGLRAVEPKVMD FT PRVIVYSVPNEMTNEHLLRGMYEKSLREHVSVNEFTKRVKIVRREDGQRLG FT NVIVELPLPWRDRLLQDGRVFVGWNSFRCCSYERVMCCFRCQGYDHRAREC FT KSEPLCYKCGKSGHRMNECKAAEDCSNCRARKLPSEHLARSPRCPMYAWKL FT RLLRSRFVNNG" FT CDS 2384..5560 FT /product="R1C_NVi_2p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="TMAESQMNGTDVGVNVRILQVNCQKSYAAMCDIANCA FT LEDGIEICLFQEPYVYKDRVCGLPAGSRMYLSKSGEAAVVVFGKRYECMLL FT NEGAHEDAVCVWVKGPVGEILVVSLYCRPNGSMQECVDYLDRVVGTRNGRR FT LLVGMDANAASELWHSKSMVRAWQAVRRGAVLGDWVVQAEMDVLNVPTLAY FT TFSGARGESDIDVTLYKGSECQFEWMLKDDWGISDHNPIVITMSTGENVDV FT NGERMHKWNARKCNWLLYRGLIETFASDYGYDEYSVLGAEEKLTLLYKWMT FT EANEVCMEKVVSRPAPKRKSVVWWNESLSEKKRKVREWRRAYQSERSRTGD FT PDRTRWREWKECEREYRRMMKDAKESNWHGTVERKGETDPWGVISTFCMGK FT LNPESLAGLRTANGCTKTWMESARVLLDEFFPADDGIPAEEVHGVQMDMYE FT FCMGELDEAVLGMKMRKAPGMDGLTNEMLRQVWRAAPLFLKGLFDTCLSEG FT LFPHRWKEARVVVLLKGADKDVAESRSYRPISLLGSPGKVMERMMVARLMR FT HMEGKWNARQYGFMRGKCTEDAWARAKENVREAESEYVLGIFVDFKGAFDN FT LLWRVALQKLREAGCTYEELRVWHSYFSDRSVCMYNGMDVVEKRARRGCPQ FT GSISGPPVWNLGMNDLLNELSELGVEVVAYADDLLLLVQGNRRNELEQSAS FT EALSVVYRYGTNIGVEVSDSKTVCMMLKGSLNMLNRVVHVSTNGMDDKRIR FT CVDRVKYLGVNVGIGMDFSVHIDGMKRRVTTVIMRLRGVLRKSWGLKRGVV FT SMVVKGLFLPAVMYGASVWYEQLHKRKLRGSRRLSEELVSCQRVVLYACTR FT VCRTVSTEAMQILFGSLPWDIECFRRANLHKVRKGLPMNESDLVTDEDLYE FT LSLHECRELVDQRALAAWQDRWEATSNGRVTYEWIRDVGFSGRSMKYFEPS FT LRVCYVLTGHGSMNSFLFSRNLSNSPACACGTEREDWIHVLCECDMYAAFR FT DLDSIGVRRTEVGWDVSGVLLDRAKYECLCAFVERAFRMRELIVQRMRENE FT ES" XX SQ Sequence 6404 BP; 1476 A; 1216 C; 2248 G; 1464 T; 0 other; ccccctcacg atcactggct tgtcgcgccg agggggcccg agttcaggag tttgcgaaag 60 cgaattcctg ttccgctaaa gtgatccaaa cccaggaagt ggtggaggat ccggtgcttt 120 ggctagccga ccccactctc aggaaccgat ccgcgaaagc ggaaaggaga ccccgagaag 180 tgtgggaacg cgaaccggag cgccggatcc gaagccaccg ggcttgacaa ccccggggtt 240 cctagtccca agcgatgagc gccgtcgctt ggcccgtaaa ccttgcaccg tcttcaacaa 300 ccttcgcagg gcgtggcgtt ggtacgttgg cggtgtgggg agcgggggct tggcttaacc 360 gcccgtccgc gaggtggggt cctctataaa accccccaat cctacgctta ggtgggcggc 420 tatgggatgc atggctcttg gagaggagtc gcgagcccac caacccccgg ctggctgtgg 480 cggcttgttc gggttgcgtt gccttctcga ggtaagggcg cactgcttcg agtgaccgtt 540 gtctcgtccc gaatagagaa gtcccttcct tatggtggac acggtgagct atgcgctggg 600 taccatcgta gagatggtgc tcggtgcggg tgccccgggt ctgcttgggg gaggggacgt 660 acgaaacaga tggattcaat taaaacaaag cttcggaaaa tgacgcgtag taagggggca 720 ggaggtagta gggaggactt cctgaggaag gattccaagg gcgggggtgg agatagtgag 780 gacacgcgct cctccgtata ctctgctgac agcctgcgcg agcgttcacg ctcccggagt 840 ccatcgggcc tgaaacccac aaaggatttc gagctcgttg agacaaaggc ggcgagaaag 900 cagcgaaagg ccgaggagaa gaggaagagg cgcgaaaatg tgagtgcgag tgagaatgat 960 aataggaatg atagggttga caagagtgtg aattgtagtg tggatggaaa tcaaggtgta 1020 agcaagaaag tgagtgtcag taggagtgaa aatgatgtga gcaggattat agagacggag 1080 agtatggatg ttggtatgga agtgtgtgcg agtatgaatg agtgtgcgaa tggcaagagg 1140 acggtaaaga ggaatgcggt gggccaggat aggttcaagg gggcgagggg cgagctcttt 1200 gtggtgtgtg agaatttgcg gcagccgcct aagcgcatca agcgcgaggt ggttgacgcc 1260 gaggtgcctg tggatggtaa gagtgtgggt gtgaatgaga gcgctgaggt ttcggaggta 1320 atggtcggcg agctgcgtgc agtcactgac ggacttcgtg ggcatctgct ttcagatgcc 1380 aacaagttca cgaagtggca ggccagtagt gtgctcgacc atgcctcgaa atacgaaggg 1440 ctggtgcagc gcttgatgtt agagaatgcg aagttgcgtg gtgagcttac tgctcacaag 1500 tgtatgaagg ctgagttagc gaatgtgtgt gagactgtcc gaagagtgga tgaagggatg 1560 aatgtcgtaa agatgagggt agcggcgtcg tcacggtcac ctccagcagc ggctgaggtt 1620 gttcctggta gtaggggttt gggagcgaat gtggggccta agcccagctt cgcgcttgtt 1680 gtgcgtggcg caaaggagca gctcacgtgc gacgaggtgc gaagaagaat gattgagagc 1740 acgagtgagg acgtgaatgt tagggtgagg accatcagac ctgctcgtgg tggtggggtc 1800 gtggtggaga cggctagcga tggagagaga aaggctctct cccggtgtgc cggactcgtc 1860 gaggcgggac tccgtgctgt ggagcccaaa gtgatggatc ctcgagtgat tgtgtacagt 1920 gtcccgaatg agatgacgaa tgagcatctc cttaggggta tgtacgagaa aagtttgcgt 1980 gagcatgtta gtgtgaatga attcacgaag cgtgtgaaga tcgtcaggag agaggatggg 2040 cagcgactcg gcaatgtgat tgtcgagtta cctctgccat ggcgtgatag gctgttgcaa 2100 gatggtagag tgtttgttgg atggaacagc tttagatgct gttcgtatga aagggtgatg 2160 tgctgtttcc gctgccaggg ctacgaccat cgtgccaggg aatgtaagag tgagcctctg 2220 tgctacaaat gtggcaagag tgggcacagg atgaatgagt gtaaggctgc ggaggactgc 2280 agcaattgca gagcaagaaa gcttccttcg gagcatttgg cgagatcgcc gcggtgcccg 2340 atgtatgctt ggaaactgcg gttgttgcgc tctcgattcg tgaacaatgg ctgagtctca 2400 aatgaatgga acagatgtgg gtgtaaatgt gcggatcttg caggtaaatt gtcaaaagtc 2460 ctacgctgcg atgtgcgata ttgcgaactg cgcgcttgag gacggcatag agatatgcct 2520 attccaagag ccgtatgttt ataaggatag ggtttgtggt ttaccagcgg gatccagaat 2580 gtatctcagt aagtctggag aagcggctgt agtagtgttt gggaaaaggt atgaatgcat 2640 gttactaaat gagggagcgc atgaggacgc cgtatgcgtc tgggtgaaag gcccggtggg 2700 ggagatactt gttgtctctc tctactgtag accgaatggc agtatgcaag aatgtgttga 2760 ctaccttgac agagtggtcg gcactaggaa tggacgtcgg ttgcttgttg gaatggatgc 2820 gaatgctgcg tccgagcttt ggcatagtaa gtccatggtt cgggcgtggc aagcggtgcg 2880 tcggggtgct gtgttgggtg actgggttgt gcaagcggaa atggatgttt taaatgtccc 2940 taccctggct tacaccttca gtggagccag aggggagagt gacattgatg tcactctcta 3000 caagggtagt gagtgtcagt ttgaatggat gttgaaggat gactggggca ttagtgatca 3060 taatcctatt gtgatcacga tgtctacggg agaaaatgtg gatgtaaatg gcgagaggat 3120 gcataagtgg aatgcaagga agtgcaattg gctgctgtac cggggcctta tcgagacctt 3180 tgccagcgac tatgggtacg acgagtactc tgtgttaggg gcagaggaaa agttaacgct 3240 cctgtacaag tggatgactg aggcgaatga ggtttgcatg gagaaggtcg tttcacgacc 3300 ggctcctaag cgtaagagtg ttgtgtggtg gaatgaaagt ttaagcgaga agaagcgaaa 3360 ggtgcgtgaa tggcggagag cgtatcagag tgaaagatca agaacgggtg acccggatcg 3420 tacgagatgg cgagagtgga aggagtgtga aagagaatat aggcgaatga tgaaggatgc 3480 aaaagagagt aactggcatg gcacggtgga gcggaagggg gaaactgacc catggggtgt 3540 catctcgaca ttctgcatgg ggaagttaaa ccctgaaagc ttggctgggc tgcggacggc 3600 gaatggatgt acgaagacgt ggatggagag tgcaagagtg cttctggacg agttcttccc 3660 cgcagacgat ggaattcctg cggaggaggt gcatggagtc cagatggata tgtatgaatt 3720 ctgcatgggt gagctagacg aggcagtctt gggcatgaaa atgcgcaagg ctcctggaat 3780 ggatgggttg acgaatgaga tgttgcgcca ggtgtggaga gcagcccccc tgtttcttaa 3840 ggggctgttt gacacgtgtc tgagtgaggg actctttcca cacaggtgga aggaggccag 3900 agtggtcgtt ctcctgaaag gagccgacaa ggatgtggcc gagtctaggt cctacaggcc 3960 tattagcttg ttgggtagcc cgggcaaggt catggagcga atgatggttg cgcgtttgat 4020 gaggcacatg gagggtaaat ggaatgcacg tcagtatggc tttatgcgtg ggaagtgtac 4080 ggaggatgcc tgggcgagag cgaaagagaa tgtaagggag gctgagagtg agtatgttct 4140 tggaatcttt gttgatttca agggtgcgtt cgacaactta ctgtggagag tagctctaca 4200 gaagttgaga gaggctggct gtacatatga ggaactgcgt gtgtggcact cctattttag 4260 cgataggagt gtctgtatgt ataatggtat ggatgtagta gagaaacgtg cccgaagagg 4320 ttgcccgcag ggatccatat caggacctcc tgtgtggaat ctcggaatga acgacttgtt 4380 gaatgagttg tccgaactgg gggtggaggt cgtcgcgtat gctgatgatc tcctgctact 4440 agttcagggc aacaggagga atgaacttga gcagtcggcg tctgaggcac tgagtgtggt 4500 atacaggtac ggtacgaata ttggtgtgga agtgtctgac tccaagacag tgtgcatgat 4560 gctgaaaggt agtttgaata tgttgaatcg ggttgtgcat gtgtcaacga atgggatgga 4620 tgataagagg attaggtgtg tggaccgcgt gaagtacctg ggtgtgaatg tgggcatcgg 4680 tatggacttt tcggtccata tcgatggaat gaaaaggagg gtcactacgg tgatcatgcg 4740 cctcagggga gtcctcagaa agagctgggg actcaagcgg ggcgtagtga gtatggtggt 4800 gaaaggcctc tttctgccgg ctgttatgta tggagcgagt gtttggtacg aacagctgca 4860 taaaagaaag ttgcgtggtt ctcggagact gagtgaggaa ctcgtcagct gccagagagt 4920 ggtgttatat gcgtgcacgc gtgtgtgtag aactgtctca acggaggcga tgcaaatctt 4980 atttgggtcg cttccgtggg acattgagtg ttttaggcgg gcgaatttgc acaaagtgcg 5040 aaagggcctg cccatgaatg agagtgacct ggtgactgac gaggacctgt atgaattgtc 5100 gttgcatgaa tgccgtgaac tggtggacca acgtgccctt gcagcttggc aggaccgttg 5160 ggaagccacg agtaacgggc gtgtgacgta tgaatggata cgggacgtgg gattctccgg 5220 ccgctcgatg aaatatttcg agccgagcct gagggtctgc tacgttctga cgggccatgg 5280 gagcatgaac tcgtttctct tctcgagaaa cctgagcaac tccccggcct gcgcgtgtgg 5340 aacagagaga gaggactgga tacatgtgct gtgtgaatgt gacatgtatg cggccttcag 5400 ggatcttgac tccatcggag tcaggagaac tgaggtagga tgggatgtga gcggagtgct 5460 tcttgaccgt gcgaagtatg agtgtctgtg tgcctttgtc gagcgcgcat tcagaatgcg 5520 tgagttgatt gtacagagaa tgagagagaa tgaggaaagt tagattagga ttagggtaat 5580 cggtgagggg gtgagggggg tagagggtag gtataagggg taaagggtag ggggtagggt 5640 aagggaagtt gtgggttggg ggtagggtaa ttgcgtgtgt gagagcgtgt ggagtgtgtg 5700 ttagatgagt gtgggaatga atgtgtgcgt gttggctggc cagctgctcg ctggtcggct 5760 ccccgcaggg ggatcccact cttgctcttc taatcgaggc tggcctgtgc cggactcgtt 5820 ggaggaccaa gagaggcatc tctgggtttt gcgaacccac ggacctgagc agcccttcca 5880 gaggcgggat ggtaagatcc caactggaac cctcaccagg gttaaaacgg taccatgggc 5940 gaccggggtg cccgctgggg gagaattgcc tccctcgccc ggtcacttgg gtttggattc 6000 gtggtggcag tggttgaaag cccacatcgc ttggggttag ggattggcac tgggtgaaag 6060 actccttggg tgctctgcac catcggagac tggaaccctc tgccagcctc gacgtgtgag 6120 ttgcggtctc aactcgggga gcggcctgct taaaccgtta gggattggat gggtcccggc 6180 cccaaccgag ggtctccaaa ggtcttacca acctgcggag gaatcggtag tcgcggctta 6240 gtagagggcc taattggttg gcaatgtttc ggcattgccg tctaattggt ctcaaagcta 6300 ttccgcgatg cgttggccga gcgtatctcg gcccctcgcc ccgtgggggg ccgtgtgggt 6360 gggccgaaag gcaggtactg cacgttaaaa caaagagacg atct 6404 // ID Gypsy-26_AA-I repbase; DNA; INV; 5003 BP. XX AC supercont1.85; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-26_AA_; KW Gypsy-26_AA-LTR; Gypsy-26_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5003 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.85; Positions 1518352 1513350. XX CC 'CTGTG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 774..2630 FT /product="Gypsy-26_AA-I_2p" FT /translation="MQDSRPVPAFRCDQIESGKLAKEWKIWKTALECYFAA FT YDVTDQYAKRAKLLHLGGPALQVVFNNLKDHDHVPLVTLEPRWYDSAVEKL FT DEFFEPRHQNTSERRKLRLLKQKQGERFADFVIRLKQQVSQCGFEKYGTEV FT SHILSDIYMIDAVVEGCSSNEVRRRVLLMDLSFPEIEALGIAQESVDQQIA FT EMAIGQPPEKVFRIEQQGRFRGKSTGRTAKPTEKSCFNCGRLGHFAASPVC FT SARGKQCRNCKCYGHFEKLCRKAKRSAVKDTDKLIRVVEETPQQETLKMEP FT EKHEAKVYYAFYSGNESNVITCMIGGVSLEMLVDSGADANLVSGEMWEKLK FT NENVAVFSSTKGSSRILRAYGSQDPLMILGSFVADIEVGDQHSRAEFFVVH FT GGQRCLLGDKTAKQLGVLKVGLDINSVAVSPFSKMNGIKAKIQFDPAVVPV FT YQPMRRIPVPLEEAVDRKLDEMLNRDIIEVKTGPTTWVSPLVVVGKSNGEP FT RLCLDLRRVNEAVLREHHPMPSVDDYLAKLGRGQLWSKLDIKEAFLQIELD FT EDSRDATTFITRRDLYRFKRLPFGLVTAPELFQKAMDETLSGCEGTVWYLD FT DVLVEGRDLQEHDQRLEQVKL" FT CDS 2857..4860 FT /product="Gypsy-26_AA-I_1p" FT /translation="MPAKSKIQAIQSFREPQCEAEVRSFLGLANYLNKFVP FT MLATLDEPLRRLLHKDTRFEWTQEQSKSFNAIKEAMGRIQNLGFYRVEDRT FT AVITDASPYGLGAMLIQFDEADRHRAIGFASKALTDTERRYCQTEKEALAI FT VWGVERFQYYLLGKSFDIFTDCKALSFLFSKRSKPCSRIERWVLRLQAFDY FT RIVFMSGKHNVADALSRLPVQEATPFDPSEEIFVREVATHAASSSALRWSD FT IKEGSQSDQEIQEVLELLVSGETQKLPITYRVIASELCELEGVLLRGDRIV FT VPVSLRNRVLVTAHEGHPGITMMKNHLRSNVWWPKMDAHVEQFVKNCRGCT FT LVGAPEPPEPLLRSQLPSSPWHTIALDFLGPLPEGQHLLVVIDCYSRFIEV FT CEMDTTTTSDVIRELSIMFSRYGIPLSMKADNAPQLSGECAELKEFCDANG FT VKLLNTIPYWPQSNGEVERQNRSILKRLRIAQELGRDWRQELYLYLLTYHS FT TKHPTTGKSPGEIMFGRRIKSKLPTVSSFQEDAGVRERDLVIKEKGKLYAD FT KRRGAMESSIQEGDHVLVKRMRKSNKLDADFSNEEFIVRRKTGTDTVIQSK FT ETGKQYRRTSAHLKKIRDRSDSSSCEGQHSKPLMDVDTSPAKDEEPVDSFP FT ALAKRSRQPPMKHRDYVPH" XX SQ Sequence 5003 BP; 1462 A; 983 C; 1306 G; 1252 T; 0 other; atctggcgac gaagattttg gatcgtaaat tgaaaagcgt ggaaaatgta agtataaatg 60 gcggaaaaaa actggttgat atgaaaaatg gagataaatg aggataatta acgaccggga 120 aataataatg agtttgcgaa agatggaaat aattgtgcac taagtggggt tggcggaaaa 180 ttgaggttat gcatggcgtg ggttggaaaa agtgaaaatg tgcatttacg cgaggattga 240 ttgaattctc cagattgtag ggtttgtggc ggaagcttgt gtcccccgga agaagcagaa 300 atcgtgagga tttctctaga gcaggcggaa aatgtaaagt tcccctgtat acaaagcaga 360 gacgcgagga ttgttcgaat tctccagatg gaagcaatta attgtggcgg aagcaatgtg 420 ggtttccctg gaagtggcag aaatcattag gatttctccg gtgcaggcgg aaaatgagaa 480 gttcccctga aaaaaaagcg aggattagtt gaatcctccg gaataattcg tggcagaaat 540 caaacatgga gtcgacggaa attgagtttc tccgaaacgg cgaaggttcg aattggaatt 600 ttggagaaat attggctgct gtgtggaagc tagtgtggaa gtgctactca atttctagcg 660 ttgtgtacat atacaaacaa attattggac actgaaaaaa aaatactttt agttttctag 720 ttcatttaac aagaaaatta aaacgatatt tttctttaaa tcttgctttc cagatgcaag 780 actcacgacc agttccagca ttccggtgcg accagatcga atctggtaag cttgccaagg 840 agtggaagat ttggaagaca gcgctagaat gttattttgc ggcgtacgac gttaccgacc 900 agtacgccaa aagagctaaa ctgttgcatt tgggcggtcc ggcactgcag gtcgtgttca 960 acaatttgaa ggatcatgac catgtcccat tggtaacttt ggagccaagg tggtatgatt 1020 ctgctgtaga gaagctggac gagtttttcg aacctcgtca tcagaataca tcggaacggc 1080 ggaagcttcg cctactgaag caaaagcaag gggaacgctt cgcggatttt gtgatccgac 1140 ttaaacaaca ggtctcgcag tgtggattcg aaaagtatgg cactgaagtg agccacatct 1200 tgagcgacat ctacatgata gatgcagtgg ttgagggatg ttcctccaac gaagttcgac 1260 gaagagtgct actgatggat ttgtctttcc cggaaatcga agctctggga attgcgcaag 1320 agagtgtaga tcagcaaata gcggagatgg cgatcggtca accgccggag aaagttttca 1380 gaattgaaca gcagggaaga tttcgaggaa aatccacggg aagaactgcg aagccaaccg 1440 agaagtcttg tttcaactgc ggtcggttgg gacacttcgc tgcttcacct gtttgctcag 1500 cccgcgggaa acaatgccga aattgcaaat gctacggcca ctttgaaaaa ctatgccgga 1560 aggcgaaacg atcggcggtt aaagatacgg acaaactcat cagggtcgta gaagaaacac 1620 cgcagcagga aacgctgaaa atggaaccag agaagcacga agcgaaggtg tattacgcct 1680 tctactcggg aaatgaatct aatgtcataa catgcatgat tggaggagtt tccttggaga 1740 tgctggttga ttccggcgca gatgccaact tggttagcgg tgagatgtgg gagaaactga 1800 aaaacgaaaa cgtggcagtg ttttcatcga caaagggaag ctctcgtatc ttgcgcgctt 1860 atggaagtca agatccactc atgatcctcg gttcattcgt tgcggatatt gaagtaggtg 1920 atcagcatag tcgcgcggag ttttttgtgg tgcacggagg acaacgatgc ttgctaggcg 1980 acaagacagc taagcagctg ggggtactga aggtcgggtt ggacatcaac agtgtggcgg 2040 taagtccttt ttcgaagatg aatggcataa aagcaaagat ccagttcgat ccggcggtag 2100 ttccggtgta tcagccgatg cgtcgaatcc ctgtgccttt ggaagaagcg gttgacagga 2160 aactggatga aatgttgaac cgggacataa tcgaggtgaa aactggacca acaacgtggg 2220 tatcgccatt agttgtagtc ggaaaatcaa atggagagcc tcgactgtgc ctggatctac 2280 gaagagtcaa cgaggctgtg ttgagggaac accatccgat gccatcggtt gacgattact 2340 tggcgaaact ggggcgggga cagttgtgga gcaaactcga catcaaggaa gcattccttc 2400 agattgagct agacgaggat tccagagacg caacaacttt cattacgcga cgggatttgt 2460 atcgcttcaa gcgattgccg tttggcttag ttacagctcc tgagctgttt caaaaagcaa 2520 tggatgagac actatccggt tgcgagggaa cagtctggta tctcgatgat gtgcttgtcg 2580 aaggccgaga cctacaagaa cacgatcagc gcttggagca ggtaaagttg taatttgata 2640 aatcataata aaatctcttt tttttgtgat ggcaaatcct gttgtttgac ctttactgag 2700 cttcttccct aatacgcatt gttgttattt ctgtcttcct gtctgcaggt cataaaacga 2760 ttgaaaaacc ggggcattga acttaattgg gagaaatgcc aactgcgagt aaccgaattg 2820 gacttcttgg ggcatagagt ttctcgggac ggaattatgc ctgcaaaaag taaaattcaa 2880 gcaattcaat cgttccgtga accacaatgc gaagctgaag ttcgcagctt tctggggctt 2940 gcgaattatc tcaataagtt tgtaccgatg ctagctaccc ttgatgaacc actgcgcagg 3000 ttactacaca aagatactag gtttgaatgg acacaggagc aatccaaatc gttcaacgcc 3060 ataaaagaag ctatgggaag aattcaaaat ttgggtttct accgagttga ggatcgcaca 3120 gctgtcatca ctgatgcgag cccgtatgga ctaggtgcca tgctaattca atttgatgaa 3180 gctgatagac atcgtgccat tggctttgct tccaaagctt taaccgatac ggagcgtcgc 3240 tattgccaga cggagaagga agccctggca atagtctggg gcgttgaacg gtttcaatat 3300 tatctcctgg gaaaatcatt tgacatcttc acggactgca aagcattaag ttttcttttc 3360 tcaaaaagat ctaagccatg ctcgcgaata gaaaggtggg tgctgcgcct acaggctttc 3420 gactatagaa ttgtattcat gtcagggaaa cacaatgtgg ctgacgcctt gtctcggctg 3480 ccagtacagg aagcaacacc atttgatccg tcagaggaaa ttttcgtgcg agaggtagcc 3540 acacatgcag ctagttcttc ggctttgcgc tggagtgata ttaaggaagg tagccaatca 3600 gaccaagaaa ttcaggaagt cttggaacta ctggtcagtg gagagacaca aaagttgcca 3660 atcacatatc gcgtgattgc tagcgagctt tgtgagctgg aaggggttct tcttcgagga 3720 gatcggatcg ttgttccggt ttctctcaga aatagagttt tggtaactgc tcatgaaggt 3780 catccaggca ttacgatgat gaaaaaccac ctgaggtcca atgtatggtg gcccaaaatg 3840 gatgctcatg tggaacaatt cgtgaaaaac tgcaggggtt gtactcttgt tggtgctcca 3900 gaacctccgg agccgttgct gcgaagccag ctgcctagtt ctccatggca tacgatagcg 3960 ttggattttc ttggtccttt gccagaagga cagcacctcc tagtagtaat cgactgttat 4020 agccgattca ttgaggtatg cgaaatggat acaaccacta ccagcgatgt cataagggaa 4080 ttatccataa tgttcagtcg ttacggaatt ccactttcca tgaaagccga taatgcccct 4140 caacttagtg gagagtgcgc tgagttgaag gagttttgcg acgcgaacgg tgttaaactc 4200 ctgaacacca tcccctactg gccccaatct aacggggaag tcgaacggca gaaccgttcg 4260 attttgaaac ggcttcgcat agctcaagag cttggacgag attggcgaca ggagctgtat 4320 ttgtacctac tgacgtacca ctctaccaag catccgacca cgggcaaatc tccaggagag 4380 attatgtttg gccgccgcat taaaagcaaa ttacccacag tttcatcttt tcaagaagat 4440 gcaggagtga gagagaggga tttggtaata aaggagaaag ggaaactata cgcagacaaa 4500 agacgtgggg cgatggaaag ctctattcaa gagggtgacc atgtactggt caaacggatg 4560 aggaagtcta ataaacttga tgcagatttc agcaatgaag agtttattgt tcgtcgtaag 4620 acaggtaccg atacagtgat ccagtcaaag gaaactggta agcagtatag aagaacttca 4680 gctcatctta agaagattag agaccgttct gattcatctt cctgtgaagg acaacattcc 4740 aagccattga tggatgtaga taccagccct gccaaggacg aggaacctgt tgattcattt 4800 cctgcactag ccaaacgatc gagacaacca ccaatgaagc atcgcgacta cgtgccacac 4860 taggacacct acagaagttt aaatgtatgt aaaatccttc attaaacatg aattcaatta 4920 catgaatgaa taaaactgtt tcgtttgaac taattgttaa agcccttgca aggtattcaa 4980 ttttctttat tcaaaggggg ggt 5003 // ID Kiri-22_AAe repbase; DNA; INV; 4509 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 16-FEB-2011 (Rel. 16.02, Last updated, Version 1) XX DE A Kiri non-LTR retrotransposon family from Aedes aegypti. XX KW Kiri; Non-LTR Retrotransposon; Transposable Element; Kiri-22_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4509 RA Kojima K.K. and Jurka J.; RT "Kiri non-LTR retrotransposons from the yellow fever mosquito."; RL Repbase Reports 11(2), 717-717 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 20 sequences with >98% CC identity. CC This family and related Kiri elements found in two mosquito CC species, Culex quinquefasciatus and Aedes aegypti, constitute a CC distinct lineage belonging to the CR1 group. The phylogenetic CC analysis with PhyML indicates that Kiri is a sister lineage of CC the L2 clade. Although several Kiri elements do not encode two CC proteins, probably due to 5' truncation, Kiri elements generally CC encode two proteins. Their ORF1 proteins commonly contain an CC L1-like RNA-binding domain. XX FH Key Location/Qualifiers FT CDS 273..1049 FT /product="Kiri-22_AAe_1p" FT /translation="MMSRRDTTVSKRVGNVKTGEVYNDDTDEMTISQLASS FT ISEQLTNTKREISQQLYDEVRGLKKTIKTDLEEMRSALEKSVSELSRCVDL FT NKESIQHNAHAISRSHLRNDLVISGVPFVQGENLREYFEKWCQALGFNNND FT HPSVDIRRLHKSVMVPGKSYMILVQFAITNHRKDFYFKYLQSRSFTLDQLG FT FSSRDRIFVNENLTVTARSIKSKALLAKKNGKLYSVITRDGIIYVKKTSES FT ESRRVETEDELLKITQTC" FT CDS 1527..4352 FT /product="Kiri-22_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MASMILNNASTNICLPGAVVKAALIPEKLSICCLNMQ FT SMCARQMTKFEEFSQILNISEVDVACVCETWLNETKSSDMIKVEGYHTLRS FT DRVGKIGGGLLIYVKKSLKYKLLECSFVETRRKTIEFMFVELSTASDKILL FT GLFYNHPDLDCSDLISDKMIEYGTHYNNLILIGDFNTNIMKASPKSNRFKD FT TLMSLSLFNVGLEPTFFHPTGASQLDLMLTDNVENVLRFNQISVPNISHHD FT LIFASLDFDTTKTNNDVFYRDYKALDPEIVLNKFNESDWERFYSITNPDVL FT IDMFNNTLNSIHEECVPLRKVRKSKNKVNPWFNSQIQIAFIDRDRAYKKWK FT HSADSDDFARYKILRNLSNRLVTQAKRSYYNAQFSGEMTTKDFWHRIRHTG FT LGKPNSNIESDFNADEINASFQRHFTSQIRTITHDPRNNNDRRFAFQPVRI FT FEIINAVHEVKSNAIGLDNVPIKLIKSLLPLLLAPLSHIFNNIIESCVYPQ FT VWKLTKIIPFRKKSNVCSLNNLRPISIISAISKAFERILKNQICCFIHENN FT MLSSLQSGYRSGHSTKTAMLKVCDDIGVVLDRDSKVVLVLLDFSKAFDTIS FT HSIMCHKLENLFNFSRNAVILIHSYLNNRQQAVFCNNTLSSFLPVSSGVPQ FT GSVLGPILFSLYINDLPNVVKYCSLHLFADDVQLYLDCTKKSIEEITRLVN FT FDLEAIRFWSLENTLKLNATKTYGIMISRETNVEKPLLRIDNESIEFVDSA FT NVLGFTIQNNLKWDKYVLKQCGTIYAYLRVLYANGNLLNRSTKIKLFKSFI FT LPHFISSDFLIGSTSVHVQSRMRIALNSCIRFVFNLRRLDPVSHLQPMLLG FT YSFQNFIKARCCILLHKISTTKMPGYLYDKLRPFHSPRAKQFIIPTHFSSA FT YGKTFFVRGVVLWNSLPNYMKNQQSVSVFKKQCQEYFN" XX SQ Sequence 4509 BP; 1438 A; 860 C; 832 G; 1379 T; 0 other; ttgagtgact gtggtatgga gtggtagtaa tattcttgct cgcgatgcta attgcttgaa 60 atcgtgaaaa acgctctaaa tatcctccta gtgaagttag atactcacgt catattatat 120 accacaaagg gaaagcggtg cacctagcca aaattcttat atctctgtga tattggctgg 180 aaaataatcg tcagaatcac tgtagttatc gtcgacgcca ttacgctgct acttgtttcg 240 atatccaacc gaagagaact gctgagttcc aaatgatgtc gaggagagat acaacagtgt 300 ccaagcgcgt gggcaacgtg aagactggtg aggtgtacaa cgacgatacc gacgaaatga 360 ctatctccca attagctagc tcaatatcgg agcagctaac caacaccaag agagaaatat 420 cgcaacaact gtatgatgag gttcgtggtc ttaaaaaaac catcaaaact gatttggaag 480 aaatgcgctc tgcactggag aaatcggtat ccgagctaag ccgctgtgtt gatctaaaca 540 aggagagcat acaacacaat gcccatgcta tatcacgttc tcatctacga aatgatctcg 600 ttatcagcgg agtaccattc gtgcagggag agaaccttcg cgagtacttt gagaagtggt 660 gccaagcatt gggatttaac aacaatgatc atcctagtgt ggatatccgc cggctgcaca 720 aatcagttat ggttcccgga aaaagctata tgatactggt ccaatttgcg atcacaaacc 780 atcgcaaaga tttttacttc aagtatttac aatcaagatc gtttacattg gatcaactag 840 gcttctcttc gagggacagg atttttgtca acgaaaacct gaccgtcact gctagatcta 900 tcaaaagcaa ggctctcttg gccaagaaga atgggaagct gtattcggtc atcacgcgag 960 atgggattat ctacgtcaag aagacctcag agagtgagag ccgccgggtg gaaaccgaag 1020 atgaactact gaagatcaca caaacatgct agacctttcc taataagcca tctctttcct 1080 tgttgatacc atccttgaat tatcccatgt ttccaatcct aaaagtcaag atttaattaa 1140 aaactgcctt tcctgcgaag tggcctctgt ccttccaatc cttcctgact atagtccatg 1200 tatccactcc taaaagttgt tgctgttgtt gctgttgggt tggatcggag aggatgacat 1260 cattgctgtt gctgctgctg ctttgtgtgc aattcgtaat atgaaattta atgaatggaa 1320 gaaccatacc gtacgtttgt gcaaaattat actcttgttg aaaattgttt gcattgtaaa 1380 ttcactagat tagttgcatt tcaagtctat tggcatcgcg aaaacgaaaa tggaaattaa 1440 tatgtttaat ttagtagact gtagttatat gattcttgac cagttgattg ctttgcattg 1500 ctcattgaac acaattatca gagataatgg ctagtatgat cctaaacaac gcctcaacaa 1560 atatatgttt gccgggagct gttgtcaagg cagctctcat tcccgaaaag ctttctattt 1620 gttgcttaaa catgcagagt atgtgtgcta gacaaatgac aaaatttgaa gagtttagtc 1680 agattttgaa tatctcagag gttgacgttg catgtgtttg cgaaacctgg ctgaatgaaa 1740 cgaaatcaag tgacatgatc aaagttgaag gttaccacac tttgagaagt gatagagtgg 1800 ggaaaatcgg tggaggactc ctaatttacg tgaagaaatc tttaaaatac aagcttttag 1860 aatgttcctt tgtagagacc cgcaggaaaa ctatagaatt catgtttgtt gaactttcca 1920 ccgcaagtga taaaatatta ttaggtctgt tctacaacca ccccgacttg gactgttccg 1980 atcttattag tgacaaaatg atcgaatatg gcactcacta taataatttg attctgatag 2040 gagatttcaa tacaaatatt atgaaagcaa gtcccaaatc aaatcgcttc aaagataccc 2100 ttatgtctct ttcactattt aacgttggac tggaacctac cttttttcac cccactggtg 2160 cttcccaact agatcttatg ttaacagaca acgtagaaaa cgttcttagg ttcaatcaaa 2220 tcagtgttcc caacatatcc catcacgatc ttatcttcgc ttcgcttgac ttcgatacta 2280 cgaagaccaa taatgatgtt ttttatagag actataaggc acttgatccg gaaattgttc 2340 tcaacaaatt taacgaatct gactgggagc ggttttattc aatcactaat cctgatgtat 2400 taattgatat gtttaacaat actttgaatt caatccatga agaatgtgtt cctctacgta 2460 aagttagaaa atctaagaat aaggttaacc cttggttcaa cagtcaaatc caaatagctt 2520 tcatagatcg agatcgagct tataaaaaat ggaagcattc tgctgattca gatgattttg 2580 cacgctataa aatacttaga aatttatcaa atcgactagt gactcaagca aagcgaagtt 2640 attataatgc gcaatttagt ggagagatga caacaaaaga cttctggcac agaattcgtc 2700 atacaggatt aggtaagcca aattccaaca tagaaagcga ttttaatgca gatgaaatta 2760 atgcttcatt tcaaaggcat tttacatcgc agatacgcac tattacgcat gatcccagaa 2820 ataacaatga cagacgtttt gccttccagc cggtgcgaat tttcgagatt attaatgcag 2880 tacatgaagt aaaatcaaat gcgattgggc ttgataatgt tccaataaaa ctgataaaga 2940 gtctactacc attgttgttg gcaccccttt cacatatttt caataatatc attgaatcct 3000 gtgtttaccc tcaagtgtgg aaactaacaa aaataatccc attcaggaaa aaatctaacg 3060 tttgttcatt gaacaactta cgacccataa gtataataag cgcgatatca aaagcatttg 3120 aacgaattct gaaaaaccag atatgctgct ttattcacga aaataacatg ctatcatctt 3180 tgcaatctgg ttaccgtagt ggtcatagta ccaaaactgc gatgttgaaa gtttgtgacg 3240 atataggcgt cgtattagat agagatagta aagttgttct tgtactttta gacttttcga 3300 aagcatttga tacgatctcg cactctatta tgtgtcacaa attagaaaat cttttcaact 3360 ttagcagaaa tgcagtaatt ttaatacatt cttatctcaa taatcgccaa caagctgttt 3420 tctgtaacaa tactctttcc agtttccttc ccgtttcctc tggtgttccc caagggtctg 3480 ttcttggccc tattcttttt agtctttata ttaatgattt gccaaacgtt gtaaaatatt 3540 gttctcttca cctttttgca gatgacgttc aattatattt agactgcacc aagaaaagta 3600 ttgaagaaat cacaagactg gtaaattttg acctcgaagc aattcgattt tggtcactag 3660 aaaataccct caaactaaac gcgacaaaaa cttatggtat aatgatcagt cgagaaacca 3720 atgtagaaaa acctttactg agaatcgaca atgaaagcat cgaattcgta gattcggcaa 3780 acgttctagg gttcacgata cagaataatc tcaagtggga taaatatgtt ttgaagcaat 3840 gtggtacaat ttacgcttac cttcgagtat tgtatgcaaa cggaaatttg ttaaatcgct 3900 ctaccaaaat aaaactattc aaaagcttta tcttgcctca tttcatatcg agcgatttct 3960 taatcggctc tacctctgtt catgttcaga gtagaatgag aatagctttg aattcttgta 4020 ttcgctttgt ttttaatctc cgtagattag atcctgtttc gcacctgcaa cctatgcttc 4080 tgggatactc gtttcaaaat tttataaaag ctcgctgctg catattattg cataaaataa 4140 gcacaaccaa aatgcctggg tatctctatg ataaactccg accatttcat agtccaagag 4200 caaaacaatt tatcattcct acgcactttt cctccgctta tggcaaaaca ttttttgtta 4260 gaggtgttgt tctgtggaat tcattaccga actacatgaa aaatcagcaa tctgtgtctg 4320 tgttcaaaaa gcaatgtcag gagtatttta attagtcaag aatcaaaatt gtgtatttta 4380 ttgtattttt gtttatgttc tttgatgaga tgtttaaaaa tccatgccag tatgctgtgt 4440 agcatttaaa aagacattag tcttaagcta tcggagaaat aaaaatcaaa tcaaatcaaa 4500 tcaaatcaa 4509 // ID LOA_Ele3C_AAe repbase; DNA; INV; 5807 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A LOA non-LTR retrotransposon from Aedes aegypti. XX KW Loa; Non-LTR Retrotransposon; Transposable Element; KW LOA_Ele3C_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5807 RA Kojima K.K. and Jurka J.; RT "LOA clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1423-1423 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >98% CC identity. The consensus is ~77% identical to LOA_Ele3 and ~76% CC to LOA_Ele3B_AAe. XX FH Key Location/Qualifiers FT CDS 335..1975 FT /product="LOA_Ele3C_AAe_1p" FT /translation="MQLLLHYGRVMMQLTFISLATRGTNKMNTSVSKPNKG FT DDCLTQSEKMEVENSLSQLGLTEDQLLCSSQEEMESTPSQKPNAPGKSSAG FT HSEDPGIPSSIESVNTDDDKDDGIKVVINTPKPVPSKKGPKMDDPMDGEEA FT EGNVTPRKNLTRSQRKQLKALRESGLSRNEALSKILANEGTVLASSKRTRN FT DLDKSATVEEDSKPKKMKHQLDPRERAEKSSSSVPQNGGSKRQATGQSYSD FT MTKRVKVGIIPKDFPTAQLTTAQLDVLQDALLLKVEQQRDEPLKPKFCNLV FT YKSGYMVLICKDQETAEWVKEITPSVNTWEGAELVALNEEDIPRQELLRAF FT FPQSSTFADERIKALIESQNGLKTTNWRIVKRSILNDIHVEWIFTVDGSSM FT DMLSKSKFILNYRFGEIQLRKIKRTPSVSNHKPNKELAQENSEEAPDSNSG FT KRNAISRPKTTMKASVSAPKGKVPSSASIPCSSGTNPKVSKTSSGKEMSEV FT LTTGAKLAGLDVMGGKNKVLKPNPKQNRDDPQHPKKGDARLDKSGTPKYGA FT " FT CDS 1908..5630 FT /product="LOA_Ele3C_AAe_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="MIRNIQRRAMLAWIKAELLNMALKAVQVNLHHAQSAT FT DVLCRRFTKEKLTVALIQEPWVNKTRILGIQLNSCKLVYDDSQLSPRAAIL FT IRNNTKCFPITEFIKKDIVAVRMEVPTARGKSEILMVSAYFPGDAENIPPP FT EIAAIVHYSRIHNIPFVIGCDANAHHTVWGSTDINTRGEYLLQFLSSKNIN FT ICNVGDKPTFSNSIRQEVLDLTLCSPSISDKIKNWHVSDETSMSDHKHIIF FT DWEGGLTIERTFKDPKKTDWETYSAILQSESYIIENNIKSIMQLESASTCI FT KNKILNAFQESCPTRTTSSSRDVPWWNKTLDKLRKTARREFNRAKRTSDWS FT LYKKALTDYNKELRRSKRKSWVSMCENIEMTPVVARLQKTLSKEHSNGLGS FT IRRVDGSFTVEPSDTLSEMLRIHFPDSIPQSRISEEGTEIVVSVPQQTISL FT VSGTKRDAIKVAKEAFTCARVERAVRSFEPFKSAGMDGIFPALIQKAEKTL FT IPHMVEIFRASLILGHVPDDWRQVRVVFIPKTGKKDKTNPKAFRPISLSSV FT MLKIMEKVLCEYINSKFMKAMPLSQHQFAYQSGKSTISALHSLVNKIEKTL FT QAKEIALVAFLDIEGAFDNASYSSIGSAMLRRNFDPCIATWVHAMLANRQI FT SSELSGSCVTVKATRGCPQGGVLSPLLWSLVVDELLDSLERRGFEVVGYAD FT DVVIIVRGKFEDVIFERMQLALDHTFSWCQREQLGINPSKTTIVPFTKRRK FT VQLRTLLLNETQLTCSKEAKYLGVILDSKLNWNSHLQTVVQKGLNSLWVCS FT RALGKKWGLKPSMIMWIYKTIVRPRITYASLIWWPKTKEATARAKLNKIQR FT IACIAITGAARSTPTSALDAMLNLPRLDQFIKLDAEKSALRLKRTKTLLSG FT DLTGHLSILNEFSINPIVEKCSDRMEMVVNYDIPYKVIMSSRQEWEEGGPN FT IPPGSIKFYTDGSKMNNLAGSGVYGPRTKTSIHLGQWSTVFQAEIYAILEC FT ALLCLKRKYRHASICILSDSQAALQALNTYTCNSKLVWECILALKKLALCN FT RVNLYWIPGHAGLEGNEIADELARNGSANRFTGPEPFFGISDSFLKMELNN FT WLVSQTSSNWDASPNTSQSKRLVNINPKKTQQLLGLSKRDLSLYTGLVTGH FT CPSRYHLQKIGVVQNSNCRFCDDERETSEHLICYCSAQIHCRSRIFGKPFL FT EPADIRIATPSNVIGFIKLIAPDWGNPHTAA" XX SQ Sequence 5807 BP; 1858 A; 1272 C; 1296 G; 1381 T; 0 other; tttaacaccg ctgacgctat gacggatccc tgtagtaact tgtccgtatg aacatgaaac 60 tattgagaac cagagctacc ggtccaaagc ccggataaag tgtggatggt tgtgatggtt 120 gttatccatg gccatcatgt aaagaaaatg gatacagtgg gggaccgaaa tccgtttagt 180 ctcgaaatcg tagacgtacg aatctcggcc ccccacaatg aaccccaacg ccaaggtgtg 240 atgcgacccg tgccgatgga agaatggttg agggggtttc ataaatgctc aatcacacac 300 ggagcctgaa gagcaccagg gcgaactctt cagtatgcag ctcttgctgc attatggcag 360 ggtaatgatg cagttgacct ttatttccct agcgactcgt gggaccaaca aaatgaatac 420 cagtgttagc aaaccaaaca aaggtgacga ctgcctcaca cagtcggaaa agatggaagt 480 ggagaactca ctaagccagc ttgggcttac agaagaccag ctactgtgca gttcacaaga 540 ggagatggaa tctactccgt cccagaaacc caacgcaccg ggcaagtcca gcgcgggaca 600 ttcagaagat cctggtatcc cctcgtcgat tgaatcggtg aacacggacg acgacaagga 660 tgatggaatc aaggtcgtta tcaacacccc taaacctgta ccatccaaaa aaggacccaa 720 aatggacgat cctatggatg gtgaggaagc tgaaggcaat gtaactccca gaaaaaacct 780 aacgcgctca caaaggaagc agctcaaagc tctccgggaa agcggactaa gccgaaacga 840 ggctttatcc aaaatcctgg caaatgaggg tactgtgctt gcttcttcta agcgaacgag 900 aaacgacctt gataaatctg ctacggttga agaggattca aagccgaaga agatgaaaca 960 ccagttagat ccccgagagc gcgctgaaaa atcctcaagc tcagtgccac aaaatggtgg 1020 gtccaaacga caagccactg gtcagagcta cagcgatatg accaagcgtg tgaaggtcgg 1080 gataataccc aaagactttc caaccgccca gctcacgacg gctcaactag acgttctaca 1140 ggatgctctg ttgcttaaag tggaacaaca gcgtgatgag ccactgaaac ccaaattctg 1200 taaccttgtc tacaaatctg gctacatggt tctaatttgc aaggatcaag agacagctga 1260 gtgggtgaaa gaaataacac cttcagtaaa tacctgggaa ggtgctgagt tggttgcact 1320 gaacgaggaa gacattcccc gccaagagct tcttcgagca ttcttccctc aaagctcaac 1380 cttcgcggac gaacgcatta aggctctcat agagagccag aacgggttga aaacaacgaa 1440 ttggcgtatc gtgaaaaggt ccatcctcaa tgacatccat gtcgaatgga tctttacggt 1500 agatggatcg tcaatggaca tgctgtcgaa atccaagttc atcctcaact accgatttgg 1560 agaaatccag ttgaggaaaa ttaagaggac accctccgtt tccaaccaca agccgaataa 1620 ggagctggcc caagaaaatt ctgaggaagc ccctgattca aacagtggta aaaggaacgc 1680 aatttctcgc ccaaagacca ccatgaaggc tagcgtgtcc gctcctaaag gaaaagttcc 1740 gagctcagct tcaattccct gctcaagtgg taccaaccca aaggtatcta aaactagcag 1800 tggaaaagaa atgagtgaag tcttgaccac aggggcgaaa ctcgcaggcc tggatgtgat 1860 gggtgggaaa aacaaagttc tgaaacctaa cccgaaacaa aaccgtgatg atccgcaaca 1920 tccaaagaag ggcgatgctc gcctggataa aagcggaact cctaaatatg gcgcttaagg 1980 cagttcaagt gaatctccac cacgcacaga gtgcaacaga tgtactctgt cggagattca 2040 caaaagaaaa attgacggtg gcgcttattc aggagccgtg ggtcaataaa accaggatac 2100 taggtattca actaaactca tgtaagttgg tatatgatga tagccagcta tctccaagag 2160 cggctatctt aatacgcaat aacactaaat gttttcctat tacagaattc attaaaaagg 2220 acatcgtggc agtcaggatg gaggtgccca ccgctagggg aaaatctgag atcctcatgg 2280 tttcagcgta ttttcctggc gatgcggaga atattcctcc accagagata gctgcgattg 2340 tccattacag ccgtatccac aacatcccgt tcgtcattgg ttgcgacgcg aacgcgcatc 2400 acaccgtatg gggaagtaca gatatcaaca ccagaggtga gtatctttta cagtttcttt 2460 cctccaaaaa catcaatata tgcaatgttg gcgacaagcc tacattttca aactccattc 2520 gacaggaagt gttggatttg actttatgta gtccatctat ttctgacaaa ataaaaaatt 2580 ggcatgtttc tgatgaaaca tcaatgtcag accataaaca cataatattc gactgggaag 2640 ggggtctaac gatagaaaga acgtttaaag atcctaagaa aactgattgg gaaacctatt 2700 cagcaattct ccaatctgaa tcatacataa ttgaaaataa tattaaatcc ataatgcagt 2760 tggaatcggc gtccacctgc attaaaaaca aaattcttaa tgcttttcaa gagagttgtc 2820 cgactagaac aactagttcg agtagagatg ttccatggtg gaacaaaact ctagataagc 2880 tcagaaaaac agcgcgtaga gaatttaacc gcgctaagcg tacttccgat tggagcctat 2940 acaaaaaagc tctaacagac tataataagg agctaaggcg ctcaaaacgg aaatcatggg 3000 ttagtatgtg cgagaacata gagatgaccc ccgtggtagc caggcttcaa aaaacactct 3060 caaaagaaca ctccaatggt ctgggtagta tccgtagagt cgacggttca ttcactgtag 3120 agccaagcga tacattgagt gaaatgttaa gaattcactt ccctgactca attcctcaat 3180 caagaataag tgaagaaggt actgagattg ttgtttcagt tccacaacag accatatctt 3240 tggtttctgg gacaaaaaga gacgcaatta aggttgccaa agaagctttt acttgtgcca 3300 gagttgaaag ggctgtgaga tcttttgagc cattcaaatc tgctggcatg gatgggattt 3360 tcccagcgtt aatccaaaaa gcggagaaaa cgctaatccc acacatggta gagattttta 3420 gggcgagctt gattttaggg catgttccag atgattggcg tcaagttcga gttgtcttca 3480 ttcctaaaac aggtaaaaaa gacaaaacaa atcccaaagc atttaggccg atcagtttgt 3540 catccgtgat gcttaaaatc atggaaaagg tattgtgcga atacataaat tctaaattta 3600 tgaaagcaat gcctctttca caacaccaat ttgcgtatca gagtggcaaa tccacgatct 3660 cagcactaca ttcgttagtc aataagatcg aaaaaacact acaagctaaa gaaatcgctc 3720 ttgtagcatt tcttgacatt gagggtgcat tcgataatgc ttcttattcg tctataggat 3780 cggcaatgtt gaggaggaat ttcgacccat gcattgctac ctgggtacat gctatgctag 3840 ccaatcgaca gatctcctct gagctaagtg gttcgtgcgt cactgtaaag gctacaaggg 3900 ggtgcccaca aggtggggta ctttcaccgt tgctttggtc actggtggtg gacgagcttc 3960 tagatagctt agaaaggaga ggcttcgaag tggtaggata tgcagatgac gtggtcatta 4020 ttgtacgagg caaatttgaa gacgttatat tcgaaaggat gcagctagct cttgatcata 4080 ccttttcctg gtgtcaacga gagcaactag gaataaatcc ttccaaaacc accatcgtac 4140 ctttcacaaa acgtaggaag gtacaattga gaacccttct cctaaacgaa acacaattga 4200 cttgctcaaa ggaagccaaa tatcttggtg tcatacttga ctccaagcta aattggaatt 4260 cacatcttca aacagtggtg caaaaaggcc tcaattcact ctgggtatgc tccagagcac 4320 ttggtaagaa atggggcctg aaaccaagta tgattatgtg gatctataaa accattgttc 4380 gtcctagaat aacttatgct tctcttatct ggtggcctaa aacaaaagag gctacggcaa 4440 gggcaaagtt aaataaaatc caacgtattg cctgtatagc aattactggt gcagcacgca 4500 gtacacccac ctctgcctta gacgcgatgc ttaatctgcc ccggctggat caattcataa 4560 agctggacgc tgagaaaagt gcgttgaggc tgaaacgaac aaaaacccta ctatcaggtg 4620 atcttacggg tcacctcagt atattaaatg aattttctat aaatccaatt gtagaaaagt 4680 gtagtgacag gatggaaatg gtagtcaact atgacatacc ctataaggtg atcatgtctt 4740 ctcgtcaaga atgggaagaa ggtggaccca acatccctcc agggtctatt aaattctaca 4800 cagatggttc caaaatgaat aatctggcag gttctggagt ctacggaccc cgaacaaaaa 4860 cctcaattca cttaggacag tggtctacag tatttcaggc agaaatatat gctatcttag 4920 aatgtgcgtt actatgcctg aagaggaaat acagacacgc aagtatatgt attctctccg 4980 acagccaagc ggctcttcag gcgcttaaca cttacacgtg taactcaaaa ctagtgtggg 5040 aatgcattct agctttgaaa aaactagctc tctgcaatcg agttaactta tattggatcc 5100 cagggcatgc gggtctagag ggaaacgaaa ttgctgatga gctggccaga aacggatctg 5160 ctaataggtt cactggcccc gagccattct tcggaatatc agacagtttc ttaaaaatgg 5220 aactgaacaa ctggctggtt agccagacct catcaaactg ggatgcatct cctaatacga 5280 gtcagtcaaa aaggcttgta aatataaacc caaagaaaac ccaacaacta ttaggtctca 5340 gcaaaaggga tctcagttta tacactggtc tagtaacagg acactgcccc agcagatacc 5400 atttacaaaa gatcggtgtt gttcaaaact ccaactgtcg cttctgtgac gatgagcgag 5460 aaacatcaga acacctcatc tgctattgca gtgcacaaat ccattgtagg tccaggatat 5520 ttggtaagcc cttcttagag cctgccgata taaggatcgc aacccccagc aatgttatag 5580 gctttattaa gctaattgca ccagactggg ggaatcctca cactgcagct taggaccatt 5640 atttctcaat aataaggtgt tcatgagctc agaagtacat agatcattta gtgacataca 5700 tgtcactgga tgatgtatgt actatacata cagtaaaagg gtatatcaca atagttcaag 5760 ttaattggac gcagtgattc aacacccgac aaagaaggaa aaaaaaa 5807 // ID hAT-78_HM repbase; DNA; INV; 3746 BP. XX AC . XX DT 11-MAR-2009 (Rel. 14.03, Created) DT 11-MAR-2009 (Rel. 14.03, Last updated, Version 1) XX DE hAT-type DNA transposon family - consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-78_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3746 RA Jurka J.; RT "hAT-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 9(3), 653-653 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 680..2647 FT /product="hAT-78_HM_1p" FT /translation="MKQLKLKLSQQHLSSPNLTIIRHHLVVPSRRNTTPVS FT DSKKKPQTRRATELFGIGAAICADDALITGIRLPTCLQVLRCMMYHSKEAT FT HGKRPGSLGATSRFTSAKIVLKQVKALYEKANIPMVTERRACEKIVKLLDD FT NNKLRSIDKSRRETPATQHKLLEMQSMLATTFQLWPANVDSLMNNDEDLAF FT LQSMKTDRAASFGVFDKALAQKINRRHCRDVAALGRLKRARGEIEASTKTV FT SPEVVTDSECSDESSESASEDDMESEFRSPSKEMELQKPNRIKPKGTLSFI FT PPDLLSRPNLVSLATRLKMTPTQQSAFTQGVIAESGGDVSMVASSYATADR FT ARRKVVREISIEIHNSWEPPKLCTLHWDGKLTPTLTNNRVTEERMTVVVGD FT ASQLKLLGVPSYFKGTDQSVGEIIAKLTMNLMIEWHCNDRIVNMTFDTTSS FT NTGHLTAACIAIQDKLERAVLWSGCRHHVGEVLLSHVFTDLKIEASKSPEV FT TVFTRLRSNWNLVPHDSSQILPFCPTDHDAEAQQLLSKMKDEVIARAIEDV FT DCLRDDYREFTELTLVYLSSSKVEVKFRRPGALHKARWMAKLINGLKIALL FT EKQILRNYQQELSLLVTKCRRYEHSLRLLLMFTVCGGSHVGTRLIPHTMIY FT SFTNVY*" XX SQ Sequence 3746 BP; 1202 A; 716 C; 736 G; 1091 T; 1 other; tattagggtg catcacaaaa aaattttttt taatattgat ytgtctatta tctcaagttg 60 ttccaatcac caaaagatgc aatgtgttaa aaaataaatt ttattagaca atatttagag 120 gtcccccatg ccctaacaag tttcacctta tttaccaaaa ttaagtgcta aataattttt 180 ttataattat tagctttgtc tttatattta tgttctttgg ctaccatgat tgaaaatgta 240 attttataaa catagtcaag tctgctcaaa aaatcaaaat aaaatttaat tatatattag 300 ttaacccagt ccagtgacaa ttggtaaatt tacaattcca ttaaatatat taatttctgc 360 attattttgt gtatatacat attacatgca gtaaatcagt aattcattgt ttatgtagat 420 actataattg tataatgtat actgtatata tatagttaaa cctatgagac atgttaggca 480 ttacctaagc aatgtgaaat atctgtaatg ttgtgcaatc caataaataa taattcaaat 540 aatattccaa aaaatgaata ataaattaca caaaatattc aaataatttg ctaatttgat 600 aaaccaagtt cataacttaa taacaacagt tacaacacta ttctcttttt ttagaaccct 660 aattccttaa aaaacatgaa tgaagcagtt gaagctgaag ttgagccaac aacacctaag 720 cagcccaaat ttgacgataa ttcgtcatca tctggtagtg ccaagtcgcc gaaatacaac 780 tcctgtctcg gactcaaaaa agaagccaca gactcgtaga gctacagagt tgtttggtat 840 aggtgctgca atctgtgctg atgatgcact aatcactggt atccgattgc caacctgctt 900 gcaagttttg agatgcatga tgtatcacag taaagaggct acccacggta agcgaccagg 960 atcactcggt gcaacatcaa gattcacatc tgccaagata gttctgaaac aggttaaagc 1020 tttgtatgaa aaagccaaca tcccaatggt caccgagcgt agagcatgcg aaaaaatagt 1080 gaaacttctt gatgataaca ataaacttcg ttcgatcgat aaaagtcgcc gtgaaacacc 1140 tgcaactcag cacaagcttc ttgagatgca gtccatgttg gcaacaactt tccaactctg 1200 gccggccaac gtggatagtc ttatgaacaa tgatgaagat cttgcattcc tgcagtcaat 1260 gaaaactgat agagccgcaa gttttggggt atttgacaag gcgcttgctc agaagataaa 1320 tcggcgtcat tgtcgtgatg ttgcagcatt aggacgcctg aagcgtgctc gtggtgagat 1380 tgaagcatca accaagactg tgtcgccaga agttgtaact gactcagaat gtagtgatga 1440 gtcaagcgag tctgcttctg aagatgacat ggaatcggaa tttagatctc catcaaaaga 1500 aatggagcta caaaaaccaa accgtataaa accgaaagga acattgtcat tcataccacc 1560 agatttgcta agccgaccaa accttgtgtc attagcaaca cgtctgaaga tgacgccaac 1620 acaacaatct gcattcacac aaggagtcat agctgagtct ggtggcgatg tatctatggt 1680 tgcttcttcg tatgcaactg cagatcgcgc acgacgcaag gttgtacgcg aaatttctat 1740 agaaattcac aatagctggg aaccaccaaa actctgtact ctgcattggg atggaaagct 1800 gacaccaact ctaacaaata atcgtgtcac tgaagaacgt atgacggtgg ttgttggtga 1860 tgcatcgcag ttgaagctac tgggagtgcc tagttacttc aagggtacag atcaatctgt 1920 tggagaaatc attgccaagc taacaatgaa cctaatgatc gagtggcatt gtaatgatcg 1980 aattgttaat atgacattcg acacaacgag ctcaaataca ggtcacctaa ctgcagcttg 2040 cattgccatt caagataagc tggaacgtgc tgtcctctgg tcaggatgtc gtcatcacgt 2100 tggtgaagta cttctatctc acgtgtttac tgacctcaag atagaagcat caaagtcacc 2160 agaagtcaca gtgttcacaa ggctacgaag taactggaat ttagtgccac acgactcatc 2220 tcagatactg ccattttgtc caactgatca tgacgcggag gcccaacagc tgctgtcgaa 2280 gatgaaggat gaggtgatcg ctcgtgcaat tgaagatgta gattgcctgc gtgatgacta 2340 ccgtgaattc acagagctca cacttgttta tttatcttct tccaaggttg aagtaaaatt 2400 ccgaaggcca ggtgctcttc acaaggctcg ttggatggca aagctgatca acggcctgaa 2460 aattgcactt ctagagaaac aaatattgcg gaactaccag caggaactat cactactcgt 2520 caccaagtgc cgaagatacg agcattcgct acgtttatta ctcatgttta cggtgtgtgg 2580 tggctcacat gtaggaacac ggttgattcc ccatacaatg atttacagct ttacaaacgt 2640 ttactagagt acgagactgt ggacaaggtg atttcgcagt cagcaattcg agctctgaat 2700 cggcacctgt ggtatctcac tgaggaaatg gtgccacttg cactgttcag taagcttgta 2760 ccatcgacgg aacgacgagc tcttgcagat gccttgctca aattgaaacc atcgagtgat 2820 ctgcaagctc caatgaatcg atttggcaac ggatggggca agccacattt tccattgtcc 2880 attgatcgta gaacacaact gagtgatctg gttggagttg actcctggtt tacagtttac 2940 cgtctacaat tagacaccag ctttctggaa ctttcggtcg atgaatggga aaagacacct 3000 gcttacattg ccagtgtcga aaattgtgca gcagtgaacg tggtgaatga ttgtgccgag 3060 aggggtgtca agctggcatc ggactttgtt gaaactgcac gatcggatga gcattatcaa 3120 aacgttcttc aagtggtcga gaaagatcgg aaagagacac caaatttgcg acgcaaacga 3180 tgcaagaagg aataaacatt aacagttcat ctaattttgg attcatttga catctttcta 3240 ctactattaa aacaaataaa gattcattac tatagttaat tagtcattaa ttagacctta 3300 aactgcaatg aggattgaat tttattaaaa aagaactgac tattagtcat agtagtcaat 3360 gatttttgca ttttttggtt aatgattttg tgctttttgt agaaaaaatt gaatagaaaa 3420 tgattttagt gatctcaatt tggttttatc tgttcagttg ccttcattat acaattaaaa 3480 caatttaggt ttattactat agtcattatt ctcattaaag gacaataagt attgaatttt 3540 atgaaaaaac ttactagtcg atcgactttg tgtcttgcgg ttggtttcct ggatttcaaa 3600 ttctcatggg ggacctctaa acttactcag atttggaaaa tatttcactc aagttattat 3660 tattcacatt ggaacacatt ggaaacacag acagctcaaa aataaacaac taattttttg 3720 ccctaatgtg gtgatgcacc ctaata 3746 // ID Gypsy-41_OD-I repbase; DNA; INV; 5184 BP. XX AC CABV01004654; XX DT 24-FEB-2011 (Rel. 16.02, Created) DT 24-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Oikopleura dioica genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-41_OD_; KW Gypsy-41_OD-LTR; Gypsy-41_OD-I. XX OS Oikopleura dioica OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; OC Oikopleuridae; Oikopleura. XX RN [1] RP 1-5184 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Oikopleura dioica genome."; RL Direct Submission to RU (24-FEB-2011). XX DR Genome; CABV01004654; Positions 2762 7945. XX CC Positions [2107-2493] - Reverse transcriptase CC Positions [3975-4451] - Integrase core CC 'AAAA' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 290..1357 FT /product="Gypsy-41_OD-I_2p" FT /translation="MADVKLIKEWAESTKAELDKMDRSDPNFKVKSDLYAN FT LLAKAVSAVGGATVKGAPSSNGDLTRDQIAALRSAETALSALEFTGRDFDL FT TAKFVACVDKIHDTYIEDDNALAGNFTKQVKMRFAENPYARLRAEAVQPSN FT WTLLKDWINTTYDCGLTSIQLLSRAIETDYKRGSCWKAFATSVDSRMLAAE FT KAVIAQIRKRKHEQNQSSDINDDCNAPAVTDVFAFFSASICADRLKSNCKH FT IHALMAQEWRQINNAADLACKAEFLLAQTGRSSDVFYSRQTQPASSNSPKN FT NQSEQKKGKNWREPCKWGKDCRKADNCKFYHGPGFTKKSSAFVAQKAKPEN FT DCELPVENSTSFV" FT CDS 1492..3720 FT /product="Gypsy-41_OD-I_1p" FT /translation="MSILPLSVIPTALHNLIKPCQMTFNGIGTTSAMGTIF FT GRLSFRATSHTFYNVRFYIVKQNMPCILGRDFFINNSELRAGHFALEGSGL FT KLTLQNGQQICVPWSKSPSPSGSLLKATTTAQLRKQLSEEHEITIETALFA FT GADLDKLINLVYEYRDIFNKDTDPIGEFKTSARIPTIEGLTRAQRERPIPK FT HQHDEVKEQIDQMAREGVIEPCPNSHGFNSPLIAVRKKNGKLRVCSDFKSS FT LNQVLTETTELWPLPAMDTLFANIQHGQKIFSSLDLSKAYWNIKIDERDRH FT KTCFTFDNKAWQYCRLPFGLKFSGDAFCKAISEMLTQVKLNDHYCNYVDDI FT LAFSADCASHLEVLEQIFAACRQFGARLGARKCSFGQSKVTFMGRNISAEG FT ISIPAENMEHILALKPPTTRKQLQSLIGNFCWLKNWVSANLGEPIASNCFS FT DVMSEITRLNKPGKKFIWSSEADFAFGQAKKRIATDKVFALPDFSQPFCLL FT TDASDRAAGAILMQKIDGRQRLISVSSKSFNETERRWSATERECFALVHGC FT EKFAYYLKGPLGFVCLVDHKALLAIDKKFLNNSKLQRWQCRLAEFKMTIQY FT VEGRSHVFADMLSRPFDTPEPAAITDDSCAGRFYKFSSDPTLKVYIPSWVM FT PSEQFKRSMLLEQTEQVSDLFTVRAVLSGELKPGAPILEMREIEQAQSEDC FT LVSAVHKLVASSTPLAKWSWPDDVYGCYNYKKHASLLSIHPAQ" FT CDS 3939..4994 FT /product="Gypsy-41_OD-I_3p" FT /translation="MQPSRPDQKHLYRASRPHEIIYCDFISMPSSNSGKRY FT CLTIICGFSRWLQVYPTSRCRSIDAARALMNYFLQFDFPKILSSDRGKHFD FT CEVLADLCQMMKIKQNLHVAYRPESSGVIERAHKTLKSALWGMVRDNPHLD FT WELTLPSVVSAMNRSVNSATKITPYKCLYGRDPSFNGIELDDNVSARDPES FT YARLTAEVLDRAHKFVRLAQEATDKLALERGKSKIIPEEITAGHYVMLKRA FT LAAAAKPTKQKWLGPFKVLQTDSVILQIDYDNKPTWVHRYHVVKAKLRPTH FT LDHDLVDLYDEPDPRSSSSDQNQGSPAASTSPASGSVESLRRSSRSRRPPD FT RLGYSRHSR" XX SQ Sequence 5184 BP; 1280 A; 1570 C; 1152 G; 1182 T; 0 other; taaagtggta ccagatctac tccgaaccta caagagacac ctcgccatcg tcaacttcaa 60 aattaaattt ttgttggaat ctacagtctt caaccttcac ctcctttcgc cgcagacaag 120 agtcaccaaa accaacgcca agccttccgg cacctcatct tagctggaac gtgtaatcaa 180 accgcgcaca gcgcgtcact cttttcagtt gccttacggc agcagtaaga gtgccctaga 240 aaggtacgag ctcttcaaag tacacaagac ctcgatttta tttcgcgcca tggcggacgt 300 caaattaatc aaagagtggg ccgaatccac gaaagccgag ctcgacaaga tggacaggtc 360 cgacccgaac ttcaaggtta agagcgatct ttacgccaac cttctcgcca aagccgtctc 420 cgctgtcggc ggcgcaactg ttaaaggcgc accttcttct aacggcgatc tcacgcgcga 480 tcagatcgcg gctctccgta gcgctgaaac tgctctcagt gcgctcgagt tcaccggtcg 540 agatttcgat ctcaccgcca aatttgtcgc ttgcgttgac aagatccacg acacctacat 600 cgaagatgac aacgcactcg ccggcaactt cacgaaacag gtcaagatgc gatttgctga 660 gaacccgtac gctcgacttc gtgctgaagc agtgcaacct tctaactgga ccttgctcaa 720 agactggata aacacaacgt acgactgcgg tctcaccagc atccagttgc tttctcgcgc 780 gatcgagacc gactacaagc gaggctcctg ctggaaagcc ttcgccactt ctgtcgactc 840 ccgcatgctc gctgctgaga aagcagtcat cgcgcagatc agaaagcgca agcacgagca 900 gaaccaatct tccgacatca acgacgactg caacgctcca gcagtcacgg acgtgttcgc 960 gtttttctcg gctagcatct gcgcagatcg acttaaatcg aactgcaagc atattcatgc 1020 gttaatggcg caagagtgga ggcagattaa caacgccgct gacctcgcct gcaaagctga 1080 gtttctcctt gctcagactg gtcgctcctc cgacgttttc tactcccgcc agacgcagcc 1140 agccagctcg aattctccga aaaacaatca gtccgagcag aaaaagggca agaactggcg 1200 cgaaccctgt aaatggggca aggactgtag aaaggcagac aactgcaagt tctaccacgg 1260 ccccggcttt acaaaaaagt cgtcagcttt cgtcgctcag aaggcgaagc ctgaaaacga 1320 ttgcgagctg ccagttgaaa attctacatc ttttgtctag ctgctgagac aaccacgtgt 1380 gctctctcag cagacctctt tacttgccac acgtacttta atctagatct tcaacttcgc 1440 atcaaaggct tcaatctcga tttccgcgcc ttgtgcgact ctggcgctga catgtctatt 1500 ctgcctctca gcgtcatccc gaccgctctt cataacttga tcaaaccatg tcagatgacc 1560 tttaatggca tcggcacaac ttctgccatg ggcacgattt ttggccgctt atctttccgc 1620 gccacatctc acacattcta caacgttcgc ttctacatcg tcaagcaaaa catgccatgc 1680 attcttggcc gcgacttttt catcaacaac agcgagctca gagccggcca cttcgccctt 1740 gaaggctctg gtctcaagct aactcttcaa aacggccagc agatctgtgt tccatggtca 1800 aaatcgccaa gcccttctgg ctcgcttctg aaagcaacta ctaccgctca actacggaaa 1860 cagctttccg aggagcatga aatcaccatc gagaccgccc tcttcgctgg cgctgatctc 1920 gacaagctca tcaacctcgt ctacgagtat cgtgacatct tcaacaagga tactgatccg 1980 atcggcgagt tcaaaacttc tgcgcgcatt cctactattg aaggactgac tcgcgctcaa 2040 cgagaacgcc cgattccgaa gcaccagcac gacgaggtga aagaacagat cgatcagatg 2100 gctcgcgaag gagtcatcga gccttgcccg aacagccacg gcttcaactc gcctttgatc 2160 gctgtccgga agaaaaatgg caagttacga gtttgcagcg acttcaaaag ctcgttaaac 2220 caggtgctca cagagactac tgagctctgg cccttgcccg caatggacac cttgtttgcg 2280 aacatccagc acggccagaa aatcttctcg tctctcgacc tcagcaaggc ctactggaac 2340 atcaaaatcg acgagcgtga caggcacaaa acctgcttca ccttcgacaa taaggcctgg 2400 caatactgcc gtcttccttt tggcctcaaa ttttcgggcg acgcgttttg caaggcgatt 2460 tcagagatgc tcacccaagt caagctcaac gaccactact gcaactatgt cgacgacatc 2520 ctcgccttca gcgccgactg tgcttctcac ttggaagtac ttgaacagat cttcgccgct 2580 tgccgtcaat tcggtgctcg cttaggcgcc cgaaaatgct cgttcggcca gtcgaaggta 2640 acttttatgg gccgaaatat ctccgccgaa ggaatttcga tcccggccga aaatatggag 2700 cacattctag ctctcaagcc gcccacaaca agaaaacagc ttcaaagcct aatcggcaat 2760 ttctgctggc tcaagaactg ggtcagtgcg aacctcggcg agccaatcgc cagcaactgc 2820 ttctccgacg tgatgagcga aatcacgcga ctcaacaaac ctggtaaaaa gttcatctgg 2880 agttcggaag ctgacttcgc ctttgggcaa gcaaagaagc gaatagctac cgacaaggtc 2940 ttcgcccttc cagacttcag ccagcccttc tgcctgctca ctgacgcttc tgatcgcgct 3000 gcaggagcca ttctgatgca aaaaatcgat ggtcgccagc gcctgatctc agtctcaagc 3060 aagtctttca acgaaacaga acggcgctgg tcggctacag agcgtgaatg cttcgcgctc 3120 gtccatggct gcgaaaaatt cgcatactac ctcaaaggtc ctctcggctt cgtctgcttg 3180 gtcgatcaca aagctcttct agcgatcgac aagaagttcc taaacaactc gaagctccag 3240 cgatggcaat gtcgtctggc agagttcaag atgaccattc aatacgtcga gggacgctca 3300 cacgtcttcg ctgatatgct gtcccgccct ttcgacactc ctgaacccgc tgctatcacc 3360 gatgacagct gcgctggtcg tttctacaag ttcagctctg acccgacgct caaagtctac 3420 attcctagct gggtcatgcc aagcgagcag ttcaagcgct caatgcttct cgagcagacc 3480 gagcaagtct ccgacctctt taccgtccgc gccgtcctct ctggcgagct aaagccaggc 3540 gcgcctattc tcgaaatgcg agagattgag caagctcaaa gcgaagattg tctcgtctca 3600 gctgtccaca aactcgtagc cagctctaca cctcttgcca agtggtcctg gcccgatgac 3660 gtctacggct gctacaacta caaaaaacat gcttcgctgt tgtccatcca cccagcccag 3720 taacttgctc atcatcaact gggggggata agaagcgcat cgtcattccc gattctctcg 3780 tttccaggta ctgcaaatca gctcatgacg atcgagctca tctcggagtt gatcgtactg 3840 cgcagttcct aaactgggct tggtggccgt ataaactaga agacatccgc tcctacgtct 3900 cgagctgcgc gaattgcctg cagcagaaag gattcgacat gcagcccagc cgccctgatc 3960 agaaacatct ctaccgcgcc agcaggccgc acgaaataat ctactgcgac ttcatctcca 4020 tgccaagctc caactctgga aagcgttact gcctgaccat catctgcggc ttctctcgct 4080 ggttacaagt ctaccctaca agccgctgca gaagcatcga cgctgctcga gctctgatga 4140 actactttct acagttcgat tttccgaaaa tactgtccag cgacagagga aaacatttcg 4200 actgcgaagt tctagctgac ttgtgccaga tgatgaagat aaagcagaac cttcatgtcg 4260 cttatcgacc ggaatcgtct ggagtgatcg aacgcgcgca caagacgctt aagtctgcac 4320 tttggggaat ggttcgagat aatccgcatc ttgactggga actgacgttg ccgagtgttg 4380 tctccgccat gaaccgatcc gtcaactctg cgaccaagat cacgccctac aagtgtctct 4440 acggccgcga ccctagcttc aatggcattg aactcgatga caacgtcagc gcccgcgacc 4500 ctgaatcata cgctcgcctc actgctgaag tactggaccg cgctcacaag ttcgtccgac 4560 tcgctcagga agccactgat aaactagctc tagaacgtgg taaatcgaag attattcctg 4620 aggaaattac cgctggccac tacgtcatgc ttaaaagagc gctcgctgct gctgcaaagc 4680 cgacgaagca aaaatggcta ggccctttca aagttctaca gaccgactcc gtcatccttc 4740 agatcgacta cgacaacaag ccaacatggg tgcaccggta tcacgttgtc aaggctaagc 4800 tccgccctac tcatctcgat cacgacctgg ttgacctcta cgacgaacct gacccgcgct 4860 ctagctcatc tgaccaaaat caaggctcac ctgctgcctc tacttctcca gcttctgggt 4920 cagtggaatc tctacgccga tcttctcgat ctagacgccc gcctgaccga cttggttatt 4980 ctcgacattc tcggtagctc gctgttaaag cgagttattt tacttgagcg tttcacttct 5040 tttcagtgtc gctctactag ccaatcaccg catccgcttt ccgtcgacgc tgtcaacttt 5100 cgagtctccg cgcgtgcgct taaaacctgc gccgcgctct tcccttcgct tcccgtcgct 5160 tctagagatt cgactctgga gggc 5184 // ID DNA-TA-7_AAe repbase; DNA; INV; 2833 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A non-autonomous DNA transposon family from Aedes aegypti. XX KW DNA transposon; Transposable Element; nonautonomous; KW DNA-TA-7_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-2833 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the yellow fever mosquito."; RL Repbase Reports 11(4), 1276-1276 (2011). XX DR [2] (Consensus) XX CC ~94% identical to consensus. TA TSDs. TIRs are 24 bp long. XX SQ Sequence 2833 BP; 944 A; 405 C; 449 G; 1035 T; 0 other; caggtatacc tcgatttagt ggaccctcga ttatatagac cctcgatttt atgtacttcg 60 attttatgta cattttacct cgattttatg gacatcaaaa tttcatgtta aattttttga 120 tgtttatcca attctaaaca tgcattatgt aaacttaaac aatgtaaggg atttaggcta 180 tgcttataga agggttgggt catgttttgt tgtaaaatat tgaaaaaaat attaataaat 240 atacaacaaa gcttttacaa tatgggaaag tttaaaaatg aggagacttc agttcgactt 300 actcaacccg ttttttttgg ctttcgagtg ctttcaattt ggattttgtc cctctgtcaa 360 cttttaacat gttgccattc acgacttgct tcaactaagt ttatagtgat ttgtgcgaat 420 tttaattatt aataagaagt tacctcttgt tcttaggatt gttccaaaaa tcccacaaat 480 tcgcatttct cataaggggg ttttatactt ttcagagatt gattcaagtg cccttccaga 540 gttttatcga agtaggaaga gttgttaaat caagcaagaa tttatcaaca aattactcaa 600 acgattattc atctttcttc agtattattt agacgaacct gctgaaatta ttgcagtagt 660 ttcttttttc tatatattcg ttttgaagca cgacaatgag gatagatttt tttcaatgat 720 tcgtgcacgt tcatcactcc aatcggtact acgttgagtt tttcagaggg ttccatcaag 780 agttttccac agttccagga gtttttttta agacaaaaat tcctcaatga agtcgttcga 840 aaattgttga agaaaacttt atagtgattt atctagaagt tatttgaaaa tttcaaaaat 900 aaattctatt tattatttta tacgtttgtt acttgacaca cactgaggaa actggtcata 960 ctggaacatc tcgagattaa aatatgaaat tttgctacaa gaaaaagttt tgcttacata 1020 cgatactcta atggaaaacc tttaccagaa ataaaagagt tatgagttaa ctattttaaa 1080 atatttgatt tagttagaaa aacactttta gtatattttt tgaaatatat gaagacggta 1140 cgcatgttga tgaaaccgat ttaataaaaa tttaatcatg cattttttac taaattatat 1200 tctacatgga aagtaaactc ttattcttca tgctaggtgg attaggtcac caaaaatttg 1260 ataaattata cacgcggaga aatagattgt acggtcaacg aaaaattggg tgatactctc 1320 agggctaaca aacatttttg ttgaattcaa cccgaaatgt ccgttggttc tacaatcatg 1380 attgttgtca ctcagtttga catgtttgtt aatccagcaa tcatttttgc catgttgaaa 1440 agtttgtttg ttaaaccgat aatgttaggt taagtattgt tttgattcca tttcgatgtt 1500 ttattttatt tgaaaataat aaaacaagcg aatgtatatg tagaaaaaaa gttttcgaaa 1560 ataattcttc acaaatgtat tgaacaatcg ataatatgga aaactcatta tgttttggat 1620 taaatagaag tcaaatctta ttttccagtc tcatattctt ataatattaa gcaacaatat 1680 accattttta tttttttaaa tgaaaaattt acgcaaacct caataaaagc ttcatttata 1740 tttttcctgg attaaaaatt gtaaaattga ttttaccttt gcgaatttgg agtagatact 1800 gtcccatgtc ccattacgct tgttttgttg gctgtcatgt ccgatcgata ggatgccgtt 1860 tgtggaacca tagcattggc aataaattcg ctggaatttt tggaccttga ttaaaatttt 1920 accgagctgc cgttctacgc gtctcgcgat ccttaaggtt tacatagaac atgggacaat 1980 ttaattacat aatttaacaa acatcaaatg ttataaatca ctttgccttc aataaaacca 2040 attgtatcat ttttgaataa tgaaaataac gatgttttga ttggtgtgta cacagtgcag 2100 ttattctaag cgtgtagaaa atgcattgtt atatcaaaat aacattgtta atttgacaaa 2160 ccttgattat tgaagtgaca tgttataaaa aaagttgtgg atttgaactc aataaacagt 2220 tttttttatt aaaccttgca aaattttcta tgtgtgataa ttttatttca tgtaaacata 2280 aataaaaccg gtattttgta caactttggc gacctgtagc tcaaaattgt tacatgctgg 2340 aaaacttcta agagtggtat cagattcagc aatcccaaat tcactggaga catataattt 2400 gatcctttag atacgcagaa atgttatttt tttatgctgt ggaaggaatt gtaaataaaa 2460 tttcaaatgg gtttggctcc gtaatgcctg atagcgcttg agcttttgaa taaacaaata 2520 agtaaaacat gcgaaaaata gtttatgtcg ttctttttaa tcgtattctt atgaaaaata 2580 tttatatttt gaattagaat gcaatacttc attgacggac tgatttcatc aagaatttgt 2640 taactcaagt caaccttgga acattgaccc tggtatcgaa aataagtttt tcagcgataa 2700 tgatacattc taatgaagtt ttaactgaat agaaagcaaa aagtgaaata aaaattttgc 2760 ttcgatttta tgtacaattc gattttatgg acaattttga aatcaaaatg tccactaaat 2820 cgaggtatac ctg 2833 // ID P-15_HM repbase; DNA; INV; 3257 BP. XX AC . XX DT 17-MAR-2008 (Rel. 13.03, Created) DT 17-MAR-2008 (Rel. 13.03, Last updated, Version 1) XX DE P-type DNA transposon family: consensus. XX KW P; DNA transposon; Transposable Element; P-15_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3257 RA Jurka J.; RT "P-type DNA transposon families from Hydra magnipapillata."; RL Repbase Reports 8(3), 361-361 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 403..2685 FT /product="P-15_HM_1p" FT /translation="MYYSTNNGGKRLLQSAIPTIFNIPNPPPLLGCKRKPP FT VERHSVPAKKQVVCLPVNSDSNLENKMSTNKINFNKLQRLLLTLRVKVSRL FT RKQVRHLKSKQKTQKNCDGSIKHILNLSRKYLSPIQVSFFKSQLEMSKRVN FT KGKRWAVCDKVLALRILFHSPQTFNVLRTIFCLPSQTCLLLFLSRVFSDLQ FT EGFSTRLFSLLKLRADAMNPLDRNVSLVMDEMSLKQHLEYDRNSDRVYGMK FT NGKLLNQALVIMVRGLANKWKQPIAYFYNNSTIATADLASLLRETISKVQE FT TGLHIRCVVCDQGSTNIAALRLLGFSNNLPYFPNPSNNKNIHVIFDPPHLV FT KSIRNNLRRHNIDINGEIVSWQHIQSLYNLDKINSVRLAPKLTDRHLDPGP FT LLSMKVKLATQVFSYQVASALYCCSNTNSLPLSVLPTARFVERMDTLFDIL FT NSYKLYADKPARCALTLKGASISQLEELKHWIEKWKFCNVRSQASISCHWG FT LSVTINSVLTLSRELFSEGFQFVCTSRFNQDCIENFFSIIRSKGGWNDRPN FT VRQFRAAYRNALLLLSVEKSKSNSNCIKDSNFETAFNLNDFMKCCNQATTN FT CGIYETENNIPKGISYFHPGSFYMFTGKILMQPKDQALLHMAGWLIQKVKL FT CHRCADVLFLSSADNPSAFMSLVCEMETTFTKFIDKYLERDGLAMILINKI FT KKKCHFQFLHNQHEEHALYLEKNIVLFFVITRIFYFIKFLNRDINPRNKSN FT AKKIARILHN" XX SQ Sequence 3257 BP; 1143 A; 469 C; 474 G; 1171 T; 0 other; tctgggatgt cctgccggac ctcagaattt tataatatta agtagcgaat agttattttt 60 gtatgtactt aaaagaattt ttatcttagt tataacaatg ccatcatttt gtgctgctat 120 aaactgtgga aataagagtg gaaaaaactg taaagatatt tcattttttc gttttcctaa 180 agacgaaaaa aggcaagtta attcatatat tatttataat aacttttctt tcataaaaag 240 ttaataaaat tattattgta gtattttaaa acattctttt attgtttaat ttttagatgt 300 aaacagtggg tcatcaactg tcgtaggaaa gatttagaca aaaaagattt tgtattttta 360 aacaaaaact tttatttgtg cagtaatcat tttgaaaata caatgtatta ttctacaaat 420 aatggtggaa aacgtttgtt acaatcagca attcctacta tttttaacat cccaaatcct 480 cctccacttc ttggttgcaa aaggaaacca ccagttgaac gtcattcagt tccagcgaaa 540 aagcaagttg tttgtttacc tgttaattca gatagtaatt tagagaacaa aatgtctact 600 aataaaataa atttcaacaa acttcaaagg ttactattaa cattgcgagt caaagtttct 660 agactacgta aacaagttcg acatttaaaa tccaaacaaa aaacccaaaa aaactgtgat 720 gggtctataa aacatatttt aaatttatcc aggaaatact tatcacctat acaagttagt 780 ttttttaaaa gtcaattaga aatgagtaaa cgagtcaata aaggaaaacg ctgggcagtt 840 tgtgacaaag ttttggcttt gcgaattctg tttcatagtc ctcaaacttt taatgtcttg 900 cgaacaatat tctgtttacc aagtcaaacc tgtttattat tatttttgtc ccgtgtattt 960 tcagatcttc aagaaggatt ttctactcga ttgttttctt tattaaagtt acgtgctgat 1020 gcaatgaatc cgcttgatcg taatgtcagt cttgttatgg atgaaatgtc attaaaacag 1080 catttagagt atgatagaaa ttctgacagg gtttacggta tgaaaaatgg aaaactttta 1140 aatcaggctc ttgtaattat ggttagaggc ttggcaaata aatggaaaca accaatagct 1200 tatttttata ataattcaac aatagcaaca gctgacttgg cttctttact acgtgaaacc 1260 atttctaagg ttcaagaaac tggtttacac attagatgtg tagtttgtga tcaaggatct 1320 actaatatag ctgccttacg tttgcttggg ttttccaata atttaccata ctttcctaat 1380 ccttccaata ataaaaatat tcatgttatt tttgatcctc cgcatttagt aaaaagtatc 1440 agaaacaatt tgcgaagaca taacattgac attaatggtg aaatagtttc ttggcaacac 1500 atccagtctc tttacaattt agacaaaata aactctgtgc gtctagcgcc aaaactaaca 1560 gatagacact tagaccctgg tcctctttta tctatgaagg tcaaacttgc cactcaagtt 1620 tttagttatc aggttgcttc tgcattgtat tgctgctcta atacaaattc acttccttta 1680 agtgtattac caacagcaag atttgttgaa cgcatggata ctttattcga tatattgaat 1740 tcatataaat tatatgcaga taaaccagct cgatgtgctc taactttaaa aggagccagt 1800 atttctcagt tggaagagtt aaagcattgg attgagaaat ggaaattttg taatgttaga 1860 agtcaagcaa gtatatcatg tcattggggt ttaagtgtca caattaatag tgttctaacc 1920 ttaagtagag agctattttc tgaaggtttt caatttgttt gtacttctcg ttttaatcag 1980 gactgcattg aaaatttttt ttcaattatt cgaagcaaag gtggttggaa tgaccgtcca 2040 aatgttagac aatttcgagc tgcttataga aatgctttat tacttttatc tgtagaaaaa 2100 agtaaatcca atagcaactg tataaaggat tcaaattttg agacagcttt caatctaaat 2160 gattttatga agtgttgtaa tcaagcaacc actaattgtg gaatttatga aacagaaaat 2220 aacataccta aaggaatatc ttattttcat ccagggtcat tttatatgtt tactggaaaa 2280 attttaatgc aaccaaagga tcaagccctt ttacacatgg ctggatggct tattcaaaag 2340 gttaaattat gtcataggtg tgcagatgta ctatttctta gttcagctga caatccatca 2400 gcttttatgt cactagtttg tgaaatggaa acaactttta caaaatttat tgataaatat 2460 cttgaaagag atggactggc catgattctc ataaataaaa ttaaaaaaaa atgtcatttt 2520 cagtttttgc acaatcaaca tgaagaacat gcgctttatt tggaaaaaaa cattgttctg 2580 ttttttgtta ttacaagaat tttttatttc ataaaatttt taaacagaga cataaaccca 2640 cgcaataagt caaatgcaaa aaaaattgca cgtattcttc ataattaata tattatgata 2700 ctattttttc aatttttttt tttgtaataa gatgcactaa taactttaaa aaaaattatg 2760 ttgtttatta caagaatttt ttatttcata aaatccaccc aaaaagtcca aaaaaataat 2820 attgcttcca aattgcaatt tcaaaaatgc acgtattgcc ctttggtata ttattatatt 2880 attttttcaa tttaatcttt gtaatgagac tttgttatgt tataatgtat attaataact 2940 tttgttaatg tataataata ataactttta tgtataataa tatactttta tgttataaaa 3000 aaattacact aactttatgt tctacaaaac tctttttgtt ctacaaaacg tttttttccc 3060 cattatttta ttcttattga aacattctaa tggtaatctt aacaaaattg atgtatctta 3120 atgttacaaa attgagatat cttgaataca aaattgagat atttacaaat atcacataaa 3180 acaaagatgc aaagtttatt taaaaaaaat aaactatttg tgttagttgc ggaagaagtc 3240 cggcaggaca tcccaga 3257 // ID WUNENG repbase; DNA; INV; 257 BP. XX AC U88303; XX DT 21-AUG-1997 (Rel. 2.07, Created) DT 21-AUG-1997 (Rel. 2.07, Last updated, Version 1) XX DE Mosquito miniature inverted-repeat transposable element Wuneng. XX KW DNA transposon; Transposable Element; Nonautonomous; MITE; KW nonautonomous DNA transposon; Wuneng. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-257 RA Tu Z.; RT "Three novel families of miniature inverted-repeat transposable RT elements are associated with genes of the yellow fever mosquito, RT Aedes aegypti."; RL Proc. Natl. Acad. Sci. U.S.A 94(14), 7475-7480 (1997). XX DR GenBank; U88303; Positions 1 257. XX CC 19 bp terminal inverted repeats; TTAA (TTAT) target site CC duplications. XX SQ Sequence 257 BP; 80 A; 51 C; 43 G; 83 T; 0 other; ggctaagtag cccgtcattc attttggcaa caataatgac ttttcagctt gcatttcaaa 60 gtggtaaaac tcagtcttca tagattaaat tgacttgaaa aagtatcact gtacgcgcta 120 acatgcataa agtatgctca tactttttca gctttgtccg tgcaaaacta tctgattttc 180 tttgattcga aatcgtgaga tgaattagca acaataatca acgacgcgta caaatttcaa 240 tgacggctta ctttgcc 257 // ID Crack-1_CS1 repbase; DNA; INV; 4856 BP. XX AC . XX DT 15-JUL-2009 (Rel. 14.07, Created) DT 15-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE Annelida Crack-1_CS1 autonomous Non-LTR Retrotransposon - DE consensus. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; KW Crack-1_CS1. XX OS Capitella sp. 1 OC Eukaryota; Metazoa; Annelida; Polychaeta; Scolecida; Capitellida; OC Capitellidae; Capitella. XX RN [1] RP 1-4856 RA Kapitonov V. and Jurka J.; RT "Crack-1_CS1, a family of Crack non-LTR retrotransposons."; RL Repbase Reports 9(7), 1343-1343 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 378..1028 FT /product="Crack-1_CS1_1p" FT /note="ORF1: L1-like domain." FT /translation="MTTRKFSVDMKAFLTDPDGATLLRDTICGHLCSMIES FT LKTDLKRRDDRIDHLETRLEELEERNDTLEQYTRRNSVRINGISESAAEDC FT ENKVLSTINEKMSLSPPMTMDSIDRMHRIGKPVPGKHRSIIVKFATYHQRK FT RVMSKRSSLRGSDIFINEDLTKHRNNLLYIARQFKKKGGITDCWSYDGRIL FT IKDNKNLIREIKKHTDLSSLQTESQPN" FT CDS 1336..4386 FT /product="Crack-1_CS1_2p" FT /note="ORF2: AP endonuclease and RT domains." FT /translation="MVFNTFLIDEDSIMDELNDIDPDLCYYNQVIANMNCE FT YYSENNFNEKLNNRNNIQATTPFSLIHSNVRSFFANGNDLTHSMDLCDHQF FT SIICLSETWLTPSTYLSAAIPQYNHECRYRSKKSGGGVSIFIHNSIQYSPR FT HDISIFNEDIESIFVDIDKSVINTDRNVIAGCIYRPPKSSISEFNESLRSI FT LGKLSKENKDAYLAGDFNIDLLSVDSHLQTSEFLEILFSFSFFPSINKPTR FT ITASSATIIDNIFINNLHSKSITAGILAIGISDHFPVFCSTPVNMSQSPPR FT EHFSNVRILSERNKIKFESMLRQFDWTTCIQQTCCQSAFSLFYFQYKKCFD FT ECFPLIKTKSTYKNKKPWLTGALKTSIKAKNKLYRKYIKSKLEQDLNTYKE FT YKSKLKSCLRRAERAHYDQLFTQYKSNLRKSWALIKDLINKKSTNKSQSFK FT INVNGVETSNMSTITNAFNDYFVNIGPSLAQSIPNTSKNPIDYIRASASRS FT MYLTPVTELEITNIIREMRDCAPGPDAIPSAILKETATIFIPVLTQIINLS FT FSEGVFPNDMKCAKITPLFKSGDKTSINNYRPISLLPSFSKIIEKCMSKRL FT LAFIDEHSILYKYQFGFRPKYSTNMAIHLLVDKIVSCLDKGENLVGVALDF FT RKAFDTVDHLILLKKLEAYGVRGNAYDWFASYLARRTQYVEINHTRSSSLE FT IKCGVPQGSILGPILFLIYINDLPFSSKLMPIIFADDTNVFLGGKSINQCI FT ETINIEMNNLSKWIQCNRLSLNVTKTNYIIFSKSKVLSANLLPLTINNTVI FT DRVYSLKFLGIVIDDKLSWSEHIKYIRGKISRSIGMLSCARKNLDRKTLIQ FT LYYAFIYPYLSYCIDVWGHCSQQLFLSVFKTQKRALRIISFSKRLAHTEPL FT FKSFEILPLQAIYILALSKFMYKFYFRLLPSVIDELFQSNNSVHSVPTRQQ FT NLLHVPHVISETSRKSIRVRGVSTWNRLAQNININSFSFNKFTRIMKDKLL FT NDNVFSISITQ" XX SQ Sequence 4856 BP; 1503 A; 987 C; 719 G; 1647 T; 0 other; gagtagttcg gtgtcgtctg tgaaactctg cttttacaga gctctctaat ttttcaacaa 60 ttatatacgt tattttcatc atacatcaat gtgattaata tcattggatt cctttttggt 120 tagtgatcat tgtgtagtgt ttttttgtgg ctaattttgc tttctctgtc ctttaattga 180 actttttcga tatcacattt atatttctca gttatttaga acttctctca tttcatttcc 240 gtcacggttg attgcacact attcactgca cgcattaact tcttgatttc tcttccctct 300 gtttattttt tttccctcgt tgttatttgt atttcttttc ctctgattat ttttcaattt 360 ttgaccctta taccgaaatg acaacaagga aattcagcgt tgacatgaag gccttcttga 420 cagatcctga tggtgctact ctactccgag acacaatatg tggtcatcta tgttcaatga 480 tcgaatccct gaaaacggat ctcaaacgga gagatgatcg gatagatcat ctggagacga 540 ggctcgagga gttagaggaa agaaatgaca ccctcgaaca atatacacgc aggaattctg 600 tgcgaattaa cgggatcagt gagtcggcag ccgaagactg tgagaacaaa gtcctctcaa 660 ccatcaacga aaagatgtca ctctcaccac cgatgactat ggacagcatt gaccggatgc 720 atcggatagg caagccagtc cccggaaaac accgatcgat catcgtaaaa tttgcaactt 780 accatcaacg caaacgagtg atgtcaaaac gttcctccct tagaggctcc gatatattta 840 taaatgagga cctcaccaaa cacagaaaca atctcctcta tatagcgaga caatttaaga 900 agaaaggtgg aatcacggac tgctggtcct acgatggtcg aatcctcatc aaagataata 960 aaaacctcat ccgcgaaata aaaaaacaca ctgatctctc ttctcttcaa actgaatctc 1020 agcctaatta gccaattacc agtatccagc gactgaccaa aaacattccc acacactcct 1080 ctcaatcctt accgttcccc cttttctttt catattcctt tattactatt attattatta 1140 ttattattta tttatttttt ttctttcttt catttttcac cactctctct cttcataatt 1200 ccttgttcct tacatttaaa cttttttatt acctgcgcac catgcaccct ggctgaactt 1260 ccattttaca atgtaaataa tcagcaaacg cccgcctctc aaaattcttt tctagacatc 1320 aaaaggctta acgcgatggt atttaacact tttttaattg atgaagactc gataatggat 1380 gaattgaatg atatcgatcc tgatctttgt tattataatc aagtcattgc aaatatgaat 1440 tgtgaatatt actcggaaaa taattttaat gagaaattga ataatcgaaa taacatacag 1500 gctacaaccc ccttctctct catacattca aacgtaagaa gctttttcgc taacggaaat 1560 gacttaactc actccatgga tctttgtgac catcaattct caattatttg tttatcagaa 1620 acttggttaa ccccctcaac gtacctcagc gcagctatac ctcaatataa tcacgaatgc 1680 cgatatagat ccaagaaatc tggtggagga gtttctatct ttattcataa ctctattcaa 1740 tactcacctc gccatgatat atctatcttt aatgaagaca ttgaatccat ctttgtagat 1800 attgataaaa gcgttattaa caccgaccgt aatgtaattg ccgggtgtat atatcgccca 1860 cccaagtcct caatctcaga gtttaacgaa tcattaagat cgatacttgg aaagttatcc 1920 aaagaaaata aagatgctta tcttgcaggg gattttaata tcgacttgct cagtgttgat 1980 agccatcttc aaacatctga attcctggaa attttattct cattttcctt tttcccttct 2040 ataaacaaac ctactcggat aacagcttca tcagctacca ttatcgataa catatttatt 2100 aataatctac attcaaaatc aattactgct ggaatactag caattggtat atcagatcat 2160 tttccggttt tctgttcaac ccctgtcaac atgtcacaat ccccaccgag ggagcacttc 2220 tccaacgtac gaatactgag cgaacggaat aaaattaaat ttgaatccat gttaaggcaa 2280 ttcgactgga ccacatgcat ccagcaaacc tgctgtcagt ctgctttttc gctcttctat 2340 tttcagtata aaaagtgttt tgatgaatgt tttcctctta taaaaactaa atctacttac 2400 aaaaacaaaa agccttggtt gactggtgca ttaaaaacat ctataaaagc taaaaataaa 2460 ctataccgta aatatataaa atctaaactt gaacaggatc taaatacata taaagagtat 2520 aagagtaaac tgaaatcttg cctgaggcga gcagaacgag ctcattatga tcaattattc 2580 actcagtata aaagtaactt acgcaaatcg tgggctctga ttaaggattt gatcaataaa 2640 aaaagcacta ataaatccca aagttttaag atcaatgtga atggtgtgga gacaagtaat 2700 atgtcgacta taaccaatgc attcaacgat tactttgtta acatcggtcc ttctctagca 2760 caatcaattc ctaatacgtc caaaaaccca attgattaca ttcgtgcctc ggcctctaga 2820 tcaatgtatt taactccagt gactgagctg gagattacga atatcataag agaaatgagg 2880 gactgtgctc ctggtccaga tgctattcca tcagccattc tgaaagaaac cgccaccatt 2940 tttattcctg tccttactca gataatcaac ttatctttct cagaaggcgt tttccctaat 3000 gatatgaagt gtgctaaaat tacacccttg tttaaatctg gtgacaaaac ttccattaat 3060 aactatcgac ccatttcact tctcccatcc ttctcaaaaa taattgaaaa atgcatgtca 3120 aaacgtcttc ttgcttttat cgatgaacac tctattctat ataaatatca atttgggttt 3180 cgaccaaagt attcaacaaa catggcgata cacttgcttg ttgataagat tgttagttgt 3240 ttggataagg gtgaaaattt agtgggtgtt gcactggact ttagaaaagc ttttgatact 3300 gttgatcatt taattctctt aaagaaactg gaagcatatg gggtgcgagg aaatgcatat 3360 gattggtttg caagttactt agcaagaagg actcaatatg tcgaaattaa tcacactcga 3420 tcttcttcat tagagatcaa atgcggtgta ccccaggggt caatattggg cccaatctta 3480 tttttaatat atatcaatga cttacctttt tcgtcaaaat tgatgcctat tattttcgca 3540 gacgatacta atgtttttct tggcggcaaa agtattaacc aatgcattga aacaatcaat 3600 atagaaatga ataacttaag caaatggatt caatgcaatc gtctatctct gaatgttaca 3660 aaaaccaatt atatcatatt ctcaaaatct aaagtattgt ccgcaaattt gttacctctt 3720 actattaata atacagtgat tgatagagta tattccctga aatttctagg aattgttatc 3780 gatgataaac tctcctggag tgaacacatt aaatatatcc gaggcaaaat atctcggtca 3840 attggaatgt tgagctgtgc gaggaagaat ctggaccgta aaaccttaat ccaattgtac 3900 tatgcattta tttatccata tttgagctat tgtatagatg tctggggcca ctgtagtcaa 3960 caactctttc tttcggtttt taaaactcaa aaacgtgcac tgcgaataat atctttctcc 4020 aaacgcttag cccataccga acctttattt aaatcttttg aaattctgcc actgcaggct 4080 atctacattt tagctcttag taaatttatg tataaattct attttcgact ccttccttct 4140 gtaattgatg aactatttca aagcaataac tctgtacatt ctgttcctac aagacaacaa 4200 aacttacttc atgtgccaca cgtaatctcc gaaaccagcc ggaaatcgat aagagtaaga 4260 ggtgtttcca catggaacag actcgctcaa aacataaata ttaattcatt ttcattcaat 4320 aaattcacta ggattatgaa agataaatta ttaaacgaca acgtattttc tatttcaatc 4380 acccaataat gctatttcag ccttaacaat ctatttttct attttcttat ttttttccct 4440 tattcttttc attttttttt ttcacttcat caaacctgta tatttaagca ttttcattat 4500 cctggccagt ttaaaaaaaa tacttttttt ttgttcttct ctctatacat gtattttttt 4560 ttgtcaatta tcatcttctt tgtttttgtt tttgttttgc gttttttttt cttgtaggta 4620 ttgtaaatag cgtcaggtta gattaggaat tgtaaataaa taagtaggaa ttttactctt 4680 aaccgctgca cattgcacct tgcacttttt ttttcgaatt tctaccacag acgacaccga 4740 aagttagtta gtattcgttt ttcgttcttt ttgtagcaca tacccaagtc tgggtcagct 4800 accaattcta tgtattatta ccttttttgg aaaataataa aattgtctct ctctct 4856 // ID BEL-634_AA-I repbase; DNA; INV; 6791 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-634_AA_; KW BEL-634_AA-LTR; Pao_Bel_Ele201; BEL-634_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-6791 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [5828-6385] - Integrase core CC LTRs are 96% similar to each other. XX FH Key Location/Qualifiers FT CDS 60..1250 FT /product="BEL-634_AA-I_3p" FT /translation="MDPSESHCMLCDRPNHVDNLVQCDRCDGYLHYSCAEV FT GDSIADPDRSFTCKRCIESDEVVTVSSHSSRRTSNSSRSAARAALRLQQLE FT KERQIRLRELEDAEKFQRMRRQVELEFEKEQFAILEQQLQEENEDGSTSTS FT RVSSRRIRDNTRKWVRAASQTGGVKCPIATTSTQIGSTIPTGTSQNPAIAP FT EGSTEATLAVPFSSGITSRVPAPVVAMRVIGNSGEQHSNHPTEVRTTTTTN FT SDQGMITRAEIVPPANLQIDQQLPGRMQQIAGAARTTTDCNMSSGTGGERV FT GAPISVGLSTDQVRTSISTLHVRLQPIDEAAQTSTPKQAPVVQTSVGNQSG FT PREKPNYLAKSGNAIGAVQGQLPHSTLLNILHPQGNLLRVQRGLVYYHPLN FT QPC" FT CDS join(3401..4117,4121..5383) FT /product="BEL-634_AA-I_1p" FT /translation="MSNLFRQFLAVDEASVKRSPQSKEDQRALDILHNTTR FT RVDGRWETGLLWRSDEPSLPNNYNMAVRRMEALERKLAKDEELQKKVQGMI FT EEYLAKGYAHRITAAELESSMPGRVWYLPLGVVRNPRKPEKVRLIWDAAAR FT TEGVSFNDMMLKGPDMLTALPVVLLRFRQNRIAFSGDIKEMFHQFRIRQED FT RQAQRFLYREHPGAQAQIFVMDVATFGAACSPCIAQYLKNKNAEEYKEQPE FT AARAIVDKHYVDGYLDSVETLEEAAKLIEDVKHVHAMAGMEIRNFASNSTE FT LLQRIGEVRETQQKSMSLEASAERVLGMMWKPLEDIFTFQLDLKEEVRNIV FT TNKIVPTKRQVLRTIMSVFDPLGLVAHFVVHGKLIMQRIWRAGLNWDEQID FT GEILEDWRKWSGLLTTITEVSVPRCLLPGSGDTSQEPQLHVLVDASENAYA FT CVAYLRTSCNGLPRCTLIAAKTKVAPLKPLSIPRLELQAALIGSRLVDTIT FT KALTIPISARYLWTDSSTVLAWLRSETRQYHQFVGFRVGEILSTTTVNEWR FT KVPSKLNVADQATKWKDGPSFDPKDWWYAGPSFLSDPEEKWVEDSAEKFTT FT TVDMRASFLLHRNQGQASMVAVERFSKWVRLLRTHRLRRSGGQAIPRTESD FT RPADAGRDTDS" XX SQ Sequence 6791 BP; 1868 A; 1686 C; 1822 G; 1409 T; 6 other; aaaatcttca aagattttga cgtggattcc atgcgcacgt ggttaacatc cggacaggaa 60 tggatccgtc agaaagccat tgtatgctct gcgatcggcc gaaccacgtc gacaatttag 120 ttcagtgcga ccgatgtgac gggtatttgc actattcttg tgccgaagtg ggggactcga 180 tagccgatcc cgatcggagt ttcacctgca agaggtgcat cgagagcgac gaagtcgtca 240 cggtatcctc ccatagctcc cgcaggactt caaacagcag ccgcagtgcg gcccgagcag 300 cactacgtct ccagcagctg gagaaagaga gacagatccg cctcagagag cttgaggacg 360 cggaaaagtt tcagcgaatg cgccgccagg tcgaacttga gttcgagaag gaacagtttg 420 ccatcctgga gcaacaactc caagaagaga acgaagatgg tagcaccagt acgagcagag 480 taagctctag aaggatccgc gacaacacca ggaagtgggt cagagcagcg agtcagaccg 540 gtggagtgaa atgtccaata gctaccacat cgactcaaat tggttccaca attcccactg 600 ggacctcgca gaatccagct atcgctcccg aaggtagcac tgaagctacc ctagcagttc 660 cgttttccag tggcataacc agtcgagtgc cggcacctgt cgtggcgatg agggtcatcg 720 gcaattctgg ggaacagcat tccaaccatc ctacggaagt tcgtaccacc acgacaacca 780 acagcgacca aggaatgatc acgcgagctg aaatcgttcc tccggcaaat ctgcagattg 840 accaacaact tccgggcaga atgcagcaga tagcaggagc agctcggacg acgaccgatt 900 gcaacatgtc atcgggcacc ggcggtgagc gagtaggagc accaataagc gtaggtttgt 960 ctacagacca ggtgagaact agcattagca ctttacacgt cagattacag ccgatagacg 1020 aggctgctca aactagtacg ccgaaacagg cacctgtagt acaaacaagc gtaggaaatc 1080 agtcagggcc tagagaaaaa ccaaattatc tagccaaatc aggcaatgcg ataggagcag 1140 ttcagggtca gctaccacac tctactctct tgaacatctt gcaccctcag gggaacctcc 1200 tccgggtcca aaggggattg gtctactacc accccttaaa tcaaccttgt tawctaatcc 1260 gaatccatcg attagtcaac aaccgaacag tcaaacgatc ggtcgaaaaa cgtcggtagt 1320 ttcgtcttta ttggaacgtg gacagtgcgg tcaatcaaat ccatcacttg ttttgccagc 1380 gtcggggcat cagcagttca gtgtgtcgaa tccccaagaa tctcaaccac cgttgggacg 1440 agaacagtgc atggtgttta gtgaacgaaa ccaatcagtg atccagcccc caccgggata 1500 cgaaaatttc gtgaggaatt ccgagacagc accagccggt tcctggaaca cgtcgtatcc 1560 aacgctcgcg aatcctggtt cagcaagcct tttccggaca tcaggttcgt atcagcagac 1620 acgccgcctc ccagaccatg cgttgaattc gacggaatat ccaccggcgt tgggtcatca 1680 gcaatgccaa ttaccgagct cgacgatact gggagcccaa ccaggatgtc aagcatttcc 1740 ggcacctgga ccggatcaat cgggaccacc aatcgggacg aaatcatcag tatgtgccag 1800 atcagtcagc tcgcgtgcag tggaaccatg ccagtgattc tcaaccgcga cgtgggacaa 1860 ggcccacagc cgaacaaatg gctgctcgac aagtcatcgg aaaggaactg ccaatctttg 1920 cgggagatcc aagagattgg ccactcttcc ttagttcctt caacaactcg accgaaacct 1980 gtgggtacaa cgatgccgaa aaccttgcgc gtttgcagcg gtgcttacgt ggacatgcac 2040 tggagagcgt aaaaagtcgg ctgctgattc cggagtcagt tcctgttgta ttggagacac 2100 tggagcgact atatggaaga ccagaggtgg ttatccatgc tcttttgaag cagatgagag 2160 agatttcctc ccctcggggc gacgacctca aaacattgat ccagtttgga atgggcgtag 2220 gaaacatggt ggagcacatg atcctggcgc aacaatatca gcatattagc aatcctatgc 2280 tactgcagga attggtggat aggcttccac cgaatctaaa actccagtgg gcctgcttca 2340 agcgcaacta caacccagtc aacctggcca cgttcaacga ttttatgaaa gaccttgttg 2400 cgatggccag tgacgtcact ctttttsctg atttgggact gcaaacggtc aagaaacaag 2460 aaaaaggaaa gcgagagaaa cccacgaagg agaagctttt gttcatcaac tgtcatcaac 2520 ggcggctggg acgtctaccc ctgagagtct accggcaaag ccctgtatcc attgcggtca 2580 aacgggccat cgaatcgccg actgtatggc attccggcga ttgaatgtgg acgagcgatg 2640 gaaggtgcta cgacagaaag gattgtgcag gatatgctta atacctcatc gatcttggcc 2700 ctgccggtca aaactggaat gcggagtagg agactgtcga atgcgtcatc atggtttgct 2760 acacgttagg aaggaggagt caggagtacc atcgtcatct acgctggaga agaacgttgt 2820 tcagcaacat ctcattcagc aacttctagt gcactgttgc gctatcttcc tgtgacgttg 2880 cacttcaatg gaaaaagcgt ggatgttttc gcatttctag acgacggatc ttcctcgacc 2940 atggtggaag cagaagtagc ggatcagctg ggcgctgtag gacctggcga accgctccat 3000 ctcggttgga cgggagatat tacaaggacc gagaaagaat cccagcacat ccaaatcgtc 3060 atctccggac tcaacatgga aaatgaattt ccattgaagg ctagaactgt gagcagcttg 3120 aagttaccga gtcaaacggt ggattacgac gcgctctgtt cagaccatcc tatttgagga 3180 agctgccact gtgtagctac acaatgcctc acctcgtctc atcatcgggg ttgataacgc 3240 gaagctgata agtgcactga aaagccggga aagcaacaca ggagaactag tggcggttaa 3300 gactcggcta ggatggtgtc ttttcggaaa aagctctagc gggagcagtc ctggagaata 3360 cgtgaacatt catgctgagc tgcccgagga agacgccgag atgagcaatc tgttcaggca 3420 gtttcttgca gtcgatgaag cgagcgtcaa gcgaagtcca caatcgaagg aagatcaacg 3480 agcgctggac atcctacaca acaccaccag aagagttgat ggtagatggg aaactgggtt 3540 gctgtggcga agtgatgaac cgtcgctccc aaataactac aatatggccg tgcgaagaat 3600 ggaagcattg gagcgcaagc tagcgaagga tgaggaactg cagaagaaag ttcaggggat 3660 gattgaggag taccttgcta aaggatatgc ccatcgaata actgctgcgg agttggagtc 3720 ttctatgcct ggtcgggtat ggtatttgcc acttggggtg gtaaggaacc cacgaaagcc 3780 agaaaaggtg cgtctgatat gggacgccgc agcccgaaca gaaggagttt cattcaacga 3840 catgatgttg aaaggaccgg acatgttaac agcacttccg gtcgtccttc tgcgcttccg 3900 ccagaatcgc attgccttta gtggagacat aaaagaaatg ttccaccaat ttcgcatacg 3960 acaagaggac aggcaggcgc aaagattcct ttaccgggaa catccaggag cgcaagcgca 4020 aatatttgtg atggatgtag ctacctttgg agctgcatgc tcaccatgca tcgcgcaata 4080 tttgaaaaac aaaaatgccg aggaatacaa ggagcagtwt ccagaagcag cgcgagcaat 4140 agtcgacaaa cactatgttg atggctacct ggatagtgtc gaaacgttag aagaagcggc 4200 gaagctgata gaggacgtaa agcacgttca cgctatggct ggaatggaga ttcggaactt 4260 cgcttcgaat agtacggaac ttctccagcg cattggagaa gtacgcgaaa ctcagcagaa 4320 gtcgatgagc ctggaagcaa gcgcagagag agtgctcggt atgatgtgga aaccgttgga 4380 agacatcttc acgttccagt tggatctgaa ggaggaagtg cgaaacatcg tgacgaacaa 4440 gattgtgccg acgaagcgcc aagtcctgcg gacgatcatg tctgtttttg atccgttagg 4500 tctggtagcg cattttgttg tgcacggaaa gctgattatg caacgaatct ggagggctgg 4560 cctaaactgg gacgagcaga tcgacggaga aatattggag gactggcgta agtggagcgg 4620 tctgctgacg acaataaccg aagtgagcgt cccaagatgc cttcttcccg gcagtggaga 4680 tacttcacaa gaaccacaac ttcatgttct ggtggacgct agtgagaacg catacgcttg 4740 cgtagcatac ctcagaactt cgtgcaatgg actccctcga tgcacgttga tagcagcaaa 4800 gacaaaggta gcgccactaa aacctctatc aattccacga ctggagttgc aagcagcgct 4860 gatcggtagc cgtctggtgg acactatcac caaggctcta acgattccaa tatcagctcg 4920 atacctatgg acggattctt caacggtctt agcttggctg agatcggaga cccggcagta 4980 tcatcagttc gtcggtttcc gagttggaga aatcttgagc acgactacag ttaatgagtg 5040 gcgaaaggtt ccatctaagc tgaacgtggc ggatcaagca acgaagtgga aggacgggcc 5100 tagtttcgac ccaaaggact ggtggtatgc tggaccaagc ttcctatccg acccggaaga 5160 aaaatgggta gaagattcgg cggaaaagtt cactacaacg gtagatatgc gagcgtcgtt 5220 tctacttcat cgaaaccaag gtcaagcttc aatggtggcc gtggaacgat tctcaaagtg 5280 ggtgagactg ctgcgcacac accgcctacg tcgttcgggc ggccaagcga ttcctaggac 5340 tgaaagcgac cggccagctg atgcaggaag agatacagac agctgaaaca cttctgtggc 5400 ggcaggcgca aatggaggcg taccctgacg aatacgcggc actcgagtac aacaaggaac 5460 atacaagcga ggagccgaag cagttagcaa aatcgagttc gctgttcaag atgtcaccga 5520 tgattgacga cgacggagtg ctaaggatga acagtcgcat akccgcggcg ccagtagttt 5580 ccaccgatct caaatacccc atttttcttc caaaagagca tcgagtaact atcctgctcg 5640 ttgaaagcta ccacatccgc tttctgcatg gaaacaatga gacggtattt aacgatgtaa 5700 ggcaacgctt ccaagttccg cagctccgtt cggtcattgc gaaggtggcc aagcagtgtc 5760 aacattgtcg agtcagaaag cagttccgcc ccaatgatgg caccacttcc tgaaatcagg 5820 ctaacgccgt tcatccggcc ctttacccac accggagtgg actatttcgg gcccattctt 5880 gtcaagcaag gcgcagcaca gtcaagcgat sgatcgccct atttacgtgc ctgtcataag 5940 ggcagtgcac ctagaagtcg tccacagctt atcgacacaa tcctgtgtta tggccatccg 6000 gagattcgtg gctcgcagag gttcacctgc taccttttgt tctgacaacg gaaccaactt 6060 catcggagcg aacaatctgc tgaaggaaca actccggacg atcagcaagg actgtgcgac 6120 gacatttacg aactcggcta cgaagtggct cttcaatcct ccgcttgcgc cccatatggg 6180 aggtccatgg gagcgcatgg tcaggtccgt gaaggtggcc atggcggcga ttgcggacca 6240 tccacgtcat cccaatgacg aagtccttga aaccgtgctg ctggaagcag aatcagtagt 6300 caattcaagg ccactcacct acgttccgct ggaccatgca acgcaggaag ctttatcccc 6360 aaaccacttt ctgctatacg gaacacaagg catcaaccaa cctagtcggg acttcgtgca 6420 gagcactcga cactgagaga tagttggaga ctagcgaact acctcgtcga caccttctgg 6480 acacgttggg tacgsgagta cctccctacg ttgacgcgcc gcaccaaatg gttccagcca 6540 gtacgaccac tagaacctgg agatctggtc attgttgtcg aagaggggaa gcgtaacgga 6600 tggattcgtg ggagaattac ggaagttctc ccagggaagg atggacaagt acgaagagcg 6660 gtagtacaaa cggctcgtgg actgatgaac cgacctgcca ccaagttagc actgttggaa 6720 gttcgaagtt cggaagtcga aggatcgcaa gcccgggaaa ttggcgtacc ggaactacac 6780 gggcgggggg a 6791 // ID Polinton-1_HM repbase; DNA; INV; 20689 BP. XX AC . XX DT 30-DEC-2008 (Rel. 13.12, Created) DT 30-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE A family of autonomous Polinton DNA transposons - a consensus DE sequence. XX KW Polinton; DNA transposon; Transposable Element; KW Polinton DNA transposon; Maverick; Polinton-1_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-20689 RA Bao W. and Jurka J.; RT "Polinton DNA transposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 2101-2101 (2008). XX DR [1] (Consensus) XX CC TIR is 238-bp long, and TSD is 6-bp long. CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 1860..2774 FT /product="Polinton-1_HM_1p" FT /translation="MNDDEINEISEFQKEFKLFEEIRKFEFEESDYDSEDD FT TYYTSDEATYFTSDDEDEIQEKSIVPIPSKTVQIMELYKDKLKWINYVPTY FT NDIIYDDYPAIDNFFIKKHNKNRKFSFEKPFNVDINFTEDGKPSTIHFVEK FT LPIFCKNILNDNFCPFGYKCKYNHDIPYCNTVKERLICTKSSCQYRHIHIC FT YEYPKCKNKECVLFHPKKSELLKLRKAKTKLCKYILDKKECKNINCKYAHS FT IKEIEENVEICHYTPCKLVESKLKKNKKGEKILLYKNINDNKKCYFLHEGE FT SILNYIVRTLKK*" FT CDS 17568..17987 FT /product="Polinton-1_HM_14p" FT /translation="MDKHINVEGIILPNEPLTNFQLIKAAKKLKIKNFRGV FT FVRDELPKNTQNKECGILNTGDSSTNGFHWICWYKDGEEKLSFDSYALPPP FT VELVQYLHSPVYYNSKRIQFGDTSFCGHLCLYVLKKLDEGVDFQSIENSLY FT *" FT CDS 13064..13567 FT /product="Polinton-1_HM_12p" FT /translation="MSDKIKYTKKYLLNLLKDNNIQQTSKDPMALVMIAMD FT NNLIDKDSVKTNIITKKQKKPKIERVKRVKKEKIVKEKKSAGRPKFLPTKE FT KKERDPKYDYLTKIRTDPWGIKLTNMETGEVKNYKSFYDYNKKEKHSKTYI FT FNRNGKIVDGVKIEIIRASKEKDALSQK*" FT CDS 13714..17484 FT /product="Polinton-1_HM_13p" FT /translation="MENRNVKELKQIARERRIRYYYRMRKADLIRAIEQTT FT PILDIPIPNNITQNTLTPTQYVPISYINTIKNKVNSFANWIIDYVPEPIKN FT VANEKLTSLKSTVSNFFRYKREPLDSKKEKKIENRPIEFKLSQSSLKNVTK FT QFSAEGVKGYDALSFMKSAENNVIKILNSNKGSKIYIVLSCEMERTDLKTG FT ETITTIASFSTKAEVVLESTDLNDFYERAEQKILESLSAFQQLGSSWIFVS FT VKKMDINIIEYKPIKGKSYIPLPKELAAKKAIINMKNEDNECFKWCVARFF FT NPKEKNSERVDKDLKEQSEKLNWEKIKFPVSLQQITQFEKNNQDISVNVFG FT YENSVYPLRISENKNRQHKIDLLLISNDETNHYCLIKSLSRLLSSQISKNE FT HEMFYCRNCLLGFCTEESLSNHKLYCDTHDSVRIELPKPNTMIEFKNYNKS FT MRVPFVVYADFESFIKPINTCSPNPNESYTKQYQKHTPSSFCYYIKCFDEK FT IYQSKLVTFTASNEDEDVAQKFVNMLEEDVKKIYNDYLKFPKKMIFTMKDK FT NNFDNAKICHICEKDLNEDRVRDHCHITGKYRGAAHSDCNLQFKIPKFIPV FT LFHNLSGYDSHLFIKKLSEGGNINCIPNNEEKYISFSKELKVNEFMNREGK FT KVEVKLYLRFLDSFKFMAASLDSLTKNLSKDQCKNISRYYSGNELNLLLRK FT GVYPYEWVDSIDKLNETQLPPKESFYSRLNDEGISDEDYLHAQNVWKEFNC FT ETFRDYHNLYNESDVLLLADVFENFRDVCXNNYXLDPAWYYTAPGLAWDAA FT LKITEVKLELLSDYDMILMIKEGIRGGISMVPNRLGTANNKYMENYDESKE FT STYIQYLDANNLYGWAMSKPLPTHGFEWMNEEELKNWKSTSCILEVDLEYP FT EHLHDLHNDYPLAPERLKIDKVEKLVTNLNHKKNYIIHYENLKLYERLGIK FT LTKIHRGIKFEESAWLSKYIKLNTDLRTKATNDFEKDFFKLMNNSVFGKTM FT ENIENRVDVRLVSKREEAIKLASKPNYESRTIFDENLIAIHMKRTKLVYNK FT PIYLGMCILDLSKTLMYEFHYDYIKNKYGDKAKLLYTDTDSLIYEIKTKDF FT YADIANDIESKFDTSEFNKDHPAVQNGFKVGVNKKVIGMFKDESAGKQITE FT FIGLRSKLYSYKIDEEDKKRCKGVKRNVVKNYITHEDYKDCLMNKKDQMRK FT MNVIRSHCHDVYTEEINKIALSAEDDKRVIQEDGIHTLAYGHYKLK*" FT CDS 11040..11501 FT /product="Polinton-1_HM_16p" FT /translation="MLVINNKGYTHKYEVGGSGLFTPFINMFNKQALSNAL FT NASRIFASRAAATDLGKTAVDAAKSAGKELATSAISTAKEIVINKGKKLIE FT NVNKSSKLTPENKQELKNLINTLDNKLNEAVPDINKIMMGSSIKSKPVRIQ FT DLVKKHKGDGLRII*" FT CDS 9782..10711 FT /product="Polinton-1_HM_10p" FT /translation="MSKTIHITSNDTVINTNMAVPIELDEDKEYGIALKKL FT MTYNSFPNIIEGLNNTINITIEASDSPGFQQIKIPTGSYELRNINTIVTRY FT VMTTLKDQLAKYKESFKNVSSTEVENYKKKQKYDSIKLDPTSIIFEANFNT FT EKSQIIINKNSNIRIQFDDDSIKDLLGFNNRDRDGNLITFDSSKVYISDKV FT IDINHVNTIRVTNSLVTGSIIDGAYSNVIYSFYPSTPVGYKIVEVPNNPTY FT YRLLSKKIWDMKTTIMDQDGHILTLQGEPISIEYEIKEIVSKCKSEHLLER FT QVSLLEEQNELLKKQLR*" FT CDS 11553..12836 FT /product="Polinton-1_HM_11p" FT /translation="MVKSDIFNITDKLRFDEEIKKYEEYEFTPSVNSNLNS FT GEIRIFIENSDSLFHPHESYLEIEGRLVKADGTAYADDYAITLTHNGLMHL FT FERIEYKFYDSVVESVNFPGIATTMLGMLKYPNDFQQSKAMNQLWYKDTTA FT TADLVNNTGFXARQQFIIQKPTTKGSFEFSIPLRHIFGFCDDYDKVFYGLK FT HELXLLRRSDDNAIFRAAGVAAGKVNITRISLMMRRATPSLVADLELAKII FT KSQETLDIGFRSRFLDKTNVPQNTSFDWRLGLRTTEKPRYILVGFQTNREG FT NQEQNXSIFDHCDLXNMWIELNEERYPATNYNLSFPNMKITRAYRHASNFA FT EDYYNMTNLISLCGITPSDYRDLYPIMYFDVSKQSERMKDKTVNIKLKAEF FT NTPVPANTVIYALIISDRIAKITSNGNRLRFEY*" FT CDS 7297..8253 FT /product="Polinton-1_HM_8p" FT /translation="MSKKLPAQRKKSKNLKWTDELANELHKPVIKHFRKRK FT VIANGIDEIWAADLVDMQAYSKFNNGIKYLLMVIDIFSKYGWIIPLKSKTG FT IEVADAFNKIFKDRKCAKLWVDKGLEFYNKHVKALGVHLYSTENEEKSCVV FT ERWNRTMKEKMFKYFSANSTRKYIDVLDEMVKQYNNTKHSSIKMTPVEASN FT KKNENKVWLNLNSKVRSEHNKLKFTIGDKVRIIKKKELFEKGYTPRWTEEV FT FTVSQIQYTDPPTYKITDYNSEEIQGTFYEQELQKTSQEIFRIEKVIKKQG FT NKSFVKWYGYPDTFNSWVNNTELKAI*" FT CDS 5214..5798 FT /product="Polinton-1_HM_5p" FT /translation="MTKYYDTLINLSDSQKEKIKKAIESDTNASIKLTXQD FT LNGEHKIALTDSQRNRMKAAYLKNKGITLSLSRAQLKHNAKVNGGFLPLLG FT LIGSVIASKVVPAIATGLLTGTAAAAGSTIVNKIAGNGIIYVKKNGSGFKI FT RKVGKGVFLNPWNGSYNYLDGYYSKNGNEYSSVGRGLLLGENSPFKNIPLL FT GLLL*" FT CDS 6686..5895 FT /product="Polinton-1_HM_6p" FT /translation="MEIIDLSWNVNKSKRHNNELLPKSIRGIIVGKSGCGK FT TTLLLNLLLRPGWLDYNHLQVFGKSLFQPEYKILKNSFENKLPKEVILKLF FT ELQDQINKNNVNPEFIIKNISRDLKDKTDIECQFFETAEDVPEPENLNCIK FT NNLMIFDDLQLTKQNKCEKYYIRGRHSNVDCFYLAQNYFELPRRTIRENAN FT FICLFKQDSKNVNHIYNDHVSNDMKLDEFKKFCNEAWSKKHGFVVIDLSSD FT LLNGKYRSGLDLFYYPTSLWNR*" FT CDS 3482..4165 FT /product="Polinton-1_HM_3p" FT /translation="MQCNNECKQLYPDLNQFSENPSAPPLSIDQNFRLVKI FT NEIQNKIECERIRREILSKKYHKALRIVSAIDNSLLASTMTLGAVGIGFLT FT TIVGSPVAIGCEIAALGMGVLSLIGSQAIKKLTIKAEKHEKIKVLADAKLN FT TISSLISKAISDQYIDDVEFNMILSELEKFQAMTEQIRTKSKKDIDEQTKE FT SIFNKGVEKGTNDIIEALNMKFNKNQSIGVMNNTLKN*" FT CDS 6755..7342 FT /product="Polinton-1_HM_7p" FT /translation="MYCVKCKNKTDTLNLQHAVSKNKRNMICGNCAICGTR FT KCQFTKGTERAKKGGDLVESINKFTSNRKLPLQKFPGEMHIPGMNFAGPGT FT NLDERLTSTEMPKEWSMPVDRVDKTAYIHDLAYKHYPDKEHRNLADKMMLE FT QMDAIPNPTPREKLERKIIKPIISAKVRFGMGNKEPMSIKELCQKNYQRRG FT KKAKT*" FT CDS 4298..5155 FT /product="Polinton-1_HM_4p" FT /translation="MYAKATDIDTRLKIAKELDXLKKSVYQDSINEKLGYD FT XLQTNLEKLYKPIIDSQSGIKEGISKLENKADQIANTFSNYPALLDSETKA FT IMPPEVVDKMSLGPIATEYLKLYTTKNSKKVSDDIFGIHYNEGDKKFYIGK FT KPITIEGNDITIEGKKYTGTPGLWEIIVKSHPVRYTNDDRNKYKEILDQTD FT AIRSDLNSAKPRSSRSYKYTNVIKPIWEEMIGKSGKGVVILPSDPNALFEM FT LKLRLAALQAGNTGARNEIVAICDELLRQGQIDDDEYKALQKAI*" FT CDS 8536..9192 FT /product="Polinton-1_HM_9p" FT /translation="MSYVDIENFDEANLYVKEIKKFNKPVQYQKIVIEYKY FT SNFSEPLRVKTPKSLSYGVCENRDMKTKELTGYSIPFVCENDIFIAIIEAI FT EKFCRNKVIENEATLKKGGMKSLNINNLNILKYRNDDFNAPPVIYAKIPTD FT YKSKKMDQVFRNKNGDILGESLIKKRCDVVGCLAIEGIYIGSMIQSIQVKL FT LDAVVSKNEYKIERFYIKDVDLESEDEE*" FT CDS 18018..19730 FT /product="Polinton-1_HM_15p" FT /translation="MSMNIAGSNSMYGYTTDGLNININVGNTATTDVDLNS FT HKLINVADPTDAQDAATKNYVDNNQTSSSACLKLDGTNSMSANINLNSHKL FT INVVDPTNAQDAATKNYVDRLIPYQHKYKVLIIAGQSNTYAGREWGLSQSI FT TNVNYMNRRIVQFAMDNGQNDVLLPCSLRLDVNETDNNCWAGYGSILAGLI FT MQDAVGANTLNMINPDECLMILPCMLTAKSFSNDYFMPYGTGFKNLTRRIN FT YISSHYNAEFCCMTWSQGEDDSKTGWCDNYAYILMNFIQTIRDYIAYSSNQ FT FDTLPNSQIKANKMLFITFQMLSSWVQANSTTATQVQNALGNIINYSPYTA FT SISLDGMPPSLRSANFDGVHYDSRQQIFLANKFFEAIPIAMNNIYGGGYGT FT PYISGEVLLVRQVAGYLFLNNYYSAYRVFNSTPSTAQIYSSLFTNWQYKNK FT DGYYKYRLKYKINGVWNSIIFKQAFLPMMSMXYQNVAQLISTDTPFGCTEL FT GQGFCGLNLTTTKSMMNSGAVLTLDTSGGYWAPVCQNMNYVYNGQPAMPIF FT DNYIQNSTLPFVATELELYAIRD*" FT CDS 2867..3442 FT /product="Polinton-1_HM_2p" FT /translation="MYKRRMFLDYEKHEKYTITFNINSDIEHRAINDRHIK FT VFAQEDFTVKLNSYNNISLMCGVKIDKGHVLISLAENYKPTLNLLNSVVIE FT STDDIKITLYNNDNKDINIKKGDLLCCISSYNGVFNANYISELDKKALEQV FT RSQRDSGTKSQSDKESVKSILSSIGGIIKETICDEDKPSIDKSDKALFDFG FT K*" XX SQ Sequence 20689 BP; 8564 A; 2427 C; 2948 G; 6723 T; 27 other; agtagttaga gaatgccggt tctagctcat tttcattttc aaccatattt tcaaccatat 60 attaatatta atatatggtt caatgttatc catgttgtgt ataaattatt tagataagaa 120 ataaattttt tcaattttta aaaatgaaaa aaattgaatt tttttaaata agtgttaatt 180 tttatttctt gtcgaaaaat gtcacgtgat tttcttgctg tgacgtcaca tgacttttct 240 tgctgtttct gtaccttttt attctataga atctaaatat tcaagtactt caaggtgatc 300 atttttagca gcaattgatt ttgcataaaa tgtaccttta tatcctaatg aatctaaata 360 ttttagtact tcaaggtgac cattttttgc agcataatta attgcataat ctgttccttt 420 atatcctata gaatgtaaat atttaattac ttcaaggtga ccwttttttg cagcataatt 480 aattgcccaa tctgtacctt tatatcctaa tgaatgtaaa tatttagtac ttcaaggtga 540 cctttttttg cagcaaaatt aattgcatat tctgtacctt tatatcctat tgaatgtaaa 600 tatttaagta cttcaaggtg wccattttct gcagcataat taattgcata ttctgtacct 660 ttatatccta ttgaatgtaa atatttaagt acttcaagrt gaccattttc tgcagcataa 720 ttaattgcay attctgttcc tttatatcct attgaatgta aatatttaag tacttcaagg 780 tgaccatttt ttgcagcata attaattgca tattctgttc ctttatatcc tatagaatgt 840 aaatatttaa atacttcaag attatttgat gcagcaaaaa ttgtatcttc tgttccttta 900 tatcctattg aatgtaaata ttttagtact tcaagatgac cattttttgc agcataatta 960 attgctaaat ttgtaccttt atatcctatt gaatgtaaat atttaagtac tttaatgtat 1020 ccattttttg cagcattatt aattgtatct tctgtacctt tatatcctaa ttaatgtaaa 1080 tatttaagta cttcaaggtc accatttttt gcagcattat caattgccca atctgtacct 1140 ttatatccta atgaatgtaa atatttaagt actttatgat taccattttt aaagcaaaat 1200 caattgcttt ttctgtccct ttttctccta attgaattaa taatttaatt aratcaattc 1260 tacctatttt tgcagcaaaa tcaaccttat taccttgttt aaacccatat ttttttaata 1320 tttcattaat cttaaaatta ttttctaatt ccattctttt tatatataaa agaaaattat 1380 ttaactcatt ttcaattttt tttattttta caaaataaaa taatgaaaaa tatttaattc 1440 tgtgtacttt catcatcttc attaacatat tcagtagttt ctgtaatttc agtagttact 1500 gtagtaactg tagtaacttc ttcatcttca tcatcttcat tatcttcatt atcttcatca 1560 tcttcatctt cttcagaatc tgattcttct tcatattcag tagtttctgt aatttcagta 1620 acttcagtag tagtaactgt agtaacttca gaatcttcag tcatttcaat atcatgagta 1680 tcaataatat ttaatttttc ttctattaat cttaatcttt catctattaa ttttaagttt 1740 aagcataatt cttcaagttt cttttctaga tccattcttt ttatatacat aagaaaatta 1800 atttaactca ttttaaattt ttttgattta aaatatttta tcaaataata aaaataaaaa 1860 tgaatgatga tgagattaat gaaatatctg aatttcaaaa agaatttaaa ctttttgaag 1920 aaataagaaa atttgaattt gaagaatcag attatgactc tgaagatgac acttattata 1980 catctgatga agctacttat tttacatctg atgatgaaga tgaaattcaa gaaaaatcta 2040 tagttccaat tccttcaaaa acagtacaaa ttatggaatt gtataaagat aaactgaaat 2100 ggattaatta tgttccaact tataatgata ttatttatga tgattatcct gctatagata 2160 atttttttat taaaaaacat aacaaaaaca gaaaatttag ttttgaaaaa ccatttaatg 2220 ttgatattaa ttttactgaa gatggaaaac cttcaacaat tcattttgta gagaaactac 2280 caattttctg taaaaatatt ttaaatgata atttctgtcc ttttggatat aaatgtaaat 2340 acaatcatga tattccttat tgtaatactg taaaggaaag actaatatgc actaaatctt 2400 catgtcaata taggcatatt catatttgct atgaatatcc taaatgtaag aataaagagt 2460 gtgtattatt tcatcctaaa aagtcagaat tattaaaatt aagaaaagct aaaactaaat 2520 tatgcaaata tattcttgat aaaaaagaat gtaaaaatat aaattgcaaa tatgctcatt 2580 ctataaaaga aattgaagaa aatgtagaaa tttgtcatta tactccatgt aaattagtag 2640 agagtaaatt aaagaaaaat aaaaaaggag aaaaaatatt attatataaa aatattaatg 2700 ataataaaaa gtgttatttt ttacatgaag gtgaaagcat tttaaattat attgttagaa 2760 ctttaaaaaa atgaaaaaaa tattatattt aatatttatg acttaggtct ctttgagatc 2820 tagggatcac atagtgatct aaaaaaatct ttacatttat aaaaaaatgt ataagagaag 2880 aatgttttta gattatgaaa agcatgaaaa atatacaata acttttaata ttaattcaga 2940 tatagaacat agagcaataa atgatagaca tattaaagta tttgcacaag aagattttac 3000 agtaaaatta aattcataca ataacatttc tttaatgtgt ggagttaaaa tagataaagg 3060 acatgtattg atttcacttg cagaaaatta taaacctact ctaaatttac taaattctgt 3120 tgtaatagag tctactgatg atattaaaat aactttatat aataatgata ataaagatat 3180 aaacattaaa aaaggagatt tgttatgctg tatttcatct tataatggtg tttttaatgc 3240 aaattatata tcagaacttg ataaaaaagc tttagaacaa gttagatcac aaagggactc 3300 aggaactaaa tcacaaagtg ataaagaatc tgtaaaatct atattatcaa gtataggggg 3360 aataattaaa gaaactatat gtgatgaaga taaacctagt atagataaat ctgacaaagc 3420 tctatttgat tttggtaaat aaaagtgatt taaaacaaat ttttctttat agaaaaaaga 3480 aatgcaatgt aataatgaat gtaaacagct ttatccagat cttaaccagt tttcagaaaa 3540 tccatctgca ccaccactat ctatagatca gaattttaga ttagttaaaa taaatgaaat 3600 tcaaaataag atagaatgtg aaagaattag aagagagatc ttaagtaaaa aatatcacaa 3660 agctttaaga atagtttcag ctattgataa ttcattacta gctagcacaa tgacacttgg 3720 agctgttggt attggttttt taacaacaat agttggatca ccagtagcta ttggatgtga 3780 aattgcagca ttaggtatgg gtgtattatc attaatagga tcacaagcta ttaaaaaatt 3840 aacaataaaa gcagaaaagc atgaaaagat taaagtacta gctgatgcaa aactaaatac 3900 aataagtagt cttatttcta aagctatatc agatcaatat attgatgatg tagaatttaa 3960 tatgatactt tcagaacttg aaaagtttca agctatgact gaacaaatta gaacaaaatc 4020 taagaaagat atagatgagc agactaaaga gtcaatattt aataaaggag tagaaaaagg 4080 aacaaatgat attatagaag cgctaaatat gaaatttaac aagaatcaaa gtataggagt 4140 gatgaataat actcttaaga attaaataaa aaaacatata ttatgtatta cttgaacata 4200 tattatgtat tacttgaaca taatatatgt tgatttagat ccctttaggg atctgaggat 4260 cacatagtga tttaaaaaaa ttattgctta tataaaaatg tatgcaaaag ctacagatat 4320 tgatacaaga ttaaaaatag caaaagaact tgatrggtta aagaaaagtg tatatcaaga 4380 ttctattaat gaaaagttag gttatgatrc tttacagact aacttggaaa aattgtacaa 4440 accaataata gatagtcaat ctggaataaa agaaggtata tctaaattag aaaataaagc 4500 tgatcaaata gctaatactt tttcaaatta tcctgcttta ctagattctg aaactaaagc 4560 aatcatgcca cctgaagttg tagataaaat gtcgttagga cctatcgcaa cagaatattt 4620 aaagttatat actactaaaa atagtaaaaa agtatcagat gacatatttg gtatacatta 4680 caatgaaggt gataaaaaat tttacattgg taaaaaacca ataactattg aaggtaatga 4740 tataactatt gaaggtaaaa aatacactgg tacacctggt ctttgggaaa tcatagtaaa 4800 atctcatcct gttagatata ctaacgatga tcgtaataag tataaagaaa ttttagatca 4860 aacagatgca attcgctctg atttaaattc agctaaacca agatcatcta gaagttataa 4920 gtacactaat gtaataaaac ctatttggga agagatgatt ggaaagtctg gaaaaggagt 4980 tgtcattctt cctagtgatc caaatgcact atttgagatg cttaaattac gcttagctgc 5040 tttacaagca ggaaatactg gagcwagaaa tgagattgtt gctatttgtg atgagttgtt 5100 acgacagggt caaatagatg atgatgagta taaagcatta cagaaagcaa tttagatccc 5160 ttcgggatcc aaactcatgc aaatgaatta aaaaaattat tttatatata aaaatgacaa 5220 aatactatga tactcttatc aatctttcag atagtcagaa agaaaaaatt aaaaaagcaa 5280 ttgaatctga tactaatgct agtataaaac taacayatca agatttaaat ggtgaacata 5340 aaatagcttt aacagattca caaagaaaca gaatgaaagc agcttattta aaaaataagg 5400 gtataacact ttcactgagt agagcacaac ttaaacataa tgctaaagta aatggtggat 5460 ttctacctct actaggatta attggaagtg taattgcatc aaaagttgtt ccggctatag 5520 caacaggatt attgactgga acagctgcag ctgctggaag tactatagtt aataaaattg 5580 ctggaaatgg tattatttat gtgaaaaaga atggtagtgg atttaaaatt agaaaagtag 5640 gaaaaggagt atttttaaac ccatggaatg gtagttataa ttatttagat ggatattatt 5700 ctaaaaatgg aaatgaatat tcatcagtag gtagaggatt gttattagga gaaaattctc 5760 catttaagaa tataccatta ttaggtcttc ttctttaggt cacttcgtga tccaggcttc 5820 gctaagttct gaaagaacca atgcttcgct agatcttcat aaaaaaaata tatatttttt 5880 taaaaaatat atatttatcg attccacaat gaagttggat aataaaataa atctaatcca 5940 cttctatact ttccatttaa taaatcagaa gacaaatcta taacaacaaa tccatgcttt 6000 tttgaccaag cttcattaca aaacttttta aactcatcta atttcatatc atttgaaaca 6060 tgrtcattat aaatatgatt tacattttta gaatcttgtt taaacaagca tataaaattt 6120 gcattttctc ttatcgttct cctaggcaac tcaaaataat tttgtgccaa gtagaaacaa 6180 tctacattgc tatgtcttcc tcttatatag tacttttcac acttgttttg ctttgttaat 6240 tgtaaatcat caaatatcat taaattattt ttaatacagt taagattttc aggttctggt 6300 acatcctctg ctgtttcaaa aaattgacac tctatatctg ttttatcttt taaatctctt 6360 gaaatgtttt taataataaa ctctggattt acattgtttt tatttatttg atcttgcaac 6420 tcaaatagtt taagaataac ttcttttggt aatttatttt caaaagaatt ttttaatatt 6480 ttgtattcag gttgaaaaag tgactttcca aaaacttgta aatgattata gtctaaccaa 6540 ccaggtctta agagcaagtt taataataat gttgtctttc cacaacctga ttttccaaca 6600 ataatccctc ttatactttt aggtaataac tcattattat gacgtttact tttatttaca 6660 ttccatgata aatctataat ttccattttt tatttatgaa aagtattttt ttaaatagat 6720 ttgtgttaaa taaattaaaa tacatatata aaaaatgtac tgcgttaaat gtaaaaacaa 6780 aacagataca ctcaacttac aacatgctgt gagtaaaaac aaaagaaata tgatatgtgg 6840 aaattgtgca atctgtggaa caagaaaatg tcaatttaca aaaggaactg aaagagctaa 6900 aaaaggaggt gacttggtag aatcaattaa taaattcaca agtaatagaa aattaccttt 6960 gcaaaagttt ccaggtgaaa tgcatatacc aggtatgaac tttgcaggac ctggtactaa 7020 tttagatgaa agattaactt ctactgagat gcctaaagaa tggagcatgc ctgtagatag 7080 agttgataaa acagcttata ttcatgatct tgcttataaa cattatccag acaaagagca 7140 tagaaattta gctgataaaa tgatgcttga acaaatggat gctataccta atcctacacc 7200 tcgtgaaaaa ttggaaagaa agattataaa acctatcata tctgctaaag taaggtttgg 7260 tatgggaaat aaagagccaa tgagtattaa agaattatgt caaaaaaact accagcgcag 7320 aggaaaaaaa gcaaaaacct gaaatggact gatgaacttg caaatgaact tcataaacct 7380 gttataaaac attttagaaa aagaaaagta attgcaaatg gaatagatga aatatgggct 7440 gctgatttag ttgacatgca agcttattct aaatttaata atggtataaa atacttatta 7500 atggttattg atatattttc aaagtatgga tggattatcc cattgaaaag taagacaggt 7560 attgaagttg ctgatgcatt taataaaata tttaaagata gaaagtgtgc taaattatgg 7620 gtagataaag gcttagaatt ttataacaag catgttaaag cattaggtgt tcatctctat 7680 tcaactgaaa atgaagaaaa aagctgtgta gttgaacgat ggaatagaac aatgaaagaa 7740 aaaatgttta aatacttttc tgctaattct actagaaaat acattgatgt tttagatgaa 7800 atggtaaaac aatacaacaa cacaaagcat tcttcaatta aaatgactcc agttgaagca 7860 agcaataaaa agaatgaaaa taaagtttgg ttaaatttaa atagtaaagt aagatctgaa 7920 cataacaagc ttaagtttac aataggtgat aaagtaagaa taattaaaaa gaaagaatta 7980 tttgaaaaag gatacacacc aagatggact gaagaagtat ttacagtatc acaaattcaa 8040 tacacagatc cacccactta taaaataact gattacaata gtgaagaaat acaaggtact 8100 ttttatgaac aagaactgca aaaaacaagt caagagatat ttagaattga aaaagttatt 8160 aaaaaacaag gtaataaatc atttgtaaag tggtatgggt atcctgatac ttttaactca 8220 tgggttaata atacagaatt gaaagcgatt tagatcyctt tgagatccag ggttctttta 8280 gaacctagcg aagcattgat cacatagtga tttaaaattt ttttactatt aaaaaaatga 8340 acaactcaaa tgtaaatgtg aatatatgtg taacagctac ttctggtaat gctaatacaa 8400 atactactgt aacaacaaca acttcaaatt acaactctag caaatgatgt aagataaaat 8460 ttatatattt ttacaagaaa attaaaataa tttttttaaa atgagttaaa taattttctt 8520 ttatatataa aaagaatgtc atatgtagat attgaaaatt ttgatgaagc aaatttatat 8580 gtaaaagaaa taaaaaaatt taataaacct gttcaatatc aaaaaatagt aattgaatac 8640 aaatactcta acttttcaga acctttaaga gtaaaaactc caaaatcgtt atcatatgga 8700 gtttgtgaaa atagagacat gaaaacaaaa gaacttacag gatactcaat tccttttgta 8760 tgtgaaaatg atatatttat cgcaattatt gaagctattg aaaaattttg tagaaataaa 8820 gttattgaaa atgaagcaac gttaaaaaaa gggggtatga aaagcttaaa tattaacaac 8880 ttgaatattt taaagtatag aaatgatgat tttaatgcac ctccagttat atatgcaaaa 8940 ataccaacag attataaaag taaaaagatg gatcaagtat tcagaaataa aaatggtgat 9000 attttaggag aaagtcttat taaaaaaaga tgtgatgttg ttggatgctt agcaattgaa 9060 ggtatatata taggatcaat gattcaatct attcaagtaa aattattaga tgcagttgta 9120 tctaaaaatg agtataagat agaaagattt tatattaaag atgtagactt agaaagcgaa 9180 gatgaagagt aaatattaat tttatatatt agcaaataat atataaaatg agatcgcaaa 9240 gcaatctagc gtctctttga aattaatcta aataattatt ataaatataa aaaatgtata 9300 ttgaaataga aaaaaataga caaaatccaa ctacaattaa atttgatgat attattaaac 9360 ttgatgaaaa taaagagtat gaaatggcac ttgatagatt tcaaactatt ctctatagtg 9420 attcaactaa aaacaaaaaa gtatacatat taaataatct tgtagatgga aaaattgtat 9480 ctaataatgg caatgctaag aaaacgactg taatgtattc atttatacct attattgttt 9540 ctaaaaattc agttattaat gaacaaccga tatctaagaa atattttcct ttaaaaacaa 9600 actccatttc agaaatgaaa attgatatca ttgataactt taataaacct atagatttta 9660 ataatgaaag ttataccata gaatttcata taagagagag aaaaaagtag attctaaaag 9720 aactaatata taaatttttt tatattgaaa acaatataaa aaattattac atatataaaa 9780 aatgtcaaaa acaatacata taacttcaaa cgatactgtt ataaatacta atatggcagt 9840 tcctattgaa ctagatgaag ataaagagta tggaatagct ctaaaaaaat taatgactta 9900 taatagtttt ccaaatatta ttgaaggatt aaacaatact ataaatatta ctatagaagc 9960 tagtgatagt cctggttttc aacaaattaa gatcccaaca ggatcctatg aacttagaaa 10020 tataaataca attgtaacta gatatgtaat gactactctt aaagatcaac ttgctaaata 10080 taaagagtct tttaaaaatg tttcaagcac agaagttgaa aattataaaa aaaaacaaaa 10140 gtatgatagt ataaaacttg atcctacatc tattatattt gaagcaaatt ttaatactga 10200 aaaatcacag attattatta ataaaaatag caatattaga atacaatttg atgatgactc 10260 cataaaagat cttttaggtt ttaataatag agatcgtgat ggtaatctta taacatttga 10320 ctcatcaaaa gtatatattt cagataaagt aatagatata aatcatgtta atactattcg 10380 tgtaacaaat agtcttgtaa caggatcaat tatagatggt gcatattcaa atgttatata 10440 ttcattttat ccatctacgc cagttggtta taaaattgta gaagtaccaa ataatccaac 10500 ttattataga ttattatcaa aaaaaatttg ggatatgaaa acaacaataa tggatcaaga 10560 tgggcatata cttactttac aaggagagcc gataagtatt gaatatgaaa taaaagaaat 10620 tgtgtcaaaa tgtaaatcag aacatctttt agaaagacaa gtatctcttt tagaagaaca 10680 aaatgaactt ttaaaaaaac aattaagata atagcaaagt gaattattga tcataatcac 10740 aaagcgattt aaaaaaatat atatagtaaa taaaatgtta gatataaata aaaaagcata 10800 tactcacaag tatgtagtag gtggaagtgg agtatttaca gatataaata atatgtataa 10860 tcaggcaaaa tcttctgatt tttttagagc attatctggt atgaaaacaa tgttaactaa 10920 tcattttaat aaaaatggac ctataataaa ataagtctct ttgagactca gattcatgca 10980 agcgatttgg caaagcccag atcacagagt gatttaaaaa aatatatata gtaaataaaa 11040 tgttagttat aaataataaa ggatatacac acaagtatga agtaggtgga agtggactat 11100 ttacaccttt tataaatatg tttaataagc aagcactaag taatgcactt aatgcatctc 11160 gtatttttgc tagtagagca gctgctacag acttgggaaa aactgctgta gatgcagcaa 11220 agtcagcagg gaaagaactt gcaacttctg caatatctac tgctaaagaa attgtaataa 11280 ataaaggtaa aaaattaata gaaaatgtaa ataaatcatc taagttaaca cctgaaaata 11340 aacaagaact taaaaattta ataaatactt tagataataa attaaatgaa gcagtgccag 11400 atattaataa aatcatgatg ggatcatcaa taaaaagtaa acctgttaga atacaagatt 11460 tagttaaaaa acataaaggc gatgggctta gaataatata attttataaa tagctaaaag 11520 tgatttaaaa aaattatatt gcatatataa aaatggtaaa atctgatata tttaatatta 11580 ctgataaatt gagatttgat gaggaaatta aaaaatatga agaatatgaa tttacaccct 11640 ctgttaattc taatttaaat tcaggagaaa taagaatttt tattgaaaac agtgattctt 11700 tgtttcatcc acatgaatcc tatttagaaa ttgaaggaag gcttgtaaaa gcagatggaa 11760 ctgcatatgc agatgattat gcaataacac taacacacaa tggacttatg catttatttg 11820 aaagaattga atataagttc tatgattcag tcgttgaatc agttaatttt ccaggtatag 11880 caactacaat gcttggaatg ttaaaatatc caaatgactt tcaacaatca aaagcaatga 11940 atcaattgtg gtataaagat actacagcaa cagcagattt agttaataat acaggatttt 12000 yagcaagaca gcaatttatc attcaaaaac caactacaaa aggttcattt gaatttagta 12060 ttccattaag acatattttt ggattttgtg atgactatga taaagttttt tatggattaa 12120 agcatgaact ttwtctgtta agaagaagtg atgataatgc aatttttaga gctgctggtg 12180 tagctgcagg aaaagtaaat attactagaa tatcattaat gatgagacgt gcaactccat 12240 ctcttgtagc agatttagaa cttgctaaaa taattaaatc acaagaaaca cttgatatag 12300 gttttagatc aaggttctta gataaaacta atgttccaca aaatacatca tttgattgga 12360 gattaggatt aagaacaact gaaaaaccta gatatatact tgttggtttt caaacaaata 12420 gggaaggaaa tcaagaacaa aatkcttcaa tatttgatca ttgtgatttg araaacatgt 12480 ggattgaact taatgaagaa agatatcctg caactaatta taatttatca tttccaaata 12540 tgaaaattac tagagcatat agacatgcat caaattttgc tgaagattat tataatatga 12600 ctaatcttat tagtctatgt ggtatcactc cttcagacta tagagattta tatcctatta 12660 tgtattttga tgtaagtaaa caatcagaaa gaatgaaaga taaaacagta aatattaaac 12720 ttaaagcaga atttaataca cctgttccag caaatactgt aatatatgct ctcatcattt 12780 cagatagaat agcaaaaatt acttctaatg gaaatagatt aagatttgaa tattaggtca 12840 ctawgtgatc aatgctatgc taggttccaa aggaacccag acttcgttag gtcttgaaaa 12900 ractwatgct tcactaagtc ggaacccata gaaactaggt taaacgctaa aatatagtaa 12960 agaaaattaa ttattttaaa aaaagtatat gcaataaaaa aaatactatt atttaaatta 13020 tttgaaaata atttaaataa attttcttta tatataaaaa agaatgagtg ataaaataaa 13080 atatacaaaa aagtatttat taaatttact aaaagataat aatatacaac aaacatctaa 13140 agatcctatg gcattagtta tgattgctat ggataataat ttaattgata aagattctgt 13200 aaaaacaaat atcattacta aaaaacaaaa aaaaccaaag attgaaagag ttaaaagagt 13260 taaaaaagaa aaaatagtaa aagaaaaaaa atctgctgga agaccaaaat ttcttccaac 13320 aaaagaaaaa aaagaaagag atcctaaata tgattatctc acaaaaatta gaacagatcc 13380 atggggaatt aaacttacta atatggaaac tggtgaagtt aaaaattaca aatcatttta 13440 tgactataat aaaaaagaaa agcactctaa aacatatatt tttaatagaa atgggaaaat 13500 tgtagatgga gtaaaaattg aaataattag agcaagtaaa gaaaaagatg cattatcaca 13560 aaaataaaat agataatgaa aaataaatta tttttataga ctttttctat aaaaatataa 13620 aaagaaaacg tttttttgct atcatttaat tttctttaca taaaaaatag aaaaattgaa 13680 aatgatataa ataattttct tgtgtatata aaaatggaaa atagaaacgt aaaagaatta 13740 aaacaaatcg caagagagcg aagaattaga tattattata gaatgagaaa agcagatctc 13800 attagagcaa ttgagcaaac aactcctata ttagatattc caatcccaaa caacattaca 13860 caaaatactt taacacccac acaatatgtt ccaattagtt atattaatac tattaagaac 13920 aaagtgaatt catttgctaa ttggattata gactatgttc ctgaacctat taaaaatgtt 13980 gctaatgaaa aacttacatc gttaaagtct acagtttcta atttctttag gtataaaaga 14040 gaacctttgg actctaaaaa agaaaaaaag atagaaaaca gaccaataga atttaaatta 14100 tcacaatcat ctcttaaaaa tgtaactaaa caattttcag ctgaaggtgt taaaggatat 14160 gatgctttgt cctttatgaa atcagctgaa aataatgtca ttaaaatact aaacagcaat 14220 aaaggatcaa agatctatat tgttctttca tgtgaaatgg agcgtactga tcttaaaaca 14280 ggagaaacta taacaactat tgcgtcattt tcaacaaaag cagaagtcgt attagagtca 14340 acagatttaa atgactttta tgaaagagct gaacagaaaa ttttagaatc cttgtcagca 14400 tttcaacaat taggatcaag ttggatattt gtttcagtta aaaagatgga cattaatatc 14460 attgaatata agcctattaa aggtaaatct tacattcctc ttcctaaaga gttggcagct 14520 aaaaaagcaa ttattaatat gaagaatgaa gacaacgaat gctttaagtg gtgtgttgct 14580 agatttttta atcctaaaga aaaaaattct gaaagagtag ataaagattt aaaagaacaa 14640 tctgaaaaat taaattggga aaaaataaaa ttcccagtat cactacaaca gattactcaa 14700 tttgagaaaa ataatcaaga catcagtgtt aatgtctttg gttatgaaaa ttctgtttat 14760 cctttaagaa tatctgaaaa caaaaataga caacataaaa tagatttatt actcatttca 14820 aatgatgaaa ccaatcacta ttgtttaata aaaagtttaa gtagattact ttcatcacaa 14880 atatctaaaa atgaacatga aatgttttat tgtagaaatt gcttgttagg attttgtact 14940 gaagaatctt tgtcaaatca taaattgtat tgtgatactc atgattcagt acgaattgag 15000 cttccaaaac caaatacaat gattgagttt aaaaattata acaagtctat gagagtgccc 15060 tttgtagtct atgcagactt tgaaagtttt ataaaaccaa tcaacacttg ctcacctaat 15120 ccaaatgaat cttatacaaa gcaataccaa aagcatacac ctagttcatt ttgttactat 15180 attaaatgct ttgatgaaaa aatatatcaa agtaaactag ttacattcac tgcaagtaat 15240 gaagatgaag atgtagcaca aaaatttgtt aatatgctag aagaagatgt aaaaaagatt 15300 tacaacgatt atttaaaatt tccaaaaaag atgatattta ctatgaaaga caaaaataat 15360 tttgataatg caaaaatatg tcacatttgt gaaaaagatc ttaatgaaga tagagtacga 15420 gatcattgtc atattactgg aaaatataga ggtgctgcac atagtgattg taatttacaa 15480 tttaaaattc caaaattcat tccagtacta tttcacaatt tatctggtta tgactctcac 15540 ttgttcataa agaaattgtc agaaggagga aatataaatt gtatacccaa caatgaagaa 15600 aaatatatta gcttttctaa agaacttaaa gtaaatgaat ttatgaatag agaaggtaaa 15660 aaagttgagg ttaaactgta tctacgtttt ctagatagtt ttaaatttat ggctgctagt 15720 ttagatagtt taactaaaaa tttgtcaaaa gatcaatgta aaaatattag tagatactat 15780 tctggaaatg aactcaattt attgttaaga aaaggtgttt atccttatga atgggttgac 15840 tctattgata aattaaatga aacacaatta ccaccaaaag aatcatttta ttcaagatta 15900 aatgatgaag gaataagtga tgaagattat ttacatgctc aaaatgtatg gaaagagttt 15960 aattgtgaaa cattcagaga ttatcataat ttatataacg aatctgatgt attattacta 16020 gctgatgtct ttgaaaattt tagagatgtm tgtwtwaaca attacaratt agatcctgct 16080 tggtactata cagcaccagg attagcttgg gatgctgcgt tgaaaataac agaagtaaaa 16140 ttagaattat taagtgatta tgatatgatc ttaatgataa aagaaggaat aagaggtgga 16200 attagtatgg ttcctaacag gttaggaact gctaacaata aatatatgga aaattatgat 16260 gaaagtaaag aatctactta catacagtat ctagatgcaa ataatttata tgggtgggca 16320 atgagtaaac cccttccaac acatggattt gaatggatga atgaagaaga attaaaaaat 16380 tggaaatcta cttcttgcat attagaagta gatttagaat atcctgaaca tttacatgat 16440 ttacataatg attatccact tgctcctgaa agattaaaaa tagacaaagt tgaaaaatta 16500 gtaactaatt taaatcataa gaaaaattat atcatacact atgaaaactt aaaactgtat 16560 gaaagattag gaataaaatt aacaaaaatt catagaggaa taaagtttga agaaagtgca 16620 tggttaagta aatacattaa actcaataca gatctaagaa caaaagcaac aaatgatttt 16680 gaaaaagact ttttcaaact tatgaacaac tcagtatttg gaaagacaat ggagaacatt 16740 gaaaacagag ttgatgtgag attagtttct aaaagagaag aagctattaa attagcatca 16800 aaaccaaatt atgaaagtag aacgatattt gatgaaaatt taatagctat tcatatgaaa 16860 agaacaaaac tagtatataa taagcctatt tatttaggta tgtgtatttt agatttgagc 16920 aagactttaa tgtatgaatt tcattatgat tacataaaaa ataaatatgg agataaagct 16980 aaactattat atacagatac agattcatta atctatgaaa taaaaacaaa agacttttat 17040 gcagatattg caaatgatat tgaaagtaaa tttgatacaa gtgaatttaa taaagatcat 17100 ccagctgttc aaaatggatt taaagtaggt gtaaataaga aagtaatagg aatgtttaaa 17160 gatgagtcag caggaaaaca aattacagaa tttataggac tcagatcaaa attgtactct 17220 tacaaaatag atgaagaaga taaaaagaga tgtaaaggag taaagagaaa tgtagtaaaa 17280 aattatatta cacatgaaga ttataaagac tgccttatga ataagaaaga tcaaatgaga 17340 aaaatgaatg tcataagatc tcattgtcat gatgtttata ctgaagaaat aaacaaaata 17400 gcattgagcg cagaagatga taaaagagta attcaagaag atggtattca cacattagca 17460 tatggacatt ataaattaaa ataaacaagt aaccaggcag gcaaacacac aggtaagtta 17520 ttttttatat ctaaagagat ctaaaaaaat ctaacgaagt aataaaaatg gacaaacata 17580 tcaatgttga aggaattata ttaccaaatg aacccttaac aaactttcaa ttaataaaag 17640 cagcaaaaaa attaaaaatt aaaaatttta gaggtgtatt tgtaagagat gaattaccta 17700 aaaatacaca aaacaaagaa tgtggaatat taaatactgg agattcaagt acgaatggat 17760 ttcattggat ttgttggtat aaagatggag aagaaaagtt aagttttgac tcttatgcac 17820 taccacctcc agttgaactt gttcaatatt tacacagtcc tgtttattat aatagtaaaa 17880 gaattcagtt tggagacact tctttttgcg gacacttgtg cctttatgta ttaaaaaagc 17940 ttgatgaagg tgttgatttt caaagtattg aaaattcatt atattgaatt aaaaaaaata 18000 tgtatagtat ataaaaaatg agtatgaata tagccggtag taattcaatg tatggttaca 18060 caacagatgg attaaatatc aacattaatg ttggcaatac agccactact gatgttgatt 18120 tgaattcaca taaattgatc aatgtagctg atcctacaga tgcacaagat gcagctacaa 18180 agaattatgt agataataat caaacttcta gcagtgcatg tctaaaacta gatggaacta 18240 atagtatgag tgctaatata aatttgaatt cacataaatt gatcaatgta gttgatccta 18300 ctaatgcaca ggatgcagct acaaagaatt atgtagatcg cttaattcct tatcaacata 18360 aatataaagt tttgattata gctggtcaat caaatacata tgctggtaga gaatggggtt 18420 tatctcaatc aataactaat gtaaattata tgaacagacg aatagtccaa tttgcratgg 18480 ataatggaca aaatgatgtg ttacttcctt gttcattgag gctagatgta aatgaaacag 18540 ataataattg ttgggcagga tatggttcaa tacttgctgg acttataatg caagatgctg 18600 taggtgcaaa tacattaaat atgataaatc cagatgaatg tttaatgata cttccatgca 18660 tgctaactgc taaaagtttc agtaatgatt attttatgcc gtatggaact ggttttaaaa 18720 atttaactcg tagaattaat tacatttcat cacattataa cgcagagttt tgctgtatga 18780 cttggagtca aggagaagat gattctaaaa ctggttggtg tgataactat gcatatattc 18840 taatgaattt tatacaaacc attagagatt atattgcata ctcatcaaac caatttgata 18900 cattgccaaa ctcacaaata aaagcaaata aaatgctatt cattacattt caaatgctat 18960 cttcatgggt tcaagcaaac tcaactacag caactcaagt tcaaaatgca ttaggaaata 19020 ttataaatta ttctccatat acagctagta tatctctaga tggtatgcct cccagtctta 19080 ggagtgcaaa tttcgatgga gtacattatg attcaagaca acaaatattt ttagcaaaca 19140 aattttttga agcaatacca attgcaatga ataatatata tggtggtggt tatggaacac 19200 catatatttc tggagaagta ttattagtta gacaagttgc tggatattta tttcttaaca 19260 attattatag cgcctataga gtatttaatt ctactccatc tactgctcaa atatatagtt 19320 cattatttac taattggcag tataaaaaca aagatggcta ttataaatat agacttaaat 19380 ataaaattaa tggtgtatgg aatagtatta tatttaaaca agcattcctt ccaatgatgt 19440 ctatggmtta tcaaaatgtt gcacaactca tatcaacaga tacaccattt ggatgtacag 19500 aattaggaca aggattttgc ggtttaaatt tgacaactac aaaaagcatg atgaattctg 19560 gagcagtact tacactagat acaagtggtg gttattgggc acctgtatgt caaaatatga 19620 actatgtata taatggacaa cctgctatgc ctatatttga taattatata caaaatagta 19680 ctcttccatt tgtagcaaca gaactagaat tatatgctat aagagactaa taaatacaaa 19740 tctagatttt gcataagtaa atccagcgca tgcgcattaa aacaagaaaa atgagttaaa 19800 taattttctt gtgtatataa aaaagaatgg aaaacgaaat gcaaaaagaa attaatccag 19860 catatttaaa ctttaaaaaa agattagaar taatgagtaa agatttaata ttaaaatcta 19920 aaaaattcag taaagaaaca aaatcagaac tagaatcaga tttagaatac ataaagttaa 19980 taataaaaaa tatatataat aatgttaaaa aagaattatt agaattaata araccagaat 20040 tattaatatt agagtgtgta atgatatcaa caagggatat gttgttaaaa atatgcgatg 20100 atgaaaatgt agaaatatta aagttaaaag cagattattt aatagaaata ataaracaaa 20160 tattagatat aataaagtat ttctaaatct aaataaaatc tagcgcatgc gcacagatct 20220 gaaagatcaa ttagcgcaca agaaaaaaaa ttgaaaataa atttctttac ataaattata 20280 agaaatggat tgtacaagta aagatgtaga tattaaatta acaagttata aaagaagatt 20340 aagtttagct tgttatttgc aaaaaaaatt taaattacaa gctgaaaagt tagaaaagga 20400 aagattagaa gaagaaaagt taaaaacacc aaataatatt tattattaga aaaagtcatg 20460 tgacgtcaca gcaagaaaat cacgtgacct gtgacgtcat tttttcgaca agaaataaaa 20520 attaacactt attttaaaaa attcaatttt tttaaatttg aaaaattatt tcttatctaa 20580 ataatttata cacaatatgg ttaacattga accctatatt aatattaata tatggttgaa 20640 aatatggttg aaaatgaaaa tgagctagaa ctggcatatc tttactact 20689 // ID HYDARGOS1_EM repbase; DNA; INV; 1865 BP. XX AC . XX DT 16-MAY-2005 (Rel. 10.05, Created) DT 16-MAY-2005 (Rel. 10.05, Last updated, Version 1) XX DE Hydargos-Em1 (HYDARGOS1_EM), a Tc1-like member of the Tc1/mariner DE DNA transposon superfamily from the single-celled eukaryote DE Entamoeba moshkovskii. XX KW Mariner/Tc1; DNA transposon; Transposable Element; mariner; KW Interspersed repeat; Tc1/mariner superfamily; Hydargos-Em1; KW HYDARGOS1_EM. XX OS Entamoeba moshkovskii OC Eukaryota; Amoebozoa; Archamoebae; Entamoebidae; Entamoeba. XX RN [1] RP 1-1865 RA Pritham E.J., Feschotte C. and Wessler S.R.; RG Department of Biology, University of Texas at Arlington, RG Arlington, TX 76019, USA; RT "Unexpected diversity and differential success of DNA transposons RT in four species of Entamoeba protozoans. Mol. Biol. Evol. (2005) RT In press."; RL Direct Submission to Repbase Update (16-MAY-2005). XX DR [1] (Consensus) XX CC Hydargos-Em1 is a consensus sequence reconstructed from an CC alignment of multiple members of a new family of Tc1/mariner CC elements identified in the E. moshkovskii genome. The consensus CC is 1865-bp long and displays 23-bp TIRs, flanked by a TA putative CC TSD. The element contains a single ORF, which can potentially CC encode a 345-aa protein with a DD35E motif; 51% similar to those CC encoded by the Tc1-like element Tucur from Anopheles albimanus. CC Thus, Hydargos elements appears more closely related to members CC of the Tc1 clade than to other clade of the Tc1/mariner CC superfamily. There are several families of transposons related to CC Hydargos-Em1 in the E. moshkovskii, E. invadens and in the E. CC terripinae genome. XX FH Key Location/Qualifiers FT CDS 304..1339 FT /product="HYDARGOS1_EM_ORF" FT /translation="MNSLSTSPLYKKSNSAKYHRIIVMRKEGGKSFEEISK FT TLGVSKSTISNVLKHFEKEGTLPECKKLGRPQGITGVIVDSIIQATEDDIF FT CSCKSVAQQVGIAKSTVNKIRHQEGYNYVPQTPRPEMKPRHIEKRIKFSKK FT ALDNILRWLESTIISDESRFSLHSDARFVWRKRGEKKEETYFDKPKYDVSV FT MVWGGIGKNFKTKLLRCKPRMKSNDYIEMLEDGGVIKMANEKWGRSKYYFQ FT QDGATPHTCSITRQFMEKKKVKLVRNWPPNSPDLSPIEHLWAIMERKLREI FT NQNFESADDLFETLCKVWDEIPMETINNLIDSTEKRFRLCLEHDGKFIGDY FT LNX" XX SQ Sequence 1865 BP; 698 A; 246 C; 317 G; 602 T; 2 other; taaactgttt acaaaaaatg agtaccrctg aaaatagtgt atttttcaaa aaaataattt 60 tttttttaat ttttttgtgg tgataatttt gaaaaaaatg ggaaaaaatt agttctgagt 120 gaaaaatttt gtcgttgtaa aaagtgaaaa atgggaaatt tttttacttg atatccaaaa 180 aatgcactaa aaccaaactc tgtcattttt tttctttacc aaaaagtttc aaatgccgca 240 atgatatttt tgttttttaa taacaatttc caactggatt tgaaaaacta aaaaaattaa 300 taaatgaatt cattatccac ttcacctcta tacaaaaaat caaattcggc aaagtaccac 360 cgcattattg tgatgagaaa agaaggtgga aagagcttcg aagaaatatc aaaaacactt 420 ggtgtttcta aragcacaat ttcaaatgtg ttaaagcatt ttgaaaaaga aggcacattg 480 cctgaatgca agaagctcgg aaggccacag ggaataactg gagttattgt ggacagcatt 540 attcaagcaa cagaagatga tattttctgt tcatgtaaaa gtgttgctca acaagtgggt 600 attgcgaaat ccactgtgaa taaaatccgc catcaagagg gatacaatta tgttcctcaa 660 actcccaggc ctgaaatgaa accgagacac attgagaaaa gaataaaatt ctcaaaaaag 720 gcattggaca acatcttaag atggttggaa tccaccatca ttagtgatga aagccgcttc 780 tctttgcata gtgatgccag gttcgtttgg agaaaacgag gagagaagaa agaagaaacg 840 tactttgata aaccaaagta tgatgtgtct gtgatggttt ggggtggtat tggcaaaaac 900 ttcaaaacta aattgttgag gtgtaagcca agaatgaaaa gtaatgatta tatcgaaatg 960 ttagaagatg gaggtgtaat caaaatggcc aatgagaaat ggggaagaag taaatattac 1020 tttcaacaag atggagccac cccacatact tgttccatta ctcgacaatt catggaaaag 1080 aagaaagtga aattggttag gaattggcca ccaaactctc cagacctaag tcctattgaa 1140 catttgtggg caattatgga gagaaagtta agagagataa atcaaaactt tgaaagtgcc 1200 gatgatttat ttgaaactct ctgcaaagta tgggatgaaa tcccaatgga aactatcaac 1260 aacctcattg atagtactga aaaaagattc agattatgtc ttgaacatga tggaaagttt 1320 attggtgatt atctgaatta atttattttt taactttttt gtttttgttt ttttgttttt 1380 caatttttcg ttgacaaaaa gtcgccacat aattttcata atttaaacaa tctactaaag 1440 aattcaaaat cagttttctt tcgaatgccg aagacaccac ataaaaaatt tgtttgtttt 1500 tattttattc atttatttgt tcaaagcttt atgaaaacaa ttactaatgt catttagttt 1560 aattaatatt ttaattaaca attgccgata aaaacgtttg ttttaatttt atttcgcaga 1620 acaaagcgtg aaaactaatg aaaagtggga atgggaaaaa ttttaaaaaa aaaccaaaaa 1680 ttcttagtgg aaattttcgt ttttttttat gtaatttact aaaattttta ctaaacttgg 1740 ttgattattg atttttttaa aaaaggagta ctaaaagaaa aaaatgttcc atttttttga 1800 ctctaaaaat cgggaaaaat ctttaacttt tttttacaca gtactgattt aatagaaaca 1860 gttta 1865 // ID Copia-25_SI-LTR repbase; DNA; INV; 207 BP. XX AC AEAQ01023844; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: long terminal DE repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-25_SI_; KW Copia-25_SI-I; Copia-25_SI-LTR. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-207 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01023844; Positions 2 208. XX SQ Sequence 207 BP; 54 A; 65 C; 33 G; 55 T; 0 other; tgcttgtcaa atctcgcgct accgctctca cagtaggaga gcagcactcg cgcgcactga 60 gcgccaccgt acacatttac tcgttcatac actcagacat tcattactga acacactctc 120 ttctatggca tgtaccgctt tgtatcagca cgaataaata gatcagtcta atccaatcgc 180 ttcgctcatt tcacacccac gttatca 207 // ID Crack-24_BF repbase; DNA; INV; 2074 BP. XX AC . XX DT 30-APR-2009 (Rel. 14.04, Created) DT 30-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE Amphioxus Crack-24_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; KW Crack-24_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-2074 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-2074 RA Kapitonov V. and Jurka J.; RT "Young families of Crack non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(4), 829-829 (2009). XX DR [2] (Consensus) XX CC Its 5'-terminal portion is incomplete. XX FH Key Location/Qualifiers FT CDS 2..1600 FT /product="Crack-24_BF_2p" FT /translation="IPNTVSQFSPLQFVNKAVSTFAFTPITDSFVVNQLST FT LKVGKATGIDSFDNKLLKLGAHILGPPLTKLFNNSLMTGHFPKSWKQSKIV FT PIHKKGDKTNPGNYRPVSLLSGVSKILERAVHNQLYQYLTQNRLLTQYQSG FT FRKKHSCETALHSVIEEWVSSIDKRELTGVVFCDLSRAFDTLDHNIMVRKL FT QQYGADDMACKWFQSYLTGRIQHTCINSVLSSAINVTCGVPQGSILGPLLF FT IIYVNDLPNCMQLCKIALYADDTIIYFAARNVQAIEQAIQSDSARLSQWFA FT ANKLSLNPTKCKTMLVGSSRANDINTELDITLAGTTLDQVDKFKYLGVIID FT NMLKWHDHCDHLNKKLAQIIGIMKYLKHYLDREALLTIYKALFLPHIQYCS FT TVWDQGGKGSLDKLQKLQTRAGRVIMGYDRHTSTSNVLRSLGWVQIVDVHK FT RSKATLVFKALNGMTPPHLTALFSQLSEIHTHNTRLRSQGGLLLPKAKSEY FT RKKAFSYSGAVLWNSLSSQLRSARTLSQFKRLYDQD*" XX SQ Sequence 2074 BP; 613 A; 448 C; 396 G; 617 T; 0 other; tatcccaaat actgtttccc agttctctcc tttacagttt gttaacaaag ctgttagcac 60 gtttgccttt acccctatca cggactcatt tgttgtgaac cagttatcca cgctcaaagt 120 tggaaaagcc acgggaatag acagttttga caacaagctt ttaaaacttg gagcgcatat 180 acttggtcct cctctgacaa aactgttcaa caactccctg atgactggtc acttcccgaa 240 aagttggaag cagtccaaaa tcgtcccaat ccataagaaa ggagacaaaa caaatcctgg 300 caactatcgc cctgtatcac tcttatctgg ggtatccaaa atacttgaaa gagctgtcca 360 caatcaacta tatcagtatc tgacccaaaa tagactctta acacagtacc agtctgggtt 420 cagaaagaaa cactcttgtg agacagcgct tcactcggtg attgaggagt gggtaagctc 480 tattgacaag agggaattaa ccggagtggt attttgtgac ctgtctcgag ctttcgatac 540 ccttgaccac aacataatgg tacgtaaact ccaacaatac ggtgcagatg acatggcttg 600 caaatggttc cagtcatact tgactgggag gatacaacat acatgcatta actcagttct 660 atcctcagct atcaatgtca cctgcggagt ccctcaaggg tcaatactag gcccacttct 720 gtttatcata tatgtgaatg acttaccaaa ttgcatgcaa ctctgtaaga ttgcactgta 780 tgccgacgat acgataattt acttcgcggc acgcaatgta caggcaattg agcaggccat 840 ccagagcgac tcagccagac tttcccagtg gtttgcggct aacaaattgt cgttaaaccc 900 cacgaaatgt aaaacgatgt tagtgggttc gtcaagagca aacgacatta acactgaatt 960 agatattact ttagctggca ctactcttga ccaagttgat aaattcaagt atcttggagt 1020 cattattgac aacatgttaa agtggcatga tcactgtgat catcttaaca agaagctagc 1080 tcaaataatc ggaataatga agtaccttaa acactacctg gatagagagg cgctgctcac 1140 aatttacaaa gccctatttc tcccacacat acagtattgc agcactgtct gggatcaagg 1200 gggtaaagga agtctggata aactacagaa actccaaact agagctggtc gcgtgatcat 1260 ggggtacgat cgccatactt ctacatcaaa cgtcttacgt agtcttggat gggtccaaat 1320 agttgatgta cataaaagaa gtaaggcaac gctcgttttc aaggcactta acggcatgac 1380 ccctccccat cttactgcat tgttctccca actctcagaa atacacacac ataatacccg 1440 tcttagatca caaggtggac tcctgctgcc taaagctaaa tctgaatata gaaagaaagc 1500 tttttcatac tcaggggcgg tgctctggaa ttccctgtcg tcccaattaa gatctgcgag 1560 gacactgtca cagtttaaac gattgtatga ccaggactaa aggattctga ttgtatatat 1620 tttaacttta gttcctgtca tatcacagtg gatccttttg tttgatttct gtattatgtt 1680 ttgatgctaa gactgacatt tatgttctct tctgactgtt atttttcatt tgcaaatcaa 1740 ctttgaaaca aacactgccc gaccagcgac acattgatat gcaaattcca cttttcgtta 1800 ctttgatttt aagttgatcc actgcctgaa tgtgttatta ttgttattat tattaccatt 1860 attatattaa gttatcctga gttatttgta ttgtaaatag ttgacacctg acgtttagtt 1920 atttaagttt gacctttgat ttcaattttg attcagttgt tgtgattttg tgttatttta 1980 tgttggggtc tcccggccca ccaggaggta ttgaaaaacg ccagtaggcg atgtattttc 2040 ctggataaat aaataaactg aaactgaaac tgaa 2074 // ID Gypsy-2_Cfl-I repbase; DNA; INV; 4817 BP. XX AC AEAB01029725; XX DT 26-JAN-2011 (Rel. 16.02, Created) DT 26-JAN-2011 (Rel. 16.02, Last updated, Version 1) XX DE LTR retrotransposon from the Florida carpenter ant genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-2_Cfl-I. XX OS Camponotus floridanus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Formicinae; Camponotus. XX RN [1] RP 1-4817 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Florida carpenter ant genome."; RL Direct Submission to RU (26-JAN-2011). XX DR Genome; AEAB01029725; Positions 472 5288. XX CC Positions [3299-3784] - Integrase core. CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 93..992 FT /product="Gypsy-2_Cfl-I_1p" FT /translation="MGRHKSRRDSDRGNSKRKERQSRSPSTSHEEKKRREE FT RFERLERIVESLDHRSHVRGSCIHRGDEQIIPVFDPSRDDLVIEKWIEHVD FT DLAMQYDWDDRAIMRLIPGRLRGHARQWYDTRPRLAIMWTEMKETLMQQFR FT KSVPFSKLFKEAALYESTPGQALGDYCFQKLNKMRKLDINIPDKYLIDAVI FT GGITDENVARTVRSAQHSDANALYVYMTSLGNLSAKGEKNKAASAPIFKND FT RKGQTSNRSTNQSQATNDENAKSADDKTAASDIKLNNRDRNVLIAVNRATS FT RGNAVCRA" FT CDS 983..3214 FT /product="Gypsy-2_Cfl-I_2p" FT /translation="MPRVECEQCRRLSHNADKCPFKKDVNAITEAKYLPNL FT YERQIIVNGHKIKGLLDTGSSCSLLRESIVEKYKLPTAITPDNVLRGFAGQ FT VTTSNQSAPCDIRVMNATARVDAVVVPDSHLVYDIIVGRDFLEQEHIVTIK FT RGHKLILKQLPMINAKCENIVDVNFAELRTHDTVKINVESWEAGAQCTELI FT REFRDCVAFSIKELGKTEATSLSIRCTVDIPVVYRPYRLAEAEKQIVRGII FT QELLANSIIRESKSPYASPIILVKKKSGEHRMCIDFRKLNLVTVKDKYPMP FT LIEEQIDKLGGNRYFTGLDLASGYYQVPVAADSIEKTAFVTPEGHYEFLRM FT PFGLTNAPAVFQRLMDNVLGDLKNSVAFPYLDDIIIPSKTIAEGMMRLRQV FT LKALRKHHLTLKLEKCSFFAQSIEYLGREISESGVQPGRCKIEAVRDMQAP FT RSVKQVRQFLGLAGYFRKFVENFATIVEPLTKLTKKDVPWEWKIEQERAFN FT IIKNKLTTRPVLAIFNPEVPAEVHTDASAIGVGAVLLQPVEGKMVAVAYYS FT RQTTADQRCYHSYELETMAVVFALRYFRVYLLGTKFKVVTDCNALRTTFAK FT RDLLPRIGRWWLEVQEYTFEIEYRAGTRMAHADALSRNPSPILLEVLQIDI FT TEGDWVLAAQLQDEQLLRIRKILTDKRKTSETKHYFEEYLIKDDKIYRRLD FT EKTQTWVVPRDARLQICRLCHDDAGHLGAEKTLERIKRTIGLQE" XX SQ Sequence 4817 BP; 1473 A; 1007 C; 1246 G; 1091 T; 0 other; tatcagaagt gggattacaa agaacgagtg cgtgcttagt taatcgatct attaaaatat 60 ataaatatat tttcctgttg taacgcaacg agatgggccg tcataagtcg cgccgcgaca 120 gtgatcgcgg caattcaaag cggaaggaac gccagtcccg gtcaccatcg acgtcgcacg 180 aagaaaagaa gcgacgagag gagcgtttcg aacgactgga acgtatagtg gaaagtctgg 240 atcatcgatc gcatgtgcga ggctcgtgta ttcatagggg ggacgagcaa ataataccgg 300 tgtttgatcc atcaagagac gatctagtaa tcgaaaaatg gatcgaacac gtagacgacc 360 tggccatgca atacgattgg gatgatcggg cgatcatgcg attaataccc ggacggttaa 420 ggggccacgc cagacaatgg tatgatacgc ggccacgatt agccattatg tggacggaga 480 tgaaagagac tctaatgcaa caatttcgta agtccgttcc ttttagcaaa ctgtttaagg 540 aggccgcgtt atacgagagt acccccggtc aagcactcgg agactattgt tttcaaaagt 600 taaacaagat gcgtaagtta gatattaata tacccgataa atatctcatt gacgcggtga 660 taggcggaat tactgacgag aatgttgcta gaacggtacg gtcggcgcaa catagcgacg 720 cgaacgcgtt atacgtttat atgacgtcgt taggcaatct gtccgcgaag ggtgaaaaga 780 ataaagccgc gtctgcacct attttcaaga acgatcgaaa aggccagaca tcgaatcgtt 840 cgacaaatca atcgcaagcc acgaacgatg agaacgctaa gtcagcggac gacaagacgg 900 ccgcgagcga tataaagcta aacaatcgcg accggaatgt tttaattgcg gtaaaccggg 960 ccacatcgcg aggaaatgcc gtatgccgcg cgtagagtgc gaacagtgcc gccggttaag 1020 tcataatgcc gacaaatgtc cgttcaaaaa ggatgtaaat gccataacag aagcaaaata 1080 cttgccgaat ctatatgaaa ggcaaataat tgtaaacggg cataaaatca aaggactgct 1140 tgatacaggg agtagttgta gtttgttgcg agaatctatt gttgaaaagt ataaactgcc 1200 gacagcgata acacccgata acgtattaag aggtttcgcc gggcaagtta caacgagtaa 1260 tcaatcggca ccctgtgata ttcgagtcat gaacgcgacc gcccgagtcg atgccgtcgt 1320 agtaccggat agccatcttg tttacgatat tattgtcggc cgtgatttct tggaacaaga 1380 gcacatagtc accatcaagc ggggacataa attgatactc aaacaattac ctatgattaa 1440 tgccaaatgc gagaatatcg ttgatgtaaa tttcgcggaa ttacgaacgc atgacaccgt 1500 aaaaataaac gttgagagct gggaggcggg ggcacagtgc acggagctaa ttcgagaatt 1560 tcgggattgc gtagcgttct caataaagga actaggtaaa acggaagcta cgtccttgag 1620 catccgttgc acggtcgata taccggtagt ataccgccca tatcgattgg ctgaggccga 1680 aaaacagata gtaagaggga ttattcagga actcctagct aatagtatta ttcgcgaatc 1740 gaagtcacca tacgctagtc ccatcatact cgtgaaaaaa aagagtggcg agcatcgaat 1800 gtgcatagat ttcaggaagc taaatttggt tacggtaaag gataaatatc cgatgccctt 1860 gatcgaggag cagatcgata aattaggggg aaatcgatat ttcaccgggc tagatctggc 1920 ttcaggatac tatcaggtgc cggtagcggc agattctata gaaaagacag catttgtgac 1980 gccggaaggt cactacgagt ttttgcgtat gccatttgga ttgactaatg caccggcagt 2040 ttttcagcga ctgatggata atgtattagg cgacttaaag aattccgtag cttttcctta 2100 cctggatgac attattatac cgtccaagac catagcggaa ggaatgatgc ggctacgaca 2160 agtgttgaag gcactacgca aacatcactt aacactaaag ctggagaaat gctccttctt 2220 cgcacagtcc attgagtatt tgggcagaga gatcagcgaa tctggagtac aacctggtcg 2280 atgcaaaata gaagcggtgc gggacatgca agcacctagg tcagtgaagc aggtacgcca 2340 gttcttaggg ttagctggtt attttaggaa gttcgttgaa aactttgcaa ccatcgttga 2400 gccgctaacg aagctcacaa agaaagacgt accctgggaa tggaaaatcg agcaggaacg 2460 agcgttcaac atcattaaaa acaaactgac gacgcgacca gtgctagcga tattcaaccc 2520 tgaagtcccg gccgaggtac acaccgacgc aagcgcgata ggcgtagggg ctgttctatt 2580 acaaccagtt gaaggtaaaa tggttgcagt agcatattat agtcgacaaa ctacggctga 2640 ccaacgttgt tatcattctt acgagttgga gacgatggct gtagtatttg cgttgaggta 2700 ctttagagtc tatctactag gtacgaaatt taaggttgtg actgactgca acgcattgcg 2760 tacaaccttc gcaaagcgag atttgctgcc acgcatcggt cgatggtggc ttgaagtgca 2820 agaatatacc ttcgagatag aataccgcgc aggaaccagg atggcacacg cggatgcact 2880 tagtcgcaac ccgagtccga ttttattaga agtgctgcag atcgacatca ccgagggcga 2940 ttgggtattg gcagctcagt tacaagacga gcaattgtta cgaattcgga agatcttgac 3000 ggataaaaga aagaccagcg agacgaaaca ttactttgaa gaatatctta tcaaggatga 3060 taagatatac agacggctgg acgaaaagac tcagacttgg gttgtgcctc gagatgcacg 3120 gctgcaaata tgcagattat gtcacgatga cgccggacac ctcggcgcag aaaaaacatt 3180 ggagcgaatc aaaagaacta ttggtttgca ggaatgagac gctttataac gaaatatgta 3240 aacgcttgtt tgagctgtgc atattacaag aacacagctg gcaagcgaca atgtaaacat 3300 tgagaaggtt ccagtacctt tccatacaat acatgtggat catgttggtc ctttcgaaac 3360 gagccggaag cataataagt tcctcttggt agtagtggat gcattcacca aattcacaat 3420 tgtggagccg gtaaagagcc agaagacgtg ttatgtcatt aaaatattaa ctaatatgat 3480 ttatttgttt ggagtcccta acagaattat caacgataga ggaacggctt ttacatctca 3540 gggatttggt acgttctgtc gccattacgg aataaaacat gtactaaacg cagttgcaac 3600 accacgagct aacggccaat gcgaacggta caacaagacg attgtacaag cgctggcgac 3660 cactacggcc ggacgggatc cacgcgattg ggattcggtc gttaagcagg ttcaaagtgc 3720 gctaaacacg acgcacaaca agggcatcaa tacgacgccc atgaaagcac tcattggatg 3780 tgagacacgg agcgctacag aagcgccgct attatcacag atccaagacg ttgtacatca 3840 actggattta gacgagttac gtaaagatat caagatgcat attaataacg agcaacgagc 3900 gcagaaggag cgctatgatc ggatgcgtcg agaggcaacg cgatacgatg aaggtaccct 3960 agttttggta caaattacta gtgacccagc aacgggcagt agtcgaaagt tacatcctaa 4020 gtttaaaggt ccattccgag ttcacaaagt tctcatcaat gatcgatatg aggtccaaga 4080 cctacgagaa ggacataaac gaaaacgaac ggtggtggcc gccgatagaa tgaagctctg 4140 gatcacgacc ccgagcgagt gagcgagtag gatactcgta tttattaagg attgatctgg 4200 ggcggcacga cggtgcctct tcaggtcagt tattacgaac tgattccggc tgacacggtg 4260 gtgtctagcc gagtcggtta ccacggattg acttggaatg acactacggt gtctttctaa 4320 gccagttatt acggactgat ctgaggcggc acgacggtgc ctcttcaggt cagttattac 4380 gaactgattc cggctgacac ggtggtgtct agccgagtcg gttagtgata ctacggtgtc 4440 ttcctaagcc agttattacg gactgatcct ggcaggcaca gtggcgctgg tgaggtcagt 4500 taacgatgac ggaccgacag aaagctatgg caagtaactt gatcgatcag acgaattacc 4560 agccggtaat atgaaaatat aggaaaacat cgcgaacgaa aactccaaga gtctgaggac 4620 agactccaac gcaggaaggc cgaatgtaag aaaattgcta cgggcctgta atgattagcg 4680 cggggcccga atcgatggcg atacataaga cctacgatat cgccccgcca cgatgatacc 4740 gagtggaggg gataggacat gcacgctagt gtcctgcgca cgtggtagac tggaagctct 4800 cgctcaacga ggagtac 4817 // ID Gypsy-1-I_MH repbase; DNA; INV; 7162 BP. XX AC ABLG01001295.1; XX DT 28-FEB-2009 (Rel. 14.02, Created) DT 02-MAR-2009 (Rel. 14.02, Last updated, Version 1) XX DE An internal portion of the LTR retrotransposon from Meloidogyne DE hapla. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW reverse transcriptase; Gypsy superfamily; Gypsy-1-I_MH. XX OS Meloidogyne hapla OC Eukaryota; Metazoa; Nematoda; Chromadorea; Tylenchida; Tylenchina; OC Tylenchoidea; Meloidogynidae; Meloidogyninae; Meloidogyne. XX RN [1] RP 1-7162 RA Bao W. and Jurka J.; RT "Gypsy LTR retrotransposons from Meloidogyne hapla."; RL Repbase Reports 9(2), 461-461 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 373..1389 FT /product="Gypsy-1-I_MH_1p" FT /translation="MPVVTRQKARKDKALEQEEETDPTITNFVNSDMDEND FT ILDGSEMGGNELSADLTPEEEQQLLDGNNINKKKNGEENPKKKLNQATTQT FT SPAPINKNAQAQNRLLPQIPPQSAQNVVYPQQFPNFPVFAQQMPNFQMYPQ FT FPSIPMGSQSTSIMPGAQQFPNLMPVYTPQVMYPQMPFNQLQNFHPQANFN FT APMNQTQNFPQPSPTVAQPPINTRNDQRPTTDFMGFMHSFDRMSLSWALDK FT IQDLKGPEGVDQIKTFFKKFDSATEGISDVVRLKALESKISGRAERAFNTA FT LGNSPFIYSSVRREMLRILETTDTKEICAFDELMSGVKRRENENID*" FT CDS join(1431..2027,2031..2663,2667..7094) FT /product="Gypsy-1-I_MH_2p" FT /translation="MTNNLLDEYSIKHLIRSIGDSNLALNLEIARQDGMSF FT DSFVSLAARAESTQKATRHINFQSKPVVQQFPQRNNNFTPRNEITCFTCGK FT PGHFSRDYYTNRNNQYGGKENMSGATGANKTFLNNKNNVNEASQNTAFWRG FT NSQPRALPPSNNQLKNGCITIQDEIPENHVNSCGVNILEIEEELKKIPEQE FT SEKVEIIKKKNMSGLNVKGLTATGMLDGGAQISLISSNFLLKMLKDKEIEL FT KNIGFSRTKAKFSDVNGQKLECHGIVELEVKRENLNEVKTCFHITDAPFGY FT DLLFGTNSLGSLGFKLMDIPNNSLIDFGNVNRDQNMNTARVIYKTRIQALS FT VKNIEVSIGKEWEGKEIVLHSENQRKESAIKVEPSIGKAESGKNFARIFNS FT SCVPIELEENEVIGNVSFEHCQNAIEFSSSMKVKESWRHTNVKLSAKIDEL FT NKYIKQNTGNLSEKENSALKELIIDFEEIFALDDKELTQTNLIEHQIETAN FT AQPVKTRCRPIPYAYREKVALMIQDYLERGIIRGSNSPWASPIVIVPKKDG FT SLRFCVDFRGVNSVTIKDSYPLPNIDNILLTLGGKKFFSCLDFLSGYWQIK FT MEPQSIEKTAFITEFGLHEFLVLPFGLCNAVATFQRFMNKLFEDVVNKFVF FT IYIDDILIASESWEEHIEHLKIIFKRISEAGLKLKIAKCKFACTEIPFLGH FT ILTREGIKKDAEKTKAVVNCPIPKTRKQLSSLLGFLSYYRKFIHGFSTLAS FT PLFKLLRKDNDFKIGETEKTAIELLKEKLLENAILYFPDFKEAINNPKRRF FT IMLTDASKIGIAAILCQTDKEGNVRPIFFASRQCNKHEAKYCATELEALAI FT RFGSKKFFQFIAQIPTRVLTDHRALVSMFKSKKETGNTRVDKWLLELNSRF FT TLQVEYNPGKQNIVADFLSRAPTWDESKIKEIDPPEEEEIAFVGQLTQIDI FT EEKEIEKREEWVEKTKESEFSLLYEFLEKRNWPDETKERQKVINWCSDFTI FT IEGLLYKYENDGKIRLFVPEPFRNKLLSESHSGKCAGHLSGRKIFKQLAVK FT YYWPNMQSECINSALKCRICAHSRNPKSNEPPMKIVRTSEPFELICLDILD FT VGPSNANSRYIFVAVDHFSKYVIAVPIPDKSAESIAKALVENVILIYGAPK FT RIHSDRGKEFLNSIIKEITNTLKIEQTVTAGYDPNANGLVERINQIIIGML FT KRSTASNWTWPERLPYLIFAYNSTPSETTGFSPYNLILGKPPNLPFEEMTL FT SVNPLYTIDDETYIQLFRENLLQLIDKAKENSEKAQEKSKKWYDSQPKVTA FT NKFKIGERVMVIFPGKNRNAPHKKLLWNHFGPYKILEMNESSATLTPVDKS FT TEKIKVPLERLIRVPPGVPDVATLPRGKNPFKNILMSILLAGIEQIENKNK FT QIELGIKEKNLGQEKSQKVFLLNASFSDENMELGTEELAWTKCSSNENLKV FT GDLDPVLNKETGIGARSINTAMKTLLATFLLRTNEIKATSAARNILNKILD FT TIPGTELIKYVEERGDTVKSGLFPNEQQIESSLSIWFEYCQEARKTLEATK FT FEKIRMTIPEVVERMNEAEKRGWQNVFKIANKKLEKIRAITIHKAKQLIHH FT QRVFVGDSSANQLSKIFNPSYFLGKEGGSLNEVIRKFNEAVLSSEVQVVVV FT MVGRDALLGGETVDQVIEEARRLKQLLERFNQIHVIWLPPPYVRQKHTEYE FT ELLPRLIYLFNEEESDKFEFITTTTTGRSFLELTRFGNSFNGLMVEADGKL FT KQNGIEAMKAWLVTQVPGFPGDHELGVRNVRSQVVVPVNRRREERTFRGRD FT YSRGRGLGDRTRSSRGQMERNFRGRGLGDRTPRRSVFERIYRSPYRYDRSR FT SRRDVSRSRERT*" XX SQ Sequence 7162 BP; 2641 A; 1312 C; 1396 G; 1813 T; 0 other; cgtggcgctc ctgaatttgt acgttctcaa tttggtgaat tgatttgttt aattaattgg 60 aaaattaaag gctctaggcg agcaagactt taaaaaacaa attttaagct ctaggcgagc 120 aaaacttcaa aagaaaaatt ttttttttga aatccctagg cgggtataaa atttttattt 180 tcttcagatt ggactcttct ttaaataaac caaattcgtt atatttgaaa aaaaaaaatt 240 aaaagaaaaa gacaaagaaa tttttcaaat ttagaagcga ctcgtgcaat cgatttaatt 300 gctgaaaaaa aaaagtgact catcaaacag ctctaggtga gcataaaaat tttaatcaat 360 aaagagaaaa gtatgcctgt cgtaacacga caaaaggccc gaaaagataa ggctttagag 420 caagaggaag aaacggaccc gacaattact aattttgtaa attcagacat ggacgaaaat 480 gacattctgg atggaagtga gatgggagga aatgaactat ctgctgattt aacacctgaa 540 gaagaacagc aacttttaga tggtaataat attaataaaa agaaaaacgg ggaggagaat 600 cccaagaaaa agctaaatca ggccacgaca caaacctcgc cagcaccgat taacaaaaat 660 gcgcaagcgc aaaatcgctt gttgccacaa attccgcctc agtctgcaca gaatgtggtt 720 tatccacaac aatttccaaa ctttcctgtg tttgcacagc aaatgccgaa ttttcaaatg 780 tatccacaat ttccgagcat tcctatgggt tcacaaagca ctagcattat gccaggagca 840 cagcaatttc ctaatttaat gcctgtgtac actccacaag tgatgtaccc ccaaatgccg 900 tttaatcagc tacaaaattt tcatccacaa gcgaatttta atgcaccaat gaatcagaca 960 caaaattttc cacagccttc accaacagtg gcgcagccac cgatcaatac tagaaatgac 1020 caaagaccca ctactgactt tatggggttt atgcacagtt ttgataggat gtctctttcg 1080 tgggctttag ataaaataca agatttgaaa ggaccagaag gtgtagatca gataaaaact 1140 tttttcaaaa aatttgattc ggcaaccgaa gggatatctg atgtagtgcg attaaaagct 1200 ttagagtcaa aaattagtgg aagagcagaa cgcgccttta acactgcatt agggaattct 1260 ccgtttattt attcatcggt ccgacgtgaa atgcttcgta ttcttgaaac gactgacaca 1320 aaagaaattt gtgcattcga cgaacttatg tctggtgtta agaggcgtga aaatgaaaat 1380 attgattagc taacagaatc gcgtctttag taaaaagagc ttatccagga atgacaaata 1440 atttattaga tgagtattcg ataaagcatt tgataagaag catcggcgat tcaaatcttg 1500 cactcaacct cgaaatagct cgtcaggacg gaatgagttt cgacagtttt gtttctctag 1560 cagccagagc tgaatcaact cagaaagcaa ctcgtcacat aaattttcaa tctaaacctg 1620 ttgttcaaca atttccacag cgaaataata attttacccc acgaaacgaa ataacctgct 1680 ttacctgcgg aaaaccaggt cacttttcac gcgattatta cacaaatcga aacaatcaat 1740 acggaggaaa agaaaacatg tcaggggcta cgggagcaaa caaaactttt ctgaataata 1800 aaaataatgt aaacgaagca agccaaaaca cagcattttg gcgaggaaac tcccaaccca 1860 gagcacttcc accatcaaat aatcaactta aaaacgggtg cattaccatc caagacgaaa 1920 ttccagaaaa tcacgtgaac agctgcggag taaacatttt agaaatcgag gaagagctta 1980 agaaaatccc agaacaagaa tcagaaaaag tggaaataat aaagaagtag aaaaacatgt 2040 cagggcttaa cgttaaaggt cttacagcaa ccggaatgct tgatggaggt gcacaaatct 2100 ctttaataag ctcaaatttt cttctaaaaa tgcttaaaga taaggagatt gaattaaaga 2160 acatcgggtt ttcgagaaca aaagcaaagt tttcggacgt aaacgggcaa aagctagaat 2220 gtcacggaat tgttgaatta gaagtcaagc gtgaaaatct taatgaagta aaaacctgct 2280 ttcacatcac ggatgctcct ttcggttatg atttgctgtt cggcacaaac tcactaggct 2340 ctctaggatt taaattaatg gacatcccta ataattcttt aattgacttt ggaaatgtaa 2400 atagagacca aaacatgaac acagctagag ttatctacaa gacgcgaatc caagctctca 2460 gcgttaaaaa catagaggtg tcaattggaa aagagtggga aggtaaagaa atagtgctgc 2520 attcagagaa tcaaaggaaa gaaagcgcta ttaaagttga accatccata ggaaaggccg 2580 agagtgggaa aaattttgcg cgtatattta atagctcttg tgttccgatt gagttagaag 2640 aaaatgaagt aataggaaat gtgtagagtt ttgagcattg tcaaaatgca atagaattta 2700 gttcgtccat gaaagtaaaa gaaagttggc gacacacgaa tgttaaattg agtgcaaaaa 2760 ttgatgaact aaataaatac atcaaacaaa acacaggaaa tttgtcggaa aaagaaaatt 2820 cggcattaaa ggagctaatt attgatttcg aggaaatttt cgcgctcgac gataaagaac 2880 tcactcaaac taacctgatc gaacatcaga tagaaacagc caatgcacaa cctgtaaaaa 2940 cacgctgcag gccaattcca tacgcttatc gcgaaaaagt tgctttaatg atacaagact 3000 atttagagag aggaataata agaggatcaa attctccatg ggccagccct attgttatcg 3060 ttcctaaaaa ggatggatca ttaagatttt gtgtagattt tcgaggagtc aattcagtaa 3120 cgataaagga ttcataccca cttccaaaca tcgacaatat tctgctaacc ctaggcggga 3180 aaaagttttt cagttgtcta gacttccttt caggatactg gcaaataaaa atggaacctc 3240 agagtatcga aaaaacagca tttataacag agtttggcct acatgaattt ttagtccttc 3300 catttgggtt atgtaacgct gttgcaacat ttcaacgatt tatgaataaa ttgttcgaag 3360 atgttgtcaa taaattcgtc tttatttata tagatgacat cttgatagca tcagaaagtt 3420 gggaagaaca catcgaacat ttaaaaataa tttttaaaag aataagtgaa gctgggctaa 3480 aactaaaaat cgcgaaatgt aaattcgcct gcacagaaat tccattctta ggacacattc 3540 tgactcgcga aggcattaaa aaggacgccg aaaaaaccaa agcagttgtg aactgtccca 3600 ttcccaaaac aaggaagcaa ctaagttcac tgctcggatt tttgtcttat tataggaagt 3660 ttatccatgg atttagcact ttagcctctc ctttattcaa acttctacga aaggataatg 3720 acttcaaaat aggagaaacc gaaaaaacgg caattgaact tctgaaagaa aaacttttgg 3780 aaaacgccat cctttatttc ccggatttta aagaagccat taacaaccct aaaagaagat 3840 ttatcatgct aacagatgcg agcaaaatcg gaattgctgc aatattatgt cagacggata 3900 aagagggaaa cgtccgtcct attttctttg cctctcgaca atgtaataag cacgaagcga 3960 aatactgtgc tactgaactc gaggcactag caattcgttt tgggtccaaa aagttcttcc 4020 aatttattgc acaaatacca actagagttt taaccgacca cagagcactt gtttcaatgt 4080 tcaaaagcaa aaaggaaaca ggaaacacca gagttgacaa atggctttta gaattaaact 4140 ccagatttac ccttcaagtc gaatataatc cggggaaaca aaatatagtc gcggactttc 4200 tttcgagagc tccaacttgg gacgagtcaa aaataaagga aatcgatcca cctgaagagg 4260 aagagatagc ttttgtcgga caactcacac aaattgatat tgaagagaaa gaaatcgaaa 4320 aaagagaaga atgggtagaa aagacaaaag aaagcgaatt tagtcttctt tatgaatttc 4380 ttgaaaaaag gaattggcca gatgaaacta aggagagaca gaaagttatt aattggtgtt 4440 cagacttcac tattatcgaa ggacttctct acaaatatga aaatgacgga aaaataagac 4500 ttttcgtacc agaacccttc agaaataaat tattaagcga atcccattca ggaaaatgtg 4560 caggtcatct aagcggtaga aagatcttca aacagttagc agttaaatat tactggccta 4620 atatgcaatc agaatgtatt aactctgctt taaagtgtag aatttgcgcc cattctagaa 4680 acccaaaatc taacgaacct cctatgaaaa tagtgcgaac gagcgaaccg ttcgaactca 4740 tctgtctcga tattctcgac gtcggcccaa gcaatgcaaa ctccagatac atctttgtag 4800 ctgtcgatca tttttctaaa tacgtaatag cggtacccat tcccgacaaa tcagcagaat 4860 ccatcgcgaa agcactcgtt gagaatgtaa ttttaattta cggggcgcca aaaagaatac 4920 attcagacag agggaaggaa ttcttgaatt caattataaa ggaaataaca aacaccctca 4980 aaatcgagca aactgtaact gcggggtatg accctaacgc gaatggattg gttgaaagaa 5040 ttaatcaaat aatcatcgga atgctaaaac ggtcaacagc ttcaaattgg acttggccag 5100 aaagacttcc ttatttaatt tttgcttaca attcaacccc aagcgaaaca accggctttt 5160 cgccttacaa tttgattcta ggaaagccac caaacttacc ttttgaagaa atgacacttt 5220 cagtaaatcc actttacact attgacgatg aaacttatat ccagcttttt cgggaaaatt 5280 tactccaact aatcgataaa gcaaaagaaa attctgagaa agcgcaagaa aagagtaaaa 5340 agtggtacga ttcacagcca aaagtgactg ctaataaatt taaaattggc gaaagagtca 5400 tggtaatttt cccaggaaag aatagaaacg caccacacaa aaagctatta tggaatcatt 5460 tcggcccgta taaaattctg gaaatgaacg aatcctcagc aactttaacc ccggttgaca 5520 aaagcacaga aaagattaaa gtaccgctcg agcgtttaat tcgtgttcca cccggagtac 5580 cagacgttgc aacactacca cgcggcaaaa atccttttaa aaatatttta atgtccatac 5640 tgctggcggg gattgaacaa attgaaaata aaaataaaca aattgagctg ggaattaaag 5700 agaaaaatct aggacaagaa aaatctcaaa aagttttttt attaaatgct tctttttcag 5760 acgaaaacat ggaacttggt actgaagaac tcgcatggac aaagtgtagc tcaaatgaaa 5820 atttaaaagt cggagatctc gaccccgtac ttaacaaaga aactggaatt ggcgcccgca 5880 gcatcaatac ggcaatgaaa acccttcttg ctactttcct tctccgtaca aatgaaatta 5940 aagcaacgtc agcagcacgc aacattctaa acaaaatttt agacacaatt ccagggacgg 6000 aattaatcaa atacgtggag gagagaggtg acacagttaa aagtgggtta ttcccgaacg 6060 agcagcagat agaaagctct ctttcaattt ggtttgaata ctgtcaagaa gcaagaaaaa 6120 ctttggaagc aacaaaattt gaaaaaatta gaatgacgat tccggaagta gtagaaagaa 6180 tgaatgaagc cgaaaaaaga ggatggcaaa atgtttttaa aattgctaac aagaagttag 6240 aaaaaattag agcaatcacc attcacaaag ccaaacaact tattcatcat caaagagtgt 6300 ttgttggtga cagcagcgcc aatcaacttt caaaaatatt taatccatct tattttttgg 6360 gaaaagaagg aggatcttta aatgaagtta ttcgaaaatt taatgaagct gttctttctt 6420 ctgaagtgca agttgtagtt gtaatggtag gaagagacgc tcttctaggc ggagagacag 6480 ttgatcaagt aatcgaagag gctaggcgcc tgaaacaact tcttgaaaga tttaaccaaa 6540 ttcacgtgat ctggctacct ccaccttacg tccggcaaaa acacactgaa tatgaagaat 6600 tactgccaag attaatctac ctttttaatg aagaagagtc agataaattc gagtttatta 6660 caacaacgac aacaggaaga agctttttgg aattaactcg atttggaaat tcttttaatg 6720 gattaatggt tgaagctgat ggaaaattaa aacagaatgg gattgaagca atgaaagctt 6780 ggttggtaac tcaagtacca ggatttccag gagatcatga actaggcgtt cgaaacgtga 6840 gatctcaagt tgttgtgccg gttaatcgaa gaagagaaga aaggacattc agaggaagag 6900 actattcgag agggcgaggt ctaggcgacc gaacccgatc aagtcgtggc caaatggaaa 6960 gaaactttcg gggaagaggt ctaggcgacc gaaccccacg tcgttcagtc tttgagagga 7020 tttaccgttc gccgtaccgt tacgaccgca gcagaagtcg ccgtgatgtg agccgatcca 7080 gagaacgtac atgaaccaac caccttttga cagccgaatg gcgaaaagaa gtcgagggac 7140 tcgcctcttc agagggcggg ga 7162 // ID SMARN3 repbase; DNA; INV; 360 BP. XX AC . XX DT 21-FEB-2008 (Rel. 13.02, Created) DT 28-FEB-2008 (Rel. 13.02, Last updated, Version 1) XX DE Consensus sequence of non-autonomous Mariner-type family of DE repeats. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Nonautonomous; KW SMARN3. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-360 RA Jurka J.; RT "Non-autonomous Mariner-type element from freshwater planarian RT (Schmidtea mediterranea)."; RL Repbase Reports 8(2), 160-160 (2008). XX DR [1] (Consensus) XX CC The closest autonomous Mariner is SMAR16. CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX SQ Sequence 360 BP; 111 A; 57 C; 50 G; 142 T; 0 other; cagtcaattc tttttttatg cgtttttgaa gttatgcggt tttccccctt cggtccagaa 60 aaggactaat ctttgtcaaa cctaaaccaa ccagatattt ttgaatagag aatcattcaa 120 atcacaaggt tccgtttcaa atttttaaaa atattaaaag caaaaaaagt tattaaactt 180 tatatttatt attacgtaat tttatgtttt ttcaatttct ccgccaactc ttaatgattt 240 tttggtttta tgtgcttttc gaaaattatg gaacgtatct ctatttatac aacgtaagtc 300 gtgtcttttt tatgcggttt tctgaggaac gtatctaccg cataaaaaaa gaattgactg 360 // ID Copia-98_AA-LTR repbase; DNA; INV; 295 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-98_AA_; KW Ty1_copia_Ele66; Copia-98_AA-I; Copia-98_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-295 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 295 BP; 70 A; 73 C; 64 G; 86 T; 2 other; tgttgagtgt catgtacctg gcgatatttc ctccatcgac cgttctgtcc ttccccccat 60 gcgaagtgtg acagcgacgg tgtgacagag atagaagagc gagtaaagag aagaaacaat 120 ttttcatttc ccatttccac atgtaacctt actgaaataa agacgcgktt atacaatcag 180 tkttactttt cgatcggtgc gcgtattccg aagtttcttt ccgccggtta tcatcggtta 240 agtgtgtcgg aattccctgc agaacctgtc cactcgtacc gggttctgtc ccaca 295 // ID BEL-25_AA-I repbase; DNA; INV; 7241 BP. XX AC supercont1.26; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-25_AA_; KW BEL-25_AA-LTR; BEL-25_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-7241 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.26; Positions 817250 824490. XX CC 'GTGGC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 902..6010 FT /product="BEL-25_AA-I_1p" FT /translation="MLEATCKEFYTVREQIELILETTSAKTTDDEDAVLEL FT NEKNELVLSEFEDRYCVAKASLLSMQPSPPLPSVSTAAGVGPLGHRSDSSF FT PKVKLPEIRLPSFNGKIKEWITFRDSFRSLFHDNMQLTDMDRFSYLKSTLS FT GEALQEIDSIELSSANYSVAWKALETRYENKKLIVKAHLDSLFAVDSMKRE FT SYEGLSNLIGGFEKNLQMLQKIGEQTDAWSTILAYMVYSRLDPATLRQWET FT HHNSKNVPTYQNIITFLRSYCSVLQSIAPTKSPTCPDQHSSRPSICHTVVK FT TSNKCPFCNGSWHSAFLCGNFQRLLVPERNEAVFKAKLCRNCLNPGHFARN FT CEKGACHHCQQKHHSLLHAGPSRSSVPQSQSSPPEQNQQQPQTPNQQTHNA FT SSATPSTSQGQSTSSQATNTSTATSQTHVALPITPTQNILLSTALVKVRDR FT NGNSMLARALLDSCSQHCLMTKEFSSRLKFNETPSYLSVQGIGTSSSTSTS FT LISAVVSPRLDRISSFVQEMQFYVLPRLTVSLPTAACDPSSWNTLDSSMLA FT DPRFYEPGPVDVIIGAEFYMDLLKNEQQKATEAGPTLQNTELGWIVSGKVP FT DSTRSIPQSLTFVCSTSELQDQLAKFWELETCRSPSTYSLEESACEKHFNR FT TTVQDASGRYIVTLPKREAMISRLGHSESTAIKRLIGMERRFAVNPALKEM FT YVEFIREYLELGHMREVTREAESGVSRYYLPHHAVLKPDSTTTRLRVVFDA FT SCQTSTGVSLNDALMVGPVIQDDLTSISLRFRTHRIAINADVAKMYRMIGV FT QEQDYPLQSIKWRYETSEPIRTYELTTVTYGTSSAPYLATRCLQRLADDGK FT ATHPIASKVIRKDFYVDDMLTGADTVEEGRRLVRDVIELSKSAGFTLRKWN FT SNSKELLEVIPDELRDVRSVLEIDSSDTAVKTLGLVWEPDHDVFRFTFPSF FT NWPATITKRVIVSDVSRLFDPLGLVGPVIIQAKIFIQELWKQGCGWDEPLD FT IKLQDKWQEYRRNLQGMESLTVPRWVGTGTQIVSTEIHGFCDASNKAYGAC FT IYIRTISETGEVNVRLLTAKSRVAPLDSLKKGNKRLSTPRLELSSALLLAH FT LFEKVTNSLEIEANCFFWTDSTIVKCWLASSPSRWKQFVANRVSEIQHITK FT HDKWNHVMSLDNPADIISRGMNPLQLPYESLWFNGPVWLMSDEQHWPSTPD FT ISEDDFDPTELEVKSVSAVLPANSPSEIFCLKSTYIDLLRLTAWVRRFRHN FT LNPANRYDRRYGQLTTIELEEALNTLVRLSQHESFPQEIADLSSGKEVQES FT SKISNLHPELTSGILCVGGRLRHASIATNRKHPYILDHRHPFSYIVVAHYH FT QKLLHGGQQLMISCIRERFWPTSIRNIVRKRVGVDYCGPFNVAYPYRRNRP FT VKIFVAVYVCLVTKAIHLELASDLSAQGFIATLKRFASRRGKPEILMCDNG FT RNFVGARRKLEELRRLFNNQQFQQSVISSAAEDQIQFRFIPARSPNFGGLW FT ESAVKSFKLLFKRTVGSHILLYDEMQTILTQVEAVLNSRPLTPLSSDPDDY FT EALTPGHFLIQRPLTAIPEPDLDGIPENRLAAYQKSQRFTQQLWKKWSELY FT LSDLHNRTKWTVRRKNIVEGAMVVLKEENQPPLTWQLGRVTEVHAGRDGNV FT RVVTVKTKDGSYRRAISKICVLPIRDNIDHHNEE" XX SQ Sequence 7241 BP; 2028 A; 1710 C; 1687 G; 1816 T; 0 other; ttttggtcct tcgaaccgga tcagtttcgg tatcggacac ttttgggcaa ttttagactg 60 ttgttgattg gtgctcaaca tctaacgaac tatcagtgcg cgaaagccgt cggcaaattg 120 attgcacggc tcggaagttt tgttggtgaa aattcctccg cgagctacga attcctccga 180 aaggcgtatc cggcgtgtaa agggtctggc aagacggcaa gttaaggctt ccagcgccat 240 cgtcaattgg attggcgcga tttttccttt gctggccatt ataaagccaa tacgtcgcca 300 ttgtgaacat tatcacggat aaggaccatc gagttagcgg acattttgag cgaccagcca 360 ttggacgttt cgaagcatac gacatcaggg aatcagcctt ggattggact tcttggagct 420 tgaaagggaa gtagtacatg ctgtagtgtg accaattgct tgtggcctaa tatatttatt 480 gaaagtgcat gcgagtgttg tcctattttt atgctgctta gccctttgag tcctccgcta 540 gcttaagttg gtcttggaat tttggtagat ttgttcggct tggtttttct ttggattgct 600 tctcccctcg ctgctgtcac tccgtttagc agtcaatctg tctcgtgagg tcgcgacctg 660 tggaacgtcg agtcgatcgg gagtgatagt gcaaagtgct gcagtgtagt gattgtatta 720 ttgattaatt acgtcaaccg tagttagtcg cagagcagtg cacagtggtt caaaagtgtc 780 ggacctgaga gaactaacga aacgtgaacg gcgtttattg ggaacgctgt taggtgtaga 840 gaagttcgtg agcaattatg agcatgcccg tgacgaaaag caaatcggtg tccgtttgca 900 aatgctagag gcaacatgca aagagtttta tacggtacgt gaacaaattg agctgatatt 960 agaaacaact agtgccaaaa cgactgatga cgaagacgca gtactagaat tgaacgagaa 1020 gaatgaactt gtgctgagcg aattcgaaga tcgatattgt gtagcaaagg caagtttgtt 1080 gtcaatgcaa ccctcacccc cacttccatc cgtttcgact gctgccggag ttggtccact 1140 tggccaccgc tccgattcct ccttcccgaa ggtgaaacta cctgaaattc gcctgccgtc 1200 gttcaatggg aagattaaag agtggatcac ctttcgtgac agttttcgca gcctgtttca 1260 cgataatatg cagctaacag atatggaccg attcagctat ttgaagtcta cgctttcagg 1320 agaggccttg caagaaatcg attcgataga gctttcgtcg gcaaactaca gcgtagcatg 1380 gaaggcttta gaaacccgct acgaaaacaa gaagttaatc gtaaaagcac acctggattc 1440 cctctttgcg gtggacagca tgaagcgaga aagttacgaa gggctgagta atttaatcgg 1500 gggattcgag aaaaatcttc aaatgcttca aaagatcggt gagcaaacag acgcctggag 1560 cacaattttg gcgtacatgg tgtactctcg actagatcct gccactttgc ggcagtggga 1620 aacgcatcac aattccaaaa atgttccaac gtatcagaat atcatcacct ttctacgaag 1680 ttattgctca gttcttcagt caattgctcc aacgaagtct cctacctgtc cagaccagca 1740 ttcgtcaagg ccgtccatct gtcacaccgt cgtcaaaaca tccaataaat gtcccttctg 1800 caatggatcg tggcattctg cgtttttgtg cggcaatttt caacgtctgc tggttccaga 1860 gcgtaacgag gctgtgttca aggctaaatt atgccggaat tgcctaaacc ctggtcactt 1920 tgcacgcaat tgtgagaaag gtgcctgtca tcactgtcag caaaagcatc actccctgct 1980 acatgctgga ccatcaagat cctccgttcc acagtcgcag tcaagtcctc ctgaacaaaa 2040 ccaacaacag ccacaaactc cgaaccaaca gacccacaat gctagttcag ctacaccaag 2100 cacctcgcaa ggacaaagca cttcgtcaca agccacaaac acaagcacag ccacaagtca 2160 gacacacgta gcactcccaa tcacacccac acagaacatt ctcctttcaa cggccttagt 2220 caaggtcagg gatcgcaacg gcaattccat gcttgcgcga gcgctactgg attcgtgctc 2280 tcagcattgt ctgatgacaa aagagttttc cagtagactt aagttcaacg aaacaccgtc 2340 gtacctgtcc gtccaaggaa ttggaacttc ttcaagcact tcaacaagcc ttatcagtgc 2400 agtagtcagt ccgaggttgg accgcatttc atcatttgtt caagagatgc agttctacgt 2460 cctcccacgt ttgaccgtgt cattaccgac agctgcctgc gatcccagca gttggaatac 2520 cctggattca tcgatgctag ctgatccacg cttctacgaa ccaggtccag tggatgtcat 2580 catcggtgca gagttctaca tggatctctt aaaaaacgag cagcaaaagg ctactgaggc 2640 cggccccacg ttacagaata ctgaactggg atggatcgtc tccggaaagg ttccggacag 2700 cactcgtagt ataccacagt cacttacatt tgtctgttca acctccgaac ttcaagatca 2760 gcttgcaaaa ttttgggagc tggaaacctg tcgaagtccg agtacgtatt cgcttgaaga 2820 atcagcgtgt gaaaaacatt tcaataggac tacagtacaa gacgcgagtg gcagatacat 2880 cgtgacgcta ccgaaaaggg aagcgatgat cagcaggcta ggtcattcag aatcaacagc 2940 cataaagcgt ttaataggga tggaacgacg cttcgctgta aatcctgcat tgaaagagat 3000 gtatgttgaa ttcattcgtg aatatcttga gcttggacac atgagggaag ttactaggga 3060 agctgaatca ggagtcagtc ggtattatct gccacatcac gctgttctaa aaccggacag 3120 cacaaccacg aggctacgag tagttttcga tgcttcttgt caaacatcga caggagtttc 3180 gctaaatgat gccctcatgg tgggtccggt catacaggat gatctgacaa gcatatccct 3240 tagattccgt acacatcgca tcgccattaa tgccgacgtc gccaagatgt atcgaatgat 3300 tggggtacaa gaacaagatt atccattgca gagcatcaag tggagatacg aaacatctga 3360 accaatccgt acttacgagt taacgaccgt aacgtacggt acctcgtccg cgccatattt 3420 ggccacaagg tgtctgcaac gccttgctga tgatggaaaa gccacccatc caattgcatc 3480 caaggtaatc agaaaagact tctacgtcga tgacatgctc accggcgctg ataccgttga 3540 ggaaggtcga agattggtta gagatgtcat cgagttgtca aagtcagcgg gtttcacact 3600 gagaaagtgg aattccaaca gcaaggagct ccttgaagtc attccagatg aattaagaga 3660 tgtgcgatct gtgctagaga tagactcctc tgataccgcg gtcaaaactc ttgggctagt 3720 gtgggagcct gaccatgatg tgttccgatt cacatttcca tccttcaatt ggccggcgac 3780 aattacaaaa cgcgtgatag tctccgatgt gtcacgacta ttcgacccgc ttggcctagt 3840 tggccctgtc atcattcagg ccaaaatctt cattcaggaa ctatggaagc aaggctgcgg 3900 ttgggatgag cccttggata taaaattgca ggacaaatgg caagagtatc gtcgcaacct 3960 acagggtatg gaaagcctta ctgttccccg gtgggtagga acaggtactc aaatcgtctc 4020 caccgagata catggcttct gcgatgcgtc taacaaggct tatggagcct gtatttacat 4080 ccgcacaatc tccgaaaccg gggaggtgaa tgtgaggctg ttaacagcaa agtcacgagt 4140 tgcacccctc gacagtctga aaaaagggaa caagcggctg tctactccac gacttgaact 4200 ttcgtcagca ttgttactag cacatctttt tgagaaggtt acgaacagtt tggaaattga 4260 agcgaattgt ttcttctgga cggattctac aatagtaaag tgttggttag cctcgtcacc 4320 gtcaagatgg aagcagtttg tagccaatcg agtttcggag atccagcata tcacgaagca 4380 tgacaaatgg aatcacgtga tgagtctaga taatccagca gatataattt cccgcggaat 4440 gaaccctttg caactgccgt atgaatcgct ctggttcaat ggcccggttt ggctaatgtc 4500 ggatgaacag cactggcctt caactccaga tatttccgag gatgattttg atcctacgga 4560 actcgaagtc aagagcgtct cggcagtgct tccagccaat tccccaagcg aaattttctg 4620 tttaaaatcg acgtacatcg atcttctacg actaacggcc tgggtacgac ggtttcgtca 4680 caatttgaat ccagcgaatc gatacgatcg tcgttatggt cagctgacta caatcgagtt 4740 agaagaagcg cttaatacgt tggtacgtct ctcccaacac gaaagctttc cgcaggagat 4800 agcggatcta tcgagtggaa aagaggtcca agagtcttca aagatctcga atctccatcc 4860 ggaactcacg tcaggaatac tctgcgttgg tggccgactg cgtcacgctt ccatagcaac 4920 aaacagaaaa catccctaca ttcttgatca tcgtcatcca ttctcgtaca tcgtagtcgc 4980 acactatcat caaaagttgc tgcatggcgg acagcaactc atgatatcat gcatccgaga 5040 acgcttctgg ccaaccagta tccggaatat tgtacggaag agagtaggcg ttgactactg 5100 tggtccgttc aacgtcgcgt atccatatcg tagaaatcgt ccagtaaaaa tcttcgtcgc 5160 cgtttacgtc tgtcttgtga ctaaggcgat ccatttggaa cttgcgtcag atctgtccgc 5220 tcaaggattc attgctaccc tgaaacgatt tgcgtccagg cgtgggaagc cggaaattct 5280 catgtgcgac aacgggcgaa attttgtcgg cgcgagacgt aaattggaag agttacggcg 5340 tctgttcaac aatcaacaat tccagcaatc tgtcataagc agtgctgctg aagaccagat 5400 ccaatttcgc tttattcctg cacgttcgcc aaactttggt ggactgtggg aatcggctgt 5460 caagtcattc aagttgcttt tcaaaagaac ggttggctca cacatccttt tgtacgatga 5520 aatgcaaaca atactaactc aggtggaagc agttctgaat tctcgaccct taacgccgct 5580 cagtagcgat ccggacgatt acgaagcatt gaccccgggt cattttctta ttcagcgacc 5640 cctcactgcc attcctgaac ctgatctaga cggaatcccc gaaaatcgtt tggcagctta 5700 tcagaagtct cagcgattca cgcagcagtt gtggaaaaaa tggtcagaac tttacctctc 5760 agaccttcac aacaggacaa agtggaccgt gagaagaaaa aacatcgtgg aaggggccat 5820 ggtggtcctg aaggaagaaa atcaacctcc tctgacatgg caactggggc gcgtgactga 5880 agttcacgct ggtcgggacg ggaatgtacg agtcgttaca gttaaaacaa aggatggtag 5940 ctatcgtaga gcgatctcca agatctgcgt tctaccaatc cgagacaata ttgatcatca 6000 taatgaggaa tgaggactcc tccatcagcg gaggtcgacg atttcgacct ccgcgcacca 6060 gttaagttgt ttattgttaa tgttattcgt aaaaagtaat ccgctcatat ttcccccagc 6120 tacgattcgc cctagttcca gtgttccagg aaatttctgg ctttgaggaa ctcaacctaa 6180 gccagctaag cactctttag tcgaggtcgg tatgtctatg gtgatggtcg ttgattgcat 6240 gtgaatttca atatgttcaa aatgtgaaca cattcagcgg tgttcaatgc cagcgcatcg 6300 gcgctcgaat caataaaacc ttcatagaga tctacgtttg atcgttcgag gtgatctgca 6360 tcagtatcag cgagaggaat aaaatcggaa ttcgtcttca gtgttctacc tatttgcgac 6420 agcaaacaca accagtacgt ctaacgaaca caccaagtag cagcagcatc acaagatcat 6480 gcgaagatcg cgacggtgaa tcttattcat gagagggtca tcggtgtcaa ccggcgctcg 6540 cggaggcctt ggaggcctcc gcatcaggca agttactttt tattgtgatt aaaagggtaa 6600 aaaagcaccc tctcatattt tgcagcaaga tgtcaagcaa cacaccacca ctgagagtca 6660 ccgaaaccta ccagacagtc tgtatcgtta aagcaaaccg taattcgaat cgaacatcta 6720 agtcccgtca gcatattgtc accgtgagcg tatcaaccca ccgtcaagtc aacgcagtgt 6780 ttaactggaa agcaactaac gcatttaatt gacccatgat cgaccccagc tggagttcat 6840 cgatgccatc cggcgtcagc ggaggccacg gaggcctccg tttcaggcaa gtcgttggtt 6900 aatttgatta taaaggtaaa aaagcacctg ctcatctctt ctagcgatat gatctgcagc 6960 gacatcaaca tcggaaatat gaaattgatc ctcatgacaa gtaaggcgta tcgaacgcct 7020 atccaatcgt aaacaacgca acaaattttg acctaggaaa acagctacga aaccagcgaa 7080 gaaactaaat cactggagca gcagcaatag aagtgatagt cgttttattt ttgtgcaagc 7140 gaatcatacg agcaaaacaa aaataattaa atagaactcg atagtcagta agttagagta 7200 ggatagcagt taaaattaat aatttcaagg cggccggcat a 7241 // ID BEL-44_AA-LTR repbase; DNA; INV; 491 BP. XX AC AAGE02018502; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-44_AA_; KW BEL-44_AA-I; BEL-44_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-491 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02018502; Positions 52641 53131. XX SQ Sequence 491 BP; 170 A; 99 C; 84 G; 138 T; 0 other; tgtggcgacg gacccctcgt tgcggcgaag cgtgagtgac gctttgcttt agagcgtaga 60 tggtagtagc tgtcaacgac tatcacacat gcaattaaga gaaaaacatc agccacaagt 120 gatcgagatt gaaattccct taaaagtaat taaaattagc tcctaaaagt aacgattatt 180 gaactattga acattaattg aactagaact attcgtgcat atcctattat acaaaagtaa 240 gcttattact aaaaatctag cctaatacag tgaattaatg taatcttaac ctatatctag 300 cctacaccta catcgtgaaa ttcgtcgact caccacgatc gttacactat cggttcacca 360 aaattgtgag taagctattt gaaatttgat ccttaaaata aatactaaat tgtttcagct 420 taaagctaac agcatcaaaa actgcgtttg ctccttgaga gttggaaaac ccattcgcgt 480 acgaccgaac a 491 // ID BEL-12_CQ-LTR repbase; DNA; INV; 504 BP. XX AC AAWU01032924; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-12_CQ_; KW BEL-12_CQ-I; BEL-12_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-504 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 178-178 (2011). XX DR Genome; AAWU01032924; Positions 36102 35599. XX SQ Sequence 504 BP; 93 A; 154 C; 106 G; 151 T; 0 other; tgttgcgtca gccagtcttt cttaaaaagt tctttatatt ttgcagttag caccgtcgac 60 agctggtgct gaccggcatt attatttttc gccccacaat tctggcaccg ctgcgctgtc 120 agaggcaagc cacccaaatg tgtctcgtca ctctcgtcgt cattccaaat ccaacctctt 180 ttgaagacag aagtgctgaa aataaaaagg aagattaaaa ttagtttgtt tcgctaaata 240 aagtgcgttg ttacttacct tcgcgtgttc tacttacctg ctgttgccac tgctctgctc 300 ctgctgccgc tgctctgctc ctgctgcacc tgctgctcct gttgcacctg ctgctgcctg 360 tacctgctgc tgcgcttgca cctgctgctg ctcttgcacc tgctgctgct cctgctggac 420 cacctgcttc cgccgccgtt gctctaatct gatcctcccg cacaaaacac aacgtagggt 480 cccggttggt gagttgccac tcca 504 // ID Penelope-6_HM repbase; DNA; INV; 2245 BP. XX AC . XX DT 13-DEC-2008 (Rel. 13.12, Created) DT 14-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE Penelope-like element. XX KW Penelope; Non-LTR Retrotransposon; Transposable Element; KW Penelope-6_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2245 RA Jurka J.; RT "Penelope-like elements from Hydra magnipapillata."; RL Repbase Reports 8(12), 2096-2096 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 266..2035 FT /product="Penelope-6_HM_1p" FT /translation="MYKLSKEDYNNLLTNAITTTYKKANPKLKDHINKEGK FT QILTNHEVYKKIEINGSSNCFFTLKDHKENFANNPSVRLLNPSKNEVGRIS FT KVILTKINCELKNKLFLNQWQSTQNVLDWFKKITNKHLYKFLVFDVNDFYP FT SIKESTLNNAMNFADHHIVIKSEDKALIKHARKSLIFNNDEAWIKKRGGLF FT DVTMGAFDGAEVCELIGIFILYQISQHYNKEDFGLYRDDGLAVFKNKNGQQ FT MEKIKKHITLIFKKNDLQITIQSNLKIVNYLDVTLNLSNDTFQPYFKPDNQ FT IRYIHSNSNHPPSIIKAIPRNIEQRLSSMSINEKFFKKAIPPYEEALKKSG FT YKSNLTYQPQIISSQNQQKKQRKRNIIWYNPPFSLNVKTKIGNRFLALIDL FT HFPAGHKLHKIFNRNSVKISYSCMPNLKSIINSHNFKILHNKNQLKTNQCN FT CVDKAFCPLNNQCLSNNIVYQATVSSSHHDRNKKVYFGISNTPFKLRYANH FT LKSFASIKYRNDTELSKEIWSLKDQNIKPIIEWKIVKRCKPYNLATKICHL FT CLYEKYVILSHNGKNLLNKKNEIVSQCRHSKKFLLSLFDTGD*" XX SQ Sequence 2245 BP; 913 A; 383 C; 279 G; 668 T; 2 other; taaaatatat aatatataat aatacgatta agtgttacaa tgaataatag atctatatat 60 aattatattt tataaataca ggaattaaag tttcatgtcc aaggcaatyt cagccgttaa 120 tacatacaaa aaaaaaaaaa cttaaaaaac ataaaaattt tataaaaatt tactaaaaaa 180 caaatcaaaa aacgaaatat aaaaagatat cataaataat ygtaaatctc acaaaactct 240 aacacatgct gacaaaacat caaatatgta taaactgtct aaagaagatt acaataatct 300 gttaacaaat gctattacca ctacatacaa aaaagcaaat ccaaaattaa aagaccatat 360 aaacaaagaa ggaaaacaaa ttttaaccaa ccacgaggta tataaaaaaa tagaaataaa 420 cggatcctct aactgttttt tcactttgaa agatcacaag gaaaacttcg ctaacaatcc 480 atctgtgcgc cttttgaacc cttcaaaaaa tgaggtagga cgaatttcta aggtaatttt 540 aacaaaaatt aattgtgaac ttaaaaataa actttttttg aatcaatggc aaagtacgca 600 aaatgtttta gactggttta aaaaaatcac caataagcac ttatacaaat tcctagtttt 660 tgatgttaac gatttttacc cttccataaa ggaaagcacc ctcaacaatg ctatgaattt 720 tgctgaccat catatcgtta tcaaaagcga agacaaagca ttaataaagc atgcaagaaa 780 atctcttatc ttcaacaacg atgaagcatg gattaaaaaa agaggtgggt tgttcgatgt 840 aacaatgggg gcatttgatg gtgctgaagt ttgcgaatta ataggtattt tcatactgta 900 tcaaatttca caacattata ataaagaaga tttcggctta tatcgtgacg acggtcttgc 960 tgtattcaag aacaaaaacg gtcagcaaat ggaaaaaatc aaaaagcata taacactcat 1020 ttttaaaaaa aacgatcttc aaatcaccat tcaatctaat ttaaaaattg ttaactacct 1080 cgatgttacg ttaaatctaa gtaacgatac gtttcaacca tattttaaac ctgataatca 1140 aataaggtac atccactcta attctaacca ccctccaagc ataataaaag caatccctcg 1200 caatattgaa caaagattat cttccatgtc aattaatgaa aaattcttca aaaaggctat 1260 tccaccgtac gaagaggctt taaaaaaatc gggatacaaa tcgaacctta cctaccaacc 1320 acaaataatc tcaagtcaaa atcaacaaaa aaagcaacgt aaacgtaata ttatttggta 1380 taaccctccg ttcagcttaa acgtcaaaac aaaaatagga aaccgttttc tagcccttat 1440 tgacttacat tttccagccg gtcacaagct acataaaata ttcaaccgaa attccgttaa 1500 gattagctac agttgcatgc ctaacctaaa atctatcata aattcgcata atttcaaaat 1560 tttacacaac aaaaatcaat taaaaaccaa ccaatgtaac tgcgtagata aagcattttg 1620 ccctttgaac aatcaatgtt tatctaacaa tattgtctat caagcaaccg taagttccag 1680 tcatcatgat cgtaataaaa aagtatattt tggcattagt aatacaccat ttaaactaag 1740 atacgcaaac catctcaaat catttgcatc aattaaatat agaaatgaca cagaactatc 1800 taaggaaata tggagtttaa aagaccaaaa cataaagccc attatagaat ggaaaattgt 1860 caaacgttgt aaaccttaca atctagcaac aaaaatatgc cacctttgtc tctacgaaaa 1920 atatgtaatt ttatcgcata atggaaaaaa tctactgaat aaaaaaaacg aaatcgtatc 1980 tcagtgtcgg cattcaaaaa aattcctatt atctttattt gacacagggg actaataaac 2040 tataacggct ttttgttttt tgttttttga tttgtttttt acgtagattt gtttataacg 2100 tcagatgtca ttctacagta atttttacgt ttttgtaacg gtttgtttac gttttttgtt 2160 tgtattaacg gctgatgatt gccttcgggc atgaaacttt aagtcccgct aatcagttgt 2220 ttttattcat ctaaatataa ttata 2245 // ID Gypsy-215_AA-LTR repbase; DNA; INV; 181 BP. XX AC AAGE02027318; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-215_AA_; KW Gypsy-215_AA-I; Gypsy-215_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-181 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02027318; Positions 22635 22455. XX SQ Sequence 181 BP; 55 A; 39 C; 32 G; 55 T; 0 other; tgtagtattg tagaacacct tcatatataa ataccttgaa atggcattga gtattctagc 60 aacactgtca aagactacct tctatgcgac cgttacgtta tgggattaaa ggagttttct 120 tcataccagc acctgagaca gacgcgttct ctgttctagt ccgaaagact caaattctac 180 a 181 // ID Gypsy-61_AA-LTR repbase; DNA; INV; 206 BP. XX AC AAGE02020262; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-61_AA_; KW Gypsy-61_AA-I; Gypsy-61_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-206 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02020262; Positions 22241 22036. XX SQ Sequence 206 BP; 75 A; 33 C; 42 G; 56 T; 0 other; tgttatattc ggacaaatag ccaagaccat ctcagagaga aaggaatgta aacgattatt 60 gagagatgag agataaaacg ataaaatacg atgtaccggt tattacaata aacgcaggcc 120 tctgtcggtc tcttgtactt tgaacgttaa acctagtggt tgtgttttat aaattagtgc 180 tacgagaaaa ccccaaaatt tgaaca 206 // ID CR1_Ele21 repbase; DNA; INV; 4181 BP. XX AC . XX DT 06-OCT-2010 (Rel. 15.1, Created) DT 06-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE A CR1 clade non-LTR retrotransposon family from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1_Ele21. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4181 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4181 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (06-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. This consensus is generated from 25 CC sequences with >97% identity, and ~99% identical to the original CC sequence in [1]. XX FH Key Location/Qualifiers FT CDS 362..1219 FT /product="CR1_Ele21_1p" FT /translation="MEICNSCAVIMNTSEVACGGFCKATFHYKCAKISESF FT YKEICGNSAVFWMCKGCCDMMKNSRFKNAMISTNAATLELKDAYQKVVEDL FT KTEIKDSLIAELKQEIQGGFNKLSPAVLSPVPRHFQFRGRNTPKRIRDEDA FT SEPTEQPTKIFCGTSQSASNALEARSDENFWVYLTKISPEVSENDVLNLAK FT ERLQTEDVVVRSLVPRGKPLSMLSFVSFKVGVHNDYKSKAMDPATWPKGIQ FT FREFVDNDSNSRHFWKPPQRIDPGATSSIQSQQLSSHPVQEPLPT" FT CDS 1102..4122 FT /product="CR1_Ele21_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="QRFQFSTFLETSATHRSRSNLFDPIAATVESSSPRAT FT SNVDESRSHLKYPSNFSSTFLTVYYQNVRGIRTKTQELLLNLCSCDYDIVV FT LTETWLRPDISNDEFASSYNVFRCDRSSATSSLQRGGGVLIAVKAELNCRS FT VELENCESLEQAVVQIKLQKFSVYVCGIYLRPNSQPELYYSHSMAVQQLCE FT RTASADKVIVMGDYNLPQLIWQNDEDIDGLLPCNASSEQEIALVETMVASG FT LRQINCLRNLNGRLLDLVFVNEARDIEVIEPPSPLLKIDYHHKPLVLRMDL FT NDARMQFAEDSSQQRDFDFQRCDFAELNDAISAIDWEIAFDGRGTDETVCI FT FYDILYGILDEHVPRRRRTRCSYKHPWWTSELQHLRNVVRKSRRRYFRART FT DENRNNLRLIETSYNECQTAAFRSYVARLETTAKHDPTAFWNFIRNRKHST FT RFPAEMTYKKTIASCPEDVANMFADCFKNVYSSNPPTFSSDNLRNCPTYDL FT NLAPFDISRQDVYSALKNLDVTKGAGTDNLPPLVLKECAESLQMPLEIIYN FT KSLRNSTFPSIWKKASITPIFKAGSSHAVENYRGVSILCSVAKVFEGLVHN FT VLYAASTPIISELQHGFVKKRSTTTNLMMFTNFLSTEIEKRSQVDAIYFDF FT SKAFDKVPHELAIAKLRHLGFPDWITEWLRSYLTQRVAFVNINGTHSSVFS FT IASGVPQGSVLGPLIFILFINDLCFRLKSGKLFYADDLKIYRTIASHLDCC FT ALQADIKELELWCMENGMELNIKKCKSVVFSRRQSCIEFGYSIGSEPLERV FT ESIRDLGVILDSKLRFNEHVNATTAKAFAALGFVRRNASCFKDIYALKALY FT CSLVRSILEYAVCVWSPQHTTHIIRIEKVQRSFIRFALRQLPWSDPFNLPD FT YTSRCRLIALQTLSSRRSNLQRLFMFDLIRGNLDCPSLLQNVSFYAPNRHL FT RQRDLLLIRRHRTSYGFNNPLCVCSRLFNSVSDLFDFNVSKSVFKSRIRVL FT E" XX SQ Sequence 4181 BP; 1161 A; 980 C; 901 G; 1139 T; 0 other; tacgcaaacc ctgaaagcgt catacacgca tccaattgta tccgtcaagc caatacactg 60 tacctgctgc tgcttatctt ctcaatccgc aatccgaaat ccccgccaga gccacgtcaa 120 gagcctcaac cgttttccat tccgtatcgc tgcaagttca tcttctccgc ttcatcatcg 180 ttcaagccag gtaaacaatc cacaattcgt cggctgttgt gtgagtagag cgattataga 240 gaacacacat tttcgcatct gtcactgcca gtgttgccag tattctcgac ttttttggag 300 taacgcttga gtgacgctcg tgtagtcaaa cgaccgattt gaatcgaccg ttgttttcac 360 gatggaaatc tgcaacagtt gcgcagtcat tatgaataca tcggaagtgg catgcggcgg 420 tttttgcaaa gcaacattcc attacaagtg cgccaaaata tcggaatcgt tttacaaaga 480 aatttgcgga aactcggccg ttttttggat gtgcaaggga tgttgcgata tgatgaagaa 540 ctctcgattc aaaaatgcaa tgatatcgac aaacgccgct actttggaac tcaaagacgc 600 ttatcaaaaa gtcgtcgagg atcttaaaac tgagataaag gatagtttga tcgccgagct 660 gaagcaagaa atccaaggtg gtttcaacaa attgtcccct gctgtgcttt caccagttcc 720 acgtcatttc cagttcaggg gtcgcaacac tcctaagcgc atccgtgacg aagacgcctc 780 tgaacctacg gaacaaccaa caaaaatctt ctgcggtacc agccagtcag caagcaacgc 840 actggaagct agatctgacg aaaacttctg ggtgtattta actaaaatat cacctgaagt 900 ctcggagaac gacgtgctca atctagctaa ggagcgcctg cagactgaag atgtggtagt 960 aagatccctg gttcctagag gtaagccgct atcaatgttg tcgtttgtgt cctttaaggt 1020 aggggtccac aacgattaca aatcgaaagc aatggatcca gcaacatggc ctaagggaat 1080 acagtttcgt gagtttgttg acaacgattc caattctcga catttttgga aacctccgca 1140 acgcatcgat ccaggagcaa cctcttcgat ccaatcgcag caactgtcga gtcatccagt 1200 ccaagagcca cttccaacgt agatgaatcg cgcagccacc tcaagtaccc ctccaacttc 1260 tcttcaacct tcctgacagt ctactatcag aacgttagag gcatccgaac aaaaacgcaa 1320 gaactactct tgaatttatg ctcttgtgac tatgacatcg tagtactcac cgaaacttgg 1380 ctacgcccgg atatttctaa tgacgaattc gcctcaagct acaacgtatt tcgatgcgat 1440 cggagctcgg ctactagcag ccttcaacgt ggtggaggag tgctaattgc agttaaggcg 1500 gaattgaatt gcagatcggt ggaactagaa aactgcgaat ccctcgaaca agccgtcgtt 1560 cagatcaagc ttcaaaagtt ttcagtttac gtttgtggga tatatctacg accaaactcg 1620 cagcccgagc tttattactc acattccatg gcagtgcaac aattatgtga acgaactgcg 1680 agtgccgaca aggttatcgt catgggagac tacaatcttc cccagttgat ctggcagaac 1740 gacgaagata ttgacggtct actgccttgt aatgcttcat ccgaacaaga aatcgccttg 1800 gtcgaaacaa tggttgcttc aggactgcgt cagatcaatt gcctacgaaa cttgaacggc 1860 cgcttactgg accttgtttt tgtcaatgaa gcaagggaca ttgaggtaat cgaaccaccg 1920 tcaccgctcc tcaaaattga ttatcaccat aagccgctgg ttcttcgtat ggacttaaac 1980 gatgctcgta tgcagtttgc cgaggattcg agccaacagc gtgacttcga ttttcaacgc 2040 tgtgattttg cggagttgaa cgatgctatt tcagccattg actgggagat agcgtttgac 2100 ggtaggggca ccgacgagac ggtatgtatt ttttacgaca ttctgtacgg aattttggac 2160 gagcacgtcc ctcgtagacg acgcacgcgt tgttcttaca aacatccttg gtggacatcc 2220 gagctgcaac atcttcgcaa tgttgttcgc aaatctcgaa gacgttattt tcgagcgcga 2280 actgatgaaa atcgaaacaa tttacgtctt atcgaaacca gttataatga gtgtcaaact 2340 gctgcatttc gaagctacgt cgctcgtttg gagacgaccg ctaagcacga tcccacagcc 2400 ttttggaatt ttattcgaaa tcgcaaacac tcaactcgat ttcctgctga aatgacttac 2460 aaaaaaacta tcgctagctg ccctgaagat gttgctaata tgtttgccga ctgctttaag 2520 aacgtgtaca gttcgaatcc accaacgttt tcatcggaca acctcagaaa ttgcccaact 2580 tacgatctga atctcgcgcc tttcgatatt tcgcggcaag atgtatattc agctttgaaa 2640 aatctggacg taaccaaagg ggctggtact gacaatttgc cgccgttagt cttaaaagaa 2700 tgtgcagaat cgttgcaaat gccgctggag attatctata ataagtcgct tcgtaatagc 2760 acttttccgt cgatatggaa gaaagcctca atcactccta tcttcaaagc tggttcttct 2820 catgctgtcg aaaactaccg tggcgtttca attttatgct ccgtagctaa agtttttgaa 2880 ggtttagttc acaatgttct ctacgctgca tctaccccaa taatttcgga actccagcac 2940 ggcttcgtca aaaagcgatc aacgacaact aaccttatga tgttcactaa cttcctatcg 3000 acagaaatcg aaaaaaggag ccaagtagat gccatatact tcgatttctc caaagcgttc 3060 gacaaagtgc cacacgaact agcaatagcg aagctaagac acttgggctt tccggattgg 3120 ataacggagt ggttgcgatc gtatctgacg caacgcgtag cattcgtcaa cattaatggc 3180 acacactcta gcgttttctc tattgcttct ggtgttcctc agggaagcgt tctggggcca 3240 ctgattttca tattgtttat aaacgacctg tgcttccgac tgaaatcggg aaagctgttc 3300 tacgctgacg atttaaaaat atacagaact atagcatctc atctggactg ctgtgcgcta 3360 caagcagata taaaggaact cgagctgtgg tgcatggaaa acggcatgga gttgaacatc 3420 aaaaagtgta agtccgttgt attttcccgt cgacaatcgt gcatcgaatt tggatattcg 3480 attggatcag aaccattgga acgcgttgag tcaatccgag acttaggtgt tattctggac 3540 agtaaactgc ggtttaatga acacgtcaat gctacaactg cgaaggcgtt tgcagcctta 3600 ggatttgttc gacgaaatgc tagctgcttc aaggacatct atgctctcaa ggcgttatat 3660 tgctcgcttg ttcgtagtat tttggagtat gctgtatgtg tatggtctcc acaacatact 3720 acccacatta tccggattga aaaggtccag cgtagcttca tccgcttcgc ccttcggcag 3780 ctgccatggt cggatccgtt taacttgcct gattacacgt cacgctgcag actcattgct 3840 ttgcaaacac tttcttcgag acgcagtaac ctacaaaggc tttttatgtt tgacttgatc 3900 agaggaaatc ttgactgtcc atcgctcctg caaaatgttt cgttctacgc accaaaccga 3960 catctgcgac aacgtgattt gctgctcatc agacggcata gaacttccta tggcttcaac 4020 aatccgttat gtgtatgttc tcgattattc aatagtgtga gtgacttgtt tgactttaat 4080 gtatccaaat ctgtgtttaa aagtagaatt agagttttag agtaagagac agtctgggga 4140 atcgtactag attcaagaca gtgaataaat aaataaataa a 4181 // ID CR1-14_CQ repbase; DNA; INV; 5595 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-14_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-5595 RA Kojima K.K. and Jurka J.; RT "CR1 non-LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 17-17 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 7 sequences with >94% CC identity. XX FH Key Location/Qualifiers FT CDS 404..5362 FT /product="CR1-14_CQ_1p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="MPKTGKIEVVACCDAGSRGTKIRCEFCAKSWHTKCAN FT ITANAIKTLRDTGGACWLCPTCRTPDGRQPASSDTSKLILQRVTSALLLIG FT NLTDTMRVFCRMYSASLSASTPSTTVLPPTAALEPDLSDAFHANLSDLFDS FT ILGDGNKRPRTSSLSRASEVQPEKILRVDVPVDQTSTSNITVPPVSGAADA FT SPSATLETAVTTTPSAVNLLAVADPATSPLAAAVAAVTGVANADASADTLL FT GSSELAASLPISPHQSSTXATAAAVAAMPPTIQPEEPPLTMLSVQPLVTPQ FT PPPPXIMPSLQPQQVANIPQVLQPLQIMPQAPPQPLTMPSSQPLVMLSPTM FT LSAQPLVTPQPPLPPPRKLAPPPPLIMQPLQTAPPASTAASTNEFAVRSSE FT AVDSNXQSTTPTVNFIPCPTSFTVTDGNNINVYGPSPALLEQLTVSTNXNC FT QTRQTNCMPHTTQNNAMLNQAANNPSPPGNINQSHPTLLVAPLEQPVSELK FT WYYVSRFLPTETPENLIRYIRHQTNCDALIVCQKLVRKNRNANRPLTFLSF FT KLNVPESIENRITAADFWPEGVTIKPFLVNRQPPPTNWSSLDQSQPAEQPH FT AYQEQSTHSNPSSPALTXHDTEAIVDAADSLSVPSQPILACVSSQPADGAF FT XIPVHAAVQSNARVQQHSTIDNGLIPSSAHSVSSIQNAQAAVSTNLLLYYQ FT NVGGINTTIANYALAISSASYDLYAITETWLTSATLSSQIFGPEYEVFRGD FT RTASNSCKGSGGGVLLAVRSNLKPRQLFPPDCTVPEQVWVAVPLAASTMFV FT CVIYIPPKFDNEKPLFDQHRXSLTWIVSKMKANDSLMVLGDFNFPAIRWTR FT TPTNKLIPNLALTPTNELKHKLLDEYSTANLSQLNDMCNNSSNVLDLCFAS FT SGTPINFTLLPAPLPLVKDVRHHFPFLVSISCTTLDFRDVAGNTFMDYRKG FT NYDGMNNFLTNINWNQLLANLDADTAADTWTGVLTDAINTFVPRKQRQPPR FT HPPWSTPRLQILKSRKRAALKKYTKHPTDRWRNHYRSRNRKYSILNKQLFR FT RHQHRIQSRLKRDPKKFWNHVNEQRKETGLPTAMILDGEEATSTESISDLF FT RRQFSSVFTNEAVGETHIAKAASNVPLRPPIGPHPVVTSESIRRACASLKG FT STSCGPDGIPAFVLKKCCDALAEPLAQLFNTSLATGVFPCCWKKSFVFPVH FT KKGPKRDVRNYRGIAALCAVSKLFEVIVLDFIKFNCCDYVALEQHGFMAKR FT STNSNLVSYSSFILRTMQQRKQIDAIYTDLSAAFDKLNHRIAVAKLERLGF FT GGPMLDWLRSYLTGREMSVKIGDVISAAFSVFSGIPQGSHLGPLIFLLYMN FT DVHHLLGCHKLSYADDIKLFTVVENDTDCQFLQEQLDLFANWCSENRMVLN FT ASKCSVISFTRKRNTMSFHYTLSNTTIPRTSCVKDLGVMLDSKMTFADHIT FT YTVSKASKTLGFIFRIAKNFRDLSCLKALYCSLVRSTLEYCCIVWAPFYQN FT AIQRVESVQRKFVKYAQRHIIWPDPANPPSYAERCKMLNLELLTVRRDVSK FT ATFVADLLRSSIDCPAVLQLVNINTRPRVLRNHSFLTVHRALTNYGQNEPV FT SSMCRVFNLCSDLFDFDISRDTIKKRFLNHLKSHP" XX SQ Sequence 5595 BP; 1402 A; 1631 C; 1153 G; 1401 T; 8 other; taccaacaac actttcggat waaatttttg ttgacatcag tccgcagttt tatttcgata 60 tttgattgca aagtgctttt attttcatta aaacgcgtta cttcgccggt tgaaacacct 120 gtgtttacgc gcatagtgaa gttaagtgcc ttaattggat atttggattg tttttcgtgt 180 gattaatatt acactccttg catctcgccg ctgtttttgc cgccctcgtg aagcactgta 240 gtctcggaac actacacttg gctcccgtag tcccgcgacc tcacgcgcac atttggagca 300 gtccttcgca gtgtgcgttt gtttaccctc cgcaactacc cagtctgaag aacgggccga 360 aagcgagtcg aaagcgtgca tcgtctgtac gatttactcg aagatgccta aaactgggaa 420 gattgaggtg gttgcgtgtt gtgacgccgg ttctcgtgga acgaaaattc gatgcgaatt 480 ctgcgccaag agctggcaca ctaagtgcgc aaatattacc gcgaatgcaa tcaaaactct 540 tcgtgacacc ggtggtgcat gttggctttg cccgacgtgc agaacaccag acggaagaca 600 acctgcatcg agtgatacca gcaagctcat cctacagcgt gtcacatcag cgctgttgct 660 catcgggaac ttgacggata ccatgcgagt cttctgtcga atgtactctg catcgctctc 720 tgcatcgacg cccagtacaa ccgtgctgcc cccaactgct gctcttgaac ctgacctgtc 780 cgacgccttc cacgcaaact tgagcgatct attcgattcg atccttggtg acgggaataa 840 aaggccccgt actagcagct tatccagagc atctgaagtc caaccagaga aaatactgag 900 agttgacgtt ccagttgacc aaacctccac gtcaaacatt acagttcctc ccgtttcagg 960 cgctgctgac gcttctccgt cggccacact tgagactgct gtcaccacta ctccgtcggc 1020 cgtgaacctg ctggccgtcg cggatcccgc cacttctcca ttggctgccg ccgttgccgc 1080 tgtcaccggc gtcgccaatg ctgatgcttc agctgacact ctgctgggtt cctctgagct 1140 cgccgcttct ctaccgatca gccctcatca atcgtctacc wccgccactg ccgccgccgt 1200 cgcagcaatg ccgcctacaa ttcagcccga agaaccaccg ctgactatgc tttccgtcca 1260 gcctctggtc acgcctcagc cgccgccgcc asaaatcatg ccctcgcttc agcctcagca 1320 agtagccaac ataccgcagg tcctgcagcc gctgcaaatc atgccacagg ccccgccaca 1380 accgctaact atgccctcgt cccagcctct agttatgcta tcgccgacca tgctttccgc 1440 ccagcctctg gtcacgcctc agccgccgct accgccgccg cgaaaactgg ctccgcctcc 1500 gcctctgatt atgcagccgc tgcagactgc gccgccagca tcaaccgctg cctctaccaa 1560 cgaatttgca gttcgctctt cagaagccgt cgactccaac maacaatcga caacacctac 1620 agttaatttt ataccgtgcc cgactagttt taccgtaacc gatgggaata atattaatgt 1680 ttatggacct agtcctgcac tgcttgaaca gcttacagtc agcaccaacm agaattgtca 1740 aacacgccaa acaaattgta tgccacacac cacacaaaac aatgcaatgc taaaccaagc 1800 agcaaacaat ccctcaccac ccggcaatat caatcaatcc caccctacac tacttgtagc 1860 tcctctagag caaccagtgt ctgagctgaa atggtactac gtctctcgat tcctcccaac 1920 tgaaacccct gaaaacttga ttcgctacat tcgacaccaa acgaattgtg atgccttaat 1980 cgtttgtcag aagctagttc gcaaaaatcg caatgcgaac agacctctca cattcctctc 2040 attcaaactt aatgtccctg aatcgatcga gaacagaatc actgctgctg acttctggcc 2100 tgaaggagta acgattaaac cttttttagt gaatcgacaa ccccccccaa caaactggtc 2160 ctcccttgac caatctcaac cagcagaaca accacatgcc tatcaggagc agtccacgca 2220 cagtaacccc tcgtcgcctg ccctcacawa ccatgacacc gaggcaattg tcgacgcagc 2280 agacagttta tccgtaccct cccaacctat cttggcctgc gtttcatccc aaccagctga 2340 cggtgccttc cktataccag tacatgccgc agtacagtcg aatgccagag tacagcaaca 2400 ttcaacaatc gacaatggtc tgatcccgtc atctgctcat tcagtctcat cgattcaaaa 2460 cgctcaagct gctgtttcta cgaatctgct actttattac cagaacgttg gaggcattaa 2520 taccactatc gccaactacg ccctcgcaat ctcttccgcc tcatacgacc tgtacgcaat 2580 aactgagacg tggttgactt ctgctactct gtctagtcaa attttcggtc ccgaatatga 2640 agtattccgt ggagatcgga cagcctcgaa cagctgtaaa ggatcaggcg gaggagtcct 2700 gcttgccgtc cgctcgaacc taaagccacg ccaattgttc cctccagatt gtaccgttcc 2760 ggagcaagtc tgggttgcag ttccactcgc tgcatccacg atgtttgtgt gtgttatcta 2820 cattcctccc aaatttgaca acgaaaagcc gctgttcgat cagcacagac amtccttgac 2880 gtggatagtt tccaagatga aagcgaacga tagtcttatg gtgctaggtg acttcaactt 2940 cccagccatt cgctggacgc gcacaccgac gaacaaactg attccaaact tagccctcac 3000 tccgaccaat gagttaaagc acaaactcct ggatgagtat tctaccgcaa accttagcca 3060 actgaacgac atgtgcaaca actcaagcaa cgttcttgac ctatgctttg ccagctctgg 3120 gacaccgatc aactttaccc ttctaccagc tcctctgcct ttggtgaaag acgtgcgcca 3180 ccactttccg tttcttgttt ccatttcctg tacgacgctc gattttcgtg atgttgctgg 3240 taatacgttc atggactatc gtaagggaaa ctacgatggc atgaacaact tcctgaccaa 3300 cattaattgg aaccaactac tggccaacct tgacgccgac acagctgctg atacttggac 3360 aggtgttcta acagacgcca tcaacacctt cgttccaagg aaacagcgcc agcctccaag 3420 acatccaccg tggtcaacac ctcgactgca aattttgaag tccaggaaac gcgctgccct 3480 caagaaatac accaaacacc cgacagatcg atggagaaac cattataggt caagaaaccg 3540 gaagtacagt atcctgaaca aacaactttt tcgtcgccac caacaccgaa tccaaagccg 3600 attgaaacga gaccccaaga agttctggaa ccacgtaaac gagcagcgga aagagacagg 3660 tctaccaact gcgatgatac tcgacggtga agaggctact tccaccgaga gtataagcga 3720 tctgtttcgt cgccagttca gcagcgtatt caccaacgaa gcagtagggg aaacgcatat 3780 tgctaaggct gctagcaacg ttccactgcg acctcccatt ggacctcacc cggtggtcac 3840 ttccgagtcc atccgtcgtg cctgcgcctc tctcaaaggt tctaccagct gcggcccaga 3900 cggcatcccc gcgtttgtgc taaaaaagtg ttgcgatgca ctcgcggaac cattggctca 3960 actcttcaat acctcgcttg ctactggagt tttcccgtgt tgctggaaga agtcctttgt 4020 tttcccagtc cacaagaagg gcccaaaacg tgatgtccgg aactatcgcg gaattgctgc 4080 cctctgcgca gtcagcaaac tgttcgaagt aatcgtgctg gatttcatta agttcaactg 4140 ctgtgactat gtcgccctgg aacagcacgg cttcatggcg aaacgttcca ccaactctaa 4200 cttggtctcc tactcgtcct tcattcttcg aaccatgcag caacggaagc agatcgatgc 4260 catctatacg gacctatcag cggctttcga caagctgaac caccgcattg ctgttgcgaa 4320 actggaacgc ctaggtttcg gcgggcccat gctcgattgg ctacgctcct atctcactgg 4380 ccgtgaaatg agcgttaaaa tcggtgacgt gatttccgct gctttctctg ttttttcagg 4440 cattccgcaa ggaagccatc tgggccctct gatcttcctc ctctacatga acgacgtgca 4500 tcatctttta ggctgtcaca aactgtcgta tgcggatgat atcaaactgt tcaccgttgt 4560 cgagaatgat accgactgcc agtttcttca ggagcagctc gacctgttcg ccaactggtg 4620 ctccgaaaac aggatggttc tgaacgcttc caagtgctcg gttatctctt tcacacgcaa 4680 gcgcaacaca atgtctttcc actacacact ttcaaacacc accataccta ggacctcctg 4740 tgtgaaagat ttaggtgtga tgctggatag caaaatgacg tttgctgacc acattacgta 4800 tacagtctcc aaggcttcca aaactcttgg cttcatcttt aggatagcta aaaacttccg 4860 ggatttaagc tgtctcaaag ctctttattg ttcgttggtc cgctctactt tagagtactg 4920 ttgtattgtt tgggctccct tctaccagaa cgcgattcaa cgcgtggagt cggtccagcg 4980 gaagttcgtt aagtacgcgc aacgtcacat tatctggcct gatcccgcca atccaccgag 5040 ttacgcagag cgctgtaaaa tgcttaatct cgaacttctt acagtaaggc gtgacgtttc 5100 caaggcgacc ttcgttgcag atctccttcg atcgtccatc gattgtcctg ccgttttgca 5160 actggtaaac ataaacactc gccctcgcgt actccgcaat cactcgttct tgactgtcca 5220 cagggctctc acaaactatg ggcagaacga accggtttca agtatgtgtc gtgtttttaa 5280 cttgtgctca gatctgtttg actttgacat ctcccgtgac actatcaaaa aacgattcct 5340 taatcacctg aaatcccatc cctaacctga cgatacacac gtagatttta gaactgtgat 5400 atttttttaa tttattgagt tagttttaag agtgaccccg tcttgtatca tttgagtttt 5460 gtgtacttgt tgatgcgata agatgaggtg gttttgtgcc tttttgagaa agtgtctttt 5520 atcgataacc agacacagct caagggggct tttgtccacc tccaataaag aaaaaaaaaa 5580 aaaaaaaaaa aaaaa 5595 // ID MuDr-3_HM repbase; DNA; INV; 8390 BP. XX AC . XX DT 08-JAN-2009 (Rel. 14.02, Created) DT 08-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE MuDr-type DNA transposons from Hydra magnipapillata. XX KW MuDR; DNA transposon; Transposable Element; Mutor; MuDr-3_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-8390 RA Bao W. and Jurka J.; RT "MuDr-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 9(2), 443-443 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 1635..3068 FT /product="MuDr-3_HM_1p" FT /translation="MCSACMMSFTRKDNMKRHALQAHKTIFRDNQSKDKRT FT PCHYQNCSQVFFQNVKLIKHIEECHEGIFTKESYIFLSESEFLAWKEKEEL FT NSYVFFSKQTGSFFTGKNINKDVKTSYYICQNDGHSKPHRSVSEPARKTNK FT RYYRGSVKSGAFCPARMIVKVNINGVTNVQYIKTHNHSIGITNTMYQPIPG FT NIKKNISAKLSIGVPVNTVYRDLRESFGDRQNRNDEDTVLTKSHLLTKKNI FT SDISRAVKKGCRLHPDDSVSTFLLVQKLKSEDFNSILVYKSQGQQTVIGPK FT VYDDIDLNKDSFVVGIQTKHQLSMFETHSSQIVCIDATHCTNQYAFPLVTL FT LVRDEFKRGYPVAFLISNHADEQTITPFLEEIKRRCNNSVKVNAVMTDDDL FT SGWNAFTNVFGDVRHLLCKWHKKRAWRNKLPLVGPTNLQEEVYRILETIID FT EKNPDVFLSTMNGFVKAYEHKCPNFISYFVTY*" XX SQ Sequence 8390 BP; 3052 A; 1160 C; 1225 G; 2943 T; 10 other; tgaattttac gtwggggaga aatactcccg acggaaacat tagcccgtat tttgacctgc 60 ataacatttg tcgcatgtcg ctacgtcgtg aataataaag ttctgaagtt cctggtgtta 120 taaaagtaaa taaacttatt gtaaaattaa acaaagatat gtttaacaag cttagtgtca 180 agttatcatc tgttaatgct acagaaataa atcctaataa aggtagggtt tttagttttt 240 tgagttctta tgttatatcg atggagagct ctctttgaat aaaaacttag gagagaatat 300 ttacttttaa tgagacttac tctctctaga gagagtaagt ctcattaaaa gtaaatgtta 360 tgttatatcg atggagagct ctctttgaaa aaaaacttaa gagagaatat ttacttttaa 420 gagagtaagt ctcattaaaa gtaaatattc tctcttaagt tttttttttc atatgagagt 480 aattctcscc ttttcaaaat taaataatca aaaagaataa ctttattgat aaagtgaacm 540 tatttggcgg tggtctgtgg tgagagtaat tctccccttt ttaaagtttt tgttgtaaga 600 rtaagtttta atttkttcaa ttramarart araccactct taagtttttt ttttcaatga 660 gagtaattct ctctaaaaga taagtttatt aaaagtaata tgtttatgat agagagtctc 720 tttgtaatct taagagagag tatttacttt tataaattac ttcttgagta agttttattt 780 aaaagagtaa atctcctctc ttaagttttt tttttcatat gagagtaatt ctcccctttt 840 caaaattaaa taatcaaaaa gaataacttt attgataaag tgaatatatt tggcggtagt 900 ctgtggtgag agtaattctc cctattttaa agtttttgtt gtaagagtaa gttttacttt 960 tggttcaatt gacaaaattg acaaaacttg atcaagtttt atttaaaaga gtaaatctcc 1020 tctcttaagt tttttttttt cagatgagag taattcttcc cttttcaaaa ttaagttatc 1080 aaaaagaata attttattga taaagtgaac ctatttggcg gtgttctgtg atgagagtaa 1140 ttctcccctt tttaaagttt ttgttgtaag aataagtttt acttttttca agtttttttt 1200 aagagtaaat ctccactctt aagttttttt tttcaaatga aagtaattct cccctttttt 1260 ttaaattaaa taatcaaaaa gaataatttt attgataaag taaacctatt tggcagtgtt 1320 ctgtggtgag agtaattctc ccctttataa agtttttgtt gtaagaataa gttttacttt 1380 tttcaagttt taattaaaag agtaaatctc ctctcttatt tataagaatc ttctttttta 1440 aaggtttaaa ctagataagt tagattttaa aggaaatcaa tttacatcag ttttttttca 1500 gcattgtcat caatcaatcc atcaactagt ttaacagaaa ttcagttaaa gaaacctagg 1560 taaaatacct aatttatatc aataagtaac taaagtatat aaagtcctaa ctaaaagttg 1620 ttgtttttag ttttatgtgt tctgcttgta tgatgtcgtt tacaagaaaa gataacatga 1680 aaagacatgc attacaggca cataaaacca tttttagaga taatcaatcc aaagataaaa 1740 gaacgccatg tcattatcaa aactgcagcc aagtattttt tcaaaacgtc aaacttatta 1800 aacacataga ggagtgtcat gaaggtatat ttaccaaaga atcttacatt tttttatcag 1860 aatctgaatt tttagcatgg aaggaaaaag aggaacttaa tagttatgtt tttttcagta 1920 aacaaactgg tagttttttc actggaaaaa atataaataa agatgtaaaa acaagttatt 1980 atatttgtca aaatgatggt cattcaaaac cgcatcgctc tgttagtgaa ccagcaagaa 2040 aaactaataa aaggtactac agagggagtg ttaaatcagg tgctttctgc cctgccagga 2100 tgattgtcaa agtaaacata aatggtgtca ctaatgttca gtatatcaaa actcataacc 2160 acagtattgg tattacaaat accatgtatc aacctatacc aggtaatatt aaaaaaaaca 2220 tttccgctaa attatcaata ggtgtgcctg taaatacagt ttatcgtgat cttagagaaa 2280 gttttggaga cagacaaaat agaaatgatg aagatactgt tttaactaaa tcacatcttc 2340 ttacaaaaaa aaacatttct gatatcagca gagcagtaaa aaaaggatgt cggcttcacc 2400 ctgatgatag cgtttcaact tttttacttg ttcaaaagtt gaagtctgaa gattttaata 2460 gcattctcgt ttataagtca caaggacagc aaactgttat tggcccaaag gtgtatgatg 2520 acattgacct taataaggat tcctttgtgg ttggtataca aacaaagcat caactatcaa 2580 tgtttgaaac acattcatct caaattgttt gtattgatgc tacacattgt accaatcagt 2640 atgcattccc tcttgtaact ttactagttc gagatgaatt taaaaggggt tatccagtag 2700 cttttcttat ttccaatcat gcagatgaac aaaccataac accattttta gaggaaataa 2760 aaagaagatg caataattct gtaaaagtta atgctgttat gacagatgat gatttgtcag 2820 gatggaatgc ttttactaat gtttttggag atgtgcgcca cttgctatgc aaatggcata 2880 aaaaacgtgc atggcgcaat aaacttcctt tagttggtcc aactaattta caagaagaag 2940 tttataggat cttagagaca atcattgacg aaaaaaaccc tgatgttttt ttatcaacta 3000 tgaatggttt tgtaaaggca tatgaacata aatgtcctaa cttcattagt tactttgtta 3060 catactaagc tccacgacat ctaaaatgga cttcttgtta tcgcaacttt caacatgcag 3120 acactgatac taatatgtta tgtgaattat ttcataacaa attgaaaaca gtttatatgg 3180 aaagacgacc taataaaagg ttagacgatt taattaattt acttttaaca atcgaagaaa 3240 atagttattg gcaccacaaa agagacacta tttatcttgg aacaatatct ctcccaaaaa 3300 agtataaaaa aagacatgaa aaaggaatgt tgattttgat aaatgtatta gtaaatgttc 3360 attaactcag tacacagtta aaagtcaatc aaaagatgta acatacaatg ttgacattct 3420 aactgaagct tgtaaggaaa gtttgtgttt ggagtcatgt ataaatccag cctgtaatgg 3480 tttgtgtgag catttatata ggtgtagttg taatgacatt gtaatccttt acaagcatgt 3540 tcataagtta caatcatttc ttctgagatc tcaagaaaaa aactgcccaa aagaatcttt 3600 agctgttggt acaggtaata atcaagtgaa gcaaaaatct caaacattta ttaatgagtc 3660 tgaaaggttt caaaaaaata ttgatgaact atctacatta ttttctcatc tcatgataat 3720 aaatttgcaa cttccttcaa taaacagaca actggaagac atgattagac aagctgaagc 3780 catcaaaaat attaatttaa ttgagatacc tacagagcca acatcacaca ataacctttt 3840 aaattttgaa ccaaataaga agttggataa tcaatggcag cctaataaat ttaagagaac 3900 aagaaaagga aataggaaaa aggaaataaa atcatatcaa tatcctgcac aaaaccaaaa 3960 agaggagata gttctgaaat tgtctcagca acatgttgat ctaattcatg ataagcaaaa 4020 tacaagagta tgcgataaca ttgaaccaag tgatttcact aacactctca ttacgtaagc 4080 tttcttaaat acatattcaa atatttgtta ctaatcaatt tatctccaaa tattttagct 4140 tttaaacaag ttggtttatt caagttaatt ggatgtaatg gataatatat tacagagtaa 4200 ataagaaaat aaactatatt tccttcagta tattagttat atttgtgtgt aaaatataaa 4260 ttatttgcat atatgtgtgt gtgttgtata tatatatata tatatatata tatatatata 4320 tatatatata tatatatata tatatatata tatatatata tatatatata tatatatata 4380 tatatatata tatatatata tatatatata tatatatata tatatatata tatatatata 4440 tatatatata tatatatata tatatatata tatatatata tatatatata tatatatata 4500 tatatatata tatatatata tatatatata tatatatata tatatatata tatatatata 4560 tatatatata tatatatata tatatatata tatatatata tatatatata tatatatatg 4620 tatatatgta tgtatgtatg tatgtatgta tgtatgtatg tatgtatgta tttaggcatg 4680 gcgctcacca actatctttt atttcccttt taacattaga aaacattata ccatctgacg 4740 ttttgtttaa tttaaaaaaa agcactccaa gttttcaaaa aggctggctt caagatgaag 4800 taataggagg ttttctgcat tgcttagaaa aaaggtaccc tttaattaaa tatttaggtc 4860 caactgaagc attagctttc agctgttcaa gaagtttaag gcttttatgg aaagatgttg 4920 atactaaagt tatagaacaa gtttttattc attttaaccc aactaattct cattggattt 4980 taatacatct aaaagttata acaaaagaag taactctact tgatccactc cattgcaata 5040 ttatgccaaa taatttatca cacaataaat ccattgctgt tgctaaagac ctttttagac 5100 taaaatttaa cattttaaac ccaactgtta tttcagcccc aagtcattgt ttacagaagg 5160 atagttacaa ctgcggtgtt tacgtatgct attatgcatt acatctttgt gaaggtatcc 5220 ctatttttca ataatatatt gtataatata tattcatatg ttagtatttt ttttagttgt 5280 caattttgca actacttttt tgttatgttg cttataatta attcatatat aaattcattt 5340 atttattaac tatttcattt attgcataaa attatttaat gctgtaattg tccagtaaat 5400 tataattgtt aattgcttgc attacttata atttgcgact attcaaaaat gcatcataac 5460 ttatgatgta taaaaaaacc gtgttatgat tcataacacg attttttatt ttctgtattt 5520 atataaattc atttaaaatt gtgttttcag aaaaaaatat taatgatcca tttaacgaag 5580 tacaatttcg taaatatatt tacgagactt tatgcggtag gtgtttaaag aaccttgaaa 5640 gtggttttga acgttttagg aaatattttt gttcttatcc tttattattt tgttcgacga 5700 caatgaatta aatgcgattc ggttggcgaa aaacaaatat ataccgttgt tacatttaga 5760 aattaggaaa tataaaaaat atatgaaaaa gtctatgaaa tttgagatag agtaatagat 5820 gttgttgtag tgaatacagg aataaccttt ttacattata ttattacaga tttgcaagaa 5880 cacatccttt gagaagaact attgggtgca atgcacaaaa tgtccacagt ggtatcattg 5940 tgagtgtgtc catatttcaa tcaaagatgc agatatcagt gatactttcc aatgtcaatg 6000 atacaatcct atcaatttta aaaaacggct attttcatag ccacaaatcg ttttaaaaac 6060 attaaatgac atgaaaaata aataaattaa acataactag ttcaattcta ttctaattac 6120 tggaccgtag ccccaggcgc caaaattggg agggcgcaaa ttcaccaatt gaattaaaaa 6180 aatatgttag ttatttatgt ttttttcaac gccattttaa gtctttgcgc ccagggcaat 6240 tttcattttt gccccccgac cccctcctcg ttactccact aaaataaaca gacgcggcgt 6300 agtgctaaca cgcttgccta agaagaaaga attctgcgtt tcaaagcgac ctctggggaa 6360 gttttgtgac agtggttggg ataacttcct gtaaaatgct cctccgcgat gctctgtaat 6420 aagactctaa ggacttcttg aggcacctaa aataaaaaac ataggcaggc cctttcagag 6480 gtttagagag gcctgggcca gttataaatt ggaggcccca agatgacatt ttgcgtaaaa 6540 tgcaacacaa gttgatgcaa agccaattta agtcttcaga taaattggga ccatataagt 6600 ctgtaactat tacaataaaa acagattgtc aactaaagca attaaagaca gactgtcact 6660 catatacgag gccctttttg tttattttag caagccaaca agccgagaaa tttgcttgtc 6720 tactaataac acatttctta tggaggcctg ggacgttact tttcttagtg taattagaaa 6780 tttcgaatta ttattttgat gtttaaaaaa tatattttga aaatgtgagg ccccaacata 6840 atcgaaggcc taggccatat gacccatgtg gctaccctct cgtagggcct gcacatagtt 6900 agattaatct acattgcaat gcatgtagtt gttgttatta ttataagtgt agtcataaat 6960 atttacatac aatgagtagt aggtaaattc agcgacccct ctaaaaacaa caagttttat 7020 aaaaatttgc tctaaatttt taatccaacc aggagcctag tcctctttga aaaaaaaggt 7080 ctagtggact tttaagttca agtagtttgt tctgacgcct tattttataa aggtgaccat 7140 ggggcaaaga tttttgtaga tacatattat caactatatc aaaacaatgt attagattat 7200 aaatttggat tggatcgcat cgtaagtgcc aagactcaag acatgtgatg gtaaatttaa 7260 ctttttaatc ttacccagta taaagttgtc tcacgctgga agctcatttt gcgttaaaca 7320 gattcaacaa aggctatatt agtttttaaa taagaagccc aaacagacac tgcatattca 7380 aggtgcagtt ttacaaacac tttatttaac tagccatatt tccatcaatg aaaatgaatg 7440 ccttctttaa cataccaaaa tttttaagaa atttgtgaat gaatatgtgt gctccacgtg 7500 tggttagatg taaaaacaaa tcccaaatca cgctctttag tatagcttct taagcattca 7560 ttcatacctt agttattttt taaaagtaac agctttgatc atagtgaata aaaaagcatt 7620 ttaaaatctt taacttaagc aaccaagtgt tttcatttac aaagcgtaga caatccagca 7680 ggtctgcagc gaaattagtg tgcgataaaa ttttagaaaa aaagtttagt atctttttta 7740 tccgtaaaaa tttacatact atgtaatgaa taaagcaagc agcgattatt tttaaatagc 7800 ctgtttacta agccacctaa tttctttcag tttttttaac aacgtgttgt gttaaatatt 7860 ggggagagtg gccccctaaa gtaggccccc ttttaaaaat gatataatca actagctgcg 7920 cacaaaaaaa actacaaatg atatgttatt caggatattt aatctatgct ctacaatttt 7980 ttatctaaaa attaaagatg catacaacaa ataaacaaaa acgttttcct aaccaaaaaa 8040 taatctcttt caaaaagtaa aataaaaaac taaacaaata attcaaaaat atgattatat 8100 gaaaaaatag agaaaaaaag ttttttgtat ttttttttaa tataaaaaaa tctcctaagt 8160 ataaatactt tgtaatataa agtataataa atacttaaaa atagaagtta tttataatat 8220 ggagattttg taaattaaaa aatatatttt gaaaaatatt ttaaataaat tacgcatgtt 8280 gtttacgaaa aataactcat gtaaaattat cacagattta tacgtaatgc gggatgcgga 8340 tacgggtttt ttataattca ataggagtat ttctccccta cactcgattc 8390 // ID Gypsy-8_SI-LTR repbase; DNA; INV; 224 BP. XX AC AEAQ01018735; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: long terminal DE repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-8_SI_; KW Gypsy-8_SI-I; Gypsy-8_SI-LTR. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-224 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01018735; Positions 1686 1463. XX SQ Sequence 224 BP; 54 A; 41 C; 55 G; 74 T; 0 other; tgttatatcc ttgcgtgcga ttggtcgcgt gtctagtttc cggtacgcgg cgtaagctgt 60 aaaagggtgg tagccctgct tttcgagaga gcgctcgctc tcttgggttt tgtgtgcgat 120 tgggacatgg cgtgttttct gtaccacgat tacaaaaata tacaatttaa taagaacgag 180 aataaggtac ctctcatttt tactttcata gtaagattat aaca 224 // ID Gypsy-23_AA-LTR repbase; DNA; INV; 107 BP. XX AC supercont1.118; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-23_AA_; KW Gypsy-23_AA-I; Gypsy-23_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-107 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.118; Positions 63223 63329. XX SQ Sequence 107 BP; 38 A; 12 C; 22 G; 35 T; 0 other; tgtgctgtta tattatgtca gtaggctatg gaaatgtgtt aggaataata aatcagatta 60 ttgtatgcat tcaaagaaca cggaagtatt tactacaagg tatcaca 107 // ID Polinton-1_SM repbase; DNA; INV; 14867 BP. XX AC . XX DT 31-DEC-2008 (Rel. 13.12, Created) DT 31-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE Autonomous Polinton from Schmidtea mediterranea - consensus. XX KW Polinton; DNA transposon; Transposable Element; Polinton-1_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-14867 RA Kapitonov V.V. and Jurka J.; RT "A model Polinton family from the freshwater planarian genome."; RL Repbase Reports 8(12), 2243-2243 (2008). XX DR [1] (Consensus) XX CC The planarian genome contains many families of Polintons (several CC thousand copies). Polinton-1_SM was reconstructed as a consensus CC sequence built from ~100 copies, which are ~98.5% identical to CC the consensus. The genome contains several hundred copies of CC Polinton-1_SM. This transposon is characterized by 6-bp TSDs and CC canonical TIRs. In additions to the standard ATP, INT, POLB, PX, CC PY, PW, and PRO proteins present in diverse animal Polintons, CC numerous families of planarian Polintons code for the TBPpol CC protein similar (Evalue=1.0e-10) to a TATA-binding TBP CC transcription factor present in archaea and eukaryotes. TBPs bind CC promoters and induce transcription. Given that TBPpol is present CC in diverse families of Polintons, it is important for the CC transposition of these transposons, probably by regulation of the CC Polinton transcription. Both order and orientation of all CC proteins in planarian Polintons are the same as in Polinton-1_SM. CC Unusual features of Polinton-1_SM: its integrase is encoded by CC two exons; protease does not contain the methionine at its CC N-terminus; it contains a 45%-identical copy of PX (PXa-4_SM). CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 1715..1284 FT /product="TBPpol-1_SM" FT /note="TBP transcription DNA-binding factor." FT /translation="MLSNINYKGRYSIPDMKFPFGKPQQVVDRSGKYPIIF FT FRSGKCRVMGCKQPLKTGDLKYNIHDLQIQSITVTIDIGKFINLYELQRKV FT NCIFEPEIFPALRLQKYNPSCVNIFSSGKVVILGIKSLDYNDYVENIKSEI FT TMLIN" FT CDS 7813..8187 FT /product="PRO-1_SM" FT /note="adenoviral cycteine protease." FT /translation="EILLIIYFLIIISSIQIDSIGRKLLGKDFLGVFPLDK FT IPYIDVNKALIVNTQSSNLSGEHWLAIYVKPNKINVFDPMGFYYPPILINK FT LESMLMPIEYNRIRYQNPMTKTCGQHCLAWLKTMKNK" FT CDS 6541..7764 FT /product="PY-1_SM" FT /note="PY protein from animal Polintons." FT /translation="MISAKSELCLFDRPSPQAVIEHGSFEDIFPMNSITDA FT GTSSIQFYINGSQTEYLDLNDTLLYIQLKVVNSLGENITQEADVTANNFFF FT QTLFKDAELVFNSTKIEGANDSFAHKALIETIINYNQDTKHTSLGGMGYTE FT HDLMRKTWIAKSKAFSLCSPLQFDFFDQPKYLLPGVSVQIKLTRTNPEKSL FT TCTKFVPKIVLLDAKLMVRRVRVEPSVLAGHQIGLATRNAIYPMRSKEVVE FT FSLPIGSSSFYKEQIFGDRRLPNFILVTFQGNKRFSGSFLDTFTMFDHFNV FT KSITLSKGSDYREKYSQDFENDNYSTSYMQSIVRNMGYLNKNLNCGISMSD FT FRNKYPFFTFVLAPDFDLNQSQLPQSGNLRLDIKFGSPLTESVTVIVYGVF FT EKEIQITSNRTVLV" FT CDS 6146..6541 FT /product="PW-1_SM" FT /note="PW protein from animal Polintons." FT /translation="MSNPDIVNFYSSSQSGGELPYFIGKQYGTGWLKTIGR FT FALPILRRIGNFGMKTANDVINNNSDFLPALKSNAISEIKESLPSVMQTLP FT SVINKVSDVFNHENEQKGGTLKRRKIMKKHINKRMRGFGTIFEK" FT CDS 1792..2412 FT /product="ATP-1_SM" FT /note="Polinton ATPase." FT /translation="MELRNDAVYQVVGPSGCGKTLFVCNLLQSNLFKSKFN FT KIYWHRGADEEHGLTQNQFCKLKMTIIKGFDKNWSSRLHQGDVIIIDDLYQ FT EANKEKDFNNLFTKIARHIGVTVIFITQNLFHQGGGHRTRNLNVQYLVIFK FT NPRDATIIDFLARQAYPNDRKFLIDSFQDATKVPHGYLFFDFTQQCPDELR FT IRTDIFNEKGVTIYKQN" FT CDS 4113..5141 FT /product="PX-1_SM" FT /note="PX protein from Polintons." FT /translation="MKENQFYMILPSNSCPLIFPNNEASNFNVEFQNPIYL FT DGEWEVALIDFTFVYKTFPFYSNSYLSYETMKSGTEIKYLKINNQNETYEL FT INSKHTPDVDVFIEKYENLVIVCPHVPFKVTFPDIQHAFSFGLKSISVINE FT DDHCRLKLPKFTDDGVMEQFGIIISYEEKTIFEKYFEDNLLINTLSELQGH FT LMSFIPSPFKELKIENNELVKIKLADNISKIYFDNNLIKTLGLNKQTFSDK FT DIVGNKVPRLLHHHQQMFIYSNIIEPITVGDVRVPLLKSIWVKNYEQGEIV FT QLEMKNPMYLPISTSTINNIEINIRDDSGKLIKFSKDTKTHLSIHFRKINE FT " FT CDS 5137..6150 FT /product="PXa-1_SM" FT /note="PX protein from animal Polintons." FT /translation="MNKEFYINLPSNSCLLTHPNNKADKFNVEFQNPIYLE FT GEWQVALTDFTFVYNNFPFYNNSNIKYSKSTLHTSYEYFYYNDAKNEYTIS FT HDSIFYLVYIDGNTLNINCKYFPFKISFDSIAEAQKLGFNKKEINFVENLN FT QIKIPSNNRKIKFDTIRLTINYFYDVEYTDKFEENLIITNLSQINDQLMTF FT YPHPFKSCEIEQGRLVKIELEDDVSKVKFSKFLAQSLGLSEESYSEKVIVG FT LRLPEILQSNQQMMIYTNIIEPILVGDVLIPLLKCVWVEKHKKDDLVQIIE FT KNPMYLSVSSTYINNIEINIRDDSGKYIFGKEDKTFLTLHFRKINE" FT CDS join(3000..3429,3473..4104) FT /product="INT-1_SM" FT /note="Polinton integrase." FT /translation="MKETLTKNRNTVKKLEKVWTDPKHPAGFGGVRKLAKE FT VVNSKKETQNWLSGQMAYSLNKPMLKKFKTRSYKTMGVDDLWQMDLMEMIP FT FSKINKGYKYILTCVDIFSRFARAIPTKTKSANEMAEAIKELFIDGKPRNL FT QTDLGKEFYNSKVREIIKGINHYSVFSQFKAAHVERFNRTLRDRLKKYFVY FT KGNKTWINVLQKAIYSYNYSPHRGLNGMRPIDVYSTNNMDNWISQQSDSII FT KKPKYKVGNHVRISKISASPFIKNFDNNWSDEVFQISEIDTNQNPVMYFIK FT DEENNQVSGKFYEQELQVIEAPTVFRIQKILKTKNVGKYKQYYVKWHGYET FT PTWISENQLIK" FT CDS 9417..13379 FT /product="POLB-1_SM" FT /note="DNA polymerase B." FT /translation="MIYFNDFMIIYLNFSFIITGNLYPNFCNSPQHDGNCD FT TIMENYYSHIYNHITYFICPTCKTTHKMNYTDFYLNHKEHLTQFNCKRKYI FT FKLPLSDSNNNNINNNNNNNNNNNNNNNNNNNNNNNNNNNNNIITKKKRKT FT NNNNNPRSSKVPKIIETIILSDSDESKMVDDVIIISDSDDVFPVDIPSVIV FT NQYGGSIAPIVNFDLNQSHAIVRMDVSNLNLKLITVPTQLINKLNNVINQM FT GSVKIQFCIDVEFEKDGEFQSFIISNSARIMNDSFLDEGATLLDEKITLIN FT DKGSNWVINKIKEIHFVLTKFSQINRLSGHGFIPTPLSLLNKHCTVNVANS FT DNLCFIYSILAILNHENIGNHHRNETFSYNTSELDFSPSDLPMKLCDIPKF FT EAKNPNLSINVLTYDENNLINDFSNPEVFKHPYVEIIHRSKVVGGQPINLI FT LLESLDSYHYIAVTKLDRLLNYRNHSFTDTRVQSIWCHRCLNGFRNIISLN FT KHKGLCDKNQLGTTLYTLPLNKNLEFKNWAKTVTPPFVLYADFEAILPSDD FT IHFQIHMPISAGLLLINNFTGEKSYKSFLGDDCVVNFLQEINNISINTVLP FT FYQNNYKKKIILTDSDILSFSSATCCYLCDKILSVPVKDHDHFTGKYLGAA FT CSKCNISRQIRFEFPVVFHNLRGYDLHHILKYGMSDFNTWKISCIPVTSEK FT FLSLSARISGLTVKFIDSCQFINSSLSNAVKTLTTLPLTRSEFVGSIIDTK FT GIFPYNFATSQEVLETTYELPPIWEDVTEAEYSKAQSIWEETNCNSLLDYM FT MVYLKLDVFLLADFFQQFRSKSLAHNRLEPLNFYGIPGMSWASALMTLDEP FT IELLTDMEMYNFFEGGIRGGLTFVNKHHVVSDDNTELLYIDINNLYGWALS FT QPLPYGNFEWVTDTNQLNNLISFLNKDLESLSFAVTFEVDLEIPTFLHDKL FT NDFPVAAEKMCAPGSKVEKLMLTHYPKKNYVLHWRLLQLFISLQIKITHVH FT RAIKYNQNSIFKKYIDTNTQLRAQSTNDLDKNYYKLLNNSLYGKTVENLKK FT RINLRLCNSANKLINYCSKPTFRKSFKIDNDLVSVLLNKESVCLDRPSYIG FT QTVLDLSKLRMYQLQYLELEKYRNEFTCEINIVAGDTDSFFLEIKNCKLDI FT LLSAMVRDNLLDTSNYDPKHPLYSRNLDSVIGKFKDENKGLKYKEWVFLRP FT KCYSLLSEVESIKAKGITLKGTDLKHKSYLDCYNKNLIFSVPQTRITSKKH FT QLFTIKNSKTALTNKDDKRVWVGKNISVAYGHYRGGDVVLGNSPEEFSEFN FT EEYDDYEEF" XX SQ Sequence 14867 BP; 5647 A; 1884 C; 2008 G; 5328 T; 0 other; agtagtgaca tatgaggact atgccccgtg gatatttggc ttgaaaaaaa accgagaccc 60 ccccctaaaa tttttttcag ggggaggggg ggtttaatta tttttttaaa attatttgtt 120 ttaatgaatg aaaaaatttt ttttaattat ttgttttaat gaacgaaaat ttttttttat 180 ttgtttaaat gttcaaattt aaaattttaa aaaaattttt aattcatgat aaaaaagtct 240 aagtattttt tctcgattaa aaagcctaaa aaataaaaat tggcttaaat cgcctatgat 300 tggtcgtact gaggagtatt ttcgaataaa aatatcgaat aaaaagccta aatatttttt 360 catacttaaa aacccataaa aataaaattg gattaaatcg cctgtcgact aaaaatcatg 420 aatttttttt atttttattt actttaatta tttgtttaag tcaataatat tatttattta 480 tattttaatt atattattat taattttaat caagtcaatt attttattta tttttaaatt 540 atattattat taaattgttt gttttaatga agtgaattaa ataattattt tttattttca 600 tttaattatt tatttaatat tattattttt attaatttaa ttatttattt tatattatta 660 ttattttatt ttattattat taacattttt tttttatttt tatttttaaa atttaattcc 720 aagtctaaga ataatttttc aatgtaaaaa agataaatca tcgataaatt ttgaattgta 780 aaaataacaa actttcaatc catacaaata tttttattga caataaagat taaataaaaa 840 agattaattt ttatttcaaa tcaaaaatat ccgagtaata tgctttacgc atttcggcca 900 acaatatatg gattgtttgg tcacgtttga aatcgtggca acatggacat ccatttgaac 960 atgttactct catactatct ttacaaacaa tgcaatttgc aacacatcgg tcaccagatc 1020 cctgaacgtt gtcatagtga atctcgagaa gttcttcaat gttaatccct gtgaaatggt 1080 aaatttttgt tgcgagaaac tctggtgcgc aataatccaa cgtattttta attcttggat 1140 aaagttcctg aattttcttt aaaattactt caaagagata aactaaataa taaaataaaa 1200 tacatattta aatatacatg atttaccttt tgacattttt taatacttaa taaatgaaac 1260 aaattcaaac aaaatattta ttaatttatt aacatagtta tttctgattt aatattttcc 1320 acataatcat tataatccaa ggattttatt ccgagtatta ctactttccc acttgaaaat 1380 atattgacac atgatggatt atatttttga agacgtaatg ctggaaatat ttctggttca 1440 aatatacaat tcacctttct ttgtaattca tatagattta taaattttcc aatatcgatt 1500 gttacagtga ttgattgaat ttgtaaatca tgtatattat atttcaaatc cccagttttt 1560 agaggttgct tacatcccat tacccgacat ttccccgatc tgaaaaatat tatcggatac 1620 ttacctgatc gatctacaac ttgttgtggt tttccaaatg ggaatttcat atcaggtatt 1680 gaatatcttc ctttataatt tatgttactc aacattttca aataatttca attcaagtaa 1740 ttctaaaaat taaaacaaat ataaacaatt aaataaataa ttaataataa aatggaatta 1800 agaaatgatg ctgtatatca agtagttgga ccaagtggtt gtggaaagac tttatttgta 1860 tgcaatttat tacaatcgaa cttgttcaaa tcaaaattca acaaaatata ttggcataga 1920 ggagctgacg aagaacatgg acttactcaa aatcaattct gtaaattgaa aatgacaata 1980 attaaaggat ttgataaaaa ttggtcatct agattacatc aaggcgatgt tattattatc 2040 gatgatctat accaggaagc taataaagaa aaggatttta ataatttatt tacaaaaatt 2100 gctcgacata ttggagttac tgttattttc attactcaaa acttgtttca tcaagggggc 2160 ggtcatagaa ctcgtaattt aaatgtccaa tacttggtta ttttcaaaaa ccctcgagat 2220 gcaaccatta tagattttct tgctagacaa gcttatccca atgatagaaa atttttaatt 2280 gattcgtttc aagatgcaac aaaggttccc cacggatatt tgttctttga ctttacacaa 2340 caatgtcctg atgaattacg tattagaact gacattttca atgagaaagg tgttacgatt 2400 tataaacaaa attgaaatct gaaaataatt acttgaaatg acgtttaaaa aagttattgt 2460 tcaacgtgaa aaatccacaa agaaaaaagt tacaaagaaa agggttacta aacgtaaaat 2520 aataaaaacg agtaagaaga ttgttaaaga taaaaagtgg attaaaatat aaacaaaaca 2580 actagatgaa aaatgtattt aatatgaaag ttgaacgtag tctaccatac tttaatgcct 2640 tgatgaaggc tcctaataat aaaagaatga atattttaca gaattttcct tcatttgtta 2700 tagatgatct agttgaagtt attgttaatg tagttcgtgg taatgtaaac attactaaag 2760 ttaaaaaact tgtattacaa aagcataaaa gatcactttt atctttagta aacactaaga 2820 atcgtcgtct gatgcgaaaa ataatatata aacaaaatgg tggatttttg ggggcattaa 2880 ttccaatagt attatctgca ataagcgcaa tttcatcatt aacttcttaa tgggtaccaa 2940 gaattcatta ccggataaac ataatttaga tttaaatatt ttgacaactt gttgtaataa 3000 tgaaggaaac attaaccaaa aatcgaaaca cagtgaaaaa gctcgaaaaa gtatggacag 3060 acccaaaaca tcccgctggt tttggtggtg taagaaaact ggcaaaggaa gtagttaatt 3120 caaagaaaga aactcagaat tggttatctg ggcaaatggc gtacagttta aataaaccaa 3180 tgttgaagaa attcaaaacc cgtagttata aaacaatggg agttgatgat ttatggcaaa 3240 tggatttgat ggaaatgata ccattctcta aaattaataa gggatacaaa tatattttga 3300 catgtgttga tatatttagt cgattcgctc gtgctattcc aacaaaaacc aaatcggcca 3360 atgaaatggc tgaagcaatc aaggaattat ttattgatgg caagcctcgt aatttacaaa 3420 ctgatttggg tatgcaatta aataaataaa ttaaattaaa tttattcttt aggtaaagaa 3480 ttttataata gtaaagttag agaaattatc aaaggaataa atcattactc tgtattttct 3540 caatttaaag ctgcccacgt ggaaagattt aatcgaactc taagagatcg cttgaaaaaa 3600 tatttcgtat ataaaggaaa taaaacctgg ataaatgttt tacagaaagc aatttatagc 3660 tacaattatt cgccacatag gggtttaaat ggaatgcgac caattgatgt ttattcaacg 3720 aataatatgg ataattggat atcacaacaa tctgattcta tcattaaaaa gcctaaatat 3780 aaagtgggta atcacgttag aataagtaaa atcagtgctt ctccattcat caagaatttc 3840 gacaacaatt ggagcgatga agtatttcaa atctcagaaa ttgatacgaa tcagaatcct 3900 gtaatgtact ttatcaaaga tgaggaaaac aatcaagtta gtggaaaatt ctacgaacag 3960 gaactccaag ttatagaagc ccctactgta ttcagaattc aaaagatatt aaaaaccaag 4020 aatgttggta aatataaaca atactatgtt aaatggcatg gatatgaaac tccaacttgg 4080 atatctgaaa atcaattaat taaataaata ttatgaaaga aaaccagttt tatatgattt 4140 tacctagtaa tagttgtcct cttatttttc ctaataatga agctagtaat tttaatgtcg 4200 aatttcaaaa tcctatttac ttggatggtg agtgggaagt tgctttgatt gattttactt 4260 ttgtgtataa aacatttccg ttttatagta attcttacct aagttatgaa acgatgaaga 4320 gtggtactga aataaaatat ttgaaaataa ataatcaaaa tgaaacttat gaattaataa 4380 attctaaaca tacccctgat gtcgatgttt ttattgaaaa atatgagaac ttggttattg 4440 tctgtccaca tgttccattt aaagtaacat tccctgacat tcaacatgcg tttagttttg 4500 gattaaaatc tatatccgtc attaatgaag atgatcattg tcgattaaaa ttaccaaaat 4560 ttacagatga tggtgttatg gaacaattcg ggataattat aagttatgaa gaaaaaacaa 4620 tatttgaaaa atattttgaa gataatttac tcataaacac attatctgaa ttacaaggac 4680 atttaatgtc atttattccg tctccgttta aagaactaaa aattgaaaac aatgaattgg 4740 ttaaaataaa acttgcagat aacatttcaa aaatatattt tgacaacaat ttaatcaaaa 4800 ccttaggatt aaataaacaa acttttagtg ataaagacat cgttggtaat aaagttccaa 4860 gattattaca tcatcatcaa caaatgttca tttattcgaa tattattgaa ccaataactg 4920 ttggagatgt tagagttcct ctattaaaat ctatttgggt taaaaactat gaacaagggg 4980 aaattgtaca attagaaatg aagaatccaa tgtatttacc aatttcaact tcaactataa 5040 acaacattga aattaatatt cgcgatgata gtggaaaatt aattaaattt agtaaagata 5100 ccaaaacaca tttatcaata cattttagaa aaataaatga ataaagaatt ttacataaat 5160 ctaccgagta atagttgtct tcttactcat ccaaataata aagctgataa atttaatgtt 5220 gaatttcaaa atccaattta cttggaaggt gaatggcaag ttgctctgac cgatttcact 5280 tttgtataca ataattttcc attttataac aactcaaata ttaaatatag taagagtaca 5340 ttacatacaa gttatgaata tttttactat aacgatgcaa agaatgaata tacaattagt 5400 cacgactcta tattttatct agtatacatt gatggaaata ctttaaatat taattgtaaa 5460 tattttccat ttaaaatttc atttgacagt attgcagaag ctcaaaaatt aggatttaat 5520 aaaaaagaaa taaattttgt tgaaaacttg aatcaaatta aaattccttc aaataatcgt 5580 aaaataaagt ttgatacaat tcgattaact ataaattatt tttatgatgt cgagtataca 5640 gataaatttg aagaaaattt aatcattact aatttatcac aaataaatga tcaattaatg 5700 actttttatc cgcatccatt taaatcttgc gaaattgaac aaggaagatt agtcaaaatt 5760 gaattggaag atgatgtcag taaagttaaa tttagtaaat tcttggctca aagtttggga 5820 ttaagtgaag aatcatattc agaaaaagtt attgttggat tacgtttacc agaaattttg 5880 caaagtaatc aacaaatgat gatctataca aatatcattg aacctattct tgttggtgat 5940 gttttgatac ctctattgaa atgtgtatgg gttgaaaaac ataaaaagga tgatctagtt 6000 caaattatag agaaaaaccc aatgtatttg tctgtatcgt caacatatat aaacaatata 6060 gaaatcaata ttcgtgacga tagtggaaaa tatatatttg gtaaagaaga taaaactttt 6120 cttacattac attttcgtaa aataaatgag taatcctgac attgtaaatt tttatagtag 6180 tagtcaatct ggtggagaat taccatattt tataggtaaa caatatggaa ccggttggtt 6240 gaaaacaata ggtagatttg cattacctat tcttagacgt ataggcaatt tcggtatgaa 6300 aacggcaaat gacgtaataa ataataattc tgatttttta cctgctttaa aatcaaatgc 6360 aatatctgaa attaaagaat ctttacctag tgttatgcaa actttaccta gtgttattaa 6420 taaagtaagt gatgtattta atcatgaaaa tgaacaaaaa ggagggacac ttaaacgtag 6480 aaagattatg aagaaacata taaacaaaag aatgagagga ttcggaacta tatttgaaaa 6540 atgattagtg caaaatctga attatgtctt tttgatagac catcccctca agccgtgatt 6600 gaacatggtt catttgaaga tattttccca atgaattcaa taactgatgc tggtacctct 6660 tcgattcaat tttacattaa cggatcacaa acagaatatt tagatttgaa cgatactttg 6720 ttatatattc agttgaaagt tgtaaatagc ttaggtgaaa atattacaca agaagctgac 6780 gtaacagcaa acaatttctt ttttcaaact ctttttaaag acgctgaatt agtttttaat 6840 tctactaaaa ttgaaggagc aaatgatagt tttgcacaca aagccctaat tgaaactatt 6900 ataaattata atcaagatac taaacataca tctttagggg gaatgggtta cactgaacat 6960 gatcttatgc gaaaaacatg gattgcaaaa tcaaaagcat tttcattatg ttcaccactc 7020 caatttgatt tctttgatca acccaaatat ttgttaccag gtgttagtgt ccaaattaaa 7080 ttaacaagaa caaatccaga aaaaagtctg acatgtacta aatttgtacc taaaatagta 7140 ttgctagatg caaaattaat ggttagaaga gttcgtgttg aaccttctgt tttagcgggt 7200 catcaaatag gtcttgctac aagaaatgca atttatccaa tgcgaagcaa agaagttgtt 7260 gaattttctt tacctattgg atcttcttct ttttataagg aacaaatatt tggtgataga 7320 cgattaccta attttatcct tgtaactttc cagggaaata aaagatttag tggttcgttt 7380 ttagatacat ttactatgtt tgatcacttt aacgttaagt caataacttt aagtaaagga 7440 agtgattatc gtgaaaaata tagtcaagat tttgaaaatg ataattatag tacaagttat 7500 atgcaaagta ttgttaggaa tatgggatat cttaataaaa atttgaattg tggtatcagt 7560 atgtcagact ttagaaataa gtatccattc tttacattcg tattggcgcc agatttcgat 7620 cttaatcaaa gtcagttacc acaaagtgga aatttaagat tagatataaa gtttggatca 7680 cccctaactg aatctgttac tgtaattgta tatggagttt ttgaaaagga aattcaaatt 7740 acttccaatc gaacagtatt agtttgaact ataaaccaga ttgagatttt ttaattgtca 7800 agtatgaagt aagaaatttt attaattata tattttttaa ttattattag ttctattcag 7860 attgattcta ttggtagaaa attattggga aaagattttc ttggagtttt ccctctagat 7920 aaaattccat atatcgatgt taacaaagct ttgattgtaa atactcaatc atctaatctc 7980 agtggcgaac attggcttgc aatttatgtg aaaccaaata aaataaatgt atttgatcca 8040 atgggtttct attatccccc tatattaatt aataaactag aatcaatgtt aatgccaatt 8100 gaatacaatc gaattcgtta tcaaaatccg atgacaaaaa catgtggaca gcattgctta 8160 gcttggttaa aaacaatgaa aaataaatga atttattgaa taaagtttct tttcaaaatt 8220 gttttttaat tataaatgta attaaagatg aattatgtaa tacgtattac ttttaaagga 8280 ttaagttcag aggattatat gtatttaaat gtttttgatg ttatgtaatt aagtgtgttt 8340 tataaactaa attttaattg gtcgatcatg taaccactta catgtttgtt aatatataac 8400 tgaatgcatt ttttaaaaaa ttcactttta tcaaatacca aactttagta agttttttat 8460 catattttta attattttta tttttagaaa atggaatcca tcaataatat aattaaagat 8520 ttaaaatcat taataaaaat tatcaacaaa aatgttcaaa atgttaaaac tccgagtcgc 8580 aaaaataaaa cagtaaaatg taagtactat taaattttta ctatcatatc ataattataa 8640 aatgataatt aattatattt gaattaaatt taatattaac attttattaa tttaaacaca 8700 ataaattcaa aataatatga taaaaaataa aaaactagga cgaataaagt taaataaatt 8760 ttatttaaat acatttggct tcaataattt tccaaattgg tcgaatagtc tggtttcctt 8820 ctttatatcc ttgaattctt attttaatca taaaatgttc attttctttt ttattttcat 8880 aattacattc tttttcacaa ttcataaaag tgaattgttc attaaaacta atatctttgt 8940 tgatggctcg taataactct gttatatttt ttcgatcaat ttctccaatt tcaccaactt 9000 tcatattata taggctttgt ttcaatgttg ttggttccta ttaaaattta ttaataatat 9060 tacttttaat gaaaatactt actctaaatg aatagatgat tggaaatttt aaatattcca 9120 ttgaatattg taaatacatt actggtgaat ttctgcttaa actgatagca tttacggaat 9180 tttcctcaag aattaacttg acagcttctt ttctaaccga ggtttctata aaaacaatta 9240 atgaataatt acataatatg aaaaccattt acttttattc attttatgat tttaaactat 9300 tttaatctac ttatataaga ttttcaaaat aaaattaacc aatcaaattc atttattcat 9360 tgtgtgtaaa attaatttaa tgtttatatt ttaatgattt attttaatga tttttaatga 9420 tttattttaa tgattttatg attatatatt taaatttttc ttttattata acaggtaacc 9480 tttacccaaa tttttgcaat tctccacaac acgatggaaa ttgtgatacg attatggaga 9540 attattattc tcatatctat aaccacatca cctattttat ttgtcccacg tgtaaaacca 9600 cccataaaat gaattataca gacttttatt taaaccataa ggaacattta acacagttta 9660 actgtaagcg aaaatatatt tttaaattac cattgagtga tagtaataat aataatatta 9720 ataataataa taataataat aataataata ataataataa taataataat aataataata 9780 ataataataa taataataat aataatatta ttacaaaaaa gaaaagaaaa acaaataata 9840 ataataatcc acgctcttcg aaagttccta aaattataga aactatcata ctttctgact 9900 ctgacgaatc aaagatggtt gatgatgtaa taatcatttc tgattcagat gatgtttttc 9960 ctgttgatat tccttctgtt attgttaatc aatatggtgg ctcgatagca cctattgtta 10020 attttgattt aaatcaatca catgccattg ttagaatgga tgtttcgaat ttaaatttaa 10080 agttaattac agtacccaca cagttaatta ataaattaaa taatgttatc aatcaaatgg 10140 gatctgttaa gattcaattt tgtattgatg tagagtttga aaaagatgga gaatttcaat 10200 ctttcatcat ttcaaactct gccagaataa tgaatgattc atttttagat gaaggtgcta 10260 ctcttcttga cgaaaaaata acattaatta atgataaagg tagtaactgg gttataaata 10320 aaataaaaga aattcatttt gttcttacca agttttcaca aataaatcgg ttgagtggtc 10380 atggatttat tccaacacca ttatcattat taaataaaca ttgtactgta aatgttgcta 10440 attccgataa tttatgcttc atttacagta ttttagctat tttaaatcat gaaaatatag 10500 gtaaccatca tagaaatgaa acctttagct acaacacatc tgagcttgac ttttcacctt 10560 cggatttacc tatgaaactt tgtgacattc caaaatttga agctaaaaat ccaaatcttt 10620 caattaatgt tcttacgtat gatgaaaata atttaattaa tgatttttct aaccctgaag 10680 ttttcaaaca cccttatgtc gaaataattc atcgttctaa agttgttggg ggacaaccta 10740 ttaatttaat tttattagaa agccttgatt cctatcatta tatcgctgtt actaaattag 10800 ataggctttt aaattatagg aatcattctt ttactgatac tagagttcaa agtatatggt 10860 gccatcgatg tctaaatggt tttcgcaata taatttcatt aaataaacat aaagggttgt 10920 gtgataaaaa tcaattaggc acaactcttt atactttacc tttaaataaa aacttggaat 10980 ttaagaattg ggctaaaact gttactcctc ctttcgtcct ttacgctgat tttgaagcta 11040 tcttaccttc agatgacata cattttcaaa tacatatgcc gatttcagcc ggtttacttt 11100 taattaacaa ttttacaggt gaaaaatcat ataagagttt cctgggtgat gattgtgtag 11160 ttaattttct acaagaaata aataatattt caatcaatac tgttttaccc ttttatcaaa 11220 ataattataa aaagaaaata atattaacag actcagacat tctgtcgttt tcttcagcaa 11280 cttgttgtta tttatgtgat aagattttat cagtaccggt taaggaccat gatcacttta 11340 ctggaaaata tttgggagca gcttgttcca aatgtaatat ttctcgacaa ataaggtttg 11400 aatttccagt ggtttttcat aatttgagag gttatgatct tcaccatatc ctcaagtatg 11460 gtatgagtga ttttaacacg tggaaaataa gttgtattcc tgtaacaagt gaaaagtttc 11520 tttctttgtc ggcaagaatc agtggattaa ctgtaaaatt cattgattct tgtcaattta 11580 ttaatagttc attaagtaac gcagttaaaa cattgaccac tcttccttta actcgttcgg 11640 aatttgtagg tagtattatc gatactaaag gcattttccc ttataacttt gcaacttctc 11700 aagaagtttt agaaactaca tatgaacttc caccgatttg ggaagatgta actgaagccg 11760 aatatagtaa ggctcaatct atttgggaag aaaccaattg taactcttta ctagattata 11820 tgatggttta cctcaaacta gatgtttttc ttttggctga ctttttccaa caatttcgtt 11880 caaaatcatt agctcataac cgattggaac cacttaactt ttatggaatt cccggtatgt 11940 cttgggcatc ggctcttatg actcttgacg aaccaattga acttctaact gatatggaga 12000 tgtataattt tttcgaaggg ggaatccgtg gtggattaac atttgttaat aaacatcacg 12060 tagtatctga tgacaacacc gaactgcttt acattgacat caataatctc tacggttggg 12120 ctttaagtca accattacca tatggaaatt ttgaatgggt tactgatact aatcaattaa 12180 ataatttaat tagttttttg aataaggatc ttgaatcact gtcatttgct gtcacatttg 12240 aagtagattt agaaatccca acatttcttc atgataaact taatgatttt ccagtagctg 12300 ctgaaaagat gtgcgctccg ggttctaagg ttgagaagtt aatgctaacg cattatccaa 12360 agaagaatta tgttcttcat tggagacttt tgcagctttt tatttcttta caaattaaaa 12420 ttacacatgt acatagagca attaaatata atcaaaattc aatatttaaa aagtatatag 12480 acactaacac acaacttaga gcgcaatcaa ccaatgactt agataaaaac tattataaac 12540 ttttaaataa tagtctctac ggaaaaaccg ttgaaaatct taaaaaacga attaatttaa 12600 gattatgtaa ttctgcaaac aaattaatta attattgctc aaaaccaact tttcggaaat 12660 catttaaaat cgacaatgat ttggtctctg tgttattaaa taaggaatcg gtatgtttag 12720 atcgaccgag ttacattgga caaactgtac tcgatttatc taaacttagg atgtatcaat 12780 tacaatattt agagttagaa aaatatagaa acgaattcac ttgtgaaatt aatatagttg 12840 ctggggacac cgattccttt tttctggaaa ttaagaattg taagttagac attcttttat 12900 ctgctatggt tcgtgataat ctcctcgata cttccaatta tgacccaaaa catcccttgt 12960 atagtagaaa ccttgattct gtcattggaa agttcaaaga cgagaataaa ggacttaaat 13020 acaaggagtg ggtgtttctt cgtccaaaat gttatagcct cttgtctgaa gttgaaagta 13080 tcaaagctaa gggtataaca ttgaagggta ctgacttaaa acataagtca tacctagatt 13140 gttataacaa aaacttgatt ttttcggtac cacaaactcg tattactagt aaaaaacatc 13200 aattatttac aattaaaaac agtaaaactg cattaacaaa taaggatgat aaaagggttt 13260 gggtggggaa aaatataagt gtagcatatg gtcattatag aggaggtgat gtggttttag 13320 gtaacagtcc agaagaattt tccgagttta atgaagaata tgacgattat gaagaatttt 13380 gaatgaataa attatttgaa aatattcttt gtttttgtat atttttgttg atgtttataa 13440 tttttatgtt tttaagttag tataattttt tattaagtat taaaaaatgt caaaaggtaa 13500 atcatgtata attaaatatg tattttattt tattatttag tttatctctt tgaagtaatt 13560 ttaaagaaaa tccaggaact ttatccaaga attaaaaata cgttggatta ttgcgcacca 13620 gagtttctcg caacaaaaat ttaccatttc acagggatta acattgaaga acttctcgag 13680 attcactatg acaacgttca gggatctggt gaccgatgtg ttgcaaattg cattgtttgt 13740 aaagatagta tgagagtaac atgtttaaat ggatgtccat gttgccacga tttcaaacgt 13800 gaccaaacaa tccatatatt gttggccgaa atgcgtaaaa catattactc tgatattttt 13860 gatttgaaat aaaaattaat cttttttaat ctttattgtc aataaaaata tttgtatgga 13920 ttaaaagtta ttttttacaa ttcaaaattt atcgatgatt ttttattctt agacttacaa 13980 ttaaaaataa aaataaaatg ttaataataa taaaataaaa ataatattaa tataaaataa 14040 taataataat aataatataa aataaataat taaatgaaaa taaaaaataa ttatttaatt 14100 cacttcatta aaacaaacac tttaatttaa aaatataatt taaaaataaa taattgactt 14160 gattaaaaca aacaataaaa tttaataata attaaaatat aaataaataa tattattgac 14220 ttaaacaaat aattaaattt aataataata taattaaaat ataaataaat aatattattt 14280 acttaaacaa acaattaaag taaattaaaa tataattaaa atataaataa ataatattat 14340 ttacttaaac aaataattaa agtaaataaa aaataaaaca ttcatgattt gtagtcgaca 14400 ggcgatttga gccaatttta tttttatggg attttaagta tgaaaaaata cttaggcttt 14460 ttattcgata tttttatgtt ctttaatcaa ttttgagcca attttattaa aaaatactac 14520 tcagtacgac caatcatagg cgatttaagc caatttttat ttttaggctt ttaaatcgat 14580 aaaaaatact tgatcttttt tatcatgaat taaaaaattt ttaaaaattt taaatttaaa 14640 catttaaaca aaaatttatt taaaaaaaaa aatttcattc attaaaacaa ataattaaaa 14700 aaaatttttc attcattaaa acaaataatt tttaaaaaat ttaacccccc cctcccccca 14760 taaaaaaaat tttttagggg gggtctcggg attttttcaa gccaaatatc cacggggcat 14820 ttttcaagcc aaatatccac ggggcatagt cctcatatgt cactact 14867 // ID DNA-2_AAe repbase; DNA; INV; 4771 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 11-APR-2011 (Rel. 16.04, Last updated, Version 1) XX DE A non-autonomous DNA transposon family from Aedes aegypti. XX KW DNA transposon; Transposable Element; nonautonomous; DNA-2_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4771 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the yellow fever mosquito."; RL Repbase Reports 11(4), 1257-1257 (2011). XX DR [2] (Consensus) XX CC ~96% identical to consensus. TIRs are 719 bp long. Terminnal CC TTAA can be TSDs. CC The region 1017-1300 is an inserted FEILAI_AA (~87% identical to CC consensus). XX SQ Sequence 4771 BP; 1528 A; 898 C; 866 G; 1477 T; 2 other; ttaaccttca tccagccaac atacttttca catgcaaaaa ctacactaaa atgtgcataa 60 cttttttgat tctcgatgga ttttgttgaa attttcagaa cttcccgaaa aactcttcta 120 gtttatggtc caggacactt gggaatatgg tccacgtggt tccggagtta ttccggattg 180 tcttggggta ccaaaattgg ccacatcgtc cattcaacgg atatctcaaa acccagatga 240 gctagaaggt tgatgtcttc ggcaaagttg ttcagcaaac taagggctat ccgtcagtaa 300 caatcttaat tgggaattta tccgctaggt ggcgccagtt acaatttttc ttcaatcttc 360 tatatctcaa gatcctgatg agctagaagg ttggtgtctt cggtgaagtt gttcagcaga 420 ccctaagcta tttgtcagta acaatcttat ttgggaattt atccgctagg tggcgctagt 480 tacaaatttt cttcaatctt ctatatctca agatcctgat gagctagaag gttggtatct 540 ttggcaaaga tgttcatcag attaagggct atcaggcaat aacaaaatta gttgaggatt 600 tctccactag gtggcgccag ttacaacttt tcttcaatca tctatatcta aagatcttga 660 tgagctataa ggttggtgtc ttcggcgaag ttgttcagca gatcaaaagc tatctggcag 720 caattaatct aattgggaat ttatccgcta agtggcgtta gtaacaaatt atcttcaatt 780 ttatgtatca aaagttaaca atcagatcaa tgttgggtgg ctccagtttg tcttcgacat 840 agttgtttac atcgtggcga aaattttaaa ttctcataat tttatgttcg agatttttca 900 cgtttgattt gtaacttacg cagctcaaca gccacaatac ataagttgga tcatcaattt 960 gttttatatg ctttttttca ttcaaagcct tatattttat ttacgagatt ttttggtgga 1020 gccttccttc accgagtggt tagagtccgc ggctataaag caaagccatt ctgaagacat 1080 ctgggttcga ttcccgtaga cccaggatct tttcgtaaga gaaatttcct tgacttctct 1140 gggcatagag tatcatcgta cctgccattc gatgtacaaa tactaaaaat ggcaactttg 1200 gcaatgaaag ctctcagtta ataactgtgg aagttctcat tgaacattaa gctgagaagc 1260 aggttctgtc ctagtgagga cgtaatgcca agaaaaataa cgagctatgg gctagttcat 1320 ctcgggacca acggctttac ttcccttcag aaggaagtcg tcactaaaat ttgtaatgat 1380 tatctcttag atgggtcgta tcccaagtcc taagcgtatg ttttaacgac tataccgagt 1440 ccatttccac atacacagaa agaaaaatcc gtgtaaattt cagcgtaaaa tcgtgcacat 1500 aaagggaatg ccagatatgt cgcaaatcta cacgtcacat cgtgtaaaat tacatcaaca 1560 tcatgtaagt ttccgctaaa catcgtgtaa tctagcatgg actctaaccg attacgcgtc 1620 aattcgcata aatttacacg atgttgatat aattttacac gatgtgtcat gtaaatttgc 1680 gacatatctg gcattccctt tatgtgcatg atttttcgct gaaatttaca tggatttttc 1740 tttctgtgta gagtcctaat tcctagtttt tctgctagaa ttataacgaa cctttctaat 1800 gtaaacaata acattgggtt ttgacagaca ctcaatgtgt ttcatgattg aatattttcg 1860 acggcatgag cgtgagcaag tgattttgca tccaaaaatc attcgtgaga atttcgaagc 1920 atatctcaaa agatttggct tttgtgggag aacctcatgg ttgagatttg agtgtcaatg 1980 taccaacact ggatgttcaa tacatcaagg gcaatctgaa atggaaaact gaaccaatat 2040 tcattcgcta tatggtgcta gtaacctctc caatattctg aatttggtga gctggtaatg 2100 caattgattc aaattttatt cattagatat cactagtgat attttcatag aacttatcta 2160 atttaacaac atgataagca ggaaggttag tttttcgaca aataatattg gatggctgaa 2220 atctacctaa tagggatcaa gttggtttga gttttagtgg cgctgccgta tatttttgag 2280 gaagaatttt attccaagat cctcgtacgc taaaatatcg gtcttttagg atttttttca 2340 agcaacaaaa atcttcctta aaacaatcca acggaaatgt tatagacaac ggtttaaatg 2400 ctcaacttga tgctcaccgc taaggtttaa agttagagca gtctgagtac aagtaaatgc 2460 ttaagtaata acaaataatg tcactgataa caaaacttgt tgattttttt tttgagtttt 2520 gagttattaa aaactccgtg agttgatgtt ctttaactcg atatcgaaaa cattaattta 2580 ctgggattgt gtgaaaagtt atttccatga ataatcaaaa ttgaagggga ttcctgtacc 2640 gaaaaaatcg atactcaacg cgtaatgctt atatatatat gcacataaat caaaaagcta 2700 tatggaatma ttttgtatgg caaaattggc cagataaact atacacttca cgatattaaa 2760 aatatattca gagaaatatt tttagtaccg gttgtacata atggagaaaa acaaaaatcc 2820 ccacaaacgt ctttttcaaa caaacgtttt gatattcggg aaatttctag tgaaatatct 2880 ttctcaataa aaattaaacg attttaatta ttttgctcaa ctatagggat aatcgtagcc 2940 gatagtatat tcttaaaata ttaggaattt tatcaggatt aaaggatagc aaaaatccta 3000 ccatcgcagt ctagaaatat atactgtaga caacaacctt ccagcggatg aattcaatca 3060 aatttttcac cgccagatat aaagtgcatg ttcgattttg gcaacatgcc aaaaaacctt 3120 tatgatgccc aaatcaaacc gtgcaaaata aaacggttgt ttcaattgaa acttctatca 3180 acgtgataat gaccacgaat taaattgaac acggtaagtt tcaagacggc tcacagaaat 3240 ttcgagatta gcacgattgt tgactgaccc atgatccatt gttttgcata tgtattcccg 3300 tgttgccaaa atcaaaccat gccaaaataa aaaaaaaaac tttgccgaac atataaactt 3360 tctcaataca tagaaaagaa atactgttat tggcgccacc tagcagatga attatcatta 3420 atatttttta ccgctagcct tctagctctt cacgatctca atatatagca gattaaaaca 3480 ttattgctag caacacctag tggatgaatt ctcaaccgga tgttgcactg ctaaatagac 3540 cttgatctgt tgaacaaatt tacggaagac accttccttc tagcttatcg gaatctcgag 3600 acatataatg gggattaatt tttaataagg tttgcaactg cccaaaaatc cttgatctgc 3660 tgaacaacta tgtcgaaaac accaaacctc tagcttatca atatctcgtg atataggaga 3720 ttgaagaaaa tttgttacta gcgcaaacta gcgaataatt tcccaatcac taccagatag 3780 ctcttggtct actaaacaac tttgccgaag acaccaacct tctagcccac caggatcttg 3840 agatatagaa gattgaagaa aatttgtaac tagcgccacc tagcggataa attcccaatt 3900 aagattgtta ctgacggata actcttggtc tgctgaacaa ctttgccgaa gactccaacc 3960 ttctagctcw ccaggatctt gagatataga agatcgaaga aaatttgtaa ctagcgccac 4020 ctagcggaga aaatctaaac taattttgtt attgccagat agctcttagt ctgctgaaca 4080 cctttgccga agacaccaac cttcaggctc atcaggatct tgagataaag aagattgaag 4140 aaaaattgta actggcgcca cctagcggag aaattcttaa ctgattttgt tattgccaga 4200 tagcccttag tctgatgaac aactttgccg aagacaccaa ccttctagct catcagtatc 4260 ttgagatata gaagattgaa gaaaaattgt aactggcgcc acctatcgga taaattctca 4320 actaagattg ttactgccag atagcccagg gtctgctgaa caacttcgcc gaagacacca 4380 accttctagc tcatcaggat cttgagatat agaagattga agaaaatttg taactagcgc 4440 cacctagcgg ataaattccc aattaagatt gttactgacg gatagccctt agtttgctga 4500 acaactttgc cgaagacatc aaccttctag ttcatctggg ttttgagata tccgttgaat 4560 ggacgatgtg gccaattttg gtaccccaag acaatccgga ataactccgg aaccacgtgg 4620 accatattcc caagtgtcct ggaccataaa ctagaagagt ttttcgggaa gttctgaaaa 4680 tttcaacaaa atccatcgag aaacaaaaaa gttatgcaca ttttagtgta ttttttgcat 4740 gtgaaaagta tgttggctgg atgaaggtta a 4771 // ID BEL-227_AA-LTR repbase; DNA; INV; 362 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-227_AA_; KW BEL-227_AA-I; BEL-227_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-362 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 910-910 (2011). XX DR [1] (Consensus) XX SQ Sequence 362 BP; 107 A; 88 C; 65 G; 97 T; 5 other; tgttgagagc agcatctgta attactaact tawaattaaa cacmcagaat tgtacaaaca 60 tacwwacctt aacaacatat ataccttgat ccctattgtg attttgcatt gaacacatac 120 actcgtgaac gctasacaga taacctaaca gaaagtgccc taagatcagt ctaataaaag 180 ccatcgtgaa gtgaagtgcg cgttccacat ctctcgaagt ttcacggtcc gagaacaagt 240 tcttacagtt tgtcggtcaa gccaagcttg tccgtaatcg aagctctccc acacgtgtcc 300 gaaagtgtcg taggccagtg tccggtcgtt ttccgtcgaa ttcgcaatct tttcccgtaa 360 ca 362 // ID Copia2-NVi_LTR repbase; DNA; INV; 342 BP. XX AC AAZX01002725; XX DT 05-NOV-2007 (Rel. 12.11, Created) DT 05-NOV-2007 (Rel. 12.11, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: long terminal DE repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia2-NVi; KW Copia2-NVi_I; Copia2-NVi_LTR. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-342 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from Nasonia parasitic wasp."; RL Repbase Reports 7(11), 1128-1128 (2007). XX DR Genome; AAZX01002725; Positions 4164 4505. XX SQ Sequence 342 BP; 98 A; 63 C; 74 G; 107 T; 0 other; tgttaaggaa aattcctcga cattgtatct agaatattcg acgactgacg caagcgctta 60 agcgagtatg tataatgtat atatgaaggt gtgtatagat atgtaaacat gaatgttttt 120 ctggacggta ggctcggcgc gagcatcaaa cgacacgata ggcagttcta gtctggccct 180 cgatatatat ggacgggtga tattttcgga agccctcgag tatgttagac taagaggtgg 240 agagtgcttt gttgagatca tctctcttgt aactacaata aactctatta taactacact 300 gcgcctattt tacattttac tctgctccta taatccttaa ca 342 // ID R1_Ele4 repbase; DNA; INV; 6005 BP. XX AC . XX DT 05-OCT-2010 (Rel. 15.1, Created) DT 05-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE A non-sequence-specific R1 clade non-LTR retrotransposon family DE from Aedes aegypti. XX KW R1; Non-LTR Retrotransposon; Transposable Element; R1_Ele4. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-6005 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6005 RA Kojima K.K. and Jurka J.; RT "Non-sequence-specific families of R1 clade non-LTR RT retrotransposons from the yellow fever mosquito."; RL Direct Submission to Repbase Update (13-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. This consensus is generated from 30 CC sequences with >99% identity, and ~99% identical to the original CC sequence in [1]. This family shows no sequence specificity. XX FH Key Location/Qualifiers FT CDS 1170..2705 FT /product="R1_Ele4_1p" FT /translation="MINQNKQLAECEGEASGPISNRLRFPPRKEATNFIEN FT SVGKSLDTVTTAGTGRSSATCVVTDGPGLIEAMDRNREYLPKMQVAAEQLD FT VIIEYVKSKTNISKDLKTSLLKLRQSVLAAKQDYEKAKEQLGKVEGQKDTR FT SCQTEVFSFTGNANISDDIIRYAMRKREASGDEYVAKRRMVAKGKVTYAAV FT LQSGKAGTSQAPRSRGHGNKGEANTNPKAKKAKKKEPRVNRNQAVHPQEPR FT AQGNSNPWIRVGNKKKQRKPKTEPKESAKPKTAKRRNLGEALVIKTEEAKY FT AEVLKAMRSTEKLSPLGADVRSIRRTRIGEMILVLKKDAKEKGAVYKQLAQ FT EVLGDEVDVRSLTAEATLQCKNLDEVTDAAEVSAALKEQCDIDVASKAIHL FT RKGPQGTQVAAIRLPVAEANKAKNLGILKVGWSVCRLSIQQPPEVCFKCFG FT KGHKSWNCNGPDRSKMCRKCGAEGHRASDCKQIAKCLICVDRADNGHLTGG FT PKCPGTSGVSKQPKRK" FT CDS 2618..5668 FT /product="R1_Ele4_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MCRQSRQRTLNGRTQMSGHKRSIKAAKTEVTQLNLNH FT CDAAQQLLWQSVSETKTDVVLLSDPYRIPANNGNWVADRSQQLAAIGTTGR FT YPIQEVVSSSNDGFAIAKINGVFYCSCYAPPRWSIEEFSHMVDRMMAELAN FT RQPVVIAGDFNAWAVEWGSRCTNQRGQLLLESLAALNVELANVGTVSTFRR FT NGAESIIDVTFCSPNLLGTMNWRVDDGYTHSDHQSIRYSIIPGGQKAARCN FT STQARGWKTARFDGEVFTEALRRERNTLDLNGEELVAMISRACDASMPRKA FT PPRENRPPVYWWCESIANLRAICLRARRRMQRARTEAQREERGAAFREAKL FT ALKKEIKSRKRACFESLCESANSSPWGDAYRVVMAKTKGAIAPQEKSPELL FT RSIIDVLFPHHPISPWPPAPYAADEGEEVARVTNEELAEVVKSFASNKAPG FT PDGIPNVALKAAVNTDPDMFRTTMQRCIDQGIFPDVWKRQKLVLLPKAGKP FT PGDPSAYRPICLLDTTGKLLERLILNRLVPYTESADGLSNNQFGFRKGKST FT LDAIQSVVQTAEVAIEHKRSGIRYCAVVTLDVKNAFNSASWEAIAHALHRL FT KVPVQLCKLLESYFDGRILLYDTEEGQKSVRITAGVPQGSILGPLLWNAMY FT DDVLRLPLPTGVKIVGFADDITLVVYGESMEEVELTAAHSISLVEEWMKSR FT KLGLARHKTEVVVVNNRKSEQRALISVGDCTIESKRSLRHLGVMIDDKLSF FT ASHVEYACKRASTAIAALSRMMSNSSAVIASKRKLLASVALSILRYGGPIW FT SKALRTNRNLKRLESTYRIMCLRVASAYRTVSKEAVCIIAGMTPIGLIIKE FT DVQCFNQRGTRGVRDTCKEETLRSWQQEWDNSTKGRWTHRLIPNVSDWYGR FT SHGEVNFHLTQFLSGHGCYRQYLHRFGHSESPACPNCAGVEETAEHVVFDC FT PRFIVVRGRMLTTCGGDTSPDNIIERMCADAECWNAVTTAVTHIMLELQRL FT WRADQELAAED" XX SQ Sequence 6005 BP; 1692 A; 1396 C; 1669 G; 1248 T; 0 other; caggagttaa tcatcaatgt taacttcctt ttcccgaaca gcctagatag ccgtgtagtg 60 tcggtagcgg ttgtttcaat tggctaagaa ttaacactac ggactgcctg ttccggtggt 120 ataagtccat ctcacaggtg acccctaatt catgccgctt taagcttacc gtgcctaaga 180 atgaatggtt aggggggtct aaaaaaaacc taaccgcaaa cggagcctgt ggagtaccag 240 ggcgccctct acagtattgt gcccttcctg tgctacccgg agcaatggtg caggtgacct 300 tgtgtttctc cgagacaatc ggcttccctt cttcagtctc aaacctgagg ctaaataagg 360 gtgggattat aaggatgttg taaattagtt taaatttttc acccttatgg cttcgcatta 420 tgcgtttttt tttttgttgg ggcagcaggg acgacagtcc aggggcttaa cggacccccc 480 cccccaggac ctctgcgagt tggggagttg cccaggatgt ggtgaagttc gcagtgggct 540 ctgatgaacc ttcataaaaa ccacaaaaat ccgaatgcaa ctctgtcaca gcgaccggtg 600 ccgcttaaag taccctagcc caaactaaca ggagtgccaa ccggcacatc aggattgata 660 ccagtgagat cctgattatg gcatactggt cgcgatacaa acggacaagg catggaatta 720 cgagtaggag tgcttcgagc acattgggac caacgccagt acggccccaa ttatgtatac 780 tggcagcgca cgacaaagga caagtaacgg tatatgtact ggataatatg ctttgaggct 840 gacagcggcg acattctact cccttagtag ggtcgggtga cactggcccg aaacggcgag 900 tgggttaatg gcgcaggctg cccccgtccc gtaaaaccgt ggcaggcctt agagtacgtt 960 ccaaatccgt cgcctggttg ttccaactgg gcgggagtag gctcagatgc cattacctga 1020 cctgcgccga tccggctctg aacatagtac cccataaagg actatgtcaa ccctagcatg 1080 gcctccctgc ttgcatagat aaccatggga ttataaaggc gactattcca gcggagatgg 1140 atagcggctc ttagggggac ccataagaga tgatcaacca aaacaaacaa ctagcggaat 1200 gtgaaggaga ggcttctgga cctatcagta accggctaag gttcccgcca aggaaggagg 1260 cgaccaactt tattgaaaac agtgtgggca aaagcctaga tactgtgacg acggcaggca 1320 ctggccgctc cagtgcaaca tgtgtggtga ccgatggccc gggactaatc gaggccatgg 1380 atcgcaatag ggagtacctc cctaaaatgc aggtagctgc tgagcaactt gatgtcatca 1440 tcgaatatgt taaaagcaaa acaaatatca gcaaagactt gaaaacaagt ctactgaaat 1500 tgcgacaatc agtcctagct gcaaagcagg actacgagaa agccaaggaa caactgggca 1560 aggtagaagg ccaaaaagac actaggtcgt gtcagaccga agtgttttcc ttcacaggca 1620 acgcgaatat atctgatgat attattcgat acgcgatgcg gaagagggaa gcttccgggg 1680 atgaatacgt ggcgaaaaga cgtatggttg ctaagggaaa agtgacctac gcggccgtcc 1740 tccaaagcgg aaaagctggc acgagccaag ccccgcgatc aagaggacat ggcaacaaag 1800 gagaggccaa caccaatcct aaggccaaga aagccaagaa aaaggagcca cgggttaacc 1860 ggaaccaggc agtgcatcca caggaaccac gggcgcaagg caacagcaac ccgtggatac 1920 gagttggaaa taagaaaaaa cagaggaaac cgaaaacgga gcccaaggag agcgctaaac 1980 cgaaaacagc gaaacgtagg aatttaggcg aagccctcgt tatcaaaacg gaggaagcaa 2040 aatacgccga ggttctgaag gcgatgcgaa gcacagagaa gctatcgcca ttgggcgcag 2100 acgtgcgaag cataaggcga actaggattg gtgaaatgat cctagtctta aagaaggacg 2160 ccaaggaaaa gggtgcagtc tataagcaac tggcacaaga agtacttggt gacgaggttg 2220 atgtaagatc ccttactgct gaagcgactc tccagtgtaa aaatctggat gaggtcaccg 2280 atgcagcgga ggtctcggct gccctcaaag agcagtgtga catcgacgta gccagtaagg 2340 ccatccacct tagaaagggc ccgcagggca cccaggtggc ggcgatcagg ctgccggtcg 2400 cagaggccaa caaagcgaag aacttgggca tactcaaggt aggctggtcg gtttgccggc 2460 tgagcataca acagcctcct gaagtttgct tcaagtgttt cgggaagggc cataagtcgt 2520 ggaattgcaa tggtcccgac agaagtaaga tgtgcagaaa atgcggcgct gaaggccata 2580 gggcaagcga ttgtaagcaa atcgctaagt gcctaatatg tgtcgacaga gcagacaacg 2640 gacacttaac gggcggaccc aaatgtccgg gcacaagcgg agtatcaaag cagccaaaac 2700 ggaagtaaca cagttaaatt taaaccactg tgatgcagcc cagcagctgc tttggcagtc 2760 agtctcggaa accaagactg acgtggtgtt attatcggac ccataccgca taccagccaa 2820 caacgggaac tgggtggcgg ataggtctca acagctagca gctataggga cgacaggacg 2880 atacccaatc caagaagtag tctctagctc aaatgacggt tttgcgatag caaaaatcaa 2940 cggcgtgttc tattgcagtt gttacgcgcc cccaaggtgg tcgattgagg aattttccca 3000 catggttgac aggatgatgg ccgaactagc taatcggcag ccagtagtga tagctggcga 3060 ctttaatgca tgggcagtgg agtggggtag ccgctgcacc aatcaaagag gccagctatt 3120 gttggaatcc ctggccgcat taaatgtaga gctggcaaat gtgggtacag tgagtacctt 3180 ccgcagaaat ggtgcagaat cgatcatcga cgtgacgttt tgtagtccta accttttagg 3240 taccatgaac tggcgcgttg atgacggtta tactcatagc gatcatcaat cgattcgcta 3300 tagtataata ccaggcgggc agaaggcagc gcgatgtaac tcgacccaag cccgaggatg 3360 gaaaacagcg cgcttcgacg gcgaagtatt cactgaagcc ctgaggcgcg aacgaaacac 3420 tctggaccta aacggtgaag agctagttgc tatgatatca cgagcgtgtg atgcttccat 3480 gccgagaaaa gcccctccta gagagaacag gccgccagtg tattggtggt gcgaatcaat 3540 tgcaaatctt cgagcaatct gccttcgggc tagacgaaga atgcaacgcg cacgcacaga 3600 agcacaaaga gaggaacgcg gtgcagcatt cagagaggca aagttggctc tcaaaaagga 3660 aatcaagagt cgaaaacggg catgctttga aagtctatgc gagagtgcca attctagtcc 3720 atggggtgac gcctacagag tggtaatggc taagaccaaa ggagccatag cgccccaaga 3780 gaaatcgccc gagttgctgc gatcgataat cgatgtactc ttcccgcacc atcccataag 3840 cccatggccc ccagcgccgt atgcggcaga tgaaggagaa gaagtggcga gagtgacgaa 3900 cgaagagctg gcagaagtcg tgaaatcatt cgcgtcaaac aaggcaccgg gacccgatgg 3960 tatcccgaat gttgccttaa aagcggcagt gaacacggat ccggacatgt tcagaaccac 4020 gatgcaacgc tgcattgatc aaggaatctt cccggatgta tggaagcgac agaaattggt 4080 gctactacca aaggcgggga aaccaccagg tgacccgtcg gcgtatagac ctatctgtct 4140 actagatacg acgggcaagt tattggagag gttgattctc aacagactag taccgtatac 4200 ggagagtgcg gacggcctgt ccaacaacca gtttggattc agaaaaggta aatctactct 4260 ggacgccatc cagtcggtcg ttcaaacagc tgaggtggca atcgagcata aaaggagcgg 4320 catccgttac tgcgcggttg tcactctgga tgtgaagaat gcgttcaaca gcgcaagctg 4380 ggaagcaata gcacacgcgc ttcaccgcct caaggtaccg gtgcagttgt gtaagcttct 4440 agaaagctat tttgatggta ggattctact gtatgacaca gaggaggggc agaaaagcgt 4500 tcgaattacc gcgggagtac ctcaaggttc aatcctgggc ccgctgttat ggaatgcgat 4560 gtacgatgac gtactgaggc tgccccttcc gacgggtgtt aagattgttg gcttcgccga 4620 cgatatcacc ctagtggtct atggtgaatc gatggaggaa gtagagttga cagcagcgca 4680 ctctatttcc ctggttgagg aatggatgaa gtctaggaaa ctaggactgg cccgtcacaa 4740 gactgaggtg gtggtggtca acaaccgcaa gtccgagcaa cgggcgctta tctcggtagg 4800 tgattgcacc atagagtcca agcgatccct taggcatctt ggggtaatga tcgatgacaa 4860 gctgagcttc gctagccacg ttgaatatgc ctgtaaaagg gcatctacgg ctatagcggc 4920 gctttcgagg atgatgtcta acagctctgc tgtaattgcc agcaaacgca agctgctggc 4980 aagcgtggcg ctatccatac taaggtatgg aggaccaatc tggtcaaaag cgcttagaac 5040 gaacagaaac ctaaagcggt tggaaagcac gtacaggata atgtgcttaa gagtagcaag 5100 tgcataccgg acggtatcta aagaggccgt gtgcatcata gccgggatga cgcccatcgg 5160 gctcatcatc aaggaagatg ttcaatgctt caaccaaagg ggtaccagag gagtccgcga 5220 cacgtgtaaa gaggaaacgc tcagaagctg gcagcaggaa tgggataact ccactaaggg 5280 tagatggacc catcgactaa taccaaatgt gtcagattgg tatggtagaa gccatgggga 5340 agtgaacttc cacctgacgc agtttctgtc aggacatggt tgctatagac agtacctgca 5400 taggttcggg cactcagaat ctcctgcgtg ccccaattgt gctggtgtag aggaaacagc 5460 ggagcatgtc gtgttcgatt gcccccgttt cattgttgtg agaggtcgca tgctcactac 5520 atgcggaggg gacacgtccc ccgacaatat tatagagaga atgtgtgcgg atgccgagtg 5580 ctggaatgca gtaactacgg ctgtcactca cattatgtta gaattgcagc gtctatggcg 5640 cgccgaccaa gagttggctg cagaggatta gccctgccga ggctggtccc ttgtaacatt 5700 gtttaagtcg gctaggagaa gcagtttgcc taggctactt ctgctacacg tgttgtgcta 5760 tatgcactgg tcccttcccg aagaaatacc gtaaggtggt tccggggaga tgagggtctg 5820 agtccaaggg tcatgtcgat gcactgttca ccacttgagc aattctaaat gaattgctca 5880 tgcacagtgc ataggctggt tttagcgggt cgtcgttggt gcgtcatccc cgcattccct 5940 gagttatctt ctcaggggat ctgtttgcag atttccccct tgtaaaaaac aaaaaaaaaa 6000 aaaaa 6005 // ID BEL-155_AA-LTR repbase; DNA; INV; 651 BP. XX AC AAGE02018864; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-155_AA_; KW BEL-155_AA-I; BEL-155_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-651 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02018864; Positions 14429 13779. XX SQ Sequence 651 BP; 191 A; 159 C; 159 G; 142 T; 0 other; tggtgtgcca aaaacattgg ccaccagtgt agcgatcgaa cgatcgccat caaccgatga 60 agcaagcgaa aaacatccgg tcggtcggtg gtccgaagtt gcatcagcct gaccgatgga 120 ctccacaccg acatactgta acatcagcag tttccacgcg ctgccttttc aactataaac 180 gaggcagcaa caatgtaagt cgcagtagca acagaaattt cgacgaagca ccgcacgtaa 240 taggagacat gtggccgttc tcaaatgaca acccagaagt caccgtggag aataacatct 300 ccacgcacac gatggttggg tacaccattg ataccctctt cgtagtcatc atcatcatca 360 tcgtcattgc ctggcgatgg cggaaaaatg cgcgccgact caaggcggtg gagagtgcag 420 caaaacggct gcatcatcaa aatcccgtac aactcaaact ctgtagataa aatgaaataa 480 atgaataagt tagtctcaac tcagaattca agcgtgtcgc gtcgtttcta gtatcacggt 540 cttgtgatcg agctaggtgg ggctagttcg agactaatta atttcgatta acgatcgttc 600 ggtggaagcc ctagcgggag cggggccaat cgaaggttgt ttgctagaac a 651 // ID BEL-6_AA-LTR repbase; DNA; INV; 349 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-6_AA_; KW BEL-6_AA-I; BEL-6_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-349 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 862-862 (2011). XX DR [2] (Consensus) XX SQ Sequence 349 BP; 95 A; 84 C; 80 G; 90 T; 0 other; tggaagcaat aacaacaaca gcagcagcat cgcacatctg tcaagctcta cactcactga 60 gcgccttcca gtgagtgacc gagcgcatcg cagtgagtga ccgtcgccag tccagcatac 120 acatgcatca tcaatcatca tcgccgtcat cgtcgtcatc atttgcgcct ggaaacagcg 180 gaaatgttct acgcgtttct agaacatgtg gtgcgggata taaatacgcg tgatgtgcgc 240 ggctgaaaat cagttatatt ttcaaattcg aacaataaag tgaagtgttt tgaatcagta 300 taacttgcct agtgctttct tcgcgtgcgg aaagtgtcgt tttcgtaca 349 // ID BEL-630_AA-LTR repbase; DNA; INV; 456 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-630_AA_; KW Pao_Bel_Ele231; BEL-630_AA-I; BEL-630_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-456 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 456 BP; 170 A; 100 C; 82 G; 104 T; 0 other; tgttcaggac ggcacccagt cgtccgaatt attagctgcg gggcgagcgc cagcaacggc 60 tgagtgaccg gccgtccaac gagagaaact gactctgcac accacacata tcacactcag 120 tattatttga gacagcacag caatcacaca cgctagttgg acgaaaatac gaacgaaaag 180 tgaattatta attgttagtt tgttggacag gacaggactt agtacacgaa aaggactgaa 240 aatgtaagac aaacaatcat acaaatatgt aaactaacca acttatgcta cttacaacta 300 atcatgcaaa attaacttaa aattagtgaa ctaaaccaac caaatgtgaa cttaaatcta 360 attaatgaaa atgcaatttc agctaaaagc gaactctacc ataaaatccg agttttgcta 420 aacggattgt ccgaaatccc atcacctgtg tcaaca 456 // ID Copia-17_CQ-LTR repbase; DNA; INV; 221 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-17_CQ_; KW Copia-17_CQ-I; Copia-17_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-221 RA Jurka J.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 350-350 (2011). XX DR [2] (Consensus) XX SQ Sequence 221 BP; 66 A; 59 C; 41 G; 55 T; 0 other; tggtaatcac accgtgtgcc gacccctcgg cctcagcagc gagatgacac taacacaatg 60 acaacgatcg agcgagcgca cgcaaagcct gctgcgccat attgtttttc tcattcccaa 120 acgaacgctc aaagtgtaaa gacgcgagaa atataatttt acctttgttt gctaaattaa 180 aactaatttc gccttttatt ccacttggac gaaaagtccc a 221 // ID CR1-43_HM repbase; DNA; INV; 4527 BP. XX AC . XX DT 18-DEC-2008 (Rel. 13.12, Created) DT 18-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE CR1-type family - consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-43_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4527 RA Bao W. and Jurka J.; RT "CR1 families from Hydra magnipapillata."; RL Repbase Reports 8(12), 1871-1871 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 117..1391 FT /product="CR1-43_HM_1p" FT /translation="MAPREIGELFLTIQSSNNILENYIINVEEICSHPRDV FT IIYGFSQFKEAEVFKLRVELFQEILERFDIEQFSLLNIILDAEDPVKKNLR FT KRYKILTCLDDIYLMSISLAENQLHKDFIKLISNSKQASQQMYQPPDINMA FT LILKEVIDRLKALQLENKKIMADLHLLQVKYDELQKTNNEIVKNQCASQFN FT NTSNCEPHKSQENIKSIVSNSIKYHSPNQRNTLEEVTAHSSTEISSLYTDR FT KNKIRDSYATVVKMQNPEQNNRKPEKEKDCVTDYQEFKVVGKNNKVNKSHM FT DKQKKEPVFGNKVASNRSSIAGKRIIRESYIFVGGVCNSISEEDLSNYMNK FT EIGITPLDIKLNRENMYNRSFKVKINSTEKGMVLKPDVWDNNIIVKPFRMR FT REALTIEKPHQNTENHLPQFLHHSSAFMQQ*" FT CDS 1406..4453 FT /product="CR1-43_HM_2p" FT /translation="MGEKELNVTNTNITLCAYNCQSFKSNSYYISKLLKSF FT DIIFISEHWLLNIESFLLNNIALLTHKVFFHAAEKNTHGRPYGGNAFMVRK FT KLSSFYIIHEDENILAIKGTLNNRNLIFIGVYLSSCRNNNESYAKYSSQLQ FT TITSIMNNYEDEGECIISGDFQSFPYKIYDSELRKNKTRNNFSKHLTNFIE FT SNQLAFVDIINGAGPTYTYQHKTLSNSSYIDHIASYKQTELCFTNCQVMQS FT SPLNLSDHLPVSTNINFEGTGLSHEINNIMQKYNSIPKHAWNNERFIDFYN FT LNLSKAFTSYAFKEDKIEEEIQKTCLYIKNAALLAVKQCFGQKEFYKYSKP FT WWTPELKKIKDNLSFHFRQWQKSGYNKSTECISYKRFIYSRKNFRKAVKAA FT HNKKIHEKTKNIEHLRNTNPQRFWGNIRKIKKNTNNRLFTINKLQNKNEIV FT EEFSHHFQKLLNTPQYTNNEFNPLQIPHLSKEPNSKVISTADIEKCILKLK FT NNKSYDCYEISAEHLKNSHNKNLLILLSSFYNNMLTNGKVPHGLSTAKIIP FT LVKSYKTSLENPNNYRGISIIPIFTKLLEYLIIHICPDITESHSHQFGFKE FT NSSTLHAEFLLSETIKHYNHNNSPLYICSLDANKAFDSCNWELLFEKLYYQ FT KKIPLSIVHTILSLYQTGTSNISYLGCTSNKFRLSQGVRQGSILSPHLYNI FT YTENLLEKIVSESKVGTSINGVYTGIIAYADDIVLQSSTISGLQSLINIVQ FT KYGLSNFIKLNTEKTEFLISGISSILINVIYINGDPIKPQNNLKHLGFLWD FT NKNKKMTATLQKQNITERVTQFLSVAKGLIRNGLRFCQPATIVHLFNSLAV FT PTLLYGLELCGSSNKFLNSIDIAGRSVLKSFFNISKQSRNYLHSYFKVDAV FT SDILFRNKLNLFIRLLNNPICYSIIRTQMPLMNQRSFVGEVMIMCKKADIN FT MLQFMIEGKKIIIARPQEALEANVAAILKQSFQHWNLKEQREVFRTLMEEN FT IPRTMQENT*" XX SQ Sequence 4527 BP; 1762 A; 759 C; 659 G; 1347 T; 0 other; ttgcgcataa tttggtgcga acggacgtgt tttttattaa gaaatttact tcagcaaaaa 60 aaaaaaaaaa aaaaaaaaaa agattttaat ctcgatgtat ttaaacttta tgctatatgg 120 cgccaagaga aattggtgag ttgtttttaa caatacagtc aagtaacaat attttagaga 180 actatattat aaatgtagag gaaatctgct cacatccaag ggatgtaatc atctatgggt 240 tttctcaatt taaggaggcg gaagttttca aattaagagt ggaattattt caagaaatat 300 tggaacgttt tgacatagag cagttctcac tgctaaacat tattctagac gcggaagatc 360 ccgttaaaaa aaatctgaga aaacggtata agatattaac atgtcttgat gatatttatc 420 tgatgtcgat ttctttagcg gaaaatcaac ttcataaaga ctttattaaa ttgatttcta 480 actccaaaca agctagtcaa caaatgtatc aaccgccaga tataaatatg gcattgattt 540 taaaagaagt tattgatcgg ttaaaagcat tacaattaga aaataaaaaa attatggcag 600 atttgcattt attacaggta aaatacgacg aactacaaaa aacaaataac gaaatcgtga 660 aaaatcaatg tgcgtcacaa tttaacaaca caagtaattg tgaacctcat aaatcgcaag 720 aaaatataaa atctattgtt tcaaattcaa taaaatatca ttccccaaat caaagaaata 780 cgctggaaga agttactgca cactccagca cagagatttc aagtttatat accgatagaa 840 aaaataaaat tagagattcc tacgctaccg tggttaaaat gcaaaacccg gaacaaaata 900 accggaaacc agaaaaagaa aaagactgtg taacagacta tcaagaattc aaagtggtag 960 gaaaaaataa taaagttaat aagagccata tggataagca aaaaaaagaa cccgttttcg 1020 gaaacaaagt cgctagcaac agaagctcga ttgcgggcaa acgtattatt cgagaatcat 1080 atatatttgt aggcggagtt tgtaattcca tatccgaaga agatctatca aactacatga 1140 ataaagaaat aggtattact cctttagaca taaagcttaa tagagaaaac atgtacaacc 1200 gctctttcaa agtaaaaatt aatagtactg aaaaaggaat ggtacttaaa cccgatgttt 1260 gggataacaa cattattgtt aaaccgttta gaatgcggcg agaagcacta acaatcgaaa 1320 aaccacatca aaataccgaa aatcatttgc cacaattctt acaccactct tcagcattta 1380 tgcaacaata aaaattgaaa tcataatggg agaaaaagaa ttaaatgtaa ctaacacaaa 1440 tattacgcta tgtgcatata actgtcaaag ctttaaatcg aattcatatt atatttctaa 1500 gttgttaaag tctttcgata ttatttttat atcagaacat tggcttttaa atattgagag 1560 ttttttgtta aataacattg cattactaac acacaaagtt ttttttcatg cggcagaaaa 1620 aaatacacac ggtagaccat atggaggaaa tgcttttatg gtacgaaaaa aattgtcgtc 1680 tttttacata atacatgaag atgaaaacat tcttgctatt aaaggcacac ttaacaatcg 1740 taacttaatt tttattggtg tttacctttc atcttgtcgt aataataacg aatcatacgc 1800 aaaatactct tctcagttac aaacaatcac ttcaataatg aacaactatg aggatgaagg 1860 agaatgtatt atttccggag actttcagtc atttccatat aaaatatatg attcagagct 1920 tcgtaaaaac aaaactagga acaatttctc aaaacatctc actaatttta ttgaatcaaa 1980 tcagctcgct tttgtcgata taatcaacgg tgctgggcct acttatacct atcaacataa 2040 aactctttct aattcttctt acattgacca tattgcatcc tataaacaga cagaactatg 2100 ttttacaaac tgtcaagtta tgcaatcttc accgcttaac cttagtgatc accttccggt 2160 gtctacgaat attaactttg aaggtaccgg cttatcacac gaaataaata atataatgca 2220 aaaatataat tcaataccaa aacatgcatg gaataacgaa aggtttattg acttttataa 2280 tctcaatcta tcaaaagcat ttactagtta cgcatttaag gaagataaaa tagaagaaga 2340 aattcaaaaa acatgcttat atataaaaaa tgcagctctc ttggcagtta aacaatgttt 2400 cggacaaaaa gaattttata agtactcgaa accttggtgg acacctgagc ttaaaaaaat 2460 taaagataat ctctcctttc actttagaca atggcaaaaa tcaggctata ataagtcaac 2520 agaatgtata tcctacaaaa gatttattta ttcccgaaaa aactttcgta aagctgtaaa 2580 agcggcacat aacaaaaaaa ttcacgaaaa aactaaaaat atagaacact tacgcaatac 2640 aaatcctcaa aggttttggg gtaatattcg aaaaatcaaa aaaaatacta acaatcgact 2700 ctttacaatt aataaattgc agaacaaaaa cgaaatcgtc gaagaattta gtcatcattt 2760 ccaaaaacta cttaatacac cgcaatatac aaataacgag ttcaatcctc tccaaattcc 2820 acatttaagt aaagagccta attcaaaagt tatctcgaca gctgatattg aaaaatgtat 2880 tttaaagctt aaaaataata aatcttacga ttgctatgaa atcagtgctg aacatcttaa 2940 aaactctcac aacaagaatc ttcttatttt actttcgtcg ttttataaca atatgttaac 3000 aaatggaaag gttccacacg gtttatcaac cgccaaaatt atcccattag tcaagtctta 3060 taaaacatca ctagaaaatc caaataacta tcgaggaatc agcattatac caatttttac 3120 aaaacttctc gaatatctta tcatacacat atgtccagat ataacagaaa gtcattccca 3180 tcaattcggt tttaaagaaa acagctctac cctccacgca gaatttcttc tgagtgaaac 3240 aataaagcac tataatcata ataactcacc gctatatata tgcagtctag acgccaacaa 3300 agcttttgac agttgtaatt gggaattact attcgagaaa ctctattacc agaaaaaaat 3360 cccattatct atagttcata caatattatc actatatcaa acaggaactt ctaatatatc 3420 atatcttgga tgtacatcaa acaaatttag gctgtctcag ggagtgcgtc aaggatccat 3480 actgtcgcct catctctaca atatctacac agagaacctt ctagagaaaa ttgtttcaga 3540 atcaaaagtt ggaacttcaa taaatggtgt atacacaggt atcattgcat atgctgatga 3600 tatagtttta caaagttcca ctatatctgg tcttcagagc cttattaaca ttgttcaaaa 3660 gtatggacta tcgaacttta ttaaactgaa tacagaaaaa actgaatttc taatatctgg 3720 aataagttcg attttaatta acgttattta tattaacggt gatcctatca agcctcaaaa 3780 caatctaaaa catcttggtt tcctttggga taataagaac aaaaaaatga cagcaacact 3840 ccaaaaacaa aatataactg aacgtgtaac gcaattttta tctgttgcaa aaggtcttat 3900 aagaaatgga cttcgctttt gccagcctgc taccatagtt catttattta attctttagc 3960 tgttccaacg ttgttatacg gattagaact gtgtggaagc agcaataagt ttctaaatag 4020 cattgatata gctggacgat ctgtcttaaa atccttcttt aatatttcaa agcaaagtag 4080 aaattatcta cattcttatt tcaaagtaga cgctgtgtca gacattcttt ttcgcaacaa 4140 gttaaatcta tttatcaggc ttttaaacaa ccccatatgt tactcgatta tacggactca 4200 gatgccttta atgaatcaac gatcatttgt gggtgaggta atgataatgt gtaagaaagc 4260 tgatataaac atgctccaat ttatgatcga gggtaagaaa attataattg cacgccctca 4320 ggaagcattg gaggccaatg ttgcggcgat cttaaaacaa tcgttccaac actggaattt 4380 aaaggaacaa agagaagtat ttagaacgct gatggaggaa aacattccaa gaacgatgca 4440 agaaaatact taaatgtgct ttgttttttt tttaatactt ttacctatag tttatgtaat 4500 tattgggtgt ttaaaagaat aaatata 4527 // ID Chapaev-20_HM repbase; DNA; INV; 2326 BP. XX AC . XX DT 14-SEP-2009 (Rel. 14.09, Created) DT 14-SEP-2009 (Rel. 14.09, Last updated, Version 2) XX DE Chapaev-type DNA transposon - a consensus. XX KW Chapaev; DNA transposon; Transposable Element; Chapaev-20_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2326 RA Jurka J.; RT "DNA transposons from Hydra magnipapillata."; RL Repbase Reports 9(9), 1914-1914 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 362..2053 FT /product="Chapaev-20_HM_1p" FT /translation="MSSSRRGCLNNPNAFCHICGEYCQESQRRNITNFVKQ FT AYLAYFGMKLGDQDKSWAPHIVCKTCVENLRQWKNGQRKGLKFGVPMVWRE FT QKNHDSDCYFCLVNVKGINRFKKRKFEYPDLESARRPVPHSDDVPIPVFTT FT LPDLTVSDEEDMQNLVCEHADSGSEYEDSTSKQQQFSQEEINDLVRDLSLS FT KQDSELLASRLSEKNSLKAECKITFFRTREAALLPYFIQEEKIVYCRNIPG FT LLLQMGLPEYRPEDWRLFIDSSMRSLKCVLLHNSNSYASIPILHSTKLKEE FT YENIRMMLQKLRYNEHQWSICVDLKMVNFLLGQQSGYTKYPCFICLWDSRA FT KEDHWKKVVWPARENMTVGASNIINEPLVDRGKIIFPPLHIKLGLMKQFVR FT ALDKDGQCFKYICRFFPGLSMEKIKAGIFSGPQIRQIIKDEQFLYTMTDIE FT ASAWKSFVLVVHNFLGNHKSPNYEEIVQKMLSDFKTLKANMSIKVHFLHSH FT LDRFPDNLGSYSEEQGERFHQDIKVMEERYQGRLDTHMMADYCWSLQRDCP FT HSPHKRQSYRHRFLMH*" XX SQ Sequence 2326 BP; 786 A; 385 C; 444 G; 711 T; 0 other; cactggccaa caaaaaaaaa tttttttttt gctgccgagg atatttttac gaatttatag 60 tatatcgggt gtgctgattt cagaaatgac attagttttc cacgattggc tctagttctt 120 gagatacagg gtgttccaga tatattctta tacaaatatc ttcgttcatg catattttac 180 agtaaaatag ttttatttct ttttaaaggg cataccttta tatatatata tatatatata 240 tatatatata tatatatata tatatatata tatatatata tatatatata tatatatata 300 tatatatata tatatatata tatatatata tatatatact ttttcctgtt tttagaacac 360 aatgagttct tcacgaagag gatgtttgaa taaccccaat gcattctgcc acatctgtgg 420 tgaatattgc caggagagcc aaagaagaaa tattactaac tttgtgaagc aagcatatct 480 ggcatacttt ggaatgaaac tcggagacca agataagtct tgggctcccc atattgtatg 540 taaaacgtgt gtcgaaaatc tacgtcagtg gaaaaatgga caaagaaaag gtttaaaatt 600 tggtgtgcct atggtatggc gggagcaaaa gaatcacgac agtgactgtt atttttgttt 660 ggttaatgtg aaaggtatca atcgtttcaa gaaacgtaaa tttgagtatc cggatttgga 720 atcagcaagg cgacctgtac cacactcaga cgatgtcccc attccagtgt tcactacact 780 cccagatcta acagtgtctg atgaagaaga tatgcagaat ttggtgtgcg aacatgcaga 840 tagtggtagt gaatatgaag acagtacttc taaacagcaa cagttctccc aggaagagat 900 caatgattta gtacgagact taagtctatc caagcaagat tctgagcttt tagcatctag 960 actaagtgaa aagaacagtt tgaaggccga atgtaaaata actttttttc ggacaagaga 1020 ggcagcactt cttccatatt tcattcaaga agaaaaaata gtatactgta ggaacatccc 1080 tggactcctc ctccaaatgg ggttacctga ataccgacca gaagattggc gattgtttat 1140 agatagttcg atgagaagtt tgaagtgtgt tttattgcac aatagcaata gttatgcatc 1200 catcccaata cttcactcaa caaaactgaa agaggaatat gaaaatataa gaatgatgtt 1260 gcagaagctt agatacaatg agcaccagtg gtctatatgt gttgatttaa agatggtaaa 1320 cttcttacta gggcagcaaa gtggatatac taagtaccct tgcttcatat gtttgtggga 1380 tagtagggct aaagaagacc actggaaaaa ggtggtatgg cctgctaggg aaaatatgac 1440 tgtaggtgca tcaaatatca ttaacgaacc actggtagac aggggaaaaa ttattttccc 1500 accactccac atcaaactag gactgatgaa gcaatttgta agagccctag acaaagatgg 1560 tcaatgtttc aaatatattt gcagattttt tccaggcttg agtatggaga aaatcaaagc 1620 tggaatcttt agcggccctc aaattcgcca aatcatcaag gatgagcagt tcctatatac 1680 aatgacagat attgaagcat ctgcctggaa aagttttgtc ttggttgttc ataacttttt 1740 gggcaaccat aaatctccca actatgaaga aatcgtacag aaaatgcttt cagacttcaa 1800 aacattgaaa gctaatatga gcataaaagt tcactttcta catagtcact tagatcggtt 1860 tcccgataat ctaggaagtt atagtgagga acaaggggag aggttccatc aagatatcaa 1920 ggtaatggaa gaaagatacc aaggaagatt ggatacacac atgatggctg actattgttg 1980 gagccttcaa cgtgattgtc cacattctcc acacaaaaga caatcataca gacaccgatt 2040 tttaatgcat taattgtgat ttcattgtac tgatttgaat gttttgatct attagcattg 2100 tatgtacttc cccataaaca atttgatata tatatctagt gttttttttg tgaattacgc 2160 tacaacctaa cataaattgt tgctattaat gtactttcgt gaccaccctg tatctcaaaa 2220 actagagcca atctaggaaa acggaggtca tattcggaat cagggcaaca agttacctat 2280 aaaatgaccc ccaaatgttt tgccgcaaaa tctttgttgg ccagtg 2326 // ID Crack-21_BF repbase; DNA; INV; 2531 BP. XX AC . XX DT 30-APR-2009 (Rel. 14.04, Created) DT 30-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE Amphioxus Crack-21_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; KW Crack-21_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-2531 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-2531 RA Kapitonov V. and Jurka J.; RT "Young families of Crack non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(4), 826-826 (2009). XX DR [2] (Consensus) XX CC Its 5'-terminal portion is incomplete. XX FH Key Location/Qualifiers FT CDS 1..2328 FT /product="Crack-21_BF_2p" FT /translation="YDLTNLITEPTRVTERSSSLIDVILTSNSSKYSSSGV FT YKGGLSDHHLVYTHRGLKQPKPAPKWISARIFKRFREEDFIADLERVPWST FT ITVFDSVQDMWSXWKSLFETVCNKHAPIKKFKVKGTERPPWLSDDIREIMS FT LRDQARCTAERTGNEKDWETYRHLRNHTKRLVASRKRSHYTEEINKGSTND FT MWKSLKSLLGKSKSGSILSMKDDTGETSSSPDIAHKLNNYFGTVAEKLADS FT IKKTMPSFSPLVYVRRCLARFSFKSITADFVKQELKLLSSNKATGLDHLNN FT RLLKAAAEVIAQPLTTILNESLKKHEFPEDWKRARVTPIHKAGDRSLPNNY FT RPVSILPAVSKILERAVHIQLYEYCTENNILSEVQSGFRPKHSTQSATHLL FT VEKWFTAMNSGNLTGAVFIDLSKAFDTLDHSILLQKLFKYGIQGGALDWFA FT SYLSGRQHCTSVNGVLSSFHTVKYGVPQGSILGPLLFIIYVNDMPNCIQNC FT EISMYADDTVIHFSSKDSGTIENALQTDLERLSQWFAVNYLSINEGKCKCM FT LIGTDKKLKTCTNPRLAVNGQFINLCYMYKYLGIFIDNQLNWKRHVKDVLS FT KLRRALSIMKHVSPFVSTSALRTLYNTIFLPYITYGSTVWDTAPEQDLQKL FT QRMQNRAGKLLLRAHYRTPSAEVLTRLGWKNIKSIHRQQKALLTYKALNNL FT LPVYMRNLFTYCRERSTRSTRQSESNLLYLPMVHREAFRRCITYSGTVLWN FT SLRENVRQAPSLSSFKNLTRLEIM*" XX SQ Sequence 2531 BP; 780 A; 566 C; 560 G; 623 T; 2 other; tatgatctga caaatcttat aacggagcca acacgagtaa cggagcgtag ttcatcactc 60 attgacgtaa tcttgacgtc aaactccagc aaatactcca gcagtggtgt atacaaggga 120 ggactgagtg atcatcacct cgtctacact caccgtggmc tgaagcagcc caaacccgca 180 cctaagtgga tttccgcgag gatctttaag agattcaggg aggaagactt catagcggat 240 ttagagagag tcccgtggtc aacgatcacg gtctttgact ctgtacagga catgtggtct 300 rcctggaaat ctctgtttga aacagtgtgt aataaacatg ccccaatcaa gaagttcaag 360 gtcaaaggaa cggaacggcc accctggctc tccgacgata ttcgggagat catgtcccta 420 cgtgaccagg cgagatgtac ggccgaacga acggggaatg agaaggactg ggaaacatac 480 agacatctga gaaatcacac caaaagactg gttgcctcca gaaagaggag ccattacacc 540 gaggagataa acaagggttc aactaacgat atgtggaaat ccctcaagtc tcttttgggt 600 aaatccaagt caggcagcat tctaagcatg aaggacgaca cgggagagac gagctcctcg 660 ccagatatag cccacaaact gaacaactat tttgggacag tagctgaaaa gctggctgat 720 tctatcaaga aaaccatgcc atcgtttagc cccttggtct acgtaagaag atgcctcgcg 780 cgtttctctt tcaagtcaat aacagcagac tttgtcaaac aggagcttaa actgctatct 840 tcaaacaaag ctactggcct cgaccatctc aataacaggc ttttgaaggc tgcggccgag 900 gtgatcgctc aacctctaac aaccattttg aatgaatcgc tgaaaaaaca tgagttcccg 960 gaggactgga agagagcgag ggtaacacca atccacaagg ctggcgatag gtccctacca 1020 aacaattaca gacccgtcag tattttaccg gcagtgtcaa aaatcctgga gagagctgtg 1080 catatccagc tttacgagta ttgtacggaa aacaacatac tctctgaggt tcaatcaggt 1140 tttcgaccca aacactctac tcaatctgcg acgcacttac ttgttgagaa atggttcaca 1200 gccatgaact ctggaaacct caccggggct gtgtttatcg atctgtctaa agcgttcgat 1260 actttggacc attctatcct gcttcaaaaa ttgttcaagt acggtatcca aggtggagct 1320 cttgactggt tcgcttccta cctctccgga agacagcact gtacttctgt aaatggagtc 1380 ctttctagct tccacacagt gaagtacggc gttccacaag gctcgatttt gggaccactt 1440 ttattcatta tttatgtaaa tgacatgcct aactgtattc aaaattgtga aatttcaatg 1500 tatgccgatg atacggttat tcacttcagc agcaaggatt ctgggaccat cgagaatgcc 1560 cttcagacag atctcgaaag gttgtctcaa tggtttgcag tgaactatct gtcgataaac 1620 gaaggaaagt gcaagtgtat gctaataggt acagataaaa aactgaaaac ttgtacaaac 1680 cccagattgg ccgtaaatgg tcaatttatc aatctatgct acatgtacaa atacttgggt 1740 atattcattg ataatcaact gaattggaag cggcacgtca aagacgtcct tagcaagtta 1800 agacgagcat taagtattat gaaacatgtc agtccatttg tatctacatc agcacttcgt 1860 actctataca atacaatctt cctgccgtac atcacatatg gaagtactgt ttgggatact 1920 gcaccggagc aagaccttca aaaactccaa cgaatgcaga acagggccgg gaagctgcta 1980 ctgagggccc actacagaac accctcggcc gaagtgctga ctcgtctcgg atggaagaac 2040 atcaaatcca tacacagaca acagaaggcg ttgctcacat acaaggctct aaacaactta 2100 ctgccagtct acatgaggaa cttgtttaca tactgtaggg aaaggtcaac tagatcaacg 2160 cgacaaagcg aatccaacct actatactta cccatggtcc acagggaagc gtttcgaaga 2220 tgtataacgt actcaggcac agttctgtgg aatagtctga gagagaacgt gaggcaggcg 2280 ccctctttat cgtcgtttaa gaacttaact cgactagaaa ttatgtaact atggactatg 2340 agatggactg atattatgtt gaagacgaaa ggatctatgg aatggtgaat tgtatgatga 2400 tgaattgtat cttttgtgta gtgcattgta tttagaaggt atttagacct ctgacctcct 2460 ccacccttga aatcttgaaa aacggctcag gccgaatgat gtatcaaggt taaataaagg 2520 ttgaaaaaaa a 2531 // ID Copia-3_DPu-LTR repbase; DNA; INV; 247 BP. XX AC scaffold_55; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-3_DPu_; KW Copia-3_DPu-LTR; Copia-3_DPu-I. XX NM Copia-3_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-247 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 670-670 (2010). XX DR Genome; scaffold_55; Positions 581052 580806. XX CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX SQ Sequence 247 BP; 63 A; 47 C; 55 G; 82 T; 0 other; tgttaatggg gagatctggc aatctcataa ttgagggttg ccgaaaccca ttagcagtta 60 aaaagaagtc gtctgctgca acgtgtgacg tgaccaaaat taatgtgtct tctctctctc 120 tctacaggtc tccaggtacg ggtgtatgat ttgtgtctct ggtgtataag tgtacagtgt 180 agattcatct catgtgtgtg cttataataa agtccctttc agcagctaca acatttatgt 240 tctatca 247 // ID Copia-25_AA-LTR repbase; DNA; INV; 101 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-25_AA_; KW Copia-25_AA-I; Copia-25_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-101 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 951-951 (2011). XX DR [2] (Consensus) XX SQ Sequence 101 BP; 31 A; 16 C; 14 G; 40 T; 0 other; tgttagatat aagtaagcca tgactttggt ttaaataaat ttttcacttt ctgttcacac 60 gttaaacgaa caagtcgttt tctttcttat ctgaagatac a 101 // ID Aara5_AA repbase; DNA; INV; 652 BP. XX AC AJ006564; XX DT 09-JUL-2004 (Rel. 9.06, Created) DT 09-JUL-2004 (Rel. 9.06, Last updated, Version 2) XX DE Anopheles arabiensis Aara5 pao-like retrotransposon. XX KW BEL; LTR Retrotransposon; Transposable Element; Aara5_AA; KW pao-like retrotransposon; reverse transcriptase. XX OS Anopheles arabiensis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Anophelinae; Anopheles. XX RN [1] RA Cook M.J., Martin J., Lewin A., Sinden E.R. and Tristem M.; RT "Systematic screening of Anopheles mosquito genomes yields RT evidence for a major clade of Pao-like retrotransposons."; RL Insect Mol. Biol 9(1), 109-117 (2000). XX DR Genbank; AJ006564; Positions 1 652. XX SQ Sequence 652 BP; 179 A; 161 C; 174 G; 138 T; 0 other; gagaaaccaa gctcaagggc ggccgctacg aaactggact cttatggcgt tatgatgatc 60 ccgacctacc ctgtaacaag gctgcggcga tgaaacggta tgtatgccta aaacaaaaat 120 taagtaaaga tcccgttcta gctgaagccg tgcaggccaa gatggaggag tacttagcta 180 aaggctacat cagaaagctc gctgacaccg tttcgatcgc gcgccagaag aacgatggta 240 cctccccata tttccggtaa ccaaccccaa caagccgggg aaaattcgca tggtcttcga 300 tgcagcggct aaggtaggcg gggtaagcct gaactctcgt ctgcttcccg gtccggatat 360 gcttgccgga ctgttagcgg tgctgctgaa attcagagag aatcgtgtag ctattgctgg 420 agatatacga gagatgttcc atcaagtggc catcaaagaa acggatcaac ggagtcagat 480 gatcctttgg gacagtggac gtcccggcac cggaccggca acttacgtcg tgaccgtaat 540 gactttcggc gcagcgtgct cccccagtag cgctcaatat gttaagaact tgaacgcgga 600 caggttttca gatgcatttc caagagcagc agaatgcatt aagcatgaac at 652 // ID Copia-20_CQ-I repbase; DNA; INV; 2937 BP. XX AC AAWU01016025; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-20_CQ_; KW Copia-20_CQ-LTR; Copia-20_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-2937 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 355-355 (2011). XX DR GenBank; AAWU01016025; Positions 41345 38409. XX CC Positions [1459-1983] - Integrase core CC 'ATCAA' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 97..2919 FT /product="Copia-20_CQ-I_1p" FT /translation="MADQRYALPRLTSVNYHTWKFKMEMLLIKERLWSVIS FT KEKPQVPSEQWVEKDEVARATIGLWVDDNQARLIKTCKSAKEAWEKLKDHH FT DQRSVVYLLKKLARLDLPEGGEVEEHIQVFNELVQRVEDAWQEIPDKLRVA FT LMMVSLPDSFDPLVTAIQQRPVEEITSDLLQASLMAEAEKRRERSSAVESG FT DKALRSNVIKGKGGKGGTKQQQQQLCNHCHQPGHFWRKCPERKKEESTNMK FT PESSKEGEVKAKQADVSDEPLAWMTGHGGSAEWYVDSGASRHMTGDKRFFK FT ALQEKVSVSVTLADGKKAEVRGTGSGTIVGVNGEGNPVDIELHDVLYVPKL FT TSGLISVSTLAAKGFEVVFNGERCEVRNRAGVVAAQGTRHGSLYRLCTPDQ FT AMITVQRSHTANCLHTWHRRLGHRDQEVVQAIPGKFAEGIKVEDCGVRMLC FT SSCMEGKMSRTPFPKQAVHDTKDKLDLVHTDLSGPMEETPSGNRYYLSLID FT DYTRMTFVYLIRNKSDAAGKIENFVKFCKTQIGKSPRVIRSDGGGEYVNKA FT LQEFLRREEIVSQFTAPYSPQQNGVAERKNRYLKESSLCMLRDAELEPKYW FT GEAVLTATYIQNRLPSRSIGISPFEKWYGRKPSLEHMKIFGSEAWVQVPAQ FT KRKKMDVKARKLVFVGYSNEHKAYRFVDKQSERVTISRDVKFVEDLRMPDE FT RKCSEFESESVKHDREEQEVSPMEPIVRNELSDDDSFEGYESAEDLFNGEE FT VVQEAAAPGDPPGPGGVDGEPPALEAAPRAPDTEPAGQGQQLRVLPDRSTR FT GQLPVRLENYEVGLVVMEQDEPAAYEKAVRCPEKEEWVRANNASKKKQTKV FT CRLLKSICGLKQAARVWHRTIAGKLSRLRFERCNSDSSIYKQPIGKDGIFL FT LIYVDYLVACKQEEKINRIERDKERAVPKLDDGGAQG" XX SQ Sequence 2937 BP; 774 A; 711 C; 962 G; 490 T; 0 other; ggttttgtgc ccaggaacgc agcggaagag gagttcggtc gcgcggaagc tgaggaaaaa 60 gttgcagccg gacgattggt gaaaagttcc ggcaagatgg cggaccagcg ctacgccctt 120 cccagactga ccagcgtaaa ctaccacact tggaagttta aaatggagat gttgctcatc 180 aaggagcggt tgtggtcagt aatctccaag gagaagccgc aagttccctc ggagcaatgg 240 gtggagaagg acgaagtggc gcgtgcgacg attggcctgt gggtggacga caaccaggct 300 aggctgataa agacctgcaa atcagcgaag gaggcctggg agaagttgaa ggaccaccac 360 gaccagcgtt ccgttgtgta cctcctcaag aagctggccc ggctggacct tccggagggc 420 ggcgaggtgg aggagcacat ccaggtgttc aacgagctgg tgcaacgagt cgaggatgct 480 tggcaggaga ttccggacaa actccgagtg gccctcatga tggtttcgtt gccagactcg 540 ttcgatccgc tcgtcacggc cattcagcaa cggcctgtcg aggaaatcac gtcggatctg 600 ctgcaagcca gcctgatggc agaggcggag aagcggcgtg aacgttcaag cgcggttgaa 660 agcggagaca aggctctacg ctcgaacgta atcaaaggaa agggcggcaa gggtggcacc 720 aaacagcagc agcagcagct ctgcaatcac tgccaccagc ctggccactt ttggaggaag 780 tgccccgaac ggaagaagga ggagtcgacg aacatgaagc cagagtcaag caaggagggc 840 gaggtcaagg cgaagcaagc agacgtttcg gacgaacccc tcgcttggat gaccggtcac 900 ggaggatcgg cagaatggta cgtcgacagc ggcgcgtcgc ggcacatgac cggcgacaag 960 cggttcttca aggcgcttca agagaaggtg agcgtcagcg tgacgctcgc agacgggaag 1020 aaggcggaag ttcgaggcac cggaagcgga accatcgtcg gagtcaatgg tgaaggaaat 1080 ccggtcgaca tcgagctcca tgacgtcctc tacgttccca aactcaccag cggactaatt 1140 tcggtgagca cacttgcagc gaaaggattc gaagttgtgt tcaacggcga acgctgtgaa 1200 gtccggaacc gtgcaggagt agtagcagcc caaggaactc gccacggaag cctctaccga 1260 ttgtgcactc cggaccaagc aatgatcaca gtgcagagga gccacactgc caactgccta 1320 cacacctggc atcgtcggct cggtcaccgt gatcaagagg tggttcaagc gattccgggg 1380 aagtttgcgg aagggatcaa ggttgaggac tgcggcgtac ggatgctctg cagcagctgc 1440 atggaaggga agatgtctcg cactccattc ccgaagcagg ctgtgcacga cacgaaggac 1500 aagctggatt tggtgcacac ggatttgagc ggcccgatgg aggaaacccc gagcggaaac 1560 aggtactacc tgtcgctgat cgacgactac accagaatga cgttcgtgta cctgatccgg 1620 aacaagtccg acgctgcggg caagatcgaa aactttgtca agttctgcaa gacccagatc 1680 ggaaagagcc cgcgcgtcat tcggtccgat ggaggcggcg agtacgtcaa caaggcgctg 1740 caggaatttc tgcgccgaga agaaatcgtg agccagttca cggcgccgta ctctccgcag 1800 cagaacggag ttgccgagag aaagaaccgg tatctcaagg agtcgtctct gtgtatgctg 1860 cgcgatgcag agttggagcc gaagtactgg ggcgaagccg tgctgacggc gacctacatc 1920 cagaaccgtc taccgtcgcg gtcgattggg ataagtccgt tcgagaaatg gtacgggagg 1980 aaaccaagcc tagagcacat gaagatcttc gggagtgagg cgtgggtgca ggtgcccgca 2040 cagaaaagga agaaaatgga cgtgaaagcg cgaaaactag tgtttgtggg atactccaac 2100 gaacacaaag cataccgttt tgtcgacaaa caaagtgaac gtgtgacgat aagtcgtgat 2160 gtgaagtttg tcgaggattt gcgtatgccg gacgagagaa agtgcagtga attcgaaagt 2220 gaatcagtga agcacgaccg tgaagagcaa gaagtgtccc cgatggagcc catcgtgcga 2280 aacgagctgt ctgatgatga ctctttcgag ggttacgagt ccgccgagga tctgttcaac 2340 ggcgaagaag tggtgcaaga ggcggcggcg ccgggtgacc cacctggacc gggtggggtg 2400 gatggtgaac cgccagcact ggaggccgca ccgcgagcac cggatacaga gccagcaggt 2460 caaggacaac agctgcgagt gctgccggac agatctacgc ggggacagct gccggtcaga 2520 ctggagaact acgaagtcgg cctggtggtg atggagcagg acgaaccagc ggcgtacgag 2580 aaagctgtac gttgtcctga aaaggaggag tgggttcgag cgaacaacgc cagcaagaag 2640 aagcagacca aagtgtgccg actgctgaaa agcatttgcg gtctcaaaca agcggcgcga 2700 gtgtggcacc ggacgatcgc cggaaaactg tcgcggttga gattcgagcg gtgcaattcg 2760 gattcgagca tctacaagca gccaatcggc aaggacggga tatttctact catttacgta 2820 gattatctcg ttgcttgcaa gcaggaggag aagatcaaca ggatcgagag agataaggag 2880 agagcggtgc caaaactgga cgacggagga gcccagggat gaatccgcga ggaggag 2937 // ID TTAA2C_AP repbase; DNA; INV; 423 BP. XX AC . XX DT 29-AUG-2009 (Rel. 14.09, Created) DT 29-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; TTAA2C_AP. XX NM TTAA2C_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-423 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2070-2070 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC TSD is TTAA tetranucleotide. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 423 BP; 136 A; 75 C; 76 G; 134 T; 2 other; gaggacgtcg cacccgcatg tgttgtctcc gtcttacaca cgtacgacat agcaaatttt 60 cgttcgctag tttcaatagg gtgctgtcan ttttgatatt agagtgaatt gacctattat 120 caaactttta ggtaacaaca ttatctgtgt tctctcgttg gttttttacg atattttaat 180 ttttaagtga gttatgagta tgtaaaacat taatatttga aaatgctcat aactcactta 240 aaaattaaaa tatcgtaaaa aaccaacgag agaacacaga taatgttgtt acctaaaagt 300 ttgataatag gtcaattcac tctaatatca aaactaacag cacnctattg aaactggcga 360 acgaaaattt gctatgtcgt ccgtgtgtaa gacggagaca acacatgcgg gtgcgacgtc 420 ctc 423 // ID Gypsy-98_AA-LTR repbase; DNA; INV; 700 BP. XX AC supercont1.272; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-98_AA_; KW Gypsy-98_AA-I; Gypsy-98_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-700 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.272; Positions 32127 31428. XX SQ Sequence 700 BP; 187 A; 158 C; 186 G; 169 T; 0 other; tgtaacattt gtcaatgtcc cattaatata aattaaaata atttaataaa attgtaatag 60 tattgcgtgt ttaggagtta ggtttttgta ttatataaac ttattaaaaa taaaaccgcc 120 cattgacacg ccgtccgcta ccaacgtcgc tggttaccag ctgccacaac cacgatatgg 180 ttggtatacg tggttagcag gaaatgcgag agagtgggag agaaaaggcc catagcaggg 240 ggtttgttag agctcgagct cgattgctgg gccaattaag attctaccag caaggtgtgc 300 ggatcgtaga aaatcggtcg aagtgtgata attcatgtcc ttcccaaagt ttcgagtgtg 360 tggtcgataa ccgggagcag actgtgtccc ccgagggcaa taatccttca gaggagtcgt 420 taaaactacc tccgaaaccg agatacgaag tggtatgtta aacagcccaa actccgcccc 480 tttaactgtg acgcgagtga cccacgtttc gcgccaagaa tttccgcgag tgccctaaat 540 gtgtggtggt cgctgagacc gcgtcccggt gcccgtattc gacgcccaat catccaccgg 600 cggcggtgac atcgctgcga gcggagcggc ggcgcgggcg caacttaaag tttctcacga 660 gtggcagagt aggcagcccc tgaggttaag gtaagttgca 700 // ID hAT-N10_AP repbase; DNA; INV; 372 BP. XX AC . XX DT 25-AUG-2009 (Rel. 14.09, Created) DT 25-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; hAT-N10_AP. XX NM hAT-N10_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-372 RA Jurka J.; RT "hAT-type DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2110-2110 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 372 BP; 132 A; 62 C; 57 G; 121 T; 0 other; catagataat aaagatatgg ataggccatt gaagcttatc atgcgttcat gtattactat 60 caattttgag gttatatggg gtaaaatggt aaatacacct actacattta tcattaatat 120 aatgccgaaa actactcaaa accaaaagtc ataggtatat tatattatta ttatagttcc 180 atgttagaaa tttagaatat ttcatattat accggcgggc ggcaattatt tgaagtttga 240 acataacctt aaatagtacc ccatgcagga gagagaacat tcttaaagat attttctctc 300 tcccacaaaa aagaacgcga cctcatacgg tgttaatata gccatggcct atccatatct 360 ttattatcta tg 372 // ID CR1-103_AAe repbase; DNA; INV; 5224 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-103_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5224 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1191-1191 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 24 sequences with >94% CC identity. XX FH Key Location/Qualifiers FT CDS 384..1388 FT /product="CR1-103_AAe_1p" FT /translation="MDSCQVCSMTLDSGRALSCNGTCSSIFHFTCVGLTKT FT QYASLTAKLGMFWFCNSCRLNFEPAVYDRERTIMKALRELLIRTDSMDTRL FT GNYGENLRKINKTLYDSQVSMPADSSSFAKRIEQLTLDDSSDDPINRSRSC FT NETSFFEVLDAVNCSIAQPPEKFVVGNNKRVQILDNPSTIPANFSNGSRTD FT VSTPAALPHCSNRQSRNILDASVSPTRTEGAARPNNALKVASRVQHSNDVE FT AFYATPFEPEQSEEEVKSFVADITNAHPSLIKVTKLVPRGRSLADLSFVSF FT KITICKTYSKVVSDVWYWPDGISVRPFEANSKNETAVRLQNSS" FT CDS 1388..5119 FT /product="CR1-103_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MNDKITTAESDPDSPYLGRTTSSTLEASSPPIIVESI FT QPAHISRPSPMFGMVEGVFQSVHSGKYEANKDITSSEDVPSRSVSRIAQSS FT FTTAITGATDFQQQPERTLSSCTLEALEPPITVEPIQPAFSSRSGPVYGSG FT EKVFQSVLSGKYQQHKNTIRSDITPARSYHESHHSSQLSTPGCTPASSKEG FT PNSLVTVEPFLPATCSHPGPVYEQEEGVFRTHFAGKYAHRQSNTFAVNSTA FT SSPMYHQLHQRTSSQTEPAISIASGCSTQSQTHQKIVTNSEASSDVLIYYQ FT NVGGINSAISDYKLACSDACYDIYAFTETWLNDNTLSNQLFDNSFSVYRQD FT RSSTNSCKSSGGGVLLAIRSSFKSRLLNPPNDLTVEQLWVAVTSLENTIFV FT CVLYTPPDMVNNDALIDKHLASLDWIVSKMGVKDNIIILGDFNLRGITWQR FT DSIGTLFPDSTCSSMSLATRELLDAYSTVGLKQLSSVLNENNRSLDLCFAS FT EDLCLDCSVMRAPSPLVKNCRHHPPLLITMRRAPQLHYVDSASSVFYDFSK FT ADFCGMNVFLRNVDWXQILNEDDANEAASSLTSILLYTIGQFVPLKTPREP FT VRPAWSNSHLKHLKRLKRAALRKHSKHRTDTTRAAYAESNALYTNLNERLY FT RAHLVQIQDRLKSDPKSFWRHVSDQRNQTGLPSSMTNGADLEANSSTETAE FT LFRAQFSSVFTDEILTAEDISTTTRNVPQLSNPGLPVSVTSNMVISALDKL FT KSSKGYGPDFIPSVLLKKCASSLALPLAKIFNTSLLTGEFPACWKDSFVFP FT VYKKGCRQNVGNYRGIAALSAISKLFELIVLDKLVQTCAHYISPNQHGFMP FT KRSTATNLTVLTTFVIREIQKGHQVDAIYTDLSAAFDKMNHEIALAKFDKL FT GFKNNSLKWLRSYLIGRSMSVKIGDHISSPFPVTSGVPQGSHLGPFLFLLY FT MNDVNLILNCLSLSYADDLKLYYVIKHKNDVVFLQQQLQVFADWCRRNRML FT LNVSKCSVISFGRKHSLLVEDYYLNNTKLQRETTVKDLGILLDTKLTFKNH FT VSYIVCKASSQLGFVLRFAKQFKDIYCLKALYCSLVRPILEYSAIVWSPYY FT QNEVQRIEKIQRKFVRFALRNLYWRNPSNLPSYESRCMLINLELLHARRDV FT SKACFISDLLQNQIDCPALLARLDINIRSRNLRSHSFLHLNAARTNYGLNE FT PLRNMCRTFNKCFEGFDFHVSRNVNKSRFKRILC" XX SQ Sequence 5224 BP; 1451 A; 1225 C; 1046 G; 1501 T; 1 other; ccactgttct atcgttgaag ttgtttttcg tgctcgcgag ttttcgatcg ttctgtcagt 60 gtttttgatc tgaaaaattc ctatagtgga ctcgaactat tcctaatgga cttgtgaatc 120 actgataatt acaatttgtg caaataagac tacgtgtgtt gtgtcattga gtgatttgtt 180 tttgtgcgat ttggacaatt attatcgcca ttggcagtat accctttctc gctttcggat 240 ttttgaagcg ccatctagca gcaaaattgg aaagcttatt tacggctact atcaaggctg 300 catacttgta ggcgcttaac atcgattatt gctaaccgta ccacagcggt gcttcggaac 360 agtattgctt tcgtcgttac atcatggatt catgccaagt gtgttctatg actttggact 420 cgggaagagc actttcatgc aatggaacat gcagctcgat attccatttc acctgtgttg 480 gcttgaccaa aacacaatat gcttccttga ctgcaaaatt ggggatgttt tggttttgta 540 actcatgccg cttaaatttt gagccggccg tatacgaccg cgagagaacc atcatgaaag 600 cattacgaga actgttgata agaacagatt cgatggacac ccggcttgga aattatggag 660 aaaatctccg taaaataaac aaaactctct acgacagcca agtatcgatg ccagcagatt 720 cctcatcgtt tgcaaagcgc atcgaacaac tgactctaga tgattcgtct gatgatccaa 780 taaaccgttc gagatcttgt aacgaaacat cctttttcga agtattggat gctgtaaact 840 gctcgattgc gcagccgccg gaaaagtttg tagtaggaaa caataaacgt gttcaaatac 900 tggacaatcc ctccacaata cctgccaatt tttcaaatgg ctctcgcact gatgtttcca 960 cacctgcagc tttacctcat tgttcaaatc gccagagcag aaatattctg gatgcatcgg 1020 tttctcctac acgtacagag ggcgcagcta gaccgaataa tgctctaaaa gtcgcaagca 1080 gagtccaaca ctcgaacgat gttgaggctt tctacgcaac cccttttgaa cctgagcaaa 1140 gtgaagagga agtaaaatcg ttcgttgccg acatcactaa tgctcatccg tccttgataa 1200 aagtgacgaa attagttcca cgtggaagaa gtcttgcgga tctttcattc gtgtccttca 1260 agataactat ttgtaaaacc tattcgaaag ttgttagcga tgtctggtat tggccggatg 1320 gaatctctgt tcgtccgttt gaagccaact caaaaaacga aacggccgta cgtcttcaga 1380 actcgtcatg aatgacaaaa taaccactgc agaatctgat cctgattcac catacctggg 1440 acgcacaaca tcaagcactt tggaagcctc ttcgccaccc atcatagtcg agagcatcca 1500 gccagcgcac atcagtcgtc ccagtcctat gtttgggatg gtcgaagggg tcttccagtc 1560 tgtgcactca ggcaagtacg aagcaaataa ggacattacc tcttctgaag acgttccctc 1620 ccgcagcgta tctagaattg cccaatcatc atttactaca gccattactg gagctacaga 1680 ttttcagcaa cagccggaac gcacgttatc atcttgcaca ttggaagcct tggagccccc 1740 catcacagtc gagcccatcc agccagcgtt cagcagtcgt tccggccctg tgtatgggtc 1800 tggcgaaaag gttttccagt ccgtgctatc aggcaagtac cagcaacata agaacactat 1860 ccgctctgat atcactcctg cccgcagcta tcatgaatcg caccacagct ctcaactttc 1920 tacaccggga tgcacgcctg ctagctctaa ggaaggcccc aattccctcg tcacagtcga 1980 gccattcctg ccagcgacct gcagccatcc cggtcctgtg tacgagcaag aagaaggggt 2040 cttccggacg cactttgcag gcaagtacgc acaccgccaa tctaatacgt ttgcagtaaa 2100 ttctaccgct tccagcccca tgtatcatca actccaccaa cggacgtcat ctcaaacaga 2160 accggctatc agtattgcct ctggatgttc tactcagtcc caaactcacc agaaaatcgt 2220 taccaactcg gaagcctctt cagacgtcct gatctactat caaaatgtcg gtggtataaa 2280 tagcgccata tcagactata agctagcgtg tagcgacgcc tgctacgaca tttacgcttt 2340 taccgaaact tggctgaacg ataatacctt atcaaaccaa ctgttcgaca actctttctc 2400 tgtttatcgt caagatcgtt cgtccacgaa tagttgcaag agttctggtg ggggcgtctt 2460 gttagctatc cgctccagct tcaaatcgcg ccttctgaat cctccaaatg atttgacagt 2520 tgaacaattg tgggttgcag ttacaagcct tgaaaatact atttttgttt gcgtcttgta 2580 taccccgccg gatatggtta acaatgacgc gttaatcgac aagcatttgg catcacttga 2640 ttggattgtc tcgaaaatgg gcgtcaagga taacatcatt attctgggcg attttaatct 2700 gcgcggcatt acttggcaac gcgactcgat cggaactctt ttcccagatt caacttgttc 2760 atctatgtcg ctggcaacgc gagagttgct ggatgcttac agtactgtcg gactcaagca 2820 gttgagttct gtcttgaacg agaacaatcg atcacttgat ctgtgttttg ctagcgagga 2880 tttgtgtctt gattgctctg tcatgcgcgc gccttcgcct ttagtaaaaa actgtcggca 2940 tcaccctcca ctgcttatta ccatgagaag agcgcctcag cttcactacg ttgacagtgc 3000 ttcaagtgtt ttctacgatt tttcgaaggc cgatttttgc ggaatgaacg tctttctcag 3060 aaatgtagat tggamccaga ttcttaacga agatgatgct aacgaagctg cttcatcgtt 3120 gactagtatt ctactctaca ctattggtca gtttgtgcca ctgaaaacac ctcgtgagcc 3180 agtcagaccc gcttggtcta attcacatct gaagcatcta aaacgtttga aaagagcagc 3240 actcagaaag cacagcaagc accgcaccga caccaccaga gctgcttatg ccgaatcaaa 3300 cgcactgtat accaatctca atgaacgtct ttaccgtgcg catctagtgc agattcaaga 3360 tcgcctcaag tccgacccca aaagcttttg gcgtcatgtc agtgaccagc ggaaccaaac 3420 aggtctacca tcttcaatga cgaacggagc tgatttggaa gccaactcat caacagaaac 3480 tgcagaattg tttcgagctc agttcagcag tgtattcacc gacgaaatac tcaccgctga 3540 ggacatcagt acaactactc gcaatgtacc acagctgtcg aatccaggtc ttcctgtatc 3600 agttacttcc aacatggtga tcagcgcttt ggacaaacta aagtcttcga aaggatacgg 3660 tccagatttc atcccatcag ttcttcttaa aaagtgcgct agttctctgg cgttaccttt 3720 agctaaaatt ttcaatacgt ctctgcttac tggcgaattt ccggcttgtt ggaaggattc 3780 atttgttttt cccgtataca agaagggttg cagacaaaac gttgggaatt atcggggaat 3840 agctgcatta agtgcaatct ctaaactttt tgagctaatc gttctcgata aacttgttca 3900 aacatgtgca cattatatct cgcctaacca acatggcttc atgcccaaac gatcaacagc 3960 cacaaattta actgttctaa ctaccttcgt aattcgcgag atacaaaaag gtcaccaagt 4020 agatgccatt tacactgatc tgtcggctgc attcgacaaa atgaatcatg aaatcgcact 4080 tgctaaattc gataagttag gattcaaaaa caactcactc aaatggttac gctcatatct 4140 gatcggccgc agcatgtccg taaaaatcgg cgaccatatt tcatcacctt ttccggttac 4200 ttctggcgta cctcaaggca gccatctcgg gccgtttcta tttttattat atatgaatga 4260 tgtcaattta attttgaact gcctaagtct atcatacgct gatgatttga agctgtatta 4320 tgtaatcaaa cacaagaacg acgttgtgtt cttgcagcaa cagcttcaag ttttcgctga 4380 ttggtgtcga cgtaatcgga tgctactgaa tgtatcgaaa tgctcggtga tatctttcgg 4440 ccgaaaacac tcattattag tggaagatta ttacttaaat aatactaaat tgcaacgaga 4500 aacgacagtt aaggatctag gcattttatt agatactaaa ttaactttta aaaatcacgt 4560 ttcctacatt gtgtgtaaag cctcttcgca gttaggtttt gttttacgat tcgcgaaaca 4620 atttaaggac atatattgcc tgaaagcttt gtattgttcc ttagttcgac ctatactgga 4680 atattcagcc atagtttggt ccccatacta ccaaaatgag gttcaacgaa ttgaaaaaat 4740 ccagcgcaag tttgttcgct tcgctcttcg taatctctac tggagaaatc cctcgaattt 4800 accaagctat gagagccgct gtatgctgat taacttggag cttcttcacg caaggcgaga 4860 tgtttcaaaa gcttgtttca tcagtgatct acttcaaaat caaatagatt gtcctgcttt 4920 actcgccagg ttagatatta atattcgaag ccgtaaccta cgatcacact cttttctgca 4980 tttaaatgcc gcaagaacaa attatggctt aaatgagcca ctgcgtaata tgtgtcgtac 5040 ttttaataaa tgtttcgaag gatttgattt tcatgtttcc cgcaatgtga acaagagtag 5100 atttaagaga attctttgtt agttttaagg tgtatgttat agccgttaag catgtcattg 5160 gggtgtataa ttttaacctg ttgacagtaa taataataaa taataaataa ataataataa 5220 ataa 5224 // ID BEL-1_RP-LTR repbase; DNA; INV; 686 BP. XX AC ACPB02040023; XX DT 28-JAN-2011 (Rel. 16.02, Created) DT 28-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Rhodnius prolixus genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-1_RP_; KW BEL-1_RP-I; BEL-1_RP-LTR. XX OS Rhodnius prolixus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Euhemiptera; Heteroptera; OC Panheteroptera; Cimicomorpha; Reduviidae; Triatominae; Rhodnius. XX RN [1] RP 1-686 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Rhodnius prolixus genome."; RL Direct Submission to RU (28-JAN-2011). XX DR Genome; ACPB02040023; Positions 6362 7047. XX SQ Sequence 686 BP; 171 A; 113 C; 139 G; 263 T; 0 other; tgttggatcc tcgcaaggag atgcggcccc ccttgtggga aatttagaat gatgcagtgg 60 agggttggct gcattggagg tctgtaggga gaaaggaact gcgtacgtga cctggctcag 120 gcagttgtcc ttggacttca aaaaaagttg ttgggtcgtt tttcaatgtg tgttttgccg 180 gtgtctctgt gcttagcatc agaaaagatt ttcgggcttc ttctggttag gtgtgaattt 240 ttgtgccgtg atcagatcac tatagtaatc tattggatcc attggatgtt ttagttttta 300 taattggtca tgaccattct agaaaataat gtaatgccta ctcatgtaag tctcaaactt 360 tataatacgc catagactaa tgtgttccta tttcgtcaag aaatgttttt attattttga 420 attctagctt taagatttcc cgtttgtcaa gaaatgtttt tattattttg aattctagct 480 ttaagatttc ccgtttgtca agaaaggttt ttattatttt gaattctagc tttaagattt 540 cccgattgtc aaaattaata aatatttgtt tctttataaa aggtattttg agtggtggtg 600 tcccccacct agttccatcc cctatttaac tgttaattta aaatattatg tctaattgtc 660 cttcctccct ctacgtcgac taatca 686 // ID MARINER_AM repbase; DNA; INV; 937 BP. XX AC U19902; XX DT 22-DEC-1995 (Rel. 1.11, Created) DT 25-MAR-1997 (Rel. 2.02, Last updated, Version 2) XX DE Apis mellifera mariner transposon, transposase pseudogene, DE partial sequence. XX KW Mariner/Tc1; DNA transposon; Transposable Element; AMMARINER; KW MARINER_AM; mariner; transposase. XX OS Apis mellifera OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; Apoidea; OC Apidae; Apis. XX RN [1] RP 1-937 RA Ebert R.P., Hileman P.J. and Nguyen T.H.; RT "Primary sequence, copy number, and distribution of mariner RT transposons in the honey bee."; RL Insect Mol. Biol 4(2), 69-78 (1995). XX RN [2] RP 1-937 RA Ebert R.P.; RT "MARINER_AM."; RL Direct Submission to Genbank (17-JAN-1995)Paul R. Ebert, RL Entomology, University of California, Davis, Davis, CA 95616, RL USA. XX DR GenBank; U19902; Positions 1 937. XX SQ Sequence 937 BP; 335 A; 163 C; 168 G; 271 T; 0 other; ttgggttggc aactaagtaa ttgcggattt cagttgaagt tgatgacgat ctaatcaaag 60 caataatcga ttcggatcgt cacagtacaa ctcgtgagat tgcagagaag cttcatgtat 120 cacatacacg cattgaaaac cacttaaaac aacttggcta tgttcaaaaa ctcgatacat 180 gcattcctca cgaactgaaa gaaaagcatt taatgcaacg tattaacagt tgcgatttgc 240 taaagaaacg tagtagaaat gatccatttt taaaacgact gataactggc gatgaaaaat 300 ggattgttta caacaatatc aagcggaaaa gatggtggag caagccacgt gaaccagctc 360 aaacaacatt aaaaactggt attcatcaaa agaaggtttt gttatcagtt tggtgggatt 420 acaaagaaat tgtctatttt gaactcttac cacccaaccg aacgatcaat tctgttgtct 480 acattgaaca actaacgaaa ttaaacaatg cagttgaaga aaagcggtcc gaattgacaa 540 atcgaaaagt tgttgtattc catcatgacg atgcaagacc acaaacatct ttggtcattc 600 ggtaaaaatt attggagttt ggttgggatg ttttgccaca ttcactatat agtcctgacc 660 ttgcaccatc tgattacttt ttgcttcgat ctttacaaaa ctccttgaat gataaaaatt 720 tcaataatga tgatgatatc aaatcgtacc tgattcagtt ttttgctaat aaaaaccaga 780 agttttatga acgtgggatt atgatgctgc ctgaaagatg gcaaaagatc attgatcaaa 840 atgggcaaca cattacagaa aaaagttatt tagttccatg aaaaaattgt ctttgatttt 900 ctaaaaaaaa tccgcaatta cttagttgcc aacccaa 937 // ID R1-1_DYa repbase; DNA; INV; 5381 BP. XX AC . XX DT 15-MAY-2009 (Rel. 14.06, Created) DT 15-MAY-2009 (Rel. 14.06, Last updated, Version 2) XX DE R1-type non-LTR retrotransposon: consensus. XX KW Non-LTR Retrotransposon; Transposable Element; R1-1_DYa. XX OS Drosophila yakuba OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; melanogaster subgroup. XX RN [1] RP 1-5381 RA Jurka J.; RT "LINE-type retrotransposon families from fruit fly."; RL Repbase Reports 9(6), 1155-1155 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 636..1724 FT /product="R1-1_DYa_1p" FT /translation="MDSPAGGSAPKGDDPFRRSSRMSRSPTRGVEITVQGA FT GTHAPAVEKGDQTGPHTTAAITSNRTAVVASLAPAATITTSQKKISDLISV FT PTQNIASKSPPGSPIRPLDLSALQHENLRSILDMMDMCAFETQRHVTKETK FT GAITDLAALNTRAIQLQQSTVNVAAPAKDIATQTEAEAQKKRPASSALKSV FT TEKFPVLPPPNKAPKAAATKAKVPNPASFAGVAKNASSTVEWTKVKPKRIR FT KKPEAFILKKTGEASYADILLKMRADPNLSEFGNQVRRIRRTQQGELLLEV FT KGKASENVPLYRGAIEESLKEMAAVRTGTQRMALTCSGMDEATTAEELHRC FT MVSQFECCPRRCEGPAQNA*" FT CDS 2072..5038 FT /product="R1-1_DYa_2p" FT /translation="MMLDAIKIIQLNVNHCAAAQSLLAHTAAERSVDIMLL FT SEPYSPGIGNPSMILDESGKAAIKCCSPLHVQERAYLPMRGIAYARVKGVH FT IYSCYAPPSDSTDQFEEMLDTLVNHARGRCPTIIAGDFNAWAVEWGSRVSN FT PRGRAVIDAMMMLDLTLLNDGRVPTFNNDRGTSFIDVTFVSRGLVANSSWT FT VLDVVTLSDHALITFSVSSTGTTRSQPRRSLGQAWDTRKFDKDMLQYQIDT FT LLIPTGDAETMAAELMKILVAMCDATMPRKKKVQRKPPVYWWSPVLQQMRS FT ECLKARRRAQRARGSPHQTELLEAFKAKRTAFKHGISAAKAQAYKNLLDSV FT DDDTWGLAYKLVSNKLRKRDVPPSDPDVLANIVAELFPMQSTTWQPLAAAP FT VSDFPSITAQEVVEAARRIKANKAPGLDGIPGVVVKAVALSRPDIFRDTFQ FT QCLLDGVFPARWKSQKLVLLPKGKGPAQAAVSYRPLCLLDIVGKLFERILY FT ARIEAVTESTXGLSSHQYGFRKGKSTLNALTAVRNIAKNALEGDRWLGGKK FT EYCAIVTLDVKNAFNTARWPLILDAMYTMGIPEYLRIAVGSYFRNRILWYD FT TENGPKSYRVSAGVPQGSVLGPILWNIMYDGILAIRKPTGAELHCFADDVA FT ITAVAKTIMETQEKCNATIRAAIDWFEKAGLAIAAHKTEAVLLSSRKKVES FT MQVSVSGTQVSSAESLKYLGVLIDHRLSFKDHAKYASRKAAITSAALSRLM FT PNVGGPRDPARRLLVTVAKATLLYAAPIWSDATKKGSYLNGARMVIRSMAL FT RLIRGFRTISQDAALVLAGMPPADLEIKALVLISNGATRDEAYERVVGEWQ FT TRWQMSQRGRWTYQLIPEIDAWTQCSHKTLDYHMSQFLTDHGCFRAYLHRF FT RHVDSAQCLYCIDAVETAEHVLMHCSRFSAEREQLKALAGDPLSPNGLITA FT MTANVNKWELGHRIIINMMKRVRADETANRAGN*" XX SQ Sequence 5381 BP; 1476 A; 1331 C; 1490 G; 1083 T; 1 other; aaagtgtttt ctaatacgga cgtgtttttt atacgctgcg caaaatttcg attattttgg 60 tgtcaccgtg gccagaacaa ttggttttcg tgcagtgttc catagcggta gtgcgcgggt 120 gttcgccgca ctatcgaatc gacataaagg agcgcgtaaa caataaaaca gaaggaggta 180 ttattgcaat cggactgaca ctgcaatttt gcaatttttt gttgtaggtc agctgttaaa 240 tcagctgttg gtgctgccaa cctatcgcag ttggcgatct agtccgagcg gcgttgccag 300 acaaacggcg ctaccagacc tgcattggag atcggttagg gcgcgcaacc agctaaattc 360 aaattcgaat tcaaattcga atctgctttt agagcggtcg ctttcggaaa agggagtcgg 420 cacaaaggtc gcggacagac cgaaaatcaa cctctcaccc ttgcggtgcc taagtagagg 480 ctctggtgtc cgagaagagg cggaataacc tccagcagtc aacgcctagg ggaaatctgt 540 ggcctacgtg ggaactctct tttcaaaaaa gctgatccgc gtgttgggac agaccgagaa 600 gcatcctctc acccttcttg tgccacgttt gtacaatgga tagtccagcg gggggcagtg 660 cccccaaagg cgacgacccc ttcaggagaa gctccaggat gtcgcggtct cctacgagag 720 gagtagaaat aacggtgcag ggtgctggta cccacgcgcc ggcggttgag aaaggggatc 780 aaacgggccc ccatacgacg gcggcgataa cgagcaaccg cactgctgtg gtagctagcc 840 tggccccggc agcaaccata acaacaagcc agaaaaagat cagtgatctc atcagcgtgc 900 cgacgcagaa tatagcgtcg aaatctccac ctggttcgcc aatacggcct ttagacctgt 960 cagcgctcca gcacgagaat ctgaggagca tcttggacat gatggatatg tgcgcatttg 1020 agacgcagcg acatgtcacc aaggagacta aaggtgcaat aacggatctg gctgctctta 1080 atacaagagc gatccagctg cagcagagca cagtcaacgt agcggccccc gctaaagaca 1140 tcgctacaca gacagaggca gaggctcaga agaagcgtcc agcatcgtcg gccttaaaaa 1200 gcgtaacgga gaagtttcca gtgttacctc ctccaaacaa ggctcccaaa gcggctgcta 1260 ccaaggcaaa ggtaccaaac ccagccagct ttgcaggggt tgctaagaac gctagcagca 1320 cggttgaatg gacaaaggtg aagcctaagc gcatacgcaa aaagcccgag gcgttcatat 1380 taaagaaaac tggcgaagca tcgtatgccg atatactgct taagatgaga gcagatccga 1440 acctaagcga gtttggcaac caagtgagga gaattagaag gactcaacag ggggagctac 1500 tgctcgaggt gaagggcaaa gcctccgaaa acgtaccctt atacagagga gccatcgaag 1560 agtcccttaa ggaaatggcg gcggtacgca cgggtacgca gagaatggcg ctaacttgta 1620 gcggaatgga tgaggcgacc acggctgagg aactccatag atgcatggtc tcacaatttg 1680 aatgttgccc aagaagatgt gaggggcctg cgcaaaatgc gtgacggcac acagatagcg 1740 actgtgatgc tcagtgcaaa cgatgcaatc accgtgctta aaaggggctc ggttaacgta 1800 ggctggtctc ggtgccgcat caatcaggat gtacgtccta caagatgctt caggtgcttg 1860 ggatatggcc atagggcaac caactgcaag gcagccgacc gctccgactg ctgtctgcgg 1920 tgcggcgtga aagggcataa ggcaaaagga tgcgtggccc caccaaaatg cctcatttgc 1980 agcgagaatg ttgataggaa tcactcgaca ggtggctttg cgtgccccac ctataaggcg 2040 aacatcgtaa aaggagtcaa gtgccgtcgc gatgatgctg gacgcaatta aaatcatcca 2100 gcttaatgtc aaccactgcg cagctgctca gagcctgctg gctcatactg cggcggagcg 2160 cagcgtagac atcatgctct taagcgagcc ctattccccg ggaattggaa acccttcgat 2220 gatcttggac gagtcaggca aagcggccat aaagtgttgc agccctctcc acgtccagga 2280 acgggcctac ctgccgatgc gcggcatagc atatgccagg gtgaagggtg tgcatattta 2340 tagctgctac gccccgccta gcgacagcac tgaccaattc gaggagatgc tggacactct 2400 tgtcaatcat gcgagagggc gctgtccgac aattattgct ggggacttta atgcctgggc 2460 agtcgaatgg ggcagcaggg tatccaatcc cagaggcaga gcggtgatag atgctatgat 2520 gatgctggac ctgaccttgc tgaacgacgg ccgcgtgccc acgtttaata atgacagggg 2580 aacctcattt attgacgtca cttttgtcag cagaggccta gtggctaact caagctggac 2640 ggtactggac gtggtaacgc tgagcgatca cgctctgatc acctttagcg tctcttcgac 2700 tggcacaacc agaagtcaac cgagaagatc actggggcaa gcatgggata ccaggaagtt 2760 cgataaggat atgctgcaat atcagatcga caccctacta atcccaacgg gtgacgcaga 2820 gacaatggcg gcggagctta tgaaaatact cgtagcaatg tgcgatgcga caatgcctcg 2880 caaaaaaaag gtgcagcgta agccccccgt atactggtgg agccccgtcc ttcaacagat 2940 gcgatcggaa tgcctcaagg ctaggaggag agcacaacga gccagaggta gccctcatca 3000 aacggagctt ctggaagctt ttaaagccaa gcgaaccgcg ttcaagcacg gaatctcggc 3060 ggccaaggca caggcctaca agaatctgct ggatagcgta gacgacgaca cctggggtct 3120 ggcctataag ctggtaagca acaaactgcg gaagagagat gtcccccctt cggaccctga 3180 tgtcctggcc aacatcgttg cggagctttt tccgatgcag tcgaccacat ggcagccatt 3240 agccgcagct ccagtctccg acttcccgag catcactgca caggaggttg tggaggcagc 3300 aaggcgcatc aaggctaaca aagcccctgg acttgatggc attccgggag tagtcgtcaa 3360 agcagtggcg ttgtctagac cggacatctt cagggacacc tttcagcagt gccttctgga 3420 cggagtcttt ccagcaaggt ggaaaagcca gaagctggta cttttgccta aagggaaagg 3480 tccagcacaa gctgcagtca gctaccgccc actatgcctt ctggacatag taggaaagct 3540 atttgagcgc atcctatatg cccgaataga ggcggtaacc gagagcacam acggtctaag 3600 cagccaccaa tatggctttc gaaagggaaa gagtaccctc aacgcactca cggcagtgag 3660 aaacattgcc aagaacgcgc tcgagggcga cagatggcta ggtggcaaaa aagaatactg 3720 tgcgattgtc acgctggacg ttaaaaacgc atttaacacg gcaagatggc ctctaatcct 3780 cgatgctatg tacacgatgg gtatcccgga gtacctcagg atcgctgtag gcagctattt 3840 tagaaatcgt atcctctggt atgatacgga aaatggacca aaaagctacc gagtctcagc 3900 tggtgttcca caaggatctg tacttggacc gatcctgtgg aacatcatgt acgatggtat 3960 actggcaatc aggaagccca caggagctga gctgcactgc ttcgcagatg acgtggctat 4020 aactgcagtc gccaaaacga ttatggaaac ccaggagaag tgcaacgcaa caattagagc 4080 ggccatcgac tggtttgaga aagctggact agcaatagcg gcacataaaa ccgaagctgt 4140 cttattaagc agcaggaaaa aggtggagag catgcaagtc tcagtcagcg gaacccaggt 4200 gtcttcagct gagtccttga agtacctcgg agtcctcata gaccacaggc tatctttcaa 4260 ggaccatgct aaatacgcta gcagaaaggc agccatcaca tcggcagcct tatctcgact 4320 tatgcccaac gtgggaggcc ccagagaccc agccaggagg ctattggtaa ccgtagcaaa 4380 ggcaacgctt ctatacgctg cgccaatttg gagtgacgcc acaaagaaag gctcatactt 4440 aaatggagcg cggatggtga taaggtcgat ggcccttagg ctaataagag gctttagaac 4500 gatatctcag gatgcagcat tagttctagc aggcatgcca ccagccgatc tggagatcaa 4560 ggctcttgtt ctcatcagca atggtgctac acgggacgag gcctacgaac gggtggttgg 4620 agagtggcag acgagatggc aaatgtcgca acggggaagg tggacctacc aactcatccc 4680 ggagatagac gcatggacgc agtgcagcca caaaaccttg gactaccaca tgtcccaatt 4740 cctcacggac catggctgct tccgggccta cctacatcgt ttccgccatg tagactcagc 4800 ccaatgccta tactgcattg atgcggtgga aaccgcggaa catgtgctaa tgcactgttc 4860 taggttctcg gcggagaggg aacagctgaa agcgttagct ggagacccgc ttagtccaaa 4920 tggattgatc acggcgatga cggcgaatgt aaataaatgg gagctgggac accgcatcat 4980 cataaatatg atgaaacgag tccgtgccga cgagacggcc aacagagctg gcaattaaaa 5040 cacagaagga gtgttcctgt gaggtggcgg gataagaata ctacctcagc gttcccggct 5100 tgtcgtaaaa ggcgactaaa gggtggaaag gaggagcgct acatggccaa tgcatcagga 5160 agggagagcg acctggcttc acatcctgct catcgaagtc ttaccttgac tggcagttcc 5220 ggtgagcgtt tataggacag aggagcatac ggaggttttt gttttagtac gtaggcataa 5280 tcccaccagg ggttatgaat cgtgcatgcc acctacggac ggtaggtggt atctttagaa 5340 gattttaatt ttcctaccgt aaggcgatac aaaaaaaaaa a 5381 // ID Copia-107_AA-LTR repbase; DNA; INV; 226 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-107_AA_; KW Ty1_copia_Ele166; Copia-107_AA-I; Copia-107_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-226 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 226 BP; 60 A; 45 C; 51 G; 70 T; 0 other; tgaagggcgc tgacgtcctg gtgacgtttc agaaacggaa gcaagactag ctgtcatttg 60 tcaaacagcg gaagagaaaa tattgccaac ctttgtattt ccttctgtat tttagaataa 120 aagttaatgc gtaataagcg agcacgattt cttcgcacgg gtcttttaaa atccgagttt 180 cgtgagtttt tcgccactaa tcgcctgatg tttcgttcta tggcca 226 // ID Zator-N1_AAe repbase; DNA; INV; 165 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 23-DEC-2010 (Rel. 16.02, Last updated, Version -1) XX DE A Zator DNA transposon family from Aedes aegypti. XX KW Zator; DNA transposon; Transposable Element; nonautonomous; KW Zator-N1_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-165 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the yellow fever mosquito."; RL Repbase Reports 11(2), 657-657 (2011). XX DR [2] (Consensus) XX CC ~98% identical to consensus. 3-bp TSDs; usually TWA. TIRs are CC ~30 bp long. XX SQ Sequence 165 BP; 50 A; 26 C; 46 G; 43 T; 0 other; ggggccgttc aaatattacg taacgcaaca ggggggggag ggggtcttac atagcgttac 60 ggtccataca aaaatttaaa attttccata caaaagctgt tacgtggggg tgggaggggg 120 tctaaaattg acaaattttg cgttacgtaa tatttgaatg aaccc 165 // ID Copia-1_TCa-I repbase; DNA; INV; 4104 BP. XX AC ChLG6; XX DT 21-FEB-2011 (Rel. 16.02, Created) DT 21-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the red flour beetle genome: internal DE portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-1_TCa_; KW Copia-1_TCa-LTR; Copia-1_TCa-I. XX OS Tribolium castaneum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Coleoptera; Polyphaga; Cucujiformia; OC Tenebrionidae; Tribolium. XX RN [1] RP 1-4104 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the red flour beetle genome."; RL Direct Submission to RU (21-FEB-2011). XX DR Genome; ChLG6; Positions 1875805 1879908. XX CC 'ATATG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 120..4040 FT /product="Copia-1_TCa-I_1p" FT /translation="MENTTIEKLGITQFNGKDYDHWKFRMEIILDHHDVKR FT CIEEENATPDDAFLKMDKKCKSLIIQCIANSHLQYVKDKTTAYQMWISLEA FT GFQRKGITSQLFLRKKLLTMKLSSNDTMEKHFLKFEETIRELKSVGAKLEE FT MDVVCHLLLTLPKSFDPLVTALETMEPSKLTLDFVKGKLLDCELKRKNQCS FT EIEVPSSSAFNSRKFNKQQSVWNKKVFPFTCHNCGKKGHKRADCHLLKQAN FT QTSSEHVAFVSYSGNAQDETVCKKMKWYVDSGATDHMVNSKEHLTNVRKLE FT SPVKICVAKDNVKLLATEIGDVNAVLRVNNTVTRATIKNVLYVKNLKHNLL FT SVQKIELASLNVSFEHGKVVIKRNSKVLAEGKRIDNLYEICFEVENKCKVV FT CSNVCEVSASLKLWHRRLGHLSNKNLVTLSKNNMVSGLNIRNSNCNESQIC FT EVCVKSKITKLPFGKRSDNKTTRVLELIHSDLCGPITPETHDGKRYFLTFL FT DDYTHFCVVYLLKSKSEVFEYFKCFESMVSAMFGTKIATLKCDNGREYLPN FT DLVSFCKGKGIIIKTTIPYTPEQNGKAERLNRTLLEKARAMILESELSKDL FT WGEAILCAAYVTNRCPTSCLENVTPSEMFYGRKPNLNNLRVFGSVAYSHIP FT KQKMKGKFDRKCDVCIMIGYTHNGYRLWIPETRKVICARNVIFDENKTITS FT VNSKNERHVSETDKEIMTYTELQNESEGNVEENKAIAEDEEHKINEDEESD FT EDKEPDETQTEGLRKSNRKKKRPAHFSDYELEHFALSVESFIDEVPESYEE FT ARSRTDYHDWEKAIAEELNALNRNGTWTLVEKPKNAKLIGNKWVFRLKRNQ FT NGDIVRHKARLVAKGFMQREGFDYEETYAPVAKLTTVRTLISIINHKNLHA FT QQMDVKSAFLHGKVKEDIFMSVPEGLEAEDNVVCKLNKALYGLKQASFCWN FT NRFNDFAEENNLIRSKNDLCLYCKKTDETELYLLIYVDDIIIASNNLEEIR FT NLKEKLINVFEMQDLGSLHYFLGIKIRSEKEGMYLSQKNYLQGLLKRFGME FT NCKGTGTPMKKGSLLTKDKIDEENLKNKPIRELVGCLMYVMLASRPDLSVS FT VNMCSRYQSTPTEELWQALKRILRYIKGTIDYELFYPKQECEQLVGYADAD FT WAGGIDDRKSTTGFLFKVCGATVSWCTRKQSVVAISSTEAEYIALAEAARE FT GLWLLHLIKDFGFDDTAFKIFEDNQSCIRLADHSEHKRLKHIDVKYNFIRE FT LVQNKTIKIEYVCTTEQIADILTKSLDKSQFNKLSLKLGLTNVNVN" XX SQ Sequence 4104 BP; 1491 A; 588 C; 876 G; 1149 T; 0 other; ggttatgggc ccaggttatg ctggaatgtg tatataaagg aatttggtcg tattgttcgg 60 tttgttaaaa agtgttacta aaaacgttcg gtgcacgtgt aacgacaaac ggaagaacaa 120 tggaaaatac aacaatcgaa aagttaggaa tcacccagtt caatggaaag gattacgatc 180 actggaaatt caggatggaa attattcttg atcatcatga tgtgaaacgg tgtattgaag 240 aagaaaacgc aactcctgat gacgccttcc taaagatgga caagaagtgt aaatcgttaa 300 taatccaatg tattgcaaat tcgcatctgc agtatgtaaa agataagaca acggcctacc 360 aaatgtggat aagtctggag gctgggtttc aacgaaaggg catcacgagc caactatttc 420 tacgaaagaa gttgctgaca atgaaattat cgtcaaatga caccatggag aagcatttct 480 taaaattcga agaaacgatt agggaactca agtcggttgg tgcaaaattg gaagaaatgg 540 atgtggtgtg tcatctatta ctgaccttac ccaagagttt tgaccctctt gtaacagcgc 600 tggaaacaat ggagccttct aaattgacac tagactttgt caaaggaaaa ttgctggatt 660 gtgagttaaa acggaaaaat cagtgtagtg aaattgaagt gccaagttca tctgcattca 720 acagcaggaa gttcaacaag caacagtctg tttggaacaa aaaagtgttt ccgtttacgt 780 gtcacaattg tgggaagaaa ggccataaaa gagctgactg ccatttatta aagcaagcaa 840 atcaaacctc gtcggaacat gtcgccttcg tttcgtattc aggaaatgca caagacgaaa 900 ccgtgtgtaa aaaaatgaag tggtacgtgg actctggagc gactgatcac atggtaaaca 960 gcaaggaaca tctgacgaac gtgcgtaagt tagaaagtcc agtaaaaatt tgtgtggcaa 1020 aagacaatgt aaaacttttg gctactgaaa ttggtgatgt taatgcggtc ttacgtgtga 1080 ataatacggt gactagggct actattaaaa atgttttgta tgttaaaaat ttaaagcaca 1140 atttactgtc tgtccaaaag atagagttag caagtttgaa cgtgtcattt gaacatggaa 1200 aggttgtaat taaaaggaat tcgaaagtac ttgcagaagg taaaagaata gataacttgt 1260 acgaaatttg ttttgaggtt gaaaataaat gtaaggtggt ttgctcaaat gtatgtgaag 1320 ttagtgcctc tttaaagtta tggcatagaa ggttaggtca tttaagtaat aaaaatctcg 1380 taaccttaag taaaaacaac atggtttctg gcttaaatat aagaaacagt aattgtaatg 1440 aaagtcaaat ttgtgaagtt tgtgttaaaa gtaaaattac gaaattgccc tttggtaaaa 1500 ggtctgacaa caaaacgaca cgtgttttgg aattaattca ttcagacttg tgcggaccta 1560 taactccaga aacacatgat ggtaaacgtt attttctaac ctttcttgat gattatactc 1620 atttttgtgt tgtgtatcta ttaaaaagta aaagtgaagt ttttgaatat tttaaatgtt 1680 ttgaatcaat ggtttcagcc atgttcggta caaaaatagc tactttgaaa tgtgataatg 1740 gtagggaata tcttccgaat gatctggtaa gtttttgcaa aggaaagggt atcataatta 1800 agaccacaat accatatacg ccagagcaaa acgggaaagc cgaaagatta aatagaacat 1860 tgcttgaaaa agctagagct atgatactgg aatctgaact ctcgaaggac ttatggggtg 1920 aagccatatt atgtgctgct tatgtaacaa ataggtgtcc cacatcttgt ttggaaaacg 1980 ttactccgtc tgaaatgttc tatggaagga aaccaaattt gaataattta agagtattcg 2040 gatccgtggc atacagtcac ataccgaaac agaaaatgaa aggcaaattt gatagaaaat 2100 gtgatgtctg cataatgatt gggtatacac ataatggata cagattatgg ataccggaaa 2160 ctagaaaagt tatttgtgcc aggaacgtca tttttgatga gaataaaaca attacttctg 2220 taaacagcaa aaatgaacgc catgtttcag agacagataa agagattatg acgtacacag 2280 aacttcaaaa cgaaagtgaa ggaaatgttg aagaaaataa agcaattgca gaagatgaag 2340 aacacaaaat taatgaagat gaagaaagtg acgaagacaa ggaaccagac gaaactcaaa 2400 ccgaaggact aagaaaaagt aacagaaaga aaaaacgtcc ggctcatttt tctgactacg 2460 aactggaaca ctttgcatta agtgtagaat catttattga tgaagttcct gaaagttatg 2520 aagaagcaag atccagaact gattatcatg attgggagaa ggctattgct gaagaattaa 2580 atgcacttaa tcgtaacgga acatggactt tggttgagaa accaaaaaat gctaaactaa 2640 taggaaataa atgggttttt agattaaaaa ggaatcagaa tggagatatt gttagacaca 2700 aagctagact tgttgctaag ggctttatgc agagggaagg attcgactat gaagaaactt 2760 atgctcctgt ggctaagctc actactgtca gaactttgat atcgataata aaccacaaaa 2820 atctgcatgc acaacaaatg gatgtaaaat cagcattttt acacggaaaa gttaaggaag 2880 atatcttcat gtcagttcct gaaggtctag aagcagaaga caatgttgtg tgtaaattga 2940 acaaagcact ttatggtctt aaacaagctt cattttgttg gaataatagg tttaatgact 3000 ttgcagaaga aaataatttg ataagatcaa agaacgatct ttgtttgtac tgcaaaaaaa 3060 ctgatgaaac ggaattatac ttattaattt acgtcgatga tattattatt gctagtaaca 3120 acttggaaga aattcggaac ctcaaggaga agctaataaa tgtgttcgaa atgcaagatt 3180 tagggtcttt gcactatttt ctaggtataa aaatccgatc tgaaaaggaa ggaatgtatc 3240 tgagtcaaaa aaattactta caaggattgc taaaacgttt tgggatggaa aactgtaaag 3300 gtacagggac tccaatgaaa aagggatcac tattaacaaa ggacaaaatt gatgaagaaa 3360 atttgaaaaa taaacctata cgagaattag ttgggtgtct aatgtatgtc atgttggcaa 3420 gcagaccaga tttaagtgtc tccgttaaca tgtgtagtcg ttatcaaagt acaccgacag 3480 aagagctgtg gcaagcccta aaacgtattt taagatacat caaaggcacg atagactatg 3540 aactgtttta tccaaagcaa gaatgcgagc aacttgtggg ctatgctgat gcggattggg 3600 cgggcggcat tgatgacaga aaatctacaa caggttttct attcaaagtg tgtggtgcaa 3660 cagtgtcctg gtgtaccaga aaacagtccg ttgtagcaat ttcttctact gaagcggaat 3720 atatagctct tgctgaggca gctcgtgaag gactttggtt gctacacctg ataaaagatt 3780 ttggttttga tgataccgct ttcaaaattt ttgaagataa tcagtcatgt attagacttg 3840 cagatcattc agaacacaaa agactgaaac atattgatgt taaatataac ttcatacgag 3900 aacttgttca aaataaaact attaaaattg aatatgtatg tacaactgaa caaattgcag 3960 atattttgac caagtctctt gataaatctc agtttaataa gttgtcattg aaactaggtt 4020 tgacaaatgt taatgtaaat tagaattttc ttgtttcact gatattgttt tattaccgta 4080 aagtgtgctt gaattgaagg ggag 4104 // ID BEL-159_AA-I repbase; DNA; INV; 6150 BP. XX AC supercont1.186; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-159_AA_; KW BEL-159_AA-LTR; BEL-159_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6150 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.186; Positions 538062 544211. XX CC Positions [5204-5764] - Integrase core CC 'GGTTG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 75..4754 FT /product="BEL-159_AA-I_2p" FT /translation="MNSARTCGKCRIDGWVNHMVGCDMCDMWLHRECADVT FT SETLDPNKSWRCERCLHDEATEVASHRTHRTTRTSSSIRAQRAELALKLLE FT EEQELIKRNREKEDEFRRREQEIQKKRAEEDAKLLKQKHELLQSLEDENGS FT VRSVMSSKASRKKVESWLGTAPGSVPAGIAEKPQASVDIDQSGIQKVITAS FT APISNLFDTNATMEVNRPKSTSTSQKHLSTPSVIPYTTVPLSNLPLQGTSS FT SSVSKWVGYYPGANVHTSSALPTIFESQSNATNVAQTNVSFGSPGITPTAP FT FASKLGDSQLPPVYTAGSHKPSSESMSLGQIVSVNSGSQFGRVSNIVTTHS FT GVSCSGQWNRPWTTSAVTTTPVSGPSFGISGQSGPGCPGFPATIGNSDAQR FT IDGSHCTVPPCGQQFDTSFQQAPSGVQIAARQVMPKDLPSFTGDPQDWPLF FT SSSFYNSSAACGFSDAENLARLQRCLKGHALESVKSRLLMPQSVPHVMETL FT RMLYGRPEILIHSLLQKLRSIPSPKNENLQSIITYGLAVRNLVDHMHIANL FT EDHLRNPMLVHELVEKLPPQMRMQWSWYKRSICYVNLATFGEFMSELVKTA FT TDVTIPSDSTIQQTKSINTGREKQKLYVHAEANEGQSAEAAVNLQNSLREA FT EIVKRLCAYCSDENHEVARCPQFKALDMDGRWKAIRSKGLCRTCLIPHRKW FT PCRSGKECGIDGCRFRHHALLHSRIATEPASEQVVRQNFHSVTKFSLFRYI FT PVTLEGNGKKVDTFAFLDDGSESTLMESGLAAELNISGSSEPLWLGWTGDI FT SREEKGSQRISVAIAETGLKGQYRLDNVRTVQTLKLQSQSFQYSELQKIYP FT HLRGLPLRNYSNAVPRMIIGIEHAQMLTSLKVREGGANDPVAAKTRLGWCV FT YGKQIDGNSTVTRLHVHSEEIGNRELHEQMKQYFSIEEVAVATPLESEEDR FT RARGILEKTTRRIYGGFETGLLWKYDRPSFPNSYPLAVCRMHSLEKKLSKE FT PALQERVNELIAEYEAKGYAHRITQAEIQSTNSDRVWYLPLGIVRNPKKSE FT KIRLIWDAAARVNGVSFNDMLLKGPDMLTSLFAVLLRFRQRPVAVCGDIQE FT MFHQVRIITQDKQYQRFIYREHRDQAPQVFIMDVATFGATCSPCSAQHVKN FT TNARDFASLFPKAAEAIVNAHYVDEYLDSVDTIDEAVQLVEEVRYVHAQGG FT FHIRNFSSNSPELLQRLGQTKQIEKKSMVLDDIIDTERVLGMMWKPAEDVF FT TFDVNLKDDLAKLLLEGITPTKRQVLRVVMSLFDPCGFIAHFIIHGKILIQ FT HIWRSGTDWDEGITDGLLGLWHHWTRLLQQLIQVKVPRCFFGNTSSKLHSG FT IQLHVFVDASELAYACVAYLRIVKNKKVCCVFVASKTKVAPLKPLSIPRLE FT LQAGLIGCRLMENICTALDLPLERRYLWTDSRTCLSWIRSDSRRYHPYIGF FT RVGEILNVSVVDEWHYVPSKQNVADDATKWGCGPSFSPNSRWYTGPNFLYL FT TEEEWPEQPLKEMTTDEELRVVFHHHREVRQTYCTGNSFLDKSR" FT CDS 4895..6148 FT /product="BEL-159_AA-I_1p" FT /translation="MDEAGVIRMNSRINAAPTISFEMKYPIILPKDHPLTT FT LLIDSYHRRFKHQNRETVLNEIQQQFRIPKLRPLIQRVAKNCQYCKVRKAM FT PRPPMMAPLPKVRLTPHIRPFCYTGVDYFGPLLVKQGRSLVKRWVALFTCL FT TTRAIHLEIVHSLSTQSCIMAIRRFGARRGFPTDFYSDNGTCFRGASNLLT FT KQIEAIHEGCALTFTNTRTTWHFNPPSAPHMGGSWERMVRSVKEAMGAISE FT HSRHPSDEVLETVVLEAEAIVNSRPLTYVPLDNAEQEALTPNHFLLYGSSG FT IVQPRSSLSDEGVVLRDSWKLVQYLVDEFWRRWVREYLPTLTRRTKWFETV FT KPLEPGDLVIVIDESKRNGWLRGRIIEVMRGKDGQVRSAMVSTADGIMKRP FT AVKLALLDVHDDPTQGTPELHGRG" XX SQ Sequence 6150 BP; 1789 A; 1385 C; 1528 G; 1448 T; 0 other; aactttaaag attcactgat tgaatctatc actgcgcagt aagcagtagg aagttctgcg 60 aagtaagcag caacatgaat tccgctcgga cttgcggaaa atgtcgcata gatggctggg 120 tgaaccatat ggttggatgt gacatgtgcg acatgtggct tcaccgggaa tgtgcagacg 180 ttacgagcga aaccctcgat cccaacaaaa gctggcgctg tgagcggtgc ctacatgacg 240 aggcgacgga agtggccagc catcgcacgc accgcactac aagaacatca agcagcatcc 300 gtgctcaacg ggctgagttg gctttaaagc tccttgagga ggaacaagag ctgatcaaga 360 ggaatcgtga gaaagaggat gagtttcgcc gaagggaaca agaaatacag aagaagagag 420 cagaggaaga cgccaagcta ctgaaacaga aacatgaact tctgcagtct ttggaggacg 480 aaaatggaag cgttagaagc gttatgagca gtaaggcaag tcggaagaag gtagagtcgt 540 ggcttgggac tgctcctggc tctgtaccag ccggaatagc cgagaaaccg caagcgtctg 600 ttgatattga tcaatctgga atccaaaagg tgataacagc gtctgctccg atatcgaacc 660 tgttcgacac aaatgctacg atggaagtca acaggcccaa atcgacgtca acatcccaaa 720 aacacttaag tactccttca gttattcctt atacaacagt ccctctatcg aacttaccac 780 tccaagggac cagcagttca tcagtttcaa agtgggttgg gtactatcct ggggcaaatg 840 ttcacacatc gtccgcactt ccgacgattt ttgaatccca atcaaatgcg acaaacgtcg 900 ctcaaacaaa cgtatccttt ggaagtccag gtattactcc aacggctcca tttgcttcca 960 aattaggcga tagtcaattg cctccggtgt atactgccgg ttctcacaaa ccatcttccg 1020 aaagtatgtc cttgggacaa atagtatctg tcaattccgg atcacaattc ggacgtgtat 1080 caaatatagt tacaacgcat tcgggggttt cgtgttctgg gcagtggaat cgaccttgga 1140 cgacgtcagc tgttactacc acacctgtat cgggtccgtc atttggtatt tctggacaat 1200 ctggacccgg ctgtccaggg tttccagcaa caataggaaa tagtgatgct cagcggatcg 1260 atggttcaca ttgtacagta cctccttgtg gacaacagtt tgatacgtct ttccagcaag 1320 caccgtcagg cgttcaaata gcggcacgac aggtaatgcc aaaagatcta ccgtcgttca 1380 ctggtgaccc acaagactgg ccgttatttt ccagttcatt ttacaactct tcggcagcgt 1440 gtgggttctc agacgcggaa aatctagccc gattacaacg ttgcttgaag ggacatgcgc 1500 tagaatccgt aaagagtcgt ctactaatgc cccagtccgt accgcacgtg atggagactt 1560 tgcggatgtt atacggacga ccagaaatct tgatacactc actactgcag aagttgcgaa 1620 gcattccttc acctaaaaat gaaaacttgc aatcaattat cacctacggt ttggcggtgc 1680 gtaaccttgt cgaccacatg cacattgcaa atttggaaga tcacttgcgc aacccaatgc 1740 tggtgcatga actagtggag aaacttccac cgcaaatgcg aatgcaatgg tcctggtaca 1800 agcgctcaat ttgctatgtc aacctggcga ctttcggaga atttatgtca gagttggtaa 1860 agacagcgac agacgtaacg attccatcgg attcgacgat acaacagact aaatccatta 1920 atacggggcg tgaaaagcaa aagttgtatg ttcacgcaga agctaacgaa ggtcaatcag 1980 cggaagcggc agtaaactta cagaactctt tgagagaagc ggaaatcgtg aaaagactat 2040 gtgcttactg ttccgatgaa aatcacgaag ttgctcgatg ccctcaattc aaggctttag 2100 acatggatgg cagatggaag gcgattagat caaaaggact ctgcagaact tgtttaattc 2160 cacatcgaaa gtggccgtgt cgatctggta aggagtgcgg aatagacggt tgtcgattcc 2220 gccaccatgc gttgctgcat tcgcggatag ctacggagcc agccagtgaa caagtcgtac 2280 gacaaaattt ccattctgtg acgaaattct ctctatttcg ttatatccca gttacgctgg 2340 aagggaatgg aaagaaagtt gacacgtttg cgtttttgga cgatggcagt gaatccacgc 2400 taatggaatc cggacttgca gcagagttga atattagcgg ttcttcggaa cctctctggc 2460 tgggctggac aggggatata tctcgagaag agaagggttc tcagcggata tctgtggcca 2520 tcgcagaaac cggattaaaa ggccaatacc gattggacaa tgtgcgtacg gtacagacac 2580 taaagctaca aagtcagtca ttccaatata gtgagctcca gaaaatctac ccacacttac 2640 gaggactccc attgcgcaac tattcaaacg cggtaccgag aatgatcatc ggcatcgagc 2700 atgcacaaat gcttacttcc ctgaaggttc gagaaggcgg tgcaaatgat ccagttgcag 2760 caaaaacgcg gctgggctgg tgcgtgtacg ggaaacagat cgatggaaac agcacggtta 2820 cccgacttca tgttcactcc gaggaaattg gtaaccgaga actacatgaa caaatgaaac 2880 aatatttcag catcgaagaa gtcgcagtgg caaccccgct ggagtcagag gaagatagac 2940 gagctcgagg tatacttgaa aaaacaactc gtagaatcta cggaggtttt gagaccggtc 3000 ttttgtggaa atatgatcga ccatcgtttc caaatagcta tccacttgct gtatgcagaa 3060 tgcattcgtt ggagaagaag ctttcaaagg aacctgcgct tcaagaacga gttaatgaac 3120 tgatagccga gtatgaagcg aaaggatacg ctcatcgaat tacacaagcg gaaattcagt 3180 cgacaaattc ggaccgcgtg tggtatctgc cgctcggcat cgtgaggaat cccaagaaat 3240 ccgaaaaaat tcgcctaatt tgggatgctg cggcgcgagt gaacggtgtt tcattcaacg 3300 acatgctcct aaaagggcca gacatgttga cgtcactgtt cgccgttcta ctaaggtttc 3360 gtcagaggcc agttgctgtg tgtggagata tccaggaaat gttccaccaa gtacggataa 3420 tcacacaaga caagcagtat cagaggttta tttatcgcga acatcgcgat caggcaccac 3480 aggtattcat catggatgtg gcaacatttg gtgcaacgtg ctcgccatgc tccgcgcagc 3540 atgtcaagaa tacgaacgcg agggatttcg catcactttt cccgaaggct gccgaagcta 3600 ttgttaatgc acattatgtc gatgagtact tagacagcgt cgatacgatc gacgaagcag 3660 ttcaactagt agaagaagtg agatacgtac acgcccaagg tggatttcac atccgaaact 3720 tctcatccaa ctctcctgag ctgctacaac gtttaggcca aaccaagcaa atagagaaga 3780 aatccatggt actggatgac atcatagata cggagcgggt gctagggatg atgtggaagc 3840 ctgctgagga cgtttttact ttcgacgtta acttgaagga tgatcttgca aagctgctat 3900 tggaaggaat aacgcctact aaacggcagg tgttgagagt cgttatgtcg ctgtttgacc 3960 catgcggatt catcgcccac ttcataatcc atggcaagat tctcatacag cacatctgga 4020 gatctggcac tgattgggat gagggtatta cggacggttt actcggctta tggcaccatt 4080 ggactcggct cctgcaacaa ttgatccaag tgaaggttcc tcgctgtttt ttcggcaaca 4140 caagtagtaa actacatagt ggaatacagc ttcacgtctt tgtcgatgca agcgaactag 4200 cttacgcatg tgttgcgtat ctgcggattg ttaaaaacaa gaaagtgtgc tgtgtattcg 4260 ttgcgtcaaa gacgaaagtt gcgcctctca agccgctctc tattccacgt ctggagctgc 4320 aagctggatt aatagggtgc cgactaatgg aaaatatatg tacagcccta gatctcccgt 4380 tggaaaggcg atacctgtgg acggattcaa gaacatgtct ctcgtggatt cgttcggata 4440 gccggcgcta tcacccatac ataggtttca gggtcggtga aattttgaat gtctctgtag 4500 tagatgaatg gcattatgtc ccgtcgaagc aaaatgtggc cgatgatgca acaaaatggg 4560 gatgtggacc tagcttcagc ccaaacagtc gctggtatac cggaccaaac tttctatact 4620 tgacggaaga agaatggccc gagcaaccac tgaaagaaat gacaaccgat gaggagttac 4680 gcgtagtatt tcatcaccat cgagaagttc ggcaaacata ctgtacgggc aacagctttc 4740 ttgataagag ccgttaaact gttcaaaaag gaccaccaat ctggaccgct gaccagcgaa 4800 gaactactgc atactagagt tcaacaaggt taaccacagg aaggaacaga aacaaattga 4860 gaagaatagc ccgttgtatg atcagtctcc gttcatggat gaggcaggtg taatacgaat 4920 gaatagtagg ataaacgctg cacctaccat atcattcgaa atgaagtatc ccatcatatt 4980 gccgaaagat catcctctaa caacccttct tatcgacagt taccaccgta gattcaaaca 5040 tcagaaccgt gaaacggtgt tgaacgaaat tcaacagcag ttccgaatac caaaattacg 5100 tcccttaata caacgtgttg ccaagaactg tcaatactgc aaggttagga aggcaatgcc 5160 aagaccacca atgatggcac cacttccaaa ggttagacta acacctcaca ttagaccatt 5220 ttgctatacc ggtgttgact acttcggacc acttctagtg aaacaaggtc gtagcttggt 5280 caaacgctgg gtggcactct ttacatgtct cacaacacgg gcaatacatt tggaaatagt 5340 gcacagtctc tccacacagt cctgtattat ggctattaga agattcggag cacgaagagg 5400 atttccaacc gacttctatt ccgataatgg gacctgtttc cgaggagcca gcaatttgtt 5460 gacgaagcag atcgaagcca ttcatgaagg ttgtgcatta acgtttacca atacgagaac 5520 gacttggcac ttcaacccac catctgcacc ccacatggga ggttcctggg aacggatggt 5580 gcgttcggta aaggaggcga tgggagctat ttcagaacat tctcgtcatc caagcgacga 5640 ggttctagaa actgtggtgc tggaagcgga agcgattgtc aactcaagac ctttgacgta 5700 cgtaccactg gacaatgccg aacaagaagc cttaacgccg aaccactttt tgctgtacgg 5760 ctccagcggt atagtgcagc cgaggtcatc actatcggac gaaggcgttg ttctgcgaga 5820 tagttggaaa ttggtgcagt atctagtgga cgagttttgg agacgctggg ttcgtgaata 5880 tttaccgacg ttgacaaggc gaaccaagtg gttcgaaacg gttaaacctt tggaacctgg 5940 agatttggta attgtaatcg acgaaagcaa gcgaaatggg tggctacgag gacgtattat 6000 cgaggtcatg cgaggtaagg acggtcaagt ccgaagtgca atggttagca cggcagatgg 6060 gataatgaag cgaccggcag taaaacttgc tctgctagat gttcacgatg atccgacgca 6120 gggtaccccg gaattacacg gaagggggaa 6150 // ID L2B-1B_CQ repbase; DNA; INV; 1597 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE An L2B non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW L2B; Non-LTR Retrotransposon; Transposable Element; L2B-1_CQ; KW L2B-1B_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-1597 RA Kojima K.K. and Jurka J.; RT "L2B non-LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 141-141 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 7 sequences with >95% CC identity. CC The consensus is ~86% identical to that of L2B-1_CQ. XX FH Key Location/Qualifiers FT CDS 2..1171 FT /product="L2B-1B_CQ_1p" FT /note="reverse transcriptase." FT /translation="CESALNLLLLKWKQCIEKGNIVLSVFVDLKRAFETID FT RTKLIRVLKKSGIRGTVLKWFSSYLENRKQVTRYNDAVSAETAVDLGVPQG FT SVLGPLLFILYINDIKQALKRVQVNLFADDTVLFVVGDSFDECFDIMNEEL FT GGFSEWLKWKKLQLNITKTKCMLVTTRQTKDCRRSVQMDGGIVERVETMKY FT LGVMLDEKLNFNEHIDYTIRKAARKFGVLCRINRYLTAETKVQLYNSLIAP FT HFDYCSSILFLATQRQMKRMQVLQSKIMRLILRCDRLTPRRSMLECLQWMS FT VKQRIEYNTLVFVFRVKERMAPQYLTEAMVRGRDIHEHDTRGADDLRLQIW FT RKACTQNSLFYKGFKLYNQLPEAAKMTSNINEFKRSCKEFVRQRPLE" XX SQ Sequence 1597 BP; 492 A; 310 C; 427 G; 368 T; 0 other; gtgcgagtcg gcgctgaatc tcctgttgct gaagtggaaa cagtgcattg aaaaaggaaa 60 tattgttctg tcagtcttcg tagatttgaa gcgggccttc gagaccattg accggactaa 120 attgatacgt gtgctgaaga aaagcgggat tcgtggaacg gtcttgaaat ggttcagcag 180 ttacctagaa aaccggaagc aggtcacgag gtacaacgac gcagtctcag cggaaacagc 240 agtggatctt ggagttccac aaggaagcgt gctagggcca ctattattta tcctctacat 300 aaacgacatc aaacaggcgc tgaagagagt acaggtaaac ctgttcgccg atgacacggt 360 actatttgtt gttggtgaca gcttcgatga gtgttttgat ataatgaacg aagagctggg 420 cgggttctca gaatggttaa agtggaaaaa actgcagctg aacattacga agacaaagtg 480 catgttggtg acgacacggc agaccaagga ttgcagaagg agcgtacaga tggatggagg 540 aatcgtggag cgtgtcgaga cgatgaaata cctcggagtc atgctggatg aaaagctgaa 600 tttcaacgaa cacatcgatt acactattcg gaaagcagca cgaaagtttg gcgttttgtg 660 taggattaat cgttacttga cagcagagac aaaggttcag ttgtacaact cgctgatcgc 720 gccgcacttc gactactgct catcgatctt gttcttggca acacaacggc aaatgaagag 780 gatgcaagtt ctccagagca aaatcatgcg gctcatcctt agatgtgatc gactgacacc 840 aagacggagc atgctcgaat gtttgcagtg gatgtcggtg aagcaacgga tcgaatacaa 900 cacacttgtt tttgtttttc gtgtgaagga aaggatggca ccacaatact tgacggaggc 960 catggtacgc ggaagggata tccatgagca cgacactaga ggagctgacg atctcaggtt 1020 gcagatttgg aggaaggctt gtacacagaa ctcgttgttt tacaaaggtt ttaaactata 1080 caaccagctt ccagaggcag caaagatgac gagcaacata aacgagttta agcggagctg 1140 caaggagttt gtacggcaac ggccgttgga gtagaagtga cccacgatgg tactgtgacg 1200 aagagcacgt tatgacggtc ggccatcttc attatcggta cacatcgcat gggatcactt 1260 tgggccgcat atgataaatt tattcaaaag taacgcgaat ctgggcgcgg tttaacccta 1320 tgggctcata tgtgcgtgga atagcaaatg gttccaatac cctaaatgga taaaaatact 1380 gcaatctgat tgataaaatg actcgacact ggatttgtta gagcgccttg agataggaca 1440 cgagagaaat ggatgggcat acacggaagt agagtcaaga ttacaggaca ctctcaaaac 1500 acgaacgacg aattatcgaa agatatctgc tcgtaaacct tccatgctac aaaaactgtg 1560 tatgggtaag aggtgggcca tccaaggaaa aaaaaaa 1597 // ID Gypsy-3_AC-LTR repbase; DNA; INV; 193 BP. XX AC AASC02058326; XX DT 18-JAN-2011 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from Aplysia californica (California sea DE hare): long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-3_AC_; KW Gypsy-3_AC-I; Gypsy-3_AC-LTR. XX OS Aplysia californica OC Eukaryota; Metazoa; Mollusca; Gastropoda; Heterobranchia; OC Euthyneura; Euopisthobranchia; Aplysiomorpha; Aplysioidea; OC Aplysiidae; Aplysia. XX RN [1] RP 1-193 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Aplysia californica (California sea RT hare)."; RL Direct Submission to Repbase Update (02-FEB-2011). XX DR Genome; AASC02058326; Positions 4615 4807. XX SQ Sequence 193 BP; 46 A; 41 C; 47 G; 59 T; 0 other; tgttgatatg tgtacattgt tcgtgtttcg taccatttcc gatgaatgtt tttgtaacca 60 ctgagttcag cacgagatta aaggactggt gagccagtgg gtcattccct actcagccgg 120 caaacgtgca agtcgagtct gttcttcttg gtggtgtaca tgttgagcca aacggtatac 180 acgctcacaa aca 193 // ID DNA8-61_AP repbase; DNA; INV; 647 BP. XX AC . XX DT 25-AUG-2009 (Rel. 14.09, Created) DT 25-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-61_AP. XX NM DNA8-61_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-647 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 1995-1995 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 647 BP; 329 A; 66 C; 56 G; 196 T; 0 other; cagggatgga aaacgttttt aaaaaacgtt attaaacgga aaaaaaaaca aaaacgaaaa 60 caattaataa aacaaaaaca aaaaacaact gaaaaacgat aacaaaaaat cccaaaaacc 120 gaaaaaaatt taataacgaa atcaaaaacg aaaacgaaaa taaattatag tattatacat 180 tattaaatta taccacgaaa aaaaaatttt tgaaaaaaat tcacttgtct ttacttttta 240 atctttaact ttaaaaacat tttaatttga aatcagaaat tttgaatttt ctaattttta 300 tataaattat aattttgatt taaaatttaa atattaagaa taattaattt taatttttaa 360 cttttaagtg catttctgta ttgcgcgtta ctattataac ttaaaagtta aaagttataa 420 gttataagaa tataatataa aaaaaaaaaa tattatcaaa aacgttattt ggaaacaata 480 ggaaaataac gtttttaaaa acgttaatcg aaaacaatat ggaaaaataa cgtttttaaa 540 aacgttaatc aaaaacgata tagaaaaata acgtttttaa aaacgttaat caaaaacgta 600 ataaacaaaa acgaaaacga aaaaattaaa aacgttttcc atccctg 647 // ID Gypsy-1_DPu-I repbase; DNA; INV; 4622 BP. XX AC scaffold_50; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-1_DPu_; KW Gypsy-1_DPu-LTR; Gypsy-1_DPu-I. XX NM Gypsy-1_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-4622 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 717-717 (2010). XX DR Genome; scaffold_50; Positions 515409 510788. XX CC Positions [3585-4052] - Integrase core CC LTRs are 99% similar to each other. CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX FH Key Location/Qualifiers FT CDS 1394..2929 FT /product="Gypsy-1_DPu-I_1p" FT /translation="MAILRLMGEDVNNLLQNGVDSLTAANNSSLECAGRLE FT LQLHYRGRSLTTSILFSPKHDGMLLSWFVCVELGLLPPCYPEPIMINTIVT FT ASPPPIDMSDIPALEQKLLSDFEVVFDEEGPLKTMSGPPMSIELLPDAVPF FT AVSGARPIPYALRDKAKHMLDDMEANQIIAPVTEPTLWTHPMVIVEEPNGK FT LRFCADLTKLNQYVKRPYYPLVSPKDAVSSIASGAKVFGTFDARHGYWQIP FT LDEESQLLTTFITPWGRYKFLRGPMGLSSTGDEFCRRMAAALSHLPNLHCV FT VDDLLSTHADLPTHYNTVREILTACRENQITLGAAKFKFASNSVPFAGYVV FT GSDGIAADPEKLSAIADFPRPVNITQLRSFLGLVGQLADFSDEISAAAGPL FT RPLLRQGNQFVWSADQEKAFETVKRALISPPVLANFDPSAETVLQTDTSRK FT NGLGYVLLQCQGDHWKLIQCGSRFVSDTESRYAMVELELLAVVWATKKCHL FT YLVGLPKLPSSSIISR" FT CDS 3522..4622 FT /product="Gypsy-1_DPu-I_2p" FT /translation="MVRACETCQKALPSLQREPLMSDPSPTRVFEDVSVDI FT FCHAGNHYLIYADRLSGWPAVFEFVKRDLCSRDVIRSCMRCFADFGVPVRV FT RFDGGTNLTSVEILNFSKNWEVISVNSTPHYPHSNGHAEAAVKAMKNLVQK FT AAPNGHLDTDAFQQGLLEWRNTPREAGLLPAEIVFGHPIRSILPAHHTAFA FT SKWRELMDLSDRLDDQAERVKAAYDQHARPLRPLRVGDSVRVQDADYKRWD FT KQGITVSVGESRDYRVKMQSGRIYWRNRKFLRFVYPPPATEADPAAPSPAI FT DANPEEEPGPLPPPDHSSNQRRRRVNFDLPPRRSNRKRHPKCLTMPKLISL FT LSFDPDCKNFVFVKLSCNENLVGS" XX SQ Sequence 4622 BP; 1116 A; 1278 C; 1123 G; 1105 T; 0 other; tggcgcagtt gataacttaa ccacttattt cgaccgactg ctacaattct actttgccaa 60 agtgggtgcg ttctgagagc attttcccct cgtggctcac gtgttggccg ccatcttaga 120 cagtttttga tctcgtgttc cctgtgtggt ttatccgtcc tcatctgccg aaatgtctgt 180 catcggcagt attggcgata tgccacgaga ctatatgatc attgacattc ccgacgatga 240 aatccagcat ctctctcacg cccaactcat cgctttggtg cacgagcaaa gtcagaaatt 300 tcgaacggcc atggaacgcg aaaggtcggc tcatgactaa gagcttgaga ctcacatgga 360 agaagatcgc cttcgtcgat ccgatgttcg tcccagaagc catagtgaag tgccggacga 420 acttgaccgt gatcgccatc gcgaacgtcg ccgcccaatc aacgtgtcgt gcatcgataa 480 aatggcgggt aatatttcct accgtgactt tttgacttgg cgcaacaagt gggacgattt 540 ttgttatctt caaaggattg ctgaatatcc gatacgtgaa caagctgctg cattgcgtat 600 gactctgtcc ttggaaatgc tgcagacggt cgagatggta cttgatatct ctccttacga 660 cctcttatcg gcagatgaaa ttcttgcaca gatcgcgcga tacattcgga agaaacgcag 720 tgtagctttg gatcgcgtcg agtttgaaga atatcgtcag gaccacggcg ccactttcga 780 cgagttttaa attgggctgc aacgcatcgc caaatgcgct gatttatgcc aacattgctt 840 tgatcaacgt atgacaacgc gtgtcatgag tggcatctga gaccaggagg tgcgtaaaaa 900 actgctcgct atcacaccgt tcccatcgct acagaccgca gttgacctat gccgcagtga 960 agaagcggcc ggcaaaaacg aatctttgtt gagccgttct ggtgcaactt cactgaacgt 1020 ggtaaaacag ccggaggaaa agggaaatcg accccgcacg cattttcacc gccgtggaga 1080 acacagggat ggactcaggt gcggaaattg tggccacccc cctcatgatt ctggcaagga 1140 ttgcccagct aaaggtagag aatgtcataa ttgcggcaag ccgggtcatt tttcgacggt 1200 ttgtgacaag cataaagcta ggagcgagaa acaggactca agcgtcaaga aactcgcaag 1260 cattcgcctc gcacacgtcg cagctgcacg tcgtgcccca accatccaga tcggtattca 1320 taatcggtcg ggtggacacc tctacaccgc caccgctatt ccggacagcg gtgcggaagc 1380 aacggtaacc agtatggcaa ttcttcgatt gatgggtgaa gacgtcaaca atctgttgca 1440 gaatggtgtc gatagtctga cagcggccaa taattcgtca ctggaatgtg ccggtcgtct 1500 tgaactgcag ttacactaca gaggccgatc acttactaca tccattttgt tcagccccaa 1560 gcatgatgga atgctgcttt cttggttcgt gtgtgtagaa ctaggccttc ttccaccctg 1620 ctacccagaa cccatcatga taaacacaat tgtcaccgcc tccccacccc caatcgacat 1680 gtcagatatc ccagctttgg aacaaaagct cttgtctgac tttgaagttg tcttcgacga 1740 ggaaggtcca ttgaaaacga tgtcgggccc gccaatgtca attgaattac tgccggacgc 1800 cgttcctttt gcagtaagcg gcgctagacc aattccatat gccttgcgtg ataaggctaa 1860 gcatatgctg gatgacatgg aagccaacca aatcattgcc cctgtcaccg aaccgacgct 1920 gtggactcat ccaatggtaa tcgtcgaaga gccgaacgga aaattgcggt tctgtgctga 1980 cctgacgaaa ctcaaccagt acgtgaaacg cccttattac ccgttagttt ctccaaaaga 2040 cgctgtatca agcatcgcca gcggtgcaaa agttttcggc acgtttgacg cccgacatgg 2100 atattggcag attccgctgg acgaagagag ccaactcctg accaccttta tcaccccgtg 2160 ggggcgatac aaattcttac gtggaccaat gggcctctca tcaacaggcg acgagttctg 2220 ccggaggatg gcagcagcac taagtcattt gccaaatctt cattgcgtcg tcgatgattt 2280 gttatccacc cacgctgatc taccgactca ctacaatact gtccgggaga ttctgaccgc 2340 atgccgagaa aatcaaatca ccctcggcgc ggccaagttc aagttcgcgt ccaactcggt 2400 acctttcgcc gggtatgttg taggcagtga cggcattgct gctgacccag aaaagctatc 2460 ggcgattgct gattttccac gcccggtcaa tatcactcag ctgcgatcat ttcttggttt 2520 ggttggtcag ctggccgact tttccgatga aatttccgct gccgccggtc cattgcggcc 2580 gctcctacgc caaggcaatc aatttgtgtg gtcagctgac caagaaaaag cttttgagac 2640 cgtcaaacgc gctttgatat cccctcccgt cttggcgaac ttcgacccat cagctgaaac 2700 agttctgcag acggacacat caagaaaaaa tggcctcgga tacgttttgt tacagtgcca 2760 gggcgaccat tggaagctca tccagtgtgg ttcgagattt gtttcagata ccgagagccg 2820 ctatgctatg gtcgaactgg agctgttggc ggttgtctgg gctacaaaaa agtgccatct 2880 ttacctggtc ggcctcccga aattaccctc gtcgtcgatc atcagccgtt agtaacgatc 2940 ttggaccgtt atacactgga ttgtgttgaa aacccccgtc tgcagcgact aaaggagaag 3000 ttacagcact acgtcttcca aaccgtttgg cgccgtggta aagaccacgc catccatgac 3060 gctctttccc gcgcacccgt tgctgatcca atgcccgacg atatgttaat tcgaagtctt 3120 gacgaccagg cacgcgtcac aattttcaac atcgttgcag ctatcgagtc agatgaagac 3180 gttgaaccag cctcacactt gccagatccc ttgctggccg atctacgttc cgcagcgtcg 3240 tcagacgacg aatattgcgc tatgatagcc gcaatccagg acgtgttccc ttcgcgccaa 3300 gaggagtggc cgattgcaat ccgcggattc tggaagatcc gcaacgatct gtggaccgat 3360 gacggcctag tgttgtacaa cagccgtata gtgatccccc cccctccaaa cgtgccgaaa 3420 cattacgcaa attacattct gcgcatcagg gcatagagcg cacaaaacgt cgtgcccgtc 3480 aactcgtcta ttggtcgggt ctatccagcg aaatcaccaa catggtcaga gcgtgtgaaa 3540 catgccaaaa ggcgttacca agcctccaac gcgaacctct catgtctgat cccagcccaa 3600 cgcgcgtgtt tgaggacgtc tccgtcgaca ttttttgtca cgccggtaac cattatttga 3660 tctatgctga tcgtttatcg ggttggccag cagtgttcga attcgtgaaa cgtgatcttt 3720 gttcccgcga cgtgattcgg tcgtgcatgc ggtgcttcgc tgatttcggt gttcccgtcc 3780 gagtacgttt tgatggtggc accaacctga catcggttga aatcctaaat ttttccaaga 3840 actgggaagt aatatctgtc aattcgacac cccattaccc tcactctaat ggccatgcag 3900 aggcggccgt caaagctatg aagaatctgg tccaaaaagc agccccgaac ggccatctgg 3960 acaccgacgc cttccaacag ggcctattgg agtggcgcaa cactccccgt gaagctggcc 4020 tattgccggc cgaaattgtc tttggccatc ccatccgatc cattctgccg gcccatcaca 4080 ccgcttttgc cagcaagtgg cgcgagctga tggacttatc ggaccgcctg gacgatcaag 4140 ccgaacgtgt gaaagccgca tacgaccagc acgcacgccc attacgcccg ctacgagttg 4200 gtgactctgt ccgtgtccaa gacgctgatt ataagcgctg ggacaagcaa ggcatcactg 4260 tgtcagtcgg cgagagccgc gactaccgcg tcaaaatgca aagtggccgg atttactggc 4320 gaaacagaaa gttcctccgc tttgtttacc ccccgcccgc caccgaggcc gacccagctg 4380 ccccgtcacc cgcaatcgac gcgaatccag aggaggagcc cggtccactg cctccgccgg 4440 atcattcgtc aaaccagcgc cgccgccggg ttaattttga tctcccccct cgtcgcagca 4500 accgtaaaag acacccaaag tgcctcacga tgcctaaatt gatttcccta ttgtcatttg 4560 atcctgattg caaaaatttt gtgtttgtta agttgtcatg taatgaaaac ttggtgggga 4620 gc 4622 // ID Gypsy-12_IS-LTR repbase; DNA; INV; 176 BP. XX AC ABJB010305691; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the black-legged tick genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-12_IS_; KW Gypsy-12_IS-I; Gypsy-12_IS-LTR. XX OS Ixodes scapularis OC Eukaryota; Metazoa; Arthropoda; Chelicerata; Arachnida; Acari; OC Parasitiformes; Ixodida; Ixodoidea; Ixodidae; Ixodinae; Ixodes. XX RN [1] RP 1-176 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the black-legged tick genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; ABJB010305691; Positions 5361 5186. XX SQ Sequence 176 BP; 36 A; 46 C; 38 G; 56 T; 0 other; tgtagtgttg tttgttaccg tcaggtgtcg ctcttaacac gactaacact aattctaaca 60 tactatcata gagggcgctg cctcactgtt acgtttgtat atattccgtg aataaagact 120 gccgtgggtc tttgcccatc cggcccagct gactctgtct ggtcccggtc tctaca 176 // ID Kolobok-3_TV repbase; DNA; INV; 2429 BP. XX AC . XX DT 27-FEB-2007 (Rel. 12.02, Created) DT 27-FEB-2007 (Rel. 12.02, Last updated, Version 1) XX DE A family of autonomous Kolobok transposons - a consensus. XX KW Kolobok; DNA transposon; Transposable Element; KW Interspersed repeat; Kolobok-3_TV. XX OS Trichomonas vaginalis OC Eukaryota; Parabasalia; Trichomonadida; Trichomonadidae; OC Trichomonas. XX RN [1] RP 1-2429 RA Kapitonov V.V. and Jurka J.; RT "Kolobok, a novel superfamily of eukaryotic DNA transposons."; RL Repbase Reports 7(2), 120-120 (2007). XX DR [1] (Consensus) XX CC Kolobok-3_TV is a consensus sequence of a family of autonomous CC Kolobok transposons that were active in the T. vaginalis genome CC in a last few million years. The Kolobok-3_TV transposon is CC characterized by 14-bp terminal inverted repeats, TTAA target CC site duplications, and it encodes the 424-aa Kolobok-3_TV1p CC transposase. Kolobok transposons, including numerous families of CC non-autonomous elements, constitute >2% of the T. vaginalis CC genome. See also comments in Kolobok-1_XT. XX FH Key Location/Qualifiers FT CDS 717..1988 FT /product="Kolobok-3_TV1p" FT /translation="MCDHLFKKPSSYSSFFMPRRTKAVAKRIKNLVQSAKN FT RVEPYVVNTVEFVLSVLLSGATFCQSEFQFMLNNIKVPSEATFHRVQEKVG FT RVIIEVARESVNYWKSRMRKCSGLLFDGSWSQRRNAMFCYVQFVEEKLKKI FT VDWEVISKSFKNFKGNFNGKSNEMEFEGLKRMLKRWNNEKRVNFFVHDGDV FT KIVSTIKNTFKGIREYRDPGHFLNNIQKKLKLPEFRILSSISKNLLRWLRQ FT LLNDTHMSIKTKKFLWLNSAKHYAGNHKFCPDPEKCKMIKPWKYAKNKTAI FT KTLKKFLEDTVKIFDMVKKIHSTQVVESINHIKAMLANKNINWHASWPIRM FT AVTILHFNESMFETIVAIRYRLNLPTMPEMMNRYFRMYDTTKDLIKAFKNS FT KQVQKKFAALRAIKRGLQATDDRITLKSHK" XX SQ Sequence 2429 BP; 900 A; 343 C; 373 G; 813 T; 0 other; ggatgggcac tcattagggt gatgaaaaaa atcaagtcta gcaaagtcaa tatattaaaa 60 acagtgaaga ctattatata gcattatagt gcacaagctc aaatatttag tgatgcattt 120 ctcgaactta ttttatttat aagttaacct tgaaacatga attttagtat atcatgagca 180 tatttttttc taattttgca atacttataa tatcatacca ggattcatgt ggaagtctaa 240 tatttggtat ataatattct tacaataaat gtattaattc aattatattc aaaatgatgg 300 gtcgcaattt ttactgaaat tgcaaattta atttcaacaa tgaaattatt gtttaaaaaa 360 aatagcacaa cagcaagatg gacaatcata tatagaatta taatgctcca tataatgctt 420 tgatttgatg cgtatataat gttatttttc gaacatcttt gtaaaaaatt gatttttgta 480 atttttgaaa aaattcgatt ttatttcgag atttcctcta taggtgaagt atattgaaca 540 atattcctat gggagaaaat aattgtaagt atgagtactt tcattaatac tatgggtgcc 600 atggttatcg ctcaaacgat caagttatat ctcgattgcg taaattgaaa ttttgctatt 660 ttctaaattc acggcaacaa cattctttta tcctgatcgt acatcacaat atttgaatgt 720 gcgaccattt gttcaagaaa ccgtcaagtt attcttcatt tttcatgccg cggcgaacaa 780 aagcagttgc aaagcgtata aagaacttag ttcaaagcgc gaaaaatagg gtagagccat 840 atgtcgtcaa cactgttgaa tttgtgctat ctgtattatt gagcggcgct acattttgcc 900 aatcagaatt tcaattcatg ctgaataaca tcaaagttcc gtcagaagct acatttcatc 960 gagtccaaga aaaggttggt cgtgttatca tagaagttgc cagagaatct gttaattatt 1020 ggaaatctcg aatgagaaaa tgttcgggac ttttgtttga cggttcttgg agtcagagaa 1080 gaaatgcaat gttttgctat gtacaatttg tagaagagaa actcaaaaaa attgtagatt 1140 gggaagtgat atctaaatct tttaaaaatt tcaaaggtaa tttcaacgga aaatcaaatg 1200 aaatggaatt tgaaggctta aaaagaatgc tgaagcgttg gaataatgaa aaacgtgtaa 1260 atttttttgt gcatgatggc gatgtcaaaa ttgtttctac aatcaaaaat acgttcaaag 1320 gtatcagaga atacagagat cctggacatt ttcttaacaa tatacagaaa aagctcaaac 1380 tgccagaatt cagaatctta tctagcattt caaagaattt attgcgttgg cttaggcaac 1440 ttctcaatga cacacatatg tcaattaaaa caaagaagtt tttatggttg aattccgcta 1500 aacattatgc cggaaatcat aaattttgtc cagatcctga aaaatgcaag atgattaaac 1560 catggaaata tgcaaaaaat aaaactgcta taaaaacact taaaaaattc ctcgaagata 1620 ccgtaaaaat ctttgatatg gtaaaaaaga ttcattcaac tcaagtagtt gaatcgataa 1680 accatatcaa agcaatgctt gctaataaaa atattaattg gcatgcttct tggccaatta 1740 ggatggcagt cacaatattg catttcaatg aaagtatgtt tgaaacgatt gtagcaataa 1800 gatacaggtt aaatttacca acaatgccag aaatgatgaa cagatatttc agaatgtatg 1860 atacaacaaa agatttaatc aaagcattca aaaattcaaa gcaagttcaa aaaaagtttg 1920 cggcattaag agctataaag agaggtcttc aagctaccga tgatcgtatt actcttaaat 1980 cacataaata attattaaat ttatttcacc ctcaaaacat cataatcttg caccttcttt 2040 gaataaacac tttcaccttc tatttttaat taatgttttt taattgtcca aaatttttag 2100 attatataag atataatttt ttataaaagt attattttct aaaagataag tatatattct 2160 taatcacaaa acctagcatt ttatcatatc tatatatcag cttcctaaat ataatactca 2220 tatagttagt atgattttaa tgtttttgaa catgaaacga aaatatgatt acaatcatat 2280 aacataatgg agttattgat ataacttata tgcatgatac attaagaagt cttccaacaa 2340 tatagatata gttgtagatt cctatgattc cttttatatt gtgagataat aacattttta 2400 tgctgtcagg agcctatgag tgcccatcc 2429 // ID Gypsy-13_DWil-I repbase; DNA; INV; 5038 BP. XX AC scaffold_181074; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-13_DWil_; KW Gypsy-13_DWil-LTR; Gypsy-13_DWil-I. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-5038 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_181074; Positions 451477 446440. XX CC Positions [2460-2966] - Reverse transcriptase CC Positions [4131-4451] - Integrase core CC 'GTAT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 2283..4451 FT /product="Gypsy-13_DWil-I_2p" FT /translation="MLKELLDRYRLIFARNDDETGQTNLASHHIDTGPARL FT IKQMARRIPLARKAEVQQLVADMENRDIIRRSVSPWSSPVVLVKKKDGATR FT FCVDYRRLNEITVKDSYPLPKMDDILSTLYGSQWFSTMDLQSGYWQVRMDE FT PSRPKTAFTCDAGLFEFNLMPFGLCNVPATFERLMDNLLSGVMWKFALVYL FT DDVIVYSRTTEEHLTHLQDVFKRLADAGLKLSPKKCSFFKKQISYLGHIVS FT RDGLQACPSKIDKVLSWPTPRTKEVRSFLGLCSYYRKFIPNFATIARPLHQ FT LTEKLRRFEWDTHCSHSFDTLKQCLTTAPVLAFPNDTGQFILDTDASNSGA FT GAALHQLQDGQERVIEYYSKCFQPVERNYCVTRRELLAVVRATKHFHHYLY FT GRPFLVRTDHASLQWLYNFKEPEGQMARWIERLQQYDMQIQHRSGTQHGNA FT DSLSRRPCSDCPTCERVERKNGVVVLFTQVLLPDLRKEQEADQALGKVRDW FT MNKSERQPKTTLWGESPENRALWHLYDRLAFDEIAVITIKNSENIPVALMP FT RSLVDAAIRGVHDSPSGGHFSAKKTLAKAKKRFYWIDQSADITLWCARCTI FT CNARKGPPRRTVGRLQPSLMGAPFDRLAIDILGPLPTTPRGNRYVAVVMDY FT FTKWPECAAIPDQTAETVARALVQEVFSRFGVPFVLHCDQGRNFEAGVFQA FT CMRLMTVEKTRTTPLHPQSDGMM" XX SQ Sequence 5038 BP; 1289 A; 1331 C; 1290 G; 1128 T; 0 other; ttggtgtcag gtgtggggtt atgttcggtt actgcgaaga acctcaccac gtgtaaaatg 60 cctaagaaat cgacgcaggc tcggaacagc agaatagaag aggacagcgg cagttctatt 120 attgctatag cacaatcagt ggaggcggtg aataaaaaga tcgaccgaat gtgcgaagcc 180 ttagaacggt cctttataag cgcgcagccc acagcggtgc cagggcccgt ttcgccgccg 240 gaacacacca cgtggggtgg gttaccaacc aaggcaggac gcgctgccac gggccgcttc 300 caccacctac ggagcggcgt gggcgcgatc accccccgtc ccgcaattta atcctggttt 360 gttgccgttg tttgatgaaa ctggatcttg gggcgaattt gagcgactgg tagacaaggt 420 ggcgaattac tatcagtggg cagaggatga gaaatgtttg gcagtggtgt tgaatctgcg 480 cggtcgcgcc tcaacatttt atggcacctt tcctacacaa cctacactta cctactcagc 540 agtgtgtgcc gccatgcgtc aacgattcat gctagattgc cggtcgccca gggaatcatt 600 gcccgatttc gcattggaat tggaacgctt ggcccaatgc gcttttagtg gaatgcctgg 660 acatatgtat gaccagattc tggtgcaaca atttattgac ggtatgcgcg aaactccgct 720 acagggatta gtaatttcta gtcgaccgat tcagctacaa gaggcattgg acgtagcgat 780 gcgagtgcaa gcccagtttg gacgcacaca cggtaatccg gttctgcaca gtcaggtggt 840 cgagagtgga gacatccata caccaaaacg tcgccgacga agacgaggtc gcagccagct 900 cacgaaagag gcaccactaa caacaagtca cccttcggaa aacagcagca tgccaatgtc 960 gcgtggcgaa cattggcaat agtggacaac gcccccaccc aacaatacag ctattctatc 1020 tccaacacca gtaaatgtac cggcgaccct cagccccaat tcagtgaaga acgcatcagt 1080 tgctttctaa cgccgagccc ccaggcatca cgaacggaga ggctagatga gggtgaaagc 1140 gcggaaccgg cagtggaaaa tattctggcc cctatcgacc gtccagatac ccgaaattca 1200 gccaccagta ccaggaacgt tcgccattct aacgtttcca ccgttaatgg agttgagcgg 1260 tctgccatat ttgagcattc cgttcctacc acattagagc catccacgga ccagagcagc 1320 aggcatccaa ccacgccgtt gcacgagtca ctatggacag agcccgcggt tggatgcgag 1380 aaccctggct tgaacgaaga aaccgtattg gcgcgattgt ttcgtgaacc cttggccgcc 1440 gtcccaacaa atatatgcgc taaatgcccc cctgaagtaa acaggaaaga tgctgctatc 1500 ttaaatggcc gacgtgggac ggggaatcct atccaagcta cgtattcacc tgtaccaaat 1560 gctaaggaca gcttacattg tattgttttc tttcagcaga ggccattgcc aatagtggtg 1620 gataccggcg cggcagtctc ggtgttgaat tggcgagatg tgaagcagcc aatacgaacc 1680 tcaccatgcc aggtaaccat gtatggtatg gcgtcgtcaa cgctcacctc cactgaagaa 1740 acgatggtag agttgcaagt gctgcgcaag cccctgattc atcgatttgt cctggtgaag 1800 gattctacgc ctacactgct ggttactgat ttttggctag aacacaaggt catgatcgac 1860 tttgataaga tgcacttaag gttccactgg ggtaccatgc ccttgctccg cgcctccgaa 1920 gcgccacgaa atccgatata cttgatgact cccgtctgtt tggcgccacg ctccgacacg 1980 catgtcagag tgcaatgtgc gacgactaag ccggcgttac cgagcttact gaaggaggac 2040 tgttacacag agctgccgga cgatatcgtc ctgagcaccg gcgtcgtcca accagtggcg 2100 gccaattttc tgccaggata tcgaatctaa gcgcacacga gagaacgctg ccacccacac 2160 tagttttggg cgacttagaa accgtagaag tggtattacc agacagatgc gaagatgcga 2220 gctcagattc tgtcgattgg gaccgattcg gggaagtgga cctaccgcct acaaaacaac 2280 taatgctaaa ggaacttttg gatcgctatc ggcttatttt tgcccgaaac gatgacgaga 2340 ctggtcaaac caacttagct tctcaccaca tcgataccgg gccggcacgc cttattaaac 2400 agatggcccg aagaatacct cttgcgcgca aagctgaggt gcagcaactg gttgctgaca 2460 tggagaacag agacataatt cggcgatccg tcagtccgtg gtcatcgccg gtggtgctag 2520 tcaagaagaa agatggagcg acacgatttt gcgtggacta taggcgcctg aatgagataa 2580 cagtcaagga ctcatacccg ctgcccaaaa tggacgacat cctatccacg ttgtacggct 2640 cccagtggtt ttcgacgatg gacttacaaa gcggatactg gcaggtgcgg atggacgaac 2700 cctcccgacc aaagacagct ttcacttgcg atgctggact tttcgagttc aatttgatgc 2760 catttgggct ttgcaacgtc cccgctacgt ttgaacgttt gatggacaat cttctgtctg 2820 gggtgatgtg gaagttcgcg ctagtctatc tcgatgacgt gattgtttac tctcgcacga 2880 ccgaggagca cctcacacac ctgcaggacg tgtttaaacg actggcagac gctggcctta 2940 aacttagtcc gaaaaagtgc agcttcttta agaaacaaat tagttaccta ggccatatcg 3000 tgagtcgtga tggattacag gcgtgtccca gcaagattga taaagtgctg agctggccta 3060 cacctcgcac caaggaagtg cgatcgtttt tgggtctttg ctcatactac cgcaagttta 3120 tccccaactt cgctactatc gctcgaccgc tccaccagct gactgagaaa cttcgacgct 3180 ttgaatggga cactcactgc agtcactcct ttgacacctt gaagcagtgt ctaacgacag 3240 cccccgtctt ggcgtttcct aatgacaccg gtcaatttat tctagacacg gacgcctcga 3300 attctggggc gggtgcggcg ctccatcagc tacaagatgg acaagaacga gtaatcgaat 3360 attatagtaa atgcttccaa ccagtagaac gtaactattg tgtaacacgt cgagagctcc 3420 tcgccgtagt ccgtgccact aaacactttc atcattattt gtatggccgt ccgtttctcg 3480 tacggactga ccatgcgtct ttacagtggc tgtacaactt caaagaacca gaagggcaga 3540 tggcacgttg gatagaacgt ctgcaacagt acgacatgca gattcagcat cgaagtggta 3600 cacaacatgg taacgctgac agcctctctc gccgaccttg tagcgactgt cccacatgcg 3660 aaagagtcga acggaaaaat ggagtggtcg tgttgtttac ccaagtcctg ttacccgacc 3720 tgcgcaagga gcaagaggcg gaccaggccc tcggtaaggt gcgcgattgg atgaacaagt 3780 ccgaacgaca gcctaaaaca acattgtggg gcgagagccc agaaaataga gctttgtggc 3840 atctctacga ccgattagcc tttgatgaaa tagcagtgat taccataaaa aactcggaga 3900 atataccggt agcccttatg ccccgctcac tggtagatgc tgccattcga ggcgtgcacg 3960 actctccatc cggaggtcac ttcagtgcta agaagacttt agccaaagcg aagaaacgat 4020 tttactggat cgaccaaagt gccgatataa ccctctggtg cgcacgctgc accatctgca 4080 acgcacgcaa aggaccacct cgaagaaccg tcggtcgcct gcaaccaagc ctaatgggag 4140 ccccgttcga tcgtctagcg attgacattc taggccctct acccactacg cctcgtggga 4200 accgttacgt cgcggtcgtt atggattatt ttaccaagtg gccggaatgt gcggccatac 4260 cggaccagac agccgaaacc gtagctcgag cattggtaca agaagtcttc agtcgattcg 4320 gcgttccatt cgtcttacac tgtgatcaag gacgcaactt cgaagcgggc gtcttccagg 4380 cctgtatgcg gctgatgact gttgaaaaga cacgtaccac gccgctgcat ccacagtcag 4440 acggcatgat gtaacgcttc aatcgcacgt tgctagatta cttggcaaag tttgtatcta 4500 ctagccagca tgattgggat gagctgctgc ccatggcgtt attggcctac aggtcagcgg 4560 ttcatgaaag ctccggttat ttgcccgcaa ggctaacgtt tggaagagaa cttaggctgc 4620 ccaatgattt actatttgga acaccagctc ctgtcccaac gaaagtcccg gaattcttgg 4680 ctaagctgca aaatcaactg tgggtggtaa atgcgaacgc tcgccgccaa ctgctaacgg 4740 ctgcccagac taacaaaatc cgctacgacc tacgatcgca ccctcaggaa ttctgccccg 4800 gcgacgaagt atggctctat tcacctagcc gaagagtagg ccgttgccca aagctacaat 4860 gcgattggat tgggcccgct acaatagtac gtcggatttg ggatcttgtc tacgaaattc 4920 gaccccctgg aaaaagagtt accagaaccg tgcatgtgaa ccgtttggct gtttaccggc 4980 ccgatcctcg caacgacttg gtactcgggc aggagcctcg agtctaagga gggggtaa 5038 // ID Gypsy-11_RP-I repbase; DNA; INV; 2015 BP. XX AC ACPB02037242; XX DT 28-JAN-2011 (Rel. 16.02, Created) DT 28-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Rhodnius prolixus genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-11_RP_; KW Gypsy-11_RP-LTR; Gypsy-11_RP-I. XX OS Rhodnius prolixus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Euhemiptera; Heteroptera; OC Panheteroptera; Cimicomorpha; Reduviidae; Triatominae; Rhodnius. XX RN [1] RP 1-2015 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Rhodnius prolixus genome."; RL Direct Submission to RU (28-JAN-2011). XX DR Genome; ACPB02037242; Positions 30664 32678. XX CC 'ATAT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 86..1660 FT /product="Gypsy-11_RP-I_1p" FT /translation="MQLGNIKRIESAIMFAPVKEAKRFYEIIFGLDEGRLS FT RKKLKEFEGFKLNDVDLCNKVEELKDTFSLWELKSVANLLGIQLSCRQDVA FT SELCKFLNDISLLLEAISEEQPEEEENEYFEEVSGNSTSSKHVQQCRPYQF FT SSRDIEESLRPFTGEDNYTINLWLTDFEELAQVMRWQELEKWVFAKKSLKG FT LAKLFVSGERGLTSYSALKEALVEEFSSEDSVKESCFRFNKFVSDSKVETR FT KCFNCYEEGHLVAECPKPRREKGSCFECGKMGHLQRECERRKQSTTSSVSL FT VAETSALPPSTADILREGATTSGEVEALPGSVISLIVEDRVSEDDILREVA FT INDGSEVMLIDYSVEEIVNPELTVNYVKPIEKPSEPAVDFEVEMKVIPEQN FT LSMLQENDKGNCESSEYLKNKMLKRFEICNELVYRKKKGKILFVVPGKMEH FT QNSRKFYDYVGHLGINKGAELILRSCRFSKLQDKIKDYIDSCVNCIAYIPI FT LGKKEMWWTMFCSMLGTCLGDISGNSDSF" XX SQ Sequence 2015 BP; 647 A; 235 C; 487 G; 646 T; 0 other; acttcaggtg tggggtaaag aactggtaaa gtgagttaaa gttcggttaa tttggttaag 60 aatagtcaat ttgtgagaaa ccaaaatgca gcttggtaac attaagagga ttgaatccgc 120 aattatgttt gctccggtaa aggaggcaaa aagattttat gaaattattt ttgggttaga 180 tgaagggcgt ctaagcagaa aaaagttaaa ggaatttgaa ggtttcaagt tgaatgatgt 240 ggacttatgt aataaagtgg aagaattgaa agacacattt tcattgtggg aattaaaaag 300 tgtcgcaaac ctattaggta ttcagttaag ctgtagacag gatgtggcgt ctgagttgtg 360 taagttcctt aatgatatta gcttattact tgaggcaata agtgaagaac aaccagaaga 420 agaagaaaat gaatactttg aggaagtaag tggaaattct actagttcga aacatgttca 480 gcaatgtcga ccatatcagt tttcctctag agatattgaa gaaagtctgc gaccgtttac 540 aggagaggat aactacacca ttaacttgtg gttaactgat tttgaagaat tagcgcaagt 600 gatgaggtgg caagagctgg aaaaatgggt ctttgcaaag aagtcactga agggactggc 660 aaaacttttt gttagtggtg aacggggttt aaccagttat agtgcattga aggaagcact 720 ggttgaggaa ttttctagcg aagacagtgt caaggaaagt tgtttccgat ttaataagtt 780 tgtttcggat tcgaaagtag aaacaaggaa atgttttaat tgttatgagg aggggcattt 840 agtggctgaa tgtccaaaac ccagacgtga gaagggaagc tgttttgagt gcggcaaaat 900 gggacacctt caaagggaat gcgaacgtcg gaagcagagc acaacatctt cagtttcgct 960 ggtggctgag acatctgctc tacctccatc tacagcagac atcctcagag aaggcgctac 1020 tactagtgga gaagtagaag ccttgcccgg ttcagtaata tctttgatcg tggaagatcg 1080 tgtaagtgaa gacgatatat tacgtgaggt agcaattaat gatggttcgg aggtaatgtt 1140 aattgattac agtgtagaag aaattgtaaa ccctgagctt actgtaaatt atgtgaaacc 1200 tatcgagaaa cctagtgaac ccgctgttga tttcgaagta gaaatgaaag tgattcctga 1260 gcaaaatctg tcaatgctac aggaaaatga taagggaaat tgtgaaagta gtgaatattt 1320 aaagaataaa atgcttaaaa gatttgaaat atgtaatgag ttggtgtata gaaagaaaaa 1380 ggggaaaatt ttatttgtag ttcccggaaa aatggagcac caaaatagca gaaagtttta 1440 tgattacgtt ggtcacttag gaattaataa aggtgccgag ttaattctaa ggtcctgtcg 1500 attctcgaaa ttgcaggaca aaattaaaga ttatattgat agttgtgtta attgtattgc 1560 atacattcca atcttaggta agaaagaaat gtggtggaca atgttttgtt ccatgctagg 1620 tacctgttta ggggatatat caggtaactc agattccttt tgagggaatt tctgagtctg 1680 gaattgggaa agtattagca attcctgtgt aaagttatag tagattttat ttataaattt 1740 tgtttgtgtt aggtatgtta cgatattatt tgctttggtt atgttaatta tgattgtaat 1800 agttttttta ggtttgtttt attaatcatt tttgtaatat tgttatgata gttttaggtt 1860 gttttaatta tgatttattt agattaattc tgtttatgta ttagcttgtt ttatgttcag 1920 ttatgtttat tcgtaggatt agttgaaact ttttattaaa ttgtaattta tcactcgtgt 1980 gatcgagggc gatcacgggt caggattggc cgagt 2015 // ID Copia-133_AA-LTR repbase; DNA; INV; 266 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-133_AA_; KW Copia-133_AA-I; Copia-133_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-266 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 266 BP; 58 A; 67 C; 61 G; 80 T; 0 other; tgttagcagt gtataagtgg gggcaatatt tttacccggc gccactacac tgcaatcatt 60 actgcagtag agaaaaagtg caatcaaaac tcctgtgaat tttcattccc gtgtaacagt 120 ccgatcatta aacacgttct agagtagtcg gtacgagttg cgttttattt cctcgttatc 180 gcgagttccc cgggttaatt ccgtccgctt tcggtgttgg cctgtccact tcgctacctg 240 ctgggtgtcc gccggtgaat cccaca 266 // ID hATm-17_HM repbase; DNA; INV; 3609 BP. XX AC . XX DT 07-DEC-2008 (Rel. 13.12, Created) DT 14-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type family: consensus sequence. XX KW hAT; DNA transposon; Transposable Element; hATm-17_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3609 RA Jurka J.; RT "Families of hATm elements from Hydra magnipapillata."; RL Repbase Reports 8(12), 1911-1911 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(565..1422,1463..3073) FT /product="hATm-17_HM_1p" FT /translation="MAPKTRISTCTQISVAFGTGKQFLVSELPTNRDVIRY FT AILLREGCEDIKKRNYPVKEMAKDILPALLHQWSSANLEFTSPKLITDKTI FT CDKIVTLWNNASAIACPRKKGAKQIIKFKEKLDKLFDILVCKCPIVKCDDF FT ICSDDCDKEVHCKCKCIKEYKIPSIDVKFIYFQRIKTGTVSKFQIGLPDSI FT ETRKQFKRLLRKDREKYFENKDSNVILTKKQKIESDQINFQKESDFSNEIE FT LNDNIDDITNVELPLVLEKSLGKKRCYNTLEISHIAMNSVRYSSFMYIYIY FT LKVILFLCKFCSHRYGVSARATAAIATATLIDAGLITSDDKKLIIDHSKVS FT RAQEKVISDLSKDFDKIAKSGEVKCIFFDGRIDQTKQMIELDDSNAKFPVT FT VKEEHYSVCMEPGGKYLFHFTPEKATKSVKHAEIIANKIVEWLTERGIDKS FT LLAVGGDSTNVNTGWEGGVIKHIEIKLGKKLVWLICQLHTNELPLRHLITD FT LDGQTKSNNKWSGVIGSMLDHATDLDINPVFETIKIGDPLISLDDKIINDL FT SADQFYGYKVVCAIRSGILPPKFHLLQIGPVSHSRWLTTANRICRIWISKH FT GIQGKDLENLRSITEYIVGVYFPCWFKIKVEHSWIEGPKHVLYQLQQLKFQ FT KEDVIKIVSPSIQRSAWFCFSECILQTLLCSNTEEDRSVAINKIKDIRGTG FT NEDLQKGDSSPRPRKTPVLNFNAEKLVDLISWESPVYEPVLTTSLTTSEIN FT VFLSNPMQVPIWPCHGQSVERCVKQVTEASSRVYTHEKREGFVRTQEASRK FT LMPKNNSKQDLESLTQMFLY*" XX SQ Sequence 3609 BP; 1399 A; 461 C; 555 G; 1194 T; 0 other; ttagggtgtg tcaattttag actttttttg aaaaaaaagt ttaataaaat ttggggtata 60 actaaaagtt gcgaaatttt acggagaatc taaatctgtt aaaatttttt aaatcagaaa 120 aatatttact taggtgatta gcctttttat gaaagtatta acggttttta tttcctttta 180 aacggaacaa cgacgttacc ggacaatact gtacgtaaaa aaaaacagac ttttaaacaa 240 caaaacaaat ttacattaag tttataaaaa aatcatttaa cttaatatgt ataaagtatt 300 taaaaaattt ttgtattaaa tgcgtaaaat aactttttat taaaaaaaaa aatttactca 360 aacgtaattt tttaatctta ataatggtca gtaaattaat ttttcattgc cgcatttagt 420 aaacaactgc gaactgcata ttgttaaata taaatttaag aacttttgat caattgtttt 480 cattcttaaa atattaaaaa aaaatttttt tcattactaa acaataataa attctcttta 540 aatattttat aacttggagg aataatggca cccaaaacca gaataagtac ttgtacgcaa 600 attagtgttg catttggtac aggaaaacaa tttctagttt cagaattacc aaccaacaga 660 gatgttataa gatatgcaat tttattaaga gagggctgcg aagacataaa aaaaagaaat 720 tatcctgtaa aagagatggc caaagatata cttccagctt tattacacca gtggtcgagt 780 gcaaatttgg aatttacatc gcctaaatta attactgata aaacaatttg tgataaaata 840 gtgactttgt ggaacaacgc ttcagcaata gcttgtccaa gaaaaaaagg agctaaacaa 900 ataataaaat ttaaagaaaa acttgataaa ctttttgata tactagtatg caaatgtcca 960 attgtaaaat gtgatgattt tatttgttct gatgattgtg ataaagaagt tcattgtaaa 1020 tgcaaatgta tcaaagagta caaaatacca tctattgacg ttaagtttat atattttcaa 1080 agaattaaaa ctggaactgt tagcaagttt caaattggat tacctgatag tattgaaacc 1140 aggaaacagt tcaaaagatt acttagaaaa gatcgagaaa aatattttga gaataaagac 1200 agcaatgtta ttttaactaa aaaacaaaaa attgagtcag atcagataaa ctttcaaaaa 1260 gaaagtgatt ttagtaatga aatagaatta aatgataaca ttgatgatat tactaatgta 1320 gaattacctt tagtactaga gaaatcatta ggaaaaaaaa gatgttacaa taccttagaa 1380 atttcacaca tagctatgaa tagcgtaagg tatagttctt tttaaataaa tttgtttttt 1440 agtaatgtat ttatccttat aaatgtatat atatatatat ttaaaagtaa ttctatttct 1500 gtgtaaattt tgctcacata gatatggggt atctgcacgt gctacagctg caattgctac 1560 agcaacacta attgatgcag gattaataac atctgatgat aaaaagttaa ttattgatca 1620 tagtaaagtt tcgagagcgc aagaaaaagt tatctctgat cttagtaagg attttgacaa 1680 aatagcaaaa agtggagaag tgaaatgcat attttttgat gggagaattg atcaaaccaa 1740 gcaaatgatt gaacttgatg actcaaatgc aaaatttcca gtaacagtga aagaagaaca 1800 ttactctgtt tgtatggaac ctggaggaaa atatttgttt cactttactc ctgagaaagc 1860 aacaaaaagt gttaaacatg cagagattat agcaaataaa attgttgaat ggctgactga 1920 gagaggtatt gacaagtctt tactggctgt tggtggagac tctacaaatg taaacaccgg 1980 ttgggagggt ggagtcatca aacatataga aataaaatta ggaaaaaaac ttgtttggct 2040 tatatgtcaa ctccatacaa atgagttacc actgagacat cttataaccg atttagatgg 2100 tcaaacgaaa tcaaacaata aatggtctgg tgttattggt tccatgcttg atcatgctac 2160 agatctagat ataaatccag tatttgaaac tattaaaatt ggcgatccac ttatttctct 2220 cgatgataaa ataataaatg acctgtctgc agatcaattt tatggttata aagttgtttg 2280 tgccattaga agtggtattt taccaccaaa atttcattta ttacaaattg ggcctgtttc 2340 acattcaaga tggttaacta ctgctaacag aatatgtaga atatggatct ctaaacatgg 2400 aattcaaggt aaagatttag aaaatttaag atctatcacc gagtacattg taggtgttta 2460 ttttccttgt tggtttaaga ttaaagttga acattcttgg attgaaggtc ctaaacatgt 2520 actttaccaa ctgcagcaac ttaaatttca aaaagaagat gtcattaaaa tagtatcgcc 2580 ctctattcaa agatcagctt ggttttgttt cagtgagtgt atactacaaa cattgttgtg 2640 ttctaatact gaagaggata gaagtgtagc tattaataag ataaaagata ttagaggtac 2700 tggaaatgaa gatctacaaa aaggtgattc ctctcctaga ccaagaaaaa cgcctgttct 2760 gaactttaat gctgaaaaat tggtagatct aatatcatgg gagtcaccgg tttatgaacc 2820 agtcttaact acctctctaa caacttctga aattaatgtg tttttatcaa atcctatgca 2880 ggtgccgatt tggccatgtc atggacaatc tgttgaacgt tgcgtaaagc aagttacaga 2940 agcttcaagt agagtgtaca ctcatgaaaa gagagaagga tttgtccgaa ctcaagaggc 3000 gagtagaaag ttaatgccta aaaataatag taagcaagac cttgaaagtt tgacacaaat 3060 gtttttgtat taaatttata atatttattg aaataattcg ggtttttttt ttaatatctt 3120 gctgtatttc tagcatcaat accaataaaa ttattacgcg atgttaaggc aaactgtatt 3180 taaaatacat tgaaaaatat tttaatatat ataaattaat ctcaatctaa tttcagtttt 3240 aaattaatat gcatattaaa aattaatttc aaagtctaat tttgaaataa gaaatatttc 3300 atttatattt tatacaaatt cttaaagaaa gtactataat atctaaaatt agttttaatc 3360 aacatttata tttcaactga tatttatcaa acaaagtaga aaaaaagctt attaatttcc 3420 aagtaaaaac cgggtttttt tgagaaacca ggtcaaaatt ttattgaaat ataaaaagtt 3480 aaatcaccta agcaaaaaaa tttttaattc aatttttaaa aacatatttg gattctccgt 3540 caaattttgc aacttttagt aactccccaa tttttttctc gaaaaaaatc aaaaatcgac 3600 acaccctaa 3609 // ID BEL-1N_AA-I repbase; DNA; INV; 3484 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; nonautonomous; KW BEL-1N_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-3484 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 849-849 (2011). XX DR [1] (Consensus) XX CC 96% identical to consensus. XX FH Key Location/Qualifiers FT CDS 316..2439 FT /product="BEL-1N_AA-I_1p" FT /translation="MAEVKRLFAQRSLVEAKVKRLQEQLVDRDGTNRPSFA FT RVKLYVSELQFLYREYQRVYGELLSAFPAERYADLDENYERFEDLHFDACL FT LVETLLLSFPTTGNRYQHSRALSYNYFAATTAPRWRTPIIRSRSPSPPTEQ FT SVPIPDVMEKPNTSPPKELSPEQLFPAQANELKVSQFALNPIHSLQPPSAE FT TSAMKLIPVCGAALVRSEGDLAEVIKSKSAPMPIVPPSVAPIAQKCHREED FT STHRGKMPTTCEEVTTVAYEDESNPPCKQTVHQATNPIPVPIDPVRTHPKP FT TSFQSATGFVPKNPVASSQSFHCATGSRPVLQRFPCASGFRTIPIKFPSAI FT GLRPVAKRFPCRTGFRPAIPSRSAGFHPTPQRFPCAAGSIPALKHSSNLTE FT SNSVVKPFQRAFPDVAGFPPVMKQPSSVIGYQPVSQRFPAASVFHSVAKKF FT QCLASIHLIREQPSCATGPRSVLKQFPPSVGSHPAPEKPPDATGIPPPVPK FT PPGVTGIPPVSELAMPSTEMPARFKPIRMSAGIPPISKPILKMSEICPAKH FT PCPVSTGSHAVSKWLRCSTTIPPIAMPFLTTAGIHPVKESTPMPTGKRPVL FT SEARNQRLTKKMMPPVNPTGDSKQPLTKSRPEKPPDEAPMRRRASEAIPEC FT VSRAHPPKDVHQVVSKLSICGVCLFPILAQSGFPSPPLKNPIETLVLLSFH FT CQLEVLFH" XX SQ Sequence 3484 BP; 853 A; 967 C; 791 G; 867 T; 6 other; tatggtccan tcgagccgga tagaggagcc agttccttga aggctgcaac agcagcagag 60 tcagtcgtca atcaacagat tgagacaaat ccattccaat tgctatcgcc gatagcaatc 120 agctttccta ccaaaggatt gctgttgttc ctgcccggta gctcgacaca agagcgaaaa 180 gaatattcct ggtttccatt ggcggagagg agacgaaatt ccccaagtga agtgaacaga 240 gacagtgaaa aagtcccgaa gtccgaatca tttccacccc gaaagtaagt cgcgcgcgtg 300 tgaagtggtt ccagaatggc cgaagtgaaa aggctgttcg cccaacgtag cctcgtcgag 360 gcaaaagtga agcggctcca ggaacagttg gttgacaggg atggtaccaa ccgaccaagc 420 tttgcacgag tgaagcttta tgtaagtgag ctccagtttc tgtaccgaga ataccagaga 480 gtgtatggcg agttgctgtc tgcattccca gccgagcggt atgcagatct tgacgagaat 540 tatgagcgct tcgaggacct ccattttgat gcctgcttac tggtggaaac gttgttgcta 600 tcctttccga ccaccggtaa caggtaccag cacagtcgag cgttatcata caactatttc 660 gctgcgacca ctgcgccacg ctggcgcacc ccgatcatac gaagccgatc accgagtcca 720 ccgacggaac agtccgttcc gattccagac gtgatggaga agccaaacac gtcgccaccg 780 aaagagttgt ccccagagca gctgttccca gcccaagcga acgagttgaa ggtgagccag 840 tttgccctca acccaatcca ctccttgcag ccaccaagcg ctgaaacttc agccatgaag 900 ttaattcctg tgtgtggagc agccctggtc cgttccgaag gtgatctcgc cgaggtcatc 960 aagagcaagt cggcaccaat gccgattgtc ccgccttctg ttgctccaat agctcagaag 1020 tgccatcgtg aagaagattc cacccaccgt ggaaaaatgc caacgacgtg tgaagaggtg 1080 acaaccgttg cctacgaaga cgagtcgaac cctccgtgca agcagacagt tcatcaagcg 1140 acgaacccga ttccagttcc aatcgacccg gttcgaaccc atccgaagcc gacgtcgttc 1200 caaagtgcta ccggatttgt gcccaagaat ccggtagcaa gttcccaatc gttccattgc 1260 gcaaccggtt cccgtccggt gctccagcgg ttcccttgcg cgagcggatt ccgcacgatt 1320 cccatcaagt tcccgagtgc gatcggattg cgtccggttg caaagcgatt cccatgtaga 1380 accggattcc gtccagcgat tccatcccgg tcggccggat tccatccaac gccgcagcga 1440 ttcccgtgtg cggccggatc cattccggca ttgaagcatt cctccaactt gaccgaatcc 1500 aattcggtag tgaagccttt ccagcgtgcg ttcccggatg tggccggatt ccctccggtn 1560 atgaagcagc cgtccagtgt aatcggatac cagccggtgt cccagagatt tccagctgcg 1620 agcgtgttcc attcggtggc caagaagttc caatgtttgg ccagcatcca tctgattcgt 1680 gagcagccat cttgtgcgac cggacctcgt tcggttctga aacaattccc gcctagtgtc 1740 ggatcccatc cggcgccaga gaagcctcca gatgcgaccg gtatccctcc tccggttccc 1800 aagcctccag gtgtgaccgg tatccctccg gtatccgagc tagccatgcc gtctacagaa 1860 atgcctgcga gattcaagcc gattcggatg tcagccggca tccctccgat cagcaagcca 1920 atcctgaaga tgagcgagat atgtccagcg aagcacccgt gtccagtgtc aaccggaagc 1980 catgcggtat ccaagtggtt gcggtgttca accacgattc caccgatagc gatgccattc 2040 ctaacaacag ccggaatcca tccggtaaag gagtcaaccc cgatgcccac cggaaaacgt 2100 ccggtattaa gcgaagcccg aaaccagaga ttgaccaaga agatgatgcc gccggtcaac 2160 ccgactggtg attccaagca accccttacc aagtccaggc cggagaaacc accagatgaa 2220 gcccccatgc gacgtcgagc cagcgaagcc atcccggagt gcgtgtcccg agcgcatcca 2280 cccaaagatg ttcatcaagt ggtaagcaag ctttcgattt gtggtgtgtg cctattccct 2340 atcctggctc aaagtggttt ccccagtcca cctttgaaga atcccataga aacattagtg 2400 ttgttgagtt tccattgcca acttgaagtt ttgttccatt gagtaagatg tgtatccatg 2460 taagatttcc acccacaata cagtccactt gacgagtcct acaagaaatg ataattccat 2520 caaacgatct ttcaagtttt gtgccttacc gagtttgttt cccttatgtt gattccttat 2580 cctaatatga gtatctagta ttgcaacaag taacatacga atccctgtga agtgtttgcc 2640 cattaattcc aaattccaag atatgattat tgtgatccgt gagtccaagt gaattttcct 2700 cccattgagt tttttatcgt tgaatccttc cctaccaagt tgtatgatgt atgataccgt 2760 gcccactaag agagtgtagc cgattccctc aagttactag tgttccattt cgtgacagtt 2820 agatagcgtc gtaaccaacc ttttatccaa ccatgaattt tacatctgaa agttgtgtcc 2880 tccttatttt gcctcgattc cttatccagt gtttgtttag ccctttcacc gagtcaagat 2940 agttcgccaa tccttattga agcttatgta cccacaatgc ctaatagtat ccagttatcc 3000 aaatacagtc cacaagtatt gatacatccg tgtaaatccc aacatttgag ttagtatggc 3060 agtctaccat aaactgccca ccaatgtttt cttcaacacc tgtgcccttt gtgtaattaa 3120 aggttccttg cagtgcctta agtacaattc ccttttatct cgattatttc cctttgtgac 3180 atacagccac caaagttgac ctattccaga agtgtggaaa gagtttcttc tgcctcaagt 3240 gtgtcgccag ttgatgattc cgttttgtga tctcctgtga aacaagcagt tccttccgca 3300 cgtaagtccg tagttccttc catccctttg agattatatg ttaaatcgta gaaagctatt 3360 gtttccaccc tgacagtcct tnngagtnga gtccctgagt tccccatcac cgtgaatttt 3420 tgccgtcatg cccggtcctt accatcgaat ccaacttccc tttncctatt ttggccgggg 3480 agta 3484 // ID Crack-30_BF repbase; DNA; INV; 1948 BP. XX AC . XX DT 30-APR-2009 (Rel. 14.04, Created) DT 30-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE Amphioxus Crack-30_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; KW Crack-30_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-1948 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-1948 RA Kapitonov V. and Jurka J.; RT "Young families of Crack non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(4), 835-835 (2009). XX DR [2] (Consensus) XX CC Its 5'-terminal portion is incomplete. XX FH Key Location/Qualifiers FT CDS 1..1740 FT /product="Crack-30_BF_2p" FT /translation="RLTGSQKSSEIPSLVNSDGKVESSPSAIARELNSFFG FT HAAGKLAAMIQRCTSLFHPLQFLQHSKVMFSFQPISESFVLNELLTMKVNK FT ASGLDGLNCRLLRAAAPVIAPSLTSIMNLSLLTSEFPTDWKKARIVPIFKA FT GDRTEAGNYRPVSILPAVSKILERAVHIQVYQFLTENNLLSDRQSGFRRKH FT STETALHLVTEEWYKAIDNGNITGAIFIDLSKAFDTIDHSILMQKLSAYGI FT IGQSAKWFQSYLSGRSHCTSVNGHTSDFCTVNYGVPQGSILGPLLYLIYVN FT DMPNCVCKCHIALYADDTILYVSHPDLASVQLALQSDLDRLSNWFVANSLS FT INANKCKAMLIGTNKRLLNTHKLSVHINNDMLNVVPAYKYLGVTVDKELKW FT DMHVETVLSKLKRALNMMRQVRPLVTKAALLTMYETIFLPHITYCSSVWDG FT ASTVQIQRLQRMQNRVGKLILGVQSRTPTTVVHHLLGWKDIQSIHKMHKLL FT VVYKSLNNMLPAHMGELFMTCRENAARATRQSSTLKLIAPRVSRECVRRSL FT AVAGPAVWNVIPEDVRNSPSLRAFRHGLISNC*" XX SQ Sequence 1948 BP; 553 A; 436 C; 408 G; 551 T; 0 other; aggttgactg gaagccagaa gtcctctgag atacccagct tggtaaacag tgatggtaaa 60 gtagagtctt caccctctgc tatagccaga gaacttaact ccttttttgg ccatgcagct 120 ggcaaattag ctgctatgat acagagatgc acgtctctct ttcacccttt gcagtttctg 180 caacacagta aggttatgtt tagtttccaa cccatctctg aatcatttgt cttgaatgaa 240 ctcctaacca tgaaagtcaa caaagccagt gggctggatg ggctaaactg tcgcctgctt 300 agagcagctg ctcctgtcat tgcaccatca ttaacttcca ttatgaattt gtccttgtta 360 acctcagaat ttcccacaga ctggaaaaaa gcacggattg tgccaatctt caaggctggg 420 gacagaacgg aggcgggtaa ttatagaccg gtcagtatcc taccggcagt ctctaagatc 480 cttgaacgtg cagttcacat acaagtttat cagtttttga cggaaaataa cctgctgtct 540 gacagacaat ctggtttccg tcggaaacat tcgactgaaa ctgccctaca ccttgttact 600 gaggagtggt ataaggcaat tgacaatgga aacatcacag gtgcaatttt tattgacctg 660 tccaaggcct tcgataccat agatcatagt atccttatgc aaaaattgtc agcctatggt 720 ataattggtc agtccgctaa gtggttccag tcctatctgt ctgggcgatc ccactgcact 780 tcagtcaatg gtcatacatc agatttttgt acagttaatt atggagtccc ccagggctct 840 atactaggcc ctctgttata tctaatttat gtgaacgata tgcccaactg tgtatgtaaa 900 tgccatattg ctttatatgc tgatgacact atactttatg ttagtcatcc tgacttagca 960 agtgttcaac ttgctttaca atctgaccta gacagactgt caaactggtt tgttgccaat 1020 agcctatcaa taaatgcaaa caaatgcaaa gcaatgctca ttggaaccaa caaacggctc 1080 ttgaacacac acaaattgtc cgttcacata aacaatgaca tgttgaacgt agtcccagca 1140 tataagtatc taggtgttac tgtggataaa gaacttaaat gggacatgca tgtagagacc 1200 gtgttaagca aactgaagcg tgcacttaac atgatgcgac aagtccgccc acttgtgaca 1260 aaggcagcac tcttaacaat gtatgaaacc atttttctac ctcacattac ctattgttcc 1320 tcagtgtggg atggagccag tacagtgcaa atccaaagac tgcagaggat gcaaaacaga 1380 gtgggtaaac ttattctggg tgttcagtcc agaactccaa ctactgtagt gcaccatctc 1440 cttgggtgga aagatataca atccatacac aaaatgcata agctcctggt agtgtacaag 1500 tcgcttaaca acatgctacc agcccacatg ggtgaacttt ttatgacatg ccgcgaaaac 1560 gcagctagag ctaccagaca atcttcaacc ctgaagttga ttgcgccccg tgtgagcagg 1620 gagtgtgtga ggaggtcgtt agcggtggcg ggtccggcag tctggaacgt catcccagaa 1680 gatgtacgga attctccctc tttgcgagcc ttccgacatg gattaatttc gaactgctag 1740 ctgatatcga tattatgagt ttgaatttga tattatgatt gtatctagtg taaataactg 1800 tattaatgtc tattttgttt atctatttat tttctcccgt gacacctccc ctcaactggt 1860 ccttaactgg tcccctagga agaacggcct acatgttggc cgatagggct tatcagtata 1920 aataaaggtg aactgaactg aactgaac 1948 // ID CR1-98_AAe repbase; DNA; INV; 5578 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-98_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5578 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1186-1186 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 14 sequences with >97% CC identity. XX FH Key Location/Qualifiers FT CDS 1706..5509 FT /product="CR1-98_AAe_1p" FT /note="endonuclease and reverse transcriptase." FT /translation="MESPVCRYGEGVFQSANHGKYNHASNSTDSDPPSGFR FT EESLNHFVSSPTLTPGRNDESIMEAPTPPATVEPFLPTTCSRPGPVCRYGG FT GVFQSANHGKYSHVSNSTDSDPPSGFRNESLIHIASSPTLTPGRNNESFME FT APTPPATVEPFLPTTCSRPGPVCRYGGGVFQSANHGKYSHDSNSTDSDPPS FT GFRDESLIHIASSPTLTPGRNNESFMEAPTPPATVEPFLPTTCSRPGPVCR FT YGEGVFQSANHGKYNHASNSTDSDPPSGFREESLNHFVSSSTLTLEGINAG FT FKDTPSPAITVQPSRTPINHPDRDHLFLSVYYQNVRGLRTKTTELRLMLSS FT CDYDVIVFTESWLRPDIRNSELTSDYALFRCDRSEATSDLSRGGGVLVAVK FT NELQCLGISLVGCTQLEQMAVCVKLPHRSVYICAIYLRPNSDVALYSTHAS FT AVQHIADQCSVNDVILVLGDYNLPRLQWIFDNDLNGFLPSNASSEVELSFT FT ESISSCGLVQLNPFVNANGNLLDLTFSSSPDVIEILQPPLPLLPIDAHHPP FT YVLQIDVHTSMSDHVSARIDALELDFKHCDFDGLNNILSTVDWGNYLDGHS FT VDANLALFYEKIYEVLDSNVPRYRCPKGCGYRKPWWNAELRNLRNKLRKVR FT KRFFSDKSVENKQMLRLVEGSYNELLIRSHQEYLIRLEENLKQNPKTFWKY FT VKSLRSSNRVPNNVMLGNTTASSSKDAANLFSSFFKSVYSSTSPQPHPGCF FT RNVPSHNVHLQRLNFTNDDILRALAKIDASKGAGVDGLPPAFVKNCAISLV FT TPLTLLFNRSLEEKTFPSLWKTACMIPVHKSGSLNHVENYRGVSILCCIAK FT VFETLVHEVLLNAVKPLVSDYQHGFVPHRSTTSNLLCYTTVLFREIEHRNQ FT VDSIYIDFSKAFDTVPHAYAVEKLKHIGLPDWIMDWIFSYLSERKAFVRVN FT SARSDEFEIPSGVPQGSVLGPLIFVMFVNDLCSRISSGKLLFADDLKIFRV FT IASALDCNVLQNDIDELLRWSNENGMRINIKKCKSISFSRCRNPIVHNYRV FT NSVQLERVESIVDLGVTIDNKLRFNEHVATMTAKAFSVLGFVRRNAAQLTD FT IYALKSLYCALVRSILEYASPVWSPYYASEVIRIERVQKQFIRFALRNLPW FT NDPLHLPHYQARCRLINLELLSERRKKLQQLTIFDILCQNIDCAELLQQIS FT FNVPQRPLRNHAFIIIPGHRTNYGFNSPLSSCIRAFNTVSENFDFNLTKQS FT FKARLRY" XX SQ Sequence 5578 BP; 1430 A; 1222 C; 1109 G; 1817 T; 0 other; cactgaatta tgcacactga aaaagattat ggccaatccc agccgatctt ctagttgatt 60 ctttgtgcat tttcactgac ttcggtcaat cacggaatag caaccattga tatgtgcagt 120 cagtctaagc taagctaagc taagtttcct tcgggattca ctttggactt tatttcggaa 180 tttcctttgg aataattttt ggagcatcct tcggaatttt actcggaatt tcctttggaa 240 ttacatttgg aattttcttg ggaatttctt ttggatcttg gaatgttctt cagactttcg 300 atggcatttc ctctgttatt ttctttggaa tttcttccgg aatttcatgt ggagtttcct 360 tcggattttt ttttttaatt tcctgtaaga ttccctttgg agtttctttc ggcaattcct 420 ttggaattac ttttgaagct tccttgaaat tgcatttgaa attttatttt tttgtggaat 480 tttcttttaa atttcttatg gagcttgaaa ttttttttgg aatttcgttt ggagcttcca 540 tcggaatttc gtctggagtt ttcggaattt ttcggaaatt cctttggagt tcctttggag 600 tttcttttgg aattcacttt ggaattatct atggagtttc cttcggaatt tccttcggat 660 ttttcttggg gatttctttt ggagcttgga atttcttccg gaatttcatg tggagtttcc 720 ttcggaaatt tatttgaaat tttcttcgga tttttttttt ttattttcca ttaagattct 780 ctttggagtt tctttcagaa tttcctttgg aattactttt ggagcttcct ttgaaattgc 840 atttaaaatt ttatttgttg tggaatttta tacggaattt tcttaagaac taccttttgg 900 aactttattt ttttttattt ccattgtagt tttctttgga atttctcccg gaatttcttt 960 cgggaagttg tttcaagtag ttcctttggg aagttcgtgc gagaattttt ttttcggtaa 1020 gtttcttcgg aaaattcctt tgaaaagatt ccttggaaat tctttggatt tattttttat 1080 ttgaagttcc ttcgtaattt cctttggatt ttggattgga gtttccttcg aaattttctt 1140 aggaatcacc tttggagctt cctttggaat ttcatttaga atttaattcg gtattacatt 1200 aggaattttc ttttggaatt tcctgtggaa ttttctttgg atgtcttgta atttgaaatt 1260 ttgaaatttt gttcgaattt tcctttgaaa tttcatttga aatatacttc agaatttact 1320 ctgtaatttt catcgaaatt tccttcagaa ttttatccgg agtttccttt ttatttttct 1380 tttgaatatc ctttggaatt tccttcagag tttccttttc tttaaaattt tctgtgcaat 1440 ttctttcgga gtttccttcg gaattttctt tggattttat tttgttattt tcttcggaat 1500 tttgcattgg tacgtagatc gaattccaat tttgaaacat aaaattgttt tccgggtagt 1560 acaatgggat tccgtttttg gcaacaaagt tgaaattttc agttgccaaa aacggaaccg 1620 tcccaaaatc ggaacccata tttcaaattt aatttttcaa ctattggttt tgaaagcatc 1680 attacaatgt taatacatca tttttatgga atcacctgtg tgtaggtacg gagaaggggt 1740 cttccaatcg gcaaatcacg gcaagtacaa ccatgcttcg aacagtacgg attctgatcc 1800 accttccggt ttcagagaag agtcgctcaa ccatttcgtt tcatcaccaa cgctgacacc 1860 gggacgcaac gacgaaagca tcatggaagc cccaactccc cctgccacag tcgagccctt 1920 cctgccaacg acctgcagcc gtcctggtcc tgtgtgtagg tacggaggag gggtcttcca 1980 atcagcaaat cacggcaagt acagtcatgt ttcgaatagt acggattctg atcctccatc 2040 cggtttcaga aacgaatcgc tcatccacat cgcttcatca ccaacgctga caccgggacg 2100 caacaacgaa agctttatgg aagccccaac tccccccgcc acagtcgagc ccttcctgcc 2160 aacgacctgc agccgtcctg gtcctgtgtg taggtacgga ggaggggtct tccaatcggc 2220 aaatcacggc aagtacagtc atgattcgaa tagtacggat tctgatccac catccggttt 2280 cagagacgaa tcgctcatcc acatcgcttc atcaccaacg ctgacaccgg gacgcaacaa 2340 cgaaagcttt atggaagccc caactccccc cgccacagtc gagcccttcc tgccaacgac 2400 ctgcagccgt cctggtcctg tgtgtaggta cggagaaggg gtcttccaat cggcaaatca 2460 cggcaagtac aaccatgctt cgaacagtac ggattctgat ccaccttccg gtttcagaga 2520 agagtcgctc aaccatttcg tttcatcatc aacgctgaca ctggaaggca tcaacgcagg 2580 cttcaaggac acccctagcc cagccatcac agtgcaaccc tctcgaacac cgatcaacca 2640 tccagaccgt gaccacctat ttttgtccgt ctactatcaa aatgttcgtg gtttacgtac 2700 caaaacgaca gaactaaggt taatgttatc ttcctgtgat tatgatgtaa ttgtgttcac 2760 tgagtcgtgg ctccggcctg acatacgcaa ctcagaacta acaagcgatt atgcattatt 2820 tcggtgcgac agaagtgagg cgacaagtga cctttcgaga ggcggtggcg tccttgtagc 2880 agtgaaaaat gaattgcaat gcctcggtat atctctagtt ggttgcacac aactcgagca 2940 aatggcggtg tgtgtaaaat taccccatcg ttcagtgtac atctgtgcta tttatctacg 3000 tccaaattcc gacgtcgctc tatattcgac tcatgcctcg gctgtacagc atatcgctga 3060 ccaatgctcg gtgaacgatg ttattctggt tctgggcgat tacaaccttc ctcgtttaca 3120 atggatcttt gacaacgacc tcaatggttt tctgccttca aatgcatcca gcgaagttga 3180 attgtcattc actgaatcaa tttcgtcttg tggattggta cagttgaatc catttgtcaa 3240 tgcaaacggc aatttgcttg atctaacctt ctcgtcctct cccgatgtca tcgaaatatt 3300 gcagccaccg cttccgttat tgcctattga tgctcatcat ccaccttacg tgcttcaaat 3360 cgatgttcat actagtatgt ccgaccacgt ttctgcacgc atcgatgccc tggaactgga 3420 tttcaaacac tgtgattttg atgggctcaa taatattctg tcaactgttg actgggggaa 3480 ctacctggac ggccattccg ttgacgctaa ccttgctttg ttctacgaaa aaatatacga 3540 agtgttggac tcgaacgtgc ctcgttatcg atgccccaaa ggctgcggct acagaaagcc 3600 ttggtggaat gctgagttgc gtaatttgcg caataagctg agaaaagtgc gtaaacgttt 3660 cttttccgat aaatctgtgg aaaacaaaca aatgctacgc ctcgttgagg gttcctacaa 3720 tgaactactt atccggagcc accaggaata tttgatcaga ttggaagaaa atttgaaaca 3780 aaatcccaag acgttttgga agtacgtgaa gtcgctaagg tcaagcaacc gagtaccaaa 3840 taacgtgatg ctcggaaata ctactgctag ctcttccaaa gatgctgcaa atcttttctc 3900 cagttttttc aaaagtgttt acagctcaac gtcccctcaa ccacatcctg gatgcttcag 3960 aaatgtgccg tcccacaacg tgcacctcca acgtctcaac tttaccaacg atgatatact 4020 aagagcacta gcgaagatcg atgcttcgaa gggagctgga gttgatgggt tgccacccgc 4080 ctttgtgaaa aactgcgcca tatctttggt cacacctctt acgctgctct tcaaccgttc 4140 cttggaggaa aaaacattcc caagtttgtg gaaaactgct tgcatgatcc ccgtgcacaa 4200 atcaggcagc ttaaaccacg tggagaacta tcggggtgta tcaattctct gttgtatagc 4260 caaagtcttt gagacgctag tgcacgaagt tttgcttaac gctgtgaaac cgctcgtctc 4320 cgattaccaa catggtttcg ttccacatcg ttcgacaacc tctaacttgc tatgttacac 4380 gaccgtactg ttccgagaga ttgagcatcg aaatcaagtc gattctatat atattgactt 4440 ttcaaaagcg ttcgacaccg ttccgcatgc gtatgcagta gaaaagctga agcacatagg 4500 actaccggat tggattatgg actggatttt ttcatacttg tctgagcgaa aagcatttgt 4560 ccgggttaac tcagcacgct ccgatgaatt cgaaataccg tctggagtac cacaaggcag 4620 tgtgttgggt cctctcatct tcgtaatgtt cgtaaacgat ctctgttctc gaatttcctc 4680 cgggaagtta ctttttgctg atgatctaaa aatattcaga gtgatcgcgt cggctcttga 4740 ttgcaacgtt ctccagaacg acatcgatga gcttttgcgg tggagtaatg agaatggcat 4800 gcgtatcaac attaaaaaat gcaaatcgat ctcttttagt cgatgcagaa atcctattgt 4860 gcataactac cgtgtcaact ccgttcagtt agaacgagtt gaatctatag tggacctcgg 4920 tgtgactata gataacaaac tccgctttaa tgagcatgtt gccacaatga ctgccaaggc 4980 tttttccgta ctcggcttcg tccgacgtaa tgctgcacaa ctaacagata tatatgcact 5040 aaaatcacta tattgtgccc tggtgcgaag catcctggag tacgcatccc ctgtttggtc 5100 gccatattac gcatcggagg ttattcgcat agaacgtgta caaaaacagt ttattcgttt 5160 tgcgcttaga aatcttcctt ggaatgatcc tttgcatctt ccacattacc aagcacgctg 5220 tcgccttata aatttagagc tactctcgga aagaaggaag aagttgcaac agttgacaat 5280 attcgacatc ctatgccaaa acattgactg tgctgaatta ttgcagcaga tttcattcaa 5340 tgtacctcag cggccgttaa gaaatcacgc atttattata attcctgggc atagaacaaa 5400 ctacgggttc aacagtccac ttagttcatg cattagggct tttaacactg taagtgagaa 5460 ctttgacttc aatttgacaa agcagtcctt caaagctagg ttaagatatt agattaggtt 5520 agtctgtacg attttatcga agaccatata caataaataa ataaataaat aaataaaa 5578 // ID ITmD37E_Ele6 repbase; DNA; INV; 1298 BP. XX AC . XX DT 13-OCT-2010 (Rel. 15.1, Created) DT 13-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE An ITmD37E DNA transposon family from Aedes aegypti. XX KW Mariner/Tc1; DNA transposon; Transposable Element; ITmD37E_Ele6. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-1298 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-1298 RA Kojima K.K. and Jurka J.; RT "ITmD37E-type DNA transposons from the yellow fever mosquito."; RL Direct Submission to Repbase Update (13-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. >91% identical to consensus. TIRs are 28 bp CC long. TA TSDs. This consensus is ~93% identical to the original CC sequence in [1]. This family encodes a DD37E-type transposase and CC is similar to Tx_mos from Toxorhynchites amboinensis. XX FH Key Location/Qualifiers FT CDS 96..1121 FT /product="ITmD37E_Ele6_1p" FT /note="transposase." FT /translation="MCICLRCVSFTSRCAIVVKMESEEQQRRKEILRTYLE FT NPDLSHRAISRKLGMVHSTVSRVLNRYHERLTIDRKEKSGKNGSPYSVKDH FT KRVVQAFKRNPNSSVRDVAKKLNLSKTFVQDAKKREGLHTYKVQKAPNRDE FT RQNTVGKTRARKLYTQMLTKPHCLVMDDETYVKADFRQLPGLQFFTAHHKF FT DVPEDVRKQKMSKFAKKYMIWQAICSCGKRSTPFVTTGTVNGQVYLKECLQ FT KRLLPLLRSHDGPTIFWPDLASCHYSKEVLEWYKAKEVNFVPKHLNPPNAP FT ELRPIEKYWAIMKQALRKHPKEVKSEEDMKKKWTATQKKSWSKCCTGPYE" XX SQ Sequence 1298 BP; 397 A; 252 C; 316 G; 333 T; 0 other; aagggtgtac ggaatcaaat tgcaccactt ttaaatggct gtaacttttt acccgttggg 60 tattttctat tgaaattttg ggtgtaattt gttcaatgtg tatttgttta cgctgtgtaa 120 gtttcactag cagatgtgca atcgttgtga agatggagtc ggaagaacag cagcggcgca 180 aagaaatttt gcgcacgtac ctcgaaaatc cggatctatc tcatcgtgcc atcagcagaa 240 agctgggaat ggtacattct acggtgtctc gtgttttaaa tcggtaccac gaacgtttga 300 ccatcgatcg gaaagaaaaa agtggcaaaa atggatctcc gtatagtgtt aaggatcaca 360 aacgggtagt tcaagctttc aagaggaatc ccaacagttc agttcgagat gtggcgaaga 420 aactgaatct atccaaaact ttcgtgcaag atgcgaaaaa acgtgaagga ctacatacgt 480 acaaagtcca aaaggcgcct aatcgcgatg aacggcagaa tacggtaggt aaaacacgtg 540 cacggaagct ctacacccag atgttgacga aaccacattg cctcgttatg gatgatgaga 600 cgtatgtaaa agcagacttc cggcagcttc cggggctcca gtttttcact gctcatcata 660 aattcgatgt ccctgaggac gttagaaagc agaagatgtc aaagtttgcc aaaaaatata 720 tgatttggca agcaatttgc tcgtgtggaa agcggagcac cccattcgtg acaactggaa 780 cggtcaatgg gcaggtgtac ctaaaggaat gcctccaaaa gcgtctactt cctcttctga 840 ggtcacacga tggtcctacg attttctggc cggatttggc ctcgtgccat tactcgaaag 900 aggttttgga gtggtacaag gctaaggagg tcaatttcgt gccgaagcat ctcaaccccc 960 cgaacgcacc ggaactgcgg ccaattgaaa agtactgggc catcatgaag caggcacttc 1020 ggaagcatcc taaggaagtg aaatcggagg aagatatgaa gaaaaaatgg actgccacgc 1080 aaaaaaaaag ttggtccaag tgttgtacag gaccttatga gtagtgtgaa gagaaaagtt 1140 cgtgcatttg gatatggaat tgaaattgaa taaaacaaac atgggaaaat attcataaca 1200 tgttttattt tactccctga aagtttgaag atgattggtt gaacgtgcga attttgccaa 1260 atatttgtgt gtgtcgcaat ttgattccgt acaccctt 1298 // ID CR1-1_LG repbase; DNA; INV; 5495 BP. XX AC . XX DT 14-JUL-2009 (Rel. 14.07, Created) DT 14-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE CR1 non-LTR retrotransposon - a consensus sequence. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-1_LG. XX OS Lottia gigantea OC Eukaryota; Metazoa; Mollusca; Gastropoda; Patellogastropoda; OC Lottioidea; Lottiidae; Lottia. XX RN [1] RP 1-5495 RA Kapitonov V.V. and Jurka J.; RT "A family of CR1 retrotransposons from Lottia gigantea."; RL Repbase Reports 9(7), 1331-1331 (2009). XX DR [1] (Consensus) XX CC The consensus sequence was reconstructed from several copies less CC than 1% divergent from each other. XX FH Key Location/Qualifiers FT CDS 264..1001 FT /product="CR1-1_LG_1p" FT /note="ORF1." FT /translation="MAGLLEHREALTRMIEEAVKVAVSTAIELFDKNLRLY FT VDGRLKCLDDRMCQIEESHHSVSITHDQFYDQLTDLQNKLDAKTIEFEKLS FT SKINSLEQYTRKAHIRIFGIVESDGEDCIAIAKKLFREKLGITSEISIDAA FT HRVGVPKGRGSGTNTEIPRPRGIIVRFIRRDERFRVLSNRKNLKGTGVVIV FT EDMTVENVKLLDSARNSRRVKSAWFTNGKVFVVGQNDYKFRLERIGDLNSK FT LPPFP" FT CDS 1875..5219 FT /product="CR1-1_LG_2p" FT /note="ORF2: AP endonuclease and RT domains." FT /translation="FIIIFIFTTRYRLVYRSFLWGDYFSLLNHVSSKVTLC FT LLFTLLNDTFSSILYYCFILIIRSGDIELNPGPTSTKLYDKISFFHLNSRS FT IRNKLDKIFDESHDNHVCAFTESHLDGSISDQSLIRDGFSIPYRKDRNCHG FT GGVLLYIKDYIFSKRRQDLESPLVESIWVEIHVLHKILLVGAIYRPPNSPA FT QVWQSIDASIELAFSSSKNIILLGDFNSDLLTPNSRLSNIILSNNLKNVCE FT KPTRVTSRSSTLIDVIIVHDEIDICFTDTVDFPNYISDHKGVHATLDLSIP FT SYHSFQRVIMIYSRADYVRLNDLIRNFDWGNCFENFDINYSCSKFYITLNK FT FIDICIPKKSVVIRPSDKPWFNSDIRKAIRLRDRLRKISIKSKNINDFNKY FT KTQRNRVNNFIKSLKSKFYVKLTDVINDPNSSNHEYKWNCMKFFLGTSCTP FT PLPPLVSNGAMVYSDADKAQLFNTYFASISNVTPTDNLPDFNERTSSKISL FT PIISEEEIIDLIKCLKNKKANGFDKISNIILKNIASSISYPLHLLFNKSLE FT SGTFPSLWKEARVMPLYKKGESWDVSNYRPISLLSCISKLFERVIFKHLYN FT YFHENKLLYPYQSGFRPGHSTVHQLIEIYNDILVAIEKKQFACFAFFDISK FT AFDRVYHPGLILKLSKYGIGGKFLDWLSSYLTNRFQRVFLNDALSPAELLN FT SGVPQGSVLGPLLFLIYINDIQDVLQNLSRLFADDTAVGCTKSLISEIEIS FT MNNDLSKLYQWSSDWLVSFNPSKTKVVLFSTKPISQLPNLILGQEPLEFVP FT FHKHLGITLSNDLKWSIHIDTICKSAYKLIGVLRKLKFTLNRKTLLSIYTT FT YVRPRLEYACEVWDCCNISDLDRLEHVQLEAARVITGLTKYCKIESLYFET FT GLHTLSERRKTRRLSLLYNIVHNNAPNYLIDILTPIDATPRFYNLRNSNEI FT PIPFARTTAYQRSFFPSTTHDWNLLDAQLKDSPSIKAFKINLKRSFLSNSA FT PVYFDAGDRRENILLTRIRHNCSGLNADLFKINVITNSSCKCGYHTENSFH FT YFFDCQIYRAQRLVLLSNLSDYVVNLELILCGNRNFSVDTNIIIFKHVHDF FT IKHSKRF" XX SQ Sequence 5495 BP; 1629 A; 1017 C; 893 G; 1956 T; 0 other; cacgtgacaa gatggcggca ataagaagtt ttttgaatgc tctctaaata attatggaag 60 tttaacttta tttaggcgtc atttgtcatt atgaatgagc taaaataatg tagttagagt 120 atgcgatctt attatagtga tttgtgaata taatcgagga attcatgtgg ttatttggat 180 caattcgtgt gtcagtagtc catattcgta tatgatgaac tcgaactcag tgtgtttaaa 240 atagtaactc gcaaaagatt aaaatggctg gtttacttga acacagggaa gctctgacac 300 gcatgattga agaagcagtt aaagtggcag taagtacggc aattgaactt tttgataaaa 360 accttagatt gtatgttgat ggtcgattaa aatgtttgga tgaccgtatg tgtcaaatcg 420 aagagtctca ccactctgtc agtataacac acgaccagtt ttatgatcaa ttaactgatt 480 tgcaaaataa actagatgca aagactatag aatttgagaa actgagctct aaaatcaaca 540 gtttagaaca gtacacgagg aaagcccata ttcgcatttt tggaattgtc gaatcagacg 600 gcgaggattg tatagctatt gccaaaaagc tattccgtga gaaattaggt attaccagtg 660 aaataagtat cgacgccgcc cacagagtag gagtacccaa agggagagga tcgggtacga 720 atactgagat accccgccca aggggcatta ttgtgagatt catacgccgt gacgaaagat 780 ttcgtgtttt atcaaacaga aaaaatctta aaggtactgg tgtagttatt gtggaagata 840 tgactgttga aaatgtgaaa ctattggact cggctcgcaa tagtcgtcgg gtaaagtcag 900 cctggttcac caatggtaaa gtctttgttg ttggccaaaa tgactataaa ttcagactag 960 aaagaatcgg tgaccttaat tctaagcttc ctccgtttcc gtgatttgtg tgtgtatccc 1020 tatattttgt acccatgtat ttgttaaacg gcctgttaca tagtttgttt acttctacta 1080 atactttatt taaatcataa taatgtgtcg gtagtgtacc tttaaaaata cccgttttgt 1140 ttgttatcaa caagtgctca ctgtaattat catacaatcc gtactgatac actcgaaccc 1200 gctaacaagc gagttgttat aactcatcgt cctattgtca gtcatcgtcg tgagccaagt 1260 tttagaaatt tatttcaaag actctgtcta tatatacact tttgtgtctt aagtaattca 1320 tgcgtatggt gttttaatat tcttactctg gttatatttt tgatctcttt tgtatatcac 1380 ctatctcatt ttgtatgtaa aaagtgttta ctgaatatac aggacagccg cttaaattgg 1440 ttttatctat accgagggct cttcacgaac tgaccggcaa ttcaattgct attaatatta 1500 taacgagttt ctctcatctt cttgagccat gtttgaatgt catttactat tttaattaaa 1560 gtttcaatcc ctcaaaaaaa aaagtatata tctatacttc gttttggtat ttagcactat 1620 aatttgtttg acatgcgcct atattttcct attttttttt tttctctcta ttatgttttc 1680 catttcctat ttgtctatag tttgtttttt tcgttcttct atttttcttt ctactattct 1740 tcactttctt atcatttgta ctgtatataa tatgttttac tccgtcaacc agttatatcg 1800 acaactaaca tcttgcatgt tgcttacatc gtattttggt tcatgtttct cccctttctt 1860 ctggtatatt ctgatttatc attatattta tatttacaac tagatataga ttagtatacc 1920 gaagttttct atggggagac tatttttcat tacttaatca tgtatcttct aaagttacgt 1980 tgtgtctctt atttacattg ctgaatgata ctttcagttc catcttatac tattgtttta 2040 ttttaatcat acgttctggc gatatcgaat tgaatccggg tcccacctcg acgaaacttt 2100 acgataaaat atctttcttt cacctgaact cgaggagcat tcgtaacaag cttgataaaa 2160 tatttgatga atctcatgac aaccacgttt gtgcttttac agaatctcac cttgatggta 2220 gcatttctga ccaatcgtta attagggacg gattttccat tccatatcgc aaggacagaa 2280 actgtcatgg cggcggtgtc cttctgtaca tcaaagatta tattttcagt aaacgtagac 2340 aggacctaga gtcaccattg gttgaaagta tttgggtaga aattcatgta ttgcataaaa 2400 ttttgttggt tggtgctatt tacagacccc ccaatagccc tgctcaagtt tggcagtcaa 2460 ttgacgcttc gattgaactt gcgtttagtt catccaaaaa tataattcta ttaggcgact 2520 ttaatagtga tttattaacc cctaatagcc gattgagtaa tatcattctg tccaataact 2580 tgaaaaatgt ttgtgaaaaa ccaacaaggg taacttctag gtcttctact ctgatcgatg 2640 taatcattgt tcacgatgaa atcgatattt gttttactga taccgttgat tttccaaatt 2700 atatttcaga ccacaaagga gttcatgcca ccctcgatct ctcgattcca tcgtatcaca 2760 gcttccagag ggtgatcatg atctactcga gggctgacta cgttcgtctt aatgatttga 2820 ttcgaaattt cgattggggc aactgttttg aaaatttcga cattaattat tcttgttcta 2880 aattctatat caccttgaac aaatttattg atatatgcat acctaagaag tctgttgtga 2940 tcagaccttc agataagccc tggtttaaca gcgacatacg aaaggctatt agattacgtg 3000 ataggcttcg taaaatatcg atcaaatcaa aaaacattaa cgattttaat aagtataaaa 3060 ctcaacgaaa tcgtgttaat aacttcatca aatcattgaa atctaaattt tacgtcaagt 3120 taactgatgt catcaacgat ccaaattctt ccaatcatga gtataagtgg aattgtatga 3180 aattctttct aggaacttct tgcactcccc cgttaccacc tcttgtttct aatggtgcta 3240 tggtttattc tgacgccgat aaagcccagt tatttaacac ttattttgct agtatttcga 3300 atgttacacc aacggataac ctaccagatt ttaatgaacg aacctcatct aagatatctc 3360 taccaatcat ttctgaggag gaaattattg acctcatcaa atgcttgaaa aataaaaaag 3420 caaacggatt tgataaaatc agtaatatta tccttaaaaa tatcgcatct tctattagtt 3480 accctcttca cctcttattt aacaaatccc ttgaatcagg tacattccca tcactatgga 3540 aggaagcgag ggttatgccc ctttataaga aaggagaatc ctgggatgta agtaattata 3600 gaccaatttc gttactgagc tgtatcagta aattatttga aagagtaatc tttaaacatt 3660 tatataatta ttttcacgaa aacaaactac tctatcccta tcaatccggc tttagaccag 3720 gccattcaac agttcaccaa ctgattgaga tttacaatga cattctagtt gctattgaaa 3780 agaaacaatt cgcatgtttc gcgttctttg acatatctaa agccttcgat cgtgtgtacc 3840 atcctggtct tattttaaaa ttgtcaaagt atggaatagg tggtaagttt cttgactggt 3900 tatctagtta cctgactaat agattccaac gtgtatttct taatgatgct ttatcaccgg 3960 ctgaattgtt aaattctggc gtccctcagg gatctgtgct aggcccccta ctctttctca 4020 tttatataaa tgacattcaa gacgttcttc aaaacttatc gcgattgttt gctgatgaca 4080 cagctgttgg ctgcacgaag tcacttatat ctgaaattga aatctctatg aataatgatc 4140 tttctaaatt gtatcaatgg tccagtgact ggcttgttag ctttaacccc tctaaaacca 4200 aagtagtcct tttttcaacc aaacccataa gccaattacc aaacttgatt ctcggtcagg 4260 aacctttaga attcgtacca ttccataaac atcttggcat cacattatcg aatgatctaa 4320 aatggtcgat ccatatcgat acgatttgta agtctgccta taaattgatt ggtgttttac 4380 gtaaacttaa attcactctc aacagaaaaa ccctcttaag catttataca acttatgtaa 4440 ggccgcgtct agaatatgcc tgcgaggtgt gggattgctg taatatctca gatttagacc 4500 gtctcgaaca tgtccagctt gaagccgcca gagtcataac aggtctcact aaatattgta 4560 aaattgaatc cctatatttc gaaaccggcc tgcatacact ttccgaacgt aggaaaaccc 4620 gtcgtctttc tcttctctat aatattgtcc acaataatgc gcccaattat cttattgata 4680 tcttaacccc tattgatgct actcctagat tttataatct cagaaattca aatgaaatcc 4740 cgattccctt tgcccgtact actgcttacc aacgatcgtt ttttccctct accactcatg 4800 attggaatct attagacgca cagttaaaag actccccctc cattaaagct ttcaaaatca 4860 atcttaaaag aagtttctta tcaaattcag cccctgttta ttttgatgct ggagacagac 4920 gtgaaaatat tttattaaca cgaattcgtc acaattgtag cggtttaaat gctgatcttt 4980 tcaaaataaa tgttattacc aattcttctt gtaaatgtgg ctatcatacg gaaaattcat 5040 ttcattattt ttttgactgt cagatttatc gagcccagcg attagtactt ttgtctaact 5100 tgtctgacta tgttgtaaat ttagagttaa ttctgtgtgg taatcgtaac ttttctgtag 5160 atactaatat tataattttt aaacatgtcc atgactttat taaacactct aaacgctttt 5220 aattgcaagt tttacatcta taatgaatct attaccatta ttttctggtt gtcggtttat 5280 gttgttgttt atattttgtt ttcttctcta atttttcttt tcttttctaa agattccaag 5340 ttattttgct tttattaact ttgtttgtaa aatatgttcg tatcttgtca atattattgt 5400 gtataacttt gtgtggagcg gatttttcct aagttcttga aacttgcatc caatcccttt 5460 gattttccaa taaaatatgt ttaaactaaa ctaaa 5495 // ID Harbinger-2N1_BF repbase; DNA; INV; 1293 BP. XX AC . XX DT 04-AUG-2008 (Rel. 13.08, Created) DT 04-AUG-2008 (Rel. 13.08, Last updated, Version 1) XX DE Amphioxus Harbinger-2N1_BF non-autonomous DNA transposon - DE consensus. XX KW Harbinger; DNA transposon; Transposable Element; Nonautonomous; KW Harbinger-2N1_BF; Harbinger-2_BF; non-autonomous. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-1293 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-1293 RA Kapitonov V. and Jurka J.; RT "Harbinger-2N1_BF - a family of non-autonomous DNA transposons RT from the amphioxus genome."; RL Repbase Reports 8(8), 797-797 (2008). XX DR [2] (Consensus) XX CC It is a non-autonomous derivate of Harbinger-2_BF. This CC transposon is characterized by 34-bp TIRs and TNA TSDs. XX SQ Sequence 1293 BP; 346 A; 304 C; 286 G; 357 T; 0 other; ggcccacttt acattacgct tttcgcgtta gcggtaaatt tcccgttatc gggacgccgc 60 tatcggcacc tctttagaca caccgcccgg aaatctgaca ggacacccgc ggcgcgcggt 120 ggaatgtaaa atcttggtga aagtctaccc gtaaagatcg cagttctttt tagctgcacg 180 ttgttatatt gaacaaacgt taattttttg tttgcgtacc gctaccaagt gattgaaaat 240 tcaaatacgt gacttgcatg agaaatacaa gtccctgact acgttgttgt ttcgctgaca 300 cttccggtaa cttcgaccac aaaattttcg ataaaaaagc acttaaaaac cctttcaaag 360 cgaactgttt gatagaagat ccctttcttt cagtgatgac ctgctaaatt atggctacgt 420 actgaaataa ccgtgaatat acgcataatt cctgcaggta taataactac ctcgtacgta 480 aaaactcaag ctgccagcag ttcatgtggc gggaatttcc ccaccaatgt catagcaacg 540 gccaccactg tcatagcaac ggccaatcgg agacgcgtaa tgggattact ttatacaaaa 600 tttgcagaga tagcggcttc tggtttcggt acaacgtaac gatgtattgc aggagaaaat 660 attcggaatc gaacttgtat atttttttac aggtcttcca cgaaaattgt gcaaaagtag 720 gtcgtccaaa aatgtgagaa tgagactatt tttcgctgac atcccgcctg ctttgaaggt 780 tgactgcaga cccgattgac gtaagagcct gcgtaggtgg ccattacgga aaaggcaggc 840 cgtgaaaact attttttctg cgtagttcgg gttgattttt gggttgacat tcattctact 900 agtatataaa aaagtaacta tcgggagact gttttcttat cactgtattc ttacccagcg 960 ttatgaacat gatcgctgct ggcaattttc ctccaatgtt ttgcaaatct gtacccacca 1020 ttttcgcgcc atatgctaat aagcgcgcgc tttatgctaa acgtctaaac cctgactccc 1080 gctaacgcga gcccgacgtg ttctctttat acgcagtttg gcgttgacgg ggactcccgc 1140 tgagcctacg gccgtacgct tcggagtgga cgggagaaaa tttggcgtta gcgggactac 1200 gttaacggga actctcttta cacggggctc ccgctaacgc agtcccgcta acgccagtcc 1260 cgctaacgcg aaaaacgtaa tgtaaagtgg gcc 1293 // ID DNAX-2_AP repbase; DNA; INV; 467 BP. XX AC . XX DT 29-AUG-2009 (Rel. 14.09, Created) DT 29-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNAX-2_AP. XX NM DNAX-2_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-467 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2052-2052 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC TA or TAA TSD CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 467 BP; 162 A; 45 C; 48 G; 212 T; 0 other; cagggtgtaa cagatagaac tgacttttgg aataactttt gttctaatca atatttaaaa 60 tttttttttt atatataatt tatagtcact tgcgctacac atatattata caaaaaatta 120 tttttatttt ttaacactga aaataaaaaa tttaaaattt gatttaaatt tttttggata 180 tattcaaaat agccaatttg tgaatctgat tataagaaca attttgatag taaaaacatt 240 tatctaagat caagtagttt attttttaga atttttttaa atatcaatat ttttttgatt 300 aaatgaattt ggaacgtaat aacttttttc aaaatatttt ttttgggaat ttgctgacat 360 gagtatattt cttaaattat ttactaatca tataatttta acaatttttg tcatacgttt 420 aagtagcatc attttttttt ttcgtcaagt ctttctgtta caccttg 467 // ID Gypsy-97_AA-LTR repbase; DNA; INV; 187 BP. XX AC AAGE02017326; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-97_AA_; KW Gypsy-97_AA-I; Gypsy-97_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-187 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02017326; Positions 8975 9161. XX SQ Sequence 187 BP; 63 A; 21 C; 39 G; 64 T; 0 other; tgtgctatgc tgagtttgaa ttataaatag taggctttga ttctcaatga aacgaaaggg 60 cagaataaat tgtgactctg gaagtgtaag acgtgtttgg tgaaattgta tgttaatgtg 120 aataatccga aacataagtt agtacagaat cattactttt cattaattct gtatttggaa 180 taccaca 187 // ID BEL-648_AA-LTR repbase; DNA; INV; 667 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-648_AA_; KW Pao_Bel_Ele213; BEL-648_AA-I; BEL-648_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-667 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 667 BP; 205 A; 111 C; 140 G; 211 T; 0 other; tgttctggca gcacggacga tgacaaccgg agcggtatgc agaatccgac tgtatgatag 60 atcgagagct acctagacat taagaattga aaaggccagt gaagtgggtg aaacgtgcac 120 cacattgttg agctctatga atttggcaca tttacgattt atttctatgc tttttattga 180 gttaagtgat tgctgattac caaaatacca catttttggt gtagatcgtg gattgcaata 240 ttgaggtggt ttgccaaagg gcgccggtaa tgatcagtgt ataattaatt tgataaccta 300 agaatctatt gtattgaact cctattgtac tattcctatc gcagattcga taattaatag 360 ctgttttcca gttataaatc cgataaattg aagctcaaat tgttaaatac gtgcgacaag 420 gtatgaatta ggtcgattat gtgatacaat gagtaattaa attatcgtca tcttgctagc 480 gcaggtatat ctaccatagg aatcttggct ttatacggta gggttctgaa ccgataggcg 540 aaaccaaatg tatggattat ggcacaattt tccgaaaaga agtctaataa acaattaatc 600 ttttttcagc tttgagctgc gcaaattgaa acgctgctgc gagatttctt ttacctatca 660 accaaca 667 // ID Shinagawa-1_CQ repbase; DNA; INV; 1642 BP. XX AC . XX DT 29-DEC-2010 (Rel. 16.01, Created) DT 16-FEB-2011 (Rel. 16.01, Last updated, Version 2) XX DE A non-autonomous DNA transposon family from Culex DE quinquefasciatus - consensus. XX KW DNA transposon; Transposable Element; nonautonomous; KW Shinagawa-1_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-1642 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the southern house mosquito."; RL Repbase Reports 11(1), 93-93 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >93% CC identity. 5-9 bp TSDs. Related non-autonomous elements, named CC Shinagawa, are found in Aedes aegypti and Culex CC quinquefasciatus. An insertion of DNA transposon ~89% identical CC to DNA-AATT-1_CQ at AATT (1261-1264) is excluded from the CC consensus. XX SQ Sequence 1642 BP; 540 A; 299 C; 323 G; 480 T; 0 other; gaatctatga catttccccg aaacccatat taccgaaggg acatttcccc gaatgccact 60 tccccgaaaa gacacttccc ctaatagtca tttccccgaa ttccatttac ccgaatttcc 120 catttcccct aatcgtccac atccccgaat gccattttcc cgaatgggcc acaatcccga 180 atgccactta cccgattagt catatccctg aatagtcatt tccctgaatg ccatttcccc 240 gaacgcggtc aagcggaaat gcccccggag cgcgcgctcc tgggcaccgg gcatttccgc 300 ttgaccgcgt tcggggaaat ggcattttaa aagaaaaaaa atgatgacaa tttttatcac 360 atcaaactac atatttaaag gttcgtattg gccttgaatg atcaactttc tttttggctt 420 cattcagaac agacgtagca ttttaggggg gagggttcaa aggttttaca atattcaaat 480 acaatgaata ttgttacaga ctttttttta tacatatatt ctcaaaacaa atactttttt 540 cttatagaca tgataataat aatgcaatag aagttttgtt attttttatg attcaaaaaa 600 cataccgtgc gattttcgta gtacagaacc ccgtacacac tgatcatttt attcacattg 660 ctggtttggg atttggcgaa aagtataatc aacaaaaact ttaaataaaa tttgtaccag 720 ccttatgtac ctatttgttc gatttttggg ttcttgaaca aaattcgaaa aaagacaaat 780 ttccttgcaa atttctcaaa aacaacatct aattatgcgg ttgaatacaa atgttaattg 840 aacaaaaaaa aactagtaca gtccagactt gattatccga aggcttcgga aaaatttcac 900 ttcggataat cgaatcacga ttttttttga tgcctatttt ttattgtcga gcgtatgtcc 960 cctaaactac gttaaagtga tttagaactt ttaaatccaa gatggcggcc aatatggcgg 1020 tgttgaaata ttgaaaaaat gcattttatt atttaatagg caatcaacta ttcaaatttg 1080 actaaaatgg ggttgcagaa ctcaaatttg atgtttaaaa caagaaaaag aaaaaaaaaa 1140 cgaaaaaaaa attgttcgtg attcgattat ccgaagtccc atacaaacct tcggataatc 1200 gaacttcgga taatcgaaac ttcggataat cgaggcttcg gataatcgag tctggactgt 1260 aatttacaac atttaatcaa cagcataccc gaaaaagctg tttgttcggt aaaattgaga 1320 aagaaaataa ctgttagtca tggaaaaagg ctttagcgaa aatgatgatt aggggatatg 1380 aaaattcggg aaaatttaaa tttaaataaa cggcaattgg cgtaagtgtc acttcgggga 1440 aatggcattc ggggaaatgg ccgattaggg gaaatggcat tcggggatat agccgattag 1500 gggaaatgat attcggggaa atggcattcg gggatgtggc tgattagggg aagtggtatt 1560 cggggatgtg gccaattagg ggaaatgatt ttcggggaaa tggcattcgg ggaaatgtca 1620 ttcggggaaa tgagctagac cc 1642 // ID BEL-88_CQ-LTR repbase; DNA; INV; 648 BP. XX AC AAWU01005995; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-88_CQ_; KW BEL-88_CQ-I; BEL-88_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-648 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 310-310 (2011). XX DR GenBank; AAWU01005995; Positions 9636 8989. XX SQ Sequence 648 BP; 206 A; 122 C; 136 G; 184 T; 0 other; tgttggcacc gcgatgaacc gtcctagtgc agcacgcgaa acacgccagc cgaaatgcta 60 gttgaaagaa gaacgcagaa aagggacgga gcatccggaa agccggctcg ttcagatcaa 120 aaacaaaatc tattgtatta tcacctaacc aacagcagaa attcgtgcag gagataattt 180 gctttatttg cgtcgataat taactttttt gaacagtgac cagtttgtaa actgttcatc 240 tcgttgtaag tgatcatttg atacaatatt tgtgcgcaca gttgtataat taatcttttt 300 attagcgata ttacttacac cgcgtggctc gaggccgcaa ggggcgaaga gtagagataa 360 ttagccagaa ttaaggctgc aaagccgttt tcggatatgt aagttaaaga ataattcgtt 420 ttcgcacaaa gttccttata tttgattatt tcgattagcg cgcagaagtt ccgctagcaa 480 agctagcctg tgagaagagg ttaacggagt tacaaggaaa ctattcatgt gagtttgttt 540 actacacttg aacccacaca atgctaaata aatattactt tttagtttta agctgttaca 600 acaaagaact gctttaaaaa tctagttgtc cgaactacgg tccgaaca 648 // ID Gypsy-58_CQ-LTR repbase; DNA; INV; 1331 BP. XX AC AAWU01036277; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-58_CQ_; KW Gypsy-58_CQ-I; Gypsy-58_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-1331 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 496-496 (2011). XX DR Genome; AAWU01036277; Positions 7319 5989. XX SQ Sequence 1331 BP; 298 A; 388 C; 308 G; 337 T; 0 other; tgtgaaatta acatgagttc gcgttggaca gcagcgtcca ataagacgcg tttttgtatg 60 ggctggaaat ttttgctgca attgcgaaag cccacgccgg gtttcgtacg ccgtcctgcg 120 ggacgttcct ctacgttttg gatcagcctc tcggcgcgtc ttcgccgcca ttttggtgac 180 tctacattcc gtgaccgtgt gacgctccgg cccgccatct tggcagtcaa acggagtcgt 240 ggaagtccgt caattcccgt tggcagaaaa cacacaccgg tctccaattg ccggtcgtga 300 aaagggtcca ccacgttaca acccccggac ccttccgaga gccgtacgta gtcaaatcgt 360 cgtacacacc cgggtcggga atcctgttgg ttttaactta aagcctccaa tggtggggtg 420 ttcgtttgcc tccaggacga agacccagtc cggacagcgc ccgacactcc gtgtccatcg 480 cgcgtgagct gagctcggct cgctctttat tggctctgcg gaattgcact cttgtgaaat 540 cgagaagctt cccgtcgcgc cgcccacact ccgcaactta gttttcggcc acgagcgtcc 600 acgtagtccc cactgcacca acgagggagt ggacgacact gacagcgcta cttctcgggc 660 tgtcacgcac ggcaatcagc atcgagcgtc gctcgacggc ggattcccgc caaagcagag 720 gaaccaactc gacacctcca gactgccagc atctaccaga acatcgcatc agcagctttc 780 gccggccctc gacggccggg cgtcttcgca gcagcatgtg caccaacacc aacaaaaccc 840 tgcgcaggta tgtgagagac ccttgaatca tggccatgtt tcctctctcc catttccctt 900 acccacacaa acaattgtct agctcgaact ctcgcggtag ttttatgctc cgggaaatgg 960 aatcccgcaa atacacagtg aaaccatgat tttgcgaaca tgaatttatt cttttttttc 1020 cctcatgcgt taccttgaat aaatctctat gaacgtgcac tcttgataac cttttcgttt 1080 tcgttttttg aaatcatacc ttcttttttc cctaactttc cgttgtctga aagaaccaga 1140 atcgaagatc tccgtgggtc attacacgtt caatttgata agatcccatt ttagttattt 1200 ttggtaattt cgtctgcgta aaagccgtat gccgactaac gaccctgaga gctaatgtcg 1260 ggcaactaac acgacgggtg gcctcttgtt tcggcaagag tggcgcataa gccactgcgt 1320 ttaaaactac a 1331 // ID Gypsy-181_AA-LTR repbase; DNA; INV; 228 BP. XX AC AAGE02024317; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-181_AA_; KW Gypsy-181_AA-I; Gypsy-181_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-228 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02024317; Positions 13654 13427. XX SQ Sequence 228 BP; 78 A; 29 C; 74 G; 47 T; 0 other; tgtagggtct gggaacactt ctagagtgaa tgagacgaga aggtagatgg agcaacaggc 60 tgtgacttct gcagggacgt ggataggaga gtaggaagga agaggaaaag gagagagggg 120 agtgtgagga gaaaagacag gaataaaccg ggctctggat tgtctatatt acgacagtgt 180 aactaaagtg aatttaagtg ataaatccga atatctgact cctctaca 228 // ID Gypsy-95_CQ-LTR repbase; DNA; INV; 188 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-95_CQ_; KW Gypsy-95_CQ-I; Gypsy-95_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-188 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 570-570 (2011). XX DR [2] (Consensus) XX SQ Sequence 188 BP; 54 A; 25 C; 66 G; 43 T; 0 other; tgtgggaact gagtacgtcc ctgacgtgct cgaggtaaga gagagagagg gagagcgtta 60 gggggaactg tcagacacag gggggaagag gattgatcgg ataggagaac acgcggagaa 120 taaacggcga tttggtatca gaaggtaatt gcttgtttca tttatttatt tgacggtcgg 180 atagtaca 188 // ID BEL-74_AA-LTR repbase; DNA; INV; 712 BP. XX AC supercont1.271; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-74_AA_; KW BEL-74_AA-I; BEL-74_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-712 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.271; Positions 1306265 1305554. XX SQ Sequence 712 BP; 210 A; 147 C; 175 G; 180 T; 0 other; tgtctgatag caggctcaac cccccctgtt acttcaaccc tagaactgct tcttaactag 60 gccaacgaac gcagcagcaa actggcatct ataaacacta gcgtaaaccc agctacaata 120 gatgtttggg atatctacgc agacggctat catgagcggg agcagatcgg atcggggatc 180 atctcttaca cagggagtct gcatactcag attaatgacg tcaacccgat acgttctgca 240 gtggtgaata atctatgaga ggagatgaga ctgagtgata ctcacaagac cgatggactg 300 agccgaataa aaacatagaa tgctgctttc gtaggtaggc gaggatcaac tgtagagtgg 360 aagttcagtt gcatctcgat cggtcggcag atatgtgggc ataggagagc ccactcggtt 420 gaattttaat gtgttagact taagcgttca atgttagttt ttaagatcca aaataaatgt 480 agtgatgtgt agatcgtgtt ttaaattcag taaacgtgag tgtttttata agtgcccgga 540 agaagccctc gaaaaggttc caaaggccca ataactcaag gagagtgagt tggaaccact 600 aagaccagtg ttcttctgcc agcaacatcg accttggtca aggtcgaata catcttcgag 660 tgttggagga attattgcaa ggctatcgaa gtgagtttct cgtcccccaa ca 712 // ID Env-1_SK repbase; DNA; INV; 1715 BP. XX AC . XX DT 17-MAY-2010 (Rel. 15.11, Created) DT 17-MAY-2010 (Rel. 15.11, Last updated, Version 2) XX DE Putative envelope protein. XX KW Nonautonomous; Env-1_SK. XX OS Saccoglossus kowalevskii OC Eukaryota; Metazoa; Hemichordata; Enteropneusta; Harrimaniidae; OC Saccoglossus. XX RN [1] RP 1-1715 RA Kojima K.K. and Jurka J.; RT "Putative retrovirus related to Chromovirus-type Gypsy LTR RT retrotransposon from the acorn worm."; RL Repbase Reports 10(11), 2019-2019 (2010). XX DR [1] (Consensus) XX CC This sequence encodes a protein related to envelope protein of CC Gypsy-2_SK, invertebrate herpesviruses, insect baculoviruses, and CC insect errantiviruses. This could be a part of Chromovirus as CC Gypsy-2_SK. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: the acorn CC worm genome. XX FH Key Location/Qualifiers FT CDS 114..1700 FT /product="Env-1_SK_1p" FT /note="putative env protein, similar to class I FT membrane proteins of Ostereid herpesvirus. It is FT distantly related to envelope fusion proteins of FT baculoviruses and errantiviruses (gypsy-type LTR FT retrotransposons in insects)." FT /translation="MNFPVPTPENIRTNNSTLDKSFTKVVNLPEYLGISSH FT DYTYIEISANDLMHCSGDKVKSCDHPFTIFQSDSSSCAHAIYLDKQDRVSK FT FCSVQILVNYNPRPSAFYLPNSKALIQGLTADTYVVCPGRETRSLPPCELC FT LVDMGCNCKLTSKEISLHTSLSQCTGNDTFLVHYVANLHFTRAIFQSYSLT FT NVTGSQTFVSKWKAELPDLSILQHAFNQSLATDAALTLDLAKVANSHNKRS FT IMYASNLDQLAFQCRPNFDLFSTPYSYLAQLFSPTMILAIISLTLYLLMFI FT KIRRQGGLIRQLLIATSTSNIIHTTKGAPILYLHSSTSEPISDPNICNYPS FT SLSFSFMHVAILLTTMVVFYLSKAMVLKLKRLLFSSSSITQGSKAKIILEL FT SNPMEAISFFVRDVSYPVSSITRTARQPIRSAWITQGFLTSKLNLKWRNLQ FT FLLEDSRVEEISLPPQISIPFGKSSSTIRILQTRYHIRILVGDAPVFTEIP FT IINTKNREIENSYEHLKAIPPIPTNIVTNPADQ" XX SQ Sequence 1715 BP; 498 A; 401 C; 278 G; 538 T; 0 other; gtaatatttt tcgccaaaag agtgttatat ttcaccaaac ttcaaattat ctatatattc 60 aactccgcat cccattgtca ttggtttacg cagagtatga tttatatcgt gttatgaact 120 tcccagtacc aacgccagag aacatacgta caaataattc tactctggat aaatcattta 180 ccaaagttgt caaccttcct gagtatttag gaatttcatc acacgactat acatacatag 240 aaatttcggc taatgatctt atgcattgta gtggagataa ggttaagtca tgtgatcacc 300 cctttaccat atttcaatct gattcttctt catgcgctca tgcaatttat ctggacaaac 360 aagaccgtgt ttcgaaattc tgtagcgtac aaatcttagt aaattataac cctagaccat 420 cggcattcta tttaccaaac tcaaaagcgc taatccaagg tttgactgct gatacatatg 480 tggtctgccc tgggcgtgag actagatccc tgcctccatg tgagttgtgt cttgtagaca 540 tgggatgcaa ttgcaaactt acttccaaag agatatcact tcataccagt ttgtcgcaat 600 gcacaggcaa tgatacattt ctggtacatt atgtggctaa tctccatttc acaagagcta 660 tctttcaaag ctatagctta acaaacgtta ctgggagcca gacgtttgta tcgaaatgga 720 aggcggaact tcccgaccta tccatacttc aacacgcatt caaccagtct ctggcaacgg 780 atgcagccct taccttggat ttagcgaaag ttgccaattc acataataaa cggtcgatta 840 tgtacgcctc taatttggac caattagctt tccaatgccg acccaatttc gatttgtttt 900 ctacgcctta ctcatatttg gcgcaattat tctcccctac aatgatatta gcgatcatat 960 ctctaacact gtatcttctg atgtttatta agattcgtcg ccaaggtggt ttaatacgac 1020 aacttttgat agccacttcg acatcaaaca taattcatac cactaagggt gcacctattt 1080 tatatttaca tagtagcacc agtgagccta tttcggaccc aaatatatgc aattacccca 1140 gctcattatc attttctttt atgcatgtgg ctattttatt aactacaatg gttgtgttct 1200 atttaagtaa agctatggtg ttgaaactca aacgactgtt gttttcatct tcttccatta 1260 cccagggttc caaagcaaaa attatccttg agctatctaa ccccatggaa gccatttcgt 1320 tttttgtaag agatgtgagt taccctgttt cgagcattac acgaaccgcc cgacagccca 1380 ttcgttcggc ttggataact cagggcttcc tcacatccaa actcaacctc aagtggcgaa 1440 atttacaatt tttgctagaa gactctaggg tagaagaaat ttcgttacct cctcaaattt 1500 cgatcccatt cggcaaatcc tccagcacta ttcgaattct ccaaacccgc taccacattc 1560 gtatcttggt tggcgatgcc ccagttttca ccgaaattcc catcattaat actaaaaatc 1620 gtgaaatcga aaactcatac gagcatttga aggccattcc acccattcct accaatattg 1680 tgacgaatcc agctgatcaa tgatatgtat aacta 1715 // ID BEL-54_CQ-LTR repbase; DNA; INV; 351 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-54_CQ_; KW BEL-54_CQ-I; BEL-54_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-351 RA Jurka J.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 262-262 (2011). XX DR [2] (Consensus) XX SQ Sequence 351 BP; 101 A; 102 C; 55 G; 93 T; 0 other; tgttggcgca caaagattgg aaacccctga aatgtgatcc acaagtcgga gatctactga 60 tctccctttg attcccacga atgatcgtcc tattcctgtt cctcaaaatt ctctcttctc 120 ccatagccca aatcccttat cattccctct ccaaattccc tgcacaaaac cgatcacaat 180 aaactgctcc agtatcagtc gcaattagac accgtcgaga acacaaccgt cgcgtaataa 240 aactaacttt tacaccgtcc gaactgttag cgtttaagtt taataaaccg tttctgtgcc 300 gtaatcgcgc gttttaagtg cagtgtccga aaccaaaacc gtgcacgaac a 351 // ID Gypsy12-NVi_I repbase; DNA; INV; 2989 BP. XX AC AAZX01024254; XX DT 13-NOV-2007 (Rel. 12.11, Created) DT 13-NOV-2007 (Rel. 12.11, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy12-NV; KW Gypsy12-NVi_LTR; internal portion; Gypsy12-NVi_I. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-2989 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from Nasonia parasitic wasp."; RL Repbase Reports 7(11), 1142-1142 (2007). XX DR Genome; AAZX01024254; Positions 8418 5430. XX CC Positions [2169-2651] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 9..2972 FT /product="Gypsy12-NV_I_1p" FT /translation="MFLNVTINSKRIRMEIDTGVYATIFSEKVKNEFFNDL FT SLSQTKHSLEDYVENVLKPVGSLENLQVTLNDKTIKLGCFVLPGKGPPLIG FT RQWLAAFGLWPLMANSTNSISLNKIQDCNIPSIREQLVIEFKLLFGDTPSL FT FNKGKLKIYLKENAKPIALKARHVAYAMKPLVEDEISRLVRLGHLVPVESS FT EWATPIVIVNKSDNTIRICGDYKLSINEYIIVNKHPLPRIEDVFAAMQGGK FT KFTELDLAHAYMQFEVEESSQQYLTIATHLGLYRYTKMPEGISTCPGDFQT FT FIENTIRGVKNTTAYLDNIYVTGSTYEEHLENLKQVCTRLQESGLRVRPEK FT CKIIKKKIELLGFAIDKDGLHKSSSKVKAIVEAPKPENAKQLLSILGLINF FT YERFLEHRSDKLKPLYDCANSKEFFWSEACDNAFKVIKEELISPRVLAHYD FT PNEQLVLACDASDYGLSAILSHKYKDGSERPIAYASKRIPKKEMHRAIIDK FT EAAAIVFGFKKFYDYIFGKEIILRTDHEPLKRIFGPKTGIPLTATSRLQRW FT AYLLSGFKYKIEHIKSKDNGNCDVLSRLPIEDDSDVFESNYTAINYITSEL FT DILDSNAIAAETKSCKVLSKIMMYITSDWPSYDKLTDHEKKYYSKRTEFFV FT DENCILWGHRVVVPMSLQPFVLKELHLSHLGIVKIKMLARSYVWWPGIDND FT IEQLVNSCKVCLEERKKPPNIPLTPWPWPNKAWSRIHCDFLGPFMGHMYLV FT VIDAHSKWPEVIDFHNNTKAEKLISKFRDIFARHGLANHIVTDNGPQFTSD FT LFQNYLKKLGIKHTFSPPYHPATNGAAENFVGIFKDKVNKIVKGGRALEDA FT INIFLFDYRSVPHSTTGKSPAFLMTKREMRTRFDSLRPRTENTVYEKQHAQ FT IVQRKGSRRVSLEEGDTIMIDNYGKGDKKIEGTIVKQLSPSTYDVQIKPDK FT ICKRHADQIISVAPGKKTVRRSERLKNKCKVSS" XX SQ Sequence 2989 BP; 1069 A; 535 C; 605 G; 780 T; 0 other; aaaaaccgat gtttttgaac gtaacgatta attcaaagcg tattcgaatg gaaatcgaca 60 cgggcgtata cgcaactatt ttttccgaaa aagtaaaaaa tgaattcttt aatgatttaa 120 gcctctcaca aacgaaacat tcccttgaag actacgtaga aaacgtatta aaacctgtag 180 gctcgctaga gaatttgcaa gtaactttaa atgataaaac gattaaacta ggatgtttcg 240 tcctacccgg taaaggtcca ccgcttatag ggagacaatg gttagcagca ttcggtttat 300 ggcctctgat ggcaaactca acgaattcta taagcttgaa taaaatacaa gattgtaata 360 taccaagcat acgcgaacag ctagtaatcg aatttaaatt actgttcgga gatacaccaa 420 gtttattcaa taaaggtaaa ttaaaaatct atttaaagga aaatgctaaa ccaatagccc 480 taaaagcgag acacgtagct tatgcgatga agccactagt ggaggacgaa atatcacggt 540 tagtacgttt aggtcactta gtaccagttg aatcgagtga atgggcgacg cctattgtaa 600 tcgtaaataa aagcgacaac acaattcgca tttgcggaga ctacaagtta tcgataaacg 660 aatatatcat tgttaataag cacccgcttc cgcgtatcga agatgtgttt gcagccatgc 720 aaggtggaaa aaaatttaca gaattagatt tagcgcacgc gtatatgcaa ttcgaggtcg 780 aagagagtag tcaacagtat ttgacgatag caacgcattt aggcttgtat cgatatacta 840 aaatgccgga aggtataagt acttgcccgg gtgattttca aacatttatt gagaatacta 900 taagaggtgt taagaacact acagcttact tagataatat atatgtaacg gggtcaactt 960 atgaagaaca tttagaaaat ttaaagcaag tatgtacacg tttacaagaa agtggtttgc 1020 gagttcgccc agagaaatgt aaaattataa aaaaaaaaat tgaactttta ggttttgcta 1080 ttgataaaga tggactgcat aagtcgagct caaaagtgaa agcaatagta gaagcaccaa 1140 agccagagaa tgcaaaacaa ttgctatcta tattaggatt aataaatttt tatgaaagat 1200 ttttagaaca caggtcagat aaattaaaac cgctgtatga ctgcgcgaat agcaaggaat 1260 ttttctggtc cgaggcatgc gataacgctt tcaaggttat taaagaagaa ctaatttctc 1320 ctcgcgtact cgcacactac gacccgaatg aacaattagt tttagcttgc gatgcctccg 1380 actacggctt atcggcgatt ctatcgcata aatataagga cggatcagag cgacctatcg 1440 cgtacgcgtc gaaaagaatt ccaaaaaaag aaatgcatcg agccatcatc gataaagagg 1500 cagcagcaat cgtattcggt tttaaaaaat tttatgatta tatttttggc aaagaaatca 1560 ttttgcgaac agaccacgag ccattaaagc gaatttttgg gcctaagacg ggaatcccat 1620 taacagcgac tagtagattg caaagatggg catatttatt gtcaggattt aaatacaaaa 1680 tagagcacat taaatcgaaa gacaacggaa actgtgatgt gctatctaga ttgccgatcg 1740 aagatgattc ggacgtattc gaatccaatt atacagcaat taattacata acaagcgaac 1800 tagacatctt agatagtaat gcgatagccg cagaaactaa atcatgtaag gttttaagta 1860 aaataatgat gtacataaca agcgactggc cgtcttacga taaacttaca gatcacgaaa 1920 aaaaatatta ttcgaaaaga acggaatttt ttgtagacga aaactgtata ttgtggggcc 1980 atcgagtcgt ggtaccaatg tcattgcaac cgtttgtatt aaaagaacta catttgtcac 2040 atttaggaat cgttaaaata aaaatgctag cccgatccta tgtatggtgg cccggtatcg 2100 acaacgacat agaacaatta gttaattcct gtaaagtctg cttggaggag aggaaaaagc 2160 caccaaacat tccattaaca ccatggccgt ggccgaataa agcgtggagt cgcatacatt 2220 gtgactttct aggaccgttc atgggtcata tgtacttagt tgttattgac gctcactcca 2280 agtggcctga agtaatcgat tttcacaata atactaaggc agaaaaactc atttcaaagt 2340 ttcgcgatat ttttgcacga catggtctcg cgaatcatat tgttacggac aatggaccgc 2400 aattcactag cgacttattt caaaattact taaaaaaact cggtatcaag catacgtttt 2460 ctccacccta tcatcctgct acaaatggag cagccgagaa ctttgtagga atatttaagg 2520 acaaagtcaa taaaattgta aagggaggaa gagcgttaga agatgctata aatattttcc 2580 tattcgatta ccgtagcgtt ccgcacagca caacaggaaa aagtcctgcg tttttgatga 2640 cgaagcgaga gatgcgcacg cgcttcgact ctttgcgacc tcgcacagaa aacactgttt 2700 acgaaaaaca gcacgctcaa atagtgcagc gcaagggttc tcgtagggta tcactcgaag 2760 aaggagatac tatcatgatt gataattacg gtaaaggtga taaaaaaata gaaggaacca 2820 tagtaaaaca attatctcct tctacgtatg acgtccaaat aaaacccgac aaaatatgta 2880 agaggcacgc ggatcaaata ataagcgtcg cgcctggaaa gaaaactgta cgtcgatccg 2940 agcgtttaaa aaacaaatgt aaagtgtcat cttaaagagg gaggaattg 2989 // ID BEL-19_CQ-I repbase; DNA; INV; 5847 BP. XX AC AAWU01039046; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-19_CQ_; KW BEL-19_CQ-LTR; BEL-19_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-5847 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 191-191 (2011). XX DR Genome; AAWU01039046; Positions 62744 68590. XX CC Positions [4792-5373] - Integrase core CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 2305..5847 FT /product="BEL-19_CQ-I_2p" FT /translation="MCNWSQFTGISIRLASRWFGKFGQTQDSSNGCCLNIS FT TSKFWALEEVTSAEAKSPLNSNNRCEDFFVQTTTIREDGKYVVRLPFKEEP FT VLLGDSFEQAKKRLLSLERKLARNPTVYDQYRAFLKEYLELDHMEMVEQKD FT ICKVRYFIPHSCVIKPDSTSTKLRVVFDASAKTTNGRSLNDAMLSGPPIQS FT DQFDLLLDFRCHDKVLMGDIAKMYRQVVVHEVDSWFQTILWRNTPEEPIQV FT YRLKTVTYGEAASSFLACRALHQVGEELRTELPEIAAMIQRRFYVDNLMMG FT GDSAEELLDRRRAVEAALMKRGFPLRKWAANDLSIIADIPNHDRETEIRIG FT DHEIIKTLGVAWSPREDTFKFLVEEQQAIRTSMTKRQLASEVLRLYDPLGV FT MQPIIITAKILLQGLWKTKLGWEDHIPDETLHEWQNLKKSLPKLAELKFPR FT RAMPSNPVRLELHGFSDASSRAYGAAIYAFAVDAQGNKSFNLLCSKSRVVP FT IKELSLPRKELLGAKLLAELMYRALGIVPHTVDKVHYWCDSQVVLAWIHSN FT VPHHEVYVRNRINIIQNLSEKTNWKYIPTDLNPADIISRGISVRKLLNVKR FT QLWLHATTYVLENHGEVQIAAVQQMAADPPRPEINDLIASYKYCNFYRYTR FT RHFAWLYRAKRNLMARSNLLKSRGICVQVKTGPLDVEDLEAGMCLIVKTMQ FT SICIPEEVKSIELTGQPTSQGPLQHLFPFIEDGVVRVSGRLDLADLPADQK FT RPIFLPREHPFIKIILIHIHRSNNHAGLEIVLAQFQAKYYMRGLRKTAQTI FT LRKCVLCARARPRRFEQQMGQLPRPRVNPSAPFTHTGIDLCGPFEVLPSKR FT AKVKLTMYACIFVCFSTKAVHLEIVENQSTSAFIASLMRFVSLRGRPEVIY FT SDNGRNFVGASRELDALRKVYNDEAFQQELIGSIAEEGIRFSFIPPRSPNF FT GGLWESNIKVAKRLFSAAARGASFNVLELQTVFYQVAAIMNSRPLTSVLSN FT AGAPEPLTPGHFLIGRAMNALPIPGSHLEERNLSMRWKRIQAQTHQFWHKW FT QNEYLQHLRCLAKWTKKQPNLQPGQIVLIGDDNNPVAKWPMGIVVETQAGS FT DGVVRVATIRVGANLYKRNVRLLAPLPIEMSSIEDYIEPCNEASRLEHENN FT VPSSVGPDKIWADRLRPKGGRK" XX SQ Sequence 5847 BP; 1735 A; 1333 C; 1364 G; 1415 T; 0 other; gttcatgtgt aacagcgaag ctcatcaggt cgtcttcatt cttccagtga ttatcacaga 60 tccctgcggg agcgggactt gagtgcaaat tcactttttc catttggtga ccccgacgtg 120 atcgggagga tcgcgttaat taattcaaga cgtcgttggt gtatcgctgc tggtgttcgg 180 cggctcttca gatgctgata ggtgcatcgc tgctagttca tcggtgttat cacaaatcca 240 gtgttcggcg gctcttcagc tgctgatagg tgcatcgctg ctagttcatc ggtgttatca 300 caaatccagt gttcggcggc tcttcagctg ctgataggtg catcgctgct agttcatcgg 360 tgttatcaca aatccagtgt tcggcggctc ttcagctgct gataggtgca ccgctgctag 420 ttcatcagtg ttcatcagtt ttatcacaaa tccagatcat cgttggtgca tcgctactag 480 tgtccggcgg ctccgcagct gctgatcgat catcgttggt acatcgctgc tagtattcgg 540 cggctcctca gctgctgata gttcatcgga ttaataagat aagtttgata gtaagacttt 600 ttcaatttat aatattccac cctgcctcgt cgaggaggga aagctaataa acccttaaag 660 gaaatggacg aaaacgcaca aatagcgttt tttaaaaaaa aacaactaat agcgttagaa 720 gggtccataa aaagcatggc caacgatgtt ggcaatgttg aaaaggcccc aaaaacaaaa 780 cctgagatta aggtggtaac tagcatgcta gaccaattat atgatcaagc atgtaacgcc 840 atcagcaaat tagaaggatt taccgagccc ctcctggaac gcaggaatgg gataatgctg 900 gtatacagtg gagcacgagt ggcccttgaa gacgcactcg gacagttcga acctttggct 960 cgagccacag ctccactgat ggagcaaact ttccagcagc agccaggtcg tgcagatcat 1020 ctgcccagac ttgagctgcc aacatataac ggcaacccaa tagagtggtt ggcgtttaaa 1080 ggccgattcg agaagcgaat cgcaaacata acagaagatt ccgacaaata tgccttctta 1140 atgaagtgcc tagaaggatt ctcgactgct cgaaacaaaa tcgacgcatt agagaattca 1200 ggtgctaaat ttcaagatgt cattaggttc aagagaataa ctactccgaa ccctaaggcc 1260 atcatgaact tgatggacgt cgtggacacc gctatccacg cagccaaaca aattcatgcc 1320 gacgctaatc cagcattgga ttgtgtcgct gacggactgt tggtggcgtt ggtcaaatct 1380 aaacttgacc cagaaacctc ctcaaggatc gaggacgcga tggacatcca tacagtgtat 1440 ggatgggacg ccttccgcga agaactagaa aagcggacta gccaaatggt gagtttcaga 1500 aaacgaaaaa gaaaagcacg ctaaaacggt gggtgctata accgccaatg attctgaaaa 1560 taaaaggccc aaaaagccta agaatcctaa ggataacgca tgctttgtct gtggcgagaa 1620 acacgccatt aggtactgcg ccaatttcaa aaaaatggcg ctgacgaaga gagtggacac 1680 ggtaaaagcc gcccacaggt gcttcagctg ccttaaccgt gggcattcat tccagaactg 1740 cccatcacaa aagtcttgcc ttgagtgcgg tctaaagcac cacactctac ttcacaaaaa 1800 tgcagaaggt gaaaataccg ctgccacttt accggcaatc atgccatcta cttcggccca 1860 gtcaccacaa tgactggagc gggaatatgt atttctggca actgccgaaa tacacatatt 1920 aggtgcgtcc aactggtatg taataagatg cttgttggac tcaggaagcc aggctgaagc 1980 aatcaccgaa gaagcagctt cagcccttgg aataccaatt atgcatagta atatgcagat 2040 taaaggtatt ggaggcagcc taaatattac cgggaagata gacacggtaa tagcttccgg 2100 gtacggaaac ttccgattac caattacgtt aattgttgtt cccaagatgc tagacgatca 2160 acctagcatt tcaattgaag ccaaagacat atcaattcct aagcatatgg aattggctga 2220 tcccaccttt tatcataaaa gaagcgtaga gataattctt ggagcaagag ttctcttcca 2280 aatcttaggg cctaggcgca tgcgatgtgc aactggtccc aatttacagg aatcagcatt 2340 aggctggcta gtaggtggtt tggtaaattt gggcaaaccc aggacagcag caatggctgt 2400 tgcctcaaca tctcaacatc taaattttgg gcgctggaag aggtgacttc cgcggaagcg 2460 aagtcaccac taaacagcaa taataggtgc gaagatttct ttgtccagac cactaccata 2520 agggaggacg gcaaatatgt tgttcgactc cccttcaaag aagagccagt gcttttagga 2580 gattcattcg aacaggccaa gaaacgcctg ttaagtcttg agagaaagct agctcggaac 2640 cctactgttt atgatcagta tcgagccttt cttaaggaat atctagaact ggaccatatg 2700 gaaatggtcg agcagaagga tatttgcaaa gtgcgctatt tcataccgca ctcatgtgta 2760 attaagcctg actcaacgtc taccaaactt agggtagtat ttgatgccag cgccaaaact 2820 accaacggaa ggtcactgaa cgatgcaatg cttagtggac ctccaatcca atcggatcaa 2880 tttgatctgt tgttggactt cagatgccat gacaaggtgc taatgggcga catagcaaaa 2940 atgtaccggc aagtggtcgt ccacgaagta gactcatggt ttcaaactat cctatggagg 3000 aacaccccag aagaacccat tcaagtatac cgtctcaaga cggtaactta cggcgaggct 3060 gcgtcatcat ttttggcatg ccgagctctt catcaagttg gagaagagct ccgaacagag 3120 ctaccagaga tcgccgcaat gattcaacga cgattttacg tcgacaacct catgatgggg 3180 ggagattcag cggaagagtt actggatcga aggagagcgg tggaagctgc gcttatgaag 3240 cgaggcttcc ccctccggaa atgggcggct aatgatttaa gcatcatagc ggacatcccg 3300 aatcatgatc gtgaaactga aatccgcatt ggcgatcatg agataattaa aacattaggt 3360 gtagcctggt caccacgaga agacaccttc aagttcctcg tggaagaaca acaggctatc 3420 aggacttcaa tgacgaagag acaactagct tcagaagtgt tacggttgta cgatccatta 3480 ggagtaatgc aaccaattat cataactgcc aaaatacttc tacaaggttt gtggaagact 3540 aaattgggct gggaagacca catccccgat gaaactttac atgaatggca gaatctaaaa 3600 aaatcattgc ctaaattagc tgagcttaaa tttcctcggc gggctatgcc aagtaatcca 3660 gtacgtttag agttgcacgg attttccgat gcctcatcaa gagcatacgg agccgctatc 3720 tacgcgttcg cagttgacgc acaaggtaat aagtccttca atttattatg ttccaaatcc 3780 agggtggttc caataaagga gctctctctt cccaggaaag aactactggg agccaagtta 3840 cttgcagaat tgatgtacag agcgcttggc atcgtcccgc acactgttga caaggttcac 3900 tattggtgcg attcccaagt ggtgttggcc tggatacact ccaatgtgcc tcaccacgag 3960 gtctacgtgc gcaatcgaat taacatcatt caaaacttat ctgagaagac aaactggaag 4020 tacattccga ctgatcttaa cccagcggat atcatatctc gagggatttc cgtcaggaaa 4080 ctgcttaatg tcaaaaggca gttgtggctt catgcaacta catatgttct tgaaaatcat 4140 ggcgaagtac aaatagctgc ggttcagcaa atggcagctg acccaccgcg accagaaata 4200 aacgatctca ttgcaagcta caaatattgc aacttctata gatataccag aaggcatttt 4260 gcttggttat atcgggccaa gagaaacctc atggcccgat caaatttgtt gaaatctagg 4320 ggaatatgcg ttcaggttaa aactggcccc ctagatgtag aagatctaga ggccgggatg 4380 tgccttattg tcaaaactat gcaatcaata tgcatacccg aagaagtcaa gagtattgag 4440 ttgacaggtc aacccacatc tcaagggcca ctgcaacatc tgttcccatt tattgaagat 4500 ggtgtagtaa gagtatccgg taggttggac ttagccgatc tacccgcaga tcagaagcgg 4560 ccaatcttcc ttcccaggga gcatccattc atcaagataa tattaattca tatacatcgg 4620 tccaataatc acgctgggtt ggaaattgtg ttagcccagt ttcaagccaa atattacatg 4680 cgcggcttaa gaaaaactgc acaaactatt ttgagaaaat gcgtgctttg cgcgagagcc 4740 agaccacgta gattcgaaca gcagatggga cagttacctc gcccacgcgt aaacccatct 4800 gcacctttta cccacacagg aattgacctg tgtgggccat tcgaagtttt accaagcaaa 4860 agggcaaaag taaaactcac aatgtatgcc tgcatattcg tgtgtttctc aaccaaggcc 4920 gtacatcttg aaatcgtaga gaatcaatcc acaagtgcat tcattgcctc gctcatgcga 4980 tttgtatcgc tccgaggaag accagaagtt atctattctg ataacggtcg caattttgta 5040 ggtgccagtc gtgaattgga tgcattgcgt aaggtgtata atgatgaagc ctttcagcaa 5100 gaactaatcg gaagtattgc tgaagaaggc atcagattct catttattcc gcccagaagt 5160 cccaattttg gaggactctg ggaatcaaat ataaaagtgg ccaagaggct attctccgca 5220 gcagcacgag gcgctagctt taacgttcta gagctgcaaa cggtatttta ccaggtggcg 5280 gccattatga actcacgacc gctcacttcg gtattatcaa atgcaggagc tccggagcca 5340 ttgactccgg gtcacttcct aataggtagg gcaatgaatg cactacctat tcccggtagt 5400 caccttgaag aaaggaatct ctctatgagg tggaagcgca tacaagcgca aacacatcaa 5460 ttctggcata agtggcaaaa cgaatatttg caacacttac gctgcctggc caagtggact 5520 aagaaacagc ccaacttgca accaggacaa attgttctta taggagatga caataacccg 5580 gtggctaaat ggcctatggg gattgttgtc gaaacccagg cagggtccga cggagttgtc 5640 cgagtggcaa ctatcagagt tggagcaaac ctttacaaga ggaacgtcag actactagct 5700 cctctcccta tagagatgtc ttcaatcgaa gactacatag aaccctgcaa cgaagcaagt 5760 aggctcgagc acgaaaacaa tgtgccttct tctgttggac ctgacaaaat ttgggccgac 5820 agacttcgac ccaaaggggg gagaaaa 5847 // ID Ginger1-2_AP repbase; DNA; INV; 4441 BP. XX AC . XX DT 03-FEB-2010 (Rel. 15.01, Created) DT 03-FEB-2010 (Rel. 15.12, Last updated, Version 2) XX DE Ginger1 DNA transposon from Acyrthosiphon pisum. XX KW Ginger1; DNA transposon; Transposable Element; integrase; Ginger; KW Ginger1-2_AP. XX NM Ginger1-2_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-4441 RA Bao W., Kapitonov V.V. and Jurka J.; RT "Ginger DNA transposons in eukaryotes and their evolutionary RT relationships with long terminal repeat retrotransposons."; RL Mobile DNA 1, 3-3 (2010). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC The sequence is not complete at both ends. XX FH Key Location/Qualifiers FT CDS join(478..1203,1203..1685,1689..1823,1826..2101, FT 3299..3436,3440..3670) FT /product="Ginger1-2_AP_1p" FT /translation="MFRPREMCLTAAVEADEPQQQKNTHVNLKWVMDYFEY FT KVYPASIPDKGSRANFRRCCRPFILKDGVLYYQKTMAKVIITPEERTQIIK FT LVHDGADSSLEASALSSHHGRDATQNLLKKRYFWPSMLNDVREYIKQCDAC FT QKANPATLKVIPDLQSVSVPKQVFKQIGVDIMTLPVVDDMKYVVVAIDYFS FT KWSEARALPDKSSDSVARFLYDDIICRHGCPLIHITDQGGEFLNKLISELF FT SLTGTKQRVTSAYHPQANGLVEHQNRTIKNCFLKVLQDNSNKWPYILQGVL FT FAPRTTQHTSTNFSPFQVLYQREPILLVDICNLKLIDEDTIISEDLGIISD FT DDVFDKVAFNKTFEKMLNMRSIMEDEVYTNIEKAQVRQRVSYNKRHKTDNA FT FNVNDKVLLNLKRDDRKGGWSALPWKPKIGYYIIDSINSNKTCVLMYKGKV FT MKQHHLSNLKHYFDKNIQLADDIEDDVVEIPGNAYQNVVLRYFNPVSYVWQ FT KMQCRYFNLTVEHFHKLSSVPKFLNKPSIIIHIIGDGNCFYRVLTCITTGF FT IFHSTTYILQNILYQIKIYCIMILLIIQKNKINYLIKVLIITYISEIWATE FT VELLAAALLMNTTIVVYTFSNKQDWQVFHKSGRIPDTFNMHEKCIYLVNSN FT SNHFDVVTSVEDNK" XX SQ Sequence 4441 BP; 1645 A; 613 C; 633 G; 1550 T; 0 other; aatattaaat tgttaaatta acattttttc aatctttgaa ttcaacgaat accctataat 60 atattatttc atacattata cttattaaat aatgaaaata aattactaat tataagtcat 120 tacttatggt aaaatatatt gatttgcctg ataaataacc taggaaatgt agaccctgaa 180 aacaagaacc ccgttaaaaa ttaccctacc cataataaaa atgactttca tagtgtactt 240 aggtacctaa taattagagc gtggatattt ataatttatg cactagaaaa ttgtgttaaa 300 gtggcttgca caatgaacta tagtttattt atttattaat ataacaatat tgtacaccat 360 aataatatta catcaaaatt aataattaac accttcccaa taatcagata ttcagataga 420 taggtatgta tgtatagtaa tatttattat taatgtcaat tcacttatag ataaaatatg 480 tttcgaccac gggaaatgtg cttgacagct gcagttgagg ctgatgaacc ccaacaacag 540 aaaaacacgc atgtaaattt aaaatgggtc atggattact ttgagtataa agtttatcca 600 gctagtattc ctgataaagg tagccgagca aacttcagga gatgttgtcg accattcata 660 ttaaaagatg gagttctgta ctaccaaaag actatggcca aagtgataat tacgccagaa 720 gaacgaacac aaatcattaa attggttcat gatggtgctg atagttcttt ggaagcatcg 780 gctttgtctt cacatcatgg tcgcgacgcc acacaaaatc ttttgaaaaa gcgatacttt 840 tggccctcaa tgttgaacga tgtgagagaa tacatcaagc agtgtgatgc ctgccaaaaa 900 gcaaatcctg cgacattgaa agttattcct gacctacaat cggtttcagt acctaagcaa 960 gtgttcaaac aaattggtgt tgatattatg acacttccgg tagtcgatga catgaaatat 1020 gttgtagtag caatcgatta tttttcaaaa tggtctgaag caagagcact tcctgataaa 1080 agttcagatt cagttgcccg cttcctgtat gatgacataa tctgcagaca tggctgtcct 1140 ttaattcaca ttactgatca aggaggagaa tttctgaaca aactcatatc agaattgttt 1200 tcttaactgg taccaagcag agagtgactt cagcgtatca tcctcaggcg aatggtttag 1260 tcgaacacca gaacagaaca ataaaaaact gcttcctcaa agttctgcaa gataatagta 1320 ataagtggcc ttatattttg caaggtgtac tatttgcacc tcgtacaact caacatactt 1380 cgacaaattt ttctcctttc caagttttat atcaaagaga acccatattg cttgttgata 1440 tatgtaattt aaaattgatc gatgaagaca cgataatatc tgaggatctt ggtattatat 1500 ccgatgatga cgtattcgat aaagttgctt tcaataaaac atttgaaaaa atgttaaata 1560 tgagaagtat aatggaagat gaagtctaca caaacattga aaaagctcaa gtccggcaac 1620 gagtttcata taacaaacgt cataaaactg ataatgcgtt taatgtgaat gacaaggttt 1680 tactttgaaa cctcaagaga gatgaccgca aaggaggttg gagtgcactt ccttggaaac 1740 caaaaattgg ttattacatt attgattcga taaattcaaa taagacctgc gtcttaatgt 1800 ataaaggcaa agttatgaag caaggcatca cttgtcaaat ttaaaacatt attttgataa 1860 aaatatacaa cttgccgatg atattgaaga cgatgttgta gaaattcctg gaaatgcata 1920 tcaaaatgtt gttttgagat attttaatcc ggtttcgtac gtgtggcaaa agatgcaatg 1980 tagatacttc aatttgacag ttgagcattt tcataagcta tcttcagtgc ctaagttttt 2040 gaataagccg tcaataataa tacatataat tggcgacggt aattgcttct atagagtctt 2100 aatcgtggtg gattacaggt gataaagatt ctcatacaat aattaggaaa gatctgaaaa 2160 aggtacctat ttataattat ttattataca atttcattgg caccataatt tgtatttttg 2220 aatttttttt tttttttttg atagtttgta gcaaatgatg atcaggttat taaatttatt 2280 ggtggtcaaa ctcagatgga agactattta ataaccaaca attgtaaaca caatttaata 2340 tttttggatg attattcatt atatatttag gtatattgct ataatattag ctaatatgtg 2400 tcacctgtgt tatatttaaa taattataat tatatttcaa taagtactat gtttcagtaa 2460 acaataattt tactataacc tatataattt ttttatgtta ttattaacat acacattact 2520 ttgcactccc ccccccccca tttatctatt ttatccaatt tttaatccac cctcagttat 2580 gtttccaaag agaataaagt gaaactagga tattttacta tctgcagaaa aaaatactac 2640 aagatactgg aaatttatat acccgatgta acaggaagaa ctgaccaaaa aaaaaaaaaa 2700 atgatgctat attcaacaaa tttttttaac attatatgat cagaaaataa tttaaaaaat 2760 atactcgtgt cagcaaattc ccaaaaaaaa tattttgaaa aaagttatta tgttccaaat 2820 taatttaatc aaaaaaatat tgatatttta aaaaattcta aaaaataaac tacttgactt 2880 agacaaatgt ttttaccata aaatttttac ttataatcag attcacaaat tggctatttt 2940 taatttatcc gaaaaaattt taatcaaatt ttaatttttt attttcagca ttaaaaaata 3000 aaaataattt ttttttaatt atccgtgtag cacaagtggg gtttaaatat agcttaaaat 3060 caaattttgt atgtggtcta gggaaatgtg caagaatgga caactgatta ggacttgcaa 3120 attttaggtc tgatttgatc tggtccaaat ttgtgctaaa tattggtata ggaatgaatt 3180 ccaacatgaa tataatatta tgaatcataa tacatgctaa ttgcataaaa aaatgtttga 3240 gtatgtgtgt ggtagttggg gggagggcag agagataata tatgctatta ttttatttac 3300 ttgtattaca actggtttta tatttcattc aactacctac atcctacaaa acattttata 3360 ccaaattaaa atatattgta ttatgatatt attaattatt caaaagaata aaattaatta 3420 tttgataaaa gtattataaa taattaccta tatttcagag atttgggcta ctgaagttga 3480 gttattagct gcagctcttt tgatgaacac aactatagtg gtatatactt tttcaaacaa 3540 acaggactgg caagtgttcc ataaatctgg tagaattcca gacacattta atatgcatga 3600 gaaatgtata tatttagtca actcgaatag taaccatttt gatgttgtta cttctgtaga 3660 agataacaaa taaaaactat aatatatcat taataggtaa ttaatagcaa caaaatatta 3720 ataattaata gttataaaat attatttaat atgtatcatc tcatttatta taattttata 3780 cttgaaaaat gacaagtact aagcagtcta tattgttaat ttttatattc gttattttaa 3840 caaattagat tagattattt tacacattaa aatattcaac tagttaagtt acaaagttaa 3900 tatgaattaa aactaactag ttaatttaga aagttaccaa aaaaggaact ttctaactta 3960 cctttaactt accaactata gttagtgcct agctttgata aaatgttata ttaatagtta 4020 aaattaataa ttataaaaca gcttttacta ttagtaccta ctacatattt taatatttgc 4080 agataatctt tttctaattt tttacatcat ggttatatgt gggttaatat tatgtgtttt 4140 ttgtttgcac tttagacaat ttactttttc catttattaa aaattaattt tatacctacc 4200 aaataattta gcctacatta tctaaatatg tatgggatga tacaaaaatc aaaaactggg 4260 ttcagaatag ataatttaat ctaatgcatt ttctcaaata tttgagcata atgtgagtac 4320 caaaaaataa aatacctaac tcgggatact atgtacaaaa aaaagtaatc atatttgcgt 4380 gtgtgctttc attatttata attattacct attattacat atacctaaat taattaaaat 4440 a 4441 // ID CR1-5_HM repbase; DNA; INV; 3728 BP. XX AC . XX DT 11-DEC-2008 (Rel. 13.12, Created) DT 13-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE CR1-type family: consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-5_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3728 RA Jurka J.; RT "CR1 families from Hydra magnipapillata."; RL Repbase Reports 8(12), 1833-1833 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 149..3205 FT /product="CR1-5_HM_1p" FT /translation="MENNLSYNINAKNIVFSDNNSDNHTETLLQSEYYSIP FT ESTQYLCENKNKFSILHINIRSMNKNFENFKCLLGELNHDFKIICFTETWL FT KSNETNSNFELRNYTSVHQMRDICVGGGISIYVHNSIDFIQRKDLNVNNTD FT CEALCVEIINKLAKNFIINAIYRTPAGSLKTFKTYLRTFLTTKNILQKHVY FT VVGDINIDLLNHALNSEAKTFIDILLEYNLIPTINKATRVTKKSSTLLDNV FT ITNNFHNSRFKTGIIKTDLTDHFPIFFITESVTLNNATHKSTVFMRQINES FT SICQFKNLLNNYVDWNLVLQSHDVNNAYDLFLNQFSKMYDKAFPLKVKLIN FT SKSVXSPWMTKGLLKSSRKKQKLYDKYLKNKTYKNETNYKNYKNLFEKTKK FT QSKVNYYAKLLEKNKGNPQKTWSVIRDLIGKNKIEKNNLPQKLIIEGKMIY FT HKEVIIEKLNNFFLDVGPNLAAKIPIGQKKFDSYLATTDLIMEEPILTKSE FT LHTAFNSLKKNKSAGIDQINVNVIKSVFDIIEPSLFHIFNLSLKSGHIPDK FT LKIAKITPIFKSGDETNISNYRPISVLPCFSKLIERIMYNRLYKYLSENKI FT LYNNQYGFKKNHSTDHAIXELVKHISNGFNSDCYTIGVFIDLSKAFDTVNH FT EILLKKLENYGVKNQSLLWFKSYLTNRKQFILKESKDSKNNLITCGVPQGS FT ILGPLLFLLYINDLYLSSKVLNTILFADDTNLFYSHKDITVLFKIFNEELD FT KINEWFISNRLSLNVEKTKFILFHKPSKAENIPLKLPNLLINNKIIKREST FT TNFLGVLLDEKLSWKFHIKYIEGKISKNIAMLYRTKPFLNNESLKNLYFSF FT IHSYLSYCNISWGSTNHAKLKKIYSKQKHACRIVFGANRKTQCEPLLRQLG FT ALNIYKLNIHQVMIFMFKAKLGFSPVTFQSYFNEISHKYPTKFSVNNFVVP FT INFLKLSSYQIQYRGPLVWKEFLPIFSKKQNFVLNCTLQSFKDDSKRHLLN FT MEFDILDFKYLF*" XX SQ Sequence 3728 BP; 1406 A; 544 C; 464 G; 1310 T; 4 other; agaagctgga atgtttgcgt atatttcata cgacaagttg gtggttcgcg atggtctcga 60 aaacccacgt aaatatttct tttaacgtta ctttattttt attttttatt tatcaatagt 120 ttttcaaatt ttaatttaac ttttagaaat ggagaacaat ctctcttata atattaatgc 180 taaaaatatt gtttttagcg ataataactc tgataatcat actgaaacct tattacaaag 240 tgagtactat tcaatacctg aatctactca atatctctgt gagaataaga ataaattttc 300 aatattgcat attaatattc gaagtatgaa taaaaacttc gagaacttta agtgtttatt 360 gggagaactt aatcatgatt ttaaaataat ttgtttcaca gaaacttggc ttaaaagcaa 420 tgaaacaaat tctaattttg agttaagaaa ctatacatca gttcatcaaa tgcgcgatat 480 ttgcgttggt ggtggtataa gcatatacgt tcacaattct attgatttta ttcaacgtaa 540 agatcttaat gtaaacaata cagattgtga ggcattatgc gtcgagataa taaataaatt 600 agctaaaaat ttcataatca atgctattta tagaacgcca gctggtagtt taaaaacatt 660 caaaacttat ctgcgcacat tcctaaccac aaaaaatata ttgcaaaagc atgtttacgt 720 agttggggat atcaacattg acttactcaa ccatgcctta aatagtgaag caaaaacttt 780 tattgatatt cttctcgaat acaaccttat cccaactata aacaaagcaa caagagtaac 840 taaaaaatca tcgacattat tagataatgt tataactaac aattttcata atagccgttt 900 taaaactggt ataatcaaga ctgatttaac tgatcatttc cctatattct ttattactga 960 gagcgttact ctaaataatg ctacccacaa atcaacagta ttcatgagac aaatcaatga 1020 aagttccatw tgccaattta aaaatttatt aaataattac gttgattgga atcttgtact 1080 gcaatcacac gatgtaaata atgcttatga tttatttcta aaccaatttt ctaaaatgta 1140 tgataaggca tttcctttaa aagtgaaact aataaactca aagtcggttg wwtccccatg 1200 gatgaccaaa ggcctactaa aatcatcaag gaaaaaacaa aaattgtatg ataaatattt 1260 aaaaaataaa acttataaaa atgaaactaa ttacaaaaat tataaaaatt tatttgaaaa 1320 aaccaaaaaa caatcaaaag taaattacta tgctaaactg cttgaaaaaa acaaaggaaa 1380 ccctcaaaaa acgtggagtg ttattagaga cttaattggg aaaaataaaa ttgaaaaaaa 1440 taacttaccg caaaaactta ttattgaggg aaaaatgatt tatcataaag aagttattat 1500 tgagaaatta aacaattttt ttcttgatgt tggcccaaac ctagcagcga aaattccaat 1560 tggtcaaaaa aaatttgatt catatcttgc aacaactgat ttaattatgg aagaaccgat 1620 tttaaccaaa agtgaactac atactgcttt taatagtctg aaaaaaaata aaagtgcagg 1680 tatagatcaa attaatgtta atgtaattaa atctgttttt gatattatcg aaccctcatt 1740 attccatatt tttaatctct cccttaaatc aggtcatatt ccagataaac taaaaattgc 1800 aaaaataaca cctattttca aatctggcga tgagaccaat atttcaaact acagaccaat 1860 ctcagtctta ccctgctttt ctaaactaat tgaacgcatt atgtacaaca gactttataa 1920 atatctatct gaaaataaga ttttatataa taatcaatac ggttttaaaa aaaatcattc 1980 tactgaccat gcaatarttg aattagttaa gcatatatca aatggattta atagtgattg 2040 ctatacaata ggggttttta ttgatttatc caaagctttc gacacagtaa atcatgaaat 2100 tcttttgaaa aaattagaaa actatggtgt taaaaatcaa agtttacttt ggtttaaatc 2160 atacctaact aacagaaagc aatttatact taaagaatca aaagactcaa aaaacaactt 2220 aataacttgt ggagtacccc aaggttctat tttgggacca ttattgttct tactttatat 2280 aaatgattta tatttatcat caaaagtatt aaatactatc ttatttgcag atgacactaa 2340 tttgttttat tctcacaaag atattactgt tctttttaaa atatttaacg aagaattaga 2400 caagatcaac gagtggttta taagtaaccg actatcatta aatgtagaga aaacaaaatt 2460 tatcctattc cataaaccta gtaaagccga gaatattcct cttaaactac ccaatctttt 2520 aatcaataat aaaataatta aaagggaatc gactacaaac tttctgggag tgctcttaga 2580 tgaaaaattg tcatggaaat ttcatattaa atatattgaa ggtaagatct caaaaaatat 2640 tgccatgtta taccgaacga aaccttttct taataacgag tctttaaaaa atttatattt 2700 ttcgtttata cacagctatc tttcatactg taacatttcc tggggtagca ctaatcacgc 2760 taaactcaaa aaaatttata gtaaacaaaa acatgcttgc aggatagtat ttggggcaaa 2820 tagaaaaaca caatgcgaac ctctcttacg tcagcttgga gctttaaata tttataagct 2880 taatattcat caagtaatga tatttatgtt taaagcaaaa cttgggtttt ctccagtaac 2940 atttcaatcc tactttaatg aaatctccca taaatatcca accaaatttt cagtgaataa 3000 ctttgtggtt cccataaact ttcttaaatt aagttcgtac caaattcaat atcggggacc 3060 tttagtttgg aaagaatttc ttccaatctt tagtaaaaaa caaaacttcg ttttaaactg 3120 cactttgcaa agttttaagg atgattcaaa acggcattta ctgaacatgg aattcgatat 3180 actagatttc aaataccttt tttgaaaaca atttaaaggt ttaaactaaa tcacttaaaa 3240 ataaattaaa tctttaacac tttctttagc ttttattaaa atggtttcaa accattagta 3300 taaaagataa cgaaaataaa attgtctttt tcgttcatat ttttgatgat attaaataat 3360 cagcccaaat acttatgttt acatatattg cttttttttt tttttaatta aacaacacat 3420 tttttttgtg tgtttttata actttcatga ctttttgtcg tttttacttt tttattttta 3480 aaattattct gcttgaagat tgttagcaac aaattatttt attgctgaag tttttgattt 3540 gtttttgttt ttgtatttat actgtctttt tgtcttttat catttattgt gcatgtaaat 3600 ttttgtgtga cggggctcgg tgataagaca acctgtcttc ttcttgctcc ggccaggatt 3660 ttcttttatt gtaatttctt ttttgtaata tttatcaacg gcaaattaaa taaataatta 3720 aaaaaaaa 3728 // ID Troyka-1-LTR_BF repbase; DNA; INV; 171 BP. XX AC . XX DT 29-APR-2008 (Rel. 13.04, Created) DT 29-APR-2008 (Rel. 13.04, Last updated, Version 1) XX DE LTR of the amphioxus Troyka-1_BF autonomous LTR retrotransposon - DE consensus. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Troyka group; KW Troyka-1-I_BF; Troyka-1-LTR_BF; Troyka-1_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-171 RA Putnam N.H., Srivastava M., Hellsten U., Dirks B., Chapman J., RA Salamov A., Terry A., Shapiro H., Lindquist E. et al.; RT "Sea anemone genome reveals ancestral eumetazoan gene repertoire RT and genomic organization."; RL Science 317(5834), 86-94 (2007). XX RN [2] RP 1-171 RA Kapitonov V.V. and Jurka J.; RT "Troyka - a distinctive group of gypsy-like LTR retrotransposons RT inducing 3-bp target-site duplications."; RL Repbase Reports 8(4), 512-512 (2008). XX DR [2] (Consensus) XX SQ Sequence 171 BP; 31 A; 46 C; 40 G; 54 T; 0 other; cgccaacatg gtatgttagg attataggtc atgacccctg accccgaggt catggggcct 60 cgtgcgtttg tccgccatct tgtctctatt gtatgcattg ctgaagtctt cattaaatcg 120 tcctcgcggc caattaccat cttgagtctt cgttagtgtc cctaactggc g 171 // ID Gypsy-84_CQ-I repbase; DNA; INV; 5365 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-84_CQ_; KW Gypsy-84_CQ-LTR; Gypsy-84_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-5365 RA Jurka J.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 547-547 (2011). XX DR [2] (Consensus) XX CC Positions [4490-4963] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 475..1680 FT /product="Gypsy-84_CQ-I_1p" FT /translation="MHSRNHELAELSEWDLENVDLYYRRILNSTMMEQHRF FT TFCTSNGQQYGQNSGQQYGQPPRPNLAPPYGTPFNGQPPFGQPYSTNPALS FT VGRLVPRVPIPNPWNFPAPAAYPVPAGFGYISPYVPNIPPRMPGHVRFDTT FT TANNSQNTAATGASTTNPGGGQGTTAGNNTPWPMDPLLGNGDHTNQNGGEN FT GNEEETGNGNREESETNNSAVRSQLNRMTTTIGTLVNHVLAMERTQAAQQS FT QLLSMMQGAVPYPPGTYQGPILPDGFNQGVDPNLFQYPNQWQANSTPLGGE FT SAPNNPNIPEGYDGNYEVTRGFKTCEPPKPYFTGDLEVSHPMEFLLDIDRY FT TFACRLDPACRLQLVTSCLEGEAKTWARGFGYLLHRLQPVLCTFPATVLGT FT ARAASGSG" FT CDS 2543..5320 FT /product="Gypsy-84_CQ-I_2p" FT /translation="MSKTANMNIISLNRIEAAVNRAATVETSGGTSPLFGS FT SILTDAEQTQLEDLLREFEDIFSDKPGLAHQYEHEIRVTDDSGFNQKQYPI FT PYKYMDKVRCQIQKMEEWGIISRAPTSYINPLVVTLKKSGEARVCLDARRL FT NKVLERDNEKPPEVQRLLQKLEGTRYFSLIDLTSSYWQIPVKKEHRKFTGF FT LFDQRSYVFNVLPFGLNTAVASFSRAMDFHLGPEVLQFTEKYLDDLLVHST FT TFDEHLGHLRVLFTKLREAGLTVSPSKSEFAKHSVKFLGHVVGPAGVSVDP FT EKLKAIEDFPCPTNEKALKSFLGLVSYVSRFTPNYAATAKPLYALLKSTVG FT WKWERNEQDAFEATKQLFLTFTTLRYPITGKQFRVQTDSSYSGLGAILFQV FT DERGNRMVISYASRLLQTAESNYSATELEALAVVWALNKWRQYLLGGSFVV FT FTDHKALIFLKRCQLLNGRLTRWILFLQQFDFDIVHCRGAENEAADALSRY FT PVGAIGLRSAGDGRKLLIARMAAEVKEFRKLMSKLPAQLASDSDWRAMWES FT AGDVELRRGKRRYMKAKGVLFVKEERDEQWRIIIPRSLIETTLKYFHDDSG FT HFGLQKTYQALRSSVFWKGMRKSAKEHVRACLTCQYSKPCLRPLCGSTRSI FT LTKGPNELISVDLFGPLPAGPAGVRYVFVMIDVFSKYVTFDSVKKPTATVL FT WRKLERRIQHLGKPSAVLCDQGTQFTAKFWRKTLKDNDIKLIFTSVRHPQA FT NPVERSMRELSRLCRAYCQLNHRSWAQQLNRFADWINCAYHESTERTPHEL FT QFGERPRDALQELFEYPGTGSSPVDIDKVYVMLRNKAKKRNRNAKPQTTFF FT KVGDEVLLRSNPVSSEADAIVKKFINVYEGPYVIKEVVQTDVFILAYPNHN FT KQRGMFHINLLKPFVTPNITGG" XX SQ Sequence 5365 BP; 1440 A; 1365 C; 1418 G; 1133 T; 9 other; ttacaacgtg ggggctcaac cgggatcttg gccgatccca aacgctaatt tgttgctttt 60 aatcggaaaa ctattgaact taaacagttc aaatacgaaa tatataaata aaaaaaaaat 120 aatagtgcaa aaatcgcgaa atatctgcat tggtgcaatt cggacattgt ccgtagaaaa 180 caaaggcggg tcgccattgc agtcgaggaa agcacatggt tggcgaggca gctgaaatat 240 agatacaagg ctttgcgaaa gcctaaaata gctacctcga gtcggatcga cattaattgc 300 cctgcgagtc agccgcacga atttagaatt gacattggaa acgaacctaa caccggccgg 360 cacgattgaa gttgtttaaa ggcttgcgaa agcctcaagg aggtgagtag cgttccttca 420 atgtttacct ttcttggttg agctgctggg tcttacgcat tccaggatcc ccgtatgcat 480 agccgtaacc acgagttggc cgagctgtcc gagtgggatc tcgagaacgt agacctgtac 540 taccgccgca tactgaactc aacaatgatg gaacagcacc ggttcacgtt ctgcacttca 600 aacggtcaac agtatggcca aaattcgggc cagcagtacg ggcaaccgcc acggccaaac 660 cttgcaccac cgtacggtac accattcaat ggtcagccac cgtttggtca accgtactcg 720 acgaatccag cgctttcggt gggtcggttg gtaccacggg ttccaattcc taatccctgg 780 aacttccctg cacctgccgc ttaccccgtg ccagcgggct ttgggtatat cagcccgtat 840 gttcccaaca taccgccgag aatgccgggt cacgttagat tcgacactac gaccgccaac 900 aactcacaga atacggcggc taccggagca tcgactacta atcccggcgg aggtcaagga 960 actactgctg gaaataatac accttggccg atggacccct tgctgggaaa tggtgatcac 1020 accaaccaga atggtggaga gaacgggaac gaagaagaga ctgggaacgg taacagagag 1080 gagagcgaaa caaacaactc ggcagtgcgc tcgcagctta accgaatgac caccacgatc 1140 ggaacgctcg ttaaccatgt actagcaatg gaacgtacac aagcagcaca acagtctcag 1200 ctactgtcca tgatgcaggg agcagtcccg tacccgcccg gaacctacca aggacccatc 1260 ctcccagatg gtttcaatca aggggtcgac ccgaatctct tccagtaccc taatcagtgg 1320 caggcaaatt ctacaccttt aggaggagag tcggcaccga acaaccctaa cataccagag 1380 ggttacgacg gaaattacga agtgaccaga ggcttcaaga catgtgaacc gccaaaaccg 1440 tattttactg gtgatctaga agttagtcac cctatggagt tccttctaga catcgaccgg 1500 tacacttttg cttgtcgatt agacccagcg tgtcggcttc aacttgttac ttcctgccta 1560 gaaggggagg ccaaaacgtg ggcaagagga ttcggctacc tcctacacag actacaacca 1620 gttctgtgta catttcctgc aacagtactg gggacagcgc gcgcagcgtc tggttcggga 1680 tgaaatgatg tacggaacgt actcgagcaa gtccacttcg agaatggccg agtacttcct 1740 gggtttggta gcgaaggscc ggtacctggt aggcgcacct tcggaagtag agcttgtctt 1800 gaacttgagc caccattttc cccgaaatgt ctgctctcga ctcagctcct gcccggacat 1860 caagtcggct tactcgatgt tgcagatcga agatcaccac aacaacaaca cgcaaaggac 1920 taactggaaa cggacmccag aaggwgcmaa caacacggcg gcggtggcta gcggtggcag 1980 cggaacaggt ggcaacagtc gcaattggag agacgccggt agcaacaatc gggctgtacg 2040 agctaccacc gctgaagcgg acgacgacgc agaaccggag actcatccca tcgctaatat 2100 ttttgttgga cccgaagagc tgctggcgga ggacaacgtc gacaactcat gcgtgaaagt 2160 ggaaaaatcg ccagtcatcg aggtgttgat tggcgagcga ccgatgctgg ttttgctgga 2220 tagcggcagt gaactcagct gcatcgatgg taacctctac cgagaagtta aggcgattgg 2280 agttgagatg tgtgagctgc cggtgcaaaa gacgaccata caaggggctt ttggagggaa 2340 gaaaaagacc atctccaatc agattttcct acctctttac atcagcgggg aagccgtcga 2400 tgtaccacta atcgtcgttg aagacttgtg cagtcccatg atatttggca tggacaccct 2460 tcaccacctc caagcttscc tcttgttcaa cacaaagagg tatgcttggt gttgaacgam 2520 aaagaagtaa agctcccstt caatgtccaa gacggccaac atgaacatta tttccctgaa 2580 ccggatcgaa gctgccgtca acagggccgc aacggtcgag acatcgggag ggacttcacc 2640 actcttcggc agttcgattt tgactgatgc tgaacaaaca caactggaag acctgctgcg 2700 agagttcgaa gacatattct ctgacaagcc ggggctagct caccaatacg aacacgaaat 2760 ccgagtgacc gacgattctg gctttaacca gaaacagtac cctataccct acaaatacat 2820 ggacaaggta cgatgccaaa tccagaagat ggaagagtgg ggcataatct ccagggcgcc 2880 aacgtcgtac atcaacccac tcgtagtcac gctaaagaaa tccggagaag cacgggtatg 2940 tttagatgcc cggagattga acaaggtgct agagcgggac aacgagaagc ctccggaggt 3000 tcaacggctg ctgcagaaac tcgagggaac acgatacttc tcgctgattg acttgaccag 3060 ctcctactgg caaatccccg ttaaaaaaga gcatcgcaag ttcaccggkt tcctgtttga 3120 ccagagaagc tacgtgttta acgttctacc gttcggcttg aacacggccg tggctagctt 3180 ctcacgagcc atggacttcc atttgggccc ggaagtcctc cagttcacgg aaaagtattt 3240 ggacgacctg ctggttcact caacaacatt tgacgaacac ctgggccact taagagttct 3300 tttcacgaaa ctgcgcgaag cgggactcac agtcagccct tcgaaaagtg agttcgccaa 3360 acatagcgtc aaattcttgg gtcatgtggt aggtcctgct ggcgtttcgg tagacccaga 3420 aaagctcaag gccatcgaag actttccctg tcccaccaac gagaaagcac tgaaaagctt 3480 tttgggcttg gtgagttacg tgtccaggtt cactccmaac tatgctgcca ccgcgaaacc 3540 actttacgct ttgctgaaat ctacggtcgg ttggaagtgg gagaggaatg aacaggacgc 3600 gttcgaagct accaagcagc tgtttctcac cttcactacc ctgcggtacc cgataaccgg 3660 gaaacagttt cgtgtgcaaa ccgacagctc gtacagtggt ctgggtgcga tcctcttcca 3720 ggtggacgag agaggcaacc ggatggtgat ctcgtacgcg agccggttac tgcaaacggc 3780 tgagagtaac tactctgcga ctgagctaga agcgcttgcg gtggtatggg cactaaacaa 3840 gtggagacaa taccttctcg gaggcagctt tgtagtcttc accgaccaca aggccctcat 3900 ttttctgaaa aggtgccagt tgctgaacgg ccggctgacg cggtggatac tgttcctgca 3960 gcagttcgat ttcgatattg tgcattgtcg cggcgcggaa aacgaagccg cggacgccct 4020 ttcacgttac ccagtgggag ctatcgggct acgaagcgct ggtgacggca gaaagctatt 4080 gatcgcgcgg atggctgctg aagtgaagga gtttaggaag ctgatgtcaa agctcccagc 4140 tcaacttgcc agtgacagtg attggagggc gatgtgggaa tctgctggtg acgttgagct 4200 gcgacggggc aaaaggcggt acatgaaggc caagggagta ctgttcgtga aggaagaacg 4260 tgatgaacaa tggcggatca tcatcccccg aagcttgatc gaaaccaccc tgaagtactt 4320 ccatgatgac tcgggtcatt ttggtttgca aaaaacgtac caggccctgc gtagttcggt 4380 gttctggaag gggatgagga agtcggcgaa agagcacgtt cgcgcatgcc tcacttgtca 4440 gtactccaaa ccttgcttga ggccgctgtg tggttccact agaagcatcc tgacaaaagg 4500 accgaatgag ctgatctccg tcgacttgtt cggacctctg ccagctggac ctgctggagt 4560 ccgttacgtg ttcgtgatga tcgatgtctt ctccaaatac gtaacgtttg acagtgtgaa 4620 gaagccaacg gctaccgtac tgtggcgcaa gttggaacgg cgcatccaac acctcggaaa 4680 accgtcagca gtactgtgcg atcaaggaac tcaattcacg gcgaagtttt ggagaaaaac 4740 gctcaaggac aacgatatca agttaatctt tacctcggtg cgacatccac aagccaaccc 4800 agtggaacga agcatgcgag agctgtccag attgtgccga gcatattgcc agctaaatca 4860 ccgatcctgg gctcagcagc tgaatcgttt tgccgactgg atcaactgtg cgtaccacga 4920 gtcgacggag cgaacgccac acgagctaca atttggagaa cggcccaggg acgcgctgca 4980 ggagttgttc gagtaccctg gcactggtag cagccccgtg gatatcgaca aagtgtacgt 5040 gatgctgcga aacaaggcaa agaagcggaa ccggaacgca aaaccacaaa cgacgttctt 5100 caaagtagga gacgaggtgc tgctgaggtc gaatccggtc tcgtcagaag cggatgccat 5160 tgtgaagaag ttcatcaacg tttacgaggg accgtacgtc attaaggagg tagtacagac 5220 ggacgtgttc atcctggcat acccaaacca caacaagcaa cgaggtatgt tccatattaa 5280 cctgttgaag ccattcgtta ctcccaacat caccggaggg tagccgcaaa cttttttacg 5340 tagagcgttt gctgaggggg ggagt 5365 // ID CR1-49_BF repbase; DNA; INV; 1803 BP. XX AC . XX DT 30-JUL-2009 (Rel. 14.07, Created) DT 30-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE Amphioxus CR1-49_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-49_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-1803 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-1803 RA Kapitonov V. and Jurka J.; RT "Young families of CR1 non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(7), 1620-1620 (2009). XX DR [2] (Consensus) XX SQ Sequence 1803 BP; 529 A; 381 C; 312 G; 581 T; 0 other; ccgtcgtttt gcgtaggtgt gcacacgagt tagcacctca gctaacggca ctgttcaacc 60 aaagtttacg tctcggctgg gttccgacac agtggaaaga cgctaatgta tgcccagtcc 120 tcaagaaagg ccgcaaggaa tttgttgaaa actatcgtcc catctctctg ttaagtattg 180 ttagcaaagt aatggaaaga tgtatgttta accgcatctt cccgtatctt aaggagcaga 240 tccacccttt tcaacatggt tttattaaag gccgctccac tgcaacactt ttactccaaa 300 tttatcacaa aattggctct attctagata atggcggcca agttgacgtt gtattgcttg 360 atttttctaa agctttcgac tgtgtctctc accgcctcct agtccacaag ctgaaaatgt 420 atggcgtaca ctccaacttg cttgcatggt tcaaaagtta tttgtcttgt cgtagacaac 480 gagtcattgt agaaaatgtg cattctgatt ggctaccagt tgtctcggga gttccccaag 540 gatccatact ggggccacta ctttttttat tattcataaa cgaccttcct tatgtggtca 600 gtaacacaat ggctttatat gccgacgact cgaagtgttt taaacaaatc tcaacagtta 660 ttgattgtgt ttctcttcaa aaagacattg agaacatgta caattggggc aatacgtgga 720 tgatgaagtt taacacagac aaatgcaaga tactcactat aggtagagga aaaaatcaaa 780 ttcagtttca gtacaaaatg gaagacagat ttcttgaggt tgtctctgaa tttaacgacc 840 taggagtttc tgcatctggt caattaacat ggaaatttca tatccaaaac attatctcca 900 aagctaactc aaccttaggt ttcattaaaa ggtctgtagg ctttcacgct cctttgtctg 960 tcaaaaaagc tctctatatg tcacttgtcc gaagaaaatt agaatattgt tctacggtat 1020 ggtctcctca cactcatgaa ctcatagaaa gactggagag tgtacagagg cgaggaacca 1080 agtttatcct caacaattac caaactgact acacgtacaa agacaggctg ctcagttgct 1140 cacttttacc tttgtcatat agacgtgaaa ttttagattt agttttcatt ttcaaatgtt 1200 ttttaggata ttacgaaata gatgtcaaca attacttaac tttcccacat cctctcctta 1260 gatcacatga tcaggccaag ttaaccccag gcaaatgtaa tactactacc tttatgtact 1320 cattttttct tagaattgct cacatttgga attctctccc tacagaaata agacaactca 1380 ggctaatccc agactcatca attaggcgct ttaagcaaga catttcaggc cattatctat 1440 cccttttgtc aacacacttc gacgctcata aaacttgcac atggaccact tgttgaaaat 1500 gctgtctgac atagttaact cgacaagttt cttcaatacc attactttat atctctaaga 1560 ctttcactca cagtttggtt gtatcgattc cttttgtttt tacagctatt cagataattt 1620 cttagaatgt tctgtcactt atgtgaatgt attaagtatt agttatgtgc tcttatttgt 1680 cacacatgta ttattttttt taaaatgtac cccggaggtg gcggccctga aaaggatatg 1740 ctaatcctgt tgccgtcacc tttcagattc cttctgaaaa gcttaataaa taaataaata 1800 aat 1803 // ID BEL-66_AA-I repbase; DNA; INV; 6845 BP. XX AC supercont1.276; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-66_AA_; KW BEL-66_AA-LTR; BEL-66_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6845 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.276; Positions 206386 199542. XX CC 'ATATT' target site duplication CC LTRs are 96% similar to each other. XX FH Key Location/Qualifiers FT CDS 22..3678 FT /product="BEL-66_AA-I_1p" FT /translation="MPRSTRSAPGSISHCMACTGPETMRMVSCCQCKTWWH FT FECVNVDESIAAPDRPFICPNCQKPSPTIPSLKPTSEVSGRKAISLSGVSS FT TSSVRAKRARLLLEKLEAHKALMEKRLEHARREQMIRHEQEKQMQEAEIEQ FT ARLRMEQTIIEETFRVREEELLNEESDGQSVSSEQSSISKVRQWQKEQPYS FT ARSSTMMETAPDANGMIHTQVALTIGEEGNVVEGELEGRLQSIKQTYIEPV FT LGGFIDRTASTLGTNVGENSVANLNIPSTSQHIVMSRFSDKQSQIAARNTI FT QIPSSVSFEPQGRSSLVEPKLPANPSILVPHPFGRSEPGMSSSNVNFSDQV FT NRVPVDSEVHQQRSSAANSSAPSVRAPLPRPRRMELETTCPRQRTSFDDPL FT RSEPPPSVQGVPPERAIPEVHRRLPYSLPDENHWNQAWGPTPQQLASRHVM FT SKELPFFSGNPEDWPLFISSYNNTTQACGYSEAENLARLQRCLKGHALEAV FT RSRLLFPQAVPQVIETLETLYGRPELLIHSLLQKVRAVPAPKHDRLDTLIG FT FGMAVRNLCDHLEAGGQEAHLNNPMLLFEMVEKLPANMKLDWSLYKQRCGE FT VNLHEFAQYMSTLVRAATDVTLHYGPRPTPQPRDSRPEKGSKDKNFCGAHS FT VEDSKTSKKTDESMNTSTPSCLICKDPKHRVKECNGFTRKSVDERWKVIQN FT LGLCRNCLGAHGRRPCKVNKRCEVEGCQLKHHPLLHSKREKQEVKQGNTEQ FT LQGSNAVTNHHYAGKSTLFRIVPVTLFGNNRAVSVYAFLDDGSERTLIEEK FT IADELGIEGEHLPLCLQWTANVKRKENNSQRIALQISGANGSKHTLSDVRT FT VSRLDLPRQSLPYSKLAKLFPHLQGLPVHDYDQAMPCIFIGNDNAHITATL FT KLRGGQPGDPIAAKTRLGWTIYGKNSENQGNLAHTFHICECRDEQSLHELV FT VQFFSIESLGISTTMVLEPEEVQHANRILCETTKRVGTRFETGLLWKFDHW FT EFPDSYPMAVKRLQGLERRIQSDPVIGGSIKRQIAEYIVKGYIHKASEQEL FT NEADPRRVWYLPLGVALNPKKPSKVRIFCDAAAKVDGISLNSLLLKGPDLL FT NNMFEVLFGFREKRIAVCADLKEMFHQIQIRKEDRHAQRLLWRDDPTKSPE FT VYLMDVTTFGATCSPCSAQFVKNLNATEHAKEYPVAAVAIQRKHYVHSTNE FT TCKIIASRM" FT CDS 4261..6606 FT /product="BEL-66_AA-I_2p" FT /translation="MKIAEEVRHVHSRGGFHLRNWLSNSKEVLLRVGENDP FT VSEKNIKLDKGSATEHVLGLFWRPDKDDFTFSTNSARHVEQPTKRQALSVV FT MSTFDPLGFLAFFLIHGKILIQELWRAKIQWDQLLPEKLSEQWTRWIAYFQ FT CLDQISIPRCYFPRHSVKDILSLQLQIFVDASEEALACVAYLRAEFSNGIE FT IALVAGKSKVAPLKALSILRLELMAAVIGARLRQTIIERHSLQIDRTYLWS FT DSRTVLAWINSDHRNYRQFVACRVGEILSKTHANQWRWIPSKENVADDATK FT WGRGLCVSPSSRWFRGPDFLYLTEDHWPVNTQFEVSQTNEELRACLIHQEA FT MVEHQVNWKRFSQWSRLWRAIAYVYRFVQNLRRKANNTLLLLGPLTQAELA FT KAETTIWRWVQGEVFPDEVAILSKMRAKHTESLGKLERTSTIRKLSPFMDE FT NGVVCMDSRISAATFASFDTRFPIILPKDHTVTELVIGWYHRKFLHGNGET FT VVNEVRQRFHVPNLRTVIRKEKKKCAWCKLKQAAPAVPRMAPLPAARLTAF FT ERPFPYVRVDYFGPIAVRVNLGNCKRWVALFTCLTVRAVHLEIVHSLTTES FT CKMAFRRFIARRGAPVEIYSDRGTNFVGSSNELQREMCVIEQQLVETFTNT FT NTRWIFNPPAAPHMGGAWERLVRSVKTALAAMYTTRIPNEETLETLVAEAE FT GVVNSRPLTFIPLEKEQQEALTPNHFLLLNSNGVVQTPRTFANSTEVSRSQ FT WNLCRVMVDRFWQRWVREYMPELARRTKWYE" XX SQ Sequence 6845 BP; 1934 A; 1532 C; 1780 G; 1599 T; 0 other; aaatctaaaa gatatttcac tatgccaagg agtactcgca gtgcaccagg aagtatatcg 60 cactgcatgg catgtaccgg gccggaaacg atgaggatgg tgtcttgctg tcaatgcaaa 120 acatggtggc attttgagtg cgtgaatgtg gacgaaagca ttgctgcacc tgaccgtccg 180 ttcatttgtc ctaattgcca gaagccttcg ccgactatcc cgtctttgaa accgacatcc 240 gaggtgagcg gcaggaaagc tatcagttta tctggagttt cgtccacctc tagcgtaaga 300 gctaagaggg ctcgcctact gctggaaaaa ctcgaggcac ataaagctct gatggaaaaa 360 cgtttggagc atgcacgtcg agaacagatg attaggcatg agcaggagaa acagatgcag 420 gaagctgaga tcgagcaggc acgcttgcgg atggagcaaa ccatcattga agaaaccttt 480 cgagtcaggg aagaggaact cttgaatgag gaaagtgacg gacagagcgt ttcatctgag 540 caaagtagta tcagcaaggt gagacagtgg cagaaggagc aaccttactc tgcacggagt 600 tcaacgatga tggaaactgc gccggacgcc aacggaatga tccatacaca agtggctcta 660 accattggtg aagaaggtaa cgttgtagaa ggcgagctag aaggtaggtt acagtccata 720 aaacagacct acattgaacc agtgctagga ggcttcattg ataggacagc tagtacgtta 780 ggcacgaatg taggagaaaa tagcgttgct aaccttaata ttccgagtac aagtcagcat 840 attgtcatgt cacgtttctc tgataagcaa tctcaaatcg cggcacgtaa taccattcaa 900 atcccaagta gtgtgtcttt cgaaccacaa ggtcgttcat cgctggtaga gccaaaattg 960 cccgcgaatc cgtcaatcct cgtaccccat cctttcggtc gaagtgaacc gggcatgtca 1020 agtagtaacg taaactttag tgatcaagtg aaccgagttc cggtggatag cgaagtgcat 1080 caacagcgtt ctagtgctgc gaactcatca gcacctagtg tcagagctcc gttaccgcgt 1140 ccacggagaa tggaactgga gactacgtgc ccacgccaaa gaacgagttt tgatgatccg 1200 ttgcggtctg agccaccacc ttcagtacaa ggagtgcctc cagaacgtgc gattccagaa 1260 gtgcatcgaa ggctgcccta ctcgttaccc gacgaaaatc attggaatca ggcttgggga 1320 cccacgccac agcagctagc atcgagacat gtgatgtcta aagaactgcc cttcttttcc 1380 ggcaacccgg aggactggcc actctttatt agttcataca acaacacgac tcaagcgtgc 1440 ggatattcgg aggcggagaa tcttgccaga ttacagcggt gtctaaaggg gcacgcgctg 1500 gaggctgtac ggagccgtct attatttccg caagcagtgc cacaagtgat cgaaacactg 1560 gaaactttgt atggtagacc ggaactgttg atccattccc tactgcagaa ggtgcgagca 1620 gttccggcgc caaagcacga tagattggac acgttgattg gtttcggaat ggcagtgagg 1680 aacctctgtg accatttaga agccggaggt caagaagctc atttgaataa tccaatgctt 1740 ctcttcgaga tggtggagaa actgcccgca aacatgaagc tggactggtc gctatacaaa 1800 cagcgatgcg gtgaggtgaa tcttcatgag tttgcacaat atatgtcaac cctggtacgt 1860 gcagctacgg acgtgacttt acattacggt cctcgcccga cgccgcaacc acgagactct 1920 agaccggaga aaggtagcaa ggataaaaac ttctgtggag ctcattcggt ggaagattcg 1980 aagacgtcaa aaaagacaga cgagagcatg aatacgtcta ctccatcgtg cctcatctgc 2040 aaagacccaa agcatcgtgt gaaagagtgt aatgggttca ccagaaagag tgtggatgaa 2100 cgctggaagg ttatccaaaa tttaggattg tgccgtaact gtctaggcgc gcatggaaga 2160 cgaccttgca aggtcaacaa gcggtgcgaa gtagaggggt gccagttaaa acatcatcca 2220 ctactgcatt ccaagcggga gaaacaggaa gttaagcaag ggaatacgga acagctacaa 2280 ggctccaatg ctgtcaccaa tcatcactac gcagggaaat caaccctttt ccgcatcgtt 2340 ccagtgacgc tgtttgggaa caaccgtgca gtttcagtgt atgcgttttt agacgatgga 2400 tcggagcgaa ctctgatcga ggaaaagatt gccgacgagc tgggaattga aggtgaacat 2460 cttcccttgt gtttgcaatg gactgcgaac gtgaagagga aggaaaacaa ctcgcaaaga 2520 atcgctctgc aaatctccgg tgcaaacgga tcgaagcata cgctgtcaga tgtccggaca 2580 gttagtagat tagacctacc acggcaatct ttgccctact caaagctggc gaagctattc 2640 ccgcatctgc agggactacc agttcatgat tacgatcaag caatgccttg catctttatt 2700 gggaacgata atgctcacat cacagctact ctaaagttac ggggaggaca acctggagat 2760 cccatcgcag cgaagacccg gttagggtgg accatatatg gaaagaattc ggaaaaccag 2820 ggaaatctag cgcacacctt tcacatctgt gaatgccgcg acgaacaatc actccatgag 2880 ttggtagtac agttctttag catagaaagt ctgggaatat ctaccacgat ggttcttgaa 2940 ccagaagaag tacagcatgc gaatagaatc ctttgcgaga ctactaagag agttgggacg 3000 agattcgaaa ccggacttct ttggaagttc gatcattggg aatttccgga cagttacccg 3060 atggctgtta agcgattgca gggcttagag cgccgaatcc aaagtgatcc tgtgataggt 3120 ggaagcatca aacggcagat agcggagtac atagtgaagg gatatatcca taaagcatcg 3180 gaacaggagc tcaacgaggc agatccacgg cgcgtttggt atttaccctt gggtgtcgca 3240 ctgaatccca aaaagccttc caaagtacgc atattttgcg acgctgcagc caaggtagat 3300 ggtatttcgt tgaacagtct gcttctgaaa ggtccggacc tactcaacaa tatgtttgaa 3360 gttctcttcg gttttcgcga aaaacggatt gcagtttgtg ccgacctgaa ggagatgttt 3420 catcagatcc agatacgaaa ggaggatcga catgctcaac gactactctg gcgcgatgac 3480 ccaacgaaga gcccggaagt ctacctaatg gacgttacca cgttcggggc cacctgttca 3540 ccgtgctcag cacaattcgt caagaaccta aacgccaccg agcatgcgaa agagtatcca 3600 gttgctgctg tcgccattca acggaagcat tatgtacaca gtacgaacga aacctgtaaa 3660 attatagcga gtcgaatgta acaaaaggac tccatcatat atgatagata tttatgtgcg 3720 atcttaaaat tacattattg ctggtttgaa aatgaactat aattagctac caaactagat 3780 gaatatataa aaaagttcta atcgagattc aaactcgcga cctttggagc gacagctgca 3840 tgcccgagct ctctcggctg tctcaccacc ttagtgagtg gctgacgaaa tttgaacacg 3900 ttctgcgtaa aattagtgta tcacagaatc acttcgacgg ttggcggatt catcgagaag 3960 tggttgaaag agcagtcaac aatgcggagc gttctctcgg tcgcgtgatt gttgagcgtg 4020 gctgggtttg ttttgtggca gattcatgta aatttctaac ggatttaact gaaatatatt 4080 tggaagggcc ctgagtggtt tacaggtggt catatttgct tcagctgttg gctaacgctt 4140 gtaattttca aatgcctgcc acaattttcc gtcgatgttc aaattgcgtg ctgcttaata 4200 ttcacaaatt tttgctgtgt agatgactat gtggatagcg cggatagtgt cgaagaagcg 4260 atgaaaattg cagaagaggt gcggcacgtg cattcacgag gtggtttcca tttacgtaac 4320 tggttgtcga attccaaaga agttctctta cgggtcgggg aaaacgaccc agtctccgag 4380 aaaaacatca aattggacaa aggaagcgca acggaacacg ttcttggact gttttggaga 4440 cccgataaag atgatttcac gttttccaca aattcggcca gacacgtgga gcaaccgacc 4500 aagcgtcaag ccttgagcgt ggtgatgagc acttttgacc cattgggctt cctggcattc 4560 tttctgatcc atggcaaaat ccttattcaa gagctctggc gagcaaagat ccagtgggac 4620 cagttgttac cggaaaaact gtccgaacaa tggacacggt ggattgcgta tttccaatgc 4680 ctggaccaga tcagtattcc gcgctgctat tttccacgtc attcggtcaa agacattctg 4740 tcactacaac tgcaaatttt cgtagatgca agcgaagagg cgcttgcttg cgttgcatac 4800 ctgcgggccg aattttcgaa tgggattgaa atagcacttg ttgccggaaa atcgaaggtg 4860 gctcctttaa aagctttgtc catactgaga ctagaactta tggcagcggt gatcggggcc 4920 cgtttgcgac agaccataat cgaaagacac tcgttacaga ttgatcgaac ttacctttgg 4980 agtgattcta gaacggtgct cgcctggatc aactctgacc atcgtaacta ccgacaattc 5040 gtcgcatgtc gtgtaggaga gatactttcc aaaacgcatg ccaatcaatg gcgctggata 5100 ccatccaagg aaaatgtagc ggatgatgcc acaaaatggg gaagaggact ctgtgtgtca 5160 ccgagtagcc gatggtttag aggtccggac ttcctatacc taacggaaga ccactggcct 5220 gtgaataccc aattcgaagt ctcgcagacg aatgaggaac ttagggcttg cttgatacac 5280 caggaagcga tggttgagca ccaagtaaat tggaaacgat tttcgcaatg gtctcgtttg 5340 tggcgagcaa tagcatatgt atatcggttt gtgcagaact tgagacgaaa ggcgaacaat 5400 acgttgctgc tgcttggacc gttaacccaa gcagaactag ccaaagcgga gacaactatc 5460 tggcgctggg tgcaaggtga ggtgtttccg gacgaggtag ctatactttc taaaatgcgt 5520 gccaagcata cagagagtct tgggaaattg gagcgaacta gcactattcg taagctgtcg 5580 cctttcatgg acgaaaacgg tgtagtatgc atggacagtc gaatatccgc agccacgttc 5640 gcctcatttg acacccgctt tccaatcatc ttgccaaagg atcatactgt aactgagctg 5700 gttattgggt ggtaccatcg gaagtttctt catgggaacg gcgaaaccgt tgtaaatgaa 5760 gtaagacagc gattccacgt gcctaatcta agaacggtaa ttcgcaagga gaagaagaaa 5820 tgtgcatggt gcaagttgaa acaagcagcg ccagcggttc cgcgaatggc gcctcttcca 5880 gcagctagat tgacggcgtt cgaacgacct tttccgtacg tgagagttga ttacttcgga 5940 ccgatagctg tgcgtgttaa tctagggaat tgcaaaagat gggtagcgtt gttcacgtgc 6000 ctaaccgtcc gagctgttca tctagagatc gtacattcgc tcactacgga gtcctgcaaa 6060 atggctttcc gcaggtttat agcacgcaga ggagctcccg tagaaatcta tagcgaccgg 6120 ggaaccaact ttgtcggatc tagtaacgag ctgcagcgag agatgtgcgt aattgaacaa 6180 cagctcgtcg agaccttcac taatacaaac accaggtgga tattcaatcc gcccgccgct 6240 cctcacatgg gcggcgcgtg ggaacgtcta gtaagatcag ttaagacagc acttgcggcc 6300 atgtatacta ctagaattcc caatgaagaa acgctggaaa cgctggtggc cgaagcagaa 6360 ggagttgtta actcgaggcc tttaacattc atcccgctag aaaaggagca acaggaggcg 6420 ttaacaccca accatttcct tttgctgaac tccaacggtg tagtccagac tccaaggaca 6480 tttgcgaatt ctaccgaagt cagtaggagc caatggaacc tctgtcgcgt catggtggac 6540 agattttggc agagatgggt gcgagaatac atgccagaat tagcccgccg tactaagtgg 6600 tacgaatagg tgacccctat ccaaacgggc gacttagtaa ttattgtaga ggacaacgtg 6660 cgtaacggat ggattcgagg acgggtcaca gaagtgatca acggaagaga tggtcgtatt 6720 cgtcaggcgg tggtgcagac agctacggga ttgatgcgac gtcctgttgc taagctggct 6780 aagttggatc tacaggaaag taaggttgac tcgggcaagc cggatcaact tacgggtcgg 6840 ggaga 6845 // ID ITmD37D_Ele6 repbase; DNA; INV; 1300 BP. XX AC . XX DT 13-OCT-2010 (Rel. 15.1, Created) DT 13-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE An ITmD37D DNA transposon family from Aedes aegypti. XX KW Mariner/Tc1; DNA transposon; Transposable Element; ITmD37D_Ele6. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-1300 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-1300 RA Kojima K.K. and Jurka J.; RT "ITmD37D-type DNA transposons from the yellow fever mosquito."; RL Direct Submission to Repbase Update (27-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. ~98% identical to consensus. This consensus CC is ~99% identical to the original sequence in [1]. TA TSDs. 29-bp CC TIRs. XX FH Key Location/Qualifiers FT CDS 160..1173 FT /product="ITmD37D_Ele6_1p" FT /note="transposase." FT /translation="MTKWQRIRDLSHAQMSQKDISIAVGVSRWTVKRVLDR FT EKKGQLSAKVATGRPRSARVPRIVAAIKRKIRANPVRSMRKMAKEHGISDF FT SVRKIVKDDCGAKSRARKKTHMITNRIRELRVERCKKIMNYLKRKKTVILF FT TDESMFTVDPVSNSRTDRYISSLPVKDVPDEHKNVFLTKHPASVMVFGLVA FT SDGKKMDPVFIPEGDKVNTEVYIGILSKHVLPWIKEQYGPNPNVVFQQDGA FT PCHTSNRTQKWLQEHLPFWAKDMWPPSSPDLNPLDYSIWRVMKAKACSKRH FT PSTASLRQTLTRTWREMDPRYIIRTCRAFRKRVEAVIAANGNSIDD" XX SQ Sequence 1300 BP; 387 A; 311 C; 312 G; 290 T; 0 other; ccgtatgcgt gaaaatgttt gcaccatcgc acaaataagg cgctcgttgc acctgtgcat 60 acaatgcctt ggttttcttt gcatgcacgt aagctataac tcataccaca gtaggctgcg 120 agtaaaagaa accgatttct ctagtttgac agttccacca tgacgaagtg gcagcggatt 180 cgtgatttat cccacgcgca aatgtcgcaa aaagacattt caatcgccgt gggagtgtcc 240 cggtggaccg tgaagagggt tttagaccga gaaaagaaag gacaactatc cgcaaaggtg 300 gccacaggtc gtccgaggtc agctcgggtc cctaggattg tcgctgccat caaacggaag 360 attcgggcaa atccagtgcg ttcaatgcgg aaaatggcga aggaacacgg tatcagtgat 420 ttttcggtac ggaagatcgt gaaggacgat tgcggggcga agtcgagagc acgaaaaaaa 480 actcacatga ttaccaaccg catccgggag ctacgagtcg agcgatgtaa gaagatcatg 540 aactacctca agcgaaaaaa aacggtaatc ctgttcaccg acgagagcat gtttaccgtc 600 gatcccgtat caaattccag aaccgaccgc tatattagtt ctctcccggt aaaagacgtg 660 cccgacgaac ataaaaacgt gtttctgacc aagcatcctg ccagcgttat ggtatttgga 720 ttggtggctt ctgatggcaa aaaaatggac ccggtgttca ttccagaagg cgacaaggtg 780 aacacggagg tttacatcgg gattttaagt aagcatgtcc ttccgtggat caaagaacaa 840 tacggcccga atccaaatgt tgtgtttcaa caagatgggg ctccgtgcca cacctcaaac 900 cgcactcaga agtggttgca agaacatctg ccgttctggg caaaggatat gtggccgcca 960 tccagccccg atcttaatcc gctggattat tcaatatggc gcgtcatgaa ggccaaggcc 1020 tgttctaaac gccacccaag tacagcaagc ctccgacaaa cattgacccg tacctggaga 1080 gaaatggacc cccgctacat tattaggacc tgccgtgcgt tccgaaagcg ggtggaggcc 1140 gtaattgcag caaacggaaa ctcgatcgat gactaacata gtttaagact caaaaaaaca 1200 tccccccttt aaataaataa aaaacataaa aaccgaaatt tttcagattt ttttttaatt 1260 ttgacgcagc gcaatggtgc aaacattttc acgcatacgg 1300 // ID hAT-4_SM repbase; DNA; INV; 2494 BP. XX AC . XX DT 02-OCT-2007 (Rel. 12.1, Created) DT 08-OCT-2007 (Rel. 12.1, Last updated, Version 1) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-4_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-2494 RA Jurka J.; RT "haT-type repetitive family from freshwater planarian Schmidtea RT mediterranea."; RL Repbase Reports 7(10), 1033-1033 (2007). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 505..2283 FT /product="hAT-4_SM_1p" FT /translation="MSKKRCWNDDYVRYGFTFTTEKDGTQRPQCILCCNKF FT SNANLKPSKLNEHFTKQHGGVDAGYDFESLKSKRVRFDKSGTITTFGFISH FT EKPLLQASYQVAYICAKKKKPHTIAEELVKPCALEMAKIVLGTEAQQKLQQ FT IPLSNDVIRTRISDISKDILQQVIEDIKASNVPVGIQLDESTDVENCSQLM FT VFVRYVKEKEIVEEFLFCEPLELTTKGIDIFNKVKDFLLQYEIPMNKIGSI FT CTDGAPSMLGKNSGFVAYVKKEVPHVTITHCILHRHALATKTLPEKLKKTL FT ITSVKVVNFIRGRALNHRLFRALCEELGAEHTVLLFHTEVRWLSRGRMLNR FT VYELRSEIMQFLNNQGSSLADEFSNRDAIIQLAYLADIFTHLNELNISIQG FT FGVNTITAKEKLTAFSRKLSIWINRVDKINYANFPLLEEVLVSDDERNNIT FT TEIKDHLQKLSESFEGYFATGDIDISEKWILDPFIYNLDSMDDSNLLKDDL FT VELRTNGHNQMEFEAENLESFWCAQVMSYPRLAKIALQHLVPFATTYLCEM FT GFSSLVHIKTKARNRLNASNDMRVAISKKIPRFSKIIEEKQQQKSH" XX SQ Sequence 2494 BP; 901 A; 347 C; 438 G; 808 T; 0 other; cagtggttct taaccggtgg tgcacgcacc ccttaggggt gcgagcttga atattttggg 60 gtgcgaaatt gtaattttac aaattcacgt gtaggtagca aaataaaaaa ctaatcttaa 120 aaattcattt tggttagtaa attttttatt ccctaaacat tttttaaata tgtttgtaaa 180 ttataaacca tcgagacaaa ttaatttttg tattttttct acaattacaa aaattcaaat 240 gcgtttctag ttagaatata aatattcaac taaacttgta aaattaactt gtattaatta 300 ttttattatt aatatttaac ttttaatatt caacacaaaa attcaaactt gtaatagtca 360 acactaaaat tcgaatctaa ttattagaat ataatttggt acattaaagt caaattcaaa 420 aggtaagcta aaatatagca ataatagata gaagtttaat atttttttta gtgttttatt 480 aaaaaattta atttagttta cgccatgtca aagaaacgat gttggaatga cgattatgtt 540 cgttatggtt ttacatttac aacggagaag gatggaacgc agcgaccaca atgtattctg 600 tgctgcaata aattttcaaa cgccaatctc aagccgtcga aactaaatga gcattttacc 660 aaacagcatg gtggtgtaga tgctggctat gattttgaaa gtttgaaatc taaacgagtt 720 cgatttgata aaagtggaac tatcacaact tttggattta tatcacacga aaagccgctg 780 ttgcaagcgt cttatcaagt tgcatatatc tgtgccaaga agaagaaacc acatacaata 840 gctgaagaac ttgtaaagcc ttgtgcatta gaaatggcta aaatagtgtt gggtactgaa 900 gcacaacaaa agcttcaaca aattcctttg tcaaatgacg taatacgtac tagaatttct 960 gatataagta aagacatctt acaacaagta attgaagata ttaaagctag taatgtccca 1020 gtgggtattc aattggatga atctacagat gttgaaaatt gttctcagct gatggtgttt 1080 gtacgatatg taaaagaaaa agaaattgtt gaagaatttt tgttttgtga accattagaa 1140 ttgactacaa aaggaattga tatatttaat aaagtcaaag actttttatt acaatatgag 1200 ataccgatga acaaaatagg atcaatttgt actgatggag ctccttccat gttaggcaaa 1260 aattctggat ttgtagcgta tgtaaaaaag gaagtgcctc acgttacaat tacccattgt 1320 attttacatc gacatgcgct agctacaaag acgttgcctg aaaaattaaa gaaaacatta 1380 atcacttcag tcaaagttgt aaattttatc agaggtcgag ctttaaatca tcgtctcttc 1440 cgtgctttat gcgaagaact tggtgccgag catactgttc ttttatttca cacagaagta 1500 aggtggctat cgcgtggacg tatgctaaat cgtgtatacg aattgcgaag tgaaataatg 1560 cagtttctta ataatcaagg cagtagttta gctgatgaat tcagcaatag agatgctata 1620 attcagctag cgtatttggc agacatattc acgcacctta atgaattaaa tatttcaatc 1680 caaggatttg gagtgaacac tataacagcg aaagagaagt tgactgcgtt tagtcgaaaa 1740 ctttcaattt ggattaaccg tgttgacaag ataaattatg ctaactttcc tttgcttgaa 1800 gaagttcttg tttcagatga tgaaagaaac aatattacca ccgaaattaa agatcatttg 1860 caaaagttga gcgaatcttt tgagggatat tttgctactg gagatataga tatatcagaa 1920 aaatggatat tagatccatt tatttataat ctggactcaa tggatgacag taatttattg 1980 aaagatgacc ttgttgaact acggacaaat ggacataatc aaatggaatt tgaagcagag 2040 aatttggaaa gtttctggtg tgctcaagta atgtcatatc cacgattagc taaaatagcg 2100 cttcagcatc ttgttccatt tgccacaaca tatttgtgtg agatgggatt ctcgtcactt 2160 gtacatataa aaacaaaagc tagaaataga cttaatgcta gtaatgatat gcgtgtagct 2220 atttcaaaaa agattccccg gttttcaaaa ataattgagg aaaagcaaca acaaaaaagc 2280 cattaataaa aactttgata atttataatt gattatttca gctaacaatt aaaatttata 2340 aaaaatttgt tacgcacata aaataaataa aaaatatgtt ttgcataatg tttgtttttt 2400 aatttatata actcaataat ttctagtaag gggtgcgaga atattgtttg aaaattttag 2460 gggtgccggt agctaaaaag gttaagaacc actg 2494 // ID Academ-2_Lgigantea repbase; DNA; INV; 6638 BP. XX AC . XX DT 16-MAY-2011 (Rel. 16.05, Created) DT 16-MAY-2011 (Rel. 16.05, Last updated, Version 1) XX DE DNA transposon: consensus. XX KW Academ; DNA transposon; Transposable Element; Academ-2_Lgigantea. XX OS Lottia gigantea OC Eukaryota; Metazoa; Mollusca; Gastropoda; Patellogastropoda; OC Lottioidea; Lottiidae; Lottia. XX RN [1] RP 1-3053 RA Yuan Y.W. and Wessler S.R.; RT "The catalytic domain of all eukaryotic cut-and-paste transposase RT superfamilies."; RL Proc Natl Acad Sci U S A 108(19), 7884-7889 (2011). XX DR [1] (Consensus) XX SQ Sequence 6638 BP; 2090 A; 1335 C; 1403 G; 1810 T; 0 other; tagagcagga gcccagaggg gtcagcctag ctgaaaaacg aaccaattct tatatgtatc 60 actattctcc atggacatta cctttaaagg aaatggacgt ctgccgggca tgcattggtg 120 ggtgtggcca ggagaggtca aagaagagta caattaagat aggctagttc agcttgtcaa 180 ataactaata gaaactacta ccaaagtagt cagcatacag ccatatacta taaatgacta 240 ttgccagaca tttccagtgg tggactgtgt ccgccagaca atgatatccg atacaagaaa 300 attgagctag atcagcttat gagctaactg catgaaacct tttgtaaaat cttcgtcttt 360 aacatgtgta tcaagaatga ccactgccag acattttctg tggtggaaaa cccctgccag 420 acgcccccaa atgaccccac cccctaaaat aacctaggtt agcccatgag ctaaccgcat 480 gaaacctttt gcaaaatctt cgtctttaac atgtgtatca agaatgacca ctgccagaca 540 ttttctgtgg tggaaaaccc ctgccagacg cccccaaatg accccacccc ctaaaataac 600 ctaggttagc ccatgagcta accgcatgaa accttttgta aaatcttcgt ctttaacatg 660 tgtatcaaga atgaccactg ccagacattt tctgtggtgg aaaacccctg ccagacgccc 720 ccaaatgagc ccacccccta aaataaccta ggttagccca tgagctaacc gcatgaaacc 780 ttttgtaaaa tgttcgtcat tgtttcgtgt atcaagaatg accactgcca gacattttct 840 gtggtggaaa acccctgctt gacatttata ataattctga aaagaaaata ttaaaaaagt 900 gaccatgtta gctaatggtt taattttatg atattgtggt ctgaaaatgt tatctatacc 960 attagaatgg cttccctgga aataaaatgg aggcctgaat tgaaaggcct ttctgtcact 1020 tttacactca aagaaaacgg taagtagttg tttttttaat acgtctcagt attaccacaa 1080 gacggcattt acctctttgt gcacctatat tgaaactgag gtaatagaca acaagcgatc 1140 tgtcttagtc tccgatttac tagacgtgta caaagttgag tttacaggta ttgggggcga 1200 aagggcagac attgtgacct acacagctca gaatcttctc agaaagatta atgacaagtt 1260 aggatcaaaa atcaatacca agttagctga ttatcggaag ggaaacttca ttcacagttg 1320 tactgtaaca gaagatgcca gagcccaact tcatgaggat gctgaacgac accaagagga 1380 tgacaaatta cgatgggcag ctcttcatat tcgttcaaag ataatgcagc tgccaaaatt 1440 taaaactccg aatcccacaa cagtgcaaaa tttgaaggaa tgtgcccaag aaataccaga 1500 gcagttggat ctcttttttc aggagtctct cctctgtgga ttgacaaaaa gtccccaagg 1560 ctcattcaag gatacaatag atcgcaaggt gactgcaatg gcttctgatg caatttacaa 1620 tgtttctcgt ggtacagtta aaccatggaa gcacattgct ttgggtcttg gcttagcatc 1680 tctcaccgga tccaagctat ccaagcagat tttaaacaga gcgggacaca gcatcagtta 1740 cagtgaaaca aaaggtctgg agacagaatt tgcatattca gtagaatctg atgaacgtga 1800 cactcctgat ggtattcgac ttgaaccagg tttagcaact gcctgtgtct gggacaacaa 1860 tgatgccaac atagaaacat tagatggcaa agaaacatta cacgccactg tcggtcatac 1920 ataccagaat attttgcagc atgacagaga aacaaataac attccaatag aatttcgaga 1980 tggaaggaac cgacgaaggt ttgttggaag tcaacgagag atacccccat ttaggaaacc 2040 tctcaatacg gctatgtttg tcacgagtgc taatatgtct gaatcttcca acgcaattac 2100 agctgaatct acagatacgg ctgaaacatc caatatgatt acatctgaat ctacagaacc 2160 aaatgtaatc cagaggaaag taaaactggc aatgaaagca ctggatctct gttggctttg 2220 gaaattgttt gagggcaaca caccactgta tgctggattt atcagcatgt acatcaagga 2280 tgtcttaccg atgcaacgaa tttgctacat ggaccctata tctagatcgc caactaacaa 2340 tgatgtagtc aaagagacaa tgatccgcac catgaatgtt gcaagggaaa caggacaaga 2400 ttattccatt gttacctatg atttggctgt ggctttgaaa gcatactcta tacaagcaat 2460 agaaagtcca atgtttgaca acctgttaat catgcttggt aattttcatg ttgaactaga 2520 tttctacggt gcagttggca cactgataaa tgaaacagga atagagttca tacttacaga 2580 ggcagacatc cttgctgaag gatcaatgat gggattcatc aaaggcaaat tttacaactg 2640 ctgcacaaga attcatgaat tgctggcaaa tgtcttggag cagaagctat atcaacgttt 2700 tcttctggac ttacccgaag aagaatatga gtcagtcgta ctagtgatgc acacattacc 2760 ttcaaatgca agccaggcag aagtgcatct tttagatcca gttgtattgc agcatcttga 2820 gaagtatgag gaattcttcc agatgattat tgatggtagc caaggaccaa cagctcaatt 2880 ctgggcaact tacatcttcc tcatcaatcg tctgcacaga gagttgcaga gatgtgtgaa 2940 gacaaatgat gtcaatagtt atatcaatgt gtttccaatg atacttggtg tattcttcag 3000 cctcaatcgt cctaactatg ccagatgggg aacgcttttc cttcagaagc tgaaatcatg 3060 tgaccctaag ttgattgaga ttttggaaaa gggtgcattt tcggtcagac gtactaccaa 3120 agattacagt agatcagctg ttgacctgtc tttagaacag tctgtgaacc gtgatgcagc 3180 atctcaaatg aaagggattg tcgcattcag aaactgccat gcgaaggtgg tcgttaagca 3240 tgacccagag agctatggct gtaactgagt taagaacatt agctgggctg gaactagggg 3300 aaactgcagc tgcccagtgt cgtccttcaa gaataaaaaa ggacaacagt caaatggcag 3360 cattgagtgc aaaaattgat gaattttgca atcctttcaa aagtgaaact ttaggttgct 3420 tggtcaatgt tgccactggt caagcagcat caaaggacac tgcatcatat ctgctgaata 3480 cactgaaaag gggtgaggat gcaagggaaa aattcttaca tgaatgggac agcaacaacc 3540 atcgctttct tcaaccagtc aaacgtacac gagtacaaaa ctttgctgct gaaaatgtca 3600 agaagaaaac caagtcaccg gccttacagg gagcaaagac aaacgctgaa agcctaaggg 3660 atatgtttat acgtatgatt attgttgttg cagaggagac aaactttgat ctaaaaaatg 3720 ttctctccta cccaattact acatacccat tatcgcttgc acattgtgat gggacccatg 3780 ttaaaactaa taaatcggta cttctgaaga aactggagtc cctacagact gaacctgtca 3840 cagagtcgga gcttccaagg aattatgttc atgtctatga tggtggcctt cttctccatt 3900 cgatcctcac gcaaacaaac ataggcgcat cctatggatc aattgctagg acaatattgt 3960 cagcagtctg ttcatatgat gcaaatgaag cacacgtttg tcttgacaaa tatgtagaga 4020 attccatcaa agacagtgag aggagactac gaggggccgt tgattctgtc tgtgtcataa 4080 cagggccaga acaaacaata cgccaaagtg gacaaaaact tcttaccaat ggagttttca 4140 agaacgaact tgcaaaattt cttctcaaag aatggggaaa agatcactat tggaacatcg 4200 tcaggggaaa gactctggtt gtttcatatg gcggagagtg cttccaatac gtcccaaaag 4260 aactccagca aatctccgtg acgaggccac cacaccttca aggtgaccac gaagaggccg 4320 acacactaat tgcatttcat gttgcaaata tcacggcgga caatatcatt gtgagagcat 4380 cagacactga tgtcttagtg atactgattg gtgcacttgg acagaaacgc ccagaagtaa 4440 gatgcatggt caatatcatc atggactgtg ggatgggaaa cagcagaaga tatatcaatg 4500 taaaaaacat tgctgatatt cttgagcacc gcaaatctgg aatgtcaaga gccctccctg 4560 gataccatgc cttcactggg tgtgacttca catcagcctt ttacaggtta ggctatacat 4620 gttattgtca aaatataata ctaagcacta ttttcacatt ctcgttccaa tttcacatat 4680 tttattattg attttattat tgataatgaa cctatttgct ttgcaaggac agttggatgt 4740 aaaaaacaca caatgtgttc aggtatagat agcagactag caagatacta agcattgata 4800 gtattgttgt tattcaatat acatgtgata ctcaattaga agtggacaaa tgtataaatt 4860 tgttttttgt tttataggaa gggcaagaca aaggccctag atatcataga gaaagacgag 4920 agtggtcgtt ttgttgaagc atttatcagc atgggagatg cacgtcacac ggttgatttt 4980 gatgtcatat cagaatttgt ctgctgcatg tatgcccaaa gtcagactcg cgacattgat 5040 gaagcacgtt acaacaaatt gatgcaaatg acaggcaaag ttcaaaaggt attaactgaa 5100 attgatataa ttacctgcca gctaaattta gacttattgt catagcagtc atgaaatgct 5160 tatttctaat gacatattta attgtctttc aggataatcc gctggcaaac gtcaaacgta 5220 tcgactgtgc attgcttcca cccactcgta aaacgctaga gatgaagatc agccggttgc 5280 aatatgttac agtgttgtgg acccttgcta caaaagcttg tccaggagat ggtttgtcaa 5340 tgacacactc ctgaaaccaa tctggttcca tgggcctgcg attcctggta gtctgttctc 5400 aaatcatcaa ggtggcaaca gagtggtaga caatgacact gatgaaacaa cgattatcac 5460 tggtagtggg tccgaaactg ttcaattgag tgattcggat gatgaaccgt ggactgagga 5520 ttcagattca gaccaagatg aagaatagaa gcactatact tattacttac aactttacat 5580 tttcccataa tgttaatgaa tagaaagaca ttaaatcaga acgctcacat taaaaaaaaa 5640 aagagtatcc aattagtata tcaaaaaggt ttagttaatc catgctagtg aatgtgataa 5700 cttttttgat atattcagta tattcataaa cataatttaa acttgttagt gtttggggtt 5760 ggggtcactt atggcgtctg gcaggggttt tccaccacag aaaatgtctg gcagtggtca 5820 ttattgatac acgaaacaat gacgaacatt ttacaaaagg tttcatccag ttagctcatg 5880 ggctaaccta ggtcatttta tgggtagggg tcacttgggg catctggcag gggctttcca 5940 ccacagaaaa tgtctggcag tggtcattct tgatacacga aaaacaatga cgaacatttt 6000 acaaaaggtt tcatccagtt agctcagggg ctaacctagg tcattttatg ggtaggggtc 6060 acttggggcg tctggcaggg gctttccacc acaggaaatg tctggcagtg gtcattctcg 6120 atacatgaaa caatgacgaa catttcacaa aaggtttcat gcagttagct catgggctaa 6180 cctaggtcat tttagagggt ggggtcacat ggggtcgtct ggcaggggct ttccaccaca 6240 gaaaatgtct ggcagtggtc attcttgatt cacatgttaa agacgaagat tttacaaaag 6300 gtttcatgca gttagctcat aagctgatct agctcaattt tcttgtatcg gatatcattg 6360 tctggcggac acagtccacc actggaaatg tctggcaaaa gtcatttata gtatatggct 6420 gtatgctgac cactttggta ctagtttcta ttagttattt gacaagctga actagcctat 6480 cttaatggta ctcttctttg acctctcctg gccacaccca ccaatgcatg cccggcagac 6540 gtccatttcc tttaaaggga atgtccatgg agaatagtga tacatataag aattagttag 6600 tttttcagct aggctgaccc ctctgggctc ctggtcta 6638 // ID TransibN1_DP repbase; DNA; INV; 1194 BP. XX AC . XX DT 13-JUN-2005 (Rel. 10.05, Created) DT 13-JUN-2005 (Rel. 10.05, Last updated, Version 1) XX DE TransibN1_DP is a nonautonomous DNA transposon - a consensus DE sequence. XX KW Transib; DNA transposon; Transposable Element; Nonautonomous; KW TransibN1_DP. XX OS Drosophila pseudoobscura OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; obscura group; OC pseudoobscura subgroup. XX RN [1] RP 1-1194 RA Kapitonov V.V. and Jurka J.; RT "RAG1 core and V(D)J recombination signal sequences were derived RT from Transib transposons."; RL PLoS Biol 3(6), (2005). XX DR [1] (Consensus) XX CC TransibN1_DP belongs to the TRANSIB family of DNA transposons. CC This element is characterized by 5-bp target site duplications. CC TransibN1_DP has 36-bp terminal inverted repeats. XX SQ Sequence 1194 BP; 400 A; 207 C; 202 G; 385 T; 0 other; cactatggtc cgaacgacca cttttgttgg ccaaaaccta atttttcaaa gtcaaaactg 60 aaacctggtt ttgatcgctc tgcattagaa aaagatcgac cattaaatct gcgtaccaga 120 aatttccata gatggcgcca cattcgcaaa atcagctgtg caacccaact gtttttatag 180 taaagcacgt gaagggaaaa agtggcaaat taataaccac tttagatctc tcaaaaattg 240 aaaaacgcca agacgtgatt aaattttttt gatgaatctt aatctgtcgg gttcttacta 300 cgtgttttga gtttttaact gtctctagta gttgtgatgt cttcgtaatt gaaggttaaa 360 gtgaggcctt taacatgtgc caacttgtaa tgaaaactaa tttttgaaaa ggtgtacaaa 420 gtgagtccta cctagtccca ccctgaaacc ttttctgtgt cgtagattaa taaattcttt 480 ttgaaatgta tatagttaga aatgccactt tttaccacaa ttcttggaat ttgcattaat 540 agagcataac ctcaaattcg aaaatgcaga tttgtttgcc gtatggaatt cagagaaggg 600 taaaattatc aattattgaa aacggatctt aaggcgacat atcatatgaa attaataaag 660 cttttgaaga taccagcaag aaaatttgtt accgtttggc aacataccgg aacaaatgcc 720 ggtaaaattc aaagatgttg gaacgaaaat atgtggattg gccacaagcc tttatttcaa 780 tacatttatc tttatgtcca aagaatgcta actaagttcg taaggctgta caggacccag 840 tcggaaagcc ggcacatatc tcacctaacg aggcgttaac cttgcttcta gacaaggatt 900 tcggcaataa tttacggaaa cctttaaacg attttatttt tatttaaatc aaaatttttt 960 ttagtcgtgc aacgccgttt tcttaaataa aatatgctta atactttttt tcactataac 1020 taaaaattta aatatgcatt attaaaaaaa ggggcggtgc cacacccatt ttttttctta 1080 aatccatttc ttattgatta tatatatcca atacaaaaaa ctaaatgaaa aaatcttgtt 1140 ccgttcggga gttattaatt ttggccaaca aaagtggtcg ttcggaccat agtg 1194 // ID DNA8-70_AP repbase; DNA; INV; 545 BP. XX AC . XX DT 25-AUG-2009 (Rel. 14.09, Created) DT 25-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-70_AP. XX NM DNA8-70_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-545 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2005-2005 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 545 BP; 271 A; 61 C; 41 G; 172 T; 0 other; cagggatgga aaacgttttt aaaaaacgtt attaaacgga aaaaaacaaa aaacgaaaac 60 aattaataaa acgaaaacaa aaaacaactg aaaaacgata acaaaaaatc ccaaaaaccg 120 aaaaaaattt aataacgaaa tcaaaaacga aaacgaaaat aaattatagt attatacatt 180 atatttatac cacaaaaaaa aatttcactt aagtctttac tttttaatct ttaccaatat 240 gcacttttaa aatttaaaaa cattttaatt tgaaatcaga aatttttaat ttttaatttt 300 tatataaatt ataattttga tttaaaattt aaatattaag aataattaat tttaattttt 360 aacttttaag tgcatttcta tattgcgcgt tactattata attaaaaata aaaacaatat 420 tatcaaaaac gttatttgga aacaatagga aaataacgtt tttaaaaacg ttaaacaaaa 480 acgatatttt ttaaatcaat aaacaaaaac gaaaacgaaa aaattaaaaa cgttttccat 540 ccctg 545 // ID Gypsy-10_SI-I repbase; DNA; INV; 4005 BP. XX AC AEAQ01022779; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-10_SI_; KW Gypsy-10_SI-LTR; Gypsy-10_SI-I. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-4005 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01022779; Positions 508 4512. XX CC Positions [1358-1780] - Reverse transcriptase CC Positions [2825-3307] - Integrase core CC 'AAGT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 1664..3712 FT /product="Gypsy-10_SI-I_1p" FT /translation="MDDILIATESVKENLGVLKEIMIRLKTYSFELNLKKC FT QFLKKKIEFLGYIVSQDGITLSPRHTEAIRSFRVPRNVHELQRFLGLVSYF FT RKFTQDFSLKARPLYDLLRKNVAFKFDSSCIESFESLKDELTRAPTLALYN FT PEAETELHTDASRVGLGAILLQKQANRKWSAVAYYSQTTNKAEKQYHSYEL FT EMLAVVRAIERFHLYLYGIPFTIVTDCNAIVYAVNKASLNPRIARWTLTLQ FT NYDFKMSHRPGDKMQHVDALSRCVGYVGELPLERELEFHQLADPRIKEICD FT ALELEDSEKFEMVDGLLYRKLEDRSKFVIPESMVASVIRAHHDDAAHVGYE FT KTLRGISENFWFPAMRKRVRDHISNCLTCVMANDASNRFEGETSLYPAPTA FT IMQVVHVDHFGPLAETPGHYKHILVVVDAFSRFTWLTPVKSTTSRETIHRL FT EDIFSLSGNPAELVSDRGTAFTSREFIDFIDRSRIKHRKVAVAAPWANGIV FT ERVNRFIKGALTKILDEPDQWSVKLPVLQYLVNNTFHAGTETTPSKIVFGY FT DQRNHSDYPLARFTKDWASIESDLESEKSRAHEAAYKATERLRNYNKMYRD FT SKSRKPSLYCVGEYVLIRDDRTKPGENSKLKAKYKGPYVIEKVLGNNRYVV FT KDIPGFNITSRPLNTILSSDRIKRWVKVPSPAGE" XX SQ Sequence 4005 BP; 1133 A; 996 C; 1044 G; 832 T; 0 other; atttctcaga agtgggatag gcctcactat cccatgccgt gtcggaaata aaaactttac 60 gtcgtgtaaa gccggggaaa accggcgccg agtgtcgtgc cgaacggata gcgtggagct 120 tatccatatc gtgtcgagtc gagatccagt cggtaatcct tgtcgtccgc gagtttgagg 180 agcgggatat ctctcgtatc cgcggaagtg tcgagtataa aattcgtcga gtcgttcagg 240 cgcgccgcgc gggaacccgg taccagcggt accgaacgcg cgcgaattcc tgccgagccg 300 gcacgcgcgg accgcgtgat aacgggtgtg gcgcagtgca aaacaccgtt tcggaagcag 360 gccgagaggt gaaggtctta gaacgcgata atagcatcgg gaacagtgtc atcttgaagg 420 tgtctgccat gaatcgagac gaaatggaaa atgcgactat ggagacgctc cagagggaag 480 cgatcgcttg cggcattcca tttacaccag acagagtggc cttgatggag gcgatcctct 540 cctaccggga gaggcacagg ccacccgcga tcagcagcgg agtaactaag ggagcgggga 600 cctcggggag gaggcgcaac tgaagctgtc agcagcgtgc gtaaagttaa gtaatctttg 660 cgacgaaacg tgtaatatcg aggcactaat cgataccggg agcccagtgt caattataaa 720 atattccaaa tacaaaaagt acgtcgaacc gcatcagacg aatttgctgt ccccgaccaa 780 aaaattaaaa agtatcgaac gacaccctca gggtgctggg ggtggccccc gttaatctaa 840 agctgcagaa gctcggggac aggaaattcg ctgtaaagtt atacgttgtc gaggatcgcg 900 agttccgagg ggacatgctc ctaggccgtg actttatcgc cggcgaaaac atatcgctca 960 agtacgacct cgagaacgct gaaggagacg aggccgtcgc gacggtaaat ttgttcgatt 1020 cattggaatg cgaggccgtg tcagagacca acgaacaagc gatccgcaac gccgaaatag 1080 attacgatgc cgaggcaaaa caacaattga tccaaacaat cgtcgatgcc gaaaatacca 1140 ccgtcgagcc gatagtcgac gactactgcg taaaagtccc ccttaaagat aattcggtgt 1200 acgcgtatta tacgcgccgc gccgctttgc gcacatcgag cggctggaaa tcagaaaaat 1260 aatagacgat ttattggcgc gaggggtaat aaaacagagc aggtcgtctt attgcgcgag 1320 agtgattccg gtcaaaagga aaaacggaaa actcagactg tgtgtcgacc tgagaccgtt 1380 aaatagcaga gttgcgaaac agaaatatcc ctttcccctc atagaggatt gcttatcgcg 1440 gttgtgtggc aaaacagtat tcaccttact agacctgaaa gacggctttc accaaatcag 1500 agtccaccct gaccatacac agtatttcgc tttcgcaact ccggacgggc aattcgaata 1560 taactatctg ccgttcgggt actcggaagc cccggccgaa tttcaaagaa ggatcctaca 1620 gatcctagac ccgctcgttc gggggggtag gattgtcgtc tatatggacg atatccttat 1680 cgccaccgaa tcggtgaaag aaaatctggg ggtccttaag gagataatga tcagattgaa 1740 aacatactcg ttcgaattga atttaaagaa gtgtcagttt ttgaagaaaa aaatagaatt 1800 cctcgggtac atcgtctcgc aagacggaat cactctgagc ccaagacaca ccgaagcaat 1860 acgttcgttc agagtaccga gaaacgtgca cgaattgcaa cggtttctag gattagtcag 1920 ttattttcgg aagttcacgc aggacttctc cttaaaagca aggccactgt atgaccttct 1980 caggaagaac gtcgcgttca aattcgacag ttcgtgcatt gagtcgttcg agtccttgaa 2040 ggacgaactc accagagcgc ccacgttagc gctctataat cccgaggccg aaaccgagct 2100 ccataccgac gcgagcagag tcggccttgg cgctatcctc ctacagaagc aggcgaatag 2160 gaagtggtcg gcggtagcgt actacagcca aactacgaat aaagccgaaa agcagtacca 2220 cagctacgag cttgagatgc tcgcggtagt tagagcaatc gagcgttttc atttgtacct 2280 ctatggtata ccgttcacca tagtgactga ctgcaacgcg atcgtctatg ccgtaaataa 2340 agccagcctg aatccgagga tagcgcgatg gaccctgact ctgcaaaact atgactttaa 2400 aatgtcgcac cgaccgggag ataagatgca gcacgtagat gcgctgagtc gatgcgtcgg 2460 ctacgtcggc gaactgccgc tcgagcgcga acttgaattt catcagctcg cggatccccg 2520 gatcaaggaa atctgcgacg cattagagtt agaagatagt gaaaaattcg agatggtaga 2580 cggtttactc taccgaaaac tggaggacag gtcgaaattt gtcataccgg agtcgatggt 2640 cgcgtcggta ataagggcgc accatgacga tgcggctcac gtgggatacg aaaaaacgct 2700 tcgcggtatt tcggagaatt tctggtttcc cgcgatgcgc aagcgcgttc gagatcacat 2760 cagcaactgt ctgacgtgcg tgatggcgaa cgacgcgtcc aaccgcttcg aaggagaaac 2820 ctcgttgtac ccagcgccga ccgcgattat gcaagtcgtt catgtagacc acttcggtcc 2880 gctagccgaa acgccgggac attacaaaca tattttagtc gttgtcgacg cgttttcgcg 2940 attcacgtgg ttaacaccgg tgaaatcaac cacgtcaaga gagacaatac accgcctcga 3000 agatatattt tcgctctccg gaaatcccgc cgagttagtc tcggaccgag gcaccgcgtt 3060 cacttcgcgc gagtttatag atttcatcga tcgcagccgg atcaaacacc ggaaagtggc 3120 cgtagccgcc ccctgggcca acggaatcgt ggaacgagta aaccgattca taaaaggggc 3180 ccttactaaa atcttagatg aacccgacca atggagcgtc aaactgccgg ttctccagta 3240 cctcgttaac aacacctttc acgcgggaac cgaaaccacc ccatctaaaa ttgtattcgg 3300 ctacgatcag cgtaatcatt ccgattaccc cctggcgcgg ttcaccaagg actgggcgtc 3360 aatcgagtcc gatctcgaaa gcgaaaaatc cagagcacac gaagctgcct ataaggcaac 3420 ggagagactt agaaactaca acaagatgta ccgagacagt aagtcccgca aaccctcgtt 3480 atattgcgtt ggcgaatacg tcctaatccg cgacgacagg acgaaacccg gggagaatag 3540 taaactgaag gccaaataca aagggccgta cgtaatcgag aaggtactcg ggaataacag 3600 gtacgtcgtc aaagacatcc cggggttcaa tataacctcg cgacccctga acacgatcct 3660 ctcatctgac cggataaagc gttgggtcaa agtaccctca ccggcgggcg aatagcgccc 3720 gcggacacca aaggtgatca tagtaatata ggcctctttt gaaaaaaaaa aatgtatata 3780 tatatttgcg accatgcctg tataaagatg gtctcataat gtgtccgtcc gaaccgtcga 3840 catgtacaaa attagttgta agggctcgcg ctgtgtcaaa ggcacgctgc ctcgcatgct 3900 ataaatgtaa taagtactcc taccggagga cgaggtatac attagactag ctcgtacgcc 3960 ataaagaagg cgcttaggga ctaagcgcat gcaggacggc cgagc 4005 // ID Gypsy-6_DWil-LTR repbase; DNA; INV; 231 BP. XX AC scaffold_180699; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-6_DWil_; KW Gypsy-6_DWil-I; Gypsy-6_DWil-LTR. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-231 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_180699; Positions 653287 653057. XX SQ Sequence 231 BP; 88 A; 42 C; 42 G; 59 T; 0 other; tgtaaacatt aggaatattt taaataaacc cgctcctcgt ctcagtgaat ttccccgata 60 accgaatttg ggcacactaa aagggtataa tacgctaagg cggggtacgg gcacactctc 120 ttgaatatag tcaataagac cgcatgtgca attaataaat tcagaaataa aatccggaaa 180 tataaagaca atcccaaggg attaattact ttaaaaagta gagagtttac a 231 // ID BEL-28_CQ-I repbase; DNA; INV; 1690 BP. XX AC AAWU01010753; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-28_CQ_; KW BEL-28_CQ-LTR; BEL-28_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-1690 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 209-209 (2011). XX DR GenBank; AAWU01010753; Positions 28449 30138. XX CC 'GCAG' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 42..1688 FT /product="BEL-28_CQ-I_1p" FT /translation="MDKLEHLRKVCLVRVSGELVAAGKLAVRKASFSEATE FT RLERLRELAGKFRRMQQSIEETLEDQEEVAAAHGKSVEFFTAYQAAEELLE FT VHIKNTTPVRAVTRKSSSGSLLDEIRQFCEKVTAATVGQFGTIPSYPNGSA FT QVSSCAPHSGPTTTSDGQSGAVGYGNPDGKQRLDCASVTDPTTVSSREEGE FT PMSNWSPLPEKHHSSIRVSTFHPGQATCDGGKDGPETPMEGVEKASVEKAT FT TQETNKPEEVDGAAINMAVTQETRRVKWKALMPTEAISVRGIDVALPQASL FT DSKTCAELCPFHGIEEEQLPAIGMSTISPVRLSCNGGKGGPETSATGVEMA FT LAGGEASQNQVVRPAAEQVEEVIINVVEAQDPRGAQRKAHLLPTEPVSCRG FT IDLALPRASLDSRFCVWIGLPSIQSINTSSVRAVGPDVKIQKAKHNINDER FT FMEMEGDQALVDTDSTESTRRFEDVGGYVTRSRFRDATFSFKQLWIRGHIR FT HLLKVERIGNWTHHQTASAGEGKGLRLKIKHTTEKIVAAGGVGLRQMCLKA FT GE" XX SQ Sequence 1690 BP; 433 A; 426 C; 530 G; 301 T; 0 other; ttggtccttc gtcgccggat gggtttcgcg aaccggtcag catggacaag ctggaacacc 60 ttcgaaaggt gtgcttggtc cgcgtgagtg gggagctggt ggcagccggg aaactcgctg 120 tacggaaggc gtccttcagc gaggcaacgg aacggctcga acggctccgg gagcttgctg 180 ggaagttccg gcggatgcag cagagcatcg aggaaacgtt agaggaccag gaagaagtcg 240 ccgcagcgca cggaaaatct gtggagtttt ttacggctta tcaagcggcc gaggaactgc 300 tcgaagttca catcaaaaac acaaccccgg ttcgagcagt tacccggaaa tcgagcagcg 360 gcagtctgct agacgagatc cggcagtttt gcgagaaggt aaccgcagcg acagtcggcc 420 agttcggaac cattccaagt tatccgaatg gcagcgctca ggtttcgagt tgcgcacccc 480 actcgggtcc aacgacaaca agcgacggcc agtctggagc cgtgggatac ggcaatccag 540 atggcaaaca gcggttagat tgtgcgtccg ttacggaccc aacgactgtt tcgtcgcggg 600 aagaaggtga accgatgtcg aattggtccc cgcttccgga gaaacatcac tcgtccatca 660 gggtgtcgac tttccaccct ggtcaagcga cgtgtgatgg cggtaaagat ggtccggaga 720 cgcccatgga gggcgtagag aaggcatcgg tggaaaaggc aacaacgcaa gaaaccaaca 780 aaccggaaga agtggatgga gctgccatca acatggcggt gacccaggaa acgaggcgag 840 taaagtggaa ggctcttatg ccaactgaag ctatcagtgt tcggggaatc gacgtggctc 900 ttccccaagc atcactggac tccaagacct gtgccgagct ctgtccattt cacggaatcg 960 aggaggaaca acttccggcg atcgggatgt cgaccatcag ccccgttcgg ctgagttgta 1020 atggaggtaa gggtggtccg gagacgtccg caacgggcgt agagatggca ctcgcaggcg 1080 gcgaagcatc acaaaatcag gtagtgcgcc cagctgcaga acaagtggag gaggtaatca 1140 tcaacgtggt ggaggcgcag gatccacgag gagcacagcg gaaggcgcac ttactgccga 1200 ctgaaccggt cagttgtcgg ggaatcgact tagcacttcc tcgagcatcg ctggactcca 1260 gattctgtgt ctggattggt ctgccaagca tccaatctat caacacatcc agcgtccgcg 1320 cagttggacc ggatgtcaag atccagaaag caaagcacaa catcaacgac gagagattca 1380 tggaaatgga gggagatcaa gcactggtcg acactgattc gactgaatcg acgagaaggt 1440 ttgaggacgt tggaggttac gtcacacggt cacgcttcag ggacgccacg ttctcgttca 1500 agcaactgtg gattcgtggc cacatccggc acctactcaa ggtcgagcgg atcggaaact 1560 ggacacatca tcaaacggcg agcgctggcg aaggcaaagg attgcggctc aagatcaaac 1620 acaccacgga gaagatcgta gcagctggcg gagttggttt gaggcagatg tgcctcaagg 1680 cgggggagaa 1690 // ID BEL-175_AA-I repbase; DNA; INV; 6123 BP. XX AC supercont1.27; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-175_AA_; KW BEL-175_AA-LTR; BEL-175_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6123 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.27; Positions 1378547 1384669. XX CC Positions [5186-5632] - Integrase core CC 'GTACA' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 4121..5632 FT /product="BEL-175_AA-I_3p" FT /translation="MALIGGKAKVAPLKALSIPRLELMAAVIGVRLLKTIR FT NGHSMKIGRTTMWSDSKTVLAWINSDHRSYRQFVACRVGEILSKSDVAQWR FT WVPTKENPADLATKWGKGPCLSPNSRWFNGPSFLRLPETEWPTGVKPMEET FT TVEELRECLVHTEATRMWVVDWQRFSNWNRLLRSVAYVHRFCANLQRKVKK FT EPQVTGALRQHEIAQAEMTIFRLVQSEQYPDEMAILSDALNENRTKCGKLE FT RTSKIRTLSPYMDDSGVIRAESRISAATFVSYDTRFPIILPKAHVVTRLLL FT EWYHRRFLHANNETVVNEVRQRFHISALRVAVREVTKMCTLCKVKKAVPAV FT PRMAPLPAAILKPYERPFSYVGLDYFGPILVRVNCSTVKRWIALFTCLTTR FT AVHLEVANSLSTESCKQAIRRFIGRRGAPVEIRSDRGTNFVGANNDLRKEM FT MEMDQQLAETFTNTYTRWVFNPPAAPHMGGAWERLVRSVKTAIAAIQTTEI FT PKVWRHLW" FT CDS join(41..2665,2669..4093) FT /product="BEL-175_AA-I_1p" FT /translation="MPMSTRNAKSAARGTSHCQACGKPDTDRMVSCCHCKS FT WWHFECVGVNDSIAETDRTFTCPKCQQPPLVNPATNETTNAGKPASKPSSE FT ASGRTSSSVRARRAQLQLEKLEAQKALALKRLELESKKLELEAEVLEESFR FT LREEIEREERSDTSSVTSQQSSRSKVEEWQKKQDEILSSTIVTTTATGTEA FT NLQQRIATTTAIGGRQGSRAQPAGNILSGALAGISLKDTASEVTLGGMIGR FT TSETIPQIGTSSKLGLSSVPPVFDTPSVCESNKRSLGLSYPLPTHDRPQNE FT SVRPYSNNTRTGRVELLPRSDRRFGDFDNVRNVDPDPREASEPLRQIPGRP FT LLSRCQPAEDPGVRFMANPDEQSGDFTRGTVMPSWGPSPQQIAARQVIAKE FT LPTFTGHPEDWPLFISSFTNTTQACGYSEAENLARLQRCLKGNALEAVRSR FT LLLPEAVPHVIATLETLYGRPELLIHTLLQKVRGAPAPKQDRLDTLIGYGM FT AVQNLCDHLEAGGHHAHLNNPMLLFELVDKLPANMKLDWSLYKQRCAEANI FT RAFSQYMTTLVRAATDVTLHYDPKHQSQQPPPQRGGKSGKDKDFCGAHSSE FT EALKTPVKEEQIVGGKVEPSDPACLVCKDPAHRVKNCSVFAKKTLEERWKL FT TGLLGLCRICLGLHGKRPCRIRQKCEIDGCQFRHHALLHSKQRPSDTKEKT FT DPKPTGSNVITNHHSAGRAVLFRIIPVKLHGNNGTVATYAFLDDGSAKTLV FT DEEIVKQLGVTGETLPLCLQWTANVKRVEADSQMVALDISGESGTSRFALK FT DVRTVRKLDLPRQSLRYATLTQEFLHLKGLPITGYEVALPRILIGSDNAHI FT TAMLKVREGLPGEPIAAKSRLGTVYGLQSSNCDRVHSFHICECQNDQTLHD FT LVKEFFSVESLGIDAVACPESAEEQRAKHILQSTTKRIGQRFETGLLWRYD FT RFEFPDSYPMAVRRLQCLERRIRKDPVIGESVVRQWSEYQAKGYIHKATQE FT ELTEADPMRTWYLPLGVAINPKKPSKIRIFCDAAAKVDGISMNTMLLKGPD FT LLNTLLDVLFGFREKRIALCADLKEMFHQIQIRPEDRHAQRLLWREDPSQT FT PDVYLMDVATFGATCSPCSAQFVKNKNAEEHAPEYPEAAEAIIRKHYVDDY FT LDSADSVDEAVKIASDVRHVHSLGGFHLRNWLSNSKEVLARVGESNAIAEK FT SLQLDKNSSTERVLGMFWKPEEDVFTFSTTLPVDRDHPTKREALRVVMSPF FT DPAGLLCFFLIHGKMLIQELWRSKTDWDQRIPMQLKEMWTRWTNMFERLDE FT VRVPRCYFPQHGQNEIVDLQLHIFVDASEEAYACVAYF" XX SQ Sequence 6123 BP; 1622 A; 1411 C; 1793 G; 1297 T; 0 other; atctataaga tttactaagg ttagtaggtc attgattaag atgccaatga gcacccggaa 60 cgcgaaaagt gccgctagag gaacgtccca ctgccaggca tgcggaaaac cggacacgga 120 caggatggtg agctgctgcc actgcaagtc atggtggcac ttcgaatgcg tgggcgtgaa 180 tgacagtatc gcggagacgg atcggacgtt cacctgccca aagtgccagc agcccccctt 240 agtcaaccca gcgacaaacg aaacgacaaa tgcgggtaag ccagctagta agccgagcag 300 cgaagcgtcg ggaagaacta gctccagcgt tcgagcaaga cgagcgcagt tgcagctgga 360 gaaacttgaa gcgcagaagg ctctcgcttt gaaacgcctt gagttggaga gtaagaagtt 420 ggaactggag gcggaggtgt tagaagagtc tttccggctg cgggaagaga tcgagcgaga 480 agagcggagc gacacaagca gtgtaacttc acagcaaagc tcacgcagca aggtggagga 540 atggcagaag aagcaggacg aaattttgag ttcgacgatt gtgacgacaa cagcgaccgg 600 cacggaagca aatctgcagc agagaatcgc gacgacaacc gctataggag gaaggcaggg 660 cagcagagcg cagccggctg gaaacattct aagtggagcg ttggcaggta tttccctcaa 720 agatacagct agtgaggtaa cgttaggagg catgattggt cggacgtcag agacgattcc 780 gcagataggt accagtagta agttaggtct gtcttcggta ccccctgtct ttgatacacc 840 cagcgtgtgc gaaagcaaca aacgttcact cggtttgtca tatccgcttc ctactcacga 900 ccgtccccaa aacgaatccg ttcgtccata ttcgaacaat acgcgaacag gccgtgtaga 960 attgctcccg cgaagtgatc gaagattcgg tgatttcgat aatgttcgaa atgtagatcc 1020 agatccacgt gaagcaagtg aaccgttgag gcaaatccct ggacgaccgc ttttgtcccg 1080 atgccagccc gcagaggatc caggagttcg gtttatggcg aatccagatg agcaatctgg 1140 tgatttcact cgtggtactg tcatgccttc ttggggaccg agtccgcagc agatcgcagc 1200 gcgacaagtt atagcgaagg agttaccgac gtttacggga cacccagaag actggccgct 1260 gttcatcagc tcgtttacca acacaactca agcgtgtggg tattcagaag cagagaattt 1320 agctaggcta cagcggtgct tgaaaggaaa cgcacttgag gcagtgcgaa gtcgactact 1380 ccttccggaa gcggttccac atgtcatagc cacactcgag acattgtacg ggagaccgga 1440 gctgctaata cacacgttgc tgcagaaggt ccgtggagct ccagcgccga agcaagaccg 1500 gctcgacacg ttaatagggt atggcatggc agtgcagaat ctctgcgatc acctcgaagc 1560 tgggggacat catgcgcatc taaacaatcc gatgcttctt ttcgagttgg tggacaagct 1620 gccggcaaac atgaagctag actggtcgct gtataaacaa cgatgtgcgg aggcgaacat 1680 tcgcgcattt tcgcagtata tgacgacctt ggtccgagcg gcaacggatg taacgctaca 1740 ctacgaccca aagcatcaat cacaacagcc cccgccgcag cgaggtggaa aatctggtaa 1800 ggacaaagat ttttgtggag cgcactcttc ggaggaagcg ctgaagacac cagtaaagga 1860 ggagcaaata gttggcggaa aagtcgaacc atcggatcct gcgtgtctcg tctgcaaaga 1920 ccctgctcac cgggtgaaga attgttccgt gttcgctaag aagactctag aggagcgatg 1980 gaaattaacg ggactgttag gattgtgccg gatttgcctg ggtctacacg gcaagcgtcc 2040 gtgcagaatt cggcagaagt gcgagattga cgggtgccag tttcgacacc atgctctgct 2100 gcattccaag cagaggccta gcgataccaa ggaaaagact gatccaaaac ccactggatc 2160 caatgtgatc accaaccatc attccgccgg tagggcggtt ctgttccgca taattcctgt 2220 gaagctgcac ggcaacaacg gaactgtggc aacctacgcg tttctggacg acgggtcggc 2280 gaaaacttta gttgacgaag agatcgtgaa gcagttgggc gtcacaggcg aaactctacc 2340 actttgccta cagtggacgg cgaacgtgaa gcgagtggaa gcagattcgc agatggtagc 2400 gttggacata tcgggagaga gtggaacttc aagatttgca ttgaaggatg tgcgtacagt 2460 gcgcaaactt gatctaccgc ggcaatcgct gcgatacgcc acgctgacac aggagttcct 2520 gcacttgaag ggacttccga ttacgggtta tgaagttgct cttccacgaa ttcttatcgg 2580 cagtgataac gcgcacatca ccgcgatgtt gaaggttcga gaaggactgc ccggagaacc 2640 tattgcggct aaatcacggc tcggatagac ggtgtacgga ctgcaaagct caaactgcga 2700 ccgtgtgcac agtttccata tctgtgaatg tcagaacgat cagacattgc acgatctcgt 2760 gaaggagttc ttctctgtgg agagcttggg aatcgatgct gtagcatgtc cggagtcagc 2820 ggaagagcag cgagcgaaac atattctgca gagcactacc aagcgcatcg gccagcggtt 2880 cgagaccgga cttctttggc gatacgatcg ttttgagttc ccggacagtt acccgatggc 2940 ggttcgacgt ctgcaatgct tggagcggcg cattcggaag gatccggtga ttggtgaaag 3000 tgttgtcagg caatggtccg aataccaagc aaaaggctac atccacaaag cgacgcagga 3060 ggagctcact gaagcagatc ctatgcggac gtggtatctt ccattaggag ttgccatcaa 3120 tcccaaaaag ccgtcgaaaa tacgaatctt ttgcgacgcg gcggctaaag tggacggcat 3180 ctccatgaac acaatgctgt tgaagggtcc ggatctcctg aatacattgc ttgacgtctt 3240 gttcggattt cgggagaagc gtatcgcact atgcgcagat ctgaaggaga tgttccatca 3300 gattcagatt cggccggagg accggcatgc ccaacgattg ttatggcgtg aagacccatc 3360 gcaaaccccg gacgtgtacc tgatggacgt cgccacgttc ggagccacct gttctccgtg 3420 ttcggcgcag ttcgttaaga acaaaaacgc cgaagagcat gcacctgagt atccggaagc 3480 agcggaagca atcattcgga aacactatgt ggacgattac ctcgacagcg cggacagtgt 3540 cgacgaagcg gtgaagatag catctgatgt ccgacatgtc cactctctcg gaggattcca 3600 cttgcgcaat tggctatcaa actcgaaaga ggtgctcgca cgagtcgggg agagtaatgc 3660 gattgctgag aagagtctac agttggacaa gaacagctcg acagaaagag tgctcgggat 3720 gttctggaag ccggaagaag acgttttcac cttttcaact actttgccag tcgataggga 3780 tcatcccaca aaacgggaag cgttacgagt cgtgatgagc ccgttcgatc ctgctgggtt 3840 gctatgtttt ttcctaatcc atggaaagat gttgatccag gagctatgga gatcaaaaac 3900 tgattgggac cagcggatac cgatgcagct gaaggaaatg tggacgcgct ggacgaatat 3960 gtttgagcgg ttagatgaag ttcgcgttcc tcggtgttac tttcctcagc acgggcagaa 4020 tgaaatcgtg gatctgcagt tgcacatctt tgttgatgcc agcgaagaag cgtacgcttg 4080 tgtggcgtat ttttgagcgg agttcgtaga cggtatagag atggcattga tcggaggaaa 4140 agcgaaggta gctccgctga aggcgctatc gatcccaaga ctggagctga tggcggcggt 4200 gatcggggtt cgcctactga aaaccatacg taacggacat tcgatgaaga taggaagaac 4260 gaccatgtgg agtgattcga agacggtatt ggcatggatc aactcggacc atcgtagtta 4320 ccgacaattc gttgcatgcc gtgtagggga aatcctttct aagtcggacg tagcacaatg 4380 gcgttgggta cctactaagg aaaatccagc agatctggcg acgaagtggg gaaaaggacc 4440 atgcctatct cccaatagtc gttggttcaa cggaccgagc tttctgcgtt tgccagaaac 4500 ggagtggccc accggtgtga aaccgatgga ggaaacaacg gttgaagagc tccgagaatg 4560 tctagtgcac acggaagcga ctaggatgtg ggtggtggat tggcagcgat tttcaaactg 4620 gaaccgccta ttgcgttcgg tagcctatgt ccatcgattt tgtgcgaatc tgcagcggaa 4680 agtgaagaag gagccgcaag tgaccggagc gttgaggcag cacgaaatag cacaagcaga 4740 gatgaccata tttcggctgg tgcaaagcga gcagtatccc gacgagatgg caatcctttc 4800 cgatgcgttg aacgagaacc gcacaaagtg cggcaaactg gagcggacaa gcaaaatccg 4860 aacgttgtcg ccgtacatgg acgatagcgg agtaattcgt gcggagtcca ggatttccgc 4920 tgctaccttt gtgtcatacg atacgagatt cccgattatt cttccgaagg ctcatgtagt 4980 cacccgattg ctgctggaat ggtatcatcg gagattcctg catgctaata atgaaacggt 5040 agttaacgaa gttagacagc ggttccatat atcagctctg cgcgtcgcag tacgagaggt 5100 aacgaagatg tgtacgttat gcaaagtgaa gaaggcagtt ccagcagttc ctcgaatggc 5160 gcctctccca gctgcgatat tgaaacccta cgaacgaccg ttttcgtacg tgggactcga 5220 ttatttcggg ccgatcttag ttcgggttaa ttgcagcact gtcaagaggt ggattgcact 5280 attcacgtgc ctcacaactc gtgccgtaca cctggaggtc gcaaactcgc tttcgactga 5340 atcgtgcaag caagcgatac gacggttcat tgggcgcaga ggcgctccag tggagatacg 5400 cagtgacagg ggtacgaatt tcgtaggggc gaacaacgac ttgcgcaagg agatgatgga 5460 aatggatcaa caactcgcag agacattcac caacacttat acccgatggg tgttcaaccc 5520 gccagctgcg ccgcatatgg gtggcgcttg ggagcgtttg gtgcgttcgg tcaagacagc 5580 gatagcggcg atacagacga ccgagattcc gaaggtttgg cgacatttgt ggtagaagca 5640 gagagcgtag ttaactcgag acctctaacg ttcattccat tagagacgga gcagcaagag 5700 gcccttacgc cgaaccattt tcttttgctg aattcaacgg gagtggtgca atctccgaag 5760 acgttaacgg agccgatggt gtgccgcgga agttgggagc tctgccgctc tatggtggac 5820 caattttgga gaagatgggt aacagagtat ctgccgatca tcgctcgccg aaccaagtgg 5880 tttgaagaag tgaagccgat tgaagttggc gatttggtca tcgtggtgga agagaaggtc 5940 agaaacggtt ggctacgtgg aagagtagtg aaggtgaaag aaggaagtga tggcagagta 6000 cgagcagccg tcgtgcagac agcaggcggg ttgatggaga ggccggtagc taagatagcg 6060 cgactggata tagtagacag taaagctgat tagatgtcca accagcctta cgggtcgggg 6120 aag 6123 // ID Copia-4_CQ-LTR repbase; DNA; INV; 141 BP. XX AC AAWU01030460; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-4_CQ_; KW Copia-4_CQ-I; Copia-4_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-141 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 324-324 (2011). XX DR Genome; AAWU01030460; Positions 4626 4766. XX SQ Sequence 141 BP; 49 A; 40 C; 19 G; 33 T; 0 other; tgttgcatgt gttgagtgta gaataaaagt gcaatttgcc taaccctaat tcaccgtaca 60 ccggaactgc cacaccttac actccaaaac cttgtctcac acaaacgtca gaataaacat 120 tccaatccga actaccaaac a 141 // ID L1-6_CQ repbase; DNA; INV; 5306 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE An L1 non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW L1; Non-LTR Retrotransposon; Transposable Element; L1-6_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-5306 RA Kojima K.K. and Jurka J.; RT "L1 non-LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 136-136 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 4 sequences with >98% CC identity. XX FH Key Location/Qualifiers FT CDS join(2185..2706,2592..5264) FT /product="L1-6_CQ_2p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="MRAHEIDVFFLQEVENDTINIPGYTMLYNIDNRRRGV FT AIGLKSTVEHVTVEKSLDGRLIVVRLRNGTTLCNIYAPTGSQNRQSRENFF FT NCTCAHYLRNLSGPLILGGDFNSIVNDRDATGATPKSEMTSRLMTSLDLID FT VWREYIGIKQTTLSFEEVLVQDWIVFLCRSQQESIPDRCXEGVHRNQTNYT FT FVRGGSGSRLDRFLVSKSTREHLRTIQHAVTCFSDHKAVIVRMVMPVAGTN FT IGRGVWRLNAATMDDEDTMAELQRKWDYLVRQRRNFTSWLIWWIEFAKPKL FT AAFFRWRSSIQNREYRDSMELLHGELARRYELYIDDASQLTRINHIKSLML FT RMQRERSNKSRLLYDTYLQGEPTSTFHLAEKSQNRKSTYIHQLEIDGRIIQ FT GPEVVQHAVDYFENLFSNNIVDFHSPDFVPTCSIPPNNERNEAIFEPVSEE FT EIIKYIKQSCSRKSPGEDGLPKEFYTKAWDIIKQEFTLIMNEALQAPDTTS FT KFMNGIVVLVKKKGTGKDMKAYRPITLLNFDYKVLTRIIKQRITPLVETVL FT SRHQKCSNGKHNIFEATTKILDKISALKFRGQPSMLVSFDMDHAFDRVNHS FT FLFRVMEQMNFNPRLINFLQKITRNSFSRILVNGNLSRPFQILRSVRQGDP FT LSMLLFVLYMQPLVDKIVSEGLAGDALNVYADDLSIFVPDGRRLTQLVNVI FT KAYEPVSGAVLNLAKTVALKIGNVDTNGIPPWLNFEDSCKILGLIFTDTVK FT KSMELTWQNILKQLKWRLWTCKSRVLNLIQKVIHINTFVSSKLWYAASTLP FT LPKKFETQILKEFRNFLWIGGKQHIRLETLSLPKSRGGLNLHSPGLKSTAL FT ITNRMLKNESELDLFRDMRYNTPFFQIPAAYPQIKTTVLEISTLSPRAIQQ FT PSSFVIFQELMEGECDPAILETQREWRRIFKQLGDQKLDSTLRSNWYCAVH FT GKIQHNELLHRQNRRSSPNCDHCPRTVDTLEHRLFECRIVSPVWSYAKTKL FT NAISPNLRHKRNXYFLYPSFSRLSREHSIQTKLIVAKYLYFVCNNSIDNIS FT VENFKFEL" FT CDS join(122..1132,945..1646,1445..2107) FT /product="L1-6_CQ_1p" FT /translation="MNEMRGYRKNTVVIDFGPVPKKPTITQIRKFAFEQLK FT LDITQIRNLQLSMTKSFVYIEMSPEINAEELVAEFDMKLTMTSEDGVQTPI FT PLHTADGATEVKIHDLPPYMPTQIIAQHLAEFGEILSITDELWKEHFPGVP FT TGTRIVRMRIRKPIPSYVNIGEEVGYVRYRNQRPTCRHCSRYLHVGQKCSD FT VKKAITGSVNSRLTLADIVSGVNPSTEEKQNTETLPAPPPLMPRPSLPIET FT LNEEQLLADEEAPAIPEVVPEQESKEDSTFPELDFAGDENYEISSDEDAKM FT QTEEXESEADSAEDXTAGKKRPARPKPQKKSKRLSGSKSKSGSSKILLEMK FT TTRFQVMRMQKCRRKKXSRKPIQLKIQXQERNAPHXPSRRRSXNGCPDPSP FT NLAPPSNIVFNXIXITTLSTSNXSSSSQXRHSSAXIYSAQHIXPKGKKLRY FT MSCRSVHRYCHPTQXPXTARRSSEHXAPGSGNXRTSSLXVGPPLRXXYHHA FT ASPVXIGKKXRSRYMSCRSTRYCPPFKSPXTAHRLPGRSAPXTRIRGPTPS FT XXTLXXPPAXLXXEPAAXXXXARREAQVQVYVLPKHQILPXVQKPSNCXQI FT ARPLXTRXPHPRPDAEXXNARXSPXRAXXGTRSRXTXXPAMRTTYQQAASP FT XSTGKKHRYMSCRSTSGCCPLCKRISITXRSXXXPXCAXGLXGXXXXGXPP FT XQXXVIXXTRKXKLASXHGPXRXXVRXAXVVRKXXXHDSXARTAVLXNXLX FT EPXTXNGHXNXKGXXRXCDTSDICWAIAALRXHFPKGSKWI" XX SQ Sequence 5306 BP; 1556 A; 1332 C; 1154 G; 1157 T; 107 other; agttgacgtc aaaacctcgg ttggggcgca cgtctttaca ctwtcgcgct gcgagttgtt 60 ttttcgctwa tagttgcaac cactctccgc gctaggcaaa tattgcctac tggcccgcat 120 catgaacgag atgcgtggct accgtaaaaa cacggtggtg atcgattttg gaccggtccc 180 aaaaaaacca acaataacac aaatccgcaa gttcgccttc gagcagctca aactagacat 240 cacacaaatt cgcaacctcc agctcagcat gaccaagtca tttgtttaca tcgagatgag 300 tccagaaatc aacgcagagg aactggtggc agagttcgac atgaagctca ctatgacgtc 360 ggaagatgga gtacaaactc cgatcccgct gcacactgca gatggagcta ccgaagttaa 420 gatccacgat cttccsccct acatgcccac ccagattatc gcgcaacacc tagcagagtt 480 cggcgagatt ctgtccatca cagacgagct gtggaaggaa cacttcccgg gtgtaccgac 540 aggtacgcgg attgttcgaa tgaggatccg gaaacctatt ccatcctatg tcaacatcgg 600 ggaagaagtg gggtacgttc gttaccgtaa tcagcgaccc acctgcagac actgttcccg 660 ctatctccac gttggacaga agtgttctga cgttaagaag gctattacgg gcagcgttaa 720 cagccgcttg acactggccg acatcgtgtc gggtgtaaac ccgagcaccg aagaaaaaca 780 aaacaccgaa accctaccag caccacctcc attgatgcca cgaccctctc tgccgatcga 840 gacgctgaac gaagaacaac tccttgccga tgaggaggct ccggcgatcc ctgaagtcgt 900 tccggaacaa gaaagcaaag aagactcgac ttttccagaa ctagattttg ctggagatga 960 aaactacgag atttcaagtg atgaggatgc aaaaatgcag acggaagaag scgagtcgga 1020 agccgattca gctgaagatw caackgcagg aaagaaacgc cccgcacgmc ccaagccgca 1080 gaagaagtcs aaacggctgt ccggatccaa gtccaaatct ggctcctcca agtaacatcg 1140 ttttcaaccw watttkcatc accackttat ctacatcmaa ctmctcatcg tcatcgcaaw 1200 ctcgccactc ttcagcaamc atttattcmg cgcaacacat agwtcctaaa ggtaaaaagc 1260 tccggtatat gtcctgccga agcgtmcaca gatactgcca tcccactcaa asccccmaaa 1320 ctgcccgcag atcktccgag catwatgcac ctggttccgg caacmgccga acatcgtcgc 1380 tgcwcgtcgg cccgccgttg aggamagwat accatcacgc cgcttcacct gttckcatag 1440 gtaagaagcw caggtccagg tatatgtcct gccgaagcac cagatactgc cckccgttca 1500 aaagccctcm aactgcwcac agattgcccg gccgctcwgc acccgwmacc cgcatccgcg 1560 gcccgacgcc gagcmcasaa acgctcgmag wtccccckgc cgmgctgmac ktggaacccg 1620 cagccgmaam acammckgcc cggcgatgag gacaacatat caacaagcag cttcacctak 1680 ctccacaggt aagaagcaca gatatatgtc ctgccgaagc acctccggat gctgccctct 1740 ctgcaaaaga atctcaatta ccmacagatc gmagmacmca ccswgstgtg cggamggcct 1800 gcamggaamt tmcscgsckg gtkcwccgcc tcsccaatss ckggtgatam kcamaacacg 1860 gaagwccaaa ctcgcatcaw cgcacggccc wagmagaaaw cwggtgagag magcgmcagt 1920 ggtacgtaaa macaamaasc atgacagcak sgcaagaaca gctgttctgw caaackactt 1980 gwcagaacca gwgacckgga acgggcacma aaacgwaaag ggtcwsgmaa gamcgtgcga 2040 caccagcgac atctgttggg caatagcagc actacgcsgg cattttccta aagggtctaa 2100 gtggatttga aaagttataa tttcgtatct atcaacatma acaacatcac gagccaaact 2160 aagctggatg cgctggccag tttcatgcgc gctcacgaaa tagacgtttt cttccttcaa 2220 gaagttgaaa acgatactat aaatatcccc ggctacacca tgctgtacaa tatagacaat 2280 cgaagaaggg gcgttgcaat tggwttgaaa tctacagttg agcatgtaac ggttgagaaa 2340 tcactggacg gacgattaat cgtcgtacgc ttgagaaatg ggacaacgct gtgtaacatc 2400 tacgcgccta ctggttcgca aaaccgccag tcacgtgaaa acttcttcaa ctgcacctgt 2460 gckcactact tgcgcaatct ttctggtccg ctaattttgg gaggagactt taactcgatc 2520 gtgaacgacc gtgatgctac aggtgccacc cctaagagtg aaatgacttc acgattgatg 2580 acttcgcttg acctgatcga tgtktggagg gagtacatcg gaatcaaaca aactacactt 2640 tcgttcgagg aggttctggt tcaagactgg atcgttttct tgtgtcgaag tcaacaagag 2700 agcatctaag gacaatccaa catgctgtta cttgctttag tgatcacaaa gctgtgatag 2760 tgcgcatggt gatgccagtt gcgggtacaa atattggtcg aggagtttgg agattgaacg 2820 cagcaactat ggacgatgag gacacaatgg ctgagctgca acggaaatgg gactatctag 2880 taaggcagcg acgaaacttc acatcctggc taatctggtg gatagaattt gctaaaccaa 2940 agctggctgc cttcttcaga tggcggtcat cgatccaaaa tagggagtat cgagactcaa 3000 tggagctctt gcatggggaa ttagctcgac gctatgaact gtacatcgac gacgcttcgc 3060 agcttactcg catcaatcac atcaaaagcc tgatgctgag aatgcagagg gaaagatcca 3120 acaaatcacg cctgctatat gatacctacc tacaaggaga accgacaagc acgtttcacc 3180 tggccgaaaa aagccaaaat cgaaagtcaa catacattca ccagctagag attgacggaa 3240 gaatcatcca aggaccggaa gtcgtacaac atgcagtgga ctactttgaa aacctgttct 3300 caaacaacat tgtggacttt cattcacctg actttgtgcc aacctgcagc atccctccaa 3360 acaatgaacg caacgaagca atctttgaac ctgtgagcga agaagaaatt atcaagtaca 3420 tcaagcaaag ctgttcgagg aagtcacctg gcgaagacgg tctacccaaa gaattttaca 3480 cgaaagcgtg ggacatcatc aagcaggaat tcacgctgat aatgaatgaa gctcttcaag 3540 caccagacac tacttccaaa ttcatgaatg gaatagttgt gctggtgaaa aagaagggta 3600 ccggaaagga catgaaggcg tacaggccga ttacgctgtt aaactttgat tacaaagtct 3660 taactagaat aatcaaacaa cgaatcacgc ctcttgtcga aacagttctt tcacgtcacc 3720 agaagtgctc caacgggaag cataacatct tcgaagcaac gacaaagatc ctggacaaaa 3780 tctctgcact gaaattccgt ggccagcctt ccatgcttgt ctcttttgac atggaccatg 3840 cgttcgatcg ggtcaaccac agcttcctct tcagggtgat ggagcagatg aactttaatc 3900 ctcgcttgat caacttctta caaaaaatta ctaggaactc gttctctaga attctggtga 3960 acggaaatct atcacgtccg tttcaaatac tacggtccgt aaggcaaggc gaccctctca 4020 gcatgctgct cttcgtccta tacatgcaac cattggtcga caaaattgtt tcagagggac 4080 tcgctggcga cgcgctgaac gtttacgcag acgatctaag catatttgta ccagacggaa 4140 gacgactcac gcagttagtc aacgtgatca aagcttatga accagtttct ggtgcagttc 4200 tcaaccttgc gaagacagtg gcgctgaaaa ttggaaatgt agacacaaat ggtattccgc 4260 cttggctgaa tttcgaggac tcctgcaaga tcttggggct tatcttcacg gataccgtga 4320 agaaatccat ggagcttacc tggcaaaaca ttttgaagca gttgaaatgg cgactgtgga 4380 cgtgcaagtc tcgagttcta aatctgatcc agaaagtcat ccacatcaac acctttgtgt 4440 cttccaaact gtggtacgca gcttcaacac ttccgcttcc caaaaagttt gagacgcaga 4500 ttctgaaaga atttcggaat ttcctgtgga ttggtgggaa acaacacatc cggctggaga 4560 ctttgtctct tcccaaatct cgtggtggtc tgaatcttca ctcacctggc ctcaaatcaa 4620 ctgcgctgat cacaaacagg atgctgaaaa atgaaagtga actagacctc ttcagggaca 4680 tgcgatacaa cacacctttc tttcagatac cagccgctta cccacaaatc aaaacaactg 4740 ttctggaaat atcaacactg tcaccacgtg caattcagca gcccagctcg ttcgtcatct 4800 tccaagagct catggaaggt gaatgtgatc ctgccatcct agaaacccag cgagaatgga 4860 gacgtatctt caaacaacta ggagatcaaa aactggactc tacacttcga tctaattggt 4920 attgcgcagt acatgggaaa atccaacaca acgagctgct gcatcggcaa aacagacgat 4980 ctagtccgaa ctgtgaccat tgtccgcgga cagtggacac actagagcat aggctattcg 5040 agtgtcgcat agtttcccca gtatggagtt atgcgaaaac aaaactgaat gctattagtc 5100 ctaaccttag acataaacgg aatgmgtatt ttctgtatcc atcatttagt mgattgtcac 5160 gagagcattc aattcaaaca aaactcattg ttgccaaata tttatatttt gtttgtaata 5220 attctataga taacattagt gtagaaaact tcaaatttga actttgaamt tttaagwgaa 5280 taaacaagga ctaccawawa aaaaaa 5306 // ID Copia-99_AA-I repbase; DNA; INV; 4159 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-99_AA_; KW Copia-99_AA-LTR; Ty1_copia_Ele2; Copia-99_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4159 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [1566-2063] - Integrase core CC 'GAGAC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 144..2876 FT /product="Copia-99_AA-I_2p" FT /translation="MADPGKFAISKLGNTNYASWKFQMEMFLIREDLWHVV FT ADAKPEPVTDAWNKADRKARATLGLCIDESQYVLIKDSVSAKAVWDALKAY FT HEKSTASSQLSLLNRLCDAKLSENGDVEKHLLELDSLFERLVNAGLELAEK FT LKIAMVLRSMPESYHFLASALEARPDADVTMQLVRSKLMDEYHKRQERSGK FT SSSGEQVLKTQKVNKEKLCFFCKKPGHFKNDCRKWQSVKEKNSGEGLSGVK FT PGGSGQQPAKAKQVKNPEINSVCFSAHTEGEAICAAAKVDRLWTIDSGASC FT HMTSSEGFFNELEKSSVTVVMANGNCSESGGIGCGSLKSVNGTGEPVDINL FT DKVLFVPNLDGGLLSVSKMADKGYSVLFTKNSVEVRNASDTVVALGERRGG FT LYFLKEPERVAVANIQCHTVNCQHTWHRRFGHRDPAVFEKIQKEDLATGMK FT LVNCGTKIVCEPCMEGKMARLPYPQQAERKSTQPLQLIHTDLCGPMSNVTP FT GGKRYFLTLIDDFSRYVIMYLLTDKSEAKHCIMSFVRMVENKFGRKPQIIR FT SDRGGEFVNRELEQFYREEGIQMQLTAGYAPQQNGVAERRNRYLQEMAVCM FT LLDANLDKKYWGEAIATAAYIQNRLPSRSVAKTPMELWCGEKPDLSRLKVF FT GCEAFVYIPDAKRVKLDSRASKLLFVGYACGSKAYRFLDKQTNKIVISRDA FT RFLELGSQLQEELPAGAKDQKPADAEVVPQRNTDRLVSIESVSAENESIDE FT ESECEGFVEEDSDDDVFYGADAEDPDGDQAEPGRPQRGNAGVLPARYQDFV FT VGVAKMDDPEPKNYREAVNSACSDKWIKAMNDEYKSLLSKGTWSLVELPKG FT RQAIGSKWVFKKKKDSVDRWCATKRGSLPRDLDNGTAVTLVKCLRRFQCRR FT RSGCSWR" FT CDS 2723..4141 FT /product="Copia-99_AA-I_1p" FT /translation="MGLQEEKGLSGQVVRHKARLVAQGFGQRYGSDFSEVF FT APVSMPSTFRVLLALAGHKKLQVRHIDVKNAYLNGRLSEELYMRQPPGYAV FT PGKEELVCRLHRSLYGLKQAAHVWNATMKKVLLSLGFKQSDSDACLFAKQL FT DNGEWIYLLIYVDDIVVVSGDERQMDLLENQLSKHFEISSLGPISQFLGIK FT VEKSSDGFYSLSQKAFINEIAERHGLDKAKTSKYPLDTGYMKQADSEPLAD FT NVQYHSLVGALLYLATNTRPDIAAAVSILSRSSSRPTQRDWVELKRVVRYL FT IGTGDLVLRLGLKRGDGLTLTGYSDADWAGDTSDRKSTSGFVFMLGGAAIS FT WGSRKQNCVSMSTMEAEYVALSEAAQEAVWLRRLLCELGAEQRQPTKINED FT NRSCIDFVSLDRQNKRSKHIDTRQHHAKDLCSKGVIQLAYCPTDRMMADIF FT TKPLGPQKINQYVADLGLAKVNRDRFVQ" XX SQ Sequence 4159 BP; 1122 A; 825 C; 1242 G; 970 T; 0 other; ggttatgtgc ccagagagga atcgtggaat tcggtcggtt taatagtgaa attcgcggaa 60 gttcgtccga ttagttttgg cgttgtcgcg tgcatcgaat tcggctggtt gttgcctggt 120 ggtgatttgc cggtgggcac aaaatggcgg atccgggaaa gtttgcgatc agcaaacttg 180 gaaacaccaa ctacgctagt tggaagtttc aaatggagat gtttttgatc cgggaggatc 240 tttggcatgt cgtggcggat gccaaaccgg agccagtaac cgacgcctgg aacaaagctg 300 accggaaggc ccgtgctaca ctaggactgt gcattgacga aagtcaatat gtgttaatca 360 aggatagtgt tagtgcaaag gcggtatggg atgctttaaa agcataccat gagaagtcga 420 cagcatcgtc gcagttgtca cttctaaacc ggctatgtga tgcgaaattg agtgaaaatg 480 gagacgttga gaagcacttg ttggaattgg attcgctttt cgagcgctta gtgaatgcgg 540 gattagagtt ggccgagaaa ctgaaaatcg ctatggtgct aaggagcatg ccagaatcgt 600 accatttctt ggcatctgct ctagaagcac gccccgatgc ggatgttacg atgcagttgg 660 tgcggtccaa gctcatggac gaatatcaca agcggcagga gcggagcgga aaatcttcat 720 ccggtgaaca agtgctaaaa acgcagaaag taaacaaaga aaagttgtgt tttttctgca 780 agaaaccggg acattttaag aacgattgtc ggaagtggca aagtgtaaaa gagaagaaca 840 gcggcgaagg actatccggt gtgaaacctg gtggcagtgg ccaacaaccg gcgaaggcta 900 agcaagtgaa aaatccggaa atcaacagtg tttgtttttc tgcccatact gagggtgaag 960 ccatctgtgc agcggctaaa gtggatcgtt tgtggacgat cgatagtggg gcttcgtgcc 1020 acatgaccag cagtgaaggg tttttcaatg agctggagaa gtcgagtgtt acggttgtga 1080 tggcgaatgg aaattgctca gagtctggtg gtattggctg cggttcgttg aaaagtgtta 1140 acggtaccgg cgaaccggtg gatatcaatc tcgataaagt gttgttcgtg ccaaacctcg 1200 acggtggatt gttatccgtg agtaaaatgg cagataaggg ttacagtgtg ctctttacga 1260 agaacagtgt tgaagtgcgt aatgcgtcgg acacggtcgt tgccttgggt gagaggcgcg 1320 gcggactgta ttttttgaaa gagccggaac gtgttgcggt tgcaaatatc cagtgccata 1380 cagtgaactg ccagcataca tggcaccgaa gattcggtca ccgtgatccg gcggtgtttg 1440 agaagattca gaaagaagat ctggctactg gtatgaagct agtgaactgt ggtaccaaga 1500 ttgtgtgtga gccgtgtatg gagggcaaga tggctcggtt gccatacccg cagcaagctg 1560 agcggaaatc aacacagccg ttgcagttga tccacacgga tctgtgcggt ccaatgagca 1620 atgtgacacc cggcggtaag aggtactttc taaccctcat cgatgacttc agcaggtatg 1680 tcataatgta tttgcttact gataaatctg aagcgaaaca ctgcatcatg agtttcgttc 1740 ggatggttga gaacaagttt gggaggaagc cgcagatcat acgatctgat cgtggcggcg 1800 agttcgtcaa ccgtgagctt gaacagtttt atcgggaaga gggcattcag atgcagctga 1860 cggcaggata cgcgccacaa cagaacggcg tagccgagag gcgcaatcgc tacctgcagg 1920 agatggcggt atgcatgctt cttgatgcga acttggataa gaagtattgg ggagaagcaa 1980 tagctactgc agcttacatt cagaatcgtt taccctcacg gtcagtagcg aagacgccga 2040 tggaactatg gtgtggagag aaacctgatt taagccgcct gaaagtgttt gggtgtgagg 2100 cattcgtcta cattccggac gcaaaacgag tgaagctaga cagccgagcg agtaagctct 2160 tgttcgttgg atatgcttgt ggctccaaag cgtatcgttt tttggacaag cagacgaata 2220 agatcgttat tagtcgggat gccaggttcc ttgaactcgg atcacaactg caggaggagt 2280 taccagctgg agcgaaggat caaaagcctg cggatgcgga agttgttccg cagaggaaca 2340 cggatcggct cgtgtctatc gaatcggttt ccgcagagaa tgaaagcatc gacgaagaat 2400 cggaatgtga aggttttgtg gaggaagatt ccgacgatga tgttttctat ggtgctgatg 2460 ccgaagaccc cgatggagat caagcggagc caggtagacc tcagcgcgga aatgcaggag 2520 tgctgccggc aagatatcag gattttgtag tcggtgttgc gaagatggac gatccggaac 2580 cgaagaacta ccgtgaagct gtgaatagtg cgtgtagtga taagtggata aaagcgatga 2640 acgatgaata taaatcgctg ttgtcgaagg gtacgtggtc gttagttgaa ctaccgaaag 2700 ggcgacaagc aatagggtca aaatgggtct tcaagaagaa aaaggactca gtggacaggt 2760 ggtgcgccac aaagcgaggc tcgttgccca gggatttgga caacggtacg gcagtgactt 2820 tagtgaagtg tttgcgccgg tttcaatgcc gtcgacgttc agggtgctcc tggcgctagc 2880 agggcacaag aagcttcaag ttcgtcacat cgatgtgaaa aatgcgtatc taaatggaag 2940 actgagcgaa gaactctata tgcggcagcc cccgggatac gcggtacccg gtaaggagga 3000 gctggtgtgc agactgcacc gaagtttgta tggactcaaa caggcagccc atgtctggaa 3060 tgctaccatg aagaaggtat tgttgagctt aggatttaag caatcagatt cggacgcttg 3120 cctgtttgcg aaacagttgg acaatggtga gtggatttat ctcttaatat atgtggatga 3180 cattgtcgtg gttagtggtg atgagcgtca gatggatctc ttggagaatc aattaagtaa 3240 gcattttgaa atctcgtcgt tggggccgat aagtcagttc ctaggcatca aggttgagaa 3300 gagcagcgat ggattctatt cgctgagcca gaaagcgttt atcaacgaaa tcgcggaacg 3360 ccatggactg gacaaggcca agacatcgaa ataccctcta gacacagggt acatgaagca 3420 ggctgacagt gagccactgg ctgataacgt gcaataccac agtcttgttg gtgcgttatt 3480 gtacttagcc actaacacca gaccagacat tgctgcagcc gtgtcgattc tcagtcgtag 3540 tagcagccgt cctacgcagc gtgattgggt cgaactcaaa cgcgtcgttc gatacctgat 3600 cggaacggga gatttggttt tgcgacttgg gctcaagcgt ggagatggat tgacactgac 3660 tggatacagc gacgcagact gggctggaga tacctcggat cgtaagtcca caagcggatt 3720 tgtgttcatg ctaggtggtg cggcgatcag ctggggcagt agaaaacaga actgcgtatc 3780 aatgtccacc atggaagctg agtatgtcgc tctgtcagaa gcggcacagg aagcagtgtg 3840 gttgcgacgc ttattgtgtg agttaggagc cgagcagcga caaccgacga agatcaacga 3900 agacaaccgt agttgtatcg attttgtttc cctcgatcgg caaaacaaac gcagcaagca 3960 catcgatact agacaacatc atgcgaagga tttgtgctcg aaaggagtga ttcagctggc 4020 ctactgtcca accgatagga tgatggccga catattcacc aagccattgg gaccacagaa 4080 gattaaccaa tacgtagccg accttgggct ggctaaggta aacagggata gattcgttca 4140 gtagctcagc gaggaggag 4159 // ID R1-1_DGr repbase; DNA; INV; 5561 BP. XX AC . XX DT 08-NOV-2010 (Rel. 15.11, Created) DT 08-NOV-2010 (Rel. 15.11, Last updated, Version 2) XX DE 28S rDNA-specific non-LTR retrotransposon R1 in Drosophila DE grimshawi. XX KW R1; Non-LTR Retrotransposon; Transposable Element; R1-1_DGr. XX OS Drosophila grimshawi OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Hawaiian Drosophila; OC grimshawi group; grimshawi subgroup. XX RN [1] RP 1-5561 RA Stage D.E. and Eickbush T.H.; RT "Origin of nascent lineages and the mechanisms used to prime RT second-strand DNA synthesis in the R1 and R2 retrotransposons of RT Drosophila."; RL Genome Biology 10(5), R49-R49 (2009). XX DR [1] (Consensus) XX CC This family is specifically inserted into 28S ribosomal RNA CC genes. D. grimshawi contains two subfamilies of R1. XX FH Key Location/Qualifiers FT CDS 187..1644 FT /product="R1-1_DGr_1p" FT /translation="IPYRYLRVCALKKSIYRLSRVVMPPRKKRSAELEAGG FT AQMVSSDDDTSQSSDAASLAEVRRRGVLSKQQLRKAASETAVDSAGKNVER FT RRAEGNLVVTLRVPREKGEVCNSGTPSVSAGVPAVEEPIAACSASNALPLR FT QRGTPIEKMKAISKELVECALAECMTGSLVGSMLGYATQYEELLFSLIAEN FT ERLKGRLEAVRFNATNEHAGHVTAGQTRVPPAAVHRGGPMSTMSSAPEMPP FT PVETWSLVVRSKTAGKTAKEVVDKVVKEVGPTLGVRVHEVKPLRDGGAIIR FT TPSVAERRKIAGNAKFSEVGLEVSVKERLGPKVVVQGVHAVISPDEFMAEL FT YELNLKDKMSKEAFTKGVRIASKPWSQEGSAAVNVVLEGAGLAMQTLLDVG FT RCYVKWFSFRVRNFDPVVGCYRCLGFDHKVAECRLKQDVCRRCGQMGHRVA FT QCSNALNCRNCAFKGRPSEHLMMSPACPVYSAIVARASARH" FT CDS 1589..4714 FT /product="R1-1_DGr_2p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="CLQLALCTVQLWRGRVLDINMSSRLLQLNCQKSYAVM FT CDVGGMIVRRGSVVALLQEPYVTNGCVRGLPAGMRVFPDSRANSAVVVNDV FT SIECTVVNSTDWGVCVSLSGNFGRLFVASVYCKFGDPLEPYIAYMDEVLLL FT ASSVPFILGIDANAPSPLWFSKISRSARYLNRSRGEVLAEWAVSQDVRVVN FT EPSEWYTFAGPMGQSDIDVTLANVAATSVFGFQWSVLGGHGVSDHNPIEIV FT ITHTSTTRESDGGNRWRTCGANWPLHGIFVSEAATQVPLSTFSAMNVDEQV FT VCVNRWVTCANDRLFERHRKVNLKRVKWWSHELSVKRRSVRSLRKRFQRAR FT SANAENAGQLRLAYSQCMNEYKQMLVRVKEDEWRSFLERNKDDPWGRAYKV FT VRGRRREADVSGLRVGDVQLTAWSDCMNVLLNEFFPRADHQNLPPSVVGDV FT DPLLDSELEVAFSMLKSRKSPGMDGFTGEMCKSVWKSIPDYMNVMYGKCMN FT EGYFPNEWKCARVIVLLKSPDRVRSNPRSFRGISLLPVLGKVLERVMVERL FT QERVSSQMSDRQFGFRKGRCVEDAWRFVSDSVESSNSRHILGIFVDFKGAF FT DHLSWPSVLERLSECGCRELAIWESYFSGRRACAVGRHESVSLNVVRGCPQ FT GSICGPFIWNLMMDTLLWQLERVCKCCAYADDLLILVEGQSRADVEASAAT FT YLRVVYEWGLRVGVSLAMDKTVTMLLKGRLSRSRPPLVRLNGVSLRHVSEV FT KYLGIVFGERMCFTPHIAYVKGRLLSLVGQVRRILRSDWGLSRSAARTVYD FT GLFVACATYGSSVWCKAVLTVVGRKNVLACQRVMLLGCLPVCRTVSTEAMQ FT VLLGVAPLDLEIRRRSLSYRIKRRLPLLQNEWLADRDVESLGLSECKKLLN FT ECVLSDWQVRWDTSENGRVTHRFIREVTFAVDRPDFRLHLSFGFLLTGHGS FT LNAFLHSRRLCDSPECPCGWVGETWEHVLCECPLYADLRDLSVLGITRGIS FT GYDVSQVLSTCEGVRRMSEFARAAFARRRLIRGEVG" XX SQ Sequence 5561 BP; 1222 A; 1118 C; 1771 G; 1450 T; 0 other; cagtacgagt tcagacttgg gaacgaacgg acgtgtcttt gctgtcgcgg ttaaacagag 60 ataacggtac aacaagtaac gggatcttgt ttactttcca gcgcgtttac agttgcgctg 120 ttctacatct gctttcgcat ttgcgttcgc tgtgcgaaaa ctgtacattt tttcgtgtgc 180 gtgtaaatcc cttatcggta cttacgtgtg tgtgcgttga aaaaatctat ttatagattg 240 tcacgtgttg tgatgccgcc acgtaagaag cgttcggcgg agcttgaagc ggggggggcg 300 cagatggtgt cgtcagatga cgacacgtcg cagtcctctg atgcagcgag cttagctgag 360 gtaaggagga ggggtgtgct gagtaagcaa cagttgagga aggcagcttc cgagacggct 420 gtagactcag caggtaagaa tgtcgaaagg cggcgcgcgg agggtaatct ggtggttacc 480 ctcagagtgc cgagagagaa aggcgaggtg tgcaattcgg gcacgcctag tgtgagtgct 540 ggtgttcccg ctgtggagga gcccattgct gcgtgttcgg cgagcaatgc gttaccgttg 600 aggcagaggg gaacgccaat agaaaaaatg aaagcgataa gcaaagaact ggtagagtgt 660 gcactcgctg agtgtatgac gggttcactg gtcggtagca tgttaggcta cgcgacgcag 720 tacgaggaat tgctgttttc gcttatagcg gaaaatgagc gcctcaaggg gcgtttggag 780 gctgttcgct ttaatgcgac caatgagcat gctgggcatg tgacagctgg ccaaacgcga 840 gttccgcccg cagccgtgca ccgggggggt cccatgtcga caatgtcgtc tgccccggag 900 atgcctccac cagtcgagac ctggtccttg gttgtgcgca gcaaaactgc tggtaagacc 960 gccaaggagg tggtggataa ggtagtgaag gaggtagggc ctaccttagg tgtgcgcgtc 1020 catgaggtga aacctctgcg ggatggcgga gcgatcattc gcacaccatc tgttgccgag 1080 cggaggaaga ttgcggggaa cgcaaaattc agcgaagttg gtctggaagt gagtgtgaaa 1140 gagagactgg ggccgaaagt ggtcgtgcag ggcgttcacg ctgtaatctc gcctgacgag 1200 ttcatggctg agctatacga gctgaacctc aaggacaaga tgtccaaaga ggccttcact 1260 aaaggtgtcc gcattgcgag taagccttgg tcgcaagagg gtagcgcagc agtgaatgtt 1320 gttctcgagg gtgctggcct ggccatgcaa accctcttgg atgtcggacg ctgctacgtg 1380 aaatggttct catttcgtgt gaggaacttt gacccggtag tgggttgcta tcgctgcctt 1440 ggcttcgacc ataaggtagc ggaatgccgg ctcaagcagg acgtctgtcg tcgctgcggc 1500 caaatgggtc accgcgtggc tcagtgcagc aatgcactga actgtcgcaa ctgcgcattc 1560 aagggtaggc cgtctgagca tctcatgatg tctccagctt gccctgtgta cagtgcaatt 1620 gtggcgcggg cgagtgctag acattaatat gtctagcaga cttcttcagt tgaactgtca 1680 gaagtcatat gcagtcatgt gcgatgtggg tggcatgatt gtgagaagag gcagcgtagt 1740 tgccctgctg caggaaccct acgtgacaaa tggttgcgta aggggattgc ccgcgggaat 1800 gcgagtattt cctgacagca gggccaactc tgctgtcgtt gtgaatgatg tcagtatcga 1860 atgcactgtt gtgaattcga ctgactgggg tgtgtgtgtg agccttagtg gcaattttgg 1920 tagattgttc gtagcaagcg tatactgtaa gttcggggat cccctcgaac cgtatatcgc 1980 gtatatggat gaggtgctac tactggctag tagcgttccg ttcatccttg gtattgacgc 2040 gaatgcaccg tcccctttgt ggttcagtaa gatatctaga tctgctaggt atctgaaccg 2100 ctctaggggt gaggtgctgg ccgagtgggc tgtgtcccag gatgtccggg tcgttaacga 2160 acccagcgag tggtacacgt ttgcgggccc gatgggccag agtgacattg atgtcactct 2220 tgcgaatgtg gcagcaacga gtgtgtttgg ttttcaatgg agtgtactgg gtggacacgg 2280 tgtgagtgac cacaatccga ttgagattgt catcactcac acttccacca cgcgtgaaag 2340 tgatgggggt aaccgctggc gcacttgtgg tgcgaattgg ccccttcatg ggatttttgt 2400 gagtgaagcg gcaacgcaag ttccgcttag cacttttagt gcaatgaatg ttgacgagca 2460 ggtcgtatgt gtgaataggt gggtgacttg tgcgaatgat cgcttgtttg agaggcaccg 2520 aaaggtcaac ctcaaacgag tgaagtggtg gtcgcatgag ttgagcgtta agcgtcggtc 2580 agtgcggtcc ctaaggaagc gattccaaag ggccagatct gccaatgccg agaacgctgg 2640 ccaacttagg ttagcttaca gccagtgtat gaatgagtac aagcaaatgc ttgtaagagt 2700 gaaagaggat gaatggcgtt cctttttgga gcgcaataag gacgacccct ggggtcgtgc 2760 ttataaagtt gtgagaggta ggcgtaggga agcggatgta agtggcctcc gtgtcggtga 2820 cgttcagtta acggcatgga gtgactgcat gaatgtctta ttgaatgagt tcttccctag 2880 agcggatcat cagaacttgc caccgagtgt ggttggtgat gttgacccgc tcctggatag 2940 tgaattggag gtggctttct cgatgttaaa gtcgaggaaa tcacctggta tggatggttt 3000 tactggggaa atgtgtaaga gtgtctggaa gtcaattcca gactatatga atgtaatgta 3060 tggaaagtgt atgaatgagg gatatttccc caatgagtgg aagtgtgcaa gggtgattgt 3120 gcttttgaag tcgcccgata gggtcaggag caatcctcgt tcctttcggg gcatcagtct 3180 cctaccagtg ctgggtaaag tgctggaaag agtcatggta gaaaggctcc aagagagagt 3240 gagtagccaa atgtcagatc ggcaatttgg ttttaggaag ggcagatgtg tggaagatgc 3300 gtggagattt gtaagtgact ccgttgagtc cagcaactcc aggcatattc taggcatctt 3360 tgttgatttt aaaggggcgt ttgaccacct gagttggccg agtgtgttgg agaggttgag 3420 cgaatgtggc tgccgggaat tggctatttg ggagagctat ttctctggca gacgtgcgtg 3480 tgctgtaggt cggcatgaaa gtgttagcct gaatgtggtt cgtggctgcc cacagggatc 3540 catctgtggt ccatttatat ggaacctcat gatggatacc ttgctatggc agctcgagcg 3600 tgtatgcaag tgctgtgcgt atgcggacga cctgctcatt ctggttgagg ggcaatcgcg 3660 agcggatgta gaggcaagtg cggcaacgta cttgcgagtt gtgtatgagt ggggtctcag 3720 agttggcgtc agtctggcaa tggacaagac cgtgacaatg ctgcttaagg gcagattgtc 3780 gcgtagtcga cctccattgg ttaggctaaa tggcgtcagc ctgaggcatg tgtcggaggt 3840 gaaatacctc ggcattgtgt ttggcgagag gatgtgcttc actcctcata tcgcatatgt 3900 gaaagggcga ttgcttagtt tggttggaca agtgcgtcgg attttgagaa gtgactgggg 3960 cctcagcaga tctgctgctc gcaccgtata tgatggtcta tttgttgcct gtgcaactta 4020 tggatcgtcg gtatggtgta aggcagtctt gactgttgta ggcagaaaga atgtgctggc 4080 ttgccagcgt gtgatgttgt tagggtgtct gcctgtatgc cgcactgtct ccacggaggc 4140 aatgcaggta ttgttaggag tagcccctct ggacttggag atcaggcgtc gaagcctgag 4200 ctataggatc aagaggcggt tgccgttgct gcagaatgaa tggttagcgg atagggatgt 4260 ggagagttta gggcttagtg agtgcaagaa attgctaaat gagtgtgttc tgtctgactg 4320 gcaggtcaga tgggatacta gcgagaatgg gagggtcact cataggttta ttcgggaggt 4380 cacatttgct gttgaccgtc cagacttcag gcttcacctg agttttggat ttttgttgac 4440 gggccacggg tcgctgaatg cattcttgca ttcaagacga ctctgtgata gcccggaatg 4500 cccttgtggc tgggtgggag agacatggga acatgtcctc tgtgagtgcc ctttgtatgc 4560 agatctgcga gatctaagtg tgcttgggat aacgcggggt attagtgggt atgacgtaag 4620 tcaagtgctc tccacttgtg aaggggtaag gagaatgagt gagtttgcac gggctgcatt 4680 tgccagacga cgtctcatac gtggagaagt tgggtgaatg ttgaatgtct gttgggggta 4740 tgaatgtggg ggtgtgtgaa tggattgctg agttgcgttc gggggtcacc agtcccggct 4800 ttatggagca aaactggaag tatccttgtg gtacgagttc tgaccggagg actgatccgg 4860 taccacgggc gttggggtgt tcaggggcgg tctcgaccct cggctctcag cgctgttgtt 4920 ggggatatca gctggccgcc tagcggtctg aattcgttca gttattaatg ccgctgtttg 4980 cggaagagga aggaataggc ctcgctcccc acagtgagaa aaccatgtcc ataaagcatg 5040 gcgtatactg ttccatctgc gatttggtgt ttacatcaat tcgcattcta ttccgtacag 5100 tgatatcttt ccttcttttt gggtcaccaa cccgtaactg tttggagttt aaattgggag 5160 taccctatgg gtacgaggct tgaccggagg cttgtttccg gtaccacggg taatcaggag 5220 cccgcggaac cttgtttccg tcctggtttt tggttgcggc ccttcgggga gtttcgtggt 5280 ggctgtggtt tgacacccaa atgcgggtag agcaattgac tcggcgtgtt gttgcgctat 5340 acaacagggt gccgtgaccc atagatcgga agtcgtttta ggtaggcggc cctccaaacc 5400 aaggtggaag ttcacgacca aacagtagtg acttcaaatt ggtacctgcg gaatattagt 5460 tccaatgggg cggtaattga cgcttaaatt aattcctgtg cttagcaccg tgagattaag 5520 ccatcccggc aggtgctcac gttaaaccaa ttgactttaa t 5561 // ID Copia12-NVi_LTR repbase; DNA; INV; 508 BP. XX AC AAZX01023343; XX DT 13-NOV-2007 (Rel. 12.11, Created) DT 13-NOV-2007 (Rel. 12.11, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: long terminal DE repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia12-NV; KW Copia12-NVi_I; Copia12-NVi_LTR. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-508 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from Nasonia parasitic wasp."; RL Repbase Reports 7(11), 1163-1163 (2007). XX DR Genome; AAZX01023343; Positions 14210 13703. XX SQ Sequence 508 BP; 195 A; 86 C; 126 G; 101 T; 0 other; tgcagaggaa agctggaaga agtaaagttg aagaattttg ataaattaga cgatttttcg 60 cagaattcga aaaatctgtg aacgagttca aggcggctgg tggaaaacta gatgaaacag 120 agaagatgag gtatttaata cgagcactac caccaagcta cagctacata ggagacttca 180 ttgatgtcat tccagaagat taaaggacag tagactatgt caaatcgaaa attaaagaaa 240 aaaattatgc aaaaaacgat tctgataaaa agagtaatgt aagtactttt acagccaagt 300 tcgaaggcaa atgttacaac tgtggaaaag ttggacatta caaaaatgca tgcacgagac 360 caccacagca gagtaaccga ggacgaggtg ctcaatccag tcaaggtcag cgaggtcatc 420 agagaggcta caacagaggt ggaaactaca gaggtcgtgg ccgaggcaga ggacaaggcc 480 aacccagagg tgacttttca agcgagca 508 // ID Copia-136_AA-I repbase; DNA; INV; 3984 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-136_AA_; KW Copia-136_AA-LTR; Ty1_copia_Ele126; Copia-136_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-3984 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [1415-1942] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 98..2332 FT /product="Copia-136_AA-I_2p" FT /translation="MEGSKISIVKLNNSNYAVWKFKAEMLLTREDTWKYIE FT DQAPEPMTDEWKGGDRKARATIALLVEDQQHALIRDTKTAKEAWENLQKHH FT QKTSLCSKVAVLKRICNKTYKDGDDMQNHLFELEELFGKLSCAGLDLGEPL FT MVALVLRSLPDSFDTLATALESRSDDDLKLELVKSKLLDEVAKRDHGSADS FT AMKASFLKKKKPLVCHFCKKTGHKQKECKLYAKEAESEKGRSPKARTARSE FT DAVTFALMAGVQANSGEWIVDSGASRHMCGERGFFEDLEESAVKSVSLADG FT KNAKVEGVGSGRLLCYHDNGKTTTVTLGNVLFVPSLEINLLSVPKLVEKGA FT AVTFDEVGCRILVGGQLAAAAIKTGGLYKVRTSKEVLMAAKDHHNVDCQHV FT WHRRFGHRDPNALGEMVQKDLVNGLKVYDCGVREGCDCCQKAKMSRKPFPK FT SVERKSTAVLDLVHTDVCGPVENVSVSGCRYFMTIIDDYSRFCVLYLLRKK FT EDVPEKIMEYVAMMKTQFGRPPKIVRSDQGGEYTGSALRAFYKREGIRAQF FT TASYSPQQNGVAERKNRYLVEMSRCMLFDANLPECYWAEAVSTAVYLQNIL FT PSKSIPTTPFELWHGRKPDVTHLKVFGSAAHVHVPKQKRKKLSPTSTKLTF FT VGYSPDHKAYRFLDRSTRRVIISRDANFVEEERLAQAENVNRHHEAENVVD FT FELGIKPTAIEVTPAVNSEPLDASDMDEIFWTMKIAIMTLLMNQR" FT CDS 2419..3984 FT /product="Copia-136_AA-I_1p" FT /translation="MVLDSIAEPRSYEEALASPESEYWKTAMDEEFRSLTE FT NDTWEITTLPAGRKKVGCKWVFKRKEDESGDVVRYKARLVAQGYSQKFGTD FT YDEVFAPVAKQVTMRSLLTVAAERNLVVKHVDIKTAYLYGKLDEEIFMKQP FT PGYETGNPAGVCRLKRSLYGLKQAARVWNRTIDDVFKRAGFLQSSADVCLY FT VRTRNGQQAFILIYVDDILVACSSEVEFVEIVAHLQKCFKLTVLGDIKHFV FT GIKVTKHQNGYLINQQGYITKLAARFGQTDAKGSKIPMDPGYQQKEEDSAV FT LSNSDQYQSLVGGLLYIAVNSRPDISISASILGRSVSQPRVRDWTEAKRVL FT RYLYSTRNHQLQLAANGKGLEVYCDADWAGDTKDRKSTSGFLIFLGGGVVS FT WASRKQSCVALSSTESEFIALAEACQEIQWIRKLLKDLGKKVDHPICIRED FT NQSCIRQVENEKVDQRSKHVDTKYCFIKDLKNKNIISLSYCPTEDMLADMM FT TKPLTSVKLGKLRIAAGIVAIDVEEE" XX SQ Sequence 3984 BP; 1080 A; 890 C; 1148 G; 866 T; 0 other; ataggttgtg ggccccagtc aaagcaatcc gcttgagcga atttgaattt tttcactttt 60 cgctgccgaa gtagaagttt ctggaagtag gagtaccatg gagggaagca agatttcgat 120 cgtgaagctg aacaattcca actatgctgt gtggaagttc aaggctgaaa tgctgctgac 180 tcgagaggac acgtggaagt acatcgaaga tcaagccccg gagccgatga cggacgaatg 240 gaagggtggt gaccgcaaag cgagagcgac gattgcgctg ctcgtggagg atcagcaaca 300 cgccctgatt cgcgatacga agacggccaa ggaagcttgg gaaaatctcc agaaacacca 360 ccagaagacg tccctctgct cgaaggttgc agttctgaag cgaatttgca acaaaacgta 420 taaggacggg gacgacatgc agaaccatct attcgagctg gaagagctgt ttggtaagct 480 gtcgtgtgct ggactggacc taggtgagcc actgatggtt gcattggtgc ttaggagcct 540 ccccgattct ttcgacacgc tggcgacggc cctagaaagc cgttccgacg atgacctgaa 600 gctggagctg gtcaaaagca agctgctgga tgaagtggcc aaacgggacc atggatcggc 660 ggattcagcg atgaaggcaa gtttcctgaa gaaaaagaag ccgctggtgt gccatttttg 720 caagaaaact ggccacaagc aaaaggagtg taagttgtac gccaaggagg cggaatcgga 780 aaaaggtcgt tcaccaaaag caagaactgc acgcagtgaa gacgcggtga catttgccct 840 gatggccggt gtccaagcga attctggtga gtggattgtg gactcgggag catcccggca 900 catgtgcggc gaacgaggat ttttcgaaga tctcgaagaa agcgctgtga aatctgtcag 960 cctggcggat ggaaaaaatg cgaaagtaga aggtgtcggt tccggtcggt tgttatgtta 1020 ccatgacaac ggtaaaacaa ctacggttac ccttggcaac gtgctgttcg taccaagctt 1080 ggaaatcaac ttgctctctg taccgaagct cgtagagaaa ggcgccgcag tgacgttcga 1140 cgaagtaggg tgccggattc tcgtcggtgg ccaattggca gcggccgcga tcaagactgg 1200 tggactctac aaggttagaa cctcgaaaga agttttgatg gctgcgaaag atcaccacaa 1260 cgtggactgc caacacgtgt ggcatcggcg ctttgggcat cgtgacccca acgctctcgg 1320 agaaatggta cagaaggacc tggtgaatgg actgaaggtg tacgattgcg gtgtgcgaga 1380 gggatgcgac tgctgccaga aggccaaaat gagtcggaaa ccatttccga aatcggtgga 1440 gaggaaatcc actgccgtgc tcgacctcgt gcacaccgac gtttgcggcc ctgttgagaa 1500 cgtgtcggtg tctggatgcc gttattttat gaccatcatc gacgactaca gcagattctg 1560 tgtcctgtac ctgctaagaa agaaggaaga tgttccagag aagataatgg agtacgtggc 1620 catgatgaag acccagttcg gccgaccgcc gaaaatagtc agaagcgacc aaggaggcga 1680 atacacagga agtgcgctga gggctttcta caaacgtgaa ggtattcgag cgcagttcac 1740 cgcaagctat tccccccaac aaaatggtgt tgccgagagg aagaatcggt acctggtcga 1800 gatgagccgg tgcatgttgt ttgatgccaa tcttccagag tgctactggg ctgaggcggt 1860 aagtactgcc gtttatttgc agaacatttt gccatcgaaa tcgattccaa caaccccgtt 1920 cgagctctgg catggaagga agccggatgt gacgcacctg aaagttttcg gcagtgcggc 1980 gcacgtccat gtccccaagc agaagaggaa gaagttaagc cctacatcga cgaaactaac 2040 gtttgtgggg tactcaccgg accacaaagc atatcgcttc ctggaccggt cgactcgacg 2100 agtgattatc agccgtgatg cgaatttcgt tgaggaggag cgcctcgctc aagccgagaa 2160 tgtcaatcgg catcatgaag cggagaatgt tgtcgatttc gagttaggaa tcaagccaac 2220 cgctattgaa gtaacgccgg cggtcaattc agaaccgttg gatgcgtcgg acatggatga 2280 gatcttctgg acgatgaaaa ttgcgattat gacactgctg atgaatcaac gttgaacgat 2340 gaagccgctg cgacccctcg tagatcgaat cgtcaatcga agggtatacc accagtacgt 2400 taccgtgaaa ctattggcat ggtgttggat tcgattgctg agccacgcag ttatgaagaa 2460 gctttagcaa gtcctgagag cgagtactgg aagactgcga tggatgaaga attccgttct 2520 ctgacggaaa atgacacatg ggagatcacc acactaccag ctggccgcaa gaaagttggt 2580 tgcaagtggg tcttcaagcg caaggaggac gaatccggtg acgtcgtgcg atataaagcc 2640 agattagtcg cccaaggata ttcgcaaaaa ttcggaaccg attacgatga ggttttcgcg 2700 ccggtggcta agcaagtcac tatgcgttcg ctgctgacgg tggcggctga gcgaaatctc 2760 gttgtcaagc atgtagacat caagactgca tacttgtatg gtaagttgga tgaagaaatc 2820 tttatgaagc agccccctgg ttacgaaacc ggcaatcccg ccggtgtgtg tcgtctgaag 2880 cgaagtctgt acgggctcaa acaagcggca cgagtatgga acagaacgat cgacgatgtg 2940 ttcaagcgag ctggattttt gcagtcatca gccgatgtgt gcctctacgt tcggacgagg 3000 aatggacaac aagcgttcat cctgatctac gtggacgaca tcttggtggc ctgctctagc 3060 gaagtcgaat tcgtcgagat agttgcccat cttcaaaaat gtttcaagct gacggtgttg 3120 ggtgacatca aacatttcgt tggcatcaag gtgaccaaac atcagaatgg gtacctcatc 3180 aatcagcagg gctatatcac caagcttgct gcacggttcg gccagacgga tgcgaaaggg 3240 tcgaaaatcc caatggaccc tggttaccag caaaaggagg aggattcagc tgttctttcg 3300 aatagcgacc agtaccaaag tctggtgggt ggtttgctgt acattgcagt aaattcacgg 3360 ccggatattt cgatcagcgc atcgattctc ggacgatccg ttagccagcc ccgagttagg 3420 gactggacag aagcgaagag agtgttgagg tatctctaca gcactcgaaa tcaccagtta 3480 cagctagcag cgaatggtaa aggcttggaa gtttactgcg acgccgactg ggccggtgac 3540 accaaggacc ggaaatcaac ttctggtttt ctcatttttc tcggtggcgg agtcgtgagt 3600 tgggcttcca ggaagcaatc gtgcgtggct ctgtcatcca ctgaatcgga gttcatcgct 3660 ctggctgaag cgtgccaaga aatccaatgg atacgcaagc ttcttaagga tctggggaag 3720 aaagtggatc atccgatctg catccgtgaa gacaaccaaa gctgcatccg gcaggtggag 3780 aacgagaaag tggatcaacg atcgaagcac gtcgacacaa aatattgctt catcaaggac 3840 ctgaaaaata agaacattat cagtttgtcg tattgtccta cagaagacat gctggcagac 3900 atgatgacca aaccgttgac gagtgtgaag ctgggaaaat taagaattgc agcaggaatt 3960 gtagctatcg atgtcgagga ggag 3984 // ID BEL-17_DPu-LTR repbase; DNA; INV; 410 BP. XX AC ACJG01006857; XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the common water flea: long terminal DE repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-17_DPu_; KW BEL-17_DPu-I; BEL-17_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-410 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the common water flea."; RL Direct Submission to RU (08-FEB-2011). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR Genome; ACJG01006857; Positions 102622 102213. XX SQ Sequence 410 BP; 119 A; 97 C; 75 G; 119 T; 0 other; tggaaaatct cgtcgttacg agagccggcc ggaaggtgcg ccctagaaag agactggacc 60 tctgagcggg agaagtggag ccgagtcgag tggcgctact agcgacgatc cgtcgaccta 120 attcttttcc cttcagagca aaatcagaaa aaaaaacaac atctgtctca gaaagaacaa 180 ctcgaaaacc gctctgcgcg aaacaagtgt caagtgtttt aaccccctta tgttcgttca 240 ctttcgttca ctctcattga attatcgcca tctcattatt ccattagatt tatcaagtgt 300 caagttccca tttatagttt tgtgtacttc atcaatattc atcaatcact catacatcat 360 acaagtaaac ttcccagcgt gtcatttatt catttaattt gtgatttaca 410 // ID BEL-1_IS-I repbase; DNA; INV; 3952 BP. XX AC ABJB010932093; XX DT 15-FEB-2011 (Rel. 16.02, Created) DT 15-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the black-legged tick genome: internal DE portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-1_IS_; KW BEL-1_IS-LTR; BEL-1_IS-I. XX OS Ixodes scapularis OC Eukaryota; Metazoa; Arthropoda; Chelicerata; Arachnida; Acari; OC Parasitiformes; Ixodida; Ixodoidea; Ixodidae; Ixodinae; Ixodes. XX RN [1] RP 1-3952 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the black-legged tick genome."; RL Direct Submission to RU (15-FEB-2011). XX DR Genome; ABJB010932093; Positions 4697 8648. XX CC 'CAGCA' target site duplication CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 430..2166 FT /product="BEL-1_IS-I_1p" FT /translation="MLQSRAVQLRSGEPARPSTLAIASRSEERRRTGIKLP FT KLQLQSFKGELAEWQPFWQQFKRAIHENGELSSGEKFQYLRTLVSGPAKAS FT IDGLQVTDACYNDAIEILTRQFGDYRRIQQDHLTKLRTLPSVTYSNDTKGL FT RRLYDQVQTHIRGLHELGVSSDSYSTMLMDIVIKALPSEIVVDFYRRAGHS FT ERKTVHPYEGDVAAHSSPPTLQAEELGSISGTSRTAELYARPTHRNEDAQL FT NSRRAEGDSQQLLEYLLIEIESRERSGVRDDRPPRTNREQLHQRAHLPNSA FT VLHAKAQDNKEECIFCKTTGQSTENCTSSISLMEKKKKLSGEMRCFRCTLR FT GHRSKDCHRKLSCTSCAGRHVTSMCDPTWKPSANTSATQTMTTSVQSAGIG FT STGISIDDDVLLQTFRTWVVEDGNTVYARAILDGGSQRTFIREDVSRKLSL FT KVLGETRLRLNTFGSSTSLPKKRRLVQVKLRSQYNNEECSVEAVEIPFICE FT DIVQVPVNHEFVRSIEEDGGCIADKLLFPGVSTEPGMSLLIGSDQMWKVAS FT EPVRRCESNCSLMASDTTFGWTFKALHHFRVP" XX SQ Sequence 3952 BP; 1153 A; 960 C; 1009 G; 830 T; 0 other; actgaatggt gccgaaaccc gggacttcaa gccaggaacg accctcagcg aaacgaagtc 60 agttgctgag cagcgaacga gtgaacgcca tggactaacc tcgaacgaaa tatggatcga 120 catcggaaca agtaagcaac tctattaccc gaagcttacc aacctctcga tttattcaaa 180 catggaaagg tgcgaacgaa aaggactgct cgcagagcac aaaacacccg aattgttaga 240 gaggccaccg ctattctcaa cactgcggat gtaattcaaa tcgaggccat agcaagaaag 300 cttgaggtaa attatgaaga actcaagaag ctgaacaccg agctcgagcc cttcattgcc 360 gatggagatc ttgagcacga ataccagtcg gtaattgagt acgaggaaga aacgacccac 420 accttgagca tgcttcaaag tcgagcagtg caactacgtt ctggtgaacc agcgcgacca 480 tcaacattgg caattgcctc acgaagcgaa gaacgaagac ggactggtat taaactaccg 540 aagttgcagc tgcaatcctt taagggcgag cttgctgagt ggcagccatt ttggcaacaa 600 ttcaagcgtg ccattcacga aaacggagag ctgtcaagcg gcgagaagtt tcaatactta 660 aggacgctgg ttagtggacc cgccaaggca tctatcgatg gtctccaagt gactgatgcg 720 tgctacaacg acgcaattga aatcttaact agacagtttg gagactacag gcgaatccag 780 caagaccacc ttacaaagct tcgaacgctt ccgagtgtca cctactcaaa cgacaccaag 840 ggtttgagga gactttacga tcaggtgcaa acccacatcc gcggtcttca tgaacttggt 900 gtcagctcag acagctactc gaccatgctg atggacattg tcatcaaggc tcttccatct 960 gagatagtgg tggatttcta ccgacgtgct ggccattctg aaaggaagac cgtccatccg 1020 tatgagggag acgtggctgc acattcatct ccacccacat tacaagcaga agagctgggg 1080 tcaatctcag gtacttccag aactgctgag ctgtacgcca gaccaacaca tcgaaatgag 1140 gacgctcaac tgaacagcag aagggcggag ggagattcgc aacagttgct tgaatatctg 1200 ctaatcgaga tcgagagtag agaaagatct ggtgtcagag atgatcgtcc accgagaaca 1260 aatcgagagc aactacatca acgagctcat cttccgaatt ctgcggttct ccatgccaaa 1320 gctcaagata acaaggagga gtgcattttt tgcaaaacta caggacaaag cacggaaaac 1380 tgcaccagta gcatctctct aatggagaag aagaagaaac tttcaggaga gatgcgatgt 1440 ttccggtgca ccctaagagg gcataggtca aaggactgtc acaggaaatt gtcttgcact 1500 tcctgtgcgg ggcgacacgt aacatccatg tgcgatccga cctggaaacc ttcagcgaac 1560 actagtgcaa cacaaacaat gacaactagt gttcaaagcg ctggcattgg tagcactggg 1620 atttcgatag acgacgacgt cttacttcag acctttcgaa cctgggtggt cgaggatggg 1680 aatactgtct acgcacgtgc catcttagac ggtggaagtc agcgaacgtt tatacgagaa 1740 gacgtttctc gcaagctgag tcttaaggtg ctaggagaga ccagactacg cttgaacacg 1800 tttggaagtt ctacttcact gccgaaaaaa cgaaggctgg ttcaagtgaa gctgcgtagt 1860 cagtacaaca acgaagaatg cagtgttgaa gcggtagaaa tcccatttat atgcgaggac 1920 attgttcagg tacctgtaaa ccatgagttt gtccgcagca ttgaggaaga cggaggctgc 1980 atcgctgaca agctcctctt ccccggcgtt tctaccgaac caggcatgag cctgcttatt 2040 gggtcagacc aaatgtggaa agtcgccagt gagccggtgc gacgttgtga aagcaactgc 2100 agcctgatgg cttcagacac aacttttggg tggactttca aggccctaca tcacttcaga 2160 gttccttaac ctgcagtaca aacaaggggg gctctaataa aaaagacgac gacaccgaga 2220 aaaccaccgc caagacgaag cataaacgtt ccgtcggatc cccgatttcc gtaaacctgc 2280 agtggcaaga gaacgaccac gagtacgaag aacgaattca aggagacctc acttgcttcg 2340 gaaagtacac ttgcaaggat ataggcggca gcctcaagtg cagtcacccg ccgcattcgc 2400 agcgctcccc gaaattaacc aatcaaagga acgaccggca gtgtcagctg tcttcgtcaa 2460 aacgctctga catcgatggg ggccagaggc tggatgtttc aagcgaatct gacagtgttg 2520 ggcccgaggt tcaaatcaga ctgaataaga cttccgacag ccgctcgcaa tcaaagaagt 2580 cgagagtatc cgacacgtgg cagtcttcga acatggttgt gtcgtgcgct cacaataatt 2640 ttgagaaagc gaatgtcagc gaagaggaca acggcgagcg taatttgcag tccgacagtg 2700 tgttttgcaa tcagacgtcg accaggaagt ttctcaaacg aaagagaaca aaccgaagag 2760 tggccgacaa agagctcttc gcttactccg tggtagttgt agttgccgtc gtctcactta 2820 cgctggccgt cgctctgagc agtcgaggaa tacatgggtt aggactagac catcatctcg 2880 tcttcaacct ttccaacctg aaggctatga aggaagatcc aacctttcgg catgctgcac 2940 ccacgagacg cagggacgcg cgttacttga ccagtgggcg aatgtttcta aaaattctcg 3000 gactgaacta cttccaccgt gcgtgccgca atttctacaa gtttgactgc aaacgtttgc 3060 cagtacgtcc agcgactaca tcgcaagaca atttgataga agggctgctc gctgtcgtca 3120 aatcagaaga caaatacccg gagctgaaca cggcgcggag gttgtacaaa gaatgtcttc 3180 agaaagatgt gatcgcggaa cagggcgggg acacgctggc agagcttcat actgaccccg 3240 gactccatgg cttatagaac gtgctgaaag acaagaatgc aagaatacta ccgaataacg 3300 aatacagtgt aaattcattg ttgaccgagc tgcccggtgg ggtttgtttt gggagcgtat 3360 ggtgtggggg ggagcgctgc caaaacatgc ctcggaaagt tcttagaaga aattgctgca 3420 gttttgaaga gcttaccact atactccatg agattgaggc agttgttaac tcacgacccc 3480 tctcatttct gcaagcctct cccgacgagc ccgaagcgtt gaccccggag cattttttta 3540 ctggaaagcg accgacagct cttccaactg cgggtgctga gaaattccaa acctctacga 3600 gagaaaatgt aacccggcgt taaagaaagc atcaacaact gctagaccac ttttggcagc 3660 ggtggcgaaa agaatacttg ctccagctgc gatcagcaca cgaaacaact tgtcaccgaa 3720 acagcgaaac agccagaagg cgatccggtt ctagttgagg aaaacaaaac gcccagacac 3780 ctgtggcgga ctggacgaat tgggcaggta ttctacggcc gggatagtat agtgtgatcc 3840 tgcgctgtga acgtgccagg cggacaagtg ctgaaaagac ccgtgcaact tttatatacc 3900 ctagagcttt atgttgaatg agcgatctaa aaacactcat ttgggggcag tg 3952 // ID piggyBac-N1_DWil repbase; DNA; INV; 658 BP. XX AC . XX DT 23-FEB-2007 (Rel. 12.02, Created) DT 23-FEB-2007 (Rel. 12.02, Last updated, Version 1) XX DE A family of piggyBac non-autonomous DNA transposons - a DE consensus. XX KW piggyBac; DNA transposon; Transposable Element; Nonautonomous; KW piggyBac-N1_DWil. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-658 RA Kapitonov V.V. and Jurka J.; RT "piggyBac-N1_DWil, a family of non-autonomous piggyBac DNA RT transposons from the Drosophila willistoni genome."; RL Repbase Reports 7(2), 125-125 (2007). XX DR [1] (Consensus) XX CC This is a family of non-autonomous piggyBac DNA transposons. The CC piggyBac-N1_DWil elements are ~5% divergent from their consensus CC sequence and are characterized by the TTAA target site CC duplications. XX SQ Sequence 658 BP; 218 A; 97 C; 92 G; 251 T; 0 other; cactagatct accacaccag tcaaaatgac tggttttact ttttatttca taaatattga 60 ctaagtgttt ttattttttt tcgatattat gttatgactt tttaaataac cactaaatac 120 atacaatttg ataagttaat ttcattttgt gttcatagga aaagagattt ttttttgttt 180 ttggcactta taccgtaacc agtcattttg actgggtata tgattgatat gcaaatattt 240 tatattttgc ccttttagga agttcccttg tttgaaacat cactgtttgt aagtttgttt 300 acacatccct gaagatctca cagctgaata tcagttatca agccgaagaa aggcaaatga 360 aaatcaccgt ccagtaacgc agaatccccg gcctgtacgt aaatggtgcc aaatagggca 420 ttgtaacaac aacaagaaaa aagtttctat ttgtaaaaca tgtaacaaac atgatttttt 480 tgattaaaac taaaacacaa ttgatgcatt tttattattt tatttatatt tttttatttc 540 tcattaaatc ttttaaagga ttacgaattt taatataaaa taataggact tttcttaacc 600 agtcaaattg actggctctg gtataaatag gtatacaaat ttcttggtaa atctagtg 658 // ID I-5_CQ repbase; DNA; INV; 5023 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version 1) XX DE An I non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW I; Non-LTR Retrotransposon; Transposable Element; I-5_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-5023 RA Kojima K.K. and Jurka J.; RT "I non-LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 111-111 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 6 sequences with >90% CC identity. It is phylogenetically close to Loner elements in CC Anopheles gambiae. XX FH Key Location/Qualifiers FT CDS 112..1269 FT /product="I-5_CQ_1p" FT /translation="MTTTEATTRRVRQYPDEATGPFLVIAEATGEDNLRPT FT NLSKTIIGMYQQDYVRAVVLSRRRMKILMRTGQAANDLVATRTIKNVQFSI FT PQRLVEVLGVAHIELEVDEDDLANATTFDKGKMMQLHNPKIVETRRMRKRV FT DGVDKMLTTVVVTFEGQKLPTHLIINQVLYPVKEYVYHTRQCRKCWRMGHG FT EKNCRGKARCKKCAQEITTTIAEHACEVAVPVCINCKGNHEADDTKACPKA FT IKRKEDDQKRKEAHSQGKTDWFSTAGLAEETSGSQPVSTLTEETIPAPSTS FT GAQSKPAKRKCLDDNTDDSSEEEEIPRLTVHIEEGIRNAIQQAINSDEATL FT AISGVLGAPPSEIPDDRLEDALKARLFDILAQRTDDYIGTLRL" FT CDS 1276..4938 FT /product="I-5_CQ_2p" FT /note="apurinic-like endonuclease, reverse FT transcriptase and ribonuclease H." FT /translation="TTKGTPAQAFPLQFLQWNCRGALSKKASIINLINSTH FT SSILCICESHLDSAANFDIPGFHLFRKDHQSNSRGVLVAARTDLNPTSLDI FT PTPQEIDAVACQVRCRLGKLTIVSVYIHPNTVVSQNTFEDFFNSIPQPCIL FT AGDWNAKNPAWGGDIQNQRGTNLLAALDVSRFVVLNDGSYTRFDANRLPSA FT IDLTIASADISLLFEWAPLDFPYGSDHLPITFGTTTEVVEEQQPGINYKRL FT DWEKFSTLLDERVKLLSPTASYEVFFETVWRSLTDATPTKSSTQVRKLPQP FT YWDTELQTALEDRRDKFRAWRRKLDYESYCIYTDAEQHFKVLVNRKQRESW FT RSLCDSFDSQTSVQKLWRLGRRYKNRVTGANHRLRDEVQLNKLLDKLAPPS FT TSREPLEFNQCNCSCTTAGAYFTTNDLDSAIKPGQDTAPGVDGICYSVMRN FT LPFTAKMKLLRIYNDIFDTGRIPEAWKVFKVIPILKPGKPPSDASSFRPIA FT LASCFRKCYENMIKEKIEWYIESNKLFPHEICGFRKGKGTLDALHILVDQV FT QLALNNNQHVIACSVDVEGAYDNVQIEVLVAQMRRLGISECLVKAVYSLFK FT ERLLHAVLDGNPIQRVTWVGLPQGSPLSPLCFNIVIFSLFGFRIAGVFYLD FT FADDITVACRGANLDESIRNIQTAIDEVVGRINALALRVAPSKCSSIIFSR FT RAVDDDRTPVLTVDGSPVPYLRSIKLLGLHLTPTLSWRKHFLYVKHRAATY FT TNFMRSVAGQSWGADPAALLTIFKSCIRPILEYASIFFTGAPQADTIILDR FT IQWSCIRIALGSTKTTHTGSLEVLSGLMPLKHRREMATMKFVERRFSLLPW FT HERFVNTTLEGQSSTWIRRNILQYRAFSGWLSTRDTLPCFQFDLDTRRVAV FT KVDLSVHRAMQEGLHQPASDRVEAEIQRNYSNATLLATDGSKDGNGVGYAV FT VDSRMRTVHSTKTHKLLSIFHAELLALRQAVEIIAGSGAGEYVVLTDSLSS FT LMSLSNSRVSSHQPSVWFEIKRFISQISERGANVTFMWVPAHRGVPLNEAA FT DRAANQARITGGSENYNLTSLDISFPARKRAMEHWQLDWDNGTKGRFCNKI FT VPTVDTSPWFADRDXNRREIVVLSKLISNHSRLPAHLWRNNIVEDATCQCG FT ESSATPDHLLFTCDLYDDDRRALWRAIVSQKEIPDLELVLKSQNDVVLRAI FT VKFFDDAAIDL" XX SQ Sequence 5023 BP; 1369 A; 1285 C; 1266 G; 1098 T; 5 other; agttctgttt gagcctccgt gagtcacagt cgtttcgcac tgaagctgct acacaatagc 60 accaccgtcc acattgttcg ccgattgaat agctaccgca agccgagaag aatgactacg 120 acggaggcca ctactcgtcg ggtgagacaa tatccagacg aggcgaccgg tccgttcttg 180 gtcattgcgg aagctaccgg agaggacaat ctacgaccaa caaacctctc taagaccatc 240 attggcatgt accagcagga ctacgtacgt gccgtagttc tgtccaggag acgcatgaaa 300 attcttatga ggaccgggca ggctgcgaat gacctcgttg ccacccggac aatcaagaac 360 gtccaatttt cgatcccgca acgcctcgtc gaagtactgg gagttgcaca cattgagttg 420 gaggttgacg aagatgactt ggcgaacgct acgacgttcg acaaggggaa aatgatgcag 480 ctccataacc ccaaaattgt cgagacgcgt cgtatgagaa agcgcgtcga cggtgttgac 540 aagatgctaa ccactgttgt ggttaccttc gaaggncaaa aactaccaac ccatctcatc 600 atcaaccagg tgctgtaccc ggtcaaggag tacgtctacc atactcgcca atgccgcaaa 660 tgttggcgta tgggccatgg ggaaaaaaac tgcagaggta aagcgcgctg caagaagtgt 720 gcccaagaaa taacaaccac gattgctgag catgcatgcg aggtcgccgt tccagtctgc 780 attaactgca agggcaatca cgaagcggac gacaccaaag cttgtccgaa agctatcaag 840 cgcaaagagg atgaccaaaa acggaaggaa gcgcactctc aaggaaaaac ggactggttc 900 tctaccgctg gtctagctga agagacctcg ggcagtcaac cggtgagcac tcttactgag 960 gaaaccatcc cggcgccgtc aacctcggga gcacagtcta aaccagctaa gcgcaagtgt 1020 ctcgacgaca acactgatga cagctctgag gaagaagaga tcccgcgact caccgtacac 1080 atcgaagaag gaatccgtaa cgctatccaa caggccatca actccgatga ggccacacta 1140 gcaatcagcg gcgtgctggg ggcgccgcct agcgagattc cagatgaccg actagaagat 1200 gctctcaagg ccagactgtt tgacatccta gcccagagga ccgatgacta catcggtact 1260 ctccggttat aataaaccac caagggcacc ccagcccagg cattccctct gcagttccta 1320 cagtggaact gcagaggagc cttgagtaaa aaggcaagta tcatcaattt gattaattct 1380 acccactcct cgatcctatg tatctgtgaa agccacctgg attctgcagc caatttcgac 1440 attccaggct ttcatctgtt ccggaaagat catcaaagca actccagggg tgtactcgta 1500 gctgctagaa ccgacctaaa tcccacatcg ctggacatcc caaccccgca ggaaatagat 1560 gcagtggcct gtcaagttcg ctgccgactc ggaaagctga cgatcgtttc ggtgtatatt 1620 catccgaaca cggtcgtgtc gcaaaacacc ttcgaagact tcttcaacag catccctcaa 1680 ccgtgcatcc tggcagggga ctggaacgct aaaaatccag cttggggcgg agacattcag 1740 aaccagcgtg gaacgaacct tctggcggct ctggatgtgt cgagatttgt agttttgaac 1800 gatggcagct acacacggtt cgacgcaaat cgtttaccaa gtgcaattga cttgaccatt 1860 gcctcagcgg acatcagctt actcttcgag tgggctccgt tagatttccc ctacggtagc 1920 gaccatcttc cgattacttt cggcacgaca acggaagtgg tggaagaaca gcaaccggga 1980 ataaactaca aaaggcttga ctgggaaaag ttctccacgc tgctggacga aagagttaag 2040 ctgctgagcc ctacagcaag ttacgaagta tttttcgaga cggtgtggcg aagcctaacg 2100 gatgcaacac caactaaatc gtccacccag gtgagaaagc taccgcagcc ctactgggat 2160 acggaactgc aaacggcgct ggaggacaga agggacaagt tcagagcatg gcggagaaaa 2220 ttagactacg aaagttactg catctatacg gatgcagagc aacatttcaa agtcctagtc 2280 aacaggaagc aacgagaatc gtggaggtca ctttgtgatt ccttcgattc ccaaacgtct 2340 gtccagaaac tttggcgtct cgggcggcga tacaagaacc gtgtcactgg agcgaatcat 2400 cggttgcggg acgaagttca gcttaacaag ctactcgaca agttggctcc cccatcgacc 2460 agcagagaac ccctggaatt caaccaatgc aactgcagct gtactacagc cggagcttat 2520 ttcaccacaa acgatctgga cagtgccatc aagccggggc aagacaccgc accgggcgtg 2580 gacggcattt gctactcggt tatgcgcaat ttgcctttca ctgcaaaaat gaaactcctt 2640 cgcatataca atgacatctt cgacaccggc cggattccag aagcctggaa agtcttcaag 2700 gtcataccga tcctcaaacc cggtaaacct ccgtctgacg ccagttcctt ccgcccgatc 2760 gcactcgcct catgttttcg taaatgttac gaaaatatga taaaagagaa gatcgagtgg 2820 tacatcgaga gcaacaaact ttttcctcac gaaatatgcg gttttcggaa agggaaaggt 2880 actctagatg cgctacacat cctggtggac caagttcagc tggcgttgaa caacaatcag 2940 cacgtcatcg cgtgtagtgt tgatgttgaa ggcgcttacg acaacgtaca gattgaggtg 3000 ctagtagcac agatgcgccg attgggaatc agcgagtgcc tcgtcaaggc tgtgtacagt 3060 ctgttcaagg aacgtctgct ccacgcggtt ttggatggta acccaattca acgggtgaca 3120 tgggttggac taccgcaagg gtctccgttg agtcctctgt gtttcaacat cgtgatcttc 3180 agcctcttcg gattccgtat cgcgggggtt ttctacctgg attttgcaga cgacataaca 3240 gtggcgtgtc gaggagcgaa cctcgacgaa agcattcgta acatccagac ggcgatagac 3300 gaagtggtag gcagaatcaa cgcattggct ctacgggtag ctccgagcaa atgtagttcc 3360 ataatcttct cgagacgagc agttgacgac gataggacac cggttctaac cgtcgacgga 3420 tcacctgtac catatctgcg aagcatcaaa cttttggggc tgcatcttac accaacgctt 3480 tcctggcgaa aacattttct ttacgtaaag catcgagctg ccacctacac taatttcatg 3540 cggtcggtag ctggacagtc ctggggtgct gatccggcgg ccttgttgac catcttcaaa 3600 tcctgcatca gaccaatcct ggagtacgcc tcaatcttct tcaccggtgc accgcaagct 3660 gatacgatca tccttgatcg cattcaatgg agctgtatca ggattgcatt aggatctaca 3720 aaaacgacac acacgggttc tctagaagtg ctgagtggac tgatgccatt gaaacaccgg 3780 agagaaatgg cgacaatgaa gtttgttgag cgccgttttt cccttctacc atggcatgaa 3840 agatttgtca acacgacctt ggaagggcaa tcgtcaacgt ggattcgacg taatatcttg 3900 cagtatcgag ctttcagtgg ttggttgagc acgagagaca ccctgccatg tttccagttc 3960 gacctagaca ctcgccgcgt ggctgtaaag gtcgaccttt ctgttcatcg agctatgcag 4020 gaagggttgc atcaacccgc cagtgatcga gtagaagcag aaatccaacg taattattct 4080 aatgcaactc tgctagcaac ggatggatcg aaggacggaa acggagtggg ctacgcggtt 4140 gttgacagcc gcatgcgcac cgtacacagt accaaaacgc acaaactgct gtcmattttc 4200 cacgctgagt tgctggcgct gcgacaagcg gtggaaatta ttgctggatc cggtgctgga 4260 gaatacgttg tgctaactga cagcctgagc agcttgatga gtctgtctaa cagccgggta 4320 tcctcccatc agccgagtgt gtggttcgaa ataaagcggt tcatctccca gatttctgaa 4380 cggggcgcta atgtcacttt catgtgggtc cccgcacacc ggggtgtccc attgaatgag 4440 gcagcagatc gcgcagctaa ccaagcacgg attactggtg gatcagaaaa ctacaatctt 4500 acgagcctcg acattagttt cccagcacgc aaacgtgcga tggagcattg gcagttggac 4560 tgggataatg gaacmaaggg gagattctgc aacaaaatag ttcccactgt ggacacttcc 4620 ccgtggtttg cggatagaga ctwcaaccgg cgggaaatag tagtgctgtc gaaactaatc 4680 agcaaccatt ccagacttcc tgcgcatctg tggaggaaca acattgttga ggacgcaacg 4740 tgccaatgcg gcgaatcatc tgctactcca gaccacctgc tgttcacctg tgacctgtac 4800 gatgatgatc gacgtgccct gtggagggcg attgtgtccc aaaaggaaat cccggacttg 4860 gagcttgtgt tgaaatcaca aaacgatgtt gttttgcggg ccatmgtgaa gttttttgat 4920 gatgctgcta tcgatctgta aagtgctgta gaagaatcaa aacttggcca acttatggca 4980 acaagaggcc agcaacctaa taaaaaaaaa aaaaaaaaaa aaa 5023 // ID Crack-3_CQ repbase; DNA; INV; 4675 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A Crack non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; Crack-3_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4675 RA Kojima K.K. and Jurka J.; RT "Crack non-LTR retrotransposons from the southern house RT mosquito."; RL Repbase Reports 11(1), 34-34 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 4 sequences with >99% CC identity. XX FH Key Location/Qualifiers FT CDS 1569..4472 FT /product="Crack-3_CQ_2p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="TTISATHYATNAYVGNPLNLSLPASSNVIKILQINIR FT GINRFEKLDSLCLFIRNLNTVVDVIVVGETWLKEARSQYYNIPDYQAIHSC FT RRTSAGGLAIFIRKELPFAVTANTTNQGYHHIAVKLHAGKNQVLIHGFYRP FT PDFDAVHLASSIEAILANADTADPCLLVGDMNLPVNNPEARDVQQYTQLLN FT SYGMAVTNSFVTRPSSTNILDHVVCSVANSSRVSNYTMDCDLSDHSYVLTV FT LKLKMEQERRTLTKTWTNYTAVNDEYRSFLQEYQLDLLVPNERLQAITERY FT ALLTKKHTRTTTVEVKVKHNCCPWFNYDIWKLSNIKDKVLQKWKANRLDEV FT QTELLARANLKLNEAKRKAKANYHSRLFCTSNPKLLWSRINEVLGGKSKAN FT SKVQLEVNHEIVEDPSSVSNIFNDYFASVGVNLGSQLTSDGNIHKFGTMKY FT ATRSLFLRPVTSLEVQNIIFALDPKKAQGYDGFPVAALKRHSTLLASVIKD FT CFNECVSQGEFPACLKRALVHPVFKGGDPTNPSNYRPISVLPVLNKVFEKL FT LFARIYNFLIATEQLYRHQFGFRKGSSTEVAILELVDEVSKTLDDKLSAGS FT IFLDLSKAFDTINHQMLLKKLESCGLRGLPNALLQSYLSDRIQQVVVSGIR FT SEIQFVRCGVPQGSVLGPLLFLIYVNDMSKLQLHGKTRLFADDTAISYACP FT SPVVVVQQMKEDMELVFNYLENNLLALNLSKTKMMIFRFLHSQLPEYPSLS FT VRNTVIEEVACFKYLGVYLDNRFNWDNHVRQVIAKCSSLCGILRRLSRSVP FT HHVLLKIYLAYIHSRYRYAIAVWGSCPKVYLKELQVQQNRCIKAVYKLPFL FT HPTRELYSMPQHKILPMYGLYTQSVAVIMYRILNNVNIHHNWEFEAAIHDH FT YTRGSTDLRRTGFRTELGRRRFKIRGPTLYNELPENMKEALTVNEFKRKLT FT IYTRENLNNFIVR" FT CDS join(226..882,689..1531) FT /product="Crack-3_CQ_1p" FT /translation="QNRAAHPSKSVTSASVKPLHCIIGRHTVDLVAEFHSY FT SQLLASPPNKICCNQHKCCCNATTESLNQHTNYKHCQTKSPIVHVTSDESR FT IKIIVLLVVFIRAESPPHTSHPPSRHRLQNLDAKYFAVCLLLRCCLLQEIL FT LERNRRISVVSCAAIVLVLALHTATPPCTKFQTTAPSVSEHCQHEFKFGRV FT ARTHCIYEEGFIKLPEGCCSEPTISFEQIYWYLHCTLPHHPAQNFKQQRHL FT LASTANMSLNLAELPELIASMRKDSSNCLKDVVASQQFLSNKFEEIVGHMK FT LLQNQIQLLRSENHCLKQSIKNLSENAKSITQVVHQAELDIDCQQRSELST FT NAIILGIPRTAQENTDDIVLKTCEALGYSTAKHDMVSCCRIPCAKSENSPI FT RITFKSAHCKEKLMEHKKQYGTMDISAIQNQKLPMGTQGRVVIRDELTPLS FT RRLLQELKGLQAALDLRYVWPGRNGAIMVRKTEKSKAIPILSRHDIQKLLQ FT TPKK" XX SQ Sequence 4675 BP; 1442 A; 1164 C; 953 G; 1116 T; 0 other; ctggtaacac tgctcgacta gtgaacttgt tgtgattgaa ccctccagat aattcagcaa 60 aaattaaccc gaaaaccacc aataaatcgc cagagacatc gctgtacccg ttgatcacgc 120 ctcgctgtac gtattgatgc tgaaaatcag tgatagtgca tgcagtaaac cgagttaaga 180 gcgttggaaa tttcgaacta atctcagtac actgcaccac cataacaaaa ccgggccgcc 240 cacccgtcga aaagtgtgac atccgcttct gtcaaaccac ttcattgcat tattggtcgt 300 catactgttg atttagttgc cgaattccac tcttactcgc aattgttggc ctcgccaccc 360 aacaagatct gctgcaatca gcataaatgc tgctgtaatg ccaccaccga gtctctgaac 420 cagcacacaa attataagca ctgccaaaca aaatcgccaa ttgtacatgt aacatctgat 480 gagtccagaa taaagatcat cgttctgctg gttgttttca ttcgtgccga aagcccgccc 540 cacacgtcgc atccaccgtc tcgccatcgt cttcaaaacc tggacgccaa atacttcgcc 600 gtttgtttac ttcttcgctg ctgtctacta caagagattc tactcgaacg gaataggcgt 660 atcagtgtgg taagttgtgc tgcaatagta ttggtacttg cactgcacac tgccacacca 720 ccctgcacaa aatttcaaac aacagcgcca tctgttagcg agcactgcca acatgagttt 780 aaatttggca gagttgccag aactcattgc atctatgagg aaggattcat caaactgcct 840 gaaggatgtt gtagcgagcc aacaatttct ttcgaacaaa tttgaagaaa tagtgggtca 900 tatgaagctg ctacaaaacc aaattcagct attgcggtcg gaaaatcact gtttgaagca 960 gtcaatcaaa aatctctcgg aaaatgccaa atcgatcacg caagtcgtcc atcaagcgga 1020 attagacatc gattgccaac aaaggtctga actgtctacc aacgcgatta ttctgggcat 1080 acccagaact gcccaggaaa acacagacga cattgtgctg aagacctgcg aagctcttgg 1140 atactctacg gcaaagcacg acatggtttc ctgctgcaga ataccctgtg cgaaatcaga 1200 aaattcaccg attcgcatca cattcaaaag cgcgcactgc aaggaaaaat tgatggagca 1260 caaaaaacaa tacggtacca tggatatcag tgcaattcaa aatcaaaagc tgccaatggg 1320 gacgcaaggt agagtagtca tccgagatga gctgacacca ctatcccgga gactattgca 1380 agagttgaaa ggattacaag ccgccctgga tcttcgttac gtatggcctg gacgtaatgg 1440 tgctatcatg gtgagaaaaa ctgaaaaatc gaaggccata ccaattctgt cccgtcacga 1500 catacagaag ctgttgcaaa ccccgaaaaa gtaatctgtg cagcttacca tgcagtatct 1560 acaactaaac caccattagt gcaacccatt acgcaaccaa cgcctacgta ggaaatcctt 1620 tgaatctgtc ccttcctgcc tcgtcgaatg tgatcaaaat tctgcagatc aatattcgag 1680 gaattaatcg atttgaaaag cttgactccc tgtgcctatt catacgcaat ctaaacacag 1740 tcgtcgacgt aatcgtggtt ggtgagacgt ggcttaaaga agctcggtcg cagtactaca 1800 acataccaga ctaccaagct atccactcct gccgtagaac gtcggccggc ggtctggcaa 1860 tcttcatcag gaaggagcta ccctttgcag taacagctaa cacaacgaac caaggctacc 1920 accacatcgc agtcaaactc cacgcgggaa aaaatcaagt tctcattcat gggttttacc 1980 gtccaccgga ctttgatgct gttcaccttg cctcatccat tgaagcgatc ctggcgaatg 2040 cagatacagc ggacccgtgt ctcttagtgg gtgacatgaa tctaccggtc aacaaccctg 2100 aagctagaga cgttcaacag tacacacagc tgctaaattc gtatggcatg gcggtgacta 2160 actcttttgt tacaagaccc agcagcacca acatcctcga ccacgtggtt tgcagtgtgg 2220 cgaactcatc ccgagtttcc aactatacga tggattgtga cctgagtgat cacagctatg 2280 tccttacagt gcttaaactc aagatggagc aggaacggcg cacactaact aagacttgga 2340 ctaactatac agctgtgaat gatgaatacc gatcatttct gcaagagtac caactagact 2400 tactggtacc aaacgaacgt ctgcaagcca taacagaacg ctatgctctt ctaacaaaga 2460 aacacacgcg aacgacgacg gtggaggtca aggtgaagca taactgttgt ccctggttca 2520 actacgacat ttggaagctg agcaacataa aggacaaagt gttgcagaaa tggaaggcca 2580 atcgactgga tgaggttcaa accgaactct tggctcgtgc aaatctaaag ctgaatgaag 2640 caaagcgtaa agcaaaggcg aactaccact cgaggctgtt ctgtacaagc aacccaaagc 2700 tactctggag tcgaatcaac gaggttctag gaggtaagtc gaaggcaaac tccaaagtgc 2760 agttagaagt aaaccatgaa atcgtggaag acccaagcag tgtctctaac atcttcaacg 2820 actactttgc ctcagtgggt gttaatctcg gcagccagct gacttctgac ggcaacatac 2880 acaagtttgg cacgatgaag tatgctacgc gatccctgtt tcttcgacca gttacatcac 2940 ttgaggtcca gaacataatt ttcgcactag accccaaaaa ggcgcagggt tatgatggtt 3000 tcccggttgc tgccctaaaa agacatagta ctcttctagc aagtgtcatc aaggactgct 3060 tcaacgagtg tgtcagtcag ggggagtttc cggcgtgctt aaagagagct ttggtccacc 3120 ccgtcttcaa aggaggcgac ccgactaacc cgtcaaacta caggcctatc tcagtgcttc 3180 ctgtcttgaa caaggtcttc gagaagctgc tttttgccag aatctacaac ttcctaatcg 3240 ccactgaaca gctatatcgc catcaatttg gcttccgtaa gggttcatcg acggaagtag 3300 ccatacttga gcttgttgac gaagtttcca aaacactcga tgataaactg tctgctggct 3360 cgatcttctt ggacctctcc aaagcattcg acacgataaa ccaccaaatg ttgctgaaaa 3420 agttagagtc gtgtggatta cgcggattac cgaacgccct tctacaaagc tacctgtccg 3480 accgcataca gcaagtcgtc gtgtctggaa tccgcagtga gatccagttt gttcgttgtg 3540 gagttcccca aggaagcgtt ctcggaccgc tactgtttct gatctacgtc aacgatatgt 3600 ctaaactgca gctccacggt aagacccgac tctttgccga cgacaccgcc atttcgtacg 3660 catgcccatc accggttgta gtagtacaac aaatgaagga agacatggag ttagtgttca 3720 actacctgga aaacaatctt ctggctttga acctcagcaa aactaaaatg atgattttta 3780 gatttctgca ttcacaattg ccagaatatc cctcacttag tgtacgcaac acagtcatcg 3840 aagaagtagc atgttttaag tacttgggcg tctacctcga caaccgattc aactgggaca 3900 atcatgtgcg gcaagttatt gcaaagtgct catcactgtg cggaatcctc aggagactgt 3960 ctagaagcgt cccacaccat gtgctcctga aaatctactt agcatacatt cacagccgtt 4020 accgatacgc cattgcagtt tggggctcgt gtcccaaagt ctacttaaaa gaacttcaag 4080 tgcaacaaaa ccgctgtatt aaggctgtct acaagctacc ctttctacat ccaacacgtg 4140 aactctattc aatgccacag cacaaaatcc tgccaatgta tggcctctac acacagtcag 4200 tcgcagtaat tatgtaccgg atcttaaaca acgtcaacat tcatcacaat tgggagtttg 4260 aagctgctat ccacgaccac tacacccgag gatcaactga tttaaggagg acaggatttc 4320 gtactgaact aggaagaaga agattcaaaa tccgaggacc aaccctatac aacgaactcc 4380 cggaaaacat gaaagaagca ctcacagtca atgaattcaa aagaaaatta acaatttata 4440 cacgtgaaaa tttaaacaac tttattgtac gttagatcta agcgatgaat tgaattaaaa 4500 taataaagtg ttaccagcta cacaccacaa cacacaacac aacacgacac aagaaaacac 4560 aacacccata acataacata aaaacccttt aaaagaacta atgttcacta ggggaattat 4620 ggaatgcatt atgaattgaa atgaaataaa gattattatt attattatta ttaaa 4675 // ID Academ-1_Dpulex repbase; DNA; INV; 4985 BP. XX AC . XX DT 16-MAY-2011 (Rel. 16.05, Created) DT 16-MAY-2011 (Rel. 16.05, Last updated, Version 1) XX DE DNA transposon: consensus. XX KW Academ; DNA transposon; Transposable Element; Academ-1_Dpulex. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-3053 RA Yuan Y.W. and Wessler S.R.; RT "The catalytic domain of all eukaryotic cut-and-paste transposase RT superfamilies."; RL Proc Natl Acad Sci U S A 108(19), 7884-7889 (2011). XX DR [1] (Consensus) XX CC incomplete on the right. XX SQ Sequence 4985 BP; 1643 A; 970 C; 919 G; 1453 T; 0 other; tagtctggga gctgggcgcc aaaaaaatgg caattgttta gaagaaggtt tgatgctaaa 60 atatggtagg gggggtatga cacacacgaa aagccacgtc tgtcgactgg cggccgggcg 120 gttcaagcgt gacgcgatcc cgaagtcggc gggaaaggta aaaaaataca ctttgtttag 180 caagcccaaa ttttcactag aaagcaagct aacattatgt tgatcaataa tcccgcaaga 240 cagacgtcga attgcgggac tataatatca acagaccatc ttaaaaatta aaaaaaaggt 300 aaatttttac actttgttta gaaaatacaa tttttacatt aaaaaataga gcaacatttc 360 atttataata atcccggtag acagacgtcg atatctgagc ccttgggggc aacagacgtc 420 ggtgtgtaac ggtttttcac gaactgtcac tcgttatttg ataccggcaa cgctgtatcc 480 agggcgctta attcaaaaaa aaatcttgtt ttcgtgggat aaataaacaa aaaattgcat 540 ttaattaaaa aaatcacttg tttgtatttc ttttcatgta tgtaattata ttatcaatat 600 taagcttctt agtcatccaa cgggaagtcg gcaacgtctt ttgtttacaa ttgtatgttt 660 tgcgcctaat agtataatga cgtgatgcaa ttagtatcac gatattttta acgtattcct 720 ttaaaatgaa agagtttcca tagtcgagta caatggagaa agaaacagtc aaagacaaaa 780 gatgtgtttt gcacactgat aaccttccat catcagcaaa tattattttg tttcaagact 840 ttactctaaa aaaatgtcaa gatgctgaac ttgttcacaa atctcgtttg attaagggta 900 cttcaatcta taaagacata gttttgcctt cagttcctaa tagtcattca ggttatcatt 960 ccaaatgtta ttcgaaatat acatctgtgt caagtgcttc aaagcagaaa gttgttcagc 1020 aaaaagaaag tgacgacact ccttcaccag aagcttcagc aggtatacat atattttgtt 1080 tttcattgtt aatttgtgct tctacttata agttgtttac tttgttttgt agaaccactt 1140 caacaggtta atcataacat agggagacgg tcactcaggt caaaaccccc agacactgaa 1200 accgataaaa gttctcatat gttcccaaca gtttgtttat tctgtgaaaa gaaagaaaag 1260 aagatttcag gcaaaagaga ggtccttcat caaatagtta caatagaatt tcaagaaatg 1320 gttttgaagg aggctcagat aaaacaagat gatcgactaa ttagaaagat actgggagta 1380 gatttagtag ccaaagaagg acagtatcac aatgcatgta ggaaagcata ttcctaccaa 1440 gcaactgtta tcagccgccg cctatctgaa caatctgaaa atgctacaag taacagtcag 1500 agaaaatccg ctattagaaa aaatgctttt gattcaacaa catcttatat tgaagcccac 1560 attttcctaa gcgaagaggt atgttattca atgtccaaat aagtatattg gttttcttgt 1620 taaactaaac ataataattt gtttgatcac ttaaggttca tcgatttcat accattgcag 1680 aacattatac tatgttgctc gctgaattcg gatttgatgc tgacgactta gctgacgtaa 1740 ggaaggatcg tttccttgac aaacttcaaa atcattttgg agaaaggctt aaagtaattc 1800 gtcacccaac aaaaggtgtt gggaagatat tgtttaaatc ttcattaccc acggataaag 1860 ctatagtgac tacctttgat ttgaaagtca acttgagtgt taaggtataa catactgagc 1920 gatataaaat ctgcagtgaa tatttatctt atagcgtaat gtttcatctc aataggtaaa 1980 agaagttgcc tttcttttaa ggaatgaaat acagagatca caagtcaaca aattgccaga 2040 agctatttca acaaaggatg ttatcgcagg tgaaattact gtaccgactc tagtccacac 2100 attttttgaa aatcttgtca cgacggaagg acatcgaaat aagaagcggc ccatttccga 2160 taacaaaaag agaaaaattg aatcattggc acatgatgct atttatgctg ttacaaatgg 2220 aagaattatg cccgcaaagc acctccttct tgcgttggcg atgaaagcac tgacagggag 2280 cagaaaagtt attgaacttc tcaacagata tggccattgc gttagctaca gtgtagcgga 2340 agaaattgaa actgagctca tttattctgc tatggaaaaa tctcgactac ttccggacgg 2400 attacaccaa cttccattcc ttcacacagg tgtaggcttc gacaacttcg atttatttgt 2460 tgacacactt tccggaaaag aaactctaca tgacacggtt ggaatcgtct accaagatgt 2520 tccagcaatt ccaattcagg aaaacttagc ttcttcccat gcaacttcaa caccttcagt 2580 tcaacgtcag ccagccaaaa aaagaagacg cgcactagaa gtagaggagt ttcccatagt 2640 gccgtaccac aaaagtttaa aacttaagac gcagtcttta actgacctgg aggacgaaac 2700 aagagccgtg attgcaccgc tttattctac agcgcaaacg ttcgacttca tttggatgct 2760 acaaatttgt atgaaagttg agaacactcc aatgtggact ggatggaatt caaagctggt 2820 tgaaaaccga cacccgatgc aggtcattca atatctacct caaatagatg catcaccaac 2880 aatcgacgca gtagttgttc tgactatgga aatgtcatta cggatagctg ccgaatgtga 2940 acaaagatac atatcactta gttctgattt ggccatttgt aagaaatact ttgccattca 3000 agcgcttgaa agccctaaat tcgacaaaat ttttgttcag ctaggtaact tttatacatg 3060 acaaatattt taacctcata tgtgtcctaa tactattatt gatttttttt caaatgcagg 3120 ttggtttcat atagagctct cttactacaa agctattggt aaattcatca gtgaatctgg 3180 gatgccatat attctggtcg aaagtgggct tcttgcttct ggatctctat ccggtttctt 3240 ggaaggcaaa aactacaacc gatgcaggcg aattcatgaa ctgaccgcaa tcgccttgca 3300 aactttacat tttgaagctt tcctcgatag tttgaagggt gaaattacca aagaagatat 3360 agctgtactt cttcgatgta ttgattttga aaatgcgaac gaagcatctt ttatttcctc 3420 tatccctgaa gaaatcaatg agcttttcct aaaatacgaa atgtattccc aattgacatt 3480 ttcgggagaa catggacctt tagccaaata ttggatgctc tacataagca agattgacgt 3540 gcatcgcaaa atgtcccgag ctattcgaat gaaagatcat agactctaca cacaaatgtt 3600 tccacaagta ctttcgatgt tctttgtgtt caaccaccag aactacgctc gatggggaac 3660 aaagtatgca tcaattctta tatattttca ttcaattaaa atttactatt tacatttgtt 3720 attataggta ccatgacaac ttgatgaata tggaaactac acatcctggt attacacagg 3780 actttgatga tggtatcata tcaatacgac ggaccagtaa acctttctct gcctctgcca 3840 ttgatcttac tctagaacag acgcagaata aagatgctgc gaattcttct actggtaagc 3900 tacttattta agtacatatt gcaacttttt attcaatctg ccattatttt ttactaggtg 3960 ttgtatcggt gactaacaat ctaagtgcgc gacagaagtt cgccaaaggg cactttatca 4020 aaacaaccct tttatctcac gtcttccaac attgtaattt gagcagaaaa gaagacgtgt 4080 caaaggaaac ttatcccagc cggatcagaa aagacacaaa gcagctgaat gatcttctgt 4140 cgtatatcaa gaaatgtagg aatcccttca ttggaaatga cgaagaactc atcaacatag 4200 gaactggcaa ggctgcctcg aaagaagttt cttcatttct tttgaacatc acgacaattg 4260 gcaacaacga atatgaaaaa ttcgttctaa gtgtcattaa agatccttta gcattcgaaa 4320 aacctatcaa gaagcagctc atcaataatt tcgcttccat gggagaaaaa gtaaaacgag 4380 ttcgaaacaa ccaagttgaa gttttgaaaa tggagcggaa ctacttaagc actctccttg 4440 ttactgccct agagaaaaaa atcgacatgg agctcgttct tcaatatccg ctcagtccga 4500 taccccttgt ttttgctcac ctgaatggtt ctatgaacaa aactgttaaa tccgtcctat 4560 acgacatatt ggacggacgt gtgaagagca tcccaccaag aacaatagat gttttaatta 4620 tcgacgggct tttctttctt caccttcacg gatctcattt accaaacact tttggtaaaa 4680 ttacgcaaca tcttctagcg cagttgtgtc gtactacagc aaaatgcatc cacgtagtat 4740 ttgaccgctg tgtttctcca tccatcaaag accatgagag agaccggagg agtactgggg 4800 gacgtgacgc gttttacgaa atatccggtc ctcaccaaac caaaccaacc aacttcctca 4860 aggaactgag atgcgatagc ttcaaaaaag cattcatttc attttttctc aaagcattag 4920 aagacgactc ccttgcatca gtattaggag agaagaagtt gtacgtcact gaagaatcaa 4980 aatgt 4985 // ID BEL-166_AA-LTR repbase; DNA; INV; 698 BP. XX AC supercont1.390; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-166_AA_; KW BEL-166_AA-I; BEL-166_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-698 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.390; Positions 471558 470861. XX SQ Sequence 698 BP; 223 A; 127 C; 146 G; 202 T; 0 other; tgttgcatac cgttcttaac agacaaagcg taacgtgaat cccgagcgct cccttacatg 60 aactgtcagc agaacgacgg gaacaatagg aaatcgcgga gcgagagatg cgtgtgtgta 120 ctggtcgact ctgttcacca tgatacatcg gcgtatgctg tagaatagat gagaagccag 180 ttaatcgcat aaatttagtg atcggtagca tcggatagaa ttgtgaaagc caaatttagc 240 tgtttattaa ccggtatcac ggcagcctgg actctgcacg gtgatataca gtttgaaccg 300 attgtgacac caatttgtgg gcaacccaga aagacaattg agcgctatag tgtagaacca 360 atatttgcta tagagttgta agtgataaca taatttgtta tgccttaata tgaattcatt 420 gctgatagca aaagctgtta tttcttatga tcgatccaat tattcatttt gttctcttca 480 gccatgatta ggatcaatga aaatcgtgtt gaaattgagc tgaattagtt tgccttgcgg 540 tttcttggag agcagaagaa acacttaatg taagcgttaa attcaattgt aatctttaaa 600 aaactaatga aatctaataa atttcagctt aaaatctggg aaacatccaa cctgctaaaa 660 agacagtttt cttccttgga aacccgcaaa ctcgaaca 698 // ID UrukSat_Cis repbase; DNA; INV; 108 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 06-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE Satellite from Ciona savignyi. XX KW Satellite; Simple Repeat; UrukSat_Cis. XX OS Ciona savignyi OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. XX RN [1] RP 1-108 RA Smit A.F.; RT "UrukSat_Cis - Satellite from Ciona savignyi."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX CC Ci000268. XX SQ Sequence 108 BP; 51 A; 9 C; 15 G; 33 T; 0 other; ctggagaaaa agttaacata ttataatagt ttaaacctgg agaaaaagtt aacatattat 60 aatagtttaa acctggagaa aaagttaaca tattataata gtttaaac 108 // ID MBOI repbase; DNA; INV; 62 BP. XX AC M34369; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 28-SEP-1995 (Rel. 1.08, Last updated, Version 1) XX DE B.malayi MboI repeat consensus sequence DNA. XX KW MBOI; MboI repetitive element. XX OS Brugia malayi OC Eukaryota; Metazoa; Nematoda; Chromadorea; Spirurida; Filarioidea; OC Onchocercidae; Brugia. XX RN [1] RP 1-62 RA Natarajan S., Werner C., Cameron M. and Rajan V.T.; RT "Isolation and characterization of a repetitive DNA element from RT the genome of the human filarial parasite, Brugia malayi."; RL Mol. Biochem. Parasitol 43(1), 39-49 (1990). XX DR GenBank; M34369; Positions 1 62. XX SQ Sequence 62 BP; 24 A; 12 C; 4 G; 22 T; 0 other; ccatttctct acagatataa caatatcact agaagacatt ttgattaatt cattaactca 60 ta 62 // ID Gypsy-16_IS-I repbase; DNA; INV; 4065 BP. XX AC ABJB010391617; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the black-legged tick genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-16_IS_; KW Gypsy-16_IS-LTR; Gypsy-16_IS-I. XX OS Ixodes scapularis OC Eukaryota; Metazoa; Arthropoda; Chelicerata; Arachnida; Acari; OC Parasitiformes; Ixodida; Ixodoidea; Ixodidae; Ixodinae; Ixodes. XX RN [1] RP 1-4065 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the black-legged tick genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; ABJB010391617; Positions 9755 13819. XX CC Positions [1820-2281] - Reverse transcriptase CC Positions [3245-3715] - Integrase core CC LTRs are 94% similar to each other. XX FH Key Location/Qualifiers FT CDS join(1694..3136,3140..4063) FT /product="Gypsy-16_IS-I_1p" FT /translation="MSHEKKLWLKRELQGMLDAGIIRPSTSTFASPITIVP FT KEDGTFRLCTDYRQINSQTDLFPFPMPRVDDIISETGGCTWFSRIDLCKGY FT WQVPLQEDTKRFTAFVTPFDVYEYNRLPFGWKNSGAWFQKMINEILKNFLG FT NFCNVYVDDIIIYSRTKEEHMDHMSQVLEALSTARLKINTKKSEFFQRTVV FT FLGRVFDGKTKSTKEESVQRISKLTKPYDLHSLRVLLGLARHFRAFIKDYA FT TKTKCLTDLTQKDTPFVWTEECESVYDYLVKVISSDPVLVLPDFQLPFELC FT TDASGYGTGAVLYQRHATEPPNRQLKVVGYYSYTFSKSQENYATTEKEALA FT VVMALRYFRSYLEGRQFKLFTDNQALTYLLSLTQPKGRMARWVTEIQQFSF FT EVAHRPGEKLPDADALSRLLIPRDTLNGSINHARIWEGTQHLELQDGKFHV FT PEFMIPRVLQLYHDSPESGGHDGFWRTYWKLTRKFTKNMKHDIANYVRTCH FT VCQIHKAKYRPRGDEMVTPQYSDVPFETIHLNFAEIKKKGEGVKKTQAFLI FT AIDECTRIVAARAGREDTNSVISLIEREVFENTKVIVSDNGPAFRSKKFQR FT WAQEKGITLRFPAPYHPEGNALAERAIRDIKKFIKMYPELTGGWKCALEAA FT IAHHNRSHIAGLGCSPHFAARGESPWLPADHLLGLTDKITLQEQKRPEKYR FT SSMKRNYDTRHLSAVPETQPGDMILVRKGLQGSKAAFSGPYMVTKTCSQQG FT ILKTLYYMGPNNTTEMAFIKNIIPYHPRRVEDKSPSVC" XX SQ Sequence 4065 BP; 1233 A; 988 C; 958 G; 886 T; 0 other; gtcgacagga ttacttgaaa gatgtggatt tacttaattc tgacggttgg cgaagaagga 60 cagccaatac tggagagcta tcagctggac ggtgaggaaa cagttccaat ggcccgggaa 120 gctaacaaca tggaacttat ctccttcatc gccgggaaag cgaaattgtc gccgaaagag 180 cttcttctaa aacggacggt gaaactaaac atcagagtgc aggaattcca ggagtggttt 240 cagaagccga ccactacttc tggtcaagtg atttcggctg gtggagacat gacatcggtt 300 gttgctcagc tgctacagat gcaagtatct caaatggaac agcaaaggca gttcatgacc 360 accatgttag aaactgagag tgcaagaacg taggcgtgaa gagaccgtaa agccagatat 420 gttcgacggt gaatcttcac ctgcttctac gtggctggac ttctacgaat atgcctgtga 480 gaaaaaccac tggacagaaa acagcgagcg gatcaagaat ttaagactct ttctcgcagg 540 gaatgcgaaa aagtggtacg aacttcgtgt ggctgagcac accagcgatt cgtgggatat 600 gtggaaggag agcttcacga catcttttga caaaaatgcg gttgaacggt gggatgatgc 660 aatattttac aaatatagag gaggaagcct actagattac ttttatgaga accggagact 720 tctacagctc gccgattcta atttacccga aacatcgatc gtcccactcg tcatccacgg 780 gatgagcaga gatgcacaaa gacaagtgca agcgcactct ccaaagacca tcgaggatct 840 cctgtattca ctgaagggtg tgttcgtgga ccactccatt ccctacggac aacaatgacc 900 actaacgaac cccacaagcg tgatcgtcga cacccacagc ggccaaaagc ttccagtaat 960 ccaggcaagt aacaagcctt ggatgaacgc aagacccaat tcaggcagcg cgaaacacgt 1020 gcgaatacac gcgcttgcca cacatcgtgg aagggcctca atgcgcgtgc tacgatacat 1080 gccgctgcat cgtgaacagc ggtgctaacg accagccgaa aaactaataa aagagggagc 1140 acagtccctc ccgatgacga agaaagcgac tgtctacttg gttaaataca acatccttta 1200 tgcctctgtt atcgtcagtg ggagacacgt ggatgcctta gttgacactg gagcatccat 1260 cagtcttgtc aagaagaata tcgtggaccc agaaaacata cgaagtgggc aactcatcga 1320 ggttcacagc tatgatgaca atgttaagct gatgcaaaac tggacgacgt tggaagtcga 1380 gtttaaggga aagaaaatcc cggtcgaggc actcgtcgca gaagacgtcg agttcgtgtt 1440 catattatca cgacccgata tgaagcgctt ccaaatgaac atatcctgga gggacgaagt 1500 aacgttggac cacgatgcgg atagcccacc tgctactcca gttcaaatgc tcagaactgt 1560 gtagaggtct gaggacgtac caacatcatt cccagagttg atatgcgtag aatcatatcc 1620 accagcaacc tcggttattg aggttccatt caaactacgc gacacttctg tagtgcgaaa 1680 aaaaccctac agcatgtccc acgaaaagaa attatggctt aagcgggagc tgcaggggat 1740 gctagatgcc gggatcatca gaccctcgac gtccacgttc gcatcgccaa tcaccatagt 1800 accgaaggaa gatggcacgt ttcgtctctg cactgattat cgccaaatca attcccagac 1860 cgatcttttc cctttcccta tgccacgcgt cgatgacatc atcagcgaga caggagggtg 1920 cacctggttc tccaggatcg atctatgcaa agggtattgg caggtccctc tacaagaaga 1980 caccaagcgc tttactgcct ttgtaacacc attcgacgtg tacgaatata accgactacc 2040 atttgggtgg aagaactctg gagcctggtt tcagaagatg ataaatgaaa tactcaagaa 2100 ctttttgggt aatttctgca atgtttatgt cgacgacatt atcatctact cccgaacaaa 2160 agaagaacac atggaccata tgtctcaggt gctcgaagca ctgagtactg ccaggttgaa 2220 aatcaacacc aagaaaagcg aattcttcca aagaacagta gttttcctgg gacgagtttt 2280 tgacggaaaa acgaaaagca cgaaagaaga atcagtgcaa agaatatcaa agctgaccaa 2340 gccctatgac ctacactcat tacgtgtttt actaggcctg gccagacatt ttcgggcatt 2400 cataaaagat tatgctacga aaacaaagtg cttgaccgat ttgactcaga aagacacccc 2460 gtttgtctgg actgaagagt gcgagagtgt gtacgactac cttgtcaagg tcatctcttc 2520 tgacccggta ttggtactgc ctgacttcca actgccattc gaactctgca ctgacgcgtc 2580 aggctacggc acaggagctg tcctctatca gagacacgca acagagcctc ctaaccggca 2640 gttgaaagtc gttggctact actcctacac attctcgaaa agccaagaaa actacgccac 2700 gacagaaaaa gaagccctag ctgtggtgat ggctttgagg tattttcgaa gttacctcga 2760 aggtcgtcag tttaagcttt tcacagacaa ccaagctctt acttacctct tgagcttaac 2820 ccaaccgaag ggacgaatgg ccagatgggt gactgaaata caacaatttt catttgaagt 2880 ggctcaccga cccggggaga agcttcccga cgctgacgcc ctctctagac ttcttattcc 2940 aagagataca ctcaacggga gcataaacca tgcccgaatt tgggaaggta cccagcactt 3000 ggaattacaa gatgggaaat tccatgtgcc agagttcatg attcccaggg tgttacaact 3060 ctatcacgac agtcctgaat caggaggaca tgacggattt tggaggactt actggaagct 3120 gacaaggaaa ttcacatgaa agaacatgaa acatgatatc gccaactatg tgcgtacgtg 3180 ccacgtgtgc caaattcata aggccaaata ccgacctcgg ggagacgaaa tggtgacccc 3240 tcaatactct gatgttccct tcgagactat tcatctcaat tttgcagaaa tcaaaaagaa 3300 gggagaagga gtgaagaaaa cgcaagcttt tctaattgcc atcgatgaat gcacgcgcat 3360 cgtagctgca agagctggca gagaagacac taactctgtc atttcattga tagaacgaga 3420 ggtcttcgag aacacgaaag tcattgtttc agacaatgga ccggcgttcc gcagcaagaa 3480 gttccaaaga tgggcacagg aaaaaggaat cacacttcga tttccagctc cttatcaccc 3540 ggaaggcaac gcgttagctg aacgggcgat acgcgacatc aaaaaattca taaaaatgta 3600 ccccgagctc acaggtggat ggaaatgcgc acttgaggca gccattgcac accacaatcg 3660 ttcccacatc gctggattgg gatgtagtcc gcatttcgca gcgcgcggag agagcccatg 3720 gctacctgcc gatcatctcc tgggcctcac cgacaaaatc acgctacaag aacagaaacg 3780 acccgaaaaa tatcgatcgt caatgaaaag aaattacgac acacggcacc taagtgcagt 3840 cccagagact caacccggcg acatgatttt agtgcgaaaa ggcctacaag gatccaaggc 3900 tgccttttct ggtccttaca tggttaccaa gacgtgcagt caacaaggca tcctcaagac 3960 gctgtactac atgggaccca acaacaccac cgaaatggca tttattaaga atataattcc 4020 atatcacccc aggcgggttg aggacaaaag ccccagcgtg tgtag 4065 // ID Gypsy-12_RP-I repbase; DNA; INV; 3902 BP. XX AC ACPB02012584; XX DT 28-JAN-2011 (Rel. 16.02, Created) DT 28-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Rhodnius prolixus genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-12_RP_; KW Gypsy-12_RP-LTR; Gypsy-12_RP-I. XX OS Rhodnius prolixus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Euhemiptera; Heteroptera; OC Panheteroptera; Cimicomorpha; Reduviidae; Triatominae; Rhodnius. XX RN [1] RP 1-3902 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Rhodnius prolixus genome."; RL Direct Submission to RU (28-JAN-2011). XX DR Genome; ACPB02012584; Positions 2252 6153. XX CC Positions [2419-2895] - Integrase core CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 1534..3693 FT /product="Gypsy-12_RP-I_1p" FT /translation="MARALTQLLRNGVRFKWGKMQEEAFQQLKDALCSDAV FT LAYPDFKTPFFLATDASGVALGAVLSQNQEGRERPTAYASRHLSPAEQRYS FT ATERELLAVVWATSLFRCYLFGRRFSLITDHAALKWMLHLKDPSALLTRWS FT LKLAEFDYDVVHRPGQKHRNADALSRAVCTVEEQARSSANVDLGLAQERDV FT WCREVRESHKENVTRGSDGIWYMRRGENKTWKALVPETWRTHVIRLHHDTP FT WSGHPGIERTLERLERIFYWPGMARDVENYVKECHLCNQRKTPAGLTVPLG FT EPYITSVPFEQISLDLMEFPKSSEGNKYLLTFIDNFSRYAEAIPLKGQTAK FT ETAYAFIEGIILRHGVPKRVLTDQGRNFVSEFFRTVCKALGIRKIQTTVYR FT PQSNGLKERLHRTLTDSISHFARRDGRDWDHWIRYALLAYNTTRHSSTGYS FT PYFLIHGRDVKLPFDQELTHIPLQASSDQDFIRTLQERLATAREIARKRDS FT DTREARTEMFNKGKKMREFQVGDRVYLLEPAIKEGHAKKLYRPWTGPHQVS FT KKLSPWNYELTLASGGNYITHINRLKPTPSHHNIAAKRRRKNRGKNKQGKK FT VPEENDDRETGGWIGSRSNSEEGLTDFEIITASGERQGEGSSGWPEATTYI FT EDDNGEETEGASSGISVDPDPSWLPERGTGTGENRSPYQLRNRGQSPPGEI FT PTPVRVNARPRMRTVPEHQP" XX SQ Sequence 3902 BP; 1176 A; 815 C; 1022 G; 889 T; 0 other; actggtgtca ggtgtggggt aaatggcgtg gccataccta ttcagggaat acaaaccgtg 60 gaagtgttgc tgaatggaca aaagttctat cagcaatttg ccattgctga agttactacc 120 cgtggagatg gcattatggg ctgggatatg atgaaacaag tagggatagt agtagatgcg 180 tcccggggaa taatttcatt ccagaatcaa gaagtggggg atcaccctaa ccgtgaaccc 240 aggaataaag aacctgttgt cgaggggtta gcaagagttt catccgtacg gaaggtcgct 300 gttccacctc ctagtgagca gctaatagaa gcaaaatggc caaaaaattt aaacggcgaa 360 atcctacttg agccgagtcc tttagctaac ccacaaatac gggtagccag aagtgtacat 420 cgggcagacg gaaggagaac atgggtaaag gtaataaact ccgatgacta tagagaaagg 480 tcagaggttg ggagttattg aggaaattgc ttccgaaaaa ggggggaaag atgggcacag 540 tttgataggt tctccaaatg aggatgaaca cctcgtatat tcagtgctag cagagaaaga 600 gaccgatggt ggggtagcac ctgaacttga agttaaactg ggacacttgg acgagaggaa 660 taagcaaatg atattggcag tcttgaggag tcatgccaaa ctctttgacc ctcctggtca 720 cgaaggttgc agtcttccgg tgtttcataa aattgaaacc ttagaggaag ggccggtcac 780 caaacggccg tttagagttc cttatcacca aagacctgtc attcatcaac atttgcagga 840 gatgctggat aagggggtaa ttgcgccttc cacgagtccg tggtcagccc ccatagtttt 900 agttcccaag aagtccagcg aagggaagac ggaatatcgt ttctgcacgg attttagagg 960 attaaacaag atcaccaaag cagatgtgta tcccctacca ttaataaatg aaaccctaga 1020 aactttaggt aatagccaat ttttctccac tcttgatctg gcgacaggct atcaccagat 1080 accgatacat cctagtgatt gtgagaagac ggcattcact acaattggcg gtcattttga 1140 ataccgcaca atggcattcg gattagccaa tgctccagcc acctttcagc gtactatgga 1200 ccaattattg gctgaattaa aaggggatga atgtctagta tatcttgatg atattattgt 1260 ctttggagca acgatagaac aacactgtag gcggctggat tgagtccttg cccgactagg 1320 tcggctaacc ttaaggtaaa gctagagaaa tgttggttcg cccataccga agtacactat 1380 ctaggccaca ttgtctccag ggaaggtatt aagccagatc ctgggaaaat ctcagcagta 1440 aagtcctttc cagaaccaag gaaggcacgg gaggttagag gatttttggg actagccggg 1500 tattatagac gatttattcg caattttgca gacatggccc gtgcgctcac tcagctgttg 1560 cgaaatggcg tacgatttaa atggggaaag atgcaagagg aagctttcca gcaacttaag 1620 gatgcacttt gttcagacgc ggttttagca tatccggatt ttaagacacc attctttctc 1680 gccaccgatg cttcgggagt agcgcttggg gctgtccttt cgcagaatca agagggacga 1740 gagagaccca ctgcgtacgc gagccgccac ttgagcccgg cggagcagcg atactcggca 1800 accgaacggg agctgttggc cgttgtttgg gccacttctc tttttagatg ctacttgttc 1860 ggaaggaggt tctccctgat taccgatcac gcagcactaa aatggatgct gcatcttaaa 1920 gatccgtccg ctctgttaac gcgttggtca ctgaagttgg ccgaatttga ttatgacgta 1980 gttcatcgtc cggggcagaa acataggaat gctgatgcgt tgagccgcgc agtatgtact 2040 gtggaagagc aagccagatc gagtgcgaat gtcgatcttg gattagctca agaaagggat 2100 gtatggtgca gagaagtgcg cgaatctcat aaagaaaatg tgacaagagg gtcggacggt 2160 atttggtaca tgcgccgagg agaaaataaa acctggaagg cattagtgcc cgaaacctgg 2220 cgaacacatg tcatacgcct acaccatgat actccttggt cagggcatcc gggaatagag 2280 cgtaccctag agagattaga gcgaatattc tattggcccg ggatggcaag ggatgtcgag 2340 aactatgtta aagaatgcca tttatgcaat cagcggaaga cgcctgctgg acttacagtt 2400 cccttaggtg agccatacat tacatcggtt ccatttgagc aaatctctct agatctcatg 2460 gagttcccga agagttcgga aggaaataaa tatttattaa catttataga taatttttcc 2520 cgttatgcgg aagcgattcc tctaaagggt cagactgcca aggagacagc ttatgcgttt 2580 atagaaggca taattttaag gcatggagta cctaagcgtg ttttgacaga tcaagggcgg 2640 aattttgttt cagaattttt ccgaaccgtg tgcaaagcgc tgggtattcg aaaaatacaa 2700 acaactgttt accgacctca gagtaacgga ctaaaagaga gactgcatag aacgttgacg 2760 gattctatat cccacttcgc gagaagagat ggaagagact gggatcattg gattcggtat 2820 gccctgttag catacaacac tacgcggcat tcttccactg gctatagccc ctactttctt 2880 attcacgggc gggatgtaaa actcccgttc gaccaggaac tcacccacat accactgcaa 2940 gcatccagcg atcaggactt cattcggaca ttacaagaac ggctagcaac ggcgcgagag 3000 atagcaagga aaagggattc cgatacgcga gaggcgcgaa cggaaatgtt taataagggc 3060 aagaaaatgc gtgaatttca ggtgggcgat agagtgtact tgttagagcc agctataaaa 3120 gagggccacg ctaaaaagtt ataccgcccc tggacggggc ctcatcaagt tagcaaaaag 3180 ctatcccctt ggaattatga gttaacttta gcctcgggag gtaattatat tacacacatc 3240 aataggttaa aaccaacacc ctcacatcac aatatagcgg ccaagagacg cagaaagaat 3300 aggggaaaga acaaacaagg taagaaagta ccagaggaaa atgatgatag ggaaaccgga 3360 gggtggatag gaagtagaag taattccgaa gagggcctaa cggattttga aattataacg 3420 gcttccgggg aacggcaggg agaagggtca tcgggatggc cggaagcaac cacctatatc 3480 gaagatgata atggggagga aaccgaaggg gctagtagtg ggatatcggt ggatcctgac 3540 ccctcttggc taccggaacg aggcaccgga accggagaaa accgatcacc ttaccaacta 3600 agaaatcgcg ggcagtcgcc gcctggggag attcctactc cggtcagggt caacgcacgg 3660 cctcggatga gaaccgttcc tgaacaccaa ccctaaaaaa atccattaac cttacaattc 3720 cgatatgtat ccccgtaaaa ataaatacag gttacaagac tcacactcga cgagaggaag 3780 aggaatgccg gagagaagca ccgacaagca gacgccatca tgaagaggac aagtcgaact 3840 gggcaaagaa ttaataaaag tgaagtcctt aagaggtgct tcacttttgg aaaaagaggg 3900 at 3902 // ID TTAA26_AP repbase; DNA; INV; 581 BP. XX AC . XX DT 26-AUG-2009 (Rel. 14.09, Created) DT 26-AUG-2009 (Rel. 15.12, Last updated, Version 2) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; nonautonomous; TTAA26_AP. XX NM TTAA26_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-581 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2095-2095 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC TSD is TTAA tetranucleotide. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea CC Aphid Genome Project. XX SQ Sequence 581 BP; 166 A; 102 C; 138 G; 169 T; 6 other; ggggatcggt ggtggnantt ntttaaggag ttcganatag gcaattttta aattgcgtgc 60 gtgcgggtct tgtaaatgtg tgcgtagtaa gtgatcatta gtgtgtgtga tacgtgtccc 120 gcactacaaa agcggagcct acgcacgcgg cgcgngcaca agtcaccgcc gcggcgtcgc 180 gcgcgtgaac gatccagaac gtgtgtctta tttgactaaa agtatgtact ttttgtccat 240 cctaccgatc gacctagtaa tgcgataata cttttgaaaa ctgatttgaa ttcgtcataa 300 tgtctacttg agatcggtca gtccacattt taagatttct gatttaaatt ccgaataatc 360 aacgtttgaa aatttcgaaa tttcaaaata ttcaaactaa atttatagtg gttggggggg 420 ggggggtgaa attgggaaaa gtggactgac cgatctcaag tagacataat aacgaattca 480 aatcagtttt aaaaantatt atcgcattac taggtcgatc agtatgatgg acaaaaagat 540 tggtatgttt tggctaaaat aactgtccgc caccgatccc c 581 // ID Gypsy-31_CQ-LTR repbase; DNA; INV; 134 BP. XX AC AAWU01031725; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-31_CQ_; KW Gypsy-31_CQ-I; Gypsy-31_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-134 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 442-442 (2011). XX DR Genome; AAWU01031725; Positions 18507 18640. XX SQ Sequence 134 BP; 39 A; 20 C; 37 G; 38 T; 0 other; tgtagtgttt tgtctgtaca aatatctggc aacactgtaa tcggtggagg gagagaagaa 60 ctgtcagagg agggagccat tataataaag atcttgtgag agtacgcacg tgtctcgggt 120 ataattcttc taca 134 // ID MAR32_SM repbase; DNA; INV; 1209 BP. XX AC . XX DT 07-DEC-2008 (Rel. 13.01, Created) DT 07-DEC-2008 (Rel. 13.12, Last updated, Version 2) XX DE Consensus sequence of Mariner-type family of repeats. XX KW Mariner/Tc1; DNA transposon; Transposable Element; MAR32_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-1209 RA Jurka J.; RT "Mariner-type element from freshwater planarian (Schmidtea RT mediterranea)."; RL Repbase Reports 8(12), 2246-2246 (2008). XX DR [1] (Consensus) XX CC ~20% divergent from consensus. CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 438..983 FT /product="MAR32_SM_1p" FT /translation="TKKMDKWVPHELNEKQKMKRLEICSSLILRNKMMPFL FT NRIITCDEKWVFYDNRKRSGQWLDADERPKHMPKPPLHPKKTMITVWWGIK FT GIIHYSFLQPGTTITTESYCREIDIMHDKLCNKNPALINRHGPILLHDNAK FT PHVSKATVKKLTGLGYEILPHPPYSPDISPTDYHLFRHLEFF*" XX SQ Sequence 1209 BP; 432 A; 182 C; 214 G; 381 T; 0 other; ggttgtttca tatgaaatgt cgttttttaa aattgtttaa aatctaatta tctaagtatt 60 atataatttt tcaacaaggt gtttagtttg aataatattt catattatac ggctgtttaa 120 tattgcaagg taatattaaa attaattaat taatctttac tttatataga tatttattta 180 tgatggattc taagaattta agactacttt ttctatatga atttaaatta ggacatacgg 240 gaacagatgc tactaacaat ataaatttgg cttttggaga aggtagcaca aatgttagaa 300 ccttacagag gtggtttgca aggtttcgtt caggaaatat ggatctggat aatttgccta 360 ggtgaaaaac gacattttgc aattgaacta ggaagtttac ctacgacaac ttggaggcac 420 ttggaatcaa taggtaaacg aaaaaaatgg ataaatgggt tccccatgaa ttgaacgaaa 480 agcaaaaaat gaagcgtttg gaaatttgct cttccttaat tcttaggaat aagatgatgc 540 cttttttaaa tcgaatcatt acttgtgatg aaaaatgggt gttttacgat aacagaaaaa 600 ggtctggaca gtggttggat gcggatgaac gtccaaagca catgccgaag ccaccattac 660 acccaaaaaa gacaatgatt actgtctggt ggggaataaa gggtattatt cattattctt 720 tcctacaacc aggaacgaca ataacaacag aatcgtactg tcgagaaatt gatataatgc 780 atgataagtt gtgcaacaaa aacccagcac tcataaacag acatggtcca atattattgc 840 atgacaacgc caaaccccat gtttcgaaag caactgtcaa aaaattaact ggtctcgggt 900 acgaaatatt acctcaccca ccttactccc ctgatatttc cccaacagat taccaccttt 960 tccgtcactt agaatttttt tgaggaataa gaatttccaa aagcaagaca atgtcattga 1020 agcagttgaa gaattctttg gctcaaaaga agaaagtttt tataaacatg gcatagattt 1080 actagtttct cgttggaaca aagtgtatga agctgatgga agctattttg attaaataaa 1140 ttatgttttg ttatgaattg agcattttga tttcaaaata aaataaacga catttcatat 1200 gaaacaacc 1209 // ID Crack-15_AAe repbase; DNA; INV; 5789 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A Crack non-LTR retrotransposon family from Aedes aegypti. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; KW Crack-15_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5789 RA Kojima K.K. and Jurka J.; RT "Crack clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1231-1231 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 14 sequences with >95% CC identity. Closely related to Crack elements in Culex pipiens CC (Crack-1_CP to Crack-4_CP). XX FH Key Location/Qualifiers FT CDS 380..1474 FT /product="Crack-15_AAe_1p" FT /translation="MMSESENDVFCYVCEKAEKDSQRLIECSYCGKCAHFR FT CKKLYGSAVASAKSKPFFCSVECCEMCLRGSKQTATNDDIIRELQLLGKAV FT NEVKQDSDKFRAVLEKSQQQISEIVATSKQIERSQDFLAEQFDHLQADFKS FT FKEEVGEVKAENSKIRMELKAWQETCGELAGTVDRLEVDLDRVNRAAKVKN FT AVLLGLPMIENENTVKLASKVCELLKCGFNSATAIVSARRIIGKQQTNGVS FT PILVSFNSEQEKEELFQRKRAHGTVLTSNICAAYNGSTRTVTIRDELTVYG FT RELLQEAKSLQETLKIKYIWPGRDGKILLKRQDGSKIEKITSKLQLASLQP FT TNLKRVLNVSGTSPPLIPQPKR" FT CDS 1641..4421 FT /product="Crack-15_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MNNINKFDWLKETLELYSGVIDVIVIGETWVKSDRQQ FT MYNIDGYQSIFSCRPDSQGGGLAVFVRSVISFDEIANEHTNGFHHVHLRLD FT IGESPFHIHAIYRPPSFDLSSFFSNLDQICAAAGRCGSSIIIGDINIPTNR FT AGCRIVNEYINLLDCYNFSITNTYPTRPSSGNVLDHTICSESLQNVVTNET FT IXTDMSDHCLVLSTFSLQKPVKKLHLEKTIVSNHKLNMAFQNAMTNIPNGS FT AEVRLQFVLNTYNTLKIQFSKTVSVEAKIKGFCPWMSFDVWKLIKIKENIL FT RNWRRNPANVRCKELLDHISKKVQQAKDHAKKSYYRNLFTTANQRNTWKNI FT NKILGRDRRATENVKLTVNGHQTTDSPTVANAFNDFFCSIGPQLAATIRSS FT QDINKFNTLIPQRQSIYLEPTTEQEVILLIKELKNGKSSGVDGISAEFMKH FT HHNVFAVLLRDIFNECITLGTFPDCLKIARVIPVHKGGSKLDVNNYRPISI FT LSVLSKVLEKLLVTRVDNFLRQHDVLYCRQYGFREGSSTWTAASELVDEIY FT NXLDNRKIEGVLFIDLKKAFDTIDHRVLLRKLNYCGIRGLANKLFESYLTG FT RKQYVSVNGAFSSERDVTVGVPQGSNVGPLLFLLYINDLAKLKLHGNPRMF FT ADDTSLSYQNANPDLIIQHMKEDLTVLHHFFAENMLSLNLLKTKFMLFHSP FT RLKIPRHADLVVNTTVVDKVQSFKYLGLTFDSKLQWNEHISSLQRELSATC FT GVMWKVSKFMPQKELLAMYHAFVQSKLQYLVSVWGAATKTAIRALQSTQNR FT CLKILYQRPLLFSTLRLFKEAADAILPIAALREKEGLVKMHNILTNPRTLH FT NRQLQHTSTRYPLRSQAVLVIGRPKTEAGKKSFAYFGYNRYNALPPHMKAE FT RNMERFKKMVLCLIRSKIATYVD" XX SQ Sequence 5789 BP; 1731 A; 1296 C; 1215 G; 1543 T; 4 other; ggccctggat tttaattaca cagttgcttt catagtgctc cgctataaat agctaatata 60 acgaaaatac tgactaaacg gtcgaggatt gcgtggaagt ggaaagttaa gacaaggcag 120 ctggggtctc catcgctatg taaagcattt cacaactgaa tttgctaaaa taaggtattt 180 taagtatcgt tcgtaagctg atatatcatc tattcatcca cattgaactc cgacgcgtgt 240 tgtgatgtac tcgcatgctt actccgacaa tcaccaacac acttgaggct ttgtcgtttg 300 gttgatattg tttttcatat cgtcattcca ctgtttcaca acaaaatttc ttccatccat 360 cgcgtaatat atggtctgaa tgatgtctga aagtgaaaac gacgttttct gctatgtttg 420 tgaaaaagca gagaaggatt cgcagcgatt gatcgaatgt tcatactgtg gcaaatgtgc 480 tcattttcga tgcaagaaac tttacggaag tgcagttgca agcgcaaaaa gcaaaccgtt 540 tttctgctcg gttgagtgct gcgagatgtg tctacgagga agcaaacaga cggcaacaaa 600 cgacgatatc atccgtgagc tgcaactgct tgggaaggct gtgaatgagg tgaaacagga 660 ctcggataag tttcgtgctg tgctagagaa atcacagcaa cagatctccg aaatcgtcgc 720 tactagcaag caaatagaga ggtcccaaga ttttctggct gagcagtttg accatctaca 780 agcggatttc aaatcgttca aggaagaagt aggggaggtg aaggccgaaa actcaaaaat 840 ccggatggag ctgaaggcgt ggcaagagac gtgcggcgaa ctagctggaa ccgtggatcg 900 cctagaagtt gatctggata gagtcaacag agcagccaaa gttaaaaacg ctgtgctgct 960 aggcctacca atgattgaaa atgaaaacac cgttaaatta gcatcaaagg tgtgcgagct 1020 gctgaagtgt ggattcaact ctgctactgc gatcgtttct gctaggcgta tcattggtaa 1080 gcaacaaaca aacggtgttt ctccaatcct agtttcattc aattcagaac aagagaaaga 1140 agaactattc cagcgtaaac gtgcccatgg aactgtgctg acatccaaca tatgcgcggc 1200 ttacaacggt tctacacgga cggtcactat tcgagatgaa ttgacggttt atggaagaga 1260 gctactacag gaagctaaaa gcttgcagga aacactgaag atcaagtaca tctggccagg 1320 aagagacggg aagatactat tgaaaaggca ggacggatcg aaaatagaga agatcaccag 1380 caaactccaa ctggcaagtc tacagcccac aaacctcaaa agagtgctca acgtgtctgg 1440 cacatcacca cctttaatcc cgcagccaaa acgttaacat gtctaatttg taattttctt 1500 attctttgta tttgtttaaa ctgatcttaa taaatgtgat taattattat tttgattcta 1560 ttgctcaact aaataaaaac ttcaacagta gtcttgtagt taattttcaa ctgaaagtat 1620 tgcaacttaa tataagagga atgaataata taaacaagtt tgactggctt aaagaaacac 1680 tagaactgta ttcgggagta attgatgtta tagtaattgg cgaaacgtgg gtgaaatcgg 1740 atcgacagca gatgtacaac atcgacggat accagagtat tttctcgtgt aggcctgatt 1800 cccagggtgg aggtttagca gtgtttgttc gatcwgtaat ttcattcgat gagatagcta 1860 atgaacatac caatggattt catcacgtgc acttacgcct tgacatcgga gaatcaccgt 1920 tccatataca tgcaatatac cgacctccat cgtttgactt gtcatcattc ttttcgaact 1980 tggatcagat ctgtgcggca gcaggacgtt gtggctcctc catcatcata ggtgatatta 2040 acatcccaac caatcgtgct ggatgtcgca ttgtaaatga atatataaat ctcctagact 2100 gctataattt ctctattaca aacacctacc ctacaagacc gtctagtggc aatgtacttg 2160 atcacacaat atgttctgaa tcgctgcaaa acgtagtaac taatgagaca atttwcactg 2220 atatgagtga ccattgctta gtgctatcta cgttctcact acaaaaaccc gttaaaaaac 2280 tgcatctaga gaaaaccatt gtttccaatc acaaactgaa tatggcattt caaaatgcaa 2340 tgactaatat tccgaatgga agtgctgagg tcagactaca gttcgtcctg aacacctaca 2400 acacgttgaa gatacagttt tcaaaaactg ttagtgtaga agccaaaatt aagggatttt 2460 gtccgtggat gtctttcgat gtttggaaac tgatcaaaat caaggaaaac attctgagga 2520 attggaggcg taatccagct aacgttcgct gcaaggagtt attggaccat atctccaaga 2580 aagtgcaaca ggcaaaagat catgctaaga agagctacta tagaaactta ttcacgaccg 2640 ctaatcaaag aaacacctgg aagaacatca ataaaatcct aggccgagac agaagggcaa 2700 cagagaacgt gaagctaact gtcaatggac atcaaacaac tgatagtccc accgtagcga 2760 atgccttcaa cgacttcttc tgttcgatcg gaccccagct tgccgcgacg attcgcagta 2820 gccaagacat caacaagttc aacacattga ttcctcaaag acagtcgatc tacttggaac 2880 ctaccacgga gcaagaagta attctgctga taaaggaact gaaaaacggt aaaagtagcg 2940 gagttgatgg tatctcagct gagttcatga aacaccatca caatgtattt gctgtacttc 3000 tgcgtgatat cttcaacgaa tgtatcaccc tgggaacctt tcccgattgt ctaaaaattg 3060 cacgtgttat tcctgttcac aaaggaggca gcaaactgga tgtcaacaat taccgcccta 3120 tttcaatcct gtcggttcta agcaaagtac tggaaaaatt gctagttacc agggtagaca 3180 actttctacg acaacatgat gtactttact gccgacaata tggattcaga gagggatcaa 3240 gcacatggac ggcggctagc gagttggttg atgaaatcta caatkgcctg gacaaccgga 3300 aaatagaagg agtattgttt atcgacttga agaaggcttt cgatacgatc gaccatcgcg 3360 ttctattacg aaagctgaac tactgtggga taagagggct ggcaaacaaa ttgttcgaga 3420 gctatctgac tggaagaaag caatatgtgt ctgtaaacgg agcattcagt agcgaacgtg 3480 atgtgacggt aggagttccc caaggcagca atgtaggccc tcttctgttt ctactgtata 3540 ttaatgactt ggcaaaactg aaattacacg ggaaccctag aatgtttgcc gatgacacgt 3600 ccctatcgta tcagaatgca aatccagatt tgatcattca acatatgaaa gaagatttga 3660 ccgttctgca ccactttttt gctgaaaaca tgttgtcact gaatctctta aaaacgaaat 3720 ttatgctgtt tcattcacct cgcctcaaaa ttcctcgaca tgcggatctg gtagtaaata 3780 cgactgtagt tgacaaagtc caatctttca agtatctcgg cctcaccttt gattcgaaac 3840 tacagtggaa cgagcatatt tcgtcactac agcgagaatt aagtgccacg tgtggtgtaa 3900 tgtggaaagt atctaaattt atgccccaga aagaattgtt ggcaatgtac catgcatttg 3960 ttcaatcgaa actccaatat ctggtatcag tctggggagc tgcaactaaa actgcaatca 4020 gagcactaca atcaacacaa aaccgctgct tgaaaattct ttaccaacga cctttgctgt 4080 tttcaactct gagattgttc aaggaagctg ccgacgcgat tctacccatt gctgcacttc 4140 gtgagaaaga aggactagtt aaaatgcata atattttaac caatccccgc actttgcata 4200 atcgacaatt gcaacacaca tctacccgat acccactcag aagccaggca gtcttggtga 4260 ttggccgtcc aaaaacggaa gctgggaaaa aatccttcgc atactttgga tacaaccgtt 4320 acaatgctct acctcctcat atgaaagctg aaagaaatat ggagcgtttc aaaaaaatgg 4380 ttctgtgtct gataagaagc aaaattgcaa cgtacgtaga ctaagcatca gattgctagg 4440 tcagggagaa tcgccagtgt cacgtccagc tccttcactt ttcaacgcca accacactcc 4500 gctgttccgt caacgcctgc gcctgcttgc tcggtcagga agaatcgcca gttacttttc 4560 caggtaatga agactggtat atgtctggcc gaagccagtg gtgctacgct cgttaaattt 4620 gctcgtaaat ggttccagct cctccacttt tcaacggcaa tcacaccccg ctgttacgtc 4680 aacgcctgcg cctggtcgcg gtcaagaaga atcgccagct acttttccag ctcctccact 4740 gttcaacggc aatcacactc tgctgctccg tcaacatctg cgcctggtgg ttcggtcagg 4800 aagaatcacc agttcctttt tccaggtaat gatgactggt atatgtctgg ccgaagccag 4860 tgatgctacg ctcgttaaat ttgctcgtaa atggttccag ctcctccact tttcaacgcc 4920 aaccacactc cgctgttccg tcaacgcctg cgcctggttt ctcggtcagg aagaatcgcc 4980 agctactttt ccagctccat cactctacaa cggaaattag tttgtcctgc atctttacgc 5040 taccgatgac cctgaacatg ctgtccccac gctacttcaa cagtactcag ctgtttcgtc 5100 aacatctgcg cctggtggct cggtcaggaa gaatcaccag ttcctttttc caggtaatga 5160 tgactggtat atgtctggcc gaagccagtg atactgcgcc cattaaaatt gtcttcaaat 5220 ggttccagct ccatcacacg tcaacgccaa tcagtttgcc atgcaccttt acacaagcga 5280 tgacccgatc cttgagcatg acgcccccac gctgctccaa cagaactccg gtgctccgtc 5340 aacaacagcg cctggttgct cggtcaggaa gattcgccac ttacttctcc agcttttcac 5400 tgcttaacgc caaccacact tcactggtgc gtcagcatcg gcgcctggtt gctcggttag 5460 gaagaatcgc cagaagacct ttcctgcaga tatcaccgct gatcatcaat cctcgccatg 5520 tgtttttctc gttgctcgaa ttgttatttt ttgaattcaa attwtttgta gccgtgtaaa 5580 ttagtaatta ttaatttagg aagttaatta attgaattag atttaccgca ctcccttcaa 5640 agagaattat ttctcactgg aagtgcatgt ttattgtaat gttttgatga aaagaagaga 5700 aggttttatg cctctaggag aagtgcttcg aaaggaacgc cactcctagg ggcttttccc 5760 tactccaata aataaataaa taaaaaaaa 5789 // ID Uhu_DAn repbase; DNA; INV; 1656 BP. XX AC . XX DT 04-MAR-2010 (Rel. 15.06, Created) DT 04-MAR-2010 (Rel. 15.06, Last updated, Version 1) XX DE Tc1-like DNA transposon. XX KW Mariner/Tc1; DNA transposon; Transposable Element; UHU; tc1-like; KW Uhu_DAn. XX OS Drosophila ananassae OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; ananassae subgroup. XX RN [1] RP 1-1656 RA Styles P.; RT "Uhu_DAn."; RL Repbase Reports 10(6), 932-932 (2010). XX DR [1] (Consensus) XX CC Consensus sequence for Uhu, a Tc1-like DNA transposon, in D. CC ananassae. There are 17 complete or partial copies of Uhu in this CC species. Uhu is also found in members of the Hawaiian Drosophila, CC including D. grimshawi, where its consensus sequence is almost CC identical in length at 1655bp, compared with 1656bp in D. CC ananassae. XX FH Key Location/Qualifiers FT CDS join(373..978,1093..1350) FT /product="Uhu_DAn_1p" FT /translation="NGTRYNNRAARTRPAPLQKWQKWRAIDEIVSLSPSTV FT QYIIQRFVRKDRIEDKGRNAPNKIFYEHEERRIVLKIKESPKLSASKIVAE FT VAQEMGKKFSADTVRRVLHDHNFNGRVARKKPLISKKNIAARLKCSKEHLL FT HPISFWDDVIFADESKFNLFGSDGRQFVWRRPKTELDNKHLKATTKHGGGY FT VMVWACMAASGVYQDNDPKHSASIVKTWLVWNCSQVVITPAQSPDLNVIEN FT LWEILDQDIRKRKISNKNDLKTALLEEWGKFPVEMTKKLFFFHCEPD" XX SQ Sequence 1656 BP; 545 A; 291 C; 343 G; 477 T; 0 other; tacagtgggt cgcagcttat ttcgtgcagg ggaataaaaa atgtttaagt cgctattgcc 60 gctaaacgaa tagggatatt ttaaaaattt aaacgcaata ttttcgttag ggatattaac 120 atatttaata actaaaaaat aagttcataa acataaaaac aaaaaagtta cataccaaaa 180 accaaattaa cagtcatttt cgaggccgca gcttatttcg tgcagcatat ttatatacca 240 aaaatattaa attttaagcc aaactcccaa tttttttttt gctttttggt attatataac 300 tttaactatt tatataatct acatctcggg gattgatcac tctaaatgca aaagattaag 360 gggaaaaact gaaatggcac caggtacaac aatagagcag cgagaactcg tcctgcacca 420 cttcaaaagt ggcaaaagtg gcgtgccatt gatgagatcg tatccttaag tcctagtact 480 gtacagtata tcatccagcg atttgttcgg aaggatagga tagaggacaa gggccgaaat 540 gcgccaaaca aaatttttta cgaacacgag gagcgccgga tagttctaaa aattaaggag 600 agcccgaagt tatccgcctc aaaaattgtg gctgaagttg cccaagaaat ggggaaaaag 660 ttcagtgctg atacggtgcg ccgcgtgtta cacgatcaca actttaatgg tcgtgtagca 720 cggaagaagc ccttaattag taagaagaat atagcggctc ggttaaaatg ttcgaaggag 780 catctccttc atcccatttc cttttgggat gacgttatat tcgcggacga atccaaattc 840 aatctgtttg gatcggatgg gagacaattt gtttggcggc gaccgaaaac ggagctcgac 900 aataagcacc taaaagctac gacgaagcat ggcggaggtt atgttatggt ttgggcctgt 960 atggcagcct caggagtctg aaacttagtt ttcatcgacg gcattatgga tgccaaaatg 1020 tatttaaacc tcctgaagga aaatctgctt caaagtgctg ataatttggg catacgggac 1080 agatttagat agtaccagga caatgatccg aagcattcag ccagcattgt gaagacttgg 1140 ctcgtctgga attgttccca agtggttata acaccagccc aatccccaga cttaaacgtt 1200 atagaaaatt tgtgggaaat attggaccag gatatccgta aaaggaagat ttccaataaa 1260 aatgacttaa aaacggcgct gttggaagaa tggggaaaat ttcccgtgga aatgacaaaa 1320 aaattgtttt tttttcattg cgaacccgac tgactgctgt ccagaaccag aagggaggac 1380 atactaaata ttaagttata aaataagagt tttatgttaa gcctaattaa gtgcacgaaa 1440 taagctgcgg cctcgaaaat gactgttaat ttggtttttg gtatgtaact tttttgtttt 1500 tatgtttatg aacttatttt ttagttatta aatatgttaa tatccctaac gaaaatattg 1560 cgtttaaatt tttgaaatat cccttttcgt ttagcggcaa tagcgactta aacatttttt 1620 attcccctgc acgaaataag ctgcgaccca ctgtat 1656 // ID Gypsy11-I_Dpse repbase; DNA; INV; 8359 BP. XX AC Unknown_singleton_87; XX DT 13-MAY-2009 (Rel. 14.05, Created) DT 13-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy11_Dpse; KW Gypsy11-LTR_Dpse; Gypsy11-I_Dpse. XX OS Drosophila pseudoobscura OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; obscura group; OC pseudoobscura subgroup. XX RN [1] RP 1-8359 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1080-1080 (2009). XX DR Genome; Unknown_singleton_87; Positions 21706 30064. XX CC Positions [5409-5885] - Integrase core CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 1432..2778 FT /product="Gypsy11-I_Dpse_1p" FT /translation="MGVKWISVRRKNELEMITAELGLDSAGTVEELRRRLA FT TFANSADLPSTARSRLEELEAQYGVAVTPEVKPLSPRPSTLAPLSLQPHPP FT AAYDVVPASSRGIVASGSAVPAESLLHGIDVGAAQRNLHGSKDPPTKGSTW FT NEEHFNAAITEKMVRWGIVFDGSGDPLSFLEHVQERAATYKIDLRHLSQGI FT LVLLTGRAESWFRTSRLSGEPWETFRKEFSEFFLPPRYFQRLEDEIRTHFQ FT RPKEAFKTYLVDIRLMMQRAGYTEAQELERIYDNMLPEYQLFTRRKDFDSL FT SELTQLVVNFEVTRNKGPSELTGYANQARDTHPRETLGPTAGQRGPRNHAA FT ARTQDLRHDRTQVRDWSTDSQQPDVRNGTIFQDADTIIDVQHACRNCGGSG FT HFSRECKQPQVLYCWDCGRRGLRTIECCRRNSSGNGQALHSAGDRAKTDNP FT TPRQ" FT CDS 2991..6347 FT /product="Gypsy11-I_Dpse_3p" FT /translation="MADIRLGDQQVKMPVLVMKDVLDDVLLGMDFLCGIDA FT TLQCGGVPLRLQFKHSHDTRTQDIGSTARLDQPTNLSRDRAAQHQEPTRND FT VPNEERVADPLGSMMVRWEKLARDDVPNETCVAVPSRNSRNSGEVSTPAGY FT QPGWTSALPGLDHRAMLKNPFRVHEVQQRSKKRVRFEPGIGDNGPRRRRTR FT ERTQGATVATVSISQLPDVLAPGFEPEDDEPLEPRITEWLQQELKQFDGLS FT GVSPIAEHTIVMRDNKPIKQRYYPKNPAMQAVINKQVDELLKEGQIEPSKS FT PHSAPIVLVGKKTGDIRMCIDFRQLNARSIPDAYPLPRIQHILEQLRDARY FT ISTLDLKSGYWQIPLAPTSRECTAFTVPGRGWFHWKVMPFGLHSAGATFQR FT ALDSVIGPDMEPHAFAYLDDIIVIGHTVEDHMRHLGTVLQRLRKANLRLNK FT DKCSFFKRRLKYLGHVISGNGIHTDPDKIAAVRNLKPQAGVKELRRCIGMA FT SWYRRFVPNFSEIVEPMTALLRKDRQWKWSEEQEKAFEELKIRLTEAPVLA FT CPDFSEKFSLQTDASEQGLGAVLTQKIRDEERVIAYASRRLSKAEENYSVT FT EKECLAIVWSIRKLRCYLEGYRFDVITDHLALKWINSIDNPTGRIARWALE FT LQQYQFDIRYRRGKQNVVADALSRQPLELLHQALEEESQCKWIRKMLARIR FT QQPGRYEDYRSESGQLYRRLHTVPEEEDSTPWKLCVAAEHRRRVLKECHDH FT PTAGHLGIRKTSTRVAQKYYWPGLFREVARYVRQCVVCQKYKVSQFKPAGK FT MFTRQVDEPFSVLCADFVGPLPRSKHGNTVLLVFFDAFSKWVELVPLRKAT FT TPHLERSFRERILSRFGTPRTFVCDNGAQFTSRSFKGFCKRLGMELQHTAP FT YTPQQNPTERASRTIKTRVAQYIDGKQNTWDELLPEITLAINSSVSDSTRF FT SPAFLIQGREPRLPGALYDEVTPGTGTSIADPETRARQMREVFEIAKTNCD FT KASEEQKRHYNLRRREWKPAIGSQVMLRRHTLSKAAEGFASKLATKFEGPY FT RVTKFLSPNLVRLQVLNGRKQRSASLNDLKAFYPTIDEDDEKTDQIRTDSE FT DDPDRTNIQQS" XX SQ Sequence 8359 BP; 2187 A; 2465 C; 2254 G; 1453 T; 0 other; acgccgcttt cccccccccc cccccctccc taatttctcc cgaaggcccc atcgttgacg 60 gcggctcggc gatccctgcg ccaacgactc gcctcccccc ccagtctcaa cgtctcagca 120 acttacaaat aaactgtagg gtgaccctgg gcagactgag actgacccca gcctaggatc 180 actactttac agatggcgcc cgagcaggga ctgttgagac aattcaaacg atagttttct 240 gggggggatt tcacccttcg tggcctaaag acccacaagt agatcgcgta ggtagcgccg 300 catcaaaccg aggagaaatt tatagggtat caataagaaa atacgatacg aaggtacagg 360 ccagcagcga gcaaagcgtg gctgaaaagc tgtacaatcc ccgttagccg tcggaaaatg 420 aagaggagag aacgggaaag aggaaatgag agaggaaaac gacagaatga aagtacagcc 480 cgcagccaag caaagcatag tgatgccctt cccatgtcca gcagaaagag agaaaaaaaa 540 aaacatccaa gcgaagcacc agcgacgaag cagcacgcaa agcagaacga aaattccgtg 600 gtccccgtta tactcaggca aaaccatgcg ggatcatggc gcccggtacc gtggggaaag 660 agccaagggc agccagacct ctgttacatt tcagtgggga gagagggggt gaagaccgga 720 gaacagttct gggcgtcggc ggcgggcatg tgccgaacga cccacgagta taatacatga 780 aattatccct tcctcatact acccgcaccc cccgatcttc gactatttac ctaaccattc 840 gcccgcggct ctctcgcgtg gtgttctcgg ccgattccga gacgtcgacg tcggcagaga 900 ctcggccgtt cagccagacc caatataagc cgggttgtat acaacgcaag ggagagtcat 960 cctcagcatt cggtcaaaca tacaacgtgt gaaacagagt gatcccctgc cttcgatcag 1020 tacacaacgc gcaaaataca gtggtcctta tcccgcagca tacagtggtc ccaatcccgc 1080 agcatacaac gtgcgaaaga cagcgatccc cgcgtcttct gtccacctcg cgctcccccc 1140 gaaaccgaac acgaaccaac gcattctaac cggaacaatc ctgcagaaac caacgactaa 1200 agtacccctg aacaagcgcg tgttaattgc cagcccccgc gtccgggccc acgaccaaga 1260 cggcccaccg gtaattcacc accgaatagc ctcgcacgcg tcctcgccgt ttaaggattc 1320 agagccccgt ctttcgccgc gtccttagct acgtttttcg actcctcggg cgacgcccca 1380 gacatttgat cgtagaccac gtgcagcaac agtgacccgg accaactcaa catgggagtc 1440 aagtggattt cggtcaggcg caagaacgag ttggagatga ttaccgcgga gcttgggtta 1500 gacagcgctg gaactgtcga agagttgaga cgtcgactcg cgaccttcgc aaactcggcc 1560 gatttaccaa gtacggcgcg ttctcggttg gaggaattgg aggcccagta cggagtcgcc 1620 gtgactccag aggttaaacc actttctccc cgcccttcta cactcgcgcc tcttagccta 1680 caaccccatc ccccggctgc gtatgatgtc gtacccgcca gtagccgtgg tatcgtggcg 1740 tcgggatccg ctgtgccggc agagagcctg ctccatggca tcgacgtcgg tgccgctcag 1800 agaaacctgc acggtagcaa ggacccacca acgaaggggt ccacctggaa cgaagagcat 1860 tttaacgctg cgatcaccga gaaaatggtc cgctggggaa tcgtgtttga tgggtccggc 1920 gatccgttaa gtttccttga gcacgtgcaa gagagggcag ccacttataa gatagatcta 1980 cgacacttat cacaaggaat cctcgtgttg ttgaccggtc gggccgagag ttggtttcga 2040 acgagccgcc ttagtggaga accatgggag acctttcgca aagagttttc agagtttttt 2100 ttgcctcccc ggtactttca acggctagag gacgagatcc gaacccattt tcaacgacca 2160 aaagaagcgt ttaaaacgta cctcgtggac attcgactaa tgatgcagcg cgcgggatac 2220 accgaggctc aggaattgga aaggatatac gataacatgc tcccggaata ccagttgttt 2280 acgaggcgaa aagattttga ttctctctcc gaattaacgc aactggttgt gaacttcgag 2340 gtgacgcgta acaagggacc gtcggagctc acgggctacg cgaaccaagc acgagacacc 2400 catccgcggg aaactttagg gccaacggct ggtcaacgag gaccaaggaa tcacgcggct 2460 gcgaggacac aggatctccg gcacgataga acccaggtga gggattggtc gacggactcg 2520 cagcagccgg atgtcaggaa tggaaccatc ttccaagacg ccgacaccat aatcgacgtg 2580 cagcatgctt gtaggaactg cggcggatcc ggacactttt ccagggagtg taaacagccc 2640 caagtcttgt actgctggga ctgcggccgt cgcgggctac gtaccatcga atgctgtcgc 2700 aggaattcgt cgggaaacgg acaggcgctt cactcggcag gggaccgggc gaagaccgac 2760 aatcccaccc cgaggcagta aagggaaccc tgcggctgga gagaggacga attcgcgcgc 2820 aagtaaccat ggaaggagag gacctactcg ctactctgga cacaggcgcc acacggagct 2880 tcgttagcga acgcaaagcc ctggagctgg acaacggaca gagtatagcg atggtacgaa 2940 cgacgatcag attggcggat gggtccctcc gggaattgac ttaggcgctg atggcagaca 3000 tacgactagg agatcaacaa gtaaagatgc cggtgctagt aatgaaagac gtgctggatg 3060 acgtcctcct cgggatggat ttcctatgcg gaatagacgc gaccctgcag tgtggaggag 3120 tccccttgcg cctgcaattt aaacacagcc atgataccag gactcaagac ataggatcca 3180 cagcccgcct ggatcaacct acgaatctca gccgagaccg ggccgcccag catcaagaac 3240 cgacgaggaa cgacgtgccg aacgaggagc gcgtggccga tcccctggga tccatgatgg 3300 tgaggtggga aaaacttgcg agggacgacg taccaaacga gacgtgcgtg gccgttcctt 3360 caaggaattc taggaattcg ggcgaggtct cgaccccggc aggataccaa cccggatgga 3420 catcggcgct ccccggcctc gaccacagag ccatgctgaa aaaccccttc cgggtacacg 3480 aggtacagca gcgatccaag aaacgagtgc gattcgagcc cggcatcggc gacaacgggc 3540 cccgccgtag gcgcaccagg gaacggacac agggagcaac agtggccacc gtgagtatat 3600 cccaattgcc cgacgttttg gccccgggat tcgagcccga agacgacgaa cccctggaac 3660 cccggatcac cgaatggttg caacaggaac tgaaacaatt cgacggcctc agcggagtgt 3720 cccccattgc agaacacacc atcgtgatgc gcgacaacaa gcctataaaa caacgttatt 3780 atccaaaaaa ccccgctatg caagccgtca ttaataaaca ggtcgacgaa ctgcttaagg 3840 aagggcagat tgaaccttcg aagagtccac atagcgcccc aattgtcctc gtgggcaaga 3900 aaacaggtga catcaggatg tgcatagact tccgacagtt gaacgcgcga tcgatccccg 3960 acgcgtatcc cctccccagg attcaacaca tattggaaca gctgagagat gccaggtaca 4020 tatcgaccct ggacctgaag agcggctact ggcaaattcc gcttgcccca accagccgcg 4080 agtgcacggc gttcaccgtg ccgggccgcg gatggtttca ctggaaagtg atgccttttg 4140 gccttcactc cgccggcgcg accttccagc gagcactgga tagcgtgatt ggccccgaca 4200 tggaaccaca cgcattcgcc tatcttgatg acattatcgt catcgggcac accgtggagg 4260 accacatgcg gcatcttggg acagttctac agcggctgcg caaggctaat cttcgactca 4320 acaaggacaa atgcagcttc ttcaaacgca gactgaagta tttaggacac gtcatcagcg 4380 ggaacggcat acacacggat cctgacaaga tcgccgcagt gcgcaacctg aaaccccagg 4440 caggtgtcaa ggaacttcgc cgatgcatcg gtatggcctc ctggtaccgg cggtttgtac 4500 ccaacttttc cgaaatagta gaacccatga cggcgttgtt aaggaaagat cggcaatgga 4560 aatggtcgga agaacaggag aaagccttcg aggagttgaa gatcaggtta actgaagcgc 4620 cggtcctcgc ctgccctgat ttttctgaaa aattctcatt gcaaaccgac gctagcgaac 4680 aaggattagg agccgtcctt acacaaaaga taagggacga ggaacgagta atagcctacg 4740 cgagtcgacg cttgtccaaa gccgaggaga actactccgt tactgagaag gaatgcttgg 4800 caatagtgtg gtcaatccgc aagctgcgct gttacttgga aggataccga tttgacgtga 4860 tcacggacca tctcgccctc aaatggatca actcgatcga caaccccact ggacggatag 4920 cacgttgggc tttggagctg cagcaatatc aattcgatat tcgctatcgc cgagggaagc 4980 agaacgtggt agcagatgca ctttcccgcc aaccattgga gttactacat caagctctgg 5040 aggaggagtc acagtgcaag tggatcagga aaatgttggc gagaatcagg cagcagccgg 5100 gcagatatga agactaccga agcgagagtg gacaactgta tcggcgcctg cacaccgtgc 5160 ccgaagaaga agactccacc ccatggaaac tctgcgtcgc tgcagaacat agacgacgcg 5220 tactaaaaga atgccatgac cacccgacgg ccggacatct aggaatccga aaaacaagca 5280 cacgagtcgc ccaaaagtac tactggccag gcctgttccg agaagtcgcc agatacgttc 5340 gccaatgcgt agtgtgccaa aaatacaagg tgagccagtt caaaccggcg gggaaaatgt 5400 tcaccagaca agttgacgaa cccttcagcg tgttatgcgc cgactttgtt gggccgttgc 5460 cccgctccaa acatggaaac accgtcctcc tggttttctt cgacgcattt agtaagtggg 5520 tcgaactggt gcctctacgc aaggcaacta cgccacacct ggaaagatca ttccgagaaa 5580 gaatcctaag ccgctttggt acccctcgta ccttcgtttg cgacaatggg gcacagttta 5640 ccagccgatc gttcaaagga ttctgcaaaa ggctagggat ggaacttcag catactgccc 5700 cgtacactcc ccaacaaaat ccaacagaac gcgcgagcag aacaattaaa acgagggtgg 5760 cgcaatacat cgacggaaaa cagaacactt gggacgagct cttgccagaa atcacgctcg 5820 cgatcaactc aagcgtgtcc gattcgacgc gattcagccc cgcgtttttg atccaaggtc 5880 gcgagccaag actgccaggc gccttgtacg acgaggttac gccaggaacg ggaactagca 5940 tcgccgaccc ggagaccagg gcacgacaga tgagagaggt gttcgaaatc gcaaaaacaa 6000 actgcgacaa agcatcagaa gagcaaaaac gacattacaa cctccgccgt agggaatgga 6060 aaccggctat tggatcccaa gtgatgttac gtcgccacac tctgtccaaa gcagccgaag 6120 gcttcgcctc caagttggcc acgaaattcg agggaccata ccgggtgacc aagtttctct 6180 ctccgaatct cgtccgatta caggttctaa acggacgtaa gcagcgatca gccagtttga 6240 acgacctgaa ggcgttctac ccgaccatcg acgaagatga cgaaaagacc gaccagatca 6300 ggacggacag tgaagacgac cctgaccgaa ccaatataca acaatcttga ttacaacgca 6360 gatctactca aaccgccacg acgatgctga caccccggat acacagagac gaaaccaatc 6420 aggaacccgg atatcttgcg aggaacgacg caccgaacga ggcgtgcgtg gctgctcctc 6480 ctggacacat ccgcttcggg aacccggata tcttgcgagg aacgacgcac cgaacgaggc 6540 gtgcgtggct gctcctcctg gacacatccg cttcaggaac ccggatatct tgcgaggaac 6600 gacgcaccga acgaggcgtg cgtggctgcc cctcctggac acagccgctt caggaacccg 6660 gatatcctgc gaggaacgac gcaccgaacg aggcgtgcgt ggctgcttct cctggaccca 6720 tccgcttcag gaacccggat atcttgcgag gaacgacgca ccgaacgagg cgtgcgtggc 6780 tgcccctcct ggacacagcc gcttcaggaa cccggatatc ctgcgaggaa cgacgcaccg 6840 aacgaggcgt gcgtggctgc ttctcctgga cccatccgct tcaggaaccc ggatatcctg 6900 cgaggaacga cgcaccgaac gaggcgtgcg tggccgcttc tcctggacac atccgcttcc 6960 gggacccgga catccttcga agaacgacgc accaaacgaa gcgggcgtgg ctgttccttc 7020 aagggcaaac cgcaaccgtg attgtaccga agggaccaac caaggccgac attttggaga 7080 ttctcattaa gtctataacc gctccgccgc actcatagag tatcaaagta tttcttgcct 7140 gtagtcaacc agtccattcc tattcacagc tcatcgaatg cgagagaccc ccgccgacgg 7200 agaactacga caaaagccag gccggagccc ccagtcgact gggagatcat cgaggaactg 7260 attcttcgcc ccactccgcg agaatgccaa gtccacccag aggtcctcag ctgggaaaat 7320 ttatgcggcg acgtcgtaag ctggccccgc atcgaccaga tcatcgaggg gcacgattca 7380 gaccccgatc acgtcaatat tacaggagac gtggcccaga cgtacaggga cccaagccct 7440 ggcaacgccg ccagagccac gttcacccac gctcagggcc ccagccaggt caaccccgac 7500 caggtcgccg aagagcacgc gcagggccct ggacacgcca ccgtaggaga agcagccgac 7560 ttcaccgctc cgggtccgaa ccccgacgag ggggcagagg ccatcactat ccctgacggg 7620 ctgccaggtg ccaccgcacg cggataccgc cagcaattcg agccgacgga gttcgaaaaa 7680 gaggtgctcc gctgggaaga gccaccgctt ccggcactct tcggggccct tgatacgctc 7740 aaccttctcg aacggataga cgggttggac atcgccccgg aagatgcagg actaccggcc 7800 gtcagcaact ttgacgccat cctgcgtccc gaagacatcg acctcgaaga tgcaggacta 7860 ccggccgtca gcaactttga cgccatcctg cgtcccgaag acatcgcccc cgaagatgca 7920 ggactaccgg ccgtcagcaa ctttgacgcc atcctgcgtc ccgaagacat cgcccccgaa 7980 gatgcaggac taccggccgt cagcaacttt gacgccatcc tgcgtcccga agacatcgcc 8040 accgaagacg cagagctacc ggccgtccgc gatctggacg ccatcctacg tatcgacggc 8100 ggccgtccgg agataccaga aacgtcgacc cccaacccag ccgttcgttg ggtgactggg 8160 gaggcggatt ggagagtacg tacgcccgac ggccggctgc cagacaccct gaaagcccgg 8220 ctcctggcac aactcgtcaa cccgcggcgt aaagatgtcc gattcttggc agaggcggac 8280 aacaccttct tccaggtcca catcagcaag acgagcaaag tgactattcg ctcacgggcc 8340 ccccgaagta aggagaggg 8359 // ID DNA-TA-4_CQ repbase; DNA; INV; 241 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A DNA transposon family from Culex quinquefasciatus - consensus. XX KW DNA transposon; Transposable Element; nonautonomous; DNA-TA-4_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-241 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the southern house mosquito."; RL Repbase Reports 11(1), 54-54 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >90% CC identity. TSDs are TA. XX SQ Sequence 241 BP; 80 A; 42 C; 38 G; 81 T; 0 other; cacacttaga tttttttacc gaattcggta aagtttaccg aattttcaac tgctgaacag 60 ttcggtaaac tgaaaattac cgaattcggt aaaaacttac cgaaattcgg taatgaagtt 120 cattattgtt catcgtacaa ccgaaattcg gtaaaatttt gttcattttg acagatggtt 180 taccgatcta aactttaccg atttttcaac taccgaattc ggtaaaaaaa atctaagtgt 240 g 241 // ID Gypsy-4_AC-I repbase; DNA; INV; 3064 BP. XX AC AASC02003433; XX DT 18-JAN-2011 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from Aplysia californica (California sea DE hare): internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-4_AC_; KW Gypsy-4_AC-LTR; Gypsy-4_AC-I. XX OS Aplysia californica OC Eukaryota; Metazoa; Mollusca; Gastropoda; Heterobranchia; OC Euthyneura; Euopisthobranchia; Aplysiomorpha; Aplysioidea; OC Aplysiidae; Aplysia. XX RN [1] RP 1-3064 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Aplysia californica (California sea RT hare)."; RL Direct Submission to Repbase Update (02-FEB-2011). XX DR Genome; AASC02003433; Positions 7204 10267. XX CC Positions [2055-2534] - Integrase core CC 'CGTCTC' target site duplication CC LTRs are 95% similar to each other. XX FH Key Location/Qualifiers FT CDS 987..2951 FT /product="Gypsy-4_AC-I_1p" FT /translation="MSQSTGNEVSEVNRLQKGKFYRQGQSQGQGQGPKQKF FT QRKENIGKKYHRCGRSPHDKQECPAKHAECYTCKKRGHFSTACRNKDKKSV FT RQVEEIFLGTLQLENEYGVRAVQSNDDWWHAEITTIGKPVKYRVDTGADVT FT VIPGRSFKKNSPLIKKTEKKLFGAGHQELQVKEVVRATLATRNTSSEQDLY FT VVTNLNEPLLGRPAIEALKLLEEECELHARSVVEGFPASDKKLQEIREQQD FT ADEICQKLKEYCQSSWPATTKGDPVMKPYWTSRDELAVEQCLLLYQCRIVI FT PTILRQDILKRLHEGHQGIVKCRALARSSVWWPGLSQQIETLVTNCPDCEK FT ERKVSPEPLKPTVTPDDPWQRVGMDLFVWKDHTYLLKVDYYSRWIEIAHLR FT EPTASNVVEHCKSIFARYGVPEVVVSDNGFHFVAHEFQKFSQYYCLTNLRS FT SPLHPQSNGEAERAVKTIKMLLSKSDDPYLALVNYRSTPLQQGQSPAELLM FT GRKIRTRVPIFPEKLVPKGGERQVFRKIDAAFRQRQKQDYDRRHCAKPMSD FT LSKGQPVWVKTPRDAKATVVQRSRDRSYLLKTDNGLKVRNRHQIRLRTEGD FT DSSHLVIPRESSMLPHHQPELPAESLESDTTLHDTFTTSGYITKSGRHVRP FT PQRLDL" XX SQ Sequence 3064 BP; 1025 A; 597 C; 740 G; 702 T; 0 other; tggtgtcaga agtggctact gaacagccaa ctacttcaag aaaacaccca aattcaatga 60 agtgctagag tttatctatc aagaacagtg ctctgtgaga gtagaggtct agactagtta 120 ggtttaggca gttaaatgtt tgaaaacggg aaggattttg gatgtaatct agatctggat 180 tctggagcta gaaactagag atctggatag aagtagatta tctagatgtg gtaataggcc 240 tggattaact agaatctaga tgttttgaac tatgaacagt gttgagttgg aactgtgact 300 tgtttgtttg ggtctggaga ttctagactt ctagacttag agtcttttat ttgagtagat 360 ctagatctat tctggatttc tagttccaca acaagtgaac agaaatggca caacataaca 420 gggaaaacca gcagaagcaa cacacagcca acgtacccct tccatcaaaa ttaaacgtaa 480 cagatggcac agacctagca caagggtgga aaaagtttaa gagaaacttc gaaaattatg 540 cgatcgccat cagattaaac aaggaagaag aagatttcca atgtgcagtg tttctagcga 600 cgattgggga agatgctgta gacattttcg aaggtttcca cttcgaggac gaggtagata 660 agaaaaatct ccaaagcgtc atacaacctt ttgacgactt ttgcataggg aagacacatg 720 aagcctacga gtcttacaag tttgacatga ggaaacaaaa acaagacgaa acaatagagg 780 cctacattac accacttcgt cagctagcca agtcttgcaa cttcggagac gcacagttgc 840 aagaccgctt aataagagat caggtggtga taggagtgag agaagaaagc ttaagataaa 900 agtttttgga agacaaaaat ctcaccctta caaggtgtct agatatagga cgagcatatg 960 agtcatctag gacacaatcg catacaatga gccaaagtac tggaaacgaa gtttcagaag 1020 taaacaggtt acagaaagga aagttttaca ggcaaggtca aagtcaaggt caaggtcagg 1080 gccccaagca aaaattccaa agaaaagaaa acataggtaa aaaatatcat agatgtggaa 1140 gaagtccaca cgataaacaa gaatgtcctg ccaaacatgc ggaatgctac acatgcaaaa 1200 agagaggaca cttctccaca gcttgtagaa acaaagacaa gaagtcggta agacaagttg 1260 aagaaatttt cctgggaaca ctacagctgg aaaacgaata tggtgtcagg gccgtgcaaa 1320 gcaacgatga ctggtggcac gcagaaataa ctacaatagg aaagccagtt aagtatagag 1380 tggataccgg tgcagacgtc acggtgatac cgggcagatc tttcaagaaa aactcacctc 1440 taataaagaa aacagagaag aagttatttg gagcaggaca ccaggaacta caggtaaaag 1500 aagtagtcag ggctacattg gctaccagga atacatcttc agaacaggat ttgtatgtcg 1560 tcacaaacct gaatgagcct ctgttaggta gaccagctat agaagctttg aaactcttgg 1620 aagaagagtg tgagctacac gccagatctg tcgtggaagg attcccagca agtgacaaga 1680 aattgcaaga aattagagag cagcaagatg cagacgaaat ctgtcagaaa ttaaaagaat 1740 actgtcagtc ctcatggcca gcgacaacca aaggcgaccc agttatgaag ccatattgga 1800 cgtcgagaga cgagctggct gtggaacaat gtttgctact ataccagtgc cgaattgtta 1860 ttcccacaat attaagacag gatatattga aacgtcttca cgaaggacac caggggatag 1920 tcaagtgcag agcgctggct cgaagcagtg tatggtggcc tggactgtca cagcaaatcg 1980 aaacgttggt tacaaactgt ccagactgtg aaaaagaaag aaaggtcagc ccagaaccat 2040 tgaagccgac agtcactcca gacgaccctt ggcaaagggt gggtatggat ctgtttgtgt 2100 ggaaagatca cacatatctc ctaaaagtag actattactc gaggtggatt gagatagctc 2160 atctaagaga gcctactgct tcaaatgtcg tagaacattg caagtcaatt tttgctagat 2220 atggagttcc tgaagtcgtc gtctcagaca atggattcca tttcgttgca cacgagtttc 2280 aaaagttctc acagtactat tgtttaacga acctgagaag cagccctcta catcctcaga 2340 gtaatggaga agctgaaaga gccgtcaaaa ctatcaaaat gttgctgagt aagtctgatg 2400 atccttactt ggctctcgtg aactaccgct ccactccttt acagcaagga cagagcccag 2460 cagaacttct aatgggtagg aaaatcagga caagagtgcc cattttccca gaaaaattgg 2520 tcccaaaggg gggagaaagg caggtgttca ggaagattga cgcagctttc aggcagaggc 2580 agaaacagga ttacgatcga agacattgcg ccaagcccat gtcagacttg tcaaaaggac 2640 agccagtttg ggttaaaacc ccaagggatg ccaaagctac ggttgtacag aggtcgagag 2700 acagatcata cctcttgaaa acggacaacg gcttgaaggt tcgtaatcgt caccagatcc 2760 ggctgaggac ggaaggcgac gacagtagcc atctcgtgat ccccagagag tcttctatgt 2820 tgcctcacca tcagccagaa ttaccagcgg agtcactaga gtcagacaca acccttcatg 2880 atactttcac tacgtctggc tacataacga agtcgggacg ccatgtcagg cctccgcaga 2940 gactcgactt gtagacaaag tttctgctgt gtacagtgta tattaggttt cagagttata 3000 gtttcagttt ctggttgatg cgtttagcat tttgaattgt tcttgaaatt gcttgtgagg 3060 ggga 3064 // ID ORTE-5_AAe repbase; DNA; INV; 7282 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A non-LTR retrotransposon family encoding cysteine protease from DE Aedes aegypti. XX KW Non-LTR Retrotransposon; Transposable Element; ORTE; ORTE-5_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-7282 RA Kojima K.K. and Jurka J.; RT "A lineage of non-LTR retrotransposons encoding an OTU cysteine RT protease from the yellow fever mosquito."; RL Repbase Reports 11(4), 1128-1128 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 12 sequences with >95% CC identity. This family encodes OTU superfamily cysteine protease CC upstream of apurinic-like endonuclease. It is positioned at the CC sister lineage of the lineage including RTE and RTEX in CC RTclass1. XX FH Key Location/Qualifiers FT CDS 1594..6273 FT /product="ORTE-5_AAe_1p" FT /note="OTU cysteine protease, endonuclease and FT reverse transcriptase." FT /translation="MNNRMEENVKWVIRDTFVWLEGTALKVQSFKVETDFV FT HMILEQVVVKMGEGIVLREDVFEMXSIILDYIMINQKDFIDHAISRNEVGN FT HEEGRLWFEAYVRAIVSGDQVFEREMLWVIAEICQCPVNIFYETGGKMSFG FT EKYHNERMGSLDFLITTAGIIENVVEVDKRTIHTQSGTGDELLEVLQLKDN FT DDELKKALKWQANQDEDDYKERRDKEIDDLFKEITIQGQDMNRKDLIMRKI FT GETATRMNKEIVIYKMEGSETVLGVTEGKKKRKECKLLLKERYGRMHFDIV FT VTSREIIRNNWINIIEELATDRYSMIPKENTYRFTHPHEGVLTVKQIVGDG FT NCMLRAMLDQIGKKYNKYCMSDYNVRKLRKVIVDYVHQEKEIYEEFMFDYK FT EPTESFETAIERMRENGRWLGHEAMLAATEVLKIAINVVGKDGANHLIGES FT FVNNGTINIFYTGETEQNHYDSIVGRQDGFYMSAQYNTHKSDSIISNNNNE FT KFKEATTIRLDGDDRNNSVQDWYETFMQNKTNEREAQIGNKNSRSTEQEAQ FT VSTADSDKNGKRSGSNEKKDTRNENTNKDIKSNRLNKESKPIEENSKLLRI FT ASWNIKGCRKNDKRNEIDEILTAYNIDVAALQEVNTIGDEINTENFRWKVV FT DNHFNKTRGLALLIRNNSGIEVISVKIVRKGIMWSKIRVNGNVLIIINIHA FT PNKKQSNFLSKLSSLTGGEKDRRHLILVGDFNAQLGWNDLGKEDEKWIGKL FT LGHDSCNENGEVFKMFLHTARLKNISSKIGKGTKVTWRSGNRQSQIDHILG FT PTMTDYRIKFIKGYWTHTNSDHKLITVGLQFQKVVRTKLRSQEPTKVNIEL FT LKDDKIRKKYQEQLISLPTYKDTKIHSTEECYIELITKIKRAANRTVRKSK FT IPSTPRRRKAFERLKAAIKVSKKHPDMQNYKYRLGNRRQEFNQAVKEHKEK FT EISNFFKNLNDFXVGQRIRKTHKYLKGFRKQKQKKRADISAKTWERILKES FT EGPGIDYCLNHDYTPLIQPPTDEEIKDIIKTSCCGKSAGPDRIFMEYLKHA FT DDDTIKTLIEIIQKGFTENKLPSDWMKSTQIPIPKKANASDADHFRRISLC FT NVVYKIYAKWMAMQLTKYAGEPDLYQAAFTNSRSTDDHIFVVRRVMEEHWN FT AGKDLYILALDIKKAFDTVKVQNLREILLSLNVPSRLVDRVIACIKDEMTR FT VLWDNQLSSEIKRGKGIKQGCPISPILFNYIMQDVIRNVAEKIPEMKLMNL FT NCLKIPLLLAFADDIIIIAKSKEELEKLLKELVEQLSIVGLELNYDKCQLL FT LRFPNNKTPKPQDIKLNGRQYKVCDKIRYLGVCLTDTLDRKSTNRLRCVSA FT YKTSRVVIEFCKKFKPSWDIGKLIYKTVLSPAITYGNKAAVLTKKSRIGMA FT NYEKLILRNIFNNCKKPLNLKFNARKLLDGKTVNRRVRVGRINYYGHILRR FT ENNHPIKLAYKMRFETKKEGRPSLTWKDSLNNDLNRYNNIDTEEWKQLAKD FT RDKLKSKAEEIYKETNSEISDGQTSEEEDDRQKPKYKHWKRKLG" XX SQ Sequence 7282 BP; 2862 A; 1025 C; 1588 G; 1803 T; 4 other; aagcggaaat ttcgcaaaat cacaaaaaaa caaaatcttc aggtagctag gaggataacc 60 aaaaaatcat cacgaaccct gatcacggca tggcgggagg gggaagggaa cagcaggaag 120 gaaaatagag atgcgcagtg ttgtcagatt taaatttaca tcttacaatg aaactcaaaa 180 ctttagatgt agtattcttt tctttgcgaa ctatagaaag gaagataaag gtgtaaaaga 240 aatggaaaaa tcgaatagac tcgtaaaaag cgtacgattc taaaaggtaa atacgtatta 300 aatcacaata ttcacaaacg aaaataactg aggataaaaa gaataaataa tggtagcact 360 gcatgggtta gtgcaaatga tgatgattat ctagatctga tgatggttca accgctaacg 420 aaacgagagg aaggggatat atattttttt tctggctatc tctttcgggc gcgtacatca 480 gtttcattca ctggtaccag acatgctatg ctcaaagtaa tcaatggtga gacgaggtac 540 ggaggttagg actgcgctag tgtgtgtcac gaggagtatt gtattcaata cagaatgaat 600 gactaacaca aataaagtcg gggctgaccg gactgacagc ttgacagtga gttgcaaacg 660 gtaattagtg gatgacacaa acaagttaga cactcacact atcaacctat atgatatagt 720 tgcaatgatg tgcaatacac acaacgaaaa ccattgatag tgacaagttt ggcgcaatgt 780 gaagagcggg aaaatcagtt catctttcgt ttggttgctt accgagtaag acgtgtaatg 840 tgtttaacag aaggaaaccc gcacaaagca tatttccgcg ttcaatttga ttacgttaac 900 gattgtgcca ttagtaattg aagcggacat atgctttgtg cgatagaaca gttttttaaa 960 agtgaaaata tcaaaaaatt aggacctgat agcgtcgagg aaaccgcaag cgttttggag 1020 cattgtactg gtggaacgag cgttgaaaag gaggtgctgt gcaaagtggt tttgaaaagt 1080 tgagtgaaga agcgagaaag gtgcaatttt ggtcaagcag tggagcggga ggctaagatt 1140 tattcctaga ggaaggaacc ggaacgggac agaaaagcgg atattctagc gtgattccga 1200 atagggaaat agtggtcggt tggcaacagt agtgacaatt ttcacacgcg agagagaaga 1260 agaaattgtg agtgctggtc attcggagag aagagagaag taaggctgag cgagcgtcaa 1320 agtgtcaaat gtagacgatt ccacggagag agtgatattt gttttacgtg attcagcaaa 1380 tttattatta ttgtctggta tagcgcattt cgaatactaa ttcattgtct tcaagcatca 1440 attggttttg gaacatagtt agatctcttt catggtaaaa tatttaacgg ttattgaatg 1500 aatacgaaga ggtgagctga tgaggtaagt tatatttaat atttaatatc taattktatt 1560 ttttgaactg caatagttta ttattagata gggatgaaca acaggatgga ggaaaatgta 1620 aagtgggtaa ttagggatac atttgtttgg ctagagggca cggctttaaa agttcagtcg 1680 tttaaggtkg agactgattt tgtgcatatg attctagaac aagtagtagt aaaaatgggg 1740 gagggaatag tactgaggga ggatgttttc gaaatgaamt ctattatatt ggattacatt 1800 atgatcaacc aaaaagattt cattgatcac gcaataagcc gaaatgaagt gggaaatcac 1860 gaagagggcc ggttgtggtt tgaggcatat gtacgggcaa tcgtttcggg tgatcaagta 1920 tttgaaagag agatgttgtg ggtgattgcg gaaatttgtc agtgtccagt gaacattttc 1980 tatgaaactg gtggtaaaat gtccttcggc gagaagtacc acaatgaacg aatgggatct 2040 ttggacttct tgatcacaac agctggaatc atcgagaacg tggtagaagt ggacaagcgg 2100 acaatacata ctcaatccgg tacaggagac gaattattgg aagtgttaca attaaaggat 2160 aatgatgatg aactgaagaa ggcgcttaaa tggcaggcta atcaggatga ggacgattat 2220 aaggaaaggc gtgacaagga aatagatgat ttattcaaag aaatcacaat tcaaggacag 2280 gatatgaata gaaaagacct aattatgaga aagataggag agacagccac taggatgaat 2340 aaagagattg taatatacaa aatggaagga agtgaaacag tgctaggtgt aacagaaggg 2400 aaaaagaaga ggaaagaatg taaactgctg ctaaaggaac ggtacgggcg aatgcacttt 2460 gatatagtgg tgacaagcag agagataata agaaataatt ggatcaatat aatagaggag 2520 cttgcaacag acagatattc gatgatacca aaagagaata catacagatt cacgcaccca 2580 catgaaggag ttttaactgt gaaacagata gtaggagatg gaaactgcat gctaagggca 2640 atgttagacc aaataggaaa gaagtacaac aaatattgca tgtccgatta taacgtcaga 2700 aaattacgaa aagtaattgt agattatgta catcaggaga aagagatata tgaggaattc 2760 atgtttgact ataaagagcc aaccgaaagt tttgagacag cgatagaaag gatgagggag 2820 aatggcaggt ggctaggaca cgaagcgatg ctggcagcga cagaagtctt gaaaatagcg 2880 attaatgtag taggaaagga tggagcaaat cacctaattg gagaatcttt tgtgaataac 2940 ggtacaatta acatattcta tacgggagaa acagagcaga atcattatga tagcatagtt 3000 ggaaggcaag atggctttta catgagtgca caatataaca cccataaatc agatagcata 3060 atatccaaca acaataatga aaaattcaag gaagcaacaa ctatacgatt agatggtgat 3120 gatagaaata acagtgtaca ggactggtat gaaacattta tgcaaaacaa aacgaatgaa 3180 agggaagctc aaataggcaa caaaaatagt cgtagcacag aacaagaagc gcaggtatcg 3240 acagctgata gtgataaaaa tggaaaaaga tcaggatcaa atgaaaagaa agatacaagg 3300 aatgaaaaca caaataaaga catcaaaagc aataggctaa acaaagaaag taaaccaata 3360 gaggaaaaca gtaaattgtt aaggatagcc agctggaata taaagggatg cagaaaaaac 3420 gacaaaagga atgaaataga cgaaattctc acagcctata acatcgatgt agcagctcta 3480 caggaggtca atactatagg agatgaaatt aacactgaga actttagatg gaaggttgtt 3540 gataatcatt tcaataagac gagagggctt gcgcttttga tcaggaacaa tagtggaata 3600 gaagtaataa gtgtgaaaat agtcaggaaa ggaataatgt ggagtaaaat aagggtgaat 3660 ggcaacgttc ttattattat caacatacat gcaccgaaca aaaaacaaag taatttctta 3720 agtaaactta gcagtttaac aggaggagag aaagatcgca gacatctgat actggtagga 3780 gattttaatg ctcaactagg atggaacgac ttaggaaaag aggatgaaaa atggataggg 3840 aaactattag ggcatgatag ttgtaatgaa aatggagaag tttttaaaat gtttttacat 3900 acagcaaggc ttaagaatat atcgtcaaag attgggaaag gaaccaaggt gacatggaga 3960 agtggaaata ggcaaagcca gattgatcat attttaggtc ctactatgac tgattacaga 4020 ataaagttca ttaaaggtta ttggactcat acaaattcag atcataagtt aataactgta 4080 gggttgcagt ttcagaaagt tgttagaaca aaattaagaa gtcaagaacc aacaaaagta 4140 aatattgagt tactaaaaga cgataagatc agaaagaaat atcaggaaca attgatatct 4200 ctaccaacat acaaagatac aaaaatacat tcaactgaag agtgctacat agaactaatc 4260 acaaaaataa aacgagcagc aaatagaact gtcagaaaat ccaaaatacc atctacgccg 4320 agaagaagga aggcattcga acgtctgaag gcagccataa aagtttcgaa gaaacatcca 4380 gacatgcaaa attacaaata tcgactagga aacaggagac aagaattcaa tcaagcagta 4440 aaagaacaca aagagaaaga aatcagcaac tttttcaaaa acttgaacga cttcgawgta 4500 ggacagcgaa ttagaaaaac acataaatac ttgaaaggat tcaggaaaca gaaacagaag 4560 aaaagagcag atattagcgc aaagacttgg gaaaggattt taaaagagag tgagggacca 4620 gggatagatt attgcttaaa tcatgattac actccactta tacaacctcc aactgatgaa 4680 gaaattaaag acattattaa aacatcatgc tgcggcaaat cagcggggcc ggatcgcata 4740 ttcatggaat acttaaaaca tgcagatgat gatacgataa aaacacttat tgaaattatt 4800 caaaaaggat tcacggaaaa taaactaccg agcgattgga tgaagtcgac gcaaataccg 4860 attcccaaaa aagcaaatgc atcagatgca gatcacttca gaaggatttc attatgtaac 4920 gtggtatata aaatttacgc aaagtggatg gcaatgcaac ttacaaaata tgcaggagaa 4980 ccggatttgt accaagcagc ctttactaat agcaggtcga cggacgatca tatatttgtc 5040 gttagaagag tgatggaaga acattggaat gcaggcaaag atctgtatat attagcattg 5100 gatataaaaa aagcatttga tacagtaaaa gttcagaatc ttagagaaat attgttaagt 5160 ctaaatgtac cttcgagatt ggtagataga gtcattgctt gtataaaaga tgaaatgact 5220 agagttttgt gggataacca attatcaagc gaaataaaaa gaggaaaagg cataaagcaa 5280 ggatgtccta tatcacctat actattcaat tacataatgc aagatgttat tcggaatgta 5340 gcagagaaaa taccagaaat gaaattaatg aaccttaatt gtttgaaaat acccttgctt 5400 ttagcctttg cagatgatat aataataatc gcgaaaagta aagaagaact agaaaagcta 5460 ttaaaagaat tagtcgaaca gttgtcaatt gtgggattag agctaaatta tgacaaatgt 5520 caactgctgc taagattccc aaataataaa acacccaaac cacaagatat aaaactaaat 5580 ggaagacaat ataaggtatg tgacaaaata aggtacttag gagtatgtct cacagacact 5640 ctagatagaa agtcaaccaa tagattaaga tgtgtgagtg catataaaac tagtagagta 5700 gtaatcgaat tctgtaaaaa atttaaaccc tcatgggata taggcaaact aatatataaa 5760 acagtattat ctcctgctat aacatatgga aataaggcag cagtgttaac caagaaaagc 5820 agaataggaa tggcaaatta tgagaaactt attttaagga acattttcaa taattgtaag 5880 aagccattga atctgaaatt caatgcacga aaactattag atgggaaaac tgttaacaga 5940 agagtaagag tagggaggat aaactattat ggtcatattt taagaagaga aaataatcat 6000 ccaataaaac ttgcttacaa aatgagattc gaaacgaaga aggaaggcag accaagttta 6060 acatggaagg attcacttaa taatgatttg aatagataca ataacataga tacagaagaa 6120 tggaaacaat tagctaaaga tagagataag cttaaaagta aagctgaaga aatctacaag 6180 gaaacaaata gtgaaatatc agacggacag acttcagaag aagaagacga tcgacaaaag 6240 cctaaatata aacactggaa gaggaaatta ggatagcttg agtttagtca aaagttagct 6300 gtaaaaaaaa aatggatcga ttaaactaac tatgcgtctt taatagggct aagaaaaata 6360 tcataaagaa cacagtatat gttttatgaa ataatttaag gaaaatttgg caatgacact 6420 caaaatgtta gacgtttcat tgcaaatcaa aaagtttaat cccaattaaa tatcatttta 6480 cagatgtctt catttttaaa aaataataga tagaactttt tccgattaaa tatagccaac 6540 agaaatgtta cactcatcag acacctctca acaccgattt ttgttttatt gattcaaaca 6600 agtacataaa aaaaaataat taaattgttt aatcgagtaa gaatgacaat gaattagtaa 6660 gcgctaaata ccagaacgaa actgaatgtt tatatttact ctgatggaaa atgtctatgg 6720 atataaatgt cgtaaaccac cggaataaat aatataacaa aagatgaacc attgccgtta 6780 ctaaacaata aacattacca gtacaataac cataaccatt cgtttcagca tatatatcca 6840 acagtaaggt aagcgattta aacaatttaa tgtattgcct attgaaactg actataatga 6900 aaataataac aaccaacagg gcacccgatt aaagcgagat ggataggaag agtgaaactg 6960 tcaacttcct atcgctaaca tttcgtggaa agagggcggg ctcttctccc catctattgg 7020 gtctttctgg caagcaaggg cttgattgta cgactttttg tgtgaactta cttctggaag 7080 gagattctct gtcccttgcg ggtttcgcgt aagtgtctat gcttacgggt ccgccattcg 7140 cttttggtcc ttcagacagg agtatgattg aatataaata gtcgtacagt cttgatcaaa 7200 ttgtttcgtc agcaaagacc attgctaccg gtggatacct tgctacaata gatggtagtc 7260 tatatcatca tcatcatcat ca 7282 // ID LARP1 repbase; DNA; INV; 214 BP. XX AC L42484; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 01-JUL-2005 (Rel. 10.08, Last updated, Version 2) XX DE Leishmania aethiopica DNA repeat. XX KW LARP1; Repetitive element. XX NM LARP1. XX OS Leishmania aethiopica OC Eukaryota; Euglenozoa; Kinetoplastida; Trypanosomatidae; OC Leishmania; Leishmania aethiopica species complex. XX RN [1] RP 1-214 RA Piarroux R., Fontes M., Perasso R., Gambarelli F., Joblet C., RA Dumon H. and Quilici M.; RT "Phylogenetic relationships between Old World Leishmania strains RT revealed by analysis of a repetitive DNA sequence."; RL Mol Biochem Parasitol 73(1-2), 249-252 (1995). XX DR GenBank; L42484; Positions 1 214. XX SQ Sequence 214 BP; 50 A; 69 C; 65 G; 30 T; 0 other; gcaagaagca agaggcagtg tcacagagat gggcgaaggg ggacggtggg agcgggagag 60 agaccgcggg cacgtggcga cgcccgtgaa atgaaaaaaa gcagaagacg cgtattccct 120 tttgctgatg tgtgcctacc tctctgccac agatcacgac ctcagctccc ctccacccca 180 acgcctcccc cgcgcggccc tgtcacaggc tccc 214 // ID Mariner-4_BM repbase; DNA; INV; 1291 BP. XX AC . XX DT 26-APR-2010 (Rel. 15.07, Created) DT 26-APR-2010 (Rel. 15.07, Last updated, Version 2) XX DE Mariner-like DNA transposon - consensus sequence. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-4_BM. XX OS Bombyx mori OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Bombycidae; Bombycinae; Bombyx. XX RN [1] RP 1-1291 RA Jurka J.; RT "DNA transposons from Bombyx mori."; RL Repbase Reports 10(7), 939-939 (2010). XX DR [1] (Consensus) XX CC >98% identical to consensus. XX FH Key Location/Qualifiers FT CDS 174..1223 FT /product="Mariner-4_BM_1p" FT /translation="MELTRENSRAMIYYDFRSGLTQKQCVDRMISAFGDEA FT PSKTTIYRWFAEFQRGRVKLSDDPRQGRPKTAVTQENVDAVRKLIEEDRHV FT TYREIQATLDIGMSQIQIILHEQLGVKKLFSRWIPHSLCEEQKAARVTWCV FT RTLERFHAGSSNAVYNIVSGDESWIYAYEPETKNQSRVWVFENELKPTKIV FT RSRSVAKKMVATFVSKTGHVTTIPLEGQRTVNAEWYASICLPQVVSELRKE FT NCNRRIILHHDNASSHTAHRTKEFLEQENIELLDHPPYSPDLSPNDFYTFP FT KIKNKLRGQRFSSPEEAVDAYKTAILETPTSEWNGCFNDWFHRMEKCVKFR FT GEYFEKQ" XX SQ Sequence 1291 BP; 405 A; 266 C; 269 G; 351 T; 0 other; cgagggctgc actaaaagta tcgggaatgg aatatttcca ctgttcctgt catattaaaa 60 tctttttaat tgaaaactcc ttggttttaa aaatcgaata ccatttattt atttaaaaaa 120 agattctcgg tcttgtcacg aggttttgtc aaacttgttt agtcgttgag aaaatggaat 180 tgactcgaga aaattcaaga gcgatgattt attatgactt tcgaagtggt ttaacacaaa 240 aacagtgtgt tgaccggatg atttctgcat ttggtgatga agccccatcc aaaaccacaa 300 tttatcgctg gtttgctgag tttcaacgtg gacgtgtcaa gctcagtgat gatccccgtc 360 aaggtcgtcc aaaaactgca gtcacccaag aaaacgttga tgctgtgcgt aagctgattg 420 aggaagatcg acatgtgaca taccgcgaaa ttcaggcaac tttagacatt ggcatgagtc 480 aaatacaaat aatcttgcat gaacaattag gtgtaaaaaa gttgttttcc cgatggatac 540 cgcattcgct ctgtgaagag caaaaagcgg ctcgcgttac ttggtgcgtc agaactctcg 600 aaagattcca cgcaggatcc tcaaatgctg tatacaacat tgtatcaggt gacgaatcct 660 ggatatacgc gtacgaaccc gaaacaaaaa accagtcacg agtttgggtg ttcgaaaatg 720 agttaaagcc aacaaaaatt gttcgttcac ggagtgttgc aaaaaaaatg gtggccacgt 780 ttgtctccaa aaccggccat gttacgacta ttcctcttga gggacaaaga acggttaatg 840 cagaatggta tgctagcatt tgtttgccac aggtcgtttc tgaactccgt aaagagaact 900 gcaaccgccg catcatcctc catcacgaca atgcgagttc tcacaccgcg cacagaacaa 960 aagagttttt agagcaagaa aacatagaat tattagacca tccgccgtac agccccgacc 1020 taagccctaa tgatttctat actttcccta aaataaagaa taaattgcgt ggacagagat 1080 tttcatcacc tgaagaagct gtggacgcct acaaaacggc cattttggag accccaactt 1140 ccgaatggaa tggttgcttc aatgattggt tccatcgtat ggaaaaatgt gtcaaatttc 1200 gcggagaata cttcgaaaag caataaatac atttttaaat agtaatgttg tgtcacttcg 1260 ttaattcccg aaattttcag tgccgccctc g 1291 // ID Gypsy-74_CQ-LTR repbase; DNA; INV; 179 BP. XX AC AAWU01042220; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-74_CQ_; KW Gypsy-74_CQ-I; Gypsy-74_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-179 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 528-528 (2011). XX DR Genome; AAWU01042220; Positions 14583 14761. XX SQ Sequence 179 BP; 57 A; 29 C; 46 G; 47 T; 0 other; tgttatggta cggagtcgcg gtcttggcca cgacaaggag tttgggtatg acccacaaga 60 ggagagttaa gtagagacag ccatcttgct agagcagacg ttgtgaatta gagctcaata 120 aaattggagt tgttcagcat tgaaatcgag tatttctatt taaaataccg aacataaca 179 // ID CR1-10_HM repbase; DNA; INV; 4310 BP. XX AC . XX DT 11-DEC-2008 (Rel. 13.12, Created) DT 13-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE CR1-type family: consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-10_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4310 RA Jurka J.; RT "CR1 families from Hydra magnipapillata."; RL Repbase Reports 8(12), 1838-1838 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 751..2835 FT /product="CR1-10_HM_1p" FT /translation="MIFLRFYIIFLNFNIALSYFLFKMADETVLTNLNFNS FT FESKKSILLNNYSDPDINFYNNDEVIKNINPLYYNPNSKNIANGMDDNSFS FT MLHINIRSIQKNFESLKQFLYKIKINFQIICLSETWCHDKNIENNSNFQLP FT NYKVIHQVRASNKEGGGLCIFIKNSLLYKVKPNLSSTTNDYESLCIEIINE FT TSKNIIIHALYRPPSGSIKVFENHIKKIIKNKSSINKTVYFVGDINLNILD FT YDKNKNIKNFFNVIFQNSYIPLINKPTRITKESATSIDQIITNEFINKKIK FT TGIFKTDISDHFPIFIXSQKCIYNAQNKRKKIITTRLINDNSIKHFHNLLS FT CVNWDTLKLNLNAXKAYDIFLTEFLKLYNKAFPEITKFIKTKTLLNPWITK FT GILKSSKKKQRLYNKFLKKKTLKNETDYKNYKRLFESILKRSKKHYYSEQL FT IKHKNDSQKTWQIIKEVIGKKNLYRNLLPKNLKFNNKYIVNKSLVAETLNQ FT FFVNIGPTLASKIATTQLNFKSYLTSNNINVMXNYKLTEKELLDAVSLLKP FT NESLGFDDISSNVIIKSIKYITIPLLHIFNLSLKQGVFPEKLKIAKVIPIF FT KSGDPSNVANYRPISILPCFSKILERIMYNRLYSFLNINNILYNKQFGFKS FT GHSTDHAIIHLVHDIFKAFDEKKFTLGVFIDLSKAFDTVDHKILL*" XX SQ Sequence 4310 BP; 1856 A; 519 C; 430 G; 1468 T; 37 other; gggaaatttt tcaagaaatg ttycaaaaac aacaaaanga tattttaama ttaataagtg 60 gaaatttaaa actgactaat gatcgaatcg atggattgct aaaagaaatt actacattaa 120 aagaawcttg tgaawcttta aaaaaagaaa acgaaaacmg aaawgccgat caaaataaaa 180 acaaacaagc agttaataaa gtatgaaaaa atacaaaaag acattgaaga aagtctaacg 240 ttctatcaag atagccawga taaaaaagtt agcgayttag aaaaaaaaaa gtcatcgaat 300 gcaaatatag gtgagaatga aaaaaataaa tttagacarc tggaagatcg acaaagaaga 360 aataatcttc gaattgaagg agttaaagaa aatgataacg aaagttggga tgatacggaa 420 ataaaaataa taaacatttt tgaaaataat ctcaatgtaa atggagtaat catagaaaga 480 gcgcatagaa ctggaattat tgaaaaaaaa aaaccaagaa caattgtaat taagttacta 540 aattataaag acaaagtaaa aattytaaaa aatgccaaca aactaaaagg wtctggaatt 600 tatatcgacg aggatttttc gctagaaact acaataataa gaaaaaaact tcttgaagaa 660 agcaaaatgc ataggaaaaa tggtaagtat tctgttgtta tatacgataa rcttattgtt 720 aaagaattta taaaaaaaaa ataatgataa atgattttct tacgttttta tattattttt 780 ttgaatttta atattgcttt aagttatttc ttatttaaaa tggctgatga aactgtttta 840 acaaacttaa attttaattc atttgaatca aaaaagtcaa tacttttaaa caactattcg 900 gatcctgata ttaattttta taataatgat gaagtaatta aaaatattaa ccctttatat 960 tataacccaa attctaaaaa tattgctaat ggaatggatg ataattcatt ttcaatgctt 1020 catattaata ttagaagtat tcaaaaaaat tttgaatcat taaaacaatt tttatataaa 1080 attaaaatta attttcaaat tatttgtcta agcgaracat ggtgtcatga taaaaatatt 1140 gaaaacaatt caaattttca attacctaat tataaagtaa ttcatcaagt tagagcatct 1200 aataaggaag gtgggggttt gtgtatattt ataaaaaatt cattattata caaagttaaa 1260 ccaaatttaa gttcaaccac taatgattat gaatcattat gcattgaaat tattaacgaa 1320 acctcaaaaa atatcatcat tcatgcttta tatagaccgc cctcaggaag cattaaggta 1380 tttgaaaatc acattaaaaa aataattaaa aataaatcat ctataaataa aactgtttat 1440 tttgttggtg atattaatct caatatwtta gattatgaca aaaataaaaa cattaaaaac 1500 ttttttaacg tcatttttca aaatagctat attccactaa tcaataaacc aacgcgaata 1560 actaaagaaa gtgcaacatc aatagatcaa attataacaa atgaatttat aaataaaaaa 1620 ataaaaacag gaatatttaa aactgacatt tctgatcact ttccaatttt tattmtatca 1680 caaaaatgca tatataatgc ccaaaataaa cggaaaaaaa taattacaac acgattaata 1740 aacgacaatt ccattaaaca ttttcataat cttctatcat gtgtaaactg ggacacccta 1800 aaactaaatc taaacgcara taaagcatac gatatttttc taacwgaatt tcttaaactt 1860 tataataaag catttccaga aattacaaaa tttatcaaaa caaaaacact tttaaaycct 1920 tggataacta agggaattct taaatcttca aaaaaaaaac aacgattata taataaattt 1980 cttaaaaaaa aaactcttaa aaatgaaacc gattataaaa actataaacg tctatttgag 2040 tcaattttaa aacgctcaaa raaacattac tattcwgaac aattaataaa acacaaaaat 2100 gattcgcaaa aaacatggca aattattaag gaagtaattg gtaaaaaaaa cttatacaga 2160 aatttacttc ctaaaaatct taaatttaat aacaaatata tcgtaaataa atccttagtk 2220 gcygaaacac ttaatcaatt ttttgtaaat ataggtccta ctttagcatc aaaaatagca 2280 actactcaat taaactttaa gtcatactta acctcaaaca acattaatgt tatgkctaat 2340 tataaactta ctgaaaaaga attattagat gcagtctctt tgttaaaacc caacgaaagt 2400 ttaggatttg atgatataag tagtaatgtc attataaaat caataaaata tattacaatt 2460 ccacttttac atatttttaa tctatcttta aaacaaggag tttttcctga aaaacttaaa 2520 attgcaaaag ttataccaat ttttaaatca ggtgatcctt ctaatgttgc aaattataga 2580 ccaatctcaa ttcttccttg tttttcaaaa atattagagc gaattatgta taatagactt 2640 tattcttttt taaatataaa taatattctt tacaataaac aatttggatt taaatcaggt 2700 cattcaactg atcatgcaat tattcacctt gttcatgata tatttaaagc gtttgatgaa 2760 aaaaagttta ctttaggtgt ttttatagat ctaagtaaag cttttgatac wgtcgatcat 2820 aaaattcttc tataaaaact caaaaattat ggtataaaaa atacaaattt agcttggttt 2880 aaaagttact tgactaatag aaaacaatat atttcatatg atggmggaaa aactgaatat 2940 atgacaatta cttgcggtgt tccccaagga tcaattctag gaccactttt atttctaatt 3000 tatataaacg atttaaataa attttctaac attttaaact caattttatt tgcwgatgat 3060 actaacttat tttattctaa taaagatatt aatattttat ttaaaacagt aaacaaagaa 3120 cttataaaaa ctaaccgaat ggtttaaatc aaataaattg tccttaaata taaataaaac 3180 aaaatatact ttatttcatc gtcttcataa aaaagaaaat attccattaa aacttccaaa 3240 tctttttatt gataactcat taataaaaag agagcaatca ataaaatttt taggcgtkat 3300 tcttgatgaa aatwtaacat ggagggamca cataagtata attgaaaata aaatttcaaa 3360 aaatatwggt atattgtata aagcaaaaca gtttttaaat caaacttgtt taaaaaactt 3420 atatttttct tttattcatt gytatctaaa ttatgcaaac attgcttggt gcagcactaa 3480 tgcaacaaaa ataaaaaaac tgtttagtaa acaaaaacat gctataagga twattacaaa 3540 tgtagatcgt ttctctcata ctaaaacact atttaataaa cttaatattc taaatgtata 3600 tcaactaaac ctttatcata ttcttatttt tatgtttaaa cttgataata aaatatcacc 3660 aattttattt aattcatttt ttaaaaaaat aaatcatata taccctacaa gattttcaaa 3720 aaacaattat attcaatcca aaacacatta ttcagcaact aaattctcaa ttgcaaatcg 3780 aggtcctaaa ttatggaata caatattaaa taatgagctt aaaactaatt cttctttaaa 3840 ccaatttaaa acaaaactta arcaaaaact tttgataaat gaaaatgaat taaatttctt 3900 ctaaaaagct ttttaaatta gcaaacaaaa ataaaaatta atctattaat tactctttaa 3960 tattaattta atattttaaa caattgtctt attgaaatct atttattatc ttataagttt 4020 tatttacgac gattttttaa gttttattat tttttttkaa tttcttttta ttattttctt 4080 ttttattart acttacttat tttactaatc aattattatt ataaattttt cgaattacwa 4140 aatagttata tattttattt tttattttgt taacgtttta cttatgaatg cattttgggg 4200 cttagtgata agacgaattt tgtcttcttc tagccccagt catgtaaata tttgttttta 4260 taaattacat tgtaaattat tttcaacggc aaataaataa taaaaaaaaa 4310 // ID Gypsy-56_AA-I repbase; DNA; INV; 4752 BP. XX AC AAGE02021956; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-56_AA_; KW Gypsy-56_AA-LTR; Gypsy-56_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4752 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02021956; Positions 99949 95198. XX CC 'CTGAT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 1784..3484 FT /product="Gypsy-56_AA-I_1p" FT /translation="MIEFHVIADKHMGLYDAIIGRDVIMRPDIRTVIENGI FT QIFEVEKGYLQIEDPSNTKVKQHKIFNIDSDTYKKLSSNIGEISDDRKAKC FT FKTKYYFNNKPSKPVTEYEMTIAFKKEEYFHFKPRRLSFDQKNKVEVKIKE FT LLKDGIIKESNSPYASPIVIIPKKGNDFRLCVDYRKLNKNTVRDNYPLPVI FT DDLIDTLNYKEIFSIIDLKSGFHPIRIATESTKYTSFVTPNGQYEYLRVPF FT GLCNAPAVFQRFINKIFKELIDDGKIIVYIDDILIASTSFDEHIDTLGKVF FT QILNDNLLEINLSKCQFLFEEIDYLGYTINKFGKKLNKSHVDSISNFPVPK FT NNKDVQMFLGLTSYFRKFVHNFSTIARPLYILLKKDAKFDFGKEQLDAFNS FT LKNKLISSPILATFDPQAETQLHCDASSFGFGAILLQKQKDNNFHPVSYFS FT RKTDEFESKLHSFELETLVVVYAIKRFHIYLAGTHFKNFTDCNALVQTLSK FT KNVNPKICRWALYLESYDFDLEYRDGSKMQHVDALSRQSTIVIKQFTEEEL FT VNKKEDKLSKDYYVNKTVNI" XX SQ Sequence 4752 BP; 1736 A; 695 C; 882 G; 1439 T; 0 other; attcagaagt aagataagcc atgcaggaac ggagtcgaaa aaatttcatg gaatcaggaa 60 gttttgcttc gatcggagaa aacactgcgg aaactggagt ctgaactgta gctgagagaa 120 atctaagaca agacgtgaca ggtcaaagta caaaaacgaa accaattgcc tgtcaaaaaa 180 gaccacttct cattggtcgt tttggcaaaa gcctactttc acatcgttga gttggattgc 240 aacagggcac tttgttgcta gcccggccgt gttgctagtt catagattag acgttgcatc 300 aaaagttcgt aaatttttca tgaaatcatt ttgtttcaaa tctgtttata ttcgatgtta 360 agttcaaagc atatgttcat acaatactaa aacatatgaa aaatacctaa ttgagaccaa 420 cataatggaa attgtgtaaa ttttctctaa aaaatatcgc cggttttgaa tttcaaccag 480 agttttgaaa agtatagcat cgtctattgc agtcaattaa taaagcgtgc aagaaacaat 540 caaattatgc gccttgatat tttaatttgt tcacaaggtt tgctcaaaaa ggattctgat 600 aacactggca tttgtatcgt caaggttatc ggctgaaaaa caacggggca gtcttggttg 660 agctgtcacg tcctgtccta gagagaaata agatgtcggg aaattcactt cacgtaggcg 720 gatcacttgc tgcaggtatc gtcgttgcaa gcgcaatgaa tgctgctgct gccgctgttc 780 aaccatcaac cggattatcg tcgaattgcg cggataacgg aggttgtaac tacgtctgtg 840 aagttgatcg agttgttcca gtgaaagttt ttggccattt caaaccagaa taaatctcct 900 caatgttgcc atcgttttct ggagacactg gagaagatgt aagccatttc atatccgttg 960 tagaaaatac gaaacgtgcg ttgggtgtga atgagttcat catgaagctt gtcgttattc 1020 aaaatttaaa agggaaggct aaactctggt tgcactcgca ggcggatttt atgctaaaaa 1080 attactacga aatcatacga agcttgaggg agttttacga taatcctgtg aacacaattg 1140 agattcgaag actgttggaa aagaagacat ggaatggaaa atttgaaacg tttatggaat 1200 actgtcaaga caaaaaagtt atcgcccaaa aattgaagat gcaagaatct gaaatagtcg 1260 aatacatagt tgaaggaata ttgaatccag tactgaaaaa tcaagcacga atgatgaatt 1320 tcaaaacggt tgaagagatt gttcaagctt ttcgtatgat aggaaatgac tcaagattgg 1380 agacgttttc aaggaggcct gtgaaatgct attgctgtaa ccaatttgga catattgctg 1440 cctattgctg gaagaaacga aatgttgttg gaccatatcg gtgtgagacg aacgcttgct 1500 gtcaatcaat atcaagaaac attgcggctg cagtaaagaa cgacttatta acttcaggag 1560 atggacgtaa cggacagata gaatttcagc ttattaatat aaataacatt tttagagctt 1620 tatacgatac aggaagtccg atttcactaa tacgacgagg tttagttaag aaatcggaaa 1680 tatctaaatt taatgaaaaa caaatgtatc ggagtattag tggaaacaga ctcaagatat 1740 taggagtttt caaagcgaag ttattattca aagtattcat ttcatgattg aatttcacgt 1800 tattgctgac aagcacatgg gactctacga tgccataatt ggaagagatg taatcatgcg 1860 tccagatatt cgaacggtca tcgaaaatgg tattcagatt tttgaagtcg agaaaggata 1920 tttacagatt gaagatccat ctaacacaaa ggtaaaacaa cacaaaatat tcaatattga 1980 ttctgatacg tacaaaaaac tatcgtcgaa tattggagaa atatcagatg atagaaaggc 2040 aaaatgtttt aaaactaaat attattttaa caacaaacca tccaaaccag taacagaata 2100 tgaaatgact attgcattta aaaaggagga atattttcat tttaaaccaa gaagattatc 2160 ttttgaccag aaaaacaaag ttgaggtaaa aataaaagag cttttgaaag atggaatcat 2220 taaagaaagt aattcaccct atgctagtcc aatagtaata attcctaaaa aaggtaatga 2280 ttttagatta tgtgtagatt ataggaaact aaataaaaat accgttagag ataactaccc 2340 tttgcctgtt attgacgatt taattgatac actgaattat aaggaaatat tttcgattat 2400 tgatctaaaa tctggttttc atccgattag aatagcaaca gagtcaacta agtacacttc 2460 ttttgtaacg ccaaatgggc aatatgaata tttgcgtgtt cctttcggat tgtgtaacgc 2520 ccctgcagtt ttccaaagat ttataaacaa aatattcaaa gaattgattg atgatggtaa 2580 aattatagtt tatattgacg atattcttat tgcaagtaca agttttgatg aacacataga 2640 tactttagga aaagtttttc aaatattaaa tgataattta ttagaaataa atttgagtaa 2700 atgtcaattt cttttcgaag aaattgatta tttgggatac accattaata aatttggaaa 2760 gaaattgaat aagtctcatg ttgatagtat atctaatttc ccagttccaa aaaataataa 2820 agacgttcaa atgtttttag gtctcacaag ctatttcagg aagtttgtac acaatttctc 2880 aaccatcgcg agacctttgt atattctttt gaaaaaagat gcaaaattcg attttggtaa 2940 agaacaatta gatgctttta actcattgaa aaataaattg atttcgtcac cgatactagc 3000 tactttcgat ccacaggctg aaacacaact tcattgtgat gcaagctctt ttggatttgg 3060 agctattttg cttcaaaaac aaaaggacaa caactttcat cctgttagtt attttagtag 3120 aaaaacagac gaatttgagt caaaattgca cagctttgaa ctcgaaacat tagttgtggt 3180 ttatgcaata aaaagatttc atatttattt agccggtaca cattttaaaa attttacaga 3240 ttgcaatgca ttagttcaga ctttgtcaaa aaagaatgta aatccgaaga tttgcagatg 3300 ggcattgtat ttagagagtt atgattttga tttagagtac agagatggct caaaaatgca 3360 acatgttgat gctttaagcc gtcaaagtac tatagttata aaacaattta ctgaggagga 3420 attagtcaat aaaaaagaag ataaattaag caaagactac tatgtaaata aaactgttaa 3480 tatttaaaat gattgtgata ttgaacaaaa tattattgtg gcgcaagaac aagatgttaa 3540 actgttgaat attaaagaat tacttgagcg atctacattt ccagattttg agctaatcaa 3600 tggagttctt attagaaaag aaggtcagaa acgtttatta gttgttccaa aattaatgat 3660 tgatactgtc attaaactat gtcacgacaa ttgtggacac attggaatag agaaaacttt 3720 atatgaaata agaaagcaat tttggtttag tagaatgaaa aaaatggtga aaaattatat 3780 aaacaattgt ctgacatgta ttttttatag ccctgctggg agaaaagaag ggtttgtcaa 3840 tattattgat aaaggaaata aaccgttcga aactgttcat ttagactact atggaccaat 3900 acaataatct tcttcaaata gagcaaaata tattttggta attatagatg ggttcacaaa 3960 gtttacgaaa ttttttccat caaaatcaac aagctcagat gaaactattc gccaattgac 4020 aaattacatg aacttttaca gtaaacttaa cagaattatc acagatagag gtacttgttt 4080 tacgtcacaa aaattcaaag attttactac catcaatcaa atcgaacaca ttaaaactgc 4140 tgcgtatact cctgaagcta acggtcaaat tgagcgtgta aataaaactt taactcctat 4200 gcttgcaaaa atgtgcaatg aaacagctaa accgtggaat gttatcaaaa ttggaattta 4260 tatttattta tatttttact tttaataggt caataggtaa ttatcctagc gtgttattat 4320 ttggtgtgca acagaataat ttgggctgtg aaagtcataa tatttcgaat tttattgcaa 4380 ataggcagat gtcaactagt ttaacgagtt tgacagatat ccgagaaaat gctcaaagac 4440 ataatataag aagtcaactt tgtaacaaaa gaaatgcaga taagaaacga aaaaagagca 4500 cagtatatgc agagggtgat ttcatagttc tgaaaactgg cgaaacacat aaattagctc 4560 ctaaatatca aggtccgtat aaaattcata aagttttgcc gaacgatagg tacgttgtaa 4620 ctgatttaga tggatttcaa attgcaaata tcccattcaa ttctcctcaa aatatgagga 4680 aatggatgag tgattacata cctgacaaca gtgatatcga tgaggacatc gatagtgtca 4740 ggatgcccga ga 4752 // ID Gypsy-257_AA-I repbase; DNA; INV; 5602 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-257_AA_; KW Gypsy-257_AA-LTR; Gypsy-257_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-5602 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1115-1115 (2011). XX DR [1] (Consensus) XX CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 1084..2856 FT /product="Gypsy-257_AA-I_1p" FT /translation="MSINPTSEPFIPGTIPFSQYLEQLEWMFEHNNYTEDR FT FKTSFLAVCGTEVFSQLKLLFPGQNLKDLTYKQITDKLKQRYDKKDSDVIH FT SYKFWTRRQGQFEKAVDFVLDVKNLAELCEFGEFKDRAIRDVLVIGIYDRN FT LQKRLFDEEDLSVAKVEKMIVNQEIASDRTQFLRKDDGVIARLGRRPNRAP FT RRSNFNGRGRSRSDSRNRSFSFRSKSGGSYDRKSSYTNKEFLCSYCKKKGH FT TRKYCYKLKNKSPRKFQSSVKFMDSPKPSVSGSSGLFKRLKEDMQSDSEDD FT APCLMISSVNRINEPCYVDVKVENKRITMEIDSGSAESVISEDLFLRSFKH FT LPVKTCNKRLVVIDGKKLKVLGKVEVSVKLGDIQEKLYLIILRCENSFVPL FT VGRTWLDIFYVGWRNAFARPTATMEYISAINDDETVLDLKRKFPKVFDKEL FT SNPIVGFEGDLVLKDNTPIFRKAYEVPLRLRQKVLEHLDGLERDGIITPIE FT ASEWASPVVVVVKKNQDIRLVIDCKVSINKVIVPNTYPLPLAQDLFATLAG FT SKVFCSLDLTGAYTQLLLSKRSRKITQRNKIDTFACLKDQIMCF" FT CDS 3134..5455 FT /product="Gypsy-257_AA-I_2p" FT /translation="SPGKTLKTVKKKLYLVLERLSKANIKVNFKKCKFFVK FT DLPYLGHILTDKGLLPCPDKVETIREAKAPQNVSELKAFLGLVTYYAKFIP FT NLSTRIKCLYALLRKNTKYIWDCECEKVFNECKHFLLKPNLLEYFDPEKPV FT VIVTDACSYGLGGVIAHQVKGEERPISFTSFTLNDAQRKYPILHLEALAVV FT CTVKKFHKFLYGKKFTIYTDHKPLIGIFGKNGKNALSVTRLQRYVLELSIY FT EYDIVYRPSSKMGNADFCSRFPTAHEIPKEIAREYIKSLNFTREFPLDYKE FT VAKESSRDTYLLKIFEFLRKEWPSRIDRNYKDIYSHHQDLEEIEGCILFQD FT RVVIPRVMKLKILKMLHMNHSGINKIKQLARRTVYWFGMNQDIEDYVKTYR FT ICLETTTLSKKPPYSKWIPTNKPFSRIHADFFHFDKKVFLVVVDSFTKWIE FT VEHMRNGTDHKKVIKVFLNIFARFGLPDVLVTDGGPPFNSEIFINFFQKQG FT IVVLKSPPYHPESNGQAERTVRLVKNVFKKFLVDPQIRTLDTDEQISYFLI FT NYRNICLETDGKFPSERMLSYKPKTLVDLINPKHNFKEKLANTHDDHQSGF FT VKAGSTSDPFNNLKCGDLIYYKNNNPTDIRQWLPAKFLKRISTNISQVSLG FT GRVLTAHKRQLKVVPASRRKTPRSFVFHGERDFGPTEPNKQQMQIEPPSVR FT LNRKRRREDDDEEQSNASDASSDFYGFPADSFIFSGEEITTASHHQTQAED FT LGNGSRRSKRKFKKKKKEGYIYY" XX SQ Sequence 5602 BP; 1713 A; 877 C; 1254 G; 1757 T; 1 other; gtgggacgag gataaagata gttttttttt tcaaatttta accgcgaatt gaattattaa 60 gttcggttta aagtaagttc gcgagtgaaa ggagataagt gaatttaaga aggaagattt 120 gcgtgcaaac ggccttttgt aaattgtgag ttttaccaca gtttttaccc ttcttgtagt 180 gcaatctttt ggcgtggcgt cagtttttta attttccttt gataaaagta actggtgttc 240 ttcgggcagt agataggcta gacctttctt tttgcttgat tttaccggat ttgtggagcg 300 ctgaaagaaa gaaatttgga aagtgtgtaa aaagaaattt acagagaagc gcatttattt 360 tgagaaaatc gccattttgg aagcctacaa tttagtggat ttcttttgtg agaagataat 420 aggcggatta ttaaatcatt ttttgtgtct tcttttgttc acacagcgag catcgacaca 480 aagcagttga gtgaccgctg ggaataaagc gttttttcat catttgtgtg gtgtttggtt 540 ttgggctctg agagagtgag ttcctagtga gataccacca gtgagtagtg agcgtgtggt 600 gaagaattgc agttttgtca ggggtgactg accttctttg gctgcatagc gaagtttaaa 660 aagcgcattg gaagttcatc gaccttattg gggttagctg ctgctgttgt tgctgaaagt 720 tgctgttgat aggaagtgct gttgttgccg gattttgttg ccgctggaga agtttgctgt 780 tttggtgaga ttggctgctg tttaccgccg cgattaaggc agatcgttca ttttgcatgc 840 tgtgactgtg agtgccttgc cagtttttgg atacgattgg ctttttcccg cagctagggt 900 agttgtgtgt attgtgcttc atctggaatc aggtatgtgg tattttcctt gatgcttatt 960 ttatcaattt ttactgtttt gtcgtgtgac tatttgaaag tgtttcattt ttcttgtgtc 1020 cgtataaaga tcagctattt attttgaatt ttgtgtattt tgattgtttt ttttattaca 1080 aggatgtcaa tcaatccaac ctcagaacct tttatcccgg gaaccatacc attttcgcaa 1140 tatttagaac aattagaatg gatgtttgag cacaataatt atacggaaga tcgatttaag 1200 acttcgtttt tagctgtttg tgggactgaa gttttttccc agctaaagct tttatttccc 1260 gggcagaatt taaaggatct tacgtacaag cagattacgg ataaactgaa gcagcgttac 1320 gacaaaaaag actccgatgt gattcacagt tacaagtttt ggacgcgtag acaaggacag 1380 tttgagaagg cagttgattt tgtgcttgat gttaaaaatt tagctgagct ttgtgagttt 1440 ggtgagttca aagacagggc tattcgtgac gttttagtaa ttggaattta tgatcgaaat 1500 ttgcaaaaac gtttgttcga cgaggaagat ctttctgttg ctaaggtaga gaaaatgata 1560 gtaaatcaag aaatagcttc ggacagaact caatttttga gaaaagatga tggtgttata 1620 gctcgtttgg gtagaaggcc aaatcgtgct ccgcgtagat cgaatttcaa tggaagaggt 1680 agaagtagga gtgacagtag aaatcgatca ttttcattca ggagcaaaag tggaggaagc 1740 tatgacagaa agagcagtta tacaaacaaa gagtttttat gttcatattg caaaaagaaa 1800 ggacacacca ggaaatactg ttacaaactt aaaaacaaaa gtcctcgcaa atttcagtcc 1860 agtgtaaaat tcatggattc gcctaaacct tctgtatcag gttcttcagg attgttcaag 1920 cgactgaagg aagatatgca gtcagattca gaggatgatg caccttgttt gatgatttct 1980 tctgtaaaca gaattaatga accatgttat gtagatgtca aagttgaaaa caagagaatt 2040 acaatggaga tagacagtgg ctcagcagaa agcgtcatat ctgaggatct gtttctgcgt 2100 agctttaaac atcttcctgt gaaaacgtgt aacaagcgtt tggtagttat tgatgggaag 2160 aaactgaagg ttttaggaaa ggttgaagta tccgtgaagt taggagacat tcaggaaaag 2220 ctttatctga taatcttacg ttgcgaaaac agttttgttc ctttggtggg gcgtacatgg 2280 ctggacattt tttacgttgg atggaggaat gcttttgcga gaccgactgc gaccatggag 2340 tacatcagcg caattaatga cgacgagaca gttcttgatt tgaagcgtaa gtttccgaaa 2400 gtatttgaca aggaattatc taatccgata gtaggtttcg agggtgattt ggtattgaaa 2460 gacaatacac ctatttttcg taaggcatat gaagttcccc tccgtttgcg acaaaaagtt 2520 ttggaacacc tagatggttt agaaagggat ggtataatta caccgatcga ggccagtgaa 2580 tgggcttcac cggttgtagt tgttgttaag aaaaaccaag acatacgatt agtaatcgac 2640 tgtaaagtct caataaacaa ggtgattgtt ccgaacacat accctttgcc attagctcag 2700 gatctttttg ccactctcgc aggttccaag gttttttgtt cattggattt aacgggggct 2760 tatacacaat tgttgctatc aaaacgatca aggaaaatta cacagcgtaa caaaattgac 2820 actttcgcgt gtctcaagga tcaaattatg tgtttctagt agattttggg tcgctgaatc 2880 tgatgccgtt gtcagaaatg tagcagcacg tcacaatgtt tagctacagg tcgccaaagt 2940 tgtataaaac actggtttta ttgatgttta catgaaattt aaagtgttat gatcattaat 3000 acaattaaag gactttatgc atacaaccgt ttgccacagg gggcatcctc aagcgcatcg 3060 atatttcaga aggttatgga tcaagtgctt caagggctag aaggtgtatc gtgttatctc 3120 gatgacgttt tgatctccgg gaaagacttt gaagactgta aaaaaaaaac tttatttagt 3180 cttggagcga ttgtccaagg ctaatataaa agttaatttt aagaaatgca aattttttgt 3240 aaaggattta ccgtatttgg gacacattct gacggacaag ggattacttc cttgcccgga 3300 taaagtggaa actatccgcg aagcgaaagc tccgcagaat gtttctgaac ttaaagcttt 3360 tttaggtttg gttacttact atgccaaatt cattccaaat ttgtctaccc gaataaaatg 3420 tctttacgct ttgctacgaa aaaacaccaa atacatttgg gattgtgaat gtgagaaggt 3480 ttttaatgag tgcaaacatt ttttattgaa accaaatctg ttggagtatt ttgatccaga 3540 gaaaccagtt gttattgtga cggatgcatg ctcctacggc cttggtggag tgatagcgca 3600 tcaagtgaaa ggtgaggaac ggccaattag cttcacttca ttcacgttaa acgatgctca 3660 gcgaaaatat ccgattttac accttgaggc attggcggtt gtatgcactg twaaaaaatt 3720 ccataaattt ttatatggga aaaaatttac catttacact gatcataagc cgctcatcgg 3780 aatttttggc aagaatggga aaaatgcact ttctgttaca cgactacagc gatacgtgtt 3840 ggagctttct atatatgagt atgatattgt atatcgtcct tcctcgaaga tggggaacgc 3900 ggacttttgt tcgagatttc caacggctca tgaaatacca aaagaaatag cgcgtgaata 3960 tataaaaagt ttgaatttta cgcgtgaatt tcctttggat tataaggagg ttgcgaaaga 4020 atctagcaga gatacatatt tattgaaaat ctttgaattt ttgcgaaaag aatggccgag 4080 ccgaattgac agaaattaca aagatattta ttctcaccat caggatttag aggaaattga 4140 agggtgcatt ttatttcagg accgagtagt aatacccagg gttatgaagt taaagatctt 4200 aaaaatgttg catatgaatc actccggtat taacaaaatt aagcaactgg cccgtagaac 4260 ggtttattgg ttcggtatga atcaggacat tgaggattac gttaaaacat acagaatttg 4320 tttagaaact acaactcttt caaaaaaacc tccttactcc aaatggattc ccaccaacaa 4380 accatttagt agaatccacg cagatttttt tcattttgat aaaaaggtgt tcctagttgt 4440 agtggatagt ttcactaagt ggattgaggt ggaacatatg agaaatggaa cggatcataa 4500 aaaagtgata aaggttttct tgaacatttt tgcccgtttt gggcttccgg atgttcttgt 4560 gacagatggc ggtccaccct ttaactccga aattttcatt aattttttcc aaaaacaagg 4620 aattgttgta ctgaaaagtc ctccgtatca cccggaaagt aatggtcagg ctgaaagaac 4680 agttcgtttg gtgaaaaacg ttttcaaaaa gtttttagtc gatccacaaa taaggacatt 4740 ggataccgat gagcaaattt catatttctt gataaattat agaaatatat gcttggaaac 4800 tgatggaaaa tttccatctg agcgaatgtt atcgtacaag ccaaagactt tagttgattt 4860 aattaatcca aagcataatt ttaaagaaaa actagctaat acgcacgatg accatcaaag 4920 tggttttgtt aaggcaggta gcacatctga tccttttaac aacctgaaat gtggagatct 4980 gatatattat aaaaataaca atccgactga tatacgacaa tggttgccag ccaaatttct 5040 aaaacggatc tctacaaata tttcacaggt ttctttggga ggcagagttc tcacagcgca 5100 caaacgtcag cttaaggtgg taccagcttc tcgccggaaa actcctagaa gttttgtttt 5160 ccatggagag agggattttg ggccaaccga gccaaataaa cagcaaatgc aaattgaacc 5220 accatcagta agacttaata ggaaacggag aagagaggac gatgatgaag agcaatcgaa 5280 tgcttcggat gctagcagcg atttttatgg gtttcctgct gactcattca tcttttcggg 5340 agaggagatc acgacagcaa gccatcatca gactcaagcg gaagacttgg gaaacggatc 5400 acggcgttca aaaagaaaat ttaagaagaa gaagaaagaa ggatacatat actattgaaa 5460 tagtgatcga atatttgaat ttgtcttaat atgtttttgc atgaaaattg aattttcgtt 5520 ctgagatcgt aattttataa tatatcttga aacgttcgaa ggtttagaaa tatttatagt 5580 ttttgcttaa gggttgagga gc 5602 // ID Transib-2_HM repbase; DNA; INV; 3140 BP. XX AC . XX DT 29-JAN-2008 (Rel. 13.01, Created) DT 06-FEB-2008 (Rel. 13.01, Last updated, Version 1) XX DE Autonomous Transib DNA transposon -a consensus sequence. XX KW Transib; DNA transposon; Transposable Element; Transib-2_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3140 RA Kapitonov V.V. and Jurka J.; RT "Autonomous Transib transposons from the hydra genome."; RL Repbase Reports 8(1), 2-2 (2008). XX DR [1] (Consensus) XX CC Transib-2_HM is a young family of autonomous Transib DNA CC transposons that were active in the hydra genome just a few CC million years ago (they are ~3% divergent from their consensus CC sequence). The consensus sequence was obtained based on a CC multiple alignment of ~20 copies; it codes for a 671-aa Transib CC transposase. Like other Transib transposons, Transib-2_HM is CC characterized by 5-bp target site duplications and short terminal CC inverted repeats. CC Hidra is a first species whose genome is massively colonized by CC Transibs. Approximately 1% of the genome is made of recently CC fossilized Transib transposons. CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 872..2884 FT /product="Transib-2_HMp" FT /note="Transib transposase." FT /translation="MQTISRLDFYNSIQQQEAPNFNEKMDLADQNLLSRNQ FT YTEELKTVYKRQMSRLKAIFRKMWSSSSRTHEIFIKKHGPWLQGSFQIPGT FT PSTPSSAGRPRKSFVDSSERSKRRKTEQLRKDVEPEAIVFAAETCLTTSGK FT RSAATVLKDLKTSPKRAGKYKKAYRSSLEPTKKSLTPLEALLIFTDAGLTK FT SQYEIVRTSDKKQWPCYSILVEEKKTCYPDPDSFTVTESLAEVDLQSLLNH FT TAERLVLYLEEVIKTLTESGRKHMKLSTKWGCDGSQQTQFNQKFEDDSNSD FT ASIFQCSMVPLQLTCGVNKKVIWQNPTPSSPRFCRPIRIRFIKETADVIAE FT EIAYMENKIAGLKPTQLQDIAVEHVMMLTMIDGKVCNAATTTKSTMRCYIC FT GATSRQFNDLDRTCVDNQSSYKFGLSILHARIRCFESLLHLSYKLPVKKWQ FT KRLNEDEKRLISDRKKKIQDEFKSKMGILVDIPKVGFGNTNNGNTSRRFFN FT DPETSADITGVDFRLISRMKIILEVISSGHKIDVEKFRQYTLDTAKLYVEL FT YPWCPMTPTLHKVLVHGPDIVENALLPIGQLSEEAAEARNKHFREYRLSYA FT RKFSRKDCNQDIINRLLLTSDPLLSSIGKKKHKLTKPFSKEAVELLLPGEV FT QDSDTENNENEEYSDNDEVESN" XX SQ Sequence 3140 BP; 1121 A; 523 C; 538 G; 958 T; 0 other; cacgtaggag tatttgacaa ttttagctgg ccaaaattgg aattattaaa agtggccgta 60 cccacttaat tacctaccct tattatctct cgataatggt cgcttaatga gcagttaatg 120 gtttggttat aaaagtagta attggaacta cacagtctag ttaaatatcc gatctatatc 180 tgcgtggttc attttattta cgagcgattt tagtggtgtt tattgaagct gtgtacaagt 240 gtttacaaca gaagttttac agtgttatat catggatgaa tctatgacag gtatgtacat 300 tatttactct tactaccgtt ttgaaatatt aattaccaca aaagttatgt accatgaaac 360 ttagcatttt tttaacgaat tttaaagtaa attttgactt taaaattgaa aaagttgtag 420 tgcgcattga aaccctttta gtttttttcg tttttctatg agtttatacc actagctatt 480 taaaaatctt aatttcaact gcctgttgat gccatctaaa ttttaatcat gttctaagca 540 catcaaggtc agggttatat atatatatat atatatatat atatatatat ataatatata 600 tatatatata tattatatat atatatatat atatatatat atatatatat atatatatat 660 atatatatat atatatatat atatatatat atatatatat atatatatat atatatatat 720 atatatatat atatatatat atatatatat atatatatat atacatcaac tgcctgttga 780 tgccatctaa attttaatca tgttctaagc acatcaaggt ccaagatgaa tttttcaact 840 taattttttt tttctagatg ggtcatcagg aatgcaaaca atctctcgtt tggatttcta 900 caacagtatt cagcagcagg aggctccaaa tttcaatgaa aaaatggact tagctgatca 960 aaatttgtta tcaagaaatc aatataccga agaactaaaa acagtataca agcgtcaaat 1020 gtcgaggttg aaagctattt ttagaaaaat gtggtcgtct tcatccagaa ctcatgaaat 1080 atttattaag aagcatggcc cttggctgca agggagtttc caaataccgg gtaccccatc 1140 cactccatca tcagccggac gaccaagaaa aagttttgtc gattctagtg aacgttcaaa 1200 gaggagaaaa acagaacagc tcaggaaaga tgtggaacca gaagctattg tttttgcagc 1260 tgaaacttgt ttgacaacaa gtggcaaaag aagtgctgca accgtcctca aagatcttaa 1320 aacatctccc aaacgagcgg gaaagtataa aaaagcctat cgcagcagtc ttgaacctac 1380 aaaaaaatca ttgacaccgt tagaggctct tttaatattt actgacgcag gcctaacaaa 1440 atctcaatat gaaatcgtac gtacaagtga caagaagcaa tggccctgct acagtatttt 1500 ggtcgaagaa aaaaaaacat gctatcctga tcctgattcg ttcacagtca cagaaagttt 1560 agctgaagta gacttacaaa gtctgttaaa ccatactgcc gaaagattgg tgttgtattt 1620 agaagaagtt atcaagactc taacggaaag tgggcgtaaa catatgaaat tatctacgaa 1680 atggggttgt gatggctccc agcagacaca gtttaatcaa aagtttgagg atgactctaa 1740 ttctgatgcc agtatatttc aatgctcaat ggttccatta cagctaactt gtggagtaaa 1800 taaaaaggtt atctggcaaa accctacacc atcgtctcca aggttttgca ggccaattcg 1860 aattaggttc ataaaagaaa ctgctgatgt aatagctgaa gaaattgctt acatggaaaa 1920 caaaattgca ggcttgaaac caactcaact tcaagacata gcagtagaac atgtgatgat 1980 gttaactatg attgacggaa aagtctgtaa cgcagcaacc accactaaat cgaccatgcg 2040 atgttatatt tgtggagcaa catcaagaca atttaatgat ttggacagaa catgtgttga 2100 caatcaaagc agctacaaat ttggactatc cattctgcat gctagaatac ggtgttttga 2160 aagtctcctt cacctatcat ataaattgcc tgtaaaaaaa tggcagaaaa ggctgaatga 2220 agatgagaag agactcatct cagataggaa aaagaaaatc caagatgaat tcaagagcaa 2280 aatgggcatt cttgtcgaca tacctaaggt tggatttggg aataccaata acggcaacac 2340 aagcaggaga ttcttcaacg acccggaaac atctgcagat atcactggag tagacttccg 2400 tttaatttcc agaatgaaaa ttatcctaga agtaatttcc agtggccaca agattgatgt 2460 agaaaagttc agacagtata ctttggatac tgcaaagctt tatgttgaac tatacccctg 2520 gtgtccaatg actccaactc tgcataaagt tttagtacat ggtcctgata tagtagagaa 2580 tgcgcttttg ccaattggac aattgtcgga agaggctgcc gaagccagaa acaaacattt 2640 ccgggagtac agacttagct atgctagaaa attttctaga aaagattgta accaagatat 2700 catcaatagg ttattgttaa catccgaccc tctgttgtcc agcattggga aaaagaaaca 2760 caaacttaca aagccttttt caaaagaagc tgtggaattg cttcttcctg gagaagtcca 2820 agattccgac actgaaaaca acgaaaatga agaatattct gacaatgacg aagtggaatc 2880 taattaaata tcaaaatata gtatgttatt taccataata aagtttgaaa aataatatat 2940 cttttatttc ctcaaatcct attgaataga ataaaatgta atccctaaag tacccatctt 3000 tcccctatat tgaacgccac tggcttggat ttgtacaaaa aacatttaat ttacaatttc 3060 ttgtcatatg gtaaagtttt cagctgttaa aaagagagaa ataattttgg ccagctaaaa 3120 tcgtcaaata ctcctacgtg 3140 // ID Gypsy-120_AA-LTR repbase; DNA; INV; 147 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-120_AA_; KW Gypsy-120_AA-I; Gypsy-120_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-147 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1004-1004 (2011). XX DR [2] (Consensus) XX SQ Sequence 147 BP; 59 A; 21 C; 26 G; 41 T; 0 other; tgatatgaga taccaagatc ctcactgagt gaccacaaac atgtataaaa agcgaatgaa 60 taaacgatga tgagtgagtc tataaaagtc tatctttgag atcgttactt taaagttaga 120 aagcttaacc ttatagtgaa tataaca 147 // ID Merlin-1_Aplcal repbase; DNA; INV; 3076 BP. XX AC . XX DT 16-MAY-2011 (Rel. 16.05, Created) DT 16-MAY-2011 (Rel. 16.05, Last updated, Version 1) XX DE DNA transposon: consensus. XX KW Merlin; DNA transposon; Transposable Element; Merlin-1_Aplcal. XX OS Aplysia californica OC Eukaryota; Metazoa; Mollusca; Gastropoda; Heterobranchia; OC Euthyneura; Euopisthobranchia; Aplysiomorpha; Aplysioidea; OC Aplysiidae; Aplysia. XX RN [1] RP 1-3053 RA Yuan Y.W. and Wessler S.R.; RT "The catalytic domain of all eukaryotic cut-and-paste transposase RT superfamilies."; RL Proc Natl Acad Sci U S A 108(19), 7884-7889 (2011). XX DR [1] (Consensus) XX SQ Sequence 3076 BP; 876 A; 425 C; 877 G; 897 T; 1 other; ggctggatcc aagcagatcg ggtcagcggg gccgaaggcc ccatcgcggc cgatgaatgc 60 gcggggattt cgggattata ttacggctat aaatggaagg gatggctccc tatatagaac 120 gtattggtca aaaaagacaa aaacgggcgt tttggagttt ctagacttaa aaaatgaatt 180 gagcgacccc acaaaatgct aaagtgtaac tttagctgtt cgggaaattg aaatgagaac 240 ttgaaatgct cgttaaggat tgcaaaaaaa tatttacgga ttgggtcgga actacttttt 300 gctccctaaa tttagctgac ttgggaactc tctccggaac tacttctggt caaatctgtt 360 attctgtaac tgtaatagta ccgttagtta aattaggtca atattttatg attgtgtgag 420 atagtaatta tctgcagttt aactagctac aattaataaa gaaaggtttt tttttgcatg 480 taatgggttt agaatgttat atgctatcaa attgtgaaga aaatatttga agttacattc 540 ttatatatgg ggaaaaacga tcataagagg agtaaataga aatcaatgca tggcttatta 600 aggctttgtt tatggtacct tattaatgat gagagttata tgacgctaag aggtgcaata 660 tttaggacaa tgacgctaat caacgcatgt aaatagggca agggtatata aagcgtgagt 720 actcacgcca agctttagtt tctgcattgg tgccaagctt tagtttctgt attcgtacca 780 tggtgtttac tgtgtttgac cttcatgttt tgatttctaa taaagctgtt ttacgggact 840 ggttgcgtta taataagtta ttactggatt tttctgattg tgtttgtgac tgtggtgcaa 900 agtactgtga atttgcagac agtggttatc cagaaggatt ttgttggagg tgttctaata 960 gagtctgcag gaagaagttt tctattaggc ttggatcctg gtttgaggat tccaagttga 1020 gttttgagac aattttaaaa ctgacatatt tttggtgcaa ccagtattcc aacaagttgt 1080 ctgctagaga agctcaagtc agtgaaaata cagttgttga ctggtttaac ttctgcaggg 1140 aagtgtgtat tgacgtgttg caggagtgcg agtatcagag cataggagga gagggtgtat 1200 tggtggagat tgacgagtct aaattcggtc ggaggaagta caataggggt aagcgtgttg 1260 atggtgtttg ggtgtttggt gggattgagt cgagagacaa aagtaggtgt ttttttagtg 1320 ttgtggagaa caggagtgca gacactttga ttccgttgat tcagaaacac atcaagcctg 1380 ggagcattat tgtttccgat gaatggaaag gatattcaac gttgaagaac attggctatc 1440 aacattacac tgtgaatcac agtagggagt tcaaaaacgc cgagacaggc gcccatatca 1500 accacattga gacaacttgg aaccaactga aagctgttaa aaaacattca ggttatgcaa 1560 aaactttaat aagtacatat tttagtgagt tcattttcag gaggaaattc ttaaatgaat 1620 cagaggaccc ttttgttgta tttattaaag agggcatcaa tgcagtgtat tctttggcgc 1680 gtgcccgtga caaattgaaa aagaaacagg agagaaagag aaatcagcag gaagaaccag 1740 ggaatctgtg ggaagaccca aagcggggta gaagcagcac agcaactgca acgtacacgc 1800 gggccgagat agaggcggct gagacagaga agcagaagaa ccaggaaacc gaagcaaaca 1860 cccaaagcgc ggtagttttg tccatggatg caacctacac gcgggcggag atagaggctg 1920 ctgagactgc tgtgtcgtcg gattctgaat ctgaagcatc ccgttcactt tctgtggttt 1980 tgtcaatgga tgaatcagat gaggatgctc ctgtttttga tttgggagta aaaatcgatg 2040 atgatagcag cagcaaggac agcctaacac cttttttgga ccaaatataa atggtaagtc 2100 cttccataag gggaggactt ccgtaacgac cactatgaga cagagcagaa ttgagggcta 2160 cgggtagggg cagatctagg ggtaggggta ggggtattga gggctagggg tagggttggg 2220 ggtcgaccac taggggtagg ggtaggagtg ggggtagggg taggggtagg ggtattgcta 2280 ggggtattgc taggggtagg gataggggta gggatagggg taggggtatt gagggctagg 2340 ggtaggggca gatctagggg taggagtagg ggtaggggta ggggtattgc taggggtagg 2400 ggtaagggta ttgagggcta ggggtagggg cagaactagg ggtaggggta ggggtagggt 2460 tagggttagg ggtagaccac nactatgaga cagagaagaa tggagggcta ggggtagggg 2520 taggggtagt atagggttag ggttaggcct cttctttata tttaattgtg tttgatgggg 2580 tgatcgctca ttacggaagt ccttacagtg gaaggtaagt tgtgagatat ttcattgaga 2640 tttaagatgg ttaaaaggtc ggggtagttg tcattcgaat aaccgtcagt aaatacaaac 2700 tatacacatt atttgattaa atgagctatt ataatgacta ggggctaggg aagtattcga 2760 gttgtagtga ttggtcaaac gcgcgtatga atacttgttg gaggatgtaa acaaagatgg 2820 cgcatgcagg gacattcgag tggatttttc taattctttt gtcgactttc gacgcagtgg 2880 gtgcttttat atcgtgttta gaactcaaat tatgtttttt gaatggatat tgtagaaagc 2940 gttttaattg ttaagtggtt aatagttttt ctttctcaat ttgaagggaa attaacactt 3000 ttgctggcgt attaatattt ttcccgattt ttctctatta tttcattgta ttgacccggt 3060 ctgcttggat tttacc 3076 // ID TTAA19_AP repbase; DNA; INV; 528 BP. XX AC . XX DT 25-AUG-2009 (Rel. 14.09, Created) DT 25-AUG-2009 (Rel. 15.12, Last updated, Version 2) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; nonautonomous; TTAA19_AP. XX NM TTAA19_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-528 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2086-2086 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC TSD is TTAA tetranucleotide. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea CC Aphid Genome Project. XX SQ Sequence 528 BP; 182 A; 83 C; 80 G; 183 T; 0 other; gggggctgca gcagatgaaa aaaaagtgac ttttagacca ataagctaga caaaactgga 60 agaatttggt ttgacagatt ttaattttga aaacggatta tgattgatca acaagtttac 120 tagggaatga tcgaggcgat tttgaatatt ttttttttca gctgtgatat ccaagaataa 180 aaatatcgaa aaaatatgat atttttgtat ttatattatt cattatgaca ctaacaataa 240 ttggaatatt caaaatcgcc tcgatcattc cctagtaaac ttgttgatca atcataatcc 300 gttttcaaaa ttaaaatctg tcaaaccaaa ttcttccagt tttgtctagc ttattggtct 360 aaaagtcact tttttttcat tgactggagc cccctccagc ccccttaagg tttattttat 420 gatacggaat ttaaatatat tgaaaaacat tataacaaac aaaattaatg tttagattgg 480 tcaaatctac attagttttg atttttgaca aaacctgctg cagccccc 528 // ID HERO-1_PP repbase; DNA; INV; 3967 BP. XX AC . XX DT 29-MAY-2009 (Rel. 14.06, Created) DT 29-MAY-2009 (Rel. 14.06, Last updated, Version 1) XX DE A family of HERO non-LTR retrotransposons - a consensus sequence. XX KW Hero; Non-LTR Retrotransposon; Transposable Element; HERO-1_PP. XX OS Physarum polycephalum OC Eukaryota; Amoebozoa; Mycetozoa; Myxogastria; Myxogastromycetidae; OC Physariida; Physarum. XX RN [1] RP 1-3967 RA Kapitonov V.V. and Jurka J.; RT "A family of HERO non-LTR retrotransposons from the slime mold RT genome."; RL Repbase Reports 9(6), 1149-1149 (2009). XX DR [1] (Consensus) XX CC HERO-1_PP is a first family of HERO non-LTR retrotransposons CC found outside Deuterostomia. The slime mold genome contains >1000 CC copies of HERO-1_PP that are ~5% identical to their consensus CC sequence. XX FH Key Location/Qualifiers FT CDS 27..3626 FT /product="HERO-1_PP_1p" FT /note="contains the reverse transcriptase and FT restriction enzyme-like endonuclease." FT /translation="MVTTRSMTSDAGRQKLPQASKPQDKKVKPQRSRLKSK FT EPDQHTSYVDAPVPTNKQTGTRSCVCGYSHKFHHRVILHMRTCSSCKDSLT FT PEQQTKSLEANAEPSINPHSPSDTASPDCSDQDQQFSPYQTLDQDHFNKYT FT PQPKLKLPKPNDHTQWSSINKELEAVLHQRLNLEELHYRGITATVNAFLSI FT VHWFISIKCGTVNPAQTKKERQQEDKTRHRDQTKEMLRKLKAEIRKKWKKA FT KQTKDKEAQKEARAKFRKVITALNKLRKQEQQVEKELIASTSRREFKRDPW FT QFCESRLGQYSSKGTRSSRSPSFDKGHADAFFSATYKDVGRDKSYTTPPQL FT EKVPPPPKSPFNLAPPTLGDVKYLLHKKPNNSCPGWDAIPYLVYKRCPILQ FT RYIHSFFTRLWDEQVVPEGWQIAYIILISKTEDTSKADQFRPIALGNTLGK FT LFFTLVQARTTEYVLQNSYLDISVQKAFLPKMAGCVEHTQTLVSALRDAHQ FT SQRAITVTFLDLKNAYGTVHHSLIAYALKRYHFPHSILEIILSYYDLLVAQ FT VVTPNFSSKWVHYSIGLFQGCTLSTILFNITYNMLEEWLAPLNVPGYCFKK FT TELEVKHLFFADDTTFVTGRYRENQTLLDHTGDFFDWTETMQARPSKCMSL FT AYARVEKGHNHLVEVLEQKSYAAINPRLVLQREVIPVLFKQSFKFLGRRIG FT GTLDDEKQKQHLLSQCENMLESTHQLSLSGAMKLWIYNNYIVAVLSWQMMI FT YDHAKVWGKRLEEAANPYLKIWAGLAYSATPGMLYQPTEQKGLGLKSLSIV FT LSQLQVSKFHLLKHSQDPKVQALYNYKKEQTEGKKRWNAVSELEQLERTVA FT LNQLTKGSQTGRAGLGNQKPPKTKDPNKEHRREVLDACKKKELEERLIHLH FT SLAIQGQWLKWDEHMTTDRTWTNLLYHLPQELFEWANNAQLRTLPTPDNLQ FT RWNRKTTGDCALCHTKNITLLHILNCCSYSLHHNRYTWRHNGILKLIAQVV FT ESNLNSLYKPKATQHITFIKAGEKAAKSTPKDPVGILSLTNDWKMSVDLDR FT DTYSFPAHITQTSLHPDILIWSDKVKHILFLELTVPLEENTAGAQIRKLKR FT YHDLEQACKANGYQTHSLTLEVGSRGWVAPSVGTCMRKLGIPRDQIKSLIR FT TLSNTALCMSYLIYVNRENRSWKPWEWKVSLLVGEQPALSS" XX SQ Sequence 3967 BP; 1264 A; 1029 C; 812 G; 862 T; 0 other; caacgttgga ttaccaacaa cacattatgg ttacaacacg atcaatgacc tcagatgctg 60 ggcgccaaaa gctaccacag gcaagtaagc cccaggataa aaaagtgaaa cctcagcgct 120 ctaggctcaa gagcaaggag cctgaccaac acaccagtta tgtggatgcc cctgtcccta 180 ccaacaaaca gactggcacc cgcagttgtg tttgtggata ctcacacaag tttcaccatc 240 gtgtcattct ccacatgaga acctgctcat cttgcaaaga ctctctaacg ccagagcaac 300 aaactaagtc tctggaggcc aatgctgagc cgagtataaa ccctcacagt cccagtgata 360 cagcctcacc tgattgttct gaccaagatc aacaattctc accctaccag accctagacc 420 aagaccattt caacaagtac actccacaac ccaagctcaa actccccaag ccaaacgatc 480 acacacagtg gtcatcaata aataaggaac tggaagcagt cctacatcag cgactcaatc 540 tcgaagagtt gcactacaga ggcatcacag ccactgtaaa tgcgttcctt agcattgtgc 600 actggttcat cagtatcaaa tgtggtacag tgaaccctgc acagacaaaa aaagagagac 660 aacaagaaga taaaacccga catagagacc agacaaaaga gatgctccga aagctaaaag 720 ctgaaattcg aaaaaagtgg aagaaggcca agcaaaccaa agataaggag gcccagaagg 780 aagcacgagc taaatttcga aaagtgatca ctgccctcaa caagctgaga aaacaagagc 840 aacaagtaga gaaggagctg atagcatcca cttcacgtag agaattcaaa agagatccat 900 ggcaattctg tgaatccaga ttgggtcaat attcaagtaa aggaacacgg agcagcagat 960 cccccagttt cgataaaggt catgctgacg ctttcttttc agctacttat aaggatgtag 1020 gcagagataa gtcctatact acacctccac agcttgaaaa agtcccaccc cctccaaaaa 1080 gcccattcaa ccttgctcca cctacattag gtgatgttaa atacttgctc cacaaaaagc 1140 ccaataattc ctgcccagga tgggatgcca ttccttatct tgtatataag cgatgcccaa 1200 tattgcaaag atacatccac tctttcttta cacgcttatg ggatgaacaa gtagtccctg 1260 aagggtggca aattgcttat atcatcctaa tcagcaaaac tgaagatacc agcaaggctg 1320 accaattccg ccctattgcc cttggaaaca cactaggaaa gctcttcttc accctagtcc 1380 aagctagaac cactgaatat gtcctgcaaa acagctatct tgacatctca gttcagaaag 1440 cgtttcttcc caaaatggca ggctgtgtag aacatacaca aacccttgtc tcagccctac 1500 gagatgccca tcagagtcaa agagcaatca ctgtcacctt tctagatcta aagaatgcct 1560 atggaacagt gcaccactct ctcattgcat atgctctgaa gcggtaccat ttccctcact 1620 ctatactaga aatcattcta tcctactatg acctcttagt agcacaagta gttactccca 1680 actttagctc caagtgggta cactatagca ttggcctatt tcagggttgt acactctcca 1740 ccatcctatt caatatcacc tacaacatgc tagaagagtg gctggccccc ctcaatgtcc 1800 cagggtactg cttcaagaag acagagttgg aagtcaaaca cctcttcttt gctgatgaca 1860 ccacctttgt cacaggacgc tatcgagaaa atcaaaccct cctagaccac acaggggatt 1920 tctttgactg gacagaaacc atgcaagctc gcccctccaa gtgcatgagt ctagcctatg 1980 caagggtgga aaagggtcac aatcaccttg tagaagtcct tgaacaaaag agctatgcag 2040 ccattaaccc acggctagtc cttcaaaggg aagtaattcc agtccttttc aagcaatcct 2100 tcaagttctt gggaagaaga attggaggca cactggatga tgaaaaacag aagcaacacc 2160 tcctaagcca gtgtgagaac atgttagaaa gtacacacca gctatccctc agtggtgcta 2220 tgaagttgtg gatctataac aactatattg ttgcagtact atcatggcaa atgatgatct 2280 acgatcatgc taaagtatgg ggaaagcgcc ttgaagaggc agcaaacccg tatctaaaaa 2340 tctgggcagg tcttgcctac agtgcaaccc ctgggatgct ctaccaacca actgagcaaa 2400 agggacttgg gctgaaaagc cttagcattg ttttgtccca gctacaagtt agcaagttcc 2460 acctgcttaa gcacagtcaa gacccaaaag tgcaagctct ctacaactac aagaaagaac 2520 agacagaggg caagaagagg tggaatgcag tatcagagtt agagcaactg gagcgaacag 2580 tagcactgaa tcaattaaca aaaggaagtc aaactggaag agctggacta gggaatcaga 2640 aaccacccaa aaccaaagac cccaacaagg aacataggcg agaagtgcta gatgcttgca 2700 agaaaaagga gctagaggaa aggctgatcc acctacacag tttagcaatt caaggacaat 2760 ggctcaagtg ggatgaacat atgacaacag atagaacttg gaccaatctg ctctatcacc 2820 tccctcaaga gttgtttgag tgggcaaaca atgcacagct tcgcacacta cccacaccag 2880 acaacctgca acgctggaac aggaagacca caggtgactg tgcactctgt cacaccaaaa 2940 acatcacact cctccacatc ctcaattgct gcagctattc actacaccac aatcgctaca 3000 catggcgaca caatggcatc ttaaagctga tagcacaggt agtagagtca aacctcaact 3060 ctctctacaa accaaaggca actcaacaca ttacctttat caaagcagga gagaaggcag 3120 caaagagcac ccctaaagac ccagtgggta tcctctccct cacaaatgac tggaaaatga 3180 gtgtagatct tgatcgagac acatactctt tcccagctca catcacccaa acctctctcc 3240 acccagacat cctcatctgg tcagataagg tgaagcatat tctcttcctt gagctcacag 3300 ttccacttga ggaaaacaca gctggggcac aaatacgaaa gctcaaaagg taccatgacc 3360 tagagcaagc ctgtaaggca aatgggtacc aaacacactc ccttactttg gaagtaggga 3420 gcagggggtg ggtagcacca tcagttggga cgtgtatgag gaaactgggt attcccaggg 3480 accaaatcaa atcccttatt cgcactctct caaatacagc cctgtgtatg agctatctga 3540 tctacgtcaa ccgagaaaac cgctcctgga aaccatggga gtggaaagtc tctctccttg 3600 tgggtgaaca acctgcatta tcttcttaac tctcaacaac cccctggctt ggggacactt 3660 tgaaacaggg accaagtaat gaaatggagt cttggaccaa cttctgcgag gtggggcagc 3720 agtttgtttg tttgtttgtc agtgagagac tggctcattg actatagcag gtagaggacc 3780 ttcctggcat tcctcacgcc taggcgaccg ctgggaaggg ggaggacgcg tgaggcaccg 3840 agcacctccc acctctacct gttttctgcc agtactagta gcatgtacac gtgcgcagca 3900 atagcacgct gctgatttgt ggctactagt atgtatctgt acgcttctgt actttctaca 3960 ctttctt 3967 // ID Gypsy-7_AC-LTR repbase; DNA; INV; 270 BP. XX AC AASC02007709; XX DT 18-JAN-2011 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from Aplysia californica (California sea DE hare): long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-7_AC_; KW Gypsy-7_AC-I; Gypsy-7_AC-LTR. XX OS Aplysia californica OC Eukaryota; Metazoa; Mollusca; Gastropoda; Heterobranchia; OC Euthyneura; Euopisthobranchia; Aplysiomorpha; Aplysioidea; OC Aplysiidae; Aplysia. XX RN [1] RP 1-270 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Aplysia californica (California sea RT hare)."; RL Direct Submission to Repbase Update (02-FEB-2011). XX DR Genome; AASC02007709; Positions 15190 15459. XX SQ Sequence 270 BP; 77 A; 62 C; 73 G; 58 T; 0 other; tgtagggcca atattttagc ctttcacgtc tccaaaactt gactagtttt gggggcgact 60 ttctggccaa ttggttgtgc acgggtagta gtttttcccc gtgcactgat gagtacaaac 120 aggatcgacg ggagactaaa aactcaacgg gagaagagag agcgtcagtc ccaaccgtaa 180 cggagtcaac acacagcagt cgccgggcgc ccggggatcc acgagatcta cgccgttgga 240 agtgaaaggt aacataagat aaatactaca 270 // ID BEL-157_AA-I repbase; DNA; INV; 6378 BP. XX AC AAGE02018937; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-157_AA_; KW BEL-157_AA-LTR; BEL-157_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6378 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02018937; Positions 24700 18323. XX CC Positions [5425-6009] - Integrase core CC 'GTATA' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 1151..2911 FT /product="BEL-157_AA-I_2p" FT /translation="MERKLENLCDRLELIMEKMLRIKEALNQDAANNVHFL FT NLQLQTIQKSYEEYDSVYNEAVSLVAKDKKGEWKNDYLNFETLHAELYVTA FT QTRIADLQRGESNRALTLNANASEFSPRPQVVHSVPHLQVPLPTFDGKLEN FT WHSFKCMFQTVMGRYPNESPAIKLYHLKNSLIGNAAGKIDQEVINNNDYES FT AWRLLEDAYEDERLIIDTHIDALLDLPKLTRENGDEMRKLVETCTKHVDAL FT KNQDLPVEGLSEMILINVISKRLDRESRKLWESSLNRGDLPSFDDLIDFLK FT ERSRVLQKLTSYAHINQPQTTAKSGQLASTTKAKQTKMFVQTNKEACPCCS FT NPHIIYKCADFKKLSVAERFDKVKRMGLCFNCLRPGHRTVDCSSDQHCKAC FT AKHHHSLLHNERSEDPKKKVKATQPKNQVEAEKKEELSPAAEQRTVSCCTQ FT TRAAPKQIFLSTAKIVVFGSGNTTTTCRALLDCCSESNIITERLAKKLNMQ FT LLVINPLISICGLNGMETTANKVVQTKVSSRDGEFSAILDFIVTPSITELP FT TGKVETQSWPLPAGIELADPAFNVPDDVDVIIGAERTTMF" FT CDS 3112..6378 FT /product="BEL-157_AA-I_1p" FT /translation="MTATERAVEQHFAETHTRNINSRYVVRLPFNDLKSQL FT GDSYENAKRRFGRLMVSLAKNPSKQEAYKQFMKEYETLGHMQEVSDNPEHG FT YFIPHHAVYKDSSSTTKVRVVFDASAASSTGVSLNDTQLIGPTVQSDLVTI FT MLRFCTHQVVLTADVPKMYRQVQIHPDDRVYQRIVWLNDSNEMTTFELKTV FT TYGCSSAPYLATKALTQLATDEADAFPLAARVVKEDSYVDDFITGGKSAAD FT VKETYSQLTNMLSRGGFGVHKFCSNSAEVLQLIPPELQEKQVDFEMSEINN FT TIKTLGLIWNPSYDYFVFNVPLPAYNEARPTKRIVLSEISKLFDPLGFLGP FT VVTTAKLIMQELWRLKLDWNDELPDDQMQLWLAFREQLVAVRQVRKKRCVI FT PGNTVKIELLGYCDASKRAYGAVLYVRSELNDGTINIQLVCSKSRVAPLKP FT TTIPRLELCGALVLAQLVQKTIESLKVKFDSVTLYSDSMICLSWLKKSPAL FT LNEFVCNRVATIVELTQNYKWDYVKSENNPADVLSRGLYPEQLIEEELWWR FT AAPEQWQHGEIEDEAVPVLADDELPELRSSKCVLATTVTKSCNNWIRVSNF FT NRLQRAWAYVWRFVEITRTKKRNPVSNPITASELSRAAQTIFKIVQQEAFK FT EEFKYLQSTDKKRNDLSSLAPFVDDKGLLRVGGRLKYSNIPYEGKHQLLLP FT ERHPVVELLVRYFHEKNLHVGQNALISIIRQQYWPIKVKTTVKRITSRCFR FT CYKHKPQQLNQFMGQLPSYRVTPAPVFASTGVDFAGPFILRESGKKPKFVK FT AYAAVFVCMAVKAVHLELCTDLRSETFLAALQRFTSRRGIPSDLFSDNATT FT FTGAANELAELRELFRSQLHKDKLAEFCTTKGITWHFIPPRSPHFGGIWEA FT GVKAMKHMLKRVVGETRLTYEEMTTFLSEAEAIMNSRPMCPLSDDPNDFQA FT LTPFHFLIGRSGQAIPEPSYDQEKIGRLSRWQHVQYMRDHFWKRWSADYLH FT TLQTRQKWKDGVLDIKVGALVLLREENVPPQQWKLGRITATHPGEDKVVRV FT VTVRTASSEYKRAVTRICLFPDVESIDPTGGV" XX SQ Sequence 6378 BP; 1851 A; 1377 C; 1655 G; 1495 T; 0 other; tttttggtcc tatcgtgacc cggatatgga cagtttttgc ggttcccgga tggtttgatc 60 ggtttcgttt gtgatcgcga tcgcgagtgc aagaaagtgg tgaatttgca ccaagtgcaa 120 aaaattgcgc cgccattgtt gtaaaatcga tgcggcggtg tgcaagcaga aacgtgttcg 180 aaggagcgcg atgcttctag aagcacgata aacagtgaaa aagtgaaatg atccgcatgg 240 tttgcggatg atcgccaggc ggtcattgag tgtacggttc ccggtttggc gttccgcccc 300 ggtcggaact tacgtttcgg cacgtttgcc gaagtgaggt tatgaaagtg ttacacacat 360 agggtgtgct agtgaaatgc ttctgaaaga agcattggtg tgagtgtggt ttcccaccag 420 agtggtgaga ccacgtacat caacagcagc actaagctga caagaacaag tgtactgtgg 480 actgattgct gtgtgcatag ggcatgctag caaaagaaaa cgaagcgcaa ggcttcagaa 540 gaaaagtttt tagttgtgta tgtgggcact gtaccccagt ggtatggtga agaatggtgg 600 tttcacgcat aggacgtgct gacagtaatg ttcgaagcac aaagcttctg gtgaaagtgt 660 agttgtgtgc gccggagctg tctgcgtgtt gtggccgtac agaagcgtag gtgtgttgaa 720 ttgctgcaca catagggtgt tgccagcaac agcaaagaaa gcacgaagct tcaggaaaag 780 tgtggttatg tgcaaacttg aattgctaaa gtaatagtgc gaagattaca aaagaacaaa 840 cgacagccag tgtactcgaa aaaagaaaaa aaaacgaatg ataaatgagt gaacaaaaaa 900 agaattcaaa tgaagattat cgcgtaactc caagagtaac aagaagtgtt gtggcaagaa 960 gaaacctatt ggcagcttta acgggatcga gcgctcgtga ggacgctgct aatactgtgg 1020 aagaagaaaa tcagtgtaat ttggtggact caagtgttcg tacacagggc ttaaatccct 1080 tgcaaaggcc actagagctc agtgttgcat atcaaaaaga gctagatcga gtgaaaaaga 1140 gcaacgagaa atggagcgca aattggagaa tttgtgcgat cgtttggagc tcatcatgga 1200 gaaaatgctt cgcatcaaag aggcattgaa ccaagacgct gcgaataatg tacacttctt 1260 aaatctgcag ctgcaaacaa ttcagaagtc gtatgaagag tacgactcag tgtacaacga 1320 agcggtaagt ctcgtcgcca aggataaaaa aggtgagtgg aaaaatgatt acctcaattt 1380 tgaaaccctt cacgccgagc tgtacgtaac ggcgcagaca aggattgcag acctgcaacg 1440 gggtgaatca aatcgagctt tgacgctaaa cgctaatgcg tcagagtttt ctccacgacc 1500 tcaggtggta cacagtgtcc ctcacctcca agtgccgcta ccgacatttg acggaaaatt 1560 ggagaactgg cactccttca aatgtatgtt tcaaaccgta atgggtcggt atccaaatga 1620 gtcgccagcg atcaaattat accaccttaa aaattcccta atcggcaacg ctgcaggtaa 1680 gattgatcaa gaggtgataa acaataatga ctacgagtca gcatggcgtt tactggagga 1740 tgcttatgaa gatgaacgat tgattatcga tactcatatc gatgcgttgc ttgatctgcc 1800 gaagttgacc agggaaaacg gcgacgagat gcgtaagctt gttgaaacgt gtacaaaaca 1860 tgttgatgcg ctaaagaatc aagatcttcc agtagagggc ctgtcggaaa tgattcttat 1920 caacgttatc agtaagcgac tggacagaga gagcaggaag ctgtgggagt catcattgaa 1980 tcgcggtgat ttgccttcat ttgacgattt gattgacttt ctcaaagagc gtagccgtgt 2040 tctccagaag ttgacgagtt acgcccatat taatcaaccg caaacgacgg caaaatctgg 2100 tcaactggct tccacgacga aagccaagca aacgaagatg tttgttcaaa cgaataaaga 2160 agcatgtcct tgttgctcca acccccacat catctacaag tgcgcagatt tcaagaagct 2220 ttctgtagca gagcgtttcg ataaggtcaa gcgtatgggc ctgtgcttca actgcctgcg 2280 tccagggcat cgcacagtgg actgctcatc ggatcagcat tgcaaggcct gcgctaagca 2340 tcaccacagc ctcctccaca acgagcgctc agaagatccc aagaagaaag ttaaggctac 2400 tcagccgaag aatcaagtgg aagcagagaa gaaggaagag ttatcgcctg cagcagagca 2460 gcgcacagtc agctgctgta cacaaacgcg agcagctccg aagcagattt ttctgtctac 2520 agcaaagatc gtagtgtttg gttccggtaa tacaacaaca acctgtagag cgttactgga 2580 ttgctgctcc gagtccaaca tcatcacgga aagattggcc aagaagttga acatgcagct 2640 tttggtgatc aacccgctga tttctatctg cggtctaaat ggaatggaga caacagccaa 2700 caaggtcgtt caaacgaagg tgtcatctcg agatggtgaa ttttctgcaa ttctggactt 2760 catagttacg ccgtccatca ccgagttgcc aacaggtaag gtagaaactc aaagctggcc 2820 actccctgct ggcatagagc tggcagaccc ggcattcaac gtgccggacg atgtagatgt 2880 gattatcggt gctgagcgta ctacgatgtt ctgaagaaag ggcgtctcaa gattggagcc 2940 gacttcccga ttctcgctga aacagtattc gggtgggtcg taagtggtcc tgcaaaatct 3000 cagcaacagg ctagccagaa acgaatatgt cagctgaaca ccacccacga agatgtcaat 3060 cgcaccctct ccaagttctg ggaattggag acgggatgct tcgtcagtaa gatgactgca 3120 acagagcgtg ctgtagagca gcactttgca gagacacaca cacgcaacat caatagtagg 3180 tacgtggtca gactgccgtt caacgatctg aagagtcaac tgggcgactc atacgagaac 3240 gcaaaacgtc gatttggaag gctgatggtc agtcttgcca aaaatccaag caaacaagag 3300 gcctataagc aattcatgaa ggaatatgag accctcggcc atatgcagga ggtaagcgac 3360 aacccggaac atggatactt cataccgcat cacgcggtct ataaggactc gagctcgact 3420 acgaaagtgc gagtagtgtt cgatgcttct gcggcgtctt ctacaggagt atccctcaac 3480 gacactcagc taatagggcc cacagttcag agtgatttag tcactatcat gctacgattc 3540 tgtacgcacc aggttgtttt gacagcggat gtacctaaga tgtacaggca ggtacaaata 3600 caccctgatg atagagtgta ccagcgaatt gtgtggttaa acgactcgaa cgaaatgact 3660 accttcgaat taaaaacagt cacatacggc tgttcaagcg ctccctacct ggcaactaag 3720 gcattgacac aactggctac ggacgaagct gacgcctttc cattggcagc tcgtgttgtc 3780 aaggaggata gttacgtaga cgatttcatc actggtggca aatcagctgc agatgttaag 3840 gagacctact cgcagttgac caacatgttg agccgcggtg gatttggcgt gcataaattt 3900 tgctcaaaca gcgcagaagt tctacagtta atcccaccag agttgcagga aaagcaggta 3960 gacttcgaga tgtccgagat caataacacc atcaagacat taggactgat ttggaatcca 4020 agctatgact acttcgtgtt caacgtgcca ctcccggcgt acaatgaagc tagaccaacg 4080 aaaaggattg tgctttcgga aatcagcaag ctttttgatc cgttgggatt cctgggaccg 4140 gtggtgacaa ctgcgaagct gataatgcaa gagttgtggc gtttgaagct agattggaac 4200 gatgagttgc cagatgatca gatgcagctg tggctagctt tccgtgaaca gttggttgca 4260 gtacgacaag taaggaagaa aaggtgtgtc attcccggaa atactgtgaa gatcgagtta 4320 cttggatatt gcgatgcctc gaaacgtgca tatggtgcgg ttttgtacgt tcgaagcgaa 4380 ctcaacgatg gcactatcaa catacaacta gtgtgtagca aatctcgagt tgcacctcta 4440 aagccgacca ctatacctag acttgaattg tgtggcgcac tagtgttggc acagttggtg 4500 cagaagacga ttgaatcctt gaaggtgaag ttcgacagtg taacgctata ttccgattca 4560 atgatttgtt tgagttggct aaaaaagtca ccagcgctgt tgaatgagtt tgtgtgcaat 4620 cgagtggcta ccatcgtcga actaactcag aactacaaat gggactacgt taaatccgag 4680 aacaatcctg cggatgtgct gtctcgggga ttgtatccag agcagttgat cgaagaagaa 4740 ctttggtgga gagccgcacc cgagcaatgg cagcatggag aaatagaaga tgaagccgtt 4800 cctgtattgg cagatgatga gctaccggag ctgcgaagtt cgaaatgtgt gcttgcaaca 4860 accgtgacaa aatcgtgtaa caactggatt cgagtcagca acttcaaccg attgcaacgt 4920 gcctgggcat atgtttggag atttgttgaa ataacccgta cgaagaagag aaaccctgtc 4980 agcaatccca tcacagctag cgaattgtcc agggctgctc aaactatctt caagatcgtt 5040 caacaggaag catttaagga ggaatttaag tacttgcagt ccacagacaa aaagcggaac 5100 gacctttcaa gcctggcacc cttcgttgac gacaaaggcc tccttcgagt tggtggacga 5160 ctgaagtatt caaatattcc atacgagggc aaacaccagt tgctgcttcc ggaacgacat 5220 ccggtagtcg agctgctggt ccgctatttt catgagaaga accttcacgt tggacagaac 5280 gcattgattt cgatcattcg gcagcaatac tggccgatca aggtgaagac aactgtcaag 5340 cgaatcacca gccgatgttt tagatgttac aagcacaagc cccagcagtt gaaccaattc 5400 atgggtcagc taccaagtta ccgcgttaca cctgcgccag tctttgcttc gaccggtgta 5460 gattttgctg gaccattcat tctcagggag agcggaaaga aaccgaaatt cgtaaaagcg 5520 tacgcagctg tattcgtgtg tatggctgta aaggcagtac atttagagct atgcacagat 5580 ttgcggtcag agacgttctt ggcagcgtta caaaggttta ccagtcggcg tggcatccca 5640 tctgaccttt tctcggataa tgctactacg tttaccgggg ctgctaatga gctagctgaa 5700 ctacgtgagc ttttccggag tcaactgcac aaagacaagt tagccgagtt ctgtaccaca 5760 aagggcatca cttggcattt cattccgcca cgaagtcccc acttcggcgg tatttgggag 5820 gccggagtca aggcaatgaa gcacatgctg aagagagttg ttggagagac gagattgacg 5880 tatgaagaga tgacgacgtt tctgtccgaa gcggaagcaa tcatgaattc gcgtccaatg 5940 tgccccttat cagacgaccc caatgatttc caggctttaa ctccatttca tttcctaatc 6000 gggcgtagtg gtcaagcgat tccggagcct tcgtatgacc aggagaagat tggtaggttg 6060 tcaagatggc agcatgtgca gtacatgaga gatcattttt ggaagaggtg gtccgcggac 6120 tatcttcata cgctgcaaac acgccaaaaa tggaaagacg gtgtattgga catcaaagta 6180 ggagcattag tgttactgcg agaagaaaac gttccacccc agcagtggaa gttgggccgc 6240 atcactgcta cacatccagg cgaggataag gtggtgagag tggtcaccgt cagaacagcg 6300 agcagcgagt ataaacgggc agtgactaga atctgtctgt ttccagatgt tgaatctatc 6360 gacccaacgg ggggtgta 6378 // ID Copia-33_DPu-I repbase; DNA; INV; 4477 BP. XX AC ACJG01006013; XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the common water flea: internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-33_DPu_; KW Copia-33_DPu-LTR; Copia-33_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-4477 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the common water flea."; RL Direct Submission to RU (08-FEB-2011). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR Genome; ACJG01006013; Positions 19932 15456. XX CC Positions [1977-2450] - Integrase core CC 'GACTG' target site duplication CC LTRs are 96% similar to each other. XX FH Key Location/Qualifiers FT CDS 799..2070 FT /product="Copia-33_DPu-I_1p" FT /translation="MITPRSHLTSSLSLDDDMTTHIQKLTSISDKLRELGQ FT PLDEMQLVTKALATLPEKFRVVRSVWASVPLNERTIDNLLQRLRSEENVLK FT SYEREDAAKDAAYAAKNYNSSRGRGRGRGRGGKPFHQVQEGVINQHKKLDG FT PRCGYCFIPGHETHECRKKKRAEAEKREKQDQGLVSSTQSNSTSSLDFFAD FT SGATKHMSDQKHLLQDFTPVTAGTWFISGIGDTQLEVMGRGNIRATVEVLG FT KTYSRIINDVLYVPGLGINLFSIAAATEIGLEARFNNNEVTFYRKNLPVLV FT GERADNTLYLLNLKPYLYTYGSRTDTALEANLRASLTIWHQRLGHVNHRTI FT QRMISEDAATGLNLTGERSIPKTLCTACELGKFHRRTLDVGRTRARQIGEL FT VHSDAAPYQLPALAMHVTMLHLPTTTADGE" FT CDS 2082..4004 FT /product="Copia-33_DPu-I_2p" FT /translation="MKKKSEVPALFRQFVASVLNETNNTVRTLRSDNGGEY FT VGHEFKKYLAERGIRHETSAAYTPAQNGVSERGNRTLMDGARSLLFASNLP FT SSLWAEAVAYVVFIRNRLLSSTNNVTPYEAWYGRKPNLADIRIFGSRAFVR FT CPNVRKLDQRCEEGAFVGLSDTQKASRIYVASTPPRIVVSHDVKVDETTMY FT FKASTSELSFTSDDWTEKTILDHDTHKSIADLPVLKTSLQDVMDPTPMDTN FT NNEAEEILPQETIHHHSETTLPNLADPPNEAAEITTNDGDKVISEDNTTSS FT PQTGSRRSSRLPAYTESYLEYKRSLANQATILDVSTQIRANNTQPTEPSSY FT TEAISCADAEHWIPAIFDEYESLIQNGTWSLCPLPTGRKAIQGKWILKFKP FT GYKDVAPRFKARFVVKGYSQIFGLDYIETFSPVVKHYSLRVVLAIAATKDL FT EMIQLDIKIAFLYGDLQEEIYMEQSEGFVVPGKEDQVCRLLKSIYGLKQAS FT RAWNKKFHDFILKFGLTQSRADPCVYFRHRREGEDDEEFTVLIIYVDDGII FT FSNQKHILTDILEHLKTVFEIRSFPADRFVGVDILRDRSLRTIHISQPDYI FT SKIVEKFNMVNCNPLAAPADPSPQNTFSRNTLWREQHCTYRLL" XX SQ Sequence 4477 BP; 1403 A; 1123 C; 900 G; 1051 T; 0 other; ggttatgggc ccaggtttac gcattttcta aagtgaaatg gccggaaatt taattgacaa 60 tcatcttaag gatgtcaatc acgtacccaa gttcgatggc accaattttc gtgagtggag 120 ctatgaactc cgcatgatgt tccaacaact gggactgatt ggacttgtgg aagcaagagc 180 tggacacact ctcccagaag aggtaatccc aacgatctga aatactcatt ttttttgttc 240 agatcactca catgtacatt tagacatgtt cgatccacat gttccatgca acattagtgc 300 tacactaaac acatgtacac ttacacatgt tcgatactgt tacagtagtt aaccacacat 360 gttcgacact tgacaagtga tcaactacac acatgtacag tcatatgtaa tctgattacc 420 caacactgct caagcataca gtaattaacg tgtacacaat catgatagca actcttttaa 480 tcaagagcga tgcatgtaca aacatattac ctaaattaat caacatccat tatatcacag 540 gcaatggacg ataatcaggt tgtaatcaac gctgcagaaa tcgaagcttg gcacctgaga 600 gatgtcaccg ccagaaatta catattcgca actctgacga aagtcatgaa gcaaaatctc 660 tactcatgtg agactgcagc tgccatgtgg actagactgg acactcaata tcaactcaga 720 gcagctgaaa atttacatct actgtggtaa agcttctatg acttcactca ccatgctggt 780 acgttaacac aatgcaagat gataacaccc agatcacact taacgtcctc tctctcacta 840 gatgacgaca tgaccacaca tatccaaaag ctaacaagca tttcggacaa actgagagaa 900 cttggtcaac ctttagacga gatgcaattg gtgacgaagg ctctagccac ccttccagaa 960 aaattcagag ttgtcaggtc ggtttgggcc agtgtacctc tcaacgaacg aacaatcgac 1020 aatctccttc aacgattgag atcagaggaa aacgtcctca agtcatatga aagagaggat 1080 gcagccaaag atgcagcata cgcagctaag aattacaact catcaagagg acgaggtcgt 1140 ggtcgcggcc gaggaggaaa acctttccac caagtccaag aaggtgtcat caatcaacac 1200 aagaaattgg atggtcctcg ctgcggatat tgtttcattc ctggtcacga aacccatgaa 1260 tgcagaaaga aaaagagagc agaagccgaa aaaagggaaa aacaggatca aggcctcgtg 1320 tcatccaccc aatccaattc aacaagttct ctggacttct tcgccgactc tggagccact 1380 aagcatatgt ctgaccaaaa acatttgcta caagatttca cacctgtcac agctggcaca 1440 tggttcattt ccggaattgg cgatactcaa ctggaagtaa tgggcagagg caacatcaga 1500 gcaactgtcg aagtactcgg aaaaacgtac tcaagaatca tcaacgatgt cctatacgtc 1560 ccaggccttg gtatcaacct tttttctatc gctgcagcca cagaaattgg acttgaagcc 1620 cgattcaata ataacgaggt taccttctac cgcaagaacc tacctgtact agttggagaa 1680 cgagcagaca acaccctgta tcttcttaac ctcaaacctt atctctacac atatggaagt 1740 cgaaccgaca cggctctaga agctaacctt cgagcgtccc tcactatctg gcatcagaga 1800 cttggacatg tcaatcaccg aaccatccag agaatgatat ccgaggacgc agcaactgga 1860 ctcaatctaa caggtgaaag gtcaattccg aagactttat gcaccgcatg tgaacttgga 1920 aagtttcacc ggcgcaccct ggatgttgga agaactagag caaggcagat tggtgaactc 1980 gtccactcag atgcggcccc ataccaactc ccagcattgg caatgcacgt tactatgttg 2040 catttaccga cgaccacagc ggatggagag tagtatactt catgaagaaa aaatccgaag 2100 tcccagcgct gtttagacag ttcgtcgcat ctgtcctcaa cgaaaccaac aacaccgtcc 2160 gaaccttacg ttcagacaac ggaggggagt atgtaggaca cgaattcaag aaatatcttg 2220 ccgaaagagg gatccgccac gagacgagtg cagcctacac tccagctcaa aacggagtat 2280 ctgaaagagg aaatcgaaca ctcatggatg gagctcgtag tctactcttc gctagtaatc 2340 taccatcatc actttgggca gaagcagtgg cttacgtcgt cttcatacgc aatcgtcttc 2400 tatccagcac gaataatgtc accccctacg aagcttggta cggaaggaaa cccaaccttg 2460 ccgacattag aatttttgga tccagagcat ttgtgaggtg ccccaacgta agaaaactag 2520 accagcgatg tgaagaaggt gcttttgttg gactaagcga cacacaaaaa gcgtcacgca 2580 tatacgttgc ttccacaccg ccaagaattg tagtaagtca cgacgttaaa gttgacgaaa 2640 ctactatgta cttcaaggca tcaacttccg aactatcatt cacctcagat gactggaccg 2700 aaaaaaccat cctggatcac gacacccaca aatccatcgc tgatctacct gttttaaaga 2760 catctctcca agatgtcatg gatcctaccc ctatggatac caacaacaat gaagctgaag 2820 agattctccc tcaagagaca atccatcatc acagtgagac tacccttcca aaccttgctg 2880 acccaccgaa cgaagccgca gaaataacaa cgaatgatgg tgataaggtc atcagcgaag 2940 ataacacaac atcttctcca caaacgggct ctcggagatc ctcacgtctc cccgcttaca 3000 ctgaaagcta tttggaatat aaaagatccc tcgcaaatca agccacaatc ctcgacgtgt 3060 caacccaaat tcgagcaaac aacactcaac ctactgaacc atcgagctac acagaagcaa 3120 tctcctgcgc tgatgcagaa cactggattc cagccatatt cgatgaatat gagtccttaa 3180 tccaaaatgg cacatggtcc ctttgtccac taccaactgg aagaaaagca attcaaggaa 3240 aatggattct caagttcaag cctggatata aagacgttgc accaaggttc aaagctcgtt 3300 ttgttgtcaa aggctattcc caaatctttg gccttgacta cattgaaaca ttctcaccag 3360 ttgtgaaaca ctactccctg cgagtagtac tcgccattgc tgcaaccaag gacctcgaaa 3420 tgattcaact tgacatcaaa atagcgttcc tctatggaga cctccaggaa gaaatttaca 3480 tggagcagtc ggaaggattc gtggttccag gcaaagaaga ccaagtgtgt cgccttctaa 3540 agagtattta tggattaaaa caagcctccc gggcatggaa caaaaaattt catgacttta 3600 ttctcaaatt tggcctaact caaagtcgtg cggacccgtg tgtgtatttc cgccatcgac 3660 gtgaaggaga agatgatgag gaatttactg ttttgataat ctacgtcgac gacggaataa 3720 tcttcagcaa ccagaaacat atcctgaccg acattctgga acatttgaaa actgtttttg 3780 aaatccgctc ctttcccgcg gaccgatttg ttggagtgga cattcttcgt gaccggtctt 3840 tacggacaat acacatttcc caaccagact acatttccaa gatcgtcgag aaattcaaca 3900 tggttaactg caacccgttg gcagctccag cagatccatc tccacaaaac acgttttctc 3960 ggaatacgct atggagggaa caacactgca cttaccggct actgtgattc agattatgca 4020 ggagatctgc aaactcgacg ctccacatcc ggctttgtct tcattcatct tggaggacca 4080 gtttcgtggg ccagccgacg tcaaccgtgt gttgcccttt caaccacgct gaatttgttg 4140 ccgccagtga agccactaag gaagcagtat ggctgaaaca actacttgcc gaactcggta 4200 tggaccatca accaactcgc ttatactgcg ataaccaaag cgccatcgcc ttggtgaaga 4260 atccagcctt tcacaaaagg accaaacaca tcgatgtccg gctctttcac atcagagaag 4320 ttcaagaaag cgggaccgtc aacatcgaat atgtttgttc agaacaacaa ctggctgaca 4380 tttttacgaa accgttagca atccctagat tcgagaaact ccgaaatgat ttgaatgttg 4440 tacaaattcc agcgtaagcg ttcagtttga ggagaag 4477 // ID hAT-47_SM repbase; DNA; INV; 2570 BP. XX AC . XX DT 16-AUG-2009 (Rel. 14.08, Created) DT 16-AUG-2009 (Rel. 14.08, Last updated, Version 2) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-47_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-2570 RA Jurka J.; RT "haT-type repetitive family from freshwater planarian Schmidtea RT mediterranea."; RL Repbase Reports 9(8), 1850-1850 (2009). XX DR [1] (Consensus) XX CC It contains 3 slightly overlapping ORFs. CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS join(280..1095,1062..1949,1937..2437) FT /product="hAT-47_SM_1p" FT /translation="MKRNSNICDYFNKKVKNNEEEENISNSVCQESPTESC FT STILSANPEKDKSAVNIFPDCWNGDQLKLFQDKYDGLDVKNKKLGCIKCSS FT FDNLGICTEQNIHISQVWRNFEVQPSGTTKSIQQASLRKKMSEHFQSKTHL FT LVLKMLELKKRNELTKVIDSMNDKNMTNTNKIFNIVYSLIKRNRPLSDMEA FT EVELQLKNGLDIGISLHSRFTAIRIAEHIAKQMKKRIVEKIITSNSKICVI FT IDEASTISSKSVLVIFLKFEMELENPTFFWINGARKPYLFLDLVHLESQTA FT EHIFNILLQTLIINGFSMKFLKENLIGFCSDGASTMLGQKSGVATRLVKEF FT PYIIVWHCLNHRLQLVLDDAIYEIKQVNHFKSFIDKIYVIFHQSNKNQLLL FT QNISSDLGIEIIKIGRVLGPRWAACSLRSALAVWRAYVPLYKFFTEDSKYL FT GMANRLANQYFLKDLALMIDILQEISYLSIGLQARFVSLQKATKLVNRTVN FT VMIQMKENLGIFEKKVEEMLSSPAFADIPFIENKKFESLPRKKLIETIIDK FT MNSRMLNCGHIRMNKNEKKTMEKNDVAELFNLLEGSDWNLEEVTCPWIYGE FT EKLRNLNEIIRYEIPINDFRDYVQNIIEGKNSVIHTIQKAKNKINTIAISS FT AEAERGFSLMNIICEKKRANLSVEHISSLMTVNMVGPPLDRWNAEEYVKTW FT LRNHNSADDTRVRNKAKKILMKIMNLFGNCYNNI*" XX SQ Sequence 2570 BP; 990 A; 338 C; 413 G; 829 T; 0 other; cagtgatgag ctgtagccgg ttcgcaccgg ttcgatagaa ccggttgtta aattttttaa 60 tagttaatcg aaccggttgt tatttcggaa taaattaaat tttaaattaa taaaatctat 120 tatttattac aaaaactatt tataatttcg attaaatagg tttataatat ctatctatag 180 aaatacgatt tccagttata attatattgt gagtacactt atgttttcca gtttatatct 240 aaatattttg tagccctttc atttagaaaa taatatttga tgaaacgcaa tagtaatatt 300 tgtgattatt ttaataaaaa agtaaaaaac aatgaagaag aggaaaatat ttcaaattct 360 gtatgccagg aatcaccaac agaatcttgc tctacaattc tatcagccaa tcccgaaaaa 420 gataaatccg ctgtgaatat ttttccagat tgctggaatg gtgatcaatt aaagctattt 480 caggataaat acgatggttt ggacgtcaaa aataaaaaac ttggatgtat aaaatgctcc 540 tcattcgata atttgggtat ttgtacagaa cagaacattc acatttcgca agtatggcga 600 aattttgaag ttcaaccatc gggaacaacg aaatctattc agcaggcttc attaaggaaa 660 aaaatgtctg aacatttcca atcaaaaaca catttattgg ttttaaaaat gcttgaacta 720 aaaaaaagaa atgaactaac caaagtcatc gattcaatga atgacaaaaa tatgactaat 780 accaataaaa ttttcaatat tgtttacagc ttaataaaaa gaaataggcc gctttcagat 840 atggaagcag aagttgaact tcagttgaaa aatgggctag atattggaat ttcgctacat 900 tctagattta ctgcaattag aattgcagaa catatagcaa agcaaatgaa aaaaagaatt 960 gttgaaaaaa taattacatc caattcaaaa atctgcgtaa tcattgatga agcctcaact 1020 atatcaagta aatctgtcct agtcattttt ttgaagtttg aaatggagct agaaaaccct 1080 accttttttt ggatttagtt catttagaat cccaaaccgc tgagcatatt tttaatatcc 1140 tgctgcaaac tttaataata aatggctttt ccatgaaatt tttgaaagaa aatttgattg 1200 gtttttgttc tgacggtgca agtacaatgc ttggtcaaaa gtcaggtgtt gctacaagat 1260 tagtaaaaga attcccttat ataatagttt ggcattgctt aaaccatcgc cttcagcttg 1320 ttttggatga tgcaatatat gaaataaagc aagtgaatca ttttaaatca tttatcgata 1380 aaatttatgt tatttttcat caatcaaata aaaatcaatt acttttacaa aatatatcaa 1440 gcgatttggg aattgaaata atcaaaatag gccgtgtttt aggtccaaga tgggcagcat 1500 gtagcttgag atctgctttg gctgtttggc gtgcttatgt accactttat aaatttttta 1560 ccgaagattc aaaatatctt ggtatggcaa accgacttgc aaatcaatat tttttaaagg 1620 atttagcgct tatgatcgat attcttcaag aaatttctta tttatcaata gggttgcaag 1680 ccaggtttgt ttctcttcaa aaggcaacca aactagtaaa tagaactgta aatgttatga 1740 tacaaatgaa agaaaattta ggaatatttg aaaaaaaagt tgaagaaatg ttatcatcac 1800 ctgcatttgc agatattcct tttatagaaa ataaaaaatt tgaaagtctc ccacgaaaaa 1860 aacttattga aacaataatt gacaaaatga attctcgaat gttaaattgt ggacatataa 1920 gaatgaacaa aaatgagaaa aaaacgatgt agctgaatta tttaatttac tagaaggaag 1980 tgattggaat ttggaagaag ttacatgccc gtggatttat ggagaagaaa aattgagaaa 2040 tttgaatgaa ataattaggt acgaaattcc cataaacgat ttcagagatt atgtacaaaa 2100 cataattgaa ggaaaaaact ccgtaataca tacaattcaa aaagctaaaa ataaaattaa 2160 tactattgct ataagttcag cagaggccga acgagggttt tcattaatga atatcatttg 2220 tgaaaagaaa agagctaatc tttcggtaga acatatttct agtttaatga ctgttaatat 2280 ggttggacca ccacttgaca gatggaatgc tgaagaatat gttaaaactt ggttaaggaa 2340 tcataattca gctgatgata ccagggttcg aaataaagca aaaaaaatat taatgaaaat 2400 aatgaactta tttggaaatt gttataataa tatataattt cagcagaggc taatcaaata 2460 aataaaaaaa ttaattctat tcaaataaaa cgatcttcta ttaaaaaaaa tttttgttag 2520 tttttagttt tgagagaacc ggttgttaaa attttggcag ctcatcactg 2570 // ID Jockey-N4_CQ repbase; DNA; INV; 1793 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A non-autonomous Jockey-like non-LTR retrotransposon family from DE Culex quinquefasciatus - consensus. XX KW Jockey; Non-LTR Retrotransposon; Transposable Element; KW nonautonomous; Jockey-N4_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-1793 RA Kojima K.K. and Jurka J.; RT "Non-autonomous Jockey non-LTR retrotransposons from the southern RT house mosquito."; RL Repbase Reports 11(1), 587-587 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 6 sequences with >91% CC identity. CC This family encodes a protein similar to Jockey ORF1p but does CC not encode ORF2p. Thus it is a non-autonomous non-LTR CC retrotransposon derived from Jockey, like HeT-A. XX FH Key Location/Qualifiers FT CDS 110..1549 FT /product="Jockey-N4_CQ_1p" FT /translation="MPKSGRGRGSSSAASKHSRSSSAGRVQKGGKQTIAKT FT TAIAKTSDSVIDKNLLHPEYRPRGISPRGFSPRSPVRTRGRVAAGSSQVNV FT PKLDGNKSTGGITISNPYEPLSGEEEEDGETTSDDDDDNSNIPESSAKEKN FT KTRSQERRPPPIYCLGHLADDIDELLEGNKYSLKLGKTAVQVITLDKKSFA FT DVIATLKANGIRHYTFNHADDVPVKVVLGGYMDRPISQLEEHLQNAKVRPR FT EIKVLSRKTTETGTHTLYLLYFNRGTVKIQDLRRIKTLDGFWVNWRFYSKH FT PNDAAQCHRCQKYGHGSRHCNLQPRCVKCGGTHLSEVCSLPRKVDLGDDAA FT QTKPRIKCANCEGNHTSNYRGCSARKAYIEDQEKRKKKPAASRPPHRSTST FT TVPAAGQRTVPSNNSASPPGWGRSFASVVAGSGDSAQQEITGEDLFTLPEF FT FTLAGEMLTRFKACRNKAEQFMALGELMIKYIYTR" XX SQ Sequence 1793 BP; 496 A; 481 C; 444 G; 371 T; 1 other; aaccctggca acgcttgagc cagcaacaac acggtgttcg acgctcctca agttttttcg 60 attttaacgt cgcgagttac cagtttatcc gcgaaactgc ctccccgaga tgccgaagtc 120 cggccggggc cgtggcagct cgagtgcggc ctcgaaacat tcgcgaagtt cctccgccgg 180 ccgagtgcaa aaaggtggta agcaaaccat cgccaaaacc accgccatcg ctaaaacgag 240 cgacagtgtc atcgacaaaa acctcctgca cccggaatac cggccacgtg ggatctcccc 300 acgtggattc tctccacgta gccccgtgag aacacgtggg agagtcgccg ccggttccag 360 ccaggtgaac gtgccgaaac tcgacggcaa taaatcaacg ggtggaataa ctatcagcaa 420 cccgtacgag cccctcagcg gggaagagga agaagacggc gaaacgacca gtgacgacga 480 cgacgacaat agcaacattc ctgagtccag cgcaaaggag aagaacaaaa cgcgatcgca 540 agaacgaagg ccgccgccta tctattgtct gggccatctc gcagacgata ttgatgagct 600 cctcgagggc aataaatatt ckttgaagct aggcaaaact gctgtgcaag tgatcacact 660 cgacaaaaaa agttttgccg atgtgattgc gacgctcaaa gcaaatggca taaggcacta 720 cacatttaac catgctgatg acgtccccgt gaaagttgtc cttggtgggt atatggatcg 780 ccccatttcg cagctggagg aacacctcca aaatgcgaaa gtacgaccgc gggagataaa 840 agtgctctcg cgcaagacaa cagagacagg tacgcacacg ctgtacttgt tgtacttcaa 900 ccgcggcacc gtcaagatcc aggacctgcg gcggattaaa acgttggacg ggttttgggt 960 aaactggcgg ttttactcga aacacccaaa cgacgcagcg caatgtcacc gttgccagaa 1020 gtacggccac ggctcgcgac actgcaacct ccagccacgt tgtgtgaaat gcggtggcac 1080 acacctttct gaggtgtgtt cactgccacg gaaggtggat ttgggggacg acgcagccca 1140 aaccaagccg cgcatcaagt gcgccaactg cgaaggtaat cataccagca attaccgtgg 1200 atgcagcgca cggaaagcat acatcgagga ccaggaaaag aggaagaaga aaccggcagc 1260 gtcccgccct cctcaccgca gtacgagcac aaccgttcct gcagctgggc agcgtacggt 1320 cccgtcgaac aattcagcgt ctcctcctgg ctggggacga tcttttgcca gcgtcgttgc 1380 cggtagcggc gattcggcgc agcaagaaat aactggcgaa gatcttttca ccctgccaga 1440 gttcttcact ctcgcggggg agatgctgac gcggttcaag gcctgccgta acaaggcgga 1500 acaattcatg gctctaggcg aattaatgat caagtacatc tatactcgat aaatttgaac 1560 tgtgatctag ttgtaagctt ttctccccta tcctttttcc ttagtaactc tagtaagttt 1620 tttgaaactt ttcttacttt ttgttaagcc catagtcaaa acaataattc ctccaacatg 1680 gactagtgta acacacagct gcaaggaact ccaaaactct gtattgtaca caaattgaac 1740 ttattgtaac ttgtttattc aaaaataaaa actgaattga attgaattga att 1793 // ID Crack-28_AAe repbase; DNA; INV; 4919 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A Crack non-LTR retrotransposon family from Aedes aegypti. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; KW Crack-28_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4919 RA Kojima K.K. and Jurka J.; RT "Crack clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1244-1244 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 14 sequences with >97% CC identity. XX FH Key Location/Qualifiers FT CDS 717..1745 FT /product="Crack-28_AAe_1p" FT /translation="MCCVYTFHFVFVYFCLFKYLFLKMNSENLCNNCHKIE FT PDGNKLIACKYCFTKIHIKCPKNLGRSIKQNESKLHFCSSKCSSMYERIIN FT SRMQNSKDMSKLLEELKSAVRANMQTVVEQVKAVTXAIESSQEFLSSKFEA FT ILTDFDVLKKENIRLRQEIEEMRSSQSSLTALVYKTEVEMDKQNKISVMNN FT AVILGVPFKKTEDLPGICRKIFETIGVESYNHSIVSVARIQANDKNAMSPI FT RIVFKDKKSKDVVLEKKKRFGQLVSTMVDPKLMCENKTTTVTIRDELTPLS FT TYLLNELRQAQGKLNLKYVWQGRGGVILAKTDENSKPHEIRNRVDLNKLLK FT " FT CDS 1772..4696 FT /product="Crack-28_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MLLSVIIFMSITMANSLQNYFHYSIDDFNKLYSNKTE FT KSLRFFQWNIRGMNNLDKFDSILEFLDQCDVPLDVLVISETWLKAENTCLY FT GIPGYQSIFSCRNSSSGGLAMYISNQINYRLIASETLDGLHHIHIELKVHE FT HQYEIHGVYRPPSYDVNCFLDKIEEWINNTNQNHVCFMLGDFNIPINLTHN FT NIVVKYRSLIESYGLVCSNTFTTRPSSNNLLDHVLCRLEDVERIRNDTIFS FT DKSDHTQIISSIKTSVDRKSTTVTKTIVNHRQVEFEFETFLNTIDAIDNVS FT ATLSRIQNVYNDILHRCTKTVSRKIKIKNHCPWMSFDLSVLIRIKNSYLKK FT AKRNPNDANLKAMLNHISKKVQYTKIACKKQYFEKLLNTTCHSKLWSNINQ FT IFGRTKEKQKICINNAQNVPSSSVEESCEILNNYFTTVGEELARKIPRLPG FT TSPFNHLTPINSSIFLRPTNPNEVISLINDLDSNKCPGADNIPPSILKMNY FT AKFSTILSNMFNEIIASGEYPDCLKIAKVTPIFKSGDPNLACNYRPISTLS FT VFNKIFEKLIVNRIIQFFKLHNVLYPLQYGFRQGCSTTSAVVELVDYVIKQ FT IEAKKIVGGLFIDLKKAFDTLNHNILLKKLEYCGVRGLPNRLIGSYLTNRK FT QFVAVEGKTSKTLGINTGVPQGSNIGPLLFLVYINDIGKLDLKGIPRLFAD FT DSALFYPGASTDEIVSHVEHDLCILNDYFCLNLLSLNLAKTKYMLFHSPRK FT KPVPRNHPKLKDIVIELVSEFKYLGIVLDPIFSWDKHIKSIEKKVAILCGS FT IRRVKSFVSQAALLKYYYACIHSLLQYLNIVWGYACKSKLKRLQTLQNRCL FT KHIFNFPPLYSTVNLYTNRNHCILPIRGLCELQTCLFVYDAVYNDNTHNNL FT PIMLNNQLQNTRQAHHLLRTQAKTTLGQKRITFIGPQKYNVLPNYLKSVNN FT RNIFKSRLKMHLKEKIHEFV" XX SQ Sequence 4919 BP; 1663 A; 869 C; 882 G; 1504 T; 1 other; aatttaatat tggtgttgaa tagtgaagtt aaatggtcaa gcgcatcaat gatgttaatg 60 atttatgcat gctgctactt aaagtgatgt aaaatgcaaa ataaaattaa atcttcttga 120 aaacgttgat aacaatgtac aattagtagt agtgcattct ggctgatagt ggttgctcgt 180 tgttgctgtt gttggtggag cttcgatgga cacactttgt attagttccg tgttgtgaaa 240 acgaaagagg ttcgattgat gctagccctg cccaagtacg tcgcaggtac ttctcgtagt 300 gttgtccacc aaaacaatat tctaggaggt tctggctcta cggacaagta gccaaattcg 360 ccatcggggt agatttaaca ttgtaacgga aaagctatac cttagtacgt cgcatgtaca 420 tcgtgaggtg tcgttgaaaa acacgatacg catgtatggt tctggctcca ttactagcga 480 tcggccgttg tatttgtaga gtagatttga cactgtcatg gaaagggtgt gcctagctat 540 acattgaaac tttagtagtg agtagcttcg tgggatctta gtgagtacac cgaacatacc 600 ttagattgat aatttatagt agagacccta taaccactca gaaaacacat tgaagtcaca 660 ctcagtcaga gataccgtat atattacatt ccttttattt ttgtatttga tattgcatgt 720 gttgtgtata cacattccac tttgtttttg tttatttttg tctttttaag tatctgtttt 780 tgaagatgaa ctcagaaaac ctttgcaaca attgtcacaa aatcgagcca gatggcaata 840 aattgattgc gtgcaaatat tgcttcacaa aaatacacat taaatgtccc aaaaatttag 900 gcagatctat caagcaaaat gagagcaaat tgcatttttg ttctagcaaa tgctcaagca 960 tgtacgaacg cattatcaat tcgcggatgc aaaatagtaa ggatatgtcc aagctattag 1020 aagaacttaa aagtgctgtg cgtgccaata tgcagactgt tgtagagcaa gtaaaagctg 1080 ttactwgtgc aatcgaatcg tctcaggaat tcctttcttc aaaattcgaa gctatcttga 1140 ctgatttcga tgttcttaag aaagaaaaca ttcgattaag acaggaaatt gaagagatga 1200 gatcgtcaca aagctccttg acggctctgg tttacaaaac ggaggtggaa atggataagc 1260 aaaacaaaat atccgtaatg aataatgcag ttattttggg tgtccctttt aaaaaaactg 1320 aagacctacc cggcatctgt aggaaaatat ttgaaacaat aggagttgaa tcatacaacc 1380 actcgatagt ttctgtagcc agaattcagg caaatgataa aaatgcaatg tcaccaatca 1440 ggatagtttt taaagataaa aaatcgaaag atgtagttct cgaaaagaaa aaacggtttg 1500 gtcaactggt ttcaactatg gttgatccca agctaatgtg tgaaaacaaa actactacag 1560 ttacaattcg cgatgagctc actccattat caacttactt actaaatgaa ttgcgacagg 1620 ctcaaggaaa acttaatcta aagtatgtct ggcaaggtag aggtggcgtc attttggcaa 1680 aaacagatga aaactctaaa ccccatgaaa ttcgcaatag agttgacttg aacaagttgc 1740 tcaaatagtt tcatcaaatt tcgatgtgcc tatgttgtta tcagttatca tattcatgtc 1800 aattacaatg gctaattcat tacaaaacta tttccattac tcaatagatg attttaataa 1860 actttattct aataaaacag aaaaatcctt aaggtttttt cagtggaata taagagggat 1920 gaataattta gacaaatttg attctattct tgaatttttg gaccaatgtg atgtcccttt 1980 ggatgtactt gtcattagcg aaacttggct taaagcggaa aatacgtgtt tatatggtat 2040 accaggatat cagtctattt tctcttgtag aaactcttcg agtggtggtt tggcaatgta 2100 tatttcaaat caaataaact acagattgat agcatcagaa actttggatg gcttacacca 2160 tatacatatt gagctgaaag ttcatgaaca tcaatatgaa atacatggag tctatcgacc 2220 tccgtcatat gatgtaaact gttttttgga caaaatcgaa gagtggatca ataatactaa 2280 tcaaaatcat gtatgtttca tgctaggtga tttcaacatt ccgattaacc tcacgcataa 2340 taatattgtc gtgaaatata gaagtttaat agaatcgtac ggcttggttt gttcaaacac 2400 cttcacaaca agaccgtcta gtaacaatct gctggatcac gtactttgtc ggttggaaga 2460 cgtagaacgc attcgaaatg atacgatatt ttcagataaa agtgatcaca cgcaaattat 2520 ttcatctata aaaacatctg tggatagaaa atcaacaaca gttacaaaaa ccatagtaaa 2580 ccatcggcaa gtagagtttg aatttgaaac attcctaaat acaattgatg ctattgataa 2640 tgtatcggct acactgagta gaatacaaaa tgtttacaac gacattttac atcgttgcac 2700 aaaaactgtg tcgcggaaaa ttaaaataaa aaaccattgt ccctggatgt cctttgatct 2760 ctcggtgttg ataaggatta aaaatagtta cctaaaaaag gcgaaacgca acccgaacga 2820 cgctaatctt aaagcaatgt tgaatcatat aagtaaaaag gtgcagtata ccaaaattgc 2880 ctgtaaaaaa caatactttg aaaagttatt gaacacaaca tgccattcga aattgtggag 2940 caatatcaat caaatttttg gaagaacaaa agaaaaacag aaaatttgta tcaataatgc 3000 acaaaatgtt ccatcaagct ctgtagagga atcttgtgaa attctcaata attacttcac 3060 gacagtagga gaggagttgg ctaggaaaat ccctagactc ccaggaacaa gtccatttaa 3120 tcatcttaca cctataaata gttccatttt cctgcgtccc actaatccca acgaagtgat 3180 ttcccttatt aacgatctag attctaacaa atgtcccggt gcagacaata ttccgccatc 3240 aatacttaag atgaactatg ccaaattttc aacaatttta tccaacatgt tcaacgaaat 3300 tattgcatca ggagaatatc cagattgtct aaaaatagcc aaggtaactc cgatatttaa 3360 gtccggggat cctaacttag cctgcaacta tcgtccaatt tccacacttt cagtatttaa 3420 caaaattttc gaaaagctta tcgtgaacag gataatacaa ttcttcaagc tgcataatgt 3480 tctatatcca ctccagtacg gatttcggca aggttgcagt accacttctg cagtggtaga 3540 gttagtcgac tatgtaataa aacaaataga ggcgaagaaa atagttggag gacttttcat 3600 tgacttgaaa aaggccttcg atactctgaa ccataacatt ttattgaaaa aattggaata 3660 ctgtggtgtt cgaggtcttc ccaatcgtct cattggtagt tatttaacca ataggaagca 3720 atttgtagca gttgagggca aaactagcaa aactttagga ataaatacag gagtgccaca 3780 agggagcaac atcggtccac ttctgtttct tgtatacatc aacgacattg ggaagctcga 3840 tctaaaaggt ataccaagac tttttgccga cgactctgct ttgttttacc ccggagcatc 3900 cacagatgaa attgtttcgc acgttgaaca tgatctttgc atactcaatg attatttttg 3960 tttgaacctt ctctcgctca atttagcaaa aaccaaatat atgttattcc actcaccaag 4020 gaagaaaccc gtaccaagaa atcatccaaa attaaaagac atagttattg agctggtctc 4080 ggaatttaaa tatttgggaa ttgttttgga tccaattttc tcatgggata aacacatcaa 4140 gagcattgag aagaaagttg caattctctg tggttctatc agaagagtta aatcatttgt 4200 atctcaggct gccttgttaa agtattatta tgcttgcatt cattccttat tacagtattt 4260 gaatatagtg tggggatacg cttgcaaatc taaactcaaa agactgcaaa ctttgcaaaa 4320 tcgttgccta aaacatatat tcaatttccc tcctttgtat tccacagtca atctttatac 4380 aaatagaaat cattgcattc ttcctattcg tggtttatgt gagcttcaaa cttgcctatt 4440 tgtctatgat gcagtatata atgataatac gcataataat ctaccaatca tgttaaataa 4500 tcagctgcaa aacactcggc aagcacatca cttattgcga actcaggcaa aaaccactct 4560 tggacaaaaa aggattactt tcattggtcc acaaaaatac aatgtactgc ctaattattt 4620 aaaatcagta aataacagaa acatcttcaa aagcagattg aaaatgcatt taaaagaaaa 4680 gattcatgaa tttgtttagt gtgatctaat taccataatc ataactgtat tcctatatat 4740 ctccggagtt ccttcaaagg atttttatcc actggaactt caactcactt tactactcct 4800 atttcaatgt tactctgtct tgtttttatc gttattaata tctaataaat ggtgtccgct 4860 accagggggc tctgtagagc tttttggtgc gggggtagtt ggtgggccat taaaaaaaa 4919 // ID Mariner-21_HM repbase; DNA; INV; 2952 BP. XX AC . XX DT 14-DEC-2008 (Rel. 13.12, Created) DT 18-DEC-2008 (Rel. 13.12, Last updated, Version 2) XX DE Mariner-type family: consensus sequence. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-21_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2952 RA Jurka J.; RT "Families of Mariner elements from Hydra magnipapillata."; RL Repbase Reports 8(12), 1955-1955 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 1125..2558 FT /product="Mariner-21_HM_1p" FT /translation="MPKSWMENHCAGRDWFTSFLQKNPTLSLRAPEATSIA FT RAISFNKHNVEMFFNNLQVCLQRHQFTPADIWNMDETGVTTVQKPDKVISR FT KGQKQVGAITSAERGALVTLACAASAIGNSIPPFFVFPRVNFRDHFLTSAP FT PGSDGDANPSGWMKEDTFCKFLTHFVKNARPSKEHPVLLLVDNHFSHLSLN FT ALDYATANGIIMLSFPPHCTHKLQPLDRTVYGPFKKAVNTAINSWMVSNPG FT KGMTIYNIPEIVKVAFPLSMTPNNILSGFSSCGIHPFNPQIFTDLDFAPSS FT VTEMPMPVKTNPQLTMPQASEVTVSPEQSTCASTSSLTLSSQEPSTSQAKS FT ISECFELASPDKVRPLPKALERKPSKNTRKKRTSSILTDSPFKNSMRSAQP FT LPKKCKSSAKGKGKGKATKKHTQDSSSSSEDETYCIICMDTWANSRDAEKW FT IKCCVCSSWAHEECTAGGSSYICHNCDSDDDY*" XX SQ Sequence 2952 BP; 977 A; 531 C; 521 G; 923 T; 0 other; ggggagagcg gggctaaaag tcataagggt tagttgtcat agtcgctatt tcggaatgtt 60 atagctggat tgatgctgcg atttttttat attatataaa tggtgctgaa agatgactgc 120 tcagttgatt tcatgaaatt cgcgcgaata gtttttctac aaatccaaaa aaagtaaaaa 180 tttcgttggt aagtaacttt ttaacttttt tgttcaaatg ttcaatacag taacaatatg 240 caatatttgt gtttaataca ttaaaatgtt tatagttttg gaaattcttt tattggtgtg 300 ctagtcattt ttcagattca cggcaaggat aaaaagttat ttaggtcaaa ccttcctcta 360 ccccctgtgg ggcaagttgt cctattttaa agggggcaag ttgtcatata aaaaaaattg 420 cctccatact ttcatatagt tgtaaaagtc ttaaatgaat gttaataata catacaatgc 480 ttattataag taataaacat caataaaatg aattaggcat aatatttaag attaataata 540 agccttagta aacattaata aatgatagtg atagttatag acattagaca gttataaagt 600 tttgcttact tggactttta tcctaacatt tttttttttt tttgctttat cagattttaa 660 atcatggttc gcaattatca aaggaaaacc agtagaggta ccactgctaa atgcacaatc 720 cttgaagcag tgaaagctgt aaagttacat ggcagaagca tagcccaggt gtctctagat 780 ttcaacatca actatagaag cctacaacgc tatttgtcaa aagcctctac agaagatctt 840 caagaaaaat cgacaatacc ttcatttcac actggctacg tacaacctag gtttgtagat 900 tgttaatatg tttaaggatt ataatatttg ttattataag tgactccata aaataagtat 960 ttatactggt atatatttta tattatttca ggcaagtgtt gacacaggag cttcatcttg 1020 aactctgtag ctacttgcaa agagcttcag atatttactt tggccttaca ccaaaagatg 1080 ttaggaaact agcatatcaa ttagcagatg ctaataaatt ggctatgcca aaatcatgga 1140 tggaaaatca ttgtgctggg agagattggt ttacgtcttt cctacaaaaa aatccaacat 1200 tgtctttaag agctccagag gccacaagca ttgcacgtgc cataagtttc aacaaacaca 1260 atgttgaaat gtttttcaat aatcttcagg tgtgtcttca acgtcaccaa ttcacaccag 1320 cagatatttg gaacatggac gaaacagggg taactacagt tcaaaagcct gataaagtga 1380 tctcaagaaa gggccaaaag caagttgggg caataacctc agcagaaaga ggtgctttag 1440 ttaccttggc atgtgctgcc tctgctattg gtaacagtat tccacctttt ttcgtatttc 1500 cgagagtcaa ttttagggac cactttttaa cttctgcacc acctggcagt gatggggacg 1560 caaatccatc tggttggatg aaggaagaca ctttttgtaa atttcttact cattttgtaa 1620 agaatgcacg gccttcaaag gagcatccag ttttgctact ggttgacaat cacttttcgc 1680 acttgtcatt gaatgcacta gactatgcga cagcaaatgg aattattatg ttaagttttc 1740 caccccattg cacccacaag ttgcagccac tagatagaac agtctatggc ccttttaaaa 1800 aggctgtaaa tacagccatc aacagttgga tggtcagtaa ccctggaaag ggtatgacaa 1860 tttacaatat tccagagata gtgaaggttg cctttccatt gagcatgaca ccaaataaca 1920 tactgagcgg attttcttcc tgtggcatcc acccatttaa ccctcaaata tttacagatt 1980 tggactttgc tccaagttca gtaacagaaa tgcctatgcc agtcaaaacc aatcctcaat 2040 taacaatgcc tcaagcttca gaggttacag tctcaccaga acaatccact tgtgcttcaa 2100 cttcaagttt gactttgtca tcacaggagc cttcaacatc tcaagccaaa tcaatttctg 2160 aatgctttga gcttgcatct ccagataaag taaggccatt accaaaggca ctagaaagaa 2220 agccttccaa aaacacacgc aaaaagagaa cttcgtctat attgactgat tcccctttca 2280 aaaattctat gagatcagca cagccactgc caaaaaaatg caaaagttct gctaagggaa 2340 agggaaaagg caaagcaaca aaaaagcata cacaagactc atcatcatca tcagaagatg 2400 aaacctactg tattatttgc atggatactt gggctaatag cagagatgct gaaaaatgga 2460 tcaaatgctg tgtgtgctca agctgggcac atgaagaatg tacagcagga gggtccagct 2520 acatttgcca taactgtgac tctgatgatg attattaatg gtcattttca aagtgtttta 2580 gagtataaat tataatataa tttatatgtt atttttaata atttataaat gattaatata 2640 atacttgttt tctatttgaa aatatttttt tttacctttt agacctacac ttgtgataaa 2700 atataattat gacaacttgc ccctcttatc atgacaactt gccccgacgg tggggcaagt 2760 tgtcatacca tggagccttc tttagaaatg gtggcatgtt atactcaaat ggtctgaagc 2820 tttccaaagt tgttttttaa ataactataa tgataaacta ccatataaaa tagtttttat 2880 gaaaattgac caattggttt ttgaaatatc ttgaaaaata taaaaagtat gacaacgtgc 2940 cccgctctcc cc 2952 // ID Tx1-11_BF repbase; DNA; INV; 6084 BP. XX AC . XX DT 28-APR-2009 (Rel. 14.04, Created) DT 28-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE Amphioxus Tx1-11_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW Tx1; Non-LTR Retrotransposon; Transposable Element; L1-11_BF; KW Tx1-11_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-6084 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-6084 RA Kapitonov V. and Jurka J.; RT "Young families of Tx1 non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(4), 848-848 (2009). XX DR [2] (Consensus) XX FH Key Location/Qualifiers FT CDS 285..1094 FT /product="Tx1-11_BF_1p" FT /note="ORF1p is incomplete." FT /translation="MEALLIKLDELKAENATNFTKLNGNFDEMKNDMKKMG FT DILCCHEEKIKSQDELIKNLQGKIESLEARQFHSEQYSRKNCLRVWGFPDT FT YDQKDLPNKFVEFVKEYLQVNILPGEIDNIHYLGGKGKNRHLIVKFATFLA FT KRKVYEERKALKGKKNGEGSFISIRPDLSPASRALFHQAKQLITNGGDNNP FT LSAVWVTIEGRIWATRGDRRVLLTCQEDLEQGVAGDTTGEFVVQRRKTFAK FT RQASRSPEQQPDVKKTNVFGPLAQEMDCP" FT CDS 2087..5887 FT /product="Tx1-11_BF_2p" FT /note="endonuclease and RT." FT /translation="MTQPKVEICSYNCNGLGDRRKRNEVFTWLKDKSYKMY FT CLQETHSTQAVETEWRSEWGGDIFFNHGLSNSRGVMILFKANISYTVHQTL FT SDNSGRWILLDVSLESTRFCIINIYGPNSDDPSIFTNLETTLQSCIDPGSS FT FIAVGDFNTVLNSDMDRTGHRFSSYHPKCLQAINSFCANLDLIDIYRHKNA FT NERRYTWRRRFQASRIDYFLISFSLVSSVQYVKIEDSFRSDHSLIGISLIT FT SPIPRGQGFWRLNQDLLSDPVFVRETSDFIKYFFLCNKGSANPHIVWDSFK FT CCVRGHIIKYSSWKYKQRKKAENSLVNEICRLQVELDSVPNSTTFDKLQEK FT KHDLEILYQIKSNNMIASRRAKWMEQGEKFFMNLVTRDKARKNMFRFVTTD FT GQVKTNPLHVLEEQANFYQNLYSFEEPPMSVDDDVFEPFFPTSNRVSLSPE FT QRGGCEGLITEGELKDAINSFPWGKGPGLDGLSVEFYKVFFNCLKQPMLEC FT FNFSFSEKQLTGTQKEGLISLLLKQDPSGQDKDPSYIKNWRPITLLNCDTR FT VLSKCLALRIKNVISNLISYDQTGFLQGRCISDNIRRILEIMEYYEKNKLP FT GLIFLADFEKAFDTIRVEFIVKALRYFGFGDSFITWIKILYSDISSKIINY FT GYISKPFNLSRGVRQGCPLSPYLFILGVELLASKIRNNIEIKGLDTFGITS FT QLSLYADDSSFPLAPELGSLYALVKDLEIFNLLSGLRSNFDKCRILRIGSL FT SGTNFYLPCSFPILWTDGPVDILGIHIPRHMSDIVHSNFDRKFEKIDKILM FT LWKSKSLTLMGKIILVNTLIVPQFIFLLTSLPTPPSDFFDRYEKKIFKFLW FT GDGPERVKRKTLYAVLDQGGLKLRNLRAVDFTLKASWLPKLYLNKEWFCSK FT LILLSNSIFRTDLFPFAQLNKKDFESLLGKISPLSTFFTHVIQAWLKYQLR FT PPEEAVEVKQQLLHLNSNITISGKPFFSQVFLSRGVIFVNDMLDAENSFLS FT YVDFVTKFGVVCPEFKYNQLMSAIPPFWKAKLKLNSPSLGVCLPVCRNIGW FT LRNCNINKCLYQFFMNEASLWGSSERIKLSWEEYFDTPLPWQNIYSLPYKL FT SIDSYTRIFQYKLLFKFLPCNKILYLWGLIDSPLCNLCNEEEETVVHLFWD FT CCVTSNFWRQVDEWFFNINGTHLHINALSIMFGISSDRCPVLSNLILLLGK FT IFVFKNRKHSLNLSAFISFVSYFYKLEKNIAIRQGKLLKHRGKWGNLNNLF FT SVSTIS" XX SQ Sequence 6084 BP; 1779 A; 1079 C; 1186 G; 2040 T; 0 other; catcatctct tgtccattcc tttttggaca cagttttgct gccgggtttg ctatttgtct 60 gtttcagagc gaccccccac cctctcgcaa ttactgtggc tgaccaggtg gcgcacgaga 120 caacatttaa gttttggcgc atcaacattc aagatctagg tttgccgtga ccggacaact 180 aattgtttaa aacatgttct aaaccagtag tacagtgcaa cgtcaatggt cgtgagtact 240 tgatttatta ttctcactcc tctataaact gaacgatagt aaacatggag gctctattga 300 taaaattaga tgaactcaaa gctgaaaacg caacaaactt caccaaactg aacgggaatt 360 tcgacgaaat gaaaaacgac atgaaaaaga tgggggatat tctctgttgt cacgaagaaa 420 aaatcaaatc tcaagatgaa ctgataaaga atctccaggg gaagatcgaa tcactggaag 480 cacgccagtt ccatagcgag cagtattcta ggaagaactg tttacgtgtt tggggcttcc 540 cagacactta cgatcagaaa gacctcccaa acaaattcgt tgaatttgtc aaggagtatc 600 tacaggtgaa cattttgccg ggggaaattg acaacatcca ctatttgggc ggaaaaggga 660 aaaaccgaca cctcatagtt aagttcgcaa ctttcttagc caaaaggaag gtctatgagg 720 aaagaaaggc tctcaaaggg aagaagaacg gcgaaggttc tttcatctct attcgtcccg 780 atttgtcacc agctagtagg gcgcttttcc accaagcaaa acagttgatt actaatggag 840 gtgataacaa tcccctgtca gctgtttggg ttacgatcga gggaagaatc tgggcgacac 900 gtggggatcg ccgggtgttg ttgacatgcc aggaggacct ggagcagggc gtggcaggcg 960 acacaacggg ggagtttgtg gtacagcgaa ggaagacctt cgctaaacgc caagcttcca 1020 ggtctccaga acaacagccg gacgtgaaaa aaactaatgt tttcggacct ttggctcaag 1080 agatggactg tccctgattt ttgttctatc tccggttgat ttttccccgc tttgcgttct 1140 gagtctgata gacacgggtg aatgtattca gtttgtaatt aatccggtgt tttttcacat 1200 gggagaaaat gttgagacct ttggttgctg ttgttgtttc ctccgagtag gatggaggct 1260 gaatccgcat gacttctccc ttagttttgt tccttttttt ttgctctgca ttttagcaac 1320 aagatagatg acttgattaa ttatacctta gttcctattt tcttttctct cttgttgctg 1380 ttattggatt gttttgtatg cggaaatgtt taaatagtgt aagactaaag tcattgatat 1440 gctaaagttt aaaaaatgta tgtatataat atacaatgta gtttcaaaag taccaggtag 1500 ttgctttttc ctctgaatta aaagtcagtt gtgtatttct cattcgtagt atggcaaaaa 1560 cagatttgtt attaatggca actccttaga catgttttat ttttattatt cttattttat 1620 gttgagagga aatgccagct tatttttctt tttgttggat ctttttgctg agatagtatc 1680 gtcttttttt ttctgcatac caacaaataa acactggtac agtaaagtaa attttgatgt 1740 ttgtgaaact acagtcaatg tcaatgtcaa tcattatttt gttggaatcc tgtatttgag 1800 gaaaatgatt aaggattagt cgaccattgt ttgagttgct ttgtagtaga aatatgcgtg 1860 cttttttaga gcatgtttgg ttatgtgcat ggcatgtttt tctatgttcc tgttttctca 1920 aatgcaaatg tgttctttac agtactgtta actatgaaaa accctcattg cccagtagat 1980 acaatataaa tatttcgcat tgttgtcaaa ctttaattct tgcttcctgc ggactccact 2040 ccttttttga tctttataaa acatttacta ttcatgtacc tatttcatga cacagcctaa 2100 ggtagaaatc tgtagttaca attgcaatgg actgggagac cgcaggaagc gaaatgaagt 2160 ttttacttgg ttaaaagaca aatcttacaa aatgtattgt ttacaagaga cccactccac 2220 acaagcagtg gaaacagaat ggcggtccga atggggggga gacattttct tcaatcatgg 2280 tttaagcaac agcagaggtg taatgatctt atttaaggca aatatttcct atactgtcca 2340 tcagaccttg agtgacaact ccggtcggtg gattttgttg gatgtttcgt tagagtctac 2400 acgtttctgc ataatcaata tttatggccc taattcggac gacccttcta tttttacaaa 2460 tttggaaact actttacaaa gttgcataga tcctggatca tcctttattg ctgtaggaga 2520 ctttaatact gttttaaatt cagacatgga cagaaccggg catcgatttt ctagctacca 2580 tcctaaatgt cttcaggcaa ttaactcttt ttgcgcaaat ctcgatttaa tagacattta 2640 tagacataag aatgctaacg aaaggagata cacttggaga cgtcgttttc aggccagccg 2700 cattgattac tttttaatat ctttctctct agtgtcttcc gtacaatacg ttaagataga 2760 agatagtttt agatctgatc attcccttat tggaatttca cttatcacct caccaatacc 2820 caggggtcaa ggtttctggc gacttaacca agatttgtta agtgatccag tatttgttag 2880 agaaacctca gacttcataa aatatttttt cctttgtaat aagggttctg caaatcctca 2940 tattgtctgg gactctttta aatgttgtgt tcgtggacac atcattaaat actcttcttg 3000 gaaatacaaa caacgtaaaa aagcggaaaa ttctctagtt aatgaaatct gtagactgca 3060 agtagagtta gactctgttc caaattcaac aacttttgat aagttacaag aaaagaaaca 3120 tgatttagaa attttgtatc aaataaaatc taataatatg attgccagta ggagagccaa 3180 gtggatggag caaggggaga aattttttat gaatttagtt actagggata aggccaggaa 3240 gaatatgttt agatttgtaa cgacagatgg tcaggttaaa actaatccgc tgcatgttct 3300 tgaggagcaa gctaatttct atcaaaacct ttattcattt gaggagccgc ccatgtcggt 3360 ggacgacgat gtttttgaac cctttttccc gacatctaat cgtgtgtctt tgtctcctga 3420 gcagcgtggg gggtgtgagg gtctgattac tgagggggaa ctgaaggatg caattaactc 3480 tttcccctgg gggaaaggtc ctggtcttga cggcctttca gtagaatttt ataaagtttt 3540 ctttaattgt ttaaagcagc ctatgcttga gtgttttaat ttttctttca gtgaaaaaca 3600 gttaactggt actcaaaaag aaggtttgat ctctttgtta cttaaacaag acccatcggg 3660 acaggacaaa gacccttcat atataaagaa ttggcgccct attaccctcc ttaactgtga 3720 tacacgtgtg ttatccaaat gcctcgccct tcgcataaaa aatgttatct caaatttaat 3780 atcttatgat caaactggtt tcctgcaggg gcgatgtatt agcgataaca tacggagaat 3840 tttagaaatt atggaatatt atgagaaaaa caaattacct ggtttaattt tcctggcaga 3900 cttcgaaaaa gccttcgaca ccatcagagt agaatttatt gttaaagcac ttagatattt 3960 tggttttgga gactctttta taacatggat taaaattctt tatagtgaca tctctagtaa 4020 aattattaat tatggttata tttccaagcc ctttaacctg tcccgtgggg tccgacaagg 4080 ttgtccttta tctccgtatc tttttatatt aggagttgaa ttgcttgcta gtaaaattag 4140 aaataatata gaaatcaaag gtttagacac ttttggaatt acatctcaac tctctctata 4200 cgcggacgac tctagttttc cccttgctcc cgaacttggc tctctatacg cgttggttaa 4260 ggatttggag atttttaacc ttctatctgg tttacgctca aattttgata aatgtagaat 4320 attacggatt ggatctttaa gcggtacaaa cttctactta ccttgttctt tcccaatact 4380 ttggacagat ggccccgttg acattcttgg aattcatatt ccgagacata tgtccgacat 4440 tgttcatagt aattttgata gaaaattcga aaaaatagac aagattctaa tgttatggaa 4500 atctaaatcc ttaaccctta tgggaaaaat tatcttggtt aatactttaa tcgtcccgca 4560 atttattttt ctattgactt ctctcccgac acctccctct gacttctttg atcgatatga 4620 gaaaaaaata tttaaatttc tatggggaga tggacctgag agggttaaaa gaaaaacttt 4680 gtacgccgtt cttgatcagg gaggcctgaa attaagaaac ttgcgggccg tagattttac 4740 tttaaaagct tcttggctac ccaaactata tttaaataag gaatggtttt gctctaagct 4800 aattctttta tcaaattcca tctttcgtac tgacttgttt ccctttgcgc aacttaacaa 4860 aaaggatttt gaatctcttt taggtaaaat atccccttta agtacattct ttacccatgt 4920 tatacaagca tggttgaagt accaacttag gccacctgag gaggcggttg aggtgaagca 4980 gcagcttctg catctaaatt caaatataac tatcagtggt aagcctttct tttcacaggt 5040 atttcttagt aggggggtca tatttgtaaa cgacatgtta gacgctgaaa acagcttctt 5100 atcatatgtt gattttgtta caaaatttgg agtagtgtgt cctgaattca aatataatca 5160 acttatgtct gctattcccc ctttttggaa ggctaagctg aaattgaatt ctccaagttt 5220 gggtgtctgt cttcctgtat gtagaaacat aggctggctt cgtaactgta atattaataa 5280 gtgtctgtat caattcttca tgaacgaagc cagtctatgg ggctcatctg aaagaattaa 5340 actttcatgg gaagaatatt ttgacacccc actaccatgg caaaatatat atagtttacc 5400 ttataaactt tcgattgatt catatactcg aatatttcaa tataaactat tattcaaatt 5460 tttaccttgc aataaaatat tatatttatg gggtttgata gactccccat tatgtaacct 5520 ctgtaacgag gaagaagaaa cagtcgtcca ccttttttgg gactgttgtg taacttcaaa 5580 tttttggcgt caggtggacg aatggttttt caatataaat ggtactcatt tgcatattaa 5640 tgctctcagt ataatgttcg gtatttctag tgatcgttgc ccggttctgt cgaatcttat 5700 cctcctttta ggcaaaatat ttgtttttaa aaatcgtaag cattctttga atctgtctgc 5760 tttcatatcc tttgtttcat acttttataa attagaaaaa aacatagcaa ttcggcaagg 5820 caaactgctt aaacacaggg ggaagtgggg aaacttaaat aacttatttt ctgtcagcac 5880 catatcttaa acaaatgtat agtgatattg ttgaatgaat atgggcaaag gggtttagga 5940 ctaacgttaa ctgttgttac tggtttttgc ttttattttg tggttttaca tgtgctttgt 6000 ttaacgtatc tgtagctgcc atgctttctt tttgttgttt gttttgcaaa aagtaataaa 6060 gaatgattga aaaaaaaaaa aaaa 6084 // ID BEL-35_CQ-LTR repbase; DNA; INV; 368 BP. XX AC AAWU01003719; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-35_CQ_; KW BEL-35_CQ-I; BEL-35_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-368 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 224-224 (2011). XX DR GenBank; AAWU01003719; Positions 67808 68175. XX SQ Sequence 368 BP; 116 A; 86 C; 88 G; 78 T; 0 other; tgttcgccat cgcgtccgtt tccgaggctc ttcaagcgta aaataaaaat ttctgacaat 60 tggcaacact ggttgctgcg attggcaaac ccccgcgccg ctgtcaaaac gtcaaaggcg 120 aaaagcaacg gcaacctgat cgtcagattt agcgcgaacg aaaagcgcgc gcaaagaagg 180 ggagaaaaag gaaaccgaac caaaattcat tccgttcgca aactttgttc agtacagatg 240 taaataaatc ggtcgtttta gaaatttgtt agagaaaacg agtgtttcac tgccgtgcca 300 gaacgtggtt tacagtcccg cgaaaataga atcaaaatcg tcccgggttg aggcaatcgg 360 ccgcaaca 368 // ID DNA8-11_AP repbase; DNA; INV; 667 BP. XX AC . XX DT 22-AUG-2009 (Rel. 14.08, Created) DT 22-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-11_AP. XX NM DNA8-11_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-667 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(8), 1753-1753 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 667 BP; 179 A; 93 C; 115 G; 280 T; 0 other; cagggcttga aaccggtatt aagaaatttc ggtttcggtt tcgattttgt ataccggttt 60 cgaaatttta cggtttcggt ttcgacttag caataccggt attaagaaat ttcggtttcg 120 gtttcggttt tgtataccgg tttctaaagt attttcaggg ggctatgtcg tctatacaat 180 tatcattctt tgttccgtat tgtcaattgt catatatagt atttgctagt aactttaatt 240 tatgtcatta ctatttataa agcaactaat aattttgcga ataatgaaat gtttatcaac 300 aaaattaatt atgatctcaa ttttattatg attattaatt atgcgtatta atggaattta 360 cgtatttaaa aacacgtcag ttaaaaaaaa aataacggtt tccaattttt tcagatttcg 420 gttttgaatt taataccggt ttccaatttt ttacggtttc gattttgaat ttaataccgg 480 tatccaattt atttcggttt cggttttaaa ttataatacc ggtatccaat tttttccggt 540 ttcggttttg actttaatac cggtatccga tttttttcgg tttcggtttt gaaacggttt 600 cggtttcggt tttaaaacgg tttataacgg tttctgaaat cggttttgaa aacggtttca 660 agccctg 667 // ID hAT-80_HM repbase; DNA; INV; 3840 BP. XX AC . XX DT 16-SEP-2009 (Rel. 14.09, Created) DT 16-SEP-2009 (Rel. 14.09, Last updated, Version 2) XX DE hAT-type DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-80_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3840 RA Jurka J.; RT "DNA transposons from Hydra magnipapillata."; RL Repbase Reports 9(9), 1922-1922 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 1215..2120 FT /product="hAT-80_HM_1p" FT /translation="MMAKAVTIEHEHASMLGFGNDIGLWPELASTDMITFW FT AEKGPTSLLQNCDQDTLRKYSISQKQEKPDGSTFFRKCTPNLFLRKTTNSE FT TCQRNWLCFSSHTGKVYCFICKLTGSKSITTNFSSSGFCYWKHASVRVSEH FT ESSKGHLEAIITLAQRGKKSGGIYHELAKQESEICVYWNQILQRVISTIKF FT IAERGLAFRGDNEIVGSPHNGNYLGILELLSEYDTFLAAHIKMHANKGRGH FT TSYLSSTICEEIINLMGSQVLKYIFCSIKKSKYFSVSVDSTPDEAHIDQLT FT IIIRYMDMKS" FT CDS 2180..3550 FT /product="hAT-80_HM_2p" FT /translation="MTEALLKYLESDGISLHDCRGQSYDNAANMSGKYCGM FT QTLIKEQNDLCIYVPCCAHSLNLVGKAAANTCSAAVKFFDFVQEVHVFFTA FT SPSRYQILTEALSSSQEKTNKKKLLTPKGLSDTRWSCRHDAVKALVVGYDK FT FKVALKKISINKEEKDIVRNAAAGLHKQMCKIEIALYSLFWNEILERFNAT FT NKILQSPEMVLQSAVDALTSLKAFVESKRECFAQYEERAKKLSGSTEYSQA FT QRRQRNRNVRLNPLGYETGDEVQQSPSDQFRTTSFIPVIDQFNVSLTDRIN FT AYKNVCEYFGFLNRLDEMEPNEIMMSANNLVNKYKPDLEESLGNELVQFSA FT LLKLYMDDYDANANVPKELFFYKILNANNFISSFRNVEILLRMYLVLMVSN FT CSGERSFSKLKLIKNRLRTNMSQNRLSYLTLMSIESDLLREINFSDIVKTF FT VLQKARKAYISL" XX SQ Sequence 3840 BP; 1324 A; 611 C; 700 G; 1205 T; 0 other; cagggccgga tgtaaagggg ggggcaaccg gggcgctagc ccggggcccc cacaaattat 60 gaaaaaaaaa aaattcaatg acatgcctta aataagggta acgataaaaa tagaggtata 120 agttattata tttaatataa agtttatgta aactttgtat tttagagatt acttatattt 180 aatttatata tattaggtat attagccaaa ttcatatcgc gcggtttgta gggaaaaaaa 240 taaaaatggt agcaaataaa taacaacaaa ctttcgcaaa gtctatttgt ttcaatgtaa 300 atggttttgt ttgtatcaaa tgttaatcta ttttattaaa aatagtggtt taagttgaaa 360 tttgtgtcat tttagaaaaa attggtttaa atttttgaaa aatctttttt taaatgtaaa 420 tggttttgtt tatattaaat gttaaactta ttttatcaaa aatttttgtt tactttttaa 480 gttatataag gtcaaaaaag atctaatctt tgaataaagt gaaatgtgta ctgtggctca 540 aagaaactag taaattagta cgcatttaat agcttttata agttattatc catataaaaa 600 gttcaacatt tgggtcaaac ttctgttgga tagcatttat tttcaaatat acattaggaa 660 cagctttagc attagctgat catcaacatt ttaaacagtg atacagagtc tcaaaaaaag 720 acttcggctt tgaaagtcat atagggttta gggatagagt cagaaagggc atatagggtt 780 atgaaattgt catatcccta atatatatat atatatatat atatatatat atatatatat 840 atatatatat atatatatat atatatatat atatatatat atatttacat attaagttat 900 agcagtaaca tttaacacta aaatttaatt ttttttagaa aaataatgtc tgacagacat 960 aaatcttttc gcaactatga aagtggatct tcaaagcgta agtcagctca agaaaaagaa 1020 aaaagcacag gaagtttttg aaatcaagaa aaattagtgc cttttttagt aaacgagatg 1080 gagatttaga ttcttcaaag tcactacctg gaacttcagc tatactacca ccagttgagg 1140 agtttctatc gcaagacatt ggtgacaatc aaacaatagc agaagcagat gagcaacagg 1200 aagcaacagt agctatgatg gcaaaagctg taacaattga acatgaacat gcctctatgc 1260 ttggatttgg gaatgatatt ggtttgtggc cagaattagc ttcaacggat atgataacat 1320 tttgggcaga gaagggacct acctcgttgt tgcaaaattg tgatcaagat accctaagga 1380 aatattcaat atcacagaaa caggaaaaac ctgatggctc aacctttttt cggaaatgta 1440 ccccaaattt gttcctacga aaaacaacaa atagtgaaac atgtcaacgt aattggttgt 1500 gcttttcctc acacacagga aaagtatact gctttatttg caaattgact gggtccaaaa 1560 gtatcactac taatttttca agttctggtt tttgctattg gaaacatgcc tcagtgcgtg 1620 tcagtgaaca tgaatcatct aaaggccatc tggaagcaat tataactttg gcacagcgtg 1680 gcaaaaagtc tggaggaatt taccatgagc tagcaaagca ggaatctgaa atatgtgtgt 1740 actggaatca aatactgcaa cgtgtaataa gtactataaa gttcattgca gaacgagggt 1800 tagcttttcg aggtgataat gaaattgttg gatcaccaca caatgggaat tatctaggaa 1860 ttttagaact gctgagtgaa tatgacactt ttcttgctgc tcacattaag atgcatgcga 1920 ataaaggaag aggccatacc agttatctgt cttccactat ttgtgaagaa attataaatt 1980 tgatgggctc acaagtttta aagtacattt tctgttctat caaaaaatca aagtatttct 2040 cggtatctgt cgattctaca ccagatgagg ctcatattga tcagttaact attataattc 2100 gttacatgga tatgaaaagt taataccgaa tgagagattc ctaactttta tccctaatac 2160 aggtcacact ggtcgtgaaa tgacagaggc attattgaaa tatctggaat ccgatggaat 2220 cagtttacac gactgtcgag gtcaatcata tgacaatgct gccaacatga gcggtaaata 2280 ttgcggaatg caaactttga taaaggaaca aaatgaccta tgcatttatg tcccatgttg 2340 tgcacattcg ctgaatttag ttggtaaggc agcagcaaat acatgttctg cagcagtgaa 2400 attctttgac tttgttcaag aagtgcacgt ctttttcaca gcatcacctt cacgttacca 2460 aatcctaaca gaagcactgt ctagctctca agaaaagaca aacaagaaaa aactgctgac 2520 acctaaaggg ttgagtgata ctcgttggtc atgcagacat gatgcagtta aggcacttgt 2580 tgttggatat gacaagttta aggtggcttt gaaaaaaatt tcaattaaca aggaggaaaa 2640 ggatattgtt cgcaatgctg ctgctggtct tcacaaacaa atgtgcaaga tcgaaattgc 2700 gctatattca ttgttttgga atgagatatt ggagcgattc aatgccacaa acaagatact 2760 ccagagtcct gaaatggtat tgcagtctgc agtagatgcc cttacttcac tgaaagcatt 2820 cgtggaatca aaacgagaat gttttgctca atatgaagaa cgtgcaaaga aattatctgg 2880 ttcaactgaa tattcccaag ctcaacgtcg tcagcggaat agaaatgttc gcctcaatcc 2940 attgggttat gaaactggag atgaagttca gcaatcccca agtgatcagt ttcgcacaac 3000 aagttttatc ccagtgattg accaatttaa tgtctcactt acagacagaa taaatgctta 3060 caagaatgta tgtgagtatt ttggctttct gaatcgcctt gatgaaatgg aaccaaacga 3120 aataatgatg tctgcgaaca atttggttaa caagtacaag ccagaccttg aagaaagtct 3180 cggaaatgag cttgttcaat tttcagccct tctaaagctt tatatggatg actatgatgc 3240 taatgcaaat gtgcccaaag aacttttctt ttacaaaatt ttaaatgcga ataacttcat 3300 ctcaagtttt cgaaatgtag aaatcctatt aaggatgtac ttggtcctaa tggtctctaa 3360 ttgttcggga gaacgatcgt tttccaaatt gaaacttatc aaaaacagac taaggaccaa 3420 tatgagccaa aatcgacttt cttatttgac actaatgagc atagaatccg atttactacg 3480 ggagattaat ttctcagata ttgtgaaaac atttgttttg caaaaagcta gaaaggctta 3540 catttcttta taatcttaca aatttttaca aagttgaaca aaatacatgt ttatatcatt 3600 atatgtttca tgtctattga ttgtttttat ttgtccatac attgctttag cattaccccc 3660 aatttatatg acggtcaaac cttatgcgga gttactttag attttaacgc attacattac 3720 aatacaattt tagtaccggt aaacggtact aaacgttcgc tatgcttgct tgaaaatctt 3780 tctcggggcc cccacaaatt tcagtgcccc agggccacac aattgtttaa tccggccctg 3840 // ID I_Ele43 repbase; DNA; INV; 6823 BP. XX AC . XX DT 14-OCT-2010 (Rel. 15.1, Created) DT 14-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE An I clade non-LTR retrotransposon family from Aedes aegypti. XX KW I; Non-LTR Retrotransposon; Transposable Element; I_Ele43. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-6823 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6823 RA Kojima K.K. and Jurka J.; RT "I clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (14-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. This consensus is generated from 20 CC sequences with 93-96% identity, and ~94% identical to the CC original sequence in [1]. Both termini are frequently associated CC with (TAA)n or (TTA)n microsatellites, indicating CC sequence-specific insertion into microsatellites. XX FH Key Location/Qualifiers FT CDS 560..1849 FT /product="I_Ele43_1p" FT /translation="MAAASSGDPGGSAKRRLPEYMDPTNQFGELTFLQLSG FT KNGAPLPINPYITGKSVEACAGGPIESAKTEAQGTKYTLRVRDPAQVAKLL FT KLTKLIDGTEVEVVPHPNLNVSRCVISCXDLIQMEEKDILTEMISQKVIRV FT QRITRNEGGKRVNTPALILTFCRTTYPEYMKVGLLRVATRPYFPNPMLCYG FT CFSYGHTRVRCPGPQRCVNCSQNFHGEECGEAPSCRNCKGDHRPTNRQCPV FT YKKEVQVIKIKVSENLSFPEARKRVEQQAGSFAQVAAQQSVFERKLKELEA FT AMLQKDKEIARLQEDNKKKEERIEQMMAFIKQVKQQSNPERVHHVSETVVA FT EKPRHSREQRVAQSTAGPMTRSRNNSPAVQETKRGRPPKFVYPKPAXSPDT FT SPPPKKTAPTTHDLTQMEYSGEESEVSETPPNQRLR" FT CDS 1785..6665 FT /product="I_Ele43_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="LRWSIPAKSLRYRRHPLTSVFDNPTLEQRSEYNDDPR FT TNTDNETFRFGSRESVVPLPTQVRQAEGVIRDVLPQPVDDEFTSVADGPND FT YTTTLQTQSAPTSSNSIESNNRHTDDTTQSLSPVPAIAGVSGSSRVVVSAM FT AGTVGQDVPQVSLLNSQPQTGTAAPAIFSDTHHTTPTAANPNQQRPTSTNR FT LAINNRPEETTQSLSPVPAVAGVSGISRVVVSAMAGTVGQDVPQVSFRLSL FT PQSGTAAPAILSDTHQTPPTTANTNRHQPASTYRLVFNNQPTDDTTQSLSP FT VPAVAGVSGSNRVVVSAMAGTVGQDVPQVSLSKFPSQTVAAAPDYLSGYAA FT SPPSPGEGPSRVHRRNITQSASPVPAVVGIGRTSVMVSTAAGTVGQSVPQA FT SDSLVPSSSEATALPLSPTTDPPIRRSSTSSSTTRSATDSRSGGCFALQWN FT IRGLRANISELKLLISDLEPCVVALQETKVNQTVVPPDFVGRNYTLLLQST FT NANYWQHGVGLAIREGIPFERIQVDTSLHLVAARLHAPIQATVVSIYVPPS FT SAQCQTALDELFDQLEGPILLLGDFNAHHIAWGSHQSSALGRFIAETTLTK FT QMVILNDGSATRIDPASGNTSAIDVTICTESLARRFTWRTLTDTHGSDHFP FT IAVSIPGWSNQQTTRRKWLYDRADWELYERITAETIRPEVEWNVGSFTEKL FT IEAASKSIPRTSGRIGPKAVPWWCPEAKAAIRRRRKCLRSLRRLQQVNPDQ FT PEALAEFQAAKAEARKAIKEAKEKSWESFVGKISPHSTTTELWRTVNTLRG FT NRQKRSVILKRPNGFTDNPEEVAEELATYYGERSATSSYPPSFQMAKMAAE FT RESIDVSSNTGDAYNVDITLAELLWALDKGRGASTGTDGIGYPLLQRLPVS FT VKFVLLELLNKIWRNGEFPASWRNALVVPIPKPNCQDSGPAAFRPISLTSC FT MAKTLERIINRRLITELESNGRLDKRQHAFRVGRGTDTYFAELERSIPNAD FT EHCLIASLDLSKAYDTTWRHGILRTLKSWRIRGRMMNLLQSFLSERTFQVS FT LGGYTSSEHKLENGVPQGSVLSVTLFLVAMQPIFRVLPDGVDILLYADDIL FT LVVKRAKSEGLHTKLQAAVKAVCKWAKSVGFAMSAAKSHVFYCSPNARREP FT ARNVIVDQVSVPKTNRLRVLGVTLDRTLTFKPHCKMVKKACESRLRILQMI FT GAKLPRGNRTSLLQVGSALVTAKLTYGIGLVSRGGPATLQTLAPAYNKMVR FT YASGAFVTSPIISVMAEAGTLPFELLAVQCVARTAIRILAKNSRNNTLPLI FT RRASDRLEELTGTALPAVGQLVRQGDRAWNERKPSIVWDVKRSIRAGDPPE FT KVRPVVQQLLSTRFRNSTVVYTDGSKCEDTVGAAFYNNGISGTFSLPKECS FT VFSAEAYAIKKAVSIPNIRNEVVILTDSASCLQALEAGSSKHPWIQEIEHI FT VRNKNVRFCWIPGHAGITGNNEADRLANAARQQPAIDTPIPGEDALRATKE FT AIRCRWDRQWFAIRDNKLREIKHDTKRWTERGNAADQRVLTRLRIGHTRLT FT HTFLLKKESPPTCECCGTSLDVRHLILHCRKYEQERRKYDIDTLSLKEALD FT DTKERTKSC" XX SQ Sequence 6823 BP; 1779 A; 1849 C; 1789 G; 1401 T; 5 other; cagatcgttc cggcacactg tgttgaccgg ttgtcttcgt tcacgcgttc gagttaaacc 60 aggtgttttt ccgagttttt cgcggttttc gcgatcgaaa cgcgtgtgaa tctgtggttg 120 cggtggtgaa gaataagtgc atattgaagg tttggtgaaa atcggtgtta aaacggccga 180 aaacggcgaa aaaaagtgac acccacgtgc tttgaaacgg ccaaacaaat cgccgtcagc 240 cggctccccg gccgttctca ttcagcattc ggtggtgcac agtgtgcttg aattgttttt 300 cgctgaaaca aaaaaaaaag tgtcggaagt ttgggaaata ggtggaattg agaaagtgag 360 cgattctcat caggggtttc actacagtag tgtccagtag tgcctacagt gcggcacaaa 420 attttgggaa cgcctggcca tcggtctcta cgtccgagga gaaaataagg taggatcttt 480 ccttttactt ttccgtcgtc caattggggt aggtaacggg tgaccgttgc cagtggaagt 540 gttttccacg taaaacaaca tggcggccgc tagtagcggc gacccagggg ggtccgccaa 600 acgtagattg ccggagtaca tggacccaac gaaccaattt ggcgagttga cgttcctcca 660 gctgtctgga aaaaacggag ctccgcttcc gatcaacccg tacattaccg ggaaatcggt 720 tgaggcatgt gccggcggac caattgagag cgcgaagacc gaagcgcagg gtacaaaata 780 caccctaaga gttcgcgacc cagcccaagt agccaagtta cttaagttaa cgaagctaat 840 tgatggaact gaggtagagg tcgtccctca ccccaatctg aacgttagta gatgcgtcat 900 ctcstgcwtg gacctmattc agatggagga aaaggacatt ttgacggaga tgattagcca 960 gaaggtgatt cgtgtgcagc gtatcacccg aaatgaaggt ggcaaaaggg tcaatacacc 1020 ggcgctgatc cttacgttct gtaggaccac gtacccggag tatatgaaag ttggcttgct 1080 ccgtgtcgct actcgcccct actttccgaa cccgatgctc tgctacggct gcttcagcta 1140 cgggcacact cgtgttcgtt gccctggacc gcaacgctgc gtcaattgct cgcagaactt 1200 ccacggggaa gaatgcggtg aagctccgtc gtgccgtaat tgcaagggtg atcatcggcc 1260 aaccaatcgc cagtgtccgg tgtataaaaa ggaagtgcaa gtgattaaga tcaaagtgag 1320 cgaaaacctg agtttcccgg aggccagaaa acgcgtggag caacaagctg gtagcttcgc 1380 ccaggtagcg gcccaacaga gcgtttttga gaggaagctg aaggagctgg aagcggccat 1440 gctccaaaaa gataaggaaa tcgccmggtt acaggaagac aacaaaaaga aagaggagag 1500 gatcgagcag atgatggctt tcatcaagca ggtcaagcag cagtctaacc cagagagagt 1560 gcatcatgtg agcgaaaccg tcgtcgctga gaagccccgc cacagccgag agcaacgagt 1620 ggcccagtcg acggctggcc cgatgacacg ctcaaggaac aactccccgg ccgttcagga 1680 gaccaagcgc ggaagacctc caaaattcgt ctaccccaaa ccagccwcct cgccagatac 1740 cagcccgccc ccgaagaaga ccgcacccac tacccatgac ctgactcaga tggagtattc 1800 cggcgaagag tctgaggtat cggagacacc ccctaaccag cgtcttcgat aaccccactc 1860 tcgaacaacg ttcggaatac aacgacgacc ctcgcacgaa cacggacaac gagacttttc 1920 ggttcggaag tagggaaagt gtagtacctc ttccgacgca agtacggcag gctgaaggag 1980 tcatccggga cgtcttaccc caacccgttg atgacgagtt cacctccgta gccgatggac 2040 ccaacgacta taccacgaca ctacaaactc aatcggcacc aacatcaagc aacagcatcg 2100 aatccaacaa ccggcacacc gacgatacta cacaaagtct ttccccagtg ccagctattg 2160 ccggtgtttc tggaagcagt cgtgtagtag tttcggcaat ggctggcact gtggggcaag 2220 acgtcccaca ggtcagtctt ttgaattctc aaccacaaac aggtactgcc gcgcctgcaa 2280 tcttttcaga tacccaccat accacaccaa ccgcagcaaa tcccaatcaa cagcgaccaa 2340 catcaaccaa tcgtctcgct atcaacaacc gccccgaaga aactacacaa agtctttccc 2400 cagtgccggc tgttgccggt gtttctggaa tcagtcgtgt agttgtttcg gcaatggctg 2460 gcactgtggg acaagacgtc ccacaggtca gttttcgctt atccctacct caatcaggta 2520 ctgccgcgcc tgcaatactt tcagacaccc accaaacacc accaaccact gcgaatacca 2580 atcgacatca accagcatca acctaccgcc tcgtattcaa caaccaaccc accgacgata 2640 ctacacaaag tctttcccca gtgccggctg ttgccggtgt ttctggaagc aatcgtgtag 2700 tagtttcggc aatggctggc actgtgggac aagacgtccc acaggtcagt ctttctaaat 2760 tcccatccca aacggttgct gctgcgcctg attatctttc agggtacgcc gcatcaccac 2820 catctcccgg agaaggccct tcgagagttc accgcagaaa cattacacaa agcgcttccc 2880 cagtgccggc tgttgtcggt attggcagaa ctagtgtaat ggtttcgaca gcagctggta 2940 ctgtgggaca aagcgtccca caggcaagtg acagtttggt tccttcgtcg tcagaggcta 3000 ccgcactccc tctttctccc acaacagacc cgccgatccg aagatcatca acctcgtcat 3060 ccaccacacg atcagctacg gatagtcgct ccggcggctg cttcgccctc cagtggaata 3120 tccgtggtct tagggccaac atcagcgagc taaagctgct aatctccgac ctcgaaccgt 3180 gcgtcgtagc tttgcaggag accaaggtga accagactgt tgttccgcca gacttcgttg 3240 gcagaaacta cacgttgctg ctacagtcga cgaacgctaa ctactggcaa catggtgtag 3300 gccttgccat ccgggaaggc ataccgttcg aacgcatcca agtcgatacc tctctgcatc 3360 tcgttgctgc tcgccttcac gcgccgatcc aggcaactgt tgtgtcgata tacgttccgc 3420 cgagttcagc tcagtgccag accgcactgg atgaattgtt cgaccagctg gaaggtccga 3480 ttttactcct cggagacttc aacgctcacc acatcgcatg gggatcgcac caatcaagcg 3540 cgcttgggcg attcatagcc gaaacaaccc tgacgaagca aatggtgatc ttgaacgacg 3600 gctcagctac ccgcatcgac ccggcttcgg gcaacacttc ggcgatcgat gtgaccatct 3660 gcacagagag tctggctcgg aggttcacct ggcgaacctt aacagacacg cacggtagcg 3720 atcacttccc gattgcagta tccatacctg ggtggtcgaa tcagcaaaca acgcgacgaa 3780 aatggctgta cgatcgggct gattgggagc tatacgaacg gattacagca gaaaccatcc 3840 gcccagaggt cgagtggaat gtaggaagct tcaccgaaaa actaattgaa gcagcgtcta 3900 agtccatccc gcggactagt ggccgaatcg gtccgaaagc ggtaccatgg tggtgccccg 3960 aggcaaaggc ggcaattcgt cgacgacgaa agtgtcttcg gtcactccga cgtttacagc 4020 aggtcaaccc agatcaaccc gaagcgttgg cggaattcca agcagcgaaa gcggaggcgc 4080 gaaaggcgat taaagaagcg aaggagaaat cgtgggaaag cttcgtaggg aagatctccc 4140 cgcacagtac aacgaccgaa ctgtggcgca cggttaacac cttgcgtgga aatcggcaga 4200 aacgctcagt tatcctcaaa cggcctaacg gtttcacgga caaccccgaa gaagtagcgg 4260 aagaactagc gacgtactac ggcgaaagat cggcgacgtc cagctaccct ccatcgtttc 4320 agatggcgaa aatggcagct gagagagagt ccatagatgt ttcgtctaat accggcgacg 4380 cgtacaacgt tgacatcacc ctagccgaac ttctgtgggc tctcgacaaa gggcgaggtg 4440 cctcgacagg taccgatggg atagggtacc cgctgcttca acgtcttccc gtgtccgtaa 4500 aatttgtact gctggagctg ctgaacaaaa tctggcgcaa cggtgagttt cccgccagct 4560 ggcggaacgc cctcgtcgtc ccgattccga agccgaactg ccaagattcc ggtcctgctg 4620 ccttccgccc tatctcgctt accagctgca tggcgaagac actcgagcga atcatcaacc 4680 gtcgtcttat cacggagcta gagtcgaacg gacgacttga caagcgtcag cacgcttttc 4740 gtgtgggacg cggtaccgac acgtacttcg ccgagctgga gaggtcgata ccgaacgccg 4800 acgaacactg tttgatagcg tctctggatc tgtcgaaggc atacgacacc acctggcgac 4860 acggtattct tcgcaccctg aaatcatggc ggatacgtgg tcggatgatg aaccttctgc 4920 aaagcttcct ctcggagcga acgtttcagg tgtcgttggg tggttacacg tccagtgaac 4980 acaagctgga aaatggtgtg ccgcagggat cagtgttatc cgtaacgctg ttcctggtgg 5040 cgatgcagcc catcttccgg gtactgccgg atggagttga catactcctg tacgccgacg 5100 acatcctcct cgtcgttaaa agggctaaaa gcgaagggct acacacgaaa ttgcaggccg 5160 ccgtgaaggc tgtatgtaaa tgggcgaaga gtgtcggctt cgcaatgtct gctgccaagt 5220 cgcatgtgtt ctactgcagc ccgaatgcgc gtcgggagcc ggcgcggaac gtcatcgttg 5280 atcaagtctc cgtcccgaaa accaaccggc tgagggttct aggtgtcact ctggatcgca 5340 ccttgacatt caaaccgcat tgcaaaatgg tgaagaaagc atgtgagtca cgtctgagaa 5400 tactccagat gatcggagcc aaactaccgc gaggcaatcg gacaagtctg ctgcaagtcg 5460 gatcggcgtt agtaactgca aaactgacgt acggtatcgg attggtgagc cgaggaggac 5520 cagcaacatt gcagaccctc gctccggcat ataacaaaat ggttcggtat gcttctggag 5580 cgtttgtcac cagcccgata atttcggtca tggccgaagc gggcacctta ccgtttgagc 5640 tgctggcggt tcagtgcgta gcgaggacag ccatccgcat tctagcgaag aacagtcgaa 5700 acaacactct tcccttgatt cggcgcgctt cggatcgttt ggaggaactc acaggcacgg 5760 cactccctgc tgtcggtcaa cttgtgagac aaggcgatcg tgcttggaac gaacggaaac 5820 cctcgattgt gtgggacgtg aagaggagta ttcgagccgg tgacccaccg gagaaagtcc 5880 gcccagtcgt ccagcaactt ctgtcgaccc gcttccgcaa ctcgaccgtc gtctacaccg 5940 atggttcgaa gtgtgaagac acggtaggag ccgcttttta caacaatggc atctccggaa 6000 cgtttagtct accgaaggag tgcagcgttt tctctgcaga agcgtatgcg atcaagaaag 6060 cggtttcaat cccaaacatc cggaacgagg tggtgatcct aacggattcg gcgagttgcc 6120 tgcaggcctt ggaggcagga tcttccaaac acccgtggat tcaagagatc gagcacattg 6180 tgcgaaacaa aaatgtacgg ttctgctgga tcccgggaca tgctggcata accggcaata 6240 atgaagccga tcgcttggcc aacgcggcta gacagcaacc ggctatcgac actcctatcc 6300 ccggcgaaga tgctctgaga gcgacgaaag aagccatacg gtgccgatgg gatcgccaat 6360 ggtttgcaat tagagacaac aagttgaggg agatcaagca cgatacgaag aggtggacag 6420 aacgtggcaa tgcggccgac caacgcgtgc taacgagatt gagaatcgga cacacccgtc 6480 tcacacacac ttttctcctg aaaaaggaat ctccgcccac ttgcgaatgt tgcggaacgt 6540 cgcttgatgt gcggcatctc attctgcact gcagaaaata tgagcaggaa agaagaaaat 6600 acgatatcga cacgctcagc ctgaaggaag ccttggacga taccaaagag agaacgaaaa 6660 gctgttgaag tatctacacg atacaggact gtacgggaaa ctgtgaaatg aaatgtgtat 6720 gtgaaacgtg aatttgtaaa taaatagatt aagtttcttt ttccccgaca cgaatgcacc 6780 cttctggtgt aaagtgtcac taataaacaa aaaaaaaaaa aaa 6823 // ID BEL-77_AA-LTR repbase; DNA; INV; 815 BP. XX AC supercont1.90; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-77_AA_; KW BEL-77_AA-I; BEL-77_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-815 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.90; Positions 1641322 1640508. XX SQ Sequence 815 BP; 287 A; 150 C; 160 G; 218 T; 0 other; tgttccgcca tccgcggacg cgcctccgct accaataagc ccccacaagg attcgacgcc 60 actgactcag cgtgatgcgc ttgtgtcggt ggacaaacct caaaggcttg cgtatgtata 120 gctgagaatg tgtaggcata cgaagtaggg agacagaaaa aatatccaag tataaacaaa 180 caacctgtga acacttccct gaattaaaat ccacttcatt tgcaaataat tataaaaaaa 240 cagtagtgaa ttagatcgtt tttgaattag tcggtagtgg gtttatcact gaagaatcag 300 caggtaaaaa tgaatttatt atcctagatt gtaaacttaa aactatatgt acttatacta 360 tgttgtaaat atcacaggtt ctatcacagt cggcaaagaa aataaaccac tagatttgta 420 gctaatcctg aggcagtgat ctgaaacaag aactatagga tagtcaaggt aacttacttt 480 agtaatacat tatcagaagt ttacaaaaca tgcaatattg aaatgcaggt atcaggcact 540 gcacccgcag caataccgta cagtaccgat tttcggaccc aacggaaatt aggaaataga 600 cactaaacgt aagtaaatgt aaaatttcga gatttaacta aagtgactct ggcaaagaat 660 tcgtcaagat gaataatcca taaatttgaa taattgtatc ccaacaggga aaatttaacg 720 tgccagccgt aataataaac gttgtaaatt gaattcggcg agtcgtttaa gtttttggaa 780 gttcgtctac cgtattgttg ccgattgcgg gaaca 815 // ID Gypsy-26_CQ-LTR repbase; DNA; INV; 239 BP. XX AC AAWU01010854; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-26_CQ_; KW Gypsy-26_CQ-I; Gypsy-26_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-239 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 432-432 (2011). XX DR GenBank; AAWU01010854; Positions 2198 1960. XX SQ Sequence 239 BP; 49 A; 59 C; 56 G; 75 T; 0 other; tgttgggttc ccacagaacc accagccgtt ccaacaattt cgcgactcgc tgtgctcctt 60 ggtagcaccc aggtcgagag aggcgcgatc ggagatgcaa ttgtactttt agttttctgc 120 atatcattac tgttagtcgc tgttacagaa taaacgtgtt ctatttgtta agcttaagtc 180 tagtttattc ttgcgacttt gtggccgctc gtttggttcg cgcccccggg tacacaaca 239 // ID Gypsy-219_AA-LTR repbase; DNA; INV; 208 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-219_AA_; KW Gypsy-219_AA-I; Gypsy-219_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-208 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1042-1042 (2011). XX DR [2] (Consensus) XX SQ Sequence 208 BP; 62 A; 48 C; 40 G; 58 T; 0 other; tgtcatatac ctgcgcattc taagcgccgt cgaactgtca ttcgatgcga agttcgaata 60 acgaacaagc gaaccgaagg gttcgatctc gaactataat aaatagacga ttagtctgat 120 ctgaaacttc aagcttatcg gtctacaaca tcacacgtcc aatcgctgtc ataaagttca 180 gagtatcttt taaggtcggt cttttcca 208 // ID CR1-28_BF repbase; DNA; INV; 3358 BP. XX AC . XX DT 30-JUL-2009 (Rel. 14.07, Created) DT 30-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE Amphioxus CR1-28_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-28_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-3358 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-3358 RA Kapitonov V. and Jurka J.; RT "Young families of CR1 non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(7), 1599-1599 (2009). XX DR [2] (Consensus) XX SQ Sequence 3358 BP; 953 A; 769 C; 575 G; 1061 T; 0 other; attacagaaa cgtggcttgg tagtacggtc gattctgaag ctatttcgct tgaaggtttt 60 cagtccccag ttcgccgtga ccgtaaccga cacggcggag gtgtcctaat atttgtatcc 120 gaccagatag cgtttagacg tcgctgcgac ctggaatcgc taactgatga gagtatctgg 180 ctcgaactat tccccggcac gttccgtatt ttgttctcag tttattaccg gccacctggt 240 caagactcag tcactgttga cagttttatc gattctctcc aaacatccgt ttttacagct 300 gccgcttcaa atgctgactc aattatcatt acaggtgatt tcaatgctaa gcattcggcg 360 tggtggcatc tcgactcaaa ttcttccctt ggctcgaaac tgctccaggc atctcagatg 420 ctaaatcttt cccaaatgat cacggaaccc acatgcgacc tatctcgatc accttcactc 480 cttgatttac tgtttactga tacgccaaat cttgttaaat ctacaagtgt tctctctcca 540 ctgtctggtt gccaccactg ccctgccatt gcgaactttg acttctttat caaacccgtc 600 cgtatgttcg cactctctgg gatttctcta aaattgacta cgccacactt gttgagtcct 660 ccacagattc gaaatggtta gatatctttt attgtaattc tgttgatgct gcctgcaaca 720 tgttaattga tcttatttcc gaaacaaaaa caaagtccgt ccctaagaga aatatcacga 780 taagacctcg tgacagacca tggatgacag cccgaataag agaacttatg cgtaaacgcg 840 acaaaagtca taaaaaggcc aaattatgta acagtccttc tctctgggcc tcttatcgta 900 agataaggaa taaacttgtt aacgagatag cttcagctaa attgaaccat aatactcgcg 960 taacaaactc actgttaaat tctcctacag gaagtaaaac atggtggcat ttagttaaaa 1020 ccatttatag atgcagtgcg acaaccacca ttcccccact taagtgtaac gatcattaca 1080 tatgtgattc aagcgaaaag gcttcagaat tcaacagata tttctccctc caatctacaa 1140 ttgatgatac taatgctaga ttaccgactt ttgactacct gaccaatgca cgtttgttta 1200 tctgtgaaac tactgccact gaggttgaac tatatgtctc cgagttggac gtctctaagg 1260 cgtacggctg ggataacatt gacaatcgtt tcttaaaaat tatctgtcca tacatatctg 1320 acaagatagc ctatgtcttt aacctatcga tttgtcatgg aatatttccc gaagtctgga 1380 agagagcgaa tgttgtgcct atctttaaaa aaggcgatcc ttgtttagtg tcaaactacc 1440 gccccgtttc gttattgcct accctctcta aaattcttga aaaaatagtt tataaacatc 1500 tgtacaacca tttaatgtct caaaatctgt tatactctta tcaatcagga tttatcaggg 1560 gggattcaac tgtaaatcaa cttgtctaca tctcaaacaa aattcttcat gcctttgacg 1620 aaaataaaga agttagagct gtctatttgg atttctcaaa agctttcgat aaagtttggc 1680 acaaaggtct actcttcaag cttcaacgaa atggggtgga aggtcctctt cttaactgga 1740 tacaaagtta tctgtttggt agaaaacaac gagtaacaat tgaaggtcaa aattctgatt 1800 ggagtgagat aagcgcaggt gtgccacaag gatcagtgct cggtccactt ttgtttctca 1860 tatacattaa tgacatgata gaaggccttg acacccattc ctttcttttc gcagatgata 1920 gttccctgct cgatgttgtt gaatccccac tcttatcagc aaatagactt aactcggatc 1980 tttctaagat atcatcctgg acaactatgt ggttgatgga actcaatcct tctaaaacca 2040 tcgaaatgtg ttattctaca aaggctaagc ccccgaccca cccaccactt ttccttggta 2100 gtgtacaaat caaatctgtt aattgtcata aacacatcgg ggtccatctt tcatctacta 2160 tgacatggaa tactcatata tctaaaatgc tagcaaatgc atctaaaaaa gtctcagtct 2220 tcagtaaact caaatttaag ctccctcgta aagttctaga aataatatac aaatcgttta 2280 tccgtcctct acttgaatat ggcggcgtag tttggcacgg ttgcactact tcagattcgg 2340 atcttattga acgtatgcaa tacgaatgtt ctttaactgt agctggtgcc atcagggggt 2400 cctcatactc ttctctacta actgaactgg gatgggaaaa actatccgac cgccgacaca 2460 ttcagactct tatcttattc tataaaatcg ttcatggcca tactcgacaa tatttaaatg 2520 atttgatacc acccttagta tcagactctt cttcctatga ccttcgtaat aaaaacgatc 2580 ttctgtcact aaaatgtacc acaacccgct ataagaaatc gtttattccc tatgcaacgt 2640 atcactggaa caaccttacc ctagcagatc gttccttaag tcttcctcaa tttaagaaaa 2700 aaatggtcaa atccgtgcgc cctgtctctg ttccatactt cagcataggt cctcgtttca 2760 cttgcgctat tctcactcgt ttccgtctag gtactcatag tttaaactcc aacttgtact 2820 ctcgcaatct tgttactagt agtgcttgcg cctgtggcca tcattgtgaa agtatttatc 2880 attatttctt gtactgccct aactacaccc aataccgtat ctccctcctc agcaatttgc 2940 ataagctggt aggatttgcc ataaacatgg acgatctctc ggatcatgat ctagtccatt 3000 taatgactcg agggtctgca tttctatcca actcaattaa cacaaaactt ctacagcatg 3060 tacaatcatt cataaaagaa acaaaaagat ttgaataagt gaatctctat cctaccagct 3120 tggcccggag tacctgtccg aatatccacc tatattcaaa tgtattttct tatgttctat 3180 gattttgtta tatggtgtta cttaacatta cttttgttat atgctttgct tgtctgtatt 3240 tgtttcaatg ttcttttgtg ctctcagttt gtttttgatg acggcatgaa aataagcatg 3300 ttgagcttga gtgtgccgcc atcatgttat gtatttatct gttgcaataa aaaaaaaa 3358 // ID BEL-232_AA-LTR repbase; DNA; INV; 302 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-232_AA_; KW BEL-232_AA-I; BEL-232_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-302 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 920-920 (2011). XX DR [1] (Consensus) XX SQ Sequence 302 BP; 36 A; 91 C; 83 G; 92 T; 0 other; tgagtgctga cggtggcctt caatctgtgg ccattccgtt ccgccgactg gattttacct 60 tggcccttct tctccggcag gtgagtgctg acggtggcct tcaatctgtg gccattccgt 120 tccgccgact ggattttacc ttggcccttc ttctccggca ggtgagtgct gacggtggcc 180 ttcaatctgt ggccattccg ttccgccgac tggattttac cttggccctt cttctccggc 240 aggtgagtgc tgacggtggc cttcaagctg tggccattcc gttccgccga ctggatttta 300 ca 302 // ID Gypsy-238_AA-LTR repbase; DNA; INV; 205 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-238_AA_; KW Gypsy-238_AA-I; Gypsy-238_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-205 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1078-1078 (2011). XX DR [1] (Consensus) XX SQ Sequence 205 BP; 66 A; 30 C; 45 G; 63 T; 1 other; tgtggcatct gagtttcatg attatatttg tatcagtgta atcataaggg acgagttgga 60 aaacggtatc gataggasag aagtgttgaa gtacgctgcc ggagtaaacg tgcttttaag 120 taactcgacc gacaaataat aaaaagttaa agtatatttc atatcgtgtt attaagtgca 180 ccattcgaag atctccgttt tcaca 205 // ID Copia6-NVi_I repbase; DNA; INV; 3936 BP. XX AC AAZX01000798; XX DT 05-NOV-2007 (Rel. 12.11, Created) DT 05-NOV-2007 (Rel. 12.11, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: internal DE portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia6-NVi; KW Copia6-NVi_I; Copia6-NVi_LTR; internal portion. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-3936 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from Nasonia parasitic wasp."; RL Repbase Reports 7(11), 1133-1133 (2007). XX DR Genome; AAZX01000798; Positions 15533 19468. XX CC Positions [1569-2099] - Integrase core CC 'GCATT' target site duplication CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 144..2750 FT /product="Copia6-NVi_I_1p" FT /translation="MIQNIEKLSGADNYHDWAVLMKAYLEHEDVFDTIVAP FT TGEQISTDATKRRLARTKIILAVERVVLSQIQNLESPKDIWDTLKNAYAIE FT SLYRKTSLIIEYTRIRLEDCNSTEEYIDKIITIHQKLKNIGKDLGDDFAAC FT IMLEGLPSEYEPMRMALESLPEKEITTENVKTKLLMQVKWRQGANKSNEVG FT LYAYNQNKQGGGYSRGPNYRRNFGARPMNSAGIRCYRCNKYGHMSTDCSLP FT DKRNHPNRYQANHVTEENEDENNEDDDQIAAFAFVGLAAGNGASTDDWIVN FT SGASRHICNNSSIFAEIHKSDLTYITVANRSKVQVRGEGGVYLRLENGITV FT KLLNVLQVPDITVNLISVGQTNLKGYSTVFSSSGCKIMNQRGDLVLYAKLQ FT NGIYKVPRFSNPQDIDKQENNRRDRSVALVASAQPQDMELWHRRLAHLNYN FT SLNQLRDGAALGIHMKPGKAEQCEICALEKICKQPFKLNEKRANAALEIVH FT TDLCHVTPKSNGNATYILTFLDDYSRKAFVYFLKKKDEVYKSFVHFNNFVE FT NETGNKIKSLQSDNGTECVNSKMNQLMDQSGIKRRLSIPGTPQQNGRAERL FT NRTLLDKARCMILDAGVAKSFWAEAVAVAAYVHNRTPKRCTKYKTPQELWT FT GKIPDLSHIKVFGCKAIVHVPESKRGKMDAKGTRCIFIGYCPTRKAYRLWD FT PSAKGVVISRDVTFYEHGSLKPVVPDNINGTKPVFLIADDPDDNSEEEAYS FT VDQLKQVDTSMEESTENENPCHQPTGHEGLPAIPDQPHALDQDAPPDGQPG FT VKSQTEGIRRSSRVRRMPTKYTEYYLSRKRNSEAAHLADASDIPDKKSNKT FT IKRDIKPDLWLGVVNN" XX SQ Sequence 3936 BP; 1258 A; 880 C; 953 G; 845 T; 0 other; ggttatgggc tcagggtccc acagtgagaa acaagtgtac cgtatttctt tagtggaacc 60 attattggta gatcagcttt aaaccttcga ggagtctcag gtgcaacgac gaaatggctg 120 ttaataaagg gaaaaattag gccatgattc agaacatcga aaagctctcc ggagccgaca 180 actaccacga ttgggcagtt ttaatgaagg cctacttgga gcacgaggac gttttcgata 240 ccatagtagc cccaacgggt gaacaaataa gtacagatgc aacgaaacga agattagcaa 300 gaaccaaaat aattttagcg gttgagcgcg tagtactaag ccaaatacaa aacctagaat 360 cccctaaaga tatttgggac acattgaaaa atgcgtacgc aattgagagt ctgtaccgca 420 aaaccagctt aataatagaa tatacaagaa tccgtctaga ggactgcaat tccacagaag 480 aatacattga caagataatt accatacatc aaaaactgaa gaacatcggc aaggatctcg 540 gagatgactt cgccgcatgt attatgctcg agggcctacc gtccgaatat gaaccgatgc 600 gcatggcgct cgaaagcttg cccgagaaag aaatcacaac cgaaaacgtc aaaactaaac 660 tattgatgca agtaaaatgg agacaaggag caaacaagtc caacgaggta ggattatacg 720 cgtacaacca aaacaaacaa ggcggcggtt attctagagg acccaactat aggcgaaact 780 ttggagcgag gccaatgaac tcagcaggta tacgctgtta ccgatgtaat aaatatggac 840 acatgtccac tgattgctct ctaccagaca agcgaaacca cccaaaccgc taccaagcaa 900 atcacgtcac cgaggaaaac gaggatgaaa acaacgagga cgacgaccaa atcgcggctt 960 tcgccttcgt cgggcttgca gcaggaaatg gagcaagcac ggacgattgg atcgtcaact 1020 ccggtgcttc acggcacata tgcaataata gttcgatctt cgccgaaatc cataaatccg 1080 atttgacgta cattaccgtg gcaaatcgca gtaaagtgca agtcagagga gagggaggcg 1140 tataccttcg gctggagaac ggaataacgg ttaaattgct gaatgtcttg caggttcctg 1200 acattacagt aaacctcata tcggtcggtc aaacgaattt aaaaggatat agtactgttt 1260 tctcatcatc aggctgcaaa attatgaacc aaaggggaga cttagtgctg tatgccaagc 1320 tgcagaacgg tatatacaag gtaccgcgat tttcaaatcc tcaagacatc gacaaacaag 1380 aaaataaccg acgcgaccgg tcggtagctc tcgtggcttc agcgcagcca caagatatgg 1440 agctgtggca tagaagactc gcccacttaa attacaacag cctaaatcaa ttacgcgatg 1500 gagctgcgct cggaatacac atgaagcccg gcaaagcgga gcagtgcgag atatgcgcct 1560 tggaaaaaat ttgtaaacaa cccttcaagt tgaacgagaa acgagcgaac gctgcactgg 1620 aaatagttca cactgatttg tgccatgtca cccccaagtc gaacggcaat gcgacgtaca 1680 tactaacttt tctcgacgac tacagtcgaa aagcgttcgt ttacttccta aagaagaagg 1740 acgaagttta caaaagtttc gtccacttca ataatttcgt cgaaaatgaa accggaaaca 1800 agattaaatc gttgcagagc gacaacggaa ccgagtgtgt aaactcaaag atgaaccagc 1860 ttatggacca gtccggcatc aaacggcgac tcagtatccc gggcactccg caacaaaatg 1920 gcagggccga aagactaaac agaaccctgt tggacaaggc aagatgcatg attttggacg 1980 ctggtgtcgc aaaatctttt tgggctgagg ctgttgcggt ggcagcgtat gtccacaaca 2040 gaacaccaaa acgctgcacg aagtacaaaa ctcctcaaga attatggaca ggtaaaatac 2100 cagatttatc acatataaaa gtttttggct gcaaagccat agtccacgtt ccagaatcca 2160 agagaggcaa gatggacgct aagggcacga ggtgtatatt catcggatac tgcccaaccc 2220 ggaaagcata cagactgtgg gacccgtcag caaaaggagt ggtcataagc agggacgtca 2280 ccttctacga gcacggcagc ctcaaacccg tagttccaga caacatcaac ggaaccaagc 2340 cagtcttctt gatcgccgat gacccggatg ataacagtga ggaggaagcg tattctgttg 2400 accagctaaa gcaggtcgat acgagtatgg aggagagtac ggaaaacgag aacccatgcc 2460 atcagccgac tgggcatgag ggacttcccg cgataccgga ccaaccccat gcgctcgacc 2520 aggacgcgcc gcccgatggt cagccagggg tcaaatcgca aactgagggc attagaaggt 2580 cgtcgagggt cagacgaatg cccaccaaat acactgaata ctacctgagc agaaaaagaa 2640 attccgaagc tgctcatttg gcggacgcca gcgacatccc cgataagaaa tcgaacaaga 2700 caatcaaacg agatataaag ccagatttgt ggctaggggt tgtaaacaat tgaatggagt 2760 agatgtcacg gagacatatt ctccggttgt aagatataca accctaatat tactgttctc 2820 ttatgctgtt aggaaaggcc tggagatcga acacctggac gtcgagacgg cgtttttgca 2880 gggagaactc gaggagcagg tgtacataca gcagccggaa ggatacgtcg acccgcatca 2940 cccggacttg gtctgcagat taaagaaagc aatctatggc ctcaaacaaa gtggacgcgt 3000 ctggaacgta aaactgggta gcatcctaaa cgaaatgggc ctccaacgct gtgaatatga 3060 cccatgctta taccatctgc aaaggagcgg tgcggagctc aagatggctg tatttgtcga 3120 cgacttgctg gtgttcgcaa gttccagagc cctcatccag gaggtcactg cggagctgcg 3180 gaagcgcata acgctaagag atctcggcga catcacaaga tgcttcaacg tcaacgttac 3240 cagggataga gagagagagg aattctatct atggaccaac gggatagcat agtgcgcatc 3300 ctcagggact acggaatgga gggttgtaac ccatgtagca ttcccatgga cgcaggctcc 3360 tcattggtgg cagacgcatc cccgaagtca aagtctgaat tggatgagct gagtaagatt 3420 ccttaccaaa atgtaattgg atctctcatg tacctcctcc agatgaccag accggacctg 3480 gcctatacgg tgagcactat gagccgcttt aacacctgtt acggaatgga gcactggaag 3540 gtgctgaaga aaacaatata aatagtgtcg acaacaagga gatacagatc acccacgtat 3600 caactccgta tatggtggct gattcgctaa caaagccagt gccggctcct aagatgaaca 3660 actttaaatt aagtgtggga ctgtataatt aaaaatttat tgagattttt cctttgcgtg 3720 tttttataac gataacgacg atacctatgt atagcttgtt tttaaatgtg aactttaagt 3780 gagaaatgtt gtttgatgca gtgtaagtca ttagatcaaa aggtatagaa caatgttgtc 3840 tgcgaagtct tttgtatcgc gaacttaaga gtgtacatta gttgttcttg ccttttgtat 3900 tgttcaatgt aataagcgaa ttcaaacatg ggggcg 3936 // ID BEL3-LTR_AP repbase; DNA; INV; 104 BP. XX AC Contig25132; XX DT 21-APR-2008 (Rel. 13.04, Created) DT 21-APR-2008 (Rel. 15.12, Last updated, Version 0) XX DE LTR retrotransposon from pea aphid: long terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL3AP; BEL3-I_AP; KW BEL3-LTR_AP. XX NM BEL3-LTR_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-104 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from pea aphid."; RL Repbase Reports 8(4), 434-434 (2008). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 104 BP; 22 A; 29 C; 26 G; 27 T; 0 other; tgttcgagca ccatcgacat ttgtaattgc tcactgttac gcgccactgt ggcgcatgat 60 ggtgtcagca ctgctaacag ttggtcagac cattaggccg ccca 104 // ID DNA-10_AAe repbase; DNA; INV; 389 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A non-autonomous DNA transposon family from Aedes aegypti. XX KW DNA transposon; Transposable Element; nonautonomous; DNA-10_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-389 RA Jurka J.; RT "DNA transposons from the yellow fever mosquito."; RL Repbase Reports 11(4), 1265-1265 (2011). XX DR [2] (Consensus) XX CC ~92% identical to consensus. Present in >2500 copies in the CC genome. 8 bp TSD. XX SQ Sequence 389 BP; 133 A; 72 C; 78 G; 105 T; 1 other; cctgagagag aggagacagc tggcttagat tgcgagctcc caaaagtcan gggaacctgt 60 caccaactgt cactggaaaa ccggcacatt cgaagggcag agcgtgaaaa accgtcaaaa 120 ttcaagtaaa cgtaattaaa atttctaggt gtttttcgat gatttcacat tttaaaaagt 180 tgaataccta aatatagttg tgtgttggca aactgctatt gattttttta tattcatatc 240 ctttttcata gtgaacaaca agaagtgaat atgattttga ccaaaataag ttcatctgac 300 cgttgtgcac aacccaagaa ttaaagaaag aagtcaaaga aggcaagaaa acatgccaac 360 tgtcagtgag ctgtctcctc tctctcagg 389 // ID P-30_HM repbase; DNA; INV; 6273 BP. XX AC . XX DT 29-DEC-2008 (Rel. 13.12, Created) DT 29-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE P-type DNA transposon family: consensus. XX KW P; DNA transposon; Transposable Element; P-30_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-6273 RA Bao W. and Jurka J.; RT "P-type DNA transposon families from Hydra magnipapillata."; RL Repbase Reports 8(12), 2083-2083 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 171..2900 FT /product="P-30_HM_1p" FT /translation="MPGANCSIYGCSTSRRHKGVSIFKIPASSDSFNTSWR FT KNLIAIILKDRIEDCNLKAQIESKSLFICEKHYLPEMILHNNNRKSLKPGS FT LPTMCLPKKGFMSSIPVQRESASSIIAKRDASMNSNNIITFKPIYKSYEEF FT TNRISKLKICEWTVSVSLVVAEFKLYDTIHLCPRYEIYVDDSLAFTIRVYS FT WLIPDDHSIYTKYRRSMRNISASNLISVVTSFSLCSGITNIFSLKSNLYIK FT HCISKKFDLFSSRTEFPLNQTEYFRSQSCIMLIDLMLSCSQCKSIEKSEII FT SLKKNITNACIPAKLNAPIKFTSPDRLKLTLQSVRLENKELKMELALMKQE FT IENSSLTISKDTESDLVKIMSGADKSKISPFMNLFWEEQQKYLKSSKQGTR FT YHPAIIRYCLGLAAKSPAAYEALRLNDKNNSGFLILPSRRRLRDYKNYVKP FT KLGFNKNIITELKDKIKDFSAQEKFIVLVLDEMKIQENLVWDKHSGELIGY FT VDLGDPSVNFATLQQADKLASHVLVFLIRSVVNPFKYSFANFATINITSVQ FT LFPLFWKAVGILEISVGLKVVAVTCDGASSNRKFFKMHFNLTEKEDRNGNV FT DVTYRTDNLFADDGDKRFIYFISDVPHLIKTARNSLSHSGAGKSNRYMWNQ FT GQHILWSHITDLFYEDLKCQLHACPKLTVEHIKITPFSAMNVRLAAQVLSN FT SCSTALEIYAPSDTSGTAQFCKYFNDFFDCLNVRNKNDYILKRNSFLKPFS FT DTNDVRFTWLKENFLKYLSDWFLSIESRPGKFSKSDRSNMFITWQTHEGLK FT ITVNSIVELVSFLINSNVSYVLTERFCQDPLENYFGQQRSIGKRKDNPSLF FT DFGFNDNTIRNQHIFHPIAGNCLNINESNRTTIKESMSRIPRRKWTYSQNS FT IDKKS*" FT CDS 6159..4963 FT /product="P-30_HM_2p" FT /translation="METDIELNSEFDIETLFFSLEDDFLFQDENFNKDLDA FT LVLELSIEPVKSTFSCTICSKKCISQRGLTRHTNSQHKGIPSDLTLQNPKP FT PILLSEFKSLLDKSITTLIDDECYPDLYRNELRINHAMLLETSQQTLSLLE FT TTIASFQANGDAEKFYPQFYKDINCDIFINYLSKKTTRLLGCELCNHVLAF FT LTNSFYNNSLINLNDLSSFELSDRDKEIILYLSGYVFSTFYCRVRSSRLWQ FT QKLSTDCLSLLNAGKIENFLKNPEPSTEFYHKFVNAKNRGGLWKVSPPVLE FT IFTKVEVLFRQQSNGFVKKIDSELIVTEILKNSVVMASISELRSLCTNQVS FT KELSLNLFYHLVTLYVRVRSFSYAKEKLSIHKLKENQTKSKSLRSKMKKTS FT IGFE*" XX SQ Sequence 6273 BP; 2183 A; 912 C; 1000 G; 2178 T; 0 other; catagtcata agaaatagat agtctggcca cttgcggtat ttctttgatc aactctttgt 60 ttgccttttt ttagtgtcca gtcccacgca gagtgcacaa agtttatttt cgtcgattgt 120 ttgtctatga attgcttgtt aaaatacatc ttcttcatcg aattaattaa atgcctggtg 180 ctaattgttc tatttatggg tgcagtactt ctaggaggca taagggagtt tcaatattta 240 aaattcctgc atctagcgat agttttaata cttcttggag gaaaaatcta attgccatta 300 tacttaaaga tcgtattgaa gattgtaatt tgaaagctca aatagaaagt aaatctctgt 360 ttatttgtga aaagcattat cttccagaaa tgatacttca taacaataac agaaagtcac 420 taaaacctgg atcattaccg actatgtgct taccgaaaaa aggttttatg tcatcgattc 480 ctgtacaaag agaatctgct tcttctatca ttgctaaaag agatgcttca atgaattcaa 540 ataatataat taccttcaag cctatatata aatcatatga agaatttaca aatcgtatta 600 gcaagttaaa gatttgtgag tggacagtga gtgtgagtct tgttgtggca gaatttaaac 660 tttatgacac aattcaccta tgtcctcgat atgaaatata tgttgatgac agtcttgctt 720 ttaccattcg tgtatattct tggctcattc cagatgatca ctcaatatat acaaaataca 780 gaagatcaat gcgaaatatt tcagcttcaa atcttatttc tgttgttact tcttttagcc 840 tttgctctgg aataactaat atattttctt taaaatcaaa tctttatatt aaacattgta 900 tttctaaaaa gttcgattta ttttcatcta ggactgagtt tcccctgaat cagacagagt 960 attttcgatc tcagtcatgt attatgctga ttgatttgat gttatcatgt tctcaatgca 1020 aaagcattga aaaaagtgaa atcataagtt tgaaaaaaaa tattactaat gcttgtatcc 1080 ctgctaaatt aaatgcacca attaaattta catctccaga tcgtctgaag cttacattac 1140 agagtgttcg cctggaaaat aaagaattaa aaatggaatt agctttgatg aagcaagaaa 1200 ttgaaaattc atcactaact ataagtaaag atactgaaag tgatttagtt aaaataatgt 1260 caggagctga taaaagcaag atttctccat ttatgaacct gttttgggag gagcagcaaa 1320 aatatttaaa aagctcaaaa caaggaactc gttatcatcc agcaataata cgttactgtt 1380 tgggattggc tgcaaaatca ccagctgctt atgaagcact aagattaaat gataaaaata 1440 attctggttt tttaatttta ccaagtcgta ggcgactacg tgactataaa aattatgtga 1500 aaccaaaatt aggatttaat aaaaatataa tcacagaatt aaaagataag atcaaagatt 1560 tttcagctca agaaaaattt attgtcctag ttcttgatga aatgaaaatt caagaaaatc 1620 ttgtttggga caaacattct ggagagttaa ttggttatgt tgatttaggt gatcctagtg 1680 ttaattttgc aacattgcaa caggcagaca agcttgcttc tcatgtatta gtttttttaa 1740 taagaagtgt tgtaaatcca ttcaaataca gttttgcaaa ctttgcaact ataaatatta 1800 catcagtgca attatttcct ctattttgga aagctgtagg cattttagaa atctctgttg 1860 gtttaaaagt agttgcagtt acttgcgatg gagcatcgtc aaatagaaaa ttttttaaaa 1920 tgcattttaa tctgactgaa aaagaagata gaaatggtaa tgttgatgtg acttaccgca 1980 ctgataattt atttgctgat gatggcgata aaagatttat atattttatt tctgatgtcc 2040 ctcatttaat aaaaacagca aggaacagcc tttctcattc tggagctggt aaaagcaatc 2100 gctatatgtg gaatcagggt caacatattt tatggtctca tataactgat ttattttatg 2160 aagatttaaa atgtcagttg catgcatgtc caaagctaac tgttgaacac atcaaaataa 2220 ctccattttc agctatgaat gttcgtttag ctgcacaagt tttaagtaat tcctgcagta 2280 ctgcattaga aatctatgcg ccatcagata cttcaggaac tgcacagttt tgcaaatatt 2340 ttaatgattt ttttgattgc ttaaatgttc gtaataaaaa tgattatata ttaaaacgca 2400 actcattttt gaaaccattt tctgatacaa atgatgttcg atttacatgg ctaaaagaaa 2460 attttttaaa atatctttct gattggtttt tatcaattga atcaagacct ggaaaatttt 2520 caaaaagtga tagaagtaat atgtttatta cttggcaaac ccatgaaggt ttaaaaatta 2580 cagttaattc tattgttgaa cttgtgagtt ttttaattaa tagcaatgta agttatgtat 2640 taactgaaag gttttgccag gatcctttag aaaattactt cggacaacaa cgatcaattg 2700 gtaaaagaaa agataatcct tccttgtttg attttgggtt caatgataat accattcgaa 2760 atcaacatat atttcaccca attgcaggta actgtctaaa tattaatgaa tcaaacagaa 2820 caacaataaa ggagagtatg agtcgtatac cacgcagaaa atggacttac tctcaaaatt 2880 ctatcgataa gaaaagttga tttttaatgt gatttgataa aaaactctct tatgtctcct 2940 cgtactgctg aaaacaaata aatgaatttg cataaaaacg gctattttca tagccacaaa 3000 ttttttaaaa acattaaatg gcatataaat aaaataaatt gttttttgtt tgttttactt 3060 ttgttacaat ataaacctat gctttttata gaacctctaa tttataaaga tttatataaa 3120 tattttacaa aacctttaat aactgcaatt ttatatggaa ccgtagttag gggtgaaatt 3180 cttaaagctt ttctattttt cataggtacc actgcaggcc aacacctctc ctccaaaacc 3240 tgtcattagg gccccacagt ttgaaaacat attttgaata ttagcaatga cattagctcc 3300 tcaagtttgt ttatgggctc tacaatttta tgggccctaa aattttctac ccactgatta 3360 agggcctggg ggagcctata tatatgtata tatatatata tatatatata tatatatata 3420 tatatatata tatatatata tatatatata tatatatata tatatatata tatatataag 3480 gggctgtcca ttaattacgt caacaaaata ggggggaggg gggtctgaag ttttttgaca 3540 gttgttgaca cgggggaggg ggaaggtcaa gtaaagttga cgtcaacaat gtttttgttt 3600 ttaaaaattt cattactcaa aaaacagaaa attaggcggt atttttttca tttgaggggg 3660 gggggaggat aatcaaaagt tgacgtaatt ttataggggg ggtctctaaa agtttgacaa 3720 attgttgaca agggggaggt aaaaatttca aaaaatttgt tgacgtaatt aatggacagc 3780 ccctaagaga gatagagttg tttctaggaa agtggggagg caacaggata attttgccct 3840 aggccctaat gattttaggg acatcaggca aaaatcaagg acattcttta tattttgaaa 3900 caccaaattt gcctcttact catcaataag gttgttgtgt agaggcaggg ctattataag 3960 gtataatacc ccctcctccc accccatcta gatttgtagc caacaagttg acaaatttgg 4020 atctagatgt caaatataaa aatcaagaac tttgttttta agttagaaga tatgcttatt 4080 acccccaccc ccaaaaaaaa attatttgca cacaaataaa tctggaacac atgaataagt 4140 aagaataaca aaaaaaactt ttactttgca cctatttatt tagatgtatg gtgtcataat 4200 cttgacccag ttgagctggg agaggcccat tttatatata atctcagata taatctgatt 4260 aatacaagag atgactagaa ttttgacatt tgggtaagat ttaataggcc aactaggata 4320 cttttttttt cttttagggg actaacttaa aaaattgatt ttgcctagct aattatataa 4380 aatattttgc ccctgattat ataaatctgg cacaaatata cacactaaat aaacacacac 4440 acacacacac acacacacac acacacatat atatatatat atatatatat atatatatat 4500 atatatatat atatatatat atatatatat atatatatat atatatatat atatatatat 4560 atagcatttg ttgtaggttg ttcaaaccga ggtaggaggt agtatcacta gcgatagtat 4620 cacaataaaa ttatatatat attatatcag gagtctatta gtcagtcaga gcagaaagtt 4680 aacatgaaag tttttgttta ttataaattt aaaactatta tgtgtatatt ataattttgt 4740 actactattt cataagagtc ctttggggtt aaggtcctaa attccttatc aatcacttat 4800 tgacacaaac aaatcaaaac atttgacaca aaaaaaaatt ctttgaaaaa aaaaatattt 4860 tttatgaaaa ataaataaca gagttactat gccaattatg cttttatgaa atttgttgct 4920 atgaaaatag cagtttttga atgtgatatt tcattaaact atttattcga atccaataga 4980 tgtttttttc atttttgaac ggagtgattt tgactttgtt tgattttctt taagtttgtg 5040 gatacttagt ttttcttttg catatgaaaa agatcttact ctaacataaa gtgtaactag 5100 atgataaaac aagtttaaag acagttcttt ggatacttga tttgtgcaaa gactacgaag 5160 ttcagatatg cttgccataa caacactgtt ttttaaaatt tctgttacaa tcagttctga 5220 atcaattttt tttacaaaac cattactttg ttggcgaaag agaacctcga ctttggtaaa 5280 gatctctaaa actggaggag acactttcca caaaccaccg cgattttttg catttacaaa 5340 tttatggtaa aactcagtgg atggttcagg attctttaaa aaattttcaa ttttaccagc 5400 attaagaagt gaaaggcagt ctgtgcttaa cttttgctgc cataaccgtg aactccgaac 5460 tctgcaataa aatgtactga atacatagcc acttaaatac aagattattt ctttatctcg 5520 atctgaaagt tcaaaagatg ataaatcatt taaatttatt aaagaattat tgtaaaatga 5580 gtttgtaaga aacgctaaaa catgattgca tagttcacat ccaagaagtc tagttgtttt 5640 tttacttaga tagtttatga aaatatcaca attaatatct ttgtaaaatt gaggataaaa 5700 cttttctgca tcaccattag cttgaaaact ggctattgtt gtctctaata aagatagggt 5760 ttgttgtgaa gtttctaata gcattgcatg atttattctc aactcgtttc tatacaaatc 5820 aggatagcat tcgtcgtcaa ttaaagtagt aatgctttta tcaagcagac ttttaaactc 5880 gctaagaaga atagggggtt taggattttg taatgttaag tccgaaggaa tacctttatg 5940 ctgtgagttt gtatgccttg taaggcctct ttgagatatg catttcttag agcaaattgt 6000 acaagaaaat gtgcttttaa ctggttcaat ggacaactcc aacaccaagg cgtctaaatc 6060 tttattaaaa ttttcgtctt gaaagaggaa atcgtcctcc aaagaaaaaa atagtgtttc 6120 aatatcaaat tcgctattta actctatgtc cgtttccata ttttgtgcac tctgttcctg 6180 gccgttcgac agttttcaag tgtttttaac gaatattaga aaaatcaaat aatacggccc 6240 gagtgaccag actatctatt tcttatgact atg 6273 // ID Kiri-1_AAe repbase; DNA; INV; 4510 BP. XX AC . XX DT 19-OCT-2010 (Rel. 15.1, Created) DT 06-JAN-2011 (Rel. 16.02, Last updated, Version 3) XX DE A Kiri non-LTR retrotransposon family from Aedes aegypti. XX KW Kiri; Non-LTR Retrotransposon; Transposable Element; L2_Ele1; KW Kiri-1_AAe. XX NM Kiritsubo-1_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4510 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4510 RA Kojima K.K. and Jurka J.; RT "A distinct group of non-LTR retrotransposons from the yellow RT fever mosquito."; RL Direct Submission to Repbase Update (19-OCT-2010). XX DR [2] (Consensus) XX CC [1] Originally named as L2_Ele1. CC [2] Consensus update and re-classification. This consensus is CC generated from 8 sequences with >97% identity, and ~99% identical CC to the original sequence in [1]. This family does not belong to CC the L2 clade and is renamed as Kiri. It could constitute a new CC clade with other Kiri elements. XX FH Key Location/Qualifiers FT CDS 166..984 FT /product="Kiri-1_AAe_1p" FT /translation="MHQNKHITRSVSTSSVNDGNITKRPRDETXSPAGVED FT ENRDLXNKIQXMFETSNSKIEAKIDASISRLEQRISGVEKQFATFQSECTD FT NINKLATAVTEVRVGLKTTTQRIDMLEKSNDLLISGLPYVVNENLHQLFRN FT IAASLAYADSDLPLVDLKRLMRPPITAGSSPPIVCQFAFRNAREEFYKRYL FT KTRNMTLRTVGFDSDQRVYVNESLTLQARAIRTEAIKMKKNGILLKVFTRS FT GIVYVQRXEGSAAEPINDISHLRAGFSSNLSX" FT CDS 1528..4353 FT /product="Kiri-1_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MDEFPLNNASPNTIIPRIVLNAALQPEFLNICHMNVQ FT SLTARNFSKFHELKLNFTDSKVDVICFTETWLNSTINNSMLSINGYKLIRN FT DRNRHGGGICIYLRNNLSYKVVSMSNNSADDFCGTEYLNIEVKVGEDKILL FT GVVYNPPNIDCTDTLYDVLENCTIKYKGSYIIGDFNTDLLKHFSRSRKFIE FT MLXTLSFACTNSEPTFYHRSGCSLLDLFITDAPDTVVKQDQLSMPGVSHHD FT LIFCSLRYSPNRQKDYVSYRDYANFDSVALFDGFHSINWNLFFNMDNPDEL FT LEYLNYHLLKLHDQYIPLRRKKDNKTPWFNNDISHAIINRNLAYNKWKRTK FT EQIDRNSYKRLRNKANELIIQAKAKYDKQKLKVDLPSKQLWNNIKELGLSK FT DSRFLEQNDLTADEINNYFISNFTNDDSTPFRNLSTDGFKFLEFLEHDIIN FT CIFSVKSNAVGLDNIPIRFIKILLPLALPIYKYLFDSIIKTSIFPRAWKNS FT KIIPIKKKGNCSAXSNLRPISILCALSKVFEKLIKCQINKFISDNDFLHPH FT QSGFRSNHNTNSALIKVHDDISHVIDKKGLAILLLIDFAKAFDRVSHSKLV FT KKLSTKFSFSQTAAELIQSYLSGRQQAVFHNGLLSTFSEIKSGVPQGSVLG FT PLLFSLFINDLPAVLEYCSIHMFADDVQIYICISGEVDMNEIARKINHDLV FT NISHWSRNNLLEINPSKTKAMLISKRKNPPNPPIISMDGQIIQFFDRLNNL FT GVIFTTNLSWAAFINGQCGKIYGTLKRLNMISRHLDTSMKIKLFKSLILPH FT FTYGEFVYSHASALSLNKLRVALNACIRYVFNLNRFSHVTHLRDVLLGCQF FT SQFYTLRTCVQIYKIIKTNSPRYLYSKLQFLRNTRTLSLVIPHHSSAYYGQ FT SLFVRGIHNWNCLSPLLKSSSSILVFRKGLLSEIERMR" XX SQ Sequence 4510 BP; 1408 A; 904 C; 777 G; 1410 T; 11 other; tcatttttgt acctcattgg tggccaaaat aactttcgat aaagtttttg tgaagtgact 60 agccgatatc ttcatctgac atcaattccc ttcagaagtt tcagtctttt tcgagcaact 120 acgcacgtat acattgtagc tcatttktga tcgccaacaa aagaaatgca tcaaaacaaa 180 cacatcacac gttccgtgtc aacatcatcg gtaaatgatg ggaatattac taagcgaccc 240 agagacgaaa ctkttagtcc cgcaggagta gaagatgaaa atcgcgactt gmtaaataaa 300 atccagcawa tgttcgagac ctcaaactct aaaattgagg ccaagatcga tgccagtata 360 tcgaggctgg agcagagaat ctctggtgtg gaaaaacaat ttgcgacatt ccaatcagag 420 tgcaccgata acatcaacaa actagctact gcagtcactg aggttcgggt tggacttaaa 480 actacgacgc aacgtattga catgttagaa aaatccaacg atctactcat ttctggactg 540 ccttatgttg tcaacgaaaa cctacatcaa ttattccgaa acattgctgc tagcctagca 600 tacgcagatt cagatttgcc cctagtggac ctcaagagac tgatgcgccc tcccatcaca 660 gctggatcct ctccaccgat tgtctgccag ttcgccttca gaaatgccag agaagaattc 720 tacaagcggt acctcaaaac aagaaacatg acccttcgca ccgttggatt cgacagcgac 780 cagagagttt acgtcaacga gagcctaaca ctgcaagcca gagccatacg aacggaagca 840 atcaaaatga aaaagaatgg aattctgctt aaggttttca cgcgtagtgg gatagtctat 900 gttcagcgta gwgaaggatc tgcagctgag ccgatcaacg acatcagtca tcttagagct 960 ggattctcat caaacctttc cwwatagttt ctcttctcct tcctgaactc ttccatgcag 1020 ccttccttaa actagttccg tgcttccatc cgmtcctgaa agttaacttt aaaccgttga 1080 tcaaatgaac ctwtccaaaa ttcgctctct ctcctgccta agctcatccg tgcttccctc 1140 cttaatttat cccatgactc catccactcc tgaaagtcag ctgttgaaaa accacggacc 1200 ccggattttg ctgttgctgt tgctgttgtt ggagctacct tgttgctgtg gctgttgctg 1260 ttattgttaa ttactgctat tatcattcgc aatacttgtt acgctgatgg tggttgagat 1320 agttatagtt ttggctttgt taaaatcgaa atttgagttt tctttttctt ctcatctaca 1380 ttatcaactg aacttagacc attgatgtat ttgtaatgta caagttagtg tttaaagatt 1440 tcttttcgtt tgaccttgac attgcatacc gcttcggttt catgagtgat ctttttctga 1500 ggcaaatctt aatcaactgc tttgataatg gatgaatttc cgcttaataa tgcgtctcca 1560 aacactatta ttcctcgaat cgtacttaat gctgctttgc aacctgaatt tctaaacatc 1620 tgtcacatga acgtccaaag cttaaccgct cgaaattttt caaaattcca tgaactgaag 1680 ttaaacttta ctgacagcaa ggtcgatgta atttgtttta cggaaacatg gctgaatagc 1740 accattaata actcaatgct aagtatcaat ggctataagc tcatacgcaa cgatagaaat 1800 aggcatggtg ggggtatttg tatctatctg agaaataatt tatcgtacaa ggtcgtgtca 1860 atgtcaaata attccgctga tgatttctgt ggcaccgagt atttgaatat tgaggttaaa 1920 gttggagaag acaaaatatt gctgggcgtc gtttataatc ctcccaatat tgattgcact 1980 gatactttgt acgatgtttt agaaaattgt acgattaagt acaaaggatc ttacattatt 2040 ggggatttca atacagattt actaaaacat tttagtcgat cacgtaagtt cattgaaatg 2100 ttagawacct tgtcgtttgc atgtactaat tctgagccaa cattttacca tcgatcagga 2160 tgctccctac tagatctttt cataactgat gcaccagata ctgtcgtgaa acaggatcag 2220 ctctctatgc ccggtgtttc tcaccacgac ttgatttttt gttcacttcg gtactcacca 2280 aacaggcaaa aagactatgt tagctaccgt gactatgcta attttgactc tgttgcgtta 2340 tttgatggtt tccatagtat taactggaat ttgtttttca acatggataa ccctgatgaa 2400 ttgctcgagt atttgaatta tcacttgctc aaacttcatg accagtatat acccctgcgg 2460 cgtaagaaag ataataaaac cccatggttc aataatgata tttctcatgc gattattaat 2520 agaaacctag catacaataa atggaaaagg accaaagaac aaattgaccg aaattcttac 2580 aaaagactta gaaataaagc taatgaactg attatacagg caaaagctaa atatgataaa 2640 caaaagttga aagttgacct accgagtaaa caactatgga ataatataaa agaattagga 2700 ttatctaaag attcccggtt tcttgaacag aacgatttga ctgcggatga aattaataat 2760 tacttcatat caaatttcac taacgatgat tctactcctt ttcgaaatct ctctactgat 2820 ggttttaaat ttctagaatt tcttgaacac gatattatta actgtatttt ttcagtcaag 2880 tccaatgcag ttggattaga taatattcct atacggttta ttaagatttt attacctctg 2940 gctctgccaa tttataagta cttatttgac tccataatta aaacttctat ttttcctcgg 3000 gcttggaaaa actcaaaaat aattccaatc aaaaagaaag gaaactgttc agcaakctca 3060 aatttgcgcc cgatcagcat tctttgtgcc ttgtcaaaag tgttcgaaaa gttgattaaa 3120 tgtcaaatta acaaatttat ttccgacaac gatttcttgc atccacatca atctggattc 3180 cgtagtaatc ataacacgaa ctctgcgtta atcaaggtac atgacgatat ttcccacgta 3240 attgataaga aaggattagc catcttgctc ctcatcgact tcgcaaaagc gtttgatcgt 3300 gtttcacact caaaactagt gaaaaaatta tctactaagt ttagtttttc ccaaaccgca 3360 gcagaactga ttcagtccta tttgagtgga aggcagcaag ctgttttcca taatggactc 3420 ctttctacat ttagtgaaat aaaatctgga gtaccgcagg ggtctgtact tgggccttta 3480 ttattttcac tgttcataaa tgacttgcct gctgttcttg aatattgctc catacacatg 3540 ttcgcggatg atgtgcaaat ttatatatgt atttccggtg aagttgatat gaacgaaatc 3600 gccaggaaaa taaatcacga tctggtgaat atttcacatt ggtcacgtaa taacttactt 3660 gagataaatc catctaaaac taaggctatg cttataagca agcgtaaaaa tcctcccaac 3720 cctccaatca tctctatgga cggacagata atacagtttt tcgatcgact aaataatctt 3780 ggtgtgatat ttaccacaaa cttatcatgg gctgctttta ttaatggtca atgtggtaaa 3840 atctatggaa cacttaaaag actcaacatg attagcagac accttgacac atctatgaaa 3900 attaagctat ttaagtcgct aatactacct cattttactt acggggagtt cgtgtatagt 3960 catgcttcag ctttgtcatt aaataagtta cgagttgccc taaatgcgtg tatcagatac 4020 gtttttaacc ttaataggtt ctcacatgtt acccatttac gagatgttct tcttgggtgc 4080 cagtttagcc agttttacac tctcagaact tgcgtacaaa tatataaaat tattaaaact 4140 aattcacctc gctatctata cagtaaactt cagttcttga gaaacacaag gacattaagt 4200 ctagtcatac ctcaccattc ttcggcatat tatggccaat ctttgtttgt gagaggcatt 4260 cataattgga attgtttatc tccattactc aaatcttcgt cctcaatctt agtcttcaga 4320 aagggtctac tcagtgagat cgaacgaatg cggtgaaaag gtcaaatagt caaattcagg 4380 aacttaaatt agatgtagag ttaaaattgt tcattaataa actaaataca ccacagtgtg 4440 acattttaaa aaggattatc cttacgttac atggattgaa ataaaaataa ataaataaat 4500 aaataaataa 4510 // ID DNA-4_PPac repbase; DNA; INV; 796 BP. XX AC . XX DT 07-JUL-2010 (Rel. 15.07, Created) DT 07-JUL-2010 (Rel. 15.07, Last updated, Version 1) XX DE Non-autonomous DNA transposon from the Pristionchus pacificus DE genome. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA-4_PPac. XX OS Pristionchus pacificus OC Eukaryota; Metazoa; Nematoda; Chromadorea; Diplogasterida; OC Neodiplogasteridae; Pristionchus. XX RN [1] RP 1-796 RA Jurka J.; RT "DNA transposons from the Pristionchus pacificus genome."; RL Repbase Reports 10(7), 956-956 (2010). XX DR [1] (Consensus) XX CC ~96% identical to consensus. XX SQ Sequence 796 BP; 216 A; 185 C; 200 G; 192 T; 3 other; taggggccac aactcagatt gccgaattat cactatccag tcaaaaccgg gtggaaactg 60 tccctcatgg tcccaataac atcatacttt tggccgtttg tcgatatctc gtctccgagc 120 tgcgctacag cccgaaaagt acgcgcgaaa cgcactgttg tggagagggg aaaggcgaga 180 cgagagggaa agggggcgga gcctatttcc gactcattcc caaaattcgg aggcaagagc 240 ttcgccatgt cgtgagagac aantcaaggg agaaattacc tcttanccgg gcagagcgtg 300 cgctagtcac gacgggttag aaacagcagg gagaggcgag gcggagggag aggggaaggg 360 gtgcgcctct cgcgtctctc ctctcatttg actagccccg cccacgttca cactgtgttt 420 agcgctaatc ggaatcgaaa aaatacgggc gtactgaaat ctttaaaaca aatccgcaag 480 cttttgaagc atgctaaaat tctatttttt tcactaaact tcctaaaata ttcgtattcg 540 tcaggaaatt tatgcccaaa caanatgttc acactgcaac ccgtcaaatc gacctattcc 600 catcgttaaa gttcaatttt gacctactca atgaagatgc gcacattttg ggcagtgctc 660 aatcgattgg gaatgccagt agctatccag cgcactttag aatgaaggat ttggtttaga 720 atgaagcttg cggattcgat tctcgttttt tggcgaaatc atgtcagacg aaagtgggcg 780 gggcttgtgg ccccta 796 // ID BEL-20_CQ-LTR repbase; DNA; INV; 481 BP. XX AC AAWU01039777; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-20_CQ_; KW BEL-20_CQ-I; BEL-20_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-481 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 194-194 (2011). XX DR Genome; AAWU01039777; Positions 16163 15683. XX SQ Sequence 481 BP; 167 A; 95 C; 84 G; 135 T; 0 other; tgttgcgagc acaccgtgcc gactgtgctg actgtgccga ccgtacccct cgaatcattg 60 gccaaacgcg acaaactcaa aaagactata gaatagaacc gtcacgctga cacagtagtt 120 gaccgtagac gtagagtaga gatagagaag aattttgttt tcccattatc agtcctcaat 180 cagataccta aggaattaaa ctgattgccc ttttggtaag aaatttattg aattatttgt 240 gcttattttg ctaattaaaa ttatcttaga ttaaggaaat taaaacgctc taagaagcct 300 acaatttcaa aagttgtctg ttgcgatttg aattaaacta ggagactaaa ttgtaagtaa 360 cccgaccgaa catttgtgaa aattattaca attaaatcct aattgcagct ttcagctata 420 acaccatcta aaaaaaggag ttttgctaaa aagaccctcc gaaacatcaa tcgtcgcaac 480 a 481 // ID Chapaev-N3_AAe repbase; DNA; INV; 2429 BP. XX AC . XX DT 12-OCT-2010 (Rel. 15.1, Created) DT 12-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE A non-autonomous Chapaev DNA transposon family from Aedes DE aegypti. XX KW Chapaev; DNA transposon; Transposable Element; Nonautonomous; KW m4bp_Ele14b; Chapaev-N3_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-2429 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-2429 RA Kojima K.K. and Jurka J.; RT "Chapaev-type DNA transposons from the yellow fever mosquito."; RL Direct Submission to Repbase Update (13-OCT-2010). XX DR [2] (Consensus) XX CC [1] Named as m4bp_Ele14b. CC [2] Consensus update and characterization as non-autonomous CC Chapaev. ~98% identical to consensus. This consensus is ~99% CC identical to the original sequence in [1]. 4-bp TSDs. TIRs are CC ~140 bp long and ~85% identical to those of Chapaev-1_AA. XX SQ Sequence 2429 BP; 769 A; 438 C; 428 G; 794 T; 0 other; cacggtgtct ttgtggagta actttattac aaaaaaatag gtaagtctga aatttcgaac 60 taaaaacaat tagactttgc tctttcattt gcgaccaaaa tcaaaataat cggtcggggg 120 gtccagaaca ttttttttta ttttgtgtga agtattcatg gatatgtgtt tttgtaccga 180 cacacattgc gttgaacata gatacgggac aaactgcaag atcaaaccat ttgaacacct 240 tggcttggaa ggtaaagttg ttcttgctca aaagagctgt aataagcata tatgacatat 300 gatatacgca taattgcaaa actgtctcaa acgtcatttt agttattgtc cacgaaaaac 360 cttgaaaatc gtacttaaat tggtcataat gatgtaagaa gttattagga aatattttta 420 ataggtgaag atgcatcgat atcaaacctc gaattttcaa gagcttgttt gtagatttgt 480 atcttgagaa cctgacaacc ttttgcgttg aacattttat tgattagtcg ccaccagcga 540 gtgaccagtg agttaccggt tgtctgcttc tctagacttg tgcttttgaa aattcgaggt 600 ttggcttcta tgtatattca cccttaaaac cattgataac cgagtgtgta gctcgttgtt 660 attaagcaga taaacggacc aaaataaaaa ctgtcaccgc tgggagacag aggaggatct 720 tctcttcgtc gtagcattgt gaaataaagt gtaaatccat tcgtttgctc agtaataaca 780 agctatggct tttaggacga gatcataaat tatgcgagat ttctcttttt actcgtaggg 840 gatggtacac aaattatgtc acgctaaatt tcaatttttt cgaccccccc ccccttgtca 900 cactttctgt atgagtcctc cgataatttt gtaaggcttg tcacgcatgg cttaaccccc 960 cccccccctt ggagcgtgac gtaatttgtg catgacccct agatacaaaa gtgacttatc 1020 cgcctttcaa acaacacaat ttcagttatt cgaattaact agtagagggc ttcgcatgtg 1080 ataaaccttg ttataaggca tgtaatatac ttgccccatg gtgctgaaaa atacgctaat 1140 ttggatggta tttgagaagt ttcaaatctt atatttttta tgttattttc ttaatggcac 1200 agtgcttatg aaaagatcag aaactaacat catgaggcta aaatgggttt gggatttttg 1260 tacttgtttg cagtactata accccatatt aacccctcta ccggcagctt cattttttca 1320 aaatattcaa atcgcgataa cttttttgtt tctcaatatt tttgaaaaaa aattccacaa 1380 cttctcaaaa aactctctta ttttcagaat ctgtatcggt tttgagcatt ggtcatctat 1440 attcggagat attccaaaat tccttggggg accgacgcat agccataact ctcgtaatta 1500 tctctggcta cagatttttt ctcatattcg gttattcgct tttcaaaact aaactttgat 1560 gaaagtctat tgtgaagaaa ttaaacaaat cggtgtaact gtttttgagc aatgagcttt 1620 tatgtttttg gtaccactct agccgtaacg agatcttgaa aacttcttaa aacatcatat 1680 tgaaatttta tatgggaagt gtcggtcccc caaggaacac cgaaatattt ttaaaaccag 1740 ttgaccaatg gtcaataccg acatagattc caaaagtaga agagtttttt gagaagttgt 1800 gaaaaaatgg tggaaaaata tagagaaaca aaaaagttat cgcgattcaa atatattttt 1860 gtgctgaaaa aatgaagctg ccggtagagg ggttaagcta aataatagca aacaaactca 1920 tgaatatctc ttctgctatc tgtcgaattg aatttctctc ttcagcaaat ttctttatta 1980 tggtttgaag aatgagcttt tacaatggga taataagttt gtacttacat tacagtacaa 2040 gcgtgtttgg aacatacctc aaaatttctt catagtaatt tccatataaa cctacattgt 2100 ctttgggcca ccttaaatca ccacctagaa agctgcgaat ttcacagaac actattttta 2160 tagtaaggat agtaaggaac atatgtgaga ttgaattctc aaacattttt taattttgaa 2220 ccgccctaat gtacataaac acatagatag gcaaattcaa acatatgtgc aagtctctcg 2280 aaatcaaaca aaaaaaaata ctctggaccc cccgaccgat tattttggtt ttagcagcaa 2340 atgaacgcga gaaacctaga ctttattgtg gagaagtttc agacttacct atttttttgt 2400 aataaacttg ttctacgaaa cacaccgtg 2429 // ID BEL2_MH-LTR repbase; DNA; INV; 1417 BP. XX AC ABLG01001008; XX DT 24-JUN-2009 (Rel. 14.07, Created) DT 24-JUN-2009 (Rel. 14.07, Last updated, Version -1) XX DE LTR retrotransposon from northern root-knot nematode: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL2_MH; KW BEL2_MH-I; BEL2_MH-LTR. XX OS Meloidogyne hapla OC Eukaryota; Metazoa; Nematoda; Chromadorea; Tylenchida; Tylenchina; OC Tylenchoidea; Meloidogynidae; Meloidogyninae; Meloidogyne. XX RN [1] RP 1-1417 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from northern root-knot nematode."; RL Repbase Reports 9(7), 1519-1519 (2009). XX DR Genome; ABLG01001008; Positions 2621 4037. XX SQ Sequence 1417 BP; 461 A; 234 C; 277 G; 445 T; 0 other; tggcttaaaa aatgtgaaaa aattacagaa tatataagag agtttatgac cgaagaacag 60 gcaccagata gtgggatagg cccaggattg gtcgtaaaca aaccgtgggg acaagaattt 120 taccctataa attgtgtatt tttaacaaat atccccttca ggagggagca aatcctgacg 180 aaattaattt tcagttgagt gatcaagaag aatccaatgg agatcatgaa gaaagtccaa 240 agaccactgt cgaagtgaat gtggagaatt gtgctggaga tttaaaaaag gtaaaaattg 300 attttaatga aaataaaaat tggaccattc aggcagaagt taaacctgat atccattgtg 360 atgcccaagg gaaagaggca aagacatttg aagtaaaatt gaagggaaag gtcttaaagg 420 gagcaactag taccttgttg attattgtat tgtttgctct tcttattggt tcgtgtttgg 480 ccattcccgg aaaggatctc cctcgggaga aacaagaagg agattcagat agctcccgag 540 gagagtctat tttagatgga tgatatggaa aaaatgagaa atatgatgcg tgaaacgctt 600 caggacttta tctcacaaca accctccact tcttctgctt ctgcgcctac aacatcagct 660 gctactccta aaagatctaa accttacgtt attcctaaaa tacaacaact tcaatccaac 720 acttcaattt tgcaacaaca attagagcaa ttatataata atgctggtgg tggaccaata 780 agacacctaa aggttaatcg gagatgttat tcttgtaata aggtgggtca cacagcaatt 840 gcctgcccta cagcagcaca caatatagac gacagttcgt ctacacgctc tttgacaact 900 tccatccctt caataaatca gggatctttt cgttctgcta gccacgaact tgcagaactt 960 gctcgtcgat ttagtcattt tccaccaatc atgcagctcg ttatgtcatt ggcaatcttc 1020 ttcaaacaga cagcagacaa tatgcgcaag tgagataagt cctgcggcgg agtgtcaaaa 1080 tgacgaattt gatttaaaga atttattgat aattttttgt gatattttgt gagtcactaa 1140 agttttggtt aattttgtcc ttttttatga tgtttttatt tcatcatgct tatttttgga 1200 aatttgctgg ctataaaagc tagtgaaatt tctattattt ttcaatgttt tcgtgaacat 1260 tcgatttgaa gttaatatat ctgtgtgata aatatttgaa gttatttgtg tttaaatttt 1320 tgtttcttga attaattagt cttagggaaa ttaattgggg acattttatt ataccctatt 1380 tctcggaata ggaaggtaaa gtctataaaa ctttaca 1417 // ID MuDR4x_AP repbase; DNA; INV; 2343 BP. XX AC Contig42151; XX DT 24-JUN-2009 (Rel. 14.07, Created) DT 24-JUN-2009 (Rel. 15.12, Last updated, Version 2) XX DE MuDR-type DNA transposon. XX KW MuDR; DNA transposon; Transposable Element; MuDR4x_AP. XX NM MuDR4x_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-2343 RA Jurka J.; RT "DNA transposons from pea aphid."; RL Repbase Reports 9(7), 1353-1353 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX FH Key Location/Qualifiers FT CDS join(543..1448,1472..2002) FT /product="MuDR4x_AP_1p" FT /translation="MNFTIVSSERKNDLILYNDYKYYKKDYVKYLDAFKWQ FT CVKKKCTAKMYVNKATNKKLKDESIHNINHKEETTSILKKSFSNGLKRKSE FT DELERPSKIINRGILKNPEVTQTFTSTDINNIYQCLYRSRRSSYPKLPTNL FT NEVIDVMSERQIKTVQDENFVLVVNSDNKIICFSTKSNLKLLCSVDKIFVD FT GTFTYCAKYFLQLFTIHIFKNGHYVPLVYFLLPDKCKDTYARSFGYVIEKF FT SEINLSFSPSKIVSDFEEAIHIGAKTIWPEIQIVGCRFHLTQNWWKHIQNL FT GKYKFEYSYKFIYFLGLSKEYKDESSEIGNWLKLVFGLPLLNSEEVSDCFA FT TDLMSIAPDDERVGKFCDYLVEYYIDEGAKFNPHIWASREITSERTTNSCE FT SYHSKFNSQFTKAHPNIFIFTHVLNTKIQTDTYMLINGININTISKNSAFN FT KKKQNIKNLSNDLNDNKISKFTYLKHVSKYYQK*" XX SQ Sequence 2343 BP; 912 A; 301 C; 294 G; 836 T; 0 other; gtctaaattg tataaaatgt taataataat tgtataattc aaaacttaaa tgtatatatt 60 ctgattaata tgtgtatgtg aatatattat aaaatattaa aatgtatagt ctaaattgta 120 taaaatgtat aaaatgataa taattgtagg tattataata aaatgtatag tctacaatta 180 tacgcggtgt atgtgaatat tttataaaac gtatacctac ctagtctaaa ttgtattagt 240 gtacgtattg ctataatata ttaggtacgc tacatataaa atgtcaatta tatgattctg 300 cctgaaatct tatctttaag atacgatata tcttgatatc tatatgacct atatttatat 360 gatattttat atagttaaaa tgtggacttc ccaatcttcc catgttactt ctcgcctagt 420 tactaatcag tatcgatcca gttcgtggat tctcatcgat tctccatcgc gtgcatagag 480 tcgcacattc gtattattat atttcttaca gctttattac ttattagtta taattaatta 540 aaatgaactt tactatagtt tctagtgaaa gaaaaaatga tttgatttta tataatgatt 600 ataaatatta taaaaaagac tatgtcaagt atttagatgc ctttaaatgg cagtgtgtaa 660 aaaaaaagtg tactgcaaaa atgtatgtga ataaagcaac aaataaaaaa ttgaaagatg 720 aaagtattca taatatcaac cataaagaag aaactacaag tattttaaaa aaatcatttt 780 caaatggact taagcggaaa tcggaagacg aattggaaag gccatcaaaa attattaata 840 gaggcatttt aaagaatcca gaagtgacac aaacttttac aagtacagat attaataata 900 tttatcaatg tttataccgt tcgcgtagat cgtcatatcc taaactccca acaaatctta 960 atgaagtaat cgatgtaatg agcgagcgtc aaattaaaac cgtccaagat gaaaactttg 1020 ttcttgttgt taattcagat aacaaaatta tttgtttttc aacaaaatct aacttaaaac 1080 tattatgttc agtagacaaa atatttgtgg atggaacttt tacatactgt gcaaaatatt 1140 ttctccagct ttttaccatc cacattttta agaacggaca ttatgttcca cttgtgtatt 1200 ttttactacc cgataaatgt aaagacacat atgcacgatc atttggttat gttattgaaa 1260 aattttcaga aataaattta tcgttcagtc catcgaaaat agtatccgat ttcgaagaag 1320 caatacatat tggagcaaaa acaatttggc cagaaataca aattgttgga tgccgatttc 1380 atttgacgca aaattggtgg aaacacattc aaaatcttgg taaatataaa tttgaatact 1440 catacaaata aaaattaatg aattgtatta atttatttat tttttaggtt tatcaaaaga 1500 atataaagat gaatcaagtg aaataggtaa ttggctaaaa ttggtattcg gcttaccatt 1560 gttaaattcg gaagaagtaa gcgattgttt tgcgacagat ttgatgtcaa ttgctccaga 1620 tgatgaaagg gtaggaaagt tctgcgatta tttggtggaa tattatattg atgaaggagc 1680 aaaattcaat ccacacattt gggcatcaag ggaaattaca tcagaacgta ctactaatag 1740 ctgtgagtct taccactcaa aatttaattc acaattcaca aaagctcatc ccaacatttt 1800 catatttacc catgttttaa atacaaaaat acaaactgac acttacatgc ttataaatgg 1860 aataaatatc aatacaatca gtaaaaatag tgcttttaac aaaaaaaaac aaaatattaa 1920 aaatttatca aatgatttga acgataataa aatttccaag ttcacttatt taaaacatgt 1980 gtccaaatat taccaaaagt agttacctat atttatatta atacatttaa gttttgaatt 2040 atgcaattat taacatttat cattttatac aattattatt aacattttat acaatttaga 2100 ctatacattt tactatttta taatatattc acatacacat attaatcaga atatatacat 2160 ttaagttttg aattatgcaa ttattaacat ttatcatttt atgcaattat tattaacatt 2220 ttatacaatt tagactatac attttaatat tttataatat attcacatac acatattaat 2280 cagaatatat acatttaagt tttgaattat acaattatta ttaacatttt atacaattta 2340 gac 2343 // ID Mariner-26_SM repbase; DNA; INV; 2754 BP. XX AC . XX DT 12-AUG-2009 (Rel. 14.08, Created) DT 12-AUG-2009 (Rel. 14.08, Last updated, Version 2) XX DE Mariner DNA transposon from Schmidtea mediterranea: consensus. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-26_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-2754 RA Jurka J.; RT "Mariner DNA transposons from Schmidtea mediterranea."; RL Repbase Reports 9(8), 1875-1875 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS join(653..1078,1082..2428) FT /product="Mariner-26_SM_1p" FT /translation="MAKEKIQKKNMKRKFISLETKIKILDRLRNNERVIDI FT ARFFSMNEATIRTIRKNEDSIRKSVAAGMCTSMNTTSHVRNSAMEMMEKAL FT VIWLEDQNQKRIPIDTNAITAKALKIYEKLENQLSSSSSNKKPFTASHGWF FT EKFKRHSLHSLKLKGEQASADIDAAQQFPQKFSETIASKSYSPDQIFNADE FT CGLFWKKMPERTFLAKQNKSASGHKAAKDRITILFCSNASGDCIMKPVVIN FT KAKHPRSFKGININNLPVYWNANKKAWVTATLFNDWFHNSFVPDVEKYLVK FT KELPFKALLLLDNAPGHPQDLQHENVEVVFLPKNTTSLLQPLDQGIIATFK FT ALYIKSTFMYILDKLENDGSLSIIDAWKKFTILDSVKHIGISYRKIKKSTL FT HACWKAVWPDVVDENINLTVSLGQEYSDIIELAHTIGGEGFDDLAQRDINE FT VMEDQELNEDDLIEFVNKVSENDVETDVDVEDEPDPFTAKFVREGLAIGRK FT LGNYFQQHDPNVERALRFQRAINESLNQYEEVYKDLTKNKSQLLITDFISK FT SHRTEINPISTTECYEILSSDDSDIIEATKRRLIVESDSDSNLN" XX SQ Sequence 2754 BP; 1000 A; 387 C; 471 G; 896 T; 0 other; cacctatccc tcgttttaca cggcaatcgt ttaacacgtt ttcgatctta cacggttttt 60 aaaatcaaaa tttcaaaaaa aagactgcta aaaaaacaaa tttcgatatt atatgtacat 120 attatttaat tattttattc tattctattt atttttaaca attttctggt accacttatg 180 atgtgtttac aagttgtcaa gaaaaaaagc gagaaaatgc aaaaaaacat aagttttttg 240 agcttgtaat aaaagagggt atacaaagtt tcaaaattct attaattatc cataatcgat 300 tattatgtaa attatttaat tgtaaatcga ttaattaatt gatcattctt atcagatgct 360 aaccttaata aaaaattaga gaaacgtcaa tattcggtca acttgcaagc cgccatcatt 420 aacaaaaaaa atattgtttc aaatacaaaa aagggaaatc accttttcgt gttgtgtggc 480 acgtaaattc tctgagtgaa tcagttgatt ttaattgttg gatggattag tttgtatcta 540 agtgaagtaa tttagcgtta aaattaagaa cagaacaatt agaaggtaga tataaaactt 600 atatgcatct cttaactttt tatgattgat ttttcgaatt ttagttttac ccatggcaaa 660 ggaaaaaatt caaaaaaaga atatgaaaag aaaatttatt tccttagaaa caaaaattaa 720 aattttggac cgattacgta ataacgaaag agtaatagat attgcaagat ttttttcaat 780 gaatgaagct acaataagaa cgatccggaa aaatgaagac tcgatcagaa aaagcgttgc 840 agctggaatg tgtacgagca tgaatacaac atctcatgta agaaatagtg ctatggaaat 900 gatggaaaag gcattagtaa tatggcttga ggaccaaaat caaaaacgaa tacctattga 960 tactaatgcc ataacagcca aagcattaaa aatttacgaa aaacttgaga atcagttatc 1020 atccagttct tcaaataaga aaccttttac tgcaagtcac ggttggttcg aaaaatttta 1080 aaaaaggcat tcattgcata gtttgaaact aaaaggggag caagcatcag ctgatataga 1140 tgcagcacaa caatttcctc aaaagttttc tgaaacaatt gccagtaaat cgtactctcc 1200 tgatcagatt tttaatgcgg atgagtgtgg attattttgg aaaaaaatgc ctgagagaac 1260 atttttagca aaacaaaata aatcagccag cggccacaaa gcagcaaagg atcgtataac 1320 aattcttttt tgtagcaatg catcaggtga ctgtattatg aagccagtgg tgattaataa 1380 agcaaagcat ccacgttcgt tcaaaggaat aaacataaac aatcttcctg tttattggaa 1440 tgcaaacaaa aaagcttggg ttaccgccac tttatttaat gattggttcc acaactcttt 1500 tgtgccagat gtagaaaaat atttagtcaa aaaagagctg cctttcaagg ctcttctttt 1560 attggacaat gctccaggtc accctcaaga cttgcagcat gaaaatgtag aagttgtttt 1620 tttaccaaaa aacacaactt ctttgttaca acctttggac caagggataa ttgccacatt 1680 caaggcattg tacattaaaa gcacctttat gtatatttta gataaattgg aaaatgatgg 1740 ctctctttca attatcgatg cctggaaaaa gtttacgatt ttggattctg ttaagcatat 1800 aggaatttct taccgtaaaa ttaaaaaatc aacactccat gcgtgttgga aagcagtgtg 1860 gcctgatgtt gttgatgaaa atataaattt aacagtatct ttgggtcaag aatattcaga 1920 cataattgag ttggctcata cgataggtgg agaaggtttt gacgatttag cacagagaga 1980 catcaatgag gtaatggaag accaagagtt aaatgaggat gacttaattg aatttgtcaa 2040 taaagtaagt gagaatgatg ttgaaactga tgttgatgtt gaggatgagc cagatccatt 2100 cacagcgaaa tttgttcgcg agggactagc aataggaaga aaattaggaa actacttcca 2160 acaacatgat ccgaatgttg aacgagctct ccgatttcag cgagctatta acgaatcttt 2220 gaatcaatat gaagaagttt ataaagattt gactaaaaac aaatcacaat tgttaattac 2280 agattttatt tctaagtctc accgaactga aatcaatcct atatcaacca cggagtgtta 2340 cgaaatatta tccagtgatg acagtgatat tatagaagct actaaaagaa gactgattgt 2400 tgaaagtgat agtgacagca atttaaatta agttagtttt tattgaggtc aacatgttag 2460 tacataattg catattttgt atataatgta tatttttggt gaatttttta aaaatttatt 2520 ttgttttaca tttttttgtt tgaaaattaa ttgaattatt atgtttttgt tttatattat 2580 cagactgcta tgttgaactc ttttactgat attatttttt tctcaataaa tgattaaaaa 2640 taggtaaatt tgtggttatt ttcgtttaac acggtttttt catataaatt taattttttc 2700 gcttaacacg gttttgccag gaacctatct tccgtgtaaa acgagggata ggtg 2754 // ID hATm-3_HR repbase; DNA; INV; 2767 BP. XX AC . XX DT 30-OCT-2007 (Rel. 12.1, Created) DT 30-OCT-2007 (Rel. 12.1, Last updated, Version 1) XX DE hATm-3_HR, a family of autonomous hATm DNA transposons - a DE consensus sequence. XX KW hAT; DNA transposon; Transposable Element; KW Autonomous DNA transposon; hAT superfamily; hATm group; KW hATm-3_HR. XX OS Helobdella robusta OC Eukaryota; Metazoa; Annelida; Clitellata; Hirudinida; Hirudinea; OC Rhynchobdellida; Glossiphoniidae; Helobdella. XX RN [1] RP 1-2767 RA Kapitonov V.V. and Jurka J.; RT "hATm, a distinct group of hAT DNA transposons in animals."; RL Repbase Reports 7(10), 1051-1051 (2007). XX DR [1] (Consensus) XX CC hATm is a separate group of hAT DNA transposons. Significant CC identity between hATm and hAT transposases appears after PSI CC BLAST iterations. hATm transposons exist in genomes of different CC animals, including segmented worms (Helobdella robusta, leech), CC flatworms (Schmidtea mediterranea, freshwater planarian), insects CC (Aedes aegyptus, mosquito), and tunicate (Ciona savignyi sea CC squirt). All identified hATm transposons are characterized by CC 8-bp target site duplications and conserved termini CC (5'-TAGGgtGgyccnnA). The planarian hAT-7_SM, hAT-8_SM and CC hAT-10_SM transposons also belong to the hATm group. Their CC putative classification as hAT transposons was not supported by CC significant similarity (BLAST and PSI-BLAST) to known hAT CC transposase proteins. CC hATm-3_HR is a young family of haTm transposons identified in the CC leech genome. The consensus sequence was built based on multiple CC alignment of 5 copies that are ~2% divergent from the consensus. CC TIRs are 15-bp long. This transposon is target site specific: CC hATm-3_HR copies are usually inserted into (TA)n. XX FH Key Location/Qualifiers FT CDS 229..2490 FT /product="hATm-3_HRp" FT /note="transposase." FT /translation="MAESTRKKQDIYLLGQPLKTAILGNKLPAKGEVMRRF FT LYIHLSEKMTVRDSETAVIREVLPIWQRARIPTTQEYNAIKKLDDMFKLWQ FT GLKKHSKRNSDAQRRQEQAFVHSRKDLFDIAHANAIELIQIQEDRKFLLAQ FT REPGRRGYMSGVDKNLALKEARSYERKRRQENYKLKSVQAQEEITAGACSA FT EIEHSSSDSSSECESEEDSIAKRVQVKRSRKSFSMSRPHCAVSRETAAALD FT RTKTSDRKATYIISGVAKHLGLNAGNLAINRSSIRRARHKHRQELALEIRR FT NFSPDVPLAVHWDGKLLLDLTGNQKVDRLPIVVAGAGVEKLLSVPKLESGT FT GERQATAVHECLLEWNLEESVKALVFDTTASNTGKKNGACTLLQQKLGRNV FT LHIACRHHISEIILEHAFSACMSSNTSGPEIALFLRFRNEFNSINQADYHT FT IMDDESMNSKVEQHKEGIVSFCLTQLEQFQPRCDYRELLELSLILLGETPP FT RGVRVLQPGALHRARWMARLIYAFKLYMYRHQFLLTRVEEKGICRFIVFGI FT CVYLRSWFTAPDLLSAARHDLQLVKRIQQFRDVDREVSVAAANALSRHLWY FT VSEELVGAAFFDDQISHEEKVLMVDALSVESKTTNDFTKRVQMNINDISDE FT MSVKDFVSSNTMQFFAILGLPHSFLNKRPSEWREDEQYKKAFEVVSGIKPV FT NDFAERGVALMQDFNRAIVSSEEQKQYLLQVVEYHRTQYPNPKKETLVGGN FT TSP" XX SQ Sequence 2767 BP; 873 A; 524 C; 624 G; 746 T; 0 other; tagggtggtc cccaaaatga aaaagtttta aattttttgt cttaccccct aattttgtta 60 ccacccatta ggtaatactt ggtgtaaaat cttaggtcaa ttggtcaata tttagaggtc 120 gcccaagggc tctgaagttt taccttatgg gcctaataat taatccaaat tgctaataca 180 acgtctttgt tttttattaa cagatctttg tctgaaaaag taaacaacat ggctgagtca 240 acaagaaaga agcaggatat ttatcttcta ggccagcctc taaaaactgc cattctgggt 300 aacaagttac cagcgaaagg tgaagtgatg cgtcgctttc tgtacattca tttgtcagag 360 aaaatgacag ttcgcgatag tgagactgct gttataagag aggtacttcc aatatggcaa 420 cgggccagaa ttcctacaac tcaagaatac aacgccatta agaagctgga tgatatgttt 480 aaactgtggc aaggattaaa gaaacattct aagagaaact cagatgctca aagacgtcaa 540 gaacaagcat tcgttcattc acgaaaagat ctgtttgata tagcacatgc taatgctatt 600 gaactgattc agatacaaga agacagaaaa tttttgctgg cacaaaggga accgggacgg 660 cgtggttata tgagtggtgt agataaaaat ctggcgctga aagaggcacg ctcgtatgaa 720 aggaaaagaa gacaggagaa ctacaaatta aagtcagtcc aggcgcagga agagatcact 780 gccggtgcat gttcagctga aatagagcac agttccagtg attcatcgtc agaatgtgaa 840 tcagaagaag acagcattgc caaacgtgtg caggtaaaac gaagtaggaa aagcttctcg 900 atgtcccgtc cacattgtgc agttagcagg gaaactgctg cagcacttga tcgaacgaag 960 acttctgata gaaaggcgac gtacatcatc agtggagttg ccaaacacct tggcttaaat 1020 gctggaaacc ttgccatcaa tagatcctca attcgacgag caaggcataa gcatcgtcaa 1080 gaacttgcgt tggaaattcg aaggaatttt tctccagacg ttcctttggc tgtccattgg 1140 gatgggaagc tcttacttga tttgaccggt aatcagaaag ttgatcgtct tcccattgtt 1200 gttgcaggag cgggagttga aaagctctta tcagttccaa aactagaatc tggtacaggg 1260 gagagacagg ctacagcagt gcacgaatgc ctcttagaat ggaacctaga ggagtctgtt 1320 aaggcattag tgttcgatac tacagcgagt aatactggaa agaaaaatgg agcatgcacc 1380 ctgctgcaac aaaaattagg gcgtaacgtg cttcacatag cttgccgtca ccacatatct 1440 gaaataatac ttgagcatgc gttttctgca tgcatgtcgt ctaatacgtc aggtccagaa 1500 atagcactct tcttgcgctt tcgaaatgag ttcaacagca ttaaccaagc tgattaccat 1560 acaattatgg atgatgaaag catgaacagc aaagttgaac aacacaaaga aggtatcgtt 1620 agcttttgcc taacgcaact tgaacaattt cagccaaggt gcgattacag ggaactactg 1680 gagctttcac ttattttgct gggagagaca ccgccaaggg gagtcagagt tctgcaacct 1740 ggagctctgc atagggcacg ctggatggcg agactaatat acgccttcaa attgtacatg 1800 tatcgtcatc agtttctgct gacaagagtt gaagaaaagg gaatctgtcg cttcatcgtt 1860 tttggcatct gtgtgtatct tagatcgtgg ttcacggctc cagatcttct tagtgccgct 1920 agacatgatc tccagctggt aaagagaatc caacaattca gagatgtaga ccgtgaagtt 1980 tctgttgccg ctgcaaacgc actttctaga catctatggt atgttagcga ggaacttgtt 2040 ggagctgcgt ttttcgatga tcagatttct catgaggaga aagtgttgat ggttgatgct 2100 ctgtcagtgg aatcaaaaac aaccaacgac ttcaccaaaa gagtgcagat gaatattaat 2160 gatatatctg atgaaatgtc tgtcaaagac tttgtgtctt cgaacaccat gcaattcttt 2220 gcaatcctcg gactgcctca ctcatttcta aacaagagac cctccgagtg gagagaagat 2280 gaacagtaca agaaagcttt tgaagttgta agcggtataa agccagtcaa tgattttgca 2340 gagagaggag ttgctttgat gcaggatttt aacagagcaa tagtatcgtc tgaagaacaa 2400 aagcagtatc ttctgcaagt tgttgagtac catcgtaccc agtatccaaa cccaaagaaa 2460 gagacattag tgggaggaaa tacatctcct taacaactaa atggtttaac agtgacactg 2520 ggtactggac ataataattt tatcaggtag tttgtaacag ttttgtcatg ttttgcaaca 2580 aaatgctaat acgtataatt aggtggtagc acaggacact ttgactcgct tgggcgacct 2640 ctaaattttg tctatggtca ataaaatttt atcatattac ttgttatata tgagagaaac 2700 aaataaaagg gtaagacctg aaaaattacg cgttttattt tttccaatgt aattggggac 2760 cacccta 2767 // ID Gypsy-87_CQ-I repbase; DNA; INV; 5956 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-87_CQ_; KW Gypsy-87_CQ-LTR; Gypsy-87_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-5956 RA Jurka J.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 553-553 (2011). XX DR [2] (Consensus) XX CC Positions [4792-5301] - Integrase core CC LTRs are 90% similar to each other. XX FH Key Location/Qualifiers FT CDS 976..1965 FT /product="Gypsy-87_CQ-I_1p" FT /translation="MELHFEKLSKILNNLSKAPTRRYRNSTLIAKLHEAKQ FT VYNAAIIEIELYKESEQLQLLKNLRYLYGEAYTKLTARLDNNIEPLSFKNL FT VNVILTINKLYRKVKMANTLDLSLAIRVVGKYDGDAAELENWLNDVTVLRA FT GQPTVDEAVFVQFMNNRLTGAARGELTGIATIIEARNTLRSRFGIKLTPVA FT VTAELRGLKQKGKSLVDFGNEIEKVAARLAAAWVSKQPQLFPTEAATRPIV FT EPIAVEAFINGLKDQSKVLQMRSRNPETLTKALSDALEIHAQPMPEEVMWT FT YASYSNPGFRGRGRGKRGKNRGNRGYQNNRNRNCETKR" FT CDS 3412..5646 FT /product="Gypsy-87_CQ-I_2p" FT /translation="MAGLNVERCLVYLDDIIVFGKTLGDHNRNLFDVFERL FT RQTNLKLNPLKCNFLKKELIYLGHTVSEEGIKPDPSKIEIIKKWSSPKTAD FT EVKRFVAFSNYYRKHIKNFAKLCSPLNRLTRKGVDFEWSEECENSFQHLKN FT CFITPPILDYPDFSDKNTFTLHTDASGKAIGAVLSNQNGRPIAFASKALNK FT AEINYSTIEKELLGVVWAIRHFRPYLYGRRFDVYSDHRPLVYLFTLTDPSS FT RLTKFRLALEEYKFDVIYKKGSENVIADALSRISIEDLQALTPQINFVCTR FT AQNKNKIKSAEKNEDKSEVATDYIQIQIGSNENEFKIEDGTLIINPKTSIS FT HLRGVLKRLVVFTEKNPIKGLVVQNTQTYKKIKEQIEALKINGMPPIIVIG FT NEIKHVENKDEQQLIINNYHILPTAGHAGIKRTQNTIKQRYYWKSMDQDIS FT NFIKKCVKCQQNKSFRTKVPMTMTSTAKTAFEKIFLDIVGPLLPDAYGNQY FT ILTTQCDLSKFITATPIIDKSTNAVAKAFVEAVILNYGVPDQILTDRGTEF FT MSSVFTKICDMLKIEKLNSTAYHHETIGALENSHKVLGNFLRIQTNNSYGY FT WSAWVPYYKFAYNTTVHSATSKTPYELVYGKLCNFPSELQGETEISPVYNI FT DDYSAILKFKLQTSQKQARDSLLKSKQKRIAKNTGHDDKNLYKINDLVLVK FT NETGSKLEQKFEGPYEVLEDLGTNLKMQIKNKIDIIHKNRVRKFI" FT CDS 2513..3439 FT /product="Gypsy-87_CQ-I_3p" FT /translation="MILREPLFHENFVIPARTEVVTFIETSFKEDLVILNQ FT EIEPLVFIANALVSPVDGKIPVRLMNLKNKAVKVNDLKLLAEPFKNYDLIK FT IGSAYPHNVDRANKLLSELNFDGINEKDKVEITKLCLKFSDIFCLTDDKIT FT VTKIYQPSLKVKPDTQPVYTRPYKLPQSQRDEVQRQVDKMLNDNIIEETVS FT EWNSPLLLVPKKSTDDSKKWRLVIDYRKLNNVLQDDKFPLPNIEEVIESLA FT GAKYFSHLDLTQGYYQCEIRPEDRSCTSFCTNRGQYQMTRLPMGLKNSPSI FT FSRLMTVAWLVSMWKGV" XX SQ Sequence 5956 BP; 2216 A; 1019 C; 1159 G; 1558 T; 4 other; tggcgatcct gccagctcta acaaaagttt ttctttttcg ggaattcaaa gcttaaagtg 60 aagactaaaa gtgaatgaaa tattcacaac agtgatttat gccagtgtag tgaattgtaa 120 agtagcaaaa gcattaaaat gagttggttt agcagtgaag tgaatactgt taaaattaca 180 gatcatgtaa gcgccatcgc gcttgtggta ctagctgcag cagcaataat ttacatgctg 240 ggtaaatttt ttgccaaaca cgtgcaaaga acggcaacgg ccgccgcgag tcgtgagact 300 aggttgaata atatcgctac gcagcgataa gtgaacattg atttaaacaa agagaagaaa 360 gtttgtgaat aaacatttaa aatcaaattg aagaaaaaaa aaatcwgtga gaaaaaaaaa 420 aaaaagcatg atgaacatga agagtgaaag tgaaaaattg aaaattaata tctaaaaaaa 480 aggaagcagg atgaacacaa aaagtgaaag tgaaaaattg aaaattaata tctaaaaaaa 540 atttttgtga tgatattgat gacaaaaaaa tatcacaaca gcaagaggag cgtcggttgt 600 gagtacggtc cgtactacca caatccggat gtagactcta tcctgaaaac aagagaaata 660 aaacaaatca gggaagcgcc ggttgtggga acggtcggtt ccgtcacaac ccgggttgca 720 ggcccttggc aattcaacaa gcatcaaaca tcaaatgagc aggagagcaa tggttgtgag 780 aacggacggt tctgacacaa gccatatgca gatcctcaag gcgaacacca tcaataacca 840 accaacatct catcgcagaa gggaagcgtc ggttgtggga acggtcggtt ccgtcacaac 900 ccgataagca ggccctggga aaatgattgg atcaactgga gatgtccgaa tcatcaaagc 960 gtgagaatac tttaaatgga actacatttt gaaaaattat caaaaatatt aaataattta 1020 tcaaaagccc caactaggag gtatagaaat agtacattaa ttgccaaatt acacgaggct 1080 aaacaagtat ataatgcagc aataatcgag atagaactct ataaggagtc ggaacaatta 1140 caattattga agaatttaag atatctctac ggagaagcat atacaaaact tacggcacgg 1200 ctcgataata atattgagcc gctttctttt aaaaatttag tgaacgtaat actaacaatt 1260 aataagctat acagaaaggt aaaaatggcg aatacwttag atttgtcatt agctattcgc 1320 gtcgtgggta aatacgacgg cgacgcagcc gagttggaaa actggctcaa cgacgtgacg 1380 gttctccgag cggggcaacc aactgtcgac gaagcggtgt ttgttcaatt tatgaacaac 1440 cgtttaactg gtgccgcccg cggtgagtta acgggaatag ctaccatwat agaagcacga 1500 aataccttaa gatcgcgatt tgggattaaa ttaaccccag ttgccgtgac agccgagtta 1560 cgggggttga aacaaaaagg caaatcttta gtcgatttcg ggaatgaaat cgagaaagtg 1620 gcagctaggt tagccgcagc gtgggtatcg aaacaaccac aacttttccc aactgaggca 1680 gccacgcggc ccatagtcga accaattgct gtagaagctt ttataaatgg tcttaaggat 1740 caatcgaaag tattgcaaat gcggtcaagg aatccggaaa ccttaaccaa agcattgtcg 1800 gatgctctgg agatacatgc gcagcccatg cccgaggaag tcatgtggac ctatgccagt 1860 tatagtaacc caggcttccg aggccgaggc cgaggaaaac gcggtaaaaa cagagggaac 1920 cgtggctacc aaaataaccg aaatagaaac tgtgaaacaa aacggtaatc agcaaggtta 1980 caataaccag cgcagcaagg ctacaactcg aatcgaggcc accgtggaaa tcgaggttac 2040 caacataacc gaggaggtgg taatcgagga gttgctaacg tggttcagga agatccgcga 2100 ccccaacagc agcaacaact acaacctcag caaccacgtg aagaggttaa tgtaggcgag 2160 ttttttcgtg catagttcaa atgccgaaag agccctacgc ctaaaattta aaattaataa 2220 ctcggaaatt tcgttaatag tagacagtgg agcatcttgc tgtctactag acgtaaatta 2280 tttaccccaa aaatttcgtg aaaaaatcaa tgctacgcag tccatagagg ttcgtggttt 2340 aaacggagtc acgcacactc ttggcactgt atcattgttt atagaatata atggttatga 2400 ataccccata acgtttcata ttgttgaaaa cctatcgcct tctatagcag gattggttgg 2460 aacgaatttt ttaagaaagt tcggagcagt aatagatttt gaaaattcta ctatgatttt 2520 acgggaacca ttgttccatg aaaacttcgt tatacccgca agaacggaag tagtaacatt 2580 tattgaaaca agttttaagg aggatttagt tattttaaat caagaaatag aacctttagt 2640 ttttatagct aatgcgttag tttctccagt cgacggaaag attccggtta gactgatgaa 2700 tttgaaaaat aaagccgtaa aggtaaacga cttgaaactt ttggccgaac cttttaaaaa 2760 ttatgatttg ataaaaatag gtagtgctta tcctcataat gtggacagag caaacaaact 2820 tttatcagaa ttaaatttcg atggaattaa tgaaaaagat aaagttgaaa ttactaaact 2880 ttgtttgaaa tttagtgata tattttgttt aacagatgac aaaataactg ttacaaaaat 2940 ttatcagcca tctttgaaag ttaaaccgga tacacaacca gtgtataccc gtccatataa 3000 gttaccwcaa tctcagcgtg atgaggtaca gagacaagtt gataaaatgc taaacgacaa 3060 tataatcgaa gagactgtat cagaatggaa tagtcctttg ttattagttc ctaagaaatc 3120 aactgacgat agtaaaaaat ggcggttagt catagattat cgtaagttaa acaacgtttt 3180 acaggatgat aaatttccac ttccaaatat tgaggaagtt atagaatctt tagcaggagc 3240 aaaatatttt tcgcatttgg atctaactca aggttattat caatgtgaga taagaccgga 3300 ggacagatcc tgtacttcat tttgtactaa taggggacag taccaaatga ctcgtctgcc 3360 tatgggttta aaaaacagtc cttcaatttt ctcaagattg atgaccgttg catggctggt 3420 ctcaatgtgg aaaggtgttt agtttatctt gatgacataa tcgtctttgg caaaacactt 3480 ggagatcaca atcgaaatct ttttgatgtt ttcgaacgat tacgacaaac taatttgaaa 3540 ttaaacccgt tgaaatgtaa ctttttgaag aaggagttga tctatttggg tcatactgtt 3600 tctgaagaag gtatcaaacc agatccttcg aaaattgaaa taattaaaaa atggagcagc 3660 cctaaaacgg ctgatgaagt aaagagattt gtagcttttt ctaattacta tagaaagcac 3720 atcaaaaatt ttgccaaact ttgcagtcct ttaaatagac taacaagaaa gggtgtagat 3780 tttgagtggt cggaagaatg tgaaaacagt ttccagcatt tgaaaaattg ttttattact 3840 ccaccaatat tagattatcc agatttttcg gataaaaaca cgtttacttt acacacagat 3900 gcatcaggaa aagctattgg tgctgtttta agcaaccaaa atggtagacc tattgcattt 3960 gcgagtaaag cattgaataa agctgagata aactacagca caattgaaaa agaactactt 4020 ggagtggttt gggcaatcag acatttcaga ccatatttgt atgggagaag atttgatgtt 4080 tattcagacc accgtccttt agtttattta ttcacattaa ctgacccttc gagtagatta 4140 accaaatttc ggttagcact cgaagaatat aaatttgatg tcatatacaa aaagggatca 4200 gaaaacgtga tagcggacgc gttgtcacgt atttctattg aagatctcca ggcacttacg 4260 cctcaaataa attttgtgtg tacaagagca caaaataaaa ataaaattaa atcagctgag 4320 aaaaatgaag ataaaagcga ggttgcaaca gattacatac aaatccaaat tggaagtaat 4380 gaaaatgaat ttaagattga ggatggaact ctgattatta accctaaaac gtcaatatct 4440 cacctacggg gagtattgaa acgactagtg gtgtttacag aaaagaatcc gataaaaggc 4500 ctagtcgtgc aaaatacgca gacatataaa aaaataaaag agcaaataga agctctaaaa 4560 ataaatggca tgccacctat cattgtaata ggaaatgaaa taaaacatgt tgaaaataaa 4620 gatgagcaac aactcatcat taataattac cacatattgc caactgcagg gcatgctggt 4680 ataaaacgaa cacaaaatac catcaaacaa agatattact ggaaatccat ggaccaggat 4740 atctcaaatt ttattaaaaa gtgtgtgaaa tgccaacaaa ataagtcttt ccgcacaaaa 4800 gtacctatga caatgacaag cacagcaaaa actgcttttg agaaaatatt tttggatatt 4860 gttggtccat tattaccaga cgcctacggg aatcagtaca tactgactac ccagtgcgat 4920 ctgtcaaaat tcataactgc aacgccaatt attgacaagt caacaaatgc agtggccaaa 4980 gcatttgttg aagcagtaat tttaaattat ggcgttcccg accaaatatt gacggacaga 5040 ggtacagaat tcatgtcctc tgttttcaca aaaatttgtg atatgctgaa aattgagaaa 5100 ctcaattcaa cagcatatca ccatgaaaca attggagcgt tagaaaattc gcataaagtg 5160 cttgggaatt ttctacgaat ccaaacaaat aattcttatg gatattggtc tgcttgggtg 5220 ccgtattata agttcgctta taatacaact gtgcacagcg caaccagtaa aactccatac 5280 gagttagtat acggaaaatt atgcaatttt ccatctgagt tacaaggaga aacggaaata 5340 agtccggtct ataacataga cgactattcc gctattctga aatttaaatt acaaacatcg 5400 caaaagcaag cgcgcgacag tttgttaaaa tccaaacaaa agcgaattgc taaaaacacc 5460 ggacacgacg acaaaaatct ttataaaatt aacgatctag ttctcgtcaa aaatgaaacc 5520 ggttccaaat tggaacaaaa atttgaggga ccatatgaag tcctagaaga tttaggtacc 5580 aatctaaaaa tgcaaattaa gaataaaata gatattattc ataagaatag agtaagaaaa 5640 ttcatctaaa aaaaaagagt gaaggaagaa gcagaacaaa agaaaataac aataaaaaaa 5700 aaaaaaatat aaaaacacaa aattacattt ccattataac ataatacact acacttacat 5760 atattataac attgcaacac atacttacat atagtactta gattatattt tcacactaat 5820 aatagtacat taaaaattat cactatagaa taatttttaa tttttcccag aagaggcgtg 5880 tagtgggtca acacatgttt cgtgccagtt cttactgact catattacaa cacttgaggc 5940 agtaacaaat tattta 5956 // ID Gypsy-169_AA-LTR repbase; DNA; INV; 163 BP. XX AC supercont1.339; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-169_AA_; KW Gypsy-169_AA-I; Gypsy-169_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-163 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.339; Positions 134769 134931. XX SQ Sequence 163 BP; 44 A; 34 C; 38 G; 47 T; 0 other; tgtggtggac agcgcatgct ataggtatat acaacgatac tcggtaagta ccacttgtgt 60 attagtatta acttaatgct atcgatggtc tgactaacta gcgagcgaac acagctattc 120 attctttgag ttagacgcgt gtgccacgga tctcggtacc aca 163 // ID I-5B_AAe repbase; DNA; INV; 5651 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A I non-LTR retrotransposon family from Aedes aegypti. XX KW I; Non-LTR Retrotransposon; Transposable Element; I-5_AAe; KW I-5B_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5651 RA Kojima K.K. and Jurka J.; RT "I non-LTR retrotransposons from the yellow fever mosquito."; RL Repbase Reports 11(4), 1355-1355 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 16 sequences with >96% CC identity. The consensus is ~79% identical to I-5_AAe, and CC ~76% identical to I-10_AAe. XX FH Key Location/Qualifiers FT CDS 318..1625 FT /product="I-5B_AAe_1p" FT /translation="METDGDEGGPKENSDCAESSSLTKPFRVKVYPSSFSG FT PFVVYFRKKEKPINVLLISSEIYKKYKSVKEIKKISLDKLRIVFGSRDEAN FT ALLESKLFFNSYRVYAPCDSCEINGIIYDESLNCDDIRNHGSGFFRNKSIS FT PVEILDCVRLSKLFIDGNDTKYVHSNCIKITFSGSVLPDYVNVDNVIFRVR FT LYYPKLMHCDRCLLFGHTSNFCSNKQKCSKCGEFHSSSTCKNQSNLCIYCK FT QEHNSLKECAVYIENQSKFNQKIKNKNLLSYAEVMKSTDNIASANTFEILS FT DDDGNVDHENFGNYVYKPPSKRKRINNKTSNNHNQIFEPQPSTSFDLHFPS FT LNDTTSPKNIPGFRKIETDPTSNNKNQNSNNLSENKAKVDTDNSILNILEE FT IVEFLGLSDFWKKIIKIFLPILASFLNKLNSFGPLLSSFFSS" FT CDS 1628..5317 FT /product="I-5B_AAe_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="MAQIKNNDMNILQWNCRSIIPKIDSLKVLISSYNIDL FT FCLNETWLVSSKFFHVSKFNVIRKDRDSSYGGVLIGIRENIEFKYLDLSIH FT SQVEYIAISVKKEKCEFSVICFYIPPNTTFSLSQIRSILNEVPPPFYILGD FT FNAHNFAWGSEKTDGRGSLIMELIDDLNLHILNDGSFTRIVVPPNHHSCID FT LSLCSNSLSMKSSWKTIDDPNGSDHLPILIEIQNPVQDNFVHEFNAPDLCT FT NVNWTKFSDLVSLSLINIVELLSPIENYNNFSKLLIDCLHKSQNKKNKSCS FT IRKNRPSFWWDNECSIALKTKSDAFKLFRRSGSRNHFFLYRKAEAQFIRLT FT KSKKRNYWKNFVQNLDRETSLSTLWSVARNLRNYDPTIPNVLEYSEKWIHE FT FASKICPDFVPRSIKFKNSSLNYFPELCGSFSLAEFNLALSITKNTAPGID FT NIKFIVLKNLPDDGKLHLLSIYNAFLTQNIIPYEWRLIKVISILKPGRDPS FT SADSRRPISLLSCLRKLLERMILNRLELWAENNKIFSSSQFGFRRGRGTRD FT CVSLLASQIQLSFNKKQDLVSTFLDVSGAYDSVLIDLLFEKMHNLKIPNMI FT SNFLCNLFSFKIMNFFHNGDSKLLRYSYFGLPQGSCLSPFLYNLFTSDMAS FT IIPNGCYLLQFADDNVISISGKNREVISHFMQCALNNIDSWAHENGFSFSV FT QKTKYIIFSRKHSNISINLYLDGIEIEQVDEFKYLGIWFDSKLNWNIHIKQ FT IQKTCSKRINFLRTITGTWWGANPSDLITLYKTTIRSVMEYGCFTFGSAAH FT THFVKLEKIQFRCLRICLKLMNSTHTQSTEVLAGVVPLKIRFQELNCKFLI FT NCLSNNHPIIDILKSLFEINPTSRILESYIICSTENVAPISYNSFYNFEIN FT IHTYLPLIDFSLFHELQQIPNHLYSHFANILFMRKFDGITSAQFYFTDGSL FT VQGIAGFGIYNEYVAHFFKLKSPCSIFIAELIALYFACNLIKDCPPNIYIV FT CSDSLSCLRALGTISFNFKTHHVLFMLKKLLFDLHCQGFIIKFLWIPAHCR FT IYGNEQADSLAKFGVVRGIIYDRDIFPSEYYPKLKTHSLDNWQTCWNSSNK FT GRWCYSICPKVSNFPWFRNIPATRNFICSFSRLISNHYICNSHLYRINIVD FT SNLCECGESYEDIDHIVLQCSRFILPRTTLFNTLMNYGHSIPVSIRDILGT FT KCYQILKLLYSFLNEISYTI" XX SQ Sequence 5651 BP; 1782 A; 857 C; 857 G; 2154 T; 1 other; catacccggt aagtaggcct tgaccaggtt agacggtttt tttcctcgct tgtcattttt 60 tttccatttc agatccgaat tttccgaagc tttgaatttc aattttgacg atttgcttgg 120 aggattttcc cagtgtgaat aattcaagag ctgcctatca agtatttgga attcaagttt 180 tcggtcactt ttatcgtcat tggagtttcg ttcgttgcaa aaccttgagt ataattttac 240 ctttattatt tatttctttt cctgttattg tagttctcat ttgacattgt tatttctatc 300 tgtgtgaagt ttccattatg gaaactgacg gggatgaggg gggacctaaa gaaaattccg 360 attgtgctga atcctcttct ttgaccaaac catttcgtgt aaaagtttac ccttctagtt 420 tttctgggcc ttttgtcgta tattttcgaa aaaaggagaa acccattaat gttttgctga 480 tttcctctga gatttataaa aagtataaat ctgtaaagga aattaaaaaa atttcattgg 540 acaagttgag aattgttttt ggatcgcgag atgaagcaaa tgctctcttg gaatccaagt 600 tattttttaa ttcctatcgc gtgtatgcac cttgtgattc atgcgaaatt aatggaataa 660 tatatgatga atcattgaac tgtgatgaca ttagaaatca tggttcaggt ttttttagaa 720 acaaatccat ttctcctgtc gagatactgg attgtgtccg tttatcaaaa ttatttattg 780 atgggaatga taccaagtat gtacattcta attgtatcaa gattacgttt tctggatctg 840 tactgccaga ctacgttaat gttgataatg ttatttttcg cgtgagactt tattatccaa 900 agctcatgca ttgtgatcga tgccttcttt ttggtcacac ttcgaatttt tgctctaaca 960 aacaaaaatg ttctaaatgt ggtgaatttc attcgtcatc cacttgcaag aatcaatcta 1020 atttatgtat ttattgtaaa caggaacaca attctttaaa agaatgtgct gtttatatcg 1080 aaaatcaatc aaaattcaac caaaaaataa aaaataaaaa tctcttgtcg tatgctgaag 1140 taatgaaatc cactgataac attgcttctg caaatacttt tgaaattttg tcagatgatg 1200 atggtaatgt agaccatgaa aattttggta attatgtgta caaacctcct agcaaaagga 1260 aaagaataaa caataaaact tctaacaacc acaatcaaat atttgaacct caaccatcta 1320 cttctttcga tttgcacttt ccatccttga atgatactac ttctcctaaa aatattccag 1380 gatttcgtaa aatagagact gatcctacat ctaacaataa aaatcaaaat tcaaacaatt 1440 tatcggaaaa caaagctaaa gtagatactg ataactcaat tttgaacatt ttagaagaaa 1500 tagtagaatt tttaggattg agtgatttct ggaaaaaaat aattaaaata tttttaccta 1560 ttttggcttc ctttttaaat aaattgaatt cttttggccc tcttttgtct tcatttttct 1620 cttcgtaatg gctcaaataa aaaataatga tatgaatatt ttacaatgga attgtcgtag 1680 tattattcca aaaattgata gtttaaaagt tttgataagc agttataata ttgatttgtt 1740 ttgtttaaat gaaacttggt tagtgtcatc aaaattcttc catgtttcta aattcaatgt 1800 tattcgaaaa gatagagatt cttcgtatgg aggagttctt attggaattc gtgaaaatat 1860 agaatttaaa tatttagatt tgtcaataca ttcacaagtt gaatatattg ctatttctgt 1920 taaaaaagaa aagtgtgagt tttcagttat ttgcttttat attcctccta ataccacttt 1980 ttctttatcg caaattagaa gtatattgaa cgaagttcct cccccatttt atatattagg 2040 tgatttcaat gcacataatt ttgcttgggg tagtgagaaa acggatggta gaggttcatt 2100 gattatggaa ctaattgatg acttaaattt acatatcctc aatgatggat cttttactag 2160 gattgttgta ccaccaaatc atcattcgtg tattgactta tctttatgct ctaatagctt 2220 gtctatgaaa tcttcttgga aaactattga cgatcctaat ggtagtgatc atttaccaat 2280 tttaattgaa attcaaaatc ctgtccaaga taattttgtc catgaattta atgctcctga 2340 cctttgtaca aatgtaaatt ggacaaaatt ttctgatctg gtatctcttt cgttaattaa 2400 tattgttgaa ttgttatctc caattgaaaa ttacaataat ttttcaaaat tattgattga 2460 ttgtttgcat aaatctcaaa ataaaaaaaa taagtcatgt tctattagaa aaaatcgccc 2520 ttctttttgg tgggataatg aatgttccat tgctttgaaa actaaatctg atgcttttaa 2580 attatttcgt cgatcaggct ccagaaacca tttctttcta tatcgtaaag ctgaggcaca 2640 attcataagg ctaactaaat ccaaaaagag aaattattgg aaaaattttg ttcaaaattt 2700 agatagagaa acttctttat ctaccttgtg gtctgttgcc agaaatctaa gaaattatga 2760 tcccaccatt cctaatgttt tagaatattc ggaaaaatgg atacatgagt ttgcttcaaa 2820 aatttgtcct gattttgtcc cacgttccat taaatttaaa aatagttcac taaattattt 2880 tcctgaactt tgtggttcat tttcattagc tgagtttaac ttagcattat caattactaa 2940 aaatactgct ccaggtattg ataatattaa atttattgtg ttaaaaaatt taccggatga 3000 tggaaaactc catttacttt caatatataa tgctttccta actcaaaata ttattcctta 3060 tgaatggcgt ttaatcaaag ttatcagtat tctaaagcct ggtagagatc cgtcatcagc 3120 tgatagtagg agacctatta gtttattatc ttgtttgcgt aaacttttgg aaagaatgat 3180 tttgaatcgt cttgaattgt gggccgagaa taataaaatt ttttcatctt ctcaatttgg 3240 atttaggaga ggtcgaggta ctcgcgattg tgtttcacta ttagcttcac aaattcaact 3300 ttcattcaat aaaaaacagg acttagtttc cacttttctt gatgtttctg gagcctatga 3360 ttctgttcta atcgatttat tgttcgaaaa gatgcataat ttaaaaattc ccaatatgat 3420 ttctaatttc ttatgtaatt tattttcttt caaaattatg aacttttttc acaatggaga 3480 ttcaaagtta ttaagatata gctattttgg acttccacaa ggttcatgtt taagtccatt 3540 tttatacaat ttatttacca gtgatatggc atcaataatt ccaaatggtt gctatttgct 3600 tcaatttgct gatgataatg ttatttctat cagtggcaaa aatagagaag tcattagtca 3660 ttttatgcaa tgtgccttaa ataacattga ttcatgggct catgaaaacg gattttcatt 3720 ttcagttcag aaaactaaat atatcatatt ttcgagaaaa cattcgaaca tatccattaa 3780 tttgtatctt gatggaatcg aaatcgaaca agttgacgaa tttaaatatc ttggtatatg 3840 gtttgactct aaattaaatt ggaacattca tattaaacaa attcaaaaaa catgttcgaa 3900 acgaatcaat tttcttcgta ctattacagg tacttggtgg ggtgccaatc catctgactt 3960 aattacgctt tacaaaacaa ctatccgttc agtcatggaa tatggttgtt tcacttttgg 4020 aagtgctgca catacacatt ttgtcaaact tgaaaaaata caatttcgtt gtttgagaat 4080 ttgtttgaaa cttatgaatt caactcatac tcaatcaaca gaagtactcg ccggtgttgt 4140 accacttaaa attcgttttc aagaactgaa ttgtaaattt ttgattaatt gtctctcgaa 4200 taatcatcca attattgata ttttgaagtc actattcgaa attaacccaa cgagcagaat 4260 attggaatct tacattattt gttcaactga aaatgttgca ccaatttcat ataatagttt 4320 ttataatttt gaaattaata ttcacactta tctacctttg attgatttct ctctattcca 4380 tgaattgcaa caaattccaa atcatctata ctctcatttt gccaatattc tatttatgcg 4440 taaatttgat ggaatcactt ctgcacaatt ttatttcaca gatggatcct tagttcaagg 4500 catcgctgga tttggaatat acaatgagta tgtggctcat tttttcaaat taaaatctcc 4560 atgttccata ttcattgcag aattaattgc tttatatttt gcttgtaact tgataaagga 4620 ctgcccacca aatatttaca ttgtatgttc tgatagctta agttgccttc gtgctttggg 4680 taccatatct ttcaatttta aaactcatca tgttttattc atgttgaaaa agttattatt 4740 tgatttacat tgtcaaggat ttataattaa atttttgtgg atccctgctc attgtagaat 4800 atatggtaat gaacaggctg attcattggc taaatttggt gttgttcggg gtataattta 4860 cgatcgtgat atttttcctt ctgaatatta tcctaaattg aaaacacatt cccttgataa 4920 ttggcaaact tgttggaatt ccagtaataa agggagatgg tgttattcaa tttgtccaaa 4980 agtcagtaat tttccttggt ttaggaatat accagcaact agaaatttta tttgttcatt 5040 ttcacgactc atttcaaatc attatatttg taatagtcat ttatatcgta tcaatattgt 5100 ggattctaat ttatgtgaat gtggtgaatc ttatgaagac attgatcata ttgtccttca 5160 atgttcaagg tttattttac caaggactac attatttaac actttaatga attacggaca 5220 ttcaatacca gtatctattc gggatatatt gggtacgaaa tgttatcaaa tacttaaact 5280 tttatatagt tttttgaatg agatttcata tactatttga tatttctgct kgtgtgtgtg 5340 tttttttttt cttgttttta tttgcagata tgaagatgga ctcttttttg gtagaccccc 5400 tccctcatcc cgttttggaa gccaccccat accccattac ttcacttgga tccatgttgc 5460 agtttatggc tctgctatgg ttaattttta ccgtatgagc ctttagtttt aatttttctg 5520 ttataacgtt atatgaaaag ataaagaggt tttgtgcctc tttgagaaag atttcgaaca 5580 gatatcactc aaaggggttt ttccctcttt caaaattttg gttaaataaa taaataaata 5640 aataaataaa t 5651 // ID Mariner-33_HM repbase; DNA; INV; 3541 BP. XX AC . XX DT 31-DEC-2008 (Rel. 13.12, Created) DT 31-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE Mariner-type family: consensus sequence. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-33_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3541 RA Bao W. and Jurka J.; RT "Families of Mariner elements from Hydra magnipapillata."; RL Repbase Reports 8(12), 1967-1967 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 909..2957 FT /product="Mariner-33_HM_1p" FT /translation="MYTYSSIVEFVLVYFFVFPSFSNNKIMKKIKRFQWSK FT TNMQNAINAVKSSSLSQRSAAIEYKIPRATLGKYLRGDSLIGVKPGPPPKL FT PNHIEKKLVDYATSRANRGIGFGKRQFMKYVSQLSKKHQINFKHDTPSEKW FT WRLFKQRHKTMVLRKPEGTSSVRHQCMSVQKVSKYFLALQQVFNKLGTHVM FT PSSIWNMDETGLQLDFKPPKIVAARGAKHLQSRTSGKRETITVIAAVNAAG FT KTIPPHLIPKGKTIKSLQAFNTSDAPVGSKWSVSETGWTKQGIAYLWFTNT FT FLPNIGTNRPQVLIVDGHDSHNSVELLSVAIDNNIEIVEMPAHCSHWLQPL FT DRTVFGPLKTYYNTCCHDLMNTYSVTINKSNFCGLFKKAWDQALTATNIIS FT GFQSCGIFPYNPSAIPYEAYLPNSLYSDQLLGNENLLKSSFDSFKNDTINY FT QSGISLNNAFEKQVDPNSNCTFINDVDTTETIGILTGSDILNVDLAQLDND FT TALMLWNSCLTPEHLTAFNFCYQNGFDISDNLYLAYKDFRINQNAISDVSF FT DLSLREAAETIALSEETDIHSVSIPNTSVDYSSPISVKTDSLKSTIDNDHD FT VISYPNFYVNKSKNNRNTKQKYFVLTSPEAMQAKLNDLKKKEEKVEAAKIK FT KLKQLDAKEKRLKLQQDRELKRQLKHIDFVINK*" XX SQ Sequence 3541 BP; 1256 A; 500 C; 498 G; 1286 T; 1 other; gggcaaactg acctattagt ggccacgtcc tattaagagc cgcttttttg agggctcttt 60 tcaaagattt tttaaacagg ttgatcaaat atttggattt tttcttcagc tatttgtact 120 ataaacaata tgcaatatat atatttaagc tacagtttat aaatagaata tttatattat 180 gttaaagata tcgctttttt tggggagaaa agtagccggt tttggatttt ttaaacatat 240 ttaaaatagt ttttatgatg cattttttta gttcagttat ttaagaacgt ttattttaaa 300 atttaagtca tacatttatc ttttaaacac tttgttgaac tgcaaacaat aagttttaga 360 attttgaaag gctttgtata gcaccactca catcgcccaa ttattggccg tataatgtac 420 caatttgtag ccattacaat ttttcaatat tacctatatg ttatttgttt gtttttattt 480 tttattgtaa acttagttat tttttgtgct aatacaaaca taatattaag aaatttattt 540 aatttactca cttagttatt caaataccta tttctttcat ttgtaaggta aaacaagact 600 gttacaaaac aatataataa taacaatatc tctgaaaaac tcagactatt aaagaagcaa 660 agtattttaa cctaagtact aagcttactt aaggtaatta tgctctgttc tttaatgatt 720 taaattatta ttgttattat tattatcata tatatatata tatatatata tatatatata 780 tatatatata tatatatata tatatatata tatatatata tatatatata tatatatata 840 tatatatata tatatatata tatatatata tatatatata tataatattc ttatatttaa 900 ttcrattaat gtatacatat tcttcaatag tcgaatttgt tttggtatat ttttttgtct 960 ttccttcttt tagcaataac aaaattatga aaaaaattaa acgttttcaa tggtcaaaaa 1020 ctaacatgca aaatgccatt aatgcagtta aaagtagttc attatcacaa agatcagcag 1080 ctattgaata taaaattccc agagcaacac taggaaaata tcttcgagga gactccttaa 1140 ttggagttaa acctggacct ccaccaaaat taccaaatca tatagaaaaa aagttagttg 1200 attatgccac atctagagca aatcgcggta ttgggtttgg gaaaagacaa tttatgaaat 1260 atgtctcaca gcttagtaag aaacaccaga taaattttaa acatgatacc ccatcagaaa 1320 aatggtggcg gttatttaaa caacgacata agactatggt tttaagaaaa cctgaaggaa 1380 catcatctgt tcgacatcaa tgcatgtcag tccaaaaagt ttctaaatat tttttagcac 1440 ttcaacaagt ttttaataag ttgggtactc atgttatgcc ttcttcaatt tggaatatgg 1500 atgagactgg acttcaatta gactttaaac cacccaaaat tgttgcagca cggggtgcca 1560 aacatcttca gtctcgtact tcaggaaaaa gggaaaccat cactgtaatt gctgctgtga 1620 atgcagctgg taaaactata cctccacatc taataccaaa aggtaaaaca ataaaatcac 1680 ttcaagcttt taatacaagt gatgcaccag ttggttcaaa atggagtgtt agtgagactg 1740 gatggacaaa acaaggaata gcataccttt ggtttacaaa tacatttttg ccaaacattg 1800 gaactaacag gccccaggta ttaatagttg atggtcatga ttctcacaat tcagttgagt 1860 tattatctgt tgctattgac aacaacattg aaattgttga gatgccagca cattgttctc 1920 actggttgca gccattagat cgtactgtat ttggtccctt gaagacttac tacaatactt 1980 gttgtcatga tttaatgaat acatactctg ttacaatcaa taagtctaac ttctgtggat 2040 tgttcaagaa agcttgggat caagcattaa ctgccactaa cattatttct ggttttcaat 2100 catgtggtat ctttccctat aatccttctg caattccata tgaggcctac ttgcccaatt 2160 cactttactc tgaccagtta ttagggaatg agaatttgct taaatcaagt tttgattcct 2220 tcaaaaatga taccataaat tatcaatctg gaatatcact aaacaatgct tttgaaaaac 2280 aggtggatcc taattcaaat tgtactttta ttaatgatgt tgatacaact gaaactatag 2340 gcatcttgac tggatcagat attctgaatg tagatttagc tcagctggat aatgacacag 2400 cattaatgtt atggaatagt tgtttaactc cagagcatct gactgctttt aatttctgtt 2460 atcaaaatgg ttttgatatt tctgacaatt tgtatttagc atacaaagac tttcgtataa 2520 accagaatgc aatcagtgat gtttcttttg atctctcatt aagagaagct gctgaaacaa 2580 ttgctttatc agaagaaact gatattcact ctgtctctat accaaatact agtgttgatt 2640 attctagtcc aatatctgtt aaaacagatt cactgaaaag taccattgac aatgatcatg 2700 atgttatctc ttacccaaat ttttatgtca ataagtcaaa aaacaataga aatactaaac 2760 aaaagtattt tgttctaact tctcctgaag ctatgcaagc caagttaaat gatctaaaaa 2820 agaaagagga aaaagttgag gctgcaaaaa taaaaaaatt gaagcaactt gatgcaaaag 2880 aaaagagatt aaaactgcaa caggacagag aattaaaaag gcaactgaaa catattgatt 2940 ttgttattaa taagtgaaca agttaatatt ttttatcaaa ttttgttagt actttttgat 3000 gtatttcttg agttttgatt gttaaaattt ctttgaatca atatacagtt ttacaaataa 3060 tttattttgg ccattgttga aggggttact tatgtctaag ctagttttta ttgagtaatt 3120 ataacgttta aaaaatttta aaaatttgat gtgtaacagt taataatcaa tatttaacta 3180 gtttatattg attatttaca ttttaaagcc tttttttttt aattcattat aaatcttatt 3240 aaaattaaat tattttcaca acattttaac ctgtggcaaa gttagtagaa tctaatattt 3300 caataaaaag tgaactggct cataattagt caacccggcc attaattggt gtttattcaa 3360 aattttaatt caaatgaatt gtttacattg ttgttttttt atattttggt aacaattact 3420 caaaataaat taaaagaatg tctattatta agtatagcaa aaaaaaattt aaatctgaca 3480 atattactgt gttcagagtg cctttaaaaa aagatgtggc cattaattgg tcagtttgcc 3540 c 3541 // ID ALSAT2 repbase; DNA; INV; 603 BP. XX AC K00079; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 25-MAR-1997 (Rel. 2.02, Last updated, Version 2) XX DE A.lumbricoides eliminated chromatin satellite. XX KW SAT; Satellite; Simple Repeat; ALSAT2; ASSAT2; KW Repetitive sequence; Satellite repetitive element. XX OS Ascaris lumbricoides OC Eukaryota; Metazoa; Nematoda; Chromadorea; Ascaridida; OC Ascaridoidea; Ascarididae; Ascaris. XX RN [1] RP 1-603 RA Mueller F., Walker P., Aeby P., Neuhaus H., Felder H., Back E. RA and Tobler H.; RT "Nucleotide sequence of satellite DNA contained in the eliminated RT genome of Ascaris lumbricoides."; RL Nucleic Acids Res 10(23), 7493-7510 (1982). XX DR GenBank; K00079; Positions 1 603. XX SQ Sequence 603 BP; 173 A; 95 C; 117 G; 218 T; 0 other; ggcctttttt gtgcattatt tctgcaattg ggatttattc gatgatcaat tataaaggaa 60 taattaatca ttctgctgtc aaatttgatg cagttcagcc gaacttgtac cacgataaaa 120 cgcatttttt gtgcattatt gctgcaattc ggatttattc gatgatcaat tataaaggaa 180 taattaatca ttttgctgtc aaattcgatg cagttcagcc gaacttctac cacgataaag 240 ggcatttttt gtgcattacc gctgcaattg ggatttattc gatgatcaat tgcatgggaa 300 taattgatca ttttgatgtc aaatttgatg cagttgagcc gaacttgtac cacgatcaag 360 ggcatttctt gtgcattatt gctgcaattg ggatttattc gatgatcaat tataaaggaa 420 taattaatca ttttgctgtc aaattcgatg cagttcaacc gaacttgtac cacgataaag 480 ggcatttttt gtgcattgtt gctgcaattg ggatttattc gatgatcgat tgcatgggaa 540 taattaatca ttttgatgtc aaatttgatg catttcagcc gaacttgtat tacgataaag 600 gcc 603 // ID BEL-23_AA-LTR repbase; DNA; INV; 547 BP. XX AC supercont1.344; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-23_AA_; KW BEL-23_AA-I; BEL-23_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-547 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.344; Positions 453117 452571. XX SQ Sequence 547 BP; 145 A; 153 C; 108 G; 141 T; 0 other; tgttccgtcc caacgtcaca agaagcaatt tgagaagcgg tggcagcagg aaacagcgga 60 atattccatc gtcacccttg catttcgcta cccggtttgc ggcatcagga tagggagcgg 120 aaaatacccc actgttgcca tattgtcgca tcccctttga gcttattcgt ttcaccctcc 180 atccagcctg ctatgtacag cgaacgcacc accacaaccg tccccatcgt tgagcgcacc 240 ccattgcagc atcgttcagc aatacataag agagtggaag tttccatcgc aacccaccgc 300 tgcgcatcca atgttttgca tctacctgta acatatataa gtggcttgca tgcgcaagca 360 attcattcag tttattttca acttttgtgc gatcatcaag tgaagaatat agtacagtta 420 aagttccaaa ccgcctttct ctcctccgga agccaaaaca gtctttttca agacttgagt 480 tgttaagagt ccattcctgt ggagccatcg gtccttaacc aagcaaatct agttgcaagc 540 cgcaaca 547 // ID BEL-10_DWil-LTR repbase; DNA; INV; 299 BP. XX AC scaffold_181117; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-10_DWil_; KW BEL-10_DWil-I; BEL-10_DWil-LTR. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-299 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_181117; Positions 194922 194624. XX SQ Sequence 299 BP; 88 A; 77 C; 66 G; 68 T; 0 other; tgttatgccc aaccgggcaa cacaagggtt aaaggaggcg gaagctttca cctcgcgccg 60 ctctcccgat gatgcccatc gctgtcccca caaaagaatt ccccacactt cggtccaagc 120 ggggcttggt gccctgctct taagtaagct agttttagtt gaattatata cctgaatata 180 cgtacaagtg aagtgaaatt tctctgcgtg tcaacaattt actgggagga aagaagaaaa 240 agggaaaatt catctagctg ccccaaacca gaaggaagtt cgacccccct acataatca 299 // ID BEL-148_AA-LTR repbase; DNA; INV; 510 BP. XX AC supercont1.155; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-148_AA_; KW BEL-148_AA-I; BEL-148_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-510 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.155; Positions 744349 744858. XX SQ Sequence 510 BP; 175 A; 86 C; 104 G; 145 T; 0 other; tgtcatatgg agcatagatc gttcaaaacg tgttatgagc taggaagata tttggacaac 60 taaagtgaat tctattataa attatactac aagtggcaaa ttagagtttg gggagcattt 120 atcggccgac aggtttgaca ataagctaaa atgtaattat actacaacta aatgtaacat 180 atttagcagt tggagcgtta cacggctagc tatacggatt actggactaa acctgatccg 240 actagtacgt aattggaact gtgagtagat ctgaaattac ttaaggcctc aattgacatc 300 gttgttatta tagtacagat cggttggagt cagcactacg ttaggaagtt caccagaatt 360 gtaggaacaa acgtaaacgt aagatgaact actatataaa gaaccattag agataatcca 420 tcgcttaaaa taaactttca gctttaaagc tgcgctgact acaggctact aaaaaggcgt 480 ttgcattctc ctggtagtga cttcccaaca 510 // ID Copia-22_AA-LTR repbase; DNA; INV; 153 BP. XX AC supercont1.65; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-22_AA_; KW Copia-22_AA-I; Copia-22_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-153 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.65; Positions 754512 754664. XX SQ Sequence 153 BP; 45 A; 33 C; 21 G; 54 T; 0 other; tgaagagcaa actattcact gggtaaaacc catgaaaaat attcgtagtt caactttacc 60 gtgtatcttt tgacaaataa aattcattct gttcttttat actcgaccgt aacacacgtg 120 tttgttttta ctctcgtcgc gtatcacatt cca 153 // ID BEL-3-I_NVi repbase; DNA; INV; 5597 BP. XX AC . XX DT 21-APR-2009 (Rel. 14.04, Created) DT 21-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia vitripennis: internal portion - DE consensus. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-3-I_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-5597 RA Bao W. and Jurka J.; RT "BEL type LTR-retrotransposons from Nasonia vitripennis."; RL Repbase Reports 9(4), 746-746 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS join(404..1612,1616..5482) FT /product="BEL-3-I_NVi_1p" FT /translation="MDALVVQWNNQTEQLRDLLTDLSIVIKSKSAVLHECK FT ARMEEIVMHYKTLGNLFYKLDAVHKEAEREKFNDLKNFYFKVTAYIRSAED FT KQSKTLGSDTTVPLGHSTLLDISRFSGLPTIDLPKFDGRLEKWNAFKTTFR FT NEMVRLNIDNITAFHYLKKSLVGAAANSLHILDPSDGSYDEAWRLLGTQYE FT HVRLIAASHIDAILYYPVITEVSHRSLKDMETSVRQHLADLKSLKVVPSDY FT FVIRTLERALPTNIKDKWIKRMSIDAIPDLESFFRFLQETSNYLYAQEGES FT KDAKASAKRTVAQSEKGTKFRRGNGQARALVTSSSISCYYCSGEHTIYRCS FT SFAALSVQQRWDAVRTKKLCRKCLRTHEGKCESRNCKKCDKDHNTLLHEER FT KTDSSGKKPLGVGVLSTVSRALQSQLMMSAVVLIRDADGAYIEARALLDTC FT ANANFMTADLAKRLRLPTRPCSIPIGAINDLRTTTKHCVDAQFKALHSDFN FT KRLTFLTVPNITSVTPEEPFPRDSIRLPKNIRLADPQFHLPRQVDILIGSG FT ATLSMLSIGQVNLSREGSELYLQKTQLGWVIVGGIDTDKISTVSCHLSQLE FT KQLVQFWSLEKCSSPVEQADDRLECEKWYVDTTVRLSDGRYMVRLPFRKKD FT PVLGDSRAQALKRFRGLERRCSRSSGMMEEFSRVFQEYLDLGHMSLVVDEN FT DSEYYLPFLIVVKILSLTTKFRVVFDASAKSSLGVSLNDLLHVGPTIQDKL FT YVHLVRFRSHRYVLTADIEKMYRQILIHPEDRRYQRIFWYRDGKLQVYELN FT TVTFGVSSAPYLAIRTIHRLADDESAQYPRAAEILKRDMYVDDLLTGSDSL FT DEIMKARDEIITVLKHGGFNIRQWASNHNHALDNLDEKVLSLAPDGEDAVL FT KTLGVSWASKRDELAIVVKPVEIPPNVTKRVILSEIAKIFDPVGLVGPVGL FT HAKWIMQECWKEKVSWDESVPQSLFTLWMDFASQLSSLSSVSAPRHVVCVN FT PSRIEIHGFSDASKKGYGACLYARSTDKDEKITVRLICSKSRVAPLKEQTI FT PRLELCGAVILVRLLLETLPALEFSIDRIVLWSDSTIVLHWLRKSPQDLKL FT FEANRVREIQELGERASWRYVPTQHNPADALSRGQLPKEFLRNETWFTGPT FT WLRQTEASWPANIVESSPDIPGLKKGVCLFVDSAKDRMFLSRFSKHRMMLN FT VVAICLRWLPSGRQYRGEPITVAEKRDAETRVFRSIQQDAFSDSIACLTKG FT IQVKGSHIAALNPCLDANGVVRVGGRLKNADIPIASKHPILLPSRNHVTDL FT IIREAHESNYHSGVQSTLFYLRQRFWLIDAKNQVRHVIRECVTCIRHKPPP FT IHCKMADLPTARVTESHVFSHVGVDFFGPLSIKEKKRYNRTALKAYGCVFV FT CMATKAVTIEITSDLTTEGFLGAFARFVGRRGIPQHVYSDNGTNFVGANNQ FT LRELFALLNSEEFKSKVNAKALSLDIQWHFNPPLSPHFGGLWEAAVKSFKH FT HLKRVVGGQLLTFEELYTLVIEIEAILNSRPLWSISADPNDPIALTPAHIL FT IGRPITVLPSADLTSIPDNRLSIWKFITKARQDFWKRWHLEYLHELQQRQK FT WHDSTGELRKGMVVILIDKNQPCTQWQLAVVQEVHPGTDGLARVATVRTSR FT GTLKRNITQLCPLPTSSTEEAE*" XX SQ Sequence 5597 BP; 1437 A; 1346 C; 1382 G; 1432 T; 0 other; tggtgcctcc tgtgaggtgt ggatcgacag ttcggccagt cagctaagga caaagggttc 60 atcaaggttc cccaggtcat cgcatagacg tcttcccctg gggtcacctc tccagctgcc 120 gcttcaaggt caacgcattg tcaacgcaac agaaaagata agtagcgttt atctttttca 180 aagtgactct acgcgtgtgt gaatttcgaa cacagcgcgg cgtttgtttg acttcgcgcg 240 taaacacgag gtcgcgcgtt gcggcgagcg gcaaaccgtt gttgtgccgg acgagcgaat 300 tccgagaatc tgagcgcgtc gagacacgtg ctcgcgcttc agtccgtata agggatttcg 360 gtatttatta tttagcgtgc gtattcactg actactaggc aacatggacg ctttagtggt 420 acagtggaac aaccagaccg agcaacttag ggatttattg acagacctgt cgatagttat 480 taagtcgaaa tctgctgttc ttcacgagtg taaagcgcgc atggaggaaa tcgtcatgca 540 ctacaagact ctcggtaact tattctataa attagatgca gtccacaagg aagccgagag 600 ggagaagttc aatgacttga agaatttcta cttcaaggta accgcgtaca tccgatccgc 660 cgaggataag cagagcaaga ctctcggtag cgatactacc gttccgttag ggcattcgac 720 tcttctagac atctctcgat tttctgggtt gccgactatt gacttgccga agtttgatgg 780 tcgtctagaa aaatggaacg cgttcaaaac gactttcagg aacgagatgg tgcgattgaa 840 tatcgacaac atcacagcgt tccactatct aaaaaagtcg ttagtcggag cagccgctaa 900 ttcgctccac atcttggatc caagcgatgg cagctacgat gaagcctggc gacttttggg 960 cacgcagtac gagcacgttc ggctcattgc tgcttctcac attgacgcaa ttttatatta 1020 tcctgtcatc acggaggtct cccatcgatc gctaaaggac atggaaactt cggtccggca 1080 gcatctggca gacttaaaat ctctcaaggt cgttccttcc gattacttcg tgatccgcac 1140 tctcgaacga gcattgccca ctaacatcaa ggacaaatgg atcaaaagga tgtccatcga 1200 tgcaattccc gatctcgagt cgtttttcag atttttgcag gaaactagca attatctata 1260 tgcccaggaa ggggaatcca aggacgccaa ggcttcagcc aagcggacgg tggcgcagag 1320 cgagaaaggg accaagttcc ggcgaggtaa cgggcaagct cgagcgcttg taacgagctc 1380 gtccatttcc tgctactatt gtagcggcga gcatacgatc tatcggtgtt cgtcgtttgc 1440 tgcgttgtcc gtccagcaac gatgggatgc tgtgaggacg aaaaaactat gtaggaagtg 1500 cttgcgaacg cacgaaggca aatgcgaatc ccgcaattgc aaaaaatgcg acaaggatca 1560 caatacgctc ttgcacgagg aacgaaagac agattcctca ggcaagaagc cgtgactagg 1620 cgtaggagtc cttagcaccg tgtccagggc tctccagtct cagctcatga tgagtgctgt 1680 tgttttgatt agagatgccg atggggcgta catagaggct cgcgcgctcc tggacacgtg 1740 tgcaaatgcg aactttatga ccgcggattt agctaaacga ttgcgtctcc ctactaggcc 1800 ttgctcgatt ccgatcggag caattaacga cttgcgaact accacgaaac attgcgtcga 1860 tgcccaattt aaagcgttac attcggactt taataagcgt ttaactttcc tcaccgttcc 1920 caacataact agcgtcactc cggaggaacc gtttcctcgc gattcgatca ggctacccaa 1980 gaacatcagg ttagcggacc ctcaattcca tctccctcga caagtcgaca ttttaattgg 2040 ttctggagca acgttgtcca tgctttctat aggtcaagtt aacttgtcta gggaaggaag 2100 cgagttatat cttcagaaga ctcagctagg atgggtgatc gttggtggta ttgacaccga 2160 caagatttca acggtgtcgt gtcatctttc tcagttagaa aagcagctgg ttcaattttg 2220 gtctctggaa aaatgctcca gtccggttga gcaggctgac gacaggttag aatgcgagaa 2280 gtggtatgtc gacaccacgg ttcgactgtc tgacggacgg tacatggtga gattgccgtt 2340 tcggaaaaag gacccggtac taggggactc gagggcgcaa gctctcaagc ggttccgggg 2400 attggaacgg cgatgcagcc gttcgtccgg catgatggaa gaattcagtc gtgtctttca 2460 ggaatatctc gatttgggtc atatgtcttt ggtggtcgac gagaatgact cggaatatta 2520 tcttccattc ttgattgtcg ttaagatttt gagccttact accaagtttc gagtcgtttt 2580 cgacgcatcc gctaagtcgt cgttaggtgt gtctttaaac gatctgttgc acgttggacc 2640 tacgattcag gacaaacttt acgttcatct agtaaggttt cgatcgcata ggtacgttct 2700 cactgctgac attgaaaaga tgtatcggca gatacttatc catcccgagg atcgccgata 2760 ccaacgcatt ttctggtatc gcgacgggaa actacaggtg tacgagttga acacagttac 2820 gtttggcgta tcttctgctc catatttagc gattcgaact attcataggc tcgcggacga 2880 cgaaagcgcg caataccccc gagcggccga gattctcaag agggatatgt atgtagatga 2940 tctgctgaca ggctcagatt ctttagatga gattatgaag gcgcgtgatg agataatcac 3000 cgtcttaaag catggtggat ttaacatccg gcaatgggcc tctaaccata accatgcatt 3060 agataattta gatgagaagg ttctgagctt ggcaccagat ggcgaggacg cggttttaaa 3120 aacattaggc gtttcctggg cctctaaacg cgacgaattg gctatcgtag tcaagcccgt 3180 agaaatcccg cctaacgtaa ctaaacgcgt aatcttgtcg gagattgcga agattttcga 3240 cccagtaggt ttagtaggac cggtaggttt gcacgcgaag tggatcatgc aggagtgttg 3300 gaaggagaaa gtttcttggg acgaatcagt tccacagagt ttgtttacgc tttggatgga 3360 cttcgcgagt cagttgtcgt ctctctccag cgtttctgca ccacgccatg tagtttgcgt 3420 taatccctcg cgcatagaaa ttcacggttt cagcgatgcg agcaaaaagg gctacggcgc 3480 ttgcctgtac gctaggtcca ccgataaaga cgaaaagatc actgtacgat tgatctgctc 3540 caaatctaga gtcgcccctt tgaaagagca gactattccc cgtttagaat tatgcggagc 3600 cgtgatcctt gtaagacttc ttttagaaac tcttccagct ttagaatttt ccattgaccg 3660 gatcgtttta tggtccgatt ccactatagt cttacattgg ttacggaaat ccccgcaaga 3720 tttgaagcta ttcgaagcta acagagtacg cgaaattcaa gaattaggag agcgcgccag 3780 ttggcgatac gtccctaccc aacacaaccc agctgacgct ttatcccgcg ggcaactccc 3840 gaaagaattc ctgcgaaacg aaacgtggtt taccgggcct acctggctac ggcaaacgga 3900 ggcttcatgg cccgcgaaca tcgtcgaatc gtctccggat atacccggat tgaaaaaagg 3960 agtctgtctc ttcgtcgaca gtgctaagga tcgcatgttc ctatcgcgtt tttctaagca 4020 ccgcatgatg cttaatgtcg ttgcaatatg cttacgatgg cttccttcgg gccgacagta 4080 ccgaggagaa cccatcacgg tcgcggagaa acgcgatgct gaaacccgcg tgtttcgaag 4140 cattcagcag gacgcgtttt ccgattcaat tgcatgcttg acgaagggta ttcaggtgaa 4200 ggggtcgcat attgcggcac ttaacccgtg tttggacgcg aatggagttg tacgtgtggg 4260 aggacgtctg aagaatgcag atatccctat agctagtaag caccccatac tattgccttc 4320 aaggaatcat gtcacggatt tgatcatccg tgaagctcac gagtcgaact accactccgg 4380 tgttcaaagc actcttttct atttgcgaca acgtttttgg ttgattgacg caaaaaatca 4440 ggttcgtcat gtcatccgcg agtgtgtcac gtgtattaga cacaagcccc ctccaattca 4500 ctgcaagatg gcagatctcc cgaccgctag ggtcaccgaa tctcacgttt tttctcacgt 4560 aggcgtagat ttctttggtc ccctttccat taaagagaaa aagcgatata atcgtacagc 4620 cctcaaggcg tatgggtgcg ttttcgtatg catggccaca aaagcggtga ccatagagat 4680 aaccagcgat ttgaccaccg aaggtttcct aggcgcgttc gctagattcg taggacgtag 4740 aggtattccc cagcatgtat actccgacaa cgggaccaat ttcgttgggg ccaacaacca 4800 gcttagggaa cttttcgcac tcctcaattc agaagagttc aaatctaagg tgaatgccaa 4860 agctctttcg ttagatattc aatggcattt taacccgccc ttatctccgc attttggtgg 4920 tttatgggag gccgcggtaa aatcatttaa gcatcatctc aaacgcgtag tgggtgggca 4980 acttctcaca ttcgaagagc tctatactct cgtgatagag atagaagcaa ttttaaattc 5040 tcgacccctc tggtcaatct ctgccgaccc caatgaccca atagcgttaa ccccagctca 5100 tattctgata ggtaggccta ttacagttct gccaagcgct gatttaacat ctattccaga 5160 taatcgactg tccatttgga agttcatcac caaagcccga caggacttct ggaagcggtg 5220 gcatttggaa tatctccacg agctccagca gcgtcaaaaa tggcatgact ctactggaga 5280 actacggaaa ggaatggtcg tcatcctcat cgataaaaat cagccttgta cgcagtggca 5340 gctggcagtc gttcaagagg tgcacccggg aaccgacgga ctcgctcgag tcgccaccgt 5400 tcgaacttca cgagggactc tgaagcggaa catcactcag ctgtgccctc ttcccacgtc 5460 ttcaacggaa gaagctgaat gaattatata tatttacact cacatacata cacactcaat 5520 cacatgtaca accgtcgaat gtacaccctc tcgaacgatg atttcgtttg ctcgtgttac 5580 tcgcaacggg gggagaa 5597 // ID MuDr-1x_TCa repbase; DNA; INV; 3123 BP. XX AC . XX DT 10-MAR-2008 (Rel. 13.03, Created) DT 10-MAR-2008 (Rel. 13.03, Last updated, Version 1) XX DE A distinct, diverged MuDr-type family. XX KW MuDR; DNA transposon; Transposable Element; MuDr-1x_TCa. XX OS Tribolium castaneum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Coleoptera; Polyphaga; Cucujiformia; OC Tenebrionidae; Tribolium. XX RN [1] RP 1-3123 RA Jurka J.; RT "Highly diverged MuDR-type families."; RL Repbase Reports 8(3), 239-239 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Tribolium CC castaneum Genome Project. XX FH Key Location/Qualifiers FT CDS join(659..1237,1245..1754,1735..2370) FT /product="MuDr-1x_TCa_1p" FT /translation="MKHDTKSKYVAERGSQGGKDGTTIIKYCCHRSGKYVA FT RGKGLRHLKTQGSNKIDGFCPARLKVIIDKNNQCKVSFIKQHVGHDEEDLG FT HLFLTLQERKCLANKMALKIPFPQILDEIRDSVVGSQLDRIHLLTRKDLSN FT IERSYHLQSSVVRHESDAVSVDAWVKQRESSVVFSSTNPKVHFAQFIFILV FT YAFGTQSESNSDLREEDFVLIIMNLGQEEILKKYGSDCICIDGIHGLNQYD FT FEMHTLLVIDDVREGFPCAFLISNRSDETVLKIFFHHIKERIGFQLTSKVF FT MSDMAEAYYKAWNFIMGPAKYRLVKRALSNYFYYNICNRLFCTWHVDRSWR FT KNLSKIKTKEKQVSNAIQLQVTQYSCNEVCIFTSSRMHQVMVYKYLRTLLQ FT ERDENAFLRMLNDFIRTITNDPETNEFSEYFKNNYINNRHCWAYCYRLHSG FT LNTNMHIERMHRTIKYIYFGGRKVKRLDKALHEIEKFIRDRLFNRLTVLFK FT GKLSHKLNELRTRHVTSQKLSTALLVSNGAGWKIPSNNSFEVYELEEKEGG FT IACNNCKLMCMDCGVCIHKYTCSCVDPV" XX SQ Sequence 3123 BP; 1034 A; 578 C; 611 G; 898 T; 2 other; actagcggct aggccccgcc ggtctacaaa tgttcgctca gtcgaaatgt tgcgagcgag 60 gtcggaacag tacgaacgga acggagtaac ggaccataac ggacttctca tttcagctgg 120 gctggttaag tctgtgttaa aagttctaaa gtggtcttga aacttgaaat gttaaattgc 180 attgacttgc atcggcagtt ttgaatttca aaagatgtca gggaaagata tcaaatgttg 240 tgtgtgccaa aaacagttta gtgcagtttt taatttggca aaacacgcgc ggaagtctta 300 ttcacgtgtg tgcatttttt aacttcttaa actaaccata actaacttct aatttgtatt 360 taatttgcag aacaagcagc tgatatcatc ccaaagaaga gttacaaata ccagtgttta 420 caatgtaaca aaaatttttc cctgttaaaa aattttaatt accataagaa gacgcataac 480 caatcgcata acccaccttc aaagaacaca tatgatgctg tgtgtccgat ttgccagttg 540 aaggcaccaa aagataattt gatattgcat tcattttaaa gtagagcatg atataaaatg 600 tgaaacacta gcaatagaat ttactacctt tgaggaattc tcaaaatgga aatcagatat 660 gaaacatgat accaaaagta aatatgttgc agagcgggga tctcaaggtg gtaaagatgg 720 gaccacaata atcaagtact gttgtcatcg ttctgggaaa tatgttgctc gcgggaaagg 780 gttaagacat ttaaaaacac aaggttcaaa taaaattgac ggtttctgcc ctgctcgttt 840 aaaagtaatc atcgacaaga acaaccaatg taaagtgtct tttatcaaac aacacgtggg 900 tcacgatgaa gaagatttgg ggcatttatt cctcaccctt caggagagaa aatgtttggc 960 aaacaaaatg gcattaaaaa ttcctttccc ccaaattcta gacgagatac gagactcagt 1020 ggtgggatca caattagatc ggattcactt actgaccaga aaagatctga gcaacattga 1080 aaggagctac cacttacaat catccgtcgt tcggcatgag agtgatgccg tcagcgttga 1140 tgcctgggta aaacaaaggg aaagttctgt agtattctct tctacaaacc ccaaggtaca 1200 ctttgctcag tttattttta ttcttgttta tgcattttaa ttaaggaacc caatctgaaa 1260 gcaattccga tttacgagaa gaggattttg tactaattat tatgaacctg ggtcaagaag 1320 aaattcttaa gaagtacggt tcagactgca tttgcattga tggcatccat ggcttgaacc 1380 aatacgattt cgaaatgcac actttgttag tgattgatga tgtacgggaa ggatttccat 1440 gcgcattttt aatttccaat cggtcagatg aaactgtact aaaaattttt ttccatcata 1500 tcaaagaaag aattggcttc caacttacta gtaaagtttt tatgtctgat atggcagaag 1560 cctattacaa agcttggaac ttcataatgg gccctgcaaa gtataggtta gtaaagagag 1620 cattatccaa ctatttttat tacaatattt gtaacaggct tttttgtaca tggcatgtgg 1680 atcggagttg gaggaaaaac ctttccaaaa ttaaaaccaa ggagaaacaa gtgagtaacg 1740 caatacagtt gcaatgaagt ttgtatcttt acttcttctc gtatgcatca ggttatggtt 1800 tacaagtacc tacggacttt attacaagag agagatgaaa acgcattctt gcgtatgctg 1860 aatgacttca ttaggactat caccaatgac ccagagacta acgagttttc tgagtacttc 1920 aaaaataatt acattaacaa tcggcattgc tgggcatatt gttatcgttt acactccggg 1980 ctaaacacca acatgcacat agaacgcatg cacagaacca tcaaatatat ctattttggt 2040 ggacgaaaag tcaagcgttt ggacaaggca ttacatgaga tagaaaaatt tataagggat 2100 cgtttattca accgccttac agtcctgttt aaaggtaaat tgtcacacaa attgaatgag 2160 cttcgcacac gccatgtcac tagccaaaag ctttctacgg ccttgcttgt atcaaacggt 2220 gctggatgga aaattccttc aaataactct tttgaagtgt acgagcttga agaaaaggag 2280 ggtggtattg catgtaataa ctgcaagttg atgtgcatgg attgtggtgt ttgcatacat 2340 aagtacactt gtagctgtgt ggatcctgta taaaattcac atgtgcaaca catacaccta 2400 ttgtgtagat atstmaaaca cagacacttc ctacccaaag agttgaaaag cagtttattg 2460 ccaaagaaag gcagagactg ggtgaaactt ttattcaact gctagaaggt gcgacttgca 2520 tggaacaact tgaagctgcg aaaaacatgt gtatagctct agaacccaca ctgcaagcaa 2580 tagcaacgaa aagggagcta acaacaccgt ctacacgtgc aagtcctagc aacaaaaaaa 2640 ttacatctca acgaggtgta ttgcattcca ccaagaaaaa aaacaagaga ccatcaatga 2700 ctctagttgc gcctacaaat gaagagatgc aagcaatcgc gatacgaacg ttgtccaaat 2760 aattttacta ctgaacaata attaacttta ataaatactt acgcattagc tttaaattat 2820 tttattgtac ccttttccac tgagtagact taaacgcagt cttgtcggtt gtagcacata 2880 atgcttctcc aataataaat caaaaattaa tgtatacaca actgtaaatt ttgtcttcat 2940 gcctaaattt tatggtacag ccactgatct gtgaatttat gtatttacta ttctcggatt 3000 cttttgaaga tgaaatttat aatagtaatt acccgctttg gccaatattc gggctgttcc 3060 gactccgctc gcaacatttc gacagagcgg acatttgtag accggcgggg cctagccgct 3120 agt 3123 // ID Gypsy-162_AA-I repbase; DNA; INV; 5335 BP. XX AC AAGE02018483; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-162_AA_; KW Gypsy-162_AA-LTR; Gypsy-162_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5335 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02018483; Positions 21161 15827. XX CC Positions [2359-2859] - Reverse transcriptase CC Positions [3994-4464] - Integrase core CC 'CGTTC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 808..5163 FT /product="Gypsy-162_AA-I_1p" FT /translation="MAQSNINTTLEPFRRGNSFGDWVERLGFFFNMNKVPV FT EEKRDHFITLSGPVIFRELKLLYPNSNLAEVPYDEMINKLKARLDKTESDL FT VQRLKFNVRVQQPDESLEDFVLSVKLQAEFCNFENFKQMAIRDRIVAGIRD FT KALQQRLLNEEKLTLETAEKLIATWEIARNNAKNMDYGNTADQIASLKASS FT FPGARLNKLAATMHMAGRSDGDDERNRGSVKNRLGYTPYQRGQWKNKQWQS FT RQRQDLEVRNRSSARPNYSQMVCDFCGVQGHIKRKCFKLKNMHRDAVNMVN FT PDTSGSNPDEYLSELVNRLRADTDSENEDDGETNVLQCMHVSTVGKISDPC FT LVNVKIDDVWCDMEVDCGSSVSVMGRKQYFEHFDKALSKSHRNLIVVNGTS FT LLIEGEANVCVTFRGISKDLKLLVLNCENNFVPLLGRPWLDVFFPKWRNFF FT VNAISSGIEPIQDTVDEIKTKYKNIFAKSFTSPIKGFEADLVLKTENPIFR FT KAYDVPYRLRDKVLEYLKKLEEEAVITPVKTSEWASPVIVVMKKNNQIRLV FT IDCKVSVNKLIIPNTYPLPTAQDVFAGLAGCKIFCSLDLEGAYTQLSLSEK FT SKKIMVINTIKGLYTYNRLPQGASSSAAIFQQVMDQVLNGIENVSVYLDDV FT LIAGKDFDDCRQKLFIVLDRLKCANIKVNWEKCKLFVNELIHLGHIISEKG FT LMPCPDKISTIEKAKVPRNETELKSFLGLINYYHKFIPRLSTKLYHLYNLL FT KSGVKFTWDDNCNEAFEESKNALLEAPFLEFYDPNKPIVIVSDASGYGLGG FT VMAHIIDGVEKPIYFTSFSLNAAQRNYPILHLEALALVCTIKKFHKFVFGK FT KFYVYTDHKPLVGIFGKEGKNSIYATRLQRFVLELSIYEFEIQYRPSKRMG FT NADFCSRFPVDMAVPDEYDVGLINSINFSRQFPIDFCIIAAQTKEDEFLKN FT VIYFMTNGWPEKISKQYIDVYANQKDLELIDECLLYQNRVVIPMSLKKQVL FT KLLHANHAGIVKMKRLARQNVYWFGINSDIENYVAVCDTCNSMMIVPKLKT FT KSKWIATTRPFSRVHIDFFHFDHRTYLLVVDSYSKWIEVELMKNGTDCGKV FT LKKLVGIFARYGLPDVLVSDGGPPFNSHAFVDFLKRQGINVLKSPPYNPSS FT NGQAERLVRTVKEVLKKFLLDPEFIDLDLDDQINLFLINYRNNCLTVEGNY FT PAQMIFSYKPKTILDLLNPKSHYKKQLCVEPMHDKLTNNKDTNLVHDDNSK FT DVGNANACIPDPFENLMQGDEIWYKNHNPHHTARWLKASYIKRHSRNTFQV FT LIGNVPTTAHRGQLRIYKEGESFEKPNVLLRRQQRETHKLKDDEEFRGFSD FT EEVRSSKPRLDGFGELNKKRNVSLRRQQLEPEVDRGEPYSAPEDVRCRRKR FT RIVVDEALQDVEMSTDCVPRRSKRIRQANMDKDFVYMI" XX SQ Sequence 5335 BP; 1776 A; 841 C; 1139 G; 1579 T; 0 other; gttggcgacg aaaaaaagtg gttttggaca agttagtgaa gtgtttcact gcacttagtg 60 aggaagatta attaaatcgg ctaccttgtg acggatcgac gccatcgcgt aggaatttca 120 atcaaactta tcaaatcgct ggaggcaatt tacgttcggc gttgttggac aggctataat 180 cactgatcac tgatttcggt tgtgcagtaa ccgtgaataa gaattacggt tctggaaggg 240 tgtggtaagt tttgagcaga aattttagcg cgttagctga aaccaccatt ttgtatggta 300 atcaaagaaa gatccggaaa gtataagagc tattgttcct gtataataag attaaaataa 360 taagttcaga atataaaaga aatcgtacaa cacaatagct ttattgtaga tgctttatta 420 agagagtttt tcttctattt attagatctc atagtgattg tgacctgaca aacaattgat 480 catcattggt gtaggcttta cacagtggtc ccggtgtgtt tttttgctga ttggctagct 540 gctgtggtca agcagttacg gcagatcgaa aggttgtaag cggagatttc cttcccaggt 600 agcggtttcc attgacgtca actgacggct cgtgtgttag ccaaagggct gcgagtagtg 660 acctacacaa cgtcggcaaa ggctgaaaat cattctggtg agtttttact tttctttact 720 cttgagggaa aagatccaga gaatccggca gttaagcaaa gtcgtttatt ttagcttttg 780 gatattttta ctgaccaata tactgtaatg gctcaatcga acattaacac taccttggaa 840 ccattccgta gaggaaattc ctttggagat tgggttgagc gtctaggttt cttttttaat 900 atgaataaag tacctgtcga ggaaaagcgt gatcatttca ttacgctcag tggtcccgtt 960 atttttaggg aactaaaatt attgtaccca aatagtaatc ttgcggaggt tccatatgat 1020 gaaatgatta acaaattaaa agcacgacta gataaaacgg agtccgatct ggtgcaaaga 1080 ttaaaattta acgtcagagt gcagcaaccg gatgaatcat tagaggattt cgtgttgtct 1140 gtaaaactcc aggcagagtt ctgcaatttt gaaaatttta aacagatggc tatcagagac 1200 cgcattgttg ccgggatccg ggacaaagct ctccagcaaa ggttgcttaa tgaagagaag 1260 ctaactttag aaacagcaga aaaactaata gcgacctggg aaattgccag gaataatgct 1320 aaaaatatgg attacggtaa cactgcggat cagatagcat cattaaaagc atctagtttc 1380 cctggcgcta gactcaataa gctagcagct acaatgcata tggcgggccg gtcagatggt 1440 gatgatgagc gaaatcgtgg atcagttaag aataggctag ggtatactcc ttatcaacga 1500 ggccaatgga agaacaaaca atggcaaagc agacaaagac aagacctgga agtaaggaat 1560 cgctcatcgg cccgacctaa ttattcacaa atggtttgtg acttctgtgg ggtgcaaggt 1620 cacattaaaa gaaaatgctt taaactaaaa aatatgcatc gagatgctgt gaatatggtt 1680 aacccggaca cctcaggatc taacccggat gaatatttga gtgaactggt caacagatta 1740 cgagcggaca ctgacagtga aaatgaggat gatggtgaga caaatgttct tcaatgtatg 1800 cacgtttcaa cagttggtaa aattagtgat ccttgtctgg tgaatgtaaa aatagatgat 1860 gtttggtgtg acatggaggt ggattgtggt tcgtctgtat ctgtgatggg aagaaaacag 1920 tattttgaac attttgacaa agctttatca aaaagccata gaaatttgat agtagtaaac 1980 gggacaagtc ttctaattga aggtgaagct aatgtttgtg taacttttag gggaatttca 2040 aaagatttga agcttctggt gctaaattgt gaaaacaatt ttgttccttt attgggtaga 2100 ccttggcttg acgtattctt ccccaaatgg agaaattttt tcgtcaatgc aatttcttct 2160 ggaattgaac caatccaaga tacggtagat gaaataaaaa ccaaatataa aaatattttt 2220 gcaaaaagtt ttacttctcc aatcaaaggt tttgaagcag atttagtcct gaaaactgaa 2280 aatccgatat ttaggaaggc atatgatgta ccctaccgtt tgcgggataa agttttagaa 2340 tatttgaaga aattagagga agaggcggta attacccctg tcaaaacaag tgaatgggcc 2400 tccccagtaa ttgttgtaat gaaaaaaaat aatcaaattc gattagtaat agattgcaaa 2460 gtttcagtaa ataaattgat cattcctaat acataccctc tacccaccgc ccaagatgtt 2520 tttgctggat tggctggttg taaaatattt tgttcattag atctagaggg agcctatacg 2580 cagttatctt tgtcggagaa atcaaaaaaa attatggtaa ttaacacaat aaaaggactt 2640 tacacgtata acagattgcc ccaaggagct tcgtccagtg cagcaatatt tcaacaagtg 2700 atggaccagg ttcttaatgg aattgagaat gtgtcggtat acctagatga tgttttgata 2760 gcaggaaaag attttgatga ctgtagacaa aaattgttta ttgtacttga taggttaaaa 2820 tgtgcaaata tcaaagtaaa ttgggaaaaa tgtaaacttt ttgttaatga attgattcat 2880 ttgggacaca ttattagtga aaaaggatta atgccatgcc cagataaaat atctactatt 2940 gaaaaagcta aagtacctcg caatgaaaca gaattgaaat cttttctagg actgattaat 3000 tattatcata aatttattcc aagattatcc actaaactgt accatttgta taatttattg 3060 aagtcgggtg ttaaatttac ttgggatgat aactgtaatg aagcttttga agaaagtaaa 3120 aatgcattat tagaggcacc ctttttggag ttttatgacc ccaataaacc cattgtgata 3180 gtttcagacg cgtcggggta cggcttgggg ggtgttatgg cacatataat tgatggggtt 3240 gaaaaaccga tttacttcac atccttttct ttaaatgcag ctcaacgaaa ctatcctatt 3300 cttcatttgg aagcactggc tttagtttgt acgattaaaa aattccacaa atttgtattt 3360 ggtaaaaaat tttatgttta cactgatcat aaaccactag taggtatttt tggcaaggag 3420 ggaaaaaact ctatttatgc tacaagattg caaagatttg tactggaact ttcaatttac 3480 gaattcgaga ttcaatatag accttcaaag cgaatgggca acgcagattt ctgctcacgt 3540 ttccctgtag atatggctgt tccggacgaa tatgatgtag gattaataaa cagtattaat 3600 tttagtagac aatttcctat tgatttttgt attatagcag ctcagacaaa agaagacgaa 3660 ttcttgaaaa acgtaatata ttttatgaca aacggctggc ctgagaaaat aagcaaacag 3720 tatattgatg tttacgccaa ccagaaagat ttagaactga ttgatgaatg tcttttgtat 3780 caaaataggg tagtgatacc tatgtcgtta aagaaacagg ttttaaaatt attgcatgcc 3840 aaccatgctg ggatagttaa aatgaaaagg ttagcaagac agaacgtgta ttggtttgga 3900 attaactctg atattgaaaa ttacgtagcg gtttgtgaca cgtgcaatag tatgatgatt 3960 gttcccaaac tgaaaacgaa atctaaatgg atagctacga cacgaccgtt cagtagggta 4020 catatcgatt tttttcattt tgatcaccgc acatatttgt tggtggtcga tagctattcc 4080 aagtggattg aagttgaact gatgaaaaat ggaacagatt gtgggaaagt gttaaagaaa 4140 ctagtgggca tatttgctag atatggatta ccagacgttc tagtgtctga tggaggtcct 4200 ccatttaatt ctcatgcttt tgtagatttt ttgaagcgac agggaatcaa tgttcttaag 4260 agtcctccgt ataatccctc cagtaacggt caagctgaga ggttggtaag gaccgtaaaa 4320 gaggtgctaa aaaagtttct acttgaccca gaatttatag atttagactt ggatgatcag 4380 atcaatctgt ttttgatcaa ttacagaaac aactgtctta cagtcgaggg aaattaccca 4440 gcgcaaatga tattttctta taagccaaaa acaattttag atttgttaaa tccaaaatca 4500 cattataaaa aacaattatg tgtagaaccg atgcatgata agttaactaa taataaagac 4560 acaaacctag tgcatgatga taattcgaag gatgtaggga atgcaaatgc atgtataccc 4620 gatccatttg aaaatctgat gcagggagat gagatttggt ataaaaatca taatccacat 4680 catactgcta gatggctcaa agcaagttat attaaaaggc actctcgcaa cactttccag 4740 gttctaattg gaaacgtacc aaccacagca catcgaggac aactacgcat ttataaggaa 4800 ggtgaatcct tcgaaaaacc aaatgtcctt ctgcgtcgac aacaacggga aacccataaa 4860 ctcaaagatg atgaagaatt tcgtggattt tcggatgagg aagttagaag tagcaaacct 4920 cgccttgatg gatttggaga gcttaacaaa aaacgaaacg tatcattgcg ccggcaacag 4980 ctggaaccag aagttgatcg aggggaacct tacagtgctc cagaagatgt caggtgcagg 5040 aggaaaagga gaattgtggt tgatgaagcg cttcaagatg ttgaaatgtc gactgattgc 5100 gtaccgaggc gatcaaaacg gattcgccaa gctaacatgg acaaagattt tgtttacatg 5160 atttaaattg tgattatatt cacgttctga aatttctcta ttatatacaa tgaattctga 5220 gttcaatcga agttcctatt agtaatgtta acgacttaga atataagttt cgaattattg 5280 aaatattaat caatcaaaca ctgtaatact gatagctatt tgaagaggga agagc 5335 // ID DNA8-74_AP repbase; DNA; INV; 902 BP. XX AC . XX DT 25-AUG-2009 (Rel. 14.09, Created) DT 25-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-74_AP. XX NM DNA8-74_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-902 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2010-2010 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 902 BP; 273 A; 110 C; 106 G; 413 T; 0 other; tagagttagg atgttgatga cttttgcatt ttttttcctt ttattctgta caatgctatc 60 ctagctaaaa atttgtttca tgctcccttg atgcagaata ttctaatttt attttgcatt 120 tttttgcatt tttagctata aatgcatttt ttttcgtttt ttaacgtcat ttttcaacat 180 ttttaagtca ttttttcatc ttttagggca ttttttgtac tttttataca atgtggggca 240 tttttcacaa attatagatt ttagggcaaa tgttacaccc aagtttggaa acaaatttaa 300 tttatttaaa atctaacttt ggatttggac tattagtaaa aaataatttt aatattatta 360 ttgtggggtt tttgttgtga ttacatttat cgcgttccga taagatttac tctttgagtg 420 ttatatattg atgaaacgag tgctctttca gacatacgcc gatcatacac gcttatcgag 480 tattatattt ataaattgta tggaataaat ccgataaaca agattaaatt gacaattaaa 540 ttgacacata ttaattgtac agtgcaacaa tgcaatatga atattattat aacatcaaca 600 tcctaacact aataatgaaa atatttttaa tatatttttt ttttatttat attttacttt 660 ttaatttatt tttaatattg aaatttattt ataaattatt taaatgcttc tactattaca 720 cccattatta tacccatatc tctaataaaa attattattt ataaatacgt tttttcactt 780 acttaattag tattttagat aatttttaag gtcatttttg aagtttttag ggcatttttg 840 cgctttttta tttttttagg gcatttttgt gtgtttttta ggtcatcaac atcctaactc 900 ta 902 // ID TTAA7_AP repbase; DNA; INV; 434 BP. XX AC . XX DT 21-AUG-2009 (Rel. 14.08, Created) DT 21-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; TTAA7_AP. XX NM TTAA7_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-434 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(8), 1787-1787 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC TSD is TTAA tetranucleotide. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 434 BP; 150 A; 73 C; 72 G; 139 T; 0 other; ggggattcga taccgtgatt ttctgttttc gtctaacaca cgcgtgacat agtattttag 60 acgtgttttt tgccaaacat accaattgat ctaatgaagc gataagaatt ctgaaaacag 120 atttgaattc gtcatcaagt ctacttgaaa tcgactttcc cacatttttg atattatctc 180 ccaaattcca gaaaaagtca tataaaaaaa gagtatttga acatttttgt atttcaattt 240 ttggagaagc taaaatcaaa aaatcaaaaa tgtgggaaag tcgatttcaa gtagacttga 300 tgacgaattc aaatctgttt tcagaattct tatcgcttca ttagatcaat tggtatgttt 360 ggcaaaaaac acgtctaaaa tactatgtca cgcgtgtgtt agacgaaaac agaaaatcac 420 ggtatcgaat cccc 434 // ID CR1-2_CQ repbase; DNA; INV; 4790 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-2_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4790 RA Kojima K.K. and Jurka J.; RT "CR1 non-LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 2-2 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 10 sequences with >98% CC identity. XX FH Key Location/Qualifiers FT CDS 152..1762 FT /product="CR1-2_CQ_1p" FT /translation="MPCHINNCQITDASHLMQCSGSCGRSYHAACAGTQRN FT YENEITNYMIPLCRDCQGVISVEVTTRAVLLQQQKLTCDIKALIDANFKAC FT NAFKQMDNKIEVVQDEYEGIATLLGEMQQQLNRLDKKIDDSTLNVCKKVSS FT ILDDVAPADNASTLVKIDALTSVTQQIGESQSIIGERLKNINTELIFLSGR FT DPGPAVNEILEEIKAISANLPAGTEQTEAHHETLADELASSSASGSPNLKD FT NSGWRTLGNKRLWKADWTDYDIRTARRKEQSKLRRKANQRKRRRQHRRSSY FT TGTDDLDEIDYELDYSFFDNNFHLEQRNDRLHRLCHNQLAQHLVKQPVNLL FT QNPQHSQSQQQQHNQQQQSQQQQHQQHIQLQQQQSQQRQHQPQQHQHQSQQ FT QQQLHQLQQPYQDQQQQQQHQQLQQPSLQQHQQQTFQHLQQPQQAHPLQQQ FT QQQQHQQQQPQPHQQHHQNLQQQLQAQSHQQQQPQFHQQHPPAQQQPQLHQ FT QQQPTQAQLIHRSAQVYSETPSWMFRPAQRSSFSETGNFMN" FT CDS 1765..4701 FT /product="CR1-2_CQ_2p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="MTPSCEASDNLILTSPTTASNNLIEILTYCQNFNRMK FT SAAKIDTINQAISGCLYTAILGTETSWDASILSEEVFSSNFNVFRNDRNLL FT SSGKKSGGGVLVAIKTDFDSEKLDSDVFAHFEHVWVKAQIAKETHIFASVY FT FPPLSPLSSYEDFFRNAEKIISSLDPESKIHLYGDFNLGAVDFMVDAENES FT ILIPILGESDRLQXIFDKMYSLGLNQINHVKNQNNRFLDLLFTNMTEDFCV FT TEANNPLWKNEAHHTAIEFSLFVHKANNRPDGSDYEEIPDYNNANYPLIKR FT RLLEVDWKALLEKETNIELAVNTFYSIIYDILDESVPIRRKRRTIDKHPPW FT FTREVKNLKNRKQKAHKAYKSNSQKLDEYLQICSQLEVTIQFEFEKYHRKV FT ESEIKSDSKQFFNFVKSKLKSSNFPSNMSLDNKIGKNSAEICELFANFFES FT VYTSFDENDRDRSYFSFIPQANNEITVDSLSLDEIQAALKALDSSKSTGPD FT GISPKFAKNVASELASPLYWLFNLSLKSGIFPSAWKKSYLVPIFKSGKKSD FT VKNYRGIALISCIPKLFEAIVNNKMFAQLKDYITQKQHGFFKGRSTATNLL FT SFVNYTLNEMDNKNSVQTLYTDFSKAFDRVDIPMLLFKLDKIGIDSNLLSW FT LKSYLTKRTQCVKFNGLISRLINVTSGVPQGSHLGPLLFILYVNDVSFIFK FT HINVSIYADDMKLFMKIKDETDASRFQNEINIFFEWCCRSLLQLNIKKCTS FT ILFSRKINKPNINVKLDNQIVETCSNVRDLGVILDSKMTFIEHYNTMVFKA FT TNMLNFIKRFSYNFKDPYTLKTLFIAYVRSILEYCSVVWSPHIKTHETRIE FT SVQKQFLLFALRKLGWTVFPLPSYIARCMLISLETLVKRREFASLSFVNDL FT IANRINSPELLQTLNFYEPTRVLRAREPFALHFHRTDYAKHGPMNRMMETY FT NKYNNIIDFTMTKSTMKKQFYNQR" XX SQ Sequence 4790 BP; 1644 A; 1089 C; 803 G; 1248 T; 6 other; tcctwtgtwg tmagtcgtaa tcgagctgtc gtggtaatcg gtcgcatttt taattaagtc 60 cgtgtcgttc cgcgcgcttt ttgaagtgta agcataaacc gcgaattcat tcattttttt 120 gtgtcgactt aaccctctca actcgcgaaa catgccttgc catatcaata actgccaaat 180 caccgacgca tcgcacttga tgcagtgttc cggttcgtgt ggcagaagtt accatgctgc 240 gtgcgctggg acacagagga actacgagaa cgaaataact aattatatga tcccgctgtg 300 ccgagactgc cagggagtta tttcggttga agtcacaact cgcgccgttc tcctgcaaca 360 gcaaaagctg acttgcgata tmaaagctct gattgatgct aactttaagg cttgcaatgc 420 cttcaagcaa atggacaaca aaatcgaagt tgttcaagat gaatatgagg gcattgccac 480 cctccttggc gaaatgcagc agcaactcaa cagactggat aaaaaaatcg acgactctac 540 attaaacgta tgcaaaaaag tttcgtcaat tttagatgac gtagcaccag cggacaacgc 600 cagtacgttg gtcaaaatcg acgcactgac ctccgtcact cagcaaattg gtgagagtca 660 gagcattatc ggtgaaagac ttaagaatat caacacggag ttgatttttt tgtctgggcg 720 tgaccctggt cctgctgtca atgaaatcct cgaggagatt aaggcaatct cagccaatct 780 cccggctggt actgagcaaa ccgaggctca tcacgaaact ctcgctgatg agttagcatc 840 tagctcagct tcaggtagtc ccaatcttaa agacaattct ggatggcgca cactcggcaa 900 taaacggcta tggaaagctg actggactga ctacgatatc cgaacagcac gccgaaagga 960 gcaatctaaa ttgagacgta aagccaatca gcgaaaacgg cgtcggcaac ataggcgctc 1020 cagctacacc gggacagacg acctcgatga gatcgattac gagctcgatt acagcttctt 1080 tgacaacaat ttccacttag aacaacgtaa cgaccggctt caccgcctgt gccacaatca 1140 gcttgcacaa cacttagtaa aacaaccagt caacttatta caaaatccac aacattcaca 1200 atcccagcaa caacaacata atcaacaaca acaatcccag caacaacaac accaacagca 1260 tatccagctc caacaacaac aatcccagca acgtcaacac caaccccagc agcatcaaca 1320 tcaatcccaa caacagcaac aacttcatca actccagcaa ccctatcaag accaacaaca 1380 gcagcagcaa catcaacaac ttcagcaacc atctctacag caacaccaac aacaaacatt 1440 tcaacaccta cagcagccgc aacaagccca ccccctccag caacaacagc aacagcaaca 1500 tcaacaacaa caaccacaac cccatcaaca acatcatcaa aatctacagc aacaactaca 1560 agcccagtcc caccaacaac agcaaccwca atttcaccag caacacccac cggcacaaca 1620 gcaaccacaa cttcatcaac agcaacagcc cacccaagct cagctaatac atagaagcgc 1680 acaggtatac agtgaaacac ctagctggat gttccggcca gcgcaacgta gctctttttc 1740 ggaaacagga aattttatga actaatgacg ccttcatgtg aagctagcga taatttaatt 1800 ttgacaagtc caaccactgc gtctaataat ttaattgaaa ttttgactta ctgtcaaaac 1860 ttcaatagaa tgaaaagtgc cgcaaaaatc gacactatta atcaagctat cagcggatgc 1920 ctttatactg ctattcttgg cacggagacc agttgggacg ctagtatttt aagcgaagag 1980 gttttctcta gcaactttaa tgtattcagg aatgatagaa atctgttatc atccgggaaa 2040 aagtcaggag gcggcgtttt agtcgccatt aaaaccgatt ttgactcaga aaaacttgat 2100 tctgacgttt ttgcacattt tgaacacgta tgggtcaaag ctcagattgc aaaagagact 2160 cacatctttg catctgtgta ttttcccccg ttatcgccac tcagctcata tgaagatttc 2220 ttcaggaacg ctgaaaaaat tatctccagt ctagatcccg aatccaaaat acatttgtat 2280 ggcgatttca atctcggtgc tgtcgacttc atggttgacg cagaaaacga gtctattctt 2340 attcccatct taggggaaag tgatagatta caaktaattt ttgataaaat gtactctctc 2400 ggcctaaatc aaattaatca tgttaaaaat caaaataatc gctttcttga tcttcttttt 2460 actaatatga ctgaagactt ctgtgtgact gaagcaaata atccactttg gaaaaatgaa 2520 gctcatcaca cagcaattga attttcactt ttcgtgcata aggctaataa cagaccagat 2580 ggttcagatt acgaagaaat tcctgattac aataatgcca attatccctt aatcaaacgt 2640 agactattag aagttgattg gaaagcatta ctagagaaag aaacgaacat cgagctagca 2700 gttaacactt tctatagcat tatatacgac atcctcgatg aaagcgtccc aatacgaaga 2760 aaaagacgca caattgacaa acacccgcct tggtttacaa gagaggttaa aaaccttaaa 2820 aatagaaaac aaaaagctca caaagcttac aaaagcaata gccaaaaact tgatgagtat 2880 ctacaaatat gcagtcagtt agaagtcacc atccaatttg aatttgaaaa gtatcataga 2940 aaggtagaat cggaaatcaa gagtgattcc aagcagtttt tcaattttgt caaatcaaaa 3000 ttaaaaagta gtaattttcc gtcaaatatg tctctcgaca ataaaattgg caaaaacagt 3060 gcagaaattt gtgaattatt cgcaaacttt tttgaaagcg tatacacctc ttttgatgaa 3120 aacgatcgcg accgaagcta cttttccttt ataccccaag cgaataacga aataacagtc 3180 gatagcttat ccttagatga aattcaggcg gctcttaaag ctcttgacag ttcaaaaagt 3240 accggaccag atggaatatc tccaaaattt gcaaaaaatg tagcctctga actggcttct 3300 ccgctttatt ggctatttaa tctatcatta aaatcaggaa tattcccaag tgcttggaag 3360 aaatcatatt tagttccaat tttcaaatcc ggtaaaaaat ctgacgtgaa aaactaccgt 3420 ggaattgctc ttatttcatg tattcccaag ctctttgagg caatagttaa caacaaaatg 3480 tttgctcaat taaaagacta cataacacaa aaacagcatg gatttttcaa agggcgctca 3540 accgcaacta atcttctctc gttcgtaaac tacacattga atgaaatgga taacaaaaac 3600 tctgtacaga cactctatac tgactttagc aaagccttcg acagagtgga cattcctatg 3660 cttcttttca aattggataa aatcggaatt gattctaatc tactttcatg gcttaaatct 3720 tacttaacaa aacgcacgca atgtgttaaa tttaatggcc tcatatctag gctcatcaat 3780 gttacttccg gggtacctca aggctctcat ttaggcccat tactttttat tctttacgta 3840 aatgatgtat cctttatatt taagcatatc aacgtttcga tttacgcaga tgacatgaaa 3900 ttgtttatga aaattaaaga cgaaactgac gcttcaagat tccagaacga aatcaacata 3960 ttctttgaat ggtgctgtcg tagcctttta caacttaaca ttaaaaagtg tacatcgatc 4020 ttgttcagca gaaaaataaa taagccaaat ataaacgtaa aactagacaa ccagatagtt 4080 gaaacctgtt caaatgttag agatttagga gtcatacttg attctaaaat gactttcatc 4140 gaacattaca acacgatggt attcaaagca actaatatgc tcaacttcat aaaacgattc 4200 agttataact ttaaggaccc ctatacattg aaaacattgt tcatagctta tgttcgctct 4260 attcttgaat attgtagcgt cgtgtggagt ccacacatca aaacacacga aacccgtatt 4320 gaatcagtgc aaaaacaatt tttactattt gctcttcgta aactagggtg gacagtgttt 4380 ccccttcctt cgtatattgc acgctgtatg ttaataagct tagaaacact tgtaaagcgt 4440 cgtgaattcg cttctctctc gttcgtaaat gatcttattg caaaccgaat taattctcca 4500 gagttgttac aaacattaaa tttctatgaa cccacacgcg tattgagagc acgtgagcca 4560 ttcgcattgc attttcacag aactgactac gctaaacacg ggcccatgaa tagaatgatg 4620 gaaacttata acaaatataa taacataatc gatttcacaa tgacaaaatc caccatgaaa 4680 aaacagttct acaatcaaag atagatgttc cccccttgta attagaaact aagcaacgat 4740 catgtagtct acgtatgttt gacgaataaa taaataaata aataaataaa 4790 // ID hATm-41_HM repbase; DNA; INV; 2984 BP. XX AC . XX DT 07-DEC-2008 (Rel. 13.12, Created) DT 11-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type family: consensus sequence. XX KW hAT; DNA transposon; Transposable Element; hATm-41_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2984 RA Jurka J.; RT "Families of hATm elements from Hydra magnipapillata."; RL Repbase Reports 8(12), 1935-1935 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(676..744,748..2664) FT /product="hATm-41_HM_1p" FT /translation="METRAIIAKLVTILDKYKKLKKNGRKTDQRKTSETIL FT KGELSQLFDVSHKNAMSLITINEDKDFLIDQRTVRKMAMAGEDKELAKKEE FT KMQARENLMKERIEKEKKKMEAQSLKVQLVDSSPSSSDSESSEHGSPQIRN FT RRSKRSTPSEKLKMTPELVCALDRSNISDRRAVHILNASAATPLSNRSISR FT SSVRRARIHSRAITAATVRDSFINEVQQKDTFLVLHWDGKLLPDCTGNGDG FT LKVDCLPILVSSPDIEFEKLLAIPKLSSSTGAAMANATVRIVREWKLEERI FT EALSFDTTSSNTGIHAGCCQLIEVQLGRPLMHLACRHHIMELILATAFKSV FT MAASSGPDIQLFKRFREQWSFISKDEISAFKDPRVDEHKDWKDATITSMQS FT SLARKSIRDDYAKLCQLSLYFLTGELCAPIRKPGAYHHARWMAKAIYVMKI FT LLFRKQMKLTKSEETGLHEVSLFIILIYSRVWIEAPQACEAAVNDKALLND FT LVSYNNVNEKIAKATLTTFLXHLWYLGGDLVGFSLFSNKLSTDEKEDIVRE FT MKKEKNMDRIKWVPKKGEETTMNLSDLASSASLQFLKSFRINETFLELPPD FT QWDVDESFQEGRRKLEKLKIVNDGAERGVAMISTFNDSLTRDEETKQALLQ FT VVEHHRRLHPLK*" XX SQ Sequence 2984 BP; 962 A; 531 C; 600 G; 890 T; 1 other; ttagggtggt ccattttaac atgtaaattt tttttttttt aaattaatga ggcatgttga 60 cttattactt ccttttggtc taaaaatcaa ctgtgcaaag tttcaggttg aaaaaaatat 120 atttaggggt tgctcaagca tatcaaagtc tagaatattt tatggttttg ccaaaaaagt 180 tttcaaaact agagtttttg tgcagactgt gtattttttg ttgtgtttaa tcattgtgtt 240 tcattatcaa tgtagaaata ttgtagaaat ttcttattca ttcagtagat gtcagcacta 300 tacttagtat acttctagaa taattattac atacaatttt aaacaccctt atttggtaat 360 attataatta ttaattactg aattaacttt ttgaattaac atttcttcat tcattttaat 420 agaatttgtt tgcataattt attgcaggtc aaataattta aaataaaaaa tggctacagc 480 tgcaataaat ataccaacac gaaaagcaac agaaatatat ttgattgggc aaccattaga 540 gaattttggt gagcgattcc ttccaacaga aaaaatgttc taaagcgtat cttctttgga 600 accaacagtc aagagaatcg caagaaagtc gttgacgaag tgctgttctt gtggacaaag 660 gcacgtattc cagttatgga gacccgtgca atcattgcca aacttgttac aattcttgat 720 aagtataaaa aacttaagaa aaattgaggt agaaaaactg accagcggaa aaccagtgag 780 acaatattga aaggcgagct ttcacaactc tttgatgtat ctcacaagaa tgcaatgtcc 840 ctcatcacaa tcaatgaaga caaggacttc cttattgacc agaggacagt cagaaagatg 900 gcaatggcag gagaagataa ggagctggca aaaaaagaag aaaagatgca agcgcgagag 960 aatctgatga aagagcgaat agagaaagag aagaaaaaaa tggaagctca aagcttaaag 1020 gtgcaacttg ttgactcgtc accatcctct tcagactctg agtcatctga acatggctct 1080 cctcaaattc gtaatcggag atcaaagcga tcaactcctt ccgagaaatt gaaaatgact 1140 ccagagttgg tttgtgcatt ggaccgtagc aacatttccg accggcgtgc tgttcacatt 1200 ttgaatgcct cagcagcaac acctcttagc aacagatcaa tctccaggag ctcagttcga 1260 cgtgctagga tacatagcag ggccatcaca gcagccactg ttagggattc cttcatcaat 1320 gaagttcaac agaaggatac ctttttagta ttacactggg atggcaagtt gctcccagac 1380 tgtacaggaa atggagatgg cttaaaggtg gattgtttgc caattcttgt ttcaagccca 1440 gatattgaat tcgagaaact gctggcaata ccaaagttat catcgagtac cggagctgcc 1500 atggctaatg caactgtcag gattgttcga gagtggaaac tcgaagaaag aatcgaagct 1560 ctttcctttg acacaacatc atccaacact gggatccatg ctggttgttg ccaattaata 1620 gaggtacaat tgggccgacc tctgatgcat cttgcatgca gacatcacat tatggagctg 1680 attttggcta cagccttcaa atcagttatg gctgcatcat ctggaccaga tattcagcta 1740 ttcaaacgtt ttcgagaaca atggtccttt atctcaaaag atgaaataag tgctttcaag 1800 gatccgagag tagatgagca taaagattgg aaagatgcta ccataacctc catgcagtct 1860 agcttggctc gaaagagtat acgtgatgat tatgccaaac tgtgtcaact gtcattgtat 1920 tttctgacgg gagaattatg tgctccaatc aggaagcctg gtgcatacca tcacgccagg 1980 tggatggcca aggctatcta tgtgatgaaa atcttgttgt tccggaagca aatgaaactg 2040 acaaaaagtg aggagactgg tctacatgaa gtgtctctct tcattatctt gatttactcc 2100 agagtctgga tagaggctcc gcaagcttgt gaggcggccg tcaatgataa ggctcttcta 2160 aatgatttgg tcagctacaa caatgtgaac gagaagattg cgaaagccac attaacaact 2220 ttcctaygcc atctatggta tctaggtggt gaccttgttg gcttttcact tttttccaac 2280 aagttgtcta ctgatgaaaa agaggatatt gttagagaaa tgaagaagga gaagaatatg 2340 gatcgaatta agtgggttcc aaaaaaagga gaagaaacga caatgaacct cagtgatctt 2400 gcttcttccg catccttgca atttttaaaa tctttcagaa tcaatgaaac cttcttggag 2460 ttacctcctg accaatggga tgttgatgaa agttttcagg aagggcgcag aaagcttgag 2520 aaactgaaaa ttgtaaatga tggggcagag aggggagttg caatgatttc cactttcaat 2580 gattctttga caagggacga agagacaaaa caagcactac tacaagttgt agagcaccat 2640 cgtcgtctcc accccctcaa atgacttttc tgacattaat gaacgatgtg aatgattgtg 2700 ttctagtttc gtcacgtttt ttgcatgttt tgatgttcaa taacttttca gtgattttta 2760 aatttaaatt tgtcttaaat tcatttattt gttcctgcaa tttgcatttt tttgataata 2820 tgattcattt gatgtatagc ttgaaacttt gatagcattg agcaacctct aaatacagtc 2880 caaactatgt aatattttgc acagtggctt ttttgaacaa aatgaacata acaaacaaca 2940 tgcctctgaa atatattaaa cttgactttt ttggaccacc ctaa 2984 // ID Copia-8_CQ-I repbase; DNA; INV; 2741 BP. XX AC AAWU01044740; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-8_CQ_; KW Copia-8_CQ-LTR; Copia-8_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-2741 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 331-331 (2011). XX DR Genome; AAWU01044740; Positions 2895 155. XX CC Positions [1595-2134] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 149..2728 FT /product="Copia-8_CQ-I_1p" FT /translation="MAEEDKGERFFMPLFDGTEFPAWKYRLLVMLEDAELL FT HCVEVEAVDDPALTEAAGDSEQQKAAKKALLAKRLKQDRRCKMLMVSRIHN FT SQLEHVQDLKTPKEILDALVGIFERKSIAGRMAAQRKLMAHRYVSGPLKDF FT FLEHDRLVRQLRGTGAVISEIDVVCNLLLNMGSAYATVVTTLESMEEKKLT FT MQTVKCRLLDEEVKRSAIGVELVTPKTESAAFAGDKESRPKKKLKCFGCQQ FT EGHKIADCPKKMQEQKEPKPSSGRTSKANAAESSSGEICFAAEKDAGYPPE FT KSKVRWIVDSGCSDHLVNDETLFDELTLLAKPVEIAVAKDGESIVAHYSGT FT VRLHTIVGGERRRCSVSNVLYVPELRMNLFSVLQVEKKGMKVVFDKGSVKI FT FSNSELVANGARRGLFYELDLYRIESVESESLLACGRITKSLQLWHRRFGH FT LNMNQLNQLIAKGFVDGLKPQSGDNSVVVCEPCVVGKQTRKPFSVRDGKRS FT SRVLEIVHSDVCGPISPEGANGERYFVTFIDDWSHFLMVFAMVTKDEVLER FT FMKYEAYVTAKFGVQISRLRCDNGGEYKNKAFEDFCDRKGIRMECTVPYTP FT EQNGTSERMNRTVVEKGRAMLEDAAIDKSFWVQAIQAAAYLTNRSPTNANE FT GNKTPFEVWEGRRPNVSNLRVFGSKVYVHVPKERRKKLDAKAWKGIFIGYS FT NNGYRVWNPETKQESVARDVDFVENPDAVEVAEGVIQSTKKSDVPMILIRR FT DEEESVVEHDEVVEEDEDFESCTEPDEDVPATEPAPVPIDGAPAPAARPDR FT NRNPPAWHKDYEVEYGHSASSALSYVDYSLELLDELRERLDREKRRILTSA FT GVPDCCV" XX SQ Sequence 2741 BP; 676 A; 630 C; 874 G; 561 T; 0 other; cttcaagaag agttaccaaa cgaaatacac gcgctttatc tatcgtttaa ctaaaaccgc 60 gttttattga ccgcgtcggt cgcccttccg ccgtaacact ttaggttatg ggccctgcaa 120 cgcgcagcga gtgagtgaaa gttaaattat ggcggaagag gacaagggcg agcggttctt 180 tatgccgctg ttcgacggga ccgagtttcc cgcctggaag taccggctcc tggtgatgct 240 ggaggacgct gaacttctgc attgtgtgga agtggaggcg gtggacgatc cggcgctgac 300 ggaagcagcg ggcgattccg agcagcagaa ggcggcgaaa aaggcgttgc tggcgaagcg 360 gctgaagcag gatcggaggt gcaagatgct gatggtgtcg agaatccaca actcccagct 420 ggagcatgtg caggatctga agacgccgaa ggaaattctg gacgccttgg tcgggatctt 480 cgagcgcaaa agcatcgccg gtcggatggc cgcgcagcgg aaactgatgg cccaccgcta 540 tgtttccggg ccgctgaagg acttcttcct cgagcacgat cgactggtgc gacagctccg 600 cggaacggga gcggtaattt ccgagataga cgtcgtctgc aacctgcttc tcaacatggg 660 atcggcgtac gcgacagtgg tcacgacgtt ggagtcgatg gaggagaaga aactgaccat 720 gcagacggtg aagtgtcgtc ttctcgatga agaagtcaag cgatcggcga taggtgtcga 780 gttggtcacc ccgaaaactg agtcggcggc ttttgctggt gacaaagagt cgcgaccgaa 840 gaagaagctg aagtgcttcg ggtgtcaaca agaggggcac aagatcgctg attgcccgaa 900 gaagatgcag gaacagaagg agccgaagcc atcgtccgga agaacatcga aggcgaacgc 960 tgctgaaagc agtagtggcg agatttgttt tgccgctgag aaggatgcag gatatccacc 1020 ggagaagtcg aaggtgcgat ggattgtgga ctccggttgc tcggatcacc tcgtgaatga 1080 cgagaccctt ttcgatgagt tgacgctgct ggccaagccg gtggagattg ccgtggccaa 1140 agatggagag tccatcgtgg cgcactactc ggggacggtg cggctgcaca ccatcgtcgg 1200 cggtgagcgt cggcgctgtt cggtgtcgaa cgtgctgtat gttccggagc tgcggatgaa 1260 cctgttttca gtgctgcaag tagagaagaa aggcatgaaa gtggtgtttg acaaaggaag 1320 tgtgaagatc ttcagcaact ccgaactggt tgccaatggt gcgcgtcgcg gtctgttcta 1380 cgagctcgat ctgtacagaa tcgaaagtgt tgaaagtgag tcgttgttgg cctgtggacg 1440 cattacgaag agcctacagt tgtggcaccg caggttcgga cacctgaaca tgaatcagct 1500 aaaccagctg attgcgaaag gctttgtcga cggtttgaag ccgcagtccg gtgataacag 1560 cgtcgtcgtg tgtgaaccgt gtgtcgtcgg gaaacaaaca cgaaagcctt tctccgtgcg 1620 cgatggcaag aggtcgtcgc gagtgcttga aattgttcac tcggatgttt gcggacctat 1680 ctcaccggaa ggtgcaaacg gtgaacggta cttcgtcaca ttcatagacg attggagtca 1740 tttcttgatg gtgttcgcga tggtgacgaa agatgaggtg ctcgaacggt tcatgaagta 1800 cgaagcgtac gtgacggcga agttcggagt gcagatctcc cggctgcggt gcgataacgg 1860 aggcgagtac aagaacaagg ccttcgagga tttctgcgac cgcaagggga tccggatgga 1920 gtgtacagta ccttacaccc ccgaacagaa cggcaccagt gagagaatga accgcacagt 1980 cgtcgagaaa ggacgtgcca tgctggaaga tgctgcgatt gataagagtt tctgggtcca 2040 agctattcag gctgcagcct acttaacgaa tcgcagtcca acgaacgcca acgaaggaaa 2100 caaaacaccg ttcgaagtgt gggaaggacg gcggccgaac gtttcgaacc ttcgtgtgtt 2160 cggctccaag gtgtacgtgc acgttcccaa agagcgccgc aagaagctgg acgcaaaggc 2220 ctggaaaggg atcttcatcg gctattccaa caacggatat cgggtgtgga atccggaaac 2280 gaagcaggag agcgtcgcaa gagatgtcga cttcgtggag aatccggatg cagtggaagt 2340 tgccgagggt gtgatccagt caacaaaaaa aagtgatgtg ccaatgattc tgattcgtcg 2400 agatgaagaa gaaagtgttg tggaacacga cgaagtcgtt gaagaagacg aagactttga 2460 gagctgcacc gagccggacg aagatgttcc tgctaccgag ccagcacctg tgccgatcga 2520 cggcgctcct gctccagccg ccagaccgga ccggaatcga aatcctccag cgtggcacaa 2580 agactacgag gtggagtacg gtcattctgc ttcaagtgct ctgtcctacg ttgactattc 2640 gttggaattg ctggacgagc tacgggagcg actggaccgt gagaaacgga ggattctgac 2700 cagtgctggt gtaccggatt gctgtgtttg agcgggcgta t 2741 // ID Mariner-38_SM repbase; DNA; INV; 2237 BP. XX AC . XX DT 14-AUG-2009 (Rel. 14.08, Created) DT 14-AUG-2009 (Rel. 14.08, Last updated, Version 2) XX DE Mariner DNA transposon from Schmidtea mediterranea: consensus. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-38_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-2237 RA Jurka J.; RT "Mariner DNA transposons from Schmidtea mediterranea."; RL Repbase Reports 9(8), 1887-1887 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 289..1959 FT /product="Mariner-38_SM_1p" FT /translation="MSGRKESVKDLTKNSRRNLSMDIKLKVIRKIENGERQ FT VDVCRELNLPGSTVRTIMQNAEKIKRSVQSTTPLSATKTSRTRNTMIEKME FT KLLTYWIEDQNNRDMPLSQAVIMEKARGLFEELEKQNSDEPSTSSSVETFA FT ASRGWFEKFKVRANIHNIRICGEAASADVGAAKDFTKSLKSLIAEKKYPPE FT LIFNVDETGLFWKRMPSKTFISKEEKRSPGFKAAKDRLTLLLGGNASGDFK FT LKPLLLYQSGNPRAMKGYSKDNLPVIWMSNRKAWVTAAIFELWFVKYFCPA FT VKDYCESKKLESRALLIIDNAPGHSKNIAALSKTIPVEVIFLPPNTTSLIQ FT PMDQGVIANFKAYYLRRTFKQLLHAVDVQKETILKFWKSFNIMMAIETIRD FT SWNEVTNDSMRGVWNKILLKENENSVPDNLHSIIVETVQIAHDVGFDDLMN FT EDVEEIINSHDAGLSNEDLIEIGDESYQNHESEDDENVEPPKKLTKKVLQE FT NLEIIAKALNVLTDNDPDSERAGFIKRGVSKSLTPYTEILKEKQSNRQQSK FT LDQFFLRNM*" XX SQ Sequence 2237 BP; 794 A; 349 C; 407 G; 687 T; 0 other; cagtaatacc tcgtataacg ctccctctta caacgctgtt tcgttataac gctcaaaaca 60 aaaaaaaatt gatatgaaaa aatcattaaa aagaatacat atacatatat agtaaaattg 120 atttctgtgt attttatttc aaatttccat acatttttgt ttatattttg gggattcaca 180 taaatcattt gaattttttt agaaacaaaa attaaacagt taatcagtgt ttcttgttga 240 aatgttccta tttatttttg taaataacta aataatttct tatcaataat gtcgggacgg 300 aaagaaagtg tgaaagattt aacaaaaaat tcgagaagaa atttatctat ggatataaaa 360 ttgaaagtaa ttcgaaaaat agaaaatggg gaacgtcaag tagatgtatg ccgtgaattg 420 aatttacccg gatcaactgt tagaacaatt atgcaaaatg ctgagaaaat taaaagatct 480 gtgcaatcaa caacaccact ttcagctaca aaaacctctc gaacgcgtaa cacaatgatt 540 gaaaaaatgg aaaaactgtt gacttactgg atagaagacc aaaataatag agatatgcca 600 ttgagtcaag cagtcataat ggagaaagca agaggtttat ttgaagagtt agagaaacaa 660 aactctgacg aaccttctac cagctcatca gtagaaacgt ttgcagcaag ccgaggatgg 720 ttcgagaaat ttaaggttcg tgctaatatt cacaacataa gaatttgtgg agaagccgct 780 agcgccgatg taggtgccgc gaaagatttt actaaatctt taaaatcgtt gatagcggag 840 aaaaaatatc ccccagaatt aatattcaac gtagatgaga cagggttatt ttggaaaaga 900 atgccttcca aaacgtttat ttctaaagaa gagaagcgat ctcctggatt taaagcggcc 960 aaagatcggc ttaccctttt attaggtggg aacgcttcag gcgattttaa attgaaaccg 1020 ttgttgctgt atcagtcggg aaatcctcgg gcaatgaagg gttactccaa ggataattta 1080 cctgtcattt ggatgtccaa tcgtaaggct tgggttacgg ctgctatttt tgagctatgg 1140 tttgttaaat atttctgccc ggcagtaaaa gattattgcg agagtaagaa actggaatcc 1200 agagctcttc ttataattga caatgcacct gggcattcaa aaaatatagc tgcgctttca 1260 aaaaccatac ctgtggaagt tattttttta cctccgaata cgacttcttt gattcaaccg 1320 atggaccagg gagttatagc gaattttaaa gcctattacc ttcggcgtac attcaagcaa 1380 cttttacatg cagttgatgt acaaaaggaa acaattttga aattttggaa aagtttcaat 1440 atcatgatgg cgattgaaac tattagagac tcctggaatg aagttacaaa tgattccatg 1500 cgaggtgttt ggaataaaat tcttcttaaa gaaaatgaaa attctgtgcc cgataactta 1560 cactccatta tcgtggaaac agtgcaaatt gcacatgatg tggggttcga cgatctaatg 1620 aacgaagacg tagaagagat aatcaattcg catgatgcag gcctttccaa tgaagattta 1680 atcgaaatcg gagacgaatc ataccaaaac cacgaaagtg aagatgatga aaatgtagag 1740 ccgccaaaga aattgacaaa aaaagtttta caagaaaatt tagaaataat tgccaaagct 1800 ttaaatgttc taactgacaa tgaccctgat tctgaacgcg ctggttttat taagagaggt 1860 gtttcaaaaa gtcttactcc gtacactgaa attttaaaag agaaacaatc gaatcggcaa 1920 cagtctaaat tagaccagtt tttcttacgg aatatgtaga atttaattta gtttttattt 1980 tttatgtttt tgatacatac atatatttaa tttacataat gaaaacttat ttattgaaga 2040 agtatcgaat atttttattt tgaagtatat attgaactga attaaatgta tgtaccttca 2100 atttattgat accatctttt ttatttttgc cttcaaagta ccccatattt taatgtaaat 2160 aatacctctt ttaacgctgt ttctgttaac gctcaatttt caggaacgca ttatgagcgc 2220 taaacgaggt atgcctg 2237 // ID Gypsy-3_AA-LTR repbase; DNA; INV; 1285 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-3_AA_; KW Gypsy-3_AA-I; Gypsy-3_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-1285 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 976-976 (2011). XX DR [2] (Consensus) XX SQ Sequence 1285 BP; 417 A; 266 C; 267 G; 335 T; 0 other; tgtaacaatg ttacaaaatc ccttagtcct aaatcctgtg attcaaaaca aaaagaacat 60 aaattaaatt taaataaata tatggattcg aaaatgagcc tttactgtca gagtcggagt 120 tgcgttccag gtctctgggt acatgaacgt atagggcaat ccgataaatg aggccgatct 180 ttaaaaatta gttctctttc aaaatcgggc cccgacaata accgaacgaa taggaacggg 240 tcgcggttaa ggctgacatg gcaaaaatct gcgttgaaga ttctctttca ttccaactgt 300 ctacggatag agacgcattg tggcgaagtt taactagtga tctaagtagg gagaagctag 360 aagaattgtg gctgtggagt tgaatagttt ggaaaagtaa aggctgtcta agtggaagaa 420 aaaaaactgt gaataattta gaatgttaaa attatatttg cagataaatt aaaagtgaac 480 ttgaaaaaaa ctcgatattg tccgagctaa agttagacac gtgttccgag taaggtttgc 540 acaaaagaat gtattgttct ctttataaaa gatttttttt acattcaggc ctaagccgct 600 cgcttgtttt ccttaaaaca gcattcttac ttaattaagt gcaaatttgc ataaaacgaa 660 caggtaattg gtaaaaatgt tttttgatgt aacctgaata gtaatttcaa atcgtttgaa 720 gatcccttca aacacaaggg tgcaccccta ggattcggag cgtgcgacaa aatccacttg 780 atcgtttggg caatgctgtt tggacgatga ggccgcgacc gagcatatgt cgtgcagcat 840 aatgaactac atcgtcattt ggtgactccg ttaactcacc acccgggtta gccaggccac 900 aacccaccaa accgttcgcc aacgaaaacc gatagagggc cgccgtcaac atacaagcga 960 tcggttgcat gcaagaccga caatcgaaac ttgcaagact ccaccttcgt tccatacaca 1020 tgaccggtca taggagccac tttgaccccc ttccgtcatc agtaagtgca cttaccatcc 1080 gccgctggac ctcgagggac ctctccgcca tcaccggtag ctgcaaacac aagtaaggtc 1140 gaccaaattt gtacactact aacccctaga caatgtaaat gcaaattgag atcagaagca 1200 aaatgtttta aatgggtatc cttttgagcc actgccatac ggatcgaaaa agagaataaa 1260 agtgcatggc actacgtttt tatca 1285 // ID SR2A repbase; DNA; INV; 552 BP. XX AC AF025680; XX DT 27-JUL-1999 (Rel. 4.06, Created) DT 27-JUL-1999 (Rel. 4.06, Last updated, Version 1) XX DE Schistosoma mansoni SR2 subfamily A non-LTR retrotransposon. XX KW Non-LTR Retrotransposon; Transposable Element; SR2A. XX OS Schistosoma mansoni OC Eukaryota; Metazoa; Platyhelminthes; Trematoda; Digenea; OC Strigeidida; Schistosomatoidea; Schistosomatidae; Schistosoma. XX RN [1] RP 1-552 RA Drew C.A., Minchella J.D. and Brindley J.P.; RT "A second family of non-LTR retrotransposons from Schistosoma RT mansoni: Subfamilies and nested insertions."; RL Unpublished. XX RN [2] RP 1-552 RA Drew C.A.; RT "SR2A."; RL Direct Submission to Genbank (19-SEP-1997)Molecular Parasitology RL Unit, Queensland Institute of Medical Research, Post Office, RL Royal Brisbane Hospital, Brisbane, Queensland 4029, Australia. XX DR GenBank; AF025680; Positions 1 552. XX SQ Sequence 552 BP; 131 A; 129 C; 102 G; 190 T; 0 other; ttctcgctca cactagaagc aagacaattt acactattat tattattatt accattatta 60 ttattattat tactattatt attattatta ctattattta ctacgtcgct ttttcactcc 120 ctttattctc aaattgtgta tccttccttt ccaacttatg caggagctcg cccatggtgg 180 aaaatctgtc ctatctaaac gatctccagt cactgtctgg ttgaaagtga atgcttccac 240 cgattatgaa tactgaagca gtctttctga aaccgcgtac accgttcaag ctggcttcct 300 tcaacgttcg cacactaatg cattatcaga caacagatgg ggctggctat gtctttggaa 360 ggtcttaatg ttgatgtttg ttgtctatcc gagacctgta ttcaagactc tagtgaagta 420 ctacaaattc gctctccatc tgtctcctcg aaaagcttgt ttcacgtgcg cttatccggg 480 gaccctgtgg catcttcgtc tggtcttgct ggcgttggtg tcgcactaag cgctagggct 540 gaggcagcac ta 552 // ID Gypsy-61_CQ-LTR repbase; DNA; INV; 179 BP. XX AC AAWU01037281; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-61_CQ_; KW Gypsy-61_CQ-I; Gypsy-61_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-179 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 502-502 (2011). XX DR Genome; AAWU01037281; Positions 11322 11500. XX SQ Sequence 179 BP; 57 A; 30 C; 60 G; 32 T; 0 other; tgtgggaact gaatacggcc ctgacgcacg cggtaggcaa cggggagaga ggggcggaag 60 acgaaccgtc agagtgaggg ggacagttga ggcgggaata cgatcggaga ataaagatca 120 ataattggag ctctaattga atacgtgttt tattgaacaa ctaatcgcag agtagcaca 179 // ID Gypsy-13_RP-I repbase; DNA; INV; 4131 BP. XX AC ACPB02039362; XX DT 28-JAN-2011 (Rel. 16.02, Created) DT 28-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Rhodnius prolixus genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-13_RP_; KW Gypsy-13_RP-LTR; Gypsy-13_RP-I. XX OS Rhodnius prolixus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Euhemiptera; Heteroptera; OC Panheteroptera; Cimicomorpha; Reduviidae; Triatominae; Rhodnius. XX RN [1] RP 1-4131 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Rhodnius prolixus genome."; RL Direct Submission to RU (28-JAN-2011). XX DR Genome; ACPB02039362; Positions 15166 11036. XX CC Positions [3163-3675] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 26..982 FT /product="Gypsy-13_RP-I_2p" FT /translation="MPKRRKAQPEPSDDVEVGVPPHPANKMNTSSRSSSTD FT KCLILPHPASKVTAPPFAPEQVDVWLLQFDTSLQIAGVQDSGSKYNHLITA FT LPTDVLAQAMRVVEASVQDKYEHMRKYLKERYQVTIDQRIERLFAEGELGD FT RRPSQLLNDLRRLAAGTDIGQHTIRQLWLQRLPIHTRTIVAAQEGSLDSLA FT QTADRVWGVECPEHEIGETRCIRSIQNTQATSYATPIERVDDVPFPHSTHN FT QHSPQPREVAQIKSPKERRRETRSAQAAKLKEQIAQLEKELRSLETDELCF FT FHRKFGSKAYKCKPSCGWRPENCLGKR" FT CDS 952..4116 FT /product="Gypsy-13_RP-I_1p" FT /translation="MAAGKLFGQALDAASSLPPLSSRLYVTDRLTGTMFLV FT DTGAEVTAIPPPAGRQSLSTGEPKLTAANGTPIPLYGRRPMHLDFGEDLAA FT HWDVLIAGVSRPILGADILKAYGWTVDLQNGQLIASTTGNTKKEALTVHTH FT QTRRGENAADRYEAILNEFPRVVGDEPMTDRVCSHTTHTILTTGRPTYAKP FT RRLPPDKLAAAKLEFDHMCRKGICRPSKSHWASPLHMVRKKTGEWRPCGDY FT RQLNAQTVPDRYPLPHIQDFTSHLHGKTVFSTIDLERAYHQVPVAPEDVPK FT TAITTPFGLFEFPQMTFGLRNAAQTFQRFMHGVLQGLPECFVYIDDILVAS FT SSEDEHHDHLRTLLSRLDQHGIKINRQKCHLGQQEVQFLGHTVSAQGIRPT FT GVKCEAITSYPQPSTVEQLRRFLGMFNFYRRFVPHAAQLQSPLLKFLKGAR FT KRDTRKIEWNKESEGAFHDCKTALADAALLAHPEANPTLELVVDASESALG FT AALHQVRAQGREPLAFFSRRLSDAERRYSAYDRELLAAYAAVRHFRHWVEG FT RQFTLLTDHKPLTFAFQQRPDKASPRQLRHLDYIAQFSTDIRHISGSCNTV FT ADALSRVEALSTHQGLDYSQLAREQLQDVELQQLFQPSSDTHPLSTGLRLK FT RLSMQGGGPNVICDLATGRPRPFIPRSQRKQVFLLVHSLAHGGPKATARAV FT AQRFVWPRMQRDCREWARDCEDCQRGKIGRHTKAPLTAFSQPGERFAHLHI FT DLVGPLPPSQGYRYCLTCIDRYTRWAEAIPLQSATAPAVASALYAGWIARF FT GCPASITSDQGTQFTSRVYHELAALTGTQLKRTTAYHPQANGVIERWHRTL FT KTALRCHGTDSWMDRLPTVLLGLRAAVREDTGLSPAEYVYGTTLRLPGEFV FT GEEFTNQTQTEVVQQLKRSMRALRPAPTVWHARQKPFVHQQLGTCSHVFLR FT IDAHRSPLAPPYEGPYPVLSRTDKTLCIDRRGKSKTVTIDRVKAAYGLATP FT GLVITPAAPEPTQVQEKQLGAHPIDTTQNHATRSGRISKTPQRYGTQRC" XX SQ Sequence 4131 BP; 1087 A; 1203 C; 1088 G; 753 T; 0 other; agtggtgacc ccaacaataa caagcatgcc gaaaaggaga aaagcacaac cagaacccag 60 cgatgatgta gaagtcggtg taccaccaca tcccgccaac aaaatgaata catcctctag 120 gtccagcagt actgataaat gccttatact accgcatcca gccagcaagg taacagcgcc 180 cccattcgcc ccagaacagg ttgacgtgtg gctgttgcag ttcgacacct cgctgcagat 240 agccggcgtg caggactctg gtagtaagta taaccatttg attacagccc tgcctaccga 300 cgtactagca caggcaatgc gagtggtgga ggcgtcggtg caggacaaat atgaacatat 360 gcgcaaatac cttaaagaac gatatcaggt cactattgac cagcggatcg agcgactgtt 420 cgcagaaggg gagctagggg atcgacgccc ctcgcaatta ctaaacgacc tgcgacgatt 480 ggctgccggc actgatatcg ggcagcatac catccgtcag ttatggctcc agcgactgcc 540 aatccacacg cgcactatcg tggcagcgca ggaaggttca ttggacagcc tagcacaaac 600 cgcggacaga gtatggggag tagagtgtcc agaacacgag atcggtgaaa cgcgatgcat 660 ccgtagcatc caaaatacgc aggcaactag ctacgcgaca cccatagagc gggtagatga 720 cgtaccattc ccgcatagca cacataacca acactcgcct cagccgcgcg aagtagcaca 780 gataaaatcc cctaaagaac ggcgacgcga aactcgcagc gcacaagccg cgaaattgaa 840 agaacagatt gcccaactag aaaaggagct gcgcagcttg gagactgacg agctctgttt 900 ttttcatcgg aaatttggca gcaaagccta taaatgtaag ccctcgtgcg gatggcggcc 960 ggaaaactgt ttgggcaagc gttagacgcg gcgagcagct tgccccccct ctcaagccgc 1020 ctttatgtaa ctgacaggct caccggcacg atgtttctcg tcgatacagg ggcggaagtg 1080 accgcaatac cacctccggc gggcagacaa tcactatcta ccggagaacc taagctcaca 1140 gccgctaacg gaaccccaat tccactttac gggcgacggc cgatgcacct ggacttcggc 1200 gaggacctag cagcacattg ggacgtactc atagcaggcg tgtcgcggcc aatcctgggc 1260 gcggatatct taaaggctta tggctggaca gttgacctcc aaaacggcca gctgatagcg 1320 tcgacgactg ggaacaccaa gaaggaggca cttaccgtgc atacgcacca aactaggcgc 1380 ggggaaaacg cagccgaccg atacgaggca atattgaacg agttcccaag ggtcgtgggg 1440 gatgagccca tgacagacag agtgtgttcg cataccacac acactatact aacgaccggg 1500 cgccccacgt acgccaaacc aagacgcctc ccaccagata agctagcggc ggctaaacta 1560 gaatttgatc acatgtgcag gaagggtatc tgccgaccct ccaagagcca ctgggccagc 1620 cctctgcaca tggtccgtaa gaaaacggga gaatggcgcc cgtgtggaga ctaccggcaa 1680 ttaaatgccc aaacagtccc agatcgctac ccattgcccc acatccagga cttcaccagt 1740 catctacacg ggaaaacagt gttctcgaca atcgacctgg agcgggccta tcaccaagtt 1800 ccagtggcac cggaagacgt accgaaaacg gccattacca cacccttcgg gctctttgag 1860 tttccgcaaa tgaccttcgg cctccgtaac gcggcccaga cgttccagcg gttcatgcac 1920 ggagtcctac aagggttgcc agaatgtttc gtctatatcg atgacatttt ggtcgcctcc 1980 agttcggagg acgaacatca cgaccacctc aggacgctgt tgagccgctt ggaccaacac 2040 ggtatcaaaa ttaaccgcca aaagtgccat ttgggtcagc aggaagttca gttcctcggg 2100 catacagtct ccgcacaggg aatacggcca acaggagtaa aatgcgaagc gataacgtcg 2160 tatccccaac ccagcactgt ggagcagctt aggcgattcc tgggaatgtt taatttttac 2220 aggcgattcg tgccgcatgc ggcacagctg caatccccgc tgcttaaatt cctaaagggt 2280 gcacgtaagc gcgacacgcg taagattgaa tggaataagg aatcggaagg cgcattccac 2340 gattgcaaaa ctgcactggc cgatgcggca ctcttggcac acccagaagc taaccctact 2400 ttggagctgg tcgtggacgc gtcggaatcg gcgttagggg ccgctttaca ccaagtcaga 2460 gcccagggac gagaacccct cgcgttcttc tccaggcgac taagcgatgc agaaaggcgt 2520 tatagcgcgt atgataggga gctcttagcc gcatacgccg cagtgcggca cttccgccac 2580 tgggtcgagg gacgacaatt cacattacta acggaccata aaccactgac ctttgccttt 2640 caacaacgtc cggacaaggc atctcctcgt caactacgtc acctcgacta tatcgcgcaa 2700 ttctccaccg acatccgcca catcagtggt agctgtaata ccgtcgcaga cgctctgtca 2760 cgcgtagaag ccctaagcac tcatcaaggc cttgattaca gccagctggc gcgagaacag 2820 ctacaggacg tggagctaca acagctcttt cagcccagct cggataccca ccccctcagc 2880 actggcctac gattgaagcg actgtcgatg caaggaggag ggccaaatgt tatttgtgat 2940 ttagctacag gccgaccacg cccctttatc ccacgcagcc aacgcaaaca agtctttcta 3000 ttagtacaca gcctggctca tggaggtcct aaggcgaccg cacgggcggt agcccaacgg 3060 ttcgtctggc cgcgcatgca gcgggactgc cgtgagtggg cacgcgactg tgaagattgc 3120 cagaggggta aaatcgggag acacacgaaa gcgccgttaa cagctttctc ccagcccggg 3180 gaaagatttg cgcatttgca cattgatctc gtgggaccgc tcccgccctc gcaaggctat 3240 aggtactgcc tgacatgtat agacaggtac actcggtggg cagaggcgat accgctacaa 3300 agcgctactg cgcccgccgt cgcttctgct ctgtacgccg gctggatcgc aaggttcggc 3360 tgcccagcct caatcaccag cgaccagggg acgcagttta cgtcaagagt gtatcatgaa 3420 ctggcggcgc tcacaggaac tcagctcaaa cgaacaaccg catatcaccc gcaggctaat 3480 ggtgttatag agcgatggca caggactctg aagacagcgc tacgctgcca tggaacagat 3540 agctggatgg acagactgcc caccgttttg ctgggtttac gagcagccgt gcgtgaggat 3600 acgggactgt cccctgcaga gtatgtgtat ggaaccacgc tacggttacc aggcgaattc 3660 gttggggagg aattcactaa ccagacgcag acggaagtgg ttcaacaact taaaaggtct 3720 atgagagccc tgcgcccggc accgacggtc tggcacgccc gccaaaaacc atttgttcat 3780 caacaattgg gaacctgttc gcatgttttc cttaggatcg acgcgcacag gtccccgctg 3840 gcgccgcctt atgaaggtcc atatcccgta ctctcacgaa ctgacaagac cctatgtatc 3900 gaccgacggg gcaaaagtaa aacggtcacc atcgaccggg tcaaggcggc ctatggactg 3960 gccacaccag gccttgtcat aacaccagcg gcgccagagc ctacccaggt ccaagagaaa 4020 caactgggag cacacccgat cgacaccaca cagaatcacg ctacacgatc cgggcgaatc 4080 agcaaaaccc cgcagcgcta tggtacgcag cgctgctagg aaggggggtg c 4131 // ID SCAI_EH repbase; DNA; INV; 118 BP. XX AC X61182; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 25-MAR-1997 (Rel. 2.02, Last updated, Version 2) XX DE E.histolytica ScaI repeat. XX KW SCAI_EH; Sca I repeat. XX OS Entamoeba histolytica OC Eukaryota; Amoebozoa; Archamoebae; Entamoebidae; Entamoeba. XX RN [1] RP 1-118 RA Bhattacharya S.; RT "SCAI_EH."; RL Direct Submission to Genbank (01-AUG-1991)S. Bhattacharya, RL Jawaharlal Nehru University, School of Environmental Sciences, RL New Delhi 110067, INDIA. XX RN [2] RP 1-118 RA Mittal V., Sehgal D., Bhattacharya A. and Bhattacharya S.; RT "A second short repeat sequence detected downstream of rRNA genes RT in the Entamoeba histolytica rDNA episome."; RL Mol. Biochem. Parasitol 54(1), 97-9100 (1992). XX DR GenBank; X61182; Positions 1894 2011. XX SQ Sequence 118 BP; 49 A; 12 C; 15 G; 42 T; 0 other; tttaatgaaa agtactaaat acaaagtaca ataatttcta attgtgaaaa tcaaaggaat 60 tattcaaaat ggtcgtcgta tagatgaaat aattttttac taatttacac cgttgatt 118 // ID Gypsy-203_AA-I repbase; DNA; INV; 4143 BP. XX AC supercont1.83; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-203_AA_; KW Gypsy-203_AA-LTR; Gypsy-203_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4143 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.83; Positions 1403700 1399558. XX CC Positions [1633-2082] - Reverse transcriptase CC Positions [3166-3675] - Integrase core CC 'CGCTC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 160..4122 FT /product="Gypsy-203_AA-I_1p" FT /translation="MPDEENKVMSAASASIKPPKPLIIEENMAAKWKQWWR FT QFKWYSVATELETKPPPIQAATFLSCIGDECIRVLDTFGLTDAQEADIKVL FT KDKFEAYFVPKSCITYERYVFGKIVQNAGEQCDSFFTRVREQAKRCAFSVL FT HDSMVKDKIIAGTIYTNLIPQLLNDDNDLQKTIDIVRNYEQSVKQTKVMLE FT KSVLEVDAVRQKKYESNVQSDAPFPCNRCGLEHRRRMCPAFNKKCLKCGKK FT GHFADRCFGGGASGGSSVSSGSMASGGSIKTGNNRYVKTVEVEEEEELSVE FT ELYIAAVDDDDENSDEVWYETVSINGKSVTLKLDSGAACNVLPWNIFRKLG FT KELQQSKTKRLISYSNHKVNVKGEAELPVVVRGRKETATFKVIEGDMMPIL FT GRKTSVRFKLIARIDEVNMEKSLFNGLGCIKGFVYDIDLVENPMFRNESPR FT RIPHALRSAVKAELDSMLQMGVIEPITEPTPVLNAMVIVRQKGKLRICIDP FT SEVNKNLLRRVHPLSTVEEISARICGSKHFSILDMKKGFWQIPVSERTKKY FT LAFGTPWGRYTCRRLPFGLASAPEIFQNLMSSLLEGIDGVESSMDDILIHA FT PTEEKLNKITKLVLKRIEDAGLKLNKDKCIFNKPTVKFLGHLVTDVGLKAD FT PEKLEVIQKLKRPTNKLQLQRILGTITYLGKFVENLSAITEPLRRLLVKDV FT EWIWDHDQEEAFLKIKQIMMTPPVLAYYDVNSDVTLSVDASSKAFGAVLLQ FT NDKPVAYASKSLTKAQENYPQIEKEAAAIRFACNRFHDYIYGKKIKIETDH FT KPLESIFKKSIDRAPPRLKRIMLDVVQYAPEIMYKRGSDIPLPDILSRDMD FT NLESRDPSDELEVHIILPMSKPARQEIVEECLRDPEIVKLTQVIMNGWPNE FT KRLVPDVLRKYWSFRDEMAVYEGLVFRSHQILIPKSLRNKMLKEIHKGHTG FT IQGCMRRAKQSLFWVGMTTEIAGMIEQCAVCEKHQRSNVKHELINNEIPTL FT PFEIVSSDLFHFCGKEYLLIADSYSGFFDFAQLREPTARVVIEQLKRWFAT FT HGIPRVLYTDNGPQYSANEFATFSRLWSFDHVTSSPHFPRSNGLAEKFVQT FT AKNLLKRCAEDGSDVQLALLLIRNTPRDEKLQSPSQRLMNRTLRSTIPAAD FT VVLKPRVINQVTENLTQKRLQQKKYADRSSVKPQEYHVGQAVMLRDEKSSF FT WKGGRITEKLEQPRSYMVQIGDGKVVRRNVRDIRDMKARDSCFEVTGNEIQ FT SAVNYHQDQREANPEDDEELGGTQQGRQSIVPLRPSTDSPIRTRCGRIIRS FT NRNSEFEYY" XX SQ Sequence 4143 BP; 1336 A; 764 C; 1037 G; 1006 T; 0 other; tggtgtcaga aaaagtgagt tatcgaaaat agttattgat tttcgtgacg aacaagtgaa 60 aaaatgcaat tgcagtgtac agtaaagttg aatcgaatta attttcaaac tagctcaccg 120 gaataagcct gaaaatagtg agcagtttcc acaacaagga tgccggatga ggaaaataag 180 gttatgtcgg ccgcttcagc gtcgattaaa ccaccgaaac cactaatcat agaggagaac 240 atggctgcca aatggaagca gtggtggcgg cagtttaagt ggtattcagt cgcaacagaa 300 ctggaaacca aaccaccacc aatacaagcg gcgacgtttt tgagctgtat tggtgacgag 360 tgtattcgtg ttttggacac tttcggactg acagatgccc aagaagcaga cattaaagtg 420 ctaaaagaca aattcgaagc gtattttgtt ccgaaatcgt gcataacgta tgagcggtac 480 gtcttcggta aaatagtgca gaatgcagga gaacaatgtg attcattttt tactcgagtt 540 cgcgaacaag ccaagagatg tgcttttagt gttttacacg actccatggt caaggataaa 600 atcatcgctg gcacgattta cacgaatctt attccgcaac tactcaatga tgacaacgac 660 cttcagaaaa cgattgacat agtgcgaaat tatgaacaat cggtgaaaca aaccaaagtg 720 atgctcgaga aatcggtttt ggaagtggac gccgtgcgtc agaagaagta cgagagcaat 780 gtacaaagtg atgcaccttt tccatgcaac cgttgcggct tggagcatcg gcgaagaatg 840 tgcccggcct tcaacaaaaa gtgcttgaag tgtgggaaga aggggcattt cgcggatcgg 900 tgttttggtg gtggtgcttc gggaggcagt agtgtttcga gtggcagtat ggcgtccggt 960 ggttcgatca aaacgggaaa taacagatac gtgaaaacag ttgaggttga agaagaagaa 1020 gagctgtcgg tggaggagct gtatatagca gctgttgatg acgacgacga aaacagtgat 1080 gaagtgtggt atgaaacggt atcaatcaac gggaagagtg taacactgaa actggacagt 1140 ggagctgcgt gtaatgtgct accgtggaat atttttcgaa aactgggtaa agaattacaa 1200 caatcgaaaa cgaaaaggct gatatcgtac tcaaatcata aagtgaatgt gaagggtgaa 1260 gcagaacttc cagtagtagt gagagggcgc aaggaaactg cgacctttaa agtgatcgaa 1320 ggtgatatga tgccaattct gggaagaaaa acaagtgtcc gtttcaaact aattgcgcgt 1380 atcgatgaag tcaacatgga gaagtctctg ttcaatggac tggggtgcat caaaggattc 1440 gtctacgaca tcgatcttgt ggaaaaccca atgttcagaa atgaatcacc cagacgaatt 1500 ccacatgcac ttcgatcggc ggtgaaagca gaattggatt cgatgttgca gatgggagtt 1560 atcgaaccaa ttacagagcc tacaccggtg ttaaacgcca tggtcatagt acggcagaaa 1620 ggtaagctac gaatttgtat tgatccatcg gaagtaaata agaacctcct tcgccgagtg 1680 catccgcttt caaccgtgga agaaatttca gcacgaattt gtggctccaa acatttttcg 1740 atcttggata tgaagaaggg tttttggcaa atacccgtat cggagcgcac gaaaaaatat 1800 ttggcttttg gcacaccgtg ggggagatac acttgtagaa gattaccatt cggtttagca 1860 tcagcgccgg agatattcca aaacttgatg agttcactac tagaaggaat tgatggggtt 1920 gaaagctcaa tggacgacat tctcatccat gcgccaacag aggaaaagtt gaacaagatt 1980 accaaattgg tcttgaaacg aattgaagac gcagggctaa agctgaacaa agacaaatgt 2040 atttttaaca aacctacagt aaaatttttg ggacatttag tcacggatgt agggttaaaa 2100 gctgatcctg aaaagctgga agtcatccaa aaattgaagc ggccgacaaa taaattgcag 2160 cttcaaagaa ttttgggtac aataacctac ctggggaaat ttgtcgaaaa cttgtcagct 2220 ataaccgagc ccctaaggag attgttagta aaagatgtag aatggatttg ggatcatgac 2280 caagaagaag cttttttgaa gattaaacaa atcatgatga caccaccggt tctagcgtat 2340 tatgacgtca attctgatgt cacgctatcg gtagatgcca gctctaaggc attcggtgca 2400 gtactgttgc aaaatgacaa gccagttgct tatgcgtcaa aatcgttgac aaaagcacaa 2460 gaaaattatc cgcaaataga aaaggaggca gccgctattc gatttgcgtg taatagattt 2520 cacgactata tctatggtaa aaagataaaa atcgagaccg accacaagcc attggaatcg 2580 atattcaaaa aatcgattga tagagctcct cctaggctaa aacgtattat gctggatgta 2640 gttcaatatg cacccgaaat aatgtacaaa aggggatctg atattcctct tccagatatc 2700 cttagtagag atatggacaa tttggaatca agggatcctt cagatgagtt agaagtccac 2760 atcatattgc caatgtcgaa gccagcgcga caagaaatag ttgaagaatg tttgagggac 2820 ccagaaattg tcaaattgac acaagtcatc atgaatggct ggccgaatga aaagaggttg 2880 gtcccagatg ttcttcggaa gtactggtct ttcagagatg aaatggcggt atatgaagga 2940 ctagtatttc ggtcacatca aattttgatt ccaaagagtt tgaggaacaa gatgttgaag 3000 gagattcata agggccacac cggaatccaa ggctgcatgc gtcgtgcgaa gcaatctttg 3060 ttctgggttg gaatgaccac tgagatagca ggtatgatag agcaatgcgc cgtctgtgag 3120 aagcatcaga gatccaacgt caaacatgaa ctgatcaata acgaaattcc gacgctgccg 3180 tttgagatcg tgtcgtctga tttgtttcat ttttgtggca aagagtactt gctcatagct 3240 gacagctatt caggattttt tgatttcgct cagctaaggg aaccaacagc gagagtagtc 3300 atcgagcagc tgaaacgttg gtttgcaaca catggcattc caagagtgtt gtataccgat 3360 aatgggccgc aatattcagc caatgagttt gccacgttta gccgactgtg gtcgttcgat 3420 catgttactt ccagccccca tttccctagg agtaacggtc ttgcagaaaa atttgtgcaa 3480 actgcaaaaa atttgcttaa gcgatgcgcg gaggatggtt cagacgtcca gttggctcta 3540 cttttgatta gaaatactcc aagagatgaa aagttgcagt cgccgagtca aaggttgatg 3600 aatcgtacat tgcgttctac cattccagca gcagatgtcg ttctcaagcc tcgagtaatc 3660 aaccaagtta cggaaaatct gacgcagaaa aggctgcaac agaagaaata tgccgacaga 3720 agttcagtca agcctcagga gtatcacgta ggacaagccg tcatgctgag agatgagaag 3780 tcaagttttt ggaagggagg aagaataacg gagaaactag agcagccaag atcatacatg 3840 gtacaaatcg gagatgggaa agtggttcga agaaacgtac gagacatcag agatatgaag 3900 gcccgagata gttgcttcga ggtaaccgga aacgagatcc agtctgcggt gaactaccat 3960 caagatcaac gtgaagcaaa tccagaggac gatgaagaac ttggcggaac acagcaaggt 4020 cggcaatcga ttgtgccgtt acgcccatca acggacagtc ctattaggac aagatgtgga 4080 agaatcattc gttccaatcg gaatagtgaa tttgaatact attgatttgc ttaaatgggg 4140 aga 4143 // ID Crack-30_AAe repbase; DNA; INV; 4633 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A Crack non-LTR retrotransposon family from Aedes aegypti. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; KW Crack-30_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4633 RA Kojima K.K. and Jurka J.; RT "Crack clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1246-1246 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 6 sequences with >92% CC identity. XX FH Key Location/Qualifiers FT CDS 867..1343 FT /product="Crack-30_AAe_1p" FT /translation="MNVVGAPGKDDGIEEAKRIQSKNVSRNAVPIIVRFSN FT ENTKEAFFEKKRAYGVLLASSICEAFAGSSNRITVRDEITSYGKDLLXKTK FT EFQTSLDIKYVWMGRDGKVLIKRQDGAKVEKISSCEDLEKLSKLSQKRLLN FT LSGASSTSSPKGPAAKRVSA" FT CDS 1476..4235 FT /product="Crack-30_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MNSLEKFDCMKELLSAYTEEIDIVVIGETWLKEARCQ FT LYSIDGYXAVFSCRSESDGGGLAVYVRQSLAFXNLSNEHLRVAQNGSHVDI FT HAMYRPPSFLIPQFFDILETVLSSTKGGAPCIIVGDXNLATNQSNNRSVEE FT YLXILNCXNFXVTNTHPTRPTSNNILDHAVCSEALSQNVVNETIFSEISDH FT CLVLTTLQNCRTVRTCTLEKVIVDHTKLNDDFATAARVLPPRMTATQKLQH FT VINLYHQLKERCSRKVSVQAKIKGSCPWMTFDLWKMMQLKDKILSRSRRYP FT QDSRVQEQLGHVSKLLQRTKELAKRSYYGKLFENPNPKNMWKILDQVLGRN FT INESQXIELEHNGRSVKDPLGVAEIFNNFFSDIGPQLASSISSQREINKFG FT SLRPIRSSIYLKPATKQEVILQIKDLDNNKSAGPDGISATFLKTHHDFFAN FT LLANVFNESINSGSYPDFLKTARVIPIHKAGSKKDPSNYRPISVLSVFNKI FT LERLLVDRLNSFFKINDVLYHYQYGFRGGSNTLTATSELVDDIYKAMDKRK FT VAGILFLDLKKAFDTIDHDVLLQKLDSYGIRGPANRLIKSYLTGRRQYVXL FT QNCASSNRNLSVGVPQGSNLGPLLFLLYINDLPHLPLHGKPRLFADDTSIL FT YICNEADQIIDFMKQDLEKLTEFFSENLLSLNLNKTKYVLFHSPRLTITKN FT IDLVVNCTKIDEVNCYKHLGITLDSSLSWSNHIAILQKDVSTTCGVLWKIS FT KFVPSKQLLLLYHAFIQSKICYLVXVWGAASKTKLKIIQTIQNRCLKVIYR FT KPRLYPTIDLYSEADKSVLPVAALRDLQTLIQMQNLLHNPLALHNQEFPRT FT SHRYETRNPPALVISRSNTEFGKKSYSHYGKSRYNALPIHIRAEINPLQFK FT HRVIAWIRSQLSNFIL" XX SQ Sequence 4633 BP; 1460 A; 976 C; 909 G; 1265 T; 23 other; ttggcaacac tggattcaca ttgaaaggct mttttgcaat ccgttctawg aatcgaaata 60 tcagttgatt tcaacatcga aaattcctgc atcaggcatt agaaggcaat tcggagcagg 120 cagcwggggt cccatcgtta tctcggaggt ttgaatcgtt tgcaccaagt tctggccagg 180 tatttatacg aatcatcatc aatcatcaat tcgtttacac ctcgtcatca gtmggccgac 240 ttctgtatgc attgaacgtg ctattggtgc tatcggtgct gtcgtwcacc aaataggatt 300 gtgctagcct attgtgttat attttggtct gttgatgtct gcttaggcac tgatgtacga 360 catggtcaac tcggaagaag aaagtgtgtg cttcatttgc gacaaatctg aaaacgatgc 420 gtcgcaaatg attgaatgtg cctactgtaa taaatsttgt cactttcgtt gmaagaaact 480 gcacgggaat gcgatctaat caatcgaaaw cgcatagtga tgtggacgtg gttgctgagt 540 tgcgattgct cacggagact gtgcttgaga tcaaacagga gtctatggtt tctcgttcgg 600 cgtttgaaga gaccagactt cagatggctg catccatgaa aacaatggaa accamtaaac 660 atatagaaga gtgataagcg agacaacatc aagctgtgga aagatgtcaa tatgctgcaa 720 gaccagaatc accaactggt aacgactgtg aatcatttgg aactcgaact agatagagtc 780 aatcgtgcca tgctctccaa aaatgccgtt attctaggac tcccgtcctt ggaaaacgag 840 aatacaagga actagtgcgc catttgatga acgtcgtagg agccccaggc aaagatgatg 900 gcatcgaaga agctaagcgg attcaaagca aaaatgtcag caggaatgcg gttccgatta 960 tcgtacgttt cagtaacgaa aacaccaaag aggctttctt tgagaagaaa cgtgcatatg 1020 gagttttgct ggcgtcgtcg atttgtgaag ctttcgctgg gtcatcgaat cgtatcaccg 1080 tccgtgatga aataacgtcg tatggaaagg accttctcca kaagactaag gagtttcaaa 1140 cttcgctgga catcaagtat gtmtggatgg gcagagacgg aaaagttctg attaagcgac 1200 aagatggtgc aaaagtggaa aaaatatcct cttgtgaaga ccttgaaaag ctctccaagt 1260 tgtctcaaaa acgactacta aacttatccg gtgccagctc aacatcatca cccaaagggc 1320 cagcagcaaa acgggtttcg gcttaatgaa tatcgttcwt agagcgtagc tgataatatt 1380 actgaattca acaataaaaa ttactatttt aattctatat ctgakctgaa taaaaatatt 1440 gtaaacttat cwaaccgtac atcaatattc gaggcatgaa ctcacttgag aaatttgatt 1500 gcatgaaaga acttctttcg gcgtatactg aagagattga tattgtagtt attggggaaa 1560 cttggttgaa ggaagcacgt tgtcaacttt acagcataga tggatatwac gctgtttttt 1620 catgccgttc tgaatctgat ggaggtggat tagctgtata tgttcgtcag tcgctagcgt 1680 ttgmaaatct ttcaaatgag catctacgag tcgcacagaa tggttctcat gttgatatcc 1740 acgcaatgta caggccacca tcgtttctta ttccacagtt ttttgatatc ctggagacag 1800 ttttatcatc cacaaaagga ggtgctccct gtattatagt tggtgacwtt aatctggcaa 1860 caaatcaatc aaataaccga tcagttgagg aatacctgaa mattttaaac tgctwtaact 1920 tcasagtcac aaacacacac ccaacaaggc ctactagcaa taatattctc gatcatgcag 1980 tatgttctga agcgctttcc caaaatgtag tcaatgaaac aattttctcc gaaattagtg 2040 atcattgttt agtgctcacg acattacaga actgcaggac tgttcgcacc tgtactttgg 2100 aaaaggtaat agtagaccat acaaaactta acgatgattt cgccactgct gcgcgagtcc 2160 ttcctccaag aatgacagcc acgcaaaaat tacaacatgt cattaacttg tatcaccagt 2220 tgaaggaaag gtgctcgaga aaggtatccg ttcaagcgaa gattaaaggt tcatgtccct 2280 ggatgacgtt tgatctttgg aaaatgatgc agttgaagga caagattcta agcagaagta 2340 ggagatatcc tcaggatagc cgcgtacaag agcaacttgg tcatgtgtcg aaattgctac 2400 agcgtactaa agaactagct aaaagatcgt actacggaaa actgtttgaa aatccaaacc 2460 cgaaaaatat gtggaaaatt ctcgatcaag ttcttggtcg caacatcaat gaatcccaas 2520 agattgaact tgaacataac ggacgtagtg ttaaggatcc tctgggcgtt gcggaaattt 2580 tcaacaactt cttcagtgat attggacccc agctagcctc cagcatttcc agtcaacgag 2640 aaatcaacaa gtttggtagt ctgcgaccta tcagaagctc catttacctg aagcccgcta 2700 ctaaacaaga agttatcctc cagatcaagg atttagacaa caacaaaagt gctggaccag 2760 atgggatatc cgcaacattt ttgaagactc atcatgactt tttcgccaac cttcttgcaa 2820 atgtgttcaa tgaaagcatc aactctggtt cttacccgga cttccttaaa accgcaagag 2880 tgataccaat tcacaaagct ggctccaaaa aagatccatc caactaccga cccatctccg 2940 ttctctcggt tttcaacaaa attttggaac gtctcctggt tgacagactc aacagctttt 3000 ttaaaatcaa cgatgtcctt taccattatc aatacggttt tcgtggtggt tcaaatacgt 3060 taactgctac cagtgaacta gtggatgaca tctataaggc tatggataag cggaaagtag 3120 ccggaatcct gtttctagat ctaaaaaaag cgttcgatac gattgatcac gatgttttgt 3180 tacaaaaact tgacagctac ggtataaggg gaccagcaaa cagattgata aaaagctatt 3240 taactggaag aagacaatac gttcwactac aaaactgtgc cagtagtaac agaaatctgt 3300 cagtaggagt accacagggc agcaatttgg gcccacttct attcctattg tacatcaacg 3360 acttgccgca tcttccctta cacggaaaac ctcgtttatt tgcggacgac acctctatct 3420 tgtacatatg taacgaagct gaccaaatca tcgatttcat gaaacaagat ctggagaagc 3480 tcacagaatt cttttctgaa aatttgcttt ctttgaacct gaacaaaacg aaatatgttt 3540 tgtttcactc accccgtctt accattacga aaaatatcga cctggtggtg aattgcacta 3600 aaattgatga agtcaattgt tacaaacatc tgggcataac actcgattct tcattgtcct 3660 ggagtaatca catcgccata ctacaaaaag atgtcagtac aacatgtgga gtattatgga 3720 agatatctaa atttgtacca tcaaaacaac tgttgctact gtatcatgct tttatacaat 3780 ctaaaatctg ctatcttgtc tmagtatggg gagcagctag caaaacgaag ctgaagataa 3840 tccaaaccat acaaaatcgt tgtctgaagg tcatttatag aaaaccgagg ctgtatccaa 3900 ccattgacct atattcagaa gcagataaat cagtcttgcc agttgctgct ttacgagatc 3960 tgcaaactct aattcaaatg cagaatctgt tacataatcc gttagcacta cacaaccaag 4020 agtttccacg cacttctcac cgttacgaaa ctaggaatcc tccggcctta gttataagcc 4080 gttcaaatac tgaatttggg aaaaaatctt atagtcacta tggtaaatca aggtacaatg 4140 ccttgccaat acatataagg gctgagataa acccacttca attcaaacac cgtgtaattg 4200 catggattag atcacagctt tcaaacttta tattgtaaat ctatgagtcc agtgtatcac 4260 tcgtaaactg ccatcaatgc tcccttcaaa gagttaaggc tcactgggag caaaaagccg 4320 tagtttcact aaatactaat gctgcaaata cgattttttt atctcttccg cctcctaccg 4380 cctagcaacg ccaccctata ctgccacccg ctgctgccgc ctaccaccgc cacccactcc 4440 aacaacgcca ccctccgcca ttcaacgttc tccatcaact ttaatacgtt attgaactaa 4500 ttgagaaaaa atgtaaaaca tgtatttaat gaaaagatga gaggttttgt gccttttgga 4560 ggaacaaact tgaaaaaagt ttacctccag ggggttttcc tactccttaa taaataaata 4620 aataaaataa ata 4633 // ID LOA_Ele5 repbase; DNA; INV; 4482 BP. XX AC . XX DT 14-OCT-2010 (Rel. 15.1, Created) DT 14-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE A LOA clade non-LTR retrotransposon family from Aedes aegypti. XX KW Loa; Non-LTR Retrotransposon; Transposable Element; Lian; KW LOA_Ele5. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4482 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4482 RA Kojima K.K. and Jurka J.; RT "LOA clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (14-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. This consensus is generated from 30 CC sequences with >98% identity, and ~99% identical to the original CC sequence in [1]. The consensus is ~76% identical to Lian-5_AAe. CC Alignment with other Lian elements shows that this family has an CC internal deletion corresponding to ORF1. XX FH Key Location/Qualifiers FT CDS 689..4351 FT /product="LOA_Ele5_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="MKFIKFLQVNLHHAKGASSVLSRRFRKEGLDVALIQE FT PWYNKGKILGIETNKCNLFYDGNQSSPRTAVLINNTLKCYPLTEFIQRDIV FT AVMVEVPTTRGITEVVIASAYFPGDALEVPPPEIVSFVNHCRKINKSFIIG FT CDANAHHTVWGSTDINSRGEYLLDYLCSSNIDICNKGNEPTFSNAIRQEVL FT DLTLCNAAIYDKISNWHVSEETSLSDHKHIMFEWSGGDYPRIAFRNPRKTN FT WEQYAQCLNSNSFTKDENIDSIQKLESFSQEVKGKIIHSFHTNCPMKQSSS FT SRDVPWWNTKLERLRKLSRKLFNKAKQTSDWAQYRRALTEYNREIRKSKRR FT SWIQTCENISATPVVARLQKALAKDHTFKLGSLKREDGVSTKTPDETLNLI FT METHFPGSISCVDSNSSEPIEYERCSVEALSSDRTVISEPDLADEIFTNAR FT VKHAIRSFQPYKSAGVDGIFPALIQNGEAVLISPLIEMFKASLRLNYVPSD FT WRAVKVIFIPKIGKRDKMHPKSYRPISLSSILLKTMEKVLYDYINSTYMLK FT CPLSDSQFAYQSGKSTVTALHTLVTKVEKTLSVKEIALCSFLDIEGAFDNA FT SYSSMTRAMKKRNFNKNIINWINNMLAKREITSELGSSSITVRATKGCPQG FT GVLSPLLWSLVVDDLLKSLEEKGFEVVGFADDIVIIVRGKFDNIISERMQL FT ALNLTNSWCIQEGLNINPSKVVIVPFTKKRKFILKALKLGGVQMHLSEQVK FT YLGVILDSKLNWNAHIEYVTNKATSSFWACSKIFGRRWGLKPKMIMWIYLA FT IIRPKVTYASLVWWPKTKEATTQAKLSKVQRLVSIAITGAMQSTPSKALDA FT ILHLLPLYEFVQLEAEKSFLRLNRLNKFKSGDLVGHLNISQHIHCGPVMTM FT HEDWMKTVDNSDIPYNVCDTTRTDWDVGGPNVRQGSIKFFTDGSKIGTQTG FT AGIYGPGTNVSVAMGSYPTVFQAEIFAILECANICLKRKYRHANICIFSDS FT QAALKALGAYKCSSRLVWECILSLRKVCQENSVCLYWVPGHNGIVGNEKAD FT ELAKQGSNTQFIGPEPFCGISYCALNMELKSWERQRVMTNWMVTTRCNQSK FT RFLVPNETITKRLLELNKRVLCIYTGLLTGHCPSRYHLKNIGQVQNDICRF FT CNMERETSEHLLCSCSALYKRRSKFLNGGYLPPSEIWTGNPSKVVGFINSI FT LPDWEKTRQSA" XX SQ Sequence 4482 BP; 1475 A; 779 C; 918 G; 1310 T; 0 other; gcccactgtt ctcaagtgca caagtgtttt tccggctcac atagaaattg actttctatg 60 tttaattacc ggtataaaaa tcgtgagtgg cagtgcaaaa ggtgcattag ataccgcttt 120 gaagattttc gcgtgtgtaa gagtgagaaa cagtgaaaat aaacgacttt tcctcataca 180 aaaatctggt gtgacggtaa ataaaagata taaaacgatt aaagaacacg gccatcatgc 240 atgatgatag tgacagcaag aattttcttg catagcatgc tattatatgc ttggatggta 300 agttgcatgc aacattttgt caaaacttgg atgatatgac atctgtgaaa cttgcatcaa 360 atcagtgctg caagtttcgc tggaaaattg taagcttaaa ataaaagtaa aaaaaaaaaa 420 tgtttttttt ttaataacag taaaccataa aatatattaa actcattatt cattatcatt 480 attgtttttt tttttaactt attgacattt ttttttttaa ttcttattta tttttttttt 540 ttttgtggat gataaacgtc ttgattagtt ttaaaagttg tgtcacaggt gacatggttg 600 gtcataattc ttcaagagta ttacgtcata aaggttgcct aagacacaca acaataaaac 660 attttctgtg acataaataa tcacaaatat gaagttcatc aaattcctgc aggtgaacct 720 ccatcatgca aaaggtgcat cttctgtttt gagtcggagg ttcagaaagg agggattgga 780 cgtggcacta atacaagagc catggtataa taaaggaaaa atacttggaa ttgaaaccaa 840 taagtgtaac ttattttatg atggtaatca gagttcaccc agaactgctg tactcattaa 900 taacactttg aaatgctacc ctttaactga atttatacaa cgtgacattg ttgcggtcat 960 ggttgaggta ccgaccacta gaggaattac agaggttgta atagcctcag cttatttccc 1020 tggtgacgct ttagaggttc ctcctcctga gatcgtttca ttcgtcaatc attgcagaaa 1080 aatcaataag tcgttcatta ttggctgtga tgctaatgca catcataccg tatggggtag 1140 tacggatatt aatagccgtg gtgaatatct tctggactac ctgtgttcta gtaacataga 1200 tatatgtaat aagggtaatg aacctacctt ttcaaatgca atcagacagg aggttttgga 1260 tttgacattg tgtaatgctg caatttatga taaaatttca aactggcacg tgtcagaaga 1320 aacttccctg tccgatcata agcacattat gtttgagtgg agcggaggtg attatccgcg 1380 tattgctttt cgaaatccca ggaaaaccaa ctgggaacaa tatgcacagt gcttgaattc 1440 caactcattt acaaaggatg aaaacattga ttctattcaa aaactagagt cgttttctca 1500 agaagtaaag gggaaaatta ttcattcgtt ccatactaat tgtcctatga agcaatcttc 1560 ttctagtaga gacgtgcctt ggtggaacac taaacttgaa aggttacgaa aactttctcg 1620 taaacttttt aataaagcga aacaaacttc agactgggct caatacagga gagctttgac 1680 tgaatacaac agggaaataa gaaaatctaa gagaagatct tggattcaaa catgtgaaaa 1740 tataagtgct actcctgtag tcgcaagact acagaaagcg ctggccaaag atcacacgtt 1800 caagctaggg tcgttgaaac gagaggacgg agtttctacg aaaactccgg atgaaacctt 1860 aaatttgatt atggaaactc attttcctgg gtcaatttct tgtgtggatt caaacagctc 1920 tgaaccaatc gagtatgaaa ggtgttctgt cgaagctttg tcatctgata gaacagtaat 1980 ctctgaacca gatcttgcag acgaaatatt tacaaatgca agagtgaaac acgctataag 2040 atcttttcaa ccttacaaat ctgcaggagt ggacggaata ttcccagcac tgattcaaaa 2100 cggagaagca gtgctgattt cgcctctaat tgagatgttt aaggcgagct taagattaaa 2160 ctatgttcca tctgattgga gagcagtgaa ggttatcttt attcctaaaa taggaaaacg 2220 agataaaatg catccaaagt catatagacc tattagtctc tcatcaattt tgttaaagac 2280 tatggaaaaa gtgttgtatg attacataaa ttcaacatac atgctcaagt gtccattatc 2340 tgactcgcag tttgcttatc aatctggtaa gtcaacagtt acggcacttc atacattggt 2400 aacaaaagtg gaaaaaactc tttcagtgaa agaaatagca ctttgctcat tcttggacat 2460 tgaaggagcc ttcgacaacg cttcctattc ttcaatgact cgtgcaatga agaaaaggaa 2520 tttcaacaag aatatcatta actggatcaa taatatgctt gcaaaaagag aaatcacttc 2580 cgagctggga agttcgtcta taacagtaag ggctacaaaa ggttgtccac aaggaggagt 2640 cctctcgcct ctattgtggt ctttagtagt tgacgatctt cttaaaagct tggaagaaaa 2700 aggtttcgaa gttgtaggat ttgcagatga tatagtcatt attgtgagag gaaagtttga 2760 caatatcatt tcagaacgaa tgcaactggc cctaaactta accaattctt ggtgtattca 2820 ggagggcctt aacataaatc cttcaaaagt agtaattgtc ccttttacta agaaaaggaa 2880 gttcatccta aaagctttga agcttggagg agtacaaatg caccttagtg aacaagtcaa 2940 atacttagga gtaattttgg attctaaact gaattggaat gctcatattg aatatgtgac 3000 taataaagct acaagttcat tttgggcatg ctccaaaatt tttggcagaa gatggggctt 3060 gaaaccaaaa atgataatgt ggatttattt ggccataatt cgtccaaaag taacatatgc 3120 ttcgctagtg tggtggccaa aaacaaaaga ggccacaaca caagcaaaac tatcaaaagt 3180 acaacgtctt gtgtccatcg ctataacggg agcgatgcaa agcactccgt caaaagcatt 3240 agatgctatt cttcatctgc taccgttgta cgaatttgta caattagaag cggagaagag 3300 ttttctaagg cttaatagat taaataagtt caaatcaggt gatcttgttg ggcacttgaa 3360 tatttcacaa catatccatt gtgggccagt gatgactatg catgaggact ggatgaaaac 3420 tgtggacaat tctgatatcc cttacaatgt atgcgatacg acgcgtacgg attgggatgt 3480 tggaggtccc aatgttcgtc aaggctccat aaaattcttc acagatggct caaaaattgg 3540 gacacaaact ggagcaggaa tctacggccc tgggacaaac gtctccgtgg cgatgggaag 3600 ctatccaaca gtattccaag cggagatttt tgccatacta gaatgcgcta atatttgtct 3660 taagagaaaa tacagacatg caaatatctg tattttttct gatagtcaag ctgcgttgaa 3720 ggcattgggc gcatataaat gctcatcaag acttgtctgg gaatgcattc tctcattgcg 3780 aaaggtgtgc caagaaaact ctgtatgttt gtattgggtt ccgggacaca atggcattgt 3840 aggaaatgaa aaggcagacg agcttgctaa acagggttca aatacacagt tcattggacc 3900 ggagccattc tgtggcatat cgtactgtgc actaaatatg gaactaaaaa gctgggaacg 3960 acaaagggtg atgaccaatt ggatggtcac cacaaggtgt aatcaatcta aaagattttt 4020 agttccaaat gaaactatca cgaaaagact cttagagctc aacaagagag ttctttgtat 4080 atacactggc ctattaactg gacactgtcc gagtagatat catttaaaga atattggtca 4140 ggttcagaat gatatctgtc gtttctgtaa catggaacgc gaaacctcgg aacacctgct 4200 ctgcagttgt agtgcattat acaagcgcag atccaaattt ctaaacgggg gttatttacc 4260 accaagtgaa atttggactg gaaatcccag taaggttgtg ggttttatta actctatttt 4320 gcctgactgg gaaaagacgc gtcaaagtgc ttaagctgtt cacttgattc atggtggaca 4380 gttttagttc cttcaatgat gcgaatagta aaaagggaca tgccacaaaa gttcaatcca 4440 atggacgcag tggcttaacc ttcccaacaa aaaaaaaaaa aa 4482 // ID BEL-608_AA-I repbase; DNA; INV; 6427 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-608_AA_; KW BEL-608_AA-LTR; Pao_Bel_Ele157; BEL-608_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-6427 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [5484-6041] - Integrase core CC LTRs are 91% similar to each other. XX FH Key Location/Qualifiers FT CDS 1550..3034 FT /product="BEL-608_AA-I_1p" FT /translation="MPRELPDFYGDPEEWPLFISSLRNSTTACGYSRAENL FT ARLQRCLKGNALKSVRYNLLDPDSVPEVIRTLQTLYGRPEVIISKLIKTVR FT DTPPPKSERLETLIDFGMAVRNLVSHLIAADQRSHLSNPVLLQELVEKLPA FT SVKMQWAQHLAEFPEATLQTFSNFMISVVESVSKVVLYTGYQSQKHEKPKS FT RERGYMHSHVEAMDPGQPSQYKSGVPKPCLICGKGGHHIKDCESFKKSSTD FT SRWKTVSSLKLCRCCLGQHGRRACRSSARCEIDGCQFRHHPLLHTKSNTNN FT PSTTRTTESHTYHHCGQSVLFRIIPVTISGPTKSIDTFAFLDEGSSATLVE FT QSLAKQLNLQGPMIPLCLKWTANMSRSEESSQIVSFEISELARKKRFRLEN FT ARTVKQLNLPSQTIRFEELQEQYHHLVGLPIRSYEKAVPRLLIGLRNLSLA FT VPQRIKEGKRGPIAVKTRVGWCVYGSLTGALRVKPLQLPHLRMRRRRKVR" FT CDS join(3219..3869,3873..6425) FT /product="BEL-608_AA-I_2p" FT /translation="MAFQRLKCLEGRMSRDPVLKENLHRQLQEYQNKVYAH FT QATDDELNDTDLRRTWYLPLGAVVNPKKPNKVRLIWDASAKVDGVSLNTFL FT LPGPDLLVPLPSVLFRFRQYSVAACGDIKEMFHQIHVTAADRQAQRFLWRD FT TEAEPPTIFLMDVLTFGSTSSPTSAQFVKNRNAKDHEAQFPRAAEGIIKGH FT YVDDCLDSFEDEQEAKLVVEQVRHVHSGGFEIRNWSSNSKAVLEHLGQESR FT PTMKDLTAAGFSESERVLGMLWMTETDMLCFSTTFRPEIDALIRSNTRPTK FT RQVLKTVMSLFDPLGLLASFLVHGKIIMQDVWRSGVKWDECVDDQIEQRWR FT KWIELFERVGEIQIPRCYFESVNTGRYRTLQAHLFVDASEAAYSAVVYFRI FT LDAGGNPQCSLVAAKTKVAPLKYVSIPRLELMAAVLGARLLTFVGENHTIP FT IHQRFCWSDSNTVLAWLRADHRRYTQFVACRVGEILTLTQENEWRWVPSRF FT NVADEATKWGKGPCFDAESRWINGPEFLGLSEDQWPRSTALNPATDEELRP FT CHLHHEEAQSLIDFERFSNWNRLLRTLAFVFHVFTIRKARETNAKVGKEPT FT HEDLKAAEIQILKIVQWSVYPDEMSIISKNQDLPVDQQRPIDKSSFLYKLS FT PMMDQNGVLRIDSRTGAARVDAFDLKYPVILPRKHHVTYLLIDHYHRKYLH FT CNAETIVNELRQKYYIPRIRVAVRTVTKLCQWCKVYKVQPSVPRMGPLPEA FT RLSPGVRPFSYIGIDYFGPILVKVGRSNAKRWICLITCLTIRAVHVEVAYD FT LSTQSCVACIRRFVCRRGAPLEIYSDNGRNFVGADRILRDQIKRIDEATAA FT TFTNALTKWFFIPPSAPNMGGSWERLVRSVKAALTNIPQDRKLDDEALLTY FT LAEAESIVNSRPLTYLPLDAPEQEALTPNHFLLGSSSGVKQPPANLGSSRI FT VRNTWDTLQANLNHFWTRWIREYLPTLTRRTKWFGNVKSVGLNDLVVIIED FT NKRNGWTRGKILEVVKGRDGIVRQAVVQTSSGVFRRPVSKLAVLDVAGSSK FT AEADTHPYGEG" XX SQ Sequence 6427 BP; 1822 A; 1503 C; 1589 G; 1509 T; 4 other; aaaacttaaa gaaattcgac acagaatgtc gagtacgaac tccgataacg atgatgccta 60 cgtctgtgct ggatgcgatc gccccgagac cgccgatgac atcgtacagt gcgatacatg 120 ttctacttgg tggcactttc tcctgtgcca aggtagacgc atcggtagca gataagtcca 180 gttggatatg cgtcaaatgt ttgcctccgc cgcctccatc ccgtccggca tctgttcgca 240 ctacatcatc gaatcggaga gcccttctgg agatcagctt acgaagattg gcagaagaaa 300 aggagctgcg caaaaaagag ctggagatca acatggagag agaatttgtg aaggcaaagt 360 acgatcttct ggagcagtgt gcgttggacg aggaaacaga aactcgcagc gtccgcagca 420 gagtagagct tagtgacatt tgaacacacg tagactagca tacactcgtc atacacagtt 480 tgtttttgtt ttcctcaaag aaacttgcac accctttcca gattcatcaa aatgcatatg 540 gaaatattac ccaatttgaa aaatttacac ccggattctg ggtgtttttc gagacagagt 600 gcatattgtt ttcctctaag gttcacaact acttctacac ttgggcaccc ataccattgg 660 gcgtgataac cactatagaa cagattgaag cccgagatcg agagaaccaa gttcacgaat 720 gggtcggcca agcagcagcg tcgctccgtg agagcagatc gctcactgac ccgacgacac 780 cctctttccc tgtggacatc agtactaaca gaacccccgt cggtcaatat catgacgatg 840 aagacggtgc gataggcaca accgaaacaa acgctggtcg ttccacgcca aaaccaaaca 900 cagatgtcga tcgagctgac aatctgaatg atgatgcgtt gagcgaggtg aacgattgca 960 acaagagcaa aaatatttcg aagcgcagag ttgtcattaa acgtagccat aagggagacg 1020 acgagaagat gacgaaagga acctctgaag actaccagct attaaaatcg cagctcaaaa 1080 aaaaaaatgt caggaagcat cgaagctcga tcgccaggct ctgcaagatc ttcaagacca 1140 gcttcgggaa tgtcagctga ggctgcggca gcgcagtgtg gaaccccggg cgctcgaaaa 1200 tcaatcaatc tggccgctta gagatgtgtc gaggggagca attgctaaaa ctaatcttag 1260 gccagcgccg acggaatcct ctttacagca aatgaatagg aatcaggcca gggtaaccga 1320 gaatgtagtt ccggtgtccg atacccgcga ttttgattcg ataggcccct tccaacacga 1380 aggtcgtggt acgcacgagc agtcgcgtaa caacttcgcg cgctcagstg agatcggcct 1440 tgtcagcacc ccacaaagac cgccgccagc caatcacggt caaacatcgg attcattgtt 1500 tctggaagtg cgtcgaccgt caccagaaca gctggcagca aggcaggtga tgcctcgaga 1560 attgccagat ttctatggcg accccgaaga atggccgtta tttatcagca gtcttcgcaa 1620 cagcactaca gcatgtgggt acagcagagc agaaaacctg gcacggttgc agcgttgctt 1680 gaagggcaat gcactcaagt cggtgagata caacctgctg gatcccgact cggtgcctga 1740 agtgatacga acgctacaaa cactctacgg acgtcccgaa gtgataatta gcaaactaat 1800 aaaaaccgtt cgtgatacac cgcctccgaa gtcagagcgg ttagaaaccc tgattgattt 1860 cggcatggct gtcaggaact tggttagcca ccttatcgct gcagaccaac gcagtcacct 1920 ttccaatccc gttctgttac aagagctggt agaaaagcta ccggccagtg taaaaatgca 1980 atgggcacaa catttagcgg agtttcccga agcgacgcta caaaccttca gtaatttcat 2040 gatatcagtc gtggaatcgg tgagcaaggt cgttttgtac accggttatc aaagccagaa 2100 acatgagaag ccgaaatcca gagaaagagg atatatgcat tcgcatgtag aagcgatgga 2160 tcctgggcaa ccgagccagt ataagagtgg tgttccaaaa ccctgtctga tctgcgggaa 2220 aggcggtcac catatcaaag attgcgagtc gttcaagaaa agtagcacag atagccgttg 2280 gaagaccgtc tcatcattga agctttgtcg ttgctgtttg ggacaacatg gacggagagc 2340 gtgtagaagt tcagctcgtt gcgagattga tgggtgccag ttccgacacc atccgctact 2400 gcacacgaag agcaacacca acaatccttc aactacgaga acgacggaaa gtcacacgta 2460 ccatcactgt gggcagtccg ttttattccg tattattcca gtgactatat ccggcccgac 2520 aaaatcgatt gacaccttcg cttttctaga cgagggttct tcagcaacgt tggtggagca 2580 aagtctggcg aaacaactaa accttcaagg tcctatgatt cccctttgtt tgaaatggac 2640 cgccaatatg tcccgttctg aggaaagttc tcaaatagtt tcctttgaaa tatcggagct 2700 agctagaaag aagcggttcc gtttggaaaa tgccaggacc gtgaaacagc tgaacctgcc 2760 ttctcaaaca attcgttttg aagaactcca agaacaatat catcatctcg ttggtttgcc 2820 tatccgcagt tatgaaaaag cggtacctcg attgctgata ggattgcgaa atttgtcact 2880 agcagtgcca caaaggatca aggaaggaaa gaggggacct atagcagtaa agacacgtgt 2940 tggttggtgc gtatacggga gcctaacggg agcattacga gtcaaaccac ttcaattacc 3000 acatttgcga atgcgacgcc ggagaaaagt tagatagcct gatccgtgag tattttgacg 3060 tggaagacgc cggactccga tcggtcaagc cgctggagtc aaatgaagtt caacgagcca 3120 aacaaattct ggaacaaact actcgccgga acgggaaccg ctttgctact gggctattgt 3180 ggaagtacga ccagtttgag cttcccgaaa gcttcccaat ggctttccaa cgtttgaagt 3240 gtttggaagg acgaatgtct cgcgatccag tattgaagga aaacttgcat aggcaactgc 3300 aggaatacca gaacaaagta tatgctcacc aagccacgga tgacgagctt aacgataccg 3360 atctccgacg tacktggtat cttccgctgg gtgcagtcgt caatccgaag aagcccaaca 3420 aggtkcggtt aatttgggac gcatctgcga aggttgatgg cgtgtcactg aacacatttc 3480 ttctccctgg ccccgatctt ttggtaccat tgccctccgt tctatttcga ttccgccaat 3540 actcggtggc agcatgcggt gatataaaag agatgtttca tcaaattcat gtcaccgctg 3600 ctgaccgtca agcacagcgt ttcctctggc gcgacaccga agcagaacct cccacaatct 3660 ttttgatgga tgtcttgaca tttggatcta caagctcgcc aacgtccgca caatttgtta 3720 aaaataggaa cgcaaaggat cacgaagcac agtttcctag agcggctgaa ggcattataa 3780 agggccatta tgtcgacgat tgcctcgaca gtttcgagga cgagcaagag gcaaagttag 3840 ttgtcgagca agtacgtcat gtacattcga mtggcggatt tgaaatccgc aattggtcta 3900 gtaacagtaa ggctgttttg gagcatctgg gacaagaatc taggccgacg atgaaagatt 3960 tgacagcagc tggatttagc gagtcggaaa gagtcctcgg catgctttgg atgacggaaa 4020 cagatatgtt atgtttctcc acaacgttta ggccggaaat tgatgcactc attcgttcga 4080 acaccaggcc gaccaaaaga caagtactca aaactgttat gagccttttc gacccgctcg 4140 gtctacttgc ctcattcttg gtacacggga aaatcattat gcaagatgta tggagaagcg 4200 gagttaaatg ggacgaatgt gtggatgacc aaattgaaca acgatggcgc aaatggatag 4260 agttgttcga acgagttggc gaaatccaga tacctaggtg ctatttcgaa tcggtcaaca 4320 cgggacggta cagaacactg caagcacatc tgttcgtaga cgccagtgaa gcagcctatt 4380 ctgctgttgt gtacttcagg attctggatg ctggaggcaa cccacaatgc tctctagtgg 4440 cagctaaaac caaggttgct cccttaaaat atgtgtcgat accacgtttg gaactaatgg 4500 cagccgtttt aggagcgcgt ttattaactt ttgtgggaga gaatcacacg atcccgatac 4560 atcaacgatt ctgttggtct gactccaata ctgtattagc ttggcttcgc gcagatcacc 4620 gaaggtatac gcaattcgtc gcctgtcgag tcggagaaat tctgacgttg acccaagaaa 4680 atgaatggag gtgggttcca agcagattca atgtcgctga tgaagccaca aaatggggca 4740 aaggaccatg tttcgacgct gaaagccgat ggataaatgg accggaattt ctgggtctgt 4800 cggaagacca atggccgcga tctacagctc tcaaccctgc aacagatgag gaactacgcc 4860 catgtcactt gcatcatgaa gaggctcaat ctcttatcga ttttgagaga ttttctaatt 4920 ggaatcgatt gctgcgcact ctagccttcg tctttcatgt tttcaccatc cgaaaggcac 4980 gggagacaaa cgccaaggtt gggaaagaac caacgcacga agacttgaag gcagcagaaa 5040 ttcaaatcct aaaaattgtt cagtggagcg tatatccaga cgaaatgtcc atcatatcga 5100 aaaatcaaga tttgccggtt gatcaacagc gccccataga taaatccagt ttcctctaca 5160 aactttctcc gatgatggac caaaacggag ttttgcgcat cgacagccgc actggtgcgg 5220 cacgtgtgga tgcttttgac cttaagtatc ctgtgatact tccacgaaag catcatgtga 5280 cttacttgct tattgaccac tatcacagaa agtatttgca ctgcaatgca gagacaattg 5340 taaatgaact ccgacagaaa tactacattc ctcggattcg agttgctgtg cgaaccgtta 5400 caaaactgtg tcagtggtgt aaagtttaca aggttcaacc gtcggttccg aggatgggac 5460 cattgccaga agctcgccta tctcctgggg tacgaccgtt cagttatata ggcatcgact 5520 attttgggcc tatcctcgtc aaggttgggc gatctaacgc aaagaggtgg atttgtctta 5580 tcacgtgcct cacgattcgt gcggtacatg ttgaagtagc ttacgacctc tcaactcaat 5640 cttgtgtggc ctgtatacgg aggtttgtat gtcgaagagg tgcaccgttg gagatatatt 5700 ccgacaacgg gcgaaacttt gttggagctg atcggattct tcgtgaccaa ataaagcgca 5760 tagatgaagc gacggctgcc acgttcacta atgcgctgac taaatggttt tttatccctc 5820 catccgctcc caacatggga ggctcatggg agcgcctagt tcgttctgtt aaagcagcgt 5880 tgaccaacat acctcaggac cgaaagctag acgatgaagc cttgttgacg tacctagcgg 5940 aagcagagtc aatagtgaat tctcggcctt tgacctattt gccgcttgat gctccagaac 6000 aagaggcact caccccaaat cacttcctct tagggagctc aagtggcgtg aagcaacctc 6060 cagcgaacct aggatcttca cgaatagttc gcaacacttg ggatactcta caagctaatc 6120 taaatcattt ttggactcgc tggatacggg agtatctccc aacgcttacc aggcgcacca 6180 aatggtttgg aaatgttaaa tcggtgggct tgaatgatct agtggttata atcgaagata 6240 acaaacgcaa cggatggact cgaggaaaaa ttttagaagt agtaaagggc agagacggaa 6300 tagttcgtca agctgtagtg cagacatcga gtggagtatt cagaaggccg gtatccaaac 6360 tggcagtgct ggatgtcgca ggaagcagta aagctgaggc tgacactcat ccttacgggg 6420 aggggga 6427 // ID BEL-3_AA-I repbase; DNA; INV; 3468 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-3_AA_; KW BEL-3_AA-LTR; BEL-3_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-3468 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 855-855 (2011). XX DR [2] (Consensus) XX CC 'TACGT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS join(57..2264,2268..3353) FT /product="BEL-3_AA-I_1p" FT /translation="MTSDSESVKDKTVLNFTENPCEICGPSTHDEAMVGCD FT ECSGWFHTRCVGIKEGEMPKSWYCQSKACQEKAQEYQKRKKQTRNRKQTDE FT SDKSSVNSRLGASSVNVKVRALEERQKRQMEELETEMHLRKKERELQRAYE FT KKIMELELQIRAEEEEEELAWQAEMLRKKTEQVERMKANQESFEERLASMD FT KELAGLSNLKVPKPASKIDTSVLRVEPSSKINRNALKLMKGNFQKFVEDDE FT NTEEEDDDDEYSKNSDLRSQSSGSSMERKVSNNASRVMRNKVDSVSQDGLG FT QQQTGPTKAQLAARKGLTYKLPKFSGNPAQWPLFYAAYKASNDACGYMNHE FT NLMRLQEALEGDALELVSGQLLLPETIPRVIEKLRRHYGRPEQLLESLLDK FT VNRLDPPQPDNLKSFISFGNTVEQLCGHLEAADLRQHLVNPLLIKSLVAKL FT PDREKREWVHYRKGRGETTLRTLTDFLMEIVEDACEANVDVEFKPPTRSSQ FT SGRSMPMARGGLYCHSAISSSPANTGEKILKPCNQCRRTEHRLRHCSEFKK FT MRYADRLRLVKREKLCHVCLNEHGGQCKFKIRCNIGECREFHNPLMHPVGN FT VVGLSAHIRTNCTVLFRIVPVQLHCGEKSATVLAFLDEGASVTLVEKGLAD FT RLGAVGVQERLVIRWTGNVSRVEDTRRTSLWVSSTRASVVDKMLLHTVHTV FT GKLMLPHQKLDFEGDRRAVPSHARIVGSSLTMDSPNLPMRNNIHSFVPIEA FT RVGNPMEPIAVRTNLGWTVYGPKRSTTVASSNYLGYHQQVNDENLQELLKS FT QYALEESVMVLSEQSEEKRVQEILEGTTKRVGDHSETRLLRSVEEERPAVP FT KNCNREWQKPVRAVGVIWDPNMDQCACYKMEYEELKSCRHESERSAKRFAR FT GCVKRIFGPSILVPRVRSTSTIRGKIVRQHRSRSKQNWRGLPTKWNVAHGL FT LKWRQGPPLQSNATQSSAIEETEEAAKNMLHKAIDDELISRWSKLLWEEAS FT VGCFVINNRRKGKGKLAENTRVPAKGRHRAHWDSFPDEKNTFTRCLKYHPD FT GSAEVTKRSGPIYQSSLDDGKIRKSWKRGIAAAVIPGRDML" XX SQ Sequence 3468 BP; 997 A; 788 C; 1031 G; 652 T; 0 other; ttcttaaaaa agaagtgaca ctaataaagt gtcccagtga taatccggcc cacaaaatga 60 cttccgattc cgaatccgtg aaggacaaaa ccgtcctaaa tttcacggag aacccttgtg 120 aaatctgcgg cccgtcgacg cacgatgaag caatggtcgg ctgtgatgaa tgttcgggat 180 ggttccatac ccgttgcgtg ggtatcaaag aaggagagat gcccaaaagt tggtactgcc 240 aaagcaaagc ctgtcaggaa aaggcccagg agtaccagaa gcggaagaag cagactcgta 300 atcggaagca aacggacgag tctgacaaat ccagcgtcaa ctcccgctta ggtgcgtcta 360 gtgtaaatgt gaaagtcaga gcgctagaag aacgtcagaa gcggcagatg gaagagctgg 420 aaacggagat gcatctgcgg aagaaggaaa gggagctgca gcgtgcgtac gagaagaaga 480 taatggagtt ggagctgcag atacgtgcag aggaagaaga agaagaattg gcttggcaag 540 cggagatgct tcgaaagaag acggagcagg tcgagcgtat gaaggcaaac caggagtcgt 600 tcgaggagcg gttggcatcc atggataagg agttagcggg attgtcgaac cttaaggtgc 660 caaagccggc gtcaaaaata gatacgagcg ttttgcgagt ggagccttcc agcaaaatca 720 accgaaacgc gttaaagctg atgaagggga acttccagaa gttcgtagaa gacgacgaaa 780 acaccgagga ggaagacgat gacgacgagt attctaaaaa ttcggatctg cgaagccaga 840 gttccggttc gtccatggag aggaaggtca gcaacaacgc ttcgcgagtt atgaggaaca 900 aggtcgacag tgtgagccaa gacgggctgg ggcaacagca gaccggaccg acgaaggcac 960 agctggctgc acgaaaggga cttacctata agcttcccaa attctcgggc aatccagcac 1020 agtggccctt gttctacgcg gcctacaaag cgtccaacga tgcctgtggc tacatgaacc 1080 acgagaacct gatgcgactg caggaagcgc tggaaggaga tgcactcgag ctggtatctg 1140 gccagcttct tcttcccgaa accatcccaa gggttatcga gaagttgcgt cgccattacg 1200 gccgtccaga gcagctgttg gaaagtctgc tggacaaggt caaccgcctg gatcctccgc 1260 aaccggataa tttaaagagc ttcatttcat ttggaaacac ggtggagcag ctctgcggtc 1320 acttggaggc cgctgatcta cggcaacatc tcgtcaaccc gctgctaatt aagtcgctgg 1380 tcgccaagct gccagatcgt gagaagcgcg agtgggttca ctaccgtaaa ggtcgcggag 1440 aaacaacgct gcggacgctg acggacttcc tcatggaaat agtggaagat gcctgcgaag 1500 ccaacgtcga cgtagagttc aagccaccca caagaagttc gcaatccgga agaagtatgc 1560 caatggcgag aggtggactt tattgccaca gtgcaatcag cagttcaccc gctaacaccg 1620 gtgaaaaaat cttgaaacca tgcaaccaat gtcggaggac ggaacatcgg ttgcggcact 1680 gcagtgaatt caagaagatg cggtatgctg atcggctaag gctggtgaaa cgcgagaagc 1740 tgtgccacgt gtgtctcaac gaacacggtg gccagtgtaa attcaaaatc cggtgcaata 1800 tcggtgaatg cagggaattc cacaatccgc taatgcatcc tgttggcaat gtggtcgggc 1860 tcagtgcaca catccggacc aactgcacgg ttctcttccg gattgttcca gttcagctac 1920 actgtggtga aaaatcggct actgtactag ctttcctgga cgaaggtgcg tccgtcacgc 1980 tggtggagaa agggctcgcc gatcgcctgg gcgcggtcgg agtacaagag cgactggtca 2040 tcagatggac gggaaatgtt tcgcgcgtgg aagatacgag gagaacgagt ctgtgggtgt 2100 ccagtacacg tgctagcgtc gtcgacaaaa tgttactgca cactgtccac actgttggca 2160 agttgatgct accgcatcag aagctagact tcgagggaga tcgtcgcgca gtaccctcac 2220 atgcgaggat tgtcggatcg agtcttacga tggacagccc caactaattg ccgatgcgaa 2280 acaatatcca ctcgtttgtt ccgattgagg caagagtggg aaacccgatg gaaccgatcg 2340 ccgtccgtac aaatctggga tggacagtgt acggtccaaa acgatcgacg acagtagcgt 2400 ccagtaacta tctgggctac catcagcagg ttaacgacga aaatctgcag gaactgctca 2460 aaagccagta tgcgctggag gaatcggtga tggtgctatc agaacaatcg gaggagaaac 2520 gcgtccagga aatactggag ggcacaacaa aacgtgtagg cgaccactcc gagaccagat 2580 tgttgcgaag tgtcgaagag gaaaggccgg cggtgccgaa aaattgcaat cgcgagtggc 2640 aaaagccggt gagagctgtt ggtgtcatct gggatccaaa catggaccag tgtgcatgtt 2700 acaagatgga atacgaggag ttgaagtcgt gccgacacga aagcgaaagg tcggcgaaaa 2760 ggttcgccag aggctgcgtc aagcgaatct ttggtccaag tatactggtg cctcgagtaa 2820 ggtcaacgtc gaccatccgc ggcaaaatcg tcaggcagca caggagccgt tcgaaacaaa 2880 attggcgagg gcttccaacc aagtggaatg tagcacacgg gctgcttaag tggagacaag 2940 gtcctccatt gcaaagcaac gcaacgcaaa gttcggcgat cgaagaaacg gaggaggcag 3000 ctaagaacat gctgcataaa gcgatcgacg atgaactcat ttcacgctgg tcaaaattat 3060 tgtgggaaga agcaagcgtt gggtgctttg tgatcaacaa ccgacgcaag ggaaaaggaa 3120 agctggcaga aaatacacgg gttccggcaa aggggcggca ccgggcgcat tgggacagtt 3180 ttcccgacga aaagaatacg tttacacgct gcttgaagta ccacccggat ggatcggcgg 3240 aagtgaccaa aaggagcggc ccaatctacc aaagttctct ggatgacggg aagattcgga 3300 agtcatggaa gagaggtatc gcagcagctg taattccggg tcgagacatg ctgtaactcc 3360 aggtgcagac atgatgacta ttgatgggaa ggtgcagaga caaggtgatg aatttagcgg 3420 cgctggaaat caagtaaatc cgggagtgcc ggatgttacg ggctgggg 3468 // ID Gypsy-19_AA-I repbase; DNA; INV; 4522 BP. XX AC supercont1.227; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-19_AA_; KW Gypsy-19_AA-LTR; Gypsy-19_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4522 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.227; Positions 681661 677140. XX CC Positions [3453-3971] - Integrase core CC 'TCTTG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 231..4496 FT /product="Gypsy-19_AA-I_1p" FT /translation="MNSEQFSMFMQHQSGVFKEMVNALRQIGAPQQVVQQP FT VEKAAALPVPLPPPLQLEGDMEQNFEFFVQNWQNYVSAVGMDQWPPEQNRQ FT KTSVLLSVIGKSALKKYFNFELTNEQQQDPDAAIAAIKLKVVRERNKIVDW FT YDYFSIMQEVQENIDEFVTRLKSLAKLCRFGALEEEFVMYKVVTSNKWPRL FT RAKLLTTQNLTLVKAIDVCRAEEITEKHVASVGQSSGDVNMVKKKKKMKCK FT YCGDWHDFSKGSCPALGKKCRGCGGRNHFEKVCRADGKKKNKERKVKKVKD FT GSFSSTSQSEDSDSSETESEEEVVIGKVYDFSDAGGNVLADLEMFVGDKWQ FT SVRCELDTGANTSLVGVDWLKKAAGGVCPELLPSKYRLQSFGGASIPVLGE FT VRIPCRRSDCKYTLALQVVNVNHMPLLSAKVCKKLGFVKFCHSVVLQPPKS FT EQDLLNIYRIKAEQIIHQHKQLFEGYGKFSGVVSLEVDPDVVPSVQQPRRV FT PISMRELLKDELKKLERDGIIVKETQHTDWVSNIVLVKRSGVEGSVRICLD FT PIPLNKALKRPRLQFTTIDEVLPELGRAKVFSTVDLKKGFWHILLDEKSSR FT MTTFWTPFGRYRWTRMPFGIASAPEIFQVKLQEAIQGLKGVECLADDVLVY FT GTGENMKEALDDHNRNLKELCVQLEKNDIKLNSSKLKLCETSVKFFGHVLT FT NQGLKADEGKITTIKEFPTPTDRKQLQRFIGMVNYLGRYIKNLSAESTILR FT RLISEKEPWTWTATEEDEFRRMKSIVADVKTLQYYDVKKPLVVECDASSFG FT LGAAVFQEKGVIGYASRTLTATEKCYAQIEKELLAILFACVRFDQLVVGNP FT KTIVRTDHKPLLAVFRKPLLSTPRRLQHMLLSLQRYNLELQFVTGRDNVVA FT DAISRAPLNEKHCEDHFEKRNIYHIFKDVQTINMSNYLSITDERLNEVITE FT TKTDPAMQMIMQYISQGWPSSCDKVPDAVKVYFKHRNELSSQDGIVFRSDR FT IVVPQSLRKKLTEKVHLSHNGIEATLKLARANLFWPGMSWQIKEAVMQCGI FT CAKFAPSQPSPPMKSHAIPVHPFQVVSLDVFFAEYRGDKKKFLVTVDHFSD FT FFEVDILKDLTPKSTINICKINFSRHGKPQLVITDNGTNFVNADWRQFAQD FT WDFRQSTSAPHHQQANGKAEAAVKIAKRLLQKSKESGVDFWYALLHWRNVP FT NKIGSSPVVRIFSRSTRCGIPMSAENLIPKPQTGVSEAIENNRRKAKYHYD FT KKSRNLPTLEVGDPVYVQLQPESSKQWTPANISSKLNERSYVVDVDGARYR FT RDLVNIKPRNEPSVTPSQPLMNVASAPVEVSAPLGLPDKPADLSSYSAIDN FT SAMSQEEISETSQDIVVSPTISESPKKASRKLNHLPTETMTRPKRTMRLPS FT RYRDYDMN" XX SQ Sequence 4522 BP; 1369 A; 879 C; 1149 G; 1125 T; 0 other; tggtgtcaga agccatcgtg gtgttctgaa agtttattcc ggcgtcatcg tcaatatcgc 60 ggaagtgttt gggcgaggaa aaaaacgaac tgaggcagtg aaagaatcat tcgcggtgtt 120 cgatttttcg ggagacagtg tgaacgtgtc gaaacttgct tggcggccat atttcttttg 180 ctgtgctggc ttcgtgtgat tttcccaaaa gtgctgaaaa ctgtttcaga atgaattccg 240 aacaattttc aatgttcatg cagcaccaaa gtggtgtttt caaagaaatg gtgaatgcgt 300 tgcgccagat aggtgctccc cagcaagtag tgcagcagcc ggtagaaaaa gcggcagctc 360 ttccagttcc ccttcctcca ccgctgcaac tagagggaga catggaacag aattttgagt 420 tttttgtgca aaactggcaa aattatgtca gtgcggtggg aatggaccag tggcctccgg 480 aacaaaaccg gcagaaaaca agtgtattgt tatcagtgat tggcaagtcg gcactaaaaa 540 aatactttaa ttttgagctg acaaacgagc aacagcaaga tccggatgcc gcgattgcgg 600 ccattaagtt gaaggtggtt cgcgagagaa acaaaatagt tgactggtac gactactttt 660 cgataatgca ggaagtgcaa gagaacatcg atgaatttgt aactcggttg aaatcactag 720 caaagttgtg tcgtttcggt gccttggagg aagagtttgt gatgtataaa gtggtaacct 780 ccaataagtg gccaaggcta cgagcaaaat tattgacgac ccagaatctc acgttggtga 840 aagcaatcga tgtgtgtcgt gcggaagaga ttacagagaa gcacgtcgcg agtgttgggc 900 agtcaagtgg tgatgtgaac atggtaaaga agaagaagaa aatgaagtgc aaatactgcg 960 gcgattggca tgacttttcg aagggttcgt gccccgcatt ggggaagaaa tgtcgcggat 1020 gcgggggaag aaaccacttc gagaaagtgt gtagagcgga cggtaagaag aagaacaagg 1080 agaggaaagt gaaaaaagta aaagatggta gcttttcgag caccagtcag agtgaagaca 1140 gtgacagttc ggaaacggag agtgaagaag aagttgttat tgggaaagtg tatgattttt 1200 ctgatgctgg aggaaacgtt cttgcggatt tggagatgtt tgttggtgat aaatggcaat 1260 ctgtgcgctg tgaactggac accggcgcaa atacaagtct ggttggagtt gattggctga 1320 agaaagcagc aggtggagta tgtcctgaac tgcttccatc aaagtaccgt ctacaaagct 1380 ttggaggagc ttcgattcca gtacttggtg aggtccgaat tccttgtcga cgcagcgact 1440 gtaagtacac gctggcgttg caagttgtaa acgtcaacca tatgccactc ctgtcagcaa 1500 aggtatgtaa gaaacttgga tttgttaagt tttgtcattc agtggtgctt cagccaccaa 1560 aatcagaaca agatctcctc aacatttatc ggatcaaggc agaacaaata attcaccagc 1620 ataagcagct ttttgaagga tatggaaaat tttctggagt tgtttcattg gaagttgacc 1680 cggatgttgt cccgtctgtt caacaacccc gacgtgttcc aatttcgatg cgcgaacttc 1740 tcaaagatga attgaaaaaa ctggaacgag atggcattat tgtgaaggaa actcaacaca 1800 cggattgggt tagtaacatc gttcttgtta aaagaagcgg tgttgaagga tcagttcgta 1860 tatgtttgga cccaattcca ctcaacaaag ccttgaaacg tccccgtctt cagtttacca 1920 ctattgatga agttttgcca gagctaggac gagcgaaagt gttctctact gtggatttga 1980 agaaaggttt ctggcatatt ctcttagacg agaagagcag tcgaatgacg actttttgga 2040 ctccgttcgg acggtaccga tggacccgaa tgcctttcgg tattgcatcg gcgccagaga 2100 tttttcaagt taagctgcaa gaggcaatcc agggattgaa aggagttgaa tgcttagctg 2160 atgacgtgtt agtttatgga accggagaaa atatgaaaga agcgttggat gatcataatc 2220 ggaacctgaa ggaattgtgt gtgcaattag agaaaaatga catcaagctg aattcgagta 2280 agttaaagct ctgtgaaaca tcggtaaaat tcttcggaca tgttctgaca aaccagggct 2340 tgaaagcaga cgaaggtaaa attactacca ttaaggagtt tccgacacca acagatcgca 2400 agcagctgca gagattcatc ggcatggtca attacctcgg gcgctatatt aaaaatctca 2460 gtgcagagag tacaatacta cggagattga tttcggaaaa ggaaccttgg acgtggacgg 2520 caacggaaga agacgaattc cggcggatga aatcgatagt tgcagatgtt aaaacattgc 2580 aatactatga tgtcaaaaag ccactggtgg tagagtgtga cgcaagttcg tttggattgg 2640 gcgctgcggt atttcaggag aaaggagtca tcggttatgc atcacgaacc ctgacagcaa 2700 cagagaagtg ttatgcgcag atcgagaagg agttactggc aatactattc gcgtgtgtac 2760 gatttgacca attagttgtc ggtaatccta agacaatagt aagaacggat cataaaccac 2820 tgttagctgt cttccgaaag ccgttgctct cgactccgcg tcgtttgcag catatgcttc 2880 tgagcttgca gaggtacaat ctggaattgc agttcgtcac agggagagac aatgtggtcg 2940 ccgatgctat atcaagagcg ccattgaatg aaaagcattg tgaagatcac ttcgagaagc 3000 gaaacatcta ccacattttc aaagatgtac agacgatcaa tatgagcaat tacctcagta 3060 ttaccgatga acggctgaac gaagtaatta ctgaaacaaa gactgaccct gcaatgcaga 3120 tgatcatgca gtacatcagt caaggatggc cgtcgtcttg tgacaaggta cctgatgctg 3180 taaaagtgta cttcaagcat cgtaacgaat tgagttcgca ggacggaatc gtgtttagga 3240 gtgacagaat agtcgtgcca cagtcgttga ggaagaaact tactgaaaaa gttcatctca 3300 gccataatgg cattgaagcc actttgaagc tggctcgtgc gaatttgttt tggccaggaa 3360 tgagctggca gataaaggaa gcggtgatgc aatgcggaat ctgtgccaag tttgctcctt 3420 cacaaccctc accaccaatg aaaagccatg ctatcccggt gcacccgttc caagtcgttt 3480 ccttggatgt attttttgcg gaatatcggg gagataaaaa aaagtttttg gtaactgttg 3540 accatttttc cgatttcttt gaggtcgaca tactaaagga cctaacccca aagagtacaa 3600 taaatatttg caagatcaac ttctcccgtc atggtaaacc ccagttggtc ataacagata 3660 acggaactaa ttttgtcaat gccgactgga gacagtttgc tcaagattgg gatttccggc 3720 aatctacatc ggctccacac catcaacaag cgaatggtaa agcagaagca gcagttaaga 3780 ttgccaaacg cctgctacaa aaatcaaaag aatccggtgt tgacttttgg tatgctcttc 3840 ttcattggcg aaacgtaccg aataagattg gatcgagtcc cgtggttcga atattttctc 3900 gaagtactag atgcggaata ccaatgtcag cagaaaatct aattcccaaa ccccaaactg 3960 gagtttcaga agcaatcgaa aataacagac gtaaagcgaa atatcattat gataaaaagt 4020 ctcgaaatct gccaactttg gaggtaggag atcccgtata tgttcagctt cagccagagt 4080 catcgaagca gtggacgcca gcgaacatca gcagcaagtt gaacgagcgt tcctacgtcg 4140 tggatgtaga cggcgcacgt tatcgccgag atctagttaa cattaagcca cgtaatgaac 4200 catcagtaac gccttctcag ccattaatga acgttgcaag tgctccggtt gaggtatcag 4260 ctcctttggg gctaccggac aagccagcag atttatctag ctactcggcg atcgataatt 4320 cagcaatgag tcaggaagaa atttcggaaa cgagtcaaga catcgtggtg tcgccaacca 4380 tcagcgaatc tcccaagaaa gcatctcgta agctgaacca tctaccgacg gaaaccatga 4440 ccaggccgaa gagaaccatg agattgccga gccgttatcg tgattatgat atgaattaag 4500 tttttaatca aaagggggaa ga 4522 // ID I-80B_AAe repbase; DNA; INV; 7481 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE An I non-LTR retrotransposon from Aedes aegypti. XX KW I; Non-LTR Retrotransposon; Transposable Element; I-80B_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-7481 RA Kojima K.K. and Jurka J.; RT "I clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1351-1351 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 11 sequences with >97% CC identity. The consensus is >77% identical to I-80_AAe. XX FH Key Location/Qualifiers FT CDS 1001..2629 FT /product="I-80B_AAe_1p" FT /translation="MAASSSGPPPTSWGDGILDFPTLGRVDGTFTNATLPS FT YMDPEGMYGELHLLRISGVNGPLPNRPFQIRRSVEKFVGGKIEGAFPEANK FT ATYALKVRNKRQFDQLLTMKKLNDGTAITITEHPTLNSIRCVVSCRDVINM FT SDDELLEELKEQGVKEVRRITKKNGQSRENTPAIVLTCHGTIRPEVIEFGY FT IRCRTRPYYPSPMQCFNCWLFGHTKLRCQAKSATCGTCSRNHPISENKECN FT TEQFCKTCDSNDHRISSRSCPLWQFENTVQRVKVDQGISYPAARRIVEQNR FT GGKSFANVVGPPPKDSLQVTNERIDQLTAVLASKDAEIAELRAALVTSETP FT PAVINHEIESLKTIVANQARQIQLLTDQISVFLKAVMPAANLSASSTNPMP FT TSVTPTITSVSATSRPPTTTPVPINIPIAAAPVDTVSAVVSKTEPTVQLID FT PLLETVFTDSDSPSAELSTNEEFFSPRTEKILSSPSTPKPTRLKATLPVTA FT RINTDPIMTRNKRTISTLSRTENLSQQQKKSKHKSSSDSLGAIPKSR" FT CDS 2633..7147 FT /product="I-80B_AAe_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="DHQPYFIFTPPPQRNIITNHTSPSRYELPNQQRNAEH FT MASGRLGTGELEALSQPESPGRPWHLANHYDPSILLDSRGASVPEVLPQPE FT SLGRLRHLGSDADEELRQNYRCPAISPGSRGTVGADVSHTGTGGLPSAPGG FT DLDEGSLAHTPTTLDDVGSEGMQTVGTRRALPPTPKDYVGSEVVYNQVNVQ FT APNSSNASSSNNKKRLLSPLATPFFPSVGTSLHSVGSSSGPAVCGALSCSA FT TNDENEVPTRSFLSTVESSSTLVLSPLRRLTGDVLDNSYSDTTHYQSSTAN FT ISYSIPTLPKSSKFCLQWNMNGFLNNLADLELLVSYDPPWILALQEVNRVT FT IEQLNRSLGGKYVWTSKREGNIRRSVALGILKSIPFTFLDIDSQLPVVGVK FT LHGSITISVACLYLPCGNIPDLRGTVENLIRQLPEPRMIVGDLNAHHPAWG FT GLRSDARGIMLLDLLEDSDLTILNDGSPTFFKGYYSTAIDVSAVTRSEIRR FT FHWNVNSDPCGSDHHPLLISVAAEAPATTRRPRWLYDQADWAAYNNNICAS FT ISNHQPSSISEFSAILTQAASVSIPKTSNTPGRKALPWWSPEIKKAIKRRR FT KALRAVKRLDLNHPNRAAALDRYRSQRNECRQVIRDAKRACWEKFLDSINA FT SQSTSDLWAKVNALNGKRSATPPTLLIDGNNTADPTVIANGLGKYFASLAA FT FGSYEPEFVRRSGATAGDIINFNVPASNSQLHINLPFSAGELQHALSRSNG FT KSAGPDDIGYPMLKKLEGPGKLVLLEQINNLWLGDSYPLEWRESFVIPIPK FT PGNLIREPASYRPIALTCCLSKVVERMVNRRLTHHLQQHGLLDHRQHAFRP FT GYGTNTYFAALGEVLKEASDRNHHTEMISLDISKAFNRTWTPFVLEKLASW FT GLTGHILHFLKNFLTNRSFRVVIGDTKSTTFNEETGVPQGSVISVTLFLVM FT MDGVFDDLPTGVQIFVYADDIVIVVSGPTTVATRRKAQAAVSRVAKWAASV FT GFTMSASKSIRCHICPSGHRVSGPPIVINNQAIPLRKTAKILGVTFDRSLS FT FKPHCEAVRSNCLSRLNLIKSLSSPHRSNNRAIRFRVAAALIDSRLFYGLE FT TTFLAMNRLIDTLSPIYNRYVRIVSGLLPSTPADAACAEAGLLPFRYLITA FT TLCMKAAAFAEKTAGNARIRLLEEADNLLDSIAATKLPPVAKVHWYGERDW FT HRSSLKFDSRIKNNFRAGDSSTRLRQSVAEVLRTDYAAHQRRYTDGSLSVQ FT GVGIGIADDELAISLSLPGQCSIFSAEAAAIFFAATEPTTRPIIIITDSAS FT CYFALQSEAPRHPWIQGIIRHASPDVTIMWVPGHCGIPGNSTADHLAGSGY FT SSRRYTTTVPLADIRRWVKSTVRRAWQTEWTNSRGAYLRKTKESVDAWADL FT KSMKDQKLISRLRTGHTRISHNLGGTPFHRVCEVCSINNTVEHVICVCPLY FT EHLRQMYGIPGSIRDALQDDPSSIASLISFVKSAGLYFKI" XX SQ Sequence 7481 BP; 1949 A; 2103 C; 1661 G; 1766 T; 2 other; cagtttgcct tgcgtttctg actgaagtag acgtgcgggc gtgtctcttc agtttttgta 60 agcattttaa ccgttttaac gtgtgtataa gtttcgtttt agtgagtgtt ccttacgggg 120 tacttagcta aggtaccact aattgataac gtttactcga gttattccgc aaaaaccgcg 180 aagcgagtaa ttttcgcgtt cggaaaacga aaaaaaagcc cccttgagcc ggctatacgg 240 maagggggcc cgtgttgtgt ttcatcacat tcgttcagtg gcctaaattg atacatctgt 300 aggcaacagt tacactcttg accgaggtta accccaccta aagtgtgtgt gcaccagttg 360 tttgcatatc tgggtaaaaa gtattacgga gaagaaacac gggatcaagc gtaaaaccgc 420 agtgatagtg caatttaccc ctctggtgac gatcgagaag agtagcacca ttccccgctg 480 gatcggatac ccgatcggga gtcggtgatt gattttctcc aactagtgta gtgaacagtc 540 catctcccag gtggaatctt tccccggagt gaataccccc ccttcccctc gggacatctc 600 ccccccttcc tcatacccct cacccccaaa ctcaacaaat ttaccccttc cccttccacc 660 tcccccccct ctaaaattac cccttcccct aaaacttgtt cctcttccct tcacccatct 720 cctggtactt ctctcacccc tcagtgttac acacagaaaa agttatccca ctctaaagtt 780 gggtaaaaag tcgaattttt tcgaaaaagt gcggaaagtt tttcctcccc cacgtaaagg 840 ggttctgagg ccaaagacac ttaagtcgtg cgcgggtgag ccgttgaaac ccagcgagta 900 ttcttttcga gtgttttgtg ttcgtgtaag ccgttgtttt caccgaggag catcaacaaa 960 gcacaggtta gttggtggtg gcggtgttct ttgggccttc atggccgcwa gttcatccgg 1020 gcctcccccc acatcttggg gggacggtat tttggatttt cccacgttag gtagggttga 1080 tggcacgttt accaacgcga cccttccgtc atacatggac ccggaaggaa tgtatggcga 1140 attgcatttg ctcaggattt ccggagtcaa cggaccactt ccgaacagac catttcaaat 1200 aaggcggtca gtagaaaagt ttgtcggtgg taaaatcgaa ggagcctttc ccgaggcaaa 1260 caaagcaacc tacgcactta aggtacgaaa caagcggcaa ttcgaccagt tgctcaccat 1320 gaaaaagttg aatgacggaa cagctatcac gatcacggaa catccaaccc tcaattcgat 1380 taggtgtgtt gttagctgta gggatgtaat caacatgtcc gatgatgagc ttcttgaaga 1440 gctgaaggaa caaggagtaa aggaggtcag gagaatcacc aagaaaaatg gacaatcacg 1500 agagaatacg ccggcgatcg tacttacctg ccacggcaca atccgaccgg aagtcattga 1560 attcggttac attcgatgtc ggacacgtcc ttactatcca agcccaatgc agtgctttaa 1620 ctgctggctg tttggacata caaagctacg ctgccaggct aaatcggcca catgtgggac 1680 ttgttctagg aatcatccga tttccgagaa caaggaatgc aatactgaac agttttgcaa 1740 aacttgcgac agcaatgatc acaggatctc cagccgttct tgccctttgt ggcagtttga 1800 gaacaccgtt cagcgagtaa aggtcgacca aggaatctcg tatccagcag cccgccgaat 1860 cgttgaacaa aaccgtggtg gaaaatcttt cgctaacgtt gtagggcctc ctcccaaaga 1920 ttccctgcag gtgacgaacg aaaggatcga tcagctaaca gctgtcctgg cttcaaagga 1980 tgccgaaatt gccgaacttc gcgcagctct tgtgactagc gaaacccccc cggcggttat 2040 caatcacgaa attgaatccc ttaaaactat cgtcgccaac caagcaagac aaatccaact 2100 gctaacggac caaatatccg tctttctcaa ggctgtcatg cctgccgcca acctctccgc 2160 gtcatcaacc aatccaatgc caacctccgt gacaccaact atcacctctg tatcagcaac 2220 ctccaggcca ccaaccacca ctcccgtgcc catcaatatc cctattgccg ccgctcccgt 2280 agacactgtt tctgcagtag taagtaaaac cgagccaaca gttcaactaa ttgacccact 2340 attggaaacc gtttttactg attccgatag tccctctgcg gaactcagta caaacgagga 2400 atttttttct cctcgcacgg agaaaatatt atcctcgcct agtacaccaa agcccacccg 2460 tctcaaagct actcttcccg taaccgctag gataaatact gatccgatta tgacacggaa 2520 taagcgtact ataagcacgc tatcccgcac tgaaaatctc tcccagcagc agaaaaaatc 2580 taagcacaaa agctcctcgg acagcttggg ggccattccc aagtctcggt aagaccatca 2640 accctatttt atcttcacac cgccacccca acgaaacatt atcacaaacc atacctcgcc 2700 ttcccgctat gaactcccca accagcagcg caatgcagaa cacatggctt caggtcgtct 2760 gggcaccggt gaactggaag ccttgtccca accagaatcg ccgggtcgac cctggcacct 2820 agccaaccat tacgatccgt ccatcctcct tgatagtcgg ggcgccagtg taccggaagt 2880 cttaccccaa ccggaatcgc tgggccgact tcggcatctt ggaagcgatg cggatgaaga 2940 gctccggcaa aactatcgct gcccagctat atctcccggt agccggggca ccgtcggtgc 3000 ggacgtctcc cacaccggaa ctggcggact accctcggcg ccggggggag acctggatga 3060 gggaagtctg gcacatacgc ctactaccct ggacgacgtg ggaagtgaag gcatgcaaac 3120 agtcgggacg cgtcgggctc taccccctac cccgaaggac tacgtgggaa gtgaagtcgt 3180 atacaaccaa gtaaacgttc aagcacccaa ctccagtaac gcatccagct cgaataacaa 3240 gaaaaggcta ttatcgcctc tcgcaactcc cttttttccc tccgtcggga catctctaca 3300 ctcggtgggc tcatcttccg gtcctgccgt atgtggagct ctgtcttgtt cggcaaccaa 3360 cgacgagaac gaagtaccta ctcgatcctt tttgtccacc gttgaatctt cttcaaccct 3420 ggtgctttca cccctccggc gactcactgg cgatgttttg gataattcgt actccgatac 3480 cacccactac caatcttcca ccgcaaatat aagctactcc atcccgacat tgccgaaatc 3540 gtctaagttc tgcctacaat ggaacatgaa cggcttcctg aacaacctag cggatcttga 3600 acttctggtg agctacgacc ctccttggat cttggcactc caagaagtca accgagtaac 3660 aatcgaacag cttaatcgtt ccctcggtgg caaatacgtg tggacctcga aacgagaagg 3720 taacatccgc cgttctgtcg ctctcgggat attaaagtca atcccgttca ccttcctgga 3780 cattgactct caactaccag ttgtcggagt taagcttcat gggagtatta caatatccgt 3840 agcctgcctg tacctaccat gcggaaacat tcctgaccta cgtggaactg tagagaacct 3900 catccgtcag cttccagaac caagaatgat agtcggggac cttaacgcac accatccggc 3960 atggggcggt ctccgttcag acgcaagagg cataatgttg ctcgatcttc tcgaggacag 4020 tgatctcact attcttaatg atggctctcc aacattcttt aagggctact actcaacagc 4080 aattgacgtt tcagctgtta cccggtcaga gatccggcgt tttcactgga atgtcaactc 4140 ggatccatgc ggaagcgatc atcaccccct gctaatctca gtagccgccg aggctcctgc 4200 caccactaga cgaccccgtt ggttgtatga ccaggcggat tgggccgcct ataacaacaa 4260 catctgcgca tcaatcagta atcatcaacc aagtagcata tctgagtttt cggctatcct 4320 aacccaagcg gcaagcgttt caattcccaa gacaagcaac acacctggcc gtaaggctct 4380 cccttggtgg tccccggaga ttaaaaaagc tatcaaaaga cgtagaaagg ctctccgagc 4440 cgttaagaga ttggacctca atcatccaaa cagagcagcg gctttggaca gataccgatc 4500 tcagcgcaac gaatgccgcc aagtcattcg ggatgctaaa cgggcttgtt gggaaaaatt 4560 tctcgacagc atcaatgctt cacaatcaac ttctgaccta tgggctaagg tcaacgccct 4620 taacggcaaa cgatccgcca ccccgcccac tcttcttatt gacggaaaca acacagcgga 4680 cccgaccgtt attgcgaacg gcctaggcaa atattttgcc agcctggctg cctttggtag 4740 ctacgagccg gaattcgtac gacgttccgg ggctaccgcc ggcgacataa taaattttaa 4800 cgtgcctgcg agcaactcgc agctccacat caacctacca ttctcagcag gagaactaca 4860 gcatgccttg agcagaagta atggcaaatc cgccggccct gacgacatcg gctacccaat 4920 gctaaagaaa ctggagggac cagggaagct agttttgctc gaacagatca ataatctttg 4980 gctgggtgac tcctacccgc tagagtggcg agagagtttc gttattccga tccctaagcc 5040 cggaaatctg atacgggaac cagcaagcta ccgtccgatt gcgctcactt gctgcctctc 5100 gaaggtagta gagcgaatgg tgaaccgccg gctaacacac catctacagc aacacggtct 5160 actcgaccac cggcagcacg cttttcggcc tggttatggc acaaacacct actttgcagc 5220 tctcggcgaa gtgctgaaag aggcaagtga taggaaccat cacactgaga tgatctctct 5280 ggatatctcc aaggctttta accggacctg gacaccattt gtcctagaaa aactagccag 5340 ctggggccta accggccaca tactgcactt cctaaagaac tttctcacca accggtcctt 5400 tagggtggta atcggtgata ccaagtccac cacctttaat gaagaaactg gcgtaccaca 5460 gggatccgtc atatcggtaa ccctttttct agttatgatg gacggagtat tcgatgatct 5520 gccaaccgga gtgcagattt ttgtctatgc ggacgatata gtcatcgtag tatctggtcc 5580 aacaaccgta gctacacgca gaaaagcaca agcagctgtt tctcgagtgg ccaaatgggc 5640 tgcctccgta ggcttcacca tgtcggcatc taaaagtatc cgatgccaca tatgtccatc 5700 cggacatagg gtcagtgggc ccccgattgt aatcaacaac caagccatcc ctcttcgtaa 5760 aactgcaaaa atccttgggg ttaccttcga ccgcagctta tctttcaagc cccactgtga 5820 agcagtcaga tccaactgtc ttagccgcct gaatttaata aaatctctct caagtccaca 5880 ccgcagcaat aatcgagcta ttcggttccg tgtcgctgca gctctaatcg atagtcgact 5940 tttctacggt ctggaaacca cattcttggc gatgaacaga ctgattgaca cactgtctcc 6000 catctataac cgctacgtgc gaatagtttc cggtctcctc ccatctaccc cagccgacgc 6060 agcttgcgct gaagccggcc ttctgccttt ccgttaccta attactgcga cgctttgcat 6120 gaaggcggcc gccttcgcag agaagactgc tggtaatgct agaatccgcc tcctagaaga 6180 ggccgacaat ttgctcgaca gcatagctgc cacgaaactc cccccagtcg ccaaggtcca 6240 ctggtacggc gagcgtgact ggcaccgctc atcactcaaa ttcgacagta gaataaaaaa 6300 caacttcagg gcaggagact cgtcaacgcg gctgcgacaa tccgtcgctg aggtcttgcg 6360 gacagactac gcagcacatc agcggcgtta taccgatggc tctctctcag tgcaaggggt 6420 tggaataggc atcgcggatg acgaacttgc gataagcctc agtcttcccg gacaatgctc 6480 gattttttcc gccgaagccg ctgctatttt ctttgccgct actgaaccaa ccactcgtcc 6540 gatcatcata ataaccgact cagccagttg ttacttcgcg cttcaatcgg aggctccccg 6600 tcatccgtgg attcagggaa taatcagaca tgcatcaccc gatgtcacca tcatgtgggt 6660 tccagggcac tgtggcattc cggggaatag caccgctgat cacctagctg gatccggcta 6720 ctccagccgt cggtatacca caacggttcc actggccgat attcgacgtt gggtaaaatc 6780 aacagtacgc cgagcatggc agacggagtg gactaactca cgaggagcct acctgcgaaa 6840 aacgaaggaa agcgttgatg cttgggcgga cctcaaatca atgaaggacc aaaaattaat 6900 ctccaggtta agaactggtc atacaaggat ctcccacaac ctagggggga cacctttcca 6960 ccgggtctgc gaggtttgca gtataaacaa tactgtcgag cacgtaatct gcgtatgccc 7020 gctatacgaa catttgagac agatgtacgg aattccaggc agcatccgag acgcccttca 7080 agacgaccca tcctcgatag catcattgat atcctttgtc aaaagcgctg gattatattt 7140 taagatttaa gctttaggca accggaccta acgctttagt taccccacta tatgttagtt 7200 tgcaatgttt cagttcctcg gattctgttt gttcttagtt ttaaaccatg tgtgatagtc 7260 ccttagttat ccttatgtgt tgacgtgtga ttttgctttt ttcatctttt aattgtccca 7320 gtgataaact ctggcaagta actctgcgaa gatcctagat ttaagccaaa ctactttgta 7380 gctgttttcc agacgagccc atctggctcg ttctgtttac cggaccgggg atgaacctgc 7440 ctacggcaga aaatccctaa ataaaaaaag agaaaaaaaa a 7481 // ID ISL2EU-1_BM repbase; DNA; INV; 2695 BP. XX AC . XX DT 12-MAY-2011 (Rel. 16.05, Created) DT 12-MAY-2011 (Rel. 16.05, Last updated, Version 1) XX DE DNA transposon: consensus. XX KW ISL2EU; DNA transposon; Transposable Element; ISL2EU-1_BM. XX OS Bombyx mori OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Bombycidae; Bombycinae; Bombyx. XX RN [1] RP 1-2695 RA Yuan Y.W. and Wessler S.R.; RT "The catalytic domain of all eukaryotic cut-and-paste transposase RT superfamilies."; RL Proc Natl Acad Sci U S A 108(19), 7884-7889 (2011). XX DR [1] (Consensus) XX SQ Sequence 2695 BP; 922 A; 436 C; 462 G; 875 T; 0 other; ggggttggca actgtcaaag tggcgctggt gtgcattgtt cccataacat tcaatatttg 60 tattggaatt tttgattaaa aattatttta agcacaatga cgcgttgttt tgtgccaata 120 tgctcagaaa ttggagtaca tcaattcccg aaggacaaga aaattggaag actgtggtta 180 aaggccattt gacgcgaaaa aagtactccc acaaaaagtt cgaggttatg tcgtaaacac 240 tttgtggaat ctgactatga aaacattagt aaatacacag gtatgcttct ttctgtcaaa 300 tatttataat gcatgactag gtgtagcaca catctttgtt ctcttattaa aaaaatccaa 360 aaataattag ttaggcacac tattagttgc taacatattt tgcaagtaga tttatgctgt 420 aaggaacctt ttaacaaaca caaaatcaga ctggaggaaa cacattcctc atgagctgtt 480 gttcattatc agctgttact attgaaagtt ctatctaaat cgctgcatcc attgtaataa 540 caaatatttt gaaactatgg tcaggtttcc ttgtgatagt atatttaaat gtttttcttt 600 gtgttgtagg tgtcaaacat taacataaat atctgaagaa aggtgctgta ccatctatat 660 tttcatggaa tatgaaacca gtttctgaag aaacaaaaag tagagaacaa aggttacaaa 720 ctcgtaacct taaaaaacag ctttttagcc tgcccaacac ttcccaatac tgtgaagata 780 tagtacccct tgaagaacac agcaatgtgc attttgattc aaatgtctct caagaaattc 840 taatagaaaa taataactct agtatagatg aaacaattac aaggcatcat attactattg 900 gaactcaaac atccaatata ttaagattgt tctctacaga gctattactt gcagacgatg 960 agacagtgca atactacacc gggttggaaa cagcatctaa gttttcatta gttttaatca 1020 cacttttgcc aatgtcaaat gatataaggt atcgctggag tagagtggtc ggcatatcaa 1080 tagaagatca atttttaatg ttactaatta agcttagaag aaacacaaca gattttgaat 1140 taagtaaaat tttcggtgta agtaaaactg aagtgtctaa ttttgttgta acatgaataa 1200 attttgtcag tgatatctag aacctaattg acatttggcc tagccgcagc ttagtaaatt 1260 attatatgcc taattgtttt agaacacatt atccctccac aagggtgatt atagatggga 1320 ctgaaatagg catacagaag cctagtcaac ctgatgctca aaaagcgtca tttagttctt 1380 ataaacataa aaataccttg aaatttttgg taggagcttc accaggggga ttattatcat 1440 atgtttcaaa cggctacgct ggttctgcaa gtgatcgaca gatagtagaa cgaagtaagc 1500 tgctgcagat ttgtgattct ggagatagta ttatggcaga cagaggtttt aatgtacagg 1560 atctgtttgc ctctaaagga attagtataa atatcccatc ttttttaaaa ggtaaatcac 1620 aaatcccagg tgttcagtta aagatagatc aaaaattagc tagtcagagg gttcacatag 1680 aacgattaat ttcatatgtg taatgttatg taattttaga gaaggaattg tcaataaaaa 1740 taattaaaaa tattcctgat tttttatttc gacaagtgtg tatagttata atacagatat 1800 ttttcataca atgcttcttt gaaaatattt tcataaaaga tatctaattt atttaacata 1860 ttatttataa aattattgtc tttttcaaca taaatcactt tgatgtcctt gtaggtgtaa 1920 atgaccaagt tgcagtatgt tttgcctgag cagtataact gcccttggat ttgataataa 1980 tatgggcagg tttttttaga gacaatatcc cgttacattg ttctaagtaa ggcaccgttg 2040 ttatatttat ttcatgtttt ctgctagcgt atggacattt cacttcaatt atagtttctt 2100 cgcccaacac accatcagga gatgcagcaa gatatgggcg ctcttggcta atgaaaatgc 2160 cacatggctc cactagcaaa tcatatatat tagcatattt tgctagagca attttttcat 2220 tttgtttcaa aattaaaatt ttggtcatat agctctaatc tgaaaaaagt aaaataatac 2280 attagttggt aggtaatgtg tgtaatatga gcaattcttt ttgcacatgc tacaaggaat 2340 aaagttccta aggtacacaa tttaatatta ttattataat agctaccgga aacagtttga 2400 gagaagtttt ctaagttaaa ataaaacatc aacattgtac ctttcgacaa aatccacttt 2460 ttttcctcgt aaagatgcac cacgttttct caactcctcc tttaattctt taactatcca 2520 gttggaatac atgttgtaaa actgtttagc actacttaga aagtaataag aaccgtatca 2580 aagtcacaaa atgccgaaat atcacgaacg caaaacacac aaaatggcgg ctctctgtga 2640 caagggaaca agctacgctg gcgacatctg agaaaagctt taaccggcca acccc 2695 // ID Gypsy-21-I_NVi repbase; DNA; INV; 7799 BP. XX AC . XX DT 17-APR-2009 (Rel. 14.04, Created) DT 17-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW internal portion; Gypsy-21-I_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-7799 RA Bao W. and Jurka J.; RT "LTR retrotransposon from Nasonia parasitic wasp."; RL Repbase Reports 9(4), 779-779 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 1023..2654 FT /product="Gypsy-21-I_NVi_1p" FT /translation="MLNGMEDDVGNWVLNLTRGELRQTLSSLRLRQDGNAE FT DQWVRLHKWLMGEYDPSDFEPTGELGAAAVKQERAESRRSFLETTCTEEKR FT IGINPHPLEELISDPGTQGRLQRTLDNAVEENVIVPENVHEEAGADKDEFK FT DCIAGVDGQEYTMPPGMVDYLEKMQRTVRSLQEEVSQLKQKRTEARASSTR FT TDGESRQPPPRVTFSDATNFETSTPATGARGATPRNRWREWREAAERHSPA FT PNRGTTRTFVRDIVTIVRKWDLRFSGSMTDRIDRFLDRLESNMRLAQMTEA FT EKMESLPHVFTGVAAEWYENNKDNWRCWEEFRTAATRWYGTNRRFQQRLAS FT DAEKRYQGAHEPVRDYVTCLGGMMKEMRPRPCLEKQLDLLHKNLKPELQRL FT TPRMDCYDWDSFREKAEEAETVLASSQEYRAPPPPEQTMLASLAYVSPTKR FT KPATAPPKVAAVEVAQANDGNTQLAEMFSKMLDERLAKFEKKIRSESERSN FT KYEQKRHSTHHNKHQGNKPKSEALCYNCNKPGHFKRDCPEGKEEASA*" FT CDS 6680..7648 FT /product="Gypsy-21-I_NVi_3p" FT /translation="MEPAEKSFEEEARRETGAARMLRLAAATGIRWPAVHS FT EVVLQAFSELVTLSEVASRAREEAEQVADEWAEKEAEGRAAERAADXAEDD FT LDETAATVIEKSGRPTEDSPADNKPASEGAGAWGVTPEPSTPGASSLGDEE FT RADSWVELRKELCILRDTIQESEDWEVXLFTRFDAYLQAAGSQADPEAQVW FT ARRIEDRREEQVLLDGARRRRQDRRRALEAEERLDRLREKRERLEEKARLE FT KQAARVRAMRKKWPARRSSSARSTSGRRGQRNRAVTCAAIRECGDPTSARS FT GSATRRYWRRHHETTRRSRGDVEDGWSWRE*" FT CDS join(2615..5176,4981..6528) FT /product="Gypsy-21-I_NVi_2p" FT /translation="ARLSGGKRGSERVRTVDTRSRNSKKTENKAVSRSASP FT RVTRAEKRSVSAAQASRAEKRSARLGEACAGEPRTASESSPEKQERHGAII FT TPSPKNKNKARKAGESMQEPAALSSLSAETADEGDEAGQCTQEHREPGRSE FT RDADDVIDSILESQAGEEXNLPGDEDNPQEKRWWMEARVGSFRFRALYDNG FT ASRVCMGPIGLQLASALGRQLMPSHGRGAKLADGSYTKIVGYVWLPFRVGN FT VEKEVRVAIIPDLPTDCIVGVNFMLTFKAVLDPTTSKLHFKESKEYVNVEL FT AAMEGGALTLASMGLADVEEQQREALRALLDKIIGPETGEIGCTTWIEHHI FT EVQTTRPIKQKYFPVSRKIEEEMHEQVKAMLAAGIIEPSNSAFASPVVMVR FT KANGKYRFCVDFRKVNAITRNDAYPLPQMNAILRKLQQARYISTIDLSSAY FT HQIPLSEESKQYTAFTVPGMGLFQFKRLPFGLSEAGATFQRLMDKIITPEL FT QPHAFSYLDDVIVATETFEDHVKVLERVLKRIKEAGLTVNREKSVFCKEEV FT KYLGVLVNRDGFRPDPEKIAPIVNFPQPKNLKQLRRFHGMASWYRQFLEGY FT AALAEPLTRLTRKDQPFVWGEDQQSAFEAIKALVASAPVLHCPDFDQQFVI FT QTDASDTGLGVVLTQNINGQERVLEFASRVLTPAERNYTVSERECLAVLFA FT VRKFRQYIEGYEFKVITDHSSLRWLCNLRNPTGRLARWALELQGHKYTIEH FT RKGADHHVPDALSRMYEETEPEIATVSQDPQPQDCWYNGRFKLWGGGGGCT FT STGRILTSRAPWETTTMPGSSSSQRKNDCKCCKNVMISRRLDISAAKKHTR FT EWPVQIVGGGGRMYKYRPDPDITGAMGDDDDAWKLVVPAEKRLQVLQECHD FT QPTAGHFGREKTYKRVTEKYFWPRVYRDVARYVQRCRLCQLGKVEQRAPQG FT LMGKRVVRKPWQWVAGDIMGPLPKSAKGHEYALVFQDLFTRWIECIPLRKA FT NGKSILTALKERVILRFGAPEIFHSDNGTEFRNKTINEYLAAQGIHHSYSP FT PYHPQANPVERVNRTIKREIAAFAEENHRAWDEHLSEIAFSFNTAVHEATG FT ASPAMLNFGRQPAMPASLRVKEDQEAARQGAVGDWQARLAKLPALHERAKA FT LSGEAQDRHASYFDARRRAATFKVGDIVAKRRHVLSSAAGGIAAKLALPYV FT GPYTITAQVGSNTFELTGKDGKIEKLVPAEEMKIFYDTAEDEAGDEVDDEA FT DDNPTRPPRDDSEAEATRDPPAACGDEPELEPSDVGTGVEDKREKRTRKRV FT TAPAGACGGRAXKDSVGAAGEPPRRPRGRPRKIVTASEPPEEQRIVKRKPG FT PTEGLD*" XX SQ Sequence 7799 BP; 2171 A; 1939 C; 2390 G; 1293 T; 6 other; aactggcgcc tcaacgtgta gaggtcagag gggcgcgaag cacgcataag cgtacgtaaa 60 gacgaggaaa gtcaggtacg cggataaaca ttcggataga tattcggtcg agccgaattc 120 gtcgcggcgc gtagcaacag cgtacacgac gcgcagcgag cagcggcgag aggacggcga 180 gcgagcgagc ggcgccgacg taaatagcgt ttgtttacat cgcgcgcggg ctagcgcagt 240 arcggcgcga gccataatct gtgctgtgtg agcgcgatcg cataggactc ggaggggggg 300 cttctctcgg tgtttcgcga caagttgagg tgggcctagt ggcaggccta cgcgacgaga 360 cgtcgcaatt aattctttta tttttcacga attttccggg ttgaataacc ggaccgaaag 420 caccgagggg ctcgatactc gagtcgagct tgccgcgccg gcataaacag ggcttcagag 480 aggcgcgcaa cggttcgaag ttgcgcgcac caaaagtaaa tacgcctgct ataaacaggc 540 gacggctcgc cgtgcgggac ggcgagcagg ttagaaattt aaattttggc cgccgagcgc 600 ggggggcgat cgtactcaca cacaaagcgg ggagtcgaga gcgtgtgtta gttaggcatt 660 ttcgcgttta cagcatagga aaaattgtaa gaattcctgt aaaatagcaa caggagtggc 720 agtgtcaccg gtcggattat cgagtggctc gacacggggt cgacgaccac atcaattaat 780 tttttttctt tggttatgaa tttagcgagc gacaaggtaa gcgggatttg tgtcatcttt 840 ttgttttctt tttttctcac cgcctgctac ttgtcgcacc gactggggac acacaacggc 900 gtaagcgagc gtaacgcaca gttaggaatt gcgaaacaca tgcacacgcg cgctcgactc 960 tagcgagcaa tacttgttgc taaagcgtgc gcagagtaaa aaaatcagaa gaaacccaga 1020 taatgttgaa cggcatggaa gacgacgttg gaaactgggt actgaacctc accaggggtg 1080 agctgcggca aacattgtcg agcctacgac taaggcagga cggcaatgca gaagaccagt 1140 gggtgcgact gcacaagtgg ctgatgggcg aatatgaccc gtcggacttc gagccgacgg 1200 gcgaactagg agccgcggca gtgaaacaag agcgagcgga gagccgcaga tcattcttgg 1260 agaccacttg tacagaagag aaaagaatcg gcattaatcc gcacccactg gaagagctga 1320 tttcggaccc aggaacccaa ggacgactac agagaacgtt agacaacgcc gtcgaagaga 1380 acgtcatcgt accagaaaat gtacacgaag aagccggagc ggataaggac gagtttaaag 1440 actgcatcgc gggggtggac ggacaagagt acacgatgcc accaggcatg gtagattacc 1500 tggaaaaaat gcagcgaaca gttagaagtc tgcaagagga ggtgagccag ctcaagcaaa 1560 agcgcacgga agcgcgtgct tccagcacca ggacggacgg cgagtcgaga caaccgccgc 1620 caagagtcac gttcagcgac gcgacaaatt ttgagacgtc aaccccggca acaggcgcga 1680 gaggcgcaac tcccagaaac aggtggagag aatggcgcga agcggcagaa agacactcgc 1740 ccgcaccaaa ccgaggcacg acgagaacgt tcgtgagaga tatcgtgact atagtccgca 1800 agtgggactt gcgattttct ggragcatga cagatagaat cgacaggttt ctggatcgcc 1860 tagagtcgaa catgaggcta gcacagatga cagaggccga gaaaatggaa tcactgccac 1920 acgtgttcac gggtgttgcg gcggagtggt atgaaaataa taaagataac tggaggtgct 1980 gggaagaatt tcgaacggca gctacaaggt ggtacgggac gaacaggcgc ttccaacagc 2040 gcctggcctc cgacgccgaa aaacggtacc aaggcgcgca cgagcccgtg cgcgactacg 2100 tgacgtgcct cggcggaatg atgaaagaga tgaggcctcg tccgtgcttg gagaaacagt 2160 tagacctgct gcataaaaac ttaaaaccag aattgcaaag attgacacca cgtatggatt 2220 gctacgattg ggatagcttc cgagagaaag cagaagaggc cgaaactgta ctggcgagca 2280 gtcaagagta ccgcgcgcca ccgccacccg agcagactat gctagcatca ctagcgtacg 2340 tctcaccgac gaaacgcaaa ccagccacag caccgccgaa agtcgcagca gtcgaagttg 2400 cacaggcaaa cgacgggaac acgcaactgg cagaaatgtt tagcaaaatg ttggacgaaa 2460 ggctggcaaa attcgagaag aaaatacgat cagaaagcga gcgcagcaac aagtatgagc 2520 agaagagaca cagcacgcat cataacaaac accagggcaa taagccgaag agcgaagcgc 2580 tgtgttataa ctgcaacaaa cccggacact ttaagcgaga ttgtccggag ggaaaagagg 2640 aagcgagcgc gtgaggacgg tcgacacacg ctcgcgaaac tccaagaaga ccgaaaacaa 2700 ggcggtgagt aggagcgcga gccctagagt caccagggca gagaagagat cagtctctgc 2760 agcacaagct tcaagagcag aaaagaggag tgcacgcctc ggagaagcgt gcgcgggaga 2820 accgcgcacg gcgagcgaaa gctcgcctga gaagcaggaa cgacacggcg cgataataac 2880 gccgagtccg aaaaataaga ataaagcgcg gaaggcgggc gagagcatgc aggagcccgc 2940 cgcgctctca tcactgtcag cagaaacagc tgacgaggga gatgaggcag gccaatgtac 3000 gcaagagcac agagagcctg gcagaagcga gcgagatgcg gacgacgtaa tcgactcgat 3060 tctggagagt caagcgggag aggaarataa cctcccaggc gacgaagaca atccccagga 3120 gaaacgctgg tggatggagg caagggtcgg aagttttcga ttccgcgcct tatacgacaa 3180 cggcgcatct cgtgtatgca tgggtccgat cggactgcag ctagcgagtg cgctgggacg 3240 gcaactcatg ccgagtcacg gccgcggcgc caaactagca gacggtagct acacgaagat 3300 cgtaggctac gtctggctgc ccttcagagt cggcaatgtc gagaaggaag tccgtgtggc 3360 aataataccg gaccttccga cggactgcat tgtcggcgtc aactttatgc taacattcaa 3420 ggcagtgtta gatcctacga catcaaagct gcatttcaag gagagtaaag agtatgtcaa 3480 cgtcgagttg gcggcgatgg aaggaggcgc gctaactcta gcctccatgg ggctagcaga 3540 cgtagaagag cagcagagag aagcgctgcg agcgctcctc gacaaaatta ttggcccaga 3600 gacaggagaa ataggatgca cgacgtggat agaacaccac atcgaggtgc aaactacaag 3660 gccaataaag caaaaatatt ttccggtctc caggaaaata gaggaggaaa tgcacgagca 3720 ggtgaaagcc atgctcgcgg caggtataat cgaaccatcg aacagtgctt ttgcgagccc 3780 agtggtgatg gtgagaaaag cgaatggaaa atatcgcttt tgcgtcgact ttaggaaagt 3840 caacgcgatc acacggaacg acgcttatcc actgccacag atgaatgcga tactgaggaa 3900 attgcagcaa gcgcgctaca tttctacgat cgatttaagt agcgcgtacc accagatacc 3960 gctgagcgaa gagagcaagc agtatacagc tttcacggtg cctgggatgg ggcttttcca 4020 gttcaagcgc ctacctttcg gtttatcaga agcgggagcc acgtttcaaa ggctgatgga 4080 caaaataatt acgccagagc tgcagccaca cgccttttct tatttagatg acgtaattgt 4140 cgcaacagaa acgttcgagg atcatgttaa agtcctcgag cgcgtgttaa aaagaatcaa 4200 ggaagcaggg ttgactgtca acagagagaa gagcgtgttc tgcaaagaag aggtgaaata 4260 cctcggcgtc cttgtaaaca gagacggctt caggccagat ccggagaaga tcgctccgat 4320 agtgaatttt ccacagccga agaatctaaa acaactgagg cgtttccacg gcatggcatc 4380 gtggtatagg caatttctag agggctacgc agcgttggcc gagccactga ctcggctaac 4440 aagaaaagat cagccgttcg tatggggaga agaccagcag tcagcgttcg aggcgataaa 4500 agctctcgtc gcctcggcac cagtactgca ctgtccagat ttcgaccagc aattcgtcat 4560 acaaaccgat gcgagtgata cgggtctcgg tgttgtattg acgcaaaata taaatggaca 4620 ggagcgagtt cttgaattcg ccagccgagt gctcacgccc gctgaaagaa actatacagt 4680 cagcgagcgg gagtgcctag ccgtgctgtt tgcggtgcgc aagttccgcc agtacataga 4740 aggctacgaa tttaaagtaa taacagacca cagcagtttg agatggttgt gcaatctgcg 4800 caatccaaca ggcagattgg ctcgctgggc actcgagctg caagggcaca aatacacaat 4860 cgaacatcgg aaaggcgccg accaccatgt gccagatgct ctgtctcgga tgtatgagga 4920 gacagagcca gagatagcaa cagtgagtca ggatccgcaa ccccaagact gctggtataa 4980 tggccggttc aaattgtggg gggggggggg cggatgtaca agtacaggcc ggatcctgac 5040 atcacgggcg ccatgggaga cgacgacgat gcctggaagc tcgtcgtccc agcggaaaaa 5100 cgactgcaag tgctgcaaga atgtcatgat cagccgacgg ctggacattt cggccgcgaa 5160 aaaacataca agagagtgac ggaaaagtat ttctggccaa gagtgtatcg agacgtagca 5220 agatacgtac agcgctgtcg actctgccag ctgggaaaag tggaacagcg cgcaccgcag 5280 gggttgatgg gcaagcgcgt ggtgcgaaaa ccctggcagt gggtggcagg ggatatcatg 5340 ggccccttgc caaagtcagc caaaggtcat gaatatgcac tggtcttcca ggatctattc 5400 actcggtgga tagagtgcat acctctccga aaagcaaacg gcaaatcgat actcacagca 5460 ctgaaagaaa gagtcattct gagattcgga gcgcccgaga tctttcattc cgataacggc 5520 accgaattca gaaataaaac gataaacgag tacctggcgg ctcaaggcat ccaccattcg 5580 tactcgccgc cgtaccatcc gcaagctaac ccggttgaac gggtaaaccg aacgatcaag 5640 agagagatcg ccgcgttcgc ggaagaaaat catagagcgt gggatgagca tctcagcgag 5700 atcgcattct cgtttaatac cgcggtacac gaggctacag gtgcctcgcc tgcaatgctg 5760 aatttcggca gacaaccagc aatgccagcg tcgctcagag taaaagaaga ccaagaagcg 5820 gcgcggcagg gagccgtggg cgattggcaa gcccgcctcg ccaagctacc agctttacac 5880 gagcgagcaa aagccctgtc tggcgaagcg caagatcgcc acgcttcata tttcgatgct 5940 cgtcgccgag cggcgacctt caaggtcggc gatattgtcg ccaagagaag gcacgtactg 6000 tcgtcagctg ccggcggaat agcggcgaaa ctagctttac cgtatgtagg cccgtacacc 6060 atcaccgcac aggtcggctc aaacacattc gagctaactg gcaaagacgg aaaaatcgag 6120 aagctggtgc cggcagagga gatgaaaatc ttctacgata cggcagaaga cgaggcaggc 6180 gacgaagtag acgacgaagc ggacgacaac ccaacccgac caccgaggga tgatagcgaa 6240 gcagaggcga cgcgcgaccc gcctgcagcg tgcggagacg agccggagct cgagccgtca 6300 gatgtaggga cgggggtcga ggataagcgc gagaaaagaa caagaaagcg tgtaaccgcg 6360 ccagcgggcg cgtgcggcgg cagagcarcg aaagactcag taggagcggc gggggaacct 6420 cctcgccgtc cgcgcggaag accgcgaaag attgtaacgg caagcgagcc gccggaagaa 6480 cagcgaattg ttaagcgcaa gcccggcccg accgaagggc tcgactaaag cgaaagttag 6540 tgataaggcg gaaacatcac cccgaaagac gcgtgcgatg aaccgcgccg cgcgagcatg 6600 aacgcatgat tgtttatatt tttgatagct agctttcatt accataatca tcacccttct 6660 cttacccgaa cagccgaaga tggaaccagc agagaagagc ttcgaagaag aggcccgtcg 6720 cgaaaccggg gcggcgcgta tgctgcgtct cgcggccgcg acgggcattc gatggcccgc 6780 agtccactcg gaggtggtgc tacaggcctt ctcggagctg gtgaccctct ccgaggtggc 6840 gagccgtgcc cgcgaagagg ccgaacaagt ggctgatgaa tgggccgaga aggaggccga 6900 gggccgggcc gccgagaggg ccgctgacra ggccgaagac gacctcgacg agacggcggc 6960 cacggtgatt gagaagagtg gccgtccgac tgaagactcg ccggcggaca acaagccggc 7020 gagcgaaggg gcaggagctt ggggtgtaac ccctgagcca tcaaccccag gggcgagctc 7080 cctcggcgac gaagagcgcg ccgacagttg ggtggaactg cgcaaggagt tgtgcatcct 7140 aagggacacc atccaggagt cagaggactg ggaggtgkac ctttttacgc gcttcgacgc 7200 ctacctccag gctgccggat cgcaggccga cccagaggcg caggtgtggg cccgccgtat 7260 agaagatcgg cgggaggagc aagtcctcct tgatggggct cggcgacgga ggcaagaccg 7320 tcgccgcgcg ctcgaggcag aggagcgcct tgaccggctc cgcgagaagc gcgaacgcct 7380 ggaggagaag gcgcgtctcg agaagcaggc cgcacgcgtg cgcgccatga ggaagaagtg 7440 gcccgccaga aggagcagca gcgcgcggag tacctcagga agaagagggc agaggaaccg 7500 cgctgttacg tgtgcggcta tccgggagtg cggcgatccc accagtgccc ggagtggaag 7560 cgccacaagg agatactggc gtcgacacca cgagaccaca agaagaagta gaggagatgt 7620 agaagacgga tggagttgga gggaatagag taccttggtt ttctacaatc accgtgtttt 7680 tctttcgcta cgacgatcct tcctaccttg tcattacccc atttacagct tcaccaagtc 7740 aggcgcgcga cgatgttgca gagctgcttc gcagggctgc gcagcgtctg gaagggggg 7799 // ID PIGLET1NA_EI repbase; DNA; INV; 182 BP. XX AC PIGLET1NA_EI; XX DT 16-MAY-2005 (Rel. 10.05, Created) DT 16-MAY-2005 (Rel. 10.05, Last updated, Version 1) XX DE mPiglet-Ei1 (PIGLET1NA_EI), a nonautonomous pogo-like MITE from DE the single-celled eukaryotic reptilian parasite Entamoeba DE invadens. XX KW Mariner/Tc1; DNA transposon; Transposable Element; mariner; KW Interspersed repeat; Tc1/mariner superfamily; MITE; pogo-like; KW PIGLET1NA_EI. XX OS Entamoeba invadens OC Eukaryota; Amoebozoa; Archamoebae; Entamoebidae; Entamoeba. XX RN [1] RP 1-182 RA Pritham E.J., Feschotte C. and Wessler S.R.; RG Department of Biology, University of Texas at Arlington, RG Arlington, TX 76019, USA; RT "Unexpected diversity and differential success of DNA transposons RT in four species of Entamoeba protozoans. Mol. Biol. Evol. (2005) RT In press."; RL Direct Submission to Genbank (16-MAY-2005). XX DR Repbase; PIGLET1NA_EI; Positions 1 182. XX CC mPiglet-Ei1 is a representative copy of a subfamily (less than 50 CC copies) of nonautonomous pogo-like MITEs present in the genome of CC E. invadens. The TIRs are 23-bp long, have similarities with CC Piglet-Ei1 and pogo-like elements from other species and are CC flanked by TA TSD. Other mPiglet copies are 94-70% similar to CC mPiglet-Ei1 and range in size from 179 bp to 183 bp (excluding CC copies truncated at one end), indicating that they have amplified CC from a single ~180 bp master copy. This master copy was CC presumably an internal deletion derivative of a larger, CC autonomous Piglet-like element (but not directly Piglet-Ei1). XX SQ Sequence 182 BP; 48 A; 41 C; 41 G; 52 T; 0 other; cagtaaaacc tcccaagagc ggtacctctt aacagcgggt tctctcaaga gcgggcattt 60 tttggactga aactataaat agagttctct attgagcggt ttttaaggtt tgattggcta 120 aaaatgcccg ttcttgagag aacaaaattc ccacacccgc ccgctcttgg gaggttttac 180 tg 182 // ID L2-3b_Cis repbase; DNA; INV; 1883 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 06-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE CR1 Non-LTR Retrotransposon from Ciona savignyi. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; L2-3b_Cis. XX OS Ciona savignyi OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. XX RN [1] RP 1-1883 RA Smit A.F.; RT "L2-3b_Cis - CR1 Non-LTR Retrotransposon from Ciona savignyi."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX CC Ci000420 A non-autonomous L2-3 derivative that appears to have CC maintained a GAG protein (ORF from pos 57-905) (most of ORF2 is CC deleted). In this way L2-3b resembles the HAL1 elements in CC ancient mammals. Many other variations and subfamilies of CC L2-3_Cis seem present in the C savigny genome. XX SQ Sequence 1883 BP; 670 A; 288 C; 304 G; 616 T; 5 other; ttcattctat ttgaataagg tgttaagttt aattgatatt aaatttaatg aattaaattg 60 atcttaataa aaaaatgcat aaaaatactg gtgctaaatt gggcaaagag aagactggca 120 aaccaaatga gttgtcattt tttgctgagt tggwagaaaa actggataat ttaaccgctg 180 atcttaaact cggccaacaa tcaagtgaag cgaaatttct caccgtaata aacgaacttc 240 aagaaataaa agctggtcaa aactttttga cagccagatt agaagaaatc aaaatagaaa 300 tttctgaaca tgacaaacac ctaaaaaagc tgaaaaatga aaatggaatt ttaaaagaac 360 gaataaaaga actggaaaac aagtctgttg aaaactccaa tacaattgat gcattagaac 420 aatatggaag acgtgaatgc ttagagtttc atgggataag ttcacatgct gatgagtcaa 480 ctgatgatct ggtagtcgcc actgtaagga aacttggctt gcccatcaac aagaatgaca 540 taagtgtctc tcatagactg gctaagccat cacatgaccg accacgacca cctatcattg 600 caaaattttt aagtcggaag atacgagaca acatatatgg tagtcggtcc aaactaaggc 660 aagtcaatga aaatttaccc accggccaag caagaatata tattaatgaa agcctaacca 720 agagtaacaa agacagattc ataaaatgta gaacatactg taaagcaaac aaaataagat 780 acatctggac aagaaatgga acaaccttta taaaagaaaa tgatgggagc tcaacattta 840 gtatcaaaaa tgacaangac ctgcaacatc tgcttaaaca actaccacta ctaaaaacaa 900 cttaaaagtt tattagtatt gttttgctta atgtaattta tactttaaat gactgtaaaa 960 tattcatgta gaataagtaa cttaccttgt aaatccaacc agaatagtat tgcatgtgat 1020 atttgcaatt actggctcca tgtaaaatgt ttaaatatta ataagcatga tcttgccaat 1080 tttgccactg atcagcttcc atacttttgt ctaatttgca tgggggaaag tccacccctt 1140 gtttactact aactttgggc ataatgagat taaggtgtgg aagctgatca atgattccct 1200 ttaacatgcc tcctactcct tgttttgtaa aaaaaagtgc aattaataag taattgtatt 1260 taggtttagt atgcggtaca tcatacactc agttaatatt tatttattcg gagtattatt 1320 cacaatctta atatttgctc ttcatacagt ggtattcatt aaatggccat aacaagtaat 1380 tgtatatatg gcatagcatt gcaaattcag tgcaccccat aagattgtta ggtcacaagt 1440 tatattgtac tgtattgcgt gagaatgaga tgtaggctat tgttctgact tccgttctat 1500 ataactgtac tactgtttgc aacaaaaaaa ctattttaat gcttatatgt aacagtttat 1560 taatcattta atataaatca ttctgaattg tcaatcccta ctgttacaat tatggtacaa 1620 gtcaaatctt ttttaattat atatatatat atatatataa tatatatttt ttttttttct 1680 tttttttttt ttttttttca gggaggcgcc aaactagata acttttgggt tttttttggt 1740 gccttcctgt tttgctgtag ttttttgcca attgccacag tatcttttat ttttttttta 1800 ttgtcaactg ttgatttgta aattgttggt ggtaaacgct gcagcgaaag aaacaaaaat 1860 aaaaattgwa ttgwattgwa ttg 1883 // ID HHAI_BMA repbase; DNA; INV; 322 BP. XX AC M12691; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 25-MAR-1997 (Rel. 2.02, Last updated, Version 2) XX DE Brugia malayi Hha I-repeat family element. XX KW HHAI; HHAI_BMA; Repetitive element. XX OS Brugia malayi OC Eukaryota; Metazoa; Nematoda; Chromadorea; Spirurida; Filarioidea; OC Onchocercidae; Brugia. XX RN [1] RP 1-322 RA McReynolds A.L., DeSimone M.S. and Williams A.S.; RT "Cloning and comparison of repeated DNA sequences from the human RT filarial parasite Brugia malayi and the animal parasite Brugia RT pahangi."; RL Proc. Natl. Acad. Sci. U.S.A 83(3), 797-801 (1986). XX DR GenBank; M12691; Positions 1 322. XX SQ Sequence 322 BP; 117 A; 32 C; 36 G; 137 T; 0 other; gcgcataaat tcatcagcaa aattaataaa actttcaatt aatcatgatt ttaattgaat 60 gtaagaattt aaattaaatt taaattcaaa tttaaatttt taatttttta aaaattttaa 120 aatttgttat agttttcctt cattagacaa ggatattggt tctaatttat caattttaat 180 tctaattaag tgccaaaact actaaaaaaa gcttattttg aaattaattg actacgttag 240 ctgcattgta ccagtgctgg tcgtgtattg tgttgtcatt ttatagttta aatattaaaa 300 tacgcttttg taattaagtt tt 322 // ID BEL-14_AA-LTR repbase; DNA; INV; 184 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-14_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-184 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 868-868 (2011). XX DR [2] (Consensus) XX CC LTRs identical to each other. XX SQ Sequence 184 BP; 55 A; 37 C; 26 G; 66 T; 0 other; tgttcagaat aaaaatttat tgctgtgtat tagctacttc ctaaccctca cgcaatttga 60 gtttatacgc ttcttttgtt tgctttgtta gttatataaa taaacagaaa cgttcggtcg 120 ctacagtaca tcgcgattct tatttcgacc aaaattccga aatatacgct acaattcgca 180 atca 184 // ID BEL-624_AA-LTR repbase; DNA; INV; 608 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-624_AA_; KW Pao_Bel_Ele61; BEL-624_AA-I; BEL-624_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-608 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 608 BP; 189 A; 101 C; 109 G; 208 T; 1 other; tgacgattat tgggacccct acggtacagc gtcgccgcac tatgtattgt catcttctgg 60 cagcctacgt cagtcgagag gaaaagtgaa ttgtcattgt aaacatataa catattgaac 120 tagttggaga ttagtggagt ttattataat ttacgatatt agttactgtt ggcatataaa 180 aacaaataat ttagcgttat ctttgattaa cggagctgta agtttgccag aatatkaatc 240 tactgttatg ttatcatgac tattactatg tacttacatg caaacaatgt tgcttacagg 300 ctttccctga tttgtacaat cagccagtaa ctattactag cgagtggatt taaaaattac 360 acggttactt agttaatatg taggtatgtc atgaatttat agctaggcga tattaatatg 420 aattttacca tcgctagctc cctaacctca ttcgactttg tgggattctt aggtgtaaga 480 tcaaatgagt tgcgaaaact aatttgtaag tactttatta tacgtaagct actttttcta 540 aacaataaaa cattatttca gttgagtctc tctcacacca cgaaaggact gcgaaaagtt 600 tctttaca 608 // ID Gypsy-10_TCa-LTR repbase; DNA; INV; 123 BP. XX AC singleUn_1004; XX DT 21-FEB-2011 (Rel. 16.02, Created) DT 21-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the red flour beetle genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-10_TCa_; KW Gypsy-10_TCa-I; Gypsy-10_TCa-LTR. XX OS Tribolium castaneum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Coleoptera; Polyphaga; Cucujiformia; OC Tenebrionidae; Tribolium. XX RN [1] RP 1-123 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the red flour beetle genome."; RL Direct Submission to RU (21-FEB-2011). XX DR Genome; singleUn_1004; Positions 6929 6807. XX SQ Sequence 123 BP; 35 A; 21 C; 30 G; 37 T; 0 other; tgagcggtga gacggcgcgg cagcagtgtc gaatcaacgg cgacacggtg gttgtgtgct 60 tttactttgt atttaaaact ttttcttgaa aataaagtta aacttcgaaa aagtcttcta 120 aca 123 // ID Copia-34_AA-LTR repbase; DNA; INV; 143 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-34_AA_; KW Copia-34_AA-I; Copia-34_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-143 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 959-959 (2011). XX DR [2] (Consensus) XX SQ Sequence 143 BP; 47 A; 25 C; 28 G; 43 T; 0 other; tgttgaagag taggcgcttg atgaaccagc aagccatgaa atagttgtaa tttgacgtta 60 aataaatgta tgagttagat cgttcccaac ctttgaacag aacagttgtt cttaattaat 120 aatccgatct acggcatatc cca 143 // ID Mariner-28_HM repbase; DNA; INV; 2851 BP. XX AC . XX DT 23-DEC-2008 (Rel. 13.12, Created) DT 23-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE Mariner-type family: consensus sequence. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-28_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2851 RA Bao W. and Jurka J.; RT "Families of Mariner elements from Hydra magnipapillata."; RL Repbase Reports 8(12), 1962-1962 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 478..2406 FT /product="Mariner-28_HM_1p" FT /translation="MPRQRQRTTTHRSYSIDALKTATEKIIKGELSIRKAA FT ASHQIPKNTLARYVLKVKNTPDGVKVNFRPSNEHLLVFNSSEELLLKKYIQ FT DAVNMHHGLSFTQVRRLAFEFAIANNKNVKCNWTEEKMAGIEWLHSFMRRQ FT SLALRTPEQTSLSRCTSFNKTNVGAFFSNLEGVINRFNLQAQDIFNLDETG FT LTSVHKPVKVLALKGQKQVGQATSAERGVLVTACCIVGATGTAIPPYLLFP FT RVFFKEHMIKGAPVGTKGAATKSGWTISDIFIDVLEHFVDHVRPTTDSPKL FT ILLDNHESHLSIAALNYAKTNGIMLLTFPPHCSHKLQPLDVSVFGPLKKHY FT NRACNDWMTNHPATPITIYDIGELLGTAFPLAFTPRNISQGFKAAGICPFD FT SQIFTEEDFLAATVTDRPDPTVSDTTIESTTLPSTSVSTTVEIPTSLTDRQ FT LISPEQIRPFPKAGMRKNNRKPRQKGKSAVLTDTPVKRQLEEAAQMRANKS FT KPKSIAPRKEKRKILIESSDEGSSEVEYADSSLDISELSSGEDTDDFNQPI FT TVDDFVLAMVHGAKKKSSRHYIAKIISKTQDGFSVKWLKNQNRTTKFYITD FT EELTFVPASDIVRKLPTPLFHGETGRFANLYSFQFDFSDYSIVM*" XX SQ Sequence 2851 BP; 961 A; 525 C; 500 G; 865 T; 0 other; gggtaaagtg aggttaactg ggtacaggtg ttaactgggc cactggctgt tcatacttct 60 gcagtttgaa tacaccgctt ttattgatgt ttcccaaacg atccactagg cggcactact 120 tgctgctgat tttgaatgca aatggcagcc attttaaaat tttattaagc atttgttgtt 180 cttggaagtc acatgtattg ttttactgca gcgttagaaa gttgatagaa tctgtgttta 240 tcaagttttg tgagttttaa taactgattt ctgaacttta atatagtgcc tatttgaata 300 attatcacaa attacgcaaa tatttaaaat actggtaagt ttttgtaaaa atatgaattt 360 tgatatagtg cttaacccca ccaatgaaaa attaatttaa tataatcatt taattaacat 420 ttatatcaac taattatgtt ttttttttat tccagtttat aaataaatta aattaaaatg 480 ccccgtcaac gtcaacgtac cacaacgcac agatcgtatt ccattgatgc tttaaaaacg 540 gcaactgaaa aaattataaa aggagaacta tcgataagaa aagcagcggc cagtcatcag 600 ataccaaaaa atacattagc caggtatgtg ctcaaagtta aaaatacacc tgatggagtt 660 aaggttaatt ttcgaccatc aaatgaacat ttactcgttt tcaactcaag tgaagaattg 720 ctattgaaaa aatatattca agatgcagtc aacatgcatc atggactttc atttacccaa 780 gttcgtcgct tagcttttga gtttgccata gcaaataaca aaaacgtgaa atgtaattgg 840 acagaagaaa aaatggctgg aatagaatgg ctgcattcat ttatgagacg acagagtttg 900 gcgctaagaa cccccgaaca aactagtctt agccgttgca ctagtttcaa taaaactaat 960 gttggagcat ttttttcaaa tttagaaggt gttattaacc gatttaatct acaagcacaa 1020 gacattttta acttagatga gactggtctc acaagtgtac ataaaccagt aaaagttttg 1080 gctttaaagg ggcaaaagca agttggccag gcaacatcag ctgaacgagg cgtcttggtt 1140 acagcatgtt gtatcgttgg agctacagga actgcaatac ctccatattt gctgtttccc 1200 agagtgtttt ttaaggaaca catgattaaa ggcgcacctg ttggaacaaa aggtgcagca 1260 acaaaatctg gatggacgat ttcagatatt tttattgatg tcctcgaaca ttttgtcgat 1320 cacgtaagac ctacaacaga cagtccaaag ttgatattat tagacaacca cgagagccac 1380 ttaagtattg ctgcgttgaa ttatgcaaag actaatggca taatgttgct gaccttccct 1440 ccgcactgca gccataagtt gcagcctctg gatgtgagcg tattcggtcc cttaaaaaaa 1500 cactacaaca gagcctgcaa tgattggatg acaaaccatc ctgcgactcc aatcacaatt 1560 tatgacatag gagaattact ggggactgct tttccgttag catttacgcc aaggaatatt 1620 tcccaagggt tcaaagcagc aggaatctgt ccatttgatt cccaaatatt tacagaggaa 1680 gattttttag ctgcaacagt tacagataga ccagatccaa ctgtttcaga tacaaccatt 1740 gaatcgacaa cattaccctc aacatcagta tcaaccaccg tagaaatacc aacctcgcta 1800 actgaccgtc aacttatttc tccggaacaa atacgaccat tccctaaagc cggaatgcgt 1860 aaaaataaca gaaaaccaag gcagaaagga aagtctgctg ttttaacgga tactccagtc 1920 aaacggcaat tggaagaagc tgcacaaatg cgtgcaaata aatctaagcc taagtccatt 1980 gccccaagaa aagaaaagcg taaaatatta atagagtcca gtgatgaagg atcatcagaa 2040 gtagagtatg ccgattcttc attggatatt tcagaattat cttctggaga agatacggat 2100 gattttaacc agcccataac cgtcgatgat ttcgtgcttg ctatggtgca cggagctaag 2160 aaaaagtcta gcaggcacta tattgccaaa ataatttcga aaacgcaaga tggtttcagc 2220 gttaagtggc tgaaaaatca aaataggaca acaaaatttt atattacaga tgaggagctg 2280 accttcgttc ctgcatccga tattgttcga aaattgccaa ctcctctatt tcacggagaa 2340 acgggacgat tcgccaattt atattccttt caattcgatt tttccgatta ttccattgtt 2400 atgtaaacat agtgttttat taatgtttaa tagttgaaat attcatacaa ttttatcaca 2460 tcgaaggtgt attttttcac taatttgtgt tttttttatt gtataatttt attgttatca 2520 cgtttatttt gtttattaat tctatctttc ttaactaaat atattatcaa aataaaaact 2580 aatattattg tagtttcact agtatgtacc cagtcaaccc catagggtgg cccagttaaa 2640 cccgtcactg tggttaactg ggccacggga cacatgtcta aaaaatgcaa tatttacaag 2700 gccacagcat atattaacta attcatttga atttattaga tggaatatta atattctcat 2760 agttatgaac agaaaaaata ttgaaaagca gtaatttgtt acaattacat cacccaaaaa 2820 aaaactgtac ccagttaacc tcactttacc c 2851 // ID Sola2-6_AAe repbase; DNA; INV; 4052 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A Sola2 DNA transposon family from Aedes aegypti. XX KW Sola; DNA transposon; Transposable Element; Sola2-6_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4052 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the yellow fever mosquito."; RL Repbase Reports 11(4), 1304-1304 (2011). XX DR [2] (Consensus) XX CC ~97% identical to consensus. TIRs are ~600 bp long. XX FH Key Location/Qualifiers FT CDS join(1197..2696,2683..3435) FT /product="Sola2-6_AAe_1p" FT /note="transposase." FT /translation="MKYKMNTINKGCVNVFSKKCSKSYRNLTESNIAKLVE FT LGFADENQLNKQFKICDSCRLQLAKRTALSSSDDLSSSEEEDVCMEEETIE FT ASASATSIQSTDSNMNIPSQESLEATSATSIESTNSNLNVSSKETLEATSA FT TSIETTDSNVPEYIQKVNIDIFNESISSILVAPIDKTKISSTKYVTKKYHE FT IVNSIRQNIFALNKEAKLDENRTGEFYEIIEQLKEKFTSEGTTRAEQFQLL FT TVLPKSWTAETIKNTFDTTLYTATEAKQLLNKSGVHSTIGPRATTGLDENI FT KHAVMQFYEDDETSRAMPGQRDCVTIRKDGKRQAVQKRLMTTTLREAYNRF FT QELNTEIQIGFSSFAKLRPKNCKLLTSSGTHNVCVCTIHENVNLITHSLKK FT YGLSNELKVFTDSLTCENATVDCFLRRCENCIDTTSLEKKLLEEMDEKFVD FT EIIFEQWVTTDRCDIETFTKQKEEFVSYFIQKLEKLIPHDLIKKEQSTFLK FT NKKKKIKKNNLQEREFVAICDFSENYTFVLQDEVQSHHWNAQQATIHPFAI FT YFKENGMLNHLSFVVISEDLRHDSISVNLFISKMINFIRQEKHLNLNKIYF FT MSDGAASQYKNRKNFSSLCQFKKNYDIDVEWHFFATSHGKGPCDALGGTIK FT RMATRASLAKEREHPIKNAKELFDWAQKRKEEQLTQIFFSYATTTEYEHIK FT EQLNEQYSKAITIQGTQKYHSFIPVSVDKIEVRQFSNCNDSKKIVNIMKKL FT " XX SQ Sequence 4052 BP; 1532 A; 621 C; 686 G; 1213 T; 0 other; gagcggttcc acagaaaatg acgacttttt ttcccaaatt tcatttttat attttttgat 60 ttggatgaaa ttttgcacat gctttcttta tgcccaaaaa tgcctttttg catcatcggt 120 tcgccatttt gactctagcc ttacttttga gaagggccta agaaaaaaat ccttaataat 180 tttcaaaaaa ttataactta gaaacggttt gtccgatcag tttggtgtct tccgcaaagt 240 tttaggttat tgttgggact atctggaaaa aaatacactg taaaaaaatg ttgtaatttt 300 ttatatttcg aaaataaagc ttaaaaatca attttctcaa aaatcgtatt tttgaatttt 360 ttttattttt ttatatgtta aagtagacaa aaaatgaagt cttttgcaca gtgggtcaag 420 atggagaaat catggacaaa aaagttatga tttttttaaa aaaaggtgaa tttttgaaaa 480 tggtcataat aaacttttta aatgtttttc aactcgattt tatgaaagta cgcaaaattt 540 acaacaaaaa aggtatacgg gaaaatcagg ctaactttta ccgttttaaa gatacagcga 600 ttttaataca aaattatgat ataattttca attttcggtc attttcgaat gtttttcgag 660 gctcataagc ctagaaattt gttcattgtc tgaaagagca gataattttt gactaaaatg 720 tttgaaaaaa tattgatttg gtaatttttc atggtatgag gcgaaataaa aaatgtgccc 780 tgtaaaaatg atttatttat ttttttaacg aaacacaact tcaacattca acattgtgat 840 attttgatat gtaattatca agaaaatatg gaacaaaatg ttaaaagtta aaaaataaca 900 aaataacgaa tataaagcaa tcaatacgtg aaatgcattt atatcgattt aagataatga 960 aatatttcca ttcgaaaaca aaagctttat gagtagagga gtaacgtgcc cagttgtctc 1020 acacaaatat acctgatgtt ttcttaaagt acagcacgtt ggtatatttt tgtggtcgga 1080 cagaaacgta cctaatagtt gattcaagta cccatgttgg tataaaaaag agctgatgca 1140 gcaaagacaa acaagcaagt gataatcaag tgaaaagtaa aagatagctt agaacaatga 1200 agtataaaat gaacaccatc aacaaaggtt gtgtaaatgt gttttcaaaa aagtgttcaa 1260 aaagttaccg aaatctcacg gaaagtaata ttgccaaatt agttgaacta ggatttgcag 1320 atgaaaatca gcttaataaa caatttaaaa tttgcgattc gtgtcgttta caattggcca 1380 aaagaacagc attatcaagc agtgatgact tatcaagtag tgaagaggaa gatgtttgca 1440 tggaagagga aacaatagaa gcatcagcat cagcaacttc aatacagtca acggattcca 1500 atatgaatat tcctagtcag gagtcattgg aagcaacgtc agcaacttca atagaatcaa 1560 cgaactccaa cttgaatgtt tctagtaagg agacattaga agcaacgtca gcaacatcga 1620 ttgagacaac ggattccaac gtccctgaat acatccaaaa agtgaatatc gatattttca 1680 atgagtcaat atcaagtata ctcgtagcgc cgatcgataa gacaaaaatt agttctacta 1740 aatatgtcac gaaaaaatat cacgagatag tcaacagcat tagacaaaac atctttgcgc 1800 taaataagga agctaaattg gatgaaaata gaaccggcga attctatgaa ataattgagc 1860 agctcaaaga aaagtttaca agtgaaggga cgacaagggc agagcaattt caactcttaa 1920 ctgtattgcc taaatcatgg acagccgaga caattaagaa cacatttgat acaactttat 1980 acacggcaac ggaagcaaaa caattactga acaaaagcgg cgtgcattct actattggtc 2040 cgagggcaac cactggactg gatgaaaata tcaagcacgc ggttatgcaa ttttacgagg 2100 atgatgaaac cagtcgagca atgcctggtc agcgtgattg tgtaacaatt aggaaggacg 2160 gaaaacgcca agctgttcag aaaaggctca tgactaccac attgcgagag gcctacaacc 2220 gattccagga attgaacact gaaatacaaa tcggattttc atcctttgct aaacttcggc 2280 ctaaaaactg taaactctta actagctcgg gaactcacaa cgtttgtgta tgtaccattc 2340 acgaaaatgt gaatctcata acccacagct taaaaaaata cggtttatct aatgaattaa 2400 aagtgtttac agatagtttg acatgtgaaa atgcaacagt agattgtttt ttacgccgat 2460 gtgaaaattg cattgacaca acttccttag aaaaaaagtt attagaagaa atggatgaaa 2520 aatttgttga tgaaattatt tttgaacaat gggtcacaac tgatagatgc gatatagaga 2580 cgttcacaaa acagaaggag gagtttgttt cctatttcat tcaaaagtta gaaaaattga 2640 taccacatga tttaattaaa aaagagcagt ctacattttt gaaaaataaa aaaaaataac 2700 ttgcaggaaa gagaatttgt agctatatgt gatttttcag aaaattacac ttttgtactt 2760 caggatgaag ttcaaagtca tcactggaat gcgcagcaag caactattca tccatttgcg 2820 atttatttta aggaaaatgg catgctcaat catttaagct ttgtagtaat ttccgaagat 2880 ttgcggcatg attctatatc cgttaactta tttatctcaa aaatgattaa ttttatacgc 2940 caagaaaagc acttaaattt gaacaaaatt tatttcatgt ccgatggagc tgcatcacaa 3000 tataaaaata gaaagaactt ttctagtctt tgccaattta agaaaaacta tgatatagat 3060 gttgaatggc attttttcgc aacatcgcat ggcaaagggc cctgcgatgc attaggggga 3120 acaataaaac gcatggctac aagggctagt ctagctaaag aacgtgaaca tcccatcaaa 3180 aatgccaagg aattatttga ttgggctcaa aagcgaaagg aggagcaact aacacaaatt 3240 ttcttttctt atgcaactac aacagagtat gaacatatca aggagcagct taatgaacaa 3300 tattcgaaag caataactat tcagggaaca caaaaatacc attcatttat tccagtttcc 3360 gtagataaaa tagaagttag gcaattttca aactgtaatg atagtaaaaa aattgtaaac 3420 attatgaaaa agttgtgaat tttcagtaaa tttaatttta aaatcgctgt atctcggaaa 3480 cggtagcaat tagcaaaatt ttctcatata ccttttttgt tgcaaatttt gcgtactttc 3540 ataaaatcgg gtcgaaaagc attcaaaaag tttatttaga ccatattgaa aaatcaacct 3600 tttttaaaaa atcataactt ttttgtccat gatttctcca tcttgaccca ctgtgcaaaa 3660 gactccattt tttgtctact ttaaaatata aaaaaaataa aaaaatcaaa aaacaatttg 3720 tgagaaaatg gatttttaag ctttattttc gaaataaaaa aaaattacaa catttttttt 3780 acagtgtata tttttttcag atagttccaa caataaccta aaactttgcg gaagacacca 3840 aaccgatcag acaaatagtt tctgagtttt aattttttga aaattgttaa ggcttttttt 3900 cttaggccct tctcaaaagt aacgctaggg taaaaatggc gaaccgatga tgcaaatagg 3960 catttttggg tataaagaaa gcatatgcca aatttcagcc aaatcaaaaa atacaaaagt 4020 aaaccgacat tcgaattcaa atggaaccgc tc 4052 // ID BEL-197_AA-LTR repbase; DNA; INV; 613 BP. XX AC supercont1.1408; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-197_AA_; KW BEL-197_AA-I; BEL-197_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-613 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.1408; Positions 19138 18526. XX SQ Sequence 613 BP; 204 A; 106 C; 102 G; 201 T; 0 other; tgctgacgca caaagcgaat aggccctttg gataggtagg actgaacaga tccatactat 60 aataacctga cagattgcat ttttctatag aggagaaaaa tagctcacac gtttttgcca 120 caaactttgc aatataatta gtttttttcc ttattatttc aatataatta gccaataggc 180 gatgaaattg ttagcttagt tgcttctaat attaaaataa cctcttgagc attatatcag 240 gtaaatatcg ataacattga attcgttatg ttgatctaca tgtggtatac ttaacaaagt 300 tttcaattat gtatctattt aggaagaatc acgtttattt ggaagatgag catatttcac 360 tacccctacg aagttgtgat caggtgttaa ccgtgggtag cagtttgtat tggccaagca 420 aaatgtctaa gaattacatt gaaatacaca ggtatctact acaagctgtc ctagcacata 480 tttgaggtcg ctttctacta caccactgta aaactcaatt gtaagtacta ataatgttta 540 aactacacat tattaacatg caataaaatt tcagtcgagt ctcgcttcgc tcagactacg 600 gaaaagttta aca 613 // ID Gypsy-18_IS-I repbase; DNA; INV; 2529 BP. XX AC ABJB010623531; XX DT 15-FEB-2011 (Rel. 16.02, Created) DT 15-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the black-legged tick genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-18_IS_; KW Gypsy-18_IS-LTR; Gypsy-18_IS-I. XX OS Ixodes scapularis OC Eukaryota; Metazoa; Arthropoda; Chelicerata; Arachnida; Acari; OC Parasitiformes; Ixodida; Ixodoidea; Ixodidae; Ixodinae; Ixodes. XX RN [1] RP 1-2529 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the black-legged tick genome."; RL Direct Submission to RU (15-FEB-2011). XX DR Genome; ABJB010623531; Positions 2766 238. XX CC Positions [1618-2079] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS join(325..1239,1243..2514) FT /product="Gypsy-18_IS-I_1p" FT /translation="MDTILAGLPKIAVYLDDILVVSETAKEHERMLTDVFE FT RLDKAGLRVQANKCELFQESLEFLGHRIDKCGIYPSEVKVEAIHKAPAPTN FT KKELQSFSGMVNFYNIFLRGRSEIAEKLHRLLDDNATWKWDEEHQRAFEGL FT KKLLTSQSLLVHFNESAPLIISCDASPVGVGAVLAHCDETAKEQPIAYASR FT TLTKAERNYAQIDREALAVVFGVRHFHQYLAGRKFVILTDHKPLLGIFRRE FT KQIPDVLSPRMLRWSLMLSAYEYQLDYRKTQDHANADCLSRLPVPAFRLEA FT ETPGDVLLFEAAEPPVTAAEVAGCTSQDSRLSEVRKWIIDGWPKEGVPEDY FT GAYEIRQNELSVHRDCVVWGNRVVIPEVLQKRVLILIHANHPGVTAMKAIA FT RSYVWWPKMDAQIEEFVRHCKACQENRQSEPKAPTHFWTKPERAWSRVHID FT FAGPVDGAVFLVAVDAFSNWAEVEIMTSMHSSAVIERLRKMFATYRLPDLI FT VSDNGTAFTSRDMQSFLKLNGIEYMYTAPYHPASNGRAERMVRELKCALRK FT QRQGSLACKVSRFLFKQHSTPHSESGKTPAELMLGRSLRSGLSKLHPDAMD FT VECKRQPPEQSFQCGDSVYARNFRQGPKWLAARVLRRRGHVMYEVQTSDGS FT IHRRHRNHVRRAWEPTTPTGGTTASFALPGTPGSEPGPLEAGVSLPGPSSL FT GARSPAVETPVLRRSSRQRRPVHRYGIDD" XX SQ Sequence 2529 BP; 610 A; 629 C; 745 G; 545 T; 0 other; ctacattggc aacgaggcac ggtctggccc gtgcgggccg ctggaaacag gcgtgatggc 60 gttgggtggc cgcatttcgg agtttgacgt tcatggaaat tcgacatggg aagaatacgt 120 cgagcgcatc gaactgtact gtgccgtcaa caaggtgttg gaggccgtgg agaagagggc 180 ggtgctgctg agtgcagcta gcagcagctt gtggtcgacg atacaactgc cgaactgctc 240 acgataaaca ccgtacaagg gctctacaaa gttcgccgcc taccgtacgg cgtatctgtg 300 gcaccaggac tgtttcaaag ggcaatggac actatccttg ctgggctacc taaaatagct 360 gtgtacctgg acgatatcct ggtggttagc gaaacggcca aggagcatga acgcatgctc 420 acggatgtat tcgaacgcct agataaggct ggactcagag tgcaagcaaa caaatgcgag 480 ttgtttcagg aaagcctcga gttcctgggt catcgcatcg ataagtgtgg catctaccct 540 tccgaggtga aagtcgaggc catccataaa gccccggccc caactaacaa gaaggaactc 600 cagtccttct cgggtatggt gaatttttac aacatttttt tgcggggcag atccgaaata 660 gccgagaagc tgcaccggtt acttgatgac aatgctacct ggaaatggga cgaagagcac 720 caacgggcgt ttgaaggctt aaaaaaattg ttaacatcgc agtccttgct tgttcatttt 780 aacgaaagtg ctccgttgat aatctcctgt gatgcgtctc cggtgggtgt gggtgctgta 840 ctcgctcact gtgacgagac ggcaaaggag caacccattg cctatgcgtc aaggacgtta 900 acaaaggctg agcggaatta tgctcaaata gaccgagaag cattagccgt ggtgtttgga 960 gtgcgtcact ttcatcagta cctagccggg cgaaaatttg ttattcttac ggaccacaaa 1020 ccgctgctag gaatatttcg gcgagagaaa caaataccgg atgttctgtc accacgcatg 1080 ctacgatgga gcctaatgct gtcggcctac gaataccagc ttgactaccg caaaactcag 1140 gaccatgcga acgctgattg cttgagcagg cttccggtac cagcatttcg gttggaagca 1200 gaaacgccgg gcgacgtatt gcttttcgaa gccgctgagt aaccgccagt caccgccgca 1260 gaagttgctg gctgcacgtc gcaagacagt cggctatcgg aagtaaggaa atggatcatc 1320 gacggctggc caaaagaagg ggtaccggaa gactacggtg cgtacgagat aaggcagaac 1380 gaactgtccg tgcatcgtga ctgtgtggtt tggggaaacc gcgtggtaat ccctgaggtg 1440 ctgcaaaaac gtgtcctcat cttgatacat gctaatcacc cgggagtgac ggcaatgaaa 1500 gctatcgcga ggtcctacgt ttggtggccg aagatggatg cgcagatcga agaatttgtg 1560 cggcactgca aggcgtgcca ggaaaaccgg caaagtgagc cgaaagcgcc tacgcatttc 1620 tggacgaaac cggagcgagc atggagccga gtgcatatcg attttgcggg tcctgtcgat 1680 ggtgcggttt tcctagtcgc agttgatgct ttctcaaact gggcggaggt ggaaataatg 1740 acgtcaatgc actcatctgc agtgatcgag cggctgcgta agatgtttgc aacatacaga 1800 ctgccagatt tgattgtttc agataatggc acggcgttta ctagccggga catgcagtcc 1860 tttttgaagt tgaacggcat tgagtacatg tatactgctc cctaccaccc tgcgagtaat 1920 gggagagcag aacgcatggt tcgggagcta aagtgtgcac tgagaaagca gaggcaaggc 1980 tctctagcct gcaaggtgtc tcgcttcttg ttcaagcaac attccacgcc gcactcggaa 2040 tccggcaaga ccccagctga gcttatgctg ggcaggagcc tgcgctcggg cctgtcaaaa 2100 cttcaccccg atgcgatgga cgtggagtgc aaacggcagc ctccagaaca aagctttcag 2160 tgcggtgact ctgtctacgc aaggaatttt cgccaaggtc cgaagtggtt ggctgctcgt 2220 gtgctacggc gtcggggtca cgtcatgtac gaggtccaga catccgacgg gagtatccat 2280 cgacgccacc ggaaccacgt gcgaagggcc tgggaaccaa caactccaac cggcgggaca 2340 acagcttcgt ttgcccttcc gggaacgccc ggttcggagc ctggccctct ggaggccggg 2400 gtttcccttc cgggcccgtc aagtctgggg gcgaggtcac ctgcggtcga gactcccgta 2460 ttgaggcgtt cttcccgcca gcgtcgcccg gtgcaccgat acggcataga tgactagggg 2520 ggagggatg 2529 // ID Gypsy-77_CQ-I repbase; DNA; INV; 5064 BP. XX AC AAWU01044424; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-77_CQ_; KW Gypsy-77_CQ-LTR; Gypsy-77_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-5064 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 533-533 (2011). XX DR Genome; AAWU01044424; Positions 13534 8471. XX CC Positions [3857-4327] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 800..4894 FT /product="Gypsy-77_CQ-I_1p" FT /translation="MPLTGTTDPYLPGSIHFHQYLEQLEWVFQHNKLAAND FT YKISFLAVCGQEVYTMLKKLFPGEDFKELSYAQLTDKLKKHYDKSDSEVIH FT SFKFWSRKQGQHEKAEDFILSVKVLAERCCFGDFKDRAVRDLLVMGVYDRS FT LQKRLFDEENLTAEKAERIILNQELSTNRTRILNNDGDRGSLVNRLGRRPE FT RTPRRVGFRNRSRSYSKNRSFSSDRNSGKPVFECSFCKRTGHTRKFCFKLN FT KSPRKQKHSVKFIDSTPANSASSGLFKRLKKDLSDDDDMPCLMISSVNKVN FT EPCYVEAVIEKRRLTMEVDCGSAESVISEELFNRNFKNLVVKPCNKRLVVI FT DGKRLTVLGKVSVHVILEGVQQQLDFIILRCEVDFIPLMGRTWLDVFYGNW FT RSTFAKPTLPTQRVNAVENDQVVVDLKSKFSTVFDKDFSSPIKGFVGDLVL FT KHNTPIFKKAYDVPLRLRQRVLDYLDYLEKDGIITPVEASEWASPVVIIIK FT KDQSIRLVIDCKVSINKVIVPNTYPLPLAQDLFAALSGSKVFCALDLAGAY FT TQLLLSDKSREIMVINTIKGLYIYNRLPQGASSSASIFQKIMDMILHGIPG FT VFCYLDDVLIAGDNYEDCRDKLYLVLERLAKANIKVNLKKCNFFVNRLPYL FT GHILTDQGLLPCPDKVETIREAKAPRNVSELKAFLGLVTFYSKFIPNLSPR FT LHTLYNLLKKNVNFVWSQECNKVFNDCKQFLLKPNVLEYFDPDKPIVVVTD FT ACSYGIGGVIAHEVNGEEKPISFTSFTLNNAEKNYPILHLEALAVVSTVKK FT FHKFLYGKDFKIFTDHKPLIAIFGKEGRNSISVTRLQRYVHEMSIYEYEIF FT YRPASKMGNADFCSRFPLAQEVPRELSFDFVKNLNFSQEFPIDFKQIAKFT FT SKDEVILTIIKYMREGWPERMEKRFRDIYSLHRDLEEVDGCVLFQDRVIIP FT EAMKDQILKLLHKNHSGISKIKQLARRSVYWYGMNGDIEEFVKACRVCHQT FT TSVTKKSPYSQWTPTTKPFSRIHADFFYFERKVFLVIIDSYTKWIELEYMR FT HGTDHRKVIKVFIGVFARFGLPDVLVTDGGPPFNAEAFVSFFEKQGIKVLK FT SPPYHPESNGQAERSVRLVKDVFKKFLLDPDIRGLDTDEQICNFLFNYRNT FT CLSKTHKFPSERMLSFKPKTLLDLINPKTNFKHNLTESQNSNPIDKHDDSL FT AETFTNLKNGDLIYYKNINPTDTRRWLPAKFLMHLSTNVLQISLGGRVLSA FT HKRQIKIPSQPRKNTPHFVFHGEQVLSPQHAPHTSNKRSREDAFEDSDAES FT DFYGFAADSSIFEESVPTSSRPEPVRQDAERVLPVRRSARQIKKKTKVDFK FT YY" XX SQ Sequence 5064 BP; 1486 A; 891 C; 1128 G; 1559 T; 0 other; ataaagtaag ttaatttaac tggcgacgag tgataaaatt tcggagtttt cgcgttattt 60 ggtgcctcgc gtgtgaaatt tggcgagtga aattatcgca agcagtattt tgggcagtga 120 ttttttttta agaaaccagc ggagttcatt ttttgtggtg cacatacatt tgcacgtttg 180 attggcacat attgttttgc tgtaaatttt aaagcccact gataattatt actgcagtgt 240 tcagtcgttt aaaaatataa aaaaattgag aacaaacgtt taacaaagaa agggcttgag 300 gtgacgccaa ccaggcgcca ttttgtagca tttcttcatt tcgcctccac aaaggagagg 360 aagatttatt cttcttttat gctttgtgtt tttttagcgt tggctttgtt cgctgttgag 420 gaaagttctt ctcaaccagt gatttgacac acacacaaac acacgtgtgt gagaagcttt 480 agtgagcagc ggctgaaaag cgagcggcgg ctgaaaggtg agcagcggct gagaagtgag 540 cagcagtttc gcgtgaaagt gagcatcagc ggcggcgaag cagtcggggc acggtgcagt 600 gttgcggagt ttacggtttt gcgagacatt ttggattttg atttttcctc cgtctggtga 660 cgtcgagtag attttgattt tgactaccga ccaaggagaa taaggtacgt gatttgtttt 720 ctgttttgtt ttgagattat tttgggtgat tttttttgtg atatattgtt tttttttgtg 780 tcattatttt tgattaagga tgcctttgac gggtacgact gatccgtatt tgcctggttc 840 aattcatttc caccaatacc tagaacagct tgaatgggta tttcagcata ataagctggc 900 tgcaaatgat tacaaaattt cgttccttgc tgtctgtgga caggaagttt ataccatgct 960 taaaaaactt tttccggggg aagattttaa ggaattaagt tacgctcaac tcacagacaa 1020 acttaaaaag cactatgaca agagtgactc ggaggtcatt cacagtttta aattttggtc 1080 tcgcaagcag ggacaacatg aaaaagcaga agattttatt ctttctgtga aagttttggc 1140 tgaaagatgc tgttttggtg attttaaaga cagagctgtg agagatcttt tagtcatggg 1200 tgtttatgat cgttctttac aaaagcgttt gtttgatgag gaaaatttga cagctgaaaa 1260 ggcggagagg atcattttga accaggagtt gtccactaac agaaccagaa ttttgaacaa 1320 cgatggtgat cgtggaagtt tggtgaaccg tttaggacga aggcctgagc gaactcccag 1380 gagggttggt ttcaggaaca gaagtaggag ctatagtaaa aatcgttcgt tttcgtctga 1440 cagaaattct ggcaagcctg tttttgaatg ttcgttttgt aaacgaacag gacacaccag 1500 aaagttttgt ttcaagctta acaaaagtcc tcgtaaacaa aaacatagtg taaagttcat 1560 tgattcgaca cctgctaatt cagctagttc agggctattc aaacgtttga agaaggattt 1620 gagtgatgat gacgacatgc cctgtttgat gatttcttcg gtgaacaaag tcaatgaacc 1680 ttgctacgtc gaagctgtca tcgagaagcg acggctgacc atggaggtcg attgtggttc 1740 agcagagagc gttatttccg aggaattatt caacaggaac ttcaagaatc tggtggtgaa 1800 accctgcaac aagcgattgg tggtcattga cgggaagcgt ttgaccgttt tggggaaagt 1860 ttcggtgcat gtcatcctgg aaggtgttca acaacagtta gacttcatca tcttgcgctg 1920 tgaggttgat ttcatccctc tgatgggtcg cacctggttg gatgtcttct atgggaattg 1980 gagatcaact tttgctaaac cgactttgcc tacgcaacga gtcaatgcag tggagaatga 2040 ccaggttgtt gttgatttga aaagtaagtt ttctacagta tttgacaagg atttttcaag 2100 tccaataaag ggttttgtag gagatttggt tttgaaacat aacacaccta ttttcaaaaa 2160 agcgtacgat gttcctttac gtttgcgaca gagagttttg gattatcttg attacttgga 2220 gaaagacggc attattacac ctgtggaagc ctctgaatgg gcctcacctg tggtgatcat 2280 tatcaagaaa gatcagagta ttaggttggt catagactgt aaggtctcaa tcaacaaagt 2340 gattgttcct aatacttatc cacttccact ggcacaagat ctttttgctg cgctttctgg 2400 atccaaagtt ttttgtgcac ttgatttggc aggtgcatac acacaacttt tactttctga 2460 caaatcaagg gaaattatgg tcataaatac tatcaaaggt ctttacattt acaacagatt 2520 gccacaagga gcttcctcga gcgcatctat tttccaaaag attatggaca tgattttgca 2580 cggaattccc ggtgtttttt gctacctgga cgacgttttg atagcaggag ataattatga 2640 agactgcaga gacaaacttt atttagtctt ggagagatta gctaaggcta atataaaagt 2700 gaatttaaaa aaatgcaatt tttttgtaaa tcgtttgcca tatctgggac acattttgac 2760 agatcaaggt ctgctcccat gtccagataa agttgaaact attcgcgaag cgaaagctcc 2820 acgaaatgtt tcagaactta aggctttttt gggattggta acattttatt ccaaatttat 2880 tcccaattta tctcctcgat tacacaccct gtacaatctt ttgaaaaaga acgtcaactt 2940 cgtttggtcc caggaatgca acaaggtctt caatgattgc aagcagtttt tgttgaaacc 3000 taatgttttg gaatattttg accctgacaa gcccattgtt gtggtcactg atgcttgctc 3060 gtatggaatt gggggagtga ttgcacatga agtgaatggg gaagaaaaac ctatcagctt 3120 tacttctttt accttgaaca atgcagaaaa aaactacccc attctacacc ttgaagcttt 3180 ggctgttgtg agcactgtaa aaaaatttca taaatttttg tatggcaaag attttaagat 3240 ttttactgac cataagcctt tgatcgctat ttttggaaaa gagggacgaa attcaatttc 3300 tgtaacccgt ttgcagcgat acgttcatga gatgtcaatt tatgaatatg aaattttcta 3360 tcgccctgcg tcaaaaatgg gaaatgctga cttttgctca aggtttcctt tggctcaaga 3420 agtccctaga gagttgtctt ttgattttgt caaaaatttg aattttagcc aggagtttcc 3480 aatcgatttc aagcagatcg cgaagtttac cagcaaagat gaggttattt tgacaattat 3540 aaaatacatg cgtgagggtt ggcctgagcg tatggagaaa cgtttcagag acatttactc 3600 tttgcatcgc gatctagaag aagtcgatgg ttgtgtgctg tttcaggatc gggtaatcat 3660 ccccgaagcc atgaaagatc aaattttgaa acttttacac aaaaatcatt ccggcatcag 3720 caagatcaag caactcgcga gaaggtcggt atattggtat ggcatgaacg gtgatattga 3780 ggagtttgtc aaggcctgca gagtgtgtca tcaaacaaca tcagtgacta agaaatcgcc 3840 gtactcacaa tggacaccta caacgaaacc attcagcagg attcatgcag atttttttta 3900 tttcgaaagg aaggtttttc tagtcatcat tgatagttac accaagtgga tcgagctgga 3960 gtacatgaga cacggtaccg atcacaggaa ggtaataaaa gtgtttatcg gtgtttttgc 4020 caggttcggg ttgccagacg tgttggtgac tgatggtggt ccaccgttta atgcagaagc 4080 ttttgtcagt ttctttgaga agcaagggat caaagtgctg aaaagcccac catatcatcc 4140 tgagagcaat gggcaagctg aaagatcagt tcgtttggtg aaagatgttt tcaagaaatt 4200 cctcctagat ccagacataa gaggattgga taccgatgag caaatttgca attttctctt 4260 taactacagg aacacatgct taagtaaaac acacaaattt ccctcagaac gaatgctatc 4320 cttcaaacca aaaactttat tggatttgat taaccctaaa accaatttta agcacaactt 4380 gacagaatca caaaactcta acccaattga caagcatgat gatagtctag cagaaacatt 4440 cactaatctc aagaatggag acctgattta ctacaaaaac attaacccta ctgacactag 4500 aagatggttg ccagccaagt ttttaatgca cctttctacg aatgttttac agatatcact 4560 tggtggtaga gtgctttcgg cgcacaaacg ccagatcaaa atcccctcgc aacctcgcaa 4620 gaacacaccg cattttgtgt tccatggcga gcaagttttg tcaccgcaac atgcaccgca 4680 cacaagtaac aaacgaagcc gagaagatgc atttgaagat tctgatgcag aatcagattt 4740 ttatggattt gctgctgatt cgtcgatttt cgaggaatct gttccgacgt catcgaggcc 4800 ggaacctgta cgacaggatg ctgaacgagt tttaccggtt cgtcgatcgg cgagacaaat 4860 taagaagaaa acaaaagtgg atttcaagta ctattaggag aatttttgaa ttttaaaacg 4920 tttagctcta aatgtagaat atttgttaaa agttgaattt tgtggaatat tattccaatc 4980 gattttgtgt tagctaaaca agcttatctg aaattaggat tttttcaaat tgtttgtttt 5040 caaacctttt aaaggataaa ggac 5064 // ID BEL-17_DPu-I repbase; DNA; INV; 6466 BP. XX AC ACJG01006857; XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the common water flea: internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-17_DPu_; KW BEL-17_DPu-LTR; BEL-17_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-6466 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the common water flea."; RL Direct Submission to RU (08-FEB-2011). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR Genome; ACJG01006857; Positions 109088 102623. XX CC Positions [5329-5880] - Integrase core CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 969..4973 FT /product="BEL-17_DPu-I_2p" FT /translation="MDTLQRSKAQVKRHFGTKIREIDTALADETTSEVRLK FT TLQAQIKQKWETMEKAYTEIQKLVADTNGDAASLSVLDKEMDEREEMRDAW FT IRVETELTERLEAASRAAAPSRSIPTTTKRVKLPDLKIKQFDGDVFKWRSF FT WDIFKINFDQNADLSDVQKYSYLREYLTGKALRAVEGFEVTDDSYPKAVKT FT LKELFGNKDIAVQAHMSRLYNLQNTKQATDTASLERLYTEVNTHVRSLETL FT GENIKAFGGFIVTIMLHKLPDELILIWNREKKRSATDLEAMLTFIRDELAA FT RDRCKQLKNSVGPAQQTTTRPEHQRHNQDRPTTSALTTGSQRQTESRADTG FT SNSNAPPCMFCFATTHRSFKCHLGVKERKEAMKKKGRCFICTTKGHRASEC FT DSKRSCTNCSGSHHVFICDVPHAKSAKKVGETTSAHSSSAAKDIFFKTITL FT ALTGPKGSEKFRCLLDAGSHRSYITSKASTALGLSKIDQERLKISGFGGRS FT STKNLPLVNVGLGSEIGGQPFIILSCLETNRICDPIPFVPNGPWTESMKAR FT HLNLAEDPSQSKGGATWKGEIDLLIGTEHYYRVLTGAQIRLTQDLMAVESI FT FGWFVHGAVCTSDEQQDQTIALLIRAQEEESERPETMLSRLFDLDGVHEQE FT YDAQAQLEADPVMMHFKSTVTKRGDRYCVKLPWKEHQEELPSNEGPAIQQA FT QSLIQRLKRAPKTLKSYHELILDHLKRGFIEEVLGEDPKSWNRILHYIPHH FT AVIRMDKVSTKIRQVFNASFGSPSLNDLLNSGPNLVPPMQDIVIRFRSRRV FT ALVADMEKAFLQIEVDEGDRDVLRFIWVEDPTANLLKFKIFRWKRVVFGVT FT SSPFHLAAVIQHHLSLQESEHPEVVRMLKRNCFVDDVIIGTENRVMGLKIA FT EEATEIAAKAGMPLRRWRTNDAELQESLAKLNQDERVEETMEFNNDQMKVL FT GTGWHRASDCLTFSTATLIEYCATVRHLSCLRTILSIAARVYDILGLISPV FT IITIRILMQNLWKLKLSWDEEVTEPVKKEFWSWVDQLNLLSTIKIPRWYHH FT GLPNKIENQQLHLFCDASTKAYGAVGYLRSEDEDGNVIISIVLSKVRVAPV FT KQITLPRLELLGGHVAVKVASAIKVALEIEQIETFYWTDSQVSLAWIKGEP FT SRWQQYVSNRVWFIQERSTPAEWCFVPGSENPADLCSRGVQPSILADPKNV FT WWTGPSWLSKSRSEWPSVKEDPDSNYDEINKEKKKSTIVLTNIAAPAALFD FT ETRYGTLTTLLRYTGYILRAYGNVIADRLDRQNALRQGLITSEEMQCARFY FT WMRHLQQQAFGAEAARQEQIHSS" FT CDS 5332..6465 FT /product="BEL-17_DPu-I_1p" FT /translation="MSPPFDVTGVDFAGPVYLRGSNAKSWICLFTCAVTRA FT LHLELVQTLTAVGFLMAFDRFVSRFGICRIVYSDNAKTFHRANKDLAEIWK FT GAEPEILIDLANRGITWKFIPEGAPWWGGFWERMVKTTKQALIKITGTAHL FT TEEELRTTLCKIEAVVNSRPLTYVYNDHEEAQPLCPSHFFAGRRLTTLPPI FT PQSATEVNATRRELVERVEYYGKLTADFKKRWHTEYLQERALHFKHQHRIE FT PIRIGEVVLIQEDNKKRQQWDIGVIKETIPSSDGVVRQVLLRTASGQLRRP FT VQRLCALEINENNPWEKELPPVVQNDAEEEEIHPSVSFAEAEEIHPSVSSA FT DAIPQHENIIELPPPEENQPDLLPDVPQDVQQGGS" XX SQ Sequence 6466 BP; 1910 A; 1486 C; 1540 G; 1530 T; 0 other; atatggtcct tcgaaccgga gataattttt caattttctt tgattaattt aaaaccactc 60 tagacgccat taaatattta agtcacgctg gttgaaactt tatccaacgg aatccattcc 120 atcattgatt gaattttgac agtatcactt gaatttcatt tagtattgag tttatcaagg 180 attagaagca aaacaatttg atttgagttt tttttttttt ttccaaaaat aattttcctt 240 tgtaacattt gccactgggc tgtcaattct cctcctcatt atctctttcg attttttttt 300 ctccggtccc attcatttgt gttgaacact tgtgtgcaat tgaactgctt gaaactgcct 360 cgcaagcgaa acagtcccga gtcgcgtttg cgacgggcca ctggaggccg agtcaacaag 420 tctcgccgat cgccacaaga aaaccaagag ccgaggcgaa gagttataat tctggaagtt 480 gctcccttac aagctgaaga tcaacgccga gttgtaattt taccgaaggc ttttaatccc 540 gaagaagaaa aattggagaa gattgagatc attaaagaga aagaggccga cccagagcca 600 attcaagaac caattcaaga gccaattcaa gaaccaattc aagaacaaac tgaagaagaa 660 gttttgcaaa ttctgggaca agaattcgac gaagagtaaa gcattttcaa caaacatatt 720 tttattaatt cttctttaag ctattttcag ataaaaattt caacgacggt ccgatactta 780 tttggtccga tttcaacgcc ggtccggttt cgagcctggt cctattgaac atttctggtc 840 cggagtaaaa tcctggtccg tttgatcaat tctggtccgg cgaaaggcct ggtccgaccg 900 aagatcctgg tcccgcgcaa gcctggccca atttcaattg tgctcaaacg caatcaacct 960 caagggacat ggacacactt caaagatcca aggctcaagt caagcggcat tttggcacga 1020 agatccgaga aatcgacacc gcgttagcag atgaaaccac atcggaagtg cgactcaaga 1080 ctctacaagc acaaatcaag caaaaatggg agacgatgga aaaggcctac acagaaattc 1140 agaaactggt ggccgacacg aacggagatg ctgcgagcct gtcggttctt gacaaagaga 1200 tggatgaacg cgaagaaatg cgagacgctt ggattcgagt cgaaacggag ctcacggaaa 1260 ggcttgaggc agcaagtaga gcagcagcgc cgtcaaggag cattccgact actacaaaaa 1320 gggtcaagct acctgacctg aagataaagc aatttgatgg agatgttttt aaatggagaa 1380 gtttttggga catattcaaa attaattttg atcagaatgc tgatctttcg gacgtgcaga 1440 aatattccta tctgcgcgaa tatcttactg gaaaggcgct gcgtgctgtg gaaggtttcg 1500 aagtgaccga cgactcttac ccgaaggccg tcaaaacttt aaaagaattg tttggtaaca 1560 aggatattgc agtacaagct catatgagtc gtctgtacaa cttacaaaat accaagcaag 1620 caacggacac cgcctcattg gaacgattgt acaccgaggt gaacactcat gttcgttctc 1680 tcgaaacact tggcgaaaat atcaaggcat tcggaggatt catcgtcacg ataatgctcc 1740 acaagctgcc cgacgaattg attctcatct ggaaccgaga gaagaaaaga agtgccaccg 1800 atctcgaggc catgttaacg ttcatccgtg acgaattggc cgctcgggat agatgtaagc 1860 agttgaagaa tagtgttggg ccagctcaac agacaacaac cagaccagag catcaacgac 1920 acaatcaaga ccggccaact acatccgctc tcactactgg cagccagcga caaacggaat 1980 caagagcaga cactggatcc aacagcaatg ctccaccttg catgttctgt ttcgccacaa 2040 cccatcggag tttcaaatgc catcttggag taaaggaaag gaaggaagcc atgaagaaaa 2100 agggcaggtg ttttatctgc acgacaaaag gtcatcgggc ctcggaatgt gattcaaagc 2160 gatcatgcac gaattgcagt ggatctcatc atgtattcat ctgcgatgtc ccacacgcaa 2220 aatcggcgaa gaaagtaggc gagactacaa gcgctcattc cagcagtgcg gccaaagaca 2280 ttttttttaa aacaattacg ttagctctga cgggacctaa aggatcagaa aaatttcggt 2340 gcttgctgga tgctggaagt caccgttcat acatcacatc taaggccagc acggcgctcg 2400 gacttagcaa aatcgaccaa gaacggctga aaattagtgg atttggaggt cgatcgtcta 2460 ctaaaaatct accgctcgtg aatgttggat tgggaagcga gattggaggc cagccgttca 2520 tcattcttag ctgccttgaa acaaatcgca tatgcgatcc catccccttc gttccgaatg 2580 ggccctggac agagagcatg aaggccagac acctgaatct ggcagaagat ccttcccaat 2640 ccaagggggg agccacatgg aaaggagaaa ttgatctgct cattggaact gagcactatt 2700 atagagtgct aaccggtgcg caaattcgac tcacgcaaga tctgatggcc gttgaatcga 2760 tattcggttg gtttgtacat ggagctgtgt gcacgtccga tgagcaacaa gaccaaacga 2820 tcgctcttct catccgagca caagaagaag aaagtgagag accagaaaca atgcttagtc 2880 gtctatttga tttggatggc gtgcatgaac aagaatacga cgcgcaggct cagctggaag 2940 cagatccggt catgatgcac ttcaaatcga cagtcaccaa acgaggagat cgctactgtg 3000 tcaaattacc atggaaagag catcaagagg agctgccgtc aaatgaaggt ccggctatac 3060 agcaagcgca gagtctcatc caacgattga aacgagcgcc gaaaactcta aagagctatc 3120 acgagctcat tttggatcac ctcaaaagag gattcatcga agaagttctt ggagaggacc 3180 cgaaaagttg gaacagaatt ctccattaca tccctcatca cgctgttatt cggatggata 3240 aggtcagcac aaagattcgt caagtattca acgcttcttt cgggagcccg tctctaaacg 3300 atctactaaa ttcagggcca aatctagtcc cacccatgca ggatattgtc atcagattcc 3360 gcagtcggcg agtggcactc gttgccgata tggagaaagc atttttacaa atcgaagtgg 3420 atgaaggcga tcgcgatgtc ttgcgattca tttgggtgga ggatcctaca gcaaatttat 3480 taaaatttaa gatttttcgc tggaaacgag tcgtgtttgg agtgacaagc agcccgttcc 3540 atcttgctgc agtcatccaa catcatctat ccttacagga aagtgaacac ccggaagttg 3600 tgagaatgct caagcggaac tgcttcgtag atgacgtcat aattggcact gaaaatcgcg 3660 tcatgggcct gaaaatcgca gaagaagcca ccgagatagc cgcgaaggcc ggaatgcccc 3720 tccgacgctg gcgaaccaat gatgctgaac tacaagagag tttggccaag ctgaatcaag 3780 acgaacgagt agaagagacc atggaattca acaacgatca gatgaaggtt ctcggaacag 3840 gctggcatcg ggcaagcgac tgtcttacct tctcaacagc aactttaatt gaatattgtg 3900 ctactgttcg tcatctatca tgcctacgaa ccatcttatc aattgctgcc agagtctatg 3960 atattcttgg attaatatcc cccgtgataa tcacaatccg cattttgatg cagaatttat 4020 ggaagttgaa actaagctgg gacgaagagg tgaccgagcc agtgaaaaag gagttctgga 4080 gttgggtcga tcaactcaat ttgctgtcaa caatcaagat tccaagatgg tatcatcacg 4140 gattgcccaa caagatcgaa aatcaacagc tacatttgtt ctgcgacgcc agcaccaagg 4200 cttatggagc ggttggatat ctgcgaagtg aagacgaaga tggaaacgtc atcatttcca 4260 tcgtactcag caaggtacgt gttgcaccag tcaaacaaat cactctgccg aggttggagc 4320 ttttaggagg ccatgtagca gtgaaagttg cctccgccat caaagtcgca ttggaaatcg 4380 aacaaatcga aacattctac tggacggact cgcaagtttc tctcgcctgg atcaagggcg 4440 agccaagtag atggcagcag tacgtctcga atcgagtgtg gttcatacaa gagcgatcga 4500 ctccagcaga atggtgtttt gtgcccggca gtgaaaatcc agcagattta tgttcaagag 4560 gagtccaacc gtcgatttta gccgacccaa agaatgtttg gtggactggg ccgtcttggt 4620 tgtcaaaatc aagatcggag tggccatctg taaaggaaga tcccgattca aactacgacg 4680 aaatcaacaa ggagaagaag aagtcaacga tcgtgttaac taatattgcc gcgcctgcag 4740 ctttgttcga cgaaaccaga tacggaacgt tgacaactct cttaaggtac actggatata 4800 tccttagagc ttatggtaat gtgattgccg atcgactaga ccgacaaaac gctcttcgac 4860 aaggattgat tacatcagaa gaaatgcaat gcgccaggtt ctactggatg aggcatctgc 4920 agcaacaagc gtttggtgcg gaagcggctc ggcaagaaca aattcattca tcgtgactca 4980 aagctatcga attttcatcc ttatcttaat aaaaatggac tcattacatt gaagaatcga 5040 actaagcttt ctagatcgct tccggaacag cctgaggttc ctattctgcc gaatcgactc 5100 ccgaatgaaa agagggagcc gcatttcatc accctgcttg ttcgtgatgt gcatcggcgt 5160 cttttgcatg ctggtgttcg agatacgttg acagcattac gtcaaacgag ctggattcta 5220 aagggcaggc aagtggtgaa gaagattttg gcaagatgtt cgacatgcaa tcgagtcaac 5280 agaaagccgt atgaccagcc gactggacct ttgccagtcg atcgatgcac gatgtcgcca 5340 ccgttcgacg tcaccggcgt cgatttcgct ggaccagttt atttacgtgg atctaacgca 5400 aaatcttgga tttgtctctt cacatgcgca gtgacccgcg ctcttcattt ggaattggtg 5460 caaacgctga cagccgtcgg tttcctgatg gcgttcgacc gtttcgtatc cagatttggg 5520 atttgtcgaa tagtctattc agacaatgcg aagacatttc accgagccaa caaggattta 5580 gccgaaattt ggaagggagc cgaaccagaa atcttgatcg atctcgccaa ccgcgggatc 5640 acgtggaaat ttatcccaga aggcgcgccc tggtggggtg gcttctggga gcgaatggtg 5700 aaaactacaa aacaagccct tatcaagata accggaacgg cccatttaac cgaagaggag 5760 ctccgaacta cactttgtaa gatcgaagcg gtggtgaatt cgcgccctct cacttatgtc 5820 tacaacgacc atgaagaagc gcagccattg tgtcccagtc atttttttgc tggacggcga 5880 ttgacgactc tacctcccat ccctcaatcc gccacggaag ttaatgcaac ccgaagagag 5940 ctggttgagc gagtcgaata ctatgggaag ctcacagctg atttcaagaa aagatggcac 6000 acggaatatc tgcaagaacg agctcttcat tttaagcacc agcatcgaat tgaacctatt 6060 cgaattggag aagttgtcct gattcaagaa gacaataaga agcgtcagca gtgggacatt 6120 ggagtcatca aggagaccat cccgtcgtct gacggcgtcg tccgccaagt tcttctccga 6180 actgcttccg gccagttgcg ccgacccgtg caacggcttt gtgcgctgga aatcaacgaa 6240 aacaatccat gggagaagga gctcccaccg gttgtgcaga atgacgccga agaggaggaa 6300 attcatccgt ccgtgtcatt tgccgaagca gaggagattc atccgtcggt gtcatctgcc 6360 gacgcgatcc cacaacacga aaacattatc gagctcccac caccggaaga aaatcaaccg 6420 gatttattgc cagacgttcc tcaagacgtc cagcaagggg ggagtg 6466 // ID Gypsy-22_DYa-LTR repbase; DNA; INV; 350 BP. XX AC chr2L_random; XX DT 06-APR-2011 (Rel. 16.04, Created) DT 06-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-22_DYa_; KW Gypsy-22_DYa-I; Gypsy-22_DYa-LTR. XX OS Drosophila yakuba OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; melanogaster subgroup. XX RN [1] RP 1-350 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; chr2L_random; Positions 3236743 3236394. XX SQ Sequence 350 BP; 110 A; 58 C; 95 G; 87 T; 0 other; tgtcggagcg cgtccccagg aattttggga accgttacca tgatttcaca atctgatgaa 60 attgtagcat tgtggtgaaa ttctgaaaag cactgcagca gaggcgcaga cggcctctgc 120 agctgagtga ggtccgatct agttgtcata aatagtgtgt tagagaggga tgacaatata 180 gtgaacgcca gtgttgtgta tagaccgaat agaatatgag gaggattaga aaaaggagaa 240 aatcaggtca gtgaagttta gctggtggag taaatcggaa cttgctgtgc tgctatacca 300 aaattgtgaa acttacgaat aaaatggata agactcccct attgccgaca 350 // ID Outcast-18_AAe repbase; DNA; INV; 5647 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A Outcast non-LTR retrotransposon from Aedes aegypti. XX KW Outcast; Non-LTR Retrotransposon; Transposable Element; KW Outcast-18_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5647 RA Kojima K.K. and Jurka J.; RT "Outcast clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1432-1432 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 7 sequences with >95% CC identity. XX FH Key Location/Qualifiers FT CDS 290..1864 FT /product="Outcast-18_AAe_1p" FT /translation="MNGDVDNGIEIEDCSISEGKTFDERNKRKLHEKDDID FT PKRLQTEMGRKKENGGIATGINTTDANMADTGXITNTTHQTRTQKLIKYTT FT KDLTQTQTQPTRMTHTQQDNCDTHDTRSQRDKNKNEKDVLKLDQWEKNNVH FT TVFFIERQNKDEKSIHPMLLAKILHNVGVKQYKELKSAGYGRFRITFHKPK FT DAEVLINSALLKENFKFNIYIPNMLKQTIGVVRNVPPTITDEEIINNIQAG FT KKKVFKVERINKMKKIDGDNVLVPTFTIKIYVEGQELPEEVSIYGIVAKVD FT FYIFPVKQCVRCWRFGHRIKTCRSSRPRCVVCGLDHESDICWDAPTCVHCN FT GSHKANDRNCPERIRQDMILQAMAREKLTFTEANIKFPRNRSVQDRLQPAI FT NSLQEFPTLPSNPNHEKQKQNNTVNKYQQSTNLNSIQNTNIEEIVNRVKTE FT LIKQLNLDTIMDKIKTMQETISNNNQLKQSGKKSKDAQTLLTEIMNEMKSI FT SNPEVTVTQKSNSKPKQNGQVQKSQVNTT" FT CDS 1830..5450 FT /product="Outcast-18_AAe_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="MDRYRNLKLIQHNITSIRPLETRKILKNFLSGNNIDI FT AILQEIWMKPNEDFKFPGYNFIKSLRINGYGGVGLILKNEIKYEEIKLPKL FT EPIEAIAIKTLNTITEMIIISIYIPPLPINNNDIKEPLKKLFDSIENWNTT FT TILAGDFNAHSKIWNNKQQDCARGKLLSSLIDITNLVILNDGAPTLIKSPN FT TTPSTIDLTIVTPDIAGKIEWKVIDDEMESNHKIIAIDIQNTAKQYTTNKE FT VLNKKQAIENINKIVPNEIHNQYQILTSLKEQIKNAKFKPKNKKQTPKPWW FT NDEINNLYRIKNQHLIEYFRNQTYENYLKFKKSKAKLKQTIRRESRKSWNK FT LIETINPDMDQRNLWNTVRRISGGFPSKNNLVLLNDTNMANSFMNFNFPEI FT VTDLEFTPTTTDEKIIITYAEVLQLINSKKDHSSPGIDEISFYMLKNINPL FT LLSKIVELLNIVVNSGEIPDDWRDIKIIPILKHLKNPNNINSYRPLAMLCV FT ILKIINFVIKRHLNSFLEQNNIIPDYSFGFKKKTSAINCVNTLISHVHNAK FT RNKKVVIATFLDLSKAFDNVDIKILLKKLSELKVPTAIVNWIYYYLKKRNT FT KLILNDGTVLTTTTNKGLPQGCPLSPILFNIYTTELHSLAEDDKILFQFAD FT DFTVMIIADNLQLATEKMNLFLSKIAEKLNNLGMIINSDKCATIIFTNKFH FT PNTNISINNNPIEVKAFHKFLGIQIDHKLNYKTHIDATVVKAKKKINILKM FT VGKRKNGAHPETMLKVAKAIVQPHLDYGLSIIAKASKTVFAKLETVQHLYI FT RTCMRFLRSTPNHVILAESGILPLKERAEYLTLKEIIKTLYYQNSPLTKFI FT KSFITSEDLPTQCSYLERIASINNYHLIQLGPKIPFEKTTKLKVFTEIDNL FT KKKETTIAVQRQMTIELISHKYKNDYKIFTDGSKIDGKAGYGIYDSREKIS FT YSGRLKTQFTIMSAEIFAILKAVEYLMTKNVKNAVILTDSKSAAILIKNNT FT TNDNFLVGKLIREINKSQIEKLTIQWIPGHAGLIGNERADLAAKLGTNKQQ FT IENYPLTKDDLILNLKIETKCYWNQRYKIISEEKGIYHYMIDQNVRSKPWF FT SDFKLSTQHTITINRLRTGHLATKDRLNAWGLVPNDKCEHCNVKEDIIHIL FT HYCSKFDHIRREYPLLQNKEDLITILAGTNFKKIKEIAKFIGETGTNV" XX SQ Sequence 5647 BP; 2359 A; 934 C; 940 G; 1413 T; 1 other; cattctcgca ccaacttgct ttgcagcaac acgtggaaag taagagctga gtcttggttt 60 gctaccgcga agtcgatttt ggttgtgaaa aagaaatctg gattgtattt tcttccatct 120 tggatgctag tggtgaaagt gtaaatatta tatcatagtt ttcgcgtttt tttttttaat 180 tccgttagtt taaggccctc caatttatta gcataagtgg tgttggttag attagttaag 240 aataaagtgt acatagttgt tataagagaa aaagtggtgt agatagtgga tgaacggtga 300 cgtcgacaat gggatagaaa tcgaggattg ctcgatttcg gaaggtaaga cttttgacga 360 gagaaataaa agaaaattac atgaaaaaga cgatattgac ccaaaaagac tgcaaacaga 420 aatgggacga aaaaaagaaa atggcggaat tgcgacagga ataaacacga ctgatgccaa 480 tatggcggac actggaawca ttacgaacac aactcatcaa acacgcacac aaaagttgat 540 caaatacacg acaaaagacc tcacacaaac acagacacaa ccaacaagaa tgacacatac 600 acaacaagat aactgcgata cacatgatac cagatcacag agagacaaga acaagaatga 660 aaaggatgtt ttgaaactgg accaatggga gaaaaataat gtacacacag tatttttcat 720 tgaacggcaa aacaaagatg aaaaatcaat ccaccccatg ttgctagcca agatccttca 780 caatgtcgga gttaagcaat acaaagagct caaaagtgca ggttatggca gattcaggat 840 tacattccat aaaccgaaag atgctgaagt attgattaac tccgcattgt tgaaggaaaa 900 cttcaagttt aacatttaca taccgaacat gctaaaacaa acaataggag ttgtccgcaa 960 tgtaccacct acgatcacag atgaggagat aatcaataac attcaggcag ggaagaagaa 1020 ggtctttaaa gttgaaagaa tcaataagat gaagaagata gatggagaca atgtgctagt 1080 tccgacattt accataaaaa tctatgttga aggacaggag cttccagaag aagtatcgat 1140 atatggaata gtagcaaagg tcgatttcta cattttccca gtgaaacagt gtgtgagatg 1200 ttggaggttt gggcacagga ttaagacatg caggagttca aggccgaggt gcgtggtatg 1260 tgggttggat cacgaaagcg atatttgctg ggatgctcca acttgtgtac attgcaatgg 1320 tagccacaaa gcaaacgata ggaattgtcc ggagagaata agacaagaca tgattctaca 1380 agctatggcg agagagaaac taactttcac tgaggcaaac ataaaatttc cccgaaacag 1440 atcagtacag gacagactac aaccagcaat aaactcacta caggaattcc caaccctacc 1500 gtcgaaccca aatcacgaaa aacaaaagca aaataataca gtaaataaat accaacaaag 1560 cacaaatttg aacagcattc aaaacacaaa cattgaagaa atagtaaaca gagtcaagac 1620 agaattaatt aaacagctaa atctagacac aataatggat aaaatcaaaa caatgcagga 1680 aaccatttcg aataacaacc aattgaaaca atcaggtaag aaatcaaaag atgcacaaac 1740 attacttaca gaaatcatga acgaaatgaa atcaatttca aacccggaag tgacggtaac 1800 acagaaaagt aattcaaaac caaaacaaaa tggacaggta cagaaatctc aagttaatac 1860 aacataacat aaccagcata agaccactgg aaactcgcaa aatccttaaa aacttcctta 1920 gtggaaataa tatagatatt gctattttgc aagaaatatg gatgaaaccg aatgaggatt 1980 ttaaattccc tggttacaat tttattaaat cactaagaat aaatggatac ggaggagttg 2040 gattgatatt gaaaaatgaa ataaaatatg aagaaattaa actaccaaaa ctagaaccca 2100 ttgaagcaat agcaattaaa acattaaata caattacaga aatgataata atatcgattt 2160 acataccacc tttaccaata aacaataatg acattaaaga acctttgaaa aaactctttg 2220 actcaataga aaactggaat acaactacaa ttttggcagg tgactttaac gcacatagta 2280 aaatttggaa taataaacaa caagactgcg caagaggaaa attattatca agcctgatag 2340 acataacaaa cctggttata cttaatgatg gagcacccac acttataaaa tcgccgaata 2400 caacgccttc tactatagac ttaacaatag ttacaccaga tattgcaggt aaaatagaat 2460 ggaaagtaat tgacgacgaa atggaaagta atcacaaaat aatagcaatt gatattcaaa 2520 atacagctaa acaatacaca acaaataaag aagttttgaa taaaaaacag gcaattgaaa 2580 acattaacaa aatagtccca aacgaaatcc ataatcagta tcagatctta actagcttaa 2640 aagaacaaat taaaaatgct aaattcaaac ctaagaacaa aaaacaaact ccaaaaccat 2700 ggtggaatga tgaaatcaat aatttatata gaattaaaaa ccaacacttg attgaatact 2760 tcagaaatca aacatacgaa aattacttaa aattcaaaaa atctaaagct aaacttaaac 2820 aaacaattcg cagagaatca agaaaaagct ggaataaatt aatagagaca ataaatccag 2880 acatggatca aaggaattta tggaatacag tcaggagaat cagtggtggt tttccaagta 2940 aaaacaattt agtactattg aatgatacaa acatggcaaa ctcatttatg aattttaact 3000 ttcccgaaat agtcaccgat ttagaattta caccaacaac aacagacgaa aaaataataa 3060 ttacatacgc agaagttttg caattgatta actctaaaaa ggatcactct agccccggaa 3120 tagacgaaat ttcattttac atgttaaaaa acatcaatcc attgttatta agtaaaatag 3180 tagaattgct gaatatagtt gttaatagcg gagaaatccc agacgactgg agagacataa 3240 aaatcatacc tatcttgaaa catttgaaaa acccaaataa tattaactct tacaggcctc 3300 ttgcaatgct ttgtgtaatt ttgaagataa tcaactttgt gattaaaagg cacctcaatt 3360 cttttttgga acaaaacaat attataccgg attattcttt tggatttaaa aagaaaactt 3420 ctgctataaa ttgtgttaac acattaatct cacacgtcca taacgcaaaa agaaataaaa 3480 aagtagtaat tgcgacgttt ctagaccttt ctaaggcttt tgataatgtt gatataaaaa 3540 tattgctaaa aaaactgtca gaacttaaag tccctacagc tattgttaat tggatttact 3600 actatctaaa gaaaagaaat acaaaattaa tactcaatga tggaacagtt ttgacaacaa 3660 ctaccaacaa gggactgccc cagggctgtc ccctatcccc gattttattc aatatataca 3720 caacggaact ccatagttta gctgaagacg ataaaatctt atttcaattt gccgatgatt 3780 tcactgttat gataatagct gataacctgc aactagccac agaaaagatg aacctttttt 3840 tgtctaaaat tgctgagaaa cttaacaacc taggtatgat aatcaattca gataaatgtg 3900 caactataat ttttactaat aagttccatc ctaacacaaa tatctcaatt aacaataatc 3960 ctatcgaagt aaaggctttc cataaatttt taggtatcca gattgatcat aaactaaact 4020 ataaaacaca cattgacgcc actgtagtta aggctaaaaa gaagataaac attttgaaaa 4080 tggtaggaaa aagaaaaaat ggagcacatc cagaaacaat gttaaaagta gctaaagcta 4140 tagtacaacc acatttggat tacggactat cgataattgc taaagcatca aaaacagttt 4200 ttgctaaact agaaacagta caacacttat atataagaac ctgtatgaga tttctgagat 4260 cgactcccaa ccatgtgata ctagcagaaa gtggcatatt acccttgaag gagagggcag 4320 aatacttaac gttaaaagaa ataattaaaa cattatacta tcaaaactct ccattgacta 4380 aattcataaa atctttcatt acatcagaag atttaccaac tcaatgttct tatttagaaa 4440 ggatagcatc aatcaacaac taccatttaa ttcaactagg acctaaaata ccatttgaaa 4500 aaacaacaaa attaaaggtt tttacagaga tcgataattt gaagaagaaa gaaacaacaa 4560 tagctgttca aagacagatg acaatagaac taatatcaca taaatacaaa aacgattaca 4620 aaatatttac agatggttcc aaaatagatg ggaaagcagg atacggcatt tatgatagta 4680 gagagaaaat atcatatagt ggaagattaa aaacacaatt tacaataatg agtgcagaaa 4740 tatttgctat tttgaaagca gtagaatact taatgacaaa aaatgtaaaa aatgcggtta 4800 ttttaacaga ttccaaaagt gctgccattc tcattaaaaa taacacaaca aacgacaact 4860 ttctagttgg aaaactgatt agagaaataa acaaatctca gatagagaag ttaacaatac 4920 aatggatacc gggacacgca gggttaatag gcaacgaaag agcagattta gcagcaaaat 4980 tagggacaaa taaacagcag attgaaaatt atccgttaac taaagatgat ttaattttga 5040 atttaaaaat agaaacaaaa tgttattgga accaacgata taaaataatt tcagaggaaa 5100 aaggtattta tcactacatg attgatcaaa atgttcgaag taagccatgg ttttcagatt 5160 tcaaactttc aacacaacac acaataacta taaatcgact aaggactgga catttggcca 5220 ctaaagacag actaaacgca tggggactcg ttccgaacga caaatgtgaa cactgtaatg 5280 ttaaagaaga tatcatccac attttacact actgctccaa gtttgaccat attcgtcgcg 5340 aatatccatt actgcaaaat aaggaagact taataaccat tctagctggc acaaacttta 5400 aaaaaatcaa ggaaatagct aaatttattg gagaaactgg aactaatgtt tgagttgggg 5460 aactgcgtta tggtaatttt tactatgatg tgacgtttac atagtgcacc cggcactgaa 5520 attgaagaaa aattagcttg aaacggtcgt ttagttaaga cagtgacgag ttgtagtcac 5580 acatacaaga cttggttgtt atggaccaga aggtctgcac caaaaagcga gaaaagaaaa 5640 aaaaaaa 5647 // ID Gypsy-5_BM-LTR repbase; DNA; INV; 270 BP. XX AC nscaf3063; XX DT 19-MAR-2010 (Rel. 15.07, Created) DT 19-MAR-2010 (Rel. 15.07, Last updated, Version -1) XX DE LTR retrotransposon from silkworm: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-5_BM_; KW Gypsy-5_BM-I; Gypsy-5_BM-LTR. XX OS Bombyx mori OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Bombycidae; Bombycinae; Bombyx. XX RN [1] RP 1-270 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from silkworm."; RL Repbase Reports 10(7), 986-986 (2010). XX DR Genome; nscaf3063; Positions 956584 956315. XX SQ Sequence 270 BP; 79 A; 57 C; 52 G; 82 T; 0 other; tgtaggaacg atacctgtgt gctcagtact cgtattgtaa aatacaaaat cgcttgcgct 60 cggtcgcagc gactagtcga gccacgatct gacggcgaac gcacgcgcaa tttcaactcg 120 gccaccacgt ctcaatatca cgcgatctca tgtttaagtt ttattttagt acacccggaa 180 attttctgtt ttgatttgtt ataacattta aataaactat tgtgaagtga aaagacttta 240 attcgaacta taaagagctt gtgaacttca 270 // ID Gypsy-7_IS-I repbase; DNA; INV; 4095 BP. XX AC ABJB010066853; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the black-legged tick genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-7_IS_; KW Gypsy-7_IS-LTR; Gypsy-7_IS-I. XX OS Ixodes scapularis OC Eukaryota; Metazoa; Arthropoda; Chelicerata; Arachnida; Acari; OC Parasitiformes; Ixodida; Ixodoidea; Ixodidae; Ixodinae; Ixodes. XX RN [1] RP 1-4095 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the black-legged tick genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; ABJB010066853; Positions 28905 24811. XX CC Positions [3124-3558] - Integrase core CC 'AAGT' target site duplication CC LTRs are 94% similar to each other. XX FH Key Location/Qualifiers FT CDS 1270..3558 FT /product="Gypsy-7_IS-I_1p" FT /translation="MRRNMLFDTDTQLSVQVLLSKVTPLGPSLPRAEMDTP FT WKNVVNEFPRVFRPPVPDESVKHNVTHHIDTKGPPVFAGPRRLTTDRFRIA FT KDEFDHMLELGYIRPSSSPWSSPLHMVPKKDGNWRPCGDYRALNKATVPDR FT YPIPHIQDLGGMVEGDTIFSKVDLVKAYHQIPIEPTDIAKTAIVTPFGLFE FT FLRLPFGLRNAAQIFQRFVDQLLRGWTFTFAYIDAILVFSRTPEEHANHLW FT QLFSRLQGYGLVVNATKCDFGAEEIRFLGHHVDSRVIRTLPETVDIIKNFP FT TPTTTTKLREFLGLSNFYRRFIPHFADKSQPLFAMLQGNETKTSLLQWNEA FT AHHAFEAVKNAIANATLLIHPKRYAPISIMVDASDGAVGAVLQQQVDDSWN FT PLAFFSRKLSPTEVRYSTFGTERLAIYLANKHFRHFVEGQNFFVLTDHKPL FT TFALTSNSTAQTPREVRQMAFISEFTTDIRFVKGTENAAADALSRIGVNRC FT SFEPHLIPYGELAAAQRADAELQKLRTTSATLVFEDVRMHDSDDTVACDMS FT TGQPRPFISAHLRRTIFDSPHGLSYPGIRATQRLIVPRFIWPRINAVVRAW FT TKTCVQCQRAKIHRHTITPSGAFNPPDARFAQIHVDIVGPLPPSHEYRFIL FT TAIDRFTRWPEVAPMRDITTQTVAQTLIEMWVSRFGVPQTITNDQGRQYES FT SLFRSLSHFLGVKHIHTLAYHPISNGLVERFHRQLKAAIKAQPNSLAWTIS FT CARHTNSFQNGY" XX SQ Sequence 4095 BP; 1045 A; 1185 C; 958 G; 907 T; 0 other; aattggttac ccggtaacaa gagacgactg ccctgagcca accacaccgt tccagaacct 60 tggtttttgg tgagctcagc tttctacttg cattattagt caaggcgatg acaccggctc 120 tgctcagacc ccgtcaatct tgggggcgac gacaacgacc attaagtttc ccgagttctg 180 gtcggccgac ttcgaactat gatttctgac aacagaatcc attttccgaa agcatagagt 240 tgcgtcgtcc tcgacgaagt ttgattacgt ggtttcggct cttccgcaag caacggcggc 300 catggtgcga gatatacggc gctcacctcc agacgatcaa ccttacgaga aactacgaga 360 ggagctgata cgacgcacaa cggaatccga accgcgccga ctccaacagc ttttgacttt 420 cgagaagttg accgatcgaa agcctaccga atttcttcgg cgaatagaac agctcctcgg 480 agacaaggca acaacaattg acacagcaat tttcagggag ctgttccttc aatgccttcc 540 agcccaagtg cgcatgatct tgagcgcaac ggcgacagac tggatcgcca ctctggctcg 600 tatggcggac cagattatcg atgaaggtgg gtctagcatt tctgcactgg aagcacatca 660 gccccgccac ctacgccatc tgaggttgga gctcaactca ctcaaatcat ggactttaaa 720 aagcgcatgg aggtcggggt ggaccaactg agtcaagagc tccgcaggct acgcgtacga 780 ccgtcgcctt ctccgaccgc gaaagagaat gtccgctgcc accagaacta cgcacgccac 840 gagtccaccg actctgaatc agaggtattc tggtaccacc gacgctatgg acaacacgca 900 ggacaatgtc gcccgccatg tacttacccg ggaaacggca tgggcacgcg ctgaaggccg 960 ccatcgtcga gagccccacc ccaggtcgtc tattttacgt caccgaacgc cgaacgaagg 1020 ttcgtttcct gataagacac tggtgcagag gtaagcgttg tgcccccacc gatggaggaa 1080 cgtcggcaac gccaagaatc tccgccgctc caagcagtaa acgggtcttc attcaaggcg 1140 tgcggagaca agtcgctgac actggacatc ggactccgca ggacgttccg ctggcttttc 1200 accatcgccg acgtccaaca agccgtaatt ggagcagact ccttgtgaaa gttcgccctc 1260 acagttgaca tgagaaggaa catgctcttc gacaccgaca cccagcttag cgtccaagtt 1320 ctcttgtcaa aggtgacgcc tcttggtccg tctctccccc gagcggaaat ggacactccg 1380 tggaagaacg ttgtcaacga gtttccccga gtcttccgcc cgccggtccc tgatgaaagc 1440 gtgaagcaca acgtcacaca tcacattgat accaagggcc ccccggtttt cgcaggacca 1500 cgacgtttga caaccgaccg ttttcggatt gccaaggatg agtttgacca catgctcgaa 1560 cttgggtata tacggccttc ttcgagtccg tggtcttccc cactccacat ggtaccgaaa 1620 aaagacggaa actggcgacc ctgtggggac tatcgagcac taaataaggc tacagtgcca 1680 gatcgctacc cgatcccaca tattcaagac ctcggcggca tggtagaagg ggacactatt 1740 ttcagcaaag ttgatttggt taaagcttac caccaaatcc ccatcgaacc tacagacatt 1800 gccaagactg cgatcgtcac accatttggt ttgttcgagt tcttgagact gccgtttgga 1860 cttcgaaacg ccgcgcaaat cttccaacgt tttgttgatc agcttcttcg cggctggaca 1920 tttacattcg cttacatcga tgccatcttg gtgttcagtc gcacaccgga agaacacgcc 1980 aaccatcttt ggcagctctt ttctcggttg caaggatacg gactcgtcgt caatgcgaca 2040 aaatgtgact tcggtgccga ggaaatacgt tttcttggac accacgtaga tagccgcgta 2100 atccgaacac tgcctgagac ggtcgacatc atcaaaaact ttccaacacc gacgacgaca 2160 acgaagctcc gtgagttctt ggggctctct aatttttatc gtcgattcat cccacatttc 2220 gctgataaaa gtcaaccact tttcgcaatg ctgcaaggaa acgagacgaa aacgtcgcta 2280 ctgcagtgga atgaggctgc ccatcatgcc ttcgaggcag ttaagaatgc cattgccaat 2340 gccacgctcc taatacaccc caagcgatac gcacccatct ccattatggt tgacgcctct 2400 gatggtgctg tgggagctgt tctacaacag caggtggacg atagctggaa tcccctggcg 2460 ttcttctcaa gaaaactttc accaacagaa gttcgctaca gcacctttgg gacggagcgg 2520 ctggctatat acctcgcaaa caagcacttc cggcattttg ttgaaggcca aaattttttt 2580 gtcctcaccg accataaacc actgacattt gcactaacat ccaacagcac agcacagact 2640 ccgcgagagg tgcggcagat ggctttcatc tcagaattca cgaccgacat ccgcttcgtc 2700 aagggcacag aaaatgctgc ggctgatgcg ctctccagga tcggtgtgaa tcgatgctcc 2760 ttcgaacctc atctgattcc ctatggggag cttgcagcag cgcaacgcgc tgatgccgaa 2820 ctccaaaaac tccgaacaac ttcagcaaca cttgtcttcg aggatgtgcg gatgcatgac 2880 tctgacgaca cagtagcatg cgacatgtcg acaggccaac ctcgaccatt tatatccgcc 2940 catttacgcc ggacaatctt tgattcccct catggcctat cctacccagg aataagagcc 3000 acgcaacgtc tcattgtgcc aagattcatc tggccgcgca tcaacgctgt tgtgcgggca 3060 tggaccaaga catgtgtgca atgccagcgt gcaaagatac accggcatac tatcacacca 3120 tcaggcgcgt tcaacccacc agatgcccgc ttcgcccaaa ttcatgttga cattgtagga 3180 ccgcttcccc catcgcatga ataccgcttc atcttgactg ccatagatcg ttttacgcgg 3240 tggcccgagg tcgctccgat gagagacata acaacacaaa cagtggcaca aacgttaatt 3300 gaaatgtggg taagtcgatt tggcgttccc cagaccatca ccaacgacca aggacgacag 3360 tacgaatcat cactgtttcg aagcctttcc cattttctcg gagtcaagca catccacacc 3420 ctagcttatc atccgatatc taatggcttg gtagaacgat tccatcgtca gctcaaagca 3480 gcaatcaagg ctcagccaaa cagccttgcg tggaccatta gttgtgctcg gcatacgaac 3540 agctttcaaa acggatatta actgttgcac ggccgagttg gtgtacggta cgtcacttcg 3600 tctgccatta gaattgttca accaatcaag cgacaccgaa attgagagtg ccgaaaacta 3660 tgtgaaacga ctccgtatcc tcatgcagga cctccgacct acctacacaa gcgtaccgta 3720 gacgcgttac acgtccgttc tggctgatgt tctgctctgt acacacgttt ttctcagaaa 3780 cgccggagtc aagaaacccc tacagcccac atacgatgga ccgtttccgg ttctgagacg 3840 cagtgacaaa acggtgacac ttgtggtaaa aggcaatgag aaagtcgttt cgctagacag 3900 agttaaggct gcccaccttg acttcgattg ttgtatacaa gatgcaacac cgttccgaag 3960 acctcgacaa gctcagttcc agccaggcaa tccatgtaac ttcagtcggg gcacagcctg 4020 ggaaacctca cgacactggc ttccgcaggg acgtaaacgc tcgtcacctc cgcctcctca 4080 ctagggggga gtact 4095 // ID BEL-54_AA-LTR repbase; DNA; INV; 396 BP. XX AC supercont1.270; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-54_AA_; KW BEL-54_AA-I; BEL-54_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-396 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.270; Positions 659565 659960. XX SQ Sequence 396 BP; 100 A; 90 C; 84 G; 122 T; 0 other; tgttggattt cgatgttatt tgccgtatat atagttagcc aacaccttgt ggagagattt 60 ctctcgtttg aaagtagctc tcattttctc tccccagctt cagatgctgt gcggtatcta 120 gctcatagat tgtaactacg ctcgtcagtt gcactgttag ccatcaaccc gatcagttat 180 aatatagaag tgtttgaaga acaactcgct ttcaatctcg ctttcgattt attgtgtagt 240 gcgtcgtatc acatttgctc tggcttggca gaatcagcct tccgcgaaaa ttaaaaccaa 300 gccggagcgt aaccactgag gatttgatac cccggaaagc ttgactgttt cttagtgtga 360 gatacagata aaggtgagcc cagacctgcc ccaaca 396 // ID piggyBac-22_SM repbase; DNA; INV; 2408 BP. XX AC . XX DT 14-AUG-2009 (Rel. 14.08, Created) DT 14-AUG-2009 (Rel. 14.08, Last updated, Version 2) XX DE DNA transposon from Schmidtea mediterranea. XX KW piggyBac; DNA transposon; Transposable Element; piggyBac-22_SM. XX OS Schmidtea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae. XX RN [1] RP 1-2408 RA Jurka J.; RT "Families of autonomous piggyBac elements from planaria."; RL Repbase Reports 9(8), 1832-1832 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 388..1785 FT /product="piggyBac-22_SM_1p" FT /translation="MFISLKMSKKWDLTNPLHLEKAMEYLFSLPEEDELSD FT VEIDDGAREELDELNTTTTSRFLESVNEEMVETEAEMDLEMDIPAPIEPEM FT NEDVQDWNYDCSFFQCDAETNFRQNASFDDNFSNYKAIDFFTQLFDQCMIE FT NVVSETNRYARQKQSSMWVDTSQEEIKAFVGSLIIMGIHRLPHLEDYWSSD FT PFLSVPAVATVMTKNRFKKLIENIHFNDNDKNVDRSDPNRDKLHKMRPLID FT KLNENCNKIITPSSFLSIDESMIPFKGRSSLKQYMPMKPTKRGYKVWCMAD FT STLGYIVKFDIYTGKSVAPLCKETTLGERTVLNLTKDISSRGTHVVVFDNF FT FTSSKLLLKLMSKGIFSCGTTRITRKDVPPIFKENKKILRGEYQYQTRGQL FT AAVRWMDNKPVHFLSTIYNPRDTTTVLRKNKNGQRETVSCPLVVSKYNKIM FT GGVDLFDQYRERSIVLVGDL*" XX SQ Sequence 2408 BP; 840 A; 355 C; 444 G; 768 T; 1 other; ccctttcgtg cccagtggga atacgtattc ccatttttgt tttgccctga taagaccgat 60 ggaatacctt ttcccgatat ttactactac tgccatctag tatcgatcat tagctaatcg 120 gtaaaacagg aatctgtttt atcagacaca ctttgatttg gcggttagtt atcacacgtg 180 gcttgttgtt gtggtcgtgt aaaagtgggc gaaattagtg tttttttgac aaaattcttc 240 attgtatgtc tcaataagga ttttaacggt tgattttagt tccagagaca ttcttttaca 300 cattgataca taaatacgtg cttatagtgt ttaattttga aagtaagtca tattttatat 360 atcatttcat aaatcaattt ataaaccatg tttatttctt tgaagatgtc taaaaaatgg 420 gatttgacta accccttaca tttggaaaag gcaatggaat atttattttc acttcctgaa 480 gaggatgaac tgtcggatgt tgaaattgat gatggtgccc gagaggaact agatgaactg 540 aatactacaa caacttcacg gtttctagaa tcagtgaatg aggaaatggt tgaaacagaa 600 gctgaaatgg atttagaaat ggatatacca gctccaattg aaccagaaat gaacgaggat 660 gttcaagatt ggaactatga ctgcagtttt tttcagtgtg atgctgaaac caatttcaga 720 caaaatgcta gttttgatga taattttagt aattataaag ctattgattt ttttactcaa 780 ttgtttgatc aatgtatgat cgaaaacgtg gtctcagaga ccaacagata tgctcgtcaa 840 aagcaatcat ccatgtgggt cgacacctca caagaagaaa taaaagcatt tgtaggatcg 900 ctgataatta tgggnattca taggttacca cacctcgaag attattggag ttcagaccca 960 tttctaagtg taccagcagt cgctacagtt atgacaaaaa acagattcaa aaagttgatc 1020 gaaaatattc atttcaacga caatgataaa aatgtagatc gtagtgatcc taatcgcgat 1080 aaattgcata aaatgagacc tcttatagac aaactaaatg aaaactgtaa taaaattata 1140 acgccatcat catttttgtc tatagatgaa tcgatgatac ctttcaaagg tcgttcatct 1200 ctgaagcaat atatgccgat gaagccaacc aaaagaggct ataaagtatg gtgtatggca 1260 gattcaactc taggatatat tgtaaagttt gatatataca cgggaaaaag tgtcgctcct 1320 ttatgtaagg aaactacatt aggcgaacga actgttctaa atttgactaa agacatatca 1380 agtcgtggta cgcatgtcgt agtctttgac aattttttca ctagttcaaa attgttgtta 1440 aaattaatgt caaagggtat attttcttgt ggtactacaa gaattactag gaaagatgtg 1500 cctccaatat ttaaagaaaa taaaaaaata ctacgaggag aatatcaata tcaaacaagg 1560 ggacaacttg cggcagtccg ttggatggat aataaaccag ttcatttttt atcgacaata 1620 tataatcctc gtgataccac cactgtgcta agaaaaaata aaaatggtca aagagaaacc 1680 gtttcttgtc cactagtggt ttcaaagtat aacaaaatta tgggcggagt ggatttattt 1740 gaccaatacc gtgaacgtag tattgtattg gtaggcgatc tgtaaagtgg tggcatcgca 1800 ttatgtacta cttgatagat atttccattg taaacggatt tttaatgtat aaaaataaaa 1860 agcaaaataa agaagatcaa ctgtcattta ggatctcttt agcaagacaa ttgatcggaa 1920 aattttcttc aagaaaacgt cgtggccgac caacacaatt cctagctaat aaaggtcgtg 1980 taccagatga tgtacgttta gaaggtgtag gtaaacacat gcctttgaag ggtgacacat 2040 atagaagatg tagactatgc agtacaaaag gtcaagagaa acgaactaag tgtatgtgta 2100 agggctgtaa tgtgccatta tgcatttctc catgttttga tatatttcat aagaaataat 2160 ttttattgta tttttttaat ttatttcaat tgtaattaac tgtcgtaatt tcaatatttt 2220 tttatttttt attttaagtc atgtatggcc gtaaagacca gtgggaagta tagatcccag 2280 tttttttaca taaaaaaata aattaaacaa aaaatatttt gtgcatataa taaaatatat 2340 attaatacat aagaatttaa aataaaaagg ttaaaaatcc caatataaat aatttgggca 2400 cgaaaggg 2408 // ID hAT-60_HM repbase; DNA; INV; 3293 BP. XX AC . XX DT 24-DEC-2008 (Rel. 13.12, Created) DT 24-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type DNA transposon family - consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-60_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3293 RA Bao W. and Jurka J.; RT "hAT-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 2048-2048 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 913..2883 FT /product="hAT-60_HM_1p" FT /translation="MSCYSNNSCANLNDTFKKMFPDSKVAEDFTMSTSKVS FT YIVNHGLAPYFKTLLKEEITKSDCYIVSFDESLNDITQTCQMDLLVRYWDG FT NDKVKVRYWTSAFLGHSAASDLLTHFNENLSRFDSSKMYQVSMDGPSTNLK FT FLDDLNKHRKDNEMPQLINIGSCSLHVIHGAFKTAIESTTWNIKETLKGCW FT QILHDSPARRQDYETVTGATKYPLFFCGTRWVESKSVADRCIEIWPNICKL FT IEFWEKLPSSKQPKSKSFLNVKKAVKDKLIILKFEVFSFVASILEPYLLAY FT QTDKPMIPFMYSDLERLLRTILSLFVKQDVIDNSCISVQLKEVDFHKKDNL FT LKGNQIKLGFAAETRLAALRKKDIITIADIQSFHKDCTNMYVRLIEKFMDR FT TPLGSIIVRNSQVLNPNAMVVMTQEDAEKKYKSLLTHLISLKYVSPIFSDK FT AINQFADFFRESRVYQDIFHEFNRERDSLDDFYFKKFNLNKYKEFALTVKI FT LLTLSHGQASVERGFSINATVLEQNLNEKSITARRLVKDHMLSNNLQPHSI FT EITSKMIISVKSAHERYRSYLNSIADTNKKEASQLAKKVVSDEIKDVETRR FT DQLKKTSEMLQFDFVKFVEEAEEKQDLGLISKANAMKQKANEKNEEIKKLE FT LALKQRN*" XX SQ Sequence 3293 BP; 1232 A; 479 C; 554 G; 1028 T; 0 other; tacagggttc atggcactca gggaaaacct ggaaaaccag ggaaaagttt taaaaattta 60 gaaaatcagg gaatacctgg aaaagtcagg gaatatttgt taaaaatctg taaaatcagg 120 gaaaactcag ggaatatttt tcagttttat atttagcttg aagttatttt gcaaaataca 180 cattttgagt gtttattgtt gttgaaaatt gctcactaaa gtggtttttt aacttttata 240 ttatatagta actatatgtt tagctaaatg tataaataaa tataataata taagacatga 300 cttcagaaaa gacttatttt caagaggatt ggttacaaca agatatttat aaagattggt 360 tagttcccga cgcacatgca aaaactaaag ctcattgcaa aaaatgtaaa aaatcatttg 420 aattgtccaa tatgggtatt caggctgtta aaagtcatgc agcaggtatt tacattaact 480 tttttaatgt gagagtgaga attaaaaaga aatgttgctt gattgattca ttattttata 540 ttttaatatc agtttcaaac acattgtaat caacttttga ttgcaacctg tttcagttgt 600 tttccttgtt ggttggttgc ttatattgtt atttcaaact ttgcaagcta aacagattga 660 gaaaaaagat aaaaaatttt agacatttta aatgtttcat tatttattta tcttgtttta 720 ggtaagaagc atcagtctgc tacaaagcct gttagttgtt tctttattaa tggaaaagag 780 ttgcatgctg aaccacttgg aagtgaagaa aaaatatcca catttttcac caaaaaacaa 840 gcaactattg tctctttaat taaatcatcg ttatcaacta atgctgagat catatgggcc 900 ctgaaatcta caatgagctg ctactctaat aactcatgtg caaatttaaa cgacactttt 960 aaaaagatgt ttcctgatag caaggtagca gaagatttca ctatgagcac ctcaaaagta 1020 tcatacattg ttaaccatgg actagcccca tacttcaaga cacttttaaa agaagaaatc 1080 acaaagtctg attgttacat tgtatctttt gatgagagtt taaacgacat aacccaaact 1140 tgtcagatgg atctacttgt tcggtattgg gatggcaatg acaaagttaa agtgagatat 1200 tggacatcag catttcttgg tcattctgca gcttctgacc ttcttacaca tttcaacgaa 1260 aacttatcaa gatttgattc aagtaaaatg tatcaagttt ccatggatgg accctctaca 1320 aatctgaaat ttctagatga tcttaataaa cacagaaagg acaacgagat gccacaactt 1380 ataaatattg gaagttgctc tttgcatgta atccatggtg ctttcaaaac agctatcgaa 1440 tcaacaacat ggaatattaa agaaacttta aaagggtgtt ggcaaattct tcatgatagc 1500 ccagctagac gtcaagatta tgaaacagtc acaggtgcaa caaaataccc tctttttttt 1560 tgcggaacaa gatgggttga aagtaaatct gttgcagatc gttgcattga aatctggcct 1620 aatatatgta agttgattga gttttgggaa aagttaccat cttcgaaaca acctaaatca 1680 aaaagctttt taaatgtaaa aaaggctgtt aaagacaaac ttattatttt aaaatttgaa 1740 gttttcagct ttgttgcaag catattagag ccatatttgt tggcatacca aactgacaaa 1800 ccaatgattc cattcatgta tagtgactta gaaagactgt tacgaaccat actgagcttg 1860 tttgtgaaac aagatgttat tgataatagt tgtatttcag tccaactaaa agaagttgat 1920 tttcataaaa aagataatct acttaaagga aatcaaatca aactagggtt tgcagctgaa 1980 acaagactgg ctgctttacg aaagaaagat ataataacca ttgctgatat tcaaagtttt 2040 cacaaagatt gcactaacat gtatgttcga ttgattgaaa agttcatgga ccgaactcca 2100 ctaggatcta tcattgtacg caactctcag gttcttaacc caaatgcaat ggtagtaatg 2160 actcaagaag atgctgagaa gaagtataaa tctttattga cccacttgat tagcttgaaa 2220 tatgtttcac caatcttttc tgataaagct ataaatcaat ttgcagattt ctttagagaa 2280 tcaagggttt atcaagacat ctttcatgaa ttcaatcgag aaagagatag cttagacgac 2340 ttctatttta aaaaatttaa cctaaataaa tataaagagt ttgcattgac tgttaaaata 2400 ttattaacac taagccatgg acaagcctca gttgagagag gttttagtat aaatgccaca 2460 gtactagaac agaatttaaa tgaaaaatca ataactgcaa gacgtttagt caaagatcat 2520 atgctatcaa ataatcttca gccacatagt attgaaatta caagcaaaat gatcatcagc 2580 gtaaaatcgg cacacgaaag atatagaagc tacctaaatt caatcgcaga cactaacaaa 2640 aaagaagcaa gtcaattggc aaaaaaagtt gttagcgatg aaataaaaga tgttgaaacg 2700 aggcgagatc aactgaagaa aaccagtgaa atgttgcagt ttgattttgt aaaatttgtt 2760 gaggaagcag aggagaagca agatctaggg ttaatttcta aagctaatgc gatgaaacaa 2820 aaagcaaatg aaaaaaatga agaaataaag aaattagagc tagcacttaa gcaaagaaat 2880 tagacgcatc ttagagcaaa aaagaaagtt gctaaaataa tatattcgtt ctcacatgaa 2940 gtttatattt tgttaacaaa acacattagt tttttcaaaa catatataaa ttgttatttt 3000 tttaatatat ttcttttagt tatcagatta atatggctct gtttcttgtt taaaacacct 3060 ttaacggatt gctataaatt aattaaagac agtctctgat ttcttaaaaa atagtgataa 3120 aaagtattta gggaacaagg attcagttgc aaaagtatta tatttttcaa cattgatagg 3180 caaaatattt agacttttca ccgcacgaac tcagggaaaa aatacttgaa aatcagggaa 3240 aatcagggaa aactcaggga aaatggtttt tcaaaactgc catgaaccct gta 3293 // ID BEL-604_AA-I repbase; DNA; INV; 5868 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-604_AA_; KW BEL-604_AA-LTR; Pao_Bel_Ele53; BEL-604_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-5868 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [4882-5451] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 32..1549 FT /product="BEL-604_AA-I_2p" FT /translation="MAHKQPLNESYCLCESCHQSPSADRGMVACDECGKWF FT HYMCVNVDESIKDHPWCCEACLLIRSRRHRSVSSTSSRRRFRELELKRLDE FT EYALQQEYMRKRYEVMASCLEDEEEDNSQDGSSVNSSVSRTRKWINSQREA FT HGNFGAGQACSLPSIPNVGPSYVVGHGESYEFTPFGRPESRVPTRKNTVTA FT INQFEANVSRHVPSLPEPLTSVSSPHGLGRSQTLNAASYRGGSSIIPALWQ FT IVSPGLAPTEHPNVRNATAQSSMPNNQPLVSREQQSTMIAPIAGPSPAQIS FT ARQVWPKRLPAFSGVAEEWPIFISSFENSNAACGFSDVENLIRLQESLKGP FT ALEIVRSRLVLPANVPKVIDTLRRIYGKPEVLIRSLIAKVRKIDPPKSERL FT DTIIKFGIAVQQLCDHLDAANQQDHLRNPVLLEEMVEKLPASFKLEWVRFK FT REEQQVSIRTFGNFMEKLLDEASEVTLLNPSSETTTKQESKSSAKPKGYVH FT THSESEAVPN" FT CDS 1822..4308 FT /product="BEL-604_AA-I_1p" FT /translation="MVHQRISKSSVVFRVVPVILINGSKEVATFAFLDEGS FT SKTLLESSMARKLGLKGAVRPLDLKWTSGVTRTEKSSMNVRLTVSGKGSGE FT RFLLDEVHTVERMNLPKQSVNFDDLVSRFKHLRGLFVENQADAEPTILIGS FT DNLDLMAPIESRIGKKGEPVGLRCKLGWTVSGPVSSNASEEYCGVHQCDVS FT MEQSLKNYFTLEEAGIACPYRVESDEDRRARGILEKTTRRIGNRFETGLLW FT RTDEIRMPDSYAMAFRRMQNLEQRFKRNPDMQEKVARAIEDFVEKGYAHKA FT STDELRRSQSGQEWFLPLNVVVNPKKPGKIRLTLDAAAKAGGTSLNDMLLK FT GPDMLVSLLDIINHFRGRRIGFGGDVKEMFLQIRMRQPDTRFQKFLFRQDP FT SKNPDVYVMDVVIFGAKCSPCNAHFAKDTNAREHASQFPAAADAIINRHYV FT DDYFDSADSEDEAIQRAREVREVHSLGGFEIRNWVSNVPKVLEALGEEGKP FT TKTLCYDKTLESERVLGLLWVPEEDVFTFSMGLKEKLIPYLVNKVRPTKRI FT ILQCVMSFYDPVGFLSPLLVHGKIIIQETWRSGTDWDEPVTDEIFNRWLSW FT TRILPAIETVKVQRCFFGDESINNITSVQLHIFSDAGDDAYGCVAYLRYRV FT AGEVRCTLVGSKAKVAPLKLTSTPRLELQAAVIGARMMTTLLASLPVSIQE FT QFLWVDSMTVLSWIRSDSRKYKPYVAHRIAEILHSTDINSWRYVPSKKNVA FT DDMTKWGPGTMMDADGRWFNGPPFLYQAEELWPEQPKTRATVTEEIRPCYL FT LHHVGITQPIIDASRISNWKILLRTVA" XX SQ Sequence 5868 BP; 1597 A; 1305 C; 1596 G; 1343 T; 27 other; taatctcaaa gattaatttc ggtagctcac gatggctcat aaacaaccgc tcaacgaaag 60 ctattgccta tgcgaatcgt gtcatcaatc accttcggct gatcgtggta tggtagcatg 120 cgacgaatgt ggaaagtggt tccactacat gtgtgtcaac gtcgatgaat cgataaagga 180 ccacccctgg tgttgtgaag catgcttgct gatcagatca cgacgacatc gtagcgttag 240 ctccacatct tcaaggcgca gattcaggga actcgagctg aaacggctag acgaggagta 300 cgccttacag caggaataca tgcgaaaacg gtatgaggtg atggcatcgt gcttggagga 360 cgaggaagag gataatagtc aggatggtag ctccgtaaac agcagcgtta gtaggacgag 420 aaagtggata aacagtcagc gagaggctca cggaaatttt ggtgccggtc aagcctgtag 480 tttgccttcg atcccgaatg taggtcccag ctacgttgta gggcacggag agtcctatga 540 attcacgccc tttggtcgtc cggaatcgag agtgccaaca cgaaagaaca ccgttactgc 600 cataaatcag ttcgaagcga atgtatcacg tcacgttcca agcttaccgg aacctttgac 660 ctcggtgtcg agcccacacg ggctggggcg tagtcaaacg ctaaatgcag catcgtaccg 720 aggaggctcg tcgattatcc cagctctgtg gcaaattgta tcgccgggtt tggcgccaac 780 ggagcatcct aacgttcgaa atgctactgc tcaatcgtct atgccgaaca atcagccgtt 840 agtgtcgcga gaacagcaat ccacaatgat cgcaccgata gctggcccaa gtccagcgca 900 aatttcagcg agacaagtgt ggcctaagcg gctgccagcg ttttctggag tggcagagga 960 gtggcccatt ttcattagtt cgtttgagaa ttcgaacgca gcgtgtggat tttcggatgt 1020 agaaaatctc ataaggcttc aagaaagtct aaaaggtcct gcactggaga tcgtccgtag 1080 cagactggtg ctccctgcaa acgtcccgaa agtcatcgat actttacgtc ggatttatgg 1140 gaaaccagag gttctcattc gctcgctgat tgcaaaggtt cgaaagatag atccaccgaa 1200 atccgaacga ttggatacga tcatcaagtt tgggatcgcc gtacaacagt tgtgcgacca 1260 cttggatgcc gccaaccaac aagaccatct tcgaaatccg gtgctgctgg aggaaatggt 1320 ggaaaagctt cccgccagct tcaaattgga gtgggtccgt ttcaagagag aggaacagca 1380 agtttcgatc cgtacattcg gaaactttat ggaaaagtta ttggatgaag ctagcgaagt 1440 gacactgtta aatccgtcat cggagactac aacaaagcaa gaaagtaaat cgtctgcgaa 1500 accgaaaggg tatgtgcaca cgcacagtga atcggaagct gttccaaats ctggtacgaa 1560 aaccgaagct gttaaatcgc acgatgataa gccgtgcgca acatgttgca aggtcgggca 1620 tcgcgcgcga aactgcgaag agttcaaacg attgacggtc aaagagcgtc ttcaattggt 1680 gaaggaccga caactgtgcg ccgtctgctt atacaatcac ggaaacatgc gttgtcgtaa 1740 caaaatgcag tgcaacgttc gggttgtaat ggcaggcacc acgctctact gcatcaggta 1800 gaaccccaag agtcgagctg catggtacat cagcgaatca gcaaatcctc tgtagtcttc 1860 cgcgtagtac cggtcattct cattaacggg tcgaaggaag tggcaacctt cgcctttctc 1920 gacgagggtt catccaaaac gctattggaa agcagcatgg cgaggaaatt gggactcaaa 1980 ggagcggtca ggccattgga tctgaaatgg acgtcagggg ttaccaggac tgaaaaatca 2040 tcgatgaacg tacgtctgac ggtgtccgga aaaggatcgg gtgaacgatt tttgctggat 2100 gaagtacaca ccgttgaacg gatgaatcta ccgaaacaaa gcgtaaattt tgacgatttg 2160 gtttcaaggt tcaaacacct ccggggcttg ttcgtcgaga atcaagcgga cgcagaaccc 2220 acgattttga ttggttcaga taatctcgat ttgatggcgc caattgaatc tcgcatagga 2280 aagaaaggag aaccagtcgg tctgcgttgc aaacttggat ggacagttag tggaccagtg 2340 agctcaaatg cctcagaaga atactgtgga gtgcaccaat gcgatgtttc gatggaacag 2400 tcgctgaaaa actattttac gttggaagaa gccggaattg cttgtccgta tcgcgtggaa 2460 tccgacgaag atcgaagagc tcgcggtatt ttggagaaaa ctactaggcg catcggaaat 2520 cggtttgaaa ccggcctttt gtggcgaacc gatgaaattc gaatgccgga tagttacgcc 2580 atggccttcc ggcgtatgca gaacttagag cagcgcttca aaaggaatcc tgatatgcag 2640 gagaaggtag cccgtgccat cgaagatttc gtcgagaagg gctatgctca taaagcgtca 2700 actgatgagc ttcgtcgcag ccaatcagga caggagtggt tcttgccatt gaacgtagtt 2760 gtgaacccga agaaaccggg gaagattcgg ttgacgttgg acgcagccgc caaagcaggc 2820 ggaacatctc tcaatgacat gctgcttaag ggtccagaca tgttggtctc tttactggat 2880 ataatcaacc acttcagagg acgacgaatc ggatttggag gcgatgtgaa ggaaatgttt 2940 cttcaaatcc gaatgcgcca gccagataca cgattccaga agttcttgtt ccggcaggat 3000 ccatccaaaa atcccgatgt ctacgtcatg gacgtcgtca tcttcggcgc gaagtgttcg 3060 ccgtgcaacg ctcattttgc aaaggacacc aatgctcggg agcatgcaag ccaatttcca 3120 gccgccgcag atgccattat caatcgacac tatgtcgatg actattttga tagtgcggac 3180 agcgaagacg aagccattca gcgtgcacgt gaagtaaggg aagtccattc tctcggcggg 3240 ttcgaaatcc gtaattgggt gagtaacgta ccaaaagtac tcgaagcgct gggagaggaa 3300 ggtaaaccaa cgaagacact ttgctacgac aaaacgttgg agtctgaacg agtgctgggg 3360 ttgctttggg tgcctgaaga agacgtgttc acgtttagta tggggctgaa ggaaaagttg 3420 atcccctatc ttgtgaacaa agtacgacct accaagcgca tcatcttaca gtgtgtcatg 3480 agcttctatg accctgtagg attcctttct cctttgctcg tgcatggcaa aattattatc 3540 caggaaacct ggagaagtgg aacggactgg gatgaaccag taacggatga aattttcaat 3600 aggtggctaa gctggacaag aattctccca gccatcgaaa ctgtgaaggt tcaacggtgt 3660 ttcttcggcg acgaatcaat caacaacata acttcggttc agctgcacat tttctcagat 3720 gccggtgacg acgcctacgg ttgcgtggca tacctgcggt atcgagtagc gggagaagtt 3780 cgctgcactc tggtgggctc caaagcgaaa gttgcgccac tgaagctgac ctcaacaccc 3840 cgattagaac tgcaagccgc agtgattggg gcgaggatga tgacaactct gctggcttcc 3900 ttacctgttt cgattcagga acaatttctt tgggtagatt ccatgacggt tctctcctgg 3960 attcggtctg acagccgaaa atacaaaccg tacgtggcgc atcgtatagc tgaaattttg 4020 cactcgacag acatcaactc atggaggtac gttccatcga aaaagaatgt tgccgacgac 4080 atgacgaagt ggggaccggg aactatgatg gatgcggatg gaagatggtt taacggaccg 4140 ccttttctat atcaagcsga ggaattgtgg ccagagcaac caaaaactag agcgactgta 4200 accgaggaga tcagaccatg ctatctactc caccacgttg gcatcacgca acctattatt 4260 gatgcctcac gtatttcaaa ctggaagata ctcttgagaa ckgtagcast agtgtatcgc 4320 tttgtttcga attgtaagcg caaacgtkcc ggacaatcta tcgaagcggt aagaagcgcc 4380 aaatccagga agtcgacctt accagccgta ctggtggaaa tacgacagga agagttagcg 4440 aaagcggaac gtctactgtg gaggatcgtt caagccgatg cttttccaga tgaagtgaag 4500 attttgaaaa ggaataggga wcttcctccg gatcagtgga tctccatcga caagtcgagt 4560 maactgtaca aaacatcgcc atttttggat gaagacgacg ttatcagaat ggaagggaga 4620 accgckgcag cggtgttcac cgmgatgtgt actcgattcc ctgttatatt gccgaggaat 4680 catatcatta catcaaagat tttggaggat taccacgttc ggtatgcgca cgggtcmaag 4740 gaaaccgtgg taaacgaagt acgacaggck tatttcattc ckaagctacg tacagctgtc 4800 ggaaaagtca tcaacagctg cctgaaatgc cgacttcgga aaagcaagcc agtggtaccg 4860 cgtatggcgc cactccctgt tcagcgaatg cagcctttcg tgcgtgcctt cagttacgta 4920 gggctcgatt actttgggcc tatcgacgtt accgtaggaa gacgcaagga aaaacgttac 4980 gtagcactgt tcacgtgtct ggtcgtgcga gctgtccatt gtgaggtggc gtatagtcta 5040 agcactgagt cgtgcaagca ggcaatccgk cgattcattc ggcgacgtgg ttctcctgtc 5100 gagatatttt cagacaacgg tacgaatttt cagggggcaa gtagagagct tcgtgatgag 5160 ctggagagaa ttgatcgaga ttgtgcaaat acgtttacgg gtacaaacac gaagtggacc 5220 tttaacccgc cgtctgctcc tcatatgggc ggagtatggg agcgaatggt gaggtccgtt 5280 aaggaagcaa tggcagcatt aagcgatgga cggaggatga ctgatgaaat tctagtaacg 5340 accctggccg aagccgaata tcttgtcaac tctcgtccgt tgacgtattc tggaacggag 5400 gacgctgaac tggatgctat tactccgaac cactttctcc ttggcagcac ttcgggacaa 5460 cacctgccgt tccaagtacc aatmwctgtg gcagaagagc tgaggagcag ttacaagcgt 5520 tckttggcat tggcgaacga attctggamc agatggtgca gagaatactt gccgacattg 5580 aatcaacgga gcaagtggaa cgtwgaaagc agatcgwtma aggcgggaga cctagtcttc 5640 gtcatggatg atggtaaggg tgctgcagga gtccggggca tmgttgaaga ggtttttgct 5700 ggagctgatg ggcgagtgcg acaagctgtg gtwcgaacga atggaggggt cttcaggcga 5760 ccggcckctc ggttagcggt ccttgaaata gmtgataaac ctggcmaagc ttccgaaggg 5820 gaktatgacc agggtttatg ggctggggas tgttcggata agcgggaa 5868 // ID Gypsy-266_AA-LTR repbase; DNA; INV; 195 BP. XX AC . XX DT 13-JAN-2011 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-266_AA_; KW Gypsy-266_AA-I; Gypsy-266_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-195 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (13-JAN-2011). XX DR [1] (Consensus) XX SQ Sequence 195 BP; 42 A; 51 C; 46 G; 55 T; 1 other; tgtggtgacc accctatcga gttatgctgg ctgctggctc agccatagca gaaaggaata 60 aaacwattca gtttattact gactacgaac gcgcgcgtac cgtttcgtat actcggtcat 120 atcacacacc gtctattgtg tagtccctat gcgtgcgtgt ctggccggtc tctaccgttt 180 ggtccggtca taaca 195 // ID Gypsy-36_CQ-I repbase; DNA; INV; 4730 BP. XX AC AAWU01012843; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-36_CQ_; KW Gypsy-36_CQ-LTR; Gypsy-36_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4730 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 451-451 (2011). XX DR GenBank; AAWU01012843; Positions 93386 98115. XX CC Positions [3697-3936] - Integrase core CC LTRs are 96% similar to each other. XX FH Key Location/Qualifiers FT CDS 516..2102 FT /product="Gypsy-36_CQ-I_2p" FT /translation="MEKWEIPQFKFRSIPNSIIRDEWVKYKRNFQYIAAAN FT EEKNATRLKNVFLAKAGPDVQEVFASLPGADVEEDKAKNIDPFKVAIEKLD FT EYFSPKQHEAFERNTFWNLKPQDEETMEKFLWRTLEVAQKCNFGTTLEESR FT SIAVIDKVILYAPVELKQQLLQKEKLTLDDVTRVVSSYESVKKQSMLMEGK FT SGSTPAYSEHGGGGGQKSELFNVQSSSRGPPTTGECYRCGHKDHYGNDPRC FT PARSVQCTRCKRYGHFSKRCKTEFGRRPTHPQSQNVVPAKRPGGSQNHWDN FT KKPKYNPRHERIAQVEDNTSDPDENTSFVYNISDGDEFIWVKVGGVLIQML FT IDSGSAKNILDDRAWEHLRVQEAVTTNFRTDCDQIFRAYGRESAPLKIVCV FT FEADIAIDGAADRIEGSDTFYVVEGGTQPLLGRTTAKKLGVLKIGLSSVNA FT GEVQERRPFPKMKDIKLIVPVDESVTPVVQNVRRPPIALLTRVEEKLDQLL FT ASEIIEPVKGKKGRRIVLLGLYIDCFITDTRCE" FT CDS 2167..3936 FT /product="Gypsy-36_CQ-I_1p" FT /translation="MRRANEAIKREHHMMPTIEDFLPRLKEAKFFSRLDIR FT ESYYQMELAEQSRNITTFITHKGLFRFTRLMFGISCAPEMFQKHLEQILVG FT CNNTVNYIDDIIVFGATEEEHDQALSEVMKVLKERDVLLNHEKCLFKVQEL FT EFLGHIISVNGIRPTVGKTEALEKFREPRSVEEVRSFLGLVTYGGRFLPNL FT ATITAPLRELLRGDKRFTWTAQHTGSFQKLKEMISDIRTLQFFDNTLRTRI FT IADASPVALGAVLVQFEDLHNINPRIISYASKSLSDTEKRYCQTEKEALAL FT VWAVERFDVYLIGRTFELETDHKPLESIFSPTSKPCARIERWVLRLQSFKY FT RVEYKRGQNNIADSLSRLVQDETSGEDFEKENKFLILAIKESAAVDVEEVE FT RASREDRQLQLAAQCLRDGKWGNPEVKEFEIFRNELGLIGDLLIRGHMLVV FT PHTLRNRILSLAHEGHPGESLMVSRLRDRVWWPSMDREAKAWTKACESCRL FT VGLPGKPAPMVRRPMPTQPWVDVAIDFMGPLPSNEYLLVIIDYFSRYKEVE FT VMIKITAKETVNRLDKIFTRLGYPRTITLDNARQFVSKDFDDY" XX SQ Sequence 4730 BP; 1387 A; 988 C; 1277 G; 1078 T; 0 other; catttggcga cgaagggata gtttttcttc agtttacagg taagagttta tggaaatcga 60 caaaatattt gatgttatgt tgaaaaatca aatttcggct gatgcgatca ccgaaagcca 120 cggggcggcc atcttacttt ttgttttcct ttttaactgg aattttattt ccaactggac 180 ttttgcataa atctaataag attgtattgt ttacatacta cttcacagta gtaaccggtg 240 ctttgcacgg tggaggcagg aagccgtcag gttggaaaaa ccgaaatcaa ggcaggaagc 300 cgttaggttg gagaaaccgt aaccgaggca ggaagccgtc aggttgaaaa aaaaaaccga 360 aatcaagata ggaagccgcg aagttggaaa accgaaccga ggagagtacg aatggattga 420 aatatacaac agaaactgag gtgagcaaat gtaaacacag ggaaaaagaa taagaaataa 480 tagtcgttgt tacttatttt agtcctttga acaaaatgga aaagtgggag attccacaat 540 tcaaattcag atcgattccg aacagtatta tcagagatga gtgggtgaaa tataagcgga 600 atttccagta catcgcggcg gcaaatgaag agaaaaatgc gacccggctc aagaatgtgt 660 ttctggcaaa agctggacca gatgtccagg aagttttcgc ttcgcttccg ggggctgatg 720 tggaggagga taaagccaag aatattgacc cgttcaaagt ggccatcgag aagcttgatg 780 aatacttctc gccgaagcag cacgaggcgt ttgagcggaa cactttctgg aacctgaaac 840 ctcaagatga ggaaaccatg gagaagtttc tgtggcggac gctagaagta gctcagaagt 900 gtaatttcgg caccacgttg gaagaaagcc gttcaatcgc tgttattgac aaagtgatcc 960 tgtacgctcc ggtcgaactg aaacaacaat tgctgcagaa ggaaaaactc acgctggacg 1020 atgttactcg agtggtcagt tcgtacgaat ctgtgaaaaa acagtcgatg ctgatggaag 1080 gcaaatcggg atcaactcct gcatacagcg agcacggagg cggtggcggt cagaagagtg 1140 aacttttcaa cgtccagagc tcctcaagag gaccacccac cacgggtgaa tgttaccggt 1200 gtggccataa agatcattat ggaaatgatc cgaggtgtcc ggcacgatcc gttcagtgca 1260 ccaggtgcaa gaggtatggt catttctcga aacggtgcaa aactgagttc ggccggcgac 1320 caacacatcc gcaatcacag aacgtggttc ccgccaagcg gccgggcggt tctcagaatc 1380 actgggataa caaaaagccg aagtacaatc cgagacacga acgcattgcg caggtggaag 1440 acaacacgag tgaccctgac gaaaatacaa gcttcgtcta caacatcagc gatggagatg 1500 agtttatttg ggttaaagtt ggaggcgttc ttatccagat gctcatcgat tccggaagtg 1560 ctaagaatat tctggatgat cgagcctggg agcacctgcg agtacaagaa gcggtgacaa 1620 caaacttccg aaccgattgt gatcagattt tccgagcgta cggacgggaa tcagctcctc 1680 tgaagatcgt ttgcgttttc gaagcggata ttgcaataga tggagcagcc gaccggattg 1740 aaggaagcga tacgttctat gtggttgaag gtggaacgca accgttgttg ggcagaacga 1800 cggcgaagaa actgggagtt ttgaagatcg gcttgagtag tgtcaacgct ggtgaagttc 1860 aggaacgcag accgtttccc aagatgaagg acataaagct gattgtcccg gttgatgaaa 1920 gtgtcacgcc tgtggttcag aacgttcgac gacctcctat cgcgctattg actcgggtcg 1980 aagagaaatt ggatcaactg cttgcatcag aaataatcga gccagtaaaa ggtaagaaag 2040 ggcgaaggat tgttttgcta gggttgtaca ttgattgttt tattacggac accaggtgcg 2100 agtgattggg tgtcgccatt ggttactatt gttaaagata atggtgactt gcggctttgc 2160 gtagatatgc gcagggcgaa tgaggctatc aaacgagagc accacatgat gcccactatc 2220 gaggatttct tgccacgcct gaaagaggcg aaatttttca gccggttgga cattcgggag 2280 tcgtattacc aaatggagct ggcggagcag agtcgtaaca taaccacctt catcacgcat 2340 aagggtttgt ttagattcac gcgtttgatg ttcggcataa gttgcgcacc ggagatgttc 2400 cagaagcatc tggaacaaat tctggtcggt tgcaacaaca cggtgaacta catagacgac 2460 atcatcgttt ttggagcaac cgaagaggaa catgaccaag cgctgagtga agtgatgaag 2520 gtactgaagg agagagacgt tctcctcaat cacgagaagt gtttgttcaa ggtgcaagag 2580 ctggagttcc tgggtcatat aatctctgtc aacggaataa ggccgacggt cggtaagaca 2640 gaagcgcttg agaagttccg ggaaccaaga tccgtcgaag aagtccgcag ttttttgggt 2700 ctagttacct atggggggcg gttcctgccc aatctcgcaa caatcactgc tccacttcgt 2760 gagctgcttc gtggagacaa acgatttacc tggacagccc aacacacagg atcgtttcag 2820 aagctaaagg aaatgataag cgatatccga acgctgcaat tcttcgataa tacgctgcgc 2880 acacggatta ttgcagacgc ttcgcctgtg gccctcgggg cagttttggt tcagttcgaa 2940 gaccttcaca acatcaaccc ccgtatcatc agctatgcga gtaaaagtct gtcggacaca 3000 gagaaacgtt actgccagac agagaaagag gctctcgctc tggtatgggc ggtggaacgt 3060 ttcgatgtgt atctgatagg aagaacattc gagctggaaa ctgaccacaa accgctggag 3120 tccatattct caccgacatc gaagccgtgc gcccggattg agcgctgggt gttacgtttg 3180 caatcattca agtacagggt ggagtacaag agaggtcaga acaacattgc agattcgctt 3240 tcgaggttgg tgcaggacga gacatcaggc gaggattttg agaaggagaa caagtttttg 3300 atcctagcga tcaaggagtc agctgcggta gatgtagagg aggtagaacg tgcgtcgcgg 3360 gaagatcgtc aactgcagtt agctgcacag tgcttgcggg atggaaagtg gggtaatcca 3420 gaggtgaagg aatttgagat ttttcgcaac gaactcggat tgattggtga tctgctgata 3480 agaggacaca tgctggtcgt tccgcatacg ctgaggaaca ggattctcag cctggctcac 3540 gaaggacatc cgggcgagtc actgatggtt agtcgtttga gagatcgggt ctggtggcca 3600 tccatggatc gcgaggcaaa ggcatggact aaagcgtgcg aaagctgcag gctggtcgga 3660 ttacccggaa agcctgcccc gatggttcgc cgtccgatgc caacccaacc ttgggtcgac 3720 gtggcgattg atttcatggg tccacttccg tcgaacgagt atctgttagt gataatagat 3780 tactttagta gatacaaaga ggttgaagtg atgatcaaga taacggctaa agagactgtg 3840 aaccgattgg acaaaatttt cacacgtctc gggtatccca gaacgatcac cctggataac 3900 gccagacagt tcgtgagcaa ggacttcgat gactactgag acacccacgg catcaccctc 3960 aatcacagta caccatactg gccgcaagaa aacgggctcg tggagagaca gaaccgatcg 4020 ctgttgaaaa gattacagat cagtcatgct cttcatcgtg attggaagaa ggatctacat 4080 gattatcttg ttatgtatta tacgacacca cacacaacaa cgggtaagac tccttctgag 4140 ctgatgttcg ggagaacgat tagatccaag atgccatcag tgggtgacat cgaaacggct 4200 cctgtgaaca ctgattattc tgatcgcgat tacattctga agcaaaagga gagagaggat 4260 tgcagacgag gtgccaaaga ggtccaaatt caaccaggtg atacagttct tgtgaaaaac 4320 ttactaccag gtaaaagtac tctgtattgg gttattcaag taacggtatc atacattaaa 4380 aaaacaaatc taatatcgat cttattacag acaacaaact cacgacaacg ttcaatccga 4440 cgaagttcac cgtagtagat cgctcaggtg gtcgagtcac cgtgtcggac caggagaacg 4500 ggaaaaccta cgaacggaac gtggctcatc tcaagcgagt ggttgatccg ccagcagacc 4560 tcggagaaca gttcgcggat tccgacacgg acggagaaga cttcagagga ttcgatcaaa 4620 tcgaaattca ggagaagtcc cctccgaaag gaaagcgcac ccgccgttgc ccgaccaaat 4680 ttaatgattt tgtgatgtga agaagataag catttccata aaaagggaga 4730 // ID DNA8-3_CQ repbase; DNA; INV; 1769 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A DNA transposon family from Culex quinquefasciatus - consensus. XX KW DNA transposon; Transposable Element; nonautonomous; DNA8-3_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-1769 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the southern house mosquito."; RL Repbase Reports 11(1), 80-80 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >87% CC identity. 8bp TSDs. XX SQ Sequence 1769 BP; 630 A; 230 C; 222 G; 687 T; 0 other; caagggtttc cagaagcgtt tattacttgc cagtctgcca attaattact caaattactg 60 gtccaaaatg attaattaca taaattactt aattactttt cactacgata aaaaacattt 120 aattttaatt taagtattca tttctacggt tctgaactaa attatttata aatagctcat 180 tcgatagctc ttctaaaaat tatgctgttt attttttttt ttgtaccgat ctggatattg 240 cgaattataa ttttaattga aaatagctat aggcgatcaa acttcaaaaa tgcccccact 300 agaggacgct gaaactttgg gtgatgttta cataatatct tacagcgaat taattggttt 360 gaaatttgga attgttttat tttattataa aattgacatc atttcaggta aaaaataagt 420 agctcaattg ctataaacaa taatatttgt tttatggcca atttaacctg ttcattatct 480 gattcgaaat tccattatgt ttcacgtttc aatcaaccgt acccttatca atcaattgca 540 aaagaagatt attgaagtcg tgtttgtcat aaaggcaaaa caacatactg tactgtttta 600 cttcaatgtt tgagttttaa tataattaat tattaattta aaattgatat tcagtttatt 660 gcaagtttat cgaagttgtt tgaaaaaagg ccccctgatt tttcgagcca aacataaact 720 ttgacaaata tttgcagcgg ctcttcatga aaattgtgaa attcatcaaa ataacgttaa 780 gttttaaaat gtttttcaac attggtttga cattactttg catttttttt tctcaagcat 840 attaaaaaaa tccgttactg aaaatacgga ttatgtgaaa tgtgacaatt acattggatt 900 taatttgtgc atctgaatat tagtcgattt taatttcact gagtttgatt tttaacattt 960 gtctcagaca tatttggaaa tttttcggga aaatagttta gctctttaga gagataagga 1020 ctgtcaattt ctgctgatta tgaaaaaata taaaaattgt atttttcgta ttatttttga 1080 taattttttt attattttca atatttttta aattattttt aatatacttt tattgatttt 1140 taaattgcta tcacttgaaa aattgttttt aaaaatgata attcactatt ttttttaatt 1200 atatgatcca cataaacttt ttttgataat tttgactgcc tattagaagg aaaagcataa 1260 aaaattaatc aattctaatg tagttcttgc aaaacaaaaa aaaaaacata tttttataat 1320 tacttaatga aaatgaatag atttcggatt cactaaacat ttttcactgg gtaatatatg 1380 atggcgttgc cttttgttgt taatcagctt ctataaacaa catgtaaaaa taaattttag 1440 gagagtttta ggagagaact atggcgaatt aggccttcta catacagagc cagtcttttt 1500 taatctttat gtttcaatct tatttgaaat taataaaata ctgttataat agattatatg 1560 taaaaagtaa atagtttcgt agaagaagta caattaaaaa aaatcaatta cttaaattac 1620 tcaaaaatta cttaaattac caatcaacta cattaaatta ctctaattac cataaattac 1680 taattaatta aagaattacc taattaccag ctaaaaatca aagtaattat taattacttg 1740 ccagtccgag gccttgctgg aaacccttg 1769 // ID Copia-36_DPu-LTR repbase; DNA; INV; 313 BP. XX AC ACJG01003741; XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the common water flea: long terminal DE repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-36_DPu_; KW Copia-36_DPu-I; Copia-36_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-313 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the common water flea."; RL Direct Submission to RU (08-FEB-2011). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR Genome; ACJG01003741; Positions 3618 3930. XX SQ Sequence 313 BP; 77 A; 77 C; 52 G; 107 T; 0 other; tgttggaaca agctcattga ggcggtgcgg caacatgctc cgggcgtctg gttgctgaaa 60 cccatgcgca cacagggtgc tgcttagcat caattctttt ccactatccc tctttttagt 120 tcttccctgt tcttcttttt tttgtcacct cgcgaagaag tcactctttc tttcgtatct 180 atctaagtgt tcaatattca actgtacctc atcacataat tgtaagtact cgttgaatat 240 tcaatgtatc ccaatacaga tagtaaaaca caacagtatg ttcaacgtca gttatcctca 300 ctatgcttta aca 313 // ID RTE-9_BF repbase; DNA; INV; 3295 BP. XX AC . XX DT 29-JUL-2009 (Rel. 14.07, Created) DT 29-JUL-2009 (Rel. 14.07, Last updated, Version -1) XX DE Amphioxus RTE-9_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW RTE; Non-LTR Retrotransposon; Transposable Element; RTE-9_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-3295 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-3295 RA Kapitonov V. and Jurka J.; RT "Young families of RTE non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(7), 1707-1707 (2009). XX DR [2] (Consensus) XX FH Key Location/Qualifiers FT CDS 171..3281 FT /product="RTE-9_BF_3p" FT /translation="RMDPQHVSYAARWQPHQLTTPASGPLPVMAELNHVTT FT PTGSIRTDLSRNRPLRATDKRVLNFRNKVRVATWNVQTLLRTGYATLLSHE FT LTKYDLSLAGICEVRWPGSGETIVGDHTFIWSGPEDRTGLYGVALAIPSRL FT RGSLISWKPISDRLLTARLLHTHGKMTVIVAYAPTEVAEDTDKDAFYDQLQ FT EAVQDAPPHDITLVLTDANATLSANTHPSAQQDAMGPISVDTTTNDNGNRL FT LELCRATNLCIADTWFPRKRIHHWTWYSPDGRTRKAIDHILISGRWKSFIT FT NCRVYRGAQLGNTDHRLLMAHMRLKLKADPSSKKQVRVDSARLKDPLIQEA FT FSCAISNRYNALAMDATTDWQLFKEEVLKATEETVGKSRPSPKQPWISPET FT LDIIDSRRAARLRGDLTEYRRLNGIRNIAIRQDREKYWTDQAAKLEAAAER FT NDQRQVYRLLRQAKAGPRQRSFLIKDSVGNIISTESDCIHRWKEHFSQLLN FT HPPVPEDPTLSEAASATDATNADCPVTPVTPDEVKAALGKLKNGKAPGICN FT ITAEMLKAGGDHMIQWLTQIVNHVWVLETLPDDWRRGIILPFWKRKGDQLV FT CSNHRGITLLSIPGKLFTRILLTRAIPAIRSRRRLQQAGFMPNRSTIDHIS FT ALRLAIEKAREFRKDRHLYIAFIDLRAAFDTVDHASLWKILRLLGAPQKTI FT SLFQQLYNSAESCVRVNGKESEWFSINSGVRQGCVAAPDLFNCIIDYLMSR FT VCERVPGVSFGQYNLADIEYADDTSLLADDLTRLQDALTVYDEEAKKLGLS FT INWGKTELMHVGDGPDPEPFIFSGCPVKFVSTFKYLGSIISKTGDLKPEIN FT QRRAQASAVLQSLWKPLWRHRHVSLQTKQRVYNSSVVSVLLYGSETWPLSN FT TLAARLDGFDSRALRRITCTRWQEHVRNSDLREQTGQPPASSLAAMRRVRW FT YGHVLRLPPEHPTRALLDFDPKQAGWRRPRGAPRTRWMDVVTRDLRSINIT FT PQQAQPAALNRGRWRTLVHAVGSTHHVQEDE" XX SQ Sequence 3295 BP; 817 A; 1019 C; 833 G; 626 T; 0 other; tcagggggat gccctaacac tctcacgggg atgccctaac atcggctccc ggtccatccg 60 ccttgccgtg gtgtgagagc ccgaatgtac caatctccca gtcccggggt aggtgccaca 120 ttcctactct gtaacctcga atgagggggt ggagcttggg tagtcgctaa aggatggacc 180 ctcagcatgt cagttatgct gcgaggtggc agccccacca gctgactacg ccagcgtcgg 240 gcccactacc cgtcatggcg gaacttaatc atgtaacgac accgactgga agcattagga 300 ctgacctttc tcggaatcga ccactgcgcg ccacagataa acgagtactg aatttccgta 360 acaaggtacg cgtcgcgacc tggaatgtac aaacccttct tcgaacaggc tatgcaacac 420 ttctgtcgca cgaactcacc aaatatgacc tcagtctggc gggcatctgc gaagtcagat 480 ggcccggctc aggagagacc attgttggag accacacctt catatggagc ggccctgagg 540 acagaaccgg cctctacggt gtagccctag caattccctc acggctgaga ggctcactca 600 tcagctggaa acccatctct gacagactgc tcacagcccg tctccttcac acacatggaa 660 agatgactgt cattgtcgcc tacgcgccaa cagaggtagc cgaggacact gacaaggatg 720 cgttctacga ccagctgcag gaagctgtcc aagatgctcc accgcatgac atcaccctcg 780 ttctcaccga tgctaatgcc accctttcag ctaacaccca cccctctgcc cagcaggatg 840 cgatgggccc catatctgtg gacacaacta ctaacgacaa cggaaaccgc ctactggaac 900 tctgcagagc tactaacctc tgcatcgctg acacctggtt tccacgcaag aggatccatc 960 actggacctg gtacagccct gacggaagaa ccagaaaagc catcgaccac atcctcatct 1020 ctggccgctg gaagtcattc atcaccaact gccgagtcta cagaggtgcc cagcttggca 1080 acacagacca caggctgctg atggcccaca tgcgactcaa gcttaaagcc gacccttcct 1140 ccaaaaagca agtacgagtt gactccgcac ggctcaagga tcccttaatc caagaggcct 1200 tcagctgtgc catctccaac aggtacaacg ccctagcaat ggatgccacg acggactggc 1260 agctctttaa agaagaggtc cttaaagcga ctgaggaaac agttggcaag agcagaccct 1320 cacccaagca accctggatc tcacctgaga ctctcgacat cattgactca cgccgtgctg 1380 cacgcttacg tggagatctg acggagtatc gccgactcaa tggaatccgc aacatagcta 1440 ttcgccagga ccgtgagaag tactggacgg atcaagctgc taagcttgaa gccgcagcag 1500 agcggaatga ccagcgacaa gtctacaggc tcctacgtca agccaaggct ggcccacgac 1560 aacgcagttt cctcatcaaa gacagcgtcg gcaacatcat ctctacagag tccgactgta 1620 tccacagatg gaaggaacat ttcagccagc tgctcaacca ccccccagta ccagaggacc 1680 ctacactctc cgaagcagcc agtgcgactg atgccacaaa cgcagactgt cccgtaactc 1740 ctgttactcc ggacgaggtg aaggctgccc tgggtaaact gaagaatggg aaagcccctg 1800 ggatctgcaa cataacggct gagatgctga aggcaggagg ggatcacatg attcagtggc 1860 ttactcagat agtgaaccat gtgtgggtcc tggagacact ccctgatgac tggagacgtg 1920 ggatcatcct gccgttctgg aaaaggaagg gagaccagct ggtgtgcagc aaccacagag 1980 gcatcaccct actctccatt ccaggcaagc tgttcacccg tatcctcctc accagagcca 2040 tccctgccat caggagcagg cggcgcctgc aacaggcagg cttcatgcca aaccgctcca 2100 ccatcgacca catttctgca ctccgtctgg ctatcgagaa ggcccgggag ttcagaaagg 2160 accgtcacct ctacatcgca ttcatcgacc tccgagcggc gttcgacacg gtagaccacg 2220 cctccctctg gaaaatcctg agacttcttg gagcacctca gaagacaatt tccctcttcc 2280 agcagctgta caactccgcg gagagctgcg tcagagtaaa cgggaaagag tccgagtggt 2340 tctccatcaa cagtggggtc agacaaggat gcgtggctgc accggatctc ttcaactgca 2400 tcatcgacta tctgatgtcc agggtctgtg agcgtgtccc cggggtatct ttcgggcagt 2460 acaacctggc ggacatagag tatgccgacg acacttccct gcttgccgat gacctgaccc 2520 gacttcaaga cgctctcaca gtctacgacg aggaggccaa gaagctcgga ctcagtatca 2580 actggggcaa gacggaactg atgcatgttg gagacggccc tgaccctgaa cccttcatct 2640 tcagcggctg ccctgtcaag tttgtctcca cattcaagta cctgggttct atcatctcta 2700 agactggtga cttgaagcca gagataaatc aacgacgtgc ccaggcctcc gctgtcctac 2760 agtccctgtg gaaaccactc tggcgacaca gacatgtctc cttacagacc aaacagcgtg 2820 tgtacaactc ttccgtcgta tcagtccttc tgtacggctc cgaaacctgg cctctgagca 2880 acaccttggc agcccgtctc gatgggtttg acagcagagc cctgaggaga atcacatgca 2940 ctcgctggca agagcatgtg cgtaacagcg acctccggga gcagaccgga caacctcctg 3000 cctctagcct cgctgccatg agaagagtcc gctggtatgg ccatgtcctt cgcctccccc 3060 cagaacaccc tacgagagcc ctgctagact tcgatcccaa gcaggcagga tggcggcgac 3120 cacggggggc gcctcgcacc cgctggatgg acgtggtgac cagagatctc cgcagcatca 3180 acatcacacc ccagcaggcg cagccagcag cattgaacag agggcggtgg agaactctag 3240 tgcacgcagt cggctctacg caccatgtgc aagaggatga gtgagtgagt gagtg 3295 // ID R1_DPs repbase; DNA; INV; 5447 BP. XX AC . XX DT 08-NOV-2010 (Rel. 15.11, Created) DT 08-NOV-2010 (Rel. 15.11, Last updated, Version 2) XX DE 28S rDNA-specific non-LTR retrotransposon R1 in Drosophila DE pseudoobscura. XX KW R1; Non-LTR Retrotransposon; Transposable Element; R1_DPs. XX OS Drosophila pseudoobscura OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; obscura group; OC pseudoobscura subgroup. XX RN [1] RP 1-5447 RA Stage D.E. and Eickbush T.H.; RT "Origin of nascent lineages and the mechanisms used to prime RT second-strand DNA synthesis in the R1 and R2 retrotransposons of RT Drosophila."; RL Genome Biology 10(5), R49-R49 (2009). XX DR [1] (Consensus) XX CC This family is specifically inserted into 28S ribosomal RNA CC genes. XX FH Key Location/Qualifiers FT CDS 398..1843 FT /product="R1_DPs_1p" FT /translation="VTPIEVATTRRAMEIETEADASDTSVASVGASSSSSV FT PARRRGRPKSSAAKQGAKKIGLGIGLAGERPIPAERSPPTSLPFEQAPSTS FT AAAAATTAASTSAAAATTAAATAAATAAATAATTAATTAATCPATAIVAAG FT YSAIIAEMSAIRMAVSEAVLKGLMPPEASMEILVATNRYDELVMALAGEKV FT RLEERARMPPPPRPAAAAHTGTTAVTAAAYATAFPGLPAPSAVAAPIPKPR FT DTWSALIKSKNPEETSKELVERVRKTVVPTLGVRVHEVRELKSGGAIIRTP FT SVSELRKVVASSKFTEAGLEVKKRPETKPQVVVYDVDTSITPEEFMEELFT FT KNLEETMTAAEFKKSVHLGSKPWSVTDGATINVTLEVDAKAQEALRECVYI FT KWFRCRCRSLVRTYACHRCAGFDHKVSQCRLKENVCHRCGQNGHNVARCPN FT PVDCRNCRFKGYPAAHSMLSAACPIYGAVLARVQARH" FT CDS 1840..4959 FT /product="R1_DPs_2p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="TLMFSFIQANCGRGRAAVMELGAIMRSSGRQFALVQE FT PYVDGAGRITGLPSGMRVFQDRRGKAAIIVDEPDAICMPMETLTTDFGVCV FT RVTGRFGSIFLCSVYCQFDTDLAPYLRYLDAVLLLGSRTPVIFGLDANAVS FT PAWFSKLSGHNRGQANYARGELLSEWITEMRAGVLNEASRVFTFDNRRGQS FT DIDVTIVNQAATMWATYDWEVKEWDTSDHNMIHVVVTTDPNDTVEPIAPVP FT SWKLSNARWRLFEEEVVREIAELPEDIAESPLDNQVSALRSVVHSVCDRVL FT GRSTPRAARKVVWWTAELHSKRREVRRLRRRLQDARRHETDAAEELVLALR FT ISSAQYKKLILRSKEDNWRRFVGENRDDPWGHVYRICRGRKKSTEIGCLRT FT DGRLVATWRDCAGVLLRNFFPVAETNAHIAIPIEAPPALEAFEVDACVARL FT KSRRSPGMDGITGAICKALWRAIPQHMTAMYSRCLETGYFPTEWKHPRVVP FT LLKGPDKDRTDPTSYRGICLLPVLGKVLEGIMVNRLKDTIPDGCRWQFGFR FT QGRCVEDAWKHVVSTVSASRSKYVLGVFVDFKGAFDNVEWNAALRRLLELG FT CREASLWRSFFSGRSASIVSRHGVATVPVTRGCPQGSISGPFIWNILMDVL FT LQRLEPLGALSAYADDLLLLIEGNARSELEQKGGELMSIVGAWGVEVGVAV FT STSKTAIMLLKGILRRPPVVRFAGASLPYKRKYRYLGITVGERLSFLPHIT FT GLRDKLTGVVGALSRVLRVDWGLSPRAKRTIYAGLMVPCALFGASVWSVVT FT TTQVVARRHLLACQRIVLIGCLPVCRTVSTMALQVLAGAPPFDLVARRLAI FT SFKLKRDYPLEESDWLYGEDLANLSWKQKMARLDEDLLCEWQRRWDDGDSP FT GRVTHRFIPNAGFVYSERRFGFTLRAGFLLTGHGSLNAFLHGRSLSDTPAC FT LCGAEREDWQHFLCACPLYTDLRDLDGLGVQYTDGDWTFSDVTASQERMRT FT LDRFAGLAFSRRQQLLNAQAAHGLGGFPIQAGRG" XX SQ Sequence 5447 BP; 1227 A; 1452 C; 1649 G; 1119 T; 0 other; cagttgcttt ttgactgcca ctcgaagcag acgtgttttt caagcggtcc gctataccgt 60 cgttcgaata aagtaaaaag tgatttttcc aagtggaaaa tacatcaaaa ttgccgcgag 120 atcacgcgct gttcgtgagt gaaatcgggt gttaatttgg tgaaagaaca aagcagatat 180 ttgcaaatta atatttgcag aataaacgct aaaaacaacc acgtgcggaa ccctagttcg 240 taaacaagtg cgaatagttc gacaacaagt cggaatagtt cgacagcacg ttcgagttcg 300 aagctttggt tcgaagcttt ggctcgagtt cgaagcgttc ggttcgtgtc ttcgctgaga 360 cgttggttcg agtctccact aagtgccgat tagttgagtc acgccaatag aggtagccac 420 aacccgccgg gctatggaaa tagagacgga ggccgacgcc agcgacacta gcgtggctag 480 tgtaggggcg agctcgtcgt cgagcgttcc cgcccgcagg cggggtcggc cgaagtcctc 540 agccgccaaa cagggagcca aaaagatcgg gcttgggatt gggctagctg gagagcggcc 600 gatcccagcc gagagatcgc cgccaacgtc gctccccttt gagcaggctc catccacctc 660 tgccgctgcc gctgccacca ccgctgccag tacttctgcc gccgctgcca ccaccgctgc 720 cgccaccgct gccgccaccg ctgccgccac cgctgccacc accgctgcca ccaccgctgc 780 cacatgcccg gcaactgcca ttgtcgcggc aggttactcg gccatcatcg cagaaatgtc 840 tgcaataagg atggcggtga gcgaggctgt ccttaagggg ctgatgccgc ccgaggcctc 900 aatggaaatc ctcgtcgcga ccaaccggta cgacgagttg gtcatggccc tggctgggga 960 aaaggtgcgc ctggaggaga gggcaaggat gccgcctcca ccgagacccg ctgctgctgc 1020 ccacacgggc accactgccg tcacagccgc tgcgtatgcg acagccttcc caggcctgcc 1080 cgcaccgagc gccgtcgctg caccgatccc aaagcctagg gatacctggt ctgctctaat 1140 aaagagcaaa aacccggagg agaccagcaa ggagctagtg gagcgtgtta ggaagaccgt 1200 ggtgccgact cttggagtcc gcgtccacga ggttcgcgag ctgaagagcg gaggagcgat 1260 tattcgcacc ccttcagtga gcgaactgag gaaggtagtg gccagcagca aattcaccga 1320 agcaggattg gaggtgaaga agaggccgga gaccaagcct caggtcgtgg tgtatgacgt 1380 ggacacgtcc ataacaccgg aagagttcat ggaggagctc ttcacaaaga acctggagga 1440 gaccatgaca gccgcggagt tcaaaaagtc ggttcacctg ggcagtaaac cctggtcggt 1500 caccgacggc gccacgatca acgtgacgct agaggtcgac gcaaaggcac aggaggcgtt 1560 gcgcgaatgc gtatacatta agtggtttag atgccgctgc cgctccttgg tcagaacata 1620 cgcctgccac agatgtgcag gcttcgacca caaggtgtcg caatgtcgcc taaaggagaa 1680 cgtgtgccac cggtgtggac agaacggcca caacgttgca cggtgtccca accccgtgga 1740 ctgccgcaac tgccgcttca aggggtaccc tgcagcacat tccatgctgt cagcggcgtg 1800 cccgatctac ggagcggtac tggcgagggt gcaagctaga cattaatgtt tagcttcatc 1860 caagctaatt gtggccgtgg ccgagcggct gtgatggaac tcggagccat catgcgcagc 1920 tctggccgtc agttcgcact ggtccaggag ccgtacgtcg atggagcagg gcggattacc 1980 ggccttcctt ctggaatgcg agttttccag gaccgccgag gaaaagctgc tatcatcgtc 2040 gacgaaccgg acgccatctg catgccaatg gagaccctta ccacggattt tggagtatgc 2100 gtcagagtta cgggaagatt cggctcaatc ttcctatgct ccgtgtactg ccaattcgac 2160 accgacttgg cgccgtacct caggtactta gatgcggtgc tgctgctggg cagccgcact 2220 cctgtcatct ttgggctcga cgcgaacgca gtatccccag cgtggtttag caagctctcc 2280 ggacacaatc gggggcaagc taactatgca cggggtgagc tgctgtctga gtggataacc 2340 gagatgagag ccggcgtgct caatgaagcc agtcgggtgt ttacattcga taaccgtaga 2400 ggccaaagcg atatcgatgt gacaatcgtc aaccaagctg cgactatgtg ggccacatac 2460 gattgggaag tgaaggaatg ggacaccagc gatcacaaca tgatccatgt tgtggtgacg 2520 actgacccga acgacacagt tgagcccatt gctcctgtgc cgtcatggaa gctttccaat 2580 gcgcgctggc gattgttcga ggaggaagtg gtaagggaga ttgccgaatt accggaagac 2640 atcgccgaat cgccgttgga caaccaagtg tctgcactgc gctctgtagt gcacagtgtg 2700 tgcgacagag tgctgggacg cagcacaccg agagccgcga gaaaagtagt ttggtggact 2760 gccgaactac actccaaacg ccgagaggtc aggagactga ggcgaaggct ccaggacgct 2820 cgtcggcatg agaccgacgc agcagaggaa cttgtgctcg cgttgaggat ctcctcagcg 2880 cagtacaaga agctcatcct gagatcgaag gaagacaact ggcgacgctt cgtgggagag 2940 aacagagatg atccatgggg gcacgtctac aggatttgcc gaggccgcaa aaagagcacg 3000 gagattggat gccttcgaac ggatggtagg ctggtcgcaa catggcgcga ctgtgcgggt 3060 gtgctccttc gcaacttctt tcctgttgcg gagacgaatg cacacattgc catcccgatc 3120 gaggctccac cggccctcga agctttcgag gttgatgcat gcgtcgccag gttgaagagc 3180 agacgctctc ccggcatgga cggcatcaca ggtgccattt gcaaggcatt gtggcgtgcc 3240 atccctcagc acatgacagc gatgtattcc cgctgcctgg agacagggta tttcccaacg 3300 gaatggaagc atcccagggt ggttccactc ctgaagggac ccgataagga ccggaccgat 3360 cctacctcat atcggggtat ctgtctgttg ccagtgctgg gcaaagtgct tgagggcatc 3420 atggtgaatc gtctgaagga caccattccg gatggctgca gatggcaatt tggctttcgc 3480 caaggacgtt gtgtggagga tgcttggaaa cacgtcgtga gtactgtttc ggccagccgg 3540 tcgaaatacg tgctcggagt cttcgtggat ttcaagggag ccttcgacaa cgttgagtgg 3600 aatgctgcac tacgccgcct ccttgagctg ggatgccgag aagcaagctt gtggcgaagt 3660 ttcttctccg gccggagtgc gagcatcgtc agtaggcatg gagtagccac tgttccggtg 3720 acaagaggtt gcccgcaggg gtccataagt ggtccgttta tatggaacat cttaatggac 3780 gtgttgctcc agcgcttaga gccccttggt gcgctcagcg cgtatgctga cgacttgctc 3840 ctcctcatcg aagggaatgc ccgatcagag ctcgaacaaa aaggaggtga gttaatgtcc 3900 atcgtaggcg cttggggagt tgaagtcggc gttgccgttt caactagcaa gacggcgatc 3960 atgctgctca aaggcatact tagacggccg ccagtggtac ggtttgctgg agcaagcctg 4020 ccatataagc gcaagtatcg gtacctaggc atcacggtcg gcgagcggtt gagttttctc 4080 ccgcacatca cgggcttgcg tgataagctg accggagtcg taggggcatt gtcgcgcgta 4140 ctgcgggtcg actggggact cagtccccgc gcaaagcgga caatatatgc cggactcatg 4200 gtgccctgtg cactatttgg tgcctcggtt tggtctgtcg tgacgacgac gcaagtggtt 4260 gccaggaggc atctgcttgc gtgccaaagg atcgtcctga ttggatgcct accggtatgc 4320 cgaacagtgt ccaccatggc gctgcaagta ctagctggag cccccccgtt tgatctggtt 4380 gccagacgcc tggcaatcag cttcaaacta aagcgtgact acccgctgga ggagagcgat 4440 tggctgtacg gcgaagattt ggcaaatctt agctggaagc agaagatggc gcgactagac 4500 gaagacctgt tgtgcgagtg gcaacgcaga tgggatgatg gtgactcccc aggacgggtg 4560 acgcaccgct tcatcccgaa cgcaggcttc gtctacagcg aacgaaggtt tggcttcacg 4620 ctgcgcgctg ggttcctgct gacgggccac ggatcgctca atgcatttct gcatggaaga 4680 agccttagcg acacgccagc atgtctatgc ggcgcagaac gcgaggattg gcagcacttc 4740 ctatgtgctt gtcccctcta tacagacttg cgagacctcg atggacttgg agtgcagtac 4800 acggatggcg actggacctt ctccgatgtg acggcttctc aggagagaat gcggactctc 4860 gacaggttcg ccggactggc gttctccagg cgacagcagc tgctgaatgc gcaagcggcg 4920 cacgggctgg gtggattccc tatccaggcc ggtcggggct aaaagggagg atcgagctga 4980 gtccttaaga tcggtaccac gggttgtgca gttccaaggc tgcacattga ggtcggcccc 5040 ctagtgggag tatcgtggtg gctgtggttg atacccaaac gttacgcggg gagagccgct 5100 aggctcctcg tggagttgcg ctctcaaccg ggtgccgaga cccatagatc ggaagacgtg 5160 ttagatacac ctcgcccctc accaaggggg attgtatgcc cgaccaggat actttcaatt 5220 ggtaccagaa gagtcgctat gtacatagct atagcttctt tttaggggcg ttaactggcg 5280 cattgtacca ggttcctgcg tatgtagcgg tggtgcgtga tggcgtgata tatgtaaata 5340 tatcgcacta atcgaggcta agctgcagat aaacattaga cgccgtggtt gaaatccctc 5400 cctgaggaac cgccacgtaa aataaagctg agagatcaga ttcattc 5447 // ID Copia-12_SI-LTR repbase; DNA; INV; 320 BP. XX AC AEAQ01030007; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: long terminal DE repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-12_SI_; KW Copia-12_SI-I; Copia-12_SI-LTR. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-320 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01030007; Positions 1309 1628. XX SQ Sequence 320 BP; 94 A; 64 C; 59 G; 103 T; 0 other; tgttgtaatg tgcaatttgt tgttactcaa ctacgtttaa acattatgat tctattggtt 60 ggtaagagct ttcgcgcctt gtgactacga acggactgag tgctctcgcc ctctctccct 120 aaccacgctc tgccacgcgc tcatacgttt ttgtattttg taaaggaaca ataaatgtat 180 tattacatta tagtaattag aaggtctaat ctaccctgtc gctacaaact gtaagtagta 240 gagatttaac aaaaggagga atcaaagtca gactactaaa tcggtgtcag tcatattcgt 300 cggaacctct aatattaaca 320 // ID Copia-16_CQ-I repbase; DNA; INV; 4357 BP. XX AC AAWU01015501; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-16_CQ_; KW Copia-16_CQ-LTR; Copia-16_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4357 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 347-347 (2011). XX DR GenBank; AAWU01015501; Positions 16395 12039. XX CC Positions [1737-2276] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 381..2555 FT /product="Copia-16_CQ-I_2p" FT /translation="MLSLLEEREFLECIEANVADVPALKDEEGDDAAKQLE FT KARLRERRTKRDRSCKNLIISRINDDQLEHVQDKTTPKDIWDALKHIFQRK FT SLARRMYLKKEILTLALGNTSLQQHFLTFDKLVRQYRATGADMDELDAICH FT LLLTLNASPKYAAVVTSLDTQPEEQLSLEFVKCRLLDEETKQKSVQIVIPK FT KEETAFVGAKAQQQKPKQKKKLRCYNCKEEGHKMSDCPQKKKKEVKASANM FT AEEQSAVCFAGLSRGESVPEQPVTWYIDSGCSDHLANDKSLFVELEPLKCP FT VEIAIAKNDECIVAKHSGKVKVISKVNDKDILCTIENVLFVPELRCNLFSV FT MRVDDAGMRVTYAEGEVAIYRGSELVASGKRDGKLYRLNFVRARVGENKSM FT LTCGLVPKNVQLWHRRFGHLNAKSLDQLIRRDMVSGLKLNGGQNGKELVTC FT EACVLGKQTRKPFVVREGKRSSRVLELIHSDVCGPISPDGLDGAKYFVTFI FT DDWSHFVVVYLVEAKSEVFRYFKMYEAMATAKFGKKVCRLRCDNGGEYRSN FT EFEQFCQEKGIQVEWTAPYTPEMNGTSERFNRTLVEKSRAMLEDSRVDKKF FT WGQAIQTAAYLTNRSPSSALPPDVTPYEQWEGKKPDVSKLRAFGCPVYAHV FT PKELRKKLDPKAWKGIFLGYTRGGYRIYNPELQRIVHTRDVDFLELEKPEH FT KAKWNDADVALLPSPEVDEDSIAAD" FT CDS 2857..3954 FT /product="Copia-16_CQ-I_1p" FT /translation="MKSLTKNGTWTLAKLPDGRAPVSCKWVFAVKRGLDGD FT RVRYKARLVARGFSQRKGFDYGETYSPVAKLDTLRTVLAVANREKLVVHQM FT DVRTAFLNGTLEEEIFMTQPEGFEEGTDLVCCLKKSLYGLKQASRAWNERF FT NNFVVDRLKFERSLNDQCLYTRKDERQTLIIVLYVDDIVIAGTTLKAVETI FT KRCFFNEFDMTDVGEIKCFLGMQVERNREEGVMRISQKQYMESLLRRFGME FT DCKPVSTPIESRLKLHKGTEEERTSQPYRELIGCLTYACLTTRPDLAAAVN FT FLSQYQSCPNEAHWKHLKRILRYIKGTLDVGLVYRKNLEAPTVEVFADADW FT ANDVLDRRSVSGGIFKVFGSTVA" XX SQ Sequence 4357 BP; 1075 A; 1082 C; 1363 G; 837 T; 0 other; ctttaggttt tgggcccggg taacgtttcg gatctaccgg ttatcgcggt caaagtgttt 60 tcggtgatgg aggggacaag tcggaaaagt tgttccttcc gttctttgac ggcaccaact 120 acactcaaac cccgatggtt tgacaccaac tgttgtcaaa cgaacggggt cactttttag 180 tttgacaccc cttttacacg gatttcacac acactaccaa acgtttgttt tgatagtgtg 240 cgtgagggcc gtgtaaaaag tgacagttcg tcacttttta gtttgacttt gaccaaccaa 300 cggggtacaa attaaaaaag tgtcaagcga aaaagtgacc aaccaccggg ggttgagtgt 360 acaatgcctg gcggtatcgg atgctgtccc tcctggagga acgcgagttc ctggagtgca 420 tcgaggcgaa cgtggcagat gttccggcgc tgaaggacga ggaaggtgat gacgcggcca 480 agcagctgga aaaggcgcgg ctgcgggaga ggaggacgaa gcgggatcgt tcgtgcaaga 540 atctgatcat ctcccggatt aacgacgatc agttggagca tgtccaggac aagaccaccc 600 ccaaggacat ctgggacgcg ttgaagcaca tttttcaacg gaaaagtttg gcgcgccgga 660 tgtacctgaa gaaggaaatc ttgacgctcg cgttggggaa cacgtcgctg cagcagcact 720 tcctgacctt cgacaaactg gtccggcagt accgtgcgac tggagcggac atggacgagc 780 tcgacgctat ctgccatctt cttctgacgc tgaatgcgag ccccaagtac gctgccgtcg 840 tgacgagtct ggatacgcaa ccggaagagc agctctcgct cgagtttgtg aagtgcagac 900 tacttgacga ggagacgaag cagaagagcg tccagattgt gatcccgaag aaggaggaga 960 cggcgttcgt cggggcgaag gcgcagcagc agaagccgaa gcagaagaag aagctgcggt 1020 gctacaactg caaagaagaa ggacacaaga tgtccgactg cccgcagaag aagaagaagg 1080 aagtgaaagc aagtgcaaac atggcggaag aacaaagtgc agtgtgtttc gctgggttga 1140 gcagggggga aagtgttccg gagcaaccag tgacctggta catcgattcg ggctgctcag 1200 accatctcgc gaacgacaag tccttgttcg tggaactcga accactgaag tgtccggtgg 1260 agattgcaat cgcgaagaac gacgagtgta tcgtggccaa gcattctggc aaggtgaaag 1320 tgatctccaa agtgaacgac aaggacattc tctgcacgat tgaaaacgtt ctgtttgtgc 1380 ccgaattgcg ttgcaatctc ttctcggtca tgcgtgttga cgatgcggga atgcgtgtga 1440 cctacgccga gggtgaagtc gcgatctacc gtgggtcgga gctcgtcgcg agtggaaaac 1500 gagacgggaa gctgtaccgt ttgaacttcg tgcgcgcgcg cgttggcgaa aacaagtcga 1560 tgttgacgtg cggactcgtg ccgaagaacg tccagctgtg gcaccgtcgg ttcggccact 1620 tgaacgcgaa gtcgctcgac cagctgattc gccgggacat ggtctcgggt ttgaagctca 1680 acggtgggca gaatggcaaa gaactcgtta cgtgcgaagc ttgtgtgctc ggcaaacaaa 1740 cccggaagcc gtttgtagtg cgcgaaggca aacgttcgtc gcgagtgctc gagttgatac 1800 actcggatgt gtgcggaccc atttcgccgg acggcttgga cggagcgaag tacttcgtga 1860 ccttcatcga cgactggagc cacttcgtcg tggtttacct ggtggaggcg aagagcgagg 1920 tcttccggta cttcaagatg tacgaggcca tggcgaccgc caagttcggg aagaaagtgt 1980 gtcggctccg ctgcgacaac ggcggggagt acagaagcaa cgagttcgag cagttctgtc 2040 aggagaaggg catccaggtg gaatggacgg cgccatacac acccgagatg aacggcacaa 2100 gcgagcggtt caaccgcacc ctcgtcgaga agtctcgagc catgctcgag gacagtcgtg 2160 ttgacaagaa gttctggggt caagcgatcc agacagcggc gtatctgacg aaccgcagtc 2220 cgtcgagtgc gcttcctccg gatgtgacgc cgtacgagca gtgggaaggt aagaagcctg 2280 acgtctccaa gttgcgtgcc ttcggctgtc ccgtgtacgc gcacgtgccg aaggagttgc 2340 ggaagaagtt ggatcccaaa gcttggaagg ggatcttcct cggatacacc cgcggcggct 2400 accggatcta caacccagaa ctgcagcgca ttgtgcacac acgagacgtg gacttcctgg 2460 aactggagaa gcccgaacac aaggcgaagt ggaacgacgc agacgtggct ctgttgccgt 2520 cacctgaggt cgacgaagac tcgattgcgg ccgattgagc aagaaccgga agaaagccag 2580 gtggagacag acgaagaata cgaaagtttc caggaggaat ccgaaggtga accacacgag 2640 gaagagccga acgtcgagaa tccgccagaa ccaggaccga gcggcggcag gccgcagcgt 2700 ccgcgtaatc caccggactg gcacaaggac taccaggtcg agtacgcagc tttcgcgctg 2760 aacgcaacca gctacgtgga cgatgtcccg agttcgctgg ctgaggcgcg caagagaagt 2820 gactggctga actggaaggc ggcggtggac gacgagatga aatcgcttac gaagaacgga 2880 acgtggactc tggccaaact cccggacggc cgagcccccg tgagctgcaa gtgggtgttt 2940 gcagtcaagc gcggattgga tggagatcga gttcggtaca aggcgcgcct cgtggcacga 3000 ggtttcagcc aacgcaaggg attcgattac ggggaaacct attcgcccgt cgccaagctg 3060 gacaccctgc ggacggttct tgccgttgcc aatcgtgaga agctcgtcgt gcaccagatg 3120 gacgtgcgca cggcgttcct gaacggaacc ctggaagagg aaatcttcat gactcagcct 3180 gagggcttcg aggaagggac agatctcgtt tgttgcctga agaagtcgct gtatggcctg 3240 aagcaagcat cgcgcgcctg gaacgagcgg ttcaacaact tcgtcgtgga tcgtttgaag 3300 ttcgagcgga gcctgaacga tcagtgcctt tacacacgaa aggacgaacg gcagactttg 3360 atcatcgtgc tctacgtcga tgacatcgtt attgccggca cgacgctgaa ggctgtggag 3420 acgatcaaac gctgcttctt caacgagttc gacatgaccg acgtcggaga gataaagtgc 3480 tttctcggga tgcaagtcga acggaaccgg gaggaaggcg tgatgcggat cagccagaag 3540 cagtacatgg agagtttgct gcggcgattt gggatggaag actgcaaacc tgtttccact 3600 cccatcgaaa gtcgcctgaa gctgcacaag ggcacggagg aggaacggac gagtcagccc 3660 taccgagagc tcatcgggtg cctgacttac gcctgcctca ccacaaggcc ggatctggca 3720 gcagcggtaa acttcttgag ccagtatcag agttgcccga acgaagcgca ctggaaacac 3780 ctcaaacgga tactgcgcta catcaaagga actctcgacg ttggccttgt gtaccggaag 3840 aacttggaag cgccaaccgt ggaagtcttc gccgatgcag actgggcgaa cgacgtgctg 3900 gacagaaggt cggtaagcgg tgggattttc aaggtgttcg gatcgacggt ggcctgaatc 3960 acacggaagc aacagacggt ctcgctatct tcgactgaag ctgagctcac tgcgctttgt 4020 gctgcggtgt gtcacgagtt gtggctggga cgtctcctgc aagatctcgg attcaagcca 4080 gaagaaccgg tccgagttca cgaagacaac caatcaacga tccgggttgc ggaggaagca 4140 aaagacttcg ggcgtctcaa gcacgttgat gtcaagattc acttcatccg tgacctcatc 4200 aagcagaagc ggatcaagct ggagttcata ccgtcggcga accagcaagc ggacatgatg 4260 acgaaggggc tacctgtagc agcttttcgc aagcagtgtt ctgctatcgg actggagcgt 4320 tgcagcggtt gagcaggggt gttaggaagg caacctc 4357 // ID Sola2-3_HM repbase; DNA; INV; 3224 BP. XX AC . XX DT 28-JAN-2009 (Rel. 14.02, Created) DT 28-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE Sola2 DNA transposons from Hydra magnipapillata. XX KW Sola; DNA transposon; Transposable Element; Sola3; Sola2-3_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3224 RA Bao W., Jurka M.G., Kapitonov V.V. and Jurka J.; RT "New superfamilies of eukaryotic DNA transposons and their RT internal divisions."; RL Molecular Biology and Evolution 26(5), 983-993 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 913..2544 FT /product="Sola2-3_HM_1p" FT /translation="MKMPLTHAERQRKYRERQMQKFGPDFIKEKDSIRKKE FT KRNSNVELARLKDKLRKKKSRVSSSKITSNISPAYKSSCTLGKAVKKAIRA FT LPTSPLKKATVVKQLAIQFNKDLPDMQPTHGPNALNDETKLCVISFFEQDG FT ISRQAPGVRDTIVLRSNKEKKKMQKRYLTMNISEAYQIFKNEHTDFSISKS FT KFASLRPGHILLSSLMPRNVCSCQKHENIILSLKALHFIDLKIPLYSHALP FT NTLVCDEKNDNCWNNLCEACKDGKLFLKTYQLDEKNHDKEITWYLWEKVTQ FT LNGTKYLEKAIKKDKAIILYNNVCKMLPKFLLHYFIKGKQSMSYENQKKVL FT KSNLDSVLLQVDFAENFSTFWQDEVQSAHWNKKLVTLFTAVSWHGDSCKSA FT VIISDDLNHSRSSVIYFIDLLITNLIRKDVKYLQVWSDGPSSQFKNCYIVA FT SIPWLESKHKIKICWNYFATSHGKGPCDGIGATIKRLATQKIVRREALIND FT ATAFYKAVRDESQVNVFMVAAEKIKTAFDGTELGNIINNACKTPWNI*" XX SQ Sequence 3224 BP; 1173 A; 455 C; 474 G; 1120 T; 2 other; gggctattcc acgttgtaca aaaagtttaa tataaatttc aaatcctaga aaataaaaaa 60 ctaaatgtat cattcaaaag ttaaaggttt gtagtttcta aaaacatgta acatgtagaa 120 ctggttttcg aagaatttta sgatttgatg cttaaacttt aaacaaaatt ttttagactt 180 caaatttaaa ttatttttct agtaaatcat tttgaaacct tttttttgaa acttttttaa 240 atttctatgt agatttgtct cacgcaaaat atagatgggg ctcttgcttt tcaaattgaa 300 aatttattat aataattaaa tagtctttca ttgagaaatt tctatttggt cttctttaag 360 tttgtcacgc tgtgacatgt cacgccgtga cactaatagt aacatttttc attctagcgt 420 aagaatattt atttatttta aataaatatt tataaattac ttgtaactaa aatagggcaa 480 taaatatcaa tagatattta aatttaaata tttttagttt ttgtgttatg tgtcttaagt 540 ttatgtcacg ttgtgacgac acggcgtgac attttttaaa attaaatttt ttaaagacta 600 atttttaata atttttctaa taaaaatgtt acgcaaaaaa catgttggtt taacatttgt 660 agtattttta ttgtttttta tttgtggata agttttgaaa cctcctttaa tataactaac 720 attgtttttt tttattaaac ttgtaggtgt acctgtaaat atgtagacat attttactaa 780 cacctttgtt atttagtttt tcttaaattt tgtcgcgcca ttgctgaagt tattctgaat 840 gtttattgtt tatacaattc ttataaaaag taaactttca caaacaaatt gtttatttct 900 tcgttgaaag taatgaaaat gcctctgaca catgctgaac gccagagaaa gtatagagaa 960 agacaaatgc aaaaatttgg tccggatttt ataaaggaaa aggactcgat acggaaaaag 1020 gaaaaacgaa attctaatgt tgaattagca cgtttaaagg acaagttaag aaaaaaaaag 1080 agtagagttt cttcatctaa aataacatct aacatctctc ctgcttacaa atcatcttgt 1140 acacttggta aagctgttaa aaaagcaata cgtgctcttc caacttctcc tcttaaaaaa 1200 gcaactgttg ttaaacaact tgcaattcag tttaacaaag atttaccaga tatgcaacca 1260 acacatggac ctaatgctct taatgatgaa acgaagctat gcgtcattag tttttttgaa 1320 caagatggta tatcccgtca ggcacctgga gtaagagata cgattgtttt aagaagtaat 1380 aaagaaaaaa agaaaatgca aaaaaggtat ttaacaatga atataagtga agcatatcaa 1440 atatttaaaa acgaacacac tgatttttca atcagtaaat ccaagtttgc tagtttgaga 1500 ccaggccata tacttttatc aagcttaatg cctagaaatg tttgttcatg tcaaaaacat 1560 gaaaacatta tcttgtcact caaagcatta catttcatcg atttaaaaat tcccctttat 1620 tcgcatgctt taccaaatac tttagtgtgt gacgagaaaa atgataattg ttggaataat 1680 ctttgtgaag cttgtaagga cggaaagtta tttctcaaaa cataccaatt agatgaaaaa 1740 aaccacgata aagaaattac ctggtacctt tgggagaaag ttacgcaact taatggaact 1800 aagtacctcg aaaaagcaat aaaaaaagac aaagccataa ttttgtataa caatgtttgc 1860 aaaatgctac ccaaatttct tttgcattat tttataaaag gaaaacaatc aatgagttat 1920 gaaaaccaaa aaaaagtttt aaaaagcaat cttgattcag ttttgttaca agttgacttt 1980 gcagaaaact tctcaacttt ctggcaggat gaagttcagt cagcacattg gaacaaaaaa 2040 ttagtaacat tatttactgc tgttagctgg catggtgact cttgtaaatc tgccgttatt 2100 atttctgatg atttgaacca ttcaagaagt tcagttattt attttattga cttgttaata 2160 actaacctta ttagaaagga tgtaaaatac ttgcaagttt ggtctgatgg tcctagcagt 2220 caatttaaaa attgttacat agtagcttca attccatggc ttgaaagcaa gcacaaaatt 2280 aaaatttgct ggaattactt tgcaacctct catggtaaag gaccatgcga tggtatagga 2340 gcaaccataa aaagattagc aacacaaaaa attgttcgaa gagaagctct gattaatgac 2400 gcaacagcat tttataaagc cgtcagagat gaaagtcaag taaacgtttt catggttgct 2460 gctgaaaaaa ttaaaactgc ttttgatggc acagaacttg gaaacatcat taataatgct 2520 tgtaaaacgc cttggaatat ttagtgccca ttgcttaaaa cagtttaatg ataaaatgga 2580 aatgaaacct tattcaagtg catcatactt agttaactag ataaaattta ataaaaaaac 2640 gtagttgaat tttacaagac tttttttgaa attattgtaa ataaactgtt ttgttaaacg 2700 ttattgttat accaatctta taaaaatccg aataaattgg tcattctttt gtttcaactg 2760 tctttttgat gatgttattg tcttaatatt caaagatttt gtactggtca caccgtgacg 2820 tcacggtgtg acrtaaactc cgttgatata tttatgtggt gtctccattt aaatttataa 2880 aacattttta aaatgttatg aaactattat aacacctttt aagtatccaa gacataacaa 2940 aaaaatattt caaaatatcc acaagagctg aactactgtc atttcttcta atgacggaaa 3000 atgctagata atttttgtca cgtcgtgacg caacctttta atgcgtatac ctaaaagtct 3060 ataaaatatt aaaactaaat aattttgttt ttagaaactg taatgtatat gaaaattttt 3120 tagtatcttt aatttttaaa tttttaattt taaatgcgct aaaatctgat gtttttcccc 3180 caaaaaaaag tcaaatgtca cgccgtgaca acgtggaata gccc 3224 // ID BEL1_MH-I repbase; DNA; INV; 8853 BP. XX AC ABLG01000037; XX DT 24-JUN-2009 (Rel. 14.07, Created) DT 24-JUN-2009 (Rel. 14.07, Last updated, Version -1) XX DE LTR retrotransposon from northern root-knot nematode: internal DE portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL1_MH; KW BEL1_MH-LTR; BEL1_MH-I. XX OS Meloidogyne hapla OC Eukaryota; Metazoa; Nematoda; Chromadorea; Tylenchida; Tylenchina; OC Tylenchoidea; Meloidogynidae; Meloidogyninae; Meloidogyne. XX RN [1] RP 1-8853 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from northern root-knot nematode."; RL Repbase Reports 9(7), 1516-1516 (2009). XX DR Genome; ABLG01000037; Positions 62662 53810. XX CC Positions [4730-5281] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 1586..5557 FT /product="BEL1_MH-I_1p" FT /translation="MLPLQRQAQYRALRKYLQKMEVPIKIGNILQNSALVQ FT KPRQKRILLCRETFIFNPANPERKIKGMVFIDCGSQRSYIRKGIAQKLGLR FT SSQKEVQRIQPFPANRSPVIEFQADIFRLGIEKYSYGQLTVEVGQLENLNS FT LITQMPVVLAAEDEREFDLKSGKIPGSIVEPDVLLGIEYFFELNIRRRKKL FT NCGLWLADTAIGPVIGGKGNITNVSSAIDKSNVIITEEEKFEDETLEEKLD FT KLYSLDGIGLGDTVNPKINEKIRKEFEAKLTFQNGRYEAGWLWKEPCPKLP FT SNYRTAKAMLLTQLENKQKSNPHHLALYEETFADQVKKGIIEEIPRNQSDG FT GQIHYLTHFPVVREDKTFTKFRIVFNASGKSRKDNPSLNDCLHQGPVTIQD FT LCGLLMRSRFPTYLIAGDLQAAFLQISLRPPDRDSVRFLWVSDIKKPSEAQ FT TDEDLRIFRYARVTFGVNCSPSILGMTVDHHLSKYDSTTAEKLKKAGYVDN FT FLIGAEENAEVEKDILEARKIFQDAGWNFREIFTNASKQLNDIPFEKLPIE FT ARTSNLCSGAKQKILGISWDTFSDRLTFKLPTPKLKPKQEWTKRQVLASIA FT ECFDPCGIISPTLLKGKWLMQKLWESKMTWDNPLNEELKIEWELIKDEYLS FT CNTIQVTRRLISWKMTKPTVYELCAFSDASDKALGFAVYLRSQNGREIQTA FT LIFAKSRVKPKAQLGKCETKMTIPRMELHAALIATKALDYLEEMLEIKIAK FT KMLWTDSQIVIHWLKSPTEQEAFIERRLRPIRKNTVKYVRTEDNPADLTSR FT GISPNELTSHLTWFSGPQWLRLDENEWPEKVLEYVPGERTEKLNPQPFHEM FT CLMTIANAPLIEHNRFSTFKKLKKTIAYVFRVLEKLAIKNSDPQNELLQKI FT KEITQTHSGQIEPREMELAEKFLLKEAQKMHPPDEDTIQNLQLFKDSEKMW FT RSRGRIEFGNLNEDTKFPIWLPKKAFLTELIIWEMHEKMAHAGTNVVLASL FT RMKYWISRKKVQDSTKKCQSCKKFWIKPYAQPPFPSLPAMRVNQFRPFMYS FT SVDYFGPMTIKLKNAEHIKRWVAVYTCTSTRAVHLEIAEDLSAQAFLKTFR FT RFIAHRKKPVLMISDNGTNFKAAEKILKSIFKETQIQKFASDHEIKWIFVT FT ERAPWKNGLVERMVGLIKYSLRRTVGRRLLEEEEFRTLITEVQTVVNSRPL FT NFIPDSEQVIPIRPIDFLIPFKEGNTELPHLPKADEDDPDYRPQKMTTKEN FT LVEEMRRASDRLNKFWLVWTSDYLLALRERKNMTKKVLSAENSPKAGERET FT TKLFRYEHVSIG" FT CDS 5868..8852 FT /product="BEL1_MH-I_2p" FT /translation="MKKDWWKLKNEKTTSYEAGFLFIDQILDARKPGQIIS FT IIESRSCPVGNLDKDWRPRILKPALALWKAACEWAEEAFSTKPWLPVSLRG FT RENPMKEEGHGLFLKIAEQTERALSFMRLNEYYPRATPILEEVIIIGDNNA FT KLLSEYFQRSYCAAPIPANELKYLFNDVIPLTPAPYFTAILCIGSEVIDEG FT GHPEALKTQLEPLFKKLETFSCTIYILPPPFKPKEVDYWQRIVDNLKKERK FT EKRGLKYIEIETGGFEALKKELINEEGVITDKGVKEVVQKLKEVDTRLAKQ FT KDVPLIKEESDSEMNEERVENNQRYRRTIYVRRPENPQGGRRPNWRGRGQI FT RDMSPIRDRRPNIVNRLSFVSMNLALIVGMALLIILPVSETGEEPMICPHG FT ITSKIYRIGHPTAHCDELLVPERQMIKDVRLSIYRPNEETQEILGFHCLKV FT MEKMTYYLNWIGYKIISPIQRMKIGVTREECLKMIHLEKCFTGTEQSMDGP FT DSSKRTSNLLHPEFSYFSGGKEATVINCFLTKIHIYFSPNQEVIESPLLEL FT EHCRLKDKACLLADQSMLIWNSPTDEQQCNYVKMQKFEGSMYDNWFMSNDQ FT TFSLSITSPKIFTDCGKEMVKTDQLYAITMLEYQKHFVMAKSQLRRSLLAT FT KFSGNKEKIENNLDGTVQVSQFSAGETAQRLTNEIQLKKAYNHLAKVMCQQ FT QSELNFLTSLDATMIARQRLADPYVEAKWIGEDKNQVWIEVWSCYPINWLN FT ITIRPSENECSKNIPVNVTIEEGKSEAMFLDPLKLILRATSEIGSCAKYRK FT QKIKIREAIYTVDQKTGKLNKIEEERIIDTPKGLIWTEKPPKLDAKTFRGH FT ILTKIKDPKKVMLQAFQEFSLSREGEKPHIHGENEFGAQISSSSNHSMWPL FT LDSWEMIKQKMILTVCLIWTLVWTLKLTYYIYLLWLLYNSGQLLEKWKNWG FT KPKNDVVQKTRKNELLIKVEPQQVLEEEELVDKQFSRGGS" XX SQ Sequence 8853 BP; 3301 A; 1614 C; 1782 G; 2156 T; 0 other; aaaagtggcg acccagcaaa ttaaagaact ctactaaaat aaataaattt taatttaaat 60 ttcttcaaaa aattggaatt tattaaattt tctacaaaaa ttatttgcat caaaaaatta 120 attcaacaaa aatcaaagaa taaaaatttg attagtgagt tgcattagtg agttggaaaa 180 aagtgagttt aaacaaaaat tactcattaa aaataaaaat taatttgaac taaattattt 240 gaaaataaaa atattaaaaa tgagccgtcc gttaaaatcc ttgacaaagc gagcggctga 300 aacagtgcaa aaattattag acagaggtgc tccaaaagag cccgatgtca ctgatgaaga 360 ttacgttctt ggtctgtgcg aaacactaga ccagctttcc atagagatag aagaattatc 420 cgaggctcac tcggatttga aagacgtaca ctctaaatgg acgtctctaa tccagagatg 480 cgcccagaca gataaagttg ccaacgaaac tgcctacgac gaaacggcct ctttctacaa 540 aatagaagag cgactaaacg aagcaggaat acgtcttaga gaacttagag agttagaaag 600 acgtcttaaa tcagaaaaaa gaaaagaaga acgtagagaa aggacctcta tgactaatac 660 atctcgtggc caacagcagt tcggtttcag atttaaacca cccaggcaag aaataggaaa 720 attctctgga aagcaaattg attggcctga atggtggcag atttttgaag ctacagtaca 780 cgaaaccgag ggaagtgaag aagtaaaaca cgctgtatta aaacagtgtg tagagggaga 840 agcaaaagcc ctcatcgcgg gcctaaaatt gagcgactat caagtcgcaa tagatttact 900 aaaacagaga tatggctgtg aagaagaata cacgagaagt ttacacacgc agcttgaaaa 960 tctcagaccc tgtcagtttt caggactgcc gtaaattttc catagaagtt gaaagaattt 1020 gtagacttct tgaaaataat ggccaaaaca tctcaggaca aggaacatgg atgagcttag 1080 agaaaaagct caccattcca attttgcgcg aagtacaaac caaaaaagcg ctagcaaaaa 1140 gtctaagaat tgaatgggac actgctgcct tcagaagagc cctcagggaa gtaattgaga 1200 aggaggaact tgtcctctca attcacggaa aatcacctga aaaaggggcg gaggtaaagc 1260 ataacccaaa aggttttagg aactttccaa ataaaagaag aaattccaac gaaaaggaaa 1320 tcaccagcac tttcgctaac acgcaaaaat ttgaaaatca agcgaagaca aataaaaaac 1380 caaataattt cgcagcaaaa aatgccaaag gcgggaggag ccccccaatg tatccatgca 1440 tattctgtgg taaaagtgga cattgggggg acgaatgccg caaaacaaat acagcaaatg 1500 agcgacgcaa gaaattaatt gacttgaaaa gatgcactct atgtataaaa ccagaacatg 1560 gtggaaattg ttataaacca atcagatgct tccattgcaa aggcaggcac aataccgcgc 1620 tttgcgaaaa tacttacaaa aaatggaagt cccgatcaaa attggaaata ttttacaaaa 1680 ttctgcttta gtccaaaagc ctaggcaaaa aagaattttg ttatgtagag aaacatttat 1740 atttaatcca gccaacccag aaaggaaaat aaaaggcatg gtgtttatag attgtggatc 1800 acaaagatct tacattcgaa aaggaattgc tcaaaaacta ggccttcgca gttcccaaaa 1860 ggaagtgcaa aggattcagc cattccctgc caaccgatca ccagtaattg aattccaggc 1920 agatattttc cgacttggaa ttgaaaaata ttcttacgga caattaacgg tggaggttgg 1980 acagctagaa aatctaaaca gcttgataac acaaatgcca gttgtcttag ctgcagaaga 2040 tgaaagggaa ttcgacttaa aatcaggaaa aatccctgga tctatagtcg aacccgatgt 2100 tttgttaggc atcgaatatt tctttgaatt aaatattagg cgcagaaaaa agctaaattg 2160 cggattatgg cttgcagata cagcaattgg cccagtaatt ggaggaaaag gaaatataac 2220 taatgtttcc tccgcaatcg ataaatccaa tgtaatcatt actgaagaag aaaaatttga 2280 agatgaaaca ttggaagaaa aattggataa attgtattct cttgatggaa tagggttagg 2340 agatactgta aatccaaaaa ttaatgaaaa aattcgaaaa gaatttgaag caaaattaac 2400 gttccaaaat ggacgttatg aagcaggatg gttgtggaaa gagccctgcc ccaaacttcc 2460 tagcaattat aggacggcca aagctatgct tctcacgcaa ttagagaata aacaaaaatc 2520 aaatcctcac cacttggccc tatatgaaga aactttcgca gaccaagtga agaaaggtat 2580 aattgaagaa atacctcgaa accaatcgga cggaggacaa attcattatt tgactcattt 2640 tccagtggta agagaggaca aaacatttac taaattccga atagtcttta acgccagcgg 2700 aaaaagtaga aaagacaatc catcattgaa tgactgcctt catcaaggac cagtcacaat 2760 tcaagattta tgcggtttac tgatgcgcag ccgctttcca acttatttaa tcgctggaga 2820 cctacaggca gccttcttac aaataagttt acggccacct gatagggaca gcgttcgatt 2880 tctttgggta tctgacatta aaaaaccctc agaagcacaa acagatgaag acttacgcat 2940 ttttaggtat gcgcgagtca cctttggtgt caattgttca ccctcgattt tgggaatgac 3000 agttgatcac catctgtcca aatatgattc tacgaccgcc gaaaaattaa aaaaggctgg 3060 atatgtagat aatttcctaa ttggtgcaga agaaaacgct gaagtggaaa aggatattct 3120 ggaagcgagg aaaatatttc aagatgccgg atggaatttt cgcgaaatat ttactaatgc 3180 ctcaaaacaa ttaaatgaca ttccgttcga aaagctccca atagaagctc gaacttcaaa 3240 tttatgttct ggcgcaaaac aaaaaatttt aggaataagt tgggacacat tctccgatag 3300 attgacgttt aaacttccaa cgccaaaatt aaagcctaaa caagaatgga caaaacgcca 3360 agtactagca tcgatcgctg aatgtttcga tccttgcgga atcatttccc ctaccctgct 3420 aaaaggaaaa tggctaatgc agaaattatg ggaaagtaaa atgacgtggg acaaccctct 3480 caatgaagaa ttaaaaatag aatgggaact aataaaagat gagtacttaa gttgcaatac 3540 aatacaagta accaggcgtt taataagttg gaaaatgacg aaaccaacag tatatgaatt 3600 gtgcgcattt tccgatgctt ctgataaagc gttaggattt gccgtgtatc tccgttccca 3660 aaatggaagg gagatacaaa ccgcgctaat ctttgccaaa agcagagtaa aaccaaaagc 3720 acagttagga aaatgcgaaa caaaaatgac aataccgcga atggaactcc acgcggctct 3780 aatagcgact aaagctctag attatttaga agaaatgcta gaaattaaaa tagcaaaaaa 3840 gatgctatgg actgactctc aaatagtcat ccattggttg aaaagcccca ccgaacaaga 3900 ggcattcatt gaaaggagat taaggcccat tcgaaaaaat actgtcaaat acgttcgaac 3960 cgaagataat ccggcggacc tgactagtag aggcatctct ccaaatgaat taaccagcca 4020 tctaacttgg ttcagtggac cccaatggtt aaggctagac gaaaacgaat ggccagaaaa 4080 ggtactcgaa tatgtacctg gtgagaggac agaaaaatta aatcctcaac catttcacga 4140 aatgtgtctc atgacaattg cgaatgctcc gctgattgaa cataatcgtt tttctacatt 4200 caaaaaattg aaaaagacga ttgcatatgt ttttagagtt ttggaaaaat tggccattaa 4260 aaatagcgac ccacaaaatg aattgctaca aaaaataaaa gaaataactc agactcactc 4320 aggccaaata gagcctcgag aaatggagtt ggctgaaaaa ttcttattaa aagaagcgca 4380 aaaaatgcac cctccggatg aagatacaat ccaaaatctc caattgttta aagactccga 4440 aaaaatgtgg agaagcagag gtaggattga gtttggaaat ttaaatgagg acacgaaatt 4500 tccaatttgg ctaccaaaaa aggcctttct aactgaatta ataatttggg aaatgcatga 4560 aaaaatggca catgctggca ctaatgtcgt tctagctagc ctgcgaatga aatactggat 4620 ttcgcgaaaa aaggtgcaag attcgacgaa aaaatgtcaa tcttgtaaaa aattctggat 4680 aaagccatac gctcaaccgc catttccttc gctacctgcc atgagagtaa atcaatttcg 4740 gccttttatg tactcttctg tggactattt tggcccgatg actataaaat tgaaaaatgc 4800 cgaacacata aaaagatggg tagcagttta cacttgtact agcactcgag cagtacattt 4860 agaaattgct gaagacttat cggcacaagc atttctgaag acctttaggc gatttatcgc 4920 ccaccgcaaa aagccggtgt tgatgatttc agataacggc acgaatttta aagccgccga 4980 aaaaattctt aaatctattt ttaaagaaac gcaaattcaa aaatttgcct cagaccacga 5040 aataaaatgg atattcgtga ccgaaagagc tccttggaaa aatgggttag tggaaagaat 5100 ggtcggatta ataaaatact ctctgcggcg aaccgtcggg agaaggcttc tcgaagaaga 5160 agaattccgt actttgatca cagaagttca aacagtcgta aattccagac cattgaactt 5220 cataccagac tctgaacaag tgattccgat cagaccaatc gattttttaa tcccatttaa 5280 agaaggaaac accgaactcc cccatttgcc aaaagcagac gaagacgatc ccgattatcg 5340 acctcaaaaa atgacaacta aagaaaattt agttgaagaa atgagaagag cgtccgaccg 5400 ccttaataaa ttctggcttg tctggacctc ggattacctc ttggccttgc gagagcgaaa 5460 aaatatgacc aaaaaagtct tatcagctga aaattcgcca aaggcaggag agagagaaac 5520 aacaaaattg tttaggtacg agcacgtctc gattggataa attcaagcaa caaaattgtt 5580 tggaaaataa aaaagaagta aaaaaggtag agcctgaatt aaaggaagag tcctttaaaa 5640 gagaagttaa aaatggaaag gtcttattac acaatggatg ggcattagtc aacggatgtt 5700 tacctggttc tgataaacac aaaagatgtt ctcagataac attaggaaat gttatagacg 5760 tagctcgaaa agattctgaa gaattaaaaa gagatcatcc agccttccaa cataaaccgg 5820 tccaatcttt aattcaactt ctcctatgct tctggacatt gttaaaaatg aagaaagatt 5880 ggtggaaact taaaaatgaa aaaacgactt cttatgaagc gggttttcta tttattgatc 5940 aaattctaga cgctcgaaaa cccggccaaa taatttcaat catcgaaagt cgctcgtgtc 6000 cagttgggaa cttagacaaa gattggcgcc caagaatctt aaaacctgca ttggctctgt 6060 ggaaagcggc ttgtgaatgg gctgaagaag ctttctcaac aaaaccatgg cttcccgtta 6120 gtttaagagg aagagaaaat cccatgaagg aagaaggcca tggattgttt ttgaaaattg 6180 cagagcaaac agaaagagca ctttcattta tgaggttgaa tgaatactac cctcgagcaa 6240 cgcccattct tgaggaagta ataattattg gtgacaacaa cgccaagtta ttaagtgaat 6300 attttcaaag aagctactgc gcagcaccca taccagcaaa tgaattaaaa tatttgttta 6360 atgacgttat tcccttgaca ccagctcctt attttactgc aattctatgt attggtagtg 6420 aagtaatcga tgaaggaggt catccagaag ctcttaaaac acaattggag cctctcttca 6480 agaaattaga aaccttttct tgcaccatat acatcctgcc tccgccattt aaaccaaaag 6540 aagtggatta ttggcaaaga atagtcgata atttaaaaaa ggagaggaaa gaaaaaagag 6600 gtttaaagta tattgaaata gagactggag gatttgaagc tctaaaaaag gagttaataa 6660 atgaagaggg agtgataaca gacaaaggtg ttaaagaagt cgtacagaaa ttaaaagaag 6720 tagataccag actggccaaa caaaaagatg tccctctaat aaaagaagaa tcagattctg 6780 aaatgaatga agaaagagtc gaaaataacc agcgatacag aaggacaatt tacgtgcgcc 6840 gccctgaaaa tccacaagga ggaaggcgtc caaattggag agggcgtggc cagattcggg 6900 acatgagccc tattcgagat agaagaccta atatagtcaa ccgattatcc ttcgtctcga 6960 tgaatttggc tctaatagta ggaatggccc ttttaataat tctgcctgta agcgaaacag 7020 gtgaggaacc gatgatttgt cctcatggaa taacaagtaa aatttatcgt atagggcatc 7080 ccacggcaca ttgcgatgaa ttattagtac cagaacgcca aatgattaaa gatgttcgac 7140 tctcgattta cagacccaat gaagaaacgc aggagatcct gggatttcat tgtctgaagg 7200 taatggaaaa aatgacttat tatctgaact ggataggata taaaataatc tctcctattc 7260 agagaatgaa aattggcgtt acaagagaag agtgcctcaa aatgatccat ctggagaaat 7320 gctttactgg cacggaacaa tcgatggacg ggccggattc ctcaaaaagg acatccaatt 7380 tgctccatcc ggaattttca tacttctctg gagggaaaga agctacagta ataaattgct 7440 ttttaacaaa aatccatatt tatttttcac ctaaccagga agtaattgaa tctcctttac 7500 tggaattgga acactgtcgg ttaaaagata aagcgtgttt attagctgat cagtccatgc 7560 ttatttggaa ctctccaact gacgagcaac agtgcaatta cgtgaaaatg caaaaatttg 7620 aaggctcaat gtacgataat tggtttatgt ctaatgacca aaccttttct ttaagcatca 7680 ccagtccaaa aatcttcaca gactgtggca aagaaatggt aaagacggat caattgtatg 7740 caatcacgat gttggagtac caaaaacatt ttgtaatggc caaaagccaa ctgagacgtt 7800 cattgcttgc tacaaaattt agcggtaata aagaaaaaat tgaaaataat ttagacggga 7860 cagtacaggt ttctcaattt tcagcaggag aaacggcaca gagactgaca aacgaaattc 7920 agttaaagaa ggcctacaat caccttgcga aagttatgtg tcagcaacag tcggaattga 7980 actttctcac ttccttggat gcaaccatga ttgcgagaca gcgactggcc gatccttatg 8040 tagaagctaa gtggatcgga gaagacaaaa accaggtatg gattgaagta tggtcctgtt 8100 acccaataaa ttggctaaac attacaatta gaccctctga gaacgaatgt tcaaaaaaca 8160 ttccagtaaa tgtcactata gaagagggaa aaagcgaagc catgttctta gatccattaa 8220 agctaatcct cagagccaca tctgaaattg gatcgtgtgc caaatatagg aagcaaaaaa 8280 taaaaataag agaggccata tatactgtag accaaaaaac agggaaatta aataaaatcg 8340 aagaggaaag aataattgat accccaaaag ggttgatttg gacagaaaag ccgcccaaac 8400 ttgatgccaa gactttcagg ggacatatcc ttacaaaaat taaggaccct aaaaaagtga 8460 tgttacaagc cttccaggaa tttagtctga gcagagaagg cgaaaaaccc catattcatg 8520 gggaaaatga gtttggcgca caaatttcta gctcctccaa tcattcgatg tggcccttat 8580 tagacagctg ggagatgatt aaacaaaaga tgattctaac cgtttgtctc atctggaccc 8640 tcgtttggac cttaaaattg acctactata tatacctttt atggttattg tacaattcgg 8700 gtcagctttt agaaaaatgg aaaaactggg gaaaacccaa gaatgacgtg gttcaaaaaa 8760 cccgcaaaaa cgagttacta attaaggtag aaccccagca ggtgctcgaa gaagaggaac 8820 tcgtcgacaa acaattctct cgaggcggga gtg 8853 // ID Copia-127_AA-LTR repbase; DNA; INV; 157 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-127_AA_; KW Ty1_copia_Ele190; Copia-127_AA-I; Copia-127_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-157 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 157 BP; 49 A; 30 C; 27 G; 51 T; 0 other; tgttggcgat gagatctagt tggcgactat caatcatagc aaacatagat tagttgattt 60 ttaaataaaa ttcattccaa gtttgtctct tgtcaagatc agaagtttta ttttcaacca 120 agagtagaat acccggtcta cggccattca ccatcca 157 // ID BEL-41_AA-I repbase; DNA; INV; 6224 BP. XX AC supercont1.245; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-41_AA_; KW BEL-41_AA-LTR; BEL-41_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6224 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.245; Positions 455219 461442. XX CC Positions [5275-5835] - Integrase core CC 'CAGAA' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS join(22..3000,3004..6222) FT /product="BEL-41_AA-I_1p" FT /translation="MSNPDKTTSTSKAATPEKSPTGCMCCDRAEDADDCVQ FT CDQCNGWWHMSCADVTASVADRSFTCAHCLPLSVSSRTTTSSARAARLALK FT KKMLEEQQAMEQRHLAEKYKLLEEELQEVEETASNRSRISKRTSQERVKEW FT QQKCAEQSKGAPGVSLTNQSKDSAAVSLAPQDAPHDQKDPEETNPVQVDGP FT SASNAPLPSLGNLRMLSETATKRDEPVSGKFTSTMQQLQQNSIIAEGCPAT FT GFGKPQWNSTAKPFGLRNPNIIGQVQQPYTGAIPKTVPYQQQVVHRSGNVA FT SKMPPVAHQGKPLRDPSHDFSIPHENENSLQQIINQFGTMAPSATDSSFCA FT NMRPFGVSALPQSGTRVYQPSTANPISSTGLVSPPLGLPNNTFMPANPTSS FT FGFVPPSVGLPNVGPSPQPNPAFSIGTVPPPPGQPIANQFTPSPSQLAARQ FT VMSRDLPTFSGDPADWPIFISSFMNSSLACGYNSAENLARLQRCLKGPAYE FT SVKSRLLLPESVPQVIDTLRLLYGRPELLIGALLQKVRSVPAPRAEKLETI FT IDFGIAVRSLCDHLEAAGQQEHLSNPTLLMELVEKLPTHTKMQWADYMQQH FT AVVNLKAFGDFMLGVVTAVSRVTTYAGGNSGGLQRSKHKGNVNAHNSEAEP FT VRQSVQEKERFCVCCKKTGHRVSECAVFKAYTVDNRWKFAQTNGLCRSCLN FT GHGRRSCRNATQCVIDGCQYRHHPLLHSDRSSSQGQRTQVSTMQNHMHRRT FT KHSLLFRIIPIIISGPLATIETFAFLDDGSDLSLIESSLVQQLGIDGWEKP FT LCLKWTGNVTRVEPESKQVQVTVRGVSNKQLFQLNDIHTVKELTLPEQSLD FT YEELSQRYRYLQGLPIVSYEKAVPRLLIGVNNARLTVPQQVREGKKSEPIA FT VKTRLGWCIFGGRGNQASHSLNYHTCECSSDQELHNVVKDYFAMEDAGVTP FT PVMLESEEDKRARAIMEETTIRIGERFETGLLWRYDNIFPDSYNMAVRRFE FT CLERKMLRDPELAANLNKQIAEYQQKGYAHRATEEELVRADPKRVWYLPLG FT VVTNPRKPGKVRLIWDAAAKVDGISLNSMLLKGPDQLTSLPAVLTRFRQYK FT VGVSADIQEMFHQLLIREADRHSQRFLFRSDPLKPLEIYLMDVATFGSTCS FT PASAQYVKNKNAEEFAVLYPRAAEEILENHYVDDYLSSYESAEEAERVSRE FT VRSIHQEGGFKLRNWLSNSSAVLHGLSEEEPKATKNLCLSATESSDRVLGM FT LWQTADDELWFPMKMKEEVQRVIDSRERPTKRQMLKCLMGIFDPLGLLCAF FT LVHGKILLQDVWRANVQWDETVTGKIFDRWVKWTGLFSRVGDLRIPRCYFR FT GATAQMYEHLQLHVFVDASEAAFAAVAYFRVVNAEGEAECSIVAAKSKVAP FT LKPLSIPRLELQAAVLGSRLMSFVQENHSVKVKQRFIWSDSALVVGWLRAD FT HRRYKQYVACRIGEILTTTDVSEWRWVPGKLNPADAATKWGNDPCPAVTDV FT WFKGPEFLRFPEVNWPKQVEASDDPEEELRSCFMIRGTLIPESIVDFARFS FT KWRRLLRAVAYVHRFIGNCRRKRQGEKLELLHLSQEELKKAKNTLMWTAQW FT QEFPDEMALFSSGKTELPKTSSLYRLTPILDEYGVLRVDGRIGAAPNAQFD FT ARFPVILSKTHRVTKLIVDDFHQMFHHGNSETVVNEVRQFYHISRLRVVVK FT QVAAACQWCRVTRAVPKVPRMAPLPVSRLSSFSRPFTYTGVDFFGPFLVKV FT GRSAAKRWICLFTCLTIRAVHVEVAHSLSTPSCIKAIRRFVDRRGAPAEIY FT SDNGTNFQGAEHLLREQLKEIHGNLAATFTNTDTKWIFNPPAAPHMGGAWE FT KMVRSVKSAMQASYNSDRKLDDEGLETLAVEAEGIVNSRPLTYLPLDAAEG FT EALTPNHFLLGSSNGVRQPAVPFSDPASAVKNTWCLIQQQLDVFWKRWIRE FT YLPMLTRRMKWFGEVKPVAVGDLVLIVDETRRNGWIRGKVMGVMTAKDGRV FT RQATVQTARGMLRRPVSKLAVLEVELCGKTGTGGQCYGGE" XX SQ Sequence 6224 BP; 1671 A; 1471 C; 1674 G; 1408 T; 0 other; aaatctttaa gaaatttcgt catgtcgaat ccggataaga cgaccagcac ttccaaggca 60 gccacaccgg agaagagccc caccggctgt atgtgctgcg atcgtgctga agatgcggat 120 gattgtgtcc aatgtgacca gtgcaatggg tggtggcaca tgtcgtgcgc ggatgtaaca 180 gcctccgtgg cggataggtc gtttacgtgc gcccactgct taccgctaag tgtctcctcc 240 cggacaacaa cgtccagcgc tagggcagca aggttggccc tgaagaagaa gatgctggaa 300 gagcagcagg cgatggagca acgccacctc gctgaaaagt acaagcttct ggaagaggag 360 ttgcaagagg tggaggaaac ggctagcaac aggagccgga tcagcaaaag gacgagtcaa 420 gaaagggtga aggagtggca gcagaaatgc gcagagcagt ctaagggcgc gccgggtgtt 480 tcgttgacta atcaatcgaa agactctgca gcagtttctc tggccccgca ggatgcaccg 540 cacgatcaga aagacccgga ggaaaccaat ccggttcagg tagatggccc atcggcatcc 600 aacgcgccgc ttccatctct agggaatctg cgtatgctgt cggagacagc aaccaaacgt 660 gacgagccag tttcggggaa gtttacctcg acgatgcagc aactacaaca aaacagcata 720 atcgcagaag gatgtccagc gactggtttc ggcaagccac aatggaattc gacggcaaaa 780 ccgttcggtc ttcggaaccc gaacatcatt ggacaggttc agcagccgta caccggcgcg 840 atccctaaaa cggttccgta tcagcaacag gtagtacacc gatcaggtaa cgtcgcctcc 900 aaaatgcctc ccgtagcaca tcaaggtaag ccacttcgtg atccatccca tgatttttcc 960 ataccacacg aaaatgaaaa ctcccttcaa caaataatta atcaatttgg aaccatggca 1020 ccatcagcca cggactcctc attctgcgcc aatatgcgcc cgttcggtgt aagcgctctt 1080 ccacagtctg gaacgagagt ttaccaaccg tcgacggcaa atccgatatc atccaccgga 1140 ttagtttccc cgcctttggg actgccaaat aataccttta tgccggcaaa tccgacctca 1200 tctttcggat tcgttccccc atccgtggga ctgccgaatg ttggcccttc gccacagcca 1260 aatccggcct tttccatcgg aactgttccc ccaccgccgg gacagccgat agccaatcaa 1320 tttactccct ccccctcgca actagctgcc cgacaagtaa tgtcgcgcga tttaccgacg 1380 ttttccggcg accccgccga ttggccgata tttatcagca gcttcatgaa tagctcatta 1440 gcttgtgggt acaatagcgc tgaaaacctc gcgcgtctcc agcgttgttt gaaggggccc 1500 gcgtacgagt ctgtcaaaag tcgtttactt ctccctgaat ctgtgcctca agtgattgac 1560 actcttcgct tgttgtatgg acgaccagag ttgttaattg gtgctctact ccagaaagtg 1620 cgtagtgttc cagcgccaag agcggagaaa ttagaaacca tcatcgactt cggcatagca 1680 gtacgcagtc tttgcgatca tctggaggct gctggacagc aggaacacct ttctaatcca 1740 acgttactga tggagttggt ggagaaactg cctacgcaca ccaagatgca gtgggcagat 1800 tatatgcaac aacatgcggt ggtcaacctt aaagcgttcg gcgacttcat gcttggagtc 1860 gtcacggctg taagccgagt tacaacgtac gccggcggga acagcggcgg tctacaaaga 1920 tcgaaacaca agggaaatgt gaatgcccac aacagtgaag cagagccagt tcgtcaatca 1980 gtgcaagaaa aggaacgttt ttgtgtttgc tgcaagaaga ccggtcatcg tgtgtccgag 2040 tgtgctgtgt ttaaggcgta tacagtagac aatcgatgga aattcgcgca aactaatggg 2100 ctgtgccgga gttgtctcaa cggacacgga agaagaagtt gcaggaatgc gactcagtgc 2160 gtgattgacg gatgtcaata tcgtcatcat ccgctgttac attccgatcg atcaagctct 2220 caaggacaac gtacacaagt gtcgacaatg cagaatcata tgcaccgacg aactaaacac 2280 tctcttttgt tccgcatcat tccaataatc atttccggtc cgttagccac catcgaaacc 2340 ttcgcctttc tggatgatgg ctccgatcta tcacttatcg agagcagcct agtacagcag 2400 ctgggcatcg atggttggga gaaacctctg tgcctgaaat ggactgggaa cgttactcga 2460 gtcgagcctg agtcgaagca agtccaagtc acagtcagag gagtgagcaa taagcaacta 2520 tttcaactca acgacattca caccgtgaag gagctcactt tgccagagca gagtctcgac 2580 tatgaagaac tgtctcaacg ttaccgctat cttcaaggtt tgccgattgt cagttacgag 2640 aaagccgttc cacgcctgct gataggtgtg aacaatgcga gattaacggt cccgcagcaa 2700 gttcgagaag ggaaaaaaag cgaacctatt gcggtgaaga ccaggctagg atggtgcatc 2760 ttcggcggcc gtggtaatca ggcgtcgcat tcgctaaact atcacacctg tgaatgttct 2820 agcgatcagg agctgcataa cgtagtgaaa gattactttg cgatggaaga cgcaggggtc 2880 acaccaccag taatgctgga atcagaagaa gacaaacgtg ccagggcaat tatggaggag 2940 acaaccatcc gtatcgggga gcggttcgaa acggggctac tgtggaggta tgacaacatc 3000 tagtttcctg acagttataa catggcggtg agacgatttg aatgtctgga gcgaaagatg 3060 ttacgggatc ccgagttagc agccaacttg aacaagcaga tagcagaata ccagcagaaa 3120 ggctacgcac atagagcgac agaagaggaa ttagttcgag cagacccaaa gcgtgtgtgg 3180 tatttacctc tgggtgtagt gacaaaccca cgaaaacctg ggaaagttcg attaatttgg 3240 gacgcggcgg ccaaagtaga tggaatatct ctgaactcca tgctactcaa agggccggat 3300 caactgacgt cgcttccagc ggttctgaca cggtttcggc agtacaaggt gggagtatca 3360 gcagatatac aagagatgtt tcaccagcta ttgatccgtg aggcagatcg tcattcccag 3420 cgtttcttat tccgaagcga tccgttgaag ccattagaaa tctacctaat ggatgtggca 3480 acctttggct caacgtgttc gccggcttca gcacaatacg ttaaaaacaa aaatgctgaa 3540 gagttcgctg ttctctatcc acgagcagcc gaagaaattt tggaaaacca ctacgtggat 3600 gactatctca gtagctacga gagtgcagaa gaagcagaac gggtgtcacg agaagttcgt 3660 tccatacacc aagaaggagg attcaaactt agaaattggc tatcaaatag ttccgctgtt 3720 ctacatggat tgtctgagga agaaccgaaa gcgactaaga acctatgcct aagtgcaacg 3780 gaaagcagtg atcgggtgtt gggaatgttg tggcagactg cagacgacga gctgtggttt 3840 ccaatgaaga tgaaggagga agttcagcga gtgatcgaca gcagagagcg accaacgaaa 3900 cgacagatgc tgaaatgctt aatgggaatc ttcgatccgc tcggactttt atgcgcattt 3960 ctcgtccacg gaaaaattct actgcaggat gtttggcgtg cgaatgtgca gtgggacgaa 4020 acggttacag ggaaaatatt cgatcgttgg gtcaaatgga ccggtctttt cagtagggtc 4080 ggagatctac gtatcccgcg ttgttatttc agaggtgcaa ccgcgcagat gtatgagcac 4140 cttcaactac acgtcttcgt tgacgctagc gaagccgcct ttgccgctgt tgcatacttc 4200 agggtggtga atgccgaagg agaagcagaa tgctctatag ttgccgcaaa gtcgaaggtc 4260 gcacccctaa aaccactttc gatccctcgc ttggagctac aagcggcagt tctgggcagt 4320 cgcttgatgt cgttcgtcca agagaatcac agcgtaaaag taaaacaacg attcatttgg 4380 agtgattcag cattagtagt gggctggctg cgagcagatc accggcgtta taaacagtac 4440 gtagcgtgcc gaataggaga gattctgaca acaacggacg tatcggaatg gcgttgggta 4500 cctggcaagc tgaaccctgc agatgctgcc actaagtggg gaaatgatcc atgccctgcc 4560 gttaccgacg tctggttcaa aggaccagag tttcttcgat ttccggaagt aaattggcca 4620 aagcaagtag aagcgtcaga tgatcctgaa gaggaactac gatcatgctt tatgatacgt 4680 ggtacactga ttcctgaaag tatcgtagat tttgcacggt tttcaaagtg gaggcggctt 4740 ctgagagcag tagcctatgt gcatcgtttc attggcaact gtaggcgcaa acgtcaagga 4800 gaaaaactgg agttactgca tctaagtcag gaagagttga agaaagcgaa gaataccttg 4860 atgtggactg cgcagtggca ggagtttccc gatgaaatgg cgctgttttc gtcagggaaa 4920 acagaactac ccaaaacgag cagtttgtac cggctgacac cgatattaga cgagtatgga 4980 gtgctgcgtg ttgacggaag gattggcgcc gctccgaatg cgcagtttga tgctagattt 5040 ccagtgatcc tctcaaaaac gcaccgtgta acgaaactaa tagttgacga ttttcatcaa 5100 atgtttcatc acggaaattc agagacagtt gtcaacgaag ttcgacagtt ctaccacatc 5160 tcgcggttga gagttgtggt gaagcaagtt gcggctgcat gtcagtggtg cagagtgaca 5220 agagctgttc cgaaggttcc gcgtatggca ccattgcccg tatctcgatt gtcgtcgttc 5280 tctcgtccat ttacctatac tggtgttgat ttcttcggac cgtttttggt gaaagtagga 5340 agaagtgctg ctaaacgctg gatctgcctg ttcacttgtc tcactattcg cgccgtccat 5400 gttgaggtag cacacagcct atctacgcct tcgtgcatta aagctattcg ccgattcgtc 5460 gatcgccgag gagctccggc tgaaatctat tcagacaacg gaaccaactt ccaaggtgct 5520 gagcacctgt tgcgcgaaca gttgaaagaa atacacggca atctggcagc gacattcacc 5580 aacacggata ccaagtggat tttcaatcca ccagcggcac cccacatggg cggtgcatgg 5640 gaaaagatgg tgcgttcggt taagtcagct atgcaagcat cctacaacag tgacaggaaa 5700 ctggacgacg aaggattgga aacgttggcc gtcgaagcgg aaggcattgt caacagccgc 5760 ccgctcactt acctgcccct agacgctgct gaaggagaag ccctcactcc caatcatttc 5820 ctgttaggga gttctaatgg tgttcgccag ccggcagtac cattcagtga tcctgcctct 5880 gcggtgaaga atacatggtg cctgatacaa caacaattgg acgtcttctg gaagcgttgg 5940 attcgggagt atctaccgat gttgaccagg cggatgaagt ggtttggcga ggtgaaacct 6000 gtagctgtgg gagatctggt tctcattgtg gatgaaaccc ggaggaatgg atggattcgt 6060 ggaaaggtga tgggagttat gactgcgaaa gatgggagag ttcggcaagc tacagtacag 6120 actgcgaggg ggatgttacg gagaccggtt tcgaagttgg ccgtgttgga ggttgagttg 6180 tgtggtaaaa ctgggaccgg tggccagtgt tacggggggg agga 6224 // ID Mariner-26_HM repbase; DNA; INV; 2923 BP. XX AC . XX DT 23-DEC-2008 (Rel. 13.12, Created) DT 23-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE Mariner-type family: consensus sequence. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-26_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2923 RA Bao W. and Jurka J.; RT "Families of Mariner elements from Hydra magnipapillata."; RL Repbase Reports 8(12), 1960-1960 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 511..2511 FT /product="Mariner-26_HM_1p" FT /translation="MPFLFKKKNKRREIPKDVILRALEEVSKGGKIKTTAI FT KYDIPRSNLQRYIKQGGAVKDTSCKYISSQIFTKDEEEKISEYFVTLFKLN FT HGMSKLKARELVYEYATAINKKIPDNWTEKKLASKDWLRGFFKRQPQLSIR FT TPEATSLSRATSFNKKNVGDFFENLKTVYERHRFGPESIYNIDETGLSTVQ FT RTQKVIALRGTKQVGQVTSAERGTLVTVCCGINALGNSIPPFFIFPRVIFK FT TYMLNDAPVGSDGAAHPSGWMTASSFLKYIHHFKKYAKPTPSSPVLLLLDN FT HESHISVPVLDFCIESGIVLITFPPHCSHKLQPLDLTVYGPLKTYYNTAVT FT DWMVSNPGKTVTIYEIPKLAAKAIPLAFKLQNIQTGFEKSGIWPFNSNIFS FT DEDFVCSSITDRYLPXENNVATEKSISDQPIMEQPSTEENLRLEDRTSSNV FT EVTGPSQLESLSKSQTSGTVNVTPDLVRPYPKAGPRKLGGMKRKKGKTRIL FT TDTPEKRLIEIEEEKKQRKNKIKELNKNRKCSKNEKNRKISSDDEIKNVSV FT SDEAFVEIDSSKEDFDVDDVIVLDRNFQVNNFVLVKFDTKKTVYHYVGIVE FT EVNHSMATVKFMRASKIRNTFMFPDIEDVASVPLEDIKTKLPTPSTWIKRG FT SLKYKFDKPLWVIPNLR*" XX SQ Sequence 2923 BP; 1006 A; 508 C; 541 G; 865 T; 3 other; ggggaggtag gggagcattg ggcagttggg aggtttgggc accccctcat ttctcacaat 60 cttttctaga tattttttgt tttttaccat agcattatag ttttataatt ggcaacacyt 120 ttcatcaaaa gtgtcagctg tcgtttcact tggttgtcaa gttacatgcg cttaacagtt 180 tgttgcgttc aaaagtaaat attttgttta tgatttgaag aaagataaca aagaaatctg 240 caagaaattg taagtatctg tgttaacttc ttcttttgtg tttaatgaac attatattta 300 cctgacaatt aggattcaac atttgataaa aataaactta ttcaaatctg tttaaaaagt 360 gaggttatgg ggagcattgg gcacgcaggt aggagggagg tttgggcatg cccaatgctc 420 cctctttttt tagtgtgttt ttttgtcata tgtttgtgtt tttttgctat gactattgga 480 ttgaatggtt aatcatttct tttcaggaca atgccgtttt tatttaaaaa gaaaaataaa 540 cgacgtgaaa tccctaaaga tgttattttg cgtgcattgg aagaggtttc taaaggagga 600 aaaattaaaa ctactgcgat taaatatgat atacccaggt ctaaccttca aagatacata 660 aaacagggtg gcgctgtgaa agatacttct tgtaaatata tatcatccca aatattcaca 720 aaagatgaag aagaaaaaat ttcagaatat tttgtaacat tgtttaaact taaccatgga 780 atgtcaaaac tgaaagcacg cgaattagta tacgaatatg ccactgccat taataaaaaa 840 ataccagaca actggacaga gaagaaactt gcatcgaaag attggctaag gggatttttt 900 aaaagacaac ctcagctttc gataaggacc ccagaagcta ctagtctctc acgtgccaca 960 agtttcaata agaaaaatgt tggagatttt tttgaaaatc ttaaaactgt atatgaacga 1020 catcgttttg gccctgagtc tatttacaat atagacgaaa ctggattatc aacagtacaa 1080 cgaacccaga aagtgattgc tctcagaggc actaagcaag ttgggcaagt aacgtcagct 1140 gaaagaggga cacttgtaac tgtttgctgc ggcattaatg cattgggtaa ctcaatacca 1200 ccctttttca tattcccacg agttattttc aaaacatata tgttaaatga tgcacctgtt 1260 ggttcagatg gagcagcaca tccttcaggg tggatgacag catcaagttt tttaaaatat 1320 atacaccatt ttaaaaaata cgcaaaacca acgccttcat caccagttct attgctgctg 1380 gacaaccacg agagccatat ttctgtacca gttctcgatt tttgtataga atcaggcatt 1440 gttttaataa ctttcccacc gcactgcagc cataaactgc aacccctaga cctgactgtc 1500 tatggaccac tcaaaactta ttataacact gctgtaacag attggatggt gagcaatcct 1560 ggcaaaacgg taacaattta tgaaatccca aaattagcag caaaggccat cccacttgca 1620 ttcaagttgc aaaatattca aacgggattt gaaaaatctg gcatttggcc attcaattca 1680 aacattttct ctgatgaaga tttcgtttgt tcttctataa ctgatcgcta cttgccggra 1740 gagaataatg ttgctacaga aaaatcgatt tcagaccaac ctatcatgga acagccttca 1800 acagaagaaa atctgagact agaagatcgt acwagttcaa atgtggaagt cactgggcca 1860 tctcaacttg aatctctatc taaaagccag actagtggca ctgtcaatgt aacacctgat 1920 ttagtaagac catatccgaa agccggcccc aggaaattgg gtggcatgaa acggaagaaa 1980 ggaaaaacta gaatattgac agacacaccc gagaaaagac ttatagaaat tgaagaagaa 2040 aaaaaacaaa gaaaaaataa gatcaaagag ctcaacaaaa atcgaaaatg ctctaaaaat 2100 gagaaaaaca ggaagattag tagtgatgat gaaataaaaa atgtttccgt ttctgacgaa 2160 gcctttgttg aaatcgattc atcaaaagaa gattttgatg tagacgatgt gattgttctg 2220 gatagaaatt tccaggttaa taactttgtg ttggttaaat ttgacaccaa aaaaactgtt 2280 tatcattatg ttggaattgt cgaagaggtt aatcactcca tggcaactgt caaattcatg 2340 agagcgagta agataagaaa tacgtttatg tttcctgaca tcgaggacgt agccagtgtc 2400 cccctggagg atataaagac aaaattgcct acaccatcaa catggataaa acgtggcagc 2460 ttaaaatata aatttgacaa gcctctttgg gtcataccaa atttaagata agtcttatta 2520 ggtccaacat tgtggagtat tgattccaag taacattgtt taatcttttg taaagtctat 2580 gatatgaaaa aatgtttgtg ttacaatttt ttatagcaat gctatataac acttaagcat 2640 atgagtatag taaaacaata tcgttcatat taaatacagt gcccaaacct ccccataggg 2700 gtgcccaaac ctcccaacca cggggaggtt tgggcacttt ggacacgtgt tagaaaatct 2760 tttgtaactt tgcaactaat tataatacta aaactcttct tagcaataat taatgccaaa 2820 tatgtgtgct agatatattt gaaataaaat tttgattatg ataaaaaata acagatttat 2880 tagcgtttat gtaaaaaagt gcccaatgct cccctacctc ccc 2923 // ID Gypsy-47_AA-I repbase; DNA; INV; 7029 BP. XX AC AAGE02018863; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-47_AA_; KW Gypsy-47_AA-LTR; Gypsy-47_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-7029 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02018863; Positions 97547 90519. XX CC Positions [3543-4004] - Reverse transcriptase CC Positions [5028-5504] - Integrase core CC 'TACC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 434..2188 FT /product="Gypsy-47_AA-I_1p" FT /translation="MEGLGRSIASVEDLKARFGDLRVDFLNEDELDYELQI FT RGIVCEEGAQMVRKRGNLREALKREKESRVTEHVKYELDSEVDYAFCVEKY FT NQVLDCLVGAKIVPPRCQSSLLHLGVRIERLMPHVDETKQGTLRVFVDKII FT ELLNLYFYQKRDGQRTASVESDLFLTDLLTVPTSFSGHSVPAIVTSSETVR FT EENAVALLDPEDDDLIQCLQRLGCLDPNNSSGDMKKDVRDAMLAFEHELYL FT LRRYKQASQSHCSTKTIFTTPITKTINVTSISSERSPNLTTSVVNILTSPR FT VTTSVSRPTPPTVSSVCVSNTTGLNSNASIPYSRSYGCSGYPVSSSVYAHQ FT SAYPTQFTSAFTGLPANTGWNPNYYGIHPVGFQNYPPVGYYPPHVPFSVPM FT PIISTAPPVSVVPHVSILNPVMSTAWNPSTVSAPIVPPPVPIVPPQPAYTH FT KSLPVSKWKIEKYAGTDQGSNLNEFLVLIQQLSLAERISEQELFDSAFHLF FT TGPALNWFMMMSSQGLLRDWSHMVQELRKSFVHPELDSLVRSKIYQRRQQR FT NESFQDYYYDMEKLFRSMTVQMGDLEKLDVVKRNMRGD" FT CDS 3456..5882 FT /product="Gypsy-47_AA-I_2p" FT /translation="MLRIGIIERCHSDWSLNVVPVIKPTGKVRLCLDARKI FT NERTVKDAYPLPHPGRILGQLPKAKYLSTIDLSEAFLQIPLEVSSRRYTAF FT SIQGKGMFQFTRLPFGLVNSPATLSRLMDQVLGHGELEPRVFVYLDDIVIV FT SETFDEHVNLLKKVANRLRAANLAINLEKSKFGVQELPFLGYLLSTEGLRP FT NPDKIRAVVDYERPTTITKLRRFLGMSNYYRRFIQDFSGVTAPLTDLLKTK FT SKILGWNEQAEDAFCSIKEKLISAPILVSPDFGKEFCIQTDASDVAVAGVL FT TQEQDGSERVIAYFSHKLTTPQRNYHACEKETLAALLAVEAFRGYVEGAHF FT TLITDSSALTHIMTTKWKTASRCSRWCLTLQQYNMTILHRKGRDNVVPDAL FT SRSLAVVTTRSASQWYDKILEEVRKNPDDFVDFRIEDGKLFKFVSTPNLPY FT DHRYEWKYVPPAADRDDIIRDNHDGAMHLGVDKTIDRIRQRYFWPGLAAEV FT REYIQDCTTCKEVKGASVPVVPLMGDQRITKHPWQIIAMDYIGPLPRSKKG FT YQHILVILDLFSKWIMLTPVRKIDSTSLCTIVRDQWFFRNSIPEVIITDNA FT SCFLSKEFKSLLERFNIRHWLNAKYHSQANPVERVNRTVNAAIRTYVKEDQ FT RLWDLKLSEIETILNSSVHSVTKMTPFFITHGHELFIQGSDHKDLPCEQEQ FT TLEQRSGMQRKLLEKIYDVVEKNLKDAHESGKHRYNLRHRQHAKSFEIGQN FT VYYRNMKQSSAVQNYNAKYGSMYLPAQIKSKLGSSSYEIQDSNGKSLGIWH FT ASHLKPA" XX SQ Sequence 7029 BP; 2214 A; 1403 C; 1483 G; 1929 T; 0 other; attggcgatc ctcgtaaaaa aatagatttc aaattgtttg gccataccac acaatttaac 60 tgcactaggg gagactcact ccaccctttt caaagtttgt cgtgattagc ttgatattca 120 atttcatttt tgcaacattc tgagtgatag gacagtccgt ttgatctctt gcaaaaacgg 180 caaaaaagta gccaccgttg gtggcaagcc agaaaaacaa ctattgttct ctggaagcgc 240 gatacaaaaa aatatagatc gcgtggttcg atacgattgc tcatcctttc attcttttga 300 cgagatacgt tctctagtta aagttgtttg tgcgaagtaa aatattttct gattctgctt 360 tggtgcgttt ctcgaacgct ggtcgcatta ttaggttttt ttttaatttt gtttgtgcaa 420 aacatcataa acgatggagg ggctggggag atcgattgct agcgtggaag atttgaaagc 480 gcgctttggt gaccttcgag tcgatttttt aaatgaggat gaactcgact atgagttgca 540 aattagagga atagtttgcg aagaaggcgc gcagatggtt agaaagcgcg gtaacctcag 600 agaagctcta aagagagaaa aggaatctcg ggttacagaa cacgtgaagt atgagcttga 660 ttcggaagta gattatgcat tttgcgtaga gaagtacaat caggtgctcg attgtctggt 720 tggcgcgaaa attgttccac cacgttgtca gtcgtcgttg ctacatttag gcgttaggat 780 agaaaggcta atgccacatg ttgatgaaac aaagcaaggc acgttgagag tatttgtcga 840 taaaatcata gaattgttga atctttactt ttatcaaaag cgtgatgggc aacgaacggc 900 tagtgtggaa agcgacttgt ttcttacaga tttgttaaca gtgcccacgt cttttagtgg 960 acactcagtc ccagcaatcg tcacgagttc agaaaccgtt agggaagaaa atgctgtagc 1020 tcttttagat ccagaggacg atgatttgat tcagtgttta cagaggctag gatgtttgga 1080 cccaaataac agttcaggtg acatgaagaa agatgtacgt gatgctatgt tggcttttga 1140 gcatgaatta tatcttttgc gtaggtacaa gcaggcgtca cagtcacact gcagtaccaa 1200 aaccattttc accacaccta tcacaaaaac gattaacgta acctcaattt cgagtgagcg 1260 atcaccgaat ctcactacat ctgtggtgaa tattttgact tcgcctagag ttactacatc 1320 ggtgagcaga cccacgccac cgacagttag ctcagtttgt gtttctaata caactggctt 1380 gaattcaaat gcttcgatcc catattcaag atcatatgga tgcagcggtt atcccgtctc 1440 gtcgtcggtg tatgcccatc aaagtgctta cccgacccag tttaccagcg ctttcacagg 1500 attaccagca aataccggtt ggaatccaaa ttactatgga attcatccag taggattcca 1560 aaattaccca ccggttggat attacccacc acatgtgcca ttttcggtac caatgccgat 1620 tatttcaaca gcaccacctg tttctgtcgt accacacgtt tcaatattga acccagtcat 1680 gtcgaccgca tggaaccctt ccacagtctc agctcccatt gtaccacctc cagtgccgat 1740 tgttccacca cagccggcat acactcataa gtcactacct gtctcaaaat ggaaaattga 1800 gaaatatgca ggtaccgatc agggcagtaa cttgaatgag ttcttggttt tgattcaaca 1860 gttatcgctt gctgagagaa tctcagagca agagttgttt gattcggcct tccatttgtt 1920 tacagggcct gctctgaatt ggtttatgat gatgagttct caaggacttt tgagagattg 1980 gagccacatg gtacaggaat tgagaaaatc gtttgtgcat ccagagttgg attctctggt 2040 gcgatctaaa atttatcaac gtcgtcaaca aaggaacgaa tcgttccaag actattacta 2100 tgatatggaa aagctcttta ggtcaatgac ggtacagatg ggtgatctcg aaaaattgga 2160 tgtggtaaaa cgcaatatgc gaggggatta aaaaaagtat ttattgtgga aacccactac 2220 gaccttacaa caattggtcg aagctggtcg tcaaattgac gcctcaaatt tttcgcttta 2280 caacaaaatg tttggatcgg aaaaaacgtc caacgttgtt aatgaagtta aatcattcaa 2340 gcaaaacatc aaacctcaga acccacgacc aaatgataaa aaatggtcgg ataaccaaaa 2400 atcaaaacca cctaacagtc ccaataaatc agttaaagaa ccacccaaat cacaaccaaa 2460 accgaaaaga ccttctgagg acactgatcc caaacctcaa gttggaaagg aaaatcctca 2520 agagggaaat cataacaata ttagacctct cagcagttta atcgcggctc ataaacctcc 2580 tcaaatggat cagtgtattt attgccgaat gtcaaatcac gctgttgagc agtgtagatc 2640 gattaaaggt cttttctgta ggatgtgtgg atttaagggg ttcgataccc aaaactgtcc 2700 gttttgtcaa aaaaacgggt ttcaggcgac caaaagtcgc aggtcgtcga cgtcatccgc 2760 gtgaacactt ttgcaaactc ggttacccca tttgactttt gggaaccaac tgtagacgaa 2820 atgtacttta aaccggatga aaacgtgcaa gttattacca cctctattcc gaatgataat 2880 agaccgtatg caaaagtcaa actttacggc catataacca aaggtttact tgattcagga 2940 agcaacaata ccctgataag tgatcgcatt taccacaaat taaataaacc taaattacac 3000 gagttaaaaa aacccggttg atctccggtc ggctagtggc accaagctaa atatacttgg 3060 caagctgtat ataccaattt cttttgacaa ccaagttcgg ataatttcca cattagtaat 3120 tgaaaacttg gtgttggatt gcattctggg gatggatttc tggaaaagat tcggaatttc 3180 tccaaccata caacaatgcg cggtaataga acaagaacaa gaaccccacc ctgacgagtc 3240 aaagtctgca ttgactgaga ccgaaactag tgaactagaa aatgtaaaaa ggctatttaa 3300 agtggcagtg ccagaggaaa tttctttgac acccttgata acccatcgta tagaaataca 3360 agacgactgg aagaaaaaac cagcagttcg tcagtcagta cccttatacg gtgtcaccca 3420 aaattcaaca aaaggtatcg gaagatctgg atcgaatgct aagaatcgga atcatagaaa 3480 gatgccattc agactggtcg ttaaatgtag tccctgttat caaacccacg ggcaaagttc 3540 ggttatgtct tgacgctcga aaaataaatg agcgtaccgt aaaggatgca tacccacttc 3600 cacatccggg tagaatcctt ggacaattac ccaaagctaa gtatcttagt acgattgacc 3660 tttctgaggc atttttacag attcctcttg aagtgtcttc acgccggtat accgctttca 3720 gtatacaagg caaaggtatg tttcagttca ctcgcctgcc ctttggcttg gtaaatagtc 3780 cggcaaccct gtctcggcta atggaccaag tcttagggca cggtgaactg gaaccccgcg 3840 tatttgtcta tcttgatgac atagtcatcg tgagcgagac ttttgacgaa catgtcaatc 3900 ttctcaaaaa ggtggcaaac cgcttacgag cagcaaacct agcaattaat ttggaaaaat 3960 ccaaatttgg tgtacaagaa ttaccgtttt tgggatattt gctgtccacc gaaggtttaa 4020 gaccaaatcc cgataaaata agggcagttg tcgattatga acgtccgaca actataacaa 4080 agttacgtcg ttttttagga atgagtaact actatcgtag gtttattcaa gattttagtg 4140 gcgttacggc tccacttaca gatcttttaa aaacaaaaag caagatcttg ggatggaacg 4200 agcaagccga agatgctttt tgtagcatca aagaaaaact catatctgcc ccaattcttg 4260 tgagtccaga ttttggaaaa gaattttgca tacaaacgga cgcaagcgat gtggctgtgg 4320 ccggtgttct tacacaagaa caggatggca gcgaacgggt catcgcgtat ttctcgcata 4380 aattaacgac cccacaacgc aattatcatg cgtgcgagaa agagacgtta gcggcgcttt 4440 tggctgttga agcattccga gggtatgttg aaggagccca ctttactttg ataaccgatt 4500 cgtcggctct aacgcatatt atgaccacga agtggaaaac cgcctcccgc tgtagcagat 4560 ggtgtctaac attacagcaa tataacatga ccattttgca cagaaaaggg agggataatg 4620 ttgtcccaga cgcgctgtcc cgaagcctag ctgtggtaac aacgcgctca gcgtcacagt 4680 ggtacgataa aatcttagaa gaggtaagaa agaacccgga cgattttgta gatttccgaa 4740 ttgaggatgg aaaacttttt aaatttgttt caactccgaa tttgccttac gatcacaggt 4800 acgagtggaa atatgttcca cctgcagcag atcgagatga tatcatacgt gacaatcatg 4860 acggtgcaat gcatcttgga gtcgacaaga ccattgaccg aataaggcaa cgttacttct 4920 ggccaggtct agctgcggaa gtacgcgagt acatacagga ttgtacgact tgtaaggagg 4980 tcaaaggtgc ttcggtacct gtcgttcctc ttatgggtga ccaaagaatc acaaaacacc 5040 catggcaaat aatagcaatg gattacattg gtcctttacc gcgaagcaaa aagggatacc 5100 aacatattct agtaatacta gatctcttca gcaagtggat tatgctaacg ccggtgcgta 5160 aaatcgacag tacgtcgtta tgcaccatag tacgcgatca atggttcttc aggaattcta 5220 ttcctgaagt gatcataact gataacgcat catgctttct atcgaaagag tttaaatctc 5280 ttttggaacg cttcaatata cggcactggc tcaatgccaa gtaccactcg caagcgaacc 5340 ccgtagaaag ggtgaatcgt acggtaaatg cggcaattag aacgtacgtg aaagaagatc 5400 aacgattatg ggatttaaaa ctatccgaaa tagaaacaat ccttaattcc tcagttcatt 5460 cggtaactaa aatgaccccc ttttttataa ctcacggtca tgaactattt attcaaggat 5520 ctgatcataa agatcttccg tgtgaacagg aacaaacctt agagcaacgt tctggcatgc 5580 aaaggaaatt gttagaaaaa atctacgatg tcgtagaaaa aaatcttaaa gatgcacacg 5640 agtctggaaa acacagatat aatctcagac atcgccagca tgcaaagagt tttgaaatag 5700 gtcagaatgt atattaccgg aacatgaaac aatctagtgc agtccaaaac tacaatgcta 5760 agtatggttc catgtatcta ccagctcaaa ttaagtccaa actaggatct tcttcttacg 5820 agatacagga ctcgaacggg aaatcccttg gaatttggca tgcgtctcat ctgaaacctg 5880 cataatgatc agtaggctaa atatatccta ttgatacaat ttgtctgtta tctcgtctga 5940 aaagcctcga gtaaaactga atcgacgttt cccaatgtca ataaatttca tccgaaagaa 6000 ggatccaagg cctttgtggg cagctaaaaa taaacaatcg agaagaaaaa gtatgttaga 6060 agcagtatgt tcgaaaaccg tgtcacgatt cgcttcaaat ccaatgcaat tttcacattt 6120 ttcttcgtta gttgaaaatg tgtgttgtgt tgttttgatc aagctcgaac tgaagtgatc 6180 aattgtcatt gctttgaaag attgtatctt caattcgaac atgaatattg gctcacatcc 6240 tccacaggcg acccagttcg aatttagcac atatttgagt ttgtttcaag tgagatcaag 6300 tttgtgggtc tgtgttggag gtgtatctct ggattcgctg gagaccggag gtataatccc 6360 gcggatgatc agtaaaagca tccttctgaa ttcatacgga gtcttaggat aaggaacgta 6420 tatgacaccg aatagttgag atcgctttct gttcactcac ggtaaaacat cttgtatact 6480 taattagccc ttgagtatgt ttcactttta gacgattgcg tctatagttt cactaccgtt 6540 agttcaaatt ttgataaaat aggctgtaat tctctttttt ttcttcaatg tacagatagt 6600 agtaagctag gttaaagaag tgcgaatgag tatgaatgaa tgggactgaa tgcataagta 6660 gagggtaccc gcatgagtgt taggtgaatg acagagtgaa tgagattaat aatggaccag 6720 taatcatgaa ccgcacaaag attagttttt ttttctcaaa acttgaggaa gtattaataa 6780 cgctctttga cagagagaca ggttttctca gaaaagggtg aatagaatga gcacataata 6840 tcgcaatcaa cgatgatcta aaaaaagagt atcacaaata cttttaaaca atggaaatat 6900 aataatttaa atgatccgga cctactgcat aaaaattgtt aaaaattttg aatttgaata 6960 tcaaataaca tgacgaatag agaaaaatta cccaatactt gtgtaatttt tccccgagga 7020 ggaggatat 7029 // ID Copia-35_CQ-LTR repbase; DNA; INV; 130 BP. XX AC AAWU01005677; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-35_CQ_; KW Copia-35_CQ-I; Copia-35_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-130 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 374-374 (2011). XX DR GenBank; AAWU01005677; Positions 5065 4936. XX SQ Sequence 130 BP; 46 A; 31 C; 28 G; 25 T; 0 other; tgtaacgaag caacctttga gaaagctgag gtgcgatcga caggccagct tctagttcat 60 tcagttgcga acactcaagc gagcaagacg caaaataaag attccttacg aaaaacccta 120 aaccgagtca 130 // ID Gypsy-9_SI-I repbase; DNA; INV; 4918 BP. XX AC AEAQ01022575; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-9_SI_; KW Gypsy-9_SI-LTR; Gypsy-9_SI-I. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-4918 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01022575; Positions 5343 10260. XX CC 'ATGT' target site duplication CC LTRs are 91% similar to each other. XX FH Key Location/Qualifiers FT CDS 1804..3408 FT /product="Gypsy-9_SI-I_1p" FT /translation="MKLQGVIEESHSPWVSPAVLVKKKDGSLWFCVDYRKL FT NAMTKKDSYPMPRIDDLFDYLFGSSWFSTLDLKSGYWQVKLRYQDKEKTVF FT SIGNGLWQFTVIPFGLCNAATFEHLMEKVLRQIINKICLVYIDDVIIFSKS FT FEGQLDNLRKVFFCFRESNLKLNPKKCSFFGRRVYYLGHIILEDGISIDPE FT KIKPIRDWPIPRNTKQLPGFFGCCSYYRKFIKVFSLFAKPLYVLTKNQSKF FT IWNEQCQSAFEKLKQFLLSSLILCFPSEIGEFVLDTDASNHGIGAVLSQKQ FT EEKEKVIAFYSQVLSKSERNYCVTRRELLAIVAFLKNFHHYLYGRKFTIRT FT DHISLRWLLSFKELDGQLARWLEHLQQYDFEIVHRAGKLHGNVDALSRHPC FT AEFSCRFCDKIKSREEELKNNSLGRIVFEESEFENWKKAQLEDQSIVNIYH FT EKEIGVRPSRQELAPNDSSLKVYWILWDSLVIKDGVLFKKWISLDFHTEVL FT QTIVLKKLIRQVLEEAHDSASGGHFGVNKTLAKIRKRFY" XX SQ Sequence 4918 BP; 1511 A; 721 C; 1027 G; 1659 T; 0 other; ttttggggct tcgtccggga tcgagacgga ctatcttttc gtttacgtgt aaacaaactt 60 tgagccatgg cttttgccac atattccacc gagcccacgg aagggtatgt gcacgcgttt 120 ctcaacaatt tctggacgtc ctttgggaga cgaagaacat tcatgcacgc ggccttcgga 180 tgatactgtc gagcgaacaa cggaagatgt cgtcgaaccg acgatgggtg ataaaacgac 240 gggcactgcg gagttactgc gtgctgcttt agaggagctc cgtaagatga aggaggaccg 300 tctcagtttt tagtagcgtg aggaggagct ttgtaaacgg ttacaacaag cagaacaagg 360 ccagtcattc accaatttta actccgattt aattgttgaa aataatatag aatttaaatt 420 aaagccagac gtatatcata gtagtgctcc ggtgcgcgaa tttctcacgc aattccattt 480 acacgctcgt gtgaataatt ggtctgattc ggttaaagta gcgattttgg cttcatgtct 540 aagagggaag gcacgttcgg ttttggatgg aatttcagat gtagaaaatt tacaattttc 600 cgaattaaaa tcgcgtttag aattacgttt tggagaaagt catttaactc aatcatttta 660 cacacaattc gttaatagaa ggcagcgttt tggtgaagat attcccactc taggagctga 720 ttttaaacga ttatctcgtc ttgtttatcc agattgctcc gcgatagttt aagataaaat 780 tgcgtgttct caatttgttg ctgcgctttc agatagtttt gttaagcgga cccttcaact 840 ggagggaata atctctttac gtgcagccgt agaaaaagca atggctatta aaaaaacagt 900 ttttccagat tttcagaaaa atcaaaattt atacagaaaa gtaaagaaag gaaagattct 960 gttaataagt ttttaaaaat tgaaaatttt aaggaaaaag aaaaaatttt tcgaaaaaat 1020 attgaatgct ggcaatgtgg ttccaaagga catttttatt cagaatgtcc attacttacg 1080 ttagcaggga aacagggaaa cttggagtaa ccgagcttta caggctaatt tcgcatggct 1140 cggcgggaga aaaaaaaatt aagaaaaaaa tttctttgaa gtctaatttt atgctaagtg 1200 ataatttttg ttttcctggc tctctggatg gtcaatcttg tacttttgaa atagatacag 1260 gttccgatgt ttccgttctc aacagaaagt ttgttaagga aaataaacag cggattttca 1320 ttaaacattg tacgttgagg tatcctactg gagaagaggt ctccattgat tttaaactca 1380 atgcaaaaat tgtcttaaga aagtactctt tggagttccc catgtttgtt tcagagattg 1440 gggatgattg tattttgggt gcagatttct taattaaaac cggtcatggg gaaatttttg 1500 ctttagcttt tggaaaagat cgtgcagaat taaaaaattt ttcttgtact taataatctt 1560 gaagttccat cttttctgtt aaattgtttc gggaaaaatt cttctgagct caatattgct 1620 gaaaagagac agtttgcaaa ttttttgaag aaatttcaag atatattttc tgaggaagtc 1680 attgcaggaa attgcacagt cgtggaacat tctattaatt tagagaaaaa agatcctatt 1740 aaacagacac caagacgaat accatttcaa ttgcgtcaca aagtggaaaa aattttagaa 1800 gaaatgaaac ttcaaggggt tatcgaggag tctcatagtc cttgggtttc tcctgctgtt 1860 ctagttaaaa aaaaggacgg ttctctttgg ttttgcgtag attacagaaa gttaaatgct 1920 atgactaaaa aagattctta tcctatgcct agaattgacg atctttttga ttatttattc 1980 ggaagctctt ggttttctac attggattta aagagcggtt attggcaagt taaattacgt 2040 taccaagaca aggaaaagac tgtattttcg ataggaaatg ggttgtggca atttactgtc 2100 ataccttttg gtttatgcaa tgcagcaact tttgagcatt tgatggaaaa agttcttcgt 2160 caaattatta ataaaatttg ccttgtttat attgatgatg tgattatttt tagtaagagt 2220 tttgaaggtc agttagataa tcttagaaaa gttttttttt gttttcgaga atcaaattta 2280 aaattaaatc ctaaaaagtg ttctttcttc ggaagaaggg tttattattt gggtcatatt 2340 attttagaag atggaatctc tattgatcct gaaaaaatta aacctattcg agattggcct 2400 attcctcgta acacaaagca attgcccggt ttttttggtt gttgttctta ttacagaaag 2460 tttatcaagg ttttttctct gtttgctaaa cctttgtatg ttttaaccaa aaatcaaagt 2520 aaatttattt ggaatgagca atgtcaatca gcttttgaaa aacttaaaca atttttgttg 2580 tcttcattaa ttttatgttt tccttctgaa ataggagaat ttgttcttga tactgatgct 2640 tctaatcatg gcattggtgc agttttgtct caaaaacagg aggaaaaaga aaaggttatc 2700 gctttttata gtcaagtatt aagtaaatcg gaaaggaatt actgtgttac tcgtcgtgaa 2760 cttttagcaa ttgttgcttt cttgaaaaat ttccatcatt atctttatgg acggaaattc 2820 acgattagaa cggatcatat ttctttacgt tggttgttgt cttttaaaga gcttgatggt 2880 caattggcac gatggttgga acatctccaa caatatgatt ttgaaattgt tcatcgagca 2940 gggaagttac atggcaatgt tgatgcattg tctaggcatc cttgtgctga gttttcttgc 3000 agattttgcg ataagataaa atctagggag gaggagttaa aaaataattc tttggggaga 3060 attgtttttg aagaaagtga atttgaaaat tggaaaaagg ctcagttgga ggatcagtct 3120 attgtaaaca tttatcatga aaaagagatt ggtgtgcgtc cttctcgtca ggagttagct 3180 cctaatgact cttccttgaa ggtttattgg attttgtggg attctctggt catcaaagat 3240 ggagtattat ttaaaaaatg gatttctcta gattttcata cggaagttct tcagacaata 3300 gtactaaaga agttaattcg acaggtgttg gaagaagctc atgattctgc ttctggtggg 3360 cattttggag tgaacaaaac tttagcgaaa attcgaaaac gattttatta ggcttcttgt 3420 aagcaagatg tcaaacatta gtgtaaaact tgtaaggttt gtttttctaa aaaaggtctt 3480 tctaaaaagg ggaaatctcc ttttcagatc tataatgttg gagttccttt tgaaagagtc 3540 caaatagata ttcttggtcc tcttcctacc tcttcgtctg ggaaaaaata tttattagtt 3600 attactgatt gtttttttaa gtgggttgaa gcttttcctg ttggaaatat taaagcgaat 3660 actgtagcaa aaatttttgt taaccaggta gtttctagat ttggcgttcc tttagaaatt 3720 catactgatc aaggcagaaa ttttgaatct cggttatttc aggaactttc tcgtttgttg 3780 ggaatcaaaa aaaattcgta ctactccgct tcatcctcaa tctaatggac aaatggagcg 3840 tcagcatcat gcacttttga atttcttagc aaaatttgtt tcagagaatc aaaaagattg 3900 ggatcgttgg atttttctgg atttgttagc ttataggtct tcaaagcatg aaagtactgg 3960 ttttactcct tttgaattat gccaagatag agagttaagg ttacctttag atctttttcg 4020 tggacgtcct ccagaaaaaa ttcagaatca gagagaggat acgtctctaa acttagagaa 4080 aaattagttt taattcatga tgttgtcaga cagcgtttgg gaatcaaatc tcaaaaattg 4140 aagttgtggt atgatcaaaa agataggcgt ttttcttttg aactaggtca gaaagtttgg 4200 ttgtataacc ctcgaagaat acttggtagg gctcctaaat tacaaagtaa ttgggaaggg 4260 ccttacgaag tcattaagaa aattaatgag gtagtttatt gcgtacggaa atctaaaagg 4320 cataagaaca aggttgttta tttagatcaa cttgctgtat ttcaagacag aaaatctttg 4380 taggggaaga gagatttgtt taaaatattg atagagccta gaatatattt ttttttattg 4440 tttttaagtt ctttaataac ttgaaaaaca cttttcgctg ttaattgtat ttttctttgc 4500 agaagtttgt tttcttcaaa tctttggatt gaagcaagaa gcatcacgac gaaaaggagt 4560 tgattatttt aagttaataa ctcttgatag aagtccgtct ttccctcggt tttttttgaa 4620 ccgtggtttt tccgaaggga cttagggtaa atgccgatac gggaacgcga ggagtctcgc 4680 aagagacgct gtgtccatgg gaaaatttat tttctttctc ccccttgtcc gaagaattaa 4740 gaaagaagtg ctcaagactc aaaatgaaga gtgagaacga gccactcaaa gacaaagata 4800 cgcagctcag aaatcccttt gcagcagctt gggaagagca agaagtcttt cgggagctcc 4860 ctgtcggttg tcgggacaac aacttgaaaa aggggggcag tgttacgacc gtgaatgt 4918 // ID Gypsy6-I_Dya repbase; DNA; INV; 6290 BP. XX AC chrX; XX DT 18-MAY-2009 (Rel. 14.05, Created) DT 18-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy6_Dya; KW Gypsy6-LTR_Dya; Gypsy6-I_Dya. XX OS Drosophila yakuba OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; melanogaster subgroup. XX RN [1] RP 1-6290 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1058-1058 (2009). XX DR Genome; chrX; Positions 20665459 20659170. XX CC Positions [4650-5126] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 759..2255 FT /product="Gypsy6-I_Dya_2p" FT /translation="MPVVNTPPRTSSEGNIENNCALCSTVMDNTNPCYHLA FT CKHVFHKNCVEAWLTENSCCYSCNAPINKKEIKVVVGNCSPFRAQPLPVPT FT APVQRGVTTRNLARALAQLEEANPANIPEPLPLNTSGAPSSVLDASQSTEH FT REGIARHPRGRPRSGRPVRTIDSRRTQAVDYNTIQDMIEQTVTRVLSSLSI FT QHLPPPSNSQGPDMTREQASFSSSSISSRQTVDIMQKWGVHFDGSTDGLGV FT EEFIYRIKALTDKTLGSDFTSMCKNMHVLFTGKARSWYWRYHKQVDGIVWS FT NICASLRQHYKDYRSDFMSMELIRARKQKHGEPFSAFYEAVASLIDKASIK FT IEEEEFIEILKNNLLPETRQKLLYQPVHSVGHLRRLVQMSENLSHEISCRT FT EQVKTKSIPNRRQVFALEDHSSEVEEISPTGAEIAALRRSTEKLKCWNCEE FT IGHIWENCLADRRVFCYGCGRPDTYKPQCQNCLQKQSENRPTGAFNKNRMP FT PKI" FT CDS 2177..3796 FT /product="Gypsy6-I_Dya_1p" FT /translation="MSKLPSKAIGKPTYGSIQQEPNAPENLNPKATLHKPT FT MANCIPMRLRPYEERVKNYAEVRKRLFQDIKPHRNRRTLRLKSFYAEVRRN FT KQLQISTIFNKSSDLRSYAEVSCMGFTELGLLDTGANISCLGEELAQQDFS FT SHPQFKPIKSRAKTASGENQTIIGFLDIELTYNKVRKPLRLYIIPSIKQRL FT ILGHDFWRTFQIAPNIISSVHVPIEDNKLAGENMYPLTSSESQRLAVVKQL FT FPKASERLGRTSLIEHCIDVGNAKPVKQRFYPVSPAVEKLLYAEIDRMLRL FT DVIEPSVSPWSSPMRLVVKPNKIRLCLDARRLNDATKKDAYPLPSIDGIFS FT RLPKANIISKLDLKDAYWQIQLSAESKPLTAFTVPGRPLHQFKVMPFGLCN FT APSTMCRLMDQIIPADLRYCVLGYLDDLCVVSEDFESHSLVLIRLAEEFRK FT ANLTLNMDKSQFGVTSINYLGYVIGSGGICTDPAKISCIVNWPAPKNLKQV FT RGFLGVCGWYRRFIPNFADLTCPITDVLSKKLKFAWPPEASSDG" FT CDS 3792..5402 FT /product="Gypsy6-I_Dya_3p" FT /translation="MDRLKIILTSAPVLTNPDFEKKFYLHSDASDFGIGAV FT LVQLAEDKAEKPIAYMSKKLSRSQRNYSVTERECLAVVLAIEKFRCYLELQ FT DFEVVTDHSSLLWLMRQQNVTGRLARWIFRLQQFKFSFSHRKGKDHVVPDA FT LSRLFENEINALDMAPIIDLSSEAFNEDTYTTLKTKFTNSEEKFPDIKIVD FT KFLYIRTEHADRNEFQDKQSWKLWVPETLRPKVLRQAHDSLTSSHAGMQKT FT IARLRRNLFWPGLAKDVRDYIRSCDACKENKAPNYTLRPPMGEHCPSYRPF FT QRLYIDLHGPYPRSKQGHVGLLIVLDHFSKYPWLQPLKRFTSTAIIDFLLK FT QVFHSVGVPETIVSDNGVQFKAVEFNAFLTELGVNHVYTALYSPQSNAAER FT VNRSLLAAIRTYLKTDQTEWDRNITSISCSLRSALHQSLGCSPYRALYGLD FT MITHGSDYKLLKNLSLLEEPITPISRSDNLAIIRKDIQANIRRAYDRNVRQ FT YNLRAKPISFKEGQEVFRRNFAQSKFAQNFNSKLGPQFVKS" XX SQ Sequence 6290 BP; 1938 A; 1354 C; 1335 G; 1663 T; 0 other; taacttttgg cgcccaacgt ggggcaggcc acggtacgat tacaacgttc agtggtatag 60 atttcctaat ccaatttatt gtcctttatg ggatttccca aggatactga agtgtggaag 120 tagttagatc ggcgtgatct ttctatcgca tttcagaggg aaaagtgcga cctacgacgt 180 tggtattcgg acgcggatat caaaccgaga aacctgaagt gtacaccaac ccatgaattt 240 aacaaaacat cagctattcg gaaatcttgt gtccctaact atcaatgctc ttttgttcag 300 tggtagggtt tgtgcagaat accagtcgct tatttgcaag cgaaagggac gagcacttaa 360 agaatattga tgatttgact ctgatcgaat atcaatactt aaggatcgct cagctggtag 420 agtgttgttt ccacaggccg ctactgctgc aaatccctta aaacacatct gatagctctg 480 tgagttgcga gttgtcagta aaaatacgat caagaagtat ctgttgtctg gaacttgtga 540 ggatgcagct taatgcttcc acttgccgaa cagaaacctg tagaaagtta gtcaacgcac 600 tcatttcaat cgggaaatca gactttcaat aatccaatcc tcaacgctag ttatagatac 660 agcaggctaa tacaataacc tccattttat taagttgcca tcaaatgtca aattcaggat 720 acaaatctaa agaaaaaaaa aaattaagat aaccagcaat gccagtagtg aatacgcctc 780 ccagaacttc ttctgaaggt aatattgaga ataattgcgc tctttgctca acggttatgg 840 ataacaccaa tccttgttat catttagcgt gcaagcatgt ctttcacaaa aattgtgtag 900 aagcctggtt aaccgagaat tcgtgttgtt attcttgtaa tgctccgatt aataaaaaag 960 agattaaagt agtggttgga aattgttcgc catttcgagc gcaaccattg cctgtgccga 1020 cagcgccggt tcagcgtgga gtaactacta gaaacttagc cagagctcta gcccagttag 1080 aagaagctaa tccggccaac ataccggaac cacttccttt gaataccagt ggtgctccgt 1140 cttctgtatt ggatgctagt caaagtaccg agcatcgtga gggtattgct agacaccctc 1200 gtggtaggcc ccgttcgggt agaccagtca gaaccatcga ctcacgtcga actcaagcgg 1260 tggattataa tacaattcaa gatatgatag agcaaaccgt taccagggtt ttatcgtcct 1320 tatccataca gcaccttcca ccaccaagca attcccaagg gccggacatg actcgggagc 1380 aagcctcttt cagttcgtcc tcaataagtt ctcggcagac agtcgacatc atgcaaaaat 1440 ggggagtaca ttttgatgga tctactgacg ggttaggcgt tgaagaattt atatatagaa 1500 ttaaagcgct cacagacaaa actttgggca gtgactttac aagtatgtgt aagaacatgc 1560 atgtattgtt caccggtaag gctagatcct ggtattggag ataccacaaa caggtagatg 1620 gaatagtttg gtctaacatt tgtgcgtcac tccgtcaaca ttataaggac tatcgatcag 1680 attttatgag catggagttg ataagagcac gaaagcaaaa acacggtgaa cctttttccg 1740 ccttctatga ggcagtggct tctttaattg ataaggcatc gataaaaatt gaagaagagg 1800 agtttattga aatcttaaag aacaacctgt tacccgagac taggcaaaag ttgctgtatc 1860 agcccgtgca ttcggtggga cacttacgtc gcctagtaca aatgagtgaa aacttatccc 1920 atgaaatcag ttgccggaca gaacaagtta agacaaaatc aattccaaat cgccgacagg 1980 tttttgcgtt agaagatcat agctcggagg tagaagaaat ttctccaacg ggtgcagaaa 2040 ttgctgcttt acgtagatct acggaaaagc taaagtgttg gaactgcgaa gaaataggtc 2100 acatctggga gaactgctta gcagatcgtc gagtattttg ctacggctgt ggcaggccag 2160 acacctacaa gccgcaatgt caaaattgcc ttcaaaagca atcggaaaac cgacctacgg 2220 gagcattcaa caagaaccga atgcccccga aaatttaaat ccaaaggcca cattacataa 2280 gccaacgatg gctaactgca ttccgatgcg attgcgccca tatgaagaac gggtcaaaaa 2340 ctatgctgaa gtaaggaagc gtctatttca ggatatcaag ccgcaccgca atcggcgcac 2400 acttagatta aagtcatttt atgccgaggt taggagaaat aaacaattac aaatttccac 2460 tatctttaac aaatcgagtg acttgcgaag ctatgctgaa gtatcgtgca tggggtttac 2520 agaacttggt ctgttggaca caggagcgaa cattagttgt cttggagaag agctagcaca 2580 acaggatttt tcaagtcatc ctcaatttaa acctatcaaa tcgagagcta agacagcaag 2640 tggtgaaaat caaaccataa tagggttcct agacatcgaa ctaacctata ataaagttag 2700 aaaacccctg cgtttgtata taattcccag tatcaaacaa agacttattc ttggccacga 2760 cttttggaga actttccaaa ttgctccaaa tattatatca tccgtacacg taccaatcga 2820 agacaataag ttggccgggg aaaacatgta tcccttaacg tcgtctgaaa gtcagcgtct 2880 agcagtagtg aagcaattat ttccaaaagc ttcagagaga ttaggtcgca cctcacttat 2940 tgaacattgc attgacgtcg gaaatgcaaa gcccgttaaa cagaggttct atccagtcag 3000 tcccgcagta gagaaattac tgtatgcaga aattgatcga atgctgcgat tagatgttat 3060 tgagccttct gtaagcccat ggagttctcc aatgcgtctg gtcgtcaaac ctaataaaat 3120 tcgactttgc ctggatgcac gcaggcttaa cgatgctacc aagaaggatg cttatccgtt 3180 gcccagcatt gacggaatat tttcccgatt gcccaaagct aacatcatat ctaagctcga 3240 tttaaaagat gcttattggc aaatacagct cagtgctgag tcaaaacctt taactgcatt 3300 cacggtgcca gggcgaccgt tacaccagtt taaggttatg ccattcgggc tttgtaatgc 3360 accatctacg atgtgtcgct taatggatca gatcattcca gccgatctgc gctattgtgt 3420 cttgggatac ttagacgatc tttgcgttgt gtccgaagat tttgagtctc actccttagt 3480 actgattcga ttggccgagg aatttcgcaa agccaatctt actcttaaca tggacaaaag 3540 ccagtttgga gtaacaagca tcaattattt gggttacgtc attggtagtg gaggtatttg 3600 tacggaccca gctaaaatct cgtgtatagt aaattggcca gctcccaaaa atctgaaaca 3660 agtaagggga tttcttggag tttgtggctg gtaccgtcgg ttcataccca atttcgcaga 3720 tctcacatgt ccaatcacag atgtgctttc taaaaagtta aaatttgctt ggcctccaga 3780 agcaagtagc gatggataga ctaaaaatca tattgacctc tgcaccggtt ttaacaaacc 3840 cagactttga gaagaagttt tatcttcaca gtgacgccag tgatttcggc atcggcgcag 3900 tgttggttca actagcggaa gataaagctg aaaagcccat tgcctatatg tcaaaaaaat 3960 taagtcgatc acaacgtaat tatagcgtca ccgaacggga atgtttagca gtggtattag 4020 ccattgaaaa gtttcgctgc tacttagaac tgcaagactt tgaagtagtt accgatcatt 4080 ctagcttatt atggctcatg cggcaacaga atgtaacagg aagactggct aggtggattt 4140 ttagattgca acagttcaaa ttttcattct cgcatcgaaa aggaaaagat catgttgtcc 4200 ctgacgcttt gtctagacta ttcgagaacg aaataaatgc attagacatg gcccctataa 4260 tcgaccttag ctcggaagcg tttaacgagg acacttatac gacgcttaaa actaaattta 4320 cgaatagtga agaaaaattt cccgacatta aaatagtaga taaatttttg tacattcgta 4380 cagaacacgc tgaccgaaat gagtttcaag acaaacagtc ctggaaactc tgggtgcccg 4440 aaacgttaag accgaaagtg cttcgacagg cacacgatag tcttacgtcc tcccatgccg 4500 gaatgcagaa gaccatcgca aggctacgcc ggaatctatt ttggcccggg ttagcaaaag 4560 atgttagaga ttacattcga tcctgtgacg catgtaagga gaacaaagcc ccgaattata 4620 cgttacgtcc acccatgggt gagcattgtc catcgtaccg cccatttcaa cggctatata 4680 tagacttgca cggaccctat ccccgtagta agcaaggaca tgttggattg cttatagtat 4740 tagaccactt tagcaaatat ccttggttac aacccctaaa aagatttacc tcaacggcaa 4800 tcatagattt ccttttaaaa caggtctttc atagtgtagg tgttcccgaa accatcgtaa 4860 gcgacaatgg agtccaattt aaagcggtgg aattcaatgc ctttcttacg gaattaggag 4920 taaaccacgt ctataccgca ttatactcac cacaaagtaa tgcggccgaa agagtaaatc 4980 gttctctatt agctgccatc cgcacttact taaaaaccga ccaaaccgag tgggacagaa 5040 acatcactag tatctcatgt tcattaagat cagcgttgca tcaatcgtta ggatgttcac 5100 cataccgtgc gttatacgga ttagatatga ttacccatgg atctgattat aaactgctaa 5160 aaaatttatc cctattggaa gaaccgataa caccgatttc ccgatctgac aatttagcca 5220 tcattcgaaa agacattcag gccaatattc gtcgtgccta tgatcgcaat gtgcggcaat 5280 ataacctacg cgcgaaaccc atttccttta aggaaggaca ggaagtattc cgtcgaaatt 5340 ttgcccaaag caaattcgcg cagaatttta actcgaaatt aggcccacaa tttgtaaagt 5400 cctgaatcaa gaaaaaacta ggaaattgtt attaccagct tgaaaattta caaggaaaag 5460 aggtaggaac gtatcatgca aaggacattc gaagctaatt cccgctaata ttgcatctaa 5520 cgaagaaata ttatcgggtg gtgtaatggt cggcatggta agcgaaaaga aattgcaaag 5580 aaaaaaaatt ataaatgcaa tgttggccac ttacaactga gaacttgact ctagggatgg 5640 gttagttcat tcagtgggta tgcttttagg ctgtggtcga taacctcagt ctaatcatcg 5700 attaacatgt ccgaaaacgt tgcccaacac tgagagcgtc ttccaacttt ccaatagaag 5760 cacagctata attattggaa tttggtggta tataaatgcc cacgcggcta cagtcccgct 5820 cttttcgtaa ccagaaacac attatgctgc taaatcattg tgcaagtact cttaatagag 5880 ttaaagatta agcgccacta cttacaagcg tcgtctgatc ggtgacgtga tctgatcaca 5940 tactggaggg aaagaacgaa actgccccaa aattttaagt gacaactggg ttcttcgcgg 6000 tggttgcctc gccttggagc cccaaccacg tgatttgcat aaccccctcg gagggaattc 6060 tcccaagaga ataggtatgt cgccgacttg cactagtgcc cccaactaaa acccgctttg 6120 ttggcagtat caagttttca tctggccacc gggcctctgc aaatactggc cggtcggagg 6180 gattctgacc accaactagc gttattatac ccctggctac ggataactga agcaattgtc 6240 tttctgggag ctacgccaac gtcccgaagc cttaaccttc gattaaccac 6290 // ID Copia-1_BM-I repbase; DNA; INV; 4037 BP. XX AC nscaf2830; XX DT 19-MAR-2010 (Rel. 15.04, Created) DT 19-MAR-2010 (Rel. 15.04, Last updated, Version -1) XX DE LTR retrotransposon from silkworm: internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-1_BM_; KW Copia-1_BM-LTR; Copia-1_BM-I. XX OS Bombyx mori OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Bombycidae; Bombycinae; Bombyx. XX RN [1] RP 1-4037 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from silkworm."; RL Repbase Reports 10(4), 583-583 (2010). XX DR Genome; nscaf2830; Positions 424200 428236. XX CC Positions [1464-1964] - Integrase core CC 'GTAAA' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 159..4037 FT /product="Copia-1_BM-I_1p" FT /translation="MGPNMFNLEKLVGRENFATWKFSMKAYLEHEDLWCCV FT ECHDGKPVDAVKDIKAKSKLILLLDPQNYVHVQDCKTVKQVWENLQKAFDD FT NGLTRRVGLLKDLINTTLDSSNRVEDYVSKIMNTAHKLRNISFDVNDEWLG FT TLMLAGLPEEYKPMIMGLESSGIKISADSVKSKLIQEVNTSKSAAFYTNSR FT KPRNNTSSIKTKGPRCFNCNRNGHFAKYCKLPKKQTPNKGESPSFVAAFSA FT TVSKNVSNEWYIDSGASMHMTSRIDWMYNVTESSVKNITVANREPLAVNGV FT GCVNIQLSQENKIQVKNVLFVPGLATNLLSVGAIVKNGYKVTFNDKGCEVG FT NSKGEVICSAKLKNSLYVLETCKEVAHLVSRNKNSNNTYLWHLRMGHLNMS FT DVKKLPTCSEGVTLIPDVNDVTCTHCMEGKQARLPFKNTGTRATRPLELIH FT SDLCGPMENISFGGMKYFITFTDDFTRMVHVYFLKDKLKILEIFKDFKLKV FT ENELNYKIKRLRTDNGKEYCNYNFEKYLSNHGIIHQTSTPYTPQQNGLAER FT MNRTLVERARCMLFYANLEKKFWTEAVATAAYVVNRSPTKSLEGKTPYELW FT KGKKPNLSHLKIFGSEAMVHVPKEKRHKWDKKSVKMIFMGYCDSSKGYRLM FT DPKTLKIIKSRDVIFIENVETSIPIEDNRTVTVSSVPSETEDSENSSVSRE FT QEIHTQESSATAPLMSSKDSNIRKGDSEDSSEYDTDTGELDATYHPPYPIR FT QDYSSSITLRQRNKQCDTQDGKTSLLCLYSDLTDPQTVEEVLTSSQAAEWK FT RAMDEEYSSLMKNKTWSLTELPPGKKALPCKWVFKIKTDQNGNVTRYKARL FT VIKGYAQRKGTDYEETYSPVVRYTTIRYLFALTAKYGLDIDQMDAVSAFLQ FT GEIDTEIYMVQPEQYKQGSKVCKLHKSIYGLKQASRLWNLKLNCVLHELGM FT QQSRTDPCIYYNVEKNTFIAIWVDDLILLTSQKKTKNLLKEKLKEHFEMKD FT IGSTSYCLGLQITRDGGKVMIDQEKYIKEMLARFKMSDCKPVRSPFEVGMK FT FNEKKEEDELIDCPYQQAIGSLLYLAQGTRPDISFAVNTMSRFNKDYTAAH FT WTAVKRIFRYLQGTKDFKLVYTKDGNENITGYCDADWASDVRDRKSSTGYV FT FMLQDGAISWRSQKQQTVALSTAEAEYMSMSSAAQEALWLQQLHTELGQQK FT NNPLIIFCDNQSAIKLSNNYCYLPRSKHIDIRYHFLKDHVSNLDIKYCYIK FT GEEMVADNLTKGTSVDKHLYCLTKMGLHSKGG" XX SQ Sequence 4037 BP; 1422 A; 646 C; 850 G; 1119 T; 0 other; tggtagcaga gcgtggttca gttgtgttta gttgtggtat agctggaaat aataataaag 60 tggacaaaac ttcaatccga aaaaattgtc gttgcaattc aaattgaagg gtaggatcgt 120 tcatcaaaaa ttgaatattc ctcgtgaaga taagcaagat gggcccaaat atgtttaatt 180 tagaaaaact ggtaggaaga gaaaactttg ccacgtggaa attcagcatg aaagcatatt 240 tggaacatga agacttgtgg tgttgtgtcg aatgccacga cggcaagccg gtggatgctg 300 taaaggatat caaagcgaaa tcaaagctga tcttactact ggatccgcaa aattatgtcc 360 atgttcaaga ctgtaaaacc gtcaaacaag tgtgggaaaa tttgcaaaag gcatttgatg 420 ataatggttt aacaagaagg gtaggtttgc ttaaagattt gatcaacacc acattagatt 480 caagtaacag agttgaagat tacgtaagta aaataatgaa cacagcacac aagttacgta 540 atattagttt tgatgtcaat gatgagtggc ttggcacctt gatgttggct ggcttaccag 600 aagaatacaa acccatgatt atgggcttag agagctctgg cattaaaata agtgctgatt 660 cagtaaaatc taaacttata caagaagtta atacatcaaa atcagcagcc ttctatacaa 720 attccagaaa acccagaaat aatacctcat cgataaaaac aaagggtccg agatgcttta 780 actgcaatag gaatggacat ttcgctaaat attgtaaact accaaagaaa caaacgccaa 840 ataaaggaga aagtcctagt tttgtggcag ctttttcagc tacagtcagc aaaaatgtaa 900 gtaatgaatg gtacattgac tctggtgcat ccatgcatat gacaagcaga attgattgga 960 tgtataatgt aactgaatca tctgtaaaaa acatcactgt ggctaacaga gaacccttag 1020 ctgtgaatgg agttggctgt gtgaacattc agctaagtca agaaaataaa atacaagtta 1080 aaaatgtttt atttgtacct gggctggcta ctaatcttct ctcagttgga gctattgtga 1140 aaaatggata taaggtcaca tttaatgaca aaggatgtga agttggaaat agtaaaggtg 1200 aagtaatatg ctctgcaaag ctgaagaata gtttgtatgt attagagacc tgtaaggaag 1260 tggcccactt ggtctcaaga aataagaata gcaataatac ttacctgtgg cacctgcgga 1320 tgggtcattt gaatatgtct gatgtaaaga agctacctac atgctctgag ggagtgactc 1380 ttattccaga tgtaaatgat gtcacttgca cacattgtat ggaaggtaag caagctaggt 1440 taccttttaa aaatactgga acaagagcta cgagacctct agaattaatc cattctgact 1500 tgtgtggtcc tatggagaac atttcatttg gaggtatgaa atacttcatt actttcactg 1560 atgatttcac acgaatggtt catgtctatt ttcttaaaga caagttgaaa atattagaga 1620 tttttaagga ttttaaactg aaagtagaga atgaattaaa ttataaaatt aaaaggcttc 1680 gcacagataa tggcaaagaa tactgcaact ataattttga aaaataccta tctaatcatg 1740 gtataattca tcagacctct actccgtata ctccacagca gaacggcctg gccgagagaa 1800 tgaacaggac tttagttgaa cgcgcaagat gcatgttatt ttatgctaac ttagaaaaga 1860 aattctggac tgaagcagta gcaactgctg cttatgttgt gaatcgttcc ccgacaaagt 1920 cattagaagg taaaactcca tatgaattgt ggaaaggtaa gaaacctaac ttatcacatc 1980 taaaaatatt tggctctgaa gctatggtcc atgtgccaaa ggaaaaacgc cacaaatggg 2040 ataaaaaatc agtaaaaatg atatttatgg gttattgcga cagttccaaa ggatatcgtt 2100 taatggaccc taaaacactt aaaataatca aaagcagaga tgttattttt attgaaaatg 2160 tggaaacttc tattccaatt gaggataatc gcacagtcac tgtcagttct gttccttcag 2220 aaactgagga ctcagagaat tccagtgtca gccgggaaca agagatacac actcaggagt 2280 catctgcaac tgcaccttta atgtctagta aagactcaaa tatacgaaaa ggtgatagtg 2340 aagactcaag tgagtatgac actgacactg gggaactaga tgccacttat cacccgcctt 2400 atccgataag acaagactat agtagtagta taacattaag gcaacgaaat aaacaatgtg 2460 atacacagga tggcaaaacg agcctattgt gtttgtattc agatttgact gatccacaaa 2520 cagttgagga ggtactgaca tcttcacaag ctgcagaatg gaagagagcc atggatgaag 2580 aatatagttc ccttatgaaa aataaaactt ggtcattgac agaacttcct ccaggtaaaa 2640 aggctctccc atgcaagtgg gtattcaaaa ttaaaacaga ccaaaatgga aatgtcaccc 2700 gttacaaggc aagacttgta ataaaaggct atgctcaaag aaagggcacc gattatgagg 2760 aaacatactc tccagttgtc aggtacacaa ctattcgcta cttatttgct ttaacggcta 2820 aatatggatt ggatatagat caaatggatg ctgtatcggc ctttctacag ggagaaattg 2880 atactgagat ctatatggtg cagccagaac agtataagca agggtctaag gtatgtaagt 2940 tacataaatc tatttatggt cttaaacagg ccagtagact gtggaacctg aaattaaatt 3000 gtgtactaca tgagttaggt atgcaacaat caagaacaga tccttgtatc tattataatg 3060 tagagaaaaa cacgtttata gctatctggg tcgatgattt aattttgttg acttctcaaa 3120 agaaaacaaa aaatttgttg aaagaaaagt taaaggaaca ctttgaaatg aaggatattg 3180 gttcgacaag ctattgctta ggtttacaga ttaccagaga tggaggtaaa gtaatgatag 3240 atcaagaaaa atatataaaa gagatgttag caaggttcaa aatgtctgac tgtaaacctg 3300 taagatcacc ttttgaagtt ggtatgaaat tcaatgagaa gaaggaagag gatgaattga 3360 ttgactgtcc atatcaacag gcaattggct ctttgttgta tttggctcaa ggtacccggc 3420 cggatatatc atttgcagta aatacaatga gcagattcaa caaggattat actgctgcac 3480 attggacagc tgtgaaaaga atttttagat acttgcaggg tacaaaagat ttcaaattag 3540 tttataccaa agatggtaat gagaacatca caggatattg tgatgccgac tgggcgagcg 3600 atgtacgtga tcgtaagtcg tccactggtt acgtatttat gttacaggat ggtgctatat 3660 cttggcgctc tcaaaagcaa cagactgtcg ctctctccac tgctgaagcc gagtatatgt 3720 cgatgtcgtc agcggcgcag gaagcgttgt ggcttcaaca attacatacc gagctcggtc 3780 aacagaagaa caaccctctc atcatctttt gtgataatca gagtgctata aaattatcaa 3840 ataattattg ttatttaccc agatcaaaac atattgatat tcgttatcat tttctcaaag 3900 accatgtcag taacttagat ataaagtatt gttatataaa gggtgaggaa atggtagctg 3960 ataatttaac taagggtact agtgttgata aacatttgta ttgtttaacg aaaatggggt 4020 tgcattccaa gggaggg 4037 // ID BEL-9_SI-LTR repbase; DNA; INV; 266 BP. XX AC AEAQ01023320; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: long terminal DE repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-9_SI_; KW BEL-9_SI-I; BEL-9_SI-LTR. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-266 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01023320; Positions 339 74. XX SQ Sequence 266 BP; 78 A; 58 C; 51 G; 79 T; 0 other; tgttctgatt ggacgttgat tataattttg ttagtgagaa aaatgttcct agcgagcgtg 60 gcgctgttag cgtaacttag ggtactatac tcttaatggt tcatgaccga tctctctctc 120 ttcgcctagc tacggacgtg tgtgccacga aagagcaagc tctcctgtaa actataataa 180 agcaagttgt aatcaaccca agaagcctac aagattaatt cgtctaatcc cacggcttca 240 ctacagtcaa acacaattaa tttaca 266 // ID L1-39_AAe repbase; DNA; INV; 4436 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE An L1 non-LTR retrotransposon from Aedes aegypti. XX KW L1; Non-LTR Retrotransposon; Transposable Element; L1-39_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4436 RA Kojima K.K. and Jurka J.; RT "L1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1392-1392 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 12 sequences with >95% CC identity. XX FH Key Location/Qualifiers FT CDS 148..1182 FT /product="L1-39_AAe_1p" FT /translation="MGRRENTFRIDFSSVPKKPDVAKVYKFCATVLGLQPC FT DVLRIQNSRALGVTFVKVVDLTLAQRICEEHDNQHEISVDGKKYPLRITME FT DGAVEVRLYDLSEDVSDEKITEFLENYGDVIEIREQRSGIDVDFPGVLTGV FT RIVKMIVKQNIESWVTIDGELTAVGYFGQRNTCKHCRDYVHVGLTCVQNKK FT LLVQKSYADAAKQMSHTSPKAAHITIAAGTNQGNKTKNQTASKANKTNPDS FT VQAESMPPPKPIPQQKPQQPPPSIASEQNFPPLGGSVTATTGQQPSASCGM FT FRRSASGTSDGNETDSSVGSAASKRQLRVRPPGKKPRVEENQENKREGGYE FT AI" FT CDS 1185..4361 FT /product="L1-39_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MAEFTSYNLATVNINNITNATKIEALRTFIRTLDLDI FT IFLQEVENDQLSLPGFTVICNVDHARRGTAIALKDHIRFSHVEKSLDGRLI FT ALRIQNTTLCNVYAPSGTAARAERERFFNGTIAYYLRFNTQHVILAGDFNC FT VLRQGDSTSPNTSPALQATVQQLRLHDVWLKLYPNTPAPTYITHNATSRLD FT RIYVSTGLCEHLRNAFTHVCAFTDHKALTARICLPHLGREQGRGFWSLRPH FT LLQAENIEEFEIRWQFWTRQRRNFPSWLAWWLSYAKPKIKSFYRWKSREAF FT NIFNAEHQRLYGQLRIAYDGYYQNPGMISTINRVKAQMLTLQRQFAQNFIR FT INETFVAGESLSTFQVGDRRRKKTVITELQNEQGDQLRGSLEIEQHMFRYF FT RNLYAREETRQDAEERFETERAIPANDEDNIAVMAEITIDELKSAIKTSQK FT RKSPGSDGLPAEFYQRSFDVIYRELYFVMNEAMTNELPPEFADGTIVLIKK FT RGGDGSARGYRPISLLNVDYKILSRVMKARLEKILQTHRVLSDAQKCANTE FT RSIFQATLSLKDRVAQLIARKQRAKMISFDMDHAFDRVSHEFLHRTLLSLG FT LNPDFVDWLSRVANASSSRLLINGRLSQPFRIERSVRQGDPMSMLLFVVYL FT HPLLTRLERVCGGDLCVAYADDITVIVTATSTINRIFGLFSCYELVAGAKL FT NMQKTIAIDVGFINGDPLTIPGLQTAETVKVLGVIFTNSVRQMVKLNWDSL FT VSRIAQIIWMHSMRSLTLQQKVVLLNTFISSKVWYLASIVPPQCAHTAKLT FT ASMGTFLFRGLPARIPMFQLARSRDNGGLKLHLPAIKCKALLINRYLKEID FT STPFYHSQLAQNIAPTTEFPCLKLLSQQIPLLPHHIKQNPTSEQIHLFYLD FT QIDLPRVERNEPGQNWKRIWRNISSKRLSSKQRSDLYLLVNEKIEHRRLMS FT IMGRANDPNCQHCVVAIETLQHKFCECPRVAQAWSLVQRKITTILGGWQRL FT TFNDLYRPALSNIVEVNRNKSLKIFIQYINYVNESVNNRIDVDGLDFMLNL FT AD" XX SQ Sequence 4436 BP; 1316 A; 1089 C; 992 G; 1039 T; 0 other; cagtgagcgc tcaacttccg agctgaatag acgcgttttt gctatcgttc cgtggaagtc 60 gaacccaaac caaattataa atttcgtttt cggttatttt tctcgtcgct aagccatcgg 120 tgcctatgtg cataccgcaa ctccgtcatg ggccggcggg agaatacatt tcgcatagac 180 ttttcaagtg tgccgaagaa accagacgtt gctaaagtct acaaattttg tgcgacggtt 240 ctgggtcttc aaccttgtga tgtgttgcga attcaaaaca gcagagctct cggtgttacg 300 ttcgtgaagg ttgtcgatct aacgctggcg caacgtattt gcgaagaaca cgacaaccaa 360 cacgaaatat ctgtcgatgg aaaaaagtat ccattgcgga taacaatgga ggatggagcg 420 gtagaggtga gactctacga cttgtccgaa gatgtgtccg atgaaaaaat caccgaattt 480 ctcgaaaact acggtgatgt gatcgagatt cgtgagcaac ggagcggaat cgatgtcgat 540 ttcccaggtg tcttgaccgg cgttcgaatc gtcaagatga ttgtgaagca aaacattgaa 600 tcgtgggtta cgattgacgg agagttgaca gccgttggct atttcgggca gcgtaacaca 660 tgtaagcatt gccgggatta tgtgcatgta ggactaacct gcgtgcaaaa taagaaactg 720 cttgtgcaaa agtcttatgc tgatgccgcc aagcagatga gccacaccag ccccaaggca 780 gcccacatta cgatcgccgc tggtacaaac cagggcaaca aaacaaagaa ccaaaccgct 840 tccaaagcga acaaaactaa ccccgactcc gtgcaggcag aatcaatgcc accgccgaaa 900 ccaatcccgc aacaaaagcc acaacaacct ccaccatcaa tcgcctccga acaaaatttc 960 cccccgttag gtgggtcagt gaccgccact acaggtcaac agccttctgc tagctgtgga 1020 atgtttcgca gatctgctag tggaacatca gatggcaacg aaacagattc gtccgtcggc 1080 tcagctgcct ccaagcgaca actccgagtc cgaccaccag gcaagaagcc ccgcgtggag 1140 gagaaccaag aaaacaagcg agaaggaggg tacgaagcaa tctaatggct gagttcacca 1200 gttacaatct cgctaccgta aatatcaaca acatcacgaa cgccacgaaa atcgaagcac 1260 ttcggacctt catccgaaca ctcgatctcg atatcatttt tctacaagag gtagagaatg 1320 atcagctttc tctccccgga ttcaccgtca tttgcaatgt agaccacgcc aggcggggaa 1380 cagcaatcgc actcaaagac cacattcggt tctcccatgt agagaagagc ctagatggtc 1440 gtctcatcgc tttgcggata cagaacacga cgctctgcaa tgtctatgca ccgtcgggca 1500 cagctgcccg ggccgagcgg gaaaggtttt tcaacggtac gattgcctat tatcttcgct 1560 tcaacacgca gcacgtcatc cttgcaggag actttaactg cgtgttacgt cagggggact 1620 caacaagtcc caatacaagt ccagctctgc aggcgaccgt acaacagttg cgactacacg 1680 atgtttggtt gaaactatat cccaacacac ctgcacccac atacatcaca cacaacgcga 1740 catctaggct agatcgtata tatgtgagca cagggttatg cgaacatctt cggaatgcgt 1800 tcacgcacgt gtgtgcattc accgatcaca aggcattaac agcgcgaatt tgtctccccc 1860 accttggccg cgaacaagga cgcggcttct ggtcccttcg acctcacctt ttgcaagctg 1920 aaaatatcga agagttcgaa attcgctggc agttttggac ccggcaaagg agaaattttc 1980 ccagctggtt ggcatggtgg ctgtcgtacg cgaagccaaa aattaaatcg ttttatcgtt 2040 ggaaatccag agaagcattc aacattttca acgccgaaca tcagcgcctt tacgggcaat 2100 taagaatcgc atatgacggc tactaccaga atccaggcat gatatcgacc atcaaccgtg 2160 taaaagcaca aatgctaacg ctacaacgtc aatttgcgca aaatttcatc aggatcaacg 2220 aaacttttgt ggcgggagag agcttgtcca ccttccaggt gggcgacaga cgaaggaaaa 2280 aaactgtgat aacagaatta cagaacgagc aaggagatca acttcgaggc tcgcttgaaa 2340 ttgagcagca catgttccga tatttccgga acctttatgc gagagaagag acgagacaag 2400 acgcggaaga aaggtttgaa actgagagag ctatcccagc aaacgacgaa gacaatattg 2460 ctgtaatggc agagatcaca atcgacgaac tcaagagcgc aatcaaaacc tctcagaaac 2520 ggaaatcacc cggctccgat ggtctaccgg ctgagttcta tcaacgatca ttcgacgtca 2580 tatatcgtga actgtacttc gtaatgaatg aagcaatgac caacgagctt cccccggaat 2640 ttgctgatgg gaccattgtg ctgatcaaaa agagaggcgg cgatgggtcc gcgagaggat 2700 accgtccgat cagcctgcta aacgtagact acaaaattct cagtcgggtc atgaaagcac 2760 gattggaaaa aatcctacag actcatcgag ttctgagcga cgcacaaaaa tgtgcaaata 2820 ccgagcgttc catcttccaa gctactcttt cactgaagga tcgtgttgct cagctgatcg 2880 cgcgaaagca aagagcgaaa atgatctcat tcgacatgga tcatgcattc gaccgcgtat 2940 cccacgagtt cctgcaccga accctcctct ctctgggact caatccagat ttcgtcgact 3000 ggctctctcg agtggcaaac gcgtcttcgt cacggcttct gattaatggg cgactatctc 3060 aaccgtttcg aattgaacgc tccgtgcgtc aaggagaccc gatgtcgatg ctgctttttg 3120 tggtttatct tcaccctctt ctcacacgat tggagcgggt atgtggtggt gatctttgcg 3180 tcgcatacgc tgacgacatc actgttatag taacagcaac cagtacaatc aaccgtatat 3240 ttggtctgtt ttcctgttac gaactcgtag ccggcgcaaa gttgaacatg caaaaaacaa 3300 tagcgattga tgttggattc atcaatgggg atccactgac gatacctggt ctgcaaacag 3360 cggagacagt taaagttttg ggtgttattt ttaccaactc cgtgagacag atggtgaaat 3420 tgaactggga cagccttgtg tctagaatcg cacaaatcat ctggatgcat agtatgcgat 3480 ccctcactct acaacagaag gtggtgctgt tgaatacgtt catctcatcg aaggtatggt 3540 acctcgcgtc catagttcca ccacagtgcg cgcacacggc gaaactaaca gcatcgatgg 3600 gaacgttctt gtttcgaggg ctgccagcac gaataccgat gtttcaacta gcgcgcagca 3660 gagataatgg ggggcttaaa ttgcatcttc cagcaatcaa atgcaaagct cttctaatca 3720 accgttatct aaaagaaatt gattccactc ccttctatca ctcccaactt gcccaaaaca 3780 ttgcacctac gaccgaattt ccctgcttaa aacttttatc ccaacaaatt cctttgcttc 3840 cccatcacat caagcaaaat cctaccagcg agcaaataca tcttttctac ttggatcaaa 3900 ttgatttacc aagagtagaa cgaaatgaac caggacaaaa ctggaaaaga atctggagaa 3960 atatctcatc aaaaagactt tcgtcgaagc agcgtagcga tctttacctt ctagtcaacg 4020 agaagattga acatcgaaga ttgatgagca tcatgggacg agccaacgat ccaaactgtc 4080 agcactgtgt ggtggccatc gagacactgc agcataaatt ttgtgagtgc ccacgagttg 4140 ctcaggcatg gtctctcgtg caacggaaaa ttacgacgat tctgggaggg tggcaaaggc 4200 taaccttcaa cgatctctat cgacctgctc ttagtaatat tgtagaagtg aaccgaaaca 4260 aatctctaaa aatcttcatt caatacatca actatgttaa tgaatcagtt aataatagaa 4320 tagatgtaga tggactagat ttcatgctga atctagcaga ttaatatttt atattatcaa 4380 tatgttattt attatccaga actaaataaa caatatttta caaaaaaaaa aaaaaa 4436 // ID Nematis_Pp repbase; DNA; INV; 2649 BP. XX AC . XX DT 15-DEC-2006 (Rel. 11.12, Created) DT 09-JAN-2007 (Rel. 11.12, Last updated, Version 1) XX DE Nematis_Pp is a Penelope-like retrotransposon. XX KW Penelope; Non-LTR Retrotransposon; Transposable Element; KW retrotransposon; Penelope-like element; reverse transcriptase; KW GIY-YIG endonuclease; Nematis_Pp. XX OS Pristionchus pacificus OC Eukaryota; Metazoa; Nematoda; Chromadorea; Diplogasterida; OC Neodiplogasteridae; Pristionchus. XX RN [1] RP 1-2649 RA Arkhipova I.R.; RT "Distribution and Phylogeny of Penelope-like elements in RT Eukaryotes."; RL Syst. Biol 55(6), 875-885 (2006). XX DR [1] (Consensus) XX CC Nematis_Pp is a Penelope-like element (PLE) from the nematode CC Pristionchus pacificus. It belongs to the Nematis group of PLEs. CC Its ORF contains regions homologous to reverse transcriptases and CC to GIY-YIG endonucleases. The element is apparently inactive, CC most copies are 96-98% identical. Consensus sequence was CC assembled from trace archives. XX FH Key Location/Qualifiers FT CDS 3..2273 FT /product="Nematis_Pp_1p" FT /translation="ITKLSREIGDRLRASERTRLYKKLQYLCPSIIPSCSS FT PSSSISRVSVASDVVLSPDAISVLSLGPKFVPSRPFPRDITLNLSISLHQL FT SFSLRWAHTLLGNSSSSIDPLVHKIPFPSVRSGVPPPVLSIDSKISRLSTH FT IRDIVSNHRVLPLCNNLSRAQSSALRDLRSQISSGSLRVTKSDKGGDFFVV FT PQSLERDIVKLHLSDPSIYVTSSSAQFKRTCDSIHRQFRAISPNISLPKRT FT VDCLLSGVPQVPVMYVLYKTHKCDLASAGNDPSRYPIRPIIVGCGGPTDRI FT SWFLSQLLIPLLKHVPTHLTNTRDALTALSSLDSNPDLCFESFDVSALYTN FT VSNRDAIRATMSLLSSHYRSLDLRGLSLLDIEDLLNSCLDCNIFAFDGLFY FT AQIRGLAMGNRLAPILAVIYMHYIEAQSIRDGIVWLGRYIDDYIVVAVTQN FT HLDSLFTSLNTIGAPHIKLTRETPTDNWLPFLNFQLSLTPDGVFNTKWYRK FT PANRNILLHASSCNPSHSKRSIVISTIRTAINVSSHTYRSQSRSLANHIVS FT SNGHCPPPRSKCVPNTRVSRRVDKPLPVFTIPFISDEFTRDIRDALCKVDL FT EVRIVEKPGPNLLNMLSRNRYLDPPKCPLRERCFFCPTGGEGCCQTAGVVY FT LIECSLCSHTYVGETGRPLHLRAYDHYKSSRNPTCNSYKHNAFAIHAISDH FT NSNPISLKGTIIHRESNTCRRKIAEAMFIVKLDPALNNRIELSKESAPITL FT QTLNI*" XX SQ Sequence 2649 BP; 597 A; 755 C; 418 G; 879 T; 0 other; gcataaccaa actctctcgg gagattggag accgtttacg tgcctcggaa agaactcgtt 60 tgtataagaa gttacagtac ctttgtccct ctatcatacc ttcttgttcc tctccttcct 120 cttctatctc acgtgtctcg gttgcatcgg atgttgttct aagtcccgat gccatctctg 180 ttctctcact tggtcctaag tttgttcctt ctcgcccatt ccctcgcgat attaccctca 240 atctctctat ttcattacat caattgtctt tctccctgag atgggctcac actctattag 300 gcaattcgtc cagtagtatc gatccgttag ttcataagat tccgtttccc tctgtccgct 360 ctggtgtccc tccacccgtt ctatccattg actccaagat atcgcgattg tccactcaca 420 ttagagatat tgtatcgaac catagggttc tccctctttg caataatctc tcacgtgccc 480 aatcctccgc ccttcgtgat ttacgttccc aaatttcctc tggctctttg cgtgtaacca 540 aatcagacaa gggaggcgat ttctttgtag ttccccaatc tctagagcgt gacattgtta 600 aactgcacct ttccgatccg tctatctatg tcacttcctc ctctgcccaa ttcaaacgca 660 cctgtgatag catccataga caattcagag caatttcccc caacatttct ttgcctaaac 720 gcacagttga ttgtctcttg agtggcgtac cccaagtacc tgttatgtat gtgctttata 780 aaacccataa gtgtgaccta gcttctgctg ggaacgatcc ctctcgttac cccatccgcc 840 ccatcattgt aggttgtggt ggccccactg atcgtatctc atggttctta tcacaattgt 900 taattccact acttaaacat gttcccactc atctgactaa tactagggat gctctaaccg 960 ctctttcctc tcttgactcc aatcccgact tatgctttga gagctttgat gtttccgcat 1020 tatacactaa tgtaagtaac cgagatgcca ttcgcgcaac catgtcattg ttatcttctc 1080 attatcgctc tctcgatcta cgtggattgt cactacttga tattgaggac cttctcaatt 1140 cttgcctaga ctgtaacata ttcgcttttg atggtctttt ctatgctcaa atccgtggtc 1200 ttgccatggg caatcgatta gctcctatcc tcgcagtgat ttatatgcat tacattgagg 1260 cccaatctat ccgtgatggc attgtgtggt taggtcgata catagatgac tatatcgtcg 1320 ttgctgtaac ccaaaaccac ttagatagcc ttttcacttc cttaaatact attggtgctc 1380 ctcacattaa gcttactaga gaaaccccaa ctgataattg gctcccattc ttaaatttcc 1440 aactctcact tactcctgat ggtgttttca acactaaatg gtaccgtaag cctgccaatc 1500 gtaacattct cttgcacgct tcctcgtgta atccgagtca ttctaagcga tctattgtca 1560 tttcaactat tcgcactgct attaatgttt catctcatac ataccgtagt cagtccagat 1620 cactagcaaa tcatattgtg agtagtaatg gtcattgccc tcctcctcgt tccaaatgtg 1680 tccctaatac tcgtgtttca cgtcgtgttg ataagccact tcctgtattt actatcccgt 1740 ttatcagtga tgagtttacc cgcgatattc gtgatgcact gtgtaaggtt gatctagagg 1800 ttcgaattgt tgagaaacct ggccctaact tgctcaatat gttgtcacgg aatagatatc 1860 ttgacccacc taaatgccca ctacgtgaac gttgcttttt ctgtcctacc ggtggtgaag 1920 gctgttgcca aactgcggga gtcgtatacc tcatagaatg ctctctttgt tcccacactt 1980 atgttggtga gactgggcgt cccctccacc ttagagccta tgatcactat aaatcctccc 2040 gcaatcccac ttgtaacagt tacaaacata atgcatttgc tattcatgct attagcgacc 2100 ataacagtaa tccgatctca ctcaaaggta ctatcattca tcgcgaatct aatacttgtc 2160 gtcgcaaaat tgccgaagct atgtttattg tcaaactgga tcctgctttg aataacagaa 2220 ttgagctctc gaaagagagt gcccccatca ccctccaaac cctcaatatc tagcccattc 2280 gctcaacatt ccttcactcc cctcttccat ctcttctcat ctctcatctc tcatatcccc 2340 tctctcctct cctctctccc tacactctct actcacccct tcccatcctc cctcttcccc 2400 tcattcatcc atcctttctc aatttgtctt ctcatttcct ttctctctcg ttcttacttg 2460 cacttcctat ggtcctaaca ttgtcttttc atgtgttatt ccgactgatg atgggctgct 2520 gcccgaaacg tctcaaccga tgataaaaac atctttcatc taccttccct ttcttgtccc 2580 ttctccggca ttactaagaa gcccacccag tattcccgat atgacaagaa tcactgagta 2640 tccactatt 2649 // ID PiggyBac-4_HM repbase; DNA; INV; 3066 BP. XX AC . XX DT 08-JAN-2009 (Rel. 14.02, Created) DT 08-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE PiggyBac-type family: consensus. XX KW piggyBac; DNA transposon; Transposable Element; PiggyBac-4_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3066 RA Bao W. and Jurka J.; RT "PiggyBac families from Hydra magnipapillata."; RL Repbase Reports 9(2), 453-453 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(889..1089,1034..1657,1584..2672) FT /product="PiggyBac-4_HM_1p" FT /translation="MEDCGDSEDDDIEDSDSFFAYTTTSSSLTASESXSED FT NSTEESDDDQLTERSSSSHCRRFFNAAEKSQNVHLAATAEDFLMPLKRART FT TGGLAYRQNQNKAQQVNLIRNILPDKTTSETNPSRNLNQWKDEXNIVKXIK FT FTANPGLKINMESKKPLDFFRLFVTEELINTMVVETNRYAMQEINKQRPLR FT RSSRFKDWKPINSEDMRQFLGVLLHMGCVKLPSFEHYWSKNSLYRFPLFSR FT IMPRNKFQLMLRFWHFINNEDSGSGRLCKIIGLHCYGSGILLTMKIRVVDA FT CVKLLDFIDHLNNTMDNIYCPNKNISIDESMMLWKGRLVFRQYVRQYVKNK FT RHKYGIKFYELCESDGIVLKVKIYSGEATLDKHLLGQTGAIVLDLMEKFLG FT KGYHLYTDNFYNSFELTKHMISQKTYICGTLRTDRKSNPKECTKAKLKQGD FT VISRSREGVVVAKWKDKRDVLMISNLHSLQMIEVTNRRGEKKMKPNIIKDY FT NQYMSGVDKADQMVSYYDCLRKTIRWYKKVALHLFDIFVFNAYCLSSKYGV FT DKTISLLKFRELVVTDLLGDRLNEIVPRITNNSLHYLSSIPPNEKKKLPTK FT PCRVCSKIKRKETRYECAVCENRPPLCVGECFRMYHSKE*" XX SQ Sequence 3066 BP; 1080 A; 451 C; 529 G; 993 T; 13 other; cacgttcgct gccgagctat ttttagctat ggtttatttc tttgccgagc cgtaaattct 60 gatatgatta acttttgccg aggctgtttt gcatggtttc attcccattg ccaaagttag 120 atatcaacat ttcaataatt ataatatttt taatctaaaa taattttata gtacttttta 180 tactgaatat aactgacaaa aaataagcac tacgagtata ctcgtacttt ggcactggtg 240 aataattttt tttctccagt gccgaagtac gagttatact cgtaagtatt ttaatgagta 300 taactcattg tggtgactaa caatgtagct tatttgatgc attaagcatc cgcattttat 360 tgtttacttc aaattttatt gtgtacttca aataagtaat gtcactgcaa tgagtataac 420 tcattgcagt gacattacta gtcatcgcga ctgtgtcaca ttgtgcggtg acattactag 480 ttaccacatt aaataatagt tgwtgcatta ttattataar ttgtgtttca atttttttgc 540 acatgcacat tggcagttat catgagtaaa aaaggtaatt tatttatttc gatttttttc 600 ttaattctaa gttattgtat tttttggact ataatctaaa ataaatttta taactacatt 660 atactgcatg ttatgaataa agggtaaaaa aacctaagtg tgcaataaaa tgttttttta 720 gtttattaaa aaatatttta gtgtgcactt caaattttta acacagaata ctaattaatg 780 tgttttataa ataagtaatt tcccttwtat aaatatataa cttataaatt tatctatcta 840 atagtagttc aacacagata ttccgcaatt gaagcagcac atatagttat ggaagattgt 900 ggrgatagtg aagatgatga tattgaagat tcagattcat tttttgctta tactacaacg 960 tcttccagct tgactgcatc agaatcagaw tctgaagata atagcactga agaaagtgat 1020 gacgatcagt taacagaacg ttcatctagc agccactgca gaagattttt taatgccgct 1080 gaaaagagct agaacaacgg gtggtcttgc ttacagacaa aatcaaaata aagctcaaca 1140 agtaaatctt ataagaaata ttttgcctga caaaacaact tctgaaacaa atccatctag 1200 aaatttaaat caatggaaag atgagcmgaa tattgtaaaa maaattaaat ttactgccaa 1260 tcctggtttg aagatwaaca tggagagcaa gaaaccattg gatttcttta gactatttgt 1320 tacagaagag cttataaata ctatggtagt tgaaactaat cgctacgcaa tgcaagaaat 1380 aaataagcag cgtccgctta gaagaagctc gcgttttaaa gattggaaac caataaattc 1440 agaagatatg cggcagttcc ttggagttct tcttcatatg ggatgtgtta aattgccatc 1500 ctttgaacac tactggtcta agaacagtct ctatagattt cccttatttt ctagaataat 1560 gccacgaaat aaattccaac taatgttacg gttctggcat tttattaaca atgaagattc 1620 gggtagtgga cgcttgtgta aaattattgg acttcattga tcacttaaat aatacaatgg 1680 acaacatcta ctgtcctaat aaaaatatat caatcgatga atcaatgatg ctttggaagg 1740 gacgtcttgt attcaggcaa tatgttaggc agtatgtgaa aaataaaaga cataagtatg 1800 gtatcaagtt ctatgagctt tgcgaatcrg atggtattgt actaaaagta aaaatctact 1860 ctggtgaggc aactctcgat aaacatttgt tgggtcaaac tggagctatt gttttagatt 1920 taatggaaaa gtttctagga aaaggttatc acctgtacac cgataacttc tacaattcct 1980 ttgaactaac aaagcacatg ataagtcaaa aaacatacat atgtggtack ttaagaactg 2040 accgaaagtc aaatccaaaa gaatgtacaa aagctaaact gaaacaggga gacgttataa 2100 gcagaagcag agagggtgta gtggttgcta aatggaaaga taaaagagac gtgctaatga 2160 ttagcaactt gcattcgttg caaatgattg aagtgacaaa caggagagga gagaagaaaa 2220 tgaaacccaa cattattaaa gattayaatc aatatatgtc aggagtagat aaggcggatc 2280 aaatggtatc atattatgat tgcctaagaa agacaattag atggtataaa aaagtagcac 2340 ttcatctctt tgatattttt gtgtttaatg cttactgtct aagcagcaaa tatggagtag 2400 ataaaacaat ttccttgctt aagtttagag aattagttgt tacggattta ttgggtgatc 2460 gcttaaatga aattgtgcct cgaatcacca ataatagtct ccactacttg tcttcaatac 2520 caccgaatga aaagaaaaaa ttgccaacaa aaccatgtcg tgtttgctct aaaataaaac 2580 gaaaagaaac acgatatgaa tgtgctgttt gtgaaaatag acctccgtta tgcgttggcg 2640 aatgtttcag aatgtatcat tcaaaagaat aggtttacac tttaattttc atgaaataaa 2700 attaaaacaa tatcaatatt ttacttttat ctgttttaar ataattctgt ctttatccgc 2760 atttaaatta ggttcccggg atttttaaag taaaaactta tgttgtaatc ttttctgaga 2820 atcaaatgtg aagctaagaa aaaaaaattt aaaaatagay aaagccaatc atgcacaaat 2880 attatttgaa ataagttagt ataactacga gtttactcgt agttggcaaa ctcatgacgc 2940 cggcttacta cgagtatact cgtagttggc aaatatcaag cgtttactta ctacgagtat 3000 actcgtagtt ggcaaatatt aagcgtttac ttactacgag tattctcgta gttggcagcg 3060 aacagg 3066 // ID Mariner-20_SM repbase; DNA; INV; 1246 BP. XX AC . XX DT 12-AUG-2009 (Rel. 14.08, Created) DT 12-AUG-2009 (Rel. 14.08, Last updated, Version 2) XX DE Mariner DNA transposon from Schmidtea mediterranea: consensus. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-20_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-1246 RA Jurka J.; RT "Mariner DNA transposons from Schmidtea mediterranea."; RL Repbase Reports 9(8), 1869-1869 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 77..1081 FT /product="Mariner-20_SM_1p" FT /translation="MKNQNINNEDRNRIIQAYIKGHSASDLSSILGFKRTT FT IYSIIKKYINHESIEKKQRGGQRRKSLSEQDKEAIRGWIDENCALTLKQIK FT SKIELERGVVVSQSTINRCIGGFGYSMKRINIMPIKRNTVEVIELRAQYAN FT RFMELLSTIDDSKIFFIDEVGFNVSIRSRRGRSLKNKRAVQNVSSIRARNI FT SVCCAMNKEGIFKYFTQTKSYNTQSFLEFIRLIIEKMSSLNISGAIFIMDN FT VPFHKNIAIKEEIEISGHRILFLPPYSPFLNPIENLFSKWKQFIRSSRPEG FT EDQLFENIENGSSLISSDDCKNYFRHMLGFLPICIRNEPVIDE" XX SQ Sequence 1246 BP; 461 A; 160 C; 202 G; 423 T; 0 other; tcaattttga cattttggca atcaattttg acattttgtc aatcaatttt gacattttct 60 aaaacctaac cccttaatga aaaaccaaaa cattaataac gaagacagaa atcgcattat 120 acaagcctat ataaaagggc atagtgcctc ggatttatca tcaattttgg gttttaaaag 180 aaccactatt tatagcataa taaaaaagta tataaaccat gaatctattg aaaaaaagca 240 aaggggaggt cagcggagaa aaagcttatc agagcaagat aaagaagcaa tcagaggatg 300 gattgatgaa aattgcgcgc ttactttaaa gcaaataaag tcgaagattg agcttgagag 360 aggtgtggta gtaagtcaaa gcacgattaa tagatgtata ggtggatttg gttatagtat 420 gaagcgcatc aatataatgc cgataaaaag gaataccgtg gaagttattg aacttagagc 480 ccaatatgcg aatcgtttta tggaattatt gtcaactatc gatgattcta aaattttttt 540 tatcgatgaa gtcggattca atgtttctat aagatcacga agaggaagat ctttgaaaaa 600 taaacgtgct gttcaaaatg tttcttctat aagagcacga aatatatctg tttgctgtgc 660 catgaacaaa gaagggattt ttaaatattt tactcagact aaatcgtata atacacaaag 720 ttttcttgag tttatacgct taataattga aaaaatgtca tctttgaata tttcaggtgc 780 aatttttata atggacaatg ttccatttca taaaaatatt gcaataaaag aagaaattga 840 aatatcggga catagaatcc tatttttgcc accttattct ccatttctaa accctataga 900 gaacctattt tcaaaatgga aacaatttat acgtagttcg agacctgaag gtgaagatca 960 gttattcgaa aatattgaga atggttcttc ccttatttca tccgatgatt gtaaaaatta 1020 ttttaggcat atgttagggt ttttacctat ttgcataaga aatgagccgg ttatcgatga 1080 ataaaagtag tatttttatt tactttctca ttttttttaa ttttattaat ttattaattt 1140 attaatttaa tttttaattt attttttaat tttcttctaa aattaacaaa atgtcaaaat 1200 tgattgacaa aatgtcaaaa ttgattgaca aaatgtcaaa atatga 1246 // ID BEL-180_AA-I repbase; DNA; INV; 5606 BP. XX AC supercont1.7; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-180_AA_; KW BEL-180_AA-LTR; BEL-180_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5606 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.7; Positions 3241817 3247422. XX CC Positions [4653-5237] - Integrase core CC 'ATCTA' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 258..5606 FT /product="BEL-180_AA-I_1p" FT /translation="MTEETPHPKRSKTPRKILDVLSGTPSSIYVSVQQQKQ FT EQAKKEMAKQLERILDRREFAKERMFRIREGLEKNVVNIHWFKLQLETLRQ FT CNEELQRTFIEICDLVPRNQRDEHRENHFRSEELHNELFVYIQTEIAKLNA FT AEEEKQRSTMNVLAPAFVPQQPVVMNNSMPHLPVPLPTFDGSLENWYSFKC FT MFKTIMNRYPHESPAIKLYHLKNSLTGSAAGKIDQDVINNNDYDSAWRMLE FT ETYEDERLIIDTHIDALLTLPKMTSENGTELRNLLDNCLKHVDALKNRSLP FT VEGLSEMILINVVAKRLDKETRKLWESQIPSDDLPTYPDMVDFLRERSRIL FT QKMRSYTEAASSIPSKQRGKQSDLKLTPGRNFAQTSQSNEWCACCNGEHVI FT YKCDVFESLTVNDRYAKVKQAGLCFNCLRRGHRTGDCNSERTCKTCKRKHH FT SLLHEGKSTVNQLQVNQADAEAPAVAEEHREDAEEIPGSVNCAKTTMPKQV FT LLSTAEVLVLDSNNRSVLCRALLDSGSDSNLISKALATTLNIPMESVNIPI FT TGVNNAETRIKHKLRTKILSRVSSFDAVLDFLVVPTVTANLPTMKVDIHSW FT SIPTNIALADPLFHIPDEIQMIIGAELFFELLKNGRMNLAEGSPMLIETDL FT GWVVSGPVKVNQNGPSRSVCQLNITDEQLNRTLVKFWEIESCNEASPLTPA FT EQAIEKHYEETFSRDETGRYIVKLPFNENKSQLGDSLEMARTRFKRLLRSF FT ANNEKKRRYTEFMTEYQTLGHMIQVQHNPEDCYFLPHHAVYKESSSTTKIR FT VVFDASAKTTSGISLNDALAVGPTVQNDLVTILLRFCSYPVVLTADIPKMY FT RQVRIHENDRKYQRILWLDTNNEIATFELTTVTYGCASAPYLATRTLIQLA FT KDEASELPLGSKVIEENSYIDDFLTGGNSEQEVIAIYQQLTELLRRGGFGI FT HKFCTNSEVVRKIIPPELQETLFDFEDADINSVIKTLGVIWNPNDDYFTFN FT VSPLHGKVTNAIPTKRSVLSAIGQLFDPCGYLGPVTTTAKLLMQDLWRLKL FT SWDDELPEEQYDLWTTFQEQLPLMNELRKKRCVITRGAAAVELHGFSDASK FT RAYGAVLYTKCISPDGSVDVELVCSKSRVAPLKPMTIPRLELCGTLLLARL FT VEKTAAAMKIPFSNVTLYTDSQVCLSWFAKSPLALNQFVANRVATVHELTQ FT DYKWCYVRSQENPADIISRGMLPAELLTEEKWFKGASALWQPNCSANEDTI FT CLDDDELPELKPTVVATSVRQKPQIDLTRMSSFRRLQRAWAYVLRFIKNVR FT QKKRDTSELQTQEITKATQIIMMLVQRETFYDLLHALKEGKKTLKQYRGLA FT PFIDKDGLIRVGGRLKYSSIPYDGKHQIMLPEKHHVTQILVRQLHTDHFHV FT GQRGLLSIVRERYWPIKVKTLIKQLVSKCYVCFRQNPTQVDQFMGDLPDYR FT ITPSPVFSNTGVDYAGPVYLKETGRKKTTYKAYIAVFICLATKAIHIELVS FT NLTTENFIAALQRFISRRGMVTNMYSDNGTTFVGANHELAELRKLFEDQTH FT QRQLNDFCISKGIEWHFIPPRSPHFGGIWEAGVKSIKHHLKRVVGETKLTF FT EEMTTFLAQCEAILNSRPLIPVSDDPNDIEVLTPSHFLIGRSAVSIPEPSY FT AEEKIGRLNRWQHVQLMKEHFWKRWSSEYLHYLQSRPKWHSETAKIDIGDV FT VVLKDENAPPHQWRMGRIVATHPGHDGIVRVVTVRADTKEFRRAVSKVCFL FT PKVDPLDSTGGV" XX SQ Sequence 5606 BP; 1647 A; 1298 C; 1338 G; 1323 T; 0 other; ttttggtcca ttcgagccag atgaacattc tgtgatgggt ttttcgttaa tttgtggtga 60 aattccggtc gaattcgcga gagttcattg atttccaccg tgtggaaaat ccgatcggca 120 acgacgccga acgacgcaac gaacagtgtg tttgtgcgtg cacagaacgt gttcaaacgt 180 gttgaacgca gagtgaaaaa caagtgaaag acttcgaaca aactttaacc gtttgtattc 240 agtgaagtgt tggtctaatg actgaagaaa ctcctcatcc gaaacggagt aaaaccccga 300 ggaaaatcct ggatgtttta agcggcacac cgtcatcaat ttacgtgtcg gtgcagcagc 360 aaaaacaaga gcaagctaaa aaggaaatgg cgaaacaact ggaacgaatc ttggatcgcc 420 gcgaatttgc gaaggaaagg atgttccgga tccgcgaggg attggagaag aacgtcgtca 480 atattcattg gttcaagcta caactcgaga cgcttcgtca gtgcaacgaa gaactgcagc 540 gaacgttcat cgaaatatgc gacctggtac cacgaaatca gcgagatgaa cacagggaaa 600 accatttccg gtctgaggag ctgcacaatg agctctttgt gtacatccaa acagaaatcg 660 cgaagttgaa tgcggcggaa gaggaaaaac aacgcagcac gatgaatgta ttagcccctg 720 ctttcgtgcc gcagcagcct gtggtaatga acaactcgat gcctcacttg ccagtaccat 780 tgccgacatt cgacggcagc ctcgagaatt ggtattcatt caagtgtatg ttcaaaacca 840 taatgaacag gtatccacat gaatccccag caataaaact gtaccacttg aagaactcgc 900 ttactggaag tgctgctggt aaaatcgacc aagacgtaat caacaacaac gactacgatt 960 cagcgtggag aatgcttgag gaaacctacg aggacgagcg actaattatc gatacacaca 1020 tagacgctct actcacgtta cccaagatga cgagtgagaa cggcactgag ttgcggaact 1080 tgcttgataa ttgtttgaaa catgtcgacg ctttgaagaa ccgatcgtta ccagttgaag 1140 gactctccga gatgatccta atcaatgtag tggcgaaacg cttggataag gaaaccagga 1200 aattgtggga atcacaaata ccatcggatg acctaccaac gtacccggat atggttgact 1260 tccttcgaga gcgaagtcgc atcctacaga agatgagaag ttacaccgaa gcagcgtcat 1320 caattccatc gaaacagaga ggcaaacaat ccgacctgaa acttactcca ggcagaaact 1380 ttgcgcaaac ttcccaatcc aacgaatggt gtgcatgttg caacggtgag catgtaatct 1440 acaagtgcga cgtatttgaa agccttacag tgaacgaccg ctacgctaaa gtaaagcaag 1500 ccggcctctg ctttaattgc ctgcgacgtg gtcaccgtac cggtgattgc aattcggaga 1560 ggacctgcaa aacatgcaag cgtaaacatc acagtctcct acatgaaggc aaatcgacag 1620 tcaatcaact tcaagtaaac caagctgacg cagaagcacc agcagttgca gaagaacatc 1680 gtgaggatgc tgaagaaata ccaggatcag ttaactgtgc caagacgacc atgccgaaac 1740 aagttctcct ttcgacagca gaagtgcttg ttctggattc caacaacaga agtgtactat 1800 gccgagcttt gttggattct ggttcggact caaatctcat cagcaaggcg ttggcaacga 1860 cgctgaacat ccccatggag agcgtgaaca ttccgataac cggagttaac aacgctgaga 1920 ctcgaatcaa gcataagctc cgtaccaaga tcctgtcacg tgtcagttca tttgatgcgg 1980 ttctggactt cttggtcgta ccgacagtta ctgctaattt gccgacaatg aaggtggata 2040 tccattcctg gtcaataccc actaacatag ccttggcaga tccattgttc catattcctg 2100 atgaaattca aatgatcatc ggcgctgaac tgtttttcga gcttctcaaa aacggacgga 2160 tgaatctagc tgaagggtcg cctatgttga tcgagaccga cttaggatgg gttgtcagcg 2220 ggccagtcaa ggtaaaccaa aacggtccct ctcgtagcgt ctgccagttg aatattacgg 2280 atgagcaatt gaatcgcact cttgtcaaat tttgggaaat tgagagttgc aacgaagcat 2340 caccattaac gccagcggaa caagccatcg aaaagcacta tgaggaaaca ttttcccgag 2400 atgaaacagg tcggtacatc gtcaaacttc ctttcaatga aaacaagagt caacttggcg 2460 actccttgga aatggctagg acacgattca aacggctact acgatcgttt gccaacaacg 2520 agaagaaaag gcgctacact gagttcatga ccgaatacca gacactgggg catatgattc 2580 aagttcagca caatcccgag gattgctact ttcttccgca ccacgcagtt tacaaggaat 2640 cgagttctac tacaaaaatc cgtgtagtat tcgacgcatc ggcgaaaaca acgtctggca 2700 tttcattgaa cgacgcccta gctgtgggcc caacagttca aaacgatctc gtcacaatac 2760 tgctgcggtt ctgttcctat cctgtagtcc tgacagcaga catcccgaag atgtaccgtc 2820 aggtacgtat tcatgaaaac gatcgaaaat atcagcgtat cctgtggctt gacaccaata 2880 atgaaatagc cacgtttgaa ttaacaacgg ttacgtacgg ttgcgccagt gcaccctatc 2940 tggcaacgcg cacgttgata caattggcga aagatgaagc aagcgaatta ccacttggat 3000 ccaaagtcat tgaagaaaat agctacatcg atgacttcct caccggaggt aacagtgagc 3060 aagaggtgat tgcaatctac cagcaactca ctgagctact gcgacgagga ggctttggaa 3120 tccataaatt ttgcaccaac agcgaggtgg tccgtaaaat cattccgcct gaactccaag 3180 aaacactttt cgacttcgag gacgccgaca tcaacagcgt tatcaaaact cttggcgtca 3240 tttggaatcc gaacgacgac tatttcacct ttaatgtgag tcctcttcac ggcaaggtaa 3300 cgaatgcaat accaaccaaa cgcagcgtgc tttctgctat cggacagctt ttcgatccat 3360 gtggctatct tggtcctgtg acaacgacgg caaaattgct gatgcaagat ttgtggagat 3420 tgaagttgag ctgggacgat gaactaccgg aggagcaata cgacctgtgg actacattcc 3480 aggagcagct gccgttgatg aacgagctgc ggaagaagcg atgtgttatt acacgcggag 3540 cggcggcagt agagctccac ggattttcgg acgcttccaa gcgtgcgtat ggagcagttt 3600 tgtacactaa gtgtatttct ccagatggat cagtggacgt cgaactcgtg tgcagcaaat 3660 caagagtggc tcccttgaag ccaatgacca ttcctcggct ggaattgtgc ggaacacttc 3720 ttctggcacg tctggttgaa aaaacagctg ctgcaatgaa aattcccttt tccaatgtga 3780 cactttacac agactcacaa gtatgtttga gctggtttgc aaaatcaccg cttgctttga 3840 accagtttgt tgcaaatcgt gtcgccactg tgcacgaatt gacgcaggac tacaagtggt 3900 gctatgtgcg atctcaggaa aacccagcag atattatttc acgtggtatg ctaccagcgg 3960 aactgctaac agaagaaaag tggttcaaag gtgcatcggc tttgtggcag ccaaattgtt 4020 cagctaatga ggacaccatt tgtttggatg acgacgaatt gccggaattg aaaccgacag 4080 tagtcgccac ctccgtgcga caaaaacctc aaatcgattt gacacgtatg agcagctttc 4140 gacgactaca acgagcttgg gcttatgtgt tgcggttcat aaaaaatgta cgtcagaaga 4200 aacgtgacac ttccgagctt caaacgcaag aaatcaccaa ggcaacccaa attatcatga 4260 tgcttgtgca gagggaaacg ttttacgacc tgctccacgc cctgaaagaa ggtaagaaga 4320 ctttgaagca atatcgtggt ttggctccat tcatcgacaa ggatggatta attagggttg 4380 gcggacgtct taaatactct tcgatcccct acgacggcaa acaccaaatc atgttacctg 4440 aaaagcatca cgtgacccaa atacttgtgc gccaactcca taccgatcat ttccatgtcg 4500 ggcagcgtgg tttgctgtct atcgtacggg aacggtattg gccgataaaa gtgaagacgc 4560 ttattaaaca gcttgtctcc aaatgctacg tttgctttcg gcaaaatcca acgcaggttg 4620 accaattcat gggcgactta ccagactatc ggattacacc atcacctgta ttttcgaaca 4680 ccggggttga ttacgctgga ccggtatatc tcaaagaaac tggtagaaag aaaacaacgt 4740 acaaggcata catcgcagtg tttatttgct tggccacgaa ggccatccat attgaattgg 4800 tttccaattt gaccacagaa aattttatcg ctgccctgca acggtttatc agcagacgtg 4860 gtatggtcac caatatgtat tccgataatg ggacgacgtt tgtgggagcg aatcatgagt 4920 tggccgagct acgcaagttg ttcgaggacc agacacatca acggcaactg aacgatttct 4980 gcatttcgaa aggaatcgaa tggcatttta tcccgccacg tagtccccac ttcggcggta 5040 tatgggaggc tggagtaaaa tcaattaagc accatttgaa gcgtgtcgtc ggtgaaacga 5100 agttgacctt tgaagaaatg actacttttt tggcgcagtg tgaagcaatc ctgaacagtc 5160 gaccgttaat tcctgtttcg gacgatccga atgacatcga agttttgact ccatcgcact 5220 tcctcatcgg gagatctgct gtgagcattc cggaaccgtc atatgcagaa gagaagatag 5280 gccgattgaa ccgttggcaa cacgtacaat tgatgaagga acatttttgg aagcgctggt 5340 cgagtgagta tttgcattat ttacagtcca gaccaaagtg gcacagcgaa actgcaaaga 5400 tcgatattgg cgatgtcgtc gttttgaagg atgaaaacgc acccccacat caatggcgaa 5460 tgggacgcat cgtggcaacg caccccggac atgatggcat tgtgcgagtc gtcacagtac 5520 gtgcagatac caaggagttc cgtagagctg tatccaaggt ttgcttcctt ccaaaggttg 5580 atccgttgga ctcaacgggg ggagta 5606 // ID BEL-33_CQ-I repbase; DNA; INV; 3451 BP. XX AC AAWU01003985; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-33_CQ_; KW BEL-33_CQ-LTR; BEL-33_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-3451 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 219-219 (2011). XX DR GenBank; AAWU01003985; Positions 56454 53004. XX CC 'AAACT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 232..1398 FT /product="BEL-33_CQ-I_1p" FT /translation="MFRRSPAALPDHVLEVSPTLPDPILEVFSALSDRVPE FT ANHLLPNTVSEASLALPDPVPEVLPSLSDRVPKSSHVLSNTTSEASLALPG FT PVPGVLPPMSDRVPKSSHVLPNTTSEASLTLPGPVPGVLPPVSESAPEANH FT VLLNTASDASLALPGPVPGARPALTDRVSEALQNLDELAATVDRPENVPPS FT TPAVSSSTPESELLTTKCSDRAAGTVQTAADFSTWTSAPQREVAMNPTAKK FT LADAFRQRHQAEQKLIFVEELLLGMCQPTLAQLTVLYDFLCTAYREQSQHH FT LTIVGIIPDEDLAQQEAEFEKHDANFYKVATVVAELQIEAERTEPVSTSAA FT AKSAVNRQRLQAESWLKQMLRAWESNRVDPLDSNVQLQAPGSQKWS" FT CDS 1302..3449 FT /product="BEL-33_CQ-I_2p" FT /translation="MAEADAEGLGIEPSGPSGFKCPTPSTRKSEVVLRDCA FT PTVCIPKPEPSLNQLKSLAKQIATKSTCELASDSRIILRSEQPSAVDSRPR FT EDFNMNPIADKFDFLEPTGYLKNVTAEAEPGPVMPAIPSVQVSLRYRSITH FT PDPRRVHSEEPKNLPDFELQQNCTEPVGSFKKDFETVMTVPKYQLQLPSAA FT KAPDKRRHICPSRVTTNSDSKSDAPNEVESRETSAVSSKLDPADRRVSIRV FT SRDVATRVSLKLSLSAMAACSDNQQSEHSNRHRGTKSDVANETRSQRLEIR FT LPTVPANRAFAEVPEDVANSENSERDEESPKSRQRASSCEVHADSIPEQTP FT EAVILDRTVALDPDINRFHSDRDLTNPEDKADFSRVTKDSSSKVTKMIPRS FT RVPGVHNSVKNTHRVHLDIVPETSFRMVKTVAEKNPLMPPVSNAPEKLKQY FT RTMKKTQPRAFGVPPTRCQPRQVCQECAKPQNQTKEVTNQEINHYKDSDPV FT TDDCALQVPNRINGRVTRRDSVTTTPTGINPIKPGGRCRIVHSETASYNPF FT RALQRQETISVSKKQGVVDAPDRTDDLALNGGAERTTPRKIWHAVAVLQLL FT QEPRDTFQNKDKKPATQSKPNDSLYDEKIAQNWLIDRELIATKMLNTSYLY FT RAEANLTPDERETLRPLEEARPTFTILKRPWDPGSTCHTSKPISDPDQTAV FT QERRFRNAVERTQRGE" XX SQ Sequence 3451 BP; 892 A; 1036 C; 884 G; 639 T; 0 other; taaatggtcc gtaccgaacc ggattcaaag tgactgcttc ggctgctgcc cgcgtgtccg 60 atcgtgttgg tttcctgaac accgcgcgcg agtgaaaatc tgtgtgtgtg tgtcacttcg 120 acataaaaaa gtgacattcc aaaatgtttc agtgacgaat tccgtgccac ggatttcttc 180 cacctgtctg gccggaccca gttccggaga caaccctggc cttgccgaac catgtttcgg 240 aggtcacccg cagccttgcc ggaccatgtt ctggaggtat cccccacctt gccagatccc 300 attctggagg ttttttccgc cttgtcggac cgtgttccgg aggcaaacca ccttctgccg 360 aataccgttt cggaggcatc cctcgccctg ccggatcctg tcccggaggt tctcccgtcc 420 ttgtcggacc gtgttccgaa atcaagccac gtcctgtcga ataccacttc ggaggcatcc 480 ctcgccctgc cgggtcctgt cccgggggtt ctcccgccca tgtcggaccg tgttcctaaa 540 tcaagccacg tcctgccgaa taccacttcg gaggcatccc tcaccctgcc gggtcctgtc 600 ccgggggttc tcccgcccgt gtcggagagc gctccggagg caaaccacgt tctgctgaat 660 accgcttcgg atgcatccct cgccctgccg ggtcctgtcc cgggggctcg ccctgccttg 720 acagaccgtg tttcggaggc actccagaat ctagacgagc tagcagcgac agtcgaccgc 780 ccagaaaatg ttccaccatc aacaccagct gtgtcgtcgt cgactccaga aagcgagctc 840 ttaacgacca aatgcagcga ccgagcagca ggaactgtcc aaactgcggc agatttctcg 900 acgtggactt ccgcacccca gcgggaagtt gccatgaacc caacagcgaa gaagctggcg 960 gatgcttttc ggcagcggca ccaggcggaa cagaagctga ttttcgttga ggaactgctt 1020 ttgggtatgt gccaaccaac gctagcacag ctgacggtgc tgtacgattt tctgtgtaca 1080 gcgtatcgag agcagagcca gcaccatcta acaatcgtcg gaatcatccc agatgaagac 1140 cttgctcagc aggaggctga gttcgaaaag cacgacgcaa atttctacaa agttgctact 1200 gtggtggccg aattgcagat cgaagcagaa agaaccgagc cagtatcaac aagcgcggca 1260 gccaaaagtg cagtaaaccg acagcgccta caagctgaat catggctgaa gcagatgctg 1320 agggcctggg aatcgaaccg agtggaccct ctggattcaa atgtccaact ccaagcacca 1380 ggaagtcaga agtggtcttg agagactgcg cgcctaccgt atgcatccca aagcctgagc 1440 cgagtttaaa ccagctgaaa tccttggcga agcagatcgc caccaagtcc acctgcgagc 1500 ttgccagcga ttcccgaata atcctgagat cggaacaacc gagtgcagtc gactcccggc 1560 cgagggagga tttcaacatg aacccgatcg cggacaagtt cgacttcctc gaaccgaccg 1620 gctatctcaa gaacgttact gccgaagcag aaccgggccc agtgatgcca gcaatcccct 1680 ctgtccaagt ttcccttcgc tatcgatcaa ttacacaccc cgatccacgc cgtgtccaca 1740 gcgaggaacc gaagaacctt ccggatttcg agctccagca aaactgcacc gaaccggtag 1800 gtagtttcaa gaaggacttt gagacggtga tgaccgttcc gaagtaccag ctacagctgc 1860 cgtcggcggc aaaggctcca gacaagcgta gacatatctg tcctagccgt gtcaccacaa 1920 acagcgactc caagtctgac gctcccaacg aagtcgagtc acgggagact tccgcggttt 1980 caagcaagct agatccagct gaccgccgag tgtctattag agtgtccagg gatgttgcca 2040 caagggtgag tctcaagttg tcgttgtcag ccatggcagc gtgtagtgac aatcagcaaa 2100 gcgaacattc gaaccggcac cgaggcacca agtccgatgt agccaacgaa accagatcgc 2160 agcgtttgga gataaggctg ccgacagtcc cggcgaaccg agcgttcgca gaagttcctg 2220 aagatgtcgc caacagtgaa aattccgaac gtgacgaaga atctcccaaa agtcgccaac 2280 gtgcgagttc ctgtgaagtc catgcggaca gcatccccga gcaaacccct gaagctgtga 2340 ttctggaccg aacggtggca cttgacccgg acatcaaccg tttccattcg gatagggatc 2400 tgaccaaccc agaagataaa gccgatttca gtcgtgttac gaaggacagc agctccaaag 2460 tgacgaagat gatcccccgc agcagagtgc ctggagtgca caattccgtc aagaataccc 2520 accgcgtcca cctggatatc gttcccgaaa ccagctttcg gatggtgaaa accgttgcgg 2580 aaaagaaccc cctgatgccg ccagtgtcaa acgctccgga gaaactcaag cagtaccgca 2640 ccatgaaaaa aacccaacct cgcgcgtttg gtgtccctcc taccagatgc cagccacgcc 2700 aagtatgcca agaatgtgcc aagccccaga accagaccaa ggaagtgacc aaccaagaga 2760 tcaaccacta caaggacagc gacccagtca ccgatgattg tgcgttgcaa gtacccaatc 2820 gtatcaacgg cagagtgaca cgccgcgact cagtaacaac cacgcctact ggcattaacc 2880 cgatcaagcc tggcggacgg tgtaggatcg tccattccga aacagcgagc tacaacccgt 2940 ttcgtgcgct gcaacgacaa gaaacgatca gtgtctccaa gaagcaaggc gtcgtcgatg 3000 caccggaccg aaccgacgat ctggcactca acggaggtgc tgaaagaacg acaccacgga 3060 agatctggca tgctgtggca gtgctgcagt tgctccaaga acctcgagat actttccaga 3120 acaaggacaa gaaacctgcc acccagagta agcccaacga ttcactctac gacgaaaaga 3180 ttgcgcagaa ttggctcatc gaccgagagt tgattgcaac caagatgctg aacacaagct 3240 acctgtaccg cgccgaagca aaccttaccc cagatgaacg cgaaactcta cgaccgcttg 3300 aagaagccag accaacattc acgatcctga agcggccctg ggacccgggg tcgacatgcc 3360 acacgtcgaa gccaatcagc gacccagacc aaaccgcggt acaagagcga agatttagaa 3420 atgctgttga gaggactcaa cggggggagt a 3451 // ID CACTA-2_AA repbase; DNA; INV; 6076 BP. XX AC . XX DT 16-MAY-2011 (Rel. 16.05, Created) DT 16-MAY-2011 (Rel. 16.05, Last updated, Version 1) XX DE DNA transposon: consensus. XX KW EnSpm; DNA transposon; Transposable Element; CACTA-2_AA. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-3053 RA Yuan Y.W. and Wessler S.R.; RT "The catalytic domain of all eukaryotic cut-and-paste transposase RT superfamilies."; RL Proc Natl Acad Sci U S A 108(19), 7884-7889 (2011). XX DR [1] (Consensus) XX SQ Sequence 6076 BP; 2046 A; 1094 C; 1163 G; 1773 T; 0 other; cactgaacca agaaatatca tgaaaagtat taaaaacgtc acatacttct ccgttaatcg 60 aattcctatt cattttatgt gtacctaata acatccatgt acaaaaccaa taaaaatcgt 120 tatgaaacct cataatattc atctaccaat cctctggatt gtacatgaac aaaagtcatg 180 tatgctgtca gataaatgct atgtgatttg ctgatgtgaa aatttagtcg ctttatagca 240 gttttttcaa gatggcgaat ctggttgacg aacttctggc ggatttagga ttctcggcgg 300 attcactgca agtatttcaa cgtatgttcc ttaaaattac gtaaaagcat aaaaaaatta 360 aattgttttt tttttcagtc tgcggtattc gtttttatga gttaattaca tattcagtgg 420 aagattttga gtctagctta agtcagagcg atataaattg gaattaccac aaagtgttcg 480 aatacgtaaa tgcctggcgg aaacgaaatg taagttaaat gttttcaaac aaatattaaa 540 agtaaatcta ttgtcaatgt ttacagaaag cagcgattct caaagtccta gcacctgact 600 tactggtaaa acaggaaaac gttcagggag gaccggtgca agcggaagac gacggaaaac 660 aaccggaaca aacaccagac aatccagtgc agtatcaaaa agtggccggc ggtgcggagt 720 caaacgccaa tccggtgcaa gctaaagacg acggaaaaca gccggaagaa acaccagaca 780 atccagagca ttatcaagaa gcggccggag gtgcggatac aaacgccaat ccagtggaac 840 aggtgaaggg aggctccgag acaacaagaa cggttgattc tgatcagcaa gagtccacgc 900 ccaagacagt ggccgagcca gatgggacat cagcagtaga tgcgggaaca ccaaacaacg 960 agaaacactc ctattctgag gaggaacatg tttcggaaaa ggccatcacg cttacagctg 1020 gtctgcttct gtctcttttg aatggcaacg aagagggaaa aagtttgatt gagaaggcaa 1080 aaatttgtga actatccgat agtaatcagc atgtacttgc cggaaccatc gctcgattcc 1140 acctgaacca cagacgtaaa ttactcacgg aagatctgga aagttattgg ttggctataa 1200 aaagtctttt caagtttgag cggaaggtaa gcaacagtaa cgggaattga ggaaaattgt 1260 aatgtttaca tgttgatttg tttgtaggaa aattacttca tctgcagaag cggccaacga 1320 cggaaccacg gtggtaagat cgccaacaaa atttctaatc tgaagcaaaa gaaacgtaaa 1380 acagatctga acgaagaaca acactgcaag caagccaaac taacagcaag gcccgagaca 1440 tctgctgatc cttcttctgt cgaagccaca gaatggttaa ccttgaacca ggaaccttgg 1500 tctgttgtac tggagaagtg gaaatccagt ttcccgatta gaagaaaata tcttcaaaat 1560 cgagattcac tgcctatgct gttgtccaag tactcacact tcaagactgc tcatggattt 1620 caactagtaa gtaatagctt tatacttaaa tagtttgaaa aataccgtca gatggggctt 1680 ttccacacta ttgcgacata tttcaacttt acaaatgttg atatttcaaa ctattacaaa 1740 tgttgatgaa acataacgat tgcatagctt tgtgatatac cttttacata gggtaaactg 1800 tccatattcg cataacagtc tcacatgcta ttgtatttcc tatctacatg ggactgtcat 1860 gcgaatatgg gcagaaaaga actacttggg cactttcaag attcactttg gcatgggggt 1920 attcccaacc gaattgacct gaaactttgc aataagatgc ttattaatgc gatgcatatt 1980 gtggccaaat atgagttcga taacttcaaa aaaaaaacca ttgccgaagt tattctaaag 2040 tgccaagaac aggattcggc tccctagcaa atgttttgta gcttgctcag acttttttga 2100 aaaatattga ctagtgtatc ctgttattgt tgatattatt acaataaatt tgcatgtcgt 2160 tttgtagagc atagggagcc ggacccaatt cttggcactt ttgattgact tcggcagtgg 2220 gttttttttt tgaaagctac aaagctcata tttgaccaca atatgcgttg tagtgaagtg 2280 ctccatattg caaagtgtca gacaattcgg ccgagaaaac ccccccatgc caaagtgaat 2340 catggaagtg cccaagtggt tctgcaccct atgtgggata ataaaagcgt tgaagacaca 2400 aaattgctca gattaatatt tatgctgcaa ataaatccaa aatgtggctg ttcaaagtag 2460 ccacgtctga cggtaatctt atttcaattg tcatttttca tttataattc ttttatcaaa 2520 aggtggatgt cgacttcaaa ttcctgaata ttggttgctc agatggactg cttaagcttg 2580 aatcggcagc agttgcaata actacgttca tcgctaaaaa agccaccgat ccctcagctg 2640 tgaagttgtt gaaatattta accagcagta atgttagcca aggtttgcac atcaaaaaag 2700 aaacactagc aattcattta actctcatat tttatgtttc tagataccaa gctgtgcgca 2760 ctgctacttg gattgaatac ggtgcttcca ccaatcgtcg ctagtgcaaa gttcaaacca 2820 accatttgtg tggctcaaga ggatacggtt gtatttgtcg attcatgcga gcaaatcgtt 2880 gagaaagttc aatcgattta cgcgtcgtat gttgatcgaa agctacctat ttcaccaaaa 2940 ctagtcgccg taggaacagg tccggaaaac ttaaccggcc gttactatgt gagttacacg 3000 gacttgtgct atgagtttac atcatttgca agggcaatcg acgtgctggt caaactcaca 3060 catctgttcg gtttacccta ctcgaagata tccaagcttg tttggcattt catcagcaac 3120 tctatctatg gcattgagca acgtgagtcg tacgcaagtg taaatcgtct ccacaatttc 3180 ctgactcaag cggaaggtcc tttaacagtg aatcatgatg aactgcatgt ttgaacactg 3240 taagtctgat tttgagagcg ctgtcgtata tctcgatcat ctgaaaaacc atcatagagt 3300 tccagtaaac taccgctata gatgcaccgt accaaccatt cctgtatgca accaaatttt 3360 ttctaaacta tatccgttca agaagcacat tatcggacac agcagctcag aaagcgttca 3420 cagaacgaat gtcccgcaaa gtaatacaac agatttggat gatcgaactg tggctgagag 3480 tatagatcac aatacaaaac ggcagagaat cagcatggaa ctcagtagcg agagcgaagg 3540 gggattttcc gacaaaatga atgaaatcga gagatcagct acagcattta ctcttagttt 3600 acatgttaaa atgaatgtta gcaggaaaga tgtatatgat attcaaagag gaatagattc 3660 aatgttcaca gagatagcta atcagttaca agatcttcaa atgatagcac ctaatccaga 3720 tttgtttttt tgttatgaaa gctacatctc aaaaatgaaa aacattttca aatccgttag 3780 ctccgatcat aaactgttca atcacttgaa gcaaaagcaa gggttgcaat tgccatccat 3840 aatatgtatt gataagactg aagatctttt gaaaagagaa atatgtattg aagaagaaaa 3900 agaaatcgaa aacaacaaaa gttatatgat aatgatgccg aatgaatttc aaatttcaac 3960 ttttctggaa tcaaataata cattggacac aatattggac cacactacaa aacttcaaga 4020 atcagacagc tttttgaatt tcgtgaatgg ctccaaatgg aaacaaattt gtcaaaagta 4080 tgaaggggac attattgttc cagtttggct gtattctgac gaattcgaga taaacgactc 4140 gcaaagttca cattctaaca gacactcgat ttgtggtatc tactataatt tcccaaccgt 4200 acctgaccag ttccgctcga agctttctca tatatttgtg gcaggaatga tcaaaaaagt 4260 ggatatgaaa gttactggca tcaataagct attagctgta atgattgaca aattcaagtc 4320 tatggaagaa aatggaatca aattgaacgt taaaggaaag tctgtaaatg tacgaattat 4380 tctgtgcttg ctgcagggag ataacttggg gttgcataca atgttgcaat tttccagtgg 4440 tttcaacgcc acgttttact gtcgtttctg tcggagacca aaggacttgc ttcaagttga 4500 tgtcaaggaa catgcagatt gcatgcgcag aaaacatgat tacgacgaag acttagaact 4560 gaataaccag agtgaaacag gtatagcagg taattcaatt tttaacgact taccatcatt 4620 ccatgtaacg gaaaatagaa gtgtagatgc aatgcatgat ctttttagcg gtgggatatg 4680 caagtatgga ttggtagaaa ttttggatta ttgcattcac gaaaaacatt ttttcactgt 4740 tcaacaattc aaccaccaaa gaaaaatatt tagtaaacat tgctctgata gcgagctaca 4800 aaggatgccc gatataacta gtagttacaa tagcaaagag aaaaaaaggt cagtttcgtt 4860 taaagtaaca tccagtgaga tgcgattatt aattcattac tttcctttga ttgttggctg 4920 ctttgtacca aggggagatg aaatttggaa ttattgcaaa gcattaatta agctagttga 4980 tatgtgtctc aaacgatctt ttaccgaagc agaaatagac aatcttaaag aagctatagc 5040 aatacaccat gaaatgtaca aacggttgtt caacaaggac ttgaaaccta agcatcattt 5100 tgtcgtgcat tacccaagtt tgattcgaag ttctggacca atagaaaaga tgatgtgttt 5160 cagaaacgaa gcaatgcaca aaaattttaa gcaatatgca catattatgt cgtctcgcaa 5220 aaatatatgt tatactttat gtgtgaaagc ttctttgcaa tttgcctatc atttgcttaa 5280 tggtattttt atgaaggatg aagtttatgg agatttcttt gtcagtgata ttcgactaat 5340 tagctactat gataatattt gtaaacctat ggtcgttaaa aatacttttg atgttgccat 5400 ctcaaacaaa atagtttaca aaggtacaac ttacagaagc ggaaattttt taactttgtt 5460 tgtcagtaat attttatgtt tgttccaaat actagacatt atgcaggaaa atgaacaata 5520 ttattttctt gctagaatgt ggaaatctgg aatttttgac gaccattaca tggcatatgt 5580 agcatacgaa gctttacctc tagttgaaat aataccaatc aatgcattca gtagtccacc 5640 aattacggta catagaatag aagaacattt atattttcgt ttaaagcata attttaattc 5700 gattgaatgt aattagcttt taaggattaa tacagattat ttgttcaata aaggatttta 5760 attacaatta ttgtttcatt caattttact aaagcaaacc ccatatctaa actcacctaa 5820 aatctatcat aatcttatat aaaaatcatt aaaaaatcaa ttgtttatca cctgagtatc 5880 acatacaatt catcgacaaa gttttatgat tcatatatgt ttttcacatc aaatgtacat 5940 gctttaactg atgattttaa tcaaaatcaa ctcgccaaag tctatgtgat attatgatga 6000 acgttatgtg aaaaactaca tggaaaaaaa ctatgtatga tttcagataa atcttacatg 6060 attcctctgt tcagtg 6076 // ID hAT-30_HM repbase; DNA; INV; 2803 BP. XX AC . XX DT 07-DEC-2008 (Rel. 13.12, Created) DT 12-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-30_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2803 RA Jurka J.; RT "hAT-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 2019-2019 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 763..2604 FT /product="hAT-30_HM_1p" FT /translation="MPKESDIWKFFEKSSKFEAKCKICFKSYKINCGSTST FT ITSHVKTKHEASWKEHLLKTTEILNKQKFKEKETKTTTIESCIIKNQKYGL FT NDPKQLKFDLDLTKFICKLGLPFEILDNEGAQEFIDEVNSKFTVKSASTFR FT RQKLPMLYEKVKEGLQIKFLKDFPEVTGVSFTTDLWTSRNNDSYMSLTIHY FT ISSDFXLVSFLIACTPFTGRHTGVAIAVNLDAFIANLNLNEEVHRACVNDN FT GSNIVLAAKESDEINSELRCNDHTIQLVILKAIKNSIELSKAIKNCTDLAS FT HTHRSALSTTKLQDACEELGIKPRKLIVPCTTRWNSDYMCIKSVLEVKDAI FT KTLAKTNDEFERSCPSDDQWKTLEYSIPFLEKIYNVSVTLSADKRPTIQDV FT IPELYSLHQELLSHQHHKHKPTRTFIKMLICELQERYPLNGAEQLINCFAN FT FLDPRYKGLHLIEYKKCDEIKEAILTQEKEKKHEKASMSSSKTDNESNENV FT KVNHHELLRKRLKQDILARSASTTTKLGLEIDHYLAAETAEETTDILEWWK FT TMKLQYPILSNYARKYLCIPATSATSERVFSSAGNVVTARRTTLAVENVEK FT MVYIKENIKKIKINF*" XX SQ Sequence 2803 BP; 1091 A; 415 C; 434 G; 862 T; 1 other; taaggccgtg cggttaaccg aaccgaaccg ttctttcggt gaaaaccgaa acaaaaaaac 60 cgaatcgaac cgaaccgatc taattattta aatcaaaata aaaaccgaac cgtaactgaa 120 aaaactttta aaaaccgatt taaaatttta aaatttcaaa aacattattt gacatcaaat 180 ttttaaataa aatataattg ttatttaatt taagaaattt caaaccgcag aaaagcaaag 240 ttaaaaaggg aacaatgaaa aataaacata ataaaatagc ttgccgcgaa attagaacaa 300 atgttgcgga ccacgatgga gtaagatttt ttaagggttg gatatcaata tcgtgtagta 360 taagcttttt ctaaaaagac tgtatttttt taatcttatt taaatttcta atattgttta 420 taatatttat cttataatat tattattttg gaaacctcgt tttctaaact gttttcattg 480 tataaataat aaaactgata taatcaaaat aatttgttgt ttaaatatta ttaatgcctt 540 taggagcatg caaggcgtaa aatagtttaa tatttgcatt gtaactaata atgttctaaa 600 aaataattaa aatacatttt aaggattttg aaaaattatt aagttaaact tatttatatt 660 gttatctgat attatacttt ttcaaaatta aactatgttt atttatgttt attatacggt 720 ttggttgtct tactaatcat ttttatttta ttcgcattaa agatgccgaa agaaagtgat 780 atttggaaat tttttgaaaa gtcgagtaaa tttgaagcca aatgtaaaat ttgctttaaa 840 tcttacaaaa taaactgtgg cagtacttca acaataacca gccacgtcaa aactaaacat 900 gaagcatctt ggaaagaaca tcttttgaaa actacagaga tacttaataa acaaaaattt 960 aaagaaaagg agacaaaaac aactacaatt gaaagttgca ttataaaaaa ccaaaaatat 1020 gggttaaatg atccaaagca attaaagttt gacttagatt taacaaaatt tatttgcaag 1080 cttggtctac cttttgaaat tttagacaat gaaggagcac aagagtttat tgatgaagta 1140 aatagcaagt ttactgtcaa atctgcatca acatttagac gtcaaaagct tcctatgttg 1200 tatgaaaagg taaaggaggg tttgcaaata aaatttctca aagattttcc agaagtaact 1260 ggagttagtt ttacaacaga tctctggaca agtcgaaaca atgattctta catgagtctt 1320 acaattcact acattagttc agactttart ttagtaagtt ttctcattgc ttgcacacca 1380 tttactggta gacacactgg ggttgcaatt gcagtaaacc tggacgcttt tattgctaat 1440 ttaaacttaa atgaagaagt ccatcgagcc tgcgttaatg acaatgggag caacattgtt 1500 ttagctgcta aagaaagtga tgaaattaat tctgaattgc gctgcaatga tcacacaatc 1560 cagctagtta ttttaaaagc aatcaaaaat tcaattgaac tttctaaagc aataaaaaac 1620 tgtacagatt tagcatcaca cacacataga agtgctctct ctacgaccaa gcttcaagat 1680 gcttgtgaag aacttggaat taagcccagg aaattgattg ttccatgtac aactcgatgg 1740 aattctgatt acatgtgcat taagtcagtc cttgaagtca aagatgcaat caaaacactt 1800 gcaaaaacca atgatgagtt tgaaaggtca tgtccctcag atgatcaatg gaagacatta 1860 gaatacagca ttccatttct tgaaaaaata tacaacgttt cagtaacttt atcagctgat 1920 aaaagaccaa caatccaaga tgtgatacct gaattatatt ccttgcacca agagcttctt 1980 tcccatcaac atcacaaaca taagccaaca agaacattta taaaaatgct gatttgtgaa 2040 cttcaggaac gatatcctct taacggtgct gagcagttga ttaattgttt tgcaaatttt 2100 ttagatccac gttacaaagg cctgcatttg attgaatata aaaaatgtga tgaaataaaa 2160 gaagcaatat tgactcagga aaaagagaaa aaacatgaaa aagcttcaat gagtagcagc 2220 aaaacagaca atgaaagtaa tgaaaatgtt aaagttaatc accatgaact gcttagaaaa 2280 agactgaaac aagatatttt agcacgcagt gcttcaacga cgactaaatt gggtttagaa 2340 attgatcatt atttagcagc cgaaacagct gaagaaacga cagatatttt agagtggtgg 2400 aaaactatga agttgcagta cccaatttta tcaaactatg cacgaaaata tctttgtata 2460 ccagcaactt ctgcaacttc agagagggtt ttttctagtg ctggtaacgt tgtcactgcc 2520 agaagaacta cgttggcggt tgagaatgtt gaaaaaatgg tttatataaa agaaaatatt 2580 aagaaaataa aaattaattt ttagtttttt ttggtaaaaa tctgtaagtc taaatttatt 2640 taataaatga cctctaaaac tttgttttgt ttttaaccga aaccgaaccg aaccgatcaa 2700 aaataaattt aaaccgaacc gaaaaaaacc gaattgttag tttaattttt aaaaccgaat 2760 cgaaaaaaac cgaaccgaaa aaaaccttaa ccgcgtggcc tta 2803 // ID Gypsy-2_PPc-I repbase; DNA; INV; 5065 BP. XX AC chrUn; XX DT 06-JUL-2010 (Rel. 15.07, Created) DT 06-JUL-2010 (Rel. 15.07, Last updated, Version -1) XX DE LTR retrotransposon from the Pristionchus pacificus genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-2_PPc_; KW Gypsy-2_PPc-LTR; Gypsy-2_PPc-I. XX OS Pristionchus pacificus OC Eukaryota; Metazoa; Nematoda; Chromadorea; Diplogasterida; OC Neodiplogasteridae; Pristionchus. XX RN [1] RP 1-5065 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Pristionchus pacificus genome."; RL Repbase Reports 10(7), 996-996 (2010). XX DR Genome; chrUn; Positions 96507831 96512895. XX CC Positions [3789-4271] - Integrase core CC 'TAAAT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 380..2374 FT /product="Gypsy-2_PPc-I_1p" FT /translation="MGEIEEDMADLKRLVASMAQLLKNQSESKTVADERSN FT ELSIASLNAIESRIQEFVYSPEDGATFEKWWNRHEDIFMIDLKELDELKKV FT RLLIRHISTTVERTFTELIAPTKWIDMKLDQVKAKMLTLFGDNTSIFDRRR FT TMLDLKMSKENIDDVRVLAARVNLTVENAQVNDATINEWKVLTFLHSLDLP FT RYSDVHLKMMQTAKQKGKDCTLDDLLSVFNDMSQLKKDSSSITDSRRNVNY FT VDKKRDNQQNKGFGNKSQNRQEKPKYTHNEPCAGCGSKSHARAECSFKESN FT CNKCGKKGHIARVCKSKKTYTVSVATIATSDYHIQLRLNGYTSSVKIDTGA FT DITIISETMWRSIGEPKCSTADCTATCANGMTLGLRGKFKAKAEYGGVQAT FT GDVYVTPKNINLLGKNFIKMLNLVEIREPGPRIHEVTTPLSTSIEYTEWVK FT KEYPDVTASGLGRCTEMTASLQLKPDAKPIFVKARPVPYALTENVETELNR FT LEKSGVIEKVEYSTWAAPILTVSKPNGSIRMCADFSTGLNAAINLPAHPLP FT VPEDIFATLNGASIFSQIDLSEAYLQVPLDQEAQKLLVINTHKGLFRYKRL FT PFGVKAAPGIFQRLMDTMLSGVKHAVPYLDDIIIGGRTKKEHDNTLVEVEQ FT RRNMIILSWRSNKEGT" FT CDS 2772..4976 FT /product="Gypsy-2_PPc-I_2p" FT /translation="MVNYYGQFIDGMHKLRSQLDHLLKDNVKWKWTKECSK FT AFTEIKGKLNQQLSLVHYDPQKEIVVAADACEDGIGAVILHRFPDGSLRAV FT SHASRKLKAAEKNYGQIEKEALALIFAVGKFHKYVFGRRFTLQTDHKPVLS FT IFGSPKGVPAYTAKRIYRWAETLLMYDFRIEYINTDSFFYADALSRLISEC FT QSAQEVDIALVQSEEDVHNMLDSAIRRLPVRSVDIQSETENDALLQEVIKK FT HIKGWSERDNKQLNLKPFYLRRHALSMVKGCLLFGSRVIVPKTLQSRVMKD FT LHAEHPGVVRMKSLARSICFWPGMDQEIENTVLKCDRCAKAAKAPVKVPLQ FT PWPTAARSWERVHIDYAGPVRGEYFLVVVDAHSKWPEVYCTQRITASITVD FT FMKDSIARYGIPEVIVSDNGTQFTSELFSQMCASYGIKHITIAPYHPQSNG FT QAERFVDTLKRSLKKMNGDAPNQEIIRQFLMTYRRTPNPNVPEGKSPAEVF FT IGRSIRSKLDLIRPTKRSDKVNERMKDQFDKRNGTKDRWFSVGDQVYYRAP FT DGPNRFQWLPAVITGKKGRVMFEIEVNKKKQRAHANQLRKNAVSQPPAEKG FT DKQLPLDLLLDTFDLDRNNQVEIDHHPEVDQREMVIEDNPMMDYAMEDIHE FT DVPVMNRMIEPEEELEENENDLINDGISEASTPNSSPFASAPTSPQPSPVK FT QPAAPLPTRPKRATKPIVRLDIDPSKKTYAKETKQ" XX SQ Sequence 5065 BP; 1508 A; 1134 C; 1293 G; 1130 T; 0 other; tattggcgtt caggacactc gaactgcgga ttaaggagag ttttccttcc cccttttcga 60 ctcattgtgt tgtcgttgta agtcttgggc gtggatcttg ctggttgtgc cagtaaggga 120 gcttgagcag ggttgttcgg tagcctgaaa gcgtggatct tgtggtcgtg ccacaaggga 180 gcttgcaagc cgaggtgaaa tcgaaattgg acgtaagtgc tgctcaattt tcgaatttcg 240 tcagagtgat aaataaggac agttgactcg ccggtcgaga actgagagta aagccttgta 300 ggccgattcc tgtgattgag cccgcgttga ttttcatcaa cggggaatat tcactatcac 360 ttaccgacac aaccagagaa tgggagaaat agaagaagat atggcggatt tgaagagact 420 ggtggccagc atggcacagt tactcaagaa ccagagtgag agcaagactg tggctgacga 480 gcgatcgaat gagctgtcga ttgcctctct caatgcgatt gagtctcgca tccaggagtt 540 tgtctactcc ccggaggatg gtgcaacctt tgagaaatgg tggaaccgac acgaggacat 600 cttcatgatc gacctcaagg aattggatga gctgaaaaag gtgcgattgc tgatcagaca 660 catttccacc actgtggaga gaacattcac agagctaatc gcccctacca aatggatcga 720 tatgaaactc gatcaggtaa aggcaaagat gctgactctg tttggcgaca acacgtccat 780 cttcgatagg agacgtacaa tgctcgattt gaaaatgagc aaagaaaaca tcgatgatgt 840 gagggttctc gcagcacgtg taaatctaac ggttgagaat gcccaagtga atgatgcaac 900 gattaatgaa tggaaagtgc ttactttctt gcattcgctt gatctccctc gttactctga 960 tgtacatttg aagatgatgc agacggccaa acagaaaggc aaggactgca cactggatga 1020 tctgttgtct gttttcaacg atatgtccca gctgaagaag gattctagtt ccatcacaga 1080 ttcacgtcga aacgtgaact acgtggacaa aaagagagat aatcagcaga ataagggatt 1140 cggaaacaaa tcacagaaca gacaagagaa accaaagtac actcataatg agccgtgtgc 1200 tggatgtggc agtaagtccc atgcgagggc ggaatgttca ttcaaagaat ctaattgcaa 1260 caaatgtggc aagaagggcc acattgctag agtgtgcaag tccaagaaga cttacactgt 1320 gagcgtagcg acgattgcta catcggacta ccacatccaa ctgagactca atgggtacac 1380 atcttccgtc aaaatcgata ctggtgcgga tatcaccatc atatctgaga cgatgtggag 1440 atccattggt gaaccgaagt gttccactgc cgactgtacc gccacgtgtg ctaatggaat 1500 gacactgggg ttgagaggca agttcaaagc aaaggctgag tacggtggtg ttcaagcaac 1560 gggagacgtg tacgttacac ccaagaacat caacctcctc ggaaagaatt tcatcaagat 1620 gctcaatctc gtcgagattc gagagcccgg tccgagaatc catgaggtca ccacacctct 1680 gtcgaccagc attgagtaca ctgagtgggt gaagaaagag taccccgatg ttactgcaag 1740 tggactggga agatgcacag agatgacagc atctctacag ttgaagcctg acgcaaagcc 1800 catcttcgta aaggcaagac cagtaccata tgctctcaca gagaatgtcg agacggagct 1860 caaccgtctg gagaagagcg gagtgattga gaaggtggag tacagtacat gggcagcacc 1920 catactcact gtgagcaagc cgaatggatc gattagaatg tgtgcagact tcagcacagg 1980 cctgaatgca gctatcaatc tgcctgctca tcctcttccc gttccggaag acatattcgc 2040 cactctgaat ggtgcctcga tcttttcaca gatcgatctc tcagaggctt acctgcaggt 2100 acctctcgac caggaggctc agaaattact tgtaatcaat acccacaagg gactcttccg 2160 atacaagaga ctgcccttcg gtgtcaaagc tgctccaggc atattccaac gactaatgga 2220 cacaatgcta agtggagtga agcatgctgt tccgtatctg gatgatatca tcatcggagg 2280 tcgaacaaag aaggaacatg ataatactct cgtggaggtc gaacaaagaa ggaacatgat 2340 aatactctcg tggaggtcga acaaagaagg aacatgataa tactctcgtg gaggtcgaac 2400 aaagaaggaa catgataata ctctcgtgga ggtcgaacaa agaaggaaca tgataatact 2460 ctcgtggagg tcgaacaaag aaggaacatg ataatactct cgtggaggtc gaacaaagaa 2520 ggaacatgat aatactctcg tggaggtcga acaaagaagg aacatgataa tactctcgtg 2580 gaggtcatgt gcaagctcaa gcaatacgga ctacgcactc gagcagagaa gtgcacgttc 2640 ggattgaagg aagtgagctt tctcggattc atcatcaata aggacggtcg ccacaccgat 2700 ccgaagaaga cggaggcaat ccgtacgatg ccagaaccgg agaatcagat gatgctccgc 2760 attttctggg tatggtgaac tactatggtc agttcatcga tggaatgcac aagctgcgtt 2820 ctcagctgga tcatctcctg aaggataatg tgaagtggaa atggacaaaa gagtgctcga 2880 aagcattcac ggagatcaaa ggcaagctca atcagcagct aagtctcgtc cattatgatc 2940 cacagaagga gatagtggtt gcagcagatg catgcgagga tggcatcggt gccgtcattc 3000 tgcacagatt ccccgatgga agtctacgtg cagtgagcca tgcatcgaga aagctgaaag 3060 cagctgagaa gaactacgga cagattgaga aagaggctct cgcccttatc ttcgccgtgg 3120 gaaagttcca caagtatgtg tttggtcgtc gtttcacgtt gcaaaccgat cacaagccag 3180 ttctatccat cttcggctcg cccaaaggag tgccagccta cactgcgaag aggatataca 3240 gatgggctga gacattgctg atgtacgatt tccggatcga gtacatcaac acagactcgt 3300 tcttctatgc tgatgcactc tcaagattga tcagtgagtg tcagtcggca caggaggtgg 3360 acatcgctct ggtacagagt gaagaggatg ttcacaacat gctcgattca gcaattcgcc 3420 gtttgccagt cagatcggtg gatattcagt ccgagactga gaatgatgca ctcctccagg 3480 aagtcatcaa gaagcatatc aaaggatgga gtgagagaga caacaaacag ttgaatctga 3540 aaccattcta cttgagaaga catgccctga gcatggtgaa gggatgctta ctcttcggat 3600 ctcgtgtaat cgttccgaag acactacaat ccagagtgat gaaagatcta catgcggaac 3660 atccaggtgt tgttcgcatg aaatcactcg cgagaagcat ctgtttctgg ccaggaatgg 3720 accaggaaat cgagaatact gtactcaaat gcgatagatg tgctaaagca gcaaaggcgc 3780 cagtgaaggt tcccttgcaa ccatggccta ccgcagcaag atcatgggag cgtgtccata 3840 ttgactacgc aggtcccgtc cgaggcgagt atttcctcgt cgttgtcgat gctcatagca 3900 aatggccgga ggtatactgc actcagagaa tcaccgcatc gatcacggtt gatttcatga 3960 aggattcaat cgcgagatac ggtatcccag aagtgatcgt gtcagataat ggcacacagt 4020 tcacttcgga actgttcagt cagatgtgtg cctcttacgg catcaaacac atcacaatcg 4080 ccccgtacca tcctcagtcc aacggacaag cagagagatt cgtagatact ctcaagagga 4140 gtctgaagaa gatgaatggt gatgctccca accaggagat cattcgtcaa ttcctaatga 4200 cgtacagaag gacacccaat cccaatgtac ctgaagggaa gagtccagca gaagtgttca 4260 tcggaagatc aattcgatcg aaacttgatc tcattcgtcc caccaagaga tcagacaagg 4320 tgaatgagag aatgaaggac caattcgaca agcgaaatgg taccaaagac agatggttct 4380 cagttggtga tcaagtgtac tatcgtgcac cagatggacc caatcgcttt cagtggcttc 4440 cagcagtgat cactggaaag aagggtcgag taatgttcga gatcgaagtg aataagaaga 4500 agcagagagc tcatgcgaat caacttcgca agaatgcagt ctctcagcct ccagctgaga 4560 aaggagacaa acagcttcca ctcgatctct tactcgatac attcgatctt gatagaaaca 4620 atcaagttga gattgatcat catccagagg tggatcagag agaaatggtc attgaagata 4680 atccaatgat ggattatgca atggaggata tccatgagga tgttcctgtg atgaatagaa 4740 tgatcgagcc tgaagaggaa ctcgaggaga atgagaatga tctcatcaac gacggaatct 4800 ccgaagcgtc tacacctaat tcatcgccat ttgcctctgc gccgacatca cctcagccga 4860 gccccgtcaa gcagcccgcc gcccctcttc caacgagacc gaagcgagct acgaagccaa 4920 tcgtgagact ggacatcgat ccgagtaaga agacgtacgc caaggagacg aagcagtaat 4980 cccgtcccct attctacgtc ctcacctcac cgctttcccc tgtttctgta ttttaagatc 5040 ggcagtgcca atcttgaagg ggagg 5065 // ID piggyBac-19_SM repbase; DNA; INV; 2481 BP. XX AC . XX DT 12-AUG-2009 (Rel. 14.08, Created) DT 12-AUG-2009 (Rel. 14.08, Last updated, Version 2) XX DE DNA transposon from Schmidtea mediterranea. XX KW piggyBac; DNA transposon; Transposable Element; piggyBac-19_SM. XX OS Schmidtea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae. XX RN [1] RP 1-2481 RA Jurka J.; RT "Families of autonomous piggyBac elements from planaria."; RL Repbase Reports 9(8), 1829-1829 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 422..1879 FT /product="piggyBac-19_SM_1p" FT /translation="MSSERFPPAAKEKAEKSLRGNELQCPLPLRERLGLRN FT DPPPSNVVSTGRGLMLRKNLPGASGISTLPINRPLPQVPVERKRSMRHAAL FT QALAILTNVRADQSGSDVSSEDEDSNSENSSENSTSSSSPSSDGQEDIHIQ FT SGRDGTFWKKIDDNGVVTGRTPAHNVFRNKPGLTAYSRSVSSELEAWRLII FT DEGFIRHIRHCTEKYASQSHSNWSVSDSEMDKFIGLTYLRGVMNQTEFPLN FT LLWSEDMGISAFRRTMARNRFRELQRFLRFDDKSTRSERLHSDKFCLASLI FT LSRLVENSQKSYKPEESLTIDEQLFPTKARCRFTQYMPTKPDKYGIKFWVL FT VELNSKYCVNILPYLGKDETRQDSLGTHVVMKLMEPYFGLGYNVTTDNYFT FT NNELAQKLHARRTTLVGTVRANRRELPPIPKLELYESAFYENGPSNLTIYR FT CKPTKHVVMLSTMHKGSHCSDNDKHKPSTIIYYNKNKCAVK" XX SQ Sequence 2481 BP; 769 A; 472 C; 533 G; 703 T; 4 other; cactagaaag accaaaggcg tcaattgacg cctttaaaat tatttaacag aattttattt 60 attcatctgt tgaaatgcta tttttttcaa taaattttat taaattaaag gcaagagtta 120 tcgctggtca atttagtggg ttacaacagg gaaaatggta acagtccccc aaggtctgta 180 ataggaaact ntgaaagtta ggaaaattca atacgaatcc taaaagatgt attgatgatg 240 ttttcgtcaa taaattcaat tttttgattg tgtacaattg ataaggttat tatgaaatta 300 ttattattaa agagtatgca taaagccttt atgtagttta gngaaaaagn tttgtcattt 360 ttaaaaaatt ggagtatgtg aaaatttttg gttcattttc tagtcggagc attttagaaa 420 tatgtcaagt gagagatttc ctccagcggc aaaagaaaaa gctgaaaaat ctctacgtgg 480 taatgaattg cagtgcccac ttcctctacg tgagcgtttg ggtttacgaa acgaccctcc 540 tccttccaac gtggtatcca caggccgagg actcatgctt cgaaaaaatc tccctggagc 600 cagtggaatt tcaacattgc cgattaaccg accattgcca caagtaccag tggaaagaaa 660 gcgatcaatg cgacatgcag ctcttcaagc tttggccatt ctaacaaacg tcagagctga 720 tcaaagtggg agtgacgtaa gttctgagga tgaagacagc aatagtgaaa atagttctga 780 aaatagtacc tcatcatcat caccatcatc tgatggtcaa gaagacatcc acatccaatc 840 aggtagagat ggcacattct ggaagaaaat agatgacaat ggagtcgtta ctggtcgtac 900 accagctcac aatgtattcc gtaacaagcc tggattgact gcatatagtc gcagtgttag 960 ctcagaactg gaagcttgga gattgattat tgatgaaggt ttcatccgtc atattcgaca 1020 ctgtactgag aaatacgcga gtcagtccca ttccaattgg tcggtaagtg acagtgaaat 1080 ggataagttt attggactga catatttacg aggagttatg aaccagaccg aattcccgct 1140 gaaccttttg tggtcagaag atatgggtat ttccgcattt cgtcgcacca tggcacgtaa 1200 tcgattcaga gagttgcaga gatttctgcg atttgatgat aaatccactc gatcagagcg 1260 tctacattct gacaagttct gtttggcgtc attgattttg tccagattgg tggaaaacag 1320 ccaaaagagt tataaacccg aggagagtct gacaattgat gaacaattgt ttccgaccaa 1380 agccagatgt cgctttaccc aatacatgcc aacaaaacct gacaagtatg gcattaagtt 1440 ttgggttttg gtcgagctca acagcaagta ttgcgtcaat atattgccgt atttggggaa 1500 agatgaaacc cgccaggaca gcttgggaac acatgtggtc atgaagttga tggagccata 1560 ctttggattg ggttacaatg tgacaacaga taactatttc accaacaatg aactggcaca 1620 gaaactccat gcaagacgca caacgcttgt tgggacagtc cgagcaaacc gcagagaact 1680 gcctcccata ccaaagctgg agctatatga atccgcattt tatgagaatg gtccgtccaa 1740 tcttactatt tatcggtgca agccgaccaa acatgtggta atgttatcga cgatgcacaa 1800 aggatcgcat tgcagtgaca atgacaagca caagccatcc acaatcattt actacaacaa 1860 aaacaaatgt gccgtcaaat gacgtcaaaa gcaggatgta ggcgctggcc tctggctgtg 1920 ttctacaaca tattggattt ggcagccata aactcatgga ttatattctg taaggccact 1980 ggtaaaaata tttcgaggcg cacgttcctt atgtccctat cgcagcagtt gataaacgct 2040 tctcaaacta ctccaacggc tatctctaac ttggctatcc cggaaaaatt gagtacgcgc 2100 atcaattgca gagtcgaatt gaactgtaat cacaaccgta caacgactgc ttgcactgtt 2160 tgtaagcttc cagtatgtgg acaatgtatg gcgaacattt gtctcaaatg tgctcatttt 2220 tgatgttatt attgaagttg tacaacatat gttgctgttt tggttttgtg tgaggcgtca 2280 attgacgccc ttggtcattc taggtatatt tgagtgctgc attcaaatga gagctctcag 2340 gctgtcatgg gctgggttta ttatagtttc agtaaaaata ataaangtca gaaaaaatga 2400 cttgaatcca tttagtagtg tctgagtaat ctagtaaaga aaatgaaaaa gcgtcaattg 2460 acgccgttgg ccgttctagt g 2481 // ID SMARN2B repbase; DNA; INV; 1150 BP. XX AC . XX DT 16-AUG-2009 (Rel. 14.08, Created) DT 16-AUG-2009 (Rel. 14.08, Last updated, Version 2) XX DE Consensus sequence of non-autonomous Mariner-type family of DE repeats. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Nonautonomous; KW SMARN2B. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-1150 RA Jurka J.; RT "Mariner-type element from freshwater planarian (Schmidtea RT mediterranea)."; RL Repbase Reports 9(8), 1891-1891 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX SQ Sequence 1150 BP; 493 A; 135 C; 135 G; 386 T; 1 other; cacctgtttc ggatgaatag caactcctat aataaaatga aaaaatcaat gttataatat 60 ctgtgtctgc atgtaatttg tcaaatgaat aaaaacttat attataaaga araaaatttg 120 aagaaatttt tttaacatac aaacataata catagtaaat aatgcataca aactttcaaa 180 atgtagaaat gtccagatga atacaaactc tatatattac tatttttaac ttttttagac 240 aattttttaa aaaaattttt gaaaaaattt cgataactct actttgagtt gtcgagaaaa 300 ttatgaaata atttcgtaag atttttcatg aaattcaatc aatttaatat ttttacattt 360 cagattttca gaaatatcca tcgaaactct agatttttgc ggagatgaag aaataacaat 420 gtaagattta ccttgaagtt aaaataaatt cattttttac ctcatttgta atttaccagg 480 agcaatggaa ccagaacctt caaatgtgga gtttaaaaaa cagaaaacgt aagttttccg 540 ttaacttaag tttaaaaata atatttttat agaatttcag aagtagacaa aatccgaatc 600 atcaatgcca ttgaaaatgg cgaagatttt tataaatttg ggaaattaat gggaatttcc 660 aaagcctcaa taaaatcaat aattataaaa tcaataaata tcaaaaattt ggatttctgt 720 caaaaggaaa aaagatggca gccaaaacga atttccaagg aacagagaaa taatttattg 780 gaatatagaa actttttgat aataaatgaa ttttatttta ataaatttta gttttaaatt 840 aatctgactt acattaaatc gaaaaataaa tccaataatt gacataaatt gaataaaatt 900 cctaaaaacc taaagaaaat taattacaaa taaatttaaa ataaatgttg ataaaaaata 960 ttatttaaga gcttgaataa tgatctcata attttacgag atcacaatcc aagccttaaa 1020 atttacttta ttaaaaaaat ttacttacca gccattttaa aaatggctaa tacataaact 1080 gtataataac tgcagtcgga tgttcattca aacttctatt atagaagttg ttaatgaact 1140 ggaacaggtg 1150 // ID Sola2-N1_CQ repbase; DNA; INV; 1930 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A Sola2 DNA transposon family from Culex quinquefasciatus - DE consensus. XX KW Sola; DNA transposon; Transposable Element; nonautonomous; KW Sola2-N1_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-1930 RA Kojima K.K. and Jurka J.; RT "Sola DNA transposons from the southern house mosquito."; RL Repbase Reports 11(1), 628-628 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 10 sequences with >96% CC identity. 4bp TSDs. ~90bp TIRs. XX SQ Sequence 1930 BP; 688 A; 289 C; 279 G; 671 T; 3 other; gagcaattct ctgagatttc ggtcattcga ttttttctgt attttttaat ccggctgaaa 60 cttttttkgt gccttcggta tgcccaaaga agccattttg catcactagt ttgcccatat 120 aattttccat acaaatttgg cagctgtcca tacaaaaatg atatgtgaaa attcaaaaat 180 ctgtatcttt tgaaggaatt ttttgatcga tttggtgtct tcggcaaagt tgtaggtatg 240 gatatggact acactggaaa aaaatgatac acggtaaaaa aaatttggtg atttttwatt 300 taactttttg tcactaaaac ttgatttgca aaaaaacact atttttaatt ttttttattt 360 tttaatatgt tttagaagac ataaaatgcc aacttttcmg aaatttccag aatgggcaaa 420 aaatctttga ccgagttatg attttttgaa tcaatacaga ttttttcaaa aaatcgaaat 480 attggtcgca aaaatttttc aacttcattt ttcgatgtaa aatcgaattt gcaatcaaaa 540 agtactctag tgaatttttg ataaagtgca ccgttttcaa gttataacca attttaggta 600 acttttttga aaatagtcgc agtttttcat tttttaaaat tagtgcccat gtttgcccat 660 ctttgaaaaa aatatttttg aaaagctgag aaaattctct atattttgct ttattgaact 720 ttgttgatac gacccatagt tgctgagata ttgccatgca aaggttaaaa aacaggaaaa 780 ttgatgtttt ctaagtctca cccaaacaac ccaccatttt ctaatgtcga tatctcagca 840 actataggtc cgatttacaa tgtaaaaaca tgaaacattc gtgaaatttt ccgatctttt 900 cgaaaaaaat attttcaaaa atttaaaatc aagactaaca tttcaaacgg gccaaacatt 960 caatattacg cccatttgaa atgttagtct tgattttaaa ttcttgaaaa tatttttttc 1020 gaaaagatcg gaaaatttca cgaatgtttc atgttttaac attgtaaatc ggacctatag 1080 ttgctgagat atcgacatta gaaaatggtg ggttatttgg gtgagactta gaaaacatca 1140 attttcctgt tttttaacct ttgcatggca atatctcagc aactatgggt cgtatcaaca 1200 aagttcaata aagcaaaata tagagaattt tctcagcttt tcaaaaatat ttttttcaaa 1260 gatgggcaaa catgggcact aattttaaaa aatgaaaaac tgcgactatt ttcaaaaaag 1320 ttacctaaaa ttggttataa cttgaaaacg gtgcacttta tcaaaaattc actaaagtac 1380 tttttgattg caaattcgat tttacatcga aaaatgaagt tgaaaaattt ttgcgaccaa 1440 tatttcgatt ttttgaaaaa atctgtattg attcaaaaaa tcataactcg gtcaaagatt 1500 ttttgcccat tctggaaatt tctgaaaagt tggcatttta tgtcttctaa aacatatcaa 1560 aaaataaaaa aaaataaaaa tagtgttttt ttgcaaatca agttttagtg acaaaaagtt 1620 aaatataaaa atcaccaaat tttttttacc gtgtatcatt tttttccagt gtagtccata 1680 tccataccta caactttgcc gaagacacca aatcgatcaa aaaattcctt caaaagatac 1740 agatttttga attttcacat atcatttttg tatggacagc tgccaaattt gtatggaaaa 1800 ttatatgggc aaactagtga tgcaaaatgg cttctttggg cataccgaag gcaccaaaaa 1860 agtttcagcc ggattaaaaa atacaaaaaa taaaattaag aaaaaagacc gattccgtag 1920 agaactgctc 1930 // ID Gypsy-36_DWil-I repbase; DNA; INV; 5346 BP. XX AC scaffold_181150; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-36_DWil_; KW Gypsy-36_DWil-LTR; Gypsy-36_DWil-I. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-5346 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_181150; Positions 4248662 4254007. XX CC Positions [2757-3263] - Reverse transcriptase CC Positions [4326-4802] - Integrase core CC 'GTTG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 28..1593 FT /product="Gypsy-36_DWil-I_1p" FT /translation="MDINIEAHNLPTLKKWLELLDLPTTGSKAELVARLND FT VPVDRRGACSISIGTETVEESTEDESNRQLSETTNEKEGDNNTDVLLKKLC FT DLLRPQLFDIVAPQRQTENETHDSVPTSNDSGSENEMAAELIGAQNQRHEP FT VLVNQRSEEVLESGFSSAAKFQLAKEALLEFNGNSCGKKWVTQLTNISGVY FT GIKGVQMHMLLVSKLKGRAHEWLHANSTRILEPVGNICEQLVESFGKTVSQ FT ADARRLFESRIWSANESFVSYIDEKARLSDAICISEEERVDKAIEGIPVEG FT LRIQAQIQGFKCIAELRRAFSGIQLPVQRAARETTKVAATGRRSKEGFRCY FT NCNSKGHMSKDCKKPDRVPGSCFGCGALDHWVAKCPLNKEKLAKGCDDVRF FT VKIYFKTPTNPYLITACFIDSGSDVSLIKISCLPEKIHLISLNHKYYGLNK FT SPLLSYGKVKCFVFKNSIKVFFHLIVVDNESMSHEVILGRDFMKACNIKLD FT LNALKMVTIERKEGENELGERSEFES" FT CDS 1995..5195 FT /product="Gypsy-36_DWil-I_2p" FT /translation="MSHEVILGRDFMKACNIKLDLNALKMVTIERKEGENE FT LGERSEFESLSLQTCHTDLQIARNVNVVRTVDEIVKTKNCKNRIVTKECNP FT KREDEEIVSEIVKRDDSEIVSEMVNEIDSKIVSETENGDRSKMVTKSRNEA FT VIELVSEKFRSDRKCEERKETADVQSLSEVEFELLNIECFNTGVDCKIGER FT LDFKIRERFEQLFERSYLVAQRPELPDVKCEMKLNVEGSKAFSCAPRRLSY FT SEKGKLQAMLDGYLEKKIIRPSESEYASPIVLVPKKAGELRLCIDFRRLNK FT VLVKDNYPIPLIDDLLDKLGDKRFFSKLDLKDGYFHVFMEENSVKYTSFVT FT PLGQFEFLRMPMGLKNSSSVFQRFVNKIFTDMIRDNKVIVYQDDIMVASRT FT AEEHLDILKEVFRRIIRNKLELRIDKCEFFQSSIRYLGFIISEEGIRADDK FT GLEAVKNFPIPTNVHTVRSFLGLASYFRRFIRNFSLIAKPLYDLLKKDRVF FT KFGKEELDCFSLLKDKLVESPVLAIYNHKDEVELHCDASALGFGAMLMQRK FT EDKKLHPVFYYSKRTSDVESKYHSFELETLAIVYAVRRFRVYLQGRYFKVL FT TDCNSLTLTLNRIELNPRIARWALELQNYDYEVVHRAGKQMQHVDALSRCT FT NILVVDTNSFEDNLVICQNRDEKLQTIKRKLEFSEDKLFEMRNGIIYRKTN FT DGRLLFCVPSEMEESILFKYHNELGHIGRDKVMDAISKTYWFPNIKEKVVK FT HISNCLKCVAYSPKSGKEEGMLHSIPKGNRPFELIHIDHCGPIASGRSKKH FT LFVIIDGFTKFVRLYPTKTTNTKEAVLALKDYFRAYSRPFAIVSDRGSSFT FT SSEFEMFVTQNNVKHIKIATGSPQANGQIERVNRTLGPMLAKLSEPEKGIY FT WDLIVENVEHCLNNTLHRSIKQLPSKMLFGVEQKGKVADELREGLEQLGIV FT NDQKNLDTIRKDGEIHQKQSQKYNEQQYNSKRVAPTGYKAGDYIMVKNFDS FT TIGASKKLIPKHKGPYVIDKVLKNDRFLIKDVEGFQLSRNPYQGVWSSHNI FT KHWIGKRKSD" XX SQ Sequence 5346 BP; 1806 A; 835 C; 1260 G; 1429 T; 16 other; atatcagaag tggtgaactt taacgcaatg gacatcaaca ttgaggctca caatttgcca 60 actttaaaga agtggttgga gctattggac ttgcctacaa cagggtcaaa ggccgagctt 120 gtggcgcgct taaacgatgt tccagtggat cgacgaggag cttgttccat atctattgga 180 actgagactg tcgaggaaag tacagaagac gaaagtaacc gtcaactgtc agagaccaca 240 aatgagaaag agggagacaa caacactgat gtcttgctaa aaaaattgtg cgatttactg 300 cgtccacaat tgttcgacat agtggcgcct caacgccaga cagaaaacga aacgcacgat 360 tcagtgccaa cgagcaacga cagcggcagc gaaaacgaga tggcagcaga actgattggc 420 gcgcaaaacc agcgtcatga accagtgttg gtaaatcaac gctcggagga ggtgttggaa 480 agcggttttt cgtcggctgc aaaattccaa ttagcaaaag aagcgttgtt ggaatttaat 540 gggaactcgt gtggaaagaa atgggtcacg cagctgacaa acataagcgg tgtatatgga 600 atcaaaggtg tacaaatgca catgcttcta gtctccaagc ttaaaggacg ggctcatgag 660 tggttacatg ccaatagtac gcgcattttg gaacccgtgg gcaacatctg tgagcagctc 720 gtcgagtcgt tcgggaaaac tgtgtctcaa gcagatgcca ggcgcttatt cgagtctcga 780 atttggagcg ccaatgagag tttcgtcagt tacatcgatg agaaggccag gctgtcagat 840 gctatctgca tcagcgaaga ggagcgagta gacaaagcca ttgaagggat tcctgtggag 900 ggactccgca ttcaagcgca gattcaagga ttcaaatgca ttgcagaatt acgccgtgca 960 ttttcaggca ttcagctacc ggtacagcga gcagcaagag aaactacaaa ggtggccgct 1020 acgggcagac gttccaaaga aggtttccgt tgctacaact gcaactcgaa gggccacatg 1080 agcaaggact gtaagaaacc cgacagagtg ccagggtcct gtttcggatg cggtgccttg 1140 gatcattggg tggctaagtg tccacttaat aaggagaagc tggctaaagg atgcgacgac 1200 gtaagatttg ttaagatata ttttaaaaca ccaactaatc cttacttgat tacagcatgc 1260 ttcatagact caggcagtga tgtttcactt attaaaatct catgtttacc agaaaagata 1320 catttaattt cacttaatca taaatattat ggattaaata agagtccact tttgtcatat 1380 ggaaaagtca aatgttttgt tttcaaaaat tcaataaaag tattctttca tttaatagta 1440 gtcgataatg agtctatgag ccatgaagta attttgggaa gagattttat gaaagcttgt 1500 aatattaagt tagacttaaa tgccttgaaa atggttacga ttgaaaggaa agaaggcgaa 1560 aatgaacttg gagagcggtc ggagtttgag tcctnnnnnn nnnnnnnnnn gccacatgag 1620 caaggactgt aagaaacccg acagagtgcc cagggtcctg tttcggatgc ggtgccttgg 1680 atcattgggt ggctaagtgt ccacttaata aggagaagct ggctaaagga tgcgacgacg 1740 taagatttgt taagatatat tttaaaacac caactaatcc ttacttgatt acagcatgct 1800 tcatagactc aggcagtgat gtttcactta ttaaaatctc atgtttacca gaaaagatac 1860 atttaatttc acttaatcat aaatattatg gattaaataa gagtccactt ttgtcatatg 1920 gaaaagtcaa atgttttgtt ttcaaaaatt caataaaagt attctttcat ttaatagtag 1980 tcgataatga gtctatgagc catgaagtaa ttttgggaag agattttatg aaagcttgta 2040 atattaagtt agacttaaat gccttgaaaa tggttacgat tgaaaggaaa gaaggcgaaa 2100 atgaacttgg agagcggtcg gagtttgagt ccttgtcatt gcagacatgc catacagatt 2160 tgcaaatagc tagaaatgta aatgttgtga gaacagttga tgaaatagtt aaaacaaaaa 2220 attgtaagaa tagaatagtc acgaaagaat gtaaccccaa aagggaagac gaggaaattg 2280 ttagtgaaat agttaagagg gatgatagcg agattgttag cgaaatggtg aatgaaattg 2340 atagcaagat tgttagtgaa acagagaatg gagatagaag caaaatggtt acgaaaagta 2400 ggaatgaagc tgttattgag ttggtcagtg aaaaatttag atctgataga aaatgtgaag 2460 agagaaaaga aactgctgat gtccagtctc taagcgaagt agaatttgaa ttattaaata 2520 tcgagtgttt caacactgga gttgattgca aaataggtga aagattagat tttaaaatcc 2580 gggaaaggtt tgagcaattg tttgagagat catatttggt agcacaaagg cctgaattgc 2640 ctgatgtaaa atgtgaaatg aagctcaacg tggaagggtc gaaggcattc agttgtgcgc 2700 ctagaagatt gtcatattct gagaaaggga aattgcaggc gatgttagac gggtacttgg 2760 agaagaagat cattcgccct agtgagtcgg agtatgcatc acccattgtg ttagtgccta 2820 aaaaagctgg cgagctaagg ttgtgcattg actttagaag attgaacaaa gttttggtta 2880 aagataacta ccctatacct ctgatagacg atttattaga caagcttggc gacaaacgat 2940 ttttttctaa attggatcta aaagatggct attttcacgt gttcatggag gaaaactcag 3000 ttaaatatac atctttcgtc actccgttag gccagtttga gttcctcagg atgccaatgg 3060 gtctgaaaaa ctcctcatca gtgtttcaaa gatttgttaa caaaattttt acagatatga 3120 ttagagataa taaagtaata gtgtatcaag acgatatcat ggtggccagt agaactgccg 3180 aagagcacct ggatatactg aaagaagttt ttagaaggat aatcagaaac aagctggaat 3240 tgcgtataga taagtgtgag ttttttcagt ccagcattag atacttggga tttatcatat 3300 cggaggaggg aattagagcc gatgacaagg gcctagaggc ggttaagaac tttcccatac 3360 caacaaatgt tcacacagta agaagtttct tgggattggc ttcttatttt cgtcgtttta 3420 ttagaaattt ttcgttaatt gcaaaaccac tctatgacct attgaagaaa gatagggtgt 3480 ttaaatttgg aaaggaagag ctagattgtt tttcgttgct caaagataaa ttagtagaat 3540 cgccggtgtt ggcaatatat aaccacaagg acgaagtaga gctgcattgt gatgcgagcg 3600 cgttaggctt tggggcaatg ttaatgcaaa ggaaagaaga caaaaaactg caccctgttt 3660 tttattactc aaaacggacg tctgatgtag aatcaaagta ccacagtttc gaattagaga 3720 ccttggcaat agtatatgca gtccgcaggt tcagagtgta tttacaggga agatatttta 3780 aagtacttac agattgtaat tcactgactc tcactctcaa tcgaattgaa ttaaacccaa 3840 gaattgcgag atgggctctt gagctacaga attatgatta tgaagtagtg cacagggcag 3900 gtaaacagat gcaacatgta gatgcgctta gtagatgtac gaacatactt gttgtagata 3960 caaatagttt tgaagacaat cttgttatat gccagaacag ggacgagaag ttgcagacaa 4020 taaaaagaaa gttggaattt tctgaggata aattatttga aatgagaaac ggaataattt 4080 acaggaaaac caatgacggg aggttgttgt tttgtgtacc gagtgaaatg gaagagtcaa 4140 tccttttcaa gtatcataac gaactaggtc atatcggcag agataaagta atggatgcca 4200 tttcgaaaac gtattggttt cctaacatta aggagaaagt tgtgaaacac attagcaatt 4260 gcttaaagtg tgttgcgtat tctccaaagt ctggaaaaga ggaaggaatg ttgcacagta 4320 ttcctaaagg gaatcggccc tttgagctaa ttcacattga tcattgtggt cctatagcat 4380 caggcagaag caaaaagcac ttgtttgtta ttattgatgg ttttacgaaa ttcgtcagac 4440 tatatccaac aaagacgacg aatactaaag aagcggtgct agctcttaaa gactatttta 4500 gggcctacag tcgacccttt gccatagtat cggatagagg aagcagtttc acctcatcag 4560 agttcgagat gtttgtaact cagaataacg ttaagcacat taagattgcc acaggttccc 4620 ctcaagctaa tggccaaatt gagagggtaa atcgtacact gggacctatg cttgctaaat 4680 tatcggaacc agagaagggt atatattggg atttgatagt tgagaatgtt gaacattgtt 4740 taaataacac acttcatagg agcattaagc agttgccgag caaaatgctc tttggagtag 4800 agcaaaaagg caaagtcgca gatgagttaa gagaaggact agagcagtta ggaatagtga 4860 atgatcaaaa aaatttggat acaatcagaa aggatgggga aatacatcag aagcaaagcc 4920 agaagtacaa tgaacagcag tataatagca aaagggtagc acccacagga tacaaggcag 4980 gggattatat tatggtgaag aatttcgatt caactatagg agcgtcgaaa aagctcattc 5040 caaaacacaa agggccttat gtgatagaca aggttctcaa aaatgaccga tttttaatta 5100 aggatgtgga aggttttcag ctcagtcgta acccttatca aggtgtttgg agtagccaca 5160 atataaaaca ttggataggt aaaagaaaaa gtgactaaaa gttatgaatt aagcaaatga 5220 aaataatgaa aactgtaata aacttttatg taatgtacct taactaattt gtagttataa 5280 gatttcaact tattgtaacg ttaattatgt aaggatcagg agatcttacg gtcaggacgg 5340 ccgagt 5346 // ID BEL-629_AA-I repbase; DNA; INV; 6083 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-629_AA_; KW BEL-629_AA-LTR; Pao_Bel_Ele167; BEL-629_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-6083 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [5136-5696] - Integrase core CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 1262..5041 FT /product="BEL-629_AA-I_1p" FT /translation="MVRLRKCLQGKALEAVKCQLLHPSNLDQVIATLKMLF FT GRPEIIVHSLLQKINSLPAPKADRLGTLVDFALAVRNMVATVKACELEEHL FT CNLTLLHSLSERLPPMIRLNWATHRQSLRSATLSEFSDWLYKLAEAASTVT FT MPQFPGVVDNKPRRGRKEDGFLNAHTETVPKPKQFDASAGCLVCQGSCAAI FT EKCKRFLSFSLSARWDALREHKLCRSCLTIHRGPCKSAKICGKSGCQYKHH FT RLLHNDTKGKPETSSSGSQSQAKGHQQASVPVLQDVESCNTHRGGNKSVLF FT QYIPIVLYNNGIELHTHAFLDSGSSLTLMEESLAKQLNLKGEKSPLCLRWT FT ADTCRYEKDAIIVSLNVSGTHPGSGQHTLREVYTVKELKLPSQSLPADNLA FT DRYSHLRGLPIEPYSNVQPKILIGVSNARVMHVLDDREGKLDEPVAVKTRL FT GWTVYGTYTSANNSVQPVLPHSFHICSHFHGTDESLHEAVKNYFALDSLGI FT SAPKNQLLSKEDERALAMLRKVTTFQDGRYQAGLLWKYDDVRLPNNRSMAM FT RRHRCLTKRMEREPRLAETLRAKMEDYERKGYIRKLTQEESRMTGDRTWYL FT PIFPVFNHNKPGKIRIVFDAAASLGGVSLNSVLMKGPDQLNALPPVLYKFR FT ERLIGIGGDVAEMFHQIRMRPEDEHSQRILWCASEDTTEPCDYVMQVVTFG FT ATCSPSTALYVLNENASRFENQYPVAVDAICRRHYVDDMLTSVDTEEEAIQ FT LANDVRYIHLQGGFHMRNWVSNSSAVLEALGENPKPEKSMEMNAELAMEKV FT LGMWWNTTSDVFSYKLCTDRNRELLSGAKHPTKRDVLRTLMAIYDPLGLIA FT HYLMYLKVLLQEIWRAKTVWDDPIEEKHLEKWLTWLRILPDLESVKIPRCY FT FRHEARIDQATIELHTFVDSSESGFAAVSYFRFEVYDYVECALIGSKTRVA FT PLKFVSIPRLELQAAVVGTRLARSIAEGHSIKIDRRYFWTDARDVMCWLQS FT DHRRYSQFVAFRVGEILEATDITEWRWLGTKHNVADDGTKWKFRPDLKPTS FT RWFTGPPFLWKPKSEWPSSSLNGEETIIELRANFLGHIERTVRTILQAENF FT SSWQRMLRVTALVHRFIGNVRHKQKRKPMILGPLTQEELSTAESFLFRQAQ FT ADVYGEELAVVAAGNRLKKKSSLFKLNPFIDDHGVMRISSRLGECDFVDES FT ERHPIVLPRDHPTTRLVVASVHQRYRHQCHETCVNEVRREILHSTSAQSL" XX SQ Sequence 6083 BP; 1532 A; 1597 C; 1546 G; 1389 T; 19 other; tcttaaaatt ttcgtttctc ggatcgatcg aagatgccca gaaggagccg aaggcggaat 60 sgggtgccgg aggaaagcat ccagcagact tgcctgcttt gcaacctccc ggacagcagc 120 gatatggtga gttgcgacaa ctgcgctcaa tggttccatt ttgagtgcgt cggagtcaac 180 gcggacgtgg ccaacaggtc gtggagctgc ccggaktgct ccccagctac agatccggct 240 ccacaactgc cacaatcttc gacaccgtca gcatcaaatt caggtgagcc acatattcag 300 tcgtcttcac cgccaccgcc agctatgccc acagcagttc cgccaatccc aactctscga 360 atgccaaccc ckcgccctcg ttcgtcgcag atacaaaatc cgcccccacc aacacctctt 420 ccgcgtaccc cacttccacg aaccgcatca gtgccaattg ttgagactga agaagtgcaa 480 atcgttgatc aggaagtgcc ggacgaagtg aatctcgtgg aaagattgcc agcagacctc 540 cgtctccagt ttctggaaca gcaagaagcc atcgagcaga gatatttgcg gcgtaggttt 600 cagttgttgc tgggcatacc aacgaacgta aacaatgaaa attcgcaagg tgtcagtaga 660 aatgttccag caaatcgtgt tgtcgatctt ccccccgtgw gccattcatc accagtgtgt 720 gtcaatcgat tgcccccacc acccatgcca tcgaackgga atgtttcgcc gttcgttcca 780 atctckcaks mtgcgaaagg caggatgatc catcggtcat tccgamaatg cttaatgttc 840 ggccggatgt ttcgacattc cctttgcggc agaccaatca gcaaccaaat tttcccgctt 900 ctcatttacc tgcctccatt ctccgcccca ctgcttcgcc gttcgcacca cctccgactc 960 gtttgcgcca taacttgcct gagttctcta atccaatggc cctcaacgca atcgagcttc 1020 tgccttcgtc ccttttgcgg gcgattcgca tcccgctgca gccgagtatt acccctcggt 1080 tttmcamgac gacccaacca cagtaggagg gacggcgctg ctaaatagaa gccagatcgc 1140 tgctcgtcat gcagttccga aggacttgcc tacattctcc ggtgagccag aggagtggcc 1200 gctatttttc gmmtcktttg agaacacgac tcacctgtgc ggatttacag cggaggagaa 1260 catggtccga ctgcggaagt gcctgcaggg gaaagcattg gaagcagtaa aatgtcagct 1320 gttgcaccct agcaatctgg atcaagttat cgctaccctg aaaatgcttt tcggtcgtcc 1380 ggaaattatc gtccattcgt tgctgcagaa gatcaacagt ctgcctgccc ccaaagcaga 1440 tcgtcttggg acactcgtcg acttcgcttt agctgttcgt aacatggtgg ctacagtcaa 1500 ggcttgtgaa ctagaagagc acctatgcaa cctcacacta ctacacagtt taagtgagcg 1560 gcttcctccc atgattcgcc tgaattgggc aacccatcgt cagtctcttc gttcagcgac 1620 gttatctgaa ttcagtgact ggttgtacaa gttggcggaa gcagcaagca ccgtaacaat 1680 gccacagttt ccgggagtcg tcgacaacaa acctcgtcgg ggcagaaagg aggatggatt 1740 cttaaatgcg catacagaaa cagttccgaa gccgaagcag tttgacgcta gtgctggatg 1800 tctcgtctgc cagggtagtt gtgctgcaat agagaaatgt aagcggttcc tgtctttcag 1860 tctgtctgct cgctgggacg cactacgaga acacaagctg tgccgcagct gcctaaccat 1920 tcatcgtggt ccctgcaaat cagcaaaaat atgtgggaaa agtggatgcc aatacaagca 1980 ccatcggctg ctgcacaacg acactaaagg taagccggaa acgtcatcgt ctggttccca 2040 gagccaggct aaaggtcacc agcaagctag tgttccagtg ctacaagacg tcgaatcatg 2100 caacacacat cgaggaggaa acaagtccgt gctgtttcag tacattccga ttgtgcttta 2160 taacaatggg atcgagcttc acacccatgc gttcctagac agtggatcat ccctgacgtt 2220 gatggaggaa agtttggcga aacaactgaa cctgaaaggc gagaagagcc ccctatgtct 2280 gcgctggaca gctgatacct gtcggtatga aaaggacgcc attatagttt ccttgaacgt 2340 ctccggaaca catcctggta gcggtcaaca tactttacgc gaagtctaca cggtgaagga 2400 gttgaagcta ccctcgcaat ctctacctgc tgataacctt gctgacaggt actcgcacct 2460 caggggtttg ccgattgagc cctacagtaa cgtccagccg aagattttga tcggagtgag 2520 caacgctcga gtgatgcatg tcttggacga tcgcgaagga aaattggatg aacctgttgc 2580 agtgaagaca cgacttggtt ggacggttta cggtacttat acatcggcga ataattcggt 2640 gcaaccagtt cttccccaca gcttccacat ttgctcgcat tttcatggaa cggacgagag 2700 cctacacgaa gcggtcaaaa actatttcgc tctcgatagt ctaggaatca gcgccccaaa 2760 aaaccagctt ctttccaagg aagatgaacg agccctagca atgctacgca aagttacgac 2820 ctttcaagac gggaggtacc aagccggcct actttggaaa tacgacgatg tacgtctacc 2880 taacaaccga tccatggcga tgagacgtca ccgttgtctg accaagagga tggaacgtga 2940 gccccggtta gctgaaacac tacgtgcgaa gatggaggac tacgaacgaa agggatacat 3000 tcgtaagctg acacaagaag aaagtcgtat gactggagac cgaacatggt atttgccgat 3060 ctttccagta ttcaatcaca acaaaccggg gaagataagg attgtcttcg atgcagctgc 3120 atccctagga ggtgtatctc tcaattccgt gctgatgaag ggtccagatc aattgaacgc 3180 tctgccgcca gtactgtaca agtttcgaga gcggttgatc ggcattggag gcgacgtcgc 3240 cgagatgttt catcagatca gaatgcgacc tgaagacgaa cacagtcaac ggatcctgtg 3300 gtgtgctagc gaggacacaa ctgagccatg tgattacgtg atgcaggtag taacattcgg 3360 cgctacgtgt tcacccagta cagcactgta cgtcctgaac gagaacgcgt cccgattcga 3420 gaaccagtat cccgttgctg tcgatgctat ctgccgtcgc cactacgtcg acgatatgtt 3480 gactagcgtt gacacagagg aggaggcaat ccagttagcg aatgacgttc gctacattca 3540 cctccaaggt ggattccata tgaggaactg ggtatcaaat tcgtcagcgg tacttgaggc 3600 cctcggagag aatccgaagc ccgaaaagtc aatggagatg aatgcggagc tcgccatgga 3660 aaaggttctc ggcatgtggt ggaacacgac gtccgatgtc ttcagttaca agctttgtac 3720 ggatcgcaat cgagagcttc tgtctggcgc caaacatccc accaagcggg acgtactacg 3780 tacactgatg gctatctacg accccttggg actcatcgca cactatctga tgtatctgaa 3840 ggttttgcta caagagatct ggagggcgaa aaccgtgtgg gacgatccga tcgaagaaaa 3900 gcatctggag aaatggctga cctggctacg catactgccc gacctggagt ctgtcaaaat 3960 acctcgttgt tacttccgac acgaagccag aatcgatcaa gctaccatcg aactccatac 4020 cttcgtagat tccagtgaaa gcggctttgc tgctgtgtcc tatttccgtt tcgaagtgta 4080 cgactacgtc gaatgcgccc tcatcggaag taagacaagg gttgcccctc tcaagttcgt 4140 ctccattcct cgtctggagt tgcaagcagc tgtcgtcggt acacggttag ctcgaagtat 4200 tgcggaagga cactccatta aaatcgatcg tcgctatttc tggacggacg cgcgagacgt 4260 catgtgttgg cttcagtctg atcaccgacg atattctcag tttgtggcct ttcgtgttgg 4320 tgaaatactg gaggcaacag acatcactga atggagatgg ctgggcacaa aacacaacgt 4380 tgcagatgat ggtacaaagt ggaagttcag accagacctt aagccaacaa gccgttggtt 4440 caccggtcca ccgttcctgt ggaagcccaa aagtgaatgg ccatcttcct cgctgaatgg 4500 cgaagaaacc ataatcgaac ttcgtgcgaa ctttctgggc cacatcgaaa gaacagtccg 4560 taccattcta caggcagaaa acttctcgtc ttggcaacgt atgcttcggg tgacggcctt 4620 ggttcatcgt ttcattggga acgtcaggca caagcaaaag agaaaaccaa tgattcttgg 4680 accgctcaca caagaagaac tttctacagc ggaatcgttc ctttttcggc aagcccaggc 4740 agacgtatac ggcgaagagt tagcagtagt tgcagcagga aaccggctga agaagaagag 4800 tagcctattc aagctgaacc cgttcatcga cgaccacgga gtgatgcgga tcagcagtcg 4860 attgggagag tgtgacttcg tggacgaaag tgaacggcat cctattgtac tccctcgaga 4920 tcatccaacc acccgtctcg tcgtggccag cgtgcatcag cgttatcggc atcagtgcca 4980 cgaaacctgc gtgaacgaag tccgtcgaga aattttacat tccacgagtg cgcagagtct 5040 gtgatcaggt tcgkcggagc tgccagcaat gtaaggtcaa caatgcgaag cctgagccgc 5100 cagcgatggg atcccttccg aaagcacgag tagcagcctt catgcgtccg ttttcgtacg 5160 tgggcgtaga cttctttgga ccgtttctcg tcctcattgg ccgtcgacac gagmaacgtt 5220 ggggtgtgat agtcacttgc ctgtcaactc gggcgataca tcttgagctt gcagcgtcgc 5280 tgaacacaag ttcttgcatc ctagccctga ggaactgctt tgcacgccgt ggaactccaa 5340 tagagatccg cagcgaccgt gggacaaamt ttgtgggggc ggataaggaa cttaaggctg 5400 cggtggcggg acttgaccag gacaaactga tgactgaatt cactactccg gcaacatcgt 5460 ggcgattcaa ccctccagcg tccccacata tgggtggctg ctgggaacgt ctgatccagt 5520 cagttaagaa ggttctagcc atcatcaaac ctcaacggat acctacggaa gaggttctac 5580 ggagctacct tattcaggta gaaaacatcg tcaacagccg tccactaaca cacgttcccg 5640 tagataactg ttcgtcgcca gcgctgactc ctaaccactt cttggtagga tcgtcaagcg 5700 gatccaaacc tctcgttccg tacacagatt gtccctcagc ggttaagcag tcttggaagg 5760 gttcggaagc tctggcaaat cgtttctggc agcggtgggt aacagagtac ttgcccacta 5820 tcactcgtcg caccaagtgg ttttacccag tgaagcctat tgtggtaggt gacgtggtcg 5880 tcgtagccga tcctgagcta gcgcggaatt gctggcccaa aggacgcgtt gtatcggtaa 5940 aaacctcttc ggatgggcag gttcgttcgg cagttgtgca gactgcttcc ggattctacg 6000 acagacctgc gaccaagtta gctgttttgg acataggtgc aaacgatagt aagccggacc 6060 agggtccaac tactgggggg gac 6083 // ID Copia-116_AA-I repbase; DNA; INV; 1734 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-116_AA_; KW Copia-116_AA-LTR; Ty1_copia_Ele92; Copia-116_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-1734 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC 'CTTAC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 132..1658 FT /product="Copia-116_AA-I_1p" FT /translation="MADVNKFAFARLNNQNWQIWKFRMEMLLTREELWYVV FT GDARPAVVTDQWTKDDRKARATIGLCIDDNQFALVKEANSAKAFWDQLRAY FT HEKNTVTSRVSLLKKLCSLNLAEEGDLESHLVVLDDLFDRLTNAGQQLEES FT LRIAMILRSLPDSYGGLVTALESRADADITMQLVKSKLIEEFERRKERSGE FT TCETKAMKSVVVRNEVQSVGGLGNRAPVRRCYFCDKPGHLRRNCRSFMLAK FT QELERESEEKKKPDDRVRVRQNARQATDENCGASVCFMAGVDQRKCWYIDS FT GASRHMTSDRNFFKSLKERKGPSVVLANGKVVTAAGCGDGIVHGVDGNGSE FT VEIKLTDVLYVPSLASGLVSVDRLTKKKFTVKFMKDSCDICDTSGKKVVIG FT EKSGSLYRLKLAKVEKKVEGQRPIIGEKKTTKGRSSPVSVVSWYSEKKPNR FT AEEREEEEYFDANTSSEEVEPLHVAESNSGSPDSDADNWREVRRSRRSNRG FT VRPSRLVDYVFDA" XX SQ Sequence 1734 BP; 480 A; 302 C; 588 G; 364 T; 0 other; ggttattggc ccagcagtgc cgaagggaag gttccggatt gcgtaggaaa aaggttccga 60 gagagcgaaa gtgatcacgc cgtgtgtgag tgtgagttgt ttctccgcag tggtgatttt 120 acgtggacaa aatggcggat gtgaacaagt ttgcgtttgc tagactgaac aatcagaact 180 ggcaaatatg gaagttccgg atggagatgc tcctgaccag ggaggagctg tggtatgtgg 240 tcggtgatgc gaggccggct gtagtgaccg atcagtggac gaaagacgac cggaaagctc 300 gtgccactat cggattgtgt attgatgata accagtttgc actggtgaag gaagcaaaca 360 gtgcaaaagc gttttgggat cagttgcgtg cttatcacga gaagaacacg gtaacgtcac 420 gtgtatcttt gctgaagaaa ctgtgctcgt taaatctcgc ggaagaaggt gaccttgaga 480 gtcatctggt ggtgttggac gatttgttcg accgtttgac gaatgcaggc caacagctgg 540 aggaatcgct gcggattgcg atgatcctgc gtagcttgcc tgattcgtac ggtggtttgg 600 tcacggcgtt ggagagtcgt gccgatgccg atatcacgat gcagctagtg aagtcgaagc 660 tgatcgaaga gtttgaacgt cggaaggagc gttccggcga aacgtgtgag acgaaggcga 720 tgaaaagtgt tgttgtgcga aacgaagtgc aaagcgtggg tggactcggc aatagagcac 780 cagtgagaag atgctacttt tgcgacaagc ccggtcattt gcgtcgcaac tgtcgttcgt 840 ttatgctagc gaagcaggag ctagagagag aaagcgaaga gaagaagaaa ccggacgaca 900 gagtgcgtgt gaggcaaaac gcgagacaag cgacggacga aaattgcggt gccagtgtgt 960 gcttcatggc gggagtggat cagcgaaagt gctggtacat tgacagtgga gcaagtcggc 1020 acatgacgag cgacaggaac tttttcaagt cgctgaaaga aagaaaaggc ccgagtgtag 1080 tgttagcgaa tggtaaagtg gttaccgccg ctggttgtgg cgatggtatt gtgcatggcg 1140 tggacggaaa cggaagcgaa gtggagataa agcttaccga tgtgttgtac gttccgtcgt 1200 tggcgagtgg acttgtgtca gtggacaggt tgacgaagaa aaagttcaca gtgaagttta 1260 tgaaggatag ttgtgacatt tgcgatacat ccgggaagaa agttgtgatc ggagaaaagt 1320 cagggtcgtt gtatcggctg aagctagcga aagttgagaa gaaagtggag ggccagcggc 1380 cgataatagg cgagaagaag acaacgaagg gaagaagttc accagtcagt gtggtgagct 1440 ggtattcgga gaagaagccg aatagagccg aagaacgtga agaagaagaa tacttcgatg 1500 cgaacacttc aagcgaggaa gtggagccgc tgcatgtggc ggagtccaac agtggaagcc 1560 cagattcgga tgctgataac tggagggaag ttcggcgatc caggaggagc aatcgaggag 1620 tacgacccag ccggttggta gattacgttt tcgatgcttg aagcaggagg agatcaactc 1680 ggcatggaag gcagagatga gaaggacgag tacgaggaca acattgagga ggag 1734 // ID Copia-35_DPu-I repbase; DNA; INV; 5484 BP. XX AC ACJG01003352; XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the common water flea: internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-35_DPu_; KW Copia-35_DPu-LTR; Copia-35_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-5484 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the common water flea."; RL Direct Submission to RU (08-FEB-2011). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR Genome; ACJG01003352; Positions 362 5845. XX CC Positions [2404-2976] - Integrase core CC 'GATTG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 1870..3297 FT /product="Copia-35_DPu-I_1p" FT /translation="MTDQRSFFSSFKEIDCETWKVNGIGGAQLNALGIGTI FT PVHSYVHGERKNGEFHDVLFVPGLGTNLFSIGIATDSGIDAHFIKDTVTFV FT KHGTEIMSGQRVGKSLYHLKVIAKNSDQDSTYAAAASTTRPIPIWHQRLAH FT LNCKTILKMARSNAVVGLDIDLSSLDYQLCEGCIFGKMSRSPFPTSSTQAE FT HVGHIIHSDIGIVPVPTPNGERYYSIFKDDYSNWTSMALMKKKSDAADFFI FT RFVAFVKTATGKTVKILRTDGGKEYDNGYLNNFLATSGIVHQTSNSYTPQQ FT NGVSERMNRTAMESTRSSLHMRSNRLTNLFKKADNSILELWGEFLKSAIYV FT LNRTLSSSATSNSFTKTPHELFYKKKPNIENLRVIGCRAYVHVPDCKRKKL FT DSKATPCWLVGYGEETKGWRLWDPVSRKIILSRDVTFDENLLISDFKDDSN FT HNASNQSYSTIFDPFLLATGILGLVYHVFIS" FT CDS 3777..5354 FT /product="Copia-35_DPu-I_2p" FT /translation="MTFYLDHFKIKHATALFTESFEPQSYKEALESEHVDK FT WMAAFKEEYDSLIANKTWEIVPLPPGSTAINCKWIGKVKPAYDSIPERYKG FT RLVAIGSRQKYGVDYDEVFAPVPHQEAVKAAFAEIASLDLEIIQFDIKTAF FT LYAKLDKTIYMKQPEGFVVPGKEDHVCLLVKSLYGLKQAPRLWHHRLDEVL FT IKFGLKNCAADRCIYIRRTPDETTIVIAHVDDGIAASSKRSVLVDIGTHLG FT AEFIMHTVPPTRYIGLNISRDRPNKRIFVSQSHMIEKLSSRFGMSNLAPKS FT IPADPSIRLIANKSPKSEGEKTTSPYPYREAVGALLYLALMTRPDISYAVG FT QVSKYCQNPNESHWNAVIQIFAYLNGTMDFGIWLGGERTGLIGYTDADYAG FT DKNDYRSTSGSIFFFHGGPVSWSSKKQTCTALSTTEAEYIAACEATKTAVW FT LSCLLQDFSGTDQRKVPMFCDNESAVRLAYNAEFHQRTKHVLVRYHYIRQQ FT VAEGKIEVKYISLMINWQIFLLKHFQVQNL" XX SQ Sequence 5484 BP; 1693 A; 1164 C; 1162 G; 1465 T; 0 other; ggttatgggc ccaggatatt cacgtgttat ttaagtatat tcattcatca tggccgaaag 60 tcccgataga cgacgtgact atgtcaaatt caatggaatc aactttcccc aatggaaatt 120 tgctgtcatg cttaagctga agaagaaaaa gcttgacgga atagtgcttg ggactgaaac 180 aagaccagtt gaggtaacaa aattcattcc aataattaga caacttgaaa ctttagattg 240 ttcctgtatt gactaagttg tctcacacac atgtgtcaca ctcgtagaaa ttctgtatag 300 cctgaatgtg tcatcgaaca atgattcata tttgtaacaa agtattagtc aaatatttca 360 acgagtgaga cacaaatgat tgattcaaaa agaacttcag acgtaatcaa ctgatcaata 420 gagtacgaca aatgtgaaat tgactgttga gtgcaacagg taacttattg ttggctgtgt 480 catgagtgac attgactgtt gggtgcaaca ggtaacttat gcgacatatt gactgtgatt 540 tgagtatcac aggtaacata ttgaatatgt ctgttgattg aatattgaga ataccgcaag 600 taacttaccg attgcgatga ttgagtagca caggtaactt aatagctgac tgtgaaaacg 660 agtgtgtatt ttgaaactac agaggattct ataattattt caacctattc ttgatgatag 720 aatttgaaac agtactgtgt tgtggatatc attttgacat atattcagag agttctattt 780 ttgaatcaca acaacaggtg cctgaagaga gagatggact aaatgtggtg actaatcaga 840 atgcaattga ctcatggaac gaaagagacg tggatgcctg ttccatcata ttttacaaca 900 ttgagccaac gtatcaaacc tccatcgaag gatctgctac tgctcatgaa atgtggaata 960 ggttgatact gcaatatgcc caagtagctg tggccaactc tgcccatctt cttggaaagt 1020 tccatcaata tcgaatggat cccggtaaat aattattgga ggaactcgtt tagacacgaa 1080 atgtgaatga ttgaataatg tttgacattt tctttattta ccagatcatt cagtcatggc 1140 tcacataaat agactcagat tgatggctga cgaactaaag agtgtcaatt ccgcaatatc 1200 tgaagaaact ctaattgtga gaattcttca aacactacca cccagttatc gatccttttt 1260 gtcggcctgg gatagtgttc cacgagctga tcaaactatc gctaacttaa cggggagact 1320 cgtaacggaa gaactcagag ccaaaaccca tggtgtagca gatcctgctg atgttgcctt 1380 ctttgcctca catccatcaa gaatccagca acaacaaatg atacaacaac atcactcaga 1440 agctaatcct gctcggagcc gaggttgtat tcttttatca acagtacagt tcgctaagta 1500 agtggaacta aaatcatatt tctgcaactt ataggctcaa aaggtgacta ctacaaccgg 1560 ggaaataagc ataatttccg tggcgaacag tcaggctatc gcaacgatca caatcaacgt 1620 agctacagcc atcgcggtgg acaacgtgga agaggtggaa gaggctgttg gagttgcgga 1680 atgacaaacc acaaatccta aaactgcaga aacaagaaag atgaagaaag aaaagatgca 1740 agaaatgatc agtttgacaa aaaccgcgac tacgactcta aaaacaactc ctttgccgct 1800 ctctcttctc tctgttttgt tgcacgaaaa cccatcgact ggtatgccga ctctggcgcg 1860 acgcatcata tgactgatca acgttccttt ttcagctcat ttaaggagat cgactgtgaa 1920 acttggaagg tgaacggcat cggtggtgct cagctaaacg cgttgggaat cggcactatc 1980 ccagttcatt cctatgtcca cggagaaaga aaaaatggcg aatttcatga tgtccttttt 2040 gttcctggac tgggaacgaa tcttttctcg attggaattg ccaccgactc tggtattgac 2100 gcgcacttca tcaaggacac tgttacattt gtgaagcatg gcacagagat aatgtctggc 2160 cagagagtcg gcaagtcact gtatcatctt aaagtaatag ccaagaattc cgaccaagac 2220 agcacttatg cagccgctgc cagcactact cgtcccattc caatctggca tcaacgtcta 2280 gcccatctca actgcaagac tattttgaaa atggcaagaa gtaatgctgt cgttggactc 2340 gatattgact taagcagtct tgactaccaa ctatgtgaag gatgtatatt cggaaagatg 2400 agcaggtctc cttttcctac cagctccact caggcagaac atgtaggtca catcatccac 2460 tcggacatcg gaatcgttcc tgttccgact cctaatggcg aacgctacta ttcaattttt 2520 aaagacgact atagcaactg gacatcgatg gcgcttatga aaaagaaatc cgatgcagct 2580 gactttttca ttagatttgt tgcttttgta aaaactgcaa cgggaaaaac ggtgaaaatt 2640 ttaaggacgg acggcggtaa agagtatgac aatggttacc ttaacaactt cctggccacc 2700 tccggaatcg ttcaccaaac aagcaactcc tacactcctc aacagaatgg tgtctctgaa 2760 agaatgaacc ggacagcaat ggagtcaacc cgaagcagtt tacacatgag aagcaacaga 2820 cttacaaatc tgtttaagaa agccgacaac tcaattctgg agctgtgggg agaatttctg 2880 aaaagcgcta tttacgtact gaaccgcact ctgtctagtt ctgccacgtc taattcattc 2940 accaagacac ctcatgaatt gttctacaag aagaaaccaa acatcgagaa tcttcgcgta 3000 attggctgca gggcctatgt ccatgttccc gattgtaaaa gaaagaagct ggactcgaag 3060 gcaactccat gttggctggt agggtatggt gaagaaacga agggatggag actatgggac 3120 cccgtctcta gaaaaattat cctaagccgg gacgtgacat ttgatgagaa tctgctgatt 3180 agtgacttca aagacgactc caatcacaac gcatcgaacc agtcctacag tactatattt 3240 gatccattct tgctggcaac tggaatactt ggcctggtat atcatgtttt tatttcttag 3300 atcgtgcatg tgataaacag tattcgtatt ttttcccaat gtgtcaaatt aggaccctga 3360 tacaccactt gaggatgaaa cgacgggtcc cgatatgatt aatctggttg aagagcagca 3420 tgacgtcgac attcaacacg aggaagatca gccaatgaac gtcttggaac cagcagatga 3480 accacctaac cttgatatgg aaatcggtga tgatcatcaa gtggtagagc ttgaaaacct 3540 tccactccct caaaatgctc aacgtggact cgaggataat gctctacttc ctgaagttat 3600 tgaacctctt gaccaacatg atactgaaca ggaaaatgct ttgcgaagat cagaccgcac 3660 tccaaagtac agtgcgagat atcaagagtt ccgacgatca ctcgggttga caacactaat 3720 cggtaaattg aacgtcaaac taaatatctg taggaaagtg aactaatctg taatttatga 3780 cattttacct agatcatttc aaaattaagc atgccactgc tttgttcacc gagtcttttg 3840 aaccacagag ctacaaggaa gctcttgaat ctgaacatgt agacaaatgg atggcggcat 3900 ttaaagaaga atatgattcg ttgattgcaa acaagacttg ggaaattgtt cctcttcctc 3960 ccggcagtac cgcaatcaat tgcaagtgga ttgggaaagt taaaccagct tacgacagca 4020 ttcctgaacg ctacaaagga agacttgtgg ctattggttc gagacaaaaa tatggcgttg 4080 actacgatga agtttttgct ccggttccac atcaagaagc cgtcaaagcc gcatttgctg 4140 aaatagcatc ccttgatctt gaaatcatac agtttgacat caagactgcc tttttgtatg 4200 caaaactcga caagactatc tacatgaaac aacccgaagg atttgttgtt ccgggaaaag 4260 aggatcacgt atgtttactc gtcaaatctc tctacggact caaacaagct ccccgattgt 4320 ggcaccacag actcgacgaa gttttgatca aatttggact taaaaactgt gcggctgatc 4380 gatgcatcta tattcgacgc accccagatg aaacgactat cgtaatagct cacgtggacg 4440 acggcattgc agcaagcagc aagaggagcg tcctcgttga catcgggaca catcttggcg 4500 ctgaatttat aatgcacact gttccaccca ctcgctacat tggccttaat atttcgcgtg 4560 atcgaccaaa taagagaatc tttgtatccc agtctcacat gattgaaaaa ctgtctagcc 4620 gattcggaat gtctaatcta gctccaaaat ccattccggc agatccatca atccgtctta 4680 ttgcgaacaa gtctccgaag agcgagggag agaagaccac ttcaccttac ccgtacagag 4740 aagcggttgg cgctcttcta taccttgctc taatgacccg tccggacatt tcctacgcag 4800 taggccaagt atccaaatat tgtcaaaacc ccaacgaatc tcattggaat gccgttattc 4860 aaatctttgc ttatttaaac ggaactatgg attttggcat ctggcttggt ggagagcgga 4920 cgggactaat tggttacacc gatgctgatt acgccggcga caaaaatgat tatcgatcaa 4980 catctggaag tattttcttc ttccacggag gtccggtatc gtggtcaagt aagaagcaaa 5040 cctgcacggc tttgtccacg accgaagctg aatacattgc tgcatgcgag gctacgaaga 5100 ctgctgtatg gctcagctgc ctcctccaag atttctctgg aacggatcaa cgaaaggtcc 5160 caatgttctg tgataacgaa agcgccgttc ggctggcgta caatgctgaa tttcaccaga 5220 gaacaaagca cgtgctggtt cgataccact atattcgaca acaagttgca gaaggaaaaa 5280 ttgaagtcaa gtacatttca ctaatgatca actggcagat atttttacta aagcacttcc 5340 aagtccaaaa tttataacta tgcgaaaaag aattggtgtt ggaaaaagat ctgactaaaa 5400 atcttatgtc catgtttatg ttttgtgtaa tgttatgttt gttaatttct tttgatgaat 5460 gagggttatt ggtttgaggg agag 5484 // ID piggyBac-17_SM repbase; DNA; INV; 2288 BP. XX AC . XX DT 11-AUG-2009 (Rel. 14.08, Created) DT 11-AUG-2009 (Rel. 14.08, Last updated, Version 2) XX DE DNA transposon from Schmidtea mediterranea. XX KW piggyBac; DNA transposon; Transposable Element; piggyBac-17_SM. XX OS Schmidtea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae. XX RN [1] RP 1-2288 RA Jurka J.; RT "Families of autonomous piggyBac elements from planaria."; RL Repbase Reports 9(8), 1827-1827 (2009). XX DR [1] (Consensus) XX CC >96% identical to consensus. Low-copy. CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 400..2091 FT /product="piggyBac-17_SM_1p" FT /translation="MFIINXRIHXXEDLSDTESDAHVTKLSESDDEIAEEN FT DEIELSDSEDEHIRNICRIRTRMRILSDSESESDIDITETSYNENSLTEKS FT SDGTIWQILEEGGESGRPPAYTIFNDVSGPTAYAKRNIMLGNVISAFQLII FT DNNIMDYIKTCTETEACKVLKKEWTITNSQLRAFLAILYARGAYEAKCLKA FT SYLWSKKWGPAFFSQTMSRDKFMEVLRFIRFDKRNERSERLKTDKFALVSR FT IWNRFIENSQNCFKPDANITVDEQLFPTKARCRFLQYMPNKPDKFGIKFWL FT AADVKTKYLINGFPYLGKDEMRTSNIPLGEYVVLKLADPYLGCGRNITTDN FT FFTSLPLAKKLLLKKTTLVGTIRGNKRNLPKIAKTKNDNMIRYSTKLYKSE FT NYILTIYKSKPKKKVLLLSSRHKSVKIENNEKLVPETVAFYNKTKCGVDVL FT DQMARKYSVKAGSRRWPLQVFYNILDLAAINAWILYKEVTGVNISRKNFIF FT QLAEELANDNRDAPSTSFKLPSNISNIRNRKSCQIGLCKKNRSNNKCNKYV FT CGKCTLKIDVICKKCKK" XX SQ Sequence 2288 BP; 859 A; 331 C; 385 G; 709 T; 4 other; cctattttta cggacacccg tcaaattgac gggttatgaa tggacccatt gttttccctt 60 cgcttgctcc agttttatac agcgatgcac gtgtaccatt agcatgtgta tttattaggt 120 cattgcactc tagatttgac taaatgagac aacagtacat aaaaaaacaa aagatatgtc 180 atcactcttg aacttacaag cgactgttgc acttcgaacg cgaaagtgaa cttttgaaaa 240 gtgtatttgt tgtgaaaaaa cttgtttact gtgtatatca catgtgagac ggataattta 300 ttcgatttgt gatatatatt tttatgtgat ttacgtttta ctttttaatt gaaatctgaa 360 tttcagaggt atgtttttgg aaaaccaatt cttatattta tgtttattat aaattanaga 420 attcatatnn acgaagactt atctgatact gaatcagatg cacatgttac taaactaagt 480 gaaagcgatg atgaaattgc ngaagaaaac gacgaaattg aactttccga tagcgaagat 540 gaacatattc gtaatatatg tcgtatcagg acaaggatgc gtattctttc agactctgaa 600 agcgaaagtg acatagatat tactgagaca tcttataatg agaacagttt gaccgaaaaa 660 tcttctgatg ggactatatg gcaaatttta gaagaaggtg gagagagtgg tagaccacca 720 gcttatacta tattcaacga tgtctctggt cccaccgcct atgctaaaag aaacatcatg 780 ctcggtaacg taattagtgc attccagctc ataattgata ataatataat ggattatatc 840 aaaacttgca ctgaaactga agcatgcaaa gttttgaaaa aggaatggac cattacaaac 900 tcccaattac gggcttttct agcaatttta tatgcccggg gagcatatga agcaaaatgc 960 ttgaaagctt cttatttatg gtcaaagaaa tggggacctg ccttcttttc gcaaacaatg 1020 tctagggaca aatttatgga agtccttaga tttatacgtt ttgacaaaag aaatgaacga 1080 agtgaacggt taaaaacaga taaattcgct ttagtttcca gaatatggaa tagatttatt 1140 gaaaatagtc agaattgttt caaacctgac gcaaatatta cagtcgatga acaactattt 1200 ccaacaaagg ctagatgccg atttcttcag tacatgccaa ataaaccaga caaatttggc 1260 ataaaatttt ggttagcagc ggatgtaaaa actaaatatc ttataaatgg tttcccatat 1320 ttgggtaaag atgaaatgcg aacgtcaaac atacctttgg gtgaatacgt agtgctcaaa 1380 cttgctgacc catatttagg ttgtggacga aatataacca cagacaattt ttttactagt 1440 cttccactag caaaaaaatt acttttgaaa aaaactactt tagttgggac tatccgtggg 1500 aataaaagaa acttgccgaa aattgcaaaa actaaaaatg ataatatgat tcgatattca 1560 acaaaacttt ataaatcaga aaactacatt ctcaccattt acaaaagcaa accaaaaaag 1620 aaagtattgt tactaagttc aaggcacaaa tcggttaaaa ttgaaaacaa tgaaaaactt 1680 gtcccagaaa ctgtagcgtt ttacaataaa actaaatgtg gagtggatgt tttggatcaa 1740 atggcgcgaa aatatagtgt aaaagcagga tcacgtagat ggcctctaca agtgttttat 1800 aacatcttag acttagcagc aataaatgca tggattttat ataaagaggt tactggagta 1860 aatatatcaa gaaaaaattt tatttttcaa ttggccgaag aactagcaaa tgacaacaga 1920 gatgccccca gtactagttt caaactacca tctaatattt cgaatattcg taatcgtaag 1980 tcttgtcaaa taggtttgtg taaaaaaaat agatctaaca ataaatgtaa taaatatgta 2040 tgtggaaaat gtacgttgaa aattgatgtt atttgtaaaa agtgtaagaa ataaaactat 2100 tataataaaa atacattgtt tcaaattaaa attaattagt tttaaattaa actaataaac 2160 gtaaatttaa tatttttata tttataatat aataaaatta ataaacaatt ttaatatttt 2220 taatataata aaattaataa agtaatttta atatttccgt caaaatgacg ggtttcagta 2280 aaaatagg 2288 // ID CR1_Ele34 repbase; DNA; INV; 5032 BP. XX AC . XX DT 18-OCT-2010 (Rel. 15.1, Created) DT 18-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE A CR1 clade non-LTR retrotransposon family from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1_Ele34. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-5032 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5032 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (18-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. This consensus is generated from 22 CC sequences with >93% identity, and ~99% identical to the original CC sequence in [1]. XX FH Key Location/Qualifiers FT CDS 434..1222 FT /product="CR1_Ele34_1p" FT /translation="MACNKCQETIADTERIICRGYCGHSFHLTCVRMDHSF FT RDVQVAHERNVFWMCDGCADLFSSDHFRKISSRYSNEIVHDEASIKCLKDD FT IAELKQIVGLLSSKVEAKPTTPIMSTPWSGLNRMRSSSNVPNTPKRQRVES FT LPMERPTIIRGSKAASELVKTVSPPEELFWLYLSAFDPCTSEXDVSNFVRN FT CMGLSAEVEPKVVKLVPKDKDPATLNFVTFKVGMKLSLKTSALSKETWPEN FT IYFREFENQPKNQRKIVRVTAE" FT CDS 1438..4902 FT /product="CR1_Ele34_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MFVVSSSPASISFHRTSTFGSPGRTPASSLLEASDSP FT NTVVPLLPATCSRPGPVFGSEEEVFQHSSAGKYITPTNYSLPEIFVVSSSS FT MSNDSCCTSSFLPPGRMPAPSLLEASDSPSTVEPLLPATCSRPGPVCGSEE FT EVFQNPTAGKYTVPPLSCSRTEIRVVSSFYDDAPTATPASSCSIAHPTNQD FT ITIYYQNVGGMNSDIDDFRVAISDHCYDIIVFTETWLDSRTLSHQIFGNAY FT EVFRCDRNADNSRKTTGGGVLVAVNSRLKTKKVESPSWVCLEQVWTSIKLG FT DRSLFLCALYIAPDRIRDRELIETHCQSVYSVLESTKAKDEVIVLGDFNMP FT GISWEKSSNGFLHPDIGHSVIHSNASFLLDNYSSATLSQINHVTNQNNRRL FT DLCFVSTQDTAPFLCEAPVPLVKLTLHHPPLLLAIGSKLPHDYVISPNVVY FT YDFRKADHRGIAQLLSGFDWSVILDSDDIEEAAQTFSNVLSYVLDRHVPKK FT VQHHSFRPPWQTSELRRLKSKKRAALKKFTTHRTPLLKRHYARVNNEYKQA FT AKDCFLRYQRGIQTKLKYRPKQFWKHVDEQRREAGLPSCMSLNGNVASSPQ FT DICNLFADKFASVFNDEILSDEHITLAAANVPQFGQTLSSVDINEDMIIRA FT SSKLKTSYNPGPDGVPSSFLKTHMRDLISPLLHVFRLSLTLGVFPSTWKLA FT HMFPVHKSGNKHDVGNYRGITSLCAVAKLFELVMMEPLLAHCKPYISVDQH FT GFVTGRSTATNLLCLTSYITESMSKRFQTDVIYTDLTAAFDKLNHNVAVAK FT LDRLGIHGCFLQWLQSYLTGRHLAVLIGDCYSSSFAATSGIPQGSHLGPXI FT FLIYFNDVNLVIEGPRLSYADDLKLYLRIRSIDDCYFLHQQLNAFADWCNL FT NRMEVNPAKCSVISFSRKKQHILHSYTLSGEVIKRVSQVKDLGVILDAQLS FT FKQHVDYAVGKASRVLGFIFRTAKEFTDIYCLKSLYCSLSRSILEYCSVVW FT TPHYNNGVLRIESVQRRFLRYALRRLPWSDPFRLPSYQSRCQLIDLESLSV FT RRDTARALLLSDVLQGRIDCPYILNEININVQPRALRNRSMLRLPLQRTNY FT STHGAINGMQRVFNRVATLFDFNLSRHSLRTRFSSFFFKCFLICPFL" XX SQ Sequence 5032 BP; 1277 A; 1269 C; 1084 G; 1395 T; 7 other; ttggcaacac tgttaagcaa atagtcgtgt tagtacggag ctcatattta tttatgtttt 60 taattgtgtt tatcgcgttt aaattcgacc cgtattgtgc cttcgcgagt ccgtagtttc 120 ggttctccac catcagtata gtgcagtgca cgaaccgttt taaagtgtgg aattttgtgg 180 atgctagttg gcgataattt atcatcagaa gtgtttgtga ctacctcmtt cacttcmcag 240 cccttcgctc attttatcct ccaaagtgac atctggcggg aataatcgga acctttgcga 300 ttgcatgcaa catcgatcma tcaccgaaag cttacttcaa tacacgtgaa ctttgctggt 360 tgcatacccg taggcgaatc gctctgaaat agtccaacag ttttccgtcc tgtttcacgg 420 ttctacgacc gcaatggctt gcaacaagtg tcaagaaacg atcgctgata cggaacgaat 480 catttgtcga ggatattgcg gacactcgtt ccacttgacc tgcgtgagga tggatcattc 540 tttcagagat gttcaggtgg cccatgaaag aaatgtattt tggatgtgtg atggttgtgc 600 cgacttgttt agtagcgatc attttcggaa aatctcgtca cgttattcta acgaaattgt 660 tcatgacgag gcatctatca aatgtctcaa ggacgatatc gcagagttga agcaaatcgt 720 tggactttta tcttccaagg tcgaagccaa accgaccaca cctataatgt ctacaccgtg 780 gtcaggattg aatcggatga gaagttcgag caatgtcccg aacacgccaa aaaggcaacg 840 cgtggagagt ctacctatgg aaaggccaac gattattcgc ggatctaaag cagcatccga 900 attggtaaaa acagtttcac caccggagga gttgttctgg ctgtatttat ctgctttcga 960 tccctgcaca tccgaastgg atgtttccaa tttcgttagg aattgtatgg gattatctgc 1020 mgaggtggag cccaaagtcg tgaaactggt tccaaaagat aaagacccag caaccctcaa 1080 cttcgttacc ttcaaagttg gtatgaaatt atcgctcaag acatctgcac tatccaagga 1140 aacctggccg gaaaatattt acttccgtga gtttgaaaac cagccaaaaa accaacggaa 1200 aatcgtcaga gtgacagcgg agtagtgcca acttggaaag ctacgactca gcgatgggaa 1260 ccgggacgca cacttgccct cagtagtttg gaagcctctc acccacccac cacagtcgag 1320 cccctcctgc cagcgaccca cagccgtccc ggtcctgtgt gtggatttgg ggaagaggtc 1380 ttccgaaatt cgaattcagg caagtacwtc gctctattga acaattcgtc cgctgaaatg 1440 tttgtagttt ccagttctcc agcatcaatc agcttccatc gtacttcaac attcggatca 1500 ccgggacgca cgcctgcctc tagcctcttg gaagcctctg actcgcccaa cacagtcgtg 1560 cctctcctgc cagcgacctg cagccgtccc ggtcctgtgt ttgggtctga ggaggaggtc 1620 ttccaacatt catcagcagg caagtatatc actcccacga actattcgtt gcctgagata 1680 ttcgtagttt ccagttcatc catgtcaaat gattcctgtt gtacatcttc gttcctgcca 1740 ccgggacgca tgcctgcccc tagtctcttg gaagcctctg actcgccctc cacagtcgag 1800 cccctcctgc cagcgacctg cagccgtccc ggtcctgtgt gtggttctga ggaagaggtc 1860 ttccaaaacc caacggcagg caagtatacc gtccctccgc tgagctgttc tcgcactgaa 1920 atacgtgttg tttccagctt ctacgacgat gccccgactg ccactccggc ttcttcatgc 1980 tccatagccc atccgaccaa tcaagacatc accatctact accaaaacgt cggcggtatg 2040 aacagtgata ttgacgattt ccgagtggcc atttccgatc attgctacga cataattgtc 2100 ttcacggaga cctggttgga ttcacgcacg ctctctcacc aaatattcgg taacgcttac 2160 gaggtcttcc gctgcgacag gaacgcagac aatagccgga aaacaacagg tggaggcgta 2220 ttagtagccg ttaactctag acttaaaacg aaaaaagtcg aaagtccctc ttgggtctgc 2280 ttggagcagg tctggacttc gattaaactc ggtgatcgaa gcttgtttct ctgcgcattg 2340 tacattgctc ctgatcgtat acgtgatcgc gaactcatcg aaacgcattg tcagtcggtc 2400 tactccgtat tggaatccac caaagcaaag gacgaagtta tcgtgttagg tgacttcaat 2460 atgccgggga tttcgtggga aaaatcaagc aatggtttcc tccatccaga tatcggtcat 2520 tccgtcattc attccaacgc ttcctttctt ttggacaact atagttctgc cacactctcc 2580 caaatcaacc acgtcactaa tcagaataat cgccgcctag atctttgctt cgtgagcaca 2640 caggacacgg ctcctttttt atgcgaggct cctgttccgt tagtgaaact taccctccat 2700 catccaccat tgttgctagc catcggctct aagctgcctc acgattatgt catttcacct 2760 aatgtcgtct attacgattt tcgtaaggcc gatcatcgcg gtattgctca attgctctcg 2820 ggttttgatt ggagcgtaat cttggattca gatgacatcg aagaggcagc acaaactttc 2880 tccaatgttc tttcatacgt attagacaga catgttccca aaaaggtcca acatcattct 2940 ttccgtccgc cgtggcaaac cagcgagctc cgccgattga aatcaaaaaa gagagctgca 3000 ctcaagaaat tcaccacaca ccgcactccg ctacttaaac gccactacgc gagggtcaac 3060 aacgagtaca aacaagccgc gaaagattgc tttcttcgat atcagcgagg tattcaaacg 3120 aagctcaagt accgccccaa acaattttgg aagcatgttg acgaacagcg acgtgaagct 3180 ggactgccgt cgtgtatgtc gctgaatgga aacgtggcct cctcaccaca ggacatatgc 3240 aatctattcg ccgataaatt cgccagtgtt ttcaatgatg aaatattgag cgacgaacat 3300 atcacactcg ccgccgctaa tgttccacag ttcggccaaa ccctgagtag cgtagacatt 3360 aacgaagata tgatcatcag agcatcctcg aaactgaaaa cgtcttataa tccgggaccc 3420 gatggtgttc catcgtcatt tctcaaaacg cacatgcgtg acttaatttc accactgctt 3480 catgtttttc gtctttcgtt gacccttgga gtatttccat ccacatggaa gctcgcacat 3540 atgttccctg ttcacaagag cggcaacaag catgatgttg gcaactatcg tggtattacc 3600 tcgctctgcg ctgttgcaaa actgttcgaa ctcgttatga tggaaccgtt acttgctcac 3660 tgtaaaccat acattagcgt cgatcagcat ggattcgtga caggtcgctc tacagctacc 3720 aatctactct gtttaacttc atacataacc gagagtatgt cgaaacgctt tcaaaccgat 3780 gtaatttaca cagacttgac tgcagccttc gataagctga atcataacgt tgctgtcgct 3840 aaacttgatc ggcttgggat acatggctgt tttctccaat ggcttcagtc gtatctcact 3900 ggtcgacacc tagccgtcct cataggagac tgttattcgt cttcgtttgc tgctacatcg 3960 ggaatcccgc aaggtagtca cttgggacca wtgatcttcc tgatctactt caatgatgtg 4020 aacctggtaa ttgaaggacc acgattatcg tacgcggatg acctcaaact gtacctccgt 4080 attcgttcca tcgacgactg ttactttctc caccaacagc tgaatgcctt tgcggattgg 4140 tgtaatctaa atcggatgga agtgaacccg gcaaagtgct cagttatttc gttttcgcgc 4200 aaaaagcagc acatattgca cagttataca ttgtcaggag aagtaatcaa acgtgtcagc 4260 caagtcaagg atttaggagt gatcctggat gcacagctat cgttcaaaca acatgtggac 4320 tacgccgttg gtaaagcctc cagagtactt ggattcatct tcagaacagc taaggagttc 4380 acggacattt actgtctgaa atctttgtac tgttctctat ctcgctccat cctggaatac 4440 tgttcggttg tttggactcc ccactacaac aacggcgtct taaggatcga gtctgttcaa 4500 cgacggttct tgagatatgc gctgcgacga ttaccttggt cggatccttt tcgcctgcct 4560 agctaccaaa gtcgttgcca attgatcgat ctggagtcac tatcggttcg tcgagatacg 4620 gcaagagcgt tactgttatc ggacgtactt caggggcgca ttgattgccc gtatattttg 4680 aatgaaatca acataaatgt ccaacctcgt gctctcagga acaggtctat gctgcgactg 4740 ccgcttcagc gaacaaacta cagcactcac ggtgctatca atggcatgca gcgagtattc 4800 aacagagttg ccactttgtt tgacttcaat ttgtcccgcc attcgctccg tacgagattt 4860 agttcgtttt ttttcaagtg cttcctaatt tgtccgttct tgtaagttat tgtttgtgaa 4920 agctattagt gttgacatgt tcatttgttg aatgttttaa aaacgtgtac tattagaatt 4980 taagttttat cattggggct ttacttagcc cgttgactaa taataataat aa 5032 // ID Gypsy16-LTR_Dya repbase; DNA; INV; 425 BP. XX AC chr2h; XX DT 19-MAY-2009 (Rel. 14.05, Created) DT 19-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy5_Dya; KW Gypsy16-I_Dya; Gypsy16-LTR_Dya. XX OS Drosophila yakuba OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; melanogaster subgroup. XX RN [1] RP 1-425 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1103-1103 (2009). XX DR Genome; chr2h; Positions 613158 613582. XX SQ Sequence 425 BP; 127 A; 110 C; 83 G; 105 T; 0 other; tgtttaaatc cagaacaatg aaccccaata agcaagcttc aatctgcaag ctcagcgtct 60 gcataatttc ctgcacacgc cgagctgcca agcagaattc gataatttcc tgcacacgcc 120 gagctgccaa gtagagttcg aagaatctga ttggtaacaa catccgctcc caccaatgca 180 agtgattcca agaagactgc actgatttga ccagcaaaac atcagccgat gtcgtggcct 240 attggaaagc tacggagccg acgccgatcc tctgccacca agggcagcgc agctcacagc 300 caaagactca cagcttgcta gggacagctt agggtaattt agttgcaaat tggatttata 360 caaataaagt cgttcttaac tgaactccaa tttattttat tatttttttt accttgctaa 420 ccgca 425 // ID SMAR5 repbase; DNA; INV; 1521 BP. XX AC . XX DT 25-SEP-2007 (Rel. 12.09, Created) DT 01-OCT-2007 (Rel. 12.09, Last updated, Version 1) XX DE Consensus sequence of Mariner-type family of repeats. XX KW Mariner/Tc1; DNA transposon; Transposable Element; SMAR5. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-1521 RA Jurka J.; RT "SMAR5: Mariner-type element from freshwater planarian (Schmidtea RT mediterranea)."; RL Repbase Reports 7(9), 994-994 (2007). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 278..1132 FT /product="SMAR5_1p" FT /translation="MTARKSLSDNERASIFRWKKEGMKQKKMAELLQVTPR FT CISYVLKTYTEQDFKSKRNNSGRKKSLSDKELVELKKIIDVDRFISGPKLA FT NKVEKKFKKVISSKTIKRFAKFVGFKAMRPKKVPFISKKNKIARLEFAKKF FT IVMPKSYWRKIIWSDESKINLFNSDGITYVWRRAGERLKNKCTRKTVKHNG FT GSIMVWGCIGENGVGNLWKIDGKMDSLKYIKILQENLLPSVEKLNLSDFVF FT QQDNDPKHTSKIATKFFEENRIKKMQWPAQSPDINIIEHCWAFV" XX SQ Sequence 1521 BP; 634 A; 154 C; 239 G; 494 T; 0 other; tacagtatgg gacaaaataa atgtaatttt ttaaaaaatg cccaaagtac aaaaaaatgg 60 gcgagaagtt ttatttagtt taaaattaca ttcctagtat agttttataa aattattaaa 120 ttaaataaac tttataactt acagtgttgg aaacattttt taatattcaa aaagctggat 180 tggacaaaat aaatgtaatt ttttaattaa tagtaaaata tgatcccgcc atttttgaaa 240 ataatttata tatatttaaa attattaact aagggttatg actgctagaa agtcactttc 300 tgataacgaa agagcttcaa tttttagatg gaaaaaagaa ggaatgaaac aaaagaaaat 360 ggctgaatta ctacaggtaa cacctagatg tataagttat gtattgaaaa catatactga 420 acaagatttt aaatcaaaac ggaataattc tggtaggaaa aagtctcttt ctgataaaga 480 attagttgaa ttaaaaaaaa taattgatgt agatagattt atatctggac ctaaacttgc 540 aaataaagta gagaaaaaat tcaaaaaggt tatctcatca aaaacgatca aaagatttgc 600 taaatttgta gggtttaagg ctatgagacc taaaaaagta ccttttatat ctaaaaagaa 660 taaaatagct agattagaat ttgccaaaaa atttattgtg atgcctaaaa gctattggag 720 aaagattatc tggtctgatg aatcaaaaat aaacttgttt aatagtgatg ggattactta 780 tgtatggcgt agagctggag aaagattgaa aaataaatgt actagaaaaa ctgttaaaca 840 caatggtgga agtattatgg tttggggctg tataggtgaa aatggagttg gaaatttatg 900 gaaaattgat ggaaaaatgg attcacttaa atatattaaa attttacaag aaaatctttt 960 accttctgtg gagaaattga accttagtga ttttgttttt caacaagaca acgatccgaa 1020 acatacaagt aagattgcaa caaagttttt tgaagaaaac cgaattaaaa agatgcaatg 1080 gccggcacag tctccagata taaacataat tgaacattgc tgggcatttg tttaaaaaaa 1140 tatggtgaag ctccggctag taatttaaat gaggcattca taaaaattaa acaaatatgg 1200 gaaaatatac cgcaggaatt tatacaaaaa cttgtagatt caatatatga acgtctatct 1260 gaggttattc gaaacaaagg aggtgctaca gactattagt taataaaaaa taaaaaaatt 1320 acatttattt tgtcctattt ttttaccaat taaaaaacca tattttgtat taataaaaat 1380 aaaaataaat gttttgaaat catacatgtg tatataatat tcgatttgta ttatttgata 1440 attttttttt cttttttgtt acagaaacat agataaaaat taaaaattaa aaaaattaca 1500 tttattttgt cccatagtgt a 1521 // ID CR1_Ele25 repbase; DNA; INV; 5444 BP. XX AC . XX DT 22-OCT-2010 (Rel. 15.1, Created) DT 22-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE A CR1 clade non-LTR retrotransposon family from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1_Ele25. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-5444 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5444 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (22-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update and extension. This consensus is generated CC from 12 sequences with >98% identity, and ~99% identical to the CC original sequence in [1]. XX FH Key Location/Qualifiers FT CDS 320..1234 FT /product="CR1_Ele25_1p" FT /translation="MRGGLDQTISSVTVEKCFASNMSLVCCSCAGDIGEIQ FT VECQGFCKAIFHPRCCGVAAEVFEQVMRNNQLFWFCPSCTALMKDMRLRNT FT ARAAYEVGQGHALNSHSDIMANLKTEIMDELKAEIRNNFAKLVNSNSFTPK FT SSRRVGIDPRFTRSRRLFSTAANPIPNNQPPLLLGTGSTPSPSIEIATVPP FT PQPKFWLYLSRIAKDVSVDQIRALAKKRLGSDDVQVVRLVARGRNIDTMSF FT ISFKIGVNLDLKPKALSTSTWPKGIVYREFTDNNKNENFWRPEPVAATDDP FT LSFASEEEVVLME" FT CDS 1084..5382 FT /product="CR1_Ele25_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="GFVHFNVAKRNCLQGIHRQQQKRKFLASGTCGCNRRS FT TELRIRRRSGFNGVKDNEYNSCFIPGRTLAPSLEEAPIPFTAVEPLLPATS FT SRPGPASELEDGVFRNPPSGKYLYLLRTSPAVPFVAFSPSSSSQMSSFHQP FT TSHNRTPGRRFAASLKEASIPLTAVEPLPPATSSRPGPACELEDEVFRNPI FT SGKYADILSTSPAVPYVASSTTSPIRVYSTSSFWNPGRILAASLKEASNPL FT TAVEPLPPATSSRPGPACEQGNEVFRNPAAGKYIDILRTSSAVPYVDSSLP FT SNIQASSTSSNRTPGRNLAASRKEASIPLTAVEPLPPATSSRPGPACEQGN FT EVFRNLSTGKYDLIPRTSPAVSLVVSRSRIPKDALTNSRLYDQWRISEGDS FT QLAQAGPSRNATWLSIPMPIPERTDTSSMEASYPTTVEKTGSSICRRPSLG FT CGVEDGVFQAATIGNASSDARDRIAYFNTSSLPDSRSLCVYYQNVRGLRTK FT THDLKLRLSSCDYDIVVLTETWLRPDISNSELSSDYTIFRCDRNKTTSNLS FT RGGGVLVAVKGDLQCTEVSLSDCADLEQVAVCVKLPNQTLNIFGIYLRPNS FT EPSLYSTHSAAVQRVSDSTSNNDVILVFGDYNLPHLQWSFDEDVNGYLPTN FT ASSEQEVSLTESIMACGLVQINSFTNRSHRILDLVFTNFSDISELSQPVLP FT LLPIDDHHPPLVLQIDVCFSAPSQLDPDPNILGLDFRRCNFSSLNNLLDSV FT DWDNQLLGHSVDESVGLFYDKMYEILRSVIPRRRCVNLGVDRKPWWNPELR FT NMRNRLRKVRKRYFASKTVENRNLLCLVESSYNELIDMSYRRYLENLQLTA FT KNNPSKFWKHMKAQKASHGIPNNVTYGDINATTHVEAAELFAKFFQSVYSA FT TSPRVYPGCFQNVRTHNIHIPPFQFSHQEVLNALLSLDTTKGAGVDGLTPH FT LLKSCASSLVAPLTLLFNRSLNENTFPQLWKIATMVPVYKSGSIRLVENYR FT GISILCCLGKMLESMVHKVVMSASISIISQYQHGFIPHRSTTSNLLCYTNV FT LFREIERRKQVDSIYVDFSKAFDTVPHLYAVEKLRHMGFPDWISDWILSYL FT TDRKAFVKVNSSRSSTFSIPSGVPQGSVLGPLIFVLYINDLCHRFSSGRLC FT FADDLKIFRVINSTLDCVALQDDIDSLLQWCIENGMTLNIGKCKVITFCRS FT LSSINYTYAINGTTLERVGSIRDLGVVIDSKLRFNEHITVITAKAFSVLGF FT IRRNASQFTDVYALKALFCALVRSVLEYAAPVWAPYHTSLMIRIERVQKSF FT IRFALRRLPWNDPINLPDYPARCQLVDLELLSNRRLKLQRLLVHDILTNNI FT DCPDLLSEVQLSVPNRRLRHTTLIAIPLHRTSYGYHDPFTTCLRNFNSVCE FT MFDFNVSKDCFRARIKNIV" XX SQ Sequence 5444 BP; 1421 A; 1359 C; 1153 G; 1511 T; 0 other; cgttatgacg taaacattgt tcgcatctag ctttttcgct tggagcaata atttatttta 60 tacatgttaa atttacctgt ttccgctgtg ttatatatcg cggtttagtg tttcgtacag 120 tgcatctcct agtagtgaat gatttacgta accacagaaa acgattgaga atttttagtt 180 ttgattttct acaatcccgc ggtgcaatac aatttgcttt gtttttgttt gtgttctcat 240 tgcacttcgt caaggcaacc gtacttcgtg accgtagcgt acactgctaa aaagactcgc 300 ggagtggcat agtttttcga tgcgtggtgg tcttgaccaa acaatttcaa gcgtaacagt 360 tgaaaagtgc ttcgcaagca atatgtcact tgtttgctgt tcctgcgccg gtgatatcgg 420 agaaattcaa gtcgagtgcc aaggcttttg caaagcgatt tttcatcctc gatgctgtgg 480 agtggctgcc gaagttttcg aacaggttat gagaaataat caattgtttt ggttctgccc 540 gtcgtgtaca gcacttatga aagatatgcg cctccgtaac actgcacgtg ccgcttatga 600 ggtgggccaa ggacatgcgc tcaactccca tagcgacatt atggcgaacc taaaaaccga 660 aattatggat gaattgaagg ccgaaattcg aaataatttt gctaagctgg ttaactcgaa 720 ttcgtttacc ccaaagtcct ccagacgtgt tggcattgac ccaaggttca ccaggagcag 780 gaggctgttt agcacggccg ccaatccaat cccgaataac cagccgcctc tattacttgg 840 aactggcagc acaccctctc cgtcaattga aatcgctacc gttccgccgc ctcaaccgaa 900 attctggtta tatctatctc ggatcgcaaa ggatgtatca gttgaccaaa tacgtgcttt 960 agcaaaaaaa cgcctcggtt ctgatgacgt gcaagttgtt cggcttgtgg ctagaggaag 1020 gaacatagat acaatgtcct ttatctcgtt caaaattggc gtgaacttgg atttgaaacc 1080 taaggctttg tccacttcaa cgtggccaaa aggaattgtc tacagggaat tcaccgacaa 1140 caacaaaaac gaaaattttt ggcgtccgga acctgtggct gcaaccgacg atccactgag 1200 cttcgcatca gaagaagaag tggttttaat ggagtaaaag ataacgagta taattcctgt 1260 ttcatcccgg gacgcacact tgcccccagc cttgaggaag ctcctattcc tttcaccgca 1320 gtcgagcccc tcctgccagc gaccagcagt cgtcccggtc ctgcgagtga actggaagat 1380 ggggtcttcc gaaatccacc ctcaggcaag tatctttacc ttttgagaac ttcgcccgct 1440 gtaccattcg tcgctttcag cccatcttca tcaagccaaa tgtcatcatt ccatcaacca 1500 acgtctcata accggacacc gggacgcaga tttgccgcaa gccttaagga agcctctatt 1560 ccactcaccg cagtcgagcc tctcccgcca gcgaccagca gccgtcccgg tcctgcgtgt 1620 gaactggaag atgaggtctt ccgaaaccca atctcaggca agtacgctga tattttgagc 1680 acttctcccg ctgtaccata cgtcgcttcc agcactacgt caccaatacg agtttattcg 1740 acatcttcat tttggaatcc gggacgcatt cttgccgcca gtcttaagga agcctctaat 1800 cctctcaccg cagtcgagcc cctcccgcca gcgaccagca gtcgtcccgg tcctgcgtgt 1860 gaacagggaa acgaggtctt ccgaaatcct gctgcaggca agtacatcga cattttaaga 1920 acttcttccg ctgtaccata cgttgattcc agcctaccgt cgaatatcca agcttcttcg 1980 acatcatcaa atcggactcc gggacgcaat cttgccgcaa gtcgtaagga agcctctatt 2040 cctctcaccg cagtcgagcc cctcccgcca gcgaccagca gtcgtcccgg ccctgcgtgt 2100 gaacagggaa acgaggtctt ccgaaatctg tctacaggca agtacgatct tatccctaga 2160 acttcccccg ccgtatcgct cgttgtttcc agaagccgca ttccgaagga tgcactcact 2220 aactcgcgcc tgtatgatca atggaggata tcagaaggcg attctcagtt agcgcaagca 2280 ggcccctcac ggaatgctac gtggctctcg attccgatgc cgataccgga acgcaccgat 2340 accagctcaa tggaagcctc ttatcctacc accgtcgaga aaactgggtc atcgatctgc 2400 agacgtccta gtcttgggtg cggagtcgaa gacggggtct tccaagccgc cacaatcggg 2460 aatgcctcat ccgacgcgcg tgaccgaatc gcttatttca acacctcatc attgcctgac 2520 agtcggtccc tctgcgttta ctaccaaaac gtcagaggtc tacgtactaa gacacacgat 2580 ttaaaattgc ggttgtcaag ttgtgactac gacattgttg ttctcacaga aacgtggctt 2640 cggcctgata taagtaactc ggaactgtca tctgattaca caatctttcg ttgtgaccgc 2700 aataagacta caagcaattt atcaagaggt ggtggggtgc ttgtcgcagt caaaggcgat 2760 ttgcagtgta cagaagtgtc attgtccgat tgtgctgacc ttgagcaagt ggcagtatgt 2820 gtgaagcttc ctaaccaaac gttgaatatc ttcggaattt acctccgtcc aaattccgaa 2880 ccaagtttgt attctactca ttctgctgcc gttcagcgcg tatccgactc aacgtcaaac 2940 aacgatgtga tcttggtgtt tggtgactac aatcttcctc atctacagtg gagctttgat 3000 gaagatgtaa atggttacct gcctacaaat gcgtccagcg aacaagaagt ttctttgaca 3060 gaatcaatca tggcctgtgg gctggtgcag attaactcct tcacaaaccg gagtcatcga 3120 attcttgatt tagtttttac aaacttctcg gacatatccg aattatcgca gccagtttta 3180 ccgctgctgc caattgatga ccatcatccg ccgcttgttt tacaaattga tgtttgcttt 3240 tccgcaccca gtcagcttga cccagacccg aatatcttgg gcctcgattt ccgacgatgt 3300 aacttttcct cgctcaataa cttactggat tcggttgatt gggataatca actacttgga 3360 cactccgttg atgaaagtgt cggcttgttt tacgacaaaa tgtacgaaat actccgctca 3420 gtcatccctc gtcgccgatg cgttaatctt ggtgttgata gaaaaccgtg gtggaatcca 3480 gaattgagaa acatgcgcaa cagactgaga aaagttcgta aacgatattt tgctagcaaa 3540 acggttgaaa acaggaatct gctttgccta gtggaatcct cttataatga gcttatcgat 3600 atgagctatc gcaggtatct tgagaatttg cagcttaccg ccaaaaacaa cccctcgaag 3660 ttctggaagc atatgaaggc acaaaaagca agccatggta tcccgaacaa cgttacttac 3720 ggggacatta atgccaccac acatgttgaa gccgctgaac tatttgcgaa atttttccaa 3780 agcgtgtaca gtgccacatc gcctcgtgtc tacccgggat gcttccaaaa cgtacgaact 3840 cacaacattc atatcccgcc tttccagttt tctcatcaag aagtattgaa tgccctactt 3900 agcttagata caacaaaagg tgctggggta gatggcttga ctccccatct gttgaaaagc 3960 tgtgcttcgt ctttggttgc accgttgacc ttacttttca atcgatcact caatgagaac 4020 acgtttcctc aattatggaa aattgctaca atggttccgg tttacaaatc tggcagcatt 4080 cgtcttgtgg aaaactaccg aggaatctca attctctgct gcctcggaaa aatgctggaa 4140 tctatggtac ataaagttgt aatgtctgca tcaatatcga ttatttctca gtaccaacat 4200 ggattcattc ctcaccgctc gacaacttcc aacctcttat gttacactaa tgtactgttt 4260 cgtgagatcg aacggcggaa gcaagtagac tcaatatacg tggacttctc caaggcgttt 4320 gacactgttc cgcatttgta cgctgttgaa aagctgagac atatgggttt tcccgattgg 4380 atatctgact ggattttgtc ctatctcact gatagaaaag ctttcgttaa agtaaactct 4440 tcgcgatcca gtaccttcag cattccttcc ggtgtgccac aaggcagtgt attgggacca 4500 ctgatttttg tactctacat taacgacctg tgtcatcgat tttcatctgg gagactatgc 4560 tttgctgacg acctcaaaat tttccgagtg atcaactcta ctctcgactg cgttgcttta 4620 caagacgaca ttgactcgct tcttcaatgg tgtatcgaaa acggaatgac gctcaatatt 4680 ggaaagtgta aagtcataac attttgccgc agcttgtctt cgatcaacta cacatacgct 4740 ataaatggaa caactctgga gcgtgttggg tcaattcgcg atctaggagt ggtaatagac 4800 tcaaaattgc gctttaacga gcatattaca gttatcaccg ccaaagcctt ttcagtcctt 4860 gggtttatcc gaagaaatgc atcgcagttc accgatgtgt acgctttaaa agctttgttc 4920 tgtgcccttg tgcgcagtgt tttagagtat gctgctcctg tatgggctcc ataccacaca 4980 tccctaatga ttcggataga acgagtacaa aaatcattta tccgttttgc tcttcgccgc 5040 ctaccttgga acgatccaat caacctaccg gactatcccg cacgttgtca gctagtggat 5100 ctggagctac tttcaaacag aaggctaaag cttcaaagac tacttgtaca cgatatcctg 5160 acgaataaca tcgactgccc tgatcttctt tccgaagtgc aactaagcgt tcccaatcgt 5220 agactgcggc atacaaccct catcgccatc cctttacata gaacgagtta cggatatcat 5280 gaccctttta ccacttgttt gaggaatttt aactctgttt gtgaaatgtt tgattttaat 5340 gtgtcaaaag attgttttag agctaggatt aagaatatag tttaattaat tagtctgtac 5400 ggtagtatta ccgaagacgt ttacacaata aataaataaa taaa 5444 // ID Gypsy-25_CQ-LTR repbase; DNA; INV; 211 BP. XX AC AAWU01010957; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-25_CQ_; KW Gypsy-25_CQ-I; Gypsy-25_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-211 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 430-430 (2011). XX DR GenBank; AAWU01010957; Positions 42439 42649. XX SQ Sequence 211 BP; 68 A; 37 C; 68 G; 38 T; 0 other; tgtagtactc gacacgaggt gagaaagtga agcctactta ggggaagaga ataaactata 60 ggacgggaga gagaagggac caccctgtgg tagcgaggaa gtgtttgggg agacaggcag 120 tggagtggaa gtaagagtgg aataaagaag cattcaggag catcacctcg acggatttta 180 ttcccaccgg aatacccggc agattctcac a 211 // ID BEL-11_AA-LTR repbase; DNA; INV; 509 BP. XX AC supercont1.187; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-11_AA_; KW BEL-11_AA-I; BEL-11_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-509 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.187; Positions 552631 553139. XX SQ Sequence 509 BP; 159 A; 102 C; 94 G; 154 T; 0 other; tgtgccgacg ggacccctcg tcgcagcaac gctgtcaatc gcggcgcttc gttcatcgat 60 acgtctactg acagtgacgt tctatgagga atgtcatgct gatcgggctg tcactgaatg 120 atattcaaaa ctactttgtc atcctgaaag aagatacggc agtgccacaa atttattgaa 180 tttctatttc taatccttat cactaaagtc gaagctgaat ttaattgaat tgtataaaac 240 tgttgcttaa tagtaagtag tacggctaaa tctttggttt gtttaaaatc tcaaaagtaa 300 ttatttaggc acaggaaatc tcaccgagaa atctttgcga cccacacctt ctaaatctaa 360 tcggttaacc aaaaatgtaa gtcgaatttg agtccctaaa atggattgta accactgatg 420 caaataaata tttcttttta gcttaaagct aacgtacaac aaattctgcg tctgctataa 480 ggagttggta gtccgaagtt aacccaaca 509 // ID CR1-24_CQ repbase; DNA; INV; 4297 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-24_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4297 RA Kojima K.K. and Jurka J.; RT "CR1 non-LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 28-28 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 10 sequences with >97% CC identity. XX FH Key Location/Qualifiers FT CDS 1..4164 FT /product="CR1-24_CQ_1p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="MEDPRLSVTVVPPHSRNXAXCTSHPSRPGPVFGPRRG FT VFQNRSVGKYPYAENDLVSLPDARLTSSNVSTNESIRQDSAYRSQDAASQQ FT SNDNADPGRPPTCYMEDLRLSVTVVPPHSRNYAFCTSHPSRPGPVFGPRRG FT VFQNRSVGKYPYAENDLVSLPDARLTSSNVSTNESIRQDSAYRSQDAASQQ FT SNDNADPGRPPTCYMEDLRLSVTVVPPHSRKYAFCSSHPSRPGPVFGPRRG FT VFQNHLVGKYPYAENDLVSLPDAHLTSSNVSTNESLRPDSAYRSQDAASQQ FT STDNADPGRPPIRIMEDPDPPDTVVPPHCCDSSRSGHQSRPGPVFGSREGV FT FQTPNAGKYQAAFQVGYALDPDPRLLSSDVQTTPVHSDDPTTPVHHDNRRS FT SAGQPVIFEPIRFYYQNVRGLNTKVDDFFLSCCDCDHDVILLTETWLDGRV FT TSAQLFGNNFSVYHTYRNPANSKRKVGGGVLIAVRNSLDSVECVQAVAENL FT EQMFVTIRTPSKRIFVGCWYLPPEKRFETELMEKHLSCIENIRRIAGPHDV FT IVVAGDYNQGSLVWKHLSGNRSYVDPQLTRTRTEKERLTHGVMLDGMANCN FT MHQCNLVKNDRNRILDLIFVSDDDCTVVAAADEPLVPLDAPHPALVFDIDA FT SRTVFVQVDNNDRVLNYKRTNFDELQRTLADVDWTPVTSAMDVDTAVREFS FT NLLNFHLAQHTPQFRTPPKPEWSNARLKRLKKERAAALKTFTLRRCRMTKL FT AFNAASSRYRRYNRYRYNVYIRKTQSQLRRNPKKFWTFVKSKYKERGLPAT FT MALGDSTATTNDEKCELFARHFSSVFREPEIISSTHHLDAVPRDLVDINTF FT VISRDMIEKATKKLKSSFSPGPDGIPSAVLKRCANQLIEPLLVIFNLSLRQ FT ATFPTAWKKSFMFPIFKKGSKRSIENYRGITSLCSGSKLLEIIVSDVILFN FT CRSYVAPEQHGFMPKRSVNTNLLEFTSFCIDNIAEGRQVDAIYTDLKAAFD FT RIDHDIALQKFSKLGFSPMLCRWLESYLKGRIIQVKIGASVSLEFTNGSGV FT AQGSNLGPIVFSGFFNDANFLFSGNCKLSYADDFKLFIPIDTLADCYVLQS FT RLDKFSEWCGHNRLELSSAKCYVISFNRKQSRELFTFDYHLGSHVLERLEV FT VKDLGVLLDLKLDFHLQLSSVIDKANRKLGFMFKAAQEFDDPLCLRTLYCA FT LVRSHLETSAVVWAPYHQNWIDRIERVQRKFVWFALRRLPWNNPAQLPSYE FT TRCNLLGIETLQNRRTISRAVFAAKVITSEIDSPNLLFQLNARVQPRNLRP FT TPGFLSRPLVRTVYAENSPIRALATAFNEFYHLFDFHEPVSRFLERLRTEL FT RDRSRSSRPAENPQQRASRTRRSSR" XX SQ Sequence 4297 BP; 1059 A; 1213 C; 954 G; 1069 T; 2 other; atggaagatc cccgcctctc cgtcacggtc gtgccaccgc attcccgcaa ttwcgcctwt 60 tgtacgagcc atccaagtcg tcccggcccc gtgttcggtc caaggcgtgg ggtcttccaa 120 aatcgatccg taggcaagta cccgtatgcc gaaaatgatc tcgtctcgct ccctgatgcc 180 cgcttaactt ccagcaacgt ttcaacaaac gagtcgattc gtcaggattc cgcgtatcgc 240 agtcaggatg cagcttctca acagtcgaat gacaacgctg acccgggacg cccgcctaca 300 tgctacatgg aagatctccg cctctccgtc acggtcgtgc caccgcattc ccgcaattac 360 gccttttgta cgagccatcc aagtcgtccc ggccccgtgt tcggtccaag gcgtggggtc 420 ttccaaaatc gatccgtagg caagtacccg tatgccgaaa atgatctcgt ctcgctccct 480 gatgcccgct taacttccag caacgtttca acaaacgagt cgattcgtca ggattccgcg 540 tatcgcagtc aggatgcagc ttctcaacag tcgaatgaca acgctgaccc gggacgcccg 600 cctacatgct acatggaaga tctccgcctc tccgtcacgg tcgtgccacc gcattcccgc 660 aaatacgcct tttgttcgag ccatccaagt cgtcccggcc ccgtgttcgg tccaaggcgt 720 ggggtcttcc aaaatcacct cgtaggcaag tacccgtatg ctgaaaatga tctcgtctcg 780 ctccctgatg cccacttaac ttccagcaac gtttcaacaa acgagtcgct tcgtccggat 840 tccgcgtatc gcagtcagga tgcagcttct caacaatcga ccgacaatgc tgacccggga 900 cgcccgccca tccgcatcat ggaagacccc gaccctcccg acacagtcgt gccaccgcac 960 tgctgcgact catcacgttc cggccatcaa agtcgtcccg gccctgtgtt cggctctagg 1020 gaaggggtct tccaaacacc aaacgcaggc aagtaccaag ctgcttttca agtcggttac 1080 gcgctggacc ctgatccccg cttgctttcc agcgacgttc agacgacgcc cgtacacagc 1140 gacgatccga cgacacccgt acaccacgac aatcgacgtt ccagtgctgg tcaacccgtt 1200 atcttcgaac ctatccgttt ctactaccag aatgtacggg gtctgaatac gaaggtcgac 1260 gatttcttcc tgtcatgctg cgactgtgat catgacgtaa ttctcctaac cgaaacctgg 1320 cttgatggtc gagtcacctc ggcccaactg tttggtaaca atttctctgt ctaccataca 1380 taccggaatc ctgcaaacag taagcgaaaa gtcgggggcg gtgtccttat cgccgtgcga 1440 aattctctcg attctgttga atgcgtccaa gctgtagccg agaatcttga acagatgttc 1500 gtcacaattc gcactccgag taagagaatt ttcgtcggct gctggtatct tcccccagag 1560 aagcgttttg agaccgagct gatggagaag catcttagct gcatcgaaaa catacgtcgt 1620 atcgccggac cgcatgacgt cattgttgtt gcaggtgact acaaccaagg aagcctagtc 1680 tggaagcacc tttctggtaa ccgatcctac gtcgatccgc agctcacgcg tactagaacc 1740 gagaaagaac gactcacgca cggagtaatg ttagatggaa tggctaactg caacatgcat 1800 caatgcaatc tggtgaaaaa cgatcgtaac cgcatcctag atctgatatt tgtcagtgac 1860 gacgactgca ccgtagttgc tgctgctgac gagccactcg tgcctctgga cgccccccat 1920 ccagcgctgg tgtttgacat tgacgcatct aggaccgtgt ttgtacaggt agacaacaac 1980 gaccgggtat tgaactacaa acgaactaac ttcgacgaac tgcaaaggac tctggcggat 2040 gttgactgga ccccagttac ctctgccatg gacgtagaca cagctgttcg ggaattttca 2100 aacctattga attttcatct tgcccaacac actcctcaat tcagaactcc gcctaaaccc 2160 gaatggtcga acgctcgtct taaacgcctg aaaaaagaac gtgccgctgc actgaaaacg 2220 ttcaccctgc gccgctgccg tatgacgaaa ctggccttca acgccgctag ttcccgttac 2280 cgccgataca atcgttacag gtacaacgtc tacatccgta agacccaatc gcaactgcgg 2340 cgaaatccca agaaattctg gacttttgta aaatcgaagt acaaggaacg tggccttcct 2400 gcaacgatgg cgttgggcga ttcgactgca actacaaacg acgagaaatg cgaactattt 2460 gccagacact tttcgtccgt attccgggag cccgaaatca tttcttccac tcatcatctt 2520 gacgctgttc cccgtgatct ggtcgacatc aacacgtttg tcatttcgcg tgacatgatc 2580 gagaaggcga cgaagaagtt gaagtcatca ttctcaccag gacccgacgg aatcccttct 2640 gctgttctca agcgctgtgc aaatcaactg atcgagccac ttttggttat atttaacctt 2700 tccttgcgtc aagccacatt tccaacagca tggaagaaat ctttcatgtt tccaattttc 2760 aagaaaggtt caaagcgttc cattgaaaac tatcgaggga tcacgtcttt gtgttccggt 2820 tctaagcttc tagaaattat cgtcagcgat gtgatcctgt ttaactgtcg aagttacgtt 2880 gcaccggaac aacatgggtt tatgccgaag agatcggtga acactaacct gctcgagttt 2940 acatcgttct gcattgataa cattgccgag ggacgccaag tcgatgcaat ctacacggac 3000 ctgaaagctg cttttgatcg gattgaccac gatattgcac tacaaaagtt ctccaaactg 3060 ggtttctcgc cgatgctatg ccgttggctt gaatcttatc tcaagggtag aatcatccag 3120 gtgaagatcg gcgcatccgt ttcactggaa tttaccaacg gctccggcgt ggctcagggg 3180 agcaacctcg gcccaatagt tttttccggg ttcttcaacg acgccaattt cctgtttagc 3240 ggaaactgta agctatccta cgctgacgac ttcaagctgt tcatccctat cgacacgttg 3300 gccgactgtt acgtgctaca atcaaggctg gacaaattct cggagtggtg cgggcacaac 3360 cgcctggaac tcagctccgc aaaatgctac gtcatatcct tcaaccggaa acaaagtcga 3420 gaattgttca cattcgatta ccatctcggt tctcacgtgc tggaacgcct agaagttgtc 3480 aaagatttgg gtgtacttct tgacctgaaa ttggattttc acctgcaact atcttccgta 3540 attgacaaag ccaaccggaa acttggtttt atgttcaagg ctgctcaaga atttgacgat 3600 ccgttatgtc ttaggacatt gtactgtgct cttgtgagat cgcaccttga gacgtcagca 3660 gttgtctggg caccctatca ccagaattgg atcgatcgca ttgaacgtgt acaaaggaaa 3720 ttcgtctggt ttgctctgag acgcctcccg tggaataatc ctgctcagtt accaagctac 3780 gaaactcggt gcaacctgct tggaatagag acgctccaaa accggcgaac aatctcaagg 3840 gctgtattcg ctgcgaaagt tattacctct gaaatcgact cccctaatct tcttttccag 3900 ttaaatgcca gagtgcagcc aaggaatcta cgcccaaccc ctggattcct ctcacgcccg 3960 ttggtccgta cggtttatgc ggaaaactcc cctattcgtg cgctggccac agctttcaac 4020 gagttctacc atctgttcga cttccacgaa cctgtgagcc gatttttgga aagacttcga 4080 actgaactcc gagatcgttc tcgttcatct cgtcctgctg aaaacccgca acaacgtgca 4140 tcaagaacaa gaagatctag ccgttaaatc atgttttgtt taaatttatg ttttgataac 4200 ttgactatat ttatgtttgt aaagttctta ttagttttaa gcaaccaaac ttcattgaga 4260 ccaccgggtc agatgatttc aaataaacaa atcaaac 4297 // ID Gypsy-9_DVir-I repbase; DNA; INV; 4183 BP. XX AC scaffold_13045; XX DT 10-MAR-2011 (Rel. 16.03, Created) DT 10-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-9_DVir_; KW Gypsy-9_DVir-LTR; Gypsy-9_DVir-I. XX OS Drosophila virilis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; virilis group. XX RN [1] RP 1-4183 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (09-MAR-2011). XX DR Genome; scaffold_13045; Positions 1191762 1195944. XX CC Positions [1477-1902] - Reverse transcriptase CC Positions [3213-3470] - Integrase core CC 'CATA' target site duplication CC LTRs are 95% similar to each other. XX FH Key Location/Qualifiers FT CDS 1114..2685 FT /product="Gypsy-9_DVir-I_1p" FT /translation="MDKYLKHEVLIGREVLSQGFGVSIDADKFCFYKTKRV FT NVLSLVKYEEIINSDTNSNYSDKTILNNMLRCHSDRFIDGIPMTRVTAGQL FT EIRLIDPQKTVQRRPYRLSEDERKHVRCKIDELLTSKIIRLSCSPFASPMM FT LVKKKNGTDRLCVDYRVLNGNTIADKYPLPLISDQIPRLRGARYLSCIDMA FT SGFYQIPVHPDSIERTAFVTPDGQYEFLTMPFGMKNAPSVFQRAIIKALGP FT LAHTFAIVYIDDVMVVAKTKDEALERLEIVLDTLSKAGFSFNITKCSFLKT FT KIEYLGFEITNVEIRPNARKIQALIGLPPPETVTQLRQFIGLASYSRQFIP FT KFSEILKPLYMLTSKNNTNFGWLSEHEQIRLKIIGVLTQAPVLTIFDPQYL FT IELNTDASSIGYGAILSHKIENKSYVVEYYSKCTSSAEAKYHSYELVTLAV FT VNAMKHFRHYFQGRKFVVYTDCNSLKASRTKMEITPRVHRWWGFLQSFDFD FT LEYRKGERMAHVEVTDNWLVSEQHEQHQ" XX SQ Sequence 4183 BP; 1383 A; 730 C; 942 G; 1128 T; 0 other; tcagaagtgg gattagctcc gccaaaagag gacagctctt acttattaag ggcagcctta 60 gaagagcaaa atcgtaatct aatggaaata ataaaatcca tgcaggcacc aagagcaccg 120 acaatagacg ggagagcact ccacgtcact ctacccaagt tttgccctga caccgctata 180 gcagatgcat cagcttggtg tactacagtg gacctaatct tcgcggataa tgctctcgaa 240 ggcagtgcgc ttgtgattgc gttaagcaaa gcattagaag gcagtgcatc gcaatggctg 300 tcccaaatat gcttcgctgg catcacatgg ccgcagcttt aggaactctt catccaacgt 360 ttcgtgggca ttgagacaac ttctgccata ctgatgaaca ttttgaacgg acgtcccaaa 420 actggagaga gttttgctga gtacggcagt cgcatagtca ccctattgct ctttaagtgg 480 aaggcaaaga atttggagga gatcgctgta tcagtagcgt tagcccacat ggctcaaatt 540 gataataatt tgttgcgttg gctgttaaca acgaatgtga cgactcgtaa tgagctgcag 600 cagcagctac aggctttcaa gaagcgtaac aacgaggagg acggatattc gactggtccg 660 gaaaggctgg ccacaaatat gcagaatgcc gctctcgaat gggtaaagca tcagccccaa 720 cagaaaggag ttatagcgtg agcagcacgt ccggagtgaa ggatcgatca aaaattaaat 780 gttttaagtg cgatgaaatc ggacatgtgg cttctgtatg ccccaaaaat cgtactggtg 840 gaaacagcaa gagctacgag aagcgacctg acgtttgtac agtagctcca ccaagtggac 900 caatacctta tccggtgggt ctattccata ttgtttcgat tctggagccg aatgttcact 960 ggtcaaggag actgttgtcg aaaaattcac cgaaaaacga tttcacaata ttgtaacttt 1020 aaatggcata acgattcaat ttgtagcaca ttgcaagttc tgagtaatat taatattgaa 1080 caatattgtt tacaagtatt ataacatgtg tgtatggaca aatatttgaa gcatgaagtg 1140 ttgattggcc gtgaggtttt aagccagggc tttggtgtat caattgatgc agataaattt 1200 tgcttttata agacaaaaag agtaaatgtt ctgagtctag tgaagtatga agaaatcata 1260 aattctgata ccaattcaaa ttattctgat aagacgatat tgaacaatat gttgagatgt 1320 cactctgata ggtttatcga cggaattcca atgactcgtg tgacggcagg ccaattagaa 1380 attaggctaa tagatccaca aaagactgtg caacgccgac cttataggct tagcgaagat 1440 gagaggaagc acgttcggtg caaaattgat gaattgctta catcaaaaat aattcgtctc 1500 agctgttcac cattcgctag tcccatgatg ctggttaaga agaaaaatgg cacagatcgt 1560 ctttgtgtag attaccgcgt tttaaatggc aatacaattg ccgataaata tcctttgcct 1620 cttatttcgg atcaaatacc gcgtcttcgt ggtgcaaggt acttgtcgtg tatagatatg 1680 gcaagtggat tttaccaaat cccagtacac ccagattcaa ttgagcgaac ggctttcgtt 1740 acacctgatg gacagtacga attcttaacc atgccctttg gtatgaagaa cgcaccttct 1800 gtgtttcagc gtgcaataat aaaagccctg ggtccgttgg cacacacatt cgcaattgtt 1860 tacattgacg atgttatggt agtagctaaa acgaaagatg aagcgttaga acgactagag 1920 atagtgttgg acaccttaag taaagccgga ttttcattta atataacgaa atgttcgttt 1980 cttaaaacta aaatcgagta ccttgggttt gaaataacga atgtagaaat tcgtccaaat 2040 gcacgaaaga tccaggcatt gatagggttg ccacctccgg aaactgtaac tcaattacga 2100 cagttcattg gtttagcttc ttactctcgg caattcattc caaaattttc cgaaatattg 2160 aaaccattgt acatgcttac ttcaaagaat aatactaatt ttggttggtt gtcagagcat 2220 gaacaaatac gtttgaaaat aataggtgta ttgacacagg ccccagtatt aactatattt 2280 gatccacaat acctaataga attgaacaca gatgccagct ccattggtta cggggcgata 2340 ttgtcacata aaatcgaaaa taaatcctat gtagtggagt attatagcaa atgtacttct 2400 tcggctgaag caaaatacca ctcgtacgaa ctcgtgactt tagcagttgt caatgctatg 2460 aagcatttta ggcattattt tcaaggtcgt aaatttgtcg tttacacaga ttgtaactct 2520 ttaaaggcca gtcgcaccaa aatggaaata acaccgagag tgcatagatg gtgggggttt 2580 ctacagtcat ttgatttcga tctcgaatat agaaaagggg agcgaatggc gcatgtcgaa 2640 gtgaccgaca attggttggt gtccgaacag cacgaacagc accagtaacg agttaccgga 2700 aaatttaagt aaaacatttg aattaaggaa gggcatattg catagaaaaa tacagagaaa 2760 tggtaaaaca aggtgcttgc cggtaattcc aagggcgttt aggtgggcag tgatcaacca 2820 cgtacatgaa tctgttatgc atttggaatg ggaaaaaaac attggatgag gtatatggat 2880 attactggtt cgagaatatg tccaagtatg tacgtagatt tgtggaaaat tgtatcacgt 2940 gcaaactagc gaaaccccca aatggaaagg tccaggcaga attgcataca attcctaaga 3000 tagagatacc atggcacaca gtacacatcg acataacgag taaaatgagt ggcaaaaacg 3060 acttgaagga atacgtcatt gttttaatag atgccttcac ctagtttgtt tatttttacc 3120 acacactgaa tatagacact gaaagttgta ttaaaactgt gaaatcggtt gtggcgctat 3180 ttggggtgtc aactagaata atagcggact aaggtcgttg ttttgctagc tcaggctttc 3240 gcaatttctg ttccgctcaa aaggttcaac tgcatttgat cgctaccgga gctagtaggg 3300 caaatgggca gcttgaacgt gtgatgagca cacttaagac aatgctgacg gcagttgaaa 3360 cgagtcagag atcatggcag gacgcattgg ctgaagtaca actggccatg aattgtactg 3420 caaatcgagt caccaagttt agcgctttag agctattaat aggaaaagaa gctaggccgt 3480 ttggtttatt gtcaataaac gaagaagaca atgatgtaga tagagaacaa attaaaattc 3540 aagccaagga aaacatggaa aataatgcaa gatatgacaa acaaagattt gacaaaaata 3600 aggcaaagat tatgaagcat aacgtaggtg atcatgttac tgaaaaacga agaacggcat 3660 caaacaaagc tcgaccctaa atttaaaggt ccatttgagg tgatcaaagt cttagacggc 3720 gatcgatata aactaaaatc attggttaat aagcgaacgt ataaatattc acatgagtgg 3780 ctaagagcgc taccaagcag acaaataatt aactaattaa acgatgaatt cgatggccta 3840 agtgattggg tgtgattggg tgtaagtgtg cacaccaaat taaattgaga aactgagaga 3900 caattctaag ttatgtaaaa tctgaattat tggaattaaa tgatatgaat aagatcatga 3960 agttgaaaaa gtattatgtt tttaaacaaa aggaaattgt gtttcttaga taagaaaaat 4020 tgtcagttaa tgtttgtgta agcgactttg taagtttgta aattgtaaga gaaaataatg 4080 ttattaaaaa caagagatat cataattaaa agagtctggt atgtgacttg gacatgggtg 4140 gttgatgtag catacgagga cgtgtgatag gtcaggaaga ccg 4183 // ID Transib-9_HM repbase; DNA; INV; 3734 BP. XX AC . XX DT 30-JAN-2008 (Rel. 13.01, Created) DT 30-JAN-2008 (Rel. 13.01, Last updated, Version 1) XX DE Autonomous Transib DNA transposon -a consensus sequence. XX KW Transib; DNA transposon; Transposable Element; Transib-9_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3734 RA Kapitonov V.V. and Jurka J.; RT "Autonomous Transib transposons from the hydra genome."; RL Repbase Reports 8(1), 9-9 (2008). XX DR [1] (Consensus) XX CC Transib-9_HM is a family of autonomous Transib DNA transposons CC that were active in the hydra genome more than a few million CC years ago (copies are ~8% divergent from their consensus CC sequence). The consensus sequence was obtained based on a CC multiple alignment of a few copies copies; it codes for a 678-aa CC Transib transposase (3 exons). Like other Transib transposons, CC Transib-9_HM is characterized by 5-bp target site duplications CC and short terminal inverted repeats. The Transib-9_HM transposase CC forms a distinctive cluster together with Transib-1_HM and the CC sea urchin RAG1-like protein (GenBank, NP_001028179). CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(751..987,1202..2417,2503..3083) FT /product="Transib-9_HMp" FT /note="Transib transposase." FT /translation="MVLYTEQEVFKIILDNNVNSVQALITVFKMLENINDI FT DEHSELVLTNTFRQFSNLKKSMFIAKRIKIDKEALVKLGKVEVLLSFNEDM FT INQEDDYNSEEDTQRNRKKWDSISRKSKIRRTEELWKNVNKLAEIENIDLK FT EVCDFLLSRCSSSSCVKTTQITTVTAVSICCDSLLGRSAYRKLRKMLRANN FT IDVYPSWKEVQDFKNSITPLRTVPLPEPYIGVYSTFTESIKITVAQILKLE FT NVHITNELTNLVLEIKFGYDGSGGHKMYNQLHNVNTNNIIMAMFCPLKLLN FT SNADGNNLIWEQISLNSDFSQRPLMLQMGKESRENAQSLTIFNAEIKTMKE FT EGFIIIFNGQYLNIKVNIVADSFDRKASNIYQGLGGAYCDLCDLSKEQCED FT ICTIEQGFVINRDIETMEQIFLDLVQKDGSLTKKSKDYDVRHGQTASPIAG FT SNVRSVQVLHALLRGFDHFMKVVVHLAAGVHAWTESKTANSSGQGGSTTTG FT NVARSLLYNEKNRKLITDYISNKHARMLAERYGLLLSNIIRAMSSSQKINI FT KKYRDICKETNILLRQEMPWVSLTPTIHKLLGHTWELIELNNDTGLKSWSE FT EGMEANNKQLRIFREKLSRKTNQLQNLEDCFKRLWVGSDPLVTEQRNKGRI FT QCKLCSEIGHSKKTCFKVHENDIVNTLFM" XX SQ Sequence 3734 BP; 1383 A; 546 C; 609 G; 1196 T; 0 other; cacagtggtc cgtttgggct gttttgggga caaatttata taactatttg aaggcaggtc 60 aaaattccct cagattgcgg cctatgtgtt aacattaggg tagcaatcgg gcaaaaaatg 120 ggatgttaat ccgcccaggg cgtcactttt ttatgggtgt ttaaaaaaaa cttttttttt 180 tcttttttct agtttgaaaa tttcctaaca gcactacctc cttcctttac ttgcttgcat 240 tttccataaa agagcggcaa aaaccatttt ccaacccctg caattactgt gtatacagcc 300 taaattttta tgccgaatca aactttttaa ttatgtaatt ttgtaaacca taaataatga 360 tttgtattta caatatgaaa atattgtata gtattttatt tatgttgcgt agttataact 420 acgattattc atgcaagcta tagcttgcat gaaacacagg aattaggtaa tgacattcat 480 cactgaatta aaacttccta tttacgaatt ctagcaatag tttatattga attaaaaccc 540 tcaaaaatta acattaaaag ccatatatac caattaacaa taattaacaa taaaaactgt 600 ttaaccacat aatcgtcacc aaatggctta aagcttttta caataaaatt attgaattca 660 gaataagaac aaacattaaa cactgaacac gttaaaaaaa ccttttagta aaagtatatt 720 actttgtttc tggtaatttc tctgttcaat atggttctct atacagaaca agaagttttc 780 aaaataatat tagataacaa tgttaattca gtacaagcat taattacagt gtttaaaatg 840 ttagagaata ttaacgacat tgatgaacat tcagagttag ttttaacaaa tacatttagg 900 cagttcagca acctaaaaaa atcaatgttt attgcaaaac gtatcaagat tgacaaagaa 960 gctctagtaa agctgggaaa agtggaggta attacaaact tcaaacatat atgtaaatta 1020 aataataaaa tgacataatt attatagaat attttttaat tagtgcagat attactgcat 1080 ttataaacaa aacttaattt tgacgaaaat cattattatt acttttgtga ttataaatca 1140 taaaatttga tattactaat aataataatt aaaattttta gtgttataat ttcatatata 1200 ggttttgctg tcatttaatg aagatatgat aaaccaagaa gatgactaca attcagaaga 1260 agatacgcaa agaaatagaa aaaaatggga tagtatttct cgaaagtcaa aaattcggag 1320 aacagaagaa ctctggaaaa atgtaaacaa acttgctgaa atcgagaata ttgatctgaa 1380 agaggtttgt gatttcctat tatctagatg ttcatcctct agctgtgtca aaactaccca 1440 aataacaact gtgacagctg tttcaatctg ctgtgattca ctgctgggtc gatcagcata 1500 cagaaaactg aggaaaatgc tgagagctaa caatatcgat gtttacccaa gctggaaaga 1560 agttcaagac ttcaaaaatt ctattactcc tctgagaacc gttcccctac ctgaaccata 1620 tattggagtt tactctacct ttacagagtc tatcaaaatc acagtagccc aaattcttaa 1680 gcttgaaaat gtccatataa caaatgaact aacaaattta gttcttgaaa ttaaatttgg 1740 atatgatggc agtggaggac ataaaatgta taatcagcta cacaatgtaa ataccaataa 1800 tataatcatg gcaatgtttt gccctctgaa acttttaaac tctaatgcag atggaaataa 1860 cttaatatgg gaacaaatat ccctaaactc agatttttca cagaggcctc ttatgcttca 1920 aatgggaaaa gaatcacgtg agaatgcaca gtctttaacc atttttaatg cggaaatcaa 1980 aacaatgaag gaggaaggtt tcattattat cttcaatgga caatatttga atattaaagt 2040 aaacattgtt gcagacagtt ttgacagaaa agcatcaaat atatatcagg gccttggagg 2100 tgcttactgt gatctttgtg atttatctaa ggaacaatgt gaagatatat gtacaattga 2160 acagggattc gttataaata gagacattga aaccatggag cagatatttt tggatttagt 2220 tcaaaaagac ggctctttaa ctaaaaagtc aaaggattat gatgttcgcc atggacaaac 2280 agcaagtcca atagctggat ctaacgtaag gtctgtacaa gtactccatg cattactccg 2340 tggatttgac cactttatga aagttgtggt tcatttagct gcaggagttc acgcatggac 2400 agagtctaag acagccagta ctacgcactt ccttgaaata gagaaaaaat atatacagga 2460 gttaattggc aaaagaaaca ggaatcaagt gggattttgc agattcatca ggtcaagggg 2520 gtagtactac aactggcaat gtagctagaa gtcttcttta caatgagaaa aatagaaaac 2580 tgattactga ctacatttca aataaacatg ccagaatgtt agcagagcgg tacggtctct 2640 tgttgtctaa tattattcgt gctatgtcat ctagtcagaa aattaacata aaaaaatata 2700 gagacatatg taaggagaca aacattcttc ttcgacaaga aatgccctgg gttagtttga 2760 caccgactat tcataagtta ttaggacata cttgggagct tattgagtta aataatgaca 2820 caggacttaa atcatggagc gaggaaggca tggaggccaa caacaaacaa cttagaatat 2880 ttagagaaaa actatcaaga aaaacaaatc agttgcagaa cttggaagat tgttttaaaa 2940 gactctgggt gggttctgac cctttagtta ccgagcaaag gaacaaaggt cgaatacaat 3000 gcaaattgtg tagtgagata gggcatagta aaaaaacatg tttcaaagta catgaaaatg 3060 acattgttaa tactttgttt atgtaaaata aaaaaattaa acttgagcgt ttgcttgtat 3120 ttttttagtc tttttagaaa tttagataaa atatctagtg cattatgctt aaagaaacaa 3180 taaaacatag tttttataac cacaaataac ttaaaaaccg ggtgtttggt aaatatatgt 3240 ggcataccaa attaatggca acagtatgaa gtttccaaat aatatatggt ttagtattac 3300 caaatctccg gtttttgaga tatttgagga agtatacaaa atttaagaga tttttttttt 3360 gcggaaactg acttttccat tcaatttttt taaaagaaat gaaatttttt tattaataac 3420 gttaaaaaaa gaaaatgaaa ttaatacttt tattttgttc attcaatttc gtgagaaatg 3480 aaatattttt tttcattttt gtctataaat ctgctgcata taattcaaat tagaaaatgt 3540 atgtgaaaaa tctaactctg cggtatgaac gttttttaaa actttgaaag gttgtcattt 3600 cttaaccagg catgataatg acctctagtc ttttccattt aattcgaatt tcattgtagt 3660 ttcaaaaaat cataaaaaaa ttttagtccc gaaatataga ttttttcatt ttgtccacca 3720 tttggaccac tgtg 3734 // ID BEL-77_AA-I repbase; DNA; INV; 6274 BP. XX AC supercont1.90; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-77_AA_; KW BEL-77_AA-LTR; BEL-77_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6274 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.90; Positions 1647596 1641323. XX CC Positions [5328-5888] - Integrase core CC 'CTAAT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 27..6274 FT /product="BEL-77_AA-I_1p" FT /translation="MSSGSVAAAGEFSRKKRDSPGGEGASCGSCRRVDNSR FT MVQCDECDVWYHYDCVNVDDSIRNHDWSCNGCLRTALEKQRCALSEQMKRL FT EQQQRQWQQEQQKRVDQMEERQKLEQLRKQQELQRQKQEQFYQTQTAQLQL FT QPQSLRFEAAISELEAPSKSLGVIPKQQQQRVTILEKAASKGVETNDRRDL FT EPLANSTVTNAKPRSRTDGASSKSSRRSTKLRLLQARALEARQSLEKKQLE FT ERLALEREMLETSDSEVESIISIEKINDWLEKTENMGEEAGISMNDTPLPI FT YDELLRKATVSSSQLRPTCSGATAGHSVQQQFIGPIDSIQSRIPRQQPGGS FT CTHAKHPQSGVNISKAYQPAFGEYPPALSGFASLPVHGVASQQPSSTAYPA FT ANHPRHVVQSMTSAVQGIGQPPLTTSTPRQNVESSQQQYPLQMSSDAVPLN FT SGHLAARQTVKDLPKFGGDPEDWPRFIAAYERTSRMCAFRNDELLDRLERS FT LHDKALNAVKSLLLHPDNVPVIMSRLKTLFGNPESIVETMVRRIRMMPPPK FT AEKMETIVDFGVAVQNLCATMQVCQMDERLYNVALLQELVDSLPSALKMKW FT AFHRKDKGAVTLLDFSDWIGELVDALCQVIHPAVRAKPQRSESKPERKVRR FT DEAFVHAHSTDPQEQSDRKECLACENECTSLEKCDSFLKMTPNARWTLVNQ FT KKICRKCLTKHFKACERRVPCNQGGCSFLHHRLLHDDSRHQRPSPSIPTQV FT ASVNTHSLLGRVLLKYVRVTVYGKGKSITTYAFLDAGSTSTLMEHSLWEEL FT NLSGEKSPLCISWTGGQGRYEKDSVVFSAEITGAHTPKQSYRLPEVHTVRS FT LDLPAQTLSVSTLAKKFHHLSGLPIESYVAVKPRILLGVDSSRLEYPLDSK FT EGDENQPTAVLTRLGWIVYGPCSAPGQPIPKKEVSYGYHICQCEGLHSTVK FT NYFSMDSLGVQLADKPLMSKDDERALELLQKNTIRRGKRYETCLLWRYDDL FT RMPESKAMALKRHECLNKRMAREPRLAKELHEKIMDYKAKGYIRKLSAQEE FT ATHTERSWYLPIFPVVNPNKPGKLRIVWDAAAKVGKLSLNSFLLKGPDLVM FT PLQHVLQRFREFRTAVTGDIREMFHQVRINPEDQHCQRFLWNDGVPGKVPS FT TYAMQVMTFGASCSPSSAQYVKNLNAERFRSQFPDAVEAICNGTYVDDMLC FT SVESEAEAVRLAQDVRAIHAEGGFEIRGWLSNSEKVVEAMGESKLIQKDLN FT KNGELSTEKVLGMWWHTTNDTFTFKIPKRCQHELLSGAQVPTKREVLRILM FT SVFDPLGLLANVLMFLKVLLQEIWRSKIGWDEPITDSQLERWRTWLSVLHQ FT VESVSVPRCYRTITTSSVKTNEIQLHVFVDASENGYAAVAYFRFEERSTIE FT CSFVTAKTRVAPLKYVSIPRLELQAAVIGTRLAKDIAETHRIPIRKRFFWT FT DSRDVLCWLCSDHRRYSKFVGARIGEILENTGHSEWFWIPTRENVADEGTK FT WQKLPDLSPSSRWFKGPDFLWRSYAVWPTQPATHGITTTEINASVNVHAVH FT EPILNFAKFSSWRKLLRVTAYVFRFISNIRARLQNRPSSTGILDQPELIAA FT ERCIYRQAQLEVFGEEISQLKLSTNAEGKSVAPIPKSSSLYKLSPFLDIHG FT VLRASGRTAGCQFIDEDAVHTIILPKKHSVTTLVVQFTHERYHHLNHETVV FT NELRQHYWIPQLRRICYKVRLHCQYCKNAGARPQPPVMADLPPARLAAYSR FT PFSHTGIDFFGPMQVIVGRRIEKRWGVLITCLVVRAIHIEIAHSLNTSSCI FT MALRNFMARRGTPVELFSDRGTNFVGANRELSEALRALDKDRLMEEFVTPN FT TKWTFLPPSSPHMGGSWERLVQSVKKILGNMHLPRTPTDEVLRNSLMEVES FT IINSRPLTYVPIEDAECEALTPNHFLLGTSGGTKPLVPYDDSPATLVNSWK FT TSQVFANMFWKRWLREYLPTITRRTKWHYPAKPIQIGDIVIIVDPDLPRNS FT WPKGRVVDVNLKNGQVRSATVRTPSNVYERPAVRLAVLDVGAIGSTHQPGS FT CVPGGT" XX SQ Sequence 6274 BP; 1723 A; 1551 C; 1652 G; 1348 T; 0 other; aattaaattt tccgtttaat tccggaatgt caagcggaag cgtagcggct gccggggaat 60 tttcccgaaa gaaacgggat tcccctggtg gtgagggagc gagttgcggg tcgtgccgtc 120 gtgtcgataa tagtcgaatg gtgcaatgtg acgagtgtga cgtgtggtac cattacgact 180 gtgtgaatgt ggacgacagc atccggaatc acgattggag ctgtaatggc tgcctaagga 240 cagcactgga aaagcagaga tgtgctttga gcgagcagat gaagcgactt gagcagcaac 300 agagacagtg gcaacaagag caacagaagc gtgttgatca gatggaggaa cggcagaagc 360 ttgaacagct gcggaaacaa caagagttgc agcggcagaa gcaggagcag ttctaccaga 420 cgcagacagc acagttgcag ttgcagccac agagtttgcg gttcgaagca gctatatcag 480 aacttgaagc tccttcaaaa tcgcttggtg ttatcccgaa gcagcagcag cagcgtgtca 540 cgatcttgga gaaagctgca tcgaaaggag tcgaaacgaa tgatcgacgt gatctggaac 600 cgttggcaaa ctcaaccgtc actaatgcga aaccacgtag cagaaccgac ggagcttcat 660 cgaagtcatc gaggcggtca acaaaacttc ggcttttgca agctagagct ctggaagcga 720 gacaatcgtt ggagaagaag cagctagaag agcggttagc tttggagcgt gagatgctcg 780 agaccagcga ttccgaggtg gagtcgatta tcagtatcga aaagataaat gattggctgg 840 agaaaacgga gaacatgggg gaagaagcag gaatttcgat gaacgatacg ccattgccga 900 tatacgacga attattgcgg aaagcgaccg tgtcatcatc tcaactacgt cccacatgca 960 gcggtgctac agctggacac tccgttcagc aacagttcat cgggccgatt gattcgattc 1020 agtcacgcat tcctcggcag caacctggtg gaagttgcac tcacgcgaaa catccacagt 1080 ctggcgtaaa catctcaaaa gcttaccagc cagcgtttgg tgaatatcca ccagcattat 1140 ccggtttcgc ttcgctacct gtacacggcg tagcatcgca gcagccatcg tcaacagcct 1200 acccggcggc aaaccatcca cgccacgtcg ttcagagcat gacgtccgct gttcaaggta 1260 tcgggcagcc acccctgacc acatcaactc cgcggcaaaa tgtcgagtcc tctcagcaac 1320 aatacccact gcagatgagt agcgatgctg taccgctgaa cagtggtcat ttagcggcga 1380 ggcagacggt gaaagattta cccaagttcg gaggtgatcc agaggattgg ccgcgattca 1440 tcgctgccta cgagcgcact agcaggatgt gtgcattccg gaacgatgaa ctgctcgatc 1500 gtctggagcg tagtctgcac gataaagcgc tcaacgccgt taagagtctg ctgttgcatc 1560 ccgacaacgt tccggtgata atgagtcggt tgaaaactct cttcggcaat ccggaatcca 1620 tcgtcgagac tatggtccgg agaatccgta tgatgccacc accgaaggca gaaaaaatgg 1680 aaacaattgt cgacttcggc gttgcagtac agaatctgtg tgcgactatg caagtgtgcc 1740 agatggacga gcgactatac aacgtcgcat tacttcaaga gcttgtcgat agcctgccat 1800 cagcgttgaa aatgaagtgg gcgttccatc ggaaagataa aggagctgtc acactattgg 1860 acttcagcga ctggatcggt gagctggtgg atgccttatg ccaagtgatt catccagcgg 1920 tgagggccaa accacaacgg tccgagagca agcccgagag gaaggtaagg agagatgaag 1980 cctttgtcca cgcgcattcg acagacccgc aggaacagtc agataggaaa gaatgtctag 2040 cgtgtgaaaa cgagtgtact tccctagaaa aatgcgacag ctttctcaaa atgacaccga 2100 atgcacgatg gacgctagtg aaccagaaaa agatctgccg gaagtgcctt accaagcatt 2160 tcaaagcctg tgagaggagg gttccctgca accagggtgg gtgtagcttc ctgcatcatc 2220 ggctgttgca tgatgacagt agacaccaaa ggccgtcacc cagcataccg acgcaggtcg 2280 cttcagtcaa cactcacagc ttgctgggaa gggtactgtt gaagtacgtt cgtgtcacgg 2340 tatatggcaa aggtaaatca atcaccacat acgcatttct cgatgccgga tcgacttcaa 2400 cgctgatgga acacagctta tgggaggaac taaacctcag cggagagaaa tcccctttgt 2460 gcatatcgtg gactggcgga caaggccgat acgagaaaga ttcggttgta ttctctgcag 2520 agatcaccgg cgcccatacg ccaaaacaaa gctaccgtct cccggaggtg cacacagttc 2580 gaagtcttga tcttcctgca caaacactgt cggtatcgac tctagcaaag aaattccatc 2640 acttgtcggg tttaccgatc gaatcatacg tcgcagtgaa accacgaatt ttgctcggcg 2700 tcgacagttc cagactggaa tatccgctcg actccaagga gggtgacgaa aatcagccga 2760 cagcagtcct gacgcggctg ggttggatag tttacggtcc gtgctcggcg ccagggcaac 2820 caatcccgaa gaaagaagtt tcttacggct accatatttg tcaatgcgag gggcttcatt 2880 cgaccgttaa aaactacttt tccatggaca gcctcggagt acaacttgcc gataagccac 2940 tgatgtccaa ggacgacgag cgagcgcttg agctgctgca aaagaatacg attcgccgag 3000 gaaaaagata cgagacgtgt ctgctgtggc gttatgacga ccttcggatg ccggagagta 3060 aagccatggc cttgaagcga cacgaatgct taaacaagcg catggctcgc gaaccaaggc 3120 tagccaagga attgcatgag aaaatcatgg actataaagc caaagggtat atccgaaagc 3180 tatcggcaca ggaggaagca acacacaccg agcgttcgtg gtatctacca atctttccgg 3240 tcgtgaaccc aaacaaacct ggaaagctgc gaattgtttg ggatgccgcc gccaaagtgg 3300 gaaagctttc gctaaattca ttcctgctga aaggccccga tctagtcatg ccactgcaac 3360 acgtcctaca acggttcaga gagtttcgaa cagcggtgac aggagacata cgcgaaatgt 3420 tccaccaagt ccgaattaat cctgaagatc aacactgtca gaggttcctg tggaatgatg 3480 gcgttcccgg aaaagttcct tcgacgtacg cgatgcaagt gatgacgttc ggtgcgagct 3540 gctccccaag tagcgcacag tacgtgaaga acctgaatgc agaacgattc agaagccagt 3600 ttccagatgc ggttgaggct atatgtaacg gaacgtacgt ggatgatatg ctctgcagcg 3660 ttgagtccga agcagaagct gtgaggcttg cgcaagatgt tcgtgcaata catgcagaag 3720 gtggattcga gatccgggga tggttatcaa actccgagaa agttgtggaa gccatgggcg 3780 agtcaaaatt gattcagaag gacctgaaca agaatggtga gctgtccacc gagaaagttc 3840 tgggaatgtg gtggcacaca accaacgaca cattcacctt caagattccg aagcgatgcc 3900 aacatgagct gctatccgga gcgcaagttc ccacgaagcg tgaagtgcta cggatcctaa 3960 tgtccgtctt cgacccactg ggactccttg cgaacgtgct tatgttctta aaggtactac 4020 ttcaagagat atggcgttct aaaatcggat gggacgaacc gataaccgac tcccaactag 4080 aaaggtggag aacctggctc agcgtgctgc accaagtgga gagtgtttct gttcctcggt 4140 gttatcggac aataacgacg tcgtcagtca aaaccaacga aattcaactg catgtctttg 4200 ttgatgccag tgaaaatgga tacgcagcag ttgcctattt ccgattcgaa gaacgaagta 4260 caatagaatg ttcatttgtc actgcgaaaa cacgtgttgc acccctcaag tacgtttcca 4320 tccctcggct ggagcttcaa gcggcagtga ttggtactcg tttggcaaag gatattgctg 4380 aaacccatcg aatcccgata agaaaacggt tcttttggac cgattctagg gatgttctat 4440 gctggctgtg ctcggaccac agaagataca gcaaattcgt cggtgccaga atcggtgaaa 4500 ttctggaaaa tactggacac tcggagtggt tttggattcc aacaagggag aacgtggcag 4560 acgaaggcac aaagtggcag aagcttcccg atctttctcc atcaagccgc tggttcaagg 4620 gaccggattt tctgtggcgt agttatgcgg tgtggccaac tcagccggca acccacggca 4680 taaccacgac cgaaataaat gcttcagtta atgtccacgc ggtacacgaa cccatcctaa 4740 actttgctaa gttctccagc tggcgcaagc ttttaagggt gacggcatac gtattccggt 4800 tcatcagcaa cattcgagcc agattgcaga atcgtccatc atctactggc attctcgatc 4860 aaccggagct gatcgcagcg gaacgttgta tatatcggca agcccaacta gaagtctttg 4920 gcgaggagat ctctcaactg aagctttcca cgaatgcaga aggaaagtca gtggcaccaa 4980 ttcctaaaag cagctcactc tacaagctat caccgttcct tgatatccac ggagtgctgc 5040 gtgcgagtgg acggaccgcc gggtgtcaat tcatcgatga agacgctgtt cacaccatca 5100 tactgccgaa gaagcactca gttacgacac tcgtcgttca gttcacccac gaacgttacc 5160 atcaccttaa ccacgaaaca gtagtcaacg agctgcgaca acattactgg attccccagc 5220 tgcgaagaat ttgctacaaa gtccgtctgc attgtcagta ctgcaaaaac gctggagctc 5280 gaccacagcc acccgttatg gctgatctac cacctgctcg tctagccgct tactcgagac 5340 cgttttccca taccgggatc gacttttttg gaccaatgca ggttattgtt gggcggcgaa 5400 tcgagaagag gtggggagtg ctaatcacct gtttggtggt acgggctatc catattgaaa 5460 tcgcccattc actcaacact agttcttgca tcatggcctt gcggaacttt atggcacgac 5520 gtggaacccc tgtggagcta ttcagtgata ggggaacaaa ctttgttggg gctaaccgtg 5580 agttaagcga agcgctccga gctctcgaca aggacaggct catggaggaa ttcgtgacgc 5640 caaacaccaa gtggactttc ctgcccccaa gcagcccgca tatgggagga agttgggaaa 5700 ggttggtgca atcggtgaag aagatccttg gcaacatgca ccttccaagg acacccaccg 5760 acgaagttct tcgtaacagc ttgatggagg tggaaagcat catcaactcg cgccctctca 5820 cctatgttcc catcgaagac gccgagtgtg aagcactaac gcccaaccac tttcttctgg 5880 gtacctctgg tggcacgaaa ccattagtgc cctacgacga tagccccgct acgctagtga 5940 acagttggaa aacttcgcag gtttttgcca acatgttctg gaaacgttgg cttcgtgagt 6000 atctgccaac gattacccgc cggactaaat ggcattatcc agctaagcca atccaaatcg 6060 gtgacatagt tataatcgtc gatccagacc tacccagaaa ctcgtggcca aaagggcgtg 6120 tcgtagacgt gaaccttaaa aatggacagg tacgtagtgc aacggtacga acaccctcga 6180 acgtttacga acgcccagca gtgagacttg cagtattgga cgtaggcgca ataggtagta 6240 cacatcaacc cggatcctgt gtaccggggg ggac 6274 // ID LIN11_SM repbase; DNA; INV; 3614 BP. XX AC . XX DT 28-MAR-2008 (Rel. 13.03, Created) DT 28-MAR-2008 (Rel. 13.03, Last updated, Version 1) XX DE Non-LTR retrotransposon: consensus. XX KW L1; Non-LTR Retrotransposon; Transposable Element; LIN11_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-3614 RA Tempel S., Bao W. and Jurka J.; RT "Non-LTR retrotransposons from Schmidtea mediterranea."; RL Repbase Reports 8(3), 345-345 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 311..3460 FT /product="LIN11_SM_1p" FT /translation="MPIYSKTSSWSFFFLNKKRRIALIIDPTADDSHTLHF FT ELATDILKTILNIQNIFGDLNFPLADIDYPICHEANLSALYVCHFIKCLMT FT XLPITIPDIAHMKETMKPIIQKYNCSKFPDNDARKYRVFIEDLIYQLDLDA FT ITCEEILCEVERINGRLNPKRYFKETKPKTDIIHLRIKKSAELLCVKRLKF FT QINQKNEIMKIWENDDIDHRPPMAKFLKTFANSECPSSNTANLILPYSTDT FT DTNQDTDCENMAXIMKNLDNTAPGMDLITIGDWKTISPKHGLITAICNNIL FT RNGICPKRWKLFRTVLILKPGKADESFKTNSWRPLAIMDTAYRIFTTLLNN FT RLLQWIKNGNLMSPNQKAIGIPDGCAEHNATLHLAIDQAKRCKTELHIVWL FT DIADAFGSLPHDLIWYTLSNMGLKKETISLIKELYKDVKTFFDCQGTLSEP FT VPITKGVKQGCPLSMTLFCLSIDYILKTILTNCPFFLHDLNISILAYADDI FT VLLSDSFQKIEKALEKTVELASFANLKFKPSKSGYLSINNVNSDIFKLYLY FT NEEIPTISDENKYKYLGVDFSYKRNQNIDGRLESALALTSSLFKSYLHPAQ FT KLNAYKTFIHSKLIFSLRNCVIGHRILDCDRKRVTQGREKQLGFDQKIKGL FT LKTMIGDKFQAINNFFPYTHCKLGGLGITSTFDEYLIQSITGITRLFHSSN FT LXFRNMLITELAHARGGKNFEAGLKWLNCEINKKFPNTSFFKKFQKSALAL FT KRKFCICVKLNFVDDNFSLEMSYKKRTSYINFQNLNTLSKELHDFVGLYYA FT EQWCQMHVQGHIATAIGDSITAKYLITSDILNDAQYSFLIRARNNLLNLNY FT NPYRLKYNIGTKCRLCHLDEETQAHVFNHCPAKPNARRIKHENVLISIVTF FT LEKIGFEIDVEKSPKYISNPTKLKPDMVIRSKRNKDIHVLDLKVPYDSGEG FT FEKAREGNYVKYKDLALEIGQAFNQKATISAIVIGCLGTWDKKNNPALSKI FT GLTKAEIQSLARIACPNAVIACYHIYREHVSYTKSAMALPFSLT" XX SQ Sequence 3614 BP; 1278 A; 720 C; 576 G; 1031 T; 9 other; tcattaatac tactaccaac ttcacctcta aactttgctg ggatactatc atcttgtgaa 60 actcaaaata tggatgaaga agttatcaaa ccaagagatc taccagaaaa tctaggcttc 120 aagaaacatc gaaaatgaac taagttgatg gtcgcagcac ttgatcaaag catttatctt 180 cacttatgct attaagactt ctacaatcta catcraccct attacttgca atgctttgat 240 ccagtgcaat tacaaaactt tctttgaaac tttccctttc aaagactttg ccaactggaa 300 tgagataatc atgccaattt acagtaaaac ctcttcttgg tctttcttct tcttaaacaa 360 gaaaaggcgt attgcgttga ttattgatcc aactgctgat gatagtcata ctctacactt 420 tgaattggca actgatatcc twaaaactat cctcaacatc cagaatatct ttggggactt 480 aaatttccct cttgctgaca ttgactaccc tatatgtcat gaagcaaatc tttctgcatt 540 atatgtatgc cactttataa aatgcttaat gactyactta ccaattacta ttccggatat 600 cgctcacatg aaagaaacaa tgaaaccaat cattcaaaaa tataattgct ctaaattccc 660 tgataatgat gctagaaaat atcgcgtctt tatcgaagat ctgatatatc aattggatct 720 agatgcaatt acatgtgaag agatattgtg cgaggtcgaa agaattaatg gaagactcaa 780 tcctaaaaga tacttcaaag aaaccaaacc aaaaacagac ataatacatc tacgtattaa 840 gaagtctgcg gaacttcttt gcgttaaaag attgaaattc caaatcaatc aaaaaaatga 900 aatcatgaag atttgggaaa atgatgatat agatcacaga cctccaatgg ccaaattctt 960 gaaaaccttt gcaaactcag aatgtccatc atcaaatact gccaacttaa tcctaccata 1020 cagtactgat actgatacga atcaagatac tgaytgtgaa aatatggcam acatcatgaa 1080 aaacttagac aacactgcac ctggaatgga ccttattaca attggagatt ggaaaactat 1140 ctccccgaaa catggactca taacagcaat atgcaataat atactacgaa atggtatatg 1200 cccaaaaaga tggaaactat ttagaacagt actaattcta aaaccaggaa aagcggatga 1260 aagcttcaaa actaactcat ggagacctct ggcaattatg gatactgcct atagaatctt 1320 tacaactttg ctgaataacc gtctccttca atggataaag aatggyaacc taatgagccc 1380 taaccaaaaa gcaattggta taccagatgg atgtgctgaa cataatgcta ctttgcacct 1440 tgcaattgat caagcaaaaa gatgtaaaac tgaactacat attgtttggc ttgatattgc 1500 tgatgcattc ggttcactac ctcatgacct catctggtat acactgtcta atatgggatt 1560 gaaaaaggaa acaatatctt taattaaaga actttataag gatgtaaaaa ccttctttga 1620 ttgtcaaggg accttatctg aacctgtccc cataacaaaa ggagttaaac aaggttgtcc 1680 actttcaatg accctttttt gcctctcaat tgactacatt cttaagacaa tacttactaa 1740 ttgtcccttc tttcttcatg acttaaacat cagtatcttg gcttatgctg atgacatagt 1800 ccttttatca gactcttttc aaaaaattga aaaagctttg gaaaaaactg tggagttggc 1860 atcctttgca aatcttaaat ttaaaccttc aaaatctgga tacttatcca tcaacaatgt 1920 taactcagat atctttaaac tttatcttta taatgaagaa ataccaacga tatcagatga 1980 aaacaaatac aaatatcttg gagttgactt ctcttacaaa cgaaatcaaa atattgatgg 2040 acgactggaa tctgcacttg cacttaccag ctctttgttc aaatcatact tacatccggc 2100 acaaaagctt aatgcctata aaaccttcat ccactccaag cttatcttct ccctacgaaa 2160 ttgcgtaatt ggtcatagga ttcttgactg tgaccgcaaa cgagttacac aaggtcgtga 2220 aaaacagctt ggctttgatc aaaaaatcaa aggacttctg aagaccatga ttggcgataa 2280 atttcaggca attaacaact tctttcctta tacacactgc aagcttggwg gacttggtat 2340 aacttcaact tttgatgaat atttgataca aagtatcact ggaataacaa gacttttcca 2400 ctcatccaac ctcarcttca gaaatatgct tataactgaa ctagctcatg ctagaggagg 2460 gaaaaacttt gaagctggac taaaatggct taactgtgaa attaacaaga aattcccaaa 2520 cacttctttc tttaaaaaat tccaaaaatc agcacttgct cttaaaagaa aattctgtat 2580 atgcgttaaa cttaactttg tagatgacaa cttttcactt gaaatgtcct acaaaaagcg 2640 cacttcttat ataaactttc aaaacttaaa cactctttct aaagaacttc acgacttcgt 2700 gggtctttac tatgcagagc aatggtgtca aatgcacgtg caaggacaca ttgcgactgc 2760 gattggggat agcattacgg ctaaatacct cattactagt gacatcctta atgatgcaca 2820 gtacagcttc ttgatacgtg ctagaaacaa ccttcttaat cttaactaca atccttatcg 2880 tcttaaatat aatattggca caaartgcag actgtgtcat ctagacgaag agactcaggc 2940 ccatgtgttc aatcactgcc ctgccaaacc taatgctaga agaattaagc atgaaaatgt 3000 cttaataagc atagttacct tcttagagaa aattggattt gagatagatg tggaaaaatc 3060 acccaaatat atctcaaatc caacaaagct gaaacctgac atggtaatta ggtctaaaag 3120 gaacaaagat atacatgttc tggacctaaa agtaccatat gactcaggag aaggctttga 3180 aaaagcgcga gaaggtaact atgtaaaata caaggatcta gccttagaaa ttggacaggc 3240 tttcaatcag aaagcaacta tatctgctat agtgattgga tgcctgggaa catgggataa 3300 gaagaacaac cccgcacttt caaagattgg attgactaag gctgagatcc aatctcttgc 3360 caggatagca tgcccaaatg cggtaatcgc atgctatcac atctaccgtg agcacgtctc 3420 ttatacaaag agtgccatgg ccctaccctt tagcctgaca tgaatgtatg ttatgcagca 3480 atgctggtaa ttacatcggc gttgcagatt tgtgtatgtg aataaaaaaa caatagaaac 3540 agatgctgag cccagctcgc atatttagcc gaaaggcagc gtatatcgat taaaaacaaa 3600 ttttgaaaaa aaaa 3614 // ID RTEX-6_BF repbase; DNA; INV; 4834 BP. XX AC . XX DT 30-JUN-2009 (Rel. 14.07, Created) DT 30-JUN-2009 (Rel. 14.07, Last updated, Version 1) XX DE Amphioxus RTEX-1_BF autonomous non-LTR retrotransposon - DE consensus. XX KW RTEX; Non-LTR Retrotransposon; Transposable Element; CR1-41_BF; KW RTEX-6_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-4834 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-4834 RA Kapitonov V. and Jurka J.; RT "Young families of RTEX non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(7), 1722-1722 (2009). XX DR [2] (Consensus) XX CC The 5' terminal portion is missing. The reconstructed RTEX-6_BF CC consensus sequence contains a C-terminal part of ORF1 and the CC complete ORF2. The 3' terminus is composed of the (CATT)n CC microsatellite. XX FH Key Location/Qualifiers FT CDS 851..4366 FT /product="RTEX-6_BF_2p" FT /note="AP endonuclease and RT domains." FT /translation="EADAEPVWEKINLVHWNVNGWTSLNCKLREITLQHLS FT PDIISINETHLKTDEVIELDDYTWYGHNRKTHRDAPKGSGGVGIFVKNNLR FT LQYKITIIDRDVDGMLGLHFEHTKSEYTFVVFSCYLSPEQSPWGRNATEYF FT SHLLCQMYALESVDAIYVCGDFNSRTGDRDDCITEVDGIPPRATIDTTFNN FT HGASLIEFLKDSKMCMLNGRVCQHNDNFTSVSTRGKAVVDYILTPHDCLDT FT CHDFNVITASDIVEECGYSCISLIGDRCRVPDHSVLSLTFYVNSQYVENVP FT SQSVKSRRKRRIVTENFLSSDISRRALLDLIDSLQVVQKTQQSIDLWYNSL FT CEVIYSDCDVNETGTNANTPRSRRLRSPGKPFWNAELDELWVQMRDAEREY FT VRCKGSKRCKAVLRDVFKCKQYAFDKNLRFYKRKYQRGQALELEGLKTNNP FT QRFWNEVNKLGPKRSTKIPMKVSDETGNLMEEVDKVLEKWEKDFSNLFAGN FT DGEGKFDDEFLQEICDLKLRLESEMNEVSYNPNEFLNAPITLDEIKAAVEN FT AKTGKAMGADEISNEMLKAPDMLNTLYMLVQTCFESGLIPSVWYKSIIKPI FT PKSSKKDPTVPLNYRGISLMSTVYKLYSCVLNNRLTRYLECSNLLEDEQNG FT FRRARACIDHIYTVCTVIRARMEQGKSTFSCFIDFQKAFDWVNRDLLAYRL FT IQNGVDGKFYKAIMTLYKSPVACVQVNEYQTQWFPTPFGVKQGDVLSPTLF FT AMYVNDLAQKVKESGLGVRLDDMTLSLLLYADDIIIVAENENELQEMLNIV FT AEWCSKWRLMINQGKTQIMHFRKKGARRTSYEFRFGETKLDLVTSYKYLGL FT HLDEHMSFNSASSVLADSAGRALGGILGKVKALKELGYHAYTQLYESCVCP FT ILDYAAGVWGFKKYDKCSVVQNRAIRCFLGVHKFAPVLAITGDMGWMPCEI FT RGKGAVISLWNRLINMSNDRLTKKVFMWDLQNHGPWSRDVQRILRSVEQEH FT IFLNNLRANTKQIKEKLFDMYQEDWASDIWLKPKLRTYRLFKDAYCTEHYV FT TMNLCKSQRSLCAQFRAGILPLYIETGRYQAIPEENRICTMCSLGEVESEA FT HFLFYCPFYDDLRESIFCKMFDKCPEVIWERDEFRMKWLFEEGVSEISNFI FT RKAWNKRRRQLYT" XX SQ Sequence 4834 BP; 1454 A; 889 C; 1155 G; 1334 T; 2 other; aaactattgt gttatttcag tgagttgaac gccaaatttg tgtatttgga gatatttgga 60 ggtcagcaga acaatgtctg acaaaggtac cggaaccaga acccgggata gtgccaggcg 120 agatgcggag attgaagccg ctagcgtcag gaaggaggga gacgccgcca ttatgaacga 180 gctgaaagcc ctggatagga agctcgacgg tatgcaggcg tcgttcgaga agaagctgaa 240 cagtaaggtt gacagcctac aaacgtctgt tgaaaagctg gtttctgcga ctaaggatga 300 gctgaaagcc gagctggaaa ggaaaacgaa agaaattgga gacaacatcg acctggagat 360 gggtcacatg agggcacgta tcgacgccat ggagaccaag atggcaggaa taagcaacag 420 ccgagccgag tttgatcctg acgagtcggt gattgtgtcg ggcttaccgt ttgaggaagg 480 ggaagatgtc aaggagaaga ttgaggagct gttccacaac ggtcttcagt ccgatgtgcc 540 tgttgttgct gccgagagaa tgaagtcgag aggccgtggt cccggcgtgg tgaaagtcag 600 gctccgctcc gtccaggaaa aggtggccat actgagggcg aaaccgaagc tgaaaagtga 660 cccctcttat gagaaagtgt tcctccgatc agcgaaaagc cacacggata ggctgataga 720 gctcaacttt aggactctcc tgcgggaaat tccgggcgga aaggattact tcatcactgg 780 cagcggacgg gtccagaagc ggggcgctcg tgacttacct gccgacaaca acctgccgac 840 acaaccgtga gaagcggacg ccgaacctgt gtgggagaag atcaaccttg tgcattggaa 900 tgttaatggc tggacaagtc tgaactgtaa gttaagggaa ataacactcc agcacttatc 960 accagatatc atttctataa atgaaacaca cttgaaaact gatgaggtaa tcgagttaga 1020 tgattacact tggtacggac ataacagaaa gactcatcgt gatgccccca aaggctcagg 1080 tggtgttggc attttcgtta aaaacaatct acgtcttcag tacaagatta caattattga 1140 tagagatgtt gatggaatgc taggactgca ctttgagcat acgaagtctg agtatacttt 1200 tgttgtattt tcatgttact tgtctcccga gcagtcgcca tggggacgga atgccaccga 1260 atatttttct cacctgttgt gccagatgta tgctttggaa tctgttgatg ctatttatgt 1320 ttgtggagac ttcaacagtc gcacaggtga tcgtgatgat tgcataacgg aggttgatgg 1380 cattccaccc agagcgacaa ttgacacaac atttaacaat catggggcta gtttaattga 1440 atttttgaaa gactccaaaa tgtgcatgct aaacggcaga gtttgccaac acaatgacaa 1500 cttcacatct gtctccacga gaggaaaagc agttgttgac tatatcctta caccccatga 1560 ttgccttgat acatgccatg actttaatgt gattacagcg agtgatattg tggaagaatg 1620 tggctattca tgtataagcc tgataggcga tcgatgtaga gtaccagatc actctgtgtt 1680 atctttaact ttttatgtaa atagccagta tgtggaaaat gtaccctcac aatcagtaaa 1740 gtcgcgtcgt aaacgtagaa tcgtgacgga aaactttctg agttctgaca taagccgtag 1800 agcattgtta gatctcatag acagtttaca agtggtgcaa aaaacacagc agagtattga 1860 cttgtggtat aactcacttt gtgaggttat ttacagtgat tgtgatgtaa atgagactgg 1920 tacgaatgca aatacacctc ggtctaggag attacgtagc ccgggcaaac ctttctggaa 1980 tgcagagctc gacgagctct gggttcaaat gagggatgca gagcgtgaat atgttagatg 2040 caagggaagc aaaagatgta aggccgttct tagggatgtt tttaagtgca agcaatatgc 2100 ttttgataag aacctccgtt tttataaacg aaagtatcaa cgaggacagg ctttggagct 2160 tgagggtcta aaaactaata atcctcaaag attctggaat gaagtaaata agctgggtcc 2220 aaagagaagc acgaaaatcc ctatgaaagt ctcagatgaa actggaaatc tgatggaaga 2280 ggtagataaa gttttggaaa aatgggaaaa ggatttcagc aatttatttg ctggcaatga 2340 tggtgagggt aagttcgatg atgaatttct gcaagagatt tgtgatctta aacttcgact 2400 agaaagtgag atgaatgagg tctcttataa ccctaatgag ttcctgaatg caccaatcac 2460 tcttgatgag attaaagcag ctgtggagaa tgctaagaca ggaaaagcaa tgggagcaga 2520 tgaaatctcg aatgagatgc taaaagcgcc agatatgctg aatactttat atatgcttgt 2580 acaaacctgt tttgaaagtg gcctcattcc ttctgtgtgg tacaagtcca ttattaaacc 2640 gattccgaag tcatcaaaga aagatcctac agtccctttg aattaccggg gaataagtct 2700 aatgagcact gtgtacaaac tttattcttg tgtactaaac aataggctaa cacggtactt 2760 agaatgttca aacctcctag aagacgaaca aaatggtttc cgtagggcca gagcgtgtat 2820 agaccacatc tacacagtgt gtacggtgat cagggcccgg atggagcagg ggaagtcaac 2880 cttctcctgc ttcattgact tccaaaaggc tttcgactgg gtaaacagag acctcttagc 2940 ctatagacta atacagaacg gtgttgatgg gaaattttat aaagctataa tgactcttta 3000 taagtcccca gtagcttgtg tacaagtgaa tgagtatcag actcagtggt ttcccacccc 3060 ctttggagtt aaacaagggg atgttctgtc accaacccta tttgctatgt atgtgaatga 3120 tcttgctcaa aaagtcaaag aatcggggct gggtgtcaga cttgatgata tgactttaag 3180 cttgttgttg tatgcagatg acatcatcat tgtagcagaa aatgaaaatg aattgcaaga 3240 aatgttaaac attgttgctg aatggtgctc taagtggaga ctgatgataa accaaggaaa 3300 aacacaaatc atgcacttta ggaaaaaggg agcaagaagg acttcttacg aattccgttt 3360 tggagagacg aagcttgatc ttgtgacttc atacaagtat ttaggactcc acttagacga 3420 gcacatgtcc tttaactcag cctcgagtgt tcttgccgac tctgctggaa gagccctagg 3480 cggtattctg ggtaaagtta aggctctgaa agaacttggt taccacgcct atacacaact 3540 ctacgagtcg tgtgtttgcc ctattcttga ctatgccgca ggtgtctggg gatttaagaa 3600 atacgataag tgttctgtgg tccagaatag ggcaatacgg tgtttcttag gtgtacacaa 3660 atttgcaccc gtccttgcca taacaggaga catgggttgg atgccatgtg aaataagagg 3720 gaaaggagca gtaatctcct tgtggaatag acttatcaat atgtctaatg acagacttac 3780 caaaaaagtc tttatgtggg atttgcaaaa ccatggccct tggagcaggg atgtacagag 3840 aatactacgt agtgttgagc aagaacacat tttcctaaac aacctaaggg ctaatacaaa 3900 gcaaataaaa gagaaattgt ttgatatgta ccaagaagat tgggcttctg acatatggtt 3960 aaagcccaaa ctgagaacct atagattatt taaggatgct tactgtacag aacattatgt 4020 tacaatgaac ctgtgtaaaa gccaaagatc cctctgtgcg cagttccgag cagggatatt 4080 accattgtac attgaaaccg ggcggtatca agcaatacct gaggaaaata gaatttgcac 4140 aatgtgtagc ctgggtgagg tagaaagcga ggcacacttt cttttctact gcccttttta 4200 tgacgactta cgagaaagta tcttttgcaa aatgttcgac aaatgtccag aggtcatctg 4260 ggaaagggat gaattcagaa tgaaatggtt gtttgaagag ggggtctccg aaatttctaa 4320 cttcatacgc aaagcttgga acaaaagaag aaggcaatta tacacatgaa tacacaacct 4380 gtccacttat gtacaactat tactgtacta catgtattac gttttcctta ttttctgtat 4440 aaactctgct ctgtttagct ctgcgaaacc cttgtttgat ggttttgatt tatgtgatta 4500 tgatttgata tctggtctgt tatccaattt atataatggt taccgattat gttcctgtct 4560 actamtatgt ttttccttat cttctgtata aactcagctc tgtttagatc tatgaaacct 4620 ttgtttgatg gttttgattt atgtgattat gatttgatat ctggtctgtt atccgatttg 4680 tttawtagca accgattatg ttcctgacgt tcttctatgt tataatatcc tatgttcaat 4740 tatgatttat attgtgtgtg tcttataagc ccacgtgggc tgggcgttct aacgcatgac 4800 acgttaataa aattcattca ttcattcatt catt 4834 // ID Copia10-NVi_I repbase; DNA; INV; 4595 BP. XX AC AAZX01001646; XX DT 05-NOV-2007 (Rel. 12.11, Created) DT 05-NOV-2007 (Rel. 12.11, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: internal DE portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia10-NVi; KW Copia10-NVi_I; Copia10-NVi_LTR; internal portion. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-4595 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from Nasonia parasitic wasp."; RL Repbase Reports 7(11), 1107-1107 (2007). XX DR Genome; AAZX01001646; Positions 9259 13853. XX CC Positions [1671-2201] - Integrase core CC 'AAAGT' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS join(459..1286,1290..4487) FT /product="Copia10-NVi_I_1p" FT /translation="MEATTTKFESCRSMEDYVARIISATKKLATIGTKLPD FT DLIGALLLAGLPPAYKPMIMALSNSGAEITGDLIKNKLLEESQSVTDDFES FT RGLHVQAQAAQFSNRSHRGRGNYHRTRGSRQSAAPSSRNPATNVRCYNCNE FT FGHYANKCTEPKRNKNHACTMSAEGDSDTEETVCALFVPSSVTHTHTARLD FT TSAADSETAVLKADNALLAVGLLSTTSKQEWILDSGASTHMCCEGVHLTNI FT RAPKTTEVTAADKAKIRVQAEGDLHITNTGYVGSKILLQNVLHIPGLTANL FT VSVSAITRQGGAVTFKEKKCQVMDHTGSVVLEGNMCNNNVYKLNLQPNPGQ FT SSKVSTAVAFKAVQKHSITLWHRRMAHLNVNYLKQLKEVAVGVDFDDKLLH FT KCEICVAGKLTNKPFQATSTRASRVLQLVHSDVCQVEDLSLNKAKYFLTFL FT DDHSRKLFVYFLQKKDEVPEVTQKFVLFAENQTGQRLKILRTDNGREYVNE FT RLRTFLQSKGVKHETSIPYHPQQNGRAERVNRVLLEKVRCMLIEASLRNKF FT WAEATPTACYLSNRSPKKCLGGKTPEELWTGQKPDLTLLRVFGCQARAFIP FT SNLRKKIAPTSKQAIMIGYCENQKGYGLWSHEEQRIFTASNVEFFEDEQPK FT NVPANRIYLSIENHEPLYEHNQDEPAVRVEEPAKSESEEKVPEKEVLKGEV FT SEKESKLKRRLPISKSIREEEPGNKILKKDLPVNPDFNRRDVQNYAENEQV FT QIKKGGKSSVKPAASNLSRPLTRSVSNQLDEEASSDEEYKTEPVRKTQRRR FT KKPKHLDDFVTYSATIEEGEPKNYQEAISCSESDQWRRAMAEEYSSIVKNN FT TWQLCNRPSGESVVGSKWIFKKKRSADGSTRYKARLVARGFSQQKGVNYDE FT TYSPVVRFVSLRLLFAYAVRRDLDIYHLDVETAFLHGDMTDTVYLQQPEGF FT VAKQHENQVCLLKKAIYGLKQGSRNWNIKLDGALKKLNLRQSKLDSCVYSF FT SNSVKLIIVALFVDDLIVFTNSIDFLDILKQGLMNICVVKDLGPLKRCLGV FT NVHYDKTKGVMKLEQTDYINTLLRNFGMEECRSVSTPLETNINSRSGSPSS FT AKFNPAVVPYQNAVRSLLYLVQATRPDLAHAVGEISRHNLDYDESHWAMIK FT RVLRYVQGSKHLRLRYTRDGDDFISGYCDTSWAGEQGDRRSTSGYIFMVQG FT GVISWNSRRQSTVALSSIEAEYLSLSAAAQEALWLGTLASELRIRDDKPMP FT IYCDNKGAIDLSKNSRFSPRTKHIDVRHQFIKEYIEKGEIEVIFIPTTQQL FT ADTLTKPVGPAKLNQFINAASMRVAEKRED" XX SQ Sequence 4595 BP; 1482 A; 1017 C; 1076 G; 1020 T; 0 other; ggttatgggc ccaggaagca tctcaacgaa aactgcacaa gtattccgtt gccgaaaaga 60 cttgcgacca tgacacgcca tagccgttgg cgtccttgac ctgaaggtca cagctggagc 120 acagtggttg ttccttaccg aaaatggcga gccgaaatca aatttcatcc ttccagatac 180 tggataaact gtcatgaaga gaaaattatc gatcttgggc agtagctatg cgcgcatatt 240 tagagatcga agacttatgg gatacggttg aacaaccagc cggtggaaca atcagcacag 300 acccgaagaa actggccaag gcgcgagggc gcatgatatt agcagtggag cccgaaattt 360 acccgtaact ggagaatgcg gttactccga aacaagtgtg gggcgaactg gccaaaacct 420 acgacgacaa aggattggca cgaaaggtac accttctcat ggaagctaca acaacaaagt 480 ttgagagctg ccgctccatg gaggactatg tcgcccgcat catctccgcg acgaagaagc 540 tcgcaacaat tggaaccaag ttgccagacg acctaattgg agctcttcta ctcgcaggac 600 tcccacccgc ttataagcct atgataatgg cgttaagcaa ctcaggtgcg gaaattacag 660 gtgatctcat taaaaacaaa ttgctggaag aatcgcaatc cgtaacggat gattttgaat 720 cacgcggact tcacgtgcaa gcgcaagccg cgcagtttag taatagatcg cacagaggac 780 gtggcaatta ccacagaacg cgaggatcgc gtcagagcgc cgcccccagc tccagaaacc 840 ccgcaaccaa tgttagatgc tataactgca acgaattcgg acactacgca aataagtgta 900 cggagccgaa acgcaacaaa aaccacgcgt gtacaatgtc ggcggaaggc gatagtgaca 960 ccgaggagac tgtgtgtgcg cttttcgttc cgtcatcggt gacacatact cacacagctc 1020 gactcgacac ctctgcggcc gattccgaga cagcagttct gaaagcagac aacgcgttac 1080 ttgcagttgg tctactgagt accaccagca agcaagagtg gatcctggac tccggtgctt 1140 cgactcacat gtgctgtgaa ggagttcatc tgacgaacat acgtgcaccg aagaccacag 1200 aagtcacagc agccgacaaa gcaaaaatcc gggtacaagc tgaaggagat ctacacatta 1260 ctaatactgg atacgtaggc agcaaataga ttctacttca gaacgttctc catatccctg 1320 gtttaacagc gaatctagtc tcagtcagtg caatcactag acaaggtggt gcagtaacgt 1380 tcaaagaaaa aaaatgccaa gtcatggatc ataccggatc agttgtcctg gaaggtaata 1440 tgtgcaataa caacgtctac aaactgaatt tgcagcccaa cccaggtcaa tccagcaagg 1500 tcagcacagc agttgcattc aaggcagtgc agaagcacag tataactctg tggcatcgac 1560 gaatggctca cctcaacgtc aattacctga agcagctgaa agaggtagca gtcggagtgg 1620 attttgatga caaactactg cataaatgtg agatttgcgt ggcaggcaaa ctcacaaata 1680 agccattcca agccaccagc acaagagcat ccagagtact gcagttagtg cacagcgacg 1740 tgtgtcaggt ggaggatttg tctctgaaca aggccaaata ctttttaaca ttcctggatg 1800 accactcgag gaaactgttt gtttattttc tccagaagaa agatgaagtc ccagaggtaa 1860 cacaaaagtt tgttttattc gcagagaatc agacaggcca aagactcaag atcctcagga 1920 cagacaatgg acgagagtat gtaaatgaga ggctcagaac atttctccag agcaagggcg 1980 tcaaacatga aacgtcaata ccctatcacc cccagcagaa tggtcgtgca gaaagggtca 2040 acagagtgct gctggagaaa gtgaggtgta tgctgataga agcatcgctt cgaaacaaat 2100 tttgggcaga agccacacca acagcctgct atctctccaa tcgaagtccc aagaaatgtc 2160 ttggaggaaa aactcctgag gagctgtgga caggacaaaa accagatctc acactactga 2220 gagtttttgg atgtcaagcc agagcattca taccaagcaa tctcaggaaa aagattgcac 2280 ccacgtcgaa acaagccatt atgataggct attgtgagaa ccagaagggt tatggattgt 2340 ggagccacga agaacagaga atattcacag ctagcaacgt tgaattcttc gaggatgaac 2400 agcccaaaaa tgtacctgca aacaggatat acctatctat tgagaatcat gaacctctgt 2460 atgaacacaa tcaagatgaa ccagccgttc gcgtagaaga acctgcaaaa agcgaatcag 2520 aagaaaaagt acctgaaaag gaagtgctga aaggagaagt atctgaaaaa gaaagtaagc 2580 tgaaaagaag attaccaata agtaaaagca ttcgcgaaga agaacctgga aacaagatct 2640 taaaaaaaga cttaccagtg aacccagatt ttaacagaag agacgttcaa aactacgcag 2700 agaatgagca agtccagatc aagaaaggag gcaagtcatc tgtcaagcca gctgcttcta 2760 atctctctag acctctgaca agatcagttt caaatcaact ggatgaggag gccagctctg 2820 acgaagaata taagactgaa cctgtaagga agactcaacg cagaagaaag aagccaaaac 2880 atttagatga ctttgtcacc tactcagcaa cgattgaaga gggtgaaccg aaaaactacc 2940 aagaagctat ttcctgctcc gaatctgatc agtggaggcg agctatggca gaggagtatt 3000 cttctatagt gaaaaataac acctggcaat tgtgcaatcg gccttcgggt gagtccgtag 3060 ttggcagtaa atggatcttt aagaagaaga gatcagctga tggaagtacc cgctacaaag 3120 caagactagt tgccaggggt ttctcacagc agaaaggagt gaactacgat gaaacgtact 3180 cgccagtagt tcgtttcgtc tcattgagat tactctttgc gtatgcagtt agaagggact 3240 tggatatcta ccatttggat gtagaaactg cttttctgca tggtgacatg acagatacag 3300 tctatctcca acagcctgaa ggttttgtcg ccaaacaaca cgaaaaccaa gtctgccttc 3360 tcaagaaggc catttatggt ctaaagcaag gaagtaggaa ttggaacatc aaactcgatg 3420 gtgctctgaa aaaattaaac ctcagacaat ctaaactcga ttcgtgtgtt tatagctttt 3480 caaacagcgt taaactaata attgttgcct tattcgtaga cgatctcatc gtttttacca 3540 acagcatcga cttcttggat atccttaaac aaggattgat gaatatctgt gttgtgaagg 3600 atctgggacc actcaagaga tgcctgggtg taaatgtgca ttacgacaag accaaaggag 3660 tcatgaaact ggagcaaacc gactacatca acacacttct aagaaacttt ggcatggagg 3720 aatgcagaag tgtcagcaca cccttggaga ccaacatcaa ttctaggtca ggctcaccgt 3780 caagtgccaa gtttaatcca gctgttgtac cttaccagaa cgcagttaga tcgcttcttt 3840 atttggtaca agcgacaaga ccagacctcg cacatgcagt cggtgagatc agtcgacaca 3900 acctggatta cgacgagtca cattgggcca tgatcaagag agttctccgt tacgtacaag 3960 gaagcaaaca tctcagacta aggtatactc gtgatggaga tgattttatt tcaggctatt 4020 gtgataccag ctgggctgga gaacaggggg atcgacgatc aacttccggc tacattttta 4080 tggtacaagg tggtgtcatc tcctggaaca gcagaaggca atcaacagtg gccttgtcat 4140 ctatcgaagc tgaataccta tcattgtcag cagccgctca ggaggcgtta tggctcggca 4200 cactcgccag cgaactccga atcagagacg acaaaccgat gccgatttac tgcgacaaca 4260 agggagccat cgatctgagt aaaaactcac gttttagccc aaggaccaag cacatcgatg 4320 tccgtcatca gttcattaag gaatatatcg agaaagggga gatcgaggtg atttttatac 4380 caactacaca acaattggct gatactttga caaaaccagt tggacctgcg aagctaaacc 4440 aattcatcaa tgcagcgagt atgagagtcg ctgagaagag agaagattga agattttgtg 4500 aaagatttta ttttacctat ttatgttatt gtttttatat ctatgtatag tctcgcgaca 4560 tattgtaacc tttgaattgt ataattgagg tggtg 4595 // ID Gypsy16-LTR_Dpse repbase; DNA; INV; 611 BP. XX AC Unknown_singleton_14; XX DT 15-MAY-2009 (Rel. 14.05, Created) DT 15-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy16-LTR_Dpse. XX OS Drosophila pseudoobscura OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; obscura group; OC pseudoobscura subgroup. XX RN [1] RP 1-611 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1102-1102 (2009). XX DR Genome; Unknown_singleton_14; Positions 8671 8061. XX SQ Sequence 611 BP; 209 A; 109 C; 128 G; 165 T; 0 other; tgacggatcg aagaatggta tgaggatgga agattttata tatcgaatac gctctttaac 60 tgaacagaac ttaagtaatg attatcaagt actttgtgat cacctgtatc tcttgttcgc 120 taataaggcc agtgattggt tttggaaatt ccaccgtaac cataccaatt tttcttggga 180 aaccttttgt tttgaatttc gcaaacgatt tgaggacgct gaaactgacc ttgacatatg 240 ggaaaagata aggaaacgca gacaaggcga aaatgaagtt tttgatgatt atcagtgcgc 300 aatagaggat ttagcagata gacttcaaca aggcatgtcc gaaagtcaat tagtcgagtt 360 gctgattagg aatggtaaac caagtttgca ccatgagcta cttcatttaa gaatatcaaa 420 tacatttaag aatatcacaa ttacgtgtcg aagtgcgaaa acatgaacaa ttctacaatg 480 acataaaagc ttgtaagcca cgaacgttgc gttcctatat taacgagcta actgacgcgt 540 acgagaaaca aactcaggta gaatcgaatg cgccagatcc agaagtcctc gcgattcaac 600 ggaagccatc a 611 // ID P-31_HM repbase; DNA; INV; 3388 BP. XX AC . XX DT 30-DEC-2008 (Rel. 13.12, Created) DT 30-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE P-type DNA transposon family: consensus. XX KW P; DNA transposon; Transposable Element; P-31_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3388 RA Bao W. and Jurka J.; RT "P-type DNA transposon families from Hydra magnipapillata."; RL Repbase Reports 8(12), 2084-2084 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(210..749,739..1800,1788..2981) FT /product="P-31_HM_1p" FT /translation="MVNSCAAYGCSNRYIKGGKKSFHKFPFQNSELCKKWI FT IALKRENFLPSKHTCICSDHFLESDYNYCIPDKKNLILDHYHKPILKCNAV FT PSVFVFSTNIQKTKRKLPSIRTSVNSLSKFDSFSKKTKFDVCPAIDKPSKE FT INSTHNDQPESTIDQNDHTSQSPTQSIKSLSPAKEKLKKKSKKKVKVLQQK FT LRRKEKKICSLQDMVSLLKSKNYLTTDAGTVLSENFSGLSYEIIKNQFSNQ FT NIKPKGHRYNDEVKKFALTLHFYSPRAYNFLRPMLCLPAASSISHWTSSVN FT CDPGLFLDVFSYLGDKQKTDINFKHCALIIDAMAIKNSIIYDKSSGHYIGF FT CDFGKDISVCEPDTPATEALIFMLVGLRGHWKTPIGYVLSNKVKASDLTCF FT IKNILDVALTYELCIKSITFDGTSTNLNSVVPFGCKFGNVIDNIVSTFLYN FT NQLLHVFLDPCHMLKLARNSLNDLGVFIDDEGCYIKWSYIKALFELQELEG FT LKFANKLSRKHIEFHRHKMNVKIAGQTLSSSVADATEFCNGILIEFLMVSQ FT HPSFLGASSTIRFIRVIDKLFDMLNSRNPNGKHFKKPLYLNNQLYWKDFLN FT TTITYLSKLTDINGIPLLLHRRKTFVLGLIVDAKSTLKLSFDLLTLHEKPF FT KYVLTYKYSQDHLELLFACIRGKNGYNNNPDVVQLKSSLKRILLRNSIVGS FT SHANCLMFEPYSNGSIFSLSWKKRLCSVDNFDQEDIDFEEVFLYCDELQMS FT SYKEAILGYISGFIVRNLINTISCNVCAQSLTDILSYEHSYAKSHSSLNNV FT KNRGGLFLPSRDVVFVVEMCEKVFNFFISGKDFKNPKITSDKNIKHKMIHC FT VNKLIIKKKVFKGLDSHDIELWTENEDLHSSQLVRKICEYYFRIRMFRYAQ FT DYTTKVLRKSSIGLRQQTNKLILFKGL*" XX SQ Sequence 3388 BP; 1184 A; 507 C; 509 G; 1187 T; 1 other; caatgacgta aataaataac tagatgtcta acttcgggat ttttataaaa taaaagtgaa 60 gccgatttat ttgtttacta ttttaatcag ccattattat atcgggcaga aaacagtaaa 120 tatcatatat aatatagaga gtaaattcgc gacattattt cttttacctt gctgaagtat 180 ttatttaatt caactatttt catcacaaaa tggtaaattc ttgtgctgca tatggatgtt 240 ctaatagata tataaaaggt ggcaagaaat cctttcataa atttcctttt caaaattctg 300 aattatgtaa aaaatggatt attgctttga aacgtgaaaa ttttcttcca tccaaacaca 360 cctgtatctg cagcgaccat tttttagaat cagactacaa ttactgtatc ccagacaaaa 420 aaaatctaat tcttgatcat tatcataagc ctattttaaa atgtaatgct gtgccatcag 480 tttttgtatt ttctacaaat attcaaaaga caaaaaggaa actcccatca atacgaacat 540 cagtaaattc attatctaaa tttgattctt tttcgaaaaa aacaaagttt gatgtttgcc 600 ctgctataga caaaccaagc aaagaaatta attcaacaca taatgatcag ccagaaagca 660 ctattgatca aaatgaccac actagtcaat ctccaacaca atctataaaa tctttatcac 720 cagcaaaaga aaaattaaaa aaaaaaagtt aaggttttac aacaaaaatt aagaagaaaa 780 gaaaagaaaa tctgttcttt acaagacatg gtttcccttt taaaatctaa aaattatttg 840 actactgatg ctggaactgt tttaagcgaa aatttttctg gactctcata tgaaatcatt 900 aagaaccaat tttccaatca aaatattaaa ccaaaaggtc acagatacaa cgatgaagta 960 aaaaaattcg ctttaactct tcatttttat tctccacgtg cgtataactt tcttcggcca 1020 atgttatgtc ttcctgctgc cagttcaata tcacattgga catcatctgt aaattgtgat 1080 cccggtttat tccttgatgt tttttcatat cttggtgata aacagaaaac agatattaat 1140 tttaagcatt gtgctcttat tattgatgct atggcaatca aaaacagtat aatttatgac 1200 aaaagttcag gtcactatat tggtttttgt gattttggta aagatatttc tgtatgtgaa 1260 cctgatactc cagctacaga ggcattaatt ttcatgttag ttggtttgcg aggacattgg 1320 aaaacaccca ttggttatgt tttatcaaat aaagttaaag caagtgattt aacatgtttt 1380 atcaaaaata ttttagatgt ggctttgaca tatgaacttt gtatcaaaag tattacattt 1440 gatggtacta gtacaaattt aaattccgtt gtcccatttg gatgtaagtt tggcaatgtc 1500 atagacaata tagttagtac ttttttatat aacaaccaac tgctccacgt tttcctagac 1560 ccatgccata tgcttaaatt agctagaaat tctctaaatg atttgggtgt ttttattgat 1620 gatgagggct gctatattaa atggagttat attaaagctt tgtttgagtt acaagagctt 1680 gaaggtttaa aatttgcaaa caaactttca agaaaacata ttgaatttca tcgccacaaa 1740 atgaatgtca aaatagctgg ccaaacactt agcagctctg ttgctgatgc aacggaattt 1800 taattgaatt tttaatggta tcgcagcatc catctttctt aggagcaagt agtacaatcc 1860 gttttataag agttattgat aaattatttg acatgctcaa ttccagaaat ccaaatggta 1920 aacactttaa gaaacctcta tatctaaata atcagcttta ttggaaagat tttttaaaca 1980 ctacgattac ttatctgagc aagttgactg atattaatgg cattccattg cttttacatc 2040 gaagaaagac atttgtttta ggtttaattg tagatgctaa aagcactcta aaattatcct 2100 tcgatctgtt aacacttcat gagaaaccat ttaagtatgt tctaacttac aaatattccc 2160 aagatcatct tgaactacta tttgcttgca ttcgtggaaa aaatggttac aacaacaatc 2220 ctgatgtagt gcagttaaaa tcatcgttaa aacggattct tctgcgaaac tccatwgttg 2280 gatcgagtca tgctaactgc ctaatgtttg agccatattc taatggctct attttttctc 2340 ttagttggaa aaaaaggttg tgttctgttg ataattttga ccaagaggat attgactttg 2400 aagaagtgtt tttgtactgc gatgaacttc aaatgtctag ctacaaagaa gcaatattgg 2460 gatatatctc tggttttata gtcagaaact tgataaatac tatttcatgt aatgtttgtg 2520 ctcaatcatt aactgatatt ttatcatacg aacattccta tgcgaaatct cattcttcgt 2580 taaataatgt gaaaaatcgt ggtggtttgt ttttgccatc aagggatgtt gtttttgttg 2640 tagagatgtg tgagaaagta tttaacttct ttattagtgg taaagatttt aaaaatccaa 2700 aaataacctc agataaaaat ataaagcata agatgatcca ttgtgttaac aaattaatta 2760 ttaaaaaaaa agtctttaaa ggacttgata gtcacgacat agaattgtgg actgaaaatg 2820 aagatttgca ttccagccag cttgttagaa agatatgtga atattacttt agaattcgaa 2880 tgttcagata tgcccaggac tataccacta aagttctaag aaagtcgtca attggtttaa 2940 gacaacaaac aaataaatta atattgttca agggtttatg aatatttcat tttattgtat 3000 aaaactttta ttaaaataat aaaatatatt atacgatgtt ttagtttgat gttctaaatg 3060 caaatatgtt tttaaatata tatttttgtg gctataaaaa tagccgtttg ttttagtttg 3120 ttataacaaa cttaacatgc taacgttagc atgtccaatc tttgaccaac aaatatgtat 3180 gtattgactg tatgctcact aaattgatcg tttgtgattc taatcatctt taattccaag 3240 aataaggacg ttaaaattgc caatatcttt ttatatgaat gtgatcgcta aaagcttttt 3300 gcccgatata ataatggctt ccaacgcatt aaattggctt cactaaaaac ggcgaagtta 3360 gatatctagt tatttattta catcattg 3388 // ID Gypsy-17_RP-LTR repbase; DNA; INV; 132 BP. XX AC ACPB02041249; XX DT 28-JAN-2011 (Rel. 16.02, Created) DT 28-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Rhodnius prolixus genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-17_RP_; KW Gypsy-17_RP-I; Gypsy-17_RP-LTR. XX OS Rhodnius prolixus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Euhemiptera; Heteroptera; OC Panheteroptera; Cimicomorpha; Reduviidae; Triatominae; Rhodnius. XX RN [1] RP 1-132 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Rhodnius prolixus genome."; RL Direct Submission to RU (28-JAN-2011). XX DR Genome; ACPB02041249; Positions 1141 1010. XX SQ Sequence 132 BP; 29 A; 12 C; 28 G; 63 T; 0 other; tggcgtctta tgtgtcttat ggcgattgct tgcgatcaat gttatgttta tgtttgttgt 60 aaaagataaa ataaatattg tgttttggtt tttgttattt tgttgttgtt gagctaccat 120 tttagtacta ca 132 // ID Gypsy-76_AA-LTR repbase; DNA; INV; 165 BP. XX AC supercont1.16; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-76_AA_; KW Gypsy-76_AA-I; Gypsy-76_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-165 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.16; Positions 2939067 2939231. XX SQ Sequence 165 BP; 52 A; 30 C; 37 G; 46 T; 0 other; tgtagtatat taccactacc ttaaccttaa taaataaaga ttatctctaa tgtatattac 60 ctttcattct gagaactttg agctacacac tggggacggt gggaatggga ctgggaaggg 120 gacattcata cgcacggtag caagacgagt ggacgttata ctaca 165 // ID I-79_AAe repbase; DNA; INV; 5994 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE An I non-LTR retrotransposon from Aedes aegypti. XX KW I; Non-LTR Retrotransposon; Transposable Element; I-79_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5994 RA Kojima K.K. and Jurka J.; RT "I clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1350-1350 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 11 sequences with >97% CC identity. XX FH Key Location/Qualifiers FT CDS 27..1337 FT /product="I-79_AAe_1p" FT /translation="MAGEPPAVLAGAPGDPAGSQRGGRVPGWMLNKDDLGQ FT PMVLVLRCKVPDGVTDDPQLPEPFVIGLSVESAIGIKEARSVKATREGRGS FT RYLLRTTSRSIVNKLTQLTELSDGTEIEIFPHPTLNTVQGIVYEPDSINTD FT EQVIHKHLISQEVQAVRRIKKRVNGKLQNTPLLILAFHGTTLPDHVFFGLL FT RIPVRTYYPSPLLCYNCGSYGHPRKACQKPGICLRCSQPLHLAEGEQCNNT FT PHCYHCEKEHPISSRDCAKFKEEDKIIHIKVDQNISFGEARRLFNEENRRE FT TIARTIQNQLKQELAVKDKLIATLQKQVADLTKKLAALTPTPRESLRPSSQ FT PSPFMDNQTTSSNPVPNIPHKTTHQSRKDKTFVSPPAKRKDNRDIGTDVGA FT RTRSRSGKRIFEISPTSSSANRGKRVQNLPGTSSSATTVDNGS" FT CDS 1198..5892 FT /product="I-79_AAe_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="MWAPALGAEVANVYLKSLQQVPVPIVESVYRISQVRL FT AVQPQWIMARKTPQNSSPNLNIDLASKTDTITVERRLSTDARLDPCKYTPG FT PDHGQQFCTTLDYTNQDDKLFVAEESRVSSATPTPPKEKQKNETIQVSNPL FT SADLQAPMSINWKLLALPCAVVSDSVAYDHYNNLKPVPSSSSTNPDSPSAR FT LTTSTLTRDVPDISDEPLAAVDVVNFLSKASIPASLSLTVNQYTPSNSPIL FT IGLKSTTTPESNNINTNNNSKPSTSSPVRGRPRRGEGERVVTSPLPNLGVD FT VWGDLNEACHERLAFSPSSWNPVESSNTTFGPDSNRYQQYPRPPHSNSRPS FT TPSSASDVAERFPPSSFAMQWNICGLRSHRSELQILVTKHQPVVICLQETN FT VDGHKLPNKFLGNGYEFLFSKCSTHGRQGAGMAIKTGTPFQRILLKTNIQA FT VAIELLAPLKITVVSIYLPPKDKEAVLLLKNLLEELPKPFLLLGDVNAHHA FT AWGSRTGTGATEASRRGEKILELVVENDMIVLNNGSHTRIDPVTGNSQALD FT ISMCSTSMAGKFNWKILTDSSDSDHLPILINTLDVSNEPLTRAKWIYTKAD FT WKCFEELTSNTLRPGHLLTVEEFSDYIISAAAASIPKTSGRRGTKSVAWWN FT EDVKSAVKQRRKRLRALRRLSDEDPQKLDALRQFQIARSKARKIVKEAKQK FT CWEDFVESINPNSPASEVWSKINRLQGKKSTTTITLNLPTGLTNNGRDVSE FT ALADEYESKSSNATYPDKFRRQHKSDWNAFRQTQRPHLYKKYNQDFTLEEL FT LWALERKAGSSIGADNIGYPMLQRLPISSKVAMLELFNRIWDNGTFPDQWK FT LGTIMPIPKPDSDRSRADGYRPIALLSCMGKVFERMVNRRLITELETNQRL FT DPRQHAFRSGKGVDSHLAALESLLSFENDEHVEIVTLDISKAYDTTWKPGI FT LHTLKEWKISGRMMNMLSSFLANRRFQVFANGSTSSLRKADNGVPQGSILS FT VTLFLVAMQPIFKKIPSSAEILLYADDVLLVVKGNNRRLVRQSMRKTVGIV FT TEWAASVGFSIAPAKSKLMHCCRQRHRKRGRAININSIPIPQVRKIKILGI FT MVDSKLNFKQHLATIKKSCSSRINILRILGSRLKRSNRSTLLSAGSALIFS FT KLFYGLGLTSTNTEDMEQMLGPTYNEVVRLSSGAFVSSPTTSIMAEAGCLP FT FRLALTLRLAKLAVRLLEKDLLSSKYPVVSRAKDIFQQTTGYSLPNICITL FT RNTDREWYTLPPRIDNFFRNSVKAGTNSRIVIPMFQDFTTNRYRHYVKSFT FT DGSKDGSFTGVGVVMDEEEENYSLPDECSIFSAEAHALLTAVSKVNGNHST FT IIFTDSASCIDALRGGRSKHPWIQSVERLSKDRNITFCWIPGHCGITGNER FT ADSLAKSARNKHKLNTPLPSQDVKMALKRSVWSTWESEWRQVQSQLRLIKC FT SPVKYPDRNNTSEQRTLTRLRIGHTRLTHSHLIDKTLPPICRFCGTQITIP FT HILIDCRGYSQQRVRCEITGSLSEILAHDAKKEANILQFLKDCNIYNQI" XX SQ Sequence 5994 BP; 1821 A; 1581 C; 1263 G; 1329 T; 0 other; aattagtgct tctgacgctc ggcctaatgg ccggcgaacc ccccgccgta ttggcggggg 60 cccccggtga cccagctggg tctcagcgcg gtggaagagt gccgggatgg atgcttaata 120 aggatgattt gggtcaaccc atggttttag ttctgcgatg caaagtgccc gacggtgtaa 180 cggacgatcc tcaacttcct gaaccgtttg tcatcggcct atctgtagaa tcggcaatcg 240 gtattaaaga agccagaagc gtaaaagcta cccgcgaagg tcgtggttca cgatacctac 300 ttcgcactac ctccagaagt atcgttaata agctaacgca gttgactgag ctctccgatg 360 gtacggagat agaaatattc ccacacccaa ccctaaacac ggtgcagggc atcgtatatg 420 agccggactc aatcaatacg gacgaacaag tgatccacaa gcatctgatc tcccaagaag 480 tacaagcagt ccgtcgcatc aaaaagcgag taaatggcaa gctccagaac accccgctgc 540 tgatcctagc ttttcacggt acaacgcttc cagatcatgt cttctttggg ttgttgcgta 600 ttccagtgcg gacatactac ccatctccgc tactgtgcta caactgtgga tcatacggcc 660 acccccgaaa agcctgtcaa aagcccggca tttgcctacg ttgctcacaa ccgctccatt 720 tagcagaagg agaacaatgc aacaacaccc cacactgtta ccattgcgaa aaagagcacc 780 ccattagttc acgggactgc gctaaattca aagaagaaga caaaatcatc cacataaaag 840 tggatcaaaa catctcgttt ggtgaagcta ggcgcctctt caacgaagaa aataggagag 900 aaactattgc ccgcacgatc caaaaccagc tcaaacaaga actagcagta aaggacaaat 960 taatagcgac actccaaaaa caagtagccg acttgactaa aaagcttgct gccctcacac 1020 caactcctcg agaatcactg cggccaagtt ctcaacccag cccattcatg gacaaccaga 1080 ccacttcatc aaacccagtc cccaacatcc cccacaagac aacacatcaa tcgagaaaag 1140 ataagacatt tgtctcacca ccagctaaaa ggaaagacaa ccgcgacatc gggaccgatg 1200 tgggcgcccg cactcggagc cgaagtggca aacgtatatt tgaaatctct ccaacaagtt 1260 ccagtgccaa tcgtggaaag cgtgtacaga atctcccagg tacgtctagc agtgcaacca 1320 cagtggataa tggctcgtaa aacccctcaa aactcctcac cgaatctcaa tattgacctg 1380 gcatcgaaaa cggacacgat aacggtagaa cgacgactat ccaccgacgc aagattagac 1440 ccctgcaaat atacccctgg acccgatcat ggacaacaat tctgcaccac gctagactat 1500 accaaccaag atgacaagct ctttgttgct gaggaaagta gagtgtcctc agcaacccct 1560 acacctccga aagagaagca gaaaaatgag acgattcagg tatccaaccc tctatccgcg 1620 gacctccaag ctccaatgtc catcaattgg aaacttctag cgctaccatg cgctgtcgta 1680 tctgattctg ttgcctacga ccactacaac aacctgaaac ctgtgccatc gtcaagttct 1740 acaaaccctg attcgccaag tgcgaggctc accacgtcga ccctgaccag ggatgttccg 1800 gatatttccg atgagcctct ggcggcagtc gacgtggtca attttctctc taaggcaagt 1860 attcctgcat cactttccct aactgtgaac cagtatactc cctcgaactc tcccatcctt 1920 ataggactta aatcaactac cacgccagag agtaacaaca tcaacacaaa caacaacagc 1980 aaaccaagta cctccagccc agttcgcggg cgtcctcgac gaggggaggg agagcgtgta 2040 gtaacctccc ccctcccaaa cttgggagtg gatgtctggg gggacttgaa tgaggcctgt 2100 cacgagcgcc ttgcattctc gccttcctcc tggaatccag tcgagtcgag caacaccaca 2160 ttcggccctg atagcaacag ataccaacag tatcccaggc caccgcattc caactctaga 2220 ccctccaccc cttcatcggc ttcggacgtc gctgaacgat tccctccgtc ttcattcgct 2280 atgcagtgga acatatgcgg tcttcggtcg caccgtagcg aattgcagat tttagtcacg 2340 aaacaccaac cggttgtgat ttgtctgcaa gaaacaaacg tagatgggca caaacttccg 2400 aacaaattcc ttggaaacgg atacgagttt ttgttcagca aatgctcaac gcatgggaga 2460 caaggtgctg gcatggcaat taagaccgga acacctttcc aacggatctt gcttaaaacc 2520 aatattcaag ctgtcgccat cgaacttttg gcgcccctga aaattaccgt cgtctcgatt 2580 tacttgccac cgaaggacaa ggaagcagta ctcctactga aaaatctgtt agaagaactt 2640 cccaaaccct tcctactatt gggtgacgtg aacgcacatc acgccgcttg gggaagtcgt 2700 accggtacag gagctacaga agcaagtcga agaggtgaaa aaatcctcga gctcgtggtc 2760 gaaaatgaca tgattgtatt aaacaacggt tcccatactc gaatagatcc agtaactggt 2820 aactcgcaag cgctcgacat ctctatgtgt tccacatcta tggcaggaaa gttcaactgg 2880 aaaattctga cggattcctc ggacagcgac catctaccga tcctaatcaa cacgctcgac 2940 gtctcaaacg aaccactaac ccgggcaaaa tggatttata ccaaagccga ttggaaatgc 3000 ttcgaagaac tcacaagcaa cactctacgc ccaggacact tgctgacagt ggaggaattt 3060 tcagactata tcatttcggc ggcggcagca tccattccaa aaacttccgg aagaagggga 3120 acgaaatctg tcgcatggtg gaacgaagac gtcaaatcag cagtgaagca aagacggaag 3180 cggttacgtg ctctccggcg gttaagtgac gaagatcccc aaaagctcga tgcacttcga 3240 caatttcaaa ttgcccggtc taaagctcga aaaatcgtca aagaagcaaa gcaaaaatgc 3300 tgggaagatt tcgtcgaaag cattaatcct aatagtccag ccagcgaagt gtggagtaag 3360 attaacagac tccaaggcaa aaaatccacc accacaatca cacttaatct accaactggc 3420 ctcactaaca atggtaggga cgtatcggaa gctttagccg atgaatatga aagtaaatcc 3480 tccaatgcta cttaccctga taagtttaga cggcaacata aatctgactg gaatgcgttc 3540 aggcaaactc agcgaccgca tctctataaa aaatacaacc aagactttac tttggaggaa 3600 ctcttatggg ccttagagcg caaagcaggt tcatcaatag gcgcagacaa cattggatac 3660 ccaatgctgc aacggctacc aatatcatcg aaagttgcaa tgctggagct ttttaatcgt 3720 atctgggaca acggtacatt ccccgatcag tggaagctag gaacaataat gccaattccc 3780 aaacctgatt cggaccgtag cagagcggat ggctatagac caatagctct tctgagctgc 3840 atgggtaaag ttttcgagag aatggtcaac cgtcggctta tcactgagct cgaaacaaac 3900 caaagactgg acccacgtca acacgccttc cggtcgggaa aaggtgtgga ttcccacttg 3960 gctgcactcg aatctttgct cagctttgaa aacgacgagc atgttgagat tgtgactctg 4020 gacatatcca aggcctacga caccacctgg aaacctggca tcctccacac actgaaagaa 4080 tggaaaatct ctggtcgaat gatgaacatg ctatccagtt tccttgcaaa cagacgtttc 4140 caggtgttcg ccaatggatc tacatcaagc ctcagaaaag cggacaatgg cgtgccacaa 4200 ggatctattt tgtcagttac gctgttcttg gtagccatgc agcctatttt caaaaaaatt 4260 ccgtcgagtg ctgaaattct tttatacgct gatgatgttc ttctcgtagt gaagggaaac 4320 aaccgtcgat tagttcgcca gagcatgagg aaaaccgtcg gaatcgtaac ggaatgggca 4380 gccagcgtag gtttctccat tgctccagct aaatccaagc ttatgcactg ttgtcgtcaa 4440 cgccaccgta aacgaggtcg agcgatcaat atcaacagca ttcccatccc acaagtgcgg 4500 aaaattaaaa tcctgggaat tatggttgac tcgaaactta atttcaaaca gcacctagca 4560 actatcaaga aaagctgtag cagtagaatc aatatcctcc gaattctagg gtctcggttg 4620 aagagaagta acagatcaac cctactgagc gcaggttcag cactgatctt ttcaaaatta 4680 ttttacggcc tagggctaac aagtactaac accgaggata tggagcagat gctagggcca 4740 acatacaacg aagttgtgcg cctatcttct ggagcatttg tatctagccc cactacatct 4800 attatggctg aagccggctg cctacctttc cgccttgcac taactctacg actagcgaaa 4860 ttagctgtac gacttctgga gaaagatctg ttatcttcca aatacccagt ggtatcaagg 4920 gcaaaggaca tcttccagca aacaacaggc tactccctac cgaacatttg cattactctc 4980 agaaacaccg accgagaatg gtacaccctt cctccacgca ttgataactt ttttagaaat 5040 agcgttaaag ctggcacgaa tagtaggatc gtgataccaa tgttccagga cttcacaaca 5100 aatcgttacc ggcactacgt gaaatccttt actgacggct cgaaagatgg ctcctttact 5160 ggggttgggg ttgtcatgga tgaagaagaa gaaaactatt ccctaccaga cgaatgcagc 5220 atattctcag cagaggccca tgctctgctt actgctgtgt ccaaagtaaa tggaaatcac 5280 agcaccatca tcttcactga ctccgccagc tgcatagatg ctctacgagg tgggcgatcc 5340 aaacacccat ggattcaatc agtagaaaga ctatctaagg acaggaacat caccttctgt 5400 tggatcccag gtcactgtgg tattacaggg aacgaacgag cagacagcct agctaaatca 5460 gcgcgaaaca agcacaagct gaacactcct ctaccctctc aagacgtaaa aatggctcta 5520 aagcgaagtg tttggtccac atgggaatct gaatggcgtc aagtccagtc tcaacttcga 5580 ctgatcaagt gttctccagt taaatatcca gaccgtaaca atacttccga acaacggact 5640 ctgacaaggc ttcgcatagg gcacactcgt ctcacccatt cacatctcat cgacaaaacc 5700 ttaccgccta tatgtcgatt ttgtggaacc caaataacaa tcccacacat cctgatcgac 5760 tgcagaggat attctcaaca acgggtcaga tgcgaaatca ccggatcact ttctgaaata 5820 ctagcgcacg atgcaaaaaa agaagcaaac attttgcaat ttctcaaaga ttgtaatatc 5880 tataatcaaa tctagtttta aacgatgtta tgtaattgaa gaatgaatac gactatttcg 5940 acacgaatgc cacacgtatg gtaaagtgtc ctaaataaat attaataata ataa 5994 // ID DNA3-1_AP repbase; DNA; INV; 126 BP. XX AC . XX DT 25-AUG-2009 (Rel. 14.09, Created) DT 25-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA3-1_AP. XX NM DNA3-1_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-126 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 1942-1942 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 3 bp TSD CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 126 BP; 32 A; 28 C; 30 G; 36 T; 0 other; gggccagttg caccgtctct gattaaagtt aaccggagtt taatcggccg aatttgcccg 60 gttaaatatc tgttatcagc tgattaagct taatcgaagt ttaaccgtag acggtgcaac 120 tggccc 126 // ID SMAR21 repbase; DNA; INV; 2414 BP. XX AC . XX DT 05-OCT-2007 (Rel. 12.1, Created) DT 22-OCT-2007 (Rel. 12.1, Last updated, Version 1) XX DE Consensus sequence of Mariner-type family of repeats. XX KW Mariner/Tc1; DNA transposon; Transposable Element; SMAR21. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-2414 RA Jurka J.; RT "Mariner-type element from freshwater planarian (Schmidtea RT mediterranea)."; RL Repbase Reports 7(10), 1079-1079 (2007). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 377..1966 FT /product="SMAR21_1p" FT /translation="MSRNPKSKERNELSLDKKIAVLNHLKEKCSSERKTAE FT LFKISKTAVNRIKKQEFEIRKRFEHENSNLCRKRKKNCFDEINNAVLIWFQ FT KMRSINARISGPMIQEVALAFAERFGITTFQASNGWLARFKNREGISQKIL FT DGESNEVSTETVTSFIERFHLLTEGFEEKNIFNADECGMMYRALPNKSLVQ FT KGDQCKNGKQSKERITVLLAASNTGEKLKPLVIGKYENPRCFKNIKKGNLP FT ITWRSSKKAWMNSSIYLQWLNDLNNEMKKANRKILLFVDNCPAHPPIDELS FT NLTVKFLPPNTTSKLQPLDQGIIKAFKGYYRKNLMKSIIIDNIEKDIKEFT FT LSINILSAIKWIDLAWKNVSTKTICNCFKKAGFTKFNNGFLAVNENVINDL FT IDDDFDNINFNNVISSDHEKWNEYLSFDDDLATCEEICEENLIDDVMEELN FT EKNGVSDEPILDANCETFIQEQEINFNEAKVYSRKLANFFLREIPDSVSKL FT YEIEEHLNNYGLTSMIKKNNQTKIEMYFKTSEVI" XX SQ Sequence 2414 BP; 891 A; 339 C; 397 G; 787 T; 0 other; cagtaaaacc cccacaaagg accacctctt caagaggggc aactctttaa acggggcatt 60 tttaggtagt cccggcaaat ttcccagtga aataatggtt tcagtgctta ataaaaggac 120 cacctcttca ttcgtggcag ggaccactat tttgtggtcc ctcgcgtcta ttttctatca 180 taaattgtct tgatcatgga ccacatcttt gaattaaatg actaaagtaa acctttattt 240 gatatgaaac acgtatgaaa ttatttaaat attatcattc atatttaaat aggcatactt 300 tattatatcg aatagagtaa tttcgatcaa tatataaata aaaattatcc ctagactgtc 360 aataaattta ttgctgatga gtagaaatcc caaatcaaag gagagaaacg agttatctct 420 cgataaaaaa attgcagttc ttaaccattt aaaagaaaag tgctcgagtg aacgaaagac 480 agcagagtta ttcaaaatta gtaaaacagc tgttaatcga ataaaaaagc aagaatttga 540 aatacgtaaa agatttgaac atgaaaattc aaatctttgc agaaaacgta aaaaaaattg 600 ttttgatgaa ataaacaacg cggtcttaat atggtttcaa aaaatgagat caattaacgc 660 cagaatatct ggaccaatga tccaggaagt tgcacttgcg tttgccgaaa gatttggcat 720 tacgacattt caagcttcaa acggttggtt ggctagattt aaaaatcgcg aaggaatttc 780 tcagaaaata cttgatggtg agtcaaatga agtatctact gaaacagtca ctagttttat 840 tgaacgattt cacttattaa ctgaaggatt tgaggagaaa aatattttca atgccgatga 900 gtgcggtatg atgtatcgtg cactgcctaa taaaagtttg gttcagaagg gagatcagtg 960 taaaaacgga aagcagtcaa aagaaaggat aaccgtttta cttgctgcta gtaatacagg 1020 tgaaaagcta aaaccactgg ttattggaaa atatgaaaat ccaagatgtt ttaaaaacat 1080 taaaaagggc aatttaccga ttacttggcg ttcaagtaaa aaagcctgga tgaattcttc 1140 aatttattta caatggttaa atgatctcaa caatgaaatg aaaaaagcga atcgaaaaat 1200 tttgcttttt gttgataact gccccgcaca tcctccaatt gatgaacttt caaacttaac 1260 tgttaaattt ttacctccga atacaacttc gaaacttcag ccacttgatc aaggcatcat 1320 taaagctttt aagggctact acaggaagaa tttgatgaaa tcaattatta ttgataacat 1380 tgagaaggac attaaagaat tcactttatc aattaatata ctttccgcta ttaaatggat 1440 tgatctagct tggaaaaacg tatcaaccaa aaccatttgt aattgtttta aaaaggcagg 1500 ttttacgaag tttaataatg gatttctggc tgttaatgaa aatgtaatca atgatttaat 1560 tgatgatgat tttgataata taaatttcaa taatgtaatc agttcagatc atgaaaaatg 1620 gaatgaatat ctttcattcg atgatgattt ggcaacttgt gaagaaattt gcgaagaaaa 1680 tcttattgat gatgtgatgg aagaactaaa tgagaaaaat ggagtttctg atgagcctat 1740 acttgatgct aattgtgaaa cctttattca ggaacaggaa attaatttta atgaggcaaa 1800 ggtatattca agaaaattag ccaacttttt tctgagagaa attcctgatt ctgtttccaa 1860 attatatgag attgaagaac atcttaataa ttatggatta acctcaatga ttaaaaaaaa 1920 caatcaaaca aaaattgaaa tgtattttaa aacttctgag gttatatgat actatttata 1980 atatggaatg ttttcctatt gattaaatct gaaataaggt attaaagcga ttaataaact 2040 tcattgtttt aatttttttc cttttcaatt aattgttaaa atagtttatt taagtttcac 2100 taattggctt taatttatta tttatagtta tcttggaagt agaatgtaac agagtgagat 2160 attacgtcat acttcaatat gtaatttttg atctatgggg aacgaattaa tttatcttct 2220 ataattcgat taaaaacaat acttttttcg atatttttca ataaatatct agaatttatg 2280 aatttttact taactagaat atacattatt aaattaccat aaatttgaat ggaatttcct 2340 ccaagaggac cacctctata aaaggaccgc atattgcaat cccccttagc ggtccgttta 2400 aaggggtttt actg 2414 // ID STTREP_Mp repbase; DNA; INV; 169 BP. XX AC AF039654; XX DT 14-SEP-2004 (Rel. 9.08, Created) DT 01-JUL-2005 (Rel. 10.08, Last updated, Version 2) XX DE Myzus persicae subtelomeric tandem repeat sequence. XX KW STTREP_Mp; tandem repeat; Telomeric repeat. XX NM STTREP_Mp. XX OS Myzus persicae OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Myzus. XX RN [1] RP 1-169 RA Spence J.M., Blackman R.L., Testa J.M. and Ready P.D.; RT "A 169-base pair tandem repeat DNA marker for subtelomeric RT heterochromatin and chromosomal rearrangements in aphids of the RT Myzus persicae group."; RL Chromosome Res 6(3), 167-175 (1998). XX DR Genbank; AF039654; Positions 1 169. XX SQ Sequence 169 BP; 54 A; 39 C; 26 G; 50 T; 0 other; ccaaaattaa tcgaatttcg ggaatttcaa agttctcgtt ctcccgttct attcaaccaa 60 atgacttcaa accttcacca aactcaagag gaagtcaata gctttctttt gataccactc 120 cgacttggtt agctcaaata cacaggtcgt gagacaattt tgaaaatgg 169 // ID CR1-26_BF repbase; DNA; INV; 4682 BP. XX AC . XX DT 30-JUL-2009 (Rel. 14.07, Created) DT 30-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE Amphioxus CR1-26_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-26_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-4682 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-4682 RA Kapitonov V. and Jurka J.; RT "Young families of CR1 non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(7), 1597-1597 (2009). XX DR [2] (Consensus) XX SQ Sequence 4682 BP; 1279 A; 1104 C; 896 G; 1403 T; 0 other; actttcagaa taatccaaga tggccgccac ctacggacgc atgtaagtgg cgtctctcca 60 gtgccgcgtg tttttcatag tttaactccg tttgtgtcac cccactaaac ctgtgtgagc 120 accgtgatgg gcctgacagt taaagaaaag gaggaggtga tccccaacct catagacgct 180 cggattaaag caactattga ggctgctctg gttactgctc tcaaacagag cgatgttctt 240 gccaccattg tgcctaccat cgtcagcgca gtgaaggagt ctgtgcgctc atccatccag 300 gatgagctga agaagttgca ggaccagctt gtgcagcagg aaaggaagtg tacctcactc 360 caagaagcct acaacaacct tgaagcaaaa tacaacgacc tagagcagta ttccaggcgt 420 aattgcatcg tcatctctgg aattccagaa ccacctgtgg aagagggcac ggataatgtc 480 gtcatggagc tcgtcaacaa caagctgaaa tgtgacccgc ctatcaacaa ccaacacatc 540 gacaggtccc atcgccttgg caagccgcgc accgacggta aaccacgtcc tgtcatcgta 600 aagtttgtct cctaccgtca gaagtccgcc gttatgcgcg ccaagaccaa ggtcgacgcc 660 agagatgttc tcaggaaaga caaagtgtac atcaacgaca acctcacaaa gggaaacatg 720 cagcttctga aagaggctcg gtcactagtc aagtcaaatc atctaaacca gtcatggtca 780 tatgacggaa agatttttgt gaagactttg aacggggaga gacaaatgat cttcaagcaa 840 gctgatctgg acatcttccg ttcagcgtaa tctgctcagc ggcactgaca actgaatctc 900 tattccaagt cattgactgt aatgtctctg ccatcgttta cttgtgttat tctattgtat 960 ggttatcttg tgagatcgtg ttcttctatg tgtacagcaa tgggtaagtc taattcattt 1020 attgacgggt caatacttct gaatagattg cagagagggt gcgcgaagcc gctttcatag 1080 gtcttgatat gtacagggca gtcattggtt tgttttctgc acattcacgc ctaatttcta 1140 aggaaagatt tagacttaca ctgaatctag atatcttcct atcctacact actgttttca 1200 tatcttttgt agtaatttta gtagcgggag acgtccatcc caatccaggg ccagtttgcc 1260 gtaaacagtt taacgttatg catctaaatg tcaatagcct agtagctggc actaagatgg 1320 acgaactatc tgctctagct gttagactca acctcgatgt tattgctgta tcagaaactt 1380 ggttgggaga ttctgttgat tcttttgcaa tttctcttga cggttttcag ttgccagtcc 1440 gtcgtgacag aaataggcat ggtggcggtg tcctcattta catatctgac cagatagcgt 1500 acaagcgacg cttagatctg gaatcaccgt ctgttgaatg catttggctt gaactctttg 1560 taggacgcag ccgtttactc ttttctgtct actaccggcc acctggccag gattccaaaa 1620 ctgttgatga atttgttgac ttgtttagtg actcagtatt cacagctgaa tcctctcaac 1680 acgatggcat cattgtcaca ggtgacttta atgccaaaca cgaggcatgg tgggcccatg 1740 acccaatcac ttcggtcggc acaaaactat tgcaagcttc taaaatgtta aatctcacac 1800 aggtgataaa ggagccaacg tgcgatttgt cacggtctcc ctccctgatt gaccttctat 1860 tcatcaacga gaaaaaactt atcaagtcta cctctgtgtt atctcccctc tccggatgcc 1920 accacagccc tattgtcgct actattaact tgtcactcag gccccctcgt ccttatgtcc 1980 gtaccatctg ggattattcc aaaaccgatc acagcttgct ggcactcttc tcgtctgagc 2040 caatatggca tgaagttttc tattgtgata ctgttgaaga agctagcaag aagttgatgg 2100 aactgatttt ggaagccaaa attaaatgtg tgcctcataa ggacattctc gttagacctc 2160 gtgacaaacc ttggatttca cctgacatta gacgcttaat gcaccagaga gacaagttgc 2220 ataagaaagc caagttaacg aacagcccta ttcattggtc ttcatatcgt cttattagaa 2280 ataaacttgt ccatcagtta tcactagcca agtctaatta tcacaaccgc ctggttgcct 2340 ccttaagtag tcccccctca tccaacaaaa agtggtggca cattgtgaaa catttctacc 2400 aaagtaaagt taattctact attccaccgc ttaagtctgg caactgtttt ataaccgatt 2460 caagagaaaa ggcaactatg tttaatttgt atttctcttc tcaggcctca gtggatgact 2520 ccagtgctag acttccaccc cttgatttcc ttactagtgc tcgtcttaca cagtgtgtta 2580 catcagctgc tgagatagaa ctttatgtct cagatttaga agtatctaaa gcatgtggct 2640 atgataatgt agacaatcgt tttcttaaat ccatatgtcc ctatatctct gacaaaattg 2700 catatgtttt taacttgtct ctaaatcacg gagtcttccc tgaatcctgg aaaagagcaa 2760 atgtggtacc aattttcaaa aagggggacc ccgaggatgt gtcgaattac cgacctgtat 2820 ccctactccc ctgtttgtct aaaatccttg aaaaaattgt atttaaacac ttgtacaacc 2880 acctgatgtc ccaaaactta ttatattctc tccagtctgg atttatccgt ggagattcaa 2940 cagtaggcca acttgtttgt gtaactgaca aaattcttga agctcttgac tctaatagag 3000 aggtccgagc ggtttattta gatttttcac gtgcatttga caaagtttgg cataagggca 3060 tcattttcaa gcttcaaaga aatggggtcg agggtccggt cctaaattgg ttccacagct 3120 atctttctga tcgtgtacaa agagtagtta tagatggcca gcactcagat tggtgtaata 3180 ttcgcgcagg cgtaccacag ggttctgtat taggccccct cttatttctt gtttacatca 3240 atgatatggt ggacgattta tcatcttgcc cgtttctctt tgcagatgat agttctttag 3300 tagatatagt ggatagccca accgaaactg cgtcaagtct aaattctgat cttagtaaga 3360 tatcatcttg ggcttccaca tggcttatga aactgaatcc tttaaaaact gaggaaatgt 3420 gtttttccag gaaaatcgat cccccttgcc atcctcccct ttttcttgat aactgtgtca 3480 ttcaatctgt aaaagaccac aagcacattg gtgtttttct cacgtccacc atgtcttggg 3540 aaaaacatat tacacacatg attgccaaag tatcaaaacg tgtgtccaca ctgaacaaac 3600 ttaaatttaa attgccacga catgtactcg aagttatcta caagtccttc attcgtccac 3660 ttcttgagta cgccgacata gtatggcatg gctgcactat atcggactcc cgacgtatag 3720 agagagtaca atatgagtgc tcactcacag tatcaggtgc tgtgagagga tcctcatatt 3780 tgtcactcct ttctgaactt gggtgggaaa aactgtctga ccgtcgccat attcactctt 3840 tgacaatgtt tcataaaatt gttcatggtc acactcgcca gtaccttcag gagttgatca 3900 ctcctgaagt ctctgcctct actccctacg accttcgtaa caaacacaac ttacagtcac 3960 cggcttgtac caccactcgt taccaaaggt cctttattcc gtatgctaca tatcactgga 4020 atggtcttga tatagctgtt cggtcattta atccttcaac attcaaaaac tacttaataa 4080 aattggttcg cccttcccct aacacacatt ttagtcaagg ccccaggtac gcctgcgttt 4140 tactcacacg cctccgcatc gggacctgta gtttgaactt tagcttattc acacgtaatt 4200 tagtcgatag tccttcttgt agctgtggct cccgatgtga aagcgttgtc cactatttat 4260 tgtactgccc taactacaac caacttagaa aaacccttct ctgcaatctt caagaccttt 4320 taacccttga tttgaaaact acgtctgacc cagtcttaac tcaactttta ctcaggggtt 4380 cacccacttt accaacagaa acaaataacc atattatgca gcttacacag acatacatac 4440 tccagaccaa ccgttttcaa actgattgaa cattttcctt aaatcttgta tgtatataca 4500 tttatttgtt ggatccgtca ccccactgct gtactactgt tttatttgat ctcacattgt 4560 tttgtccttt gtttatttta ttttcaaatg tcctagtggt aatgttaata ttagctttgc 4620 tagagttcat ttaccactgt gtcttgtctt atatgtatac caataaaaaa aaaaaaaaaa 4680 aa 4682 // ID I-51_AAe repbase; DNA; INV; 7023 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE An I non-LTR retrotransposon from Aedes aegypti. XX KW I; Non-LTR Retrotransposon; Transposable Element; I-51_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-7023 RA Kojima K.K. and Jurka J.; RT "I clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1322-1322 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 12 sequences with >97% CC identity. XX FH Key Location/Qualifiers FT CDS 568..1944 FT /product="I-51_AAe_1p" FT /translation="MASTSGGDPGGSNRRTLPLYMDPHNEFGAVTTLLMTG FT KDGTNLPIEPFIIGKSLEDWAGPIEDSRSVSRCTKYVIRTRKPAQVEKLLK FT MTHLCDGTEVSVVLHPTLNISRCVISAYDVIEKDEQEIVENLSSQGVIKAR FT RILKSNKDRTAAIILTFNKSVYPANVKIGVLNFKTRPYYPNPMLCFSCFEY FT GHPRLSCTNSKRCYNCSQDHEEREKCEDAPFCRNCEKDHRPTSRQCPIYKK FT EMEVIKTKIDCNLTEADARKRVEAGNGSYARIAAQPRLDQIKLSDLSTQLA FT EKQKKIEELESSLAHVTQVLEAKFNDIVTKNAEKDTQIKELLADLKQRDER FT IAKLEAGNQGMKKYIEDIKMRARTNSQSSEPSHSKSNNKKHKTKRTTPSNE FT NDNERASRSNMSPPPKKQPNTVRSPILTRNTINKSRNEAVDPSETDDPMHC FT DSSNSTDYGPSRTQ" FT CDS 1925..6916 FT /product="I-51_AAe_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="MDQVEPNNLALRSTIECTANNQQHTVNSNQFLLGHQH FT SRFEECECAVPHSLLKEAQQEGKSIRDAIPQPVDGGPYCSAGSSSSRTNDI FT LSTNTIQLLDHPSLNNPKNPTPNSSHTYNPNSDQISQNSKIINRNESSAKL FT STVARQKSIPIKGSYRTNRINNSSDNQQISPINGGQDIRITNSSSEGFGVG FT SVNPRQVPALALTTDVPSSWIEPVAAAGAGSLSVSQASNIISSSLSSNDAA FT EFSTATSRQSTSRFSSTDANPVNQDIRIINSSSGGVGVGSVNPRQVPALAL FT TTDVPSSWIEPVAAAGAGSLSVSQASNIISSSLSSNDAAEFSTATSRQSTS FT RFSSTDANPVNQDIRIINSSSGGVGVGSVNPRQVPALAPTTDVPSSWIEPV FT AAAGAGSLSVSQASNIVSSSLSSNDAAAFSTATSRQSTSRSSPNDANLAPI FT NQHIHSVAESESSNPSGGNNQTFVLQWNIRSVWRNYAELRKIVDDSNPAII FT CLQEMMTTRTNNLLSNNYAWSIRNRPESMGSGISGLGVRRDIPHEFINFAS FT DLLVCIARIGKPWNLTIVSIYVPQSVSSDALTSKLTDIVQSLEPPFLLCGD FT FNASHEVWGSSRSNNRGRSLLEWAVDNNLISLNNGSPTHYHASSGTFSAID FT VSFFSNCLASKFSWICDDDLHGSDHFPISIQLNNPLPTIQYTKRWIYKDAD FT WGKFETIVANKIPPNTVTSIEEITQAIHEAALESIPRTKGMPGKKRQIWWN FT DEVKQVVKNRRKALRKLRKLKDDSPGKEAAKQQFNEAKREARRMIEEAKTN FT SWTSFQEIFTTKSSTSQLWQSFNKLSGKRRTNKLGMIINGEHNSNPTAIAD FT FLVEFFASTSANEGYSEEFLRKKLLREGTPLNLISNNVSPMDQKFSIQELF FT QALESANGTSTGVDNIGYPMIKHLPFHCKLTLLEAFNHIWMTGDIPLTWKE FT SIIIPIPKPGESANGANGFRPISLTSCMAKTMERMVNRRLMDFFEFNKRLH FT PRQFAFRKGKGTTCYFAELDEFITEAAKKGEHCEVVTLDIKKAYDKVWHRN FT ILDTIIENNLGTNMNKFLNNFLQHRSARVSFAGALSKSKHLENGVPQGSVL FT SVSLFLLAMNSVFSIVPKNVQVFLYADDIILVAAGKRVSYLRKRLQKAVNL FT VDKWATDIGFDLSSSKSSTMHCCKLKRHRRWHLEGAKIILNDNELDKPKVT FT RILGVLFDRKCTFNAYTKHVKEECGSRLNLLKAISKGADRKTLLYIGNSIV FT ISKLFYGMELLGNVNIEKLSPVYNQVVRTASGALKTSPILSLMVESGCLPF FT QHAATLTIIKKACATLEKSASGTSNLWVKADTLFQTLTGKPLPKLSKLHTM FT FHRAWNKKKPFIDWTVKSKIRAGQTPHLAQDIFRRLVSENYENFDHWYTDG FT SLAEGKVGYGVIGPGIRIEQSLPEQCSVFTAEAAALAHAAHRAKTRSVIFS FT DSASSLQALESGKSRNSYIQKIEKVIEDKDLRFCWIPGHSRIPGNEEADSA FT ANCGRHKPVGEEDVPAQDIKVWVTREIWRAHQSAWDQAPRNNLKIVKPNVG FT RWIDRSNRKEQIVLTRLRIGHAWFTKRYLFEKNPREKCTLCDRYLSVHHVL FT ASCQSYNAARRELGISNELRRILQNEKTTETKLISFTKKIGIFKCI" XX SQ Sequence 7023 BP; 2256 A; 1632 C; 1505 G; 1630 T; 0 other; cagttcgtgt tgagcaaccg aagtgtacgg tcgttcttca ataaatcgga gctaagaccc 60 ctgttggttc gtacctcgtt aatatctcgg agcgctagcg ctacgccttc cgacgaaagg 120 atttcgtgag tttagtgtta ttggtcaacc taaccgcgaa gttttgccac ctctaaccgc 180 gattttattc gcgagcgggg ttacctgtcg taccaggcga tcatacgcgt gaaaagtgtc 240 ggggtattgt aaaccacctg tttgggcgac aaagcgtccc caccttgtgc atgccgtgta 300 tgtgaactac gaaggaccaa gctactcctt aatgctatag tggttgtgaa gtgcaagaag 360 tagagtgcaa gaagacatct gtaggcgata agcgacgaac ttgatcgcta ctgagtcacc 420 agaagagcat cgttataaaa ggtagggtca gattttccaa ttcctaattt ccaattccct 480 ccgcgctggg gtaggtagtg ggtgaccact atcactttat gcggtatttg atctcgctga 540 agctgttggt atcaagttaa gtgttgaatg gcctccacca gtgggggcga cccaggaggg 600 tcaaatagaa gaactctgcc gttgtatatg gacccccaca acgagttcgg tgctgtgaca 660 accttgctaa tgaccggtaa agatgggaca aatctcccaa ttgaaccgtt catcattggg 720 aaaagccttg aagactgggc aggaccgata gaagactcca gaagtgtaag cagatgtact 780 aagtatgtta tacggacccg gaagcctgcg caagttgaaa aactcttgaa gatgacgcat 840 ctctgtgacg gaactgaagt ttcagtagtg ttgcatccaa cgcttaacat tagccgctgt 900 gtgatttccg cctatgacgt gatagaaaag gacgaacaag agattgtgga gaacctaagc 960 agccaaggag tgatcaaagc ccgtcggata ttgaaaagca acaaagatag aactgcagcc 1020 attatcttga cgttcaacaa gagcgtatat cccgcgaacg ttaaaattgg agtactcaac 1080 ttcaaaacac gcccatatta cccaaacccg atgttatgtt tcagctgctt tgaatacgga 1140 catcctcgtt tgagctgtac gaattccaaa cgctgctaca actgctcgca ggatcacgaa 1200 gaaagagaaa aatgcgaaga cgcccctttt tgtagaaact gcgaaaaaga tcaccgaccc 1260 accagtcgac aatgcccaat atacaaaaaa gagatggagg tcatcaaaac aaaaatcgac 1320 tgcaacctga cagaggctga tgcgagaaag cgtgttgagg cgggaaatgg cagttatgct 1380 cggattgctg cacagccgcg attggatcag ataaaactaa gtgatttatc gacccaactt 1440 gcggagaaac agaaaaaaat tgaagaactt gaaagtagtc tggctcatgt cactcaagta 1500 ttggaagcca aatttaacga tatcgttacc aagaacgctg agaaagatac tcaaataaag 1560 gagttgctcg ctgacttgaa acaacgagac gaaaggatcg ccaagttaga ggcgggaaac 1620 caaggcatga aaaagtacat cgaggatata aaaatgagag ctagaactaa ctcgcagagc 1680 agtgaaccga gtcactcaaa gagcaacaac aaaaagcata aaacgaaacg aacaactccc 1740 tcgaatgaaa atgacaatga gcgtgcctcc aggtcaaata tgtccccacc accgaaaaaa 1800 caaccaaaca cagtgagaag tcccatcttg accagaaaca ccatcaataa atccagaaac 1860 gaagcggtag acccatccga aacagacgat ccaatgcatt gtgactcttc caactccacc 1920 gattatggac caagtagaac ccaataattt agcgttacga tcaaccatcg aatgcaccgc 1980 aaacaatcag cagcatacag taaactccaa tcaatttctg ttaggtcatc aacacagccg 2040 gtttgaggag tgcgaatgtg cagtaccaca ctcacttcta aaagaagcgc aacaggaagg 2100 aaaatccatc cgggacgcca taccccaacc cgtagatgga ggaccctact gttcggccgg 2160 ctccagttct tcaagaacca atgacattct ttcaacaaac acgatacagc ttttagatca 2220 tccttccctc aataatccca aaaaccctac accaaactcc tctcatacat acaatccaaa 2280 ttctgaccaa attagccaaa acagtaaaat aataaacaga aacgaatcaa gtgcgaaact 2340 atcgaccgtt gccagacaga agtcaatacc aatcaagggt agctatcgaa caaataggat 2400 aaacaattca agtgacaacc aacaaatatc accaatcaat ggtggtcaag acattagaat 2460 aaccaattca agttcagaag gcttcggagt ggggtccgta aacccccggc aagtccctgc 2520 actggccctg accacagatg ttcccagttc atggattgag cctgtggcgg cagctggtgc 2580 ggggagcttg tctgtgtctc aggcaagtaa catcatatct tcttctctaa gctcaaatga 2640 tgccgcagaa ttctccacag caacctctcg acaatcaaca tctcgctttt catcaaccga 2700 tgccaatccc gtcaaccaag atattaggat aatcaattca agttcaggag gcgtcggagt 2760 ggggtccgta aacccccggc aagtccctgc actggccctg accacagatg ttcccagttc 2820 atggattgag cctgtggcgg cagctggtgc ggggagcttg tctgtgtctc aggcaagtaa 2880 catcatatct tcttctctaa gctcaaatga tgccgcagaa ttctccacag caacctctcg 2940 acaatcaaca tctcgctttt catcaaccga tgccaatccc gtcaaccaag atattaggat 3000 aatcaattca agttcaggag gcgtcggagt ggggtccgta aacccccggc aagtccctgc 3060 actggccccg accacagatg ttcccagttc atggattgag cctgtggcgg cagctggtgc 3120 ggggagcttg tctgtgtctc aggcaagtaa catcgtatct tcttctctaa gctcaaatga 3180 tgccgcagca ttctccacag caacctctcg acaatcgaca tctcgctctt caccaaacga 3240 tgctaattta gctcccatca accaacatat ccattccgta gcagaatcgg aatcatcaaa 3300 cccatctgga ggaaacaatc aaacttttgt gctacagtgg aacataagaa gtgtatggcg 3360 gaactatgct gagctccgca aaatcgtcga tgacagcaat ccagcgataa tttgtcttca 3420 agaaatgatg acaaccagaa caaataacct actcagcaac aactacgcat ggtcgatccg 3480 caatcgaccg gaaagcatgg gcagcggtat cagcgggctt ggcgttcgtc gtgatattcc 3540 tcacgaattc atcaactttg catctgatct tctcgtatgt atcgcaagaa taggtaaacc 3600 gtggaacctc acaatagtgt caatttacgt cccacaatcc gtaagtagtg acgctctcac 3660 aagtaagttg actgatattg tacaatccct tgagccgcca tttctcttgt gtggcgactt 3720 caatgcttct cacgaagtgt ggggtagttc acgttctaac aatcgcgggc gatctctctt 3780 ggaatgggca gttgataaca atctcatatc actcaacaac ggttcaccaa cgcactatca 3840 tgcttcttcg gggacgtttt ctgcaatcga cgtctcgttt ttctctaatt gtctcgcatc 3900 caagttttct tggatctgcg acgacgactt acatggaagt gatcatttcc caatttcaat 3960 tcaattgaac aatccgctcc cgaccatcca atatacgaaa agatggattt acaaggatgc 4020 tgactgggga aaatttgaaa caatagtggc taataaaatc cctccaaaca ccgtcacctc 4080 tatagaagaa attacacagg ccattcacga agcagcattg gagtccattc ccagaactaa 4140 aggtatgccc gggaagaaac gacaaatttg gtggaacgat gaagttaagc aagttgtgaa 4200 aaaccgaaga aaagccctgc gaaaactgcg aaagctgaaa gacgacagtc ctggtaaaga 4260 agcagccaaa caacagttca acgaggctaa gcgggaagcg agaagaatga tcgaggaagc 4320 caaaacgaat tcctggacaa gttttcagga aatattcaca actaagtcta gtacttcaca 4380 gctatggcaa agctttaaca aactcagtgg taagaggcgc actaataaac taggaatgat 4440 aatcaacgga gaacacaata gcaaccctac tgctatagcc gactttcttg tagaattttt 4500 cgcttctacg tcggctaatg aaggatactc cgaggagttc ttgcggaaga aactgttaag 4560 agaaggtacc ccacttaatc tcatttcaaa caacgtctct cctatggacc aaaagttctc 4620 cattcaagaa ctttttcaag ctcttgaaag cgcaaatgga acttcaacgg gcgtggacaa 4680 cataggttat cccatgataa aacatctgcc ttttcactgt aaactaaccc tcctcgaagc 4740 cttcaaccat atctggatga ctggggatat cccattgaca tggaaggaga gcattattat 4800 tccaattcca aaaccgggag aatctgcaaa tggtgctaat ggttttcgcc ccatttcact 4860 gactagttgc atggccaaga caatggaaag gatggtaaac cgccgcttga tggatttctt 4920 tgagttcaac aaaagactcc acccgcgtca gttcgcgttc cgcaaaggaa aaggcactac 4980 ttgctatttc gccgaactag atgaattcat cacggaagcg gcgaagaaag gagaacactg 5040 cgaagtcgtc acccttgaca ttaagaaggc ttacgataaa gtttggcaca gaaacatctt 5100 agatactatc atcgagaaca atcttggaac aaacatgaac aaattcctca acaattttct 5160 tcagcaccga tcagccaggg ttagttttgc aggcgcccta tcaaagtcta aacacctgga 5220 aaatggagtt cctcaagggt ctgtattatc agtttccctg ttcttattgg ctatgaactc 5280 ggtgttttct atcgttccaa agaatgtgca agtctttctg tacgcagacg atataatcct 5340 agtggctgca ggaaaacgcg tcagttacct gagaaaacga ctccaaaaag cggttaatct 5400 agtagacaaa tgggctacag atatcgggtt cgacttatct tcttctaaat catccacaat 5460 gcactgctgc aaactaaagc gtcaccgtag atggcatctt gagggcgcca aaatcattct 5520 aaacgacaat gagctagata aaccgaaggt aacaagaatc cttggagtgc tatttgaccg 5580 gaaatgcacc ttcaatgcat atacgaagca cgtaaaagaa gaatgcggaa gtcgactaaa 5640 cctactcaaa gctatatcga aaggtgcgga tcggaaaaca ttgctgtata ttggaaactc 5700 gattgtaatt tcgaaacttt tttacggaat ggaactatta ggaaatgtaa atatagaaaa 5760 actttctccc gtatacaatc aagttgtcag aacagcatct ggtgctctaa aaacatctcc 5820 aatactttct ctgatggtgg aaagtggatg cctcccattt cagcacgcag caacactgac 5880 aatcatcaaa aaagcgtgtg caactcttga aaaatctgca agcggaacta gcaatctatg 5940 ggtcaaagcc gatactttat tccagacgct aacaggaaaa ccattgccaa aattaagtaa 6000 actgcataca atgttccaca gagcgtggaa caagaaaaaa ccttttatag actggactgt 6060 aaaatcaaaa atcagagctg gacaaacacc tcatctggca caggatattt tccgccgact 6120 agtttcggaa aattatgaaa actttgacca ttggtacaca gatggatcac ttgcagaagg 6180 aaaagtgggg tatggagtta ttggacctgg cattagaatt gaacaaagtc ttcctgaaca 6240 atgttcagtg ttcacggcgg aagctgcagc tttggcccat gcagctcatc gtgctaaaac 6300 caggtctgta attttcagtg actcagccag ctcccttcaa gcgttggagt cagggaaatc 6360 aagaaattcg tacatacaaa aaatcgaaaa agttattgaa gacaaagact tgaggttctg 6420 ctggataccg ggtcactcta ggattccagg aaatgaagaa gcggacagtg cagcaaactg 6480 cggcagacac aaaccagttg gagaagaaga tgtaccagcg caagacataa aagtatgggt 6540 tacacgagaa atctggcgtg cacatcaatc agcctgggac caagcgcctc ggaataattt 6600 gaagatagtt aaaccgaacg ttggaagatg gattgaccga tcaaaccgca aggagcaaat 6660 agtcttgact agacttcgta ttggtcacgc ctggttcacg aaacgctacc tctttgagaa 6720 aaatcctcgt gaaaaatgca ctctctgtga ccgctacctg tccgttcatc atgtattggc 6780 atcatgccag tcgtacaacg cggcgcgtcg agaactagga attagtaacg agctacgaag 6840 aattcttcag aacgaaaaaa cgaccgaaac caaactcatt agctttacga aaaagatagg 6900 aattttcaag tgcatataat atgtaacagt tactaatgtt tatttttcaa ataaagtatt 6960 caatagagac gaatgcccct tctgggtaaa gtctctctaa acaaaaaaaa aaaaaaaaaa 7020 aaa 7023 // ID Gypsy-2_BT-LTR repbase; DNA; INV; 502 BP. XX AC AELG01001160; XX DT 15-JAN-2011 (Rel. 16.02, Created) DT 15-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the buff-tailed bumblebee: long terminal DE repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-2_BT_; KW Gypsy-2_BT-I; Gypsy-2_BT-LTR. XX OS Bombus terrestris OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; Apoidea; OC Apidae; Bombus; Bombus. XX RN [1] RP 1-502 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the buff-tailed bumblebee."; RL Direct Submission to RU (15-JAN-2011). XX DR Genome; AELG01001160; Positions 116930 116429. XX SQ Sequence 502 BP; 146 A; 125 C; 102 G; 129 T; 0 other; tgtagcggta ctcgtcgaca aaaatcataa tagtttttgc ccagtaaatc cgctacacaa 60 gaaaccgtgt ttacccaaag acgagctaat tatagatcca tcgcccttag gcggttccga 120 tcgttagtcc aaatccgtcg tttttgccgt gttcataggc cctcaagtca actacccaaa 180 aagccaacaa ttgtaaacaa ggtcgagagc tcggcgcccg ttgggacgtt tcgagaacat 240 cttcgaaatt caaatcccaa cgcctttcgt caactcccgc gaatataaat atagcggaca 300 gggtagcgac gcattcttag ttatcgagag acacgagcag cagcaattaa ccgatacttc 360 ttcgcgccga tctattagcg accgcgaacc gagacggcaa gtgtattgta cgactagtgt 420 acgcattggg tatattctaa taaactacgt agttaggtat actcttgtgt ttgtgaaaat 480 taatccacac cttatcacct ca 502 // ID Gypsy-34_NVi-LTR repbase; DNA; INV; 303 BP. XX AC . XX DT 29-JUN-2009 (Rel. 14.07, Created) DT 29-JUN-2009 (Rel. 14.07, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: long terminal DE repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy-34_NVi-LTR. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-303 RA Bao W. and Jurka J.; RT "Gypsy LTR retrotransposons from Nasonia vitripennis."; RL Repbase Reports 9(7), 1382-1382 (2009). XX DR [1] (Consensus) XX SQ Sequence 303 BP; 60 A; 100 C; 84 G; 59 T; 0 other; tgtcacggac gagctttggg ctttccggcg gcatctgcga tggcctgcat ttccggcgat 60 gcgccagacg catcgtcagc ggaagaacac cgcggcccga aagtttccgc gcgcagaagt 120 gcaacgctgc acttcggcac gtggaaccgc tcgccgtcga cgagtcgcgt cgacgagcga 180 gcgggcagac caccggcgac tctccaagcg agcggagctc cgatattttg ggataataaa 240 tacagtgaca ccgaacttcc ctgcctcttc tctctcctcg tgtacaccca ttcttccgtc 300 aca 303 // ID DNA8-1_AAe repbase; DNA; INV; 770 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A non-autonomous DNA transposon family from Aedes aegypti. XX KW DNA transposon; Transposable Element; nonautonomous; DNA8-1_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-770 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the yellow fever mosquito."; RL Repbase Reports 11(4), 1281-1281 (2011). XX DR [2] (Consensus) XX CC >97% identical to consensus. 8-bp TSDs. TIRs are 297 bp long. XX SQ Sequence 770 BP; 234 A; 159 C; 146 G; 231 T; 0 other; tagagtggtt caaaaaatcg tttttgctcc acaccgctca ttcgattcta gatcaaattc 60 tgagtgtcct cccaaaattt gagctcattc ggatgaaaac tgagactgca caagcccttt 120 aaagtttata tgggaattac tatgggaaaa gcaagcaatt cattcaatcg gtcatagtgt 180 ttgcccatgt gctcttgagg attagtgcta cgttgatact gtgagataca ttcatcagct 240 acaactttgc cgaagaccgt tttcaaatcg gacgtctcag taattagtta ttgatttata 300 tccagtcaca aattcttcaa cagtgctcat ttcacttctg aacaggcaac attgctgcat 360 ctggcgcgaa agatagcacg cacaaatcat ggctactatg ttttactgca tatatccagt 420 gatgcctgca gaattgccca attattatag ccaaagtttt acaactcata caaaaatcaa 480 taactaatta ctggggcgtc cgatttaaaa atggtcttcg gcaaagttgt agctgatgaa 540 tgtatctcac agtatcaacg tagcactaat cctcaagagc acatgggcaa acactatgac 600 cgattgaatg aattgcttgc ttttcccata gtaattccca tataaacttt aaagggcttg 660 tgcagtctca gttttcatcc gaatgagctc aaattttggg gggacactca gaatttgatc 720 tagaatcgaa tgagcggtgt ggagcaaaaa cgattttttg aaccactcta 770 // ID SMAR20 repbase; DNA; INV; 1920 BP. XX AC . XX DT 05-OCT-2007 (Rel. 12.1, Created) DT 22-OCT-2007 (Rel. 12.1, Last updated, Version 1) XX DE Consensus sequence of Mariner-type family of repeats. XX KW Mariner/Tc1; DNA transposon; Transposable Element; SMAR20. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-1920 RA Jurka J.; RT "Mariner-type element from freshwater planarian (Schmidtea RT mediterranea)."; RL Repbase Reports 7(10), 1078-1078 (2007). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 832..1755 FT /product="SMAR20_1p" FT /translation="MLKNESSKGCKTSKERLTVLCCASMSGSKEKLLVIGK FT SANPRCFKGVKQLPVDYYANKNAWMTTLIFNEWLLKWDNKLNHKIVLLVDN FT CTAHIVKVNLKQIKVIFLPANTTALIQPCDQGIIRTLKAYYRREMRSRVLE FT NMEDSQKLTANELAKKTNILDALHLLAMSWKSVSEKAIKNCFSHGGFTTED FT LESEEIVENPTDLTDEAFEEWMLIDQNIPIAEKLTDSEICDTVTQTSPASI FT EDEEEDEDESQTNNKPPTACEIRNALQILRLGVQHRSSEFHKQYEYESFIN FT SLLRKNTRQSSLHDFF" XX SQ Sequence 1920 BP; 683 A; 309 C; 371 G; 557 T; 0 other; tacagtggcc gccgcttatt ataatcacgg ttaatgttat cattcggcta atgttatcac 60 tttcactaag tcccggccgc atattccttt ttatactgaa aataatctgc ttattgttat 120 cagaatctat taaaaaattc gcttattgtt atcacttttt taatttaaaa tatcaattaa 180 aacctaatta aaatctttaa attcaatgtt tcattaaaaa aattaattaa caaatcaaat 240 ccttgtctga tgtgaagctt aaaccaaaat gtccgagaaa cagaaaaaaa gaaaagactt 300 aacactgaag gaaaaagttc aaattattga gaactatgac aagttaccaa agatgggtca 360 acgcaatgca tctgttcaat taaagatttc tcagccttta ctttgtaaaa ttctaaaaaa 420 tcgagatgaa attttgaggg gagcaaaact aaatgaaaat ttaagttgta tgcgaaaacg 480 agttggaaag gatgctcagg ttgaatctgc tttaaaactt tggtttactc atgttcgaga 540 aaatgatgca cgtgttgatg ggcctctgat gagacagaaa gctgaagagc ttgccactag 600 aatgggaaaa gatgattttg ttgctacaga tgggtggttt aatcggtgga aaaaagggaa 660 aacattgttt acaaacgcgc tcatggtgag caaaaagatt ctgatttttc agcagctgaa 720 gcatggctca ttgaggagtg gccaaagatt attgcagagt attcaccgga aaatgtttac 780 aatgcagatg aaactggcct gtactatcgt gccttgcctg agcacacttt tatgctcaaa 840 aatgagagtt ccaaaggctg caaaacttca aaagaacgtt tgacagtact ttgttgtgca 900 agcatgtcag gcagcaaaga aaaactgttg gttattggaa aatccgcaaa cccaagatgc 960 tttaaaggtg taaaacaact tccagttgat tactatgcta acaaaaatgc ttggatgaca 1020 actctgattt ttaatgaatg gcttctgaaa tgggataaca aactgaacca caaaattgtt 1080 ttgttggttg acaactgcac cgcacatatt gttaaggtca atttaaaaca gatcaaggtg 1140 atcttcttac cagcaaatac gacagctttg attcagccct gtgatcaagg aatcattcga 1200 acactgaagg catattatcg aagggaaatg cgatcaagag tcttggaaaa tatggaggat 1260 agtcaaaaac tgactgcaaa tgagttagcc aaaaaaacaa acattcttga tgctcttcat 1320 cttttagcaa tgtcgtggaa aagtgtctct gaaaaggcaa tcaaaaattg cttcagtcat 1380 ggcgggttca caacagaaga cttggaatca gaggaaattg ttgaaaaccc aactgatctg 1440 actgatgaag cttttgagga atggatgtta attgatcaaa acataccaat tgcagaaaaa 1500 ctgacagatt cagaaatctg tgacacagta actcagacta gcccagcatc aattgaggat 1560 gaagaagaag atgaagatga aagccaaaca aataacaaac cgccgactgc ttgtgaaatc 1620 agaaatgctc ttcaaatttt aagactgggt gtacagcaca gatcttcaga atttcacaaa 1680 cagtatgaat atgagtcttt tattaacagt ttactgagaa aaaataccag gcaatcgtct 1740 ttacatgatt ttttctaagc tttttatgca tgttgagatt tctgaatgaa atgtattgat 1800 aaatgtttaa ttaaaaattt gaaaaaaaat tgcatttttt ttcgggtatt gttatcattc 1860 ggataatgtt atcagtttca tgttggccca aagtgattac aataagcggc ggccactgta 1920 // ID Homo5 repbase; DNA; INV; 2432 BP. XX AC . XX DT 09-OCT-2009 (Rel. 14.1, Created) DT 09-OCT-2009 (Rel. 14.1, Last updated, Version 1) XX DE Homo5 is a putative autonomous DNA transposon. XX KW hAT; DNA transposon; Transposable Element; HOBO; transposase; KW Homo5. XX OS Drosophila mojavensis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; repleta group; OC mulleri subgroup. XX RN [1] RP 1-2432 RA Ortiz M.F. and Loreto E.L.S.; RT "Characterization of new hAT transposable elements in 12 RT Drosophila genomes."; RL Genetica 135(1), 67-75 (2009)DOI 10.1007/s10709-008-9259-5. XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 628..2064 FT /product="Homo5_1p" FT /translation="MISIASHFKSTVLYDSNSKRKAEIDKALTEMIVKDVQ FT PYNIVDNEGFVQYTQILDPKYKLPSKTHLRDVLIYNYFKATTEKLSVILEN FT IPDIAITCDLWTSSANACFLTVTSHFVHHNELKTVSLATKKLLNKINHCSQ FT NIADTLREILIEWNILNKTVCIVTDNASSMLKACEILQIRNLPCFAHSINL FT VVQDVLKVDDVIIQDLFTKCKSIVRFFKQSTIANEKFKLAQEGSAYTLLQE FT TPTRWNSFYFMIERILKTNDAIAKVLLSTTNAPQPFTAEEILVLKDIEKVL FT AFFQQASEQISGGKYVTISLIIPMAYGLLRKIESVSPLLQTLQGRTIKNIS FT MESIKKRLSIYEQRTVCRMATILDPRFKKDGFQHSSNGEQAATFFENELAN FT FSAKEFPINQPVARQTSSQSTLFFDFVGERSMTKVGNSRVDAILIKRQYLE FT RPTVSQDTDPLLWMKVCLKLNYKFNIILFINQII" XX SQ Sequence 2432 BP; 820 A; 456 C; 450 G; 706 T; 0 other; tagaggtgga gaacaatcga tgtcactatc gaaactatcg aatttagggc acaatcgagc 60 tatcgaaata tcgatggtca aataatgcga tgcttatcga tagtgtcgtc gatatttttt 120 cttacatcac tgcaatcaat gttggcgctt tattcatctt gaccgcagaa agtttagtta 180 atctaacact attattatat tacattttta aacaaaatgg atcgcttcct ccaagaaggt 240 aatgcccata acatattatt agaatgtatt ttgcaatatt tgtgtttgat ttaaggtaag 300 cgcagcattg ccacggcgca cagtggggcc tctgtcggcg cgccggcttc tgattgcagc 360 aatgcgaata ctgcagcaat aacgccctca caggcgaaaa gaaataagat ttccgaagtg 420 tggaagtatt ttaagaggtc tggagacaaa caagttgcta agtgccttac ctgtggtaag 480 gaatataaaa caagtggcaa cacatctaat ctccatgacc acttgaggag gtttcatcct 540 ggtttagaga ttaaaaatcc agactctagc acacctgctg tatgtgctga agggaacgac 600 cctgctgcca gtagttgtaa atcaaatatg atatctatag cgtcccattt caaaagtact 660 gtcttgtatg actcaaattc caaaagaaag gcagaaattg acaaagctct aactgaaatg 720 attgtcaaag atgtgcagcc ctacaatatt gttgataatg aaggctttgt tcaatatact 780 caaattctgg atccgaaata caaactaccg agcaagacac atttacgaga tgtgctaata 840 tataattatt tcaaggcaac cactgaaaag ctgtcagtaa tacttgaaaa tattcctgat 900 attgcaataa catgtgactt gtggacatca agtgccaatg cttgtttttt gactgtaacg 960 agtcattttg ttcatcataa cgaactaaaa acagtatcct tggctacgaa aaaactgtta 1020 aacaaaatta atcactgttc gcaaaacatt gcagatacat tgcgagaaat attgattgaa 1080 tggaatattc tcaataaaac tgtatgcatt gtaacagata atgctagctc catgcttaaa 1140 gcatgtgaaa tattacaaat ccgcaatctg ccatgctttg cgcattcgat caacttggtg 1200 gtacaagatg ttttaaaagt tgatgatgta ataattcaag atttatttac taagtgcaaa 1260 tccatagttc gattttttaa acaaagcaca atagcgaatg aaaaatttaa actagctcaa 1320 gaaggttcgg cgtatacttt gttgcaggaa acacccacca ggtggaatag tttttatttc 1380 atgattgagc gcattttaaa aacgaacgat gccattgcca aagtcttatt atcaactaca 1440 aatgcaccgc agccgtttac tgctgaggaa atccttgtcc tcaaagatat agaaaaggta 1500 ttggctttct ttcagcaggc aagtgaacaa atttcaggtg gaaagtacgt taccatatcc 1560 ctaataatac ccatggcata tggactttta cgcaaaatcg aaagtgtttc acctctgctg 1620 caaactttgc aaggaaggac tattaaaaac atatcaatgg aatcaataaa aaaacgcctt 1680 tccatatatg agcaaaggac agtatgcaga atggctacaa tattggatcc acgttttaag 1740 aaagatggat ttcaacacag ctcaaacggt gagcaagcag caacattttt tgaaaatgag 1800 ctagcaaatt tttctgccaa agagttcccg attaatcagc ctgtcgcacg acagacaagt 1860 tcccagagta ctcttttttt tgattttgtg ggtgagcgat caatgaccaa agttggaaac 1920 tcacgagtgg atgcaatcct tattaaaagg cagtatctgg agaggccaac tgtctctcaa 1980 gatacggacc ctctgttatg gatgaaggta tgcttaaaat tgaattataa gttcaatata 2040 attctattca ttaaccaaat aatataattt attttgcaga tgaatcaatc tgattttccc 2100 tctattggaa gccttttctg caagtacctc tgcattcccg caacgtcagt agaatcggaa 2160 agagcattca gcaaggcggg ccaaattatt tcagagaggc gaacacgact caaagaaaaa 2220 aacattaaca ttcttttatt tttaaatcgt aacttttgga tcaaataaac aatatatata 2280 tcttatattc aattcaaata tatttggctg ttggggtaca ctttcgatgg cgctatcgat 2340 actgtcgatg acaccatcga caatatcgat ggttcaaaat aaacaatcac caaaaatcga 2400 gagcgtccaa tatcgatagt tctccacctc ta 2432 // ID R1_DSe repbase; DNA; INV; 5417 BP. XX AC . XX DT 08-NOV-2010 (Rel. 15.11, Created) DT 08-NOV-2010 (Rel. 15.11, Last updated, Version 2) XX DE 28S rDNA-specific non-LTR retrotransposon R1 in Drosophila DE sechellia. XX KW R1; Non-LTR Retrotransposon; Transposable Element; R1_DSe. XX OS Drosophila sechellia OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; melanogaster subgroup. XX RN [1] RP 1-5417 RA Stage D.E. and Eickbush T.H.; RT "Origin of nascent lineages and the mechanisms used to prime RT second-strand DNA synthesis in the R1 and R2 retrotransposons of RT Drosophila."; RL Genome Biology 10(5), R49-R49 (2009). XX DR [1] (Consensus) XX CC This family is specifically inserted into 28S ribosomal RNA CC genes. XX FH Key Location/Qualifiers FT CDS 357..1790 FT /product="R1_DSe_1p" FT /translation="TSIRLLASTVGGATIVFMPMESDSSASALSGSSASMK FT SRRGRRRSHLASKSSAPTQAKLVALASNGVPEPVGVLEEPFSSLEDARAAT FT ENAAIDAAPPHAAATAADPTAAPAADHTAASAVFTAAKIVATTATAATAAA FT RAGQAAMMAELSATQRMVRSSFRRLGEVDTEELSYAISRYDELVLALMLRC FT GELETRLAMPPPPPPSLNLLKNTAANAPQMQQVAPIAAPRTTKVRETWSAV FT VKCDDPALSGKDIAEKVRTMVAPSLGVRVHEVRELRRGGGAIIRTPSVGEL FT QKVVASKRFAEVGLNVARNAAEKPKVVVYDVDTAIGPEEFMKELHENNFDS FT EMNLAQFKKSVHLVTKAWSVADGATVNVTLEVDDRAMAKLDVGRVYIKWFS FT FRCRSQVRTYACHRCVGFDHKVSECRQKDSVCRQCGQQGHTAAKCQNPVDC FT RNCRHRGQPSGHYMLSSVCPIYGALLARVQARH" FT CDS 1787..4849 FT /product="R1_DSe_2p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="TLMFSFIQANCGRGRAATIELGVRLRRSESMFALVQE FT PYLGGDGMDVLPEGMRIFIDRRGKAAILVDHQEAICMPVETLTTDYGVCLV FT VKGSFGSIFLCAAYCQYDAPLEPYIRYMDAVLLQASRTPAILGLDANAVSP FT MWLSKLSRHAEGQANYRRGELLSEWMLEARVAALNQSTEVYTFDNYRATSD FT IDVTIVNEAASMWATYEWRVDEWELSDHNIITVVAEPTTARAVESIAPVPS FT WNFSNARWRLFKEEMVSRAAELPENFSESPLDQQVSTLRSIVHNVCDIALG FT RKSIRLPNRRARWWTADLCDARREVRRLRRLLQNGRRHDDDAAIERVLVDL FT RRASANYKKLIWRAKMDEWKRFVGDHADDPWGRVYKICRGRRKCTEIGCLR FT VNGEMITDWGDCARVLLRNFFPVAESEAPTAIAEEVPPALEVFEVDACVAR FT LKSRRSPGLDGINGTICKAVWRAIPEHLASLFSRCIRLGYFPAEWKCPRVV FT SLLKGPDKDKCEPSSYRGICLLPVFGKVLEAIMVNRVREVLPEGCRWQFGF FT RQGRCVEDAWRHVKSSVCASPAQYVLGTFVDFKGAFDNVEWSAALRRLADL FT GCREMGLWQSFFSGRRAVIRSSSGTVDVPVTRGCPQGSISGPFIWDILMDV FT LLQRLQPYCQLSAYADDLLLLVEGNSRAELEEKGAELMSIVEAWGAEVGVT FT VSTSKTVIMLLKGALRRAPTVRFAGANLPYVRSCRYLGITVSEGMKFLTHI FT ASLRQRMTGVVGALARVLRADWGFSPRARRTIYDGLMAPCVLFGAPVWYDT FT AEQVAARRRLASCQRLILLGCLSVCRTVSTVALQVLGGAPPLDLAAKFLAV FT KYKLKRGYPLEENDWLYGEDTTCLSWKQRKTRLEECLLQNWQNRWDDDSEP FT GRVTHKFIPYVTLAYRDPSFGFSMRTSFLLTGHGSFNAFLQGRALSDTTAC FT ACGDPYEDWMHILCACPLYADLRNLDGLGVQRLGENWTFNRILEDQQRIQR FT LAAFADEVFRRRRGI" XX SQ Sequence 5417 BP; 1211 A; 1353 C; 1654 G; 1199 T; 0 other; cggacgtgtt ttcgttgcgc tcgtgtacag attgcgaaga acttggtttt ccgtgtttgg 60 aaagtaataa aatcggtgaa ttagtgctcc gcgaaagtcg tgtgctaatt ttcgtgtgtt 120 ataaacaagc ggtttggaag taattaacaa ttaattgttg ggaattttcc acttagcacg 180 tgctgaaggc gaacttacga gtaagttttt cgagtaagct ttttgatcag ctgtcacaaa 240 gcttattggt gggagtcact gctaaggttt gtgtctaggg agagttttag tgctaccaga 300 ctctaccgat aggtggagct cgagttccag agcaactacc ttttgtcagc gagtaaacga 360 gcatacggtt gctggcgtcg acggtaggtg gagccaccat cgtatttatg ccgatggaga 420 gcgacagcag cgcgagtgcc ttgagcggaa gtagtgcctc gatgaagtcc agacgaggca 480 ggcgcagaag ccatctggca tctaagagct cggcgccaac gcaggcgaaa ttggttgccc 540 tggcatcgaa tggagtgcca gaacccgttg gggtactgga ggagccgttt tcgtcgctgg 600 aggacgcccg ggcggctacg gagaacgctg ccattgatgc tgcccccccc cacgctgctg 660 ccaccgctgc tgatcctact gctgcccctg ctgccgacca cactgctgcc tccgccgttt 720 tcactgctgc taaaattgtt gccaccactg ccactgccgc caccgctgcc gcccgtgctg 780 ggcaagcagc catgatggca gagctgtcgg ccacacagcg catggtgaga agcagcttcc 840 gcagactagg agaagtggac acggaagagc tctcatatgc tatcagccgc tacgatgagc 900 tggtgttggc gctaatgctc cggtgcggag agctggagac gcggcttgct atgccgccac 960 cgccgccgcc gtcgttgaat ctgttgaaaa atacggccgc caatgctccc cagatgcagc 1020 aggttgcacc catcgctgcc ccgcggacta ccaaggtccg cgagacgtgg tcagcggtgg 1080 tgaagtgcga cgaccctgcg ctatcgggga aagacatagc ggaaaaggtg cgaacgatgg 1140 ttgcaccctc tctcggagtc agagtacacg aggtccgtga gctccgtcga ggtggtggtg 1200 cgatcattcg cactccttcg gtgggagagc tgcagaaggt ggtcgcttca aaaagattcg 1260 ccgaggttgg cctaaatgtg gcacggaatg cggccgagaa gccgaaggtc gtcgtatatg 1320 acgtcgacac agctatcggc cctgaggagt ttatgaagga gctccacgaa aacaacttcg 1380 acagcgaaat gaatctggcc cagtttaaga agtcagtgca cctggtgacc aaggcgtggt 1440 cggtagctga cggcgccaca gtaaatgtga cgctggaggt tgacgaccgg gcgatggcga 1500 agcttgatgt aggtcgtgtc tacatcaagt ggttctcatt ccgatgccgg tcacaggtcc 1560 gcacatacgc ctgccacaga tgtgtgggtt tcgaccacaa ggtcagcgaa tgccggcaga 1620 aggatagtgt ctgtcgccag tgcgggcaac aaggccacac tgcggcaaag tgccaaaacc 1680 cggtggactg ccggaactgc cgccacagag ggcaaccctc ggggcattac atgctctcga 1740 gcgtttgccc gatatacggg gcgttgctgg cgagggtgca agctagacac taatgtttag 1800 cttcatccaa gcgaactgtg gccgaggccg agctgcgacc atcgagctcg gagtccgact 1860 caggagatcg gagtctatgt tcgcgctggt gcaggagccg tatctcggcg gggacggaat 1920 ggatgtgctg cctgaaggaa tgagaatttt catcgatcgg cgagggaagg cagccatcct 1980 agtggatcac caggaagcca tctgtatgcc agtggagacc ctcaccacag attatggcgt 2040 atgtctggtc gtgaaaggga gttttggctc aatcttcctt tgcgccgcat actgccagta 2100 tgatgcacct ctggaaccgt acatccggta catggatgcg gtcctgctgc aggccagcag 2160 aacccccgca atcctgggcc tcgacgcgaa tgcagtgtcc cccatgtggc ttagcaaact 2220 ctctcgtcat gccgaggggc aagctaacta cagacggggt gagctgctgt cagagtggat 2280 gctggaggca agagtcgccg ccctaaacca gtcaacagag gtgtacacgt tcgataatta 2340 cagagctaca agcgatatcg acgtgacaat cgtcaatgag gcagcatcta tgtgggccac 2400 atatgagtgg agagtggacg agtgggaatt gagtgaccac aacatcatta ctgttgtggc 2460 cgaaccaact accgcgcgcg cagttgagag catagctcct gtgccgtcct ggaacttctc 2520 caatgcacgt tggcgattgt tcaaggagga aatggtgagt agagcagccg aacttccgga 2580 aaacttctca gagtcgccgt tggaccagca agtttcgacc ctgcgcagta tagtacataa 2640 tgtatgtgat attgcgctgg gaagaaagtc catccgattg cccaacagga gagcacgttg 2700 gtggactgcc gacctctgtg atgcaaggcg cgaagtccgg agacttcgtc gcctacttca 2760 aaatggaagg cgtcatgatg atgatgccgc aatagagcgt gtattggtcg acctgaggcg 2820 ggcctcagcc aactacaaga agctcatttg gagggcgaaa atggatgagt ggaaacgctt 2880 cgtgggagat catgccgacg acccatgggg gcgcgtctat aagatttgcc gaggccgcag 2940 gaagtgcacg gagattgggt gcctccgcgt gaatggcgag atgatcaccg attggggtga 3000 ctgtgcacga gtgctcctcc gcaatttctt cccagttgcg gagtccgaag caccgactgc 3060 catcgcggag gaagtcccac cggccctcga agtattcgag gttgatgcat gtgttgcccg 3120 gttgaagagc aggcgctctc ccggcttgga cggcattaat ggcactatct gcaaggcagt 3180 ctggcgcgcc atacccgagc accttgcatc gttgttttcc cgatgcatcc gattaggata 3240 ctttcccgct gagtggaagt gcccacgagt tgtctcgctg ctcaaagggc cagataagga 3300 caaatgtgag ccctcctcat atagaggaat atgcttgcta ccagtctttg gtaaggtgct 3360 cgaggccatc atggtgaatc gtgtgagaga agttcttccg gaaggctgca gatggcaatt 3420 tggatttcgc caaggacgat gtgtggagga tgcttggagg cacgtgaaga gcagtgtttg 3480 tgccagcccg gcgcaatacg tgctcggcac attcgtggac ttcaaaggag cattcgacaa 3540 cgtcgaatgg agtgctgcac tccgccgact agccgacttg ggatgccggg agatgggctt 3600 gtggcagagc tttttctccg gccgaagagc agtgatccga agcagttccg gtactgtgga 3660 tgtaccggta actagaggct gcccgcaggg gtcaatcagt ggcccattta tctgggacat 3720 actgatggat gtactgcttc agcgtctcca gccgtattgc cagctgagtg catacgcgga 3780 tgacttgctg cttctcgtcg agggaaattc ccgagctgag ctagaggaaa aaggtgcaga 3840 gctgatgtcc atcgtagaag cgtggggagc ggaagttggc gttaccgtct cgaccagcaa 3900 gacggtaata atgctgctga aaggtgcctt gagacgtgcg cctacggtga ggtttgctgg 3960 agcgaacctt ccgtatgtgc gtagctgtcg gtaccttggc atcacggtca gtgaaggaat 4020 gaaattcctc acgcacatag cttcgcttcg ccagcggatg accggagtcg ttggagcatt 4080 ggcgcgtgtg cttcgagccg actggggctt cagtcctcga gccaggcgga ccatatatga 4140 cggactcatg gcaccttgtg tgctgtttgg tgccccggta tggtatgaca ccgccgaaca 4200 ggtagccgcc cggagacgac tagcctcctg ccagaggcta atcctgcttg gatgcctttc 4260 ggtatgccga acagtgtcca cagtggcact gcaggtactt ggcggagctc ccccgcttga 4320 cttggctgct aagtttttag cggtcaagta caagctgaag cgtggatacc cgctggagga 4380 gaacgactgg ctatacggcg aggacactac gtgtctaagc tggaagcaga ggaagactcg 4440 cctagaggag tgcttgctgc agaattggca aaacagatgg gatgacgaca gcgaaccagg 4500 acgggtgacg cacaagttca tcccatacgt cactctcgcc tatcgagatc caagctttgg 4560 attctcgatg aggacgtctt tcctgctgac aggtcacggg tcgtttaacg catttttgca 4620 agggagagcc ctcagcgata ccactgcttg cgcttgtggt gatccatatg aggactggat 4680 gcacatattg tgcgcttgtc ctctatatgc agatttgcga aacctcgatg gacttggagt 4740 gcagcgcctt ggcgaaaact ggaccttcaa tagaatcctg gaagatcagc agaggattca 4800 acggctggca gcgtttgcgg acgaggtgtt ccgtaggagg aggggtattt agcccaaaac 4860 ttcgccgtgt ggttagcggg cgagaatact tccacagccc gctattgctt gtcgtaagag 4920 gcgactaata tagcgactgg ttcctctaac catgcttgtc ggagcaaaag gaggaggccc 4980 accgagcctc tctttcggta ccacgggttg tgctgctcca agacagcaca ttgaggtagg 5040 ccccctggtg ggagtatcgt ggtggctgtg gttgataccc atatcgcggg tagagccttc 5100 gtgttcgacg tttgagttac ggtgctagtt gcgcaaaact cgggtgctgt gacccagaga 5160 tcagtagaga ttttaggtag atctcgctcc tcagcaaggg ggagtgcttg cccggcaagc 5220 aagtactcga attgctaccg gggtggtcgc tatgtacata gctatagctt ccagtccggg 5280 acgtttgtct ggcgtatcca gactcatgca ccatgttgat acatgcacca cttgtgggtg 5340 ttcagggtgt cgtggttgta atcccttcag tgtggaacac gccacgtaaa acaagttcgg 5400 agaggtccga aagtcac 5417 // ID BEL4_Cis_LTR repbase; DNA; INV; 449 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 06-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE Long terminal repeat of BEL LTR Retrotransposon from Ciona DE savignyi. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL4_Cis_LTR. XX OS Ciona savignyi OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. XX RN [1] RP 1-449 RA Smit A.F.; RT "BEL4_Cis_LTR - BEL LTR Retrotransposon from Ciona savignyi."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX CC 5 bp dups. XX SQ Sequence 449 BP; 103 A; 62 C; 87 G; 192 T; 5 other; tgttactgca ccgctgcagt agttttggtt catkgatcgt tttggttcat tgattgtttt 60 ggttcattga ttttgattac attttatttc tatgatttgt ccttggtgat ttcacgtttt 120 ttaatttgaa tagtcctttt cgatttcggc tgcgaggaag cagcagtttc ttttggtgaa 180 taatacttaa ggtatgtgtt ttgatacgtt ttggtttggt atnttcgttc aagttattaa 240 tggtgtttta atttaatatt ataggagtaa acattcaaca ttcctcgaag taacggacgc 300 gcaattgtta aggtattatt ttgtgatagt tttttcattt ttccttttta takcgtctta 360 catgcattta tgttttttta gttaccgctg aaacaccaaa taaaggaatt ggananagcg 420 cgtcaacttt attgcgtgcc gccgtaaca 449 // ID BEL-241_AA-I repbase; DNA; INV; 6402 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-241_AA_; KW BEL-241_AA-LTR; BEL-241_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-6402 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 936-936 (2011). XX DR [1] (Consensus) XX CC Positions [5449-5847] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 985..2589 FT /product="BEL-241_AA-I_1p" FT /translation="MSEVVLPQKEHRQPAILTVSQTQSDFASFNTRQIKQP FT NSSLFNAASNTPTFIPHTNSIVTEPCLPPHSSEYTTVFDPIHMSSPRVRFS FT SPPSVPYVYNNLSPIVPPIISSYSTVHTVQSNPSPMVSSIISTCTPTSAIS FT STVTTPASVSMSHAWISVPNAQQLAARHVVPKDLPIFSGDPVEWPLFLSYF FT QNSTEMCGFSNGENLIRLQRSLKGNALEAVRSFLLHPSSVPSIIGTLRTLY FT GRPDLIVNSLLEKVRNTPAPKPDKLESLISFGLACQNLCGHLRAAGQETHF FT SNPSLLHELVNKLPANIKLDWAIFKQKCPEVNLCTFGDYMSQIVLAVSDVT FT PFQEPEGFPNDKRKKQKLYVNAHSFGTSSNTEEARGSIDKQVFLKPCLVCK FT KQGHNVRECETFADSSLEDRWKYAQELHLCRRCLNAHGRYPCKTAIVCGFN FT GCQERHHKLLHPRKPQSELSSQSSNSTGIVNVHHKPLVTVLFRIIPVTLFG FT NGKSIETFAFLDEGSSTTMVETRIAQELGIQGEVQPLSL" XX SQ Sequence 6402 BP; 1821 A; 1513 C; 1510 G; 1544 T; 14 other; aaacttcaag attttatccg tgatcggaaa attgattcgc aacaaccaaa agtgaacatc 60 atggcgacag cggtagcccc aggcacgtgc ttctgtggac aattgataga tcccaaggac 120 aaaatggcgc gctgccaaaa gtgctcacgt catgtgcatt tagcatgcgc agtggaggat 180 ctcaccatca atggaagttg cctcgtttgt ttacgctgtg ttgaagacgt accaccgccg 240 ccccgatcgc tctcaagaaa atcatcgagc tctacagccc gtgctagaat tttattagac 300 attcgtcgtc tggaagagga aaagatcatg caggaaaaag cggcagaaga taaagcacgg 360 cgagatcagg attttttgtc aaaaaagtac gatcttctac atgctcaatt ggacgaagaa 420 gaagatggaa gtgtaagaag tcgtcgaacg gtttcaagca ggcagamagt agaaaattgg 480 ctccggcgtg aaccaatggt aaccgctgtt tcggaaatca ggtgtgagtc agatcccgtt 540 ggtgaatcca ctaggactgg aaccactcca aagcagggtc accaaacacc atcaaaaaac 600 cctgcaacat cctccttcca agctaatact caatcctgct ttaccgttgg gatccaatcg 660 tctgttggaa gtatggcaaa gtggagagca tctttagacg gtggcgwaca acagccagat 720 acatcggata accaaacgca aaaccctctc gattggacct tcgatcgaga agctcgtcgg 780 gattctcaac cacccccgtc tagtcggctg gcagtcgact tgcaggagat tcagcaaaag 840 ttcgactcaa tgaaactgtg gcccactgat caatcacctc cagtcgawcc agggcaaggt 900 cagcgacaac cgttaggtgc ttcagccgct gtgtcggaga gcatacagat tgcaactcta 960 ccggccatca aaagtggtcc acggatgtcg gaagtcgtgt taccacaaaa agagcatcgt 1020 caaccagcta ttttaacggt gagtcaaaca caatctgatt ttgcgtcctt caacacgcgt 1080 caaataaagc aaccaaattc gtcgctgttt aatgcggctt ctaatacccc aacgttcata 1140 ccgcacacaa actccattgt aaccgaacca tgcttacccc cccattcttc tgaatacaca 1200 acggtcttcg atcccataca catgagttct cctcgggttc gtttttcgtc acccccctcc 1260 gtgccgtatg tctacaacaa cctgtctcca atcgttccac caatcatctc gtcgtactca 1320 actgtgcata cagtacaaag caatccatcg cctatggtgt cctccatcat ctcaacttgt 1380 acaccaacat cggcgatctc gtccaccgta accaccccag cgtcagtctc gatgtcacat 1440 gcgtggatta gcgtaccgaa tgcacaacag ctagctgcca gacacgtcgt tcccaaggac 1500 cttcccatct tttcgggcga cccagttgag tggccgttgt ttttgagcta tttccagaac 1560 tccaccgaaa tgtgcgggtt ttcaaatgga gaaaatttga ttcgacttca aagaagcctc 1620 aaagggaacg cgcttgaagc agtacgaagt tttctgctac atccatcctc ggtgccaagc 1680 ataatcggaa cgctgaggac actgtatggc cgaccagacc taattgttaa ctcgctccta 1740 gagaaggtgc gtaatacacc agctccgaaa cctgataaat tggagtcgct gatttccttc 1800 ggcctcgcct gccaaaacct atgtggacat ctgcgcgctg caggtcagga aacccatttc 1860 tccaacccct ctttgttgca cgaactcgtt aacaaattgc ckgccaacat taaattggac 1920 tgggcaatct tcaaacaaaa atgcccagaa gttaatcttt gtactttcgg tgactacatg 1980 tcacagatag tgctagccgt kagtgatgtt accccatttc aagaaccaga aggcttcccc 2040 aatgacaagc gcaagaaaca aaagttgtat gttaatgccc attcatttgg tacctcatcg 2100 aacacagaag aggcccgtgg ctcaattgat aaacaagtgt ttctaaaacc ttgtttggtg 2160 tgtaaaaaac agggtcataa tgttcgagaa tgtgaaacct ttgctgacag ttcattggaa 2220 gatcgctgga agtatgcaca agagctccat ctgtgtagaa ggtgtttgaa tgcacatggg 2280 cggtatcctt gcaaaaccgc aatagtttgt gggtttaacg gatgccaaga acgacatcac 2340 aaattacttc atccgaggaa accgcagtca gagttatcat ctcagtcttc aaacagcacc 2400 ggaattgtca atgtacatca caagcctttg gtaacagtcc tcttccgtat cataccggta 2460 acattgttcg gtaacgggaa atccatcgaa acctttgcat ttcttgatga aggttcttcg 2520 acaacaatgg tagaaacaag aatcgctcag gaactaggaa ttcaaggtga agtgcagccg 2580 cttagtctac astggaccag cgacgtcgaa cgwacagagg aaggttcgca ggtagttagt 2640 ttggaaatct gtggacgagg aacccacaag cgtcacgtat tgaaaggagc tcatacggta 2700 cagaagcttg gactacctac ccaaagctta caactcgacc tactcattga gaaattttct 2760 tacttacgtg gacttcccat ccaggactac caaaatgcaa caccatcggt tctgatcgga 2820 ttggataaca cccatttgaa aatacctctc caaatccgcg agggaaaaag tggtcaaccc 2880 gctgctacca aaaccaggct tggttgggca gtttatggag gaattctggg agcaacttca 2940 tcaatcaacc aataccagtt tcacatgcag gcagaaccta ggaaggcgga tgaagactta 3000 cacaatctag tgaaagagtt cttctcgatt gaaaattttg gcgttgcagc ggcccctgct 3060 ctagaaggat cagaagtcaa acgagcaaat aaaattttgg aagacacaac ggtgcgatta 3120 ccctcagggc agttccaaac cggccttttg tggaaatatg accacataca ctttccagat 3180 agcaggccta tggcagagcg taggttgaaa tgtttagagc gtcgtttgat gcggwcccca 3240 cagctgtacg aaaacgtaaa gcagcaaata gctgagtacg aggtcaaggg atatgctcat 3300 aagatcaccc aagatgagct tgctagctca gatcctcgaa gaatatggta tcttcctttg 3360 ggggtcgttg tgcatccaaa aaagcctggt aaaatccgtg ttgtatggga tgcagcagct 3420 acggtgcaag gacaatcact taactccgtt ttgctgccag ggccagatct attgacatcc 3480 cttccagcag ttctctccca atatcgccag aggcaggtgg caatcagcgg tgatattcag 3540 cagatgttcc accagctcaa aatacgtcca gaagataaac aggcgcaacg atttttgttt 3600 agagatgacc cgtcgaaggc tgttgtcaca tacgtcttgg acgttgcgac tttcggtgcg 3660 acctgctcgc catctgctgc acagtttata aaaaamcgca atgcgagaga tttcgaggac 3720 cagtatcctg aggcagcaca tgccattatc cacaaacatt atgtggatga ctatctcgat 3780 agtctggata cgatagacga agcagtagat ctggcgcaac aggtaagaac tgttcatgct 3840 aaggctggct tccatatccg caattggcag tccagctcca aagaggttct tgcacggatc 3900 ggcgaatcag cagcggaagt accaaaatgc ttcaaagctc ataactctct gggtacagag 3960 cgggttctgg gaataagttg gttgccgaat actgacgaat ttgtcttcag tggacttttc 4020 cgagaggatt tcaagccgtt attacacggt gatgtcgctc ctaccaaaag gcagttgctc 4080 caggtggtca tgtctatatt tgatcccctt ggacttgtct ctctcattgt ggttcatggc 4140 aagatattac tacaaaatgt ttggcgttca aaaatcgact gggacacaaa aattgatgga 4200 aacttgtttg gcgaatggta taggtgggcg attcttttga agcaactaaa cacggtgcac 4260 atttcgagat gctacttccc tggatacgct cctgaaagct acgacactct ggaacttcac 4320 gttttcaccg acgccagcga ggaagcatac gcagctgttg cctatttccg tatcatcgat 4380 aggggtcaag ttcgatgtac cctagtaggg gccaaaacaa aagtggcacc actgaagcca 4440 ctgtctatcc ccagacttga actacaagct gctttacttg gtgccagact agcaaaatcg 4500 atagaagaga accacacact gaatatcaac cgacgtgtct tttggagtga ttcatctacc 4560 gttctatcat ggctacaatc cgatcaacga aaatatcgac catttgttgc gtttcgtgtg 4620 gctgagatta tagattcaac taaactaacg gaatggcggt gggttccttc acggtcgaat 4680 gtkgctgacg acgctaccaa atggaagggc aatttacgac ttgataacga tcaccgctgg 4740 tttcaaggac cggagttctt acgactagac ccgacaatgt ggccggaaaa tcatgtgtgt 4800 agtaaggaaa caagtgaaga attgcggtct acaaatgtcc acgtggtgtg cccctccgag 4860 cagttsatac ccttcggcag attttctaaa tgggaacgat gcttacgagc tgttgcttac 4920 gttcatcgtt tcatccgtca acttcagcga agaatacgat gtgaagcttt ggaaaactcg 4980 aaatggttaa ctcgagaaga gcttcaacaa gccgaagcaa ctatatggag attagcacag 5040 agcgatgtgt tcacgaaaga gatcacgtta ttacaaaaga acaaccaagt tccgcagggc 5100 caacagcagc aattggataa gagaagcaaa ctttataagt tatcaccatt cttggacggc 5160 aatggtgtta ttcgattgga gagtcgaatt ccagcattca gcgacgcacc atacgacttt 5220 aaatgtcctg tactattacc aaaaggacac tacgctacca agcttatggt tgactggtat 5280 caccggcagt atatgcattg caatggcgaa acagcagtga atgagatgcg gcaaaaatac 5340 catttgtcag aaatgagggt agctttcaaa aakttgcgga aggattgttt gtggtgcaaa 5400 gtatacaagg cgtcacctag cactcccaga atggctccgc ttcctgaagt cagagtcaca 5460 cccaatgtaa gggcgtttag cttcgtcgga ctcgattatt ttggcccgct tttggtaaag 5520 caaggtcgac atgaagtgaa aaggtgggtg gcacttttca cgtgtttaac cgtgcgtgca 5580 atccatttgg aagtggtaac tagcctatca acggagagct gtaaaatggc gattcggcgt 5640 tttatagcac ggaggggggc accgcgagaa atctacagcg accgtgggac caactttata 5700 ggtgttagcg gtgagttgcg agagaaaagt cgaaatatca accaaaaatt ggcagcaact 5760 atcacgaata tcaataccga gtggcgcttc aatcctcctt cggcgcctca catggggggg 5820 gtcttgggag cgcatggttc ggtcggttaa gtgtgcgctt gcatcgctct cttctgagcg 5880 taaaccagat gaagaaacct tggaggcctt gttgatcgaa gcagaggcga ttgttaactc 5940 taggccattg acttatatgc cccttgatca cgctgagaac gaggccttaa ctcctaactg 6000 cttcttaatg ttaagcacga gtggagtaaa tcagccacaa agtgatctgg tcgatgaagc 6060 atctacactw aaatctagct ggaacttgta ccaacgttta ttggaccgat tttggacccg 6120 ttggatcaaa gaatatttac caacaatcac caaacggact aaatggtttg tggacacgaa 6180 gccaattgaa gctggcgacc tggtgatcgt tgtagaagaa caccttagga atggttggag 6240 aagaggccga gtcctacgcg tgtttcctgg gcgagacggt agaagccgca gcgcggatgt 6300 tcagacgaca gctggagtgt ttcgacgacc agtgacaaag ttggccgtcc tggaagtagc 6360 aggtattgct gcwgagaaca ccaagcaata cgggccgggg ga 6402 // ID Copia2-I_Dmoj repbase; DNA; INV; 4048 BP. XX AC scaffold_2198; XX DT 13-MAY-2009 (Rel. 14.05, Created) DT 13-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia2_Dmoj; KW Copia2-LTR_Dmoj; Copia2-I_Dmoj. XX OS Drosophila mojavensis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; repleta group; OC mulleri subgroup. XX RN [1] RP 1-4048 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1022-1022 (2009). XX DR Genome; scaffold_2198; Positions 4724 677. XX CC Positions [1555-2058] - Integrase core CC LTRs are 95% similar to each other. XX FH Key Location/Qualifiers FT CDS 184..3198 FT /product="Copia2-I_Dmoj_1p" FT /translation="MEEDDDVLAVCEGSLTCPTQESATYERDSKRFLKADK FT TARKIIIHSIERRPMELIVSCKTAREMWIKLNSVYDMKSEENLSAIQKQFF FT DFKWIESESVDYNISKLEVLWSKMKNLGSAVPETMLMSRILSSLPQKFNYF FT HSAWDSVEDSKRTLNNLTTRLMTEEMRSKEQEVVPEESTALVAGNGKNKRE FT HENKKKIMKCYNCGVMGHLKKDCYRCYICKKKGHKSNTCHQRRNAGSQAEN FT NQGELGESSRKGLIASRDGREDDVWVIDSGASDHMSRRRDWYSTYEEFLEP FT QRIGVGNGKYITAHGKGNIDIETYVNNNKRATTMFNVLYVPEIVYNLFSIM FT SAAKRGVDCLIKNKGTQCLLRRDNKIIATGSRFGNLLKLDANVLMPKLCNL FT TRKIESKESDSLQVWHERFCHQNVRHVRDYLKNKNIVFLENDFFCEGCAYG FT KQHRLTFHKNIDRATKTREIIHTDVCGPIEVESIGKKLYFVSFKDDYSSYR FT EVYFIRHKSEVFEKLQIFIQAIQNQFHENIKEIHSDGGREYINKEVKLFFN FT KKGIKHIVNAPYTPEQNGVAERENRTIIEAARSMLHSNSELPLFLWAEAVN FT TAVYVINRTGPTKHGNKTPYELWFKKEPDIGNLKIFGTECFVHVPKEKRRK FT LDKKSIKGFIVGYVENCKGYRVYVPNMRDVIISRDVCFKPENFTAQFKKPE FT ILVDNSESETNESEYENNKIEDESTEHNNSEMGRRLRDRSQIRMPDFYGCP FT VTFITQSLPLNYNEALKSDDKEYWQTAMQEEVNSLHENGTWVLVPKPENQK FT VINSRWVYTKKLNPDGSERYKARLVIKGYSQKEGIDYKETFRPVVRFDTVR FT FLLSVAAQENLFLGQFDVKTSFLYGTLKENIYMHQPEGFEDGTSSVCKLLK FT SLYGLKQAPRCWTEHFANVIKNLGFTQSIADPCFYIYSKNNDRILLTMYVD FT DGLIAASNEFLIDNLFENLSKEFKITNTKNVTCFLGIEILKQPNGSIFIY" XX SQ Sequence 4048 BP; 1474 A; 516 C; 877 G; 1181 T; 0 other; ggttatgggc ccagacagtg ctaaacgtgt aatgtagtgt gaaatcgagt gcaagtacaa 60 aagtgaatta gtggcaaatt tgtaactaaa taaagacaat gaaaatggag ccattaaata 120 tgagaattga acgattacgc gacaagacaa ttggcatttg tggtgtttta tgattcgtac 180 gctatggaag aagacgacga tgttcttgcc gtttgtgaag gtagtttgac gtgtccgaca 240 caggagtctg caacatacga gagagactcg aaaaggttct tgaaagctga taagactgca 300 aggaaaataa ttattcattc tatcgagaga agaccgatgg aattgattgt gagctgcaaa 360 acagcaagag aaatgtggat aaaattgaac tcagtgtatg atatgaaatc agaagaaaat 420 ttatcggcta ttcagaaaca gttttttgat ttcaaatgga ttgaaagtga gagtgtggac 480 tacaatattt caaagttgga agttttgtgg agcaagatga aaaatcttgg aagtgctgtg 540 cctgaaacaa tgttaatgtc gcgaatcctg tcatctttgc ctcaaaagtt taactatttt 600 catagtgcgt gggattcggt cgaagacagc aaaagaacat tgaacaattt gacgactcgt 660 ttaatgacag aagaaatgcg gtcaaaagaa caagaagtgg tacctgaaga gtcaacagcg 720 ttagtcgccg gcaacggaaa aaataaaaga gaacatgaaa ataagaagaa aattatgaag 780 tgctataact gcggcgtgat gggacatctg aagaaggact gctacaggtg ttatatatgc 840 aagaagaaag gccacaagag caacacatgt caccaaagga gaaatgcagg aagtcaagcg 900 gaaaataatc aaggagaatt gggtgaatca tcgaggaaag gattgattgc gagcagggac 960 ggcagagaag atgacgtgtg ggtgatcgac tcaggagctt ctgatcacat gtcaaggcgt 1020 cgagattggt attctactta tgaagaattt ttggagcccc agaggatagg cgtaggtaac 1080 ggtaaatata tcacggcgca cggaaaaggt aatattgata ttgaaactta tgtcaataat 1140 aataaaaggg cgacgacaat gtttaatgtt ttgtatgttc cggaaatagt ttacaattta 1200 ttttctatta tgtcagctgc aaaaaggggt gttgattgtc ttattaaaaa taaaggaaca 1260 cagtgtttgt tgagacgtga taataaaata attgcaacgg gttctaggtt cggaaactta 1320 ttaaaattag atgcgaatgt tttgatgcca aaattgtgta atttaacaag aaaaatagaa 1380 tcgaaagaaa gtgattcttt acaagtatgg cacgagaggt tttgtcatca aaacgttcga 1440 catgttcgcg attatttgaa aaataagaat attgtttttc tagaaaatga tttcttttgt 1500 gaaggatgtg cttatgggaa gcagcatcgg ctaacatttc acaaaaatat tgaccgtgca 1560 acaaagacac gagagataat tcatactgat gtttgcggtc cgatagaggt agagtctatt 1620 ggtaaaaagt tgtattttgt tagctttaaa gatgattatt caagttatcg ggaagtttat 1680 tttatacgtc ataagtctga agtatttgaa aaactccaaa tttttattca ggcaattcag 1740 aatcaatttc atgaaaacat taaagaaatt cacagcgatg ggggtagaga atatattaat 1800 aaagaagtta aattgttctt taataagaag ggaataaaac atattgttaa tgctccttat 1860 acacctgaac agaatggagt tgcggaacgt gagaatagaa ctattattga agctgcacga 1920 tcaatgttgc attctaattc tgagttacca ttatttctat gggcagaagc agtaaacact 1980 gcagtttatg taattaaccg cacaggtcca acaaaacatg ggaataaaac tccatatgag 2040 ttgtggttca agaaagagcc agatatcgga aatcttaaaa ttttcgggac tgaatgtttt 2100 gtacatgtac caaaagaaaa aaggagaaaa ttagataaaa agtctattaa aggatttata 2160 gtaggatacg ttgaaaattg caaaggatat cgtgtatatg ttccaaatat gagagatgta 2220 attataagta gagatgtatg ttttaaacca gaaaacttta cagctcagtt taaaaaacct 2280 gaaattttag ttgacaattc tgaaagtgag actaatgaaa gtgaatacga aaataataaa 2340 attgaagatg aatcaactga acataataat agtgaaatgg gtcggcgatt gcgagataga 2400 tcacagattc gtatgccaga tttttatggt tgccctgtta catttattac acaaagcttg 2460 cctttaaatt ataatgaagc gttaaagtct gatgataaag agtactggca aactgcgatg 2520 caagaagaag ttaattcttt acatgaaaat ggaacttggg ttcttgttcc aaaacctgaa 2580 aatcaaaagg ttatcaatag tagatgggtt tatacaaaga agctcaatcc agacggctcg 2640 gaaaggtata aagcgcgttt agttattaag ggttattcac aaaaggaagg gatcgattat 2700 aaagagactt ttagacctgt agttagattt gataccgtga gattcctgtt aagtgtagct 2760 gctcaagaaa atctatttct tggtcagttt gatgtaaaaa catccttttt atatggtact 2820 cttaaggaaa atatttatat gcatcagcca gagggtttcg aagatggtac ctcttctgtt 2880 tgcaaattgt tgaagagttt gtatgggtta aaacaagctc cgaggtgctg gacagagcat 2940 tttgcaaatg ttattaaaaa tctgggattt acacaaagta ttgcagatcc atgtttttat 3000 atatactcta aaaataatga cagaattttg ttaacgatgt atgtagatga cgggttaatt 3060 gctgcatcca atgaattttt aattgataat ctatttgaga atttgagcaa ggaatttaaa 3120 ataacaaata caaaaaatgt aacatgcttc ttaggaatag aaatactgaa gcagccaaat 3180 ggttctatct tcatatatta aggaaattat ataaggaaaa ttttagaaaa atttaatatc 3240 ataaatgcga acattgtctc cacgccagtt gaatgttatt atgatgaaaa tgtatctgaa 3300 caaaataatt gtaatgcacc ttaccgcgaa gcagtaggaa atttaatgtt tttacaagtt 3360 gtttcaaggc cagatataag ttttctgtta atgtagtttc ccgagagcta gaaaacccga 3420 agcagcatca ctggacgtta gttaagagaa tatttcgtta tttgaaagga acgatggatt 3480 tgggaatttt atacagtaaa aataataata tttttgagac atatagtgat gctgattatg 3540 caggtgataa gaaaactcgg aaatctacat ctggaatttt gtgtaaatat gcgaatgctg 3600 caataacgtg gcaaagtaaa aaacaacaat gtgtagcttt atcaacaaca gaagccgaat 3660 atgtaagtgc agctctagca agtaaggaaa tgatatggct aaagaaattg ttttttgatt 3720 gtaacattaa tttaagtaac tacatcttgt ttatagataa tataagtgcg attaagctaa 3780 tcaaaaatcc tgaatttcat caacgaagca aacatataga tgttaaatat cattttgtac 3840 gcgatttatt tgagaaagat gagatagatg taaggtatgt aaagagtgaa gagcagatcg 3900 ctgacatttt tactaaagct ttgccaaagc ccagattttt gtatctaaga gagaaattgg 3960 gaatgaagaa taagggacaa atttgttata gagatgttga gttttaggga gggtgttgag 4020 gattgatctc gaaaaactca tcatcaag 4048 // ID HAT2a_Cis repbase; DNA; INV; 381 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 06-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE hAT DNA transposon from Ciona savignyi. XX KW hAT; DNA transposon; Transposable Element; HAT2a_Cis. XX OS Ciona savignyi OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. XX RN [1] RP 1-381 RA Smit A.F.; RT "HAT2a_Cis - hAT DNA transposon from Ciona savignyi."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX CC Ci000001 hAT classification based on 8 bp target site CC duplications (no bias); 17-20 bp terminal inverted repeats. XX SQ Sequence 381 BP; 107 A; 76 C; 80 G; 118 T; 0 other; ccagtatgga aagtacggca gctgcacact cgagtttttt ttccctgcgt ctttgcacac 60 ttgggtaatt tcacacacgc tcgcgagctg gcagtataaa catcgtataa tttccaacaa 120 acctgtaaac ctacctctgg acctcgggta atttactgtc agtacgtttt ttgttacgta 180 gatagatatg ttagctgatg gacacgatag atttaggata gaataatacg tttgtttaag 240 atttttacaa aatagtaata aaaaggagat tttgtgcaca cttgaagtgt gcactcttgg 300 acgtttgcac actgcacact taaatatttg gcgtgtgcac actcgagatc tcggcgaaaa 360 ttgacgtact ttccatacgg g 381 // ID DNA8-26_AP repbase; DNA; INV; 261 BP. XX AC . XX DT 22-AUG-2009 (Rel. 14.08, Created) DT 22-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-26_AP. XX NM DNA8-26_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-261 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(8), 1768-1768 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 261 BP; 88 A; 44 C; 49 G; 78 T; 2 other; accatagata tatacaacaa actagatagc cgtctcctcg ttgtcatgga agtcggccgc 60 catntgcaaa gcaggcnacg tctacaaaat aacgttatca catagactag tatatatgta 120 tagataaata tatatgtata taaataaaat gtaaaataaa tgtaaacgtt gcctgcttta 180 tagatggcgg ccgatttcaa catattggtc gcgactgtag tattcgaaga cggctatcta 240 gttgttgtat atatctatgg t 261 // ID Copia-40_AA-LTR repbase; DNA; INV; 120 BP. XX AC supercont1.304; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-40_AA_; KW Copia-40_AA-I; Copia-40_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-120 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.304; Positions 829744 829863. XX SQ Sequence 120 BP; 39 A; 22 C; 23 G; 36 T; 0 other; tgaagagacg tatcgaagga gcagttgatt ttatgaccac cttggatact gggagttgtt 60 tatcctctat caactagtga aaatttaatc attaataaac caaccttgct actgagcaca 120 // ID CR1-76_AAe repbase; DNA; INV; 4089 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-76_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4089 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1164-1164 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 8 sequences with >98% CC identity. XX FH Key Location/Qualifiers FT CDS 226..1014 FT /product="CR1-76_AAe_1p" FT /translation="MAVSVCCSCAGELEKTEEIVCNGFCRSSFHLKCVKQT FT AATRDIVAKCSQLFWMCNACTKIMANSTFRNALSSANNAMEAIHAEQNNAL FT VELRQEMEQNTEKINMILRQLPTALQERTGRKTSTSSTSVPSSRRKRPRID FT EDAIQTEIRETEGTKEIDSSVMIPLADRTTSESKFWLYLSGFNPNATDDDI FT RSLVQRNLNTSDAVDVRKLVPKGKNLDELSFVSFKVGIDSQLKDMALVSST FT WQKGIIFREFDFHPRATFQFQQ" FT CDS 923..4018 FT /product="CR1-76_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="RIWRWYPLLGRRVSYSANSISILGLRFSFNSNSSNYV FT ETFFSPIHATCCQHVDIASTNATTPIALQYPDHVNSNTDGKLNRSTPSITV FT YYQNVRGLRTKTNELFAALSVCDYDIVVLTETWLNKNVLDAELTDEYVIHR FT CDRSASTSQCLRGGGVLIGVKKNIRCNVFTINGADRLEQVVIRASVGSVSL FT VVCAVYLPPNTELGLYEQHAACINELHRKTSIHDRIVVLGDYNLPLLRWCF FT DNDVGCFIPTNASSEQEISLVENVIASGLQQVNYLTNENGRLLDLAFVNCM FT TICEIIEPPLPLMNVDRHHQPFVMVIEDQRTTLENENNSNVVYDFNHCDFD FT VLNVAISQINWREILNSGTLDEAVDSFYGGLDSIFRQHVPLKRQSRQHGSR FT RPWWNGELRNLRNRLRKARRRYFRLRNDDSKAVVRDLECQFNSLNASRFRS FT YISRLERNIKDDPKQFWTLIRNRTSTKAIPHTVNYLGDTSSTPLESASLFA FT SFFQSVLSNNSPPLVESYLNRLQSFDLNLPATAFTERDVCIKLREVDGCKG FT PGLDGLLPYFIKQCSSSLAVPATLLFNRSLEESTFPSRWKEASITPIHKTG FT NVHDVTNYRAISILNCLPKVFESMVLDFLYPAVRNIIAVDQHGFMKKRSTT FT TNLMSYVSLLVDLIDKRQQIDAVYIDFSKAFDRVPHQLAIEKLKRIGFPDW FT LTKWFSSYLSKRSASVKLGTVKSDPFCISSGVPQGSHLGPLLFVLFVNDLC FT SELESLKSMYADDMKIFRVVSSPVDCCALQADVDKVLIWCERNGMEVNIQK FT CNIITFSRIRAPILFEYSMQSCTLKRVSTIKDLGITLDNKLRFNEHISSVI FT AKAYAVLGVIIRNTRDFNDIYCLKTLYISLVRSLLEYGVVVWSPYHAVQID FT RIERVQKCFVRFALRRLPWRNRFRLPAYENRCALISLPTLATRRTYLQRLF FT VFDLLENNVDCPDILEKLRFNAPGRTIRRTDFFRRSAHRTQYGQHNPLDVC FT CQVFNDVYHMYDFNLCKSAFKSRISN" XX SQ Sequence 4089 BP; 1143 A; 883 C; 890 G; 1173 T; 0 other; cacaccataa ggtaaagtta aacgttgatt atctgttatg atcgttttct aatcagtgtt 60 ttcttgtgtt tcatatacct gattattgca atcgctgttt tctccgtgct gatcaacttt 120 ttccccgtgc ggtacatctg gagtattgct gtttccgcgc cagaataagt gattaaatta 180 aggaatttta tttcacatcg ttttccgtat tcaagtacac aaacaatggc ggtttcagtt 240 tgttgttcat gcgccggcga gttggaaaaa actgaagaaa tagtctgcaa tggtttctgt 300 aggtcgtctt tccatcttaa atgcgtgaag caaactgctg ccactcgtga tattgtcgcc 360 aagtgctccc aactcttctg gatgtgcaat gcttgcacca aaattatggc aaactctact 420 tttcgcaatg cactatcgtc agctaataat gctatggaag ccatccatgc tgagcaaaac 480 aatgcacttg tcgagctgag acaggagatg gagcaaaaca cagaaaaaat caacatgatt 540 ctacgtcaat tgcccactgc attgcaagag cgcactggca gaaagactag cacgtcaagc 600 acgtctgtcc ccagcagccg ccgtaaacgt ccccgaattg atgaagatgc gatccagacc 660 gaaatacgcg agaccgaagg aaccaaagaa attgattcaa gtgtgatgat accgctggca 720 gataggacga ctagtgaatc caaattctgg ttgtatctat ctggcttcaa tccaaatgct 780 actgatgatg atattcgaag cttggtgcaa cgaaatctga ataccagcga tgccgttgac 840 gtgcgaaagt tagttccaaa aggcaaaaat cttgacgaac tttccttcgt gtcgtttaag 900 gttggtatcg actcgcaact aaaggatatg gcgctggtat cctctacttg gcagaagggt 960 atcatattcc gcgaattcga tttccatcct agggctacgt ttcagtttca acagtaacag 1020 cagtaattat gttgaaacct tcttctcgcc gatacatgcc acttgttgtc aacacgttga 1080 tatcgcctca acgaatgcta ctactccgat cgcgctacaa tatcccgatc acgtgaattc 1140 gaataccgac ggcaaactga accgatctac gccctcgatc acggtttact atcagaatgt 1200 gagaggatta agaaccaaaa caaacgaatt atttgcagcc ttgtctgttt gcgattacga 1260 tatagtggtg cttacagaaa cttggctaaa taagaatgtg ttggatgccg agctaactga 1320 cgagtatgtg atacaccgtt gtgatcgtag tgcgtccact agccagtgcc tccgtggtgg 1380 tggggttctg attggtgtca aaaagaatat tcgatgcaat gttttcacta tcaacggggc 1440 tgatcgtcta gaacaggtag tcattcgagc tagtgttgga agcgtatcac ttgtggtgtg 1500 tgcggtttac ctccctccca atactgagct tggattatat gaacaacatg cagcgtgcat 1560 caatgaactt cacagaaaaa cttccattca tgacaggatt gtagttctgg gtgactacaa 1620 tttgcctctt ctccgctggt gtttcgataa cgacgtcgga tgcttcattc caacgaatgc 1680 atcctccgag caggaaatct ctttagttga gaacgttatc gcgtctggtt tacaacaagt 1740 aaactatcta acaaatgaaa acggaagact gctcgattta gcttttgtga attgcatgac 1800 catatgtgaa ataattgagc ctcctctgcc cttaatgaat gtggaccgtc atcatcagcc 1860 gtttgtgatg gtcattgaag atcaacggac tacgctcgag aatgagaaca actccaatgt 1920 agtttatgac tttaaccatt gcgattttga tgtattgaat gttgcaattt cacaaatcaa 1980 ttggcgagag atcttgaact ctggtacttt ggacgaagct gtggatagtt tctatggcgg 2040 attggattct atttttcgac aacacgttcc tttgaaaaga caatcacgac agcatggaag 2100 cagacgacct tggtggaacg gagagttgcg gaatcttcga aaccggctac gtaaggctcg 2160 tagaagatac tttcgactga ggaatgatga cagtaaagca gtggtgcgcg acttagaatg 2220 ccaattcaat tctttaaacg catcacgctt tcggtcgtac atttcccgac tagaaagaaa 2280 tataaaggat gacccgaagc agttttggac tttaattcga aaccgtacat ctacgaaagc 2340 aattcctcac accgttaact atcttggtga tacatcttca acgccgttgg agtctgcaag 2400 tctttttgcg tcgttctttc aaagtgtgct aagtaataac tcaccacctt tggttgaatc 2460 gtacctaaat agattgcaga gtttcgacct caacttgcct gcaactgcct tcactgagcg 2520 cgacgtgtgc atcaaactga gagaagttga cggttgtaag gggcctggat tggatggatt 2580 attgccgtat ttcatcaaac agtgttcttc ttcgttggct gtaccggcta cactactctt 2640 caatcgttcg ctcgaagaaa gcacatttcc atcgagatgg aaggaagcgt cgatcacgcc 2700 aattcacaaa actggaaatg ttcatgatgt tacgaactac agagcaattt ctatcctgaa 2760 ctgtctgccg aaagtttttg agagtatggt attagatttt ctttacccgg cggtgcgtaa 2820 catcattgca gtcgatcagc acggatttat gaaaaagcgc tcaactacaa caaacctcat 2880 gagttatgtg tctttactgg tcgatctaat agataaacga cagcaaatcg acgcagtata 2940 catcgacttc tcgaaggctt tcgaccgtgt cccacaccaa cttgcgatcg aaaaacttaa 3000 gcggattgga tttcctgatt ggctgaccaa atggttttct tcgtatctca gtaaacgctc 3060 tgcttccgta aagcttggaa cagttaaatc ggatccgttt tgcatttcgt cgggtgttcc 3120 tcagggaagc catctcggtc ctcttctctt cgtactcttc gtgaatgatc tttgcagcga 3180 attagaatcc ctgaaaagca tgtacgctga cgacatgaaa atttttcgtg tggtgtcctc 3240 tccagtggat tgctgtgcac tacaggcaga tgttgacaaa gttctgatct ggtgcgagag 3300 aaacggaatg gaggttaaca tacaaaaatg caacatcatt accttttcac gaatccgcgc 3360 tccaattctt tttgagtatt cgatgcaaag ctgtacgttg aaaagagtct caaccattaa 3420 agatctaggt atcaccctgg acaacaaact gcgtttcaac gagcacatat cgtcggtgat 3480 agctaaggcc tacgcagtac tcggtgtaat catcaggaac actcgggatt tcaatgatat 3540 ttactgcctg aaaacattat acatctcgtt ggttcgcagc cttttggaat acggcgttgt 3600 ggtctggagc ccgtatcacg ctgtgcagat tgaccgcatc gaacgtgttc agaaatgctt 3660 tgtcaggttt gcacttcgta gactaccatg gcgtaataga tttcgactcc ctgcttatga 3720 aaaccgatgc gcactcatca gtcttccgac gttggcaact aggagaacat atcttcagag 3780 gcttttcgtg ttcgacctac tggagaacaa cgttgactgt ccggatattt tggaaaagct 3840 tcgtttcaat gctccaggta gaacgatacg gcgtactgac ttctttcgac gatcggcaca 3900 tcgcacacag tatggacaac acaacccact cgatgtttgc tgccaagttt ttaatgatgt 3960 gtatcacatg tatgatttta atttgtgtaa atctgcgttc aaatctagaa taagtaatta 4020 agttttagtc tgtgcgatgt taaaatatcg aagactgtaa ataaataaat aaataaataa 4080 ataaataaa 4089 // ID CR1-88_HM repbase; DNA; INV; 4087 BP. XX AC . XX DT 14-SEP-2009 (Rel. 14.09, Created) DT 14-SEP-2009 (Rel. 14.09, Last updated, Version 2) XX DE Non-LTR retrotransposon: consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-88_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4087 RA Jurka J.; RT "Non-LTR retrotransposons from Hydra magnipapillata."; RL Repbase Reports 9(9), 1930-1930 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 1086..3953 FT /product="CR1-88_HM_1p" FT /translation="MFFTETWFNDNSDTNIEGYQAIIRNRDGRSGGVAIYI FT KNNLTVSETNIEILQKPANIEQIWRIIKCQNDSVLVGCIYRPHDLDDNVLA FT QIINSIKAAKRSLESLKCSSLLLYGDFNFSHTTYEQLDTGGNSATAAHILN FT EHQTDIKFQKCLDECHLTQTITFPTYRNSLDVRPVSTLDLIITDEPDRLIF FT LEECEPLGRTPRGQAHCMIKGTYALPESSLQGTLTNPRFIWSRANFDMLSA FT YITAHDWENTVGLSSTRENYKILIEKYKEAVKLYIPSTTKPFKEKKEPWNT FT QSVNEAVNNKRDTWRKFLAAGSSLKDSIRLEHKLACKKVSKEVRKAVVDYE FT MGIAYVFKSDPIKLHNHVKHKKLVKDTIRTMEDLSGNITTDKNEVAKILND FT YFQSVFTTDPLDQLPNFEARTEVICDPDDSFITESNIRKRLESLKESKSMG FT FDGIHPRVLRNCAKAFARPLALIFRKSFNEGIVPDLWKRSNVSPIHKKDSK FT QKAANYRPVSLTSVPCKVMEGTIHETIMNHCIANDLITKEQHGFLKKKGCT FT SNLLETRDILTESVHQGSAVDVIYTDFAKAFDIVSHRRLLHKLQAYGIRGA FT LLKWIEAWLSNRQQRVVIDGAESEWKEVTSGVPQGSVLGPLLFIIYINDLP FT DNLQKHIKIYADDSLQKHIKIYADDSKIIRVIDNDDDVKSLQTDIDSTVAW FT SHEWQMSLNLDKCKVMHVGKAKNKSTQLYTMTGRNGIVHNLETTTLERDLG FT VLVSDDLKVCQQVESASARANSMLGRLKKAFRSRGLVLWKTLYTTYVRPHL FT EFAIQAWSPHLRKDIEKLEQVQHRATKLITEIKNDSYETRLQKLGITTLET FT RRTRGDLIQNFRIQNGFEVVSFFHKQQLAPSLSTYNLRGHNQRLERQNVNN FT CEERAHFFTNRVVNPWNALAPNAIRAQSVNAFKDHIDGKRASSVILLNDLS FT A" XX SQ Sequence 4087 BP; 1488 A; 877 C; 749 G; 973 T; 0 other; ataaaacata cagaacagaa gtgggcgcaa tgttcagaac agcacttacg atagtactgt 60 cagaaattca acaacaatac aataagcaaa tcaatgagtt gaaggaggaa attgtagcac 120 taaaaacagg aaatgaacaa acgacaacaa ttcgacctat taatcgacca actaatgctc 180 cggtttctta ctcatccacc gttaacctaa taggtagagt aaatactact cagcgtaatc 240 atcttcttaa cataactgca gccgaagtta atacccgaaa agagagagag tcaagtgcaa 300 ttatcgtggg tttagaagaa tctagggttt agaagaatct accgagcaag aggattcaga 360 aatcgttaat gaattccttg acagcattaa aattgatagg agtaacgtcg tacatattca 420 acgtttcaaa aacagaaccc aaacaaacgt ggactcttcc gcaaaaaaac aaatcggaat 480 tgtcaaagtc gttttcaaat cgtctgaata tcagaaggaa gccttaaaaa acggtaacta 540 tcgtaatatt tctaaattta aagacgtctt catcagagag gacagaacac cagctgagca 600 gtcggaattt aataaactgt tagcgaccaa aaaacgtaaa aacgaggaat tcaacaaaat 660 tgccgctcaa tctattaagc cctttcgttg gatcatccac aggcgcacta aagaagtcag 720 atgcattgac gcaggcgagt cagcgagcca aaaaaaacgc atctgggcct caaagccaca 780 actcgatcag atcagagccg atctcaaaaa gtttcaagaa ttgcaaaaat cccaaacccc 840 cgctgtcgcc accaatagat tcggcatcat ttcaatcgag gaagaagcct aaaacaaatc 900 aactcaaagc caacagcaca caaaaatcga aagctaaaca gtcgaatccg caacccacaa 960 ttcaccaaga atccgtaaat aaattgcact tctgggcaca caacccttgc tcactaaacg 1020 gttcaaaaaa ggacgaattg acagcacgaa ttcaatcaga aataacaaat aataaaccgg 1080 atatcatgtt cttcacggag acatggttta acgacaattc ggacacaaac atcgaaggat 1140 atcaggctat tatcagaaat agagatggtc gatcaggcgg agtagctatc tacatcaaaa 1200 acaatttgac agtatccgaa actaacatcg aaattctaca aaagccggca aacatcgaac 1260 agatatggag aatcattaaa tgccaaaatg attcggttct agtaggctgc atataccgac 1320 cacacgacct agatgataac gttctagcac aaatcatcaa ctcaatcaaa gctgcaaagc 1380 gcagtctcga gagccttaaa tgctcatctc ttctcttata cggcgatttt aacttttcgc 1440 acacaacata tgagcaatta gatactggag gtaactctgc tactgctgct cacatattaa 1500 acgaacacca aacagacatc aaattccaga aatgcttgga cgaatgtcat cttactcaga 1560 cgattacgtt cccaacttac cgcaattctc tagatgtcag accagtaagt actcttgact 1620 taataattac cgatgaaccc gatcgtctta tatttctcga agaatgcgaa cctttgggtc 1680 gcacacccag gggacaagcc cactgcatga taaaaggtac gtacgctcta cctgaatcta 1740 gtcttcaagg tactctcacc aacccacgtt tcatatggag tagagcaaat ttcgatatgc 1800 tctcggcata cataacagcc catgactggg agaatacagt aggtttgtct tcaactcgcg 1860 aaaactacaa aatactgatc gaaaaatata aggaggccgt aaaactatat attcctagta 1920 cgacaaagcc tttcaaggaa aagaaagagc cgtggaatac acagtcagta aacgaagcag 1980 taaataacaa aagagacaca tggcgaaaat tcctagctgc cggttcatca cttaaagact 2040 cgatacgcct ggagcacaaa ctcgcttgta aaaaggtctc aaaagaagtt cgaaaggccg 2100 tagtggacta tgaaatgggt atcgcttatg tcttcaaatc agacccaata aagctacaca 2160 atcacgttaa acataaaaag ctggtcaaag acactattag aactatggaa gacttgtcgg 2220 gaaatataac taccgacaaa aacgaagtcg ctaaaatctt gaatgattat tttcagtcag 2280 tattcacaac agacccactt gatcagctac caaatttcga agcacgtacg gaagttatat 2340 gcgatccaga cgattccttc attaccgagt caaatataag aaaaagactc gaaagtctta 2400 aagaatccaa atctatgggt tttgatggta ttcaccctcg agtgctacgt aattgtgcga 2460 aggccttcgc acgtccactg gctctaattt ttcggaagtc cttcaacgaa gggatcgtcc 2520 cagacttatg gaaacggtcg aacgtttcac ctattcataa gaaagacagt aagcaaaagg 2580 ccgcaaacta tcgaccagtc tctcttactt ctgtgccctg caaagttatg gaaggcacaa 2640 ttcatgaaac gatcatgaat cactgcatag caaacgatct cattacaaag gagcagcatg 2700 gtttcctcaa aaagaaagga tgtacatcga accttcttga gactcgagac atcttaacag 2760 aatcagttca ccagggcagt gcagtagacg tcatctacac agacttcgcc aaggcttttg 2820 atatagtttc tcatagaaga ctgcttcata aattacaagc gtacggcatt cgcggagctc 2880 ttttaaaatg gatagaggcg tggctttcaa atcgccaaca aagagttgtc attgacggag 2940 cagagtcaga atggaaagaa gtcactagtg gagttccgca agggtctgta ctaggccctt 3000 tattatttat tatttacata aacgatttgc cggacaacct acaaaagcat atcaaaatct 3060 atgccgatga cagcctacaa aagcatatca aaatctatgc cgatgacagc aaaatcataa 3120 gagttatcga taatgatgat gatgtaaaat ctttacaaac agacatagac agcacagtag 3180 cctggtcaca cgagtggcaa atgagtctaa atctagataa atgcaaagta atgcatgttg 3240 gaaaagctaa aaataaatct acacaactgt atacgatgac aggcagaaat ggaatcgtac 3300 ataatctaga aacaactaca cttgagcgag acctaggagt tctagtatcc gacgacttga 3360 aagtttgtca acaagtcgaa tcagcttcag caagggctaa cagcatgtta ggtagactca 3420 aaaaagcttt ccgcagtcga gggttagtac tatggaaaac cctatacaca acgtacgtga 3480 gaccgcactt ggaattcgca attcaagcgt ggtctcccca tctcaggaaa gatatcgaaa 3540 aactcgaaca agttcagcac agggcaacta aactcatcac agaaatcaag aatgattcgt 3600 acgaaactcg tcttcagaaa ctcggtatta caacactaga aacccgtagg acaaggggag 3660 atctcattca aaatttcaga atacaaaacg gtttcgaagt agtttcgttt tttcataaac 3720 aacaacttgc accatctcta tcaacgtaca atttacgtgg ccataaccaa cgattggaaa 3780 gacaaaacgt taataattgt gaagaaagag ctcatttctt cactaacaga gtagtaaacc 3840 cttggaacgc actcgcgccg aatgcgatca gggctcaatc cgtaaatgcc tttaaagatc 3900 atatagatgg aaaaagagca tcttcggtca ttttattaaa tgacctttca gcatagaggg 3960 gattggttat cacctcttca taaaatattt taacattttt tttaaaaaca aaaactctac 4020 taatcaattg ttattttact taaatatttt atttctgaaa taaaacaaat tcaaattcat 4080 atatata 4087 // ID Gyp1b_Cis_LTR repbase; DNA; INV; 508 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 06-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE Long terminal repeat of Gypsy LTR Retrotransposon from Ciona DE savignyi. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gyp1b_Cis_LTR. XX OS Ciona savignyi OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. XX RN [1] RP 1-508 RA Smit A.F.; RT "Gyp1b_Cis_LTR - Gypsy LTR Retrotransposon from Ciona savignyi."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX CC Ci000068 and Ci000091. XX SQ Sequence 508 BP; 149 A; 108 C; 107 G; 143 T; 1 other; tgtagcgacc tcatgtgctt acataagatt tacgatcgca cagccggcat ttgcgctgac 60 aatgcgtcat cactgaaata ttattattag gtttacgatt gttaccgcag ctgccggata 120 ctttgcctat ataagtttat ttcacttttg ctcgagagag tcctaacgca ttaagtaata 180 ttcttgtatg tggagaatac tgagagttta tctgaaatct atatcgttgc aacatctaac 240 gatttaactc gcaattggtn ataagttatt cgcctcaatt gctcaataca attgatgtta 300 taaaactact gaggcgttta tcggcgaact acaggagatc agtcaaagag acatttacgc 360 agaaaacctc aattaattag cgcacctccg agagtcgaga tagcgcgtag ggaatcgcag 420 atcgcatcag gtgttgtaac cggcttacct gggtgcctac aaggaacaat ctagcgaaat 480 aggcagcgac ctgcaacggc tccctaca 508 // ID Gypsy-4_PPc-I repbase; DNA; INV; 4893 BP. XX AC chrUn; XX DT 06-JUL-2010 (Rel. 15.07, Created) DT 06-JUL-2010 (Rel. 15.07, Last updated, Version -1) XX DE LTR retrotransposon from the Pristionchus pacificus genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-4_PPc_; KW Gypsy-4_PPc-LTR; Gypsy-4_PPc-I. XX OS Pristionchus pacificus OC Eukaryota; Metazoa; Nematoda; Chromadorea; Diplogasterida; OC Neodiplogasteridae; Pristionchus. XX RN [1] RP 1-4893 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Pristionchus pacificus genome."; RL Repbase Reports 10(7), 1000-1000 (2010). XX DR Genome; chrUn; Positions 23867806 23872698. XX CC Positions [2082-2618] - Reverse transcriptase CC Positions [3741-4199] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 280..1770 FT /product="Gypsy-4_PPc-I_1p" FT /translation="MEMTHGGVPSSSGTQPSTNGDVLPPGAPPGNTRHESE FT GVPRHSTEGVSRVKPDGVPGHDTEGAYKQNADISGSRSSSSDSKTSTLDTY FT STMRSASTLDVFNVDAPPQQAAPQAVHTTVELAPYQGKPPGAYLPGSDWRI FT WLKRLENFMKLRKVHNDSKKYVFLDEIGDLNYAILEGLLPGKELEEYSYDQ FT LKEAMTERFQPKLLVLSERFRLTQLTQKHTQNLAEFLAELQSAAKTCKFET FT VKDVRDAFVSLAFISGIKSDDTRKKLIEQNDKNSQELLSLAEAHERAGKGA FT VDIRHTGESVHGVDNAKRPTQVRFAPQNKSYHNGKSRTHHKSHDKGNRGFQ FT KKRNFSKSTTKPNSYNRGPSCTVCHKGGHTADRCWHKEKRPLKNAHGVFSD FT HDLSDNEVTLQYGIFAVNSVKGCEYPKCMITTELEEKTLKFELDTGSSLTI FT IGSQTWEELGKPSLERSSHSASAYSGTPLLFKGTLRALVKWKNLSRLLEID FT Y" FT CDS 1734..4280 FT /product="Gypsy-4_PPc-I_2p" FT /translation="MEEPESTTRDRLLEVLDRPAPALWGRDAIARFNMDLG FT PVYEKGIHQVTTNKVELASKVKEILGRNAEVFGKDLGKCTVAKATLKFKKK FT NPQPKFFRARPVPFALKPKVDETLDNMVSQGSLKLVDHADWATPLVVVPKP FT GGKVRLCGDFKVTVNPVLDINQYPLPKPDDMFHQLNGGKKFSKIDLKDAYT FT QVELDEESKKYLVLNTQKGLFQYQRLPFGEASAPAIFQKIMEQTLVGIPGV FT LAYLDDITITAPTDEEHLDRLNQVLTRMRNAGFRLSREKCEFLVEKMEFLG FT HIVDKEGIRSSPEKVKAMLEMPEPRNIKQVESFLGMINYYGKFIKDLSTLA FT APMNALRKKNADFIWGKEQQKAFMEIRKRLSETDVLTHYDPDTPVVLATDA FT SDYGIGAVIYHKYPDGNEKVIAYASRSLTKCEKNYAQIEKEALGIVYGVDK FT FSQFLYGRKFTLLTYHQPLVRIYGPKHELPVIAAKRLHRWGLKLMMYSFDI FT EYRNTEEFGNADGLSRLPQETELPTIQSVKDDDEITEWDKKTLQCLPISAS FT SLVEETQKDPILKEVKIKMRKGFRPKEKDIEMKIFADKKEFLSQGDGYLMM FT DGRVVIPTKLRLTILKNLHANHYGVARMKALARMKVWFPRIDTAIEKMAKS FT CPVCAVLGNGLTKTPLHPWEIPERPWQRVHIDFAGPFQGHMWLVAVDAKTK FT WPEVKKMSNITTKATTQTLQGIFATHGIPEQIVSDNGPQLTAKEFKEFCAA FT QNIEHILTPPYHPNSNGEAERFVQTFKRGVEKGIRGGKKVDDILCGLLQEY FT RATPHPATGMSPAQMLMGRQIRTNLLPVPARTKERENKSISEKYQEQ" XX SQ Sequence 4893 BP; 1631 A; 1047 C; 1159 G; 1056 T; 0 other; ctaactggcg ttcagtcttc gagctacgcg gaggatatct acagtaaccg aaggcagttt 60 ggatcaattc ccgatacccc ctcgaattac tgtagtattt cttggagatt gatctccacc 120 ccccgtatca aaggctagta ccacatatac ccagataccg tacaccgaga tgagccagcg 180 cactcagtca acaccgaaat cgagttcagc caagtcagcg tcgaggacac cgtctcgccc 240 gaaggagagt cggtattccc ctcgacccct accgtatcca tggagatgac ccatggcggg 300 gtacccagta gctcgggtac ccagccaagc acaaatggag atgttcttcc tcctggtgct 360 cctcctggta ataccagaca tgaatctgaa ggcgttccca gacatagcac cgaaggcgtt 420 tccagggtaa aacctgatgg cgttcccggg catgatactg aaggcgcata taaacaaaat 480 gctgacatat ctggctctcg aagctcatcc agtgattcta aaacatccac cttagatacg 540 tacagtacta tgagaagtgc ttcaacactg gatgtgttca acgttgacgc tccaccccag 600 caggcagcac ctcaggcggt gcatactacg gtagaattgg ccccctacca aggtaagccg 660 ccaggtgcat acctacctgg aagtgattgg cgcatctggc tcaagcgtct agagaatttc 720 atgaagctta ggaaagttca caacgattct aagaagtatg tgttcctaga cgaaatcggt 780 gatttaaatt acgctatcct agaaggactc cttccaggaa aagagctaga agagtattct 840 tatgatcagt tgaaagaagc gatgacagaa aggttccagc ccaaactttt ggtgctatca 900 gaaaggttcc gattgacgca actgactcag aagcatactc aaaacttggc agagttcctt 960 gcggaactcc agagcgcggc aaagacctgt aagtttgaaa ctgtgaaaga tgtcagggat 1020 gcatttgtct cattggcatt catctctggt atcaaatcgg acgatactag aaagaaacta 1080 atagaacaga acgataaaaa ttcacaggaa ttactgtctt tagccgaagc tcatgagaga 1140 gcaggtaaag gtgcggtaga catccgccac accggagagt cggtccacgg agtggacaac 1200 gcgaaacgac ccactcaagt tcgtttcgcg ccccagaaca agtcgtatca caatggcaag 1260 tcaaggactc atcacaagag ccatgataaa ggtaatcgtg gtttccaaaa gaagagaaac 1320 ttctccaaaa gtacgaccaa acccaattcc tataaccgag gtccgagctg taccgtatgc 1380 cacaaaggag gtcacaccgc ggacagatgc tggcacaagg agaaaagacc cttgaagaat 1440 gcccatggag tattttctga ccatgacctg tctgataatg aagtgactct acagtatggc 1500 attttcgcag tgaacagtgt gaaagggtgt gaatacccta agtgtatgat aacaacagaa 1560 ttagaagaaa agacactgaa gttcgaacta gatactggat ccagtttgac tatcataggg 1620 tcacagacat gggaggaatt gggtaaacct tcgttggaac gatcaagtca ttcggctagt 1680 gcttactcag gaacgcctct tctgttcaag ggaacgttga gggcattggt caaatggaag 1740 aacctgagtc gactactaga gatcgactac tagaggttct ggatcgtccg gcacccgccc 1800 tgtggggccg tgacgcaatt gcaagattca atatggatct aggcccggtc tacgaaaagg 1860 gaattcacca ggtcaccaca aataaagtgg aactagctag caaggtgaaa gaaatcctag 1920 gaagaaacgc tgaagttttt ggaaaggatt tggggaagtg cactgtagcg aaagctaccc 1980 tcaaattcaa aaaaaaaaat cctcagccca agttctttcg agcaaggcca gtgccatttg 2040 cactgaaacc taaagtagac gaaacattag ataatatggt ctctcaagga agtttgaaat 2100 tagtggacca cgccgattgg gcgaccccat tagtggtcgt accaaaacca ggcggaaaag 2160 tgagactgtg cggagatttc aaggtaacag taaacccagt actggacatc aatcaatatc 2220 cgttacctaa accagatgat atgtttcatc aacttaatgg tggaaagaaa ttctccaaaa 2280 tcgatctcaa agacgcatac acacaagtgg agttggacga agaatccaag aaatatctcg 2340 tattgaacac acagaaaggc ctatttcaat accagagact tccatttgga gaggcgtcag 2400 caccggcaat tttccagaaa ataatggaac aaacgctggt gggaattcca ggagtgctag 2460 cgtatttgga tgatatcacc attacggcac ccactgatga agagcactta gatagattga 2520 atcaagtgct aacaagaatg agaaacgcag gtttccgtct tagcagagaa aaatgcgaat 2580 tcctagtaga aaagatggaa tttctcgggc acatcgtgga taaagaaggt atccgatctt 2640 ctccggaaaa agtaaaagca atgctagaga tgccagaacc acgaaatatc aagcaggtgg 2700 aatcgttcct aggaatgatt aattactacg gtaaattcat aaaggacctt tccacactag 2760 cagcgcccat gaatgcactg cgtaagaaaa atgcagattt catctgggga aaggagcagc 2820 aaaaggcttt catggaaata cggaaaagac tgtctgaaac agatgtgctg actcattacg 2880 atcccgacac tccagtcgta ttagcaaccg atgcatcaga ttacggtatt ggagcagtga 2940 tttaccataa gtacccagat ggaaatgaaa aggtgatagc atatgcatca cgatcactca 3000 cgaaatgtga gaagaactac gctcagatag agaaagaagc cctgggaatt gtctatggag 3060 ttgacaaatt ttcgcaattt ctctatgggc gaaagttcac gttactgaca taccatcagc 3120 cgttagtcag aatctatggt ccgaagcatg aacttccggt aatcgcagca aaaagactgc 3180 accgatgggg actgaaattg atgatgtatt cattcgatat tgagtatcgt aacacagagg 3240 aatttggaaa tgctgatgga ttatcccgac tcccacagga aactgagtta ccaaccatcc 3300 agagtgtaaa ggacgacgac gagataaccg aatgggataa aaagacgctt caatgtcttc 3360 ccatatcggc gtcaagcctg gtcgaagaaa cacagaaaga tcccattctc aaggaggtaa 3420 aaatcaaaat gagaaaaggt ttccgcccaa aagagaaaga tatagaaatg aagattttcg 3480 cggataagaa agaatttctc agccagggag atggatatct catgatggat ggcagggtcg 3540 tgattcccac gaaattgaga ctgacgatac tcaagaactt gcacgcaaac cattatggtg 3600 tggcaagaat gaaggcatta gcacgaatga aagtgtggtt ccccagaatt gatacagcca 3660 ttgagaagat ggcgaaatca tgccccgtat gcgcagtgtt agggaatggt ctaacaaaga 3720 ctcccctgca cccgtgggaa ataccggaaa gaccgtggca aagagttcac atagatttcg 3780 cgggtccatt tcaagggcac atgtggttag tagccgtaga tgccaaaaca aaatggcctg 3840 aggtaaagaa aatgagtaac attactacga aagctacgac ccaaacactc caaggaatat 3900 ttgctacaca tggaatacca gagcaaattg tgagtgataa tggaccccag ctgactgcca 3960 aagaattcaa ggaattttgt gcggctcaga acattgagca tattctcact cctccctatc 4020 acccgaattc gaacggtgaa gctgaacgtt tcgttcaaac cttcaaaaga ggagtggaga 4080 aagggatccg gggaggaaag aaggtagacg atatcttgtg cgggttgtta caagaatatc 4140 gggctactcc tcaccccgca acgggaatgt ctcccgcgca gatgctgatg ggacgtcaaa 4200 ttagaacaaa cttgttgccc gttccggctc gtaccaaaga acgagaaaat aagagtatca 4260 gtgagaaata ccaagaacaa tgaaaaagaa ccacgataaa accactgtct cacgaaattt 4320 cgaggtagga gaaaatatct tcatggctaa cccgagagat agtggagatc gttggattcc 4380 cggtatcatt accaaaaagg tgcgaaacaa tcagtttgag attcattttg gaaatggtca 4440 gagagtttgt cacgcaaacc agctaaagaa gaggacccaa atagtactat gggaaaacta 4500 tgaagacttc atgctgacca agaaaactac gaaaaaagga gtgaaaagac cagaaccggt 4560 gatcagaaga gaggttgtca aacgcccaag tatgacacca gttcaacccg aaagggcagg 4620 aaaaatacca agaatcgcac cacaagaagt tcagaagaaa ccagaagtta cggtaatcag 4680 acagtcgtca agactgcaac agaaggaaaa agtagattat cgatctattg caagtggaaa 4740 tgcaaaaaac cccagtttct aataagattc tgaagaataa tcaatttcca ctcgtactat 4800 gcaaatgaaa tttttaatca aaggatcaca gttccaaggg ataggggatc ttgggaaaac 4860 agataaagtg gaagagacca cttggggagg agt 4893 // ID BEL-615_AA-LTR repbase; DNA; INV; 595 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-615_AA_; KW Pao_Bel_Ele51; BEL-615_AA-I; BEL-615_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-595 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 595 BP; 218 A; 105 C; 98 G; 174 T; 0 other; tgttacgatg agacccctcg tagaagcggc gcctctacaa cagtgccgtc aaattcatgc 60 aaaccgacga aacgtcaagc atccacgtca acgactgctg aactgtatgg aaaggcgtaa 120 gagaagaatg tgatgtgctc aaaaaccaca tgaagtgaag actccaaatt attgaattat 180 tctagtttat attgaaaatc agtgaattga aatcttttat atatatatat ttattcacac 240 catagtttcg aaagcatata ggtaaagacc gaatcattta ttaaatctgt acaagataac 300 ttacttctaa attttagttg tattgattgc actaataacc taaaaaggtg tcccatcatt 360 cgctaatttg taattcctaa cgaattagtc aaatagtagg tatgatatat gaagattcaa 420 tgaaatattg ctaaatttgg acttaaattt tagataggct tactagtaga ccggctgaag 480 cgaaagaatc actctgtgct tgcatagatt caccaaaatt tgtgagtaaa ccaaccaaca 540 gccaaaaact atgaactaat ctaaataaaa tttcagcttg aagcttacta tcaca 595 // ID Gypsy-30_DWil-I repbase; DNA; INV; 4497 BP. XX AC scaffold_181154; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-30_DWil_; KW Gypsy-30_DWil-LTR; Gypsy-30_DWil-I. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-4497 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_181154; Positions 1825705 1830201. XX CC Positions [3157-3426] - Integrase core CC 'CATA' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 97..3426 FT /product="Gypsy-30_DWil-I_1p" FT /translation="MQTPRAPATDEKGKHVTLPKFNPEGAGADATAWCTTV FT DLIFADKMPEGSTLMITLSKALEGSASQWLSQICFAGITWPQFKELFVQRF FT VGIETSAAILMKILNGRPKSGESYTEYGSRIVTLIMSKFKSNDLEEIAVSL FT ALAHMAQMDDKLSRLVFTNNIKTRNEMQQQLQAHTFKKRNLDDDSSTTGPE FT RKKPKYLSQVKCNYCGKSGHKYAECRARQRTPIQGKSHYGSSSQGSKDRST FT VKCFKCDELGHFASACPKGRPRGNDRVIEKRVDICTVAPPSGTICSSTGES FT IPFCFDSGAECSLVKESVADKFSGKRINNIVKLNGISNDSICSTQQLLCNI FT SIDQCYFEILLHVIMDKYLKHDVLIGREIFSQGFGVTMDADKFQLYKTKRV FT EALIISNYEDVIESNSELDDLDKAALINILKYYSEWFIDGIPKTRVTTGHL FT EISLIDQQKTVQRRPYRLSEEEKNKVRGKVDELLKAQVIRPSCSPFASPMM FT LVKKKNGTDRLCVDYRMLNDNTVADKYPLPLITDQIARLRGAKYFSCIDMA FT SGFYQIPVHEDSIERTAFVTPEGQYEFLTMPFGLKNAPSVFQRAIMKALGA FT LAYTYAIVYIDDVLIIAESKEDALVRLRTVLETLSKAGFSFNVTKCSFLTT FT RIEYLGFELSNGEIRPNSRKIQALNGLRPPQNVSQLRQFIGLASYFRQFVL FT NFSQTLKPLYLLTSKKNTDFVWLPVHEQIRTKIITILTQTPVLIIFDPQYP FT IELHTDASCIGYGAILLHKIDNKPHVVEYYSKCTSPAESKYHSYELETLAV FT VNAIKHFRHYLQGRKFIVYTDCNSLKASRTKLDLTPRVHRWWAFLQAFDFE FT IEYRKGERMAHVDFLSRNPVLEQHKRPNKIPELRINLTEVSDNWLVSEQQR FT DSEIQEIILKLKSNELPGNMDKTYELRKGILHRKIQRNSKTKCLPVIPRAF FT RWAVINHVHEAIMHLGWEKTLDKVYGYYWFENMSKYVRRFVENCITCKVSK FT PLVGKVQVELHPIPKIEIPWHTIHIDITGKMSGKNDLKEYVIVLIDAFTKF FT VYFHHTLNIDSESCIKAVKSAVSLFGLPTRIIADQGRSFASKSFREFCSSQ FT KN" XX SQ Sequence 4497 BP; 1572 A; 781 C; 930 G; 1214 T; 0 other; tcagaagtgg gatctgcctt gcctacaaat accaccgatg catacttggc agctttacaa 60 ctgcaaaaca acaacctttt ggagatcatt aaaaatatgc agacaccaag agctccggca 120 actgatgaaa agggaaaaca tgtaactctg ccaaaattta atcctgaggg agctggagca 180 gacgccacag cttggtgtac gacagtggat ctcatatttg ctgataaaat gccagaagga 240 agcaccctga tgataacttt aagtaaggcg ctagagggaa gcgcatcaca atggctgtcc 300 caaatatgct ttgctggcat tacatggcca caattcaaag aactctttgt gcaacgcttt 360 gttggcatag aaacgtctgc ggctatactg atgaaaattt tgaacgggcg accaaagtct 420 ggagaaagct acaccgagta tggcagccga attgtcaccc taataatgtc aaaatttaaa 480 tctaatgatc tggaggaaat cgcggtttcc ctagctctgg cccacatggc acaaatggat 540 gacaaattgt cgcgcttggt ttttacaaac aatattaaaa cccgaaatga aatgcaacaa 600 caactacaag cgcacacttt taagaagcgg aacctagatg acgactcatc tacaacaggc 660 ccggagcgta aaaagcctaa gtatctctct caagtgaaat gcaactattg tggaaagtct 720 gggcataaat atgctgaatg tcgtgctcgg caaaggacac caattcaagg aaaaagccac 780 tacggaagtt catcgcaagg atcaaaagac cggtcaactg tcaaatgttt caaatgcgac 840 gagcttggac acttcgcatc cgcttgccca aaaggacgac cacgaggaaa cgaccgagta 900 attgagaaac gtgtagacat ctgtactgta gctccgccat caggaacaat atgcagttca 960 accggtgagt ctattccatt ttgtttcgac tccggagctg aatgctcctt ggttaaggaa 1020 agcgtcgccg ataagtttag tggtaaaaga attaataaca ttgtaaagtt aaacggtata 1080 agcaatgatt caatatgtag cactcaacaa ctattatgta atataagtat tgatcaatgt 1140 tatttcgaga ttttgttaca tgttattatg gataagtatt tgaaacatga tgtactgatt 1200 ggtcgtgaaa tttttagtca gggctttggc gtaactatgg acgctgacaa atttcaactt 1260 tacaagacca aaagagttga agcgttaata atttccaatt atgaagatgt catagaatcg 1320 aatagtgaac ttgatgatct tgataaggca gccttgataa atattttaaa atattattca 1380 gaatggttta ttgatggaat accaaagact agggtcacca cagggcattt ggaaattagt 1440 ctaatagatc agcaaaagac agtacaacgg cggccatata ggcttagtga agaagaaaaa 1500 aacaaagtgc ggggtaaagt tgacgaatta ttaaaggctc aagttattcg cccaagctgt 1560 tcacctttcg ctagtcccat gatgctagtt aagaagaaaa atggcaccga tcgactgtgt 1620 gtagactatc gcatgctaaa cgacaataca gttgctgaca aatacccgtt gcccctcatc 1680 acagaccaaa ttgccagact tcgtggtgct aaatattttt cgtgcattga tatggcaagt 1740 ggcttctatc aaattccagt tcacgaggac tcaatcgaac gaactgcatt tgtgacgcca 1800 gaaggtcaat acgagttctt gacaatgccg ttcggtttga agaacgcacc atctgtgttc 1860 cagagggcta taatgaaggc cttaggagct ctagcttaca cttatgcaat cgtgtatatt 1920 gatgatgttc tcataatagc agaaagtaaa gaagatgcgc tagttagatt acggacagta 1980 ttggagacat taagtaaggc tggattttca tttaatgtca ctaagtgttc ctttctcaca 2040 actagaatag agtatcttgg atttgaatta agcaatggag aaatcaggcc aaattcacga 2100 aaaattcaag ctttaaatgg attacgacca cctcaaaatg tttcacagct aagacaattt 2160 ataggtttag cttcttattt tcggcaattt gtgttaaatt tctcccaaac acttaaaccc 2220 ctgtatcttc tcacctctaa gaaaaacact gactttgttt ggttaccagt gcatgaacag 2280 atacgaacaa aaataataac tatattgaca cagacgccag ttttaatcat attcgatccg 2340 caatacccaa tagagctaca tacagacgca agttgtattg ggtatggagc gatcctgtta 2400 cacaaaattg acaataaacc tcatgtggta gagtattaca gtaagtgtac gtctcccgcc 2460 gaatctaaat accattcata cgaactggag acgttagcag tagtaaatgc tattaaacac 2520 ttccgacatt atttgcaggg acgtaaattt atagtttata ctgactgtaa ttctttaaag 2580 gccagtcgca caaaattaga tttaacaccg agggtgcata gatggtgggc gtttttacag 2640 gcatttgatt tcgaaataga atatagaaaa ggcgagcgaa tggcgcatgt tgactttctc 2700 tctcggaatc cagtgcttga acaacataaa agaccgaata aaatccctga attacgaatc 2760 aatttgacgg aggtatctga caactggttg gtctcggaac aacagcgaga ttccgagata 2820 caagaaataa ttttaaagtt aaaaagtaat gagttaccag gaaatatgga caagacctat 2880 gaattgcgaa aaggaatact acataggaaa atacagcgaa atagtaagac caaatgtttg 2940 ccggtaatac caagagcttt tcgatgggcc gtcataaatc acgtccatga agctataatg 3000 catctgggtt gggaaaaaac gctcgacaag gtctatggtt attattggtt cgaaaatatg 3060 tccaaatatg tgcgtaggtt tgttgaaaac tgcatcacat gtaaagtctc taagccactt 3120 gtgggaaaag tgcaagtcga gctgcatcct attcctaaaa tagaaatacc atggcatacg 3180 atacatattg acattacagg caaaatgagt ggtaaaaatg atttgaagga atacgtcatc 3240 gttcttatag atgctttcac taagtttgtt tattttcatc acacactaaa tatagattct 3300 gaaagctgca ttaaggctgt aaaatcagcc gtatctttat ttggtttacc cactcgaatt 3360 atagcagatc aaggacgcag tttcgcaagt aaaagtttcc gcgaattttg ctcatctcaa 3420 aaaaattaat ctacatttaa ttgctacggg tgccagtaga gcaaacggtc aggtagaacg 3480 tgtcatgagt accctgaaag gtatgctgac tgctgtcgaa accagtcaga gatcatggca 3540 ggatgcgttg gctgaagtgc agctagcaat gaattgcacg attaatcgtg tgactaaatt 3600 tagcccttta gaactattga ttggaaagga agctcgacct tttggcttgt tgccgataaa 3660 cgaagataac aatgatgaag tagacaaaga aaatttgaga aaggaggcta aaaagaatat 3720 ggaaactcaa gcaaaatatg acaaaaatag gtttgataaa aataaagcta aaatagtaaa 3780 tttcaaagtg ggtgatcacg ttttgctcaa aaacgaagaa cgtcatcaaa ccaaacttga 3840 tccaaaattt aaagggccat ttgagataat tgaaacattg gatggagaca gatatttact 3900 gaaatcttta acatgtaatc gaaagtataa gtattctcat gaatgtttaa aagcaattcc 3960 caataatcag atactcgatg agttatgtga agatagagac agcacaatgg aagttgcatc 4020 tgttgtccaa tatgataaat gaagactgca ccatagcaga aaagacatgt tatggcgggg 4080 gtcgtatgta tgtgtagaag gaattacaca aatgattagt ttatatgtgt agaaggaatt 4140 acacaaatga taagtttata tgtgtagaag gaattacaca aaagataagt tagttatgtg 4200 taaaaggaac tacacaaatt tgtttggtta aaaaaaaaaa aaacaaaaaa aaaaaaaaaa 4260 aaacaatatg ttaaaggaat tacacgaatg ttatgatttg ttagattaaa attatgttac 4320 gaaaatgtaa ttatgctatg acaagaatga attttaaact gatttgaatt gtgttacata 4380 atgaataaga atatcgaata aaaatgtatt gaaataccct ttgaattctg aagcgttgtc 4440 acttatgtgg ttgagttaag acaacacacg aggacgtgta tattgtcagg aaggccg 4497 // ID Mariner-1N1_BF repbase; DNA; INV; 1265 BP. XX AC . XX DT 13-APR-2011 (Rel. 16.04, Created) DT 13-APR-2011 (Rel. 16.04, Last updated, Version 1) XX DE Mariner-1N1_BF DNA transposon DNA transposon - consensus. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Nonautonomous; KW non-autonomous; TA TSD; Tc1-1N1_BF; Mariner-1N1_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-1265 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M. and Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-1265 RA Kapitonov V.V. and Jurka J.; RT "Transposable elements in the amphioxus genome."; RL Deposited to Repbase as the tmplanrep.ref file (02-May-2008). XX RN [3] RP 1-1265 RA Kapitonov V. and Jurka J.; RT "A family of Mariner-1N1_BF DNA transposons from the amphioxus RT genome."; RL Direct Submission to Repbase Update (13-APR-2011). XX DR [3] (Consensus) XX SQ Sequence 1265 BP; 377 A; 252 C; 264 G; 372 T; 0 other; cactcggcac aaaaagtttg gaatatcaac tttggtatat catatttccg ttatttatgc 60 atcaatttca atgtattata tatcaataga aagcttgtgt gattttcttt cctatggtat 120 caaacttatt atgattgtaa acacatgaaa cgagcacaag gcctgcttac gtgagtgggt 180 cacgaaaaaa aagtgcccaa attcccctac ttctgttcaa cactgcatcg tcctgttgat 240 attggcgcgt gtgtcactgg ttcaatcaat cctcttttca cctcacattt tgtatacgga 300 acatacaagg ttttagtaca cttgagtaga gttcatctgc cctttcgata gttggatggc 360 agaatcaaag cgtggatgcg gcaaggagaa ccgcacgctg acaaggcaag agtaactttc 420 ggcatttggt ggtggaggca gcgtcatggt gtggggcggt ataaccacca tagggtaaac 480 aagactcgtc atcattccag gcaaattaat gtggtgacat atagggacac catcctccat 540 ccagtagcaa cattctacct cttcaatatt ggaccaaatg ccattctgca agattacaac 600 accagtccac acagagcaag gaccattgga tgctggagca cagtcttagg gtggaatagc 660 cttcctacgg cttatggcta gcactcgatt gatcacctct tgccatggtt cagatttttt 720 gcttgaataa agagtaatag tagctttgat tgacatgtcc tgtttattca acttgccttc 780 attcacaagt caatactgac ggaagaaaat gagcagaatg gagtgattga ctttagtagg 840 gtaattttgg cactttatct ttaggacccc ctgtacgtaa gcgggcctag tcctcgttcc 900 atgaatttac actcataaaa tactctactc agatatgcta aaaagaaact tggatgttct 960 gtataccaaa cgttaggtga taaaaggatt gattgaatca ttgtcacccg cggccataac 1020 aacggttgaa ccggacgata cagttttgaa cagaaacggg ggaatttggg cacttttttt 1080 tcgtgaccca ctcacgttag caggcctagt gtatgttcca tgagttaaca atgataatga 1140 gtttgatacc attggaaaga aaatcatgca agctttctat agatatataa tatattgaaa 1200 ttgatgcaga aacaacataa atatgatcaa ccgaagttga tattccaaac tttttgtgct 1260 gagtg 1265 // ID ITmD37E-N1_AAe repbase; DNA; INV; 944 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A non-autonomous ITmD37E DNA transposon family from Aedes DE aegypti. XX KW Mariner/Tc1; DNA transposon; Transposable Element; nonautonomous; KW ITmD37E-N1_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-944 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the yellow fever mosquito."; RL Repbase Reports 11(4), 1285-1285 (2011). XX DR [2] (Consensus) XX CC ~93% identical to consensus. TA TSDs. TIRs are 27 bp long. XX SQ Sequence 944 BP; 318 A; 172 C; 186 G; 268 T; 0 other; aggtgtggcc cacttagaat tggaacaaac atctttggtt cttacaacga ctctagtggt 60 ccaatttaga agctgaaccg tccatgtgtg ctctataaca ataataacat cacatgattt 120 ttttcgcagc ttaattcgaa cgcacttctg agaaattcaa catggaagaa gagagaaggg 180 aaataattgc gcacaaaaat ttgaaaaaac cgtcttgatc aggaatagaa attgcaaaag 240 tgttaggttt ttcaataaat actgtgtggc tcgtactgaa taagtacgaa gtaactattt 300 tgacagatcg caagccacaa aggaagcgtc gggcagttac tgtgcaccgt cagctgtgga 360 ttaaaatcag gaccattgat tcaaatccaa acattttaga ccgcgatttg gccaaaaccg 420 aactttttgc cgtgcagtac cgtgcaatgg attcgacttc ggatattgaa gcctccgatc 480 ataccatgtc tgcagacaac caaataggga acaaaattag agaaatgtgg ccaaattccg 540 taccatgaag ttgtttgatc aagtgttgac ggaatttaag gattttttca tgataggtga 600 tacccaccta attgacccca actcagccca attaagagat tttgggctat tatgaagcgg 660 aagctaagag cgaatcaaag tatcgccaaa acatcgacga actgtaaact aaatgggatg 720 aacttgtcaa aaatatggat gaaaatagtg tgcgcaaatt aattaaccgt attccgggaa 780 aagtccgaca atttcttata aaaggggatg aatatttttt tcttattttt cctttgaagc 840 ttacaagcaa tgcaacattt gagttaaaac agaattcatt tttgttgaat tatatccaaa 900 ttattccaat ttgtgtatgt tccaattcta aatgggccac acct 944 // ID Gypsy-128_AA-I repbase; DNA; INV; 6675 BP. XX AC supercont1.366; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-128_AA_; KW Gypsy-128_AA-LTR; Gypsy-128_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6675 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.366; Positions 585916 592590. XX CC Positions [5024-5500] - Integrase core CC 'ACAT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 486..2681 FT /product="Gypsy-128_AA-I_2p" FT /translation="MDLQTMYASMDVSHLSVDEVEYELLIRNILFHFDEHE FT SIKRRKLKDKLKSEKELKTFAFSQPWRSLDEELVTISLKLKVIGGLLENPK FT TDARQRQKLKTRLVHYRVRNYILSKASGADKCRNEIVKVGRQATELFRRFF FT PEVNEADVHSEDSERLESDLNNVLEEVRSEIEILNETTASGKEIEEHLEQA FT DLPNQVLESKKKEMNESVRRSEEILKVLSGYEEGKKENPLELIAVFKSFVQ FT QTTEQQKQMREKQIAEEERKIREAKDSMERKKRLEKVLITLNDRLKREPES FT VEEKNPVRDLEQEQTKDLFENRKKGEKTISFNYDSDDYRNYRRKDSSEDSR FT EKTGRKYKKESRASNMPHKSSKRGKKKRHRRPSFSATSSTTSGTESSEFSS FT SESSTDSSKEWKGRRERREGRRNRDLKRIPVAEWRLKYDGKDQGRKLTEFL FT KEIKMRCISEEISERELFRSAIHLFSGRAKDWFIEGIENRDFRNWHELKKE FT LKREFLPPDLDFQLEVQATERRQARGEKFTDYLHDIMKIFHSMTRPISERR FT KFDIIWRNLRFEYKNAMTGAGVRSLSKLKKYGRIIDENNWTMFHKSHDYMA FT RGRNTQVSEVSATDSSKSKAPFQNPNSASRVFTRSKSKNEPTEKIKPLSET FT KTSGERKEEKSADPVEGSSKGTLIAMAENYRRPPIGVCYNCRLSGHHYADC FT PKPKGKFCRICGFGDVITPTCPVCQKNEAFSA" FT CDS 2750..5887 FT /product="Gypsy-128_AA-I_1p" FT /translation="MSGFEPVSENDYSPVSNVDELFVRVDGDTRPFVKVSV FT LGREIIGLLDSGAHRSVLGAGCRKLIKLLKLKMFPSDTHVQTASGSTVDVE FT GFVYLPVTFNNENRIIKTLVAPKLKRRLILGFDDFWRAFQIQPVVLSRESR FT KKEKEFEESLRVEELEWKEGGVKEETLSETQKGQLEKVKLKFKIAIEGQVL FT DVTPLATHRIDLKDEFKNSPPIRINPYPTSPEMQKRINKELDNMLQQQVIE FT HSKSEWSQSTVPVIKPTGEVRLCLDARRLNDRTKRDAYPLPHQDRILSRLG FT ASKYLTTIDLTKAFLQIPLHPDSRKYTAFSVLGRGLFQFTRLPFGLVNSPA FT TLARLMDDVLGHGELEPNVFVYLDDIVVVSDTFEAHVQTLSEVARRLKAAN FT LSINLDKSRFCLKELPYLGYILSPEGLKPNPDRIEAIVNYERPASLKSLRR FT FLGMTNYYRRFIPGFSEHSAPLTDLLKKKPKTLLWNSKAEQAFLNLKESLI FT ASPVLANPNFDLPFQVQSDASDSAIAAILTQQHEEGEKIIAYFSQKLSPAQ FT KAYAASEKEGLAVLCAIEKFRPYIEGTHFTVVTDASALTHIMKGKWRTSSR FT LSRWSIELQGYDMEIRHRRGKDNVIPDALSRSVELAEILSDQNVWYSDLFK FT KVECSPEDYLDYKIENQQLLKLVPNKTEVLDYRHEWKLCIPESDRERILHH FT EHDESLHIGYDKLLDKLKARYFWPKMAEHAKKYVGRCQVCKECKPSTISQH FT PTMGSPRLATKPFQILAVDFIQSLPRSKAGNMHLFVLLDVFSKWTVLVPVR FT KISAELIVKILEEQWFRRYSVPEIIITDNATSFLSNDFKTFLAKYEVKHWA FT NSRHHSQANPVERLNRSINACIRTYVKSDQRQWDTKISAIEHTINNTLHSS FT TNFSPYRVLFGHEIITTGQEHRRDPDTIDISEKERNEQRLKVDDVVHALVR FT KNLEKAHDRSTRAYNLRFRQPAPVYQIGQQVYKRNFAQSSAGDNYNAKLGP FT AYVPCTIVSRRGTSSYELADESGKNLGVFSAADIKPGISN" XX SQ Sequence 6675 BP; 2159 A; 1295 C; 1524 G; 1697 T; 0 other; attgacgccc aactaaataa gctctggtca aggattcaag cagccactct tccggagtga 60 acagaacgat aggttatcaa attaatgaaa ttcacgttaa atacgtggtt gtgaagaaaa 120 ttaggttatc aaattctaac aaaaggtttc tgaaaaatca aattagcaaa ggcagtgaaa 180 tcaaaccggt agcaccaatt tgttctcgcc gataacgtga aaagtttcta gaaagcatac 240 tactagtgcg cgtaccctca caaaagccca cagtgttgct gtgtattcac gaaatagcga 300 ttaattagaa aacgattgat atttcaatta aattcttttg ttaatgggtc ttgattcgac 360 taaataacct ccattgtgat tttttcattg aaagtgttac ttgcgatttg tattgtgcat 420 gaataaagcg attatctaca caatagtctg cattgatttc ttgaatagct ccattcaacg 480 aaacaatgga cctacaaaca atgtacgcgt ccatggatgt ctcccatctt tcagtcgatg 540 aggtagaata cgagttacta atcaggaaca ttttgtttca cttcgatgaa cacgagagta 600 tcaagcgaag aaaactcaaa gataaattga aaagtgagaa agaattgaaa acgtttgctt 660 tttcccagcc gtggcgaagc ttggatgagg aattagtcac gataagtctg aagctaaagg 720 tcataggtgg tttgctcgag aaccctaaaa cggatgctag acaacgtcag aagttgaaaa 780 cgcgtttagt tcattaccga gttcgtaatt atatcttgtc taaagcttca ggcgcggata 840 aatgcagaaa tgaaatcgta aaagtgggac gacaagctac agaactattt cgtagatttt 900 ttcccgaagt gaatgaggct gacgttcatt cagaagattc cgaacggctt gaatcggatt 960 tgaataacgt gctagaagaa gttaggagtg aaatcgaaat cttgaatgaa acaaccgcgt 1020 ctggtaagga gatagaggag catttagagc aggcagattt gcctaaccaa gtgcttgaat 1080 caaagaagaa agagatgaat gaatcggtta ggaggtcaga agaaattttg aaagtacttt 1140 cggggtatga ggaaggcaaa aaggaaaatc cgttagagct tatcgcggtt tttaaatcgt 1200 tcgttcagca aacgaccgag cagcaaaagc agatgagaga gaagcagatt gctgaagaag 1260 agcgaaagat aagagaagcg aaagatagca tggaacggaa aaaaagattg gaaaaagttt 1320 taatcacttt aaatgatcgt ttgaaaagag agccagaaag tgttgaagag aaaaatccgg 1380 tgagagattt agagcaggaa caaacgaagg atctgtttga gaataggaag aaaggggaga 1440 aaacgatcag ttttaattac gattcggatg attataggaa ttatcgtagg aaggacagtt 1500 ccgaggacag tagagagaag actggtcgta aatacaaaaa agagagtaga gcatccaaca 1560 tgcctcataa gtcgtccaaa cgcgggaaga aaaaacgaca tcgtagaccg agcttttcgg 1620 ctacatctag tactacatcc gggacagaaa gttccgagtt ttccagttcg gaaagtagta 1680 cggattctag caaagagtgg aagggtagga gagaaaggag agaaggtagg agaaacaggg 1740 atctgaagag aataccggtt gctgagtgga gattgaaata cgatgggaaa gatcagggcc 1800 gaaaattaac tgaatttttg aaagagatca aaatgagatg catttctgag gaaatttctg 1860 agagagagtt gtttaggagt gccattcacc ttttctcagg gcgcgccaaa gactggttca 1920 ttgaagggat cgagaacaga gactttcgga attggcatga gctaaaaaag gaactaaaac 1980 gggaattttt accacccgat ttggatttcc agcttgaggt tcaagctaca gagcggcgac 2040 aggctagggg tgaaaaattt actgactatc tgcatgatat aatgaaaatt ttccactcga 2100 tgacccgccc gatctctgag cgcaggaagt tcgatattat ctggcgcaat cttcggttcg 2160 aatacaagaa cgccatgacc ggtgcaggag ttagatcgtt gtccaaactg aaaaagtacg 2220 gacggattat tgacgagaat aactggacaa tgttccacaa gtctcatgac tatatggccc 2280 gtggtagaaa tactcaggtc agtgaagttt ctgcaacgga ttcgtcgaag tccaaagcgc 2340 ctttccaaaa tccaaactct gcatcacgtg tttttaccag aagcaaatct aaaaatgaac 2400 ctacagagaa aatcaaaccg ttatcggaga cgaaaacctc tggggagaga aaggaagaaa 2460 agtctgctga tccagtagag ggttcttcta agggcacttt gatagcgatg gctgaaaatt 2520 atcgtcgacc tccaattgga gtgtgctata actgcagatt gagtggacat cattatgctg 2580 attgtcccaa accgaagggc aaattttgtc gaatttgtgg ctttggagat gtaattactc 2640 ccacttgccc agtgtgtcaa aaaaacgagg cgttttcagc ttgagggggc aagctgaagt 2700 agatcgccga aaccccccaa caaatgatcg tgtcactgaa gaactagaaa tgagtggatt 2760 tgaacctgtc tctgagaacg actactctcc cgtgtccaac gtggatgagc ttttcgttcg 2820 tgttgatgga gatacccgac cgtttgtaaa agtgagtgtg cttggaaggg aaataatcgg 2880 tcttttagac agtggagctc accgctcagt cttgggagca gggtgtagaa agctcataaa 2940 acttttgaaa ttgaaaatgt ttccttccga tactcacgta cagactgcat cagggtccac 3000 tgttgatgtg gaaggtttcg tgtatttgcc ggtcactttc aacaatgaaa accgcattat 3060 taagactctc gtggcgccca aactaaaacg aagattgata ttaggttttg atgatttttg 3120 gagagctttt caaattcaac ccgttgtgtt aagtcgagag tctcgaaaaa aggaaaagga 3180 gtttgaggaa agcttaaggg tggaagaatt ggaatggaag gaaggagggg taaaggagga 3240 aactctaagc gaaacacaaa aaggccagtt agagaaggtc aaactgaaat tcaaaattgc 3300 aatagagggg caagtcttag atgttactcc ccttgcaact cacagaattg accttaaaga 3360 tgagtttaaa aattcacccc caatcagaat aaacccatac ccaacatcac ctgagatgca 3420 aaaacgcatc aataaggaac tcgataacat gctgcaacag caagtcatag agcacagtaa 3480 aagtgagtgg tcacagagta ccgtcccagt aattaaaccc actggggaag tacggttgtg 3540 tttagatgcg agacgcctaa atgatagaac aaaacgcgac gcttatccac tcccacacca 3600 agacagaata ttgagcagat tgggtgcaag caagtattta actactatcg atttgacgaa 3660 agctttctta caaataccgc tacacccaga ctctcgcaaa tatactgctt tttcagtgtt 3720 ggggagagga ttgtttcagt tcaccaggct cccattcgga ctagtcaata gtccagctac 3780 attagcgaga ctaatggatg acgtcctggg gcatggtgag ctggaaccaa atgtgttcgt 3840 ctaccttgac gacattgtcg tggtaagcga tacatttgaa gcacacgtac aaaccctctc 3900 tgaagtagct cgtcggttga aggcagcaaa cctgtcaatc aaccttgaca aatctcgatt 3960 ttgtttgaaa gaactaccct atttgggata tatactctca cccgaagggc tgaagccaaa 4020 tcccgatcgt atcgaggcca tagtcaatta cgagagacca gcatctctca aatcgttgcg 4080 caggtttctc ggtatgacaa attattatag acgattcatt ccagggttca gtgagcactc 4140 tgcacctctc actgatctcc tgaaaaaaaa acctaaaact ctactgtgga actcgaaagc 4200 ggagcaagct ttcctaaacc tcaaagaaag tttgatcgct tcccccgtac tcgctaatcc 4260 caactttgac cttccatttc aagtccaatc agatgccagc gatagtgcca tagccgcaat 4320 cctgactcaa caacatgaag agggtgagaa aataattgct tatttttctc aaaagctctc 4380 ccctgctcag aaagcttacg cagcttcaga aaaagaaggg ttagcagtcc tctgtgccat 4440 tgagaaattc cgtccataca ttgaagggac acatttcact gtggtcacag atgcgtcagc 4500 tctcactcat ataatgaagg gtaagtggcg cacatcatcc cgattgagta gatggagtat 4560 agaattgcag gggtatgaca tggagatcag acacaggcgt gggaaggata acgtcattcc 4620 cgacgcactg tcccgatctg tggaattggc cgagattctg tctgaccaaa atgtgtggta 4680 ctctgactta ttcaagaagg ttgaatgttc acctgaagat tatttggact acaaaattga 4740 gaatcaacaa ctcctcaaac tagttccaaa caaaactgaa gtactcgatt accgccatga 4800 atggaagctc tgtatcccgg agagtgatag agagcggatt ctccatcatg aacatgacga 4860 gtcattacac atcggatacg ataaactgct cgataaactg aaagcccgat acttctggcc 4920 gaaaatggca gagcatgcga aaaagtacgt tggcagatgt caggtgtgca aggaatgcaa 4980 acctagcacc atatcacagc acccaactat gggcagtcca cgtctagcca caaagccatt 5040 tcaaattttg gcggttgact tcattcagtc tttaccgcga tcaaaagccg gaaacatgca 5100 cttattcgtc cttctcgatg tgttctcgaa atggacagtt ctagtcccgg ttaggaaaat 5160 ttctgcggaa ctgatcgtga agatcctcga ggaacagtgg ttcaggcgct actccgttcc 5220 cgagatcata atcaccgata atgccacaag tttcttgagc aacgacttca agacattcct 5280 ggcgaaatat gaagtgaaac attgggcaaa ttcgcgtcat cactcccagg cgaaccctgt 5340 ggagcgattg aatcggagca taaatgcgtg tattcgcact tatgtgaaat ccgatcagcg 5400 ccaatgggac accaagattt ccgctatcga gcatacgata aacaatacgt tgcattcgtc 5460 cacaaacttt tctccttatc gcgtcttatt cggccatgag atcatcacca ctggtcaaga 5520 gcatcggaga gatccagaca cgatcgatat ttctgaaaaa gaacgaaatg aacaacgatt 5580 aaaagttgat gatgttgtgc atgccctggt tcgaaagaac ttagagaagg ctcacgatcg 5640 aagtaccaga gcgtacaacc ttcgttttcg ccaacccgct cctgtgtacc aaattggtca 5700 acaggtttat aaaagaaatt ttgcccaatc gtctgctgga gacaattata atgccaaact 5760 tggtccagcg tatgtaccat gcaccattgt ttcgcgaaga ggcacgagct catatgagct 5820 tgccgatgaa tccggcaaga atttgggtgt attctcagcc gcagatatta agcccggtat 5880 ctcaaattga tcagcacaga tttctccatc gttcagtcct gtttttagaa ttttaggaag 5940 ttaataatca ctaatatcga gaatttagaa actcgaccgt aatcaacgtg tccgcgtcgg 6000 tgaatcgtat tgtatagata gagcaaaaat cgttagagtg agcgatattg atgtagatga 6060 atgtaggtac gtgttttctg tagtagatct gtctctgttc gacaaaagta gaacttcaat 6120 ttttaaatag aacgttgttt agagggatcg tagcttgcga gtgaagaaca ccaccgacca 6180 atgaaatatg actagtagtg accaaaattc ttcctcgtga taatgtagat atttttaaac 6240 cggttatatc tgcactgcgg tcctaattaa tgacatccgt tgcaccaaat ccaatcatcc 6300 caatccaaaa aaaaaacata taatagccca gaatgcagaa aactattatc tattcattcg 6360 atcgatttgc atcaagtaat ttcgcaagac tagtttttta ttttggttca tctttcacga 6420 aacttcacag tttggataaa aaggaagatg ataaggccaa ttagctgaat cgggaatttg 6480 aagcgttaca aatagcattc cgtgcatgaa aggaatgaac tatttggtga ttacacgcga 6540 caatgtctgt ttttgggaaa gtcgaatgca tgaatgttgc atctcaaacc tggtatgcta 6600 tttagttagg taaactttat taaaattaat tgataataaa ttatacaatt aattttgggg 6660 ggagatagaa ggtgg 6675 // ID Gypsy-25-LTR_NVi repbase; DNA; INV; 363 BP. XX AC . XX DT 08-MAY-2009 (Rel. 14.05, Created) DT 08-MAY-2009 (Rel. 14.05, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: long terminal DE repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy-25-LTR_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-363 RA Bao W. and Jurka J.; RT "LTR retrotransposon from Nasonia parasitic wasp."; RL Repbase Reports 9(5), 987-987 (2009). XX DR [1] (Consensus) XX SQ Sequence 363 BP; 86 A; 111 C; 97 G; 69 T; 0 other; tgtgggggct cccgccgacc accaccggag ccaccgacgg aaacagcgcc tggtccccag 60 gcccgacttc acttaagcag tcgcgctttc cggccaccac cgctccgcgg cgccccctcc 120 gagccgccac gatgggtggc gatcaggcga cgcggcgggg ctcgagcgcc atgttagaac 180 cagagagaac gctctggtgc ttcgaacgag cgacacgcat cagagcttat aagagagaga 240 gagtttgagc taaataaaga gtcttatatt ttattaaatt gacttcacat ttattacgac 300 accctatcga gccggcgagt tatctgcgaa cctggagaat cccgattcgc acacgtttct 360 aaa 363 // ID Gypsy-15-I_HM repbase; DNA; INV; 4212 BP. XX AC . XX DT 06-JAN-2009 (Rel. 14.02, Created) DT 06-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE An internal portion of the LTR retrotransposon - a consensus DE sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW reverse transcriptase; Gypsy superfamily; Gypsy-15-I_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4212 RA Bao W. and Jurka J.; RT "Gypsy LTR retrotransposons from Hydra magnipapillata."; RL Repbase Reports 9(2), 402-402 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 110..4165 FT /product="Gypsy-15-I_HM_1p" FT /translation="MTESGKGGVFPPMLDLTVDRFAAFRSWIEKWHDYVLL FT SDLEKKPPAYQAAMLRYTFSSETRNIYESLNLTEYEKTDPAIILEKMEVFA FT KGIINETLERHKYFKRFQEDGECFDDFLTEVKLLSKNCNFCNTEDCFGSLL FT RDKIVYGIRCDKVREKLLSEKTLTLDKAVEICRSSEKAQDGVSELRNNNSE FT SVDRIGKQLYSKNKFNSPTQNSNNRDFKFNSLLCKFCLKKHPFIRGSCPAW FT GKKCRDCKIMNHFSNSSVCLKTKPNIPDTCVDQQNDSNNSSSMQHLGALFL FT GRVGADCAERPWEIQLKVKYGKITFKIDTGAEVTVIGTNHLKKFGIKIEHL FT YTTNKRLIGPDYKPLNCLGYFQKSFSINGRNSELITIYVCDNVQTALLGKP FT ALQKFNLVKVDIPERFMCAMISKSKSNIIEQFPLVFKGLGTIKGNPVHITT FT QENATPYHIGAPRRVAFPLLEPLKIELERMKEMGVIKIVDQPTDWCHPIVV FT VKKPNGLLRICIDLTQLNKHTKREFYELPSVDDTLAQLGNRCKYMAKLDAN FT SGYWQLPMDIESQLKCTFTTPFGRFCPTRGPFGLTSLPEIFSKKMDQVIDG FT LKGVVKSMDDFLVFGNTEMEYNENLVALLNRFAENGVTLNVEKCLFDQTEV FT EFLGHKISAHGVKPLTKKVEAIKNFPKPTNITSLRSFLGMAQQLSKFNPSL FT AKVAEPLRDLLSSKSAWFWTENHTRAFNEVKNSLTNPPILAHYDVKKPVKV FT RTDGSLLNGLSVIVYQNHNGIYKPIDCASRFLTTTEKNYYPIEMEMLAVTW FT GCTRMSKYLYGLPNFILETDHKPLIPILNYRSLIDMSPRIQRLRMKLLRFS FT FTAEHVAGKSITDADAMSRAPVSNPTKEDEIAENDLNVYVNSIIKSMPATE FT SRLEEIKQKTAEDKLLNQLQESIISGWPNSKKYCPANIQPYWDFKDEIVKI FT DGLLLYQNRIIIPVSMRREILSKLHEGHLGMEKCKRRARQSVFWPGLNNQI FT EQLIRKCEACMKYLPSKPKEPMLTPDVPTRPWQKLGTDLFQWANKNYLIIV FT DYYSLWTEVFLLPNTGSANVIQACKESFSRNGIPEELVSDNGTQYSSKVFR FT QFSQQWQFKHITSSPHYAQSNGLAEATVKSVKLLIKKCYMSNEDIYKGILI FT LRNTPIKCGLSPAELLHGRQLRDNLPRFQSQNKEQSKYSREIVKERVQSKK FT YYDKNIGVFKPYVYRQGQTVAIQNEVTREWSLRGKIMKCVAPRSYEVKLNH FT NGHILRRNQIQLRKVYAISAPGVTRDNGAIMLRRQAIPSLEECNSGTTSSE FT MESENNIIEKNNLVENTKDRLSSRGRRIKNKVPIDYEDL*" XX SQ Sequence 4212 BP; 1555 A; 628 C; 784 G; 1245 T; 0 other; acacaacatg gcgctacgag tatgtaaaga aacgtgaaaa aactaatcgt ttttgtttaa 60 agattatctt gtttaatctt ttgaatcgtt gcattaaaat aaataaaaaa tgacagaaag 120 tggtaaggga ggtgtttttc ctccaatgtt agatttaact gtagatagat tcgctgcatt 180 tcgctcatgg attgagaaat ggcatgatta tgttttattg tcagatttag aaaagaaacc 240 accagcatat caagctgcta tgttaagata tacattttct tcagaaactc gaaacattta 300 cgaatcatta aatctaaccg aatatgaaaa aacggatcca gctattatat tggaaaaaat 360 ggaagtgttt gctaaaggta ttattaatga gactttagaa agacataaat attttaaacg 420 ttttcaagaa gatggtgaat gctttgatga tttcttaacc gaagtaaaat tactaagtaa 480 aaattgcaac ttttgcaata ctgaagactg ttttggcagt ttattaagag ataaaattgt 540 atatggaata cggtgtgaca aagtacgtga gaaattactt tctgaaaaaa ctttgacact 600 cgataaagca gtagaaattt gtcgttcttc agaaaaagca caagatggtg tatctgaatt 660 gcgaaataat aacagtgaaa gtgttgaccg cattggaaaa caattgtaca gtaaaaacaa 720 atttaattca cccacacaga acagtaataa tagagatttt aaatttaact cattgctatg 780 taaattttgt ttaaaaaagc atccttttat tcggggatcg tgtccagcct ggggtaaaaa 840 gtgcagagac tgtaaaatta tgaatcactt ctcaaatagt tcagtttgcc taaaaacaaa 900 gccaaatatt cctgatacat gtgttgatca acagaatgat agtaacaatt cgtcaagtat 960 gcagcatctt ggtgcattgt ttttaggaag agtcggagca gattgtgctg aaagaccttg 1020 ggaaatacaa ttaaaagtaa aatatggaaa aataacattt aaaattgaca caggtgctga 1080 ggtaacagtt ataggaacta accatttaaa aaagtttgga attaaaattg agcatttata 1140 cactactaat aaacgattaa taggtcctga ttataagcca ctcaattgtc ttggatattt 1200 tcaaaaatca ttttctataa acggtcggaa cagtgaatta attactatat atgtctgtga 1260 taatgtgcaa actgctttat tgggaaaacc tgccttacaa aagtttaact tagttaaagt 1320 agacattcct gaaagattta tgtgtgctat gatttccaaa tcaaaaagta acattataga 1380 acagtttcct ctagttttca aggggttagg gactattaaa ggtaaccctg tacatattac 1440 aactcaagaa aatgcaacac cgtaccacat tggagcacca cgacgtgtag cgttcccact 1500 actagaaccg ttgaagatag aactagagcg tatgaaagag atgggtgtta tcaaaatagt 1560 agatcaacct acagattggt gtcatccaat agtagttgtg aaaaaaccaa atggactact 1620 tcgtatttgt attgatttaa cacagttaaa caaacacacc aaacgtgaat tctatgaact 1680 gccaagtgta gacgatacat tggcccaact tggaaacaga tgtaaatata tggcaaagtt 1740 agacgctaac tcaggatatt ggcagttacc aatggatatt gaaagtcagt taaaatgtac 1800 atttacgaca ccttttggtc ggttctgccc aacaaggggg ccatttggat taacatcgct 1860 ccctgaaatt tttagtaaaa aaatggatca agttattgat ggtctaaaag gagtggttaa 1920 aagtatggat gatttcttag tttttggaaa tacagaaatg gaatataatg aaaaccttgt 1980 agcattgcta aatagatttg ctgaaaatgg agttaccctt aatgtagaga aatgtttatt 2040 cgatcaaaca gaagtggagt ttttaggtca taaaatatca gcacatggag taaaaccatt 2100 gaccaaaaaa gttgaagcaa taaaaaattt tccgaagcca acaaacatta cgtctttacg 2160 tagtttttta ggcatggccc agcaattatc taaattcaat ccatcgcttg ctaaagtagc 2220 cgaacccttg cgagatcttt tgagttctaa atcagcatgg ttttggactg aaaatcatac 2280 aagagcattc aatgaggtaa aaaacagttt aaccaatcca cccattttgg cgcactacga 2340 tgttaaaaaa cctgtaaaag taagaacaga tggaagttta ttaaatggac ttagcgtaat 2400 agtttaccag aatcataatg gtatatacaa accaattgat tgtgcatcta ggtttcttac 2460 gacaactgaa aaaaattatt atccgatcga aatggaaatg ttagcagtaa catgggggtg 2520 cacacgaatg tctaaatatt tgtatggttt accaaatttt atattagaaa cagatcacaa 2580 accattaatt ccaatattaa attatcgttc acttattgat atgtctccaa gaattcaacg 2640 tttaagaatg aagttattaa gattttcatt tactgcagag catgttgctg gcaagtctat 2700 tactgatgca gatgcaatgt ctcgtgcacc tgtatctaat cctacaaaag aagatgagat 2760 tgcagaaaat gatctgaatg tgtatgttaa ttctataata aaaagtatgc ctgctacaga 2820 gagtcgttta gaagagatta agcaaaaaac tgcggaagat aaacttttaa accaattgca 2880 agaaagtata atttctggat ggccaaattc aaaaaaatac tgtccggcaa atatccaacc 2940 ttattgggat tttaaagatg aaatagttaa aattgatgga ctcttattat atcagaatag 3000 aataataatc ccagttagta tgcggagaga aattctttca aaattgcacg aaggtcattt 3060 gggaatggag aaatgcaaac gtcgtgcacg tcagagtgtt ttttggccag gtttaaacaa 3120 ccaaattgaa cagttaatta gaaagtgcga ggcatgtatg aagtatctcc catcaaagcc 3180 aaaagaacct atgctaactc ctgatgtccc aacacggcca tggcaaaaac tgggaactga 3240 tttgtttcag tgggctaata aaaactattt aattattgtc gattattata gcttgtggac 3300 tgaggtgttt ctgcttccaa atactggatc agcaaatgtt atacaagctt gcaaagaatc 3360 tttttctaga aatggtatcc ctgaggaatt agtgtcagat aatggtacac aatatagttc 3420 gaaagtgttt cgtcaattct ctcaacagtg gcaatttaag cacataacaa gtagtccaca 3480 ctatgcacaa tcaaatggtt tagcagaagc aactgtaaag tcagtaaaac tgttgataaa 3540 gaaatgttat atgtcgaatg aggacattta taaaggtatc ttaattttac gtaatactcc 3600 aataaaatgt ggtttatctc cagcggaact attacatgga cgtcaactga gagataattt 3660 acccaggttt caatctcaaa ataaggaaca gtcgaaatat tcaagagaaa tagtcaaaga 3720 gagagttcaa tctaaaaaat actatgataa aaatattggt gtttttaaac catacgtcta 3780 tagacaaggt caaactgttg caattcaaaa cgaagtaact cgtgaatgga gtcttagagg 3840 taaaataatg aaatgtgttg cccctagatc atatgaagta aaattaaatc acaatggaca 3900 tatccttcgt cgaaatcaga ttcaacttag aaaagtttac gctatatcgg ctccaggggt 3960 gactcgggat aatggggcta taatgctcag aagacaagct attccgtcgc ttgaagaatg 4020 caatagtggt acaacgtcat cggaaatgga gagcgagaat aatataatag aaaagaacaa 4080 tttagttgaa aatacaaaag atcgattatc aagtcgtgga agaagaataa agaataaagt 4140 accaatcgat tatgaagatt tatgaactaa cttgatttct tgacatttat atatatatat 4200 atgtttgttt at 4212 // ID Sola1-N3_AAe repbase; DNA; INV; 1046 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A non-autonomous Sola1 DNA transposon family from Aedes aegypti. XX KW Sola; DNA transposon; Transposable Element; nonautonomous; KW Sola1-N3_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-1046 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the yellow fever mosquito."; RL Repbase Reports 11(4), 1293-1293 (2011). XX DR [2] (Consensus) XX CC ~94% identical to consensus. 4-bp TSDs. TIRs are 27 bp long. CC This family is >82% identical to both termini of Sola1-1_AA. XX SQ Sequence 1046 BP; 340 A; 180 C; 189 G; 337 T; 0 other; tctcgcgttc agtatcataa gggcgtatct gcagttttgg cgggaaatga gatttcgtaa 60 tatttgaatt ctataggtcc gaaactataa ccctaccaaa aattattact aaaaacattc 120 tagaaatacc atattttcaa aaacataatt tgacgtaact acagaactgc aatgggaaaa 180 aagatacgtc aggcgtaact tttctggtca tcatctgttg aagttacgcc aactgcgata 240 ctgttccgtg tttacaaaat gtctaacaga aaatatttgc tgctccgtat tatttcttca 300 ttttacacac gatatggcgc agttgctaag ttcgatgcat tcgaggatgt agtatggttt 360 acacctgaag aaatggtgaa ggaatatact gcagttgtag gtgagtgatg ttaaaaaagc 420 agtccaaaca tttttcagta aatttgtgtt accatcattg aacacagcac agcggattcc 480 cgcacaggca atgttgtagg aacagaaaag gacaatctgg aaccagtgta caataactcg 540 taataattcg atcaaattct ggttgaaaat tgacaaactt tccctgtatc ttagccgtaa 600 acgatacatg agcaccaaca tggtattatc gaagaaagtt ttgtttcatt aattgtaaat 660 atgtccaaag tgtacgtact tgagaagctg gttctattcc agtaggcatg aaattttaac 720 aaaaagttat tagagactct cagagtctgt ttggtgttat aaaaattgtt aatttatcaa 780 aaactaatga ttttgcttcg tggcgtaact ttttccaatg cgacataccg gcgtaactac 840 aatttatttc attttccagc atattttacg tgaacttcat ttggcaccgg atactctaat 900 gtcgtgatag tatttgtata aattttcaga taaattcact taaaaacaaa aaaatacgtg 960 aatttcttgt ttgttgaaaa cttcaatatc gattttctcg gtttcttcga aactgcagtt 1020 acgcccttat gatactgaac gcgaga 1046 // ID CR1-2_TCa repbase; DNA; INV; 2099 BP. XX AC . XX DT 21-MAR-2009 (Rel. 14.04, Created) DT 04-APR-2009 (Rel. 14.04, Last updated, Version 3) XX DE CR1-type retrotransposon: consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; Nonautonomous; KW CR1-2_TCa. XX OS Tribolium castaneum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Coleoptera; Polyphaga; Cucujiformia; OC Tenebrionidae; Tribolium. XX RN [1] RP 1-2099 RA Jurka J.; RT "CR1-type retrotransposons from Tribolium castaneum."; RL Repbase Reports 9(4), 736-736 (2009). XX DR [1] (Consensus) XX CC The 5'-end not determined. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Tribolium CC castaneum Genome Project. XX FH Key Location/Qualifiers FT CDS 183..2099 FT /product="CR1-2_TCa_1p" FT /translation="CLSTTRKQYPTEQNKELYTEYKNKHLLLLNSTKKSFY FT ENKISVSDNRIKTIWSIIKSETKSVTNQLKLPTNVSADNFNDFFLHNPENI FT IKNLTNGHNDSHTYLNKKYIKTNASMFFQPVIEDDIRIIINKLKNKTAEDI FT YGISIHLLKVSATPILSIITNIINCCLSEGVFPSKLKISRVLPIYKKGDEN FT DLKNYRQISILPAFSKILESVMCQQLVNFLEKNKYLSQQQFGFRKGFTTTD FT AILSLVKDIQKAFENKQTYVGVFTDLSSAFDCVDHEILLDKLRFYGVRGTC FT LKLFRSYLEDRQQYVKINASLSKAGHIKYGVPQGSILGPVLFLVYINDLEN FT SMDKNILTIMYADDTSLGIPINKKKNVVPIKERMIRDADEWFAANKLALNK FT TKTVTKVFELQSQLKQTEHIKFLGLFLDNKLSWAPHVYEVCNRLSTAVFAI FT RKIRVSVNEYAALSTYYALFHSQINYSILLWGDAAYVHLHKILVLQKAAVR FT AIAEICSDVSCRAYFKKYKIMTLFSLIIFHKVYDVRKNISSYPSLADVHSH FT DTRNKTNLVNPRFRLAKSDNLSIRLYNSLSQNLKDQHLPLKTFKTKLINML FT TECSLYSINEYLCVCVCVCYHYIFILLTSLILILLLLLLLL" XX SQ Sequence 2099 BP; 814 A; 345 C; 304 G; 635 T; 1 other; actcaatatt tagaaccaca aactggataa taaaattaac aacggcagaa gacaaattta 60 acgaattctt taaaatattt ttggagggtt ttaaaccatc ttcaactata ccttcaaaaa 120 taaaaataca aaccgcaata aaataaaatc atggataaca cctgatataa aaaaaaagtt 180 aatgtttatc tactacgcgt aaacaatatc ccacagaaca aaataaagaa ctatatacgg 240 aatataaaaa taaacatcta ctgttgctaa actctactaa aaaatccttc tatgaaaaca 300 aaatttcagt atcagacaat cgtataaaaa caatatggtc tatcatcaaa agtgaaacta 360 aaagtgttac caatcaatta aaacttccca caaatgttag tgcggataat tttaatgatt 420 tctttttaca taatccggaa aatattatca aaaatttaac taatggacat aatgattctc 480 atacttactt aaataaaaaa tatattaaaa ccaacgcttc aatgtttttc caacccgtca 540 ttgargatga catccgtata ataataaata aacttaaaaa caaaacggcg gaagatatat 600 atggtataag cattcattta cttaaagtga gtgctactcc tattttaagt attattacaa 660 atattataaa ctgttgtctg agcgaaggtg tatttccatc aaaacttaaa attagtagag 720 tcttaccaat atacaagaaa ggagacgaaa atgacctaaa aaattaccgc caaatatcaa 780 tattgccagc attttctaaa atattagaat cagtaatgtg ccaacagctc gtaaacttct 840 tagagaaaaa taaatatcta agtcagcaac agtttggttt cagaaaaggc ttcaccacaa 900 ccgacgcaat attgtcatta gttaaagata ttcaaaaagc atttgaaaat aaacaaacat 960 atgtaggtgt attcacggac cttagctctg catttgactg cgttgatcac gaaattttgc 1020 tagataaact acgcttctat ggagtaaggg gaacatgtct aaaactgttt agatcctatt 1080 tggaagatcg acaacaatac gtgaaaatca acgcatcact aagtaaagca ggacatataa 1140 aatacggggt acctcagggg tccatactgg gtcctgttct ctttctagtt tatataaatg 1200 acttagaaaa ttctatggac aagaacatcc taacaataat gtatgctgat gatacgtcat 1260 taggtattcc tataaacaag aagaagaacg tggtaccgat taaggaaaga atgatcagag 1320 atgcagatga gtggtttgca gccaataaat tagccctaaa taaaacaaaa acagttacaa 1380 aagtatttga acttcagtcg caactcaaac aaactgaaca cataaaattt cttggactgt 1440 ttttagacaa caaactatcg tgggcacccc atgtttatga agtttgcaat aggttgtcca 1500 cagcagtatt tgcaatacga aaaataagag taagcgttaa cgaatatgcc gctttatcaa 1560 catattatgc cttgtttcat tcccaaataa attacagtat tttattgtgg ggagacgcag 1620 cctacgttca tctccataag attcttgttt tacagaaagc tgcagtcaga gcaatagccg 1680 aaatatgtag tgatgttagt tgcagagcat atttcaaaaa gtacaaaatt atgacactat 1740 tttctcttat tatatttcat aaagtttacg atgttagaaa aaatatctca agttatccct 1800 ctttggcaga cgtgcatagt catgacacaa gaaacaaaac aaatcttgtt aatccgcgtt 1860 ttcggttagc aaaaagtgac aatctaagta tacggcttta taactctcta tctcaaaatc 1920 taaaagatca acatcttcct ctcaaaacat ttaaaactaa attaataaat atgttaactg 1980 agtgcagcct atacagtatt aatgaatatt tgtgtgtgtg tgtgtgtgtg tgttatcatt 2040 atatttttat cttactgaca agtcttattt taatattatt attattatta ttattatta 2099 // ID Ci000010 repbase; DNA; INV; 211 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 06-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE DNA transposon from Ciona savignyi. XX KW DNA transposon; Transposable Element; Ci000010. XX OS Ciona savignyi OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. XX RN [1] RP 1-211 RA Smit A.F.; RT "Ci000010 - DNA transposon from Ciona savignyi."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX CC 7 bp duplication site (very clear), 53 bp TIRs. XX SQ Sequence 211 BP; 63 A; 61 C; 31 G; 55 T; 1 other; cgggcaacac taaaaggagg gaaagcggga aggagggaaa ctgctcacta aaactatgga 60 aactaattta aaccctgccc taaacactaa tctcaacccc agaactaacc ctaactccaa 120 ccccatgttt aaaactaact ctaatgtcnt aatcttcatt ttagtgtgca gtttccctcc 180 ttcccgcttt ccctcctttt agttttcccc g 211 // ID BEL-53_AA-I repbase; DNA; INV; 5659 BP. XX AC supercont1.141; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-53_AA_; KW BEL-53_AA-LTR; BEL-53_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5659 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.141; Positions 378811 384469. XX CC Positions [4681-5262] - Integrase core CC 'GTTC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 226..5658 FT /product="BEL-53_AA-I_1p" FT /translation="MWRKLVGQEDDEKLESSHVENTEKDTPNTKQQKRKLK FT MAEQELKSLITKRGQVKGKVTRIWNAIQTDPGQELTISQQQLRVHARNIEK FT YYTEYNEVNDRIISLVPDKQVEVHEGKLIEFEDIYNNTLVVIETLLDAHAT FT AVQQENPQSGSVAAGKPQYVIHQQSLKAPLPTFNGKYENWPKFKSMFVDLM FT RNSPDSDAIKLFHLEKALVGEAEGILDAKTIQDNNYQQAWRILEERYENKR FT RIIDIHIEGILQLKKVEKKSSRELRELINECTRHVENLRFHDQELTGVSEL FT IVVHILASCLDNETRELWESTIEHGELPEYEETVEFLKKRCLVLERCETSK FT SVISSAKPVVSKGSTSMKSSTKVSAAVSTTSEIVCELCGGSHLNFKCSAFR FT SMSVSQRYAKAKQANVCFNCLRKGHRSAACTSERSCSKCSAKHHGMLHPEE FT ESSRSLPPPKQEDKANVSAVQATGPLVQSGAVEPTSSVPVESNTRNSASVT FT TSCYNGGMLQSTKQVFLQTAIIDVVDKHGRLHPCRTLLDSGSQAHILSESM FT ARVLELPFTKCNVMVVGANAVKTQARKGMIIEFSSRYSNYKDRIECLVSDR FT PTGTIPGATINVTEWNIPDGIQLADPRFYEPHEVDLILASNYVWDLLRMNT FT VTLVNGTTSLRETDLGWIVTGTFDPYAQVSQSILLSNVTLQEPLHDAIEKF FT WTVEEIGDSSPNTNEEMEVEAHFLETYRRDETGRFVVKMPFKDIVSELDDN FT RELALKRFHQLERRLSRNPELKRQYTAFIEEYEALGHCKEVYEKDDVPGKG FT SYYLPHHAVLKPSSSSTKLRVVFNASSKSGRYSLNEVLKIGPTVQSDLFSV FT LLRFRCYLYAFSGDVTKMYRQIGIDASQTAFLRIFWRKHPADPIRVLELAT FT VTYGTSSAPYLATRCLVQLAEEEAERYPVASEMVKKDVYVDDLLSGAATQE FT EAIERQREVTELLAEGGFPIRKWCSNSPAVLKNVPEENQEKLIKIQECGEA FT TKALGVLWDPREDKLLFGIECKQEESDKITKRYVLSTIAKLFDPLGLVSPV FT VVLAKMLLQKLWACQLNWDEELEGDILQEWKNFQRSMPTLTKISVPRCVLS FT SGVVRLELHGFADASLLAYGACVYLRCLGEGDLITTSLLCAKSRVAPLKGI FT TIPRMELLAALLLSRLVSKVATIVQINVSATVLWSDSQIVLAWLKRAPGTL FT EVFVRNRVVQINNLTASCTWKYVRTTENPADIVSRGQMPLLLSVNKLWWQG FT PVFLQQSEYVVEEPPPVPENEIPEIKTLVISDSICVEEMLPVFTRYSDFRK FT LQRVIAYVRRFIFNCRSRVINQPKIVSKYLTVPEMRQALKYIVMVIQMMEL FT PEEVRAAERGEPSKRFAGLSPFMDRDKLLRVGRRLENANLPFESKHQLILP FT PHPVTEALIRTLHVENMHVGPTGLLAIVRRQFWLLRGRSAVRKIIRRCLNC FT FRTKPSRIQQLMGSLPDERVNVSAPFEYTGVDYAGPVTVKQGKYRPKLVKG FT YIAVFVCLATKNIHLELVSELTTEAFIAALERFVNRRGLVRKMFSDNGTNF FT VGASRELRQLYQQLHDDVQSQGINNLLLPRDIEWHFIPPKAPNFGGLWEAG FT VKSTKTHLKRTFQNAVLTFEEFATALTHVEAILNSRPLFSLNDNPDDPMPI FT TPAHLQIGRPLQSVSKPTCLDVPVNRLSRWRYLDRLKEHFWGRWSREYLTT FT LQTRSKWTSRESNVQPGMVVLLIEDNLPPQTWTMGIILKTYAGRDSLVRVV FT DVKTSSGIFKRPISKLAPLPIEDNDYLQESSKSASQPRGR" XX SQ Sequence 5659 BP; 1596 A; 1225 C; 1460 G; 1378 T; 0 other; ttggtccatt cgatccggat gagttccgga caagagcgtt tcacgaggct ccaacatcgt 60 gaatctcgtc gagtcgcaaa gtatcgtagt gcattgaccg cagcgcgagt gcaagaagcg 120 tcgtgtttgg gttaagtgcc gccattttgt cgagaaattt ccaaagcgtg gagattcctc 180 ggtgaaaatt ccagaaagca gttccgttag catcccgtgg ttcctatgtg gcgaaagctt 240 gtgggtcaag aagacgatga gaagctggag tcatcccatg tagagaacac cgagaaagat 300 actccgaaca caaagcagca aaagaggaaa ttaaaaatgg ctgaacaaga gctgaagtca 360 ctcatcacca agcgtggcca ggttaaaggg aaagttaccc gaatctggaa cgctatccag 420 actgatccag gacaggagct gacaatctcc cagcagcagc tacgagtcca cgcaagaaat 480 atcgagaagt actacaccga gtacaacgag gtcaacgacc ggatcatttc gctggtcccg 540 gataagcaag tggaagttca cgaaggaaag ctgatcgagt tcgaagacat ctacaacaac 600 acactggtgg tcatcgaaac gcttctggat gctcacgcta cagcagttca gcaggagaac 660 ccacagagtg gttcggtcgc tgctggcaaa ccacagtatg tgattcacca gcaatctttg 720 aaggcgcctt tgcccacttt caacgggaaa tatgaaaact ggccgaagtt taaatcaatg 780 ttcgttgact tgatgagaaa ctcgccagat tccgatgcta ttaaattatt ccacctggag 840 aaggccctag taggcgaggc tgaaggtatc ctggacgcca agaccatcca ggacaataac 900 taccagcagg cctggaggat cctggaggaa cgatacgaaa acaagcgacg aatcatcgac 960 atccatattg aaggtatttt gcagctgaag aaagttgaga agaaatcatc cagagagcta 1020 agagaactaa ttaatgagtg tacaaggcat gtcgagaacc tgcggttcca tgatcaagag 1080 cttacaggtg tgtccgagtt gattgtggtg cacattctcg cttcgtgttt ggacaatgaa 1140 acccgtgaat tgtgggaaag taccattgaa catggcgagc tgccggaata tgaagaaaca 1200 gtggagttct tgaagaaaag gtgtcttgtt cttgagcgtt gtgaaacgag taaatcagtg 1260 atatcatctg ccaagccggt agtttccaaa ggatcgacgt caatgaagag ttccactaaa 1320 gtgtccgctg cagtatcaac aaccagtgaa attgtttgtg agctatgtgg tggatctcat 1380 ttgaacttta agtgcagtgc atttcgaagc atgagtgtca gtcaaagata tgcaaaagcg 1440 aagcaagcaa atgtgtgctt caactgtctg cggaaaggac atcgtagtgc tgcatgtaca 1500 tctgagcgaa gttgttccaa gtgtagtgcc aagcatcacg gaatgttgca tccagaggaa 1560 gaatcatctc gatcgttgcc gccacctaag caggaagata aagcaaatgt tagtgctgtg 1620 caagccactg gtccactggt gcagtcaggt gctgttgaac cgaccagttc cgtaccagta 1680 gagtcaaaca ccaggaatag tgcgtcggta acaacgtcgt gctacaacgg aggaatgctc 1740 cagtcaacta aacaggtatt tctgcaaaca gccattattg acgtagtgga caagcatggg 1800 cggctgcatc cgtgccgcac gttgctggat tccggctcac aggctcatat tctgtcggag 1860 tccatggcac gagtcctgga attgccgttt acaaaatgca acgtcatggt tgttggagca 1920 aatgctgtga aaactcaggc tagaaagggg atgattatag aattttcgtc ccgatactcg 1980 aactacaagg accgtattga gtgtttggtc tcagatcgtc caaccggaac gatacctgga 2040 gcaacaatca acgtgacaga atggaacatc cccgacggaa tacagctcgc ggatccacgt 2100 ttctacgagc cacacgaggt tgatcttatt ttggcatcaa attacgtttg ggacctgctg 2160 cgaatgaata ctgtgacatt agtcaacggt actacttcac ttcgagaaac agacttggga 2220 tggatcgtaa ctggtacgtt tgatccttat gctcaggtga gtcaatccat tttactttcc 2280 aacgttactt tgcaagaacc cttgcacgat gccattgaga aattttggac ggtcgaagaa 2340 ataggcgact cttcgccaaa caccaatgaa gaaatggaag ttgaagcaca ctttttagaa 2400 acgtatcgcc gagatgaaac cggcaggttt gttgtcaaaa tgcctttcaa ggatatcgtc 2460 agtgagctag acgacaatcg tgaacttgcc ctcaaaaggt ttcatcagct tgaaaggaga 2520 ctgtcgcgga atccggaatt aaagcggcag tacacagcat tcatcgaaga atatgaggca 2580 cttggacact gcaaagaggt ttacgagaaa gacgacgtac ccggcaaagg aagttattac 2640 ttaccgcacc atgcggtcct gaaaccatcc agttcgtcga caaagttgag ggtggtcttc 2700 aatgccagtt caaaatctgg ccgatattcg ctaaacgagg tattgaagat cggcccgact 2760 gtgcagagtg acctgttctc tgtccttctt cgatttcgct gctacttgta tgccttttcg 2820 ggtgacgtaa ccaaaatgta taggcagatt ggtatagacg ccagccaaac tgcttttttg 2880 agaattttct ggagaaagca tcctgcggat cctattcgag tacttgaact ggcaacggtc 2940 acatatggaa catcgtcggc tccctatctg gcaactaggt gcttggttca attggcagaa 3000 gaggaagcag aacggtatcc ggttgcgtcc gaaatggtca aaaaggatgt gtacgtggat 3060 gacttgctgt caggtgccgc aacgcaagaa gaagccatcg aacgccaacg tgaagtgacg 3120 gagcttctag cagaaggagg attccccata agaaagtggt gctcgaattc gccagcagtg 3180 ttaaaaaacg tacctgaaga aaatcaagaa aaactaatta aaattcaaga gtgtggtgaa 3240 gcaacaaagg cattgggtgt attgtgggat ccacgagaag acaaattgct ttttggcatt 3300 gagtgcaaac aagaagaaag tgataagatt acgaaaagat atgtactttc gaccattgcg 3360 aaactgttcg acccgttggg cttagtgtca ccggtggtag tgttagctaa aatgttgttg 3420 caaaagttgt gggcatgtca attgaactgg gacgaagagt tggagggaga tattctccaa 3480 gagtggaaga attttcagcg ctcaatgcct acacttacca aaatcagtgt gccgcggtgt 3540 gtcttgagta gtggcgtcgt gcgcttggag ctccatggat ttgcggatgc ctcactcctt 3600 gcgtacggtg catgtgtata cttacggtgc ctaggagaag gcgacttgat aactacgagt 3660 ttactgtgtg caaaatcaag agtcgctccg ttgaagggca ttaccattcc gaggatggag 3720 cttcttgcgg ctcttttgct atcccgcctc gtttccaagg tggcaaccat cgtccagata 3780 aatgtgtccg ctactgtgtt atggtccgat agccaaattg tactggcatg gttgaaacga 3840 gcacccggaa cccttgaggt gttcgttcgt aatcgggtgg tacagatcaa caatctcact 3900 gctagttgta cgtggaagta tgtgagaacc actgaaaacc ctgccgatat tgtatcacgt 3960 gggcaaatgc cattactgtt gagtgtgaac aagctgtggt ggcagggacc agtgtttctg 4020 caacaatcgg agtacgtggt agaggaaccc ccaccggttc cagaaaatga gattccggag 4080 attaagaccc tggttatatc agattccatt tgcgttgaag agatgctgcc ggtcttcacc 4140 agatacagtg attttcggaa actgcaacga gtgattgcgt atgtgcgtag attcattttc 4200 aattgtcgaa gccgcgtcat caatcaacca aagatcgtat cgaaatacct cactgttcct 4260 gaaatgagac aagcgttgaa gtatatcgtt atggtgatcc aaatgatgga gttgccagaa 4320 gaagtaagag ctgctgagag aggtgaaccg tcgaagcgtt ttgccggttt gagccccttc 4380 atggaccgtg acaagttgtt gagagtgggg aggcgtttgg aaaatgccaa cctgccattc 4440 gagtcaaagc atcagttgat actgccgccg catcctgtga ctgaagcact gatccggacc 4500 ctccacgtag aaaatatgca cgttgggccg accggactgc ttgcaattgt tcgtcggcaa 4560 ttttggttac tgcgtggccg atctgcggtt cgaaaaatta ttcgtcggtg cttgaactgt 4620 ttccgaacca aaccatcgag gatccagcag ctaatgggta gtctacccga tgaacgagta 4680 aacgtctccg cacctttcga gtacactgga gtcgattatg cgggccctgt aacagtaaag 4740 caaggtaaat atcgtccgaa actggtcaaa ggttatattg ccgtttttgt atgcttggcg 4800 acaaaaaata tacatttgga gttagtgtcc gaactaacga cagaagcgtt catagctgcg 4860 ttggaacgct tcgtgaaccg tcgaggattg gtgcgaaaaa tgttctctga taacggaaca 4920 aatttcgtgg gagcgtcacg tgagcttcgt cagttgtacc agcagctcca tgatgatgta 4980 caatctcaag gtataaacaa tttgttgttg ccaagagaca tcgagtggca ttttatccct 5040 cccaaggcac caaattttgg tggcctttgg gaggctgggg tgaaaagcac caaaactcac 5100 ctgaaaagaa ctttccagaa tgcagttctt acttttgaag agtttgccac tgccttgaca 5160 cacgttgagg ctattttgaa ttcacgtcca ctgttcagct tgaacgacaa tccagacgat 5220 ccgatgccaa ttactccagc tcatctacaa atcggcagac cgttgcagag tgtttcaaag 5280 cctacctgtc ttgatgttcc ggtgaatcgt ctatcgaggt ggagatattt ggataggctg 5340 aaagagcact tctggggccg atggtcacgg gagtacttga caacgcttca aacccgtagt 5400 aaatggacat caagggaatc caacgttcag ccaggaatgg ttgtgcttct gattgaggac 5460 aatcttccgc cgcaaacctg gactatggga ataattttga aaacgtacgc aggaagagat 5520 tcacttgtac gagtcgtgga cgtgaaaact tcttcaggga ttttcaaacg gccaatctcg 5580 aagctggctc cgctaccaat tgaagacaac gactacctcc aggaaagttc gaagtcagct 5640 tcgcagccgc ggggcagga 5659 // ID L1-3_Cis repbase; DNA; INV; 6114 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 06-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE L1 Non-LTR Retrotransposon from Ciona savignyi. XX KW L1; Non-LTR Retrotransposon; Transposable Element; L1-3_Cis. XX OS Ciona savignyi OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. XX RN [1] RP 1-6114 RA Smit A.F.; RT "L1-3_Cis - L1 Non-LTR Retrotransposon from Ciona savignyi."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX CC 3% div. XX SQ Sequence 6114 BP; 2455 A; 1264 C; 937 G; 1445 T; 13 other; ttctatttga ataaggcaag aatttcctta agtggttatc tgccgaatag caaatagaat 60 ttctgtccat tttggattac tgtcatctat cgtattcctt tggatatatt ttggatatac 120 ttccctcttg ttggcaagaa gaccgcaaac aaaaggtaaa atatctaact gccttgcttt 180 aaaatttctg tagataactc cccgtttttg tctttttttc gttaaaacca gccgcacagc 240 cttttttcgt ggcgtacggc gaatagacca ccaccccttt tttgtatcac gtgtgttttt 300 ttggtgtaag atattatctc tgtttgccac gtgtacgaac gcacgtgcaa cactggtaat 360 atcttaacaa gtgtgttttc ttnacttcta cccagcagca ataatatagt gtttccacgt 420 ctctcagtcc aaaaagacct ggaaacactg ttaattgctt acccgctgat nacctccgcc 480 tttgtgtaaa ttccctccca ctccgccttc tcctcttttc tctttattaa gagaaaatct 540 gagccaataa ctccttgtga tagcgttcta gtaagtgacc agcgcattaa gtgggtttcc 600 tggaaagaac ctaaatcaat aacataaata cctgcacaaa atcaacataa aagcataata 660 taaaacacac ggacttaccg caagccaaag aatgatgagc tccgactaca tccccgagat 720 tgaaggaaca ttggaccgtt cggttgtttt ccagctaacc gggacagttg agcacatgag 780 cgtcagctcg ttcatggagc tttttgcaga aggtgctcct cttgcaggga tatctgaaat 840 agtggaaagt gtcattttgg aaaacttcaa ggacgcaacn tttctggtaa cgcttaaaaa 900 cccaagcgaa actcagatag tccccaaaag gcagcatatc atagattatt tcntaaataa 960 tcagatcact tttacaacgg aaagcggagc aaacatggcc attaaagcgg attttccaca 1020 aggagagagc gagatcgtct ccttgcaccc attcccattg gacatagaca taaaaaaatt 1080 ggaagacctc attgagaaag aacagtgggg caaattgaaa catatcaatt atggtacaca 1140 ccggaacttt aacaaaatta agaacggntg gcttaacgtc acactaacag aaacaaaggt 1200 naaaaatatc ccaccgctcc taaaaattgc cggacgatta ataacagtga cgagaccggg 1260 agaggctcac ctacccctct gtcggtattg caagataaga ggacatcacc aacagtcatg 1320 ccccaaaaaa ggcttctgca caacttgcaa gacacacggc cacctcactc ggaactgtag 1380 aaacggaaca acgtatgaga ccagaccaag aactgtcttt tactcagaaa catccactgc 1440 tccgctcctc agcacaccga gtcgtaatga caaaaatgtc attataaacg aatggcagac 1500 ggttcaaagg gtaggaaagc aacacctcat accaacaaga catgcgccaa tacgactgac 1560 taaccatttc ggcttactcg atgcggatga catgctgcag aaagcattat tcggagaaac 1620 gtccaatgat atcgtccaag accttttcaa accagacgac ttcccggaac taattgaaaa 1680 ccacaccagc ccgacattaa gcgcacccaa atcacaacgt tcaccccggc gtaagaaaca 1740 aaaaaaaggc aaaacatcac cagagccaac caaacctgac caacaacacc agaatcccgc 1800 ccatgaggaa ccaaatgaac aaagcaaacc tgaaacaatg gaaaccacaa acctgtcaag 1860 cagcactaac acaagtgacc tccttctaga agagatcaca gacaaaacag accaacaatt 1920 caaatccctg ctgaacagaa aaccttcaca aacagaatca gtttacgtaa ccccggacga 1980 caacaaaacc ccaacacatc caacggcaca aaacaatgat ttcaacaccg aactctacaa 2040 cctaagaaca aactggaata tcaactctcc aaaacgacca agaccatcag actcttgaac 2100 taacatcgcg ctaagcgcca aaaacccgaa acaccttgca taaaacaatc acacccacac 2160 acgaaaccca tgtaatatat catttgaaat ggagattgaa caaaatttgg acaaaaattt 2220 aaaaattggt tcaattaaca caaatggagt tatacggaaa ataaaacaaa taatatatta 2280 catggactca aataaaatag atattttgct aatccaggaa acccacggtc tcaacgaaga 2340 ccacatttta aactttgaaa cacaatatag catcaaattt ttcttaaacg cgcctgatcg 2400 aaataacaaa ttctaccgcc aaggcacagc ctttatcatc aagaagcata tactctcaat 2460 atatgaaatc acacatgaca ctctttttga caatagaatt cacagaataa aaatcactta 2520 caaacataac gaaattcacc tatacaatgt ctatatggaa gcaggcaact gccatcacaa 2580 tcttttaaac agagaaaaca tgctaaacaa tcttaaaaac aagttgaatg aaacaaacgt 2640 gacaatcgac cttataatgg gtgattttaa tatggtttta gacgaaatcg atgtacaacg 2700 actgtttgac aaaagaaaaa aacgtgatag aatggcgtta aagagactac taaccgaaaa 2760 atcgttcaac gacgctttta gacttaagca aaaacgcaaa atcgaattta caagaattac 2820 aaccacaagc gcaacacgac tagacaggat ctacgtaaac aatttggcaa aagataaggt 2880 aacaaacttt tcacacgtgc gtaattattt ttccgaccac aacaactgcc caataatcac 2940 tttaaaaata aacagtaaca gtaaatgggg aaattccttt tacaaaataa acaacgctat 3000 tctggactac agcgacccga tcgagaatgt atgcgcactg ttagaaaatt ggaaaaaaca 3060 gaagagaaaa tacaggaatc cactactatg gtgggatgac tgcaagaaat taataaccaa 3120 tgatctgata agatactcac aantaatcaa ttatgctgaa aagagaagat atataaacaa 3180 agtaaaagaa ttggaatcac tggaaaagct gaccccggta gaacaaatta acgaaaagat 3240 aataagacta aagaaaaacg tagaaagata tgaaagtaaa attaatgaag gagccataat 3300 acgatccaaa ataaagatca taccaaatga agaaaaacca acaaaagaat tcttccgata 3360 tgaacaaacg aaaggaaata gggacaccat acacacagta tacaacaaac atggacaaat 3420 aaccaacaac ccaaacgaaa caatgcaagc aatacacaat ttttacaacg acctctggac 3480 aacaagagga acaaatatac aatcattaac aaactatcta tcaataatac aaccgataca 3540 aatacatgaa tcagaaataa aagaaatgac caaaccgata cacatgaagg aaatccacga 3600 ggcaataact gacatgaacg acgacagctc ccctggcatt gacggattaa caacaaaatt 3660 gtacaaaaaa ttatggccat atatcaaata cgaattggaa gaattataca acaatatata 3720 tttacaaggc actatgacta aaacaatgaa gacagcaatt attaaactaa tatataaaag 3780 gggggataaa aaagacataa aaaattggag accaatatca ctcttaaaca ccgattacaa 3840 aattattagt aaaattatag cccaaagatt aaacataata ctaaatagaa taataagccc 3900 caaccaaaaa tgtggaatcc ccggacgaac gatggacgat tgtctttaca atataaccgc 3960 ctgtatagag gcagcaaaac attttaacag aaatttaacg ataattgcta tcgacttcga 4020 aaaagccttt gatcgcgtaa actatacttt tatccacgaa atccttaata aattaaacgt 4080 ccccacctat ataattaatt ggataaaaat aatatacaac caaattatta gcaaaattga 4140 aatcaacgga gcttttacag accaaataaa tataactaga ggaattagac aagggtgccc 4200 gtgtagcatg ttgctatttc tgataggaat tgaaatactn actcggaaaa tacaaagtaa 4260 tcacaatata aaaggattca aattaaataa gattgaatta aagaccgaac aatatgcaga 4320 cgacctttcc attattctat ctgacgccac aagtattaac gaactaatga aggaactacg 4380 cgaattcgaa atagtatcag gacaaaaaat aaacacnaat aaaacacaag ctatatcaaa 4440 cgaccctaca atacatcaca caatcaacac ggaaatatca aatgaatgca ttaaagatac 4500 aattaaaata ttgggacttc atattagctt aaaatccgac tgtgtaaaag agaacctggc 4560 taaatcgcga cgcataataa accacttata ccataaacat atcaaacgaa aattgacgat 4620 aaaaggaaga atcatgctaa ttaattcttt attcgtacca cagcttatca acacaggaag 4680 acatataatc gcaccacaaa gctttctaaa cgacataaat agaacattgt ttaaatttat 4740 ttggtatcca tttaaaatcg atcgcatcgc gagacggaaa ttaatagcgc cacctcacga 4800 tggaggatta aatgcaccgg acattaaact aaaactacag gcacttagag catctcgcat 4860 atttgaaata acaaaacttc aacaaataga atcaatcnca catgaatgga cacgatttaa 4920 tttaggatca accatgaaag taataaatag caaactttat actaattctg caccaaacgc 4980 aacgcaacca aattttctac acgccgaaat aagacgaaca gtacgaatac tagaggaaaa 5040 tgatttcgcc tgggaatcaa aaaaaataaa accaatttac ataatactac taaaagaaaa 5100 atccgaaacc actaacatcc tagcaaataa cgaaaccatg aaatggtcta ggattacatt 5160 aaacgaacca ccctataaac gcttttttac caataaagaa cgcgatacaa actacaaaat 5220 agcccacaat gcatacaact ttggagactg gtttagggac aaaatcgcca ctcaatatta 5280 caatggacaa ctacaaatca gaaactgtaa attttgcggg gaacaaacag acaatattaa 5340 acatattttg acggaatgcc aaattacgaa gctaatcatg aacgatatcg aaaanntaac 5400 aaccgactgc tgcgaaaaac cgactaaatt aaccaaatca gtaatattgt ataatcatac 5460 cacggagaat gcgcacccca atatatttgt tacaaaagta ataaatatat tcaaaactga 5520 aataattagg aaaaaattga aactagatat agctaaccaa tacatacaca atacatacca 5580 attcaaacaa caaactctat gggtgataaa cacaaaaatc aaacaaatca tacagcaaga 5640 attaatattg agaggaaaag aagatacgta caaaatattc ggcttaaaac atacgtacgt 5700 aatctaattt aggtnaacgt ttttgtcttt tctcccctga ataaaaattt ggaaaatcga 5760 tgaaaaatat tactcggaaa attacaatgg tcacttttat tttatataaa tatatatata 5820 taatatttaa gaaaaagcaa ggcaattaga tattttacct gtacatactt agagtaatat 5880 ttggaaaatt tcaattttac taacctccaa aacaagtgtt gcaggtcgtt cttgtcaatg 5940 aagatcgcct cttccagtac ccctctacag tgacctgcaa atatatgaag tatcgttaat 6000 tttttggtgt tgttgtgtat ttcatatttc aagtgtatta ttgttgtgaa ttattgtatc 6060 ggccgatttg tcggtgaaat gcgtcgttta gtcgcccaat aaaaaaaaaa aaaa 6114 // ID BEL-112_AA-I repbase; DNA; INV; 2925 BP. XX AC AAGE02020778; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-112_AA_; KW BEL-112_AA-LTR; BEL-112_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-2925 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02020778; Positions 122911 125835. XX CC 'ACCAT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 500..2695 FT /product="BEL-112_AA-I_1p" FT /translation="MLEKQLEEEAKFVSERKKLHERFEDAKKFLAESLSNA FT EGATGAAKLKGDNELSFDKKVQFWLKQQNQGRSNATPKLAGSVAESEEYED FT PVEEETSKNEDHLDGCDAEDFGSAHEERQAFDRIVRNSRTHGPGETVPSCY FT SRQPQQTVKLSQEQLAARQAVSKNLPVFKGEAEKWPMFISCFEYTTKACGF FT SNLDNLKRLQDCLQGDALEAVRSRLVLPESVPDVINDLRRIFGRPEKLLKT FT LLTKVRNAQAPRADRLETFMQFGITVKQLCDHLEAAKLADHLNNPMLVQEL FT VEKLPSSYKMDWVRFKRGRIGSPLRIFTDFMNDIVADASEVSEFSTLFLGD FT CRETSQSSKGKPKRKEFMHVHNTSSNKPDLPVPFKSTKPCFVCKRTDHKLR FT FCDDFKKFSIADRLKVVDQHKLCALCLNNHGTSRCSFKIRCNISGCRGDHH FT PLLHHAEGSVQLMDIACNTHGALNRGIIFRMIPVTLYFENYALDVLAFLDE FT GSSSTLVENAVAHELNARGFAEPLVVTWTGNVKRLETKSKRISLMISARGS FT DDKLPLKDVRTVAELKLPKQSLQYERVADRYPHLKNIPVESYELEEPRIIL FT GLDNLSVFAPLESRSGSPGEPIGVKSKLGWTIYGPQSCEAVVSEIRVNFHS FT FEPMSNQQLHDAMRTQFLLDEPSLVSVMESAEDARARRIMQETTKRVGHRF FT ETGLLWRQDERRFPDSYPMAVSRLKALERSRI" XX SQ Sequence 2925 BP; 876 A; 618 C; 771 G; 660 T; 0 other; caaatcaaag aatttgcctc aaaagatgga ctcgacagta ggggattgca tgaaatgcaa 60 taaatccaac aaaatatgtg acatggtgca gtgtgatact tgccaattgt gggcgcacta 120 ctcatgcgtt ggagttacgg aaacaatcaa agactcggat tggagctgtg acaagtgtag 180 caacgagctg cagattccaa gaataccgag aaagatatcg agcaagaagg ggagctcgaa 240 gtcgaagagc gatggaggat caaccaagtc ttctctccag gatgggacct cggggctaaa 300 cgatgcgctg cgtaagctag aagccgaaca atgtgcaagg gagaaagcct tagaggagga 360 gatgactctt cgagagaaac gactcgaaat ggaacgaatt cttcaggaaa agcgtcgaca 420 gaaggaaaag gcatttcgcg aaaagcagct tcagcaggac agagagctga aagagcgcca 480 gttgaaggaa gaataagaga tgttggagaa gcaactggaa gaagaagcta aatttgtcag 540 cgagcgcaaa aagctgcacg agcgattcga agatgcaaaa aagttcttag ctgagagtct 600 ttcgaacgca gaaggtgcta ctggagctgc aaagctgaag ggcgataacg agttgtcatt 660 cgataagaag gttcaattct ggctgaagca gcagaatcag ggccgaagta atgcaacgcc 720 gaaactagcc ggctcggttg cggaaagcga agagtacgag gatcctgtcg aggaagaaac 780 atcaaaaaac gaagatcatc tggatggatg cgatgcagag gattttggat ctgctcatga 840 agaacgtcag gcgtttgata gaatcgtacg aaattcaagg actcacgggc cgggggaaac 900 tgtgccatcg tgctacagcc ggcaaccaca acagaccgtg aaactctctc aagaacagct 960 agcagcaagg caagctgtgt caaaaaatct gcctgtcttc aaaggcgaag cagaaaagtg 1020 gccaatgttc attagttgtt ttgaatacac cactaaagcc tgcggtttct ccaacttgga 1080 taatctgaaa cgcctccaag attgtcttca aggtgatgct ctcgaagctg tcaggagccg 1140 gttggtcctg ccggagtcag taccggatgt gataaatgac cttcgtcgta tttttggaag 1200 acctgagaaa cttttgaaga cgctgctcac gaaagtccga aatgcccaag cgccacgagc 1260 ggaccgcttg gaaactttta tgcagtttgg aataactgtg aagcagttgt gtgatcacct 1320 agaggcagct aaattagcag atcaccttaa caatcctatg cttgtccaag agttggtgga 1380 gaagttaccg tccagctaca aaatggactg ggttcgtttt aagcgtggca gaataggttc 1440 tcctctgagg atatttaccg acttcatgaa tgacatcgtg gcagatgcgt ctgaagtgtc 1500 agagttctcg acactctttc tgggcgactg tcgtgagaca tcgcaatcca gtaaaggaaa 1560 accgaagcga aaagagttta tgcacgtgca taatacatct tccaataaac ctgaccttcc 1620 ggttccattt aaaagtacca agccctgctt cgtttgcaaa aggacggatc acaagttgcg 1680 cttctgtgac gatttcaaaa agttcagcat tgcggacaga ctcaaagtcg tggatcaaca 1740 taagctctgc gctttgtgtc tcaacaacca cggcacaagt cgctgcagtt tcaaaattcg 1800 ctgcaatata agtggatgtc gaggagacca tcatccccta ctacatcatg cagaaggatc 1860 cgtgcagctc atggacattg catgcaatac gcatggtgcg ctgaatcgcg gcataatctt 1920 tcgaatgatt cccgtcaccc tgtatttcga gaactatgcg ttggatgttt tagcattctt 1980 ggatgaagga tcgtcatcta ctctcgttga aaatgctgtg gctcatgagt tgaatgccag 2040 aggcttcgcg gaaccgttgg tagttacatg gacaggtaac gtgaaaaggc tcgaaactaa 2100 gtccaaaaga atcagcctaa tgatttctgc acgtgggtca gacgacaagt taccattgaa 2160 agatgttcgt actgttgccg agctaaaact accaaaacag agccttcaat acgaaagagt 2220 ggccgaccga tacccacatc tgaaaaacat ccccgttgag agttacgaac tcgaagaacc 2280 gaggatcatt ctcggattgg acaacctaag cgtatttgct cctctcgaat cacgatctgg 2340 atcacctgga gaaccaattg gtgtgaaatc taaactaggg tggacaattt acggccctca 2400 atcgtgtgaa gccgtagtat ctgaaattag agtaaacttc cattcgtttg agccaatgag 2460 taatcagcaa cttcatgatg ctatgcgcac tcagtttctt ttagacgaac cctcattagt 2520 ttctgtgatg gaatcagccg aagatgcgcg agctcgtaga atcatgcagg agacgacaaa 2580 acgagttggt catcggtttg aaaccggatt actatggcgc caagatgaac ggagattccc 2640 tgatagctat cccatggcag tatcccggtt gaaggcatta gagaggtcaa gaatttaaaa 2700 gctggtgatc tggtttatgt tgttgacgga aacaagcgga agtgttgggt gcgcggcgtt 2760 gtggaggagc ccataccttc ggaagacggg agagtacgac aagctgtgat tcgcaccaac 2820 agtggagtac tgaaaagggc cgtagcgaag ctagcgctca tggagattag tgatggtgac 2880 gctgctccgg gaacgatatc cggaccaggt tcacgggccg gggta 2925 // ID Gypsy-26-LTR_NVi repbase; DNA; INV; 1147 BP. XX AC . XX DT 08-MAY-2009 (Rel. 14.05, Created) DT 08-MAY-2009 (Rel. 14.05, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: long terminal DE repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy-26-LTR_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-1147 RA Bao W. and Jurka J.; RT "LTR retrotransposon from Nasonia parasitic wasp."; RL Repbase Reports 9(5), 989-989 (2009). XX DR [1] (Consensus) XX SQ Sequence 1147 BP; 342 A; 298 C; 240 G; 267 T; 0 other; tgtcacgtca acctactatg acctagactg accggtcaac gcacacgcgg ataacacgtt 60 caaccaacga agtacctcac cacggtaatt gggaatattc cacggcaggc ataacagagg 120 ccaacgaacc aacaacggtc ggcaacacag agcacgacaa aacaaccaag ccactgacac 180 gcgcggcagc agatgaaagg gcaacgatcc tgaataaact aggtcaaaca aggggagata 240 ccccagagct accggacaat aaggaggcca taaatctcag acaaacgacc caaataagtg 300 gaaatcgata caacaatgaa gggaaattac aataaagaaa accattacag ctggccagac 360 taggggccgc gtactgaggg acattgatta ataaaagggg cgcccacccc tactgagaca 420 tacgccctcc ctaaatcctg attggccaga accttcccta tgcttatgca ttaccctcat 480 ccctataaat accaatggag agcaaggagg agtattcttt cgtaacccgg agttcttact 540 ggtagggcga agctctgccc gactcgcgac ttaaactatt tttctgtaaa cgaacataga 600 gtcagagcgt tcgccaagtt tttggacata tttctacctc agacacgagt tgtactcagt 660 cgaggacttg tatcttgtac catttataat ctgacgagtt gaataaacca tttctctcaa 720 ttacttaatc cttgcgtacg aattattaca tttcaaactc cgcgcccttt gagacgagct 780 acagcgccag aacgagagcg agccacccta tcggtaacta caagtcagct tcgtcaaggt 840 aagtcgtttc tcctatcaat cttctcaaat tcacgtcagt gtcgcatgca acgttattgt 900 gtcagtacat agtgtttaaa cctagctagt ccaatacatt tagaagtcac tctgagtttc 960 gttactccgt gagctatcac tatcgcacag tcagccgagc tacaggtcaa gtcgcgcgct 1020 tctttacctt ttccctaggc ggtatcgcgt gagcggactg cgctaaataa aaggtctcat 1080 cgactcggaa tttgtaatta tttctatcgt gcgagtattc gctctgtgcg gccacccgga 1140 cgtaaca 1147 // ID Gypsy2-I_Dya repbase; DNA; INV; 4925 BP. XX AC chr2L; XX DT 18-MAY-2009 (Rel. 14.05, Created) DT 18-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy2_Dya; KW Gypsy2-LTR_Dya; Gypsy2-I_Dya. XX OS Drosophila yakuba OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; melanogaster subgroup. XX RN [1] RP 1-4925 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1034-1034 (2009). XX DR Genome; chr2L; Positions 21207534 21202610. XX CC Positions [2476-2904] - Reverse transcriptase CC Positions [4081-4530] - Integrase core CC 'CTAG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS join(2182..2919,2923..4077) FT /product="Gypsy2-I_Dya_1p" FT /translation="MGVLFKEKEKDFATAFKGMRVTEIIDESSVLDKIDLS FT HLEKSQKLKVENLVKDYKPVKSLASPVNMKIILEDDVPVYQRPRRTSYENK FT CFIDKQVQDWLSEGIIVPSCSEYASPVVLVSKKDGSKRFCCDYQKLNQKIK FT KDNFPMSLIDDVLDRLQGSKILTTLDMMNGFFHVPIDEDSRKFTAFVTHNG FT QYEFTHVPFGISNSPAVFCRYVSEVLRELIQNQTIVVYMDDIIVPSKDEND FT GIKALIVLSAASKAGLRIKWEKAQILKRRINFLGYTIENGTISPTDEKIKA FT VVNFPIPQSRQALQRYLGLTSYMRRFIEDYALAAKPLSDMLRKEGNQKNDR FT NLVMTELGIASFNHLKQLLVSAPVLKLFDPLAVTEVHTDASKYGFGGVLMQ FT KDPDDQQFHPIEFMSRKTNNCEEKYSSYELEVLAIVNALKKWRVYLLGKPF FT TVVTDCNAFAMTIKKDDLPPRVARWAMSLQEFEYKVEHRAGTKMRHVDALN FT RLSCFLIADSTKSRLIGAQNRENWIKAVKVILASNKEHEDFYIKNNIFCKD FT PVNELIVVPDSMVDEIIATAHKEGHFGIKKTRDILEKSYYIPGILPKVERV FT VRACVECLIIDSKRGKKEGLLSPIDKAQEPLGT" XX SQ Sequence 4925 BP; 1735 A; 873 C; 1176 G; 1141 T; 0 other; aattattggg ggctcgtccg ggatcgttca aagaataaaa ttaataaaag tgcattaaca 60 ctaacgaatt tgagagaacg aaggagaaga taaactggcg tgcgaccatt cgacgaagca 120 aaactggagc aaattctgga tcttttctcg acaaagaaga gaccctttga aagaatattt 180 aagtgacggt tgcaacgacg acgcaagagc aacagtgaga ctagtagaaa gtctttcgac 240 gtgcacatga aagttcttgg gcattttgaa gacgtgtcga aacaagaatt tttcgtggtg 300 aaattccgta tcgaagagcg gatagagctg gcgaaagaaa cgacgaattt taaaggaacg 360 acaacgagag gataacgaca aagagaggat aacgacaacg agaggataac gacaacgaga 420 ggacaacgac aacgacaaac caaataaaga aggaaacagt aaacgagtag aggagatcat 480 acgacgcaaa aacaagagaa ggatcgacgg cgtgaaaaaa tacgtgcatt tcgacgggaa 540 cgacgaagaa ataaaaacag cagctgagac gcaaatagag gcaagacttg gagctggcaa 600 tattgcaaca actagttttg cacagcgacg agtatataag tacatatttt tgaacaaaac 660 gacgaaataa cgaagagcag gagcagcgac ggcgacgtaa gagcaatcga cgacgtgatc 720 tgcaacgcaa tacggaccga ggttacggcg aagacgaact tcgcaacgca gaagaactta 780 aatacgacga cattgaagag acgagaagtg cggcggaaga gagtcgtaac aactggttgg 840 gtgagcgaga ggctttgttt aacattgttt aatatttttg aattcattgt ttaacattta 900 aattaatttg taaacggctc acgcactaac atggatgcta gacaaattct cgaattgacc 960 gtgccttttt ttaaaagcaa aattaacgga attgggtttg gataccaatg ggcgaaagag 1020 cgctttgcag gacagattat ttgaacgcta tggcatagat gcgggcggtg aggaagagga 1080 tgcagtaagt gtttcgggtt catctaatta tcaagatgtt agtcaccacg taggctcgag 1140 atttactcta cgcgatgtcc aagatagcat ttcatcattt tctggctcag accacgaaga 1200 cgtcaatcat tggttgtctg agttcgaaga cgttgcagta accgtaggct gggacaattt 1260 gcaaagattt atctatgcta aacaactttt gaccggagcc gccaagcttt atatcaaaag 1320 cacaaaccga attagagggt gggtcgagtt acagcgcgcg ttaaaagatg agttcggcaa 1380 aaaattatgc tcggctgaga ttcataagct tctgcggcaa aggcaaaaac aaccaaaaga 1440 gacatgcatt gagtttttgt acaacatgat ggagatcagc aaatcgataa atcttgatga 1500 ggagagttta gtttcttatg ttgtagaagg cataccggat ggcaggataa ataaggctgg 1560 gctgtatcgc gcaaaaacta tcagagagct gaaggacgaa atttctattt acgaaaagat 1620 aaagggaaaa ggaattaata aggaaattgt taagaaagag gtggcagtga aaaaaggaca 1680 gagtaatggc cagagaaaat gctttaagtg caacagtgaa ggccacattg ccagagattg 1740 taagggcgac gtagtttgct atcattgcaa ccagaaaggg cacatatcca agaactgcgt 1800 ggaaaagaaa aataatgttg tcgtcaagaa tgagaaagcg aatgtcctgg aagtcaatgt 1860 tgaaaaaggc tcaatataaa aaaatgatag tagcaaatgg tttgacattc gtagcgatga 1920 tcgattcggg atgcgacttg tgcttgatga gggaggatat tttcagaagc tttgaggatt 1980 taaggctgaa gcctgaagcc taaacatctg agaggcattg gcaaaggtga gctaaggaca 2040 ataggatgct ttgatattaa ccttcaggca gatggcgtat gctttacctc aacttttcac 2100 gtagtaaaac agaatgaatt ggagtacgca gccattattg gaaatgatgt gctccaatgt 2160 atcgatgtcg tgttcagttc gatgggggtc ttatttaaag aaaaagaaaa ggatttcgct 2220 acggcattca aaggaatgcg tgtcactgag atcatcgatg agtcctccgt gctcgacaaa 2280 atcgatctat ctcacctgga aaagagtcag aagctaaagg tagaaaatct agttaaagat 2340 tataaacctg taaagtcatt ggcgtctcca gttaatatga agatcatatt ggaagacgat 2400 gtacccgttt atcagagacc acgacgaact tcgtacgaga acaaatgttt catagacaag 2460 caagtgcaag actggctttc agagggaata atcgtaccaa gttgctcaga gtatgcatcg 2520 cctgtggtac tagtctcaaa aaaggatgga tcgaagagat tttgttgtga ctatcagaaa 2580 cttaaccaga aaatcaagaa ggacaacttt ccaatgagcc tgatagacga tgttctggat 2640 agattgcagg ggtcgaaaat cttaaccact ctcgacatga tgaacggatt tttccacgtg 2700 ccgatcgacg aagattcaag gaagttcaca gcattcgtga ctcacaacgg acagtatgaa 2760 tttacgcatg ttccatttgg catctcaaat tccccagctg tgttttgtcg ttatgtttca 2820 gaagtcttgc gagaactcat tcaaaatcag acaatcgtag tatacatgga cgatataata 2880 gtaccaagca aggacgagaa cgatggaatc aaggccttgt aaatcgtatt gagcgcagcg 2940 tcgaaggcag gtttgcgaat taagtgggag aaagcacaga ttttgaaaag aagaatcaac 3000 ttcttaggct atacgattga gaatggaacc atatccccaa ctgacgagaa aatcaaagct 3060 gttgttaact ttccaatccc tcagagcagg caagctttgc agaggtactt gggcttaacg 3120 tcatacatgc gacgctttat cgaagactat gctttggcag caaagccttt gagtgacatg 3180 cttcgaaagg aaggcaatca gaagaacgat aggaacctcg tgatgactga gttgggtatc 3240 gcttccttca accacttaaa acaactattg gtatcagcac cagtactgaa actgtttgac 3300 cctttagctg ttacagaagt acacacagat gcgagtaaat atggctttgg cggcgttttg 3360 atgcagaagg acccagatga tcagcaattt cacccgattg aatttatgag ccgcaaaaca 3420 aacaattgtg aggagaaata cagttcctat gagctcgagg ttttagccat tgtgaatgcg 3480 ttgaaaaaat ggagagtgta cttgttagga aagccgttca cagtggtcac agactgcaac 3540 gcttttgcaa tgaccatcaa aaaagacgat ctgcctccta gagttgcaag atgggcaatg 3600 tcattgcagg aattcgagta taaagtggag caccgagccg gaacgaagat gagacacgta 3660 gatgcactca accgtttgtc gtgctttctt atagcggatt ccacaaaatc acgcttgatc 3720 ggagcacaga acagggaaaa ctggataaaa gccgtcaaag ttatattggc aagtaataag 3780 gagcacgaag atttttatat aaaaaataat atcttttgca aggatccagt aaatgagctg 3840 atagttgttc ccgactcgat ggtagatgaa atcattgcta cagcacacaa agagggtcac 3900 ttcggaataa aaaaaaccag agacattttg gagaagtcct attacattcc aggaattctg 3960 cctaaagtag aaagagtggt tagagcatgt gtagagtgcc tgatcattga ctctaagcgc 4020 ggaaaaaagg agggattact atcgccgatc gacaaggctc aagaaccact cggaacctaa 4080 cacatcgacc acgttggccc acttaccgat acaaaaaaga agtataacca catattagca 4140 gtcgtggatg gtttttctaa atttgtttgg ctttatccaa caaagtcaac aggaacaaat 4200 gaggttatag aaaaactcga gaagcaagca gcagtttttg ggaaccctcg cagaatagtt 4260 acggacagag gaaccgcact cacttcagga tcctttactg aatattgcaa tcgcgaaagc 4320 attcaactac tccacattgc gacaggtatt cccagaggca atggacaagt cgaacgaata 4380 cacaaaattg tcatcccgat gctcggcaaa atgtgtcaag aaaacccact aaattggtat 4440 cgccacgtag acagagttca acaaatcata aacaatactc cacctcgaac cactaaatat 4500 agtccattta gaattttgac aggattgaat atgagactta agctgaatga tataaaagac 4560 ctaatagtcg acttagaact agaagaattg aacgatgaga gggaagagat tcgaaagaaa 4620 gcacaggaaa acatagaaaa agtgtagcaa gagaatagga aaaactttaa tgagaaaaga 4680 aaggttgaaa atgaatatga agaaggagat ttggtagcca ttgaaaaaac acagtatgga 4740 acaggaatga aacttcgtcc aaaatttttc ggtccgtata aaattacaaa aaaaaactaa 4800 accacggtcg ctatgaagta ataaaggaag ggaatcagga agggccaaaa acaacaacga 4860 ctgcagccga atatataaaa aaatggagtt cattcagggg cgaatgaata ttcaggatgg 4920 ccgat 4925 // ID DNA8-94_AP repbase; DNA; INV; 419 BP. XX AC . XX DT 26-AUG-2009 (Rel. 14.09, Created) DT 26-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-94_AP. XX NM DNA8-94_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-419 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2031-2031 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 419 BP; 170 A; 43 C; 49 G; 157 T; 0 other; cagtgttgag ccgagataaa ttataataat tttatctaga taaagataaa gataactaaa 60 ttatttattt atctagataa agatgaaaaa aatttttttt atctagataa attatcggat 120 aaattttttt atattggttt atggcactct ccgtaagctc gtagcaatat agaataactt 180 aatagtatgt tatcgaaaat gtcgatagtt attagtaaat actacctact taataaatta 240 cttaattgtc tacctataaa ttataataaa taaaagccgt ttaataaatt aaattatctg 300 gataaattca tctagataac ggtgattttt atcttagata actgtgtaaa ttaatattct 360 ctatttatct agataagata aaagataaaa gaatatttat ctagataaac tcaacactg 419 // ID Academ-1_HM repbase; DNA; INV; 6815 BP. XX AC . XX DT 30-APR-2010 (Rel. 15.04, Created) DT 30-APR-2010 (Rel. 15.04, Last updated, Version 1) XX DE This family belongs to the Academ superfamily of DNA transposons. XX KW Academ; DNA transposon; Transposable Element; Academ-1_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-6815 RA Kapitonov V.V. and Jurka J.; RT "Academ - a novel superfamily of eukaryotic DNA transposons."; RL Repbase Reports 10(4), 645-645 (2010). XX DR [1] (Consensus) XX CC Academ is a novel superfamily of DNA transposons that populate CC genomes of metazoans, including cnidarians, insects, sea urchins, CC lancelet, and fish. The autonomous Academ transposons encode a CC ~1500-aa protein composed of a novel Academ transposase domain, CC which is not similar to transposases encoded by any other CC transposable elements reported previously, the XPG domain, and CC the putative Cys8 zinc finger. The XPG domain is structurally and CC functionally related to FEN-1; divalent metal ion-dependent exo- CC and endonuclease, and bacterial and bacteriophage 5'-3' CC exonucleases. The Cys8 zinc finger is a conserved set of eight CC cysteines: CC Cys-X-Cys-X3,4-Cys-X3,10-Cys-X-Cys-X6-Cys-X3-Cys-X1,2-Cys. CC Academ transposons generate 3-bp target site duplications and CC contain terminal inverted repeats whose length varies from 6 to CC 530 bp. CC Usually, Academ transposons have the 5'-TAG and CTA-3' termini. CC Academ-1_HM is a young family. The consensus was derived from CC multiple alignment of several copies >95% identical to it. CC Imperfect TIRs are 14 bp long. XX FH Key Location/Qualifiers FT CDS join(553..714,949..1272,1368..1424,1486..2235, FT 2337..2684,2878..3120,3360..3644,3834..4070, FT 4220..5461,5710..6333) FT /product="Academ-1_HMp" FT /note="Contains the Academ TPase, XPG nuclease, and FT Cys8 zinc finger." FT /translation="MNLKLKIKDLLCTRGRKSTRKCNSSNSNRDRNKTHIL FT PEICLVCEKVEKYFMDKMQLKRSVQYLTLAETKDAGLLHEAAEQKNDGRIL FT LQIKDKDCVAVGVRYHKICYTLYTQEVRNIQRCPKPNLDISKNYDIAFQKL FT CEDIVMERIINNNEIMTLKSLLKYFINQVIDLKVKLFIYRTVCNSHDDNTK FT EIPKQNISIELIHFYMVALELRKVIQGIADFTGIWPPVSHLFTIENCVESV FT PIVLYNFLCLVLGFSDEPYLYNQRHSITTPLMLKVISIAQDLIYVASKGRT FT NTYKHLALGMTVRQLTGSSKLIDILNGFGHCVSSSTVTRHESALAALNVMV FT NTIVPCNVAKKKFTTLVYDNADFLEETLSGSGTTHVTHGICIQEKYNCNFS FT NSIQVSRRLKTVPLPKQVVQPYHLGKKPSFSVSYDSQLPTISSEILPSWTG FT LNTIMSDKNMLTTIAYLPVIDAPVTEISTINEILKQALTIANTLELAHIVL FT VFDEAVYSKIQLVRWKTEEYLSRIVVRLGDFHMLMSYCSGISKIYADAGMQ FT DIFIESGIVASGSINGVLSGKHYNRSVRCHKTLYEALQRLCFQSFLDSLVN FT EENCDIIEFLSAMRECIVKENDDFYYHNYKDYIESQKFENLCQRYKDFVDV FT QCQENATFNFWYNYIDMIQILLLHIRATRTGDWALHLSAVRSMLPWFFITD FT RVNYARYATSYWVEMKRLAITHPVNDEIHNNWTSQRQEQYGFSRVACDQTI FT EQTFNRDSKTQGGIVGFTRKRSAVHRWIMAQHQRCAIFKQCEIISGSISHT FT QRKDLGVSRIKKDENAILDVLSTIKTMANPFDLETSVLLHLSTGAVATVPL FT TTDMRNMLCTGEEKVVSFINTRILSSEQDLHSSIPKSKIVTFSSMLKKYSA FT KTSNGDLVTVKNTKDLFAKLILIAKSRNVEMREVLKYPLRPYPMPFATTTG FT GLVKTQKSKLISLIETPVVDAIVDNVEKGNALMIDAMALLQTLKIITPTFK FT ELSDFLLCTVISMGNSFGSSRIDFVSDRYPEVSIKDLERDKRASLGSQTIK FT IMNPNQKVPKQWKKYMSVGSNKEDLVDFLYQSWQLSDPSLFCGISVYITHG FT EFCHQFVPILKTVVVNEVAALHCNHEEADTRLLAHVEHAITNGIKNIIIQS FT PDTDVFVISLSNTFCKDSCLYFLTGIGNKRRIISINAVIQHWNKNWCQAFI FT GFHTFTCDTTSSFMGKGKGKPLKVLFEYTEFLDTFSRLGDSFIISESVLSS FT LEKFVCYMYQKNSTCNSVNEMRYQLFKSGIYDEELLPPTLDSLRCHIKRVN FT YQSYIWRNATKPIHNLDDFQNHGWMVSESGIAIEWLMNPVAPDSILNFVKC FT NCTTGCDTKRCSCVKSALKCSELCNCSKCDNTAFADDEDVYDNGVEDYDEY FT EDEFLEDDLL" XX SQ Sequence 6815 BP; 2382 A; 986 C; 1101 G; 2346 T; 0 other; tagaccacag gccatcgatc tgtatcgttc agtttcgtag tgcagtaatg agctctaacc 60 gttttacaaa ttatttcgcg ttttaaggta taataaattg atgggtaaaa tttctcaact 120 tgtaaaagta actgatttat tgtttgtgta ttttacagaa cacaataaac tctaaaaaaa 180 actttaaatt atgttcgaca aattacacaa aaacacaata taattaattt tttttcatta 240 tccataaaca aaatgtcagc cgagtgccac aaaccgttct gtattgttca tttaactgga 300 tgcagtggcg tgttaaataa gttttctaaa aaaacgtttg aaaatttttg tataaaaaga 360 acgaaatggc tttctcttgc ggaggaaggg aaaactgttg cggaaatttc tctcaatttt 420 gttgggaaag atgtttcatt tgagaatctt tatgctacaa caacaggaag cacttgctct 480 ccatattatc atttaaaatg taccaaaagt tcactgacaa agtccttatt gcaagaatag 540 aaaaaaaaat taatgaatct aaagctaaaa ataaaagatc tactttgcac tagaggtaga 600 aagtccacaa gaaaatgcaa cagctctaac agtaatcgtg atagaaataa gactcacatt 660 ttacccgaaa tttgtttggt ttgtgaaaag gtggagaagt actttatgga taaggtaaga 720 ttgtctgata atatatatat atatatatat atatatatat atatatatat atatatatat 780 atatatatat ataatttgtg tgtgtgtgtg taatatgtat agcttagcaa ttctttacag 840 atcatatatt gcatagctac tcatattatt ttatctatta aatatactat tatattatat 900 gtatattttt gaattctctt taatttactt attaaaattt tttattagat gcaactaaag 960 cgatctgtgc aatatctaac tctggcagaa acaaaagatg caggattact tcatgaagct 1020 gcagagcaaa aaaatgatgg gcgtatactc ttgcagataa aagacaaaga ctgtgttgct 1080 gtaggggtaa gataccacaa aatatgctac actttataca cccaagaggt acgcaatatt 1140 cagcgatgtc caaaacctaa tttagatatt tctaaaaatt atgatattgc ttttcaaaaa 1200 ctttgtgaag atattgttat ggaaagaatt ataaacaata atgaaattat gactctcaag 1260 agccttctaa aaaagttttt ttataaatca atttgaaaat agtgacataa taatatacca 1320 gtcaactaga ttaaagaaca gacttgtttc taaataccct aatttagtat ttcataaacc 1380 aagtaataga tctgaaagtg aaattgttta tatatcgaac tgtaatgtaa gatttgctgc 1440 agaatctttt gttaataata gttcaactga aaatacagat gaagatgtaa tagtcatgat 1500 gacaacacaa aagaaattcc taaacaaaat attagtatag aattgatcca tttttacatg 1560 gtggcattag agttgagaaa ggttattcaa ggaatagctg atttcactgg gatttggcct 1620 ccagtttcac atttgtttac cattgaaaat tgtgttgaaa gtgttccaat agtactatat 1680 aactttttat gtcttgttct tggtttttct gatgaacctt atctttataa tcaacgtcat 1740 tctattacaa caccacttat gcttaaagtt atttctattg cacaagatct tatttatgtt 1800 gcatcaaaag ggcgaaccaa tacttacaaa catcttgcac tcggtatgac agtcagacaa 1860 cttactggat ccagcaagct tattgacatt ttaaatggat ttggtcattg tgtatcaagt 1920 tcaactgtca ctcgtcacga atctgcgtta gcagcattaa atgtgatggt taataccatt 1980 gttccatgta atgtagcaaa aaaaaagttc acaactctag tatatgataa tgctgatttt 2040 cttgaagaaa ctctctctgg atctggaaca actcatgtta ctcatggaat ttgcattcag 2100 gaaaaataca attgcaattt tagcaactct atccaagtta gcagaagatt aaaaactgtt 2160 ccattgccaa aacaagtggt tcaaccatat catttaggaa aaaaaccatc atttagtgtt 2220 tcatatgata gccaaagtta ttttactcaa aataatctat tgatttgaca ccttccacaa 2280 atgtcaaaga acaattgcat tttattaaat atgatctggc ctactgtttt atcaagttgc 2340 caactataag ctcagaaatt ctaccttcat ggacaggttt aaacactata atgtctgata 2400 aaaatatgct aactactatt gcttacttac cagttattga tgcacctgta actgaaatat 2460 caactatcaa tgaaattttg aagcaagcac ttacaattgc taacacgctt gaactagcac 2520 atattgtgtt ggtttttgat gaagctgtct actcaaaaat acagttagtt aggtggaaga 2580 ctgaagaata cttaagccgt atcgtagtac gtcttggtga tttccatatg ttaatgtcat 2640 attgtagtgg aatttcaaaa atttatgctg atgcaggaat gcaggtaatt tttttgaaat 2700 ttgtatagat taacacaatt ttaactgtaa aaataattat tgtacattat attgtataca 2760 atattacaat atacaatatt gtatatatat taggttgaaa cttatataca acacagtaac 2820 attaaagaaa tttttttgca atataattag gcataaacag ttattaaaat ttattaggat 2880 atatttattg aatctggaat tgtggcatct ggttccatta acggagtcct atcagggaaa 2940 cattacaatc gatctgtgcg ttgtcataaa acattgtatg aagcacttca acgcctctgc 3000 tttcaatcat ttttggattc cttggtgaat gaggaaaatt gtgatatcat tgaatttttg 3060 tctgctatga gagaatgtat tgttaaagaa aatgatgact tttattatca caattataag 3120 gtattttagt gaaactttat ttttcttttt aaggcttttt aaaatacact ttaagataaa 3180 aacaaacaaa aaacttctta aaacttcttg caaaattttt tctaaaatta tagtataaaa 3240 aaatttgtat cactttatat ttatagcagt aaatattgca tattatttct cataaaagga 3300 aattaagggg atattgttaa acaaaaattt gaataaacca aaatgtaatt atttttcagg 3360 attatattga gagccagaag tttgagaatc tgtgtcaacg gtacaaagat tttgttgatg 3420 tacagtgcca agaaaatgcc acattcaact tttggtataa ttacatagat atgatacaaa 3480 ttttgctttt acatatacga gctacgcgca ctggagattg ggcgttacat ttatcagctg 3540 ttagatcgat gctcccatgg tttttcatta cagatagagt taattatgcg cgatatgcta 3600 cttcttactg ggtggaaatg aagagattag caataactca tccaggtagc tttaatacat 3660 ttagctttaa aataaatatt tggtaatctt cctttgtttt aagaatatgc gtagtgtatt 3720 cattgataat gcaaaatgca actacatagt ttaaattgtt ttatataaat aataatataa 3780 tttcaatttc tgaaaatata ctctagctta ttttaatata atatatttta gctgttaatg 3840 atgaaattca caataactgg acttctcaac gacaggaaca atatggattt tctagagttg 3900 catgtgacca gacaattgag caaactttta atagggattc gaaaacacaa ggaggcattg 3960 ttgggtttac tcgaaaacga tcagctgttc atcgttggat catggctcaa catcaacgct 4020 gtgctatatt taaacaatgt gaaatcattt cgggtagtat atctcataca cggtaacttt 4080 gttttcttac tttttgctta attttaataa agtaatctct ctctttacta tctctcttta 4140 ctatcatagc ttcacactgt attgtttact gtatttagag tagtagttag taaaagattc 4200 aattatttca tgttgtaggc agaggaaaga ccttggcgtt agtaggatca aaaaggacga 4260 aaatgctatt cttgatgttc tatcaacaat taaaaccatg gcaaacccat ttgatttaga 4320 aactagtgtc cttcttcatt tatctacagg agcagttgca actgttccac tgactacaga 4380 tatgagaaac atgttgtgta caggcgaaga gaaagttgta tcatttatta acaccagaat 4440 actatcaagt gaacaggatt tacactcatc cattcctaaa tcaaaaattg taacattttc 4500 cagtatgtta aaaaagtatt cagctaaaac ctcaaatgga gacttagtta cagtaaaaaa 4560 cacaaaagat ttgtttgcaa aactaattct aattgccaaa tccagaaatg ttgagatgag 4620 agaagtttta aagtatcctc ttcgtccata tccaatgcct tttgctacaa caactggtgg 4680 cttagtaaaa acacaaaaga gcaaattaat cagtttaatt gaaactccag ttgttgatgc 4740 tatagtagat aatgttgaaa aaggtaatgc tttgatgatt gatgctatgg cgttactaca 4800 gacgttaaaa attatcactc caacatttaa agagttgtct gattttcttc tgtgtacagt 4860 tatttctatg ggaaactcct ttggatcatc tcgtattgat tttgtcagcg acagatatcc 4920 agaagtcagt ataaaagatc ttgaacgaga taaaagagct tcattgggaa gccagacaat 4980 caagattatg aaccctaacc aaaaggttcc aaagcagtgg aaaaaataca tgtctgttgg 5040 aagcaacaaa gaagatttgg tagacttttt atatcaaagc tggcaattga gtgatcctag 5100 tttgttttgt ggaatttcag tatacataac acatggtgaa ttttgtcatc aattcgtccc 5160 tatattgaaa actgttgttg tgaacgaagt agctgctttg cactgcaatc atgaggaagc 5220 agatacacgc cttttagctc atgttgaaca cgccatcacc aacgggatta aaaatatcat 5280 tattcaaagt ccggatactg atgtgtttgt aataagctta agtaacacgt tttgtaaaga 5340 tagttgtcta tactttctca ctggaatagg aaataaaaga aggattatct caataaatgc 5400 tgtcatacag cattggaata aaaattggtg ccaggcattt attggttttc atacatttac 5460 aggtattttt tattaatcaa gtgattagta tttatttcag ttttcttatg attaattata 5520 tattagtagg aagttctttg tgattatatt tagtaggtta gttgtttgag tttgaaacac 5580 aattctccag gttcgaaccc ttggggtcat ttgccaatta gatggatttt attttcttat 5640 tatgcaacaa gatatatatt acaaaattgt atgctatgaa tttctctaaa atatttaaat 5700 tttttaggtt gtgacacaac aagctcgttc atgggaaaag gtaaagggaa accccttaag 5760 gtcctctttg agtataccga atttcttgac acatttagtc gattaggaga ttcctttata 5820 atttcagaaa gtgttttaag tagtctggag aagtttgtct gctatatgta tcaaaagaat 5880 tctacttgta attctgtgaa cgaaatgagg tatcaattgt ttaaatccgg aatatacgac 5940 gaagaactgc tcccaccaac tttagattct cttagatgtc atataaaacg agttaactat 6000 caatcatata tatggcgaaa tgcaactaag cctattcata atttagacga tttccaaaac 6060 catggctgga tggttagtga atcaggaatt gccattgaat ggctaatgaa tccagtagca 6120 ccagacagta tcctgaactt tgtaaaatgt aactgtacga caggttgtga tactaaacgc 6180 tgttcttgtg taaaatcggc tttaaagtgt tcagaactat gtaactgctc aaagtgtgat 6240 aacactgctt ttgctgacga tgaagatgtt tatgataatg gtgttgagga ttacgatgag 6300 tacgaagatg aatttttaga agatgatctt ttatagacca aattcattac ttttaagtaa 6360 tagtttttaa ttgttgtatt tacgtatatt ttaaaaggct tttttgaaaa gaataaaatt 6420 ttaaaatatt gtttaactgt ctaaatcggt tatgaatcga taaaatcgct atttttacac 6480 gataataaca cgttagtgcg aaacttttct ttaataatat atattttttt aaaattgtct 6540 taaaaccaat caaaattgct aatccaatta cattctgaaa aaaaaaaaaa aaaaaattca 6600 tagttaacag ctaaaatcga taaaaatcga tagatcgttt tgatggtttt ttaaagctcg 6660 acataaaaaa aactgcgggt ttttgaactg agaaatttat agtttataat cttaacacct 6720 taaaacgctc atttgcatca aaatttttta gagctcatga ctgcaccacg aaaattgatt 6780 ttaaagcatc tttttatgct ttggcctacg gtcta 6815 // ID DNA8-40_AP repbase; DNA; INV; 634 BP. XX AC . XX DT 25-AUG-2009 (Rel. 14.09, Created) DT 25-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-40_AP. XX NM DNA8-40_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-634 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 1970-1970 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 634 BP; 241 A; 73 C; 69 G; 249 T; 2 other; cagtgttggg aattatctag ataagatttt ttgaaataaa ttatctttta aagtatctag 60 ataattatgt tacgatttat ctaatgttta gcttagataa aaaatgggct tatctagata 120 aatattatag tgcccctaca aaattataaa accaatgtta ctaaaattct aatataatat 180 aaattattta taaaattaga agaatatatt tcttgttatt attgccaacg acggtattac 240 gataataaga gaacctgcat gcgggcttgc actatacatc gttctctact agccataatg 300 agcatattaa aattaattag gtataggttt atttcccnat ccattttaaa aataaattaa 360 aattaacttc gtagaactgt atttcaaagg ctcaatccgt tgttattata aatattatgt 420 cattatattg atataaatct aatattacaa aaatactaaa aaacaataaa aataagtgta 480 aaaaaatgat atttatctag ataaatntat tgttgtattt atctttatct agataaatta 540 cgtttatgtt atctttatct ttatctagat aatattttgt taaaattatc ttatcttatc 600 tagataattt tttagttatc tattcccaac actg 634 // ID MSAT-2_AAe repbase; DNA; INV; 276 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE Minisatellite-type sequence: consensus. XX KW MSAT; Satellite; Simple Repeat; MSAT-2_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-276 RA Jurka J.; RT "Tandemly repeated DNA from Aedes aegypti."; RL Repbase Reports 11(4), 1448-1448 (2011). XX DR [1] (Consensus) XX CC 23-bp unit. XX SQ Sequence 276 BP; 72 A; 60 C; 60 G; 84 T; 0 other; tctcaggatt ctgagagaat ccttctcagg attctgagag aatccttctc aggattctga 60 gagaatcctt ctcaggattc tgagagaatc cttctcagga ttctgagaga atccttctca 120 ggattctgag agaatccttc tcaggattct gagagaatcc ttctcaggat tctgagagaa 180 tccttctcag gattctgaga gaatccttct caggattctg agagaatcct tctcaggatt 240 ctgagagaat ccttctcagg attctgagag aatcct 276 // ID Copia-39_DPu-I repbase; DNA; INV; 3451 BP. XX AC ACJG01005078; XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the common water flea: internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-39_DPu_; KW Copia-39_DPu-LTR; Copia-39_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-3451 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the common water flea."; RL Direct Submission to RU (08-FEB-2011). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR Genome; ACJG01005078; Positions 12500 9050. XX CC Positions [1660-2184] - Integrase core CC LTRs are 95% similar to each other. XX FH Key Location/Qualifiers FT CDS 583..2805 FT /product="Copia-39_DPu-I_1p" FT /translation="MTKIICTLPPIYQSFVTAWDSVPFADRTMALLTSRLL FT KEEEMAKRWNTGKPNSQDAAFFAHQNPSYPQTFNLNSCGRDRSRRGGRSNY FT RQQPYRFCKYRRCNIAGHTIEVCRIRIRDEEDARKEKNNASAAIAASNQEK FT EDADNPEHHLNDDAYFSSSCFIGRSNSDWFLDSGATQHMSDQRSFFCTFKP FT VLSDSWTVSGIGSTRLFVRGYGSIEFIVLVGTIKRIVTIENVLYVPSLGTN FT LISIAAVTAVGLSVHFIETKVTFKKNKAVVMMGERIGKTLYHLSILPNLSE FT MQQQTDTALLSVPPSTPINVWHQRLAHKSYKTILKMSSKQLVKGLQLPANI FT SIPKQPCLGCVSGKMQRSPFPVGRERANKVGQLIHSDVCGPMHVPTPCGAK FT YFALFTDDYSGWQAVYFLKQNSEVAESFKNFVNTLRSETGQIVHTLRADNG FT GEFTNQSLKAWLSDRAIRIETSAPHSPEQNGVFERANRTIMEGTRSLIYAK FT HLPLELWAEAIACAVYSLNRVCNSTSPLTPYENWYGKKADFSHLRIFGSTA FT FIHVPKTERRKLESKIFKCYFVGYSLTQKAYRFWDPIGRKIKISRDVIFDE FT NCNVFSHFSPQLDDLCELYPKPTKPATTPQQQITNIADVMPLVAGEMDIQP FT ESPPDCSEQPDTTSLHPEDDPAPTEDIPPDIGHSTERGHDPLPSSTPPIRH FT SPYPLRVHGAHFFQLMSKRQKYMSRHPIQMQWSMQMHIYGK" XX SQ Sequence 3451 BP; 1067 A; 806 C; 673 G; 905 T; 0 other; ggttatgggc ccagtacctt atagcagtta ttgataacat acaagatgaa ttcaatgcga 60 gatgtcaatc atattgccaa attcaatggt cagaattttc ctctttggaa acttggattt 120 tggattcttc tccaacaaca cgagttggtc aaagttgtaa ctggagaaga acaggttcct 180 actgaggtat acagtaacat ttatcttaca taaatgatcc agtagatgtt ttacatgcct 240 tatattaatc catagatacg caatgaagca caagtagtca ctaatgctgc tgctatagca 300 agctggcaca ccaaagacac acttgcccgt ggttatctca tctcaactat tgaaagtcag 360 caacaaagat ctctaatcaa ctgcaacaca gctcacgaga tgtgggtacg actatcggca 420 caatacctta gaaatgctgt tgagaatcgg tatgtttttc aaaccagatt ctttgagtat 480 cgtttcaagc cagacaacga catcatggcg catataactg agatggaaac catggccaca 540 taacttacta atattggagc agaaatcgac gccatttcta tcatgacgaa gatcatatgt 600 accttgccac ccatctatca aagttttgtc acagcgtggg atagtgtccc atttgcagat 660 aggaccatgg cattattaac atcccggcta ctaaaagagg aagagatggc gaaaagatgg 720 aacactggaa aaccgaattc tcaggacgca gctttcttcg cccaccaaaa cccgtcttat 780 ccacaaactt tcaacctcaa ctcatgtggt cgtgatcgtt ccagacgtgg tggccgttct 840 aactacaggc aacaaccgta tagattttgc aaatatcgac gctgcaacat tgctggccac 900 accatagaag tttgtcgaat aagaattcga gatgaagaag atgcaaggaa agaaaagaac 960 aacgcaagtg ctgccatagc tgcatcaaat caagaaaaag aagatgctga taaccctgaa 1020 catcatctca atgatgacgc ttatttttcg tcttcatgtt tcattggacg aagcaattca 1080 gattggttct tagactcagg tgccacacag cacatgagtg atcaacgttc ctttttctgc 1140 accttcaaac ccgttctttc tgattcgtgg actgtaagtg gaatcgggtc tactcgtcta 1200 tttgtgcgtg gatacggaag tattgaattc attgttttag taggtaccat caagcgaata 1260 gtcactattg aaaatgtcct ttatgtaccc agtcttggca ccaatttgat ttcgattgct 1320 gctgttaccg ctgttggcct gtctgtacat ttcattgaga ccaaagtcac ctttaagaag 1380 aacaaagcag ttgtcatgat gggcgagagg attggaaaaa cgctttatca tttgtctatt 1440 cttccaaatt tatcagagat gcaacaacag acagacacag cactcctttc agtgccacca 1500 tctacaccaa tcaacgtgtg gcaccaaaga ttggcgcata agagctacaa gaccatccta 1560 aagatgtcat ctaaacaact ggtaaaaggt cttcaacttc cggccaacat cagcataccc 1620 aaacaaccat gcctgggatg cgtttccgga aaaatgcagc gttccccttt tccagttggg 1680 cgtgaaagag cgaacaaggt tggtcaacta attcattccg acgtgtgcgg accaatgcac 1740 gtaccaacgc catgcggagc aaaatacttt gcactcttca ccgacgacta cagtggatgg 1800 caagcagttt attttctaaa gcaaaattca gaagtagctg agtctttcaa gaatttcgtc 1860 aacactcttc gaagtgaaac tggacaaatc gtgcatacac tgagagcaga caatggcggt 1920 gaattcacga atcaatcctt aaaagcctgg ctatctgata gagccataag gatcgaaaca 1980 tctgcacccc attcacctga acagaatgga gtcttcgaaa gagctaatcg gacaatcatg 2040 gaaggcacaa gatccctaat ctatgccaaa cacctaccac tggaactttg ggcagaagcc 2100 attgcctgcg cagtatactc actcaaccgc gtctgcaaca gcacgtcccc actcacaccg 2160 tatgaaaatt ggtacggtaa gaaggctgat ttttctcatc ttcggatttt tggttcgact 2220 gccttcattc atgttccaaa aactgagaga cgcaagttgg aatcgaaaat tttcaaatgc 2280 tatttcgttg gatactccct tacacaaaaa gcttatcgct tttgggaccc aatcggtaga 2340 aaaattaaaa taagtcggga tgtaattttt gatgaaaatt gtaacgtttt ttcccatttt 2400 tcaccacaac tagacgatct ctgcgaattg tatcctaaac cgacaaaacc tgctacaaca 2460 ccacaacaac aaataaccaa catcgctgac gtgatgcctc tagttgcagg ggagatggat 2520 attcaaccag aatcaccacc tgattgttca gaacagccag ataccacatc gttacatcca 2580 gaagatgatc cagctccgac ggaagatatt ccacccgata ttggacattc aaccgaacgt 2640 ggacacgacc ctcttcccag ttcgacacca ccaattcgcc actcgcccta cccgttacga 2700 gtgcatggag ctcatttctt tcaactgatg tccaaacgac agaaatatat gagccgacat 2760 cctattcaga tgcaatggag tatgcaaatg cacatttatg gaaaatagcc atccaggaag 2820 aatatgactc actcatgtcc aacaaaactt ggcacctcac accactccca atcaatcgta 2880 cgcccatcaa gactcgttgg gtttttacat tcaaacaagg tacccaagat tctcccccgc 2940 gatataaagc aagattggtt gcaaagagtt tctcacaacg acctgggatt gatttttaag 3000 agactttctc acccgtagtt aaacatgaca ctcttcgagt aatcctatct cttgctacca 3060 ctcttgatct agaaatgtcg caactagatg taaagactgc tttcctgtac ggcgaaatag 3120 aagaagaaat ttatctcctg caacccgaag gatatgttca agccggacaa gaaaattccg 3180 tatgccgcct ccagaaatgt ttatatggcc tcaaacaagc gtaacgagtc tggaatcgcc 3240 actttgatag tttcctcgaa cgtttcggcc tacaatcaag tgatgccgac ccgtgtcttt 3300 actttcgtcg aaacaacaaa gaagttcttt tcgtcctaat ctgggttgat gatggattag 3360 taatcagtga tgacgggaat ctggtaaaac aagtaattga atatttgaag aaatatttcg 3420 agatgcggtg tactccggct aatcattttg t 3451 // ID Gypsy-42_CQ-I repbase; DNA; INV; 5578 BP. XX AC AAWU01014973; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-42_CQ_; KW Gypsy-42_CQ-LTR; Gypsy-42_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-5578 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 463-463 (2011). XX DR GenBank; AAWU01014973; Positions 38603 44180. XX CC Positions [4369-4839] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 1045..5385 FT /product="Gypsy-42_CQ-I_1p" FT /translation="MSIASTIEQYRKGSSFGDWAERLEYNFKANKYTDDLM FT KTHFMNLCGSYLYSELKTIYKKSDLDKATYTELVEKLKQKLHKTEPDLVQR FT FRFSKRIQHPDETAEEFVQAVKLQAEFCNFGEFRDNAILDRVLVGLTDDDL FT KEQLLKEEKLTIAKMDKYITTWNIAKSNVHAINARTHTPPGNINQIRRSVR FT ERLGFNPYNYRQNSQYRNNYNRANNYNRQDNFNNQQRAHISPRAHNSQRTV FT RFQHSNRNFNRNANYNNNNNGYNGGNNNIGYNGGNNNFRSVHCGQLGHIKR FT KCFRLKNQRREAVNFVTEDRAGPSGRPSAGPSGGSSAETGAERQLSSMMGR FT LSTGGNVDLDTSDTDCEEWNAGDLQCMCVASLNKISEPCLINVLLDNIFVE FT MEVDCGSTVTVMGKLQFDNLFNKRLLKSNKQLLVVNGNKLKIWGETDVLVQ FT LNGLKRNLKIIVLDTDFKFIPLFGRNWMDVFFPQWRQFFSNNTAINEQVNR FT VTAPNGDKLIEQIKSDFPEVFIKDFSTPIKGFEAELVLKINVPIFKKAYDV FT PYRLREQVLEYLDRLEKENVITPVKSSEWASPVIIVMKKDNKIRLVIDCKV FT SINKVLVPNTYPLPIASDLFAKLANCKVFCALDLEGAYTQLALSERSRKFM FT VINTIKGLYTYNRLPQGASPSASIFQQMMDQVLGGIENVYCYLDDVLIAGE FT TLDNCHNKLIIVLNRLAMANIKVNWEKCKFFVSDLKYLGHIISEKGLMPSP FT DKISTIQKAKVPKNENELKSYLGLINYYNRFIPNMSSKLFYLYNLLRKNVK FT FNWDHNCDKAFHDSKQALVEANILEFYDPKKDIIVVSDASGYGLGGVIAHV FT IDGVEKPISFTSFTLDDAQKKYPILHLEALALVCTIKKFHKYLFGQEFIAY FT TDHKPLLGIFGKEGKNSIFVTRLQRYILELSIYKFELRYRPSAKMGNADFC FT SRFPLEQSVPAELDQDIVRSINFSREFPLDSVSIAKATVDDVYLQQIMNYL FT RVGWPQKIDKRLLDVYANQSDLEEVDGCLLYQDRVVIPRKMQSGILKLLHA FT NHAGMVKMKQLARKQVYWFGINKDIEKYVSTCDVCASMAVVPKTKIESQWT FT PTTRPFSRIHIDFFYFSNHTFLLIVDSFSKWLEVEWMKRGTDCAKVVKKLV FT SYFARFGLPDVLVSDGGPPFNSYSFTSFLERQGIKVMKSPPYNPQSNGQAE FT RLVRTTKDVLKKFLLDPDLDSVDLEDQINLFLMNYRNNIVTSSGQFPSDRV FT FLYKPKTALDLLNPKKHYKQFLTVPPTPNDEVVDVPKAIRKASKDELDELI FT AGDLVWYKNNKNSIPNRWIKASFIKKFSPNVFQVLAGSGEILAHKDQLRPY FT RAREEARTNILIPTASGANMNLEQTATLRASGSADEDGDFRGYSNAELARV FT RKRKVDAAQLPEVCLRRSKRLRKPKQDFEFKYV" XX SQ Sequence 5578 BP; 1669 A; 895 C; 1311 G; 1703 T; 0 other; gcgtacgttt tagtggcgac ggggatcaag gatttttttt ctctccggtg aaaaagtagt 60 gattttggta atttttcatc gcgtttcgcg gtacgcgttt tgcgttttta taactccaaa 120 tcatctacgg ctttgccgag gtaaagaagt tcttttgtga gagtacgggg tgaatttcgc 180 ggcagtgtgc ttcagaggtg atcggctcga gtctaccaag caagcggcca ttttgaagct 240 tcttcggagg gcgtcggtgg agcgaattcc gtgcgagaga tccggctgat cacggttccg 300 gagtggacgc agtaggctgt ggctgttgga atctgggagg agaatcgtta gcggcaagta 360 gcagaagcgg aggtcggcaa ttttggcaaa atctgtcggt tttggttggc gtcggcgtgg 420 atcggacaaa ggctgccacg cggtacgttt cttttgtttg agctgtttct taaggtaggg 480 gcgattgggg ccaaataagt tccttttgct ttttttcttt attttaaaat catgttccac 540 tgatcgtatc acagttggaa caagttcgtg attttggttg tcctaacatt ttaacatttt 600 agaaaattga gtgtgcgtgt gtgtgagctc tttttgttct gggtagaaca aaggaggata 660 gaaaattttc atcccatttt aagagtgttc ttctttttta tttattcagt gcggtgaatt 720 tgccgaaggt cagtgaaggt gcttattggc ggcgtaggac gttcacgttc tgtctgctga 780 acttccgggt gaaatcctcg gtgcttttca acaaggtcgg agttctggta taggagagcc 840 ggaattctgg tgaagtgcgt cggctttttc tgctgtcaga acgacggtgc ttcagcacag 900 gagcaaaggt ttggtaacct tggtggtgag tttttaagca gatttttttt gaaaactgtt 960 agagcctaat ttggccattt ttggtgtttt cttttctttt atttttgatc ttttgtttaa 1020 aaaagtggtt ctaaatagaa caagatgagc attgcttcca caattgagca gtaccggaag 1080 ggttcgtcct tcggtgactg ggctgaaaga ttagagtaca attttaaagc caacaagtat 1140 accgacgact tgatgaaaac tcactttatg aatttatgtg gttcatactt atactctgaa 1200 ctgaaaacta tttacaaaaa atctgatctc gataaagcaa cttatacaga attagttgaa 1260 aaactcaaac aaaaactgca taagactgaa ccggacctgg ttcagcggtt ccgtttcagc 1320 aagaggattc agcatcctga cgaaaccgct gaagagtttg tccaggcggt caaactccag 1380 gccgagtttt gcaattttgg ggagtttcgc gataatgcaa ttttagatcg ggttttagtg 1440 ggtttaactg acgatgactt aaaagagcaa cttttgaaag aggagaaatt gacgattgct 1500 aaaatggaca aatatattac aacatggaat attgcaaaat caaatgtaca cgcaatcaac 1560 gcacgaacac acacaccacc agggaacatc aatcaaatta gaaggtcagt tcgggagcga 1620 cttggcttta acccatacaa ttataggcag aactcgcaat atcgaaataa ttacaataga 1680 gcaaataact acaatagaca ggacaatttt aacaatcagc aacgtgcaca catttcacca 1740 cgagctcaca attctcagcg tactgttcga tttcaacaca gcaacagaaa ttttaatcga 1800 aatgcaaatt acaacaacaa taacaatggt tacaatgggg gaaacaataa cattggttac 1860 aatgggggaa acaataactt cagatcagtt cactgtggtc agttgggtca catcaagcgg 1920 aagtgcttca ggctgaagaa ccagagaagg gaagcggtaa actttgttac tgaggacaga 1980 gcgggacctt caggtagacc gtcagctgga ccttcaggtg gatcgtctgc tgagactggt 2040 gcggagagac agctgtctag catgatgggg cggctgagca ctggtgggaa tgttgatctc 2100 gacacatctg atactgactg tgaggagtgg aatgcaggtg atttacagtg tatgtgtgtt 2160 gcgtccttga ataagattag tgagccttgt ttgatcaacg ttttgttgga taatattttt 2220 gttgaaatgg aagtggactg tggttcaact gtgactgtga tgggaaaatt acaatttgac 2280 aatcttttta ataagcgttt gctgaagagt aacaagcaac ttttagttgt gaatgggaac 2340 aagttgaaga tctggggaga aactgatgtt ttagtgcaac ttaatggttt gaaacgcaat 2400 ttgaaaatta ttgttttaga tacggacttc aaatttattc ctttgtttgg tcgaaactgg 2460 atggatgttt tctttccaca gtggagacag ttcttctcta acaatacagc aattaatgag 2520 caggtaaaca gagtaacagc gccaaatggt gacaagttga ttgagcaaat taagagtgat 2580 tttccagagg tttttatcaa ggatttttct acacctatca aaggttttga agccgaattg 2640 gttttaaaaa ttaatgtccc gatttttaag aaagcatatg atgtgcctta ccgtttaagg 2700 gagcaagttt tagaatattt ggacagattg gaaaaagaaa acgttattac tccagtgaaa 2760 tctagtgaat gggcttcacc agtaataatc gtaatgaaga aggataataa gataagactg 2820 gttattgact gcaaagtttc aattaacaag gtcttagttc caaatactta ccctttgcca 2880 attgcgagtg acttatttgc taaactggca aattgtaagg ttttctgtgc tcttgatttg 2940 gagggagctt acacacagtt ggcattgtcg gaaaggtcca ggaaattcat ggtgataaat 3000 acaataaaag gtttatatac atacaatcgc cttccacaag gtgcttcccc aagtgcttcg 3060 atatttcaac aaatgatgga tcaagtcttg ggaggaattg aaaatgttta ttgctatctt 3120 gatgatgttt tgattgcggg agaaaccctg gataactgtc ataacaagtt gataattgta 3180 ttgaatcgtt tggcaatggc taatattaaa gtcaactggg aaaagtgcaa gttttttgtt 3240 tctgatttga aatatttagg gcacattatc agtgagaaag gtctcatgcc cagtccagat 3300 aaaatttcga caattcaaaa ggcaaaagtt cctaaaaatg agaatgaact gaaatcttac 3360 ttaggtttga ttaattatta caatcgtttt atacctaaca tgtcttcaaa gctgttttat 3420 ttgtacaatt tgttaagaaa gaatgtgaag tttaattggg atcataattg tgataaagct 3480 tttcatgaca gtaaacaggc tttggttgaa gcaaatattt tggagtttta tgatccaaag 3540 aaagacatta ttgtggtttc tgacgcatcg ggctatggcc ttggaggagt gattgctcat 3600 gttattgatg gtgttgaaaa accgattagt tttacttctt ttactttaga tgacgcgcag 3660 aagaaatatc ctattcttca cttggaggca ttggctttgg tttgtacaat taagaaattt 3720 cataaatatt tattcggaca agaattcatt gcttacactg atcataaacc attgctaggt 3780 attttcggca aagaaggtaa aaattcaatt tttgtaacga gactccaacg gtacattttg 3840 gaactttcta tttataaatt tgaacttcgg tataggcctt ctgcgaaaat gggaaatgcg 3900 gatttttgtt cgagatttcc attggagcag tcggttcctg ctgaattgga tcaagatatt 3960 gtacggagca taaactttag tagggagttc cctttagatt ctgtttcgat tgctaaggca 4020 acagttgacg acgtttattt acagcaaatt atgaactatc tgcgtgttgg atggccgcaa 4080 aagattgaca aacgcttact ggacgtttac gcaaatcaaa gtgatttgga agaagtcgat 4140 gggtgcttat tgtaccaaga cagggtggtt ataccacgta aaatgcaaag cgggattttg 4200 aagcttttgc atgccaacca cgcagggatg gttaagatga aacaactggc aaggaaacag 4260 gtttattggt ttgggattaa caaagacatt gaaaaatatg tttctacgtg cgacgtctgt 4320 gcaagtatgg ctgtcgtacc aaaaactaag attgagtctc aatggacacc tacgactaga 4380 ccttttagta ggattcacat tgatttcttc tacttttcca atcatacttt tttgttgatt 4440 gttgattctt tttcgaagtg gttggaagta gaatggatga agaggggcac ggattgtgca 4500 aaggttgtga agaaattggt atcgtatttt gcgagatttg gattaccgga tgttctggta 4560 tcagacgggg gtcctccttt caactcttat tctttcactt ctttccttga aaggcaagga 4620 ataaaggtga tgaaaagtcc accttataat ccacaaagca atggacaagc cgaaaggcta 4680 gtgagaacaa ctaaggatgt tttgaaaaag tttttgttgg atccggattt ggacagcgtt 4740 gatttggagg atcagattaa cttattttta atgaattatc gtaataacat tgtgacgagt 4800 agtgggcaat ttccctcgga tagagtattt ttatataaac caaaaacagc tttggacttg 4860 ctgaatccca aaaagcatta taaacaattt ttaactgtgc caccaacccc aaatgatgaa 4920 gtggtagatg taccaaaagc aatcagaaag gcgtcaaagg acgaattaga tgagctgata 4980 gctggggatt tggtgtggta caagaataac aaaaatagca taccaaatag atggattaaa 5040 gcaagcttta ttaagaaatt ttctccaaac gttttccagg tgttggctgg aagcggggaa 5100 atcctcgctc ataaagatca gctgcgacca taccgggctc gtgaggaagc gaggacaaat 5160 atcctaattc cgacggcatc cggtgcaaac atgaacctgg aacaaacagc gacgctgaga 5220 gcgtccggaa gcgcagacga ggatggagat ttccgtggat actcgaacgc cgagctggct 5280 agggtcagaa aaagaaaagt tgatgcggct caattgccgg aagtgtgctt aagacgttca 5340 aaacgtttgc gtaaacctaa acaggatttc gaatttaaat atgtttaact ttggattttt 5400 ttcatgtcgt tgaattgcga tttcggattt atatagagaa atgtattttg atcagaatta 5460 aaataaattg attttgttag catttgaagt gaattctttt actttggaaa cagtgaattg 5520 taaaatattt ttagtttaat cttagactat agaaacaact tccgaaaggg gaaggagt 5578 // ID BEL-24_CQ-I repbase; DNA; INV; 2391 BP. XX AC AAWU01010212; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-24_CQ_; KW BEL-24_CQ-LTR; BEL-24_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-2391 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 201-201 (2011). XX DR GenBank; AAWU01010212; Positions 21750 24140. XX CC 'GATAG' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 530..2347 FT /product="BEL-24_CQ-I_1p" FT /translation="MEEQLKALVRQRDAVERRLKRVQADLKTSTAAPNENL FT RNVHFVKHQLSNVESAFVKFGEYQEKIYAMALAVEEQAKHEKCELEFEALR FT RGLTVKLKGLVEDLEKPGLNATNVPAASPHYLPPLNVPLPKFDGTYETWLS FT FKSMFQHVMARYTAEAPAIKLYHLRDSLVGKAAGVIDQDIVNNNDYEAAWA FT VLEELYGDKRVIIDRHIDVVFALPKISRDNAAELRKLIDTCGKHVEGLKSL FT QLPVNGLGEQMLLNLLASRMDRDTRKAWEAEQKVGVLPTYAATIAFLKERC FT RVLERVEPCGETNVNPQRSVALVVASGAKCYTCNQQHEIKDCEQFKSRSVN FT ERFSQLRKHGLCFNCTKRGHRAGECTSTNSCERCKKRHHTMLHKDSPRKPE FT IAPNSTAADAANPKQRRHFGRGPTCCATTKKLVMLPTAVVQVYGENGVPHL FT CRTLIDSCSQNHFVTERFANLLAVNRERVNCEVSGLNEDTTRISHLVRATV FT KSRVGDYAADVELLLTPKITGDVPAKMIDVAGWNLPSNVELADPNFNHSGR FT VDMLLGAELFWDLVKDGKISLSENLPSLRETEFGWIVGGALPIRTPVTASS FT FCGVSTRG" XX SQ Sequence 2391 BP; 580 A; 556 C; 785 G; 470 T; 0 other; ttttggtcct tcatggccgg attgctggag cagtgatccg tggtatgaag agacgggttg 60 ccggagtggg aactcgaaca ttcgcggtcg aagtgtccgc gtgtgtgtgt gaagcgcggg 120 gcgtagcacc cacgctgcta agaaaaagtg attgaacttt ggaagtctgt gaccggacgg 180 ctggagagtg gttcgggaag cggccttttt gaaggaaagc atcgggcgga gtgtccggga 240 gtgaaaaagt gcgaccggac gactggagag tggtctgaag gtgacatttg acggtggaaa 300 ccgaaccgaa aacaccagca aaggcttcgt gtgtgtgtga cagcagtgag cgtgttggaa 360 acttccggcg cggggcatag catcatctag tggaagaggg gctgtgtaat tcgaagagtg 420 ccagaaagaa gtgtccgaag aagaagccgg caacgggggt gagaaggagg aagtgacacc 480 aatctgcaag cagttgctgg tgcagcggaa gaaggaagac gcccgtaaga tggaggagca 540 gctgaaagct cttgtgcgtc agcgagatgc tgtggaaagg aggttgaagc gggtgcaggc 600 ggatttaaag acgagtacag cggcgccgaa cgaaaatctg cggaacgtgc acttcgtgaa 660 gcatcagctg agcaacgtgg agagcgcttt tgtcaagttc ggtgagtacc aggagaaaat 720 ttatgcgatg gcactcgcgg tggaagaaca ggcgaaacat gagaagtgcg agctcgagtt 780 cgaggcgctg cgccgcggac tgactgtgaa gctcaaaggt ttggttgagg atctagagaa 840 gccagggctg aacgctacga acgttccggc tgcctcgccg cattaccttc cgccgctgaa 900 cgtgcctttg ccgaagttcg acgggactta cgagacgtgg ctctcgttta agtcgatgtt 960 tcagcacgtc atggcacgat atacggctga ggctccggcg atcaagcttt accacctgcg 1020 agactcgcta gtcggaaagg ctgccggtgt gattgaccag gacatagtca ataacaacga 1080 ctatgaggcg gcgtgggccg ttctcgaaga gctgtacgga gacaagcgtg tgatcatcga 1140 tcggcacatc gacgttgtct tcgctttgcc caagatctcc cgcgacaacg cagccgagct 1200 ccgcaagctg atcgacacat gtgggaaaca cgttgaggga ctgaagtcgc tgcagcttcc 1260 ggtcaacgga ctgggggagc agatgctctt gaacctgctg gcgtccagga tggatcggga 1320 cacgcggaaa gcctgggagg cggaacagaa agtcggagtg cttccgacgt acgcagccac 1380 catcgcgttt ctgaaggaga ggtgccgtgt cctcgaaaga gtggaaccgt gcggtgagac 1440 caacgtcaat ccacaacgat cggtcgccct ggtggtcgcg agcggagcga agtgttacac 1500 ttgcaaccag cagcacgaga tcaaggattg cgagcagttc aaatcgagga gcgtcaacga 1560 aaggttcagc cagctacgga agcatggact gtgctttaat tgcaccaaga gggggcatcg 1620 cgcaggcgag tgcacttcta caaactcgtg cgagaggtgc aagaagcggc atcacactat 1680 gttgcacaag gatagcccga ggaagccgga gatcgctccg aattcgacgg cggcggatgc 1740 ggcgaacccg aaacagcggc ggcactttgg acgaggtcct acgtgctgcg ctacaacgaa 1800 gaaactggtc atgctgccga cagccgtagt ccaggtatac ggtgaaaatg gcgtgccaca 1860 cctgtgccgg acgttgatcg attcctgctc gcaaaaccac tttgttaccg aaagattcgc 1920 caatcttctg gcggttaata gagaacgcgt caactgtgag gtaagtggtc tgaacgaaga 1980 cacgacgaga ataagtcacc tggtccgtgc gacggtcaag tctcgcgttg gcgactacgc 2040 ggccgacgtc gagttgctgt tgacgcctaa gatcactggc gacgtgccgg cgaagatgat 2100 cgacgtcgct gggtggaacc tgccatctaa cgtcgagctt gcggatccca acttcaacca 2160 cagcggtcga gtcgacatgc tcttgggagc tgagttgttc tgggatttgg tcaaggatgg 2220 aaagatttca ctgtcggaga acttgccctc gctcagagag actgagtttg gctggatcgt 2280 cggcggtgcg ctgccgattc gaacacctgt gacggcgagc tctttctgcg gtgtttcaac 2340 acgaggatag ttggagatga attgttgaac tgtttcaaca atgggggagg a 2391 // ID NeSL-1_TV repbase; DNA; INV; 6092 BP. XX AC . XX DT 31-JUL-2009 (Rel. 14.07, Created) DT 01-AUG-2009 (Rel. 14.07, Last updated, Version 1) XX DE A family of NeSL non-LTR retrotransposons - consensus. XX KW NeSL; Non-LTR Retrotransposon; Transposable Element; NeSL-1_TV. XX OS Trichomonas vaginalis OC Eukaryota; Parabasalia; Trichomonadida; Trichomonadidae; OC Trichomonas. XX RN [1] RP 1-6092 RA Kapitonov V.V. and Jurka J.; RT "A family of NeSL non-LTR retrotransposons from the T. vaginalis RT genome."; RL Repbase Reports 9(7), 1337-1337 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 20..5683 FT /product="NeSL-1_TV_1p" FT /note="RT ans restriction enzyme-like nuclease FT domains." FT /translation="MIPVLGTGGPEKLPLQSYVYCGNTAITDSFTPTAKTI FT LKPEEQNLDIVLKNIAALNPENYSDLIRSLSKMEFRLDYPKEIENYWISEK FT LFSQSIASLPISLLVASMFSPEDRDLSTEPFHCNADGCNFHCDNCERMVEH FT IREHHNTDPMINTFETTEDTFRRITAIKIDKTGIEELNPLKYRCSYCDELF FT TEAEDHAIHMISHLTEKLSPDISFFFNDILRLYKTIDKPTVQNLFPETQVA FT IFDTLEETNRFRLIVGREAIETIEEAFPPSPPGTDRKPSIIITDTCQLRFV FT PCMDEPPKGDLGILTLLLRDFSAHNIPIKSLNNKELIADKDIDYSPDFVEG FT ALANAEEHDTTNSQNNNGRYINSAEKLTEFLIQCEDYLTNIKTLEDLERFY FT TTIKDYRVNKEVIAEDTPIFVYFLVEEGKLPKPGLRCPLESYEGHEDKAFE FT SLRKLCDHFKGEIAKTSFDPKVHTIDIWVEFLAQAYGTGTFVYKDENGNID FT LDTHVFKCPYADCSYTNNDRSKLMDHMKTKKHAKNVYIERYGFFWGIVIEG FT VNRPKGIVYPTLKDIKEHACRKCPEAGCNTYVTELSDIKEHLKKKHKSTTA FT GVDGEIAHTDATYCWITKEELDALHAERARERAEQVDNTPVQQIINADNNE FT ENNENQEDNGNNEEADALDPPNNTTETEDEAVHAVIINPPATEEEEVAIIA FT EARRNIPELQQAEERGCVTPKMTSLVRLKLLKGGGELFNKKLTPLATRYAA FT TGNTEADKIKVDYLTLKCNAALREMIYTNNHSESKFMTAENGEDTAPPPRI FT SEDTRDRIQKAANEIKGTLIKVVKHISHARCLKDSTRDDEHNKFVEMIAKI FT KNDLRDNKFEQYNIEEIFQGPISDQSILNIVNTEDNNEFIKKMDYINRILG FT TPQDASPYARKKLQACFADNPTKTLRNIILADKVPQQSLKPSEYLDYYGPQ FT WANEAEGYENFLHHDYALPERYGQVFANDFLDFMTNESKIIEVIRNKNHLS FT AHGLDGIPNSVYMLFPVSAAKFLSILFRSIIISGHIPDCWKLSKTVMLFKK FT DDPSLAKNWRPIGITSCTYRIFMTLVNKALQMIPMFHAMQKGFVRGATLSE FT HIAVANEVLCQSTRTQSEMFQTAIDFTNAFGTVPHQLIFDSLEAKKVPDSI FT INLLKDLYKGARTAIYTRHAHSEIVPVRRGVIQGCPLSPILFNCCLDPLLY FT AVQRRHFEDGYRFQDKAGQYSIAIQAYADDVLVISPTHEGMQRILNTVDEF FT QKIAKLKVAPQKCVTLAKTSTAIQPFRIGPDEIPIKTSMDNITYLGIPISG FT TKTSRFAAATGILEKVKAQIRVVFASHLALSQKIIALRVFILPQLDFYMFH FT NVFRVNDLKATDQMIRGLIDKEAPTSNIPVSFFYMPKNKGGFGLVKLELRQ FT PQLVLTKFARLWLSQQAETKAFFHTMAQEEKSFRKVVEDQENGFLGIKMEN FT GKIVQKNERSKRTNCFITQAAKAADKLEVRFKEWDKGGIQVRGVGENATDW FT YRSKHIGQISPLIGRVIQQRQYEEFKKDETHSHTFCEPAALAESHDIMKRP FT QAVPNNLYSAAIALRTNTAPTPANMHFHNPEVLANCPLCGCQSCTLFHTLN FT MCRNRFSLYKWRHNIICDDIYQFIHDHYPGVTIKCSARITSDGYQTTGPEL FT DDTVKDLLPDLVVYDEANKMIKIIEVTCPYGTDNNVGNSLDAAYDKKVNKY FT KSLAEQTERLFNWTTTLSIIVVSSLGVIPLRTKLDALRISPADHIQLLKRL FT SMHAIAASACIVFEKVPEFFGMRCRPLPGRVTAPNAAIPPNNNENNNDTDH FT GQENQQATSEEQPTNNGNAQEDNGQGEQINNSTEQTISVDQIIEEDAENNA FT IEQALDQPDEDEFLN" XX SQ Sequence 6092 BP; 2163 A; 1497 C; 1161 G; 1271 T; 0 other; gggtgagtag tctagtggta tgattcctgt tttgggtaca ggaggtcccg agaagcttcc 60 actgcaatcg tacgtgtact gtggcaacac agctataaca gacagtttca cgccaaccgc 120 gaaaacgatt ttgaagcctg aggaacaaaa tttagatatc gttttgaaaa atattgcagc 180 gttgaatcca gaaaattact ccgacttaat caggagccta tcgaagatgg agttcagatt 240 agattacccg aaagaaatag agaattactg gatttcggaa aaattattta gccaatccat 300 cgcatcattg cccatcagtt tgttagtcgc atccatgttc tcacctgaag accgtgactt 360 gagtacagaa ccgttccact gtaacgctga tggctgtaat ttccattgtg acaattgtga 420 aagaatggtt gaacacatca gagagcacca taacactgac cccatgatca atacatttga 480 aacaacagaa gacacattta gaagaataac ggccatcaaa atagacaaga caggcatcga 540 agaacttaac cctctaaaat acagatgctc gtattgcgac gagttattca ccgaagcaga 600 agatcatgcc atccatatga tttcacatct cacagaaaaa ttatcaccag atatatcttt 660 ctttttcaac gacattttac gcctttacaa aactatcgac aaaccaacag tacaaaattt 720 atttccagaa acacaagtcg caatttttga cacacttgaa gaaacaaaca gattcagact 780 tatcgtagga agagaagcca tagaaacaat tgaagaagca ttccctccaa gtccaccagg 840 aacagatcgg aaaccatcca taatcatcac agacacctgt caactcaggt ttgtaccatg 900 catggatgaa ccaccaaaag gagatctcgg aattctgact ctacttttaa gagatttcag 960 cgcacacaat atcccgatta aatcactgaa caataaggaa ctaattgctg ataaagacat 1020 cgattacagc ccagattttg tcgaaggagc tctagccaac gcagaagaac atgatacaac 1080 gaacagccag aacaacaatg gaagatacat taactcagcc gaaaaactta cagaattttt 1140 aatacaatgt gaagactact taacgaacat caaaacactt gaagacttag aacgtttcta 1200 cacaacgatt aaagactaca gagtcaacaa agaggttatc gccgaagata caccaatctt 1260 tgtatatttc ctagtagaag aagggaaatt accaaaacca ggtcttagat gcccacttga 1320 atcatacgaa ggacacgaag acaaggcatt cgaatcactg agaaaacttt gcgaccactt 1380 caaaggagaa atcgcgaaaa cgagctttga cccaaaggtt cacaccatag acatctgggt 1440 tgaatttttg gcccaagcct atggcacagg cacgtttgtc tacaaagatg aaaacggaaa 1500 catcgacctt gatacgcacg tattcaaatg cccttatgca gactgctcat acacgaacaa 1560 cgacagatca aaactcatgg accacatgaa aacgaagaaa cacgccaaga acgtatacat 1620 cgagagatac ggcttctttt ggggtattgt catagaagga gtcaaccgac caaaaggaat 1680 cgtctacccg acactcaaag acatcaaaga acacgcttgt cgcaaatgtc cagaagcagg 1740 atgcaacaca tatgtaacag aattgagcga catcaaagaa catctaaaga agaaacataa 1800 gtctacaaca gcaggagtag acggagaaat cgcgcacact gatgctacat actgctggat 1860 taccaaagaa gaactcgacg cattacatgc cgagagagca agagaaagag cagagcaagt 1920 agacaacact ccagtacaac agataattaa tgctgacaac aatgaagaga acaacgagaa 1980 ccaagaagac aacggaaaca acgaagaagc agatgccctc gacccgccaa ataacacaac 2040 agagacagaa gatgaagcgg ttcatgccgt catcatcaat ccaccagcaa cagaagagga 2100 agaggtagcc atcatcgccg aggcaagaag aaacattcca gaactccaac aagcagaaga 2160 gagaggctgc gttacaccga aaatgacatc actcgtccga ttaaaactat tgaaaggagg 2220 aggagaactt ttcaacaaga aactcactcc attagccaca agatacgcag ctacaggaaa 2280 tacagaagca gacaaaatca aggtagatta cttgacacta aaatgcaatg ccgccttgag 2340 agaaatgatc tacaccaata accacagcga atcaaagttt atgacagcag aaaatggaga 2400 agacacagca ccaccgccaa ggatatcgga agacacaaga gatcgcattc aaaaagcagc 2460 caatgaaata aaaggaactc tcatcaaagt agtcaaacac ataagtcacg cgagatgcct 2520 caaagacagc acgagagacg atgaacacaa taaattcgtc gaaatgattg caaaaatcaa 2580 aaacgatctc agagataaca aattcgaaca atataacatt gaagaaatat ttcaaggacc 2640 gatctccgac cagagtattc tcaacatcgt caacacggag gacaacaacg aattcatcaa 2700 gaaaatggat tacattaacc gaattctcgg aacaccacag gatgcatcac catatgcaag 2760 gaagaagtta caagcatgtt tcgccgataa cccaacaaag actctcagaa acataatctt 2820 agccgacaaa gttccacaac aatcattgaa gccaagcgaa taccttgatt actacggacc 2880 tcaatgggca aacgaagctg aaggctacga aaacttcctg catcatgact acgcgttacc 2940 ggagagatat ggccaagttt tcgcaaacga cttcctcgac ttcatgacaa acgaatcgaa 3000 gatcatcgaa gtaatccgca acaagaatca tttatcggca cacggcctcg atggaattcc 3060 gaactcagtt tacatgctat tcccagtcag cgccgcaaaa ttcctcagta tattattcag 3120 atcaatcatc atatcaggtc acatcccaga ctgctggaag ctctccaaga cagtgatgct 3180 ttttaagaag gacgacccat cgttagcaaa gaactggaga ccaatcggca tcacgtcatg 3240 cacttacaga atcttcatga ctttagtcaa caaagcgtta cagatgatcc caatgttcca 3300 cgcaatgcaa aaaggtttcg ttcgcggagc aacactgagt gagcacattg cagtcgcgaa 3360 cgaagtcctt tgccaatcaa ccagaacaca gtctgaaatg ttccaaacag caatcgattt 3420 cacgaacgct ttcggcacag ttcctcatca attgatcttt gattctctcg aagcgaagaa 3480 agttcccgat tcgatcatca atctgctcaa ggacctctac aaaggagcaa gaacggctat 3540 ctatacaaga catgcacact ccgagatagt tccggttcgc agaggtgtca tccaaggctg 3600 tccactcagt ccaatcctct tcaactgctg cttagatcct ttattatatg cagtccagag 3660 gagacacttt gaggacggtt acagattcca agacaaagca ggacagtatt caattgccat 3720 tcaagcttac gctgacgacg ttctagtcat ctctccaaca catgaaggaa tgcaaagaat 3780 cttaaacaca gtagatgaat tccagaaaat tgcgaaactc aaagttgcac cacagaaatg 3840 cgtcacactt gccaaaacat ccactgcaat ccaacctttc cgcattggtc cagacgaaat 3900 cccaatcaag acgagcatgg acaacatcac atatcttgga ataccaatct ctggaacaaa 3960 gacatcaaga tttgcagctg caactggcat tctggaaaag gtcaaagcac agatcagagt 4020 cgtcttcgcg tcacatctcg ctctctctca gaagattatc gctctcagag tcttcatctt 4080 gccacaactt gacttttaca tgttccacaa cgtattcaga gtcaatgact tgaaagcgac 4140 agatcagatg atccgaggcc tgatcgacaa agaagcgccg acgtcaaaca ttccggtttc 4200 atttttctac atgccgaaga acaaaggcgg ctttggactc gttaaattgg aacttcgcca 4260 gcctcagctc gttctcacta aatttgcgag gttatggtta agtcaacaag cagaaaccaa 4320 agccttcttt cacacaatgg ctcaagaaga gaagtcattc cgcaaggtcg tcgaagacca 4380 agaaaatggt ttcttaggca tcaagatgga aaacggcaaa attgtccaga agaacgaaag 4440 atccaaacgc acaaattgtt tcatcacaca ggcggctaaa gcagcagaca aactggaagt 4500 cagattcaaa gaatgggaca aaggaggcat acaagtcaga ggtgtaggag aaaatgcaac 4560 agactggtac cgctcgaaac acatcggcca aatctcaccc ttaatcggtc gcgtcatcca 4620 acagaggcag tacgaggagt tcaagaaaga cgaaacacac tcacacactt tctgcgaacc 4680 agcagcgcta gcggagtcac acgacatcat gaagagacca caagctgttc caaacaacct 4740 ctactcagcg gctattgctc tccgtacaaa cacagctcca accccagcaa acatgcactt 4800 ccacaaccca gaagttttgg ctaattgtcc attgtgcgga tgccaatcct gcactctctt 4860 ccacacattg aacatgtgca gaaaccgttt cagtctatac aaatggcgcc acaatatcat 4920 atgcgatgac atttaccaat tcattcacga tcactatcca ggagtaacca tcaaatgctc 4980 ggcgagaatt acaagtgacg gctaccaaac aacaggccca gagctcgacg acacagttaa 5040 agatctcctc ccagaccttg ttgtctacga tgaagcgaac aagatgatca agatcattga 5100 agtcacatgc ccttacggca cggacaacaa tgttggcaac tctcttgacg cggcatacga 5160 caaaaaggtt aacaagtata agagccttgc tgaacaaaca gagagattat ttaactggac 5220 cacgacgctc tcaattatcg tagtctcatc actaggagtc atccctctcc gtacaaaact 5280 cgacgcattg agaatctcac ctgcagatca catacagcta ctcaagagac tttcgatgca 5340 cgcgatagct gcgagtgctt gcattgtttt tgaaaaagtg ccagaattct tcggtatgcg 5400 ctgccgtccc ctcccaggac gagtcacagc tcccaatgca gcgatcccac caaacaacaa 5460 tgaaaacaat aacgacacag atcatggtca ggagaaccaa caggcaacct ctgaagagca 5520 accaaccaac aatggaaatg ctcaagaaga caatggccaa ggcgaacaaa taaataattc 5580 aaccgaacaa actatctctg ttgatcaaat catcgaagaa gatgctgaga acaacgcgat 5640 agaacaagcc ttagaccaac ccgatgagga cgaattcctt aactaagaag agataagacg 5700 agtgagaaga acagaagcat agtaggattg gcagagctta agcgatgtca ctcggtacga 5760 aacgtgtacc aaacaccgga ttccgtgcta ggaatcacaa gccaaaataa aagagacacc 5820 acgaaaatta ctcaccctcc ctcaaacaga taataatatt aacctcccat ccatcagtcc 5880 gtatggtctg ataacagact agcaccacat ccatgataca ctcattggag tgaaaaccac 5940 caacaacaaa tccacctaga ccaaatcctg ccccacctcc acccaagtag ctcgcttcgc 6000 tcgctcacct aaaactttgc tcgctcgctt cgctcgctcg tcttaaccct ttccgaataa 6060 acacttacaa ttcccggctc gccccatttt tt 6092 // ID SIRE2_TC repbase; DNA; INV; 811 BP. XX AC AF227606; XX DT 09-JUL-2004 (Rel. 9.06, Created) DT 09-JUL-2004 (Rel. 9.06, Last updated, Version 2) XX DE Trypanosoma cruzi SIRE repeat region, related to VIPER DE retrotransposon. XX KW SINE; Non-LTR Retrotransposon; Transposable Element; SINE element; KW SIRE repeat region; SIRE2_TC. XX OS Trypanosoma cruzi OC Eukaryota; Euglenozoa; Kinetoplastida; Trypanosomatidae; OC Trypanosoma; Schizotrypanum. XX RN [1] RA Vazquez M., Ben-Dov C., Lorenzi H., Moore T., Schijman A. RA and Levin J.M.; RT "The short interspersed repetitive element of Trypanosoma cruzi, RT SIRE, is part of VIPER, an unusual retroelement related to long RT terminal repeat retrotransposons."; RL Proc. Natl. Acad. Sci. U.S.A 97(5), 2128-2133 (2000). XX DR Genbank; AF227606; Positions 1 811. XX SQ Sequence 811 BP; 225 A; 103 C; 206 G; 202 T; 75 other; gggcggtnaa aaaaanatta gtnagaggnt ttnccggggg tttntngata aaggannana 60 gagnggggaa aaatanaagg gagggggagg aanagaacaa nttnttaaag nannnngggg 120 agaaancnag gtntnttgna annaatggaa ttaggggaaa nttaaaanag ggaatttnat 180 antaatntag agaaanttgn gcccctttng gngncaagga attntggnan ggagggattt 240 tanngtttta tggcngcnng gttngagagg ntgttgcgag agntntttan nggncnncng 300 antttnttgn ggtttcntgg gggggaangg tgattttgga aagcggtaaa aatgggttcg 360 acgatagtgg gagtagggac cgcttttttn ggtaagngtt antgcccacc aaagtngccg 420 cggaagggaa cgacagatta ggggantatc cggcagccca tcagccgcgt cttttgantg 480 tgcgggcatc cccggctttt tntaaatgag ccaatgagcg gaaactgcac gcggctgttn 540 acaaattggn ttcctccgca catacgttac acaaatttta tttatttatt tatttaattt 600 ttattttgcc atcctaccca cccgcttttg atcaccccan ttcgcggcgg ggtnttgtgg 660 ttggagggga agttaagatc attcattttt taagtgataa atatatttat ntttaatntg 720 aacagaatgc gacatgccac aatattggag gtaacggaat atataaagac gaacatacat 780 tttgagcaaa aaagnaaaaa caaacgaacn a 811 // ID R1-1_NVi repbase; DNA; INV; 5582 BP. XX AC . XX DT 18-AUG-2009 (Rel. 14.08, Created) DT 18-AUG-2009 (Rel. 14.08, Last updated, Version 1) XX DE non-LTR retrotransposon of R1 clade from Nasonia vitripennis. XX KW R1; Non-LTR Retrotransposon; Transposable Element; R1-1_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-5582 RA Bao W. and Jurka J.; RT "R1 non-LTR retrotransposons from Nasonia vitripennis."; RL Repbase Reports 9(8), 1801-1801 (2009). XX DR [1] (Consensus) XX CC non-LTR retrotransposon of R1 clade. XX FH Key Location/Qualifiers FT CDS join(894..2309,2282..2677,2538..3305,3235..5307) FT /product="R1-1_NVi_1p" FT /translation="MCWGHVSCLHRRGDPGGRTHTRTKFDFSMDEATERNN FT LRMEGCDVPADGDAVSGGGSKNLVGSLATINDIAAQNREGTGDIIGGSPQN FT WXAAERKLEDQLLIIRSNLVKMKAKTNVQKNVSRVITDGMCEIYEAVEVGM FT HELQRMKQLRQTLTEVQRTDTEETPRLKRKRDRQTPTPEPGTSDKQELXAK FT DTWATVTNEETKRPRKRRRMLIKXDETKEPKARKTDRSNLKPRRSAILIKP FT AEGKTYADVLLSXRQXVNLTDSKVDVRAIRKTKDGSVLLQLDKEQERQVFQ FT QTIQNVLGQTATVKDVTSKVTLEFMDLDCVTTAQDVADAINRETGTTTDRN FT VHVFDANNRGQALAVCEFDAKEATDLLKKGRIKIGWINCRIRPRLRVPRCY FT RCLGYGHVRVDCKGPDRRDCCWKCGRKGHVGKACTSAPSCFLCAGRANQDL FT AHVPGTASCAVFKAALTEYRTKQTKWTIQDETDKMDLTILQCNLNGSSVAH FT SLLPQIAAETDADVLVISEQYRNLSEPSWIASSSNTAAVWVRGVNAARIID FT SGAGEDYAWVRVGSVTIVSVYLSPNNSAQCARIRRQAGSPGRRRQXPYWRC FT NRGGRLGWALSLLSVSTFLRTIVRSAREYEDKLEALEDVVRXLTGDVIVAG FT DFNARAIEWGMPTTNRRGRLILQMAARLELEVINDGNVTTYRRPGFGNSIP FT DITLATDRMLTRLRGWRVIEDYTASDHQYIVFNLTNDAALRQRQSMRTTRW FT DIGRINRDEIKRQLQNVAIPSADLSQGRTDRAAAERSANDLEKYLQRICEA FT AMPRKRYRQDRRQTYWWTQEIAEIRKECLRLRRLAGRERDSDDKRGYLAGI FT QARETATREGNGTRTIREAISREYKLARRLLRQTINSTKRRCWRRLEDEVD FT SDPWGEGYKIVTRKLGAWNPPETKDADTMERIVHDLFPTHQDRTDDTDMDT FT AVECPLFTVAELESAAAKLKPRKAPGPDGVPGELLQIIVAEQPDALLRLYN FT GCLTAGVFGGPWKGARLVLIAKGKGDPASSSSYRPLSLLNTPGKLYELLLR FT PRLLQAVEAAGDLSDRQHGFRRGHSTIGAITHVIQIVDKTNDICHGARPLV FT LLATLDVRNAFNSARWADILEALETTFRVPAYLVRVVKDYLRDRYLTYETS FT DGTRRRKLTAGVAQGSILGPDFWNILYDGLLRLDMPDDTCLIAYADDVAAV FT ITARNTHLAQLKLNQVMRRVSTWMTDHGLQLALQKTELLLLTRKRIDTIIP FT MNVGTEHTITRNEVKYLGVTLDTKLTFWAHIRNATAKAAETTKALSRLMAN FT TRGPRPSIRRLLMSVTHSIMLYGAEVWADALQVKKRAKAMTSVQRTGALRI FT ACAYRTVSENAIFVIAGVIPIDLLARERKRLYVRSSNVGRADAATEERTRT FT MEEWQTRWTTNGQGRWTHRLIPEVGPWKDRKHGEVDFYLTQFLSGHGYFRR FT YLYRMNRVTTPECTYCGQAWDDAEHTFFRCNRWTEQRQRLETLLGTTITPD FT NVTKLMITSEDAWRSIATYVGSILRGKKEDGCLDN" XX SQ Sequence 5582 BP; 1575 A; 1305 C; 1682 G; 1010 T; 10 other; taataataat aataatccca tcgatatgtc cgctccttgt cgtaggcggt tgctttaaca 60 tcttcgcaac ctgccgggtg atgggcttac gagagcagca gtcctctgag ggagtgaatg 120 tatggcattc tggtgggagt gtaaatgagg gacgattgaa cgaaactcct taaaaagagt 180 atcttgatga atcagcgcat ggagcggaga tgagctgcag tagcagaaag ggaaaccgaa 240 tatgcaagct gcatgccaac cacagtcgga gccgtaggtc ctttaaaaag atcttgaccg 300 gtcacacaga gacgcaggaa ggcacgcact atcgaagacc tctataaacg ggtgttcttt 360 gagaagcgaa atcacacgcg gagcacaagc cgggactatg ccctttctgc ggcgaccccg 420 cagcaatggt agcgtcttgg tggagagaat tgttgatgca tgattgaggg ggtcctataa 480 acggatatct cccggacgta tcagagtgat agatgggagt gtcttgttgc cgtctaagtc 540 aggggaaggc tgctctcaaa ctaatcaagc ggaccgccgt gttggccgtg cggtgcgacc 600 cgtgagctgc caaaaggaga tcctgggtgg ggagacgatc ttaggctgcg tgccgaagtg 660 actcgagcca cctttggctc ccgctgacgg caagacggga ggtcggctgc ctagccagca 720 ccgatcaatc ggcttggcga gtgcctgctg tcgtgtgacg cctgctgtaa acttttcaga 780 gagctgaaca cttttgttcg ggggtgggaa gctcgagaga gcggaggtgg gctagtgcgg 840 cagtgggttc tatatcctgg tatgatgtac ccagtctggc atcgcctgca ggcatgtgtt 900 ggggccatgt cagctgcctc catagacggg gggatccagg gggtcggaca cataccagga 960 cgaagtttga tttcagcatg gatgaggcaa ccgaacggaa caatttgagg atggaggggt 1020 gcgacgtgcc tgcagatgga gatgcggtca gcggaggtgg aagtaagaac ctggtaggta 1080 gcctggctac tataaatgac atcgcggcac aaaatcgtga agggacgggc gacataatcg 1140 gagggtcgcc acaaaactgg raggctgcag agaggaagtt ggaagaccaa ctactcataa 1200 tacgcagcaa cctggtgaag atgaaggcga agacaaacgt ccaaaagaat gtcagtcgcg 1260 taattacgga tggaatgtgt gagatctatg aagctgttga ggttgggatg catgaacttc 1320 aacgtatgaa gcaactgcga cagactctca cagaagtaca aaggacggac acggaagaaa 1380 cgccacgact caaacgtaaa cgggacagac aaaccccgac tccggaaccg ggaacttcag 1440 acaagcagga gttgcyggca aaggatacgt gggcaaccgt cacgaacgaa gaaacgaagc 1500 ggccacggaa acgaagacga atgctgatca aayaagacga gacaaaagag cccaaagcgc 1560 gcaagacaga tcgctcaaac ctgaaaccca ggcgaagcgc gattttgatt aagccagcag 1620 agggcaagac gtacgcggac gttctacttt ctatrcgaca garagtgaat ttgacggaca 1680 gtaaagtgga cgtacgagct attcgtaaga ccaaagacgg ttctgtactg ctacagcttg 1740 acaaagaaca ggagcgacag gtttttcagc agacgataca aaacgtactg ggacagactg 1800 ccacagttaa ggacgtgact tccaaggtga cactggaatt catggacttg gactgcgtca 1860 caacagcaca ggacgtcgcg gacgcgatca accgcgaaac ggggaccacg actgaccgaa 1920 atgtccacgt ctttgacgcg aataaccgcg gacaagcctt agctgtctgt gagtttgacg 1980 cgaaggaggc aacggatcta cttaagaaag gccgaattaa aatcgggtgg attaattgcc 2040 ggatcagacc caggcttcgc gttccgaggt gctacaggtg ccttggctat ggacacgtga 2100 gggttgactg caaaggaccg gacagacgag actgctgctg gaaatgcggc cgaaaggggc 2160 atgtaggcaa ggcgtgcaca agcgcgcctt cctgcttcct ctgcgcgggg cgagccaacc 2220 aagatctggc acacgtgcct ggcacggcct cgtgcgcagt attcaaggca gctctgactg 2280 aatacaggac gaaacagaca aaatggacct gacaatactc caatgcaacc tgaacgggag 2340 cagcgtggca cattcgctgc tcccgcaaat tgcggcggaa acggacgcgg acgtactcgt 2400 gataagcgag cagtacagaa acctctccga gccatcgtgg atcgctagca gctcgaacac 2460 ggcggcggtg tgggtgcgag gggtgaacgc ggcgcggatt atagacagtg gggccggaga 2520 agactatgct tgggtaaggg tgggctctgt cactattgtc agtgtctacc tttctccgaa 2580 caatagtgcg cagtgcgcgc gaatacgaag acaagctgga agccctggaa gacgtcgtca 2640 garaccttac tggcgatgta atcgtggcgg gcgactttaa cgcgagagcg atcgaatggg 2700 ggatgcctac gacgaacaga cgaggacgac tgattctaca aatggccgcc cgactagaat 2760 tggaagtcat aaatgacgga aacgtgacca cctacagacg acccggcttt ggtaattcta 2820 taccagatat aaccctagcc acggacagga tgttgacaag gctgagggga tggagagtga 2880 tcgaggacta cacagccagt gatcaccagt acatcgtctt caacctgact aatgacgccg 2940 cgttgaggca aagacagagc atgcggacaa cacgatggga cattggacgc ataaacaggg 3000 acgaaatcaa gaggcaactg cagaacgtag caataccttc tgcggatctg tctcaaggac 3060 ggacggaccg agccgcggct gaacggagtg ctaatgatct ggaaaagtat ctgcagcgca 3120 tctgcgaggc cgcgatgcca cggaaacgat acagacagga cagaaggcag acctactggt 3180 ggacgcagga aattgccgaa atcagaaaag aatgcctccg actccgcaga ttagcgggaa 3240 gggaacggga ctcggacgat aagagaggct atctcgcggg aatacaagct cgcgagacgg 3300 ctacttagac agacgatcaa cagcacgaaa cgacggtgtt ggagacgact ggaagacgaa 3360 gtagattcag acccctgggg ggaaggctac aagattgtga ctcgcaaact tggcgcctgg 3420 aatccccccg agacgaagga cgcggacaca atggagagga ttgtccacga tctctttcct 3480 actcaccagg acagaacgga cgatacggac atggacaccg cagtggaatg tccacttttc 3540 acggtggcgg agcttgagag tgctgcagcc aaactgaagc ctcgaaaagc cccgggacct 3600 gacggcgtgc ctggggagct gctacagatc atcgtagctg aacagcccga tgcgctgttg 3660 aggctctaca acggctgcct gacggcggga gtgtttggtg ggccttggaa gggcgcgcgg 3720 ctggtcctca ttgccaaagg taagggggac cctgcttcgt cgtcctccta taggcctcta 3780 agtctcctga acacgccggg taaattgtac gaattactgc tacgaccgag actgttgcaa 3840 gcggtggaag ccgccggaga cctctcggac cgacagcacg gatttagacg aggacactca 3900 acaataggag ctataacaca cgtaatacag attgtggaca agacaaacga catatgtcat 3960 ggagccagac cactggtact cttggcaacg ctggacgtac ggaacgcttt caattctgcg 4020 aggtgggcgg acatcttgga ggcgctggag acaaccttcc gggtgcctgc ctacctcgtg 4080 cgagttgtaa aagactatct acgggaccgg tacctgacgt atgagacatc ggacggtacy 4140 cgacggagaa agttgacagc aggagtagca cagggctcaa ttctgggtcc tgacttctgg 4200 aatatactgt acgacggact gttaagacta gatatgccgg acgacacgtg cctcatcgca 4260 tacgcggacg acgtggcggc tgtaattact gcacggaaca ctcacctggc gcagctaaaa 4320 ctgaatcagg tgatgagacg cgtaagcaca tggatgacgg atcacggact ccaactggcg 4380 ctacaaaaga ctgagctact gctactaaca aggaaacgta tagacacaat tatacctatg 4440 aatgtgggga cggaacatac gattaccaga aacgaagtaa agtatctggg ggtgactcta 4500 gatacgaaac tgactttctg ggcacacata cgcaacgcta cggctaaagc agcagagacg 4560 acgaaagcct tgagtagact catggccaac acgcgaggac caagaccgag tatacggcga 4620 cttcttatga gcgtcacaca ttcgatcatg ctctacggag crgaggtatg ggcggatgcg 4680 ttacaggtaa agaaacgggc aaaggctatg acgagcgttc aacgaaccgg ggcactaaga 4740 atcgcgtgcg cgtaccggac cgtatcygaa aacgcgatct tcgtaatagc gggcgtaatc 4800 ccgatagacc tcctcgcgcg tgagcggaaa cgcttatacg tacggagttc gaacgttggg 4860 agggcagacg ctgcgacaga agaacggact aggacgatgg aagaatggca gacgagatgg 4920 acgacaaacg gacaaggtcg ttggacgcac agactgatac cggaggtcgg accctggaaa 4980 gacaggaaac acggtgaggt ggacttctac ctcacccaat ttctgtcggg acacgggtac 5040 ttccgaaggt atctatacag aatgaacaga gtgaccacac cggaatgtac gtactgcgga 5100 caggcttggg acgacgcgga gcacacattt ttcaggtgca accgatggac cgagcagcga 5160 cagagactcg agacattact aggcacgacg attacaccag ataacgtgac gaaactgatg 5220 ataaccagcg aagacgcgtg gagaagcatt gcgacatacg ttggtagcat cttgcgaggt 5280 aagaaagagg atggatgttt ggataattra gcgagagcga gagtgggaga ataagaatat 5340 aagcgtagct atgaatgact ggaatagtaa ggaatgaatg tgcagaggaa gcgcaaagga 5400 catgggccca gttcgaagta atgtgataaa ccgtaccggg ctggtgtcgg gcgtacaggg 5460 agatatgtgt ttagtgggta ggggtggccg ccgccgccga gtcccacact tggagttctc 5520 gcaagggaac accagtcagt ccgccacaag gattcacagg ctcctatcaa aaaaaaaaaa 5580 aa 5582 // ID EcoRI_HA repbase; DNA; INV; 329 BP. XX AC D38086; XX DT 09-JUL-2004 (Rel. 9.06, Created) DT 09-JUL-2004 (Rel. 9.06, Last updated, Version 2) XX DE Hemitaxonus athyrii DNA, EcoRI family (pHAE family) of tandemly DE repetitive DNA sequence. XX KW EcoRI family; EcoRI_HA; tandem repeat. XX OS Hemitaxonus athyrii OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Tenthredinoidea; OC Tenthredinidae; Selandriinae; Hemitaxonus. XX RN [1] RA Sonoda S., Yamada T., Naito T. and Nakasuji F.; RT "Repetitive DNA sequence families in Hemitaxonus minomensis and RT H. athyrii (Hymenoptera; Tenthredinidae)."; RL Jpn. J. Genet 70(1), 7-16 (1995). XX DR Genbank; D38086; Positions 1 329. XX SQ Sequence 329 BP; 104 A; 51 C; 63 G; 111 T; 0 other; aattctatcg cctataggtg tttcttatgc agaattatct caggaatgca acaaaccgag 60 tctcattgtt gtgcgatact cagggaataa gatatcgaca attttcagga ttctcgattt 120 tcgcaaaaat cataaggggt tagccttact tttttttcga cgaaaaaaaa agttgactgg 180 aatttatttc tttccgggtg tttcttgtag agaataatct cacgaatcca aaaaacgcag 240 tctcgttgtt gtgcgataat taggagaaaa gatattcaag attcactgaa tttttttatt 300 tttctaaaaa tcggagaaat gtaggtcag 329 // ID Gypsy4-I_Dpse repbase; DNA; INV; 6028 BP. XX AC Unknown_singleton_87; XX DT 13-MAY-2009 (Rel. 14.05, Created) DT 13-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy4_Dpse; KW Gypsy4-LTR_Dpse; Gypsy4-I_Dpse. XX OS Drosophila pseudoobscura OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; obscura group; OC pseudoobscura subgroup. XX RN [1] RP 1-6028 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1045-1045 (2009). XX DR Genome; Unknown_singleton_87; Positions 19727 13700. XX CC Positions [2915-3418] - Reverse transcriptase CC Positions [4490-4999] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 794..5368 FT /product="Gypsy4-I_Dpse_1p" FT /translation="MPVKRSPEKLINLKDIERPTTSNQRAPSISHSLTTRS FT KTKPVETATMPDPPPDQENLDASTPTTSNRAASDVLYTVMAGTISAEMAKQ FT TLAMSAAMAQLTEKLSQAISTGLQAASTASSRERSPIRIEQLSSGTHVEQH FT NIQPLLPNSHSQAASDQTRDYGQIFSQAPLPGEVRPDRISQIMANWKLRFN FT GKDSMSIDDFIYRIEALTHQTMNGNFELVARNSNNLFEGKASEWYWRYHKS FT VSQIRWHDLCVALRGQFKDSRTDRIIRTEIEDRKQLVNESFDDFYSVIASL FT ADRLTQPMSEQSLLETLRANLLTEVQREILYEPASTIAQLRHLVRTREIFM FT RTVYKPSVAIPKPKYRNISALGHQTDHISESDSSDIEEVEVAAIEASCWNC FT GAKGHRYQECLAERRVFCFGCGLADTYKPNCRRCSSNSTKKLSSACTTEKC FT TQTSCLEGDEYRLDETPKLLHNLLPAASYHPDLQQSINYSTPGKSSTRSRL FT RRIAFNRQTRAERKMLLETVIGKLHDFRPYAQVTLLGNTVVGLLDTGASVC FT CIGGHLAKDVMEKTEYKRLSATVKTADGKSQDIIGRLTTEVSYRGESKKIT FT FFVVPSLSGDLYLGIDFWKAFNLLPVSLLSEQLSVLSVSDPLSRSLTVHQQ FT KTLAAAILLFPSFAIKGLGRTTLISHSIDIADAKPIKQRHYPVSPAIEKLM FT YNELDRMLKLGVIEESNSSWSSPVVLVRKPGKVRLCIDSRKLNDSTVKDAY FT PMPLIDGILARLPKAEFITSLDLKDAFWQIPLDKKSRDKTAFTVPGRPLYQ FT FTVMPFGLTNSPQTMSRLMDKVIPANLRNEVFIYLDDLLIVSDTFERHLEV FT LGILATQFKIANLTLNVEKSHFCIKEVKYLGHVIGNGTINTDPDKIAAVTD FT FPVPRSIKQVRRFLGLTGWYHKFIRNYAALAAPLTDTLKQKRSFVWSEEAQ FT KSFELLKERMSTAPVLHSPNFNIPFSIHCDASDSGVGAVLMQTNEDKDEVP FT VAFMSRKLNKCQRNYTVTEKECLAAVLAVKKFRAYIEGQEFIIITDHASLK FT WLMTQSDLSSRLARWSLKLQGFPFKIQHRKGSQNVVPDALSRTNTEDLSAL FT VLSSIVDLQSEEFRSPEYVALRDKLVENASKTPDLKIVDGYIYRRAEHAAG FT DKVADDLCWKLWIPRGLVEETLKDAHENQLSAHCGINKTLEKLRRFYYWPN FT LVSDVRTYINNCHVCKITKHPNHTLRPPLGNIGTSDRFFQKLFVDFLGPYP FT RSRSGHIGIFIVVDHLSKFPFLKPVKKFTADVVARYLEEDLFHCFGVPETI FT VSDNGVQFRSEVFNALLGKYHIVHRYTAVYAPQANASERVNRSVIAAMKSY FT VRPDQKNWDEQLSNICCALRSSVHSAINTTPYRLAFGQHMIVDGASYRLLR FT NLQMLEDRTVQFGKEDSFDLIRHKAKDTMHAQHARNERTYNLRSRQVSFVV FT GQEIYRRNFQQSNFAKGFSAKLAPAFIKSRIRQKLGNCYYEVEDLQGKLIG FT KFHAKDLKQ" XX SQ Sequence 6028 BP; 1770 A; 1287 C; 1356 G; 1615 T; 0 other; gttctaactt atagatatat tataatctta tttggcgccc aacgtggggc ccgaagcagc 60 catcctggct gaccccaacc gatagtcttc attcttatca attacatctg ataatatagt 120 tcctaacact tgttcgattg gttctcggtt ccttgttctg ttaatggtta gagctctatg 180 gagatagcta aagcttaaaa ttgctttact acctggactc cactaaaaca gttaactcat 240 gtgctttatg gagagtttct ggaagcgtta taactttaaa tggttataac tccaaaattt 300 aatagtataa agtcattaga atccttgggt ggattgttca tactgcatag taggcgttgc 360 ttattggtta cacgtggagt caagacctac ggccatcccg tttaccttgc ttgaggacaa 420 tcatacctat cagttgcctt tggaatcgtg ataccttctc aggaaagttc actcctatgt 480 aacatatctc ccttccttct ttgcttcctt tgtgttttca cttatgtata cttcctaata 540 ttgtttgctg gagcttagtg agttacagcg tagccgccac tggcggaaaa taatatttta 600 tcgtgtactc ttttcgtttt ctattctaat ctgtactata ttgacaagct cacattagaa 660 cagctgatta agaccgtcac aaccatatga aacgatagca tggcaaccaa acagtttata 720 caaattcctt cttttataca attttttttc ttagcgttat acaaaattaa tttaattaag 780 cgagtcaaga gcaatgccgg ttaaacgaag ccctgaaaaa ttgattaacc taaaagacat 840 tgaacgtccc acaacttcga accaaagagc accgtctatt tctcactcac tgactactag 900 atcgaaaacc aaaccagtcg aaactgcaac catgcctgac cctccaccag atcaggagaa 960 tttagatgct tccacgccga ctacgagcaa tagagcggca agtgatgttt tgtacacggt 1020 gatggcaggc acaatttcag cagaaatggc aaagcaaact ctagccatga gtgcagcaat 1080 ggcacaacta acagaaaaac tttcccaagc tattagtaca ggtttacaag cagcttcgac 1140 agcttcctct agagagaggt ccccaatccg tatagaacag ttgtcatcag ggacacacgt 1200 tgagcagcat aatatacagc ctttattgcc caattcacac tcgcaggcag cttcggatca 1260 gacacgggat tatggtcaaa tctttagtca ggcgccactg ccgggtgagg tacgtccgga 1320 tcgtataagt caaataatgg caaattggaa gctgagattc aatggcaagg attctatgtc 1380 tatagatgac ttcatttata gaatcgaagc cttaacccac caaacgatga atgggaactt 1440 cgaactagta gcccgaaact cgaacaattt atttgagggt aaggctagtg agtggtattg 1500 gagataccac aaaagtgtgt cccaaattcg ctggcatgac ttatgtgtag cgttaagggg 1560 tcaattcaaa gattcacgca cggatcgcat aatccggacg gaaatcgaag atcgtaaaca 1620 gctcgtaaat gagtcgttcg atgattttta cagtgtgatc gcctctttgg cagatcggct 1680 cacacagccc atgtcggagc aatctttgtt agagactcta agagctaatc ttcttactga 1740 ggttcagcga gaaatattat atgagccagc ttccaccatt gctcagttaa gacatctggt 1800 gcgaacacgc gaaattttta tgcggacagt ctacaaacct tcagtggcga taccgaaacc 1860 caaatatcgc aacattagtg cgttgggaca tcaaacagac cacatttccg aatccgatag 1920 ttctgatatt gaagaggtag aggtagctgc cattgaagct tcctgttgga attgcggggc 1980 gaagggccac cgttatcagg agtgtctggc agaacggcga gtcttctgct ttggttgtgg 2040 attggctgac acgtataagc ccaactgccg ccgttgcagc tcgaattcca caaaaaaact 2100 tagcagcgcg tgcacaaccg aaaagtgcac gcaaacaagt tgtctcgagg gggacgaata 2160 cagattagat gagactccaa aattgctaca taacttatta ccggccgcgt catatcaccc 2220 tgacttacag caatccatta attattccac tcccggcaaa agtagcactc gttcgcgatt 2280 gcgtaggatc gccttcaacc gtcaaactag ggccgaacgc aagatgctat tagagacagt 2340 aattgggaaa ctccatgact tccgcccata tgcccaagtc actctactgg gtaacacggt 2400 agtcggcctt ttggacaccg gtgcgtccgt gtgctgtatt ggcggtcatc tggcgaaaga 2460 cgtaatggag aaaacggaat ataagcgttt atcggccacg gtgaagacgg ccgatggaaa 2520 atcgcaggat attattggtc gactgacgac agaggttagc tatcgaggcg aatccaagaa 2580 gataaccttt ttcgtagtcc catctttatc gggagacctt tatttgggta tagatttttg 2640 gaaggcgttt aacttgcttc cagtttctct tttaagtgag caactatcgg ttttgtcagt 2700 gagcgatccc ttaagtaggt cattaactgt acatcaacaa aaaacactag cagctgccat 2760 tcttctgttt ccctcctttg ctatcaaagg cttaggacgc accaccttga tatcgcattc 2820 gattgatata gctgacgcaa aaccaatcaa gcagcgtcac tatccagttt cacctgctat 2880 agagaagctg atgtacaatg aattggatcg aatgcttaag ctgggggtaa tcgaagaatc 2940 aaatagtagt tggtcctcac cagtagtatt agtgcgcaaa cctggcaaag ttcgactatg 3000 catcgatagc aggaaactca acgactctac agtaaaggac gcatacccta tgccgcttat 3060 cgatggtatt ttggcgagat tgcccaaggc cgagttcatt acaagcttgg atctaaaaga 3120 cgcattttgg cagataccgc tcgataagaa atcgcgggat aaaacggcgt ttactgttcc 3180 cgggaggcct ctttatcagt tcaccgtgat gccatttggg ctaactaact cacctcagac 3240 catgtcaagg ctcatggata aggttatccc cgcaaatctc cgaaatgaag ttttcatcta 3300 ccttgatgac ctcttgatcg tgtctgacac atttgagcgg catctggagg tattaggaat 3360 tctagctact caatttaaaa tagcaaattt aaccctaaat gttgaaaaga gtcatttttg 3420 tataaaggag gtcaaatatt taggtcacgt aattggtaat ggcacgatta ataccgatcc 3480 agacaaaata gcagctgtta ccgattttcc agtacctcgt tcaattaagc aagttcgacg 3540 tttccttggc ttaactggat ggtatcataa gttcatccgg aactatgctg cattggccgc 3600 ccctttaaca gatactctaa agcaaaagcg tagctttgtt tggagcgaag aagcacaaaa 3660 gtcgttcgaa ttattaaagg aacgtatgag cacagcccca gttttgcata gcccgaattt 3720 caatatacca tttagcatac attgtgatgc aagtgactca ggcgtggggg ccgtgttgat 3780 gcagaccaat gaggacaagg atgaagtacc tgtggctttc atgtcacgga agctaaataa 3840 atgccaacgg aattacacag tgaccgaaaa agagtgtttg gcagcagttt tggcggttaa 3900 gaagtttagg gcttacatcg aagggcagga gttcatcatc ataacagacc atgcatcttt 3960 aaaatggcta atgacccagt ctgatctcag ttcacggtta gctagatggt ccttaaaact 4020 tcaggggttc ccgtttaaaa tccagcatag aaaaggtagc caaaacgtag tcccagatgc 4080 tctttcgcgc acaaataccg aagacttatc ggcattagtt ttgtccagca ttgttgacct 4140 tcagtcagaa gaatttaggt cgccggagta tgtagcccta agagacaaac tcgttgaaaa 4200 cgcgtccaag actccagatt tgaaaatcgt agatgggtat atataccgac gcgccgagca 4260 cgcggcggga gacaaggtag ctgacgattt atgttggaag ctttggatac cacgaggatt 4320 agtcgaggaa accttaaaag acgctcacga aaaccagcta tcagctcact gtgggataaa 4380 caaaacatta gaaaaactta ggcgtttcta ctattggccg aatttggtgt ccgacgtaag 4440 gacatatata aataattgcc atgtatgcaa gatcacaaag catcccaatc atacccttcg 4500 accgccttta gggaacattg gtacatcgga taggtttttc cagaaactgt ttgtcgattt 4560 cctggggccg tatccccgaa gtcgttctgg gcatataggg atttttattg tggttgatca 4620 cttgtcgaag tttcctttct taaagccagt aaaaaagttt actgcggatg tcgtggcacg 4680 gtatctggaa gaagatttgt tccattgttt tggggttcct gaaacaattg tttcggataa 4740 tggtgtacag ttcaggtcgg aggtgtttaa tgcgctgcta gggaaatatc atattgtgca 4800 caggtatact gctgtttacg ctccgcaggc aaacgcatca gaacgagtaa accgatcggt 4860 gatagctgca atgaaatcat atgtccgccc tgatcaaaag aattgggatg agcagctcag 4920 taacatttgc tgtgcgctgc gttcctcggt ccattcagcc ataaatacga ctccttatcg 4980 cctcgctttt ggccagcaca tgattgtcga tggagccagc tacaggctgt taagaaattt 5040 acagatgctg gaagatcgca ctgtacagtt tggtaaagag gattcgtttg acctcatcag 5100 acataaggca aaggatacga tgcacgcaca acacgctagg aatgaaagaa catacaattt 5160 gcgcagtcgg caggtgtcat ttgttgtagg ccaggagata tatcggcgaa atttccagca 5220 aagcaatttc gcgaagggct ttagcgctaa actagcacca gcgtttataa aaagtcggat 5280 acgacagaag ctaggaaatt gctactatga ggtggaagat ctgcagggta aattaatagg 5340 gaagtttcac gccaaggatt taaaacaata acttctcttt taatagccgt tcaagcgtgg 5400 atcccacttt tggtgtggtt cccccaaagt gtgattttgg taggggggaa tttgtagtga 5460 cgcttgaacc agctacaaaa atattactac aacaacaaca aaaaataaaa aaacgggaaa 5520 aaaaaaaact aatcaaaaat tttaaatcca gggaatatcg taattgcgtt tagcgcccct 5580 attgctcgtg atcgacgggt actttttggg cagacggtcg gactcgggag agagtcggcc 5640 gcggttatgt taactcgagg gaaatggtgg ttggcacttt cgctgttagg agggttcgtg 5700 aatgcaagtc cagctacggt cgctcttttc gattttgggc cccacttcag aaggtagtgt 5760 ttgtggtgga aagaggaaat attagcggaa aaacataact aaacctcgcg ccagtgctaa 5820 atcaaagaaa aaaagacaaa caaaagaatc gtgttgcttt caaattatac cataaagaaa 5880 tcgaaaagaa atttgccacc ccagtaaggg ggcgttccat tctgacccca cgccggaggc 5940 aaggcatccg aaagtggtct cttttttttg tttcggacgg ccatttgcaa gttaatataa 6000 caagagtgcc aatcgccaca tagcgaat 6028 // ID Gypsy-84_AA-I repbase; DNA; INV; 5455 BP. XX AC supercont1.15; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-84_AA_; KW Gypsy-84_AA-LTR; Gypsy-84_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5455 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.15; Positions 1934765 1929311. XX CC Positions [4474-4941] - Integrase core CC 'GTGTA' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 1392..3095 FT /product="Gypsy-84_AA-I_2p" FT /translation="MEEARPIPPFRCESMEKQKLSREWESWKGSLECYFDA FT YSITDQRMMKAKLLHLGGVELQRVFRSLPNHDKFPVVALDPKFYDLAIDIL FT DAYFQPGKQDVIERRKLRQLKQDVGEKFSHFLIRLRQQAANCGFEKYHADV FT GDVLMEIYLIDIIVENCRSEELRRSILKKDRKLTEIEEIAACIEGTEQQLK FT DLKESTSNTREASVYQVRDPRPSFRERNRTLGPRGAPLNSRSTISQPSWKN FT TNVSCFACGESGHISKSPNCPARGRTCRRCKRLGHYEATCRKRKAEASLVP FT KSKKVFSVDENRGSESAVEKPECATGESEKVYYAFYGGNESNVLPGVIGGV FT STELLVDSGADVNLIKLETWETMKEKQVKIIKSVKGCSKILKGYGSDKPLD FT IVGSFIAEIIVGTRAATAEFFVVKGGQKDLLGDSTAKRLGVLKIGLNINHL FT ESPLKPFNKIMDVQAHIRMNPHFKPVFQPLRRVPIPMEEAVNKKLDQLLTR FT DIIEVKTGPSTWVSPLVIVGKASGEPRICLDLRSGACRRFEGRYYVHHQPR FT PFFVLSVCLSDWSQLLKFSNV" FT CDS 3493..5430 FT /product="Gypsy-84_AA-I_1p" FT /translation="MNKYIPDLATLAEPLRKLTQKNVKFEWGEAQNNSFQA FT VKKALATATKLGFFDPKDHTAVMADASPTGLGAVLIQKNALGDSRIICNAS FT KSLTDTERRYCQTEKEALALVWSVERFRMYLYGREFDLITDCKALEFLFTP FT RSKACARIERWVLRLQTFDYRMVHIPGSQNIADALSRLDSSKPVAFDSEEE FT LFIRHIAGTAAASFALKWETIEEESKRDPEILELIEMIRNEDEESLPLSYK FT VIYNELCIVGEVVLRVDRIIIPKSLRDRVLNLAHEGHPGMRMMKSHLRTNV FT WWPQMDQHVEKFVKQCKGCALVAAPNPPEPLIRKQLPDQPWIDVAADFLGP FT LPDGQHLLVVIDYYSRFMEVSEMHSITASDTISELAIIFSRYGLPMTLRVD FT NGPQLNENCEEFREFCESSGIKLINTIPFWPAMNGEVERQNRSLLKRLRIA FT QQLGKDWRAEMRQYLLTYHATNHTVTGKSPAELMFGRRIRTKLPQVPPNRL FT EDEEVRDRDTIIKEKGRVYADTKRKAKESKILVGDRVLAKRMKKDNKLDTV FT FSPEEFEVIRKAGADTIIRSHVTGKEYRRNVAHLRKLESAGTPETADVSAG FT PCPSTSEFPETTGRTTLEKPSSDKEDPLTRRAARSRNEPKYFKDFIPH" XX SQ Sequence 5455 BP; 1746 A; 930 C; 1325 G; 1454 T; 0 other; acctggcgac gagaataaat cgaggtaagc gtgattcgat tcggtggata attagaatcg 60 gtcgcgaatg aaacgcatgt gaaaaaaaaa tggccgagtt tgtgcatttc aatttaccgg 120 tgaagagcat ggagagaaga atgaaagttt gggctttata aattcagaac atgggagcgt 180 tgtggttcta tatgcttatt gtgaaatgtg actttttact ttgaagctta atcgaaatga 240 tacgatgaaa aaaaaggaat gttatttctt aaatgtgctc taaaacaaag cggcacactt 300 tgcagtggtg tgagtgcggt atataagaac attggaagca gatgtgtcat cgaaatcagc 360 attactgatt ctttgatcat gagttctcca tacaacgtaa ggtggtatgc tcgtagcttt 420 ctctcttacc aatacatgca tatctttgtc attatgatac ataaaaataa cggtagtgta 480 aaaggtagtg gagataaatg aggaaaacaa aacagatgag ggagatgtga acgtattgaa 540 gaattcagat gatgtaggca taaaaaaaaa tggtttggaa tcagtgccat ccgaaatatg 600 catgtttcaa agttgggaat ggatcgaagt gataaatgaa tgaaccagta gcgttcgaat 660 tggttaatgg tgactgatat gtgtggaagt cgattatttc tatggaagta agagtatcct 720 ttggaccagt agcgtcaaag ggggtgaaga taaagggtga ataaaagttc gtttctgcaa 780 gtttatcctt ctgaccagca gcgtcggagg aggtgaggat tttgtttttt agatggaaga 840 aaggaaaaaa ctcttgtgtg tatctttggt ataagcgaag atgtggatat ggaagtatat 900 gtttttcaag ttttgtgaag atgtatcctt tagaccagta gcgtcaaagg aggtgaggat 960 gaagttagtg aagtccaata ttgcaaaatt atttgacacc agtagcgtgc aaaataaaga 1020 gaagtgcaag aaatatatga atacaagaaa tcaaccggca cggtttcagg aaaatcgtca 1080 aaaaaaatac aaacggaggg ttactaaact acaagagttt ctctcaggta agtgaatttt 1140 gaaactatat gaaacacata attcagaatg aggggtattg ataaaaaaaa aaaaaatttc 1200 aaaattatta ttgaatacat tcagataaat tctgctaatt ttatgaggta taaattgaat 1260 gagtgggcta caatgagtac tgtgtataat aacatgctta cagtaaaata catgctcttc 1320 ctgacaattg aaatgcacaa tcaacaaaaa tatgtaggca cgggtaaaaa aaaaaaaaaa 1380 atattttaca gatggaggaa gctagaccga tacccccatt cagatgtgaa tctatggaga 1440 agcagaaatt gtctcgtgaa tgggagtcgt ggaaaggatc tttggaatgc tacttcgatg 1500 cctattccat tacagaccaa cgaatgatga aggcaaagct tcttcatctg ggtggggtag 1560 aactacagag agtgttccga agtttaccaa accacgataa atttcctgtc gtggctctgg 1620 acccaaaatt ttatgatttg gctattgaca tcctggatgc atactttcaa cctgggaagc 1680 aagatgtgat tgaaagaaga aagcttcgtc aattaaaaca ggatgtgggt gaaaagtttt 1740 cacatttttt gattcggctt cgtcaacaag ctgcgaattg cgggtttgag aaatatcatg 1800 cagacgtcgg tgatgttttg atggaaatct accttatcga tatcattgtg gaaaattgtc 1860 gttctgaaga actgaggcga agtattttga agaaggaccg taagctcaca gaaatcgaag 1920 aaatcgctgc gtgtattgaa ggaaccgaac aacaattgaa ggacctgaaa gaatcaactt 1980 caaataccag agaagcgtca gtatatcaag tccgtgatcc gaggccgtct ttccgggaac 2040 gaaacaggac gcttggtcca cgaggtgcac ctctgaattc tcgatctact atcagtcagc 2100 caagctggaa gaacacgaat gtatcctgtt ttgcttgtgg agaatcagga cacatttcga 2160 agtcaccgaa ttgtccagcc agaggtcgta cttgtcgtcg ctgcaagaga ttgggccatt 2220 acgaagcgac ctgtcgaaaa cggaaagcag aagcatcatt ggtacctaag tcgaaaaagg 2280 tattcagtgt tgatgaaaac aggggcagcg aatcagcagt agagaagcct gaatgtgcca 2340 ccggagaatc agagaaagtt tattacgcgt tctatggggg aaatgaatcc aatgtactac 2400 ctggtgtgat tggaggagtt tctactgagc ttttggtcga ttccggagcg gatgttaatc 2460 tgattaagct agagacatgg gaaacaatga aagagaagca agttaagata atcaaatcag 2520 tcaaaggatg cagtaagatc ttgaaagggt atggtagtga caaaccgttg gatattgttg 2580 gatcctttat cgcagaaata atagttggta ctagagctgc aactgccgaa ttttttgtgg 2640 tgaaaggggg tcagaaggac cttttgggag attcaactgc aaagcgttta ggtgttctca 2700 aaattggtct gaacatcaac catttggaat cgccgctaaa acctttcaac aaaattatgg 2760 atgttcaggc ccatattcga atgaatccgc atttcaaacc ggtattccaa ccactccgta 2820 gagtcccaat cccaatggaa gaagcggtta acaagaagct agaccaactc ttgactcgtg 2880 acatcataga agtgaagaca ggtccgtcaa cgtgggtttc gcccttggtg attgtaggaa 2940 aagcttccgg tgaacctcgt atttgcttgg atctgcggag tggagcttgc cgaagattcg 3000 agggacgtta ctacgttcat caccagccga ggcctttttt cgttttaagc gtctgccttt 3060 cggattggtc acagctcctg aaattttcca acgtgtgatg gaagaaatgt tgtcgggatg 3120 cgaaggctgt tactggtatt tagatgacgt tattatcgaa ggagagacgc aggaagctca 3180 tgatgccaac ttggagaagg tactgtaaat tgattgtctt tcgtttaaaa gaaatatgta 3240 ttcaaataaa ggtatggttg ctattgtttt aatattttaa tttcaaaaca ggtgctaaat 3300 agactgaagg aaaaaggagt agaattgaac tgggacaaat gtaaaatcag cgtcaaagag 3360 ttggattttc tgggccatca catatcagaa aaaggtatca gtccgtctca aaccaagatt 3420 gattcgattc tgttatttcg acaaccagca acggagtctg aagtaaggag ttttctggga 3480 ttggccaatt atatgaacaa atacattcca gatttggcta ctttggcaga gcccttgcga 3540 aagcttacac aaaagaacgt aaaatttgaa tggggtgaag cgcagaacaa ttcatttcaa 3600 gcagttaaga aagctttagc cactgctaca aaactggggt tctttgatcc aaaagatcat 3660 acagctgtga tggctgatgc tagtccgact ggattgggcg ccgttcttat tcaaaagaac 3720 gcattgggtg attcaaggat aatttgcaac gcgtccaagt cgctgacgga tacagaacga 3780 cggtattgcc agacggaaaa agaggctctt gcccttgtat ggagtgttga gcgcttccgt 3840 atgtatttat atggcagaga gttcgatctt atcacggatt gcaaagcatt ggagtttctg 3900 tttaccccac gatcgaaggc ttgtgcacga atcgagcgat gggtgctaag actgcaaact 3960 ttcgattata ggatggtcca cattcctggt tcccagaata tagcagatgc tctttctcga 4020 ttggattcat ctaagcctgt tgcgttcgat tccgaagagg aattattcat tcgtcatata 4080 gctggtacgg cagccgcatc ttttgctttg aaatgggaaa caatagagga agaaagcaag 4140 agggatccag aaatccttga actgattgaa atgattcgta acgaagatga ggaaagcttg 4200 cctttatcat ataaggtgat ttacaatgaa ctttgtattg tgggagaagt tgttttgcgt 4260 gtcgatagga ttatcattcc gaaatccctc agagatcgtg ttttaaatct ggcgcatgag 4320 ggacatcctg gcatgcgtat gatgaagagt cacttgagga ctaacgtttg gtggccacag 4380 atggatcaac atgtcgaaaa atttgtcaaa caatgcaagg gttgcgcctt ggttgctgct 4440 ccaaaccctc cagaaccgtt gattcgtaaa caacttcctg atcaaccgtg gatcgatgtt 4500 gcagctgatt tccttggtcc tttgccagat ggacaacact tattggttgt gatagattat 4560 tacagccgct tcatggaggt tagcgaaatg cattcaatta ctgctagtga tacaataagt 4620 gaactagcga taattttcag ccgttatggt ctgccaatga cattgagagt tgacaatggc 4680 ccacaactga atgagaattg tgaggagttt cgcgaattct gcgaatctag tggcatcaag 4740 ttgatcaaca ctattccttt ttggccagct atgaacggcg aggttgaacg ccaaaacagg 4800 tctcttttga aaaggctcag aatcgcacaa caattgggaa aggattggag agctgaaatg 4860 cgtcaatatc ttttgactta ccatgccacg aatcacactg taacaggaaa gtcaccagct 4920 gagctcatgt ttggcagaag gataagaacc aagctaccac aagttccacc aaatcgtcta 4980 gaagatgaag aggttaggga tcgtgacacc atcataaaag agaaaggcag ggtttatgca 5040 gatacaaaga ggaaagctaa ggaaagcaaa atcttggtgg gtgatagagt tcttgccaag 5100 cgaatgaaaa aagataataa gttggatacc gtcttttcac cagaagaatt tgaagttata 5160 cggaaggctg gtgctgatac catcatccga tctcatgtaa ccggaaagga gtatagaaga 5220 aatgtagcgc accttagaaa acttgaatca gcaggcactc cagagaccgc agatgtttct 5280 gcaggtcctt gtccatctac gtcggaattt ccagagacaa ctggcagaac aacgttagaa 5340 aaaccatcat cagacaaaga agatccactt acgagaaggg cagcgagatc aagaaatgaa 5400 ccaaagtatt tcaaggactt tatcccacat tagcagtcta aaaggtgggg agggt 5455 // ID Gypsy-20_DWil-LTR repbase; DNA; INV; 265 BP. XX AC scaffold_180958; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-20_DWil_; KW Gypsy-20_DWil-I; Gypsy-20_DWil-LTR. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-265 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_180958; Positions 33562 33826. XX SQ Sequence 265 BP; 79 A; 53 C; 64 G; 69 T; 0 other; tggacaaatg aatatagcta atactgcatt tggcagtggc cacgatcgct gggtcagcga 60 tcccacggtc tgggtaatat gagccggccc gcaagacgtt gaccgctgct agcggcacga 120 ggcgcagtgt cagcattccg atggccaatg cttagggttc ttgggtagtt tatttaataa 180 tgtttagtta gttataattc ggactccaag cgcacaaaac tgtggcttgt aataaatcaa 240 atataaataa acatataaat aaaca 265 // ID Copia-21_SI-I repbase; DNA; INV; 4197 BP. XX AC AEAQ01023582; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-21_SI_; KW Copia-21_SI-LTR; Copia-21_SI-I. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-4197 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01023582; Positions 4580 384. XX CC Positions [1652-2188] - Integrase core CC 'ATTTC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 233..2782 FT /product="Copia-21_SI-I_2p" FT /translation="MTENFYSKNITKFDGKNYQSWKFQMKAVLVANDLMEI FT VDGTKVEPEAVAERDAWKKSNAKAMFIISTSLEESQLETLITCNTATEMWA FT KLRSVHEQTSETNKLILTQRFHAYRMEHNMSISQNIAKIENMARHLKDLGE FT TISDVAIMAKILGSLPSKYSYLVTAWDSVDPTRQTLEYLRERLLKEEARLS FT STDEMTNALAAVSLKTNQSKSESNLQKKPEISNAIRGSTVKKVECFYCHKL FT GHMIKDCRNRKKKDQEKSGKLETVALVVTQSTADTTTSESVSAHSKEISEI FT LSYDKDNIWYADSGASKHISFRREWFSEFHPSSGEDIVLGNEAKCKVLGSG FT TIAAKRLVSGRWLDVKLEDVLYVPNFGRNLISVGYCTKKKLTVTFHDDVVT FT ISRDGEYIVGGIKQENQLYRLFMKPMLHEANVASFQTWHERLGHLNKNYIS FT EMAQKGLVAGLDLSNKSDFFCEDCQYGKMHRKPFRNTDRTQTMKIGDLIHS FT DVCGPFQVDSLGGARFYVLFKDDFSNYRVVHFIKHKADVFDKFKEFEKSIQ FT NRFGHSVKRLRVDNGREYCNKEMSNYLVKKGISLETTAPYSPEQNGKSERD FT NRTLVESARAMLHAKNLPMFLWAEAINTAVYVLNRSVCSKTKDSTPYEIWT FT NKKPNLSHVRIFGSNAFIHIPKVHRQKLDPKARKLMLVGYQGESSNYRLYD FT PSNKRITVSRDVIFNECPSREMASSKSVCLEIGLENLSKDLPTQDIPNEVQ FT ENNEQKQNQENGDRSNIEEPTRNEATQRILRDRQSIRKPDRYEANFLQFDE FT PKSYEDAIHGTHAEEWKCAINEELAAHKKNGTWEVITLPPGKKTIGAK" FT CDS 2782..4161 FT /product="Copia-21_SI-I_1p" FT /translation="MIFKVKQSSSGEIERFKARLCAKGFTQTKGVDFTETF FT SPVVRYDSIRTLLAIATEKNLKMQQFDVKTAFLNGDLSEEIFMDIPEGVQI FT SNIRGKVCKLVKSLYGLKQASRCWNLKFDAFLKDYNFTQSAADQCVYIGSI FT NGSLTYVVLYVDDGLVFSESKESLNSIIRDLSDKFEIKLCDGKMFAGVQID FT RDCKNRTTFIHQSLYAERLLYKFNMAEAKPVSTPIENGFDFNSSDDGNVSN FT FPYHEAVGSLMFLATVSRPDIAFAVNVASRYFNNYSNIHVSLVKRILKYLK FT GTIRKGIVYKSDGSVLELVGFSDADFAADCETRRSTTGYLFQLAGGPITWS FT SQRQKLVTLSTTESEYVAACAAAKECIWLRKLLHDINHKCNSATIIYVDNQ FT SAIRLVHNPEFHCRTKHIDVKHHFIREKFQNNEIEVSYVPSDSQKADLLTK FT AVPKMRFEYLVSFIS" XX SQ Sequence 4197 BP; 1370 A; 783 C; 922 G; 1122 T; 0 other; ggttatgggc ccagacatct cgtaattggg gtctgatcgt tctaaagaag aaaagcgtaa 60 gtttgtcgtt gtgtcgagcg tgctcgtggc aacctaacct aaggtaggaa ggaagcgtaa 120 cgaacgaccg cgtggtctcg taaagtcttt cacgtctcgg taacaagcca aaagaaacag 180 tttctttttc gagttgcaag gtttcagaat cgattgcgaa taacctgcaa aaatgacgga 240 aaacttttat tcgaagaaca taaccaaatt cgacgggaaa aattatcaat cctggaagtt 300 ccaaatgaag gcagtattag tagcgaatga tctaatggaa atcgtggatg gtacaaaggt 360 ggaacccgaa gccgttgctg agagagatgc ctggaagaag tcaaatgcaa aggccatgtt 420 cataatatct acttcgttgg aagaatctca actcgaaacg ttaattacgt gcaatactgc 480 gactgaaatg tgggctaaat taaggtcagt acacgaacaa acctcagaaa caaataaatt 540 aatcctcacg caaagattcc atgcatatag aatggaacat aacatgtcga tatcacaaaa 600 tattgctaaa atagaaaata tggcaagaca tctgaaagat cttggcgaaa caatctcaga 660 tgtagcaatc atggccaaga tattaggaag cttaccttca aaatatagtt atttagttac 720 ggcctgggat agtgttgatc cgactcgtca aactcttgaa tatttgcgtg agagattact 780 taaagaagaa gctcgtttat cctctacaga tgaaatgaca aatgcgttag ctgctgtttc 840 tttgaaaact aatcagtcaa aatctgaaag taatttgcag aaaaagcctg aaataagcaa 900 cgcaattaga ggttctacag taaagaaagt cgaatgcttc tattgtcata agttaggcca 960 catgattaag gactgtcgca acagaaagaa gaaagatcaa gagaaatctg gcaaattgga 1020 gactgttgca ctcgtagtaa ctcagagcac cgccgataca actacgtcag agtcagtaag 1080 tgctcactca aaggagatct ccgaaatcct atcgtacgac aaagataata tatggtatgc 1140 ggacagtgga gcctccaagc atatcagctt tcgaagagaa tggttctcgg aatttcatcc 1200 ctcgtcagga gaagatatag tattgggtaa cgaagccaaa tgtaaggttc taggatccgg 1260 gacgattgct gcaaagagat tggtgtcagg aagatggctg gacgtcaagt tggaggatgt 1320 tctttatgtt cctaatttcg gaaggaactt gatttccgtt ggctattgta ctaagaaaaa 1380 gttgacagtc acatttcacg atgatgtcgt cacaattagt cgagacggtg agtacattgt 1440 cggaggtatt aaacaagaaa atcaacttta caggttattt atgaagccga tgctccatga 1500 agctaacgtc gcgtcattcc aaacgtggca tgaaagatta gggcacctca acaaaaatta 1560 tatcagcgaa atggcacaaa agggattagt ggcaggcctt gatttgtcaa acaagtcgga 1620 cttcttctgt gaagattgtc aatatggcaa aatgcatcga aaaccctttc gaaacacgga 1680 ccgaactcaa acgatgaaga tcggcgacct cattcacagc gatgtatgtg gtccttttca 1740 agtggattca ctaggaggag ccagattcta cgtcttgttc aaggatgatt tttctaatta 1800 ccgtgtagtt cattttatca aacacaaggc tgacgtcttc gataaattta aggagtttga 1860 aaaatcaata cagaatagat ttggtcattc ggtgaagcgt ctgcgtgtgg ataacggaag 1920 agagtactgc aataaggaga tgtccaatta tctcgtcaaa aagggcattt ctctggaaac 1980 aacagccccg tactcacctg aacaaaatgg caaatccgaa agagacaatc gtactctcgt 2040 ggagagtgca agagcgatgc tacacgctaa aaatttgcca atgtttctct gggctgaggc 2100 cattaatacg gctgtctatg ttctcaatcg cagtgtctgc agcaagacga aagattctac 2160 gccttatgaa atctggacca acaagaaacc taacttgtct cacgtcagga tctttggaag 2220 taatgccttt attcatatcc caaaagttca cagacagaaa ctggatccga aggccaggaa 2280 gttaatgcta gtcggatatc aaggcgaatc gtccaattat cgcctttatg atccgtcaaa 2340 caaacgaatc accgtcagtc gagatgttat atttaatgaa tgtccttcga gagaaatggc 2400 atcgtcaaag tcagtttgcc tcgagattgg tctggagaat ctatcaaagg atttgccaac 2460 acaggacatc ccaaatgaag ttcaagaaaa taatgaacaa aagcagaatc aagaaaacgg 2520 tgatagaagc aacatcgaag aaccaacacg caacgaagcc acacaacgaa ttctacgaga 2580 tcgacagtct attcgcaaac ctgatagata cgaggcaaat tttcttcagt ttgacgaacc 2640 aaagtcgtat gaagatgcta tccatggcac acacgctgag gagtggaaat gtgccattaa 2700 cgaagaattg gcagcgcata agaagaatgg tacttgggaa gttattactt tgcctccagg 2760 gaagaaaacc atcggtgcca aatgattttc aaggttaaac agtcatcatc cggagaaatc 2820 gaacgcttca aggcacgcct ttgtgccaaa ggatttacgc aaacgaaggg cgtagatttt 2880 actgaaactt tttctccagt cgtccgctat gactcgatta gaactcttct ggccatagca 2940 acagagaaaa atctcaaaat gcagcaattt gacgtgaaga ccgccttttt gaatggcgat 3000 ttgagcgagg aaatattcat ggatattcct gagggtgttc agatctcaaa tattagaggt 3060 aaagtttgta aattagttaa atctttgtac ggattaaaac aagcctctcg ttgttggaat 3120 ttaaaatttg acgcgtttct taaagattac aactttactc aatctgctgc agatcagtgt 3180 gtttacattg gttcgattaa tggttctctt acctatgtag ttttgtatgt agatgatgga 3240 cttgtgttct cagaatcaaa ggaatcatta aactcaatta ttcgagattt atcagataag 3300 ttcgagatta aattgtgcga cggtaaaatg tttgctggag ttcagattga tcgcgattgt 3360 aaaaatagaa caacgttcat tcatcagagt ttgtatgctg aaagacttct ttacaaattt 3420 aatatggctg aggcgaaacc cgtcagtacg ccgatcgaaa atggatttga ctttaactcg 3480 agtgatgacg gaaacgtctc taattttccc tatcatgaag ctgttggctc gctgatgttt 3540 ctcgccacgg tcagtcgtcc agatattgcg ttcgcagtaa atgtcgcgag tcgttatttt 3600 aataattata gcaatattca tgttagtctc gtaaagcgaa ttttaaaata ccttaaaggt 3660 acgattagaa aaggaatagt ttataaaagc gacgggagtg tcttggaatt agtaggcttt 3720 tcagatgcag attttgctgc tgactgtgaa accagaagat caactactgg ttatttattc 3780 caactcgcag gtggtccaat cacttggtca tctcaacggc agaagcttgt aactcttagc 3840 acgacagaat cagaatacgt cgcagcctgc gccgcagcca aggagtgtat ttggcttcgc 3900 aaattattgc atgatattaa tcataaatgt aattccgcaa ctataatcta tgtagacaac 3960 cagagtgcga ttcgattggt gcataaccct gaatttcatt gccgaactaa gcacattgat 4020 gtgaaacatc attttatccg tgagaaattt caaaacaatg agattgaagt aagttatgtt 4080 ccgagcgatt cacagaaggc cgatttatta actaaagcgg taccaaaaat gcgtttcgaa 4140 taccttgtat ctttcataag ttagaatgta cgcgtgaaat cactcaaagg gagggag 4197 // ID CACTA-1_CapOwc repbase; DNA; INV; 7243 BP. XX AC . XX DT 17-MAY-2011 (Rel. 16.05, Created) DT 17-MAY-2011 (Rel. 16.05, Last updated, Version 1) XX DE DNA transposon: consensus. XX KW EnSpm; DNA transposon; Transposable Element; CACTA-1_CapOwc. XX OS Capsaspora owczarzaki OC Eukaryota; Ichthyosporea; Capsaspora. XX RN [1] RP 1-3053 RA Yuan Y.W. and Wessler S.R.; RT "The catalytic domain of all eukaryotic cut-and-paste transposase RT superfamilies."; RL Proc Natl Acad Sci U S A 108(19), 7884-7889 (2011). XX DR [1] (Consensus) XX SQ Sequence 7243 BP; 1729 A; 1891 C; 1695 G; 1828 T; 100 other; cacagtggaa aaattttttt cggatttcgg agcaaaattc ggtcaaaatt cggaaaaacc 60 gaatttatca ttttgtcggt caaaattcgg tgcctcagca gagcgagatt atggcgatag 120 ctcgacgtct gattggtcag aaattgcttt cttcgtctcc ttttatatta ttaaaaatat 180 tttgttttta atttttaata aaatttattt tattattggt ttccatattt gcacaaattt 240 acaaaaatct cagatcgccc ccgcctgcac ctcctggtgc tttttactgg ccttggaggg 300 ctcgactggc agcacgctgg gcacgcgtct gatgtcgccg agggccactt tgctcgcctt 360 ccgtgcagaa tcgcatctgt tcaagtagaa atcacacttt agtcactgtt ctgtcccgtg 420 cactgcatct gagcatctga cgagcagtca cttacgcgtt tgcttgaacg agacacgttg 480 tgggccggcc gataatcgcg gttggatcgc cttctcacca agccaccggt gcagttgttg 540 tcctgacgcg aatcatcttg atcatcgatt tcagcgcttg ttacacgatc gtttgcctcg 600 tggctttcgc tgtcgctgga tgctgcagcg agtccactat ccgacgaagc aggatgcgcg 660 tcatcgtcct cgctctcgtc tgccgccgcc aagttggagt ttgtgagcag gctttcgctc 720 gaactgacgt tgttgttgtt gttgctgtcg tcgctgctgt tgttgctgtc gtcgctgttg 780 ttgttgctgt cgtcgctgct gttgttgttg tcgtcgtcgc tactgggttc agtgtggaaa 840 ttgatgtcgt cagtggcgcc agctgaaagg tcgatgtcgg ggaggggctg tgacatggca 900 attgaatcgg caacctgctg tgtttccacc atgactgatt cttgtgcatc tggcgttcga 960 gctgctgcgc gctttttagc actattgttc agcccacgcg ccgagtcata cacctatcaa 1020 acatcagctc aaacttagca tttcggttgt agtgctttcg ccgatcgaac aacacgcgta 1080 ccttgtagag tacatcattg acttcatgat gcttgtgcga tgtgacgagt gccaacaacc 1140 tgttttggaa ccagagctct actccctcat cggtgtcggt tgtaccggcg gaaagctgct 1200 cttgtgtttt ggtttgggcg cttttgagtg cttgccgtag atgctttggt aaaaaccact 1260 tcggataccg agcatgaaga tcgccaaaca ttccccggac ctctctgacc catttaacct 1320 ccctatactc actcaagcca gcttagtaaa ttgacaaaca tcacactaga tcccatcata 1380 cacgactgct gtgcgtactt gctcgagctc acctgccggg accgagacaa caactcagat 1440 tgcagccgca ttgtttggga ttgcaattct acgacttgtg cgcgaagctg tgatatgaga 1500 ttgttgtttt ctgaaatgat ggccgcgtgc ctatcctctg cagcacggag ggcctgtttg 1560 acttgatcca actcaaccgt cgcccgctgc gccgactcaa gggcgaattt ttgggtatcc 1620 attatgtgct ggacaaaagg ggctacttat gatcatattc cacggagagg gcgggagtgc 1680 gagacaagaa gaagatcatg tgattttggg cgcgaacttt tgaaaaatgc attttcccgc 1740 caattcacgc cctacctgat ttaaaatttg cattttctcc cctttctatc gcgcttctgc 1800 agttccttca catctctctc gtgtcgttct tgtgtatcca tgtgagatga gtgttttcca 1860 cgcatccgcg cagtcggcat cgatctgcag gcagcagacg accgcatctt tcgtcatctt 1920 tttttttctt tttttttttc gcgccttttt attttttttt tcttttattt tttttacctc 1980 catcctcttg tcctcctgac atcagcatcg cttgcgtgcg ttggctgctg cacagtcaga 2040 gagaggtaaa gctcaatgac agacaagcac cacaagccgc gcgacggcag tgttcgcctt 2100 cttctttgct tttgtttttt tttatctttt tctttttgtt gtggcacact cacaaatgtc 2160 actgctggcc atctcgatct gcactcgatg tagcacaaaa aaaaataata aataaaataa 2220 taaataaatg gtagactgaa gcagtgccag tacaaaataa aattaaatta aattaaatta 2280 aaaaataaat acaataaaaa atgaacattg aaaataaaaa aaataaaaaa cagaaagcgc 2340 aaccaaacag gttcgaaccc agttgcaaaa agaaagaaaa accttcaaga cgcttggtct 2400 agacagctcg aagtcaacga tgggcgaaat gggattcgat cgcaggcatg aaactacact 2460 ttctcatcct gctccaatcg cgcgtatctc gattggacaa tatgtcatta tatctcagaa 2520 tactacaagt cgagaaaagg agcacgaaac accgcgaaaa gacacgcaaa cggtgagaac 2580 aagctcactc gtttcgtgtt atcgcctgtc ctgtcgctgc attggcccct tttgccccgc 2640 tcgctgcttt gaatgatact agaatgagcg gtcgtcctaa cgccatattg tattctggtg 2700 tgtggggcgc ggtttctgcc attttgaagc gaacattcgc gaaaaatgaa gccggccgca 2760 ggcgttcttc gaacactatg tttccgtgct gttattgcca ctggactgga actgctgcga 2820 catcacttga gtataaccac acgacaatga atactctcat caccaccctt cacaatcttt 2880 gtcacaggcc ggtatcaggc tcgcggaagt gacacccatc gcgacaaaca acgcaacaac 2940 cgatcaacgc tcgtttttcg gcgcacatcg cacgcttgac gtcagcgctc acttgtatta 3000 catttgaatg atttttgaca cgcattgatg tgcagcgagg accgagtgcc gcaagactgc 3060 cggcagtaca agtaagggaa gtgaccgatt tgagctcact tttgttttta tgtcagaaga 3120 ttggggttca tttcagcggt gcatctgcac catgtgcgca gccgcctttc aaacgaatcc 3180 atgcagttca ccacaagctt ggatgtcgga tgctgacaga aagcttcatc atcaaatcca 3240 caacctgccg ctgtcaggag tctgcccttg tctgaactgc accaagttcg ccatcaagca 3300 atggtctttc ggcaacggca atttcgagaa actttcgccg tcaagtttct acagacacgc 3360 gactcaatgc tcgtatgcaa ccgttgctga tgtcatccag gcgtatcccg aaatgcgaca 3420 acgctttgag acgcactttc gagaggagca aactgttcca atggcggtgg attctactcc 3480 ccctactccg ccactgcaca cgtctgagcc agctgccatg gcagtggatg accatggtga 3540 cgactacgat gatggcttcg gcgatgtcca cgtcgctgac cacggtgacg ctcctcaagc 3600 tcctctggga gctgatgacg acgaatacga aagcgacttc gaattggacg tcgaagcgac 3660 tgacgaagcg gacgataccc cgaacaatgc cacagtcaat gacccgagcg agacgtcaga 3720 ggaggcgact gcagcaccag gagcggccga accaggagcg gccaaaaaca aaggaccgac 3780 acctagagcg actcgcacag ctgctgcgta tctgcagctt ctccaattcc tcaatgacat 3840 caaaatcaaa ggcaaggtca cgaaagacat cttcactcaa cttctcacgg gtatccacaa 3900 gtactgctcc gagattaaag caatcttgga tgatcattcg acgagattgg tctctttgtt 3960 ccagacgttt ctttcggacg ttcaacagcg tacaaatgtc gacccaattg cgtctgcggg 4020 cgcgttctta gaccaatgtc aagagagtta tcctgtgatt caaggtaagc acgtcgaatt 4080 ggtggctggc gagggattgc tctctaactt tcgaatgttc cagttgttcc tccgaccatc 4140 catcgattta cgaaggctct cagactcgtc aatcttctcc cgccagcgtc acctgtcgac 4200 aagaacaatt nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 4260 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn acgcatgaaa 4320 aacctgtaca ctgatagcga ccagatcaag tggtaccgcg ttggctgtgc acactcactc 4380 agtttgaagg aagtgctggt atccgaaaac gtgtgtagcc aatgtggtgc attcggagcg 4440 aagtgttcac aatgcaccta ctccggttgc tttcgtggtc ctgtgaacca gcttctcgcg 4500 catcgatgca ccaacaccaa ttgccctgcc cgtttctcac gtgagaccat ctacccgtac 4560 atgccgctta ttccgcgttt gagaaacatg ttcgcggatg aaaatcaagt caagggcctc 4620 gtcgacaggt tcggcatcga cttcagcgat ccgcaacaag ttcaaagagc cttgaccgac 4680 gattggccgt caatcatcac caggcacaac gacaagcttg actcattgcg cagcttcagc 4740 cacccgctgg aagggctgca tttccgtaca ccaagtcgtc gcacgttcct gtcagagtct 4800 ccttggaaca tcagccttgc tttcttctat gacgggtaaa gcgcttgaaa gaccatgatg 4860 gcgggaattg accgagaaag gactatagac tcaccaactc tgctggttta gatttgaccc 4920 atattacacc tcttccgcca agatgcatgt cttgtgtttc aggcccctca acatgaacac 4980 agcaatagca catcgattca tctggcctgt ctcttttctg caggtacgct gctggcaagc 5040 aacgttatgc tcatgctgga ggtggatgtt actaagatgg tcttcttggt ggtggcgtag 5100 gggaaggagc cgaaaagtct aatgccatat tcaaggccca tgctgcaaga gctcaagacg 5160 ctctctaagg atggggtggt tgtttacaac gcattgctcc aaagggacgt gattgtgaag 5220 gctgagctgc acctactcac ggctgacagt ccagcgcgag ccaaagtgct gtcactcgtg 5280 gctcccactg cctacaacgg ttgctgctat tgcccaatgt cggcccttgc ctgcaactgc 5340 cggaagcacc gcgactggtc aaaagctgca gcttccgacc gcacccactt gtcagcaaac 5400 cagttcacca atttcaacta ttctttcgtg ggaaagctgc cctacattac gcctcccaac 5460 acatgctcca tcgagaccag cttttgctcg cttgacacat tgcacagctt tatgctctgc 5520 agtggcaagc gtttactcag cctgactcta gaatcgatga aacccttcgt tcacacgggc 5580 aaaggatccg tgcttaggaa caagagggct gcgctgcttg gcaacgacac ggtcaagatt 5640 cttgaagtag acctgcgctt ggccatcact gtatggcctg cctttgtcat gcgaaagcct 5700 cgactttgct ctgcacacca caagtcgtac actgccgagg agatgctcaa ttttgtcctg 5760 gtctgcggca agcacatctt cctgggtttg gtggatgaga gaaagcaagc tatctggtgg 5820 aacttttcag aaatgacgcg gatcgccatg tccaaggtgg ttaaggtcga ccccacgacg 5880 cttgactcgc tcaacaaact gacttccaat accatcggaa tcatcagtga gacgtttggg 5940 aactgcaacg tgcccatcac tacccaccat ctgtctcatc ttgcccgggc ttgccagcaa 6000 catggcccgc ctgcacatgt ctcgacattt ccgttcgagc gcgcgatcaa ccaatacggc 6060 cgccttatcc gccccggtac tcacgacgaa attgttccca gactggtggc cagaatcgca 6120 ctgcaggtca accctgtggt ttcctacgct ccaggtcgca agtttggggt gctgacgcgc 6180 ttgcgtgtcc ctgaagaaac gatggaagcc gtcgtggcga agctggccaa tcacgagcac 6240 gacgctcctt ctggctcggt ctacacggaa gcctcggtct ccaagcggat ggcggaattc 6300 aaggagaagg ctcaagcgtt ccagacactg accatgtctg tgtggggatc gcgacccgtt 6360 tcgctcgcca aggtatccga agcaacagtt cttcagtttg gggacgtggt tgtcaagcca 6420 gccaggtttt ggcgtatcga ccaatcaatc tgggttgagg ttgcaagtta ctaccgtcca 6480 tccgaacatg tgtacggcaa gaacccactg taccagccac tccgccctcg cttcggacac 6540 gagatcgtag ctctcgacag ctttacatgc agcgggtact gtcacgaatt gaccaacagc 6600 cagttcaagc aagtgaacga gcagtacaat gccaatcgac ggctaaagaa ggtgatccaa 6660 cagcagagtt tgcagcgcgt gaacgaagcg gaccaggtcg acgagcgtgc tcgatcttgt 6720 ccgcaaaacg acccaaaccc tccaactaaa ccgcaattca tcatgattct gtggcatttg 6780 aatatgcgtg atttgtgtta attgtatttg tagctagaat gagagatatc ctgagtgaag 6840 aattggtgca ctggtgctgt cattgagtgg tggctgcgaa gttgtgtcaa cagcgctttc 6900 ctgcgtctgt gcgaccgtcc ccgaccgccc ccgcatcaga aacaacgcgt gacaactgta 6960 tcacgtcact ggacatgcaa aggcgcgcgt tgacatggcc tggggagcag tagcaaggag 7020 gagtgccaat cgcaatggac cgccagccaa ttcccctcct catcgtttcg cgttccagga 7080 cagtccccgc cgtccatttt ccacaaatat gcactgcaaa caacatatat tgcatactcc 7140 gattgaaaga aacaagccaa acccaagcat tcgcaagttg aaaaacaccg aatttttgaa 7200 aaaattcggt cgttccgaaa ttcggaaaaa attttccact gtg 7243 // ID Gypsy-250_AA-LTR repbase; DNA; INV; 254 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-250_AA_; KW Gypsy-250_AA-I; Gypsy-250_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-254 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1102-1102 (2011). XX DR [1] (Consensus) XX SQ Sequence 254 BP; 75 A; 50 C; 48 G; 81 T; 0 other; tgtagtatat ctagatgagt tatggcaaat agtgttgcct taaacaatag ttaattgtac 60 tgcgagttgc tatagaatta attatagcag ttaggttgct cagcacatgc tgacttccgt 120 tcggtctctt tcctgaccga ccgtcgacca gtgcagacgc acaagaagaa atcgttagaa 180 taataaatta atcaattagc agttatcaaa gtacagtctc gcgtgtgttt tcattacttc 240 ctcccaacat ttca 254 // ID ULTR-1-LTR_NVi repbase; DNA; INV; 188 BP. XX AC . XX DT 10-APR-2009 (Rel. 14.04, Created) DT 10-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE Nonautonomous LTR retrotransposon, long terminal portion, DE consensus. XX KW LTR Retrotransposon; Transposable Element; Nonautonomous; KW ULTR-1-I_NVi; ULTR-1-LTR_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-188 RA Bao W. and Jurka J.; RT "Nonautonomous LTR-retrotransposons from Nasonia vitripennis."; RL Repbase Reports 9(4), 790-790 (2009). XX DR [1] (Consensus) XX SQ Sequence 188 BP; 39 A; 54 C; 54 G; 40 T; 1 other; tgtaacgagg aacaacgtcc ctcgccacgc cggtaatcga ggcaggatag agagtgacga 60 aatagggcgg cgttggctca agyggtagca cgcgcgccta gagatctcgg gagcgccgga 120 tcgattctcg gtcccggcgt ctccttttgt gattttctcc tctcacgtca aagtctcacc 180 ccgttaca 188 // ID Copia1-LTR_Dmoj repbase; DNA; INV; 137 BP. XX AC scaffold_6680; XX DT 13-MAY-2009 (Rel. 14.05, Created) DT 13-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia1_Dmoj; KW Copia1-I_Dmoj; Copia1-LTR_Dmoj. XX OS Drosophila mojavensis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; repleta group; OC mulleri subgroup. XX RN [1] RP 1-137 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1020-1020 (2009). XX DR Genome; scaffold_6680; Positions 20014414 20014550. XX SQ Sequence 137 BP; 43 A; 33 C; 21 G; 40 T; 0 other; tgttgatatg gcaattctca tattagtggc aacgctatca tgtacttagc tattaagaag 60 cataataaat cagtctcact tgaacaacca aacgttgcgg ttcttcttat ccactcgcgc 120 acaaatactc gccaaca 137 // ID BEL-27_AA-I repbase; DNA; INV; 5600 BP. XX AC supercont1.16; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-27_AA_; KW BEL-27_AA-LTR; BEL-27_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5600 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.16; Positions 140251 145850. XX CC Positions [4439-5041] - Integrase core CC LTRs are 95% similar to each other. XX FH Key Location/Qualifiers FT CDS 41..5383 FT /product="BEL-27_AA-I_1p" FT /translation="MEDLVKKRNVAFERLRRIHASVKQLAEREPPAFEVHD FT RLRKLADMEESFDRIQAEIEEEVPAEELSSVLSVRSDYEQLFYITKGMITK FT LLQIPDDHSIGANSEKTVVEPKSELKEAVRVLLETQRTLLSSQATASSNME FT ELAEQLRQQKTEPLDTQLPSYNLPVFRGDRKQWASFRDLFVSSVNNKNLTS FT ALKLQILMSHLEGDAKSLVSSYSITDANYTQVWDTLVEHFDKPKFTVAALV FT QEFCDQPAVKGSNLASLRKLVSTSDEVIRQLSALGPNFETRDPWLNYIVLK FT KLDEGLRSQWSQHIVDNDDPTFDDLLKFLKRKCEALETCAAFGGKPLDYGK FT KDFQRDDRKQNVIKKEIKSFNAVQQQSCPICAASHRIYECTVFKEATINER FT RERVQQAKLCFNCLRPNHCVKSCPSKSVCRTPNCQQHHHTMLCRASCESPT FT IAPQEKQVLSSPAQKPDSAAVEIKPALSCTTNVNNKSSSVVIGLLPTALVR FT VKGNNGQWNEVRVMIDSGSQASLITEHCVTSIGLQRSNANLIVTGIASCSS FT ETTRGAVELEISSRFCYNPVVRVKAYVLSKFPRIVPNQRLDRERLKCLEPL FT QLADPDFDKPGQVDLILGADVFLAILEDGKVKDETGLPVAINSSFGWIVAG FT QVFDASEVNCNTAIISLSMDMDIDKALQKFWEVEEVNRPKPLTQEEQQAVD FT FFNSTHQRDEHGRFNVRLPFVEGKPPLGESLPPAVQRLKAMERRFARDPDF FT KQLYGDFMAEYLSLGHMERVPDDEVNIEPEKRFYLPHHGVMRQDSVTTKLR FT VVFDGSCQSSSGISLNEKLLIGPKVTEDLPIVLTRFRSYAFAFMADAEKMY FT RQVRIHRGDVDYQRIVWRTDPDKPIEHYRLLTVTYGTSCAPFLAIESLRQA FT ARDSCAQYPIASGRVQINFYVDDFLSGAATLEGALTLKKDIVRITSEAGFH FT LRKWSANDPRLLEGNASAADTSVPVHLHPDADSVKALGIHWYPATDSFGYR FT LNVNIDKPNTKRQLLSDSARLFDPLGWISPIIVRIKILYQTLWLQDLQWDD FT PLPAAVNQEWNKIKQSLAAIEDIRIPRWIANHHGALQLHGIADASEAAYAA FT VVYARTVDDEGNVCISLVASKTKVAPIQQVSLPRLELNAAVLLVELMKQIL FT EAMDNLDVTCYAWTDSTVVLGWLTSHPKKWKTFVANRTSAILDFLPRSSWN FT HISSGENPADCASRGLLPTELVDHELWWTGPSLLHDHEDIWKVRAEAVQYE FT EDTTEQRKVAAAASVAVKPVSDYEIEECLLKRNTSLKFSQREVAWINRFKV FT NLLAKLRGTQKISGPLTTSEVHAANLQLARFAQHEVFGKDMELLRKGQPLS FT PKSQIKSLFPFIDGDGTLRVGGRLQNSDQPFHMKHPVVLPKSHRYTYLLIS FT MLHHQNLHAGPTLLIATLNQHYWVIGCQSVVRSVVNSCVRCTRWKAKTSNQ FT LMGSLPAVRTVSSRAFEIVGVDYAGPVTIKASLLRSTKTVKGYIAVFVCLA FT TKAVHLEAATDLSANTFISALKRFCARRGLPNQLWSDNGTNFVGADRQLKE FT LLASAAFNSEVDQYLNSLGIQWNFITPSAPHMGGLWEGAVKSMKKHLRVVL FT GTTVLNFEELTTVLTQIEACLNSRPLCTLSSSVDSCEALTPGHFIIGQPLN FT LVPEPDASGIPMNRLDRYRLLRKITEDFWERFRTEYIATLQPRTKWQNIES FT NLKDGNLVLVKNENTPPAYWELARVVATHPDRSGIVRNVSLKRGQTIYQRP FT IHKLVILPTD" XX SQ Sequence 5600 BP; 1535 A; 1324 C; 1429 G; 1312 T; 0 other; tattggtcct tcgccaccgg atcggcgtga atcggcgaca atggaggatc tcgtcaagaa 60 gagaaacgtg gccttcgaga ggctgagaag aatacatgcg agtgtgaagc agttggcgga 120 gcgagaaccg ccggcgttcg aagttcacga tcggctccgg aagctagccg acatggagga 180 aagtttcgac aggattcagg ctgaaatcga ggaagaggtt ccggcggagg aactttcatc 240 ggtactcagc gtacgttcgg actacgaaca actcttctac ataacgaagg ggatgatcac 300 gaagttgctg caaatcccgg acgaccattc aatcggtgca aacagtgaaa aaacagtggt 360 ggaacccaag agcgagctaa aggaggcggt ccgtgtactg ctcgaaacgc aacgaaccct 420 cctttccagc caggcgacag catcgagtaa catggaggaa ttggcagagc agcttcgtca 480 acagaaaaca gaacccttgg atacgcaatt gccatcgtac aacttgcccg tattccgagg 540 tgacaggaag cagtgggctt cttttagaga tctgttcgtc agcagtgtca acaacaagaa 600 cctaaccagt gctctaaagc tgcaaatact gatgtcccac ctggaaggag acgctaaaag 660 cctggtaagc agttactcta tcaccgatgc aaactatacg caggtttggg atacactcgt 720 tgagcatttt gacaaaccaa agttcacggt tgccgccttg gttcaagagt tctgcgacca 780 accagcggta aagggttcga atctcgccag tttgcgtaaa ctagtgtcca catcggatga 840 agtgattcgt caactcagcg ctttgggtcc aaacttcgaa acgagagatc cttggctgaa 900 ctacatcgtt ttgaaaaagt tggatgaagg tctacgatca cagtggtcac agcacattgt 960 ggataacgac gatccgacgt tcgacgatct gctcaagttc ttaaaaagga agtgtgaagc 1020 attggagact tgtgcagcgt tcggaggaaa acccttggat tacggtaaga aggattttca 1080 acgagatgat cggaagcaaa acgtcatcaa aaaggaaatc aagtcattca acgcagtaca 1140 acagcagtcg tgtccgattt gtgctgccag tcacaggatt tatgagtgca ccgttttcaa 1200 ggaggctacc atcaacgaaa ggcgagaaag agtgcaacaa gccaagctgt gtttcaattg 1260 tttgagaccg aatcattgcg tcaaatcctg cccgtcaaaa tcggtgtgcc gaacgccaaa 1320 ctgtcaacaa catcaccata cgatgctgtg cagagcaagt tgtgagtcgc cgacaatcgc 1380 accgcaggaa aaacaagttt tgtcttcgcc ggcacagaag ccggattctg cagcagttga 1440 aatcaagcca gctctgtcat gcacaacgaa tgtcaacaac aagagttctt cggttgtcat 1500 tgggctgcta ccgacggctt tggtgcgagt gaagggaaat aacggccagt ggaacgaagt 1560 gagagtgatg attgatagtg gttcccaagc ctccttaatc acggaacact gcgtgaccag 1620 catcggactg caacgaagca acgccaatct gattgtcacc ggcatcgcaa gttgttcgtc 1680 ggaaaccacc agaggagccg ttgagttaga aatttcttct cgtttttgct acaatccagt 1740 ggtcagagta aaagcatatg tgctctccaa atttcctcga attgttccaa accaacgatt 1800 ggatcgggaa cgcctgaagt gtttggaacc gctacagctg gcggaccctg atttcgataa 1860 gcccggtcaa gtggatttga ttcttggagc tgatgttttc ttggctatat tggaggacgg 1920 caaggtgaag gatgaaactg gacttcctgt ggcgatcaac tcttcttttg gatggatcgt 1980 cgccggtcaa gttttcgacg ctagcgaagt gaattgcaac acagccatca tcagtctcag 2040 catggatatg gatatcgaca aggcgctcca aaaattttgg gaagtagaag aggtcaaccg 2100 accgaaacct ttgacacagg aggagcagca agctgtggat ttcttcaact caacgcatca 2160 acgagatgaa cacgggcgtt tcaacgtacg tctaccgttc gtcgaaggca agcctcctct 2220 gggcgaatca ctgccacctg cagtgcaacg attgaaggca atggaacgaa gattcgcaag 2280 agatcctgat ttcaagcaat tgtatgggga cttcatggct gagtatttga gtcttggcca 2340 catggagcgg gttccagacg acgaggtgaa cattgaaccg gagaagcggt tctatcttcc 2400 acatcacgga gtcatgcggc aggatagtgt cacaaccaaa ttgagggtcg tgtttgacgg 2460 ctcttgccaa tcgtcgagtg gaatttcatt aaatgaaaaa ctattgatcg gaccgaaggt 2520 caccgaggac ctgccaatcg tactcacccg tttccgcagt tacgccttcg ccttcatggc 2580 ggatgcggaa aaaatgtacc gacaggtgag gattcatcgg ggcgacgtcg actaccagcg 2640 tattgtatgg agaacggacc cggacaagcc tatcgaacat tatcggttgc tgacggtaac 2700 ctacgggaca tcttgcgcac ccttcctggc catcgaatcg ctacgacaag cagctcggga 2760 ttcgtgtgcg cagtatccaa ttgcttccgg acgcgtacag ataaactttt atgtggatga 2820 ttttttatct ggcgctgcga cattggaagg agcacttacg ttgaagaagg acatcgttcg 2880 gattacatct gaggccggtt tccatctccg gaaatggtca gcaaatgatc cacgactgtt 2940 ggagggaaac gcaagcgctg ccgatacatc agttcccgtt catcttcatc cagatgcgga 3000 ttcggtaaaa gcactgggta tacattggta cccagccaca gattcgtttg gatatcggtt 3060 gaatgtcaac atcgacaagc ccaacacgaa gcgtcaactt ctatccgatt cggcccggct 3120 cttcgatccg ttgggttgga tttcaccaat aatagtgcgc atcaagattc tgtaccaaac 3180 attgtggttg caggacctgc aatgggacga cccactccca gcagctgtga atcaggagtg 3240 gaacaaaatc aagcaaagtt tggcagcaat cgaggatatc cgtattccaa ggtggatcgc 3300 gaatcatcac ggagcattac aactgcatgg aattgctgac gcgtctgaag cagcttatgc 3360 agcagtcgtg tacgctagaa ccgtggacga tgaaggtaat gtgtgcattt cgctggttgc 3420 aagcaaaacc aaagttgccc ctattcagca agtttcgctt ccccgtctcg agttgaacgc 3480 cgccgttctt ctagtggagc ttatgaaaca gattctcgag gctatggaca acttggatgt 3540 tacatgctat gcatggacag attctaccgt tgtgcttgga tggctgacat cgcacccgaa 3600 gaaatggaag acgtttgtcg ccaatcggac gtcggcgata ctggatttct tgccacgaag 3660 ctcttggaac cacatttcat cgggcgagaa tccagccgat tgtgcatcta gaggtttgct 3720 gccaacggag ttggttgacc acgaattgtg gtggactggt ccatcgcttt tacacgacca 3780 tgaagatatc tggaaggtaa gagcagaagc agtacaatat gaagaagaca ccacggagca 3840 gagaaaagtt gctgctgctg cttcggtagc tgtcaagccc gtatcggact acgaaatcga 3900 agaatgtctc ctgaaacgaa acacgtcgtt gaagttttcc caacgagaag tggcgtggat 3960 caacagattc aaggtcaact tactagcaaa gctgcgagga acccaaaaga tttctggacc 4020 actgacgaca tcagaagttc acgctgccaa tctacaattg gcacgttttg ctcaacacga 4080 ggtgttcgga aaggacatgg aactgcttcg aaaagggcaa ccactttcac cgaagagcca 4140 aatcaaatcg cttttcccgt ttatcgatgg agatggaaca ttgcgtgttg gaggtaggtt 4200 gcagaactcc gatcagcctt ttcacatgaa gcatccggtt gtacttccca agtcacatcg 4260 ctatacgtat ctgttgattt caatgttgca ccatcaaaat ctgcacgctg gtccaacttt 4320 gcttatcgcg acgctgaacc aacactactg ggtaattggc tgccaatccg tggtgcgatc 4380 ggtggtgaat agctgtgtcc gctgtacacg ttggaaagcc aaaacttcga atcagctgat 4440 ggggagtttg cctgcagtgc gaaccgttag tagccgagct tttgaaatcg ttggcgttga 4500 ctatgctgga cctgtcacaa ttaaagccag cttactgcga tccacaaaaa ccgtaaaagg 4560 ctacattgca gtttttgtgt gtctagcaac aaaagctgtc catttggagg cggcaacgga 4620 tctgtcagca aacacattca tctcagcttt gaaacgcttc tgtgcccgcc gtggtttgcc 4680 caatcaattg tggtctgaca acggcacaaa ttttgttggc gccgaccgcc aactcaagga 4740 gcttctagca tcagctgcgt tcaactccga agtggatcag tacctaaaca gtttgggaat 4800 acaatggaac tttattacgc catccgcccc ccacatgggc ggtctttggg aaggcgcggt 4860 aaagagcatg aagaaacatc tacgcgtggt gctaggaacc accgtgttga acttcgaaga 4920 actcacgact gttttaacgc aaattgaagc atgcttgaat tcacgaccgt tgtgcacact 4980 atcgagctct gtggattcct gcgaagcttt gacccccgga cacttcatca tcggccagcc 5040 gttgaatcta gtgccagaac cggatgcatc cggtatacct atgaatcgcc tcgatagata 5100 tcgcctgcta cggaagatca ccgaagattt ttgggaacgc ttcaggacgg aatacatcgc 5160 cacgcttcaa ccccggacaa aatggcaaaa catcgagagc aatcttaaag acggaaacct 5220 tgttctggtg aagaatgaga acacaccacc ggcatactgg gaacttgcac gagtggtagc 5280 tacgcatccc gatcgaagcg gaattgttcg gaacgtgagc cttaagcgag gccaaaccat 5340 ctaccaacga ccaatacaca agttggtgat tcttcctact gattgacgca tgcgcctcaa 5400 ggccgggagg atgttcggga ttatcgttgc accagagttc aaccctgtca ggaaaagttt 5460 caagaaaatt ggagcaagag agaaaatatc aagagcagga actttacctg tctccctaac 5520 atgtgtactc tttgacaatg tcagttttca ttattctttc gtctgtaata aagcgagtta 5580 gagcagacat tttgttcttg 5600 // ID DNA8-52_AP repbase; DNA; INV; 827 BP. XX AC . XX DT 25-AUG-2009 (Rel. 14.09, Created) DT 25-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-52_AP. XX NM DNA8-52_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-827 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 1983-1983 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 827 BP; 332 A; 118 C; 95 G; 282 T; 0 other; gggctcggat tttgtagcaa aataaatcct taaaaaagca ttttaagtag ccaaaaaaag 60 catttattaa tttaattcta aaaaagcaaa tttcaattac acataaaatt aaatattaaa 120 aaaggaattt acaaccatat atttagtcaa gttactttca ttaaaactca tattaaacat 180 ataaaacgtt atatttttta tccgaattta ttttgacttc acacaattta caaaatataa 240 tatttccgtc agtcataagc acattttcac cgaattctct tacgattttt tttaatttta 300 aatttaaatt tttaactttt ggcattttga cgatgagaga aattatttag aacactatac 360 gtttatacct cactgattgt cgattgatat ttaccggcca actaaagtat tagaacgtgc 420 tataaaaata cgtaggtata attcagataa ttttcccttt ccttgagtcg taaatacaac 480 tatagataat tgtaaataaa actatttcgg cacaataacg acagttattt tatcagtcat 540 atatttatta ctggtttttt cactggtttt ataatttctg gtaataataa tatagtccaa 600 gacgtataat atacatagag ataagctaag ttttcaagaa aaatatgtaa aaatcaagaa 660 aaaataacca aataggggga aaagcaataa aaagcagatt ttgtggaaaa aagcaaaaaa 720 aagtttactt cattcaaatc agaacgattc aaaacgtatt tatatataag aattatcttt 780 ctatcacaat tccctcaaaa gaagcatttg ctacaaaatc cgagccc 827 // ID MSAT-7_CQ repbase; DNA; INV; 184 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A satellite repetitive sequence family from Culex DE quinquefasciatus - consensus. XX KW MSAT; Satellite; Simple Repeat; MSAT-7_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-184 RA Kojima K.K. and Jurka J.; RT "Satellite sequences from the southern house mosquito."; RL Repbase Reports 11(1), 619-619 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >93% CC identity. XX SQ Sequence 184 BP; 67 A; 29 C; 23 G; 65 T; 0 other; acatttatta agaaattcca acaaattcct taagaaaatt gcaattttaa taaaatttca 60 aataaaagca gccctgctga caatctgtgg ctcattcaat gattggagaa tgttgttctg 120 atgtctaaga atgatattca gtcattctac caaattctac ctcattgtaa aattgttttt 180 aaat 184 // ID Gypsy-5_AC-I repbase; DNA; INV; 3805 BP. XX AC AASC02053826; XX DT 18-JAN-2011 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from Aplysia californica (California sea DE hare): internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-5_AC_; KW Gypsy-5_AC-LTR; Gypsy-5_AC-I. XX OS Aplysia californica OC Eukaryota; Metazoa; Mollusca; Gastropoda; Heterobranchia; OC Euthyneura; Euopisthobranchia; Aplysiomorpha; Aplysioidea; OC Aplysiidae; Aplysia. XX RN [1] RP 1-3805 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Aplysia californica (California sea RT hare)."; RL Direct Submission to Repbase Update (02-FEB-2011). XX DR Genome; AASC02053826; Positions 18232 14428. XX CC Positions [2855-3220] - Integrase core CC 'AGTAC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 728..3697 FT /product="Gypsy-5_AC-I_1p" FT /translation="MQSEGKRCLSEETGKLASGGCCAGSYPFEMRAQSLPI FT IQVGVKGAKVKALVDTGCMVTVVSSQIVGNCVGENYMVAFDGSRVQCQGKK FT ELIIDVRGVSVSVNAIVSDSILSDIDVVLGMDAINNLGGVSVNHGTVEFGV FT KWCGAAEMETESDPPDKAKGNVKRAESAVYEHREVEVIDDSDFKAVFDGNT FT WTVEWKWKRDPPHLKNKIGCYGKQMDMKTTEGFEKEIQSWIEEGILLPWNE FT KVEIGIIPLMAVVQPTKNKVRPVLDYRELNKCISSHTGGDFTDVCGETLRE FT WRQTTDAATIVDLKGAYLQIRVSQDIWKYQLVNFKGKTYCLTRLGFGLTSA FT PRIMTKILKTVLNKDEKIDSATQSYIDDILLDESKVTSAALVKHLANYGLT FT TKEPEPLDGGAALGLKLKSDSTGDLIFSRANRIPHAVSKVTKRELFSICGK FT LTGHYPVAGWLRIACSYIKRRANGTNWEEIVNDEVEKMLKEMLSRLTNEDP FT VKGKWYVPKATKGTVWCDASSIGYGVLLEVNGNAVEDAAWLRKPSDFDHIN FT VAELEAVMKGVNMAIKWNISEIEVITDSATVAGWIKTVLSEERRVHTKGAA FT ELIVKRRLGILKSLINEFSLVLSVTLVPTRKNKADILTRVKKEWLSTGGYG FT MEEGEPEPCVCATGHLDLKDLHDMHHMGVDRTLYLARKVLPETSRADVKKV FT VQACEQCQSIDPAPVVHEAGNLDVATDWTRLAIDITHHHQVPYLSIVDCGP FT GRFAIWKKLRRETAGFVTEVVEELMLERGPVREILMDNGTVFRSDCFWQML FT QKWNIKPLFRAAYRPTGNGIVERHHRTVKSIAERGNIPPQEAVFWYNSTPR FT AGLDPASVPQRAVYRYDWRNHIVVNEEEDGPGRFEVGDDVWVKPPQSKCTT FT KWNRGTVTGVNSINNVAVDGMARHILDVRRVVRDVTSEEEDDEGGGGEQQT FT LQTLTRNEGAGEPPPRRYPDPVRKTPGWLVDYETQ" XX SQ Sequence 3805 BP; 1120 A; 676 C; 1085 G; 924 T; 0 other; agctatttcg gcgaggtgct cattctagag ccagatatta agtgctagtg atggcatcga 60 aaataacgac ggacatgatc aagcctttca gtggcaatag cgatgttgtg gcatggttgg 120 ctaaggttaa actcgtcgct ggtttgaaga acattcagga tcttgcgaaa tttgtaccgt 180 tgttcctgga gggtgatgca ttggcgttgt acttacagct tagtgagctg gaaagatcac 240 aatatgagac aattgagaag cgtcttctag aggcattcac agacagctca tttgttgcgt 300 tcagtaagtt gacacagagg aggtgggacg gtgagcatgt tgatgtattc gcgaacgaac 360 taaggagatt gggaggattg gctggatttt caggcagcga actggatagg ctggttagat 420 tgtcctttgt aaatggattc cctgacccga tatcgtgtga gctccaacag atgagccggg 480 tcttcgaagt tgaactgtca gaggtcatag caagagctag aatccttaca gcaagtaaga 540 ctggaatggt cagcgttgcg gcggcaggcg gagccatggc aaacgaaatg ggaagccaca 600 gtcaacgtgt gagggcattt aaaggttctt gttatcgatg tggtggacca catatggcta 660 agttttgtag tcatcgtgct gacattgttt gctaccactg caatgaagtt gggcacatag 720 ccagtcgatg cagtcagagg ggaaacggtg cctatcagaa gaaacaggaa aactagcttc 780 gggtggctgc tgcgccggca gctacccgtt tgaaatgagg gcgcagtctt taccgataat 840 tcaagttggc gtgaagggag ctaaggtcaa agcattggta gatacaggtt gtatggtcac 900 agtggtcagt tcacagatag ttggaaattg cgtgggcgag aactatatgg tagcttttga 960 tggttcacga gtgcagtgtc aggggaagaa agagctaatc attgacgtgc gaggtgtctc 1020 ggtctccgta aatgcaattg tctcagattc tattctgtct gatattgatg tggtactggg 1080 aatggatgct ataaacaact tgggaggagt ttcagtcaac catggtacag ttgagtttgg 1140 tgtcaaatgg tgtggggcag cggaaatgga aactgaatct gatcccccag acaaggcaaa 1200 ggggaacgtg aagcgagcag agagcgctgt ttacgaacac agggaggttg aggttattga 1260 cgacagtgat tttaaggcag tttttgatgg caatacatgg acagtagagt ggaaatggaa 1320 aagggatcct ccccacctga agaataagat tggatgttat ggtaaacaga tggatatgaa 1380 gactacagaa gggtttgaga aagagataca aagttggata gaggaaggaa tacttttacc 1440 atggaatgag aaagtagaaa taggcataat tccgctgatg gctgttgttc aacctacgaa 1500 gaacaaggta agacctgtcc ttgactacag agagctcaat aaatgtataa gtagtcacac 1560 ggggggtgac ttcacggatg tatgcggcga gacattgcga gagtggagac agacgacaga 1620 tgcagcaacg attgtcgacc tgaaaggtgc ttaccttcaa attcgggtgt cacaagatat 1680 atggaaatat caactagtga atttcaaagg taaaacatat tgcctgacaa gacttggatt 1740 tggtcttaca tcggcacccc gcatcatgac gaagatcttg aaaacggtat tgaataagga 1800 tgagaagata gactctgcca cgcagtccta cattgatgac attctactgg atgagagcaa 1860 ggtaacatct gctgcactgg tgaagcacct ggccaactat gggttgacaa cgaaagaacc 1920 agagccattg gatggtggcg ccgcattagg tttgaagttg aaatctgatt ccacaggaga 1980 tctcatattt tccagagcca acaggattcc ccatgcagtg tctaaagtca ccaaacggga 2040 gttgttctcg atctgcggaa aacttacagg tcattatccg gttgctggat ggttaaggat 2100 agcatgcagc tacataaaga gaagggcaaa tggtaccaat tgggaggaga ttgtaaacga 2160 tgaagtggag aaaatgttga aggaaatgtt gagccgtttg actaatgagg atcccgtcaa 2220 aggcaaatgg tacgtgccta aagcaaccaa aggaaccgta tggtgtgatg ctagcagtat 2280 cggatatggt gtgttacttg aggtcaatgg caacgcggta gaggatgctg catggcttcg 2340 gaaaccttca gattttgatc atatcaacgt agcagagctg gaggctgtga tgaaaggtgt 2400 aaacatggcc atcaagtgga acatctctga aatcgaagtt atcactgact cagcaacggt 2460 ggctggttgg atcaaaactg ttctatcgga agagaggaga gtgcacacca aaggggcagc 2520 ggagttgatt gtgaagagac gattagggat attgaaaagc cttattaatg agtttagtct 2580 ggttttgtcg gtcaccttgg ttccgaccag aaagaacaaa gcagacattc tgacaagagt 2640 gaaaaaggag tggttaagca ctggcgggta tggaatggag gaaggagaac cagaaccatg 2700 cgtttgtgct actggccatc ttgacttgaa agacttgcac gatatgcacc atatgggggt 2760 agaccgtact ttgtatttgg ctcggaaggt gctaccagaa acgagccgtg ctgatgttaa 2820 aaaagtcgta caagcttgtg agcaatgtca gtctatcgat cctgcaccag ttgtccatga 2880 ggcgggcaac ctagatgtcg ctacagactg gacaagattg gcaatagaca taactcacca 2940 tcatcaagtg ccatacctgt caattgttga ttgtggccca gggcgattcg caatttggaa 3000 gaaactgagg agagagacag cagggtttgt gacagaagta gttgaagaac tcatgttaga 3060 gagaggacca gtgcgcgaga tcttaatgga taatggcaca gtgtttcgtt cagattgttt 3120 ttggcagatg cttcagaaat ggaatatcaa accattgttc agagctgcat ataggcctac 3180 ggggaacggg attgttgaga gacaccatcg tacagtgaag tccattgcag aaagaggaaa 3240 cattcccccc caagaagctg tgttttggta caactcaaca ccaagggcag gtttggaccc 3300 agcatcagtg ccacagagag ctgtctaccg ttatgattgg agaaaccata tcgttgttaa 3360 tgaagaagaa gacgggcctg gaagatttga agttggagat gatgtttggg ttaaaccacc 3420 ccagagcaaa tgtacgacta agtggaatag aggtacggtt acaggagtaa actcgatcaa 3480 caatgttgct gtcgatggta tggctcgtca cattctcgat gttagaagag tggtacgaga 3540 tgtaacttca gaggaagagg atgatgaggg aggtggcggt gaacagcaaa ccttgcaaac 3600 cctgacgaga aacgaaggtg caggtgaacc ccctcccagg cggtatcctg atccagtcag 3660 gaaaaccccc ggttggttgg tggactatga aactcagtaa atgaagtaat caagtgtaca 3720 aagtgtagat agcgttgaac catgtcctgt tcgcacgatt gaagaactaa attttgtttt 3780 gtttgatctt gagatcacgg gggcg 3805 // ID BEL-1_TCa-LTR repbase; DNA; INV; 203 BP. XX AC singleUn_1341; XX DT 21-FEB-2011 (Rel. 16.02, Created) DT 21-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the red flour beetle genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-1_TCa_; KW BEL-1_TCa-I; BEL-1_TCa-LTR. XX OS Tribolium castaneum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Coleoptera; Polyphaga; Cucujiformia; OC Tenebrionidae; Tribolium. XX RN [1] RP 1-203 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the red flour beetle genome."; RL Direct Submission to RU (21-FEB-2011). XX DR Genome; singleUn_1341; Positions 4646 4444. XX SQ Sequence 203 BP; 65 A; 40 C; 41 G; 57 T; 0 other; tgtcgggtat tctcggttat tcccgagtgg aaccgaaatt aaaaataaag ccgtcttaag 60 tcagaacgct acgatcagta tgacgaagtt gttgttttat ttattttaac ctacaattga 120 accagcttcc aattttcgaa gaaataaggg cagattgcaa acaggaagca tctacacctg 180 agcaatcgcc tgaggtttca aca 203 // ID hATm-2_HM repbase; DNA; INV; 3803 BP. XX AC . XX DT 16-MAR-2008 (Rel. 13.03, Created) DT 22-MAR-2008 (Rel. 13.03, Last updated, Version 1) XX DE hAT-type family: consensus sequence. XX KW hAT; DNA transposon; Transposable Element; hATm-2_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3803 RA Jurka J.; RT "Families of hATm elements from Hydra magnipapillata."; RL Repbase Reports 8(3), 206-206 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(596..1168,1350..1667,1726..3447) FT /product="hATm-2_HM_1p" FT /translation="MGRTAKKKKMIGQNKKKRKNTERLRYIYLLKYPLREL FT PQRQLPSNGDILRYYQFILRESQVKYKPTSTNVGCKLKSKSKDLVCENGVC FT TDPDNSCLVSRIKKIWDNAGFGKKFVICGYNIKTKIIKLNLKYKKLIKLSS FT LNWNSSLDKQSKLEKDFLTESYQLFDISTKNFKKDILADRLRTDDAKLEDI FT KIFLSFYEDQKLHRKGYMETKVDEDYSRRVLDANRRKNKLDKLREKELERQ FT KNLKDINDVLLSHDSMTEEIQFSSEEEKSNEDSEISEEDDFLGNGRKRRYS FT IILITVEKYYKMIPRYNSGDRINLAVNVEKLIECTAIPATRFNVGVRPHLA FT ILSEVFKAGGIEINDIPLSRSTLHRKKFKYVELEGDSEWISISNHLKGRRL FT ILHFDTKLLEQITTGLNIIQKIERLAVSVSSPDDDTMEDHLLGVVQCPSSK FT GVDQAEQVHNLLEYYGCKEQIIGICCDTTASNTGGVNGAVQILTDILKLPI FT LWIMCRRHIYEVHVSHYMAALTGEKTKGPRRALYVKLKNKWPEICEKVNEV FT KNLCKLDWQRDDLKIGSVLRTVAEEAKTFLTNATTDIVFSRNDYGIVCKLA FT SFFLGVEVQDFKFHQPGACHEARFLADAIYILTLYMTKDISNILTEKEMLQ FT LKDASLFIAICYVPWFLKSFLGFLAPYNDLKAIQTAYALRKVSKNIGNALL FT LSMGRHTWYLTPQLCVFSFGDKEVPSETKLRMLLALIEFEEPNEFQRCKPS FT NVQIGETTELPELISEQSYLLFKHFKISRASIKEWLNNSLNTIDVFNHPQL FT LDFIAWIKNISVVNDGAERNIKLIKDFISATRDEDHLQNVLFVVKKSRKNL FT TKTMTKKELGNCSKSCIF" XX SQ Sequence 3803 BP; 1342 A; 543 C; 690 G; 1227 T; 1 other; ttagggtgtc ccaaaaacac acttttttta aaatttcaat gtgctctcaa ctgatcccca 60 ttctataggt cctaaatagc taccaaaatc tttwttttta actttttaat gactagaagt 120 gatagatgta aattctgagt tttacggttt tgtacacata tgtgtattta ctaaaaaaca 180 gcgctatcta gtatttcaat atgttttaat atgatgtaat aattagtaca caaataattc 240 ccataggata ttattaaaga ataggatcat tccctatgaa cattgttagc gattgatatt 300 tttccctctg gacaatatta ttagttggta taattcccca tggattggtt cacaaataac 360 tccatttaga aatttttttt gaaggaaaaa atttctaaat ggaattattt gtgcacgacg 420 gaattctaat ttgtgcacga agaaattctc gtgtaaacgg ttataaaagc cgtaacggta 480 aacttagata aaatattata cgagagggaa taacgacttc atagattttt gatgcaagtg 540 taacatttag taaaaattta gttaaaaaac tgattatttc gaaattcata taaagatggg 600 tagaacagct aaaaaaaaga agatgatagg acagaataag aaaaaacgga aaaatacaga 660 aaggttgagg tatatttacc tattgaagta cccactgagg gagttacctc agagacagct 720 tccatctaat ggtgatattc ttagatacta tcaatttatc cttcgcgagt cacaagttaa 780 atacaaaccc acttctacta atgttggttg caaattgaaa tcgaaatcca aggatcttgt 840 ttgtgaaaat ggtgtttgta cagatcctga taattcttgc ttggtatcta ggattaaaaa 900 aatttgggat aatgctgggt ttggcaagaa atttgtgatt tgtgggtaca atataaagac 960 caaaatcatt aaacttaatt taaaatacaa gaaattgatt aaattgtctt ctctaaactg 1020 gaacagcagt cttgataagc aatcaaagtt agagaaagat tttctcacag aatcctatca 1080 gctttttgac atctcgacta agaattttaa aaaagatatt cttgctgata gattgagaac 1140 agatgatgcc aagctggaag atataaagta agttcattaa ataacacttt ttttagtagt 1200 gagatttttt ttccaattaa aagtaactta agtaatagca tattatttcc tttaaaatat 1260 ttacgtttaa ctctttacat ggtttggtca tcttccaaga aaaactagaa caattattaa 1320 tttatttata aatggcttaa tagatttaaa tatttcttag tttttatgaa gaccagaagt 1380 tacataggaa aggatatatg gaaacaaaag tcgatgaaga ctacagtaga agagtactgg 1440 atgcaaacag gagaaagaac aagctagata agttgagaga gaaagaatta gagagacaga 1500 aaaacttaaa agatatcaat gatgttttgt tgtcacatga ctccatgaca gaagagattc 1560 aattcagcag tgaggaagaa aagtcaaacg aggactctga aatcagcgag gaagatgatt 1620 tcctgggtaa cggtcggaaa cgcaggtata gtattatttt aataacatag ttgtaatttt 1680 cacaagctaa agctgtttta ttgaaaagta tataattata aataggtaga aaaatattat 1740 aaaatgatac ctaggtacaa ttctggagat aggataaatc tagcagtgaa tgttgaaaag 1800 ctcattgaat gtactgctat tccagcaaca aggttcaatg ttggtgtacg acctcatctg 1860 gcaattttaa gtgaagtatt taaagctgga ggtatagaaa ttaatgatat accgctttca 1920 cgaagcacac ttcatagaaa aaagttcaaa tatgtagaac ttgaaggaga ctcggagtgg 1980 atttcaatat ctaatcatct taaaggccga cggcttattc ttcattttga taccaaactg 2040 ttggaacaaa ttaccactgg gcttaacatt attcaaaaaa ttgagcgact tgcagtctca 2100 gtgagttccc cggatgacga taccatggaa gaccatcttt taggtgttgt tcaatgtccc 2160 tcttcaaagg gagttgacca agcagaacaa gtgcacaatc ttttagaata ttacggatgt 2220 aaagagcaga tcattggtat atgctgtgat acaactgcaa gcaatacagg cggagtaaat 2280 ggagcagtcc aaattcttac agatattttg aaattaccaa ttttgtggat aatgtgtagg 2340 agacacatat atgaagtaca tgtatctcac tatatggctg cccttactgg tgagaaaact 2400 aaaggtccaa gaagagctct ttatgtcaag ctaaagaaca agtggcctga aatctgtgag 2460 aaagtgaatg aagtgaaaaa tttatgtaaa ttggactggc aaagagatga tctgaagatt 2520 ggctctgttc ttcgcactgt tgcagaggaa gctaagacat ttttgactaa tgcaactact 2580 gatatagtgt tttcaaggaa tgattatggt atagtttgta agcttgcctc cttcttcctt 2640 ggagtggaag tacaagattt taagttccat caacctggcg cctgccatga ggcaaggttt 2700 ttagcagatg cgatctacat cctgactctt tatatgacta aagatatcag taatatcttg 2760 acagaaaaag aaatgctgca gttgaaggat gcaagtttgt tcattgcaat ttgctatgtt 2820 ccatggtttt taaaaagttt tttaggattt ctggctccct acaatgattt gaaagctatc 2880 cagacagcct atgcattgag aaaagttagt aagaatattg gaaatgcttt gcttctcagc 2940 atgggacgcc atacgtggta cttaactccg caactctgtg ttttttcctt tggggataaa 3000 gaagttcctt ctgaaacaaa gctaagaatg ctgttagctc tgattgagtt tgaagagcca 3060 aatgaatttc agcgttgtaa accatcaaat gtacaaattg gagagaccac agagctacct 3120 gagttgattt ccgagcaaag ctaccttctc ttcaaacatt ttaaaatatc aagagcaagt 3180 ataaaagaat ggcttaataa tagtcttaat actatagatg tttttaacca tcctcagttg 3240 ctagatttta ttgcatggat caaaaacata tctgttgtaa atgatggtgc agaaagaaac 3300 attaagctca ttaaagattt tatttcagct actagagatg aagatcacct tcaaaatgta 3360 ctttttgtag ttaaaaagtc tagaaaaaac ttgactaaga ctatgactaa aaaggagctt 3420 gggaactgtt caaaaagttg tattttttga gaaatgtttc accgcaataa aatcttaaat 3480 ttgctaataa atattagttt tcatgtattt tttttttttg tattttcata tttataaata 3540 taatctatca agttaaatag tagaaaacaa ttggaaatga tttataccca tctaattgtt 3600 ttgttttaat tttctaaaat gtaaaaaaat tgtgttttct attaatgtta agctagctta 3660 acattaaaac agtaaaactc agattttaca tctagcactt cctgtcaaaa taaaaccgtc 3720 aaattttttt tagtacttat ttaggtcaca tagaatggga atcagtagag agcttttttt 3780 tttttttttt tgggacaccc taa 3803 // ID Gypsy-196_AA-I repbase; DNA; INV; 6757 BP. XX AC supercont1.70; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-196_AA_; KW Gypsy-196_AA-LTR; Gypsy-196_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6757 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.70; Positions 441068 434312. XX CC Positions [3479-3940] - Reverse transcriptase CC Positions [4964-5440] - Integrase core CC LTRs are 96% similar to each other. XX FH Key Location/Qualifiers FT CDS 366..2711 FT /product="Gypsy-196_AA-I_2p" FT /translation="MADEIKFTAENLANHCLGIRVDFLDEDELDQELFMRN FT MFLTGEQTMSRRRRALREALKQEKEEGVSVKHLPNSVENELAICEAKLHRL FT EASSFTAHRIAPKCKSTLLHLGQRLLVLIQFSGGNYQDSARDLLLRVISHL FT NHFYGGEDSISSSDENQNREEVDLTGQDTRIDQEIEQEEAIGPLPPLNEAG FT SLSSPIISLGQSLDNSPQTRTVELELLNFLKRLGLSDNSVTDIGSIRRALN FT GFEVELKSLRKGHDASEEGSSQRTSNDSIHTVVPGVVPVSLGSDTQNFSNP FT PVISTLATITCTGTIPKSTMIQSNFQTTSLYTRPIPSVGVASSTVFYSNVS FT HLNLTPTTTLGWPASTTTSVSNSPYQTTEAQNGLTSSYFNERRTHNRLGSG FT TPWAIPPIPPQSCFAGFPSRTIPNTNHGWVTDVGPVPHTTMAYPLPQPSSN FT LVAYRSTPSHRTLPVSQWSIEKYSGTDQGTKLNEFLALIDQLSLSEKVSAD FT ELFDSAFHLFSGPAANWYMSMRSSGRLQNWNHLIHELRKAFVHPELDTLIR FT SRIYQRRQQRNETFQEYYFDMDKMFRSMICPMNDQEQLDILRRNLRADYKK FT ALLWKPVHDLTQLLEAGHLIDASNFSMYQKVFGTEKTVNFVSQKNPPPQNQ FT NSRQIRPNPNQPEWKNQKSKPTSAQLEKSNLPRENIQNKTPAPPKTSSSPN FT TQAGPSKPRRTLDYLVEGFKPPSPDVCLNCRQPDHQIDQCRSIKGLICFVC FT GFKGFDSQNCPYCRKNGLQTAESRRPSEKSA" FT CDS 2606..5818 FT /product="Gypsy-196_AA-I_1p" FT /translation="MFRLWFQGIRLSKLPILPKKRTADGRKSPAVREVRIS FT APIAVPCSEYWEPDNDAYRTENQSHVLQIILPNLDDNRPYAWVKIHNIRVK FT GLLDSGSNRTLISLKLYNYLRGANLRAPSEQIVLRSANGGQLKIVGEIYAP FT FNFDGKIRIIPTLVVEGLLVDCLLGMTFWSRFNIYPKVEECALTLEEETEE FT TPQPILTPQEIQRLEEVKAKFLVAVPNQVTTTHLTTHKIEIMDEWKTRPPI FT RQYPYTMSPKVREKVSDELDRMLGAGIIERCHSDWSLNVVPVIKPTGKVRL FT CLDARKINERTIKDAYPLPHPGRILGQLPRAKYLSTLDLSEAFLQIPLDRE FT SRRYTAFSIQGKGMFQFTRLPFGLVNSPATLARLMDQVLGHGELEPFVFVY FT LDDVVLVSETFQEHLRLLHEVARRLTEANLAINLEKSRFGVNELPFLGYLL FT STEGLRPNPEKVRAIIDYERPNTLTKLRRFLGMANYYRRFIDDFSGITTPL FT TDLLKSKSKILPWDDKAEDAFCLIKEKLISAPILVSPNFALPFSIQTDASD FT IAVAGVLTQVQEDHERVIAYFSHKLTTPERNYHACEKETLAALMSIEAFRG FT YIEGGHFTLITDSSALTHILRTKWKTSSRCSRWSLTLQQYNMTIVHRKGKD FT NVVPDALSRCIAVVAAQTPSHWYSDLKEKVEQNPDDYVDFRIDEGVLYKFV FT ATPNNPYDHRFEWKKVIAPEDRPSILRECHDDAMHQGFERTLARAKSRYYW FT PKMAKDLKEYVQNCTVCKQTKAACVPVIPPMGEQRVTTHPWQIIATDYIGP FT LPRSKKGNQHILVTLDLFSKWVMLSPMRKIDSGSLCRILRDQWFARYSTPE FT VIISDNASVFLSREFKTLLERFHIKHWLNTKYHSQANPVERVNRTINAAIR FT TYVRQDQREWDSRLSEVEIILNTSKHSSTAMTPFFIVHGHEAFIKGTDHKW FT FGSEIDPPSEEQEEFRRKLFDRIYDLVKTNLQKTHENVKARYNLRHRQYSK FT AFQIGQLVYYRNMKQSNAAEAYNAKYGPMYLPAKIKARLGSSSYEIEDLNG FT KNLGNWSAAQLKPG" XX SQ Sequence 6757 BP; 2022 A; 1550 C; 1461 G; 1724 T; 0 other; attggcgatc ctccgtaaaa caaaacattc cagtcttcta ttggaatatt tttttcctac 60 ctcgggaaga ctcgaagaca atagtcactt gtctcgattc agttgtttgc tattgtagca 120 ccacatgacc ttggaaaatt ggatgactgt tagaacttgc tcatatatgt cgttggccgg 180 ttgagaatag aacaaaaaag ttcttcgggg aattattgtc tgaccaaagc ttcaagaagg 240 cagcgtaagg ggttgaagaa agggaattca tcgtcgttga ccgtgttacc acgttttcgg 300 ctccatcttg aatatcatcc catcagagat tagcaagaaa ccatacacgt gtagtggcag 360 taacaatggc cgacgaaatt aaattcacgg ctgaaaattt agccaatcac tgtttaggca 420 ttcgcgtaga ctttttagat gaagatgaac ttgatcagga gctgtttatg cgtaatatgt 480 ttctaactgg cgaacagacc atgagtcgac gtaggcgcgc tttacgtgag gcattaaaac 540 aagaaaaaga ggaaggtgtt tctgtgaaac atttacccaa ttccgttgaa aatgaactcg 600 ctatctgtga ggcgaagctc catagattag aagcttcatc atttacagca caccggattg 660 caccaaagtg caaatcgacg ttgctacact taggtcaacg gttgctagtg ttgatacagt 720 tctctggagg aaactaccag gacagtgctc gcgatctctt actaagagtt atttctcatc 780 tcaaccattt ttatggcgga gaagactcca tttcgtcctc cgatgaaaat cagaatagag 840 aggaagtcga tttgactggt caggatacgc gaattgatca ggagatagag caggaggaag 900 cgataggtcc cctcccacca ttaaacgaag cgggaagtct gtcatcacct ataataagtt 960 taggacagtc gttagacaac tctccacaga cgcgaacagt ggaacttgaa ttgcttaact 1020 ttctcaagcg cttaggatta tcagataact cggtaacaga tataggttcg attagaaggg 1080 cgttaaatgg attcgaggtc gagctgaaaa gcctacgtaa aggccatgat gcttcggagg 1140 agggatcttc ccagaggacc tctaatgact cgattcatac agttgtccca ggtgttgtgc 1200 cagtgtcgtt ggggagcgat acccagaatt tttcgaatcc accagttatt tcaacactgg 1260 caacaattac ctgcaccggc acgattccaa agtcgacaat gattcagtcg aattttcaaa 1320 cgactagcct gtacacacga ccaatcccct cagttggcgt tgcctcgtca acagtgtttt 1380 attcgaatgt atctcatttg aatttgacgc cgacgacaac gttaggctgg ccagctagta 1440 ccactacaag cgtttcaaat tcaccatacc aaaccacaga agcacaaaat ggtctcacat 1500 cgtcttattt caacgagaga agaactcaca accggttagg ttcaggaaca ccctgggcta 1560 tccctccaat accaccacaa tcgtgttttg ctggattccc atcgcgtacc atccctaata 1620 ccaatcacgg gtgggtcacc gatgtaggac cagtaccaca caccactatg gcttatccac 1680 taccgcagcc tagctcaaac ttggtagcct atcgatcgac cccttctcat cgaactctac 1740 ctgtctctca gtggagcatc gaaaaatatt cgggtacaga tcaaggcacg aagctgaacg 1800 aattcttagc tttaatcgat cagttgtcgc tctctgagaa agtctcggca gacgaactct 1860 tcgattcagc gttccatttg ttctcgggcc ctgcagctaa ttggtatatg tcgatgcgct 1920 ctagtggtcg gttacagaac tggaatcatc tgatacatga acttcggaaa gcatttgttc 1980 atccggagct cgatacgctg attcgttctc gtatctatca gagacgacaa caacgaaatg 2040 agacttttca agagtattat ttcgatatgg acaaaatgtt ccgctcaatg atttgtccga 2100 tgaatgacca agaacaacta gacattctaa ggcgtaatct tagggccgat tataaaaagg 2160 ctctgttatg gaagccggtc catgatctca ctcagcttct agaagcaggt catctgatcg 2220 atgcttcaaa tttctctatg taccaaaaag tcttcggtac cgagaaaacc gttaactttg 2280 tatctcaaaa gaatccaccc cctcaaaacc aaaattctcg tcaaattcgg ccaaatccaa 2340 atcaaccgga atggaaaaat caaaaatcaa aaccaacttc ggcacaactc gaaaagtcta 2400 acttacctcg tgaaaatatt cagaataaaa ccccggcacc tcccaaaaca tcgagttccc 2460 cgaacacgca ggcagggcct tcaaaacctc gcagaacgct ggactatctt gttgagggtt 2520 ttaaaccacc ctcacctgat gtctgtctaa attgtcgtca gcctgatcat caaatagatc 2580 agtgcagatc gattaagggg cttatatgtt tcgtttgtgg tttcaaggga ttcgactctc 2640 aaaattgccc atattgccga aaaaacggac tgcagacggc cgaaagtcgc cggccgtccg 2700 agaagtccgc ataagtgctc ccattgcagt accctgctcg gagtactggg aaccggacaa 2760 tgatgcgtac cgcaccgaaa atcagtccca cgtactccaa attatcctac ccaatttaga 2820 cgataaccgc ccatacgcat gggtgaaaat ccacaacatt cgcgtcaaag gtttattaga 2880 ctccggtagt aaccgaacgt taataagcct caagctttat aactacctga gaggagctaa 2940 tttacgagct cctagtgagc aaattgtctt acgatctgct aatggagggc agcttaaaat 3000 tgtaggtgaa atatatgccc ccttcaattt cgacggaaaa atcaggataa tccccacact 3060 cgttgtcgaa ggactgctgg ttgattgcct gctgggtatg accttttggt cccgtttcaa 3120 tatttatcct aaggttgaag agtgtgctct cactttagaa gaagaaacgg aagaaacgcc 3180 ccaacctata ctaacccctc aggaaattca acggctagag gaggtgaaag ccaaattttt 3240 ggtggcagta cccaatcaag ttacaaccac ccatctcacc actcataaaa ttgaaatcat 3300 ggatgaatgg aaaacgcggc ctcccatcag acagtatccg tacacgatgt ctcccaaagt 3360 gagggaaaaa gtatcggacg agctggaccg aatgttagga gcggggatta ttgagcgttg 3420 ccattcagac tggtcgctga acgtggtacc agtaatcaaa cctacgggca aagtaagact 3480 ctgtttggac gctcgaaaga taaatgagcg cacaataaaa gatgcctatc ctttacctca 3540 tccagggaga atactgggac aattgcctag ggcaaagtac ttaagtacct tggatttgtc 3600 cgaagcattt cttcaaattc cccttgatcg agaatcccgg cgctatactg catttagcat 3660 ccaaggaaaa ggaatgttcc aattcaccag gttgccgttc gggctggtaa acagtccagc 3720 aaccttggcc cggctgatgg atcaagtgct aggtcatggt gaattggaac cattcgtatt 3780 cgtttatctg gacgacgtcg tcctagtaag cgaaactttt caagagcatt tacggttgct 3840 tcatgaagtt gcccgtcgtt tgacagaagc caatctagcc atcaaccttg aaaagtcacg 3900 atttggtgta aatgagttgc cattcctggg atacctgctt tcgacagagg gtcttcgtcc 3960 taacccagaa aaagtccggg ctattattga ctatgaaaga cccaacaccc taacaaaatt 4020 acgaaggttc ctaggaatgg ccaactatta taggcgcttc attgatgact tcagcggaat 4080 aactactccg ttaacagatc ttctaaaatc taaaagcaaa attcttccat gggacgataa 4140 agccgaagac gccttttgtc ttataaaaga gaaactcatt agtgcaccca tattagtgag 4200 tccaaatttt gcgttgcctt tttccataca aactgatgcc agtgacatag ccgttgccgg 4260 cgtgctaacg caggttcagg aggatcatga gagagtcatt gcgtacttct cacacaagct 4320 gacaaccccc gaaaggaatt atcacgcttg tgaaaaagaa actttggctg ctctcatgtc 4380 cattgaagcc tttagagggt atatagaagg cggccacttt acccttataa cggattcatc 4440 tgccttaacc cacatactgc gaacgaagtg gaaaacttca tctcggtgta gtaggtggag 4500 cctgacccta cagcaataca acatgaccat cgtgcatcgt aaagggaagg acaatgttgt 4560 ccctgatgcg ttatcccgtt gcatcgctgt tgtggctgcg caaacaccct cacattggta 4620 ttcagaccta aaagagaagg tagagcagaa cccggatgac tacgtcgatt ttagaataga 4680 tgaaggggtg ctatacaaat ttgtcgccac cccaaataac ccatatgacc atagattcga 4740 gtggaagaag gtgatcgctc ctgaggatcg cccctccata ctccgggaat gtcacgatga 4800 cgccatgcac caagggttcg aaagaacctt agccagggca aaatcgcgct attattggcc 4860 aaaaatggcc aaagacctta aagaatacgt tcaaaactgc acagtttgca aacaaacaaa 4920 agcagcttgc gttccagtga ttcctcctat gggcgaacag agggtgacta cacacccatg 4980 gcaaatcatc gccaccgact acatcggtcc attaccaaga agcaagaagg gtaaccagca 5040 tatcctggta acccttgatc tcttcagcaa atgggtaatg ttgtccccta tgcgaaaaat 5100 agatagtgga tcactatgtc ggatccttag ggaccagtgg tttgcacgct actctactcc 5160 agaggtaatt atatccgaca atgcgtcggt atttttatca agagaattta aaactctctt 5220 agaacggttc cacatcaaac attggttgaa cacgaagtac cactctcagg ccaatccggt 5280 cgagagagtg aatagaacga tcaacgccgc catacgcacc tacgttcgac aagatcaacg 5340 agagtgggac agtcgtctct cggaagtcga aattattctg aacacgtcta agcattcttc 5400 cactgcaatg acaccgttct ttatcgtgca cggccatgaa gcctttatta aaggcactga 5460 tcacaagtgg ttcggttccg aaattgatcc tccctcggag gaacaagagg aattccgtag 5520 aaaactcttc gatcgaattt acgaccttgt aaaaacaaac ctccagaaaa cccatgaaaa 5580 cgttaaagct cgatacaatc tccgacatcg acaatactcc aaggcgtttc agatcggtca 5640 gttggtatac tatcgaaata tgaaacagtc taacgcggct gaggcataca acgcaaaata 5700 tggcccgatg tatctgcctg ccaagataaa ggcccgccta ggatcatctt cctacgaaat 5760 cgaggactta aacggcaaaa atttgggaaa ttggtcggct gcacaattaa aaccagggta 5820 gtccagctgg gcacctcaaa attgtttttt ttttcacagc caaaatttca atccagttag 5880 ccgcccaaat atcaagcaca aatccgacga aaacagttca ttccatctgg tagtcgctca 5940 tccttttcga gaaatcagca atccttcttt caggtccaat tccagctcga aaactaattc 6000 aattttctta aaccgaaaca agatctttta gtgaattaag aaaattttca cgaacgttct 6060 atcattgtct tgctgcttca atattatgaa ggtttgacgt ttgacagtaa acaaagtata 6120 gtcttttcca tgagcaatcc tgttcatgga tggcaggatt gatggttcac atctccgcaa 6180 agggggtgcg tttagtaaaa ctcttcttat gattaggttg tagaattata ctcccctaac 6240 acatgaaaca ataaaggctg gttgggagta tgcgttagta gaatggcaat cgtgccttca 6300 caggactgac aatttacaat atccacgtag acagtagaat agtttgtcac tgggttagag 6360 gtaatttaga ttaaaatagt cataaaactg gatgagtgtg agtgtgtgct tagtgtgatt 6420 ctgtaagggg gcgcgcccaa tatgaatgaa tgaggaatga atgttagaat gagtgttgaa 6480 aacgtaggtg atcttcagag aagagagaaa agtgtatacc acggtagggc gaggaatgta 6540 cgagtattct ccccgacaaa gctggactgg actagtagga aggatgacga ttctcagtct 6600 aggggcttaa ccactgtttt caactcggta tcggtttcat taacagtcca gtcatctata 6660 ctcaatggtc atgtcggtgc cgtgattaaa ggcacctgtt tgagataatt gatgaaaaat 6720 tgcacgaaca atttttcttc tcgcacgagg agataat 6757 // ID Copia-15_AA-I repbase; DNA; INV; 4123 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-15_AA_; KW Copia-15_AA-LTR; Copia-15_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4123 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 941-941 (2011). XX DR [1] (Consensus) XX CC Positions [1536-2024] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 1650..4121 FT /product="Copia-15_AA-I_1p" FT /translation="MQSKSQVLECFKLYEGMVSAKFERKISRIRCDNGGEY FT RNEHFERFCRKKGIQMECTVPYTPEQNGVSERMNRTLVEKARAMLFGSNVD FT KRFWCEAVETAAYLVNRSPASALQDGKTPSELWEGRKPDVSGLRVWGSPSY FT CHVPKEHRKKLDSKAWKGIFLGYHCNGYRIWDPRWKKVVVLRDVIFDEAGT FT VDEPNEQGNLPDLVRIRRPPIDTELDGSEVEDVGDHQDDMDGTFDEFEDCE FT GATGGSDSAGASGRDTAETSGNHFNSENESRVADSNGHSVRKRAPPKWHDD FT YDMSCAYALSAMNFVESFPDTLDEMKKREDWPQWKTAVDEEMKSHQKNGTW FT TLCKLPEGKRAVSCKWVFRIKRGEDGGVDRYKARLVARGFSQRFGYDYTDT FT YAPVARLDTVRTVLAVANQERLEVHQMDVKTAFLNGRLREEIYMTLPESVE FT GRPDGVCRLNRALYGLKQASRAWNERFHELATKLGFRRSENDRCLYVRQEG FT ESKLFLLLYVDDILIIGRNLMDIVKVKKHLAAEFEMSDLGQVGNFLGMRIE FT RDIQNRVLRITQRTYLESLLRRFGMEECKPVATPMECRLKLKKGVETERTT FT QPYRELIGCLAYVAQSSRPDLCAAVNLLSQYQSCPTNEHWGYVKRILRYVK FT GTLDIGLEFRGGERVEALVAYSDADWGNDENDRRSISGSVLRVFGGTVMWL FT TRKQQTVALSSTEAEFVALCTAACEGIWLRRLLSDLGVTVEGAVTYYEDNQ FT SCIKVAEEPKDSRRLKHVDIKYNFIRELVQDGRIAISYIPTEKQLADIMTK FT GLPAGAFGRLREDLGLQRLSRG" XX SQ Sequence 4123 BP; 1148 A; 772 C; 1249 G; 954 T; 0 other; agaggttatt ggcccagagt tcgcgtgtgt cgggaagtat taatgagtgt attcgtagtc 60 cgtgtgtgat atttctaaga aaatggaacg ggaatcggaa cttgtgcgaa ttccgatgtt 120 tgatggcaca aattttccat cgtggaagtt ccgcatggtg atcgtgttag aggagcacga 180 tttgaaagaa tgtgtggaac aggaggtgga agaagtggag gcgctggcta tcaaagaaga 240 agacagcgct gccatcaaga agcagaagga agaagaggcg gtgaagcgga ggaagaaaga 300 ccgccggtgc agacgttcat tatttctcgc atttgtgaca gccagttaga gtatgtgcag 360 gatcagccaa caccaaagtc tatgtgggtg gctttgcaca aagtgttcga acgaaaaagt 420 atcgcgagtc ggttgcactt aaagaaaaag ttgctcaccc tgcgtcacga aagtggaacg 480 agtttgcagg agcacttttt gattttcgat cgtatcgttc gcgaatataa atcgacgggt 540 gcgatgatcg acgacatcga tgtagtgtgt catctcttgc taacgctggg accaaagttc 600 gcgacggtag ttacagcgtt agagacgatg ccggaggaca atttgacatt ggaatttgtc 660 aaatgccggc tactcgatga agagattaag tgtcggggaa aagatattgg tgcaagtgtc 720 tccaaacgtg agccagctgc tttcgtgggg aagccgaccg gaacgaagaa caagtggaag 780 tgtttcgtgt gccaaaaggt tggtcacaaa gcggcagagt gtccggagcg cgagaagaag 840 aaaagcgaca agaagaaaac aagtgcgaac gtaacggaag aaagcagtgg tggatttgtt 900 tgctgctgaa agtgactgcg gaagaagatc aagtgctagc agattccggt gcgtcagaac 960 acatgacgaa tgacaagagt gtattcgaaa cgttagtgca tggaagaacc gatcaagatc 1020 gcagttgcgt tgagtggaaa atccgctgtc gcgaagttcc gtggttcggt taaagtgatc 1080 gctgcgattg gaaacgagaa ccgcgagtgc acgttggaaa atgtactgta tgccccagac 1140 ctacggtgca atctattttc gatccgtaag gtggaaatgg cgggcatgac ggtagttttc 1200 aaggacggtg gcgtgaaagt tttcaaagac ggaaaagtgg ttgcttgtgg tcagcgtcgc 1260 ggagtgcagt atgaaatgga cttccgttcg aagaaagaag gtgtttcgtc gctgtactcg 1320 tgtgggaaga ttcagaagtg caatgaacta tggcaccgac gtttcggaca cttaagtgag 1380 aagaaccttg agaacatcat gaagaaaaag atggtgaaag gttttgagaa gaatacgagt 1440 gatgataaca gtgaagtgta ctgcgagtct tgcattgagg ccaagcaaac tcggaagccg 1500 ttttccgaaa gctcggagaa gagatcatct cgggttttgg aactgattca caccgatgtg 1560 tgtgggccgg taacgccggt ggctcacgat ggcagtcgat attacgtaag ttcatcgatg 1620 attggagcag attcaccata gtttaccgga tgcaatccaa gagccaggtg ctggaatgct 1680 tcaagctgta tgagggaatg gtttcagcaa aattcgagcg aaagatctcg cgcataagat 1740 gcgacaacgg cggtgagtac cgaaatgaac attttgagcg tttttgtcgt aagaaaggga 1800 ttcagatgga gtgtaccgtg ccgtatacgc cggaacagaa tggcgttagc gaaaggatga 1860 accggacgct ggtggaaaag gcacgggcga tgcttttcgg ttccaacgtc gacaagagat 1920 tctggtgcga ggccgtagaa acggccgctt acttggtaaa ccgtagtcca gcgagcgccc 1980 ttcaggatgg gaagactcct tctgagctgt gggaaggtcg caaacctgat gtgtccggat 2040 tgcgcgtctg gggtagtcct tcgtactgcc atgtaccgaa ggaacatcgt aaaaagttgg 2100 acagcaaggc ttggaaaggt atttttctcg gataccattg caatggctac cgtatctggg 2160 atccaaggtg gaaaaaggtg gtagtactac gcgatgtgat cttcgacgaa gctggtactg 2220 tagacgagcc aaacgaacaa gggaatctac cagacctggt gaggatcaga cgaccgccga 2280 tcgatactga gttggacggc agcgaggttg aagatgtggg agatcatcag gatgatatgg 2340 atggaacgtt cgatgagttt gaggactgtg aaggagcaac aggcggctca gattcagcag 2400 gggcatctgg tcgagataca gcagagacat caggaaatca tttcaattcg gagaatgaat 2460 caagagttgc tgattcgaat ggtcacagtg ttaggaagcg tgcaccaccg aaatggcatg 2520 atgattacga tatgagttgt gcttacgctc taagcgcaat gaattttgtt gaaagcttcc 2580 cggatacttt ggacgaaatg aagaagcgtg aagattggcc gcagtggaag actgcagtcg 2640 acgaggagat gaagtctcat cagaaaaatg gcacgtggac gttatgcaag ctgccggaag 2700 gaaagagagc cgtatcctgt aagtgggttt ttcgtatcaa gcgaggtgag gacggcggtg 2760 ttgaccgcta caaggctaga ttagtggcac gtggtttctc ccaaagattc ggatacgact 2820 atacagacac gtatgctccg gtagccagat tggatacagt tcgaacagtt cttgccgttg 2880 cgaatcagga acgtctcgaa gtccatcaga tggacgttaa aacggcattt ttaaatggga 2940 gacttcgtga agaaatctac atgacgctac cagagagtgt agaaggacga ccggacggtg 3000 tttgccgact gaatcgcgcg ttgtatggcc tgaagcaggc atccagggcc tggaatgaga 3060 gattccatga gttggcaact aaacttggat ttcgtcgtag cgagaacgat cgttgtctgt 3120 acgtgagaca agaaggcgaa tcgaagttgt ttctgttgtt atacgtcgac gatattctta 3180 ttattggtcg gaatctgatg gacatcgtta aggttaagaa gcatctagca gcagagttcg 3240 agatgtctga cctagggcag gtcggaaact ttctcggtat gcgcatcgag agagacatac 3300 agaaccgtgt tttacggatt acccagcgta cctacctgga gagccttctt cgcagattcg 3360 gaatggaaga atgtaagccg gtggcgacac cgatggaatg ccgactgaaa ttgaagaagg 3420 gtgtggagac ggaaaggaca actcaaccgt acagggagtt gattggctgc ctagcgtatg 3480 ttgcacaaag ttcccgaccg gatttgtgtg cagcggtgaa cctgttaagc caatatcaaa 3540 gttgtccaac caacgaacat tggggctacg tgaagcgtat cctgcgctat gtgaagggaa 3600 cacttgacat cggattagag tttcgaggag gcgaacgtgt ggaagctctc gtggcatact 3660 ccgacgcgga ttggggtaat gatgagaacg accggcgatc tatttccggc agtgtattgc 3720 gagtgttcgg cggcacagtg atgtggttaa cacggaagca acaaactgtt gcattgtcgt 3780 ccacagaggc tgagtttgta gcattgtgta cggcagcatg cgaaggcata tggttgcgtc 3840 gattgttatc tgacctgggc gtaactgtag aaggagctgt aacctactat gaggacaatc 3900 agtcctgcat caaagtggct gaagaaccga aggacagtcg acggctgaag cacgttgata 3960 ttaaatataa tttcattcgc gagttggtgc aagatggacg gattgcgatt tcctacattc 4020 cgacggagaa gcagctggca gacataatga caaaagggct gcctgccgga gcttttggac 4080 gtttacgtga agatctaggg ttgcagagat tgagcagggg tat 4123 // ID L1_Ele5 repbase; DNA; INV; 4851 BP. XX AC . XX DT 06-OCT-2010 (Rel. 15.1, Created) DT 06-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE An L1 clade non-LTR retrotransposon family from Aedes aegypti. XX KW L1; Non-LTR Retrotransposon; Transposable Element; L1_Ele5. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4851 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4851 RA Kojima K.K. and Jurka J.; RT "L1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (06-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. This consensus is generated from 23 CC sequences with >90% identity, and ~98% identical to the original CC sequence in [1]. XX FH Key Location/Qualifiers FT CDS 171..1193 FT /product="L1_Ele5_1p" FT /translation="MAAVRARENTFKVDLSNFPKRPSFEEIHSFVHETIGL FT SVDQVLRLQMNHAQNCAHVKCRDLKTAQDAVDHHNGRHELEVNKTKFKVRL FT MMDDGGVEVKIHDLSENVRNEDIAAFLKQYGEIISIKELVWGDNFAFKGVS FT SGVRVAKMILHRHIKSFVTIMGEESLISYRNQPQYCRHCTNPSHPGLTCVE FT NKKLLGQKTDLNNRLKAVQKKTSYASVLDRSKVVASLMPEFVATNLNELNA FT NARSSTSTAVVEERGASGVSPPAAPADDERMEEGEIQKNTSDDTDAATADA FT AAAAAAATVVENIAAAPVAGSSAVPLLPPLLPPLLPPLMPPLMPRLLML" FT CDS 1597..4791 FT /product="L1_Ele5_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MSRRDTLSYNIGSINTNTVSSDNKIAALRSFVRLLDF FT DIVLLQEVECVSLCIPGYNIIFNVDESRRGTAIALKSHIPFSNVQRSLNSR FT ILTVKIGDSLTICNIYAHSGSQNYSARENLFKEQLPFYLQAASEHVILGGD FT FNCVISNKDATGTNNQSPALKQLVENMQLNDTWDCLKRNVIDYSFVRSNSA FT SRIDRIYVSRSLVPHLHSSELFATSFSDHKAYKIRCGLPNLGKPFGRGYWS FT MRAHVLNEENLAEFEQKWTRWLREKRNFNSWMSWWTEYAKPKIRSFFRWKT FT NEAFREFHATNEMLYRQLRNAYDHLYQNPNGVAEVNKIKAKMLQVQSNFSK FT AYERLNDRYVCGEKVSSFQIGDRYRRKTTISSIRHEDRCITTGTEIENHVH FT DFFRNLYSEEQLHSFEYSPTNRIIPADSNSNNAMMHEITTTELFFAIKSSA FT SRKSPGSDGIPKEFYLKAFEIIHPQLNLIINEALQGNVPENFVEGIVVLCK FT KKTNDQTIKAYRPISLLNYDYKLLSRILKERLEKVMVEQNLLNASQKCSNS FT DRTIFEAVHAIKDRIAELNCRRRHGKLISFDFDHAFDRVNHDFLKVVMRNI FT HLNGNFVQLLGKIMSSSKSRLLINGNLTPSFPIQRSVRQGDPLSMHLFVLY FT LHPLLEKLRAICNGPMDLVVAYADDISIIITDERKLAIIKQAFVNFGRCSG FT SLLNLNKTFTVNLGPHRDADDINWPMAKDSIKILGVTFFNSQKQLIDYNWA FT DVIRKTSRLMWMYKARNLTLHQKIIVLNTFITSRLWFLASIISVPNHAVAR FT ITSHIGSFIWERFPTRVPIEQLTLPKNMGGLNLHLPMHKCKSLLVTRFLKE FT IRETPFASSLAQHLENPPNLAAIPALYPCLKSLSKVLPYIPRRIRENPDAA FT LLHNHFREKLNKPKVMQENTNITWRRVWQNIRIKGLTSIEQSMYYLLVNGK FT IPHAKLLHRQNRVANPFCQLCPNTEEDLEHKFSSCPKTSALWNHLQSKLET FT TVGRRMTFTSLRFPELKNMNRTCRYQALKMFISYVKYVLDANDQLSVEALD FT FILNCVFV" XX SQ Sequence 4851 BP; 1490 A; 1102 C; 999 G; 1258 T; 2 other; tcacagttag gttcaagcta tcgctgcgac cagacgtatt ttcaaaccgc tccgtatcgc 60 tttgtttcga gtttcgttcg tttattctcg caccttgttg tgcagaattt caatcgcgtc 120 agttttctgt ctgacgacca tccaaacccg tgctattcaw aatagcaacc atggctgccg 180 tacgtgcccg tgagaacacg ttcaaggtgg acttatccaa cttcccgaag cggccttcct 240 ttgaggaaat ccactctttc gttcatgaga ccattggact gagtgtcgat caagtgctgc 300 gcctgcaaat gaaccacgcg caaaattgcg cgcacgtgaa atgccgtgat ctgaagactg 360 cccaggatgc tgttgatcat cacaacggtc gtcacgagtt ggaggtgaac aaaacgaagt 420 tcaaagttcg cttgatgatg gacgatggag gcgtcgaagt caagatccac gatctgtccg 480 aaaacgttcg gaacgaagac atcgctgctt ttctcaagca gtacggcgag attatctcga 540 tcaaggagct tgtttgggga gacaactttg cgttcaaagg cgtatcatca ggagtgcggg 600 tggctaagat gattttgcat cgccacatca agtcgttcgt tacaattatg ggtgaggaga 660 gcctaatctc ctacagaaac cagccgcaat actgtaggca ttgtaccaac ccatcgcacc 720 cgggattaac gtgcgtcgaa aataaaaagc tgttgggaca aaagaccgac ctcaacaaca 780 gactgaaggc agtccagaaa aaaacaagct acgccagcgt tctggaccgg agcaaagttg 840 tggcctcact catgcctgag tttgtagcta caaacctaaa tgaactgaat gcgaatgcac 900 gttcgtcaac atcaaccgcc gtagtcgaag aaaggggagc gtcgggcgtc tcgcctcccg 960 ctgcgcctgc tgatgacgaa cggatggaag aaggcgaaat ccaaaaaaac acctcggatg 1020 atacggacgc tgccactgct gatgctgctg ctgctgctgc tgctgccact gtcgtcgaga 1080 atattgctgc tgcccccgtt gccggctcct ctgccgtccc attgctgcca cctctgctgc 1140 caccgctgct gccaccgctg atgcccccac tgatgccacg gcttctgatg ctgtgactgc 1200 tcctgttgtt gatcgttcca ccctgcagac cgatccagac caactgaaaa acaatatcgg 1260 cgaaccgagc cacgtgagtg cattcaaaat tcctatcacc agataccctt ccaatccctt 1320 gtccatggag atttccgaga ctgaaagcaa cgagtcgtcc actgaaggcg gtccgttcct 1380 gaaagtgaag cggccaagag ggcgcccgaa gaaacttaaa actggctccc aataactagt 1440 ctttgacggt cgatctactt aacaccatta acgaaatttc caaaaacgat ccgcaatact 1500 aacacacgaa aactaaacat aaaacctact ccaatcctta aaattaatct cgagtcggcc 1560 gcgagtactt cggctcaaac cccataacta tatgtgatgt ccagaagaga taccctcagc 1620 tacaatattg ggtctatcaa taccaatacc gtttctagtg ataataaaat agcagctctt 1680 cgttcgtttg tgcgtttgtt agatttcgac atcgttcttc tccaggaagt tgaatgtgtt 1740 agtctttgca ttccgggtta caatattata tttaatgtag atgaatccag aagaggcacc 1800 gccatagctc tcaagtcaca catccctttc tcaaacgtac aaagaagttt gaatagcaga 1860 attctaacag tcaaaattgg cgattctctg actatttgca atatttatgc ccactctggt 1920 tcacaaaatt actctgcccg cgagaatttg ttcaaagaac aattgccctt ctacctacag 1980 gctgcttccg aacatgtcat ccttggcggg gattttaatt gtgttatctc caataaagat 2040 gcaactggga caaacaatca aagtcctgct ctaaaacaat tggtagagaa catgcaactc 2100 aatgatacat gggattgcct aaaaagaaat gtgattgatt atagtttcgt tcgttctaat 2160 tccgcttctc gcatagatcg tatttacgtc tctaggtcgc ttgttccgca tctccactct 2220 tcagaactst tcgctacgtc tttttcagac cacaaggctt acaaaatccg atgtggtctg 2280 ccaaatttag gtaaaccatt tggccgcggt tattggtcta tgcgtgctca tgttttaaat 2340 gaagaaaatc tcgcagaatt cgagcaaaag tggactcgtt ggttgcggga gaaacgtaat 2400 tttaatagct ggatgtcttg gtggactgaa tatgctaaac caaagatcag aagtttcttt 2460 aggtggaaaa ccaacgaagc attccgtgaa tttcacgcaa cgaacgaaat gttgtaccgt 2520 caactaagaa atgcatatga ccacttgtac caaaatccaa atggtgttgc cgaagttaat 2580 aaaatcaaag caaaaatgtt gcaagttcaa agcaatttct cgaaagcata cgaacgttta 2640 aacgatcgtt atgtgtgtgg agaaaaagtg tcctctttcc aaatagggga tcgttatcga 2700 cgaaaaacaa ctataagttc aattcgtcac gaagatcgtt gcatcactac agggacggag 2760 atagaaaatc acgttcacga tttcttccga aatctttaca gtgaagaaca attgcacagt 2820 tttgaatatt cgcctacgaa tcgtattatt ccagctgatt caaacagtaa caacgccatg 2880 atgcatgaga taacaacgac tgaactattc tttgcaatta agtcaagtgc atctcgaaag 2940 tcaccaggaa gcgatggaat tccaaaagaa ttctacctga aggctttcga aattattcat 3000 ccacagctca atttgataat taacgaagcg ttgcaaggaa atgtacccga aaattttgtt 3060 gagggaatag ttgtgttatg taaaaagaaa acaaacgacc agactatcaa agcgtacaga 3120 ccgatcagtc tgctaaatta tgactataag cttttgtcca gaatcctaaa agagcggttg 3180 gagaaagtta tggttgagca aaacttactg aacgcaagcc aaaaatgctc gaactcagat 3240 agaactatct tcgaagctgt gcatgctatc aaagatcgaa tagcagaatt gaactgtagg 3300 agaagacatg gaaaattaat ttcctttgat tttgatcatg ccttcgatcg tgtaaatcat 3360 gatttcctca aggtcgtgat gcggaacatc catttgaatg gaaactttgt gcaactttta 3420 gggaagatta tgtcctcctc aaaatctcgt ttgctaatca acggaaatct cactccaagc 3480 ttccccatcc aacgttctgt tagacagggc gatcctttaa gtatgcattt attcgtcctt 3540 tacctacatc cactcctaga aaaattacga gctatctgca atggtccaat ggatctcgtg 3600 gtagcatacg cagacgatat ttccatcata ataacagatg aacgcaagct agcaatcata 3660 aaacaggctt ttgtaaactt tgggcgctgc tccggatcac tactgaacct taacaaaacg 3720 ttcacggtaa acttaggacc acatcgggat gcagatgaca taaattggcc gatggccaaa 3780 gattcaatca aaattttggg agtaacgttt ttcaattccc aaaagcagtt aatcgactac 3840 aattgggcgg atgttatcag aaaaacatct cgattaatgt ggatgtataa ggcaaggaac 3900 ctcacactgc accagaaaat aattgtgcta aacacattca tcacatcgag gctctggttc 3960 ttagcctcga taatcagcgt accaaatcat gccgtagcac gaattacatc gcacattgga 4020 agttttatat gggagcgttt tccaaccaga gttccaatag aacagctcac tttgccgaaa 4080 aatatgggtg gtttgaactt acacctacca atgcataaat gcaaatcttt gttggtgaca 4140 agattcctga aagaaattag agaaacacca tttgcgagtt ctcttgcgca acatttggag 4200 aacccaccaa acctagctgc gattcctgct ttgtatccgt gcttgaagag cctctccaag 4260 gttctgccat atattccaag aagaataagg gaaaatccgg atgctgcatt actccacaat 4320 cacttccgcg agaaactgaa taaaccaaaa gtcatgcaag aaaataccaa cataacctgg 4380 aggagagtgt ggcaaaacat cagaatcaaa ggactaacat cgatagaaca atcaatgtac 4440 tatctccttg tgaatgggaa gattccacat gcaaagctac tccatcgaca gaaccgagtt 4500 gcaaacccat tctgccagct gtgcccaaat acagaggagg atctagagca caaattttcc 4560 agctgcccga aaaccagtgc tctttggaat catcttcaat ctaagttgga gacaaccgta 4620 ggaaggagga tgactttcac tagcctgaga tttcccgaat tgaagaacat gaacagaaca 4680 tgccgatacc aggctctaaa aatgtttatt agttatgtaa agtatgtttt agatgccaat 4740 gaccaactat cagtagaggc attagatttt attcttaact gtgtttttgt gtagaaagtt 4800 catgaatgta atcaaatagt tccaataaac gtgtttaaaa aaaaaaaaaa a 4851 // ID Jockey-N2_CQ repbase; DNA; INV; 1766 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A non-autonomous Jockey-like non-LTR retrotransposon family from DE Culex quinquefasciatus - consensus. XX KW Jockey; Non-LTR Retrotransposon; Transposable Element; KW nonautonomous; Jockey-N2_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-1766 RA Kojima K.K. and Jurka J.; RT "Non-autonomous Jockey non-LTR retrotransposons from the southern RT house mosquito."; RL Repbase Reports 11(1), 585-585 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >97% CC identity. ~9 bp TSDs. This family encodes a protein similar to CC Jockey ORF1p but does not encode ORF2p. Thus it is a CC non-autonomous non-LTR retrotransposon derived from Jockey, CC like CC HeT-A. XX FH Key Location/Qualifiers FT CDS 114..1517 FT /product="Jockey-N2_CQ_1p" FT /translation="MPKAGRGRGSSSAASKNPRSSSTGRVQKGKQTNAVSA FT AIAQGSVSADGIDKKYLHPGYVPRSPVRTRSGTSGATTSGTSTSANIPIRN FT EFQMLSDDEENNNTDGSSTTDDDDDDGRARKKVSTPKKNNSPKERRPPPIF FT VLDTLADDVDELLEGLGYCLKIGKSSVQVYTFDEKNFDLVVEKLKRQNFKF FT YTFDPVQKTAVKVVLQGYQDRPISDLKKDLSGAGITPRDIKVLSRKTTVTG FT THTLYLLYFDRGTVKIQDLRRTKALDGFWVNWRFYSKNPSDAAQCHRCQKF FT GHGSRNCNLPPRCVKCGESHLSEACALPCKADLGDKAEQTKARIKCANCGG FT NHTSNYRGCSARKNYLEEQEKRKKKAAASHPPQRSTSETVPAAGHRTVPAD FT NSAFPPGWGRSFASVVAAGSGNTAQQEVTGEDLFTLPEFFALAGEMMTRFR FT TCRNKAEQFLALGELMIKHIYKG" XX SQ Sequence 1766 BP; 474 A; 460 C; 451 G; 381 T; 0 other; cacctccgtg tacgcacgag agcaagcagc agtcaagtgc tctgtgctct accgtttttt 60 cgaaattttt cgccgtagtt gatagtgatt acccgcgagt ttgcttctcc agcatgccaa 120 aggccggccg tggccgtggc agttcgagtg cggcctcgaa aaacccgcga agttcgtcta 180 ccggccgtgt gcaaaaaggt aaacaaacaa acgccgtgtc ggccgccatc gcgcagggga 240 gcgtgagcgc agatggaatc gacaaaaagt atctacatcc cggatacgtt ccgagatcac 300 cagtgcgaac ccgttccggt acttccggtg ccacgaccag tggcacatca acatccgcca 360 acatccccat caggaacgag ttccagatgc tgagcgacga cgaagaaaac aacaacaccg 420 acggtagcag cactaccgac gacgacgacg atgacgggcg tgcacggaag aaagtgtcga 480 cgccaaaaaa gaacaattct ccaaaggaac gtagaccacc tccaattttt gttttggaca 540 cgttggcgga cgatgttgac gagttgctgg aaggcctcgg atattgtctg aaaatcggta 600 agtcgtcagt gcaagtttac acatttgacg aaaagaactt cgacctggtt gtggagaaat 660 tgaagcgtca aaacttcaag ttttacacat tcgaccccgt gcagaagact gccgttaagg 720 tcgttttgca ggggtaccaa gaccgcccga tctccgacct caagaaggac ctctcgggtg 780 ctggaataac gccgcgtgac ataaaagtgc tctcgcggaa gactacagtc acaggtacac 840 acacactgta cctgttgtac ttcgaccgcg gcaccgtcaa aattcaagac ctgcggcgaa 900 ctaaggcgtt ggacgggttt tgggtaaact ggcggttcta ctcaaagaac ccgtcggacg 960 cagcacaatg ccaccgttgc cagaaattcg gccacggctc gcggaactgc aacctcccgc 1020 cccgctgtgt gaagtgcggt gaatcacacc tctctgaggc gtgtgcactg ccgtgcaaag 1080 cggacttggg ggacaaggca gagcaaacga aggcgcgcat caagtgcgcc aactgcggag 1140 gtaaccatac cagcaactac cgtggatgca gcgcacggaa aaactacctc gaggagcagg 1200 aaaagaggaa gaagaaagca gcagcgtccc accctcctca gcgaagtacg agcgaaaccg 1260 tgccggcagc tggtcatcgt acggttccag cggacaactc agcgttccct cctggatggg 1320 ggcgatcgtt cgccagcgtg gtcgctgccg gcagtggcaa tacggcccag caagaagtta 1380 ccggagaaga tctctttacc ctgccagagt tctttgctct cgcaggggag atgatgacgc 1440 ggtttcggac ctgccgtaac aaggcggagc aattcctggc ccttggggag ctgatgatca 1500 agcacatcta taaaggataa aaaactgtga tctagtttta agctttttct atttctatcc 1560 cctttccctt gcaattttag taagtttttt ttattttttt cttactcttg gtgacacctt 1620 tatttacaag caataactgt tccaaaatgg attatgatgt aacacacagc tgtaaggaac 1680 tccaaaactc tgttatgtac ctcaaaagaa cttattgtat cttgattatt caccaataaa 1740 accgaattga attgaattga attgaa 1766 // ID Gypsy-27_DPu-LTR repbase; DNA; INV; 110 BP. XX AC . XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from Daphnia: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-27_DP_; KW Gypsy-27_DPu-I; Gypsy-27_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-110 RA Jurka J.; RT "LTR retrotransposons from Daphnia."; RL Direct Submission to RU (16-DEC-2010). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR [1] (Consensus) XX SQ Sequence 110 BP; 30 A; 26 C; 21 G; 29 T; 4 other; tgtagcgtag ctatagcata tgcccccaat agatatcatg tatcccgtag taatatgata 60 taccmcgcgc tgmtatataa cgcgctgccc cttgtagtca awggamatca 110 // ID CR1-10_CQ repbase; DNA; INV; 2168 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-10_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-2168 RA Kojima K.K. and Jurka J.; RT "CR1 non-LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 12-12 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 7 sequences with >96% CC identity. XX FH Key Location/Qualifiers FT CDS 3..2105 FT /product="CR1-10_CQ_1p" FT /note="reverse transcriptase." FT /translation="RRADFESLSRALRGIEWSSVLDPVDIDLAVDTYTXIL FT GRLIEQFVPKIRKVVSHHLPWQTQELRXLKTQKRAAFKVSNGSVHLRDYYL FT RINHSYQRLSRRCFANYQRKMQKKLKTDPKKFWKFVNESRRESGLPTSMRL FT ADXEGDDADKICQLFAKKFAGVFCAEPITDEEVRAAAENVPRRDELLASIE FT IDEGTIQAAAAKLKHSCSPGPDGIPSTLLKRCSSSLSAPLLHLFRLSLSSG FT KFPVAWKQAYMFPVHKKGDRTNVENYRGISALCATSKLLELVVIDPVFSHC FT RQDLADEQHGFFPKRSTATNLLCFTESIINSFDAHSQTDAIYTDLSAAFDK FT INHRIAVAKLERLGFRGSLLAWFKSYLTGRSLRVRIGDALSDIIDATSGIA FT QGSHLGPVVFLFYFNDVNYTVTGPRLVYADDLKLFARIDNSNDAEALQRDL FT DRFAGWCXVNRMILNPGKCQXITFSRRHFPVTFDYRINDTSVERVTHVKDL FT GVILDTQLTFKQHLSYIIAKASRQLGLVIRMTRKFTDIYCLKTLYCSLVRS FT TLEYCSTVWSPHYNNAVCRLESIQRRFVRYALRQLPWTNPLXLPPYEQRCR FT LIDLDTLQLRRDLARAMTSADVLLGRIDCPDLRNQIQLRVPTRQLRHTPAL FT EVPFRRTNYSANGAINGLKRAFNKVSSVFSLDVSRTTLRFKFVSLLRSLLR FT M" XX SQ Sequence 2168 BP; 535 A; 586 C; 503 G; 533 T; 11 other; tccggcgcgc cgacttcgag agtttgtccc gcgccctacg tggtatcgag tggagcagcg 60 ttctggaccc tgtagacatc gatttagcag tcgacacgta cacatstatt ctgggmagac 120 tgattgaaca gtttgtgccg aagattcgga aggtggtctc ccatcacctt ccctggcaaa 180 cccaagaact wcgccwcctc aagacccaga aaagagctgc cttcaaggtt tccaacggat 240 ctgtacactt acgcgactac tatcttcgga ttaatcacag ctatcaacga ttgagtcgca 300 ggtgctttgc aaactaccag cgcaaaatgc agaagaaatt gaaaaccgac ccgaagaaat 360 tctggaaatt cgtcaacgag agccgtagag aatctggatt gccgacttca atgcggttgg 420 ctgacgawga aggagatgat gcagacaaaa tttgccagct tttcgcgaag aagtttgccg 480 gtgtattctg cgcggagcct attactgacg aggaagtccg agccgccgca gaaaacgttc 540 cacgtcggga tgaactactg gcttcgatcg agattgatga aggtaccatc caagccgctg 600 cagccaaact aaaacactct tgctcaccag gaccggacgg gataccgtct accctgctga 660 aacgttgttc ctccagccta tcggcaccac tcttacacct gtttcgtctt tcgttgagct 720 ctggaaagtt tcctgttgcc tggaagcaag cttacatgtt tccagtgcac aaaaaaggcg 780 atcgaactaa cgtggagaac taccgtggga tttccgcctt gtgtgccacc tcaaagttgc 840 tggaattggt tgtgattgat ccggttttct cgcattgccg ccaagacctt gctgatgaac 900 aacacgggtt ctttcctaag cgatcaacag caaccaatct cctctgcttc accgaatcga 960 tcatcaacag cttcgacgcc cactctcaaa ccgatgccat ctacacagat ctgtcagccg 1020 ccttcgacaa aattaaccat cgcatcgctg ttgctaagct tgagagattg ggtttccgag 1080 gaagcctatt ggcctggttc aagtcttatc taacgggacg ttcgctgagg gtgagaatcg 1140 gagacgcact ttccgatatc atcgacgcca catcgggtat cgcgcaagga agccacctcg 1200 gtccggtggt attcctgttt tactttaacg acgttaatta caccgtcact ggacctcgcc 1260 tcgtttacgc ggatgacctc aagctttttg ctcgaattga caactcgaac gacgccgaag 1320 cattgcagcg cgatcttgac aggtttgctg gttggtgcga kgtgaaccgc atgatcctca 1380 accctggaaa atgccaakcg atcacgttca gtcggagaca cttcccagta accttcgact 1440 accgtattaa tgatacttct gtcgaacggg tcactcacgt gaaggatttg ggagtcatct 1500 tggacacgca actcactttc aagcagcatc tgtcatacat aatcgccaaa gcctcccgac 1560 aactaggatt ggtaatccgc atgacgcgca agttcactga catttattgt ctaaagaccc 1620 tgtactgctc gctggtccgm tccactctag agtactgctc gacagtgtgg tccccgcact 1680 acaacaacgc tgtttgccgc ctcgaaagca tccaacgaag gtttgtgcga tatgccctca 1740 ggcagctgcc gtggacaaat cccttgcawc ttccgccgta cgaacaacgc tgccgtttga 1800 tcgacctcga tacactmcag ctacgacgag acctagcccg tgcgatgact tctgctgatg 1860 tcctactcgg ccggattgat tgccctgacc tgcgtaacca gattcagctc cgagtgccta 1920 cccgtcaact acgccacacg ccagctctcg aagttccgtt tcgtcgcacc aactacagcg 1980 cgaatggtgc catcaacgga ctaaagagag cgtttaacaa agtgtccagt gtgtttagct 2040 tagatgtgtc tcgtactacg ttgcgtttta agtttgtgtc cttgttaaga tctttgttac 2100 gcatgtaaat ttagttttaa gtkcatcatt ggggcaaaat aagtgcctgt tgagaagaaa 2160 taaacaac 2168 // ID Gypsy-1_SI-LTR repbase; DNA; INV; 378 BP. XX AC AEAQ01029017; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: long terminal DE repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-1_SI_; KW Gypsy-1_SI-I; Gypsy-1_SI-LTR. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-378 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01029017; Positions 524 147. XX SQ Sequence 378 BP; 71 A; 104 C; 88 G; 115 T; 0 other; tgtaagatat ctttaagcta gcgtcgaaac gactcaggct ggctgcagcg ttgcgcgcgt 60 tgcggctgct tccttaggca ttggccaaag ggatcgcgcc acacttttgc cgttgtgcgg 120 cgctgtgggc gtgatgcagc acgatggatc tcgggctttt cggttgggtg gtgagatcgc 180 ggccgtggtg atcgcacgtc tcacttaacc ttttgctgtg tatccgccaa gagaataaac 240 gagattacgg ttagcatcgt cttcttcctt gcacaggccc tcccttctca ttcgcctcct 300 tatttaccgt tccttaaacc tatttcctga cactaactaa tattctccct aactataatt 360 tttattcccc cctttaca 378 // ID Gypsy-9_DPu-I repbase; DNA; INV; 5282 BP. XX AC scaffold_14; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-9_DPu_; KW Gypsy-9_DPu-LTR; Gypsy-9_DPu-I. XX NM Gypsy-9_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-5282 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 733-733 (2010). XX DR Genome; scaffold_14; Positions 742210 747491. XX CC Positions [4165-4662] - Integrase core CC 'TACCT' target site duplication CC LTRs are 100% similar to each other. CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX FH Key Location/Qualifiers FT CDS 1057..5100 FT /product="Gypsy-9_DPu-I_1p" FT /translation="MAAFRMALDPAMQQVVEVALGILPTTVTTPDLVLDRI FT ADYVRAKRNIALDRVAFEERRQGPSETFDDFYIALRRLAEAADLCATCFDT FT RLVTRIIAGTRDAETKKKLLAMSPFPALQSTVNICRSEESARANERSLSGH FT SGISSIQNKRNRTERPYANECKSCGHAAHSTGITCPAIGRTCHSCGQPDHF FT APKCPNRDRGKSGGGGGNGSSGGSSGGSSSGGSGGSKSKVGHITIGNVQAN FT HRQRRSPTIALEVLLDESGSSVATIDDAIPDPGAEVSVAGRDVLAALGMTE FT ADLLHSPFELVMADRSTPLLSIGQHEISVRYGGRCTRMTVVFCPEIRGMLI FT CRLDCVNLAIIHADYPQPLTPIQSVQISPTEDADQPTQPSEWEFLRHIHLP FT LDPSAEQIAEIEAAIAAEFDSVFDQEDGLRTMVGPDMMIQLRDDAVPYYVN FT GARPIPFSDRPEVKSKLDQLVSQGVIVPVTEATEWAAPLVVIRNAKTGKIR FT ICVDHTRLNKFVLRPTHPTRTPRDAVAEIDSECRFYTSFDAANGYYQIPLH FT PSCQHLTTFMTPWGRFKFLRASMGLSCSGDEYNRRADVAFAAVSNTVRVVD FT DLLRFDLTFPEHVRGVCEILQAARDAGITFSKEKFRFARNRVSWVGYEIKH FT GGVTIEEAKLKALSQFPRPTNISELRSFMGLVEQLAGFSTEVAAAKEPLRP FT LLSTRNPYVWTPDHDRAFAAVKLALISPPVVVHFEPDRETVIQVDASRKNG FT MGYALMQRHDGGWRLVDANSRWCSDVESRYAIVELELAAVEWAIRKCRLYL FT SGLPNFTLMVDHQALVAILDKYTLDAIDNPKIQRLKERLSPYSFTTVWRKG FT KERAIPDALSRAPVNDPAPEDERVVADLDFSVRNVVIQSIAAVCNSEDESA FT PSDHLPDALLADLRSTAVADADYTALVAAVESGFGTDRARVTNYIRQFWSI FT RHQLSTEDGLVLFGSRIVVPFSSRRNILQKLHAAHQGIVRTKRRAQQTVYW FT PGISNDVTMLVERCSKCQERRPSQCQEPLMSDPLPTFVFEDVSADLFQHGN FT LHVLVYADRLSGWPVVHQWRRDPTAREVVQAVVHNFVELGVPMRFRSDNGP FT QFDAGVFQETLKRWGVAWGNSSPHYPQSNGHAEAAVKAIKELVAKTGDLSS FT EEFLAGLLEFRNTPHESGASPAQILFGHQLRSIVPAHRSSYAPKWKDAMIA FT RERQAAAAAETKLRYDAHTRPLSPLRIGTQVRIQDPDSKLWTHVGVIVAVG FT RYRSYRIKFASGSVLWRNRRFIRPLVPAEEETTATIDRINPSADGSAPTVA FT DVIVPPASTQQPPIVPAPDGPRRGTRVRKQRVITSV" XX SQ Sequence 5282 BP; 1137 A; 1530 C; 1455 G; 1160 T; 0 other; tggcgcagtt ggtctcacgc acctacgatt gctaatcaca gtggtgtact tttttcgtga 60 ctctagtgaa ccctgcgttt ccctgtgttc ctcgccctca gtgttcgact ttcgcccatg 120 cggcccattt acgccgccat cttgccttga cgatcgaatc cccactcctg atagtgcagg 180 aggccactct gtcagaggcg tcacacatgt gccggtcctc gtactgttag ttgtgcggca 240 tttcagattg ctttgaattc tcttgattta tcgacatttt tttttgtggt gcgaacccga 300 ccaacgcacg cggcttatgg gcgccgccat cttggttttt gcagcccctc gttgtgtgtc 360 ggcatatcag attcacggca tatcaggccc caactacttt tctcgctgag ttaatttttc 420 aacccgactt tttctttctc tgtgcgacga gtgttattag tgccttattt gtgcctgttt 480 gttccgtatt cccacgaagt tcgctttcga ccaacaggtg gcgcaacaca cggtgcaaaa 540 tcgtcgctgt gccttttgta ccatcccgca tcgtatattc tgcgtgacac agcatcgctg 600 aacttagtgg ccagtgagcg gctgtgatta tactcgcgtt ccgtatatgt ccaaaaggaa 660 gcaacatatc gctacttcac ccatcatcac ccgagctcgt ctgcgagcca cacacccaat 720 ggcagcacca ccggcggcac caacggcggc ggatgcgtta gcggcggcgg cggcggccgg 780 aaaccggatc gacaccctgg aggcgctggt ggtggcggtg gcggtggcgc tggtggcggc 840 ggtggcggtg gcgctggtgg cggcggtggc ggtggcggcg gtggcggcgg cggcggtctt 900 gttactccgg caacgcctca acgacggcgg ctggacacct ctagcgtgga aaaacttcac 960 ggcgatgtca ccattcccct cctgcggtcc tggaggaacc gttggaacga cttcgccgaa 1020 ctcagccagc tggcaacata tcctgggtcg gagcagatgg cggcgttccg tatggccctg 1080 gacccagcca tgcaacaagt tgtcgaggtg gcgctgggta tactgcccac cacggtcacc 1140 actccggatt tggtactgga tcgcatcgcc gattacgtcc gcgcgaagcg taacatcgcc 1200 ctcgacagag tcgcctttga agaacggcga caaggtccat cggaaacgtt cgacgatttt 1260 tatatcgcac tccgccgctt agctgaagcg gcagatttat gtgcaacctg tttcgacacc 1320 cgcctagtca cgcgcatcat tgccggcacg agagatgctg aaaccaagaa aaagctgctg 1380 gcaatgagcc catttcccgc cctgcaatcg acagtcaaca tctgcaggag tgaagaatcc 1440 gcacgggcca atgaacgatc tttaagtggc cattccggca tatcatcgat ccagaacaaa 1500 cgcaatcgca cggagcgccc atatgcaaac gaatgcaaat catgcggtca cgcagcgcac 1560 tccactggga ttacgtgccc ggccatagga cgcacgtgcc attcatgcgg tcaaccggac 1620 cattttgccc caaagtgccc caatcgcgac agaggaaaga gcggcggggg tggcggcaac 1680 ggcagcagtg gtggcagcag cggtggcagc agcagtggcg gcagcggcgg atcgaaatca 1740 aaagtcggtc acattactat tggcaacgtg caagccaacc atcggcaacg cagatccccc 1800 accatcgcat tggaggtttt gttggacgag agcggctcgt cggtcgcgac aatcgatgac 1860 gcaatccccg accctggagc agaggtgagc gtggccgggc gcgatgtctt ggcggcttta 1920 ggtatgaccg aagcggacct attgcattcc cctttcgaac tggtcatggc tgaccggtcc 1980 actccactcc tatccatcgg gcagcacgaa atttctgtcc gttatggcgg ccggtgtacc 2040 cgtatgacgg tggtgttttg cccggaaata cgcggaatgc tcatctgccg tttggattgc 2100 gtgaatcttg caatcatcca tgcggattac ccacagccac tgacaccaat tcagtccgtg 2160 caaatttcgc cgacagagga tgcagatcag ccgactcaac catcagagtg ggaatttcta 2220 cggcacattc acttaccctt ggacccatca gcggaacaga tcgcagaaat cgaggcggct 2280 atcgccgcag aattcgattc tgtgtttgac caggaagatg gtttacggac aatggtgggg 2340 ccagatatga tgattcaact gcgggacgat gcagtgccat attatgtgaa cggcgcccga 2400 ccaatcccat ttagtgaccg tccggaagtg aagtccaagc tggaccaatt ggtgtcgcag 2460 ggagtgatcg tgccagtgac cgaagcgacg gaatgggccg cccccctggt ggtgattagg 2520 aatgccaaga ccggcaagat ccgcatctgt gtcgaccata cccgattaaa taagttcgtc 2580 ctgcggccaa ctcaccctac gcgtactcca cgtgatgcgg tcgccgaaat agacagcgag 2640 tgccgattct acaccagttt cgacgccgcc aacgggtact atcaaattcc cctccatccc 2700 tcctgtcaac atttaaccac ctttatgacg ccgtggggcc gttttaagtt cctccgcgcg 2760 tccatgggcc tcagttgctc gggtgacgaa tacaacagga gggcggatgt tgcgttcgcc 2820 gcagtgtcaa atacggtgcg tgtggtcgac gatctgcttc gcttcgatct taccttcccc 2880 gagcacgtga ggggagtgtg tgaaattctt caagcggcac gcgacgccgg gatcaccttc 2940 agcaaagaaa agtttcgttt cgcccgaaat cgtgtgtcgt gggttggata cgagatcaag 3000 cacggcggag tcaccatcga ggaggccaaa ttaaaggccc tgtctcaatt tccccggccg 3060 actaatattt cagaattacg gtcattcatg ggcttggtgg agcaactcgc cggattttcg 3120 acggaggtgg cagcggcgaa ggaacccctc cgacctcttc ttagcacacg caacccatac 3180 gtgtggacgc cggaccatga ccgagcattc gccgccgtta agttggcttt aatctccccg 3240 cccgtggtgg tccatttcga accagaccgt gaaacagtca tccaagtcga cgcctcgcgc 3300 aagaacggaa tggggtatgc gctaatgcag cgacacgacg gaggctggcg tctagtggat 3360 gccaattccc gttggtgctc cgatgtcgaa tcacgatacg ccatcgtgga gctggagctg 3420 gctgcagtcg agtgggcgat ccgaaagtgc cgcttatacc tatctggtct tccaaatttc 3480 accctgatgg tggatcatca ggccctcgta gcaatcttag acaagtatac cctggatgct 3540 atagacaatc ctaaaataca gcgtctgaaa gagcgactgt cgccctattc tttcacaacc 3600 gtttggagaa aggggaagga gcgcgccatt ccagacgcct tatcacgggc cccagtcaac 3660 gacccagcgc cggaagatga acgtgtggtt gcagaccttg atttttccgt tcggaacgtg 3720 gtgattcagt cgatcgcggc tgtatgcaac tctgaagacg aatcggcccc gtcggatcat 3780 ttgccggacg ctttgctggc cgatctgcga tccaccgccg tggcagacgc ggactacacg 3840 gccctagtgg cggcagtcga atcaggtttc ggcacggatc gtgcccgcgt gacgaactac 3900 atacgtcagt tttggtccat tcgtcaccaa ctatctacgg aagacggcct tgtcctcttt 3960 ggatcccgca tcgtggtgcc gttctcatct cgccgtaaca ttctccaaaa acttcacgcc 4020 gcccaccagg gcatcgtccg gacaaaacga cgggcccagc agacggtata ttggccggga 4080 atatcgaacg acgtcacgat gctcgtcgaa cgctgctcca agtgccagga gaggcgaccg 4140 agccagtgcc aggagccact gatgtcagac ccgctaccga cattcgtatt tgaagacgtg 4200 tcggccgact tattccagca cggaaacctc catgtgctgg tatacgcgga cagattatcg 4260 ggttggccgg tagtccacca gtggcgacgc gatcccaccg cgcgagaagt agtccaggca 4320 gtcgtgcaca acttcgttga gctgggcgtg cccatgcgat tccgctcgga taatggacct 4380 caattcgatg ccggagtttt ccaagagaca ctgaaacgtt ggggcgtggc atggggtaac 4440 tcttcccctc attaccccca gagcaacggc catgcggagg cagcagtgaa ggcgataaaa 4500 gaattagtgg cgaagaccgg agatttatca tcggaagaat ttctagcggg gctcctggag 4560 tttcggaaca ccccccacga gagcggtgca tcaccggcac aaattttatt cggccaccaa 4620 ctccgctcca tcgtcccggc gcatcgatca tcctacgctc ccaaatggaa ggacgccatg 4680 atagcacggg agcgacaggc ggcggcggct gctgaaacta aacttcggta cgacgctcat 4740 acccgcccat tgtcaccact tcggatcggc acgcaggtac ggatccaaga cccggattcc 4800 aagctttgga ctcatgtggg cgtcattgtg gcggtcggcc gctaccgatc ctaccgaatc 4860 aaatttgcta gtggcagcgt cctgtggcgc aatcggcggt tcatccgccc cttggtcccg 4920 gcggaagaag agacgacggc gacgatcgac cgaatcaacc catcggcaga cggcagcgcg 4980 cccacggtag cggatgtcat cgttcccccc gcatcaaccc aacagcctcc catcgttccg 5040 gcgccggacg gcccacgccg tggaactcgc gtccggaagc agagggtcat cacatctgtg 5100 taatttattc catgtcgtac aaccattccc gtttgatcat ttttgtgctc gtttaatatg 5160 ccaatgtccc ctatccaccc caaagtgtcg ccgacattcg cttcccattg gatgtattca 5220 tggaatcatc attctatttg tttgttctgt ttgtttgtac cgagtgaggg ctcgggagga 5280 gt 5282 // ID Gypsy2-LTR_AP repbase; DNA; INV; 407 BP. XX AC Contig39744; XX DT 21-APR-2008 (Rel. 13.04, Created) DT 21-APR-2008 (Rel. 15.12, Last updated, Version 0) XX DE LTR retrotransposon from pea aphid: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy2AP; KW Gypsy2-I_AP; Gypsy2-LTR_AP. XX NM Gypsy2-LTR_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-407 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from pea aphid."; RL Repbase Reports 8(4), 440-440 (2008). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 407 BP; 89 A; 122 C; 79 G; 117 T; 0 other; tgtggcgaac cgatgccaca tgacctccct ccgtcgtcgg taacattgta tacttttcgt 60 attgtcatga acggttggtc acattgcgct cgcaccgagc ggcggcctcc gcgtcgttgc 120 ggcggcctgc tctagctctc tctcctcgta accgcgaaca tatatagtcc attcttttac 180 cgctagtccg cgaaccgacg ccatcttttc tccgtttaaa aactgttccc ctcaggaatt 240 tcgcatcgcg aacgcacgtg gctagccgct caaaatctga tcatcgatcg cactttttgt 300 acagctacct ttcttacgta ctttcgtact tattgaaata aaaagcacat cgcgcacctg 360 cataatcaca tttggtgtac aatcatttat tcgtcaatag acgccca 407 // ID Dtripu1cons repbase; DNA; INV; 491 BP. XX AC . XX DT 19-APR-2011 (Rel. 16.04, Created) DT 19-APR-2011 (Rel. 16.04, Last updated, Version 1) XX DE Consensus of mariner-like element internal region of mellifera DE subfamily. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Dtripu1cons. XX OS Drosophila tripunctata OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; tripunctata group; OC tripunctata subgroup IV. XX RN [1] RP 1-491 RA Wallau G.L., Hua-Van A., Capy P. and Loreto E.L.; RT "The evolutionary history of mariner-like elements in Neotropical RT drosophilids."; RL Genetica 139(3), 327-338 (2011). XX DR [1] (Consensus) XX CC Consensus of clones with show less than eight percent divergence. CC Dtripu1cons. XX SQ Sequence 491 BP; 154 A; 109 C; 111 G; 115 T; 2 other; cttgggtgcc gcacgagtta tcaaanaaaa acatactgga ccgaatttcc atctgcgaat 60 cttggtcgaa acgtaatgaa atcgacccat ttcttaagcg gatggtgatt ggtgatgaaa 120 agtgggtcac ttacgacaat gttgttagaa aacgantcct ggtcaagacg cggtgaatca 180 gctcaaacgg tatccaaacc aggactaact gccaggaagg ctttgctgtg cgtttggtgg 240 gattggaagg gaatcatcta ccatgagctg cttccatatg gacaaaccct taactcagac 300 ctctattgtc aacaattaga ccttttgaag aaagcaattg acaaaaagcg accagaattg 360 gccaatagaa aaggaattgt ctttcaccag gacaaggcta gaccacacac atctataatg 420 acgctcaaca aacttaggga gctcggatgg gaagttttaa tgcatccacc gtactcaccg 480 gaccttgcac c 491 // ID BEL-77_CQ-I repbase; DNA; INV; 7137 BP. XX AC AAWU01021738; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-77_CQ_; KW BEL-77_CQ-LTR; BEL-77_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-7137 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 295-295 (2011). XX DR GenBank; AAWU01021738; Positions 17435 10299. XX CC Positions [6082-6660] - Integrase core CC 'TCTGG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 2614..7038 FT /product="BEL-77_CQ-I_1p" FT /translation="MTDFLQTRIRILESLPKKKADSKVVRQNQVRSKPAVV FT KASYNTAQESGGPCCACSGTHFLHQCKSFQQRTVSDREALLRTNSLCRNCL FT KSGHLAKDCSSRFSCRNCNKRHHTLLCFKSGKDKGSAQYKRDNSKAATDES FT QASGHSSSSTSNQVTPEITVSNSTQLFSTQVLLATAIVIIEDDEGNRLPAR FT ALLDSGSESNFISEHLSQRLRVLRNKVDISVSGIGRSVSKVKQQIRATLCS FT RVSNFSRDMRFLVLPKVTVSLPTSNINTAGWTIPDNVVLADPTFSVSKGVD FT MVLGIESFFDFFETGRKISLGEELPALNESVFGWVVCGGIADSGESIRITC FT NVSARDKLEALVTRFWSCEEVESGSSFSPEEARCEALFAQTVQRGADGRYS FT VALPKNEAILSKLGESRNIALRRLHGTERRLARDAHLQDQYTEFMDEYLRL FT GHMHKVEETDTVKRCYLPHHPVVREASTTTKVRVVFDASCKTSSGVSLNDA FT LLCGPVIQQDLRSLIYRCRIKQIMLVSDVEKMFRQIGITPEDRALQCVLWR FT PTPSAEVCTYELNTVTYGTKSAPFLATRALKQLAMDEKHRFPLAAKAISED FT VYMDDVITGMDDEDAARNLRIQLDEMMISGGFRLRKWACNRAEVLRGVAEE FT NLAIPFPEGINLDKESSVKTLGLTWIPNTDEFKVQFDITPTVAEDELCKRV FT VLSKAASLFDPHGWFGATITTAKIFLQQLWTLVDADGKRLDWDTPLPPTVG FT ENWRKYEEQLPVLNSIRFARCVVIPNAEKVELHCFSDASKKAYGGCVYVRS FT ENAAGDVMVRLVASKSRVAPLKVQTIPRLELCGATLVAQLFKVLQEALDIP FT LSAHFWTDSTCVLSWLDAIPTTWATFVANRVSKIQTITEGHQWKHVPGVQN FT PADLISRGIMPNDIVDNLLWWNGPPWLALGREHWPNSTATVAHDEVETERR FT RTAVAIVASNLQEFTDYYLSKFETYPILIRRTAIWLRLMKNLPLPDEERSR FT GFITSAELRQAEYVLIRRVQKEVFAEEWKALSASQPLPKKSPLRWYHPYID FT RDQILRVGGRLTHSEEGEETKHPAVLPARHQLTRMILQHYHMRLLHAGPQL FT LLGAVRLRFWPLGGRSLARLIVHQCNRCYRCKPSPVQQFMGDLPAARVTVS FT RPFSKTGIDFFGPVYVRPGPRRTAVKAYVCLFICLCTKAVHLELVSDLSTD FT RYLQALRRFIGRRGRPAEIWSDNGTNFVGGKNRLQELFALMKDSEHKEKVA FT KEFADQGIRWIFSPPSAPHFGGLWEAAVRSAKHHLLRVLGEETLAIEDMTT FT LLVQVENCMNSRPITQLSDDPNDLEPLTPGHFLVTSSLQSLPDPSYLDVPT FT NRLNYWQQVQRKVQEFWRRWKRDYLSQLQSRTKRLYAAVPIEVGRLVVVVD FT DNLPPIRWKLGRIHTVHPGDDGVVRVVTLKTASGFLKRPVEKICLLPRPDE FT QPTTDSTEEK" XX SQ Sequence 7137 BP; 1839 A; 1715 C; 1822 G; 1761 T; 0 other; aacatataaa gtttgtctat ggagtcttga aactttatat tcactttata cgcatggagg 60 taaaattgag tggcgacatc tagagttgat ataaagtccc tgactgaaaa gtccgtcgga 120 cgtccgtcgt ttgtcgtccg tgcaatcgta cgacaatgtc gtgcgacttt cgcacgaatg 180 tcatacgttt ttgcaccaaa atgtcgtatt tgtcgtgcga tcgatcatcg aaattgttcg 240 acattgtcgt acgacattcg accgacgtcg cacggtttgt ttttagggat cgtcctcgca 300 cggcatcgta gatcgtcgcc tgctactgaa cttcagcgcg ggacactgct tgtccaggta 360 aatactttga gttcagtata tgtctggccg gcgcgccaca tacgtgaatc taacaccgcc 420 atttcgacct cttctttacg ttgtgacctg tgacctgtga gtgctactga tcggatcgat 480 catctggtgc cacctacgac tcgactggat gatcatcgcc aacccggatt aggccggtaa 540 cgtgagtcaa cgtacagcgc agcgttctgc gcgacgaccg aacgggacta catggtgtcg 600 gttgcctgct ggctgccatc ttcgactggg agacgaatgt actaattggt gcacaccgcg 660 aggatcgtca cgggagagat cgacgacacg tcgacggccc gagctggacg acgtcgttgc 720 ggatcaccgg ctggatagct ggaatacatc gcgagcatcg aagaggcagc atcatcggtt 780 gttttcccag gtaaatactt tagacgcagt atatgtctgc cgaggcctga acggaaatac 840 acgccattaa atcatatgtt cacctcttcg attgcctatt gtaccaccac atccctgtgc 900 atcacgcctc gagcgtgtcc accaacccca tttgaatttg catttgaatt ttggatcggc 960 tgttagtttc tccaggtgag gccaacagta tatgcctgcc gaggcagttt cttggtgata 1020 ctacatccat ggtttttcat cctcttcgcc ctacacacga tcgactgagc tgattattgg 1080 actggcttcg cacgattcgt agaactttgg tgggatcaga tcttcgtttt gttggctgct 1140 gtgttgctgt tggttcctac aggtaataca acacggtata tgtcagccga agcattactt 1200 tggatactac atgcccacac tttcgccctc ttctgccacg tgtactgtac gtaacattga 1260 gccgcttgag ccgaatcgaa cgatttgtga cgacaaatat ttgacgctgg tctgcatcgg 1320 ttgggaggta ccaacacctg tttgtccagg taacacattt tagtgtatgt ctagcgaagc 1380 taggcccaac cgcagaatac tacagccacg catctcgttt ctgatctctt cattgcaaca 1440 acaacacaac atgccgatca ccacccggag taacaaaacg ttgtcactca aacagttgac 1500 gacggttttc aagcaggtga aagcgtcaat gcaggacatt tttaactttt cggctgattt 1560 ggaaaagagc tgtacgattg ctgatgttga agtaagattg ggtgctttag atgagctgtg 1620 ggtagacttc aatgagactc tggttgagat tcaatctcat ggagagtacg tggcaacgga 1680 agacgcaact tatgaagctg accgtgcgaa gttcagtgag agttactttc tcaacaaatc 1740 tcgtttaatc accaaatcga gagaacttca agctcctcct gcggttctgg aacagaccat 1800 ccggtctgag gaaacttcgc atgggctcaa cgatcacatt cggctaccgc agattaagtt 1860 gcagaagttc accggaataa ttgaggaatg gctcagcttt agagatctgt ttgttttacg 1920 accccacaga cgaccggagg ttagccgaaa ctcggtcgtg gggtgtaatg attatcacac 1980 taattaacac tgattacagg taaacttcgt cagtataata aaacgcgtgt ggttagtctt 2040 ctatcaatca agttaataac attttatttc aaccacatga caacacacag taaacaatga 2100 cgcgctgtca atgacagtcg aggctcggca ttcgggccga gttcgagtcc agcgccgaat 2160 gcacagtggg cacattcgca attccagtcg tcacaatagt ttgtatcgct cattcactca 2220 aaggctgaac tctcagacgt cgaaaagttt tactacctga ggggttgcct tgatggaaaa 2280 ccggccggct tgatcgatca tctgaagatc actgcagaaa gctacactgt ggcctggact 2340 attctgctga atgagtacga caacgataag cttctgaaga agcggcagat tcaatcgcta 2400 tttgaactac ctgtaatcac cgaggaatct tcttcagact tgcacactct cgtggatgga 2460 ttcttgagaa tagtacaaac attggatcgt gttttggagg aagcgaactt caaggacttg 2520 ctactggtca acatcttatc atctcggctt gactcgatta ctcgtcgagc ttgggaagag 2580 tattcagcaa aggataatga tactttgaag gacatgacgg actttctcca gacgcgtatt 2640 agaattctcg aatcgctacc caagaagaag gcagacagca aggttgttag acagaaccaa 2700 gtccggtcga aaccagcggt ggtgaaagct agttacaaca ctgcacagga gtctggaggg 2760 ccttgttgtg catgttctgg aacgcacttt ctacaccagt gcaaatcgtt tcaacaaagg 2820 acggtgtcag acagagaggc actactccga accaattcgt tatgtcgaaa ctgtttgaaa 2880 tccggccact tggcaaagga ctgctcgtcg agattctcgt gtcgaaactg caacaagcga 2940 catcacacgc tgttgtgttt caaatcgggg aaggataaag gttctgccca atacaagcga 3000 gacaactcta aggcagctac ggacgaatct caagcatcag gacattccag ctcgagcacg 3060 tcaaatcaag tcactcccga gataacagtt tcgaactcaa ctcagctgtt ttcaactcag 3120 gtcctgttag caactgcaat tgtcatcatc gaggacgatg aaggcaaccg acttccggca 3180 cgagccctct tggattccgg atcagaaagc aatttcatct cggagcatct gagccaaagg 3240 cttcgcgtac tacggaataa ggtggacatt tccgtttcgg gcattggaag atcggtatcc 3300 aaggtcaaac agcagattcg agctacgctg tgctcacgtg tctccaactt ctcccgagac 3360 atgcggtttc tggtactacc caaggtgacg gttagcttac caacttcaaa catcaacact 3420 gctggttgga ccattcccga caacgtggtc ctggccgacc caaccttctc tgtctccaag 3480 ggcgtagaca tggtgttggg cattgaaagt ttcttcgatt tcttcgaaac tggtcgtaag 3540 atttcattgg gcgaggaact accagcactc aacgaatcag ttttcggttg ggtagtctgc 3600 ggaggaattg cagactcagg agaatctatt cgcatcacat gcaacgtgtc ggcgagggat 3660 aaactagaag ctttggtgac tcggttttgg tcctgcgagg aggttgagtc aggcagcagt 3720 ttttcgccgg aggaagcacg ttgcgaagct cttttcgcgc aaacggttca gcgcggagct 3780 gacggtcgct actcggtggc tttaccgaag aacgaggcca ttctctcgaa gctcggcgag 3840 tcgaggaaca tcgcactcag acgccttcac ggtacggagc gaaggctggc gcgggacgcg 3900 catcttcagg atcaatacac ggaatttatg gacgaatatc tgcgcctggg gcatatgcac 3960 aaggttgagg aaactgatac agtcaagcgg tgctatctcc cacaccatcc agtagtcagg 4020 gaagccagca cgaccaccaa ggttcgcgtg gtatttgacg caagttgcaa aacatcgtca 4080 ggagtttccc taaacgatgc gctgttatgc ggtccagtca tacagcaaga tttacggtcg 4140 ctaatctacc gctgccgtat caagcagatt atgctggtgt cggacgtcga gaagatgttc 4200 cgacaaattg gaatcacccc ggaagatcga gctctacagt gcgtactctg gcgaccaacg 4260 ccgtccgctg aggtgtgcac gtatgaactg aatacggtta cgtatggaac gaaatcggca 4320 ccgtttcttg ccactcgcgc tctgaaacag ttggcaatgg atgaaaagca tcgttttccc 4380 ctagcggcaa aggcaataag cgaggacgtc tacatggatg acgtcataac cgggatggac 4440 gatgaggacg cagcacgcaa tctgagaatt cagctggacg aaatgatgat cagcggggga 4500 tttcggctca gaaagtgggc gtgcaaccgt gcagaggtct tgcgcggtgt tgctgaggag 4560 aatttggcca ttccatttcc agaaggaatc aacctggaca aggaatcgtc cgtaaaaaca 4620 ttgggtttaa cttggatccc taacacggat gagttcaagg tacagttcga cataacacca 4680 actgttgccg aggacgaact ctgcaaacgt gtcgttttat caaaggctgc atctcttttc 4740 gacccgcacg gctggtttgg ggcaacaatc acaacagcga agatctttct ccagcagttg 4800 tggacactgg tagatgctga cggaaagcga ttagattggg acaccccgtt accacccacg 4860 gtgggtgaga attggaggaa gtatgaggaa caacttccag tgctcaattc aattcgtttt 4920 gctcgttgtg tggtcattcc aaatgcggaa aaggtggagt tacattgctt ttcggatgca 4980 tcaaagaaag cgtacggagg ttgtgtttat gtacgaagcg aaaatgcagc tggcgacgtc 5040 atggtacgac tggtggcttc caaatccaga gtggcgccgc tgaaggtaca aacgattcca 5100 cggctggagc tgtgtggcgc gacgctggta gcccaactct tcaaagttct gcaggaggca 5160 ctggacatcc cgttgagcgc gcatttttgg acggattcga cgtgtgttct cagctggctt 5220 gacgcgattc caacaacgtg ggccactttc gtcgcgaatc gagtttccaa gatccaaacg 5280 atcacggagg gacatcaatg gaagcacgtt ccaggtgtgc agaatccagc agatctgata 5340 tccagaggaa tcatgccaaa tgacatcgta gacaatcttc tctggtggaa cggaccacca 5400 tggttagcgt tgggacggga gcattggcca aactcaacag ctacagtggc gcacgatgaa 5460 gttgagacgg agcgtcgacg cacagcagtg gcgatcgttg catcaaacct gcaagagttt 5520 actgattatt atttgtccaa gtttgaaact taccctattt tgatacgaag aacagcaatc 5580 tggttgcgct tgatgaagaa tctgccttta ccggatgagg agcggtccag gggtttcatc 5640 acttctgcag agttgaggca ggctgagtat gtgctcattc gtcgtgtgca gaaagaagtg 5700 tttgctgaag aatggaaggc cctgtcagca agtcaaccgc tgccgaagaa atccccgctt 5760 cgctggtacc atccgtacat cgatcgtgat caaattctgc gggtcggtgg gcgattgacg 5820 cactcggagg aaggagaaga aacgaaacat ccagctgttc tgcctgctcg tcaccagctc 5880 acacgcatga ttcttcaaca ctatcacatg cggctgctcc acgctggccc acaactacta 5940 cttggtgcgg tgagacttcg tttttggccg ttgggaggca gaagcttggc tcggctcatc 6000 gtacaccaat gcaatcggtg ctacaggtgc aaaccatcac ctgtacaaca atttatggga 6060 gatctgcctg cagcgcgagt cacggtctct cgtccgttct ccaagactgg aatcgacttt 6120 tttggaccag tttacgtgcg accaggaccg aggcgtacag cagtgaaggc atacgtttgt 6180 ttattcattt gtctatgcac aaaagcagtt catttggaac ttgtgtcgga tttatctaca 6240 gatcggtacc tgcaagcgct gcggaggttc ataggtcgcc gtggcagacc cgcagagatc 6300 tggtctgata acggaacaaa ttttgttggt ggtaaaaatc ggcttcaaga actgttcgct 6360 ttgatgaaag actcggagca caaagagaag gtcgctaagg agtttgctga tcaaggcatt 6420 cgttggattt tcagcccacc aagcgcacca cactttggcg gtttgtggga ggccgctgta 6480 cgatcagcaa aacatcatct gctacgagtg ttgggggagg aaactttagc catcgaagac 6540 atgacaactt tgttggttca ggtcgaaaat tgcatgaatt ctcgcccaat cacacaattg 6600 tcagacgacc caaacgattt ggaaccccta acacctggac acttcctcgt aacttcttcg 6660 ctgcagtctt tgccggatcc aagctacctc gacgtgccca caaaccgact caactactgg 6720 cagcaggttc aacgcaaggt ccaagaattc tggagacgtt ggaaacgcga ctatttgtcg 6780 caacttcaga gcagaactaa acgtttgtat gcagcagtcc caattgaagt tggccggcta 6840 gtagtcgtgg tggacgacaa cctgccacca atacgttgga agctcggcag aatacacacg 6900 gtacatcccg gggacgatgg ggttgtacgg gtggtaacgc tgaagacggc ttcgggattc 6960 ctgaaacgcc cggtggaaaa aatatgttta ttaccacgtc cggatgaaca gccaacaaca 7020 gactctacag aggaaaaatg aaaacatcac tttcccatcc cctcccactc ccactccctc 7080 gaagaggatt tttttattta ttttcagaaa tgctggatat ttctgggtgg gtgagaa 7137 // ID BEL-651_AA-I repbase; DNA; INV; 6528 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-651_AA_; KW BEL-651_AA-LTR; Pao_Bel_Ele228; BEL-651_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-6528 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [5558-6118] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 50..1822 FT /product="BEL-651_AA-I_2p" FT /translation="MNPASHCMLCDRPDNADDLVQCDKCDGYVHFACAEVG FT DSIADEDRSFTCQRCIERLEAITVSSQRSSKRTSTSTRSSLSAARLTLRLQ FT QLETERQNRLKELEDAERFQRLRRQVEADFEKQKLEALESQLLDEEENRST FT RSRVSARVIHANTHKWIGSTSQSDDVVGRPSNRTEEANEPQIFESTCNVEI FT QSGNPEIPATSTSRPALKNIGVPKQKTSVPVSLAGVGSVSDPNSRTRDMST FT YEFHIDTSRGAGQSTSNQVRDSVSVSNVRLESIAESVQTSTPKPPTDNVSI FT PKVQIASKLRDKSLTVSSQIAQQRLPCKGSTTGEPPPKASVSQPISELDPM FT QYHKSGVPGKRLGPKVGQQLSEIKLSERLPPTQVPSTSTSQSHQQQCAEPV FT LSSTGHTATRSSNEFPSTQRSFQQYSQWNQSALLPPPGYNKLGSKPVIHYQ FT QQWPSGMAQWSQPQQPHPLNSTEFHQNQVPISSAPLQPLGQHPIPSSSLTE FT AVPVQISPSSNRRTAPTAEQLAARQVIPRELPTFSGDPQDWPLFFSSFNNS FT TEACGYNDSENLARLQRCLRGHALESVRSRLLIPESVPHVTNIR" FT CDS 2203..3252 FT /product="BEL-651_AA-I_1p" FT /translation="MSKEKLFVHQQTPSTSGSKHISSTRQKEPAEVVLKPC FT TYCNKTDHRIAECQEFKQLDLDGRWKVMRRKGLCRICLIPHRTWPCRSKQE FT CGVGDCRMRHHPLLHSRKAELSASQSPSSAVRNVTYQHHHATTSYALLRYL FT PVTLYGNGKSIELFAFLDDGSSSTMLEAEVANLLGIEGPNEPLALGWTGDV FT TRIEKESQRIEVTIAGGNNGKCFLLKAKTVGPLKLPSQTVDYEELCETHPQ FT LRHLPLKSYTSVSPRLIIGVDNARLISSLKNRESAAGDLIATKTRLGWSIY FT GTHLVNKTSVGYVNTHVKLSQEDTDFTDLFRQFLAVEEATIKQNPQSEEDK FT RALEILQ" FT CDS 5138..6172 FT /product="BEL-651_AA-I_3p" FT /translation="MHEYPDDYATLVYDRDDPDREQKLIVKSSPLYKMSPK FT IDEDGVIRMDSRIIAAPRITLDMKYPILLPRNNQVTKLLVESYHERFLHQN FT NETVCNELRQRFRIPRLRTVISKIARQCQHCRIGKAAPQVPIMAALPEFRL FT TGFVRPFTHTGIDYFGPVYVKQGRSQVKRWIALFTCLSVRAVHLEVVHCLS FT TQSCVMAIRRFVARRGSPITFCSDNGTNFVGANNLLKEQLQKISKDCAQSF FT TNSQTKWLFNPPLAPHMGGSWERMVRSVKVAMAAIAHHPRHPSDEVLETII FT VEAESIVNSRPLTYIPLDSVNQEALSPNHFLLYGTQGINQPSQELTVDHST FT LS" XX SQ Sequence 6528 BP; 1899 A; 1510 C; 1648 G; 1460 T; 11 other; atcttcaaga ttttgtggtt agatcctgcg tgcggcgagt atacagataa tgaatccagc 60 aagtcattgt atgctgtgcg atcgcccgga caatgccgac gatttagttc aatgcgataa 120 atgtgatggc tatgtacatt tcgcatgtgc agaagtagga gactcgattg ccgacgagga 180 tcggagtttc acctgtcaga ggtgcatcga gagattagaa gctatcacag tttcatcaca 240 acgttccagt aagcgaacaa gtacgtctac gcgcagtagc ctgagtgcgg cccggctaac 300 attacgtctc caacagttgg aaaccgaacg acagaatcgg ctgaaggaat tggaggatgc 360 tgaaaggttt cagcgactac gaagacaggt agaagctgat tttgaaaagc agaagctcga 420 agccctcgag tctcagctgc tagatgagga ggagaatcga agtactagaa gcagagtaag 480 cgccagagtg attcacgcaa acacacataa gtggattggt tcgacaagcc agagtgacga 540 tgttgttggt agaccctcga atcggacgga ggaagcgaat gagccacaaa tcttcgagtc 600 gacatgcaac gtagagattc aaagcggtaa tcccgagatt ccagccacgt cgacgtcaag 660 accggcgttg aagaacatag gtgtcccaaa gcagaagacg agtgtaccgg ttagtttggc 720 cggwgtggga tcagtgtccg atccgaactc gcgtaccaga gatatgtcca cttatgagtt 780 ccatatcgat acatcccgtg gtgcaggcca gtctactagc aaccaggtga gagatagtgt 840 tagcgtttcc aatgtaagac tagaatctat tgctgaatct gtccaaacaa gcacacccaa 900 gccaccaacc gataatgtaa gcattcctaa agttcaaata gcatctaaac ttagggataa 960 aagtctaaca gtatcaagtc agatagccca acaaagatta ccatgtaaag ggtcaactac 1020 aggggaacct cctccaaaag caagtgtttc acaaccgatt agcgagttag atccaatgca 1080 atatcacaag tcgggtgtac cggggaaaag gctggggccc aaagttggtc agcaactatc 1140 cgaaattaag ttgtctgaac gtttgcctcc aacacaagta ccatctacaa gtacatcaca 1200 atcccatcaa caacagtgtg ccgagccagt tttgtcatcg acggggcata cggcaactag 1260 aagttcgaac gagtttccat cgacacaacg aagtttccag cagtattcgc agtggaatca 1320 gtcagcattg ctaccacctc ctggatataa caagttaggg tcaaaaccag tgatccatta 1380 tcagcaacag tggccttcag gaatggcgca gtggagccaa ccgcagcagc cacacccttt 1440 gaacagtacc gaatttcacc agaatcaagt cccaatttcg tcagcgccgt tacaaccact 1500 agggcagcat ccgatcccat cttcgagttt aacggaggca gtaccagttc aaatttctcc 1560 ttcgtcaaac agacgaactg caccaactgc tgaacaatta gctgcgcgtc aagtcattcc 1620 gagggagttg cctacgtttt caggtgaccc acaggattgg ccactattct tcagctcgtt 1680 taataattcg accgaagcgt gtgggtataa tgattctgag aatctcgcaa gactccagag 1740 atgtctccgt ggccatgcac tagaaagtgt tcgcagtcgt ttgctgattc ctgagtctgt 1800 tccccacgtg acgaacatac gctagcccgg ctctatggca ggcccgaagt tataattcat 1860 tcactgttga agcgggtgcg tgaaattccc tcccctaaag gtgatgatct caaaaccttg 1920 atcaaatttg gaatgggagt agggaacctg gttgaacaca tgattctagc cgatcaacgg 1980 cagcatatca gtaaccctat gctgttgcag gagatggtgg acagactacc acctaacttg 2040 aaactacagt gggcgtccca caaamggatg tatcagackg ttgatttkgg agmatttaat 2100 ggttkcgtga tggacctggc gacgatggca agtgacgtta ccctacacat ggatcmagcg 2160 cagtcgagtt ccagcaggtt ggagaagccg aaaaaggaca aaatgtcaaa agagaagctg 2220 ttcgtgcatc aacaaactcc atcgacatct ggatcaaaac acatcagctc tacacgccaa 2280 aaggaaccag ctgaagtagt tctcaagcct tgtacatact gcaacaaaac cgatcatcgc 2340 attgcagaat gtcaagagtt caaacagcta gatctcgatg gacgctggaa agttatgagg 2400 cgaaaaggac tttgccgaat ctgtttgatc cctcatcgca cgtggccgtg tcgctcgaaa 2460 caggaatgtg gagtcggaga ttgtcgaatg cgtcatcacc ctttgctgca ctctagaaag 2520 gctgagctca gcgcgtctca gtcgccttct tcggccgtca gaaatgtgac ctaccagcat 2580 catcacgcca ctacatcgta tgccttactt cgctaccttc ccgtgacgct atatggaaat 2640 ggcaaaagca ttgaattgtt tgcctttcta gacgacggct cgtcgtcgac tatgctggag 2700 gcggaagtgg caaatcttct tggaattgaa ggacctaatg aacctctggc tttaggatgg 2760 acaggagatg tcacaaggat cgaaaaggaa tcgcaacgca ttgaagtcac tatagccggg 2820 ggaaataatg ggaaatgttt cctattaaaa gccaaaacag taggccccct caaacttcca 2880 tctcagacag tggactacga agagctgtgt gaaacacatc cgcaactaag gcatctcccc 2940 ctgaagagtt ataccagcgt ctctcctcgg ctcataattg gcgtagacaa cgctcgactg 3000 ataagctcgc ttaaaaatcg tgaaagtgct gcaggagatc tcatcgcaac aaaaacccgg 3060 ctgggatgga gtatttacgg cactcatctg gtaaataaga catctgtcgg atacgtgaac 3120 acacacgtga agctaagcca agaagacaca gatttcaccg atctgttcag acagtttttg 3180 gcggtagaag aggctacaat aaagcagaat ccacaatcgg aggaagacaa gcgggctttg 3240 gaaatattgc aggamacaac acgaagagtc ggaggaaggt ttgaagtagg tttgctgtgg 3300 aaatacgatg atccatgtct gcctgacagt tttcctatgg cggttcgtcg tatggaagca 3360 ttagagaaaa aattggagaa agatccttat ctggacggaa aggtgagaga acaaatatca 3420 gagtacgtcg aaaaaggata tgcgcatcga atcactgcag cggaattaga gtccacggag 3480 tcagggcgcg tatggtatct acccctaggg gtagtacgca acccgcggaa gccggaaaag 3540 gtccgattga tttgggacgc tgcagcacga gtagagggcg tgtcgttcaa tgataggatg 3600 ctgaaaggac cggaccttct gacagcacta ccaaccgtgc tgctacgctt ccgtcaaaaa 3660 tcaattgcct tcagtggaga tattagggaa atgttccacc aattccttat tcgtcagaga 3720 gatcaacaag cgcagaggtt cgtgtttcga gaacgcscag gacaagcgcc gcagattttt 3780 gtgatggatg tcgctacatt cggcgctgcc tgctctccat gtatcgcgca gtatctaaaa 3840 aaccggaatg cggaggagca cgcagaacag ttccccgacg cggcgaaggc aatcatcgag 3900 aaccattatg tagacgacta cctcgacagc gtagacacgg tagaggaagc agttcaacta 3960 atacaggagg tgaagcacgt tcatctttgg ctggaatgga aatccgcaac ttttcgtcsa 4020 actccgccaa agtgcttgaa cgcgtagaga ggtgaatggg aatcgtcata gcctaagtcc 4080 gatgcaatcg tagagcgagt gctcggtatg gtgtggaggc catctgaaga cgttttcagc 4140 ttcgaactaa tttaaaagag gagattcgga acatcatgaa aggcatgaag caccaaccaa 4200 acgtcaagtc cttcgtacaa ttatgtcact gttcgacccg ttaggactgg tagcgcactt 4260 cacctcacgg aaagatcacc atgcagcagg tttggagaac cggagcaggt tgggacgatg 4320 ccatttccgg taaaattctg gaagataaga gtggagcgtc aagatagaga tgctcgaatt 4380 ctaaggtgct tcttcgcaga cacgggagct catccgaaag atgcagagat ccacgtttcg 4440 agaatgcata tgcttgcgag actgcgttca ggagctccgc agtgcacatt gatagcggca 4500 aacacgaaag tggcaccgtt gaagccactc tcaataccac ggctagaata ataagcagct 4560 cgagttggtg accgtcttct ggatagaatc tgcaaagcgc tgacaatacc tgttgtcgct 4620 cgataccttt ggtcggactc gacgaccgca cttgcctggc tcagatccga aactcgacga 4680 taccaccaat ttgtcggctt tagagtggga gaaaattttt tgagcacaac cactatcaac 4740 gaatggcgca aagttccttc caagataaat gtcgcctgat cacttgatcg cttgatttgg 4800 tagctctacc aaatggagag atggacccag tttcgacccc gaagactggt ggtacgcagg 4860 accgacgttt ctaagggatc cggatgacag ttggctgaag caaaccgatc atatgtttga 4920 gacgtcggag gatttgcgaa acgtcttcct acacatccat cggactttac cagcaccgct 4980 gattgacctc agccgcttct ccaaatggtc gaggctcaac agggccgcgg gatacgttgt 5040 gcgggcagta aagctattcc tcggttggaa agcagaaggt ccattaacgc atgaagaatt 5100 gcagaaggca gaatctctga tctggcktca agcgcagatg cacgagtatc cggatgatta 5160 cgcaacgctc gtctacgaca gagatgatcc ggatagggaa cagaaactga ttgttaaatc 5220 aagcccttta tacaagatgt caccgaaaat tgatgaagat ggagtaataa ggatggatag 5280 tcgaatcatc gcagcaccaa ggattactct ggacatgaaa tatcctatct tactgccaag 5340 aaacaatcag gtcacgaaat tactggtgga gagctaccat gaacgttttt tgcatcagaa 5400 taatgaaacc gtttgtaacg agcttcggca acgttttcga attcctcggc ttcgcacagt 5460 aatctcgaaa atagccaggc agtgtcaaca ctgtcgaata ggaaaggcag caccacaagt 5520 ccccatcatg gcagcacttc ctgaatttcg tctaaccgga tttgtacgac cgtttacgca 5580 cactgggata gactattttg gtccagtgta cgtgaaacag ggacgcagtc aagtcaaacg 5640 ctggatagca ctgttcacgt gtctgtcagt aagggcagta catctagaag ttgtacattg 5700 tttgtcaaca caatcatgtg tcatggccat acggcgcttc gtggctcgaa gaggttcacc 5760 aataacgttt tgctcagaca atggaaccaa ttttgtcgga gctaacaatc tgttgaaaga 5820 gcagctccag aagatctcca aggactgcgc tcagagcttc acgaattcac aaacgaaatg 5880 gctgtttaac ccacccttgg cgccccacat gggaggttca tgggagcgca tggtccgttc 5940 ggtgaaggtt gcgatggcag caatcgcaca tcatccaaga catccaagtg acgaggttct 6000 cgaaaccata attgttgagg ccgaatcaat agtcaactca agaccgctaa cgtacattcc 6060 cctggatagt gtgaaccagg aagcactttc gccgaaccat ttcctacttt atggtaccca 6120 gggaatcaac caacctagtc aagagttgac cgtagatcac agcacattaa gcttaaggga 6180 tagctggaaa ttggcccagt atctggtaga tacattctgg cgacgctggg taaatgaata 6240 tttgcccacc ctgacaagac gcacgaagtg gtttcaaccg gtaaaaccaa tgatgccggg 6300 tgatttggtg gtggtagttg aggaaacgac tcgcaatggt tggttgcgag gacgaatcgt 6360 ggaagttgtg aaaggtaagg acggtcaagt ccgcagagct gtagtgaaga cgtcgagggg 6420 aatactcagc aggccagcga ccaagctagc aatcctagat gttcgaggta ctacaggggc 6480 tccagaaaat gaaaattcaa gcgaaccgga actacacggg agggggga 6528 // ID TDD4 repbase; DNA; INV; 3839 BP. XX AC U57081; XX DT 24-OCT-2005 (Rel. 10.1, Created) DT 03-FEB-2010 (Rel. 15.03, Last updated, Version 2) XX DE Dictyostelium discoideum Tdd-4 transposable element. XX KW Ginger2/TDD; DNA transposon; Transposable Element; TDD4; Ginger; KW Ginger2. XX NM TDD4. XX OS Dictyostelium discoideum OC Eukaryota; Amoebozoa; Mycetozoa; Dictyosteliida; Dictyostelium. XX RN [1] RP 1-3839 RA Wells D.J.; RT "Tdd-4, a DNA transposon of Dictyostelium that encodes proteins RT similar to LTR retroelement integrases."; RL Nucleic Acids Res 27(11), 2408-2415 (1999). XX DR EMBL/GenBank/DDBJ; U57081; Positions 1 3839. XX SQ Sequence 3839 BP; 1668 A; 489 C; 449 G; 1233 T; 0 other; tgttgtaggg aattttcaat aaaatttcac tacagtagcg tagtgaattt tattatttaa 60 tttcactaca gtagcgtagt gaaaaaaaaa aatcactgac cctaaatttt ttttttttca 120 aaaaacaaat aaagtttatt aaattattat cttttatttg attattagtt ctatataagt 180 ttttttattt tttttttttt ttattttatt tttttttttt tttattttat tttttttttt 240 ttttttgcgc caaaactttt tttttttttc cccaacaaaa cttttttttt tttttttttt 300 ttttttattt tcattttccc aaaatgaaac aagataaaat aaatgaaatt ataaattatt 360 taaaatttat tgaagataat aattttgaaa atactaaaca taaaaattat aaatttttac 420 atgatgagta aatatttttc cttacaataa cacaatcatt ttttaaattc tgaccaataa 480 aaaacagaaa cagaaaagaa aaattattta gaattgaaaa aaaagtcatg aattgtggcg 540 gagatagaaa tatatcaaaa gaagtgtgtt tagaagtaat taatgattat agaatggtat 600 cactctggga agaaatgcat aaaggtcata taggaagaga tgcaacctat gggaactaca 660 agactaaata ttataatatg ggattgtatt cttttgtatc tgatgcagta gatacgtgtg 720 acatttgcca acgaaataga ataaaaggtt tggtatcaat catttttata tgatgttttg 780 taacttttat attaaaactc ttttaataat ttaaataaat taataggtat aacaaaggat 840 tttgctccaa ttgtagatac cgaggaatac tcaagattgg tttatgattt aacatcaatt 900 aaaggtgaac ataaagaaaa agttacatac gatgatgata atgaaaagat actaactaaa 960 ctagatgatc ttatccaata tgactctgta caaccgtacg acaccgatgt tgtttacatt 1020 attctatgca tagattcttt tacaaaattt gctactggaa ggtatttatt ttttatttat 1080 taatttatta gttttctttt tttattttaa ttatcattta ttttttagat gtttgacaac 1140 aaagagaaca gttcccatat acaatttttt ggctctcacg tactttggca aacctgtgaa 1200 agtatggcac tgcgataatg gacgtgaatt taaaaacaaa gtccaaaaag aattcctaaa 1260 actttttcca ggctccaagt cagcacatgg agctcctcgt acaccaacaa ctcaaggtat 1320 ggtagaaaga ttgaatcgaa ctatcaagga gaggatctca aaattaaaac aacaagagta 1380 tgatattgtg aacaatataa attatcgata tgtttattaa ttaatcattt ataaataaat 1440 tttagttttc ttgatggtac ttctaggtct ctttctgaac tattaaaaca agctttgtat 1500 gattacaata atacaaaaac aagaacaatt aaaatgaaac catctcaagc tgttggtatt 1560 gttcctttgt ttattaatgt tcaatcagaa caagactctc aatcaattgg tgtttcagat 1620 gtttcaaaag aagaaagaac agctattatt cttgaaaatc tcacaagtta ccaaaatcaa 1680 tggaattcaa aaccaccaaa gggattgaaa gttggtgata ctgtactctt tctagaaatt 1740 aaaaaaaata atgtaagtaa ttaaaacata aagattaata atatatgaaa cagtaataat 1800 taacaaattt tatatttaat tctaatagaa aatattgatt ttgtgtaaaa ttcataaggt 1860 aatacaggaa gacactaaac aactttataa acttgaattt ttggaagatg gaatcaatag 1920 ccttcaaaaa aaaggtttat actctggttt tgtaggaggt aacaagttgg ttctttataa 1980 acaatcaaca gttgatattt ttagaacatc gccaaacata caaaaagata ttgattcttt 2040 tacttctggt ctttatttaa ataatgatgg gaaagttatt gaatttctgg ttcagtttat 2100 taaaaatctt ggtaaggatt tatccaattt caacttgcat caattggtaa cacaaaatga 2160 tccatttagg gttttggaag atgctcttaa caatccatca aatattccac ctatcattaa 2220 cgataatcca tttcaacaca aactcaataa tgaaatggaa caagaagaaa tactcccatt 2280 tttaaataat aatccaacag cgccaaattt aatgaattgt ttgagaaagg atatgaattt 2340 aaaagaaata gtaccaccaa tacctcaagt gcaaataata ccatcgtata ctataccaca 2400 atctggtagt ggtatagtaa caactgttaa acgtctcaga ggccgtcctc ataagatacc 2460 aattgttaat aaacctcctt taaaatcaca ttctaaacca tccaaatcac ctctcaaatc 2520 accatccatt tcaccttcca attcaccatt taaatcacca tttaaatcac cttccaaatc 2580 accatttaaa tcaccttcca aatcaccatc catttcacct tccaattcac catttaaatc 2640 accttccaat tcacctttta aatcaccatt taaatcacct tccaaatcac catcaatagg 2700 aaaaatcgtt aatgtgatac cttcaaaaaa agttgcaaaa acaaagttaa caaggacaca 2760 aaaattagaa cttgatttgg ttgcaattaa tagaaggggg tatggggaaa ttagaaaata 2820 aaaacaagtg gccaaatcaa aaaaacaaaa aaaacacaaa tcaacacaaa aaattaaaaa 2880 aaaaaaaaaa aattaaaaat taaaaaaaaa aaaaaattga aaaaaataaa aaaaaattac 2940 atttgttaaa aaaaataaaa aaaaataaaa aaaaataaaa cttaaaaaaa attgaaaaaa 3000 caaaataaaa aaaaaaaatt gaaaaaacaa aataaaaaaa aatttcaaaa aaaaagaatg 3060 atcgaagaat taatagattc gataaaactc ttaagaaatt tagccattga atatggcaat 3120 gaagaatagt tagattcaga ggaaaatttg actaaattcc tcgaaaaagc agccaaagta 3180 ctatatgtaa atagtggtga tgctttaatc gttaaaaatc caaatatgat gttagaagca 3240 ttagaattaa aatcaaaaca atttcaccaa cattgtgtaa aaactgtaat agaaaacgaa 3300 ttatatttgg acttttttgt atacaaagga aaatgcaaaa acatcaaatg caccagaggt 3360 aacaaagcta gaaaaaattc aattgtgtgc agtgtttaca gtgatatggt aaatcaagca 3420 attgaacaac tcagggagtt aataaaaata aattaataaa aataaattaa aaaaaaaaaa 3480 attgaaaaaa aaaaaaaatt aacaataata ataaataaaa taataattaa aacaaaaaat 3540 aaaattacaa tttaataaac aaataaaaat tgaaaaaaat ttaaaaaaaa aaaattgaaa 3600 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaattaaca ataattataa taaaaataaa 3660 taaaataata attaaaacaa aaaataaaat ttcaatttaa taaactttat ttgttttttg 3720 aaaaaaaaaa aaatttaagg tcagtgattt ttttttttca ctacgctact gtagtgaaat 3780 taaataataa aattcactac gctactgtag tgaaatttta tggaaaattc cctacaaca 3839 // ID Academ-1_BF repbase; DNA; INV; 7324 BP. XX AC . XX DT 16-MAY-2011 (Rel. 16.05, Created) DT 16-MAY-2011 (Rel. 16.05, Last updated, Version 1) XX DE DNA transposon: consensus. XX KW Academ; DNA transposon; Transposable Element; Academ-1_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-3053 RA Yuan Y.W. and Wessler S.R.; RT "The catalytic domain of all eukaryotic cut-and-paste transposase RT superfamilies."; RL Proc Natl Acad Sci U S A 108(19), 7884-7889 (2011). XX DR [1] (Consensus) XX SQ Sequence 7324 BP; 2229 A; 1655 C; 1558 G; 1882 T; 0 other; tagtccatgg gccagacccc caaattttac atatcagtac cacctgaggg ggcaaatttt 60 ccttatgatt tttaagtagt agggtatctg agggatatat tttgatatga tagccatcgc 120 agagtggtag ctttgacccg gctatctgcc cttgaccttt tgaccccgat ctggaccacg 180 ccctttgata tgattataca aatgggtgtg taaatcaatt aaacaataat attttcaaat 240 gaaacttgct gggctgatac agaatgaacc aaggattaca gaaaacttaa tttcatttca 300 atatcttgat cctatgaaga gttatgagtc tttgaattat tatttttgag caaattttgg 360 atgaaaaatg ttaatttcta cttttcaaag gctgacattt attccgaact cacatttgac 420 cttttttcat gcttttatct gctttgttgt atcattccct gcattctaaa gtttaaatca 480 tgaaaatcca ccatttctat ctagagatat gagtacctaa ataagtacac acctacaggc 540 cttgtgttgg ggaaaccgtt gctagggccg ttgctatgga ggcgctgggc tgcaatccca 600 agcaaaatgc agtacaaagc ctagtctttt gatatatttt tatgttctta taccttccaa 660 catatataga gaacatgttc aaggtgtcat atgtaaggta aagtataatt ctagtttaaa 720 acttacatgc tatatgtata agttaaatag cattatcatc ctacatgtat aagagacctt 780 gcacgagtgt gcatactatt tcccacaact atgaaataac acttgacagt gttggttgct 840 acataagaac aataaacatt gttgaccact catgttgtga ttgccattga cctacttttg 900 cagatgtcct ttaaatgacc cctctcccca caccctgagc cttaattact tccagtcgca 960 tcttccctgt acctctattt tggctgtagt gccacatcat catggcctgg ccatgtgcta 1020 ctctccttga tgctgtctgg ttgacgattg tgccttctta gaactatatt tagctagcat 1080 tagaacttct ccaaagtgtg agcctgcccg ttatatctac aatgttcggg acaatgttaa 1140 gttgagtgca caagctccgc gtggacaact tggcatcttg attcttgtgt tcggactgga 1200 tggattcact cctctaacat acctctgact gggaggcgac gttgccacta gcactggcca 1260 gtcctgtttc aacttcatac gcatctcaac agctccatgg aggctaagaa tcgtactaaa 1320 gattgtgacc gccaccccac cccaacctcc agcctaggcc gaccatacct ggagtcctca 1380 gagattcgta cagccataag gtactcagtc ttcctccttt tctctgtgtc tatctcctca 1440 cttttcaaca tggcaagtgc aaaagactcg tgcgttgcag cgaaacgtcc tcgtttagac 1500 tgtgtgatac actgttcgcc aaacgatgag actgaatcta agttgataag tccccagagt 1560 attgactcat ggaaaacact actaagagca gctgagatca gaaactacac tcccatcata 1620 gagcttgcta aaagcttgtc agaaggcgaa atcccagata tcaagtacca cagaacatgt 1680 cgtagtatat ttaccatgaa gaaagatctg gacagattgg cgaaacagaa tgcagatgag 1740 aaatctgaca aagacaattt tgaaaaaagt tgtaccggta agagaaagca atgttcaaca 1800 tcaagggttt acgagaatga atgcatgttc tgccagaaaa ctaaatatct gccaggaaaa 1860 aggaccagag aacctttggt acaatgcagt gagttcaggg ctgatgagaa gataagacag 1920 gctgcagaag ccaaaatgga ttttagaatg ataagtctgg cttctaggga ccttatagca 1980 gcagaaggtc aataccacag atcatgctac agagaataca ctaggagtac cacatgtgta 2040 gcaggcaccg atgacaacaa tgaagggagt atgaggcaag attcatatga gtctgctctg 2100 gatactgcgt atcaagagct gttccaattc atacggaatg atctgttcag caaccctagg 2160 gtactacgaa tgacagacct ctcctcaaga ttagtggagt ctatgaagtc tcttggtata 2220 gagcagatca aagactccac caagaaacac atgcggcgga gtttagagag cgagttttct 2280 agtaccctac acttcttttc tgatgacaac ctcagactgc ttgtctatcc tgacaatcta 2340 tctagatgcg accttgtcaa agcgaatcag gccttacaaa aggagctcaa cactcttaaa 2400 tcactcctta acaaagatgt tgttgccaag gcagccctgt tgcttagaga cgacataaga 2460 aagcaggatg ttcctcaagc ctggccaccg gtgatcacag aggaggacca aagtagagac 2520 atcatccctc cgtcagtcgt caaattcttc tgctacttgc tgacaggtgg tttcaacaat 2580 gccgatgcat cacagagggt ggaacgactt gtgacctcat tcggatatga cactgtgtat 2640 gctgtaagcc agggaaaagt gaaaacaccc aaacacatca cgttggcttt ttccatcaag 2700 tcactgacag gaaacgtgga gcttgtgcaa ctgctgaacc gacttggtca cagtgtttct 2760 tacagtacta tggaggaaat cagtacggcc ctttgccttc agaagctgtc aagtggtgat 2820 ggtgacataa cccttccaag taacttgcat aatcaagttt tcacaacact tgcctgggac 2880 aacatcgacc gactggaaga gaccataagc ggtggcggga catctcatcg tgtcaatggc 2940 attgctgtcc aacccaaggt aacagagtcc actgaagtag aagccctccc tccagtgact 3000 aaaacaaaaa gaagaagcat tgacgcccag cagcccggca gcattgccaa cctacaatgt 3060 tggccaaaga gagggtcctg aaccatgcca atctgttgac cttgagcaag aagatatcac 3120 tcacactaca cggctgaaga acataatatg gatgatagct cgtatactta aacaaaaaga 3180 gcaaagcatc agcagctgga caggcctcaa catcaaatca cgcagtgatg tccaagtaat 3240 ccctgacaac gtttgctaac ttcccaccat caatgcccct ccaacacaga tgtccaccgt 3300 gtacgaaatt ctcaggcagt ccttaatcat taaggataag cttcagttaa agaagatcgt 3360 ctgtgtcttt gatcaggcta tctatgcgaa ggcgacagag gtcatgtgga agcacaaaga 3420 tattttcaaa ccaatagtca tcagaatggg atctttccat acatcctgta cacttctggc 3480 aattatcgga aaacgcttcc aggatgcagg tctgagggac cttgccattg aatctggggt 3540 agtagcagag ggttcggtgt ctggagtgat ggacggacgc aagtacaaca gagcagtcag 3600 gttccacaag ctagtgtacg aggccttact caggcttgtg tggtcagggt ttctgtcctg 3660 gatgcgtcag aaccacaatg aggatctcac tgacctggaa gaagcaacgg aaaccatcga 3720 cacccacacc aatgatgtat cacaacaagc tttactaggc atcctacagt cagcaacaaa 3780 taacagaatc atgtgcatgt ttgacaacta tgtcaccacc ctccgaagtg agagcagccg 3840 cctcacacag tattggatgt cctacatcga cctcgtagag attttgctgg ggctccttcg 3900 agcaacgagg gagggtgatt ggctgctaca tcaagtgtcc atccgccagc tcatcccatg 3960 gtgctttgcc tatgacaggg taaactatgc aaggtacctg tcatactact atgcacagat 4020 gtcacaactg caatcccaac atccagacgt gtatgcagag tttatgagag gacacttttc 4080 tgttcagatt ggccatgcaa acccgtttgg acgtattccg gttgaccaga ctatcgagga 4140 gacggtgaac aaagacacac aaaccgccag ggggacaaag gggttcagcc ttaaccactc 4200 agccatatca aagtactacc tcacagcaga acacaggtga gtatctactt gaaacaacta 4260 agggacatga ctagtggtgc caagagtagt aagttttctc ataaagatct acaaactaca 4320 cgcattagaa gggatgaagc cgatgtcagc tcattggtcg atttaatgga gaacaactgg 4380 atcaacccaa tgtcctcaga agagcccgac cttgtcagct tgtccacggg cagtgtggca 4440 cctcccgaca tagtgaagca tatttccaac gcacaccaaa tgggagagga cgcctatctc 4500 tccttccgaa ggaatcgcct ggagcaagac ccaccagaag taaagttcca tgacaagctg 4560 acaaaactaa aactgaagac attctccaat attggcacta agaagaccag tgaaaagggc 4620 aaaaccaaag aagttgtcct gcaggcggac aggaacctgt tcaggcacat catactgatt 4680 gcagaaagca ggcagctccg cattaaagat gtcctgacac atcctctcgg tccccttcca 4740 tggtctcttg caaatgctga tggcacttta cgtaagacca acaaagctgc ccttgcaaga 4800 gagctggaga gtcgtgtttc accagcagaa gacaccccca gctccatcca cctgtctgat 4860 cgatggcatg agcatcctgc aaaagatcaa cggcaacaac ataacgtttt ctgggctagc 4920 aaacaccaca atgtccatag ttctaaagga aggtgtgaac agtcacagaa ttgatgttgt 4980 atttgacgtg tacagggacc agtccatcaa agatgctgag tggttcaagc gtgggtccag 5040 tacagcatta cagtacaaaa ccataactgg tggacatcaa gtcaaacaat ggagaaagtt 5100 tctgtccagt tcaaacaaca agtcatccct gatcaagttt ctgattgaag aatggaagca 5160 gccaatctac agaaagagag aaacttagga acaagacgct gtatgctacc tgcgaggatg 5220 cctgctacag gttcactaaa gatgaatggg tggatattgc ggagctacaa tgctcacaag 5280 aagaagctga cacgcgcctc ctacttcatg cactccacgc tgctgagtct ggttctgaag 5340 cagttgtcat cacagcagaa gacacagaca tcatggtcat cagcctcgcc ttcgctaaac 5400 gtatcccatg caaggtgtac caaaagtgtg gcacaaagaa cagaacacgc ttcattgaca 5460 ttgacaagtt ggcagacgca ttgggagaag aggtctgtaa agcattggtt ggactacatg 5520 cattcacagg tgggtaccag tattgcctta cctcacgaag atttacatgt acactatgca 5580 tgtacaccaa aaacattttg catgataagt cccacaatat ttttaaacac atactttgca 5640 tagttttgta acatgaattt tgcttaacag atacaatcaa ttgaatagac acacctcaca 5700 tctacaggag ctgatagtga atgattaaat gccactctag tctaaatgtt gtttcccagg 5760 atgcgatact gtcatcgcat tttctggccg tggaaagttg ggcgctttca agcttatgtt 5820 gaagaatacc agaatgcatt tcagcaactg ggagaaagct ggacggtacc agtgtctcct 5880 gatggcaccc tgttcaagag gattgagcgc ttcacctgtc agatgtacgt gtcgtctacc 5940 cctcgtagct gacgtaaacg agatgcggca ccacttgttc attgccaaaa agggaaatgt 6000 ggaatcaagt gcactaccac cgtgccgaga ctgccttcac ctacatgtta aacgagcaaa 6060 ctaccaagcc ggtatatgga gggggtgcct ccagaacgac cctcaggtac caagcccagt 6120 agatgcagga tggaagttag atgaagatgg taatctctcc atcacatggc tacagtcacc 6180 tcccgcgcca gctgctgtac tggagctgct gacatgtagc tgctcgcggt cgtgtactct 6240 gccctcctgc acctgtcttg caaatggttt gaactgcaca gatatgtgta aattgaaaga 6300 ctgtagtaac aggaaagagg aagaaaccga agaggatctg gacattgaac tctcctctag 6360 tgatgctgat gacacatcta gtgatactga tgatgaatag ataataatca aatgtaaagt 6420 tgacatgatc atgataactg tgcttgtaca tgccatattg tgtaaatact cagggcatat 6480 gggttgttca tctatataga caaaaaaaac aaatttgttg ctttgcagta tgtagttaca 6540 gcttacagct atggtttatg atatggcatt gccatatgtt atcaagtaga taaaaagtta 6600 tttattaact tgttattaga taaacataat actcattaac tcaatatctt attggtcagc 6660 ttattctaaa cctgattact ctcccaaaac aaaagtttct cgacaaaaaa gtggatttag 6720 cactgcattt tgcttggaat tgcagcccag cacctccata gcaacggccc tagcaacggt 6780 ttccccaaca caaggcctgt aggtgtgtac ttatttaggt actcatatct ctagatagaa 6840 atggtggatt ttcatgattt aaactttaga atgcagggaa tgatacaaca aagcagataa 6900 aagcatgaaa aaaggtcaaa tgtgagttcg gaataaatgt cagcctttga aaagtagaaa 6960 ttaacatttt tcatccaaaa tttgctcaaa aattataatt caaagactca taactcttca 7020 taggatcaag atattgaaat gaaattaagt tttctgtaat ccttggttca ttctgtatca 7080 gcccagcaag tttcatttga aaatattatt gtttaattga tttacatacc catttgtata 7140 atcatatcaa agggcgtggt ccagatcggg gtcaaaaggt caagggcaga tagccgggtc 7200 aaagctacca ctctgcgatg gctatcatat caaaatatat ccctcagata ccctactact 7260 taaaaatcat aaggaaaatt tgccccctca ggtggtacca atatctcctc tggcccatgg 7320 acta 7324 // ID DNA8-50_AP repbase; DNA; INV; 552 BP. XX AC . XX DT 25-AUG-2009 (Rel. 14.09, Created) DT 25-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-50_AP. XX NM DNA8-50_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-552 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 1980-1980 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 552 BP; 198 A; 51 C; 64 G; 239 T; 0 other; tagggcagtg aagttgttgc atttgcatat tttttgtaaa tgcttatatt tattgctgag 60 tgagcttaaa atatgtttcg ttaaactctt aatttaaaca aaaaaaaatt atattgcaat 120 aaatgcaatt ttggcgattt ttttaatttt actgcaattt taactatttt tcataatttt 180 cgtttacatt taagattata taacttaata tataaaataa aagtaagtaa acgtagcaat 240 gaacaaattt ctatatttgt tgtaagtaat acttattatt ttattaatat ttctatagag 300 cgattatcgc attaaggtcg ctaagaaaac ttaaaaaaat catttgttat gcaatgcaat 360 aatgtaagca atattcatat ttaattttta ttattattta ttaatatgtt tgattttatt 420 gattataatt aaattttttc gatgcaattt ttaatatatt ttaatgcaat aagtgcaatt 480 aatttagtgt attttaatgc aataatgcaa tcaatttagt gagtttttta atgcaataac 540 ttcactgccc ta 552 // ID BEL-235_AA-I repbase; DNA; INV; 6012 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-235_AA_; KW BEL-235_AA-LTR; BEL-235_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-6012 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 925-925 (2011). XX DR [1] (Consensus) XX CC 'CAAGG' target site duplication CC LTRs are 93% similar to each other. XX FH Key Location/Qualifiers FT CDS 50..1063 FT /product="BEL-235_AA-I_2p" FT /translation="MQSAKVCGKCSSSGITGGMVGCDMCDAWFHPGCVEVS FT ETNHNPDKTWRCTRCVNDDVREVASQHTSKSLGSSASSKARRELLLQQLEE FT QRALKLKQRAEEDEIRKKRAEEDEAFLQQKLDIILEDDIESRSSRQSSRAS FT RRKVEDWLNDGQGGQRAAQSKSIPQQPVAQQFASTSTAHDQGQVPYIFAPP FT TGVPTSTSTPQSGKVVTALEVDNPKLSVPQQLGAFSYLKDTSRDNYGISTL FT ASTQLVTAGGNIQSFPLHASPQVPQVCNNLESLRAKQSFADTKVTFSGQLP FT QTVSSKPHALFGPQIGYAPTSQILSNVPPGSQSAIPLSTANVNSMP" FT CDS 3363..4895 FT /product="BEL-235_AA-I_3p" FT /translation="MFHQIRIIPQDKQAQRFLFRESHKELPQIYVMDVATF FT GATCSPCSSQFVKNKNAQEFQPQFPRAAEAIIKAHYVDDYLDSVDTTEEAV FT QLVNEVKHVHALGGFEIRNFLSNSPEVLRQLGETDCLQKKSLDLDGSSHVE FT RILGMVWKSTEDLFTFDVTFKEDLNKVLVQGVTPTKRQVLRLVMSLFDPYG FT FIAHFVVHGKILMQHIWRSGTEWDDDIPLELQGMWNNWIGLLGRLQEVEVP FT RCFFGESDSRIPSSIQLHIFVDASEVAYACAAYLRIVQDGVVRCVLVAAKT FT KVAPLKPLSIPRLELQACVIGCRMMETVQSALDLTIEKRYFWTDSATALAW FT IKSDSRRYHPFVAFRVGEILNNSSLDEWYHVPSKQNVADDATKWGNGPNFE FT ANCRWYIGESFLYKPMSEWPIQVGKNWKTDEEMRVVFHTHQKLPPQTINVN FT RFSTGIGCCGRQPTCIERRRISPGSRSALDKSSGCIRISFVDDVGVIRMNS FT RISNSPSISFETKTR" FT CDS 1606..3564 FT /product="BEL-235_AA-I_1p" FT /translation="MAVRNLVDHMYVAQLSDHLRNPMLLHELVEKLPSQLK FT MQWSWFKRSQVDVNLATFGEFMTELVNTASDVTLPSDVQQSRPNLTGKDKQ FT KLYVHAEARGSQATTRTLDTIKCVPEIDKLKRSCCYCSNEGHEVAACPQFK FT ALDVDGRWKAIRSKGLCRICLIPHRKWPCRSGKECGVEGCRLRHNALLHSR FT AADVVTRQNAPESRTPSSGVNVVQQNHHSTLNYCLFRYLPVTLEVNGKQVD FT AFLDDGCQTTLMEAGLAADLDVTGPVEPLWLGWTSNISREEKGSQRITVKI FT SGTGHKNQYQLSNVRTVQKLQLQGQTFHYDELQKIYPHLRGLPLSSYDEAV FT PRLIIGIEHAQLLTALKVREGGSSEPVAVKTRLGWCVYGKQAGNSVTVERL FT YVHTVEEQADNRELHDLMKKYFAVEEAAVATPIESADDNRARRILEQTTRR FT TGEGFETGLLWKYDNPVFPDSYPLALCRLQSLEKRLDKDHELRTRVISLMR FT EYETKGYAHKITQEEMELTDANRTWYLPLGVVKNPKKPQKVRLIWDAAARV FT HGLSLNDMLLKGPDMLTSLFAVLIRFRQSQSRYVETSAKCFTKSGSYHRTN FT RHNDFYSVKVTRNSHKSMSWTLQHLVPPVHPVPPNLLKTKTHKSSSHSFRG FT RQKQ" XX SQ Sequence 6012 BP; 1663 A; 1395 C; 1550 G; 1400 T; 4 other; aaactttaaa gattttctgc gaagtaggca gccacctcgc aacttcatca tgcagtccgc 60 aaaagtatgc ggtaagtgct cgtccagcgg cataacagga ggaatggttg gctgtgacat 120 gtgtgatgcc tggtttcatc ctggatgcgt tgaagttagc gagaccaacc ataacccaga 180 taaaacatgg aggtgtacgc gttgcgtgaa tgacgatgta agagaagttg ctagtcagca 240 caccagcaag tcgctgggaa gttcggcaag cagtaaggcc cgaagagagc ttcttttgca 300 gcaactagag gagcaacgag cattgaagct caaacaacgt gcagaagaag acgaaattcg 360 taagaagcga gctgaggaag acgaagcatt tctccagcaa aagctggata tcatcctgga 420 agacgacatc gagagtagaa gcagcaggca aagtagtcgg gcaagtcgta ggaaagttga 480 agattggctc aacgatggtc aaggcggcca gagagcagcg caaagcaaaa gtattcctca 540 acaacctgtg gcccagcagt ttgcatcgac gtctacggct catgaccagg gacaagtacc 600 ttatatcttc gcaccaccaa ccggagttcc cacctcaaca tctacaccac agtcaggtaa 660 agtcgtaaca gccctagaag tagataaccc taagcttagc gtaccacaac aattaggagc 720 cttttcgtat cttaaagaca ctagtagaga caattatggg atcagtactc tagcatcgac 780 tcagttagta accgcgggtg ggaatataca gtcatttccc ctccatgcgt caccccaagt 840 cccacaagta tgtaacaatc ttgaaagttt gagggccaaa cagtcattcg ctgacacgaa 900 agttactttc tctggtcagc ttcctcaaac agtttcttca aaaccccatg ctttatttgg 960 tccacaaatt gggtatgcgc ctaccagtca aatactgtca aacgtgccac cgggctcgca 1020 aagtgcgatt cctttgtcga cagcaaatgt gaattcaatg ccggsagtac tgccgcttag 1080 ttscgctgta tcatacgcga gtgtgccttt accgtcatcg tcaacagctg ttccagtttt 1140 ggatgcgact caacgagtgc cgatcaccaa cgtgacgacg atttcatcgc atttcggaca 1200 aacacagcca agtttacaac agttttgttc atcaccsatg ggtacagtgt atcagcctgc 1260 acctagtagt gtacagctag cagcgagaca gataatgccc cgtgaactgc cgaacttctc 1320 tggtgatcca caagactggc cgttgttttc tagctccttc tacaactcaa cggcggcttg 1380 tgggttcaca gatgcggaga acctagcgag actccaacgc tgtctgaaag gacatgcttt 1440 agagtcagtt aaaagccgtt tgttgatacc ggaatcggtg ccgcatgtaa tggaaacgtt 1500 acgcatgctg tatggaagac cggaagtact catccacacc cttttaagga gattgcgaag 1560 cgtgccacca ccgagaacgg agaatttatc gtaatcacat tcggaatggc ggtacggaat 1620 ttggtcgatc atatgtacgt ggctcaactt tcggaccatc ttcgtaatcc aatgcttctt 1680 cacgagttgg tggaaaaact tccgtcacaa ttgaagatgc aatggtcgtg gttcaagcgt 1740 tcgcaagtgg atgtcaacct ggcgacattc ggagagttca tgactgagct agtcaataca 1800 gcgtcagacg tgactctccc ttcggacgtt caacagtcaa gaccaaacct aacagggaaa 1860 gacaagcaga agctgtatgt tcatgcagaa gcaagaggca gtcaggcgac cacgagaact 1920 ttggatacta tcaaatgtgt accggagatc gacaaattga aaagatcttg ttgctactgt 1980 tcgaatgaag gccatgaggt agccgcatgt ccgcagttca aggctttgga cgtggatgga 2040 aggtggaaag ccattcgttc gaagggtttg tgcagaatat gtttgattcc acatcgtaag 2100 tggccgtgtc gttcagggaa ggagtgcgga gtagagggct gtcgactacg tcataatgcg 2160 ctgctgcact cacgcgctgc agatgtcgtg actcgtcaga atgcacccga aagtaggaca 2220 ccgtcgagcg gtgtgaacgt cgttcaacag aatcatcatt caacgttgaa ctactgcctt 2280 ttccggtatc taccagtgac tcttgaagta aatgggaagc aggtggatgc ctttcttgac 2340 gatggctgtc aaactacact aatggaagct ggattggcgg ctgatctgga tgtcacggga 2400 ccagttgaac ccctttggct tgggtggacg agcaacatct cccgtgagga aaaggggtca 2460 caacgaataa cagtaaaaat atccgggact ggacacaaaa atcaatacca gttaagcaac 2520 gtgcgtacag tgcaaaaact tcagttacaa gggcagacat tccattacga tgaactacag 2580 aagatctacc cacatctacg gggcctcccg ctgagcagtt acgacgaagc agtcccaaga 2640 ctaatcatcg gcatcgaaca tgcgcagctg cttacggcat taaaagtgag ggagggcgga 2700 tcgagcgaac cagtggcagt taagactcga ctgggctggt gcgtgtatgg gaagcaagct 2760 ggtaactccg tgacagtcga aaggttatac gttcacacag tggaggaaca ggcagataat 2820 cgtgagttgc acgatctgat gaagaagtat tttgcggtcg aagaagcagc agtggctact 2880 ccaatcgagt ctgcagacga taatcgagcc cgtcggattc tggagcagac aacacgcaga 2940 actggagaag gctttgaaac cggattactc tggaaatacg acaatcccgt gtttccagac 3000 agctatccac tagccttgtg taggcttcag tcgttagaaa aacgactcga taaagaccac 3060 gagttaagga cgagagttat atctctgatg cgggaatacg agacgaaagg gtacgcacat 3120 aaaataaccc aggaggaaat ggaactaacc gatgcgaatc gtacatggta tctacccctg 3180 ggggttgtga aaaatccaaa aaagcctcag aaagtgcgat taatctggga cgctgctgca 3240 cgagtgcatg ggctgtctct gaacgacatg ctcctaaagg gaccggatat gttaacatcg 3300 cttttcgctg ttcttataag atttagacag agtcagtcgc gatatgtgga gacatccgcg 3360 aaatgtttca ccaaatccgg atcataccac aggacaaaca ggcacaacga tttttattcc 3420 gtgaaagtca caaggaactc ccacaaatct atgtcatgga cgttgcaaca tttggtgcca 3480 cctgttcacc ctgttcctcc caatttgtta aaaacaaaaa cgcacaagag ttccagccac 3540 agtttccgag ggcggcagaa gcaataatca aagcgcatta tgtggacgac tacctcgaca 3600 gcgtagacac caccgaagaa gcagtacagc tggtgaacga agtgaagcac gtgcatgcgt 3660 taggtgggtt cgaaatacgc aactttttgt cgaactcacc agaagtcctg aggcagttgg 3720 gcgaaacgga ttgcttgcag aagaagtcgc tcgatttgga tggttcatcg cacgtcgagc 3780 gtatacttgg gatggtttgg aagtccaccg aagatttgtt caccttcgat gttacgttta 3840 aagaagactt gaataaagtt ctggtgcagg gagttacgcc gaccaagcga caagttctgc 3900 gactggtgat gtcgcttttc gatccctacg gattcattgc acattttgtt gtgcatggta 3960 agattctaat gcagcatatc tggagatcgg gcacggagtg ggacgatgat attcctttag 4020 aactgcaggg gatgtggaat aattggatcg gccttttagg acgtctgcaa gaagtcgaag 4080 ttccacgttg tttcttcggg gaatcagaca gcagaatacc cagcagcata caattgcaca 4140 tatttgtaga cgcaagcgaa gtcgcttatg cgtgtgcggc ttatctacgt atagtacaag 4200 acggagttgt ccgctgtgtc ctcgttgctg cgaaaacaaa ggtagccccc ttgaagccgc 4260 tatctattcc tcgtcttgaa cttcaagctt gcgttattgg atgccgtatg atggaaactg 4320 ttcagtcagc gctggatcta actatcgaaa agcgctactt ttggacagat tccgctacag 4380 ctctcgcctg gatcaaatcc gatagccgac gatatcatcc cttcgtagca ttccgagtag 4440 gcgaaatcct aaacaactcc agtttagatg agtggtatca cgttccttcg aagcagaatg 4500 tggctgatga cgccacaaaa tggggaaatg gaccaaactt tgaagcgaac tgtcgttggt 4560 atattggtga atctttcctt tataagccga tgtcggaatg gccaatacag gtcgggaaga 4620 actggaagac tgacgaagaa atgcgcgtgg ttttccatac ccaccagaag ctacctccgc 4680 aaactatcaa cgttaacagg ttttccactg gtatcggttg ttgcggacga cagcctacgt 4740 gcatcgagcg aaggcggatt tctcccggct cacgaagtgc gctagataaa agtagtggct 4800 gtatcagaat ctccttcgtg gacgatgtag gagtgatcag aatgaatagc aggatctcca 4860 actctccatc gatttcattt gaaacaaaaa cccggtaatt ccgccaaaga atcatgtgat 4920 gattcgtccc tggttgacag ctatcatcga cgtttcttgc gtatgtataa cgacactgtc 4980 ttcaacgaac tcggcagcgc ttcaggattc cagtcgcgcc agttcatcaa gcgagtcgct 5040 agtgattgcc aacattgccg tgtcaaagtc gccgcacatc cgtgcattta gttataccgg 5100 cgttgactat taggcccaat catggtgaaa cagggtcgta gtttggtaaa gcgttgggtt 5160 gcatcgttca catgccttac tatccgtgcg gtgcatctag aaaccgtgca tagtctaacg 5220 actcagtctt gcgtgatggc agttcgtcga ttcgcgagtc gcagaggggc cccagcggct 5280 ttctactcag acaacgccac ttgcttcaaa ggagccagta atttgctatc ggaacagatt 5340 caggccatac acgagcattg tgcagtaacc tttacgaacg ccaagactac gtggatttca 5400 atccccattt gcactacatg ggcgggtgtt gggagcggat ggtccgctcc gtcaaggctg 5460 caatagcggc cgtggctgaa cctctcgtca cccaaacgat gttctggaga cagtattact 5520 agaagccgaa gcagtcgtca actctagatc atgacttaca ttccgttgat agtgcagacc 5580 aagaagcctg acgctgaacc atttcactct ttacggcaca gggattgttc aaccaaggac 5640 agcgtttaag tggaaggtcg actctaaggg acggctgaaa ctctcccagt atctatcgat 5700 caattctgga agagatgggt tcgcgagtac ttgcgcgttg agcggcggac taaatggttc 5760 cagccgacca agccactagc accgggagac gtagtagttg tagtcgacgt gaacaaacgg 5820 aatggatggc tgagaggccg catcatcgaa gtgttcaacg gaaaggacgg tcaggttagg 5880 agcgcaaggt tcagacgagt gaggctggag ttggctcatg cggccggccg taaagttgga 5940 ctktcggagc gtgcaggcca accgcaagga ccaagatatt gaaggttctc cggaacttca 6000 cgggcgggag aa 6012 // ID P-19_HM repbase; DNA; INV; 3198 BP. XX AC . XX DT 21-MAR-2008 (Rel. 13.03, Created) DT 21-MAR-2008 (Rel. 13.03, Last updated, Version 1) XX DE P-type DNA transposon family: consensus. XX KW P; DNA transposon; Transposable Element; P-19_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3198 RA Jurka J.; RT "P-type DNA transposon families from Hydra magnipapillata."; RL Repbase Reports 8(3), 365-365 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(393..896,900..2894) FT /product="P-19_HM_1p" FT /translation="MPKKCCVYGCKTNYQSTKNNKSFEKIPVYRFPKDKEQ FT RNIWIKSIPNSNLLVTDETVICQLHWPTFFEKVTVQGKQRPKIPPSVWPGI FT PLSQIPTPLPRLRTTQKSLSVVRNVRTDELSTFLFADIATYNDFKDVLLNN FT LRTFSFSLVTYMVGKTINIQSIIFNNGIPFLVKIFENLHFETFHCGIKCFI FT KSLCRNRINIVDSWSKFEEIIHFLNVMETDHKKNIIQQQISVMSAKLIGKK FT IYESDIIIRSFEYFSTSRVLYNKLRNDFQLPSVSTLTRITSKVSKIDDASF FT LNSVFNSIEEKKKICIIIHDEVYVKKMLLYHGGTLFGKSLDDPSLLAKTVL FT GIMVVCLKGGPKFLYKMIPISKLNSNFLFEQINSTIXLIKASSGDVKVVIC FT DGNRVNQAFFKLYRTIPEKPWKTEDGMYLLFDFVHLLKNIRNLWLTEKTGE FT LLYNDNGVQRTAKWEHLKCLYKLESEKLVKLSDLNEVAIAPKPIERQNVST FT CLKVFSDKTYHALLTHSQINSNSKDTAIFIKKVITWWKILNVKGHGADVRH FT NDPLEAPICSPDDYRLNTLLQFGDMALEMCCKKGKRVKQLSNDTAKSIHHT FT CYGIVELCKHLLETSHDYVLLGMFTSDHLEKDFGKLRQGSGGTYFINVQQI FT IEKLHINHTSLLLSLNXDIDYFDLSSGHNCPSCTYVLCEAGSEIFDNLEKL FT ESSISDSTKMSLIYIAGYVCRNEYNESNLFEQTMLYYEKFGDYLNSIDRGG FT LKIPLDNVCQWVFFCFILFHSIKDKVCRQSLSDIFLLVSEFYYFEIKQNHC FT NTLANIFLNNYCAQITPRSNKEPALKVLKLS" XX SQ Sequence 3198 BP; 1145 A; 410 C; 476 G; 1162 T; 5 other; catggcgatc tatactgaag gcctattcat cgcgcaaccg taagccgtaa ctgggggcct 60 ataatagtct gtataaaaat gcgtagatta aaagcatgca cgttgtaaag ctttctagag 120 aattttaaaa ttaatatttt ttattctttt atatcatcaa atatttgtgg tttttaattt 180 aatggttttt atgttataaa ttttattaaa taattttaat ctgtattttt gctctcaact 240 ttttaattct taacaatgtt aggagttgaa aagttagtaa aagagttaca ttatttatgt 300 cttcattttg gctgataatc tattatagtt gaatcacctt atgttattaa ttgcttattt 360 ttatawtctc ttcaataact tatactttat agatgcctaa aaagtgttgt gtatatggct 420 gcaaaactaa ttatcaaagy accaaaaata ataaaagctt tgaaaaaata ccagtttatc 480 gatttcctaa agataaagaa caaagaaata tttggatwaa aagtattcca aattctaatc 540 ttttagttac agatgaaact gttatttgcc aattacattg gccaacattt tttgaaaaag 600 ttactgttca aggaaaacaa agaccaaaaa ttccaccatc tgtatggcct ggaatacctt 660 taagtcaaat acctactcca ttaccacgtc ttcgcacaac acagaaatct ttaagtgttg 720 ttagaaatgt tagaactgat gaactcagca cttttttatt tgctgatata gcaacatata 780 atgattttaa agatgtttta ttaaataatc tcagaacatt ttcattttca ttagtaacat 840 atatggttgg aaaaacaatt aatattcaat ctattatatt taataatggt attccttaat 900 ttcttgttaa aatttttgag aatttacatt ttgaaacatt tcattgtgga ataaaatgtt 960 ttataaaaag tttatgtaga aatcgtataa atattgttga ttcctggtca aagtttgagg 1020 aaattataca ttttttaaat gtcatggaga cagatcataa gaaaaacatt attcaacaac 1080 aaatatcagt tatgtctgca aagcttattg gaaaaaaaat atatgaatca gatataatta 1140 tacgtagttt tgaatatttt tcaacatctc gtgttttata caataaatta agaaatgatt 1200 ttcaattgcc atcagttagt actttgacac gtattacatc aaaagtttcc aaaattgatg 1260 atgctagttt tttaaactca gtatttaata gcattgaaga aaaaaagaaa atttgtataa 1320 ttattcatga tgaagtatat gtgaaaaaaa tgttgctgta ccatggtgga acattgtttg 1380 gtaaatcttt agatgatcca tcattattag ctaaaactgt gttgggaatt atggtagttt 1440 gtcttaaagg tggtcctaaa tttctatata agatgatacc aatttcaaaa ttaaactcaa 1500 attttctgtt tgaacaaata aactccacaa taaracttat aaaagcatct tctggtgatg 1560 taaaagttgt tatatgtgat ggtaatcgtg taaatcaagc tttttttaaa ttatatcgta 1620 ctattccaga aaaaccttgg aaaacagaag atgggatgta tttactgttt gactttgttc 1680 atcttttgaa aaacattaga aatctatggc ttacagaaaa aactggggaa ctactgtaca 1740 atgataatgg tgttcaaaga acagctaaat gggaacacct taaatgctta tataaattag 1800 aatcagaaaa gttagtcaaa ttatcagact taaatgaagt tgcaatagct ccaaagccaa 1860 ttgaaaggca aaatgtttct acttgtctta aagttttttc agataaaaca tatcatgcct 1920 tgctgactca ttctcaaata aattcaaact caaaagatac tgcaattttt ataaaaaaag 1980 taataacttg gtggaaaatt ttgaatgtga aaggtcatgg agctgatgta agacacaatg 2040 atccactaga agcacctatt tgtagtcctg atgattaccg tcttaatact ttacttcagt 2100 ttggtgatat ggctcttgaa atgtgttgta aaaaaggcaa aagagtaaag cagttatcta 2160 atgatactgc aaaatcaatt caccatactt gctatggtat tgttgaacta tgtaaacatt 2220 tattagaaac tagtcatgat tatgttcttc ttggtatgtt tacaagtgac cacttagaaa 2280 aagattttgg gaaattgaga caaggttctg gaggtaccta ttttataaat gttcaacaaa 2340 ttatagaaaa attgcatata aatcacacat cactgttatt atctctaaat rttgacatag 2400 attattttga tttatcttca ggtcataatt gcccctcttg cacatatgtt ttatgtgagg 2460 caggtagtga aatatttgac aatttagaaa aactagaatc atcaatatct gattctacca 2520 aaatgtcttt gatatatatt gctggttatg tatgtagaaa tgaatacaat gaaagtaact 2580 tgttcgaaca aacaatgctt tattatgaaa agtttggtga ctaccttaat tctatagatc 2640 gtggtggtct taaaattcca ctagacaatg tatgccaatg ggtttttttt tgttttattt 2700 tgtttcactc tattaaagat aaagtatgtc ggcagtcctt gagtgatatc tttttgttag 2760 tttcagaatt ttactatttt gaaattaaac aaaatcattg taacacactt gctaatattt 2820 ttttaaacaa ttattgtgca caaattacac cgaggtctaa taaagaacca gcattaaaag 2880 tgttgaagtt gtcataaatt tatttagaaa aaaacaatgt gtatttgata ttgtttgaga 2940 taagactatt gagactattt tctttgtttt taacttttct ttttcattaa ataaaaaatg 3000 agaactcatt gttgcttttt tcatttataa gtattcttat attgactaat taaatttatt 3060 aaaaaatgat atttcgaatc aagattataa attattaaag cagagttgtt aaaaaacttt 3120 aaattggaaa aaaataatag gcccccagtt acggcttacg gttgcgcgat gaataggcct 3180 tcagtataga tcgccatg 3198 // ID Copia9-NVi_LTR repbase; DNA; INV; 387 BP. XX AC AAZX01000081; XX DT 05-NOV-2007 (Rel. 12.11, Created) DT 05-NOV-2007 (Rel. 12.11, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: long terminal DE repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia9-NVi; KW Copia9-NVi_I; Copia9-NVi_LTR. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-387 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from Nasonia parasitic wasp."; RL Repbase Reports 7(11), 1117-1117 (2007). XX DR Genome; AAZX01000081; Positions 4653 4267. XX SQ Sequence 387 BP; 110 A; 101 C; 65 G; 111 T; 0 other; tgttgagaaa tatactttct accaatgata cgcacacaca gccctcagct gatcgagacg 60 agcagagagg gtagctacct ggcttgggtt ttattgcacc cgagcggcca ccgacgctcc 120 ccagcagaga cgcgcacaga gggcctcact tccgaacgga ccgccgacca gcgtacacat 180 cagcccccgc aactatgtct actctacagt tttaccttgt tctctcttta tattagattc 240 agtgtacaac gaacatttta aattagtgtt acatatattg tatttacttc atttacctct 300 agtgctgttt aaaactctgt ataatctcgt atatttattt gaataaagtc taataaataa 360 atactaccac ttcacctaac tccaaca 387 // ID Gypsy-19_SI-I repbase; DNA; INV; 4963 BP. XX AC AEAQ01023421; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-19_SI_; KW Gypsy-19_SI-LTR; Gypsy-19_SI-I. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-4963 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01023421; Positions 5371 409. XX CC Positions [2427-2933] - Reverse transcriptase CC Positions [3990-4466] - Integrase core CC 'TATATA' target site duplication CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 108..4961 FT /product="Gypsy-19_SI-I_1p" FT /translation="MTETTTTLGEIWPNRLSCYTTKLKDINILHLLCYSQE FT GNPKTNRKNVRAFKGFRFANDEEYEVKRNLIVNTFSVDELVKICEWFQFDL FT GSKSGSKEKLADCLCTRLNKLNQLKPDDESEDEDDEDAHTNDGEVNDENDR FT REEERTMNPNLSGEPDNNVQVRGNSRRFSGDEGTSRQACQFVPLRSNSEHD FT ARCQKDDHRQREMERRQRKEVSRHSEEGTQYYEDEIRRLNDDIRRLNARRH FT IYDARRVNDDGYHDEEQRREDYVRRDYKQEYNEQRIPYYGETITRMNNNIR FT PRDVEALIPTFTADDDYPIEKWLSDFEDIAQMANLNELERAVFGRKAIKGT FT AAKFLRCQRHTERWTELQKSLLHEFRRKKTSAELHEELTKRKQKSNESLQQ FT YLYAVTEIASYGDISDESIIDYVIRGINDDDVNKMVLYGARTLDELKNKLR FT VYTKYVSSQSITTKDKTKIRKVNEVKPTKTTTRDTTDMKKERCYNCGRSGH FT KSDSCQDKEKGKKCFNCDQFGHISADCSQTRRTSAKQKADKSAETTTKNAY FT LAKQAEDFHEMKKHVSINGHNMIALIDTGSEVNMIKFEKYDELGKPKLNSR FT ATTITGLGQYEVKTMGGFHAKVNVDDVKTEMEMHVVPSSAIQYDVVLGQSL FT LQNNTIVINQHTIKLVTPAATDELIIEEEKMHTCELKDGQLSEASEKNPLC FT YITLNNDEINLDVGNQRYFETIVNLVNNYQPADVKETNVKMKIILKDDQPI FT YQRPRRLSRPEQQEVDRQMKQWLKDGIVRPSSSEYASPIVLVKKKTGDIRI FT CGDFRALNKKMVRDHYPLPIVDDQFDKLTGAIIYSTLDLKNGFHHVPVEEE FT SRKYTSIIVPNGQYEFLRCPFGICNSPSVFQRYVNEVFKELIQKNIVLTYM FT DDLIIPATNEEQALERLKMVLETASKYGLQFNWKKSQFLKNDIEYLGHVIK FT NGAIKPSPRKIAAVMHFPKPITKKQIQSFLGLSGYFRKFIRGYALLARPLS FT EILKKDVPFRFGPEEEASFNQLKRKLSTEPVLKIFDHQKETELHTDASSEG FT YGAVLLQKDDVDGQFHPIHFMSKKTTDAEKKKHSYELEVLAVVEAFKKFRH FT YLLGIPFKVVTDCQAFAKTMDKKDPIPKIARWALYMQDFQFTVEHRSGRRM FT QHCDALSRNPVALFTQNELIQRVARAQRDDEYLQTLATVLQDKPYKDFCMI FT RDVVYKERDGKKLLVIPNSMTQDIIRRAHDMGHFAVKKTTEILKNEYWISN FT LEAKVERYIRNCIPCILAKDKGGKKESLLHPINKGNKPLCTYHVDHVGPLE FT ITNKKYKYLLVVIDAFSKFTWIYPTKTMSSEEVISKLEEQSALFGNPSVIV FT SDRGGAFTSQLFEEYCTKQNISHHLITTGTPRGNGQVERINKVILSILTKL FT SIAKPDYWYRHVRKVQQAINGNHQRAIGRTPFEIMFGTKLRHADDPSLQQL FT LQQELLECFINERDELRAEARAQIEKIQTENVKQYNLRRKEARKYTIDELV FT AIQRTQFTQGSKLLPGFIGPYRVTEKIGYDRYAVEKAGQQLGPRTTTTSAD FT KMKPWSEPHAANDVLDEVPDEITNQISQGVQNDVPDKSISESGTDSSQEWP FT " XX SQ Sequence 4963 BP; 1776 A; 958 C; 1113 G; 1116 T; 0 other; tattgggggc tcgtccggga ttgagcgagc agttgataaa tataaagata cacgacgacg 60 acgacggtta acctaaccta taagaaaaaa ggggcacggt gagaaaaatg acggaaacga 120 cgacgacgct tggcgagata tggccaaata ggttgagctg ttacaccacg aagctgaagg 180 acattaatat actgcatctg ctgtgctaca gtcaagaagg taacccgaaa acgaaccgta 240 aaaacgtccg cgcgttcaaa ggattccggt tcgctaacga cgaggaatac gaagttaagc 300 gaaatttaat tgtcaatacg ttctctgtcg atgagcttgt aaaaatttgc gaatggtttc 360 agttcgatct aggatctaaa tcaggatcta aagaaaagtt agcagactgc ttatgcactc 420 gattaaacaa attaaatcag ctcaagccag acgacgaaag tgaggacgaa gatgacgaag 480 atgctcacac aaacgatggt gaggttaacg atgaaaatga ccgacgtgaa gaggaacgca 540 cgatgaatcc gaacttgtca ggagagccag acaataacgt tcaggtacga ggaaattcac 600 gacgattttc tggggacgaa gggacatcgc ggcaagcttg tcaattcgtt ccgttacgaa 660 gtaatagcga acacgacgca cgatgtcaga aagacgatca tcgtcaacga gaaatggaga 720 gacgacaacg aaaagaagtt tctcggcatt cagaagaggg tacgcagtac tacgaagacg 780 aaatacgacg actcaacgac gatataagac gtctcaatgc ccgacgtcat atttacgatg 840 cacgacgcgt gaatgacgat ggataccacg acgaagagca acgacgagaa gactatgtac 900 gacgagatta caagcaggag tataacgagc aacgaattcc ttattacggt gagacgatta 960 cacgaatgaa taataatatt cgaccacgag acgtagaggc cctgatacct acattcaccg 1020 ccgacgatga ttatccaatt gaaaagtggc tgtcagattt cgaggatatt gcgcagatgg 1080 ccaatcttaa cgagttagaa agagctgttt tcggtcgcaa agcgattaaa ggaacagctg 1140 ccaaatttct acgatgtcaa cgacacacgg aaagatggac ggaacttcaa aaatctttgc 1200 tacacgagtt tcgaagaaag aagacgagcg ctgaattaca cgaggaatta accaaaagaa 1260 aacaaaaatc gaatgaaagt ttacagcaat atctctatgc tgtaacagaa attgccagtt 1320 acggagatat ttccgacgaa tcaataattg attacgtgat tcgaggcata aacgacgatg 1380 atgttaacaa gatggtcctc tacggtgctc gtacattaga tgaactgaag aataaattac 1440 gagtgtatac caaatacgta tccagtcaat ctattacgac aaaagataaa acaaagatta 1500 gaaaagtcaa cgaggtaaag cccactaaaa caacgacgag ggacacgact gatatgaaga 1560 aagaacgatg ctacaattgc ggacgcagtg gacataagtc ggatagttgt caagacaaag 1620 aaaaaggcaa aaaatgcttc aattgtgatc aatttgggca tatttcagcg gattgctcac 1680 agaccagacg aactagtgct aaacagaaag cggataagtc agcggagact acgacgaaga 1740 atgcctatct ggctaaacaa gcagaggatt tccacgagat gaagaaacac gtttccatta 1800 atgggcataa catgattgct ttaatcgaca cgggcagcga agttaatatg ataaaattcg 1860 agaagtacga cgaattggga aagcccaagt taaattcaag agcgacaaca ataactggcc 1920 taggtcaata tgaagttaaa acgatgggcg gattccacgc aaaagtaaat gtcgacgatg 1980 tcaagaccga aatggagatg cacgtagttc caagcagtgc aatacagtac gacgtggttc 2040 tcggtcaatc attgttacaa aataacacga tagtgataaa tcaacatact attaaattgg 2100 tgacgccagc tgccacggat gagttaataa tcgaggaaga aaagatgcac acctgtgaac 2160 taaaggatgg tcaactatca gaagctagtg agaaaaatcc tttgtgctac ataactctaa 2220 acaacgatga gataaatttg gatgtaggca atcaacgata tttcgagact attgtcaatt 2280 tggtcaataa ttatcaacct gcagatgtca aggaaacaaa tgtcaaaatg aagattatac 2340 tcaaggatga ccaaccgatt taccaacgcc cacgaagatt atcacgacct gaacaacaag 2400 aagtcgatcg gcaaatgaaa caatggttaa aggatggcat agtgaggccg agctcgtcag 2460 aatacgcaag tccgattgtt ttggtcaaaa agaagacggg agatatcagg atatgtggcg 2520 acttcagggc attaaataag aaaatggtgc gagatcatta tcctttgcct atagtggacg 2580 atcaatttga taaattgacg ggagcaataa tctactcgac gcttgacctc aaaaatggat 2640 ttcaccacgt gccagtcgaa gaagaaagcc ggaaatacac gtctatcata gttccgaacg 2700 gccagtacga attcctacga tgtccatttg gcatatgtaa cagtccttca gtattccaac 2760 gctatgttaa tgaggtattc aaagagttaa tccagaagaa tattgtgctt acttatatgg 2820 acgacctgat tatacctgcc acaaacgagg aacaagcgct agaacgattg aagatggtat 2880 tggagactgc atcgaaatat ggactgcagt tcaattggaa gaaatctcag tttctcaaaa 2940 acgatatcga atatttaggt catgtgataa agaacggagc tatcaagcca tcaccaagga 3000 agatcgcggc tgtgatgcac ttcccaaaac ctattacgaa gaaacaaatc cagagttttt 3060 tgggactatc tggatacttc agaaaattca tcaggggtta tgccttacta gcacgacctc 3120 tctctgagat actcaaaaag gatgtgccgt ttcgattcgg accagaagaa gaagcatcgt 3180 ttaatcagtt aaagagaaaa ttatctacag aacctgtttt gaaaatcttt gatcaccaaa 3240 aagaaacaga attacatacc gatgcttcaa gtgagggcta tggagctgtc ttacttcaga 3300 aagacgacgt tgatggacaa tttcatccca tacatttcat gagcaagaaa acgactgatg 3360 cagaaaagaa aaaacatagc tacgaattgg aagttctagc tgtggtcgag gcctttaaaa 3420 aattccgaca ctatctatta ggcattccgt ttaaagtggt gactgactgc caagctttcg 3480 cgaagactat ggataaaaag gatccgatac cgaagatagc acgatgggct ctgtacatgc 3540 aagatttcca atttacagtg gaacatcgtt caggccgcag aatgcagcat tgcgacgcac 3600 ttagtcgtaa cccggtcgca ctattcacac aaaacgagtt aatacaacga gtggccagag 3660 cacaacggga tgacgaatat ttacaaacgc tagcgacagt actacaagac aagccgtaca 3720 aagatttctg tatgatccgg gacgtcgttt acaaggagcg cgatggtaag aagttactag 3780 ttatacctaa ttcgatgacg caagacataa ttagaagagc tcatgacatg ggtcactttg 3840 ccgttaagaa gactacggaa attttgaaga acgaatattg gatttccaat ttagaagcaa 3900 aagtcgaaag atacattcga aattgtattc catgtattct agccaaggat aaaggaggta 3960 aaaaagagag tttattacat ccaataaata aaggaaacaa accgctatgt acttatcatg 4020 tcgaccacgt tggtcctctg gaaattacca ataaaaaata caaatattta ttggtggtaa 4080 ttgacgcttt cagtaaattt acgtggattt atccaactaa aactatgagt agcgaagaag 4140 tgatatcgaa actagaagag caaagcgctt tattcggcaa tccttctgta atagtatcag 4200 atcgaggagg agccttcaca tctcagctgt ttgaagaata ctgcacaaaa cagaacatta 4260 gtcatcatct gattacgacg ggaacaccac ggggaaacgg ccaagttgag agaatcaaca 4320 aagtcatatt atctattcta acgaagttat caatagcaaa gcctgattat tggtatcggc 4380 atgtaagaaa agtacaacag gccatcaatg gtaaccatca acgagctata ggaagaacac 4440 ctttcgaaat tatgttcggc actaagctac gtcatgcaga cgatccatcg ttgcaacaat 4500 tgcttcaaca agaattgctt gaatgtttca taaacgaaag agacgaatta cgtgctgaag 4560 cccgagctca gattgagaag attcaaacgg aaaacgttaa acaatataat ctacgaagga 4620 aggaagcacg aaagtatacc atcgatgaat tggttgctat tcaaagaacc cagttcactc 4680 aaggatccaa actacttcca ggatttatag gaccctaccg tgttacggaa aaaattggtt 4740 atgaccgata cgctgtcgaa aaggctgggc agcagctagg accaaggacg acgacaacca 4800 gcgctgacaa aatgaagcca tggtcagaac ctcacgcagc aaatgatgtc ctagacgaag 4860 ttcctgacga aataacaaac cagatttccc aaggagttca gaatgacgtt cctgataaaa 4920 gcataagtga gtccgggacg gactctagtc aggaatggcc gaa 4963 // ID Gypsy-8_AC-LTR repbase; DNA; INV; 192 BP. XX AC AASC02007279; XX DT 18-JAN-2011 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from Aplysia californica (California sea DE hare): long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-8_AC_; KW Gypsy-8_AC-I; Gypsy-8_AC-LTR. XX OS Aplysia californica OC Eukaryota; Metazoa; Mollusca; Gastropoda; Heterobranchia; OC Euthyneura; Euopisthobranchia; Aplysiomorpha; Aplysioidea; OC Aplysiidae; Aplysia. XX RN [1] RP 1-192 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Aplysia californica (California sea RT hare)."; RL Direct Submission to Repbase Update (02-FEB-2011). XX DR Genome; AASC02007279; Positions 7962 8153. XX SQ Sequence 192 BP; 56 A; 39 C; 36 G; 61 T; 0 other; tgtcatgtgt gtacatcaat agcttggact taactgacca gctatgttat aacggctcaa 60 ggccgtaacg tgaccttctt gataataaag gacaacttcc gttactactc attctcacag 120 aggtcttgtg tttgtttgtg agcacgtttt ataattattc gtcaagctat aaacatagcc 180 aagaacgtaa ca 192 // ID Gypsy-33_DPu-I repbase; DNA; INV; 7376 BP. XX AC . XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from Daphnia: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-33_DP_; KW Gypsy-33_DPu-LTR; Gypsy-33_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-7376 RA Jurka J.; RT "LTR retrotransposons from Daphnia."; RL Direct Submission to RU (16-DEC-2010). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR [1] (Consensus) XX CC Positions [4236-4703] - Integrase core CC 'ATAT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 575..2215 FT /product="Gypsy-33_DPu-I_2p" FT /translation="MPPQIKFLSPKVFKATSEDDASDWLERYESTGAYNQW FT GDTELRANFGMYLDGAARKWYLCSTLPTEWRDLPVRPGVGLNAPDLPAVTG FT VRTLFLKEFQQQNYRLFQETRLRNRVQGIEEATTNYYYDVIDLCRVVDPTM FT AEATKVDYLFGGLRPSLVEKLYPLQPKTCEEFLEAAKRFTDAKLLANRRNW FT PDAVLGVAATRAADVPIDFIRTIPKPAPTTADTELWKVIKELQGAVESLKI FT QATPPPKRPGNEKTVTWGESERIYRTDDGVLKCYCCNGTGHMARNCWDNPQ FT SARFRQRPYPGPPSPQRGALGHPAPINIVSQLAEPSADSSTHDVLKEQPIL FT KLDYSKLIREEVTCGTRQVMAVVDTGAALTVISPELLKESQFVMRPWDGPR FT VVMANGEQATLMGAATISVNHKMGTATGEAVVFGMDGIDLILGNDFLKQYG FT KVQIDYREPKASITFGDQPLAAITSQRTDHTTRSVKLISNNDVTIPSFSVA FT NVGTVAPALQAENFCFDPSGKLLETKGVSVGHALLAAKVTFVPLATDLTVH FT " FT CDS 3774..4703 FT /product="Gypsy-33_DPu-I_3p" FT /translation="MPFTVTEEQEEDRCVAIGLVGSGRPVDFGPEEEFVAE FT QRRVPRWRRLMDQLEAGRATLKNFCLSNGRLCLQTIKNGKQYRRLCVPRSF FT RERVVKAYHDDLMSGHLGVRRTLTKISNRFFWERLAIDVTNYVQSCPNCQG FT RKGVNKRPAGFLQCIQVARPFQKVGIDLLGPFPLSNTGNKMIIVAVDYLTK FT WVELRAMPTGKADMVATFFVEQIVLRHGAPESIISDQGKCFIAALTQEVMK FT NLGTNHKTTSSYHPQANGLVERMNHTLAAMLSMYVSADHKDWDEALPYVCF FT AYNTARQESTGYSPALRT" FT CDS 5077..6189 FT /product="Gypsy-33_DPu-I_1p" FT /translation="MKLFHDLVERGPNLNTEATEAMFDTPLPNVPPPPVEI FT HRETGPDNGTHGEEAAVLHDSNVRPVPSPGGVRKQKTVNRPVRLPTRIQPA FT RRAGRPDRYLALLLPYTICVLAFLGPVKVDTLLVRDTVIFNEKPGVTFGES FT FWTVITDLDIRPAEAVVQTLKVRLKEYSEMAVKCRKTGNQQGASAAKKLEV FT KCRWFERELNQSEHRLATFRDAIGSPSKQRRAVIDGGGSALKWLLGVATQA FT DLAGLNKKITGLTRRENEIVHLMDQQATVVNESLWEIRTNTKLIQELEGQT FT AELTKNYNNLLGRMAERQAYMIEYFDFFINLDTAFESVEASLQWLRDLADA FT LDEGLALLANGRLAPQIFPPAQMASVLK" XX SQ Sequence 7376 BP; 1974 A; 1699 C; 1991 G; 1695 T; 17 other; aatggtggag aagcagcatt acaaaggaga caaggcaagc acgtttatct tatagtatta 60 ttcattgtca gaggcgaata tcgtttatgg gctctaccac aagcggttta accaacgggt 120 tgtgccaaat cttaaattgt tggatacgga gccacgtttt cggctgacag tggataggct 180 tatttggttt ctattagctt tctctcagag tgaggtttga tttttgtgwc aagatgttga 240 gatattttaa tgcagcaagc aattaaagcc ggaggtaaat gagaaagcaa tcgaatccac 300 gttggcctag ttgtccaagg gatccgagca gacgcttaca ccctgtgcct ccatctacgc 360 gcccaataga ggacgtagtg ggaagtaggc agcaggtgct ccaggcggaa aagaagttag 420 agagtactgc gtcagatctt gaagaatctg acgtggacga gattattgag gcagcacacg 480 aaagtagcaa cgaaagcgaa gaagaggaca acgttgaagc actggagaag aacagttttt 540 ggccgatatc gatagtgatt cagatacgga cataatgcct cctcagatta aatttttgag 600 ccctaaggtt tttaaggcaa cgtcggaaga tgacgcttcc gactggctgg aacggtacga 660 atccacgggg gcatacaacc aatggggcga cacggaactt cgggccaatt tcggcatgta 720 cctggatggg gcggcaagaa aatggtacct ctgctccaca cttccaacag aatggcgcga 780 tttacccgtt cgaccggggg ttggcctcaa cgcaccggac cttccggcag taaccggagt 840 cagaacactg tttttgaaag aatttcagca acaaaactac aggctgttcc aagagacacg 900 cttgcgcaac cgggttcaag gtatcgaaga agcgacaact aattactatt acgatgttat 960 tgatctttgt agggtggtgg atccaacaat ggcggaggcc accaaggtag actatctgtt 1020 tggaggatta cggccctcgt tggtagaaaa attgtacccg ttacaaccca aaacatgcga 1080 agaatttttg gaggcggcaa aacggtttac tgatgccaag ctcttggcca acagacgaaa 1140 ctggccagac gcggtgcttg gggtggcggc gaccagagcg gcggatgtcc caattgattt 1200 cattcggacc attccgaaac ctgctccaac cacggctgat acggaattgt ggaaagtaat 1260 taaagaattg caaggtgcgg tagaaagttt gaagattcaa gcaacccccc caccgaagag 1320 accaggaaac gagaagacag ttacgtgggg agagtcggag agaatctacc gaaccgatga 1380 cggagtcctg aaatgttact gttgtaacgg aacggggcat atggcccgga attgttggga 1440 caatccgcaa tctgctcgtt ttcgacaaag gccctacccc gggcctccta gcccacaaag 1500 gggggcgttg ggtcatcccg cgcccatcaa catcgtgtca cagctagccg agccgtcggc 1560 agactcatcg acccacgacg tgttaaaaga gcagccgatc cttaaactag attacagcaa 1620 gcttatcaga gaagaagtaa cttgtggaac acgacaggtg atggcggttg tcgacaccgg 1680 ggcggcactg acagtaatat caccagagct gctaaaggaa tcccagtttg tgatgcgtcc 1740 atgggatgga ccgcgggtag tgatggcgaa tggtgagcaa gctacgttga tgggagctgc 1800 caccatttcg gttaaccata aaatgggaac ggcgacgggc gaggcggtcg tctttgggat 1860 ggacggcatc gatttgattt tgggaaacga tttcctgaaa caatacggca aggtgcagat 1920 cgattatcgc gaaccgaagg cctccattac ttttggagac cagcctcttg cagccattac 1980 gtcgcagcgc acagatcaca cgacacggtc agttaaattg atctccaaca atgatgtaac 2040 aatcccttct ttctcagtgg ccaacgtggg tacagtagca ccagctttac aggcggagaa 2100 cttctgtttc gatccatcag gaaaactttt agaaaccaaa ggtgtgtcgg tgggacacgc 2160 gttactggca gccaaagtaa ctttcgtccc gttggccacg gatctaacgg tccactagcg 2220 tgtggattcc aaaggggacg acattgggag tcgtcagtgg atacgacgga gaagtgttga 2280 gctgtggcct aacgacaaaa gaagaggacc cctcaatgac cagaccacca acaaaggaag 2340 aagtggatcg gcggaggaca ctcatagacg acctaaaaaa gcaagtcaat cacggcattc 2400 cgaaagagaa acgagaaaaa gtgttccaat tgctacaaga gaatttggaa tgttttgcgg 2460 caaacgcttc agaagtaggc cgctgcaaca tatccgaaca caacattgaa acgggagacg 2520 ccccgcccgt tcaccaggcc ccgtatgcca gcgcgtggaa ggctagaaca atcgtgaatg 2580 gacaagtaaa aatcttggaa gatgcgggaa ttatcgaaaa tcggacagcc cgtggccgca 2640 ccggtggtgc tagtcagaaa gaaagatggg acatggagat tctgtgtgga ttatagaaag 2700 ctgaatgctg taacgatgag agatgttacc ctctaccaag gattgacaga tgtgctgagt 2760 cggctggaag gagcagaatt tttttccatc ctgacttaca agccgggtat cgkcagatcc 2820 cagtgagaga ggaggacmgc cacaagacag ccttcattac agcgacgggc tgtatcagtt 2880 caaggtgctg ccattcggcc tgtcaaacgg cccggtcatt ccagcggtca atggattcgt 2940 ttggcgggct tcgatggaca gcatgcctcg tctacttaga tgacgtcata gtgtactcgt 3000 cgacaattga ccccacgttg acggctacgg cggtttggat tttaaagaag gccggcctaa 3060 gttaaaagtg tccaaatgtc atttgcggaa actagkttaa aagtgttagg tcatgtggtg 3120 gatgcggatg gcattcgtcc agacccggac aaattagcag cggtaaggga gttcccaccg 3180 tgcaacgaag ggaagacggt ggcccttaaa gtgaagaaag tacagagtta tcttggcctg 3240 tgctcttatt atcgtccaca tgtcaaagat ttttcaatga ttgcacgccc gttgattttg 3300 ttaaccaaaa aggatgcagt gttcgattgg ggtcctgatc aggagtccag cttcaacatt 3360 ctgaagcaag cgctgcttac agccccagtc ctggctcatc cgaattatga cctaccaatg 3420 gagatcattc ccgatgcctg tggctacggc attggagcag tgctggcaca acgagtggat 3480 gggcaagagc atccgttggc ctatgccagc cgcctgttaa gcagctccga aatcaattac 3540 agcatcaccg agaaagaatg cctggccttg gtttgggcct taaagaaatt taaaggctat 3600 gtatggggat gcaagatcgt ggtagttacm gaccaccagg cgctgtgttg gttgttaacc 3660 aaacgagatc tcgctggacg gctagcccgt tggagcttgt cgatacagga gtatgacatc 3720 gaaattcggt atagaagcgg aaaacttcat gacaacgcgg attgcctctc gcgatgccct 3780 ttaccgtgac cgaagaacaa gaagaagatc gctgtgtggc cattggcctt gtggggagtg 3840 gcagaccggt ggattttgga ccggaggaag aattcgttgc ggagcagcgt cgggtcccgc 3900 ggtggcgtcg gctgatggac cagttggaag caggtcgtgc aacgctaaaa aacttttgtc 3960 tatccaatgg gcgactatgc ttacaaacca ttaaaaacgg taaacagtat cgtcgcctgt 4020 gcgtcccacg ttcctttcgg gaacgcgtcg taaaggcata ccatgatgac ttgatgtcag 4080 ggcatctggg agttcgacgt accctaacga aaatttctaa tcgattcttt tgggagcgtt 4140 tggcgattga tgttacaaat tatgtgcaat cgtgtcctaa ttgccagggt cgaaaaggcg 4200 tgaacaaacg gccagccggt ttcctgcagt gtattcaggt ggcacgccca tttcaaaagg 4260 tgggaatcga ccttttgggt ccgtttccct tatccaacac cggaaataag atgatcatcg 4320 ttgctgtgga ctacttaacg aagtgggtgg aactacgagc catgcccacc ggaaaggcgg 4380 atatggtggc cacatttttt gtcgagcaaa ttgtgctgcg tcacggagcc ccggagtcca 4440 tcatttcgga tcaagggaaa tgttttatag ctgcacttac ccaagaagtc atgaagaatc 4500 tcggaaccaa ccacaagaca acgtcaagtt accatccgca ggcaaacggt ttggttgagc 4560 ggatgaatca taccttggcg gccatgctgt caatgtatgt cagcgcggac cataaggatt 4620 gggacgaggc attaccttac gtgtgtttcg cctacaacac ggcgcggcaa gaatctacgg 4680 gatattcgcc agctttacgg acgtgaacca atgttaccta ttgatttgga attaggtgct 4740 gatgccaacc ctcggctggt cacgtcgaga acggctccgg cttatgcaac ccatgttgtg 4800 accgaattag cgaaggctcg agcattggtg cacacgcgat tgggagtcgc ccaggacaac 4860 caacgacgtc agtacgacgg tcgccacaga gagctcmaat ttaaagtagg cgaccaagtc 4920 ttggtgtaca aaccctttcg caaagtgaaa agggcagaaa aactgcttca tcgctggcag 4980 ggaccgttca acgtgatccg ccagaccact cctgtgaact acgaagtgaa gttgagttcc 5040 ggatcaagaa aatcagagat tgtgcacgtg ataaagatga agcttttcca tgacctggtc 5100 gaacgaggtc caaatttgaa cacggaagcc acagaggcga tgtttgatac acctctcccc 5160 aacgtccctc caccacctgt ggagattcat cgggagacag gtccggataa tgggacccat 5220 ggcgaggagg cggcagtttt gcacgacagc aacgttcgtc cagtgccttc ccckggagga 5280 gtkcgtaaac aaaaaacagt aaacaggccg gtccgtctcc ccactcgaat acagccggct 5340 cgtcgagctg gtcgaccgga tcgttacctc gctttgttac tgccttacac aatttgtgta 5400 ttggccttcc tcgggccagt caaggtagat actttgctcg tccgtgacac cgtcatcttc 5460 aatgaaaagc cgggtgtcac gtttggagaa tcgttctgga cagtcatcac agatctcgac 5520 atacgacccg cagaggcagt agttcaaaca ttgaaagtga ggctgaagga gtactccgag 5580 atggccgtta aatgccgtaa aacaggaaat caacaagggg catcagctgc gaaaaaactg 5640 gaggttaagt gtcgttggtt tgaacgcgaa ctgaatcagt ccgaacatcg actagcaact 5700 tttcgggacg ccatcggttc tccttcgaaa caacgtcgag ckgtgattga tggcggcgga 5760 tcagcgttaa agtggcttct cggagtagcc actcaggctg atctcgccgg tttgaataag 5820 aagatcaccg gccttacccg ccgggaaaac gaaatcgtcc atttaatgga ccagcaggcg 5880 acggtggtaa acgaatcact ttgggagatc cggacaaaca cgaagttgat ccaggagttg 5940 gaagggcaaa cagcggaact aaccaagaat tataacaatt tgcttggacg gatggcagag 6000 cgccaagctt acatgatcga gtatttcgat tttttcatca atctcgacac agcatttgag 6060 tcagtggaag ctagcctgca gtggcttaga gacttggcag atgctcttga cgaaggattg 6120 gccctcttgg caaacggccg attagcgcct cagatatttc cacctgctca aatggcttcg 6180 gtgttgaaag magttaatcg acaactgccc ttgggttggg cagtatcgag tgaagaatta 6240 tgggtaacct atcgggaagc catggtgacg gtggcggmgg tggaggggcg ctttcgtmtc 6300 tttatcaaaa ttmcaatcta cgatcacgca cagcagcata ccctttttga aatttccagt 6360 ttaccacgag caacagacaa tggcacccag gaaaaatggg taggtctccg atcaccacga 6420 tcggagacct acccattttc tggccgtttc cacagacttg gaaactttta ttgaactttc 6480 macagacgam attcggggtt gtaagmaact ggaacgatta atttgtaatt ttcatacagg 6540 tctcggaaac ggggagcasg aaaatcctgt gctatttcat tgttcaccaa tgacgccaac 6600 cgaaagctaa cgcaatgcag gcaacaattt aaggagtgga aaggatccga agtggcttat 6660 cttggggaaa atcgctgggc attctcagcc attacagctc atgaggtggt gttctcctgc 6720 ccgatcggta gcagccaagg gccaccacaa tccctgttgc tgcctccgtt tgggattttt 6780 gatgtccctc ctggatgtac ggctagaacg gaagactggg tcttccccgc cagtcttgat 6840 gggcgatcag aggcctcgtt ggatcctctg gtggcaccca ctctgggcgc agtaggattt 6900 aatgtgacca catttaaatc agtcgccgtc attgagctcc cgaaggcgaa tgccacatct 6960 attaacttca tcagcgacct actacgccga aacgacggcg ctcgtgcgtc gtcggagatg 7020 acagggttcc agattcaaga actcatgaag aagacgactg aagaatttca gctgacggag 7080 tccagatatc cgtttgaact tttgtccatg ctactattac tagtattcgc aacgacgtat 7140 ctcagttacc aaactgtttg gctgagaggc cgtatgaggg cccacgaaca attagacagt 7200 caagacgatc acttcctacc acacctcgaa caggtggaag tggtttaata cgaaattctt 7260 atattttcgg gtgtttggtt ttctgattta gggttttttg gttttggggt acccgttctg 7320 tgggttttat tttgctgtag aagcagtcgg ggcgaccgcc ttcacgaggg gggatc 7376 // ID BEL-173_AA-I repbase; DNA; INV; 5822 BP. XX AC AAGE02025208; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-173_AA_; KW BEL-173_AA-LTR; BEL-173_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5822 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02025208; Positions 18488 12667. XX CC Positions [4860-5411] - Integrase core CC 'CACAT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 31..4185 FT /product="BEL-173_AA-I_1p" FT /translation="METGRHDTTAGNYHCKKCNRRDSADTHMVACDQCHVW FT THFGCAGVDESIKDRKYVCNACEALEGVASGMQQLKPPGADNKSTKSSKRT FT KKNSNTPVPSMTSSVREALLEAQMKMIEERQLLEEQALREQEAIRVKQIEE FT ERRQLDDKRRLAEEENQLRERQLQEEKAFQAKQQLIRQQSLDKKHELLRQI FT AEASSRPGSVVDSRGKVSSWLAHQHAVVDEDAYDCGAGAPHIVNQQERNNG FT GGTALPAAPMSENRSHRPGLQAQSPVQEEETITPFRTSKSLNPTALSTQHF FT AARQVIGKELPTFGGSAEDWPIFISCFEQSTDACGYSDAENLIRLQRSLKG FT HALESVRSRLLLPSSVPHVISTLRTLYGRPELLVRSLIAKVHRVPAPRQDR FT LETIMEFGLAVQNLVDHYRAAHQKNHLANPTLMQELVEKLPGPLRLDWAVY FT KSQHPNVTLRTFGGFMSRLVTAASEVTYDLPVHSKPARNEKIKSKEWGIVQ FT AHTAVQYSTERPNRDSRKVSKPCALCEREGHRLHECPQFRTFDVDERWKFV FT QQKGLCRTCLNGHGNWPCKSWQGCAHQDCRLKHHTLLHPPTTAVYHSVNIS FT SNCSAGNEKLFPIFRIIPVVLRNRGRSEIVYALVDEGSSLTLLEKSIAEQL FT QVVGPTEALTLQWTGNITREENESQIIQLDVAGKGSGTYHKLISVHTVDGL FT LLPTQTFRYQELAKQFPHLRGLPIEDQEQVQPKLLIGLENLRLAVPIAIRE FT GAAGEPIAAKCRLGWGIYGCALAPSGNKPVLNFHVSVASDSDRQLNEQLQD FT YFALDVSGVSGAGEKPESDENKRAMRILKETTRRTLRGFETGLLWNTDNPN FT FPNSYLTALRRLQSLERRLSKEPTLRERVNELIHEYEAKGYAHRATQNEIQ FT SSDPSRVWYLPLGIVTNPKKPDKIRLIWDASAKSDGVSLNSKLLKGPDLLT FT PLPTVLSRFRQYPVAVSGDIREMFHQLTVIEPDQQAQRFLWREDPAGEPHI FT YVMDVLTFGSTSSPSSAQYVKNLNASEFASEYPRAAGAILENHYVDDYLDS FT FATVEDAIRTVNEVKHIHSKGGFQLRKFKSNDVAVLRGIGELANDECKDLL FT LERGESIESVLGMKWIPKEDVFIYAFAPRKEFHQILCPEHVPTKREVLKIV FT MSLFDPLGFASFFLVHGKVLIQEIWASGSHWDQPIKDEICERWHRWTEMFD FT QLVKLRIPRSYFSSAIPTEFNNLQIHIFVDASEAAYSCVGYFRLDSESGIQ FT VSLIGAKTKVAPLKTLSIPRLELKAAVLGARFMESIQANHTFHVSQKYYWS FT DSSTVIAWIRSDHRRYNKFVELRIGEILMLSDPCDWKWVPSKINVADHATK FT WNKGLSFQNDSEWGQTFYTIPN" FT CDS 4155..5801 FT /product="BEL-173_AA-I_2p" FT /translation="MGPNFLYDPELTWPQQRTISTTQEEIKRCHTHWTPSP FT LIDVSRFSQWTRLHRAMAYVLRFIDNLRRKRNRQPLVIGLLQHEELVRSEQ FT SLWKIAQLEAFPVEVESLSKTRGPPEERHPTVAKSSSIYRSWPFMDSDGVV FT RMRSRLGSAFYIPVEARYPAILPRQHRITTLIIEWYHRRYQHANRETIVNE FT IRQRFEISKLRALIGKIIKNCVKCRLTKATPEYPPMASLPEARLSTFERPF FT TFVGLDYFGPVLVRVGRSTAKRWIALFTCLSIRAIHLEVVHSISTESCIMA FT VKRFVARRGPPVEIYSDNATCFQGASNELQREINDALALTFTSTNTSWKFI FT PPAAPHMGGVWERLVRSVKVAIASINETCRKPDDETLETIVLDAEAMVNAR FT PLTYVPLESANDEALTPNHFLLGSSSGVKIAPLETPSKPAVLRSSWKLAQH FT ITNHLWSRWIKEYLPIITRRSKWFEEVRELKEGDLVLIVGGTSRNQWTRGR FT VEKVVYGRDGRVRQALVRTSSGVLRRPAVRLAILDVNGNGKPSSEHDDLVN FT HQ" XX SQ Sequence 5822 BP; 1768 A; 1299 C; 1395 G; 1360 T; 0 other; aattccttta agaattttgt acgtggagct atggaaactg gacgtcacga tacgactgct 60 ggaaactatc attgcaaaaa atgcaatcgg agagattcgg cagataccca tatggtggct 120 tgcgatcagt gtcatgtatg gacccacttc ggatgtgctg gagtcgacga gtccattaaa 180 gatcgtaaat acgtatgcaa tgcatgtgaa gctttggaag gagtcgcaag tggaatgcaa 240 caactcaagc cacccggtgc ggacaacaaa tccacgaaat catctaagcg aaccaagaaa 300 aattcgaaca cacccgtgcc cagcatgacc tccagtgtcc gagaagctct gttggaggca 360 cagatgaaga tgatagaaga acgacaactt ctagaagagc aagcgttaag ggaacaggaa 420 gcgataagag tcaagcagat agaggaagag cgccgtcaat tggatgacaa acgacgacta 480 gcagaggaag aaaatcaact acgagagcgt cagcttcaag aagagaaagc gttccaagca 540 aagcagcagc tcataaggca gcagtcactc gataagaaac atgaactact ccgtcagata 600 gcagaggcta gtagtcgacc cggatcagtt gtagattcac gcggaaaagt ttccagttgg 660 ctggcacatc agcacgccgt ggttgacgaa gacgcgtatg attgtggtgc tggtgccccc 720 cacatagtaa atcaacaaga gcgaaacaac ggtggaggaa cggcgttgcc tgcggctccg 780 atgtcggaga acagatcaca cagacctggg cttcaagctc aatcgcctgt acaagaagag 840 gaaacaataa caccgtttcg tacttctaag agtttaaatc cgactgcact gagcacacaa 900 catttcgctg cacgacaagt tataggcaaa gaattgccta cgtttggagg aagtgctgaa 960 gattggccga tattcataag ctgttttgaa caatcaacag acgcttgtgg atattcggat 1020 gccgaaaact tgatccgcct tcaaagaagt ctgaagggcc atgctttgga gtccgttcgg 1080 agccgcctgc tactcccatc gagtgtacct cacgttatta gcacgctgag gacactatac 1140 ggtcgacctg aattactggt tcgttccttg atagcgaaag tacaccgtgt tccagcaccg 1200 agacaagata gactggaaac tatcatggaa tttggtctcg cggtgcaaaa tttagtcgac 1260 cattacaggg cggcgcacca aaagaatcat cttgctaatc ctacactcat gcaagagtta 1320 gtggaaaagc ttccaggacc tctaaggcta gattgggcag tttacaaaag ccaacatcca 1380 aacgttacac tgaggacctt cggtggcttc atgtctagat tggttactgc agcgagcgag 1440 gtgacatacg atctaccggt acacagcaag cccgccagaa acgagaaaat caagtcgaaa 1500 gagtggggaa tagtgcaagc acatactgca gtgcagtaca gcacggaacg cccgaatcgt 1560 gatagtcgga aagtcagcaa accatgtgca ttatgtgaac gcgagggaca tcgactacat 1620 gaatgcccac aatttagaac gttcgatgtg gatgaacgat ggaaattcgt acaacaaaag 1680 gggctttgta ggacttgtct caatggccat ggtaattggc catgcaaatc ctggcaagga 1740 tgcgctcatc aggactgccg ccttaaacat catacattgc tccatccacc caccacagca 1800 gtatatcatt cggtaaatat ctcttctaac tgctctgctg gtaatgaaaa attgtttcca 1860 atatttcgaa ttatcccggt tgtgttgaga aacagaggtc gctccgaaat agtttacgcg 1920 ctagttgatg aagggtcatc tttgactctc ctggaaaagt ccattgcaga acaactacaa 1980 gtggtcggac caacagaggc acttacccta caatggactg gaaatattac ccgagaagaa 2040 aacgaatcgc aaataattca actagatgta gcaggaaagg gtagtggcac ctatcataag 2100 cttatcagtg tgcatacggt cgatggatta ttgctgccta cccaaacatt tcgatatcaa 2160 gaactggcta aacaatttcc tcatctacgg gggttaccaa tagaagatca agaacaggta 2220 caacctaagc tgctgatcgg tttggaaaac ctgcgacttg ctgtaccgat agccatacga 2280 gaaggcgcgg ctggggaacc aattgcagca aaatgtcgcc ttggttgggg aatttatgga 2340 tgcgccctag caccttctgg aaataaaccc gttctgaatt tccatgtgag tgtcgcatca 2400 gattccgatc gtcagttaaa cgagcaatta caggactatt tcgccttaga cgtgtcagga 2460 gtatcaggag caggtgaaaa accagaatca gacgaaaaca aaagagcgat gcgtatacta 2520 aaggaaacca cacggcggac gttgcgagga tttgaaacag ggctactttg gaacaccgac 2580 aatccaaatt ttccaaatag ttatttgaca gcactacgtc gtctacaatc actagaaaga 2640 agactatcga aagaaccgac tctcagagaa cgtgtaaatg aactgattca cgaatatgag 2700 gctaaggggt atgcccatag ggctacacaa aatgaaatcc aatcgtcgga tccgtctcgt 2760 gtctggtatc tcccgttggg aattgttacg aatcccaaga aaccagataa gattaggctc 2820 atctgggatg cctcagctaa atccgacggt gtgtcgctca actccaaact cctaaaaggc 2880 ccggacttgc tgacaccatt gccaacagtg ctctctcgtt tccgccagta ccccgtggca 2940 gtaagtggag acataagaga gatgtttcat caactcacag tcatcgaacc agaccaacaa 3000 gcacaaagat tcctttggcg tgaagatcct gcaggagaac cacacatcta cgtgatggac 3060 gttttaacgt ttggctcaac ttcctcaccc tcgtcagcac aatacgttaa gaaccttaac 3120 gcatcagagt tcgcatccga gtacccacgt gcagcaggag caattttaga aaaccattac 3180 gtggacgact acctcgatag ttttgccacg gtggaggatg cgatccgcac agtgaatgag 3240 gtgaaacata ttcattcaaa gggaggtttc cagttgagga aatttaaaag caatgatgtc 3300 gccgtactgc gcggaatagg agagctcgcg aacgatgaat gtaaagatct gttgctagaa 3360 agaggagaat caatcgagtc agtactcggg atgaagtgga ttcccaaaga agatgtcttc 3420 atttacgctt tcgctccacg taaagaattc caccaaattc tctgccccga acatgttcca 3480 acgaagcgag aggttttgaa aatagtaatg agcctgtttg atccattggg ctttgcatca 3540 ttttttcttg tacatggtaa agtgttgatt caagagattt gggcctcagg atcacattgg 3600 gatcaaccta taaaagatga aatttgcgaa cgttggcatc gatggacaga aatgttcgac 3660 cagctagtaa agttgaggat accaagaagc tacttctctt cagcgattcc gactgagttc 3720 aacaatctac aaattcatat attcgttgat gctagcgaag cagcctattc atgtgttggc 3780 tactttcgtt tggattccga atcaggaatt caagtgtcgt taattggcgc caaaactaag 3840 gtcgctccac tgaagacctt atcaatccct cgactagaac taaaagcagc agtactgggg 3900 gcgcgcttca tggaatcaat acaagccaat cacacctttc atgtatcgca gaaatattac 3960 tggagcgatt cgagtactgt tattgcatgg atcagatcag accacaggcg gtacaataaa 4020 tttgttgagc tacgaatcgg tgaaatcctg atgctgagtg atccttgtga ttggaaatgg 4080 gttccttcca aaatcaacgt agcggaccat gcaacaaagt ggaacaaagg tctaagtttt 4140 caaaatgaca gcgaatgggg ccaaactttc tatacgatcc cgaactaaca tggcctcaac 4200 aacgaacaat ttccacaacc caagaggaga ttaaacgttg tcacacacat tggactccga 4260 gtcctctaat cgacgtgagc agatttagtc agtggacaag attacaccgg gctatggctt 4320 acgttctccg gtttatagat aacctacgtc gaaagaggaa tcgtcaacca ttggttattg 4380 gactacttca acatgaggaa ttggttaggt cggaacaaag tttatggaaa attgcacaac 4440 tagaagcttt tccggtggaa gttgagtctc tctccaagac acgtggacca ccagaagaac 4500 gtcatccaac agttgcaaaa tcaagttcaa tctacagaag ctggcccttt atggacagtg 4560 atggtgtagt gcgcatgcga agcagacttg gttcagcatt ctacattcca gttgaagccc 4620 gataccctgc gattctccca aggcaacatc gtataactac actaattatc gagtggtatc 4680 accgacgata ccaacatgcc aatcgggaga caatagtaaa tgaaattcgc caacgatttg 4740 aaatttctaa gcttcgggct ctcattggta aaataataaa gaactgtgtc aaatgccgcc 4800 taacaaaggc tacacccgag tatcctccta tggcatctct tcctgaggct cggctgtcga 4860 cgttcgagcg accgtttacg ttcgttgggc tcgattattt cgggccagtt ctggtacggg 4920 ttggcaggag tactgcaaaa cgctggatag cactatttac gtgtctttcc attcgcgcca 4980 tacacctaga agtcgtacat agtatttcaa cagagtcgtg tataatggca gtcaaacgat 5040 ttgttgcacg acgtggcccg cccgtggaga tatactctga caacgcaacg tgcttccagg 5100 gagctagcaa tgagctgcaa agggagataa atgatgcact ggccttaacg ttcaccagca 5160 caaacacctc atggaaattt attccaccag ctgcgccaca tatgggaggc gtttgggaac 5220 gcctggtgcg ttctgttaaa gttgcgatcg cgtcgatcaa tgagacttgt cgcaaacctg 5280 acgacgagac acttgaaact atagttcttg atgcggaagc gatggttaat gctagacctc 5340 ttacatatgt gccacttgaa tcagcaaatg atgaagcact aacacccaac cattttcttc 5400 taggaagttc atcaggagta aagatagcac cattagaaac cccaagtaag cctgcggtac 5460 tccgaagtag ttggaagtta gcgcaacaca ttaccaatca tttatggagt aggtggatta 5520 aagaatattt acccattata accagaagaa gtaaatggtt cgaagaagta agggagttaa 5580 aggaggggga cttagtccta atagtaggcg gtacttcgag aaatcaatgg actcgtggcc 5640 gagtggagaa ggtggtatac gggcgtgacg gaagagtgcg tcaagctcta gttcgcacgt 5700 cgtccggtgt attgcgaaga ccagcagtgc gcttagctat tctcgacgtc aacggtaatg 5760 gtaaacccag cagtgaacac gatgacctgg tgaatcatca gtaggattta cgggaagggg 5820 ga 5822 // ID Gypsy-10_CQ-LTR repbase; DNA; INV; 139 BP. XX AC AAWU01008510; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-10_CQ_; KW Gypsy-10_CQ-I; Gypsy-10_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-139 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 400-400 (2011). XX DR GenBank; AAWU01008510; Positions 121185 121323. XX SQ Sequence 139 BP; 41 A; 26 C; 33 G; 39 T; 0 other; tgtgacgtag gtgcattgag cagcccctcg tatctgtcag tgtgggcgaa tgacatagca 60 tttctagatc gcgcgagcaa acgacacgct ttgcaataaa atgggaaagt taattaaaac 120 gttgttttaa tctattaca 139 // ID Transib-14_HM repbase; DNA; INV; 3498 BP. XX AC . XX DT 14-DEC-2008 (Rel. 13.12, Created) DT 14-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE Autonomous Transib DNA transposon -a consensus sequence. XX KW Transib; DNA transposon; Transposable Element; Transib-14_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3498 RA Jurka J.; RT "Transib transposons from the hydra genome."; RL Repbase Reports 8(12), 2103-2103 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 778..2889 FT /product="Transib-14_HM_1p" FT /translation="MIIKYSEIVAIIKQNDKHYLAIEKCLLCCEEKLKRDL FT KEEEKQVITQKIKNIKKRYNELWRKQGCSSARFMKYHGSWLQTEYELPIEL FT KCDPQPSSSEKTKRGRPTMPFLEMSARSKRRVIANVVSNNEPETEMYLRVA FT ATVAAQNDKKDLSHLIKQVIQSQNLAAVLNKINSEVVRMPPEQALSFLFDT FT NLSKDQYNQIRFAAKTQGGDIFPPYKLVSAEKSQCCPSEILQTENCIEVEW FT KHLLKHTLERISKLQHELIEKEIPEINVVQCKFIVSYGFDGSTQQARFNQD FT LSENFSDSSLFVTSMIPLRLQTSCGKILWTNQTPQSTRFCRPQRIEFAKES FT KEYVIRLKEKYDKEIAELEEVVIILKSGKKLLVECNLCLTLIDGKIQAFIT FT KTSSFQRCTVCGCSSKDMNKLVNFTKGVFNVNPVALQYGLSILHMHIRCFE FT AMLHISYKLVLPENQWQVHGKENEAIIKARKIEIQSRFKNKLNIHIDMPKS FT SGGNSNTGNVAKRAFEDPKLLSEILDIDLELIQRLRILLLCISYRFPIDPV FT KFNDYCFSTASRYVKLYKKFPMTPTLHKLLIHGSQIISNSIVPVGMLSEEA FT SESRNKIFRSDRINHARKDSRQHNLFDVFTRAMETSDPILSSSRLQQRIQK FT YKKHKMPSEVRELLILNTNINNENCDDDSNSENDIDDFENSHMYETNVDSE FT IYQ*" XX SQ Sequence 3498 BP; 1289 A; 503 C; 554 G; 1152 T; 0 other; cacaatgggg cctaacaaag gaaaaaatgg ccaaaaatct aaaaatcaag ttttaaattt 60 ttgaaaattt taaatgctaa ttttaaatct tatttgtagc tgcatctgaa acagattagt 120 ttgtacttaa aaaaagttgg atgttgagta accattagac tcaaactcaa actgaccgaa 180 atttgtgtta aaattcctat tttgagttta ttacatttac tttgtgactg ctgttttaag 240 aatattaaat tatatatcta tgtaaagata atgattaagt ctatcaatat atgcaataaa 300 attaatttaa tttagtttat ttccaataaa aatatatata tttatagcgc ttttagtttg 360 tttataactt aaaagctaga gatataattt tgctgttgat ggccggtgga agtctaactt 420 gcaataaata cttgaaactt agttcaatca aatttgataa aaacttagtc tgtcttgtac 480 tgtttgattc attgtttttc cgtttaaaaa actttttttg tctttttttt ttgtcttttt 540 tttaaaaaaa atctaaaaat aaagacagat ggcatttcta tcagtatact gatagaaatg 600 ccatctgcag taaccaataa ttactttatt ggttactgca ataattatat aaaattatat 660 aaccaataat tactttgtaa ggattattgg ttactgcagt aacacataat ccttacgcaa 720 ttcaaaagca ttgttcaaag ataaataaaa ccgtgaaatg tgtaaataca tttaaaaatg 780 attattaagt atagtgaaat tgttgcaatt ataaaacaaa atgataaaca ttatctagcg 840 attgaaaaat gcttgttgtg ttgtgaagaa aaacttaaaa gagacctaaa ggaagaagaa 900 aaacaagtta taacacagaa aataaaaaat ataaaaaagc gatacaacga actttggaga 960 aagcaaggtt gttcttctgc aagattcatg aaataccatg gaagttggct tcaaactgaa 1020 tacgagcttc caatagagct taaatgtgac cctcagccgt ccagttctga aaaaacgaaa 1080 agaggaagac caactatgcc gtttttggag atgtctgcca gatccaaacg ccgagttatt 1140 gctaatgttg tatctaataa cgaacccgaa actgaaatgt acctcagagt tgctgcaacc 1200 gtcgctgcac aaaatgataa aaaagactta tctcatttaa tcaaacaagt tattcaaagt 1260 caaaatttgg ctgctgtatt gaataagatt aattctgaag ttgtgagaat gccaccggaa 1320 caagcgttat cgtttctttt tgatacgaat ttgtccaaag atcagtataa tcaaataaga 1380 tttgcagcta aaactcaagg tggcgatatt ttccctcctt ataagttggt atcagctgaa 1440 aaatctcaat gttgtccaag tgaaatattg caaacagaaa attgcattga agtagaatgg 1500 aagcatttat taaaacatac tctagaaaga attagtaaat tgcagcatga acttattgag 1560 aaagaaattc cagaaattaa tgttgttcaa tgcaaattta tagtttctta tggctttgac 1620 ggtagcacgc aacaagcaag gttcaatcaa gatctaagtg aaaattttag tgatagttca 1680 ttatttgtga catctatgat tcctttaaga ttgcaaacaa gctgtggaaa aattttatgg 1740 actaatcaaa ctcctcagtc tactagattt tgccgtccac agagaatcga atttgcaaag 1800 gaaagtaaag aatacgtaat tcgattaaag gagaaatatg ataaagaaat tgctgaactc 1860 gaagaggttg tgattatttt aaagtctgga aagaaactat tggttgaatg taatctgtgt 1920 ttgacgttaa tagatggaaa gattcaagca tttattacaa agacatcttc gtttcaacga 1980 tgtactgtgt gtggttgttc ttcaaaagat atgaataaac tagtaaattt tacaaaagga 2040 gtatttaatg ttaacccagt agctctacaa tatggtttaa gtattcttca catgcatata 2100 cgctgtttcg aagcaatgtt acacatttct tataaattag ttttgcctga aaatcaatgg 2160 caagtgcatg gaaaagaaaa cgaagcaata ataaaagcaa ggaagattga aattcaaagc 2220 cgattcaaaa acaaacttaa cattcatata gacatgccaa aatcaagcgg tggcaatagt 2280 aatacaggaa acgttgcaaa aagagcattc gaagatccga agctcttgtc tgaaatttta 2340 gacatcgact tggagcttat tcaaagatta agaatacttt tgttgtgtat ttcttatcga 2400 tttccaattg acccagtcaa atttaatgac tattgcttct caacagcatc tcggtatgtt 2460 aaattatata aaaaatttcc tatgactccc acattacata aacttttgat acatggctca 2520 caaatcatat caaattcaat tgtccctgtt ggcatgttat ctgaagaagc ttcagaaagt 2580 agaaataaaa tttttagatc agatagaatc aatcatgctc gaaaagatag tcgccaacac 2640 aacctctttg atgtgttcac acgtgcaatg gagacttcag atccaattct ttcaagttca 2700 agattacaac aaagaattca aaagtacaag aaacataaga tgccatccga agtaagagaa 2760 ctattaatct taaatacaaa tattaataat gaaaactgtg atgatgattc taacagtgaa 2820 aatgatatag atgattttga aaacagtcac atgtatgaaa caaatgtaga ttctgaaata 2880 tatcaataag atgatgttat gattttttat aataaaaata gcagacgtgg cgcagtggtt 2940 atagtgttga actctttacc caaggggttg caagttcgaa tcgagtatca ataaaattga 3000 acaaaatatt tcgtacaata acttttctct attatttgct tgcaacatag atggcagcat 3060 gagtttttat cttttgtata aaaaacttat tcatattttc atataatatt ttacacttta 3120 ttttaaaaat ttggaaaaag ttgtgggggg ggggggcact cagcctttca ctcctcacca 3180 ctcccctctg ttaccgcgat cccttttata aatacgcatg ttttactttt tttttccata 3240 agtttttttt ataagctata aatatatact taaataatat agtaaaaagt aaaaaaaatc 3300 actgaatgtc aatttattgc taaagaaatt cttatcagac tctatttcga aagaaatggt 3360 caaaattttt tcaaaaaaca tttaattatt ttttcattca atttagaata ctctttttta 3420 taattgtgca aaattttaaa tggattggca gatttttcga tttttggcca tttttccttt 3480 gttcggcccc attgtgca 3498 // ID NONAUT-5 repbase; DNA; INV; 4727 BP. XX AC BN000789; XX DT 04-SEP-2005 (Rel. 10.08, Created) DT 04-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE Schistosoma mansoni Nonaut-5 LTR retrotransposon (EST). XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW gag-pol polyprotein; NONAUT-5. XX OS Schistosoma mansoni OC Eukaryota; Metazoa; Platyhelminthes; Trematoda; Digenea; OC Strigeidida; Schistosomatoidea; Schistosomatidae; Schistosoma. XX RN [1] RP 1-4727 RA DeMarco R., Machado A.A., Bisson-Filho A.W. RA and Verjovski-Almeida S.; RT "Identification of 18 new transcribed retrotransposons in RT Schistosoma mansoni."; RL Biochem Biophys Res Commun 333(1), 230-240 (2005). XX DR EMBL/GenBank/DDBJ; BN000789; Positions 1 4727. XX FH Key Location/Qualifiers FT CDS 349..4419 FT /product="NONAUT-5_1p" FT /translation="MSITFDQFEVFLERHEKRLEQFQMRILEKLTQQMNLN FT KNGLTDVSAKSHADCIIDSIHEFHFDGVAGVTFESWYKKYEDVFNVDPETF FT DDAAKVRVLLRKLGTIEHERYTNYILPKNPRDISFEETVKILSQIFGEQSS FT LFNIRYQCLKIVKAGNDDWVQHASIINRECERFKLSAMTEDQFKCLVFVCS FT LQSSEDADIRTRILSKIEQCPNITLQEVTTECQHLVNLKHDTSMVESNGRL FT PYVQSVSGKQKQNTTIKKPPSACWFCGELHYKRFCPYKNYRCTRCQLKGHK FT ETCCKKKMHRRSSSVHYRRRKNCSSTNKIMVAHTSTEHNRRKYVTLEINGR FT RARLQLDTASDISLISRKTWSHIGKPLVLPTAQLAHSASGGKLNIAGEICC FT PVKKGNVQKNVKVYLTESPGLDLLGLDLIETLKLADHSINSICRRVTIDNS FT SRCNQKSTVPQRHHNVFKEELGECTKAKALLILKPGATPVFRPKRPVPYAA FT LPMVEQEIERLQKLGVIEPVNFSEWTAPIVVVKKTNGSIRLCADYSTGLNE FT ALGTHQYPLPLPEDPFAKLNGGKLFAKLDLSEAYLQIPVADECKNLLTINT FT HKGLFRYNRLPFGVKTAPSIFQQVMDTMLQDVPGAAAYLDDIIIMGVDKID FT LEKKLDQVLSRIAEYGFRLRAEKCNFCMKKVSYLGFIIDQDGRRPXPENTQ FT AVKTMPRPTDVPTLRSFLGLVSHYGAFIPNLHHLRAPLNNLLAKNVKWDWS FT ADCQAAFEEIKKILVSDLLLTHYDPSLPIVVASDASNHGIGAVISHIMPDG FT SEKAISHAARSLTTAERSYSQIEKEALSIIFAIKRFHKMIFGRQFTLLTDH FT KPLLAIFGSKKGIPVYTANRLQRWGTTLLGYDFKIKYQPTTDFGQADALSR FT LIGSRVKHEEDTLVAAIETEAEVHRVLEDAVDGLPVTLKAIKEATINDKTL FT REVSGYLLTRWPNRRFQSEMLQYFRRRDSLMMVDSCIMFGDRIVVPKTLRH FT RVLKQFHSGHPGINKMKALARSYAYWPSMDHDIEQKCRSCSSCLEAAKNPN FT KAEPQPWPKPDGPWQRIHADFAGPIQGRNYLVVVDAFTKWPEVYDMPKMTS FT ESTIRKLTGLFACFGVPEILVTDNGTQFMSSVFKQFCTENGISYLQSPPYH FT PQSNGQAERFVDTIKRALVKGGGEAIPEQVISKFLLSYRVTPNPAVPEGKS FT PTECMFGRKIRTIFSSMLPPRKTRKPTNGNHMVTRSFKVGERVLVKSYQGG FT KRWEPGIIERRIGSVLYLIRRNVGTCIRHINQIRRDGRTWMPQSRLPFHLL FT MDSSPKCTQAHEGRKDKRRKSKATQPSRVMSRKRTAVTQFQVDPR" XX SQ Sequence 4727 BP; 1511 A; 995 C; 1030 G; 1190 T; 1 other; gccctaagaa gcccaggaag agccgcagat attccatcca atcacggcta ggggaatgtt 60 ctagaaatgc caacgtggag cttctccatt ggatggatta gtttgatacc tatgtgctta 120 ttggttgacc caaatgataa tttactttca gccaaaaact ataaatacac gagatttctg 180 tgctatattt tgacactgtt tggggcaata aacttcctgc ttaccagtct ttgtcttctc 240 tcttttgcga cgtttgcggg ttctatcgtt caagtagtca agtagttcgc ggcatattaa 300 gaatcaaggt tcgagatata acaactggcg acgggttaat ttatcaagat gtctattacc 360 tttgaccaat ttgaagtttt tctagagcgc catgaaaaac ggcttgaaca attccagatg 420 cgcatactcg aaaaactaac ccagcagatg aacctcaata agaatggact tacagatgtt 480 tctgccaaat cgcatgctga ttgtatcatt gattccattc acgaatttca ttttgatggt 540 gttgctggtg taacattcga atcttggtac aagaaatatg aagatgtatt taatgtcgac 600 ccagaaactt ttgatgatgc cgccaaggtc agggtgctac tacgaaagct gggtaccatt 660 gaacatgaac gatatactaa ttacattctg cccaagaacc caagagacat ttcattcgaa 720 gaaacagtta aaattttgtc gcaaattttc ggtgaacagt cttccttgtt caatattcga 780 tatcagtgct tgaaaatagt aaaagccggc aatgatgact gggtgcaaca tgctagcata 840 ataaatcgcg aatgtgagcg ctttaaacta tcagcaatga cagaagacca attcaaatgt 900 cttgttttcg tatgcagctt acaatcttca gaagacgccg acattagaac cagaattcta 960 agcaaaatcg aacagtgccc aaatatcacc cttcaagagg tgactactga atgtcagcat 1020 ctggtgaact tgaagcatga cacttcaatg gtagaaagta atggtcgact accttatgta 1080 cagtcagtaa gtgggaagca aaagcagaac acaaccatta agaagcctcc atccgcatgt 1140 tggttttgcg gagagctgca ttacaagagg ttctgcccat acaaaaacta ccggtgcact 1200 agatgtcagc tcaaaggaca taaagaaaca tgttgtaaga agaagatgca ccgtcgctca 1260 tcgagtgttc attaccgaag acgtaaaaac tgttcatcaa ccaataaaat aatggtagca 1320 cataccagca cagaacataa tcgaagaaaa tatgtgacct tagagattaa tggacgccgt 1380 gcccgtctac agcttgacac tgcttcagat atttcactta tcagccgcaa gacatggagt 1440 catattggca aacccttggt gctccctaca gctcagttag cacacagtgc atctggagga 1500 aagttaaata ttgctggaga gatatgctgc ccagttaaaa aagggaatgt acagaagaat 1560 gtgaaagtat atctcacgga aagcccagga ctagacttat tgggacttga tctaatagaa 1620 actcttaagc tagcagatca ttctattaat agcatctgca gacgagtgac tattgacaac 1680 tcctcaaggt gtaaccagaa aagtaccgtg ccgcaaagac atcacaacgt ttttaaagaa 1740 gaattgggag aatgtacgaa ggccaaagcc ttgttaatcc tcaaacctgg tgctactccc 1800 gtttttcgac ccaaaagacc tgtgccctac gcagctctac cgatggttga acaagagata 1860 gaacgacttc aaaaattggg tgtcattgag ccagtgaatt tctcagaatg gactgcacct 1920 atagtggtag taaagaagac taatggtagc atccgtttat gtgccgatta ctcaaccggg 1980 ctaaacgaag ccctaggaac tcatcagtat ccactacccc tacccgaaga tccttttgct 2040 aagctgaatg gtggtaagct ctttgcaaag ttagatttat cggaagctta tttacaaatt 2100 ccggttgctg atgaatgtaa aaatctgctc accataaaca cacacaaggg cctattcagg 2160 tataatcgac taccttttgg agtcaagacg gccccatcta tatttcagca agtgatggat 2220 actatgctac aggatgttcc aggtgctgca gcctatctgg atgatataat aattatggga 2280 gtagataaaa tagacttaga aaagaagctc gaccaggtgc tgtcgcgaat agccgaatat 2340 ggatttcgac tgcgtgcaga gaaatgtaac ttttgtatga aaaaggtaag ctacctcgga 2400 tttataatcg atcaagatgg cagaagacca ratccagaga atactcaggc agtgaaaact 2460 atgcctagac caacggacgt ccctacacta cgatcctttc tgggactagt tagtcattat 2520 ggagctttta ttcctaacct tcatcacctg cgtgccccct taaataacct tctggcgaag 2580 aatgtaaagt gggattggtc tgcggattgt caagccgctt tcgaggaaat caagaaaata 2640 ctagtctcgg atctgctgct cacccattat gacccatcct tacccattgt agtagcatca 2700 gatgcttcaa accatggcat tggagcagtt atttctcata tcatgccaga cggatctgaa 2760 aaagccatat cgcatgcagc tagatcatta actacagcag aacgtagcta cagtcagatc 2820 gaaaaggaag cattgtcgat tatctttgcc ataaaaaggt tccacaagat gatatttgga 2880 cgccagttca ccctactaac tgaccataaa ccattgttag ccatttttgg gtcaaagaaa 2940 ggtatcccgg tatacacagc aaacagactg caacggtggg gaactacact cttaggttat 3000 gactttaaga tcaagtacca gcccactact gatttcggac aagcagacgc tttgtctaga 3060 ctgattggtt cgcgagtaaa acatgaagaa gatacattgg tagcagcaat tgagacggaa 3120 gcagaagtac atcgagtatt ggaagacgca gtcgatggct taccagtcac tctcaaagct 3180 atcaaagaag ctacgataaa cgacaagaca ttaagagaag ttagtggtta tctcctaacc 3240 aggtggccga accgtcgttt ccaaagtgag atgctacaat atttccgccg gcgagactca 3300 ctcatgatgg tcgactcatg tattatgttt ggtgacagaa ttgttgttcc aaaaacatta 3360 cgccacaggg ttctgaaaca attccacagc ggccatcccg ggatcaataa gatgaaagcg 3420 ttagctcgaa gctacgcata ctggccatca atggaccatg atatagagca aaaatgtcgc 3480 agttgttcgt cctgtttaga ggccgcaaag aacccaaata aagctgaacc gcaaccatgg 3540 ccaaaaccag acggtccctg gcaaagaata catgctgatt ttgccggccc aatacaaggc 3600 agaaattact tggttgttgt cgacgccttc acgaaatggc cagaagtata cgacatgccg 3660 aaaatgacgt cagagagtac tataagaaaa ttgacaggtt tgttcgcttg ttttggggtt 3720 cctgagatct tagtgactga caacggcacc cagttcatgt catctgtctt taaacagttt 3780 tgcactgaaa atggcatatc atatctacag tctcccccgt accatcctca gtcgaatggc 3840 caagcagaaa gattcgtgga tactataaaa cgagctctag ttaaaggggg aggagaggcc 3900 atcccagaac aagtaatcag caagttcctt ttgtcttaca gagttacccc caatccggca 3960 gtacccgaag gtaagtcacc aaccgagtgt atgttcggaa ggaaaattcg aacaatcttt 4020 agcagtatgt tgccacctag aaagacaaga aaacccacga atggaaacca tatggtcact 4080 cgtagcttca aggtgggtga aagggtactc gttaaatcgt atcaaggtgg aaaaaggtgg 4140 gaaccaggta ttatagaacg tcgaattgga agtgtattgt atttaatccg aagaaatgtc 4200 ggtacttgta taagacacat caatcaaatt cgaagggatg gcagaacatg gatgccacaa 4260 agccggcttc catttcattt gctcatggat tcgtctccaa aatgcactca ggcgcacgaa 4320 ggaagaaaag ataaaagacg caagagcaag gcaacacaac cttcgagagt aatgagccgc 4380 aagagaaccg cggtaactca attccaggtg gatccgagat aaaggactta tacaacagac 4440 tttaaagggg ggaggtgtga ggatggtcca caggtgaagt cgaggaggct ctaggaagca 4500 caggaagagc cgcagacatt ccatccaatc agggctaggg gaatgttcta gaaatgccaa 4560 cgtggagctt ctccattgga tggattagtt tgatacctat gtgcttattg gttgacccaa 4620 atgataattt actttcagcc aaaaactata aatacacgag atttctgtgc tatattttga 4680 cattgttcgg ggcaataaac ttcctgctta ccaaaaaaaa aaaaaaa 4727 // ID Gypsy-132_AA-LTR repbase; DNA; INV; 1099 BP. XX AC AAGE02026533; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-132_AA_; KW Gypsy-132_AA-I; Gypsy-132_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-1099 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02026533; Positions 45937 44839. XX SQ Sequence 1099 BP; 286 A; 323 C; 269 G; 221 T; 0 other; tgtaaccgta aagcggttca ataaacccta gtccaagaga aatgtagata cttgtgcgat 60 tctcaccatc ttgctagttg tcctagatag gacgtaaagt ttagaaccct atccgcgggt 120 gaggcaatgg tagcaagcca aattacccct agatggcttg agcgggtgtg gttaccaaca 180 ggttttggag gaagtggtga ccctgcaaac ggaacatgtg gaaagaggaa tgagcgtgca 240 aaaaggggca gctggagcct cgaaaaaggg actcggggct cttaagttgt gaccgtggtc 300 gtgaacagtc gaagaagcct tgtgctaata agttttccga aaaagtgagc tagtgagccg 360 ttgggacggt ccaactgact acagtgtccc gatcaccacc cctagtggct ggttgaacgc 420 caaggatcgg taagagtaaa ccggtctcgc ccggaacggt ttcgaatcgt cacggctaac 480 cgtaaaagag acaagccggt gttcgtcgcg gatccctctc ggaagctgca gcgatccgac 540 acaccatccg ctaatcgcta aggacgaata gatctgcgga tctccatcct gccagaagga 600 aactcaccaa tcaacaatag tcaagcgtgc gcatccctgg ttggtcgcac agcaagccat 660 tggccacgtg cctcatcttc cacttctgcc actcgcctcc aatccaccgc cgccattgct 720 gagaacccga tatccccatc aaacctcccg ccgtcgagcc acccgccatc aagccacccg 780 ccgtcaagcc acccccgtcg tccactcgcc accatgccat caacccaaag cctagatctc 840 gcaacgtaac ctccaaccat caatccgtga gtaataaagt gtctaaattc acttccgaac 900 tttgaccact agcaaggtcc gaaaacctcc ccataacctc caacgaccct gggactcgag 960 gaccaaaaga ggccttgtgt gcccgccccg taggttgccg ttggctttag accagcaacc 1020 tggggtttag gacgttcccc cttctgggtg cgtcgatcat atcctcgagg gctaagaggc 1080 cgtaccagaa acggtaaca 1099 // ID Gypsy-16_AA-I repbase; DNA; INV; 2973 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-16_AA_; KW Gypsy-16_AA-LTR; Gypsy-16_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-2973 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1001-1001 (2011). XX DR [2] (Consensus) XX CC Positions [2112-2588] - Integrase core CC LTRs are 92% similar to each other. XX FH Key Location/Qualifiers FT CDS 1254..2810 FT /product="Gypsy-16_AA-I_1p" FT /translation="MLKKGSSFQFDDDAVRAFNRIKEILSSYPVLQIFRPE FT GDVEVHTDASKKALAGILQKADDDGKFHPCYYYSRLTNGSEKNYHSYELET FT LAVVESVREFRCYLLGRKFKIITDCMAFKDSMSKKKLNPRIARWVVDLAEF FT DYEVEHRRGEKMPHVDALSRANVMVVSTPIVAKIRAAQKDDEKAMKILTTM FT EQGDEANGYTVYNGVLYEGDGDARRIYVPESMEMEIVRSAHERGHFGVKKV FT KERINADYYIPCLDDKIKRCIATCVACIVGEKKRGKPEGELHPIPKGDVPL FT DTFHVDHLGPMPSSRKSYNYILTVIDAFTKFVWLFPTKSTTAEESVKKLRV FT ITDTFGNPKRIICDLGAAFTSGLFTKFCEDENIELHKIATGMPRGNGQVER FT VHRIIIPMITKMAVDKPEEWFKHVTDVQKCLNNSWQRAIEMTPFQLLTGVK FT MRTKEDHVLHEILQKEIQASFEEERSELRKIAQNNIQKMQEENRKHYNLRR FT KCATKYKVGDTENPVRCGPESQA" XX SQ Sequence 2973 BP; 888 A; 600 C; 819 G; 666 T; 0 other; actctaaacc ctttcgtctt acatgggggc tcgtccgggg atgagagttg ttgagacagt 60 gttcgcgatg tgataaaagt ttgattaaat caatcgcaaa gttgaacgac ggacgattaa 120 agatggcgga cgacgtggag attttgtgcg aaacgttgac caagttcggc attcaacgcg 180 agtgcgaaaa acgaggtttg ccgactaccg gcaataaaaa ggctatggcg aaacgaattg 240 tcgatcacga caaaacgacg acggctgaac aagatggcgt tcctgaaacc aacgacgagg 300 agcaggatcg cagtcagcat gacgacgatg tgaaagttga tgacgatgac cgacgcgcgc 360 gtgaaaaagt ttgcaccaaa aattgaagag tcggaaccgt gtgatgccaa atggatccgt 420 cgtgtggaag agtatatcgg cgaggaagaa attgtggttc cgtacaaata tcgcgacgaa 480 gttttgggaa tgattgagaa ttataagccg agtgaatcag taattgctgc gaacgaattg 540 tcgattaatt tgaaggataa tgacatcgtt tgtgaaaatc cacgtcgatt ggcgcagctt 600 gaaaaagaag tggtgaagaa gcaaatcggt gaatggatgg aaagtggcat agtgcaaccg 660 tcagaaagtg aatacgcgag tccggttgtg gttgttccga agaaggacgg aacgtaccgc 720 gtctgcatcg attaccgcga aataaacaaa aaagtagtgc gtgataaatt tccgatgcct 780 aacgtagagg atcaaatcga tcagctgtcg gaggcccgtg tgtatacgac gctcgacctg 840 aaaaattcgt actttcatgt gccggtcgaa gaaagcagcc gcaagtatac cgcgttcgta 900 acgagtgacg ggcaatatga gttcctacga gctccgttcg gactgtgcac cagtggtaac 960 gcgttcggac gatttatcac cacggtgttc aaagatttga ttaatgacgg gacgattttg 1020 gcattcgtgg atgatgtcat aattccgtcc gaagatgaag aaaaaggact gtcgaacatc 1080 tcggctatac catttacgat ggccgtgtgg aaccagcgcc tgtgaaggtc gagaagttgt 1140 tgcagttccc gcagccgtca acagtgaagc aactgcagcg gttttatgga ttggccagct 1200 acttcaggaa gtttgtccca gcgtttgcga agatcacccg accactttca ggtatgttga 1260 aaaaaggaag ttcttttcag tttgatgacg atgcggtaag ggcattcaac cgcatcaagg 1320 agattttgtc cagctaccca gtgctgcaaa ttttccgtcc tgaaggcgat gtcgaggttc 1380 acaccgatgc cagtaaaaaa gcactggcag gtatcctgca aaaggcggac gatgatggga 1440 aattccaccc ctgctattac tacagtcgcc taaccaatgg atccgagaag aactatcact 1500 cttacgagct ggagacgttg gccgttgtag aatctgtgcg ggagttccga tgttatctgc 1560 ttggtcgtaa attcaagatt atcacggact gcatggcctt caaagattca atgtcgaaga 1620 agaagctgaa ccctcggatt gcaaggtggg tcgtggatct cgctgagttt gactacgagg 1680 tggaacatcg ccgtggagag aagatgcctc atgtggatgc gctatccaga gcaaacgtga 1740 tggtcgtgtc cacaccgatt gtcgctaaaa ttcgtgcggc acagaaggat gacgaaaaag 1800 cgatgaagat tctaacgaca atggagcaag gagatgaagc caatggctac actgtgtaca 1860 atggtgtcct gtacgaagga gatggcgatg cgagaagaat ctacgttccg gaatcgatgg 1920 aaatggagat agttcgttct gcgcatgaga gaggacattt cggagtcaag aaggtaaagg 1980 agagaatcaa cgcggactac tacatcccat gtttggacga caagataaag cggtgcattg 2040 ctacgtgtgt cgcttgtata gttggtgaga agaaacgagg aaagcctgag ggtgaattac 2100 atcccatccc gaaaggtgat gttcctttgg atactttcca tgtcgaccac ctaggtccta 2160 tgccgtctag caggaagtca tataactata ttttgaccgt tattgacgcc tttaccaagt 2220 tcgtatggct gttcccaacg aagtcaacaa ctgcggagga gtccgtgaag aagttacgtg 2280 ttattaccga caccttcggt aatccgaaaa ggatcatctg cgatctaggc gctgcgttta 2340 catcgggatt gttcacgaag ttctgcgagg atgaaaacat cgagcttcac aagattgcca 2400 ctggaatgcc tcgtggtaat gggcaagtgg aacgagttca tcgaatcata attccaatga 2460 tcacgaagat ggccgtggac aaaccggaag agtggttcaa gcacgtgacg gatgttcaaa 2520 agtgtttaaa caacagttgg caaagagcga ttgaaatgac accctttcag ttgctaactg 2580 gagtaaaaat gcgaaccaaa gaagatcacg tcctacatga aatactgcag aaggagatac 2640 aagcaagttt cgaggaggaa agaagcgaac ttcggaagat tgcgcaaaac aacatccaga 2700 agatgcaaga ggagaatcgc aagcattaca acttacgacg gaagtgtgca acgaagtaca 2760 aagttggaga taccgaaaac ccagttcggt gtgggccaga aagtcaagcc taaattcttt 2820 ggaccgtatg aaatcatcga agctctaccc aacgatcgtt atgaggtgcg aaaggtcgac 2880 gaagaagacg aaggaccgaa gaagacgact acagctggtg acgctatcaa gccgtggaca 2940 cttccggggc ggaagtagtg tcaggaaagg ccg 2973 // ID BEL-650_AA-LTR repbase; DNA; INV; 448 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-650_AA_; KW Pao_Bel_Ele200; BEL-650_AA-I; BEL-650_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-448 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 448 BP; 166 A; 89 C; 86 G; 107 T; 0 other; tgttccgaac gacgataaac cagcacccag atgcgatagg gtcacccgtt cagcgcacag 60 caagaggcca cagtagaggc aagatacaat catgcggatg tattgccttc tgctggagct 120 gtcacgctgg atgattgtat tgaacttagt aactatagac acacacaaaa cagattaaaa 180 tctactaaag ctaggaagtg aaactgaatt gaattaaaac tatttgctta aattatcagc 240 taaaattttg cttatcatag ttaaagttgt tggacagtgg aaaaacacat ctaggactag 300 aaatgtgagt agcaagacat aacacaaaca tgaatttata aactaattaa cgaaataata 360 aaattacagc tataaagctg actcatgcac catgcaaaca cgagtcgtgc tataagaggt 420 ccgaacacct atcgctgctg tcccaaca 448 // ID CR1-39_BF repbase; DNA; INV; 1399 BP. XX AC . XX DT 30-JUL-2009 (Rel. 14.07, Created) DT 30-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE Amphioxus CR1-39_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-39_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-1399 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-1399 RA Kapitonov V. and Jurka J.; RT "Young families of CR1 non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(7), 1610-1610 (2009). XX DR [2] (Consensus) XX SQ Sequence 1399 BP; 414 A; 303 C; 378 G; 304 T; 0 other; ataacgccaa tccacaaaaa gggagctaga acccaagcag gaaactacag accggttagc 60 ctaacttcgg tggttgggaa gacgatggag gctatcatcc gggatgcact ggtggagcac 120 atgtctgctg gaaaccactt cacagacgct caacatggct ttgttccagg gaggtcgtgc 180 atgactcagc ttctggtggt actggaggag tggacccagt tactggaaaa gggggaagct 240 attgacgcag tctacttaga tttcaggaaa gcctttgacg ctgttcccca tcagcgtctc 300 ttatgtaagc tggggtcgta cggtgtgaca ggagacctgt acaactgggt aaaagacttc 360 ctagcctcga ggaaacagag ggtggtgctg aatggaacta gttcctcatg gacccctgtg 420 aggagtggca tcccgcaggg gagtgtactg ggcccggtat tgtttgtagt atacattaat 480 gacctaccag aggctgtcag cagcactgtc agaatatttg ccgacgacag taaactctac 540 cagggtgtga aggacaacga aggtcgggtt aacttacaga aggacttaga agctttgagg 600 gactggtctg cgtcctggca gttgcccttc aacgtgggca aatgtaaagt gttgcacatg 660 gggaacaaca acaacaggca gatttacacc ttaggggatc aggtattgga ggagaccacg 720 gcagagaaag acctgggagt gacagtggac aatcacctca aattccacac gcatacggca 780 ttagcgaaga acaagggaaa tcagatgctg gggttgatta aaaagtcctt tgccaacttg 840 gatgaacaaa ccatgccgat tctctttaaa acaatggtga gaccacacct ggaatacgga 900 aatgtaatct ggggacctca ctataagacg ggcaagcaag agttagaaaa ggtccagaga 960 agggccacaa aaatggtccc ttctctaaga gagcttcctt acagcactag acttcagcga 1020 ctcaaactcc caaccctgga gtataggcga ctgagaggtg atatgataca agtttttaag 1080 atgaatggaa tcgatagaat cccggtacaa agtttctttg ctccagtaga gcaatcagtc 1140 actagaggtc acagttttaa actacaagtt cccctggcca agaccagggc gaggagtcag 1200 gtcttcagtg ttagaaccgt gtcgagctgg aacgccctac ctgagtcagt ggtttcagcg 1260 aaaagtgtaa accagtttaa atccagactt gacaaacact gggattgcca aaagtatgtc 1320 acatgagagc gaagtcaaga tcaggacata acaggcggag gcctactttc ctggacacag 1380 tatcaaggta tcaaggtat 1399 // ID PENELOPE repbase; DNA; INV; 2780 BP. XX AC . XX DT 27-JAN-1997 (Rel. 2, Created) DT 04-AUG-2008 (Rel. 13.07, Last updated, Version 2) XX DE Drosophila virilis Penelope transposable element. XX KW Penelope; Non-LTR Retrotransposon; Transposable Element; KW transposon; LINE. XX NM PENELOPE. XX OS virilis group OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila. XX RN [1] RP 1-2773 RA Evgen'ev B.M., Zelentsova H., Shostak N., Kozitsina M., RA Barskyi V., Lankenau H.D. and Corces G.V.; RT "Mobilization of multiple transposable elements during hybrid RT dysgenesis in Drosophila virilis."; RL Unpublished. XX RN [2] RP 1-2773 RA Evgen'ev B.M., Zelentsova H., Shostak N., Kozitsina M., RA Barskyi V., Lankenau H.D. and Corces G.V.; RT "PENELOPE."; RL Direct Submission to Genbank (13-FEB-1996)M.B. Evgen'ev, Biology, RL The Johns Hopkins University, 3400 N. Charles St., Baltimore, MD RL 21218, USA. XX RN [3] RP 1-2780 RA Smit A.F.; RT "Consensus update."; RL Direct Submission to Repbase Update (05-MAR-2004). XX DR [3] (Consensus) XX CC [3] Drosophila virilis Penelope transposable element. Also in D. CC americana, and D. montana of the virilis group. This is a new CC consensus of 10 copies in D. virilis which 100% matches the 2 CC copies that transformed D melanogaster (good for me). The CC original contained a copy of a Helitron DNA transposon at the 3' CC end and a fraction of a second Penelope at the 5' end. Most CC copies appear to lie in tandem repetition in the D. virilis CC genome. XX FH Key Location/Qualifiers FT CDS 60..2564 FT /product="PENELOPE_1p" FT /translation="MSYAKIKTKYKDSKRTINKFQLTLVKLTKLKSSLKFL FT LKCRKSNLIPNFIKNLTQHLTILTTDNKTHPDITRTLTRHTHFYHTKILNL FT LIKHKHNLLQEQTKHMEKAKTNIEQLMTTDDAKAFFESERNIENKITTTLK FT KRQETKHDKLRDQRNLALADNNTQREWFVNKTKIEFPPNVVALLAKGPKFA FT LPISKRDFPLLKYIADGEELVQTIKEKETQESARTKFSLLVKEHKTKNNQN FT SRDRAILDTVEQTRKLLKENINIKILSSDKGNKTVAMDEDEYKNKMTNILD FT DLCAYRTLRLDPTSRLQTKNNTFVAQLFKMGLISKDERNKMTTTTAVPPRI FT YGLPKIHKEGTPLRPICSSIGSPSYGLCKYIIQILKNLTMDSRYNIKNAVD FT FKDRVNNSQIREEETLVSFDVVSLFPSIPIELALDTIRQKWTKLEEHTNIP FT KQLFMDIVRFCIQENRYFKYEDKIYTQLKGMPMGSPASPVIADILMEELLD FT KITDKLKIKPRLLTKYVDDLFAITNKIDVENILKELNSFHKQIKFTMELEK FT DGKLPFLDSIVSRMDNTLKIKWYRKPIASGRILNFNSNHPKSMIINTALGC FT MNRMMKISDTIYHKEIEHEIKELLTKNDFPPNIIKTLLKRRQIERKKPTEP FT AKIYKSLIYVPRLSERLTNSDCYNKQDIKVAHKPTNTLQKFFNKIKSKIPM FT IEKSNVVYQIPCGGDNNNKCNSVYIGTTKSKLKTRISQHKSDFKLRHQNNI FT QKTALMTHCIRSNHTPNFDETTILQQEQHYNKRHTLEMLHIINTPTYKRLN FT YKTDTENCAHLYRHLLNSQTTSVTISTSKSADV" XX SQ Sequence 2780 BP; 1203 A; 541 C; 434 G; 602 T; 0 other; ggtcgccaga gccatcaata aatatcaacg gaaggcacgc cgtatgcaca gcaaccaaca 60 tgagctacgc aaaaataaaa actaaataca aggattcgaa aagaacaatt aataaattcc 120 aactaacact ggtaaaatta actaaactta aatctagttt aaaatttttg ttaaaatgta 180 gaaaatcaaa tttaatacct aacttcatca aaaacttgac acagcatttg accatactga 240 ccactgacaa taaaacccac cctgacataa caagaacatt gactagacac acacattttt 300 accataccaa aatattaaac ttacttataa aacacaaaca caacctatta caagaacaaa 360 caaaacatat ggaaaaagca aaaacaaaca tagaacaact gatgaccaca gatgacgcaa 420 aagcgttttt tgagagcgag agaaatatag aaaacaaaat aacaacaaca ctcaagaaaa 480 gacaagaaac gaaacacgat aagttacgag atcaacggaa cctagcctta gcggataaca 540 acacgcaaag agagtggttt gtaaacaaaa caaaaataga attcccgcca aacgtcgtag 600 cgttactcgc aaaagggccg aagttcgctc tcccaatcag caagagagat tttcctctct 660 tgaaatacat cgcagacggt gaggagctag tgcaaacaat aaaagaaaag gaaacacaag 720 agtcggcgcg cacaaaattc tctttgttag tcaaagagca taaaaccaag aacaaccaaa 780 acagtaggga tcgagcaata ctggacacag tggaacagac acgaaaatta ctgaaagaaa 840 atataaatat taaaattcta tcgtcggata agggcaacaa aaccgtagca atggatgagg 900 atgaatataa aaataaaatg acaaatattt tagacgactt atgcgcgtat agaacattga 960 gactggatcc gacatcaaga ctacagacaa agaataacac cttcgtagca caattattca 1020 agatgggtct tatttcaaag gacgaaagaa ataagatgac tacaacaaca gcggtacctc 1080 cgaggatata tggactacca aaaatacaca aggaaggaac tccactgaga ccaatatgtt 1140 cttccatagg atctccatct tacgggctgt gcaaatatat aatacaaata ttaaaaaatc 1200 tgacaatgga ctctaggtac aacatcaaga acgcggtaga ttttaaagac agagtcaaca 1260 actcccagat tagagaagag gaaacattag tatcttttga cgtagtatcc ttatttccca 1320 gcataccaat agaattagca cttgacacaa taagacaaaa atggaccaaa ttagaagagc 1380 acacgaatat accgaaacaa ctatttatgg acatagttag attttgcata caggaaaaca 1440 gatatttcaa atacgaagac aaaatataca cacaacttaa gggaatgcca atgggatcac 1500 cggcttcccc agtaatcgca gatatattaa tggaggaact gttggacaag attacagata 1560 aattaaaaat aaaaccaaga ctcttgacca aatatgtaga tgaccttttt gccataacga 1620 acaaaataga cgtggaaaat attctaaaag aattgaattc cttccacaaa cagataaaat 1680 ttacaatgga attagaaaag gacgggaaat taccattttt agactctatt gtaagcagaa 1740 tggacaacac actcaaaata aagtggtata ggaaacccat agcctccgga cgaatactca 1800 acttcaattc aaaccaccca aagagtatga taatcaatac agcactaggc tgtatgaata 1860 gaatgatgaa aatatcggac acaatatacc acaaagaaat tgaacatgaa atcaaagaac 1920 ttttgaccaa aaatgacttc cccccaaata taatcaaaac attattaaaa agacgacaaa 1980 tcgaaagaaa aaagccaaca gaacctgcta aaatatacaa atcactaata tatgtaccac 2040 gactatcaga acgcctcaca aactcagact gttataacaa acaagatata aaagtagcac 2100 acaaaccgac gaatacatta caaaaattct tcaacaagat aaagtcgaaa atcccgatga 2160 tcgaaaaaag caacgtcgtt taccaaatac catgtggcgg ggataacaac aacaagtgca 2220 atagtgtcta cataggtaca acaaaatcga agctaaaaac aaggattagt caacataaat 2280 cggacttcaa actaagacat caaaataata tacagaaaac agcacttatg acccattgta 2340 taagaagcaa ccacacacca aattttgatg aaacaacaat cttacaacaa gaacaacact 2400 ataacaagcg acacacattg gaaatgctac acataattaa cacaccaacc tacaaacgac 2460 taaactacaa gacagacaca gaaaattgcg ctcacttgta cagacacctc ttaaacagtc 2520 aaacaacctc agtaacaatc tccacgtcaa aaagcgcaga cgtgtaaaat aatgtatgta 2580 aaatgttcga aataatgttt aatttattgt attataattg ttaattgttt tttgtatctt 2640 ggtgttagtg ccctgaagac ggtttgccga tgtgcaaccg aaatatatcg gaagagaatt 2700 gaataaaatt gtttttcatt gtttgtttta acaaacacgg acctcgagcc agccaacaaa 2760 taaatattga aatatggaaa 2780 // ID piggyBac-10_SM repbase; DNA; INV; 2648 BP. XX AC . XX DT 30-MAY-2008 (Rel. 13.05, Created) DT 30-MAY-2008 (Rel. 13.05, Last updated, Version 1) XX DE DNA transposon from Schmidtea mediterranea. XX KW piggyBac; DNA transposon; Transposable Element; piggyBac-10_SM. XX OS Schmidtea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae. XX RN [1] RP 1-2648 RA Kapitonov V.V. and Jurka J.; RT "Families of autonomous piggyBac element from the freshwater RT planarian genome and two clear examples of horizontal transfer of RT piggyBac transposons between a flatworm and insect."; RL Repbase Reports 8(5), 529-529 (2008)Repbase Reports. XX DR [1] (Consensus) XX CC piggyBac-10_SM is a young family of piggyBac transposons, CC characterized by 14-bp TIRs and TTAA target-site duplications. CC The consensus sequence was reconstructed based on multiple CC alignment of 10 copies, which are ~97% identical to the consensus CC sequence. XX FH Key Location/Qualifiers FT CDS 784..2484 FT /product="piggyBac-10_SMp" FT /note="piggyBac transposase." FT /translation="MSRRLRSANVLLEKINDIDAECSEDDQEILDMEIDNT FT ENYDPLETDSNTDENEENISNVCRRRKRRRVLVSSDSENGEEVSEIAMDGT FT VWQEVKTGSNPGRAPSHNIFRETSGPTGYSKRNIMKGEVRTAFSVIIDKNI FT IELIRKCTETEAFRVLGYKWELSTAKLYAFFAILYARGAYEAKNIDIALLW FT NKKWGPPFFSNTMSRHDFTEIMRFIRFDDRNQRSQRLQTDKFAMISEVWYK FT FIDNSQNCYKPGPYITIDEQLFPTKARCRFTQYMPNKPDKFGIKFWLASDV FT RSKYIINGFPYLGKDESREPSVPLGEFVTMKLAKPYVGCGRNITTDNFFTS FT LPLATKLLAKKTTIVGTIRANRKELPRLAKLKNDDMALFSTKLYRSNNCML FT TVYKAKPRKKVLILSSMHNSVQVEENDTRTPETIQLYNSTKFGVDVTDQMA FT RNYSVKSKSRRWPLQVFFNILDLAGINAWVLYKETTGGEITRQEFLFQLAE FT ELAIEYQQELGKENQPIPIASTSTGSRERKTCQIRYCKDNKANKICLKCKK FT YVCGKCTVEKPICKKCDEKE" XX SQ Sequence 2648 BP; 913 A; 429 C; 524 G; 782 T; 0 other; cactattcct accgggaccg gtcaaatgac cgtttttaaa ttttcaattg gaatttccta 60 cattcaacgt aagcgaatcg cctttaaact ttatgacttt tcctcatata tatgtatgat 120 acatttttca atattcactt atttttttat tgtttatttt ctgagaaaaa tttgtagtct 180 gtcttaccta ccacgaccgg tcatttgacc ggtggtacga tttatacatt tttgtaatag 240 tttttctcta agactcaatg ccaaaactat aaagtcgtta caaacgaaag gtaatgtagt 300 tgtggcatga ttttatccta taagtgcctt ccagaaaatg gtttacggca cttccaagtg 360 tttgcagttc tcgctcgact atcgggttag cggtgtttat ttgcttgatt gatctgatac 420 ggtccacttc ctcgaatttt ctcacgcaat cttatctact atgacggaaa agaaatgaac 480 tcataactga aggaaatgta tagacgtctt ctcacgaatt caaatcagta gagtcttgca 540 acacgaacag agtgcatcga aacgtgaaag tcacaaagag catacttggc agtcttttat 600 tattatttta taatatttta tatcgtatta cgtgttgtaa gtgtcatttt caaaatagtg 660 aacaagacta ataatacgtc gcagtaagtt tcacatttta tcttttattg atttttttat 720 ctatattgat ttacattttc atattttagt ttttattttt tccgttgaat atttgtcccc 780 aaaatgtcaa gacgtttgag gagtgcgaat gtattattag agaaaataaa tgatatcgat 840 gcagaatgtt ctgaagatga tcaagaaatt ctggatatgg aaatcgacaa tacggaaaat 900 tatgatccac ttgaaaccga ctcgaacacc gatgagaacg aagagaatat ttcgaacgta 960 tgtagacgac gaaagagaag gagagtttta gttagttccg atagtgagaa tggcgaagag 1020 gtaagcgaaa ttgcgatgga cggaactgtt tggcaggaag taaaaacagg gtctaatcct 1080 ggaagagcac ctagtcataa tatttttagg gaaacatcag gcccgactgg gtatagtaaa 1140 cgtaatatta tgaaaggcga agttaggacg gcattttctg tgataattga taaaaatata 1200 atcgaactta taagaaagtg cacagagaca gaggcattta gagtgttggg atataaatgg 1260 gaactgtcta ctgcaaagtt atatgcattt tttgcaattt tgtatgcacg cggcgcatat 1320 gaggctaaga atatagatat agcacttcta tggaataaaa agtggggccc gccttttttc 1380 tcaaatacta tgagtagaca tgactttacg gagattatga gatttattcg atttgacgac 1440 agaaatcagc gaagccaacg cttacaaacc gataaattcg ctatgatttc ggaagtttgg 1500 tacaagttta ttgacaacag tcaaaattgt tataaaccag ggccatatat tactattgac 1560 gaacaattgt ttccaacgaa agcaaggtgt agatttactc aatacatgcc caataaaccg 1620 gacaaatttg gtataaaatt ctggttagca tccgatgtac gtagcaaata tataataaac 1680 ggctttccat atttgggaaa agatgaaagt agagaaccct cagttcctct tggagagttt 1740 gtcactatga aattagcaaa accgtatgta ggatgtggaa gaaatataac cacggacaac 1800 ttttttacaa gcctaccttt agcaacaaaa cttttagcaa aaaaaactac gattgttggg 1860 acaattagag caaatagaaa agagctacca agactggcaa aactgaaaaa cgacgacatg 1920 gcacttttct caaccaaact ctatcgctca aataattgta tgttgacagt ttacaaagcc 1980 aaaccaagaa agaaagttct aattctcagt tcaatgcaca attccgtaca agttgaagaa 2040 aatgacaccc gaacaccgga gacaattcag ttgtataata gcaccaaatt tggcgtagat 2100 gtgaccgacc aaatggccag aaattattcg gtaaaatcta aatctcggag atggcctttg 2160 caagtatttt ttaatatctt ggatttagcc gggataaatg catgggtgtt atataaagaa 2220 actacgggag gagaaattac aagacaggaa tttctattcc aattggcaga agaacttgcc 2280 atcgaatacc aacaagagct cggcaaagaa aatcaaccaa tacccatcgc aagtacaagt 2340 acaggttccc gtgaacggaa aacatgtcag ataaggtatt gtaaagataa taaggcgaat 2400 aaaatctgtc taaaatgtaa aaagtacgta tgcggaaaat gtacagttga gaagcccatt 2460 tgtaaaaagt gcgacgaaaa agaataaaat gtagtttatg tactcttgaa taaaagaaaa 2520 tatattttta tactggtaag ctggctattg gtctgcattc aatattgaaa aatataagtt 2580 gtgaaagacc ggtcatttga ccggccgcgg taggaatagg tatacgtgaa gtgtcggtag 2640 gaatagtg 2648 // ID Gypsy-227_AA-LTR repbase; DNA; INV; 269 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-227_AA_; KW Gypsy-227_AA-I; Gypsy-227_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-269 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1058-1058 (2011). XX DR [2] (Consensus) XX SQ Sequence 269 BP; 71 A; 59 C; 59 G; 80 T; 0 other; tgtgggaata acatcctatg cacccctacg tgttctccca gttacgcact gcaacccgag 60 gatcctccga tcaagcagag gcaaagcata cgaagcagtt agtgttcgaa tgtgagagca 120 accgttcggt cgcgtttaat ttgcttcaat atacttttaa attgattcgc cgtcttttaa 180 aaggtctcga tatccgaagt tgtcatctgg aacgcgtatc ctttctttta gtgcgggaaa 240 agttgagcta gtagttaaga tttactaca 269 // ID piggyBac-N4_BF repbase; DNA; INV; 942 BP. XX AC . XX DT 13-APR-2011 (Rel. 16.04, Created) DT 13-APR-2011 (Rel. 16.04, Last updated, Version 1) XX DE piggyBac-N4_BF DNA transposon DNA transposon - consensus. XX KW piggyBac; DNA transposon; Transposable Element; Nonautonomous; KW piggyBac-N4_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-942 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M. and Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-942 RA Kapitonov V.V. and Jurka J.; RT "Transposable elements in the amphioxus genome."; RL Deposited to Repbase as the tmplanrep.ref file (02-May-2008). XX RN [3] RP 1-942 RA Kapitonov V. and Jurka J.; RT "A family of piggyBac-N4_BF DNA transposons from the amphioxus RT genome."; RL Direct Submission to Repbase Update (13-APR-2011). XX DR [3] (Consensus) XX CC Terminal parts are similar to piggyBac-1_BF. XX SQ Sequence 942 BP; 336 A; 140 C; 157 G; 309 T; 0 other; ccctcatccg accgtatggg gtcatttttg accccaggcg tatattttaa tctgccatag 60 caacattttt catcagagaa aaaattcctt ccatgaattt gtttgtatac atgtcttaca 120 acatctacca atttttcgcc gagttttgcc cattattgta taagctatac ttgaaagttt 180 gtgtcttgtt cggtggggtc aaaaatgacc ccagccaatt atgaatcatt aattacataa 240 atatcaagat atttcaataa aataacaggc attcctcatt attgctattg cagcaacagg 300 ttatagttgc ttagatgagt aaacaccgaa tattttgaaa gaaatttagt attttaagtg 360 taaaacacag tttttcgaca ttctgcataa attatgataa tacgcactaa ttacctatgc 420 taatgaaagt gaaaatgttt ctaattaatt aaaaaacaat atatggacag aatgtgcaga 480 aggaacctac aagaaagatc tttgcgggaa aatagacaag agttattgta tgtatgtgtt 540 tagatggcgg aatatggtgc ataattaagt aataaattcg ttacttttca caaatattgt 600 attttttctt gttaaataac attattatgc tatcagcatt attgcatcat cacaattatt 660 gtttagaaat tgataaaaca aaacattgtt taagtaagtt tagtatggta gattttcaga 720 gcaaatgtgg gttttcctca ttttctgcat aaattatgat aatgagtaat aactactgac 780 accaaaactt gtcaaaatat ttttcatagg ttagtgaaac agtaaataca tagaaatgac 840 aaaataagcc agcagaaata tgtctatgag caaaacaaaa tggggtcaaa aatgacccca 900 tacggtcaga ttcgtcgtaa aaatgtaacg gtcggatgag gg 942 // ID RTEX-14_BF repbase; DNA; INV; 5897 BP. XX AC . XX DT 30-JUN-2009 (Rel. 14.07, Created) DT 30-JUN-2009 (Rel. 14.07, Last updated, Version 1) XX DE Amphioxus RTEX-14_BF autonomous non-LTR retrotransposon - DE consensus. XX KW RTEX; Non-LTR Retrotransposon; Transposable Element; Jockey-12_BF; KW RTEX-14_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-5897 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-5897 RA Kapitonov V. and Jurka J.; RT "Young families of RTEX non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(7), 1731-1731 (2009). XX DR [2] (Consensus) XX CC The complete RTEX-14_BF consensus sequence contains two ORFs. The CC RTEX-14_BF ORF1 protein contains the esterase domain. The 3' CC terminus is composed of the (TA)n microsatellite. XX FH Key Location/Qualifiers FT CDS 1..2412 FT /product="RTEX-14_BF_1p" FT /note="esterase." FT /translation="MGAESKTPKGKTPTTKPWNINLSKVPLDETLAFDYED FT VEKSRLCGVKLRTSQLDKWIQLHKDNFDKNFEDHKGTTVTWKSTQGTTSLS FT VNFSSATKTGKSLLSVHFYHKKNGLMVQGQRTKWWVDLLYPELKNNVTMLD FT KCTRASAKDGVQRVRDAVNSFQTISTEQPLPKTPCPSNVTETPTMPKSTAP FT RTLEGLETPTGSSFTEQLQQDFVNGAQSAGDAITGDSRDPPVTSGVTSETN FT ESSDTRNVTLTGTSNSPVEVYLERVISVEKSIKASDSAFNDLQKQYVDTIG FT KLNAFRDELIAKEKEREKAHMKSLNAISTQIEAAQKALESHIRKATAKLQT FT DLNNSRKDITLQKEKLMKEKSEAKQEISIERNLWKERLHSMSLKVDDLAAS FT NADLQDIITKQDSEIQVLVDRYKDLEQMFATCHCGVTHSLHDNKAPSSQNN FT DRSTSHDNGFTPVNRKHNKQTIVPDDQTPAVCTNMPQSVNNHVRSENAGVE FT IRIFSDSLFRDVDPDRAFKANHTKIHRSSTISAAVDNISNIKDSTTKTVIL FT HVGSNDLDNSKGHSDSVHKTLRNTDRLLEATKKSFPNARVAVSQVLQRGPN FT QRSTLNENIKSYNQEVLKLSRNSGFTYIKHRKLTQDRHLYLPDQIHLDPRS FT GTKLLVADVKRTLNASPPTPTAGPPQLPTPRPQRGYRSSDQHAHPRPQTAA FT TKRDNAIPTRDWFKNSVNAIPLGSRKPVSPASLTSNRNPTTGSRPGGTQQV FT VTSPITTGSRSNGTLQVPTSPIATLQLRPTKQTVLQWKQIGKRFRKAWDIL FT TS" FT CDS 2769..5762 FT /product="RTEX-14_BF_2p" FT /note="AP endonulease, RT." FT /translation="TYQVIVFFTSSRIRSKGAKRNSGGVAVIFKSLYRRGI FT SKLVSSTVDTVWCKFDKQFFGLENDLYLCNAYIPPESSSKSSNESPFEILS FT NDICKYSNLGDIVVLGDLNARTGQLIENHFDTFTFDNPITDGLCFTKNRQN FT RDIKVNNYGKTLIDLCSAADLTILNGRFAGDLKGDFTCYHYNGSSVVDYCI FT VRNSLLSDVQYVNVSPISEFSDHCHVSFSLSTXXXXXXXXXXXXXXXXXKT FT VTEYLNQLDDHTPNPLDFPFTPVEVLNGLKLLKTGKASGIDSISNEMLKYG FT AKHLCQPLVSLFNTILDKGTFPSNWNTSILTPIHKSGDKSNADNYRGIAIS FT SCLSKLFTLILNNRLQNFAERNKLLDDTQFGFRKRCRTSDNVFILKSLIDK FT YTKKKGGKLFVCFVDMRKAFDSVWRDGLFYKLHKCGIGGNFFNIIKSMYND FT VNYAVRLDNGLSNSFPSVCGVRQGCNLSPLLFDLFINDISECFDPTKCDPA FT VLTSKSLNCLSWADDLALISSSKQGLQHCIDCLENYCKKWKLYVNVSKTKV FT VVFGKGSSKNNKDQFHIYGKNIEITDSYTYLGIPFTFSGKFKTARKYLKTK FT AMRALFKLKALLLSNKDISINLGKNLFDKFVKPIFMYCSEITCFDRFSKSI FT RIIVSHTNNICEPSSKVFSTLLKKLNIQDNFPATIRKTTSSALNTTYVVCL FT ERNTDKEILLRHANSHILRDNGFQIINLDSNQQIPEFDIIDMRFQKFLLGI FT HAKASNDGVRGELGTFPARIDAEMQLVKYWHRLVNLPEDALLREAYNTVSF FT GEHDWIVHVKNILNYQGFGHIWLNPRSYHIDTITTQLRQRLQDIYVQNWHS FT SIRNNSKLSSLSTLNKHYKQENYISTIKSLDVRRAITQLRIGCHKLNIETG FT RFQKIPIDQRVCPFCPELIEDEYHFMVVCPKYSDIRNSLYRRLSFCTQGFI FT QMDLKCKFRYIMTCDNPCMADIGKYIKEGFDIRNSPNDCKSLQ" XX SQ Sequence 5897 BP; 1947 A; 1305 C; 1089 G; 1506 T; 50 other; atgggagccg agtccaagac gccaaagggt aagacgccca cgacgaaacc atggaatatc 60 aatctctcca aagtcccgct ggacgaaacc ctggccttcg actacgaaga cgtggaaaag 120 tctcgactgt gtggagtgaa actgcggaca agccagctag acaagtggat tcaacttcac 180 aaggacaact ttgacaagaa ctttgaagac cacaaaggaa caacagtgac ctggaaatcc 240 acgcaaggca caacaagctt gtccgtcaac ttctcatcgg caacaaagac aggtaaaagc 300 cttctatcag ttcactttta ccacaagaag aatggtctaa tggttcaggg tcaaagaacc 360 aagtggtggg tagatttact gtaccctgaa ctgaaaaaca atgtaacaat gcttgacaaa 420 tgtacaaggg caagcgccaa agatggcgtc caacgcgtga gggacgccgt caatagcttc 480 cagacaatct ctacagaaca gcccttgccc aagacaccat gtccatccaa tgtgactgag 540 acaccgacta tgcccaaatc caccgctcca cgaacattag agggtcttga aactccaact 600 ggtagcagtt tcacagaaca gctgcaacag gattttgtaa atggcgccca aagtgccggt 660 gacgccataa ccggtgactc acgcgacccc ccagtgacat ccggcgtaac ttctgaaaca 720 aacgagtcaa gtgacacacg taacgtaaca ctaacaggta catcgaactc tccagtagaa 780 gtgtatctag aacgtgtgat ttctgttgaa aagtccataa aagcgagtga cagtgccttc 840 aacgatctac aaaaacaata cgtagatacg atcggcaaac tcaacgcctt cagggacgag 900 ctaatcgcga aagaaaagga gcgagagaag gctcacatga aaagtctgaa tgccatatcc 960 acacagatcg aagccgctca gaaggcactt gaaagtcata tcagaaaggc taccgcaaaa 1020 ctccagacag atctgaacaa ttctcgcaaa gacattactc tacagaaaga aaagctcatg 1080 aaggagaaat cagaagcaaa gcaagagatc tcgatcgaaa ggaacctatg gaaagagaga 1140 cttcatagta tgtccctgaa agtagatgat ttggcagctt caaatgccga cctacaagac 1200 atcataacaa agcaagattc agagattcag gtactggtgg acaggtataa agatttggaa 1260 cagatgtttg ctacctgcca ttgtggtgtg acgcactctc tacatgacaa caaagcccca 1320 agttcccaaa acaatgacag gtctacttca catgacaatg gctttactcc cgtgaacagg 1380 aagcacaaca aacaaaccat tgttcccgac gaccagacac cagctgtctg taccaacatg 1440 ccccaatctg tgaacaacca tgtgcgaagt gagaatgccg gggttgagat aagaatcttt 1500 tcagattcac tttttcggga tgtggatcca gatcgcgctt ttaaagcaaa ccatacaaag 1560 atccacagga gcagcaccat aagtgcggcc gttgacaaca tcagcaacat caaggattcc 1620 acaaccaaga ctgttatttt acacgttgga tctaacgatc tagataacag caaaggacac 1680 tcagactcag ttcacaagac gctccgtaac actgacaggc tgctagaagc gacaaagaag 1740 tccttcccaa atgcgagggt ggctgtgtcc caagtactac aaagaggacc gaatcagaga 1800 tccaccctga atgagaatat caaatcatac aaccaagaag tactcaaact atccaggaat 1860 tctggcttca cgtacattaa acacaggaag ttaacacagg acagacatct ctacctgcca 1920 gaccaaatac acctagaccc cagaagtgga accaaactac tagttgccga cgtcaaacgc 1980 acactcaatg catcgccacc tacaccaaca gcgggcccgc cacagcttcc gacgccgcgc 2040 ccacaacgtg gataccgctc ctccgaccaa cacgctcacc caaggcctca aactgccgcc 2100 accaaacgtg acaatgccat cccgacacga gactggttca agaacagtgt gaatgccatt 2160 ccacttggat cgaggaagcc tgtcagtcct gctagtctga catccaaccg aaaccctaca 2220 acaggaagcc gacctggtgg aacccaacaa gtcgtgacct caccgatcac cacgggaagc 2280 cggtctaatg gaactctaca agtcccgact tcaccgatcg ccacgctgca gttacgaccc 2340 accaaacaga cagttctcca gtggaagcag attggaaagc gtttccgtaa agcatgggac 2400 attctcacat cttgatttct gggaccttaa atgaactttg ataccatcct tcacagacat 2460 gtaaagccgt agaggtaaag ttattcctta tcttcatact atgttcactc tcaattctag 2520 aggagtttat tctccttttt tttacaacta tttcgtatgc atatgtcata ataccgatcc 2580 catgtagata ttgtgtactc ctattcaaat acggaggatg ccaaaaacat ctgcactagt 2640 attttcttct tggaatatcc agggaagctg ttgcaagaaa ttcaaagatg acgaatttct 2700 ttcatatgta gaaaatagcg acatcatttg tgttcaagag acctggctag acagttatgt 2760 agatttagac ataccaggtt atagtttttt tcactagcag tagaattaga agtaaagggg 2820 ctaaacggaa ttcaggtggg gtagcagtta tttttaaaag tttgtacagg agagggattt 2880 ccaaattagt tagtagtacg gtagatactg tttggtgcaa gtttgacaag caattttttg 2940 gcttagaaaa tgacttatat ttatgtaatg cctatatccc ccccgaatca tcttctaaat 3000 cgtctaatga atctcccttt gaaatactct ctaatgacat atgtaaatac tctaatttag 3060 gagatatcgt cgtgttaggc gatttaaatg caagaactgg ccagttgata gaaaatcatt 3120 ttgacacttt cacctttgac aatcctataa cagacggtct ctgtttcacc aaaaacagac 3180 agaacagaga tatcaaagtt aacaactacg gtaaaacatt gatagatttg tgttcagcag 3240 ccgatttaac tattctaaat ggtagattcg cgggggacct aaaaggcgat tttacctgtt 3300 atcattacaa cggttctagt gttgtcgatt attgtattgt gcgcaattct ctattatccg 3360 atgtacaata tgtaaatgtt agcccaatct ccgaattttc tgatcattgt catgtttcct 3420 tctcattgtc aacannnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3480 nnnnaaaaac ggtcactgaa taccttaatc agttagatga tcatacccca aatcccttag 3540 atttcccatt cactccagtg gaagtcttaa atggtttaaa attattaaaa acgggaaaag 3600 cttccggtat cgactcaatc tcaaatgaaa tgttaaagta tggtgcaaaa catctatgtc 3660 aacctttagt gagtcttttc aatactatac ttgataaagg cacctttcct tccaactgga 3720 atacaagtat tcttacgcca atacataaat caggtgataa aagtaatgca gacaactatc 3780 gaggaatagc gatttctagt tgtctatcca aattattcac actcattcta aacaacaggt 3840 tacaaaactt tgcagaaaga aacaaacttc tagatgacac acagtttggg ttccggaaaa 3900 gatgtagaac ctccgacaat gtttttatct taaaatctct aatagacaaa tacaccaaga 3960 aaaaaggcgg aaaactattc gtatgctttg ttgacatgag gaaggcattt gacagtgttt 4020 ggcgagatgg gctattctat aagttgcata aatgtggtat cggaggtaat ttctttaata 4080 ttataaaatc catgtacaat gatgtgaatt atgcagttag attagataat ggattatcaa 4140 actcctttcc ttccgtatgc ggagtacggc agggatgcaa tcttagtcct ttattatttg 4200 atttattcat caatgatata tctgagtgct ttgaccccac aaaatgtgac cctgcagtcc 4260 taacctcaaa atctctaaac tgtttatcct gggcagacga tctcgctctg atttcatcgt 4320 caaaacaagg gctacagcat tgtattgact gcctagaaaa ctattgtaag aaatggaagc 4380 tatatgtcaa tgtttcaaaa accaaggtag ttgtttttgg caaaggttct agcaagaata 4440 acaaagacca attccatatt tatggcaaaa atatagaaat cactgacagc tatacatacc 4500 tcggcattcc tttcacattc tccggaaaat tcaaaaccgc caggaaatat ttgaaaacca 4560 aagcaatgag agcacttttc aaactcaagg ctcttttact ctctaacaag gacatttcaa 4620 tcaatctcgg aaaaaatcta tttgacaaat ttgtaaaacc catctttatg tactgttcag 4680 aaatcacttg ctttgaccgt ttttctaaaa gtataagaat cattgtatca cataccaata 4740 acatctgcga accttcctca aaggtttttt ctactttact taaaaaactt aatattcaag 4800 ataactttcc agctaccatt cgaaaaacca cctcttctgc tttgaatact acttatgttg 4860 tatgtttgga aagaaataca gataaagaaa tactattgcg tcatgccaat agccatattt 4920 tacgagataa cggtttccaa attataaacc tagactcaaa tcaacaaatt ccagaatttg 4980 acatcattga catgcgcttc caaaaatttc tgttaggcat tcatgccaaa gcctcaaacg 5040 atggtgtcag gggagaattg ggaactttcc cagctaggat tgatgcagaa atgcaacttg 5100 tgaaatattg gcatcgttta gttaatctgc ccgaagatgc ccttcttcgt gaggcataca 5160 atactgtttc attcggagaa catgattgga ttgtccacgt aaaaaatatc ctcaactacc 5220 aaggctttgg acacatatgg ctgaacccga ggtcatacca catagacaca atcaccactc 5280 agttacgaca acgtttacaa gatatatacg tgcaaaattg gcattcctct attcgaaaca 5340 attccaaact ctcatctctt tctacattga acaagcatta caagcaagaa aattacattt 5400 caaccataaa gagccttgac gttcgtaggg caatcacaca actcagaatt ggctgtcaca 5460 aattaaatat agaaaccgga agattccaaa aaatccctat tgaccagagg gtttgtccat 5520 tctgtccaga gctaatcgaa gacgaatatc actttatggt tgtatgtccc aaatattccg 5580 atatacgcaa ttctctatac agaaggctgt cattttgtac acaaggattt atccaaatgg 5640 acctgaagtg caaattcaga tacataatga catgtgacaa tccctgcatg gcagatatag 5700 gaaaatatat caaagaaggt tttgacatta gaaattcccc aaatgattgt aaaagcctcc 5760 aatgattgta agaaacttat cccatactac aactcctgta ttagttagat aagccctaag 5820 ttaagttata gcactgtagt tatgtagatg ccatactttg tacctcgtac aattgttgtg 5880 caataaagtt atatata 5897 // ID Gypsy-71_AA-LTR repbase; DNA; INV; 122 BP. XX AC supercont1.332; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-71_AA_; KW Gypsy-71_AA-I; Gypsy-71_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-122 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.332; Positions 367875 367996. XX SQ Sequence 122 BP; 41 A; 19 C; 20 G; 42 T; 0 other; tgttatgata ctgaatttta cccaaatatt attactttga atttgtattg tatgactgat 60 taaaaaatga ataaagctgg cagcattgcc attcgttcgc ggttcaccga caattgaaaa 120 ca 122 // ID hATm-10_HM repbase; DNA; INV; 3388 BP. XX AC . XX DT 22-MAR-2008 (Rel. 13.03, Created) DT 22-MAR-2008 (Rel. 13.03, Last updated, Version 1) XX DE hAT-type family: consensus sequence. XX KW hAT; DNA transposon; Transposable Element; hATm-10_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3388 RA Jurka J.; RT "Families of hATm elements from Hydra magnipapillata."; RL Repbase Reports 8(3), 214-214 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(480..1742,1785..2321,2422..2982) FT /product="hATm-10_HM_1p" FT /translation="MAQNTRDKVKCKLTKLIGTGKTLLESELPTLRDVLRY FT GLFLRRIEENNQNLRLTSEKLWTSVKYLWQKANPLIPIISDKRGKERIIYD FT WKKAYNLALGRGTSKQLSDHKLILDKLFDIAKCKCSIKSCEKSDCYGCNQV FT FHIVCKCSQENKIPIIELKFLKSQRDKVGSKSGMQIGSVDKKEVSRQKKYH FT QRKQNKSKRVRIETVQYSDVLSQNDVESDNAENDYNDNLYLPEIKKVSQLN FT TLNVNHIALSALRFGTSNRATASIGTATLKAAKDAGFLKDHILDKHLALDH FT KKIERSKQRVMNVCKERHNNEVKKEGIQCIMFDGRKDLTKTVVYGNDGKSY FT PREIKEDHYSVCSEPGGKYLFHFVNNSEEREARTPAEHIAEQLYDWIIAHG FT IENQLVAIGGDSTNLNTGFIFISLNGSKFILGWKGGIIQFLEKKLGFKLIW FT IICALHTNELPLRHLMLALDGKTLSENRFSGNIGKLINIATNLPILTCIPQ FT LDFVVDLIHLEKEVVDSLSTDQKYLYRIHNAIASGVFPDDLKEIQIGPHNH FT SRWLNLANRLLRIWCSEHGLKNKDLQNLKLLTEFIMGVYSIMWFDLKVRNL FT VMTKHSWIQGPHHILKQLALVKKQNKFVRDIVLPHVCSSAWNAHSECILQT FT MLCSSDEGERRFAVQKIIDFRKNKDVGDRKCRNRRIPVINTESNNLIDLIN FT WETEEISEPVLTCCLMRDDIIQFINNPMVVPNFPIHGQSIERIVKEVTRAS FT MSVXGAERRDGFIRATMAHRELMPVCQSKKDLMKIL" XX SQ Sequence 3388 BP; 1256 A; 463 C; 582 G; 1086 T; 1 other; tagggtgggt cacttttgaa atatttttga atttaggttc ggctgggcat ttttcttatg 60 tctttgaccg tatgataatg agaaccaatt ctctttttat ttgcatttaa tgtttagtca 120 ggtaattttc agcttgaaaa ttttagtttt atcataaata tataatttcc ccagagtttt 180 gtcaattttt ttacttttgc catcttcaaa ccattttaaa tcaataaatg tcttaaacta 240 cacctatttt atcaataaaa atgtcaaaat atgatcctaa aatattgcaa ctaagggggg 300 tattttacca tagaaatggg taacagttaa ttttttaaaa tgtattgatg acgtatttct 360 gaaaattttg tttatgtacg ctaaagtcgg accactagta acagttttat tagttattaa 420 aaggaagaaa taataacgca tgtatttatt gtatagaaat aagtttacag ttataaagaa 480 tggctcaaaa cactcgtgat aaagttaaat gtaaattaac aaagctaatt ggtacaggaa 540 aaacattatt agagtcagaa ctaccaactt taagagatgt tttaagatat ggtctctttc 600 ttagaaggat tgaagaaaat aaccaaaatt taaggctaac ttctgaaaaa ttgtggacct 660 cagtaaaata tttatggcaa aaagctaatc cacttatacc cattatctct gataaaagag 720 gaaaagaaag aatcatatat gactggaaaa aagcatataa tctggcgtta ggtaggggaa 780 catctaagca actttctgat cataaattaa tcttagataa actgtttgac atagctaagt 840 gtaaatgttc aataaaatca tgcgaaaaat ccgattgtta tggctgcaac caagtttttc 900 atattgtttg caaatgttct caggaaaata aaattcctat tattgaactt aaatttttaa 960 agtcacagag ggataaagtt ggttctaaaa gtggaatgca aataggttca gttgataaaa 1020 aagaagtatc cagacaaaaa aaataccatc aaagaaagca gaacaaatca aagagagtta 1080 ggattgagac tgtacagtat tctgatgtat tatctcaaaa tgatgtggag tcagacaatg 1140 ctgagaatga ttataatgac aatctttatt taccagaaat aaaaaaagtt tctcagctaa 1200 ataccttaaa tgtaaatcat atcgctttgt cagcattacg atttggtaca agtaacagag 1260 caactgcatc aataggcact gcaacattaa aagcagccaa agatgcaggt tttttgaaag 1320 atcatatact tgataaacac ctagctctag atcataaaaa aattgagaga tcaaagcaaa 1380 gagtaatgaa tgtctgtaaa gagagacata ataatgaagt aaagaaagag ggcatacaat 1440 gtataatgtt tgatggtaga aaagacctta ctaaaacagt tgtatatggc aatgatggta 1500 aatcttaccc tagggagatc aaagaagatc attatagtgt ctgttctgaa cctggtggaa 1560 aatatttatt ccactttgta aataattcag aggagagaga agctcgaaca cctgctgaac 1620 atattgctga acaattatat gattggataa tagcacatgg cattgaaaac caattagttg 1680 caattggagg agattcaacc aaccttaaca caggttttat ttttatttct ttaaatggta 1740 gttaaaaatt ttatgtactt cttttttttt tttactgaaa ataaaaattt attttaggtt 1800 ggaagggtgg aattattcaa tttttagaga aaaaactagg attcaaactt atctggatta 1860 tttgtgcgct acataccaac gaattgccac ttaggcattt aatgttagct ttagatggaa 1920 aaactttatc tgaaaataga ttctcaggaa acatcggaaa acttattaac atagcaacaa 1980 atcttcctat tttgacatgc atccctcagc ttgattttgt tgtagaccta attcatctcg 2040 agaaggaagt tgtagattca ctgtctacag atcagaagta tttatacaga atacataatg 2100 caatagcatc aggtgttttt cctgatgact tgaaggagat tcaaattggt ccacacaatc 2160 attcaagatg gttaaatctc gcaaatagac tacttcgaat ctggtgctca gagcatggtt 2220 taaaaaacaa agatctacaa aacttgaaat tgttaacaga gttcatcatg ggtgtttatt 2280 cgattatgtg gtttgatctt aaggtaagaa acttagttat gtaaataaag atataataca 2340 ataatttact taaatactat aatttacttt aataattgta aaacaattgt gattatgcta 2400 aagaaaagta attgacttta gactaaacat agctggattc aaggtccaca tcatatcctt 2460 aagcaactag ccttggtaaa aaaacagaac aagtttgtga gggatattgt attaccgcat 2520 gtctgctcct ctgcctggaa tgcacacagt gaatgtattc ttcagacaat gctctgtagt 2580 tcagatgaag gagaaaggag atttgcagtg cagaaaataa ttgattttag aaaaaataag 2640 gatgttggag acagaaaatg caggaataga agaataccag tgataaacac tgaatcaaat 2700 aatctaatag atttaatcaa ttgggaaaca gaggagatca gtgaaccggt tcttacttgt 2760 tgtttgatga gagatgatat tatacagttt attaataatc caatggttgt accaaacttc 2820 cctattcacg gtcaatctat tgaacgcatt gtgaaagagg tcaccagggc ttcaatgtcg 2880 gtgtktggtg ccgagagaag agatgggttc ataagggcta caatggctca tagagagctg 2940 atgcctgtgt gccaatcaaa aaaagatctg atgaaaattt tgtaatcagt attttaactt 3000 tttattatga gtaaacaatt tattttagaa aaatatgcgg aaaatgctgt tatcaatatt 3060 ttaatgtctt attacgtgta tagacttaat ataccataaa attttaattt ttgatgtaac 3120 tttaatatat tgttgatgta actaggttaa ctaaggctca ggaatctttt tagactttat 3180 tttattgagt agtagcaaac catttttcat aacttcgcat aaacgatata tttatgataa 3240 aacttaaatt ttcagtctaa aaattacctg actaaacatt aaatgcaaat aaaaagaaaa 3300 ttgtttctca ttatcatacg gtcaaagaca taagaaaaat gcccagccga acctaaattc 3360 aaaaattttt caaaagtgac ccacccta 3388 // ID DNA8-24_AP repbase; DNA; INV; 738 BP. XX AC . XX DT 22-AUG-2009 (Rel. 14.08, Created) DT 22-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-24_AP. XX NM DNA8-24_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-738 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(8), 1766-1766 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 738 BP; 285 A; 108 C; 108 G; 235 T; 2 other; cagtgatggg aaagttcttt ctaaaaatga attagttcaa gttcaagttc aagacattaa 60 aaatgaacta gttcaagttc nagttcatag tttttagaat gaactagttc aagttctagt 120 tcagtagttt atagttttta aaatgatcta aaattataaa agttactaaa gaaaataaaa 180 taaaaaaata cttaagttag aaataaaaaa aaaaatacgt ggaaatacgt gcgactgccg 240 actatgcgtc gaaaccggcg atttttatta gtatacggca aacaataatt tgtatgagta 300 tgttanaata atttgtatga acatgttagt cttgggaatg ttatttataa cactttattg 360 tattatgtca agttacaaca gaattataat aatataccat aaatggtata ccaactattc 420 ttatcggcga tcaacggcgc ggcgttgtga tttcagggca aaaataaata tccaaactaa 480 aaataaacaa aattaaaaac aaattatatc ttttgattct tgagaaggaa cctgatcggc 540 tacaaaacat ttcaaaacgt tatcctatcg cctgttctat gaaaattaaa ttgtcaatga 600 acgaaaccaa aacgttcaaa atcataatta ataagaactt agttcacgtt caagttcatt 660 tcttgaaaat cgaactcgtt catatcaagt tcacacaatt tgaacgcgtt cttatgaact 720 cgttctttcc catcactg 738 // ID Sola1-5_AP repbase; DNA; INV; 3705 BP. XX AC ABLF01016303.1; XX DT 28-JAN-2009 (Rel. 14.02, Created) DT 28-JAN-2009 (Rel. 15.12, Last updated, Version 2) XX DE Sola1 DNA transposons from Acyrthosiphon pisum. XX KW Sola; DNA transposon; Transposable Element; Sola1; Sola1-5_AP. XX NM Sola1-5_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-3705 RA Bao W., Jurka M.G., Kapitonov V.V. and Jurka J.; RT "New superfamilies of eukaryotic DNA transposons and their RT internal divisions."; RL Molecular Biology and Evolution 26(5), 983-993 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX FH Key Location/Qualifiers FT CDS join(588..839,843..1109,1113..1349,1500..3173) FT /product="Sola1-5_AP_1p" FT /translation="MKKARANKLVTLALNVTNRNILSHDHREINVSQSEYS FT VDLFNTSTSAVDFKPNENLEEKDMDNCGKRNIIFIPNSYTINKLILNLILF FT VDYDDSNELKHTGCSSQVMHYNIESEVTEINVSQSEYSVDLFNTSTSAVDF FT KPNENLEEKDIDDYGKRNIIFIPNSYTINKLILNLILFVDYDSNGSSLFEM FT DSDESYIPPGPLDTFSDSLSDQFDISTNLIESNVIQTDETEVIQKKKKTKK FT GKSRTLEKNYCLGNNNIFFFRYKFNDNFPEDVRSLRCQEYWNIGDYGKQKM FT YLSGLIQNVPVIRKKVITTKPKQVSRIYSITDVKGEKKRVCLNFFCTTFSI FT SHRVVNLCMKNIGDNGLFVGKDNRTNHKALNKTPISALNLVKEHINSFPKV FT ESHYCRRDSSKLYLSSDLNRALMYRLYTDHFCKERNISPVSFFIYKQIFNS FT FDPQLSFFLPKKDQCFHCNAYNNAIDKTDIENDWVQHKQREKVAMQMKSDD FT KKKSALDNGKSFRTISFDLQAILSLPFSGENQLYYRRKLNVYNFTIFDSYK FT NDGYCYVWDECNGKKGSAEIGTCLLNYLSQLPESVTHVSTFSDTCGGQNRN FT KNILAAMLYAVNRIDNIETIDVKFMESGHSYLEADSIHATIERHRKHKKVF FT TTREWALLFSSARIKPSPYKVTTMHYNEFYNFEKLVSSMIQNTTLNTEKEK FT VNWLKIKWMRFQKNEPYIMQYKYDQQETVFKKIDTSMKRGAGRKQEWTSVS FT LDIKYSKKIPISEAKKKDLKYLMNCKIIPNDYYQFYNDLPTTKKCQPQMSS FT DDEDTAD*" XX SQ Sequence 3705 BP; 1434 A; 468 C; 559 G; 1244 T; 0 other; cgatggcgta gcgcacaagt gtgataagtg acattttttt gaaaaatgag ttaagtgcat 60 aatcaattta atcatacaat tctgcacatt atgcacaaac gttgtaagtg aacagttcat 120 tatttactcc gctagatgtc gtgtttattt gaaataagta cgaatctcta aaattagaaa 180 attacaaatg ttgtaagtga ctttacaaca tttgtgattt tcatgaatgt ctttacgttg 240 cgttcaaatg tgttaagtga cgttaaaaca tttgtgaaca attttatgaa ataataaaga 300 tattattaat attaatattt gtcacgtttg gcgcgcattt tataatcata gagaatagaa 360 acatttctat aacctatcaa gcacacctat ctagcatatt attattatca atattagcac 420 cataatatta aaagtttatt ttcttgtgat aagttctacg cgtgacatcg tgtattgcat 480 ttaaagtaag ttaggtatta ttaattaaaa ctatttatga tgagtatgca gcttacaatt 540 attttactta tttaaatacc ctttattaat aattattcaa aataaaaatg aagaaagcaa 600 gagcaaataa attagttacc cttgctctta atgttacaaa tcgcaacata ttgtcacatg 660 atcatcgtga gattaatgta agtcaatcag aatatagtgt tgatttattt aatacaagta 720 cttcagctgt tgattttaaa ccaaatgaaa atttagaaga aaaagacatg gataattgtg 780 gtaagagaaa cattattttt attcctaata gttatacaat aaataaatta atactgaatt 840 gattaatatt atttgtagat tatgatgact caaatgaact gaaacatact ggatgttcta 900 gccaggttat gcattataat attgagagtg aagtaacaga aattaatgta agtcaatcag 960 aatatagtgt tgatttattt aatacaagta cttcagctgt tgattttaaa ccaaatgaaa 1020 atttagaaga aaaagacata gatgattatg gtaagagaaa cattattttt attcctaata 1080 gttatacaat aaataaatta atactgaatt gattaatatt atttgtagat tatgactcaa 1140 atggatcatc actgttcgag atggattctg atgaatcata tataccgcca ggaccacttg 1200 acactttttc tgatagttta agtgatcagt ttgatatttc tacgaattta atagagtcta 1260 atgtaattca aacagatgaa accgaagtta tacaaaaaaa aaagaaaaca aaaaaaggca 1320 aatccagaac tctggaaaag aactattgtt aaaaaaaaaa gattgagtgg agaaatgtac 1380 acaaatcgaa atgccaaaac atttgataaa aaaataccaa aaccagttaa ttgtatcaag 1440 tgtaggtagg tcaagtacaa catttttaaa taattgtaat aatcattatt gtatcttaat 1500 taggaaataa taatattttt tttttcaggt acaaattcaa tgataatttt ccagaggatg 1560 ttcgttcttt gagatgtcaa gaatattgga atattggtga ctatggaaaa caaaaaatgt 1620 atctatcagg cttaatacag aatgtgccag ttattcgtaa aaaagtaatt actactaaac 1680 caaagcaagt gtcaagaatc tattcaataa ctgatgtaaa aggggaaaaa aaacgtgtgt 1740 gtttgaattt tttttgtaca actttttcta ttagtcatag agttgtcaat ttatgtatga 1800 aaaatattgg agataacgga ctgtttgttg gaaaagataa taggactaat cataaagctc 1860 ttaataaaac tccaatatct gcattaaact tggtaaagga acacatcaat agtttcccta 1920 aagtcgagtc tcattattgt agacgtgact ctagtaagct atacctatca tcagacctta 1980 atagagctct tatgtaccgt ttatacacag atcatttttg taaagaaaga aatatttctc 2040 ctgtttcttt ttttatttat aaacaaatat ttaattcatt tgatcctcaa ctgtcgttct 2100 ttttaccaaa aaaggaccaa tgtttccatt gtaatgcata taataacgca attgacaaaa 2160 cagacataga aaatgattgg gtacagcata aacaaaggga aaaagtagcg atgcaaatga 2220 aaagtgatga caaaaaaaag tctgccttgg ataatggaaa atcattcaga acaattagct 2280 ttgatctaca agctattcta agcctcccat tttctggaga aaatcaatta tactatagaa 2340 gaaaattaaa tgtgtataac ttcacaattt ttgattctta taaaaatgac gggtattgtt 2400 atgtctggga cgaatgcaac gggaaaaaag gaagtgcaga aattggtaca tgtttattaa 2460 attatttgtc ccagttacca gaatcagtta ctcatgtttc aactttctct gacacttgtg 2520 gaggccaaaa cagaaataag aacattttgg cagcaatgtt atatgcagta aatcgcatag 2580 ataatataga aacaattgat gttaaattta tggaatctgg tcattcgtat cttgaggcag 2640 actctataca tgctactata gagcgacata ggaaacataa aaaagtattt accaccagag 2700 aatgggcatt attgttttca tctgctagga taaaacctag tccatacaaa gttacaacta 2760 tgcattataa tgaattttat aactttgaaa aattagtgag ttctatgata caaaatacaa 2820 ctttgaacac agaaaaagag aaagtaaatt ggttgaagat aaagtggatg agatttcaaa 2880 aaaatgagcc ttatatcatg caatacaaat atgatcaaca agaaacggta ttcaaaaaaa 2940 tagacacatc tatgaaaaga ggtgcaggta ggaaacaaga atggacttca gttagtttag 3000 atatcaagta ctcaaaaaaa attcctatca gcgaagcaaa gaaaaaagac ttgaagtatt 3060 taatgaattg taaaataata ccaaatgatt attatcaatt ttataatgac ttgccaacca 3120 ccaaaaaatg tcaaccacaa atgtcctcag atgatgagga tactgctgat tgaagttcct 3180 attatctata ttctatattt gtattattaa ttaataatgt tatattttga taagttgtca 3240 taaaataagt tgaatttctt tttagaaatt ttatatactt taatattaaa aggttgaatt 3300 taatttgtta tacaatatat taaagcagtt ataataacaa ttataatatt atttaagtaa 3360 ttgttgttct ttattgaaat tagaagaaaa atgtgatgtg tttaatcatt actttttacg 3420 tgtacaaaat gtgttaagta caatagcttt ttttttaaat ttttttttaa gaatacaaat 3480 gtgataagtg caggttttta aaaaaaaaaa ttttttctcg agatttgtct atagtagaac 3540 aataattttt aatcctcatt actcattaca tgttaaggaa aatttaacaa aaatttcagt 3600 acagttcgtt acatttttga attgttataa catctcaaac ttaaaaatcg ttctcctgaa 3660 caattacgaa aatgtcactt atcacacttg tgcgctacgc catcg 3705 // ID Gypsy-142_AA-LTR repbase; DNA; INV; 1807 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-142_AA_; KW Gypsy-142_AA-I; Gypsy-142_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-1807 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1018-1018 (2011). XX DR [2] (Consensus) XX SQ Sequence 1807 BP; 486 A; 353 C; 399 G; 563 T; 6 other; tgtaacactt aaatgttttg tgccagtcta agaatattcc ctttttgttc ataaccaaaa 60 gatcagcaag ctttaagagt agcttttgtg cccaaaagag taaagcttaa gtaagtgttg 120 cgcacggtgc ataaccaaag gcgaatgaac tggacgctta tgaaacattc gataagttac 180 ggaattttcc actaattaat ttgtacctgg ccctttgaat ttgctaaaac caccaatcgg 240 gaaaagcgaa ccagtggaaa gtttttagat tcmttctgag tcctccagag ctgaccaata 300 aaaawsatcg tacaaaaaag agagtacacg ataaggtgga aattgcgtcg tagctttaag 360 gctcgaacaa gtttattgcg cgctttccat tttattctgg tggagtcagt taaagtgaaa 420 agatcatttt ccgaggtagt gaagctttct ccgaaagtga agtaaaagtt attaaagtga 480 gcttttcgtt aaatttgtga acttttaaat ccgaattcga agtaagttcg cgagttttca 540 gtaattactc gaataaattg attaatcgag tttttcagta tttttgaatt ttcagagttc 600 gaatagtttg accggtgaaa attgaataat tgtgaatgat ttattgtgac tattcaggta 660 agcctttggt ggagaataat taktagattc agttagtgag aatttatcac gttgaatagg 720 catcgtcaag cccatccaac gtgggcagct tgtccaggct ggtgcaaatt gtccctggat 780 tgtcctgagt cacttcgagt ccacgtgatc gggaatcgcc cacgtgttcc agtagggttc 840 cgactacgga gtcccttcct gtgtctagcc gaagactgtt ccgactagtg ggccattgtg 900 ggtggaaaat ctcgtccaac agtcaagctt tcccccgaat cgataagctc agacgtgagc 960 ttgtgcccaa gtgtgtgctc gagtcgccga tccgtaatcg agtagagtta tcgacgattg 1020 gactcgtttc gccaaagcgt gaggcgtaga gcaacaacaa cccgctctca ctccaccaac 1080 cgccagggac gcctctaccc tgaagatcgt gcccacgcgc gagtctaccg tatagcacac 1140 gttccaatta gaatccgacg aagctctcct gccgtcgagg gacgatcaag ccatcgccgt 1200 tggagcacca cgccgatgca gttaaggtca gatccacact agcttttgaa ctgattgata 1260 gtatccgaaa attgtatata tagaataagc taaatgagtt aaatgcgtga cgtcaccaga 1320 ctagatacta ggttttatwt gaatgttcac aataaattga gagcttgcat acgttttgaa 1380 taggtcctat gcaaataact ctctaaattc atttttctwa aataaattcg tttaaattga 1440 ctttgtttca taatatttat ttcgagttcg gattgaaaac gactctacta ttcttttacg 1500 tttatttttt gtaaatatct agatttgttg gttccttaag ttaagacttt ttctgccgct 1560 cagttctggt tgactttttc ttttcttttt ggtttaggtt tccggttttc agttggtttg 1620 tagttaaatt tccgttcatt tgtcctttcg tgtgtgccat tttggagagt ttttgcttct 1680 gcattgaact atttgcgacc tgccctgaga agggtggtct cataccgggc ttagacagag 1740 tgacggtcct tcggaagtcg aaggtggcgc taaagccgtt gcttgaggag cagagtagaa 1800 cgttaca 1807 // ID hAT-4_HM repbase; DNA; INV; 2421 BP. XX AC . XX DT 07-DEC-2008 (Rel. 13.12, Created) DT 10-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-4_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2421 RA Jurka J.; RT "hAT-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 1993-1993 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 422..2191 FT /product="hAT-4_HM_1p" FT /translation="MSKSKKRSYLDEYMRYGFTYMLKETISYPQCVICYKV FT LGNDSMKPSKLAIHLNKCHPDLQTKDTDFFKRKFESLKRCKLDKTGIIYQT FT QQNIVEASYKVSYIIAQNKKAHTLAEEVILPCTKEIVRLLFGEEAAKKVDN FT ISLSNTTVKRRLTDISSNIKENVINEIKESPYFSIQLDESTDVSSMAQLIV FT YCRYIHNNKFKEEFLFSSCLETTTKAANIMEILKQFFNDNQLQWKYLFGIT FT TDGAPAMMGCKSGLQTRVKEIAPNVVGVHCFIHRQALATKTLPGSLKTVFN FT QLVKLVNYIKSSALNTRLFTKFCSDLNAEHNKLLFYTSVRWLSAGNFLERF FT FMLRNEVKEFLCQMKSGLVEYFEFDNFEITTAYLVDIVGHFNKLNLQLQGK FT NANVITHSDKLKAFIEKLKLYKTRINNGNLIMFQNLNNIIGVNMLPEVIKT FT EIVCHLDNLITEFGNYFPDMNLLSSEVIISPFSCDVKNVKEEAQEEFIELK FT NDTTAKDHFKVAPLNNFWLKMRNSYPLCSSIALKALIPFSTSYLCEAGFSA FT MLSLKTKKRNRLDIDADIRCALSKTKPNFQILIASKQCQNSH*" XX SQ Sequence 2421 BP; 880 A; 338 C; 391 G; 812 T; 0 other; cagtggttcc caaccggtgg tccatggacc cctgggggtc cataagaact ttttaagcgg 60 accacaagat aaacgtaaat tctaaatatt ttttggtagt gaaacttaat tttgattatt 120 tttttctgtt gaaagaaaca aattaaattt taaattaaaa atgtaattta tgcgagaaag 180 ctgttacaca aaatttgcgt taactaattc gacgaggact tgcggacctc tacgaagtaa 240 taaatgctat ataatatcat tttttatccg taaatttaaa atgttaaaat cttatctcat 300 ggtaaagata accttactga tttgtttgaa agaacgtaat ttgtgttatt ttctataaaa 360 attcgatact atactactct tatctataga tgctttttat attatttttt aaatacttta 420 gatgtctaaa agtaaaaaaa ggtcatactt agatgaatac atgagatatg gatttaccta 480 catgcttaaa gaaaccatta gctacccaca gtgtgttatt tgttataaag tactaggaaa 540 tgactctatg aaaccttcaa agcttgctat tcacttgaat aagtgccatc ctgatctcca 600 aacaaaagat actgattttt tcaaaagaaa atttgaatca ctgaagagat gcaaattgga 660 taaaactggg attatttacc aaactcaaca gaatatagta gaagcatcct ataaggtttc 720 atatataata gcacagaaca aaaaagctca cacattggct gaagaagtta tacttccatg 780 tacaaaagaa atcgttagat tattatttgg agaagaagct gcaaagaagg tagataatat 840 atctttatca aatactacag tcaaacgaag gctaacggat atttcatcaa acattaaaga 900 aaatgttatt aatgaaatta aagaatcccc atatttttct attcagttgg atgagtccac 960 agatgtaagt tcaatggcac aactaattgt atattgtagg tatattcaca ataacaaatt 1020 taaagaagaa tttttgtttt cttcttgcct tgaaacaaca acaaaagcag ctaatattat 1080 ggaaatttta aaacagtttt tcaatgacaa tcaattacaa tggaaatatt tgtttggaat 1140 tactactgat ggagctcctg ccatgatggg ttgtaagtct ggattacaga ctagggtgaa 1200 agagattgct ccaaatgtag ttggtgtaca ctgttttatt catagacagg ctttggcaac 1260 caaaacttta ccaggtagtt tgaaaactgt atttaatcaa ttagtcaagt tggtcaatta 1320 tataaaatct tctgctctta acactcgcct ttttacaaaa ttttgctctg acctaaatgc 1380 tgaacacaat aagttattgt tttatacttc tgtgcgatgg ctatctgctg gaaatttttt 1440 ggaaagattt ttcatgctta gaaatgaagt gaaagaattt ttatgtcaaa tgaaaagtgg 1500 actggtagaa tattttgaat ttgataattt tgaaataaca actgcttatc ttgtggacat 1560 cgtaggtcat tttaacaagt taaatttgca acttcaaggc aaaaatgcta atgttattac 1620 acattcagat aaactgaagg catttattga aaagctcaaa ctgtacaaga ctagaattaa 1680 taatggaaac ttgatcatgt ttcaaaattt aaataacata attggagtta acatgcttcc 1740 tgaagttata aagacagaaa tagtttgtca cctagataat ttgattactg aatttggtaa 1800 ctattttcca gatatgaatc ttttgagcag tgaagttatt atatcaccat tttcttgtga 1860 tgtgaaaaat gtgaaagagg aagcacaaga agagtttata gagttaaaga atgataccac 1920 ggctaaggat cacttcaaag tagcaccatt aaataacttc tggttgaaaa tgaggaattc 1980 atatccattg tgttcatcaa ttgcacttaa agcccttatt cctttttcaa catcatattt 2040 gtgtgaagcc gggttttcag caatgctatc tcttaaaaca aaaaaaagaa atagacttga 2100 tattgatgca gatattagat gtgctctatc aaaaactaaa ccaaattttc agatattaat 2160 tgctagtaag cagtgtcaga attcacactg atctgttctt gtatcataca taaactaaga 2220 taaaaaattt ttgtgtcatt gataaatatt gattttgtgt ctagtctatt caaaaatacc 2280 ttaaataaaa tattatcctt atgaataaaa gttacattat taaaatgttc accttgtaat 2340 aatttttaaa atcaaagggg tccatgaaga ttttaggata tttaatttgg ggaacgggtt 2400 agaaaaggtt gggaaccact g 2421 // ID SZ23_TC repbase; DNA; INV; 276 BP. XX AC . XX DT 14-SEP-2004 (Rel. 9.08, Created) DT 14-SEP-2004 (Rel. 9.08, Last updated, Version 1) XX DE Trypanosoma cruzi telomere-associated repeat, a consensus. XX KW SZ23_TC. XX OS Trypanosoma cruzi OC Eukaryota; Euglenozoa; Kinetoplastida; Trypanosomatidae; OC Trypanosoma; Schizotrypanum. XX RN [1] RA Chiurillo A.M., Cano I., Da Silveira F.J. and Ramirez L.J.; RT "Organization of telomeric and sub-telomeric regions of RT chromosomes from the protozoan parasite Trypanosoma cruzi."; RL Mol. Biochem. Parasitol 100(2), 173-183 (1999). XX RN [2] RA Gentles A., Kohany O. and Jurka J.; RT "Trypanosoma cruzi telomere-associated repeat, a consensus."; RL Direct Submission to Repbase Update (JUL-2004). XX DR [2] (Consensus) XX CC Average similarity to consensus 90%. XX SQ Sequence 276 BP; 48 A; 60 C; 94 G; 74 T; 0 other; caaccacaca caaaatgcac attgcagtgg atttggatgc ttcttttgta gcgtgtgctg 60 gcggttgcgg gtggctgctg cttccttccg tgggcatgaa gggaaggagg agagagggag 120 cgcaccgcga gtcacttttg ctgctggcgt gatgcaccac ctatgaccgc tttgagttgg 180 gcaccgaatc ccattgtggg tggaatggct ggaaattttg ttgccgcgtc atggatgggt 240 gaggatccgc ttccgcgtgt ggctctttgg gtgatt 276 // ID BRP2_NV repbase; DNA; INV; 110 BP. XX AC X64090; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 25-MAR-1997 (Rel. 2.02, Last updated, Version 2) XX DE N.vitripennis repetitive DNA from B chromosome. XX KW SAT; Satellite; Simple Repeat; BRP2_NV; Repetitive DNA; KW satellite DNA. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-110 RA Eickbaum C.D.; RT "BRP2_NV."; RL Direct Submission to Genbank (27-DEC-1991)D.C. Eickbaum, RL University of Rochester, Dept. of Biology, Hutchison Hall 334, RL Rochester, NY 14627, USA. XX RN [2] RP 1-110 RA Eickbush G.D., Eickbush H.T. and Werren H.J.; RT "Molecular characterization of repetitive DNA sequences from a B RT chromosome."; RL Chromosoma 101, 575-583 (1992). XX DR GenBank; X64090; Positions 1 110. XX SQ Sequence 110 BP; 23 A; 19 C; 29 G; 39 T; 0 other; ctgggtttaa gcacttggtt tttagagttt aacctcgctt tgctcgcctt acgccgatgt 60 gtcgaagagt tttaataaat aatttactga tctcgggggg catggtgtag 110 // ID Copia-44_AA-LTR repbase; DNA; INV; 176 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-44_AA_; KW Copia-44_AA-I; Copia-44_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-176 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 966-966 (2011). XX DR [2] (Consensus) XX SQ Sequence 176 BP; 43 A; 43 C; 33 G; 50 T; 7 other; tgtagaagaa cgatgattac ctccggccaa gtatacggtg catccaagca ccccccgtga 60 gtatgattac tagcgcgttg ttgtaccccc aggtacgttt gacatttatg ctcattccta 120 aaagcacatt tcccgcgtam agttgatwtt cattttmask ctaaktagmc cgttca 176 // ID BEL-29_CQ-LTR repbase; DNA; INV; 366 BP. XX AC AAWU01003415; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-29_CQ_; KW BEL-29_CQ-I; BEL-29_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-366 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 212-212 (2011). XX DR Genome; AAWU01003415; Positions 3434 3069. XX SQ Sequence 366 BP; 95 A; 91 C; 94 G; 86 T; 0 other; tgttgagtct tcaacgtctg gcagtaaatt ggcaacactt gctgtcaccg cagccagttt 60 gcttagatca acgaaccacc ctgtaaggaa aaacgcaacc gttgctgcga tcggcgacgg 120 aagaggaagg gagaaacgcg cgaaaccgag agttggagaa gcagcaactt ggctcgggat 180 cagtctcgcg ccaccagtcg ttggaagcag ttctaaatcg cttgttgaac taaaataaag 240 ttagtgatta gtattagaac cgcgtgtgta ggtttctttc caacgcccac tgccagttca 300 agtccaccat cttttcggaa acccaattcc gccgtttttt cgtggggcca gtctgtggcc 360 cgaaca 366 // ID Gypsy-9_OD-I repbase; DNA; INV; 10202 BP. XX AC CABV01000587; XX DT 24-FEB-2011 (Rel. 16.02, Created) DT 24-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Oikopleura dioica genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-9_OD_; KW Gypsy-9_OD-LTR; Gypsy-9_OD-I. XX OS Oikopleura dioica OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; OC Oikopleuridae; Oikopleura. XX RN [1] RP 1-10202 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Oikopleura dioica genome."; RL Direct Submission to RU (24-FEB-2011). XX DR Genome; CABV01000587; Positions 24049 13848. XX CC Positions [5655-6137] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 1037..2104 FT /product="Gypsy-9_OD-I_1p" FT /translation="MPTTGITNRCIEVSSATLVYLKKVATLTAIDGAYKYT FT KKEELLNKIKEYGNGFWVTLVESRSGGVFLIPLTGAFNIDELQNKPTVFDL FT QGAKEAFARGFEKELYRFNEAEIEAKEFGTNDDNCKMTIEAFYMGDDTSGA FT FAHFKFDHVKVLDRARNKSSTPAPSEPEESDGPTVKDESEESAREAASEWA FT AKAQQVKEQMRVAHERFESITRKQAEAIKKMGDKMEAREKRLNEMDEAIEG FT LASMDEILDDTIRQARADVGVNNVNITRDLLDGNDFSGINKGSKIKLPIIF FT GDKDKGLGKPTDVTDWYENCCFILNLQGITNKKTRLAHLVGEHQREIPGAS FT NSVDQGETVANGG" FT CDS 2811..6626 FT /product="Gypsy-9_OD-I_3p" FT /translation="MSSLPTCQVKIIQFKGREKVISGNFKFLIDTGSENSF FT IKSDIVNQARLDKTPRKNAMIVGNAAGSTMARITDEIRCDINVTGPKNQIF FT VSNIELDIWGAQLSYDGILGYNILKYINIMGSDNKQILLLNNIESAQESFE FT KWPWVILGQPDYKLRRKCIVITEDVYVPPNGIAWGKIPTQKKQWAVDTDFE FT LLENGIFVNHKGKLKLGRTLIKLENHRDIGVLISAGFPIMEIVSGNQMNFL FT MEMGKIDKMERMIHEKELEDWKKRRDQLVETTNVDSEIKEALVNVPAEYSI FT GLEKVLRRYQWSFSRNSSDCGYSCNWVADLKLNDPEAPPTFVRPYDIDNSA FT MEQVGNKLKEMNDSGIIQESSSGWNAPLLMIAKADKSIRIVQNYSGARKNE FT KSLNERLIMPRWPIMPIRSILTTISRNITNLKRDYPGERIYFVGIDIRNAY FT YTLSLRNSTRELTSFIFSDKQYEYTKMSQGLSTSPSTFTAFISKLFAKLDD FT QGNKKYFYTTCYMDDILITGVGRHMNRAIDQVLGLFEKENIVVSLKKCSFY FT EKETKFLGFMISEIGFRGSQKKVDGILELDYPRTSKEAMRMLGCVNYFNRL FT IRGASELMGPLAKESGKGKHYQLTEQIKECGDALKKRIRENGVSLIHLKYY FT DGTHGKYCFIASDSSLYGTGACIGNCTIIDDQTSEIEIAAFCSKKLDDQEA FT MLSSRSRELIALSVALQSFRDIIPKSLPVLAFVDHQSLVNIRKNINLKTSG FT QTRVRKAFATVLDFPNLKIFYIPGNSEIINVVDYISRHAEFVIREIDPSIL FT SPKKIETEEIQTNQITDITEIGKTVKEIDVDVLKKEQKLEFGEIMNGPQGI FT FKIVGKKEFIVQNDLLFEKVRSGAVLLVIPDCLADEIIQFLHVSSLHASQE FT EIFRKIRAQKILIKNRCERVSRLLKHCVYCKIRCTTKHSFEKQEFPREPAL FT KSREKIHVDCLDIRTGNKENIFLTIVDEFSRKLFVHKLKNKKMDVLVPILT FT IQLAEIGGSYSILISDNGKEFLNRGVKEALLTLNILRTTNSPYNSRSNLCE FT RAHKNIREKLRCTELTASNVEFKTLLAVAFLNHCPLKELNFLTPNQVWANL FT DPPNYLPQFTREEVTLLPSEKMSFLEELQTQVMSEKIGRYLFKTDVLENPF FT SEGDIVMLRDPVGISANKIKGPFRVVKTGLKSSVTICDLNNGVLYRRNGRH FT LFKIYVSKEDMEKFMGKGRTVLENKRNGELEPGFVGSPLDNMSIKFPEPFE FT PGFKLRSGRTK" XX SQ Sequence 10202 BP; 3508 A; 1348 C; 2461 G; 2885 T; 0 other; ctataaattg gtgactgaga gcggacgacg atctacgtaa acgtaagtaa tgcttgatgc 60 ttgcttattt taattttata taagttttgc tggaatttga atgattttaa tttttgaacc 120 ccttgggtcg ataaaatttg ttgatttata aaaaatggaa aatgattttt gggggaggat 180 tggaaatttt tgatcaaaaa tgggaaaaga attttctttc ttttgcgtcc aaaattcgag 240 attttttttt tcattttgat aaggagataa gggccgtaat ggcttgggaa ttaaattaaa 300 agactaaggg gggataaatt aggttaaaga agtgtcgatc gtttttttcg aaaacatgta 360 aattttgacg cacttctgcc acgagatatt gggaaacttg tccttgttaa ttttccgaaa 420 taggaccgaa cggaaaaggg atcgagcggg gaaaaaggcg aatcgggaga aagaggttat 480 ttttgagcgc cacacttgcg tgcatttttt gaaccggtga cggtcagcta ccaaagattt 540 ggtcagaatt ttagttcgcg tttggtcaat tttgggaatt taaggattaa gaaaattatt 600 tgggtttaaa ttttgagagt aaagttattc aattttttaa gaaaattagt gaggattaaa 660 cttatataaa atataaaatt ttttttacaa attgaggtaa atgtgcacaa ttctatcttg 720 atttttttga aaatttttgg gactgattat attaaagagt gatatgagga gattgaggaa 780 tagggtgcac cagaggggtg accatgaaaa tttttgttga agaatttttg agaatctaga 840 actgataacg cgcgtgggag aggatctctg ataaccgaga tattaggaac cgtcactgac 900 tgataagtca gcttgtgtct cagggtcttg tagacgaggg gaataaatat gacgggttta 960 attaattaag tagactgagc gagtctgaga gactcattga acttgatgat gaaatattat 1020 acgagttgtt tttcagatgc caacaactgg gattaccaat aggtgcattg aggtatcatc 1080 ggcgacgttg gtgtacttga aaaaggtggc aacgctcacc gcgattgatg gggcctataa 1140 gtatacaaag aaagaggagc ttttgaataa aattaaagag tatggtaatg ggttctgggt 1200 tactttggtg gagtcgagat caggtggggt ctttctcatt ccattgacgg gtgctttcaa 1260 tattgatgag ttgcagaata agccaactgt gtttgactta caaggtgcaa aagaagcatt 1320 cgcaagggga ttcgagaaag agttgtatag gtttaatgaa gcggagattg aggctaagga 1380 gtttgggaca aatgatgata attgcaagat gaccattgag gctttctata tgggggatga 1440 tacttcggga gcgtttgctc atttcaagtt tgatcatgtt aaggttctgg atcgggcaag 1500 gaacaaaagc agcactcctg caccatccga gcctgaagag tctgatggac cgaccgttaa 1560 ggacgagtcc gaggaatcag ctcgagaagc tgccagtgaa tgggcagcaa aggcacaaca 1620 agttaaagag caaatgagag ttgctcatga aaggtttgag agtatcaccc ggaagcaggc 1680 tgaggctatt aaaaagatgg gtgataaaat ggaagctcgg gagaaaaggt taaatgaaat 1740 ggatgaagct attgaagggt tagccagtat ggatgagata ctagacgata ccatacgtca 1800 ggcaagagcg gacgttgggg ttaacaatgt taatataact agggacttat tagatggtaa 1860 tgatttttcg ggcataaata agggttcgaa aattaaacta ccaattattt tcggggataa 1920 agataaggga ctgggaaaac ctactgatgt tacagattgg tatgaaaact gttgtttcat 1980 tttgaattta caaggtataa caaacaagaa gacgcggtta gcgcatcttg tcggagagca 2040 tcaacgggaa attccagggg ctagtaattc ggtcgatcaa ggagaaacgg tcgccaacgg 2100 aggatgacgt tcttcaaacg cttctgaagc ttttgaatta caatcgaagc catttgaaat 2160 cagaaatctc caagtttcaa attgtcaggg atcgacaatt gatcgagcaa tttttagagc 2220 tcagaaatct gattgggtta caagatttga agtttccaag tccggaagct agggaaagta 2280 cgcttgactt gttagcactt cgggaatttg agcaaaaaat gccaaaggct attgcagggc 2340 aagtactgtt tcggatggag agtgaaggga aatcgccgag gcaaataatt cagctggcgc 2400 aaaaaatttt ggatagttcc cagccatctg aatcagaatt taataatttt aaaggggcag 2460 agattgaaaa tgacagtgag tttcagtcag atttgaatgc atttcgaaag aaatggggaa 2520 aagatgcgaa atttgggggt ggtgataaga agggtaaatc aaaagcagcg tgctggatct 2580 gtgataaacc gggtcacaag agtttcgatt gttataaaaa ggctaaagat gggtgtttta 2640 gatgtggaaa tgtttcacat agaattaagg attgtccgga gaaaaagtcg acaagttctg 2700 agaagaaggg taagaaacct gttcctgagg ggtacttgtg caagatatgt agcgtcagcg 2760 gtcattggat ctgggagtgt ccacaaaagg gaaaaggaaa gtaaagacat atgagctcgc 2820 tgcctacatg ccaagtaaaa atcatccaat ttaaaggtcg ggaaaaggtt atatcgggga 2880 attttaaatt tttaattgat acaggatcgg aaaatagttt tataaaaagt gatattgtaa 2940 atcaagcgag actggacaaa acacctcgta aaaatgcaat gatagtggga aacgcggctg 3000 gatcaactat ggccagaata actgatgaaa ttagatgtga tataaacgtt acgggtccaa 3060 aaaatcaaat ttttgtgtca aatatagaac ttgatatctg gggcgcacaa cttagttacg 3120 atgggattct aggatataat atactcaaat acattaatat tatgggatca gataataaac 3180 agattttact actgaataac attgaaagtg cacaagaaag ttttgagaaa tggccatggg 3240 tgattttggg gcagccggat tataaactca ggagaaaatg cattgttatt acagaggatg 3300 tttacgttcc gccaaatggg atcgcttggg gaaaaatacc gacacagaaa aaacagtggg 3360 cggttgatac tgattttgag ttactcgaaa atgggatttt tgtaaatcac aagggaaagc 3420 tgaagctggg ccgaactttg ataaaacttg aaaatcatcg agacattggg gtcttgattt 3480 cagcggggtt cccaatcatg gaaatagtat cgggtaatca aatgaatttt ttaatggaaa 3540 tgggtaaaat agacaaaatg gaacggatga ttcatgaaaa ggagcttgaa gactggaaga 3600 agcggcgaga tcaactcgtt gagacaacca acgttgatag tgaaattaag gaagcgctag 3660 taaatgttcc ggcggagtat tcaataggac ttgaaaaggt tcttagacgg tatcagtggt 3720 cgttttcaag gaacagttcg gattgtgggt attcatgcaa ttgggttgca gatttaaaat 3780 tgaatgatcc tgaagcgcca ccgacatttg tcagacctta tgatatagat aatagtgcta 3840 tggagcaagt tggtaataaa ttaaaagaaa tgaatgactc ggggataatt caagagagta 3900 gtagcggatg gaatgcgcca ttgttaatga tagctaaagc tgataaaagt atacggatag 3960 ttcaaaatta ctcgggggcc cgtaaaaacg agaaatcatt aaatgaaaga cttattatgc 4020 ccagatggcc tataatgcca ataagaagta tccttactac catatcaagg aatattacaa 4080 atttaaaaag ggattatcct ggggagcgaa tctatttcgt tggaattgat ataagaaatg 4140 cttattatac gctttcactg aggaactcga ccagggaatt gacgtcattt attttttctg 4200 ataaacaata tgaatacacg aaaatgagtc aggggttatc gacaagtccg agtactttca 4260 cggcgtttat aagtaaactt tttgcaaaac ttgatgacca gggtaataaa aagtattttt 4320 atacaacatg ttatatggat gatatactga tcacaggggt cggacgacat atgaatcggg 4380 cgatcgacca agtgttagga ctttttgaga aggaaaatat tgtcgtttct ctgaaaaaat 4440 gctcgttcta tgaaaaggaa acgaaatttt tgggatttat gatatcagaa ataggatttc 4500 gggggtctca gaaaaaagtt gacgggattt tggagttgga ttatccaaga acgagtaaag 4560 aggcaatgcg catgcttgga tgcgttaatt attttaatag attaataagg ggtgcctcag 4620 aactgatggg acctttagcc aaagaatctg gaaaagggaa acattatcaa ctaacagagc 4680 agatcaaaga atgtggcgat gctcttaaaa aacgcataag ggaaaatggg gttagtttga 4740 tacatttaaa atattatgat ggaactcacg ggaaatattg ctttatagca agtgattcct 4800 cactttatgg aactggggct tgcattggga actgtacaat aatcgacgat caaacgtccg 4860 agattgaaat tgctgcattc tgtagtaaaa aactagatga tcaggaagcg atgcttagtt 4920 caagaagcag agagttaatt gcactaagtg ttgcactgca gtcttttcga gatattattc 4980 ctaaatcatt accagtattg gcttttgtgg atcatcagag tttagtaaat ataaggaaaa 5040 acataaatct taaaaccagt gggcaaacaa gggttagaaa agcgtttgcg actgttttgg 5100 attttccgaa tttgaagatt ttttacatac cgggaaactc tgaaattatt aatgttgtag 5160 attacattag tcgccatgcg gaatttgtca taagggaaat tgatccgtcg atccttagtc 5220 ctaaaaagat tgagactgaa gagattcaga caaatcaaat aacggatata acggagattg 5280 ggaaaaccgt aaaagaaata gacgtggatg ttttgaaaaa ggaacagaaa ttagaattcg 5340 gggaaataat gaatggacca caagggattt ttaaaattgt ggggaaaaag gagttcatag 5400 tacaaaatga tttgcttttt gaaaaagttc gatcaggggc ggtactttta gttattcctg 5460 actgtcttgc ggacgagata attcaatttt tacatgtttc atctcttcac gctagtcaag 5520 aggaaatttt tagaaaaata agggctcaga aaatattgat taagaacagg tgcgagcgag 5580 taagcagatt attaaagcac tgtgtgtact gtaaaattag atgtactaca aaacactcgt 5640 tcgaaaaaca ggaatttcca agggagccag cgttaaaatc tagggaaaaa attcatgtgg 5700 actgtttgga tataagaacc gggaataagg aaaacatatt tttaaccatt gtggatgagt 5760 tttctcgaaa gctttttgtt cataaattaa aaaataaaaa gatggacgta ctggttccga 5820 ttctgaccat acaattagct gagatagggg gatcttacag tattttaata agtgacaatg 5880 ggaaggaatt tttgaatcgg ggagttaagg aggctcttct gacgcttaac atactaagga 5940 caactaacag tccgtacaat tcgcggtcaa atttgtgcga gagggctcac aaaaatataa 6000 gggaaaaatt acgttgcaca gaattgacgg cgtcaaatgt agagttcaaa acactactgg 6060 ccgttgcatt tctaaatcat tgtcctttga aagagctgaa ttttttaacg ccgaatcaag 6120 tttgggcaaa tttggatcca ccaaattatc ttccgcaatt tacaagggaa gaggttacac 6180 tgttaccatc ggagaaaatg agtttcttgg aagaattgca aactcaagtc atgagtgaga 6240 aaattgggag atatttgttt aaaacagacg tactcgaaaa tccattttct gagggagaca 6300 tcgttatgtt aagagatcca gtggggattt cagcaaataa gataaaaggt ccatttagag 6360 tggtcaaaac tggattaaaa agtagtgtga ctatatgcga tttgaataat ggggttctgt 6420 atagacgtaa tgggagacat ttatttaaaa tttacgtatc aaaagaggac atggaaaaat 6480 ttatgggtaa agggcgaaca gttttagaaa ataaaagaaa tggggagtta gagccgggtt 6540 ttgtaggttc acctttagat aacatgtcta ttaagtttcc cgaaccattt gaacctggat 6600 tcaagcttcg cagcggaagg acaaaatgaa tgtttgtttg ttagggctga tcttctggag 6660 ttccggagtc agttcgacga ttttggaaaa gggtgtattc gtggggtgca ctgccaatgt 6720 aaagcgacac tgccaatgta aagctctgca ttttctcgct ttacattgga agggtgtttt 6780 ctaaaaacac attgttttga ttaatatctc ttgttctata agaaaggttt gtgggccaca 6840 gattccaaat atgggtgaaa acgggttaaa aaaagttaga aattaataga catcagaatc 6900 agatattttg tgacattttg tgtatcgagg aaaaatcgag gtgcattttt gcgctttaca 6960 ttggcattgt aaaagtgcac gctttacatt ggcagtgtct cccgtattcg taactaaggg 7020 gaaaaagggc tacgtgattc cggcaattgg tcaaggggta ctggccgtag atttttatcc 7080 ggatgatcgg gtttttagaa atataaaaga cggtaaagaa agttgttttg gaaatacgat 7140 tcaccaaagt caaaaaaccg acaatttgga gtatatcaat aaaatgttaa agcttaaatg 7200 gggtttgctt aacaattttg ataggaaaaa gcgttctctc acatctaaaa tagcgctggg 7260 actaagtgct ttcgattttt tacagggaaa tatccgggac aatttcattt ctgaagaaat 7320 ttcgaaaact aggaaattaa tgcgggagga cgaaattcag acaaaaaaat tagcggaaat 7380 cgttaacgat gttagcatta ttttggaaaa tgataacggt gaactacaac ttatgaagga 7440 gaaaatatgt ttggatcgat tggatatgaa attgttcaaa gtcgaacagg aattattcaa 7500 cgtggtacag tcatatttgt tagaattaga agcattggag actaaaatag tcttaaaatt 7560 gccaaactcg aagatatcag aaatgctggt caaaatctgc gttgaaagca atgggaaaaa 7620 tttttcagaa gggtgtaaag attactattt atacgatgga tttacggttc aacgtaaaga 7680 aatcgtgtac aatgatatag gattaattgg gcaacgattt tatattttat atgaagtacc 7740 ggtcattgaa attattgaaa atacgtttga actgacaacg gtacctacac cattttcggt 7800 agaacaggga aaatatatct ttcaaaaatg ggatctgccg gaagtaattg gaattatgga 7860 aaaagggaaa attgtttttg atatgggaat tgtaaaattc gcgcgggaaa aacatatttc 7920 tgcgataaca gtatcatttt tgagaatttt cagactcaat gtcctgaagc aataattcaa 7980 ggggttgtga gtgcagaaac atgcggagtt tcgttgataa gctctgtgca tgactgtttc 8040 tttaatagaa atattcgagg aaactcggtt attgtcggac attttaatga gccgattgtt 8100 aaaaataggg ttaactcagg ggtaccaact caaggttata gacaaaaatt tcgggaaaac 8160 agaaatgttt caatgataga gttaaatgac caagtttcga caattgagtg tcgaaatacg 8220 atttttaaac atgctggagt cattaaaaat cctaaactta tcacggttaa tgtaacagat 8280 tttgtaaaaa ttgagtatcg ggatcaaaat ttacatttgg aaccatggaa gaaaactcca 8340 agtggaactc aaatcattca aaatgagctg gggaatattg aaaaaggatt tttggataga 8400 tcatcagagt ctattgatag accgtggtgg gaattttggg tattttcgga gaaaaataag 8460 aaaaagttag ttatttcaag tatttttgcc atgattattc tgagttttgg gttattcttg 8520 agcacaaaat ggggacgaaa acagattttt aacttgctaa aaacaatatt tttgaaatta 8580 tgggttaaat gtcgcaattc gcgcaagatt tcaaacaaat cagatggtat ggaaatgaaa 8640 aaactggatt caataacgga ggaaccagaa attaaaccga ggaaactatc aattacaatt 8700 taaaaacatg tattctgaca tgttgagtgg aatttttaga tcaaaaattc catatttagg 8760 gataattttg ccgataattc attttttacc agttttaagg gtaaaaatac catttttcgg 8820 taaaattatg cgatttttgg gttaaaatgg ttaaaaacga gttaaaacag gttaaaaatg 8880 ggttaaacag gttaaaaatg agttaaaaca ggttaaaatg ggttaaacag gttaaaaatg 8940 agttaaaaaa ggttaaaatg ggttaaacag gttaaaatgg gttaaacagg ttaaaaatga 9000 attaaaacag gttaaaatgg gttaaacagg ttaaaaacgg gttaaacagg ttaaaaatgg 9060 ataaggttaa aaaataggtt aattgacatt tttgggttaa aaacggaatt aaaaagtttt 9120 tcttcaagat ggctaagaag gtcacctgga acgacgagga cgttcttcac ctctatgatg 9180 acgatgcgga gatgtcaaac atgcttgacc tctggtctga agaaatcggg atgcgaacgg 9240 acatggggtc ccaaacggaa atggaaatgg aatcagctgc aactcaagcg acaatggaag 9300 tgatgacgga gaaaaaatca gtggaaacac aggccaggcc ggaattggta gaacgaggga 9360 tgcagaccga cgaaatggat gcccgtgaag ttaaagttgt tctttataaa agcgtaactt 9420 tgactgagga aaagaaggtg cagacggagc gaagcgaagc taaaagcgcg aagtcaactg 9480 gaaacagctt gtgcggaaaa tttctgaacg agggatcagt tatttcaatt tgcaacgatt 9540 tgggggagaa gataaatgaa aatatgcgat ggagggctga aattggagaa cgatttaatt 9600 tgggatattg gaaggagcca tttgagggat acagattgat tgcgcaagaa tggaaaacag 9660 gacggagcgt tgtgctggag aagaaagaag atgtggataa ggtacagcaa tttgtcaaaa 9720 tacttggaaa gtttctggtc aaaaatcttt ttcgaggggg tggatatata acttatatgg 9780 cgttggaaac gattgttaag ccaaatttat attttgcgaa tgttacgact tgggagtaca 9840 aattgggaga tacatttaag gcgaactgga caagaacatt acaggatcga tatatgctat 9900 tggagcttgg ggatcgtttg tttggacagg atcaaaagaa gattacaaca atggaaatgg 9960 aggggcagaa cagacgacaa tcaacaatgc gagataggta ctagatggat ggagcaatgg 10020 attctttatt atttctaaat ttggtggggt cattacattt cggttcatca attctttcaa 10080 tttggatcta aatttggact ggcgatagga ttttagttaa atggaaactg ttaggatagg 10140 ataaattggt attggttatt tacgacggaa gtcaacgaca acttcaaaga gggggaggag 10200 tt 10202 // ID CR1-44_BF repbase; DNA; INV; 2287 BP. XX AC . XX DT 30-JUL-2009 (Rel. 14.07, Created) DT 30-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE Amphioxus CR1-44_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-44_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-2287 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-2287 RA Kapitonov V. and Jurka J.; RT "Young families of CR1 non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(7), 1615-1615 (2009). XX DR [2] (Consensus) XX SQ Sequence 2287 BP; 780 A; 520 C; 542 G; 445 T; 0 other; tagactggac aaaagaacta gaaggcatga cgacacagga ggccttcaac acactgcaag 60 agaaagtaat gaaggtggta gaagagaatg taccgacgaa gagaacggga aatacaacaa 120 agaaaagcaa accgaaatgg atgacacatg aggctcaacg taaagccaga aagaaattcc 180 atgcatggtt aaggtacctc aacacgaaaa gcaaggaaga ctggaaaacg tacaaagaat 240 gtagaaatgc cgcaacacat gctgcaagac aagcacgtag agactttgaa cgatcactcg 300 ccaaggaagc taagacaaat aacaaagctt tttggagtta tgtgaactcc cgcagatcag 360 taaaatcaca ggtgggtgac ttgaaggaca cgaagggtaa ctttgctacc aaagacaagg 420 acaaagcaga aatactaaat gaccagtacc ataagacctt cacacgtgag agctacgaca 480 acatgccagt attcgatccc aaacctctac agactccggt cctggacaaa atagaagtca 540 ctgaggaaat ggttatggac caactatccc agctgcgtcc ggacaagtcc ccaggtaatg 600 acggacttca cccgagaatt cttagagaac tgtcacacga gatcgtggtc ccactaacca 660 aggtttatca actcagctta agtaatgcag tacttccaga gcagtggacg gaagcaagta 720 tcactcctat cttcaaaaag ggtgagaagt cagacccagc aaactaccgt cccgtctcgc 780 taacatcagt gccctgcaag atgctggaaa gaataatctc agataaggtg attgaacaca 840 tgcggataaa caatttcaca tgtgagcaac agcacggttt ttccaagggg aaatcaacag 900 tgactaacct actcgaggca cttgacgtgt ggaccgaagc cttaagtcac ggactacaag 960 tggatgttat atttttagac tatgctaagg cttttgacac ggtcccacac gaaaggctcc 1020 ttagaaaggt cgaatccttt ggcattagag gaaagctgtt agagtgggta cgaggcttcc 1080 taacccaacg aagacaaaga gtctcggtaa acggagctac atcgtcctgg aagagggtag 1140 aaagcggagt cccacaaggg agtgtactgg gtcccttact attcatgatt ttcgtgagcg 1200 atataccgga cacactgcag aactttgtat ctctctttgc ggatgacaca aagatctatg 1260 caaccgtcga ggactgtgca agagaaggtc atacgacctg ccttcaggcc gatctagacc 1320 aactacaaga atggtcgaga aaaatgcaga tgagattcca cccggataag tgtaaaccga 1380 tgcacctggg gaagggtaat cccggacaca aatacacaat gatcaagaca gacggatcag 1440 tacataccct tcagtgtaca aaggaagaga aagacctagg agtactcata gacagtaagc 1500 tgagcttcag cagtcatgta cagaaccagg tcaacaaagc caacaaggtg ctcggtgcta 1560 taagacacac tttcaaatat ctggacatag actcatttgt gttgctgtac aaaagcctga 1620 tacgtcccca cctagagtat gcaacggtca tttgggatcc aaaaacaaag agagacaaag 1680 acatggtgga gagggtacag agaagagcta caaagctggt tccttcaatc tcccaccttc 1740 cctactcaca gagactaagg gctctgggac tacccacact tctcttcaga aggaaaagag 1800 cagacataat tcttctctac aagataacac atgggttagt cacatgcaga accagtaacc 1860 actgcaacct ctgtagcaga gctatgctca cgcctagcct ggcaacctcc acaagaggcc 1920 actcgtacaa gtaccagata cagtgctcca agggtccaag ggccaatttt tacccagcaa 1980 gagtgatacc catgtggaat aagctctccg aacagacggt gaccagcaaa tcagtgaact 2040 tgttcaaatc aaggttgtcg gctgagtgga acacacacaa ggacctgtac gaatacgagt 2100 tctcctactg aaccagtagg ttgcagtgaa gaaagaacga aggaaggaag gaaggaagga 2160 aggtaggaga gaaggaagga aggaaggaag accagatgct acagccccgc agtttaacta 2220 tgtacagcct cattactcta acgggtacct actggctgtc gctcaaggtg aatactcaag 2280 gtgaata 2287 // ID Copia-2_DWil-LTR repbase; DNA; INV; 163 BP. XX AC scaffold_180723; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-2_DWil_; KW Copia-2_DWil-I; Copia-2_DWil-LTR. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-163 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_180723; Positions 21887 21725. XX SQ Sequence 163 BP; 63 A; 34 C; 25 G; 41 T; 0 other; tgaaataatt ctctatcaac tcaatgcatg catgatcaca cacatacgaa tatatatatg 60 taactttgta tactctctct cgagatgcgt taaagagaac aatgactcga tgagagacaa 120 atagaagtta gtcgcgaata aattgcacaa ccaaccacac gca 163 // ID FEILAI_AA repbase; DNA; INV; 289 BP. XX AC . XX DT 29-OCT-2001 (Rel. 6.09, Created) DT 29-OCT-2001 (Rel. 6.09, Last updated, Version 1) XX DE Feilai family of SINE - a consensus. XX KW SINE; Non-LTR Retrotransposon; Transposable Element; FEILAI_AA. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Tu Z.; RT "Genomic and evolutionary analysis of Feilai, a diverse family of RT highly reiterated SINEs in the yellow fever mosquito, Aedes RT aegypti."; RL Mol. Biol. Evol 16(6), 760-772 (1999). XX DR [1] (Consensus) XX SQ Sequence 289 BP; 80 A; 62 C; 76 G; 68 T; 3 other; ctggggcctt ccttagccga gtggttarag tccgcggcta caaagcaaag ccatgctgaa 60 ggtgtctggg ttcgattccc ggtcggtcca ggatcttttc gtaatggaaa ttttcttgac 120 ttcccngggc atagagtatc atcgtacctg ccacacgata tacgaatgcg aaaatggcaa 180 ttttggcara gaaagctctc agttaataac tgtggaagtg ctcataagaa cactaagctg 240 agaagcaggc tctgtcccag tggggacgta atgccaagaa gaagaagaa 289 // ID BEL-110_AA-I repbase; DNA; INV; 5533 BP. XX AC AAGE02021062; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-110_AA_; KW BEL-110_AA-LTR; BEL-110_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5533 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02021062; Positions 59821 54289. XX CC Positions [4560-5144] - Integrase core CC 'ATATT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 210..5531 FT /product="BEL-110_AA-I_1p" FT /translation="MKRTPHKQSKMASDSEDVENDLSTFVQLRGMAQGNIT FT RIKTILNQAQAEQVELPLAQVKVYIKKVETSYGEYHHFHQQIVAILPSNKR FT EEQNEKFLHFETLHEEVSTMLETLLEILTAPLAMQPGHPTVNQPPLVIQQS FT LPRTIPTFDGRYENWERFKIMFRDAVDRTNEAPRIKLYHLEKALVGDAAGI FT IDVKTISDGNYDHAWEILEERFEDKRRMIDIHIAGLLGAKKLSKEDYGELR FT SLVESMSSHVENLKFLEQEFTGVSELIVVHLIARALDPTTKKLWESTIKRG FT ELPSYDNTVRFLKDRVSILERCHDTCEEAKSMRATGKPATKKPLLPKANAA FT TTSPSMDQRCSICDESHVTYKCSVFNSLNVSDRLSKVKEKNVCFNCLRSGH FT SVKSCSSKKSCTKCNKRHHTLLHFENSENSSKLLTENSAAKCTASKSTTPV FT EVPELSKSAASNMKQESQPAMTSAHSSNPVSPASQVVLLTALVNVMDHQQQ FT PRVCKALLDCGSQVSFISQSLVSALGIEVEEVSVPITGIGSVRSTIRQKCT FT LNVHSRCSDFTFTLNCLVSPKITGHVPSVKINLENWELPLGLSLADPSFHE FT PSQIDLLIGMDWFYDIMKPGCLKLNDYLPSLHDSKLGWLVGGKLLDSSSNS FT ALNSYSVRLDPVEELMQKFWEVEGVSSEVVLSSEEEQCETQFAATYRRDNS FT GRFIVQLPLRDIASQLSDSRNMALRRFYMLESKLERHADLRAQYDAFMDEY FT EALGHCQEICERDDSPGLLKWYLPHHAVLRPSNSTTKCRVVFDASAKVSGL FT SLNDVMMTGSNVQSDLLSIILRFRKHRYVMSADVAKMYRQIIVDPAFTPLQ FT RIFWRKSPQDRLRVLELTTVTYGTAAAPFLATRALLQLARDERERFPLASK FT VVEKNIYIDDALFGSDDFQETCELRDQLIGLLQCGGMHLHKWSSNTSRLLL FT SIPCEDRDSCVSFSDSGLNEIIKTLGLMWNPSTDEFLFRTTSRERIKKPTK FT RQVLSEIAKIYDPLGLISPVVVLAKIVMQKLWISKLNWDDQLDEGLMNEWE FT KFLTSLPTSEQIRIPRQILSSKAAYYELHGFADASQKAYGACVYVRSIFSD FT GTASMQLVSAKSKIAPISPLTIPRKELLAALLLTRLVSKVDAALEMNFNSI FT VLWSDSQVVLAWLRKPLGNLQVFVRHRVAEITSNNGYIWKYVRTEQNPADT FT VSRGQSAKELIDNKQWWNGPQFLCIVDYQEGTLEFLRDSEIPELRVEVASL FT PVMNYEEFPVLEKFTSFRKCQRIVAYMLRFISNSRRKNDDRVVARYLTIPE FT LRVASNLIVGAIQHQEFAKEIECVRAGEPNHRLNNLKPFLDKDLLRVGGRL FT GQSQLPFGVKHQLILPNKNPLVHGLILEIHRENHHAGCSMVQYLLRQHFWL FT INARSTIRRVLKGCVTCFRTNPTIIDQQMGNLPSYRITPAPVFERIGLDFA FT GPIYIKQSVRKAIPVKGYICVFICMVTKAMHLEAVEDLSTDSFIAAFQRFV FT SRRGYPKEVFSDNGTNFIGARSALRELYQLFKEETTQKRIFEYCQAKQIEW FT KTIPPNAPHFGGLWEAGVKSCKSVLKRVYQNTSLTLSGLSTLLCQIEAILN FT SRPLYSQSNDPTEPEALTPGHFIINRPLLAIPEPSVVGIPTNRLSHWQHIQ FT QLREHFWKRWSREYLSELQVRAKWTKQKVNVQPGIVVLLKDDNLPPQCWNL FT GRVVKVYPGADNLVRVVDVQTKFGTYKRPIHKLAPLPIIDNDHVAKTSTFC FT LGE" XX SQ Sequence 5533 BP; 1570 A; 1210 C; 1332 G; 1421 T; 0 other; tggaatttgg tccttcgaac cggatgctca agtttgagtg tccgaatgtg ttagtgattg 60 gtggttggat tggatgagtg gtggtggaga caaacacgga gaagtttgaa aagtaaaaca 120 agagactgag actacaagtg aaacaagaaa ctgaaactta aagtgaaaaa gttatgtgaa 180 aaaactgaaa aactataagt gaagtgaaaa tgaagcgtac accgcataag cagtctaaaa 240 tggccagtga tagtgaagac gtggaaaacg acctgagcac ttttgtacag ctacgtggta 300 tggcacaagg taacatcacg cgcatcaaaa ccatcttgaa ccaagcccaa gcggaacaag 360 ttgagcttcc tctagctcag gtcaaggtct acatcaagaa agtggaaact tcctacggcg 420 agtatcatca tttccatcag caaatagtgg caattctccc atcgaacaaa cgagaagagc 480 agaatgagaa gtttttgcat tttgaaacgt tgcacgaaga ggtttccacc atgctagaga 540 cccttctcga gattctcacc gcacctctcg cgatgcaacc gggacatcca acagtgaacc 600 aaccaccgtt ggttattcag cagtcgcttc cgcgtaccat ccccaccttt gacggtaggt 660 acgagaattg ggaacgcttc aagataatgt tccgcgatgc agtcgatcgc accaacgaag 720 cccctcgaat caaattgtat catcttgaga aggctctcgt gggtgatgct gcgggaatca 780 tagatgtaaa gacgatcagt gatggaaact acgaccacgc ctgggaaata ttggaggaga 840 gatttgagga caagcggaga atgatcgaca ttcatatcgc tggtctgctg ggtgcgaaga 900 agctatctaa ggaggactac ggtgagttga gatccttggt cgagtccatg agcagtcacg 960 tggaaaattt gaagtttctt gagcaagagt tcactggcgt ttcggaactc atcgtcgttc 1020 atctgatcgc ccgtgctttg gatccaacta cgaagaaact gtgggagtcg acaattaaac 1080 gaggagagct tcccagttac gacaacaccg taaggttcct gaaggatcgc gtctcaattc 1140 tggagagatg ccacgataca tgcgaagaag ccaagtctat gcgtgcaaca ggcaaaccag 1200 caacgaagaa gccattgctt cccaaggcaa acgcagctac aacctctccg tctatggatc 1260 agcggtgcag catatgtgat gaaagccatg tgacgtacaa gtgctcggtg ttcaacagtt 1320 taaatgtgag tgaccgactg tctaaggtca aagagaagaa tgtttgcttc aactgcctga 1380 gaagtgggca tagtgtgaag agctgttcgt cgaaaaagtc gtgtacaaag tgcaacaaaa 1440 ggcatcacac attactccat tttgagaata gtgaaaattc ctcgaagcta ttaaccgaaa 1500 atagtgcagc aaagtgtacg gcatcgaagt caaccacacc agttgaagta cccgaactaa 1560 gcaaatctgc agcatcgaac atgaaacagg agagccagcc agcaatgaca tccgcgcatt 1620 ccagcaaccc ggtgagtcca gcgagtcagg tggtgttgct cactgccctt gtcaacgtaa 1680 tggaccatca acaacagcct cgtgtgtgca aagccctgct agactgcgga tcacaggtga 1740 gttttatctc ccaatcattg gtgagtgctc tcggtattga ggtagaagag gttagtgtcc 1800 cgataacagg cataggtagt gtaaggtcga caatcaggca aaaatgtacc ttgaacgtcc 1860 actctagatg cagtgatttc acctttacac tgaattgttt ggtctcgcca aaaattactg 1920 gtcatgttcc gtcggtcaag ataaatttgg aaaattggga gcttccacta ggtctctcac 1980 tggctgatcc ttccttccat gagcctagcc aaattgatct gctcattgga atggattggt 2040 tctatgatat tatgaaacca gggtgtttga aattgaacga ttatctccct agccttcatg 2100 actccaaact tggttggcta gtcggtggaa agttgttgga ttcttcttcc aattctgctc 2160 tgaattctta ttccgttcga ctcgaccctg tcgaagagct gatgcagaaa ttctgggagg 2220 tagaaggtgt gtcgtctgag gtggttctgt ctagtgagga agagcagtgt gaaacgcagt 2280 ttgcagcaac ctaccgtcga gacaatagcg gtcgtttcat cgttcaactt ccattgaggg 2340 acatcgcttc gcaactatca gactctcgta acatggcctt gcgaagattt tatatgcttg 2400 agtcgaaact tgaacgtcat gcggatttga gagcccagta tgacgccttt atggacgagt 2460 atgaagccct tggacattgc caggaaatat gtgaacgtga tgattctcct ggtttgctga 2520 agtggtatct tcctcatcat gccgtcctcc gcccttcgaa ctctacaaca aaatgtcgtg 2580 tagtattcga cgcatctgca aaggtttccg ggttgtcact caacgatgtc atgatgaccg 2640 gctccaacgt gcagagtgat ttgttgtcca ttatcttacg atttcgaaaa catcgatatg 2700 ttatgagcgc agatgtagcc aaaatgtatc gccagattat cgtggatcct gctttcactc 2760 cattacaacg gattttctgg aggaagtcac cccaagatcg actgcgtgta cttgaactaa 2820 cgactgtaac ctatggtacg gcagcagccc cctttttggc tacgcgagct ctgctgcaac 2880 ttgcacgaga tgaacgagaa agatttccgc tcgcttccaa ggtagtcgag aaaaatatct 2940 atattgatga tgctcttttc ggttccgatg atttccaaga gacctgtgaa cttcgagatc 3000 agttgatagg attactgcag tgcggtggaa tgcaccttca caaatggtcg tccaatacta 3060 gtcggctatt gttatcgata ccatgcgaag atcgtgatag ctgcgtatca ttcagtgaca 3120 gtggtctcaa tgaaattatc aaaacattgg gattgatgtg gaatccttcc acggatgagt 3180 tcttgtttcg cacaacgtct cgagaaagga ttaaaaaacc caccaagcgt caggtcctgt 3240 ctgaaatagc aaaaatatac gatccactcg gcctgatttc tccggtggta gtgctggcta 3300 aaatagtcat gcagaagttg tggattagta agctgaactg ggatgatcag ttagacgaag 3360 gcttgatgaa cgaatgggag aaatttctaa catctctgcc gacttcagaa caaattcgca 3420 ttccacgtca gatattatca agcaaggcag cgtattatga gctccatgga tttgcggacg 3480 cctcacagaa ggcgtatgga gcctgtgtgt acgttcgttc aatattctcc gatggcacag 3540 catccatgca actggtcagt gctaaatcaa aaattgcacc aatttctcct ctgacgatcc 3600 cgagaaagga gttgctagca gcccttctgc tgacgagatt ggttagtaag gtagacgcgg 3660 cgttagagat gaacttcaat tctattgttc tctggtccga cagtcaagtt gtattggcat 3720 ggttgcgtaa gcctcttggc aatttacaag tttttgtccg ccacagagtt gcagagataa 3780 catccaacaa tggatacatt tggaaatatg tgcgaacgga acaaaaccca gcagacaccg 3840 tgtcacgggg ccagtctgcc aaggaattga ttgacaacaa acaatggtgg aatggtcctc 3900 agtttctgtg tatcgttgac taccaggagg gaacgctaga gtttctgcgt gattctgaaa 3960 tccctgagtt gagagtcgag gtagcctccc taccggtgat gaattacgaa gagttccctg 4020 tattggaaaa gtttacatca ttccggaaat gccaacggat agtagcttac atgcttcgct 4080 tcatttcgaa cagccgaaga aagaatgacg accgagttgt ggctcgatac ctcactattc 4140 ccgaactaag agtcgcatca aacttgattg ttggagccat tcagcatcag gagttcgcca 4200 aggaaatcga atgcgtaaga gcaggtgaac ccaatcatcg cttgaacaat ttgaaacctt 4260 tcctcgacaa ggatttgttg cgagttggag gtaggctagg ccaatcccag cttccattcg 4320 gagtaaaaca tcagttgata ttgcctaaca agaatcctct cgttcacgga ttgattcttg 4380 aaattcaccg agagaatcac catgctggat gttccatggt gcagtacctt ctgcgacaac 4440 atttttggtt aatcaacgcg agatcgacaa ttcgaagggt gttgaaaggc tgcgttactt 4500 gtttcaggac aaatcccacc atcattgatc agcagatggg aaatcttccc tcgtatcgga 4560 ttaccccagc accagttttc gagagaattg ggcttgattt tgcgggacca atttatatta 4620 agcagtcagt gcgaaaggcg attccggtga agggatacat ttgtgttttt atatgtatgg 4680 taaccaaggc catgcatttg gaagccgtgg aggatttgtc aaccgattcg ttcatagccg 4740 cttttcaacg atttgtgtcc agacgagggt accctaagga agtcttttct gataatggga 4800 ccaatttcat cggagcaagg tcagcattgc gagaactgta tcagctgttc aaggaagaga 4860 ccacccagaa aaggattttt gagtattgtc aagcgaagca gatcgaatgg aaaacgattc 4920 ccccaaacgc gcctcacttt ggagggcttt gggaggctgg agtgaaaagc tgcaaatccg 4980 tactcaaaag agtttatcag aacacctcac taacactttc cggtctttcc accctgttgt 5040 gccagataga agctatccta aattcgagac cattgtattc ccaatcaaat gacccgacag 5100 aaccagaagc tttgacccct ggtcatttta taatcaaccg tcctctcttg gcgattcccg 5160 aaccgtctgt tgtgggtatt ccgaccaatc gtttgtctca ctggcagcac atacagcaac 5220 ttcgagaaca cttctggaaa cgatggtcga gagagtatct ttctgaattg caagtgcgtg 5280 ctaaatggac caaacagaag gttaacgttc aaccgggaat tgttgttctg ttaaaagatg 5340 acaatttgcc accacaatgc tggaatttgg gacgagtagt aaaggtttat ccaggtgccg 5400 ataacctggt aagagtggta gacgttcaaa cgaaatttgg tacatataaa agacctatac 5460 acaaacttgc acctttgcca ataatagaca acgaccatgt tgccaaaact tccacttttt 5520 gcctggggga gaa 5533 // ID Copia-5_CQ-I repbase; DNA; INV; 3555 BP. XX AC AAWU01030339; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-5_CQ_; KW Copia-5_CQ-LTR; Copia-5_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-3555 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 325-325 (2011). XX DR Genome; AAWU01030339; Positions 7089 10643. XX CC Positions [1472-2002] - Integrase core CC LTRs are 93% similar to each other. XX FH Key Location/Qualifiers FT CDS 50..3541 FT /product="Copia-5_CQ-I_1p" FT /translation="MTDAKPAPVQVPMLDESNFNPWRLRMLAFLEEHELLE FT CVRTSVGDVAELKERAGEKPEEKTARMKLLDKRIRKDRHCRSQLMARISDA FT QMDHLEEDMSPKDIWDKLHNIHARKSVASRMRIQRQIVTMRYDGGKLQQHF FT LQFEKLIREYRSAVAKLEEIDVICHLFISLGSAFSGAVTALETMSQEDLKL FT DFVKNRLLDEETKKGIESPTPSQSDTAFHGAKPQKKQKCFGCNKEGHRVAD FT CPEKKKPQQSKRKQKSKANVADSGSNRKEVCFVSEANPQQARTRWFIDSGA FT TDHLVRDKELFCELHRLKKPIEIAVAKDGETIQAEHSGTVKVVAVVDGKRV FT NCTIKDALYVPKLRCNLFSVLRVEQAGMRVVFEDGKATVYSGSEIVACATV FT QNKLYEMNFLARHGTEDSSLLTCGRVRKGFVLWHRRFGHLNEQSLKCLVQK FT EMVSGLDLSTAGTDDTIVCESCVVGKQTRKPHPPREGRRSSRVLELVHTDV FT CGPVSPEGLDGMKYFVTFIDDWTHFVMVYLIRSKDEVMECFEEYEALVTAK FT FERPISRLKCDNGGEYRNKRFEGFCKKKGIRMEPTVPYTPEMNPLSERMNR FT TLVEKARAMLADAGVDKKFWGEAMLTAAYLTNRSPTAALEMKKTPYELWES FT KKPDVSNIRVFGSDVFVHVPKELRRKLDDKAWKGTLVGYSHNGYRVWDPKK FT RTVVTARDVDFVESVVEQQQVAQPNSQFVPLESDEESCVDDASCADDGDET FT SEDEFSSAADEPEAEEADGSGRPQRPKNPPAWHKDFEVEYASFALNAMNFI FT EDIPSSVPELKKRDDWHEWRAAMQDEMNSMKRNNAWTLVKLPAGRKTVTCK FT WVFKVKRGDAEKPDRFKARLVARGFTQKPRFDYTETYSPVAKLDTLRAVLA FT LANENRMFVHQMELKTAFLNGELAEEIYMTQPEEFQQGNGLVCKLNRSIYG FT LKQASRAWNDRFHGVISKLGFVRSANDMCLYTRGAGDKKLILVLYVDDILL FT AGTSLKVLEAVKQKLTEEFEMTDEGEIKQFLGMRFERDKLKGVLKISQRDY FT FDGLLKRFRMDDCKPISTPMEHRLRLVKGEEKQRTSKPYRERGFPRLQAPE FT ARGHQVPLPAGAGAAEDDRDRVRALCGATSGHHDQGSTGSCLQGSSRQAGI FT GGDVL" XX SQ Sequence 3555 BP; 839 A; 881 C; 1187 G; 648 T; 0 other; agtcgcgttt tacggtcaaa cggaaaagtg cgcggaatcg tcgggcgtca tgacggacgc 60 aaaacccgct ccggtgcagg tgccgatgtt ggacgagtct aacttcaacc cgtggcggct 120 ccgcatgcta gcattcctgg aggagcacga gctcttggag tgtgtgcgga cgagtgtggg 180 tgacgtggcg gaactaaagg aaagagcggg agaaaagccg gaggaaaaga cggcccggat 240 gaagctgctg gacaagcgca tcaggaagga ccgtcactgc cgttcgcagt tgatggcgcg 300 gatcagcgat gcgcagatgg accacctgga ggaggacatg tccccgaagg acatctggga 360 caagctgcac aacatccacg cgcggaagag cgtcgcgagc cggatgcgga tccagcggca 420 gatcgtgacg atgcgctacg atgggggaaa gctgcagcag cactttttgc agttcgaaaa 480 gttgatccgg gagtatcggt ccgccgtcgc gaagctggag gagatcgacg tgatttgcca 540 cctgttcatc tcgttgggat cggcgttctc cggtgctgtg acggcgctgg aaaccatgtc 600 gcaggaggac ctcaagctcg atttcgtcaa gaaccggctc ttggacgaag agacgaagaa 660 gggcatcgaa tcgcccaccc cgtcccagag cgatactgcg tttcacgggg cgaagccgca 720 gaagaagcaa aagtgtttcg gctgcaacaa ggagggccac cgggtggccg actgtcccga 780 gaagaagaag ccgcagcaga gcaagaggaa gcagaagtcg aaggcgaacg tggcggattc 840 cggcagcaac cggaaggaag tttgtttcgt gagcgaggcg aacccccagc aggcgcggac 900 gcgctggttc atcgattctg gtgcgacgga ccatctagtg cgggacaagg agctgttctg 960 tgagctgcat cgtctgaaga agcccatcga gatcgcggtg gccaaggacg gtgaaacgat 1020 ccaggcggag cactccggca cggtgaaggt ggtggcggtc gttgacggta agagagtgaa 1080 ctgtacgatc aaagacgcgt tgtacgtgcc aaagttgcgc tgcaatttgt tttcggtgct 1140 gcgcgttgaa caagccggga tgcgcgtcgt gttcgaagac gggaaagcga cagtgtacag 1200 tggttccgaa atcgtcgcgt gtgctactgt gcagaacaag ctgtacgaaa tgaactttct 1260 ggctcggcac ggtacggaag attcttcgct gttgacgtgc ggtcgcgtgc ggaaggggtt 1320 cgtgctctgg caccgtcgtt ttggacattt gaacgagcaa agcctcaagt gtctggtgca 1380 gaaggagatg gtatccggac tcgatctcag taccgccggg actgacgata ccatcgtgtg 1440 tgaatcgtgc gttgtcggca agcaaacgcg gaagcctcac ccaccgcgtg aaggtcgtcg 1500 atcgtcgcga gtgctcgaat tggtacacac cgacgtgtgt ggtcccgtgt cgccggaggg 1560 actcgacggg atgaaatatt ttgtgacgtt catcgacgac tggacccatt tcgtgatggt 1620 gtacctgatt cgctcgaagg acgaggtcat ggagtgtttc gaggagtacg aggctttggt 1680 gacggcgaag ttcgagcggc cgatctctcg gctgaagtgc gacaacggcg gcgagtaccg 1740 gaacaagcgt ttcgaggggt tctgcaagaa aaaggggatc cggatggagc ctactgtccc 1800 ctacacgccg gagatgaacc cgttgagcga gcgcatgaac cggacgctag tcgagaaggc 1860 gcgggcgatg cttgctgatg ctggcgtcga caagaagttc tggggggaag ccatgctgac 1920 ggcggcctac ctgacgaacc gaagcccgac cgctgctctc gagatgaaga aaacgccgta 1980 cgagttgtgg gagtccaaga agccggacgt gtcgaacatt cgtgtttttg gcagcgatgt 2040 gttcgtccac gtcccgaagg agttgcgccg taagctggac gacaaggcgt ggaaaggaac 2100 gctcgtcggg tactcgcaca acgggtatcg agtctgggac cccaagaaac ggacggtcgt 2160 cacggcgcgc gacgtggact tcgtggagtc tgtcgtggag cagcagcagg tcgcgcaacc 2220 caactcgcag tttgtgccac ttgaatctga tgaagaaagc tgcgtcgatg acgcgagctg 2280 tgccgacgat ggagacgaga cttcggaaga cgagttcagc agtgctgccg atgaacccga 2340 agctgaagaa gctgatggca gcggccgacc tcaaagaccg aagaaccctc ccgcttggca 2400 caaggatttc gaggttgagt acgccagttt tgcactcaac gcgatgaact tcatcgagga 2460 catcccgagt tcagtccctg agctgaaaaa gcgggacgat tggcacgagt ggagagctgc 2520 gatgcaagac gagatgaact ccatgaagcg caacaacgcc tggacgctgg tgaagttgcc 2580 tgcaggacgc aaaaccgtta cctgcaagtg ggtgttcaag gtgaagcggg gcgacgctga 2640 gaaaccggac cggttcaagg cgagactggt cgccagaggc ttcacgcaga agcccaggtt 2700 cgactacacc gagacctact ctccggtggc gaagttggac acgctgcggg cggtgttggc 2760 gctggccaac gagaatcgga tgttcgtgca ccagatggaa ttgaagaccg cgttcctgaa 2820 cggtgagtta gctgaagaga tctacatgac ccagccggaa gagtttcagc aggggaatgg 2880 tctggtctgc aagctgaacc gttccatcta cgggttgaag caagcgtccc gggcgtggaa 2940 cgaccgattc catggcgtca tcagcaaact cggattcgtc cggagcgcca acgacatgtg 3000 tctctacacc cgaggagctg gagacaagaa gctgatcctg gtactgtacg tcgacgacat 3060 cctcctcgcc ggaacgtcgt tgaaagtgct ggaggcggtc aagcagaagc taaccgagga 3120 gttcgagatg acggacgaag gtgagatcaa gcagttccta ggaatgcgct ttgaacgtga 3180 caagctgaaa ggagttctga agatcagtca gcgcgactac ttcgatggcc tcctgaagcg 3240 tttccggatg gacgactgca agccgatctc gacaccgatg gagcaccggc tgaggctggt 3300 gaaaggagag gagaagcaga gaacgtcgaa gccctaccgt gagcgaggat tcccgagact 3360 gcaagcgcct gaagcacgtg gacaccaagt tccacttcct gcgggagctg gtgcagcaga 3420 ggacgatcga gatcgagttc gtgcgctctg cggagcaaca agcggacatc atgaccaagg 3480 gtctaccggc agttgtcttc aaggatcttc gcgccaagct gggattggag gagacgtgct 3540 gtgattgagc agggg 3555 // ID hATm-35_HM repbase; DNA; INV; 4007 BP. XX AC . XX DT 07-DEC-2008 (Rel. 13.12, Created) DT 10-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type family: consensus sequence. XX KW hAT; DNA transposon; Transposable Element; hATm-35_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4007 RA Jurka J.; RT "Families of hATm elements from Hydra magnipapillata."; RL Repbase Reports 8(12), 1929-1929 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 628..2928 FT /product="hATm-35_HM_1p" FT /translation="MAASNKATRSTSNVWLVGHILSTIDGPSQLPTSGVAL FT CRVFYEMIVKKSTLSTSCNTVADEVMIFWLKANIPTIAKPHVVAKLKSIHK FT QHVQVSKHRSRKSATQHSYEEKFTAKMSTLFDIAHSEWDCLIKINEDKQFL FT IDQRGPRKMLMTSEDLMYKKAVVKYRKRKLEEKRRYEQHKESKVKSTETLM FT DDNSDYSLDENSDCDASIQIQKVFNRRKISKHSTMGTSQLSITVSKRCILD FT NQLFHAALDRTKITPRQAMMVVTPALAAIGIDVNKLTLSRSTLIEARTTSR FT ESLSTAVRKDFQPLVPLVAHFDGKMLSYVDRTRRECLPIVVSGLDIEKLLG FT IPEIPAGTGALMGQKVVEFIHEWPGVEEHLAGLCFDTTASNTGSYNGAITV FT IQKSMNRRVLFLACRHHILEICANAVFDTFFVSKGPNIELFGRVKSQWELI FT EKSKFEPLESDTSGTGCLTKLEKEWLASRKSDVVENLKKNLKDTQPRDDYK FT EFGQLALLLLGEINDTTSIRTPGAYHRARWMAKGIYCLKIFGFRHQLNVTD FT KEIGILRRICLFITTIYVCFWFSAPLTTEAPKNDLLLLKVIENYCQVDSKV FT ANVAKNKIKLQLWYLSEDLAALPLFSEEISFEEKMEIVNALQKEPLPGDNR FT RLASNKIYTFSKLSIANFVTRRSMNLFDSLNLPKEFLTSSPKSWANRDDYK FT AACKIVRAMKVVNDCAERAVKLATDFNEVLTKNDNQRQLLYQVVEYHRKKI FT STEATKRQLSHNHSK*" XX SQ Sequence 4007 BP; 1380 A; 647 C; 699 G; 1281 T; 0 other; gggtggtttt tttttatatg gaaaaatatt tttttttcaa attttaaatc gcacaacctc 60 ttatttggtg aggtctgtta taaaaagtat atagttaaaa tttttttgca aactgaaaat 120 gggaagtggt cccccaaacc ctattttgga accctaaatt gaccaaaatt tgtataaaaa 180 cttaatcttg taattctatt tttatttcat agaaaatact gctataaatt gatctaaaaa 240 caaataatat aaagtattat aatgttacaa catagaagtt cacagaagac tggaaagtac 300 aagcaaattt agattaaccc ccctcagcat tatttaccca aaatcgctcc taaatgccta 360 ttttgatttc aaaaacattt tatttgctat aatttcaagt aaactttttc taagtaaact 420 ttagctcaga aattaaattt gtataaattt aataatagta atgacatgtt tggccattgt 480 tgggtgggaa agggttagga atggggggcg ggtgtattat ctaagacttt taattttttt 540 attatgcttt gagcactttg taattgttaa atttctgtca tattctttct attatagtaa 600 aaaagtataa aaatattaca taagaagatg gctgcttcaa acaaagcaac acgctcaact 660 tcaaatgttt ggctagttgg acatattctc tccacaatag atggaccgag tcagttgcca 720 acaagtggtg ttgcactatg tagagttttt tatgaaatga tagtcaaaaa gtcaactctt 780 tcaacatcct gcaacacagt tgcagatgaa gttatgattt tctggcttaa ggctaatatt 840 ccaactatag caaagcctca tgtagttgct aaactaaaga gtattcacaa gcaacatgtt 900 caagtgtcta agcacagaag tagaaaatca gctactcaac acagctatga agaaaaattc 960 accgccaaga tgtctacact ttttgatatt gcccatagtg aatgggattg tctcatcaag 1020 ataaatgaag ataaacaatt tctgatagac cagagaggcc ccagaaagat gttaatgaca 1080 tcggaagacc taatgtataa gaaagcagtt gtaaaataca gaaaacgcaa gcttgaagaa 1140 aaaagacgct atgaacaaca taaagaaagt aaagttaaat ctacagaaac attaatggat 1200 gacaattcag attactcttt ggatgaaaat tctgactgtg atgcgtcaat tcaaatacaa 1260 aaagttttta atcgtcgaaa aatttcaaaa cattctacta tgggtactag tcaactctca 1320 attacagtat ccaaacgttg tattcttgat aatcaactct ttcatgcagc attagaccga 1380 acaaaaataa caccacgaca agcaatgatg gttgtcacac cagccctagc tgcaatcggt 1440 attgatgtaa ataaattaac attatcacgt tccactctaa tagaagctcg aactacatct 1500 cgcgaatctc tttcaacagc agtaagaaaa gattttcaac ccttggttcc gctagttgct 1560 cattttgatg gtaaaatgtt gtcatatgtt gatagaacca gacgtgaatg tctgccaatt 1620 gttgtatctg gcttggatat tgagaaatta ctagggatcc ctgagatacc tgctggtact 1680 ggtgctctga tgggacagaa ggtagttgag ttcattcatg aatggcctgg cgtcgaggag 1740 catcttgcag gtctttgctt tgacacaact gcaagtaata caggaagtta taatggggca 1800 ataactgtca tacagaagtc aatgaaccga agagttttgt ttttagcctg tcgacatcat 1860 attcttgaaa tctgtgcaaa tgcagtgttt gatacgtttt ttgtatcaaa aggaccaaac 1920 attgaactct ttggtagagt gaagtcccag tgggagttaa tagaaaaatc aaagtttgaa 1980 ccacttgaaa gtgatacaag cggaactgga tgccttacaa agcttgaaaa agagtggctt 2040 gcatctcgaa aatccgatgt ggtggaaaat ttaaaaaaaa acttaaagga cactcaacca 2100 cgtgatgatt ataaagagtt cgggcaacta gctcttctcc ttttgggaga aattaatgat 2160 acaacaagta ttcgcactcc tggggcttac catcgtgcaa gatggatggc taaaggaata 2220 tactgtttaa aaatttttgg ttttcgccat caattaaatg tcacagataa agaaatcggg 2280 attttaagaa gaatttgtct cttcataacc acaatttatg tatgtttctg gttttcagca 2340 cctctcacaa cagaagcacc aaaaaatgat ctccttcttt taaaagtaat cgagaactat 2400 tgtcaagttg acagcaaagt tgcaaatgtt gcaaaaaata aaattaaact tcaattatgg 2460 tatttaagtg aagatctagc tgctctccca ttattcagtg aagaaatttc ttttgaagag 2520 aaaatggaaa ttgtaaatgc cttacaaaaa gaaccattac caggtgacaa tagaagactt 2580 gcttcaaaca agatttatac attctctaaa ctttctatag ctaactttgt tacacgacgc 2640 tcaatgaatt tgtttgactc tctcaatcta ccaaaagaat tcttgacatc gtctcctaaa 2700 tcttgggcta atcgtgatga ctacaaggct gcttgtaaga ttgttcgtgc catgaaagtg 2760 gttaatgact gtgcagaacg ggcagttaaa ttagccactg actttaatga agttttgaca 2820 aaaaatgaca atcagcgtca gcttctttac caagtagttg agtatcatag aaaaaaaatt 2880 tcaactgaag ctacaaagag gcagttaagc cataatcatt ctaaataaca gcaatttaat 2940 ctattcaaat atgcatacat ataatatgaa ctaaatatat agttaagtac atatatggct 3000 attgtttttc tgtagtattt taatttagta atatgtcaag gtttgattaa attatgtatg 3060 ttattttatt agaataattt taaatgttgt tatttgtaat atgttttctt gatcgcataa 3120 aatcatattt ttcattcttt taaaattttt gtttgagtac attatttttt gttatttttt 3180 aatttataaa tttgaattag ggctgagtta gaaatgactg gtagaggggc atatggggat 3240 ctagtttggg gcatatgggc cactagtact cacaatttaa aaaattgaag aaatgttaaa 3300 aaataagaat cttttttaag ttgttataac agctttttgt atttttaccc cccttcccct 3360 tgccccaagt tacctctgct acatacattc atacatttgt cttcacccta ataagtaaaa 3420 ttattgcatg tagcactggc aatagtttat tctaatgcgt ttcagaaaat tatcattttc 3480 aacaaaaacg ggcctcaaaa gggcttaaaa aacatgtccg ggtcctagtg ccctgagggt 3540 ggggtctggg aatttgtcac tgctagatac attcttttta ttgttatcac cctaataagt 3600 aaaactgatg catgtaacac taatacaagc tgattttaat agatttcaga aatatattat 3660 tttagacaaa aatggatctc aaaagagctt tgaaaacagg tccgggacct cgcgttccta 3720 gggtggagtc tgggaatagg tcattgctat atacattctt ataatttttt taaccctaat 3780 aagcgaaact gatgcatgta acactgatac ttattgattt taatacattt tagaaatatg 3840 tcattttggt caaaaacggg gttctaaagg gcttacaaag caggtgtggg ggaccacttc 3900 ccatcatcag tttggaaaaa aattttatca gtgaactttt tttattgaac ctcaccaaat 3960 aagagggtgc gcgatatcaa attcaaaaaa aaaaaaaaaa accaccc 4007 // ID Gypsy-2_DFa-LTR repbase; DNA; INV; 341 BP. XX AC ADHC01000031; XX DT 21-APR-2011 (Rel. 16.04, Created) DT 21-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Dictyostelium fasciculatum genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-2_DFa_; KW Gypsy-2_DFa-I; Gypsy-2_DFa-LTR. XX OS Dictyostelium fasciculatum OC Eukaryota; Amoebozoa; Mycetozoa; Dictyosteliida; Dictyostelium. XX RN [1] RP 1-341 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Dictyostelium fasciculatum RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; ADHC01000031; Positions 1287012 1286672. XX SQ Sequence 341 BP; 136 A; 49 C; 29 G; 127 T; 0 other; tgttagattc agattgtaaa ttaaagtaca gttcaatgat tagtcaatga tatttcatct 60 aattatcaac atacttaatc ctggtgccaa atatagaaag tatgctggat aaaatactaa 120 cgactaaact attgtcggtt agtcgtctcg aacaacaaac tataaaagac caatgaaatc 180 aataaacatt aattcatctt ttattattat aattaaataa tcatttacaa tctaatctaa 240 agacattatt actatctata tattcatata tctatatatt attattatta tatacctctc 300 aactaattat atatttacca agtatctata tagttattac a 341 // ID BEL-108_AA-LTR repbase; DNA; INV; 233 BP. XX AC AAGE02018277; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-108_AA_; KW BEL-108_AA-I; BEL-108_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-233 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02018277; Positions 70829 71061. XX SQ Sequence 233 BP; 65 A; 59 C; 39 G; 70 T; 0 other; tgttcgccta cttgcgcatc cctaccacac accgtacgtg aacaacccta aaaagacaaa 60 cacacacaca cattattgct tttttgattt ttctgttttt ttttttgcga aataaagacc 120 agttgagttc tatagtcaag agtatcaaga cgcgttgttg cttggctacg gaaagaattt 180 tcccgtcgtt atccgacccc ccaaatttac agtccgctcg atctgaagga aca 233 // ID Penelope_Ele2 repbase; DNA; INV; 3068 BP. XX AC . XX DT 27-OCT-2010 (Rel. 15.1, Created) DT 27-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE A Penelope-like element family from Aedes aegypti. XX KW Penelope; Non-LTR Retrotransposon; Transposable Element; KW Penelope_Ele2. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-3068 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-3068 RA Kojima K.K. and Jurka J.; RT "Penelope-like elements from the yellow fever mosquito."; RL Direct Submission to Repbase Update (27-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. This consensus is generated from 5 CC sequences with >96% identity. Both terminal ~330 bp are PLTRs. XX FH Key Location/Qualifiers FT CDS 333..2723 FT /product="Penelope_Ele2_1p" FT /note="reverse transcriptase." FT /translation="MASTLSESNTFIKLKKSIAGLKKDIAFLSECRKLNLT FT PKSHKIKVRGQVPQDIIKKMESEYLRSSIKKLHSKLNSQTLKCYSLHLKLA FT KQYPWEFPLFLTKVIQAEECEARRKKDIHSKKLAILKNKQHNKVVTALPKV FT HSIEGFVVNRSTQQFSEEQLTLLNKGLAYAVATKPDIEQTIIDVETAITHT FT IPIEFQESARNDTEESLKGNKFQLRGQNSDEDKVLSELRDKSVYYVKADKG FT NAVVILDKDDYDNQMNEKITTGPYRHFRVDPLPTMVRHVEKTLKECKPLLG FT DNTSRLKVSNPVLPRIKGLPKIHKPGNEMREIISAEGSPTHKLAKWLVKEF FT QSMPKPFESRSVNNTQEFASQLQASGYIQDDEIMVSFDVKALFPSVPVKEA FT INLLEEWLLNQHNESNWKNKVRYYIKLTRLCMEENYFSFRGNFYKQTKGAP FT MGNPLSPFLSELFMANFESILKKKDLLPQRWWRYVDDVFSIIKKDYLPKIL FT EAINSIHKDIKFTHEEEKDNKLPFLDLLITRESSHFEFEIYRKPTNTMRVI FT PSTSNHTYQHKMAAFHHMIHRMQTLPLSETGKSKELDYIFETARLNGYRNS FT TIQAVVDKRARDKYRKSMTTLTAEKEDLKRVAVNFDTKITQPLAKKFRKYG FT VDLVFSSRSSQLKSRLGSTKDKINKLNRAGVYKISCPHCNKIYIGQTKRTL FT EIRFKEHIAEVKKADKELEKGLAYDFKSKVAEHVFQDGHQMTSDNIDIVRS FT VSSSWKLDVAESLEIYKQKPSLLLNRDQGNGQSRLFKFVPKKYKRA" XX SQ Sequence 3068 BP; 1097 A; 496 C; 613 G; 862 T; 0 other; agttagggat ttgtataatt attattatcc cccttttcaa tttttgacca agacatgaca 60 ggtatactcc tgtcgaaatt ttttcctagg tagtttgatt gtgtaggtac ttatctggtg 120 ttagtattta aggtagagag atgcgacgat ttcttcagtg gatacgagta actgacactg 180 aagaagactg caagtggtag tcgaaatacg cgtatctgtc aaagataagc atttaggagc 240 ggaattaaaa ggtacacggt actccatcta ctttaaagtt tctctatttg agattctgct 300 cagaggattc gaacattcat taaagagttg taatggcttc cacgctttcc gaatcaaata 360 cttttatcaa gctgaagaag tcgatcgcag ggttaaaaaa ggacatagca tttttaagtg 420 agtgtcgtaa gcttaatctc acaccaaaga gtcataaaat taaggttaga ggtcaggtcc 480 cccaggacat tattaagaaa atggaatcgg aatatttacg atccagtatc aagaagttgc 540 actcaaaact gaactcacaa acattgaagt gctatagttt acatttgaaa ttggccaaac 600 aatacccttg ggaatttcca cttttcctta ccaaggtgat acaagcagag gagtgcgaag 660 cacgtagaaa gaaagatatt cactcaaaaa aattagcaat tttaaagaat aagcagcaca 720 acaaagtggt aacagcattg ccgaaagtcc attctattga aggttttgtg gtcaatcgct 780 cgacgcagca gttctcagaa gaacaattga cacttctcaa taagggatta gcatatgcgg 840 ttgccacaaa accagatatt gaacagacaa taatcgatgt tgaaaccgcc attacacaca 900 caataccaat agagtttcaa gaatctgcta gaaatgacac tgaagagtca ttaaaaggta 960 acaaatttca attaagaggt caaaatagtg atgaggataa agttttgagt gaacttcggg 1020 ataagtcggt atactatgta aaagcggaca agggcaatgc agtggttatt ttggataaag 1080 atgattatga taatcaaatg aacgaaaaaa taaccactgg tccctatagg cattttagag 1140 tcgatccact tcccacaatg gtgagacatg tagaaaaaac tttaaaagaa tgtaagccac 1200 ttcttggaga taacactagc cgcctcaaag tttctaatcc agttctacca agaattaaag 1260 gattgccaaa aattcacaaa cctggcaatg aaatgagaga aattatatca gcagaagggt 1320 ctcctacaca caaacttgca aagtggctgg ttaaggaatt ccaatccatg cccaaacctt 1380 ttgaaagccg ttcagttaac aatactcagg agtttgcaag ccaacttcag gcttctgggt 1440 atattcaaga tgatgaaatt atggtttctt ttgatgttaa agcattattt cctagtgtgc 1500 cagttaaaga agccataaac cttttggaag aatggctact taaccaacac aatgagtcaa 1560 attggaaaaa caaagttcgg tattacatca aattaacccg actttgtatg gaagagaact 1620 attttagctt ccgtggtaat ttttataaac agacaaaagg tgcacctatg ggtaatccat 1680 tgtctccgtt tttgagcgag ttgtttatgg ccaattttga aagcattttg aaaaagaaag 1740 atttgttacc acagcggtgg tggagatatg tagatgatgt gtttagcatt atcaaaaagg 1800 attatttgcc aaaaatttta gaagcaatta acagtatcca caaagatatt aaatttactc 1860 atgaagagga aaaggataat aaactacctt ttttggattt gctaattact agggaatcgt 1920 ctcactttga atttgaaatt taccggaaac caactaatac tatgcgagtt attccgagta 1980 cttccaacca cacttatcag cataaaatgg cggcttttca tcatatgata catagaatgc 2040 aaacccttcc tttaagtgaa actggaaaat caaaagaact ggattatatt tttgagacgg 2100 ctaggcttaa cggatatagg aatagtacaa tacaagcagt agtagataag agggcaaggg 2160 ataaatatag gaaaagtatg acaacactta ctgccgaaaa agaggattta aaaagagtag 2220 cggtgaattt tgataccaag attacacagc ctttggctaa gaaatttcga aaatatggag 2280 tagatttggt tttcagcagt aggagtagtc agcttaaatc tagattagga tcaacaaagg 2340 ataaaattaa taaactgaat agagcaggag tatataaaat ttcatgcccc cattgtaata 2400 aaatctatat aggacaaaca aaacgaactc tagaaataag attcaaagaa catattgctg 2460 aggtaaaaaa agcggacaag gaattagaaa agggattagc gtatgatttt aaatcaaaag 2520 tggctgaaca tgttttccag gatggacatc aaatgacaag cgataacata gacatcgtac 2580 gaagcgtatc ttcatcttgg aaactagacg tcgctgaaag tttagaaatt tacaaacaaa 2640 agccttcact acttcttaat agggatcaag gaaacggaca atcaaggctg tttaagtttg 2700 tacctaagaa atataaacga gcttgaatac ataggtagat agttagggat ttgtataatt 2760 attattatcc cccttttcaa tttttgacca agacatgaca ggtatactcc tgtcgaaaat 2820 tttttcctag gtagtttgat tgtgtaggta cttatctggt gttagtattt aaggtagaga 2880 gatgcgacga tttcttcagt ggatacgagt aactgacact gaagaagact gcaagtggta 2940 gtcgaaatac gcgtatctgt caaagataag catttaggag cggaattaaa aggtacacgg 3000 tactccatct actttaaagt ttctctattc gagattctgc tcagaggatt cgaacattca 3060 ttaaagag 3068 // ID Jockey-N6C_CQ repbase; DNA; INV; 1398 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A non-autonomous Jockey-like non-LTR retrotransposon family from DE Culex quinquefasciatus - consensus. XX KW Jockey; Non-LTR Retrotransposon; Transposable Element; KW nonautonomous; Jockey-N6_CQ; Jockey-N6B_CQ; Jockey-N6C_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-1398 RA Kojima K.K. and Jurka J.; RT "Non-autonomous Jockey non-LTR retrotransposons from the southern RT house mosquito."; RL Repbase Reports 11(1), 590-590 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >89% CC identity. This family encodes a protein similar to Jockey ORF1p CC but does not encode ORF2p. Thus it is a non-autonomous non-LTR CC retrotransposon derived from Jockey, like HeT-A. The consensus CC is ~82% identical to that of Jockey-N6B_CQ. XX FH Key Location/Qualifiers FT CDS 68..1285 FT /product="Jockey-N6C_CQ_1p" FT /translation="MEVEESSDGDEFRPVDGTRKRRKTSDDGAVVGNGXEE FT HLLNNNKFSPLAENNNNNNNNGNPAGKGSVPPVPVVPVAKKPPPLVVKNTS FT XAKLRSVMSPCTTKPSYKLTPFGIKLLCSSEERFETARAHLIANKVEFYTH FT EKRSERQLRVVVRGLPPASPEFVKKXLKESXNLDAVEVHAIKRKGEFASSE FT ETPYIVTFPKGYTNLKQLSEIKXLRAFHIRWEAYRNKRPNVTQCRNCLQLG FT HGTRNCHLKGRCNNCGGPHKTDECEVQEAQPKRCANCSGAHEATDRSCPKR FT ADFIRRRQQASKPKPPARKAEKQSPAVPAFTPAEFPPLPGAVPDGKSKDRP FT RPAGSSQGGSRDGGATEEEAGEVLYSSAELWGIFSEYIGRFKTCKTRLDQV FT TLVSYMISKYGI" XX SQ Sequence 1398 BP; 367 A; 377 C; 411 G; 237 T; 6 other; cgtctgacag ccggaggcgg cggacgcgtt tttcckctgt tttcgctgcg cggcaaagtg 60 cttgcggatg gaggtggaag aatcctccga cggggacgag tttcgtcccg tagacggaac 120 gcggaagcgg aggaagacca gtgacgacgg agccgtcgtc gggaatggcc magaggagca 180 tctcctcaac aacaacaagt tcagcccgct ggcggagaac aacaacaaca acaataacaa 240 tggtaaccca gccgggaaag gaagcgtccc accggttcca gtggttccag tggccaagaa 300 accacctcct ctggtggtaa agaatacgag ctwcgccaag ctgaggagtg taatgtcgcc 360 gtgcacaacc aagccaagtt acaagctgac tccgtttggg atcaaattgc tgtgttcctc 420 cgaggagcgt tttgagaccg cgcgggccca cttgattgcg aacaaagtgg agttctacac 480 ccacgagaaa cgaagcgagc ggcaactccg cgtcgtcgtc agaggacttc caccggcttc 540 acctgaattc gtcaagaaga amctcaagga atcgcamaac ctggatgctg tggaggtgca 600 cgccatcaag agaaagggag agttcgcctc gtcggaggaa accccgtaca ttgttacgtt 660 ccccaagggg tacaccaacc tcaagcagct gagtgaaatc aagckcttga gagcatttca 720 catccggtgg gaggcctacc ggaacaagcg gccgaacgtg acccagtgca ggaactgctt 780 gcagctgggt catgggacca ggaactgcca cctcaagggg aggtgcaaca actgtggggg 840 tccccacaag acggacgagt gcgaagtcca ggaagcccag ccgaagcggt gtgccaactg 900 ctctggagcc cacgaagcca cggaccgcag ctgccccaag cgtgcggact tcatccggag 960 gcgccagcag gcgtcgaaac cgaaaccgcc ggcaaggaag gcggagaagc agagtccagc 1020 agttccggcg ttcacgccgg cggagttccc tccgctgccg ggcgcagttc cggacggaaa 1080 atcgaaggat cgccctcgac ccgcaggaag cagccaaggt ggctcccgag acggtggagc 1140 cacggaggag gaagccgggg aggtgctcta cagttcggct gagctgtggg gcattttctc 1200 cgagtacatc ggcaggttca agacctgcaa gacccgcttg gaccaagtaa ccctcgtcag 1260 ttacatgatc tccaagtatg gaatttaagg agtttttttt tgttattata tattgtttaa 1320 ccgatcctcg gtcccaacct ggtcgcagca cctaaaagga cctaataaaa ataagttaag 1380 aaaaaaaaaa aaaaaaaa 1398 // ID RTE-1_DYa repbase; DNA; INV; 2363 BP. XX AC . XX DT 15-MAY-2009 (Rel. 14.06, Created) DT 15-MAY-2009 (Rel. 14.06, Last updated, Version 2) XX DE RTE-type sequence: consensus. XX KW RTE; Non-LTR Retrotransposon; Transposable Element; RTE-1_DYa. XX OS Drosophila yakuba OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; melanogaster subgroup. XX RN [1] RP 1-2363 RA Jurka J.; RT "LINE-type retrotransposon families from fruit fly."; RL Repbase Reports 9(6), 1156-1156 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 301..1503 FT /product="RTE-1_DYa_1p" FT /translation="MTSTLREQLIPLEDHPWEHTCRQLRTAAEQIIGLQQR FT GRADWISEGTWDLIRRRNNLKPLADQLTQNRDEYRELCRAVKRAARKDKRA FT LLDRLATDAERAAGENNMRSLYQQIARIAGTHHRRNQPVRDLEGNLLSNDD FT AQIRRWRQHFMGISHTTSPNVATDYRSIAPPSGNTRIPSTPPHVREIVNAI FT KKLKRNRAAGEDNIPAELLQVDTQLMADTLHPHFARIWEGETVPDSWKSGI FT IVKLPKKGDLSDCNNWRGITLLNTSYKILATLLNERLQEKIEPTIRDEQAG FT FRPHRSCVDQANTLRIITEQAVEWRAPLYLLFIDFKKAFDSVERAAIWRSL FT ARKGVPANLIAIIKSMYDDANLAVLHNGKQVNLSRRTPEFGKDALYHLYSS FT TSWSTS*" FT CDS 1503..2345 FT /product="RTE-1_DYa_2p" FT /translation="MSEVCSSKRGITWSLTRHLEDLDYADDICLISHRLGD FT IQRKSDNLVRIASSVGLEINVAKTKAMRFNHSNQGNIIIHGNPIEFVNSFP FT YLGTIISNNGGVDDDVSSRLGKARSAFGRLYRIWRNREISRRTKLRIYNAC FT VKSILLYGSETWLATKKVTQKLQVFSNKCLRIICGIFWPNRIANTELWRVT FT SEPPIHTQIRKKKWMWIGHALRRNPSSIVRMALDWNPQGSRRVGRPRATWR FT RTAQQELAQINITWENAKQTAQNRVRWKALVEALSSREE*" XX SQ Sequence 2363 BP; 774 A; 600 C; 577 G; 411 T; 1 other; gcaaccctca gcaagacgcg gcaaggtgac attaaccagc tattgcrgct attcccacac 60 atccactact tggacatctc ctagtttcga ccacatttgc attagcagga aatggagaca 120 ctcactactg gacgtacgga acaaaagggg agcatctata gacagtgatc atgagttggt 180 cattggagaa cttgaaataa agttgaaccg caactactca aggaacactg ataacagcca 240 acaccgacga gcgcccccct gaacctgcac ctccttagcg actccactct gaggacgaga 300 atgacgtcca cattgcggga acaactgatc cccctagaag accacccatg ggagcacacc 360 tgcagacagc taaggaccgc agctgagcaa ataattgggc tgcaacaacg cggaagagca 420 gactggatat ctgaagggac ctgggacctg attagaagga ggaacaatct gaagcctctg 480 gcggaccagc tcacacagaa tcgtgatgaa tatcgcgaac tgtgcagagc tgtgaaaaga 540 gcggccagaa aggacaaacg agcgctactg gatagacttg cgacagacgc agaacgagca 600 gccggcgaga acaacatgcg ctcgttatac cagcagattg cacggattgc cggaacccac 660 cacagacgga accagccagt cagagaccta gaaggcaacc tcttatccaa tgatgatgca 720 cagatccgga ggtggcgcca acacttcatg ggaattagcc acaccacatc accaaacgtg 780 gctacggact atagaagcat tgcgccacca agtgggaata ccaggatacc ctctacaccc 840 ccgcatgtaa gagagattgt gaatgcaata aagaagctaa agcgcaacag ggcagccggc 900 gaggacaaca taccagccga actgcttcaa gttgacactc agctgatggc tgatacgctg 960 catccacatt tcgcacgcat atgggaaggc gagactgtac ctgactcctg gaaaagcgga 1020 atcatcgtga agctccccaa gaaaggcgac cttagcgact gcaacaactg gagagggata 1080 acactgctca acactagcta taaaatccta gctacactgc tgaacgaacg cctccaggaa 1140 aaaatcgagc caacaatccg agacgaacag gcaggcttca gaccacacag gagttgcgta 1200 gatcaggcaa acacactacg cataattacc gagcaagctg tggaatggag agccccgctt 1260 tacctcctgt tcatcgactt caagaaggcg ttcgactcag ttgaaagggc agcaatatgg 1320 cgatcacttg caaggaaagg agtacctgcg aacctcatcg ctatcatcaa atcgatgtat 1380 gacgatgcca acttggcggt actgcacaac ggaaaacaag tgaacctttc cagacgaaca 1440 ccggaattcg gcaaggatgc cctctatcac ctctactctt caacatcgtg gtcgacgagt 1500 taatgagcga agtatgctcg tccaagcgcg gaattacgtg gagcctcacc agacaccttg 1560 aggacttgga ctatgcagac gacatctgtc ttatctctca tagactcggc gatatacaga 1620 ggaaaagtga taacctcgtc aggatagcca gcagcgtcgg actcgagata aacgtagcaa 1680 aaacaaaagc tatgcgattc aaccactcca accaaggaaa catcatcata catgggaacc 1740 caatcgagtt cgtcaacagt ttcccatacc ttggcaccat aatatccaac aatggaggag 1800 tggatgatga tgtctcgagt cgtcttggca aagcccgctc agcatttggg agactatacc 1860 gcatttggag aaacagagaa ataagccggc gaacaaagct ccggatctac aatgcgtgtg 1920 tgaaatccat actgctatac ggaagtgaga catggctcgc aacgaagaaa gtaacgcaga 1980 aactgcaggt tttcagcaac aaatgcctga gaatcatatg tgggatattc tggccaaaca 2040 gaatagccaa cacagagctg tggcgcgtta cgagcgagcc cccgatacac acccaaataa 2100 ggaagaagaa atggatgtgg atagggcacg ctctcaggag aaaccctagt agcatagtaa 2160 ggatggcact ggactggaac ccccaaggaa gcaggagggt aggaaggcca agagcaacat 2220 ggagaagaac cgctcaacaa gaacttgccc aaataaacat cacatgggaa aatgctaaac 2280 agacagcaca aaacagagtt agatggaagg ctctcgtaga agccctaagt tcccgagagg 2340 aataagaagg aattaaaaaa aaa 2363 // ID Hoana4 repbase; DNA; INV; 2532 BP. XX AC . XX DT 21-SEP-2009 (Rel. 14.09, Created) DT 21-SEP-2009 (Rel. 14.09, Last updated, Version 1) XX DE Hoana4 is a putative autonomous DNA transposon. XX KW hAT; DNA transposon; Transposable Element; Hoana4. XX OS Drosophila ananassae OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; ananassae subgroup. XX RN [1] RP 1-2532 RA Ortiz M.F. and Loreto E.L.S.; RT "Characterization of new hAT transposable elements in 12 RT Drosophila genomes."; RL Genetica 135(1), 67-75 (2009)DOI 10.1007/s10709-008-9259-5. XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 1269..2270 FT /product="Hoana4_1p" FT /translation="MIKAFSSSDKIHCINHLLNNAVEKAINAIPEVENIVT FT SCSQLVRYFKKSGNNSSLGVSLKSFCPTRWNIVYYLLKSIEINWIELTAIL FT KENKQTCRVDDININHLGAIVRLLEVFENVSKKLEASKRPTIHLILPNLNK FT LKKTCNIEYYDAEIIKSLKFQLNSQILSTVTPNLSKYHNIALFLFPPTNKL FT IQFSQTEKETIINDCKIIMSQFLEDNSDLNNNAAQLVDDEFADYFDVPKFE FT TVQDKVEQEISGYSNIKVAYTPDFNALAWWHLNKQFFPLLHKTSCKIFCIP FT ASSAASERTFSNARNLITEKRCQIATNSENINKIMFLHSNLN" XX SQ Sequence 2532 BP; 863 A; 442 C; 450 G; 777 T; 0 other; tagacatctg catgattaca ttcataacat acttctaata gagtacattc gaatcgaata 60 gaatgaagtg agtggaatga tggaattgaa atgagattgt atgaatgtaa gcttaggttc 120 ctctcgtaat ttgagacgtc aaacgaaatg acagggaatg aacgaatcga aaagtgaaat 180 gagatatatg tcaatgtcat gtatacccga atccttcgag ttgtttctta tcattcgtgc 240 acatttcttt ctgagagctg ctacaaattt gtctacaaat ttttaacaat ttttaactta 300 atatcaaaaa aaaagagctg ccaaaaaaaa aaacggaaat atggacaacg aatttataaa 360 agaagccgac gagcctgtat gtatatattt gtagagtttt ttaaagttta gtgtttaggt 420 ttttaagagt taatttaatt tatttaaaat atataggctg tgcaaagcgc ggtgggtatc 480 aaagagaacc tgcaaagtgg aatgtacacc ttaacagaaa agagaggcag aagcgaagtc 540 tgggagtttt tccgcaaaat aaaaaccgaa aatggagaac aaattgatga ctttgtagct 600 tgtaagatgt gctatactgt tttaaagttt acaggaagca catctaacct ggtcaaacat 660 aagtgctaca tcctaaatgc cagcaaattc cacaccgctg ctcttgttga agttaacaaa 720 gctacaaaag acgagggctt atctgttgta accgaatggg ttcttaagaa ctgccgtcct 780 cttaacatta ttgacgactc tggcatcaaa cagtttgctt catttttaat taatgttgga 840 gctaaatatg gtgcaaatgt cgatgttaat aagctgttac cacatccaac taccgtatcc 900 cgaaacataa aatctatata tttgacccac tttgggccaa taaaaacgga aatagaaaag 960 tacaaagcct ttggatatgc tattaccagt gacatatgga ctgataactt cttaaaaaca 1020 gcatatttgt cgtgcactgt ccattacata agagagggag ttttggtcga tcgccttatg 1080 gccatgaagt caatgaaagg ctcgcccaac acaggtttgt taactctatc tttatgttac 1140 aagtataatt acaatttttt ctttgtcaag gtgccaatat aaagaaaaaa atagaggcga 1200 ttttgaagga ctttggatgt gaccttgaag tagacaaacc cgtaatcgta actgatcgtg 1260 ggtcgaacat gattaaagcc ttttcgagct cggataaaat tcactgcata aaccacttgc 1320 ttaacaacgc tgttgaaaaa gctattaacg ctattcccga ggtggaaaat attgtcacaa 1380 gttgcagcca gctagtcaga tattttaaaa agtcaggaaa taattcatcg ttaggggtgt 1440 ctttaaagag tttttgtcca actcggtgga atatcgtcta ctatttgctc aagtccattg 1500 aaataaattg gatagagcta acagcaattt taaaagaaaa taagcagacc tgcagagtgg 1560 atgacatcaa tattaatcat ttaggtgcaa ttgtacgatt gctggaagtt tttgagaatg 1620 tgtccaaaaa acttgaagct tctaaacgcc caacgataca tttaattttg ccaaatttaa 1680 acaagttaaa gaaaacgtgc aatatcgagt actatgacgc agaaataatt aaaagcctta 1740 aattccaact caatagccaa attttatcta cagttactcc aaatctgtca aaataccata 1800 acatagcttt gttccttttt cccccaacta acaaactaat tcaattctca caaactgaga 1860 aagagaccat tattaatgat tgcaaaataa ttatgagtca atttcttgaa gacaattctg 1920 acctcaacaa taatgcagcc cagttagtag atgatgaatt tgccgattat ttcgatgtcc 1980 ccaaatttga gacagtacag gacaaggtgg aacaagaaat ttctggatat tccaatatta 2040 aggttgcata cactcctgat tttaatgctt tagcttggtg gcatttaaat aagcaatttt 2100 ttccgctatt gcacaaaaca agctgcaaaa tattttgcat tccagcgagc agcgcggctt 2160 cggaacggac cttttcaaac gcaagaaatt tgataacaga gaagcgttgt caaatcgcca 2220 ctaactctga aaatattaac aaaataatgt ttttgcattc aaatttaaat tagatataaa 2280 ttcgctgcat ttaatttgtt tttatttcct tttatttaca tttacaactc gaatcatttg 2340 aatgaaacgt ctctgttttc aacgttctca attcactcaa ttcgtttcct tctatcaaat 2400 gattgtgcaa atgagtaggc gtaagagatc catccctctc acatcatgtt ggctttgctc 2460 tcaaattcac tcaactcatt tttagcattg tgcatttgaa tgtgggtttt ttgccactca 2520 tgcagatgtc ta 2532 // ID SAT-1_AAe repbase; DNA; INV; 152 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE Satellite sequence: consensus. XX KW SAT; Satellite; Simple Repeat; nonautonomous; MSAT; SAT-1_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-152 RA Kojima K.K. and Jurka J.; RT "Tandemly repeated DNA from the yellow fever mosquito."; RL Repbase Reports 11(4), 1456-1456 (2011). XX DR [1] (Consensus) XX CC Tandem arrays include units corresponding to sequences 1-152, CC 1-114, or 39-114. XX SQ Sequence 152 BP; 24 A; 34 C; 51 G; 43 T; 0 other; agttgtgttt gatccttcga ccttgacgct tgctcctgcg ggggcgggcg atgagggcag 60 gatcaaacat gtcgcgcgtc ggtcgagtga agtggaaaag tggcgtgatc cgttagttgt 120 gtttgatcct tcgaccttga cgcttgctcc tg 152 // ID Gypsy-37_OD-I repbase; DNA; INV; 3365 BP. XX AC CABV01004218; XX DT 24-FEB-2011 (Rel. 16.02, Created) DT 24-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Oikopleura dioica genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-37_OD_; KW Gypsy-37_OD-LTR; Gypsy-37_OD-I. XX OS Oikopleura dioica OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; OC Oikopleuridae; Oikopleura. XX RN [1] RP 1-3365 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Oikopleura dioica genome."; RL Direct Submission to RU (24-FEB-2011). XX DR Genome; CABV01004218; Positions 4090 726. XX CC 'GCCTT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 124..1830 FT /product="Gypsy-37_OD-I_1p" FT /translation="MNNANEVIQQAKLFTSNINGSCRSNVILIPLLSLDVW FT IQEKFNENNKKTPEKTIVIDRALQLGFTYVNYRFASYQSRYFILPGSISDF FT PAGRIQSFHYFIRKFKEEEEFRKLAQPVKLELFEDLDLNNLDITTKLTLDF FT IQRTVQSGATAKNTGLEIMMSETESETEEMNNTHFEDANETLNEEINRSFN FT GNDDEAREETKFARTRPINSTRKNPYVNTNSIIKDEDDSEKTWKAHKIPKL FT KESGLGVKDWADKCVFLVDFGREKKLTDTQKCQLILQNVPFASFGPIMDEF FT QQENSKTFENLKQIIEDQVELDEMEASVTLQSTKFDENTDRDMRRFYERIK FT KLVRIKYPGLQKEGLNTTSMEHFERLIPDYIKNSESWGLDTYDAKDPAKRV FT LLANRIFLMSKERRHINALTDRKNNAKIKSSHSTSKKCDNCGQEGHIRPEC FT LLLPQCFTCGKRGHKSHECRSQRQNGQQIQERGNFQQSNGQGNYSQQNNQF FT QNEESGNFNSNGRNGNQCVQCNKKIGGCFICGSTNHWKRDCNQKQINTFGQ FT NGGRQNHHDDDTPFIRMRHSNE" XX SQ Sequence 3365 BP; 1361 A; 583 C; 621 G; 800 T; 0 other; tctggtgact gaagtgggat ccatgtgtta gaatgcataa ggcgacgcag gaccgaacct 60 gcatctggtg actgaagaaa tagaacaaac gctaacgaag agaaaggaaa gaggtgtaaa 120 ttaatgaaca atgcaaacga agtaatacaa caagctaaac tatttacgtc caacatcaac 180 ggatcttgca gatcaaacgt catcttgata ccgttgctgt cattggacgt ctggattcaa 240 gaaaaattta acgaaaacaa caagaaaacg cctgaaaaaa ccatcgtcat agacagagca 300 cttcaactag gcttcacata cgttaactac cggtttgcta gttaccaatc acggtacttc 360 attcttccgg gatctatttc tgattttcct gctggaagaa tacaatcgtt tcattacttc 420 atacgtaaat tcaaagaaga agaggaattt agaaaactcg ctcaaccagt aaaattagaa 480 ttatttgaag atctggatct aaataatctt gacatcacta caaaattaac gttagatttt 540 atacaacgaa cagtgcaatc aggcgcaaca gcaaagaaca caggacttga aataatgatg 600 tctgaaactg agtcagaaac agaagagatg aataacactc acttcgaaga tgctaacgaa 660 acactaaacg aagaaataaa cagaagtttc aacggcaacg atgatgaagc aagagaagaa 720 actaaatttg ctagaacaag accaattaat tcaacaagaa aaaatccata cgtgaacacg 780 aactcgataa tcaaagatga agatgattca gaaaaaacat ggaaagcgca caaaatccca 840 aaactgaagg aaagtggatt aggcgttaaa gactgggcgg acaaatgcgt gtttctggtc 900 gatttcggaa gagagaaaaa acttacagac acacaaaaat gtcagctcat acttcaaaac 960 gttccgtttg ctagctttgg accaattatg gatgaatttc aacaagaaaa cagtaaaaca 1020 tttgaaaacc ttaagcaaat catcgaagat caagttgaac ttgacgaaat ggaagcaagt 1080 gtcacacttc aaagtacaaa atttgatgaa aacaccgaca gagatatgcg tagattttat 1140 gaacgaataa aaaagctcgt aagaataaaa tatccgggat tacagaaaga aggactcaac 1200 acaacgtcga tggagcattt cgaacgattg attcctgatt acatcaagaa cagcgaaagt 1260 tggggattag atacttacga cgctaaagat ccggccaagc gagttcttct tgccaatcga 1320 atattcctaa tgtcaaagga gcgacgtcac ataaatgcac ttacagaccg taaaaataat 1380 gcaaaaatca aatcttctca ttcaactagt aaaaaatgcg acaactgtgg acaagaaggt 1440 cacataagac cagaatgcct actcttacca caatgcttca catgcgggaa acgaggacat 1500 aaaagtcatg aatgtagatc tcaaagacag aacggacaac agattcagga aagagggaac 1560 ttccaacagt caaatggaca ggggaattat tctcaacaga acaaccagtt ccaaaacgaa 1620 gagtccggta attttaactc aaatggaaga aacggcaatc aatgtgtaca atgtaacaaa 1680 aaaatcggag gttgcttcat ttgcggatcg acaaatcatt ggaaacggga ctgcaatcag 1740 aagcaaataa atactttcgg tcaaaatggg ggaagacaaa atcatcacga tgatgacaca 1800 ccatttataa gaatgagaca ctcaaatgaa taatgctata tagaaaaaca tatgaaaccg 1860 attccggtaa attttgaaaa tttggaccga caatcggcaa acatcgatat cagcttcgct 1920 tcaaccgcct tttaactaca ttgcagctta cgcttacaag ataaaacagc ttcaaatcgg 1980 aagaagcgag aacatttaac aaatcatcaa aaatcaacaa atcaacaaaa aatcaaaaaa 2040 aaaggagtgc tggaattcgg actcgcacga ataaacaacg gaaaaatttt ttgcggaaga 2100 tgttgacctg taccaaaatg aaacacatgg tacactcacg gatttggaca gcagcatact 2160 attcaaaaag tctctcagaa gaaaacttat tggaatcaaa acaacgacca tacgagtctt 2220 agtggaattg tgcacaatcc gggattgaag acaagcggat caacacgttg ccgaggagca 2280 tttgcggata ttggagaaac agaacagaaa atatattatg taccagctac gagcgatatt 2340 taaacgtggt agactgtttt agcaagttga gacattccat cctaaaagat ttcaaataaa 2400 gggaacccac atggaagaaa tccaagctag ttatttgaaa ctgaagaaga aagtaccaaa 2460 gatggagatt atcaaagcca agctgatcag cgaaaaattc tcgaaaattc atggaaaaat 2520 tcaacagatg aattgatcag catcaacaat gaaaagtacg tgaagagaaa gaacgggctt 2580 tttaagaaga cttgtaatga aaaaagaatt gattatgaaa cgtggtagag acagctcacg 2640 aaaaaacaac atataatgaa gtcaagggac tactcaagct gattgaaaca aaagaatccg 2700 aaaatccgaa aatccgaaaa ttccgaaaat ccgaaaaaaa aaaaaaaagc aagaaaattc 2760 ggcgttcatc atcgctaagg ggccgtctga gccgtcgaat gggctaaaag aagatttagt 2820 ggaatgcttc aagataactt tacaaatgaa atgaagaaaa gaaagaaaaa taattataat 2880 caaactatac tgcttatttt tttatgaata ttttaattat tattactaaa aacttattta 2940 attaattttc atataaataa taattatata ataatttata ttgagtaagt aaggattaaa 3000 agttaatacc gataaattca tctctaggtc taacggtcca cagttcttcg tgaaaactga 3060 ggtgagagac tcatttatcg ggtgttagaa ttctggttag ttataaaaaa tataaaaaat 3120 tgatgaaaaa aagtacaaat aaatttaaaa acttaccatt taaagaaaat ctcttccctt 3180 tgttccaaaa ccttcttatt gacagtaatt cttcgttaaa agtgctcttt caaatttctc 3240 gtatttaatt ttcactttta aaaaaattaa caggaaacga atcgctaact ccatagagga 3300 gtatgaaata tttcttatta tcttcaaccg gaaacggttc ctcatttttc acgagagggg 3360 aggga 3365 // ID EnSpm-17_HM repbase; DNA; INV; 6374 BP. XX AC . XX DT 21-JAN-2009 (Rel. 14.02, Created) DT 21-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE EnSpm-type family from Hydra magnipapillata - consensus. XX KW EnSpm; DNA transposon; Transposable Element; EnSpm-17_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-6374 RA Bao W. and Jurka J.; RT "EnSpm-type families from Hydra magnipapillata."; RL Repbase Reports 9(2), 388-388 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(520..2052,2015..2560) FT /product="EnSpm-17_HM_1p" FT /translation="MSYWTKRRKIQKNVASIFDEIAEANSFVLKDSSLPLN FT MSCNNSKFDATPESFKPIDTSIFSLNSVFHDCVNSSDGIWETNDGKLNSSE FT SILSIDKSNVINVLVNDLTSWATRHKITHAALGDLLXILRKSFPGLPKSPK FT SILKTEQLKVEVFENSYSYLGIKNGLLSESTECFTDKTVTIQVNFDGLPLF FT KSSSMQIWPILGLVEKFDGILQCNKKPFVIALYCNNCKPKNITAFLSDFVL FT EVNELHKNGITLNSQHYSFRISAIICDTPARAFVKCTKGHNAYHGCDKCEQ FT HGVYNGRVTFPETAATLRTDASFANMSDRKHHICVSPLAQTSIGLISQVPG FT DYMHLSCLGIMRKFVYLLIKGPLKTRLGPRAVLDLSEKLLHLKKYICSEFA FT RKPRAFSDFERWKATEFRQLLLYTGMVCFRDTLPDALYNNFMLFCVGMTIL FT LSHSLCHKYVDYAHSLLVLYVEHSSSLFGPEVVTYNMHGLVHLAQDAKQYG FT PLDNISAFPFENYLSTIFQPSLLKIIYHNNLKKLVRKPHLPLQQVLRRLSE FT QHLAESINSETNSTLFQLKKEHFDGPLCNSENINVQTQHKELYTNKFCIKV FT SQGDNCFKIKTGSIVLVKNIIKSCCGQIYLVCSNFDSVENFFTYPCNSDTI FT DIFMVSYLAKHLSYVLLSEIKQKFVLLPYKAYFVTIPLLHNFS*" XX SQ Sequence 6374 BP; 2162 A; 896 C; 958 G; 2287 T; 71 other; cccagacagc aagtatatac tggtccgata atggctagcc aatgccagcg aattcattaa 60 aggaccggta aagttccgtt cgtggattgc cagttctggc aagccgtaac aggaccattg 120 ccgagccttt ataacattgc ctgggcaggt tagccgcgat agtaccaata gcaggccatt 180 aatggtttgc cggagcagat gaatagacwt tgggccaata tcggccgacc agtgccgaac 240 caatttgaat taatttgaaa aaactttakt tctcgcatcc atcttgtttt taacgtttca 300 tacaaatgta trttcctcag attctttatt tgttaaaaaa tatttgcagg cgagatactg 360 ttctagtata ttttctttta cttcattact ttaaaatctt ttgaagatta tattttatta 420 aacagagttt ttgtttttaa aaattttttc agaagaatta ttgcaaatta attatattgt 480 taaacgaaac aagtttttta ttgtattgtt agtttaaaaa tgagctactg gactaaacga 540 aggaagattc aaaaaaatgt tgcatctatt ttcgatgaaa ttgctgaagc taatagtttt 600 gttttgaaag atagttcatt acctttaaat atgagctgca ataactcaaa gtttgatgct 660 actccggaga gttttaaacc tattgacaca tctatttttt ctttaaattc agtttttcat 720 gattgtgtca acagctctga tggaatttgg gaaaccaatg atggaaaatt aaatagttca 780 gaatcaattt tatccataga caagtcaaat gttataaatg tcctagttaa tgatttgacc 840 agttgggcaa ctaggcacaa aattacacat gcagctttgg gtgatttatt grttatattg 900 cgaaaatcat ttcctggctt accaaaaagt cccaaatcta ttctgaaaac tgaacaatta 960 aaggttgaag tttttgaaaa ttcctattct tatttaggca taaaaaatgg acttttatct 1020 gaaagtacag aatgttttac agataagact gtaacaattc aagtcaattt tgatggtttg 1080 cctcttttta aaagttcaag catgcagata tggccaattc tcggtttagt ggaaaaattt 1140 gatggaatat tgcaatgtaa taaaaaacca tttgttattg cgctgtactg taacaattgc 1200 aaacctaaaa acattacagc tttcttatct gactttgttt tagaagttaa tgaactgcac 1260 aaaaatggta tcacacttaa cagtcaacat tattcattta gaatttctgc aattatttgt 1320 gacaccccag caagagcatt tgtaaaatgc actaaaggtc ataatgcata tcatggctgt 1380 gataaatgtg agcagcatgg tgtatataat ggaagagtaa cttttcctga aactgctgca 1440 acattaagaa cagatgcatc attcgctaat atgtcagatc gtaagcatca catttgtgta 1500 agccctctag ctcaaacctc tattggtttg atatcacaag tcccaggtga ttatatgcat 1560 ttgtcatgcc tgggcattat gcgcaaattt gtgtatttgt taattaaagg tcctttgaaa 1620 acacgtctcg gtccacgtgc agtattagat ttatcagaaa agttgcttca tttgaaaaaa 1680 tatatatgtt cagaatttgc tcgaaagcca agagcatttt ctgattttga acgctggaaa 1740 gctacagaat ttcgacaact tctgttgtat actggaatgg tttgctttcg agatacatta 1800 cctgatgctt tgtacaacaa ttttatgctt ttttgtgtcg gaatgactat tcttcttagt 1860 cattcattgt gtcacaaata tgttgattat gcacatagtt tgcttgtact ttatgtggaa 1920 cactcttcaa gtttatttgg acctgaagtt gtaacttata acatgcatgg actagtgcat 1980 ttagcacaag atgccaaaca atatggtcca ctagacaata tttcagcctt cccttttgaa 2040 aattatttat cataataatc ttaagaaact tgtaagaaaa ccacacttac ctttgcaaca 2100 agttttaaga aggctatcag aacaacatct tgcagagtca atcaatagtg aaactaatag 2160 cacacttttt cagttaaaaa aagagcattt tgatggacct ttgtgcaata gtgaaaatat 2220 aaatgtgcag actcaacaca aagagctcta tacaaacaaa ttttgtatta aagtttccca 2280 aggtgacaat tgttttaaaa taaaaactgg ctctatagtt ctagttaaaa atattattaa 2340 aagttgttgt ggtcaaattt atctagtgtg cagcaatttt gatagtgttg agaatttttt 2400 cacctaccca tgcaattcag ataccattga tatttttatg gtatcttatt tagctaagca 2460 tctttcatat gttttattat cagaaataaa acaaaaattt gttttactgc cttacaaagc 2520 ttactttgtt actatacctt tgctacacaa ttttagttga ataagtttat ttttaatctt 2580 atattgcaca tgctttgact atccttgact tggagacatg ttcaactttt tttcttaaca 2640 aattgtaata agtttgaatt gtaataagtt tattttaata agtttagtat tttaatagca 2700 atatatgaag acttagttat cgatatccag tgtaagcaat tttgatttta tatgccattc 2760 tgaatattat tttatttaaa atataattgg agctgaaaaa gatttttttt ttttcagatt 2820 atgtattatt catatctttt tcagattatt tawtaaaata taaaaatggt aagcaataat 2880 ttaacaaacc tttttaaggt ttgttaaatt attgcttact ttttatattt tattcttatt 2940 tattttttta ttaatttgta atttactact actgtacagt tgttgtacag tatttagcta 3000 ctactactac tactactact actactacta ctactactac tactactact actactacta 3060 ctactactac tactactact actactacta ctactactac tactactata tatatatata 3120 tatatatata tatatatata tatatatata tatatatata tatatatata tatatatata 3180 tatatatata tatatatata tatatatata tatatatata caaaattata taaatttaca 3240 cgattacaca attcctgtta aagaatataa ttataatatc aatattgttg taaaataaca 3300 taatgattgt gtaatttaaa tagtgtaaca atcttagtct tattttttga gataagtcta 3360 tttttctaaa ctttcatttc agtaatttta atttttcttc aaaataaatt tttaatcttt 3420 gaaattatat tatataatgg aatcatttga aaatcttttg aaaattttat acatcagaaa 3480 aataaaataa aagtagtatt tcttatactg agctttattt tttttttttc ttttattgat 3540 gttatgattt ttatttaatt tagaataaac tttagaatgg atctccatca gtttcttatt 3600 gttgctttta aaaatgaaga agatgctgtt gcaattttgc cagaaacatt tatttaatta 3660 gtaatgaact ttgcttttgg ccaccatata cctcacaagg cagagtaaaa aaggctatca 3720 aagaatgtga aacccctaat gaaaattggc caaagcatgc cattcgagtt attaaaaaaa 3780 taggtcagtc acatatgtac ataatgtgct taagatataa cctgcattaa tgtcaaaaaa 3840 ttgtaattta ttgtacaaaa gaaatagtga taattgtata ctgtagttgt aacattgatg 3900 gtgttgttat aagagtgtaa cttaggggtt gtattttagg ttcctacaac aaggcacaat 3960 gtcaacttga aaaagctacc ataacatctg atctgcagag tgattcagaa tcaagtacta 4020 ggctacgtcc aaagcggtac ttaatagaag aaaatttata tttttttgta aatacaattt 4080 tatggtttta aagttnttaa gtcctaagta tttttatttt attttatgtg aannntattt 4140 tnatntccnn agngnnnnnn gnaaaagaat tttttnnnan annaannaat ncnaatttnt 4200 aaaaanntna antgnttacn ccaannncng tttccncnaa aanatttttc cnnngnacnn 4260 ccannnaanc tnnaaccnta aaaatnntaa nnccacnaat tntaancncn cctcctttaa 4320 tnaaatctaa aatacaatct tatagatgag gagcagagat cacaagaaat gttgcaatcg 4380 gctgtgacaa acaaaatagt ttcaatgtta gtggaactca aagaggatgt ggcaaatctg 4440 agaaaagagg ttgcatacaa cacagccatg cttcagaata ttgctaatgg tggagttcat 4500 gatgatgacc aaaacatttt tgatattctt aaacttcctt tatgcaatat ggctcagtta 4560 ttgcaacttg aaaaaacact tgaaaatgaa aaaataacaa tgaaaaagct tgtaagttgc 4620 tcactcattt aactgatctt aatttttaca ctttattttt ttattattta ttttgttatt 4680 taaaagaaaa tataaakttt ctaatggttt attgaagttc accacaatgt gtttagcatt 4740 aaaattttgc aactcaagaa aattttaatg ctaaacacat ttaatcttta tttctctttt 4800 ctaaaactaa attagcagta agaaagcaag agttggaaat taaaatctaa atttttaatt 4860 aaattaaatg tatgttgtca agttaatata ttttttgtgc taattgtcaa aagtttcatt 4920 tcgaagacat tttattttat ttgttttatt gtttatttta ttgcttattg tatctagata 4980 cgagcagttg cttctaaagg tgggcaaaac ttaaaagagg cagtgaagag aatgttgttg 5040 tcattattac ataatgatgt tctcaaaaaa ctaaattgga caggtcaggg tgataaagac 5100 aaacagtctt tccgtgattt gaattgcaga ttagttatcg aaggtagatc ttttttttta 5160 actttgaatt taaaatttaa aaaagtttta attcatttaa atcttttgtt aamactatca 5220 aggtggaaaa aagttcttat ataaaatctc tactaaaatg ttggtaagga tgatagggga 5280 ggagataata aacatttgaa ggtaatcttt attgcgatgg tgtcacaata ccatatccgg 5340 ttctgttaaa taggctaaca tttgtaaggg aatccagtag aaaaacattt tctgaaaagc 5400 gggacaatat aaaaagaatt gtatatttat tgttcttttc cttttatgta agtaaatgtc 5460 aattatacca aaatgtttat attttaaaat aggtttgact aaactttaaa ctatttaact 5520 taataatttt agttgactct ccaaataaat atacgagttg agaatatgac atcgtttgat 5580 tttatatgta tgttatctta tgctttttaa ttttacaggc gcattgaagc gaaatccctc 5640 gactgcacaa gccacagaaa gcgatattca aaaggygatt gtacattttt tggccggagc 5700 aatcgataga aatggtggaa ggaaggcgag agccacaaga ctactggctc taaaagctac 5760 tgcaaccgag aaagtatctt gtttttcgag agtgaaaaat aaaatttttt ctagtttacc 5820 atcaacttcc gattcagaag aatcaaatta aaatttgtta ttgtaatatg ttagattgta 5880 aatatgttag attgtaaata tgcttatatt actcattgaa tattatatta tataataata 5940 cccttgctta gtcaactatg tcgtaatgta agcgatgaaa tgtagttaat gcattatgaa 6000 taaaaaattt gaaatttatt ttacaaaaag acaagtttaa tctgatgcta aaagtaaagt 6060 ataattttat cccagtgaga atgttaacta ttagaataat taaaagcatt attgtgattt 6120 ctgttcgtga ctattgtcac gaaaagaaat ttcaatacgg aatttcaata actgtagtaa 6180 aggttagcca tttttctgcc agttcttttc ctgtagtggc tagccactgt tttgccatca 6240 ctggctagcc acggctggtc catgaaaaat ccttagagca aaacggagct tcgccagcct 6300 tggcagccga agccggtcca gcgcggcagc ctatacagga ccagtgcggg accactatac 6360 ccttgctgtc tggg 6374 // ID hAT-31_SM repbase; DNA; INV; 2520 BP. XX AC . XX DT 14-FEB-2008 (Rel. 13.02, Created) DT 03-MAR-2008 (Rel. 13.02, Last updated, Version 1) XX DE hAT DNA transposon from Schmidtea mediterranea: consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-31_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-2520 RA Bao W. and Jurka J.; RT "hAT DNA transposon from Schmidtea mediterranea."; RL Repbase Reports 8(2), 80-80 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 1896..2348 FT /product="hAT-31_SM_2p" FT /translation="ESQTATNFNFGNQNLLNLLNKYSKIIPNYSEEVKKQI FT LSQYSNFKFLYQTKFNSNVVSSFADMVDLANKEEEEFQELSSLIDISATFQ FT ASSADCERGFSQMNAIKTKARNRLETFHLDHLMRIKLFLRRGNEIDVQKVF FT NFWKNNKDRRGN" FT CDS join(349..762,689..1435) FT /product="hAT-31_SM_1p" FT /translation="MSKRNSSSQPASTSKKVCPNFKPAWLLQLVKTKIPTS FT RELDHVKLDYIFEFNEEESTIKCKFCTEAKVDGDFSIGKKWDDWKLDYLKR FT HLNHKTHIEAVEILTARRCGGLVRLITETPKDREVRIRKCWKTKGKSKNGL FT LRLLKTVRYELENAGRQKANPKMVKVLIDNVILAIKLNTSMNSVQEINSHV FT SKYVQIPESWRSKNYAFEFVECINQVVQNEDFDAIRKSQYHCLIIDESTDI FT FVHKMLIIYIKFRTNFDYKTSFATIIQLSACDALSIVAAIKKFYSDFKLDM FT DKMVMMTSDGAAVMLGGYNGVAKILQRDIPYLSAQHCVAHREDLGIDDACE FT GNTFNENSRDFVKNGVHTMFSRSSVEKRQISRFSCSNGMRCFEF" XX SQ Sequence 2520 BP; 889 A; 387 C; 457 G; 787 T; 0 other; caacgttccc tttaagctgc gcgcatgcgc aatcgcgcat aaaaaataat ttgttgctca 60 cagcattcca tgctgcgcgc ataagaaaat tgctcgcgaa tttttttata aattaatgac 120 cgtttttttt aaaataattt ttaatatcga tccaagtagg cttattttta cattgctaat 180 tgctattata aagcaatata tttagtatat ttcgaacatc aaccgaataa agttaatatt 240 ttttttacga attttatact gtgctgtgca gaggaattgt aaatggtata tgtgttatta 300 tttatttttg ttgatttttt ttaaatacat tcaatcccag cttcaacgat gtcaaaaaga 360 aactcctcaa gtcaaccagc gtcaacatcc aagaaagtat gtccaaattt caaacctgct 420 tggcttttgc aacttgttaa gactaaaata ccaacttcac gtgaactcga ccatgtgaaa 480 ctagattata tattcgaatt caatgaagaa gaaagtacaa ttaagtgcaa attttgtacc 540 gaagcgaaag tagatggaga tttttccatt gggaaaaagt gggatgattg gaaattggat 600 tatctgaaac gtcatctaaa tcacaaaaca cacattgaag cggttgaaat tcttacagct 660 cgtagatgtg gtggattagt tcgactgatt actgagactc ctaaagaccg tgaggtacga 720 attagaaaat gctggaagac aaaaggcaaa tccaaaaatg gttaaagttt taattgacaa 780 cgtaattttg gcaataaaat tgaacacttc aatgaattct gtgcaggaaa taaatagcca 840 cgtatcaaaa tatgttcaaa ttcctgaaag ttggcgtagc aaaaattatg cctttgaatt 900 cgtggaatgc ataaaccaag tggttcagaa tgaagatttt gatgctatta gaaaatcaca 960 atatcattgc ttaataattg atgagagcac agatattttt gttcataaaa tgcttataat 1020 ttacataaaa ttcagaacaa actttgatta caaaacctct tttgcaacta taattcagtt 1080 aagtgcttgc gacgctttat caattgtggc agctatcaaa aaattctact ctgattttaa 1140 gttggatatg gataaaatgg taatgatgac atcggacgga gctgcggtta tgctgggtgg 1200 ttataatgga gtagcaaaga tactacaacg agatattccg tatttatcag ctcagcattg 1260 tgtggcacac agagaagacc ttggaattga cgatgcttgt gaggggaata cctttaatga 1320 aaactctaga gactttgtta agaacggtgt acacacaatg ttcagtagat catctgtgga 1380 aaaaaggcaa atttcaagat ttagctgcag taatggaatg cgatgttttg agttttagac 1440 ctctcaatga gggtaaggtg gctttcacgg cactttgcag tgattccatt tattcgaaat 1500 tatgatgttt tgattgagta ttgcaaaggt gaagttgaaa attcaaatga tccaatctct 1560 aaatattgtg ttaaggctct tactaattct ttgaaccgat tagctttaac agtgctaaat 1620 gatattttaa ctgaattagc taaaaatgag caaatttttt caaaaaagcg tgctttctcc 1680 tatggaagct catcaatacg taaaatccat tattaaaaag attcggagcc agtatcttgg 1740 agaaactatt ttttggagcg aagaagcaat gaaacttctc aacacaagcg ataatactgc 1800 tgaaattatt gaatttatca caaagctatg tgatcatcta gattgcagat ttccagaaga 1860 agaaatgaaa gactgggctg catttgatca tataggaatc gcaaacagca acaaatttta 1920 attttggaaa ccaaaacttg ctgaatcttc ttaacaaata ctcgaaaatt attccaaatt 1980 atagtgaaga agtgaagaag caaattctaa gccaatattc gaattttaaa tttttatacc 2040 aaaccaaatt taactctaac gtcgtaagca gtttcgcgga tatggtagac cttgccaaca 2100 aggaagagga ggagtttcaa gagctttcct ctttaataga catttctgca acctttcaag 2160 catccagtgc agattgcgag cgtgggttta gtcaaatgaa tgccattaaa acaaaggcca 2220 gaaacagatt ggagacattt catttggacc atctcatgcg tatcaaatta tttttaaggc 2280 gtggaaacga aattgatgta caaaaagttt tcaatttttg gaaaaataat aaagatcgac 2340 gtggtaatta atctaaaatg taatttcttt ttctaataat aaaaaaattc ttataaaaat 2400 aaaaagtctt aatttaccaa aacaaaattt tctcttgcgc acaatgtggt tacttttcaa 2460 aatatgggca cagaacgtta ttttttcgcg cagcgtgaaa aaaaataaga gggaacgttg 2520 // ID DNA2-8_AP repbase; DNA; INV; 765 BP. XX AC . XX DT 26-AUG-2009 (Rel. 14.09, Created) DT 26-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA2-8_AP. XX NM DNA2-8_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-765 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 1941-1941 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 2 bp TSD. Putative mariner. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 765 BP; 290 A; 120 C; 90 G; 259 T; 6 other; cagtagaaac cgtttataat gacgttcaag ggaccaagga aaataggtcg taataaccga 60 tngtcattac naccaaaatg actaaaatta aattataact tcaatanata ttattattat 120 tatttattta cgtaatgtat aatatagaaa aaaagtaaag ataaaaacat ttaattatgt 180 acatattatt taaattaatt aattcaatta ttatttttaa aaaaatctgt tatttcattt 240 tgaaactctt gttcgatttt gtctcaagct ttccatatgt tagaaatcgt atagagtgac 300 ctaccccaaa ttcattaggt atgtgggaat ttttttcacc ttttttttta aacgaaacat 360 gacatttgct ttttcttcaa ttgtggattg ctttcgtttg cacattttac aatttcaaaa 420 tatttaacac aagaacacaa aacacattta atctcacgac gattacccgt accctgatga 480 taaaataaaa taaaataaaa ctgacgacaa aacaaagacg aaattgtcgg tattatcggt 540 tgtaatagta ccttcaataa agtaaagtca ttatagataa aatatttttc caatataacc 600 aaaaataacc gtcataacat ccgtatagtc attatataga cgtcataacn accgatacaa 660 attagtacat acaacttacc tangcgtttt tcagggncct ttaatttaag gtcataaaac 720 ccgatatgtc actacaaccg gtgtcattat aaacgatttc tactg 765 // ID Gypsy6-LTR_Dya repbase; DNA; INV; 1356 BP. XX AC chrX; XX DT 18-MAY-2009 (Rel. 14.05, Created) DT 18-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy6_Dya; KW Gypsy6-I_Dya; Gypsy6-LTR_Dya. XX OS Drosophila yakuba OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; melanogaster subgroup. XX RN [1] RP 1-1356 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1061-1061 (2009). XX DR Genome; chrX; Positions 20659169 20657814. XX SQ Sequence 1356 BP; 426 A; 253 C; 247 G; 430 T; 0 other; tgattggaca ccccgaccta tctatatagg tatagatcgg tgcgcagtag aacacctaca 60 actaacttcg caccgcagga tgagcacaag ccgcaccagt tacttttccc agtcaggatc 120 atttatggaa acccccaaaa cctggatcgt cagcggataa gggattagga taaaagtagt 180 gcaggttgct agaccatacg gagtgtgttt ttgcgagagt gtgttttatc gtaaccgccg 240 tcatgtagcc ccgtaatttt tatacttcaa ttaatctccg ctagcatcca tcatttcacc 300 gctggaaggc ctttcaacgc tggaactgga cacatgtgag ttcgtaaagc taagagtaat 360 ttcgtatttg tcgtagagta tgccatacct acattcttac aagaacattt caaatgaata 420 taaagattag ggggtgactg aaaagaaaac aaacaaatta tatatctaaa ccttgaaatt 480 aaacgaaatt taaatactat agagcccgtt acgaaagaaa atttttttta atttttattt 540 ttttcataac gcgactccat atggcctagg attcttattc aaaaaaatat ttctaacctt 600 atgcatagtt cgtttgatag aaacctcaac atttaaccat aaatattaag ataagacatc 660 acaaaatgaa ttaaatgaaa tacttttaga tctaagaaaa tgtactgaaa ggaaatttcg 720 atagtagttt aagtaaagga aggagatttt tttagttttc ttactttgtt ttgattaagc 780 gctccatgaa gttgcataaa tgaatactat tactacatct tatttgttcg gccatccagc 840 ctcttttgta aataaattta ttatatgaac ttaaaacaaa tcatgaattg ctaaacctgt 900 aattatcctg atttgacatg gcagcttaaa taaaatcgga agtctctata gggaattttc 960 aaagatgcta tcgtgaagat tatggtcggg ttgggtagag aaagcgaggt tcaaattcca 1020 ccaaagacct tgctaccgtt tagtcttgct gtcgtctgta acgttcttct tttttttttt 1080 tcactttgcg agttccgttt tattttttag ttttcttctt ttagtatgga atgtacaagc 1140 ctagtaactt atagtttccg taggaatcct aacaattaaa gtatgtgtgg accgttatga 1200 aaggcaatcc gacccacccc atcgccgaaa gagctaaggc accagaggat tccgcctgcc 1260 attattcata atcgagtaga agagtgagag cacgaatcac cagtgtttgg gtactgttat 1320 acccctcggc ttctcatgct taagtcaacc attaca 1356 // ID hAT-2_AP repbase; DNA; INV; 3473 BP. XX AC Contig35933; XX DT 24-JUN-2009 (Rel. 14.07, Created) DT 24-JUN-2009 (Rel. 15.12, Last updated, Version 2) XX DE DNA transposon. XX KW hAT; DNA transposon; Transposable Element; hAT-2_AP. XX NM hAT-2_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-3473 RA Jurka J.; RT "DNA transposons from pea aphid."; RL Repbase Reports 9(7), 1364-1364 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX SQ Sequence 3473 BP; 1283 A; 530 C; 550 G; 1110 T; 0 other; ggtacttttc ggttatcgta ctccaacggt cggccggcag gtggcggccg cggtcgggat 60 ataccgattc cgttatcgta caccattaaa tttatctcaa tattgtcacg tttcattgcc 120 aaaaatataa tttattttaa tagaataata taataatatg aaaaataata attttaaaat 180 aaaataataa taaaaaaaaa ttataataaa atatgaacac caaaaaaatt attgtaaata 240 caaattatta aataatatta tgaaggaggc atgaatttaa caatgtcgtt tataatttga 300 gtaaatatgc agatggtttt tttatgtttt tgttgcataa atcataataa taataataat 360 gattaattta aaaaaaatta ttataatata agtaatacaa ttaatataaa gaataacaat 420 ttatttaata taaacaaata ttttcttaat atatttaagt aataactcaa acacattgaa 480 taaaaatagt atttaagtac cactttagat acttttataa aatattttac ctaaaaaaat 540 gaacacaacc aaccttttaa cagttacaag tacagttaaa attttaaatt atgaataatt 600 aatatcaaat aggtaaataa taatacttct aaattatagg taaatgaatc aaattctgag 660 attattgatg atccagaacc ctcaacatct atggaccaaa atccgattag acaacaatct 720 acttcagtag caacaacatc aacagcattt ttaagaccac ataacaaaca gcaaagctct 780 attgcaaact tcatccataa gccgatacct atgagcaaat caaaaatgat cgatcaacag 840 ttaatcaaaa tgattgtgaa ggaataccat ccgtttagtg tggtggagga tgaagaattt 900 cgtaaattta ttaaaatgtt atgtccaact tacattattc catcaagaaa aaccgtaacc 960 caaagtttgt tacctcagat gtttgatatg actcttgaat gcgtgaagga tagattaaaa 1020 aatgtggaag ctgtgtgttt gaccacagat gggtggactt ccaggaccaa ccaaagtttt 1080 atatccgtaa cagcacattt tattgaccct aaaaacgata cagttgtttc atctgtatta 1140 ttaggctgta tcgagtttaa ttaaaaacat acaagtgaaa atttgtctcg ttttttaagg 1200 aacctagttg aggaacggaa tttattatat aaacttactg ctgtagtaac tgataatgct 1260 gctaatatta aatcagcaat tagaaactgt aactggagaa ggctatcttg ctttgcacat 1320 tctataaatt taattgtcca atctagcttg aaatgtatag attctactct gtcaaaggta 1380 aaaaacatag tacagtattt taaaaaaagt tcgcacgccc tggcaaaatt aaatgactat 1440 caaaaacaac ttggttcacc tattttaaaa cttaaacagg attgcccaac gcgatggaat 1500 tcgacaaacg acatgatcaa tagaataatt gcaattaaag attcgattat tgccactctt 1560 gcagttttag gtaattcaga gttgaactgt ctaagcccac aagactgggt tatattggaa 1620 aatgcccggg acattttgaa aatcttttat gaagtgactg tcgaaataag tgccgaaaaa 1680 tatgtaacta tatcaaaaga aatcattttt attaaaacat taaataagtt tgtttttaat 1740 tttaataata acaatacatt accaaaagaa attaattcca tgtgtcaggt tttgaaggat 1800 gagttgtatg caagatttgg taaatatgaa gagaacccgt taattagcaa gcaacattat 1860 tggatccgag gttcaagaaa tttgcacatt ctccaatgaa aatcattgta agaatgcagt 1920 taatttgtta aaggctaaag cacaaagtat tatattagaa tcagacgaac caatacatca 1980 acaagttgct acatctacct ctgctagtaa ttcaagttca atgctgtgga aagagtttga 2040 tgaaactgtt gtaaatttga ttggaggatc caactcgtca gtggcaggta taattgaagt 2100 tgaaaagtat ttgaatgaac cattgatcaa ccgtgcagaa aatcctttgg tttggtgggc 2160 tgagcgaaaa aatgtgtacc cacgattgta cagattaact aaaaggagac tgtgtattat 2220 ggcaacatca gtaccatgtg agagaatatt ttcaaaagca ggacaagttg taaatgaaag 2280 aagatctcga ctcaccacat caaaaatatc acaaatatta tttcttaacc acaacatgtt 2340 gtgatttata taattatatt tattgtatac taatagttat tagttgatag ttatattatt 2400 ataatactta ttattattat tttattattt tcattaaaac aatgtaaaga ggagcaaaac 2460 aaaaatgtgt aaatattaaa taataaatat atcattattt attataacct tgttatcaaa 2520 aacaaaaact cgcgtctcgt tcagtctcgt tcaatcaacg gttcacacgg ttcgcgtgag 2580 tatatgattc gggaatcggg gagatagtga cagtgtgaac caagaaccaa tacgaaaaaa 2640 aaaatcaggt gatccggatc acctcggttc ggcgatctgg atcatatcgt tcactacttt 2700 gacccgttcg ataagatccg ttcgttcgcg aacgacacat ctctactcag ttccttgatc 2760 gactatggct cgtttcaact acgatatgtt atgttcggtg tgcaatgacg aagaaagtgc 2820 tatacgtttt ttacaagacc acggagtgct tcatcgtaat cgtacttgta aaaacggtca 2880 tgatatgttg atacggtacg ggtctaaacc tctgtggcga tgctgccggc tctattataa 2940 tatcagattg ctggaagagc tacgattctt taaagcataa taatgcattt cagcatatgc 3000 aggtcaatca caaatataat tttgtagacc cagataccgg ggcacacaca caaaatattg 3060 agcgtttatg gcgatcggct aaagaacgaa ataaaaaaca ttcgggaacc catcggtcaa 3120 tgttggattc tcacataagt gaatttctta aaaaaaacca tctgcatatt tacccaaatt 3180 ataaacgaca ttgttaaatt catgcctcct tcataatatt atttaataat ttgtatttac 3240 aataattttt ttggtgttca tattttatta taattttttt ttattattat tttattttaa 3300 aattattatt tttcatatta ttatattatt ctattaaaat aaattatatt tttggcaatg 3360 aaacatgata atattgagat aaatttaatg gtgtacgata acggaatcgg tatatcccga 3420 ccgcggccgc cacctgccgg ccgaccgttg gagtacgata accgaaaagt acc 3473 // ID Gypsy-236_AA-I repbase; DNA; INV; 6281 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-236_AA_; KW Gypsy-236_AA-LTR; Gypsy-236_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-6281 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1073-1073 (2011). XX DR [1] (Consensus) XX CC Positions [3525-4163] - Reverse transcriptase CC Positions [5241-5714] - Integrase core CC 'ATTGG' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 1961..3502 FT /product="Gypsy-236_AA-I_1p" FT /translation="METFRPVKQFDAKVDQSQLATEWRKWKRSLEYYLAAS FT GIAGQREKRNQLLHLGGPDLQDIFDNLPGVHDVPHVTPDPPFYDVAVQQLD FT SHFQPCRRRTYERHVFRQISQLTGERFGDFVMRLRVQASRCDFDQEGNSVM FT ESMIIDQIAEKCISSALRKKILEKDRPLDEVVAIGKTIEDVEQQCKEMVQR FT EHDDKPLTVVNKVNQRGLPPAKTNFQPANNRSFQYQPRSQQGFRRPNFRDQ FT SRFRFEAKRNWTSSGNPPTSGAEGGHQSINDNRICFGCGRRGHLKGSASCI FT ARDAQCLKCRTVGHFAKWCTKRTNEDSLPNPSSAKRIKAVVDSDDNSKMYG FT NSDLPEKEQDICFIMGGNVFRFKIGGVETIMTIDSGAAANIIDLKTWETLK FT KQGAKVNFKPHVDRSFKAYGSSTPISMAGMFIAEIETGNNKVEATFYVAKN FT GLQSLLGDETAKKLKVLKIGYNVGSIQDVSKVFPKIRGVIVEIPVNPDVKP FT VQQPYRRAPFALEKQNCR" FT CDS 3927..6260 FT /product="Gypsy-236_AA-I_2p" FT /translation="MFGISCAPELFQKVMESVVAGLDGVVVYLDDIMVWGR FT NSEEHDQRLKALLGRLQDYGILLNKEKCVFGVCELEFLGHEMSSSGIRPTE FT SRVAAIKSFRLPRNVAELRSFLGLITYVGRFIPSLADKTEPLRRLLRIGEK FT FDWKEEHSQAFEEIKRAVNETNCLGYYDPNDLCIVVADASPSGLGAVLLQQ FT NANGNRRIISFASKSLTEIERKYFQTEREALSLVWAVDRFQLYLLGKKFKL FT ITDCKPLKFLFKERSKPSARIERWVLRLQTYNFDVVYEPGANNLADALSRL FT SVTNPVAYRSSDEECILQLALEGFPSAISVEEVEEETLKDDVLQSVQRGLE FT TGIWNDMSKNFKPFSSELCLARNLLLRGDRLVIPEKLRSRMLEIAHESHPG FT IVVMKRRLRQKVWWPEIDKQVELFVKKCKNCTLVSTLNPPEPLNRTKLPEE FT AWADLAIDFMGPLPSGHNLLVMVDYFSRFTEVIIMKQITANLTVKALHETF FT CRFGMPESIKTDNGPQFISIELQDFCKHFGIEHRRTTPYWPQANGEVERIN FT RSIGKRLKISQETTGADWQWDLRTFVLMHNSTPHSTTGVAPSILMFGRQLK FT DKLPGLLLKGSNILEEIHERDFLRKMKGAEYADLRRGAKPSDICQGDTVVT FT KRIHKENKLSTTFDPEEYLVIDRNGSDITIKSKESGKVIHRNVVHLKKLFA FT DTEHPQSLAGKSEAEMDAGELLVPGQKHTTTEGGETAKVRGNQKSLLDTGT FT EGNATIRPRREVKQPSYLKDYTIDKID" XX SQ Sequence 6281 BP; 1969 A; 1082 C; 1514 G; 1712 T; 4 other; gtggcgtacg aggagaataa tttcgggaac ggtgattgat tgtgaaatta ttccggatag 60 atccattgaa tttcggacgg tttgatttga tagctcaact tgagtgattt ttattttgat 120 cagtatctgg tgtttgagta agtatggtgc aaaaaaaata tctttggaag tgcattcggt 180 ttgttgaaga agcgaaaagg tcaaataggt tatggtatga atgtgaagga atatgatagt 240 gcaaatgcaa gaaatggcag ttgkttgcag cggaaaagaa aaacatggga aacatagcat 300 gcgaaaagwg tttggagtac tcaaaatggc cgtcgacagt gagcattgta taggatgctg 360 ggaaaawtca tgcatccaaa ttgggaaatg ttgttggaaa caagatggcc aattgtgtaa 420 agctataaag tagtgtatgt tggtaaaggt gtggcgctgg aaaaaaaaat taatcgtctc 480 aattgaaaaa tgtgtttgga acaagatggc caacttcgtg aagctgtaaa gtagtttggt 540 aatggtgtag cagtgagaaa atcatacagg gatgttcggg gctgtgtgtt tgaaagttta 600 ctactacgaa gaagttacag tttactgcta cgaagagaag aaaaattttt gtgattgtat 660 gacaagaaac ataaaatgta gtgatcacca atacatgtga atacttacgt tgttttattt 720 cttattgaaa atggaagtaa gagtgagaac agaaagtgtc gacgagccaa cactgctgca 780 cagatacatt tttgcacacc gttttgctca acttgtgaat gtgtgagtaa atatataaac 840 aaaattacta tatgcacaca ggcgcacact gctcgtttga tgaccagcgt agaattgtaa 900 gtaagatgaa acgagaaaga aatgagaatg caacggctcg gtaagtcaga aacaaatttt 960 ttttatcctt tttggaagac attttaatgg atcgtagcag ttgaaagcag tacaaattac 1020 aattatggaa gacttaattg gttgaggggt tatgcaattg tcaaatcctg agtcaagggc 1080 cgagagccgg attctcagtt gattgatttt cggtgaaatg ggccgtgctc cggattcacc 1140 tatttaaaag ctgttgcatt tccatatgca tgtgttgctt gggtggaata ctagaagtat 1200 acattctaca gcaatctaat aaattgttag atgtagtact gaaattgaca gacagaaaag 1260 tgatcggtaa ttctgaacag ctgtggggaa ttgacaaaaa aaaaaaaaca aaaatcctga 1320 gtcatgggcc gcgagccgga ttctcagtcs acaaattcac ggtgaaatgg gtcgcgtacc 1380 ggattcactg aaagatggtg aaatgaaaaa agaaatgggg acatcctgag tcatgggctg 1440 agtgccggat tctcattggt ttatgttttt ttggttatat tgaatgcgtt tccagtttgt 1500 tgaaaataat gctgatgata gtctatgcag tagtttggaa gcattagtta ttcctacgta 1560 gaagcgctat ttggttccaa tgaaactggg aatgtcgtat ttaaaaatgt tggaccgaaa 1620 ttcgaaatca tggtcgttta ttatttcatg aaatgtacga tgtatgtact agaagagaga 1680 tgacttaaaa tgatcgtcct gtaaaggcaa atacaaatgt gattgttata tatacattat 1740 tgttttttaa ttctgcataa acgattatcc gggtgtatag gaatggttgt ggtgaacgta 1800 tagttcgtgt tttgaatgga tgaagtgctc gtgtgatata gattatccga atgtaaaatt 1860 atttttttat tcttctggta cagtcgatgc actttctaag atattgttgg atgtgatatt 1920 ttttttattt gattgacaat gtaattaatt ttctcagacg atggagactt tcaggccagt 1980 aaagcaattt gatgcgaaag tagaccagtc tcaattagct acggagtgga gaaaatggaa 2040 gcgtagtttg gaatactact tggctgccag tggcatagca gggcaacgcg agaaaagaaa 2100 tcagctgctt catcttggcg gaccagatct gcaggacatt tttgataatt taccaggagt 2160 tcacgatgtt ccacacgtta ctccagatcc accattctat gacgttgctg tgcaacaact 2220 tgattcccat ttccaaccat gcaggcgtcg aacgtacgaa agacacgtat ttcgccaaat 2280 ttctcagctt actggagaac ggtttggcga tttcgttatg agattgcgtg tccaagctag 2340 tcggtgcgat tttgatcaag agggtaattc agtaatggag agtatgataa ttgatcagat 2400 cgcggaaaaa tgtatttcgt cagctcttcg caagaaaata ctagagaagg atcgaccact 2460 agatgaagta gtagccattg gaaaaactat tgaggacgtc gagcagcaat gcaaggaaat 2520 ggtacaaagg gagcatgatg ataaaccgtt gaccgtcgtg aataaagtaa accaacgggg 2580 tttaccaccg gcaaaaacta atttccagcc ggcaaacaac cgatcgttcc agtaccaacc 2640 tcggtctcaa cagggttttc gtcgaccgaa tttccgagac caatcgcgat tccggttcga 2700 agccaaaagg aattggacga gttctggaaa tcctcctact tctggagcgg aaggaggaca 2760 tcaatcgatt aatgacaaca gaatttgttt tggatgtgga cgtcgtggac atctcaaagg 2820 cagtgcatca tgtattgctc gagacgcaca gtgtctcaaa tgtagaactg ttggtcactt 2880 tgctaaatgg tgcacaaaac gaacgaatga agactctctt ccaaatcctt cctcagctaa 2940 gcgcataaag gctgtagttg actccgatga caactcgaaa atgtacggta actcagatct 3000 tccagaaaaa gagcaagaca tttgcttcat aatgggtgga aatgttttcc gtttcaaaat 3060 cggaggtgtt gaaaccatta tgaccattga ctcaggcgcc gcagccaata tcatcgacct 3120 taagacatgg gaaacactca agaaacaggg agctaaagtt aatttcaagc cacacgtaga 3180 tcggtcattc aaggcttatg gttcgtcgac ccctataagc atggcgggaa tgttcattgc 3240 tgaaattgag acgggtaaca acaaagttga agctactttc tatgtagcta aaaatggctt 3300 gcaaagtttg ttgggagacg aaacggcaaa aaagcttaaa gtcttgaaaa ttgggtacaa 3360 tgttggatcg attcaagatg tttcaaaggt atttcctaaa ataagaggtg ttattgtgga 3420 gattccagtt aaccctgatg ttaagcctgt gcaacagcca taccgtcgcg ctccatttgc 3480 gttggaaaaa caaaattgcc gataaacttc agtatctact tgaccgggat attattgaac 3540 gagtgcagca accgtcggct tgggtttctc caattgtacc cgtcgtgaag gataatggtg 3600 agattcgtct ttgtatcgac atgcgcagag caaatcaagc ggttgttagg gaaacacacc 3660 cattacctat tattgaggaa cttttcggag gaatcaacgg tgcagttcgg ttctccaagc 3720 tggacatcag agaagcttat caccaggtcg aaatttctga acgatcaagg gaaatcacaa 3780 ctttcattgc taagcaagga ctgttcaggt aaatggaact gtgtgctgga tggatcaatt 3840 gaaattaaat aaataaaatg aaaatatttg gttcttaagc gtttttattg agttcttatt 3900 tatatttata cagatttaaa aggctgatgt tcggaataag ctgcgctcct gaactatttc 3960 agaaggtgat ggagtcagtg gtagctggtt tagatggagt agtcgtatat ttggacgaca 4020 taatggtttg ggggcgaaat tcagaagagc atgaccaaag actgaaagca ctactcggtc 4080 gcctgcagga ttacggaatt ttacttaata aagagaaatg tgttttcggt gtttgtgagc 4140 tagaattcct tggccatgaa atgtctagct ccggtattcg accaacagaa agcagggtcg 4200 ctgctatcaa gtcatttcga cttccgcgta acgttgcaga gcttcgcagt ttccttgggc 4260 tcataacata tgtaggcaga tttattccgt ctctggcgga taaaactgag ccactacgaa 4320 gactgttgcg cattggagaa aaatttgatt ggaaggaaga acactctcaa gcgtttgaag 4380 agattaagcg tgctgtgaac gaaaccaact gtcttggtta ttatgacccc aatgacctct 4440 gtatcgtagt agctgacgca agtcccagcg gattaggtgc tgtattgcta caacagaatg 4500 caaatggaaa caggagaatc atttcttttg ccagcaaatc attgacagaa atcgaaagaa 4560 aatattttca aactgagagg gaggccctat ctctggtttg ggctgttgat aggtttcagc 4620 tttatttgct cgggaaaaag tttaagctaa taactgactg caaacccctg aagtttctat 4680 ttaaagaacg atcaaaacct tctgcgagaa ttgagcgatg ggtattacgc ctacaaacgt 4740 acaattttga tgttgtgtat gagccgggag cgaacaattt ggccgatgcg ctttctaggt 4800 tatcggtgac caatccagtg gcatatcgat cttccgatga agaatgcata ttacaactag 4860 ctttagaggg atttcctagc gctatatcgg tcgaggaagt cgaagaagag acattgaaag 4920 atgatgttct tcaaagcgtt caacgagggt tggaaactgg aatttggaac gatatgtcta 4980 aaaatttcaa accgttcagt tcagaattgt gtttggcgcg aaaccttctc ttaagaggtg 5040 ataggttagt gataccagaa aaactaagat ctcgaatgtt ggaaatcgcc cacgaatctc 5100 acccaggaat agtggttatg aagcgaaggc ttcgccaaaa agtatggtgg ccggaaattg 5160 ataagcaagt tgagttattt gttaaaaagt gtaagaactg tactctggtt tcaaccctaa 5220 atcctcctga accactgaac cgcaccaagt tacctgagga agcatgggca gatcttgcaa 5280 ttgatttcat gggtccgcta ccgtctggtc ataatttgct cgtgatggtg gattatttta 5340 gccgattcac agaggtgatt atcatgaagc aaatcacagc aaacttaaca gtgaaagccc 5400 ttcatgaaac gttttgtcgt ttcggtatgc cagaatcgat aaaaaccgat aacggtcccc 5460 agtttattag cattgaactc caagatttct gcaaacattt tggcatcgag cacaggagaa 5520 caacaccata ttggcctcaa gcgaatgggg aggtggaaag gatcaaccgt tctatcggaa 5580 aaaggctaaa aattagtcaa gaaacgaccg gtgcagattg gcagtgggac ctaagaacat 5640 ttgtcctcat gcataactca acgccgcatt ctactacagg tgtggcaccg tcaatcctga 5700 tgttcgggag acagcttaaa gacaagctac ctggtctgtt gttgaaaggt tccaacatct 5760 tggaggagat tcatgaacga gattttttgc ggaaaatgaa gggagccgaa tacgctgatc 5820 ttcggcgcgg ggcgaagccg agcgatatat gtcaagggga taccgttgtt acgaagcgta 5880 tacacaagga aaacaaactg tcaacaacat tcgatcctga agagtatttg gtgattgatc 5940 gaaatggttc cgacataaca ataaaatcga aggaaagcgg gaaggtcatc catcgtaatg 6000 tggttcacct aaaaaagcta tttgcagata ccgagcatcc acaatcacta gcaggtaaat 6060 cggaagcgga gatggacgcc ggagaattgt tggttccggg tcaaaaacat acaacaaccg 6120 agggaggaga aacagcaaaa gttaggggaa accagaagag ccttctggac actggtacag 6180 agggaaacgc aacgattcgc cctagaagag aggttaaaca accatcatat ttgaaagatt 6240 acactattga caaaatcgac tagttttgag aaagagaggg a 6281 // ID Crack-2_HM repbase; DNA; INV; 4599 BP. XX AC . XX DT 15-SEP-2009 (Rel. 14.09, Created) DT 15-SEP-2009 (Rel. 14.09, Last updated, Version 2) XX DE Non-LTR retrotransposon: consensus. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; Crack-2_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4599 RA Jurka J. and Kapitonov V.V.; RT "Non-LTR retrotransposons from Hydra magnipapillata."; RL Repbase Reports 9(9), 1933-1933 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 292..1059 FT /product="Crack-2_HM_1p" FT /translation="MAVTLVEIKKMIKDLFTKYKVETEAALKQQENNFINI FT VSANTKILNERLDKVEKNILENAKKISTLATEVEEIKVSLNFHEELIENKI FT KTALDSFIKNKPTHNEIQNNNIELKKINSKLREIEDRSRRNNLRVEGVKED FT DNESWLESETKVKKIFDEYLGIKDVKIERAHRAGKEDIKKHRTIVVKLLDF FT KDKEAILKNSSKLKGKNIFINEDFCAETNRIRKDLREKMKIERQLGKFAYI FT SYDKLIVREWNAKKK" FT CDS 1216..4302 FT /product="Crack-2_HM_2p" FT /translation="METNNNDFEKIINLFETHALHLDDESDPDINYFNEIN FT SPEFETSYFYPSNVSKILKSENKSDYLNAVHINIRSLKKNFENLSNFISEA FT ESSFNLICLTETWCLHADFKNNSNLHLPGFDPVILERSTNKRGGGVLFFVK FT NDLLYKVRSEMCVSDENKEILTIELINKRSKNILISCCYRPPTGRIENFGN FT FLLNNIIKKTDLEKKKNYLIGDFNINSFDYYENQSSRKFFNGLFETGTIPL FT INRPTRITSHSSSLIDNILTTDFFNKSLKKGIIKTDVSDHFPIFFSICVNL FT KKEKQGKLLIKKRLINNNNLNSFREQLSHIDWGYINFNDNINIIYDNFFKT FT FYQVYDSNFPLIEIVLSDKEMSSPWITKGIRKSSKIKQKLYIKYLKSKSEK FT HKQNYKTYKNLFEKLRKNAKKNYYSNLLNKYKHNSKRVWQIMKEVSGKLKP FT NKNLLPNFIQIENKLINDPNDIAFEFNKFFSTVGTKLSDSISFIKDKSVDE FT FETSISLNLNFTELAFNEFEVAFKSLKRNKATGYDDINTNIVIDSYDVIKD FT ILYKIFRASISQGSFPDKLKIAKVTPIFKTGDRTNITNYRPISVLPVFSKI FT LERIMYNRIYSYLVENKILYENQFGFKKNNSTEHAILQLTRNITDSFKDSC FT FTLGVFIDLSKAFDTVNHQILLKKLISYGIKDKTLLWFESYLTNRKQFVYN FT KDSISHLLMNITCGVPQGSILGPLIFLIYINDLHKSSNLTTINFADDTNLF FT LSHKDINMLFTNMTNELKKVSVWFKQNRLSLNIEKTKWTLFYPPLKYQKLP FT HIMPDLLIDNIIIKREKVTKFLGVYIDENLSWRNHIDNTSNKIAKCIGILY FT KARNILSKHQLTQLYYSFIHCHINYANIIWGSTHRTKLKSLYLHQKHATRV FT INFKNRFFHTKQLFKEMNILNVYQLNVFNILCFMFKCRENLSPNVLQNFYC FT MKPKNKYNFRINNSLIAPFCRKSKDQFFVSYRGPYLWNKIVLPNFDFSTQM FT TFASFKQKLKKIIFLIENILIYF" XX SQ Sequence 4599 BP; 1855 A; 620 C; 612 G; 1512 T; 0 other; aggtttgagc attaaacgga gcacacgtgt ttacgaagaa agatattttt ttacatattt 60 gacttttatt ttttgattat ttttatggag attgaaagag tagtagctta actatcgtga 120 atattgtggt tggtgattat tgtggttgat tgtgaatatt gtggttggtg aaattgcgta 180 aagtaatata ataagaaatc aatctacatt tatacattta tacatttaaa atacaaaaat 240 acatttaaaa tacaaaaata aaaaaaatac atctacagat atacatttaa aatggcagta 300 acacttgtgg aaatcaaaaa aatgataaaa gacttgttta caaaatacaa agtggaaaca 360 gaagctgcgc taaagcaaca agaaaacaac tttataaaca tcgttagtgc aaacaccaaa 420 atcttaaacg agagattgga caaagtagaa aaaaatattt tagaaaacgc aaaaaaaata 480 tcaacacttg caacagaagt tgaagaaata aaagttagtt taaattttca tgaggaactc 540 attgaaaaca aaattaaaac tgcgttagat tctttcataa aaaataaacc tactcacaac 600 gaaattcaaa ataacaacat tgaactcaaa aaaataaaca gcaagctaag agaaatagaa 660 gatagatcaa gaaggaacaa tctaagagtt gaaggagtta aagaagatga taatgaaagc 720 tggctagaaa gcgaaacaaa agttaaaaaa atatttgatg agtacttggg cataaaagat 780 gtaaaaattg aaagagcgca tagagctggt aaagaagata taaaaaagca cagaacaatt 840 gttgtgaaat tattggattt taaagataaa gaagcaattt taaaaaactc ttcaaaatta 900 aaaggaaaga atatttttat caatgaagat ttttgcgcgg agacaaatcg aataagaaaa 960 gacttgcgag agaaaatgaa aattgaaaga caattgggaa aatttgcata tatttcctac 1020 gacaagctta ttgtacgcga gtggaatgca aaaaaaaagt aatttttttt ttaattttat 1080 ttatttgttt acattatatt taaatatacg ttatatgcta aagtggtatg caataaatta 1140 atcactaagt cttcctaaaa gactttcaac atgtcgcaca ctattattta ctgccaaatc 1200 taaattatag cgaaaatgga aacaaataat aatgattttg aaaaaataat taatttattt 1260 gaaacccatg cattacactt agacgacgaa tccgatccgg atattaatta ttttaatgaa 1320 attaattctc ctgaatttga aacctcttat ttttatccta gtaacgtgag caaaatttta 1380 aaaagcgaaa ataaaagtga ttatcttaat gcagtccaca ttaatatacg tagcttaaaa 1440 aagaattttg aaaacttatc aaattttatt agcgaagcgg aaagttcttt taatttaatt 1500 tgtttaactg aaacctggtg tttacatgca gattttaaaa ataactcaaa tctccatctc 1560 ccaggttttg atccagtaat tttagaacga agtacaaata agcgaggagg aggagtttta 1620 ttttttgtta aaaatgacct tctatataaa gttcgaagcg aaatgtgtgt ttctgatgaa 1680 aacaaagaga ttttaacaat agaacttata aataagcgtt caaaaaatat actaataagc 1740 tgttgttaca ggccaccaac tggaagaatt gaaaactttg gcaatttttt actaaataac 1800 ataataaaga aaactgatct tgaaaagaaa aaaaattact tgatcgggga tttcaacata 1860 aactcctttg attattatga aaatcaaagt tcgagaaaat tttttaatgg tttatttgaa 1920 actggaacaa tacctttaat taatcgcccg actagaatca caagtcattc atcatcatta 1980 attgataaca ttttaacaac cgattttttt aataagtcgc taaaaaaggg aattataaaa 2040 accgatgttt ccgatcactt cccgattttt ttttctatat gtgttaattt aaaaaaagag 2100 aagcagggaa agttattaat aaaaaaacgg cttatcaaca ataataattt aaattctttc 2160 agagaacaat tatcacatat agattggggt tatattaatt ttaatgataa tatcaacata 2220 atctacgaca attttttcaa aactttttac caagtgtatg attccaactt tcctttaatt 2280 gaaatagttt taagcgataa agaaatgtca tctccctgga ttactaaagg tatcagaaaa 2340 tcatcaaaaa ttaagcagaa gctatacatt aaatacctaa aatcaaaatc agaaaaacat 2400 aaacaaaatt ataaaactta taaaaactta tttgaaaagc ttcgcaaaaa tgcaaaaaaa 2460 aactactatt caaacttgct taataaatat aagcacaact caaaacgcgt ctggcaaatc 2520 atgaaggagg tgtctggaaa acttaaacca aataaaaatt tactcccaaa ttttatacaa 2580 attgaaaaca aattaataaa tgacccaaat gatattgcat ttgaatttaa caagtttttc 2640 tccactgtcg ggacaaaact aagcgatagt atttcattca ttaaagataa atctgttgac 2700 gagttcgaaa catcaataag tctaaatcta aattttactg agttagcttt taatgaattt 2760 gaggttgctt ttaagtcatt aaaacgaaat aaagctactg ggtatgatga tattaacaca 2820 aatattgtga ttgattcata cgatgttata aaagatattc tttataaaat cttcagagca 2880 tccatttcac aagggtcttt tccagataaa ctaaaaatag cgaaagtaac tccaatcttc 2940 aagacagggg atcgtacaaa tataacaaat tatcgcccta tttcagttct tccagttttt 3000 tcaaaaattt tagaaagaat aatgtacaac cgaatctact cttatcttgt tgaaaataaa 3060 attctatacg aaaaccaatt tggtttcaaa aaaaataatt caaccgagca tgccattctt 3120 caacttacgc gcaatataac cgactcattt aaagactctt gttttacact aggagtgttt 3180 attgatttgt ccaaggcatt tgataccgtc aatcatcaaa tcctgttaaa aaaactaata 3240 tcatatggta ttaaagataa aaccctatta tggtttgaaa gttatcttac taaccgtaaa 3300 cagtttgttt acaataaaga ttctatatct catctattaa tgaatataac atgtggtgtt 3360 ccacaaggat ccatacttgg tcctcttatt tttttaatat acataaatga tcttcataaa 3420 tcatcaaatt taacaacaat aaattttgca gatgacacaa atttgttttt gtctcataaa 3480 gatattaata tgctttttac aaatatgaca aatgaattaa aaaaagtttc tgtttggttt 3540 aaacaaaata gattgtctct aaatatagaa aaaactaaat ggacactttt ttatccacct 3600 ttaaaatacc agaaactgcc ccatatcatg ccagaccttc taattgataa tataataata 3660 aaaagagaaa aagttactaa atttcttgga gtatatattg atgaaaacct atcttggaga 3720 aatcatattg ataatacctc caacaaaatt gctaaatgca tcggaattct ctataaagca 3780 agaaatatat taagtaaaca ccagttaaca caattatatt attcatttat acattgtcat 3840 ataaactacg cgaatattat atggggaagc actcacagaa caaagctaaa atctctttac 3900 cttcatcaga aacacgcaac tcgcgtaata aattttaaaa atcggttttt tcacacgaaa 3960 caacttttta aagaaatgaa tattttaaat gtatatcaac taaatgtctt taatatttta 4020 tgtttcatgt ttaaatgtag agaaaatttg tcgccaaatg tattacaaaa tttttattgt 4080 atgaaaccta aaaacaaata taattttcga attaataaca gtcttattgc ccctttttgt 4140 cgtaaaagca aagaccagtt ttttgtttct taccgaggtc cttatctctg gaacaaaata 4200 gtcttaccaa attttgattt ttcaacccaa atgacttttg cttcattcaa acagaaactt 4260 aaaaaaatta tatttttaat tgaaaatatc ttgatttatt tttaattgcc tttttttttt 4320 gttttgtttt tgttgttgtt ttattttgta tattatttgg tttaaaattt tatcttgact 4380 ttttaacatt ttatcttggt ttttaaatat ttttgcatag aggtgtatta ttatttcact 4440 acagacattg tatttttatt aaaagttaac catatattgt aaagggcttc atgacaagat 4500 cctcatgatc ttctagaagt cctgtcgtta ctaatgtaaa aaattgtaaa atatatttag 4560 ttattgtaat attaaacggc aaaaataaaa aataaaaaa 4599 // ID BEL-39_AA-LTR repbase; DNA; INV; 662 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-39_AA_; KW BEL-39_AA-I; BEL-39_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-662 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 870-870 (2011). XX DR [1] (Consensus) XX SQ Sequence 662 BP; 243 A; 110 C; 126 G; 183 T; 0 other; tgttaacaat tagattagta tggagaaaca tttgaattgt tatgaattta aaaacctaaa 60 atcagatatt acaaaaagtg tattaaaact taaagctagg attaaatagg ttaaaaaaat 120 taaactgtta tacacatcac agacacaaaa cactgaattt ctgtattacc acacatgcaa 180 tattgataac acagataaca ttgatagcac acaacacaga tgtagaagcc gatgatttat 240 atacaagact gtatagggac aatagcttac cggcaaaaca aacggcagtt agtagccaat 300 cgatcgtcag gcaataaaga acatttttcc ttaccgtcgc gtattcaaaa taaaattata 360 ggttctagtt tcattactaa aaattaaagt gtagtttttt ttgcctaaat caaagtgcgt 420 gaaattattc ctgaagaaaa cgactttcaa cggtgattag tgcaaaaagg tgtcacgtgt 480 cgctattccc gtgcatgcta ttcgggttaa ggagagtgtg ctcgacagtg gtgaagaaat 540 tgcacttaaa aagaaagcaa caaactgtgg ttcggatcgt ggattgcacg gaatcgttca 600 gatccttatt tagagctttg tggtgggcct tcccaaccca cgaaccgtgg agaataagaa 660 ca 662 // ID Gypsy-38_OD-LTR repbase; DNA; INV; 223 BP. XX AC CABV01001283; XX DT 24-FEB-2011 (Rel. 16.02, Created) DT 24-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Oikopleura dioica genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-38_OD_; KW Gypsy-38_OD-I; Gypsy-38_OD-LTR. XX OS Oikopleura dioica OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; OC Oikopleuridae; Oikopleura. XX RN [1] RP 1-223 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Oikopleura dioica genome."; RL Direct Submission to RU (24-FEB-2011). XX DR Genome; CABV01001283; Positions 1353 1131. XX SQ Sequence 223 BP; 60 A; 61 C; 28 G; 74 T; 0 other; tgtagtgtct ttgcataagc cactccataa tactgaccca aatgtgccat aacaatttcc 60 agcgcagcgc gcaccgctgc acatgaccta ccttttatcc ttatttaacc gctctgcctt 120 tccttttcac tattcagttg tacttgccca acaacttgat aaataaacta ccaacgagac 180 ttttgacttt gtctctttta ttatcttatg tagaaacact aca 223 // ID BM1 repbase; DNA; INV; 445 BP. XX AC X03547; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 28-SEP-1995 (Rel. 1.08, Last updated, Version 1) XX DE Bombyx mori Bm1 repetitive DNA element. XX KW SINE; Non-LTR Retrotransposon; Transposable Element; BM1; KW Bm1 repetitive sequence; Repetitive sequence. XX OS Bombyx mori OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Bombycidae; Bombycinae; Bombyx. XX RN [1] RP 1-445 RA Adams S.D., Eickbush H.T., Herrera J.R. and Lizardi M.P.; RT "A highly reiterated family of transcribed oligo(A)-terminated, RT interspersed DNA elements in the genome of Bombyx mori."; RL J. Mol. Biol 187(4), 465-478 (1986). XX DR GenBank; X03547; Positions 1 445. XX SQ Sequence 445 BP; 126 A; 96 C; 104 G; 119 T; 0 other; gaactcgtcg tggcctaaag gataagacgt ccggtgcatt cgtatatagc gatgcaccgg 60 tgttcgaatc ccgcaggcgg taccaatttt tctaatgaaa tacgtactca acaaatgttc 120 acgattgact tccacggtga aggaataaca tcgtgtaata aaaatcaaac ccgcaaaaat 180 tataatttgc gtaattactg gtggtaggac ctcttgtgag tccgcacggg taggtaccac 240 cgccccgcct atttctgccg tgaagcagta atgcgtttcg gtttgaaggg tggggcagcc 300 gttgtaacta tactgagacc ttagaactta tatctcaatg tctgtggcgc atttacgttg 360 tagatgtcta tgggttccag taactactta acaccaggtg ggctgtgagc tcgtccacac 420 atctaggcaa taaaaattaa aaaaa 445 // ID Gypsy-7_BM-LTR repbase; DNA; INV; 192 BP. XX AC nscaf2766; XX DT 19-MAR-2010 (Rel. 15.07, Created) DT 19-MAR-2010 (Rel. 15.07, Last updated, Version -1) XX DE LTR retrotransposon from silkworm: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-7_BM_; KW Gypsy-7_BM-I; Gypsy-7_BM-LTR. XX OS Bombyx mori OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Bombycidae; Bombycinae; Bombyx. XX RN [1] RP 1-192 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from silkworm."; RL Repbase Reports 10(7), 990-990 (2010). XX DR Genome; nscaf2766; Positions 734507 734698. XX SQ Sequence 192 BP; 46 A; 45 C; 39 G; 62 T; 0 other; tgaaactacg aagtagcaac actgtggccg cctatttaag gagcagtgtt gactgtgctc 60 gctctcttat ctcgccgact gttgtagcgt gcggaagctc tgtccgaact ctctcttgtt 120 attactgtct aataaggcct gccggccaat aaattgtttt taaaaaccta aaactcgttt 180 ctgtttatta ca 192 // ID Gypsy-206_AA-LTR repbase; DNA; INV; 214 BP. XX AC supercont1.44; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-206_AA_; KW Gypsy-206_AA-I; Gypsy-206_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-214 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.44; Positions 1394999 1395212. XX SQ Sequence 214 BP; 86 A; 30 C; 42 G; 56 T; 0 other; tgtagtagtg acaaacgctt gagaactcgc tcagagaaaa caagagatag catgaagaga 60 gaaaagggaa tttagaggaa gttcagctat gtaaggaata ggtgaaatat acatgattaa 120 aagttttgtg actaccatcg aaacaagacg tgtttttact acaatttaat atagaacaga 180 aatctttaaa cctctcactt tacaagtcat taca 214 // ID Gypsy-53_AA-LTR repbase; DNA; INV; 242 BP. XX AC AAGE02021081; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-53_AA_; KW Gypsy-53_AA-I; Gypsy-53_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-242 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02021081; Positions 22858 22617. XX SQ Sequence 242 BP; 81 A; 51 C; 41 G; 69 T; 0 other; tgtgtcctta tctgaattat cctattcacc aaatgctata gaaccccatg ttcgatgttt 60 cgttaggaga ataagccctt ttgggttttc ggtttaaacc tgttccacaa cggaacggtc 120 gaaaacagat ccaaaaagaa acaaaactaa aaatatacga ggcgaaggtc tcattcgatt 180 atcggaattc tacatctaca cttcactaaa gcaatcacgg tcttgctata aattagagta 240 ca 242 // ID L2-2_Cis repbase; DNA; INV; 4503 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 01-SEP-2010 (Rel. 15.1, Last updated, Version 2) XX DE CR1 Non-LTR Retrotransposon from Ciona savignyi. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; L2-2_Cis. XX NM L2-2_Cis. XX OS Ciona savignyi OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. XX RN [1] RP 1-4503 RA Smit A.F.; RT "L2-2_Cis - CR1 Non-LTR Retrotransposon from Ciona savignyi."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX CC Ci000222; two ORFs (66-932, 955-4173); < 1% diverged copies. XX FH Key Location/Qualifiers FT CDS 100..921 FT /product="L2-2_Cis_1p" FT /translation="MSPATPKKPAATTTLIAKMDNVLEQVCANKQELNSKL FT NAVLTELQDIKQSQAFLSKKYDALLNQSNKTNEEMKNMKLENATLKNDIEE FT MKNEVKLCKKSINALEQYGRRDCLEIKGVQYNDQENTDEIVIAVAKSIGVN FT ITRSDISISHRLGPFRTATKNNQPTTIIAKFTSRKSRDNVYNNRSKLHKPY FT TPGSQQSRIYINESLTKENRRLFNNCLQFKKSNNYKFLWTRNGVILLKRDD FT ASKTITITDEEDLQRNEIPINITSNHPVTSNIV" FT CDS 965..4162 FT /product="L2-2_Cis_2p" FT /note="PHD zinc finger, endonuclease and reverse FT transcriptase." FT /translation="MADKIISQQCLMCQECCTDNQDSICCDQCNNWMHLSC FT TSLTKSQFESLGCSQDPYFCSLCFHNTQSQNVSPNNNVNKSFSESNSLHAC FT EDDDDNLSFNGNSLYYGQDRLNGLMMSCASHVSIFHFNIRSLAKNKYRIEE FT FITSLPTLPYFIGITETKISKLSLNNYEIDQYNFIHNDSLTHAGGVGLYIR FT NDISYLIRSDLELEIDHCESLWIEVKIKKISSLIIGVIYRHPCYNFQNFQN FT KLCETLSSTKVINKDYLLCGDFNINLLSSNRQESVANFEQSLKSIGCTNII FT TQATRFSYNSIPSLLDHIYTNILTELTGGICLFDASDHLPTFCIIYGIQIS FT YTNSSKYVKIRCMKNFVVEQFLNELQNIMSVATSEKNNTVDFLFERFVNNF FT QYVLNKHAPLCNLSRKQKKNLYKPWITKGILKSIKHKNCLFRACYKHNDIE FT QILYYKKYTNHLTRIRKLSKQQYYFNIFHENRYNPGKTWKTINEIIAHRNV FT SKHSISEILLDNGTKTDEPEKVCEVLNNHFASIGKTMASTISDPSVKFSSF FT VHSSVHSFYLEPTTSEEVSQCISNLKINNSCGPDNISSKFVKMANSIISPI FT LAELFNKSMDQGAYPKCLKISQIIPIPKCRSPKTVNDYRPISLLPTFSKIF FT EKIIHFRLYYYFNKYDLLTPCQYGFRTNHSTELAVTSIHEQMLSNLDSGKI FT TCSIFLDLQKAFDTIDHSILLQKLFLYGIRGVQNDLFKSYLSGRLQCTTVG FT NIRSLYKSVDCGVPQGSVLGPLLFSIYVNDLPTSSNFQTTLFADDTNLHMC FT DFSFSNLQYRVNLEVNKVDVWLRANRLSLNHSKTVFMLIHNHKNNYQFNVV FT INKKPVQQTNNIRYLGVILDHKLKWNMHIMNLRDKLSRVCGVFCKLRRYVK FT QNTLRCVYFSLFHSHLQYSIINWGRANTSAIHPLEMLQNRALRHILFRSNR FT DHASPLYNELKIMKXHDIFIHEIGKFVYKFSNKSLPSYFNHFFTPLDAMHK FT HATRQKDIGTFYHPRVRTNYGKSMLQYLGPCVWGEIPNDYKCMSFNLFKLR FT FKQYLIQMY" XX SQ Sequence 4503 BP; 1646 A; 715 C; 680 G; 1461 T; 1 other; attctatttg aataaggtct tagattaaga aaaagtacta tttgaataag ttgataatat 60 ttgaataagt acaacttaaa taagccttgc aaaaaaatca tgtcgccagc aacaccaaaa 120 aagcctgctg ccacaacaac actcattgcc aaaatggata atgttcttga gcaagtttgt 180 gcaaacaagc aagagctaaa ttctaagttg aatgctgtcc tcactgagct gcaggacatc 240 aaacaaagcc aagcatttct ctctaaaaaa tatgatgctc tattgaatca aagtaataaa 300 acaaatgagg aaatgaagaa tatgaagttg gaaaatgcaa ccctaaaaaa tgatatagaa 360 gaaatgaaaa atgaagtaaa attatgtaaa aaatcaatta acgcactaga gcaatatgga 420 agaagggatt gtttagaaat aaaaggagtc caatacaatg atcaggaaaa tactgatgaa 480 attgttatcg ctgtagcgaa aagtatcggg gttaacatca caagatcaga catcagcatt 540 tcacatagac ttggaccatt tcgaactgct accaaaaaca atcagccaac aaccatcatt 600 gcaaagttta ctagtagaaa atcccgtgat aacgtataca acaatagatc aaaacttcac 660 aaaccataca ctcctggatc ccaacaatct cggatatata taaatgaaag tctcacaaag 720 gaaaaccgaa gattattcaa taactgtctc caattcaaaa aaagcaacaa ctacaaattt 780 ttatggacaa ggaatggagt aattttattg aagcgggatg atgcaagcaa aacaatcacc 840 attacagatg aagaagacct tcaaagaaat gaaattccaa tcaacataac cagtaatcat 900 ccagtaacaa gtaatattgt ttaattaaat tattaaaact ctctaatata gtatttacaa 960 aataatggct gataaaataa tttcccaaca gtgtctgatg tgtcaggagt gttgcactga 1020 caatcaagac agtatatgct gtgaccaatg caataattgg atgcacttat catgcacttc 1080 cttaacaaaa tcccaatttg agtcacttgg ttgttcccaa gatccatatt tttgttcttt 1140 atgttttcat aatacccaat ctcaaaatgt aagtccaaac aataatgtta ataagtcatt 1200 cagtgaaagt aattcattac atgcatgcga agatgatgat gataatttaa gttttaatgg 1260 caatagctta tattatggac aagataggtt gaatggttta atgatgtctt gtgcttcaca 1320 tgtatctatt tttcacttta atatccgtag tctagcaaaa aataagtata ggattgaaga 1380 gtttattaca tcattgccta ctttgccata ttttattggg atcacagaaa ctaagataag 1440 taaattatct ttaaataact atgaaataga ccaatataat tttattcata atgactcttt 1500 aacacatgct gggggtgtag gactttatat cagaaatgat atttcatatt tgataagatc 1560 tgatctagaa cttgaaatag atcattgtga atcactgtgg atagaagtta aaataaagaa 1620 aatatcatct ctaataattg gggttatata tagacatcca tgttataatt ttcaaaattt 1680 ccagaataaa ttgtgtgaaa ccctatcatc cactaaagta attaacaagg attatctctt 1740 atgtggagat tttaacatta atttgcttag ctcaaataga caggaaagtg ttgcaaactt 1800 tgaacagagt ttaaaaagca ttggttgcac taatataatt actcaagcca ctagattttc 1860 atacaactca attccctcac ttttagatca tatatatacc aatattttaa cggaactaac 1920 agggggaatt tgtttatttg atgcatctga ccacttacct acattttgta ttatatatgg 1980 catacaaatt tcttacacaa attctagtaa gtatgttaaa attaggtgta tgaaaaattt 2040 tgtagttgag caatttttaa atgagttgca aaatattatg tcagttgcca cttctgaaaa 2100 aaataataca gttgatttct tatttgagag gtttgttaat aatttccaat atgtactaaa 2160 taagcatgcc ccattgtgta accttagccg aaaacagaaa aaaaacctgt ataaaccatg 2220 gattacaaaa ggaatactaa aatctataaa acataaaaat tgcttattta gagcatgtta 2280 taaacataat gacattgaac agatattgta ttataaaaaa tacacaaatc acttaactcg 2340 tattcgcaaa ttgtccaagc agcaatatta ttttaatata tttcatgaaa atagatacaa 2400 tcctggtaaa acatggaaaa ctattaatga gataattgct catcgtaatg taagcaaaca 2460 tagtatatcc gaaatattat tggataatgg tacaaaaacg gacgaaccag aaaaagtgtg 2520 tgaagtctta aataatcatt ttgcaagtat tggcaaaaca atggcatcta caatctcaga 2580 tccgtcggtg aaattttcat cctttgtgca ctcaagtgtg cattcttttt atttggaacc 2640 cactacttct gaagaagtat cacaatgcat tagtaattta aagataaata actcttgtgg 2700 tcctgataat atatcttcaa aatttgtaaa aatggctaat agtattatat cacccatctt 2760 ggctgaattg ttcaacaaaa gtatggatca gggagcctat cctaaatgcc tcaaaatatc 2820 tcaaatcata ccaattccga agtgtcgctc tccaaaaact gtaaatgatt atagaccaat 2880 atctttattg ccaaccttct ccaaaatatt tgaaaaaata atccatttta gattatatta 2940 ctactttaac aagtatgatc tattgacacc atgtcagtac ggctttcgta caaatcattc 3000 cactgaactt gcagtgacct ctatacacga acaaatgctt tctaatttag attcaggaaa 3060 aataacatgc tctatattcc tggatctaca aaaagctttt gatacaattg atcattccat 3120 actattgcaa aaattatttt tatatggtat taggggagtc cagaatgatt tgtttaagtc 3180 gtacttaagt ggtcgtctac agtgcaccac tgttggcaac atcagatcac tgtataaatc 3240 tgtagattgt ggtgtaccgc aaggttcagt attgggccca ctgctctttt ctatatatgt 3300 gaatgatcta cctacctctt ctaattttca aactacttta tttgctgatg acaccaatct 3360 acatatgtgt gatttttcat tttctaactt gcaatataga gtaaacttag aagttaataa 3420 ggttgatgtg tggctgaggg caaataggtt atcgttaaat cacagtaaaa ctgtttttat 3480 gctgatacat aatcataaaa acaattatca atttaatgta gtaataaata agaaacctgt 3540 ccaacaaacc aataatattc gttatcttgg tgttatactg gatcataagc taaaatggaa 3600 tatgcatatt atgaatcttc gagacaaatt gtctagagtt tgtggtgttt tctgtaaact 3660 aagaaggtat gtaaaacaaa atactctcag atgtgtatat ttctctttat ttcatagtca 3720 tttgcaatac tcaattatta attggggaag ggctaacaca tcagcaattc atccattaga 3780 aatgttgcaa aacagagctc ttcgacatat tttatttcga tccaacagag atcatgcttc 3840 tcccttatat aatgaattaa aaattatgaa aatncacgat atttttattc atgaaattgg 3900 aaaatttgtg tacaaatttt caaacaaaag cttaccaagt tatttcaatc attttttcac 3960 gccacttgat gccatgcata aacatgccac tagacaaaag gatattggta cattttatca 4020 tccgagagtg cgtaccaatt atggaaaaag tatgctacaa tatcttggac catgtgtgtg 4080 gggtgaaata ccgaatgact acaagtgcat gtcatttaac ttatttaaac ttcgcttcaa 4140 gcaatattta attcaaatgt attaattacg gacaatgcaa tagtctcaat tgttacacta 4200 tctgtcaatc aacctcttta taactttata ttttgtttca ctgtaacttg tttactgcaa 4260 agtactataa aatgatacgt ttattaaagg ttttggcctg tgcagcattg tatttcgtta 4320 ttttttttta tatttatttt ttttatttat ctaaataaac acaaagggtg cagctgactc 4380 ggtgactttt tggtctttcg ctgcatcctt tgctgtacat tactgtgcat tactgtatat 4440 attgttgtcg tttgtaaaaa tgtgatgtca gcataaatta aataaattga attgaattga 4500 atg 4503 // ID Copia-8_DPu-I repbase; DNA; INV; 4447 BP. XX AC scaffold_728; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-8_DPu_; KW Copia-8_DPu-LTR; Copia-8_DPu-I. XX NM Copia-8_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-4447 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 679-679 (2010). XX DR Genome; scaffold_728; Positions 4927 9373. XX CC Positions [1642-2172] - Integrase core CC 'GGATT' target site duplication CC LTRs are 98% similar to each other. CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX FH Key Location/Qualifiers FT CDS 337..4248 FT /product="Copia-8_DPu-I_1p" FT /translation="MRYCYGRFLIFNSCDETRKLALLNSRNSHEMWTRLET FT QYLQRAADNKHLLHRDFLNLRPTAGEDIMVHITALESMATELNDLGVNITQ FT HDLITTIMCSLPARFGFLASTWDNIPDNERSMDALRARIVSEQRRIELRRI FT EEQASLPPPQERNNSALQAKGIQRTPFIRNARGGRGRGNGSRDVVNRDTAQ FT CTYCGKSRHYEFECRLRIAQQGEKQPTPPKRPNNDDNGGSRVNFSLISVCC FT LPDGDLKGIVLDSGATRHMCGELSYFTDLCDIPPNSWPINGIGGKILYAVG FT VGTIKMTSFVNGVSVEGELKNVLYVPNLGVTLISIASLSINGYSVSFCNEN FT ASIQKGNTIVMTASRSGEGLYKVKAIVSQYATMGLNADTKSQVTLNIWHKR FT LGHVNNQTVRRMANNIGIGGMQITPGSKSMEECCHGCEVGKMHKLPFANST FT TKYSTVGECVVSDLVGPMQVNSVGGARYYVLFKDVYSKYKTVYFLKHKSET FT ANCFLEYTKMLTTDTGLHVKLLRCDGGTEFINSQMNTILTSLGIKLQTSAP FT YTPEQNGIAERDHRSTVESARSQIHDRGVPLNLWAEAVNHSVYVLNRTLAK FT SQNKTPFELWHGVIPNISNLRIFGSVAYMFIPDTLRQKLDPKATKGVYVGE FT SEEQKASRIFVVATGRTHITRHVKVYENLPYWPGISELPSSTPQTESTIQP FT LLHLETLENDVAPTTVPISIEEPLQERQLTIPLRKSSRGLIPKKLFPMETF FT GSYAAVETSTLPMSCCISIALKGSSLYYEPQTYKEAMGGAESALWKLATDR FT EIAAHMKNQTWTLTTLPNGRICIPSGWDFKVKTDKLGLPCRRKARFFAKGY FT RQVQGIDYQESFSPVVRYDSLRNIIAITAARNLELIQLDVTTVFLNGLIDE FT LVFIAQPEGYVIPGREQEVCRLNKGIYGLCQASRIWNKTLHDALIQYGLIQ FT STADPCIYYRITRTCFLIIAVWVDDGLVAGNSMDVIDDVIRYLNRKFKITA FT VPADLFVGIVIARDRPNQRIYLSIPQFIEKKLVKFRLSDAHPLSLPVLKGS FT PRLSSYSAPSTPTEVATMSTIPFREVVGCIMYAALTVRIDIAFIAGQLAQH FT CQNPGLDHWKAAQRVLKYLASTRNHGLCFGGNDVKDNILIGYSDADYAGDP FT DTRRSTSGYVFILNGGAVTWSSRRQPIVALSTMQSEYIATSDSTREAVWIR FT RLLDNLGSTQMNPTALRCDNESAIGLAYNPLAHKGSKHIDVRYHYIQVTNK FT TIELGYVNTCKQIADVLTKAVDGETFASCLNGFGLGKVPDIIE" XX SQ Sequence 4447 BP; 1241 A; 1044 C; 978 G; 1184 T; 0 other; ggttatgggc ccaggctaaa cgctaagcta tacgtttaac attccaagat ggctcaaaac 60 tattctcttg atgtgatcaa acacttaaaa agatttgatg gaaaagattt cattacgtgg 120 agacacaaca tggaaatgtc gttcacattg aagaatctga gacccatcgt tgaggtatgt 180 aacctaattt aatatattag aagagaatat gctaaattct aattcgctac tcactgttgc 240 aatcttacag ggtttaatca ttgaaccggt tcaacagttc gaggagggag aagataacca 300 gctagtaatc aatctgatga gatcgaagag tggagaatga gatactgcta cgggagattc 360 cttatcttca acagctgtga tgaaactaga aagttggcac tcctcaacag cagaaacagt 420 catgagatgt ggacaagact cgagacgcaa tatcttcagc gtgctgccga caacaaacat 480 ctcctccaca gagatttcct caacttgaga cccactgctg gtgaagacat catggtacac 540 atcactgctc tggagtcaat ggcaaccgaa ctgaacgacc ttggagtgaa catcactcag 600 cacgatttga ttactaccat catgtgtagt cttcctgcta gattcggttt tcttgcatcg 660 acgtgggata acatccctga taatgagaga tccatggatg cccttcgagc cagaattgtt 720 tcagagcaaa gaaggattga actacgtcgt attgaagagc aagcgtccct acctccacca 780 caggaacgta acaatagtgc acttcaggca aagggtatcc aaagaacccc ctttatacgt 840 aatgcacgtg gaggacgcgg acgtggaaat ggatcacgcg atgtagtaaa tcgagatact 900 gctcagtgta cctactgtgg caaatcacgc cactatgaat ttgaatgcag acttcggatt 960 gcacaacaag gggagaaaca accgacacca cccaaacgtc cgaacaacga cgacaatggt 1020 ggttcgagag tgaacttctc tctaatttct gtatgctgtc ttcctgatgg agatctgaag 1080 ggaattgtat tggattctgg agcgacccgc cacatgtgtg gagaactttc ctatttcacc 1140 gatctatgcg atattccacc aaacagctgg ccgatcaatg gaattggtgg gaaaatattg 1200 tacgctgtcg gtgtcggaac tatcaagatg acatcatttg ttaatggagt ctccgttgaa 1260 ggagaattga agaacgttct gtacgtgccc aatctggggg tgactctcat ttccattgca 1320 agtctatcta tcaatggata ctcagtatcg ttctgcaacg aaaatgcgag tatacagaaa 1380 ggaaacacca tcgtcatgac ggcttcaaga tccggcgaag gtctctacaa agtcaaagct 1440 atcgtatcac aatatgccac catgggtctc aatgcagaca cgaaaagtca agtcactctt 1500 aacatttggc acaagcgcct tggacatgtc aacaatcaaa ctgttcgacg catggccaat 1560 aatattggta ttggcggtat gcaaattact ccgggatcca aatcgatgga agaatgctgt 1620 catggctgtg aagtgggtaa gatgcacaaa ttgccattcg caaacagcac taccaaatac 1680 tctaccgttg gtgaatgtgt cgtgtcagac ctcgttggac caatgcaagt caactcagtg 1740 ggaggtgctc ggtactacgt tctcttcaag gacgtctaca gcaagtacaa gacggtttat 1800 tttctcaagc acaagtccga aactgctaac tgtttccttg agtatacgaa gatgcttact 1860 actgacactg gtcttcacgt taaactcttg cgttgtgatg gtggaactga attcattaac 1920 agtcaaatga ataccattct aacatcactt gggattaaac tacagaccag tgctccctac 1980 actccggagc agaatggaat agctgagaga gatcatcgat ccaccgtgga atctgcacgc 2040 agtcaaattc atgacagagg cgttccattg aatttgtggg cggaagcagt gaatcattct 2100 gtatacgttt tgaatagaac tcttgccaag tcccagaaca aaaccccttt cgaactttgg 2160 cacggggtga tcccgaacat ctcgaatctt cggatattcg gatctgtagc ctatatgttc 2220 atacctgata cgcttcgtca aaaactcgac cccaaagcca caaaaggcgt ttatgtcgga 2280 gagagtgagg agcaaaaggc gagtcggatc tttgttgttg cgaccggacg gacccacatc 2340 acgagacatg tcaaagtgta tgaaaatctc ccttactggc ctgggatttc ggaactacct 2400 tcgtctacgc cgcagactga atcgacaatt caacctttgc tacacctgga aactctggag 2460 aacgatgtag ctcctacaac agtgccgata agtatcgaag aaccacttca agaacgacag 2520 ctgaccatcc cacttcgtaa gtctagtcgt ggactcatac ctaagaaatt atttcccatg 2580 gaaacgttcg ggtcatacgc tgctgtcgag acgtccacct tgcccatgtc ttgttgcatc 2640 tccattgcac taaagggatc ctctttatac tacgagccac agacttacaa agaagccatg 2700 ggtggggctg aaagtgctct atggaaactt gccactgatc gtgaaatagc agcccatatg 2760 aagaatcaaa cttggactct aacaactctt ccaaacggac ggatttgcat acccagtgga 2820 tgggatttca aggtcaagac tgacaaactt ggactaccat gtcgccgaaa ggcacgtttc 2880 tttgccaagg ggtaccgtca agtacaaggt attgactatc aagagtcttt ttctcccgta 2940 gtccgctacg actctctacg caacatcatt gccatcacgg ccgcgcgcaa tcttgaactc 3000 atccagctgg atgtcactac cgtttttctc aacggactca ttgacgaact tgtttttatt 3060 gcacaaccgg aaggatatgt catccctggc cgagaacaag aagtgtgccg ccttaacaaa 3120 gggatctacg gtctctgcca agcttctcgc atttggaaca agactctaca cgatgccctc 3180 attcaatatg gtctcataca aagtactgcg gacccgtgta tatactaccg tattactcgc 3240 acgtgctttc tcattattgc agtatgggta gatgacgggc ttgtcgccgg caattcaatg 3300 gatgtcatcg acgacgtcat ccgttatctc aaccggaaat tcaagattac agccgtacct 3360 gccgaccttt tcgtcggtat cgtgatagct cgagatcgtc ccaaccaacg gatctatctg 3420 tctattccgc agttcatcga gaaaaagttg gtgaaatttc ggctgtcaga tgcgcacccg 3480 ctgtcgctac cggttttgaa aggctcgcct cgtttatctt cttactctgc cccttctacc 3540 ccaactgaag tggcaactat gtctaccatc cctttccgcg aagtcgtggg gtgtatcatg 3600 tatgctgcgc tcactgtacg gattgacatt gctttcatag ctggccagct agctcaacat 3660 tgccagaacc ctggtttgga tcattggaaa gccgcccagc gtgttctgaa gtacctagcc 3720 tcaacgcgca atcatggtct gtgcttcggc ggaaatgatg tcaaagacaa catcttaatc 3780 ggatactcgg atgctgacta cgcgggtgat ccggatactc gccgttccac ttccggctac 3840 gtcttcatcc tcaacggagg cgcagtcact tggtcgagcc gtagacaacc cattgttgca 3900 ttatcaacca tgcaatctga atatattgcc actagcgact cgacgcgtga agctgtatgg 3960 atacgtcgtc ttttagacaa tcttggatca actcagatga atcccaccgc tttacgctgt 4020 gataacgaaa gtgccattgg tttggcctac aacccactgg cacacaaagg atcgaaacat 4080 atcgatgtaa ggtaccatta tattcaggta accaataaaa cgattgaact tggctatgtt 4140 aacacatgca aacaaattgc tgatgtcctg actaaagctg ttgatggaga gactttcgct 4200 tcttgtctga acggatttgg tcttggaaaa gttccagata ttattgagta gtctcagtat 4260 ttgtttggat aatcactgtt gtgtttttac actatcattt gtctctttac tctctttgag 4320 atagtactaa ctactcactt aattcagaaa cactattaat taccatttgt gatttgctaa 4380 aattatgtcg ttgtttttct ggtccccatg tgttgtattt cttatgttgg ttttgaatga 4440 gagggtg 4447 // ID Chap3a_Cis repbase; DNA; INV; 258 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 06-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE hAT DNA transposon from Ciona savignyi. XX KW hAT; DNA transposon; Transposable Element; Chap3a_Cis. XX OS Ciona savignyi OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. XX RN [1] RP 1-258 RA Smit A.F.; RT "Chap3a_Cis - hAT DNA transposon from Ciona savignyi."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX CC Ci000008 16 bp terminal inverted repeats. Charlie-like GTCTAGAC CC target site duplications. XX SQ Sequence 258 BP; 64 A; 53 C; 52 G; 89 T; 0 other; caggggtcgg caacctgcgg ctctttcatc cttctgatgc ggctcttatt aattttattg 60 ctactgtagc gagcgcttaa taagtttaat acccatgcat aagtgaaaca aacagaagaa 120 cactcgcgtc tttatctcac acagtgttgt tttcctttta ttgcgaaaat attattagtt 180 ttgcggctcc cagtgatttt tattttgtgg aatatcggtt aaaatggctc tttaagtgca 240 aaaggttgcc gacccctg 258 // ID LARP2 repbase; DNA; INV; 217 BP. XX AC L42499; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 01-JUL-2005 (Rel. 10.08, Last updated, Version 2) XX DE Leishmania archibaldi DNA repeat. XX KW LARP2; Repetitive element. XX NM LARP2. XX OS Leishmania donovani archibaldi OC Eukaryota; Euglenozoa; Kinetoplastida; Trypanosomatidae; OC Leishmania; Leishmania donovani species complex; OC Leishmania donovani. XX RN [1] RP 1-217 RA Piarroux R., Fontes M., Perasso R., Gambarelli F., Joblet C., RA Dumon H. and Quilici M.; RT "Phylogenetic relationships between Old World Leishmania strains RT revealed by analysis of a repetitive DNA sequence."; RL Mol Biochem Parasitol 73(1-2), 249-252 (1995). XX DR GenBank; L42499; Positions 1 217. XX SQ Sequence 217 BP; 54 A; 71 C; 66 G; 26 T; 0 other; gcaagaatca agaggcggtg tcacagagac gggcgaaggg ggacggcggg agcgggagag 60 agagcgcggg cacacggcga cgtccgtgga aagaaaaaag gcagaagaca acgcgtattc 120 ccttctgcta atgtgtaccc gcctctctgc cacagatcac gaggccagct ccactccacc 180 ctaacgcctc ccccgcgcag ccctgtcaca cgctccc 217 // ID Gypsy-1-I_HM repbase; DNA; INV; 3706 BP. XX AC . XX DT 19-DEC-2008 (Rel. 13.12, Created) DT 19-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE An internal portion of the LTR retrotransposon - a consensus DE sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW reverse transcriptase; Gypsy superfamily; Gypsy-1-I_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3706 RA Bao W. and Jurka J.; RT "Gypsy LTR retrotransposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 1968-1968 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 68..736 FT /product="Gypsy-1-I_HM_1p" FT /translation="MTSFFAINNITSDQHVAFLLAALPLKTYKTMEELTAP FT EVPEDVSYEELVTKLETLEIRKSSLISRYEFGKIYRGTEESIQTFAHRVIR FT AAQDCKFLPSERDGRLRDQFIIGINDGAILRAILRGDDALSFDDCVKTASV FT VAQTRLDAACLSVNEEVNAIHRTTGTTSYTRRKKEIRCYNCESLNHFAREC FT TSLCKKCLHKHAAKDCYKRQRNQENLNGTSSH*" FT CDS 904..3684 FT /product="Gypsy-1-I_HM_2p" FT /translation="MGGIIRCEEVSAKICHPTNGCLVDVVVLSVNEVINKY FT DMIMGMDLIKTCDGLQIGTGKLKLCNSLIELSSKLRVATIEIGTSKAYFDG FT KKWIVSWEWENEPKMKNRTSQYNVADQHCEAFNNEINEWIKEGILIKYDDK FT IMGKPKGLLPLMCVIQDNKKKIRPVLDYRELNTYVKASTRDADVINDAMRE FT WRLMSKNSNDSVVDLRRAYLQIHVHKRHWPYQTVIYKNQRYALTRLGFGLS FT LAPVMMSSILRYVLQQDSALAKTTRGYIDDTLATNGNGRKLVIHMKKWGLH FT AKAPEYFGENPVRALGLEVTVNPLTKEMFWKRGRELPIIKSVLTKRDIFSL FT CGNLIANLPVAGWLRPACSYIKRCTNEAGWDEPITNKYVIEMMQEVVDKVV FT KCDPAKGVWNVKPNASLTVWCDASSLAKGVVITQGDRKIEDATWLRPKSDA FT MHINVAELDACIEGLNMALKWNPSDVSIKTDSHSVFSWLNSELTRDKPVKC FT AGQCEMLIRRRLQIFEKLVEDYVINVTVTWVPTTKNIADELTRVPTKWIKN FT KVATQECNTIYKYDKQQIRKIHSLAHMGVSKSMELCMRAGIHVTRDTVKQV FT VAECDECNSIDPNVMLWDKGNLAVSKVWERVACDVTHVDSSLYLTCIDCGP FT SRYTLWTPLSDESATMVVQALETIWSTLGPSSEVLLDNSKSFRSQLMQTMA FT AKWEIRLIYRAVEKPSGNGIVERVHRTVKRTNARVKCGIPMSVFLINNVPS FT KHGTPFEIFCQHHKRVRIPGIDKIESTQKSMTECREWKVGDKVWVKPKHIT FT PCTQRWLKGSIVALHNGLTAEVDVNGHTLPRHFSHLQHRRVDTKEEDTTDD FT YDSTLFKLPETSSKHIVNDDPNLQIIPPDNGKNDNSTGSDQNYSTDGETES FT QPVQFTRYGRRVLPNRKYNDYV*" XX SQ Sequence 3706 BP; 1272 A; 642 C; 834 G; 958 T; 0 other; agctaaccgg gcatgtctat tatctccgca atatgaagat ggacagccaa agcgatttat 60 tcgacagatg acttccttct ttgcaataaa caatataact tcagatcaac acgtagcgtt 120 tttgcttgcc gctctgccgt taaagactta caagaccatg gaagaattaa cagcaccgga 180 agttccagag gatgtttcgt acgaagaatt ggtaacgaaa ttggaaacgt tggaaataag 240 aaaatcatcg ttaatctcac ggtatgagtt tggaaaaatt taccgaggta ctgaagaatc 300 tatacagacg ttcgcgcatc gagtgattcg agcagctcaa gactgcaagt tcttgccctc 360 agagagagac ggccgacttc gtgaccaatt cataattggc attaatgatg gtgccattct 420 gcgtgccatt cttcggggag acgatgcatt gagctttgac gattgtgtca agactgcatc 480 cgtggttgca caaacaagat tagatgcggc atgtttatct gtaaatgaag aagtaaatgc 540 gattcaccgg acgactggaa cgacaagtta tacacgaaga aagaaagaaa ttcgatgtta 600 caactgtgaa tccttaaatc actttgcacg tgaatgtacc tctttatgca aaaaatgctt 660 gcataaacat gctgcaaaag actgctacaa acgtcaaaga aaccaggaaa acttaaatgg 720 gacgagttcg cattaggtat ctttacaaga actcgtcccg aaacaagact acctactatt 780 gacttggtaa taaacggaag aaaagctaca gcgcttgtgg atactggttg ttcgcaatca 840 atcatcacca agaagttact gccgcctgac tataaagtga ttcgtgtagt aaatgttgca 900 acaatgggcg gaataattcg atgcgaagaa gttagtgcta aaatatgcca cccaacgaac 960 gggtgcttgg tggatgtggt ggttctgagt gtgaatgaag taataaacaa atacgacatg 1020 attatgggta tggacttaat aaagacgtgt gatggactac aaattggcac tggaaaactg 1080 aaattgtgta attctttgat tgagttgtca tcgaaacttc gtgttgcgac aattgaaatt 1140 ggaacaagca aagcttattt tgatggcaaa aaatggattg taagttggga gtgggaaaac 1200 gaacctaaaa tgaaaaatag aacaagtcag tataatgtag ctgaccaaca ttgtgaggcg 1260 tttaacaacg aaataaatga atggatcaaa gaaggtatac tcatcaaata tgacgataaa 1320 attatgggaa aacctaaagg tctactaccg ttaatgtgtg tcattcaaga taataaaaag 1380 aagataagac ctgtgcttga ttatcgtgag cttaacacgt atgtaaaagc ctctacaagg 1440 gatgctgatg tcatcaatga tgcaatgagg gagtggagac tcatgagtaa aaacagcaat 1500 gactcggtag tggatctaag aagagcatat ttacaaattc atgtgcataa aaggcattgg 1560 ccttaccaga cggtcatcta taaaaatcag cgatatgcac ttacacgttt gggctttggt 1620 ttaagtttgg caccagtgat gatgagctca atattgcggt atgttctcca acaagattct 1680 gcactggcaa aaacaacaag aggatacatt gatgacaccc tagcaaccaa tggaaatgga 1740 agaaagctgg tgatacatat gaagaaatgg ggtcttcatg caaaagcacc tgaatatttt 1800 ggcgaaaacc cagttcgtgc actaggatta gaagtcacag taaatccatt aacaaaggag 1860 atgttttgga agagaggacg cgagcttcct attattaaat ctgtgctcac aaagagggat 1920 atattctcat tatgtggtaa tctgattgct aatctaccag ttgctggatg gttgagaccg 1980 gcatgcagct atataaaaag atgtacaaac gaagcaggtt gggatgaacc tattacgaat 2040 aagtatgtta tcgaaatgat gcaggaagtt gtagataaag tcgtaaagtg cgatccggca 2100 aaaggtgtat ggaatgttaa accaaatgca tctctgactg tatggtgcga tgcaagttcg 2160 ctggcaaaag gagttgtgat aacgcaaggt gacagaaaga tagaagatgc gacatggctc 2220 cgaccaaagt ctgatgcaat gcatatcaat gtggccgaat tagatgcatg tattgaaggt 2280 ctaaatatgg cattgaaatg gaatccaagt gatgtctcca ttaaaactga ttctcatagt 2340 gtatttagtt ggttaaattc tgaattgact cgtgacaaac ctgtgaaatg tgctgggcaa 2400 tgcgaaatgt taattcgacg aagacttcaa atatttgaaa aattggtcga ggattatgtc 2460 ataaatgtaa ctgttacatg ggtaccaaca actaaaaata ttgcagatga actcacaaga 2520 gtgccaacaa aatggataaa aaacaaagtt gcaacacaag aatgtaatac catttataag 2580 tacgataaac agcagattcg aaaaattcat tcactagcac acatgggagt ttccaagagt 2640 atggagttat gtatgcgcgc tggtatccac gttaccagag acacagtgaa acaggttgtc 2700 gctgaatgcg atgaatgtaa ctcgattgac cctaatgtga tgttgtggga taaaggaaat 2760 cttgctgtta gcaaagtatg ggaaagagta gcctgcgatg ttactcatgt tgattcatcg 2820 ctatatctca cttgtattga ttgcggccca agcagatata cgttatggac accattaagc 2880 gacgaaagtg caactatggt tgttcaagcg ttagaaacaa tatggtcaac acttggacct 2940 tcttctgaag ttcttctaga taactctaag tcatttagat ctcagcttat gcaaactatg 3000 gctgcgaaat gggaaataag gctcatctat cgagctgttg aaaaaccatc gggaaatgga 3060 atagttgaaa gagttcatcg gacggtaaaa agaacaaatg caagagtaaa atgtggtatt 3120 cccatgtctg tatttttaat caataacgtt ccatcaaaac acggcacacc ttttgaaata 3180 ttttgccaac atcacaaacg agttcgaatt cctggaatcg ataaaatcga atcaacacaa 3240 aagtcgatga cagagtgtcg tgaatggaaa gttggcgaca aagtgtgggt gaaacccaag 3300 cacataacgc cgtgtacgca acgatggctg aaaggaagta tagtggcatt gcataatggt 3360 ttaacagctg aagttgatgt caacggacat acgttaccac gacatttttc tcatctgcaa 3420 catcgtcgtg tagacacaaa agaagaagat acaacagatg attatgattc aacactattt 3480 aaacttcccg aaacgagctc aaaacacatt gtcaatgatg atccaaatct tcaaataatt 3540 cctccggata atggtaaaaa tgacaatagt acgggcagcg atcaaaacta ttcaacagat 3600 ggtgagacag agtcacagcc tgttcaattc actcgctatg gaaggagagt acttcctaat 3660 cggaagtaca acgattacgt atgagctctt gttgagctaa ggggga 3706 // ID piggyBac-N7_BF repbase; DNA; INV; 1040 BP. XX AC . XX DT 13-APR-2011 (Rel. 16.04, Created) DT 13-APR-2011 (Rel. 16.04, Last updated, Version 1) XX DE piggyBac-N7_BF DNA transposon DNA transposon - consensus. XX KW piggyBac; DNA transposon; Transposable Element; Nonautonomous; KW piggyBac-N7_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-1040 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M. and Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-1040 RA Kapitonov V.V. and Jurka J.; RT "Transposable elements in the amphioxus genome."; RL Deposited to Repbase as the tmplanrep.ref file (02-May-2008). XX RN [3] RP 1-1040 RA Kapitonov V. and Jurka J.; RT "A family of piggyBac-N7_BF DNA transposons from the amphioxus RT genome."; RL Direct Submission to Repbase Update (13-APR-2011). XX DR [3] (Consensus) XX CC Its ~300-bp terminal portions are similar to these in CC piggyBac-2_BF. XX SQ Sequence 1040 BP; 303 A; 183 C; 237 G; 311 T; 6 other; cccttgtcct gcgggcccsg ttatataacc ggatgcacat gctgtgcctg tgtgcgggtc 60 cggttatwta accggctggg ggtattcccc acaggaacct ttgtggcttc tgtgtggtaa 120 ccaggaacag ctgggaagaa gatgggggct gcatgaatgg ggtcaagggt caatatcctg 180 gttgcgcaat caatcccaga gctctttgcg ggcataaatt atgttaatga gccaagatgg 240 ccgactgtcg tctgcagata tgttttttca tgtttttttc gtgttttggg gcataaattc 300 ttagtttagt cgctaaaatg tcaaagttaa cttggtaaca agtgcataag gagcctttcc 360 ccactgattc tgaggcagaa catccaattt tgggcaagaa tggtgaagaa atgctgccga 420 acatgagaaa agaagatatg cacgtgaact ttaccatttt tatggaaaat taccccccac 480 gcaacaggaa acgaattagg acagtctgtt aagctgcaaa ttcgaccttg gttacttttg 540 tggaggtcat tgagtacctt gatgtggcaa ataattagtg acagatggag ctgatcatat 600 gttacattat tctattgaac agccttwtca gcacccagga caatacctag ccataaactc 660 acctgcagtg atacctgggt aatttttttt gctgcatgtg tggctctgat gttgtcagta 720 tgtcatagga atgttgtagt gttgatacta agattattta tgaaattcag gtattattaa 780 tacaaaagtt tatgaaaata acttgtttta ttgatttatt acccttgcat atcattttta 840 tgactcaggg tmaaactttc aagacattac twaaaagtag agactctcag ctttctattg 900 atgtgtaaca ttatagggtt acttaaatgc aaagtaasta aaaaatagca aagtaaaagc 960 atttcccata ctttgaagag agtcaaaaat gcccagcaca gggagggtat atgggtaaaa 1020 aattgccagc aggacaaggg 1040 // ID DNA4-1_SM repbase; DNA; INV; 229 BP. XX AC . XX DT 16-AUG-2009 (Rel. 14.08, Created) DT 16-AUG-2009 (Rel. 14.08, Last updated, Version 2) XX DE Non-autonomous DNA transposon: consensus. XX KW piggyBac; DNA transposon; Transposable Element; Nonautonomous; KW DNA4-1_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-229 RA Jurka J.; RT "DNA transposons from freshwater planarian Schmidtea RT mediterranea."; RL Repbase Reports 9(8), 1835-1835 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX SQ Sequence 229 BP; 76 A; 37 C; 46 G; 70 T; 0 other; ccccttaacg cacatgcccg agcgtaactc gggtaagtgt tactggcaaa tttgtgcatg 60 cccgagagag actcgggtat acgaaatata aagttttgtt taagattcac taacgattat 120 atctcgggca gtgtaataaa tttatcgaca taggtctata tattaattat caataataaa 180 aaaatgattt atttttgccg ttgtatcaaa aaaaacgtgc gttaagggg 229 // ID Gypsy-11_TCa-LTR repbase; DNA; INV; 301 BP. XX AC chrUn_5; XX DT 21-FEB-2011 (Rel. 16.02, Created) DT 21-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the red flour beetle genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-11_TCa_; KW Gypsy-11_TCa-I; Gypsy-11_TCa-LTR. XX OS Tribolium castaneum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Coleoptera; Polyphaga; Cucujiformia; OC Tenebrionidae; Tribolium. XX RN [1] RP 1-301 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the red flour beetle genome."; RL Direct Submission to RU (21-FEB-2011). XX DR Genome; chrUn_5; Positions 91555 91255. XX SQ Sequence 301 BP; 91 A; 54 C; 75 G; 81 T; 0 other; tgttagaata gttcccaggg ttggagtctg gagtcttgga tcaatctttt aacaccggtg 60 atacggcact taggattatc gcaaatgttt agaccaagag acagcccgga acgacttgct 120 cgggatacga agtaccaagg gggcagaggg acggtatccc agatagcgag agaaaaatca 180 gccgggggat atattgaatt aggccgtgta actgtacaca tgtgtccaat actaaattaa 240 ataaacattg gtctgagcgt cttgaatttt tttttcaata atctgaatag cgtttctaac 300 a 301 // ID DPSAT1 repbase; DNA; INV; 324 BP. XX AC M31307; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 28-SEP-1995 (Rel. 1.08, Last updated, Version 1) XX DE D.pulchellus satellite DNA. XX KW SAT; Satellite; Simple Repeat; DPSAT1; KW Satellite repetitive element. XX OS Diadromus pulchellus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Ichneumonoidea; OC Ichneumonidae; Ichneumoninae; Diadromus. XX RN [1] RP 1-324 RA Bigot B.Y., Hamelin H.M. and Periquet G.; RT "Heterochromatin condensation and evolution of unique RT satellite-DNA families in two parasitic wasp species: Diadromus RT pulchellus and Eupelmus vuilleti (Hymenoptera)."; RL Mol. Biol. Evol 7(4), 351-364 (1990). XX DR GenBank; M31307; Positions 1 324. XX SQ Sequence 324 BP; 87 A; 75 C; 79 G; 83 T; 0 other; gataatttcg agccagctac tttcgtttag ccagaaaagt gcttttcgct tgaaacggca 60 aatcctgctt tctgagcgct gtagagatgc gggcctgccc aaaaccccct agctctcttc 120 atttggtgta actatgcatg aatcaaaaga gaaacccgat cccgagcttt ataattgctg 180 ggaaaacgcg ttttaggcga aaacgcgagt acggtcgatt tgcgtcgaaa agcaaaaaac 240 ctgctttctg agcgctgtta aggccgggga tcgctgtaaa ccctctgagg cgtttcaatt 300 agtcaaactg tacggtggaa cagt 324 // ID BEL-16_DWil-LTR repbase; DNA; INV; 380 BP. XX AC scaffold_181145; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-16_DWil_; KW BEL-16_DWil-I; BEL-16_DWil-LTR. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-380 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_181145; Positions 1699889 1700268. XX SQ Sequence 380 BP; 126 A; 55 C; 88 G; 111 T; 0 other; tgttgtgaag aaatgacaat agcgttagaa tttaaattca aattgaaatg taatttgata 60 gtgttagact acacataatt tagctataag aacatatgat tgttagtgtt gtaagacaca 120 gtgttggaag acactagaag ggctagtgta caaaaacggt cacttgatat cgagccgtgg 180 agagaaaagc gtgagtaaaa ttcgctatta gtttttaagt tgatttaatt tgtaatattt 240 cttaggaaat aaaactctct atcacactct ctactcatca actcctcaac aacttggatg 300 aagttggaac caggggcgtt tggagggggt cagaagttat tataagccaa ggggtccacg 360 gcgcgtgggt cgagtcaaca 380 // ID BEL-67_CQ-I repbase; DNA; INV; 6488 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-67_CQ_; KW BEL-67_CQ-LTR; BEL-67_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-6488 RA Jurka J.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 287-287 (2011). XX DR [2] (Consensus) XX CC Positions [4961-5521] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 48..2720 FT /product="BEL-67_CQ-I_2p" FT /translation="MSSHPSTKGGDAGECCIVCREEACNETVKCDRCRRRF FT HFKCVGVSQELDGWRCAICWNEEPVFKTNSDGAGTSGLKPIPKFEELLTPV FT GPGKKKPSADPAKKVTKPKKELKPRPKPTPKPNPKPDPKPNPEPDPKPAPE FT PIPTPVPDIPKLPPKKAKSLVSHISSASGSTSKAVLKLKLQKLEEQRLLIE FT TQMEKERTLLENERTLLERKYEVLEEKYQLLEEQVAGSDGVESRNGSVTES FT RFAPPDASSRHSDSSDEPKQSSYANSSSPGEKTGKHHGKTPQPDRTDFPKR FT DPCPLTTKQLAARQAVSKDLPCFNGNPADWPLFISTFDSTTAMCGFTDEEN FT IFRLVKCLKGRALERVQNRLTHPANLPGILRTLKLMYGQPEAIVHSLIGKI FT HALPVVSADDFEALVDFAVTVENFCATVDSCGLEEHLYNVTLLHQLVNKLP FT PTIKLNWAQYKLSLPGANLASFSNWIYSLAEAASTIIIPNNPASQPVRNES FT RSTTKKENAFLNAHSNDTSPEPSNVNKSSKPSEGCLVCKGSCKTIDKCKRF FT LELSRDSRWAAVREFSLCRSCLRRHAGDCQQKCGKDGCQFRHHKLLHNDQK FT PFSPGTTSSESPSQLPSTNNSEKPKEESSSTLHGCHTHQTKSSGTLFRYLP FT VMLSGKRRSVRAYAFLDDGSALTLLDRELADELGLEGEVNPLCLHWTGGTQ FT RQETNSRIVSLEISGTHGGAKSYPISGVRTVNELMLPFQTMDLDELSQLNT FT HLLGLPIASYHGVRPQILIGLKNLHLSMILNCREGKPDQPIAVKTRLGWTV FT CGGGNPELVPSPVHSVFHVASGDTDEDLHRTMKDYFALDSLGIMKPGGPLL FT STEDQRAQSLQESLTRFTGERYETGLLWRSDKTRLPDS" XX SQ Sequence 6488 BP; 1663 A; 1726 C; 1665 G; 1284 T; 150 other; acatcttata aaatacgttt acggataatt gcgtcagtca gcccgttatg tcgagtcatc 60 cgtcaactaa aggcggtgat gccggcgaat gctgtatagt ttgccgggag gaagcttgca 120 atgagacggt gaagtgcgac aggtgccgca gacggttcca cttcaaatgc gtcggagtct 180 cgcaagaact tgacggctgg agatgtgcga tatgctggaa cgaagagccc gttttcaaga 240 cgaattcgga tggcgctggg acatccggcc tcaaaccgat tcccaagttt gaagagctgc 300 tgacgccagt agggcccggg aagaaaaagc cgtcagcaga tccagccaaa aaggtgacta 360 agccgaaaaa ggaactcaag cccagaccta aaccgacgcc taagccaaat cccaaaccgg 420 accctaagcc gaaccccgaa ccagatccta agcccgcccc ggaacccatc cctacgccgg 480 ttccggacat tccaaaactt cccccgaaga aagcaaagtc actggtgtcg cacatttcga 540 gcgcatccgg ctcaacctct aaagccgtcc tgaagttgaa gctacagaaa ctggaggaac 600 aacgtttgct gatcgagaca caaatggaga aggaacggac tctgctggag aatgaaagga 660 cgctgctaga aaggaagtac gaggtattag aggaaaagta ccagttactt gaggagcagg 720 tagcgggctc tgatggcgta gaaagcagga acgggtcggt cacggagtcg aggtttgcac 780 cacccgatgc ctcaagtcgg cactcggact caagtgatga accaaaacaa tcgtcctacg 840 caaacagctc gtcaccggga gaaaaaactg gaaagcacca cgggaaaacg cctcaaccgg 900 atcgcactga ctttcctaaa cgagatccgt gcccgctgac caccaagcag ctggcagctc 960 gccaggccgt ttctaaggat cttccctgct ttaacggcaa ccctgcggac tggccgctct 1020 ttatatccac attcgacagc acgacggcta tgtgtggatt cacagacgaa gagaacattt 1080 ttcggctagt gaaatgtctg aaagggcgag cgctcgaaag ggttcaaaac cgtctaacgc 1140 acccagccaa ccttcccggg attcttagga cgctgaaact gatgtatggg caaccagagg 1200 caattgtaca ctcgctgatc gggaagatcc acgcgctacc ggttgtaagc gcagatgatt 1260 tcgaagccct tgtggatttc gcggtgaccg tcgaaaactt ctgtgccaca gtggattctt 1320 gtggcctcga agaacatctg tataacgtca cgcttttgca tcaactggta aacaaactgc 1380 caccgactat caaactaaac tgggcccagt acaagctctc actcccagga gccaacctgg 1440 cctccttcag caattggatc tactccctag cagaagcggc aagcaccatc attatcccta 1500 acaacccagc ttctcagcca gtgcgaaacg aatcaaggag cacaacaaag aaggaaaatg 1560 cctttctgaa tgcgcactcc aatgatacgt cgccagagcc gtcgaacgta aacaaatcga 1620 gcaagccctc cgaaggttgt ctggtgtgca aaggtagctg caaaacaatc gacaagtgca 1680 agcgattcct agagctctct cgcgattcta gatgggcggc ggtgcgcgag ttctcactct 1740 gcagaagctg tcttcgtcga cacgctggag attgccagca aaaatgcggg aaagacggat 1800 gccagttcag gcaccacaag cttctgcaca acgatcagaa gccgttctca cctgggacaa 1860 catcatccga gtctccaagc caactgccct caacgaacaa cagcgaaaag cccaaggaag 1920 agtcgagttc gacgttgcac ggctgtcaca cacaccaaac taagtctagt ggaactctat 1980 tccgctatct cccggtaatg ctgtccggga aacgtcgatc tgttcgagcg tacgctttcc 2040 tcgatgatgg ctcagcactg accctgcttg accgagagct agccgacgaa ttggggctgg 2100 aaggtgaagt caacccattg tgcctgcact ggacaggagg cacacagcgt caagaaacaa 2160 actcacggat tgtaagcctg gaaatttctg gaactcatgg cggggcgaaa agctacccca 2220 tcagcggcgt tcgtacagtt aacgagctta tgctcccgtt ccaaacaatg gaccttgatg 2280 agctctccca actcaacacc catctgctag gtctaccaat tgcgtcctat cacggtgttc 2340 gaccacaaat cttgatcggc ctcaagaatc tgcaccttag catgatactc aattgccgtg 2400 agggaaaacc agatcaacca attgcagtaa agacacggct gggatggacg gtatgtggtg 2460 gcggaaaccc agaattggtt cccagtccag tacattccgt tttccatgtg gcgtccggcg 2520 acacggacga agatctgcac agaaccatga aggactactt tgcactggat agcttgggta 2580 tcatgaagcc tggcggccca ttgctctcca ctgaggacca acgcgcgcaa tccctacaag 2640 aatcgctgac cagattcaca ggagaacgct acgaaactgg tttactttgg cgctcggaca 2700 agacgcgctt accagacagc maggmtatgg ccctgcgtcg wttcaagtgc ctkgaaaggc 2760 gcatggaaag agatsmagma ctggccgaag cgctgaagsa gaagatgacs gaccacctgc 2820 gcaaaggtta catccggaag ctcactcagc aagaattgaa ccagtcttgs caacgctctt 2880 ggtaccttcc ggtgtttcck gtcacgaacc ccaacaagcc cggtaaggtg cgaatggtst 2940 gggacgcggc ggccaaatct cacggagtgt ccctcaactc agcgcttctw aagggacccg 3000 atttgctgwc ttcgctctat accgtgctaa tcctcttccg wgagaaccct gtcgcgctgw 3060 ccggagacat cmgggagatg tttcatcagg ttgcaatccg ccaagaagac cagcastgcc 3120 agcggttctw ctggmgggat gaagacgggc agttagcagt gtacgcgatg tgtgtcatga 3180 ctttcggcgc ctgttgctct ccgagcagcg cccagtwcgt gawgaaccwt aacgcagawc 3240 gctttacgcg ggaatacccw acggcggcgg aagtcatacg gaatcaacac tacgtcgacg 3300 acatgctgtc tagcgtsgac accgaagmgg aagcgatcga gmtsgcgaag gacgtgaagt 3360 ttgtgcackc tgaaggcggg ttcgaaatcc gtaactgggt cwscaactcw cascgcgtcc 3420 tscmggcttt gcttggggat amcgccgaag agaagaactt ggatctgtcg ccggagatgg 3480 ctacggaaaa ggtgctcgga atgtggtggt gtactsaagc cgacktcttc acattcaagg 3540 tcgkctggga tcggtacgac sgwgacctgc tggaaggtcg tcgccgaccc acgaagcgkg 3600 aggtactsag sgttctcatg wcgatattcg atcccctggg gctcatcgca ccgttcctsg 3660 tkcatctgaa aatcctacts caagacgtct ggcgctcggg tgtccwatgg gacgacaaga 3720 tcaacgacaa gtmkttckma aagtggawaa tctggctgcg cgtcctccca caagttgaac 3780 gaatacaaat wccacgttgc ttccgattgg ggttccctga ctacgacgaa gttcaactgc 3840 acacgttcgt ggatgcgagc gagaacgcaa tgkctgccac gtgctatttg cgcttcgwga 3900 aggacggatc aattaggtgc tctcaagtwg ccggcaagac cmgagtcgct ccactgagat 3960 wccwctcgat ccccaaacta gagtgtgagt cagcggtggt cggcgtcmgg ctagcccgca 4020 ccgtaaccga atcgctctcg ttcacagtag acaagcgctt ctwtcacacg gactcccggg 4080 atgtcmtctg ttggctgaac tcmgatcatc gtcggtacws sccwttcgta gcmtcccggg 4140 taagtgaaat tctggagwat accgaggtta acsagtggmg atgggtgccg wccaaactga 4200 atgtcgccga tgacgcaaca aagtgggaaa agcttccaga catgacatcg agttcgcgat 4260 ggttcaacgg tccggaattc ctgtggcwgc cggaagaagc ctggccgcgs cagctggaga 4320 aaggctgcca aaccgacgtc gagctccgcc caagcctgtt agcacacttt acgaaaccag 4380 actctacaat tgaggtatcc agctggaagc tgatgcttaa agttgctgtg gttgtccatc 4440 gtttcccctc aactgtaggc gcaagaaagc taagcagccg gccatctgtg gaccacccac 4500 atcggaggaa ctccttgcag ctgagcgtta cgtgtttcgc caagctcaac gagaggtcta 4560 cgcagatgat gttgccattc ttctgagacg tacggaagaa gccataacta acctctctcc 4620 aaccagttca ctgtacaagc tgtcaccgtg gttggacaag catggtttgc tgcgaatgcg 4680 aggaagaata agcgcctgcg cgctagtcac cgaagacgcc aagaacccaa tcatccttcc 4740 tcgtgaccac cacgctacgc aactcgtaat cgaacacwat cacaacaaat accatcacca 4800 aaatcacgag accgtcataa acgagctgcg acaaaggttc agcatcccgc ggctgcgcgt 4860 tgtttatgct aacgtgcgta aaaactgtca acggtgcaag aacgaccgcg ccaccccaag 4920 gccacccatg atggcggatc taccaccaga gcggcttgat gcactwgctc gaccgtttac 4980 acacgtgggc atcgattatt tcgggccatt ggtggtctcc gtcggccgcc gcacggagaa 5040 gcgatgggga atgctgatca cctgcttgac cactcgggct atccacatcg aggtggtgca 5100 cagtctcagc accgattcgt gcatcatggg cctacgaaac ttctcggcac gtcgaggaac 5160 tccaagaacg atctacagtg accgtgggac ctgtttcatc ggagcaaaac gggagataca 5220 cgaagctacc gagaacataa aactagaaga tgtaatgaag gaatttgtta gtgtggagac 5280 aacctggaag ttcaaccctc ctctgtcacc acacatgggc ggtagctggg agaggctcat 5340 cggaattgtg aagcgcaatt tgatggcaat ccgtcccgtg agaaacccca gcgacgaagt 5400 actgcgaaat cttcttactg aaatcgagca cactgtcaac tcgcggccgc tgacacacgt 5460 tccagtagac gacgaatccg cccctgcgct tactccgaat cacttcctgt tgggtacatc 5520 cgacggttcc aagccaacct gcacactcga tgacgatgga atggtactgc gacggagctg 5580 gcgaacatct caggtgttgg ccaaccgatt ctggaaacgg tggcttaacg attacttgcc 5640 ggaaatcacc cgcaggacaa aatggttctg taactccaag ccgattgaaa ttggtgacat 5700 agtcgtaatc gttgatccac ttctacccag gaactgctgg cctaagggaa agatcatcgg 5760 aaccagcata agcaagaagg atggtcagac tagggcagcc mcggtgcgaa cgtcgactgg 5820 tgtctacgaa cgtcctgttg ccaagttagc agtgctggat gttcggcgcg attakgtttg 5880 agctgactgg cgagtcagct cacctggggg gagtgttacc tgcgccactg cgcagggtgc 5940 gcacctcatc caccaacccc ttgggaatca acacaggcca actgacacga cggcccgcgg 6000 ctgtcamcag cgtgaagtag caagtcacta cttagattca gttgcaaagt gmgttcgtwc 6060 gmagacagcm ggtksggmas tttagagtgc cgcaaaggta cgaagctcgt aaktgccagt 6120 ttkggwtcwa tmgaacgtca kttgtmagtg cscgtgkaam tmtcacgwga ctkmwagawt 6180 atcctttaga gtkcggcact agwkgwtcas ttccaakmwg gcsgcgatca atcctccwcg 6240 tgcggwwcat macwcgwgtg attcttaacc tcamaakgma wcaskctgwc aggtkmkgag 6300 tgcgacamtc wgtgtggaac cagccwtcaa atckaacgta ccattasmtt cagcttcaaa 6360 gcgagttcgc acgcwgaccg acaaacmggc acttgagaga gctttaagwk ccgcwmamgt 6420 amggagwcta wamktattag tttgaaatct atagactwat tgcactaaat ctataagagc 6480 acgcgaag 6488 // ID L1_Ele21 repbase; DNA; INV; 4574 BP. XX AC . XX DT 15-OCT-2010 (Rel. 15.1, Created) DT 15-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE An L1 clade non-LTR retrotransposon family from Aedes aegypti. XX KW L1; Non-LTR Retrotransposon; Transposable Element; L1_Ele21. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4574 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4574 RA Kojima K.K. and Jurka J.; RT "L1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (15-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. This consensus is generated from 20 CC sequences with >96% identity, and ~99% identical to the original CC sequence in [1]. XX FH Key Location/Qualifiers FT CDS 127..1242 FT /product="L1_Ele21_1p" FT /translation="MASSPYHLTSNGPRVNTLVIDLANLPAKKINMEMIAK FT FIHDKLRIQFSSIQSLQHNTGKSLVFIECQTEELAIAVTERNDGNYEISVD FT NVKYKVNVFMEDGATTVRVHDISPQTENSSIEKELERYGDIVWLKEEMWTE FT PAVLKGIPNGVRSVRIRLHTAIPSYININGEVTLVTYKHQQQTCRHCGKPV FT HWGRKCIEAGYMEMQLNGGNISDRLRASGVDYAGVLKTSNQTGTSQEIRNS FT VSSEAEKNRVDPNYTNLTQLFNKSSSQGQDRKKITTNTVESNEKLAIKSTN FT TNTNSKFQAPSENTESSAAKTATSKLARGSMAVNPIASGIEVSNNFDGLDI FT SDDDMDGTHASGCKSPSRKKIGLSVDSSE" FT CDS 1142..4444 FT /product="L1_Ele21_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MVSISRMMIWMVLMQVVASHLLERRLVYRWILLNNHK FT MDMLSYNIGTLNINGIANENKINALRSFIYLMDLDILMLQEVHEHITISGF FT DTFSNIDLNKRGTAILVKSHINTSHVQKSIDSRVISLKIENHTTICNIYAP FT SGTAQKSLRERFFTDTIHYYVNNGDGRLIVGGDFNSVIDKKDANHGSNFSQ FT SLKFVVQSLKLNDVWQIVHKNAVDFTFIRSGSGARLDRIYVSDDLKGYVST FT VRTNVCAFSDHKIVILKLKLPNLGRPFGRGLWRLNDLALTDENLAEFREKW FT IYWVRQKRYYDSWFSWWNEYAKIKTVSFFKWKYSTLKRRYSDTMEFFYSAL FT NILYNDYLTNPDVLSEINKVKAKMLALQKEMSNNFYGRSETFISGENSSIF FT QLGEQKQRREKTHIKTLRINGQDEADIEIVTEEITTFFKNLYTKSNVVIEN FT SFEPQKTIQDNFPANSRLMDPISEEELFQAIKTSQSRKCPGKDGLTKNFYI FT KCWEIIKTELLLVVNEVINGKTSTKFLEGVIVLIRKKKTNNDISGFRPITL FT LNFDYKLVSRILKNRLVKLNEALLSNTQKCSNGSRTIFEATCGIRNKIVEI FT NLKRKKGLLISFDLEKAFDRVNHEYLLATLSKMGINNRFVDFMRLCQNNTY FT SRILINGNMSETIRIERSVRQGDPLAMILFCFYLEPLLQKIQNICEDEMDL FT LVGYADDISLIIFDVRKLSVVKQIFDNFELVAGAKVNYNKTKAMRIGSNNN FT FILPDWLEHEEYLKILGIWFNNNSNAMIQKNWDEILQNLRYMLWNSQFRNL FT NLIQKIIFLNTYASSRMWYTASTVSVHKKTLTKIKSLFGNFLWYRCSLRIS FT FNQLCLPRKRGGLGLISPEHKCKSLLINRFLRLRQSSPFLHNYTDRIENPP FT NMKGLPCNAHFLKTVYLELAYISENIKNNPTSQSIYNHFCSNLPKPHITEK FT YPQFRWDIIWKNVFSRQISSDHQVTWFLLINEKIICAEKQFRFGSRVSPNC FT VYCPQEIEDLKHRFSTCPKIRNCWWFTVTQIKLLNRRKFRNISFDDFKFPV FT LRTYNQLERKIASKMFLEFLRYAIANDREDVSVEELKFIMNCNI" XX SQ Sequence 4574 BP; 1611 A; 724 C; 922 G; 1317 T; 0 other; cagttccgtt ttagctatcg ttcagagagg acgtctttct attgtgcgga gtgtgaaaca 60 aaagtgattt tttctccagt ttcggctggt acgacaaaca ataaaccaca gaacaaacac 120 gaaacaatgg cttcatcgcc gtatcatcta acgtcgaacg gaccacgtgt taacacgttg 180 gttatcgatc ttgcaaattt gccagccaag aagatcaaca tggaaatgat tgccaagttc 240 atccacgaca agttgagaat tcagttttcg agcattcaat cgctacagca taacacagga 300 aagtcattgg ttttcatcga gtgtcagacg gaagagctag ccatagccgt taccgagcgt 360 aatgacggaa actatgagat cagcgtggac aatgtaaaat acaaggtcaa cgtattcatg 420 gaggatggcg caactacagt tcgagtacac gatatttcac cgcaaaccga aaattcatcg 480 attgaaaagg aattggaaag atatggggac attgtttggt tgaaggagga gatgtggaca 540 gagccagccg ttctgaaggg tattccgaat ggtgtaagat cggttcgtat ccgactgcat 600 acagcgattc catcatacat caacatcaat ggagaagtca ccctcgtcac atacaaacac 660 caacaacaaa cgtgcaggca ctgcggaaaa ccagttcatt gggggcggaa atgcatcgaa 720 gctggctaca tggaaatgca attgaatgga ggtaatatca gtgataggct tagagcatca 780 ggtgtcgatt atgcaggagt attgaaaact tctaatcaga cagggactag tcaggaaatt 840 cgtaattctg ttagcagtga agcagagaag aatcgggtag acccgaacta caccaaccta 900 acacaattat tcaacaagag ttcaagtcaa ggtcaagata gaaagaaaat aactactaac 960 accgtagaat caaacgaaaa actagcgatc aagtctacta acactaacac taactcaaag 1020 tttcaagcgc ccagcgaaaa cacagaaagc tcggctgcca aaactgccac atcgaaacta 1080 gcacgtggtt caatggctgt gaacccaatc gcctctggta ttgaggtttc gaacaatttt 1140 gatggtctcg atatctcgga tgatgatatg gatggtactc atgcaagtgg ttgcaagtca 1200 ccttctagaa agaagattgg tctatcggtg gattcttctg aataatcata aaatggatat 1260 gctctcgtat aatattggta ctctaaacat caacggaatt gcaaatgaaa acaaaataaa 1320 tgctctgaga tcgtttatct atttgatgga tttggatatt ttgatgttgc aggaagtaca 1380 cgagcacatt acgatcagcg gttttgacac tttttcgaat attgatttaa ataaacgcgg 1440 cacagcaata ctagttaaat ctcatatcaa cacctcacat gtacagaaaa gtattgactc 1500 cagagtaata agcttaaaga tcgagaatca cacaaccata tgtaatattt atgcaccttc 1560 cggaacggct caaaaatctc tcagagagcg gttttttaca gacactattc actactacgt 1620 taataatgga gacggaaggt taattgttgg aggtgatttc aattcggtaa tagataaaaa 1680 agatgctaac cacggttcga attttagtca gtcgttgaag tttgttgtgc aatcattaaa 1740 attaaatgat gtttggcaaa tagtacacaa aaatgcagtt gatttcactt ttatcagaag 1800 cgggtcagga gcgaggcttg ataggattta tgtaagcgat gatttgaaag ggtacgtgag 1860 caccgtgcgg acaaatgttt gtgcattctc cgatcataaa attgtaattt tgaagcttaa 1920 gttgccaaat ttgggtagac catttggccg agggttgtgg cgtttgaacg atttggcttt 1980 gacggatgaa aatctagctg agtttagaga aaagtggata tattgggtaa ggcagaagcg 2040 atattacgat tcatggtttt cctggtggaa tgagtatgcg aaaattaaaa cagtgtcatt 2100 tttcaaatgg aaatatagta cattgaaacg acgttatagc gataccatgg agttttttta 2160 ttcggcgttg aacattttat acaacgatta tttgactaat ccagatgttt tgagtgaaat 2220 caacaaagtt aaagcgaaga tgctggcgct gcaaaaagaa atgtcaaata acttttacgg 2280 tcgttctgaa accttcatca gtggggaaaa ttcttcgatt tttcagttag gagaacaaaa 2340 acaacgtaga gaaaaaactc atattaaaac tcttcgtatc aatggtcaag atgaagcgga 2400 tattgagatt gtcactgaag aaatcaccac atttttcaaa aatttgtata ctaaaagtaa 2460 tgtagttatc gaaaattcgt tcgagccgca aaagactata caagataatt ttccagcaaa 2520 tagtaggtta atggatccca tttcagaaga agagttgttt caagctatta aaacgagcca 2580 gtcaagaaaa tgtcctggaa aagatggatt aacaaaaaac ttttatatta aatgttggga 2640 aatcatcaag acagagcttc tgttagtggt aaatgaagtg attaacggaa aaacatcaac 2700 caaattttta gaaggagtaa tagtattgat cagaaagaaa aaaacaaaca atgacattag 2760 cggatttaga ccaataacat tgttgaactt tgactacaag ttagtttcaa gaatcctcaa 2820 aaacagatta gttaaactga atgaagcttt actatcaaat acacagaagt gttccaatgg 2880 ttccagaact atctttgaag ccacttgtgg cataagaaat aaaattgtgg agataaattt 2940 gaaaagaaaa aaggggttgc tcatttcttt cgatcttgaa aaggcgtttg atagggtaaa 3000 tcacgagtat cttttagcga ctttatcaaa aatgggaata aataacagat ttgttgattt 3060 tatgaggtta tgccagaata acacgtattc aaggattctt attaacggta atatgtctga 3120 aacgattaga attgaaagat cagtacgaca gggagatcca ttggcaatga ttctgttttg 3180 cttttatctt gagcctcttt tgcaaaagat ccagaatatt tgtgaagatg aaatggactt 3240 actggtaggt tatgcagatg acataagtct gatcattttt gatgtgagga aactgagtgt 3300 agtgaaacaa attttcgata attttgaatt agttgctggg gcgaaggtta attacaataa 3360 aacgaaagca atgaggatag gatcaaataa taatttcatt ttacctgatt ggctggagca 3420 cgaagagtat ttgaagattt tgggaatttg gttcaataat aatagcaatg cgatgattca 3480 aaagaactgg gacgaaattt tacaaaattt gaggtatatg ctttggaata gtcaattcag 3540 aaatttaaat ttaattcaaa aaatcatttt cctgaacact tatgcgtcat cgcgtatgtg 3600 gtatacggca tcaacggttt cggtccataa gaaaactcta acaaaaataa aatcgctctt 3660 tggaaatttt ttgtggtatc gttgttcttt aagaatttct ttcaatcaac tgtgtttgcc 3720 aaggaaaaga ggaggtttag gtctcatttc accagaacat aagtgtaaat cgttgctaat 3780 caacagattt cttcgactgc gtcaatcttc acctttctta cataattata ctgatcgaat 3840 tgaaaatcct ccaaacatga aaggtttacc gtgcaatgca cattttctca aaactgttta 3900 tttagaacta gcgtatattt ccgaaaacat aaaaaataat ccaacatctc aaagcattta 3960 taatcatttc tgttctaact tacctaaacc acatataacg gagaaatatc cacaatttag 4020 atgggacatt atttggaaga atgtattcag tagacagatt tcttcagacc atcaagtaac 4080 atggtttctc ttaatcaatg agaaaatcat ttgtgctgaa aaacagtttc ggtttggcag 4140 tcgagtatca ccaaactgtg tttattgtcc tcaagaaatt gaggatttga agcataggtt 4200 ttctacatgc cccaaaatta gaaactgttg gtggtttact gtaacacaaa ttaaattgtt 4260 aaacaggcgt aaatttagaa atattagttt tgatgatttt aaatttccag ttttgagaac 4320 atacaatcag ttagaaagaa agatcgcatc gaaaatgttt ttagaatttt taaggtatgc 4380 tattgcaaat gatagagaag atgtatctgt agaagaattg aaattcatta tgaattgcaa 4440 tatataggtt aatatataaa tagaagaata cacgatcacg tgcttgtttt gtaaaacaga 4500 aaattgtaat catgtaacaa atcacaaaat aatggaaata aacacgtttt aagtaaggaa 4560 aaaaaaaaaa aaaa 4574 // ID piggyBac-14_SM repbase; DNA; INV; 2337 BP. XX AC . XX DT 30-MAY-2008 (Rel. 13.05, Created) DT 30-MAY-2008 (Rel. 13.05, Last updated, Version 1) XX DE DNA transposon from Schmidtea mediterranea. XX KW piggyBac; DNA transposon; Transposable Element; piggyBac-14_SM. XX OS Schmidtea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae. XX RN [1] RP 1-2337 RA Kapitonov V.V. and Jurka J.; RT "Families of autonomous piggyBac element from the freshwater RT planarian genome and two clear examples of horizontal transfer of RT piggyBac transposons between a flatworm and insect."; RL Repbase Reports 8(5), 533-533 (2008)Repbase Reports. XX DR [1] (Consensus) XX CC piggyBac-14_SM is a young family of piggyBac transposons, CC characterized by 14-bp TIRs (1 mismatch) and TTAA target-site CC duplications. The consensus sequence was reconstructed based on CC multiple alignment of 12 copies, which are ~98% identical to the CC consensus sequence. XX FH Key Location/Qualifiers FT CDS 379..2016 FT /product="piggyBac-14_SMp" FT /note="piggyBac transposase." FT /translation="MSDSESEYEYANLSESGSEDSDNYSSDSEDSSDEDIA FT VDWKQISDSQPNPPPPRFPFVGASGPTFFFDNNFDVLEYFKLFFDDSVMDI FT IVLETNRYAEQQPGGSTTQWKNVDKNEMMIFFAICILQGLIKKPEERQYWS FT TNEIWNTPIFPKLMNLRRYTSIKRNLHFCNNETFDRNTHPNPKLYKIWPIY FT EIINNKCSSLYIPERDITIDESLLLYKGRLGWVQYIPLKRARFGIKLFLLC FT ESKSGYLFSFIIYTGKGTVIDVKYKDMPVTSQIVVSLLDPLLDQGYCLTTD FT NYYTSPQLADFLVKHKTDTYGTVRKNRKDIPKFIQTKKLKKGEIVAAQRGK FT VMVMKWQDKRDICLLSTIHNTEKAATNKTDKDGNVVSKPKLVLDYNETMGG FT VDRLDQHLHDYPIIRKRGKKYYKKIFFHLLDISIWNGYVLYKKNGGGKSNL FT DFRSELVEKLIQNYHSEQNIKKSGRPKSTGILRLTERHFPSFVPPTPKKEA FT PTRYCAVCCSKRDSKGKRKRRESRYMCNPCNVGLCAAPCFEIYHTKENFE" XX SQ Sequence 2337 BP; 862 A; 350 C; 394 G; 731 T; 0 other; ccccttaacg cgcatactcg agttaattcg agcgcgctaa accggccaaa ttgtgcacac 60 tcgaattaac tcgagcggcg tcacatttta tgccgctaaa atatttacac tcgtaaataa 120 tcgagcattg aaaataactc aataaaaact agttaaaata ttcgtttttc tgatatccat 180 cattttaact ctatattgca agctataaag ttaataaaat atggcctttt aggacgatgt 240 ttataaacag ttgcattttg tttaccaaaa aaattgtgta aattggatat ttagtgtata 300 aaaggtatgt tttctaatga tcaagtaagg aatattattc tatttgacat tttattctag 360 taaaatatat attgaaaaat gagcgattcc gaatctgaat atgaatacgc aaatttgtca 420 gaaagtggtt cagaagattc tgataactat tccagtgatt ctgaagatag ttcagatgaa 480 gatatcgctg ttgattggaa gcagataagt gatagccaac caaatccacc tccaccaaga 540 tttccatttg ttggcgcttc aggaccgact ttcttttttg acaataattt tgatgtgttg 600 gagtatttca aattattttt tgatgattct gtgatggata ttatagtatt agaaacgaat 660 agatatgctg aacaacaacc tggcggatct accactcaat ggaaaaacgt cgataaaaat 720 gaaatgatga tattttttgc aatctgtata cttcaggggc ttatcaaaaa accagaggaa 780 agacagtatt ggtctactaa tgaaatatgg aatactccaa tttttccaaa attgatgaac 840 cttagaaggt atacaagcat aaaacggaac ttacattttt gtaacaatga aactttcgac 900 agaaatacac atcccaatcc aaagctttat aaaatttggc ccatttatga aataataaat 960 aataaatgtt ccagtttata cataccggaa agagatataa ccatagatga aagcctattg 1020 ctttacaaag gtcgtttagg atgggtgcaa tatatcccac taaaaagagc aaggtttgga 1080 ataaaattat ttttactgtg cgaatctaaa agtggctatc ttttctcatt tattatatac 1140 acaggaaaag gcacagtaat agatgtaaaa tataaagaca tgccggttac atctcaaatt 1200 gtcgtgtcgt tgctcgatcc acttctggac caaggctact gtctaaccac tgataactat 1260 tatacatcac cacaattggc agacttttta gtgaaacata aaacggacac ctatgggact 1320 gttagaaaaa atagaaaaga tatcccaaaa tttattcaaa caaaaaaatt gaaaaagggc 1380 gagatagtag ctgctcaacg aggaaaagtt atggttatga agtggcaaga caagagagac 1440 atttgtttgt tatctactat ccataataca gaaaaagcag caacaaacaa aacagataaa 1500 gatgggaacg ttgtatctaa accaaaattg gtgttagatt ataacgaaac aatgggaggt 1560 gttgatcgtt tagatcaaca tctacatgac tatcccatta ttaggaagag gggtaaaaaa 1620 tactacaaaa aaatcttttt ccatttacta gacattagta tttggaacgg ttacgttttg 1680 tataaaaaaa atggaggtgg aaaatctaac ttagatttta gaagcgaact ggttgagaag 1740 ctaatacaaa actaccatag cgaacaaaac ataaaaaaat caggtagacc aaaatcgaca 1800 ggaatcttac gactaactga acgtcatttt cccagttttg ttccgccgac accaaaaaaa 1860 gaagcaccca caagatattg cgctgtgtgt tgttcaaaaa gggacagtaa aggaaaaaga 1920 aaaaggagag aaagtcgtta tatgtgcaat ccctgcaatg ttgggctatg tgctgcacct 1980 tgttttgaaa tttatcatac caaggaaaat tttgaataat tttattttat tatttttcaa 2040 ataaataaat attattatta tattttgtta caaatttagg ttcaattaaa tattttcaca 2100 gttaaaaatt tgtttttttc ttcaaatagt aactgaaaaa taaaattaca atagtcgatc 2160 cgtgatgcaa ttgaaaatcc actacaatta tttatccgct atttctatta aattttgaac 2220 aattttgcat cttttttttg ggttttagtg aaaataaggt tcaaatatgt attttttcaa 2280 ataaaaaaaa ttgttttttt ttgccggttt ttttcaaaaa aagtgtgcgt taagggg 2337 // ID Crack-5_CQ repbase; DNA; INV; 4716 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A Crack non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; Crack-5_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4716 RA Kojima K.K. and Jurka J.; RT "Crack non-LTR retrotransposons from the southern house RT mosquito."; RL Repbase Reports 11(1), 36-36 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 10 sequences with >97% CC identity. XX FH Key Location/Qualifiers FT CDS 358..1440 FT /product="Crack-5_CQ_1p" FT /translation="MIGGSMETDDSVTCCVCSEEENDLTAVITCMYCFSSS FT HLKCRGITNKSAHRMREKPYFCSTKCADIYQKITEMQRNDKSLISSISSQL FT NATVTKVVETQMKQVKAEVKSVTTAVESSQNFLSEKFDKIVSDFNEMKSDN FT TRLNLEVDLLKRSHSDLAGTVYKLESSVDKSNKLAVSHNLIVLGLPSSANE FT NIMNIVNKTFACIGVDVASVSFTASRLYSEPKNSNIVVPIRVIFDEVTMKD FT YVLDKKREFGQLKSNVINRTLLLNGNATNIAIREEMTPLGLELLKEMRDNQ FT KSLNIKFVWPGRGGVVLVKKTDDSPTEKIANRDDLNRLIARYKNINRPCSD FT SSSXGQEQQEGKKRKVKK" FT CDS 1488..4376 FT /product="Crack-5_CQ_2p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="MYNEIKNFSHDNIEDFNASYSLNENKFLKILQWNVRA FT INDLNKFDDILQTLDSSKTSVDIVVLGETWLKNDFCALYEIPNYNSFFSCR FT DQSNGGLAVFVHKNSTSKLLKNVCIDGFHHIQIELIIKGHFYHIHGVYRPP FT NFDYGRFADYLENILSCTPVNYSCIIVGDVNIPINRALHNISDRYLRLLES FT YNYICSNTVPTRPLSGNILDHVLCKVDDSCCLRNDTISHYLSDHSPIISSF FT KLPIDKEKIKLSKTIVNHRKLNNAFDNFINNFDHVTDVSNSLLLITSTYNS FT LLEKNTRTVEKSITVKGNCPWMTFDLWTLCKIKNNYIKRVKRNPTNQHLKD FT MLNHISQKVDDTKTKCKKLYYEKLLNVTNHSKLWSNIKVVFGVGKKNNKIC FT LVNNGTRTNNDAEVCEVFNNFFSNIGSQLAANIPRNLNSNPINNLSRIDHT FT IVLWPSNENEVITLIKSLKNKKSCGPDNFPVNILKRNCGIFAQILSQCFNL FT MLVTGEYPECLKIAKVIPVFKAGDPDDCNNYRPISTLSVFNKIFEKLLVNR FT LVKYLSKRKILYKFQYGFRQGSSTQTAILELVDDIIKEVDSKKSVGALFLD FT LKKAFDTLDHSILLKKLEMYGIRGLANDLIKSYLSNRKQFVAINDARSSNQ FT TIGVGVPQGSNIGPLLFLLYINDLGKLQLCGTPRLFADDTALFYPNKNINT FT IIESIEADLGVLKNFFDANLLSLNLSKTKYMIFHSIRKVITEHEHPRLGNQ FT EIEKVNCFKYLGIHLDPTLSWAYQVNHVEKKLAPLCGMLWRVRNFVPRHVL FT LKFYFAYFHSQLNHLVSTWGYASSTSLKKIRTLQNRCLKTIFYKPLLYPTL FT QLYSELSHTILPLNYLRDYQTLVFVHNLLNNPVMHHNITLPSAPHFHSTRQ FT NDHLRREQASTNLGQRRISFIGPTKYNTLPLDLKQTTSTSIFKIRLKQHLK FT HKLHELFQ" XX SQ Sequence 4716 BP; 1588 A; 816 C; 792 G; 1513 T; 7 other; gtcggaacga ttgacgtaac tgctaacaat ccctcataat tagtaattaa tacatggccc 60 ttggtctttt cattttgatg attcattatt ttaatgttgt ctgtgatgat ccttattaaa 120 taaattatat atattttttg ctcgactgct cattactcct tgcacacaca tctggcctgc 180 cttcacactt gtttatacag tcagtacatc catactccac acttgtctta gtgcggtgtg 240 tgcattgttc atcccatcag cttgacatcc cattaatgaa aattgwcttt ttggttttgt 300 ttgtatgtaa ctgtatgtgt tttgtttatt tttgcggctt tttaatctga tttcaaaatg 360 attggtggct cgatggaaac tgacgactca gttacttgct gtgtctgctc agaagaagaa 420 aatgatttaa ctgcagtcat aacgtgcatg tattgtttct caagtagtca cttgaaatgc 480 cgcggaataa caaacaaaag cgctcatcgt atgcgtgaaa aaccttattt ctgttcaacc 540 aaatgtgcgg atatttatca aaaaattaca gaaatgcaaa gaaatgacaa atcccttatt 600 tcttccataa gctcccagtt gaatgctact gtgaccaagg ttgttgagac gcaaatgaag 660 caggtaaaag ctgaggtgaa atctgttacc acagctgtgg aatcatcgca aaatttctta 720 tctgaaaaat ttgacaaaat tgtgtccgat tttaatgaaa tgaagtcgga taacacacgc 780 ctaaatcttg aagttgatct tttgaagcgt tctcattctg atcttgcagg gactgtttac 840 aaactcgaat cgagtgtaga caagtctaac aaattagctg tgtctcataa tttaattgtg 900 ctggggctgc cctcatcagc caacgaaaat attatgaata ttgtaaacaa aacattcgcg 960 tgtattggtg ttgatgttgc atctgtttcg tttacagcga gcagattgta ctctgaacct 1020 aaaaacagta atattgtggt accaattagg gtaatatttg atgaagtaac aatgaaagat 1080 tatgtactgg ataaaaaacg kgaatttgga caactaaaat ctaatgttat taacagaact 1140 ctattattga atggaaatgc gactaatatc gcgattcgtg aagaaatgac tccgctgggg 1200 ttagaattgt tgaaagaaat gcgtgataat caaaaatctt tgaacattaa atttgtttgg 1260 cccggtcgcg gaggagttgt tcttgtmaaa aaaactgatg atagtcctac tgaaaaaatt 1320 gccaatcggg atgatttgaa tcggctgatt gctcgctaca aaaacattaa tcgtccatgt 1380 tcggattcta gttcgsccgg acaggaacag caagagggta agaaacgtaa agtaaagaaa 1440 taaaacaaaa aatcttgact tagtactgat gcaaatttta tttaaaaatg tataatgaaa 1500 ttaaaaattt ttcgcacgat aacattgaag attttaatgc aagctactct ttgaatgaaa 1560 ataagttttt aaaaatacta cagtggaacg ttagagcaat caatgatctt aacaaatttg 1620 atgatatttt gcaaaccttg gacagcagta aaacttctgt tgatatagtw gtcttaggcg 1680 aaacctggct gaaaaatgat ttttgtgcac tttatgagat acccaattat aattcatttt 1740 tttcatgtag agatcaatct aatggaggct tagcagtttt tgtacataaa aattctacaa 1800 gtaaattatt gaaaaatgtt tgtatcgatg gctttcacca cattcaaatt gaactcataa 1860 taaaagggca cttttaccat attcatggag tttatcggcc acccaatttc gattatggga 1920 gatttgctga ttacctggaa aatattttaa gttgtacacc tgtaaattat tcatgtatta 1980 ttgtgggaga tgtcaatata cctatcaaca gggcactcca taacatttct gatagatatc 2040 taagattact ggaatcgtat aattatattt gctcgaatac agtccccaca agacctttaa 2100 gtggcaacat tttggatcac gtgctgtgta aagtagatga ttcatgttgt ttgcgaaatg 2160 atacaatttc tcattattta agtgaccatt caccaattat ttcctctttt aaattaccaa 2220 tcgataaaga aaaaattaaa ttgagtaaaa ctattgtaaa ccacagaaaa cttaataacg 2280 ctttcgataa ctttatcaac aattttgatc atgttacaga tgtcagtaat tccttgctgt 2340 tgatcacctc aacttataat tcactgttgg aaaaaaatac acgaactgtt gaaaaatcaa 2400 taaccgttaa gggtaactgc ccttggatga ctttcgattt atggacgctc tgcaaaataa 2460 aaaacaatta cataaaacga gtcaaacgaa acccaactaa tcaacacttg aaagacatgc 2520 ttaatcatat ttctcagaaa gtagatgaca cgaaaacgaa atgtaagaaa ttgtattatg 2580 aaaaattact gaatgttact aatcactcaa agctttggag taatattaag gttgtatttg 2640 gcgtagggaa gaaaaacaac aaaatctgct tggttaataa tgggacaaga acaaacaacg 2700 atgctgaagt ctgcgaagta ttcaacaact tcttttccaa tattggtagt caacttgcag 2760 cgaacatacc acggaacttg aattctaatc caataaacaa cttaagtcga attgatcaca 2820 caattgtact gtggccatcg aatgaaaatg aagtwattac tttaataaaa tctctcaaga 2880 acaaaaaaag ttgcgggcct gataattttc ctgtgaatat tcttaagaga aattgtggta 2940 tttttgctca aattctctca caatgtttta atttgatgtt ggtcactgga gaatatccag 3000 aatgtttgaa aattgccaaa gttataccgg ttttcaaagc tggtgatcct gacgactgta 3060 ataactatcg tccaatttcw actttatcag tttttaataa aatatttgaa aaacttcttg 3120 tcaataggtt agttaaatac ttgtcaaaaa gaaaaatatt gtataaattt cagtacggat 3180 ttaggcaggg atcaagcaca caaacagcga tacttgaatt ggttgatgat ataattaaag 3240 aagttgactc taaaaaatct gtaggcgcat tatttttgga tctcaagaaa gcgttcgata 3300 cgttagatca cagcattctt ttaaaaaaac ttgaaatgta tggcatccga ggccttgcca 3360 atgatttaat taaaagttat ttgtcaaatc gtaagcaatt tgttgcaata aatgacgccc 3420 gcagttctaa tcaaacaatt ggagttggtg tgcctcaagg cagcaacatt ggccctttgc 3480 tatttcttct ttatattaat gacttaggga aactacagtt atgcggtact cctaggttat 3540 ttgccgacga tacagctcta ttttatccaa ataaaaatat taacacaata attgaatcaa 3600 ttgaggctga tctgggagta ctcaaaaatt tctttgatgc taatcttttg tcgttaaacc 3660 ttagtaaaac caagtacatg atatttcatt caattcgtaa agtcataaca gaacacgaac 3720 atccaagact agggaatcaa gaaattgaaa aagttaattg ctttaaatat ttgggaatac 3780 acttagaccc gacactttcc tgggcatacc aagttaatca cgttgaaaag aaattggccc 3840 ctttatgtgg catgctttgg cgagtgcgaa attttgttcc tcggcatgtt ctattaaaat 3900 tttattttgc gtattttcat tcccaactga atcaccttgt gtcaacatgg ggctatgcct 3960 ccagtacttc attaaagaaa attcgtacac tccaaaatcg ctgtctcaaa acaatttttt 4020 acaaaccatt actgtaccca actttacaac tgtattccga attatctcat acaatacttc 4080 ccttgaatta cctaagggat tatcaaacat tagtatttgt gcacaatctt ctaaataatc 4140 ctgtaatgca ccataacata accttaccaa gtgcaccaca ttttcattca actagacaaa 4200 atgaccacct gcgacgtgag caagcttcaa ccaaccttgg tcaaagacgc atatcattca 4260 ttggtcctac taaatacaac acacttcccc ttgatttgaa acaaaccact agcacctcca 4320 ttttcaaaat cagattaaaa cagcatttaa aacataagtt acacgaactg ttccaataga 4380 atacacttaa caccgattac ataaaatagg ttatatttct ctttattagt cgagctattt 4440 tcatttattt tagttttgtg gatcccttaa aaggaaatct tttttccact gggatttcac 4500 cttagtatta gcagtagtta ttcttcattg cccaccaata ttagcatttc tttgtaaaac 4560 aatgttctca gcataaattt agtttgcttt cgtcctttac attaactttt gttagctgag 4620 catttaagga gaccactacc agggggctca attatgagct ttttggtgtg ggggtgtgat 4680 ggagggtcct taaaaaaaaa aaaaaaaaaa aaaaaa 4716 // ID SAT_NM repbase; DNA; INV; 349 BP. XX AC . XX DT 14-SEP-2004 (Rel. 9.08, Created) DT 14-SEP-2004 (Rel. 9.08, Last updated, Version 1) XX DE Nicrophorus marginatus satellite DNA. XX KW SAT; Satellite; Simple Repeat; SAT_NM; satellite repeat; KW tandem repeat. XX OS Nicrophorus marginatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Coleoptera; Polyphaga; Staphyliniformia; OC Silphidae; Nicrophorinae; Nicrophorus. XX RN [1] RA King M.L. and Cummings P.M.; RT "Satellite DNA repeat sequence variation is low in three species RT of burying beetles in the genus Nicrophorus (Coleoptera: RT Silphidae)."; RL Mol. Biol. Evol 14(11), 1088-1095 (1997). XX RN [2] RA Gentles A., Kohany O. and Jurka J.; RT "Nicrophorus marginatus satellite DNA - consensus sequence."; RL Direct Submission to Repbase Update (JUN-2004). XX DR [2] (Consensus) XX SQ Sequence 349 BP; 125 A; 47 C; 55 G; 122 T; 0 other; ccagtagaac gggagatacg gtgattaaaa ctacgtaaaa atcgcgttta tttcgggctg 60 aaacgctcgt ttttagccga aattaaaaat taattctaac ttttgaatgc gtggagcgat 120 ttcgatgaaa ttttaaatgg ttatttacta gagcaaaccc aacattattg catgaataga 180 aatttttaaa ctcactcggt tcatgaaata ataaataaaa attgaatatt ttattgatta 240 aaaaaaataa ttatttctcg gtagataaat tttttattta gaatctgcat gcaccagctt 300 actttatttg cgctaaaaat taatatgagc ccttattttt tgaaatcgg 349 // ID Gypsy-25_DWil-I repbase; DNA; INV; 4312 BP. XX AC scaffold_181136; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-25_DWil_; KW Gypsy-25_DWil-LTR; Gypsy-25_DWil-I. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-4312 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_181136; Positions 1777045 1781356. XX CC Positions [1875-2378] - Reverse transcriptase CC Positions [3420-3896] - Integrase core CC 'TATA' target site duplication CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 241..1647 FT /product="Gypsy-25_DWil-I_1p" FT /translation="MSCLNDLISGTLKTGKVDNIRALHKFIYESEGDRNNR FT KRIREFSGFDYDEANAKYKAKAEYVRRNLTNGDLVSICNVLGIKYSVDDLA FT LHIFTNLKMNNLLASQDEDELDDEVDDEIDDETDHESVATERNVATENNLR FT NIPHERASENNNDEYRRINEERDRMMTPRFAISFRDVEENIRIFDGTNTIA FT VEVWINEFNEQATLMCWNDFQKFLFAKKALKGIAKLFVLSERGINSWNSLE FT KELLSEFKSCVSSKDIHEQLMKMKRKNNECVYEYFYRMKDVASRGQIKDDS FT LIEYLIDGIDEKSENKSILYGAKNLIEFKEKLKQYKIMSEKESKKGNGKYK FT NDENSARVKNKMMKTNNKDDVVCYNCGDKGHYSKDCDDKGKGRKCYKCNEF FT GHIAKNCNKREMNGKFEPNTRSFKSKEEKTTCKVIKIDNIECKALFDTGSK FT FCIIREDFYKRLKQPELSSFDGLK" FT CDS 1788..4121 FT /product="Gypsy-25_DWil-I_2p" FT /translation="MRIILNDDSPIFARPRRLALKVNEKVEEWLAEGIIEE FT STSEYTSPVVLVKKHDDSYRLCVDFRKINKVCVRDHYPLPLIEDQLDRLQD FT ATIFSTIDFKNGFFHVAVAESSRKYTSFVTHCGQYQFKRVPFGFTNSPGVF FT QRHVKRGIALPYVDDVIILARDEEEAVKNLKEVIVTCEEYGLELNLKRCHF FT LKKRIQFLGHIVENKKIYPSPDKIEAVAKFKMPQIVKQVQSFLGLTGYFRK FT FIPNYACIAKPLTELTKNAQKFVCGPEEENAVQILKKLLTENPVLSIYNQS FT YQSEVHTDASIEGFGAVLLQRCPDNGELHPVYYMSRKTTDAQRKYSSYELE FT ILAVITALEKFRVYLLGMHFKLITDCDAFTKTLEKKSLCTRVARWVLFLQE FT YDFDVLHRSGTRMRHVDALSRYPIMQINDYGISVGIKKAQSQDEKIRAIIE FT VLKDNSKTYDDYFLKADILYKLVNDVDLIVVPNAMQKQIIGQAHNKGHFRA FT KKTKDLICREYFIPHLEEKIQKYIECCIPCNISSRKVGKQEGLLHPLQKDD FT IPLLTYHIDFLGPLESTHKDYKHILAVIDAFTKFCWLYPTKSTTSNEVITK FT LTIQSTIFGNPAFIISDRGSAFTSQDFLKYCDNEGIKHVKTTTALPRVNGQ FT VERLNAIIIPVLSKLSVDDPTKWYRHVSKVQQAINSTYTRSIGTSPFELLT FT GVKMRTNDDHKLRDLIDQETKAIFTDERASSVPRRRHTSPRSKTKTRSTTT FT YVVSKLDLIVLATWLQLRGRNLAVVSN" XX SQ Sequence 4312 BP; 1503 A; 731 C; 968 G; 1110 T; 0 other; tattttgggt gctcgtccgg gatagattcg agtaagagtg gcgctcatcc ggttacaaag 60 agttttcccc acatattttg tcgtgataga aagagtgcgg gatacaagag aacaccccac 120 atacattttt tgtgcgttcg tgcgggagaa gagtgaaaaa aaaagtgatt ttttttcttt 180 aacaatcgtg ttgtgttaag tcggtcctgc tctgctaatg gtgaaattaa aacattaact 240 atgtcgtgtc taaatgattt aataagtggt actctgaaaa cgggtaaagt agacaatatc 300 agagcattgc ataaatttat ctatgagagc gagggagata gaaataaccg caaaagaatt 360 cgagaatttt ctggtttcga ttacgatgaa gcgaacgcaa aatacaaagc aaaagcggaa 420 tatgtacgac gaaatttgac gaacggtgat cttgtttcga tttgcaatgt attgggaatc 480 aagtacagtg tcgacgacct tgcattgcat attttcacga atttgaagat gaacaatttg 540 ttggcatcac aagacgaaga cgaattagac gacgaagtag atgatgaaat agacgacgag 600 acagatcatg agtcggtagc tacagagagg aacgtagcaa cagagaataa tttgaggaac 660 ataccgcacg agagagcaag cgagaacaac aacgatgagt acagaagaat aaacgaagag 720 agagatagaa tgatgacgcc gcgttttgcc attagttttc gcgacgtgga agagaacatt 780 cgtatttttg acggcactaa cactattgca gtagaagttt ggattaacga attcaatgag 840 caagcaacgt tgatgtgttg gaacgatttc cagaagtttc tgttcgcgaa gaaggcgtta 900 aaaggtatcg cgaaactgtt tgtcttgagt gagagaggaa taaattcatg gaactctctc 960 gaaaaagaac ttttgagtga attcaaatcg tgcgtgagta gtaaagacat acatgagcaa 1020 ctgatgaaaa tgaagcgtaa aaacaatgag tgcgtttatg agtactttta cagaatgaaa 1080 gatgtagctt cacgtggaca aattaaagac gattcattaa tagaatattt aattgacggc 1140 attgatgaaa aaagtgagaa taagtcgatt ttatatggtg cgaaaaattt gattgagttt 1200 aaagagaaat taaagcagta taaaataatg agtgaaaaag aaagtaagaa aggcaatggt 1260 aaatacaaaa acgatgagaa cagtgcaaga gtgaaaaaca aaatgatgaa aactaataat 1320 aaagacgatg ttgtgtgtta caactgtggc gataaaggac attactcaaa agattgtgat 1380 gataaaggga aaggtagaaa gtgctacaaa tgtaatgaat ttggtcatat agctaaaaat 1440 tgtaataaac gagaaatgaa cggtaaattt gaaccaaaca caagaagttt taagtcgaaa 1500 gaagaaaaaa caacctgtaa agttattaaa attgacaata ttgagtgtaa agcattgttc 1560 gacaccggaa gtaaattttg tatcattcgc gaagatttct ataaacgttt gaaacaaccc 1620 gagttgagta gctttgacgg gttgaaataa aaattctaaa aattgatgtc gcgaataata 1680 aagatgaact taatatagac gaatgtgcaa gcacaagtgc taaaaaagag gtattcgatt 1740 tgatatctaa ttatcaaccg gtagcaataa agacaaccaa tgtagaaatg cgaataattt 1800 taaacgatga cagtccgata tttgcgcgac cccgtagact tgcacttaaa gtgaacgaga 1860 aagtcgaaga atggttggct gaaggcataa ttgaagagtc aacgtccgaa tacacaagtc 1920 ctgtggtact tgtgaagaag catgacgatt cctaccggct atgtgtcgat tttcgtaaaa 1980 taaacaaggt atgtgtgcgt gaccattatc cattgccgct catcgaggat cagttggatc 2040 gtttgcaaga cgctacaatt tttagtacca ttgattttaa gaacggattc tttcatgtgg 2100 cagttgcaga atctagtcga aagtatacgt cgtttgtgac ccactgtggt cagtaccaat 2160 tcaagagggt tccgtttgga tttactaatt cacctggagt attccaaaga catgtaaaga 2220 gaggcattgc gctgccatac gttgacgatg ttatcatctt ggccagagat gaagaggaag 2280 ctgttaagaa tctaaaagag gtaatcgtaa cctgcgaaga gtatggttta gagttaaacc 2340 tgaagaggtg tcatttttta aagaaaagaa tacaattttt ggggcacatt gtcgagaata 2400 agaagatata tccatctcca gataaaattg aggctgttgc aaagtttaaa atgcctcaaa 2460 tcgtaaagca agttcaaagt ttcttaggtt tgacgggtta ttttagaaag tttattccaa 2520 actatgcgtg tatagcgaag cctctgaccg aattgaccaa gaatgcacaa aaatttgtct 2580 gtggccctga agaagaaaat gcagttcaaa tcttgaagaa gttactaact gaaaatccag 2640 tcttaagtat ctacaaccag tcgtaccaat ctgaagtcca tactgatgca tccattgaag 2700 gctttggagc agttttacta caaagatgtc cagacaacgg tgaactccat cccgtatact 2760 acatgagtcg aaaaaccaca gacgcacagc gaaagtatag cagctacgaa ttagagattc 2820 tcgcagtcat aacagcgtta gaaaagttta gagtttatct tttgggcatg cattttaagc 2880 taatcacgga ttgcgacgca tttacaaaga cgctggagaa gaagagcctt tgtaccaggg 2940 tagctcgttg ggtacttttt cttcaagaat acgattttga cgttttacac cgttcaggta 3000 cccgtatgag acatgtcgac gccctcagca gatatccaat tatgcaaatt aacgactatg 3060 gaatcagcgt tggcataaag aaggcacagt cacaagatga gaaaataaga gccatcatag 3120 aagttctgaa agacaattcg aaaacttatg atgattattt tctgaaagcc gatatcctgt 3180 acaaattagt caatgacgtt gatttaattg ttgtcccaaa cgcgatgcag aagcagatta 3240 ttggtcaggc acataacaaa ggacatttca gggcaaagaa gacgaaagac ctgatttgca 3300 gagaatactt tataccgcat ctcgaggaaa aaatccaaaa atatatcgaa tgctgtattc 3360 cttgtaatat cagtagccgt aaagttggca agcaagaagg cttgttgcat ccgcttcaga 3420 aagatgatat tccactttta acttaccata ttgattttct aggtcctctc gagtctactc 3480 acaaagacta taagcatatt ctggcagtca tagatgcatt tacgaagttc tgctggttat 3540 acccaaccaa atcgacgact agcaatgaag ttattacgaa gctgacgatt caaagcacga 3600 tttttggcaa ccctgcgttt attatctcgg atcgggggtc cgcctttaca tcccaagatt 3660 ttctgaagta ctgcgacaac gagggtatca aacacgttaa gacgactaca gcacttccga 3720 gagtaaacgg ccaagtcgaa cgtttaaatg cgatcatcat tcctgttctg tcgaagttga 3780 gcgttgatga tccgacgaag tggtatcgac atgttagcaa agttcagcag gctatcaatt 3840 cgacgtacac cagaagtatt ggcacttctc catttgagtt actaactggt gtaaagatga 3900 ggaccaacga tgaccataaa ctgagagatc ttatcgacca agagacgaag gcaatattta 3960 cagacgaaag agcgagctcc gtgccaaggc gaaggcacac atctccaaga tccaagacga 4020 aaacaagaag tactacaacc tacgtcgtaa gcaagctcga tcttatagtg ttggcgacct 4080 ggttgcaatt aagaggacgc aatttggcag tggtctcaaa ctaaaactaa aatattttgg 4140 accctaccaa gttaagaagg tgaagcacaa caacgcatac gatgtcgaaa aggtaggcaa 4200 gtgcgaaggg cctaaagtga cgtccacatg tgcagagtac atgaagccct ggagcccaaa 4260 ccgagacgat gaagaagatg acccagcatt cgggtcgaat gcgtaatcag aa 4312 // ID SINE-4_CQ repbase; DNA; INV; 597 BP. XX AC . XX DT 26-DEC-2010 (Rel. 16.01, Created) DT 26-DEC-2010 (Rel. 16.01, Last updated, Version 1) XX DE Non-LTR retrotransposon from Culex quinquefasciatus - consensus. XX KW SINE; Non-LTR Retrotransposon; Transposable Element; SINE-4_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-597 RA Jurka J.; RT "Non-LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 596-596 (2011). XX DR [2] (Consensus) XX CC >98% identity to consensus. Putative SINE. XX SQ Sequence 597 BP; 155 A; 140 C; 130 G; 172 T; 0 other; gaccggcccg tggcttaatg gctacggctc ccgcctcata agcggaaggt tccgggttcg 60 attcccgacc ggtctccttg aaattattcg actataattg aactttgaat atgaacaaaa 120 aacgcatgga atcaggtggg attcgaactc acacctttgg attggtagtc agatgctcta 180 gccactcggc caccgagggg ttgacacccc ttgagtgaat taatccgtag tggtgaaata 240 atgtcacaag gtcatttact atataataca acaatccctt tcccaacgat tcccctacac 300 acactacaca ccatattcta taaataactc gaagtggttg gtacggtatg tcctcacttc 360 ttcttcttcc cctgatgatt ccatgaaatg tgggttccgt tcatagaatc tgtctcttgc 420 ggttaatgca acgcagtagg ccgggccgct ttagtgagtg taatacattt cgggggttct 480 cgaaagtact gcaatcatta ataatgacaa gaaagaactt gaggcgtaat ctaaccataa 540 gttcttcccg gatgctttct tcggactggc tgcgcttgtg ttcgattaga ttagatt 597 // ID Transib-11_HM repbase; DNA; INV; 3376 BP. XX AC . XX DT 31-JAN-2008 (Rel. 13.01, Created) DT 31-JAN-2008 (Rel. 13.01, Last updated, Version 1) XX DE Autonomous Transib DNA transposon -a consensus sequence. XX KW Transib; DNA transposon; Transposable Element; Transib-11_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3376 RA Kapitonov V.V. and Jurka J.; RT "Autonomous Transib transposons from the hydra genome."; RL Repbase Reports 8(1), 11-11 (2008). XX DR [1] (Consensus) XX CC Transib-11_HM is a young family of autonomous Transib DNA CC transposons that were active in the hydra genome less than a few CC million years ago (copies are ~4% divergent from their consensus CC sequence). The consensus sequence was obtained based on multiple CC alignment of 10 copies; it codes for a 651-aa Transib CC transposase. Like other Transib transposons, Transib-11_HM is CC characterized by 5-bp target site duplications and short terminal CC inverted repeats. CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 1074..3026 FT /product="Transib-11_HMp" FT /note="transposase." FT /translation="MKRLDLIPNIRELGVDKLKLWKDLLNEIKNTEKIRKE FT NLNKVKLALVIFISKVKKKLKQCGRNYDTFIQKENRWLQRNLMNNKNSHFG FT SGRPHKQWSVLGERSKRKKIKELATEHHEALFLASAKSANAAGEKKKASVL FT KNNISNVLEIEKVDKTLDKTFPINMSVEDALALKMNTDLSDKQYQMIRNSA FT LVHNVHIYPTLHDIFVEKLKCYPENIHFSEISAKCTLQSMLDHTVSRITEM FT IDMPNFGQSDEIFGVLYGKIGFDGASSQSIYKQKYDTSNASIELKHEENLF FT STAFVPLLLKVEDGEIWKNENPSSCHFCRPLHLQYKKESPILSREEESDLK FT NQINTLKTFCKVFNIAGRILTFNISYRIDLTMFDNKVVSALTETKSTQSCN FT VCNAKPKEMNNIDLLKAKTENDSALGLGISALHCWIKCLEFILHLSYKIEI FT KKYKPKTIEEKLSVDLKKKEIQVLFRERLSLYVDMPKTGSGNTNDGNTARR FT AFKNSEVFSEITKVDCDIIKRLHNILITISCGYPININELDLYCTETAKRI FT VSLYNWYVMPPTVHKLLLHSSSISNKLPLPIGVYSEDALESLNKQIRNSRL FT KHTAKISRLNTMMNQMHYLLVRSDPKISSISFRKHKSYRGKPLPDEVFKLI FT VM" XX SQ Sequence 3376 BP; 1337 A; 435 C; 497 G; 1107 T; 0 other; cacagtgact caaaacaaca aaaaaatggc aataaacccc tcaccacttt tttataaatt 60 aagtgccctt tgttagataa tattaaaaga caatgatttt ttatttcaaa agtaaaaata 120 attttttcat aaagctttta ttttgtattt tataggtaaa agaagtgaaa aaatttgata 180 ttatttttaa agaaatctga aaaatgtgag gtttttattc aactaaatta gagaacttga 240 aatttaaata aaagtgtata tttattttta aaataatatt ttgactctat atatttcaaa 300 tatactaaaa agaaaaaaaa aagtctgatt ttttcgttta aatttaaaga cttctctaaa 360 aaaaagtaga aaataattgt attttttttg tgtttatata tcagctaaaa aaatgttcca 420 ccgagttcca caaatatata tatatatttt ccagttaatt aacatttcta ctttaggaat 480 gtaaaaggat tactgcggtc tcctgcggta ttggaagtta ttaaagaaaa acaaatgttg 540 atgattttat aaaaaccttg ttttccgcca ttttgctaag aactttatta tatcaaaaca 600 aatgaaattg gcgggaaact taaagttaag tttcttgctt gtagaacttg ttctaatata 660 gtttttaaat cataaatatt aaatatagtt ttaaaaaacg tgcttttgtc attatatact 720 atttaaaagt ttgttaaaat atttattgac aattttaaag aacgatttat tattcgataa 780 gtacaacatg tttgaaaaag cttaattgaa caactttggt atttcccgtt tcaagaaata 840 ccaaattgtt acatcctgta ataacacata ttacttagtt aactttatat tacatgtaat 900 ataaagttaa ctaagtaata tgtgttatta caggatgtaa caatgtgaaa attgttaaaa 960 gtcataacaa gataaaattt gaaacagtgt aaagtaaaaa aatatgtaat gtaatttaaa 1020 atgtgtctaa tatattagtt ttattatatt gatggttctt aaaaacgata acaatgaaaa 1080 gattagatct tatcccaaat atacgtgaat taggagttga taaattaaaa ttatggaagg 1140 atttacttaa tgaaataaaa aatactgaaa aaattcgcaa agaaaactta aataaagtta 1200 agcttgcttt agttatattt atttcaaagg tgaagaaaaa actaaaacaa tgtggacgaa 1260 actatgatac atttatacaa aaagaaaatc gttggcttca aagaaatctt atgaacaata 1320 aaaatagcca ttttggttct ggaaggcccc ataaacaatg gagtgttctt ggagagagaa 1380 gcaaaagaaa aaaaataaaa gaactagcca ctgaacatca tgaagctcta tttttagctt 1440 ctgcaaaaag tgcaaatgct gctggagaaa aaaaaaaagc ctctgtttta aaaaacaata 1500 tatctaatgt tttagagata gaaaaagttg ataaaacttt ggataaaacc tttcccatta 1560 acatgagtgt agaggatgct cttgcattaa aaatgaatac tgatctcagc gataaacaat 1620 atcaaatgat cagaaactca gctcttgtac acaacgtaca tatttatcct accttgcacg 1680 atattttcgt ggaaaaattg aaatgctatc ctgagaatat acatttttct gaaatttctg 1740 caaagtgtac tcttcaaagc atgttagatc acactgtgag tagaataaca gaaatgattg 1800 atatgcctaa ttttggtcaa agtgatgaaa tatttggagt tttatatgga aagattggtt 1860 ttgatggagc atcaagccaa agtatttata aacagaaata cgatacaagc aacgcaagta 1920 tagaacttaa gcatgaagaa aatcttttca gtacagcttt tgtgccttta ttgctcaaag 1980 tagaagatgg cgaaatttgg aaaaatgaaa atccttcaag ctgtcacttt tgtagacctc 2040 ttcatctaca atataaaaaa gagagtccaa ttttatccag agaagaagaa tcagatctaa 2100 aaaatcaaat aaatacctta aagacgtttt gcaaagtttt taatattgca ggaagaattt 2160 tgacatttaa tatcagctat aggattgatc ttacaatgtt tgataataaa gtagttagcg 2220 cacttacaga gacaaaatcc actcaatcct gcaatgtgtg caatgctaaa cccaaggaaa 2280 tgaacaatat tgatttatta aaagcaaaaa ctgagaatga ctcggccttg gggctcggaa 2340 tttcagcttt gcattgctgg ataaaatgct tggagttcat tcttcatttg agttataaaa 2400 ttgaaattaa aaagtataag ccaaaaacaa ttgaggaaaa actatctgta gacttaaaaa 2460 aaaaagagat tcaggtactc ttcagagaac ggttaagcct atacgttgat atgcctaaaa 2520 ctgggtctgg aaacacgaat gacggcaaca ctgccagaag agcatttaaa aatagtgaag 2580 ttttttctga aattactaaa gttgattgtg atataattaa aagattgcat aatattctca 2640 taaccatatc ttgtggatat cctattaata tcaatgaatt agatttatac tgcactgaaa 2700 cagctaaacg tatagtgagt ttatacaact ggtatgttat gcctcctact gtacataaac 2760 tgctactgca tagttcttct atatcaaata aattaccttt gccaatcggg gtatattcgg 2820 aagatgcact agagagttta aataaacaga taagaaattc tagacttaag cacacggcaa 2880 aaatatcaag gttaaacaca atgatgaatc agatgcacta ccttctagta agatcagatc 2940 caaaaatatc aagcatatct tttagaaaac ataagagcta tcgaggaaaa ccgttaccag 3000 atgaagtttt taaattaatt gtaatgtaaa taataaattt aatgtaaaaa aatacagtat 3060 tttttctcta tgtatcaaca tcctttaaaa accattttta taaaaatatc aagtagagta 3120 atatatcaaa attatgtgtt tgtcttgaag attctaaaaa ctaaaaaaaa aaatttttgt 3180 acgcgtatag tcacctttaa cacgcgtaag cgcgcatttt acgcgtcgaa gctgcgcgtt 3240 aaaaaaaaaa aattattaac aaatttgaaa tctttgactt aaaaattatt attttaatat 3300 ataacattat ttatttacga tttatgtcaa tttttattta atttttgccg ttttttgatt 3360 gtcctgagtc actgtg 3376 // ID DNA8-90_AP repbase; DNA; INV; 821 BP. XX AC . XX DT 26-AUG-2009 (Rel. 14.09, Created) DT 26-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-90_AP. XX NM DNA8-90_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-821 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2026-2026 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 821 BP; 237 A; 118 C; 147 G; 315 T; 4 other; gggctcggat gttgttgcat ttgcaacttt ttttctattg ttcggattcg ttgctaagta 60 ggttntaaat ncgtttcana caagtacgaa ttaanatcta aaaattttaa gtgcaacttt 120 ttgcgatttt tgaccttttt caattttatt gcaattttgt cgattttcgt cattttcaat 180 acaattttat gtattttttt gtgttttcac ttttttttgt tagattatac aggaataatc 240 tatggttatg tcgggctttg ttggttgtga ctcgtgacgg cttatttgac cgtcgtcgtc 300 ggcggtaact ggaaaaacca tttaattagt agagcgcggt cattacgcta taatatgact 360 acgggaagta tttttaggct tgcagtcaat aacaagataa gaataacgac gacttgattg 420 ttatacaata ataaaatgtt gggagaaatg catgtttcaa tttataatag ttcagtatcg 480 cactgtattt cgttaacact gcatttcgtt aacactgcac agtgcacact attttgtatt 540 tttttttctt atttgttctt caaaatgccg aaaagtaagc catcgatgaa gaacgtatta 600 aacaaatatg tcgatgagtt tggggaaaac atattttcga gtgacggttc agttttattt 660 tgcaagttat gtgaaactcg agtatctgct gaaagaagat acatagttac acaacaaaaa 720 aatatttttt tattgcaatt tttaagtgca attttgggtt tttttcacag caataatgca 780 atcaatttta taacattttt agtgcaacaa catccgagcc c 821 // ID CR1-10_BF repbase; DNA; INV; 3383 BP. XX AC . XX DT 30-JUL-2009 (Rel. 14.07, Created) DT 30-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE Amphioxus CR1-10_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-10_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-3383 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-3383 RA Kapitonov V. and Jurka J.; RT "Young families of CR1 non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(7), 1581-1581 (2009). XX DR [2] (Consensus) XX SQ Sequence 3383 BP; 1031 A; 940 C; 652 G; 760 T; 0 other; atgggaattg acttgttctt gtaccgactt acaattggat tgtttttatc aagatacaaa 60 caatgtagca agatggcggg taagtcaaac ccaagctcat gtaaactgtc tgtagtctta 120 ctttgcataa caatgatggc tgccctgact gtgttcacat gtttatgtag catcggcaca 180 gacaatgtat ctgctgaatt cacacaatgg aaaccagcta tagacctaac acaacactat 240 gatgtacacc caaatcctgg gccagactta tctgaactgt tacttagcac agactttcgc 300 tctaagggct tactcactat tgctcacctc aatgctagag gcctagtacc gaagatagac 360 gagttgagat gtctagcatc aacaatgtac attagtatca tatgtgtaac tgaaacctgg 420 ctaagcagtg acatttcaga tagtgacatg gaactcgaag gctataattt gtaccgcaga 480 gataggaaca ggcacggagg tggtgtatta atctatgtta atgaccagtt gtatgctaga 540 aggcgtagtg acttagaaaa cgatggcacc gaatccatat ggtgtgaagt aaccaagtcc 600 aacgtaacta tattaatcaa ttgtacttac cgaccaccgt catcagacga tacatttttc 660 gatatatatg aatcgcaagt acagaaagca acgagcaacc cacagactgt aaatctagta 720 ctaggagact tcaactccaa gaacagggac tggttacata gcaatgcaac tgataacacc 780 gggcgcctat tggacaatat cttcctaaac cacggacttg aacagatgct ccatgagcca 840 acccgcggtt caaacctgtt agacttgatc gctatgtcac acccatcaat gtgctatcaa 900 agtggtaccc tggcccctct aggagactcc gaccacttga caaccataac aaccctcaac 960 ctccaaacca cccggctctc tggaaagaag ctcgtgtggc tctacaagaa ggcaaacctt 1020 gatgccctgt gtaacgacct cagccaagcc ccctgggaca ccaacttcat ctttgacact 1080 atggacgata tctggaactc ttggtacatc atgtttatgg acatatgccg acagcacatt 1140 ccacacaaac taatctctgt caacagaaca gccaaaccat ggatccatca aaacaaagag 1200 atcaaagcgg ccatccgacg aaagcaccgg ctccactcca gagccaaaag caaaaacact 1260 cctgaagcct ggtctgccta caggaaacaa cgaaacctgg tcaccaccct caccagaaac 1320 gcagaggcgg catacatcca agaagtgacg gaagatgtag aagctggaaa cacaaggcgc 1380 ttcttctcct ttgccaaatc tgctttagga aaggcgtcaa ccggcatccc agcactcaaa 1440 gatgggacag atattattga aagacccgaa gacaaggcaa atgccctcaa caactttttc 1500 atacaacaga ctgacctaaa tgacaggaac gactcccctc ccaccttcca accaacgaca 1560 atccctgaaa caacactaag caatcttcaa ctgactgtag acgaagtgcg acagcaactg 1620 aaaacgctaa aggtgggcaa gtcctgcggt ccagacaaca tcactccaaa ccttctccga 1680 caggtagctg acaccatatg tgcacccctc acccggctat tcaacacatc actcagcctt 1740 ggtcaagttc cttctggttg gaaagaggcc aatgtaaccc ccatccacaa atccggcagt 1800 cgtcatctca ccaacaacta caggccgatt tccctgctca acatcgttcc gaaagtgcta 1860 gagtcactag tcaacaaaca tctaattaaa caaatcaatc cagtattgtc ccaccaccag 1920 agcggattca gagcccacga caacaccaca ctccaactag cccggcttgt agaggaatgg 1980 acccgagcta tggacagagg ggaagttgtt ggttgcgtgt ttctggacct gcgtaaagca 2040 ttcgataaag tgtggcacca agggctcttg accaaaatcc gagcacatgg aatacgtgga 2100 tccatgctga actggttcca cagctactta tcgaacagac gtcaaagagt agtcatccag 2160 ggcggaacat ccgaatggaa atctcctctc gcgggtgtcc cacaaggttc cgtactaggt 2220 cccaccctct tcatcctcta catcaacgat ctacccacct gctgcacaca gtctgacgca 2280 aacctgttcg ccgatgacac atcactctca accagcaacc gatccgtcca acatgtagtc 2340 aactcactca acacggacct acggtccgta tccaactggc taacaaactg gaagctagaa 2400 gccaatgagg acaaatgcaa ggtgatgttc atcacaaccc gcactctccc tagaccccct 2460 gctccagtca tactaggcgg ttgtacacta caagttgtta ctagctacaa gcatcttggt 2520 gtcaccctaa caaacacact ttcctggtct aaacacattg aaacaacctc cactaagtca 2580 agaagatctg ctggcctact gtgtgcactg agaaagaagg taccaaagga tatcctgctc 2640 agactgtaca aaaccatcac cagacctggt ctggagtacg ccgacgtcgt ttgggctgga 2700 ctcaccaagc gcgacgaaac aaaactggaa tcaattcaat accaaactgc cagactgatc 2760 agcggacaac atggactccc gtacccgtca taccaatctc tgtacaccca gctatccctt 2820 ccatcactgc agttccgacg acaattccac accgctgtca ccctgtacaa gctcctcaac 2880 ggtcactgcc ctccacacct tcaaaccctg atacccagaa cccgtgcgtc tgctactgag 2940 tcacgttact ccctcaggaa cagcgaccac ctgacatctg aactcactaa gtccaccaga 3000 ggccagaaaa cattcatcta cagagcaaca gcactctgga acactctccc tactgctact 3060 cgcacagcca cctccactgc ttctttcaaa agaaaactgt gggaatgcct aggcaaccct 3120 tacgcatggt aaattagtca tacacgtgta tatacagacc ctgacctctc attgcaaata 3180 ttgttgtcga tgatatgatg ttatatactc cattttagat gatatgtaag ttatcctaca 3240 ccatgtacat acaaatgttc tttttttaat ttgtattgtt tatgtgtttg tcagcagggc 3300 tagccctttg taatagccat aggctagttg ggcagccctg gctgtgtgtt cggtgcagcc 3360 aaaccaataa ataaataaat aaa 3383 // ID Gypsy4-I_Dmoj repbase; DNA; INV; 5106 BP. XX AC scaffold_6541; XX DT 13-MAY-2009 (Rel. 14.05, Created) DT 13-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy4_Dmoj; KW Gypsy4-LTR_Dmoj; Gypsy4-I_Dmoj. XX OS Drosophila mojavensis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; repleta group; OC mulleri subgroup. XX RN [1] RP 1-5106 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1044-1044 (2009). XX DR Genome; scaffold_6541; Positions 1443204 1448309. XX CC Positions [2067-2489] - Reverse transcriptase CC Positions [3537-3755] - Integrase core CC 'AGTT' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 1380..3929 FT /product="Gypsy4-I_Dmoj_1p" FT /translation="MKVVVALAEKVVQMTVLIMPTMLDHIILGMDFLCAVG FT ATLRCGDTELTMRMEETSKESSRVENEGSLARRSSYANVNPEAPSWTPPSK FT KRRARADAESQRAPDICARADRKGDEQLERKLGEHPTMRDATDPEEERDAR FT PALGAVVWPEDLDPSLKEFLETELALFEKLEGVSHIDKPLKQRYYPKNPAM FT QKIINEQVDELFRVGAIEPSRSPHSAHIVLVKKKTGEWRMCVDYWQLNAHS FT IPDAYPVPRINHILKRLRQARFISTLDLKSGYWQIPMAADSREYTAFTVPG FT RGLFQWRVMPFGLHSAGATFQRAPDTVIGPEMEPHAFAYLDDIVVIGASKE FT EHTANLREVFRRLRAANLKVNRKKCSFFREKLAYLGHVISGEGICTDPAKV FT EAIRDFPTPKNLKELRQCLGMASWYRRFVPNFASLVQRMTRLLKKGQKWVW FT TEEQEDALQKLKGSLTTAPVLACPDFSEKFVLQTDASDCELGAVLTQEIEG FT QERVIAYASRRLTKTEVNYSAIVWAIRKMRCYLEGYRFDVITDHLALKWLN FT SIESPTGRIARWALELQQFQLDVRYCRGSLNVVADALSRQPLENCQQAVEE FT CLPCKWIAKMRESIANDPEKFRDYVEENGQIYRNLGYRTDDEDYIPWKLCV FT SEGQRSRVLHECHDAPTAGHQGVRKTAARLAQRYYWPGMFRNAAKYVKRCE FT TCQKFKCVQQKPAGQKLTRQVAEPMAVLCADFVGPLPRSKRGNTMLLVFHD FT AFAKWVELVPLKKATTALLQLAFRERILSRFGVPRTFVCDNSIYKQKLQGV FT HRILGGDAPIYSALFAAGESDGDDKSHGKDDGSTIRRRPSKLVGRAAA" XX SQ Sequence 5106 BP; 1549 A; 1164 C; 1428 G; 965 T; 0 other; agtggcgccc gagcagggac caggggctga ggagagtgcg gcgagctagc ggcataaaga 60 accgcaaact tgcccaaatc caggtaatcc aggtatcgag catacccgcg tgtacccgac 120 ctgctcaacg aaataacgaa agaatagcgc tagcggcggc aaataacggc gtgagcgtaa 180 cggcgctaaa gtgcattgca ccgaaggcgc gtcaaaagca tagctaaact tatgtgtttg 240 tgtatacata tattcatata ttcatatata tatatataca tatttatacg aacatatata 300 cagaagagaa aattaatgta aaagggaaaa caaattttac aaatgttaat gaagtgtgcg 360 gcctcgtgag cggatatacc tcacggtacg ttgcaacagg gtaagctagc ggttctcaag 420 tcataaaagt gcggcgcaca ttgggtcgga agcaagagaa aacttagaca aaataagttg 480 cagtaaaact gcataaaacg aggaaatata aaggagaggt agaaagaaag aaaatatata 540 tatatatata agtggtaata ataataatga taataataag aatagtgatg agaataataa 600 tggaaataat aatgacagta atagtgaagt gaactgtaca ttcttattct tggtgcgcgc 660 ggactcgtga gcggatatac cccacggctc aacacgacag aatagattag cggtcactgg 720 gccataaaaa cggagcgcac agtgggccaa aggcgaaata aaatagaatg aagtaaaata 780 aaaaataaaa aataaaataa aataaattaa aatacaataa aataaaatat aataaaataa 840 aatgaaataa tatttataat aaaacaaaaa aaaaaaaaaa aaaaaaatgt gggaatttgg 900 tttattttat tcttggtgcg cgcggactcg tgagcggata taccccacgg ttcaacacga 960 cggaatagat tagcggtcac tgggccataa aagtggagcg cacagtgggc caaaggggag 1020 ataaaataaa ataaaataaa acaaaacaaa acaaaataaa ataaaataaa gtaaagtaaa 1080 atataataaa ataaaataaa aatttacagt ggggaatttt gttcattttt aattcttggt 1140 gcgcgcgggc tcgtgagcgg atataccccc acagctcagc ataataaatt cgaggtcatg 1200 gggccgtaaa gagaacagtg ggacaagggt gaaataatag cgaataaaat aaaattaaat 1260 taaattaaat taaaataaaa ttaaattaaa ataaaataaa accaaattga attaaatcgc 1320 aggagatacg atctcgaata aggttggccg acgggtcgtc gctggacgta acgcgaacaa 1380 tgaaggtagt cgtggccctc gcggagaagg tagtgcagat gacggtgctc atcatgccga 1440 ctatgttgga tcacattatt ttagggatgg atttcctgtg cgcggtgggg gcgacgctgc 1500 gctgtggcga cacggaactg acgatgagaa tggaagaaac gtcaaaagag agcagtcgcg 1560 ttgaaaacga ggggtcactc gctcgacgct cgagctacgc taatgtgaac cccgaggcgc 1620 caagctggac tccaccgtcg aagaaacgga gagctcgggc ggatgcagaa tcgcagaggg 1680 caccggacat ttgcgcaagg gccgacagga agggagacga gcagctcgaa cggaagctgg 1740 gagagcaccc cacgatgcga gacgcaacag atccggaaga agaacgcgac gcaaggccgg 1800 ccctcggagc cgtagtgtgg ccggaagatc tggatcctag cctaaaggag ttcctggaga 1860 ccgagctagc actcttcgag aagctcgaag gagtgtccca catagataaa cccctcaaac 1920 agagatacta cccgaagaac ccagcgatgc agaagataat caacgagcag gttgacgaac 1980 tgttcagggt tggagctatc gagccgtcga ggagcccgca cagcgcgcac atagtcctgg 2040 tgaagaagaa gaccggagaa tggcgcatgt gcgtggatta ctggcagctc aacgcccact 2100 cgattccgga cgcttatcct gttcccagga ttaaccacat cctgaaaaga ctgcggcaag 2160 cccggttcat atctacgctg gacctcaaaa gcgggtattg gcagatcccg atggcggcgg 2220 acagcaggga gtacacggct ttcaccgtcc cggggagagg tttgttccag tggcgggtaa 2280 tgcccttcgg actgcattcg gctggggcaa cgttccagag agcaccggat acggtgatcg 2340 gacccgagat ggagccccac gcgttcgcat acctggacga tatagtggtg atcggagcct 2400 caaaggaaga acatacggcc aatttgaggg aggttttccg gcgactgaga gcggctaacc 2460 tcaaggttaa tagaaaaaaa tgcagctttt tccgagaaaa gttggcgtat ctaggccacg 2520 tcattagtgg ggaagggatc tgcaccgacc ccgcgaaagt cgaagcgatt cgagacttcc 2580 ccacaccaaa gaacctgaaa gagctacgac aatgcttggg aatggcgtct tggtacaggc 2640 gatttgtgcc gaacttcgct tcattggtcc aacgaatgac caggctgctc aaaaaagggc 2700 agaagtgggt ctggaccgag gagcaagaag atgcgctgca aaagctgaag gggagcctaa 2760 cgacagcacc agtcctggcg tgccccgact tctcggaaaa atttgtgctg caaaccgacg 2820 ccagcgactg cgagctgggc gcagtgctaa ctcaggagat cgagggtcag gaacgagtca 2880 tagcctacgc cagccgaaga ctgactaaaa cggaggtgaa ttattcggcg atcgtctggg 2940 caatcaggaa gatgcgttgc tatctagaag gctacaggtt cgacgtaatc acagaccacc 3000 tggcgttgaa atggctgaac tcgatagaga gccccacggg caggatcgcc cgatgggcgt 3060 tagaactaca acaattccag ttagacgtcc gttattgccg gggaagccta aacgtggtgg 3120 ccgacgcact gtctcgccaa cccctggaga attgtcaaca agcggtggag gagtgcctcc 3180 cttgtaagtg gatagcgaag atgcgcgaga gtattgctaa cgacccagag aagttcaggg 3240 actacgtaga ggaaaatggc cagatatata gaaacttggg ctatcggacg gacgacgagg 3300 attacatccc ttggaagctg tgcgtctccg agggccaaag aagcagggtc ctccatgaat 3360 gccacgatgc ccccacagca ggacatcaag gggtgagaaa gacggcagct agactggcgc 3420 agagatacta ttggccgggc atgttccgaa atgccgccaa gtacgtaaag cgctgcgaga 3480 cgtgccagaa attcaagtgc gtacaacaaa agccggccgg gcagaagctg acaaggcagg 3540 tagcggagcc aatggcggtc ttgtgcgccg atttcgtggg acctttacct cgatccaagc 3600 gtgggaacac aatgttgttg gtgttccacg acgctttcgc taagtgggtg gagctagtgc 3660 cgttgaagaa ggccactacc gcgctgctgc agctagcgtt ccgggaacgt atactcagca 3720 gattcggagt acccaggaca ttcgtatgcg acaattcaat ttacaagcag aagcttcaag 3780 gcgttcatag aatccttggg ggtgacgctc caatatacag cgccttattc gccgcaggag 3840 aatccgacgg agacgacaaa tcgcacggta aagacgatgg tagcacaata cgtcgaaggc 3900 catcaaagct cgtgggacga gctgctgcct gagataacgc tggccgtcaa ctcgagtgtg 3960 gccgattcca caggcttcac tcctgcattc ctaatgctgg ggcgggagcc acgcctaccc 4020 gccgcactgt atgaagaagt cactccgggt tctgcgacga gagaaatcca gcccaaggcc 4080 aaagaagtta ggatgagaga aatcttcgac gtagtccgta acaacttaca gcgagcctcg 4140 aaggatcagg gacgatatta taatatgcgt aggcgcgaat ggaggccgaa gccaggttcg 4200 tcagtcctgt tgaggcagca cccgttgtcc aacgcggcgg atggcttcgc agcgaaattg 4260 gcaccgaagt atgacggccc gttcaggatc gttagctttg tctcccccaa catcgtacgc 4320 ctcaccaaac gcggcgagca caaaagaagg gtagctagca tctcccagct gaagcccttc 4380 catcacgacg gcgacgagag cgagaaagag cgggacgaga gccgagggac catcgacgtc 4440 tctgaacact gacggcaaca cacctcagtc ataccctttc cccacctcaa acctgttatc 4500 caacccccgt agtatctcgt gaaagttgat atactaactg gtagtttagg aggcagtatc 4560 ctaaaaagtg tgtcacttat acttgggttt tcagatacaa gaacaatgga gaacgtggtc 4620 atcataatca gcagcgagga cgagagggag gaagaggcgg ctcgcgtgga agatgtcagc 4680 gacgctgaca ccgagccatt cccgggtacg gggtaccaaa ggcgaggcga cccgagggtg 4740 ggccgggagt attacaggtc tcgtgaggag cgacccccac ccaagttcaa acacacgaac 4800 gaggccgaac tgctggccga tatggccgat atcttggacg gctgggagcc gaatttattc 4860 gagatcctca tgggggagag cccccctcca ttcgacccag cgccggtcat cccacccggc 4920 tggaagatcc ggccaccggg ggcctcagtg gacgaacaga ccatccgagc ggtacaatgc 4980 gagctaggga tgcggtagct accgcgcaga agggtgcgtt tccgtgttcg taccgagggc 5040 gccacggtca atgtgacggt tacaccctcg gggggcgtga ccctcgctct agtaaataag 5100 gggggg 5106 // ID PERERE-4 repbase; DNA; INV; 5111 BP. XX AC BN000795; XX DT 04-SEP-2005 (Rel. 10.08, Created) DT 02-JUN-2010 (Rel. 15.07, Last updated, Version 2) XX DE Schistosoma mansoni Perere-4 non-LTR retrotransposon (EST). XX KW CR1; Non-LTR Retrotransposon; Transposable Element; Perere; KW endonuclease-reverse transcriptase; PERERE-4. XX NM PERERE-4. XX OS Schistosoma mansoni OC Eukaryota; Metazoa; Platyhelminthes; Trematoda; Digenea; OC Strigeidida; Schistosomatoidea; Schistosomatidae; Schistosoma. XX RN [1] RP 1-5111 RA DeMarco R., Machado A.A., Bisson-Filho A.W. RA and Verjovski-Almeida S.; RT "Identification of 18 new transcribed retrotransposons in RT Schistosoma mansoni."; RL Biochem Biophys Res Commun 333(1), 230-240 (2005). XX DR EMBL/GenBank/DDBJ; BN000795; Positions 1 5111. XX FH Key Location/Qualifiers FT CDS 1..1674 FT /product="PERERE-4_1p" FT /translation="PSISSCSMPTVRLVDIKSEMPTCIKSTQSTRPHALLR FT PRSNIPKTKESNSVSHVYENADRYSTDIRQLQNSLAEIYSRLNQLAPLTHI FT WGINTDKNLEQLTKAECILDFVAEEVTKRMMSSHNAVVYNLPDRIHPNKLR FT DICLKACDMLKVNCQVIRLRKKSDRLTCPLLFKFGCCNDAKQFIDLSKNIS FT SLVPYKSIKVTPDLTPCQRRIRRREAIESYDRVIVKNQLKTVCNEMIDIKR FT MDNKHDVIPQTTGPSVDLDESADLKSLNKIRSPSVTKRRDGISDPGCSLSI FT IPKCGPCNETHPTKHIHVKIGSPRDTPQEDSKEPKFISQRQRNPSSLTKKT FT GEIKSKIRNRGTRKIEANCSVGIPTTVNRFYPLISCNLPSTPRTMGSTLER FT NKPFYTHTHKNYQNTGHSTHLLKSKTVNNNSNYRFLTPHPMYIPQDHNCNY FT SQNENFPVSTHHMKTASIHNNSRNHFQLRASDPPHARTSANSYNRGYELNH FT QNCLANNSTLYSQHVPDFFGIRPLVAPLLKHIALVISNQVRNINSLPPYSS FT HQKVLLPFPPG" FT CDS 1860..4352 FT /product="PERERE-4_2p" FT /translation="MCLINARSICNKKTDLALFASIYQPSLIMISEAWTNS FT NISDLSISLDNYLLYRSDRLKGRGGGCLIYAHSSLNSLLCDMPELNKLKDS FT VWIVLKPSNSITLLFGCIYQQPNLAIGKIAVLSEVFTLASSLPFTGKLICG FT DFNMPEISWFPVKAPKRYESFIECLELGQWTQYVSSPTRHQNILDLVFTSG FT LTPNTVYIGKTFPGSDHNTVVCSLNIKNTNGRRKNVTLRRNYRKVDWNNFH FT NYLRSTDWGTYFTANNTDTLTNIFYHNIESIMNKIAPLEYWRPKFSKDLFI FT PAGTRRRLQRHCTRFHNSNDFSSLVTMTEILVQTDAISTQKAIEQEKRAVE FT LSNNSSALTTLFQNKLKRSKSRGNIFIKDKQVYEDPKLVTGLFNEHFCSSL FT TDEKPLNDSELITASACEITNVDFNVQNISKAISTLKHSYTDGPDGIPASM FT LKRGGVDMCVLLLKLFAISLSTAHYPTAWKTTHIIPKIKTGPEANVENYRP FT INITSVVSRVMEKVVKAALVQHLITNNLISVSQHGFLRSRSCDTCLVDYMN FT NITLKRDNGLLVSVIFLDFKKAFDKVPHERLLAKLRSFGINNPLYSWFVSF FT LKGRKQMVNYNDCLSLPRPITSGVIQGSVLGPLLFLMYINDICNIIKSGKP FT YLYADDLKIVYSYKPEVLSESVRLIQDDLNNLTIWSEKWQLPFNLHKCGIM FT HFGKHPYEPQLYLNKSKVQTLNSVHDLGINYTGSLNFKAHASFIISKARRL FT IGFITKNFFTTDAKLTLYKICVRPSLEYCSFIFSNMNTTDKIRVEDVQRRF FT TRQLLGSDTTLDYTSRCMHLSLEPLWHRR" XX SQ Sequence 5111 BP; 1697 A; 1168 C; 827 G; 1417 T; 2 other; ccctctattt cttcgtgcag tatgccaacg gtaagactag ttgacataaa aagcgaaatg 60 cccacgtgca tcaagtcaac acagtccacc agaccgcatg ctttgcttag gcctaggtcc 120 aatataccaa aaaccaagga gtctaacagt gtatctcacg tatacgagaa tgctgatcga 180 tatagtactg acattcgtca gttacaaaac tccctagccg aaatttacag tagacttaac 240 caactagcac ccctcacaca catatggggt attaataccg ataaaaatct cgaacaactc 300 actaaagctg agtgtatttt agattttgtg gctgaagaag tgacgaaaag aatgatgtct 360 tcccacaatg cagttgttta caatctccct gatcgcatac acccaaataa gttgagggac 420 atctgcctta aagcatgcga tatgctgaag gtcaactgcc aagtcatccg ccttagaaaa 480 aagtcggata gattgacttg ccctctacta tttaagtttg gctgttgtaa cgatgctaaa 540 cagtttattg atttgagcaa aaatatcagc tcattagttc catacaaaag cattaaagtc 600 actcctgacc taacaccgtg tcaacgtcgt ataagaagac gtgaagcaat agaatcgtat 660 gacagagtta tagtaaaaaa ccagctaaag acagtatgca acgaaatgat tgatataaag 720 cgcatggata acaagcatga tgttatacca caaacgactg gtccatccgt agatctagat 780 gagtctgcag atctgaaatc tttgaataag atacggtccc catctgtcac taaaaggcgt 840 gacggcatct ctgatcccgg gtgttcgctc agtataatac caaaatgtgg gccgtgtaac 900 gaaacacacc caactaaaca tattcatgta aaaattggtt ccccaagaga tactccacaa 960 gaggattcga aagaacctaa gttcatctca cagagacaaa gaaacccttc atctctgaca 1020 aaaaagactg gggagataaa atccaaaata cgaaataggg ggacacgtaa gattgaggcc 1080 aactgttctg tgggtatccc aacaacagtg aataggttct acccgctgat atcatgtaac 1140 ttgccatcaa cacctcgcac aatgggtagt acgcttgagc gtaacaagcc tttctacacg 1200 cacactcaca aaaactatca gaacactggt cactccactc atcttcttaa gtctaaaacc 1260 gttaataata atagtaatta caggttcctc accccacacc ctatgtacat tccccaggat 1320 cacaattgta attactctca gaatgaaaac tttccagtat caacccacca catgaagacg 1380 gcctccatac ataacaacag tcgtaaccac ttccagttac gagcatcaga cccccctcat 1440 gctcgcacta gcgcaaatag ttataacaga ggctacgagc ttaaccatca aaactgtctt 1500 gcaaataaca gtacacttta cagtcaacat gtaccagatt tttttggaat aaggccccta 1560 gttgcacccc tgttgaagca tatagcccta gtcatttcaa accaagtgcg gaacataaac 1620 tcgcttcccc cgtacagttc gcatcagaaa gtactcctac cctttcctcc tggataaact 1680 ccacaagcaa cttcactagt tcatttgata catgccccag taactctagc tatcaaaata 1740 cagagctatt gaaaaacgct caatccatta cttcaaacat acatcactca cgcgaggatc 1800 atgacatctt taatcgagat actctcactc actttaactt agtaggccac ttgtttaata 1860 tgtgtcttat caacgctagg tcgatctgca ataaaaagac tgatttagct ctgtttgctt 1920 ccatatatca accatcactt ataatgatat ctgaagcttg gactaacagt aatatttccg 1980 atctaagtat atctcttgat aactacctcc tatatagaag cgatagattg aaaggacgag 2040 gcggaggatg ccttatttat gctcactcta gcttaaactc actactatgc gatatgcctg 2100 aactaaacaa actgaaggac tcggtgtgga ttgttttaaa gccatcaaat tccattacct 2160 tactgttcgg ctgtatctac cagcaaccta acttggcaat aggtaaaata gcagtattat 2220 cagaggtatt cacactagca tcatcccttc cattcacagg caagctcatt tgtggtgact 2280 tcaacatgcc tgaaatttct tggtttccag tcaaagctcc gaaacgttat gaatcattta 2340 ttgaatgtct agaactagga cagtggactc agtatgttag tagccctacc agacatcaaa 2400 atatcttgga cttagtcttt accagtggcc tcacccccaa tactgtgtat attggaaaaa 2460 cgttcccagg tagtgatcat aacactgtcg tatgtagcct taatataaaa aacacaaacg 2520 gtagacgtaa aaatgtaacc cttcgtagaa actatcgcaa ggtagattgg aataattttc 2580 acaattacct aagaagcact gactggggaa catactttac tgcaaacaat accgacacac 2640 taaccaatat attttatcac aacatcgaaa gtatcatgaa caaaatcgca cctttagagt 2700 actggagacc taagttttct aaagatctgt ttataccggc cggtacacga agacgtctgc 2760 agagacattg tactcgcttc cacaactcta atgacttctc ctctttagta acgatgacag 2820 aaatacttgt ccagacagac gcaataagta cacaaaaagc aatcgaacag gaaaaacgtg 2880 ctgtcgaact tagtaataac tcctcagcac tcaccacact ctttcaaaat aaacttaaac 2940 gttctaagtc gagaggaaac atatttatta aagataaaca agtatacgaa gatcctaaac 3000 ttgttacggg actattcaat gaacactttt gttcctcatt aactgatgaa aaaccactta 3060 atgattcaga acttatcaca gcatctgcct gtgagattac taacgtagac ttcaatgtac 3120 agaatatctc taaggcaatt agtacactga aacactccta cactgatgga ccagatggta 3180 ttccggctag catgctaaaa cgtggaggcg ttgatatgtg tgttttatta ctcaaactat 3240 ttgcgatatc actctctaca gcgcattacc ctactgcctg gaaaactaca cacatcattc 3300 ccaaaattaa aacaggacct gaagcaaatg ttgaaaatta ccgacccata aacataactt 3360 cagtggtgtc tagagtgatg gagaaagtag ttaaagcggc gttagtccaa catctaatta 3420 ctaataatct catctcggtc tcccaacatg gatttcttag gtccagatca tgtgatacat 3480 gcctagtgga ctacatgaat aatataactt tgaaacgaga caatggactc cttgtatcag 3540 tcatattttt agactttaag aaagcttttg ataaggttcc acacgaacgg ctactagcca 3600 aactacggtc cttcggaatc aacaacccac tctattcatg gtttgtttca ttcttaaagg 3660 gacgtaaaca aatggtaaat tacaacgact gtctctcatt acctagacca ataactagtg 3720 gcgtcatcca gggaagtgta ttaggcccac ttttgtttct catgtatata aatgatatat 3780 gtaacatcat aaaatctggg aaaccatact tatatgctga tgatctcaaa atagtataca 3840 gctataagcc tgaggttcta agtgaaagtg tacgcttgat tcaagatgat ctcaataacc 3900 taactatctg gagcgaaaag tggcaattac catttaacct ccacaaatgc gggatcatgc 3960 actttggaaa gcacccctat gaaccccaac tttacttaaa taaaagtaaa gtccagacac 4020 ttaattcagt acacgatcta ggaataaatt acacaggaag cctaaacttt aaagcacacg 4080 cctcctttat tatttctaaa gccagacgtc taattggctt cataacaaaa aatttcttca 4140 cgacagatgc caagcttacc ctctacaaaa tatgtgtacg accttcctta gaatactgct 4200 cttttatctt ctctaatatg aataccactg acaaaataag agtagaagat gttcaaagac 4260 gttttacacg ccaactgcta ggaagtgaca ccactcttga ttacacaagt agatgtatgc 4320 acctgtcttt agaaccttta tggcaccgta ggtaaaggaa caatctgata tttctctacg 4380 aatcattaaa tggactgtca ttcctaacct cttgtccaac tatgatcagt gacccagcct 4440 atagactaag aaacaatgca tatacactat ttactgaaaa acaccaaaaa caaatgcgtt 4500 gtaggttttt tacagttcgg tatagtttat tgtggaacag gttgccaata cctatccgta 4560 accgtgacac ctttaccaaa tttaaaaggc taataaccct gttactaaac tctaaagaac 4620 ttcaccaact attccttgaa ctcccgccta ccaatgaggc actttattat ggaccgccca 4680 atatctagac ggaacaagaa tataacgaac tccattgtcg ttagtatgaa tatagcaaaa 4740 ctaaagaaaa catgctttat ccttccccat tatttatgga ttattaacca tggccttccg 4800 ctcctaaccc tcattttcag tttccgaatc tctctaatgg ataaatctaa attattcttt 4860 ctaagtgtca tgtcctttta tttttctcat tacaagtaga tagataactc actaatctta 4920 tggatgctaa ccaactaagg ccgatcaaaa gaagctcata cgtcctaaat tcttgtgagt 4980 tgttcaatgt ttgatttctt cttaccctga aaattttaca atatttacta tcatatcttc 5040 tggnttgtta ttagccctta ccatctctta accacatant atctgacttg tctcttctaa 5100 ccctttacat t 5111 // ID BEL-53_CQ-I repbase; DNA; INV; 2350 BP. XX AC AAWU01015731; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-53_CQ_; KW BEL-53_CQ-LTR; BEL-53_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-2350 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 259-259 (2011). XX DR GenBank; AAWU01015731; Positions 17592 19941. XX CC 'AAGGC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 343..2283 FT /product="BEL-53_CQ-I_1p" FT /translation="MMLRTPKGGRKKTPKKKTPAKSDKNGDAQKSDTQEVK FT VVPVTPISKQLEEQRRKEEARKMEEQQKVLMRQRDAVAEKLDRVKAAIKSS FT TEQLNQNLKNLHFLKLQLKTVEACYGEFNAFQNQIYGLALTPEQEKLHRAC FT YLKFETLHNILTVQLNELIERLSKPSVALVPAAPACAVVPQYLPPLSVPLP FT KFDGTYENWFSFKCMFRSVMDRYQGEAPSMKLYHLRNSLVGRVEGVIDQDI FT INNNDYEAAWALLVETYEDKRVIIDKHIEALFNLPKITNDDAVAFRKLIDT FT CVKHIEALKNLQLPVDGLGEQMLMNLLAARMDKETRLAWESQWKAGELPTY FT AATIAYLKEKCRVLEVVEQCSVTVEKVTPHRSVAMLIAGTQKCSVCNRQHG FT LNGCEQFKGKSVNEKYSHLRKCGLCFNCLRRGHRVAACTSTNTCKICGNRH FT HTMLHTGGVKKQMVVPISTAPVTSKPMDKRRTRPARKQTLLSTAVVLVDGG FT SSGPHLCRALIDSGSQNHFVTERFADKLAIKKERADYQVSGLHDFKTRISS FT LIRATVKSRVGDFSTELELLVTPSIIDDLPPESIYITSWNLPPNIKLADPG FT FNSASPIDMLLGAELFWNLIRSGRITLAETMPSLRETELGWVVGGVLKH" XX SQ Sequence 2350 BP; 603 A; 559 C; 739 G; 449 T; 0 other; ttctggtcct gcacgagccg gatgctgaag tagtggtccg ggagtgacgg ttgacggatt 60 tcgacgggaa atccgacttt gaacacggtc gacgtgtccg tgtgtgcgcg tgtgtgagac 120 gtcgaggggc gtagcaccca cgccgcgaaa aagtgcgcga tccgacagtc agtgctttgg 180 aagaccggga agtgctccgg aagatcggaa aaaggttccg aaagactgga ctagtggtcc 240 ggaagtgacg gagaaatccg attcgttcgc gtgattgacg ttagcgcgtg tgagagtgtt 300 tgtgggacgt cgaggcgtag caccacctag cgacgggcat ggatgatgtt gagaacgccc 360 aaaggtggta ggaagaagac gccaaagaag aaaacccccg cgaaaagtga caagaacggt 420 gacgctcaaa agagcgatac gcaagaagtg aaagtggttc cggttacgcc gatcagcaag 480 cagcttgagg aacagcgccg gaaggaggaa gctcggaaga tggaagagca gcagaaagtg 540 ctgatgcggc agcgggacgc ggtggcggaa aagctggacc gagtaaaagc agccatcaag 600 tcgagcaccg agcaactgaa ccagaacttg aagaacctgc acttcctgaa gttgcagttg 660 aagacggtag aggcctgcta cggtgagttc aatgcattcc agaaccagat ctacggactg 720 gcactcactc cggagcagga gaagctgcac cgggcgtgct acttaaaatt cgagacgctt 780 cacaacattt tgacggttca gctcaacgag ttgatcgaac ggctttcaaa gccaagcgtc 840 gcgctcgttc cggccgcccc cgcgtgtgca gttgtgccgc agtacttgcc gccgctcagt 900 gtgcccttgc cgaagttcga cggcacttac gagaactggt tttcgttcaa gtgcatgttc 960 aggagcgtga tggaccggta ccagggtgag gctccgtcga tgaagctgta ccatctacgg 1020 aactcgttgg tcggcagggt agaaggcgtg atcgaccaag acatcatcaa caacaacgac 1080 tacgaggcag cgtgggctct cttggtggaa acatacgagg acaagcgagt gataatcgac 1140 aagcacatcg aagctttgtt caatctgccg aagatcacaa atgacgacgc cgtcgctttc 1200 cggaaattga ttgacacgtg cgtcaagcac atcgaagcgt tgaagaatct tcaacttcct 1260 gtcgacggat tgggtgagca gatgctgatg aacctgcttg cagcacggat ggacaaggag 1320 acgcggttgg cgtgggaatc gcaatggaag gccggcgagc taccgacgta cgcggccacg 1380 atcgcgtacc taaaggagaa gtgccgagtt ctggaggtgg tggaacagtg cagtgtgacg 1440 gtggaaaaag tgacgccgca cagatcagta gcgatgctga tagctggtac gcagaagtgt 1500 tccgtatgca accggcaaca cggcttgaac gggtgcgagc agttcaaggg aaaatccgtc 1560 aacgagaagt acagccattt gcggaaatgt gggttgtgct tcaactgttt gaggagagga 1620 catcgcgtgg cagcatgtac gtctacgaac acctgcaaga tctgtggcaa tcggcatcac 1680 acaatgctgc acactggtgg agtgaagaag cagatggtcg ttccgatctc gacggcgcca 1740 gttacgtcga agccaatgga caaacgccgc actcgacctg ccaggaaaca gacccttctc 1800 tcgaccgccg tagtactcgt cgacggtgga agcagcggcc cgcacttgtg tcgagcgctg 1860 atagactcgg gatcgcaaaa ccacttcgtc acggaacggt ttgccgacaa actggcgatc 1920 aagaaggagc gcgccgacta ccaggtcagt ggtttgcacg actttaagac gaggatcagc 1980 agcctgatcc gagcgacggt taagtctcgc gttggcgatt tctcgaccga gctggagctg 2040 ttagtcactc cgagtataat tgatgacctg ccgccggaat cgatctacat cacaagctgg 2100 aacttaccac cgaacatcaa gctcgctgat ccggggttca attcggcaag cccaatcgat 2160 atgctgctgg gagcagaatt gttttggaac ctgataaggt caggtaggat cacgctggcg 2220 gagaccatgc cctcgctcag ggaaactgag ttgggatggg ttgtcggcgg tgtgttgaag 2280 cattgaagcg gcagaggagc aggaagcgag gatggaggtt tgaaatgcag tcatttcaac 2340 gtggggagga 2350 // ID BEL-48_CQ-I repbase; DNA; INV; 2812 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-48_CQ_; KW BEL-48_CQ-LTR; BEL-48_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-2812 RA Jurka J.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 249-249 (2011). XX DR [2] (Consensus) XX CC 'GTACT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 1249..2811 FT /product="BEL-48_CQ-I_1p" FT /translation="MELKSARSLTIKRNFVKGKISRVVDRLKAFEENHQLP FT SFAQIQVFVWNVEKYYGEFNKVCDEFKVSDADAGERFNHDIEWARVVSLIQ FT EANMRIVALNNALLSQPVHAAEKQPVLKEEANRTPLPTTEARGEVDSSGAG FT DLPDAANAERCHADDREPGAPDTKVGAHFELYSSPPIADSERNEPMCTLCN FT GCRSDIDCFPVRGDPVEQCQPLEESNPNLSSQKRLPGESHEPETQPAVHAI FT LPPEEPEVVPHESSSDDRGSAETKPGLCSPQRLPGDSREPENRQEESSKEP FT AVVPHESSAAERGSAVSDDPEAVPMPEQAQHVDVVPDVDTCQAVEGVSLLP FT AVPDDENNEEGAQCGCTEPARSSVQVVWTTNQNARKANKNIEIDQRTCTSH FT ESVTYLYSQRTELYERCCGDSIARSGADGADEPGRAVEKLHDPAIFSSTTK FT RPRFVWDPGVELNAAFAPYNRGRPPTDANRCKELLARLVAMNLEPSAINGI FT SQYERLRPAQRVGEAEAVTPQPGGR" XX SQ Sequence 2812 BP; 720 A; 731 C; 879 G; 482 T; 0 other; tttggtcctt acgatccgga tagacagttg gcctattgat ccggtaagtc cgtcggaatc 60 gccacgagtg tctcgaggag gaagaatgct cgtgtaagaa gaagttcgcg agcagtgatc 120 cggaagacgt tgattcccgg agggcgttcc acgcggcccg gaacagcgtt gggaaaaacc 180 gtgtctgtgt ggtgatccag cagatcgacg gtccggaacg cgcgtgtggt cgaaccagcg 240 acgcgtagtg cacggaagcg gtcccattgg ccgccggaag cacggtagtg agccggaagt 300 ggtcgttggc caccggcaga agtaaatgga cgttgtgaac tttgcgagga aggtgatccc 360 attggccatc tgagcaaaga acagtgaaat ggtggaagaa ggacgttcgc gccatcttgc 420 cacacaaaga aagtgaacaa agaaaaacag aagtgaccgt cgaaaagaga tgagcgtgcg 480 tcgccatctt gttggacgaa aaagacgtgt gtgtgttcgt cagtgagaaa tgggcgtgcg 540 ccgccatctt gctcaacgaa aagtgaaagc ctcggcggat gtggtcccat tggccgcccc 600 gagcagaagt gtgcgtggaa ggtggtccct ttggccaccc gagcagaaag tcagtgcttt 660 tgaagaagta atggagtgcc gcgcagctct gtgagcagag tagtgcgcag aaaagccctg 720 acaaagtggt cccattggcc acatcaggcg gaagttccga cgggtggacc cgatgtccac 780 gccgtgaaaa ggagctggaa gagtggtccc attggccaca cggtgcagtg ttgtgcgttg 840 ctctgtgagc agggcgagtg gtcccattgg ccacgaagaa gccctgcgtc tgtgtggtga 900 gccaacagag cagcgtgctg aagaaaccgg cggtagtggt cccatcggcc acgccgtacc 960 cgaagaaaag tgagcgcgag gcgcataaaa cccaaactag aaagtgacaa agagtgacct 1020 gctgcaagag cagcgaaaag tgcaagctgt gctgtgacgc agactgcaaa accccgagca 1080 gaaaagtgct ggtgagccgg cagctcgaag aagtgatcca gtgtgcccag ctggaaagaa 1140 gaagaagaac cagtgcaaca ggacgctgcc gctgtcgtgt cgttctgagc gcggccacag 1200 tggagcaaca gattcctgcg cgaatccaga gcaacagatc ccgccgagat ggagctgaag 1260 tcagcgcgtt ccttgaccat caagcgaaac ttcgtcaagg gtaagatttc gcgggtcgtg 1320 gacaggttga aggcgttcga ggagaaccac cagctgcctt cgtttgccca gatccaagtc 1380 ttcgtgtgga acgtcgagaa gtactacggc gagttcaaca aggtttgcga cgagtttaag 1440 gtaagcgacg ctgacgctgg tgaacgcttc aaccacgaca tcgagtgggc aagagtagtg 1500 agtctgatcc aggaagctaa catgcggatt gtggccctga acaacgcact gctgtctcag 1560 cccgtgcatg ccgctgagaa acaaccggtc ttgaaggaag aggccaaccg aactccgctt 1620 cccacgaccg aggctcgtgg cgaagtcgat tcttccggag ccggcgatct tccggatgct 1680 gcgaacgccg agagatgcca tgcagatgat cgagagccgg gtgcacctga caccaaggta 1740 ggtgcccatt ttgaactcta ttcgagtccg cccattgcgg acagcgagcg taacgagccg 1800 atgtgcacgt tgtgcaacgg ttgccgctcg gacatcgact gcttcccggt ccgtggagat 1860 ccggtcgagc agtgtcaacc cctcgaagag tcgaacccca acctgagcag ccagaagagg 1920 ttgccaggag agtctcacga gcctgagact caacccgcag tgcacgcgat actgccaccc 1980 gaggaacctg aagtggttcc gcacgagtcc tcgtcagatg accgaggtag tgccgagaca 2040 aaacccggcc tgtgcagccc ccagcggttg ccaggagatt ctcgcgaacc cgagaatcga 2100 caagaagagt cgtccaagga accggcagtg gttccgcatg agtcctcggc agcagaacga 2160 ggtagtgccg tgtcggacga cccagaagct gtccccatgc cggagcaagc tcagcacgtc 2220 gacgttgtgc cagacgttga cacgtgccaa gcagtggagg gagtttcact gctgccagcc 2280 gtacctgacg atgagaacaa cgaggaaggt gcacaatgcg gatgcaccga accagcaaga 2340 tctagcgttc aagtagtttg gacgacgaac cagaacgcga ggaaggccaa caagaacatc 2400 gaaatcgacc agcgaacgtg tacgagccac gagagtgtga cgtacctgta cagccaacgg 2460 accgagctgt acgaaagatg ctgcggtgac agcatcgcac ggtcaggagc cgacggtgca 2520 gacgaacccg gaagagctgt ggagaaattg cacgatcctg caatcttcag cagcaccacg 2580 aagagacccc gatttgtctg ggacccaggc gtggagttga acgcagcgtt tgctccatac 2640 aaccgaggaa gaccaccaac agatgccaac aggtgcaagg agttgcttgc tcgtctggtt 2700 gcaatgaacc tggaaccttc agcgatcaac ggaatctcgc agtacgaacg actgagaccg 2760 gcacagcgtg ttggtgaagc tgaagccgtg acgcctcaac ccggggggag aa 2812 // ID Jockey-1_DYa repbase; DNA; INV; 4785 BP. XX AC . XX DT 15-MAY-2009 (Rel. 14.05, Created) DT 15-MAY-2009 (Rel. 14.05, Last updated, Version 2) XX DE Jockey-type non-LTR retrotransposon: consensus. XX KW Jockey; Non-LTR Retrotransposon; Transposable Element; KW Jockey-1_DYa. XX OS Drosophila yakuba OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; melanogaster subgroup. XX RN [1] RP 1-4785 RA Jurka J.; RT "LINE-type retrotransposon families from fruit fly."; RL Repbase Reports 9(5), 966-966 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 396..1988 FT /product="Jockey-1_DYa_1p" FT /translation="MSSDQKNERRSSVNPRNACYSYFSTRDIDQNQKSTKS FT NSTLLTVDSKRASSCSPSLLTPTPASWSYQCQTPSPPASLNEAPTTSSAKI FT SAVNSPPIIKPTTNTASSVPTAAQQNKMATKTVSLDKQQAVPAIQTGMDRY FT IQIKRKLSPPNTVGNKPKINRTNKSTDQTRSSNENRFSILAEADNNQPELG FT QETQRKPKPPPIYIREKSSSALVNKIVALVGDENFHVVPLIKGNIHETKVQ FT TKSEEHFRAVSKYLEDTKKNFYTYQLKSSKGLQVVLKGIEPDVTPSEIQEA FT LKARGFCAKNVSNIVNRNKKPQPLFKVELEPDSKALKKNEVHPIYKLQLLL FT HRRITVEEPHKRNGPVQCTNCQEYGHTKTYCTLRTVCVVCGDFHNSVNCPA FT NKEDPQMKKCGNCGGNHTANYRGCAVYKELKSRMRRATATHQQHFAKAARS FT SFGQLDTTKGISYAEALRTGMENPPQPHSENAQQVPVQPQSKLESMMFTMQ FT QSMMELMSFMKTTMQTLVQNQNMVIQLLVAQQSK*" FT CDS 1992..4649 FT /product="Jockey-1_DYa_2p" FT /translation="MTNLRISMWNANGISRHKLEITQFLNVNHIDAMLLVE FT THLTGRYNFHIRGYTFYRTDHPDGKAHGGTGILIRERIKHHFHQRFATNYL FT QATSIKVISGNGNLCIAAVYCPPRFNISEGQFMDFYNTLGDRFIAAGDYNA FT KHTHWGSRLVTPKGRQLYNALIKVSNKLDYVSPGSPTYWPADPKKIPDLID FT FAVTKNIPRNLISAIALPDLSSDHSPLLISLLQSPEIADHPHMLTSHNTNW FT MKYRKYVSSHIELTPQLNIEADIDCSTAALEEVLVTAARISTPKRKDGEFK FT KFKTNRQIEQLVLEKRRLRRAWQTSRSPCSKQRLTEATRNLNRALKQEAEN FT EQLKYIGKLSPTSTKHPLWRAHPNLSSPIQTVTPIRNSSGCWARSDKDRAE FT SFALHLRGVFQPNPAANDYVPPQIHLETESTPTTFQPNEITKVIRELKPKK FT APGGDLITTKMIMELPYCAIKVICKLFNGIISLGYYPRKWKKSIIIMIPKP FT GKDHTIPSSYRPISLLSCLSKLFEKCLLTRIIPYMGAHNIIPAHQFGFREK FT HGTIEQVNRITSEIRTAFEHREYCSAIFLDVSQAFDRVWLKGLMHKIKTHL FT PGYTHKLLESYLYNRAFAVRCNTTISDSYTIEAGVPQGSALGPTLFVLYTA FT DIPTSDRLTTSTFADDTAILSRSRCPVRATTQLANHLVVVEKWLSDWRIKI FT NEQKCKHITFTLNRQTCPPLCMNSTQIPQVNEVTYLGIHLDRRLTWRRHIK FT CKKLHLKLKASSLHWIINSRSPLCLDYKVLLYNSTLKPIWTYGSQLWGNAS FT RTNIDIIQRAQSKILRTITGAPWYIRNQNIHRDLGILAVKDEIDKQKASYN FT EKLSVHPNPLARVLTRVSSRTRLQRTDLPTQP*" XX SQ Sequence 4785 BP; 1588 A; 1264 C; 946 G; 987 T; 0 other; aaaagtctac acattcgtgg aagcggacgt ctttcgatag cgctgcgcga aaatctcgtg 60 ttaacgtggt tgtgctgaac agtgttagcg gcaaatctgc aaaaacaaac ggtggacttg 120 tgccatagtt ctgcgcgtaa cctaggcgtg ctgaatgacc gcgtgttgca ggtggttagc 180 aaaatatata agtttgcaat tcttgcagct gataaaacag ctgacagcgc tgccaactta 240 tctcggtggg cgatcatgag cgagtggcgt tgccacatca accgtaagac cagtgctgag 300 gtgcgaacgt tgggcgcgta atttcaaaat taaattcaaa atttgattta atttaattta 360 aacaaataag tgccacgcat aacaagctcg aaaaaatgag cagcgatcaa aagaatgagc 420 gtagaagctc cgtcaaccca agaaatgctt gctactccta tttctcaact agagatatag 480 atcaaaatca gaagtcgaca aagagcaact cgactctact gaccgtcgac agcaagagag 540 ccagctcttg ctcaccctct ctgctcaccc caactcccgc atcatggagt taccaatgcc 600 aaaccccatc accgccggct tcgctgaatg aagctccaac aacaagcagt gctaaaattt 660 ctgcagtaaa ctcaccacca ataattaaac caactaccaa tactgcgagt tcggtcccaa 720 cggccgcaca acaaaataaa atggcaacaa aaacagttag cctcgataag caacaagcgg 780 tcccggctat acagaccggt atggaccgct acatccagat taagcgcaag cttagtccac 840 ccaacacggt tggcaataaa ccaaagatca atcgaaccaa caaaagcacc gatcaaacga 900 ggagctccaa tgaaaaccga ttctccatct tagcagaggc ggacaacaac caaccagagt 960 tggggcagga gacccaaagg aaacctaagc ccccacctat ttacataagg gaaaaaagtt 1020 caagcgccct ggtcaacaaa attgttgcgc tggtcgggga cgagaatttc catgttgttc 1080 cgcttattaa agggaacata cacgaaacaa aggttcaaac gaagtccgaa gaacacttcc 1140 gagctgtgtc taagtacctg gaagacacaa aaaagaactt ttacacctat caactaaaaa 1200 gcagcaaggg actgcaagtt gtgctaaaag gtatagaacc tgacgtcacc ccctccgaaa 1260 ttcaggaagc ccttaaggcc aggggcttct gtgccaaaaa tgttagcaat attgttaaca 1320 gaaacaaaaa accgcaacca cttttcaagg tagagctcga accagatagc aaagccttaa 1380 agaaaaacga agtgcacccg atctacaagc tgcagctctt actgcatcga agaatcaccg 1440 tggaagaacc ccacaaacgc aacggccctg ttcaatgcac aaactgccaa gagtatggac 1500 acaccaagac atactgcaca cttcggacgg tctgcgtagt ctgcggagac ttccacaact 1560 ccgtaaactg ccccgcaaac aaagaagacc cccaaatgaa aaaatgtggc aactgtggag 1620 gaaaccatac ggcaaactac agaggctgtg cggtctacaa ggagctgaag agtcgcatgc 1680 gacgagcgac agctacgcac caacaacatt tcgccaaggc agccagatca tcttttgggc 1740 agctagatac aaccaaggga atctcctacg ccgaagcact aagaacaggc atggaaaatc 1800 cgccccagcc tcactcagaa aatgctcagc aggttccagt gcagccgcaa agcaaattgg 1860 aatctatgat gttcaccatg caacaaagta tgatggaact tatgtcattc atgaaaacaa 1920 ccatgcaaac ccttgttcag aaccagaaca tggtaattca gttgcttgta gcacaacagt 1980 ccaaataata aatgactaac ctacgtatat ccatgtggaa tgccaacggc atttcacggc 2040 ataaacttga aataactcaa ttcctaaacg taaatcacat cgacgccatg ctgctggttg 2100 aaacgcacct cacagggaga tataacttcc atataagagg ttataccttt taccgcacag 2160 atcatccgga tggcaaggcc cacggcggaa cgggcatctt aatcagagaa cgcatcaaac 2220 accactttca tcaaaggttt gcaacaaatt acctgcaagc cacgtccata aaagtgatct 2280 caggaaacgg caacctttgc attgccgctg tctactgccc acctcgattt aatatctctg 2340 aaggtcaatt catggacttc tacaacactc tgggggatcg cttcatagcc gcaggagact 2400 acaacgccaa acacacgcac tggggatcac gcctcgtgac tcccaaggga aggcaactgt 2460 ataatgcact tataaaggtg agcaataaac tggactacgt ttcccctggt agccccacat 2520 attggcctgc agacccaaaa aagatcccag atctaattga tttcgcggtg acaaaaaaca 2580 tcccacgcaa tttaataagc gccatagcac taccggatct ctcatctgac cactcgcctt 2640 tgttgataag ccttcttcaa agcccagaaa ttgcggacca cccccacatg ctgacgtcgc 2700 acaacactaa ctggatgaaa tacagaaagt atgtgagttc ccacattgag ctaactcccc 2760 aactaaatat cgaagcggac atcgactgct ccaccgccgc actggaagag gtactcgtca 2820 cagcagctag gatctcaacc cctaaacgga aggatgggga atttaagaag ttcaaaacca 2880 accggcaaat tgaacagctg gttctggaaa agaggcgtct gcgacgagcg tggcaaacca 2940 gcagatcgcc atgttcaaag caacgtctta ctgaagccac ccgcaatcta aaccgggccc 3000 tgaagcaaga agctgaaaat gaacaactta aatatattgg aaaactctcg ccaactagca 3060 caaagcatcc cttgtggagg gctcacccaa atctaagctc tccaattcaa accgtcaccc 3120 ctataaggaa ttcttcaggg tgctgggccc gcagcgacaa agatagagcc gaatcatttg 3180 ccttacatct cagaggcgta ttccagccga accctgcagc aaatgattat gttcctccgc 3240 aaatccacct cgaaactgaa tccacgccaa ctacatttca gccgaacgag atcacaaaag 3300 tcatcaggga gctaaaacca aaaaaggctc caggcggtga cctaataact acaaaaatga 3360 taatggaact tccgtactgt gctattaagg tcatctgtaa actcttcaat ggaatcataa 3420 gtctcggcta ctatccaagg aaatggaaaa aatcaattat tataatgata ccgaagcccg 3480 gaaaagatca cacgattccg tcatcttaca gaccaataag tttactatca tgcctgtcca 3540 aattattcga aaaatgcctt ctaacgcgca taataccata tatgggagcc cacaacatta 3600 tcccagccca tcaatttggc ttcagagaaa aacacggaac tatagagcaa gtaaacagaa 3660 taacatcaga aattcgcaca gccttcgaac atagggaata ttgcagcgcg atatttctcg 3720 acgtatctca agcattcgac cgcgtctggc tcaaaggtct catgcataaa attaaaacac 3780 acttgcccgg gtacactcat aaactccttg aatcctacct ctacaataga gccttcgcag 3840 taagatgtaa tacaacaatt tccgacagct acactatcga agctggggtc ccacaaggta 3900 gcgcacttgg gccaacttta ttcgtccttt acacagcaga tattcccacg agtgaccgac 3960 taacaacatc caccttcgcc gacgacactg cgatccttag ccgctccaga tgcccagtcc 4020 gtgcaacaac ccaactcgcc aaccacctcg tggtagtcga gaagtggcta tctgactggc 4080 gtattaaaat aaacgaacaa aagtgtaaac acataacgtt tacccttaac agacaaactt 4140 gccctccgct ctgtatgaat agcacgcaga tcccccaagt taatgaagta acgtatctcg 4200 gcatccacct cgacagacgt ctaacatggc gtagacatat caaatgcaag aaactgcacc 4260 taaaactgaa agccagcagc ctccactgga ttataaactc acgatcgccc ttgtgtctgg 4320 actacaaggt cctgctctac aactccactc ttaagccaat ctggacgtat ggctcccagc 4380 tgtgggggaa cgcgagcaga accaacatag acatcataca gcgagcccaa tcaaagatcc 4440 tgcgaactat cactggggca ccatggtata ttcgtaacca aaacatccac agggacctcg 4500 gcattctcgc agtaaaagat gaaatagaca agcaaaaagc gtcctacaat gaaaaactat 4560 ctgtccaccc aaatcctcta gcaagggtct tgactcgggt ctccagccga acccgcctgc 4620 aacgtacaga ccttccaacc cagccataac ttgtcagggc cactgtctca gtctacatga 4680 ctcctagtta ggttatagta taagatttga tacacttatt gttagtctcg taaatgagaa 4740 gattcaataa ataaaagcaa agcatttaaa aaaaaaaaaa aaaaa 4785 // ID Mariner-32_HM repbase; DNA; INV; 3369 BP. XX AC . XX DT 31-DEC-2008 (Rel. 13.12, Created) DT 31-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE Mariner-type family: consensus sequence. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-32_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3369 RA Bao W. and Jurka J.; RT "Families of Mariner elements from Hydra magnipapillata."; RL Repbase Reports 8(12), 1966-1966 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 1050..2876 FT /product="Mariner-32_HM_1p" FT /translation="MVRNYVRKTQRGATNDRIKSALEAMEMGCSLKNVASE FT FSINKKTLQRHRDGKVKVVGGLSLGGKSPVFSKDFELNIVTQVQIMERALF FT GLTTIDIRRLAYDFAKQMGIENPFNNESKMAGKDWLQGFMSRNPQLSIRTP FT QATSVSRAVGFNKPKLNQFFSVYKSLFEEHKFSAKQLWNMDETGITNVHKP FT GKIIATKGKRQVSKITSGERGATVTVVCAMSASGVYVPPLFIFPRKRMTDR FT LAFGAPSGSIVRVSSSGWTDSSLFIEWLTHFVTVTHASKNNEQLIVLDGHH FT SHKTLEAINFCRNNGIHLITLPPHCTHKMQPLDRTFFKPLKVGYNTAASNW FT MLSHQGRRISFFDMAGIFTTAYNFTANIDKAINGFRCSGLYPINELIFNDE FT AFDAALLTDEAIPLESCQQSSDLPSIPLVAPIQLPDVISTATVASKMKTVT FT KEPAQSNDAVLPVVDEHLGLKSAPNILEKISPRPKITVLRQRKRKTESAEN FT LTSSPFKKRLEEHENNVKEKQQEAILRKEKAALNKVIKQKKKSINLRTKKP FT KKSKKQKASQCFITDTTLCIICSCAYNVPPFDDWTQCTKCTSWYHSSCGPD FT DSDLCYYCES*" XX SQ Sequence 3369 BP; 1196 A; 480 C; 572 G; 1121 T; 0 other; gggtagggtg gggcaagatg accagtgggg caagaagtct ttttgctttt gtgagggtat 60 catagaatag taatgtttta ttcctctatt ataatattaa tatagttaaa acaatatcac 120 ggaacaaaaa atagtttaaa tatcgttatt agttaaggtt taaatagatt tttaactttt 180 ttgagtattt tgttagtaag ttgcttctaa cttcaaatgt ttattggact ggcaatactt 240 atgcgatgaa ggtaagaaaa tagttttttt tatacagaat ttgtcccaca ataatatata 300 gaaaaaaaag tttttaatta agcaacaaaa ttagttttat ttgtattaaa cttgcactca 360 ctacatatga ggcaagatgt cctcgtttac gtggggtaag atgcccacgg agggcttcta 420 gtctcgtggt tttacggaat ttctaccatt ttaaaagtgt gtttagaaca taataacgtt 480 tactaattta gttattgact ttgtttacat gttggtttat gtgttgataa ctattataca 540 tatatataaa taatttttta aaataatttt atatattttt aaaatgttat aatatgtatg 600 ctaaataaag tgaagtgtga agttgtatta catgcagtta ctgcaattca atccatactt 660 gtaaaacatt tataaattat ttataaatgt atatataaat gtattataaa tacattttta 720 aaatttgcat tgcatttggg gaaataaagt tgtattatat taaattaatg atcagcaatt 780 gaatgtaaat aataataata acaataataa attaaaaaaa taataatacc attgttatta 840 ataattcttg ggtaactttg tgcaaaaatt gtatgcatat aacttaagta agttcttatt 900 taacttaaaa agagttaatt tatgaatgta aggtaatgtt tgaaacaata gtatattaaa 960 taattatata atactttgca tgtgtttata tgtttatata atttaatttt aaaattttta 1020 tttagaaaca tatttaaaca atttaaataa tggtgagaaa ttatgtgcga aaaacacaga 1080 gaggtgcaac taatgatagg ataaagtcag cgttagaagc tatggagatg ggctgcagtc 1140 tgaaaaatgt tgcatcagaa tttagtatta acaaaaaaac attacaacgt catcgagatg 1200 gaaaagtaaa ggtagttggt ggtttatctc tgggtggaaa atctcctgta tttagtaaag 1260 attttgaatt aaacattgta acacaagttc agattatgga aagagcatta tttggcttga 1320 caacaataga tatacgtcgc ctagcatatg attttgctaa acaaatggga attgaaaatc 1380 cttttaataa tgaatcaaaa atggcaggaa aggattggct gcaaggattt atgtctcgca 1440 acccacaact ttctattcga accccacaag ccaccagtgt aagcagagct gttggcttca 1500 ataaaccaaa gttaaatcag tttttttcag tatacaagtc attatttgag gaacataagt 1560 tttcagccaa acagctctgg aatatggatg aaactggaat aacaaatgtc cataagccag 1620 gaaaaataat tgcaacaaag ggaaaacgac aagtttcaaa aataactagt ggagaaagag 1680 gtgctactgt gactgtcgtg tgtgcaatga gtgcaagtgg tgtttatgtc ccaccattat 1740 ttatattccc ccgtaagcga atgactgata ggctggcttt tggtgcacct tcagggtcaa 1800 ttgttagagt tagctccagt ggatggacag attcatcttt attcatcgaa tggttaacac 1860 attttgttac tgtgactcat gcatctaaaa ataatgagca attaattgta ctcgatggtc 1920 atcatagtca caaaacactt gaagccatta acttttgtcg caacaatggg attcacctca 1980 taactcttcc acctcattgt acacacaaaa tgcaaccttt agaccgtaca tttttcaaac 2040 cattgaaagt tggttacaat actgcagcaa gcaactggat gttatcgcat caggggcgta 2100 gaatttcatt ttttgacatg gcaggtattt ttacaactgc ctataacttt actgcaaaca 2160 ttgacaaagc aatcaatgga tttagatgct ctggtttata cccaataaat gagcttattt 2220 ttaacgatga agcctttgat gcagctttgc tcactgatga agctatacca ttggaaagct 2280 gtcagcaatc aagtgatttg cctagtatac ctcttgttgc acctatacag ttgccagatg 2340 tcataagcac agcaactgtt gcctcaaaaa tgaagactgt tacgaaagaa ccagcacaat 2400 caaatgatgc agttttgcca gttgttgatg agcatcttgg ccttaaatcc gcccctaata 2460 ttttggaaaa gatttccccg cgtcctaaaa ttactgtatt aagacagcgt aaaagaaaga 2520 cagaatcagc tgaaaatctt acttcttcgc ctttcaagaa gagattagaa gaacatgaaa 2580 acaatgtaaa agaaaagcaa caagaagcaa tattgcgtaa agaaaaagct gcacttaata 2640 aggtaatcaa gcaaaaaaag aaatcaataa atttgaggac aaaaaaacca aagaagtcta 2700 aaaagcagaa agcctctcaa tgttttatta ctgatactac tttatgcatc atatgctctt 2760 gtgcttataa tgtgccccca tttgatgatt ggacacaatg tactaaatgt acatcttggt 2820 accactcttc gtgtggtcca gatgattcag acttatgcta ctactgtgaa tcatagtcat 2880 gtatatacat cgttattaga acatttattt tagttattta ttgctaataa ttaaaaaaga 2940 agttatttat cttcaagaca attgtataga ataattgcat agtatttatt tgcagtgtta 3000 atatttcagt tgttttaata aaatgtaaag agtaaataaa aaaaactaat aagggctttt 3060 atttttaaga actttaagtt atttttaaca actgttttaa tggtttgggg aataacgcgt 3120 gaaaagcttt ctttctttaa atccatagag catcttgccc caccactggg gcaagatgtc 3180 ttctggcaca agtttttttt tttttgattt taaagtttat caacagtaac aaagtagtta 3240 ttttatctta ccaatgtctt ctcttataag taaactttaa tattaaaaag aaaaaacaca 3300 agaaaatgta aaagtattta ttatatacga taaagtttgt taaggggggc atcttgcccc 3360 accctaccc 3369 // ID RTE_Ele3 repbase; DNA; INV; 3365 BP. XX AC . XX DT 05-OCT-2010 (Rel. 15.1, Created) DT 05-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE An RTE clade non-LTR retrotransposon family from Aedes aegypti. XX KW RTE; Non-LTR Retrotransposon; Transposable Element; RTE_Ele3. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-3365 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-3365 RA Kojima K.K. and Jurka J.; RT "RTE clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (06-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. This consensus is generated from 30 CC sequences with >97% identity, and ~100% identical to the original CC sequence in [1]. XX FH Key Location/Qualifiers FT CDS 483..3347 FT /product="RTE_Ele3_1p" FT /note="endonuclease and reverse transcriptase." FT /translation="MKLEILGLSEVRWPNFGEHRIPSGQILLYSGLRGEHA FT PRHRGVGFLLSAQAYAALMKWEPINERIIVARFRTRVRNLTIVQCYAPTDA FT AELQDKENFYSQLNAVVDKIPKGDIKIYMGDFNAKIGSDNSSYEPVMGRHG FT LGEMSENGELLAEFCGNNDMVIGGSLFPHRPVHKVTWVSRDGFTENQIDHI FT CISRKWKRSLLDVRNKRSADIASDHHLLIGEVRLRIARIRRQEERVGRRFN FT TRRLEDATVKRSFVEELETXAANISEGGSVEEQWTAIKNAFITTSENNLGE FT LRTRRKQWITDETWRKIEERREAKAAIERAKTRGAKALSRQRYSALEREVK FT RSCRRDKRAWADSLADEGEKAAATGDIRLLYDISRRLSGAKMNTTMPVKDT FT TGQLLTDPTDQLKRWFEHFEQLFQVSTRPPTPQHDPPRVRRISRVNTEAPS FT LHEIETAIQSMKSNKAPGVDRISAEMLKADPAISAQLLHQLFRNIWDTATF FT PADWMQGVLVKVPKKGDLTVCDNWRGIMLLCIVLKVLCKVILNRIQEKIDA FT TLRRQQAGFRAGRSCVDHIVTLRIILEQVNEFQESLYMVFIDYEKAFDRLN FT HENMWGALRRKGVPEKIIGLIEAQYEAFTCRVLHNGVLSEPIRVVAGVRQG FT CILSPLLFLIVIDEILIGAIDREQNRGLLWQPITMEHLNDLDLADDIAILA FT QRRSDMQSKLDDLAERSSAAGLNINVNKTKSLDVNTAGPSSFTVAGQTVEN FT VENFQYLGSQLASDGGTKIDIGARIKKARAAFASLRNIWKNSQISQRTKIR FT IFNSNVKSVLLYASETWCVSAENTQRLQVFINRCLRYIIRAWWPHNWISNA FT DLHRRCHQQPIQTEIRQRKWGWVGHTLRRGGNEICKQALEWNPAGHRSRGR FT PRGSWRRSLNKEIKEVDGNLTWPQVKAIAGNRPGWKSFTSALCTRMGAQDP FT " XX SQ Sequence 3365 BP; 942 A; 842 C; 885 G; 695 T; 1 other; gggctgcaaa atggtgagtc gacggagtcc tcacaagttc ctacctcatg cttccacggg 60 tcgacggatg acaaatgacc gccagctaag agttgcgtgc ttagctggta gtgcagcctg 120 ggtcctgtag tccttctgac ttcagctaga ttgaggaggt acgtccaccg agcgtctgtt 180 caccaaggag gtgcggctca aacagcgtct gtctggcatc cagcggctga ataagaaatg 240 ctcttccccg gaagctacac ctaagatggc agccccatca cggcggatag ggaaccttag 300 gctaacaacc tactgtcccc gaaacatcaa aaaatgttca agaaaccgat agagaaagat 360 tacgaactga tttaacggca acgactttta gcgcgaaaca acggacacga attggaacat 420 ggaacgtact aaccctagcc cagcagggta aactggcaca actcgccaac gaggcgtgtc 480 gcatgaagct ggagatccta ggactgagcg aagtccgttg gccaaacttt ggagaacaca 540 gaataccatc gggacagatt ctgctatatt ctggcctgcg aggtgaacac gctcctcggc 600 accgtggggt tggctttttg ctcagcgccc aagcttacgc cgcgcttatg aagtgggagc 660 ctataaatga gagaataatc gtcgccagat ttagaacacg ggtccgaaac cttaccatag 720 tccaatgtta tgcgccaacc gatgctgccg aattgcagga caaagagaac ttttacagtc 780 aactcaatgc tgtcgtggat aagattccga agggtgatat caagatctac atgggcgact 840 tcaatgcgaa gatcggatcc gacaactcga gctatgagcc tgtcatggga cgccatggtc 900 tcggagaaat gagcgaaaac ggagaactgt tggcagagtt ttgtggcaat aacgacatgg 960 tgatcggggg atcgctcttc ccccatcgtc cggttcacaa ggtcacgtgg gtctcccgtg 1020 acggctttac cgaaaatcaa attgaccaca tctgtatcag ccgaaagtgg aaacggagcc 1080 ttcttgatgt gcggaataaa cgtagcgctg acatcgcatc cgaccatcac ctcctcatcg 1140 gcgaggtgcg tctgcgcatt gcgcggattc gtcgacagga ggaaagagtc ggacgtcgat 1200 ttaacacacg ccgactggaa gatgccacag tgaaaaggtc cttcgtcgaa gaactggaga 1260 ctcktgctgc taatatttcg gaaggtggca gcgtagaaga gcaatggacc gccatcaaga 1320 atgccttcat cactaccagc gaaaataatc tgggtgagct acgcactcgg agaaaacagt 1380 ggattaccga tgaaacctgg aggaagatag aggagcggag agaagccaaa gccgcgatag 1440 agcgggcgaa aaccagagga gccaaggcac tttcccgtca acgatactcg gctctagaaa 1500 gggaagttaa gcgctcatgt aggcgggaca aacgagcgtg ggcggactcc cttgctgacg 1560 aaggtgagaa agccgcagca accggcgaca ttcgtcttct ctacgatatc tcacgacgcc 1620 ttagcggggc aaagatgaat acgacgatgc ccgtgaaaga cacgactgga caactgttaa 1680 ccgaccccac tgaccagcta aaacgctggt ttgagcactt cgaacaactt ttccaagtgt 1740 caacccggcc accaacacct cagcatgatc cgcctagggt tcggcgtatt agtcgcgtta 1800 acaccgaagc tccatcactg catgagatag aaacagccat ccaaagtatg aaatccaaca 1860 aagcccctgg ggtcgatcgc atatcagccg agatgctcaa agcagacccc gcaatatccg 1920 ctcaactgtt gcatcaacta ttccgcaata tctgggacac cgcgactttt ccggccgact 1980 ggatgcaagg cgtgttagtg aaggtaccca aaaagggtga cctcactgta tgcgataatt 2040 ggcggggcat catgttgcta tgtattgttc tcaaagttct atgcaaagtg atcctaaacc 2100 ggatccagga gaagatcgat gcgactctcc gtcgacagca agcgggattc cgtgccggac 2160 gatcctgtgt ggaccatatt gtcacgctcc gtatcatctt ggagcaagtc aatgaattcc 2220 aagagtccct gtatatggta ttcattgact acgaaaaagc tttcgaccga ctcaatcacg 2280 aaaatatgtg gggtgccctg agacgcaagg gtgttcctga gaaaatcatc ggcctcatcg 2340 aagcgcagta cgaggccttt acgtgcagag tattgcacaa cggggtcttg tctgaaccca 2400 tccgggttgt agctggagtg aggcaaggat gtatactatc accactactg ttcctcatcg 2460 taattgacga gatcctgata ggcgccattg accgtgaaca aaaccgtggg ctgctgtggc 2520 agcctattac tatggagcac ttaaatgatc tcgatttggc tgatgatatt gctatactag 2580 ctcaacggcg ctctgatatg cagagtaagc tagacgatct tgccgaacgc tcctcagcgg 2640 caggtctcaa catcaatgtc aacaagacca aatcgttgga tgtaaacacg gccggccctt 2700 ccagcttcac agtagccggg caaacagtgg agaacgtcga aaacttccaa tatcttggta 2760 gccaactggc gtctgacggc ggtaccaaga tcgacatagg tgcacggatc aagaaggcga 2820 gggctgcttt tgcgagctta agaaatattt ggaagaacag tcagattagt caacgcacca 2880 aaattcgaat attcaattcc aatgtcaaat ctgtgctgtt atacgccagt gaaacgtggt 2940 gtgtatcagc ggagaacact caacggctgc aggtgttcat caacagatgc ctgcggtata 3000 taattcgggc atggtggcca cacaactgga tttctaatgc tgacctccat cgtcggtgcc 3060 accagcaacc gatacaaacc gaaattcgac aacgaaagtg ggggtgggtc ggccacactc 3120 tacgtcgggg cggaaacgaa atctgcaaac aagcgctgga atggaatcca gcaggacatc 3180 gcagcagagg cagacccaga ggatcatggc ggcgcagcct caacaaagaa ataaaagaag 3240 tcgacggcaa tttgacctgg ccgcaggtca aggcgatagc tggtaatcgc ccaggatgga 3300 aatctttcac gtcggcccta tgcaccagaa tgggtgccca ggacccataa gtaagtaagt 3360 aagta 3365 // ID hATm-33_HM repbase; DNA; INV; 3547 BP. XX AC . XX DT 07-DEC-2008 (Rel. 13.12, Created) DT 10-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type family: consensus sequence. XX KW hAT; DNA transposon; Transposable Element; hATm-33_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3547 RA Jurka J.; RT "Families of hATm elements from Hydra magnipapillata."; RL Repbase Reports 8(12), 1927-1927 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(719..1033,1180..1431,1471..3042) FT /product="hATm-33_HM_1p" FT /translation="MDLITCPHTILLCEHDLSGCSSPKNCKNKAHINCTCI FT YSKKIPTMELRWLYSQRNKTGEKSDMQMGHADKIETMKLRKKYKRKVSDEE FT AYKKRIKKKRRNKHSSRVAEEEDIELDKEHQKNIFPGFLDENYNLDNVNLT FT LKQDNEIKILVKRLLKEKLGVYAFLVTRFLEKKSRYNTMSIKNTAMASIRV FT NRNKYKPFLFYRFGISPAATAAIATSYLQDLIEAGFLTPESSYLACDPSKL FT TRERKKVMVVAKKNDNLKLPKADNIAIYFDGRKDSTRAMINDSNGQLHPII FT IKEQHITVTIEPGGRYLGHFTPNAPNHLDKPAKKIAEGVYNLLQQYNLTES FT CLVLGADSTSINTGWKGGAITHLEKLLGHKCHWAICMLHSNELFLRHLIEG FT IDGPTSCSTGFVGPVGKFLPDVNKMEYNPQFKALPNGESFVDIPETILKNM FT STDQKQCYKLCKAVKIGLLPPNLREIQCGPLSHARWLTTGMRIVFMWTRKH FT NLNDQNSQNLEILVVFCLQFYFKLFFDIKVKNRLEDASYHILSQLRILRMQ FT PDRVKEIVFPYIQLGAWYANHESILISLLSSNNAEDRNFAVDQILKLRGND FT ILGDMRIRPRKTPKLNFAATTLQELVKWENGVVEEPVFTCSLTRDKLLLLR FT LIPLITPSIKIHTQSTERAVKQVTVASMSVVGPYARDGYIRARAQHRELLP FT VFKTKKEILLNF*" XX SQ Sequence 3547 BP; 1322 A; 512 C; 574 G; 1138 T; 1 other; gggtacagtg aatttcatat gtttttgaaa aaaataaaat taaagcttag gaaaaggtgt 60 attcttatgt aaaattaaat tttaaggtag gtttgctatg gacaactttt tgattagaaa 120 acgttttgat acccaaattt tcatatttag acattattaa aatgaaatat actttaatta 180 ttataaagta aatttaatcc ggttttatgt ttagtttctc ataataagcg cagtgccaaa 240 tcaattttta aagaaattct tagctctcca ccgtaaaaaa catctaaaca agatcgtagt 300 ctgtaaatat tttatttgtt atattatcaa tttattcata aatgtcgaaa gttaaaacta 360 gatattcatc caccactaga atttctgagt ttgttggacc tggaaaagat tttattccta 420 gtgagctgcc tacaaacaga gcagttattc aaaaaggaat tttattaaaa gaacaaaaag 480 cataacatgc aatttacacc aaaataaata ttctaacaaa gatcttgcta gcgatctaat 540 tccacttgtt cttgctcaat ggaaaaaagc aagtatttta tttcaacaac cagttattat 600 tggggagaaa tcgctttatt atagaataaa aacactttgg gagaaagtag aaaacgttgc 660 atggggaaga actaaggtaa aagtaaaaga agagctaata ttgaagttag ataaactaat 720 ggacctcata acatgtcctc acacaatact tctatgcgag catgatctgt ctgggtgttc 780 ttcccccaaa aattgtaaaa ataaagcaca tataaattgc acctgtattt attctaaaaa 840 aattccaact atggagctca ggtggttata cagtcaaaga aataagacag gtgaaaagtc 900 tgacatgcag atgggacatg ctgataagat tgagacaatg aaactaagaa aaaagtataa 960 gagaaaagtc agcgatgaag aagcttacaa gaaaagaatt aaaaaaaaga ggaggaacaa 1020 gcattcttcg aggtaaatat aatttcttaa taataaatat tttctttttt catataataa 1080 taataataat aacaataata tatatataaa tatattatat atatatatat atatatatat 1140 atatatatat atattatata tattttaatt tgattttagg tcgcagaaga agaagatata 1200 gaattagata aagaacatca aaaaaatatt tttcctggct ttctcgatga aaactataat 1260 ctagataatg ttaacttaac tttaaaacaa gacaatgaaa ttaagatttt ggttaaaagg 1320 ctgctaaagg aaaagttagg tgtctatgct ttccttgtaa cacggttttt agaaaagaag 1380 tcaagatata acacaatgtc aataaaaaac acagctatgg ctagtataag gtaattatat 1440 tgacttttgt tttgttgagc agcaacatga gttaatagaa ataaatataa accattctta 1500 ttttataggt ttgggatttc acctgccgca actgcagcaa ttgctacaag ttatcttcaa 1560 gatttaattg aagcaggatt tctcacccct gagtcctctt accttgcttg tgatccttct 1620 aagttgacga gagagagaaa aaaggtgatg gtagttgcga agaaaaatga taatttaaaa 1680 cttccgaaag cagataatat tgccatttac tttgacggaa ggaaagattc aacaagagca 1740 atgataaatg attcaaatgg acagctacat cctattatta tcaaagagca gcacattact 1800 gtgactatag aacctggtgg tcgatatctt ggccatttta ccccaaatgc acctaatcat 1860 ctagataagc cagctaaaaa gattgctgaa ggtgtttata atttgctgca acaatataat 1920 cttactgaat catgtttagt acttggagct gacagtactt caattaacac aggctggaaa 1980 ggtggtgcta taacgcatct tgaaaagctg ttaggtcaca agtgccactg ggctatatgc 2040 atgcttcatt cgaatgaatt atttttgcgt caccttatag aaggcattga tggaccaaca 2100 tcatgtagta ctggatttgt aggtcctgtg ggaaaatttc ttcctgatgt taataaaatg 2160 gagtataatc ctcaatttaa agctcttcca aatggagaga gctttgtcga cattccagaa 2220 acaatattaa aaaatatgtc aacagatcag aaacaatgtt acaagctatg taaagcagtt 2280 aagattggtt tattgccacc taatcttaga gaaattcaat gtggcccctt aagtcatgcc 2340 agatggctca ctactggcat gagaatagtc tttatgtgga caaggaaaca caacttaaat 2400 gatcaaaatt cgcaaaatct agaaatactt gttgtatttt gtttacaatt ttactttaaa 2460 cttttttttg acataaaggt caaaaacaga cttgaagatg cttcttatca cattctgtct 2520 caattgagaa ttcttagaat gcaacctgat agagtaaaag aaattgtgtt tccctacatc 2580 cagttagggg cttggtatgc gaaccatgaa agtatactaa tctctttatt atctagtaat 2640 aatgcagaag atcgcaattt tgcagttgat caaattttga aactaagggg taatgatatc 2700 cttggagata tgcgaatcag accaagaaaa acacctaaac ttaactttgc agcaactact 2760 cttcaagagc tagttaaatg ggaaaatggt gttgtagaag aacctgtttt tacttgttcg 2820 ctcaccagag ataagttgtt attacttaga ttgatacctt taataactcc ttcgattaaa 2880 attcatactc aaagtactga acgtgctgta aaacaagtga cagtagcatc catgtcagta 2940 gttggaccat atgcaagaga tgggtatatt agagctagag ctcagcatag agagttgcta 3000 cctgttttta aaacaaagaa agaaatttta ttaaattttt aacttcaata gatttttatt 3060 ttatttgttt tttaatagtg ttttagtagt tctatcaaaa tattattggg gtaagatagg 3120 gggataaacc acctattttt tggtgaaagt ataaagaact gttatgatca aattttttaa 3180 tcattagaac ctcataaaaa attgaatacc ccctcctcct ccctctaaaa aatgcagaat 3240 ctgcactgca gataggatcc atgcaagata atgattttaa aaagtatcta aaaaactttc 3300 aataatctaa aaatattatt tgcatttaat attttatcgc aatgaaaatg aaaatataaa 3360 atacaatgaa ggacatgtga caaattagat tttactaatt tttgctattt ctaaaagtcc 3420 atatttgggt atcaaaactt tttccgatca aaattttttt ttaagaaacc ctctttaaaa 3480 tttgatttta cataaaagta caccttttcc taagctttta raaaaaaaaa aaaaattcgc 3540 tgtaccc 3547 // ID BEL-8_CQ-LTR repbase; DNA; INV; 398 BP. XX AC AAWU01032143; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-8_CQ_; KW BEL-8_CQ-I; BEL-8_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-398 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 170-170 (2011). XX DR Genome; AAWU01032143; Positions 7048 7445. XX SQ Sequence 398 BP; 98 A; 91 C; 83 G; 126 T; 0 other; tgttggagaa ctactgccga tgattgaatt tgttacccta gatataagaa ttttggactg 60 ttctaatgta atcatgaatt gttgcgttct ctcaatcaat ctcttatcgc tctctgcgat 120 cttttgcatt tccgccgacg atcgtcagca cgatcgatcg ggtagccgat agggagctgc 180 ggctcattag ttttagttcg tattcgacaa cgttacagag aatacatcga ggaggaacct 240 ctgccaactt tacttttgct ttttctttaa ataaatggcc caagactaag ccgtagtttt 300 cttcgaaagt tcgttttttt gattttggca agttccccac accgaaacaa accccacgga 360 gaagacgtcc tgccgagtag ctgcccagtt tgcgtaca 398 // ID Gypsy-603_AA-I repbase; DNA; INV; 6468 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-603_AA_; KW Gypsy-603_AA-LTR; Ty3_gypsy_Ele116; Gypsy-603_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-6468 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [4292-4768] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 2101..3198 FT /product="Gypsy-603_AA-I_1p" FT /translation="MINSIIINPEDDERPHAEISILGKKLRGLLDSGASCS FT LLGGNAVKIIEELRLRKGEAKGQIKTVDGTAHNIRNFVYLPIAFNNRIQTI FT AVLLAPSLPECIVLGMNFWRTFGVKPVCCSLEEEGIEIDTNPETAKTPEEE FT PMKELTPEQRKALHRVIATFPTAEDGKLGRTSLYEHRIDVGEAKPKKQLNY FT PMSKYVLDEVNKEIDRMVALDVVEEAMFSPWNNPLVAVKKKNGNYRVRLDA FT RHLDSIMTNEGYPIPQISAIINNLGGCAYISSIDLKDAFWQLPLAAASRPL FT TAFTVPSRGHFQFKVVPFALCTASQALSRLMTHLFADLEPRVFHYLDDIII FT CSRTFEEHLEMLTEVAARYGVRT" FT CDS 3515..5104 FT /product="Gypsy-603_AA-I_2p" FT /translation="MPDYSKPFSVACDASDIAIGAVLTQEVDGEEHPASYF FT SQKLSSSERKYSVTERECLAVIRAVEKFRGYIEGVRFVVYCDHAALSYLLS FT MKNPTALMSRWILRLNAFDFEIKYRKGSINVVPDALSRVVAVTTFATGEDP FT EDPWYEALAHKIKTQGDDYPDFRLVDNEIYKNCLCREGGTTTHKWKRIVPL FT KNRAEVIKRCHDAPTAGHLGFHKTLDKVQSHFYWPKMRENIGQYVRSCEVC FT KASKAPNSAMMPDMGGLKPARIPWELVSVDFVGPLTRSKRGNTVLLVVVDW FT ITKYVIAHPMRSADSGKMVEFLEQHVFLKFSRPRIILSDNGRQFESIAFKS FT LLARHKITHMKTAFYSPMVNNAERVNRVLITCVRALLDENHAGWDENLAAI FT TAAINSAKHQVTGVSPHFANFGRDLILHTDLYVQANMNTPEDPKVAQDMRL FT SAMKRIQEFVLQRIRKNYEKTKQRYNLRTRQVDFNIGDVVWRRTFTQSSKA FT DHVNRKLDPKYVPALVKQILGKNIYMLEDVADGK" XX SQ Sequence 6468 BP; 1936 A; 1553 C; 1626 G; 1340 T; 13 other; ttggcgccca accaaaacga gcccgtagta ttattatctt atagaataat actcaaaaat 60 ttgtgagaaa ttttgttttg tccatttggc taacttttcg tgttgaagat actttagaca 120 gtatggatcc cttatacctc agtgatgagg agctaaatta cgagttggct ttgcgccgat 180 gcaaccaacc tacttctgcc acacgcagaa tcaaaggcac gcagctcaga gcgctgatgc 240 aaaaagagtt tgccgagaat gtaaaataca cctcatccga gcatgcaatg ccggcagawg 300 tgaacattca tcgttgtgaa cagcaagtca gactactgat tccaatgatt gagacggcac 360 tcaaaagacg taacacagat ttcctaatcc aatctmgatc kmgtatgatt cattatcgag 420 atcgtctgtc aatcgtsaaa cctccggsag acctgatcga tatgcacgcg tcggtgtcaa 480 tgcaagtgaa gttcttaata gaagacatca gtgacagact tggtgatccg ttgatcagcg 540 mgagtagtca acaagtktma cmaccagcgs aaggagcggk atcgctggaa tatcaacagc 600 actacaacac tcttctatcg ccggaaagag aacaaaacca gcatcgaatc aacgaaccac 660 agaaggcagc agcgcaaacg ggtgttgtga acaggcaaag caatgaaggg ctgaacaacg 720 acttcatccg tgcgacgagt tcgccgatta gagcaggaag aggaagggga cgtggagttg 780 cgatcaatct ctcactacct ggcaggaatt cttcgcaacc gagaggcaca cctcctccag 840 cttaccatcg agctggacca gaaatagctc acgacagtac aaccgatcgc ttgagagatg 900 aactgctcat acagctacta caacgagaac agcacacgca agatcggcaa gacgcaggga 960 gaggtctcaa agcagttcac aactggccat tcaagttcaa gggcgaaaag gacaccacat 1020 ctctcaacac ctttctcgac agagtggaaa cattcgcacg gtcagaaggc ctttccgatg 1080 atacgctgct gaagtccatc aaacatctac tactagacga tgcgttggac tggtatggac 1140 gagcgatatc gcagaacctg ctggtaacat ggacagcttt caaaaacgag atacggaagg 1200 aatatctgcc aagtagttac gagcagatcc tcaagttaga agccatcttc cgattccaag 1260 ctccggatga gtctttcgca aagtactacc gtgacatatc cgcattattc cgcttcatac 1320 aaccaccaat gagcgaacaa gagaaattct tcattgtgaa gaaaaacatg aatgcggact 1380 atgcaacgat tgttacggca gcgagaccac tctctctgca agacatggtg gaagtgtgta 1440 gcagctatga tgaaacgcgc atgctgttaa accgtcagaa acgagttcct tttccgcaga 1500 gcagtctgct agaaccgaac ttagcaacgc cagcgaatcg cccaacgccg ttaaatcgag 1560 gaaatccgcg gttcgggcga gtgcacatgc tgaactcaga ggaggatttt agtcctctag 1620 aagcttgcag ctgtcaacca caagcccacc caagcaaaaa cgcgaacgtc agcgagagtc 1680 aaaagaagtc aatggaaacg caatcggcag cagaaaagga caactggcag cagcgaattc 1740 tggaactgtc ggagcaagtg aatgcgctaa aacttcgaca atataacagg gaagaaagac 1800 cacatcgcag tcagcatcaa caggaacaag agtgcaagaa catcatcagc aacaacctgg 1860 cgatcaaagg agaactggtt tgatctgttg gaattgcgat gaagaaggac atcggttcat 1920 ggactgtcca aagcctcaag cggtcatgtt ttgctatcgc tgtgggcaca aaggctattc 1980 gctgcgtagc tgtccgactt gtcgaacagg tctgggaaac gctgtcgcgg ggaatcgcta 2040 atcgagggaa ggagatcctc gcaaaacttc tacaatccct cagacatccc gaaccaagcc 2100 atgatcaatt ccatcatcat caaccccgaa gacgacgaac gcccgcatgc agagatatca 2160 atcctgggga agaagttacg tggactgctg gacagtggtg caagctgttc ccttcttggc 2220 ggaaacgcgg tcaagatcat cgaagaactc cgactgcgga aaggagaagc aaaaggtcaa 2280 ataaaaacag ttgatggtac cgcgcacaac attcgcaact tcgtatatct tccaatagct 2340 ttcaacaacc gaattcagac aatcgcagta ctcctggccc catcgctacc agaatgcatt 2400 gtgctcggca tgaacttctg gagaacattc ggagtaaaac cggtgtgctg ttcgttggaa 2460 gaagaaggca tcgaaatcga cactaatccg gagacagcga aaacaccaga agaggagccg 2520 atgaaggagt tgactccaga gcagagaaag gcattacatc gagtcatcgc aacatttcca 2580 actgcggagg acggcaagtt aggtcgaacc agcctgtacg agcaccggat agatgtcggt 2640 gaagcaaaac ccaaaaagca actgaactac ccgatgtcca agtacgtgct ggacgaagtc 2700 aacaaggaga tcgaccggat ggtggcacta gacgtggtag aggaagcgat gttctcccca 2760 tggaacaacc cactcgtggc ggttaagaaa aagaacggga attacagggt acgcctcgat 2820 gcgcggcatc tggactcgat catgaccaac gaaggatacc cgattccaca aatctccgcc 2880 atcatcaaca accttggggg atgcgcgtac atttcgtcga tcgacttgaa agacgctttt 2940 tggcaactac cactagcagc ggcgtcccgg ccgttgacgg cgttcaccgt accctcacga 3000 ggacacttcc agtttaaggt agtcccgttc gcactctgca cggccagcca agctctatcg 3060 cggttgatga cgcatctgtt cgcagaccta gagcctcgag tcttccatta tctggatgac 3120 atcatcatct gctctcgtac tttcgaagag catttggaaa tgctgacgga ggtagcggcc 3180 cgctacggcg tgcgaacctg acaatttccc agaaaaatcg aagttctgca gagaggagat 3240 aagatatctt ggttatgtgc tgaatcaaga cggctggaag gggacgggat agcctgcatc 3300 gtaaagtacc cggcaccaac aaccagaaag gaggtgcaga gatttctggg cctgtaccag 3360 taactggtac agaaggttca tagcggagtg ctcacggatt gctgccccgc atagaccgag 3420 ttaacgaaga cgaagataaa gttccgctgg acggcgaagc agaggaggct tcttaaacct 3480 gaaggccgcg ctagtgtcag ctccggtgtt gctcatgccg gattacagca aaccattctc 3540 ggtagcatgc gacgccagtg acatcgctat tggtgcggtc ctaacgcagg aggtagacgg 3600 agaagaacat ccggcaagct acttctcgca aaagctgtcg tcgtcggagc gcaaatactc 3660 ggtaaccgaa cgggaatgcc tggcagtaat ccgcgccgta gaaaagttca gagggtacat 3720 agaaggagtg agattcgttg tgtattgtga ccatgcggca ctttcgtatc tactgtccat 3780 gaaaaacccg acagcgctga tgagcaggtg gatcctaaga ttgaatgcct tcgacttcga 3840 gatcaagtac cggaagggaa gcatcaacgt agttccggac gcgctctcgc gggttgtagc 3900 agtaaccacg ttcgctacag gcgaagatcc agaggaccca tggtatgaag cgttggcgca 3960 caaaataaaa actcaaggag acgattaccc ggacttcagg ctggtggaca acgagatcta 4020 caagaactgt ctctgcagag aaggaggaac aacaacgcat aaatggaagc ggattgtgcc 4080 gttgaagaat cgcgcggagg tcatcaagcg ctgccacgac gcgccaacgg ccggtcatct 4140 aggattccac aaaacgctag acaaggttca gtctcacttc tactggccga agatgagaga 4200 aaacataggg cagtacgtcc ggagttgcga agtttgcaaa gctagcaaag cgccaaactc 4260 cgccatgatg ccagacatgg gaggtctgaa acctgccaga ataccatggg agctggtgtc 4320 agtggacttc gtgggacccc ttactcgctc aaaacgcggc aacacggtac tgttagtcgt 4380 ggtggattgg atcacgaagt atgtcatcgc tcatcccatg cgcagcgcag attctggaaa 4440 gatggtggag tttctggaac agcatgtctt cctgaagttc tcacggccaa gaatcatcct 4500 ctcagacaat gggagacagt ttgagtccat cgcgttcaaa tcgcttctgg cacggcacaa 4560 gatcacgcac atgaagacgg ccttctattc gccgatggtg aataacgccg aacgcgtgaa 4620 tcgagtgctg atcacctgtg ttcgagcgct tctggatgag aaccatgcag gatgggacga 4680 gaatctggca gcgataacag cagccatcaa cagtgctaag catcaagtga cgggagtgag 4740 cccgcatttt gcaaatttcg gacgagacct gatactccat acggacctct acgtacaagc 4800 gaacatgaat acaccggaag atccgaaagt ggcacaggac atgcggttgt cagcgatgaa 4860 gagaatccaa gagttcgtgc tacagcggat taggaagaac tacgaaaaaa cgaagcagag 4920 gtacaatcta cgcacgcgac aggtggactt caacataggc gacgtcgtct ggaggcgaac 4980 cttcactcag tcctccaaag cagatcacgt aaacaggaaa ctggatccaa agtacgttcc 5040 ggcactagtg aaacagattt tggggaagaa catctacatg ctggaagacg tcgcggatgg 5100 gaagcwaggc cgctatcacg caaaagacat caaggcggac taaacgttat tggttcagct 5160 atgtccacag gcaaaccctg tcaacaatga gttctacgct ccgaaaatcc gtgccaaatc 5220 gggaaaatgg cagccaagtc aggtgtgatg agcaacaact tcttcatgac tacctctcca 5280 cctcgacaag accttcagct atgaaaacac gacgtccaga agcctacgga aggcatggga 5340 gattgggacc ctgaaaactt atagtgagta aggagactcc aaataccagc aactggagcg 5400 acaacacgac aagacgaagc ctacggaagg cacgggagat tgggaccctg aaaacttaaa 5460 gtgagtaagg aggctccaaa taccaacgac ggaaacaaca acaacacgac aggacaaagc 5520 ctacggaagg cacgggagat tgggaccctg aaagcctgta gtgagtaagg tggctccaaa 5580 taccaaacaa gatcagctat gaacaacacc agttacgagg agttcagctc gagaaagaca 5640 ccaagcgtgg aagcaatgaa gtcacaagcc ctggatggtg tactggcggc agcagcagct 5700 gcaaatatcg atcaggtcca aagtgttacg cattcctgta aatctgtcaa ataaaggtct 5760 tctcggcacc tcgtagtgtt cgggcgattg catggcatta tcatcccggg tatcacccaa 5820 aagcaacgcg accaggggcg cggatattcc aaggtaactt ggaattgtga accctattct 5880 tggacccatg cacatcgagg gcgaggaaat gcgctctttt ccgaaccaac tacacaagct 5940 atgcactcga cttcggtcag tgactaagcc attaggcacg aaaataggca attataagga 6000 atcataccat cgatgatgtc ctagccttaa gtaatttagc caatacaaca atttacctct 6060 atgtttcttg tgtagatgta agatttttct agttttagtt acaactagac ataatctatt 6120 tgaatgttcc actgtgggac agctagcata gagatgatgg acaagaactt ctctagagac 6180 catttgccaa gacttcgttg gcgtctgatc aattcatctg acaggactcg ctgtcgttaa 6240 tcagaaactt gaagctttgg attatgtgta ccctctctgg caagtggtgc gacttgcgta 6300 aatttagttt ggattttggt gcctctttgt atgtcttgaa gtctgttcaa gctaaactac 6360 acgatcagtt cagtctctag aaaacttctt aacggaagtg aaagtttaga atttttcatg 6420 aatcgctgtt accgattcat gaaaaattca atccccacag tgaagttg 6468 // ID BEL-66_CQ-I repbase; DNA; INV; 6791 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version 1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-66_CQ_; KW BEL-66_CQ-LTR; BEL-66_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-6791 RA Jurka J.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 285-285 (2011). XX DR [2] (Consensus) XX CC Positions [5956-6537] - Integrase core CC 'GAATG' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 227..1849 FT /product="BEL-66_CQ-I_1p" FT /translation="MAAEKKAAILSLQRTMKALHTRAGKLLIYVQSFDVDK FT EDSLQLETRRDAVFEMRQTFLEIEQKLFGLSKEEELDGISMQSEEFFDLVT FT EILYQIARKLKPLSNPPEAKPSLDSRIATVPSSAPTIRLPEISLPKFNGHY FT DQWIYFRNQFNHLVRRNEALNDHQRLHYLRSCLVGEAENFESSEESFSSLW FT KALEQEFENRRWPVDNHLAELFQIKRLEFESANDLHQLLNVVQRNLRGLSS FT LKLNLEPLSEAMLVHVVASRLDGDTHKAFESHVVGQSSVKWSEMVDFLLNR FT CRILENLEQERKQTRVHPKPVGARLQPKVLVSATRDEEKRNFGCFNCTGSH FT YINECRSFLALPVKQRFQRVKDLKLCINCFSNRHIVANCKSGTCKSCGQRH FT NTLLHFDPKMPGEGTSSQSSASSVRPRAIQNKPEQDRDRVGLCALTSPDAA FT REQPAPGPEARLVDPALDSKKTSGPVASEDRSVEPKSPSKKVQLISTSGQA FT LLHTAIVYVQGSNGEVQECRAVLDSCAMTSFMTTACASPVTFWPA" FT CDS join(1936..4413,5278..6789) FT /product="BEL-66_CQ-I_2p" FT /translation="ISSWVSTLAKNLSQLKTFPASVSIVGFGGAGHGITEA FT AVAHVFSKPSVIEHQEVVVEFLVAPCIVNRLPLRSFDISSWIIPEWIDLAD FT PHFNXPEKIDILFGIVEWDKMMLSQTHKLGDELPTLRRTIFGWVAGGPVNE FT EVNYPTLQALPVTNEQLDEQLTKFWEVESYSTERFLSPEEQAAEDHFVTTH FT CRDEKGRYVVALPFKEEEVKLGDSAHIAHRRFLMLEAKLIKDRKLYDDYRK FT FMQEYIDLGHMERVGTFSLAEPQELPYYFLPHHAVVRPESSTTKLRVVFDA FT SAKSSNGQSLNDQLLVGPTLQPRLIDNLLSFRTYKVAVTSDVAKMFRQIDV FT RKEDRRFQQILWRTSPDEDIGVYQLTTVTYGTACAPFLATRALLQTCTDEA FT EHFPLAAVFGKNSVYMDDVLFGAETVEEALEMQRQLTGMLGKAGFELXKWC FT SNRSELLAGIPEDKLEQKVLFTEEGKTKTLGITWQPDLDVFSFDIHSIAFS FT EGPPTKRKVLSDISKFFDPCGLAAPVVMTXKIYMQDLWRGKXKWDVALKED FT LAKLWIEYRDELSEMKTIRISRCVLPLRRTVFVELIGFSDASQKGYGANLY FT VRSVNHEGQTAIRLLCAKSHVAPLKEKRALIPRLEICGCVKLAQLVDKVKR FT SLPIEFDRITCYTDSMIALSWINTCPSKLEPFIANRVARILRLTSKIDWFH FT CRTEDNPADLLSRGIMPGKLATCGKWWKGPPFMIDHKRPTSVQLIPEDQLP FT GLLVVCAVETVERFSLFDNCEKYFTMVRAVARIQRWHQNACKRSPENRRHG FT RLFSPEEMRRATPRAWVNNKGNALWFYLRYFLPLAPHQHNHHRATKGTFWS FT LLFAQVIPKFVFPCYFPCSGLALVRLAQEEAFPEVFAQLRKKVTLQNHSLI FT PLSPFTDDRDVLRVGGRLSKSSCSYDTKHQMILPQKHPLTVAIIRDKHTQN FT MHAGIELTLAASRREFWIIRGRVAVKDVIGKCVVCFEDNPRPVRQYMGDLP FT ACRLERRYPFHQVGIDLCGPVFIKQRNKRSTVQHKAWIVLFVCLSTKAIHL FT ELVSELSTAAFLAAFDRFVSRRGXPAVVWTDNGTNFVGAANLLAEWDKFFA FT SVDTQDEIQRAHSNEITWRFNPPEAPHFGGIWESNIKQAKKLLVKHTAGAA FT LCFEEMSTVLARIEAILNSRPITPLSNDPEDFEPLTPGHFLVFRPLNAVAR FT PEVDRTKHARSRYEHITKIVQHFWDRWHFEYLTXLQQRYKWHHKIVVEVGQ FT LVLIKRDNMPVQKWLLGRVVELFPGVDGVTRVVKVKTKDGILKRAVAKLCF FT LPIDPENGVESAAFQRRE" XX SQ Sequence 6791 BP; 1711 A; 1774 C; 1709 G; 1588 T; 9 other; ttaggtcctt cgagccggat cgaagcggat tgcattgtcg gagaaagtgt tcgaaagggg 60 acctcgaacc tcaaggtaag gtagtgatcg attgagctag agcaagcgcg tgcgtaaata 120 gaatcttggc ctttttgtgc gaagcttttg ttctcggtgg cttccgtttc cgccatcttt 180 gatccacatt tgtcttgtgt gctcccatcc acagacaaac tttgcgatgg cggcggagaa 240 aaaggccgcc attttgtccc tgcaaaggac gatgaaggca cttcacacac gtgctggaaa 300 acttttgatc tacgtacaaa gctttgatgt cgacaaagag gattccctcc agctcgaaac 360 tcgcagagat gctgttttcg aaatgcgcca aacctttttg gaaattgaac agaagctctt 420 cggtctctcg aaggaagaag agttggatgg gatttcgatg cagagtgaag agttttttga 480 tctcgttact gaaattctct accagatcgc cagaaaactg aaaccccttt cgaacccccc 540 agaagcaaag ccctccctcg acagtagaat tgccaccgtt ccaagttccg ctccgaccat 600 taggcttccc gagatctcgt tgcccaagtt caacggccat tacgaccagt ggatttactt 660 ccgcaaccag ttcaaccatc ttgttcggcg caacgaggcc ctcaacgatc atcagcgtct 720 ccactatttg cgctcttgcc ttgtcgggga agctgagaac ttcgagtcgt cggaagagtc 780 gttctcgtcg ttgtggaaag ccctcgaaca ggagttcgag aaccggcgct ggccggtaga 840 caaccatctt gcggaactgt tccagatcaa gcggttggag ttcgagtcgg cgaacgatct 900 gcatcagctg ctgaacgtcg tccaaaggaa ccttcgtggc ttgtcctctt tgaagcttaa 960 cctcgaaccc ctttccgaag caatgctagt ccacgttgtc gcttcccgtc tcgatggtga 1020 cacgcacaag gctttcgagt cccacgtggt cggccagagt tctgtsaagt ggagtgaaat 1080 ggtcgacttc ctgttgaatc gctgtcgcat tttggagaac ctggagcaag aacgcaagca 1140 aacccgtgtc cacccgaagc cagtcggagc caggctgcaa ccgaaggtgt tggtgagcgc 1200 cacccgtgat gaggagaaga ggaatttcgg ttgtttcaac tgcacaggta gccattacat 1260 caacgagtgt cgcagcttcc tcgctctgcc tgtgaagcag cggttccaga gggtcaagga 1320 tttgaaattg tgcatcaact gcttctccaa tcgccacatt gtcgccaatt gcaagagtgg 1380 tacctgcaag tcttgtggcc aacgtcacaa cacgcttctc catttcgacc ccaagatgcc 1440 aggtgaagga acgtcgtcgc agtctagtgc atcgtcggtt cgccctagag caatccagaa 1500 caagcccgag caagatcggg atcgcgtcgg tctgtgtgct ctgacttctc cagacgctgc 1560 tcgtgaacaa cccgcgcccg gaccagaggc gcgccttgtc gaccccgctc tagattcgaa 1620 gaagacgtcc ggaccagtgg cgtcagagga tcgatcagtt gaaccaaagt cgccctccaa 1680 gaaggttcag ttgatctcga cgagcggaca agcactgctg cacactgcga tcgtgtacgt 1740 tcaaggatca aacggggaag tccaagagtg tcgtgcggtc ctggactcgt gtgcaatgac 1800 ttcgttcatg acgactgcgt gtgcttcacc agtaactttt tggcctgctt gatgttcgat 1860 tcccagatcc cgccaaagtg tggggcctcc ggggggttga atctccaggt gatctcgttg 1920 ctgtgggcgc gttgaatctc gtcttgggta tcgacactcg cgaaaaactt gtcccaactc 1980 aaaacgttcc ctgctagtgt ctcgattgtc ggattcggtg gagcaggcca tggaatcacc 2040 gaagcagctg tcgctcacgt tttctcgaaa ccgtctgtga tcgaacacca agaagtcgtt 2100 gttgagtttc tcgtcgctcc gtgcatcgtt aaccggttgc ctttgcgctc gttcgacatc 2160 tcaagctgga tcataccgga gtggattgac ctcgcagacc cccacttcaa ccawccggag 2220 aagatcgaca tcctgtttgg catcgtcgag tgggataaga tgatgttgag ccagacccat 2280 aagctcggcg acgagttgcc tacgctgcga cgaacaatct tcggatgggt cgcaggaggw 2340 ccggtgaacg aggaggtcaa ctatcctaca ctgcaagctc tgccggtaac gaacgaacag 2400 ctggacgaac aactgaccaa gttctgggaa gtcgaaagct acagcacgga acgcttcctt 2460 tcccccgaag aacaagctgc tgaggatcac tttgtgacca cccactgtcg agatgaaaaa 2520 ggtcgttacg tcgtcgcctt gccgttcaag gaggaggagg tcaagctcgg agactcggca 2580 cacattgctc accgtcgttt tttgatgctg gaagcgaagc tgatcaagga ccggaagctg 2640 tacgacgact acaggaagtt catgcaggag tacatcgacc tggggcacat ggaaagagtt 2700 ggcactttct ctctcgctga accacaagaa ctgccctact actttttgcc gcatcacgcc 2760 gtcgtccgac ccgaaagcag taccaccaaa ctacgggttg tttttgatgc ctcggcgaaa 2820 tccagcaacg gacagtccct gaacgaccaa ctcttggtag gaccgaccct ccaaccccgc 2880 ctgatcgaca atctgctcag ttttcgaacg tacaaagtcg cagtcacctc ggacgtcgca 2940 aaaatgttcc ggcaaatcga tgtccggaaa gaagatcgtc gattccagca gatcctgtgg 3000 cgaaccagtc ccgacgaaga catcggcgtc tatcaactga cgacagtcac ctatggcacc 3060 gcttgcgctc cctttttggc aacccgggca ctgctgcaaa cctgtaccga cgaagctgag 3120 catttccccc tcgcagcggt cttcggcaag aactcggtgt acatggacga cgtgcttttt 3180 ggagccgaga cagtcgaaga agccctcgaa atgcagcggc agttgaccgg aatgctcgga 3240 aaggcgggat tcgagttgca maagtggtgt tctaaccgca gcgaactatt ggccggcata 3300 ccagaggaca aactggagca gaaagtgcta ttcaccgaag aaggcaaaac caagaccctc 3360 gggatwacct ggcaacccga tctcgacgtc ttcagtttcg acatccactc gattgcgttt 3420 tcggaaggac ctccaactaa gcgcaaagtt ctctcggaca tctccaagtt cttcgatccg 3480 tgtggactag cagctccagt ggtcatgacg gscaaaatct acatgcaaga cctgtggcga 3540 ggaaagaaka agtgggacgt agcactcaag gaagatctgg ccaagctgtg gatcgagtac 3600 agggacgagt tgtcggagat gaaaaccatc aggatcagtc gctgtgtgct tccccttcgt 3660 agaacggtct ttgtcgagtt gattgggttc agcgatgcgt cgcaaaaggg ctatggggca 3720 aatttgtacg tacgatcagt aaaccacgaa ggtcaaaccg caattcgttt gttgtgcgcc 3780 aagtcgcacg tcgcaccgct taaagaaaaa agagccttga ttccgcgctt agaaatttgt 3840 ggatgtgtga agctcgctca actagtcgac aaagtaaaac gatcactacc aatcgaattc 3900 gaccgcataa cttgctacac ggattccatg atcgccttga gttggatcaa cacctgtcca 3960 agcaaacttg aaccgttcat cgccaaccga gttgctcgga ttttgcgatt gaccagcaag 4020 atcgactggt tccactgccg gaccgaggac aatccggcgg atctgctctc gcgtgggatc 4080 atgcctggaa aactagcaac gtgcggaaag tggtggaagg gtccgccatt catgatcgac 4140 cacaagagac caaccagtgt gcagctcatc cctgaagatc aacttccggg actactcgtc 4200 gtctgcgccg tggaaacggt ggagagattt tcgctgttcg acaactgcga gaagtacttc 4260 accatggttc gagccgtcgc acggatccag cgctggcacc agaacgcgtg caagcggagt 4320 ccagagaatc gtcgccacgg tcgtctgttc agcccggagg agatgcgtcg agccacgccc 4380 cgagcatggg taaataacaa gggaaatgcg ttataatacc tttctatggt atcaatacca 4440 aattttggta ttcctggacc ctttaaaaat tgtcttaagg attatgcaag agcgaaatgt 4500 gctatcatac caattaaagg tattgttttg atattgcaaa attgcagaat ttgtcaatgg 4560 tcgaacacca cattttggta ttcttttggt attacagtgt tttatcaaat tcctctgaaa 4620 ctatgggaaa attttgacaa agtaattaat tttgcgctaa aatgatgtct caaagcgaat 4680 gctttcattg aaaaccggaa atttcaaact tgatttcaaa catttaagaa atgacttgac 4740 attgtggaaa attttctaag tcaggatgcc actttgtccg aagctaaaaa acgtcgattt 4800 ctctaatttg ggataaccga atacttgaaa aacgagtagc gacatttgct gttttaattt 4860 aagttttctt taaactattt ttttcagtgt tttttttaga tttcaagaac ttcaagcaca 4920 ggccgttcat caatgacaaa tgcattcaat gtggtgaaga atatcactaa gaacaacatg 4980 gaatcatcag gtgagaagat ttggctgttg catatggatc atttccgttc tttcactaac 5040 tgcaactgtt gcactaaaaa tggtatgaaa atactagatt tggtattctt gacaaatacc 5100 agcgaccaaa atgtgctctt ggttgcccag taatagatgg taaaatacca tgaaatcata 5160 ccaaaatctg tatgggacac aggaatacca actgtttcca agggcgcggg aataccaaat 5220 aaaccattgc aataccaagt tagtttcgca ttcgatccag acccaacttg gtattgatgg 5280 ttttatttga ggtatttttt acccttggca ccccaccaac acaaccacca ccgagcaacc 5340 aagggcacat tttggtccct gttatttgcc caagtaatac caaaatttgt attcccgtgt 5400 tatttcccct gctcggggct ggccctcgtt cgtctcgcac aggaggaagc ctttccggaa 5460 gtattcgccc aattgcgcaa aaaggtaacc ctccagaacc actcgctaat cccgctttcc 5520 ccattcacag atgaccgcga cgtcctcagg gtcggaggac gcctctccaa gtcgtcctgc 5580 tcgtacgaca cgaaacacca gatgatcctg ccacagaaac atcccttaac tgttgccatc 5640 atccgcgata agcatacgca aaacatgcac gctggaatcg agctgacctt agctgccagt 5700 cgaagggagt tctggatcat ccgtgggaga gtcgcggtca aagatgtgat tgggaagtgc 5760 gtcgtgtgct tcgaagacaa ccctcgccca gttcgccagt acatgggaga cctgcctgcg 5820 tgccgcctgg aaagacggta cccattccat caagtcggaa tcgatctctg tggaccagtt 5880 ttcatcaagc agcgcaacaa acggtcgact gttcaacaca aggcctggat cgtgctgttc 5940 gtctgcctgt cgacaaaggc aatccatctc gagctcgtca gcgagttgtc aacggcggcc 6000 ttcctagccg ctttcgatcg tttcgtcagt cgtcgcggtc kccctgctgt tgtttggacc 6060 gacaacggca ccaactttgt cggcgctgct aacctgcttg cggagtggga caagttcttc 6120 gcgagtgtcg atacccaaga cgagattcaa cgcgcccaca gcaacgagat cacctggaga 6180 ttcaaccccc cggaggcccc acactttggc gggatctggg aatcgaacat caagcaggcc 6240 aaaaagttac tggtgaagca cacggcggga gcagccctat gtttcgaaga gatgtcgaca 6300 gtactcgcac gtatcgaagc aatcctcaac tctcgtccga tcacgccact ctcgaacgac 6360 ccggaggatt tcgagccact aacacccgga catttcttgg tgtttcgtcc gctgaacgct 6420 gtcgcacgtc ccgaggtcga ccgaaccaag cacgcacgat cacgatacga gcacatcacc 6480 aaaatcgtgc aacacttctg ggatcgctgg cacttcgagt acctgacgwc gttgcagcag 6540 aggtacaagt ggcaccacaa aatcgtcgtt gaagttggac agctggtgct gatcaagcgg 6600 gacaacatgc cggtgcagaa gtggctactc ggtcgagtcg ttgagctgtt ccctggcgtt 6660 gacggagtca cgcgtgtcgt caaggtcaag accaaggatg ggatccttaa gcgagcggtt 6720 gcaaagctgt gcttccttcc aatcgacccc gagaacggtg ttgaaagcgc agcctttcaa 6780 aggcgggagg a 6791 // ID BEL-37_CQ-I repbase; DNA; INV; 5926 BP. XX AC AAWU01044402; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-37_CQ_; KW BEL-37_CQ-LTR; BEL-37_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-5926 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 227-227 (2011). XX DR Genome; AAWU01044402; Positions 10784 4859. XX CC Positions [4973-5557] - Integrase core CC 'GGATG' target site duplication CC LTRs are 96% similar to each other. XX FH Key Location/Qualifiers FT CDS 530..5926 FT /product="BEL-37_CQ-I_1p" FT /translation="MSDQQTPTGVSSGITRRNILDALTGSKGSVVKTVAQQ FT KADQEAQQQKRRMEKNLEALEDRRDMLVEKLERMKGSLEEVTVSVHLLKLY FT EETLRRSADDYDKNYSDIIVLLPKEKRAEQRGKYAAFEKLHSTVYEKLQAR FT IAESELTTKLLELPKAPAAAVAPPQPVYIQAPAPVPHLQAPFPTFNGKPEG FT WYSFKNLFQSIMSRYTNDPPAMKILYLRNALVDDAKDKIDQDVVNNNDYDA FT AWRILEDEYEDKRLILDTHIDAILDCPKVAKESRGKSISKLVEVCAKHSEA FT LAGHGYPVEGLAELVMVNVIYKKLDKETQEQWELKIGSGALPEFEEFMEFL FT RERGRVLQRTNRSQQQPAQQSTVAPSKQRQAFGQKAQPLLKSFVQATTEVC FT GHCKEDHPIYRCSTFRDLSVPERKSVATKANLCYNCLKARHRVSDCESKTL FT CKVQGCGRKHHSLLHDSGTRSNQGQVKKEESSQCLPVAPSVPAGGPATAPA FT QQSSAVSLCANVGGAKHQVLLSTAEVMVIGQADSTVKCRALLDSGSDSNII FT TEKLANKLKLKMDRVDLPVSGLNDIQTRVKYLLSTKIESCVNAFASPVLDF FT LVVPRITSNLPVVEIDVAAWPLPPGLQLADPKFHTPGEIDLIIGNEIFFDL FT VKSGRVKIGRNAPTLAETELGWVLGGAVPTRKAKPCPRVCQLNHFEEELNK FT TMVKFWEIECVHPETDLTVAEVAVEEHFQRTHSRDAEGRFVVRLPFNNQKS FT QLGDSYNNARKRLDKLMISLARNPAKRVEYSAFLTEYLTLGHMKEVQHVND FT GGYYIPHHAVYKASSSTTKTRVVFDASAKTTSGLSLNDTLDVGPTVQNDLL FT SIILKFCTHPVVLTADIPKMYRQVKLHQENCKYQRILWLDDDRQLKVYELQ FT TVTYGLASSPHHATRALVQLATDEGKDLPLAARVVKRDSYVDDFLTGGSSV FT AEVVQIYKELTELLRRGGFGVHKFCSNRTEVLRAIPSELHEKQVDFGESDV FT NNTIKTLGLIWNPLEDYFVFRVPRPINEEWTKRIVLSEISSLFDPCGFLGP FT IITLCKLVMQDVWVRALTWDQPLPVELSLRWRLIRVQLPAISKMRKKRCLI FT APDAVNVQLHGFSDASKRAYGGVLYIRSVNKRGEISVCRAASKSRVAPLKE FT QTIPRLEACGAKLLAELTQKVMAALEVPFDDVVLHCDSKIVLCWLKKPPSA FT LKQFVSNRVAAIQKLTREFTWRFIRSEHNPADVLSRGALPEDLLEHELWED FT GAPELWEHQLPEDLSEPFDESQLPELKPAIVLTIVNPKPAIDFTRCSNYRR FT LQRSWVYVERYINLAVHKRPETSGISADEVARAEKTILAVVQKQAFGDLLK FT LLRSNSNKQHSWSNLAPFLGEDGLIRVGGRLKYSAIPYDGKHQVLLPDGHH FT VTKILVRKLHEENFHVGQNGLLAIVRERYWPLRAGLLIKQVVTKCQICRRQ FT NPKPGSQFMGNLPGYRINPAPPFSKVGIDYAGPFMLKTGGRGSRLQKAYVV FT VFVCMVVKAIHFELVTNLSTDNFIAALQRFASRRGLPSDIHSDNATCFAGG FT NNELAALRELFNSQLHKNKLEEFCSLKGIRWHFIPPRSPHFGGIWEAGVKS FT MKHHMKRVVGETKLTYEEMTTFLAQTEAILNSRPLCPMSEDPNDLTVLTPS FT HFLISRSGVALPEPSYADEKVGRLSRWQHIQYMQQHFWKRWSGEYLHQLQG FT RPKWNAGVKEFKVGELVLLIDENLPPQQWRRGRIVATHPGEDGAVRVVTVK FT TATSEFKRAITKVALMPSVEPDASTGGE" XX SQ Sequence 5926 BP; 1432 A; 1634 C; 1740 G; 1120 T; 0 other; ttttttggtc cgttctcccg gatagtattc ggagaatttg gagctggttc ccggttcgtg 60 agtgattagc ccggtgctaa gttcggctgc ggccaaagtg aagtcggttc ttgtgtccgt 120 ggcacaagtt ccggaaaacc gccatcttgg gcgaagtgga ctcttgggtt ggtgctacgc 180 caccaagaaa gtgcgtgccc gtggcacgga agttttagaa ccagttgaga acgctcgact 240 gtacggaagt ttggggtcgg cgctacgcca ccaagtgtgt gtgtgtgtgc ccgtggcaca 300 aatggaaaaa gtggtcgcca ttttggcgtg gaaaactgag tggtcgccat tttggcgtgg 360 aaaactgacg ccggttgcga tcggctgcag tagtgtgtgt gcgtgtgtgc ccgtggcacg 420 tcgaaaagga cgccatcttg gcaagtgcta acaaagtccg tctctgcgcg agaccagtga 480 gtgcgagaaa acaaaaagtg gccattgaag cggcaaaaag tgtgcacgga tgagcgacca 540 gcagacgccc acgggagtgt cgtccggaat tacgaggagg aatattctcg acgctctgac 600 gggatccaaa ggaagcgtgg tgaaaaccgt cgcacagcag aaagccgacc aagaagccca 660 gcagcagaag cgcaggatgg agaagaactt ggaagcgttg gaggatcggc gggacatgct 720 cgtcgagaag ttggagcgga tgaaaggttc gctggaggaa gtgacggtga gtgtccacct 780 gctcaagctc tacgaggaga cgctgcggcg tagcgcggac gattacgaca aaaactactc 840 cgacatcatc gtcctgctgc caaaggaaaa gagagccgag cagcgtggga agtacgcagc 900 gtttgaaaag ctgcacagta ccgtgtacga aaagttgcaa gcgaggatcg ccgagtcgga 960 gctcaccacg aagctgttgg aactgccgaa ggcacccgcc gccgcagttg cccctccgca 1020 acctgtctac atccaagccc cagcgccagt cccgcacctg caagctccgt tccctacttt 1080 caacggcaaa ccggagggct ggtacagttt caagaacctg ttccagagca ttatgtctcg 1140 ttacacgaac gaccccccag ccatgaagat tctgtacctc cgaaatgcgc tcgtcgacga 1200 cgccaaggac aagatcgacc aggacgtcgt aaataataac gattacgacg cggcctggag 1260 gattttggag gacgaatacg aggacaaacg cctcattctg gacacgcaca tcgacgccat 1320 tctggattgc ccaaaagtcg ccaaggagag tcgcggcaag tccatctcca agctggtcga 1380 ggtgtgtgcg aagcacagtg aagctttggc tggtcacggg tatccagttg aagggctagc 1440 cgagctagtg atggtgaacg tgatctacaa gaagctcgac aaggagacgc aagagcagtg 1500 ggagctgaag atcggcagcg gagcgttgcc ggaattcgag gagttcatgg agttcctgcg 1560 ggaacgtggg cgtgttctgc agcgcacaaa tcgctcccag cagcaaccgg cgcagcagtc 1620 aacagtggcc ccaagcaagc aacgccaagc gttcggacag aaggcccagc ccctgctcaa 1680 gtcgttcgtg caggccacaa cggaagtgtg tggccactgc aaagaggatc acccgatcta 1740 ccggtgctcg acgttcagag acctgagtgt accggagcgc aagtccgttg ccaccaaggc 1800 aaacttgtgt tacaactgct tgaaggcgag acaccgggtg agtgattgcg agtccaagac 1860 tctgtgcaag gtgcaaggct gcggccgcaa gcaccacagt ctcctgcacg acagtggtac 1920 acgaagcaac caaggtcaag ttaagaagga agaatcgagc cagtgcttgc cagtggcacc 1980 ctcggttcca gcgggcggtc cagcaaccgc tccagcgcag cagagcagcg cggtgtccct 2040 ctgtgcaaac gtcggcggtg cgaagcacca agtgctgctg tcgacggcgg aagtgatggt 2100 gatcggccaa gccgattcca ctgtcaagtg tcgtgcgctg ttagactcgg ggtctgacag 2160 caacattatc accgagaagt tggcgaacaa gcttaagctg aagatggatc gcgtggacct 2220 accggtgagc ggtttgaacg acatccagac ccgagtgaag tacttgctgt ccacgaagat 2280 cgagtcctgc gtcaacgcct tcgcctcacc cgtgttggat ttcctggtcg taccacgcat 2340 cacgtcgaac ttacccgtgg ttgagatcga cgtcgcagcc tggcccctgc cgccaggact 2400 gcaactagct gacccaaagt tccacacacc tggagagatt gaccttatta tcggcaacga 2460 aatcttcttt gatttggtca agagcggtcg cgtcaagatc ggccgcaacg cgccaacatt 2520 ggccgagacg gaactcggat gggtcctagg aggagcggta ccgaccagaa aggccaaacc 2580 atgcccgcgt gtgtgccaac tgaaccactt cgaggaggag ctgaacaaga cgatggtgaa 2640 gttttgggag atcgagtgcg tccacccaga gaccgacctg accgtagctg aggttgccgt 2700 cgaggaacac ttccagcgca cccactcccg agatgccgaa ggccggttcg tcgtccggct 2760 tcctttcaac aaccagaaga gtcaactggg cgactcgtac aacaacgcgc ggaagcgctt 2820 ggacaagttg atgatctcgt tggccaggaa cccagccaag cgtgtcgagt attccgcgtt 2880 cttaaccgaa tatctcacac tgggccacat gaaggaagtg caacacgtga acgacggagg 2940 ttactacatc ccacaccacg cggtgtacaa ggcctcaagt tcgaccacca agacaagagt 3000 cgttttcgac gcttcggcga agaccacgtc tggtctgtcg ctgaacgata ctctggatgt 3060 tgggccaaca gtgcaaaacg acctcctgtc gatcatcctg aagttctgca cccacccggt 3120 cgtgctgact gccgacatcc caaaaatgta ccggcaggtc aagctgcacc aagaaaactg 3180 caagtatcaa cggattttgt ggttggatga cgacagacag ctgaaggtct acgagttgca 3240 gacggtgacc tacggcctcg ccagctcacc tcaccacgca acaagggcat tggttcaact 3300 ggccacggat gaaggcaagg atctaccgct cgcggcgcgg gtggtcaagc gggacagcta 3360 cgtcgatgac ttcctgaccg gtggttcgtc cgtcgccgaa gttgtccaaa tctacaaaga 3420 actgacggag ctgctgcgcc gaggaggttt cggcgtccac aagttctgct cgaacagaac 3480 cgaggtacta cgagcgattc catctgagct gcacgagaaa caggttgatt tcggggagtc 3540 cgatgtcaac aacacaatta aaactctcgg actcatttgg aacccgctcg aggactactt 3600 tgtctttcga gtgccccgac ccatcaacga ggaatggaca aaacgaatcg tgctgtcgga 3660 gatctccagc ctgttcgacc cctgcggctt ccttgggccg ataatcacgc tgtgcaagct 3720 ggtgatgcaa gacgtctggg ttcgtgcgct aacctgggac caaccacttc cggtcgagct 3780 ttcactgcgg tggaggctaa ttcgagtgca gctgccagct atcagcaaga tgcggaagaa 3840 gcgatgcctg atcgccccgg atgcggtgaa cgtgcaactc cacggctttt cggacgcctc 3900 gaagcgagca tacggtggtg tgctgtacat ccggagcgtc aacaagcgcg gcgagatcag 3960 cgtgtgccga gctgccagca aatcacgagt ggctccactc aaggagcaaa caatcccccg 4020 gctggaagcg tgcggcgcaa aactgttggc ggagttgaca caaaaggtca tggccgccct 4080 ggaagttcca ttcgacgacg tggtgcttca ctgcgactct aagatcgttc tgtgctggct 4140 caagaaacct ccatcggcac tgaagcagtt tgtgtcgaac cgtgtagccg caatacagaa 4200 gttgactcga gagttcacgt ggcgtttcat ccgatcggaa cacaatccag ccgatgtgct 4260 ctcccgggga gcgctgccgg aagacctgct ggagcacgag ctttgggagg acggcgcacc 4320 tgagttgtgg gaacaccagc tacccgagga tctgtccgag ccgttcgacg agtcccaact 4380 accagaactc aagccggcga ttgtcctgac gatcgtcaac ccgaaaccag cgatcgactt 4440 tacgagatgc agcaactacc gacgactgca gcggtcgtgg gtgtacgtcg agcgctacat 4500 caacctggcg gtgcacaagc ggcccgaaac atctggcatc tcggcggacg aggtggcccg 4560 agcagagaaa accatccttg cggtggtcca aaagcaagca ttcggtgacc tgttgaagct 4620 gctgcggtcg aactccaaca agcaacactc gtggtccaat ctggcgccgt tccttggaga 4680 ggacggtctc atccgagtcg gtggccgttt gaagtattca gcgataccct acgacggcaa 4740 gcatcaggtg ctcctacctg acggccacca cgtcacgaag atcctggttc gcaagctgca 4800 cgaagagaac ttccacgttg gccaaaacgg actgttggcc atcgttcgcg aacgttactg 4860 gcccttgcga gcaggtctgc tgatcaaaca agtcgttacg aagtgccaaa tctgtcgcag 4920 acagaaccca aagcctggta gtcagtttat gggcaaccta ccaggatacc ggatcaaccc 4980 cgcgccacct ttctccaagg tgggcatcga ctacgctggt ccgttcatgc tgaagaccgg 5040 aggacgagga tcaaggctgc agaaggcgta cgtggtcgtc ttcgtatgca tggtggtgaa 5100 ggccatccat ttcgagctcg tgaccaacct ctcaaccgac aacttcatcg ccgcgctgca 5160 gcgattcgct agccgacgtg ggctcccaag cgacatccat tcggacaacg ccacttgctt 5220 cgccggtggc aacaacgagc ttgcagccct tcgagaactg ttcaacagtc aactgcacaa 5280 gaacaagctg gaggaatttt gcagtctcaa gggcatccgc tggcacttca ttccaccgag 5340 aagcccccac ttcggaggga tttgggaggc aggagtgaag tccatgaaac accacatgaa 5400 gcgggtcgtc ggcgagacca agctcacgta cgaggagatg acgacgttcc tggcacagac 5460 ggaggccatc ctcaactccc ggcccctgtg cccgatgtct gaagacccga acgatttgac 5520 ggtgctgacg ccgtcgcact tcctgatcag ccggtctgga gtggctctac cggaaccgtc 5580 gtacgcggat gagaaagtcg gacgtcttag cagatggcaa cacatccagt acatgcagca 5640 acatttttgg aaaagatggt ctggcgagta cctccatcag ctgcaaggac gaccgaagtg 5700 gaacgctgga gtcaaggagt tcaaggttgg tgagctggtc ctgctgatcg acgagaactt 5760 gcctccgcag cagtggcggc gcggacggat cgtcgctaca caccccggag aagacggcgc 5820 tgtcagagtg gtgacagtga aaactgcaac gagcgagttc aagcgagcaa tcaccaaggt 5880 tgctttgatg ccttcggttg agcctgacgc ctcaacgggg ggagaa 5926 // ID Nimb-2_AAe repbase; DNA; INV; 6134 BP. XX AC . XX DT 14-OCT-2010 (Rel. 15.1, Created) DT 14-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE A Nimb clade non-LTR retrotransposon family from Aedes aegypti. XX KW Nimb; Non-LTR Retrotransposon; Transposable Element; I_Ele6; KW Nimb-2_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-6134 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6134 RA Kojima K.K. and Jurka J.; RT "Nimb clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (14-OCT-2010). XX DR [2] (Consensus) XX CC [1] Originally named as I_Ele6. CC [2] Consensus update and re-classification. This consensus is CC generated from 15 sequences with >98% identity, and ~100% CC identical to the original sequence in [1]. It is classified as CC Nimb in the RTclass1, thus renamed. XX FH Key Location/Qualifiers FT CDS 630..2009 FT /product="Nimb-2_AAe_1p" FT /translation="MATAISSLDPPWGGHSKHGADRNDRTIPVWMDRHGMH FT GQRIILSLRAAEGENLPENPFILSKSINQVCGGNVEAANTEEKDTKYVLTT FT RSVNQAQKLLKMKQLIDGTKVEVVLHPKLNVRNCVITCRAAIKMTEENLLE FT ELAPQGISKVRRITRYENGQKVNTATLVLTVNGTVVPEYINVGLLRVPTRL FT FYPAPMLCFNCYSYGHTKMRCQNTPVCKVCSGTHTLGKDVSCTQQPACKNC FT KDSHSPASKKCPLYMKEEEIIKIKVESGVSFAEARKEYQRMHGDRSYSAVT FT GAQARIQDIRKENEKDVEIRKLKEELANLKEVMSQKSEKEKEDETVKLREE FT VIQLRKIIDDFKNGPAKQQMDTEERNEFSISEEEMSNAESQSVNNDNESEG FT YAPVAKAVRRKSKRTLAKSVKTVREKEGDDTDRSRSSRSGHEPINKEKSNS FT SQKKRGRPPKHRDDQ" FT CDS 1897..6060 FT /product="Nimb-2_AAe_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="MTPTEAEVAEADMSQSTRKSQTLAKRSEEGHPNIEMT FT SERNMLNDGNNDMDRSNGNGTENDICDEGRSETVSWLGLDEEKVVSPLFRR FT GENSPRNESEDVTLPPVLCQLPRSSANYAATNEAQLRNGTTQASRIRNQAT FT KLQPAILSWNIQGIQTGRQDLDLIIKDRKPLVICLQETMCSQRTQPRITGY FT TIYNRPRPNCQRASGGVLIAVKNGIDNVEIEMETTLEAIKVNIGPPLNFSI FT INVYFPPGANLNISEVEALFAGPNPVLLVGDINAHHPVWGSSFSNKRGKEL FT EEVFDKNDLVILNSGEPTYYNQTTGKGSCIDVAVCSTHLAGNLEWEVLGDS FT YGSDHMPTLICNPRETQENPTRPKWKIQEADWEQYQRIVDFPLIGDPEEQM FT KGITSGIISAAEQSIPRTKGAPSRRVVPWWNDEVHVAVKKRRTALRRMKSA FT PSDERREELARDFKTARREARQVIKSAKEESWKQFVSSFNVHTSVKDMWNK FT YRSLQGRCRQACIGGIKYEGREETTSVGIAEAIAEAFCSVSSDTSYSEHFR FT DVKQQTESNPFLMFDAPEEEYNKDFSMYELDEALESLKDSAPGEDNVHYGM FT LSNLPLDCKYKLLETYNKLWQQSVYPVEWTKSIVIPIYKGKGDRTNPLNYR FT PIYLNSCVAKILEKMVNSRLIFILETKKLLHKYQYGFRKAKSTTDHLVELE FT GTIREHLNKGHYAQAVFLDISKAYDTTWRRLILEKLKFWKIGGHLLRFIER FT SLACRTFRVLANGTLSSEKKMENGLCQGSVLSVTLFLVAINTIVEKLPGEV FT KCLIYADDVVIIYGGNDFEKNEEILQNALNTISDWQQRTGFNVSAEKCATV FT TFKTARARKVPLTSSLTINGSKIPKEKTFKCLGVWFDQSLRFREHVENIRA FT SCQQRMKILKCVASSSWGGDRETICKLYKATILEKMLYAAPLISSVSKDIL FT KKLEVVHNAGMRAISGAFRSSPSESIRAEAGMPSFNTYVDQRTALYAVKLK FT ALDPQIIEETCETTSSEDESQETTTYESSSGEEWGSVKPVPELTIRERGGH FT LLQELEVQVPLLNIFTLPSCAPWKRKKILVDKTLYTAMRRKQPEIMLRKLF FT NERRVTKYRVHKTLYTDGSKMSNKCGYSVVSDNMSIRKRIYDESSIYGAES FT LAAKEAIMLVANEEGAGAYLICTDSLSVVTCLESSKVKSLWKDQMLEIYTQ FT CLQNRKEVTFFWVPSHIGIEGNEKADREAKLALEQRIDTSIALDYKEMKKI FT LNNKIIIKWQYEWGKIRENKLREVKDSVIPYKTRGKTRRENVVLTRIRIGH FT TVLTHKHLFERMAAPMCQWCNELLTVKHLLVECISLDEERKKVGLPISLRE FT ILADDEERIESVICYLKNINVYNSL" XX SQ Sequence 6134 BP; 2175 A; 1110 C; 1443 G; 1406 T; 0 other; gagtttggtt ggtaacccta ttacagacaa gtttgtttgc atatgaagtg aattttctaa 60 gaactttttc gaagcgttat aagtgacaga aaattaagct tccaagaata ccatttgtac 120 gagagttcag agagtcagta gagcgcggag aatattacta gcattctgta gtttgcttat 180 ttcgactgtt tcaatagtgt caataaaagc aaaacaccag tccagtttgc tgtagacaat 240 tgtacacaag tcgactagca agcgcacgct tgcgtgctac atagcacgaa gaagcgctgc 300 ccagtgcatt gccacctgct aggcgcaatc ggcagcttca ccagaaacaa atctctccgt 360 ggtttgtgga gaaaccactg tttgcgtggt ggagatcgta gtgactaaga actaccaagc 420 gtccccatcg aaattgattt tctgtgaacg cgcgcttaaa ctattcacct ggaccttcaa 480 cccttgggtg atatagtagt tctaggtaaa agggtagtcg ttacttgcta cggcaagaac 540 gcagttggga gtgtagaata ttcaatttga gtgaccagca aacataagct gacagaagga 600 agctagctaa ctcttcgctg ttggccaata tggccacagc tatttcgtct ttggacccac 660 cctggggtgg ccattcgaag catggagcgg atagaaatga tcgaaccata ccagtatgga 720 tggaccgaca cggaatgcac gggcaacgca ttattttgtc tctacgagct gctgaaggag 780 aaaacttgcc agaaaatcct ttcattctca gtaagtctat caatcaagtc tgtggaggaa 840 acgttgaagc tgccaacacc gaagaaaagg atacaaagta tgtactaacg accaggagtg 900 tgaatcaagc gcaaaagctg ctgaagatga aacaattgat cgacggaact aaggttgagg 960 tagtcttaca cccaaagctg aacgttagaa actgcgttat cacctgcaga gcggccatca 1020 aaatgactga agagaaccta ctggaggagt tggcaccaca aggtattagc aaggtgagaa 1080 gaattacccg ttacgaaaat ggacaaaaag tgaatactgc gacactagta ctgaccgtca 1140 atggaacggt agtaccggag tacatcaacg tcggactttt gagagtgcca acacgattgt 1200 tctatccggc accaatgtta tgcttcaact gctactccta cggacacaca aaaatgagat 1260 gccagaacac acccgtgtgc aaagtatgct ccggtactca tacacttgga aaagatgtaa 1320 gttgtaccca acagcctgca tgcaaaaact gcaaggatag tcattcaccg gcgagtaaga 1380 aatgtccatt gtacatgaag gaggaggaaa ttataaaaat aaaagttgaa agtggtgtgt 1440 cctttgccga agccaggaaa gaatatcaac gaatgcatgg agatcgttcg tattctgctg 1500 tgactggcgc gcaggcgagg attcaagaca ttaggaaaga aaacgaaaag gacgtggaaa 1560 ttcgaaaact caaggaagaa ttagcaaacc tcaaggaagt gatgtcacaa aaatctgaaa 1620 aggaaaagga ggatgaaaca gttaaactgc gcgaagaagt aattcaactc agaaagataa 1680 ttgacgactt caagaacggt ccagcaaagc aacaaatgga tactgaggag cgaaatgaat 1740 tttccatcag tgaagaagaa atgtctaatg cagagtcgca atctgtgaac aacgataacg 1800 agagtgaagg atatgcacca gttgctaaag ctgttcgtag gaaatcaaaa agaacgttag 1860 caaaaagtgt gaaaactgta agagaaaagg aaggagatga caccgaccga agcagaagta 1920 gcagaagcgg acatgagcca atcaacaagg aaaagtcaaa ctctagccaa aagaagcgag 1980 gaaggccacc caaacataga gatgaccagt gaaaggaata tgctaaatga cggaaataac 2040 gatatggacc gaagcaacgg aaatggcacc gagaacgaca tctgtgatga aggaaggagc 2100 gaaacggtaa gttggttggg gttagatgaa gagaaagtag tctccccact cttcagaaga 2160 ggagagaata gtccgaggaa cgagagcgag gacgttacac tcccacctgt gctctgtcag 2220 ttgcctcgga gttctgccaa ctacgccgca acgaatgaag cacaattaag aaatggcacc 2280 acccaagcaa gcaggattcg aaatcaagca actaagctac aacccgcaat attatcatgg 2340 aacatccaag gaatacagac aggaagacaa gatttggacc ttataataaa ggatagaaaa 2400 ccactagtga tatgtttgca agaaacaatg tgctctcaac gcactcagcc tcggattaca 2460 ggatacacca tttataaccg accacggcct aactgtcaac gagcgtctgg aggagtactg 2520 atcgcagtta agaacggcat tgacaacgta gaaatcgaaa tggagaccac gttagaagcc 2580 attaaagtaa atattggtcc tcctttgaat ttctccatca taaatgtcta cttcccaccc 2640 ggagcaaacc ttaatatatc ggaagttgaa gcactttttg cgggcccaaa tcctgtatta 2700 ctagtgggag acataaatgc tcatcatcca gtatggggtt ccagttttag caacaaaaga 2760 gggaaagaat tagaagaagt atttgacaaa aacgacttag tgattcttaa cagcggagaa 2820 ccaacatact acaatcaaac aacaggtaaa ggatcatgca tcgacgtagc agtctgttct 2880 acacatctgg ccggaaattt ggagtgggaa gtcctaggag atagctacgg tagtgatcat 2940 atgccaactc taatctgtaa cccaagagaa acgcaagaga atccaaccag acctaaatgg 3000 aaaattcagg aagcggactg ggagcagtat caacgtatcg ttgatttccc tttaataggg 3060 gacccagaag aacaaatgaa gggaattact tcgggaataa tatctgcagc tgagcaatcg 3120 ataccacgaa cgaaaggagc accgtcgcgt agagtagtcc catggtggaa cgacgaagtc 3180 catgttgcgg taaagaaacg acgaacggca ttaagaagaa tgaaatcagc accaagcgac 3240 gaacgtagag aagagcttgc cagagacttc aaaacagcta gacgtgaagc acgccaagtg 3300 atcaaaagtg ctaaagaaga gtcgtggaaa cagtttgtgt cttctttcaa tgtgcatact 3360 tcagttaaag atatgtggaa caaatacagg tcgctccagg gaagatgtag gcaagcttgc 3420 attggaggaa tcaaatatga aggtcgcgaa gaaacgacaa gtgtgggaat agctgaagca 3480 attgcagaag ccttttgcag tgtctctagt gatacctcgt acagtgaaca tttccgtgat 3540 gttaaacagc aaactgaaag taaccctttt cttatgttcg atgcaccgga agaagagtac 3600 aataaagact tttcaatgta tgaattggat gaagcactgg agagtctgaa ggactctgcc 3660 ccaggtgaag ataatgtgca ttatggaatg ttgagtaact taccgctaga ttgcaaatac 3720 aagttgctgg agacttataa caagttatgg caacaatcgg tgtaccctgt agaatggact 3780 aagtcaattg tgatcccaat atacaagggt aaaggggata gaacaaatcc actgaactat 3840 aggccaattt atttaaatag ttgtgttgca aaaattttgg aaaaaatggt gaacagtcgt 3900 ttaattttca ttttggaaac caaaaagtta ttgcacaaat atcagtatgg attcagaaag 3960 gcgaaatcca ccacagacca tttagtggaa ttggaaggaa caattagaga acatcttaat 4020 aaaggacact atgctcaggc tgttttcctg gatatatcga aagcctatga cacaacgtgg 4080 aggaggctca tactggaaaa gttgaagttc tggaaaatag gtggacatct gctcagattc 4140 atagaacgat ctctggcatg tcgaaccttt cgtgtgctcg caaacggtac gctttctagt 4200 gaaaagaaaa tggaaaacgg attgtgccag gggtctgtgc tcagcgttac actctttctg 4260 gtggccatta acactattgt agaaaaacta cctggtgaag taaagtgttt aatctacgct 4320 gatgatgtcg ttatcattta cggtggcaat gactttgaga aaaatgaaga aatactacaa 4380 aatgcgttga acacaataag tgactggcaa caaagaactg gattcaacgt gtccgcggaa 4440 aaatgtgcta cagtgacttt taaaacagct agagctagaa aagtgcctct gacaagctcc 4500 ctgacaataa atggatcgaa aataccgaag gaaaaaacat tcaagtgttt gggtgtttgg 4560 tttgatcaaa gtttaaggtt cagagaacat gttgagaata ttagagcatc gtgtcagcag 4620 agaatgaaaa tattaaaatg tgtagctagt agctcatggg gtggtgatcg agaaacaata 4680 tgtaagctgt acaaggcgac gattctcgag aagatgttgt acgctgcacc tctgatctct 4740 tcagtgtcca aagatatttt gaaaaaactt gaagtggtac acaatgctgg aatgagggct 4800 ataagcggtg catttcgttc aagtccatcg gaaagcataa gggctgaagc aggaatgcca 4860 agcttcaata catacgtgga tcaacggaca gccttgtatg cagtgaaact gaaagctctg 4920 gatccccaga taatcgaaga aacatgtgaa acaacaagca gcgaagatga atcacaagaa 4980 acaacaacat atgaaagcag ttcaggagaa gagtggggat cagtgaaacc ggtaccagaa 5040 ctgacaatca gagaaagagg aggccatctg ctacaggagc ttgaagttca ggttccttta 5100 cttaacattt ttactctacc atcttgtgca ccatggaaac gtaagaaaat tctcgtggac 5160 aagaccctgt atacagcaat gcgtcgtaaa caacctgaaa tcatgttgcg gaagctcttc 5220 aatgaaagaa gagtaacaaa atacagggtt cataaaacct tatacacaga tggttcaaaa 5280 atgtccaata aatgtgggta tagtgtagtc agtgataata tgtcaattcg taaaaggata 5340 tatgatgaga gtagtattta tggagcagaa agtctagctg caaaagaggc aataatgttg 5400 gtagcaaatg aagaaggggc gggtgcatat ttaatttgta ctgattcact aagtgttgta 5460 acatgtttgg agagtagtaa agttaaaagt ctatggaaag atcaaatgct tgagatttac 5520 actcaatgtt tacaaaatag aaaagaggtc acattctttt gggtaccgag tcatataggt 5580 attgaaggta acgagaaagc ggatcgagaa gcgaagttag ccttagaaca aagaatagac 5640 accagcatag cattggacta taaagaaatg aagaaaattt taaacaataa aataataatt 5700 aaatggcaat acgaatgggg aaagattagg gaaaacaaat tacgagaagt aaaagactcg 5760 gtcatcccgt acaaaactag aggcaagact cggagagaaa atgtagtcct tactagaata 5820 cgtatcggtc atacggtcct aactcacaaa catctattcg aacgaatggc agcaccaatg 5880 tgccagtggt gtaacgaatt gctcactgtt aaacacttat tagttgaatg tataagttta 5940 gacgaagaaa ggaaaaaagt aggattgccg ataagtttga gggagatttt agcagatgat 6000 gaagaaagaa ttgaatcagt tatatgttat ctgaaaaata tcaatgtata caatagttta 6060 tagaaggttc caccatacac aagacctgaa tgacaataaa gttaaagggt ctataataaa 6120 aaaaaaaaaa aaaa 6134 // ID Gypsy-18_OD-LTR repbase; DNA; INV; 215 BP. XX AC CABV01002725; XX DT 24-FEB-2011 (Rel. 16.02, Created) DT 24-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Oikopleura dioica genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-18_OD_; KW Gypsy-18_OD-I; Gypsy-18_OD-LTR. XX OS Oikopleura dioica OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; OC Oikopleuridae; Oikopleura. XX RN [1] RP 1-215 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Oikopleura dioica genome."; RL Direct Submission to RU (24-FEB-2011). XX DR Genome; CABV01002725; Positions 8400 8186. XX SQ Sequence 215 BP; 57 A; 49 C; 35 G; 74 T; 0 other; tgatatgaga ttctcatatc tctagagtta tttccgccat ggttaatcaa atagcctaaa 60 ttaactcatc cacttccgcg ttaaccaact gtctcacact atctgcgtca actgtttact 120 tatttcgtgt gctgtgcccg atccgcttgg atcattcaat aaagttgtgg taattgagtg 180 gacatctttt acagactttg agtcaaacca tatca 215 // ID CR1-47_BF repbase; DNA; INV; 1673 BP. XX AC . XX DT 30-JUL-2009 (Rel. 14.07, Created) DT 30-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE Amphioxus CR1-47_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-47_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-1673 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-1673 RA Kapitonov V. and Jurka J.; RT "Young families of CR1 non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(7), 1618-1618 (2009). XX DR [2] (Consensus) XX SQ Sequence 1673 BP; 449 A; 355 C; 346 G; 523 T; 0 other; gacctatatc tctgttgtcg atagccagca aagtcatgga acggtgtctg tataacgcag 60 tgtttcccaa gttgtatgac aagatacact ctctccaaca tggctttgtc aaaggtcgct 120 ctaccaccac ccagctgctt gaggtgtatc accagattgg ctctgttcta gataaggggg 180 gacaagtgga tatgcttttt cttgatttct ccaaagcatt tgactccgtt ccacatatac 240 ttttaaccca caagttacag atgtttggct ttggtggtga actgctgcaa tggcttaatt 300 catatttgac taaccggagg caaagggtcg tggtagaagg cagtctgtcc gagtggcgac 360 cagtgacctc tggagttccc caggggtcca ttttgggacc attattgttc ctcctatata 420 ttaatgacct tcctagttct gccaccaatt ccacagttgc actattcgct gacgactcca 480 agtgttttaa agaaatccgt aatactgacg actgtgttaa acttcagtat gacattacgt 540 caatgtataa ctggagtact acttggagat tatactttaa tctttctaag tgtaaggtgc 600 tgcggcttac acggtctagg aaaccagtca tgtacagtta tgacatgaat ggtacaagtc 660 tagagctagt gagtcgcatg cctgatcttg gtgtcactgc acaattcgac ctcaaatgga 720 acgatcatgt tgtgaacact acagctcgag caaactctat gctaggtttt gtaaaacgca 780 cagttggcta tacgaccagt gtagacagta ggaaatccct ctacataact ctggtgagaa 840 gtatgctaga gtacagctcg ccagtttggt gccccgggtc tcgcaagctg atcagtctta 900 tagagggtgt acagcgtcgt gcatctaagt atatactggg tgctggtcat gaacatcctg 960 actacaaatc caggttgacg tctctgggta tgctaccact ctcatatagg agagaagctg 1020 ccgacgttct actgtttgta aagtctattg ggaacatata tgactcagac atagtgaaat 1080 atgtttcttt ctcatctaga gggactagaa gctttagggc caacatggca acacctttta 1140 ggtctaaaac tcaacagttt gccacctcat acatcccaag actcattagt atttggaatg 1200 ggctggatgg gaaccttaga gccatgggcc gctatgtctc tcagtcatca cacataacta 1260 catttaaacg tcagttgtta aaatacctac acaaccgatt cacacacctt tataatacag 1320 acaccccatg tacttggacg tcgttctgct cctgttcgac gtgcttcgca acaagagcac 1380 gttaaccgtt caactttgta tataaacatt gtgcattttg tatataagtt atattttgtt 1440 tccgattatc actgtgcata ccatttttga ttattcttat tatttaattg aatgtgtatt 1500 gtatataagt tattgtttgt ttccgattat tcctgtatat accattgttg actcttatta 1560 tttacttgta ttgtgtattt tgtttcgggg agacaacttg cagaggtggt atcacctgtt 1620 ttgtctcccc accatttatg ggaagtgtaa taaacaaata aacaaataaa caa 1673 // ID LOA_Ele3B_AAe repbase; DNA; INV; 5698 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A LOA non-LTR retrotransposon from Aedes aegypti. XX KW Loa; Non-LTR Retrotransposon; Transposable Element; KW LOA_Ele3B_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5698 RA Kojima K.K. and Jurka J.; RT "LOA clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1422-1422 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >98% CC identity. The consensus is ~80% identical to LOA_Ele3 and ~76% CC to LOA_Ele3C_AAe. XX FH Key Location/Qualifiers FT CDS 305..1876 FT /product="LOA_Ele3B_AAe_1p" FT /translation="MMQRTVIPQRLVGTNMSTNQNNNNLEGVDCLSQSEPM FT VVEDVQSNTELTEEQLLASSQEDMLTDATEVSNKNSTNHPYPDTQLDMDDD FT NNDGINVIINIPESQPSGSESKTDPSCHPNNANDQQTTKINLTRGQRKKFK FT ALMQSGVSRSEALIQLGKGKDELASSKRGRTDLDNSATSEDVPKQKRTKKR FT LDPRDRAEQSNHDGPMAQQSSDGHQNSGENQTAGHSYSDMTNRRKVGVVPK FT HFPTSHLSTTQLDALQEALLLKVEKQRNEPMKPKFCNLLYKSGYMVLVCKD FT LETADWVKEITPSLIPWEGAELEAMDEEKIQRPEPIRAFFPQSAKYCDERI FT KTLIESQNKITTSSWRILQRSTPNDIHVEWIFTVDGPSMANLTKSNFILNY FT RFGEIQLRKIKGKTTQSNENSTNRVPQEKSKAASSKNPQSNPIKKASLPVS FT KDSSSNQTPSSSGGRLVPSKSSGSGKRLNSTTETKKAGLGKKIDKHEHPKH FT HPKQIQDDPQHPKKNDLRPGNCGTPKND" FT CDS 1812..5531 FT /product="LOA_Ele3B_AAe_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="MIRSIQRRTTSVLETAAHLKMTKIIQVNLHHARSATD FT VLCRRFTKELFTVALIQEPWVNQSRIQGIHVNSCKLVYDDSQLSPRAAILI FT RNDTKCFPITEFIKRDIVAVRVEVPTARGKADIVMVSAYFPGDAEDVPPPE FT IAALTSYCETKNIPLVIGCDANAHHTVWGSTNINPRGEYLLQFLSSNNIDI FT CNTGDKPTYENAIRQEVLDLTLCSQSISHKITNWHVSDEISMSDHKHIVFE FT WEGGLTIAKSFKDPKKTDWETYSATLRSEDYIINPNIKTVLELEMASDSIQ FT TKILNAYQESCPTKTVKSSRDVPWWNNHLEKLRKTARREFNRAKRSTDWSL FT YRKALTDYNKEIRLAKRKSWVLMCESIEKTPVAARLHKTLSKDHSNGLGTL FT RKTDGSLTVDPTETLSEMLRTHFPDSIPEAIVNANGNRHGVSGQQELLSWD FT SSTKRDALKVAKEAFTRARVERAVRSFEPFKSAGMDGIFPALIQKEEQTLI FT PPMIEIFKASLVLGHIPDGWRQVRVVFIPKPGKKDKTNPKAYRPISLSSVM FT LKIMEKVLGEFINSKFMEAMPLSKYQFAYQSGKSTISALHMLVNKIEKTFH FT AKEIAIVAFLDIEGAFDNASYLSIVSAMHRRKFDPCIATWVHAMLANRRIS FT SELSGSCATVMATKGCPQGGVLSPLLWSLVVDDLLNSLEKRGFEVVGYADD FT VVIIVRGKFESVVLARMQSALDYTLSWCLKEKLGINPSKTTIVPFTKRRKV FT QLQPLFLNQTQLVYSNEVKYLGITLDAKLNWNTHLQNMVNKGLNSLWVCSK FT TCGKKWGLKPNMIMWIYKTIVRPRIAYASLVWWPKTSEVTARAKLNKIQRA FT ACVAITGAVRSTPSFALDAMLNLPRLDQFIKLDAEKSALRLKRSINLLPGD FT LTGHLSILNEFSINPIVEKCSDWMEMVVNYDISYTVFIPSRQEWQEGGPSI FT PPGSIKFYTDGSRMNNRSGSGVHGPRTKISVPLGQWPTVFQAEVYAIIECV FT QLSIKRNYRNATICIFSDSQAALHALKAYTFNSKLVWECALALKTLAIRNR FT VKLYWIPGHTGLEGNEIADELARNGSANTFIGPEPFFGISNCSLNIELNSW FT LSRQIISNWKTVSNANQSKRFVTINSKQTQKLIGLNKRDLSTYTGLITGHC FT PSRYHLQKIGAIQNPNCRFCSETCETSQHLLCSCSAHIQRRSKIFGKPFLE FT PADIWNASPREVVGFIRLIVPDWGIHNAAT" XX SQ Sequence 5698 BP; 1792 A; 1335 C; 1255 G; 1316 T; 0 other; cctggtggct gtcttggcca aataagttgt tgaacatcaa tttcagcaac ttttccaagc 60 agcctagaga gccgtgtagt tatcggtagc ggttgactca actggctaag aataatacta 120 cggatcgcct gatccggtgg tagaaatcca ccagtcaggt aaccccaatt ccaaggtgtc 180 atgcgacccg tgctgatgga tgaatggttg agggggttta aaagatgctc aatcgcgaac 240 ggagcctggt gtgtaccagg gcgaactccc cagtatgtag ctcttactgc attacggcgg 300 ggcaatgatg cagcggaccg ttattcccca gcgactcgtg ggaacaaata tgagtacaaa 360 tcaaaacaac aacaatcttg aaggtgtcga ctgcctctct cagtcggagc caatggtagt 420 ggaggacgtg cagagcaata ctgagctaac cgaagagcag ttactcgcaa gttcgcagga 480 ggatatgctt accgacgcaa ccgaagtaag caacaaaaat tcaacaaacc atccgtatcc 540 ggacacccaa ctggacatgg acgacgataa taacgatggt ataaatgtta tcatcaacat 600 cccagaatct caaccatcag gatctgaatc caaaacggat ccttcatgtc atccgaacaa 660 cgcgaatgac caacagacga caaaaataaa cctcacgaga ggacaaagga agaagtttaa 720 agcactgatg cagagcggcg tgagtcgatc tgaagccctc attcaactcg ggaagggcaa 780 agatgagcta gcttcctcta aacgtggcag gacggaccta gacaactcgg ctaccagtga 840 ggacgtcccc aagcagaaac gaaccaagaa acgcctggat cccagagatc gagctgagca 900 atccaaccac gatggcccaa tggctcagca gtcaagtgac ggtcaccaaa atagcggaga 960 aaaccaaacg gctggtcata gctacagtga catgaccaat cgtaggaagg tcggagttgt 1020 tccgaaacac tttcccacat cccatctttc cacgactcag cttgatgctc ttcaggaggc 1080 cctgctgcta aaagtggaga aacagcgtaa tgagccgatg aaaccaaaat tctgcaacct 1140 gctctacaag tctggataca tggtccttgt ctgcaaggat ctcgagactg cagactgggt 1200 aaaagaaata actccttcac tcatcccttg ggaaggtgcc gagttggaag caatggacga 1260 ggagaaaatt cagcgtccag aaccaattcg tgcgttcttt ccccaaagtg cgaaatactg 1320 tgatgagcgc ataaaaacgc tcatcgagag tcaaaacaag atcacaacct ctagttggcg 1380 tattctccaa agaagcacac caaacgatat ccatgtcgag tggatattta cggttgatgg 1440 gccgtctatg gcgaacctaa caaaatccaa ctttatcctc aactatcgct ttggcgaaat 1500 acagctgaga aaaataaagg gcaaaaccac tcagtcgaac gaaaattcaa ctaatagggt 1560 gccccaagag aaatctaagg cagcctctag caaaaaccca caatcaaacc ccattaaaaa 1620 ggccagtctg cctgtctcta aagattcaag ctccaaccaa actcctagct ctagcggtgg 1680 cagattggtg ccatctaaga gcagtggatc tggtaaaaga ttgaattcga ctacagagac 1740 aaaaaaggct ggcctaggga agaagattga caaacacgaa cacccgaaac atcatccaaa 1800 acaaatacaa gatgatccgc agcatccaaa gaagaacgac ctccgtcctg gaaactgcgg 1860 cacacctaaa aatgactaag atcattcaag tgaatctcca tcatgctcga agcgcaacgg 1920 atgtgctttg ccggagattc acaaaagaac tgttcactgt ggctcttatc caggagccat 1980 gggtcaacca atctcgaata cagggtattc atgtaaactc atgcaagttg gtatatgatg 2040 acagccagct ctctccaaga gcagctattc taatacgcaa tgacactaaa tgctttccaa 2100 ttacagaatt cattaaacgc gacatcgtag cagtcagggt ggaggttcct accgctagag 2160 ggaaggctga tatcgtcatg gtctcggcgt attttccagg cgacgcggaa gatgttcctc 2220 ctccagagat tgctgcgcta acctcttact gtgaaacaaa aaacattccc cttgtcatcg 2280 gatgtgacgc gaatgcgcat cacactgtat ggggaagtac aaatataaat cctcgaggtg 2340 agtacctttt acagtttcta tcctcaaaca acatagacat atgtaacaca ggtgacaagc 2400 ctacatatga aaacgccata cgacaggaag tgctggatct gactttatgt agtcagtcca 2460 tctcccacaa aataacaaac tggcatgttt ctgatgaaat atctatgtca gaccacaaac 2520 acatagtctt tgaatgggaa gggggtctaa ctattgcaaa atcgtttaaa gatcctaaga 2580 agactgattg ggaaacctat tcagccactc tccgatccga agactacatc ataaatccta 2640 atatcaaaac agttctagag ttagaaatgg cttctgactc tatacaaact aaaattctta 2700 atgcatatca agaaagctgt ccgactaaaa ctgttaagtc gagcagagat gttccgtggt 2760 ggaacaacca tcttgaaaag cttaggaaaa cggcacggag ggagttcaac cgtgccaaac 2820 gctctactga ttggagccta taccgaaagg ctctgacaga ctacaacaag gaaataaggc 2880 tagctaagcg gaaatcatgg gtcctcatgt gtgaaagcat tgagaagact cccgtagctg 2940 ccagacttca taaaactcta tcgaaagacc actccaatgg tctgggaact ctccgaaaga 3000 ctgacggttc gctcactgtg gatcccacag aaacactgag tgaaatgctg aggactcact 3060 tccctgattc aatccctgaa gcgatcgtga acgctaatgg caacagacat ggcgtctctg 3120 gtcaacaaga gctcctgtca tgggactcaa gtaccaaaag agacgcattg aaggttgcca 3180 aagaagcctt tacgcgagca agggtggaaa gggcagtgag atcctttgag ccatttaaat 3240 ctgctggcat ggatggaatt ttcccagcgc ttatccagaa agaggagcaa acgcttattc 3300 ctcccatgat agagattttt aaggccagtt tagtcttagg acatattcca gatggctggc 3360 gtcaagttcg agttgtcttt atcccgaagc cgggaaaaaa agacaaaacc aatcccaaag 3420 catacagacc gataagtctg tcgtcagtga tgcttaaaat aatggaaaag gtcttaggag 3480 agttcataaa ttcaaaattt atggaagcaa tgcctctttc caaataccaa ttcgcttacc 3540 aaagtggaaa atctacgatc tcagcactac acatgctagt caacaagatc gagaaaacat 3600 ttcatgcaaa ggaaatcgcc atcgtggcat ttcttgacat tgagggtgca ttcgataacg 3660 cttcctattt gtctatagtg tcagcaatgc ataggagaaa atttgaccca tgcattgcta 3720 cctgggtaca tgctatgcta gcaaatcgcc gaatctcatc cgagttgagc ggatcgtgcg 3780 ccactgtcat ggccacaaag gggtgtcctc aaggaggggt actttcacct ttgctatggt 3840 ctctagtggt ggatgatcta cttaatagcc tggaaaaaag aggattcgag gttgttggct 3900 acgctgatga tgttgtcatc attgtacgcg gcaaatttga aagcgttgtc cttgcaagaa 3960 tgcaatcagc tctcgattac acgctctcct ggtgtctgaa agagaaactg gggataaatc 4020 cttcaaaaac tacgattgtt cctttcacaa agcgcagaaa ggtacagctg caacctcttt 4080 tccttaatca aacacaattg gtttactcaa atgaagtcaa ataccttggc attacgcttg 4140 atgctaaact caattggaac acacatcttc aaaacatggt aaataaaggt ctcaattctc 4200 tgtgggtctg ctcaaagacc tgtggaaaaa agtggggcct aaaacctaat atgatcatgt 4260 ggatatataa aaccatcgtt cggcctagaa tagcctacgc ttcccttgtt tggtggccaa 4320 aaacaagcga ggttacggct agagccaagc tgaacaaaat ccaacgcgcc gcgtgcgtcg 4380 ccattactgg tgcagttcgc agcacccctt cattcgccct agatgcaatg cttaatctgc 4440 cccggctaga tcaattcata aagctggatg ctgagaaaag cgctctgagg ctaaaacgat 4500 caataaacct tctgccaggt gatttaaccg gtcacctaag catactaaac gaattttcta 4560 taaatccaat tgtagaaaaa tgcagtgact ggatggaaat ggttgtgaat tatgacatat 4620 catatacggt gttcattcct tctcgccaag agtggcaaga aggtggaccg agtattcctc 4680 caggctcaat caaattctac actgatggat cgagaatgaa caatcgttct ggatctggag 4740 tgcacggacc aagaaccaaa atctctgtcc ctctcggaca gtggcctaca gtttttcagg 4800 ctgaagtgta tgctatcatt gaatgtgtgc agctcagcat aaagaggaat tacagaaatg 4860 ccacaatctg tattttctcc gacagccaag cagctcttca tgctctgaaa gcttacactt 4920 ttaactcaaa attagtgtgg gaatgtgctc ttgctctaaa aaccctagcc atccgcaatc 4980 gagttaaact atattggatc cccgggcata cgggtctaga gggtaatgaa attgccgatg 5040 agctagccag gaatggatcg gccaatacat tcattggtcc tgagccattc tttggcatat 5100 caaactgctc actaaatatt gaactgaaca gctggttgtc caggcaaatt atatccaatt 5160 ggaaaacagt ttccaatgcg aatcaatcta aaagattcgt cacaattaat tcgaaacaaa 5220 cacaaaaact cattggtctc aacaaaaggg acctcagcac ctacaccggt cttataactg 5280 gacactgccc cagcagatac catttacaaa agatcggagc catccaaaac ccaaattgcc 5340 gtttctgtag tgaaacgtgc gaaacctcac aacaccttct ctgctcctgc agtgcacata 5400 tacaacgaag atccaaaata tttggcaagc cctttttgga gccggccgat atttggaacg 5460 catctcccag ggaagtggtc ggctttatca ggctgatcgt accagattgg gggattcaca 5520 atgctgcaac ttaggacttc tgcccatcaa tggcagatgg ctaaaagttc agtcaccgcg 5580 caaagtattc cggtggcgtt cccgccactg agggtctttg taaaagcaaa agggtatatc 5640 acaatagttc taaaaaaaat ggacgcagtg atctcacacc cgacagaagg aggaggaa 5698 // ID CR1-61_AAe repbase; DNA; INV; 2450 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-61_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-2450 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1148-1148 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 18 sequences with >98% CC identity. Closely related to T1 and Q. This consensus is CC 5'-truncated. XX FH Key Location/Qualifiers FT CDS 3..2327 FT /product="CR1-61_AAe_1p" FT /translation="LDNFAFRGLKQRNLLRNHLDRTLDLIFTNDAITDCSS FT VDEACQPIVDIDPHHPAFVLTVRSPTIVPFVEVSDEEVRDFGKVDYETLHV FT LLSRVDWSTVYQCNDVDNAVSIFNGKLHECFTHCVPLAMPRRKPAWSNARL FT LYLRRNQSKMQRRFSKRKNDCNKLAFNRASRSYRYYNRYLYGKYVRRVQDN FT LRRHPKQFWSFVNSKRNETGLPSFLSLNGHSASSAEGKCELFASWFASVFD FT TSSQSQADPDAASQSLPTDIIDVDIFEVSHMMVEEALKKLKPSISPGPDGI FT PACILKHCKAVLIAPLKAIFNLSLRCQQFPKAWKMSYMRPIFKKGSRTEVS FT NYRGITSLPAGSKCFEIIMNDVLLQACKTYISQNQHGFFPQRSVISNLCEY FT TSFCIKEMDKGGQVDSIYTDIKAAFDTVDHSILLAKLNRLGASSRICSWLY FT SYLTDRQLLVKIGKAISAPFTPHCGVPQGSNLGPLLFVLFFNDVSHVLSNC FT YLLFYADDLKIYLIVRSINDCRQLQEQLDLFVSWCTLNHLSISVSKCCIIS FT FRRMKSPVLYNYAINGTILNRVDNVKDLGVLLDHQLTFKMHYSETVDKANR FT QLGFIFKIASEFDDPLCLKSLYCSLVRSILEFASVIWSPYDAVWIARIESI FT QRRFVRCALRSLPWSDPLRLPPYEHRCALLGIETLEERRKMNQAVFGAKVL FT RSEIDSPALLRQLNIYAPQRVLRSRQLLYQPPRRTMYGSNNPIDTISRRFQ FT DNYNLFDFNISTNTFKNRLRNRQV" XX SQ Sequence 2450 BP; 703 A; 517 C; 518 G; 712 T; 0 other; ttttggacaa tttcgcattc aggggactga aacagagaaa tttattgcga aaccatttag 60 acaggacgtt agatctaata ttcacaaatg atgctatcac agattgcagt tcagtcgatg 120 aagcctgtca gccgatagtg gatattgacc cgcatcaccc agcctttgta ctcacagtaa 180 gatctccgac aatagttcct ttcgtagaag ttagtgatga agaagttcgc gactttggta 240 aagtggacta cgaaacactc cacgttttgc tttcgagagt ggactggtca actgtctatc 300 aatgcaacga tgtggataat gcggtgagca tcttcaatgg gaaactacac gaatgtttca 360 cgcactgtgt accgttagca atgccgcgac gcaaacctgc gtggtctaat gcacgcttgt 420 tgtatctccg acggaatcaa tccaagatgc agagacgatt ttcaaaaaga aaaaacgatt 480 gcaacaagct ggcatttaat cgagctagcc gcagttatcg ttactacaat cgatacctct 540 acggtaaata tgtaaggcgt gttcaggaca atttacgaag acatccaaag cagttctgga 600 gttttgtcaa ttcaaagcga aatgaaactg ggctgccatc atttttgagc ttgaatggcc 660 actctgcttc ctcggccgag ggcaaatgcg agctttttgc atcatggttt gcgagtgtct 720 tcgacacatc gtctcaatct caagcggatc ccgatgctgc gtcacaatcg ttacctactg 780 atattatcga tgttgacatc ttcgaagtta gccatatgat ggtggaggag gcattgaaga 840 agctaaaacc atccatatcg ccaggccctg atggtattcc tgcatgcatt ttgaaacatt 900 gcaaggcggt gttaattgct ccgctaaagg ctatattcaa cctctctctc cgatgtcagc 960 agttccctaa agcatggaaa atgtcttata tgcgaccgat cttcaagaaa ggcagtagaa 1020 cggaagtatc gaactatcgt ggaataacct cccttccagc tggatcgaaa tgcttcgaga 1080 taataatgaa cgatgtttta ttgcaagcct gcaaaacata catttcacag aatcagcatg 1140 gattcttccc acaacggtca gtaatctcaa atctatgcga gtatacgtca ttctgcatca 1200 aagaaatgga taaaggggga caggtcgata gtatttatac agatataaag gccgcatttg 1260 atacagttga tcacagtatt ctcttggcaa agttgaaccg tttgggtgct tcttcgagaa 1320 tttgcagttg gctgtattcc tatctaactg atcgtcaact actcgttaag attggaaaag 1380 ccatatcagc tccattcact ccgcattgtg gtgtacccca agggagtaat ttggggccac 1440 tgttgtttgt cctgtttttc aatgatgtgt cccatgtact ttcgaactgt tacttgttgt 1500 tctatgcaga tgacttgaaa atctatctca tagtaagaag catcaatgat tgccgccagt 1560 tgcaagaaca gttggacctc tttgtgagtt ggtgtacgtt gaatcatttg agtattagtg 1620 tgtctaagtg ctgcatcata tcctttcgta gaatgaaaag tcctgttctg tacaattatg 1680 ccattaatgg taccatactg aatcgtgtgg ataatgttaa agatttgggt gttttactag 1740 accatcaatt gacattcaaa atgcactatt cagaaacagt cgacaaagcc aatcgtcaac 1800 ttggattcat cttcaaaata gcaagtgaat tcgatgaccc tctgtgtcta aaatcattgt 1860 actgctcgtt agtcagatca attctagagt ttgcttctgt aatctggtcg ccatatgatg 1920 cagtgtggat tgctagaatc gaatcgattc agcgtagatt cgtgagatgc gcattacgta 1980 gcttaccctg gtctgaccca ttacggctac cgccctacga gcatcgatgc gcactgctcg 2040 gtatcgaaac gctagaggag aggcgaaaga tgaaccaagc cgtctttggt gctaaagttc 2100 ttcgctccga aatagacagc ccagcattgc ttcgccagct taatatatac gctcctcaac 2160 gtgtgctgcg ttctagacaa ctactatatc aaccaccgag gcgaactatg tatggatcca 2220 ataatcccat tgacacgata agccggagat tccaagataa ttacaatttg ttcgatttta 2280 acatctcaac taataccttc aaaaatcggt tgcgcaatcg ccaggtttaa tgtatcctgt 2340 taagtgttgt ccaagagttg atgcttagtt tagtttagtt ttaattttat tcattaagac 2400 catgatgtcg gatgaatctt caatacaaat acaaatacaa atacaaatac 2450 // ID hAT-3N1_BF repbase; DNA; INV; 952 BP. XX AC . XX DT 30-SEP-2008 (Rel. 13.09, Created) DT 30-SEP-2008 (Rel. 13.09, Last updated, Version 1) XX DE Amphioxus hAT-3N1_BF non-autonomous DNA transposon - consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW hAT-3N1_BF; hAT-3_BF; non-autonomous. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-952 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-952 RA Kapitonov V. and Jurka J.; RT "hAT transposons in the amphioxus genome."; RL Repbase Reports 8(9), 922-922 (2008). XX DR [2] (Consensus) XX CC It shares 90- and 210-bp 5' and 3' termini with the autonomous CC hAT-3_BF. XX SQ Sequence 952 BP; 258 A; 230 C; 219 G; 245 T; 0 other; cagtggcggc ggcaccgtgg gggcagtggg ggcggtcgcc cccacaaaaa ataggtcgtg 60 ggggcgtcgc ccccacgata aaataagctg aaacacacaa aatcaagctg aaactcttgt 120 gtatattttc actaatggaa atttcatcaa atctcgctag tctactggca aaatttggcc 180 ctgaaaatgc aggaaatgac gtttcagagg gtccagattt caaaatttca aaggacctcc 240 cttgtgagga ttcgcgcctt cggcactcag gacatggtgg gaaaatgcta ccggagcgtc 300 gtcccccaca acctttctct agtagaaatt tcattatcaa actttagatt tgtttacaag 360 caaaatgttc ccctcaaatg caggaaacgg cattcaaagt gtcctgattt aaaaatttct 420 ccagaactcc cttgcggcgg cttgcgtctt cggggctcac gacgacgtag tgcaaaatat 480 gttggtgcgt gggtcgtcgc cctcaaaaat cgttttctct actagaagtt taattaaatt 540 cagcttagtc tgtctgcaaa atttgtccct caaaatacag gaaatggcgt ttcagagggt 600 ctagatttta aaaatttcca gaacctaccc tgcaacggct cgtaccttcg ccgctcgaga 660 tagtgaaaat atgttggggg tgtcgccctc aataactaaa gttaaaacta tacatgtatt 720 tttgatagaa atgccattga agttagctta atctggcagc aaaatttgcc cctcaaaatg 780 caggaaatag cgtttcagag ggtttagatt tgaaattttt ccgggggtgc atgcccccgg 840 acccccctag aagggtcgcg cctccggcgc gacgcctcgg gccttcggcc ctcgatttag 900 tgatatgaca aatattcgcc cccacaactg aaaaacggtg ccgccgcctc tg 952 // ID HHA1_BT repbase; DNA; INV; 326 BP. XX AC . XX DT 09-JUL-2004 (Rel. 9.06, Created) DT 09-JUL-2004 (Rel. 9.06, Last updated, Version 2) XX DE Brugia timori HhaI repeat region - consensus. XX KW HHA1_BT; Hha1 repeat; tandem repeat. XX OS Brugia timori OC Eukaryota; Metazoa; Nematoda; Chromadorea; Spirurida; Filarioidea; OC Onchocercidae; Brugia. XX RN [1] RA Fischer P., Wibowo H., Pischke S., Rueckert P., Liebau E., RA Ismid S.I. and Supali T.; RT "PCR-based detection and identification of the filarial parasite RT Brugia timori from Alor island, Indonesia."; RL Unpublished. XX RN [2] RA Gentles A., Kohany O. and Jurka J.; RT "Consensus."; RL Direct Submission to Repbase Update (JUN-2004). XX DR [2] (Consensus) XX SQ Sequence 326 BP; 134 A; 38 C; 34 G; 120 T; 0 other; gcgcaaaact taattacaaa agcgtatttt aatatttaaa ctataaaatg acgacgcaat 60 atacgaccag cactggtaca attcacgtaa catagtcaat taatttcaaa ataagctttt 120 tttagtagtt ttggcactta attaaaatta gaattgataa attagaacca atttccttgt 180 ctaatgtaag aaaactacaa caaattttaa aatttttaat aaattcaaaa cttaaatttg 240 aatttaaatt taatttaaat tcttaaattg aattaaaatc atgattaatt gaaagtttta 300 ttaattttgc tgatgaattt atgcgc 326 // ID POT_Cis repbase; DNA; INV; 2976 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 06-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE Mariner DNA transposon from Ciona savignyi. XX KW Mariner/Tc1; DNA transposon; Transposable Element; mariner; KW POT_Cis. XX OS Ciona savignyi OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. XX RN [1] RP 1-2976 RA Smit A.F.; RT "POT_Cis - Mariner DNA transposon from Ciona savignyi."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX CC Ci000344. TA target site duplications; 24 bp TIRs. The ORF from CC bp 541 to 2409 encodes a transposase up to 24% identical (46% CC similar) to the Pogo-like transposases of POT2, Flipper, and CC Fot1 in fungi (POT = POgo-like Transposon). XX SQ Sequence 2976 BP; 1023 A; 482 C; 558 G; 912 T; 1 other; ccgtgaaatg gggtttcttt gcccttcatg gcaactttgc ccgaaatttc aaaaattttt 60 gaaagtgaaa atattccagc ttttacatac aaactaagtg aatttttatt ttatagtata 120 aattcatcat atgattacca tacttccata aaaacaagtg aaaaaattat ctttaaacca 180 ttattttttg tatttttgca ggtcctgttt tttttcaaaa ataatgttgc ttagtacttt 240 ctcttacagc cttattatta ataatgatat aaatatgata agtattgtga gtgaagatat 300 tctcattagc aacttattgc atatgatttt tatttggttc catataaaaa caaaaaatgt 360 tggtgaactt tttaaacttc atgcttattt tggtgtatac ggtacaactt tgcccacttt 420 gagatgccta atcatgttac atgagtacgg taattcatta gtcattccta cattggttga 480 tgcttaaaat gcatttacag attttaattt aactttctag gaaaatttca tgtatagtgt 540 aaataatagt gtaaatgcaa tgagaagata caggaaaaga aaggggtcac ggccatataa 600 aaattactca gaagagtgtt tgaaaaatgc actatcagct gttcaagggg ggatgtcagt 660 cagaaaagca gagaaagagt acagggttcc caggagaact ttatctaata aattaaagaa 720 actccactca aaatctgtag gtaggcccac tgcattgaca gaggaagaag agagagcaat 780 ggttcatcat cttctaattt gcgcggaatg gggaatgcct ctatcctgtc ttgatgtccg 840 aatgattgta aaaaatcata ttgattctaa aggaaaagta atagagtgct ttactaataa 900 tatgcctggc tatgattggg tgaaaggatt ccttcatcgt cacaaatctt cacttactga 960 acgcttgtgt agaaatatca aaagagcaag agctcagaca agtcatgatg atataaatgt 1020 gtactttaga caccttcaat ccacacttca agatattcca agcacacaca tagttaacta 1080 tgatgaaaca aatttcagtg atgaccctgg agtaaagaag atgatttttt gtcgtggtgt 1140 taaatatcca gaacgaatta aaaacttctc aaagggtgct gttagtgtaa tgttttcagc 1200 aacagctaac ggcgagtttc tgccattgta cgttgtgtat aaatcaataa agctgtggca 1260 cagttggtgt gagggtggtc caccaagtac tcgctatgga aggaccacat ctggttggtt 1320 tgatgcatgc actttcgaag actggtttac cagtgttatt gttccttggg caaaaaaact 1380 tcctgggaca aaagtcatta ttggagataa tttaagttct catttgaaca gtagagtcat 1440 tcaaaagtgt gaagaactcc acattaattt tgcttttttg ccaccaaaca ctacccacct 1500 aactcaacca cttgatgtgt ctgttttccg tcctgtaaaa acacattgga gaaacattgt 1560 cagtgaatac aaagagcaat acccgagcag cacttcattg gataagcaca tgttccctaa 1620 ccttctaagt aagctgatga aaatattgca aaaggatgga aatgacacaa acatcataaa 1680 atctggattc aaagcaacag ggatatatcc atataaccct aatgaagttc tcaagcgact 1740 tccaagtaca gacttagatg atagtgtatc aagtagtata tctgatagtg ttcttactta 1800 cctacagcag aatcgtataa agaagagaca aggggcaaaa agaaagaaat tggaagtgcc 1860 accaggaaaa tcagtcagtg ttgatgattt gaagacttgc caaccatcta ctagtggatt 1920 gccaataaca aaaacgaaaa actcaacgaa aggaaatata gtaagagcag atgaggtgtc 1980 tacaaaggca aaaaagaaaa agacaggtgg tgcacagcag catgcttact ggagcagcag 2040 tgatgagagt gattgcagtt tacactacaa tgatacatca gatgatctag aatcaagtga 2100 agaagaaaat tggccacaag aaagcagaaa taagtgcgga gattttgtgg ttataaaata 2160 tattgtcaag tcaagtgtga ttcattatgt tggtgagatt gtcaatttcg atgcaactgc 2220 ggcttttgca gaaattcatt tcatgaagca tcttggaaaa gagctctttg cttggataaa 2280 tgaatattgt gaagtcgact taaaccaaat agttttgagc ttgccaccac cacaaccaaa 2340 cagaagagag caatacactt tccgtgctga attgagtggt attaccaatc tccgataaca 2400 atgtatttgt tgcaattttt catgatgtgt atgtttcatg atgttcttca ttcgtttctt 2460 ggtttatttt gtttcgcaat tgatttgcca cttatttaaa gtaaatataa aacaatggga 2520 taatttttag ctctggttta tagtttttac aattgttctg tattagtaac aatgtgacaa 2580 aaaacaatta taagaactat gtactctacc atacaataac tgatgctatt gtaccagctt 2640 tataataaat attggttcca gtagtttttt ttgtcttcag agtgggcaaa gttgtctcta 2700 gaagtttaaa attgcatctt accatggcag attgtgcatt caaatgtaca aatacatatg 2760 tgggcaaagt tgccctatac ggggctactt tgcccacctg aggntttttt taaaaaaata 2820 acagtatgaa aaaaattaat ctcataattt cacataagaa atgtagtaca acatgagaca 2880 tgttgactga actacaaaaa aaacatttaa cttcactatg tctaaagtta tggccataaa 2940 tgcacaaaaa gtgggcaaag taaccccatt tcacgg 2976 // ID TTAA18_AP repbase; DNA; INV; 560 BP. XX AC . XX DT 25-AUG-2009 (Rel. 14.09, Created) DT 25-AUG-2009 (Rel. 15.12, Last updated, Version 2) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; nonautonomous; TTAA18_AP. XX NM TTAA18_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-560 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2085-2085 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC TSD is TTAA tetranucleotide. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea CC Aphid Genome Project. XX SQ Sequence 560 BP; 175 A; 92 C; 114 G; 179 T; 0 other; ggggattgca aacggacagt ttttaaggag ttcaaaatag gcaattttca aattgtatgc 60 atgcgggtct tgtaaatgtg tgcgtagtaa gtgatcatta gtgtgtgtga ctgtgtgtaa 120 ataaaatagc ggagcgctcc cgcacgcgca cgtacaagtc accgcctcgg cgttgcgcgc 180 gtgaacggta aacaatttat gctcatttgt atgtactttt tgtcaatcct accgatggac 240 ctgataacgc gataagactt ctgaaaactg atttgaattc gttaatatgt ctacttgaga 300 tcgatcaggc cacattttaa gatatttaat ttaattttcg aataatcaac atttaaaatt 360 tcgtaaattt taaatattca aacaaaatgt ataggagttt aggaacaaaa ttttaaaaaa 420 gtggcctgat cgatctcaag tagacatatt aacgaattca aatcagtttt cagaagtctt 480 atcgcgttat caggtccatc ggtatgattg acaaaagatt ggtacatttt ggttgaaata 540 attgtccgtt tgcaatcccc 560 // ID Gypsy-589_AA-I repbase; DNA; INV; 4185 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-589_AA_; KW Gypsy-589_AA-LTR; Ty3_gypsy_Ele15; Gypsy-589_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4185 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [3275-3757] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 140..4185 FT /product="Gypsy-589_AA-I_1p" FT /translation="MNATTFSSQTGQECSSGGRGDGLSANLRQENNVHNRQ FT AVSNCVCQEQRNRLGPEFMVSNNNFRFEEVAAFLPFFAGHSDEDINHFIAT FT VENTKRALGINCEIMKLVVIKQLKGNARDWLHTRADFMLKTYEETLDDLRQ FT MYGSAVNGFELRKRLERIRWFGREPFADYCQSKKLIAQKLNLGEKELVEYI FT VEGISDHNLKNQARIQNFQTVTEINQAFRMVKMEEQKSFRKPTVCYSCNQP FT GHLAVNCKKALDQKQSKSTSSFWKSNEGAKKQQAIQRPIINAVMNDHQLED FT EVTEEGYDGLVEFKLSNFDYTFKALFDTGSPISLIRRGLVEKQNITKNNES FT KRYRGIGGKRLKTLGESINIVHIQNISFRINFVVVPDKVLGSHDAIIGRDV FT IMRSRIRTIINQNIRLYSNIDRSSRPETSTKTVNIFNINSDVSELKKENIG FT YITNDQLAQFLEIFKTNYSCYKKPLIPETKYEMEIVLKKGESFHFKPRRLS FT FDQKQKVELKIKELLEAGIIKESNSPFASPIVLIPKKNNEIRMCVDFRKLN FT KDTVRDNYPLPVIEDIFDSLRNKTYFTILDLKSGFHQIKLASASTKFTSFV FT TPSGQYEYLRVPFGLCNAPAVFQRYINKIFKHLIEKGKMVVYLDDILIASE FT TFDEHLEILQQVFETLSKNLLELNLEKCKFMFEEIDYLGYTLNEKGRKLNN FT THIESISKFPVPKNVKDVQRFLGLTSYFRKFVQNFASIARPLYNLLKKDSK FT FLFSKEHSESFEKLKQKLITAPILAIYNPSAETQLHCDASSYGFGAILLQK FT QGDGNFHPVCYFSKITDNYESKLHSFELETLAVVHALKRFHVYLAGIKFKV FT FTDCNALVQTLSKKELNPKISRWALLLENYNFDLEYRDESKMKHVDALSRQ FT NMSNKFVDKQLSDVDTINVLNESEIESNIILAQEQDSKIMNIKNLLERSNF FT PNFLLINGVLFRKDGEKNLLVVPKLMIDNVIRLHHDNCGHIGIEKTIHEIK FT KYFWFSSMRKIVKKYIQNCLTCIFYNPCEKKQGFLKLIEKGNQPFNTIHID FT HYGPIQLPSSNRQKYVLVIVDAFTKFTKLYSTKSTSVDEVIKHLSDYMNYY FT SRPLRIITDRGSCFTSSKFETFVNVHCINHVKVAVRTPESNGQVERLNKTL FT TPMLAKLCDESQQKWDKILNKIEYVFNNTFNRSVNNYPSILLFGIQQNSIE FT NENYNIESFVKSKQFDLGKETIQEIRRKAQIKNKETQEYNKKMTDKKRIKC FT TDYKEGDFVVLKANESHKLDAKFKGPYQIKKVLPNDRFVITDIDGFQVSSI FT PFNSICSPNNMRKWMCNDVPDDGSIDEDIGFVRMPEV" XX SQ Sequence 4185 BP; 1548 A; 617 C; 767 G; 1253 T; 0 other; tatcagaagt gggatattgc tgcgcagtca gaagacttcg gatgaattgg agcgttctga 60 acaccggaga actaaggctg accggatact gcctgaggta gatgcaagag aatttgtagg 120 aatcgttgca gaaggcacca tgaacgctac aactttttct tcgcaaactg gtcaagaatg 180 ttcaagcggt ggacgcggtg atggattgtc tgcaaatttg cgccaggaga ataacgttca 240 caatcgtcaa gcggtcagta attgcgtttg tcaagaacag aggaatcgtc tgggtcctga 300 atttatggtc tcaaataaca attttcgttt tgaagaagtg gctgcttttc tgccgttttt 360 tgcgggacat agcgatgaag acatcaacca ttttatcgct acagtggaaa acacgaagcg 420 cgctcttgga ataaactgtg aaattatgaa gcttgtagta atcaaacagt tgaaaggtaa 480 cgcaagagat tggcttcaca ctagagctga ttttatgctg aaaacatatg aagagactct 540 ggacgatcta cgtcaaatgt atggatcagc tgtcaacgga tttgagctac gaaaacgttt 600 ggaacgaatc agatggtttg gacgagagcc gttcgcagat tactgccagt cgaagaagtt 660 gattgcccag aagcttaacc taggagaaaa agaattagtg gagtatattg ttgaaggaat 720 ttctgatcac aatttgaaaa atcaagcaag aattcaaaat tttcaaacgg ttaccgaaat 780 caatcaagca ttccggatgg tcaaaatgga ggagcagaag agttttcgaa aaccaaccgt 840 gtgttattcc tgcaatcaac ctgggcatct agctgtaaat tgcaagaagg cgttagatca 900 aaagcagtcg aagtcaacat cgtcattttg gaaatcaaat gaaggagcta agaaacaaca 960 agcaatacaa agacctatta tcaatgcagt tatgaacgac catcagctag aagacgaagt 1020 tacagaggaa ggatatgacg gactggtaga attcaaactt tccaattttg attatacttt 1080 taaagcttta tttgatacag gcagccctat ttcacttata cgccggggat tagtggaaaa 1140 acaaaacata acaaaaaaca atgaatctaa acgttaccgg ggtataggtg gaaaaaggct 1200 aaagacgtta ggtgaaagca ttaatattgt tcatattcaa aacatttctt tcagaattaa 1260 tttcgttgta gtaccggata aggtgcttgg ttctcacgat gctatcattg gacgggacgt 1320 tattatgcgt tcaagaatcc ggactatcat taatcaaaat attcgacttt attcaaatat 1380 tgatagaagc tcgagacctg aaactagcac taaaacggta aatattttta atatcaattc 1440 tgatgtttcc gaattaaaaa aggaaaacat aggttatata acaaacgatc agttggcaca 1500 atttttagaa attttcaaaa caaattattc atgttataag aaacccttaa ttcctgaaac 1560 taaatatgaa atggaaatag ttttgaagaa aggcgaatca tttcatttta aacctcgtag 1620 attatctttt gatcaaaaac aaaaagttga acttaaaatt aaagaacttt tggaagctgg 1680 aataataaag gaaagtaatt caccatttgc cagcccaatt gtgttgattc caaagaaaaa 1740 caacgagata agaatgtgcg tggattttag aaaactaaat aaagacacag ttcgcgacaa 1800 ttatccttta cctgttattg aagacatatt cgacagttta agaaacaaaa catattttac 1860 aattttggac cttaaatctg gatttcatca aattaaattg gcatcggctt caactaaatt 1920 cacatcattt gtaactcctt ctggccaata cgaatatttg agggtccctt tcggtttgtg 1980 taatgctcct gcagttttcc agaggtatat caataaaatt ttcaaacatt tgattgaaaa 2040 aggaaaaatg gtagtttatt tggatgacat attgattgca agtgaaacat tcgatgaaca 2100 tttagaaatt ttacaacagg ttttcgaaac attaagcaaa aatttacttg aattgaattt 2160 ggaaaaatgc aaatttatgt ttgaagaaat cgattattta ggatacacat tgaacgaaaa 2220 aggtagaaaa ctcaacaata cacatattga aagtatttcc aaattccctg ttccaaaaaa 2280 tgtgaaagat gttcaaagat ttttaggact cacaagttac tttcggaagt tcgtacaaaa 2340 ttttgcttca attgcgcgtc ctctctacaa tcttttgaag aaagattcca aattcctttt 2400 tagtaaagaa cattcggaat cattcgaaaa gcttaaacaa aaattaatta cagctcctat 2460 tttagcgatt tataatccat ctgctgaaac acaactgcat tgtgatgcta gttcttatgg 2520 ttttggagca attttacttc agaaacaagg tgatggtaat ttccatcctg tttgttattt 2580 tagtaaaata accgacaatt atgaatcgaa attacatagt tttgaactag aaacgttagc 2640 agtagttcat gcgttgaaac gttttcatgt atatttggca ggtattaaat tcaaagtgtt 2700 tactgattgc aatgccttag tgcaaacatt gtccaaaaaa gagctgaatc ctaaaataag 2760 tcggtgggcg ttgttattag aaaattataa ttttgatctg gaatataggg atgaatctaa 2820 aatgaaacat gttgatgctc ttagtcgcca aaatatgtcc aacaaatttg tagataaaca 2880 attatcagat gttgatacga taaatgttct caatgaatca gaaatagaaa gtaatataat 2940 tttagcccaa gagcaagata gtaaaatcat gaacattaaa aacttactag aacgatcaaa 3000 ttttccaaat tttttgctaa taaatggtgt attgtttagg aaagatggag agaaaaactt 3060 attagttgtt ccaaaattaa tgattgacaa tgttatcaga ttacatcatg ataactgtgg 3120 tcacattgga attgaaaaaa caatccatga aattaaaaaa tatttttggt tcagtagtat 3180 gaggaaaata gtaaaaaaat atattcaaaa ttgtcttacg tgtatatttt ataatccatg 3240 tgagaaaaaa cagggttttc tgaaattaat tgaaaaagga aatcaacctt ttaatactat 3300 tcatattgat cattacggtc caattcaatt gccttcatca aatcgacaaa aatatgttct 3360 agttatagtt gatgctttta ctaaatttac caaattatat tcaaccaaat ctactagtgt 3420 agatgaagtg ataaaacatt taagtgatta tatgaattat tatagtagac ctttacgaat 3480 tataactgac cgaggttcat gttttacaag tagtaaattt gaaacatttg ttaatgttca 3540 ttgcataaat catgtgaagg tagcagtacg aacacctgaa tcaaatggtc aagtggaacg 3600 attaaataaa acattaactc caatgcttgc caaattatgt gacgaatctc agcaaaaatg 3660 ggacaaaatt cttaataaaa tagaatatgt tttcaataat acgttcaata gatctgttaa 3720 taattatcca agtattttgt tatttggaat acaacaaaat tctatagaaa atgaaaatta 3780 taacatagag tcttttgtaa aaagtaaaca gtttgattta ggaaaagaaa ctatacaaga 3840 aattcgtaga aaagctcaaa ttaaaaataa agaaactcaa gaatataaca aaaagatgac 3900 agataaaaag agaatcaaat gcactgatta caaagaaggg gatttcgttg ttttgaaagc 3960 aaatgagtca cataaattag atgcaaaatt taaaggacct tatcaaataa aaaaagttct 4020 gccaaatgat cgatttgtga taactgatat agacgggttt caagtttcga gtattccttt 4080 caattcaatt tgctctccga acaatatgcg aaaatggatg tgtaatgacg taccagacga 4140 tggaagtatc gatgaggaca tcggttttgt caggatgccc gaagt 4185 // ID Gypsy-36_NVi-I repbase; DNA; INV; 9512 BP. XX AC . XX DT 01-JUL-2009 (Rel. 14.07, Created) DT 01-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE Gypsy LTR-retrotransposon from Nasonia vitripennis, interanl DE region. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-36_NVi-I. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-9512 RA Bao W. and Jurka J.; RT "Gypsy LTR retrotransposons from Nasonia vitripennis."; RL Repbase Reports 9(7), 1389-1389 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 72..1430 FT /product="Gypsy-36_NVi-I_1p" FT /translation="MSAHHERDRFVNTQRNTKFTLDFEDSLLTKLKDKEDK FT LKIAIANSSPETPKLIQEVDSLTKEVRNSCERFNIDTSANQDLPRNKKEYY FT QYRLNKSNQSEDSDDEVSIHEIKTEQNQTITQNKNILLKPPTLNRNAESSC FT LGTSTPVTRGLAQKNNLTTIFGSLGIQETPIDKNAINNIKQHLSIEIKPEV FT LDAFFNDENNAAEIDKTINNPGFGFFQAIKLVRQRLSTFTTGIPQLENIKI FT KTSPNHTDSVEKSELFLSFDRSFNALKIESPQKSKIEKKPKMAETIQVRGM FT SISNVLDLIPRFNGQNVSLSLFLDGCREAKELLPSTLEGELAKFIRMRLYG FT DALNSTRGQTFATVNEVMEFFENIYGSAKTYHDWSGDLAKMKQKSNESVIV FT FLNRIREIEKEITAAAQREGRISIKKKHLKMISKKTASNFLKGDYVGRSGG FT AWVRLPL*" FT CDS join(1265..2125,2008..6714) FT /product="Gypsy-36_NVi-I_2p" FT /translation="NSGNRERDHSRGSAGGPDKYKKKTFEDDLEKDCIKFF FT KRGLRWEIRGRMGTPTTLKEAREQAIEIEREYAGCAIDDDELPDAKFKNKE FT TRKVHVVDYDQSSMKCGFCGKIGHVTLNCRQFALQHLQRHSDVRDYQRDRG FT PPPRGNQNFRNETRPNNTNFGGWNAENRQRNGFLPNQMNNNYNNNNNFNRR FT ITENPRYNNSENLRSSNNSNRNNTNNQSTFRPTCYYCGIQGHTRNACFKLQ FT QDMRSGNVAQGNAESLSQEGARREMTSGTRPNDSNERNTPTSAPKLRSAGK FT RRVSFSGGRAERDDIWHAPERFQREEYSHVRTETINLSRASTVPTIRVDIG FT EFNKPLEFMLDTGASVSLVRKSEVSPYTNINTNEIIRLKGITTQYISTLGK FT VTVDFNNVSTDFHLVDDDFPITEAGILGNDFFRKNEATINYFENYIEVNGQ FT IIPFEKYPTEIEPIDTKPRVTSESKNTRVRSDLSEGGETDRGYDDSYVDDT FT RNIFDQGNDKLIEENSELIKENKELIEENCEFIDEKDNENFCIRDENIYVN FT QNLENIYVHFDTNDRDVIESDEFNIQEINDESSENIYTIEYVENLFKEIER FT LKKLLQVNTSQGNESEHTDYVNIEKNVVMNKIKSSAEDTEMKNKVNVAEII FT DDEVIKVDTKNVFVDEVTENIFEENKNNDNFEIIANDIDYEIDSELENCLL FT KEGSVNAETFEKLYGDKQENKLNNCHFVENYIENSHMDDLCYNYMEISDKD FT YFANGMPNNATTIPIFQIKEAENAEMPRIDRLKSLLRYTHLETELRQPIER FT LIEEYADCFHLKGEKLGTTNFIKHRILTVSDVPICTKPYRFPHALKQELDN FT QIQEMLEGGVIQPSNSNYSSPVFLVAKSPDSKGNKRYRVVVDFRKLNDITI FT KDKYPLPNILDIIDQVGGAEYFSVFDLAQGFFQIPLHEADRHKTAFCSSNG FT LYEFTKMPMGLSNSPATFQRCVDKIFKDMQGKEIFLFLDDAVIYGKTIEEH FT NTKFEKFLKRLREANLKLQPDKCEFLKKEVIYLGHLLSKEGVKPDPKKLEA FT VRNFPLPKTQKNIREFLGLAGYYRRFVHNFSGIAKPLSNLLKKDVELVWNE FT QTQEAFDILKEKLCSKPVLQIADFSKPFIVTCDASLHAIGGILSQGEIGKD FT RPIGYVSRVLSDTESRYDSYSREALAILYSVSQFRPYLLGRHFTIVTDCKP FT LIWFKNSTDLTSKVSRWRIKLNAYDYDIVYKCGKANVNADALSRNPVERSE FT ETQQEEPVLVVETRARTGKIQIPNYRENRPKRAKISHDKNQNNDTPENYDA FT EKVETTNEPDTNLTNSQDNQLKFPIESNTENENVQEVEPVINKERKYGKIF FT VECKEQLFMRKDNYIYFITSDGTPSDEGARQLHTRNELPRFSELEAGEIKI FT TNRNNKLHFAIVTRGKQALSTTEILNNITVAFKVLKSLIVKENICSISIAK FT SKDIENVLWDDVLIKLRHLLKGIPVKIIICLGIIRYIAEKDRVRIMEELHN FT TAIGGHKGVNKTYKRIKQKYYWENIKEDVQNFIRKCLTCQLKKLVRVKTKQ FT PMQITDTPQSPMEKISLDIVGPFPVSKNGYNNILTIQCNFSKYCLAIPIKD FT ATAESVADALIKRFICIFGAPLTILTDQGSNFMSSLMRRVAKKFRIKQVKT FT TAYHPQSNGSLERAHHSLTEYIKTALDKETWDECLELATFCYNTVPHEGHQ FT FTPYELVFGHLARVPSNEPLEPEDLIPTYNDYVVNLSKRLNELQRIAREKL FT IESKIRSKYYYDKKINPKEFRIGDTVWLLKGGKQYKLADQYEGPYTVVDVF FT NNQRNVKIRMKNKKIKTVNANRLRVSYIEPEEV*" XX SQ Sequence 9512 BP; 3643 A; 1712 C; 1758 G; 2399 T; 0 other; cattaatata tttattattt taaaatggtg catagcggcc gggaaatgcg gaaacgaagt 60 cacttggtag aatgagtgcc catcatgaac gcgatcggtt tgttaataca caacgcaata 120 ctaaattcac actagatttt gaggactcct tacttaccaa gcttaaggat aaagaagata 180 aactaaaaat cgcgatcgca aatagcagcc ccgaaacacc gaaattaata caagaagtag 240 actcgttaac taaggaagta agaaactcgt gcgaaagatt taatatcgac acgagtgcaa 300 atcaggactt acctcggaat aaaaaagaat actatcaata tcgtttaaat aaatcaaatc 360 agtcggagga tagtgacgac gaagtctcta ttcacgagat caaaacggaa caaaaccaaa 420 caatcaccca gaataaaaac attcttttaa aaccacccac actaaataga aacgcagagt 480 cttcgtgcct cggcacaagt acccccgtta cccgaggact cgctcaaaaa aataacttaa 540 cgactatctt tggatccctg ggaatacagg agactcctat agacaaaaac gcaataaata 600 atatcaaaca acacctaagt atcgaaatta aaccagaagt attagacgca ttttttaacg 660 acgaaaataa cgcggcagaa atcgataaaa ctataaacaa tccgggattc ggtttttttc 720 aagcaattaa acttgtgcga cagcgcctta gcactttcac aacgggaata cctcagctcg 780 aaaatataaa gataaaaaca tcaccaaatc acaccgattc agttgaaaaa agcgaactct 840 tcttatcctt tgacagaagt tttaacgcac tcaaaataga atcgccccaa aaatcgaaaa 900 tagagaaaaa accaaaaatg gccgagacaa ttcaagtccg tggaatgtcc atatccaacg 960 tgttggattt aattccacga ttcaacggac aaaacgtatc gctatcactc tttcttgatg 1020 gatgcaggga agcaaaagaa ttgctaccca gcactctcga aggagaacta gcaaaattca 1080 tccggatgcg cttatacggg gacgcgctaa atagcacccg cgggcagact tttgcgacgg 1140 taaacgaagt tatggaattt tttgaaaata tatacggttc agcaaaaacg tatcatgatt 1200 ggtctggcga cttagcgaaa atgaaacaaa aaagcaacga atccgtaatt gtatttttaa 1260 atagaattcg ggaaatcgag aaagagatca cagccgcggc tcagcgggag ggccggataa 1320 gtataaaaaa aaaacatttg aagatgatct cgaaaaagac tgcatcaaat tttttaaaag 1380 gggactacgt tgggagatcc gggggcgcat gggtacgcct accactttaa aagaggctcg 1440 cgaacaggcc attgagatag aacgcgaata tgcgggttgc gcgatcgacg acgatgagtt 1500 gccggacgca aaatttaaaa ataaagaaac gcgcaaggtg cacgtagtcg actatgacca 1560 aagttcgatg aagtgcggtt tctgtggaaa gatcggacac gtgacactaa actgtcgaca 1620 gttcgccctt caacatctgc agcgacatag cgacgtgcgt gactaccaac gagaccgagg 1680 acccccaccg cgcggcaatc aaaattttag aaacgaaacg cggcctaata atactaactt 1740 cggaggatgg aatgccgaaa atagacaacg caacggattc cttccgaatc aaatgaataa 1800 taattacaat aacaataata atttcaatcg aaggattact gaaaatccac gatataacaa 1860 cagcgagaat ttgagatctt cgaataattc aaatcgtaac aatactaaca atcaaagcac 1920 atttagaccg acgtgttact attgcggtat tcaagggcat acgcgtaatg cctgtttcaa 1980 gctgcagcag gacatgcgct cgggtaacgt agcgcaggga aacgccgagt ctctttctca 2040 ggagggcgcg cggagagaga tgacatctgg cacgcgcccg aacgattcca acgagaggaa 2100 tactcccacg tccgcaccga aactataaat ttgagtcgcg cgagcacggt gccgaccatc 2160 agagtagaca tcggagaatt taacaagcca ttagagttta tgcttgatac cggagcctct 2220 gttagtttag tgcgtaagtc ggaggtatcg ccctatacta acataaacac gaatgagatt 2280 ataagattaa aaggcataac aacccaatat atttccactc ttggcaaagt gacagtagat 2340 tttaacaacg tgtcaacaga ttttcattta gttgacgacg attttcccat cacggaagct 2400 ggcatattag gtaatgattt ctttcgtaaa aatgaagcga caattaatta ttttgaaaat 2460 tatattgaag ttaatggaca aataattcct tttgaaaagt atcctacgga aatagaaccg 2520 atagatacga aaccaagggt aacaagtgag tctaaaaata cacgagtgag aagcgatctg 2580 agtgaaggag gggagacaga ccgcggttat gatgattctt atgtcgatga tacacgaaat 2640 atatttgatc aaggaaatga taaattaatt gaagaaaata gtgaattaat taaagaaaat 2700 aaggaattaa ttgaagaaaa ttgtgaattt attgatgaaa aagataatga aaatttttgt 2760 atacgagatg aaaatattta tgttaatcag aatttggaaa atatttatgt gcatttcgac 2820 actaatgatc gtgacgttat agaaagtgat gagtttaata ttcaagaaat aaatgacgaa 2880 tcaagtgaaa atatatatac aattgaatac gttgagaatt tatttaaaga aattgagcgt 2940 ttaaagaaac ttttacaagt caacacttct caaggcaacg agagtgagca tacagattat 3000 gtgaatattg agaaaaatgt cgttatgaat aaaattaaat ccagtgcaga agacactgag 3060 atgaaaaata aagttaatgt tgctgaaatt attgatgatg aagtgattaa ggttgatacg 3120 aaaaatgtat ttgttgatga agttacggag aatatatttg aggagaataa aaataacgat 3180 aattttgaga taattgctaa tgatattgat tatgaaatcg attcagaact tgagaattgt 3240 ttactaaaag aaggatctgt aaatgctgaa acctttgaaa aattatatgg agataaacaa 3300 gaaaataaat taaataattg tcattttgtt gaaaattata tcgaaaatag tcacatggat 3360 gacttatgtt acaattatat ggaaatttca gataaagact attttgccaa cggaatgcca 3420 aataacgcaa cgacaattcc catcttccaa atcaaggaag ctgaaaatgc cgaaatgcca 3480 agaatagatc gtttgaaatc attgctaaga tatactcatt tagaaaccga actgcgccag 3540 ccaatcgaac gccttataga ggaatatgcc gattgctttc atttaaaagg ggaaaagttg 3600 gggactacta attttattaa acatcgaata ctaactgtta gtgatgtgcc tatttgtact 3660 aaaccttata gatttcctca cgctcttaaa caggaactcg ataatcaaat acaagaaatg 3720 cttgagggag gtgtgattca gccttctaat tctaattatt catcgcctgt atttttagta 3780 gcgaaatctc cggacagtaa aggtaataaa cgatatcgag tggttgttga ttttcgaaaa 3840 ttaaatgata taactataaa agacaaatat ccactaccaa acatattaga tattatcgac 3900 caagtaggtg gagcggaata tttttcggtt tttgatttag ctcagggatt tttccaaatt 3960 ccgttgcatg aggctgatag gcataaaact gctttttgct cttcgaacgg tttgtatgag 4020 tttactaaaa tgccaatggg tctttcaaat tcgcccgcaa catttcagcg ttgtgttgat 4080 aaaattttta aagatatgca aggtaaagag atatttttat ttttggacga tgccgtcatt 4140 tacgggaaaa caattgagga acacaatact aaattcgaaa aatttttgaa aagattacgc 4200 gaagctaatt taaaattgca accggataaa tgcgagtttt taaagaaaga agttatttat 4260 ttgggacatt tgttaagtaa agaaggggtt aaaccggatc ctaagaaatt agaagcagta 4320 cgaaattttc cattacctaa aactcagaaa aatattcgcg aatttttggg attagcgggt 4380 tattatcgcc gatttgtaca taatttttcg ggcattgcta aacctttgag taacttatta 4440 aagaaagacg tcgaacttgt atggaatgaa cagactcagg aagcatttga tatattaaaa 4500 gaaaaactct gtagcaaacc tgtattacaa atcgcagatt tttcaaagcc ttttatagta 4560 acgtgtgatg ctagtttaca tgctataggg ggtatattaa gtcaagggga aataggcaaa 4620 gaccgaccga taggatacgt gtcgcgagta ctttccgaca ccgaaagccg atacgactca 4680 tactctcgag aagctttagc aatattatac tctgtgtctc agtttagacc gtatttactc 4740 ggaagacact ttacaattgt tacggactgc aaacctttaa tctggttcaa aaattcaacc 4800 gatttaacgt caaaagtaag ccgatggaga attaaattaa acgcgtacga ctacgacatc 4860 gtgtacaaat gcgggaaggc taacgtgaac gcagatgctc tctcgagaaa ccccgtcgag 4920 cgatccgagg agacacagca ggaggaacct gtgctagtcg tagagacacg tgccagaacg 4980 ggaaagatac aaataccgaa ctatcgagag aaccgaccga aacgcgctaa aatctctcat 5040 gataaaaacc aaaataacga cacgcctgaa aactatgacg ccgaaaaagt cgagacgacc 5100 aatgagccgg acactaactt aacaaactcg caagataacc aattgaaatt cccaatagag 5160 tcaaatacgg aaaacgaaaa cgttcaagaa gtcgagcctg ttataaataa agaaagaaaa 5220 tatggaaaaa tttttgtaga atgtaaagaa caattgttta tgagaaaaga taattacatt 5280 tatttcataa cttcagacgg aacgccgagt gacgaaggag cccgacagtt gcacacgcgt 5340 aatgaattac ccagattttc tgagttagaa gccggggaaa ttaaaatcac gaatagaaat 5400 aataaactac atttcgctat tgttacacgc ggcaaacaag ccttaagcac tactgagatt 5460 ctcaataata taactgttgc attcaaagtt ttaaagtcat tgattgtgaa agaaaatatt 5520 tgttctatta gcattgctaa gagcaaggat attgaaaacg tattatggga cgatgtgtta 5580 attaaattga gacatttatt gaagggaata ccagtaaaaa ttataatttg cctaggaata 5640 attagatata tagctgaaaa agatagagtt agaatcatgg aagaattaca caatacggct 5700 ataggaggac acaagggagt caataaaact tataaacgga ttaaacaaaa atattattgg 5760 gaaaatatta aagaagatgt acagaatttc atacgtaaat gcctcacatg tcaattgaaa 5820 aagctagttc gagtcaaaac caagcaacct atgcaaatta cggatacgcc acaatcaccc 5880 atggagaaaa ttagtctcga tatcgtcgga cccttccccg tatcgaaaaa tggttataat 5940 aatatattaa caattcaatg taacttttca aaatattgtc tcgcgatacc tatcaaagac 6000 gccaccgcgg aatctgtcgc cgacgcatta ataaaacgtt ttatttgtat ttttggcgca 6060 cctcttacga ttctcacaga tcaaggaagt aatttcatgt caagtttaat gagacgagta 6120 gcaaagaaat tccgtattaa acaagtaaaa accactgcgt atcatcctca atctaacgga 6180 tccttggaaa gagcacatca ttcattaacc gaatatatta aaaccgcttt agataaagaa 6240 acttgggacg aatgcttaga attagcgact ttttgttaca ataccgtgcc tcacgagggt 6300 catcagttca cgccatacga gttagttttc ggacatttag cacgcgttcc ttcgaacgag 6360 ccactcgagc cggaagatct cattcccacg tataatgatt acgttgtaaa ccttagtaaa 6420 cgcttgaacg aactccaaag aattgctcgc gaaaaattaa ttgaatcgaa aattcgctca 6480 aaatattatt atgacaaaaa aataaaccca aaagaattta gaataggtga tacagtttgg 6540 ttacttaaag ggggaaaaca atataaatta gctgaccaat atgaagggcc gtatactgtt 6600 gtcgatgttt ttaacaacca gcgaaacgtt aaaatccgaa tgaaaaataa gaaaattaaa 6660 accgtaaatg caaatagact tcgagtttca tatatagagc ctgaggaggt ctaaaaatat 6720 gcgttatttt aagattaata tatgtatatg tataaatata tctgcgtaca cacatactta 6780 tatgcgcaca cacgtcgttt gtactcattt caaaagtatt aagtaaaaaa aaagcatgaa 6840 tacgtgtatc cactaaacaa acgccactaa ccaaatcaag agtgaagagt tcaagtataa 6900 ataaagaaag cgatatctcc ctctaggaac ataacacgag acgagacaat aacgccgttg 6960 attatccagc cgacgaagca aaatggcaac acctacaatt caagacaata ctgttaaaca 7020 aatattgtgt tacgggctga aaatgaggca agagtcccga caaggcatca ctcctgcata 7080 ccaccgttcc ctagtcgaca acatagaatt tataacgtca attatgtaca acaataatga 7140 aagtttccgc gaagctatag aatcaacttt aaacccagat aaccaacaaa agataaccta 7200 ccgaaaaata aataaagcag catgcgaact catgcaacat cccgatttcc gaagtaaaaa 7260 gaaggatgac aataatccat ggcgaaaaga atatgccact atggagagga gaacatttcg 7320 atttgaaata ttacggagag aaagcttcga ggggtggtct gtaaaagaag tgtctccgaa 7380 attattggca gccgaaggct tttactatac gcaacgcaga gacgcagttc gctgcttttc 7440 ttgtagtgtc gaaattagtg gctggggaaa cagaggcaaa tcttaaccgg caaacagcac 7500 atcggtgaat agagccagga aattcgtcca ccttgacatg gcataatatg aaacagtccc 7560 gaattaggtc cagactcaca gcacattgaa gcacagggta acgatactag aaaaataaca 7620 taaaaacgcg tcaaaataaa cactagtcaa actaagaaaa gaaaacaaca atatttacga 7680 accgagatga caccataatt agtacgctta accatgatcc gagaagggct cgtctcgtaa 7740 tgaaaacaaa caccacacca gccccagccc aaattatgac aaggctccca aggtaaatca 7800 cttacttcaa tcggtttaat aacatcaatc acctaaagaa aaatcaataa caataaacaa 7860 aataatgaaa taaaacgtag tagagcacaa tataaaaata ataaaaatat taaaattctc 7920 ttacctcaca cattttgttg aaagtcggaa taaataactg taacagacac gaacaaacta 7980 actgcacact actaaagcac tcgacagact gatgacagat cagcctgcat ctcgaggacg 8040 agccagaaaa atcagagaaa tcgagcacct tagactagga aagaaagtgg ggaaaaacac 8100 atgtccttaa gaaaaaggag cttgggaaag aaagtaataa aaattggaaa ccgacgaaag 8160 cgcatgatgg gacctcctgc acacgccagg aagaagtata aatatatctg cgtacacaca 8220 tacatgcgca cacacgtcgt ttgtactcat ttcaaaagta ttaagtaaaa aaaagcatga 8280 atacgtgtat ccactaaaca aacaccacac cagccccagc cgaaattatg acaaggctcc 8340 caaggtaaat cacttacttc aatcggttta ataacatcaa ccacctaaag aaaaatcaat 8400 aacaataaac aaaataatga aataaaacgt agtagagcac aatatacaaa taataaaaat 8460 attaaaattc tcttacctca cacattttgt tgaaagtcgg aataaataac tgtaacagac 8520 acgaacaaac taactgcaca ctactaaagc acttgacaga ctgatgacag atcagcctgc 8580 atctcgagga cgagccagaa aaatcagaga aatcgagcac cttagactag gaaagaaagt 8640 ggggaaaaac acatggcctt aagaaaaagg agcttgggaa agaaagtaat aaaaattgga 8700 aaccgacgaa agcgcatgat gggacctcct gcacacgcca gaaagaatgt tgaaatttca 8760 ttttcaacat tctttttctg gtggggtgga attttattgc ataaaatcaa ttgaaaatat 8820 aaattgtaaa ataaaaagag aaaagtactc ttagggcata acactaactc taataaatat 8880 ctgcatatac taattaacat tctttatgtt acagttaact atcattcata tacaaacaaa 8940 aattaaataa attatacacc tatcaacata tccctatcac actaacaaac attttttttc 9000 cacatattgt aaataaaaaa tcacctttat tagtaatagt atacgaaaaa tacaaaataa 9060 accattattg taaacatcct tctacatata tgtaaaatac gaaaaatatc cattattgta 9120 aaaatttctt tttgtttcac ctatatgtaa aatacgaaaa gtatccatta ttgtaaaaaa 9180 attttttcct tccttctata tgaaatctta tatgtaaaat gcgaagaata tccattgtct 9240 accagtaatc acatatgtaa gatacgaaaa atattcatta ctgtaaatta acatttgcgc 9300 aacgaaaaat catttctgta cacatcggaa actatgaaaa gaaaaacttt catcaaaatt 9360 tatgtatcaa agaaaagtac cttgcaacaa attatgtaaa taaataatga tgtgcggaga 9420 atgaaaatac ccatcattac ttataacaca cacacatact aaattttacg aacgtcgagt 9480 ttcactcgtc cgttcttttt cgggcgggga gg 9512 // ID Kiri-34_AAe repbase; DNA; INV; 3914 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 16-FEB-2011 (Rel. 16.02, Last updated, Version 1) XX DE A Kiri non-LTR retrotransposon family from Aedes aegypti. XX KW Kiri; Non-LTR Retrotransposon; Transposable Element; Kiri-34_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-3914 RA Kojima K.K. and Jurka J.; RT "Kiri non-LTR retrotransposons from the yellow fever mosquito."; RL Repbase Reports 11(2), 729-729 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 7 sequences with >95% CC identity. CC This family and related Kiri elements found in two mosquito CC species, Culex quinquefasciatus and Aedes aegypti, constitute a CC distinct lineage belonging to the CR1 group. The phylogenetic CC analysis with PhyML indicates that Kiri is a sister lineage of CC the L2 clade. Although several Kiri elements do not encode two CC proteins, probably due to 5' truncation, Kiri elements generally CC encode two proteins. Their ORF1 proteins commonly contain an CC L1-like RNA-binding domain. XX FH Key Location/Qualifiers FT CDS 311..688 FT /product="Kiri-34_AAe_1p" FT /translation="MMLLLSQSSTTINWLPMGIVSNDLSFNFLLPPLVIHD FT SHPSSPLIVPLLKVYNRSALPKNILENPSRMYPMNPFLSISMISSILKVNR FT GGTQEMPWITAVTVVVNAVVVAAASLDHTAAAVAVFFC" FT CDS 856..3708 FT /product="Kiri-34_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MLCFPMMSVSQISDVAHSNTSIANIPRAVMNVALRND FT CINLCHMNVQSLCARQLSKFDEFKNCFINSKLDLICVTESWLHENIPDSSI FT AVEGYNFLRNDRGHSRGGGICVYYKNDLFCKEIAASQPSGDGNRTEYLFIE FT VRVNQDAFLLGVIYCPPGVDCAGVLEQKLAELSCSYTKIILIGDFNTNLKK FT SCIRTNRFCSVMDSFGIFCINNEPTHFYSGGCSLIDLLLTNDENFVLNFNQ FT VSAPGFSQHDIIFSSLNISRSRMQHSVPYRDYNRINISELQDALNAIDWSL FT LYSITDSDIALDFFNNCITELYENYVPLRYPRPNRNAPWFNDTVLNAIVIR FT DIAYRQWLRSKNASDRAQFKRLRNRVTDIINQAKSDYLCMNLQSTTSSKVL FT WKKLKKLNVTGNNDNSKSNFTNNEINDYFGENFTRDASLSPIPPSNIDGFN FT FSPCTEMEVSAAIYSISSDAIGLDGIPLRFVKMILPMIICPLTFLFNLSIT FT TSKFPRAWKSAKIIPIRKKPTGDNLDNLRPISILCSLSKVLEKILKNQIQN FT YIQRHTLLSRYQSGFRSAHSTTTALLKVHDDIHQAIDKKGIAFLLLIDFSK FT AFDRVSHVKLLQKLSQNFGFSRKAVYMIKSYLDCRSQVVELNGNRSSPVSI FT LSGVPQGSVLGPLLFSLFINDLPSILKTCFVHMFADDVQLYFFSTNMDISS FT MAHLINADLRNVALWSANNLLPINASKTKAMFICRRLIRPVLPDLFINGNK FT IDYVNKTTNLGIIFQNNLEWDSQVNLQCGRIYAGLRQLKLTADMLPIPIKV FT KLFKSLILPHLSYGSELILNASAAAYNRLRIALNNCIRWVFHLSRYSSVTH FT LQRRLLGCSFYDFFKLRSCLSLFRIIHHGPKYLADKLHIFRSSRIQHFILP FT QHYSSYYGNTFFVRALVTWNQLPSSIKSIHSAVRFQRACVSWLNEGR" XX SQ Sequence 3914 BP; 1115 A; 743 C; 741 G; 1308 T; 7 other; ggctagcagt tgcgcttggg tastgcgaat ctgacatccc gttggtgtat accaaacgac 60 tgtccsgtgc tccaataawt gctggkgcaa caccgcctag cttcgaagga tgaattctkc 120 gtacggtatc tttctaggaa gaatttgagc ttgagtcatc tcggtgtcgg tgctgaaaaa 180 cgagtttatt tgaacgaaag tttgactgaa gctgcacgga wcatcaaagg gatcgctctc 240 aagctgaagc gttcgggcca tatwaggaat gttttcacga agggaggagt gatatttgtg 300 aagccgttgg atgatgttgc tgctcagcca atcttcgacg accatcaact ggttgcctat 360 gggcatcgta tctaatgacc tttccttcaa ttttcttctt cctcccctag ttatccatga 420 ttcccatcct tcttctcctc tgattgttcc tctcctgaaa gtttacaatc gttctgccct 480 tcctaagaat attcttgaaa atccttccag aatgtatcct atgaatccat tcctttcaat 540 ttccatgata tcttccatcc taaaagtcaa ccgtggaggg acacaggaga tgccctggat 600 tactgctgtt actgtggttg ttaatgctgt tgttgttgct gctgcctcgt tggaccacac 660 cgctgctgct gttgctgttt ttttttgtta gctcttaatt atgtccaata gttgaattta 720 cgatatcgga ccacttgata attatgctag gtttatttag agattttgca atgattagta 780 cctacgccat tgacctgttg cgctatccat tcacttctat ccttcatttt tgcttgctca 840 ttacgtcaca cttgaatgct ttgcttccca atgatgtcgg ttagtcaaat aagtgatgtt 900 gcgcattcaa atacaagtat tgccaatatt ccacgagctg tgatgaatgt tgctcttcgt 960 aatgactgta ttaatttatg tcacatgaat gttcaaagcc tgtgtgctcg tcagcttagt 1020 aaatttgatg aattcaaaaa ttgcttcatt aacagcaaac tagatttgat ttgtgtaact 1080 gaatcttggt tgcatgaaaa tatacctgat agctcaattg cagtcgaggg ctacaacttt 1140 ttaagaaatg accgtgggca tagtcggggt ggaggaattt gcgtgtatta taagaatgat 1200 ctgttttgta aagaaatagc cgcctctcag ccgagtggtg atggcaatag aaccgaatat 1260 cttttcatcg aagttcgagt aaatcaagac gcatttttac ttggtgtgat ctactgtcct 1320 ccaggagtgg attgtgctgg tgttttagaa cagaaacttg ccgaactgtc ttgtagttac 1380 actaaaatta tattgattgg cgattttaat acaaacttga agaagtcttg catcagaaca 1440 aaccgtttct gtagtgtgat ggacagtttt ggtattttct gtataaataa cgaaccgact 1500 catttttact ccggtggatg ttcacttatc gatttgttgc taactaatga tgaaaatttt 1560 gtgttaaatt tcaatcaggt ttctgctcca ggtttctctc agcatgatat aattttttca 1620 tcattgaata tttcacgatc tagaatgcag cattcagtac cctacagaga ttacaatagg 1680 ataaatattt cagagcttca ggatgcttta aacgcaatcg attggtcgtt gttatatagt 1740 ataactgatt ctgatatagc gcttgacttt ttcaacaact gcatcactga actgtatgaa 1800 aactacgttc cactgcgata tccacgtcca aatcgaaatg ctccgtggtt caatgatact 1860 gttctgaatg ctattgtcat tagagacata gcttaccgtc aatggctccg cagtaaaaac 1920 gcttcagatc gtgctcagtt caaaagactc agaaatagag tgaccgatat aataaatcaa 1980 gctaaatctg attacctgtg tatgaacttg caatcaacaa cctcgagtaa agtattatgg 2040 aaaaaactca aaaaattgaa cgttactgga aataatgata attcaaaatc taactttact 2100 aataatgaga tcaatgatta ttttggggaa aattttaccc gggatgcatc attgtctcca 2160 attcctccat caaacataga tggattcaat ttttcaccct gcactgaaat ggaagtgtca 2220 gcagctattt attcgatatc ctctgatgcg attggtttgg atggaattcc tttgcggttc 2280 gtaaagatga tacttcctat gattatctgt ccattaacat ttttatttaa tctatcgatc 2340 acaacttcaa agttcccacg tgcatggaaa tctgcaaaaa ttattccaat tcgaaagaaa 2400 cccacaggag ataatctaga taaccttcgc cctatcagta tcctttgctc tttatctaaa 2460 gtacttgaga agattttaaa aaatcagatt cagaactaca ttcagcggca tactctatta 2520 agtcgttatc agtctggatt tcgttctgct catagtacaa ctactgctct tctcaaggta 2580 catgatgata ttcatcaagc tattgataag aaaggtattg cgtttttgct tctgatcgat 2640 ttttcaaagg catttgacag agtttcccat gtcaaattgt tacaaaaact atcgcaaaat 2700 tttggtttca gtcgcaaagc agtttatatg atcaaatctt accttgactg tcgctctcag 2760 gtagtggaac tgaatggaaa tcgatctagc cctgtcagta ttttatcagg ggtacctcaa 2820 ggatccgtat tggggcccct cttgttttca ctattcataa acgatcttcc atctattttg 2880 aaaacgtgtt ttgttcatat gtttgctgat gacgtacagc tatatttctt ttcaactaat 2940 atggatataa gtagtatggc acatcttatc aatgctgatc ttaggaatgt agccctatgg 3000 tccgcaaata atttgctacc aattaacgca tctaaaacaa aagcaatgtt tatctgcagg 3060 cggctcatac gcccagttct gcctgacctg tttattaacg gtaacaagat agattatgta 3120 aacaaaacga caaacttggg tattattttt caaaataatc tggaatggga ctctcaagtg 3180 aatttgcagt gcggaagaat ttatgcaggc ttaaggcagc ttaaactaac agcagatatg 3240 ctaccaatcc ccataaaagt aaaacttttc aaatcgctca tattacctca cctgtcttat 3300 ggatctgaat taattttaaa tgcttctgcc gctgcataca atcgcttgag aattgctttg 3360 aataactgca ttcgatgggt ttttcattta tcaagatact caagtgtcac tcacttacaa 3420 cgtcgattat tgggatgctc tttttatgac ttttttaaac tgcgtagttg tttatcctta 3480 ttcagaatta tacatcatgg tcctaaatat cttgctgaca agctacacat ctttcgcagc 3540 agtcgtattc aacattttat tttgcctcaa cattattctt cgtactatgg caatactttc 3600 tttgttcgcg cacttgttac atggaatcag cttccatcta gtataaaatc aatacattca 3660 gccgtaagat tccagcgggc gtgtgtgagc tggcttaatg agggtaggta aatggaatgg 3720 atgttaaata gcaagaggtc aatgagtttg tttgtttttt ttttaagtta aattgaattg 3780 aaagtgtatt ggattgtatt agatttcgaa tgagtggttc aggcaacgaa agtaccgatt 3840 gtagagcgtt ataagggtta cccttaatct acaagtatat ggatgaataa atgaaatgaa 3900 atgaaatgaa atga 3914 // ID Gypsy-69_AA-I repbase; DNA; INV; 4243 BP. XX AC supercont1.165; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-69_AA_; KW Gypsy-69_AA-LTR; Gypsy-69_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4243 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.165; Positions 1823162 1827404. XX CC Positions [3286-3813] - Integrase core CC 'ACATG' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 136..1212 FT /product="Gypsy-69_AA-I_1p" FT /translation="MAQNGQGQQQPLPEALAGAQAGPAMAAAPIINSGIPF FT PTPLELSGNLRENIVNWKDTFENYLIASGVEFLEERVKVATFKSALGAEAR FT KIFNLWPLQEHERNTVAACMESLSRNMIPQRNVKLARYEFYQCHQETGDDS FT KPPESMTQFINRARELIKDCNFGAMEDEMLRDKIITGISDVPLKKRFIEQV FT DLTATRVIQQCQTEEVTREEMMRNQWLDTQVHSINKVSGVKPKQKQCSFCG FT RKYHSDLSDCPARGATCNFCGLKNHFAAACKNKKEQGSSGNSGTRMKKKVN FT TLEAENNQAFGSENTESDEESVEMLQYLYEVKQEENDNLLALVLHSSMGTH FT ASIRYDACWTQEHHAT" FT CDS 1233..3083 FT /product="Gypsy-69_AA-I_3p" FT /translation="MNLLGTKHVRLDNDKAVFKVFGGERCKSIGRTEVDCV FT HKGTPYKMVFHVVDFDQPPILSWKTCLNLKLLQVCFALSEEQKSTSKKILE FT QFPDVFTGLGKLEGTIHLEVDENAKPTIQQPRRIAVTLVEELHQVLAEMEK FT DGIIAPEKHHSDWVSNLILVKRNGKLRVCIDPIMLNKALKRPRYQMPTLDE FT VLPELANAKLFTTVDARAGFWQIQLDDDSSRLTTCWTPFGRYKWLRMPMGI FT TSAPEIFQLKAHEAIQGLRNVKALMDDFLIFGCGNTVAEAKADHDRNLEAF FT LRRMREKNLKLNPDKIKLCQNSVRFFGHILTAEGVKPDSEKTSSITMMPIP FT DDVAAVQRFLGMITYLSRYLPSLSTVAEPIRRLTWKDETWTWSRQQQDAFE FT RLKKMISTAPVLRYFDATKNTVIQCDSSSVGLGAVLLQEGQPVVYASKTLS FT ATERRYAQIEKETLAILLACRKFEMYILGRPKTIQTDHQPLVKIFHKPLSE FT APLRIQRMILALQRYNITLQFTPGKEVIIADMLSRAAVESNDETSREIYDI FT YGIESSNAVDFGRQSRADPQSITGGSRDAGHHAVYHQRLAGEMRRDSSKSE FT SFLEIQIGAQHPQWIGIAM" FT CDS 2842..4200 FT /product="Gypsy-69_AA-I_2p" FT /translation="MTKPVEKFTTSMESSQVMPWISDDKVEQIRKASQEDP FT EMQAIMQYIINGWPARCDAIPANLKVFWKYKSELSTHNGLVLRCDRILVPH FT SLRRDILERLHQSHSGIEATIQLAKDTVFWPGLRDQIRQRVQSCDVCAKYS FT ASQSSQSMQSHLIPSYAYQKVSMDLCECDFESQKRIYLITVDHFSDYFDVD FT ELRTPNTAAVIQACKKNFARFGVPQEVSSDGGPQFMSQEFIRFATLWDFKH FT SVSAPFHQQANGKAESAVKIAKMLMKKAHESKQDFWKLLLQWRNTPNKVGC FT SPVQRLLSRRTRFDVPMSEAKYSPKLQEGIKEKIVKCRQQAKLYYDRKTRR FT LPDLEFGQPVFVKLKPSDKEWQRGTVVDPITDRSTIISVGEREYRRDNTCI FT KPALPSYVSSKPKEPSSGPLEHPKTLEQSTPVAEQPMADIGRPKRIIRLPK FT RFEDFEVS" XX SQ Sequence 4243 BP; 1299 A; 947 C; 1024 G; 973 T; 0 other; tggtgtcagt acgggtgatt tttacttcaa aagaaatgtt tgctcttgcg aatcttgagt 60 aacaagctct agtaagcaaa ccgcgagaac cgaactaata aacagttatt gtgaacagct 120 cagcaacgaa tagtaatggc tcaaaacggc caaggtcagc aacaaccact gcccgaagcc 180 ctagcagggg cacaagcagg accagcaatg gcggcagctc ccatcatcaa cagcggcatc 240 ccattcccaa cgccactgga gctaagtggc aatctgagag agaacatcgt caattggaag 300 gacacgtttg agaactatct cattgcttcc ggcgtggaat ttttggagga acgtgtcaaa 360 gtggcgactt tcaagtcggc tttaggagca gaagcgagaa agattttcaa cttgtggcct 420 ctccaggaac atgagagaaa tacggtagca gcgtgtatgg aatctctctc tcgaaatatg 480 attccgcagc gtaacgtgaa gcttgctcgt tatgaatttt atcagtgtca tcaggaaact 540 ggcgatgatt ccaaaccacc cgaatcgatg acgcagttca tcaaccgtgc acgagaactc 600 attaaggact gcaactttgg tgccatggag gacgagatgc tccgggacaa gataattacc 660 ggtatcagcg acgttcctct gaaaaaacgc ttcatcgaac aagtggatct tacagcaaca 720 cgagtgatcc agcaatgcca aacagaagaa gtaacacgcg aagaaatgat gcggaaccag 780 tggctcgata cacaggtaca ctcgatcaac aaagtatccg gtgtcaagcc taaacaaaag 840 cagtgttcgt tttgtggaag gaaataccac agcgatctgt ctgactgtcc agcccgaggt 900 gccacctgca acttctgtgg actgaaaaac cattttgcgg cagcgtgtaa gaataagaag 960 gaacaaggaa gctccggaaa ttcaggaacg aggatgaaga agaaagttaa caccttggag 1020 gcggaaaaca atcaggcttt tggatctgaa aatacggaaa gcgatgaaga atctgtggaa 1080 atgttgcagt atctctacga agtgaagcag gaagaaaacg acaaccttct tgccctggta 1140 ttacattcat cgatgggaac gcacgcaagc atcaggtacg atgcgtgctg gactcaggag 1200 catcatgcaa cgtaatcgga aaaagcaacg tgatgaacct actcgggacg aagcatgtcc 1260 ggctagacaa cgacaaagcg gttttcaagg tctttggtgg tgaaagatgt aaatctattg 1320 gaagaaccga agtggactgc gtacacaaag gaactccata caagatggtg tttcacgtcg 1380 tcgacttcga ccagccaccc attctgtcat ggaagacatg ccttaatctg aagcttcttc 1440 aagtatgttt tgcactttcg gaggagcaaa aatcgacgtc gaagaaaatc cttgaacaat 1500 tcccggacgt gttcaccggt ctgggaaagc tcgaaggcac tattcatttg gaggttgacg 1560 aaaatgccaa gcccacaatt cagcagccac gccggattgc agtgacgctt gtggaggagc 1620 ttcatcaagt tttagcggaa atggaaaagg acggtatcat tgcaccggaa aagcaccact 1680 cagactgggt cagtaacctg atcctggtca agcggaacgg aaagttgagg gtatgtatcg 1740 atcctattat gctaaataaa gctttgaaaa ggccgcgcta tcagatgcca acattagacg 1800 aggttctgcc agaactggcc aatgcgaagt tgtttacgac tgtcgacgcc cgtgccggtt 1860 tttggcaaat ccaactggat gatgatagtt caagactcac tacgtgttgg accccgtttg 1920 gtcgttacaa gtggctacgg atgcctatgg ggatcacgtc tgcacccgaa attttccaac 1980 ttaaagcaca cgaagccata caaggactgc ggaatgtcaa ggcgctgatg gacgattttt 2040 tgattttcgg ttgcggtaat acagttgcag aagcaaaagc tgaccacgat agaaaccttg 2100 aagcttttct ccggaggatg cgtgagaaga acctgaagct gaatccagat aaaataaagc 2160 tttgccagaa cagtgttaga ttctttggcc acatcctaac ggcagaagga gtaaaaccgg 2220 actcggaaaa aaccagcagt ataacgatga tgccaattcc agatgacgtc gcagctgttc 2280 aaagattttt ggggatgatc acatatctgt ctcgttattt acctagcctc tccactgtag 2340 ctgaaccaat ccgacgactt acctggaagg acgaaacatg gacatggtct cgtcagcaac 2400 aagatgcttt tgagcgtcta aagaaaatga taagcacagc accagtactg cgatactttg 2460 atgccacaaa aaataccgtg atccaatgcg acagtagcag cgtaggacta ggcgctgttc 2520 ttctgcaaga aggtcagcca gtcgtctatg cgtcgaagac cttaagtgct actgaacgcc 2580 gatatgcaca gattgaaaag gagactttgg caatccttct cgcatgtcga aaatttgaga 2640 tgtatatctt gggacgtccg aagaccattc aaactgatca tcaaccgttg gtcaaaatat 2700 tccacaaacc gttatcggaa gcacctttgc gtattcagcg aatgatccta gccctacagc 2760 gatataatat aaccttgcaa tttactcccg gtaaagaagt aatcatagct gatatgttgt 2820 cccgagcagc agtagagagc aatgacgaaa ccagtcgaga aatttacgac atctatggaa 2880 tcgagtcaag taatgccgtg gatttcggac gacaaagtag agcagatccg caaagcatca 2940 caggaggatc cagagatgca ggccatcatg cagtatatca tcaacggctg gccggcgaga 3000 tgcgacgcga ttccagcaaa tctgaaagtt ttttggaaat acaaatcgga gctcagcacc 3060 cacaatggat tggtattgcg atgtgatcga atactagtac cacatagtct acgaagagat 3120 atactggaac gtctccatca gtcccactct ggcatcgaag caactattca actggcgaag 3180 gacacggtat tttggccagg cctgcgggat caaatcaggc agcgcgttca gagctgtgac 3240 gtttgtgcta agtactcagc cagccaaagt tcgcagtcca tgcaaagtca cctaattcca 3300 tcgtacgcat accaaaaggt ttccatggat ttgtgcgaat gtgatttcga aagtcaaaag 3360 agaatttacc ttatcaccgt ggaccatttc tccgactact tcgatgttga tgaactgaga 3420 actccgaata cagcagcagt aattcaggcg tgcaaaaaga attttgctag attcggagta 3480 ccgcaagaag tatcaagtga tggtggtcca cagttcatga gccaggagtt tattcgattt 3540 gctaccttgt gggacttcaa acacagtgtc tcggcccctt ttcaccaaca ggcgaacgga 3600 aaggccgagt cagctgtaaa aatcgccaag atgctcatga agaaggccca tgaatcgaaa 3660 caagatttct ggaaattgct gttacaatgg cggaatactc ccaacaaagt tggctgctct 3720 ccagttcaac gtcttcttag cagaagaact cggtttgatg tgccaatgtc cgaagcaaag 3780 tactccccga agcttcaaga aggaataaag gaaaagatcg tcaagtgtag gcagcaagct 3840 aagttgtact atgatcgaaa aacacgacgt ctcccggatt tggaatttgg ccaaccagtt 3900 tttgttaagc tgaagccttc ggacaaagag tggcaacgag gaacagttgt tgatccaata 3960 accgatcgtt caacaataat ttcggttggt gaacgggaat acagacgaga caacacttgc 4020 atcaaaccag cactgccctc atatgtatct tcaaaaccta aagagccgtc ctcgggtccg 4080 ttggagcatc caaagacgtt agaacagtca acacctgtag ccgaacaacc catggcagac 4140 atcggtagac caaagcgtat tattcgttta cccaagcgtt tcgaagattt tgaggtttcg 4200 taagttatgg gctatttaag ttttctttct ataaaagagg gga 4243 // ID MuDR3_SM repbase; DNA; INV; 2543 BP. XX AC . XX DT 07-FEB-2008 (Rel. 13.02, Created) DT 03-MAR-2008 (Rel. 13.02, Last updated, Version 1) XX DE DNA transposon: consensus. XX KW MuDR; DNA transposon; Transposable Element; MuDR3_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-2543 RA Jurka J.; RT "MuDR-type DNA transposon from Schmidtea mediterranea."; RL Repbase Reports 8(2), 157-157 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 460..2181 FT /product="MuDR3_SM_1p" FT /translation="MTYTNKDKAIDFIKSENTWSYHYENRTADSKKVYYRC FT NKAKLRGKQCPAGILLQIHLNNECVDLYRTGADHMHDNLLDNNRLTEEMKE FT EIKTLYDLKIKPKAILQKLREKGLHPKNKSQVSSYLKKLNKLKYGSTKISL FT GELEQWCIDHSNIPEDENESFVAAFSIRYGNEENEQEMETEDEEEEDEYNF FT RVLISTKRLLKLAKKTKKIHADATYKLNWEGMPVLIIGTTDMDRHFHTFGI FT SVCSNEKTKDFIFIFESLVNSLKNLGENIHFDTLISDASNSIRNAFKKVFG FT EDSNLIMCWAHMRRNVVKHLHLVEQEFREDILNDIDTLQLASNKIIFEKAK FT QLFIKKWLQKKQNDFIDYMKEMWLTTHQNWYEGALSNTPSTNNALESFNLV FT IKKEDTLRERFPLSRYFKLCLDSVKKWSNLYENNDKVFVESPTIDLKKWTD FT GYHWAKSSKIITSKTLSNCVEYYCPAGNESKVSEDQIQNVKDMRWNTFDQF FT RKRSSVVWIVNISLDGAKWKEGTCTCPSFLKEYMCKHVIGVSIRLKYVKPP FT PAAKQIPIGEKRQRGRPKKTTKALLVD" XX SQ Sequence 2543 BP; 1027 A; 338 C; 367 G; 810 T; 1 other; taatgatcag tattttaata aattatcaat attttttaat atgatcagtt ttaataaaat 60 atcagtatca cttgatcagt aattaattca atgtttatga tcagtatttt attctaatga 120 tcagtaattt attagaatga caagcaaatt attttttatg acnattattt aattttattt 180 gaccagtatt ctaattaatt tgattagtag tcaattagtt attgttttaa aaataattat 240 ttgactattt aaaacgttaa catttttaaa tcttaacaat tttaaatttt aaaaaaattt 300 aaaaactaaa taaaaaataa taaccatggc ttctcaaatc ccattttacg aatcttcaga 360 atcagaagat gatttcgaaa atgaaatata taaagataag gaaaatatta acagaggtac 420 caatcgaatg aaaaaagatc acgtgattgg aagtttgaaa tgacatatac aaataaagat 480 aaagcgatcg atttcattaa atccgaaaat acttggagtt atcattatga aaatagaact 540 gctgattcta aaaaagttta ttataggtgt aacaaagcaa aattaagagg aaaacaatgt 600 ccagctggca ttttacttca gatacacctt aacaatgagt gcgttgattt atatcgaact 660 ggagcagatc acatgcacga caatctatta gataataata gattaactga agagatgaaa 720 gaagaaatta agacgcttta cgatttaaaa ataaaaccaa aagcaatatt gcaaaaactt 780 agagaaaaag gcttacatcc aaaaaacaaa tcacaagtat ccagctattt gaaaaaatta 840 aataaattga agtatggatc aacaaaaatc agtcttggag aacttgaaca gtggtgtatc 900 gatcatagta atatacctga agacgaaaat gaaagttttg ttgctgcatt cagcattcga 960 tatggcaatg aagaaaacga acaagaaatg gaaactgaag atgaagagga agaggatgaa 1020 tacaacttta gagttctaat ttctacgaaa agacttttga aattagcaaa aaaaactaaa 1080 aaaattcacg ctgatgctac atataagtta aattgggaag gcatgccagt tttaataatt 1140 ggtacaaccg atatggaccg ccattttcat accttcggta tctccgtgtg ctctaatgaa 1200 aaaacaaaag atttcatttt tatatttgag tcacttgtta attcgttaaa aaatctcggt 1260 gagaacatcc attttgatac tttgatttca gacgcttcaa attcaatccg aaatgctttc 1320 aaaaaggttt ttggagaaga ttcaaactta attatgtgtt gggctcatat gcgtcgtaat 1380 gtagtcaaac atttgcattt agttgaacaa gaattcagag aggatatatt aaacgatatt 1440 gatactcttc agttggcttc aaataagatt atttttgaaa aagccaagca attatttatt 1500 aaaaaatggc ttcaaaagaa acaaaatgat tttatcgact atatgaagga aatgtggtta 1560 acaactcatc agaattggta tgaaggagca ctcagcaata ctccatcaac caacaatgca 1620 ctagaatcat ttaatttagt tattaaaaaa gaagatactc ttagagaaag atttccatta 1680 tcgcgttatt ttaaactttg tctcgattct gttaaaaaat ggtcaaatct ttatgaaaac 1740 aatgacaaag tattcgtcga atctccaacg attgatctaa aaaaatggac agatggttac 1800 cattgggcca aaagtagtaa aataataaca tctaaaactc tttccaattg tgtagagtac 1860 tactgtccag caggaaacga atcaaaagta tcagaggatc agatacaaaa tgtgaaagac 1920 atgcgttgga atacattcga tcaatttaga aaaagatctt cggttgtgtg gatcgtaaat 1980 atttcgttag atggtgccaa atggaaggaa ggtacatgta cttgtccatc ttttttaaaa 2040 gaatacatgt gtaagcatgt aattggtgta tctattcgtc taaaatatgt caagccacca 2100 cctgctgcca aacaaatacc aatcggagag aaacgtcagc gaggaagacc aaagaaaaca 2160 acaaaagcat tactagtaga ttaaatcaaa aatatatttt tttaaaagaa aattaatttt 2220 aaatttaaat aaaataatcg ataaaactaa aaaacaactt ttaatactga tcatttttct 2280 ttactgatca ccttttacca atcattattt gaataaatac tgatcacttg ttaatttttg 2340 tttgatcaaa ttccatttta atactaatca tttattaatt tttaactgat taaatgacat 2400 taaatactga tcatttaatt aaaatattga tcattttaat aaaatactga tcaaaaaaaa 2460 attttactga tcatttttta cttaaaatac tgatcaaaaa cattaaaaaa tactgatcat 2520 ttttactaaa atattgatca tta 2543 // ID MuDR4_SM repbase; DNA; INV; 1981 BP. XX AC . XX DT 16-AUG-2009 (Rel. 14.08, Created) DT 16-AUG-2009 (Rel. 14.08, Last updated, Version 2) XX DE MuDR-type DNA transposon element - a consensus. XX KW MuDR; DNA transposon; Transposable Element; KW Autonomous DNA transposon; MuDR4_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-1981 RA Jurka J.; RT "MuDR-type elements from Schmidtea mediterranea."; RL Repbase Reports 9(8), 1902-1902 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 170..1789 FT /product="MuDR4_SM_1p" FT /translation="MNTLLRTNEIISNNSEQLSLEFRVGSMFSNIDEFLQL FT FSKSAILKRVPYIVHRRNKKHVTLSCKSLSCDFKVSAYKKIVLDQCIVSVM FT HPFHTTDCSYSKLSTCIAIKSVISESTPSSAKASEIIELVRQKTGSQLKYI FT TAYKALESYKSKAQIVLENSYSFINSLISSINPLDTFTNLELENSGFKRFF FT MSWRATKHFYSLNRKCITIDGTFLAGVNKGTLLVAVAQDGKDQLVLLAYAI FT VESENRSSWLYFLENLNRHFDINDSSTIIMSDRDVGLLSAVNSVAPRAIKC FT NCVRHIAMNLKSRFHNILLMEKYWYLVYTYDEIAFERGMRELQVLNEEFYH FT ELVNMGISSWANSKCPIAKYGKNTSNAAESMNSAIKKFIKQDITNLIISLN FT NYSMKIFSQRRERTIGNSIFAKQLASVETNTTIGRLYTIAESNANVFLVER FT EFIVDFSSKKCSCNRSFEFGYPCSHLCAVIVYLRQDPKAFVEIYFTSSNYH FT GSYVDSIAPLSVISLQRDNTLPPEARRSRGRPRVSRIRSAREN*" XX SQ Sequence 1981 BP; 695 A; 298 C; 327 G; 661 T; 0 other; tagcatattt tttgtcgata atttaaaaaa tggcatatca atttggtaaa tttaaaaaat 60 ggcatccaat aaaacaaatt ttgaaaaata ttatttaatc gtaaaaatta taaaaaatat 120 catttaatta taaaaaacta taaaaaatat tatttcgata ccattcttca tgaatacatt 180 actcagaaca aatgaaataa tctccaataa ttcggaacaa ttatcattag aatttagagt 240 tggtagtatg ttttcgaaca tagatgagtt cttgcaacta ttttcaaaat cagccatact 300 taaaagagta ccatatattg ttcaccggcg caacaagaaa catgtaacac ttagttgtaa 360 atcactttct tgcgatttta aagtatccgc atacaaaaag atagttcttg atcagtgtat 420 agtcagcgtt atgcatccat ttcatacaac tgattgcagt tattcaaaat tgtcgacgtg 480 cattgctatt aaaagtgtta tttctgaatc tactccttca tcagcaaagg cttcggagat 540 tattgaattg gtgagacaga aaactggaag ccaattgaaa tatattacgg cgtacaaggc 600 actcgaaagt tacaaatcca aagcccaaat agtgctcgaa aattcatact cgtttattaa 660 tagcttaata tcttcaataa atccattaga cacattcact aatttagagc tggagaattc 720 aggatttaaa cggtttttca tgagttggcg agccactaaa catttttatt cgctaaatag 780 gaaatgtatt acaattgacg gaacttttct ggctggtgtt aataagggaa cattattagt 840 cgctgttgca caagatggca aagatcaact tgttctatta gcctatgcta tagttgaatc 900 tgaaaatcga agcagctggt tatatttttt agaaaaccta aataggcatt ttgacataaa 960 tgattcttct acgataataa tgtctgatag ggatgttggg ctgttatcag cggtcaattc 1020 agtggcccca cgcgcaatca aatgcaattg cgttagacat attgcaatga atttaaaatc 1080 gaggtttcat aacatattgt taatggaaaa atattggtat cttgtctaca catatgatga 1140 aattgctttt gaaagaggta tgcgagaact acaagttcta aatgaagaat tttatcatga 1200 acttgttaat atggggattt ctagttgggc taattcaaaa tgtccaattg ctaagtatgg 1260 gaaaaacact tcaaacgccg cggagtcaat gaattcagcc atcaaaaaat tcattaagca 1320 agatataaca aaccttatta tatctctgaa taactattcc atgaaaatat ttagccaaag 1380 aagagaaaga actataggga attcgatttt tgctaaacaa ttagctagcg ttgaaactaa 1440 cactactatt gggcggttat atacgattgc agaatctaac gcaaatgtat ttctagtgga 1500 aagagaattt atagtagatt ttagttctaa aaaatgcagt tgcaatagat cttttgagtt 1560 cggatatcca tgttcgcatt tatgtgcagt aatcgtatat ctaagacaag atcctaaagc 1620 atttgttgag atatacttta catcttctaa ttaccatggc tcgtacgtag attcaattgc 1680 gcctctttcg gtcattagct tgcaaagaga taacactcta ccaccggaag cgagaagatc 1740 tcgtggaaga ccgagagttt caagaatacg gtcagcgaga gagaattaaa tattcgcttt 1800 taactttgta ttgcttttta tgtctaatat attttttcta ggcaaagttt ttttatgatg 1860 ctatttttta aattttattt ttttaagatg ccatttttta aattagcgaa attgatatgc 1920 tattttttaa atttatcaaa ttgatatgcc attttttaaa ttaccgacaa aaaatatgct 1980 a 1981 // ID L2-8_AAe repbase; DNA; INV; 5015 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE An L2 non-LTR retrotransposon from Aedes aegypti. XX KW L2; Non-LTR Retrotransposon; Transposable Element; L2-8_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5015 RA Kojima K.K. and Jurka J.; RT "L2 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1404-1404 (2011). XX DR [2] (Consensus) XX FH Key Location/Qualifiers FT CDS 795..1979 FT /product="L2-8_AAe_1p" FT /translation="MDACKACARNFVRSDRSVLCSGSCGRVFHPGCVGLNA FT TNFKSWTANVGLLWFCENCRVNFNPSIMDREAIIMKTLRDLLLRLDSMDLR FT VGQYGENLKVLNSLLTXSTSISRESNFSSRLQQSGMSGPTTYDPTDFHKTI FT DRFNLDLSLRSNVANNSIVDANEGLDASLDNDDDCNDHQSGINSTIATTAT FT TTVSAVAAAVVAGAAASATPCHVTTAATTNSADTNVVSSYASIVSRPSVAV FT DTATNRNSISNNNMTHAIATHSVASTSKNVIVQDAVQDCRLRVVNRNRTLR FT QRQPTSDVTMKSFYVTPFDIEQTEEDVIEYLRETMNVDNSTLNCVKLVPRN FT RNVNELTFVSFKLSVSEDLVSVISDPFYWPEGVEVREFQPKNGSFPNRVVS FT I" FT CDS 1934..4897 FT /product="L2-8_AAe_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="ISAKKREFSQPRSLNLINVSNSNTNVPVLSEPSLNAA FT LLAPSLPESIYESKSKLHYFDYQGLKCLLKKHSTNFISIGVHNARSLICNV FT ENYRELIGNSNLSVLAIVETWLKPSHTNKSVELEGYTLTRSDRNSKTKSRG FT GGVAFYLKKNVKFSVIAKPENDSEIDYLFIKLKHVHLVCGVVYKPPDVNDS FT KLEIVFKLLSEISSIEPNVMLMGDLNINLEAIKTSKTLNFFENLNALSFKL FT LPTYPTCHKSDSSSTIDLFSGNCTHNVNNVYQSSAGGISDHDFICLDYKFK FT SSKAKSEVYYTREYHKIDTDLFISDLRGVAFDQVYNCTNVNEKLRCFTKLF FT VSILNSHAPLRRKLVTNPTSPWINGNISRLFKNRSDAYECWKRDKSDSQKW FT KNFTRLRNLTNREVERTKKEFFASKLSADLPAKQLWKNIKRLGLKQTNTTI FT GGGDVCADSLNTFFVSHNVPISFNPNNNSSELEPNFSFRCVNNDEVLKEFI FT SASCDAIGVDNIPLKILKMSLCVTLPYITDIFNFCIITCEFPIDWKVAKVI FT PIGKIENPVSVNDYRPISILPALSKVFESILTNQLNEYLSQENLLCPLQSG FT YRKTCSTVTALIKIENDIKEALDKKMITIMALLDFSKAFDTIDHRLLCKKL FT KNIFLIDNLSINLLRSYLQNRVQYVEFNFQQSETINVESGVPQGSILGPIL FT FSMYINDLPSVLSYCKFHLYADDCQLYISGNHKNISDIVNLINMDIRRILI FT WCEQNGLQLNAKKTQTIIFRTKRTVLTDIPLIRVNDNLINYTDVVKNLGLL FT MDCNLNWNAQVNSLCKKVYNALHSLVILRHCTPLHIRVHLARSLLVPLFDY FT GDVLFGLVSKKNLNKLNLVFNAVTRYAYNLKKYDHISNFRKAILGCEFTTH FT LKFRMCIQTYKILNNPPPYLQNLFTYARSSRVSLLSVPHCSTNYLKESFRH FT RAVKIWNELPRDCRTELGFFSFRRKTKIYLNSV" XX SQ Sequence 5015 BP; 1517 A; 974 C; 939 G; 1583 T; 2 other; acaacaaaca accgagttca caagcaacgt tttccgaggt gttaaattat cgacgatttt 60 ccgtgctatt ttcaatctag taattgaacc gaacgtggtt wactttgaag tattacctgc 120 taaatagaat acgactaaag gtcaagctgt tccggcactt accggagtgc aaaagtgatt 180 tctattttct gtaattcgct ggaggatcca attaggaaac aaaagattgt accgtctcag 240 tcgattgtga taacgttgga tgcgttatat caacacattc acaacgtcag tgattcagtc 300 actcagttgt ttagtactga ttccaccgtt tgttttgctc agtgttttca cacgcttctt 360 caggagtggt gctgctctgt tggatagagt agcgtatctg ggtgcgttcg gtggtctcaa 420 aagccgaaac actttaatcg aggcgcacgc tctggttcga tatcggcttt ggtgagaaca 480 catttggggt acgtttgcac gctgctgagc ggtaccgtct agttggatag aggagtgtgt 540 ctgggtgcgt ttgatagtct caaaaatcaa aacatcttca ttgacgcgca tgatctggtt 600 caactgcggt tttggcagat gtactttaaa gagcgagacg cttggtttgc tttgggaaca 660 cgctgtgcct gcttgaacac tgctttatag tgtctttgtt tttatatcgt tttcagaata 720 ttacttaaat cgcactgctt aagggaagtc cattttccta actgttttct tctagtgtta 780 gaaatttagg aatcatggat gcatgtaagg cttgcgctag gaattttgtg aggtccgata 840 gaagtgtttt gtgcagtggg tcgtgcggaa gagttttcca ccccggttgt gtcggcttaa 900 acgcgacaaa ctttaagtca tggactgcca atgtcggttt gctttggttt tgtgaaaact 960 gccgagtaaa tttcaacccc agcatcatgg accgggaggc aatcattatg aaaacattgc 1020 gtgatctcct tttacgactc gactcaatgg atcttagagt gggtcaatat ggagaaaacc 1080 tgaaggtgct caacagcttg ctcacaatst ccacatcaat cagtcgcgaa tcaaactttt 1140 catcaagact gcaacaatct ggaatgtctg ggcctaccac ctacgatcca acagattttc 1200 acaaaaccat cgatcgtttc aatctcgacc tttcgctccg atcaaatgtt gccaataata 1260 gcattgtcga tgcaaacgaa ggacttgatg cttctttgga taatgatgac gactgcaacg 1320 atcatcaaag tggtatcaac tccacaattg ctaccaccgc tactaccacc gtttctgctg 1380 ttgctgctgc tgttgttgcc ggtgctgctg cttctgccac tccctgtcat gtcaccaccg 1440 ctgcgacgac taactccgct gataccaacg ttgtctccag ttatgccagt attgtgtcaa 1500 gaccctctgt tgctgttgat accgctacca ataggaactc gatctccaac aacaatatga 1560 cgcatgccat cgccacgcac agtgtcgcat cgacatcgaa aaacgtcatc gttcaggatg 1620 ccgtacagga ttgtcgttta agagttgtaa accgaaaccg aactctgcgc cagcgccaac 1680 cgacgagtga tgttacgatg aaatcgtttt atgtaactcc attcgatatt gaacagactg 1740 aagaagatgt tatcgaatat ttacgcgaaa ctatgaacgt tgataattcc actcttaatt 1800 gtgttaaact tgtgccgcgg aacagaaatg ttaatgaact tacgttcgta tcgtttaaat 1860 tgtcggtttc agaagatctt gtatcagtaa tcagtgatcc attttattgg cctgaaggcg 1920 ttgaagttcg tgaatttcag ccaaaaaacg ggagttttcc caaccgcgta gtctcaattt 1980 gataaatgta agtaactcta acaccaatgt accagttctg tctgaaccct ccttgaatgc 2040 tgctcttttg gctccctctc tacctgaatc aatttatgaa tcaaagagca agttgcatta 2100 ctttgattat cagggtttga aatgtttact caaaaagcat tcaacaaatt tcatcagtat 2160 tggggttcac aatgctcgga gtctgatttg taatgttgag aattatcgtg aattgatagg 2220 aaattcaaat ttaagtgttt tggcaattgt tgaaacctgg cttaaaccat ctcacactaa 2280 taaatcagtg gaacttgaag gttacacact gacaagatct gatcgaaata gcaaaacaaa 2340 atctagaggt ggaggtgttg cgttctattt gaaaaagaat gttaaattct cagtcattgc 2400 caaacctgaa aatgatagtg aaattgatta tttattcata aaattaaagc acgttcattt 2460 ggtttgtgga gttgtttaca aacctcctga tgtgaatgac tctaagttgg aaatagtatt 2520 taagcttctg tctgaaattt cctccattga gcctaatgta atgttgatgg gagatctgaa 2580 cataaacctg gaagcgatca aaacatctaa aactttgaat ttctttgaaa atttgaatgc 2640 tttgtctttt aagttgctcc cgacgtaccc tacatgtcac aaatctgatt cttcatcgac 2700 tattgatctt ttttccggca attgcacaca caacgtgaat aatgtttatc aatcgtcggc 2760 agggggaatc agtgaccacg attttatctg tcttgattat aaattcaaaa gttcaaaagc 2820 aaaatcggaa gtttattata cacgagaata ccacaaaata gatactgatt tgttcatatc 2880 agatctccgt ggtgttgcat ttgaccaagt gtataattgc accaacgtta atgaaaaact 2940 ccgttgtttt accaaactat ttgtttcaat tcttaattcg cacgctcctc ttagaagaaa 3000 attagttacg aatccgacta gtccttggat caatggtaac ataagccgac tattcaaaaa 3060 tcgatcagac gcatatgaat gctggaagcg ggataaaagt gattctcaaa aatggaaaaa 3120 tttcacacgc cttcgcaatc ttaccaatcg tgaagttgaa cgcaccaaaa aagaattttt 3180 cgcctcaaag cttagtgctg atcttcccgc gaaacagctt tggaaaaata tcaaacgact 3240 ggggttgaaa caaacaaaca ctactattgg aggtggtgat gtatgtgccg attcattgaa 3300 cacatttttc gtatcccata atgttcctat ctcgttcaat ccgaataaca attcttctga 3360 actcgaacca aacttttcct tccgatgcgt aaacaatgat gaagtcctta aggagttcat 3420 ttccgcttca tgtgatgcta ttggtgttga taacattccg ttaaaaattc tgaagatgtc 3480 cctctgcgtt acattacctt acattacaga tattttcaac ttttgcatta tcacttgtga 3540 gtttcctatt gactggaaag tggctaaagt cataccaatt ggtaagattg aaaatcctgt 3600 atcagtaaat gactaccgtc ccataagcat tcttcctgct ctctctaaag tttttgaatc 3660 gattcttacg aaccagctca acgagtactt aagccaggaa aatctgttat gtcctttgca 3720 atcaggttat aggaaaacat gcagtactgt gactgccttg ataaaaattg aaaatgacat 3780 taaagaagct ctagacaaaa aaatgattac cataatggcc cttctggatt ttagtaaggc 3840 atttgacacc atagatcatc gactattatg caaaaagctg aaaaatattt tcctgataga 3900 taatctttcc attaatctgt taaggtccta cttacagaat cgtgtccagt atgttgaatt 3960 taacttccaa caatctgaaa cgattaatgt agaaagtgga gtccctcaag ggtcaattct 4020 tggtccgata cttttttcaa tgtatataaa cgatttgcct tctgtactat cctactgtaa 4080 atttcatttg tatgcagacg attgtcaatt atacatttca ggaaatcata aaaatatttc 4140 tgacattgta aacctcatta acatggatat acgtcgcatc ttgatatggt gtgagcaaaa 4200 cggattacaa ctaaatgcaa aaaagacgca aactataatt ttccgcacca agcgaacagt 4260 tctaacagat attccactga ttagagttaa tgataatttg attaactata cagatgtggt 4320 aaaaaacctg ggtcttctta tggattgtaa tttaaactgg aacgcacaag ttaattcctt 4380 gtgcaaaaaa gtttataatg ctttgcactc tctagttatt ttgagacatt gcactccatt 4440 gcatataaga gttcatcttg cgagatcact tttagtgccc ttgttcgact atggtgatgt 4500 tctatttggt cttgtatcta agaaaaattt aaataaacta aatctagttt ttaatgcggt 4560 caccagatat gcatacaacc tgaaaaaata cgatcatata tcaaacttta ggaaggcaat 4620 acttggatgt gaattcacaa ctcatctaaa gttcagaatg tgcattcaaa cttacaaaat 4680 attgaataat cctccacctt atttacaaaa cttatttact tatgctcgat catcacgtgt 4740 atctttactg tcagtccctc attgttccac caattatttg aaagaatctt ttcgtcatcg 4800 cgctgtaaaa atttggaatg aattgccaag agattgtaga actgaattag gattcttttc 4860 atttagacga aaaactaaaa tttacttgaa ttctgtttaa tattacattg gtattagagg 4920 ttacttatat taaaatacag gcatatactt tgaaaatcaa tagatgatta tttgtgctac 4980 tgtagtttga ataaataaat aaataaataa ataaa 5015 // ID Transib3_AA repbase; DNA; INV; 933 BP. XX AC CC120136; XX DT 13-JUN-2005 (Rel. 10.05, Created) DT 13-JUN-2005 (Rel. 10.05, Last updated, Version 1) XX DE Transib3_AA is a DNA transposon, a partial fossilized copy. XX KW Transib; DNA transposon; Transposable Element; KW Interspersed repeat; DDE-class; TRANSIB superfamily; KW Transib3_AAp transposase; Transib3_AA. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-933 RA Kapitonov V.V. and Jurka J.; RT "RAG1 core and V(D)J recombination signal sequences were derived RT from Transib transposons."; RL PLoS Biol 3(6), (2005). XX DR GenBank; CC120136; Positions 933 1. XX CC Transib3_AA belongs to the Transib superfamily of DNA CC transposons. CC The consensus sequence is not complete; termini are not known. CC Transib3_AA encodes remnants of the Transib3_AAp transposase. CC The transposase is not perfectly recovered due to available CC sequence CC data. CC Conceptually translated Transib3_AAp transposase: CC NFHVAMKSTPVKFKSYCFETAELYVSLYPWYYMPTALHKLLIHGHKIVERSVLPVGRMSEEAQERSNKTI CC KSNRNNFSRKNSSENTNRDIVNRMLIISDPLINTDLKPPRNHHKDLPEDVLQLLKNENDVDNELDDSDDA CC SFGTQSDCED. XX SQ Sequence 933 BP; 297 A; 167 C; 207 G; 262 T; 0 other; gcaattttca tgtggccatg aaatcgaccc cggtgaagtt caaatcttac tgttttgaaa 60 ctgcggaact atatgtgtct ctttacccgt ggtattacat gccaactgct ttgcacaagc 120 tcctaattca tggccacaaa attgttgagc gatctgttct tcccgtcgga agaatgtccg 180 aagaagcgca agagcgaagt aacaagacga taaaatcaaa tcgcaacaat ttttcgagga 240 aaaatagtag tgagaacacc aatcgtgata tagtgaaccg aatgttgata atatccgatc 300 cactgatcaa cacagacctg aaaccacccc gtaatcatca caaggaccta ccagaagatg 360 tactgcagtt attgaaaaat gagaatgatg ttgacaacga attagacgac tcagatgatg 420 cttcatttgg gacacaatca gattgcgaag attaggttca gaaacaaaag ttttgttctt 480 aatggcataa cattttgaat tggataaatt aattttagtt ttgaaattgt tcatttgaat 540 taaataaata taagtcagta aaacacatga actctattct gatcaccagt aaaatacaca 600 tgacagtaat cgaaatgaca atcgcaatgc ctgaatacgt caagaaatac ttaaaatatt 660 tataatatgt aataatagag cacaatttat cattttaaaa aagtttgagg ttttgtccgg 720 tatttgttct agggcggacc actgtgcaat ggcgtggtgg cgagtgggcg atggcggcag 780 tgaattggag gcgagtggca gttgtggaag atgaggcacg ttggcttgct gtgcggccaa 840 ccagggatgc gcactgttga ctgttgtcga ttagtgggtt tccttttgga acacttctgg 900 caggatggag atccggagat ctcttcatcc tta 933 // ID CR1-87_AAe repbase; DNA; INV; 4457 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-87_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4457 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1175-1175 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 11 sequences with >90% CC identity. XX FH Key Location/Qualifiers FT CDS 142..1074 FT /product="CR1-87_AAe_1p" FT /translation="MLATRAVIVTTPDPEEGGTLRSGKRRKIANSETDRSS FT ASRLPSKNPPCSITPQPKTKTVIKMEQTVSFKPKQAQSAESTKADLCSKLD FT PVTFSVTNVHMKVTGEVTVRCETKDHAEKLSKTASEMFADKYVVEIQRPLK FT PRVKVIGFSKEVSEEEFLPKLMKQNRGLECFSMKVVRIIKNETRSSNQVSA FT ILETDARGFEELIDRQRLYIGWERCRVIEATDVLRCFNCSEYGHKAVSCTK FT TACCPKCAGDHQVSECQSDFAKCINCHLMNTNRTSPYDKLLDVSHSSWSLD FT CPIFAKRLNTARQRIDFSA" FT CDS 1078..4377 FT /product="CR1-87_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="QLDETAQLYASRLNPSSMDATNSLQRENLAGPGNYKN FT SICPVEARKDSTLVVSNLQLFQSSLISLPTLAPEDVGHPDVLLLPGYYPTA FT MDLICDAQPNELGGPGNAMNSICPSEAPRDTAPALRGSQNFQNRTCYSSEE FT ENNNNFSIYYQNVRGLRTKIDDFYIMTQNGDYDVIVLTETWLNNEINSTQL FT FGSEYTVYRKDRDPNISGKLRGGGVLVAVRRGFSSSCNYVGIDDDLEHIFV FT NIDGGVRTVCVGVVYIPPDQSRNSDVIERHIRSISSVSYSLKLHDLHLLFG FT DFNLPDVSWNFASTGYAYPEASESISSSLNFLDGMSLLNMKQLNVTRNCLN FT RILDLFFVNDEALPMCTLMEPHEALVPVDPMHPPLLTSLQCTAPIKFSDQD FT VVRMFDFAKTDFISLNEAMREIDWSTLSLATDVNVAVSIFNHTLLQLLPXF FT VPASRPRQSPRWANNHLRHLKRKRSKALRKFSSNRCAVNKRRFNEASRRYK FT KYSRELYSRYVNRTQSDLKNHPKRFWSFVNEKRKETGLPSTMFLGDEFADT FT TMDICNLFAKHFSSAFDDGTTTAQQIENALLDVPTDIININIDRFDEADIL FT AGIAKLKPSSAPGPDGIPSIIFKKCADTLSGPLKTLFDLSIAQSVFPTDWK FT RSIMFPAFKKGDKRNVTNYRGVTSLSAGSKLFEILVNDRIFRAVKSYISQD FT QHGFFAGRSTTTNLLQFSSFCINTIEKGSQIDTIYTDLKSAFDKVNHDILL FT AKLERLGMARRLVDWLRSYLANRQLRVKLGSTESDAFNSSTGVPQGSNLGP FT VLFIIFFNDVCLTLPSGCKLVYADDLKLYLTIESYRDCQKLQKLLDVFVDW FT CRRNHLTVSIGKCSVISFTRKKHPINWTYYIDNHPLDRTNAVKDLGVLLDT FT ALNFKDHYSQIISKANRNLGFIFRISNEFRDPYCLRSLYYSLVRSVLESSA FT VVWCPYHNVWIERIEKVQSKFIKYALRFLPWRNRDQLPPYEDRCRLLGMDT FT LERRRKISQGHFVAKILLGEIDASRILAQVNINTGQRILRQRSFLRLPYHR FT TEYGQHDPIRTMCEAFNNFFTMFDFNISPSLFRERLTRSTFMF" XX SQ Sequence 4457 BP; 1275 A; 1012 C; 953 G; 1216 T; 1 other; cacagttcaa ccccaatttc cctctatgca actactgaat cgtcaagtgg aattgcaccg 60 tcgtttgcgc aagtagttag ctctaagggc ggcgctctca ctaaccgggc aaaacgtaaa 120 gctccagcag tctctgacgg catgttagca acccgcgctg ttatcgttac cacacctgat 180 cctgaggagg gcggcaccct tcgttcaggg aaacgccgca aaattgccaa cagtgaaaca 240 gacagatcat ctgcctctcg actaccatct aagaaccctc cttgctccat cacaccgcag 300 ccaaagacta agacagtcat taaaatggaa caaacagtct catttaaacc gaagcaagcg 360 cagtccgctg agtctacgaa agccgatctt tgttccaaat tggatcccgt gactttttca 420 gtaacaaatg ttcatatgaa ggtaactggt gaagttacag tgaggtgtga gaccaaagac 480 catgctgaga agttgtcaaa aacggcctct gagatgttcg ctgacaagta tgtcgtggaa 540 attcaaagac ccttgaagcc aagagttaag gtcatcggat tttccaagga agtgagcgag 600 gaagagtttc ttcccaaatt gatgaagcaa aatcgtggtc tggaatgctt tagtatgaaa 660 gtagtgcgaa tcatcaaaaa cgaaacgcga agtagcaatc aagtgtcagc aattctagaa 720 actgacgccc gcggatttga agaactcatc gaccgacaac gcctttacat cggatgggag 780 cgatgcaggg ttatcgaagc taccgacgta ttgcggtgtt ttaactgctc cgagtatggc 840 cataaagctg tttcttgtac aaaaactgct tgctgtccca aatgtgccgg tgatcaccag 900 gtcagcgagt gtcaaagtga tttcgcaaaa tgcattaatt gtcatctaat gaacacaaac 960 cgcacctccc cctacgataa gctgctcgac gtttctcact cgtcatggag tctagactgt 1020 ccgatctttg cgaagcgcct aaatactgca cgacaaagga ttgacttctc ggcatagcaa 1080 ttagacgaaa ctgctcagct gtacgcatcc cgactcaacc cttcttcgat ggatgctacg 1140 aattccctcc agcgtgaaaa cctggctggg ccaggtaact ataagaatag tatatgtcct 1200 gtcgaggcta ggaaggattc tacacttgta gtcagcaatc tccaactatt tcagtcgtcc 1260 ttgatctcat tgccaacatt ggcccctgaa gatgttggcc atcccgacgt actgctacta 1320 cctggatact atccgacggc gatggatttg atatgtgatg cccaacctaa cgaactaggt 1380 ggcccaggta atgccatgaa cagtatatgt ccttccgaag caccaagaga tactgcacca 1440 gccttaagag gatcccaaaa ttttcagaac cgaacgtgtt actcaagcga ggaggaaaac 1500 aacaacaact tcagcattta ttatcagaat gtcagaggtt tacggacgaa gatcgacgat 1560 ttttatatta tgacgcaaaa cggagactat gacgtaattg ttctaacgga aacctggctg 1620 aataacgaaa tcaactctac tcagcttttt ggcagtgaat acacggtata taggaaagat 1680 cgtgatccaa acatttccgg aaaattacga ggaggcggag ttttagtagc tgttaggaga 1740 ggttttagtt catcctgcaa ttacgtagga attgatgacg accttgaaca cattttcgtc 1800 aacatagatg gtggggtgcg tactgtgtgt gtaggtgtcg tatacatccc gcctgaccaa 1860 tctagaaatt ctgacgtcat cgaacgacat atcaggtcta tctcgtcggt atcttattca 1920 ctcaagttgc atgatttgca ccttctcttc ggggatttca atctaccgga tgtgagctgg 1980 aactttgcgt caacgggata cgcctatcct gaggcttcgg aatccatctc ttccagtctg 2040 aatttccttg acggcatgtc attgctcaat atgaaacagc tcaatgtcac aaggaactgc 2100 ttgaatcgga tactcgattt gtttttcgtt aacgatgaag ccttgccgat gtgtaccctt 2160 atggaaccac acgaagcact agttccagtt gacccaatgc atcctcctct gctaacatcc 2220 ttgcaatgca ccgctccaat caagttcagc gatcaagatg tcgttcggat gttcgatttt 2280 gcgaaaacgg atttcatttc attaaatgag gccatgcgag aaatcgactg gtcaacattg 2340 agcctcgcca ccgatgttaa cgtagctgtt tctattttca atcatactct gctgcaactg 2400 ttgccawcat ttgttcctgc ttctcgcccc cgtcagagtc ctcgatgggc caataatcat 2460 ctgcgacatc tcaagcgaaa gcgctccaag gcattgagaa aattctcttc caatcggtgt 2520 gctgttaata agagaagatt caatgaagcc agccgaagat acaaaaaata tagtcgtgaa 2580 ttgtacagca ggtatgtgaa ccgcacacag agtgatttga agaaccatcc taaacgtttc 2640 tggtcatttg ttaatgagaa aaggaaagaa actgggttac catccaccat gttccttggc 2700 gatgaatttg cggatactac tatggacatc tgtaatctgt tcgcaaagca cttttccagt 2760 gctttcgatg atggaacgac aacagcccaa cagatcgaaa atgctctact cgatgtacca 2820 actgatatca tcaacattaa tatcgataga tttgacgaag cagatatcct agctggaatt 2880 gcgaagctga aaccatcatc agctcctggt ccagacggaa tcccgtcaat tatatttaaa 2940 aaatgcgcag atactctaag tggtcctctt aaaacattat tcgatttatc gattgcacag 3000 tcagtgtttc caacagactg gaaaagatcg attatgttcc ctgccttcaa aaagggagat 3060 aagcgcaacg ttacaaacta ccgtggggta acatcgctaa gtgcgggatc aaaattgttt 3120 gagatcctgg ttaatgatcg tattttccga gcagttaagt cgtatatttc gcaagatcag 3180 catggatttt ttgccggacg ttcaaccacc acgaatttgc tacagttttc gtcgttctgc 3240 attaacacaa tagaaaaagg ttcccaaatc gacactatat acactgattt aaaatcggct 3300 tttgataaag tgaaccatga tatcttgttg gcaaaattag agcgactagg catggctcgc 3360 aggctcgtgg attggctaag atcttactta gccaatagac agttacgcgt taagctgggc 3420 tccacagaat ctgatgcctt taacagtagc actggggtcc ctcaaggcag caatctgggc 3480 ccagtactat tcatcatatt tttcaacgat gtctgcttaa cgctaccctc cggatgcaaa 3540 ttagtttacg ccgatgatct caaattatac ttgacaattg aatcataccg agattgtcag 3600 aaattacaaa agttgctcga tgttttcgtc gactggtgca gacggaatca tctgaccgtg 3660 agtataggaa aatgctctgt aatatcgttt acgaggaaga aacatcccat taactggacc 3720 tactacattg acaaccaccc actggacaga actaatgcgg taaaggatct tggagtattg 3780 ctcgatacag cgcttaattt caaggaccac tacagccaga tcatttccaa agccaaccgc 3840 aatttgggct ttattttcag aatttccaac gagttccgcg acccatattg cttgcgttcg 3900 ttgtactata gcttagttcg ctccgttctt gaatcctcag ctgttgtgtg gtgcccatat 3960 cacaatgttt ggatcgagcg gattgaaaaa gtgcagtcca aatttattaa gtatgcactt 4020 cgatttctac catggcggaa tcgggatcag ctaccaccct acgaggacag gtgccgactt 4080 cttggaatgg acacccttga aagacgtcgg aagatatccc aagggcactt tgttgcaaag 4140 attctcttgg gtgaaattga tgcttcgcgg attttagctc aggtcaacat aaatactgga 4200 cagagaatac tacgacagcg gagtttcctg aggttgcctt accatcggac agaatacgga 4260 cagcatgatc ctattaggac tatgtgtgaa gcttttaata atttttttac catgtttgat 4320 tttaatattt ctcctagttt gttccgtgaa cgtttgacta gatcaacttt tatgttttga 4380 aactagcttt taatctttgt tagtatgttt ttttttttca tgtagaccaa tgagtccgat 4440 gaattgtacc aaataaa 4457 // ID hAT-20_SM repbase; DNA; INV; 3573 BP. XX AC . XX DT 11-FEB-2008 (Rel. 13.02, Created) DT 03-MAR-2008 (Rel. 13.02, Last updated, Version 1) XX DE hAT DNA transposon from Schmidtea mediterranea: consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-20_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-3573 RA Jurka J., Bao W. and Tempel S.; RT "hAT DNA transposon from Schmidtea mediterranea."; RL Repbase Reports 8(2), 69-69 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 649..3381 FT /product="hAT-20_SM_1p" FT /translation="MAESAGDTNKRKSVTKLSGSSRRKLAKEASLKADASK FT CIKLTSFITKQSKLPVVQSAEVSAKNEHVFDDQHPSTHSSSCTEPAETETS FT SEVLVEDEQVLSAVDEQHPAIHSCAENTETVTLPAQGLKHNQGDSEFDTVE FT KQLASDILQEPLLPQSDFPTDPGLYINTSISSKLIRKLFEIGPCQPGLNEP FT FQFPTDELGRKFQISWYTKCFGKGSMKEERNWLVYSPTNQKMHCQACWLFA FT DCKTENYSKEWSDPNFGVYNWKKGMEKIVKHETSNQHQSAICQYLLAKYRI FT SNDKTVISGLISQERYQVEKNREVLKRMVDATFFLAKQGLPFRGHREHSGS FT DRCSNTGNFRELLFLLAKYDSTLDNHLKFEKKNELYLCHDVQNDLIQSLAS FT EISANIDKEVMSAQFFSLIVDSTIDISRVDQMSVSLRLVLKSGQVVERFIG FT FYTLENSNAAAFSNIILTELQKRNIDITLCRGQAYDGASVMSGIKNGLQTK FT IKTLSPNAIFVHCCSHALNLVTIAAMSLNSDVQLFFGTTEKLYTFLTSSLP FT RLHILEEHQKSRYESTVDTLKRLSDTRWASRKHAVDSVVGSFSAIISTLED FT INFGESKEHKGSVRAEAMGLSILITKYSFVFLMLFLQKLLDNIFVLSNYLQ FT RKDIDIAFAKQLIYVARNKFADMRSDKAFESLNVAVKSFIIKNCSLLDVET FT EFKEKRIAKKKRMAGELNRDERVDDPTTRFKCETYFTVLDTLVIQLDERFN FT DFHNTVALFSCLDPSQISEENKESFQNLCDIYKNDINIEEAILEYDTFKYV FT YASIRPLLSCELQLKEVLPFLVEKQMAPGLPNLAILYKIYLTLPVTSATAE FT RSFSRLKIIKNYLRSTMTNERLSGLALISIERELAENIDFESTINRFASMK FT SRRKQFK" XX SQ Sequence 3573 BP; 1210 A; 619 C; 640 G; 1104 T; 0 other; cagtggcgga tttagctata ggcgtggggg ggcgggtcgc ccctccacgc ccctgttgaa 60 tcctaattgg cgcccccaaa gataattttt atgaatgtgt acaagcatta agattaatcc 120 gatcactgac actgacaatt taatatatct ttgatttgga agatctaaat ggattgaaat 180 gtattctact atcactgcag catacagctg ctcagagtca ccgtttatag attattatta 240 ttattattat tgatagtaca tattactctg ctatgttgtt aaacatagtt gatgcagcac 300 agcagcaggc aaattcttca gtagtgcaga ttactgctta cttagcagtg ctgagactgc 360 cactagccag acggcctgac ctttctctta ttacagtttt gtctgtgtac aaagttcaac 420 tggatttaag caactccagc tgttatgctg catcatatta ataatattac atagctccag 480 tgaagagcga cagtttgaag tttttaagct tgtatagtgt ttgtacataa atctaaccat 540 aataattatt gtcaatttgg acttttttga caaacaaatt attgcgttac tattgaaata 600 ttaaaataat attcaatatt gaattcaaat aaaatttaaa ttagagccat ggctgagtca 660 gctggtgaca ctaataaaag aaaatcagtt accaaattaa gtggatcatc taggcgtaaa 720 ttggcaaagg aagccagtct caaggcagat gctagcaaat gtattaagtt aacatcattt 780 ataactaaac agtctaaact gccagtagtt caatcagctg aagtttctgc taaaaatgag 840 catgtatttg atgatcaaca cccatctacg cattcgagtt cctgtactga gcctgctgag 900 acagaaacat catctgaagt tcttgttgaa gatgagcaag tactatctgc agttgatgaa 960 caacacccag ctatacattc atgtgcagag aatactgaga cggtaacatt acctgctcag 1020 ggcttgaaac ataatcaggg tgattcagaa tttgatacag tggaaaaaca acttgcttca 1080 gatattttac aagaaccact tcttcctcaa tccgacttcc ctacagatcc aggactctac 1140 attaatacga gcataagctc gaaactgata cgaaaattgt ttgaaatcgg accatgtcaa 1200 cctggactca atgaaccatt tcaatttccg acagatgaat tggggcgcaa gtttcaaatt 1260 tcatggtaca caaaatgttt tggaaaaggt agtatgaagg aggaaagaaa ttggcttgtc 1320 tattcaccaa caaatcaaaa aatgcattgc caagcttgct ggttattcgc tgattgtaaa 1380 actgaaaatt atagcaagga atggtccgac ccaaattttg gagtttataa ctggaaaaaa 1440 ggcatggaaa aaatcgttaa acatgaaact tccaatcaac atcaaagtgc aatatgccaa 1500 tatttgctgg ctaaatatcg catttcaaat gataaaactg tgatatctgg tttgatcagt 1560 caggagcgtt atcaagtgga aaaaaataga gaagttttga aacgaatggt agatgcaacg 1620 ttttttttag caaaacaagg tttgccattt agaggacaca gagaacattc tggctctgac 1680 aggtgtagta atactggaaa ttttcgtgaa ttgctatttt tattggctaa atatgattcc 1740 actttggaca atcatctaaa gtttgaaaag aaaaatgaac tttatttatg tcatgatgta 1800 caaaatgacc taattcaatc acttgcctct gaaatatccg caaatattga taaggaagta 1860 atgtctgcac aattcttttc tctaattgtt gattcaacaa ttgatatatc cagagttgat 1920 cagatgtcag tttctttaag attggtgtta aaatcaggac aagttgttga acgatttata 1980 ggcttctaca cgttggaaaa cagtaatgct gcagcatttt ctaatataat attaacggag 2040 ttacaaaagc gaaacattga cattacactt tgtagaggac aggcatatga tggtgcctct 2100 gtgatgtctg ggattaaaaa tggtcttcaa actaagataa agacactttc accaaatgcc 2160 atctttgttc actgttgttc tcatgcattg aatttggtga ctattgcagc aatgtctctt 2220 aacagtgatg ttcagttgtt ttttggtaca acagaaaagt tgtatacttt ccttacatca 2280 agtctaccac gactacatat tctcgaagaa catcagaagt ctcgatatga atcgacagta 2340 gatactctaa aacgactttc tgatacaaga tgggcaagcc gaaagcatgc agttgattca 2400 gttgttggat cattctctgc tattatatct actctagaag atataaattt tggggaatca 2460 aaagaacaca aaggaagtgt cagagcagaa gctatgggac tatctatttt gataacaaaa 2520 tattcttttg tttttctgat gctatttcta caaaagctgt tagataatat ctttgtgtta 2580 tcaaactatt tgcaacgaaa ggacattgat attgcgtttg caaaacaatt gatttatgtt 2640 gccagaaata agtttgctga tatgagaagt gataaagctt ttgaaagtct aaatgttgct 2700 gtaaaatcat ttattatcaa aaattgttcc ttactagatg ttgaaacaga atttaaagaa 2760 aagcgaattg caaaaaagaa acgcatggca ggtgaactta acagagatga aagagttgat 2820 gatccaacta cacgcttcaa atgtgaaacg tatttcactg tcttggatac acttgtaata 2880 caactagatg aacgttttaa tgattttcat aacactgttg ctttattctc ctgtcttgat 2940 ccatcacaaa tatcagaaga aaataaagaa tcttttcaaa atttatgcga catttacaaa 3000 aacgatatta atattgaaga agcaatccta gaatatgata ctttcaaata cgtgtatgca 3060 tcaatacgcc cattactatc ttgtgaacta caacttaaag aggtgctgcc tttcttggtt 3120 gaaaaacaga tggcgccagg acttcccaat ttggcaattc tctataaaat atacttgact 3180 cttcctgtta catctgctac tgctgaaaga agtttcagta gactgaaaat catcaaaaat 3240 tatttgcgat caacaatgac aaacgaacgt ctgtctggat tagcattaat ttcaatagaa 3300 cgcgaactgg ctgagaacat tgactttgaa tcgactataa atcgttttgc ttcaatgaaa 3360 tcacgcagga agcaatttaa ataataaata aagtggtatg tttgttacat caataaattt 3420 actagaataa atttactgaa tctaatgctt tataataata ttggtattca ctcaatactc 3480 gggaaaatga gtagtaccat cttcataatt tgtggcgccc cctgaaaatt ttcctcgccc 3540 ccccccacgc accaaaatct aaatccgcca ctg 3573 // ID BEL-208_AA-I repbase; DNA; INV; 5528 BP. XX AC AAGE02026623; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-208_AA_; KW BEL-208_AA-LTR; BEL-208_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5528 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02026623; Positions 33548 39075. XX CC Positions [4562-5146] - Integrase core CC 'TCAAT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 236..5281 FT /product="BEL-208_AA-I_1p" FT /translation="MPPKRTPVKNMEGGGGTAAAGGEDEVEEFDALVHMRG FT LAKGKLTRVKNIMEQIVQEEITLPQIKVHQKKIEAAYKEFSDYHERIMAVC FT PPSKRRDQDCKYLEFETLYDDISLALETWIEMLNQTQRNQQPLVVQPSLPR FT AIPHFDGKYEQWEKFKVMFRDVVDRSNEPPRIKLYHLEKALIGDAAGIIDA FT KTISDGNYDHAWEILAERFEDQRRMVDHHISGLLNMKKIAQESHSELRELV FT DSCVGHVENLKYLGQEFTGVSDQIVTHLLANALDDETKKLWESSITHGELP FT EYDETVKFLKGRISVLERCQKSTNDSKVKQSARGSSKGAQVSGKQPSNVKV FT NVVSSSNSCEFCAGDHLAFKCSVFNGLSVEDRLKMVREKLVCFNCLRRGHR FT GAECKSNKTCSKCKRKHHTLLHAEGGSVQSSGSSNATPTEQQQAPRQVTQP FT ENVSVAGSRPPTTVASNVVDKPMTQVLLLTAMVNILDKHGNPQACRALLDS FT GSQVNFISEAMLEKLKVDREEINIPITGVNSVRSSVRQRAKVEVLSKNNGF FT RLNLDCLVSPKVTGNLPTFEVDVASWNIPTGIQLADPVFFRPGQVDLLIGM FT EWYDDVIKPGRLKLSEDLPTLQDSQFGWLVGGKCASAFSGISVTSHAATPS FT NDSLNELMQKFWEVESVASEHCEKSDAEKCELHFQSTFRRSEDGRYIVQFP FT LRESVAQLGDSKPMALRRFHALEAKLQKNPDIKKQYDEFMDEYEQLGHCKE FT VQEADDPPGLQKWYLPHHAVLKPSNSTTKCRVVFDASAKVGGMALNDAMMI FT GPTIQPDLLTIILKFRMFRYVLSADIAKMYRQVLKDKCHTPLQRIFYRKHP FT KEPLRVLELQTVTYGTASAPFLAIRALYQLAIDEEDRYPRAAELVKKMFYV FT DNVLFGGNNMEEVRSLQRELIELLSRGGFHLHKWAANNEQLLDSIPLEDRD FT KLVKIEDSGANEVIKTLGLMWDPNGDDFLFRAQYDCDNPAPTKRQVLSVIA FT RLFDPLGLLSPVIVLAKVLMQNLWKHKMDWDDKLDGDLLKDWKYFLKALPL FT AEDFRVPRQVISSDAASIDIHGFADASNTAYGACVYLRSVNHDGSASLKLV FT TSKSKVCSITPMSIPRKELMAALLLHRLVKKVINAMELEHVPVTLWSDSQV FT VIAWLNKPLDSLQIFVRNRVAEIKGDNDQRVWKYVRSANNPADIVSRGMPA FT TALSTNNLWWNGPTFLCDAAYEEVVPEALDDEAIPELKPLKSVNVATIIEE FT LPILTKFESFRKTQRIIAYVLRFINNCRKSGERITAMKPMVPELREATKRI FT IQAIQRTELQDEIDRLAAGETCKRIGNLNPFLEDGLLRVGGRIRHSKIPYE FT AKHQWIIPNKHPVTRRIIAAVHRENLHAGPSSVMAALRERYWILNAKSTIR FT SVTRNCITCFKSNPKLAEQFMGDLPSYRITAAPTFLRVGVDFAGPIYIKQT FT ARKAAPVKGYICVFVCMVSKAMHLEVVENLSTEAFLAALQRFVSRRGVPEE FT VYSDNGTNFMGARSELHELYELFKSDLTNGKLSQFCQVKEIKWTTIPPNAP FT HFGGLWEAGVKSVKSVLKKVYRSASLTIMEFATLLCQIEAILNSRPLFAHS FT SDPNDPNVLTPGHFIINRPLTKPTIKVAAHSTSSTAFLETLVERVFGRASM FT SFKMDEEARQCSP" XX SQ Sequence 5528 BP; 1605 A; 1199 C; 1382 G; 1342 T; 0 other; tgttttggtc cgatcgaacc ggatagctcg gtttatgtgc atcgcttgct ggaacgtaat 60 gcggctggtt ggaccgtgga gtggctggct cggcgacaaa acacacacac gccgcgaagt 120 gaaaagtgga aaagcaaaac gtaaatcaaa aactgagtga acattggaca tttgaaaaaa 180 gtgaagtgat taactgctaa agaacattgg aataaaaagt gattagtgaa gaaacatgcc 240 gccgaagcgt acacctgtga agaatatgga aggtggtggt ggcacggccg ccgccggtgg 300 cgaagatgaa gtggaagaat tcgacgcgct ggtccacatg cgtggattgg caaaaggaaa 360 gctcacgcgt gtgaaaaaca tcatggagca aatcgtgcaa gaggaaatta cgctgccaca 420 aatcaaggtg caccagaaaa aaatcgaagc tgcctacaag gagttttcgg actaccatga 480 acgcatcatg gcggtctgcc caccaagtaa gcgaagggat caagactgta agtaccttga 540 atttgaaaca ttgtacgacg atatttccct cgctctcgaa acatggatcg agatgctgaa 600 tcaaacgcaa cgcaatcagc aaccgcttgt ggtgcaaccg tcgcttcctc gtgccattcc 660 gcatttcgat ggcaaatacg agcagtggga aaaattcaaa gttatgttta gggacgtggt 720 ggacagatct aatgagccac cccgaattaa gctctaccat ttggaaaagg cactcattgg 780 ggacgctgct ggcataattg atgcaaaaac catctccgat ggcaattatg accatgcatg 840 ggaaatttta gcagaaaggt ttgaagacca gagacgaatg gttgatcatc acatctctgg 900 attactaaac atgaagaaaa tagcgcaaga aagccactct gagttacgag agttagttga 960 ttcatgtgtt ggccatgtcg aaaatttgaa atatttgggt caagaattta ctggggtttc 1020 agatcaaatc gtcacacatc tcctcgcaaa tgcgcttgat gacgaaacga aaaagttatg 1080 ggaatcttct atcactcatg gggaattacc ggaatatgac gaaactgtca aatttctgaa 1140 aggtcgcatt tccgtactcg aacgttgtca aaagtccact aatgatagca aggtcaagca 1200 atccgcgcgt ggttcaagca aaggagctca agtgtccggt aagcagccat cgaacgtgaa 1260 agtgaatgtg gtgtcatcgt cgaacagttg tgagttttgt gcaggagatc atctggcgtt 1320 caagtgctct gtgttcaatg gattatcagt cgaagatcga ttgaaaatgg tgcgtgaaaa 1380 actggtgtgc ttcaactgtt tgcgacgtgg tcatcgtggt gcagaatgca agtcgaacaa 1440 gacgtgttca aagtgtaaga gaaagcacca tacgttactc catgcagaag gtggttcagt 1500 gcaaagttcc ggttcaagta atgccacgcc gacagagcag caacaagcac cgagacaagt 1560 gactcagcct gaaaacgtaa gtgtagctgg tagccgccct cctactaccg tagcttcgaa 1620 tgtggtcgac aaaccaatga cgcaggtgct gttgttgact gcaatggtta acattttgga 1680 taagcatgga aacccgcaag catgccgtgc tcttttggac agcggatcgc aggtcaactt 1740 tatttcggag gcgatgttgg agaaactgaa ggtggatcgg gaagaaatca atattcctat 1800 caccggagtc aacagcgtca gatcaagtgt tcgtcagcgg gcaaaggttg aagttctttc 1860 gaagaataat ggcttcaggc ttaacttgga ttgcttggtg tcgccgaagg taacgggaaa 1920 tttacccacc tttgaggttg atgtcgcttc atggaatatc ccaaccggta ttcagctagc 1980 agatccagtg ttctttcgcc cagggcaagt tgatttgctg attggaatgg aatggtatga 2040 cgacgttatc aagcctggtc gtctaaaact atctgaagac cttcccacac ttcaggattc 2100 gcagtttggc tggttggtcg gtggaaagtg tgctagtgcg tttagtggaa tcagtgtaac 2160 ttctcatgca gcaacaccgt caaatgattc gctgaatgaa ctgatgcaga agttttggga 2220 agttgaaagc gtggccagcg aacattgtga gaagtccgat gctgaaaagt gtgaactgca 2280 tttccagtct acgtttcgtc ggagtgagga tggtcgttac atcgtacagt ttccattaag 2340 ggaatctgtt gctcagcttg gagattcgaa accaatggcc cttagaagat tccatgctct 2400 ggaagcgaaa ctgcagaaga accccgacat caagaaacaa tatgatgagt tcatggacga 2460 atacgagcag cttggacatt gcaaagaggt tcaagaagcg gatgatcctc ctggtttgca 2520 gaaatggtat ttaccgcatc acgcggtgct caagccgtcg aattcgacta ccaaatgtcg 2580 tgtagtgttc gacgcgtcag cgaaggttgg cggaatggcc ttgaacgatg cgatgatgat 2640 tggccctaca attcaacctg atttgctgac gatcattttg aagtttcgca tgtttcgata 2700 tgtgctaagc gccgacattg ctaaaatgta tcggcaagtg ctgaaagaca aatgccatac 2760 tccgctacag cgcatattct acaggaaaca tcccaaagag ccgttacgag ttttggaact 2820 tcagacggta acatacggga cagcgtcagc cccgtttctg gctattcgcg cgctctacca 2880 actggccatt gacgaggaag atagatatcc acgagcagcg gaattggtga agaaaatgtt 2940 ctatgtggac aacgtgctat ttggtggtaa caacatggaa gaagtgagaa gcctccaacg 3000 agaattgatc gagcttctga gccgcggggg tttccaccta cacaaatggg cagcaaacaa 3060 cgagcaactc ttggactcta ttccattgga ggatcgagat aaattggtga agattgaaga 3120 ttctggtgca aacgaggtaa tcaagaccct tggattgatg tgggatccaa atggagatga 3180 ttttctgttc cgagcacaat atgactgcga caaccctgct ccaacgaagc gacaagttct 3240 gtctgtgatt gcaagactat ttgatccact aggcttgcta tcgcctgtca tcgtccttgc 3300 caaggtcttg atgcagaacc tatggaagca taagatggac tgggacgata aactggatgg 3360 agacctactc aaggattgga aatacttctt gaaagcactg ccacttgccg aagacttcag 3420 agttcctcga caagtaatat caagtgacgc tgcaagcatt gacatccacg gatttgctga 3480 tgcatcgaac acggcttacg gagcatgtgt ctacctacga tcggtgaatc acgatggcag 3540 tgcgagtttg aaactggtta cgagcaagtc aaaagtgtgt tccattacgc ccatgtctat 3600 tccgcggaag gagctgatgg cagccctact tctgcatcgc ttagtaaaga aggtgatcaa 3660 cgcaatggag ttagaacatg tcccagttac tctctggtct gacagccagg tggttattgc 3720 gtggttaaat aagccgttgg attctttgca aatattcgta agaaatcgag ttgccgaaat 3780 caaaggtgac aacgatcaac gtgtttggaa gtacgtccga tctgcaaata atcccgcaga 3840 cattgtgtct cgtggtatgc cagctacagc tctttcaaca aacaacctgt ggtggaacgg 3900 tcctactttt ctgtgcgatg cagcatatga agaagttgtt cctgaagcgt tggacgatga 3960 agcaatccct gaattgaagc cactgaaatc tgtcaacgta gcaactataa ttgaagagct 4020 tccgatcttg accaaatttg aatcatttag gaagacgcag aggatcattg cctacgtttt 4080 gaggttcatc aacaactgcc ggaagagtgg tgaacgaatc actgcaatga agccaatggt 4140 tcctgagtta agagaagcta caaaacgtat catccaagcc atccaaagaa ccgaacttca 4200 agatgaaatc gatcgccttg cagccggcga aacatgtaag cgaatcggca atctgaaccc 4260 cttcttggag gacggattac ttcgagttgg aggtagaatt cgtcacagca aaattccata 4320 cgaagcgaag caccaatgga tcatcccgaa caagcaccct gtaacaagaa gaataattgc 4380 tgctgtacac agggagaact tacatgcagg accgagtagc gtgatggcgg ccttacgaga 4440 acgctattgg attttgaacg ccaaatcaac gatacgaagt gtaaccagaa actgcatcac 4500 ctgctttaag tcgaacccga agcttgcaga acaattcatg ggagatttgc cctcgtatcg 4560 catcaccgct gcacctacgt tcttgagagt tggagtagat tttgctggac ccatctacat 4620 taagcaaacc gctaggaaag cagctccagt gaaaggctac atttgtgtat ttgtctgtat 4680 ggtatcgaag gcaatgcact tggaggtggt ggaaaaccta tccactgagg catttttggc 4740 tgcactacaa cgattcgttt cgagaagagg agttccagag gaggtttaca gcgacaatgg 4800 aacgaatttc atgggcgcga gatccgagct ccacgagtta tacgagctct tcaagtctga 4860 cctaaccaat ggaaagctat cacaattctg tcaagtcaag gagattaagt ggaccacgat 4920 tccccctaac gcacctcatt ttggaggatt atgggaggca ggtgttaaga gtgttaaatc 4980 tgtgctcaag aaagtctatc gttccgcctc tctgacaatt atggaatttg cgaccttact 5040 ttgccaaatt gaggccattc tcaattcgag accccttttt gcgcattcct ctgaccccaa 5100 tgacccaaat gttctgaccc caggccattt cattatcaat agacctttga ctaaaccgac 5160 tatcaaagtg gcagcacatt caacttcttc gacagcattt ttggaaacgc tggtcgaaag 5220 agtatttggt agagcttcaa tgtcgttcaa aatggacgaa gaagcacgtc aatgtagtcc 5280 ctaacaccgt cgtgctattg aaagaagaca acgtgccacc ccaacagtgg aaattgggta 5340 aaatcgtcaa cacttaccca ggtcccgatg atcttactcg agtcgttgat gtccgtgtcg 5400 gtagcagtat cttcaagcgg ccaatccaca aattggcgcc cctacctaca ctagagggag 5460 atccagcaca ttcatcagaa gcgaatccca aggaatcaac agtgtcactc ctttcccggg 5520 tggcagca 5528 // ID Chapaev-2_HM repbase; DNA; INV; 3501 BP. XX AC . XX DT 26-FEB-2008 (Rel. 13.02, Created) DT 26-FEB-2008 (Rel. 13.02, Last updated, Version 1) XX DE Autonomous Chapaev DNA transposon -a consensus sequence. XX KW Chapaev; DNA transposon; Transposable Element; Chapaev-2_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3501 RA Kapitonov V.V. and Jurka J.; RT "Autonomous Chapaev transposons from the hydra genome."; RL Repbase Reports 8(2), 28-28 (2008). XX DR [1] (Consensus) XX CC Chapaev-2_HM is a young family of autonomous Chapaev DNA CC transposons that were active in the hydra genome less than a few CC million years ago (they are ~1% divergent from their consensus CC sequence). The consensus sequence was obtained based on a CC multiple alignment of 17 copies; it codes for a 679-aa Chapaev CC transposase (two exons). Chapaev-2_HM is characterized by 4-bp CC target site duplications and 216-bp terminal inverted repeats. CC Based on the TPase identities, Chapaev-2_HM forms a distinctive CC group together with Chapaev-1_HM and Chapaev-5_HM. The N-terminal CC portion of this group TPase contains a Chapa-like zinc finger CC (H-X7-C-X2-C-X35-C-X2-C-x36/38-C-X2-C) but is free of the RING CC finger motif. CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(773..1415,1523..2916) FT /product="Chapaev-2_HMp" FT /note="Transposase." FT /translation="MPNKAKNHEDCRKSVCFLCMRKGDRELTDFIIGRIHK FT LLKTDIDFEDERVPQAICNTCRTLLQKRDAGDMSVSLPAIYNFSAVVVKPL FT TRTSGPCDCLICQIAKLKLNQKHPLAPKEQTRNLDQGESSKPGPSPNSRLK FT SSEKRCTQCLSILARGFRHSCTIGTRHQNLQALVQSDPVGAEQIASAIISS FT KDASPGGTVRLSQARGGQLFPLTPGPASAHKSTSSTLTMQSLLNVQLNTGL FT SNTGMRQLASTLNKSTETRLVEPFFQQKFAAVGKSLNNFFQLSVVDISAEK FT GKTEEKRAIVHCKNLEDLTNKVLLSREYGYNHFVKLGIDGGGSFLKMSLSV FT IKVTDEDDETRSPQPKSLRLLTSSTARDTGVKRQLIVAIAENVSETYENVK FT VMMNIIQLNDVSFCISCDLKMANIICGIQSHSSKHPCCWCNIDSDNLSECG FT TSRSFGSMKEKFEAFVAAGSDTKSAKDFDNVIHPPLISMDDSTLILDVIPP FT MELHLLLGIVNHLYKNLSDLWPGAKEWPTLLHIQLQPYHGGHFAGNECHKL FT LQNLDILQMLAEKACAYQAFGFIETLRHFKEVVTSCFGTTLQEDYKEKIQK FT FKTSFLFLPISVTPKVHAVFYHVSEFVEKHQTSLGIFSEQATEAMHSRFKF FT HWERYKRIPTHPDYGKQLFNCVVDYNSKHL" XX SQ Sequence 3501 BP; 1241 A; 615 C; 612 G; 1033 T; 0 other; cacaccctgt aaatacccat gataacatca cccgaaagga ccgaacaaat ccttttaggt 60 gggcattata tagaccttgt gtgaacattt agagccattt ttgaaatctt ttaactacgt 120 ccgagccatt tttttcattt taaagttata tagtgaaaat aaccaaaaat ccaatatttt 180 ccaaaaaaat aaacagctta aactcataat ttacagtgac tttttaagct gttggtatta 240 ggagatcatt aaacaataac tttaaggcat ttcaaacatt caaaatgtca aataattatt 300 ggaaatttaa aaaaaaatga atttcaatag aattatatta ctttgtaatt aaaaagtaaa 360 acagagtaag tgtaaaaagt aaaacagagt aagtgtaaaa agtaaaacag agtaagtgta 420 aaaagtaaaa cagagtaagt gtaaaaagta aaacagagta agtgtaaaaa gtaaaacaga 480 gtaagtgtaa aaagtaaaac agagtaagtg taaaaagtaa aacagagtaa gtgtaaaaag 540 taaaacagag taagtgtaaa aagtaaaaca gagtaagtgt aaaaagtaaa acagagtaag 600 tgtaaaaagc acaataagat taatcttatt gtgtacttta ctcttactct gtttaatcga 660 ttgtgtacac aaaaagatta tcatgaaatt tatgaaataa aaaaacattt ataactattt 720 ttccttgctg catgcagaac atttttttaa atggagacat aaagcttcta aaatgccaaa 780 taaagcaaaa aaccatgaag actgcaggaa gtctgtttgt tttctttgca tgaggaaagg 840 agaccgagag ttaactgact ttatcattgg aagaattcac aaattgttga aaacggacat 900 tgactttgaa gatgaaaggg tcccacaggc aatttgcaac acttgccgaa ctctcttgca 960 aaaaagagat gctggggaca tgagtgtttc acttccggca atctacaact tctctgcagt 1020 tgttgtcaag cctttgacac gaacatcagg tccatgtgac tgcctcattt gccagatagc 1080 aaagctcaag ctcaaccaga aacatccctt ggcaccaaaa gaacagacca gaaaccttga 1140 ccaaggtgaa tcttccaagc ctggcccttc accaaattct cgcttaaagt catcagaaaa 1200 aagatgcaca cagtgtttgt ccattcttgc aagaggtttc agacactcat gcactatcgg 1260 aacaagacac caaaacttgc aagcattggt ccagtcagat cctgttggtg ctgagcaaat 1320 tgcatcagcc atcatcagtt caaaagatgc atctccggga ggaactgttc gccttagtca 1380 agcaagagga gggcagttat ttccattaac accaggtaat aatgaacttc agtgtgaatt 1440 actttttttc ataactgttg ttgttgtaat catttacaag taatcaaaat aagttgttaa 1500 atttatctca tttgcttttt aggcccagca tctgcacaca aatcaaccag ttcaacactt 1560 acaatgcaaa gcttgttaaa tgttcagttg aacacaggac tatccaacac aggaatgcga 1620 caacttgcct caacacttaa caaatcaaca gaaactcgac ttgttgagcc atttttccag 1680 caaaagtttg ctgctgtagg aaagtcgttg aacaattttt ttcaactctc agttgttgac 1740 atttcagcag aaaaaggaaa aactgaagag aagagagcaa tcgttcactg caaaaatctt 1800 gaggacttaa ccaacaaggt gctgctgtca agggaatacg gctacaatca ttttgtcaag 1860 cttggcattg atggaggagg atcattcttg aaaatgagct tatcagttat caaggtaact 1920 gatgaagatg acgaaacaag aagtcctcaa ccaaaaagtc ttcgcttgct gacaagctca 1980 actgccagag acacaggtgt caagcgtcaa ctcatagttg ccatcgctga aaatgtttct 2040 gaaacatacg agaacgttaa agtgatgatg aacatcattc aattaaatga tgtttcattc 2100 tgcatttcat gcgatctcaa aatggccaac atcatttgtg gaattcaatc ccattcaagc 2160 aaacaccctt gttgctggtg caacattgat tctgacaatc tgtcagaatg tggaacttcc 2220 agaagttttg gatcaatgaa ggaaaaattt gaagcctttg ttgcagctgg atcagatacg 2280 aaatcagcaa aagattttga caatgttatc catccaccac tgattagcat ggatgactca 2340 acactcattt tggatgtgat tccaccaatg gaactccatt tacttcttgg aattgtcaat 2400 catctttaca agaatctttc tgatttatgg cctggtgcca aagaatggcc aactttgttg 2460 cacattcagc tccagccata ccatggcgga cactttgctg gcaatgaatg tcataagctt 2520 ttgcaaaacc ttgacattct ccaaatgctg gcagaaaaag cttgtgccta tcaagcattt 2580 ggtttcattg aaactctcag acatttcaaa gaagttgtta cttcatgctt tggaacaact 2640 ctgcaggaag actataaaga aaagattcag aagtttaaaa cttctttcct attcttgcca 2700 atctccgtga ctcctaaggt acatgctgtt ttctaccatg tttcagaatt tgtagaaaaa 2760 catcaaacct ccctcggaat cttcagcgag caagcaaccg aagcaatgca ttcaaggttc 2820 aagtttcatt gggagcggta caagcgcatc ccaactcatc cagattatgg gaaacaactc 2880 ttcaattgtg ttgttgacta taacagcaag catctgtaac aaacaacacc tccaaactca 2940 acttttttca gaaatatatg tcttggaata aaaaatcaga aattgcacat tttagaattt 3000 tttttaagaa attaaaaaac tgtttttgaa cttttgtatt ttataaagaa agcttttagt 3060 tctagtctct aacttgcaat atgtaagaat ctgtttattt tttatcataa atattgttca 3120 gatgaagatg attgtgaaaa tatactatat tgaattttac actttgtgtt tattttacaa 3180 tagagcttta aaataagaaa aatggctctg acaaaatttt gaatgtttga aatgatttaa 3240 agttattgta tagtgatctc taaataccaa cagcttaaaa aatcactgta aattatgagt 3300 ttaagcagtt tatttttttg gaaaatattg gatttttggt tattttcact atataacttt 3360 aaaatgaaaa aaatggctcg gacgtagtta aaagatttca aaaatggctc taaatgttca 3420 cacaaggtct atataatgcc cacctaaaag gatttgttcg gtcctttcgg gtgatgttat 3480 catgggtatt tacagggtgt g 3501 // ID I-4_AC repbase; DNA; INV; 6189 BP. XX AC . XX DT 27-JUL-2009 (Rel. 14.07, Created) DT 27-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE A family of Nimb non-LTR retrotransposons from a sea slug - DE consensus sequence. XX KW Nimb; Non-LTR Retrotransposon; Transposable Element; I group; KW I-4_AC. XX OS Aplysia californica OC Eukaryota; Metazoa; Mollusca; Gastropoda; Heterobranchia; OC Euthyneura; Euopisthobranchia; Aplysiomorpha; Aplysioidea; OC Aplysiidae; Aplysia. XX RN [1] RP 1-6189 RA Kapitonov V.V. and Jurka J.; RT "Nimb - a novel clade of animal non-LTR retrotransposons."; RL Repbase Reports 9(7), 1539-1539 (2009). XX DR [1] (Consensus) XX CC Nimb is novel clade of I-like non-LTR retrotransposons. It CC includes families of retrotransposons present in fish, molluscs, CC sea squirts, sea urchins and insects: I-1_DR, I-3_DR, I-5_DR, CC nimbus, I-3_AC, I-4_AC, I-1_CI, I-1_SP, I-1_AA, I-1_BM. I-1_CI is CC a family of tunicate Nimb non-LTR retrotransposon. The consensus CC sequence was derived from multiple alignment of 14 copies ~98% CC identical to each other. The 3' terminus is composed of the CC (AATC)n microsatellite. XX FH Key Location/Qualifiers FT CDS 652..2208 FT /product="I-4_AC_1p" FT /note="ORF1." FT /translation="MDFRAVSPCWTRWRVGSPGDEIKDKIHMDSTKNISPT FT PIPNKSQTGEAGATPGLGKRKKGLSGEDADNVSIWDIPKRTSLGWSPFLVI FT SPKDEKQDTNLTTLSVFKIHKSLKHFGIYKPKYIKNLGSTDLLIQVSSSIE FT SGKLLACTSFGGIPVCVQPHRSLNTSKGVIKSRELYGCTEEEMVQEVDGIV FT HARRIKIRRDGKEICTNTWILTFNTPTPPTKLDIAYLELEVRPYIPKPMRC FT FNCQRFGHTQLRCRRGAACPRCGKEGHSGETCSADPWCPNCRQGGHSASST FT ECPRWCQEKAILTHRAQYGGTFAQARATLYPKGSTPKNTTKTYSEVLKTVP FT GSVNPTRKVLQTNRKQNTCKNQHSSLETEEDMESTSSTPISPSLHTPSPHT FT PSSSPPSQPSSSSSSPSSSSPSSFSSSPPSSSPTTSSPSSSSHKSPHPTPS FT LLQPPSHHPSSNKAPPPTPRTRSPSSTRIQDSGPPPPPHSPSKKAAVHRDR FT SKTKQRQRSPTTKKKCIEPVPSK" FT CDS 2218..5889 FT /product="I-4_AC_2p" FT /note="ORF2: AP endonuclease, RT, RNase H." FT /translation="MAVLQWNIRGLRANQAELSYLLTSHTPNVICLQETKL FT PTPTFTLKNYQTYNQIYTDNQIASGGNSILINNQTLHRQIQLNTTLQAVAV FT RVTLHRPITICSIYIPPNYPLSLRELINLRDQLPQPFIMLGDFNAHSPLWG FT NEPTDTKGKIIEDFLIKTSTSIFNDNSPTFQNSNNLNVSSIDLTLCDPDLI FT PDFQWSVLDDPHGSDHFPITLIPDIPHRTPIPDRFNLKRADWDGFHVECLE FT KLNPSQTQTQTFERFHETLLEIVQKHIPKLSTKPRRNKCWFTEDCQKAVAA FT KKSALRKFVNSKTNEDLINFKIARAQARRKIRTAKKTSFRNYVSKINNNTP FT MHKVWKMIKKLKGTKKDTIKHILKPNNSFVETEKTVADCIASTLSGNSSTH FT NYNPEFKKVKMSSEKTTVDFTSSNEEHYNCSFSLNELKCNIAELSNTAPGP FT DQIHNEIIKRLPDETLLLLLDIYNELWHSQQFPNSWRLATVIPIPKPGKDH FT SNPSNYRPIALTNCFCKLMEKLINKRLMWFLETSNSLSNFQCGFRKNRSTI FT DHLVRLETFIREAFIRKEHLVAIFFDLEKAFDTTWKHGILKDLHDLGLRGN FT LPEFIRNFLKDRSFQVKVGSTLSNLHTQEEGVPQGSILSPILFEIKINSIT FT KTLQNNIDCSLYVDDFLICYKSKSKMDHIERQLQLQLHKLETWANGHGFKF FT STVKTNCVHFCLSHKCVRQPDLYLFGEKIPVKKQARFLGVIFDNKLSFLPH FT LKDLKTRCQSALNALKVLSNPEWGGDTEILLQLYRSLVRSRLDYASPVYGS FT ARKSYLKILDPVQNQGLRLALGAFRTSPVESLQAEAHEPSLELRRKKLSLQ FT YALKLSSVPQNPAHKCVFGLPEDLKTLSSLGNKVSPFGIRIQNDLKDIDFS FT EDIIEPFAMSSIPPWHLTKPNIDTSLEKHNKKSTDKTTYLNSYKQLINRYP FT EAEKLFTDGSKNDSSVGAAALASPHSLSLRKLHPEASVYTAEATALEMALN FT IIDQSTKSTFLILSDSLSCLKSLERSDTSDNRILKLIIKLHFLHQSGKTIT FT IAWLPSHVGIDGNEAVDQFAKESLKLPENKNSKIPFSDMKPKVKEHITSSW FT SQSWNQQIHNKLHYIQPKLTPRKPFSLPRREKVIFTRLKIGHTHVTHKHLL FT EGEEAPFCVACNTNFTVRHILMECVDFNYVRDKYFKCKDIKTLFNIIEPRR FT ILSFIKEIGLFNKM" XX SQ Sequence 6189 BP; 1956 A; 1637 C; 1065 G; 1531 T; 0 other; acccccagtg ggtcgggggg atataaatag acccactgca ccgccagcag gaggtgcaac 60 cccttcactg gcgatttgtc gcccatggac atctacgctc aggtgcacat gggcgacact 120 tgctttggtg tatcgtctgt cttctatcgt ctttcttcct tcttctttcg tccttccccg 180 agctgaccgg ggaataaaat aaaaatagtt ggctgtccca tcaacagggg gtgtaacccc 240 cccgctggtg ggatgccact cgtacgtcac tagctctctg acgtaggggt ggcatctttt 300 gtccgctctt gctatgtctt ctatccatct atctttcttt ccgttctgtc gaccgaaggt 360 ccggtccagc aaagaataga caccaaaccc ccgcaacgcg aggcaccccg ttgaccgtgg 420 ggtgcgaccc gtacgaccgg agaaggggac cctggtggct gaggtgtggc tttcctcacc 480 aacacgccac tttggccctg gattcggtct agacgggcgg gtgcagtggc ctactgcact 540 caatcggttg cgaccggttc atagaaccgg ttttggccaa tcttggttag aaggcaagat 600 ttcttgccgt ctgattggga gtatacctcc agggtagccc tgacaccagc tatggacttc 660 cgagcggtgt ccccttgttg gactcggtgg cgggtgggga gccctggaga tgaaataaaa 720 gacaaaatac acatggattc aaccaaaaac atttctccca ccccaattcc caacaagtcc 780 cagacaggag aagcaggtgc aactcctggc ctgggcaaaa gaaagaaggg cctttcgggc 840 gaagatgctg acaatgtcag catctgggac atacccaaga gaacatcgtt ggggtggtca 900 ccatttttgg tgatctcccc aaaagatgaa aaacaagaca caaatctgac tacattaagc 960 gtcttcaaaa ttcacaaatc actcaaacat ttcggtattt acaagccaaa atatataaaa 1020 aatctaggat ccaccgacct cctgatacaa gtcagttcca gcattgaatc aggcaaactg 1080 ttggcctgca cctcgtttgg tggcatccct gtatgtgttc agcctcacag gagtctgaac 1140 acatccaagg gtgtcatcaa aagccgagag ctctacggct gtaccgaaga agaaatggtt 1200 caggaggtcg atggaattgt ccatgctcgt cgtatcaaaa tacggcggga tggcaaggaa 1260 atctgtacca acacgtggat ccttaccttc aacacaccca caccaccaac aaaattagac 1320 atcgcatact tagaactcga agttagacca tatataccga aaccgatgcg ctgtttcaac 1380 tgtcagcgct tcggacacac acagctacgc tgtcgtcgtg gcgcagcgtg tcctcgctgc 1440 ggcaaggagg gacactctgg ggagacctgc tctgcagatc cctggtgccc aaattgtcga 1500 caggggggtc actctgcatc ttcgacggag tgtccgaggt ggtgtcagga gaaagccatc 1560 ctcacacata gggcccagta cggcggaacg ttcgctcaag caagagcaac gctataccca 1620 aaaggatcga cgccaaagaa tacaacaaaa acatattcag aagtcctgaa gactgtccca 1680 gggtcagtaa acccaacccg aaaggtcctt cagacaaata gaaagcagaa cacctgcaaa 1740 aatcaacact ccagtctgga gacggaggag gacatggagt ccacttcctc cactcctatc 1800 tctccctctc tccatacgcc atctccccac accccctctt catcaccacc ctctcagcca 1860 tcttcttcat cctcatcccc ttcatcatcc tcaccctcat cattctcatc atcaccccca 1920 tcatcttcac ccacaacatc ctccccatca tcatcctcac acaaatcacc acatcctaca 1980 ccctcattgc ttcaacctcc atcccatcat ccttcatcca ataaagcccc tccacccact 2040 ccccgcaccc gctccccctc ctctacccgc atccaggact ccggtcctcc tccccctcca 2100 cactctccct caaaaaaggc tgcggtccac cgtgaccgca gcaaaacaaa acaaagacaa 2160 aggtcaccaa cgaccaaaaa aaaatgtatc gaacccgtgc cttcaaaata aattaaaatg 2220 gcagttcttc agtggaacat tcgcggtttg cgtgcaaacc aggcagaact ctcttatctt 2280 ttgacctcac acaccccaaa tgtaatctgt ctacaagaaa ctaaattgcc aacacccaca 2340 ttcacattaa aaaactatca aacttacaac caaatataca ccgacaacca aatcgcctct 2400 gggggcaact ccattttaat aaataaccaa acactacaca gacaaataca actaaacaca 2460 acacttcaag cagttgcggt cagggtcacc ctacaccgtc ctataactat ctgttcaata 2520 tacatccccc caaattatcc tctatcttta agagaactaa taaacctcag agaccagctc 2580 ccacaaccct tcatcatgtt gggagatttt aatgcccata gccctctttg gggaaatgaa 2640 ccaactgata ctaaaggtaa aatcatagaa gatttcttaa taaaaaccag tactagtatt 2700 tttaacgaca attccccaac atttcaaaat tcaaataacc taaacgtttc atccatagat 2760 ttaacccttt gtgatcctga cctaattcca gactttcagt ggtcggtcct ggacgaccca 2820 cacggatcgg accacttccc catcacccta attccagaca ttccacaccg aactcctatc 2880 ccagaccgtt tcaatcttaa aagagcggac tgggatgggt ttcacgtgga atgtctggag 2940 aaactcaatc catctcaaac acaaacacaa acttttgaac gatttcatga aacccttctc 3000 gagatcgttc aaaaacacat accaaaatta tccaccaaac cccgaagaaa taaatgttgg 3060 ttcactgaag actgccaaaa agctgttgcg gccaaaaaat cagctctcag gaagttcgtc 3120 aactctaaaa ctaacgaaga cctcattaat ttcaaaattg cgcgagccca agcacgcaga 3180 aaaatcagaa cagccaagaa aacatctttt cgaaattatg tctcaaaaat caacaacaac 3240 acaccaatgc acaaagtctg gaagatgata aagaaattaa aaggtactaa aaaagacaca 3300 atcaaacaca tccttaaacc gaataactcc ttcgtagaaa ccgaaaaaac agttgctgac 3360 tgtatagcat caactctctc cggaaattcc tctacccaca attacaaccc tgaattcaaa 3420 aaagtaaaaa tgtcttctga aaagacaact gttgatttta cctcttcaaa tgaagaacac 3480 tataattgca gcttttctct taacgaatta aaatgtaata ttgcagaact atcaaacaca 3540 gctccaggcc ctgatcaaat acacaacgaa atcatcaaac gtcttccaga cgaaacactc 3600 ctattactcc ttgatattta caatgaactc tggcattctc aacaatttcc gaacagttgg 3660 cgtcttgcca ctgttatacc aatcccaaaa ccaggaaaag atcactctaa cccttcaaac 3720 taccgtccca tagcccttac aaactgtttc tgtaaactta tggaaaaact tattaacaaa 3780 cgtctaatgt ggtttctcga aacctcaaac tccttatcca attttcaatg cggttttcgc 3840 aaaaatcgct ctactattga ccatctcgtc cgacttgaaa ctttcattcg ggaagctttc 3900 attcgaaaag agcaccttgt ggctattttt ttcgacttag aaaaagcatt cgacaccaca 3960 tggaaacatg gtattctgaa agacctgcat gacttgggct tacgcggaaa tcttccagaa 4020 tttatacgaa actttttgaa agatcgatca tttcaagtaa aagtaggttc aacattatca 4080 aatttacaca cccaagaaga aggcgtcccc cagggaagca tcttatcgcc cattcttttc 4140 gaaatcaaaa taaattccat cacaaaaacc cttcaaaaca atattgactg ctcactttat 4200 gttgacgact tcctgatttg ctataaatcc aaatccaaaa tggatcacat tgaacgtcag 4260 cttcagcttc aactccataa actggaaact tgggcaaacg ggcacgggtt caaattttcc 4320 acagtaaaaa caaactgtgt acatttttgt ctgtcgcaca agtgtgtgcg acaaccagat 4380 ttatatttgt ttggagagaa aattcctgtt aaaaaacaag ctcgtttcct gggcgtaatc 4440 ttcgacaata aactttcttt cctaccacac ttgaaagacc taaagacgag gtgccagagt 4500 gctctcaacg ccttaaaagt attatccaat cctgaatggg gaggagatac tgaaatactc 4560 cttcaacttt accgttctct tgtccgttcc aggctggact atgccagtcc tgtttacgga 4620 tcagccagaa agtcttacct taaaatttta gaccctgttc agaaccaagg ccttcgcctt 4680 gccctaggcg ctttccgtac ttcacctgta gaaagtttac aggcagaagc gcatgaacca 4740 tctttggaac ttagaaggaa aaaactctct ttgcagtacg ctttgaaact tagctcagta 4800 ccccaaaacc cagcccacaa atgtgtcttt ggccttcccg aagacctaaa aactctatcc 4860 tccttaggaa ataaagtaag ccctttcgga ataagaatcc agaacgatct caaagacatt 4920 gatttttctg aagacataat cgagcccttt gccatgtcca gcatacctcc atggcattta 4980 accaaaccaa acatagatac tagtcttgaa aaacacaaca aaaagtccac agacaaaact 5040 acatacttaa actcatacaa acagctgata aatagatacc ctgaagcaga gaaactcttt 5100 acagacggat ccaaaaatga ctcctcagta ggagctgctg cccttgcatc gccacacagc 5160 ttaagcttaa ggaaactcca tccagaagct tcagtctaca ctgctgaggc tacagctctt 5220 gaaatggccc taaatattat agatcaatca accaagtcta ccttcctaat cctttcagac 5280 tctctctctt gtctgaaatc tttagagcgc tctgacacct ccgacaacag aattttgaaa 5340 cttatcatta aactgcattt tctacatcag tctggaaaaa ctattacaat agcttggctt 5400 ccaagccatg taggcattga tggcaacgaa gccgttgacc aatttgccaa agaatctcta 5460 aaactacctg aaaacaaaaa tagtaaaatc cctttctctg acatgaaacc gaaagtaaaa 5520 gaacacatca catcctcctg gtcccaaagt tggaatcaac aaatacacaa caaactccat 5580 tacattcaac cgaaacttac gcccaggaaa ccattttctc taccacgaag agaaaaagta 5640 atttttaccc gactaaaaat aggacacacg cacgtcacgc acaaacattt gctcgaagga 5700 gaggaagctc ctttttgcgt ggcgtgcaac acaaatttca cagttcggca catcctaatg 5760 gaatgtgtcg actttaatta tgtccgtgat aaatatttta aatgtaaaga tattaagacc 5820 ctttttaaca tcatagaacc tagaagaatt ttaagtttta taaaagaaat tggactgttt 5880 aataaaatgt gaaaatatat agttgtcgtt gtcggtgtat atatacatac ggtgtatata 5940 tacatacact gagatttgta ttgtatatac aaatgtaaat aagccttttc catatttaca 6000 aaaaaatttt ttataaatgt ttgttcgtct ttttagcata atagaaatag tgtgatcgtt 6060 tgcggttgcg agctccgaaa gactctttat taaaaacacg tctgattttg tagcggtgaa 6120 acctgatggc gcttaaatga ccaaaattgt gtcgatagtg ccgtaaaaca tctacccaat 6180 caatcaatc 6189 // ID CR1-65_AAe repbase; DNA; INV; 3202 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-65_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-3202 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1153-1153 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 14 sequences with >95% CC identity. This consensus is likely 5'-truncated. XX FH Key Location/Qualifiers FT CDS 2..3109 FT /product="CR1-65_AAe_1p" FT /note="endonuclease and reverse transcriptase." FT /translation="NDHLSANMHRTLSHSGRTTSSTMEAPTPPSTVESLQP FT ALISRPDPVVGRGGGVFQPVLSGKYNKITDSLLPDISPPSSLRIYYQNVRG FT LRSKIDSFFLAISELDYDVIVLSETWLDDCILSSQLFGSNYSVFRTDRSAL FT NSDKGRGGGVLIAVSSRISCSVDSAPVSNQLEQLWVKIVLPRNNVSIGVIY FT LPPDKKNDIDTIQRHIESIGAVHSHLHENDMAMLFGDYNLSDVHWTSDAND FT NVFVDLTRTRLNPASSTLLDGFCFHGLSQVNTIVNSGGRTLDLVLLNDVVL FT PNHSIHEAVEALTTVDVHHPPIESVVSCPLPVVYETVSDNTGFDFKKANFE FT SLNVALRSMDWDRLDSIEDVNEAVAFFTNSVMQAVAEHVPPRRPPSKPPWS FT NSRLRLLKRRRSAALRFYRSHRSQQSKLHFNRSSSEYRSYNAYLFAQYKSR FT TEQNLRSNPKQCWSFVNTKRNEDGLPTSMYLDELSADSASDKCELFAMQFK FT RAFNDYASPASQIDRALQDTPQDVFSYDMFHVSEEEVIGAIGKLKFSYNPG FT PDGIPPALLKRCSSTLLAPLTKLLNHSLRQQVFPACWKKSFLFPIHKKGDK FT RCVSNYRGITSLCACSKVFEIIVNDSLFESCKRYISTDQHGFFPKRSVSSN FT LVEFSTLCIRAIDAGKQVDAVYLDLKAAFDRVDHRILLQKLRKCGVAINFV FT DWFRSYLTERSLCVRIGESESASFTNISGVPQGSNLGPLLFSLFINDVSLI FT LPTGNRIFYADDAKIYMIVESLDDCYRLQHLLNLFEAWCARNCLTLSIEKC FT QVISYGRKRSPIRFPYELSGTTLERVKCVRDLGVTLDEKLTFNCHQNDVIS FT RANKQLGFVLKVSSGFKDPLTLKALYCALVRPILEFAAIVWCPFQNYWISR FT IESVQRKFVRHALKNLPWRDPENLPPYHERCRLLGIDTLEDRRHVAQTMFV FT AKVLRNEIDSPSLLADVNIYAPERTLRRRQFISLGSRNTRYGQHDPVRFMA FT LKFNEAYHLFDFNVTITTLRNHFIQYFRRN" XX SQ Sequence 3202 BP; 868 A; 740 C; 678 G; 916 T; 0 other; caatgatcat ctttcggcca acatgcatcg cactctttct cattcaggac gcacaacatc 60 tagcaccatg gaagccccaa cgcccccctc aacagtcgag tccctccagc cagcgctcat 120 cagtcgtcct gatcctgttg tagggagagg tggaggggtc ttccagcctg tgctgtcagg 180 caagtacaac aaaatcactg atagtttgct tcctgacatt tctccgcctt ccagtctacg 240 catctattat cagaacgttc gtggcttacg ttcaaaaatc gactcgttct tccttgccat 300 cagtgaattg gattatgacg tcatagtact ttcggagacc tggttagatg attgtatcct 360 ctcatcgcag ctgtttggaa gcaactattc agtattccgg acggaccgta gtgctttgaa 420 tagtgataaa ggaaggggag gtggtgtttt aatagccgtg tcatcgcgca taagctgctc 480 tgtagattct gcgcctgtta gtaaccagct tgaacagctc tgggttaaaa tcgtcttacc 540 ccgcaacaat gttagtattg gagtgatata cttgccaccc gacaagaaga acgacattga 600 taccatccag cgccacattg agtcaattgg agctgtccac agtcacctcc atgaaaatga 660 tatggcaatg ctgttcgggg actacaatct ctccgatgta cattggactt ccgatgcgaa 720 cgataatgtt ttcgttgatc tgacacgtac acgcctgaat ccggcaagtt ccactctatt 780 agatggtttc tgcttccatg gtctctcgca agttaataca atagttaact caggtgggcg 840 tactctcgat ctcgtccttc tcaatgatgt cgttctgcca aatcactcaa tacacgaagc 900 tgtcgaagca cttaccacgg ttgatgttca tcacccgcca attgaatcgg ttgtatcttg 960 tccgttacct gttgtctacg aaacggtttc tgacaacaca ggatttgact ttaaaaaagc 1020 aaacttcgag tcactgaatg ttgctttacg ctcaatggat tgggatcgtc ttgactctat 1080 tgaagatgta aacgaagctg ttgcattttt cactaattcg gtgatgcaag ctgttgcgga 1140 acacgttccg ccacgccgac cacctagtaa accgccttgg tcgaatagcc gactgcgtct 1200 tctaaagcgc cgtcgatctg ctgctctccg tttttaccgg agtcatcggt cacaacagtc 1260 aaagctgcac ttcaaccgat caagcagtga gtacaggagc tacaacgctt acttgtttgc 1320 tcaatataaa tctcgtactg aacagaacct ccgatcaaac ccaaaacagt gttggtcatt 1380 cgtaaacact aaacgaaatg aagatgggct tccaacatca atgtatcttg atgaactatc 1440 ggctgattcc gcaagtgata aatgtgagct gttcgctatg caattcaaac gtgccttcaa 1500 tgactatgct tctccagcaa gtcaaattga tcgtgcttta caagatactc cacaagacgt 1560 gtttagttac gatatgttcc acgtttcgga agaagaagtg attggagcga tcggaaaact 1620 gaagttttca tacaatccag gtccggatgg gataccacca gcgttactga agagatgcag 1680 ctccacactg ctggctcctt tgactaagct tctaaaccat tcactccgac aacaagtgtt 1740 ccctgcgtgc tggaaaaaat cgttcctatt cccaattcac aagaagggtg acaaacgatg 1800 tgtaagcaac taccgtggca taacttcact atgtgcttgt tctaaggtat tcgaaatcat 1860 agtcaacgat tccttatttg aaagttgcaa acgatacatc tccaccgatc aacatgggtt 1920 ttttccaaaa agatccgtct cgtccaatct tgttgaattt tcaacgcttt gtattagagc 1980 tattgatgcc ggcaaacaag tagacgctgt atatctggac ttgaaggcgg cattcgaccg 2040 tgtcgatcat cgtatccttc ttcagaaact cagaaaatgt ggtgttgcca taaatttcgt 2100 tgattggttc cgttcttatc tgactgaacg atcgttgtgt gtgagaatcg gggaaagtga 2160 atcagcatcg ttcacgaaca tatccggagt accgcagggt agcaaccttg gacctttgct 2220 gttctcacta tttatcaacg atgtatcact gattttgccg actggaaaca ggatcttcta 2280 tgctgacgac gcaaaaattt atatgattgt agaaagcttg gacgactgtt accgtttgca 2340 acatttactc aacttattcg aagcatggtg tgcacggaac tgtttaacat tgagcatcga 2400 gaaatgtcaa gtcatctcgt acggaagaaa gcgcagccca attcgcttcc catatgaact 2460 atcagggacg acgcttgaac gtgttaaatg tgttcgtgat ttaggtgtta cattggacga 2520 gaagctcact ttcaactgtc accagaacga tgtcatctcc agagcaaata aacaacttgg 2580 tttcgttttg aaagtttcaa gtggattcaa ggatccgcta accctgaaag ctttgtactg 2640 cgccttagtg cgtccaatat tggagtttgc tgctatcgtg tggtgtcctt ttcaaaatta 2700 ctggatctca cgaattgaat ccgtgcaaag gaaatttgta cgccacgctt taaaaaatct 2760 tccttggcgg gacccagaaa acctgccacc gtaccatgag cgttgccgtt tactgggaat 2820 cgataccttg gaggatagac gtcatgtggc ccaaacaatg tttgttgcta aggttttgag 2880 gaatgaaatt gattcaccgt ccctgttagc cgacgtaaat atttatgcac ctgaacggac 2940 actgcgtaga cgtcaattca tcagtcttgg tagtcgcaat acacgctatg gacagcatga 3000 tcctgtgagg ttcatggctt tgaagttcaa cgaagcctac cacttgtttg acttcaacgt 3060 cactataact acactgcgga accattttat tcagtatttt agaagaaatt gactgtatgt 3120 ttagcttaag ttatttattt atacattcat taagacaact tcgtgtcaga tggattaaat 3180 cagaaataca aatacaaata ca 3202 // ID L1-38_AAe repbase; DNA; INV; 5187 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE An L1 non-LTR retrotransposon from Aedes aegypti. XX KW L1; Non-LTR Retrotransposon; Transposable Element; L1-38_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5187 RA Kojima K.K. and Jurka J.; RT "L1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1391-1391 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 20 sequences with >97% CC identity. XX FH Key Location/Qualifiers FT CDS 1952..5119 FT /product="L1-38_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MDSLSFNLATINIANIYSPTKIDALNSFLRLNEIDIV FT FLQEVENTSIQLTGYSIVFNIDNRNRGVAIAHKPSIEYTAVQNSLDGRLIT FT ITLKNGVTLCNIYAPTGAQNRASREDFFNFTCANYLRNCTGPIIFGGDFNS FT VVNPRDATGATSRSNMTSRLMSSLDLIDVWRVLHRNETDYTFVRAGSSSRL FT DRFLVSRSSTSWLRTINHSVTCFSDHKAVLMRMVLPITGPIIGRGLWRLNP FT HVLEDEDTQNRFQARWNYLVRQKQNYRSWSSWWLDYAKPKVASFFRWRTSL FT QIQDFRNTMNLLHGELTRAYQLYANDATQLTRINRIKSIMLLKQRESSSNW FT RQLKETYLKGEPTSLFHLSEKNYKRSKTLITELDVGGDMINDSGQIEQCVV FT TYFEDLFQENRQQPTGSTFLPTLRIPDQNVANEILMDPIEVNEIHRCIKNS FT SARKSPGADGLPKEFYLKCWNTISREFTLILNEMLHGPNIDPRMMNGIVVL FT VKKKGVCKTMNGYRPITLLNFDYKILTRILKQRMAPLLEFVLSKHQKCANG FT KRNIFQATSKILDTLSALKHSKKSSLLVSFDLDHAFDRVKHSFLLQTMQRM FT NFNAQLVELLKRIMENSSSRILINGKLSRAFQIHRSVRQGDPLSMFLFVIY FT MQPLIDKIVESGDAVEGLNVYADDISIFVPNMQTLQRVVQVIREFQTTSGA FT ILNLQKTVALKIGNISSGDQLPWLNFSNFVNILGIRFCCSVKQTMESTWIN FT VIKNMKWRLWMSRSRSLNLIQKIILINTFVSSKMWYTASTLPLPKKFEQQI FT LKEYRNFLWIGGKNHIRLETLFLTKNRGGLNLHCPGLKSVSLLTNRILENL FT YDLPFLRGLLNTDQILQIPAAYPQVKVFALEVATLANKTLEHHSSSNIYSE FT LLLAENDPEVLSVNRRWKVIFKYIADHRIHSQYRSVWYTAIHGKINHNELF FT YKQRRRDSPLCDRCPGVIDTVEHKLFSCLVSKPVWIFVKSIISRMKPALQR FT KSPNYFLFPELRGIPSREALKINQLFAKFVYFICCTPEEQINVSSFKLEL" FT CDS join(143..1096,1042..1941) FT /product="L1-38_AAe_1p" FT /translation="MAGFRKNSIVIDFSVMPVRPKLNVVQEFLFKQMSLDM FT SLVKNLQTSITKSHVIIELDSAATAENVVMLHNMKHTLEHDKQLYHIPVHP FT TDNAIEVKVYDLPPHMPNNLISAQFSAYGKVLSVKNDVWKDYFPGIPNGVR FT IIRMEIQTPIPSYVPVAGELSYVLYSNQVKTCRHCTQKVHIGKTCSAARKE FT SSQNAGEKSTLAEIVSNSQASTNESPIELMEATDSESDDESVASKEATTTQ FT LSTADSLATTMKKTSKSPETPNFSQPVASTSNSSKIVTSETETDPLGGRKR FT TLSPKQAEQESQRPQRRTKSQRHTSKARITTTSTPNKISATHFKVRITFFT FT IEAKHSVPCYLHVHLRKPSLEHLNFSKVKIHCICLAVAPSDTAVLKISINS FT SELHLQNRVINCNSKHHHRPKSISPLEQHFPSKVKTKSICPVEVYLDATVL FT KSTTISVEAVPSFICPWDCDDCSCRLEGFPSLEHRPFRKVIKHSICPAVAF FT LDTAAPVKQISTSVGQITTAALSSAKSSARVCRQKDTISETQLPRSRCLTN FT ISNNQIVVYGRTPTTQVSDNRDKPADNRQLVIRVTIIHTANDLTANDKSQR FT PGTLLRAHFTARVHFVYLTL" XX SQ Sequence 5187 BP; 1649 A; 1153 C; 1014 G; 1371 T; 0 other; cagttgacat ttaggcttca tacgaaacag acgtgtattg tttgccaagt gaaaacaccg 60 catatatttt cgctaatcgg actttgtccg aaaggcttat tcattaagtt ttcgtgtaaa 120 cgactagtta tcatccttaa ggatggccgg attccgcaaa aactcaatcg taattgactt 180 tagcgtcatg ccagtacgac caaagctgaa cgttgtgcaa gagttcctgt tcaagcagat 240 gtccctagac atgtcgctag tcaaaaacct gcagacaagc attactaagt cccatgtcat 300 catcgaactg gactccgcag caacagctga gaacgtcgtt atgctgcaca acatgaagca 360 tacacttgag catgacaaac agctctacca tatcccagtg cacccgacag acaacgccat 420 tgaggttaag gtgtatgact tgcctccaca tatgccaaac aatctcatct cagcgcagtt 480 ctcagcttat gggaaggttc tctccgtgaa gaacgatgtc tggaaggact atttcccagg 540 catccctaac ggagtgcgaa taattcgtat ggaaatccaa acgccgattc cttcgtatgt 600 ccccgtggcc ggagagctgt catacgtcct gtacagtaac caggttaaga cctgtaggca 660 ttgtacccag aaggtccata tcggaaagac atgttcggct gctcgaaagg agtcttccca 720 gaacgcagga gagaaatcaa cattagccga aattgtcagt aactcgcaag ctagtaccaa 780 cgagtccccg atagaactca tggaggccac agatagtgaa tctgatgatg aatcagtcgc 840 gtcaaaagaa gctactacca cccagctctc cacggccgat tcactggcca caacgatgaa 900 aaaaaccagc aaatctcctg aaacgcccaa cttcagtcaa cccgtcgcct ctacatcgaa 960 ttcgtcgaaa attgtcacct cggaaaccga aacagatcct cttggaggtc gcaaacgaac 1020 tctgtcgccg aagcaagctg agcaagaatc acaacgacct caacgccgaa caaaatctca 1080 gcgacacact tcaaagtaag gataactttc ttcaccatcg aagccaaaca ttccgtacca 1140 tgctatctac atgtccatct acgcaaacca tcgttggagc accttaattt ctcaaaggta 1200 aaaatacatt gtatatgtct agccgtagcg ccatcagata ctgccgtcct gaaaatttca 1260 ataaactcct cagaactcca cctgcaaaat cgtgtcatca actgtaattc aaaacatcac 1320 catcgaccaa aatctatttc accgttggag caacactttc cttccaaggt aaaaacaaaa 1380 agtatatgtc ctgtcgaagt gtatttggat gctaccgtcc tgaaaagtac aacaatctcc 1440 gtagaagccg taccgagttt tatctgccct tgggattgcg atgactgttc atgccggctg 1500 gaaggtttcc catcgttgga gcatcgccct ttccgtaagg taataaaaca tagtatatgt 1560 cctgccgtcg cgtttttgga tactgccgca cctgtcaagc aaatttcaac atccgtaggc 1620 caaatcacaa cagcggctct ctcttcagca aaatccagcg cacgagtatg tcgacagaag 1680 gatacaatat cggagacaca actaccacga tctcgatgct tgacgaacat cagcaataac 1740 caaattgttg tttacggtag gacaccaaca acacaggtga gcgataacag agataaaccg 1800 gctgacaata gacaacttgt cattagagtc actattatac acactgcaaa tgatttaaca 1860 gcaaacgata agtctcagcg accgggaacg cttttgagag ctcacttcac ggctagggta 1920 catttcgtat acttgactct ttaaaataca aatggattca ttgagcttca atttagcaac 1980 cataaacatt gctaatatat acagtccaac taaaatagat gctctcaatt ccttcctacg 2040 gttaaacgaa attgatatag tttttctgca ggaagtcgaa aatacgtcta tacaactgac 2100 aggttactca attgttttca acattgataa tagaaataga ggtgtagcaa tcgctcataa 2160 gcccagtatt gaatacacag cagttcaaaa ttctctggat ggtaggctaa taacgataac 2220 actgaaaaat ggggtcacgc tgtgcaatat ttatgcccct accggagcgc aaaatagggc 2280 atcacgagaa gactttttca acttcacttg tgcaaattat ctcagaaatt gtacaggacc 2340 gataatcttt gggggtgatt tcaattccgt tgtaaatccc agagacgcga ctggtgctac 2400 ctcacggagt aatatgacgt cacggctaat gagttcctta gacctgatcg atgtttggcg 2460 tgttctacat cgaaatgaga ccgattatac tttcgtgcgt gctggatcgt cctcgaggct 2520 agataggttc ctagtttcta gatcgtccac tagctggcta cgaaccatca atcattcagt 2580 tacgtgcttc agcgatcata aagcagtgtt aatgcgaatg gttttaccaa taactggtcc 2640 catcatcggg agaggcctat ggcgactcaa ccctcatgtt ttagaggatg aagatacgca 2700 aaaccgtttt caagctagat ggaactactt agtccgacaa aagcaaaatt atcgttcctg 2760 gtcatcatgg tggttagatt acgcaaaacc taaagttgca tccttttttc gttggcggac 2820 ttctcttcag attcaagact tccgtaatac tatgaatctc ttacacggtg aactaactag 2880 ggcttatcaa ctgtatgcca atgatgcaac acaattgaca cgaattaata ggatcaaaag 2940 cattatgcta ttgaagcagc gtgagtcatc aagcaactgg cgacaactta aggaaactta 3000 tctgaaagga gagccaactt ccttatttca tttgtcggaa aagaactaca agcgttcaaa 3060 aaccttgatt acggaattgg atgttggagg tgatatgatc aacgacagtg gtcaaataga 3120 gcaatgcgtt gtcacatatt tcgaagattt gttccaagaa aatagacagc aaccgacagg 3180 aagtaccttc ctacccactc ttcgcatacc cgatcaaaat gtagcaaacg aaatactaat 3240 ggacccaatt gaagtcaatg aaattcatcg ctgcattaaa aatagttctg ctagaaaatc 3300 accaggagct gatggactcc caaaagagtt ttacctgaaa tgttggaaca cgataagtcg 3360 tgaatttacc ttgatcttga acgaaatgct acatggtcct aatatagatc cacgcatgat 3420 gaatggaatt gtcgttctag tgaagaaaaa gggagtctgc aaaacaatga atggctatcg 3480 tccaattaca ctgctgaatt tcgactacaa aattctaact aggattttga agcagagaat 3540 ggctccatta ttggaatttg tcttatccaa acatcagaaa tgcgcaaatg ggaaacgcaa 3600 catttttcaa gcaactagca aaattctaga tacgctttcc gctctaaaac actccaaaaa 3660 atcatcactg ttagtgtcct tcgaccttga tcatgctttt gacagggtga agcatagctt 3720 tcttctacaa acaatgcagc ggatgaactt caatgcacaa cttgtagaac ttctgaaaag 3780 aataatggaa aactcttcat cgcgaatact catcaatgga aaactgtcaa gggcgttcca 3840 aatacaccgg tcggtgaggc aaggagatcc gctcagcatg ttcctctttg ttatatatat 3900 gcaaccattg atcgacaaga ttgtagaatc aggggacgct gttgaaggat tgaacgttta 3960 cgcggacgac atcagcatat ttgttcctaa tatgcagaca ttacaacgtg tagtgcaggt 4020 aatccgggag tttcaaacca catctggcgc aatcctaaat ctgcagaaaa cggttgctct 4080 gaaaatagga aatatttcgt caggagatca gctgccttgg ctaaacttct ctaatttcgt 4140 aaacatccta ggcattagat tctgctgcag tgtcaaacaa actatggagt caacatggat 4200 caacgtaatc aaaaacatga aatggcgttt gtggatgagc agatcgcgct cgttgaattt 4260 aatacaaaag ataatcttga taaacacctt cgtatcctcc aaaatgtggt acacagcgtc 4320 aacgctccca ttaccaaaaa agtttgaaca acaaatcctt aaggaataca ggaatttcct 4380 ttggattggt ggcaaaaatc acatacgcct tgaaactctt tttctgacga agaaccgtgg 4440 tggcttaaac ctgcattgtc cggggttgaa atctgtgtcg ttattaacga acagaattct 4500 tgagaatttg tacgatctgc cgtttttacg tggactttta aatacggacc agattcttca 4560 gattccagct gcttaccctc aagtcaaagt atttgctttg gaagtggcaa cactggccaa 4620 taaaactctg gagcaccata gttcttccaa catatattca gaactacttt tagcggaaaa 4680 tgatcctgaa gtgctgagcg ttaatcgtcg ttggaaggtg attttcaagt atatagcaga 4740 tcatcgaatt cattcacaat atcgctcagt ttggtatacc gcaatccatg ggaagattaa 4800 tcataacgaa ttattctaca aacaaagacg ccgagactct cctttatgcg accgttgtcc 4860 aggtgtcata gataccgttg agcataagct tttttcgtgt cttgtgtcaa aacctgtttg 4920 gatttttgtg aaatcaataa tttcaagaat gaagccagcc cttcaacgga agtcaccgaa 4980 ttattttctt tttcctgagt taagaggcat accatccaga gaagcactca aaattaatca 5040 gctttttgca aaatttgtat acttcatatg ttgtactcca gaagaacaaa ttaatgttag 5100 tagttttaag cttgaattgt aaatctaaaa gttttataag tcagtactct aaataaattc 5160 agaccaagat aaaaaaaaaa aaaaaaa 5187 // ID Waldo-4_AAe repbase; DNA; INV; 6514 BP. XX AC . XX DT 05-OCT-2010 (Rel. 15.1, Created) DT 05-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE A Waldo non-LTR retrotransposon family from Aedes aegypti. XX KW R1; Non-LTR Retrotransposon; Transposable Element; R1_Ele3; KW Waldo-4_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-6514 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6514 RA Kojima K.K. and Jurka J.; RT "Waldo, (AC)n microsatellite-specific families of non-LTR RT retrotransposons from the yellow fever mosquito."; RL Direct Submission to Repbase Update (05-OCT-2010). XX DR [2] (Consensus) XX CC [1] Originally named as R1_Ele3. CC [2] Consensus update and characterization of target sequences. CC This consensus is generated from 30 sequences with >98% identity, CC and ~98% identical to the original sequence in [1]. CC Sequence-specificity is not so tight compared to other Waldo CC families in Aedes aegypti, but their 5' ends are frequently CC flanked by (AC)n microsatellites. Renamed as Waldo. XX FH Key Location/Qualifiers FT CDS 891..2285 FT /product="Waldo-4_AAe_1p" FT /translation="MEQIIDLDGESSGGNEDAIVFARSGKMQRSPVIPQTV FT AASTSRCSTPSELPGNQRDAGFLSQVLTPKSGSSTAQDDRQHDKSELSEVK FT RKVNELYEFVKTKNNVHLRIKQLVTSIKSAVTAAERGQKELRSRAEKAEKA FT LTSVNATAVEQETPRSNPTTRTEKRTRDTPGEEEDPKKQRSGNECVSEQNG FT SGWRTVKGQKEKNTARKERKRQKKEPKKKEKQRSPRERFRGDALVIEASDK FT TSYAAILRRVREDPELKELGENVVRTRRTQKGDMLFELKKDPSIKSSACRE FT LVAKSLGNEASVRALSQEAVIECRELDEITTEDDVRGALLSQCNLEEAPQS FT IRLRRAYGGMQIATIRLPVAAAKKLVEAGKIKVGWSVCPLKLIPRETNPVE FT RCFKCMDFGHRAINCKGPDRSEHCRKCGEKGHVGRDCLKSPRCMLCKVEEG FT NAHTTGGFKCPKYQKAKAGQ" FT CDS 2288..5314 FT /product="Waldo-4_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MMMEITQVNLNHCDAAQQLLWQSTTETKCDIAIIAEP FT YRVPPDNGNWVADRAETTAIQVMGRFPIQEVVERSYEGFVIVKINGIFVCS FT CYAPPRWTLEQYTQMLDVLTDKLIGRTPVIIAGDFNAWAVEWGSRLTNARG FT YSLLEALAKLDVRLCNDGLASTFRKDGRESIIDVTFCSPSLVTNMNWRVSE FT EYTHSDHQAIHYCVGQRSCAVWQRRNGERRWKTKHFDKDLFVEALRAVSDA FT PNMDAGELTEALARACDATMPRKLEPRNQRRPAYWWNDRLSTLRAACLRAR FT RRVQRARSEASMEERKVTYRLARAAFKREISTSKTNCYKELCREADANPWG FT NAYRVVMAKIKGPVTPAEMCPDKLKAIVNGLFPTHAPTAWPPTPYADVDEE FT NAGIQVSNAELITVAKGLKLNKAPGPDGIPNVALRAAILAFPDIFRTALQK FT CLAEGCFPDRWKVQKLVLLPKPGKPPGDPASYRPICLLDTLGKLLEKVILG FT RLTIYTEGENGLSKRQFGFRKKTSTVDAVRAVIACAEKAAKQKRRGNRYCA FT VVTIDVKNAFNSASWEAIAAALHRMRVPGYLCKVLQSYFQNRVLVYDTNQG FT RMSIEVTAGVPQGSILGPTLWNMMYDGVLTLSLPNGVEIVGFADDVVLAIT FT GETAEEVEMLTAESLDTIESWMTGVKLQLAHHKTEVVLVSNCKAVQQLEIN FT VGGHAIPSKRSLKHLGVMVDDRLNFNSHVDYACEKAAKAANTVARIMPNIG FT GPRSSTRRLLATVSSSILRYGVPAWAAALGTKRNRDKLAGTFRLMAIRVAS FT AYRTISSEAVCVIAGMIPICITLAEDIACYQQRDTRNVRNTIRLDTMARWQ FT QEWDNSEKGRWTHRLIPNVSVWTTRKHGEVNFFLTQFLSGHGCFRKYLHRF FT GHAESPHCPACPNIEETPEHVIFDCPRFRDIRSEMMPETASVLNPDNIVQN FT MCQHESTWNAVNRGITAIMSLLQRRWREDQRAPGRDRSRLDPPSGTRPSRA FT GVA" XX SQ Sequence 6514 BP; 1836 A; 1590 C; 1902 G; 1186 T; 0 other; gggttgcaaa atggtgagtc gacaaccagg aaggagcgtc caacatagct ctggtcctca 60 caagccccta cctcacgctt ccacgggtct aacgatgaca aagaccgcca gcaaagggtt 120 gcgtactttg ctggtagtgc aacctgggca ctgttgtcct tctgacatca gctagagtga 180 ggaggtgcca ggtgggagct tgggattttt accctttcca agcgaaactc gtacatacag 240 ccgtatggaa taccaccttt agtacttcct tcgggaggtg accgagcgtc tggtccatct 300 gaccaggggc tcaagcatgg tctacctagg atgtggcggg gtttcatcag tgggctctgg 360 tgaatctcta caaaaaacca catatctgca agtaaccctg aacaagcgac ctggtaccgc 420 tttcaaagta ccttagccct ctggagtgcc aaccggcaca tcaggatgga tgccagtaaa 480 atcctgatta tggtatactg gtcacaacgc agaacaacgg acaggacacg gattacggat 540 tggatacgtc gactaacagt cgtgggctgg cagcgtactc tactgttccc tgagtaagga 600 agggtgacac caacttgaaa cggcgaatag gactatgagg aaactgcctc cgtcctgata 660 aaaccttggc aggccttcgt gtacgttcga aactcgtcac caacctcggt gagtagtagg 720 ctcagatgta acattcctga cttacgccga tccggctctg aacacaaccc ccatgaagga 780 ttgtgtcaac cctagcatgg cctccctgct tgcatagaca accatgggat cacgagaggc 840 gactatttac aacgagtaga gatagcggct cgaagggggg acccaatcga atggagcaaa 900 taatcgattt ggatggagaa tctagtggag ggaacgaaga cgcgatagta ttcgcaagga 960 gcggaaagat gcagagatcg ccagtgatac cacagacggt agcagccagc actagcagat 1020 gcagcacacc gagtgagcta ccgggaaacc agcgggatgc aggcttcctg agtcaagtgc 1080 tcacaccaaa gtcggggagt agcaccgctc aagacgaccg gcagcatgat aagtcggagc 1140 tgtcggaagt aaagaggaag gtcaatgaac tctatgagtt cgtcaagacg aaaaacaacg 1200 tccacctcag aatcaagcag ctggtcacga gcatcaagtc tgccgttaca gccgcagaac 1260 gcgggcagaa ggaattgcgg agtagagcgg aaaaagccga aaaagcatta acttcggtga 1320 atgcaacagc cgttgaacag gagacgccta ggagcaatcc gaccacccgc acggaaaaaa 1380 gaacgaggga tacaccagga gaggaggaag atcccaaaaa gcagagaagc ggcaatgagt 1440 gcgtcagtga gcagaacggc agtggatggc gcaccgtgaa aggtcaaaag gagaagaata 1500 cagctcgtaa ggagaggaag cggcaaaaaa aggaaccgaa gaagaaagaa aagcaacggt 1560 cacctcgcga gaggttcagg ggcgatgctc ttgtcattga ggcaagcgat aagacaagct 1620 acgcggcgat tctcaggaga gtgagggagg acccagagct taaggagctg ggtgaaaacg 1680 tggtgagaac caggcgcacc cagaaaggtg acatgctctt tgagcttaaa aaagatccgt 1740 cgatcaagag ttcggcctgc cgagagctcg tcgcgaagtc gctgggcaac gaggcaagcg 1800 tgagagcctt atcccaggag gcagtgatcg aatgtagaga actggacgag atcacaacag 1860 aagacgacgt gaggggtgca ctgctatctc aatgcaacct ggaagaagca ccacagtcca 1920 tccggctgag gagagcttac ggaggtatgc aaatagcgac gatccgatta ccagtcgcag 1980 cagccaaaaa actcgtcgag gctggtaaga tcaaggtggg atggtcggtt tgcccgctga 2040 aactgatccc tcgggagaca aatccggtgg agagatgctt caaatgcatg gatttcggcc 2100 accgggcaat aaactgcaaa ggcccggaca ggtccgaaca ctgcaggaaa tgtggtgaaa 2160 aaggtcacgt tggcagggac tgcctgaagt cacctaggtg catgctgtgt aaggtggagg 2220 aaggtaacgc ccatacgacg ggtggcttca agtgccccaa atatcagaag gcgaaggcag 2280 ggcagtaatg atgatggaga taacgcaggt gaacctcaat cattgcgacg ccgcacaaca 2340 actgttgtgg cagtcgacaa cagaaacaaa gtgcgacatt gcgatcattg cagaaccgta 2400 tcgggttcct cccgataacg gcaattgggt agcagacaga gcagagacga cggcgattca 2460 agtaatgggc cgattcccta tccaagaagt tgtcgaacgc tcatacgagg gcttcgtgat 2520 cgttaaaatt aacggaatct tcgtgtgtag ctgttacgcc cccccgagat ggacactgga 2580 gcagtacacc cagatgttgg atgtgctaac cgacaagctg atcggtcgaa cgccggtaat 2640 aatagcagga gacttcaatg cttgggccgt ggagtggggt agcagactga ccaacgcgag 2700 aggatacagt ttgctggaag ctctagcgaa gctcgacgta aggctgtgca acgacggtct 2760 cgctagcaca tttcgcaaag acggtaggga gtccatcatc gacgtgacct tctgcagccc 2820 gtcgctggtc accaatatga actggagagt tagcgaagaa tacacccaca gtgaccatca 2880 agcgatccac tactgcgttg gccagagaag ttgtgcagta tggcagagga ggaatggaga 2940 acgaaggtgg aagacgaagc atttcgataa ggatcttttc gtcgaagcac tccgagcagt 3000 aagcgacgct ccaaacatgg atgcaggaga gctgacagaa gcgttggcga gagcgtgtga 3060 cgcaactatg ccgagaaaat tggagccaag gaatcaacgg cgtcctgcat actggtggaa 3120 cgacagactc agcacccttc gcgcagcctg cttaagagcc agaagacgcg tgcagagagc 3180 tagatccgaa gccagcatgg aagagcgtaa agtgacctac cgactggcca gagccgcgtt 3240 caaacgggaa atatcgacga gtaaaacgaa ctgctacaag gagttgtgcc gggaagctga 3300 cgcaaatccc tggggcaacg cctatcgagt agtgatggcg aagatcaagg gcccagttac 3360 cccagctgaa atgtgtcccg acaaactgaa ggcgatagtg aacggtctct ttcctacgca 3420 tgctccaaca gcgtggccgc ccacaccgta cgctgacgta gatgaggaaa atgccggtat 3480 acaagtgtct aacgccgagc tgataacggt tgcgaaagga ttgaagctga acaaagctcc 3540 cggaccagat ggtatcccta atgttgcgct aagggcagca atcttggcgt tcccagacat 3600 attcaggaca gcgttgcaga agtgtctagc ggaaggctgc ttcccagaca gatggaaggt 3660 gcagaagctg gtgttactgc caaaaccggg aaaaccgcca ggagatcctg cttcatatag 3720 accaatatgc ttgctggata ctcttggcaa acttttggag aaggtcatcc tcggcaggct 3780 gacgatctac acggaaggcg agaacggatt gtcgaaaagg cagttcggat tccgtaaaaa 3840 gacttcgacg gtggatgctg ttcgggcagt catcgcgtgc gcggagaaag cggccaagca 3900 gaagaggaga ggcaatcgat actgcgcagt ggtaacaata gacgtcaaga acgcgttcaa 3960 cagtgccagt tgggaggcta tcgccgcagc cctacacaga atgcgggttc ccggatatct 4020 gtgtaaagtt ctgcaaagtt acttccagaa ccgggtactg gtctacgaca cgaaccaggg 4080 acggatgtca attgaagtca cggcgggagt tccacaagga tccatacttg gtccaacatt 4140 gtggaacatg atgtacgatg gagtgctaac cttgtcactg ccgaatggag tcgagatcgt 4200 cgggttcgca gacgacgtcg ttcttgcgat aactggtgag actgcggagg aggtggagat 4260 gctgacggcg gaatcgctag acaccatcga atcgtggatg actggagtca agctgcagtt 4320 ggctcatcat aaaacggagg tggtgctggt cagcaactgc aaggcggtac agcagttgga 4380 aatcaacgtc ggagggcacg caatcccatc gaagcgctct ctaaagcatc tcggagtaat 4440 ggtcgacgac aggctaaact tcaacagcca cgtagactat gcctgcgaaa aggcagcgaa 4500 agctgctaac accgtagcca ggatcatgcc caacattgga ggaccaagaa gcagcacgag 4560 gcgtcttcta gcgaccgtat catcgtctat actgaggtat ggagtcccgg cttgggctgc 4620 agcgttggga accaaacgta atcgagataa gctggcaggt acgttccggc tcatggctat 4680 acgggttgcg agtgcctacc gcactatatc atcggaggca gtgtgcgtta tcgccggaat 4740 gatccccatc tgcatcactc tggctgagga catcgcatgt taccagcaaa gggacacacg 4800 gaatgtgagg aataccatta ggttggacac aatggccagg tggcagcagg agtgggataa 4860 ctcggagaaa ggaaggtgga cccacaggct cattcctaac gtgtcggtgt ggacgaccag 4920 aaaacatgga gaagttaact tcttcctgac ccagtttctg tcaggccatg gatgcttccg 4980 gaaatatctg cacagattcg gacacgcaga atctccacat tgtccggcct gtccaaatat 5040 cgaggagaca ccggagcacg ttatattcga ctgccctcga ttcagggata tacgaagcga 5100 gatgatgcca gaaaccgcaa gcgtcctaaa tccggacaac atcgtgcaga acatgtgcca 5160 gcatgaaagc acctggaatg cggtgaacag gggaataacg gcgattatgt cgttgctgca 5220 aaggagatgg cgtgaagacc agagagctcc gggtcgcgat cggagtaggc tagatcctcc 5280 gtcggggact agaccgagta gagcgggcgt agcgtagtat cggttaatag tcgtcggggc 5340 gcctgcaaac cggaagccat cctccaaccg gaattgcaga accgaccctg gcacttggcc 5400 gaccaacgtc gagtcggagt aggctgagtc caccgccggg gtccagctga gtaattcgcg 5460 aaatagcacc ggcaaatggt cgtcggggcg cctgtgaacc ggaagtttcc ctccaccgga 5520 atcgcaggac cgacctcggc atctgcccgg ccagcttcgg agtaggctaa gtccaccgcc 5580 ggggactagt tgagtagatc gcgttgttgc accggcaaat gatcgtcggg gcgcctgtga 5640 accggaagct tccctccacc ggaatcgcag gaccgacctc ggcatctgcc cggatagcat 5700 cggttcggga gatcttccgc cgccggggaa atcttcgtcg gagtaggata gatccaccgc 5760 cggggactac tctgagtagt acgtagcgta tcaccggctt gagtcgtcgg agcgccagcg 5820 aaccggaagc catcctccaa ccggaatcgc tggaccgact tcggcactcc accggcctgt 5880 atcgaatcat gaagaagcgc gtagccggag taggctagct ccaccgtcgg ggactaagcc 5940 gagtagcatc gatcgtcaca aaccagtttt gggtcgtctg ggcgccagtg aaccggaagt 6000 catcctccaa ccggaatcgc tggaccgacc tcggcatcca actggtcaac aaggagagct 6060 cgaacagcag cagcggagaa cgagtcgtcg tagagcaacg aagtgcacat gagcgtccag 6120 tagaagatca acagagctct gacgcgaggc cagcccaagg gaagtatgcg tcgtcgcaca 6180 gaaagctgtc caaagtcccg gcgagatgtt caaccgccaa ttgggcaaaa cgctccaaca 6240 gcgaactagc agaacagtta gaggctgaga ttgaagaagt gcatgagcac agaagtgcat 6300 gagcacagcc ctcccccgat gaagttgcct gatgtagttc cgggggggat cgaggcacag 6360 gagcagtagg gaaagttttt tagtggttaa gcacgcctaa cgtgagtccc acaccgtgcc 6420 aacacacaac aggcctggct tttgaagctt ttttgtaccc tactataaaa aaaaaaaaaa 6480 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaa 6514 // ID BEL-195_AA-LTR repbase; DNA; INV; 598 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-195_AA_; KW BEL-195_AA-I; BEL-195_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-598 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 882-882 (2011). XX DR [2] (Consensus) XX SQ Sequence 598 BP; 203 A; 86 C; 126 G; 183 T; 0 other; tgttacgctg ggagataaag ggtttgttgc atggacgcga tccacatttt gcaaagtatg 60 gctcaagatt gaaagaagga agaaatcaca ataacagcat tgtaaacaca gggcgctgac 120 atgttgaatt cggattgcta tgaatagact attgtgattg ctatcggagt tgaattatta 180 actatttttt gaattagatg ctattgagag tttactaaaa ttattgcatt ggaaatagta 240 agtaaaaaga aaattattat caattgttta ttacataaat cacttacaat aggcgcagaa 300 ttgccagcag agtgaagaca ataacgatag tgggtgttga ctgatagaac ttgccgtgag 360 taattatgag taatattgaa tactggttgc taatggcgaa atgttgtaca ggattacgtt 420 gtaggaaatt tggaatgttc gtgggctgaa tagctgaatt gaagactaga tttgtaagta 480 acaacattat aaaaccccat atgaacctaa tcaaatataa ttttagtttt gagtgtgccc 540 aaacaccgtt acatacttca aaagccgttt ttcaatatgc tgccccgaac tcctccca 598 // ID Sola3-1_BF repbase; DNA; INV; 8912 BP. XX AC ABEP01036107.1; XX DT 28-JAN-2009 (Rel. 14.02, Created) DT 28-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE Sola3 DNA transposons from Branchiostoma floridae. XX KW Sola; DNA transposon; Transposable Element; Sola3; TTAA TSD; KW Sola3-1_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-8912 RA Bao W., Jurka M.G., Kapitonov V.V. and Jurka J.; RT "New superfamilies of eukaryotic DNA transposons and their RT internal divisions."; RL Molecular Biology and Evolution 26(5), 983-993 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS join(3637..4563,4768..5259,5231..7153) FT /product="Sola3-1_BF_1p" FT /translation="FSAQLIFQEETQDSPSSSLSMGSPSAERSLYTVSSQS FT TGNGGSSVPEPTHLLEQLLQASGSHVDKVQVLTVPWAEASERSQRLHLQQA FT SEAVSTVLQVLAPGDAYSLWKYLRESKVTDVALGMGGSAIEMELLNSLVEC FT YQCASQSFTRRQILSIMADKYSFPDLEKLLPGLTRYQVTSARQHAMQYGRG FT APIPPSVGTRMRVDPGKLDHFIAFATSAHVVQDLPFGEKVLKLSSGESLNV FT PNTIRAMIPERIIAQYQQYCTETHFLPMGKRTLQRVLSACSASVRTSLQGL FT DYFTAEGNSTRLLLNVHFIHPNSTKENVNDKNTPWPSWLAGIAILNIAFNL FT RLFPFHCLGSRAFEDIDGIVGKLPEQLGQDWAKKQTAILRSSKRYLKGDYK FT VVQYWHQNIGNNLLFHVLCSNEFKHLSRAAKLYKSWYFINTILYQNMFMQM FT FKCACFHSVTNTSPHVSVGTCFTGGISSVHVSQEASVAEHCRQYALSDGTD FT NHTTTCSHQHVEACEACDSLQQLFDSLEESFTDYTFPTSEEHDDMRFRLDQ FT AKQDINAWQCHQLRSINQDEAWHVLLDNMDDTSVLLVMDWAMKFLPRKYRE FT SQTDWFGKRGIPWHITVAHQQGTSSIQVETFIHLFQTCKQDSNSVVSILQH FT IAQQLKKELPDLQSIYLRSDNAGCYHNTLILQAARHINETAGVTIRRVDYC FT DPQGGKGSCDRQAATVKSHIKTWINEGHNVETAEEFKIAVESRGGIPGVRI FT FLCEVGAAVAATSMKLDGITKLNNFEFSDAGLRVWRAYNIGKGNLIPWPRL FT PAIEPPLLKIVSQPSNPGSAFRPIKSRQKKVQAQQEVESDSDEDAASHETM FT QPSLGSLFPCPEESCVKVYQTCKGLEAHVAVGKHKRRLERETLLDKAKLKY FT AEKLAQGPSEVPHVEPSLEVNRTTPCPQQGWALRSPRKAVRFSHKQKDYLD FT KRFRLGQSTGRKSDPITVSKEMRHARDTQGQRLFQVQDFLTGQQVSSYFSR FT LAAQRRKPSEDSEEEEEQQDPAVIESSNSRIQEEIISNVQLCHPVLYDTYN FT LCDLAKIGGLYEIGIALLKSICELFDVDVSSITDRRRKAAFVAKIKEYLSC FT CPCSK*" XX SQ Sequence 8912 BP; 2678 A; 1969 C; 1826 G; 2439 T; 0 other; gagcaattgt ctgtaaggat caaacaatgg tggcaggggt caaaaaatcg atttggctca 60 tattttgcac agtgatagta cttggtccac aacccctcaa aatagctttc ttttagatca 120 aaactccttg ttgccatggt aatgggcaaa caaagttttc tggccatttt ccccaatttt 180 cgcatctcaa aacctcagaa atcagcatat tttacggcac tgtctcggca agatctttta 240 atctatcatc aaaacaaaac aagatctgaa tagaccaaca ttttggcttt ttcagaattg 300 ttgtgaaatt tccaaagagg tacatgtaag ttcttgggca gcctgaaaac agtgtgatgt 360 ttcaagcttc aagaaaatct aatttttggc caaactttgc aacgctataa gtgccgctct 420 ggtatttttc tttacttgaa atttgtacca tatatagatg attgcaacat ctttcaaatg 480 atatctttca aaatttggta acttgtttgg ttctcacgtg gttgtccctt aaagtaggtc 540 acattttgcg tgtagccaaa attgttgatg ttagctatgt tcaaagtact gtacaggaca 600 aagtaaaata gatttctcac tcaacttctc atcaaatata gatctatgta ttgcctatca 660 agtaattgct tgtagagttt cgggtcgtaa tgtttaatca cgtggttgtc cctaaaagta 720 ggccattttt tgcatttggc aaaaaattgg aatttaagcc atattcaaag tactgtacag 780 gacaaagtaa aatagatttc taactcaact tctcatcaaa tatagatcta tgtattgcct 840 atcaagtaaa tgcttgtaaa gtttcgggtc gtaatgttta gtcacgtggt tttccctgaa 900 agtaggccat ttttcgcatt aggcaaaaat ttggaatttt gaaaataatt gtgacttgta 960 gcttacaatg ttcaaagtac tttttgtgac aacgtgcaag tagaattctg attcaacttc 1020 aaataaaggc ctgttgtata gtttcgggtt gtgacgttta gtcacttggt tgtcaaaata 1080 agtggatcac attttcccgt gtctagacac agtttggaac ttcaagaaac tatcattata 1140 gcgctagcct tccttaatga cgtgaaatac ttaaatacaa aacaggttca tcttccgacc 1200 atatcagcta tattgtagcc ccagacatgg tttttcaaag gagtacaagt cagagatttg 1260 cacaacctga attctgtcaa ggtggggttc tcttacaaag gccgaattca ataatggttt 1320 gtcttttaaa ctaggtcagg tatcacttac aaataactgt gaggatcagg tgttataact 1380 gcgtttggaa aggttggcca tgttattcca tgtgctgaag ttatcttggc gtcctggtgc 1440 aacttatcca tatcagttcc ccttgaaaat ataatgaacc tccccttgga tccttgctcc 1500 gagggcgaga aacctaagaa cttcttatct ttctgaccca gacagggtga acttcttatc 1560 ttttctatct agacagggta gaattgttgc agtctgtctg ggcctcaaag aaacagcacg 1620 tgttgttacg gggagttcat ttaccaacag tttttgcagg attttgaacc aaattcgttg 1680 tatttcaagt tatcggcatg tatttggaaa gccgtagcta agtcaagtgt catcaatgga 1740 ctgttggacg acttacagac acagggggca tgttggtgga ccgctgctgc aactggtaag 1800 ctacgaatta cctcttatag ttttgaaaac atcatatact aggttatcaa tgcgttatca 1860 gattgtttca aaagtctcag tgcatgtgtt cggaggattt tttttagttt attgtctaat 1920 accttattga tggtacatgt atgatttagg ctgttaaagg ccatgagttt aaagggtata 1980 tacaatgtgt gaaacaagtt gatgtcctac aaagtataat ggaacaatgt ttttctcatg 2040 attccttcat tcatttgtac ctccaggcca tctgaatcat gtcgtgttcc cttgctgaat 2100 atgccggagg gaggtgtgga gcaaacccaa aggctcgaaa ggagggtgaa agcattatac 2160 cacttattca gtgtacaaaa gacatttcgt atcatttgaa aacctttagc gttccgagga 2220 cagttcagaa tgaacatgag ttgattttgg ctagagccgg aatattcaca cttccagaca 2280 atgtcgctgt gatgacggca tgcccaaagc acagaggggt tttgggtatc tattggtgca 2340 ggtcaaccga aaaatgcgga atgccagatc agctttctaa tcacagtcga tggagggcaa 2400 ggcgcaaagg taccagaggc atcggcttct ttcagtcacg gaagatacaa gagctgacca 2460 agcagttggt tccagttggc acaggtattt ctgaagttgt atggaatgtt tgatcgttgt 2520 atttgtacat tgtattcgtc acattcaaag tttacagttt aagagcattc atttcaacca 2580 ttgcatctta ctaagtcttt gctatcgttt attcacagac aacaccatac atataagcta 2640 caaacaaaag ttgtctatag ctttatactg caccgtaacc ccttacctgg ttaatgtgcc 2700 caacatgtga ttgtaataca cacacacact ctttctctct ctctctctca cacacacaca 2760 cacacacaca cacacacaca cacactgtca cacacacaca ctcactcact ctctttttct 2820 ctctctctct ctcacacaca cacacactca cacacacaca cacacacaca cacacacaca 2880 ctcaccgtca cgcacacact cacacgctca cacacacggc acacaagcac actcacacac 2940 acacacggca cacacgcaca cacacacaaa catgcgcacg cacacatgca cagtcacaca 3000 cgcagtctaa cacgcacgca cacacacaca cacacactca cacacatgtg cacacatggc 3060 acacaagcac actctcacac acactcagtc acacacacaa acaagcacac aaacacaaac 3120 atcacgtatg cacacacaca aacatgcgca cgcacacatg cagtcacacg cacacagtct 3180 cacagacaca cacgcacaca gtcacacaca cacacacaca cacacacaca catacataca 3240 tgagcgcttg cacacacgag cgcacacaga cacacacgca cacacacata cacactacat 3300 ctacaattta tggtattcat tacataattt ttttgctccc cagccatatg ttccgaatgt 3360 cggaactcga tcggagggga agaggagcta tcagacgtga aagaagaaac tactcctctg 3420 gatgtacagg aacaccgtag cagcggcctt ggcgtccagg agccagttcc aagtacgagt 3480 ggcggccatg gcgtccagga gccagtgccc agtgccagtg gccttgacaa agctcttcag 3540 cgattgtcac tggtatgcaa tattattcaa ttaaaactgt tctgatcaaa ctttttgttg 3600 gtttctaatg ttacatagtt aggctacaat gaataattca gtgcacaatt aatttttcag 3660 gaggaaacac aagatagtcc ttcatcctcg ctgtccatgg gaagtccctc tgccgagaga 3720 agtctataca ccgtcagttc acaatctact ggcaatggcg gctcatccgt cccggaacca 3780 acgcacctgc tggaacaact tttgcaggca agtggtagtc acgttgacaa agtgcaagtt 3840 ctcactgtgc cgtgggctga ggctagcgag agaagccagc gattacatct gcagcaggca 3900 agcgaagcag tcagcaccgt gctccaggtg ttagccccag gcgatgcata cagcctttgg 3960 aagtacctgc gagaatcgaa ggtaacagat gtagcccttg ggatgggtgg aagtgccata 4020 gaaatggaac ttctaaactc tctcgtggag tgttaccagt gcgccagtca gtcgtttacg 4080 cgcaggcaaa ttctgtccat aatggctgac aagtacagct ttcctgacct cgaaaaactc 4140 ctcccgggct taacaagata ccaggttaca tccgcaagac aacacgccat gcaatatgga 4200 aggggagcac cgataccgcc atccgtgggc acacgtatga gggtagaccc gggaaagctt 4260 gaccatttca tcgcattcgc caccagtgcc catgtcgtgc aagaccttcc gttcggagag 4320 aaagttctga agttgtcatc tggagaatct ctaaacgtgc caaataccat ccgggctatg 4380 attccagaga gaatcattgc tcaataccaa cagtactgca cggagaccca tttccttcca 4440 atggggaaga gaacactgca gcgagtactt tccgcctgca gtgcttctgt aaggacatct 4500 ctgcaaggtc tggactactt cacagctgaa ggtaacagta cacgtctgct cttaaacgta 4560 cattaaatga tcccctgtac catattattg catagcagtt gcttgcttct acaatagcat 4620 ttgaatacat ctaaaactat cagaaatgta tgtgtcatct tgacatattt gccatactca 4680 acttctatgg gcaaaatata tattgagctg ttgtccattt ggaaaatcta gatatttcaa 4740 cgatcttcat ttggatgaaa aaactagttt atacatccca attcaacaaa ggaaaatgtg 4800 aatgataaaa acactccctg gccatcttgg ttagctggta ttgctatctt gaatattgca 4860 tttaatttac gtttatttcc ttttcactgt ttaggttcaa gggcatttga agacattgat 4920 ggtattgtgg gaaagcttcc tgagcagcta ggacaggact gggcaaagaa acagactgcc 4980 atcctaaggt catcgaagag atatctgaaa ggggactaca aggtagttca atattggcat 5040 caaaacatcg gaaataatct tcttttccat gtgttatgct caaatgaatt taaacatcta 5100 tcaagggcag cgaagttgta taaatcttgg tactttatca acactatttt gtatcaaaat 5160 atgttcatgc agatgtttaa gtgcgcttgt tttcattcag taaccaatac ctctccacat 5220 gtttccgtag gtacatgttt cacaggaggc atcagtagct gaacactgtc ggcagtacgc 5280 cctcagcgat ggcacagaca accacaccac aacgtgttcc caccagcatg ttgaagcgtg 5340 tgaggcctgc gacagtctac aacagctttt tgacagcctg gaagagtcgt tcacagacta 5400 cacatttcca acatcagagg aacatgacga catgcgcttc agactggacc aagcaaagca 5460 agacatcaat gcttggcagt gccatcagct gcgcagtatt aatcaggatg aggcctggca 5520 cgtacttctg gataacatgg acgatacaag tgtgcttctg gtaatggact gggccatgaa 5580 gttcctgcca agaaagtaca gggaaagcca aacagactgg tttggaaagc ggggcatacc 5640 ttggcacatc acggtagcac accagcaagg gacaagttct atccaagttg agacattcat 5700 ccaccttttc cagacatgca aacaggatag taattcagta gtctccatct tgcaacacat 5760 agcacagcaa ttgaagaaag agcttccaga cctccagtcc atctacctcc gaagtgacaa 5820 tgctgggtgt taccacaaca cactcatact tcaagcagct agacatatca acgaaacggc 5880 aggcgtcacc atcagacgag tggactattg tgacccacag gggggaaaag gatcatgcga 5940 cagacaggcc gcaacagtga agtcccacat caagacctgg atcaatgagg gccacaatgt 6000 tgagacggcc gaggagttta agattgccgt agagtctcgt ggtggaatcc cgggagtgag 6060 gatcttcctg tgtgaagtgg gtgcagctgt tgcagcgaca tccatgaaac tggatggaat 6120 caccaaactc aacaactttg agtttagtga cgcaggtcta cgtgtctggc gagcttacaa 6180 catcgggaaa gggaatctta tcccatggcc gcgattgcct gccatagaac caccgctgct 6240 gaagattgtc agtcagccgt caaatcctgg gtctgcattt cgacccatca agtccaggca 6300 gaagaaggtg caagctcagc aggaagtgga gtcggattct gatgaagacg cggcaagtca 6360 tgagactatg caacccagcc ttggcagcct tttcccttgt ccagaagaga gctgcgtcaa 6420 ggtttatcaa acatgcaaag gtctggaggc tcatgtagct gtcggaaagc acaaacgtcg 6480 ccttgaaaga gagactttgc tggacaaggc aaagttgaag tacgcagaaa agcttgcaca 6540 aggcccgtca gaagtgccac atgtagagcc atccttggaa gtaaatagaa caacaccttg 6600 cccacagcaa ggatgggcac ttcgcagtcc gaggaaggct gtcaggttta gccacaagca 6660 aaaggactac cttgataagc ggtttcggct tggacaatct actggacgaa agtcagaccc 6720 catcactgtg tccaaagaga tgagacatgc ccgcgataca caaggccagc ggctattcca 6780 ggtgcaggac tttttgactg gccagcaagt ctctagctat ttctcccgac tggcagctca 6840 gcgccgtaaa ccttcagaag attctgagga ggaggaggag caacaggacc ctgctgtaat 6900 cgaatcctct aacagtcgaa tacaggagga aatcatctcc aacgttcaac tatgccatcc 6960 tgtattgtat gatacatata atctttgtga cttggcaaag atcggaggcc tgtatgagat 7020 tggtatagca ctattaaagt ccatttgcga gctttttgat gtcgacgtat cctccattac 7080 tgaccgtcga cgcaaggctg catttgttgc aaaaattaag gaatacttga gttgttgccc 7140 ttgctcaaag tagacatcat atggtttgtg tgaaactaga gtgtactaag acgtagaggc 7200 ttttgtgtgt accgtttgat atttagcaca ttttccaatg tttatatgct ttgagaaatt 7260 attcatttaa ttttttaaga tctatatagg ctgtactttt ggtgttctta tgtatgcttg 7320 gttagataaa tgctgttagg acgttctgtt actttgaaga agatgcctct tgaacagttt 7380 gtttctttta actttgtcaa tgtccaatgc ttatcatatt ggttatacaa tgactgataa 7440 tgcacttaca gtgttaaatt aaagttaagt taaacaccga ctctgttata agtgtaaata 7500 tcaatttgta atgaggaaac taagatattg ttttgaattc agctatgttc tataatccgt 7560 atcaaaagga acactaacac gggaacaaaa gttcctaaca ttcagttgat agacaatgtc 7620 tgaataaggc ctacctccag agactttccg ggactaaacg tcactttgac cctaaaactc 7680 tacaagcatt tacttgatag gcaacacata tgcctacctt aatatgaagt tgagttagaa 7740 tcctgatttt cttagcgtag ttttacttag tcacgtagcg tactttgaac atggctaaaa 7800 ttcaaaattt ttgtcgggca caaaaagtga cctaatttca gggacaacca cgtgactaaa 7860 cgtcatgact cgaaactcta caagcatcta tttgatgaag aatacaaaga tgtacatttg 7920 atcagaagtt gaatcagaat tcttcttcat ttcgtcacgt agcgttcttt gaaaattcac 7980 cattttcttc aaatgaaaaa agtggcctac tttcagggac agccacgtga cttaaattca 8040 tgaccttatt atctacaagt attcatttga taggcaacac ctaaatttac ctttgaccag 8100 aagttgaatc agtatgttat tttgctgcgg cgtgtagcgt atgttgaaca tggctaaaat 8160 tcacaatttt cgtcaaatgc aaaaaaatgg cctactttca gggacaacca cgtgactgaa 8220 cgttatgacc agaaactata caaacattta tttcataggc aatacataga tccatctttg 8280 accagaagtt gaattagtat tctattttgc tttgtcatgt agcgtacttt gttcatggcc 8340 aaagttcaca attttcgcca aacacgaaaa gcggcctact ttaagggaca gtcaagtgac 8400 aaccaaacaa gttaccaaat tttgaaagat atcatttgaa agatgttgca atcatcaata 8460 tatggtacaa atttcaagta aaaaaaaata ccagagcagc acttatagcg ttacaaagtt 8520 tggccaaaaa ttagattttc ttgaagcttg aaacatcaca ctgttttcag gctgcccaag 8580 aacttacatg tacctctttg gaaatttcac aacaattctg aaaaagccaa aatgttggtc 8640 tattcagatc ttgttttgtt ttgatgatag attaaaagat cttgccgaga cagtgccgta 8700 aaatatgctg atttctgagg ttttgagatg cgaaaattgg ggaaaatggc cagaaaactt 8760 tgtttgcccg ttaccatggc aacaaggagt tttgatctaa aagaaagcta ttttgagggg 8820 ttgtggacca agtaccatca ctgtgcaaaa tatgagccaa atcgattttt tgacccctgc 8880 caccattgtt tgatccttac agacatttgc tc 8912 // ID L1-3_CQ repbase; DNA; INV; 4517 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE An L1 non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW L1; Non-LTR Retrotransposon; Transposable Element; L1-3_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4517 RA Kojima K.K. and Jurka J.; RT "L1 non-LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 133-133 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 9 sequences with >96% CC identity. XX FH Key Location/Qualifiers FT CDS 156..1226 FT /product="L1-3_CQ_1p" FT /translation="MSSTIRRVNTFKIDFASLPKKPRFEEIHHFVKTTLGI FT PKEKIERLQVNHYFWCVFVKCVDLATAQQTVLQHNNKHAFEIDGKKYTIKI FT QMEDGAVDVRLHDLPEEITNDQVKKFMSAYGEILSVRELVWEEKYELAGLK FT TGVREVKMILKHQIKSFISIEGQHTYVTYIGQQTTCRHCGEYSHSGIPCTQ FT NKKLLLQKVSVNERLKDAKKPTGKSSYADALMTKPSPRFVPAVLVEDNTGM FT DCDGDNTHTDAENSEITTEGNGSSGGGSGFAAEPGIIIQDGGSSGGASGLA FT DMLQKTTPILQETIIRDGDQLCVFKTPLTLPQDGDGAEESDGSSTSSASGK FT RPRGRPPKKPKTST" FT CDS 1263..4457 FT /product="L1-3_CQ_2p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="MDYQSFNIGTLNINTISNQNKINALHNFTRTHDLDIL FT FLQEVYSDELIIPGYTVICNVDHNRRGTAVALKHHIKYTNVEKSVDSRVIA FT LRLQGSVTLVNVYAPSGSQQRTHRETFFNTTLAHFLRHQTNYTILAGDFNS FT VINARDATGEGNHSLALKNTVQQTRLKDVWEALHGTRVEHTYITHNSASRI FT DRVYVSSNLRDQLRTAYVSACSFTNHMSLTTRLCLPNQGRAHGMGFWSLRP FT YVLTDETIEEFQQKWNYWTRQRRSYNSWMQWWLRLAKPKMKSFFRWKSKEA FT FDDFTRKQQHLYAQLHQAYRAYLHNPDMLTSINRIKSQLLLLQKQFTQLFV FT RINDTHVAGEPLSTFQLGERTRKRTIIESMENDAGASLESSDEIQEHVVNY FT YRNLYAEGETRANPEFDCARVVPPDSISNHACMEEITTAEIYEAIKFSSSK FT KSPGSDGLPKEFYVRTFDIIHRELNLLLNEALRGDIPEEFVNGVIVLVKKK FT QAGNSITAYRPISLLNYDYKLLARILKARVDRVLREHDIISPVQKCSNGDR FT NIYQATLSLKDRIAQLRSQRLSGKLVSFDLSSAFDRVDRGFLYHTMNSLGL FT NPDLVNLLERIGERSTSRILINGHLSAAFPIQRSVRQGDPLAMHLFAIYLH FT PLVARMERICDGPHDLVVAYADDISMITTNADKAEQVRGLFRDFEGCSGAM FT LNLRKTTAIDVGHINDRNRLALPWIQSSESVKVLGVIYLNSLRQMINKNWD FT TLITKMGQLIWLHRMRSLTLHQKVTLLNVFITSKMWYLSSVLAPYKVHIAK FT ITSIMGSYLWHGHQARVPMHQLALPIEGGGLKLHLPYFKCRALLTNRHLQE FT SNSTPFYKTFMQRIQNPPDLHRIPTDCPCLKVVCQEIAYLPEQLINNATAT FT GVHTNLLQQVATPKVMSSDPTLDWKLIWRNINSKKLKSAERSWYFLLVNKK FT TTHGELLHRMNIVNTPNCAHCGAQREDLQHKFSTCSRVAGAWRTLQRHLAA FT AGMNGTTSFNSLAQPELRRLRRDVKHKVMKLFINYINFINQCNNQIDVDSL FT NFHLENEI" XX SQ Sequence 4517 BP; 1285 A; 1253 C; 1052 G; 927 T; 0 other; agtcagcgtt cacctctcag agagagcaga cgcatagtgc agtctttaga aaaatcgtgc 60 gatccgaaag actgtaatcg gcttggtttt ttgctgtccc gtacctacgg gctaagtttt 120 tttgcaaaaa gcaccgacct gctacatcta gcaagatgag ctcgacaata cgccgcgtaa 180 acacgttcaa aatcgacttc gctagtcttc caaaaaaacc gcgttttgaa gaaatccatc 240 acttcgtcaa gacgaccttg gggattccca aagaaaagat cgaacgactc caagtcaacc 300 actacttctg gtgtgtgttt gtgaagtgtg tggatctcgc cacagcccag caaacagtcc 360 tgcagcacaa caacaagcac gcgttcgaga ttgatgggaa gaagtacacc atcaaaatcc 420 agatggaaga tggggccgtt gacgtccgcc tacacgacct acccgaagag atcaccaacg 480 atcaagtcaa gaaattcatg tctgcgtatg gcgagatcct gtccgtgcgc gagctggtgt 540 gggaggagaa atatgagctc gcaggactca agacgggcgt tcgcgaggtg aagatgatct 600 tgaagcacca aatcaagtca ttcatctcga tcgagggcca acacacatac gtcacataca 660 tcgggcaaca gaccacctgt cgtcactgcg gcgaatacag tcacagtggc atcccctgta 720 cgcagaacaa gaagctgttg ttgcagaagg tgagcgtcaa cgagcgcctc aaagatgcca 780 agaagccaac cggcaagtcg tcctacgctg atgccctcat gaccaaacct tcaccccgtt 840 tcgtgccggc agttctcgtt gaggacaaca ccggcatgga ctgtgatggc gacaacacac 900 acaccgatgc agagaattcc gagataacca ccgaagggaa tggtagcagt ggcggaggca 960 gcggttttgc tgcggagccc gggatcatca ttcaagacgg cgggagcagc ggcggcgcca 1020 gcggcctcgc tgatatgctc cagaagacga caccgatcct gcaggagacc ataatccgcg 1080 acggcgacca actatgcgtg ttcaagacac cactcacact tccacaagac ggtgacggcg 1140 cagaggaatc cgatggatct tctacgtcat ctgcatcggg taagcgtccg agaggccgtc 1200 ccccgaagaa accaaagacc agcacgtaaa caccaacaca cccctaattt ccccttaaca 1260 caatggatta tcaaagcttt aatattggta cacttaacat caacactatc tccaatcaaa 1320 acaaaattaa cgccctacac aacttcacac gcacacacga tctcgacata ctcttcctgc 1380 aagaagtgta cagcgacgag cttattatcc ctgggtacac agtcatatgc aatgtcgatc 1440 acaatcggag gggcacagca gtggctctca aacaccacat caagtacacg aacgtcgaaa 1500 agagcgtgga ctccagagta atcgccctac gcctgcaagg gtcggttaca ctggtgaatg 1560 tgtacgcgcc ctctggaagc caacaacgta cacacaggga aaccttcttt aacaccacac 1620 tagctcactt cttgcgacac caaaccaact acactatctt agccggggac tttaactcgg 1680 tgatcaacgc aagagacgct acgggagagg gaaaccacag ccttgctcta aaaaacactg 1740 tccaacagac tcgactgaag gatgtatggg aagcactaca tgggactcgg gtcgaacaca 1800 cttacataac acacaactcg gcctctcgca ttgatcgtgt gtacgtgagc tcgaacctgc 1860 gggatcagct gcgtacagcc tacgtgagtg cctgctcatt cacaaaccac atgtccctga 1920 caacccgcct gtgcttaccg aaccaaggtc gagcgcatgg tatgggtttc tggtcactcc 1980 ggccgtacgt cttgacagac gaaacgatcg aggagtttca acaaaaatgg aactattgga 2040 cgcgacaacg ccgtagctac aactcctgga tgcagtggtg gctgcgcctc gcgaaaccaa 2100 aaatgaagtc cttctttaga tggaagtcta aggaggcgtt tgatgacttc actcgaaaac 2160 agcagcatct ctacgcgcag cttcaccagg cgtatcgcgc gtacctgcac aatccggaca 2220 tgctgacctc catcaaccgc atcaagtccc aattgcttct actacagaag cagttcactc 2280 agctgtttgt gcgtatcaac gatactcacg tagctggcga gccgttgtca accttccagc 2340 ttggagagag aacgaggaaa cgcacgatca tcgagagcat ggagaacgac gctggcgcct 2400 ctctggagag ttcagatgag atccaagagc acgtggtcaa ctactaccga aacctgtacg 2460 cagaaggaga gacgcgagca aatcccgaat tcgactgcgc tagggtggtt ccaccagaca 2520 gcatctctaa tcatgcctgc atggaggaaa tcacaactgc ggaaatttac gaggcgatca 2580 aattcagctc gtcgaagaaa tccccagggt ccgacggtct tcccaaagag ttctacgtac 2640 gcacattcga catcattcac cgagagctga atctgctgct gaacgaggcg cttcgtggtg 2700 acatccccga agagttcgtg aacggtgtga tcgtcctggt gaagaaaaaa caagctggaa 2760 actccatcac tgcataccgc cctatatcgc tgctcaacta cgactacaag ttgctggcca 2820 gaatcctgaa agcgagagtc gatcgtgttt tgcgcgaaca cgacataatc agccctgtcc 2880 agaaatgctc gaacggagac cgaaacatct accaagcaac tctctcgctc aaagaccgca 2940 tcgcacaact acgcagccaa cgactctctg ggaagttggt atcgttcgac ctctccagtg 3000 ccttcgaccg tgttgataga ggtttcctct atcacacaat gaactcgctt ggtctcaacc 3060 ccgatttggt gaatctgcta gagaggatcg gagagcgctc cacctctcgg attctcatca 3120 acggacacct ctctgctgcc tttcctatcc agagatcggt tagacagggt gaccctctcg 3180 caatgcactt gtttgctatc tacctgcatc cactggtagc gagaatggag cgtatctgtg 3240 acggacctca cgatcttgtc gtcgcctacg ccgacgacat cagcatgatc accaccaacg 3300 ctgacaaagc tgagcaggtt aggggtctct ttcgtgactt cgaaggatgc tccggtgcga 3360 tgctcaacct acggaagacg acggccatcg atgtgggtca cataaacgat cgaaacagac 3420 ttgcgctgcc gtggatacaa tcgagcgaat cggttaaggt cctcggagtg atctacctga 3480 actctctacg ccagatgatc aacaagaact gggacacctt gatcacaaaa atgggacagc 3540 tgatctggct tcacaggatg cgcagtttaa cactgcatca aaaagtcacg ctactgaacg 3600 ttttcatcac atcgaagatg tggtatctct cctcggtact tgcaccctac aaagttcaca 3660 tcgccaagat tacctccatt atgggcagct acctgtggca cggccaccag gcaagggtgc 3720 caatgcatca actggcgctc cccatcgagg gaggcggtct caagctccac ttgccttact 3780 tcaaatgtcg agcactgctg accaacagac acctacagga gagcaacagc acaccgttct 3840 acaagacctt catgcagcgc atccagaacc ctccggacct gcacaggata ccaactgact 3900 gcccatgtct caaggtagtc tgccaggaga ttgcctacct gccggaacag ctgatcaaca 3960 acgccactgc gacgggtgtt cacacgaacc tccttcagca agtggcaaca ccaaaagtca 4020 tgtcgagcga tcctaccctg gactggaagc tgatatggcg caacatcaac tccaagaaat 4080 taaaatcagc tgagagaagc tggtacttcc tactggtgaa caagaaaacc actcacgggg 4140 agctgctcca tcggatgaac atcgtcaaca caccgaactg cgcacactgt ggtgcgcaac 4200 gagaagacct acaacacaaa ttcagcacct gttccagagt ggccggtgcg tggagaacgc 4260 tccaacgcca ccttgcagcg gcggggatga atggcaccac cagcttcaac agccttgctc 4320 aaccagaact acgaagatta cgacgagatg tgaaacacaa agtgatgaaa cttttcatca 4380 attacatcaa ctttattaat caatgtaata atcagattga tgtagattcc cttaattttc 4440 acttggaaaa tgaaatctag aattcatcat gtaattagct tttacaaatg actgaataaa 4500 tattttataa aaaaaaa 4517 // ID Gypsy-178_AA-LTR repbase; DNA; INV; 1286 BP. XX AC AAGE02025061; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-178_AA_; KW Gypsy-178_AA-I; Gypsy-178_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-1286 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02025061; Positions 52359 51074. XX SQ Sequence 1286 BP; 378 A; 308 C; 275 G; 325 T; 0 other; tgtaaccgtt tgttacaaat tgtcagtctt tttttaattc gtgctatttt ttatttgtat 60 actgtcacgt ttctagtgtt tagtggtgtt ggatttgtat agggttagta aataaaaggt 120 tagtaaatgt tatcaatgtc aaattgtata aaatatgatt tttagtaagt tagaagaatt 180 gttaaatttg aattacaaat atgaaaaagt tgcatgcaag ctgtgggtac aagacaaaca 240 taaacgacaa gtgggccagc aggcgtatta cacgcaacag atggtaacaa ccccccaacc 300 cacaattgag gaaattgggg aggggaggaa aaacagacaa gagggtcaag ccaaattcta 360 gaatcgccat cacttcactc tgaaaaaggc gtgcgtgtgc tcaagtgcat ttttgaaagt 420 gtcctcaaag ttataattaa agtgagtggt aaccttacaa attgggagag aaagtgtatt 480 tcatcaggta tgattgaacc ttaaatctat tctaatctaa tgcctataac caatggctag 540 aaagttagta cagctagcta actactccca ttaccctcac aggacctttt ttatcttacc 600 gtaaaaattt cgccatcttg gctggtgtgg cgtggcctac cagcagcaca cgtttcggcc 660 tagtagacgg aagtctgcta aacctaaccc gtcaaggacc atcgctgact tctggtcatt 720 tcggtcataa tcgtaaagga ggtccacagc tcggggtaat cacccctaac cgtataggag 780 ctttcccctc tcggcacggt cactccggta ccgtcctggg ccgtgtcgat tgccgtcaaa 840 gagggaggac gcctaataaa gccaaacccc tacgacacta cgtctacggt gtcgtcgtta 900 ccacctgtgc agccgcagaa tatcccagcc tgttagcagc atcagcgagc gctcgcagga 960 catcaggaga tcgtcggcag caaccagaag cagcaagtcg taccggtacg tgaaaaccca 1020 ttaacaccca cacccccaca agaacaatag cttgcaataa accactagaa gataacccta 1080 aatcccacaa tagctcagtt tgcccaataa aaccctagag attcccacca agttagaatt 1140 ctttcattcc acacactaca atcgccctag tcaggtgacg gtacgctgca cgccttggtc 1200 gcgtagcccc tcagcttacc cagtagcaag ccgaccctga gacccagctc gaggggtctc 1260 ttgttactga gagattgccc ggttca 1286 // ID Academ-1_CSa repbase; DNA; INV; 7186 BP. XX AC . XX DT 27-JUL-2010 (Rel. 15.08, Created) DT 27-JUL-2010 (Rel. 15.08, Last updated, Version 2) XX DE Academ-type DNA transposon from Ciona savignyi. XX KW Academ; DNA transposon; Transposable Element; Academ-1_CSa. XX OS Ciona savignyi OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. XX RN [1] RP 1-7186 RA Kojima K.K. and Jurka J.; RT "Academ-type DNA transposon from sea squirt."; RL Repbase Reports 10(8), 1069-1069 (2010). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project. XX FH Key Location/Qualifiers FT CDS 1272..5408 FT /product="Academ-1_CSa_1p" FT /note="transposase." FT /translation="MSQHQTLFIVQNTDAEDVSKCYICKESTEVESQLQKV FT NVKTWKTIRNAAALRKNLKSDKYFDITRNILASDDSSTLSYHPSCHKCYTA FT VKRPKEPTSLNDEPAAKITCITTRCSNVFPKCDRQGMLKGSCIFCGKSRKK FT KNRKEEPRLKIATVAGCESLCERASVSKNERIKSIIRSGVDLIAKEAEYHK FT SCRLAFLRESDTKSKPTEEKSSLSFHKKAFKCLCSFIETEIIERRRCILVA FT DLLAMYKDEYNSCGGDRADIRTYTTQNFIRKVKDFFKEEIKIILADQRKGN FT FIHSSSLSKEDARSRLHKDAKRYEEDKKLRCAALYLRSLIMQLPKTKNPNP FT ATVQNLKECSSEIPEQVDLFFRSLLGGVTPSFSGTRRETIDRKVTSMASDV FT IFNVTNGTVKPWKHTAMGLGLASLTGSKLVMQILNRSGHCISYSEVKGLET FT ELAYSVEGDTHDAPDGICLDPNLATACVWDNNDANIETLDGKGTLHATVGH FT TYQNVIQGDQHTNTTIVAFREVRHRRSFVGNEREIPPFGKSIHKANFLSLT FT TEAHVDTDVSSTQLESDDGYNIRLNLLDLYWFWSSREDSTPLHAGFMSNYI FT QDKLPLQRICYMDPISKSPTNNDVVRETMIRTMNVAKETGQDFAVVTYDLA FT VAIKAYSIQAIQTPIFDKLLIMLGSFHIELAFYGAVGTLINESGIEYVLTE FT ADVLAEGSMVGFIKGRFYNRCTRIHELLANVLEQKMYKRFLLEISQDDFDS FT FQQVMSTVPLDPILAKEHLSHPVVTQHLRIYENYFQTILDGNLGSTAQYWA FT IYIYLINRLHRELQRCVKTNDVDGYVNVFPQMLDVFFALNRPNYARWGTLF FT LRKLKSLDPKVREILQNGAFSIRRTSKDYSRSAVDLSLEQTVNRDSASTMK FT GIVSFRNSVNAMRRWSLTMTQRSMAVTELRRLTGLEFGESAAAQCRPSRIK FT KDNCHMAALSAKVDLFGNPFAEDAPISLVNIATGHVASKDTESYLTNTLRR FT GQDKKEKFQHEWNSNSNRFLQPVKRTPVQNFASQNLKQKAKVPSSQKAKTT FT AEGLRDMFLRMIVVVAEKTAFDLRNVISYPITTYPLSLAHCDGAPVKTDKS FT ALLKKLESLQTEIFTEADLPTTYLQVYDGGLLLHSVLSQTNIGASYASIAH FT NILSVVFSGRANEAHVCLDKYLECSIKESERRLRGAVGSAYVITGPNQTIR FT QSGQKLLTNGKFKNELAKFLLQEWGKDYNWHLFSGKTLIASYGGECFQYVP FT DRQHHINVTNPVHLQADHEEADTLIAFHIAKTTITGHVIVRASDTDVLVIL FT IGALGAQRQEVRCRANIVMDCGMGKSRRFINVTNIVHVLDERQPGLSRALP FT GYHAFTGCDFTSAFYR" XX SQ Sequence 7186 BP; 2191 A; 1495 C; 1565 G; 1930 T; 5 other; tagtmcggga gcccgtaggg gtcaacatag ctggaaaagt aaccaatttt cttctatata 60 gcaatgatct atgtatattt ccatkaaaga aaatggacgt ctgccgggca cattgtggtg 120 gatgtggcca ggggggcgta aagaggagac ccacaaattg agctagatta gctccaaagg 180 taacccgacg aaacatattg gaaaatgctc agcatatccc atggtatcat gaatggctat 240 tgccagacat ttcaagtggt gggaagtgcc cgccagaccc acataactag accccaaaat 300 tgagctagat tagctcatga ggtaacccga tgaaactaaa tgaaaaatgt tcgtcatgat 360 ctacggtatc gtaaatcgcc aatgccagac gatttaagtg gtggactgtg cctgccagac 420 gccctcaaac gccccatgtt ccaaaaattg agctagatta gcttttgagg taacctcacg 480 aaacatattg gaaaatgttc ggcatatccc agggtatcat gaatggctat tgccagacat 540 ttcaagtggt gggaagtgcc cgccagaccc atataactag accccaaaat tgagctagat 600 tagctcatga ggtaacccga tgaaactaaa tgaaaaatgt tcgtcatggt ctacggtatc 660 gtaaatcgcc aacgccagac gatttaagtg gtggactctg cctgccagac gcccttaaat 720 gtcccacgtt tcaaaaattg agctaaatta gcacaaaagg taacctcacg aaacatatta 780 gaaaatgtct gacatagccc agggtataat aaatgaacta ttgcaagaca tttcaggtgg 840 ttgggtgtgc ccgacagacc cacataccta gtagcctatc ccaaagttgt tgtagaaccc 900 gatgaaacca aataaaaaat gttgagcagg atataaagca tcataaatct ccagacattt 960 ttagtgatgt aatgtgcccg gcatacatat cctaacataa catgaaccaa gaacagcttg 1020 ggggtcaact tgtataatta ctaaaaacgc tcccctatgt aaacgattta cttatctgca 1080 tcggacaatt tcccgcccgt agtcatccgc cattatgaca taataactgc actatttttt 1140 gttatcgatt tcttgtctca gctttagcgg tagtaagatg tgtttttcaa ctagttaagt 1200 ttgtcgtaat tgttgtagct cacagtaagg ctaagctttg tatcaaatgg tcgtagaaac 1260 agttgattgt tatgtcacag catcagactc tatttatagt ccagaacact gacgccgaag 1320 acgtatcaaa gtgctacatt tgtaaggagt caacagaagt ggaaagtcag cttcaaaagg 1380 taaatgtgaa aacatggaaa accattagaa atgctgctgc tttgcgtaaa aatttaaaat 1440 cggacaagta ctttgatatc acacgaaaca ttttggcaag tgatgatagt tctactctta 1500 gttatcatcc ttcttgtcac aagtgctaca cagcagtaaa acgtccgaaa gaacccactt 1560 cacttaacga tgaaccagct gcaaaaatca cttgcatcac gacacggtgt agtaatgtct 1620 ttccaaagtg cgataggcag ggcatgttga aagggtcttg tattttctgt ggcaaaagcc 1680 gcaaaaagaa aaataggaaa gaagaacctc gactaaaaat agcgacggtt gccggttgtg 1740 aatcactttg tgaacgggcg agtgtctcta aaaatgaacg aatcaagagt ataattcgaa 1800 gtggagtaga cctaatcgcc aaagaagcag aataccacaa gtcatgtcgc cttgcgttcc 1860 taagagaatc tgacactaaa tctaagccta ctgaagaaaa atcatctctc tcgttccaca 1920 agaaagcatt taaatgcctg tgttcattta tcgagactga gatcatagaa aggcgacggt 1980 gtatattagt ggctgattta cttgccatgt acaaagatga gtataatagt tgtggaggag 2040 atagggcaga tattcgaacc tacacaactc aaaattttat caggaaagta aaagattttt 2100 ttaaagaaga aatcaaaata atattagcgg atcaacggaa aggcaacttt atccacagct 2160 cttctctttc aaaagaggat gccaggtctc gtttacataa ggatgcaaaa cggtatgaag 2220 aggataaaaa gttaagatgt gccgctctct accttcgatc gctaattatg caacttccaa 2280 agaccaagaa tcctaatcct gcaactgtcc aaaacttgaa agaatgctct tcagaaatac 2340 cagaacaagt tgacttgttt ttcagaagtc ttcttggtgg tgtgacacca agttttagtg 2400 gcacacgtag ggaaacgatc gacagaaaag tcacttcaat ggcatcggat gtcattttta 2460 acgtcactaa cggcaccgta aagccatgga agcacacagc tatgggcctc gggcttgcat 2520 ctctcacagg ttcaaagctg gtgatgcaaa ttctgaacag atcaggacac tgcataagct 2580 acagtgaggt aaagggcctt gaaactgaat tggcgtattc agttgaaggt gacacacatg 2640 atgcgccaga tgggatttgc cttgatccaa acttagcaac agcctgtgtt tgggataaca 2700 atgacgcaaa catagaaact ttagatggaa aaggaacatt gcacgcgact gttggccaca 2760 cgtaccagaa tgtcattcaa ggtgaccaac ataccaatac taccatagtt gcgtttcgag 2820 aggttagaca ccgacgcagt tttgtaggaa atgaacgcga gatacccccg ttcgggaaat 2880 ctatacacaa agccaatttc cttagtctta ccacagaagc tcatgttgat accgatgtat 2940 ccagtacaca actagaaagc gatgacggct acaacattcg gttaaattta ttggacttat 3000 attggttctg gagttcgagg gaagacagta cacctttaca tgctggattc atgagcaatt 3060 acattcagga taaattaccc cttcagcgaa tttgttatat ggatccgata tcgaaatcac 3120 caaccaacaa tgatgtagtg agggagacca tgatccggac catgaacgtt gcaaaggaga 3180 ctggccagga ttttgcagtt gttacttacg acttggccgt tgcaataaaa gcctactcca 3240 tacaagccat ccaaaccccc atattcgaca aactgttgat aatgttaggg agctttcaca 3300 ttgaattagc tttctacgga gcagttggta cacttataaa tgaaagtgga attgaatatg 3360 ttctcaccga agccgacgtt ttggcggaag gttcaatggt aggatttatc aaggggaggt 3420 tttacaacag atgcacgcgt attcacgagc tgcttgctaa tgtcttagaa cagaaaatgt 3480 acaaacgatt tcttcttgaa atatcccaag acgactttga ttccttccaa caagttatgt 3540 ctactgtacc tttggaccca atactagcaa aagaacatct atcgcatcca gttgttaccc 3600 agcatctgag gatttatgag aattactttc aaacgattct cgacggaaat cttggatcta 3660 cagcacaata ctgggccatt tacatctatc tgatcaatcg ccttcacaga gaactgcaga 3720 gatgtgtaaa gaccaatgac gtcgacggat atgtaaatgt atttccacag atgttggatg 3780 tgttcttcgc cctgaaccgt ccaaactacg ccagatgggg cacacttttt ctccggaagt 3840 tgaagagttt agacccaaag gtgcgcgaga tattacaaaa tggtgcgttt tccatcagac 3900 gcacatcaaa agattatagc agatcagctg ttgatctctc tttggagcag acagttaatc 3960 gtgattctgc atccacaatg aaaggaatcg tttcctttcg aaattctgtg aacgcaatgc 4020 gccgttggtc attgactatg acacagcggt ctatggcagt cacagagctc agaagattaa 4080 ccggacttga atttggggaa agtgcagcag ctcaatgccg tccttcaaga ataaaaaagg 4140 acaactgtca tatggcggca ctgagtgcaa aagttgacct attcggaaat ccatttgcag 4200 aagatgctcc aatttcttta gtcaatattg ctaccggtca tgtagcatct aaagataccg 4260 aatcatatct aaccaatacg ctgagaagag gacaggataa aaaagagaaa tttcaacatg 4320 aatggaactc caacagcaat cgatttcttc aacctgtgaa acgcacgcct gtacaaaact 4380 tcgcttctca gaacttgaag caaaaggcca aagtaccatc ctcgcagaag gcaaaaacaa 4440 ctgctgaggg actgagagac atgtttctgc gtatgattgt agttgtagca gagaagacag 4500 catttgacct acggaatgtt atatcctatc caattaccac atatccattg tcccttgcac 4560 actgtgacgg ggcgcctgtg aaaaccgaca aatcagcctt gctgaaaaag ctcgaatcac 4620 ttcagacaga aatattcacc gaagcggatt taccgacgac ctatttacag gtatatgacg 4680 gaggacttct tttgcattct gttctttcgc aaacgaacat tggagcatca tatgcatcaa 4740 ttgcgcacaa tattctctca gtggtgtttt ccgggagggc caatgaggca catgtttgtc 4800 tcgacaagta ccttgagtgc tccattaaag aaagtgagag aaggttgcga ggtgcagttg 4860 gttcagcata tgtgattacc ggaccaaatc agactatcag gcagagcgga caaaagcttc 4920 tcactaatgg aaaattcaaa aacgaactag ccaagtttct tttacaagag tgggggaaag 4980 attacaactg gcacctgttt agtgggaaga ccttgattgc ttcctatggt ggcgagtgct 5040 tccaatacgt tcccgacaga caacatcata tcaatgtaac taatccagta caccttcaag 5100 ccgaccatga agaggcggat acgctgattg catttcatat tgcgaaaaca acaataacag 5160 gacatgttat agtgcgggca tccgacacag acgtcttggt gatattaatt ggtgctcttg 5220 gagcgcaacg ccaagaagtc cggtgtagag ccaacattgt aatggattgt ggaatgggaa 5280 aaagcaggag gtttatcaac gtgactaaca ttgttcatgt tcttgatgag cgccaacctg 5340 gactttcaag agcactccct ggataccatg catttactgg atgtgatttc acttcggcct 5400 tttacaggta agcacaaaat gtttgtatga gagcatggct taatttacaa tttgttcttt 5460 ttgtttagga aaggcaaaac taaaccattt gatctccttg agagtgacaa aagtggttgt 5520 tacgttaatc tttttatcgg tatgggagag gtgcagcgaa ttgattttga tgtcgcatcc 5580 gagtttgttt gtcgcatgta tggacagagt gaaattcgtg atgtcaatga agcacgatac 5640 aacaaacttc tacaaatgaa cggcaaactt gatcaggtaa ttgcggtaca aaatgcgtta 5700 atttcatatc accggttctt aaatatcgta cgacaaatgt aacaacacgg gttcattgac 5760 ttttagggaa atccactggc aaacattaaa cgagtcgact gtgctttact accaccctgc 5820 atccgaactc tcgagatgaa gatccaacgg acgaagtaca ttgcaggatt gtggatgcgt 5880 gctgcgacag catctcccac gaatgggtta gttccaaccg actatggatg gtgtgtggag 5940 aacggtattt ttgtgccaat ctggtttcag ggtcctagta tacccgacag gttgtttgaa 6000 gatagaaatg cagaagatat tgcagaaacc gacagtcagt taataataga aataactgac 6060 gaagcgtgga gcgaagactc ggactcggat gcggatgagg aagaagaaca ggaatgattt 6120 agtcaatgac ggaacattag cctacgttgt tttgtgaaca tcaataaata cttcgttacg 6180 ctgtttggct atcaaattat taaattcaca tatcactttt gcgatttacg atgccttaaa 6240 acctgcccaa catttttatt tggtttcatc gggttaccgc atgtgcaaat ctagctcaat 6300 tttggggtag tctactagtt atagggatct ggcgggcact acccaccacc agatatatct 6360 ggctatatcc attcatgata ccctcggctg tgccgatcat tttctagtat gtttcgtgag 6420 gttacctttc aagctaatct agctcaattt ttggaacgtg gggcatttgg gggcgtctgg 6480 caggcacagt ccaccactta aatcgtctgg cattggcgat ttacgatacc gtagaccatg 6540 acgaacattt ttcatttagt ttcatcgggt tacctcatga gctaatctag ctcaattttg 6600 gggtactagt tatgtgggtc tggcgggcac ttcccaccac ttgaaatgtc tggcaatagc 6660 cattcatgat atcttgggat atgccgaaca ttttccaata tgtttcgtga ggttacctta 6720 caagctaatc tagctcaatt tttggaacat ggggcatttg agggcgtctg gcaggcacgg 6780 tccaccactt aaatcgtctg gcattggcga tttacgatac cgtagaccat gacgaacatt 6840 tttcawttag tttcatcggg ttacctcatg agctaatcta gctcaatttt ggggtactag 6900 ttatgtgggt ctggcgggca cttcccacca cttgaaatgt ctggcaatag ccattcatga 6960 taccatggga tatgccgagc attttccaat atgtttcgtc gggttacctt tggagctaat 7020 ctagctcaat ttgtgggtct cctctttacg cccccctgac cacatccacc acaatgtgcc 7080 cggcagacgt ccattttctt tmatggaaat atacatagat cattgctata tagaagaaaa 7140 ttggttactt ttccagctat gttgacccct acgggctccc gkacta 7186 // ID R2B_TM repbase; DNA; INV; 1467 BP. XX AC AF015822; XX DT 26-AUG-1999 (Rel. 4.07, Created) DT 26-AUG-1999 (Rel. 4.07, Last updated, Version 1) XX DE Tenebrio molitor retrotransposon R2 reverse transcriptase gene, DE partial cds. XX KW R2; Non-LTR Retrotransposon; Transposable Element; R2B_TM; R2_TM. XX OS Tenebrio molitor OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Coleoptera; Polyphaga; Cucujiformia; OC Tenebrionidae; Tenebrio. XX RN [1] RP 1-1467 RA Burke D.W., Malik S.H. and Eickbush H.T.; RT "R1 and R2 Provide an Estimate of the Age and Stability of RT Retrotransposons."; RL Unpublished. XX RN [2] RP 1-1467 RA Burke D.W. and Eickbush H.T.; RT "R2B_TM."; RL Direct Submission to Genbank (24-JUL-1997)Biology, Univ. of RL Rochester, 135 Superior Road, Rochester, NY 14625, USA. XX DR GenBank; AF015822; Positions 1 1467. XX SQ Sequence 1467 BP; 377 A; 337 C; 373 G; 380 T; 0 other; gccttcgccg acgatttggt gctccttgct gacagtgcca aggacgccaa taggctgtta 60 tcggaggcct cgcagttctt tgcggatcgg ggactcgaat tgaacgttgc caagtgctgt 120 gccttgtcaa caggtgtagt tccgtccaaa aagaagtgtt cagctcgact tagatcttat 180 tccaggctga acggtgccca catccgtcaa gtggcgattg gcgatgtctt caagtatctg 240 ggcaatcact tcgctttcga tggcgttagt gacataagct tggaggaact aaactctcaa 300 attaataatt tgatgaaagc cccgttaaac tctcaaatta ataatttgat gaaagccccg 360 ttaaagcctt ggcagaagtt caaaattctg aggcaaacat ctgataccat cgggtggata 420 cactgccttc agagcccttc tgtttcgaat aaagtcctaa gggaggctga caggaaaatt 480 aagagagcgg ttaaatcgat tcttcactta ccggttacta ttgccgactc gtcgatctat 540 gcaagccagc gcgatggtgg tctcggggtc ttctgcttct caaggaagat tcctgtcatt 600 cttcgctcgc ggtgggagtc gctacaggag atggctgatc ccgcgcttgg cgctgttctc 660 ctgcaatcgg agagatgcat ggatagagtc tcgagactaa ttaaagagga ctggagatct 720 gacagcgaaa tcaagaacag ttttaggcgg agccttgaga attcttggtg cgggggtgga 780 atccatcaag tagggataat caagcttgca agtaaatatc tgctgcaccc ccctgccttt 840 tggacaggaa gggactatgt gtccacaatt cagctgaggc tgaacgcgct tccttccaga 900 ggtttaccat ccaacccccc cgaccagcgt atgtgtagga ctggatgtgg cagatctgag 960 tccctatcac acatcttaca gagatgtgga ttcgttcagg gccatcgaat atcacgccac 1020 aaccatgtcg caaggaaaat taggaggttg gccgagtcta aggggtggga agttagggag 1080 gagcctacta tccgcaccaa tgaacaggtt ttcaaaccgg atctacttct gattcgcgat 1140 aatgaattaa tagtgtgcga tgtctccatt aactgggaag gtccgtccca ctttccacac 1200 actatcagga taaatttaat aagtatgcca ctccaggtgt tatccaatgg ctgcaacaac 1260 gattttcccc gacaaactgt cagattcctt ccctttatat tgggtgctag gggatcttgg 1320 tgtggcgaca acaggagttt agttgacgcg cttgaactca cacagcgcaa tatagaagat 1380 ctaatactta catgtctcac tgggagcctt atcatctata agagcttcat gaagtcggtc 1440 tggaggagga ctgtcgcggg tgactaa 1467 // ID Gypsy-2_DVir-I repbase; DNA; INV; 7177 BP. XX AC scaffold_7526; XX DT 07-MAR-2011 (Rel. 16.03, Created) DT 07-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-2_DVir_; KW Gypsy-2_DVir-LTR; Gypsy-2_DVir-I. XX OS Drosophila virilis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; virilis group. XX RN [1] RP 1-7177 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (07-MAR-2011). XX DR Genome; scaffold_7526; Positions 7811 635. XX CC Positions [4653-5129] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 3036..5504 FT /product="Gypsy-2_DVir-I_1p" FT /translation="MIKLGVIEEAPYSAWSSPAILIVKPGKVRFCLDCRKV FT NEVTSKDAYPIPNLDGLISRLPPVHCVSKIDLKDAFWQICLDESSRAKTAF FT TIPNRPLYQFKRMPFGLCNAPQTMSRLMDQVIPYDMKGHVMVYLDDLLVLS FT QSFDEHLIHLAEIATRLRRAGLTINIRKSNFCLTKVEYLGYIVGEGTLQPN FT PAKVQAVADFPVPSTKRQLRRFLGMTGWYQRFIPQYSSIIFHLTELLKGTS FT FAWNDNAQISFEEIKRKLSSAPFLINANYEKPFIVQCDASMHGVGGLLAQC FT DESGIERPIAYMSKKLNKAQRNYSVTELECLAVVLAVRKFRMYIEGHDFKV FT VTDHATLRWLMNQKDLSGRLARWSFKLQGYNFSIEHRKGSENVVADTLSRA FT FEPADEVAAVDLEVYPEIDLSSEAFRSNEYCRLKNNIIRSNLPDFRVVDDY FT IYFRNEPSASSVDENMEGWKLFVPSELRLNVMKAAHDQPTSAHCGMAKCLE FT CIRRHLYWPNMVVDVREYIRHCEFCQTSKVPSRCLKPTMGNQVVSERPFQR FT LYVDFVGPYPPSKKRNIGIFVILDHYSKFTFLKPIKRLNAKVVVDILQEDI FT FDCFGVPEIIVSDNGTQFKSLEFSSLLSKYGIQHMCTGVYSPQANAVERVN FT RSINAALRTYIRSDQSLWDVYISSINSSLRNTIHQSIGISPYFVVFGQNMI FT KHGNDYKLLKSLNMLNEGQLKLDRADEFPLIRSNIQKHMDKAYEKNLRTYN FT LRTRPRSFEVGQIVIKRNFVLSNLAKHFNAKLAPVGTKARIKRKIGNCLYL FT LEDLKGKELGHYHAKDIWECK" XX SQ Sequence 7177 BP; 2208 A; 1441 C; 1487 G; 2041 T; 0 other; tgataaaata ataagaaaaa tggcgcccaa cgtggggcct gaacttctgc tcttccgctt 60 ctcaccaccg agatccggaa ttggcccaca agaacttagg agagagccga tagcattgtt 120 cgttacatga tttgtcacga gccaatgtgg acgatcctaa gttaaatcca tgctagcttt 180 cacccttctt cctatcctta tcacaatata ccatccaaat aattttagtt tccgtgacga 240 catttatcct ctcttggaac tgctggtatt ctgatgtctt ttgttggagc cagaccagga 300 agtaaatagt cgagcgattt atttagtgga tcagttatac tgttccacta gaaatcttat 360 tatgttttcg ccacttttga ttcatttagg acttttagat ctgatttcga gtcggttctc 420 taccaagtga tagaaggttg gtagggcata ataacattcc gtttgttcta gttgaattga 480 gaattgttct ggttgggtta tttcgactcg tccctcttca actttgttca gacaactcaa 540 cactgattta gtaaaatagt ttgcttgctt ttcgtaagaa gatatatcaa acatagagat 600 ttagattatg cgagaaattt acctaaggat aagtcccatg ttataaaatg ttataatatt 660 gtattagcat cgaagtgcct taaacctttt cattctgcct taaaaccgtt tccgattttc 720 ataattatcg ctcattagcg cttctcgtat atatcttaaa attaaaatcc tttcgggagc 780 aattcccatt taactctttg accaattata atcgatttat tgatgatggc tgtcgttcgt 840 agtcaggaaa atttaactct cagcgaagca cctgcagtgt gtactatctg caatgaatca 900 tcgattggtg atgcagacac tgtagaaact ccatgtgctc ataaatttca tcgaaattgc 960 ctaaatcaat ggttaagaag caatgagaca tgtcccactt gcaaacgagt ttgtagtcga 1020 gaattatacg aagatagcag tgacttgaat gcagtagctt cacaaagacc tacaaacacg 1080 gtcccgaact attacaccca accctctaag gaatcgcttg gtttgtcaga acagcgtgtg 1140 cagttgctta tcagtaagac attggaagcc cataaggctg aaatggcctc ctatttgtcc 1200 gaagaattaa atgccgcagt tagaaattta aacttgtatc aggagccacg tgaggaaccc 1260 caattagaat gggaaaatga ttttccacgt gatgcaccgc gatcgaacaa cgcttcgaga 1320 ttaccgaatg ctagtgttca gtctgatgcg agacgtgaat acgcagtcga taggcccgac 1380 aaaatatcga gtgtcatatc caattgggat gtcagatttt caggcgtggc tagtgatatc 1440 ccagtcgacg actttatata tcgcataaat tgattgactt cacaatgttt aaatggcaat 1500 tttaggttgt tatatcagtt tgccaatttg ctgtttaccg gaccagcttt gacatttttt 1560 tggagatatc accgtacgac tgaccaaaca ggctggttca atttgtgcac atgcttgcga 1620 gaaagatata aagaccacag atcagacgaa gacataaaag acatggtgag aagacgcaaa 1680 caacgagggt ccgaaagctt tgatgaattt ttagatacga tgcttgcctt gtcagatgcg 1740 ttacgggaac caatgtcaga ctcagagtta gtgatcaatg tcaagcgcaa cctaaaacca 1800 gaattgagac acgaactact tcatgtaaat actccgagtc ttgcgacatt acgcaaagaa 1860 tgtcacaagc atgaagcgtt ctattcgaat tacctttcca agccaatttc tcgaccaatt 1920 ataccacaac gagtagtcaa cgaggtagtc catgaagata ccacaatgac ggatcttacg 1980 cagaatctgg tagaaactgg ggacttatat gctatacatt ctccggataa attgatttgc 2040 tggaactgcg aggaaaaggg acacagatat caagaatgtt taaaagcgcg tcgtgtattt 2100 tgttacggct gtggtgccgt tgacacttac aaacccactt gtagaaagtg taaccctgtt 2160 tcggaaaact ctccggagga ctttcgtcgt ctcttaaacg cgaatgtccg tcgtcaatag 2220 cctcggatcc agactttaat aaagatacct taacacctga ttcgaccttc acaccagact 2280 tatcaaaacc gtgtcagaca aaacttaaac cttaccacgt ccgtttacac gaatatttag 2340 accgtagagc aactattttc gcttcagaac ccaaagttga atttaagcct cgttcgtcag 2400 agagactaag aaacctatgg actcagcgac gctcgctaaa taagcatcgt atttcagcga 2460 ttgatcatcg atcaaacgat gtgcgaccgt tcacgaattt agaaatattc aatcagcatt 2520 acttagcctt gttagatagt ggggctacga aaagcgtgat tggagggagc ttagcgaata 2580 aaattctcga ttcgaaaatt gactttaaca aaataaaagg cgatgtgaga accgccgacg 2640 gacaaaaaca acatgttgta ggttccattt caattcctat cgtttataac tccattgcaa 2700 ataatttcga attcttaatt gtcccgtcta tcaaggaaaa cgttctctgt gggatagatt 2760 tctggagagc gtttggcatt tcagtacatc attcgaatac attgaactca gtggatttcg 2820 aagaagtgga gtcgaatcag ttgtcgttaa cactagccca aaagaaacgc ttacaatcag 2880 taatcacttc ctttccttcg tttgaaattg atggtctagg aaagactact cttgtagaac 2940 atatcatcga tactggggac gccaagccca taaaacagag gttttatccg ttatcaccag 3000 caagggaaaa attgttgtgc gaagaagtag ataggatgat caagcttgga gtcattgaag 3060 aagctccata ttccgcatgg tcatctcccg cgatattgat tgttaagcct ggaaaagtca 3120 gattttgttt agactgccgc aaggtaaatg aagtgaccag taaggatgct tatcctatac 3180 caaatttaga tggcttgata agtagacttc ctcctgtaca ttgtgtgtcg aaaatcgatt 3240 taaaagatgc cttttggcaa atctgcttgg atgaatcatc gcgggcaaaa accgcattca 3300 ctatcccgaa caggccgcta tatcaattta agcggatgcc attcggattg tgcaatgcac 3360 cgcagaccat gagccgattg atggatcaag taattcctta cgacatgaaa ggtcatgtta 3420 tggtttattt ggatgattta cttgtactat cgcaaagttt tgatgaacat ttaattcatt 3480 tggctgagat cgcaacacga ttgcgtaggg caggtttgac cattaatatc cgtaaaagta 3540 atttttgtct aacaaaagtc gaatatcttg gttacatcgt aggtgaaggc accttgcagc 3600 ctaatccggc taaagttcaa gcagttgcag attttccagt gccgtcaaca aaaagacagc 3660 tgagacgatt cctaggaatg accggttggt atcaacggtt tatccctcag tattcgtcga 3720 ttatctttca cctaacagaa ttgctaaagg gcacgtcatt tgcttggaat gataatgctc 3780 aaatttcgtt tgaagaaatc aaaagaaaat tatcctcagc accgttcttg ataaacgcaa 3840 actatgagaa gccgtttata gtccaatgcg acgcttccat gcatggcgtg ggcggtctgt 3900 tagcccaatg tgatgagtca ggaatcgagc gacccatcgc atacatgtcg aaaaaactaa 3960 ataaagcgca gcgaaattat tccgttacgg aactcgaatg tcttgccgtt gtattggctg 4020 taagaaaatt tcgcatgtac atcgaaggac acgattttaa agtcgtgact gatcatgcaa 4080 ctttacgttg gctaatgaac caaaaagatc taagtggtag actagctaga tggtcattca 4140 aattgcaagg atataatttt tctatcgaac atcgcaaggg tagtgagaat gtggtcgcag 4200 acacgctttc ccgtgccttt gagccagcag acgaagtagc tgcggtggac ctagaggtat 4260 accctgaaat cgacttgtct tcggaggcat ttcgatccaa cgaatactgc cggctaaaaa 4320 ataatattat tcgttccaac ttgccagatt ttcgagttgt agatgattat atatatttcc 4380 gaaatgaacc ctcagctagt agtgtagatg agaacatgga gggttggaaa ttatttgtgc 4440 catctgaatt acgcttaaat gttatgaaag cagcacatga tcaaccaaca tccgcgcatt 4500 gtgggatggc caaatgtttg gaatgtattc gcagacattt gtactggccg aacatggtgg 4560 ttgacgtacg cgagtacata cgccactgcg agttctgtca aacgtcgaaa gttcctagtc 4620 gatgcctcaa accaaccatg ggaaaccaag tggtcagcga aaggcctttc caaaggttat 4680 atgtcgactt tgttggtccg tacccaccat caaaaaaacg aaacattggt atattcgtca 4740 ttctggatca ctactcaaag ttcacatttc tcaagcccat aaaaaggctc aatgccaaag 4800 tagtggtaga tatactgcaa gaagatattt ttgactgttt tggagtaccc gagataatag 4860 tgagtgacaa tggcactcaa tttaaaagcc tggaattctc ctcactcttg tcgaagtacg 4920 gcatacagca tatgtgtacc ggtgtatact cgccacaggc aaacgcggtg gaaagagtta 4980 atcgttcgat taatgctgct ttgcgcacat atattcggtc cgaccaaagc ctatgggatg 5040 tatacattag cagtataaat agttcgttac gaaacaccat acaccaatct attgggattt 5100 ctccatactt cgttgtgttt gggcaaaata tgattaagca cggcaatgat tacaagttgc 5160 tcaaaagcct gaacatgctg aatgaagggc aattaaaact cgatcgagca gacgaatttc 5220 ccttaatacg ttctaatatt caaaaacaca tggataaggc atatgaaaag aatttgcgaa 5280 cgtataactt acgcacacgc cctcgttctt ttgaagtcgg tcaaatagta ataaaacgca 5340 acttcgtttt aagcaatttg gctaagcatt tcaacgcaaa acttgcacca gttggaacta 5400 aggcacgtat caaacgaaaa attggtaatt gtttatattt gttagaagac ctcaagggaa 5460 aagaacttgg tcattaccat gctaaagaca tttgggaatg taagtaatca cccctttttc 5520 agcttttatc tcagtattag ttaatccttt cgattaaacc tgcgttgtaa tggggtgatg 5580 agttttattt ctgtaaacat aaacatacat tgccattggc atgatcagct gatcgcaatc 5640 agcttgaaga ggcagcactg caccataaca tttatgcaat tacatatgat cagcggaggt 5700 taataacata accagagtga ccatgaccta tgttaatgac gccgttgcga atgaaaatat 5760 cgtgtcttgg gtgaatgcta aattgcttct tttagttttg gcgaggcgtc attcttttcc 5820 attttaatcg ccaacgagta cagtcggttt agattgcttt aaaaattata aaattcaatt 5880 tttcacgtgg gcgtaataaa cccacgtgta tcggcactca agctcaaaaa aaaaaaaaaa 5940 aaaaattaat aaataaaagg gaaggaaaaa aggtccaagg tgcccagaaa gaaaaggggt 6000 aatatagaca tcatgcgctc ccaagagctt ggctataaca tttcaccgga ataaagaaaa 6060 ttttcagaat tcggacactg acaaatattc actaaaggat cgttactgga gtcgtggggt 6120 gcctttcctg tgtgtggtag tcgtaggaag gtgcaggcta aagcgcgggg ctgcgttatt 6180 gcgtgccaaa aagaaaaatt acgcataacg cgtagtcctt atcagagcca aagccaaggc 6240 cccaaagctc caaaaagcga caatctacac tggtagcagg gagcgaattt cctaagggcg 6300 gaccaagatt tcagcatcgt ctcattttct gcagataagc tttgaacctc gctaaccaat 6360 agttcgtgct gcacgcacag gcagaccaac gtgaccacgt gttcgaccca ttgattcccg 6420 gacattcctg agcgcagatc tggtggatag ctaaccagtg cgcaccctca ctcggattta 6480 caacatctga cataattgtt aaacggcata ctggccgtag gtataagcga agggcataag 6540 tacacatgaa atttaaattg caattctttc cgttttatat cgctgacatt gtctttcgat 6600 cctgatttag catgtatatg tcgctgacta ctatatttct acgttacata gttcaaatta 6660 agagcccaca gcacatcatc gtcgtcaata ggcctgctga catcgttgga tcatactaaa 6720 gtagtgagta ctgtgtgaaa gaacaaaaga taataagcca acatatcgaa ttgtacgtat 6780 tgtttgaatc gcggttatag tcagcgggtg tactcctaat cagccttcgt aattacaaat 6840 aacttttgtg gatcattcag atcagcagtg atttaacttc atattagttt cgtgtaatct 6900 cattcatgtg tgtttttgcc cagattgagt ttagttttat aaaaaaaaat agaaaaacgt 6960 tgttcttctc acgtaatgta taatgaagtc cgatcatgag cttaaatctt ctttgaatac 7020 acttatatga taaaatgaaa tcttaattcc tttttgggca ccttaggtaa atagtatcgc 7080 atacaaaagg gtagttgcaa cgacaatcgc agatgcagct acaggcaggg tgggtgaaga 7140 ccaagggtag ccaacatcta agctcctcgg ctctata 7177 // ID L1_TC repbase; DNA; INV; 4831 BP. XX AC . XX DT 22-DEC-1995 (Rel. 1.11, Created) DT 15-JUL-2010 (Rel. 2.02, Last updated, Version 4) XX DE T.cruzi mRNA for Non-LTR retrotransposon. XX KW Ingi; Non-LTR Retrotransposon; Transposable Element; KW reverse transcriptase; gag; endonuclease; L1TC; L1_TC. XX NM L1_TC. XX OS Trypanosoma cruzi OC Eukaryota; Euglenozoa; Kinetoplastida; Trypanosomatidae; OC Trypanosoma; Schizotrypanum. XX RN [1] RP 1-4831 RA Martin F., Maranon C., Olivares M., Alonso C. and Lopez C.M.; RT "Characterization of a non-long terminal repeat retrotransposon RT cDNA (L1Tc) from Trypanosoma cruzi: homology of the first ORF RT with the ape family of DNA repair enzymes."; RL J. Mol. Biol 247(1), 49-59 (1995). XX RN [2] RP 1-4831 RA Lopez M.; RT "L1_TC."; RL Direct Submission to Genbank (30-NOV-1994)M. Lopez, Instituto de RL Parasitologia y Biomedicina, C.S.I.C., Calle Ventanilla, 11, RL 18001, Granada, SPAIN. XX RN [3] RP 1-4831 RA Kojima K.K. and Jurka J.; RT "Consensus update."; RL Direct Submission to Repbase Update (15-JUL-2010). XX DR [3] (Consensus) XX CC [1,2] This element is present in a high-copy number, and is found CC dispersed throughout the T. cruzi genome. Northern analysis shows CC an abundant expression of L1Tc-related sequences with a major CC band of about 5 kb. The transcript has at its 3' end a fragment CC of a highly repetitive DNA sequence (E12A), at its 5' end a CC ribosomal mobile element-like sequence and three putative open CC reading frames (ORF) in different frames. CC The ORF2 codes for a protein which has significant homology with CC the retrotranscriptase-related sequences from non-LTR CC retrotransposons containing the seven domains present in all the CC retrotranscriptase and retrotranscriptase-related proteins. CC The ORF3 codes for a gag-like protein showing unusual cysteine CC motifs present in all non-LTR trypanosomatid elements, similar to CC the C2H2 zinc finger family of transcription factors. CC Interestingly, ORF1 codes for a protein with significant homology CC to the major human AP endonuclease protein, and maintains in CC similar positions most of the amino acid domains described for CC all the Ape family of proteins. The presence of Ape-related CC sequences, described for the first time in a non-LTR CC retrotransposon (L1Tc), may have functional relevance for these CC types of elements. CC [3] Consensus sequence is updated. Only one long ORF is CC reconstructed. It is likely that previously characterized three CC ORFs are desrupted products of a single protein-coding region. XX FH Key Location/Qualifiers FT CDS 138..4787 FT /product="L1_TC_1p" FT /note="includes AP-like endonuclease, reverse FT transcriptase, and ribonuclease H domains." FT /translation="MEPFTWLPAERYFYPLLNSIGAYQRYTYRLRAVCDAQ FT RQKLLLSGDIEQNPGPIAVLQMNVSCLTPSKLATLMAQGADIIAIQETWKS FT SEQIASMHTGDYVLYAQSRIGKGGGVAVLVRKTLRSKRIPLTIPQHDTSLE FT VVVVQVALDQNRDLIVASAYMRPPPQVTQSFRRLVNCLPASSPLLLCGDFN FT MHHPQWEPFLETSPSEVAAEFLELCTDAGLTLVNTPGEITYARGTRERSCI FT DLTWSKHLTVSDWSASVSPLSDHYMLTFTLHQAFKDTIPSAPPSAPKFFYS FT WGKCKWDLFIKDFDAQLPAYDYKKQSTGIKAFTRALITSYRRHCPRGMHKD FT GPRLWDDTLMEAERIATDSKARYLQLPTPDREAEMQRTRSQFFLLLRERLR FT NTYLRRISKLNPGEPLAWKYISGRKKASLPSPTSMLLGDGQHTYKTARRAA FT NALNRIFLPFHPSHKAVRFSKGINRQSASLNINASFLFGQQNNSESAASFT FT SFSSTSSSSEPQNNNESAATFTSGSSLSSSSEPQDKNEAATTSGLVAHLHS FT PLDAPFNRTELLAALRNTPYGKAPGPDEVYSEALRHISSKGLRFLLRCINH FT SWTTGTIPVEWRRATIVPLLKPGKSPELLESYRPISLTSIVSKVAEKMVLK FT RLLWVWTPHPHQYAYRSMRTTTMQLAHLIHEVEHNRNHYFQVSLPKKSGIG FT NQLHYRPHRTLLVLVDFSKAFDSIDHRVLSRLLANIPGVNCRRWLRNFLCG FT RYAKTRVGHRHSDRRPMLRGVPQGSVLGPYLFSLYVHPLLNLLNSFAGVTA FT DMYADDLSIIVKGQSREDAIPTANMVLQKLHAWSQENGLAINPSKCEAAWF FT TLSTHTESDYDREGRWPLVVAGCQIPVMTMGASRTTKLLGMDLDPRLTLNV FT AATKQCAATSQRISQLRCIAHKEAGPSPHDLRTFVIGYGASKLRYGSELIW FT AVATDSAKNEMQKTYATLARIVSGVPSTVDPESALLEANMPPLHVLCLRAR FT LSIFENTRACQMDWMRRPPPEPPPRAGFRISPLSRDELYAFVDAYTKDYGI FT TESSPREERFFRSSIPPWFAASAHRVTIGVELPIDHSITDEEELIREKRRV FT SEEALALHSHRSWILATDGGVDVPKSAGVGILLSSLNSSEIIEKASINCGA FT RPCSYRTESRALLLALEKLMIPRIRHRRKTLLVVTDSQSLLAALNKGPLSQ FT TDWTEDQIWQRLLTLTRAGWSVHLQFCYGHCGVHANELADQYATQTMESGQ FT YTEQGIAPLWHTDLLTCFTTQLTNKWRSTIRQDTHRYLLCGTRPSDLSGKD FT LITQEVLHRQELVHLARARCGESELWGRLYWAVRDCTNQCRFCNISPEQSA FT YMRSNNDPTAPGTDTVPPSAREEDVSPVRRRTLTRRRKEKCPHCDSTLTGF FT SGLVSHCRSFHPEHPPPLPELKCDFCDMVFPTRRSTAQHRSRCAHNPDATR FT HRNSSARRRSLLPQDQPASTSTPIGPQETLHHLLLECPGTLAVRQRLGIEQ FT DLRLGKFSQWQLLHSRKLLSLLDHLFGTQMALYS" XX SQ Sequence 4831 BP; 1184 A; 1361 C; 1221 G; 1065 T; 0 other; ccctggctca gccggccacc tcaacgtggt gccagggtct agtactcttt gctagagagg 60 aagctaagcg cctgctgccc atccgctgcc cgcggagagg caggaggcgc cgcacaaacg 120 ggtcggaagg gcaccagatg gagccattta catggctgcc cgcggagcgg tatttctatc 180 cgctgctgaa ttccatcggc gcttatcagc gctatacata ccgtttgcgt gccgtatgtg 240 acgcgcaacg acaaaagcta ctgctaagcg gagacattga gcagaaccca ggccccatag 300 cagtactcca gatgaacgtt tcttgcctca cgccgtcaaa actcgcaaca ttaatggcgc 360 aaggagcaga cataatagcc attcaggaga cttggaagtc gtcagagcag atcgccagca 420 tgcacactgg agattatgtg ctctatgcac agtcgcgcat cggcaaggga ggcggtgtgg 480 cggtgctggt gcggaaaact ctccgctcca agcgtatacc tctcaccatc ccccagcatg 540 acaccagcct tgaagtggtg gtggtccagg ttgctctgga ccagaaccgt gatcttattg 600 tagcgagtgc ctatatgaga ccaccaccgc aagtaacgca atccttcagg cggttagtaa 660 actgccttcc agcctcgtcg ccgctcctgc tgtgcgggga tttcaacatg catcacccac 720 agtgggagcc attcttggag acttctccaa gcgaggttgc tgcagaattt ttagaactgt 780 gcacggatgc gggactcacc ttggttaaca cccctggtga gatcacgtat gcccgtggca 840 caagagaacg atcctgtatc gatctgacat ggtcaaagca tttgactgtg tcggattggt 900 cagcttccgt gtcgccgctt agtgatcatt atatgctgac atttacgctg catcaggcat 960 ttaaggatac cataccttcg gcaccccctt cggcacctaa gtttttctac agttggggga 1020 agtgcaagtg ggatttattc atcaaggact tcgacgcaca acttccggca tacgactata 1080 aaaagcagtc caccggcatt aaggctttca cgagagcgct tataacttcg tatcgacgac 1140 attgcccccg cggcatgcac aaggacggtc ccaggctttg ggacgacact ctcatggagg 1200 cagagcggat tgctaccgac agcaaggccc gctatctaca gttaccgacg cccgaccgtg 1260 aggcagaaat gcaacggaca aggagtcaat tcttcctcct gctccgagag cgcttgcgca 1320 acacgtatct acgccgcatc agcaagttaa atccaggcga gccactcgca tggaaataca 1380 tttccggacg aaaaaaggca tcacttccat ctcccacatc aatgttatta ggagatggtc 1440 aacacactta taaaacagca aggagagcag cgaatgctct caatcgcatc tttcttccat 1500 ttcacccctc tcacaaggca gttaggtttt ccaaaggcat caacagacag agtgcatcct 1560 tgaacattaa tgcttctttt ctttttgggc agcagaacaa cagcgagtct gctgcctcat 1620 ttacttcatt ttcttctact tcttctagct ccgagccgca gaacaacaac gagtctgccg 1680 ctacatttac ttcaggttct tcactctcat ctagttctga gccacaggat aaaaacgaag 1740 ctgccaccac atctggttta gttgctcatc ttcactcccc tcttgatgca ccttttaatc 1800 gtacggaact gcttgctgcg ctacgtaata cgccgtatgg caaggccccc ggaccggatg 1860 aagtctacag tgaggcactg cgacatattt cgtcaaaggg cctccgattc cttcttcgtt 1920 gcattaacca cagttggacg accggtacga ttccggttga gtggagacgc gctaccatcg 1980 ttccactctt aaaacccggt aagtcgccgg aactgcttga gtcatatcga cccatcagcc 2040 ttacctccat tgtgagtaag gttgctgaga aaatggtact gaagagattg ctttgggtgt 2100 ggacgccgca cccccaccag tatgcatatc gtagtatgcg taccacgacg atgcagctgg 2160 cacacctgat acacgaagtg gagcataata gaaatcacta tttccaagtg agccttccca 2220 agaagagcgg tattggcaat caactccact acagacccca tcggaccctg ctggtgctgg 2280 ttgatttcag caaggctttt gactccatag atcatcgagt cctcagtcgc ttgctggcta 2340 atattccggg ggtgaattgt agaaggtggc ttagaaactt tctatgtggt cgctacgcga 2400 agacacgagt tggccacaga cacagcgatc ggcgtcccat gctgcgagga gttcctcagg 2460 ggtccgtgct gggaccgtat ttgttctccc tttacgtaca cccacttctc aatctgctga 2520 acagctttgc gggtgtcaca gcagacatgt atgcggacga cctctctatt atcgttaagg 2580 ggcagtcccg ggaagacgcc attcccactg ccaacatggt tcttcaaaaa ctgcatgcgt 2640 ggagtcagga aaatggcctg gccatcaacc cgtcaaagtg tgaagccgct tggttcacac 2700 tatccacgca cacggagtca gattatgatc gtgaaggaag gtggcccctg gtagtggctg 2760 gatgtcaaat cccagtcatg accatggggg catcgcgaac tacgaagctt ctcggcatgg 2820 atctcgatcc acgactgacg ctaaatgtgg cggccaccaa gcaatgcgct gccacttcgc 2880 aacggatatc gcagctacgc tgcatagcgc acaaagaggc gggaccatct ccacatgacc 2940 tacgcacgtt cgtcattgga tacggtgctt ccaaattacg ctatggcagc gagctcatat 3000 gggcagtagc gacggattca gcgaagaatg agatgcagaa gacgtacgca actctagcac 3060 gcattgtcag cggagttccg agcactgtcg acccggaatc cgcactactg gaggctaata 3120 tgccgccgct ccacgtcctt tgcctgcgcg cgcggctctc aatatttgag aacacacgcg 3180 catgtcagat ggactggatg cggagacccc cgcctgagcc accgcctcgc gccggtttcc 3240 gcatctcgcc attatctcgg gacgagctat atgccttcgt agacgcatac acaaaggact 3300 atggcatcac cgagagctca ccacgcgaag agcggttctt tcgcagctcc attcctccct 3360 ggtttgcggc ctccgctcac cgggtcacca tcggtgtgga acttccgata gaccactcaa 3420 taactgacga agaagagctg ataagggaaa agcgcagagt cagcgaagag gctctggcgc 3480 tgcacagcca tcgttcgtgg atacttgcga ccgatggcgg tgtcgacgtt cccaagtcag 3540 caggggttgg aatactgctt tcatccctca actcatcgga gataatagaa aaggccagca 3600 taaactgcgg tgcacgccca tgcagctaca ggacggaatc ccgtgcgctg cttctagccc 3660 tagagaagct gatgattcct cgtatccgcc acaggcgtaa aaccctgctt gtggttacgg 3720 acagtcagtc tcttctagcg gctctaaaca agggcccgct cagtcagaca gactggacgg 3780 aggatcagat ctggcagcgt ctcttgacac tgacgcgtgc tggctggtcc gtgcacctgc 3840 agttttgtta cggacattgc ggagtacatg ctaacgagct tgcagatcag tatgcgacgc 3900 agactatgga aagtggacaa tacacggagc aaggaatcgc acctttatgg catacggatc 3960 tgctgacatg ttttactacc cagctcacca acaagtggcg tagtaccatt cgtcaagaca 4020 ctcatcgcta cttgctttgc ggcacaaggc catcagatct cagcggtaag gacctgatca 4080 ctcaggaagt tctacaccgt caggaactgg ttcacctcgc aagggcaagg tgcggggaat 4140 ctgagctctg gggccgacta tactgggccg tgagagattg cacgaaccaa tgccgattct 4200 gcaacatctc accggaacag tctgcatata tgcgctctaa caacgatcca actgcaccgg 4260 ggacggacac tgttcccccg tcggcgaggg aggaagacgt ctctccagta aggagacgga 4320 ccctcacacg ccgtcggaag gagaaatgtc cgcactgtga ttccacattg acgggattct 4380 cgggtctcgt cagtcactgt cggtcatttc atccggaaca tcctccaccg cttcccgagc 4440 tcaaatgtga tttctgtgac atggttttcc ccacacggag aagcaccgca cagcacagaa 4500 gtcgctgcgc acacaaccca gacgccacac ggcatcgaaa cagcagtgcc aggaggcgat 4560 ctctgctgcc gcaggatcag ccagcttcca caagcacgcc aatcggcccg caggaaacct 4620 tgcaccatct gcttctagaa tgtccaggca ccttggctgt gcgtcaacgg ctgggcattg 4680 aacaggacct tcgcctcgga aagttctctc aatggcagtt gcttcatagc aggaaacttt 4740 tgtcgttgct cgaccacctc ttcggcactc agatggcact gtatagctag acgcgctggt 4800 aagagtagaa aaaaaaaaaa aaaaaaaaaa a 4831 // ID EnSpm-8_HM repbase; DNA; INV; 10212 BP. XX AC . XX DT 06-JAN-2009 (Rel. 14.02, Created) DT 06-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE EnSpm-type family from Hydra magnipapillata - consensus. XX KW EnSpm; DNA transposon; Transposable Element; EnSpm-8_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-10212 RA Bao W. and Jurka J.; RT "EnSpm-type families from Hydra magnipapillata."; RL Repbase Reports 9(2), 379-379 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(7112..8149,8070..8966) FT /product="EnSpm-8_HM_1p" FT /translation="MTSRSTIWRKTNSLIASILENNNYQSSTGEFFSSPIH FT NSIDTFESDSTLPITELSSTSDDFSNSFESFHSKHCPDLAHDLAQWAVKHA FT VTHFSLNDLLILLQKHGHEDLPKNSKTLLKTPNCVLTENMCSGDYIYLGLA FT TGIKNMLQVDEASINNNIHLIVNLDGLPLFKSSNTQLWPLLCQFGSKPPFP FT VAFFCGKQKPNSSMEFLRQFLEEFKMLSENGLVYKDNFFNVSLKFWTCDAP FT ARAFIKCIKPHNAYHGCERCIDKGEWQGRVVFNQVNSVLRSDEQFSKMYYK FT DHQVSRSPLIDFNLPCVSTFVLDYMHLVCLGVVSTTNNILDRGTKHMSFVI FT CPNFVLGLFRRLIIFWTEGPNICRLSYVQINRITEKLNDLSGKMPSDFVRQ FT PRSLGEFKRWKATEFRSFLMYTGPYVLKNVLKHDLYIHFLCLSIAIRVLDD FT NNSSSNAIGYASMLLNWFVSKSVDYYGPTFTTYNVHNLIHLSEDVIKFQMT FT LLDMSVFSFENYLQRLKRLVRGKNMPLAQIVKRLGEIDCLDNKFLRKKHKS FT KVVPGGKDSWFFLKSGQLAQVIEINFDSLLCEIIPCRLFNDFFVVPCSSRY FT NKILSVPKNASVKRKTVQKTELKRKVISFKGRTRNVVFPLCHDCVF*" XX SQ Sequence 10212 BP; 3513 A; 1267 C; 1383 G; 4024 T; 25 other; cccagtaaga aaaagacgtc gtgccgttgt tgtaatttgg ttggattttg gtcgcgacgc 60 gagttaacta aattccaacg atgttacaac gtcggcataa cggtgttatt ctaacgttgt 120 ctaacaaacg ttggaacaac ttcggaaaaa cgttatcatc taacgtctta aatgttaaca 180 ttaaaacgac gttggaacaa cgttggttat ttaacattaa accaacgtta gcctgattaa 240 cgttagaacg tctttggaac aacgttgctt tatttacgtt tttacttttg tcatcaaagc 300 aacgtaaaca caactaaaaa caaaaaaaac ttgcacattt taaaatggtt graaaaccgt 360 taatgaagaa aatgttattt ttaatctggt ttttaggcat tatgatctgt aatactaatg 420 tttattacat tattttttat attctctagt cgaattatat atgtagtttt taaacattgt 480 catgagaaaa taaagttaca aaaactttat ttgactttca aaactattca aaaacgtttt 540 tgttatgctt ttgtatattt aacattatat agtaaaattt tctgatatat aaagtattac 600 acataatatt taatttaaaa ataactaatg caataatggt attctaaata cctattatta 660 aactacttat tagtttaatt aaatgtcaat aaaaaataaa gagttcattt tttatgacat 720 ttaattcagc tgttttatat attatcttat gttattttta gtttcttgtg gaactaattc 780 tagactagaa cttacctaat aacctttctt ataacttgtt taatgtgttt cttgtatgaa 840 tttttatagt agtatttttt tatataraga ttagttgttt tttttgtaac ttgttataaa 900 aagcaacaaa aaaaaaaaaa aacaagatta tagaattagt attgttatca cttttgtatt 960 tttcttaaac attaaaaaat gcttttagta aatacattta ttaagctatt taaagattat 1020 tttatttttt gaaaagaata catagttata taataaatag ttatattaaa cttaaatttt 1080 tcattttttt gaagtaaact gatacagatt tataatcaat ttaaaattwm cttttattgg 1140 tttaattytg aaaaaawtta tttatttttt ctttggtacc ctataatgaa ttgctattat 1200 atctaaatrg tgtcatttaa agtgcaagac aaggtctcct ttgacatcat ctcttkaggt 1260 tgagtcaaaa agaaaatttt gagatggcca aagaaaggaa cggaaaaaaa agttaagtga 1320 ttcttaagtt aaatttactg ataaaagctt tatcacatgc tactcctata atattataac 1380 tgatactgac tcacaagatt ttgagtctca agtttcataa aaaggtcaaa attagttttt 1440 tagttgttta catattaggt tatttttatt gatttatttt ttattgatat ttacctcccc 1500 atggctgaga gggtctctac agttaaggat gcttgattgt tgttgcatct aggggtgtgg 1560 agttgtaaaa aaaagtaaca actctgacta caattgttga aactttcaga gaacaactcc 1620 aaattcaaag caaaattttt gaaaaattct ttgaattcct aaaaaaatat tacgcgataa 1680 tataaaamtt ttttttgtgc caaatatatg aaaaattctt tgaaggttaa atgttttata 1740 tatgcacgtg caatacccag aagttttact attgragttg gagtcacaat tttgtttagt 1800 gactctgcta caaatgtcac aactctgayt ctcccactcc gacttcacaa ccctggttgc 1860 aaccctctct caactccaaa acacagtcct tgacatacaa gactactgag cagtaaaata 1920 agttaagagc cttactacca gggatgtktt ggggatcrat ctccaaactt ctcacttatg 1980 aagcaagcac tttgccacta cactactacc acatattaag gtattcattt tgttttttca 2040 attttttact taataatttg atattagttt ttttcagcac ttggcaaagy atttactcta 2100 aaacctaaaa ttaaatctka ttttaggaat agaaaagcat atcatacaga aaaagyaaca 2160 raacacctat rttttaaagc agaacttagt catgataaat aaatattggt ttgaaccaat 2220 taaaatctca tatttctctt atttcagact taaaacaaag agctaatttt aacctttcca 2280 ataaattaca tgcaatgtta aaaaatattt tttgttgaat actgttrttt gttatttatt 2340 attatgtgtt aagagttcca gtacaaagtt attaagttgt gtgaaattaa cacaagtcaa 2400 cccaaaaatw matcaccaca tcaattgctg taagatttga agttttgtta aggtttatga 2460 gttatatttt aatatgttta acaagatwta aaaaacaaaa gctggataat taattgactg 2520 taaattattt gctgttacaa cttatgccag cctatttatt attttttagt gttttaggtt 2580 gggtgcagca taactaaatt gttattgaat aatttaaatg ttttaaaatt ttgttttatt 2640 aaaaaattaa ttgcataaaa atatttttgt acttatggtt ttgccatcat tcaatcagaa 2700 ttacttattc aattctaact gtttgttaaa aaagtttggt ttttttaaat gttgtccttg 2760 gtgcagtaaa aaaaaaaaaa aatgttttaa gatgttatgt gtaaaacatt ttaatgtgcc 2820 ttataatttt tttccttttt taggtatttt ttttggtawt ttttagatat gcaaaggtaa 2880 cttttatatt ttgtcaacta tatttttatg tattacattt acgttcagaa cattacaggc 2940 attatgacca actgcaaaga ttttgttaat aaaattaaca aaatctttgc agtttttctg 3000 acgtttaaca aaatgttaat tttgttaaac gtcagaaaaa tgttgtcaac taacatctgc 3060 aatgtcaaaa ataaaacaac atttgaacaa tgctggttgt tcaaacaacc agcattgttc 3120 agatgttgtt ttatttttga cattgcagat gttattatga aaatcaaaaa aacatataga 3180 tagatattaa aamtatatat atatatatat atatatatat atatatatat atatatatat 3240 atatatatat atatatatat atatatatat atatatatac attttatttt tgattaaaaa 3300 tttaatttaa atcaacttag tctcagaaaa ttctgcatgt agaatgttct taggctgaga 3360 tgatttaaaa taaaattcaa atttttattt cctggtgcac aaggaattaa aaagaacatt 3420 aagttttatt aacaaataaa taacaaaata atgggtatat aaatagttta attaggcttt 3480 ttttcaagaa ataaaatgct aattatttga cattttagtt agttggagtt actattttag 3540 ttagatggag tatttatttg atatttttac tatttctttg acattttagt tcaattatgt 3600 taatacatga taaattattt atttgtttat ggaaatttta tgctacttta agttccttgt 3660 ggaattagtt ccagacaagg taattatctt ccttataact tagacagacc ttaaaaacca 3720 taatgtgtgt ctattaatat atgtacatat acatatatag tggtactttt tatttgaaga 3780 ttagcatttt atttttttat aacttgctgg acgaagaaaa caaaatcata gtgtttgtat 3840 tttcaacatt attgttttct cttttacatt aaaaacaatt tcataaaaag tacattataa 3900 cattatttag gttaataaat ctttgtaaag gtaatatttt gttttttcta tataataatt 3960 tattacaaaa cttcatattg ataaataatt ttaaattatt cattttttta actttactga 4020 tataaatatg atagtagttt aaaattattt ttttattgat attattctga aaaaatttat 4080 ttattttttc tgttttacca ttatgtagtg tgctattatg tctgaatggt gtcgtgtaaa 4140 gtggcttgaa gaagacaaag tttactctga cactgttcct ttaagttgga ttaatggtaa 4200 agttttgaga tggccaaaaa aaggggctga aaaaaagtta agggaccagg ttgaacctca 4260 agatgattgg tttaaatttc aagttctcaa aattaaatta actgataaag attttatgac 4320 ttgccattca tataatatta caactgatac tgactcttgt gattttgatt ttaaaaaaga 4380 tatgtctcaa gttgcaaaaa aargtaaaaa gtaatttttt gtgtttgcat attaagttat 4440 atagtattct gatatttatt ttttaaattt ttttgtttat ttaattattt tttggaatta 4500 atttttttga gttaattttt ttaaatcaaa tttttttttt tattgaatta ttcatttttt 4560 aatttttttt ttaacagttt gcagtacact tcctccaaag cctgaaatca aatcaggttt 4620 tagaaataaa aaagcattta ccccagaatt agaaaaagaa cacctaaatc tttcacaaga 4680 agctagtcag aataaagtaa atattgattt gaatcaacct acccagtcag tatttagttt 4740 tcaaagttca tcaaaatccc ataattcaag gaaatcacgc tctcgatctt ccagtagttc 4800 ttgttcacta tctcctcaga gtaaaagtaa aagaagacca gattattgtt ctcttttaca 4860 atcagagaaa attgaaacac agagtgaact ctcttgtgga ttaccaaaaa gtaaagaaca 4920 ttctactctg gctattaaca gaaataactt tccaatgagt gatcaaagtg tgtagatttt 4980 tttttcagta tttatacata ctgaaaaaat attattcata atatattttt attttatttt 5040 ttatataagt cagttattat aattatttta tggaggcata ttatatttaa aaagtattac 5100 aattaaagat aaatttttat aaaaatttac cctgtagttt aataaattat acttttgttt 5160 ttttttcttt tatttttgtg tgtagaattc cagtatgaag tcatcaagtt actgtgtgaa 5220 cttaaaacat gtttacaatg tcaacccaaa agtacattgt cacataattt aagtgcggaa 5280 attccaactc aattcttgac agtagaagat ttgaatgaat ttgaacttaa aatgcatgat 5340 aatagttttg cacaagaatt tgtgagttat attttaatat ttttaaaagc aatttagaga 5400 aaaaaagatg gacgataaaa tgattattaa tcaattgctt ttgcaactaa tgtcacctta 5460 tttataacct atttattttt aaactattta ggttgtagca ttactataac ttaagtatta 5520 ttaactaatt aaattgttta aaaatttatt acacttatat aaaaattttt atgtgaaata 5580 tttcaattgg gctgatattt tttttctatt taaggttaaa caactctcac gtgttggtgg 5640 aaagtcacct aaatatatgg ttaaggacct aattcaaagg taatttttgt attttcaaaa 5700 tttgattttt atatgttaca tttacttcat attactatta agttagaaac agcataaatt 5760 actaaatgat atcttttagt tgttttattt attttcatct aagattgatg tcaaatgatc 5820 ttcagacaaa attcagcatg attggtttaa aaggaaagct taagtttgaa aaaactttgt 5880 tgtataatgt aataaatagt gagtttgttt ataggttgtt ttttttttta aataacattt 5940 tcctaaatag cattattgct gttacaatat ctggcagacg tgtattttag acacccccct 6000 ttagtaagtt cgtaaaaaac gagtcatgtt taacatataa catttattga aaattatact 6060 tttaccctga agccaacaat aaagtttatt ttattgaaag ccatgcccta atcaaacgct 6120 gagcctaatc ctaagagtca tggctagggt ttggttaggg tatggttttc aacaatattt 6180 ttttcatata aaactttttt acagacttag taatgaaggg tgcctgaaat acacgtctgc 6240 ccaatatatt atactaatct atcttaatac tgcatcctaa tacttttttt gtataattgt 6300 ttatcttaga agcagtatta aatgtctttc cggatgctac cagctcagaa gtacgaatag 6360 aagtgatgaa ccatttaaaa tatgctgcag atcgtaaaaa ccggaaaaca cttcggacaa 6420 gcaccagaaa tgatgtcatg tctatcagtt cgaattttgc agaagaagcc agtaaaacat 6480 aatttttata taattttttc actgagtgtt tttttttttt tttttttttt ttacttcaat 6540 ataataaagt ctttattttt caacatttct tttgaatttc ttaaacacaa tattaaataa 6600 atattaagaa gaaaaaactt tttttgttta tgtgaatttt aaaacttaag ttgttaaact 6660 ttaaagatct tttaaattag tttaccatat attattaaat catgtgtcat tttttgatca 6720 ttgttttgaa taacataaat atagttctgt ttttaaaaga tttactattt gttttattat 6780 ttgtgatgga ttataaataa ataataatat taatcaatat aaaaattaaa tcaatggtta 6840 aataatgctt caaaatcatt ttttacttac aatttttttt aagattctta tgatgaaagt 6900 ttatttctaa aaagttacaa aagctttatt taaattttaa cagtttgtgt aatattttgt 6960 gtttttatac taaatagcaa catttaatga taaaatatta attatacaaa attatataat 7020 ttttatataa attaattgtt atatatatat atatatatat atatatactt ttttttataa 7080 gtattataga taagaaatac aaagatatac aatgacatca aggtctacaa tatggagaaa 7140 gactaattct ttgattgcca gtatattaga aaataataat tatcaaagtt caactggtga 7200 gtttttttct agtccaattc ataatagtat tgatactttt gaaagtgatt ctacattacc 7260 aataactgaa ctctcttcaa caagtgatga ttttagtaac tcttttgagt cttttcattc 7320 taaacattgt cctgatttgg cacatgatct tgctcagtgg gcggttaagc atgcagttac 7380 acatttttca ttaaatgatt tgcttatttt gttgcaaaaa cacggacacg aggacttacc 7440 aaaaaattct aaaactttac ttaaaacgcc taattgtgtg cttactgaaa atatgtgctc 7500 tggtgactat atctatctag gtttagctac tggtattaaa aatatgctcc aagttgatga 7560 agcaagtatc aataacaaca tacatttaat agtaaattta gatggtctgc cgctttttaa 7620 atctagcaat actcagcttt ggcctttact ttgtcaattt ggttctaaac ctccgtttcc 7680 tgttgctttt ttctgtggca aacaaaaacc taactcttct atggagtttc tcaggcagtt 7740 tttggaggaa ttcaaaatgc tttcagaaaa tggtcttgtt tacaaagata atttttttaa 7800 tgtcagttta aagttttgga catgtgatgc tccggcacgt gcttttatta aatgtataaa 7860 gccacacaat gcttaccatg gctgtgaacg atgtattgac aagggtgaat ggcaaggaag 7920 agttgttttt aatcaggtta attctgttct tcgttcagac gaacaatttt ctaaaatgta 7980 ttataaagat catcaagtat ctagaagtcc tttgattgat tttaatttac cttgtgtttc 8040 tacctttgtt ttagactata tgcatttagt ttgtcttggg gttgtttcga cgactaataa 8100 tattttggac agagggacca aacatatgtc gtttgtcata tgtccaaatt aaccgcataa 8160 ctgaaaaatt aaatgactta tctggtaaaa tgccatctga ttttgtcaga cagcctagat 8220 cactaggaga gtttaaaaga tggaaagcta cagagtttcg ttcatttctt atgtatactg 8280 gtccctatgt attgaaaaat gtattaaaac atgatcttta tatacatttt ctttgtctta 8340 gtattgctat tcgtgtgtta gatgacaata attctagttc taatgccatt ggctatgcat 8400 caatgctgtt aaattggttt gtatctaagt ctgtggatta ttatggtcca acatttacta 8460 catataatgt tcataacctg attcatttgt ctgaagatgt tataaaattt caaatgaccc 8520 tacttgatat gtctgtcttt tcgtttgaaa attatttgca gcgtctgaaa agattggttc 8580 gtggtaagaa catgccactt gctcaaattg tcaagcgttt aggagaaata gattgtttag 8640 ataataaatt tttgagaaaa aaacacaaat caaaagtagt ccctggtggt aaagatagct 8700 ggtttttttt gaaaagtggt cagttggcac aggttattga aatcaatttc gattctttac 8760 tttgtgaaat aattccatgt cggcttttta atgacttctt tgttgtacca tgcagttcaa 8820 gatataacaa aatattatct gtgccaaaaa atgcttctgt aaaaagaaaa actgttcaaa 8880 aaactgaact caaacgcaaa gttattagtt ttaagggtag aacaagaaat gttgtttttc 8940 cattatgtca tgattgtgtg ttttgaataa attttgtttg ttttatctat ttaaaatttt 9000 ttttattgtc agttgcatta gtttttgaat gcagtttatt gagaatcata tatggtaaat 9060 atattcccta ctatttcaac atttatgttg aatatatatt taataatgac attttgttaa 9120 attttattta attttagata tttttcaatc gaaaattaat tttcatccta acatttcatt 9180 aaaatcctaa ttgaaacatt caaattttgc aaatttgaat gtttcaatta ggatgttaga 9240 aggtgtttta taaaagtttg ttattttaaa aaattttatt ttgatagtta taatagaaat 9300 taagttattt ttttctattt caaaggattt tgctcaaaga ttttctgtca ataatttttc 9360 aaaaaaagga aaactcaggt gagcatcaaa taacaagtta gatcaatttc tggtatttat 9420 gttagttttt tctatttgtt ctttctgaat gtttttcgtt tgttgataat gcttttaatt 9480 aaatttgttt ataactgttt atttatatga atttatctct tagtgtcaaa caaagaatac 9540 acgaattaat tgagatatgc tagaagtttg gaaagaatgt aaaaatcaaa ctagtttcac 9600 tctaagttat tgtatataat tttaaaattt tgaaatagaa gtattgtaaa gtttattttt 9660 cttttttagt cttttaattt ccatattcaa ttagttgcct ggactaatat aattgccatg 9720 tatgaactaa ataaataaaa ggaaaaaaaa catcacagca cgatgttata cagtgcaaca 9780 tagtgcttta ttgctctttg cagtgtagaa ataacatcgt gaaaatgtta gaaatacatt 9840 ggtgattgtt tttcttataa agtttggcca acgtcatcaa aataacgtta gcctatgttt 9900 tatttcagcc attattccaa cgtctaaaca acgatagaat agacattgtc tcaacgtcat 9960 ttcgctcaaa atacaacgtt tgcccaacgt cttcataacg tcttttttct taaaatttaa 10020 cgttgttata acgcctttcc gacgtcattt agctcgaatg taacgttgtc acaacgtcat 10080 tttacatcgt tgcaacaacg ttagcaaaat gacgtctttc cgctgtttca tttataacat 10140 cataccaacg tctttccaac gatacggaag acgttggaat aacgttgtcc caacgtcatt 10200 tttcttattg gg 10212 // ID L2B-3C_AAe repbase; DNA; INV; 4561 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE An L2B non-LTR retrotransposon from Aedes aegypti. XX KW L2B; Non-LTR Retrotransposon; Transposable Element; L2B-3C_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4561 RA Kojima K.K. and Jurka J.; RT "L2B clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1408-1408 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 13 sequences with >97% CC identity. The consensus is ~89% identical to L2B-3B_AAE and CC ~77% CC identical to L2B-3_AAe. XX FH Key Location/Qualifiers FT CDS 719..1681 FT /product="L2B-3C_AAe_1p" FT /translation="MDGDSYPSAGSFVEVVRRKKRVHETDSVLRSGRVRNK FT SVATSNSNDNRNVSAMAQQVTNAKVSEVNGNGNDSNKKFGCTVRVKPNVTQ FT SNHQTKKEVRSKINPTQVAIKSVRNGMNGSIIVECDNENEAKGFAKIVNEK FT LGDGYAADIEQPKRPRIKIFGVESNYNSNELINILRDQNDIEHVQYLNVLR FT CIPSKRNPENEFTLICEIDAITFERVMRKGKLNIDFERCRVMESIEVFRCF FT KCCGYGHKSSECKNNLHCAKCAERHDVKECSSDQEFCVNCIISNRERKTQF FT DVNHSSWSVDCPIYLRKISISRSYINYNA" FT CDS 1685..4486 FT /product="L2B-3C_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="QSKRATDVVLLNIAGITSHFAELEVLVNIKKPKLIML FT TETHLTSDIGINEYSIRNYKMXCCFSVSRHTGGVIMYIHESIKYHVVDNST FT CGLNWFVAIKVVKGLRVGVYGLLYRSPSGNEQQFLAHLEQNWLEKVLDDKG FT MNLIAGDFNINWQQSSDSRNLRSIMEFFDLDQKVKSTTRCTLRSQTIIDLV FT FCNDDRLKVVTAHDSKISDHETIEINFDEFSQPLENYITIKCWKKYSKSAF FT LSLLRNSMPRNSERQLDEKADVLSTVLKENINRLVVVRNIKCNSRKKWYTA FT ELKILQEARDDAYKKASASWDEADWQRYKVLRNEYSYSIRTAKAEFTQKKI FT EQNRGNSKQLWRTLKSLYKSKEKPASRISFNGVDVDEDQVICEKFNSYFVD FT SVQQINESIENVSDCFGNNECQGTHSWNVFNRVSYDALKKAIAKISCSSGI FT DNVNLQVLKDSLEVTGEYLLGIINDSLEQGKFPSGWKQSTVVPIPKVSGTT FT NSEEYRPINMLPIYEKVLEIIVKDQLLEYLNEHKIIINEQSGFRQNHSCES FT ALNLLLYKWKRMIEEKKTIVVLFLDLKRAFETISRPEMLKTLNKYGMGGNV FT LKWFESYLSDRTQVCQYGNCISSPKSVPLGVPQGSVLGPILFILYINDMKK FT AIKHCDINLFADDTVIFIAERDEKVAIRKIRDDIKSLNKWLKVKKLKLNVQ FT KTKSMVISNKKQLDYTELKIHIEGVEVEKVDVFKYLGVIIDQKLTFSAHID FT SVIKKVAKKYGMLIRLKSQLTFWSKIFLYKTLVAPHIDYCSSVLFLASETH FT LKRLQRLQNKFMRYILNCDRYTPIKNMLEVLQWLSVKERIIFNVMTVIFKL FT TNDLLPEYLTNIILRGRNIHDHRTRRSDDLRVVPFTMTSTQKSMYYNGIRI FT FNELPAEIKNARCVTEFKKNCATWVKFKYR" XX SQ Sequence 4561 BP; 1667 A; 636 C; 969 G; 1284 T; 5 other; tgtgagtgac tcatgcgtga aataaggtgt taacatcgag tctgaamtta aatacgtttt 60 taacagtgmt aatcaatttt aatcgcgact atacatgtat cattagacgc gtaagaagat 120 tccccgtcta acggtacata gtgcgtttta gtgaatatcg gcgttcttag tctacgtcga 180 aattaatgat ccaaaagtag tgctttcgcg aaagtaacag tgaaatcaat taattgaaca 240 aagcggccta cggccgtgca aagacaatac atctgaaagt agcggcaaat atgtctgtgc 300 agtgtgatgt ttgcttcaag gcaattaatg tggcaaagga aagggtgttt tgtttcgggg 360 gctgcggtca ggtactgcat gctaaatgtg ccgacctaac taatgcaggg aaacggcttt 420 gcgcgagaat ctatcaatca agtacttgtg ccatgattgt agaaaaaagc aagtgggtct 480 taacgaagtg ataggcaaat gcgataacat tctcgttgcg ataaatgaga tcaaaactcg 540 tttggataaa attgaagcga aatttgaacg aaacggttgc gatgaagcag tgaaactatg 600 tgaacaaaat gtcaagttag tggtggaaga atctgcaaag ttacatggtg aacaattaaa 660 aaacttggaa acacaaattg cgaattcggc atacagtcct gctgcgggca aaggcccaat 720 ggatggcgat agttatccat ctgcgggcag ctttgttgaa gtggtgagga ggaagaagcg 780 agtgcatgaa actgattctg ttttacgctc tgggcgagtt agaaataaga gtgttgctac 840 ttcaaattcg aacgacaata gaaacgtgag tgcgatggca cagcaagtca ctaatgcaaa 900 agtgagtgaa gtgaatggga atggaaatga ctccaataag aaatttggat gtacagtgcg 960 cgtcaaacca aatgtgacac aatcaaatca tcagacgaaa aaagaagtta gaagtaaaat 1020 taatccaaca caagttgcaa taaaaagtgt tcggaatggc atgaatgggt caataattgt 1080 tgaatgtgat aatgaaaatg aagccaaagg gtttgcgaag atcgtcaatg agaagttagg 1140 tgatggctac gcagctgata ttgagcaacc aaagagaccg agaataaaaa tctttggcgt 1200 ggaaagtaat tacaattcaa atgagttgat taacattttg cgtgatcaga atgatattga 1260 acatgttcag tacctcaatg ttttaagatg tattccgtcg aaaagaaacc cagagaacga 1320 attcacactc atatgcgaaa ttgatgcaat tacgtttgag agggtgatgc gtaaaggtaa 1380 gctcaatatt gattttgaga gatgccgagt tatggaaagc attgaagtat tccggtgttt 1440 caaatgttgt ggatacggac ataagtcgag tgaatgtaaa aataaccttc attgtgctaa 1500 gtgtgctgaa agacatgatg ttaaagaatg ttcatcagat caagaatttt gcgttaattg 1560 tattatttct aacagagaaa gaaaaactca atttgatgtt aatcattcat cgtggagcgt 1620 agattgtcca atttatttga gaaaaatatc aatttcgaga agctacataa attataatgc 1680 atagcaatca aaaagagcga cagatgtcgt tcttttaaat attgctggaa ttacstcgca 1740 tttcgctgaa ttggaagtgt tagtgaatat aaaaaagccc aaacttatta tgttaacsga 1800 aactcatttg acttcagata ttggaataaa tgaatacagc ataagaaatt ataaaatgwt 1860 atgttgcttc tctgtatcta ggcacacagg tggcgtgata atgtatattc atgaatcgat 1920 aaaatatcat gttgttgaca attcaacttg tggactaaat tggttcgttg ccataaaagt 1980 agtcaaaggg cttagagttg gcgtatacgg cttgttatat cgttcaccaa gtggcaatga 2040 acaacagttt ctagcacatt tggaacaaaa ctggcttgag aaagtacttg atgacaaagg 2100 aatgaatctt attgctggtg attttaatat caattggcaa caaagcagcg atagtagaaa 2160 cctacgcagt ataatggagt tttttgattt agaccaaaaa gtcaagagca caactagatg 2220 tactttgaga tcacagacca ttattgattt agttttttgt aacgatgata ggctgaaagt 2280 tgttacggca catgacagta aaatctcaga tcatgaaaca attgaaatta atttcgacga 2340 attttcacaa ccattagaga attatatcac aattaaatgt tggaagaagt attctaaaag 2400 tgcatttctt tcactgctga ggaatagtat gcccagaaac agtgaaaggc aattagatga 2460 aaaagctgat gtgttgagta ctgtgttgaa agagaacata aacaggcttg tagttgtaag 2520 gaatataaaa tgcaatagca gaaaaaaatg gtacacagct gaattaaaaa tattacaaga 2580 agcaagagat gacgcgtaca agaaagctag tgctagttgg gacgaagctg attggcagcg 2640 atacaaagtt ttacggaatg aatattcata ttctataaga acagcaaaag cagaattcac 2700 acagaaaaaa attgaacaaa atagaggaaa tagcaaacaa ctttggagaa cattgaaatc 2760 actgtataag agtaaagaga aaccagcaag tcgaataagc tttaatggtg tcgatgttga 2820 tgaagatcaa gttatttgtg aaaagtttaa cagttatttc gttgacagtg ttcaacaaat 2880 taatgagagc atagaaaatg tttccgactg ctttgggaac aatgaatgtc aaggaacaca 2940 cagctggaac gtttttaata gagtctcgta tgacgcattg aaaaaagcta ttgcaaaaat 3000 tagttgttca tcaggtatcg ataatgtaaa tttacaagtt cttaaggatt cattggaagt 3060 caccggagag tatctacttg gcataattaa tgattctctt gagcagggca aattccctag 3120 tggttggaag caatcaacgg tggttccgat accaaaagta tccggaacaa caaattctga 3180 agaatatagg ccaattaata tgttgccaat atacgagaaa gtattagaaa tcattgtcaa 3240 agaccaattg ctggagtact tgaacgagca taaaattata ataaacgaac aatcaggatt 3300 taggcaaaat cattcgtgtg aatcagcttt aaacttatta ttgtacaaat ggaagcgaat 3360 gattgaagag aaaaaaacta ttgttgttct gtttttagat ctaaagcgtg catttgaaac 3420 gatatcgcgt ccggaaatgt taaagacttt gaataaatat ggtatgggag gaaatgttct 3480 caaatggttt gagtcgtatt tatctgaccg aacacaagtg tgtcaatacg gaaattgtat 3540 atcttcgcca aaatcagtgc cgctcggagt tccacagggt agcgttttag gaccgattct 3600 atttatttta tatataaatg atatgaagaa agccattaag cattgtgata ttaatttatt 3660 tgcagacgat acggttatat ttatagcaga gagagatgaa aaggttgcaa ttagaaaaat 3720 tagagatgat ataaaatcat taaacaagtg gttgaaagtg aaaaagctca aattgaacgt 3780 ccagaaaact aaatcgatgg ttataagtaa caagaagcaa ttggattaca cagaactgaa 3840 aattcatatt gaaggagttg aagtagaaaa agtagatgtc tttaaatacc ttggagtcat 3900 aattgaccag aaattgacat tcagtgcaca tatcgatagc gtaataaaaa aagtagcaaa 3960 aaagtatggc atgctaattc gtttgaaaag tcaactgacg ttttggagca aaatattttt 4020 gtataaaaca ttagtggcac cacatattga ctattgctct tcagtgttat tcttagcaag 4080 tgaaactcac ctaaaacgat tgcaaagatt gcaaaataaa ttcatgagat acattttgaa 4140 ctgcgacaga tatacaccaa ttaaaaacat gttagaggtg ctacaatggc tttctgtgaa 4200 agagcgtatt attttcaatg tgatgactgt gatttttaaa ctgacaaatg atctcttgcc 4260 ggaatacttg acaaatatta ttttacgagg acgaaacatt catgaccata gaactagacg 4320 aagtgatgat ttacgtgttg tgccgtttac aatgactagt actcaaaaat ctatgtatta 4380 taatggaata agaattttta atgaattacc agctgaaatc aaaaatgcaa gatgcgtcac 4440 agaattcaaa aaaaactgcg caacgtgggt taaatttaag tatagataag gaaaatcata 4500 aaattgtatg tatgaattac tgtatattat tatcagaaga taaataaatg gattattatt 4560 a 4561 // ID Copia-10_CQ-I repbase; DNA; INV; 4039 BP. XX AC AAWU01001056; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-10_CQ_; KW Copia-10_CQ-LTR; Copia-10_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4039 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 335-335 (2011). XX DR GenBank; AAWU01001056; Positions 14172 18210. XX CC Positions [1522-2052] - Integrase core CC LTRs are 92% similar to each other. XX FH Key Location/Qualifiers FT CDS 104..1639 FT /product="Copia-10_CQ-I_2p" FT /translation="MADEEVRVPGAGRGVAAVQPGGPRLPGSSVSLPVKEV FT LRGSENYNSWAFATKMVLIKERTWRAVRDPVEGEPEVDADTSEQALATICL FT SMDRGLRGMVQNAATAREAWDILRNTFEDSGSTRRIGLLRKLTGIRLMGAE FT KVEEIICQSTQEYVSEIMVTCNQLAEVGFKVDDDWVSSLLLKGLPKKYAPM FT ILGLESSGVKLTAEKVKATIMQEVKVEAGSSSSESEGAFYSKQRQQKFKKP FT VEKSDVKCFRCKKLGHYASECTNGTSAKSKEKSGNHKGNATSFFAAFNAVE FT SPAGASGDWIIDSGATTHMCRNEAILEKPSSVSAMINVANNAEVAVVAKGS FT VKLEAECDGEIRDLTMTDVMCVPDLAINLLSVSKVCQKGFRVEFTKDACAV FT LDASDTVVAIGRETDGLYKLTETKRSRANLVQADDQFELWHRRMGHLSASG FT MKRLREMTTGAQFDVKDGCSVYRASRASTSVNRSTREGRERSESWNWFTRI FT CAARWRRSRSEADATF" FT CDS 1603..4029 FT /product="Copia-10_CQ-I_1p" FT /translation="METESIGGRRYFLTFIDDASRKTVVCFLKSKTEVLEA FT FQNFKAFAENQTGERIKRIRTDNGREYVNREMEEFLRKSGIHHEKTVPYNP FT EQNGLAERRNRTYVERARSMLFDAGLDKRFWAEAVATASHVINRSPAKGLS FT VTPEEKFTGKRPDLSHLRVFGTKCMAHVPKERRRKWDKKSTGTDVALTEPE FT VGDEEMELIVPSIPIEVGPAAEEEAEPEEDESDFSDYESVEGDADSLVEAE FT EEPLVLPPQPEDPQVLVRRSGREHVPPGKYKDYVCSTLPEKDSQPRQNRSI FT PMMVEPDTYREAIARDDSDQWKRAMAAEFGALVENRTWELTDLPKGRKALR FT CKWVYKLKTNSDGSVERYKARLVIKGYSQIKGVDYSETFSPVVRYATVRYL FT MSLAAKLDLKVHQMDAVTAFLQGDLGEEEIYMEQPEGFTESGAPTKVCKLR FT KALYGLKQASRVWNQKLDAELKRSAYDTCVYFKLEGNTVLIVAVYVDDLLV FT FSNNQQWVTKLKKDLMARFRMKDLGMASKVLGMRVTRGDDKVMLDQEQYIN FT ELLQRFNVAKCNAVSTPVDPTQKLTKEMCPSSVEEREQMAKVPFRELVGGL FT QYLALSTRPDICYAVNAVSQFSSNPGKAHWIAAKRVLRYLKGTKMMKLTYS FT KHRDQSFVGYSDADWGNDPETRRSITGYVYMEAGGSISWSCRKQTTVALST FT MEAEYMAVSAATQEALWWRGLLKELFGVAQPVTIYCDNMSAVHLAEKEIGY FT SPRSKHIDIRHHFVRENVEEKLIKLEHVATEAQKADVFTKPVAVSKFLEAR FT KALGIHG" XX SQ Sequence 4039 BP; 997 A; 963 C; 1326 G; 753 T; 0 other; gacagtccgt agcaacggct gtttggagaa aataacagtt tggaagaaaa ttttcactca 60 agaagttaac tgaagaattt ggaagaagtt ttgtcctgaa acgatggccg atgaagaagt 120 ccgagttcct ggcgctggac gaggcgtggc ggcggttcaa cctggcggtc caaggctacc 180 tggaagttcc gtttcgctgc cagtgaagga ggtgctgcgt ggttcggaga actacaactc 240 ctgggcgttt gccacgaaga tggtcctgat caaggaacgg acctggcgcg ccgtacggga 300 tcctgtagaa ggagaaccgg aagtggacgc ggatacctcg gagcaagcac tggcgactat 360 ctgcctcagc atggaccggg gcctacgtgg catggtccag aatgcggcaa cggcaaggga 420 agcttgggac atcttgcgca acacgttcga ggacagcggt tccacccgga ggattggtct 480 cctgagaaaa ctgaccggga ttcggttgat gggagctgag aaggtggaag agatcatctg 540 ccagtcgacc caggagtacg tcagcgagat catggtgacc tgcaaccagc tggcggaagt 600 cggcttcaag gtcgacgatg attgggtgtc gagtttgctg ctgaagggcc tgcccaagaa 660 gtacgcgccg atgattctcg gactggagtc gtccggggtc aagctaacgg cggagaaggt 720 gaaggccacg atcatgcagg aggtgaaggt cgaggctgga tcgagctcga gcgagagcga 780 gggcgctttc tacagcaagc agcggcagca aaagttcaag aagcccgtcg agaagtctga 840 cgtcaagtgt ttccgttgca agaagttggg acactacgcg tcggagtgca cgaacgggac 900 gagcgcgaag agcaaggaga agtctggtaa ccacaagggg aatgcgactt cgttctttgc 960 tgcgttcaac gcggtggagt caccggcagg agccagtggt gactggatca tcgattccgg 1020 agcgacaacc cacatgtgcc ggaacgaagc gatcctggag aaaccatcga gtgtgtcggc 1080 catgatcaat gtggccaaca acgcggaggt cgccgtcgtc gcaaagggtt ccgtcaagct 1140 ggaagctgaa tgtgatggag aaatccggga cttgacgatg accgacgtga tgtgcgttcc 1200 ggacctggcg attaacctgc tttctgtaag taaagtgtgc cagaaaggtt ttcgtgtgga 1260 gttcacaaag gacgcgtgtg cggtgctgga cgcaagtgac acggtggtag cgattggtcg 1320 agagacggac ggtctctaca agctcaccga aaccaagcga tctcgtgcga atctggtgca 1380 agcagacgac cagttcgagc tgtggcatcg ccggatggga catctttctg ctagcggaat 1440 gaagcgatta cgagagatga cgaccggcgc ccagttcgac gtgaaggacg gctgcagtgt 1500 gtaccgtgca tcgagggcaa gcaccagcgt caaccgttca acgcgcgagg gcagagagcg 1560 gagcgagtcc tggaactggt tcactcggat ctgtgcggcc cgatggagac ggagtcgatc 1620 ggaggcagac gctactttct gaccttcatt gacgatgcca gccgcaagac cgtcgtgtgc 1680 tttctgaagt cgaagactga ggtgctggaa gcatttcaaa acttcaaagc gtttgcggag 1740 aaccagacag gtgagcggat caagcggatt cgtaccgaca acggtcgtga gtacgtgaac 1800 cgcgagatgg aagagttcct gcgcaagtct ggtatccacc acgagaagac cgttccgtac 1860 aatccggaac agaacggtct ggctgaaaga cgcaaccgta cgtacgtcga gagagcgaga 1920 agcatgctgt tcgacgctgg cctggacaag agattttggg cggaagcagt ggcaactgcg 1980 tcgcacgtga tcaaccgatc accggcgaag ggactgtcgg tgacaccgga ggagaagttc 2040 acgggcaagc gaccggatct gtcgcatctc cgcgtgttcg gaaccaagtg tatggcgcac 2100 gtaccgaagg agcgacgacg aaagtgggac aagaagtcga ccgggactga cgttgctttg 2160 acggagcctg aggtcggaga tgaagaaatg gagctgatcg tgccaagtat tccgatcgag 2220 gtcggtccag ctgctgaaga ggaagctgag cccgaagaag acgagtcgga cttcagcgat 2280 tacgagtcgg tcgaaggcga tgccgattcg ctggttgagg cggaggaaga accgttggtg 2340 ctccctccac aaccagagga cccacaagtg ttggtaaggc gcagtggtag ggagcacgtg 2400 cccccaggca agtataaaga ttatgtgtgc agtaccctac ctgaaaaaga ttcccaaccc 2460 agacagaacc gttccatccc gatgatggtc gagcctgaca cgtaccgtga ggcgattgct 2520 cgagacgaca gcgaccagtg gaaacgtgcg atggctgcgg agttcggagc gctggtggag 2580 aatcggacgt gggaactaac cgatctgccg aaggggagga aagctctacg ctgcaaatgg 2640 gtctacaaac tcaagacgaa ctcggatgga tctgtcgagc gctacaaagc ccgcttggtg 2700 atcaagggct attcgcagat caagggcgtc gactacagtg agacgttttc acctgtagtc 2760 cggtacgcta ccgtccgata cctgatgtcg ttggctgcca agttggatct aaaggtgcac 2820 cagatggatg cagtgacggc gttcctgcag ggtgatctgg gagaagagga gatctacatg 2880 gagcagccgg aaggcttcac ggaatcaggt gcccccacga aagtctgtaa gctgcggaag 2940 gcgctgtacg gtctgaaaca agccagccgg gtatggaacc agaagctgga tgccgagctg 3000 aagcgttcgg cgtacgacac gtgcgtctac ttcaagctgg agggcaacac ggtccttatc 3060 gttgctgttt acgtggacga tttgttggtg ttctcgaaca accagcagtg ggtgaccaag 3120 ctgaagaagg acctgatggc gcgattccgg atgaaagact tgggcatggc gagcaaagtt 3180 cttggcatgc gagtgaccag aggcgacgac aaggtgatgt tggaccagga gcagtacatc 3240 aacgagttgc tgcagcgatt caacgtggcc aagtgtaatg ccgtctcgac gcctgttgat 3300 ccgactcaga agctgaccaa ggagatgtgc ccgagctctg tcgaggaacg ggagcagatg 3360 gcaaaagttc cgttccgtga actggtaggc ggactgcagt atttggcgtt atcaacccgg 3420 ccagatatct gctacgcggt gaacgcggtc agccagttca gcagcaaccc agggaaggca 3480 cactggattg cggccaagcg agtgctgcgg tacctgaagg ggaccaagat gatgaagctg 3540 acctactcga agcacagaga ccagagcttc gtcggataca gcgatgcgga ctggggaaac 3600 gatcccgaga cccgtcgatc gatcactggc tacgtctaca tggaggctgg tggttcgatc 3660 tcatggagct gcaggaagca gaccacggtg gcgctatcga cgatggaagc ggagtacatg 3720 gccgtctcag ctgctactca agaggcactg tggtggcgcg gactgctgaa ggagctgttt 3780 ggtgtcgctc aaccggtcac gatctactgt gacaacatga gcgcggtgca cctggcggag 3840 aaggagatcg gctactcgcc ccgaagcaag cacatcgata ttcggcacca ctttgtgcgc 3900 gagaacgtcg aggagaagct gatcaagctg gagcacgtcg caacggaggc gcagaaggcg 3960 gacgtgttca cgaaaccggt agctgtttca aaatttcttg aagcacggaa ggctctcgga 4020 attcacggtt gagagggga 4039 // ID LOA-5_CQ repbase; DNA; INV; 5898 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A LOA non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW Loa; Non-LTR Retrotransposon; Transposable Element; LOA-5_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-5898 RA Kojima K.K. and Jurka J.; RT "LOA non-LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 152-152 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 9 sequences with >98% CC identity. XX FH Key Location/Qualifiers FT CDS 436..2106 FT /product="LOA-5_CQ_1p" FT /translation="MMNELDPKRSANCSQQLEKTEEKEEVKKGKESAVIEE FT KIENMECGGVDDGDGDDLEDQLGDLGLDGGSSVSSSVLDSPQRSMEVDAPE FT DQSAGPGIPPGDNDSLHSSGDTSGSVTPTPVDPAGSNRKRRKPGKPKLTNA FT QLRRMDYFVRNGHPPKEARELALVPVEQNYLNKRQRSKDEPSPNSSGQGKG FT PDKKRIRNRGSEGPAPPPAAQAGMISYKEVAEAIHVAVIPLGYPAKTLTTE FT ELKAIRTQLLDLIIKLETPVVRPKYRNCTFKYGYLILACADEATAGWTKEV FT VNKLQPWEGAELVAVNLADLPKQVLFLGYFQDSLQYTSEDIIRLVQNQNDD FT FRTEQWKVARRTEHSKTIELLIEMDEKSAEQVAAQHFQLNYTFGKARLRKV FT TGSTANSDQKDSATLTGKVQTEHVPQPPAQDSQITERTVLSGAQPSCSQPL FT DRIDVTAKPGTVLQKSESESKQQPLGSSRKTKPRKVPQGASSNKKLQPPGS FT SKNKQSNLQGVGRGPVTGRIQPLESAIAGASKKHGSTAAGNCRPKPGSLAG FT PATGSSSSQQ" FT CDS 2106..5783 FT /product="LOA-5_CQ_2p" FT /note="apurinic-like endonuclease, reverse FT transcriptase and ribonuclease H." FT /translation="MSSIRCLQVNLHHAKGASSVLSRRFTKEQITIALLQE FT PWVNNTKILGLPTQNSKLIYCNTQSKPRTAILLRCGIKYTPITEFIQRDIV FT AITVEVPTATGKQELIIASAYFPGDQDDVPPLETTAFVRYCRDVNKPFLIG FT CDANAHHTIWSSTDINKRGECLLEYLSSNDVNVCNEGDKPTFINAIRQEVL FT DLTLCSSSISEKVKNWHVSDEPSLSDHRHIVFNIEARALETEEYRIPAKTD FT WNAFKEHLQGTRMPINPKIRTPMELETEAENLQGRIIEAFNASCPLKKRRV FT CRDVPWWSEKLADLRKEARRLFNRAKLSGNWDAHRAALTKYNAELRNAKKE FT SKKRFCESIATMPVATRLQKALSKDHTNGLGQLRTEDGSLTKTNKDTLDVL FT MSTHFPGSSTKDEAVTNRPADFNVRRSVPVEAIHLARRMFTSTSIKWALHA FT FEPLKSPGPDGILPIFLQKAGDIILPEMITIFRSSYILGYIPESWCKVKVI FT FLPKAGKPDKTLPKAFRPISLTSTLLKLMEKITDNYIRAEFLKNSPLHRHQ FT HAYQSGKSTESALHHLVTLVEKSLRHQETVLGAFLDIEGAFDNTSYGSIKT FT ALLNRGIDATTTNWIETMLISREISASLGDTTVVVKAAKGCPQGGVLSPLL FT WSLVVDTLLQNLSELGYEVIGYADDVTLIIRGKCDATISCRLQSGLNYIIH FT WCTQEGLSINPNKTVLIPFTRRRKHNISPPVIKGVRLDFSSEVKYLGVFLD FT RKMNWNSHLDYAVKKAISAIWTCSKLFGKTWGLSPQLALWSYTTIARPRLT FT YASLVWWPKVNERTAQAKLNKVQRLACLSITSAMKTTPTAAMEALLHILPL FT HLYVKKEAENGALRQRRHNINYEGDLTGHLRIMKEFDITPLVTTVTDCMEA FT RPNIDTPYDTVETTRRLWCAGGPELPPGTICFYTDGSKIGDLTGAGIFGPG FT IKLSISMGRWPTVFQAEVYAIYSCAVHCLKKNYRHAKIGIFSDSQAALLAL FT KAARVESKLVWDCIDVLRELSRRNKVSLFWVPGHCGIMGNEFADQLARQGS FT SAKFVGPEPFLGTSKCAVKYELKKWEDNQISSTWFATQGCRQAKNFITPKP FT AITNKLIGLKRSELRTVTGLLTGHCPAKYHLKKLGIVDNDICRFCNTELES FT SAHLLCTCGAIATRRLNSLGSRLLSPYAVWRTHPKKMISFMNCIIPEWDKS FT TGQPTTNSLPPVNGYE" XX SQ Sequence 5898 BP; 1755 A; 1470 C; 1428 G; 1244 T; 1 other; ccgcctaggt acagaacacg gatgacaaac ggagcaagct acaatcttac gcgattgtat 60 cttaggaagg gaaatcattg agaactgtaa cttgscggtc gtaagcccgg gtaaaggagg 120 aacgttgaga tggaggctgt tctctgcctc catcttgtaa aaaaatagta cggagaacct 180 actggaggtt atcttagggt agcgtgcaaa attggcatgc tatccaagta aactacctcc 240 tatattagac cccaatctac agtgtcacgc gacccgtgcc aagagatgta tgcctggggg 300 ggtctttaac attgattcca ggtcttaacg gagcctgtgc ggcaccgggg caccccgcac 360 agtatcttgc tccatctgtt ccatgcaggc tctagaacgg atgtttaatt ttccttgcag 420 tttagtgaaa tcagtatgat gaatgaatta gacccaaaac ggagtgccaa ctgctcccaa 480 cagttggaaa agactgagga gaaggaagaa gtgaagaaag gaaaagagag tgcagtaata 540 gaggagaaga tcgagaacat ggagtgtggg ggggttgatg acggggatgg agacgatctg 600 gaggaccagc tcggcgactt gggccttgac ggaggctcgt cggtttcttc atccgttctt 660 gactcacccc aacgatctat ggaggtggat gcacctgagg accaaagtgc tggtcctggg 720 atcccaccag gtgacaacga ttcgttgcat tcttcggggg acacttcggg gagcgtgacg 780 cctactccag ttgatccagc cggaagcaac aggaagcgca ggaagccagg caaaccgaag 840 ttgaccaacg cacaacttcg ccgaatggac tactttgtcc ggaacggaca tccgcccaag 900 gaagcgcgtg aacttgcact agtgccggta gaacaaaact acctcaacaa gcgccaacga 960 tcgaaggatg aaccttcgcc gaacagcagc ggtcaaggta agggacccga taagaaacgg 1020 attagaaacc gtggatctga aggacccgca ccaccaccgg ctgcccaagc cgggatgatc 1080 tcctacaagg aagtggccga agcgatccat gtcgctgtca ttccactagg ataccccgcc 1140 aaaacgttaa ctacggaaga gctgaaagct atccgcacgc aactgctgga cctgattatc 1200 aagctggaga cacccgtcgt cagaccgaaa tacaggaact gcactttcaa atacggttat 1260 ctcatcctcg cttgcgctga tgaggcgacc gctggctgga caaaagaggt ggtgaacaag 1320 cttcaaccat gggaaggtgc ggagctcgtt gctgtaaacc ttgctgacct gccaaaacaa 1380 gtcctattct tgggatattt ccaagacagt cttcagtaca ccagtgagga cattataaga 1440 cttgtccaga accaaaacga tgacttccgc actgagcagt ggaaggtcgc tcgcagaacg 1500 gaacactcga agaccatcga actgctgatc gagatggatg agaaatccgc ggaacaagtc 1560 gcagcgcagc acttccaact gaactacacg tttggtaagg cacgtctgcg gaaagttacg 1620 ggatcaacgg ccaactccga ccagaaagat tctgctacct tgacagggaa ggttcaaacg 1680 gaacatgttc cgcaaccacc tgctcaggac agtcaaatta cggagaggac ggttctatcg 1740 ggcgctcaac caagctgctc gcaaccactt gaccggattg acgttaccgc gaagccaggg 1800 acggttctgc aaaagtctga gtctgaatcc aagcaacaac cacttggaag tagtcgcaag 1860 acgaaaccaa ggaaggttcc gcaaggtgca agctcaaaca aaaagctcca accacctggt 1920 tctagcaaaa acaaacagag caatctgcaa ggtgttggtc gtggtcccgt aactggccgc 1980 attcagccac tggaatctgc catcgcggga gcatccaaga aacacggctc tacagctgcg 2040 ggtaactgcc gcccaaaacc cgggtcacta gcgggtccgg caactggcag ttcctcatcc 2100 caacaatgag cagcattcgc tgcttgcaag tgaatctcca tcacgcgaag ggcgcctcca 2160 gtgtcctaag ccggagattc acgaaagagc aaataactat tgctctacta caagaaccgt 2220 gggtaaacaa cacgaagatt cttggattac caacacaaaa cagtaagtta atctactgca 2280 acactcagtc taagcccagg actgccattc tgttgcgctg tggtattaaa tatactccca 2340 ttacagaatt catccagcga gacatcgttg caatcacggt agaagtcccc acagccacgg 2400 gaaaacagga gctgataatt gcgtcggctt acttcccagg tgaccaggac gacgtaccgc 2460 cacttgaaac cacagctttc gtgagatact gcagagacgt caacaaacca tttctaattg 2520 gatgcgacgc caacgcacat cacacgatct ggagcagcac ggacataaat aaaagaggtg 2580 agtgccttct tgaatacctt tcgtcaaacg atgtcaatgt atgcaacgag ggagacaaac 2640 ctacttttat caacgctatt cgacaagagg tactggatct caccttgtgt agttcctcaa 2700 tctccgaaaa agttaaaaac tggcacgttt cggacgagcc cagcctatct gaccacaggc 2760 acattgtctt caacatcgaa gccagggcgc tggaaactga ggaatataga attcccgcta 2820 aaacggactg gaatgccttc aaggaacacc tgcaaggtac gcgaatgccg attaatccaa 2880 aaattcggac tcctatggaa ctggaaactg aagcggaaaa tcttcaggga aggatcatag 2940 aggcgttcaa cgcaagctgt ccgttgaaaa aaaggagggt gtgccgtgac gttccctggt 3000 ggagcgaaaa gctggcagat cttcgcaaag aagcaagacg cttgttcaac agggctaaac 3060 tctcgggaaa ctgggatgcc catagagcag ctctcaccaa atacaacgct gagttgcgga 3120 atgctaagaa agaatcgaaa aaaagattct gtgaaagcat tgcgacaatg cccgtagcta 3180 cccgactaca aaaggcgctg tccaaagatc acactaatgg tcttgggcaa ttaagaacgg 3240 aagacggtag tctcacgaaa acaaacaagg atactctgga cgtgcttatg tccactcatt 3300 ttcctggatc atcaactaag gatgaagcag ttaccaacag accagcggac ttcaacgtgc 3360 gaagaagcgt tccggtagag gcaatccatc ttgctcgacg tatgttcaca agtacgtcga 3420 tcaagtgggc tctccatgct tttgagcccc tgaaatctcc tggacccgac ggtatacttc 3480 ccatcttctt acaaaaagcg ggcgacatca tcttgcccga aatgatcaca atatttcgct 3540 ctagctatat tctaggatac atcccggaaa gctggtgcaa ggttaaggtg atcttcttac 3600 caaaggctgg caaacctgac aagactttgc ctaaagcctt cagacccata agtctgacat 3660 ccacgctgtt aaaacttatg gaaaaaatca cggataatta tatccgcgcg gagttcctga 3720 aaaactctcc gctgcacagg catcaacacg cttaccaatc gggtaaatct acggaatccg 3780 cgttacatca tctggtgacg cttgttgaaa aatcactcag gcatcaagaa acggttctag 3840 gggcgtttct ggacattgag ggagctttcg acaacacatc atacggatca atcaaaacgg 3900 cacttctcaa cagaggaata gacgctacaa ccacgaactg gatagaaacc atgcttataa 3960 gcagggaaat ttctgcatcg ctaggagata caactgtagt cgtgaaagcg gccaaaggat 4020 gtccgcaagg aggggtactc tcacccttgc tttggtcgct agttgtagat acacttctac 4080 aaaatctctc cgaactaggg tacgaagtca taggatatgc tgacgatgtt accctcatca 4140 tcaggggcaa atgtgacgcg acgatctcgt gcaggttgca gtcaggtctc aactacatca 4200 ttcactggtg cacacaagag ggactatcca tcaaccccaa caaaaccgtt ttaattcctt 4260 tcactaggcg gagaaaacat aacatttctc cgccggtgat taaaggtgtt cggctcgact 4320 tcagttctga agtcaagtat ctaggagttt tcctggatcg caaaatgaac tggaactcac 4380 atctggatta cgccgttaaa aaagcaatat cggccatctg gacctgcagt aaactgttcg 4440 gaaagacctg gggcctctcg cctcaactgg ctctctggtc ttacaccacc atagcacgac 4500 cgaggctaac atacgcgtct ctcgtatggt ggccaaaagt gaacgagagg acggcgcaag 4560 caaagttgaa caaggttcag cggttggcct gcctctccat cacaagtgct atgaaaacaa 4620 caccgaccgc tgcaatggag gcactgctcc acattctacc actacatctt tacgtcaaga 4680 aggaagcgga gaatggagca ctacgacagc gaaggcataa catcaactac gaaggtgatc 4740 taacaggtca cctgcgcatc atgaaggagt tcgacataac tcctctagta acaacagtta 4800 cagactgcat ggaagcgaga cccaacatag acactccata cgatacagtt gaaacaactc 4860 gtagactctg gtgtgctggg gggccagaac ttccaccagg aactatctgc ttctatacag 4920 atggctccaa aataggtgat ctcactggag ctgggatctt cggaccaggg atcaaactat 4980 ccatctcgat gggtagatgg cccaccgttt ttcaagcaga agtctatgct atatactcct 5040 gcgctgtgca ttgccttaaa aagaactata ggcatgcgaa aatcggtata ttctcggaca 5100 gtcaggcagc actgctagct ctgaaagccg caagggtgga gtccaaactc gtttgggact 5160 gcatcgacgt actacgggag ctgtcccgcc ggaacaaagt atctttgttc tgggtacctg 5220 ggcactgtgg gatcatgggg aacgagtttg ccgatcagct cgctaggcag gggtcatcgg 5280 ctaaatttgt tggtcctgaa ccatttctgg gtacatcaaa atgtgctgta aagtacgagt 5340 taaaaaaatg ggaggataac caaatttctt ccacatggtt tgcaacacag gggtgtagac 5400 aagccaaaaa cttcattaca ccaaaacctg caattaccaa taagcttatt ggtctaaaac 5460 gtagtgaact gcgcactgta acggggctgc ttacagggca ctgtcctgca aagtatcacc 5520 tcaaaaaact cggtatagtt gacaacgaca tatgccgctt ctgcaacaca gagctggaaa 5580 gctctgcgca tttgctctgc acctgtggag caattgctac tcgaaggctg aattccctag 5640 gaagtcgcct tctctcgcca tacgccgtat ggcgcactca tcccaaaaag atgatcagtt 5700 tcatgaactg tatcatacca gaatgggata aaagcacagg tcagccaaca accaactcgc 5760 tacccccagt caatgggtac gagtaaccgg ctaactgtgt agagtaaaca gggatacatc 5820 acaaaagtta gtctcaagga cggacgaggt gatatcaaac ccaaaagccc aatttgggca 5880 caaaaaaaaa aaataaaa 5898 // ID BEL-176_AA-LTR repbase; DNA; INV; 496 BP. XX AC supercont1.6; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-176_AA_; KW BEL-176_AA-I; BEL-176_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-496 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.6; Positions 2938818 2939313. XX SQ Sequence 496 BP; 154 A; 96 C; 110 G; 136 T; 0 other; tgttaccggc aacactgttg acggcgaaac ggtggtgaaa ccactgtccg ttccctcgca 60 actgagcggt tctcgcggag tggaatgaca gttcgcgccg ttcccgatac aacagagaag 120 acaaattcat tgatacagat gcatagcata cgtctataca gcacaattag tcagcatttt 180 gaccgagcag agtgtgaaat ttgtgaagaa tataccgatt tttggatgct cagtaagatc 240 agtttgctgc tacagtcaac gccgtaaagt tagtgaagga atcagtcagt tgaggtacgt 300 ttagaaattg tacggtctac gttttccccg gataccgata tacctgtaca gtaggacaca 360 gcaaaatact gaaattgtaa gaacccattt gtatacttaa ttaatttaaa ctaaaatgaa 420 tttaaaataa atttcagctt tgtagctgct aagtctgcaa ctagaagaac ggtgtttgtt 480 ctgacattcg ggaaca 496 // ID Gypsy-6_AA-LTR repbase; DNA; INV; 387 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-6_AA_; KW Gypsy-6_AA-I; Gypsy-6_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-387 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 982-982 (2011). XX DR [2] (Consensus) XX SQ Sequence 387 BP; 96 A; 99 C; 99 G; 90 T; 3 other; tgtagatagc tcagtgtgaa ctaacaggtt taaatattac aggatagtta gatagtttcc 60 tccgagtgct gaaaagttag actgtagtga taccaaagtg cagtcagaaa ggtattaaag 120 tcgtgcgttg cgtgaagtgt gtgctctctt taaccccatt gtcgaacagc cgttcgacgg 180 ctttatttct cggatttagg aacccgcagc agtgaccgac catccggcaa accaccggcc 240 cacgcgttac tcgtggccga aggggccgcg ccgccaacat cgcgcccatc gctctgtgac 300 amcgtcgaag aacatccgtc cgatcacgtg tccagaaaac cgacgctaas gaagttcgtc 360 cmgtctggta ggtggattga tcccgca 387 // ID Gypsy-28_AA-I repbase; DNA; INV; 4848 BP. XX AC supercont1.6; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-28_AA_; KW Gypsy-28_AA-LTR; Gypsy-28_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4848 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.6; Positions 749281 754128. XX CC Positions [3860-4321] - Integrase core CC 'TACCT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 803..2314 FT /product="Gypsy-28_AA-I_1p" FT /translation="MMKNGKKDLLLHYAGSNVQQLFDTLPEVPGTEMRGPL FT LNIEHYTPNMTSYEEARAKLNEFFLPKENSTYERHLLRQLKQQAGENIDAF FT TIRLPVQAERCGFGDRVEENIKDQIIQNCQSAMLRCDLLKRGDASLEEVLS FT IAKIFETVAQQEKSFVSGSESKPDVDVNRIDTTPYGKRRRISDLKQLECHR FT CGYFGHIAKDDRCPAKRKVYNKCGGKDHFAKKCRTKYPISQHARTDGNEKI FT QPHGIGRSNKIDRKDDETHTVKHIVDEHVEYVFHLTTPDGNGEVQCKIGGV FT STYAVIDSGSKYNLLSQSNWEQLKDKKVIVSNQRREAQMVFKAYGGQSLPL FT IGVFTATVKLGNARNSAEFIVVKGNGKILIGRDTATAMGVLKIDIPVNEVE FT FGAKSKKLGTIKDILVDIPIKADAVPVLHQGVIEPVNEPAKWISPVVVVPK FT GDKDVRICVDMRRANEAVERENHPLPTFEDFLPHLAKAKVFSRLDVRNAFH FT QVCHPCFE" FT CDS 3719..4804 FT /product="Gypsy-28_AA-I_2p" FT /translation="MTIMKQRLRAKVWWPKLDKQVERYVRSCRGCMLVAAP FT SAPEPMKRRALPSGPWQHVAIDFLGPLPSGHYLFVVVDYFSRYIEVEIMTK FT TDSGETIKRLNSIFARFGLPMSITADNGPQFSSEEFRIFCASSNIKLISTT FT PYWPQQNGEVERQNRSLLKRLTISQATNADWIEELNKYLLMYRSSPHSTTK FT KTPSEMLSGYNIRDRLPSIYQPKDDDEETADRDKIAKEKGKLYADERRSTK FT PSPISEGDNVLLKKMAKTNKLTPNFEPNVFKVLKRKGGDVIVSSEESGAKY FT RRHVSHLQQILSDDDTLSIPEEADDTPTPDSSLSKESRSEDLTRKQVRGLK FT RVIRKPAYMRDYVQAVKTD" XX SQ Sequence 4848 BP; 1496 A; 908 C; 1184 G; 1260 T; 0 other; ttttggcggc gagaataaaa aaaaatggcc agctcatttg gtaagtaatt gatattcaaa 60 attcttatat ttggaattta ctcggaagag gaaaaaaaga ctttgaagag taaacagcaa 120 aatcaaaatt ggcttaaatg cggaaacatt atgtcggatt atgagttaat atgcgttgaa 180 gcagataatt ggttatggta cagatgctta aatgcagtga gtcgaaacct gttttgcgaa 240 agagcaagga atgtttgcaa gatggtcgct tcaatgcgga gataaaaaaa aaaacatctg 300 attgatctat aaagaaaagg gcaatgattc tcaaagtgag cagcctatat ggtagtattg 360 gtataagacg acaaaggtac gatgcacatg atctgccagt gagcgagccg tatattttgc 420 actggggcaa acgtaaccaa tactgatgca cacgaaaact gacagatgat ttgacaagaa 480 gacgacgagg ttgaaagttg gacagaaagt gttattttga agaaagactt gtttacagtt 540 ccttttctgt tctgcaattg caaatcgtaa aaagtatgta aacagtgtga ttaattaata 600 tttattgaga cctaaaaagt tgaaaactga taagtttaag gaaatgctaa agttgccgtg 660 tttggatctt tttcagacat gatacgctca ctcggattgc agctgcaacc gtttgattat 720 gccacccaca cggacaatgt gggcattgag tggcgaaagt ggttaagatc gtttgagaca 780 atgatccgcg cgagtcggat tgatgatgaa gaatggaaaa aaggacttgt tgttgcatta 840 cgctggatcg aacgtccagc agctatttga tacgctacca gaagtacctg gaaccgagat 900 gcgtggtccg ttgctgaaca tcgaacatta tacaccgaac atgacgagct acgaagaggc 960 tagagcgaag ctgaacgagt ttttcttacc gaaagagaat tcgacctacg agcggcattt 1020 gctacggcag ttgaaacaac aagcgggtga gaatattgac gcctttacta tcagactgcc 1080 agtccaagca gagcgttgcg gctttggcga tagggtagag gagaacatta aagaccagat 1140 cattcagaac tgccagtcag ctatgctacg ctgcgatttg ctgaaacgag gagatgccag 1200 tcttgaggaa gtgttgagca tcgcaaaaat ttttgagact gttgctcagc aagaaaagtc 1260 ttttgttagt ggaagtgaat caaaaccaga cgtcgatgtt aacaggattg acactacacc 1320 ctatggaaaa aggaggagaa tcagtgattt aaagcagctt gaatgccatc gctgcgggta 1380 cttcggacat atcgctaaag atgacagatg cccagcaaaa cggaaggtgt acaataagtg 1440 cggcggcaag gaccattttg ctaaaaagtg tcgcaccaag tatccgatca gtcaacacgc 1500 aagaacggat ggaaacgaga aaattcaacc acatggaatc ggtcgtagca ataaaattga 1560 tcgcaaagat gacgaaactc acacggtgaa gcacatagtc gatgagcatg tggaatacgt 1620 cttccattta accacacctg atggtaacgg agaagtgcag tgtaagattg gtggtgtaag 1680 cacatatgct gtaatagact caggatccaa gtacaatctg ctcagtcagt cgaactggga 1740 gcaattaaaa gataagaaag tgattgtatc gaaccaacga cgggaagcgc agatggtgtt 1800 taaagcttat ggagggcagt cgttgccact gattggtgta tttaccgcta cagttaaact 1860 gggcaatgcg cgtaattcag ctgaattcat cgttgttaaa ggaaacggta aaatcttgat 1920 tggtcgcgat actgccacgg caatgggtgt tctaaagatc gacatacctg tgaacgaagt 1980 ggaatttggt gcgaaatcca agaaactagg aaccataaag gacatcttgg tggatattcc 2040 gatcaaagcg gacgctgtac cagtgcttca ccagggagtt atcgagccgg tgaacgagcc 2100 ggcaaagtgg atttctccgg tagttgtggt gccgaaggga gataaagatg tacgcatttg 2160 cgtggatatg cgacgcgcga atgaagccgt ggaaagagag aatcaccccc ttccaacgtt 2220 tgaagatttt ctaccgcatt tggcgaaagc taaggttttc tctcgtttgg atgtgagaaa 2280 tgctttccat caggtatgtc atccttgttt tgagtaattt cagtctgaat tgacacgttt 2340 acgtttaccg atgtgcaaac aattgatcat aattgaattc gatttatttt tttatttttt 2400 tctcgatgca ctacctgaga tttgttttac aataaaactg tcagttgttg acttcaataa 2460 aagagatttc aaactttgga aaattgagct agtttgtttt attttaggtt gaaatttcta 2520 aacgatcgcg tgaaattaca acatttatca cgcgcagagg cctgtttcgc tataccagac 2580 taatgtttgg tataaactgc gcgcctgaac tattccaaaa gaccatggaa caagttctta 2640 gtggttgtga aggttgttta attttcattg acgatgtcat agtgcatggc tctgacaagg 2700 aagcgcacga catgaggctt aaaatgattc ttcggcggct agatgactgg aatgtgacga 2760 tgtgactcta aatgatgata aatgtatgta tgggtgtctg aaatgaagat tttgggacat 2820 attctctcgg cggatggaat taaacccgat tcagataaac tggaatccat tcggcgcttc 2880 cgcgaaccaa aatccggaga agaggttaga agctttttgg gtctggtgaa ttaccttagt 2940 aagtttattc ccgatcttgc gacactaacg tatccactac ggcaactaac ggttcagaag 3000 caatgttttg tctggggagt ggagcaacaa acagcatttg aaaaactgaa ggaacatatg 3060 actcgtccga cgacgcttgg ttactttgat gtttcggatc gcacacaatt ggtggctgat 3120 gctagcccag ttggtttagg aggtccttat tcagataaat aaaacaggcc agcaaaagtc 3180 tttcagacta gctctcgtgt gggcggttga gcgctttcac ttttatttgt acggtcgttc 3240 gtttgaactg ataaccgatc acaaaccgtt ggaaactatt ttcggatcaa gatcaaaacc 3300 atgcgctaga atcgagaggt gggtagttcg gctgcaatca tataaggcaa cagttgttta 3360 ccgcccggga aaatcgaata ttgctgaccc gttgtcccga ttagcagtta cgggtagcac 3420 gactggaaaa actttcgacg aatatgctga gcattatatt gcatgggttg cttcaaatgc 3480 attaccggta gcgattaaaa tttcagaaat agagaaagct tcagattccg acaaaataat 3540 tcagtcgatt cgagttggta ttgaccaagg tgtttggtca gaagatgcgg atccatttaa 3600 aatatttgct acggaactat gctttgctga taagattttg ttgcgaggaa ctaggatagt 3660 gataccagaa accttgaggg agcgaacgtt aggcttggct cacgagggtc atcctggcat 3720 gaccatcatg aagcaacggt tgagagcgaa ggtatggtgg ccaaaactgg ataagcaggt 3780 tgagcggtat gtcaggagct gtcgtggatg catgctggtc gcagccccat ctgcaccgga 3840 accaatgaaa cggagagcgt tgccatcggg tccatggcaa catgtagcca ttgattttct 3900 cggcccgctt ccttcagggc actatttgtt tgtagttgtt gattatttca gccgctacat 3960 agaagtcgag attatgacaa agactgattc tggcgagact atcaagcgtt tgaattcgat 4020 tttcgcacgg ttcggcctcc ctatgtccat tactgctgat aacggtcccc agttctccag 4080 cgaagaattc cgtatttttt gtgcctctag caatatcaaa ctgatcagta cgaccccgta 4140 ttggccacag cagaatggcg aggttgagcg acagaaccga tctctgttga aaaggctgac 4200 gatcagtcag gcaacaaatg ctgattggat cgaagaatta aacaaatatc tgcttatgta 4260 tcgctcatcc cctcactcaa ctacgaagaa aacgccatcg gaaatgcttt ccggctacaa 4320 catacgggat cggctgccat ctatctacca accgaaagat gatgatgaag aaactgctga 4380 tcgagacaag attgcgaagg agaaagggaa attgtatgcc gacgaacgcc ggagtaccaa 4440 accaagtcca atctccgaag gtgacaatgt tctgttaaag aaaatggcaa agacaaataa 4500 gcttacgcca aattttgaac caaatgtttt caaagtctta aaaagaaaag gcggagatgt 4560 cattgtatct tcagaagaat ccggcgcaaa ataccgcaga catgtttctc atctacagca 4620 gatcctcagt gacgatgata cactttcgat tcctgaagaa gcagatgaca caccaacccc 4680 agacagttcg ttgagcaagg agagccgttc agaggatttg actaggaaac aagtacgtgg 4740 acttaaacgg gtcatccgaa agcctgctta tatgcgagac tatgttcaag cggttaaaac 4800 tgattagagt gccgagaaac taaatttata taaaaaagag aaagtgaa 4848 // ID Kiri-18_AAe repbase; DNA; INV; 4597 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 16-FEB-2011 (Rel. 16.02, Last updated, Version 1) XX DE A Kiri non-LTR retrotransposon family from Aedes aegypti. XX KW Kiri; Non-LTR Retrotransposon; Transposable Element; Kiri-18_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4597 RA Kojima K.K. and Jurka J.; RT "Kiri non-LTR retrotransposons from the yellow fever mosquito."; RL Repbase Reports 11(2), 713-713 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 11 sequences with >98% CC identity. CC This family and related Kiri elements found in two mosquito CC species, Culex quinquefasciatus and Aedes aegypti, constitute a CC distinct lineage belonging to the CR1 group. The phylogenetic CC analysis with PhyML indicates that Kiri is a sister lineage of CC the L2 clade. Although several Kiri elements do not encode two CC proteins, probably due to 5' truncation, Kiri elements generally CC encode two proteins. Their ORF1 proteins commonly contain an CC L1-like RNA-binding domain. XX FH Key Location/Qualifiers FT CDS 293..1144 FT /product="Kiri-18_AAe_1p" FT /translation="MNTNRALTRAAHTKMSKDRPTLAVPLGSTSNDNQNKK FT RSREDLDRTENDVESFDQLLVRIKQMIDEGNAKIENKIESSNAALVTEIST FT LRDEVNQLKVDYARDFNSLCESHEKTAEQVRRSKDNAGKILRSNDLILTGV FT PYRPTEKTDEILREIATTLGYNDSDVPPVFTKRLARIPIAAGSTPPILLQF FT AFRASKDEFFHRYFASKNLSLLHLGFDTDKRIFINENLTESTRNIKGAALK FT LKRNGHLQNVFTKDGTVYVKPLEDVPAQPVFSLDQLKIFGGRN" FT CDS 1548..4397 FT /product="Kiri-18_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MMPVSHTTNIARTQSSIADIPRAVMNVALRDDCLNLC FT HLNAQSLCARQLSKLDEFKRCFANSKVDLICVTETWLNENITDSTVAVEGY FT SILRNDRMTGRGGGICIYYKSGLNCRVLDNSDMLSGHASVDRTEHLFIEVR FT VSEDKLLLGVIYSPPDVDCSYLIDQKLSELSLDYEKIVLIGDFNTNLRKNC FT PKTTRFCEVLDNFGMFCVNTEPTHFYPGGSSLIDLLITNDINFVLNFNQVS FT APTFSHHDIIFSSLNVLRYRNDNPRMIRDYSRIDYPSLQHYLNGIDWSLLY FT SITDSDTALDFFNSVIIQLFDNLVPLRAPRHKNNAPWFNNDILNAMIARDV FT AYREWIRSKNLLDHHQFKRLRNKVTQLINTAKSNYMSSNLESAISSKELWV FT KLKRLNVTGSSNSSAKFHHSKDEINTFFGENFTRDSTHPSIPPSNSSGFTF FT TPCSELEVANSIFSISSDAIGLDGIPLKFIKIILPFIITPLTYLFNLFISS FT TKFPRAWKAAKVIPIRKKPAGSDLNNLRPISILCSLSKVFEKIVKTQIQEY FT IQRFSLLSPYQSGFRSGHSTTSALLKVHDDIHQCIDKKGVAFLLLIDFSKA FT FDRVSHAKLLHKLSHQFNFSRDAALLIKSYLCQRTQVVELDGSLSNTIFIL FT SGVPQGSVLGPLLFSLFINDLPSILKNCSIHMFADDVQLYICSTDYNTYDL FT AQLINYDLERLSKWSSSNLLPINSTKTKAMFISRRQIRATLPDLVINGDKI FT DYVDKACNLGVIFQSNLEWDCQINAQCGKIYAGLRHLRLTANMLPVSTKLM FT LFKSLLLPHFSYGSELVLNASAADFGRLRVSLNHCVRWVFNLSRYSNVTEF FT QRQLLGCSFYRFFRLRCYITLFKIINHGPQYLVDKLQSFRSARVRNYILIH FT HNSSHYSNTFFVRAINLWNQLPTDIKSISSLSRFRAECMSWLNEGN" XX SQ Sequence 4597 BP; 1346 A; 902 C; 839 G; 1510 T; 0 other; agtgttctga gtgatggtaa gcggtagaac gtgagttgtt cgtggtcgca gttcagttga 60 agtttttggc atgtaatgtg attccatgct acgtttttca ccgatgtagt gcttctttac 120 tcgaactcct ttctatacga caagatagct gattccatcc tgcactagta atgtgagaaa 180 agtgaaagtt attatcacaa ttcctggcat aaacttttca tcttccatat tgagactctc 240 atattgtgtt gtgtactaaa ctacgtgtac taaaccatat ccactcactt ggatgaacac 300 aaatagagca ctcactcgtg cggctcacac taaaatgagt aaagacagac caacacttgc 360 cgtcccactg gggtcaacca gtaatgataa tcaaaacaaa aagcgctcac gagaggattt 420 agaccgtact gaaaacgatg tggagagctt tgatcaacta ttggttcgca ttaagcagat 480 gatcgacgaa gggaatgcga aaattgagaa caagatcgag tccagcaatg cagcccttgt 540 tactgaaatc tccactctgc gtgacgaggt aaaccaactt aaggtcgatt atgcacggga 600 cttcaatagt ttgtgtgaat cacacgaaaa aactgcagag caagtgcgac ggtccaagga 660 taacgccggt aaaatactga gatcaaatga tctgattctc actggtgtac cgtacaggcc 720 cacggaaaaa acagatgaaa tcctgcggga aatagctaca actcttggct acaatgactc 780 ggatgttcct cctgttttca ccaaacgtct ggctcgtatt cctattgctg ctggttctac 840 gccgccgatc ctgttacaat ttgcgttcag agcatccaaa gatgaatttt tccatcgtta 900 tttcgcatca aaaaatctaa gcctacttca tcttggcttc gatactgata agcgtatttt 960 tatcaacgaa aacctcactg aatctactcg caacatcaag ggtgctgcac tgaaactcaa 1020 acgtaatggt catcttcaga acgtgttcac taaggacggt accgtttatg tgaagccact 1080 cgaggatgtt cctgctcagc cggttttcag cttggatcag ttgaagattt ttggaggacg 1140 aaactaaccc tatcctttat gaattctttg tattcctgtc aatgttacat ccttgactcc 1200 tttcctaccg attccatgct gccttccttc ctaaaagtat ttcttcatct gccggtgatc 1260 atcggtgctg acctgatgct gctgttgttg ctgctgtggg tgttgttgct gctgtggatg 1320 ttgttgctgt tgtcaccacg ttttgcacac ctgtatcact gtacctgcgg tagcttttag 1380 tagttataca actgaatttg cgatcaaaaa ccacattcta ataagttgaa ataagattag 1440 tatgaatata aatgaatttg aaatgcatgt tatcccatta ttagatttag tgttcccttc 1500 agttccccag cttatgctct cacgatcgtt cttgttatta cccttcgatg atgccggtta 1560 gtcatactac caatattgca cgtacgcaaa gtagtattgc cgatattcca cgtgccgtta 1620 tgaatgttgc cctacgcgat gattgcctaa atctttgcca tttgaatgct caaagtctct 1680 gtgctcgtca attaagtaag ctcgatgaat ttaaacgttg ttttgctaac agtaaagttg 1740 atttaatatg cgtaacggaa acatggctaa atgaaaacat aactgattca acagttgccg 1800 tcgaaggtta cagtatatta agaaatgacc gtatgactgg tcgaggtggc gggatctgta 1860 tttattataa atctggtctt aactgcagag ttcttgataa ctcagatatg ctgtctggtc 1920 atgctagtgt tgatcgtact gaacatttat ttattgaggt acgagtcagt gaggataaac 1980 ttctgcttgg tgtaatttat tctcctccgg atgttgactg ctcttatcta attgatcaga 2040 aactctccga actatcgctt gattatgaga aaattgttct aataggtgat ttcaacacaa 2100 acttaaggaa aaattgtccc aaaacaactc gtttctgtga agtactagat aactttggaa 2160 tgttttgcgt taatacagaa cctactcact tttatcctgg gggcagctca ttaattgatt 2220 tgcttataac aaatgacata aattttgttc tcaattttaa tcaagtgtca gccccaactt 2280 tctctcatca tgatataatt ttctcatctc tcaatgtatt gcgttacaga aatgataacc 2340 ctagaatgat cagagactat agtcgaatag attatccctc gttacaacac tatttaaacg 2400 gcatagattg gtccttgctg tatagtatta cagattctga tacggctctt gatttcttca 2460 acagtgttat aattcagctt tttgacaatt tagttccatt gcgggctcct cgtcataaaa 2520 ataatgcacc atggttcaat aacgatattt taaatgctat gatcgctaga gatgttgctt 2580 accgtgaatg gattcgtagc aaaaatttac tagaccatca tcagtttaaa agactacgca 2640 ataaagtaac ccaactaatt aatacagcaa agtcgaacta catgtcttcg aacttggaat 2700 ccgcgatttc aagtaaagaa ctctgggtga aactcaagcg cctcaatgtc actggaagtt 2760 ctaacagtag cgctaaattt catcattcca aagatgaaat aaatactttt ttcggtgaaa 2820 acttcacacg tgattctaca catccttcaa tacctccttc caattctagt ggctttactt 2880 tcacaccttg cagtgaattg gaggttgcga attctatatt ctcaatttct tccgatgcaa 2940 ttggtttaga tgggatcccg ttaaaattca taaagattat tttacctttc atcatcactc 3000 ctttaacgta tctttttaat ttatttattt caagtaccaa gtttcctcgc gcttggaagg 3060 ctgcaaaagt aattccgata cgcaagaagc ctgccggatc tgatctgaat aatcttcgcc 3120 caataagtat tttgtgttcg ctgtccaaag tttttgagaa aatcgttaag actcaaattc 3180 aagaatatat tcaacggttc agtctgctca gtccttacca gtctgggttc aggtctgggc 3240 acagtaccac atcagcgctc ctaaaggttc atgatgatat tcatcaatgt attgacaaaa 3300 aaggtgttgc ttttctgctt ttaatagact tctctaaagc ctttgataga gtctcacatg 3360 ctaaactcct gcataaatta tcacatcaat tcaactttag tcgcgatgct gctttactga 3420 taaaatcata tttatgtcaa cgcactcaag tagtagaact agatggaagt ctgtctaaca 3480 caatttttat tttgtcaggt gttcctcaag gatccgtttt gggtcctctg ttgttttcgt 3540 tgtttataaa cgatttacca tcaattctga agaactgctc aattcacatg tttgcagacg 3600 atgtgcagct ttacatttgt tccactgatt ataacacata tgatttagca caactcatta 3660 actatgatct ggagagactt tctaaatggt cttcaagtaa tttacttcct ataaactcaa 3720 caaaaactaa agctatgttc atctctcgtc gacaaattcg tgcaacttta cctgatttag 3780 tcattaacgg tgacaaaata gattatgttg ataaagcttg caatcttggc gtgatttttc 3840 agtctaatct tgagtgggat tgtcaaataa atgcacaatg tgggaaaatc tatgcaggct 3900 tacgacactt acgactcact gcaaatatgc tacctgtttc aaccaaatta atgttattca 3960 aatcactttt gttaccgcat ttttcatatg gttcagaatt agttttgaat gcatcagcag 4020 cagactttgg tcgactgaga gtttcactaa accattgcgt tagatgggta ttcaatttat 4080 ctagatactc taatgtaact gaatttcaac ggcaattgtt gggttgttcg ttttatagat 4140 tcttcagatt acgatgctat atcaccttat ttaaaattat taatcatggt ccacaatact 4200 tagttgacaa gttacagtct ttcagaagtg ctagagttcg aaactatata cttatacatc 4260 ataactcctc tcattatagc aatacttttt tcgtacgtgc aattaatctt tggaatcagc 4320 ttccaacaga tattaaatct atttcttcac tatcaagatt ccgagctgag tgcatgagct 4380 ggctaaatga agggaattag tttaagttag atgaattgtt taacggggat atcatgtttt 4440 tgattgttgt cttttgtaat taaatcatat tacgatatag atttgtatga attggataga 4500 atgtggtgtg agcaacgaaa aatccgattg tagaattttt ataagggtga tcccttactc 4560 tacaagtata attgaataaa taaataaata aataaat 4597 // ID hAT-N12_AP repbase; DNA; INV; 492 BP. XX AC . XX DT 25-AUG-2009 (Rel. 14.09, Created) DT 25-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW hAT-N12_AP. XX NM hAT-N12_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-492 RA Jurka J.; RT "hAT-type DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2112-2112 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 492 BP; 162 A; 50 C; 62 G; 218 T; 0 other; cagggctagg atttatatgc aacagcatgt tttttttgct acttctgatt attgattttg 60 tcagtaatgt ccacgaatca cctcatgatg agataaatgg tggtattttt attttgcatg 120 tttttgcata ttttggataa aatgcatatt attgcatata ttgaataaaa ttaatattat 180 tgcatatttc gattataaga atgtaaaaaa tgcatgtttt atagtatatt tcgtaaaact 240 tggttttttt gtcaaatata aattttattt catgtaacac gcacgtagtt aatgtgaatt 300 atacatttta tttctaaatt atattatttt aagacaaaca gcctatcatt gatgtatttt 360 aataaataaa tatttttttt gtaactatat tttagtgttt atcttataaa aaattaaatg 420 catatttttg catattttgg tatataaaat gcatatttta caatttttta ttgcatataa 480 atcctagccc tg 492 // ID P-1_Lgigantea repbase; DNA; INV; 4102 BP. XX AC . XX DT 16-MAY-2011 (Rel. 16.05, Created) DT 16-MAY-2011 (Rel. 16.05, Last updated, Version 1) XX DE DNA transposon: consensus. XX KW P; DNA transposon; Transposable Element; P-1_Lgigantea. XX OS Lottia gigantea OC Eukaryota; Metazoa; Mollusca; Gastropoda; Patellogastropoda; OC Lottioidea; Lottiidae; Lottia. XX RN [1] RP 1-3053 RA Yuan Y.W. and Wessler S.R.; RT "The catalytic domain of all eukaryotic cut-and-paste transposase RT superfamilies."; RL Proc Natl Acad Sci U S A 108(19), 7884-7889 (2011). XX DR [1] (Consensus) XX SQ Sequence 4102 BP; 1395 A; 717 C; 678 G; 1312 T; 0 other; catagatatc ctataaataa ctagatggaa cgtcatagcg aagagccaat gagagcagtg 60 aagaaccgga agtcagccat taagctacca acgtctcgcg ttaggcctat tgacgtcgtg 120 tttttctaca agttttacta ttaaatatgc ccggctgttc ggcgataaat tgttcaatta 180 gatctggaag agagaacatc ccacagggaa ttacatttca caggtaaaag ctgttttttt 240 ttatgatgaa aaaacttttg aaaggaaaaa aaaaccgcaa ttgaagcaag gtcaaataac 300 ttgaatgtat gaccttttca ataggggtga aaccttcacg gtacgacttt tgttccgtat 360 cggttcattt gggtccacgg cacttatgta ttatcagctt ctggagtaga acaaaattat 420 acacgtgtaa tccaccgatt ttgtcaggtt tctaccgatt ttgaatgtcc tcaaccgaca 480 taccggcggg agatctaccg attcccgtgt ccgcatttta accatgaatt aataaaaata 540 aatagtaaat gaaagtgttt tgattctatc aatcacaaaa ttctactgga ttaactgaaa 600 aaatacttcc cctcttttac tgccatgttt actgccctga catcatgtgc tcaatgtatt 660 ttactactgc cccagaaatg ccattttagc caacattact tttttaaaaa cccacctttc 720 cttcatcttt agtaacttac aacttataga actaattatg cttatttgta tttcagatat 780 cctttaaaag atccagaaag actcaagaag tggctggtaa atttgaagag agttgatttt 840 gaaccaacta agaacactat tttgtgttcc agacattttg aggaacaatg ttttttgaaa 900 actcttgagc gtacctacct taaagatgat gctgtaccaa caatatttga ttttcctgat 960 cacttaatga aggtgagcat tatcaaaacc ttcactaaat tactaagata ctttactcac 1020 cttctttact cttaatcggt ttcattaatt ctaatttcaa aacaggacag cttgtaccta 1080 aataagcctc aaattgcctg catatgctta ttaaagtgat tgttcaccaa atacaaatca 1140 aatcattttc ctgacaaaat catccacaat taatcttgga taatacatta agcatggatt 1200 aattacattt cttaagaaca aattctagtt actatgcata catgcagaat actggtaaaa 1260 ctacccatca tcactctcct ttcctcctta caaatatatt aatacctacc atattgttga 1320 cattgcagaa acaagtagag agaaagccac cagctctcag acaagaagaa aatctacaaa 1380 cctgtagtga tattgatacc atccctgttg aacaaaaaca taattattac aaaagcggtt 1440 ctcccagaaa attaaaaaga aagatcaatg aagctgaaga tagaattata attcttaaaa 1500 agaaacttaa atcaagtaac caaactagag acagagtaaa gaagaaagtg tcaagtttac 1560 aatctgtagt tgatgcccta agaaaagaga aactaatatc agatacttgt tctgttattt 1620 tagaggacat attttctgga acatcactag aagttatgaa acagattgtt agtggcaaac 1680 catcaaaaaa gtactcgtca gagttgaaat catttgcaat gactatgttc aacagacttt 1740 ccagttatcc ctgcctcatc ctaaacaaat cagaaagtgg tattcttccg tctctgctga 1800 cccaggcttc acaaaaccat catttgatgc cctgaaagta aaggtagaag aaggtgcagc 1860 taatggtgca ggtaatatgc gctctgacat tggatgaaat ggcaataaag aagcatgtag 1920 aatggaatgg caagtcattt tctggttatg ttgatattgg gaatggcact tctgattcag 1980 attcatctcc tacagctaaa gatgctttag tcttcatggt tgtctctcta aacgacagat 2040 ggaaggtacc aatagcctat tctttattga tggtttgact ggagaagaaa aagccaatat 2100 tgtaagagaa tcgctgtcca gacttgcaga tataggtgtc aaaataacat ctgttacttg 2160 tgatggtcct tcatcccatt ttaccatgtt tcgatccttg ggaacgaaga tagaaagttt 2220 ttcaatagac acacattttc cctaccctac atttcctgaa cagcaagtgt atgtgatgtt 2280 tgatatttgt catatgctta aattggtaag aaacactttg gctgatggtg gcatcatagt 2340 aaatagcaag ggagagaaaa tagaatggcg ttttatagaa gaacttcata aacttcagga 2400 aaatgaaggt ctgcgattgg ccaataaaat taaaaaagcc cacattacaa ttggaagcag 2460 cagaaaatga aggtcaacct tgcagctcaa gtatttagtt catctgtagc agatgcaatc 2520 caatattgcc atgaagtgct aaaacttact gaaggtacaa tcgaatttat tcgcataatt 2580 gacagactct ttgacttact taattcgaga acaccatacc aaaaaggttt taaatcgccc 2640 ttgagagtta gtaatataga cttttggcaa ccatttttga atgaagcatt cacctatatt 2700 tacaacttaa cagatactaa taaaaacaag atgatccatt caagaaggaa aactggtttc 2760 attggttttt tagttggaat taaaagcatc caaggtttag taaatgatct tttgatgggc 2820 ccaactcctg aactgaatta tatattaacc tacaagctaa gccaggatca tattgaattg 2880 atttttggtg ccataagagc tgcaggtgga tttaataaca atccaactac catacagttt 2940 acagcagcat ataaaagact tctatttaga ggaacacaaa tgattggagg aaaaggaaat 3000 tgtgttaacg attcacctac aaatatttta tcatcaattt ccgacatgta cagtaataat 3060 ggtatcaccc gcacaaaacc agacattgcc ttcctaaaaa aatacggatt gaacgctgat 3120 gatccagctc ttccagatga aattccacca aacttattta catgttctga atataaggag 3180 gctgccattt cgtatattgc tggctatgtt gtcaaaatgg ttcgcaggga aattaaatgt 3240 gttgattgtt ctttagcact tgaagatgca cgtacaaata gggaaatttc agtaaacagc 3300 tttgttaaat ttaaagatag gggtggatta tgtattcctt catccagtgt tttaaaagta 3360 tgttatgcta cagaaaaatg tttgtcttca atactttctt ctaataattt accacaagga 3420 caatttataa gtgctgtatc aacatttgtt ctagctaacc ttaatataaa attaatcttt 3480 acttgtttgg atgaacataa tttgcaatat tcttttgaag aaaatcacat tttaagattg 3540 ataaaatgca tttgtaataa atattctaaa ataaaattgc atcatgctgc aaaaaccctt 3600 aatgaaaaaa acagtgaacc caaaataagg aagaaattat caaaactcat attatttaag 3660 catgaataga ttaatgagat ttgattttgt atcattgaaa caattttatt attccctatt 3720 ttttctgttt gatttttaat gactttaatt ctacctaaca ggaaattgtc cgaaataaac 3780 tttcttctac tttggagtgt gattttagtt acattcagta gcctattcag taatgtcaga 3840 aagtggaacg atcgtatcta tcttgatttt tttttactaa aatttctgaa cgttgatcat 3900 tcaatgtatc ttatttcata tttaattttt ttttttttga aatgtgaaat aaattgcata 3960 aaagtgtgtt tttcttgtat tattatcaaa attattgcac ttccctattg acgttttgtt 4020 atggcttggt agcttaatgg ctgacttccg gttcttcaca gcaccgacgc aagcttctct 4080 ccagtctata tatagctcag tg 4102 // ID Copia-3_ACA-I repbase; DNA; INV; 3799 BP. XX AC AEYA01000017; XX DT 23-MAR-2011 (Rel. 16.03, Created) DT 23-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the canthamoeba castellanii genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-3_ACA_; KW Copia-3_ACA-LTR; Copia-3_ACA-I. XX OS Acanthamoeba castellanii OC Eukaryota; Amoebozoa; Centramoebida; Acanthamoebidae; OC Acanthamoeba. XX RN [1] RP 1-3799 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Acanthamoeba castellanii genome."; RL Direct Submission to RU (23-MAR-2011). XX DR Genome; AEYA01000017; Positions 5297 1499. XX CC Positions [2032-2532] - Integrase core CC 'AGTTC' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 694..3432 FT /product="Copia-3_ACA-I_1p" FT /translation="MLAIETGGHKMSERSRLMILATALKSINPAFEAQFNA FT LADDTATEAVLKALDHVEASHHASPSSDTILYAGQGQHHGRPAFQHRQAPY FT GAHAQTPPNSLPPRHPTGPRLPRQQRRDNWGSDRGRWCSLHNSSGHSLEEC FT REAAARGLAGATPRATPQTAHTAALAAMTAAAVAAASSFLLHTPQSSILAA FT MTAATAAAHPSDFAIVSESAAGAALSSEPTPNKPFQNYSVFSLVAPIHLPT FT IADTSATQHTTNQRDLLHNFIPLALPSRLVCADGGHIECLGHGTLRSTTIV FT DGQASNIMMCDVAYIPGASHTLISPQQLLDAGCRVAFDQQCGFLFYLDDKL FT HLFSYWSGNLYYFNITFTPADIATPHAALVAATSPLCLDLIHHRLGHVSER FT RCHEFVRQSADLSEREKRAALSSSLSPICDVCLAGKQTVRGVSRVPCTNHS FT VRGAYPGDLLSLDLIGPMRHQSAGGHAYLLTVTDSYSHMHFVRPLPNKTST FT AIHAALQAIAASFPPAVRIHQVHLDNARELNALMGAWILDIGAKREPTPPY FT TSEYNSVVEQFNREVMTRVQCLLFNARLPSEWWAEAARYACNVINLTPTQA FT NLDSASPYLLWNGVAPPSRHIHVFGAPGRMRLHAHERHKISKQSVPVRFMS FT VVDYSLSTYWVYIVSQRRIVDTCNVVFNERCASCPPDDRPPPTHICVPLLL FT DNDDDTLATLSSPILGGDAIANTPADSTETASTHEPPLPDLASPDCPTPMA FT SGAAASVSPPAPLDVSPAGPASHTATAPATATPPECTPTSTQIIGSARGST FT TTSSKQIERAAAASAKYSGETVAVTRSGRNVRQPARLNLLSSLAHEAPTHA FT AHTMLQLRRMDHALNLAQHTTTVPQTFREGEGSQVEVWECTCHQSQSQPSA FT RSHMRQSGL" XX SQ Sequence 3799 BP; 790 A; 1319 C; 939 G; 751 T; 0 other; ggttatgggc ccatgctaca tctagtgcca ctctcaacaa caatcaccac atctctgtgt 60 acaaattctt tcaatctggt tcaccattac accatgagta acaacaccaa caccaacatc 120 aacggtcttc aagtcacacc gcttcgagac caatcgtcct ttgctgagtg gatttgaact 180 gtatgccttg ctctccatga acaaggacac tcgaaatatg tcaccaccga ctacactgtg 240 atcgtccagg cggtggatga ctatgacgct gagcatatgg cacatctgtt tgaggctctt 300 gacatcacat caacaatctc catcgcgttg gaatccactt ctggcaagaa gtcgttggcc 360 gcgaaggcga agtcggaaga cgtcaagaag aagcagattg ccgccttcgt tggagatgaa 420 ctggccctca aggccgctcg cgagctgtgt aagggacgtt gcaaaacctt tgtcttcatc 480 tgcagtacta ttgcccctga gctacttgag gacatatccg gcatgagtga cggctgccct 540 ctagctctct ggatggcgat caataattga tatggcaaca acgacgtcgc cctccaagcc 600 gcgcggcgcg accttggatg gcctgacttc tacctggttt gccccggtga cactgtgcgc 660 tccatctggg cacggatcca ggccctcatc aacatgctcg ccatcgagac cggaggacac 720 aagatgtcgg aacggagccg cctcatgatt ctcgcaacag ccctcaaaag catcaacccg 780 gctttcgagg ctcaattcaa tgcactcgcc gacgacacgg ccacagaggc cgttctcaag 840 gcactcgacc acgtcgaggc ctcccatcat gccagcccca gctctgacac catcctctac 900 gccggccaag gccagcacca cgggcgtccg gcattccaac accgtcaagc gccgtacggc 960 gcgcacgcgc agacgccgcc gaactctctg ccaccgcggc acccgactgg gcctcgcttg 1020 ccacgccagc aacgtcgtga caactggggc agcgacagag gccgctggtg ctccctgcac 1080 aactccagtg gtcactcgct ggaggaatgt cgcgaggctg ccgctcgtgg tctcgccggc 1140 gccacgcctc gagctactcc ccagaccgcg cacaccgccg ctcttgctgc gatgactgcc 1200 gccgccgtcg cggctgcgtc ttctttcctg ctacacacgc cgcagtcgtc cattctcgcc 1260 gccatgaccg ccgccaccgc tgcagcccac ccaagcgatt tcgccattgt cagtgagtct 1320 gctgccggtg ctgctctctc ctctgagccg acacccaata aaccattcca gaactactca 1380 gtcttctccc tggtcgcgcc catccatcta cctaccatcg ccgacaccag cgccacgcaa 1440 cacaccacca accaaaggga cctgctccac aacttcattc ctctcgcgct cccgtcgagg 1500 ctggtgtgcg ccgacggcgg acacatcgag tgtctggggc acggcaccct gcgcagcacc 1560 accattgtcg acggccaggc cagcaacatc atgatgtgtg atgtcgcata catacctggc 1620 gcttcacaca ctctgatctc acctcagcag ctactggatg ccgggtgccg ggtcgcgttc 1680 gaccagcaat gcggattcct attctacctg gacgacaaac tacacctgtt cagctactgg 1740 agtgggaacc tctactattt caacatcacc ttcactccag ctgacattgc cacaccacat 1800 gccgcgcttg tggcggccac ctcaccactg tgccttgatc tcattcacca tcggcttggc 1860 cacgtcagtg aacggcgatg ccacgagttc gtccggcagt ccgccgacct cagcgagcgt 1920 gagaagcgcg cggctctgtc atcctcgctg tcgcccatct gtgacgtctg ccttgctggg 1980 aagcagaccg tgcgtggcgt cagcagggta ccatgcacga accactccgt gcgaggcgcc 2040 taccccggcg atctcctctc acttgacctg attgggccga tgaggcacca atcagctggt 2100 gggcacgcct acctactcac cgtcaccgat tcgtactcgc acatgcattt cgttcgcccg 2160 ctgcccaaca agacctccac cgccatccat gcggctcttc aggcgattgc tgcctcgttc 2220 cctccggcgg tgcgcatcca tcaggtccat ctcgacaacg cacgcgagct caacgcgctc 2280 atgggtgcat ggatccttga catcggcgcc aaaagggagc ctacaccacc ctacacgtct 2340 gaatacaaca gtgtcgtcga gcagtttaac cgcgaggtta tgacgcgcgt tcagtgcctg 2400 ctcttcaacg ctcgcctccc cagcgagtgg tgggctgagg ctgcacgcta cgcatgcaac 2460 gtcatcaacc tgacgcccac tcaggccaac ctggacagtg catccccgta tctgctttgg 2520 aatggcgtcg caccgccgtc cagacacatt cacgtgtttg gcgctccagg tcggatgcgc 2580 ctgcacgcac acgagcggca caagatctcc aagcagtctg tgcctgtgcg cttcatgagc 2640 gtggtggact actcactgtc cacctactgg gtctacattg tcagccaacg ccgcattgtc 2700 gacacgtgca atgttgtctt caacgaacgc tgcgcgtcgt gtcctccaga tgaccgccca 2760 ccgcccacgc acatctgcgt ccctctgcta ctggacaatg atgacgacac gctggctaca 2820 ctgtcttccc cgatcctggg cggtgatgca attgccaaca cgcctgccga ctccacagag 2880 accgcgtcga cccatgaacc gccattgcct gacctggcta gccctgactg tcccacgcca 2940 atggcctcag gcgctgcggc cagtgtctct ccgccagcgc ccttggacgt cagtccagca 3000 ggtcctgcgt ctcacacggc aactgctccg gccacggcaa cgcctccgga gtgcaccccg 3060 acaagcacgc agatcatcgg ctcagcacgc ggctccacca caaccagcag caagcagatc 3120 gagcgggcag ccgccgcaag cgcgaagtac agcggcgaaa ctgtggcggt cacccgatcc 3180 ggccgcaacg tgcgccaacc tgcgaggctg aacctgctct cctctcttgc ccacgaggcg 3240 cccacacacg cggcccacac aatgctccaa ctacggcgaa tggaccacgc gctgaacctc 3300 gcccagcaca ccaccactgt tccacaaacc ttcagggaag gcgaaggttc tcaagttgaa 3360 gtatgggagt gtacatgtca tcaaagtcaa tcccagcctt ctgcgagaag ccacatgcga 3420 caaagtgggc tttgaagatc ttgagggcct tcagcttgag aacccagcat gtgctgatgg 3480 cctgtcaacc agacaagcag gcccctctgg tgtggaactg cactctcgac accttcctcg 3540 tccagattgg cttcacagtg gcatccatgg acctgtgcct ctacgccttg cacaatcact 3600 gtgctagcag tgatggtgac cagggaccct atgacccaga gcttcactgc accttcatcc 3660 actccagctc cagtggcaga ccgctcgtca tcctatccgt ctacgttgac aatctcctga 3720 ttgtcagttc gcccgtcgac gttgatgcct ccgttagaaa ctgaagtgaa cgtgcaatct 3780 ctttcaactc aagggggag 3799 // ID Waldo-3_AAe repbase; DNA; INV; 6674 BP. XX AC . XX DT 05-OCT-2010 (Rel. 15.1, Created) DT 05-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE A Waldo non-LTR retrotransposon family from Aedes aegypti. XX KW R1; Non-LTR Retrotransposon; Transposable Element; R1_Ele6; KW Waldo-3_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-6674 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6674 RA Kojima K.K. and Jurka J.; RT "Waldo, (AC)n microsatellite-specific families of non-LTR RT retrotransposons from the yellow fever mosquito."; RL Direct Submission to Repbase Update (05-OCT-2010). XX DR [2] (Consensus) XX CC [1] Originally named as R1_Ele6. CC [2] Consensus update and characterization of target sequences. CC This consensus is generated from 30 sequences with >98% identity, CC and ~100% identical to the original sequence in [1]. Both sides CC are (AC)n microsatellites. Renamed as Waldo. XX FH Key Location/Qualifiers FT CDS 891..2339 FT /product="Waldo-3_AAe_1p" FT /translation="MEESIIEIEEGAANPFAKGGLVRSPPSQLGQQQRNQE FT EQQVQRQLQQQQQQQPREQQQQLQQQQQAQVSAWSKVPKLPRVVAAKKLVD FT ELHEFVDKRSNVHKDIKTLVLKIQGTLGLAVKEWAAVAQKVESTEKELSAA FT KTALEIRQVPIQQTAMPITFARETNAKRDKSQRTESVPFTPKRPRASPGDA FT RPGGSKKHKDTRVNESGPANAATTKEGNDTPWKIVKAKNRKTKTKSRSEKQ FT SLFKGRKRGEALIVKASDDSYEKVLRAMRTNPELEQLGADVRKVRRTRTGD FT MILELKRDPMASSSSYKELAEKAMGNTVEVRAVCPEAVLECKNLDAISTDD FT DVRVAMKEQCMLGEVQMQIRIRKGPSGMLIASIRLPIEAAVKALKTEKIKV FT GWSVCPLSVSQKPEACYRCHEYGHLARFCKGPDRGNLCRRCGEEGHKAQAC FT RKPPKCMICANGDNNNHVTGGLRCPAFMKATATKPQWR" FT CDS 2294..5389 FT /product="Waldo-3_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MPGLHEGDCYQTTVEVIQINLNHCDTAQHLLRQSVAE FT YKCDVAIISEPYRVPAGDGNWIADNAKSVAIWTVGRYPFQEVVHRADEGFV FT IAKINGVFFCSCYAPPRWSIDQFNEMLDKLTEELTDRRPVVIAGDFNAWAI FT EWGSRFTNSRGSSLLEALARLNVDVANDGTTSTYRRDGRESIIDVTFCSPG FT LSGSLNWRVSEEYTHSDHQALRYNIGSGRQLEARAIPSKERRWKTSQFDKE FT VFVEALRLERNAQHATANELTAALARACDATMQREGKPRHGRRPAYWWNST FT IADLRTSCFRARRMMQRARNDAERADRRPAYKAAKTALNKEIRLSKKACLE FT ELCRNANSNPWGDAYRVAMAKLKGPAVPPDRCPEKMKTIIEALFPRHEPTS FT WPSTPYGAQDNDEEEAQVTNDELIAVAKALKTRKAPGPDGIPNVALQAAIQ FT ENPDMFRIVLQKCIEEGNFPDIWKRQKLVLLPKPGKPPGDPSAYRPICLLD FT TVGKVLERVILNRLTKYTEGENGLCDMQFGFRKGRSTVDAIRTVVEAMDTA FT RKQKRRGNRYCGVVTLDVKNAFNSASWAAIAESLHRLEVPEYLCKILRSYF FT QNRTLIYETDAGKRCTTVTAGVPQGSILGPTLWNAMYDGVLKLSFPRGVKI FT VGFADDVVLVVIGESLEEVEILATEAVDAVEEWMRGKKLVLAHHKTEVVMI FT SNRKAVQKARISVGQCTIDSKREVRHLGVMIDDRLSFNSHVDYACERAVKV FT ISALSRIMPNNSAISSSKKRLLASVSTSVLRYAGPVWVTALQTKRNRSRLN FT STFRLMAMRVASAYRTISSEAACVIAGLIPISLQLEEDSECYRDRRTRGIR FT KRARDETMRKWQQQWDSAENGRWTHRLIPCLSTWVNRKHGEVNFHLTQFLS FT GHGCFKKYLNRFGHARSPLCTGCGDVDETPEHVVFECPRFEHERAEMTSVV FT GNDVNVHNIVQRMCADEEKWDAVNRTIVQMMSLLQRRWREEQRHSTQRTAH FT GSESLGQPESSDRPWRLTSQPVEESI" XX SQ Sequence 6674 BP; 1837 A; 1652 C; 1957 G; 1228 T; 0 other; gggttgcaaa atggtgagtc gacaaccagg aaggagcgtc caacatagct ctggtcctca 60 caagccccta cctcacgctt ccacgggtct aacgatgaca aagaccgcca gctaagggtt 120 gcgtacttag ctggtagtgc aacctgggca ctgttgtcct tctgacatca gctagagtga 180 gggggtgcca ggtgggagct tgggattttt accctttcca agcgaaactc atacatacag 240 ccgtatggaa taccaccttt ggtacttcct tcgggaggtg gccgagcgtc tggtccatct 300 gaccaggggc tcaagcatgg tctacctagg atgtggcggg ggttcatcag tgggctctgg 360 tgaatctcta caaaaaacca cagatctgca agtagccctg aacaagcgac ctggtaccgc 420 tttcaaagta ccttagccct ctggagtgcc aaccggcaca tcaggatgga tgccagtaaa 480 atcctgatta tggtatactg gtcacggcgc agaacaacgg acaggacacg gattacggat 540 tacacaccac attgcgacac agagctgaca gtcgtgttat tctattcccc gagtaaggat 600 gggtgatacc aacttgaaac ggcgagtagg ctaatggtgc gattgccccc gtcccggtaa 660 aaccttggca ggcctctgga tacgttcgga tcccgtccat cttaagtgat gagtactagg 720 ctcggatgac taaaccccga tctacgccga tccggctctg aacacgattc tcataaggga 780 tcgtgtcaac ccttgcatgg cctccctgct tgcaaagata accatgggat catgaaaggc 840 gactacgtac ggcgggaaat gatagcggcc cagagggggg accctgaaga atggaagaga 900 gtataattga gatcgaagaa ggcgcggcga accctttcgc taaaggtggc ttagtacggt 960 caccaccatc gcagctaggg caacagcagc gcaatcagga ggagcagcag gtacagcgac 1020 agcttcaaca gcagcagcag cagcagcctc gagagcagca gcaacagctt cagcagcagc 1080 aacaggctca ggtcagtgca tggtcgaagg tgccaaagct accgagagtt gtagcagcca 1140 agaaactggt ggacgaattg cacgagttcg tcgataaaag aagtaatgtg cacaaagaca 1200 tcaagacttt ggtgttgaag atccaaggaa cccttggact agctgtcaag gaatgggcgg 1260 ccgttgcaca gaaagtggaa tcgactgaga aggagttgtc ggcggccaag actgccttag 1320 aaataaggca agtgccgatt caacaaacgg cgatgccaat tacttttgcg agggaaacca 1380 acgcaaagag ggacaaatcc cagaggacgg agagcgtgcc cttcacaccg aagaggccaa 1440 gagcatcacc aggagatgca agaccgggtg gttccaagaa acacaaggac acccgcgtca 1500 acgagtcagg cccagcgaac gcagcaacca ccaaggaagg caacgatacc ccatggaaaa 1560 tcgtcaaggc aaaaaatcgg aagacgaaaa ccaagagcag gtccgaaaaa caaagcctct 1620 tcaagggccg gaagagaggc gaggcgctta tagtcaaggc aagcgatgac tcgtacgaga 1680 aggttctccg tgcaatgcgg acaaacccgg agctcgaaca gctgggtgca gacgtgcgga 1740 aagtcaggcg cacccgcacc ggagatatga tcctggagct aaaacgagac ccgatggcca 1800 gcagctcttc ctacaaagag ctcgccgaga aagccatggg taatacggta gaagtgagag 1860 ccgtgtgtcc ggaggcagtc cttgaatgca agaacttgga cgcaatctct acggacgacg 1920 atgtaagggt tgccatgaaa gagcagtgca tgctaggaga agtgcagatg cagatccgca 1980 taaggaaggg gccatccgga atgctaatag catcaattag gcttccgatt gaagcggccg 2040 tcaaggcact taaaacggag aagattaaag taggctggtc ggtatgtcca ctgagcgtct 2100 ctcagaaacc ggaagcatgc tatagatgcc acgagtacgg ccatctggct agattctgca 2160 aggggccgga cagaggtaat ttatgcagaa ggtgcggaga agaaggccac aaggcgcaag 2220 cctgccggaa gcctccaaag tgcatgatct gcgctaacgg agataataac aaccacgtta 2280 caggaggcct gcgatgcccg gccttcatga aggcgactgc taccaaacca cagtggaggt 2340 aattcaaatt aacctcaacc actgtgacac agcacaacac ttgctgaggc aatctgtggc 2400 ggagtacaag tgcgatgtgg ctatcatctc ggagccatac cgtgtcccgg ccggcgatgg 2460 gaactggata gcagacaacg caaaatcagt ggcgatatgg acggtgggaa gatatccttt 2520 ccaggaagtg gtacatcgtg cagatgaagg ctttgttata gccaaaatta acggagtctt 2580 cttctgtagc tgctacgcac ccccgcgatg gtctattgac cagttcaacg agatgctcga 2640 caagctgact gaagagctca cagaccgcag accagtcgtc atagccggcg atttcaacgc 2700 gtgggccata gaatggggca gtcgcttcac caactcaaga gggagcagtc tactggaggc 2760 cttggcaagg ctaaacgtcg atgttgccaa cgatggtacc accagcacat accgtcggga 2820 tggtcgagag tcaattattg acgtaacttt ctgtagccct gggctgtcag gaagtttgaa 2880 ctggcgagtg agtgaggagt atacgcacag tgaccaccaa gcgcttcggt acaatattgg 2940 tagcggacgg cagttggaag cacgtgcgat cccttcgaaa gagcggaggt ggaaaacgtc 3000 gcagtttgac aaggaggtgt ttgttgaggc gttgagactg gagcgcaatg ctcaacacgc 3060 tactgcgaat gagttgacag cggcattagc acgagcgtgc gatgcaacca tgcagaggga 3120 aggtaaaccg cgacatggtc gccgcccagc ttactggtgg aactcgacga ttgccgacct 3180 gcgtacaagc tgctttcggg ctaggagaat gatgcagaga gcccgcaacg acgctgaaag 3240 agcagatcga agaccagcgt acaaggcggc gaaaaccgct ctcaacaagg aaattcggct 3300 cagcaagaaa gcctgtctgg aggagctctg tcgcaacgcc aattcaaacc cgtggggtga 3360 cgcctacaga gttgcaatgg cgaagttgaa gggcccagct gtaccacccg ataggtgtcc 3420 ggagaagatg aagaccatca ttgaagcgct attcccccgg cacgaaccga cgagctggcc 3480 gtccacacct tacggagctc aggataacga cgaagaagaa gcccaggtaa cgaacgatga 3540 gctgatcgcg gtggcgaaag ccttaaaaac caggaaggct ccgggaccag acggtatccc 3600 caacgtagct ctgcaagcag caatccaaga aaacccagat atgttcagga tcgtactgca 3660 aaaatgtatc gaggagggta acttccccga catttggaag cgacaaaagc tggtgctgct 3720 gcctaaacca ggcaagcctc ctggagatcc ttcagcatat agaccgatat gtctgctgga 3780 cacagttggc aaagtgctgg agcgagtaat cctaaacagg cttacgaaat acacggaggg 3840 agaaaacggc ctgtgtgaca tgcagttcgg ctttcggaaa ggtagatcaa ccgttgatgc 3900 catccgaacg gtcgtggaag ctatggatac ggcgcggaag caaaagagga gaggaaatcg 3960 gtactgcggt gtggtcactc tagacgtaaa aaatgccttc aacagtgcca gttgggctgc 4020 gatcgccgaa tcgctgcaca gattggaggt ccccgagtat ctttgtaaga ttttgaggag 4080 ctactttcaa aatcgtacac tgatctacga aacagacgct gggaaaaggt gcacgacagt 4140 cacagcgggc gtcccacagg gttctattct tggcccgact ctctggaacg ctatgtatga 4200 tggagtgttg aagctgagct tcccccgggg tgtgaagatc gtcggcttcg cggatgacgt 4260 ggtgctcgta gtcatcggcg aatcactaga ggaagtagag atacttgcta ccgaagcagt 4320 agacgccgtg gaagaatgga tgcgagggaa gaagcttgtg ttagcccatc acaaaaccga 4380 agttgtgatg ataagcaacc gaaaggcagt gcagaaggcg agaatctcgg ttggacagtg 4440 taccatcgac tcaaagcggg aagtcagaca cctgggagtg atgatcgatg atcggctgag 4500 cttcaacagc catgtcgatt atgcatgtga gagggccgtg aaggtgatat cagctctatc 4560 ccggatcatg cccaacaact cagcgatcag cagcagcaag aagcgactac tggcgagcgt 4620 gtcgacgtcg gtactcagat acgctggccc cgtgtgggtg acagcactgc agacgaagag 4680 gaaccgctcc cggctgaaca gtacgttcag actgatggcc atgcgtgtgg cgagcgcgta 4740 tcgtacgata tcatcggagg cagcctgtgt gattgcagga ttgatcccta taagccttca 4800 actcgaagag gacagcgagt gctacaggga ccgacgcacg cggggaattc ggaaaagagc 4860 ccgggatgaa accatgagga aatggcaaca gcaatgggat agcgctgaga acggtagatg 4920 gacccaccgt ctaattccgt gcctgtcgac gtgggtaaac agaaaacacg gagaagtgaa 4980 tttccacttg acgcagtttt tgtctggcca tggctgtttc aagaaatacc tcaataggtt 5040 tggacacgca agatcgccgt tatgcaccgg gtgcggcgat gttgacgaaa cccctgagca 5100 cgtggtcttc gaatgtccgc gcttcgagca tgagcgagcc gagatgacat ccgtcgtcgg 5160 caatgacgtt aatgtgcaca atatcgtcca acgaatgtgt gccgacgaag agaaatggga 5220 cgcggtaaac agaactatcg ttcagatgat gtctctttta caacgaagat ggcgagagga 5280 gcagcggcac tctacacagc ggacagcaca tggctctgaa tcgttggggc agccggaatc 5340 gtcggaccga ccttggcgct tgacaagtca gcctgttgaa gagagcatat gaaggagacc 5400 aacgggcgcg accggagtag gctagatcct ccgccgggga ctagaccgag tagagcgggc 5460 gtagcgtaat atcggtaata agtcgtcaag gcgcctgcaa accggaagtc atcctccaac 5520 cggaattgca ggaccgacct cggcacttac cgaccaacat cgcgtcggag taggctatgt 5580 ccttccgccg gggactagcc gagtagatcg cgaaacagct ccggcaaatg gtagtcgggg 5640 cgcctgtgaa ccggaagttt cccaccaccg gaatcgcagg accgacctcg gcatcagccc 5700 ggtcagcttc ggagtaggtt aagttcaccg tcggggacta actgagtaga acgcgtagtt 5760 gcaccggcaa atggtcgtcg gggcgcctgt gaaccggaag tttccctcca ccggaatcgc 5820 agaaccgacc tcggcatcag cccggaaagc atcggttcgg gagatcttcc gccgtcgggg 5880 aaatcttcgt cggagtagga tagatccacc gtcggggact attctgagta gtccgtagcg 5940 agtcaccggc ttaagtcgtc ggggcaccag tgaaccggaa gccatcctcc aaccggaatc 6000 gctggatcga cctcggcact tcaccggtca gcagcagtga tgcgggaaca aacgcgtaac 6060 gttatcggtt tcgggagaca ttcctacgtt aggaactctc cgctggagta ggctagctcc 6120 accgtcgggg actatgccga gtagcgatcg tgacaaccac cagttttggg tcgtcggggc 6180 gccagtgaac cggaagctac cctccaaccg gaatcgctgg accgacctcg gcatccaact 6240 ggtccaacct ggagaactcg agaatcggcg gattaacagc aggcgcaaca gttgcgcagg 6300 gctctggcga gatgtcagcc ccaaacacgt cgcagcagag caacgaagcg tagcaatgac 6360 agaagcggta gtagaaacat ggccctggcg cgatgccagc agtacgaaga gagcaaggga 6420 agtacaggag cgtaagaagc aaggagcagt gcagtgctta gcacaatagc ctcccccatg 6480 aagtaatgcc aagaggcagt tccgggggga atggctcgtg ggcgaaggtg gactttagtc 6540 ggtataaagg agtccgacac tctggcgcac gatggacgaa ttcgatatga ctagcctcca 6600 tatcgaacgt gaaacgctgg gttagttcgt aaatgtaatt taccaccttc taaaactaaa 6660 aaaaaaaaaa aaaa 6674 // ID BEL-80_AA-I repbase; DNA; INV; 5964 BP. XX AC supercont1.324; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-80_AA_; KW BEL-80_AA-LTR; BEL-80_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5964 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.324; Positions 208251 202288. XX CC Positions [4994-5578] - Integrase core CC 'AAGTC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 606..4541 FT /product="BEL-80_AA-I_2p" FT /translation="MPENSESKQQQIGKSGASSVLGQSTLDSSTPSSKKQK FT KKFRLTEKLATMENMQRLVRRRGAAKGKVSRILNTIRPNEEEVVQLTEAEI FT KVYMRKLEAAHKDYNDAHDQITNVVSIDDYDQHEQQYEEFDILHDSVAILL FT EDQLNRINAAAAANANARNQQVPVVIHQPLRMPVPTFDGRYESWPKFKAMF FT KDLVDKGPDPPAVKLYHLDKALVGSAAGLIDAKTINEGNYAHAWQILEERF FT ENKRHAIDSHIHGLLNLKRMTKKSHLELRSLVDECNKHVEGLKFLERDFDG FT VGEDFVIHLLAAALHNDVRHMWESTIKHGELPDYDEMLAFLKEQTFILERV FT EASSQKSSSAPIKSVSAANKPPMQKVYAAVSSSETEMKCDFCGKAHANHNC FT AEFKALHVPQRLLKVRERNVCFNCLRRGHRGVDCSSDKSCLKCKRRHHSLL FT HAEEKTKQVPEPSSQPLPKPAVEQEPTPSTSTTATCSSLAPRSQQVVLLTA FT VVDVMDKNYQPHPCRILLDSGSQVNLISRAMANMLGLKLNSSNVTMFGVNS FT TQTRSSNCGVVHLSSRFKDFHAKVKCLVTDKVTSDLPSSVINISALDLPAG FT VQLADPKFFQPSKVDMLLGNEWFMKLMMPGEITLADNLPVLRETQFGWVIG FT GVFEEGALADEAVYSHTVTMDELSQSIERFWEVEDVVGADKHGSEEEECEE FT HFNATHRRDASGRYIVELPLKESVSELGDSRPLALRRFHALERKLSQHPDL FT KKQYQDFMDEYESLGHCKEVDISKDPAGIIKWYLPHHAVLRPSNTTTKCRV FT VFDASAKVSGRSLNDVMKIGANIQSDLQSICLRFRLPLHVMATDVAKMYRQ FT VLVNQRHTPLSRVFWRKSSSDSLRVLELTTVTYGTASAPFLATRALLQLAL FT DEGSKYPVAADIVKNNFYVDNALFGFDDLNEAKEAQVQLIQLLKAGGFHLH FT KWASNNPVLMERIPEGDRDELVSIDESGSSEVIKTLGLMWNPKMDALQFVS FT LPTMSENRATKRQVLSLISRMFDPLGLVAPVIVIGKLLMKSIWKEELEWDE FT ELTGNLKKRCDKFLSALRSVTNLRIPRHVVVTGAVAFELHGFGDATLEAYG FT ACVYIRSIVPGQAPVVQLLCAKSKIVPKTVLTIPRKELLAALLLHRLVKKV FT LAALALPFQDISLWSDNQVVLAWLAKNPEHLEVFVRNRVSEINSTGHQFKW FT KYVNTLENPADIVSRGQSTHALEKNDLWWNGPLFLRSEVYQVVAPEPLSDE FT DVRELRQVAATSAVTVLERLPVFTKFESSGSCNEFWHTFCVFAGMQRRRWP FT " FT CDS 4541..5962 FT /product="BEL-80_AA-I_1p" FT /translation="MKRIVARFPTVLEMRHSLKAITRVVQLQHFGKEVAMI FT ESGEYCKRFSALNPFLDDGMIRVGGRLRHSNLPYGVKHQWVLPKNDETIQR FT LIQAIHRENLHIGPSALLAQLRRQFWILGARSAVRKVTRNCVRCFRLKPPC FT ASQFMGDLPVARCDKAPAFVKVGVDFAGPILIKQTGRKVAPVKGYVSVFVC FT MVTKAIHLEVVEDLSADAFIAALQRFVSRRGVPEQIFSDNGTNFVGARNEL FT NELYRLFKGQVTELKINEFCQPRQIEWKMIPPNAPHMGGIWEAGVKSVKTI FT LKKKCQSSLLTMSEFSTLLCQIEAQLNSRPLYAPSEDPSELEPLTPGHFII FT DRPLTAIPEPSYDGIPANRLSRWQYVQRLRQEFWKRWSEEYLLELQVRQKW FT SKKKENILPGTVVVIKDDNLPPQQWKLGKVESTYTGSDGLVRVVDVRTKSG FT VLKRPIHKLAPLPILDNQAVNENKVPAGE" XX SQ Sequence 5964 BP; 1539 A; 1319 C; 1625 G; 1481 T; 0 other; taattttggt ccagtcgaac cggatcgcgc gtggtaaagt gtctttccga ctgttccgga 60 accggatgca gaaccagtga agtgggattc cgaaaaagtc ggaaaattta aactataatg 120 gtgtgagcaa aatggccgcc attgaagaag cgaagtagta gagcgacacc gccattttgt 180 tgaagagcag tgtggtgtgt tgtcgccatc ttggaaatgt gctgatggtt taagccatcg 240 aagaaaagtg gtgttacgag caattaagtg ctcaactcct ggggaataca cccccgcaaa 300 gcgaagaaac aagttgtaaa gctggtgttc cggattggcc actgtgacca ttaattagtg 360 gaataggttg cgtgagcata gacttggctg tgttcccaac caacgtcgag cagtgggctt 420 ggcttgaagt ggtgtccgcg gtggcattcc ggaaaatagc gttgagtaaa gtgggttgtt 480 tcaaaatccg tgtttcggtt cggtcacttg cgccgattag tgtccggttg cgtgagcata 540 gacttggctg tgttcccaac catcgtcgag catagggctt gacttgatgc ggtatccacg 600 gtggcatgcc ggaaaatagt gaatcgaagc agcagcaaat cggaaaatcc ggagctagtt 660 ctgtacttgg acagtctact ctggattcct ccactccatc aagtaagaag cagaagaaga 720 agttcaggtt gactgaaaag ttggccacca tggagaacat gcagagactc gtccgccgtc 780 gtggcgcagc caaaggtaag gtttctcgta tccttaatac tattcgccca aatgaagagg 840 aagtggtcca gctcacggag gctgagatca aagtgtacat gaggaagttg gaagctgccc 900 ataaggacta caacgatgca cacgatcaaa tcactaacgt ggtttcgatt gatgactatg 960 atcagcatga gcagcagtac gaggagttcg acatccttca cgattccgtt gcaattttgt 1020 tggaggatca gctcaacagg attaacgcag cagctgcagc caatgcaaac gcgaggaacc 1080 agcaggtacc ggttgtaatc catcaaccac ttcgcatgcc ggttcccaca tttgatggcc 1140 gttatgagag ctggcctaag ttcaaggcca tgttcaagga tctcgtggac aagggtccag 1200 atccgccggc tgttaagttg taccatctgg acaaggccct agttggtagc gcagcaggtc 1260 tcattgatgc caagactatc aacgaaggca attacgctca cgcgtggcaa attctggaag 1320 agagattcga gaataagcgt catgctatcg actctcacat tcatggtctg ctgaacctca 1380 aacgcatgac gaagaagagt catttggagc ttcgtagtct ggtggacgag tgcaacaagc 1440 atgtggaggg tctcaaattc ctggagcgag acttcgatgg tgtaggagag gatttcgtaa 1500 tccatctgtt ggctgcagcg ttgcacaacg atgtgcgcca catgtgggag tcaaccatta 1560 agcacggtga gcttcccgat tacgacgaga tgctggcgtt cttgaaggag cagactttca 1620 tcttggagag ggtcgaagcc agcagtcaga aatcatcatc agcacctatt aagtctgtgt 1680 ctgcagctaa caagccgccg atgcagaaag tctatgcagc cgtttcttca agtgagacgg 1740 aaatgaagtg tgatttttgc ggaaaagcac atgccaatca taattgcgct gagttcaagg 1800 ctcttcatgt tccgcagaga ttgcttaaag tgagagagcg aaacgtatgt ttcaactgtt 1860 taagaagagg tcatcgtgga gtcgactgtt cttcggacaa atcctgtctg aagtgtaaac 1920 gtcggcatca tagtctcctt catgccgaag agaaaaccaa gcaagttcca gaaccatcaa 1980 gccagccgct cccaaagccg gctgtagagc aagaaccaac accatcaact tcgacgacag 2040 ccacatgttc cagtctggct ccccgctcgc agcaagtggt gctgcttaca gccgtagtgg 2100 acgtcatgga caaaaactac cagccacatc catgcagaat attgttggac agtggttctc 2160 aagtgaattt gatttcacga gcgatggcaa acatgttggg cttgaagttg aactcctcga 2220 acgtcacgat gttcggggtg aatagcacac agactcgatc gtctaattgt ggtgtcgtac 2280 atctttcgtc gaggttcaag gatttccacg ccaaggtcaa gtgtttggta actgataagg 2340 tgacatcaga tcttccttca tcggttatca atatcagcgc attggacctt cccgctggcg 2400 tgcagctagc cgatccgaag tttttccaac cgagtaaagt agacatgctc ctgggcaatg 2460 aatggttcat gaagttgatg atgcccggag aaattacgtt ggctgataat cttcccgttt 2520 tgcgtgaaac ccaatttggt tgggttattg gtggagtgtt tgaagaaggt gcgttagctg 2580 atgaagcagt ttactcgcac acagtcacga tggacgaatt aagccagtcc attgaacgtt 2640 tttgggaagt agaagatgta gttggtgccg acaagcatgg tagtgaagag gaggagtgtg 2700 aagagcactt caatgcgact catcgtagag atgcttccgg tcggtacatc gtcgagctgc 2760 ctttgaagga gtccgttagc gagttaggtg attcccgacc actcgcattg cgaaggttcc 2820 atgcgctgga gcgtaagctt tcgcagcatc cagatttgaa gaagcagtat caggacttca 2880 tggacgagta tgagagtctt ggtcactgta aggaagtgga cataagtaag gatccagcag 2940 gtatcatcaa gtggtactta cctcatcatg cagtgctgcg gccatcgaat accactacta 3000 agtgccgtgt ggtgtttgat gcctctgcga aggtatctgg ccggtcgctc aacgacgtta 3060 tgaagattgg agccaacata caaagcgatt tgcagtccat ctgtctgcgg ttccgccttc 3120 cattacatgt gatggctact gacgtggcca agatgtacag gcaggttctg gttaatcagc 3180 gacatacgcc gttatctcga gtgttttgga gaaagagttc atctgattcg ctacgtgtcc 3240 tagagctgac aacggtcacc tacggcacag cttccgcccc gtttttggca acgcgggcac 3300 tcttgcagtt ggctctcgat gaaggctcca aatatcctgt tgctgcagac attgtgaaga 3360 ataacttcta tgtggacaac gctttgttcg ggttcgatga tttgaacgaa gcgaaagagg 3420 ctcaagtgca gttgatccaa ctactcaagg ctggtggatt tcatttgcac aagtgggctt 3480 ccaacaatcc tgtgttgatg gagcggattc cggaaggcga tcgggacgag cttgtcagca 3540 tcgatgaaag tggttcaagt gaagtaatta aaacgttggg actaatgtgg aacccaaaaa 3600 tggacgcttt gcagtttgta tcccttccaa caatgagtga aaatagagca accaagcggc 3660 aagtgctgtc gctgatatcg agaatgttcg acccattggg tcttgtggcg ccggttatcg 3720 ttatcggcaa acttttaatg aagagtattt ggaaagaaga gttggaatgg gatgaagagc 3780 ttactggtaa tctgaagaag aggtgtgata aattcctgag tgcattaaga agtgttacca 3840 atcttcgaat ccctcgtcac gtggttgtta ctggggccgt tgcatttgag ttgcacggtt 3900 ttggtgacgc aactttggaa gcctatggtg cctgtgtgta catcaggtca attgtacctg 3960 gccaagctcc tgtggtgcag ttattatgtg ccaaatcgaa gatcgttccg aaaacagtgt 4020 tgaccattcc aagaaaggaa cttttagcag cgcttttgct gcatagattg gtgaagaaag 4080 tgttggcagc attagcgctg ccctttcaag acatttcgtt gtggtcggac aatcaagtgg 4140 tgctagcatg gctagcgaag aacccggagc atttagaagt gtttgtgcgg aaccgggtta 4200 gtgagattaa ctctaccgga catcagttta agtggaagta cgtaaatacg ctggaaaatc 4260 cagctgacat tgtgtcgcgt ggccagtcaa cccacgcact tgagaagaac gatctttggt 4320 ggaatggacc attgttcctg cgtagtgaag tgtatcaggt ggtggctcca gaaccactgt 4380 ctgatgaaga tgttcgtgag ttgcggcagg tcgctgcaac atctgcggtg acagttttgg 4440 agagattgcc agtattcacc aagttcgagt cttcaggaag ctgcaacgag ttctggcata 4500 cgttttgcgt ttttgccgga atgcaaagga gaaggtggcc atgaagcgaa tcgtcgcgcg 4560 gtttccaacc gtgcttgaaa tgcgccactc tttgaaagcc atcaccaggg tggttcagtt 4620 gcaacatttt ggtaaggagg tagccatgat cgagtccggt gagtactgca aaagattttc 4680 tgcgttaaat ccctttttgg acgatggaat gattcgcgtc ggcggacgac tgcgacattc 4740 aaatttacca tacggcgtta agcatcagtg ggtcttacct aagaatgatg aaaccatcca 4800 gcggttgatt caagcgatac atcgtgagaa cctacatatt ggaccgtcgg cgctgctggc 4860 ccagcttcgt cggcaatttt ggattctggg agcccgttct gctgtacgca aggtaacacg 4920 aaactgcgtg cggtgtttca ggctcaaacc cccgtgcgcc agccaattca tgggagatct 4980 tcccgtcgca aggtgcgaca aggctcccgc ttttgtgaag gttggtgttg actttgcggg 5040 ccccatactt atcaagcaga ctggaaggaa ggtggccccc gttaaaggat acgtgagcgt 5100 gttcgtctgt atggtgacca aggccattca tctggaggtg gtagaagatc tatcggccga 5160 tgcgtttatt gctgctctgc agcgattcgt ttctcgtcgt ggtgtgccag aacagatatt 5220 ttccgacaat gggacaaatt ttgtcggtgc acgcaacgaa ttgaacgagc tgtaccgtct 5280 cttcaaggga caagtaaccg agctcaagat caatgagttc tgccaacccc ggcagatcga 5340 atggaagatg attcccccaa atgcgccaca catgggcgga atctgggagg caggagtaaa 5400 gagcgtcaag accattctca agaagaaatg ccaatcaagc ctcctaacga tgtccgagtt 5460 ttcgactttg ctctgccaga tcgaagctca actaaattcc cggcctttat acgctccctc 5520 tgaagaccct tcggagctcg agccgttaac cccaggtcat ttcattattg atcgtccact 5580 gactgccatt ccggagccta gctatgatgg aatcccagct aaccgactct ccagatggca 5640 atacgtccag cgcctgcgtc aagagttctg gaagcgctgg tccgaagaat acctgttgga 5700 gttgcaggta cgacagaagt ggagcaagaa gaaggagaac attctacctg gaacagtggt 5760 ggtaatcaag gatgacaact taccgcctca gcaatggaag ttggggaagg tcgagtcaac 5820 ctacacagga tcagatggac tggtccgcgt ggtagatgtg cgtaccaagt caggagtgct 5880 caagcgtcca atccataagc tggctcctct acctatattg gacaatcagg ctgtcaacga 5940 gaacaaggtt cccgcggggg agga 5964 // ID TRAS3_BM repbase; DNA; INV; 8005 BP. XX AC AB046668; XX DT 09-JUL-2004 (Rel. 9.06, Created) DT 01-JUL-2010 (Rel. 15.08, Last updated, Version 4) XX DE Bombyx mori TRAS3 gene, non-LTR retrotransposon, complete cds. XX KW R1; Non-LTR Retrotransposon; Transposable Element; TRAS3 gene; KW TRAS3_BM. XX NM TRAS3_BM. XX OS Bombyx mori OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Bombycidae; Bombycinae; Bombyx. XX RN [1] RA Kubo Y., Okazaki S., Anzai T. and Fujiwara H.; RT "Structural and phylogenetic analysis of TRAS, telomeric RT repeat-specific non-LTR retrotransposon families in Lepidopteran RT insects."; RL Mol. Biol. Evol 18(5), 848-857 (2001). XX DR Genbank; AB046668; Positions 1 8005. XX FH Key Location/Qualifiers FT CDS 2309..3670 FT /product="TRAS3_BM_1p" FT /translation="MTSDKSPRNPKSPEGGERRSIGLRIDEWETSKGKTIP FT SSTKLAAQTVVAAKPTTSQAGKTTSPGQCQDRSAEAKACLLKAKLHLSNSG FT NIKREIKIEVTAALDKLYQLVKDAEMDRKKGKSKEEKTGDVGQRATDRTFT FT VSPMDANIYVAKMEEHAKLLQESKKMMGDLKEAMEKATVTAATYASVAAIQ FT PAATNEKPEVMRQTLHSVVITSKDECETGEKVLDRVRKAVDAKEGWIEVKN FT VRKAKDRKVIIGLGTKAERDKLKNRLEKAETQLHVEEVENRDPLMMLRSVL FT TIHSDEDILKALRNQNRDIFRDLCEGEDRVVIRYRRRARNPHTNHVVVSVS FT PTVWQRATGKGSVHIDLRRIKVEDQSPLVQCTRCLGYGHSKRFCVESVDLC FT SHCGGPHLKTECSDWLAKVPPKCRNCTKADIDNAEHNAFDSNCQVRKRWDD FT LARSTVAYC" FT CDS 3663..7346 FT /product="TRAS3_BM_2p" FT /note="endonuclease, reverse transcriptase, and FT ribonuclease H." FT /translation="RIAKGSQEDQVPYRVVQANLQRNKLATNEVLVEAARL FT KIAVGLLQEPYVGGAKEMKTQRGMRVFQNADVSGGTVKAAIVVFDHNINVV FT QYPKLTTNNICVVGINTSAWSITLVSFYFEPDHPIEPYLEHLGKIKEEIGR FT SKIIYGGDSNAKSTWWGSPSIDNRGTSMLGTLEELELNILNTGEIPTFDTI FT RGGKRYKSYVDVTACSTDLMDLVSDWRVDEGLTTSDHNAILFNIHTKRAIG FT IKIQRTTRIYNTKKANWSNFHEKMRQLIQEKQLTIENIKQINTIAEIEIAE FT NKYTNIIKTVCNQTIPKKKTQEKFTLPWWSDELAAMKREVATRKRRIRCAA FT PIRRSRVVEEYLKLKQEYELKAASAQIESWKNYCHRQDKEGVWEGIYRVIG FT RVTKREEDLPLEKDGNILDAKQSVKLLSETFYPKDSTDGDNDYHRQIREEA FT EKVNCGKQNNNIFEPQFTMSELKWASNSFNPKKAPGADGFTADICHHAINS FT SPHVFLTLLNKCLEQSYFPKAWKEATVVVLRKPGKESYTNHKSYRPIGLHT FT ILGKIYEKMLISRVKYHLIPRTSTRQFGFMPQRSTEDSLYTMMQHISNKRK FT EKKIVTLVSLDIEGAFDCAWWPAIRVRLAQENCPLNLRKVMDSYLTDRKVR FT VRYAGEEHSVNTSKGCVQGSIGGPVLWNLLLDPLLKSLDTQKVYCQAFADD FT VVLVFDGDTALEIENRANAALEHVQEWGINNKLKFAPQKTKAMVITRRLKY FT DIPRLNMGGTVIPMSEDIKILGVTVDNKLTFNAHVSNVCRRAIEVYKQLAR FT AARASWGLHPEVIKLIYTATIEPIVLYAASVWVSAVAKLGVIKQLAAVQRG FT IAQKVCKAYRTVSLNSALILAGMLPLDLRVREAASLYEAKKGQLLPGLADA FT EIEQMTPFAEMPHPVERADLQIVCLEDQEQVDGNSDYDECIFTDGSKIGGK FT VGAALSIWKGDTETKTRKLALSNYCTVYQAELLALCVATTEVRKSKSKSFG FT VYSDSMSALQTITNYDSPHPLAVEARQNIKASLLQGKAVTLHWIKAHAGLK FT GNERADGLAKEAAENSRKRPDYDRCPISFVKRSLRMTTLEEWNRRYTTGET FT ASVTKLFFPDALVAYRIVRKIQPSNILTQIMTGHGGFSEYLCRFKCKESPS FT CICDPAVKETVPHVLVECPIFAQARHDIEQKLDVKIGLDTLHEIIIDTNRN FT QFLKYCIAIIGIVIKRNK" XX SQ Sequence 8005 BP; 2785 A; 1569 C; 1779 G; 1868 T; 4 other; cccagtctgc cgtggccctc cgcagcgaac acgcgcgtcc caagttgctc cgcagtgatt 60 tacgctcgaa aaatcattaa aaattccgaa aagtggtgaa aagtgtacca aaataacagt 120 gcgatacacc aaaatttgtg gaaaagtgcg gcgaacctgt tggtgaaact ttattttgga 180 tcgcagtgaa atttggacga ttttccgtgg aaaaactcgg aaattgcaga attcgaaact 240 ttgaccgcgt gcgccgcggg ttttatcgct gtgacataca gtttttctac tgctgtcctg 300 acgggcttgt tgaaccctac aatccaatac caaatttgga atttttcggc cagagacggc 360 cgagaaatca ccgaaaaaaa caaaaaccac gtggcacgtg gtcagtgatc ggcggccatt 420 ttggaaattt tgaaaatagt gactggagga ggcagacaag agaagcccag aaccgtttca 480 agacctgaaa aaagttaagg cgtttttaag ttattgaatt ttgaaagttt ggcaatattc 540 gaaattttct atctttttat aacgatcgat aacatatagt tttttcatag tcatattaaa 600 gcttacatac acctgcaaca ttgttgaaaa aatcagaaat ttctatcaga aaattccgga 660 gaaaataatt tttcaaaatt tttcaaattt tcaattgcat ttaaaaaata attattaaaa 720 atctagaaca tattaattta aatattattg tagagttaaa aatttagcat aactttgacg 780 ccttatttga aattattgga ccgaaattgt acttttataa tcactttaag gaaaaaattt 840 attactttta agaatattaa atcggacgtc ggttggtttt tattgttaaa taaatattcg 900 tgaccctgag agaagccctt acaaaacgaa cccaaactcg actagtttcg gaccactttt 960 tgaatttttt aatttttggc ttattttatt tgtgtttgta ttattaagta ggaactttaa 1020 aaatttagaa ctgtacatac tttggctgtg cgtgttgtgt atgtcttaga ataggttatg 1080 aatttgattg attcgtgaat cacttgtatt tttgacccgc atcgaaagcg gaacaatata 1140 aaacatttat cttaagccgt aagaaaccga gcgaacttgt cgaccaatag aacagcgcgg 1200 cggtgacgtg atatcgatgt tgtttacatg caggtgagcg gatgagtcac gcgccgcacc 1260 atctggtccc gtcttttgtt tctttatgtt ttacaacatt taaatttgaa gtaaaataac 1320 atcgaacata acacagccag cgtattgaat agcgaataga acgaaataga acgaaattga 1380 atgcttggtt tggttacttg ctgtcactga cttagctgtg catatttagt gcataataat 1440 attagtttta gtgtgaatgt gaaatacatc actgtattac ttagaaattg tggacagtgt 1500 gtaactgagc tgacgttatt gttttgtttc gtcttgtttt ttctctaatt tttattttgt 1560 atcactattt tattttttat atatttcata ttataattca gaactgcgaa taatttaatt 1620 gtgtctgtta tgttttaaaa ttttttaaca gtttgactgc ttattttagt tagttggtca 1680 atatcgaaag cnnaatnant taaattaaat cattatatat aatatacaca cacacacaca 1740 cacacacata cacacacaca cgcgcacaca cacacacaca cacacaagta aataaataaa 1800 taatttctta ccgacatccg acgagtccag tactcgtgta gtctgacgaa atctgcaaca 1860 aaaaacagag acataaactt tcatatgcag agtacagctc ctgtaaagta aaataaaata 1920 tttaaaaaaa tagaaagtta catatacatt tcaataagac atatctatct gggtagtata 1980 gacagcacgg gtttatattc gactgaccct attgggcaaa taaacaagcc ttttttgggc 2040 ggaataagtg tgttgcgacg acttaattat tattaagcta agtaagacat tgacaattat 2100 ttttacaata tatacatttt acacatttac acctatacat ttatttaagc cataggacaa 2160 ctttccccga tatactcgca taacacatta cattgcatca tactaactaa ctgggctgaa 2220 tcactatcgg actagacagt cgttcgctag aaatacttca acacctctga caacatgttc 2280 ggaatacgat cccccgtcca gaaaagcgat gacgagtgat aaatcacccc gaaaccctaa 2340 atccccagaa ggcggcgaga gacgcagcat aggcttgaga atagatgaat gggaaacctc 2400 caaaggtaaa accatcccat catccactaa gttagccgcg cagactgtcg tagccgccaa 2460 accaacaact tcgcaagccg gcaaaacaac aagtccaggc caatgccagg acagatctgc 2520 ggaagcaaaa gcctgccttt taaaggccaa gctgcactta agcaactccg ggaatattaa 2580 aagagaaata aaaatcgagg ttacagcggc acttgacaaa ctttaccaat tggtgaagga 2640 tgccgaaatg gaccggaaaa aagggaaatc gaaagaagag aagacgggtg atgtggggca 2700 aagagcgact gataggacct ttacggtaag cccaatggac gcgaatattt atgtagccaa 2760 aatggaggag cacgccaagc tcttgcaaga gagtaaaaaa atgatgggag acctgaagga 2820 agcaatggag aaagcaacag tgacagcagc aacgtacgcc agcgtagccg cgatacagcc 2880 agctgctaca aatgagaagc ctgaagttat gaggcaaaca ctgcactccg tcgtgatcac 2940 ctcaaaggac gaatgtgaga ctggggaaaa ggtgctcgac agagtaagaa aggcagtaga 3000 tgcaaaagag ggatggatag aggtaaaaaa cgttaggaag gcaaaagata gaaaagttat 3060 cataggccta ggtaccaagg cggaaaggga caaattaaaa aataggttgg agaaggcgga 3120 gactcagctc cacgtcgagg aagtggaaaa ccgggatccc ttaatgatgc ttaggagtgt 3180 tcttaccata cattcggatg aggacatcct taaagctctg agaaaccaaa atcgagatat 3240 tttccgcgac ctctgcgagg gagaggacag agtggtgatc cggtacagac ggagggcgag 3300 aaacccacac acaaaccacg tggtggtcag cgtctcgccc accgtctggc aaagagcaac 3360 cggaaaagga agcgtgcaca tagatctgcg aaggattaaa gtagaagacc aatctcctct 3420 ggtgcaatgc acgcgctgcc taggctatgg acacagcaag agattttgcg tcgaatctgt 3480 agacctgtgt agccattgcg ggggtccgca tttgaaaact gaatgttctg actggttggc 3540 taaggtacca cccaaatgta ggaattgcac aaaggcagat atagataacg cagagcacaa 3600 cgcttttgac tcgaactgtc aggtgaggaa aagatgggat gatttggccc gatcgactgt 3660 agcgtattgc taagggcagc caggaggatc aagtccctta tcgggtagta caagcaaacc 3720 tccaaagaaa taaactagcg acaaacgagg ttcttgtgga ggcggcaagg ctcaaaatcg 3780 ccgtgggcct tctacaggaa ccatatgtgg gtggggcgaa agaaatgaaa actcaaaggg 3840 gaatgcgtgt gttccaaaac gctgatgtga gtggtgggac tgtaaaagca gcgatagttg 3900 tattcgacca taacatcaac gtagtgcagt acccgaaact caccaccaac aacatctgcg 3960 tggtggggat caacaccagc gcgtggagca tcacgctagt ctccttttat ttcgagccag 4020 accatcccat agagccctat cttgaacatc tagggaaaat caaagaagaa ataggaagaa 4080 gcaaaataat ctacggagga gactcgaacg caaagagcac ctggtgggga agccctagca 4140 tagataacag gggtacaagt atgttgggaa cactggagga actggaactg aacatattaa 4200 atacagggga aattccgacc ttcgacacga ttaggggagg aaagcgctac aaaagttatg 4260 tcgacgttac agcctgctca acagacttga tggatctggt gagcgactgg agagttgatg 4320 aaggactgac gacctcagac cacaacgcca tcctatttaa tattcataca aaacgagcaa 4380 taggaataaa aatacaaaga accacaagaa tatacaacac aaagaaagcc aactggtcaa 4440 attttcatga gaaaatgcga cagttgatac aagaaaaaca attgaccatt gaaaatataa 4500 aacaaataaa tacaatagca gaaattgaaa tagcagaaaa caaatacaca aacataatta 4560 aaacagtatg taaccaaacc atacctaaga aaaaaacaca agaaaaattt accttgccgt 4620 ggtggtctga tgagctagcc gcaatgaaac gtgaagtcgc caccagaaag cgcagaatcc 4680 gatgcgctgc gccaatccga aggtcacggg tcgtcgaaga gtacctgaaa ctaaaacaag 4740 aatatgagtt aaaagcagct agtgcccaga tagaaagttg gaagaactat tgtcatagac 4800 aagataagga aggagtgtgg gagggaatct atagggttat tggaagagtg actaaacggg 4860 aagaagactt gccactggaa aaagacggaa acattctaga tgctaagcag tcagtcaaat 4920 tgttgtcgga gacattctat ccaaaggatt ctaccgacgg cgataacgac taccatcgcc 4980 aaatcaggga agaagccgaa aaagtgaatt gtggcaagca aaataataat attttcgaac 5040 cgcaattcac catgtcagaa ttgaaatggg caagtaactc cttcaacccc aaaaaagcac 5100 ccggagcaga tggctttaca gcggatatat gtcatcatgc cataaacagt agccctcatg 5160 tatttctcac gctcctcaac aaatgtctgg aacaaagcta cttcccaaag gcctggaagg 5220 aagctaccgt ggtggtgttg cggaagccgg gtaaagagtc atacacaaac cacaagtcgt 5280 atagaccaat cggtttgcac acaatactgg gcaaaatata cgaaaaaatg ctgatttcac 5340 gcgtaaaata ccatttgatc ccaaggacaa gcacaaggca gttcgggttc atgccacaaa 5400 ggagcaccga ggactccctc tataccatga tgcaacatat ttctaacaaa aggaaggaaa 5460 agaaaatagt aacgttggtg tcattagata tagagggagc ctttgattgc gcctggtggc 5520 ctgccatcag agtccgatta gcccaggaaa actgtccact gaacctgcgg aaagtaatgg 5580 acagctatct cacggatcga aaagtccgag tcagatacgc aggggaagag cacagcgtga 5640 ataccagcaa aggctgtgtg cagggctcaa tcggtggccc tgtgctgtgg aacctcctgt 5700 tggacccact cctgaaaagt ctggacaccc aaaaagtgta ctgtcaggca ttcgcagacg 5760 atgttgtcct tgttttcgac ggagacacgg cgttggaaat tgaaaaccgg gccaatgcgg 5820 ctctcgaaca tgttcaggaa tggggtatca ataacaaact gaagttcgca ccacaaaaaa 5880 ctaaagctat ggtcattaca aggagattga aatatgatat cccacggctg aacatgggcg 5940 ggacagtcat tcccatgtct gaagacatta agattctagg ggtaaccgtc gacaacaagt 6000 tgacatttaa cgcgcacgtc tcgaatgttt gcagaagagc gattgaggtg tataaacaac 6060 tagccagagc agccagggcc agttggggtc tacaccccga ggtcattaaa ttaatatata 6120 ccgccaccat agagcccata gtcttgtacg ccgccagtgt atgggtatcg gcagtcgcca 6180 aactgggcgt aattaaacaa ttagccgctg tgcagagggg aattgcacaa aaggtatgca 6240 aagcgtatcg caccgtatct cttaactcag ctctgatcct agcgggtatg ctccccctag 6300 acctccgagt tcgtgaggcg gcctcattat acgaagccaa gaagggacaa ctgctgccgg 6360 gactggctga cgcggagatt gagcaaatga caccttttgc agagatgcca caccccgtgg 6420 aacgtgcgga tctgcagata gtctgcttgg aggaccaaga acaagtcgac ggtaacagcg 6480 actacgacga atgtattttt acagacggaa gtaaaatcgg aggcaaagtg ggggccgcgc 6540 tgtcgatttg gaaaggggac acagagacta agacccgcaa acttgccctg tcaaactact 6600 gcacggtcta ccaagcagag ctgctggcac tgtgtgtggc gacgacggaa gtcaggaaga 6660 gtaaaagcaa atcttttgga gtttatagcg attccatgtc ggccctccaa accataacaa 6720 actatgatag cccccatcca ctggcagtcg aagctagaca aaatattaaa gcctcgttac 6780 tccaaggcaa ggctgtcacc ttgcattgga taaaagctca cgcagggctg aagggcaatg 6840 agagggccga cggacttgca aaggaagccg ctgaaaactc caggaaaaga ccagactacg 6900 atcgctgccc gatctcattc gtcaagcgaa gcctacgaat gaccacgctt gaggaatgga 6960 accggcgcta tacaactggc gagacggcat ccgtcactaa gttgtttttc ccagatgcat 7020 tggtggcgta cagaatagtg agaaagatac agcccagtaa catactcaca caaatcatga 7080 cggggcatgg cgggttctcg gaatacttat gtcggtttaa gtgtaaagag agcccgtcat 7140 gcatttgcga cccagcagtg aaagaaaccg ttcctcatgt gctggtggaa tgtcccatct 7200 ttgcacaggc tagacatgac atcgagcaaa agctggacgt aaaaattgga cttgacacgc 7260 tgcatgaaat aataatagac acaaatagaa atcagttttt gaaatactgc atagctatca 7320 ttggaatagt aataaaaaga aataaataga aaataaagta tgtttacaat aatattaaga 7380 tatattagat ataagcatac aatatattaa gcaaaaacaa aagtatatat atacgcagta 7440 aaaataagaa aacaaaggaa atgggcttga cataaattaa aacttacctt cctcttctcc 7500 tgtccagctc cctggaaaat aaaattaaaa ttgaaaagta ttgttagaat agaactagaa 7560 gagaaatgta acaaataaca tagaaaaaag tagaataagc acgtaatata ataagcaata 7620 gaataagaag ctaaaatgta aaccatagga ataagataat aatattgaaa ttgttaaaat 7680 tatataaaga taagatccaa atagtataag cttcaaaata gacataagtt tcaaacaata 7740 tttttgtaat tgatttatga ataaatgagc aaaatgctat ttaccccgaa aaataaaaca 7800 atagtaggtt aaagtagcgg gagtcccacc cggctaagta tgacagatga tgaatgcaag 7860 aaagaaacag tatgaagcat gaaagtgcga gaagcataaa agagagaact agaatgaatg 7920 agtaacttag actaaaaaag acccggtgat ctcacgatcg gggaaaggca ttaaaaaaaa 7980 ataaaaaaaa aaaaaaaaaa aaaaa 8005 // ID CR1-28_HM repbase; DNA; INV; 4016 BP. XX AC . XX DT 16-DEC-2008 (Rel. 13.12, Created) DT 16-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE CR1-type family - consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-28_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4016 RA Bao W. and Jurka J.; RT "CR1 families from Hydra magnipapillata."; RL Repbase Reports 8(12), 1856-1856 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(181..837,841..3858) FT /product="CR1-28_HM_1p" FT /translation="MVNAQEFNAYKAQQEQIIGDLKLQIEKLITDNMNLTK FT RVDDLEMNKSSTSHINNNVTWANMVKKSPEQMLMINSVTSETKERERRENN FT VVIFGIKPSENEEKSKEKEENRRAIINVFKKLQVTVNVQNFMKLKPRNTNN FT QAPFVVILKDKRERNSILKKAKELRDSKEYENVYINPDLTESERFKSKFLR FT EECRKKNKENIENEFYYGIRNEKVTKIKKLYDLKDKNSNHCSYSHVNVLGN FT AVFKFKGGPTSIPSLEKSRNNNPHKNHIKLWYTNATSLNNKMQQLKAYLSC FT DKPDIILVSETWFNDKSVINIDGYNCYNKNRPLGENGGGVVIYAKSTFITT FT SVSHLALCDNGIEQIWCSLTNGPDHLLIGCIYRPPSSPESVNTKINSVLGI FT AKSLVDQKKYTGLIITGDFNYPHIKWYKDGGTKLKQNCDISIKFIDNLNEN FT FIEQCIFGSTFQISDXEPTNTLDLLMTDNCKRIRDVEIGAPLGPIEKAHYV FT IQFKYDMQGADTIPIQFNSKDINYQKGDYEKINLGFNLINWYELFKDLNVD FT HCYSAFVQKYNELIELNIPKKRARFDQKVKKDPCITIEVKALIRKKKNLWN FT QLRAGKFKYLNKLNEYKAVKKLVKSKLKESTIDYEKNLINNAKKNPRLFYK FT YAKKNQKNIGKIQSMKNYANVLTSDRHEIVNILNKSFQSVFVKEHGEIPEF FT VPRITKPFEWDWKNVVTNEAVQQKLYSLDVNKACGSDGISPYVLKNCSSSA FT ATPLALIYRKSIETGKIPSKWKEANITPIFKNGNTTDPLNYRPVSLTSVPG FT KVMEAFIKEAIMTHLMENKLLSKNQHGFVKNKACVTNLLETLDILTASMSD FT SLAVDMIYMDFAKAFDKVPHKRLLKKCEAYGIKGVINDWISSFLKERRQRV FT VMGVDVSPWLEIFSGVPQGSVLGPLLFLIFVNDLPSVINSNCKLYADDSKL FT IKVIKNVEDHESLQNDINEFVKWTDIWLTKLNIEKCKVMHLGKKNPRMSYC FT MNHDGKEYKLKTTVSERDLGVIISSDLKWHDQVTSATAKAQRTLGLIKRTF FT TYFDVEMVKSLYTTFVRPLLEFAIPAWQPYLQRDIDELEKVQRRATKLVPQ FT LKKISYEKRLKAMGLTTLEKRRARGDLIQQYRFKQNIDQINWYKNPKPASS FT VTSSGPAFSVRGNKYRLEREYVPSCLPRYNFFTNRVCKNWNLLPNEIIEAT FT SLNVFKARIDKHFS*" XX SQ Sequence 4016 BP; 1629 A; 569 C; 690 G; 1127 T; 1 other; tttttttttt tttttttttt ttttttttgg cggaattaaa gatggcggac atgcttttga 60 aataaaaaac aataacaacc attatattaa tgaaatatta tatatacgtc ttgttattaa 120 cgaaaaaaca acagagtttg aagaactagt gaataatata attttaaatt tttcaagaaa 180 atggtaaatg ctcaagagtt taatgcctac aaggctcaac aggaacaaat tattggagat 240 ctgaaactac aaattgaaaa attaataact gataatatga atttaacaaa aagagtagat 300 gatttagaaa tgaataaatc aagtaccagt catataaata acaatgtaac gtgggcaaac 360 atggtaaaaa agtcaccgga acaaatgctt atgataaatt ccgttactag tgagacaaaa 420 gaacgagaaa gaagagaaaa caatgtagtt attttcggaa ttaaaccatc ggaaaacgaa 480 gaaaagagta aagagaaaga agaaaataga cgagctataa ttaatgtttt taaaaaattg 540 caagtaaccg ttaatgttca aaattttatg aaactaaaac caagaaacac caataatcaa 600 gccccatttg ttgtcatact gaaagataaa agagaaagga attctatctt aaaaaaagcc 660 aaagaattac gagattcgaa agaatatgag aatgtctata taaatcccga tttaacggaa 720 agtgaaaggt ttaaatcaaa gtttttaaga gaagaatgca gaaaaaaaaa caaagagaac 780 atcgaaaacg agttctacta tggaattcga aatgaaaaag taacaaaaat caaaaaatag 840 ctatatgatc taaaggacaa aaactcaaat cattgttcat attcacatgt aaatgtatta 900 ggaaatgctg tgtttaaatt taagggtgga ccaacatcaa tacctagcct cgaaaaatca 960 agaaacaaca acccacacaa aaatcacatc aaactttggt acaccaatgc aacttcactt 1020 aataataaaa tgcaacaact taaagcgtac ttgtcctgtg ataaaccaga tattatatta 1080 gtaagtgaaa catggttcaa tgataaatca gttatcaata ttgatgggta caactgctac 1140 aacaaaaaca gaccgcttgg tgaaaatggt gggggagtag taatttatgc aaaatcaaca 1200 tttattacaa caagtgtaag tcatttagca ttatgtgaca atggaataga gcagatatgg 1260 tgtagtttga ccaacggacc agaccattta ttaattggat gtatttacag accgccaagt 1320 agtccagaaa gcgtaaacac taaaatcaat tcagtgcttg gaatagccaa aagtctagta 1380 gaccaaaaaa aatataccgg attaattata actggtgatt ttaattatcc acacataaaa 1440 tggtacaaag atggtggaac aaaactgaaa caaaactgtg atattagtat aaaatttatt 1500 gataatttaa atgaaaattt tattgaacaa tgtatttttg gatcaacatt tcaaatatcc 1560 gacgawgaac ctacaaacac acttgattta ctaatgacag ataactgcaa aaggattaga 1620 gacgtagaaa taggcgcacc acttggacca attgaaaaag cacattatgt aattcaattc 1680 aaatatgaca tgcaaggtgc tgacacaatc cctatacaat tcaatagtaa agatataaat 1740 taccaaaaag gagactatga aaaaataaat cttggtttta atttaataaa ttggtatgaa 1800 ttattcaagg atttaaacgt ggaccattgc tacagtgctt tcgtacaaaa atataatgag 1860 ttgattgaat tgaatattcc aaagaaaaga gccaggtttg atcaaaaagt taaaaaagac 1920 ccatgcataa caattgaagt taaagcttta atacgtaaaa agaaaaattt atggaaccaa 1980 ttaagagctg gtaaatttaa atatcttaat aaattaaatg agtacaaagc agtaaaaaaa 2040 ttagtcaaaa gcaagttaaa agaatccact attgattatg agaaaaatct tataaataat 2100 gcaaaaaaaa atccgagact gttctacaaa tacgcgaaaa agaaccaaaa aaacatcggt 2160 aaaatacaat caatgaaaaa ctatgcaaat gtactcacat ctgacaggca tgagattgtt 2220 aatatattga acaaaagttt tcaatcggtt tttgttaaag agcatggtga aattccggag 2280 tttgtaccac gtattacaaa accttttgaa tgggattgga aaaatgttgt tacaaatgag 2340 gcagttcagc aaaaactgta ttctttggac gtcaacaaag catgtggtag tgatggtatt 2400 agtccttacg tacttaaaaa ttgttcaagt agtgcagcaa ctccattagc tttaatttac 2460 cgaaaatcaa ttgaaacagg aaaaattcct agcaaatgga aagaagcaaa tattactcca 2520 atattcaaga atggaaatac aacagacccg cttaactaca gacccgtatc attaacatct 2580 gtaccaggca aagttatgga agcattcata aaagaagcaa taatgactca tctgatggaa 2640 aataaattgc tatcaaaaaa tcaacatggc ttcgtaaaaa ataaagcctg tgtaacaaat 2700 ttgctagaaa cattagacat tttaacagca agcatgagtg acagtttagc ggtagatatg 2760 atttatatgg attttgccaa agcctttgat aaggttccac ataaaagact gttgaaaaag 2820 tgtgaagcat atggtatcaa gggtgtaata aatgactgga tatcgtcttt tttaaaagaa 2880 agaaggcaaa gagttgtgat gggtgtagac gtgtctccat ggcttgagat ttttagtggt 2940 gtgcctcaag gctctgtgct tggtcctttg ttattcttga tattcgtaaa tgatctacca 3000 tcagttatta attccaattg caaattatat gcagatgaca gtaaacttat caaagttata 3060 aaaaatgttg aagatcatga gtctttgcag aatgacatta acgaatttgt aaaatggacg 3120 gatatctggt tgacaaagtt gaacatcgaa aagtgcaagg tgatgcattt aggtaaaaag 3180 aatccacgta tgagttattg tatgaaccac gacggcaaag agtataaatt aaaaacaaca 3240 gtaagtgaaa gagatttagg agtgattatt tcatccgacc ttaaatggca tgatcaagtg 3300 acttcagcga cagcaaaagc acaaagaact ttgggtctaa ttaaacgtac gtttacatac 3360 ttcgatgtgg aaatggtgaa atcactatat acgacatttg tgagaccgct tcttgagttc 3420 gctatacctg catggcagcc ttatttacaa cgggatattg atgagttaga gaaagtgcaa 3480 agaagagcaa cgaagttagt tccacaactt aaaaagattt catatgaaaa aagactgaaa 3540 gcaatgggtt taacaacatt agagaaaaga cgagctagag gagatcttat acaacaatac 3600 agattcaagc aaaatataga ccaaataaat tggtacaaaa atccaaaacc ggcatcatcc 3660 gtgactagtt caggcccggc tttttctgta aggggcaata aatacaggct agaaagagaa 3720 tacgtaccat catgtttacc gagatataat ttttttacca atcgagtttg taaaaattgg 3780 aatttgttac caaatgaaat cattgaagca acatcgttga atgttttcaa agcgcgaata 3840 gataaacatt tttcataact tgtgtttttt taatttgtgt gggctgtcta taacccgatg 3900 cttgcatcgc gttccacaat ttattttatt tttaaatatt tatatttaaa caaatatgta 3960 attgttacag caataaatat tactattact aattactatt actaatcttt ccaatc 4016 // ID Copia-20_DPu-I repbase; DNA; INV; 4876 BP. XX AC scaffold_175; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-20_DPu_; KW Copia-20_DPu-LTR; Copia-20_DPu-I. XX NM Copia-20_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-4876 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 703-703 (2010). XX DR Genome; scaffold_175; Positions 143716 148591. XX CC Positions [2396-2638] - Integrase core CC 'TACTT' target site duplication CC LTRs are 93% similar to each other. CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX FH Key Location/Qualifiers FT CDS join(2492..3724,3728..4876) FT /product="Copia-20_DPu-I_1p" FT /translation="MESAQSMYLANNLPNELWAEAVAYATYIQNRVVTTKR FT KITPFEVIHGRKPDIHHIRMFGSTAFIHTSDALRRKLDPKAEEGIFVGCCE FT SKRTYRIWIPPKRRIVISRNVIVDEQNQSLKDEKLSSTTPFDSFFRQDSTE FT HPKTDDCTSADRQTADDKQEQSESSTTMAEDTAVEVPHKEEDATQAEKTSN FT NEEPAAQAEDTSNNEEPDPTEEIADQPTAEIPTTKRKSSRTPVFSGKFLEW FT KRSLSKNKENKSKSFGMEVTSTILTEKPEPRTYKEAMESEDAEKWKSATED FT EYKSLMANLTWILVNRPNGRNIVGCKWVYKVKPGYTGVAERYKARLVAKGF FT TQQYGTDYNETFSPVLKYNSLRTILAITAQHNLNISLLDVKTAFLNGELNE FT EVFMEQPEGFVESGKEDVCLLKKSLYGLKQAPRMWNIKFNQFLIRFGLKRS FT ELDSCVYFRRNGTYLLIIAIFVDDGLVCASNIQLAESVIDSLSEEFDMRSL FT PATRFLGLDLHLRNHKILINQPEFIKKVLNKFNMGSCKPTSIPMDSSDVFD FT DANIRRGDFPYGKSTVQRGGRMSTLHLNHVQTRYIVRCKSSSQVLSKPRSG FT TLERVKKILAYLSGTTELGIMFEKNETSPISGYTDADYGGDLDNRCSTSGS FT VFFVYGNLVSWSSKRQKCISQSTTEAEYVAASDACKEAVWLSSLLTELGET FT EEKSVPMYCDNQSAIQSIRNPTFHQRTKHIDIRYHFIRSLQENGIIVVSYV FT PSKEQKADMLTNPFQNQNSKECVNSWAYVISLCLLKIPWLERR" XX SQ Sequence 4876 BP; 1759 A; 1058 C; 912 G; 1147 T; 0 other; ggttatgggc ccagttgtca catagacctc acatagagaa tctgtataga agaatggaaa 60 caccaacgct gaaagatatt ggacacctaa ccaagtttaa tggtagcaac ttccaacgct 120 ggaaatgtgg tttaagactc atccttgaac accatcaact tctcgacatc attgatggaa 180 cagaaactaa accagctgag gtagttctta cgacaatcac ttaaaaccaa aaacagtaaa 240 accaatctag tcacgcttta aaaactacta ctctgaatac gagtaaaaat agagatcagt 300 caaatagaat tttttttttc tctaaaaaca attgagattc tgtacacgag taaaaactag 360 aaatcagtca aacaagtaaa ataatttgca acacagtcga ctaagagact ttttataacc 420 taaaaactta tactgtcatt cgagatcaac caatatagat ctcatgaaac aaaatcttta 480 aactcttatt ctaattcgag ataatgacat tttctatctc atgcacctat taaataggca 540 agaaacaatg acaacgtcgt cacaaatgct gcacaaattc aaagatggaa agaaagagat 600 gttgcagctc gaaattatct atttgccgct actgaagacg atctacgaga tacactctgc 660 acagcaacaa cagcagccga tatgtatgag agaataagaa accaacacgc tcgcactgca 720 gctgataatc gacttatcct tctccaacaa ttcaccgagt acaagtttca acaaggtcat 780 aacgtcacaa gccatgtgac agctctcgaa cttctgtggt caagattaac agaaattgga 840 gaacatatac ttgaaagtca agttatagcc gaaattctct ccacacttcc atcaacctat 900 cgtcacttct acaccacctg gaacaattca ccagaagcag gaagaactgt caaacttctg 960 ctcactaaac tacaagaaga agaggagata acacaggcat tcaacagaaa caacccttca 1020 acggaaggag cctatgctgc accaaaccac ttccaaaatc ctgctcaacg aaatcaaaat 1080 caatctctat aacaacctta cagcagacct catccttact caatgtctag aggaggtcac 1140 caaggaggaa ggataccagt caggaagagg aggctatcca ggtgtcagag gaggatatcg 1200 tggaggtttt cctggaggct ttcgtggatt cagaagtggc ccacaagttc aaagagaact 1260 ctgctcctat tgtggattag gtcctcacaa agccacaaac tgtagaaata gaatgagaga 1320 tgaatctgaa gccctccaac agcaacagtc aaatgcaaaa ttttcacatg cagaacaaac 1380 atcagaaaat tttgaatttg gctacacctc acagaatatt caagactttg agacatttgg 1440 ttttgtagca gactctggag cgtctgaaca catgacggat aaaagatcga ttatcatcaa 1500 ctttagaccg attcaaacag gaactcactc tgtccatggc attggaaaca catgtttaga 1560 agcaaaagga agaggtgatg ttgaagtagt aaacgccgca ggaacaactc tactacttaa 1620 agatgtccta tttgtccctg gcctgggaat aaacttattt tcgattagtg caaccacatc 1680 aaaaggagca gaagcaatat tcttcaaaga tatggtacaa caaacccatt aaaaacattt 1740 ttttttctca ttttttaaat aaacatttta tttctcatag gtcaatatat acagagatgg 1800 acaactagaa atgactggtc aaagagcaag tgagaagctt tactatattg atattatagt 1860 aaaggtccag gaatcctgtc tcagtgccac tagtcgtcca ctaccaatct caatctggca 1920 tcaaagatta ggtcacgtca acaacaaaac gatcctgaaa atgtcaaaag aaaacgctgc 1980 aaatggactc aaaatagaag aaaccagtaa gccaccaaca ctatgccacg gatgcgtttt 2040 agggaagatg cataggtcat catttccaaa cgaaagaaca agaaagaacc acgtcggcga 2100 tctaattcac tcagacgtat gcggttcgat gcaaaccatg tcacctggca agtcacgcta 2160 ctatgttcta ttcaaggatg acttcagcgg ttggtgcgaa gttcagttca tgaaaaacaa 2220 gtccgaggtc ttccaccact ttcaaaactt tgtagcagca ttcaaaacac agtacaacca 2280 tatcgtccgg atccttcgct ccaacggtgg tggagaatat attagtcaag aattcgaaga 2340 atggcttaaa aagaacggta caaatcttca taaaactcat gtaaactatg aataatctaa 2400 caaatgtgta attataggaa ttaaacatga gcgaagcgct ccctatacac ccgagcagaa 2460 tggcgtatca gagagaacaa acagaacaat aatggagtcg gcccaaagca tgtacctagc 2520 caacaatctc ccaaacgaac tttgggcaga agcagttgca tatgcaacgt acattcaaaa 2580 tagagtagta acaaccaaaa gaaaaatcac accatttgaa gtcattcatg gaagaaaacc 2640 agatatccat catattcgta tgtttggatc gacggcgttc attcacactt cagatgcact 2700 acgtagaaaa ttagatccaa aagccgaaga aggaattttt gttggatgct gcgaatctaa 2760 aagaacttac agaatctgga tcccaccaaa aagaagaatt gtgataagca gaaatgtcat 2820 cgtggacgaa caaaatcagt cactgaaaga tgaaaagtta tcctctacaa caccgtttga 2880 ttcatttttt cgtcaagact cgacagaaca ccctaagaca gatgactgca caagcgcaga 2940 cagacaaact gctgacgaca aacaagaaca aagtgaatcc tcaactacaa tggcagaaga 3000 tacggcagta gaagttccac acaaagaaga agatgcaact caagcagaaa agacatctaa 3060 taacgaagaa cctgcagctc aagcagagga tacatctaat aacgaagaac ctgatccaac 3120 tgaagaaata gccgatcaac ctacagcaga aataccaaca actaaacgaa aatcatctcg 3180 cactccagtc ttcagtggaa agttcctaga atggaaaaga agcctttcga aaaacaaaga 3240 aaacaagtca aaatcctttg ggatggaagt cacatcaact atccttacag aaaaacctga 3300 accaagaacc tacaaagaag caatggaatc cgaagatgcg gaaaaatgga aaagtgcaac 3360 cgaagatgag tacaagtcac tcatggcgaa tctcacctgg atactagtca atcgacctaa 3420 tggacggaac attgttggat gcaaatgggt ctataaagta aaacctggct acactggggt 3480 agctgagcgc tacaaagcta gactagtagc aaaaggtttc acccaacaat acggtactga 3540 ctataatgaa actttttctc ctgttctaaa gtacaactca ctccgcacca ttcttgccat 3600 aacagctcaa cacaatctaa acatctctct cttagatgtg aaaactgcct tcctaaacgg 3660 agagttaaac gaagaagtct tcatggaaca acccgaagga tttgtcgaat caggaaaaga 3720 agactaagtc tgcctactca agaaaagtct ctacggactg aaacaggctc cgagaatgtg 3780 gaatataaaa ttcaatcaat ttctgatcag atttggacta aaaagaagtg aattggactc 3840 gtgtgtatac ttccgtcgaa atggaaccta cctactcatc atagctatct ttgtagatga 3900 cggacttgtg tgtgcatcaa atattcaact tgccgaaagt gtaattgact ccctttccga 3960 agaattcgac atgagatctc taccagctac aaggtttcta ggcttagatc ttcacctaag 4020 gaatcacaag attctaatta atcaacccga gtttatcaag aaagtgctaa acaaattcaa 4080 tatgggatcg tgcaaaccta catctatacc tatggactcg tctgacgtct tcgatgacgc 4140 caacatccga agaggagatt ttccatatgg caaaagtacc gtacagagag gcggtcggat 4200 gtctactcta catctcaatc acgtgcagac cagatatatc gtacgttgta agtcaagtag 4260 ccaagttctg tcaaaaccca ggtcgggcac attggagcgc gtaaagaaga ttctagccta 4320 cctctccgga acaactgaac tcggaattat gtttgaaaag aatgaaacaa gtccaatttc 4380 cggctacacc gatgctgatt acggtggtga tctcgacaac aggtgctcca catctggctc 4440 agttttcttt gtatacggga atcttgtctc gtggagcagt aagagacaaa aatgtatatc 4500 tcagtcaact accgaagctg aatacgtagc ggcctctgat gcgtgtaagg aagcagtctg 4560 gctctccagt ctactcacag agttaggaga aacagaagaa aaatcagtac cgatgtactg 4620 cgacaatcaa agtgcaatcc agtccatccg caatccaaca tttcatcaga gaactaaaca 4680 catagacatc cgttatcact tcatcagatc cctgcaagaa aatggcatta ttgttgtgtc 4740 ttatgtccca tcaaaagaac agaaagctga catgctaaca aaccccttcc aaaaccagaa 4800 ttcaaaagaa tgtgtcaaca gttgggctta tgtgatatca ctatgtctat tgaagattcc 4860 ttggcttgag aggaga 4876 // ID Gypsy-6_RP-I repbase; DNA; INV; 4368 BP. XX AC ACPB02021416; XX DT 28-JAN-2011 (Rel. 16.02, Created) DT 28-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Rhodnius prolixus genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-6_RP_; KW Gypsy-6_RP-LTR; Gypsy-6_RP-I. XX OS Rhodnius prolixus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Euhemiptera; Heteroptera; OC Panheteroptera; Cimicomorpha; Reduviidae; Triatominae; Rhodnius. XX RN [1] RP 1-4368 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Rhodnius prolixus genome."; RL Direct Submission to RU (28-JAN-2011). XX DR Genome; ACPB02021416; Positions 1846 6213. XX CC Positions [3327-3836] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 1242..4259 FT /product="Gypsy-6_RP-I_1p" FT /translation="MTYLIDTGAAMSVIPPSIAGRRTPQKFPLYAANGTVI FT HTYGEKQMRLDLGLCKAFTWTFIVAAVPKPIIDADLMFHYGLLVDLKGRQI FT IDQQTGLTSKGYVVRVISPSISAIPEGQPFANILKDFPEITRPSPIIEHQA FT PVKHCILHHIETSGPPVVAKARRLAPERYRAAKEEFRRLMEEGICRPSKSP FT WASPLHVVVKKDGSLRTCGDYRRLNAQTLPDRYGVPNILDFNVHLHDKKIF FT SKIDLIRAYYQIPVAAEDIPKTALITPFGLYEFTRMCFGLRNAAQTFQRFM FT DAIFRDLDYCFVYLDDILVASENVQQHEQHLREVFHRLKEHGIVINPNKCL FT FGKDEIEFLGYKVTPCGIKPLETRVQAIAQYPLPNTVSELRRFLGMVNFYR FT PCLRKAAETQRPLNRYLKNSKKNDKRVIEWTDETRTAFERCINDLQEATYL FT GHPAIDAYVTLACDASDTSVGAVLQQREGDVWVPLGFYSKTLNPAQVKYST FT YDRELLAIMLAIKHFRQTLEGRNFAFLTDHKPLIYAFRQKKSPDSPMRTRW FT LNFISEFTTDLRHVEGETNRVADALSRIEDVSLEPDWANLSHAQGNDPKLH FT SLRNQSNLSWNHLIIPGCEEHIWCEMTTGNCRPYLPAAFRRSTFDRMHSFS FT HPGPRASRKLIGSKYFWPRMNSDIGKWAKSCLSCQKAKITRHTVAPLGSFP FT ACKRFEYVHIDFIGPLPPSKGFLYCVTMIDRFTRWPEVVPVQDICAETVAT FT VFFQSWVSRFGTPRIITSDQGRQFESALFRSLMNILGTNRIRTTAYHPQAN FT GCVERMLKSALIAHTNPASWADALPSVLLGLRVAVRKDTNASVADAVYGQA FT LRIPGEFLTPATGSENINHIQQLRAFVSQLRPPAEPTSQRTFIHPQLSTCT FT HVFLRDTTPKKSLQPTYRGPYQVLSRGSKTFNIDLDGKETTVSIDRLKPAF FT ILADPGDSTPTPTPPAKTSVPSQQTPNYTTTRSGRVVKPSVRFLDAVTGGE FT " XX SQ Sequence 4368 BP; 1275 A; 1000 C; 999 G; 1094 T; 0 other; ttggtgaccc cgacgtgata ctgtggtaac aaaaacatta gggattataa gaatcatttc 60 tttgtagtat ttgtgttgtt gcaaattggt gttaattttt ttttcttttt ctagttttat 120 atgtattcta aataattttg tttttaagaa cttttcgttg tttgtttacc tttgatatgt 180 gtactattgc catgttaata ctatattagt tattgttccg aatgattggt tttggttttt 240 attagacagt tgaattactt tatattctca agccattttt tttttttttt ttttataact 300 atttattagt gtaagttaac gattttattc tttctaatta ttagctatgg gttctaaaga 360 taaaagtgtc tccaccgaaa tagggctaac agtatctcca gacataggag cagtttcagc 420 tactaacaga attccgccat tctggaagag cgacccggaa ttgtggtttg ggcaggtaga 480 atccgtgttc ttaaaggcaa atataaccga cagtgaaact aaattccata caattgtgcc 540 aactttagag ttcgatgttc tcaagcaggt cgccgacctc gtcaagtcga ggcctgccaa 600 caacccctat gaagctttaa aggaaagact aatagacacc tttgcagagt cggaaaacaa 660 gagaataact caattgctcg aaggtaaaca gcttcccact tgttacgtca gatgcagtta 720 ttagcggggg atacggtagc aaaagatatg gtgaaaatgt tgtgaatccg atcattgcct 780 tcaaacatgc aagcgatttt gcaagcaaca ggacaaacta aaccgctcgc attggcggac 840 gtagctgata aaattcacga agtggcagtc cccgcggata tctgtactgt ccataaacca 900 agcccaacag aggatttgat tcacgaagtg cagcgcctta cccaggaagt ggctgaacta 960 aaattacaag caaaacataa cgaaatgaga ggaaaaagag agagatctcg ggccagttcc 1020 cgcggtagat ccgggcaacg agggctgcag cagtcatcct ggttgtgctt ttaccattat 1080 aggtttgggg acaaggcaca aaaatgtgag aaaccctgta actggcaaaa taaggaagga 1140 caaggagaag acccgggaaa ctagaagagg tgctgggtgt ggcggaaccc agtaccgcac 1200 aaacgcaacc ccgcctgttc attaaagata ggcgatctgg aatgacttac ttaatagata 1260 cgggtgcagc aatgtcagta atccctccat cgatagcggg tagacggacg ccgcagaaat 1320 ttccgttgta cgccgccaac ggcaccgtga tacacactta tggcgagaaa cagatgaggc 1380 tggatttagg actgtgcaag gcattcacat ggacatttat tgtagcggcg gttcccaagc 1440 ctataattga cgcagattta atgttccatt acggactctt ggtggacctt aaaggtcgtc 1500 aaattatcga ccagcagacg ggactgacat cgaaaggata tgtcgtaagg gtaatatcac 1560 ccagtatatc agcgattcca gaaggccagc catttgcgaa cattcttaag gattttcctg 1620 agataactcg cccctcaccc atcatagagc atcaggcacc agttaaacac tgcattctac 1680 accatataga aacttcaggg cctccggtgg tcgctaaggc aaggcggtta gcgccggaaa 1740 ggtatcgcgc agccaaagaa gagtttagaa ggttgatgga ggaaggaatt tgtaggccct 1800 cgaagagccc atgggccagt cctcttcacg tggttgtaaa aaaggatggg tcgcttagga 1860 cttgtggaga ctaccgacgc ctaaacgccc aaactctacc tgataggtat ggggtaccga 1920 atatactgga tttcaacgta cacttacacg ataaaaaaat cttttcaaag atcgatttaa 1980 ttagagcgta ctatcaaatt ccagttgcag ccgaggatat ccccaagacg gcgctgatta 2040 ccccgtttgg gctctacgaa tttactcgta tgtgttttgg ccttcgcaat gccgcgcaga 2100 cattccagcg ttttatggat gcaattttta gagacctgga ctactgtttt gtctacctcg 2160 acgacatcct agtggcatca gagaacgtac aacaacacga acaacactta agagaggttt 2220 ttcacagatt aaaggaacac ggtattgtca taaacccgaa caagtgctta ttcgggaagg 2280 atgaaataga atttttagga tataaggtaa caccttgcgg cattaaaccg ctcgaaacta 2340 gagtccaagc tatagcgcaa tatccactgc caaataccgt gtctgagcta agacgttttt 2400 tgggcatggt taatttttat agaccttgcc ttagaaaggc ggccgagacc cagcgccctt 2460 taaataggta ccttaaaaac agtaagaaaa acgataagag agtcatcgaa tggaccgatg 2520 aaacacgaac cgcttttgaa cgttgcatca acgacctcca agaggccaca taccttgggc 2580 accccgcaat agacgcatac gtcacactgg cttgcgatgc ctccgacacc tccgttggag 2640 cggttttaca acagcgagag ggcgacgttt gggtaccttt ggggttctac tccaaaacgt 2700 taaacccagc gcaagtcaaa tatagcacct atgatagaga attactggcc ataatgctag 2760 cgatcaaaca tttcagacaa actttagaag gcagaaattt tgctttctta actgatcaca 2820 agcccctgat ctacgctttc cgacagaaaa aatcgccaga ttcgcctatg cgcactaggt 2880 ggcttaattt cataagcgaa tttaccaccg atttgcggca cgtagaggga gaaacaaatc 2940 gagttgcaga cgccttatca cggatcgaag acgtatccct agagccggac tgggcgaact 3000 tatcacacgc acagggaaat gaccccaaac tccattcact tcgtaaccag tcaaacctct 3060 cttggaacca tctcattatt ccggggtgtg aagagcacat ttggtgcgaa atgaccactg 3120 gcaattgccg tccatattta cctgctgcgt ttaggagatc cacgtttgac agaatgcact 3180 cattcagcca ccccggacca agagcaagcc gcaagttgat cgggagcaaa tatttctggc 3240 cccggatgaa ctctgatatt ggcaaatggg ctaagagttg cttatcctgc caaaaggcaa 3300 agataacgag acacaccgtg gcacccctgg gaagtttccc tgcttgtaag aggttcgaat 3360 atgtgcacat cgatttcatc ggcccactac ctccatcgaa aggttttttg tactgtgtca 3420 ctatgataga tcgctttact cgatggccag aagtagtgcc tgtacaggat atatgcgcgg 3480 aaactgtagc aactgttttt tttcagtcct gggtgtccag gttcggaaca ccccgcataa 3540 taacatccga ccaaggaagg caatttgagt cggcgctatt ccgtagtctt atgaatattc 3600 tggggacaaa tcgcataaga actacggcgt atcaccccca agccaacggc tgcgtcgagc 3660 gaatgctgaa aagcgctctt attgcgcata caaatccagc aagttgggca gatgcacttc 3720 catcagtact gctgggatta cgggttgcag ttagaaaaga cacaaatgcc agcgtggcag 3780 acgcagtgta tggtcaggcc ttgaggatac caggagagtt tctcaccccg gcaaccggat 3840 ccgaaaatat taatcacata cagcagttga gggcgtttgt gagtcaactc cggccgccag 3900 cagagccgac atcgcaaaga actttcatac acccccaact ctctacttgt acacatgtat 3960 tcctcaggga cacgacccca aagaaatcct tacaaccaac ttatagaggg ccatatcaag 4020 tcctgtctag gggtagtaag acgttcaata ttgatttgga cggaaaggag actactgtgt 4080 caatcgaccg tctcaagcct gcttttatac tggctgaccc aggtgattcg acacccactc 4140 caacgccacc agcgaagaca tccgtaccct cacaacaaac accaaactac accacgaccc 4200 gttccggcag agtggtcaag ccctccgttc gattcctaga tgccgttact gggggggagt 4260 gatgtggggg agccccataa ccagtaacgt catctaggat ctgtatcgaa cataagatcg 4320 atggataacc tagggcaggg ttacactaaa accgtgtgca gatatata 4368 // ID Gypsy-4_TCa-I repbase; DNA; INV; 4707 BP. XX AC ChLG6; XX DT 21-FEB-2011 (Rel. 16.02, Created) DT 21-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the red flour beetle genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-4_TCa_; KW Gypsy-4_TCa-LTR; Gypsy-4_TCa-I. XX OS Tribolium castaneum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Coleoptera; Polyphaga; Cucujiformia; OC Tenebrionidae; Tribolium. XX RN [1] RP 1-4707 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the red flour beetle genome."; RL Direct Submission to RU (21-FEB-2011). XX DR Genome; ChLG6; Positions 4067553 4062847. XX CC Positions [1718-2221] - Reverse transcriptase CC Positions [3296-3781] - Integrase core CC 'AGTA' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS join(77..1105,1109..4138) FT /product="Gypsy-4_TCa-I_1p" FT /translation="MAEENNLPEDISNLEDENVTEEEQQDNHPPAPKRAKV FT VWQETLIALVAAQQKQLSEMTEITRHSKIATGPTPSTSATGEGSAQALANT FT SFRLSEFNPENSDYAIEEWLDIATKLKAELHIGDVLMIAKAGEALKGSAHR FT YYCDWRPVHRTWDEFCKDLIVAFPDRETPGARAFTAATLRSRDCESLSDYG FT IRKLRSINRFHRDLPWNTILSMVEYGLDHGEAQATIRMHQPTGDRELLKIL FT SEFDARRRKQRVMQNQSSRSTDVSSMPRRREKFVKGSCFRCGQSGHHKNNC FT NVDIDKDTGTKVAVETKDSPPTCTHCKKMGHTEPNCWLKHGRPKKAFVVKK FT RHLNTTPMASLLTKNNKFRFVYLIDSGADVSIIKHSVVKALDARIRNTSES FT CAFAGVGTQTVYAVGVSELIIFLPKVTLEVAFLVVPDSAIPGNIDVIIGWD FT VISRPCLRIEKTSEGLELHHDHRNLPKVLTTHCLKINRSGLTDDINQRLET FT MLETYRNNTPDHITTGKMQIRLKDTAPVAYHPRRLAYEERRQVKQIVSKLL FT QDGIIRESHSDYASPIVLVKKKNGELRMCVDYRDVNKRVFKERYPLPHIQD FT QINSLCYAKFFTTLDMKSGFYQMEIEEESKHITAFITPDGHYEFNRMPFGY FT VNAPSIYQRAIDKALGDLKGSKAFVYLDDVLVPSTTIEEGLDTLQEVLCAL FT TAGGFSLNYEKCVFFATETEYLGVVLSEGTIKPSPRKVKALTETPVPSNIK FT SVRQFMGLAGYFRRFIQGFSKITAPITALLRKDQVFNWTPECESARQLIIS FT KLTEPPILRIYNPELPCQLHTDASSVGIGAALLQVENGAAYPVAYYSRRTT FT DYESRYPAYDLETLAIVEAVEHFRVYLYGVHFTVFTDCNSVRATALKKNLH FT RRVAHWWMKLQDFDFSIEYRPGKQMAHVDYLSRNPVDEESDLKVCVLKTIS FT ANKISDVQTLREFQNNDSFCREILNDPDCSQDFTVINNVVVTKTKPQKCFV FT PIAARLLAMKLYHDYSSHIGWDKCIQKMREDLFWPKMGQCLKKYIKNCRSC FT VLGKSHTGPRSGLWQHGEQPSDILETWHIDHAGPIMKSNGCTQILVIIDAF FT SKYCRLQPIPKKTSEDSICALLAVFEELGRPKRIIADRGTAFTSTMFQNFL FT SEQGVKLHHIATGIPRGNGKVERLMRTVFNLLRATLTAKKENTWTVAIAAI FT EDNLNSTVHSATGYVPAVLQLGINPRLVATQQFLGDAPTSDHFVDPDKAVA FT DARVRMQENTKKHAQRFDATRFRSSLFLEGDKVAVEDSQLAGGGKLKAKYK FT GPYTVSKRLPNERYLLTKKGQRTTVAAHEQLRSWPSTETSD" XX SQ Sequence 4707 BP; 1471 A; 964 C; 1072 G; 1200 T; 0 other; tcagaagtgg gattcgaaaa tcctaccgcg aagttcccac tagtgtctgt tgtgaaatag 60 tttaagacaa aggacaatgg cggaggaaaa caatcttcct gaagacatct cgaatcttga 120 agacgagaat gtgacagaag aagaacaaca agacaatcat ccccccgccc caaaacgagc 180 caaggttgtg tggcaggaaa cgctaattgc gcttgtagct gctcaacaaa aacagttatc 240 tgagatgacc gaaataacac gtcattccaa gattgcaact gggccaacgc ctagcacgtc 300 agcaaccgga gaaggttctg cgcaagcttt agcaaacaca tctttccgct tgtcggaatt 360 taatcccgaa aatagcgatt acgcaatcga ggaatggttg gacatcgcaa ctaaacttaa 420 ggcagaatta catattgggg atgtcctcat gattgccaaa gcgggcgaag ctcttaaggg 480 gagtgcccat cgatactatt gtgactggag accagtacat cgcacctggg atgaattttg 540 caaagacctt atagttgcgt tcccagatcg cgaaacacca ggtgcccgag catttactgc 600 tgctactctc cgcagtcgcg attgtgagtc acttagcgac tacggaatcc gaaagcttcg 660 atccatcaat cgattccatc gcgacttgcc ttggaacacc atccttagta tggtagagta 720 tggcttggac cacggagaag ctcaggctac catccgcatg catcaaccca ctggtgatcg 780 ggagctcctg aagatcttga gcgagtttga tgctcgccgc aggaaacagc gagtcatgca 840 aaaccagagt tctcggtcga ctgatgtgtc tagtatgccg cgacgtcgtg aaaaatttgt 900 gaaaggatct tgctttagat gtggtcaaag tggacaccat aaaaacaatt gtaatgtgga 960 cattgacaaa gatactggta ctaaggtggc agtagagaca aaagatagcc caccaacgtg 1020 tacccactgt aagaagatgg gacacacgga gccaaattgt tggttgaaac acggcaggcc 1080 gaagaaagcg tttgtggtga aaaagtgacg ccacttaaat acaacgccaa tggcttcgtt 1140 gttaacgaaa aacaataagt ttaggttcgt ttacttaatt gatagtggtg ctgacgtttc 1200 catcataaaa cattctgtag ttaaagcgtt ggacgctcga attcgtaaca ctagcgaatc 1260 gtgtgctttc gctggtgttg gaactcaaac cgtatacgct gtgggtgttt ccgaactaat 1320 tatctttttg ccaaaagtga ctctcgaagt agcctttttg gttgtgcccg attccgcaat 1380 tcctggtaat attgacgtga ttattggctg ggatgtgata agtcgacctt gcttacgcat 1440 cgagaaaaca agcgagggtc tcgagttaca ccatgaccat cgcaatcttc cgaaagtatt 1500 aacaacgcat tgcctaaaaa taaaccgaag tggtctcacc gatgacatca accaacgatt 1560 agaaacaatg ttggaaactt atcgaaataa tacaccggat catattacta ccgggaaaat 1620 gcagattcgt ttgaaagaca ctgctccagt agcgtaccat ccaaggcgtt tagcttacga 1680 ggagaggcgt caagtgaaac aaatagttag taagcttttg caagacggta ttattcgtga 1740 aagtcactcg gactatgcta gcccgatcgt acttgtgaaa aagaaaaatg gtgaattaag 1800 aatgtgtgtg gattaccggg atgttaataa acgcgtcttt aaggaacgat accctttgcc 1860 acacattcaa gatcaaatta actctttgtg ctatgccaaa tttttcacaa cactagatat 1920 gaaatctggt ttctaccaaa tggaaattga ggaagagtca aaacatatca cagcgttcat 1980 cacgcctgat ggccattatg aatttaatcg aatgcctttc ggctacgtga atgctccttc 2040 gatttatcaa cgagccatag acaaggctct aggagattta aaaggcagca aagcttttgt 2100 ttacctcgat gatgttttag ttccatccac gacaatagaa gaaggactag acactctaca 2160 agaagtgctt tgtgcgttga ctgctggtgg tttctccctg aactatgaaa aatgtgtgtt 2220 tttcgctact gagactgaat atttaggtgt tgttttgagt gaaggtacca ttaaacccag 2280 tccgcgaaaa gttaaagctt tgactgaaac acccgttcca tcaaacatta aaagtgtacg 2340 tcagtttatg ggacttgccg gatattttcg ccgctttatc caaggatttt cgaagataac 2400 tgctcccata acagcattgt tgcgtaaaga ccaagtgttc aactggacgc cagagtgtga 2460 aagtgctaga caactaatca tcagcaaatt aactgaacct ccaattttgc gaatttataa 2520 tcctgagtta ccgtgtcagt tacataccga cgctagttct gttggaattg gagcagcgct 2580 tcttcaagtt gaaaatggtg cggcttatcc agtggcgtac tatagtcgtc gtacaactga 2640 ttatgagtcc agataccccg catatgattt ggaaacactg gctatcgtcg aagcggtaga 2700 acatttccga gtttatttgt acggtgtgca tttcacagtg tttacagact gtaattcggt 2760 gcgagcaacc gctttgaaaa agaatcttca tcgtcgtgta gcccactggt ggatgaagtt 2820 acaagatttt gatttttcaa tcgagtatcg acccggtaaa caaatggccc acgtagacta 2880 tttaagccga aatcccgtcg acgaagaaag tgacttaaaa gtgtgtgtgt tgaaaacgat 2940 aagcgcaaat aaaatatcag atgtccaaac attacgggaa ttccaaaaca atgattcgtt 3000 ttgccgcgaa atactgaatg acccggattg tagtcaagac ttcactgtaa taaacaatgt 3060 agttgtgaca aaaacaaaac ctcagaagtg ctttgtacct attgctgctc gtttattggc 3120 gatgaaattg tatcacgatt attcgtcaca cattggctgg gacaaatgca tacagaagat 3180 gagagaagat ttattctggc caaagatggg acaatgtcta aaaaaataca tcaaaaactg 3240 cagatcatgt gtccttggta aatcacatac tggtcctcgt tcagggttat ggcaacatgg 3300 agagcaaccc agtgacatct tggagacctg gcatatcgat cacgctggac ctattatgaa 3360 atccaacggc tgtacacaaa ttcttgtaat aattgatgcc ttttcgaaat attgccggct 3420 gcaaccgata cccaagaaaa cttccgaaga ctctatatgt gcattattag cagtgtttga 3480 agaattggga aggccaaaac gcatcattgc ggatcgagga actgctttca catcaacgat 3540 gttccaaaat tttttgagtg aacaaggtgt aaaactgcat cacattgcta ctggtattcc 3600 aagaggaaac gggaaagttg aacgtcttat gcgaactgtt ttcaaccttc tgcgagcaac 3660 attaactgcc aaaaaagaaa atacctggac cgttgcaata gcggcaattg aagacaattt 3720 aaattctact gttcattctg caactggtta cgtacccgca gtactacaat tgggaataaa 3780 tccgagattg gtcgcaactc aacaattttt gggagacgcg ccgacaagcg atcatttcgt 3840 cgaccctgac aaggccgtcg ccgatgcacg tgtccgtatg caggaaaaca cgaaaaagca 3900 cgctcagcga ttcgatgcga ctcgatttcg ttcaagttta tttttggaag gagataaggt 3960 agccgttgaa gattctcaac tagctggtgg tggaaagtta aaagcgaagt acaaaggccc 4020 ttatactgtg tcaaaacgac ttccgaacga gagatatctg ttaacgaaaa agggacagag 4080 gaccactgta gctgcacacg aacagttgag atcttggccc tcaacggaga cttctgatta 4140 agttactaag taagtaaagt aaacaaaacg gatcgaatgc aaataaagtt gattacaaac 4200 aataaaaagt gaatggctat atcactcaaa gtgcaattaa tcggctagtg gcagtcctac 4260 taggcctgga aggtggcagt accacctgat gtgtgtaaat gtgaagagtg gcagtaccac 4320 tggcaaatta gggtggcagc accacctggt atgtctggta tgtactggta tgtacagagt 4380 agtggcagaa ccactaggta aaaagggaag gatggcagta ccacctgatg acaacagtcc 4440 aagtgagttg gttagtggca gtcccactta ccgtacaaaa aaaaaaaaaa caaacacatt 4500 agtggcgatc ccacaaattg taacagaaat aaagttgtga cattcctttc gtgaaatcag 4560 agacaaaaca gagtaagtct gatagtcctg aacaaggata tcaaagacaa atcaaagtag 4620 tgaagacagg agtactgtct tgtgatttct gttttcagaa tttcccttga gttgcgttgc 4680 gaggacgcaa caggaacagg atggccg 4707 // ID Gypsy-66_CQ-I repbase; DNA; INV; 6893 BP. XX AC AAWU01038782; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-66_CQ_; KW Gypsy-66_CQ-LTR; Gypsy-66_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-6893 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 511-511 (2011). XX DR Genome; AAWU01038782; Positions 4788 11680. XX CC Positions [4662-5138] - Integrase core CC 'AAAG' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 472..2400 FT /product="Gypsy-66_CQ-I_1p" FT /translation="MSHELDLIYRSMNVKHLLDDEVEHELMIRQVNFSSGD FT SRESKVRLLRVTLRNQRDSNSFKVILLDENMLEVEFCQIKEKVGNIRKQLE FT SGKVKKEDLPPYKTRLVHLIFRLIRLKRVKFNTCGLDQNAVRLYNEYFTVW FT SNNPEKAREAARRLELEISKIGAIDELITESDDTEDSGEGIEAVSAEGESS FT ENEDDDLPAKKKKTSTPKKILKNKQTGPIDTQKIVEIVVDQVSKHFQEQFR FT NLMLDDRFMGGIVSGKKSGKTKDRKGKSQQKANGKSRVEKPRKGERRSKKK FT QQSTTESEETVSSEGDDSLDSSDDSEPEPRRERKLRRPRPVSEWNLRYDGK FT DDGKRLNKFIAEVEGMAEAENLSKRALFKEAIHLFSGEVRTWYMEGKKNRD FT FTNWKELLVELKVEFQPPDLDYHYEQQATLRRQRRSEKFSEYYNAMKEIFS FT FMARQPSEDRKFDIVYRNLRADYRSALLVKEIKTLRLLKVWGRKLDAANWF FT LYRGKETGSAQKSSQVNEVYQSSSQKFDKPQPKPDWKPKGTGNKPGWSDNR FT KQSDQPRKPTENKNEKSSQPGPTKTQQQPGNSRATLEERVVRHRVPDSTVC FT YNCRGKYHHFKACLKERELFCAVCGFYDFTHENCPFCAKNGRKSA" FT CDS 2526..5537 FT /product="Gypsy-66_CQ-I_2p" FT /translation="MGLKVTGLLDSGAQVSILGAEGDHLLRKLNLREFGSE FT RKLFSAGGDELVVRGYVHLPITFNGTTRLVTTLISPTLKRKMILGMNFWRQ FT FGIEPTVTDAQVEEIEEAQLEEEPLSAAQQAQLAKIKKDFKAFEEGQTLET FT TPLITHKIEFLEEYEHADPVRLNPYPWSPEVQKHVNSELDKWIEAGVVERS FT TSDWALLIVPVVKKSEDDTGETQMKVRMCLDARKLNERTRRDAYPLPHQDR FT ILGRLGKSRFLSTIDLSKAFWQVPLDPESRKYTAFRVFGRGLFQFTRLPFG FT LVNSPATLSRLMDRVLGYGELEPNVFVYLDDIVIANDTFEDHLRCLREVAK FT RLKAANLSINVEKSKFCVPELPYLGFILSKEGVRPNPDKVEAIINFERPSS FT VRSLRRFLGMVNYYRRFIEGFSDVTAPLTDLLKGKPKVVQWNAAAEGAFIE FT LKQRLISAPILANPNFELPFTVQTDASDSAIAGVLTQEHDGVEHVIAYFSR FT KLTTPQRSWKAAEKEGLAALEAIEKFRPYIEGTQFTLITDSSALSFIMNTK FT WKSSSKLSRWSMLLQQYAMTVRHRKGSENIVPDALSRSVEAVDVGSREDWY FT ADLFRKVAESPDDYVDYKIENGKLYRFISSPSDVMDYTFEWKLCVPTELRK FT SVLKEEHDESMHPGYEKTIQRLKMRYLWPRMAVQCKKYIKACSVCKQSKPS FT TVGSVPVMGNQRITNKPFQILALDFIQNLPRSKNGKCHLLVLLDMFSKWTV FT LVPLRKIEAKEVCGIVEDQWMRRFGTPEIIISDNATTFLGKEFQALLRRRG FT IQHWPNARHHSQANPVERANRTINACLRTYMKEDQRVWDSRIAEVEEMMNT FT TVHSSTGLTPYRILYGHEKATQGEQHRIERDERELSMDERDESRKRMNEKV FT FKIVEDNLKKSYEKNVKNYNLRHKRFAPTYSVGQRVLKRNFKLSSAADRYN FT AKYGPVYVPCVVVARRGTSSYELADETGKNIGVFSAADLRPDDTDVHSQ" XX SQ Sequence 6893 BP; 2071 A; 1328 C; 1841 G; 1653 T; 0 other; attggcgccc aaacaaaaat cgcaaaaacc aacgaggatc gattagagat tttggcaatt 60 tttcaacccg agtgggagga gaataccaaa caaatttgcg acttgttttg gtaggatccg 120 ttgctagact ttaccttagt gggcctagtg agaaataatt gtgaatttgt aagttgctag 180 tggtggtatt tgaactcgat aaaaatattt ctagcagtta gaggtggatt aaacgagcat 240 catagacatc tggttaaaat ttgagaatgt aaagccgaag aaacttaaaa taaaaggtcc 300 tgagcgatat taaaaataaa actacaagtc aacagaattt caatttaatt tatttctttt 360 gtatttgtta gtgtgaacat attttgtatt ttaatttagt tgatttatac tgaattacat 420 tcagtttaat tgattttatt atttattttg tgattgttta aaattatcaa gatgagtcac 480 gaacttgatc taatttatcg ttccatgaac gtaaaacatc ttcttgatga tgaggttgag 540 catgaattga tgattcgtca ggtcaatttc tcaagtggtg attcgagaga aagcaaggtg 600 cgtttgttac gcgtcacttt gaggaatcag agagattcaa acagttttaa agtgattttg 660 ttagacgaaa acatgctaga ggtcgagttt tgtcaaatca aagagaaggt tgggaatatt 720 cgaaaacaac tggaatcagg caaagtaaaa aaagaagatc ttccaccgta caagactagg 780 ctggtacact tgattttccg gttaatacgc ttaaaacgag taaaattcaa tacgtgtggc 840 ctggatcaaa acgccgtgag gttgtacaac gagtacttca cggtgtggtc taacaatcca 900 gaaaaggcca gggaagctgc aaggcgattg gagttggaga ttagtaagat cggtgccata 960 gatgagttga ttacggagtc agacgacaca gaggattcag gagagggaat agaggcagtg 1020 tcagcagaag gcgaatcgtc agagaatgag gacgatgatt tgccagcaaa gaagaagaaa 1080 acgagtacac cgaagaagat tttgaaaaat aagcaaactg gtccaataga cacccaaaag 1140 attgtggaaa ttgtcgtgga ccaggttagc aaacatttcc aggagcagtt cagaaatctg 1200 atgttggatg atcgctttat gggtggcatt gtaagcggaa agaagtctgg gaaaacaaag 1260 gataggaagg gaaaatccca gcaaaaggca aacggaaagt caagggtgga aaagccgcgt 1320 aaaggtgaaa gaagatcaaa gaagaagcag caatcgacta ctgaatctga ggagacggta 1380 tctagtgaag gagatgactc gctagacagt tcggatgatt cggagcccga accaaggcgt 1440 gaaaggaagt tgcgtagacc acgtccggta tcagagtgga atcttcgcta cgatggcaag 1500 gatgatggaa aacgtttgaa caagttcatt gctgaggttg aaggtatggc agaagcggaa 1560 aacctaagca agagagcgtt gtttaaagag gccattcacc tgttctcagg cgaagttcgc 1620 acctggtata tggaaggcaa gaagaacagg gatttcacga attggaagga gttattggta 1680 gaactaaaag tcgagtttca acctccagac ctagattacc attacgaaca gcaagccaca 1740 ttgagaagac aacgcagatc agagaagttt tcggagtatt acaacgcgat gaaagagatt 1800 ttcagcttca tggcaaggca gccatcagaa gatcggaagt tcgacatcgt gtaccggaac 1860 ttacgcgcag attatcgtag cgctcttctt gtgaaagaaa taaaaacgtt acgattgttg 1920 aaggtttggg gacgcaagct ggacgctgcc aactggttcc tgtatcgcgg taaagagacg 1980 ggatcggcgc aaaagtcgtc acaggtgaac gaagtctacc aaagctcgtc acagaagttc 2040 gacaaaccgc agccgaagcc ggattggaag ccgaagggta ccggaaacaa accaggttgg 2100 agtgacaaca ggaagcagtc agaccaaccg aggaaaccga cagagaacaa gaatgagaag 2160 tcttcacaac ctggacccac taaaacccaa caacaaccag gaaatagtag agcaacactg 2220 gaagagcgtg tggttcgaca tcgagtaccg gacagtacgg tgtgttacaa ctgtcgcggg 2280 aagtatcatc acttcaaagc gtgcttgaag gagcgggaac tgttttgtgc ggtgtgtgga 2340 ttttatgatt ttacacacga gaactgcccg ttttgtgcaa aaaacggtcg caagtcggcg 2400 taggaggtcg tcgccgagtg cgattcagaa gacctcagac agtttttatc ccagctaacg 2460 cagaagtaga agaattgata gtagaggtgg agggtgacaa cagaccgttt gtgaccgttg 2520 atgtgatggg gttgaaggtg accggactat tagatagtgg agctcaagtt tcgatacttg 2580 gagcggaagg agaccacctg ctgaggaagc tgaatcttcg ggaatttggt tccgagagga 2640 agcttttctc tgcaggcggc gatgagttag tagtacgagg atatgtacac ctaccaatca 2700 cattcaacgg cacgacgagg ctagtcacca ctcttatttc acccactctc aaaagaaaaa 2760 tgattttggg aatgaatttc tggcgtcagt ttgggataga accaacagtt actgatgctc 2820 aagtagaaga gatcgaagaa gcacagctgg aagaagaacc gttgtcggcc gcacagcagg 2880 cccaactggc aaaaatcaaa aaggatttta aggcttttga agaaggacaa actcttgaga 2940 ctacaccgct gattactcac aagatagagt tcctggagga gtacgagcat gctgacccgg 3000 ttcggctcaa cccgtatccg tggtcaccgg aagtgcagaa gcatgttaac agtgaactag 3060 ataagtggat tgaagcgggg gtggtggaga gatcgaccag cgactgggca ctactcatcg 3120 tgccggtcgt gaagaagagc gaggatgaca ctggagagac acagatgaaa gtgaggatgt 3180 gtctggacgc gcgcaagctg aacgaaagga ctcggaggga tgcctaccct ctacctcacc 3240 aggatcgcat acttggccga ttagggaagt cacggttctt atccactatc gacctttcaa 3300 aggcattttg gcaagttccg ttagacccgg agtcgcgcaa gtatacagct ttccgggtgt 3360 tcggcagagg gttgttccag tttacccgac ttccatttgg tcttgtcaat agtcctgcga 3420 cgttgtccag gcttatggac agggtcttgg ggtacggtga actggaaccg aacgtgttcg 3480 tctatttaga cgatatcgtc atagcaaacg acacgttcga ggatcacctc cgatgcctca 3540 gagaagtggc aaaacggttg aaggcagcga atctgtccat aaacgtagaa aaatcgaagt 3600 tttgcgttcc agagcttcct tatttaggct tcattttgtc taaggaagga gtccggccga 3660 atcccgacaa agttgaggcg ataataaact ttgaacggcc gtcatctgtg cggtcattgc 3720 gaaggttttt aggcatggtg aattattacc ggcggttcat agaagggttc agcgatgtaa 3780 ctgcgccctt aacagatctg cttaaaggga aaccaaaggt ggttcaatgg aatgcggcgg 3840 cagaaggcgc gttcatcgag ttgaagcagc ggctaatctc tgcgccgatc ttggccaacc 3900 cgaacttcga gctacctttc accgtgcaga ccgacgccag tgacagcgcg atagcaggtg 3960 tattgacgca ggagcacgac ggagttgagc acgtgatagc gtacttctcc cggaagctga 4020 ccactccgca acggtcgtgg aaagcagccg agaaggaggg cctcgcggca ctggaagcga 4080 ttgagaagtt ccggccgtac atcgagggaa cgcagttcac gctgattaca gactcgtcgg 4140 cgttgtcatt cataatgaac acaaaatgga aatcgtcatc gaagttgagc aggtggagta 4200 tgctgttgca gcagtacgca atgaccgttc gtcaccgcaa aggttcagag aacatagttc 4260 cggacgccct ttcgcggtcg gtcgaagctg ttgacgtagg tagccgggaa gattggtacg 4320 ctgacttgtt tcgcaaagtg gcggaatctc cggatgacta cgtggactac aagatcgaga 4380 atgggaagct gtatcggttt atctcatcac catcagacgt tatggattat acgttcgagt 4440 ggaaactgtg tgtgcccacc gaactgcgta agtcggttct taaagaggaa cacgatgaga 4500 gtatgcatcc aggctacgaa aagactatcc agcggctgaa gatgagatat ttgtggccga 4560 gaatggcggt tcagtgcaag aagtacatca aagcgtgttc ggtttgtaaa cagagtaaac 4620 cgtccacagt aggctccgta ccggtaatgg gaaaccagcg catcacaaac aaaccattcc 4680 agatattagc acttgatttt atccaaaatc tgcctcggag caagaacggg aaatgccatt 4740 tgttagtcct tttggacatg ttttcaaaat ggaccgttct tgtcccattg aggaagattg 4800 aggcgaagga ggtgtgcggt attgtggagg atcagtggat gagacggttt ggaaccccag 4860 agattatcat ctcagacaat gcgacgactt tcttgggaaa agagttccaa gcactgttgc 4920 gtaggagagg gattcagcac tggccaaatg cgaggcatca tagccaggcc aacccggtcg 4980 aacgagcgaa tcggacgatc aacgcgtgct tgaggacgta catgaaagaa gatcagcggg 5040 tctgggacag cagaatagca gaagtagagg agatgatgaa cacgacggtt cattcctcca 5100 ccggtttaac gccctaccgc atcctgtacg gccatgagaa ggcgacgcaa ggagagcagc 5160 atcggataga aagggacgaa agggaacttt ctatggacga gcgggatgaa agccgaaaga 5220 ggatgaacga gaaggtgttc aaaatcgtcg aggataattt aaagaagagc tacgagaaga 5280 atgtgaagaa ctacaatctc aggcataaac ggtttgcacc aacttacagt gtaggccaac 5340 gcgtgcttaa gcggaatttc aagctgtcct cggctgcgga ccggtataac gcgaagtacg 5400 gtcccgtcta cgttccgtgt gttgtcgtcg cacggcgtgg aaccagttcc tacgagttgg 5460 ctgatgaaac cggcaagaac atcggggtgt tttccgcggc ggatcttcgt cccgatgaca 5520 cagatgtcca ttcacagtaa tcaactcact aaaaaggaaa gaaaaggtgt aaggacctaa 5580 ctcaatgaca cagcaacgaa ctcaaggtac cagagtatcg gtgtccccta taacatgttc 5640 atggcagatg cgtgggcgtg ttggtcatca ttgcgacaaa gccgaacgaa cacttgatgg 5700 agcgaattaa gatcctcagg tggtgatcac gttgcggcga tgggcgtagc gaccaggctg 5760 gtgtacgatc ttaatccacc ggtcatgtaa acagcactag tggtgaagag tggtcatctc 5820 aaaacagcag agacgctgga gatcgagtga tcgttcgtca cgagccacag tgtggtgaat 5880 tcagcgttca gcgtagagtg ataaggccag aatgatcggg tgttgccata acagcgagaa 5940 gtgagaagat tcctccttcg ctggtgagat ggtgatcgcg agatctggaa gatgagaaga 6000 ttgatgagta attgcgcatg tatacagtcc ggactgatga agaatagaga tctctgtaga 6060 gtagtatttg tagtagtgat agattagaat aataaattgt aggtttttag aaagaagaag 6120 tagaagatcg tggaacgcgt cgtacttacc atccatacag ttgaaatttc ttcttttcac 6180 gttgagctga tgatcgtaac tttggtagga gagtcccgta ggtgtaaatg gaagaaaaag 6240 cagctttcaa ggcgccgaaa gcttaacttc gcgaacagga gctcacttgt agctcacacc 6300 tttgtagatc aggcggccac ctggtcggac aacctgaaaa ggaaaccgtt cactaagctt 6360 aacgtgatag ttgatacata atgatctact tatcttgaac acttcttgag aaagaccacc 6420 tattttaatc cgtttagtag aaattttcac agtttttttt attttttttt ttacactttt 6480 gtaaattaat cggcaacgta ttaccgttcg tgaccgcgtg tgagaagcga ctgaaagttg 6540 gctctttctc ttactgtgag taagccagaa atgagattta acgaaagttg atgtagtaga 6600 tttaaattca ccattggtgt aattcggcat gcatatactc aggctaaaaa tcacacgtgg 6660 ccaattatgg catatttcgt gaagaaatgt cgcttttagt tgccagattt taatgtgttt 6720 atgatgtcta tatgttgtaa tttgttttaa ctgctgcgcg agacgaatta tgtgtataca 6780 tgtttgtaaa tgttctactt aaccacaaaa tagcttccaa tttgcaaata cttttccaaa 6840 ggcttacata attgaacctg accagttcaa ttattaacta aggtggggga tag 6893 // ID BEL-96_AA-I repbase; DNA; INV; 5576 BP. XX AC supercont1.22; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-96_AA_; KW BEL-96_AA-LTR; BEL-96_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5576 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.22; Positions 2753059 2758634. XX CC Positions [4531-5139] - Integrase core CC 'ATAGC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 261..1601 FT /product="BEL-96_AA-I_2p" FT /translation="MVDEKRLKVKEVKRRNAIDVFKRMDIFLASYVPEQHQ FT REIAPRLDRLEKVWEIFEAVQDECEELDEVEESVQKNLELRGQVEALYFRV FT KAGLVAKLPPAPVPAVPGPSSAPITPSPLANVKLPTISLPEFDGDFNTWLT FT FHDTFLSMIHSSTEISQVQKFHYLRAALKGEAASSIQSITITANNYAVAWD FT TLVNRYSNKAILRKKHIRALLKYPKIPNNNVEALHKVVDEFQRHTKVLEQL FT GEPVDHFSSILIELLEDKLDDASLTAWEESIANDAHPTYDKMVDFLLKRAR FT ILETIAINRPQHSVAKPAAHNSASKKPNQPRISTMAAAEIPPKTFPICPAC FT DKHKHSIFDCSVFNGLDTKGRLKVVTDKKALQQLLPQRPFCPQLSLQIFLQ FT TLLQATPFHDPPGTIRDCFRRKFFAQYRRYAAQHFQCYHRSGSSTRTRDCD FT HG" FT CDS 1468..5331 FT /product="BEL-96_AA-I_1p" FT /translation="MIHPGPFEIASEGNFSPNIDATLPSTSSVTTAVVAAP FT VPEIVTTAKSSSTNVLLTTVVLIIVDVYGQEHIARALLDTGSQPNAISERL FT CQQLHLPRKVVNVPIAGVDSIVTNAKHEVRAEIRSRIANFNESLDFLVLRK FT VTHDSPSMSFSTSQWRLPDNLPLADHDFHTSRKVDMIIGAGHFYSFLRDGK FT FRLHNNGPLLVETVFGWIVSGKFEAISDSNSQPTVTCHAATVTSISDQLER FT FWQVEELYGSNYSIDEQHCEDYYRETVSRDPTGRYIVRMPKHPDHDQMIGN FT SKLTALRRFLLLERRLSKDNTLRAQYHDFIREYESLGHMKPVPLHAKDDPK FT ACYLPHHPVLKDSSSTTKVRVVFDGSAKTSSGHSLNDTLLVGPVVQDDLFS FT LVIRFRKFAIALVADIEKMYRQIVMHEQDRPLQRILWCLDSTLPVQEFELS FT TVTYGLAPSSFLATRTLLQLVEDEGTPFPHAGTAIKKNTYIDDLLAGADSI FT DDTIQLREELSNLLQKGGFRLRKWCSNSLPVLSGLPSELLGTQSSVRFAAG FT EIIKTLGIRWDLEADVFRFDMSPMVKNQPATKRNILSAIAQLFDPLGIIAP FT VVVQAKILMQHLWLLALDWDDEVSPDLQRKWHQFCEQLPHLSNFSIDRFAF FT VHGYCSAELHCFADASEVGYGACMYIRSEDHEGRVHVSLLASKSKVAPLKP FT LSIPRLELCAALLAARLYEKVISALDMKFSRSFFWSDSTIVIQWLKAPPRT FT WQTFVANRVAEIQALSHGSHWNHISGNENPADVISRGMSADDLVNSELWNC FT GPKWLREERSQWPVPDIQEKRFTVEELEMKKNRILTTQILQPNPLFERFSS FT YQTLLGVVGYCFRFCNYARNKTKKCSSKVLSVIELQNAKLALSKLVQREVF FT SEDLQRLEKGQMVSNKSCLRLLNPFIDADGLIRVGGRLRLSNESFNVKHQI FT VIPGFHPFTQLLLNHHHCKLIHGGVTMTLSVVRDEFWPLNGRRAVRSAIRK FT CYRCSRSNPQPIQQPIGQLPVARVTANEAFACTGVDYCGPIFLKPVHRKAA FT ARKSYICVFVCLSTKAVHLELVSDLSTAAFLMALDRFVWRRNKPQHLYSDN FT GTNFVGAKNELHALYTMLQSGPDNDKIAKHLAEDNIQWHMIPPRAPNFGGL FT WEAAVKVAKTHLVRQLGSALLSFEELCTVLIKIEGCMNSRPLLPLSSDPND FT LGALTPALLGEEYDPPSPRSRRSRCAIQPSRSISGSSEVFPTFLASVAKRI FT FEAAELTIQDQSERLPVKRRRPRYRER" XX SQ Sequence 5576 BP; 1381 A; 1457 C; 1310 G; 1428 T; 0 other; tctggtgccg tgaccaggat ttctggtcct cccgttcccg aacacgtgca caacataacc 60 tcctttcgat ttcgctctcc gtcataaatc tctccggagg ctgtagctgg ttcctgatta 120 taaatataca aggcctctta tagagaggta ccaggtgagt acctctccaa cctgtttacc 180 cccagttgtg cggtacttcc tggatctttt tgatctttca gcccgtctat cgatcagcga 240 cgtcggaacg aggctcagca atggtcgacg aaaagcgctt gaaggtcaag gaggttaagc 300 ggcggaacgc gatcgatgtc ttcaagcgga tggacatatt tctcgccagc tatgttcctg 360 agcaacacca gcgtgagatt gctcctcgtt tggaccgctt ggaaaaggta tgggagattt 420 ttgaagccgt gcaggacgaa tgtgaagagc tggatgaagt ggaagaatcc gtgcagaaga 480 atttggaatt acgtggtcaa gtagaggcat tatactttcg cgttaaagct ggtcttgttg 540 caaaactgcc accagcacct gttccagcag taccaggacc ttcatcggcg ccgattactc 600 cgtcgccact cgccaacgtc aagttaccta ccatatcgct ccccgaattc gatggcgatt 660 tcaatacctg gctgactttc cacgacacct ttttgtcaat gattcattca tcgacagaaa 720 tttcccaagt gcagaaattt cattatctgc gtgctgctct aaagggtgaa gcggccagtt 780 cgatccagtc gatcacgatc acagccaaca attacgccgt tgcttgggac acgttggtca 840 accggtactc caacaaggcc attctacgaa agaaacatat aagggccttg ctcaagtacc 900 ccaagatccc aaacaacaat gtggaggcgc ttcacaaggt cgtcgatgag tttcagcgcc 960 acaccaaagt gctagagcag ctaggtgagc cagtggacca tttcagctct attttgatcg 1020 agttgctgga ggataaattg gatgatgctt cgctcaccgc ttgggaggag tcgattgcca 1080 atgatgcaca tcccacctat gacaagatgg tcgactttct gctaaaacga gcccgcattt 1140 tggagacaat cgccataaat cgtccgcagc attctgtcgc caagccggcg gctcataatt 1200 ccgcatcgaa aaagccaaat caaccccgta taagcactat ggcagcggct gaaatcccac 1260 cgaagacgtt cccgatttgc ccagcatgtg acaagcacaa gcactcaata ttcgactgct 1320 ctgtcttcaa cggcttggac accaaaggcc gtttaaaggt ggttaccgac aaaaaagctt 1380 tgcagcaatt gcttccgcag cgaccatttt gcccgcaact gtcgctccaa atattcctgc 1440 aaacactgct ccaagcgaca ccattccatg atccacccgg gaccattcga gattgcttcc 1500 gaaggaaatt tttcgcccaa tatcgacgct acgctgccca gcacttccag tgttaccacc 1560 gcagtggtag cagcacccgt accagagatt gtgaccacgg ctaaatcatc cagcactaat 1620 gttctcctca cgaccgtcgt gctgatcatc gttgatgtct acggccaaga gcatatcgct 1680 cgcgctttgt tggacacagg ctcgcaaccc aatgcaatca gtgagcgatt gtgtcagcag 1740 cttcatctcc cccgaaaggt tgtcaacgtg ccgattgctg gagtggacag tatcgtcact 1800 aatgcgaaac acgaagttcg ggcagaaatt cgctctcgaa ttgctaattt caacgaatct 1860 ctagatttcc ttgttctgcg aaaagtgacc catgattcgc catccatgtc gttttccacg 1920 tcacaatgga gacttcccga taatcttccg ctagctgatc acgactttca tacttcccga 1980 aaggtggaca tgattattgg cgccggtcat ttttactcgt tcctgcggga tggaaagttc 2040 cgcttgcaca acaatggtcc gttactcgtc gaaacggtat ttggttggat agtatccggg 2100 aaatttgagg ctattagtga tagcaattct caacctacag ttacgtgtca tgcggcgacc 2160 gtaacttcta taagcgatca gttagaaagg ttttggcaag tggaggagct gtacggatca 2220 aattactcca tcgacgaaca acactgcgaa gattattacc gagagaccgt ttctcgcgat 2280 ccaaccggtc ggtacatcgt gcgcatgcca aaacaccccg accacgacca aatgattggt 2340 aactccaagc tgactgcgct ccgccgtttc ctgctattag agcgaagact ctccaaggat 2400 aatactctga gggcgcaata ccacgacttc atcagggagt acgaatccct tgggcacatg 2460 aaaccagttc cgctgcacgc caaggacgat ccgaaggctt gctaccttcc gcaccatccg 2520 gttctgaaag attccagctc cactacgaag gtgagagtcg tttttgatgg gtctgcgaag 2580 acgagttcag gacactctct gaacgacacc ttgcttgtcg gacccgtcgt ccaagacgat 2640 ttgtttagtc tggtcatccg atttcggaaa tttgcgatcg ctttggtggc tgacatcgag 2700 aagatgtacc gccagattgt gatgcacgaa caagatcgtc cgttgcaaag aattttgtgg 2760 tgcctcgaca gtacgcttcc ggttcaagag tttgaattga gcaccgttac ttacggtttg 2820 gctccatctt cgttcctcgc tacacgaacg ctcctgcaac ttgtcgagga cgaaggcacc 2880 ccgttcccgc atgcaggcac agccatcaag aaaaacactt acatcgacga tctactcgct 2940 ggagccgaca gtattgatga cacaattcag ctccgtgaag aactctctaa tctgctacaa 3000 aagggaggtt ttcgtctacg taaatggtgt tcgaattcgc tcccggtcct atccggactt 3060 ccctctgagc tgcttgggac acaatcgtct gtgaggttcg ctgccggtga aataatcaag 3120 accttgggaa tacgctggga ccttgaagca gatgtatttc gtttcgatat gtctcccatg 3180 gtgaagaacc aacctgctac caagcgcaat attttatcag caattgcgca attatttgac 3240 ccgcttggta tcatcgctcc cgttgtagtg caagccaaaa tactaatgca gcatttgtgg 3300 ttactggctt tagattggga cgacgaagta tcgcctgact tgcaacgaaa atggcaccaa 3360 ttctgcgagc aacttccaca tctctccaac ttcagcatcg acagatttgc gtttgttcac 3420 ggctattgct ctgcggaact gcactgtttt gctgatgcgt cggaagtcgg ttatggcgcc 3480 tgcatgtaca tccgctccga ggaccatgaa ggacgtgtac atgtgagctt gctcgcttcc 3540 aaatcgaagg tagcccctct aaaaccgttg agcatccctc gtttggaact gtgtgccgct 3600 ctgctcgccg cccgcttata tgaaaaggtc atctctgcgt tagatatgaa gttctctagg 3660 agcttcttct ggtcagactc aacaatcgtt attcagtggt tgaaagcccc cccgcggacg 3720 tggcagacgt ttgtggctaa tcgtgtcgcc gaaattcagg ctctttctca cggttcacat 3780 tggaaccata tttctggaaa cgaaaaccct gcggatgtaa tatcacgagg aatgtccgcc 3840 gatgatctgg tgaacagtga actgtggaat tgtgggccca agtggctacg cgaagaacgg 3900 tcgcagtggc ccgttccaga tattcaggaa aagcgtttca ctgttgaaga attggagatg 3960 aaaaagaatc gcatcttgac cacccaaata ctgcagccaa atccgctatt tgaaagattt 4020 tcgtcatacc aaactcttct tggtgttgtc ggatactgct tccgattctg caactatgct 4080 cgtaataaaa ctaagaagtg ttccagcaag gtcctctccg tcatcgagtt gcagaatgcc 4140 aaactagcgc tttcaaagct agttcaacgt gaggttttct ccgaggatct acaacggtta 4200 gaaaagggac aaatggtttc aaacaaatcc tgccttcgcc tgctgaaccc tttcattgac 4260 gccgatggat tgattcgcgt cggtggccga ttgaggctgt ccaacgaatc tttcaatgtg 4320 aaacatcaga ttgtcattcc tggattccac cccttcacgc aactactttt gaaccaccac 4380 cattgcaagc tcattcatgg aggagttacg atgactttgt cggtcgttcg tgacgagttc 4440 tggccgctaa atggccggag agcagttcgg agtgccatcc gaaaatgcta caggtgcagc 4500 agatcgaatc cccaacccat ccagcaaccg attggccagc tacccgttgc ccgtgtcacc 4560 gccaacgaag cctttgcgtg caccggtgtt gattactgcg gtcccatttt cctcaagcca 4620 gttcaccgca aggccgctgc tcgcaaatct tacatttgcg tcttcgtttg tttgagtaca 4680 aaagcggtcc atttggagtt ggtaagtgac ctaagtacgg ctgcattcct aatggcactt 4740 gaccgtttcg tttggcggcg gaacaagcct cagcatttgt actcagacaa cggtacgaat 4800 tttgttgggg cgaaaaatga actgcacgca ctttacacta tgcttcaatc cggtcccgac 4860 aacgataaga tcgccaagca tctcgccgaa gacaacatcc aatggcacat gatacctcct 4920 cgcgctccta attttggtgg cctctgggaa gcggcagtca aggttgccaa gactcacctc 4980 gtgcgtcagc ttggctctgc tctgctgtct tttgaagaac tgtgtacggt gctcatcaaa 5040 attgaaggat gcatgaactc tcgaccgctg ttgccgctat cgagtgaccc gaatgatcta 5100 ggtgctctta ccccagcact tcttggtgaa gaatatgatc cgccctctcc ccgaagtcga 5160 cgttcgagat gtgccattca accgtctcgg tcaatatcag gctcttcaga agttttccca 5220 acttttttgg catcggtggc gaaacgaata tttgaagcag ctgaactcac tatacaggac 5280 caatccgaaa ggctaccagt taaacgtcgg agacctcgtt atcgtgaaag atgaatgcta 5340 tccccctgcc cgctggccgt tggcccgcat cattgaactt catccaggac ccgatggagt 5400 gacacgggtt gtgagtcttc gtacgccgtc cggagttctt aagagagccg tttgcaaaat 5460 ttgcccaatg gaatgtgcta tggaagagtg agatgtacat tatattttca tggacaatat 5520 gtttgagtta cgtttagttt aatttcgcaa atagtttgca aaggtggccg gtatta 5576 // ID P-23_HM repbase; DNA; INV; 3134 BP. XX AC . XX DT 21-MAR-2008 (Rel. 13.03, Created) DT 21-MAR-2008 (Rel. 13.03, Last updated, Version 1) XX DE P-type DNA transposon family: consensus. XX KW P; DNA transposon; Transposable Element; P-23_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3134 RA Jurka J.; RT "P-type DNA transposon families from Hydra magnipapillata."; RL Repbase Reports 8(3), 369-369 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 180..2768 FT /product="P-23_HM_1p" FT /translation="MPKKCCVVACRSGYKKKKNDDTVEGFITKTVFNFPSD FT INLCSRWIKFINRINWKPSKFSGICIDHFEEKFIKRGKRNTLDFTKNPIPT FT IHTNKDCFLKPSLLPTPITLRKTPTKRPYSHSLLDETVVFNKLDKICSIND FT LTEQVCPQNFLFKKIYGGVIYYRVIIENITSSTNIESINVDNNLHVKLYHK FT GNLIPLPQWFRQNNKCTLTSVSMLENLVSYMRNRVEEKEKNILSELYSFMY FT YESHGRELYSNELIRFALVLRYTSRQAYNLLLDEFPFPSFSYLKALTQGGI FT EPIKALKYLLKEEKISSDSVLLIDEMYLQKSVQYHGGKFVGKDENGELYSG FT VVVFMLVGIKRSIPFVVKACPETSISGNWLMTEIEECLTNLFDANFKICAI FT ITDNHSTNTLAFKKLIKKYNSTENFFFYYGQNKIYTLFDSVHILKNIRNNL FT LHSKRFIFPKFNFDKFEDSINVEAGDIRWQLFHQVNEKDNLLPANLKKAYK FT LTPTGLHPGKNKQSVKLALQIFHESTYSAIKSYFPKELSAANFLYLINTWW FT TISNSKNIYNTNNKIGNAVVANDQKPEFLNAFAQYVSEWYNTNLQGCHKLS FT LSAQTSNALIVTLKGTASLIKDLLSEGYDYILTSRFQTDPLEKHFGKLRQM FT SGGRFLVSLREVESSEKIISIKSLIKETLALNTWFNNEVPVNNDAKELETA FT LLLLQNEIEEYDLCESSKEVVIHVAGYITKKIKSKFKCSLCQDLLQTNKIS FT SPYMEILSRGGLTEPSENMVEYVCSLFCILDITKELLLTKYSNKLKNSALA FT ALELFKNNICFMCSDHLETGRQYIHTTVINIFYNNEQSIQNGKIRKDEVVA FT FKNRHNKKQT" XX SQ Sequence 3134 BP; 1141 A; 421 C; 463 G; 1108 T; 1 other; catggcctac ttaaatacac ggccggactt gttgcttaca agtccgataa ttagtaggcc 60 agtttttagt caagttcttt atttgcttta agtggttctc aaaagaatta caactattgt 120 tattagaata aaaacaaact ttcattttat ctttttacat tatttatttg tcttataaaa 180 tgcctaaaaa gtgctgtgta gttgcttgca gaagcggtta caaaaagaaa aaaaatgacg 240 atactgtaga gggttttatt actaaaacag tatttaattt tccctcagac ataaatttat 300 gctcccggtg gatcaagttt ataaacagaa ttaactggaa accatctaaa ttctcaggaa 360 tatgtattga tcatttcgaa gaaaaattta ttaaacgagg caaaagaaat acactagact 420 ttacaaagaa tcctattcca acaattcata caaataaaga ttgtttttta aagccatctt 480 tacttcctac acctataact ttaagaaaga ctcctacaaa aagaccttat tctcattcat 540 tgctagatga aacagtggtt ttcaacaaac tagataagat ttgttcaatc aatgacttga 600 cagaacaagt ttgtcctcaa aattttttat ttaagaaaat atatggtggc gttatttatt 660 atcgtgttat cattgaaaat ataacatcat caactaacat tgaaagcatt aatgttgata 720 acaatctgca tgtaaaattg taccataagg gaaatttgat acctcttccc cagtggtttc 780 gtcaaaacaa taagtgtaca ctaacaagtg tcagcatgtt agaaaattta gtttcttata 840 tgagaaatcg ggttgaagaa aaagaaaaaa atattttaag cgaactttat agttttatgt 900 attatgaatc acatggcaga gagttatata gtaatgaact tattcgattt gctttggttc 960 tacgatatac ctctagacaa gcctataact tgctattaga tgaatttcct tttccttcat 1020 tttcatattt aaaagcatta actcaaggtg gaatcgaacc aattaaagca ttaaaatatt 1080 tattaaaaga agaaaaaata agctcagatt cagttttgtt aattgatgaa atgtacttgc 1140 aaaaaagtgt tcaatatcat ggtggtaaat ttgttggaaa agatgagaat ggagaacttt 1200 attcaggagt tgtagttttt atgttagttg gtattaaacg ttctatacct tttgttgtta 1260 aagcatgtcc tgaaacaagt atttctggaa attggttgat gactgaaatt gaagagtgtc 1320 taacaaatct ttttgatgct aattttaaaa tttgtgctat catcacagac aaccattcaa 1380 ctaatacact tgcatttaaa aaactcataa aaaagtataa ttctactgaa aacttttttt 1440 tttattatgg acaaaataaa atttatacat tatttgatag tgtacatatt ttaaaaaaca 1500 tcagaaataa tttattacat tcaaagaggt ttatttttcc aaaattcaat tttgataaat 1560 ttgaggattc aattaatgtt gaagctggag atattcgttg gcagttattt caccaagtta 1620 atgaaaaaga taacttactg cctgctaatt taaagaaagc ttacaaactt actccaactg 1680 gtttacatcc aggaaaaaat aaacaaagtg ttaaactggc tttgcaaatt ttccaygagt 1740 caacatattc tgccattaaa agttattttc cgaaagaatt aagtgctgct aattttctat 1800 atttgataaa tacttggtgg acaatttcaa attcaaagaa tatatataat actaataata 1860 aaataggaaa tgctgttgtt gcaaatgatc aaaaacctga atttttaaat gcttttgctc 1920 aatatgtaag tgaatggtat aatacaaatt tacagggatg tcacaaattg tcactttcag 1980 ctcagacatc aaatgcacta atagtcacat taaaaggaac tgcatcttta atcaaagatt 2040 tactttcaga aggatatgat tacattttaa ccagtagatt tcaaactgat cctcttgaaa 2100 aacattttgg gaaacttagg cagatgagtg gcggaaggtt tctggttagt ttgcgtgaag 2160 ttgaaagttc tgaaaaaata atcagcataa aatctctaat taaagaaact ctggcattga 2220 atacatggtt taataatgaa gtaccagtca ataatgatgc aaaagaactt gaaacagctt 2280 tacttctatt acaaaatgaa atagaagaat atgatttatg tgaaagttca aaggaagttg 2340 ttattcatgt tgctgggtat ataacaaaaa aaattaaatc caaatttaaa tgttctttgt 2400 gtcaagatct tcttcaaact aacaaaatat cttcacctta tatggaaata ctatcaaggg 2460 gagggttaac tgaacctagt gaaaacatgg tagaatatgt ttgctcctta ttttgcatat 2520 tagacatcac taaagaattg ttattaacta aatattccaa caaattgaaa aattcagctc 2580 tagctgcact tgaactgttc aaaaataaca tttgttttat gtgttctgac catttagaaa 2640 caggaagaca gtatattcac acaactgtta ttaatatatt ttataataat gaacaatcaa 2700 ttcaaaatgg aaagattcga aaagatgaag tagtagcctt taaaaatcga cacaacaaaa 2760 aacaaactta atttattatc cttatttttg ctatagatat acttttgttt ggactctgta 2820 tatttaattt ttgttttgat ttgcttatct atagcttcat cgtcatcttt ttctaaagct 2880 aatttatggt ggctaagttt ggcttttgta tttttttgtt aaaactattt tattttttga 2940 taagttatgt ttttctatag ctgtattttt ttctcaagtt atatattttg ttgaaatata 3000 tttgatattt gaaatatatt tttagatatt agtttaataa ataataaatt ataagtttaa 3060 taaatcgtta aaactggcct actatttttc ggacttgtag caacaagtcc ggccgtgtat 3120 ttaagtaggc catg 3134 // ID CR1-63_HM repbase; DNA; INV; 4408 BP. XX AC . XX DT 23-DEC-2008 (Rel. 13.12, Created) DT 23-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE CR1-type family - consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-63_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4408 RA Bao W. and Jurka J.; RT "CR1 families from Hydra magnipapillata."; RL Repbase Reports 8(12), 1890-1890 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 72..905 FT /product="CR1-63_HM_1p" FT /translation="MDFTMASIELMVNVLLDKQKNTIIEETQKLLKEQEKS FT FTAITTANIKIITGRFDKLESDILNNSAKIATITNALETSNQKILLLETEL FT NEIKKFIHVNDEIVASKFEIVKEKTKKVSSQVKDKSTVNKINNKLREIEDR FT SRRNNLRINGIKESEHETWEESELKVLKLFEETLEIKDVKIERAHRTGPRG FT ENKNRTIILKLLNYKDKTDIMKRSVKLKGMNIYINEDFCFETVQIRKDLKN FT EMFKQRELGKYAFISYDKLIVREWSEKKIDAKDCQV*" FT CDS join(1056..1682,1642..2325,2291..4138) FT /product="CR1-63_HM_2p" FT /translation="MNSKLLNNFELNSFNFFQTNKFVINKDTDPGSIYFSE FT AGALLNNCSYFYNNELKEFLERDHLNVIHFNIRSLKKNFESFRNNIEEALN FT IFNIICVTETWCSSDEAKSNSNLHLPGFYLVPLARKTKKRGGGVLFYVSEE FT MRFINRPDMSISDADKEVLTIEILAKKNKNIIISCCYRPPSGEINSFKSFL FT ITDIIKKNYNRKEIKLHNWKKTTIEKKLNYIIGDINLNAFNYHVNYKINGF FT YNDLFENGAIPLINKPTRVTSNSASLIDNIMTTDTFNESIKKGLIKSDVSD FT HFPIFFSINIDIELVPITNQVFFKRCFSEANLMSFNEQLSLLHWKHINFSS FT EANLVYNSFFKTFFEVYDANFPQYEVSLTKKSIKSPWITKELRKSSKVKQK FT LYIKYLKSKTDKNKEIYKTYVKHFECQRKNLKKKILLKLTREVKKKYYSNL FT LEKFKNNSKRTWQILNEITGNYKSKTSNLPKVIKNNDVFLNNPKDIANQIN FT NYFITVGPNLAKNIPIITNSTNYFTLPIISVLNNFEISMKEFECAYKMLKT FT NKAKGPDGINSNILIASYENIKTILFQVFSCLIKQGIFPDQLKIAKVSPIF FT KGGELSNITNYRPISVLSGFSKVFERILYNKIYDHLSKNKILNTSQYGFKN FT NNSTEHAILHLTRNIADSFEKSQFTLGVFIDLSKAFDTIDHEILIQKLKNY FT GISGNFLNLLKSYLSNRKQFVQIDVSSSTNLLDITCGVPQGSILGPLLFLV FT YVNDLHNASNLTTVMFADDTNLFLSSSDIVTLFNDINIELVKISNWFKTNK FT LSINLEKTQWTLFHPKSKKKLLPNDMPQIYIDNVQIKQSKVIHFLGVFIDE FT NLTWKNHIETLCNKVSKNIGVLYKARNFVNRHALIQLYYSLIQCHVNYANI FT AWGSANNSQLEPLYRQQKHLARLINFKDRYTHAKPLLIDMNILNIYQLNVF FT NVLCFMYKCKINLTPIFFQNLYAIKPRNKYELRNNNLIHQHYSHTNFGRSL FT ISYRGAFLWNQIVLKNFDFSKNCSFLTFKNKLKKVILTIDNIFEFF*" XX SQ Sequence 4408 BP; 1755 A; 605 C; 618 G; 1430 T; 0 other; atacggaagc gatcggacgt tttattttct tgaaaaagat tatctttaaa tttacataac 60 ttaaattaaa aatggatttt actatggcga gtatagaact aatggtaaat gtactattgg 120 ataaacaaaa aaacacaata atagaagaaa ctcaaaagtt gctaaaagaa caagaaaaaa 180 gttttactgc aattacgact gcgaatataa aaataataac cggaagattt gataaacttg 240 aatcagacat tttaaacaac tcggctaaaa ttgctacgat tactaatgcg ttagaaacgt 300 ctaatcaaaa gatattgctt ttagaaactg aattaaatga aattaaaaaa tttatccatg 360 ttaatgatga aatagttgca tcgaaatttg aaatagttaa agagaaaaca aagaaagtta 420 gcagccaagt aaaagataag agcactgtga acaaaataaa taacaagtta agggagatag 480 aggatagatc taggagaaac aatttaagaa taaacggaat aaaagaaagc gagcatgaaa 540 catgggaaga aagtgagtta aaagtgctta aattatttga agagacgctt gaaatcaaag 600 atgtgaaaat tgaacgggcg cacagaaccg gaccaagagg tgaaaataaa aatagaacaa 660 taatattaaa actcttgaac tataaagata aaacagatat tatgaagaga tcagtgaaat 720 taaaaggtat gaatatttac ataaatgagg acttttgttt tgaaactgtt cagataagaa 780 aagacttgaa gaatgaaatg tttaagcagc gagaactggg aaaatatgcg tttatctctt 840 acgacaagct aatcgtgcgc gaatggtctg aaaagaaaat tgacgcaaaa gattgccaag 900 tataaattta ttttgatttt ttgattttta atatttgacg agaattttct ttcattgtat 960 aaatttattt gcataaaatg aatgtctatt atacattcat ttcatgctta aattcacaat 1020 gaaattcttt aagctctata tcttaaaata caaaaatgaa ctcaaaacta ctaaacaatt 1080 ttgagttgaa ttcttttaat ttttttcaaa caaataaatt tgttataaac aaagatactg 1140 atcctggttc aatttatttt agtgaagcag gtgctttgtt aaacaattgc tcctattttt 1200 ataacaatga gctaaaggaa tttcttgaac gagatcactt aaatgtaata cactttaaca 1260 taagaagctt aaaaaagaat tttgaatctt ttaggaataa tattgaggaa gctttaaaca 1320 tttttaacat tatatgcgta acggaaactt ggtgtagctc cgatgaggca aaatctaact 1380 caaacctcca tcttccgggt ttttatcttg tacctttagc acgcaagact aaaaaacgag 1440 gaggtggagt acttttttat gtaagcgaag aaatgcggtt tataaatagg cctgacatga 1500 gtatttctga tgccgataaa gaggttttaa caattgaaat cttagccaaa aaaaacaaaa 1560 atataattat aagttgctgt tatcgcccac catctggtga aataaatagt tttaagtcat 1620 ttttaattac cgatataata aaaaaaaact acaatagaaa agaaattaaa ctacataatt 1680 ggtgacataa atttaaacgc attcaattat catgtaaatt acaaaataaa tgggttttat 1740 aacgatttat ttgaaaacgg cgcaattccg ttaatcaata aaccaacaag ggtaacttca 1800 aattctgctt ccttaataga caacattatg acaacggata cttttaatga atccatcaaa 1860 aagggtttaa ttaaaagtga tgtttctgat cattttccga tttttttctc aataaatatt 1920 gacattgagt tagtaccgat aacaaatcaa gttttcttta aacgatgttt cagtgaagca 1980 aatttaatgt cgtttaatga acaactatca ttactacact ggaaacatat aaatttctca 2040 tctgaagcaa atctagttta taactcgttc tttaaaacat tttttgaagt ttatgacgca 2100 aactttcctc aatacgaagt aagtcttaca aaaaaaagta ttaagtcacc atggattaca 2160 aaagaactta gaaaatcgtc caaagttaaa caaaaattat atataaagta tttaaagtcc 2220 aaaactgata aaaataagga gatttataaa acttacgtga agcattttga atgtcaaaga 2280 aaaaacttaa aaaaaaaaat actactcaaa cttactagag aagtttaaaa ataactcaaa 2340 gcgcacgtgg caaattttaa atgaaattac tggcaactat aagagtaaaa caagcaactt 2400 gccaaaagtt attaaaaaca atgatgtttt tttaaataat ccaaaagata tagcaaatca 2460 gataaataac tattttataa ctgttggtcc aaacttagct aaaaatattc caattataac 2520 taactcaaca aattatttta cccttcccat aatttccgta ctaaataatt ttgaaatttc 2580 catgaaagaa tttgaatgtg cttacaaaat gctaaaaacc aataaagcaa aaggcccgga 2640 tggaataaat agtaacattt taattgcatc ttatgagaac ataaaaacta ttcttttcca 2700 agtgtttagt tgtttaataa aacaggggat atttcctgac caactaaaga tcgccaaagt 2760 ttcaccaatt tttaaaggag gggaactatc aaatataact aattatcgtc caatttctgt 2820 actctctggt ttttcaaaag tttttgaaag gattctttac aataaaatat atgatcatct 2880 ttctaaaaat aaaatattaa ataccagtca atatggattt aaaaataata attccactga 2940 acacgcaata cttcatttaa ctagaaatat tgcagattcg tttgaaaaat cacaatttac 3000 tcttggcgtt ttcattgatc tatcaaaagc ttttgatacg atagatcatg agattcttat 3060 tcaaaaactt aaaaactatg gaatcagcgg aaattttcta aatttgctaa aaagctattt 3120 aagcaatcgt aaacagtttg tgcaaattga tgtatcctcc tctacaaatt tgctagatat 3180 aacatgtggc gttccacagg ggtctatact ggggccactc ctttttctcg tctatgttaa 3240 tgatctgcat aatgcctcaa atttaacgac agtgatgttt gccgatgaca ctaacctttt 3300 tctgtccagt agtgatattg ttacactttt taatgacata aacatagagt tagttaaaat 3360 ttcaaattgg tttaagacaa ataaattatc aattaactta gaaaaaactc aatggactct 3420 atttcatcct aaatccaaaa aaaaactttt accaaatgat atgcctcaaa tatatattga 3480 caatgttcaa ataaagcaat caaaagttat acatttcctt ggtgttttca ttgatgaaaa 3540 cttaacatgg aaaaatcata tcgaaacatt atgcaataaa gtctcaaaaa atattggagt 3600 gttgtataaa gcgagaaatt ttgtaaatag acatgcatta attcaacttt attactcgct 3660 aatccaatgt catgtaaact atgctaacat tgcgtggggt agtgctaata atagtcaatt 3720 agaacctctt tatcggcaac agaagcatct agcacgtcta ataaatttta aagatcgtta 3780 tactcatgca aaacctcttt taatagacat gaatattctt aatatatatc aactaaatgt 3840 ttttaatgtt ctttgcttta tgtacaaatg taagataaac ttaaccccaa tattttttca 3900 aaatttatat gcaataaagc caagaaacaa atatgaatta agaaataaca atcttattca 3960 tcaacattat tcgcacacaa attttggaag atctcttata tcatatcgcg gagcttttct 4020 gtggaaccaa atagtgttaa aaaactttga tttttctaaa aattgtagtt ttcttacttt 4080 caaaaataaa ctgaaaaaag ttatcttgac aattgataac atcttcgagt ttttttgaat 4140 attttattta aatatttagt tttattacat attttgacat tttttaacgg tattttatgt 4200 ttggttttgg cttattaaaa atcttgtatt gtaaatatcg gtattttgtt tttatttatt 4260 ttgtatcgca ttgttaaacg gttctcggtg acaggatctt gccatcctct tcgagtttcc 4320 gtgttctttt tattttatgt aaaacgatat tataccaaat aaaattgtaa taagaacaaa 4380 aaaaaaaaaa aaaaaaaaaa aaaaaaaa 4408 // ID DNA8-60_AP repbase; DNA; INV; 911 BP. XX AC . XX DT 25-AUG-2009 (Rel. 14.09, Created) DT 25-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-60_AP. XX NM DNA8-60_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-911 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 1994-1994 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 911 BP; 327 A; 167 C; 129 G; 288 T; 0 other; cagtggcgca acttgataat aaggggccca tgtgaaaatt tctagaccgg ggcccttacc 60 tgctagcaag aattaaacac agtacagcac ttggtataaa tcgaacgtgg ttcaaagtac 120 tctttaaact ttcctaatat ctccaagaac aatttgaaat tttcacaaaa tcggtcaagt 180 agtttttgag aacctatata atagccacag cttacaaata tacatttaac tctcatgttt 240 gttttaagat ataatttctt tattcgaaac cataagcgtg cgacgattta atggatcctt 300 ttacctcatc taccagagta actaatggca aacatttaaa atactatttt ttactaatta 360 tttaaaaaaa aaaaataaac aaaaatgtct atcgtaaatc tcaaaatacc tctccgggtt 420 gaaccaactg ggtctaatag tgaatctaaa ccattcagag actctctcaa aaacacacaa 480 aaaatgtcca gtggtttagg aggagatcaa tgacatacag acaaacattt atttgtacta 540 tatatataga catagattgt tattattttc ggccacccgg cacccaattc attttaccaa 600 acaaaattat tatacattaa aaccttcccc gtgactcgac cgatttacta gtgaaaaccg 660 tattaaaata agtagagttc ttttcgagat atgctcgtac atacaaaaaa ggggcttcta 720 ttttataata tatatatata tatatagatt aatagattat attatattaa ttgtattaat 780 atttaattat tattatatta ttttgtatat tacacattat agtaggaata tcgtatatga 840 gatctcgggg ccccaaaaat cgggggcccc tgtgaattgc acacccaaca ctcccgataa 900 ttgcgccact g 911 // ID Sola1-2_AP repbase; DNA; INV; 4812 BP. XX AC AC202215.4; XX DT 28-JAN-2009 (Rel. 14.02, Created) DT 28-JAN-2009 (Rel. 15.12, Last updated, Version 2) XX DE Sola1 DNA transposons from Acyrthosiphon pisum. XX KW Sola; DNA transposon; Transposable Element; Sola1; Sola1-2_AP. XX NM Sola1-2_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-4812 RA Bao W., Jurka M.G., Kapitonov V.V. and Jurka J.; RT "New superfamilies of eukaryotic DNA transposons and their RT internal divisions."; RL Molecular Biology and Evolution 26(5), 983-993 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX FH Key Location/Qualifiers FT CDS 1581..3674 FT /product="Sola1-2_AP_1p" FT /translation="MDVEIITDVQPLKYIEEKIVNNTIMIEISNETNTSYM FT NTELFQNNNNITTKDAETDFRPLNLYTERILTNQILIEPNTSTVDANLILE FT NTDFLPLMESVSNVSINNDPEYIDPITETNGGRKRKIQNKRKEKQLQRNII FT PRELELKCNHTEMIKKNVCRVSELSEEDLTDFKHNLCKLTTKIEQDKFLIT FT MMTVSDVKRTDRKKTNRSHRMVVKYFIPQIRGNLIPVCLDAFSSVTSITRR FT RLNIISKTFKQNHSSPTEKRGGIRVDLIQDEITDSIKEHIKAFKCRKSHHT FT RKDTGRSYLPPTYSIKYMWENWYEKRKLEKKQTASLSKYQKIFTTKFNISF FT GHPRQDTCSYCMEQKIKIQTEGDIDKKSELMLELTLHKRKAKRFFELLKTE FT TPGEISCSFDMMQTQPLPKLSVTDVFYSRQVWLYNLTFVISDSNQKPENCI FT LYSWNETESGRGPNEVCSALIHFLESLEKRLKTSESPPTILNLFSDSCSGQ FT NKNQFTMITLLYYINYKATVFTQINHIFPVRGHSYMPPDRVFGRIEQVLRK FT KETILSPNQYFEIFKNFCTVNAYGKDFKIYDIKSAVKTVVKTKIDFKSTEQ FT KVYSYVKGKNSSIRVSKTYAGAYEEFEVIKRHSDISKLFNNIKILPKTNYV FT KLPKQNDVKKLLKFFTIPEDAQEFYEDIFNNNDDPSDEDIHYYEEDNE*" XX SQ Sequence 4812 BP; 1910 A; 552 C; 623 G; 1727 T; 0 other; gaggggcgac aatgcaaaac gtgctattgc caaaaatata aaaattgagt tatacatcat 60 ttctgatagt aaaaaccacg cagaatcgaa tggcgtgttt agatttggcc aaaacgtact 120 atttccagtg catatttgat cataaagttt catgatttct gcataacgca ccatttccat 180 cgctaaaatc gtttcaaaaa gaactatttc caaattacat caaaacgtac tatttcccat 240 gctatttagt gtaaatttaa taattattaa agcaaaacgt actatttttt aaattatgtg 300 cagaacgtac tatttccaga ttgaaacaaa acgttatatt tcccatgtta aaatgtagct 360 atactaaatt aattatttgc aaaacgtact atttccaaga gtaaaaggtg aagttgacat 420 ttcaatttga attaattatt ctttaattat ttcatggtga tataatatgg gactacatta 480 ttttatttta tttactagtt ggtgcaactg aaaaaattca gtataagtct tggtgttgaa 540 tgttatcaca aataatttcc ctccaaaagt ttgtatgatg actatcatag tattatgata 600 atacaattca tctactgtaa gttctctcat tctctgtact ggttcattgt tatttgtttt 660 aattacctac taataattgc atgtacattt tacgagacat tttattaatg ttttattagt 720 aggtatttaa aatgttcaac tctagaacca ttaaaatggc cgcatgtggt ttaaacaaaa 780 ataaaaatga tactcaaact aaagaagaaa tggatattaa tgaatccatt aaaaaagttt 840 caaggtggtt agaggtatga aaaaatgaat ttatttgttt acaactttat ttatgtaaga 900 cttttaatta ttatattgtt aaatcattga acagtaggta aatactgtat acttatgtat 960 tactgtgata cctacctaat aatttgtagg aatacctatt ttaaaaaata aaataatttt 1020 ataatatttt aattattcta ttaaattgat ttcatagtag atgaattatg taatagaata 1080 ttatgcagaa taaattgaaa aatagttaaa ttcataataa atgtatgctg ggtattactt 1140 taaatactat ttaataataa aaatacctag ttcttttgat tttgtatcat agctgttatg 1200 ttttatttta tttatattat tatttactat aacttactat ggttggtcca atttatcaaa 1260 taattaagat tatgctataa aatttaactt aaatctattg aattctatat atttttatta 1320 ttagggtaga caatttaaat tatataatat attgttattg gtaaacctaa atatctaata 1380 gattatattt acaattaaaa tctttaagta ttataataat aattctaata caatattatt 1440 tgtattaatt ttttttttta cacattaaat attaagttaa tatattatat ttaaagaaat 1500 atatttaata ttataataat aactaaattt aggaaactgc tgatttgaac tattctgtac 1560 caaacctaaa tattgaagac atggatgttg aaataattac agatgtacag cctctaaaat 1620 atatcgaaga aaaaatagta aataatacaa taatgattga gatctctaat gaaacaaata 1680 catcatacat gaatacagaa ctgtttcaaa ataataataa cattaccact aaagatgctg 1740 agactgattt tagaccatta aatttatata cagaaagaat attaactaat caaatattaa 1800 ttgaaccaaa tacatcaaca gttgatgcta atttgatttt agaaaatact gattttcttc 1860 ctttgatgga atcagtatca aatgtatcaa taaataatga tccagaatat attgatccta 1920 taactgaaac taatggaggt agaaagagaa aaattcaaaa taaaagaaaa gaaaaacaat 1980 tacaaagaaa tattatacca agagaattgg aattaaaatg taatcatact gaaatgataa 2040 agaaaaatgt atgtagagtt agtgaactat cagaagaaga cttgactgat tttaaacata 2100 atttgtgtaa attaactaca aaaattgaac aggataaatt tctaattaca atgatgactg 2160 taagtgatgt caaacgaact gaccgtaaaa aaactaatag atctcatcgt atggttgtga 2220 aatactttat accacaaata agaggtaatt taataccagt ttgtcttgat gcattttctt 2280 ctgttacatc aataactaga agacgactaa atataataag taaaacattc aaacaaaatc 2340 attctagtcc tactgaaaaa agaggtggta ttcgtgttga tttaattcaa gatgaaataa 2400 cagattcaat aaaagaacat attaaagcat tcaaatgcag aaaaagtcat catacacgta 2460 aggatactgg ccgaagctat ttaccaccta catattctat taagtatatg tgggaaaatt 2520 ggtatgagaa aagaaaatta gaaaaaaaac aaactgcttc tttgagcaaa tatcagaaaa 2580 tatttactac aaagtttaac ataagttttg gacatccacg ccaagacacc tgcagttatt 2640 gtatggaaca aaaaattaaa attcaaactg aaggagatat tgacaaaaaa agtgagttaa 2700 tgttagaatt aaccttacat aagagaaaag ctaagcgttt ttttgaacta ttaaaaacag 2760 aaactcctgg ggaaatatct tgctcatttg atatgatgca aacccaacct ttaccaaagt 2820 tgtctgtgac tgatgtattt tacagtcgcc aagtctggct ttataatttg acatttgtta 2880 tatctgattc aaatcaaaaa ccagaaaact gcattttgta ctcatggaat gaaactgaaa 2940 gtggacgtgg cccaaatgaa gtttgttcag cactgataca ttttttggaa tcattggaaa 3000 aacgattaaa aacaagtgaa tcaccaccta ctatattaaa tttattttca gattcttgct 3060 caggccaaaa taaaaaccag tttacaatga ttacattgtt gtattacata aattacaaag 3120 caactgtatt tactcaaatc aatcatatat ttcctgtaag gggacacagt tatatgccac 3180 ccgaccgagt ctttggtcga atagaacagg ttttaagaaa aaaggaaact attctgtcac 3240 caaatcagta ttttgaaatt tttaagaact tttgcactgt taatgcatat ggtaaagatt 3300 ttaaaatata tgacattaaa agtgcagtca aaacagttgt taagactaaa atagatttta 3360 aatctactga gcaaaaggtg tattcttatg taaaaggcaa aaattcaagt attagggttt 3420 ccaaaaccta tgctggagct tatgaagaat ttgaagttat taaaagacat tctgatataa 3480 gtaaattatt taataatata aaaatcttac caaaaacaaa ttatgtcaaa ctcccaaagc 3540 aaaatgatgt caaaaaactt ttaaaatttt tcacaatacc agaggatgct caagaatttt 3600 atgaagacat ttttaataat aatgatgatc cttcagatga agatattcat tattatgaag 3660 aggataatga ataaattcaa aataatgtgt cgattctccc atactatgtt attttattca 3720 aattcatttg gtgttattat taaagagaat atactatgct taataataag ttgtttttat 3780 tttattaatt tattatttgt ttttatttgt ataatgtact atgaattttt ttttaatatg 3840 tataaaataa gttaaattta ttgttatttg ttttaatttc atatatttaa taacaaattt 3900 aattttacct atagtaaaac catatactat gttattttat tcaaactcat tagatgttat 3960 tatttaaggg aaaatactat gcttaataat aagttgtttt tattttatta aattattatt 4020 tgtttttcta atatacctat ataaaataag ctttttttga ttatattttg tgataaaata 4080 atactagaaa aagactagtt tttaagaaac ataaagatct ataagtagag caaagactgc 4140 aaactaaaaa attattatga ataaaaatgt tattttgttt taaaaaaagg tgttattgtt 4200 aataaaaaag taaataaact atgttaaaat taaagtatct aatacctatg tgttattttt 4260 atttttattt gtatgattat ggactttgtt tttaatgatt ataaaataag gttttattat 4320 ttataaatat aatacaaaac attgtagcaa tttaccagta aaaaaaaaag tatggagtat 4380 atttggtttt ttcatgaaat agttgattat gaaaaataaa attctaggtc gttttgttaa 4440 aaaaaatagc taataaaaaa tatatctaac tacctttaat ctttaaatta tatcaaaatg 4500 tactatttaa tggaaatagt acgttatgat gaaatatggc caaaacgtac tatttccatt 4560 gaattttgac aaaacgaact aagtggtgga aatagtacat tttggcacga ttcactttat 4620 gaaattacgt ttaaatttac caaaacttaa atttaaatat aaaaaaatat cataagtaag 4680 ttcagttttt ttttatgatt tttttgaata cttaaggttt ttactatgat tttttgataa 4740 gctactatag aaaatcaatg tttctggaaa aaacaaaaaa tggaaatagt acgttttgca 4800 ttgtcgcccc tc 4812 // ID hAT-13_SM repbase; DNA; INV; 1437 BP. XX AC . XX DT 23-JAN-2008 (Rel. 13.01, Created) DT 23-JAN-2008 (Rel. 13.01, Last updated, Version 1) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-13_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-1437 RA Jurka J.; RT "haT-type repetitive family from freshwater planarian Schmidtea RT mediterranea."; RL Repbase Reports 8(1), 14-14 (2008). XX DR [1] (Consensus) XX CC Present in ~1000 copies in the genome. The youngest elements a CC >8% divergent from consensus. CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS join(238..855,859..1281) FT /product="hAT-13_SM_1p" FT /translation="MCNEKNIPQPMFLHCIIHQQALCAKYMDIGGVLNPVV FT KMVNLIRSHGLNHRQFRDMLKDIDTELQDLPYYTAVRWLSCEKVLSRVFKL FT RKKISDFLESKGKLQPLLSHEEWVWKFAFTADITSHLTFLNLKLQEEKNLI FT SDLYTHLKAFRSKLVLFLEQIQANNLTHFQQCKVFMAETTTEFPTSFACEI FT IKDLQIQFQERFSDLDQTEEVRLFQDSFEADATSCPNELQLEVIELQANDL FT LRDKFKMGLVGFYQFLPKEDFPNVKTFASRYLSIFGTTYLCKQTFSRMKYV FT KNHLRTNLSDDSLRSLLMLGTSNLNPEISAYFGIQETISPFALIQVFMCCV FT SLYY" XX SQ Sequence 1437 BP; 460 A; 274 C; 282 G; 421 T; 0 other; caggggtctc caaactacgg cccgcgaggc ttcttatccg gcccgcggaa agctttgtat 60 ctacgaaaaa aattatggaa atttacctat atcctgccaa gattagccgc cgcttacgtg 120 gacgttatat tttccaaccg atcactacat tgcgaaacct tattccctac ttcctttttc 180 tccttgcgtt tctccttgag tggaataaag aaagagttgg ttggacgagt gaagcagatg 240 tgtaatgaaa aaaacattcc acaaccgatg ttcctgcact gtattattca ccagcaagct 300 ttgtgtgcta aatatatgga tattggcggt gtattgaatc ctgtggtgaa gatggtgaat 360 ttaataagat cacatgggct caaccataga cagtttagag acatgttgaa agatatagac 420 acagaattgc aagatttacc atattataca gcagtaagat ggctaagttg tgagaaagtt 480 ctgagcagag tatttaagtt aagaaagaaa ataagtgact tcctggaaag taaaggcaag 540 cttcaaccat tactgtctca tgaagagtgg gtatggaaat ttgcctttac tgcagacata 600 actagtcatt taactttttt gaacctaaaa ttacaagaag agaaaaatct aatttctgat 660 ctatacaccc acttaaaggc cttcagatcc aaacttgtct tgtttttgga acaaatacaa 720 gccaacaatt tgacgcactt tcagcagtgc aaagtcttca tggctgaaac aacaactgag 780 ttccccacat catttgcatg tgagatcatc aaagacctcc agatacagtt tcaggagcga 840 ttttccgact tggattgaca gacagaagaa gtgagactgt ttcaagactc atttgaagca 900 gatgcgacca gttgtccaaa cgaattgcag ctggaagtca ttgagttgca agcaaatgac 960 ctccttaggg acaaattcaa aatgggactg gtgggatttt accagtttct tcccaaggaa 1020 gactttccta atgtcaaaac ttttgcatca aggtaccttt caatctttgg aacaacatac 1080 ctgtgtaaac aaacattttc aagaatgaaa tacgtgaaga accatttgag aacgaacttg 1140 tccgatgata gtctaaggtc actgttgatg ttagggacat caaatctaaa cccagaaatt 1200 tctgcttatt ttggcatcca ggaaacaatt tcaccattcg cactaataca ggtgttcatg 1260 tgttgtgtgt cattgtatta ttaaataata gctgaagaaa ataatattaa ataaatagtt 1320 aaaaacaata tccatctttt gattcttttt tatttggact tcgagtatgt agtcggcccc 1380 tgaactgctg tttgatagtt aatccggccc tcagaatgaa aagtttggag acccctg 1437 // ID AeTango2 repbase; DNA; INV; 1662 BP. XX AC . XX DT 12-OCT-2010 (Rel. 15.1, Created) DT 12-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE A Mariner/Tc1 DNA transposon family from Aedes aegypti. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Tc1_Ele2; KW AeTango2. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-1662 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-1662 RA Kojima K.K. and Jurka J.; RT "Mariner/Tc1-type DNA transposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (27-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. ~90% identical to consensus. This consensus CC is ~99% identical to the original sequence in [1]. TA TSDs. CC 225-bp TIRs. XX FH Key Location/Qualifiers FT CDS 381..1388 FT /product="AeTango2_1p" FT /translation="MGGGKISMQIRNLILRDAKRNLSQRKIAEKFGVSRGA FT VQKIILKHQKFGSVADRTGRGSKRKSSARTDDLIIRQIKKDPRTTVRAIKE FT RLDLTISERTIRRRMAERELKSYFAKKRPMISKVNKHKRLLFARNHINKPL FT EFWKHVVWSDESKFELFNKKRRLRVWRKSDEGLQDRHLQPTMKHGGGNVMV FT WGCFSWFGVGNLAQINGIMTAEGYIDILCENLEESMLKMGLENNNTFQQDN FT DPKHTAKKTRAFFRSTRIKSMEWPPQSPDLNPIENLWAILDNKVDKTGVTN FT KQAYFAALQKAWDDLDQQHLRNLVESMPKRLQLVIEAKGGHIEY" XX SQ Sequence 1662 BP; 550 A; 315 C; 354 G; 443 T; 0 other; cagtgaccgg cacaaaaaaa gatccacccc aagcatatat cgaaatttgc acaaacttgt 60 gatttttttt aaacttatca taatttcaat taatgtaaaa gtcacatata cctccaaact 120 tcatttcaga ataataacat gattgatttt gttttcgaaa aataaaaaat aataagttta 180 ctctttactt agtttttttt agtatgacag aaaaaaagat ccactaccgt aaaattaatt 240 tttcgtgtaa tttacgttgg tgagcccccg gttttgacac ttcattgttt ttgtttgtcg 300 acaacataaa ttgtagagct cttaagtcaa ctgcaagtgt tcaaaagaat tggttttgaa 360 gctgtggaag ctgtggaaaa atgggcggag gcaaaatttc aatgcagatc cgaaacctta 420 ttttgcgtga tgctaagcgc aacctatcac aacgaaaaat agcggagaag tttggagtga 480 gccgtggcgc agtgcaaaag ataatactga agcaccaaaa gttcggaagt gttgctgatc 540 gaacgggaag aggatcaaaa cgcaagagca gtgcgcgaac cgacgacctg atcatccggc 600 aaatcaagaa agatccacgc acaaccgtgc gtgcgattaa ggaaaggcta gatttgacga 660 tttccgaacg tactattcga cgacgcatgg cagaacggga attgaagagc tatttcgcta 720 aaaagcgacc aatgatcagc aaggtcaaca aacataaacg actcctcttc gccagaaacc 780 acatcaacaa gcccctggag ttctggaagc atgtcgtctg gtcagacgag tcaaagtttg 840 agctcttcaa caagaaacgg cgactaagag tttggcggaa gagcgatgag gggctccagg 900 atagacatct ccaaccaaca atgaagcatg gcggtggaaa cgttatggtt tggggctgtt 960 tctcgtggtt tggcgtggga aacttggccc aaatcaacgg cattatgacg gcagaagggt 1020 acatcgacat attgtgtgag aatctggagg aatcgatgct taaaatggga ctggagaata 1080 ataacacctt ccagcaagac aacgacccca aacacacggc gaaaaaaaca cgagcattct 1140 tccggtcaac ccgcatcaaa tccatggaat ggccacccca gagccccgat ttgaatccca 1200 ttgaaaatct ctgggcaata cttgacaaca aggtggataa aactggtgtt acaaataaac 1260 aggcgtattt tgcagccctg caaaaggctt gggacgattt agaccaacag cacctccgaa 1320 acctcgtcga aagcatgcca aagcgtcttc agttggtgat tgaggccaaa ggaggtcata 1380 ttgagtacta atgatttgtt ttgatttttt ttgttattgt tttgtgaaaa aacatgaagt 1440 ggatcttttt ttctgtcata ctcaaaaacc taagtataaa gcgatatgat tttttaaatt 1500 ttctttttaa taaaaatcaa tgatgtttct atgacaaaat gaagtttgga agtttacttt 1560 tctgttacat aaactgaaat aaaaataagt ttcaaaaaaa tcacaagttt gaagcaattt 1620 tcttataaac tgagagtgga tctttttttg tgccggtcac tg 1662 // ID Chapaev-N7_AAe repbase; DNA; INV; 742 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.03, Created) DT 23-DEC-2010 (Rel. 16.03, Last updated, Version -1) XX DE A non-autonomous Chapaev-type DNA transposon family from Aedes DE aegypti. XX KW Chapaev; DNA transposon; Transposable Element; nonautonomous; KW Chapaev-N7_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-742 RA Jurka J.; RT "DNA transposons from the yellow fever mosquito."; RL Repbase Reports 11(3), 837-837 (2011). XX DR [2] (Consensus) XX CC >95% identical to consensus. 4-bp TSDs (TTAA). XX SQ Sequence 742 BP; 240 A; 135 C; 131 G; 236 T; 0 other; cccctctacc ggcagcttca ttttttaccg ctaaaaaaat attcaaatcg cgataacttt 60 tttgtttctc gatatttttg caccattttt tcacaagctc tcaaaaaact cttctagttt 120 tagaatatgt gtcgatattg ataattggtc atctggatcc ggagatattc caaaattcct 180 tgggggaccg acgcgtagcc ataacccacg taaatatctc aggctacaga atttttatcg 240 tattcggata ttcactcctc ggaatgatac attaatgcga gtatgttgtg aaaaaatgaa 300 gcaatttggt gcagccgtct ttgagtaatg agcatttatg tttctggtac cacgctggac 360 aaataaagat cttgaaaact ctaaaaaacc tcatatcgta attttcagat ttctccaaga 420 atactgaacc gatttgtatg attttttcag agtagctcct tattacctgg cattgtatag 480 tacacatttt atttttttga taaattgacc aaaaacaaaa tggccgccaa agacatttta 540 tatggagaat gtcggtcccc caaggaacat cggaatatct tcaaaaccag atcaccaatg 600 attaatatcg acacggattc ttaaactaga agagtttttt gagatcttgt gaaaaaatgg 660 tgcaaaaata tcgagaaaca aaaaagttat cgcgatttga atattttttt agcggtaaaa 720 aatgaagctg ccggtagagg gg 742 // ID R4-1_BM repbase; DNA; INV; 1875 BP. XX AC . XX DT 29-APR-2010 (Rel. 15.07, Created) DT 29-APR-2010 (Rel. 15.07, Last updated, Version 2) XX DE Non-LTR retrotransposon - a consensus. XX KW R4; Non-LTR Retrotransposon; Transposable Element; R4-1_BM. XX OS Bombyx mori OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Bombycidae; Bombycinae; Bombyx. XX RN [1] RP 1-1875 RA Jurka J.; RT "Non-LTR retrotransposons from Bombyx mori."; RL Repbase Reports 10(7), 1052-1052 (2010). XX DR [1] (Consensus) XX CC >97% identical to consensus. XX FH Key Location/Qualifiers FT CDS 196..1776 FT /product="R4-1_BM_1p" FT /translation="MVTFWRNLWSRPVEHVEGSWMQVIEDSCAGIPPMSPV FT TISKGDIKAAVCSSSHWKSPGCDGLQLYWLKSFRACHETLARQFQEALDTK FT VLPAFLTTGITHLIPKSESTADPAQYRPITCLPTTYKTLTSVLETKISHHI FT DSCRVLSGAQNGCRRGSRGTKELLLIDAVAGQQVKRNRRNFSAAWIDYKKA FT FDSVPHTWLKRVLELYKIDDTVRDFLGACMGQWSTMLSLSGVRLSAVVMEC FT DKGLTPLALARDGWQSPAVLSTSDREAIWKGKELHGRFFQALHEPHVDKEA FT SVHWLRFGDLFGETEGFVCAIQDQVIRTNNYRKHILRDGTADICRLCRRPG FT ESLRHVTSGCSMLANTEYLHRHNQAARILHQELALKYGLIEQRLPYYKYQP FT DAVLENDRAKLYWDRPIITDRTILANKPDIVLMDRTESRVFLVDITIPYDE FT NLVRAEADKKTKYLDLAHEVTDMWRVVSTEIIPVVVSVNGLVPKSLSKHLE FT RLGLNKKSVVAQMQKAVLLDNARIVRRFLSQ" XX SQ Sequence 1875 BP; 463 A; 497 C; 507 G; 408 T; 0 other; gcacaacagc tgggagatcg catagacctc ctgaagcaga aggtggctgc ttggagcaag 60 cgcgtccggc ggtactcgga acgagttcaa aggtatcgcc agaatcgcct cttcgtgagt 120 gatcagagga agttctacag gtccctagaa caggccaatg ttagcgccgt caccgaacgt 180 ccagctggac aggagatggt cacattctgg cgcaacttgt ggtcaagacc tgtggagcat 240 gtggagggct cctggatgca ggtcatcgag gactcgtgtg cgggcatacc gccgatgagc 300 ccggttacta tcagcaaagg tgatattaag gctgcagtct gtagttcttc gcactggaag 360 tctccaggct gcgacggact gcagctttac tggctaaaaa gtttccgggc ttgtcacgaa 420 accctcgcca gacagttcca ggaagcccta gatacgaaag tgctccccgc cttcctcact 480 actgggatca ctcacctcat tccaaaatcg gaaagtaccg cggatccggc gcagtaccgg 540 ccaataacgt gtcttcctac cacctataag acactgacat ccgttctgga aactaagatc 600 tcgcaccaca tagacagctg tcgagtactg tctggtgccc agaacgggtg taggcgtggt 660 agccgcggta ctaaggaact cctcttgatc gacgcggtag ctggccagca ggtcaaacgc 720 aatcgacgaa atttctctgc cgcctggata gattataaaa aggcatttga ctcggtcccc 780 catacatggc tgaagagggt cctcgagctg tataagatag acgatacagt tcgagacttc 840 ctcggtgcct gtatggggca atggagtaca atgcttagtc tatccggtgt gcggctgtcg 900 gctgtagtca tggaatgcga caaaggactg accccgctag ccctggccag agacgggtgg 960 caatcaccgg cggtactcag tacctctgac cgtgaggcca tatggaaagg gaaagagctg 1020 cacggacgct tctttcaggc actgcatgag ccccacgtgg acaaagaagc ctccgtgcac 1080 tggctgcgat tcggtgacct ctttggggaa accgagggtt ttgtctgtgc aatacaggat 1140 caggttatca ggacgaacaa ctataggaag cacattctga gggatgggac agctgacatc 1200 tgtcgattat gccgccgacc gggtgaatcc ctcagacatg tcacctctgg ttgttctatg 1260 cttgctaaca ctgagtactt gcacagacat aaccaagcag ccagaatcct ccaccaagag 1320 ctcgccctta agtatggcct catcgaacag aggctaccgt attataagta ccagccggac 1380 gccgtactcg aaaacgaccg cgccaagctc tactgggacc ggcccatcat tacggacagg 1440 actattcttg cgaataagcc tgatatcgtg ctgatggacc ggacggagtc tcgggtattt 1500 ctggtggata tcaccatccc ctacgacgag aacctcgtgc gggccgaggc agataaaaag 1560 accaaatatt tggacctggc gcacgaggtg accgacatgt ggagggtggt atctacagaa 1620 ataatcccgg tagttgtgtc ggtgaatggt ttggtcccta aaagcctctc aaaacatctc 1680 gagaggcttg gtctcaacaa aaagtcggtg gtggcccaaa tgcaaaaagc agtcttgctc 1740 gacaatgccc gtatagttcg ccggtttctc tcccaatagt ccctaacgct ccggccgatt 1800 gttgcccacc tcgccggagt gtgtcctgcc gtcttcccag gcgtggcagt gtttaataca 1860 ctataataat aataa 1875 // ID PERERE-2 repbase; DNA; INV; 4544 BP. XX AC BN000793; XX DT 04-SEP-2005 (Rel. 10.08, Created) DT 27-JUN-2007 (Rel. 12.07, Last updated, Version 2) XX DE Schistosoma mansoni Perere-2 non-LTR retrotransposon (EST). XX KW CR1; Non-LTR Retrotransposon; Transposable Element; SR1; KW PERERE-2. XX NM PERERE-2. XX OS Schistosoma mansoni OC Eukaryota; Metazoa; Platyhelminthes; Trematoda; Digenea; OC Strigeidida; Schistosomatoidea; Schistosomatidae; Schistosoma. XX RN [1] RP 1-4544 RA DeMarco R., Machado A.A., Bisson-Filho A.W. RA and Verjovski-Almeida S.; RT "Identification of 18 new transcribed retrotransposons in RT Schistosoma mansoni."; RL Biochem Biophys Res Commun 333(1), 230-240 (2005). XX DR EMBL/GenBank/DDBJ; BN000793; Positions 1 4544. XX FH Key Location/Qualifiers FT CDS 367..1491 FT /product="BN000793_1p" FT /translation="MTGRCNKSICHRPGCRYPVDSGMQCDECKGWYHDVCT FT NLTPAAFKRFSKNGCVWLCQQCCLDANSLLTEAISIVNAAKKCLGKRGRDT FT SVRTQSVDTQIDTRQAQVSPKTADPHPTKEEEHPPKRTVTTRNSRKKVSSV FT PRIKNGTQKGTSGTQNRKARSVSSVKDDPTSQVVLNLPESHSTPDATVVTT FT PKNVGDWVRVVRKKRQNDVKGNPTPTKRVEKPRSDHSDRSVIFHRVKESDS FT SEPKARFEHDIVLIKQLLNQVMPKNIPGVTLLKVYRLGNLAHLKPNQSRLL FT KVVFKSPNERDLILENGHKLKGSGVFLRKDLPLADRVKRREAEKELQLRLD FT AGEKDLKIVNFRVVRLRQRMMPKPLWVKHEAT" FT CDS 1929..4289 FT /product="BN000793_2p" FT /translation="MVDWGNLRTESSANSFEQELVDAVITCALVQHVKEAT FT RYDPGSASSLLDLILTHYEDDVANLDYMPPLGKSDHAVLSFDFHITVDHEY FT ASAQSRPNVWKADIPDIMKSASSVDWKIDPESSIETAWDLFRNLYLKVTAP FT HIPWTTPKRPRNSPPWFSREVRILLRKRRKMWDRFRLLGTDEAKSQYQKAR FT NTCASTLRKARKLYEEKIVRESIECPKRLYSYINQRTKRRRNVPSLWGDST FT ATSLVEDDFGKAQVFSKYFSDVYTIETPFSPVHENPPTQALDSVTIKELDV FT FGLLIKLDIGKSTGPDELHPKLLKELANFVVNPLSVCFNLSVTQGRLPKDW FT KNAIVSPVFKTGTKHKPENYRPISLTSVVVKILEKIIRKELLKYLDENRIL FT SKKQNSFRTGYSCLTNLLVARESWCALKDQKLPIDVVYIDFSKAFDKVPHN FT RLLYKLRNVGIGGNLLMWIKDFLVGRQQRVRVNSKLSSWETVLSGVPQGTV FT LGPVLFLLYVNDLPRLLSSSVLLYADDVKIWRAIQSKGDSLELQNDLERLS FT EWSQTWQLPINTSKCIVMHIGHQGTDTYTMNNTELPIVQAHNDLGVIVSQD FT LKTTAHCRAIDAKGFRTLWSIRRAFRHLDAKTFLTLYTVFVRPKLEYCIQA FT ASPCLKKDSELLERVQRTATRLIPGIAKLPYGTRLTKLNLLPLSYRRIRGD FT LITVFKLLNDKFAPDMPSFFLSSKTENLRGHSKKVHKPRRNYLSADYRLSH FT RIINEWNSLPQHVVEAPSVDSFKRKLDQLRDHHCQD" XX SQ Sequence 4544 BP; 1412 A; 1007 C; 961 G; 1164 T; 0 other; tcatatttga atgtattgtt tacaaattcg tcgttccaca ctataactag tttgtttgac 60 gtattatttg aagttgctgt cctttttatc agcgcgctat ttaaaaagga tagtggtatt 120 tttcctcgtt gagtaaattc tattgttttg atcgttatta acgtatcaag actactgttt 180 ttatactttg attgaatttt tttcagttaa agtattctat tagagtatca tttttggtat 240 cccaaatagt tttatcgtcg gtactccaga gtacctttgt ttttctattg tgtctgttag 300 tactttagtc ctaactattt atagtttttt tctacttata gtcaacactc tctttcgcat 360 cttaaaatga ctggaagatg caacaaaagc atttgccacc gcccaggatg tcgatatcct 420 gttgatagcg gcatgcagtg cgatgagtgc aaaggttggt accacgacgt ttgcacgaat 480 ctaacacctg cagctttcaa gcggtttagt aaaaatggat gtgtatggct atgtcaacag 540 tgctgtttgg atgcaaatag cctgctaacc gaggccattt caatagtaaa tgccgcaaaa 600 aagtgtctcg gcaagcgcgg tcgggataca tcagtcagaa ctcaatctgt cgacacgcag 660 attgacacaa ggcaggccca agtgagtcca aaaactgccg acccacaccc tactaaggag 720 gaagaacatc ctccaaagag gactgttaca accagaaact cgcgcaaaaa agtctcttct 780 gtccctcgta tcaaaaatgg tacccagaag ggaacttccg gaacccaaaa tcgcaaggct 840 cgatcagttt ccagcgtgaa agatgaccca acctcccaag tcgtcctgaa ccttccagaa 900 tcccacagta cgcctgacgc gactgtggtt actactccaa agaacgttgg cgattgggta 960 cgggtcgtaa ggaaaaaacg gcaaaatgac gtaaaaggga atcctacccc caccaaaagg 1020 gtcgaaaagc cgaggtctga tcacagcgac agatcagtca ttttccatag agtcaaggag 1080 agtgacagct cagaaccgaa agctcgtttt gagcatgaca ttgtgttgat aaaacaacta 1140 ctcaaccaag ttatgcccaa aaacatcccc ggagtcacct tgctaaaggt gtatagacta 1200 ggaaacctag cgcatctgaa accaaatcag tctagactac tcaaagtcgt tttcaaatcc 1260 ccaaacgaac gcgacttaat cttagaaaat ggacacaaat taaagggttc aggagttttt 1320 ctccgtaagg acttaccgtt ggcggaccgt gttaaaagac gggaagccga aaaggaacta 1380 cagctcagat tagacgctgg cgaaaaagac ctgaaaattg taaattttcg ggttgtgagg 1440 cttcgacaga ggatgatgcc gaagccactc tgggtgaagc acgaggccac ctaaataggc 1500 ttcggatctg ttataccaac gcccagagct tactaaataa gctaccggaa ctaggtgtac 1560 agattgactc aactaggcca gacataatcg cagtcacaga aacatggctg acgccgtcta 1620 tagatagtag ggaacttgat ttcgagggtt ttacattagt aagggccgac agaacgcaaa 1680 cgcgtaaagg agggggagta gctctattca ttaggaatgc tatcccattc accattatcg 1740 acagtgtatc ccatgagagt gggacgtatg aattagttag ctgccgcctg aaatgcaggg 1800 gacaagagtt gctacttagt ttgatctatc gcagtccaag ctgtgaggca aacgaggtcc 1860 tgctaaacag tctcaacact ttatcacgaa gtgatcgatg tctaatccta ggggacttta 1920 atgcacccat ggtggactgg ggaaatctgc ggactgaatc gtcagcaaat tccttcgaac 1980 aggaactagt tgatgcggta atcacatgtg ccctagtgca acacgtgaag gaagcaacta 2040 ggtacgaccc gggttctgca tcatccttac tagatcttat attgactcat tatgaggatg 2100 atgttgcaaa cctggattac atgccacccc taggcaaaag tgatcatgca gttttaagct 2160 ttgacttcca tataactgtc gatcacgagt acgcttcagc tcaatccaga cctaacgtct 2220 ggaaagcaga cataccagac atcatgaaat cagcatcatc agtagattgg aaaatagacc 2280 cagagtcatc aatcgaaacg gcttgggact tattccggaa tttatactta aaagttaccg 2340 ccccccacat cccttggact acacctaaga gaccgagaaa ctccccaccg tggttcagta 2400 gggaggttcg catccttctc cgtaaaagaa ggaaaatgtg ggatagattt aggttactgg 2460 ggactgacga ggcaaaatct cagtatcaaa aggctcgaaa tacctgtgcc tcgaccctcc 2520 gtaaggccag aaagctgtac gaagagaaaa tcgttaggga atccatagaa tgccctaaac 2580 gcttgtattc gtatataaac caaaggacaa aaagaagaag aaatgttcct tcactatggg 2640 gagacagtac tgccacatca ctagtggagg acgactttgg caaagctcaa gtattctcta 2700 aatactttag cgatgtatac accatagaaa cacccttctc accagtccat gaaaatcccc 2760 ccacacaagc actggacagc gtaaccatta aagaactcga tgtctttggt ctgctaatta 2820 agcttgacat aggtaaatcc actggacccg atgaactgca tcctaaatta ctaaaggaat 2880 tagctaactt tgttgtgaac cctttaagtg tatgttttaa tctatccgta acccagggtc 2940 gtctaccaaa agactggaag aacgccatag taagtccagt cttcaaaaca ggtacaaaac 3000 ataagcctga gaattaccga ccaattagcc taactagtgt ggttgttaag atcttagaaa 3060 agattattcg gaaggagctg ttaaagtatc tcgatgaaaa ccggatcctc tccaaaaaac 3120 agaatagttt tagaacaggt tactcttgtc tcacaaactt attagtcgcc cgtgaaagct 3180 ggtgcgctct taaggaccaa aagttaccta tagacgtagt ttacatcgat ttcagcaaag 3240 ctttcgacaa agttccgcat aaccggctgt tatataagct aaggaatgtc gggattggag 3300 gcaatctatt gatgtggata aaagacttcc tagttgggcg tcaacaaaga gtaagggtga 3360 actccaagtt gtctagctgg gaaactgtgc ttagtggagt cccccaaggt acagttttgg 3420 ggccagtgtt attcctcctg tacgtaaatg atctccctcg tctactatcg tcatcggtct 3480 tactctatgc tgatgatgtc aagatatgga gagcgataca aagcaagggc gatagcttag 3540 aacttcaaaa tgacctagag agattatctg aatggtccca aacctggcaa ttgccgataa 3600 acacttccaa gtgtattgtg atgcatattg gccaccaggg tacagataca tacacgatga 3660 ataacactga gttacctatt gttcaggcac acaatgactt aggcgtcatc gttagtcaag 3720 acttaaagac tactgcacac tgccgtgcaa tagacgccaa aggttttagg actttatggt 3780 ccatacgtag ggcttttagg catcttgacg ctaaaacgtt tctgactttg tatacagtgt 3840 ttgtacgccc taaacttgag tactgcatac aagcagctag cccctgccta aaaaaagaca 3900 gtgaactctt ggaaagggtt cagagaacag caactaggct gattcccgga atagcgaagc 3960 tcccgtatgg tactagactg accaagctaa acctactccc gctgtcatat agaagaatca 4020 gaggcgactt gattacagtt ttcaaattgc ttaatgacaa atttgcacct gatatgccct 4080 catttttctt gtcttccaaa acagaaaatc tacgaggaca ctccaaaaaa gttcacaagc 4140 ccagaaggaa ttacttgtca gctgactacc gactttccca tcgaataatc aacgagtgga 4200 attcattacc tcagcacgtg gttgaggctc catccgtcga ctccttcaaa agaaagttgg 4260 atcagctgag agaccatcat tgccaggact aacacaggcc accaagcctc ctgtcctttc 4320 caaactgaaa ctgaaactga tttaccttga ctccgaagtc ttgaccttca gcaaatcagt 4380 gaaagatttg aaaatcttcg tcaccagaaa gcaaggctca aatttcagtt caaagacagt 4440 tggagttagc caatccttgt tcacacaatg gtttaccatt aacaaatgaa aaacaactga 4500 tggaattaca aggaaaagaa cagacttttt atataaatac acta 4544 // ID BEL-11_CQ-I repbase; DNA; INV; 5946 BP. XX AC AAWU01008510; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-11_CQ_; KW BEL-11_CQ-LTR; BEL-11_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-5946 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 175-175 (2011). XX DR GenBank; AAWU01008510; Positions 114966 120911. XX CC 'GGGGG' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 424..5916 FT /product="BEL-11_CQ-I_1p" FT /translation="MFLRSGRVKKVLFPSPSTPTGVLSAPLSICEQKEAAE FT RRRREREMAESIDALDQCCQAAKAKVSRIRKAILDAKQDHKKFAYNALKLY FT AKQVDAAYDEYNSFTNRIYLVDPKKKQEFETKFIDFEELYEFVRIALSEMM FT QQYEDAEKAAAEVAAFEREKQLLALKFRNDTVAVTNVPANPVPYQIPPSLL FT LQQTPLPTFDGRYDQWHKFKARFCDIVDRCTTDSPATKLHYLDKALVGDAQ FT GVIDEQTLNDNNYEGAWRILIERYENIPMVVHGHVTKLLNLKPMARESSLE FT LRRLLDDCTKRVESLEFHKLKMDKMAEAIVITLLTSKLDPSTRRSWEASCD FT YGKLPVFKDTVAFLRKHSHVLERCEQSSTVTKPKVAALRTQTPTVTSKAHS FT VTVSKTNEGCAVCGATHSVEACGAFKKLIVEERYSKAKQFGLCFLCLNRGH FT RTNTCKKTDLCTACKKKHHALLHPDEKKLDTPEAGRRSAEEQETSSSCPKP FT PLTAAKCSIPNERTKSLQTKQVLLATAVVLAYDSSGAAHKCRVLLDSGAMA FT NFMSTRMAELLRLRKESANVPIDGVNGMKTVVKYKVSAKVVSRTTSFETTL FT EYLIVPRVTGALPAMKIDPKGWHIPESMLLADPKFYEPARVDMLVGAELFF FT DVLREGKLKLAPNLPLLQESQLGWLVSGPVAETADIGTVKVCHIGALDDPE FT EQLTELLRRFWTIDELGGDASPKPEDTCELNFLETHRRTNDGRYVVKLPFR FT ENVGELGESRQQAMRRFVALERRLEKYPVMKQMYVDFINEYLALGHCRVVT FT SAEPTNADAYYLPHHCVIKPDSSTTKLRVVFDASAKSSTDLSLNDVMEIGP FT TVQDSLFNILLRFRLHKFVFTADVPKMYRQVIVDEAHRKYQRILWRDNAGQ FT PIKELELNTVTYGTAAAPFLATRAIVQLARDEQKDFPVASKVVEECFYVDD FT VLTGADSLPEAKELQRDLIALLERGGFQLHKWCANDESLLEDIPVDAQEQQ FT FNLDDCNASGVVKALGILWDPVGDEFMYHVQPFNGCSEVPTKQMVLSEMSK FT LFDPHGLLAPTILIAKLLMQQLWQANVGWKDPIPQEQLRTWNRFRSELSCL FT NSLRINRRITVDDAVDVELHGYADASKVAYGCSIYLKSFKRDGTTEMQLIC FT GKSRVAPMKELQRNEKQDATPDEQTIPRLELCAALLLAEILEKVRETLAIP FT IEKVKLWSDSKIVLCWLKRMKPGTPVFVQNRVKKILKLFPAHMWHYISTLH FT NPADLVSRGVFPAELIHCDGWWFGPVEHAEPWQQNCEAELEPEEHGEPLQV FT VAAVLPNEPKKQLDPYDVILEQSDYRRLQRVFAHLTRFIFNCRSKQNKVAR FT RTGRLGGRDFGEAQRAMVSVVQQFVYKDEIDCIQKNRAVKGKLRNLNPIYD FT KDDKLLRVGGRIRNSDLPKDQKHPMILPENNHFTEILVKALHKEHLHVGLN FT GLLAVIRRRFWPVNAKRTIHRVLKGCVDCFRVNPTDVQQYMGDLPSCRVTA FT SQPFARTGMDYAGPFYVKVGRMRSKVKVYVCLFVCMAIKAVHLELVGSLTT FT EGFLAALQRFVNRRGTPSELYSDNGTNFRGGNRELSELVELLRSQILQKKV FT DAFCQPRGITWNFNPPKAPHQGGIWEANVKCMKSHLCKTLTESYLTFEELN FT TLLIQVEGILNSRPLVQLTDDPFDYEALSPGHFLVGRELTAVAEPLYDDWK FT ESSLSRYQLVQKRMQHFWKRWSSEYVTGLQKRGKWYKEPALLRKGMLVLLK FT EDNVPPKTWKLGRIIETHPGKDGIVRVITVRTGNGVYKRPTTQIAVLPIDD FT NRVPAESQE" XX SQ Sequence 5946 BP; 1528 A; 1455 C; 1769 G; 1194 T; 0 other; tttggtccac gtcgagccgg atgtgggatc agtggaattg aaggacagtt tctcggttcg 60 tgcgcgagtc gggaaagaat cgacgtgttc cgcgacggtt gcgacggtcg tggcgcgaga 120 ttccgcttgc acgcgctgca gcggatagtg gttccggaga ctgcggttac cggacgggag 180 tttctgcgcg gtcgagtccg accgaaagaa gattccgggt gcttgcgagt gccggggaag 240 tgaaggtcgg tgtgcagaaa cgtgaaagag gctggacggg cgaaaagtga aagtggaggt 300 ctaacctcaa aataggaaga gccggaactt gcgagttcct gaaaagtgga aagacctcgc 360 tggaagattc cgatacggat tgaagcgaga aaaaagtgag aaaaagaagt gcgatttcgg 420 gtgatgtttt tacgctctgg gcgcgtaaag aaagtgctgt tcccttcacc ttcgacgcca 480 acgggtgtgc tgagtgcgcc gcttagtatt tgtgagcaga aagaagcggc ggaaagaaga 540 agaagggagc gagaaatggc cgaaagcatc gatgcgttgg accagtgctg ccaggcagcc 600 aaggcaaaag tgagtcgtat tcggaaggcg attttggatg caaaacagga ccacaaaaag 660 tttgcgtaca atgctctcaa attgtatgcg aagcaagtcg acgcggccta cgatgagtac 720 aacagcttca cgaatcgcat ctacctcgtc gatccaaaga agaagcaaga atttgagacg 780 aagttcatcg acttcgagga actctacgag tttgtccgga tcgctctgag cgagatgatg 840 cagcagtacg aggacgccga aaaggctgcg gccgaggtgg ctgcctttga gagggagaaa 900 caactgctcg cgctgaagtt ccgcaacgat actgtcgcag ttactaacgt cccagcgaac 960 cctgttccgt accaaattcc accttcgttg ctgttgcagc agacgccttt gccgaccttt 1020 gacggtcggt acgaccagtg gcacaagttc aaagcaaggt tttgcgacat tgtggatcgg 1080 tgcacgacgg attctccagc gacaaaactc cattatctgg acaaggcgtt ggtcggagat 1140 gcacagggag tgattgacga gcaaaccctc aacgacaata actacgaggg cgcctggaga 1200 attctaatcg agcggtacga gaacatccca atggtggtcc atggtcacgt cacgaagctt 1260 ctgaacctga agccgatggc tagggagtca tcgctggaac tacggaggct gctggacgac 1320 tgcacgaagc gtgtggaatc cctagagttc cacaaactaa agatggacaa gatggctgag 1380 gccatcgtga tcacgctgct gacctcgaag ctggacccga gtacccgacg cagctgggag 1440 gcatcgtgtg actacggaaa gcttccggtg ttcaaggaca cggtggcatt tttgcgaaag 1500 cactcacacg tgttggagcg gtgtgagcaa agcagtacgg tgacaaaacc caaagtggcg 1560 gcactgagga cccagactcc aaccgtcaca agtaaggcgc actcggtgac ggtctcgaag 1620 acgaatgaag gttgtgccgt gtgcggagct actcactcag tcgaagcttg tggcgcattc 1680 aaaaagctga tcgtcgaaga acgatactcg aaggcaaagc agtttggatt gtgcttccta 1740 tgcctgaatc gtggccatcg caccaacact tgcaagaaga ccgatctgtg cacagcttgc 1800 aagaagaagc accacgcgct gctgcatcct gacgagaaga aactcgatac gcccgaagca 1860 ggacggcgat cggcggagga gcaggagacg agttccagtt gcccgaaacc accgctgacg 1920 gcggcgaagt gttcgatccc gaacgaacga acgaagagtc tgcagacgaa gcaggtgttg 1980 ctggcgacgg cagttgtgtt ggcgtacgac tccagtggcg ccgcgcacaa gtgtcgagtg 2040 ctactcgact ccggagcgat ggcgaacttc atgtcaacac ggatggcgga actgctgcga 2100 ctgcgaaagg agagtgctaa cgtcccgata gacggagtga acgggatgaa gaccgtagtg 2160 aagtacaaag tgagtgctaa agtggtatcc agaacaacca gtttcgagac gacgttggag 2220 tacttgattg tcccccgagt gaccggcgct ttaccagcga tgaagatcga tccgaagggc 2280 tggcacattc ccgagtcgat gctgctggcc gaccccaagt tctacgagcc agcgcgggtc 2340 gatatgctcg tcggcgccga gctatttttt gacgtgctcc gagaaggtaa actaaaactg 2400 gcccccaatc tacctctgct tcaagaaagc caactcggtt ggcttgtctc tggacctgtg 2460 gccgaaactg cggatatcgg cactgtgaag gtgtgtcaca ttggggcact ggacgatcct 2520 gaggagcaac tgaccgaact gctgcggcgg ttctggacca tcgacgagct cggaggtgac 2580 gcgtcaccta agccagaaga cacgtgtgag ttgaacttcc tggaaactca ccgcagaacg 2640 aacgatggtc gctacgtggt gaaattgccg ttccgcgaga acgttgggga gctcggggaa 2700 tcccgacagc aagcgatgcg gcggttcgtg gcgctagagc gccgcttgga gaaatacccg 2760 gtgatgaaac aaatgtacgt cgacttcatc aacgagtatc tggcactggg tcactgccga 2820 gtggtgacaa gtgccgagcc aacgaacgca gacgcctact accttccgca ccattgcgtg 2880 atcaagccgg acagctccac aacaaaattg cgagtggtat ttgacgcctc agcgaagagc 2940 agcacagatt tgtcgttgaa cgatgtgatg gagatcggac ccacggtgca ggactcgttg 3000 ttcaacattc tcttgagatt tcgcttgcac aagttcgtgt tcaccgccga tgtcccgaaa 3060 atgtaccggc aggtaatagt cgacgaagct caccggaagt atcaacggat cctgtggagg 3120 gacaacgctg gtcagccgat caaggaattg gagttgaaca cagtcactta cgggacagca 3180 gcagcaccgt tcttggcgac tcgagcgatt gtacagctgg cacgagatga gcagaaagat 3240 tttccggttg cgagcaaggt cgtcgaggag tgcttctacg ttgatgatgt gctaactggg 3300 gctgactcac ttccggaagc aaaggagctt caaagagatt tgatcgcgct gttggagaga 3360 ggagggtttc agctccataa atggtgtgct aacgacgaat cgctgcttga ggatatccca 3420 gttgacgctc aagagcaaca gttcaacctc gacgattgca acgccagcgg cgttgtaaaa 3480 gcgctcggaa tcttgtggga tccagtcgga gatgaattta tgtaccacgt tcaacccttc 3540 aacggctgct ctgaagttcc gacgaagcaa atggttttgt ctgaaatgtc aaaacttttc 3600 gatccccacg gcctgcttgc accaaccatt ctgatcgcga aactcctgat gcagcaactt 3660 tggcaagcga atgtgggatg gaaggacccg attccgcagg agcagcttcg aacctggaac 3720 cgatttcgat ctgagctaag ctgcctgaac tcgctgcgga tcaatcggcg aatcactgtt 3780 gacgatgcgg ttgatgtgga gctgcacggg tacgccgacg cgtccaaggt ggcgtacggg 3840 tgctcgatat acctgaaaag cttcaagcga gacggaacga cggagatgca gctgatctgc 3900 gggaagtcgc gtgtcgcgcc aatgaaggag ctgcagcgga acgagaagca ggatgcaacg 3960 cctgacgagc agacgattcc gcggcttgag ctgtgtgctg cgttgttgct agccgaaatc 4020 cttgaaaaag tgcgcgaaac cttagcaata cctatcgaaa aggttaagct ctggtccgac 4080 tcgaagatag tgctgtgttg gttgaaacga atgaagcctg gtacgccagt gttcgtgcaa 4140 aatcgtgtga agaagatcct gaagctgttt cctgctcaca tgtggcacta catctcgacg 4200 ctgcacaacc ctgcggattt ggtgtcgcgt ggggtctttc cggcagaatt gatccactgc 4260 gatggttggt ggttcggtcc cgttgagcac gctgaaccgt ggcagcagaa ctgcgaagcg 4320 gaattggaac cggaagaaca cggggaacct ctccaggttg tcgctgcagt tctacccaac 4380 gagccgaaga aacagctgga cccatacgac gtgattttgg aacaaagcga ctatcgcaga 4440 ctgcagcgtg tttttgcgca cttgactcgc ttcatcttca actgccgatc gaagcagaac 4500 aaggttgcgc gtagaacggg acgacttggc ggccgagact tcggtgaggc gcaacgagca 4560 atggtgagtg ttgtgcagca atttgtctac aaggatgaga tcgattgcat tcagaagaac 4620 cgtgccgtca agggaaagct gcgaaacctg aacccgatat acgacaaaga cgacaagctg 4680 ctccgtgtcg gtggtcgtat aagaaattcg gatctgccca aggaccagaa acacccaatg 4740 atcctgccgg aaaacaatca cttcacggag atccttgtca aagctctgca taaggaacat 4800 ctccacgttg gactcaacgg actgcttgcg gtcatccgac gacggttctg gccagtaaac 4860 gctaaacgaa ccatccaccg ggtcctgaaa gggtgcgtag actgcttccg ggtgaaccca 4920 acggacgtcc agcagtacat gggtgacctg ccgagctgcc gagtgaccgc ttcccaaccg 4980 tttgccagga ctggaatgga ttatgccggg ccgttctacg tcaaagtggg acgtatgaga 5040 tcgaaggtga aggtgtacgt gtgtcttttt gtatgcatgg caatcaaggc cgtacaccta 5100 gagttggtgg gctccctgac gacggaagga ttcctggcag ctctacagcg atttgtgaac 5160 cgacgaggga cgccgtccga actatattct gataacggaa caaactttcg aggaggaaat 5220 cgagagctgt cagaattagt cgagttgctg cgatcccaga ttctgcagaa gaaggtggat 5280 gcattttgcc agccgagggg aatcacctgg aactttaacc cccccaaggc accccaccaa 5340 ggaggcattt gggaagcgaa cgtgaaatgc atgaagtcgc acctgtgcaa gaccttgact 5400 gagagctacc tgacctttga agaactgaac accctgctca tccaggtaga aggaatcctc 5460 aactcaaggc cccttgtgca gctgacggac gacccctttg actacgaagc actgagtccc 5520 ggccattttt tggttggacg agaattgacc gcagtggcgg agccactgta cgatgattgg 5580 aaggagtcga gtttgtcccg ataccaactg gttcagaaac gtatgcagca cttttggaag 5640 cgatggtcaa gcgaatacgt cactggactg cagaagcggg gcaaatggta caaggaacct 5700 gcgctacttc ggaagggaat gctagtgctc ttgaaggaag acaacgtgcc accgaagact 5760 tggaagctag gccgcatcat cgaaacacat ccagggaagg acgggatcgt acgtgtgatc 5820 acggtgcgaa caggcaacgg agtctacaag cggccgacga cgcagattgc ggttctaccc 5880 atagacgaca acagagttcc agcagagtcc caggagtagt gaggccttgc cccaccgggg 5940 ggagga 5946 // ID Mariner-4_HM repbase; DNA; INV; 2992 BP. XX AC . XX DT 22-MAR-2008 (Rel. 13.03, Created) DT 22-MAR-2008 (Rel. 13.03, Last updated, Version 1) XX DE Mariner-type family: consensus sequence. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-4_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2992 RA Jurka J.; RT "Families of Mariner elements from Hydra magnipapillata."; RL Repbase Reports 8(3), 221-221 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 500..2350 FT /product="Mariner-4_HM_1p" FT /translation="MARQKYKTKVCSSKRRLQYTDSQLRDAINAVKKGMVV FT YKASKTFGIPYQTLRDKISGKTSLKIQHCGYESVLGEHIENKLVEWLLTCT FT RMGFAMSVVQLLDTVQKYLNSNNIKTQFTNNRPAKGWFYAFLCRHEQLSQK FT RGEYLNRARGGVTEKAIRDWFTEIPKLLGENAQYLNDSMRVFNMDESGFQL FT SPKTNLLIGERGKNTYEESARSNKKSITTLFAVNASGTFAPSLTIFKYVRL FT PNRIINTAPSGWGIGKSENGWMTAECFFEYITNIFHPFLIKEEIPLPVIIF FT FDGHASHFSIELSEFCSKNGIILVALFPNATHILQPLDVAVFGPMKAKWKS FT FCRQWRIDHEGQEINNENVPEALNSFIIDPSMANNIKSGFKNTGIFPFDAN FT SVDYKKIVQKLSESTVAPTPYLEENQSSVTLSASSPINFIENCIDPSILEQ FT FKIADEHLGWKGDLEYLQLYHFWKRASSETYKTTILPDEIQQGNKNLSTTC FT NLENVQVITDMEMPDCNIAAVIEPKPACSSLKIAAEIINPIESVLIWPKQP FT VQTSKRKIERLPSVVTSETWQQIQKTKLQEKSQINDEKLRKKQLAADKKSK FT NRGRKIKKETNGYGEKVIK" XX SQ Sequence 2992 BP; 1156 A; 435 C; 465 G; 935 T; 1 other; gggtcgtttc cacacctacc gtgagtcgac acttatcgta ggttttaaat actaacataa 60 attttgcact aaaaagtaat tatttttttt atatatatat tttgtttatc aacaacatcg 120 tcatgcactt ctattacaat acatagaaac cgcaatacat agttacttat ataaactcag 180 tgggtttatt acgtcagcct atgaaaatac ttttttttta agtaccagct gttagaagcg 240 tttcaaataa aacaagataa tttaagaaga taacataaaa ttttatttct aaattgattt 300 aataaggtaa ataaaacatg gatcattatt aatattatta aaaccactca tcattatagt 360 atcaatgtag tatttttgaa cgtacgataa gtatgaacta aaaaaatgtc attccataca 420 tatcgtatgt ccaataaata atcaatttgt ttctttttta ctataaaaat aaatattatt 480 ttacttagat tcttttacaa tggcaagaca aaaatataag acaaaagttt gtagctcaaa 540 acgaagactt caatatacag atagtcaatt aagagatgct attaatgctg ttaaaaaagg 600 tatggttgtg tacaaagcaa gcaagacgtt tggaattcct tatcagacgc ttcgtgataa 660 aatatctgga aaaacttctc tgaagataca acactgcggt tatgaatcgg tactaggtga 720 acacattgaa aataaattag tcgagtggtt actaacatgc accagaatgg gatttgccat 780 gtctgtagta cagctattag atactgtgca aaaatatttg aattccaaca atataaaaac 840 acaatttact aacaaccgtc ctgcaaaagg ttggttttat gcatttcttt gccgtcacga 900 acagttatct caaaaacgag gtgaatactt aaaccgagct agaggaggag taactgaaaa 960 agccataaga gattggttta ctgaaattcc aaagctttta ggagaaaacg ctcaatattt 1020 aaatgattcg atgcgcgttt tcaatatgga tgagagtggt tttcagttgt caccaaaaac 1080 taatttatta attggtgaac gaggaaaaaa tacatatgaa gaaagtgctc gaagtaataa 1140 aaaaagcatc accactttat ttgcagtaaa tgccagcgga acttttgctc catcattaac 1200 aatttttaaa tatgttcgtc ttccaaatag aattataaat actgctccat caggatgggg 1260 tattggaaaa agtgaaaatg gttggatgac cgcagaatgt ttttttgagt atattacaaa 1320 catttttcat ccatttttga taaaagaaga aattccttta ccagttatta ttttttttga 1380 tggtcatgcg tcacattttt ctatcgaact tagtgagttt tgctctaaaa atggaataat 1440 tcttgttgct ttgttcccaa atgcaacaca tattttgcag ccgctcgatg tggcagtttt 1500 tgggccaatg aaagcaaagt ggaaatcttt ttgccgacaa tggcgcattg accatgaagg 1560 acaagaaata aacaatgaaa atgtaccaga agcattaaat tcttttataa ttgatccttc 1620 tatggcaaat aatattaaaa gtggttttaa aaataccgga atttttccat ttgatgctaa 1680 tagtgttgac tacaaaaaaa ttgttcaaaa attatctgag tcaactgtkg ctccgactcc 1740 ttatttagaa gagaatcaat caagtgtgac tttatcagct tcctctccca taaactttat 1800 tgaaaactgc attgatccaa gtattttaga gcagtttaaa atagctgacg aacatttagg 1860 ttggaaaggt gatcttgaat acctccagct gtatcatttt tggaaaagag cctcatctga 1920 aacatataag accacaattt tacctgatga aatacaacag ggtaataaaa atctatcaac 1980 aacttgtaat ttagaaaatg tgcaagttat aaccgatatg gaaatgcctg attgcaatat 2040 tgcagcggta atagaaccaa aaccagcttg ttctagttta aaaatagcag cagaaatcat 2100 aaatccaatt gaaagtgttt taatatggcc aaaacaacca gtccaaacgt ccaagcgcaa 2160 aatagaacgt ttaccatcgg ttgtaacatc tgaaacgtgg cagcaaattc aaaagacaaa 2220 attgcaagaa aagtcacaaa taaatgatga aaaattaaga aagaaacaat tagctgcaga 2280 taaaaagtca aaaaatagag gaagaaagat taagaaagaa acaaatggct atggagaaaa 2340 agttattaaa tgaaaaaact aaaaaagaaa gagaaaacat gaaaattttg aaaaagaaaa 2400 ataaagtcgc aaaagaaaaa gtaagccttg aaactgagtc agaaaacgag ttccatttaa 2460 atgttaacaa taattataaa ataagactca gataattatg atgaaactct atcctttcca 2520 gtttaaaaga tttgtactaa agtaatggaa atgctttaaa aatctttagt tttaattcaa 2580 aataatattt tcattactat ataattaatg tttatttaat agtctttttt tttaatttat 2640 atacctaaaa acgtttaata aaatgttact aatgcattaa attttcattt gaaaccgcga 2700 gcgcattata aaccagtcac taaagcagat tttaagatat acgataagtg tcaacaatat 2760 atacgataag tgtgaaacac ttacgataag tatggaaaac tgcgctttaa aagtttgatc 2820 aaaataagta cattattgat ttaaatttca gagtgagaat aattttaaac taacccttat 2880 agtactacat tgatggctgc aaaaattcat ttctaataaa tttctacttt ttgagtaatc 2940 taacatatta taataacttt tgttaaactc acggtaagtg tggaaacgac cc 2992 // ID Mariner-17_HM repbase; DNA; INV; 2961 BP. XX AC . XX DT 13-DEC-2008 (Rel. 13.12, Created) DT 14-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE Mariner-type family: consensus sequence. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-17_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2961 RA Jurka J.; RT "Families of Mariner elements from Hydra magnipapillata."; RL Repbase Reports 8(12), 1951-1951 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 680..2314 FT /product="Mariner-17_HM_1p" FT /translation="MVRYRKKKTDRVAISCDTMKSAVNEVLEGRPVNVVAR FT KFSINRMTLKRYCNKKKLNPNESFKPNYNNKQVFTAEDEKSLSSYLLLASK FT MNYGLSTKSTRLLAYEFAVKNNKICPSSWIKNKIAGIDWLQGFMKRQPELS FT LRTPEATSFARSTAFNRHTVGEFFQNLKTVRNRYKFDPYCIYNVDETGLTT FT VQKPVKVLAGKGSKQVGRITSAERGTLVTVCCASNAIGNSIPPLFVFPRVK FT FHDYMIKEGPPGCVGFANPSGWMNSDIFIEWIKHFVRYSNCSHESPVLLLL FT DSHKSHISVRALDLAIQHGITMLSFPPHCTHKLQPLDRTVFGPLKRFYNAA FT CDNWMVTNPRPMTIYDIVSIVREPYTKAFSPSNIQRGFQVAGIEPFNPEIF FT KDDEYLPSSVTDRAAPNVVNTILVEPEMTVAHVDGMEPEMPVVHHETSFSN FT KVSPSIASLLSPEVLKPYPKASARKRSVKSRQLKTRILTDTPIRNEIRLSE FT EKKTLKQINQQKKLIAKDKKRGKVKNYLLRNKKQCNVKAASRLDFDKG*" XX SQ Sequence 2961 BP; 1040 A; 432 C; 474 G; 1014 T; 1 other; ggggagatgg gggcgcattg atcgccgggg cagtatgatc ttttagcttt tactcgttta 60 cttttgcctt aatttaacag agattgcaat ttaattaacc attatactac ccttcattgc 120 ttaatcgtaa taatttttga taagatcaac ttcagcgtta agaaaattga aaagactgtt 180 tttcatcatg aaaagtgatt tttgaaagtt gtaaacaaat tcaaataaat ttttaattag 240 atttccgtat gactttacaa ccaatttatt gtaaactcat atgtgttaag tttataataa 300 atttgttttt ttgagttggt tactgtattt atagaagtta ttggcatcaa tatatcaact 360 gtacctaatg gggcacattg atcgttgact atgcggggct cattgatcgg aggatcaaag 420 tgccccatat aacttatatt ataagactta aattttttgt ttaaatttta tacatttaaa 480 cttatgacga ataaaaaatc tttatataaa cttttaaaat ttcattttta aaatgtaata 540 gtgttaaaaa ctaatattaa taacattagt ttttagtttt acataagttt tacaattagt 600 ttacataata ttattatata aaatattttt tttaaacaat cattttattt tacttttttt 660 taaattttca gatctttaaa tggttaggta tagaaaaaaa aaaacagaca gagttgctat 720 atcatgtgat acaatgaaat cagctgttaa tgaagtttta gagggcagac cagtgaatgt 780 tgtagctcgc aagtttagca taaatcgaat gaccttaaaa agatattgta ataaaaagaa 840 gctcaaccca aatgaatctt ttaagcctaa ttacaacaat aaacaagttt ttactgcaga 900 agatgaaaaa agtttgtcaa gttatttact acttgcatca aaaatgaact atggactttc 960 aactaagtca acacgattat tggcatacga gtttgctgta aaaaataata agatttgccc 1020 ttcatcatgg atcaagaata aaattgcagg gattgattgg ttgcaaggtt ttatgaaaag 1080 acaaccagag ttgtctttac gaacacctga agcaactagc tttgctcgat caactgcctt 1140 taaccgacac actgttggag aatttttcca aaatctaaaa acagtaagaa atcgatataa 1200 gtttgatcct tattgcatat ataatgttga tgaaactggt ttaacaacag tacaaaagcc 1260 agtaaaagtg ttagcaggta aaggaagtaa acaagtagga agaatcacat ctgcagaacg 1320 aggaacatta gtaactgtat gttgtgcttc aaatgcaatt ggaaactcta ttcctccact 1380 atttgttttt cctagggtca aatttcatga ttacatgata aaggaaggtc ctcctggatg 1440 cgtgggattt gcaaatcctt ctggttggat gaattcagac atttttatag aatggataaa 1500 acattttgtg aggtattcaa actgttctca tgaatctcca gttttgttac ttctggacag 1560 ccataaaagt catatttctg ttagagcttt ggatcttgca attcaacatg gaattacaat 1620 gctaagtttt cctccccatt gtacccataa attgcaacca ttggatagaa ctgtttttgg 1680 accattgaaa aggttttaca atgctgcgtg tgataattgg atggtcacaa atccaagacc 1740 tatgaccatt tatgatattg tttcaatagt tcgcgaacca tatacaaaag ctttctcacc 1800 atctaatata cagagaggat ttcaagtagc tggcattgag ccatttaatc cagaaatttt 1860 taaagatgat gaatatttac catcatcagt tacagatcgt gctgctccaa atgtagtaaa 1920 cacaattctt gtggaacctg aaatgactgt agcacatgtt gatggcatgg agcctgaaat 1980 gcctgtagta catcatgaaa caagtttttc gaacaaagtg tcaccaagca ttgcttcatt 2040 actctcacct gaagttttga aaccttaccc gaaagcatct gctagaaaaa gaagtgttaa 2100 aagtaggcaa ttaaaaacaa ggattttgac tgatactcca atcagaaatg aaattcgttt 2160 gtcagaagaa aagaaaaccc taaaacaaat caaccaacaa aagaaactca tcgcaaaaga 2220 taaaaaaagg ggaaaagtta aaaactactt gcttaggaac aaaaaacaat gtaatgtaaa 2280 agcagcatca cgacttgact ttgacaaagg atgataatat aaaaaaataa tcttgtaaac 2340 tttagttcta ataaaagtct tgtatgaatc ttgtattttc tttgtaaact taaaatactg 2400 tattcttata gactttataa gcttcaagtt ttgtaaaact taacacatat atatgttagt 2460 agtatataac gggcttattt tcatagatag aattgtggtt attcttctaa tgtaaacttt 2520 atacactggt gtgtattagg ttgtaatgat atgtatatat atatatatat atatatataa 2580 tatwtatata tatatgtttt aaattttact ttttttttta gatgtattat ataatataca 2640 ctaataccct tcaactaatg catttgtagc acttaggtat agcagtatta aacatgtaaa 2700 taatgttact catataagat caaagcgccc catatcggag atcatagtac cccattgcat 2760 ggggtacatt gatcttttgc aagggttgtt gtaatgttga aatataccat gttcattaat 2820 gttttttatt aaaacattta tttttactta aagacagatt gttctagttt agtttagtaa 2880 ttaataagtt tttatatcta gatcgggttg tttgcagttt tcaaaatact gcaacggcga 2940 tcaatgcgcc cccatctccc c 2961 // ID Crack-25_AAe repbase; DNA; INV; 4533 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A Crack non-LTR retrotransposon family from Aedes aegypti. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; KW Crack-25_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4533 RA Kojima K.K. and Jurka J.; RT "Crack clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1241-1241 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 13 sequences with >95% CC identity. XX FH Key Location/Qualifiers FT CDS 389..1435 FT /product="Crack-25_AAe_1p" FT /translation="MSENGDTEDTIVCSICKKQELDSTRIIECSVCFNSMH FT FRCKRIFGTGITRARQQSSFVCSPNCAEIYGRMNGNPNFEKLVAELNESVR FT ASIKQEMEVINAKIDTSQQSMMKELHSFANRFDELKIENDNLKKTVASLTE FT KYDSLMDSMITLETEVNRSSCSAMEKNAVILGLPMTENEDTKDVFQQLCSS FT LSFDLPENAVLSAKRMVSKNAKGGNPPIKIVFTDSSIKERFFATKKQHGKL FT LSTVVNGMPVNGKPGNVLVRDELSSLGLSMLREVRGMQDALGIKYVWPGRG FT GVILLKRNDGAKVEMIRNRQDIDRLGQKLNKRTLSDSSPRSLNNSAVNEPS FT SKRTCP" FT CDS 1489..4383 FT /product="Crack-25_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MYFSNSYYDSFSSYNASKNGSKNGNXFKILQINIRGL FT NNTTKFDYLLECLDSCKERIDVIVISETWLKKHKTALFEISGYNSVFVCRA FT ESSGGGLAVFVRKEHRTQIRTVQNIPDFHHIHLEITNMSRLLNLHAFYRPP FT SQSHEPFLKKLEDIISSCPPSNDCIIVGDINIPVNKPDVPMVKHYQSLITS FT YNYAITNEHSTRSASGSILDHSLCSNSLLQRIHNDTVFHDVSDHNFVITTI FT SCTFKSFRTKLEKKIVNHRKVNNEFSEFLQNIPQELGPVEKLSLIINEYKS FT IKERNSKTIFVNAQLKIPLCPWMNVDVWKLRKIKDNLLTKSKQQPNNSNVK FT ELLKHVSNKLVNAKRIAKNQYYHNLLNTPNQKLLWSRLNELLGRNVKQDNL FT VEIEKNGELVVGDVEKSSAFNEYFCSIGSDLASRIDTDRDITKFNSIKVNN FT ASIFLRPASSQEVTILINNLNSKKARGPDDFPADLVKCHHLFFAQLLRDVF FT NYSVETGVYPDSLKIARVVPVFKKGDRKNIENYRPIAVLSVFNKIIEQLLA FT SRLNDFFDRHDILYHMQYGFRSGSSTSNVLCELVDSLYASFEKRNMVGAVF FT IDLKKAFDTLDHEVLLAKLAAYGVRGQANNLLRSYLSERKQFVVVNQISSG FT SKCIPIGVPQGSNLGPLLFLIYMNDLPHINLRGVVRLFADDTSIFYEGKTQ FT AEIKSDAEEDLRTLKSYFQTNMLSLNISKTKFMLFRSTHKMISNPTGLTID FT DETVPYVDSFKYLGLIMDSSLTWQHHIDQLSITLSRMCGLIRKLTDFLPFK FT ALEKLYFAFFHSHLQHLAIIWGSASNERLKRIQVLQNRCLKLIHKLPQLYS FT SALLFTDSKIKTLPIRALYHQQLMIHTYSVVNRQDLLTNMRFSRRTHSHLT FT RRANHLPIPRVRTNYGKRSFGYLGPVLFNRLPQTLKNYTPSSKFKKELKMY FT LKSNVTLYLR" XX SQ Sequence 4533 BP; 1471 A; 892 C; 838 G; 1331 T; 1 other; agtctggcaa ctctggaaac taattggtgg ctattttttt gtgcttttaa ttttcatatt 60 accagttgaa aaaaacgctt ttaattcgtt gttttggtga aatagtgaag aatatacatc 120 gaaccacatc ctaaaagtcg atgaggagga aaactgcacg catctcccag gtttttcatc 180 acaaaagtta tggtgaaatt cgctaattat actttgtagc gcattgtact tgaataattt 240 cgagcagttt catgcaccaa attaccacag tattctcttc gtctctgtat ccctattgtc 300 tatagtcgtc tgtgtgtagg gtttcatcca ttgtgtgaca tgatatccat attctagttg 360 ctatagcatg taaacatctt tcgtcaaaat gtctgaaaat ggtgatactg aggatacgat 420 cgtctgttcc atctgcaaga aacaggaact ggattcaact agaatcatag agtgctcggt 480 atgcttcaac agtatgcatt ttagatgcaa gcgaatcttc ggcaccggca tcaccagagc 540 gcggcagcaa tcttcatttg tttgctcacc taactgtgct gagatctacg gtcgtatgaa 600 tggaaatcct aatttcgaaa agctggtagc tgaactgaat gaatccgtac gtgcttcgat 660 caagcaagaa atggaagtca tcaatgcgaa gatcgataca tcccaacagt ctatgatgaa 720 agagttacat tccttcgcaa accgtttcga tgagctgaag atcgaaaacg ataatctcaa 780 aaagaccgtg gcatcgttga ccgaaaaata tgactcgcta atggacagca tgatcacact 840 tgaaacagaa gtcaacagat catcatgtag tgctatggaa aagaatgctg taatcttggg 900 cttaccaatg actgaaaacg aggacactaa ggatgttttc caacaactgt gtagttctct 960 atcgtttgac ttgcctgaaa atgccgtcct gtcagctaaa cgaatggttt cgaaaaatgc 1020 gaagggggga aatcccccta ttaaaatcgt cttcaccgat tcgagcatca aagaacgttt 1080 cttcgcgact aaaaagcaac atggtaagct attatccact gttgtgaatg gaatgccagt 1140 taatggtaag cccggtaacg tactggtacg cgacgagcta tcttcccttg gactttccat 1200 gctgagagaa gtacgtggta tgcaggacgc gttgggaata aaatatgtat ggcctggtcg 1260 cggaggagtt attcttttga agcggaacga tggggcaaag gtggaaatga ttcgtaatcg 1320 tcaagatatt gatcgtttag gacagaagct caacaaaagg accttgtctg attcttctcc 1380 gagaagcctg aacaattcag ctgtcaatga accctcgtca aagcgcacat gcccataaaa 1440 tatccttagt atcaccaatt ttctcaaatt atatttgtaa atcttataat gtatttttca 1500 aattcttatt atgattcgtt ttcgtcttat aatgcgtcca aaaacgggag taaaaatgga 1560 aatwgtttca aaatattaca aataaatata agaggtttga acaacactac aaaattcgat 1620 tatctcttag aatgcctcga ttcgtgcaaa gaaagaatcg acgttattgt tattagcgaa 1680 acatggctaa aaaaacataa aacagctctt tttgaaatca gcggttacaa ttctgtgttt 1740 gtctgtcgtg ctgaatctag tggaggggga ttggcagtat tcgttcgcaa agagcatcga 1800 acacagattc gaactgtaca aaatattccg gattttcacc acatccacct tgaaattaca 1860 aacatgtctc gattgttgaa tttgcatgca ttctacaggc ctccgtctca aagtcatgaa 1920 ccgtttctta agaagttaga ggacataatt tcaagttgtc ctccgagcaa tgattgcata 1980 atagtcggtg acattaacat tcctgttaat aaaccagatg ttccgatggt taaacactat 2040 caatctctta ttacttcata caactatgct attaccaatg agcactcgac aagatctgcg 2100 tctggatcaa tactcgatca ttcactatgc tctaatagtc tgctacaacg catacacaat 2160 gatacagttt ttcacgatgt gagtgaccat aactttgtta tcactacgat cagctgtaca 2220 ttcaaatcat ttcgcacaaa attggaaaaa aaaattgtaa atcaccgtaa ggtcaacaat 2280 gagttctctg agtttctgca aaatattccg caggaactag gacccgtgga aaagctttca 2340 ttgattatca acgaatataa atccataaaa gaacgtaact ccaagacaat cttcgtgaat 2400 gctcaactta aaattccctt gtgtccctgg atgaacgtag atgtttggaa attgcgtaaa 2460 ataaaagaca acttgttaac caaatctaaa cagcaaccga ataattcgaa tgtaaaagag 2520 cttctaaaac acgtttcaaa taaactcgtt aatgccaaac gtattgcgaa aaatcaatat 2580 taccacaatt tactgaacac accaaaccaa aaattgcttt ggagtcgttt gaacgaattg 2640 cttggtcgaa atgtcaagca agacaatctt gtcgagatag aaaagaacgg ggaattagtt 2700 gtaggcgatg ttgaaaaaag ttcagcattc aatgaatatt tctgctcaat aggatccgat 2760 ttagcttcta gaattgatac cgacagagac atcactaaat tcaattctat caaagttaat 2820 aatgcatcaa tttttttgag accagccagt tcgcaagagg ttacaatcct aataaacaat 2880 ctcaattcaa aaaaggcacg aggcccagat gactttccag ctgatttagt aaaatgtcac 2940 catctattct ttgcccaact actacgtgat gtgtttaatt actctgttga gacaggcgtt 3000 tatcctgatt ccttgaagat cgccagagtg gttccagtgt ttaaaaaagg agacagaaaa 3060 aatattgaga actatcgtcc gattgcagta ttgtcagttt tcaacaaaat aatagagcag 3120 ctattggctt cccgcttaaa tgattttttt gatcggcacg atattttgta tcatatgcaa 3180 tatggattta ggtctgggtc cagtacgtca aatgtgctat gtgagttagt cgattctctg 3240 tacgcatcgt ttgaaaaacg gaatatggtt ggtgctgttt tcatcgatct taaaaaggct 3300 ttcgacactc tggaccacga agttttacta gccaagctag ctgcttacgg tgtacgtggt 3360 caagcaaata atcttcttcg aagctattta tcggaaagga agcagttcgt agttgttaat 3420 caaatatcca gcggttcaaa atgcattcca atcggtgtac cacagggcag caacttgggg 3480 ccactgttgt tcttaatata catgaacgac ctgccccaca tcaaccttag aggagttgtc 3540 agactatttg cggatgatac atctatcttt tatgaaggta aaactcaagc agagattaaa 3600 tcagatgctg aagaagactt gcgtaccctc aaaagctatt ttcagacaaa catgctttca 3660 ttaaacattt ctaaaaccaa attcatgtta tttagatcaa ctcacaaaat gatatcaaac 3720 cctacaggac tgactataga cgatgaaaca gtgccatatg ttgatagttt caagtacctt 3780 ggacttataa tggattcttc cttaacgtgg caacaccata ttgatcagct atcaattaca 3840 ctttcaagaa tgtgtggatt gatacgaaaa ctaactgatt tcttgccttt caaagcacta 3900 gaaaaactat attttgcatt cttccactct caccttcagc atctcgcaat catttggggc 3960 tctgctagta acgaacggct aaagcggatc caagtcctac agaatcgctg tctaaaattg 4020 atacataagc ttcctcagtt gtattcgtct gctcttcttt ttacagatag taaaataaaa 4080 actttaccga ttcgtgctct ataccaccaa cagttgatga tccatacgta cagtgtagta 4140 aatcgacaag atttactcac taacatgcgt ttttcaagaa gaacacacag ccatttaacc 4200 cggcgagcca atcacctacc gattccaaga gtgcggacta actacggaaa acgaagtttt 4260 ggttacctag gacctgtttt gttcaacaga ctacctcaga cgctaaaaaa ttatacaccg 4320 tcatcaaaat ttaaaaaaga attaaaaatg tacttaaaat ctaatgttac tctatactta 4380 agataaagcc atcaatgttt cacatcagtg taaaataatc ccccttcaaa gagcttaaag 4440 ctcactgggg gctacaaatc actgtaatta accagttgaa tgcttgttat gtcaattatg 4500 tgtaaaattt atatgttaat aaatgttaat aaa 4533 // ID Kiri-7_AAe repbase; DNA; INV; 4703 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 16-FEB-2011 (Rel. 16.02, Last updated, Version 1) XX DE A Kiri non-LTR retrotransposon family from Aedes aegypti. XX KW Kiri; Non-LTR Retrotransposon; Transposable Element; Kiri-7_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4703 RA Kojima K.K. and Jurka J.; RT "Kiri non-LTR retrotransposons from the yellow fever mosquito."; RL Repbase Reports 11(2), 702-702 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 9 sequences with >96% CC identity. CC This family and related Kiri elements found in two mosquito CC species, Culex quinquefasciatus and Aedes aegypti, constitute a CC distinct lineage belonging to the CR1 group. The phylogenetic CC analysis with PhyML indicates that Kiri is a sister lineage of CC the L2 clade. Although several Kiri elements do not encode two CC proteins, probably due to 5' truncation, Kiri elements generally CC encode two proteins. Their ORF1 proteins commonly contain an CC L1-like RNA-binding domain. XX FH Key Location/Qualifiers FT CDS 274..1089 FT /product="Kiri-7_AAe_1p" FT /translation="MSQNGIVTRSGSTSSLVGSIAKRSREDLTKQDGDSEI FT VSLDDLWNKMQTMLSNSCGRIETKIEKCNAALEKRMTNIEEKLNAVREECS FT EKIVKLEEDVVGVRADVDFAVEAVHRMGRDRELVFSGIPFHSQENLDEVFR FT KIALTIGYHENNLPIVDLQRMARAPIALGTSPLILCEFALRNKRNEFYRSY FT LSKRSLCLRDIGLDSGNRIYMNENLTNNARQIRAEAIKLKKLGYVENVSTR FT NGIVHVKRKGTDKPSAIYSLQQIVQRKTPIQ" FT CDS 1728..4556 FT /product="Kiri-7_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MDRIDASSNSASDQGVIAKAVVNAVLLKDKLNISHIN FT VQSLCARRFSKFEEXKRLVTESKIDIACFTETWMDSSVTDSMIHIQGFNVV FT RNDRNRHGGGICIYIRKGLAYRLVKKSFICVNDALSKTEFLILEFQIDNDR FT IVLAVYYNPPDTDCSELLRNHFEEFSVRYSSTFFIGDFNTDPFKQNRKSSC FT FNDVVNSMSFSRINSEPTFFYNSGCSLLDLFLTDTPDVVLRFSQISMPGIS FT KHDLLFASLNYSNKIENQGYWYRDYFNYNHDSLYDDFFRFNWNRYFCVDDP FT DILTSILNNQLTVLHDTFFPLRFHKFRKNPWFNNDIETAMINRDLAYRNWK FT RCKTFENKMQYKRLRNLVSSMISNAKLVHDRQKLNLNLPSKQLWNNVKSLG FT VSTKESFPVVSQHSANDINEYFSSNFTADDNETIRFSSNGNGFCFRQIVDY FT EIVNAIFSVKSNAVGFDGIPVNFLKIVCSFALPVFEHLFNSIIVSNKFPSQ FT WKSAKVIPINKKPNVLSISNLRPISLLSTVSKVFEKLIKIQILEYVNRMNY FT INIFQSGFRNNHSTETALIKVHDDIAKSIDKNGVTILLLIDFAKAFDRVAH FT NKLINKLITLFNFSNCAARLIASYLSHRDQAVICDGKLSDFKPTISGVPQG FT SVLGPLLFSLFINDLPTVLDFCSVHMFADDVQIYICTDENTDLSSAGSLMN FT HDLHKVMKWSKDNLLPINPDKTKAMLISKQKTPPQPPQILFGGLEVQFVDR FT VNNLGVIFNKNLEWDSHVNMQCSKIYGSLKRLNLTTRHCDTGTKLKLFKSL FT ILPHFIFGDFIYTNALAGSTDKLRIALNACVRYVYKLSRFSHVSHLQKXLI FT GCPFVNFYKYRSCLNLFRIIKTSSPPYLFENLVPFRGTRTKNYRVPQHSSS FT YYSQSFFVRSIVSWNSLPTSIKTINSFPSFKKNLLQNFQ" XX SQ Sequence 4703 BP; 1417 A; 855 C; 875 G; 1551 T; 5 other; tctgagtttc tgaagggatg agcgacagta cttaagctca aaagtggttg cagttaaatc 60 cttctttttg catccgtaaa gtaattccgt tgcataataa actggtgttt ctgtatatat 120 agaaacgttc tggagtagtg acacataaag ttcaacataa acgtgcgtta attgtcgctt 180 aaacgtgaaa aaatcaatca ctgtgcaaca aagcaaaagt tgtggccctt cagcattatt 240 tacgcatctg tcatccgttg gcactctccc aaaatgtctc aaaatggaat tgttactcgg 300 tccggatcaa catcttctct tgttggatca attgccaaac gatctcggga agatttaacc 360 aagcaagatg gcgacagtga gattgttagc ctcgacgacc tgtggaataa aatgcaaaca 420 atgctctcaa actcctgcgg acgaatcgag acgaagattg agaaatgtaa tgcagcgttg 480 gagaaacgaa tgactaatat cgaggagaag ctgaatgctg tgcgtgagga gtgttccgaa 540 aaaatcgtga agcttgagga ggacgtggtt ggagttcgtg ccgatgtgga tttcgctgtt 600 gaggctgtgc atagaatggg cagagataga gagctcgtgt tttccggtat tcctttccac 660 agccaggaga atctggacga agtttttcgt aaaattgctc taacgatcgg ctaccatgaa 720 aataatctac ctattgtcga tttgcaacgt atggctcgtg ctccgattgc attaggaacg 780 tcccctctca ttctctgtga gttcgctctg cgtaataaga gaaatgagtt ctaccgatca 840 tatctctcga aacgctcgct atgccttcgt gatattgggc tcgatagtgg gaatcgtatc 900 tatatgaacg aaaatctcac caacaacgct agacaaattc gagctgaggc aatcaagctg 960 aaaaagttgg gatacgtaga gaatgttagc actcggaatg gtatcgtcca cgtcaaacgt 1020 aaaggaacgg ataagccgtc tgcgatctat tcacttcaac agatcgttca acgcaaaacc 1080 cctatccaat aaagctattc atttccttcc atgatatcca tgtttccaat cctttacatt 1140 ctttgaatcc atccactcct gaaagttaaa aactatattc atactacgac aaccctagcc 1200 tagtaatctc acaatcctag ttctttgcat atccatgact cctctcccaa tttccatgat 1260 tctttccctc ctaaaagttt gttgctgcta ccgaccactg gtgttatgtt gctgttgggg 1320 taccgatgta cgttgctgct tctgtatggc tggtgctgtt gaacgttgct gctgctgtat 1380 ggctggtgct gwttgctgtg caatactgat gttgttgaat ggctggtgct gctgaatggt 1440 ttgatgcaag catcgatgaa gaattgttat attggtttta tgaactcgaa ctcagatcat 1500 ttgcaatgtg ataaattgat tattatgtat caacgtggct tataattatc ttttctttgt 1560 gctttttaaa cggatttgct gcaacgaagt tgcgaaaatt agtagtcgtt tgagtatgcg 1620 gtctatagtt agttttaaga tgttgttctc tttagctgta aattacttgt ttgctgtctg 1680 ctttgtggat tggtttttct tgaagcgttc aatcacgttt gttgattatg gatagaatcg 1740 atgcttccag caacagtgct tctgatcagg gtgtaattgc taaagctgtt gtaaacgcgg 1800 ttttgctgaa agataagcta aatatttcgc acatcaacgt gcaaagctta tgtgcacgca 1860 gatttagtaa atttgaagag wtgaaaagat tggtgacaga gagtaagata gacatcgcct 1920 gtttcacaga aacttggatg gatagttcag taactgactc gatgatacat attcaaggct 1980 ttaatgtagt cagaaatgac cgcaataggc atggaggagg aatatgtatc tacataagaa 2040 aaggtctagc ttatcgatta gtcaagaaat cttttatttg tgttaatgat gctctatcga 2100 aaactgaatt cctaatttta gagtttcaaa ttgacaacga tcgcattgtt ttagctgttt 2160 attacaatcc tcctgatact gattgctctg aattgctaag aaatcatttt gaagagtttt 2220 ctgttcgata ttcgtctact ttctttatcg gtgatttcaa tactgatccc tttaaacaaa 2280 atagaaaatc ttcgtgtttc aatgatgtcg taaacagtat gtctttttca aggatcaatt 2340 cagaacctac gtttttctac aattcgggct gctcgctttt agatttattt ttgacagata 2400 ctcctgatgt cgtgttaaga tttagtcaaa tatcaatgcc cggaatatct aaacatgatc 2460 tcttatttgc ttctctaaac tattctaata aaatagaaaa ccaaggttac tggtaccgag 2520 actatttcaa ttataatcat gattctcttt acgatgattt ttttagattc aactggaata 2580 gatatttttg cgttgatgat ccggatatac tgacaagcat tctgaataat cagttaacag 2640 ttcttcatga tacttttttt cctttacgtt ttcataagtt tcgcaagaat ccttggttca 2700 ataatgatat cgagacagct atgattaaca gagatcttgc ttatcgcaac tggaaacgct 2760 gtaagacttt tgaaaataaa atgcaatata aaagattgag aaacttagtc agttctatga 2820 tatcaaatgc aaaactagtg catgatcgcc aaaagttaaa tttgaatctt ccttcaaagc 2880 agctgtggaa caatgtgaaa agccttggtg tttcaactaa agaatccttt ccggtagtga 2940 gccaacattc tgcgaacgac attaatgaat atttctcttc aaattttaca gcagatgata 3000 atgaaactat tcgtttttca tcaaatggca atggcttctg ttttagacag atagtcgact 3060 atgaaatcgt taatgcgatt ttcagtgtca agtccaatgc tgttgggttt gatggcattc 3120 ctgtcaattt tttaaagatt gtttgttctt ttgcgttacc agtttttgaa cacctattca 3180 actcaatcat agtttcgaat aaatttccat ctcagtggaa atccgcaaag gttattccaa 3240 tcaataagaa acctaacgtt ctctcaattt ctaacctaag accgataagc ctattgtcta 3300 cagtctcaaa agttttcgaa aagcttatca aaattcaaat tttagagtat gtaaaccgta 3360 tgaactatat caacattttc caatctgggt tcagaaacaa tcatagcact gaaactgcat 3420 taattaaggt gcatgatgac attgcaaaat caattgacaa aaatggtgtc actattcttt 3480 tgttgattga ttttgcaaag gccttcgacc gtgttgcgca caataaacta ataaataaac 3540 taatcacttt gttcaatttt tcaaattgtg ctgcaagatt aatcgcaagt tacctaagcc 3600 atagggatca agcagtgatt tgcgatggta aattgtctga ttttaaacca acgatttccg 3660 gtgttcctca gggttcggtc cttggtccgc tccttttttc actttttatc aatgatctac 3720 caactgtgtt agatttctgt tcagttcata tgtttgcaga tgacgtacaa atttatattt 3780 gtactgatga gaacaccgat ctgtcttctg caggatcttt gatgaatcat gatcttcaca 3840 aggttatgaa atggtccaaa gataacctac ttccaatcaa cccagataaa actaaagcga 3900 tgttgatatc taaacaaaaa actcctcctc agccaccaca aattctattt ggtggcttgg 3960 aagtccaatt tgttgatcgt gttaataatt taggtgttat attcaacaaa aatttggaat 4020 gggactctca cgttaatatg cagtgctcta aaatttacgg atcattgaaa agattaaacc 4080 taacaactag acactgtgat acaggcacca aattaaaatt atttaaatct ctgattttac 4140 ctcattttat tttcggcgac ttcatataca ctaatgcatt ggctggatca acagataagt 4200 taagaattgc tcttaatgct tgtgttcgct atgtttacaa actttctaga ttttcacacg 4260 tgtcacattt gcaaaaamat ctcattggat gcccatttgt aaatttttat aagtacaggt 4320 cgtgtttgaa tttattcagg attattaaaa cttcttctcc accatatctt tttgaaaacc 4380 ttgtkccctt ccgagggacg cgtacaaaaa actatagagt acctcagcat agctcttctt 4440 actatagcca atcattcttt gttaggagca ttgttagctg gaacagttta ccaacttcaa 4500 taaaaaccat caattcattt cctagcttca agaagaacct cttgcagaac tttcagtaga 4560 tgggtgttgt taaagagaac atattgttaa gagattatta agtgcttttc tagtttaack 4620 ccatattgta acattaaaaa agatgaatgt cttaagttac atgaatttaa taaatgaaat 4680 aaataaataa ataaataaat aaa 4703 // ID Dong repbase; DNA; INV; 4134 BP. XX AC . XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 05-NOV-2010 (Rel. 15.12, Last updated, Version 2) XX DE Bombyx mori non-LTR retrotransposable element. XX KW R4; Non-LTR Retrotransposon; Transposable Element; KW Repetitive element; Non-long terminal repeat; BMRP2; Dong. XX OS Bombyx mori OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Bombycidae; Bombycinae; Bombyx. XX RN [1] RP 1-4134 RA Xiong Y. and Eickbush T.H.; RT "Dong, a new non-long terminal repeat (non-LTR) retrotransposable RT element from Bombyx mori."; RL Nucleic Acids Res 21, 1319-1319 (1993). XX RN [2] RP 1-4134 RA Kojima K.K. and Jurka J.; RT "Consensus update."; RL Direct Submission to Repbase Update (05-NOV-2010). XX DR [2] (Consensus) XX FH Key Location/Qualifiers FT CDS 419..4006 FT /product="Dong_1p" FT /note="includes a reverse transcriptase and a FT restriction-like endonuclease." FT /translation="MLRRGRIFLPASTKAGKTRGRMKWSREVNLFIMRTYY FT YVTKLETDLTIYRKKLHEHFSLKYPNVIISQQRISDQKRAIERNKLLSQET FT LDRLKEEVRKQLEDEQTNNVENEKLNSETYSHEYTTLTPQTILTKKTQQHT FT NIISSTQTSHSSTQTESITLLLENEVDILNTNPTEGATQTQEVKDKFETNL FT TMYSGMDPKARPPLPKLKYSSKLNELIRLFNNDILVDYISPDTQLSDVHTL FT TYCTAVTISEQLKYKIIAIEGNARHKKNFKPPWQQRLEKDIAKLRADIGKL FT TQYINNNRSKKVVQSVEQIFKNTKIHTSHENGNKKSQEFLDTLKQKLALKA FT HRLKRYNNSQKRKNENTIFLTNEKLFYRNLIKPKTDRDNSNIDIPTAEQLE FT MYWARLWENSAKHNDKANWITEEKERWDTIEEMQFDDVTEEEITTITARLH FT NWKSPGIDKIHNFWFKKLICLHKTIAKNLTDIISGNQSIPEFIATGITYMI FT PKGDFSIEASQYRPITCLPTIYKILTTVITKKINSHIEHNNILAEEQKGCR FT RGHMGCKEQLIIDSTIMKHATTKNRNLHCTYIDYKKAFDSIPHSWLIQVLE FT IYKINPIIISFLRNIMTHWQTTLKLKNPPNFVTTRQIAIKKGIYQGDSLSP FT LWFCLALNPLSHQLHNDRAGYRIKQQDNTETIISHLIYMDDIKLYAKNDKE FT MKKLIDTTTIFSNDISMQFGLDKCKTVHIIKGKVQPGDYTIDDTNTITAME FT PSDLYKYLGFQQLKGLDHITIKQSLTSEYKKRINAICKTKLSGKHLIKALN FT TYAIPILTYSFGIIKWSKTDIEQIERITRTTLTKHNNLHPKSAIERLTIKR FT QDGGRGMIDIWHLWRKQIHSLKTFFYIKSDLSEIHRAIAQNDNNYTPLNLK FT QKELIDNTENLRNRNPQKDMEENWKKKALHGRHPHDLSQSHIDSKASNMWL FT KTGSLFPETEGFLIAIQDQVINTKNYRKYIIKDPTIRDDKCRKCNTQPETI FT QHITGACSTLTQTDYTHRHNQLANIIHQQLALKHKLIQNTNTPYYNYKPQT FT VLENDSCKLYYDRAILTDRTIHYNRPDITLQDKNNKVTYIIDIAVPNTHNI FT QKTFTEKMTKYTELKEEIVRIWKQKKAYIVPIIISTTGVVPNHIHNSLKLL FT DLKDNIFISLQKAAILNTCRIVRKFMQLEENQTYYTQ" XX SQ Sequence 4134 BP; 1718 A; 863 C; 649 G; 904 T; 0 other; gctagctccc taaaatccta ccttacgtcc gaggcgaaca tctgtccacg tggggagcgg 60 aaacgcgtac tatcgaaact tacgcggcta acaaggtaaa ggtaacccat taatatggag 120 acaagactaa aaagaaaatt gagagggccg cttcccgggg gcgatcgccg gggcacacct 180 ggagctggcg ctgggtgttc cagcatagga tccgtcagcg gcggagagtt gagcaggcgg 240 gctctcgccg gggatttaca acccgaaaat gctacagccg ccaacagaaa tcacgatgta 300 gaaagtagga gcaatagccc atgtgaacct tacagcccga gtaccggttc atacaacccc 360 tcggtacaat catcaccatc atcctcgggt catagaggct caccaacgtc gactatggat 420 gttgcgcagg gggcgaatct ttttacctgc gtccaccaaa gctggcaaaa caaggggccg 480 catgaaatgg agcagagaag taaacctatt tatcatgcgc acctactact acgtcactaa 540 attagaaact gatctgacta tttacagaaa aaagctgcat gaacattttt cactaaaata 600 ccctaatgta ataatttcac agcaaagaat atctgaccaa aaaagggcaa tagaaagaaa 660 taaactacta tctcaagaaa ccctagaccg actaaaagag gaagtgagaa aacagctcga 720 agacgaacaa accaataatg ttgaaaatga aaaattaaat tcagaaacat attcacacga 780 atatacgaca cttactccac aaactatttt aacgaagaaa acacagcaac atacaaatat 840 tatctctagc acacagacct ctcattcatc cactcaaacc gaatcaataa ctctgctgtt 900 agaaaatgaa gtcgatattt taaatactaa ccccacagaa ggggcaacac agacacaaga 960 agttaaagat aaatttgaaa cgaatttaac aatgtactca ggcatggacc ctaaagcaag 1020 gccgccattg cctaaactta aatatagctc taagctaaat gaactgatac gtctatttaa 1080 taatgacata cttgtagatt acatctcacc agacacacaa ctatcagatg tacatacatt 1140 aacatattgc accgccgtaa ctatttcgga acaactaaaa tataaaatta tagcaataga 1200 aggaaacgcg agacataaaa aaaacttcaa accaccgtgg caacaaagat tggagaagga 1260 tatagcaaaa ttgagagcag atattggtaa actgacccaa tacataaata ataatagatc 1320 taaaaaagta gtccaaagcg ttgaacaaat atttaaaaac actaaaatac acacatcaca 1380 cgaaaatggc aataaaaaat ctcaagaatt tttagacaca ctgaaacaaa aattagctct 1440 aaaagcccac agactaaaaa gatataacaa ctcacagaaa cgaaagaatg agaacactat 1500 atttctaaca aatgaaaaac tattctacag aaacctaata aagccgaaaa ctgatcgaga 1560 caatagtaat atagatatac caacagcaga acaattggaa atgtattggg ctaggttatg 1620 ggaaaatagt gcaaaacata atgacaaagc aaattggatt actgaagaaa aagaaagatg 1680 ggatacaata gaagaaatgc aatttgatga cgtgacagag gaagaaataa ctactataac 1740 agctagacta cataactgga aatccccagg tatagataaa atccataatt tttggtttaa 1800 aaaactaatt tgcttacaca aaacaatagc caaaaatcta acagatatta tctctggaaa 1860 tcaaagtatt cccgaattca tagcgacagg aatcacttat atgataccaa aaggtgactt 1920 ctctatagaa gcatcccaat atcgaccaat tacatgcctt ccgactattt acaaaatttt 1980 aacaacagtt attacaaaga aaataaattc acatatagaa cacaataata tcttagctga 2040 agaacagaaa gggtgtagac gaggccacat gggctgcaag gaacagctaa ttatagactc 2100 aaccatcatg aaacacgcca ccacaaaaaa tagaaattta cactgtacat atattgacta 2160 caaaaaagct tttgatagca tcccacattc atggctgatc caagtcctag aaatctacaa 2220 aattaaccct ataataataa gcttcctacg caatatcatg acacattggc aaaccacact 2280 taaattaaaa aaccctccta attttgtaac aacacgacaa atagccataa aaaagggtat 2340 ttaccaaggt gattctctca gccctttgtg gttttgcctc gccttgaacc cactatccca 2400 tcaattgcat aatgaccggg cgggataccg cattaaacaa caagataaca ccgaaacaat 2460 aatatcacac ctgatttata tggacgacat taaattatac gcaaaaaatg acaaagaaat 2520 gaaaaagtta atagatacta ccacgatatt cagcaacgac atcagtatgc aatttggact 2580 tgataaatgt aaaaccgtac atataataaa aggaaaagtc caacccggtg attatacaat 2640 agatgacaca aacacaataa cggcgatgga accaagtgac ctttataaat atctgggctt 2700 tcagcagctc aaaggactcg atcatataac aataaagcaa tcattaactt cagagtacaa 2760 aaaacgtatc aatgccattt gcaaaacgaa attatctgga aaacatctta taaaagcact 2820 gaacacctat gcaataccca ttctaaccta ctcatttgga ataataaaat ggagcaaaac 2880 tgacatagaa caaatagaac gcataacaag gactacatta acaaaacaca ataatcttca 2940 tccaaaatct gcaatagaaa gattgacaat taaaagacaa gacgggggta gaggcatgat 3000 agatatttgg catctatggc gtaaacaaat acacagctta aaaacatttt tctacataaa 3060 atcagattta agtgaaattc acagagccat agcacaaaat gataacaact acacaccgct 3120 aaatctcaaa caaaaagaac taatagataa tacagaaaac ctaagaaata gaaacccaca 3180 aaaagacatg gaagaaaact ggaagaaaaa agcgctacat ggacgacacc ctcatgacct 3240 aagccaatct cacatagaca gcaaggcatc aaacatgtgg ctcaaaacag gaagtctgtt 3300 ccccgaaacg gaaggatttt taattgccat acaggaccaa gtaataaaca caaaaaatta 3360 cagaaaatat attattaaag atcccactat tagagacgat aaatgccgca aatgcaacac 3420 ccagccagaa accatacagc acataactgg agcatgttca acccttacac agacagatta 3480 cactcacaga cacaaccaac ttgctaatat tatccatcaa caactggctc tcaaacataa 3540 attgatacaa aatacaaaca caccgtacta caattataaa ccacaaaccg ttcttgaaaa 3600 tgactcctgt aaactttatt atgatcgcgc tattcttacc gataggacga ttcactacaa 3660 tagaccggat atcactttac aagataaaaa caataaagtc acctacatta ttgacattgc 3720 agtcccgaat acccacaaca ttcaaaaaac gtttacagaa aagatgacaa aatacacaga 3780 acttaaagaa gaaatagtta gaatttggaa acagaaaaaa gcatacatag tcccaataat 3840 aatctcaacc actggagttg tcccaaacca catccacaac agcttaaagc ttctagattt 3900 aaaagataac atatttattt cactacaaaa ggcagctatc ctaaatacat gcagaatagt 3960 gagaaagttc atgcagcttg aagaaaacca aacttactac acgcaataaa aactagcata 4020 attattaact cataactaat gtatattact tggccaaaag cccgtatata cagttccacc 4080 ggctctgtcg acagactgaa ctgagaaagg ggaaacatat ggaaataata ataa 4134 // ID Merlin9_SM repbase; DNA; INV; 1060 BP. XX AC . XX DT 14-AUG-2009 (Rel. 14.08, Created) DT 14-AUG-2009 (Rel. 14.08, Last updated, Version 2) XX DE Consensus sequence of Merlin-type family of repeats. XX KW Merlin; DNA transposon; Transposable Element; Merlin9_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-1060 RA Jurka J.; RT "Merlin-type element from freshwater planarian (Schmidtea RT mediterranea)."; RL Repbase Reports 9(8), 1899-1899 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 77..886 FT /product="Merlin9_SM_1p" FT /translation="MRIWDLPKTEQDAILFFQERNILPKKRKCTNSHDMKL FT YFGKRKFWKCNLRQCDQQVGLRKDNWFEGSRIPFCSALRFIYCWCEELTSI FT KFCEKQLDLSDKTVIDWNNYMRELCVLDMENKPNKKVGGPDCIVEIDESLF FT TKRKNNCGRVLPEQWVFGGICRETKDSFVVTVPNRTGSTLLDKIIENIAEG FT STIYSDSWKGYQTNRIEIEGFLNAKVNHKYNFIDPDTGVHTQTVERMWGSA FT KWRNKRHRETSFGILFIRVHMATASGERK" XX SQ Sequence 1060 BP; 358 A; 149 C; 227 G; 326 T; 0 other; ggtacttacg tattaacgcc accgcagcag aagttgttgt ttgctgattt ttgacgccat 60 ttttagtgct acgaaaatgc gtatttggga tttgccaaaa acagaacaag atgcaatttt 120 attttttcag gagcgaaata ttttaccaaa aaaacgaaaa tgtaccaatt cccatgacat 180 gaaactttat tttggtaaac gtaagttttg gaaatgtaat ttaagacagt gtgatcagca 240 agtgggactt agaaaagaca actggtttga gggaagcaga attccatttt gttcagcttt 300 gagattcata tactgctggt gcgaggagtt gacgtcgatt aaattttgtg aaaagcaact 360 tgatttatcg gataaaactg ttatagattg gaacaattac atgcgtgaat tatgtgtctt 420 ggatatggag aataaaccaa acaagaaggt aggtggacca gattgcattg tggaaattga 480 tgaaagtttg tttactaaac gtaaaaataa ttgtggtcga gtattacccg aacagtgggt 540 gtttggtggt atatgtagag aaacaaaaga ttcgtttgtt gtcactgttc ctaatagaac 600 tggctctacc cttcttgata aaattattga aaacattgca gaaggtagca caatttattc 660 tgacagttgg aaaggttacc aaacaaatag aatagaaata gaaggattcc taaatgctaa 720 agttaatcat aaatacaatt ttattgatcc tgatacagga gttcatacac aaacagttga 780 gagaatgtgg ggcagtgcta aatggaggaa caagagacat agggagacat catttggaat 840 cttatttatc agagttcata tggcgacagc atcaggtgaa agaaaataga gactgttttg 900 aatctatgtt aaactctata tcggcccatt ttccaccgaa atccgattga ctactttgtt 960 tttaaataaa gtgttttatt ttatgatttg tcgcaaaaat tttaggcgta gattaaaacc 1020 gccaaatctg tagctgcggt ggcgttaata cgtaagtacc 1060 // ID Gypsy-31_CQ-I repbase; DNA; INV; 5436 BP. XX AC AAWU01031725; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-31_CQ_; KW Gypsy-31_CQ-LTR; Gypsy-31_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-5436 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 441-441 (2011). XX DR Genome; AAWU01031725; Positions 18641 24076. XX CC Positions [4430-4891] - Integrase core CC 'CCCAT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 167..1813 FT /product="Gypsy-31_CQ-I_1p" FT /translation="MMNHNIQPFVVGETATVGKRWKSWRRSFDLYMACCGT FT VTDEKKKAMFLHLAGPEVQEIFYDDPDHDAALPNGSDVFKEAVNLLDKHFE FT PVVCIPHERVLFRRMKQEGSDGIEQFARKLRHQGNLCDYGGALDMRITEQI FT FDGSTSDALREAILKKRLVALKDILEEGRILETIELNKQEAVASTSNGPAA FT VNKVDSGKCFRCGSLEHFANSKKCPALKKKCDKCGLVGHFKLQCRTKSHTA FT KTKKKKVRQVDSSDEEDSDEDKVSDADSENSEVLHIFSTTMKQNEVVRESS FT KYNTKISCSVGGVKLKWTVDSGASVNVIDEGTWELLKKKGCKVSYENNETR FT KKLMAYGNNRLDVKGVFKADISHGPKTVHREIYVIRGHGANLLGKTTSIDL FT GVLQIKSEVSQVQESKTPAFGKAKGVSIVVELDQQVRPVQQPCRRLPIPLQ FT AVVEAELQKLLDQDVIEYAPNKVTWSSPLVVTPKDGGRKVRLCVDMRRANT FT AIVPQRYPLPTFDEIMPYLDLKDRPHASISSAGTTSGFSGNNHIRHAEGLL FT SL" FT CDS 3797..5434 FT /product="Gypsy-31_CQ-I_2p" FT /translation="MYLRGKFFVVLTDHKPLVKIFATESDPNERQQRWVLN FT LMGYRFLVRHVPGKANIADPLSRLAQSEEAQNCDKDCEKDLCAVVESALPA FT TLTMTELIQAAEEDAETFMLRKAIRTGTWGDDMKRYRPFQNELCYANQVVL FT RRNKIVVPQSLRERVLDLAHGGHPGSSKMKRRLRAAVWWPGIDQDVDKRCK FT QCLECQAVRSGPSPEPLCIQEMPSKPRSHLSADFLGPLPNGKYLFVLIDAY FT SRYCVVEVMTKITSVLVIQRLERIFTRLGLPDILKTDNATNFCSQEFKDYC FT VDSGIKLTHTTPYWPAANGEVERQNRSILKALKIGQLNGENLEKALQDYLY FT MYTVTPHAVTGVSPAELMFGRRFKDRFPHLSGEVVVPEEVQERDRIAKYQA FT KVYRDIKVHAKPSDIQIGDHVLMKNQHRDNKLAPNFNPEPVVVVQKAGNCV FT TVRTPRGDLFRRNSSHLKPIRSSIGPAEQANEPDVDGETLAEADFDAEYPV FT AANDRAATPGIGQGGVRFNVLSGLSNPQLDSRTTSWEKSRESNLTVTKGE" XX SQ Sequence 5436 BP; 1599 A; 1150 C; 1446 G; 1241 T; 0 other; ttttggcgac gattcgggaa atttaccggg agaacgcgcg gaaaagatcg cgaaagtacc 60 gtttttgtga aaaaccgtgc gaagggaagt agttcggcta gcagaaaacg ggaaattgaa 120 cgaattttga gcgtcgcgtg tgtgagcatc gaagtgtgcg agggagatga tgaatcacaa 180 tattcaacct tttgtggttg gagaaaccgc gacagtggga aagagatgga aatcttggcg 240 aaggtcattt gatctttaca tggcgtgctg cggcacggtg actgatgaga agaagaaggc 300 gatgtttttg cacctcgccg ggccggaagt gcaagaaatt ttctacgatg acccggacca 360 tgatgctgca cttccgaatg gatcggatgt tttcaaagag gcggtcaatt tgctggataa 420 gcatttcgaa ccagtggttt gcatccctca tgaaagagta ttgttccgac gcatgaaaca 480 agaaggcagt gacgggattg agcagtttgc gagaaagttg cgtcatcaag ggaacttgtg 540 tgactacgga ggtgctctgg acatgcgaat cacggaacag atcttcgacg gctcgacgtc 600 ggatgctctt cgagaggcga ttttgaagaa gcggctggtg gcgctgaagg acatcctgga 660 ggaaggaagg atcttggaaa cgattgagct gaataagcag gaggctgtcg cttcgacgtc 720 gaacggacct gcagcagtga acaaagtgga tagtggaaag tgtttccgat gtggctctct 780 ggaacatttt gcgaactcca agaagtgccc agctctcaag aagaaatgcg acaaatgtgg 840 acttgttgga catttcaagc tacaatgtcg cacgaagagc cacaccgcga aaacgaagaa 900 gaagaaagtg cgacaagtgg atagttccga cgaagaagat tcggacgaag acaaagtcag 960 cgatgccgac agtgagaaca gtgaagtgct ccatattttc tctacaacga tgaagcaaaa 1020 tgaagtagta agggagagct cgaagtacaa cacgaagatt tcgtgttcgg tgggaggagt 1080 aaagctgaag tggacggtgg actccggagc gtcagtcaac gtaatcgacg aaggcacttg 1140 ggaattgctg aagaagaaag gttgtaaagt gagctacgag aacaacgaaa cccgcaagaa 1200 gctgatggcg tacggcaaca atcggctgga cgtaaagggt gttttcaagg cggacatctc 1260 gcacgggcct aagactgtac atcgggaaat ctacgtaatt cgtggtcacg gagctaatct 1320 gttgggaaaa acgacctcga tagatttggg cgtgttgcag atcaagtctg aagtttcgca 1380 ggtgcaagaa agcaaaacac cagctttcgg aaaagctaaa ggcgtgtcca tcgtcgtcga 1440 gttggatcag caagttcgcc ctgtacagca gccctgccgc agattgccga ttcctttgca 1500 agccgtcgtc gaagcggagc tgcagaagct cctggaccag gatgtgatcg agtatgcgcc 1560 gaacaaggtc acttggtcat ctccgcttgt cgtgacgccg aaggacggag gtcgtaaagt 1620 aaggctctgc gtggacatgc gacgcgctaa cactgctatc gtcccgcaaa gatatcctct 1680 cccgacattc gatgaaatca tgccctacct cgatctcaaa gatcgacctc acgcaagcat 1740 ttcatcagct ggaactacat ccggattctc gggaaataac cacattcgtc acgccgaagg 1800 cttactttcg ctttaagcga ctgatgtttg ggatgagcat cgcatcggag gttttccaac 1860 gagagctcgg aaacatgctg aaagggctcg agggggtaaa gcacttcatc gacgatattt 1920 tggtctttgg ccgaactcga gaagagcacg accgccggct ggcggcgttg atgaagcgga 1980 tcgaggagtg cggactaaca gtgaacaagc tgaagtgtca gattggtcag acgaaggtct 2040 cattcatggg acatctgctg tcgagtgacg gaatactgcc aatggaagag aaggttagtg 2100 caatcaagtc attccggcga ccggagtcag ccgaagagat tcggagtttc cttggtttgg 2160 cgaactacgt cggaaaattt atcccgaatc tttcctcgat cagcactccg ctgagggaca 2220 tgacggtgaa gggggccaac ccaagtagca agaaaaacat cgctttttat ttttttgttg 2280 aacaattgct gcataagcgt tttaatacgc taattattga acaagtgtca taacaattgg 2340 taaattgttc aaattttgat caaaaaacca ttgttaaaca tggttgtgta acataaatgc 2400 taaatctagc aacggattac gaaaagttca tttttgaagt ttattctgac ttaaatgaaa 2460 catgcttgta gaactcggag agcaaaatga catttgtaaa catgatgttt catttcagtt 2520 tgctaaacat gataacatag atgaattgac ctttggcttc ggctgggatt tctcgtaaac 2580 aagttgttta tgtatagttt gccttcgtgt tttaaaaacc gagaaagaac atgttgacga 2640 aacaagcaca gatagttcta aaaatagaaa cagcttatgt tcaaaggaag ttttagcaaa 2700 ctgttagtga acttggtaaa agctagatga ccgcagactt ctaattgacg tttaggcctg 2760 ctcatccaat ataacccctc gtggaaagat gcagacgcgc gcccggactt gacatttgga 2820 gtgatcgagg gttcaatact tgccgtgagg attcttcatc aaaaaggaaa actgaaaatg 2880 cagccaccgg gggaaccagc agcagagtag cagcaagcga tgttgattag tactcaaaaa 2940 ttgaaaaaaa ttatctccaa aagcaaaata atgaaattga atgatttttt tttttgttga 3000 tatttttatt attcatcgta tccctgtaaa aactgataaa tcacaaaact agtaattaaa 3060 gtattgtaaa gtgacattac tcacaaataa agtttaaaat aaagaaaaat atatttaaag 3120 cgaaaacttg ggacgtatat tatgaaaagc ctctttaaat tttttgtctg ccacgacatg 3180 ttcttggacg gttctcaaaa ggttttccta actcgcttct gaatcttaaa ggcaggatgt 3240 tgttaaacgg ttctaacaac gttcacctga cccaattcag aatcttatag tcaagagttc 3300 ttatgacatg tttaacaaac gttctctatg caaagatgat ttgcatttgc aaaactaact 3360 taaaccacgc acatttctga cggatgaaga gatgttcaac aacagaaaaa cgtgcgcttt 3420 ttccagcgtt caagagtcct tagggacgtt gggaaaacat agtaaacgag ttcaacaagc 3480 agttcggaaa tctcttggcg aataaagtgt gctacttggg aagttcaagt ggacgaaaca 3540 agccggaaaa gcgttcacgt ccattaaaag ttctctgtca aacccgcagc acctgggatt 3600 ctacagtccg gaaagtgaaa caaccctggt ggtagatgca agtgccacgg gactgggtgc 3660 tgttctcctg cagacggaca acggtcagcg aagggtgata agttacgcga gcaaaagctt 3720 gtcgaagacc gagcgcaggt actctgtttt ggataaggaa gcgttggcga tttactgggc 3780 gatcggcaga ttcgacatgt acctgcgggg aaagtttttc gtagtgctga ctgatcacaa 3840 gccgctggtc aagattttcg ctaccgaatc agatccgaac gaacgacagc agcgctgggt 3900 gttgaacctg atggggtacc gtttcctagt acgacacgtt cctggaaaag cgaacatcgc 3960 tgaccctttg tcgcgactcg ctcaaagcga ggaggcacaa aactgcgaca aggactgcga 4020 aaaggatctg tgcgccgtgg tcgaaagtgc gttgccagca actctgacaa tgacggagtt 4080 gatccaagcg gcggaagaag atgctgaaac gttcatgctg cgtaaggcga taagaacagg 4140 aacctggggt gacgacatga agcggtaccg tccattccaa aacgaactct gctacgccaa 4200 ccaggtggtt ctacggcgca acaagatcgt cgttccgcaa agtcttcggg aaagagtttt 4260 ggatctagca catggtggtc acccgggtag cagcaaaatg aaacgtcgac tgagagctgc 4320 ggtttggtgg cctgggatcg atcaagacgt tgacaaacgt tgcaagcagt gcctggagtg 4380 tcaagcagtt cggagcggac caagtcctga accgttgtgt atccaagaaa tgccatcaaa 4440 gccgcggagt catttgagcg ctgacttcct cggtccacta ccaaacggca agtacctgtt 4500 cgtcctgatt gatgcctaca gtcgctactg tgtggtggaa gtaatgacca agatcacttc 4560 agttttggtc atccagcgcc ttgagcgaat tttcactcgg ctcgggctgc cggacatttt 4620 gaagacggac aacgcaacca acttttgcag ccaggagttc aaggattact gtgtggacag 4680 cggtatcaaa ctcacccaca caacaccgta ctggccagct gcgaacggtg aagtggagag 4740 acagaatcga tccatactga aggcgctgaa aatcggacaa ttgaacggcg agaacctgga 4800 aaaggcgcta caagattacc tgtacatgta cacggtaact ccacacgctg tgaccggcgt 4860 ttccccagcg gagttgatgt ttggaaggcg cttcaaggac agattcccgc acctttcggg 4920 agaagtcgta gttccggagg aggttcaaga acgcgatcgg atcgcaaagt accaggccaa 4980 ggtgtaccgg gacatcaagg tgcacgctaa accatcggac atccaaattg gcgaccacgt 5040 cttgatgaag aaccagcacc gagacaacaa gttagcccca aatttcaacc cggaaccggt 5100 ggtagtggta caaaaggcgg gaaattgcgt gacggttaga acaccccgag gtgatctgtt 5160 ccgacgaaac tcttcgcacc tgaaacccat cagaagcagc attggaccag ctgaacaagc 5220 gaatgaaccg gacgtcgatg gtgaaacgtt ggcagaagcc gacttcgacg cagagtaccc 5280 ggtagcagcg aacgatcgag cggcaacccc agggattggt caaggaggag tgcgtttcaa 5340 cgtcctaagc ggattatcaa acccccagct agattccagg actacgagtt gggagaagag 5400 tagagaaagt aacttaactg tgacgaaggg agaaga 5436 // ID Jockey_Ele7 repbase; DNA; INV; 4980 BP. XX AC . XX DT 15-OCT-2010 (Rel. 15.1, Created) DT 15-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE A Jockey clade non-LTR retrotransposon family from Aedes aegypti. XX KW Jockey; Non-LTR Retrotransposon; Transposable Element; KW Jockey_Ele7. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4980 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4980 RA Kojima K.K. and Jurka J.; RT "Jockey clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (15-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. This consensus is generated from 8 CC sequences with >95% identity, and ~97% identical to the original CC sequence in [1]. XX FH Key Location/Qualifiers FT CDS 529..1785 FT /product="Jockey_Ele7_1p" FT /translation="MGRNRKQKADSASIPAPLADSGQTSAPKRARNEDANP FT AAYNSRLLASNQFASLPVDQVPQKAKVAPLFAATTDIAALRTELAALNIKP FT LFKLCHTGTKVMCDSRNDYEKAGKLLKAKGLEFYTHDAPGSKPLKVIIRGL FT PEYTPEAIIEEMVSAGLKPTSVFPIRRAQGGRHRDQLYLAHLEKGSTTMAG FT LTRVRALFHIVVEWERYRPKKRDVTQCGNCLALGHGTRNCHMKPRCGKCAG FT AHATTTCQPMEGTEPKCANCGANHEGSSRNCPKRAEFLAIRQQASAKKLGR FT QRQRQPPPPLTEEHFPTPRYQVPNLPPLQPTHRQASRQSAPSVQQRLAAAA FT AAPSVRNAPPPGWGNPGPSASGTPPPSDDDGSLYTPEQMLEYTRNLFQRLR FT ACRSKSEQIDAANSVVFAFLSKYGP" FT CDS 1577..4447 FT /product="Jockey_Ele7_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MRPLLDGGILDPVLPALLLPPTTTAPCTRRSKCWNTQ FT GTFSNGYEPAVPSRNRSTPPTRWYLRSYPNMAREAITRIANWNACSLRNKA FT IELVEFLEEKTIDIAIITETHLKPELSVSIPNYRLVRLDRTDSEGGGVAIA FT LRHNMNCRLLPSLQLKLIEAVGVEVQASDGPITIIAAYYPTQASLNDGSAT FT ALRQDITKLTRRHGRYIIAGDLNAKHEAWGNPRRNRNGVILQQGLEEGHFN FT ILGPDTPTRLSRSGAHAFIDLFITNMVSISHPVVYQELSSDHYPVVVEVGS FT SVNRHLKTRRNYRRVNWDQFRQCVDTHVDYEVRPESTDDVDRQLQAIEEAI FT SQAREQHVPATSQVSNNIPIDRLTKDLIRLRNTIRRQYQRTGLPALKSDVN FT RMNKIIKARMLDLRNQNFSNKIRALPDCAQPFWKLTKILKTKPRPIPPLVP FT QDNNSPVDRLITPAEKAAEIGRHFVSSHNLGHNIISPHEAAVNEHANNHHL FT IPNDFSEELEITADELSTYLKHSKNMKAPGFDNIMNLELKHLSVQFYDHLA FT VIFNQCLRLSYFPSSWKSAKVVPIRKPGKDPSSPKSYRPISLLSALSKLFE FT RAVYRRLLASVEQNNILLDEQFGFRRGRSTVHQLSRVTNILRRNKSLSKTS FT AMALLDVEKAFDNVWHDGLVFKLQRYNLPSYLVKIVHNYLSARTFRVSLNG FT AISDAHDIIAGVPQGSILGPLLFNLFTSDMPALPGGGALSLFADDTSIVYS FT GRRIRALTAKLQRGLDVLTEYLTSWKICINAAKTQVILFPHSRSPRLVPAE FT DCKITLNGTAVEWSNEADYLGLTLDSKLIFRQQVDKTVTRCSTMLRMLYPL FT TNRRSTLSQKNKLAVYKQIIRPVIEYGIPVWESCAKSHHLKLQRVQNKFLR FT MALNAPRRMRNSEVLRLAEIQTLEDRFSESIGRYRARCRTSDQQVVRAIVA FT PP" XX SQ Sequence 4980 BP; 1282 A; 1476 C; 1197 G; 1020 T; 5 other; ttaaagagtt gtttgtgtaa tattccctaa tgtgtttacc acatacttgc tataatctct 60 taatwcatct catagcatca tacaataatt gatttattac gaataattga caatacggma 120 atttgaggat gtttccgagg acgctgctgc agtgccgtcg ggatgcaata acagcatgat 180 gctwatcttg tacatccctt acccagacat cagtttcata tagcctattt cccagataag 240 aaaacgaaga atttgaaagg aggagcatca cctgctattc taagggtata aagcatccct 300 ccttcgttga attcggtttc atcaacattc cgaatcgaac tgccgttgtg aacggttgcg 360 tgttcttttt gagttgcgag acacaagttc gaattctccc ccgttcgtgt gcgaagtgca 420 cttttcgcga tttcccgctc ccgcgactgc tctccggcgc taacctacct ccttggggtg 480 gctagccaga aggcatctct gcaagcagag caaccgcggc ccatagcgat gggtcgcaac 540 agaaagcaaa aggcggactc cgcctcgatc cccgccccgc tggccgacag cgggcagacc 600 agcgccccga agcgwgcccg caatgaagat gccaacccgg cggcttacaa cagcaggttg 660 ctggcaagca atcagtttgc atccctgccg gtggaccaag tcccccagaa ggcgaaggtt 720 gcccccctgt tcgcggcaac gacggacatc gcggcactgc ggacggaact ggcggccctc 780 aacatcaagc cgctattcaa gctgtgccat accggcacga aggtcatgtg cgactcccgc 840 aatgactacg agaaggccgg caagctgctg aaggcaaagg ggttggagtt ctacactcac 900 gatgcccccg gtagcaagcc gctgaaggtc atcattcgtg ggcttccgga atacaccccg 960 gaggctatca tcgaggagat ggtgtcggcc ggactcaagc cgacgagtgt gttccccatc 1020 cggcgagcgc aaggaggacg acaccgggac cagctctacc tggcccattt ggagaaaggg 1080 tccaccacca tggcgggcct gacgagggtg agagcgctct tccacatcgt ggtggaatgg 1140 gagcgctacc gcccgaagaa gcgagacgtg acgcagtgtg gcaactgcct cgcgctcggg 1200 catgggacca ggaattgcca tatgaagccc cgctgtggca aatgtgccgg tgcgcacgcc 1260 actacgacct gccagccgat ggagggcacc gaaccgaagt gtgccaactg cggtgcaaac 1320 cacgagggca gcagccgcaa ctgccccaaa cgtgcggagt ttctggcaat ccgccagcaa 1380 gcgtccgcca agaagctggg acggcaacgc cagcgccaac cacccccacc gctgactgag 1440 gagcacttcc cgacgccccg ctaccaagtg cccaatctgc caccgcttca accaacccac 1500 cggcaggcat cccgccagtc ggccccctcc gttcagcaac gcctcgcggc cgccgccgct 1560 gctccatcgg tccggaatgc gccccctcct ggatggggga atcctggacc cagtgcttcc 1620 ggcactcctc ctccctccga cgacgacggc tccctgtaca cgccggagca aatgttggaa 1680 tacacaagga accttttcca acggctacga gcctgccgtt ccaagtcgga acagatcgac 1740 gccgccaact cggtggtatt tgcgttccta tccaaatatg gcccgtgagg ccatcaccag 1800 gatcgcaaat tggaacgctt gttccctccg gaacaaagct atcgagctcg tcgaatttct 1860 cgaagagaaa accatcgaca tcgccatcat cacggagacg cacctcaagc ctgagctgag 1920 cgtctccatt cccaactatc gacttgtgcg gctagatcgg accgactctg aaggaggggg 1980 agtcgccatc gctttgcgcc acaacatgaa ctgccgcctg ctgccgagcc tgcagctcaa 2040 gctcatcgag gccgtaggag tagaagtgca agcttcggac ggcccgatca ccatcatcgc 2100 tgcatactac ccaacgcaag cgagcctcaa cgacgggtcg gcaacggccc tgcgacaaga 2160 catcaccaag ctcactcggc ggcacgggcg gtacatcatc gctggcgacc tcaacgcgaa 2220 gcacgaagcc tgggggaacc cccgaagaaa tcggaacgga gtcatcctgc agcaaggctt 2280 ggaggagggc cacttcaaca tcttgggccc ggatactccc acccggttga gcagatcggg 2340 agcccacgcc ttcatcgacc tgtttatcac caacatggta agcatctctc atccggtcgt 2400 ctaccaggaa ctcagctcgg accactatcc ggtggtggtg gaagttgggt cctcggtcaa 2460 ccggcatctc aagacccggc gcaactatcg acgcgtcaac tgggaccagt tccggcagtg 2520 tgtcgacacc cacgtcgact acgaggtgcg acccgagtca accgatgacg tcgatcggca 2580 gctgcaggcc atcgaagaag cgatctccca ggccagggag caacacgtgc cggcaaccag 2640 tcaggtgagc aataacatac caattgatag acttactaaa gatttgatca gacttcgcaa 2700 caccatccgc aggcagtacc agcgaactgg gctgcctgcc cttaaatccg atgtcaatcg 2760 gatgaacaaa attatcaagg ccagaatgtt ggacctcaga aatcaaaatt tttcaaacaa 2820 aatccgcgct ctcccagact gtgctcagcc gttctggaag ctcacaaaaa ttttaaaaac 2880 caagcctaga ccaattccac cgctggttcc acaagataac aatagccccg tagatcgctt 2940 gataacgcct gctgagaagg cagccgaaat aggtcgacat ttcgtcagct cacataatct 3000 tggacacaat atcatcagtc cgcatgaagc tgccgtgaat gagcatgcaa acaaccacca 3060 tctgattccc aacgacttct cggaggagtt ggaaatcaca gctgacgaac tctcgaccta 3120 tttaaaacac tccaaaaaca tgaaggcccc aggtttcgac aacatcatga acttggaact 3180 caagcactta agcgtccagt tctatgatca cctcgcagtg atcttcaacc agtgcctccg 3240 gcttagctac ttcccatcgt cctggaagtc agccaaagtc gtccccatca ggaaacctgg 3300 gaaagatcct tcctcgccta aaagctatcg ccccatcagc cttctctcag cgttatcgaa 3360 gcttttcgaa agggcagttt atcgccgact tcttgcttcc gtagagcaaa acaacatcct 3420 gctcgatgaa cagtttggtt ttcgacgcgg tcgttcaacc gtgcaccaac tgtcccgagt 3480 caccaacatc ctcagacgga acaagtctct atccaaaacc tctgccatgg ccctgctcga 3540 cgtcgaaaaa gcgttcgaca atgtctggca tgatggcctg gtgttcaagc ttcaacgata 3600 taatcttccc agctacctgg tgaaaatagt acacaactat ctgtcggcga ggacattccg 3660 ggtctcacta aacggagcaa tttccgatgc gcacgacatc atcgcaggcg tcccccaggg 3720 cagtatcctc gggcccctgc tattcaacct gttcacctcc gatatgcccg cactcccagg 3780 aggcggcgca ctgtctctgt tcgcagatga cacctccatc gtctacagtg gtagaagaat 3840 cagagcgctc acggcaaaac tccaacgggg cctggatgtc ctgacagaat acctcaccag 3900 ctggaagata tgtattaacg cggcgaagac ccaggtcatc ctcttccccc actccagatc 3960 ccccagactt gttccggctg aggattgcaa aatcacccta aacggcacag cggtggaatg 4020 gtccaacgag gccgattatc taggcttgac tttggacagc aagctgatct ttcgacaaca 4080 agtcgacaaa acggtcacga ggtgcagcac gatgctgcga atgttgtatc ccctcacaaa 4140 ccgtaggtcg acactgtccc aaaagaacaa gcttgctgtc tacaaacaaa ttatccgtcc 4200 cgtgatcgag tacggcatac cggtctggga gagctgcgct aaatcacacc atctcaaact 4260 ccagagggtt cagaacaagt tcctkagaat ggcactgaac gccccaagga gaatgcgaaa 4320 ctctgaggtc ctccgtctgg ccgagataca aacccttgaa gatcgcttta gtgagtctat 4380 cggaaggtac agggctcgtt gccgcacgtc agaccagcag gtcgtccgcg ctatagtcgc 4440 acccccctag gttatcaaat tttgttttgc ttgtgtatat agtagttagg ttatcaaatt 4500 tttaataatc accagagctc caaagagcca aattgtcaaa ctagaataat caaatttaaa 4560 ttccacaaca ccatatcaaa acgaccaaga ttgaaaggcc cttggccgac caatttagat 4620 gtaagaataa ttgtaccccc cccaaatcaa acatgaaaat aaatatgaat taaatcgaaa 4680 aatgaattcg gtttcattct gttttcgctt tcgagctggt aagagctcct ccaaacctgc 4740 tcagctcctc gctaccagag gcaagctgtt tgtacgagat ggctgaagaa aatcgaccaa 4800 agccgcttgt tgaagcgcac atcgtcgtca gcaccgcaaa tctcatcggg cgaggaggat 4860 tgcaaccagt gcgccgtcaa aatgtgtccg tgttacgcaa attcaacaat tcaatcggcc 4920 cttttcagag ccaacaaatg tgttttaaag agttgattgt gtaatattcc ctaatttgtt 4980 // ID Mariner-4_SM repbase; DNA; INV; 2508 BP. XX AC . XX DT 08-FEB-2008 (Rel. 13.02, Created) DT 03-MAR-2008 (Rel. 13.02, Last updated, Version 1) XX DE Mariner DNA-transposon from Schmidtea mediterranea. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-4_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-2508 RA Jurka J.; RT "Mariner DNA transposon from Schmidtea mediterranea."; RL Repbase Reports 8(2), 148-148 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 352..2244 FT /product="Mariner-4_SM_1p" FT /translation="MIKKFLKLLFLNWGKVDTLTLFWIFLVKLNLKLFVFC FT LEMPRKYTRKTDMTNNYSEESLLQAVNEIQNGMPLLTAAKEFGIPRSTLRR FT KCEGNMEKWGKKSILTLSQEQVLLERVMHLCSRGFPLTIDKFRQAAYHFAK FT VLHRRKMLETLPETWTRDKTASYEWWYSYKQRFPQLALRVAENLSSSRAEA FT FNKHRVTSFYEEASQVLNNAGVSSSPHLIYNCDETGLSSVPNKSRKVLAEK FT GKRIIQQIQTGERGTLTTFLPCCNANGEYIPPFLIFKGQHIPQSSDYPPNT FT RLMASRSGYIDMDIFLSFLQHFECHRNHEIGKKAILFLDGHKSHVSAQAVE FT YCSQVGIELVCLPPHSTHRLQPLDTHFNKGLKSKWSANLAEFLRNNSKVQL FT CRQEFYKVFNPTWAAFTERRSLLVDAFQYCGLFPCRDPTTNNDFQMALNFV FT TDTKQSSTLHESHDLPAQTAIRIIAPSPRKTPNPVHLKQHIAHITSPTMMM FT KLSQRNNSHSSGGSGQAPQSSFNEANAESPMHGSAQGHLRTLPQNVPSTSS FT GVNHGSHGYLPQPPKKRAKRCIPSQHHSKKTKQVSVKESRTSNNCVVCDEE FT YSNSSVDWFCCPGCHNWTCEMCFASDLCYNCSD" XX SQ Sequence 2508 BP; 801 A; 529 C; 436 G; 742 T; 0 other; ctgggggaaa gtggacactc tttctgcgaa ttttgttttg ataaagaaat ctatgctacc 60 gatgaaattt tttagcctac aactatttat gcgctaaaaa tatacttata gatacccttt 120 tacttaaaac taatcaatat tgaattaaca aataactttt aagtcgtgtt tttccacatt 180 aaagtcccaa aaaaagtatc gtgaaatttc aagtgttttt aaacgctgac atttttagca 240 gacaatattg tgtaattcat cattggatca gcaagaaaag gtattatttg ataatttgct 300 atataaagtt gtaatgaaaa tgttcagatt tacaaacttt tactgtttca tatgataaaa 360 aagtttttaa agttgttgtt tttgaattgg ggtaaagtgg acactttgac actattttgg 420 atatttttgg tgaaattaaa tctaaaatta tttgtttttt gtttagaaat gcctcgaaaa 480 tacacaagaa aaacggatat gacaaacaat tattccgaag aatcactcct tcaagcagtc 540 aatgaaatcc aaaatggaat gccattatta acagcagcaa aagagtttgg aattccacgc 600 tcgacgctgc gaaggaaatg cgaaggaaac atggaaaaat ggggtaagaa gtctatcctg 660 acactaagcc aagaacaagt gttattggaa agagtgatgc atttatgctc tcgtggtttc 720 cctctaacaa tagataaatt tcgacaagct gcctatcatt ttgcaaaggt cctccatcgg 780 agaaaaatgc ttgaaacatt gccagaaaca tggactcgag ataaaactgc atcctatgaa 840 tggtggtatt catacaagca acgttttcca cagcttgcct tgcgtgtagc cgaaaatctg 900 tcatcctctc gtgcggaagc atttaacaaa catcgagtga catcttttta cgaagaagca 960 agtcaagttc tcaacaatgc aggagtctct tcctcacctc atctcatata taactgtgat 1020 gaaactgggc tctcatctgt cccaaataaa tcacgaaagg tcttggcaga aaaaggtaag 1080 cgcatcatcc aacaaattca gacgggtgaa cgtgggacgt taactacatt ccttccttgt 1140 tgtaatgcca atggtgaata tataccacca tttttaatct tcaaaggaca acacattccc 1200 cagtcctcag actatccacc caatactcgc ttaatggctt caagatctgg atatatcgat 1260 atggatattt tcctctcatt cctgcagcat tttgaatgtc atcgcaatca cgaaatcgga 1320 aagaaagcta tcctcttctt agatggccac aagtcgcatg tatctgcaca agccgttgag 1380 tattgcagtc aagtcggaat cgaattagtg tgccttcctc ctcacagtac ccaccgcctt 1440 cagccacttg acactcattt caataaaggt ttgaagtcca agtggtctgc taatttagct 1500 gaattcctca gaaataatag caaagtgcaa ctttgtcgac aagaatttta taaagtattc 1560 aatccgacat gggccgcttt tactgaacga aggtcactcc ttgttgatgc ttttcagtat 1620 tgtgggctat ttccatgtcg tgatccaaca acgaataatg actttcagat ggctttaaac 1680 ttcgtcacgg atacaaagca atcgtcaacg cttcacgaat ctcatgacct tccagctcag 1740 actgcaattc gtatcatcgc tccttctcct cgtaaaactc ccaatccagt tcatttaaag 1800 caacacatcg ctcacatcac atccccaact atgatgatga agttatccca acgcaataac 1860 tcacattctt ctggtggaag tggacaagct ccacagtcct catttaatga agcgaatgct 1920 gaatctccaa tgcacggatc tgcccaaggt catcttcgca cattacctca aaatgtgcca 1980 tcaacgtctt caggagtgaa ccatggctca catggctacc ttccccaacc acccaaaaag 2040 cgtgcaaaac gatgtattcc aagtcaacat cattctaaaa aaacaaagca agtctcagtg 2100 aaagaatcaa ggacaagtaa caattgcgta gtttgtgatg aagaatattc caattcatca 2160 gttgactggt tttgttgtcc tggctgtcac aattggacat gtgaaatgtg ctttgcatca 2220 gatttatgct ataattgcag tgactgacaa acggtttctc attagtgctt actaattatc 2280 ttttctctag gactgtccac tttaccccat gatgtgtcca ctttacccca ttccattttt 2340 tggctttgac aaataaattc tatgagacag gtttggagta aactataaaa tacttcagaa 2400 aataagttac taatagatag atattgttgc caaaaaaatc agtaagccta agtaataaca 2460 tacgctaaca ccggctgatt cgcagaaagt gtgtccactt tcccccag 2508 // ID BEL-599_AA-LTR repbase; DNA; INV; 269 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-599_AA_; KW Pao_Bel_Ele12; BEL-599_AA-I; BEL-599_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-269 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 269 BP; 74 A; 74 C; 53 G; 68 T; 0 other; tgttcgcgtg cagaccatcc ctacacgctg tgtagacttc gtgcacatcc ctcgcagcgc 60 aaacgtcatc atcgcagccg cgcgctgcca acattactcc acttcctttt ccgctaagag 120 aaggcagaaa aagtacagtc gtctgaccaa tcggtcgtaa aaagtgtgct cactaaaagt 180 aaaattaaaa taagttaata cagtccattt tgagttcgaa aaagtagtcc gccgtgtttt 240 attacgtccc gttagctccc tcgcgtaca 269 // ID W1 repbase; DNA; INV; 482 BP. XX AC J04665; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 28-SEP-1995 (Rel. 1.08, Last updated, Version 1) XX DE S.mansoni female-specific DNA repeat W1. XX KW Repetitive sequence; W1. XX OS Schistosoma mansoni OC Eukaryota; Metazoa; Platyhelminthes; Trematoda; Digenea; OC Strigeidida; Schistosomatoidea; Schistosomatidae; Schistosoma. XX RN [1] RP 1-482 RA Webster P., Mansour E.T. and Bieber D.; RT "Isolation of a female-specific, highly repeated Schistosoma RT mansoni DNA probe and its use in an assay of cercarial sex."; RL Mol. Biochem. Parasitol 36(3), 217-222 (1989). XX DR GenBank; J04665; Positions 1 482. XX SQ Sequence 482 BP; 130 A; 86 C; 112 G; 154 T; 0 other; gaattcgttc aacacagtga aattcttcct tcacacatat ctaccatcca atgtcttcgc 60 aatattttgg agtgaaattt gcttttctca ttatattgtg catgatgact gatgtgacag 120 gaatgaggat tatgttgata tcgtctgagt caatgagaat tgtgaatcgg atgtgcagat 180 gagaggttgt gcatacttgt tccttgtgac acaaaggagt ggtgatgcca gttcgagtgt 240 ttgtggatgc gatggtgttc acacgtggat tgaataagcg atgaacaaat gcgatgatgc 300 attagggtgt gtggttgtgc tggaccaatg tgcataatgg aatcgttgct tgtgcacatg 360 gaccaccaca aataacacac tcaattcata ctccgtccat ttaaccatgc attgctttct 420 catcaacacc acagtttgca ttatcatttc gaacattgag ttgaatgtcg agtggtgaat 480 tc 482 // ID BEL-58_AA-I repbase; DNA; INV; 5783 BP. XX AC supercont1.17; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-58_AA_; KW BEL-58_AA-LTR; BEL-58_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5783 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.17; Positions 2157483 2151701. XX CC Positions [4535-4801] - Integrase core CC 'TATGG' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 251..4801 FT /product="BEL-58_AA-I_1p" FT /translation="MSTERRIKGLKTRLKSLLTSFNLIKSFVEEYQEDRDA FT HQVSVRLEHLVALWKDVNTTQVELETLDEESLDQYLKVRTDFESSFFHVKG FT FLLSVNKSAVPDSPSASRTTAQVPPTSSVRLPDVKLPVFSGHLDGWLNFHD FT LFVSLVHSSHELSSIQKFYYLRSSLSGDALKLIQTIAISANNYQVARNLLV FT DHYQNPTILKQSYVDSLFEFPSLKRESASELHSLVEKFEANVRVLKQLGEK FT TEFWGILLIKMLSSRLDPTTRRDWEEYATSKDNVSFNDLTTFIQRRVNVLQ FT TINKSTEVLSQGAPKKSVSRSVASHGASQPNYRKCLVCSDHHPLYQCVVFS FT KMTLDDKEKDVRRHQLCRNCLRKGYQARECPSSSTCRKCRSRHHTQLCPGE FT THPASANSRTVESAPPKPTSAPIVEDPPRNSASAISEPVSCASSGQKRKSV FT LLATAVVTLIDDNGTEHFARALLDSGSECCFMVESLAQQIKAKRTKIQVPI FT TGIGQSSTYARHKLQSTIRSRVSGYSTAVEFLVLPKVTVNLPSTSIDVSSW FT EIPDGIQLADPSFYDTRPVQLVLGAEIFFDLFKVSGRIQLGESLPNLVNSV FT LGWVVSGVTSNCQSRTSVTANVATTADLHRLMEKFWSIEEGNDKNHYSVEE FT AACEAHFRRTVSRSPEGRYIVRLPLKQDALMNLGDNRRSALRRFHLVENRL FT SRNPELGTQYREFMREYEELGHMQQVHNYENPPSRCYHLPHHAVIREDSTT FT TKVRVVFDASCRTPEGSSLNDALMVGPIVQEDLRSIIMRSRTHPIMLIADI FT KQMFRQIRVDEPLQRIVWRPSSDAPMSTYELKTVTYGTASAPFLATRVLRQ FT LSEDEQDRFPEAAEVLRMDFYVDDLFSGGNTVEETIALRRQLDSLLSAGGF FT ELRKWASNVDAVLEDVPLDNKALKTSVDLDRDQCIKSLGLHWEPATDHLRY FT KIGLPPINDTPLTKRIALSQIARLFDPLGLVGPVITSAKLFMQALWTLKSN FT EGHIWGWDQELPPSYKERWVTYTSELPLLNDLRIERCILCSNPISIQLHFF FT SDASENAYGACCYIRSCNSVGETKTALLTAKSKIAPLKQQSIPRLELCGAL FT LAAELYEKVAASLKITTETYFWVDSTIVICWLSSTPSAWTTFGANRVSKIQ FT LSTQNCRWNHVSGHQNPADHISRGLPAENLLNNDLWWKGPPWLSLDKEFWP FT LQQPMNDSEYNSIPEARKPPTTAITATAEPSFIDLFVSKYSSYNKMLRITA FT YCRRFILHCQRNSRPSTTIITSEEMTGAETTLIRLVQQQTFAAEWKQLQQS FT MPVSSKSRIRWFHPFLSTDQLIRIGGRLKQTQQPFDTKHQILLPSTHSIST FT LLIRSLHETHLHAAPQLLLNILRLKYWITGARSLARKITQHCVVCVRARPK FT LVQQFMAELPVERVTATRPFTITGIDYWGPISLKPIHRRAAPGKAYVAVFV FT CFTTKAVHLELVGDLSTAKFIQALRRFVARRGLCAEIFSDNGRNFIGAANE FT L" XX SQ Sequence 5783 BP; 1588 A; 1625 C; 1193 G; 1377 T; 0 other; ttgtccttcg agccggatag cgtgaactcc gctagtccgc cagcccggaa tctacgccga 60 acccgccaga tattcgccgc cacgtgctga agaggttagc cgtctggaac aatctgcaat 120 acaaggcatt cgaattgcct tgagaaggta attatctttc caacacctcc tatccctact 180 cgtttgagct gcgcggtctt ccgccttcca gatcctttca tttcctggtc atcttcgtca 240 tctccccacg atgtcaacgg agcgacgcat caaaggcctt aagacgcgcc taaagagctt 300 gttgacttca ttcaacctca tcaaatcatt cgtggaagaa taccaggaag accgggatgc 360 tcaccaagtc tcggttcgtc tggagcacct agtagccctc tggaaggacg tgaacaccac 420 tcaagtcgaa ctcgaaacgc tggatgaaga gagcctggac caatatctga aggtccgcac 480 agactttgag tccagctttt tccacgtaaa gggtttccta ctttccgtta ataaatctgc 540 tgtcccagac tcaccgtctg cttctcgaac taccgcacaa gtccctccaa catcgtcagt 600 acgactacct gacgttaagc tgcccgtatt ctccggccat cttgatggat ggctaaactt 660 ccatgatttg ttcgtctcgc tcgtccactc gtcgcatgaa ctttcgagca tccaaaaatt 720 ctattatttg cggtcttccc tttccggaga tgccctcaag ctgatccaaa ccattgctat 780 cagtgccaac aattaccagg tggcgaggaa cttactggta gatcactacc aaaaccctac 840 tattttgaag cagtcttacg tagactcgct tttcgagttc ccttcgctta aaagggaatc 900 tgcgtcggag ttacactctc tagttgaaaa attcgaagcg aatgtccgag tccttaaaca 960 gttaggagag aagaccgagt tctggggtat actcctgatc aaaatgctga gcagcaggct 1020 tgacccgaca accagacgtg attgggagga gtatgcaaca tcgaaggaca acgttagctt 1080 caacgaccta acaacgttca tccagcgcag agtcaacgtt ctgcaaacaa tcaacaagtc 1140 aaccgaagtt ttgtcccaag gtgccccaaa gaagtcagtt tctcgttcag tcgccagcca 1200 tggggccagt cagcccaact atcgcaagtg tctcgtctgc tccgaccatc atcccttgta 1260 ccagtgcgtt gtcttttcaa aaatgaccct tgatgacaaa gagaaggatg ttcgacgtca 1320 ccaattatgc cgcaactgtc tccggaaagg ttatcaagct cgtgaatgcc cttcgtcaag 1380 cacctgtcgt aaatgcagaa gtcgccacca tactcaactc tgtcccgggg aaacccatcc 1440 agcttctgcg aactccagaa ctgtggaatc tgccccaccg aaacctactt ctgccccaat 1500 cgtagaagat ccaccaagaa actccgcttc tgccatctcc gaacccgtaa gctgcgcttc 1560 ttccggccag aaacggaaaa gtgttctact ggccacagcc gtggtcacac tgatcgatga 1620 caatggaacc gaacactttg ctagggcgct cctagactcc ggcagcgaat gctgcttcat 1680 ggttgaatcg ctagctcaac aaataaaggc taaacgcacc aagattcagg ttccgatcac 1740 tggtatcgga cagtcctcaa cctacgctcg tcacaagctg caatccacca tccgctctcg 1800 agtcagcgga tattctacgg ccgttgaatt cttggtgctt ccaaaagtca ctgttaacct 1860 accatcaaca tcgatagacg tctcatcctg ggaaattccc gacggaatcc agcttgctga 1920 tccgtcattt tacgatacaa gacctgtcca acttgtattg ggagcagaaa tctttttcga 1980 cctcttcaag gtttctggtc gaattcaact cggagagtct ctgccaaacc tagtcaattc 2040 cgttctaggt tgggtcgtat ctggggttac gtcgaactgt caatcgagaa catccgtcac 2100 tgctaacgta gccacaaccg ctgatctcca tcgtctaatg gagaagttct ggtccatcga 2160 agaaggcaac gacaaaaatc actactccgt cgaagaagct gcttgtgaag ctcacttccg 2220 tcgcacggtt tcccgctcgc cagaaggccg ctacattgta cgacttccct tgaaacaaga 2280 tgctctaatg aacctcggtg acaatcgccg ttctgcgctt cgccgtttcc acctcgttga 2340 aaatcgcctg tcacgcaacc cagagctagg tacccaatat cgagaattca tgcgagaata 2400 tgaagaattg gggcacatgc agcaagtaca caactacgaa aatcccccgt ctcgttgcta 2460 tcacctgccg catcacgcag tcatccgtga agacagcacg accaccaaag tgcgtgtggt 2520 cttcgacgca tcctgtcgta ctcccgaagg atcatcccta aacgacgccc taatggtcgg 2580 accgattgtt caagaagacc ttcggtccat aatcatgcga tccagaactc accccataat 2640 gctgatcgcc gatatcaagc aaatgttccg tcagatccga gttgacgaac cactgcagcg 2700 aatcgtttgg cgaccatcgt cggacgcgcc aatgagtacc tacgaattaa agaccgtcac 2760 ctatggaaca gcaagtgcac cttttctggc tacaagggta ctacgccaac tctccgaaga 2820 cgagcaagac cgattcccgg aagccgctga agtactacga atggacttct acgtggacga 2880 tttgttctct ggaggaaaca ccgtagagga aaccatcgca ctccgaaggc aactggattc 2940 cttactctct gctgggggtt tcgaactacg aaagtgggct tctaacgtcg atgctgtctt 3000 agaagacgta cccttggaca ataaagctct caaaacttca gtggacttag atcgagatca 3060 atgcatcaaa agtcttggtc ttcactggga accagcaacc gatcatcttc gctacaaaat 3120 tggcctacct ccgatcaacg acaccccact aacaaaacgc attgctctct cccaaattgc 3180 tcgcctattt gacccccttg gtctagtagg gcccgtcatt acatcggcca agctatttat 3240 gcaagccttg tggacactaa aatccaacga agggcacata tggggttggg atcaagagct 3300 tcctccatcc tacaaggaac gttgggtaac ctacacctct gagctaccac ttttgaacga 3360 tttacgtatc gaacgttgca tcctctgttc aaatcctatt tcaatccaac tccacttttt 3420 ctccgatgca tcggaaaatg catatggagc ctgttgctat ataaggtcat gcaatagcgt 3480 tggggaaact aaaactgctc tgctgactgc aaagtcaaaa atcgcccccc tgaagcaaca 3540 aagcatccca cgactcgagc tctgtggagc tctactcgca gctgaactgt atgaaaaggt 3600 agcagcttcc ctgaaaatca ctaccgaaac ttatttctgg gtagactcaa ccatagtcat 3660 ttgttggctc agctctacac catcagcatg gaccaccttc ggggcaaacc gcgtttccaa 3720 aatacaatta tctacccaaa attgccgttg gaaccatgtt tcgggacacc aaaacccagc 3780 tgaccatatt tctcgaggac tgcctgcaga aaatctcctc aacaacgatc tctggtggaa 3840 gggtccacca tggttgtcac ttgacaagga attttggcct ctccagcaac ctatgaacga 3900 ctccgagtac aacagcatcc cagaagcacg aaaaccaccc acaacagcaa taacggctac 3960 cgcagaacca tcattcattg atttgttcgt cagcaaatac tccagctaca acaaaatgtt 4020 gcgaattact gcctactgtc gccgtttcat cctacactgt cagcgcaatt caaggccaag 4080 cacaacaatc attaccagcg aagaaatgac aggagcagaa acgacactaa tccgcctagt 4140 tcaacaacaa acattcgccg ctgaatggaa acagctgcag caatctatgc cagtctcgtc 4200 taaatcacgc attcgttggt ttcatccttt cctttcaacc gatcaactga ttcgcattgg 4260 tggaaggcta aagcaaacac aacaaccctt cgacactaag caccaaattc ttctgccatc 4320 cacccactca atctccacac tcctcattcg ctctctacat gaaacccatt tacatgccgc 4380 acctcaattg ctgctcaaca ttcttcgtct gaagtattgg attaccggag ccagaagttt 4440 ggctagaaag atcacccaac actgtgtagt ctgcgtaaga gcccgaccca aactcgtcca 4500 gcagtttatg gccgaactcc cagtcgaaag agtaacagcg actcgtccat tcacaataac 4560 tgggatcgat tattggggtc ccatttccct caaaccaatc catcgtcgag cagcacccgg 4620 taaggcttat gtagcagttt tcgtttgctt cacaacgaaa gctgtacatc tcgagttggt 4680 tggagacctg agcactgcca aattcattca ggctctacga cgtttcgtcg ctcgccgagg 4740 gttgtgcgcg gaaattttca gcgacaatgg cagaaacttc attggtgctg caaacgaact 4800 ttgacaaatg atccaaagta atcagcatca gcaagcaatc atcgaagaat gtgcgtcaaa 4860 tggcatacgt tggcgcttta atcctcctaa ggcctctcat tttggaggac tatgggaagc 4920 cgcaatccaa tctgcccaaa aacactttgt tcgagctctg gggactcaaa cattgtgcat 4980 agaagatatg caaacacttc tcactcagat cgaatgttgc ctgaactcca ggccgatcgt 5040 tccgctaagt gacgaccctt ccgacttcga gcctctcagt ccaggacact tcttgacagg 5100 ttcgtccttg aaggctgttc ctgacgtaaa tgtcaccact agtccaatga atcgactgag 5160 cgattatcag caaatccaga agctacttca acatatttgg cagaggtggc acaccgagta 5220 cttatgtacg cttcagtctc gaaccaaatg gatcaaccat cccgtaaata tccaacgggg 5280 ccagctcgtc gtattgaagg aggagaacgc accaccgctg cattggccta cagcacgggt 5340 agtggatttg catccaggaa ccgatggcat cactcgtgtc gtcacaataa aaacatcaac 5400 aggagaatac aagcgaccgg tttcgaaaat ctgcattctt ccagttgcag catccgatga 5460 aaacaaccag taagccggtc cacagaatat catcagcctt ggcatctctt caatgtttcc 5520 agcaaagaac acctaaaaat aaagaaatac aaggtccttt taaccaaaag gacaaggtaa 5580 gaacctgtcc tattagttat gccgcccttg aacccgctac cgggtcttat gttttgattc 5640 cagttttcgc aattcattcg tcgagcaata tcaattgacg ttttcaccat aaaatcgaga 5700 taatcaacca tcgagcagag aataacatca tcacgttgaa catcgacgca actgtcagaa 5760 tccgatcctc gaaggggcca gga 5783 // ID DNA8-7_AP repbase; DNA; INV; 309 BP. XX AC . XX DT 22-AUG-2009 (Rel. 14.08, Created) DT 22-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-7_AP. XX NM DNA8-7_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-309 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(8), 1749-1749 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 309 BP; 100 A; 55 C; 57 G; 97 T; 0 other; cacggctata gatagtatgg ttgcccatcg acgtatgatt ctcatacttg aaaagtgtct 60 accgaaatga atctctataa tcagtattaa agttaaatta aacagctgga aacttaagca 120 ctagcgctta tatgtcatag tggctatcac ggaactacaa tttacatttg aaaggtaatt 180 gcatttagag tcactacaac aaataagcgc tagagcgtaa gtttccagct gtttaatact 240 gataatggag attcatttcg gtagacactt ttctagattc gttgggcaac catactatct 300 ttaatcgtg 309 // ID Tx1-5_CQ repbase; DNA; INV; 4748 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A Tx1 non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW Tx1; Non-LTR Retrotransposon; Transposable Element; Tx1-5_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4748 RA Kojima K.K. and Jurka J.; RT "Tx1 non-LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 637-637 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 7 sequences with >94% CC identity. XX FH Key Location/Qualifiers FT CDS 130..1275 FT /product="Tx1-5_CQ_1p" FT /translation="MEARVVRENTVKLKFGAQNRGPTNSEMFAFFRKQNWN FT EEMLSAMYKEDYSLLVKFRTKQLMLDVLAKFGNHVEFGYEDGSNVQLAVSA FT ACGVFKYVRVFGLPPEVDDKAIADVFGKFGNIHQMVRERFPAETGFPIWNG FT VRGIHMEVTAELPAQVYVQHIRARVYYDGFQNKCFSCGSTEHLKAVCPKKQ FT TVQSRLESYAHPPLDRGAKGSGQKAGLGAGQRAGQGAGQGAGQGSGSKNPS FT WNDMIKNFPDLPTNVQTCLTPEQATLNKPGQEQRKEGEQGEGWVTIPIKEK FT LRGRGLRQRKATDADSDPEDLFKVPDAPKLARLQGTRSRSLNSRKQKENSN FT AQVPVVLIEGSPSPSNQKTDTETSDGKASEEDAAMVDLK" FT CDS 1375..4629 FT /product="Tx1-5_CQ_2p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="MQYSYKIATVNLNSTSTDVNKGLLRDFVYNHDIDIVM FT LQEVAYDNFLFIPTHFPLVNISEDNKGTAVLIRKSFNFKDVLLDPNGRIVS FT FVLNNINYVNIYAHSGSDKRRERNDLFTNRLAVHLNKPGTRFSVLCGDFNC FT ILDGDDTRSNVKNFCAGLKNIIDLFSFKDIAKQLKKAAFTFIRGDSASRLD FT RFYGPIEFVNEVIDLQNIPVAFSDHHAVIIKVNSLKDHICAKGRGYWKINP FT SLLDSEEISKRFQDEYIKLKSRSTYNTDLNSWWHFILKPKIKQFYKTESWK FT INSATQLQKSNYLSELLKMAERQHKGENLTAEIAACKTKLMEIEYNRLNNY FT SKKLSNLNISEGEKISVFQVSKQINQGHDSQYLRLQGSDGIITEKNELDEL FT IFQHFSKQFDIDNTSTNLRNEENPLDSISKSLDDDDREMMIDPITEEEILN FT ILNACNKKKSPGIDGLTYEFYITHYDLIKKDLTKLLNGYLTATYSPPKEFT FT DGIVILIPKKGDNKQLENYRPITLLNTDHKLLTKIIAARIQEKMDKLIGPG FT QSACVKNLSCSDNLKDIRKMMTKAAETKRFKGFLLSLDLEKAFDRVNHSFL FT WDVLMKFGFPEQIINCVKKLYVHASSRVLYNGFLSKEFKIKASVRQGCPLS FT MLLFVLYIEPLIRRIYANTCGILAYDKILKVTAYADDLNVYVRNCEEFDLI FT IQIISSFSKFAKIRLNQRKSAFLRINNCISGPFLIPEADDLKILGVVFKNS FT WNSSIDQNYNKLVNDIKYRLTLNSFRKLSLVEKCWFVNSFVLSKLWYIAKI FT FPPKNIHLAKIKASVGNFIWNNQIFKVDRRQLYLSAENGGLGLQDPESKCK FT AIFIRNVLQSDCPSNDEYLLRYKYTSKLSRNSREWLETANMIKSNYNLHSC FT KLIYLYFIDEFKFKPKVEIEYPQMSWETAWLNLSSXFLQPEDRSSVYNFLN FT DLIPNRFKLFQYGIRKVTDNKCEICFEIDNNLHRLKSCPNVMEIRNWTEQI FT LRTRMNVRCNDLEDILFSHIDKNSIQQKAALWITVHFVGFVFRKFPKCSLY FT VFKKSIRELRWNKRKQFVLHFGTLLNIC" XX SQ Sequence 4748 BP; 1652 A; 753 C; 999 G; 1342 T; 2 other; cagtttactg tcggcttcca ctcgcttcag acgtgtgcag ttatacgcta ccagaagtat 60 cgagtattga gctattgtga aaataatctc ttgaccttga actgagtttg atccggaaag 120 tgtaagaaaa tggaggcgag agttgtgcgc gagaacactg tgaagctaaa atttggtgcc 180 caaaaccgtg gaccaactaa ctcggaaatg ttcgcgtttt tccggaaaca gaactggaat 240 gaggaaatgc tttcggctat gtacaaggag gactacagcc ttcttgtgaa attccggacg 300 aagcagctta tgctagacgt tcttgctaag tttgggaacc acgtggaatt cgggtatgag 360 gatggatcga atgttcagtt ggccgtctcg gccgcttgtg gagtgttcaa gtacgtgaga 420 gtatttggtc taccgccaga agtagatgac aaagctatcg cagatgtgtt cgggaaattc 480 gggaacatcc accagatggt tcgagaacgg tttccggctg agaccgggtt ccccatatgg 540 aacggtgtcc gtggtatcca tatggaggtg accgcggagc tccccgctca ggtgtacgtc 600 caacacatca gagcgcgggt gtattatgac ggcttccaga acaagtgctt ctcttgcggg 660 tcgacagagc acttgaaagc ggtttgtccg aagaagcaaa cagtgcaaag tcggttggaa 720 tcttatgctc acccaccgct ggaccgcggt gcgaaaggat cgggtcaaaa agcgggccta 780 ggagcgggcc aacgagcggg tcaaggagcg ggccaaggag cgggtcaagg aagcggatca 840 aaaaatccta gctggaacga catgatcaaa aacttcccgg atctcccgac gaacgttcaa 900 acatgtttga caccggaaca ggctacacta aacaagccgg ggcaggagca gcgcaaggaa 960 ggggaacaag gggaagggtg ggtaaccatc ccgatcaagg aaaagcttcg cggccggggg 1020 ttgcggcaaa ggaaagcgac cgacgcggat tcggatccag aggacctgtt caaggttcca 1080 gacgcgccga aacttgcgag gctgcaagga acgaggtctc gatcgctgaa cagccggaag 1140 caaaaggaga actcgaacgc tcaggtgccg gtcgttctga tcgaaggctc accttctccg 1200 agcaaccaga agacggacac ggaaacgagc gatggaaagg cgtccgaaga agacgcagca 1260 atggtggatc taaagtgagg ttaggtgagt tcacctgacc aattgttatt gagagagtgt 1320 gtgagtgtgt gagtgtgttt gtgtgaatgt gtgtgtgtag tataaaacga aaagatgcaa 1380 tattcctata aaattgccac agttaatctt aacagtacca gtacagatgt caacaagggt 1440 ttgcttaggg actttgtata caatcatgat atcgatattg tgatgctaca ggaggttgcg 1500 tatgataatt tcttgtttat accaacacac tttccgttgg tcaatattag tgaggataac 1560 aagggtaccg ctgtattaat aagaaagagt ttcaatttta aagatgtttt gctcgatcca 1620 aatgggagaa tagtatcgtt tgttttgaat aatataaact atgttaatat ttacgcgcat 1680 tcgggctctg acaaaagacg ggaacgcaat gatttgttca caaatcgttt agcagtacac 1740 ttaaacaaac ctgggacacg attttctgtt ttatgcggtg attttaattg tattttagat 1800 ggcgatgaca ctcgcagcaa cgttaaaaac ttttgtgccg gacttaaaaa cataattgac 1860 ctattttcat tcaaagacat agctaaacaa ttgaaaaaag cagcttttac attcattcgt 1920 ggcgattcgg cctctcggtt ggatagattt tatggaccga ttgagtttgt aaatgaagtg 1980 atcgatttgc agaacatccc agtggcattt tcggaccatc atgcagtcat tattaaagtt 2040 aattcattaa aagatcatat ttgtgctaag ggcagagggt actggaaaat taatcccagc 2100 ttacttgatt cggaagagat ttctaaaaga tttcaagatg aatacattaa gcttaaatcg 2160 cgatcaactt ataatacaga tttgaacagc tggtggcatt ttattttaaa acctaagatt 2220 aaacaatttt acaaaacaga aagctggaaa attaatagcg caacccaatt gcagaaatca 2280 aattatttat cagaattatt aaaaatggct gaaaggcaac acaaggggga aaatcttact 2340 gctgaaattg ctgcatgtaa aacgaagctt atggaaattg agtacaacag actgaataac 2400 tactcaaaaa aattatcaaa ccttaatatt tcagagggag aaaaaattag tgtttttcaa 2460 gtttccaaac aaattaatca aggtcatgac agtcaatatt tacgacttca aggatctgat 2520 ggaattatta cagaaaaaaa tgaattagat gaacttattt tccaacattt ttcaaaacaa 2580 tttgatattg ataacactag tacaaattta agaaatgaag agaatccttt agactctata 2640 tcaaaatcat tggatgatga tgatcgggag atgatgattg atccaataac tgaagaagaa 2700 attttaaata tacttaatgc ttgtaataag aaaaaatcac caggaattga cggccttaca 2760 tacgaatttt atataactca ttatgattta attaaaaaag atttgacaaa attgttgaat 2820 ggatatttaa ccgcaacata ttcaccccca aaagagttta cggacggtat tgtgattttg 2880 attcccaaga aaggtgacaa taaacaattg gaaaattata gacctattac tcttttaaat 2940 actgatcata aattactaac aaaaatcata gcagcgcgaa tacaagaaaa aatggataag 3000 ttaattggcc ctggtcaaag tgcgtgtgtt aaaaatttat catgttccga taatcttaaa 3060 gatataagaa agatgatgac taaagcagca gagacaaaac gatttaaagg atttttgttg 3120 agcctggacc tagaaaaggc gttcgatcgt gtaaatcata gttttttatg ggatgtcttg 3180 atgaaatttg gatttcctga gcaaattatt aattgtgtga agaaattgta cgttcatgcc 3240 agttctcgag ttttatataa tggtttttta agcaaagaat ttaaaattaa agcttcggtg 3300 cgccaaggat gcccattaag tatgttatta tttgtattat acatagaacc gttaatcaga 3360 agaatatatg caaatacttg cggtattctt gcttatgaca agatccttaa agttacagca 3420 tatgctgatg acttaaatgt ttacgtacgc aactgtgaag aatttgattt aataatacaa 3480 atcatttcat cattctcaaa gtttgctaaa attcgattga atcaaaggaa atcggcattt 3540 ttgagaataa ataattgcat ttcaggaccg tttcttattc ctgaagcaga tgatttaaaa 3600 atattaggtg tagtttttaa aaatagttgg aacagtagca ttgatcaaaa ttataataaa 3660 cttgtgaatg atataaaata tagattaacg ttaaactctt tcagaaagct gagtttagta 3720 gaaaaatgtt ggttcgtcaa ttcgtttgta ctttcaaaac tatggtatat agcaaaaatt 3780 tttcctccaa aaaatattca tttagcaaaa attaaggcaa gtgttggaaa ttttatatgg 3840 aacaatcaaa tttttaaagt agatagacgg caattatatc tttcagcaga aaatggcggc 3900 ttaggacttc aagaccctga aagtaagtgc aaggcaatat ttataagaaa tgtgttacaa 3960 tctgattgtc caagtaatga tgaatattta cttcgataca aatatacatc taaattatcg 4020 cgtaactcga gagaatggct ggaaactgca aatatgataa aaagcaacta caatcttcac 4080 tcgtgcaaac ttatctattt atattttatt gatgaattta aatttaaacc taaagttgaa 4140 atagaatatc cgcaaatgag ctgggaaact gcatggctga acctaagttc aawttttttg 4200 cagccagaag atcgatctag tgtttataat ttcttaaatg acttaattcc taatagattt 4260 aaattatttc aatacgggat aagaaaagta accgataata aatgtgaaat ctgctttgaa 4320 attgataaca atctacatcg ccttaaatcg tgtcccaacg ttatggaaat tagaaattgg 4380 actgaacaga ttttgagaac ccgaatgaat gtaaggtgta atgatttaga agacatatta 4440 ttctcccata ttgataaaaa tagtattcaa caaaaagcag cattatggat tacggtacat 4500 tttgtaggat ttgtctttag aaaatttcca aagtgtagtt tgtatgtatt taaaaaaagt 4560 attagggagc ttaggtggaa taaaagaaaa caatttgtat tacatttcgg aaccttgttg 4620 aacatctgtt gaaattctca tgtattgtaa aagcttaacg taaactggta aataaatgtt 4680 tttttgaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaacaaagw aaaaaaaaaa 4740 aaaaaaaa 4748 // ID Copia-5_DPu-I repbase; DNA; INV; 4681 BP. XX AC scaffold_118; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-5_DPu_; KW Copia-5_DPu-LTR; Copia-5_DPu-I. XX NM Copia-5_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-4681 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 673-673 (2010). XX DR Genome; scaffold_118; Positions 82266 86946. XX CC Positions [1889-2419] - Integrase core CC 'ATTGT' target site duplication CC LTRs are 99% similar to each other. CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX FH Key Location/Qualifiers FT CDS 442..1536 FT /product="Copia-5_DPu-I_1p" FT /translation="MSNQASMYNHIHMLTKNQTYLLQIRDDPDNPQRITNV FT AAIDAWILRDVTSRNYIFATLTKPMKDSLYSCETAAIMWTRLDTQYRLRAA FT ENLHLLWQSFYDFTHHPGILNIITHCLNIPLIVTFIYTDDYMTTHIRKLSS FT IADKLRELGQPLDEMQLVTKALATLPEQFRIVRSVWANVPLNERTIDNLLQ FT HLRSEENVLRSYERPDGSNQAFAAYGQTRGRGSRGNRRGGRSLHGVNRQPP FT RPDVRCGYCFIPYHETKDCRKKKRAEREEQANKDQALLSSTSVNPKSVLAF FT FADRLWSNSTHVRPEDFIRRFHTDSTWILVNCWNRRHPPPSTRKRPHQSYY FT QCQWRNVNQTHRRCPLCTRSWH" FT CDS 983..4657 FT /product="Copia-5_DPu-I_2p" FT /translation="MKERLTIFYNISVQKKMFLDPTKDLMALIKLLQHTVR FT QEDEEAVATEEAEDLCMESIDNHHDRTSDVVIASYHIMRQKIAERRRERKE FT RSKPTKTRHYCHQLLSTRRVFSPSSQTDSGATQHMSDQKTLFEDFIPIQPG FT SWSIAGIGDTHLQVLGKGHIRATINVNGETSTRLIEDVLYVPGLGTNLFSI FT GAATSSGLETRFSEDQVFFYRGNQLVLTGRRTGNTLYHLDLQPQTTRNKTT FT CHIDLAQQAGLQASLIVWHQRLGHMSHQTILKMVSQDLISGLHLTNEKIPK FT TLCTACELGKFHRQPLKSGRTRATRVGELIHSDVEGPMPSPSVGNARYYVL FT FTDDFSGWRVIYFMKCKSEVPALFRLFFASLLNETGNTVRTLRSDNGGEYT FT GTEFNKYLAEKGIRHETSAAYTPAQNGVAERGNRTLLDGARSMLLASNLPP FT TLWAEAVGYLVYIRNRVLSSTIEVTPFETWSGRKPDISNIRIFGSRAFVRC FT PNVKKLDARCLEGAFVGVSNTQKASRIYVTSPSPRIIVSYDVKVDETVMYS FT TTKKQNGPQWTEPTDRTEPIICDPTDNAVEADQKLTTTAQPSNSDPVNEAI FT QVDQVFPEEMIHLEAMPEEVPHRDEPTHDIIQEEAVQNPIILENDNNVADI FT PNTDDRNETTSIRRSSRLPHYSERYLAYRKSLGRQAVCFAMPASTEDTNAV FT PVEPSSYTEATTCPDADKWIPAIFDEYESLIQNNTWTLCPLPPDRSAIEGK FT WVFAFKPGYQQVAPRYKARFVAKGYSQVYGLDYIDTFSPVVKHYSLRTVLA FT IAAAKDLEMIQLDIKTAFLNGDLQEEIYMKQPEGFVIPGKETQVCKLLKSL FT YGLKQASRAWNQKFHAFIVKFGLTQSKADPCVYFRHQREGEVEEFTVLIIY FT IDDGIIFSSRKQTLTDILKHLETAFEIRSLPAHRFVGVDITRDRSKLMMYI FT SQPDYITKIAGRFNMTTCTPLAVPADPCCRLSPDMSPQNEEEEAEMKTIPF FT REALRSLMHIMVMTRPDIAYAVGQVAQYAQKPGKQHWRAVKRILAYLTKTK FT NFGLCFGRSSDQLIGFCDADYAGDLQTRRSTSGFLFLYLGGPVSWASRRQP FT CVALSTTEAEFVAAAEATKEAVWFQQLLSELGIDGRSTTLYCDNQSAIALV FT NNPTFHQRTKHIDVRLFYIRELQEKKTINVVYINTEQQLADILTKPLAVPR FT FEKLRDALGVVPIKI" XX SQ Sequence 4681 BP; 1472 A; 1121 C; 924 G; 1164 T; 0 other; ggttatgggc ccagtatctt gttcccttta acgaaataag atggccaacc agatgattga 60 aaatcatctt agagatgtta accatgtacc taagttcgaa ggtaccaatt tccgtgagtg 120 gaattttgaa ttgagaatga ttttccaaca acttgggcta cttggacttg tcgaaggaag 180 agaagggcac acattgcccg aagaggtaac caaagcattc tgaaattttt tacacatgtc 240 ttagtcacac tcatatacgt taagcacata tgaccctgtg atacacatgt acatttctca 300 tgtataattc aaacacatgt tcacatgtaa ctatacacat gttcacatgt atgactcgta 360 gacatggtca catgtacaac ttactctcat gttcacacac gagtgtagat tagacacatg 420 tagcactatt ttaacttaca catgtctaat caagcttcca tgtataacca tatacacatg 480 cttactaaaa accaaaccta tcttttacag ataagggatg accctgataa tccacaacgt 540 attacaaatg ttgctgcgat tgatgcatgg attctcagag atgtaacatc cagaaactac 600 atctttgcga cactaaccaa gccaatgaaa gacagtttgt actcctgtga aacagctgca 660 atcatgtgga caagattgga cacacaatac cggcttagag cagctgagaa tcttcatctg 720 ttgtggcaat cattctatga tttcactcat caccctggta tacttaacat aatcacacac 780 tgcttaaaca taccactaat cgtcacattt atttatacag atgattatat gactactcac 840 attcgaaaac tctcatcgat tgctgacaaa cttagagaac tcggacaacc tcttgatgag 900 atgcagcttg taaccaaagc tcttgccaca ctccctgaac agttcagaat tgtaagatct 960 gtctgggcaa atgttcctct gaatgaaaga acgattgaca atcttctaca acatctccgt 1020 tcagaagaaa atgttcttag atcctacgaa agacctgatg gctctaatca agcttttgca 1080 gcatacggtc agacaagagg acgaggaagc cgtggcaaca gaagaggcgg aagatctttg 1140 catggagtca atcgacaacc accacgaccg gacgtccgat gtggttattg cttcatacca 1200 tatcatgaga caaaagattg cagaaagaag aagagagcgg aaagagagga gcaagccaac 1260 aaagaccagg cattattgtc atcaacttct gtcaacccga agagtgttct cgccttcttc 1320 gcagacagac tctggagcaa ctcaacacat gtcagaccag aagactttat tcgaagattt 1380 cataccgatt caacctggat cctggtcaat tgctggaata ggagacaccc acctccaagt 1440 actaggaaaa ggccacatca gagctactat caatgtcaat ggcgaaacgt caaccagact 1500 catcgaagat gtcctctatg taccaggtct tggcactaat ctattctcaa ttggtgcagc 1560 cacaagttcc ggattagaaa caagattttc tgaagatcaa gtattcttct atcgtggaaa 1620 ccaactcgta ctaacaggaa gacgcactgg aaacaccctc tatcacctag atctacaacc 1680 ccagacaact cgcaacaaaa caacatgcca catcgactta gcccaacagg ctggtctaca 1740 agcctccctt atcgtctggc atcaacggct gggtcatatg agtcatcaga ccatactgaa 1800 gatggtttct caagatctta tatccggact tcatctaaca aatgaaaaaa ttcctaaaac 1860 actttgtact gcatgtgaat taggaaaatt tcatcgacaa ccattaaaat ctggaagaac 1920 gagagctacg cgtgttggag agttaatcca ttcggatgtt gaaggtccaa tgccatcccc 1980 cagcgttggt aacgcacgtt attacgtact atttacagat gatttttcag gctggagagt 2040 tatttacttc atgaaatgta aatctgaagt tcccgcattg tttcgactct tcttcgcttc 2100 ccttctaaat gagacgggca atactgtgcg cactctacgt tcggataatg gaggcgaata 2160 cactggaact gaattcaaca aataccttgc agagaagggc atccgccacg aaaccagtgc 2220 tgcgtataca ccagctcaaa acggtgttgc tgagagagga aacaggacgc tcctagatgg 2280 tgctcgtagc atgcttctcg ccagtaacct accacctaca ctctgggcgg aagctgttgg 2340 ttaccttgtc tacatccgca accgtgtact atccagtacc attgaggtga caccatttga 2400 aacatggagc ggaagaaaac cggatatctc caacatccgc atctttggat ctagagcatt 2460 tgtaagatgc ccaaatgtca aaaagctgga cgcaaggtgt ttagaaggag catttgtcgg 2520 agttagcaac actcaaaaag cctcccgcat ctacgtaacc tctccatcac caagaataat 2580 cgtgagctac gatgtcaaag tggatgaaac agtcatgtac tcaacgacga aaaagcaaaa 2640 tggacctcaa tggacggaac ctactgacag aacagaaccc attatttgtg atcctactga 2700 taatgcagtt gaagctgacc aaaaacttac caccacagca cagccaagca actctgatcc 2760 agttaatgaa gcaattcaag ttgatcaagt cttccctgaa gaaatgattc accttgaagc 2820 catgcctgaa gaagttccgc atcgagatga acctacccat gacatcattc aagaagaggc 2880 agtacaaaat cccatcatct tagaaaacga caacaacgtt gctgatattc cgaataccga 2940 tgaccgcaat gagacgacta gtatccgcag atcatcacgt cttccccact acagtgaacg 3000 ctacctggct tataggaaat cattaggacg tcaagctgtt tgcttcgcaa tgcctgcgag 3060 tactgaagac accaatgcag tacctgttga accatccagc tacactgaag caacaacgtg 3120 tccagatgca gataagtgga ttccggccat ctttgacgaa tacgaatccc ttattcaaaa 3180 caatacatgg actctctgcc cacttccacc tgacaggtca gcaattgaag gcaaatgggt 3240 ttttgcgttc aaacctggat accagcaagt cgctccacga tacaaagccc gttttgtagc 3300 caaaggctac tctcaagtct atggcctcga ctatatcgac actttttcgc cagtagtgaa 3360 acattattca ctacgcaccg tacttgctat tgctgcagcc aaggatctgg aaatgataca 3420 actcgacatc aagacggcgt ttctcaatgg tgaccttcaa gaggaaattt atatgaagca 3480 gccagaaggt ttcgtcatcc caggcaagga aactcaagta tgcaaattgt tgaagagctt 3540 atacggccta aagcaagcat cacgggcatg gaatcaaaaa tttcacgcat tcattgtcaa 3600 gtttggccta actcaaagta aagctgaccc gtgcgtatat ttccgacatc aacgtgaagg 3660 ggaagtagag gaattcactg tgctcatcat ctacatcgat gacggaatca ttttcagcag 3720 cagaaaacaa actcttacgg atatcctgaa acatctggaa acagcctttg agatccgctc 3780 tcttcctgcg caccggtttg tcggcgttga catcactcga gatcgctcta aactcatgat 3840 gtacatctca cagccggact acatcacgaa gatagctgga agattcaata tgaccacgtg 3900 tacacccctt gctgtcccgg ctgatccttg ctgtagacta tctcctgaca tgtcacccca 3960 gaatgaagaa gaagaagcgg aaatgaaaac tatccccttc agagaagcct taagatccct 4020 tatgcatatc atggtcatga ctagaccaga tatcgcctat gctgtaggac aagtggcaca 4080 atacgctcag aaaccaggaa aacaacactg gcgtgcagtc aaaaggattc tagcctacct 4140 caccaaaacc aaaaactttg gtttgtgctt tggaagatct agcgaccaac taatcggatt 4200 ttgtgatgca gattatgccg gagatttaca gacccgtcgc tctacatctg gtttcctgtt 4260 tctttatctt ggaggaccag tttcgtgggc cagccggcgc caaccgtgtg tagcactttc 4320 caccacagaa gccgaatttg ttgcagctgc agaagccacc aaagaagctg tatggttcca 4380 acaactactg tctgaactag gaatagacgg tcgctcaaca actctttatt gtgataatca 4440 aagtgccatt gccttagtga ataaccctac ctttcatcaa cgcacaaaac atatcgatgt 4500 acgactcttt tacatcagag agctgcaaga gaagaaaaca atcaatgttg tttacatcaa 4560 cacggagcaa caacttgctg atatactcac aaagccattg gctgtcccaa gatttgaaaa 4620 attacgagat gccctaggag ttgttccaat caaaatttag agcttaatat ttgaggggaa 4680 g 4681 // ID BEL-625_AA-I repbase; DNA; INV; 7190 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-625_AA_; KW BEL-625_AA-LTR; Pao_Bel_Ele108; BEL-625_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-7190 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [5103-5684] - Integrase core CC LTRs are 93% similar to each other. XX FH Key Location/Qualifiers FT CDS 1156..2823 FT /product="BEL-625_AA-I_1p" FT /translation="MFRDTFRSLIHLNGQLNSMDKFTYLRTSLTGDALKEI FT NTIELSAANYEVAWKVLLERYENKKLIVKAYLDTLFSLEPLKKESYDGLNH FT LVSEFEKNLQMLDKVGEQTAQWSTILAYMLCARIDTATLRLWEAHHNSKEV FT PKYQDLVTFLRNQCAVLQSIAPSKSTSSDQRPTKFAVCHAVVKMSNKCPFC FT GEFWHSPFHCNKFQKMKVTERNEAVMRSKLCRNCLHPGHISRTCDKGVCHH FT CQQKHHTMLHFNPVRSSVPPAQTRSQEASQPQRQPQSNPQQPNQQAHTQSN FT NAPTHTANSQINRPQPSTSQASTSHNYVALPKTPTHNILLSTALVRVKDRF FT GNVLLARALLDSCSQHCLMTKEFSNKLKFRVSPTFLSVQGIGSAQSVSTKR FT VSAEVSPRSRMISPFEEEMQFFVLPKLTVSLPTSSFSPSEWSLPETALLAD FT PDFHESGPVDVIIGAEHYLDLLADGRLKATENGPTLQNTVFGWIVSGRVPD FT CPISVSRSIVNVCSTAELQDQLTRFWEIETCRSISTHSVEESACEENLRRQ FT LYATIREDS" FT CDS 2976..6044 FT /product="BEL-625_AA-I_2p" FT /translation="MGHMVEVTEDCEGVAYYMPHHAVFKPESTTTKLRVVF FT DASCKTSTGVSLNDALMVGPVVQDELININLRFRLHRYAVVADVAKMYRMI FT AVSEHDRKLQRIVWRGSSDEEIRTFELTTVTYGTASAPYLATRCLKQLAEE FT GEESHPMAAAVLKEDFYVDDMLSGVDEIDEGKQFVEEMVDLMQSGGFTLRK FT WNSNCEEILRHVPEHLHDDRTILTLDTSDSTVKTLGLQWEPRSDVYRFSTP FT KWNESDAVTKRTVLSDISRLFDPLGLIGPVIVQAKLFVQELWKRECSWDSP FT LSEDMRERWLEYRRNMIGLDGIRIPRWIGVTTSRESVELHGFCDASRKAYG FT ACIYIRTVAADGTVDVRLLTAKSRVAPLENLKKKKKSLSTPRLELSSALLL FT AHLYEKVIDSIHMTVESHFWTDSTIVKYWLSSVPSRWKEFVSNRVSEIQHI FT TVGGTWNHVPGMENPADIISRGMAPPQLQYESLWWRGPHWLLQERANWPTA FT EPDTSELHPSLLEEKGVISALLPSTEPSEIFGLRSSLTELVKLVAFLRRFC FT FNARSANRDNRKQGQLTLVERDEALRLLVRLAQQESFPQEYTELSRGRSVQ FT DSSRIASLNPEMVDGIICVGGRLRNAKVPSSRKHPYIIGHHHPFAMLVMTE FT YHRDLFHAGQQLLVSTVRGKFWPTNARNLARKVIHDCVRCFRVKPKIHEQL FT MADLPSERVTPCSPFQRVGIDLCGPFQVKYPQRSARPVKCFVVVFVCLVTK FT AVHLELVADQTSQAFLASLRRFSSRRGKPSLIMCDNGRNFVGAKRELAELR FT QLFVNQQFQNAVVQESSKDHIEFRFIPARSPNFGGLWESAVKSFKTLLKRT FT IGSRTLQYDEFQTLLTQIEAVLNSRPLTPVSNDPSDYEALTPGHFLIQRPL FT AAIPEPNLDGLPENRLSAWQTVQHFVQLLWRNWSTEYLSNLHNRTKWTKKR FT DNVKIGTMVVLREENQPPLSWLLGRITEVHTGADGNIRVVTVRTKDGSYKR FT AISKICILPISDNVQSA" XX SQ Sequence 7190 BP; 1893 A; 1732 C; 1880 G; 1670 T; 15 other; aacatttttg gtccttcgag ccggatcgag gacggcagga tcggtttgcg gatacgaagc 60 ccgcttgtgt gaatcagtga aaaaaggaaa gattgtgagc ttgctactgt gtgggcgatt 120 tgcattgcga aagtcgcaaa gaaaaagtaa acagtgtcga tcgtgaacgg cggtgagatt 180 tcgtgcgaag tgaagaagaa agtttgccta ccgcctgtga acaaaggtcg gcgtgaagat 240 ttcgtgccat tttggacgcg gtaaatcgcc atcgcaggag gaattcgagc gccatcagga 300 ggaacaaagc tgctaacgac cgggcgccat cggagttgca gttgcagtcg cagggaagta 360 cwgtgtaacg gtggcttggt gcatgtggcc aaccgaaata aggcagaata aatccaktga 420 atgcatgtga gttttttccg tttggcaagg tgcatgagtt gtgtccgctt tctgttggtc 480 tcgaagttcc gcttgggtct gttggttgcc ttttcttctg tggttgatct cccctcgctg 540 ctgtcgttgg gttgtacagt tctaaatctg tctactttaa acgcgatcgt ggaacgtcga 600 gtagatctgt cacagtgaaa gtgtaccggt gatagtgatt mgtgcttggt gattgaattt 660 gatttgacca gtgcagtgat ggccagtgat ttgcgtgctt tgttgaagcg tgaacgcttg 720 cttcagcaaa aggtgacaat cgtggagaag tttgtggccg ctttcgacaa ggatcgtgat 780 gagtgtgaag ttgaagtgcg actcaagggc ttgaacgaag tgtatttgga gtttttcgtg 840 ctacgtgaga agatagagct gctaatggaa gacgaagagg aagaagagga ggatgaagac 900 gagagcgatc tgttgaagaa gggttcttct ggcggaaaag atgcgcacac ccgtgagcag 960 gaaaacttcc gtgtagccga agaattcgag aaccgctact gtaaggtgaa atcggccctg 1020 ctmaaactgc ttccacccaa kgatccggtc aggcgcgact gtggaatacc agccgatcaa 1080 accgcagcgc attccaaggt gaagttgccc gaaattagtt tgcccacgtt tagtggcaaa 1140 ttaggagaat gggtgatgtt tcgcgacacc ttccgcagct tgattcatct gaacggacag 1200 ttgaactcga tggacaaatt cacctacctk cggacgtcgc ttacgggaga tgctctgaag 1260 gaaatcaaca ccatcgagct ttcggctgcc aattacgaag ttgcgtggaa ggttctcctg 1320 gagcgctacg agaataagaa actcatcgtc aaggcgtatt tggatactct gttttcgctg 1380 gaaccactca agaaggagtc gtatgacgga ctgaatcatc tcgttagcga gttcgagaag 1440 aatttgcaga tgctggacaa ggtaggcgag caaacagccc agtggagtac cattctcgcc 1500 tacatgcttt gcgctaggat cgatactgcg acgttgcgtt tatgggaagc tcatcataac 1560 tccaaagaag tgccgaaata tcaggatttg gtgacattcc tgcgcaacca gtgcgcagtg 1620 ttgcagtcaa tagctccgtc gaagtctacc agctccgatc aacgtccgac gaagtttgct 1680 gtgtgtcatg ctgtggtgaa gatgtccaac aaatgtccat tttgcggcga gttttggcat 1740 tctccgttcc attgcaacaa gttccagaag atgaaggtca cggaacgcaa cgaagcggtg 1800 atgagaagca agctgtgtcg aaactgcctg catcctggtc atatctcaag aacatgcgat 1860 aaaggtgtct gccaccactg tcagcagaaa caccacacca tgctccactt caacccggta 1920 agatcctccg ttccacccgc gcaaacgcga agtcaagaag ccagccaacc ccaacgtcaa 1980 ccacagtcca acccacagca accgaaccaa caagcacaca ctcaatcaaa caatgcacct 2040 acacacactg ccaactcaca aatcaatagg ccacagccta gcacaagtca agccagcaca 2100 agccacaact acgttgcact acccaagaca ccaacacaca acatcctctt gtcaacggca 2160 ctcgtacgag ttaaagaccg ttttggaaat gtcctgcttg ctcgagcttt gttggattct 2220 tgctcccaac actgtctcat gactaaggag ttctcgaaca agctcaaatt tcgagtttca 2280 cccacctttt tgtccgtgca aggtattggg tctgcacaaa gcgtatccac caagcgagtt 2340 agtgctgaag tttctccaag gtcgcgaatg atttctccgt tcgaagaaga gatgcagttt 2400 ttcgtcttgc cgaagctgac tgtgtcgttg cctacctcca gttttagccc atcggaatgg 2460 agtctgccgg aaacagcact cttggcggat ccagacttcc atgaatccgg accggtggat 2520 gtaattatcg gagcagaaca ttatttggat ttgctggcag atggacgact gaaggcgacg 2580 gagaatggac cgactttgca gaataccgtt tttggatgga tcgtctctgg ccgagtgccg 2640 gattgcccaa tcagcgtatc ccgttccata gtcaacgtct gctcaactgc ggagctccaa 2700 gaccagctga ctagattctg ggaaattgaa acttgtcgtt ccatcagtac acattccgtg 2760 gaagaatctg cttgcgagga gaacttgcga agacaactgt acgcgacgat acgggaagat 2820 tcgtagttgc tctgccgaag aaggatttcc tcatcgaaaa attgggtgaa tcaaggtcta 2880 ctgcgatcag acggttgaac agtttggagc gaagattgtc agcggactcc gaactgcagc 2940 accagtactc cgagttcatc caagaatatc tggacatggg acacatggta gaagtcaccg 3000 aggactgcga gggagttgca tactatatgc ctcatcacgc tgtcttcaag ccggagagca 3060 caacaaccaa acttcgagtg gtgtttgatg cctcctgcaa aacctcaaca ggagtttcgt 3120 tgaacgatgc gttgatggtc ggtcccgtcg tacaagacga gttgatcaat attaatctcc 3180 gattccgtct ccatcgttac gccgtcgttg ctgatgttgc aaagatgtat cgcatgattg 3240 cagtttccga acatgatcga aagctccaga ggatcgtgtg gagaggtagc agcgacgaag 3300 aaattcgaac gtttgagttg acaaccgtca catacggtac tgcgtcagct ccgtatctcg 3360 ccacgaggtg cctgaaacaa ttggcggaag aaggagagga gtcccatccc atggcagccg 3420 ccgtgctaaa agaagacttc tacgtcgacg atatgctgtc cggcgtggac gagattgatg 3480 aaggcaagca gttcgtcgaa gaaatggtgg atttgatgca atccggaggt ttcacgctga 3540 gaaagtggaa ctctaattgc gaagaaattc tgcggcacgt tccggagcat ctacatgacg 3600 atcggaccat tctcacgctg gatacaagcg attctactgt gaagacacta ggcctacagt 3660 gggaaccacg atcggatgtc tatcgcttca gcactccgaa gtggaatgaa tcagatgcag 3720 taacgaagcg gacagttctg tccgatatct cccgtttatt cgatcccttg ggattgatcg 3780 gcccagtcat cgttcaagca aagttgttcg ttcaggaact ctggaagcgg gaatgtagct 3840 gggacagtcc gttgagcgaa gatatgagag aaagatggct tgaatatcgc agaaatatga 3900 tcggcttgga cggaataagg attccacgtt ggattggcgt tacaacttct cgggagtctg 3960 tagaattgca tggtttctgc gatgcatcca ggaaagcgta cggtgcgtgc atctatattc 4020 gaaccgtcgc tgcagacggt actgtagacg tacgattgct gacagcaaag tctcgagttg 4080 ctcctctcga gaacctcaag aagaagaaga aatcgctttc cacaccacgg cttgaacttt 4140 cctcagcgtt actactcgca catctctacg agaaggtgat cgacagcatc cacatgaccg 4200 tggagtcgca tttttggacg gattccacga ttgtgaagta ctggctatcg tcagttccct 4260 ccagatggaa ggagttcgtc agcaaccgag tatcggagat ccagcacatc accgttggcg 4320 gcacctggaa tcacgtccct ggaatggaaa acccggcaga tatcatttct cggggaatgg 4380 cacccccgca gctgcagtac gagtctctct ggtggcgtgg gccgcactgg cttctccaag 4440 aacgagcaaa ctggccgaca gctgagccag atacaagcga actgcatccg tcgctgttgg 4500 aagaaaaggg agtcatttcc gcattactgc ccagtacgga acctagcgaa attttcggtt 4560 tacggtcttc gctgacagag cttgtaaagc ttgtagcttt cctacgacgg ttctgcttca 4620 atgcaaggag tgcgaatcgt gataatcgca agcaaggtca actaacgctc gtcgaacgcg 4680 acgaggctct tcgattgcta gttcgcctag cacagcaaga atcctttccg caagagtata 4740 ccgagctgtc gcgtggtcgg agtgttcaag attcctcgcg aatcgcttcg ctaaacccag 4800 aaatggtgga cggcattatt tgcgttggcg gccggttacg gaacgccaag gttccgtcga 4860 gtaggaaaca tccgtatatc atcggtcatc accatccatt cgccatgctt gtcatgaccg 4920 aatatcatcg ggatctgttc catgcgggac agcaactact agtttcgact gttcgcggca 4980 agttctggcc gacaaacgcc cgaaatctag ctcgaaaggt gatccatgat tgtgtacgct 5040 gcttccgtgt caagccgaag attcacgagc aattgatggc cgatctacca tcggagagag 5100 ttacgccttg tagtcccttc caacgcgtcg gcatcgatct ttgtggaccg tttcaagtca 5160 aataccccca gcgttctgcc cgtcccgtga agtgttttgt cgtcgttttc gtttgcctgg 5220 taacgaaggc agtccacctg gagttagtgg ctgaccaaac atcgcaggca ttcctggctt 5280 ccctcagacg gttctcgtct agacgtggaa aaccatcgct aatcatgtgc gacaacggca 5340 gaaatttcgt cggagctaaa cgcgagttgg cagaattacg gcagcttttc gtcaaccaac 5400 agttccagaa cgccgtggtc caagaatcgt cgaaagacca cattgaattc cgcttcatac 5460 cagcccgctc cccaaatttt ggaggattgt gggagagcgc tgtaaaatcg ttcaaaacgt 5520 tgctgaaacg aacgataggt tcaaggacgt tacagtacga cgagttccag acgctgctca 5580 cgcagatcga ggcagttttg aattctcgtc cgcttacacc agtcagcaac gatcccagtg 5640 attatgaggc actaacaccg ggccattttc tcatccaacg tccgttggca gcaattccgg 5700 agccgaacct tgatggtcta ccggaaaacc gtttgtctgc atggcaaacc gtgcagcact 5760 ttgtgcaact tttgtggagg aattggtcga ccgaatattt gtccaacctg cataaccgca 5820 cgaagtggac caagaagcgt gacaacgtga agataggaac gatggtggta ttgagggagg 5880 agaatcagcc accgttgagt tggcttctgg gtcggataac cgaggtacac actggagcag 5940 acggaaacat tcgggtagtg acagtccgaa cgaaggatgg cagctacaaa cgagcaatct 6000 cgaagatctg catcctwcca atcagcgaca atgtkcaatc mgcatmaggg gagaactagg 6060 actcctccac cggcggaggt cgaaagacct ccgcagacca gttaagttaa gtattgttta 6120 gaatttcaaa aagttaatcg gctcattttt cattcgtagc cacattccgt cctcgtttca 6180 agcccaagaa acgatggaag tcactattcc gatccggtct ctccaattat cgaggtaatc 6240 tgttttgttc tggtggtggt acgttctgca tgaacggtgc atgaagcagt cgtgctgaac 6300 ttagttcggc aatattattt tgttccgtga agtcagtcat ctcggcagtc gtagactgca 6360 caattcgaaa ccctgctaac cacaatcaca cgcttggcta tttgaggtac gagtgtgcga 6420 actcgcaacg gaggtagaag agcgctattc gtcctcccgg aatcaacgaa gaggtcgtca 6480 amgagtatgg ccaactcgaa caacggtctc gaatcggaaa tggttccacc agcacatttg 6540 tagttcgcta gcaagtaccg acgatcgcca ctgtgacgtg taggggtatc atcagcgcct 6600 ggaggcgccg cggaggccag gaggcctccg ttccaggcaa gtctctgttt tgtccagtaa 6660 ttcagtttaa aaagcacccg ttcatatgat tcatagatct accgcggcga ggcgagggag 6720 cccgtcttac cgacgacgaa tcaccaccga cgagaacaag tcacccagtc aagcaagtca 6780 gaagtccaac cgcatccgtt tatccgccaa gagtcgwaga gcagcagatc cgtcagccta 6840 gagagaagaa gaagtsaagt ccgcagcagc aaggatcgtc cagaatccag atcgaagtcc 6900 agcaaccatc cgaaaccgtc atcgtctctg caagcaacca aagcgtcaac agcaacagca 6960 agccaatgtt taccaccgag tatgggtcat tcggcatcga agaagcagca acacaatgcg 7020 tccagcacca gtagcagtag gagtacggtc aacccggcga ccgagagtgt aaatagtaga 7080 ttaagagtaa aaagtgaatt cgtatcaaaa caaagcagta magagtagat agaagttagt 7140 gtaggagtag gtagttgwtt gaaatcgccg agatttcaag gcggccggca 7190 // ID hATx-20_SM repbase; DNA; INV; 3856 BP. XX AC . XX DT 12-AUG-2009 (Rel. 14.08, Created) DT 12-AUG-2009 (Rel. 14.08, Last updated, Version 2) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hATx-20_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-3856 RA Jurka J.; RT "haT-type repetitive family from freshwater planarian Schmidtea RT mediterranea."; RL Repbase Reports 9(8), 1855-1855 (2009). XX DR [1] (Consensus) XX CC >85% identical to hATx-2_HM. Possible horizontal transfer. CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 1660..3639 FT /product="hATx-20_SM_1p" FT /translation="MFQKIFKEFSTGARVDENSVWFYFLRDKENQLSKCKK FT CGKEIKSHGGSTSGLHTHLRTNHEIDLLKRKSNQDAASSSALFSSAPLSSA FT KKPKPLITDFLQNQHDKSLPAVFSRMTALDGLPFSVFATSAELRTSLGARG FT FIVPKSAATIRNMVVKYANTIRETVISNFTRLRTQGVRFSLTFDEWTSIKN FT RRYLNVNVHTEDEFWSLGLARVVGSLPAEKCIELIQKVLTQFMLNYDEDIV FT CITTDGASVMQKVGRLSDCDQQFCIAHGIQLGVLDVLYKKPSTSAKTILKG FT SSNDERSRENEDDDENLTNDDEEDFSDGFQVVMSEKDDTIQLTDELQPLIA FT KVRTVVKLFRCSPTKNDKTLEKYTLDEFGEALPLIFDSKTRWCSLHTMLAR FT FMKLKNCIRKSLIDLQSKIEISDKEFDTISSVVACLEPVKLAVEALCRNDA FT TLLSADTTLLFMVNYLGDTELAVKLKAALVRRINERRTPFSSLLHYLHKGH FT QRYENLDPALSFEHLSKSTIVNAIVQLNERLNQRADEPLPSTSVSFDSDSV FT DSTANLSLKEKLDMAILNDKKCKNKERLNVSKDLSKTIRKEIAIFEEEGTR FT GTYLQNTYEYLKTIKPTSVESERVFSASGNFVTKLRSSLEDDTLNALCFLR FT AYFTKEKNSAK" XX SQ Sequence 3856 BP; 1322 A; 637 C; 659 G; 1237 T; 1 other; tagtgcattc aataccggta taccggtatc ggtactaccg gtaataccgg acttttttct 60 ggtaccgaaa taccggtatt gagttggatc aataccggta tttccggtat tggatatata 120 aaaaatattt aaaattattt ctcaatttta cttaaaaaat attataataa taaggcattg 180 ctcttttaac caaaaataaa ttagactttc ataactaaat tgacgaatgc aggactatta 240 atggttttcc tgactgttta tgcagaacta aaaacttcaa aattcatgac cagttacaat 300 ttttcctgac aattttttgt ttcctgacta tttattattt aactagtatc taaaaagaca 360 aattctctat tatatcttca gtactcatgc aaaaacttta tgcctaatag attattaggc 420 ataaagtttt ataagtaatt atgataacgt taaaaaaaaa actttatgca ttctgaattg 480 tattttaccc gcgcaaataa accttccctt acagaaaaaa aaaactttcg agtactcgac 540 aaactaaaag cttagatttc ctaaccactg gtaaaaatat ggtggcttgc ttagcgtgtt 600 tttttttcag aaatttgaac tgatcaaata ataatataac aaggtttttt gttattaatg 660 tatttttatt tatttattta cttttcaatt taaaatcatt gcaaatattt tttgacaaca 720 caaaattatt ttaatctatc cacgaatata acagtgccaa atatgctcat tatttaactt 780 attattttaa ttgagtaaat tgttcattta ttttcaaaat aagcgctttt ttattacaat 840 agctacttgt agtgaaattt tctggttttt tttaactcat gcgttgtttt ttagtaaaaa 900 aaatattttt tcaattaatg ttgtacaaat tacaacagtt tcttgtaaca tttgacaatt 960 tattatctag atcgtcatat aattggaagt ctaaaaaaca taaaaataat caagattaaa 1020 aataagtaaa atttttattt aagaatgatt taaaacttaa ttttgatttg gatatttaaa 1080 cataaatccc attaaccaaa tatttctagt atgcgatgtg ctagatttaa ttattttatt 1140 gacccgaaat ttaagaaacc tactttcaaa ttcaataata acaataacgt taaataataa 1200 ttcaatagtt aaattgaaaa aaaatggcca catgtttcca tgtgcagtct ttccgtcaaa 1260 tcgttataat taggtattcg agttcgaaaa cttaacctgt tgtatcttgt tgttaaaagt 1320 tcttaaaatt tttatcatag cgcaagaatt cacttttatt tacaattccg attgatattt 1380 atttaaataa atacgtctac gtcagcaaat ttgcatttaa atagtaagta ttttatttct 1440 aaacatatat ttgggtactt taaatgaatt aggttcgctt tttatttgca gatttttgtt 1500 tttttttgaa gccaacaaga aatagacaaa aaaagggatt gtcagaaaat aatatcagat 1560 aattttagtt tttttttggt aaggaattga gtatcaataa tcaatatttt atctctaatg 1620 gatagtttta tattaattta atattcagaa accccaaaaa tgtttcaaaa aatattcaaa 1680 gagttcagca ccggcgcacg agtagacgaa aattctgtgt ggttttattt tctgcgtgat 1740 aaagaaaacc aactttcaaa gtgtaagaaa tgcggcaaag agataaaatc acatggtggc 1800 agtaccagtg ggctccatac acatctaaga acaaaccatg aaattgactt gctaaagaga 1860 aaatcgaacc aagatgcagc atcgtcatcg gctttgtttt catcggctcc gttgtcatcg 1920 gccaaaaaac ccaaacctct gatcaccgat ttcttacaaa atcaacatga caagtcgttg 1980 cctgcagttt tttctagaat gaccgccctt gacggactgc cctttagcgt atttgccaca 2040 tcggctgaac tgagaacatc gctaggagct cgtgggttta ttgttccgaa atcggcggcg 2100 acaattcgaa atatggtagt aaaatacgca aacacaatca gagagacggt catatccaat 2160 ttcactcgtc tccgaacgca aggggtgcga ttcagcttga cgtttgatga gtggacgtca 2220 attaaaaatc gaagatattt aaatgtaaac gttcacaccg aggacgaatt ttggagcctg 2280 ggnttagcta gagttgttgg ttcactgcct gccgaaaaat gcatcgagtt gattcaaaaa 2340 gtgttgacgc agtttatgct gaattacgac gaagacattg tgtgcattac tactgatgga 2400 gcttccgtaa tgcaaaaagt gggacggtta agcgattgtg atcaacagtt ttgcatcgcg 2460 cacggcattc aactgggcgt gctagacgtt ttgtacaaaa aaccgtcaac ttctgctaaa 2520 acaatcttga aaggaagctc gaatgacgag agaagtagag aaaacgagga tgacgacgaa 2580 aacctaacca atgatgatga agaagacttt tcggacgggt ttcaagttgt gatgagtgaa 2640 aaggacgaca ctatccaact gaccgatgag ttgcagccgt taatcgctaa ggtgcgcacc 2700 gttgtaaaat tgtttcgctg ttcaccgaca aaaaatgata aaacgctcga aaaatacaca 2760 ttagacgaat ttggtgaagc tcttccacta atattcgact caaaaactcg ttggtgtagc 2820 ttgcacacaa tgctcgcacg tttcatgaaa ttgaaaaatt gcattcgcaa gtcgctgatt 2880 gaccttcagt caaaaattga gataagtgat aaggagttcg acacaatttc atcggttgtg 2940 gcatgtctcg agccagtaaa attagccgtg gaagcattgt gtcgcaatga tgcaacactg 3000 ttgtcagccg atacaacact attattcatg gtaaattacc ttggagacac ggaattagct 3060 gttaaattga aagctgcatt ggtgcgacga ataaacgagc gacggacacc tttttctagc 3120 ttgctacact atctgcacaa gggccatcaa cgatatgaga atttggatcc agcgctgtct 3180 tttgagcatc ttagcaaatc aaccattgta aatgccatag ttcaactgaa tgagcggctc 3240 aatcagcgag cagacgagcc tctcccttcg accagcgttt catttgatag cgattctgtt 3300 gacagcactg caaacttgtc attgaaagaa aagcttgaca tggcaatttt gaatgataaa 3360 aagtgtaaaa ataaggaaag attgaacgtt tcgaaagatt tatcaaaaac aatccgcaag 3420 gaaattgcaa ttttcgaaga agaaggaact cgtggaacat atttgcaaaa cacctacgaa 3480 tacttgaaaa ctataaagcc aacaagtgtc gaatcggagc gtgtattttc agcaagtggc 3540 aactttgtta ccaaattacg atcatctctt gaggatgaca ctctgaatgc cctttgtttc 3600 ttacgggcgt attttacaaa ggagaaaaac agcgcaaaat gaaaaaaaat tgttattttg 3660 atacaaaata aaaaagttcg ttgaatttgt aaatttttac tatttatttg atattttaat 3720 caaattttaa ttttttttct agcttcaata ccggtattat accggtatta ccggtattga 3780 aatatttcga taccgcaata ccggtattga aaaataatac cggtattaaa aaataatacc 3840 ggtattgaat gcacta 3856 // ID Gypsy-189_AA-LTR repbase; DNA; INV; 1052 BP. XX AC supercont1.90; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-189_AA_; KW Gypsy-189_AA-I; Gypsy-189_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-1052 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.90; Positions 167460 168511. XX SQ Sequence 1052 BP; 336 A; 262 C; 201 G; 253 T; 0 other; tgttaccgct gtgatataaa attctttatt tgcctaggct aaaattaccc tttatatgtt 60 tattagcgta ttgttaaatt aatacataaa ctatactcaa atatccctaa tgttaaaaat 120 tcatcatata aaattgttaa actagttcaa aaagcatgca cactcaaaaa tgttatatat 180 tactaagtta ggtaacacaa tttgttaaaa cacatctaaa caacatccaa cccctcgtac 240 gcactgctga atagcaaaag gtgatgccac atgcgagaaa aaggaccaag caaggaaaaa 300 aagggaatat gcgaaaagga aaggcaacta gggtcagaga cagctatcga tttgttctat 360 tccatttcac atctcagcca agacggtcta ttgtaacgaa agtggtggaa atataaatat 420 agtgagactc gaagttaatt gtctccgaca gacacaggag tctattgccg gatatattcg 480 gatcaggacc caaaggattc tgggaaagta accttgagac cttcctaagc catacggtac 540 ccagaaagcg atcagtatcc cgaaacgcag ccccggattc ccatcgacct tgaccacacg 600 gtccaagagt gctggcgagt tcagacccgg aagaggccta gcatatccgt tgggaagcct 660 gcaatcgcca gatagagacc ccagatagag cccttccatg tgttcggacc tccgcttcaa 720 aggcctatca caagcgctcc gaataccagg aacgaagcgt cccgttaatc agcgaagccc 780 tagaagtttg tcgccaaccc acgtggtata ccttcaccag cagacccaat ccgccttgcg 840 tagccggccg gcccttccct cctccttata cacccacaca gctgtaagtc ccccaataaa 900 tcgtttaaat gttaaatagt ggttttatag tttttccact tgaaggacag aacccttgaa 960 agccatttaa gccgacccta ggatatttaa gacccatgct ctagctcgcc cacattccga 1020 gattcaacgg gtcgatctga ccccaaataa ca 1052 // ID Gypsy-33_AA-LTR repbase; DNA; INV; 1045 BP. XX AC supercont1.283; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-33_AA_; KW Gypsy-33_AA-I; Gypsy-33_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-1045 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.283; Positions 19882 20926. XX SQ Sequence 1045 BP; 295 A; 236 C; 204 G; 310 T; 0 other; tgtaaccttc aaaaggttac attatcttta tatatatttt tgtacatata ttttcctttt 60 ttttattatt actatttttt ttactatgtt tattattatg atctcacggt tttgttaata 120 gaaggtattt aaatgaatta atgaattaaa atgaaataga actaaaatga aattaaaaaa 180 aaaaaaatat cgaattttcc attcctcctg aacatcccta ttatccaata ccctatccaa 240 ccgtaacgca cccacacaac gacaccaacg atcggtgtcg gcaatacgtt cgtcgtctga 300 cacacgaaga tgacgacgac gaatcaacaa aactggttgt catccagatt tcagcagcta 360 ttaggtgaaa caccatcatc accgcaactg tgcgcgaatt ttaatccgat tttaatttct 420 ttctcagtga acgccttgtg ggactttgct agtccggaat agtttgaagt tcgtgtgtga 480 agtctagtcg gtgtgccaga agtcgattgg cccggctgtg acgagtgaaa gacattccgt 540 ggacagtgag aaggtgaagt tgtcggtgtg ccaaaccgtc cgatcgaaat ccagtgaagt 600 gacaactaga cggagaagcc agcatccggc cccggcgcaa cagcggattg gtgtgaagcc 660 ctggtcgatc cagtcccgaa gacgatcaac gccccgagac gaaaggctgg cctgtccggt 720 tcttagactg gtccagacgg atggcgcaac actcgttctg agtatcccgg taaattttat 780 tccttttcat tcctccccct caaacgatct cgaataatga atttgaatat atagagacaa 840 gatgtaattt aaatgccctt tattgttgcc tttcatttta taaccgctga gccttgtttt 900 ctaccatctt ttgtgcgtag tccacctcgc tcttttacgg gaaggttttc cgccatcacc 960 caatttcctc accggagtca ttttgtacga ccctgagtct ctaagaaggc cacattaggc 1020 tggcccaaaa aaagtaattc taaca 1045 // ID BEL-225_AA-I repbase; DNA; INV; 6043 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-225_AA_; KW BEL-225_AA-LTR; BEL-225_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-6043 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 907-907 (2011). XX DR [1] (Consensus) XX CC Positions [5105-5656] - Integrase core CC LTRs are 95% similar to each other. XX FH Key Location/Qualifiers FT CDS join(101..1042,1046..3208) FT /product="BEL-225_AA-I_1p" FT /translation="MVKAGNFTLTPCGVCSETSETNEGMVGCDACDLWFHF FT RCVEVTEESLDGLEKWFCPSELCQKVAEGMLRKPDDRKKKGKSKGAKTLAT FT EESDKSSVKSDNQSGSSLEKRLQALEREQKAKEQEMEIERILREKRIEMDR FT VLKEKQLKIENELREKEMQVEKEMLEKALRDEKAHADRMQRMRESYQNAIM FT GVKNMAMKSRNSVDPDTKFEKGIVEETIPREGEGSSWHHKKMLDTPLRNPN FT VSQRTIPEKQRKSVDKYAQKTEFENSSEDDNDSVNETSDEESEEDCSNSEI FT SSEEREEASECEIEVPKSLGYQKKKKPVSHGLGRQLAGPSKAQLAARNGLS FT KKLPIFTGKPEEWPLFIGSYEASNTACGFNDVENLVRLQESLKGNALESVR FT GQLLLPKSVPKVINKLRQLYGRPEQLLYCHLERVKRLDPPKADKLESYVPF FT GNVVEQMCDHLEAAGLKEHLINPILIQDLVDKLPAGDKREWVRFRSKKKTV FT TLRTFSKFLSKIVAEACAANVNLENQQEFKTKPAPAGKGRGMDKGAVYMHG FT ANEAFAAGEEQKLKPCRACKRTDHRLRFCQDFKAMNFVDRMKIVEKGKLCK FT ICLNDHGNAPCKFKIRCNVEDCRDRDRHHSLLHPVDNLVVINTHIRSSSSI FT MFRMIPVTLHYGEKQVTTLAFLDEGSSITLVERSLTDRLSVEGVKWPLTIK FT WTADITREEPNSKMMNLWISALGNEERLPLQAAQTVEKLLLPKQSLNAGTL FT AAEYSHLRDLPLTSYSDQHPGLLIGLNNLHTIAPIEAKVRKIGEPIAVRSK FT LGWSVYGPTSRKTNDTVIVGHHDAISNNDLYNLLKEHYALEESVVKVVQET FT KDEKRAREILERTTIRVGSHFETGLLWKTDDWIFPDSLPMARRRLKQLEQR FT LERYPELYEKIRQQIEDYQLKGYAHLATEKELAETVPHKTWYLPLNYVQNP FT KKPSKLRLVWDAAATVRGVSLNSQLLKGPDMLTPLTSVLSLFRERRVAFGG FT DIREMYHQLRIREADKHAQRFYSGRTPET" XX SQ Sequence 6043 BP; 1763 A; 1372 C; 1559 G; 1325 T; 24 other; gttttagctg ctacaatagg ctgctcaaag aaagtggtgt atttctaagg taaatccgaa 60 caatctaaaa aatagataat cggagctaac ctcaaacacg atggtgaagg caggaaactt 120 cacattaacc ccctgtggtg tttgctcgga gacgtcggaa accaatgagg gcatggttgg 180 atgtgacgca tgtgatcttt ggttccactt tcgatgcgtg gaagtgacgg aagaatctct 240 tgacggcctg gagaaatggt tctgcccatc tgagctttgc caaaaggttg cagagggaat 300 gctcagaaaa ccagatgacc gtaagaaaaa aggcaagtcg aaaggagcaa agacgctggc 360 taccgaagaa tcggataagt ctagcgtaaa atccgataac cagtcaggct cgtcattgga 420 gaaaagactc caagcgttag agagagagca aaaggccaag gaacaagaga tggagattga 480 gcgaatctta agggaaaagc ggatcgaaat ggatcgagtc ctgaaagaga agcaactgaa 540 aattgagaac gagctaagag agaaggaaat gcaggttgaa aaggagatgc tggagaaagc 600 gctccgtgat gagaaagctc acgccgatcg tatgcaacgg atgcgagagt cataccaaaa 660 tgcgatcatg ggcgtgaaaa atatggcgat gaagtcacgc aactccgtcg accctgacac 720 caagtttgag aagggaattg tggaagaaac catcccacgc gaaggcgaag gttcttcatg 780 gcaccacaag aaaatgttgg atacgccgtt gcgtaatcca aacgtttcgc aacgaacgat 840 tccggaaaag caacggaagt ctgttgataa atacgcacag aaaacagagt tcgagaacag 900 ttcagaggac gataacgatt cggttaacga aaccagtgac gaagaatctg aggaagactg 960 cagcaactcg gagatatcat cagaagaacg ggaagaggca tcggagtgcg agatagaggt 1020 cccaaagtca ctcggatacc aamataaaaa gaagaagcca gtatcacacg ggctggggcg 1080 gcaactggca ggaccatcaa aggcgcagtt agccgcgcgt aatggactct ccaagaaact 1140 ccctattttc actggaaagc ccgaggaatg gccgcttttt attggcagct acgaagcttc 1200 taatacagcc tgtgggttca acgatgtaga aaacctggtt cgtttgcaag aaagtttaaa 1260 aggtaatgct ctagagagtg ttcggggaca attattatta cccaagtccg tgccgaaagt 1320 cataaacaaa ttacgtcaac tttacggtcg tccggagcag ttattatact gccatctaga 1380 aagggtaaag cgcctagatc ccccaaaggc agacaaacta gagtcttatg taccttttgg 1440 aaacgttgtg gaacaaatgt gtgaccactt ggaggctgcc ggattgaaag aacatttaat 1500 caatccgatc ctcatacagg atctggtaga caaacttccg gccggtgata agcgagagtg 1560 ggttcgattt cgcagcaaga agaaaacagt cactttgaga accttttcta aatttctctc 1620 caaaatcgtt gcggaggcat gtgcagcaaa tgtaaatctc gaaaaccaac aagaattcaa 1680 aactaaacct gcgcccgctg gaaaaggaag aggtatggat aaaggtgcag tttacatgca 1740 cggtgcgaat gaagctttcg cggccggcga agagcagaaa ttaaaaccat gcagagcctg 1800 taagagaaca gaccatcgtc ttcgtttttg ccaggatttt aaagcaatga atttcgtcga 1860 ccgcatgaaa attgtagaga aaggcaaact ctgcaaaatt tgcctgaacg accatggtaa 1920 tgctccatgt aagtttaaaa ttcgatgcaa tgttgaagat tgtcgagatc gggataggca 1980 tcattcgttg ctccatccag ttgataattt ggttgtaatc aacacgcata tccgctcatc 2040 tagttctatt atgttccgca tgattcccgt aacattgcac tacggagaga agcaagtcac 2100 gacgctagca ttccttgatg agggttcgtc gatcaccttg gtcgaacgct cgctcacgga 2160 tcgattaagc gtcgaagggg ttaaatggcc actaactatc aaatggactg ctgatattac 2220 cagagaagaa cctaattcca aaatgatgaa tttatggatc tccgctctgg gtaatgaaga 2280 gaggttgccg ctacaggcag cgcaaacggt ggaaaagctg cttctcccga aacaatcgct 2340 caacgccggt acacttgctg ctgagtatag ccatctacga gatctgcctt taacttcgta 2400 ttcggaccaa catcctggcc tgcttattgg tttgaataat ctgcacacca tcgcaccaat 2460 tgaagccaag gtccggaaaa ttggagaacc gatagctgta cggtcgaaac taggttggtc 2520 agtttacggt cctacctcac gaaaaaccaa tgacactgtc attgttggtc atcacgacgc 2580 cattagcaat aacgacctgt acaatttgct gaaagagcat tacgccttag aagagtccgt 2640 tgtaaaagta gttcaagaaa cgaaggatga aaaaagggca agagaaatct tagaacgaac 2700 taccatccgt gtcggcagcc acttcgaaac cggcttgttg tggaaaacgg atgattggat 2760 ttttccagac agcctcccga tggcccgtcg aaggttgaag cagttagaac agagactgga 2820 aaggtatccc gaactgtacg agaagattag acaacaaatc gaagattatc agcttaaagg 2880 ctatgctcat ttggcaacgg agaaggagct agcggaaacc gttccgcata agacatggta 2940 tcttccgctg aattacgttc agaacccgaa gaagccctct aaacttcgtc tggtatggga 3000 cgctgcagct acggtaagag gtgtgtcact caacagccag ttgctcaaag gccctgacat 3060 gctcacgccg ttgacatccg tgcttagtct wtttcgtgag cgccgtgtcg cctttggagg 3120 ggacatacgc gagatgtatc accaactacg cattmgagag gcggacaagc acgcacaacg 3180 gttctattca ggaagaaccc cggagacgtg agccgagcgt gtacgtaatg gacgtagcca 3240 catttggatc cgccagctca ccgtgttcag cgcaatacgt gaaaaaccgc aacgcatcgg 3300 agttttcagc gacgtatcca gaggcggcgg ccgctatcgt tsasaagcat tatgtcgacg 3360 actattacga tagcgtcgac accgtcgagg aagctattcg tcgtgccaaa gaagtacgtt 3420 tcatacactc caaggcaggk tttgagataa gaaactgggt ttcgaactct gctgaagtgc 3480 ttcacagcct gggagaatca aatccgacta aagacgtcca cttcaacgtc gataagacaa 3540 cggaaaatga acgagttctt ggaataatct ggagcccaaa ccaagactca ttttctttcg 3600 ccaccgatca tcggccggat ttgcagcctt tcgttgacgg tattcgacga ccaacgaaac 3660 ggttggtgtt aagctgcgtm atggggtttt tcgacccact tggtctcctg gcgccgttca 3720 cgatccacgg caaaacactc gtccaggatc tgtggcgaac gggatgcggc tgggacgaag 3780 aaatcgacga cgawgcmctg cagaaatgga awcgctggac aagtttgttg aagcaggtgg 3840 cagacatccg aatccctcgc tgttacwtcg gcgatkccca ctcgwctgaa atcgattctc 3900 ttcagctgca catcttcatg gatgcgagcg agcatgctta cggctgcgtc gcttacttcm 3960 gggcagtggt ggatggccag gtgaggacwt ctctagtgac ttcccgcacg aaggtggcgc 4020 cgctgaaacg gcaatcgatt cctcgtctta gagttgatgg ctgcggtact agggtccgwt 4080 tgctacgcac ggttcaacct cagcactccc ttccaattaa taaatatttc ctctggtcag 4140 actcccagac ggttctcagc tggatccgct ccgatcagct gaagtataaa caatttgtgg 4200 cattccgcat cggtgaaatt cgagaacaca caaacatctc tgactggcgt tggattccaa 4260 cgaaaacgaa catcgctgac gttttaacca aatggggtca agggcctccc tagagtctag 4320 cagcgmgtgg ttcagcggac cacaatttct ttcggacccc gaagatcaat ggccagtagg 4380 acctctgccg gccgttgata caagcgaaga gatgcgtgct tgcgttctgc atcatgaagc 4440 aatacagtcc gwgccagtaa ttaatgttga ctcgacgaat cgcttaggat gtctgctccg 4500 aagtactgct aatgtgatcc ggttcattgc caattgtcgc cgcaaggtcg caggcaatcc 4560 aatagtagtg tctagggcta ctgctggtca aatacgtcta ctgaaagcag aaacaatatc 4620 aatccagcaa cccctgcagc aggaagaact acgtgcagcc gaattagtcc tgtggaggca 4680 agctcaacgt gagggattcc cggaagaagt cagaatattg gagaaaaacc wgaggcgtga 4740 tgcatctacg gagcgaatca aaaagtcaag caccctgtac aagatgaccc ccgttatgga 4800 caccgaagga gtaatgcgcg taggaggaag gttacagcaa gccgagttcg ctmckttcga 4860 tatgaagcat ccmattattc tacccaaaga acatgcgata accaaaatgt tgattctgca 4920 gtaccacgaa aaatttgctc atgcgaatag agaaacagtt tgtaacgaac ttcgccagcg 4980 attccacatc cccaaattac gacaagcgat tcgacaagct gtgaaggatt gtatgtggtg 5040 tcgagtcaat cgctgcctgc cacaaactcc gatgatggcg cctctaccgg tacaacgagt 5100 aactccacag cttcgcccgt tcagctctgt aggcgtagac tacttgggtc ctgtcgaagt 5160 gttggttggc agaaagaagg agaaaagatg ggtagcactc tttacctgcc tagcagtacg 5220 agcaatacac ctagaagttg tgcacggctt gaccactcag gcatgtttaa tggcaattcg 5280 acggttcatg tgcaaacgcg gagcacccga agagtttttt tccgacaatg ggactaattt 5340 caagggtgca tgcggcgagt tggctaggct gaaacagctc aaccaagaat gtgccgagag 5400 cgtgacgggt accactctaa agtggacttt cataccccca ggtacaccgc acatgggcgg 5460 catctgggaa cgtatggttc gagcggtaaa ggaagcactg aaagcactta atgacggccg 5520 aaaacttacg gatgaaattc tgctcactac gttatcagaa gcagaggatg ctatcaacac 5580 ccgcccgttg gtctacctcc caggattcag cggaaacgga ggcaattaca ccaaaccatt 5640 ttctccgtgg gacggttagg aatgctgatt taacggtcga cgatactgca gacttcgcgg 5700 aggctttgag ggaccctaca aacggtcgca atacctagcg atcaaatgtg gcagcgttgg 5760 tgcaaggagt atctgccgac cataaacatg gatccaaatg ggtcgaagat cgaggcmagt 5820 actgtaggtg atctggtttc atcgttaacg atggacagcg gaagamctgg actcgaggca 5880 tcgtcgaaga ggtgttcgaa ggcaacgatg gtaggattcg ccaggtgaat gttcggacga 5940 caaaaggtgt atctagaagg gctgttgcca acctagcgtt atagaagttc ctggtaaatc 6000 cggaatctcc gaagggaacc ggaaccggag ttacgggctg ggg 6043 // ID BEL1-LTR_DV repbase; DNA; INV; 426 BP. XX AC scaffold_13049; XX DT 15-OCT-2009 (Rel. 14.12, Created) DT 15-OCT-2009 (Rel. 14.12, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL1_DV; KW BEL1-I_DV; BEL1-LTR_DV. XX OS Drosophila virilis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; virilis group. XX RN [1] RP 1-426 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(12), 3093-3093 (2009). XX DR Genome; scaffold_13049; Positions 223347 222922. XX SQ Sequence 426 BP; 124 A; 96 C; 78 G; 128 T; 0 other; tgtttggtca agtaacagca gagataacac cgacggctgc ggacatcatc caacagcaga 60 aaataacaat cataattaca gcctaagagc aaatagtttt tttgcgcgct ctttgtaagc 120 tgctgtccac tctccttctg aattgctttg cttgcattgc tgcccttcgc tctaaccggc 180 gagcgtctta ttggagtttc gtttcgattc gctttacata ttgtcataac cagtcagtct 240 ttcgttttga tcgcaaccta ggcacaacta attcgagttg gctgccaagc aacaatttaa 300 aacttataaa ttctaataag tgccctgcgc ggcaaataaa tccatttgaa aactaatcta 360 agtgtttgaa tttcgctgca caagcagtta gaattaattg gttgcaaaaa agcttgcaac 420 tacaca 426 // ID Copia-3_SI-I repbase; DNA; INV; 3943 BP. XX AC AEAQ01006153; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-3_SI_; KW Copia-3_SI-LTR; Copia-3_SI-I. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-3943 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01006153; Positions 382 4324. XX CC Positions [1393-1920] - Integrase core CC 'ATTTC' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 40..3921 FT /product="Copia-3_SI-I_1p" FT /translation="MMADNTRVTITKLNNDNYQVWKYKVELLLIKEDLWNV FT VSQEPPAEVDAAWTRKDGQARATIGLLVEDNQFVHIRDKTTARDTWNALKT FT YHQKATLTSKVYLLKRICSLKVTDDGNMEDHINNMLDLVNKLAALGEQLMD FT HLIVAMMLSSLPDSYNTLITALESRAEEDLTLDLVKGKLIDEYKRRKATGT FT LGESNESALKTSKESKEKGRGNQDCFFCKKYGHQKKDCYKYKKWKQNKEKA FT NQAAENSCKDGSKSGEVCFHVNTKGKTLNVWFIDSGATSHMTNNKDFFEEL FT DPTKKIEIRLADGESRKACGIGMGKLKCLNDKNQIIEVKVTDVLYVPSLEG FT NLLSVKRLTERGLQVKFKSDKCYITKNGKIVAIADESSNLYKLRVSQQACV FT VIQKHNKHCQHMWHKRFGHRDPEAVKQLANNALAMELKITDCGIRTTCKTC FT IKGKITKKSFPKKSENRTSSTLDLIHTDLCGPMQTQTPGKKRYILTIIDDY FT SRYTEVFLLQSKDETASYIKDYIQKVKTQFNRKPKIIRSDRGKEYINAELK FT TFLRQEGIQTQYTAAYSPQQNGVAERKNRSLMEMARCMIIDAEMPNKYWGE FT AVVTANYLQNRLPTKATEKTPHELWFTKKPNVNQLRIFGCTAFAHIPKEQR FT RKLDVKAKELRFVGYAEDSKAFRLLDMTTDRIIISRDVIFIEDVEDGFEIT FT ENENEIQIGLHKAPSEEIKEESTIEEPTNNEGFIEKPEEKFQDKNADALRR FT SDRKNKGVPPHRYEETASIAMELQDPKTIQEAMSRTDRDKWKAAMDDEMNS FT LNMNNTWELTELPNDRRPIGCKWIFKIKQDAVGNPSRYKARLVAQGFSQKY FT GTDYDEVFAPVVRPITFRTLLVISGRENFIVKHIDAKTAFLNGELKEVIYM FT KQPIGYEVPNKEHMVCKLNKSLYGLKQAARVWNEKIHKVLEEHEFRQSKTD FT PCLYMKVIDNVWIFIIIYVDDIIIGGKEKKLVNDAIDMLRRRFDIVNLGNL FT SNYLGMSIERDDKGIFYLSQPKYIKKIIDSVGLQEAKISSYPLDPGYENIE FT NSEDVMENSAKYQKLIGELLYVAVHSRPDIAAAVSILSQRIKCAKYSDWTE FT AKRTVRYLNGTIKWKLKLGGDTKEDEESPLIGYADANWAQDKADRKSNSGF FT IFMLNGGTISWACRKQPCVALSTTEAEYIALAEACQEGIWIGRLLEEFGIT FT SKTPLKIYEDNQSCLKLLCAKGFNNRTKHIDTKYHFVKQLKEDNVMDFVYC FT QTSDMIADMLTKPLHGIRLRRLAELSGLKNWSG" XX SQ Sequence 3943 BP; 1489 A; 635 C; 903 G; 916 T; 0 other; ggttatgggc ccagatacta gcgcatcaaa attgaggtta tgatggcaga caatacacgt 60 gtaacgatta caaagctcaa caacgacaac taccaagttt ggaagtacaa agtcgagctt 120 ctgttaatta aagaagattt atggaacgta gtgagtcagg aacctcctgc agaggtagat 180 gcggcttgga cacggaagga tggacaggct cgcgcaacga tcggattgct cgttgaagat 240 aatcaattcg tgcatattcg ggataaaact actgcacgtg atacatggaa tgcgcttaag 300 acttatcatc agaaagctac attgacaagt aaggtatatc ttctaaagag aatttgcagc 360 ttaaaggtaa ctgatgatgg aaatatggag gatcacatta ataatatgct tgatcttgtg 420 aataaattag cggcattagg agagcaatta atggatcatt taatcgttgc aatgatgtta 480 agtagtttgc cagattcgta caacacgctc attacggcac ttgaaagcag agcagaagaa 540 gacttgacgc tggatctcgt aaagggcaag ttaatcgatg aatacaaaag gcgaaaagcg 600 acaggaactt taggtgaatc aaacgaatct gcactaaaga catcaaagga aagcaaagaa 660 aaaggtcgag gaaatcaaga ttgttttttc tgcaagaagt atggacatca aaaaaaggac 720 tgttacaagt acaagaaatg gaaacaaaat aaagaaaagg caaatcaagc agcggagaat 780 tcttgtaaag acggtagcaa atcaggtgag gtttgttttc atgtcaatac aaaaggaaag 840 acattaaacg tgtggtttat cgattctggg gctacgagcc atatgacaaa taataaagat 900 tttttcgaag agctcgatcc gactaagaaa attgaaatcc gattagcaga cggtgagtcg 960 cgaaaggcat gcggaatcgg tatgggtaag ttaaaatgtc ttaatgataa gaatcaaatt 1020 attgaagtga aggttactga cgttttatat gtacctagtc tagaaggcaa tttgctatcc 1080 gtaaagagac taacagaaag aggtttacag gttaaattta agagcgacaa atgttatata 1140 actaaaaatg gaaagatcgt cgccattgca gatgaatcat ctaacttgta taaactacga 1200 gtatcacaac aggcttgtgt agtaattcaa aagcacaata aacattgtca gcacatgtgg 1260 cataaacgtt tcggacatcg agaccctgag gcagtcaagc agctagcaaa taacgcactt 1320 gcaatggaac tgaagatcac tgactgtggg ataaggacaa cttgtaaaac atgcattaaa 1380 ggaaaaataa caaagaaatc ttttccaaag aaatcagaaa acagaacttc atcaacattg 1440 gatcttattc acacggattt atgcgggcca atgcaaactc agacgcctgg aaagaagcga 1500 tacatcctta ccattattga tgattatagt cggtatacgg aggtatttct attgcaaagt 1560 aaggacgaaa cagccagtta cattaaggat tacattcaaa aggtaaaaac gcagtttaac 1620 agaaagccaa aaatcataag atcagaccga ggaaaggaat atattaatgc ggaattaaag 1680 acatttttaa gacaagaagg cattcaaacg caatatacag cggcttattc accccagcaa 1740 aatggcgtcg ctgagagaaa aaatcgctcg ctgatggaaa tggcgcgatg catgataata 1800 gacgctgaaa tgccaaataa atattgggga gaggcagttg tcacggcaaa ttatttacag 1860 aatagattgc caacgaaggc tacagaaaag acacctcatg agttgtggtt cactaagaaa 1920 ccaaatgtta atcaactcag aatattcgga tgtaccgcgt tcgcacatat tccaaaagag 1980 caacgaagaa aactggatgt gaaggcaaaa gaactgagat tcgttggata cgcagaagac 2040 tcaaaagctt ttcgtctact cgacatgact acggatagga tcatcataag cagagacgta 2100 atattcatag aagatgttga agatggtttc gaaataactg aaaatgagaa tgaaattcaa 2160 atcggtttac acaaagcacc atcggaagag attaaggagg agtcaacaat agaagagccg 2220 acgaataatg aaggattcat tgaaaagcct gaagagaagt ttcaagataa aaatgcagat 2280 gcactacgaa ggtcagacag aaaaaataaa ggagttccac cacacaggta tgaagaaact 2340 gcaagtatag caatggaact tcaagatcca aaaaccattc aagaagcgat gtcaaggaca 2400 gacagagaca aatggaaggc tgcaatggac gatgaaatga attcccttaa catgaataat 2460 acgtgggaat taacggaact accaaacgac agaagaccaa taggatgtaa gtggatcttc 2520 aaaatcaagc aagatgcagt tggaaatccg agcagatata aagcaagatt agttgctcaa 2580 gggttttcac agaaatacgg aacggactat gatgaggttt tcgcacccgt ggtaagacca 2640 atcacattcc ggacactact agtcatatct ggaagagaga actttattgt caagcatatc 2700 gatgcaaaaa ctgcattttt aaacggagaa ctaaaggagg taatatatat gaagcagccg 2760 ataggctacg aagtaccaaa taaagaacat atggtatgca aactaaacaa gagtttgtac 2820 ggactaaagc aggcagcaag ggtatggaat gaaaagatac ataaggtact tgaagaacat 2880 gagttcagac aaagtaaaac tgacccatgt ttgtatatga aagtgattga taatgtatgg 2940 atttttataa ttatttacgt agatgatatc attattggag gtaaggaaaa gaaacttgta 3000 aatgacgcca tagacatgtt gaggaggagg tttgatattg tcaatctcgg aaaccttagt 3060 aattacttgg gaatgtcaat agaacgagat gataaaggaa tattttatct aagccaacca 3120 aaatatatta agaagataat tgattcagtc gggcttcaag aagcgaagat ttccagttac 3180 ccattggatc cagggtacga aaatatagag aactcagaag atgtgatgga gaacagtgca 3240 aagtatcaaa agctgattgg agaactactc tatgttgctg tacattcacg accagacata 3300 gcagccgccg tttcaatttt aagccaaaga ataaaatgtg cgaagtactc agactggaca 3360 gaagctaaga ggacagtaag atatttaaat ggcactatta aatggaaact caaactcgga 3420 ggcgacacta aggaagatga agaatcacca ctgattggat atgcggatgc aaactgggca 3480 caagacaagg cagatcgtaa atctaatagc ggatttatct tcatgttaaa tggaggcaca 3540 ataagttggg cgtgtcgcaa acaaccgtgc gtggcactct ctacaacaga agcagaatat 3600 atagcgcttg cagaggcttg tcaagaagga atctggattg gaaggttgct ggaagagttc 3660 ggaataacat ctaaaacacc tttgaagatt tacgaggata atcagagttg cctgaagcta 3720 ttgtgcgcaa aaggtttcaa caacaggaca aaacacatcg atacgaagta tcattttgta 3780 aaacaattga aggaagataa cgtaatggat ttcgtgtact gtcaaacatc agatatgata 3840 gctgatatgc tgacgaaacc attacatgga atcagattga ggaggcttgc agagcttagc 3900 ggacttaaga actggagcgg gtgatgtcat cggtgaggag gag 3943 // ID Gypsy-605_AA-LTR repbase; DNA; INV; 462 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-605_AA_; KW Ty3_gypsy_Ele44; Gypsy-605_AA-I; Gypsy-605_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-462 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 462 BP; 157 A; 101 C; 95 G; 108 T; 1 other; tgtagaaaga gaggaataaa tataaattat agttgacact aattacacac cttcattaag 60 aaaaacacga aactgaaaat atttcagaat gaaagaaatt gccgatacat taagtgataa 120 tctattccta agttgattgc atcatctttc tccaccaacg ttgtgaaaat tcccacatgt 180 ttgcttgtct gccattcgtt aaattcattg cccaatagca tgcttgttat gcaaaaggtc 240 tgccatttgt cgaactcagc aaccaatagg aagcgaaaac ggattccacg cgaatataaa 300 tamgggcgaa ctttgagcag aggctcattc ttgaaacgga tttctcgagc gacacatcgc 360 cttccgctag cagcagccgc agcaccgact acccgcagca gacactatcc gagcagcagc 420 cgatgagaga aatggagaac gtggagaggg aagacgacga ca 462 // ID TTAA14_AP repbase; DNA; INV; 552 BP. XX AC . XX DT 25-AUG-2009 (Rel. 14.09, Created) DT 25-AUG-2009 (Rel. 15.12, Last updated, Version 0) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; nonautonomous; TTAA14_AP. XX NM TTAA14_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-552 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2080-2080 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC TSD is TTAA tetranucleotide. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea CC Aphid Genome Project. XX SQ Sequence 552 BP; 176 A; 87 C; 90 G; 199 T; 0 other; ggggcctcgc tacggtcatt atttaagggg caacatttag gcaatttcta atcttgtcat 60 agccgcccgg gatctatgtg ttagttgtaa atgattatta gtgtgagtga aattaaatgc 120 gggtatcctc tctcgtacgc gcatgcacat gtcaataact cgataaccat atcaatttca 180 ctacacgaat tttacaaaaa catacatttg ttttgacaaa attaaagtta gttaacgttg 240 tctgattttg attctgaaaa aatatttgaa ttcagcagaa aaatgtctgt agatcgtacg 300 aaacattttc cgtttttaat attaatttta attttaatta tgattcgaaa aagttcatat 360 tttcaagttt tcaattacag ttcacaataa tataaaattt agtttttaca aagtgcttcg 420 taggatctat agacaatttt ctggtgagtt caaatttttt ttcagaatca aaatcggaca 480 acgttaacta actttaattt tgtcaatact ttaggtcttt tcagctaaaa ttgacggcag 540 tagcgaggcc cc 552 // ID R2_BM repbase; DNA; INV; 4212 BP. XX AC M16558; XX DT 01-MAY-1996 (Rel. 1.04, Created) DT 25-MAR-1997 (Rel. 2.02, Last updated, Version 2) XX DE Bombyx mori rDNA insertion element R2 (type II), complete cds. XX KW R2; Non-LTR Retrotransposon; Transposable Element; BMR2; KW insertion sequence; R2_BM; retrotransposon; reverse transcriptase; KW transposon. XX OS Bombyx mori OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Bombycidae; Bombycinae; Bombyx. XX RN [1] RP 1-4212 RA Burke D.W., Calalang C.C. and Eickbush H.T.; RT "The site-specific ribosomal insertion element type II of Bombyx RT mori (R2Bm) contains the coding sequence for a reverse RT transcriptase-like enzyme."; RL Mol. Cell. Biol 7(6), 2221-2230 (1987). XX DR GenBank; M16558; Positions 64 4275. XX SQ Sequence 4212 BP; 925 A; 1083 C; 1325 G; 879 T; 0 other; cgggagtaac tatgactctc ttaggggcga tacgcataat tttaattttt cgattcaaat 60 ccagtcgtct taatctggtg accagtggcg cggtcaccag tatagtgcac aggacgtgaa 120 tggctccgag gctggcggag tcactcacta taagtgtgag agacgatgtc ctgtgccaag 180 tatacgtcca accctaacgg gttaagtgaa attagttgct cataacaggg acggtgtacc 240 tgtttgctcg tggctggcta tcgaatggac gggaccaata cacccccctg ttagtaatgg 300 ggtaagagag agcggtctga aactatggcc gagatcacga cgccccactc ctacccataa 360 cctgcacgtg gtaccgccgc acattgaccg atacgggagg aggggcagca cttgaatcac 420 gtagtcttgg tgtagccatt gcgggactac agccctcgta agtgccgcct tagaacgcaa 480 cggggcaata ggtgggccgg ggcgctagcg ggggggagta atctcccctg ttggcgtgca 540 ccgcactgct ccctctgggg gcagtgtcat ccggaaacag gtgggccggg gcgccaccag 600 gggggagcaa tccctcctga tgatggcgag caccgcactg tcccttatgg gacggtgtaa 660 cccggatggc tgtacacgtg gtaaacacgt gacagcagcc ccgatggacg gaccgcgagg 720 accgtcaagc ctagcaggta ccttcgggtg gggccttgcg atacctgcgg gcgaaccctg 780 tggtcgggtt tgcagcccgg ccacagtggg tttttttcct gttgcaaaaa agtcaaataa 840 agaaaataga cctgaagcct ctggcctccc gctggagtca gagaggacag gcgataaccc 900 gactgtgcgg ggttccgccg gcgcagatcc tgtgggtcag gatgcgcctg gttggacctg 960 ccagttctgc gaacgaacct tttcgaccaa caggggtttg ggtgtccaca agcgtagagc 1020 ccaccctgtt gagaccaata cggatgccgc tccgatgatg gtgaagcggc ggtggcatgg 1080 cgaggaaatc gacctcctcg ctcgcaccga ggccaggttg ctcgctgagc ggggtcagtg 1140 ctcgggtgga gacctctttg gcgcgcttcc agggtttgga agaactctgg aagcgattaa 1200 gggacaacgg cggagggagc cttatcgggc attggtgcaa gcgcaccttg cccgatttgg 1260 ttcccagccg ggtccctcgt cgggggggtg ctcggccgag cctgacttcc ggcgggcttc 1320 tggagctgag gaagcgggcg aggaacgatg cgccgaagac gccgctgcct atgatccatc 1380 cgcagtcggt cagatgtcgc ccgatgccgc tcgggttctc tccgaactcc ttgagggtgc 1440 ggggagaaga cgagcgtgca gggctatgag acccaagact gcagggcggc gaaacgattt 1500 gcacgatgat cggacagcta gtgcccacaa aaccagtaga caaaagcgca gggcagagta 1560 cgcgcgtgtg caggaactgt acaagaagtg tcgcagcaga gcagcagctg aggtgatcga 1620 tggcgcgtgt gggggtgtcg gacactcgct cgaggagatg gagacctatt ggcgacctat 1680 cctcgagaga gtgtccgatg cacctgggcc tacaccggaa gctcttcacg ccctagggcg 1740 tgcggagtgg cacgggggca atcgcgacta cacccagctg tggaagccga tctcggtgga 1800 agagatcaag gcctcccgct ttgactggcg aacttcgccg ggcccggacg gtatacgttc 1860 gggtcagtgg cgtgcggttc ctgtgcactt gaaggcggaa atgttcaatg catggatggc 1920 acgaggcgaa atacccgaaa ttctacggca gtgccgaacc gtctttgtac ctaaggtgga 1980 gagaccaggt ggaccggggg aatatcgacc gatctcgatc gcgtcgattc ccctgagaca 2040 ctttcactcc atcttggccc ggaggctgtt ggcttgctgc ccccctgatg cacgacagcg 2100 cggatttatc tgcgccgacg gtacgctgga gaattccgca gtactggacg cggtgcttgg 2160 ggatagcagg aagaagctgc gggaatgtca cgtggcggtg ctagacttcg ccaaggcatt 2220 tgacacagtg tctcacgagg cacttgtcga attgctgagg ttgaggggca tgcccgaaca 2280 gttctgcggc tacattgctc acctatacga tacggcgtcc accaccttag ccgtgaacaa 2340 tgaaatgagc agccctgtaa aagtgggacg aggggttcgt caaggggacc ctctgtcgcc 2400 gatactcttc aacgtggtga tggacctcat cctggcttcc ctgccggaga gggtcgggta 2460 taggttggag atggaactcg tgtccgctct ggcctatgct gacgacctag tcctgcttgc 2520 ggggtcgaag gtagggatgc aggagtccat ctctgctgtg gactgtgtcg gtaggcagat 2580 gggcctacgc ctgaattgca ggaaaagcgc ggttctgtct atgataccgg atggccaccg 2640 caagaagcat cactacctga ctgagcgaac cttcaatatt ggaggtaagc cgctcaggca 2700 ggtgagttgt gttgagcggt ggcgatatct tggtgtcgat tttgaggcct ctggatgcgt 2760 gacattagag catagtatca gtagtgctct gaataacatc tcaagggcac ctctcaaacc 2820 ccaacagagg ttggagattt tgagagctca tctgattccg agattccagc acggttttgt 2880 gcttggaaac atctcggatg accgattgag aatgctcgat gtccaaatcc ggaaagcagt 2940 cggacagtgg ctaaggctac cggcggatgt gcccaaggca tattatcacg ccgcagttca 3000 ggacggcggc ttagcgatcc catcggtgcg agcgaccatc ccggacctca ttgtgaggcg 3060 tttcgggggg ctcgactcgt caccatggtc agtggcaaga gccgccgcca aatctgataa 3120 gattcgtaag aaactgcggt gggcctggaa acagctccgc aggttcagcc gtgttgactc 3180 cacaacgcaa cgaccatctg tgcgcttgtt ttggcgagaa catctgcatg catctgttga 3240 tggacgcgaa cttcgcgaat ccacacgcac cccgacatcc acaaagtgga ttagggagcg 3300 atgcgcgcag ataaccggac gggacttcgt gcagttcgtg cacactcata tcaacgccct 3360 cccatcccgc attcgcggat cgagagggcg tagaggtggg ggtgagtctt cgttgacctg 3420 ccgtgctggt tgcaaggtta gggagacgac ggctcacatc ctacaacagt gtcacagaac 3480 acacggcggc cggattctac gacacaacaa gattgtatct ttcgtggcga aagccatgga 3540 agagaacaag tggacggttg agctggagcc gaggctacga acatcggttg gtctccgtaa 3600 gccggatatt atcgcctcca gggatggtgt cggagtgatc gtggacgtgc aggtggtctc 3660 gggccagcga tcgcttgacg agctccaccg tgagaaacgt aataaatacg ggaatcacgg 3720 ggagctggtt gagttggtcg caggtagact aggacttccg aaagctgagt gcgtgcgagc 3780 cacttcgtgc acgatatctt ggaggggagt atggagcctg acttcttata aggagttaag 3840 gtccataatc gggcttcggg aaccgacact acaaatcgtt ccgatactgg cgttgagagg 3900 ttcacacatg aactggacca ggttcaatca gatgacgtcc gtcatggggg gcggcgttgg 3960 ttgagccttg cacagtagtc cagcggtaag ggtgtagatc aggcccgtct gtttctcccc 4020 cggagctcgc tcccttggct tcccttatat attttaacat cagaaacaga cattaaacat 4080 ctactgatcc aatttcgccg gcgtacggcc acgatcggga gggtgggaat ctcgggggtc 4140 ttccgatcct aatccatgat gattacgacc tgagtcacta aagacgatgg catgatgatc 4200 cggcgatgaa aa 4212 // ID DIRS-2_DPu repbase; DNA; INV; 5503 BP. XX AC scaffold_16; XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE DIRS retrotransposon from Daphnia. XX KW DIRS; LTR Retrotransposon; Transposable Element; nonautonomous; KW DIRS-2_DPu. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-5503 RA Jurka J.; RT "DIRS retrotransposons from Daphnia."; RL Direct Submission to RU (08-JUN-2010). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR Genome; scaffold_16; Positions 1511038 1505536. XX FH Key Location/Qualifiers FT CDS 136..4008 FT /product="DIRS-2_DPu_1p" FT /translation="MSRYRPSPSEFFYGQFFDPRVGFFDQPRLERERMRDR FT ELRSRPHHQHPNYQVRDRSPVDFPRFAPRWDDYSRDAPRPEGFGFVAPYDD FT RFVRDDQPSSSRRGSGKGYRCGETGFYFYCDDGFCYDEAGYSYQFEQDGFY FT YDTETGYRHDSGGFLCDENGVRLNRAELESYEQSVRFAHPIASVIGPIPPR FT TEDVPIDIDIEDQEEQDDSRRGRRPWLPRRSSSPKKRAGGSAAAPRPAAYT FT PNSTAAKSSETTVGPSATEAASSATAAVSSATATESSATAAASSATAATTV FT ANVIHELVDLVPGSVGVQAVTAPLEGLSLPGSSGAQPLSVPKSISDEILLF FT MTSGISTDSSKAISKEFCLDYVDEDFSLKPPKLDGWISRRVLLKSDKNLIK FT SINAKEETLIKAQLKIMDIGQPLVDLYTRLSSLPDNETIKRPVQAALQQWG FT RAYFSITRERRSAVVALAEPSADYLLKEPDAFSTGKEARAFLLTDKFLQVM FT LNNANQDNTLAQASKAAAAAAAANATRRPATRRVRAEPPSSSSHPPPRYES FT DVIVRGGRGGRSRRAGHGRGQRSVTWFPGSRRYVTSKSNSLTPCKIFPSPN FT SKPFSQPEGPRATVVDKDGSITVASRLTKFADRWALVTSDRWVLKTIREGL FT SIEFENLPVQKSWPPQIVMSKEMAEVCDKEVKDLLAKRAIAEVTDGSAGFV FT CSFFCIKKKQAGQFRPIVNLKPLNKFIRYQHFKMENLESVRFLVRKGDWLA FT KVDLKDAYFTVAVKKEHRKYLRFRWGKRVFEFNCMAFGLAPRVFTKILKTV FT MAFLRRKGIRLVIYLDDILVLNESKEGLVADVNTVLELLQSLGFLINWEKS FT IIAPTQVIEYLGLIVDSNDPSFSLPCAKAAAVRKMCETALSEGKVSLRTIA FT SIQGNFAWAIPAIPFAQSHYRSLQRFYISNAQRVDFNLEAKVRLSPSAALD FT LGWWVANIEKANGKMFFPREPDLEIFSDASLTGWGAVCNGVTTRGPWTVQD FT MNKHINELELLGAFFAIQTFSAQTSNIAIRIFLDNSTAVSYVNKCGGTKSA FT ALTNTAKAISAWCEEKSISVEAVHLAGELNVIADRESRAEADTSDWRLDAT FT IFSRISEIWEMDVDLFASSWNSQLPRFIAWGPQPGAFAANAFSIRWENIYG FT YAFPPFSLIFRCIEKIRREKASIILICPVWTGQPWFPVLLEHACDIPRLLR FT PSPELLTSARGEPHPLIQSGALSLAAWKLSGDRTTCKAFRSRLLNFSWPEA FT AATPIPHMNQPGAVGSIGVWNGISIPCVAI" FT CDS 3699..4808 FT /product="DIRS-2_DPu_2p" FT /note="tyrosine recombinase." FT /translation="MDRPTMVSGFVGTRLRHSSAPTTVTRAVNISTGRTPS FT ANPIRRVEFSRLEALRRSYNLQGFSQQAIELFVAGSRSNTNSAYESAWGSW FT FNWCVERNIDPLRSDLAVMSDYLARLHASGKSYSSINLHRSMLSTTLPSIE FT GIPVGQHPLVIKLLKGCYNRNPPKPRYNSTWDPSLVLRFMASLGNDEFLPL FT PTLSGKLVTLLALATLLRVSELASITFSSVVLSENTIKFSLSKPRKAQRSG FT PLQSFTLSACPDANTCPVSALRSYVNRTSVNRPPVQNGMLFIALVAPFRAV FT TGNTVGRWIKTFLKMAGVDTDIYSAHSTRSAASSLAVARGLPIDSILQAGN FT WANQTTFSRFYNRGATATFAASVLTDG" XX SQ Sequence 5503 BP; 1362 A; 1375 C; 1385 G; 1381 T; 0 other; accctaaggg tgtaaactta tttattttag cacttcaaag caaaaaactg aaggccgtag 60 ccttcctcgg ccgcggcacg cccacttctc atcagactta tttttgctct gttgctagca 120 acacgcgtcg acagaatgtc tagatatcgc ccatcgccta gtgagttttt ttatggccag 180 tttttcgacc cacgtgttgg ttttttcgat caacccaggc tcgagcgtga gagaatgcgt 240 gatcgtgaat tgcgtagccg tcctcatcac cagcatccta attatcaagt aagagaccgc 300 tcacccgtcg attttccccg ttttgctccc cgttgggacg actacagtcg tgatgctccg 360 agaccagaag gatttggttt tgtggcccct tacgacgacc gtttcgtacg agacgatcag 420 cccagtagta gccggcgtgg cagcgggaaa ggatacagat gcggtgagac gggtttctat 480 ttttattgcg atgacggttt ctgctacgat gaggcgggtt atagctatca gtttgaacaa 540 gacggttttt attacgacac ggagactggc tatcgtcatg actcaggcgg atttctctgc 600 gatgagaatg gcgttcgttt aaatcgcgca gaactcgaat catacgaaca aagtgttaga 660 ttcgcgcacc ctatcgcatc ggttattgga ccgatcccac ctcgtacgga agatgtgcca 720 attgacatcg acatcgagga tcaagaggaa caagacgatt ctcggagagg taggaggccg 780 tggctcccaa gaagatctag tagtccaaaa aaaagagcag gagggtcggc agcagcacct 840 cgtccggcgg cgtacacacc gaattcgaca gcggccaaat cgagtgagac aacggtcgga 900 cccagtgcga cagaggccgc atcgagtgcg acagcggccg tatcaagtgc gacagcaacc 960 gaatcgagtg cgacagcagc cgcatcgagt gcgacagcag ccacaaccgt cgcgaacgta 1020 attcatgagt tggtggatct agtcccgggg tctgtagggg tccaagccgt cacagcacca 1080 ttggaagggt taagcctgcc cggtagttca ggtgcgcagc ctctttcagt tcccaagtcc 1140 atttccgatg agattttgct ttttatgacg agcgggattt ccactgacag ttcaaaagca 1200 atttcgaaag aattttgcct tgattacgta gacgaagact tttctttaaa acctcccaag 1260 ctagacggat ggatttcgcg ccgcgttctc ttgaaatccg acaagaactt gattaaatcg 1320 attaatgcga aagaggagac tttgattaaa gcccagctta agattatgga tatcgggcag 1380 ccacttgtgg acctgtacac ccgtctgagt tccttgccgg acaatgaaac gatcaagcgc 1440 ccagtgcaag ctgctctcca gcaatggggg cgcgcctatt tttccataac tagagagcgt 1500 cgaagtgcgg tggttgcgct cgccgagccg tcggccgatt atttgttgaa agagcccgac 1560 gcgttcagca ctggtaaaga agcgcgcgcc ttccttctta ccgacaaatt cttgcaagta 1620 atgctgaaca acgccaacca ggataacacc ctggcccaag catcaaaagc ggctgcagcg 1680 gcagccgccg ccaatgcaac gaggcgtccg gctactaggc gggtgcgtgc ggagccacct 1740 agtagttcgt cccatccacc gccacgttac gagtcagatg tgatcgtccg aggcggaaga 1800 ggggggcgga gtagaagagc cggccacggc cgtggtcaac gctcggtcac ttggtttcca 1860 gggtcccgaa ggtacgttac gtcaaaatcc aattctctga ccccatgtaa aatatttcca 1920 agtcccaact ccaaaccttt ctctcaacca gagggcccga gagctacggt tgttgataaa 1980 gatggctcaa taacggttgc gtcaagactt acaaaattcg cggacagatg ggctttagtc 2040 acaagcgatc gttgggttct taaaacgatt agagaggggc tgtctattga atttgagaat 2100 ttaccggttc aaaaatcctg gcccccacag atagtcatgt cgaaagagat ggccgaagtg 2160 tgcgacaagg aggttaaaga tttactagcg aagcgcgcga tagcggaggt cacggacggc 2220 tccgccggat ttgtctgctc gttcttttgt ataaagaaaa agcaggcagg tcaattcagg 2280 cccatagtta atctaaaacc gctgaataaa ttcattcgat accagcattt caagatggaa 2340 aatcttgagt cggtacgttt tctagtcagg aaaggggatt ggttggccaa ggtcgacctt 2400 aaagatgcct acttcactgt agcagtgaag aaggagcacc gcaaatactt gcgttttcgc 2460 tgggggaagc gcgtttttga atttaactgt atggcttttg gcctcgctcc cagggtcttt 2520 actaaaatcc ttaagacagt catggctttt ttgcgtcgaa agggcatccg actggtcatc 2580 tacctggacg acattctggt attgaacgag tccaaagagg gactggtagc cgacgttaat 2640 accgttctcg aattgctcca gtcgctaggc tttttaatta attgggaaaa gtcgattatc 2700 gccccaactc aggtgatcga gtatttgggc ttgatcgtcg actcgaatga tccatccttt 2760 tctctcccgt gtgccaaagc ggcagcggtt aggaaaatgt gcgagacggc tctatccgaa 2820 ggcaaagttt cattgcggac gatagcctca attcaaggaa actttgcttg ggcgatccca 2880 gcaatcccat ttgcacagtc acactatcgc agcctccaac gattctatat ttcaaatgcg 2940 cagcgggttg actttaatct ggaagccaaa gttcgtttgt cgccaagtgc cgcgctcgac 3000 ctcggatggt gggtggccaa tatcgaaaaa gcgaacggga aaatgttttt tccgcgtgaa 3060 ccggacctcg aaatcttctc agatgcgtcg ctgacaggat ggggggcggt gtgtaatggc 3120 gttacgacgc ggggcccctg gaccgtgcaa gatatgaaca aacatatcaa cgaactcgag 3180 ctactaggcg cgttctttgc aatccagact ttctcagccc aaacgtcgaa catcgcgatt 3240 cggattttcc tcgataattc gacggctgtt agctacgtaa ataaatgcgg aggaacgaaa 3300 tcagccgctc tcacaaacac ggccaaggcc atttcggctt ggtgtgagga gaagagcatt 3360 tcggtcgaag cagttcacct ggcgggtgaa ctcaacgtta ttgcggatcg cgaatcaagg 3420 gcagaagctg atactagcga ttggcggtta gacgcgacga ttttttcacg aatttcggaa 3480 atttgggaga tggacgtgga tctgttcgca tcttcttgga acagtcaact accccggttt 3540 atcgcgtggg gaccccaacc gggagccttc gcggccaacg cgttctcaat tcgttgggag 3600 aacatttacg gctacgcgtt tccccccttt tccttgattt ttagatgtat tgaaaaaatc 3660 cgacgggaaa aagcgtcaat tatattaatt tgccccgtat ggacaggcca accatggttt 3720 ccggttttgt tggaacacgc ttgcgacatt cctcggctcc tacgaccgtc acccgagctg 3780 ttaacatcag cacggggcga accccatccg ctaatccaat ccggcgcgtt gagtttagcc 3840 gcctggaagc tctcaggaga tcgtacaacc tgcaaggctt ttcgcagcag gctattgaac 3900 ttttcgtggc cggaagccgc agcaacacca attccgcata tgaatcagcc tggggcagtt 3960 ggttcaattg gtgtgtggaa cggaatatcg atcccctgcg tagcgatcta gcggtaatgt 4020 cagactatct cgcccgtcta catgcgtccg gtaaatcgta cagttcaatt aatttacatc 4080 gatcaatgct ctctacaact ctgccgtcca ttgaaggtat accggtaggc cagcacccgc 4140 tggtaatcaa attattaaaa ggatgttaca atcgcaaccc gccgaaaccc cgatacaatt 4200 ccacttggga cccgagcctt gtgttgcgct ttatggcctc gctgggcaac gatgagttcc 4260 ttcccctgcc caccttgtcc gggaaattag ttacccttct ggccctcgct acgctgctaa 4320 gagtgtccga gttagcctca attacctttt cgtcggtcgt tttatcggaa aatacaatta 4380 aattctccct ttctaagcct cgcaaggcac agcgaagcgg accgttgcaa tctttcacgt 4440 tgtcagcatg tccggatgcc aatacttgtc cggtatctgc attacgctcc tatgtaaatc 4500 gcacgagtgt taataggccg ccagttcaga atggcatgtt atttatcgcc ctcgtcgccc 4560 catttcgcgc cgtgacgggc aacacagtcg gcagatggat taagacgttt cttaaaatgg 4620 ccggagtaga tacggatatt tacagcgctc actcgacaag aagtgcggct tcgtcattgg 4680 ccgtcgctag gggcctccct atcgacagta tattgcaagc ggggaattgg gcaaaccaaa 4740 cgacattcag caggttttac aatcgtgggg caactgcaac attcgcagca tcggttttga 4800 ccgacggcta ggctttaaag tcacccttag ggttgagcgg aatgtattcc gctgtacaat 4860 tgagaattac cagagtgatc gctcggcgat cacgatgggt aattagaatt gtacaaggaa 4920 ggaatgagag agaccctaag ggtcatatat cccacccgca atcctccctt ctcattccgt 4980 tccatttgtt ttaattgctc aatgttgcac ctgccatatc cagggtccag ccggaaagac 5040 agtccaaaag aggccagaga agtgggctcg aatagtcaac gagaaacgca acccaccctg 5100 tcaggccggc ccgtcatggt ccattatgtc atgaagatgc caaatttgtt tattctgtta 5160 ctctcacaca ccggcttttt ctggctagac agccagtttt ttatgcctca gtttcccttg 5220 gttacgttat atgcttgcca atttgttcga aattgttcat tctgttcatt cacacaacgg 5280 ctttttctgg ctaagcagcc agttttttat gccacaatat cccttgatta tgttacattc 5340 ttgtcaattt gttctatgaa atgcattgat aggcacgcgc accccccttt cgattgatca 5400 taataagtct gatgagaagt gggcgtgccg cggccgagga aggctacggc cttcagtttt 5460 ttgctttgaa gtgctaaaat aaataagttt acacccttag ggt 5503 // ID SMAR2 repbase; DNA; INV; 2266 BP. XX AC . XX DT 24-SEP-2007 (Rel. 12.09, Created) DT 01-OCT-2007 (Rel. 12.09, Last updated, Version 1) XX DE Consensus sequence of Mariner-type family of repeats. XX KW Mariner/Tc1; DNA transposon; Transposable Element; SMAR2. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-2266 RA Jurka J.; RT "SMAR2: Mariner-type element from freshwater planarian (Schmidtea RT mediterranea)."; RL Repbase Reports 7(9), 991-991 (2007). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 225..1958 FT /product="SMAR2_1p" FT /translation="MAGTSKTNKSKRSVIDLETKHKIILQFESSKKVKDIA FT YDLKLSHSTISTILKDKKRILEAVKESTSMNSTIITKKRQGPIHDMEKLLV FT IWMEDLIQKRIPLSLAIIQTKACSLFNMLKSKEGESYDQTFTASHGWFQRF FT KKRANYQNLKISGEAASADHIAAEEFVVKFNEFLEENNYTKEQIFNVDETG FT LYWKRMPERTYIHKEEKSMPGFKAFKDRITVLLGGNIAGYKLKPFVVHRSK FT NPRAFKGVKRNMLPVHYNSNKSAWMTQLLFEDWFMNVFATEVKEYCNQHKI FT PFRILLLLDNAPGHPPHLGDLHPNIQIMYLPPNTTSILQPMDQGAIATFKA FT HYLKITFTQAIMAIEEGITLKEFWSKFNILQGIINLASGWKEVKASCMSGI FT WKKLTKIFEESTSSSIEKEVDIEEITDQIVDLGRQLNLEVDSEDINQLIAH FT QLDDLANEDLLEFEAQCVGENVDKGKSDEEESPQKKFLVKEMSDAFHQINK FT GMRKLQQMDVNEERFSEVNKLIQKNLACYYEIYREMKPQTVQTTLDRFVKK FT TRIGYSVSEDNAMDCSLVISKHDDDTKITLS" XX SQ Sequence 2266 BP; 839 A; 324 C; 404 G; 699 T; 0 other; tacaggtagt ccccgactta cgaccgctcg acttacgacc atccgcactt acgaccaatt 60 ttttttcaat tttttcatag aaaatgatta aaatagtttc gtagtcttcc ctataatatg 120 tatatgagaa aaatttgtta aaaaaacacc caatgtacaa gattcgattt atttaagcat 180 aaataatacg tttgttttag tgaataattt taatcatccc aaaaatggct ggaactagta 240 aaactaataa aagcaaaaga agtgttatag acttggaaac taaacataaa ataatattac 300 aatttgaaag tagtaaaaaa gtaaaagata tagcatatga tcttaaactg tctcattcga 360 caatatctac aattttaaaa gataaaaaac gaattttgga agctgtcaaa gaatcgacat 420 ctatgaattc tacaataata acaaaaaaaa gacaaggtcc catacatgat atggaaaaac 480 tgttagtaat ttggatggag gatttgattc aaaaacgaat tcctttaagt ctcgcaatta 540 tacaaaccaa agcatgcagt cttttcaata tgttaaaatc aaaagaagga gaaagctatg 600 atcagacttt tactgcaagc catggttggt ttcaaagatt caagaagaga gctaattatc 660 aaaatctaaa aattagtgga gaggcagcta gtgctgatca tatagctgct gaggaatttg 720 ttgttaaatt caacgagttt ttagaagaaa acaattatac aaaggaacaa atatttaatg 780 tggacgagac cggtctgtat tggaagagga tgcctgaacg tacgtatatt cataaggagg 840 aaaaaagtat gccagggttt aaggctttta aggaccgtat aacagtttta ttaggcggaa 900 atattgctgg ttataaattg aaaccttttg ttgtacatcg ctcgaaaaat ccaagagcat 960 ttaaaggagt aaaacgtaat atgttaccag tgcattacaa ctctaataaa agtgcttgga 1020 tgacacaatt actttttgaa gactggttta tgaacgtatt tgcaactgaa gtaaaagaat 1080 attgtaatca gcataaaatt ccatttagaa tacttttgct actggataat gctcctggac 1140 atccgccaca cctaggcgac ttacacccga acatccaaat tatgtatctg cctccgaata 1200 ctacgagcat tttgcagccc atggatcagg gtgctatagc tacatttaag gcccattatt 1260 taaaaattac attcactcaa gcaattatgg ctattgaaga ggggattact ttaaaagaat 1320 tttggtcgaa gtttaatatt cttcaaggaa ttataaattt agcaagcgga tggaaagaag 1380 tcaaggcttc ttgtatgagc ggaatatgga aaaaactaac aaaaatattt gaagaatcta 1440 caagttcgtc gatcgaaaaa gaagttgaca ttgaggaaat tacggatcag attgtagatc 1500 ttggtcggca gctcaatctt gaagttgaca gtgaagatat taaccaactt attgcacacc 1560 aattggatga cctagcgaat gaagatttat tggaatttga agcacaatgt gttggagaaa 1620 acgttgataa aggcaaatcg gatgaagaag aaagtccaca aaaaaaattt ttagtaaaag 1680 aaatgagtga tgcttttcac caaattaata agggaatgcg taaattacaa caaatggatg 1740 taaatgaaga acgattctca gaagtaaata agctgattca gaaaaatttg gcttgttatt 1800 atgaaattta ccgggaaatg aaacctcaaa ctgttcagac gacattagac agatttgtta 1860 agaaaactag gattggatat tctgtttctg aagacaacgc aatggattgc tctctagtaa 1920 taagtaaaca tgatgacgac acgaaaataa cgctaagttg aataaaatgt ttttgtttaa 1980 ttatatatat ttcttattat gtatttttgt tttgttacat atataatatg tgttattagt 2040 acacatttaa tatatatttt tgggttatta tttcatttta taaattttaa taaagttttc 2100 attatatgaa ttttcggtag tttcatttct cctgatccta aaagttagta gtacagtatc 2160 taatattttt ctttagttat tcatgatgaa actcgactta cgacttaatc gacttacgac 2220 cgatgactag gaacctatct cggtcgtaag tcggggacta cctgta 2266 // ID BEL-28_AA-LTR repbase; DNA; INV; 601 BP. XX AC supercont1.281; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-28_AA_; KW BEL-28_AA-I; BEL-28_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-601 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.281; Positions 494960 495560. XX SQ Sequence 601 BP; 186 A; 105 C; 122 G; 188 T; 0 other; tgttctggca gcacgggttc agatgttgtg cagaatccaa tcgagcgtgg gacgcacaac 60 ggggactgtt aggggaacaa tagctattgt tgaaactgcg aaagtgaaat cacgatcgta 120 tactttattg aaagtgtcga atattatttg ctatcttatt aatctattgc tctgaaattc 180 tagttattta cggcattgtt aacttgtcgg ctacatttac tatcattgtg aattcattaa 240 ctaaagtata tcggtaagaa ttctttgaaa acgtcacgag tcattatcat gaatcccctt 300 atgttacact aggcttgtaa tctacagata cggaaatcac gcaatatcta atcatcaatt 360 tagtggcgaa aggtatgagt actgttctac ttacgattac aataattaag tgaagtgttc 420 tttaaaatag ggattagaag ccacagacca gaagcgtcat cgggtagagt tctaagccgg 480 taggcaatac caaatgtaag ggaaaatact ttgtttaaat ggatttaaat aaaactacga 540 tcttttcagc tttgagctgc gccaaactaa cgctgctgcg agattttttc ctgcccttac 600 a 601 // ID Zator-N3_AAe repbase; DNA; INV; 1503 BP. XX AC . XX DT 12-JAN-2011 (Rel. 16.02, Created) DT 12-JAN-2011 (Rel. 16.02, Last updated, Version 1) XX DE A Zator DNA transposon family from Aedes aegypti. XX KW Zator; DNA transposon; Transposable Element; nonautonomous; KW Zator-N3_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-1503 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the yellow fever mosquito."; RL Repbase Reports 11(2), 659-659 (2011). XX DR [2] (Consensus) XX CC >93% identical to consensus. 3-bp TSDs; usually TWA. TIRs are 29 CC bp long. Both termini are >80% identical to those of Zator-1_AA. XX SQ Sequence 1503 BP; 509 A; 209 C; 257 G; 527 T; 1 other; ggggtcgtcc ataaatgacg tagcatttta ggggggaggg gggtcttcac gaatttgtga 60 cgatgtgtga cgagggggag ggaggggtcc caactagtgg acgtagcatt ttgaatmatg 120 ttggtaaaaa gagtagcgct gaaaaaattg acaaacagtt ttttttatca aacttctttt 180 ttgtttgact tataaacttg ggttatgaaa tgagcttcaa acattaatat gataagattg 240 aaaaaatttc taagaaattt taaattttcc tgataaatgc aatcaaacag attttatgaa 300 gctttaaata gttatgtgaa tattatattg tattcgtgtc taaatttata aataaacaaa 360 acaaaaagtt taataagtta agtaaaaatc aatgctttat cgacttggaa aacattgttt 420 tttcattttt aacaagacat agaatcaacc gttttagttt tattcatatt ttctgttgga 480 gctttatttc ttcaaaacaa tatgctcatt tggcaaatat tacattaaaa caagatattt 540 attccgtgaa ttatgcgttc aatgttgcag gagatctcca cccagtcaac tcatcgattt 600 gttttgatat atcaatgctc ttgatggaac agcctgaata ataataggac aacagattcg 660 agtgttatcg aaattgtcct atcattcagt tttctaaatc actcggtaat tcgaagaaag 720 agcattttta ttatttcatt gtttcattat gagttgtgta agaatatttc agtatgggga 780 acgataatga agtttagctg aaatattaat aaagatgaat aattgtataa aaatcgctaa 840 gattttctaa acattccaaa tattacaatt gttatctttt ctctatctgg ccactgtttg 900 ctgaatggat ttgaatttgg ataataatga cgaagatcca ttccaaatta atattttgtt 960 gatcatattc actgaaatct gtttcttaac cacgaataat aaaaaaaaat cctctaatat 1020 aacaaaattt ctgttcaata tattataaaa tatttgaccc taactcgtta ctgttagctt 1080 catcaatcca acaagattta cttttttttc tacattttca gctgtaaact tgtttttcat 1140 ggtaacaaca gcaaaagtaa tacaatttaa cctatattaa actggaaacc aaaattatgt 1200 aacttaaaca ggaaataagt ttgtttaaaa ctgttttcca aattactttt gagcgctacg 1260 gtatatcaaa ctattattcg tcagattatc ctgacgtttc agtcagtaat taatatcaaa 1320 ctttgttttg aatttctgga ttttaatgtt tcgttcaacg ccaggaaaac aaaaatatag 1380 gggggggggg gtgcttgata tgctacgtat tttccaaggg gggtatcagc atttgtgacg 1440 aaatgctacg aggggggtgt aaaaaatcag tgaaaaaatg ctacgtcatt tatggacggc 1500 ccc 1503 // ID DNA8-17_AP repbase; DNA; INV; 919 BP. XX AC . XX DT 22-AUG-2009 (Rel. 14.08, Created) DT 22-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-17_AP. XX NM DNA8-17_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-919 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(8), 1759-1759 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 919 BP; 340 A; 127 C; 98 G; 354 T; 0 other; cagtgttggg aaatatctag ataaatttat ctagataagt tatctagata aaaattattt 60 atcttttatc ttatctagat aaatttctac tcagttatct ttatcttcat ctagataaaa 120 ttctagttat ctatgagtac ttatctagat aatttattta aatgaactaa aaaaaaattt 180 gaaaatttaa aatattttta ttattttatt tttacatttt tctatttttc atctattaca 240 tagaaaacac gtttagtaat atgcattata ataccatgat tttaatattt attaatattc 300 attgttagtc tggagtttta catcgtaaac aatgagaaaa cccaaaaata gaaatatgta 360 atgggcatgt agtaggtaag attgtaagaa caactaataa tagcaaaagc aaaaacatct 420 atccctgcta cccgccggcc accaccgaat gatgatggtc tcattagctg atacccgttt 480 catagaattg tgtacaaagt atgaaaaata attattgttt taaaccgaca aaaataaacc 540 aacattcatg gatattggac tcacaagcat aataatatta taataattgt aatttataac 600 aatttaatat ttataaaacc ctattagaat tcccggtatt tcagttatcg cttaaccgat 660 ctgtacgacc tctgtcctct gctttaaaaa attttaaatt atgtattttt ttttacctgt 720 agactcaagt aggtatcctg aatattttca aaattaaaca tgaaattatc tcatatttat 780 ctagataaaa atgtactaat ttttatcttt atctagataa aatttagtat aattatcttt 840 atctgtatct agataaaaat tattacagtc atctttatct ttatctagat aaattattag 900 ttatctattc ccaacactg 919 // ID Gypsy-6_BM-I repbase; DNA; INV; 5848 BP. XX AC nscaf3093; XX DT 19-MAR-2010 (Rel. 15.07, Created) DT 19-MAR-2010 (Rel. 15.07, Last updated, Version -1) XX DE LTR retrotransposon from silkworm: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-6_BM_; KW Gypsy-6_BM-LTR; Gypsy-6_BM-I. XX OS Bombyx mori OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Bombycidae; Bombycinae; Bombyx. XX RN [1] RP 1-5848 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from silkworm."; RL Repbase Reports 10(7), 987-987 (2010). XX DR Genome; nscaf3093; Positions 987761 993608. XX CC Positions [4881-5360] - Integrase core CC 'GTAC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 631..1596 FT /product="Gypsy-6_BM-I_1p" FT /translation="MSKSDSKKDVREQRVDEEVALNVILKFIKPFNGDREK FT LTAFLRNCDSAISLTSPQQEDIVFKYILSQLEGKAESACAIKDFESWSSLK FT EFLKNQFGERKHYAHLLTDLQDCKQRNQETINQFSLRIETCLQKLLTEITV FT SNPKKSEIAGRIAAMEDLALHTFLLGLIPRIADMVRCRDYKTLNEAINAAI FT SEEKIQQFTFRNNYSKPRFVENSNKRTETVNGPPSRPFNNNNNNQSNSFYR FT NPSICKYCKKPGHILENCRLREYNNKRFSNGSRPFTPSQPQASTSRETNFK FT PSGPRVHFTNDYEDEGRDEIDTPSNNADLN" FT CDS 1959..5411 FT /product="Gypsy-6_BM-I_2p" FT /translation="MKHKVYEILTQKRYLPYVQLQTPINKNPLTFLMDSGA FT SICVLKKKSIINSQELKENIVKVKGIDSKNDTLKSVGCFDLPLQFNDDLKI FT SHTFHVFDNVDLPYDGIIGNDLFMKYQCRINYQTNCLEINNINVKLSFDEP FT SYIIHARTETVIECSVKNPELKEGIIVDQHICDSLLVSNCVVKVKNNNRIN FT LTILNVSDEQVYLNANLMLTLAPIDLTEYNTVQHDQSTNSIERCNQLSDLL FT RTSHLNQEELYYLNDICFQYADIFHLPGDQLTCTDVMQHEIKTSSSQPINV FT KSYRFPEIHKEEVNSQINKMLDQQIIKPSKSPWSSPIWIVPKREDASGKKK FT WRIVIDYRKLNDITIGENYPIPQINEILDQLGQSKYFTTLDLASGFHQIKM FT SPADASKTAFSVPQGHFQFNRMPFGLKNAPATFQRTMNTVLSGLQGVHCFV FT YLDDIVVYSSDLPLHIEKLTLVFEKLRKFQLKLQPDKCEFLRKEVAYLGHV FT ISNDGVKPNPDNIKAVMQFPVPKSAKDIKSFLGLASYYRRFIPEFSKYAKS FT LTVLLKKDVPFNWTNTQQLAFEQLKEKLVTAPVLIYPDFTKPFILTCDASN FT YAISAILSQGDIGKDHPIAFASRTLNKAETNYSVTEKECLAIIYGTKIFRP FT YLFGRQFKIITDHKPLNWLFNCKDPGSRLIRWRLKLEEFDYEIQYRRGKTN FT TNADALSRYPMNPLQQNDDISMPIPENIDINPNQVRNEEISPVKPFPNLDS FT INNDLPELDLPDNFVLPDSLELPEDFDIPTPQSVEDTDDDTYSKCLKSINN FT KSVIFDTKILEHNENLLKTKLKHIVIPTSIDLDESNRYVPEIIGNIPDPKE FT LLAAERTLYSPLISETNDILHYFLFVKVYHQPIKYTNITIHIYKNTIMYPA FT MTEITKILRENHDIPISGHLGSNRMLKRIQEKYYWRNMKRDVENYVHKCES FT CQANKALRQTNRAPMQITSTSTQPFERVALDIVGPLPEVGINKLRFILTLQ FT DDLTKFSAAYPISNCTAEETSECLIHFITQFGIPKIIITDQGSNFTAELFK FT QTCSFLKIKQLWSTPYHPQTQGALERSHSTLKEYLKSFVDENQSNWPRYVY FT TAMLTYNTTVHCTTNFTPYELVFGHKAIIPSSIYDSSLESTYN" XX SQ Sequence 5848 BP; 2072 A; 1204 C; 940 G; 1622 T; 10 other; tttggtggca gcggtgggat tcgaatatcc cagtgtcctt aacccggaag gaactttata 60 tcaagaaacc tgtgaagtga aatttataaa agtactaaag ttgtatccag aactgaaaga 120 aagaagttcc cggacccgaa ctcattatag cgagatatat atatactctc tattcctcaa 180 attcctgtaa caccgtgaag cggaatccaa tagtcaggaa tcccttttgc acacctgact 240 ctcatatcca gcaccgaatc gacggtgatc tccagttcct gatctaatcg tccatggtaa 300 acgtatgcca tgcatcttga aaccatcacg aagagttcct gtaagctgtt cctgtaagtg 360 ttattttcat caccaatgaa cctttgtagt tttgtgcaat taatattgtt cagtgttata 420 actttagacg tttaaaaaat agaactatat caattttaat tcataagagg cagaaacttg 480 atccgaggtc gttcttatca atcgtactta gtctaaacaa aacgcgctgc aggtatcgcc 540 cgcgtattcc tggccattgc aaataaaata tacaactaat atcccgtggg ttggtttatt 600 tatttaatat ttttataaaa ttagtgcaaa atgtcaaaat cagattcgaa gaaagatgtc 660 agagagcaaa gggttgatga ggaggttgcg ttaaatgtga ttctgaaatt cataaaacct 720 tttaatggtg atagagaaaa attaacagca tttttaagaa actgcgatag cgctatatct 780 ttaacatcgc ctcagcagga agatatagtt tttaaataca tcttaagcca attagaaggc 840 aaagccgaaa gcgcctgtgc aattaaagat ttcgagtcat ggtcttctct taaagagttt 900 ctgaaaaatc aattcggtga gcggaaacat tacgcacacc tgctcaccga cttgcaagat 960 tgtaagcaaa gaaaccagga gactatcaat caattctccc tcagaataga aacctgccta 1020 caaaaacttt taacggaaat aactgtttct aatccgaaaa agagtgaaat tgcgggtaga 1080 atagcagcta tggaagacct tgccctccat acattcctat tagggctaat tcccagaatt 1140 gcagacatgg tgagatgtcg cgactacaaa acattaaatg aagccataaa tgccgcgata 1200 tctgaagaaa aaattcagca gttcacattc cgtaataatt actctaaacc aagatttgtt 1260 gagaattcaa ataagagaac agaaacagta aatgggcccc cgtcaagacc ttttaataat 1320 aacaacaata atcaatcaaa ctcattctat aggaaccctt caatctgtaa atactgtaaa 1380 aagccagggc atattcttga aaattgcaga cttcgtgaat ataacaataa gagattttca 1440 aacgggtccc gaccttttac tccttctcaa ccgcaggcgt ccacttctag agaaacaaat 1500 ttcaaaccta gcggtccccg tgtgcatttc actaatgatt acgaagatga aggacgtgat 1560 gaaatagaca ctccaagcaa taacgctgat ttaaactaat agatgtcccg tataggtgtc 1620 ggacgggaca agtctcttcc gataaaaacg atgccgttta tattaaaagt aagaccacgc 1680 cacagggtaa ccagccccac aaggaaggtc tgaatgctat taaaagtaag accacgccac 1740 agggtaacca gccccacaag gaaggtctga accttatcaa tagtaagact acgccgcagg 1800 gtaaccagcc ccataaggaa agtctgaatc cctttaataa taaggctgcg caacagggta 1860 gccagtccca taaagaagat ccgaattatt ccagtcataa agacaatttg aagggtgacc 1920 aaacccctaa caaaatatgc gatattaaca ataccgagat gaaacataaa gtttacgaaa 1980 tcctaacgca aaaacgatac ttaccttacg ttcaattgca aacacctata aacaaaaacc 2040 cattaacatt tttaatggat tctggagcat ccatctgtgt cctgaaaaag aaaagcataa 2100 ttaattccca agaattaaaa gaaaacatcg tgaaagtgaa aggaattgat tccaaaaatg 2160 atactttaaa gagtgtaggt tgttttgact tacccttaca attcaacgat gatttaaaaa 2220 tcagtcatac tttccacgta tttgataacg tagacttgcc ttatgacggc ataattggaa 2280 atgatttatt catgaaatat cagtgtcgta ttaactatca aacgaattgc ctagaaataa 2340 ataatattaa cgtcaaactg tccttcgatg agccatccta tattattcat gctcgcactg 2400 aaaccgtaat agagtgctct gtgaaaaatc cagaactgaa agaaggaatc atagtagacc 2460 aacacatttg cgattccctc cttgtttcca attgtgtagt taaagtaaaa aacaataata 2520 gaataaattt gacaattctc aatgtgtctg atgaacaagt gtatttaaac gctaatctaa 2580 tgcttacttt agcaccgatt gatctaaccg agtacaatac agtccaacat gaccaatcta 2640 ctaacagtat cgaaagatgt aaccagttat ccgatctttt aagaacatct catttgaacc 2700 aagaagaatt gtactacctt aacgacatat gttttcaata cgctgatata ttccatttac 2760 caggagatca attaacatgt acagatgtta tgcaacacga gattaaaact agttcctccc 2820 aacctataaa tgtaaaatca tatcgtttcc cagaaataca taaagaagaa gttaactctc 2880 aaataaacaa gatgctagac caacaaatta tcaaaccatc taaatctcca tggtcttccc 2940 caatatggat agtaccaaaa agggaagatg ctagtggtaa aaagaaatgg aggatagtaa 3000 tagattacag aaagctgaac gatatcacta tcggtgaaaa ttatcccatt ccccaaatta 3060 atgagatatt agatcaattg ggtcaaagta aatatttcac aacattagac ctcgcttccg 3120 gttttcacca aatcaaaatg tcccccgcag acgcatcgaa aacagcgttc agtgttcccc 3180 aaggtcactt tcagtttaac cggatgccgt ttggtttgaa aaatgcccct gcaacattcc 3240 agagaacaat gaatacagtc ttatccggct tacaaggcgt acactgcttt gtttatctcg 3300 acgatatagt cgtgtactcg tcagatctac ccttacacat agaaaaacta acccttgtct 3360 ttgaaaaact aagaaaattc cagctcaaac ttcagcccga taaatgtgag tttttacgta 3420 aagaagttgc ttacctgggt catgtcatta gcaatgacgg cgttaaacca aatccagata 3480 acataaaagc agtaatgcaa ttcccagtac caaagtcggc taaagacata aaatctttcc 3540 tgggcttagc ctcatactac cgaagattca tacctgaatt ttccaaatac gcaaagtctc 3600 tcactgttct cctgaaaaag gatgtccctt tcaactggac caatacacaa cagctagctt 3660 tcgaacaact gaaagaaaaa ctagttacag caccagtcct tatctatccc gatttcacaa 3720 aaccattcat cttaacttgt gatgcttcta attacgcgat ttctgctatt ttgtcacagg 3780 gtgatattgg aaaagatcac ccaatcgcct tcgcgtcaag gacactaaac aaagccgaaa 3840 ctaactacag tgtcactgag aaagaatgtt tggctataat ttacggaact aaaatttttc 3900 gtccttacct ctttggtcgt caattcaaaa taataacaga ccacaagcct ctcaactggc 3960 tttttaattg caaagatcca gggtcccggc ttatccgttg gcgtctcaag ttagaagaat 4020 ttgactatga aattcaatat agaagaggta agacaaatac taatgcagac gctttatcac 4080 gctatccaat gaatcctcta caacaaaatg atgacatctc aatgccgatt cccgaaaata 4140 tagatattaa tcctaatcaa gtacgtaacg aggaaatctc tcctgttaaa ccctttccaa 4200 acttagactc tataaacaat gacttaccag aattagattt acccgataat ttcgtcctac 4260 cagatagcct agagcttccg gaagattttg atatccctac tccacaatct gtggaagata 4320 ccgatgacga cacatactcc aaatgtctga aatcaataaa caacaaatca gttatatttg 4380 acactaaaat actcgaacat aatgaaaatc ttttaaaaac taaattaaaa catatcgtaa 4440 tacctacatc cattgatctc gatgaatcaa accgctatgt gcctgaaatt atcggtaata 4500 tacctgatcc taaagaatta cttgcagcag aaagaacttt atactcacct ttgatatcag 4560 aaacaaatga cattcttcac tattttctct ttgttaaagt ataccaccaa ccaataaagt 4620 ataccaacat aacaatacat atatataaga acacaatcat gtatccagca atgactgaaa 4680 tcacaaaaat attgagagaa aatcatgata ttcccatatc aggccactta ggttctaaca 4740 ggatgttaaa aaggattcag gagaaatact actggcgaaa tatgaagcga gatgttgaaa 4800 attacgttca caaatgtgaa tcctgtcaag ccaataaagc attgcgtcaa accaaccgag 4860 ccccgatgca aatcacatca acatcaacac aaccatttga acgggttgct ctcgatatcg 4920 tgggccccct acctgaggtc ggaattaaca agttaaggtt cattctcaca ttacaagatg 4980 accttacaaa gttttcagcg gcctatccaa tttctaattg cactgcggaa gaaacatccg 5040 agtgccttat tcattttatc actcaattcg gaattcctaa aataataatt actgatcaag 5100 gatcgaattt cacagcggaa ttatttaaac agacttgtag tttcttaaaa attaaacaac 5160 tttggtcaac accatatcat ccgcagacac agggagctct cgaaaggagc cattcaaccc 5220 ttaaggagta cctcaaatcc tttgtagatg aaaaccaaag taattggcct agatatgttt 5280 ataccgccat gttaacatac aacacaactg tccattgtac aacaaatttt actccttatg 5340 aattagtatt tggccataaa gctatcatac cttcttccat ctatgattcc tctctggaat 5400 ccacttataa tagnnnnnnn nnnctaagtc cgtaattctg atttaatctt atcaatgacc 5460 ctgttggtga tttaaagaaa tatatgtcaa tcaaacacaa gttagaacct ttaaaattgt 5520 acgatatgca tttagaaaaa cttgtaagtg tttcaaatca tcaattaatg acgaatattg 5580 ataatattgt gaatacgcct gacccaatta tagaatacga ttcgcattgt cctattatta 5640 cctattgttt acttatttta ttcatgtttt ttgtatttta taagttatgt aaaagattcg 5700 gtaaatgtcc tactttccta aacaaaacta agcccgataa tgtattagat gaagtccaag 5760 aaatgaaaga aattccaatt ccgagaattc ggatatctac ttgaaactta ttttctaaaa 5820 cctataagtt tcatcttaaa cggggggt 5848 // ID R2C_NGi repbase; DNA; INV; 3646 BP. XX AC . XX DT 08-NOV-2010 (Rel. 15.11, Created) DT 08-NOV-2010 (Rel. 15.11, Last updated, Version 2) XX DE 28S rDNA-specific non-LTR retrotransposon R2 in Nasonia giraulti. XX KW R2; Non-LTR Retrotransposon; Transposable Element; R2C_NGi. XX OS Nasonia giraulti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-3646 RA Stage D.E. and Eickbush T.H.; RT "Maintenance of multiple lineages of R1 and R2 retrotransposable RT elements in the ribosomal RNA gene loci of Nasonia."; RL Insect Molecular Biology 19(Suppl 1), 37-48 (2010). XX DR [1] (Consensus) XX CC This family is specifically inserted into 28S ribosomal RNA CC genes. XX FH Key Location/Qualifiers FT CDS 190..3534 FT /product="R2C_NGi_1p" FT /note="reverse transcriptase and restriction-like FT endonuclease." FT /translation="WVTSPRRPRYVGPQKKKASDGNDGRAAARAEPTNPGG FT PDRADDDEGDVKFWCEFPGCDRFFMTRSGRGLHHKKGHPDWNDQRNLAGKQ FT HRKEIWSEEERLLLAKKEAELAISGARFINVELRDFTARSLDAIKGQRKRP FT DYKILVEKFVRELRVRGIRQGVASRSQQARAMAVAGAPAATSSGAPPVATQ FT PPPSGRVLRSQVVEAPAMEIPVAESEGDSSGDELFEDVEPVRLSDLPPDRF FT TIYFAGLEIPGTEDIYAHRLHTICLMTTWRTKEEVRLELGLFLKDLFPSKG FT SQERPERTNLPDPRNRIERRRGEYKKCQDLWRRNKSTCVQRILKEDLSQGE FT CLPRELMEPFWNATFTQNPGTAPVLPPPTEVYSSVWEPIRPENIKGNYPPQ FT NTAAGIDGLTVGDLKGVSREMLARIFNLFMWCGKLPEHLCASRTILLPKKP FT GAKVPGEFRPITVTSVLIRTFHKVLAERLKVVPLDPRQRGFRESDGCAENV FT MLLDMTIRYHHERRRKMFLALLDMAKAFDSVSFESMREVLTTKGIPTPFIE FT YFMTHLEDSFTVLQHGNWQSGKIHPTCGVKQGDPLSPPIFNFIMDEMLKRL FT PKEIGVNLDGLFVNAMAFADDLSLVANTEQGLQILIDEATSFLGLCGLRAN FT PNKCVTLAIKTIPKEKKTAIDPSSHFRIGNAVIPSLKRTDEWVYLGIKFNS FT NGRLISDAKPKLIKDLELLTKAPLKPQQRLWALKVIVIPGILYRGTLGSST FT AGYLRSLDCVIRAYVRRWLRLPGDCPNGYFHAAVADGGLGVHPIRYKAMVD FT RLARLRKLEKSAYITGPEAARYLQRQVSIAENRLRDGANRIMSDASMLREF FT LRELLYKSFDGRPLENSSKVPGQHRWVEEPTRFLSGADYMNCIRARIAALP FT TAARCARGRLKDKHCRAGCGNVETLNHVLQFCHRTHGTRIGRHDAVVKYVV FT GGLKKRGYAVKEEPKIVLQDVVYKPDMVATKEGKTLILDAQVLGDQRDMRL FT AHEDKLRKYGAPEFKRKIRSETGSATIKSLSVTLSWRGLWGPDSVKGLLEE FT GVILKKDLKILSTRVLIGALAGWRRFNERTSMATSGRREEVTTRMVRRWKR FT RERVGVG" XX SQ Sequence 3646 BP; 950 A; 854 C; 1037 G; 805 T; 0 other; cgggttcccc cgacttcggc ttgccgtggt ctggggctca ctgctttttg tggagtcatg 60 gttacatggt gaccctggtt cctcgcaccc ccgctggaaa ctatctgggg aggccatgat 120 tgggtaacga taaaggtcct ggtcgtgtcc tcctgagata ggctgaatgg gtcactaagt 180 ggcacctaat gggttaccag ccctaggcgg ccgagatacg ttggtccaca gaaaaagaaa 240 gcctcggatg gaaatgacgg acgagccgct gctcgtgccg aaccaacgaa tccgggcgga 300 ccagaccgcg ctgacgacga cgaaggggat gttaaattct ggtgtgaatt tccaggttgt 360 gatcgcttct ttatgaccag gagcggtaga ggcctccatc acaagaaagg ccaccctgat 420 tggaatgatc agagaaacct ggccggaaag caacaccgaa aagagatatg gtcggaggaa 480 gaacgtctcc tgcttgccaa aaaagaggcg gagcttgcca tcagtggagc taggtttatt 540 aacgtagagt tgcgtgattt tacagcgcgc tccctagacg ctatcaaggg ccagcgaaag 600 agacccgact ataagatctt agtcgagaaa tttgtcaggg agttaagggt tagaggcatt 660 cgtcaaggag tggcctcgcg gagtcaacaa gctcgcgcga tggcggtggc aggagctcct 720 gcagcgacgt cctcgggggc accacctgtc gcgactcaac caccaccatc aggtcgcgta 780 ctaagatctc aggtcgttga agcaccagcg atggagatcc ccgtggcaga gtcggaaggt 840 gactcctcgg gggacgagct gtttgaggat gtcgagcccg tgcgattgtc cgacctaccc 900 cctgacaggt ttacgatata ctttgctggg cttgaaatac ccggcaccga agatatatat 960 gcccacaggc tccataccat ctgcctgatg acaacgtggc gaaccaaaga agaggtaaga 1020 ttagaacttg gcctcttttt gaaagatttg ttcccgagta agggcagtca agaacgcccg 1080 gagagaacca acctgccgga cccgagaaat cggatcgaga ggcggagggg ggagtacaaa 1140 aaatgccagg atctatggcg acgaaataag tcaacctgtg ttcagcggat ccttaaagag 1200 gatctttcgc agggtgaatg tttgcctcga gagctgatgg agcccttttg gaatgcgact 1260 ttcacccaga atcctggcac ggctccggtg ctccctcctc ccacggaggt ttattctagt 1320 gtttgggagc ccattcggcc cgagaatatc aagggcaact atccgccgca gaacaccgcg 1380 gcagggatag acggactgac agtgggtgac ctgaaagggg tgtcgcggga gatgctggcc 1440 agaattttta acttattcat gtggtgcggc aaactgccag agcacctttg tgcctcacgc 1500 acaattctcc tgcccaagaa acctggggcg aaagtccccg gcgaattcag gcctatcacc 1560 gtgacatccg tcctcatccg gacctttcac aaggttctgg ccgaaagact gaaggttgtc 1620 cctcttgacc cccgccaaag aggcttcaga gagtccgatg gatgtgcaga gaacgtgatg 1680 ctactggaca tgaccatccg gtaccaccac gagcggcgca gaaagatgtt cttggccctg 1740 ctagacatgg ctaaggcatt tgactcggtc tctttcgagt ccatgcggga ggttttgact 1800 actaaaggca taccaacgcc atttattgag tattttatga cgcacttgga ggatagtttt 1860 actgttcttc agcatggtaa ctggcaatcg gggaaaatcc acccaacatg tggtgtgaag 1920 caaggcgatc cactgtctcc gcctatcttc aacttcatca tggatgaaat gttgaagagg 1980 ttgcctaagg aaatcggggt taacttggac gggttatttg ttaatgctat ggcatttgcg 2040 gatgacctga gccttgttgc caataccgaa caaggtctgc agatcctcat agatgaagct 2100 acttcctttc tggggctctg tggactccgc gccaatccca ataagtgcgt caccctagca 2160 attaagacca tcccgaagga gaaaaagacg gccattgacc cctcatcaca ttttaggata 2220 ggtaatgcgg tgatcccctc gttgaagagg acagatgagt gggtgtactt agggatcaaa 2280 tttaattcaa atggtcgcct tatctctgac gcaaaaccca agctcataaa agatcttgag 2340 ctactaacta aggcaccact caaaccacag caaagactgt gggcgcttaa ggtgattgtc 2400 attccgggca tcctttacag aggtaccctg gggagcagca ccgcaggcta cctacgctct 2460 cttgactgtg taataagggc ctatgttcgg cgatggctac gtctccctgg agattgcccg 2520 aatgggtatt ttcatgcagc ggttgcggat ggagggctgg gagttcaccc catacgatac 2580 aaggcgatgg tagatcgcct tgcccggctc cgaaaattag agaaatccgc gtacatcacg 2640 gggcctgaag ccgcacgtta tcttcaaaga caagtttcta tcgccgaaaa taggctccga 2700 gatggggcca accgcattat gagtgatgcg agtatgctaa gggagttcct tcgggagctt 2760 ctgtacaagt cctttgatgg tcgtcccctg gaaaattcca gcaaagtacc aggtcagcac 2820 cgctgggtcg aggagccaac ccgtttccta tccggggcgg actatatgaa ttgtatacgt 2880 gcgaggatcg cagctcttcc gactgcagcc aggtgtgcta ggggacgtct caaagacaag 2940 cattgccggg caggttgcgg aaatgtggag acgcttaacc acgtcttgca attctgccac 3000 cgtacccatg gcactcgcat tggacgccat gatgcggttg taaagtatgt tgtaggagga 3060 ctcaagaaga gggggtacgc agtgaaagaa gagccgaaaa tcgtcttaca ggatgtggtg 3120 tacaaacctg atatggttgc gaccaaggaa gggaaaacac tcattctgga cgctcaggtt 3180 ctaggcgacc agcgtgatat gagactggca catgaagata agctccgtaa gtatggggcc 3240 ccagaattta aacgaaagat caggagtgag acggggtcgg caaccattaa gtccttgtcg 3300 gttacattga gctggcgagg gttgtgggga cctgactcag taaaggggct cctcgaagag 3360 ggagtgattc taaagaagga ccttaagatc ttgtccacaa gagtacttat aggagctttg 3420 gcaggctgga gaaggtttaa tgaaaggacg agtatggcaa catctggaag aagagaagag 3480 gttacaacaa ggatggtgag aaggtggaag agaagagaga gggtcggtgt tggttagcgg 3540 actggactgt ctggaggagt gtttaactcg ggttctcatg ggaacccgac aacgttgtta 3600 tcttgtatga caattcataa aaaaaaaaaa aaaaaaaaaa aaaaaa 3646 // ID BEL-214_AA-I repbase; DNA; INV; 6109 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-214_AA_; KW BEL-214_AA-LTR; BEL-214_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6109 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 885-885 (2011). XX DR [2] (Consensus) XX CC Positions [5144-5704] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 36..950 FT /product="BEL-214_AA-I_2p" FT /translation="MEGHRKPAGYSCQSCTRPDTDDEKWVACDGCSLWEHF FT YCAGVDESVKNQPYICRKCKAKSAVSGTSSSRINKPPPSEGRSLRSSTAKS FT GLKPVNKVTQLASKFSRSVNSTTSSVRAARLQAQMKLVEEEQLLKEQELEA FT QEAMRKKEMEEEERQLEEKKALLQAETKLRQRKLQEEKEYQKKQQMIRKES FT LEKKNTIARQLSECSSRGGSIPDSEQQVARWLQQCSPEQLENEDYISRTGA FT GEASPLIRPNPPFTNKAEHQDSDRQSHVEQQFSRILIDASDNDPPAAIHAC FT VNHDPPQERTATK" FT CDS 3632..6109 FT /product="BEL-214_AA-I_1p" FT /translation="MLERNSATESVLGMTWNPVDDCFSYSFNLRNDLHSIL FT DETHVPTKREVLKVVMSLFDPLGLVSHFLVHGKVIIQGTWAAGTGWDEPLN FT EELLKKWRQWVSLFPKLNDLKISRCYFSPYPSNIGDIQMHTFVDASDIAYS FT CVIYFRLVHLGHVQVVLVGAKSKVAPLKTLSTPRLELKAAVLGVRYAAAIS FT EYHSIPLTRRFFWSDSTTALAWIRSDHRKFHKFVSVRVGEILTLSEPQEWR FT WVQSKSNVSDEATKWNDGPNLDSHGPWFNGPKFLLLEEASWPEQRVMSTTL FT EEIRRVQIHREHEPAIEYFRFNKWTKLQRTTAYVVRFIDSLRRRNVGQNLQ FT REILTQEELARAEVLLWKMAQMEAFPEEQIVLKKTMGPPNVKHANVSKSSS FT IYKTWPYMDAQGILRMRGRIGAAYYLPFQARYPVILPRQHWITTLIVDWFH FT HRYRHANRETVINELRQRFEVAKLRSLVQRVSKNCVWCRINKAFPKPPVMA FT PLPEERLEPFVRPFTYVGVDYFGPLYVKVGRSQAKRWVALFTCLSIRAVHM FT EIVYSLSTESCVMAIRRFVSRRGSPAVFFSDNGTCFQGASKQLKEEVNTRI FT EAIADTFTDAETKWKFIPPAAPHMGGVWERLVRSVKTALGTALEHPRKPND FT ETLGTIIQEAEALVNSRPLTYIPLESADQESLTPNHFLLGSSSGNKIEPAV FT VTEHATLRSSWKMSQYIVGEFWKRWVKEYLPVITRRCKWFKEAKDLAEGDL FT VLVVNGTERAKWVRGRIEKTFAGCDGRVRQALVRTANGVLRRPAVRLAVLD FT VEERREPKELQEDPGNHLGSRAGV" XX SQ Sequence 6109 BP; 1698 A; 1447 C; 1567 G; 1372 T; 25 other; aatcttcaag aattttatct acacggggtt gtggaatgga aggacatcgt aagcctgccg 60 gttacagttg ccagtcttgt acaaggccag ataccgacga cgaaaagtgg gttgcgtgtg 120 acggatgtag tctttgggaa catttctatt gcgcaggagt ggacgagtcg gtgaaaaacc 180 aaccgtatat ttgcaggaag tgcaaggcaa agtcagctgt ctccggaacg tcttcatctc 240 gtatcaacaa accaccgccg tcagaaggaa gatctttacg atcgtcgacg gccaaatctg 300 gcctaaagcc ggtaaataag gtaacccaac tggcatctaa gttctcgcga agtgtaaaca 360 gcacgacttc gagtgtgcgc gctgcacgtc tccaggcaca gatgaagctc gtcgaggagg 420 agcaattgct gaaggagcag gaattggagg cgcaggaagc aatgaggaag aaagagatgg 480 aggaagagga acgtcagcta gaggaaaaga aggcgctctt acaagctgag actaaactgc 540 gacagcgtaa actgcaggag gagaaagaat atcagaagaa gcaacagatg ataaggaagg 600 aatcgctgga gaagaaaaac accatcgcac ggcagctgag cgagtgcagc agtagaggtg 660 gatcgattcc ggattctgaa caacaggtag cccgttggct tcaacaatgt tcccccgagc 720 aactcgaaaa cgaagattat atttcgagaa cgggtgctgg agaagcctcg ccgttgatca 780 gaccgaaccc accattcacc aacaaagccg aacaccaaga ctcggaccgg caatcgcacg 840 tggaacagca attttckcga atcttgattg atgcctctga caacgacccg cctgcagcta 900 ttcacgcatg cgtgaaccat gatccgccac aagaacgaac cgcaaccaag gawccaacga 960 agattcagct agccaccgat tcgacagaga ttttcgtgca aacttctccg atcacacgat 1020 gctccctcgt ccgtatgctg atcgacgaat tccagkgaca caaatggagc agccacagaa 1080 ggcgtacaaa ccatctcaac aggtaccaca ctcgtcgcgt cagccataca tggatccgga 1140 aacaatgacg ttttctgcgg tacttagtgc gaatcaacta gctgccagac aggtaatggg 1200 gaaggaattg ccgaatttct ccggtaatcc tgaggagtgg ccgatcttca tttgcagctt 1260 cgagcaatct acagcggcct gtggctacac cgatactgaa aacctgattc gactacaaag 1320 atgcttgaaa gggcatgctc tcgaatctgt gcgaagccga cttctccttc cttcgagcgt 1380 tccacaagtt atcaacacgc ttcgcacact ttacggcaga ccagagcttc tcatacgaac 1440 actgattgaa aaagttcgcc gcactcctgc ccctagacat gatcgtttgg agactgtggt 1500 tgaattcggc ttagtcgtgc aaaacctsgt agatcacctg aaggcggcga agcagtacac 1560 gcacttagcc aatccgattc tgatgcaaga gttggtggaa aaacttccag gwkcagtgaa 1620 gatggactgg gcagtctata aaagccggca gccctatgca actttascca cttttgggga 1680 cttcatgacc gggctactag atgcagcaag ccaggtgaca ttcgaactgc cgaaccacat 1740 tggaagctcc aagttcgggg aacagcgagc aagagaaaga ggtctgcttc cgcmcattcg 1800 tcgatsscgc ttctgcacgg agggtataaa tkstmaatca actacagaga aagcaggtaa 1860 gccatgcgca gcgtgcgaac gtcgagaggt catcgmgtta gcggattgcc accagttcaa 1920 atcactggac gtcgkcgaac gstgggaggt cgtccaccaa aagggactgt gttgtacgtg 1980 cctgaatggc cacggaaagt ggccgtgtaa gtcctggcaa gggtgtggga ttgaggggtg 2040 tctccaaaaa caccacacac tccttcactc ctcttccgcg tcgccacctc ggaatgtctc 2100 cgtcaaccat cccatggaga acaacggaat ckcaccctat gtttcgggtg ctcccagtgg 2160 tactatatgc gggcatcgca gaaggaagtc gtatttgcgt tcgtcgatga gggatcgtcg 2220 acaacattct tggaggaaac catcgctgac cgcctgggcg tatctggtca cgtagaaccg 2280 ctgaccctac agtggacggg taacataaaa cgggaagaac gaaattccca acgaattcag 2340 ctggatgtca ccggtgagga tggtcgaaca ccgtcataag ctgtgcgagg ttaggacagt 2400 cagctgtttg gtcttgcctt ctcaaacgat gaattacaac gatctctgca accgctaccc 2460 acacttgcgg ggactcccac tgagaggtta tgagttgatc cagccaaagc tactaattgg 2520 tttggataat cttcgccttg ccgtgccgtt aaaggtccgc gaaggtggmc cwagagaccc 2580 gatagccgct aagtgccgwt tgggttggag tgtctacggg agtttagcgg gaacatctcc 2640 accgcgagct gtcgtccact ttcatgtggc tggaccagca aatagtgact gtcaattgaa 2700 cgatcagctc cgagactatt ttgcactgga ggacgccgga ataagggggc taagcgagac 2760 gctagagtcc gatgaagaga gacgtgcgat tgaaatatta caacgaacca ckcggcgsac 2820 agggcgtgga tttgaagccg gacttctgtg gagagaagac gatccgaatt tcccagacag 2880 ctatccaatg gcagttcggc gsttgaagtc tttggagcgt aagttggaga agaacccact 2940 gctgaagaat cgtgtcctgg agcaaattgc cgagtactta cgaaaaggat acgcccacat 3000 tgcactgatt cggagctkga aagkgcagac cgcaaacgta cgtggttcct gccgcttgga 3060 gtggtcgttc attccaagaa acctagcaaa ataagattaa tatgggatgc cgcagccatg 3120 gtagatggag tctccttcaa ttccaagctt ttgaagggtc ccgatctttt aacaccatta 3180 ccagcagtgc ttagtggctt tcgtcaattt cccattgcgg tctgcggtga catcaaagag 3240 atgtttcacc agctctcaat tcgcgaagag gaccgtttag cacaatgttt cctatggcgg 3300 aacaactcta cggaacctat tcaactctac atcatggacg tagcgacgtt cggcgcgact 3360 tgttctcccg ctgccgcgca atatgtaaaa aaccttaacg cgcaggagtt ctctgaagtt 3420 tatccacgtg ccgccaccgc aatagtagaa aaaacattac gtggatgact acttggacag 3480 cttcgcgacc cttgaagatg ctgcggaagt agtgaaggag gtaactcaag tacactcgct 3540 gggaggattc gagattcgag gctttcgctc taactcaaca gaacttctcc gggaaatagg 3600 tcaaccaacc gatgatgagc cgaagaactt gatgctagaa aggaattccg ctactgagtc 3660 cgtgcttgga atgacctgga atcctgttga tgattgtttc tcctattcct tcaatctgcg 3720 aaacgatctc cactcaattc tcgacgaaac acatgttcct accaagcgtg aggtcctgaa 3780 ggtagttatg agcctcttcg atcccctcgg gcttgtctca cactttctag ttcatggaaa 3840 agtgatcatt caaggaacgt gggcagctgg aacaggatgg gatgagccgc ttaacgaaga 3900 actcttaaaa aaatggcgcc aatgggtgtc acttttccca aagctcaacg atttgaagat 3960 ttcgcgatgt tatttttcgc catacccttc caacattggc gatatacaaa tgcacacatt 4020 tgtagacgcc agtgacatcg cttattcctg tgtgatctac tttcgactag ttcatcttgg 4080 ccatgtacaa gtggtactgg tgggcgctaa aagcaaggtg gctcccctga agacattgtc 4140 taccccacga cttgagctga aggccgctgt actaggagta cgatatgctg cggcaattag 4200 tgagtatcac tcgattcctc taacaagacg gttcttctgg agcgattcca ctactgctct 4260 agcatggatt cggtctgatc atcgtaagtt tcacaaattc gtgtcggtgc gagttggaga 4320 aattctaaca ttgagcgaac cacaagaatg gagatgggtg caatctaaat ctaacgtttc 4380 cgacgaggca actaagtgga atgatggacc gaatcttgat tctcacggtc cctggttcaa 4440 cggacccaaa tttcttcttc tcgaggaagc gtcatggcca gaacaacgag tcatgagtac 4500 gacactggaa gaaatccggc gagttcagat tcaccgggaa catgaaccgg cgatagagta 4560 tttccgtttt aacaagtgga ctaaattaca acgcacaaca gcctatgttg ttcgtttcat 4620 cgatagcttg cgccgacgta atgttggcca aaacctgcaa cgagaaattc tcacccaaga 4680 agagttggca cgagctgaag ttctcctttg gaagatggcg caaatggagg cgtttccgga 4740 ggaacaaatc gtgctgaaga aaacgatggg tcccccaaat gtaaagcatg ctaatgtttc 4800 caaatctagc tcaatataca aaacctggcc gtacatggat gcacaaggaa tcttgagaat 4860 gcgaggacgc attggagcgg catattacct gccatttcag gcaaggtatc cagtaattct 4920 tccaagacaa cattggatta caactttgat tgttgactgg tttcatcacc gctatcgcca 4980 tgctaaccgt gagaccgtga tcaacgagtt gcgtcaacgt tttgaagttg caaagttgag 5040 atctcttgtg caaagagtgt cgaagaactg tgtatggtgt cggattaaca aggcctttcc 5100 aaaacccccg gtaatggccc ctcttcccga ggaaagacta gaaccatttg tgcgcccttt 5160 cacttacgtc ggcgtggact attttggacc attatacgtg aaagtaggac gatctcaagc 5220 aaaacgatgg gttgcactgt tcacctgcct atcgatccga gcagttcata tggaaattgt 5280 gtatagtctg tctacagagt catgcgttat ggctatcaga agatttgtat cacgccgagg 5340 ttcacctgct gttttcttct cggacaacgg tacctgtttc caaggtgcca gcaagcagct 5400 gaaggaagaa gttaatacca gaattgaagc aatcgcggac accttcacag atgccgaaac 5460 taagtggaaa tttatcccac cagccgcccc acatatgggt ggcgtatggg aacgtttagt 5520 ccggtcggtg aaaacagcgc taggaacagc attggaacat cctcgaaagc ctaacgacga 5580 aacgctgggg acaattattc aagaggcaga agcactggta aactccagac cactcacata 5640 catcccgtta gaatcggcag accaagagtc gttgacacca aaccatttcc ttctgggtag 5700 ttcaagtgga aataagatag aaccagctgt cgtaacagaa cacgctacac tccgtagcag 5760 ttggaagatg tcgcaataca tagtaggaga gttctggaag aggtgggtaa aggagtactt 5820 accggtaata actcggaggt gcaagtggtt taaggaggcc aaggacctag cagaaggaga 5880 tttggttctc gttgtaaacg ggacagaaag agctaaatgg gtaagaggac gaatcgagaa 5940 aacatttgcc ggttgcgacg gacgagttcg ccaagctcta gtgcgtacag caaatggagt 6000 attgcggcga cctgctgtta gactagccgt gttggacgtc gaagaaagac gtgaacctaa 6060 ggaacttcag gaagatccag gaaatcacct aggttcacgg gcgggggta 6109 // ID CR1-1_NVi repbase; DNA; INV; 4642 BP. XX AC . XX DT 13-APR-2009 (Rel. 14.04, Created) DT 13-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE CR1-type non-LTR retrotransposon: consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-1_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-4642 RA Bao W. and Jurka J.; RT "CR1 families from Nasonia vitripennis."; RL Repbase Reports 9(4), 748-748 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS join(15..437,406..1518) FT /product="CR1-1_NVi_1p" FT /translation="MAGVRMPPDKCVSNFECHSNKTSTTLFCIICDAVYHF FT SCLAQKTTDIKIISGNLIVCPEHSDTNLTSKIEEEGLDDATRLLIAQIKLN FT KTEEVRRDLLLEVSTAEQDELDINIEGKYKLIKNENLLLKQLVSELQEKKH FT TTSVNCRKKNILLREKLLNKQNESAAASFSYAEVTSFPKLLQKKVPKITVK FT NVSNKAVDVKKKVIECLVTEKKIQTKNLYVNRKEELVINCLNSESAMLTES FT VLKHNLEGICEVRKDTLKNPKIKIVGIDNYSNMEMEDIEDDINMRNFGSFT FT NKGQVLHMYKSNFSSLTTVLMEVPADIYKFIKENKSRIFVGYQNCKVYDSI FT NINPCYNCGRFGHNGNKCNNSSVCLKCAEKHKTSTCTNNAIKCLNCVFSNS FT KYNTKLDVNHAANDTNQCNILKKKISIYIDSTDYPIRPTLPAVHQVYRERD FT NTTDTHNPTTDNTMKQPLQELPKSTKQKHERSEAAKQQQLQQLKMARTPQQ FT QAAPAKNTRLRSSKT*" FT CDS join(1775..2731,2706..3575,3517..3882,3875..4393) FT /product="CR1-1_NVi_2p" FT /translation="LIGSNYEYESYYNDSRINRNDGVIVFVNRNLVQSTEI FT VQQGRIRIVNTKINLSNNSNIEISSMYRSHGIPKTEFILDLNKYLIKKRNI FT KNHLVIGDFNIDILDCDHVSQEFLCNFFDKGYIPGFVGITRPPVGIAKGSC FT IDNIFIKTDNLNTITYKLKLSITDHYPIFIALNKMKIVHNKPTIVLNYKKL FT INSAETRNWNEIITMQDPNLATDKLLYEIKQCIELSKMNQKRIKQVGRKNW FT ITDAIIKSCETKEFLYNLWKLDINNEQLKTEYKTYSKILEKVIKDAKFRYD FT KKLVEKNANNPKKLWEIINNKLGKKRVTINLAKKESNETISCIKINDIKVT FT EKVEIARNMNKFFCEIGQKLSNDIVQPNNAEFRLPDINANSIFLKPTNMNE FT VKNIINTLKIKNGGVDKIKAKSLKVICNYIAVPLTHIFNLCIEKAIWPDAL FT KKAEIIPIYKAGEKHNITNYRPISLISNVAKIFERIIYNRLYDFIQKNNII FT SKHQFGFMKNIGSKNALNYITQILYNNVDKTIPTIVTFLDLAKAFDTVDHK FT LLLNKLYCIGIRGQALDLLRSYLNNRYQTVKIDGVESDSLLISMGVPQGTI FT LGPLLFIYIYKVWVFHRVRYWVHSYLFIYINDVLKEIPLESILSYADDTAI FT IATGNTWIEAQKSMNEFLTVIYKWLAVNKLSLNVDKTVCMTFGSYCDSVPK FT XIDVRIQNXKIARVEXYKYLGIVIDXNLKWVNGYKHIQYIYNKKXKVSSLY FT FSQTITXYVNRNFKNDILCXLPXHNXLWNSSLGNGAYRNNRDLLQSLQRKI FT LKIINKNTFSIQDSPLNLEQVFNFESLKIHYQDLKTQFQMSTXITRKKSLI FT IPKSKKRVSDKNSYNNAIIIYNXLPNELKELDISKTAAKXKIKYWIKTXT* FT " XX SQ Sequence 4642 BP; 1877 A; 670 C; 764 G; 1298 T; 33 other; aaaaagccag caacatggcg ggagtgcgga tgcctcccga taagtgtgtc tcaaattttg 60 agtgtcatag caataaaact tcaacaacat tattttgcat catatgtgat gcagtatatc 120 attttagttg tttagcacaa aaaacaaccg acatcaaaat cataagtggt aacttaatag 180 tgtgcccaga acatagtgac acaaacctaa cctcaaaaat agaagaggaa ggtttagacg 240 atgcaacaag attactgatt gcacaaataa aactgaataa aacggaagaa gtaaggagag 300 atctactact tgaagtttca acagcagaac aggatgaact tgacatcaat attgaaggaa 360 agtacaagct aataaaaaat gaaaacttac ttttgaaaca gctagtcagt gaattgcagg 420 aaaaaaaaca tactacttag agaaaaattg ctaaataagc aaaatgaaag tgctgcggct 480 agtttttcgt atgcagaagt tacaagtttt cctaaactat tacaaaagaa agtgcctaag 540 ataactgtaa aaaatgtgag taataaagca gtggacgtca aaaagaaagt tatagagtgc 600 ttagttactg aaaagaagat tcaaacaaaa aacttgtacg taaacagaaa agaagaactt 660 gttatcaatt gcttgaatag tgaaagtgct atgttaactg aatctgtttt gaaacataac 720 cttgaaggta tatgtgaggt taggaaagat acgttgaaga atcctaaaat caaaattgtc 780 ggaatcgata actacagtaa catggaaatg gaggacatag aggatgatat aaacatgaga 840 aattttggaa gttttactaa caaggggcaa gtactacata tgtacaaaag caatttcagt 900 agcttaacta cagtactaat ggaggttcct gctgacatat acaaattcat caaggagaac 960 aaaagcagga tttttgttgg ctaccaaaac tgcaaagtat atgactctat aaatataaat 1020 ccatgttaca attgtggaag atttgggcac aacggtaaca aatgcaacaa cagttctgta 1080 tgcctcaagt gcgcagaaaa gcacaagact agtacctgca caaacaacgc cattaagtgt 1140 ctgaattgtg ttttttcaaa cagcaaatac aatacaaagc ttgacgtcaa ccacgctgct 1200 aacgatacaa accagtgtaa catcctgaag aaaaaaatca gcatatacat agactctact 1260 gactatccta ttaggcctac cttaccagcg gtacatcaag tctatcgaga acgtgataat 1320 actacagata cacacaaccc aaccacagac aacacaatga aacagccttt acaagaacta 1380 cctaaatcta caaaacaaaa acatgaaagg agtgaagcag ccaagcagca acaacttcaa 1440 caattgaaga tggctagaac accacaacaa caagctgctc cagcaaaaaa tacaaggtta 1500 agaagcagta aaacttaaat taataaaatg gatacttttg attatctaga tataatacaa 1560 aaggattcaa tgcagaaaga aaaaaacata cagtggtatt ggcgatctaa acaagacaat 1620 tagcaacaaa aatgatataa ttatgcactt aaatgtaaga agcatgaatg caaattttga 1680 caaggtaaaa atattttttg aaagtctgat cgtcaaacca tctgttgttg ttctttctga 1740 aacatttgag caggtaaatc atacattttt ttaactgatc ggaagtaact atgaatacga 1800 gtcttactac aacgatagta ggattaatag gaatgacgga gttatagtat tcgtaaatag 1860 aaacttggtt cagagcacag aaattgttca acagggtagg attagaattg taaatacaaa 1920 aattaactta agtaataata gcaatattga aatctcatcc atgtacagat ctcatggcat 1980 acctaagact gagtttatat tagatttaaa taaatatttg ataaaaaaga ggaacattaa 2040 aaatcattta gtaataggag atttcaatat tgatatattg gattgcgatc atgtgagcca 2100 agaattcctg tgtaatttct ttgataaagg atatattcct ggatttgttg gaattacgag 2160 gccacctgtg ggcattgcaa agggatcctg cattgataat attttcataa aaactgataa 2220 cttaaataca attacttata aactcaaact atctataaca gaccattacc ctatatttat 2280 agctctcaat aaaatgaaaa tagtacataa caaaccaaca atagttttaa attataaaaa 2340 attaataaat agtgctgaaa caagaaactg gaatgaaatt atcacaatgc aggatcctaa 2400 tttagccaca gataagttgt tgtatgagat taaacaatgt atagagttat ccaaaatgaa 2460 tcaaaaaagg attaaacagg tggggaggaa aaattggatc actgatgcga taattaaatc 2520 gtgcgaaacc aaagaatttt tgtataactt atggaagttg gatataaata atgagcaact 2580 taaaactgaa tacaaaacct attccaaaat tttagagaaa gttattaaag acgccaaatt 2640 tagatatgat aaaaaactag tcgaaaaaaa tgcaaataac cccaaaaaac tatgggaaat 2700 cataaacaat aaacttggca aaaaaagagt ctaatgaaac cataagttgt ataaagataa 2760 atgatattaa agttacagag aaagttgaga tagctagaaa catgaataag tttttctgtg 2820 agatcggcca aaaattgagt aatgatatcg tgcaacccaa taatgctgaa tttagactac 2880 cggatattaa tgcaaactca atttttctaa aacctacaaa tatgaatgaa gtaaaaaaca 2940 taatcaatac attgaaaata aaaaatggtg gtgtggacaa aataaaagct aagtccttaa 3000 aagtgatttg taattatatt gcggtaccct taacgcatat ttttaatttg tgcattgaaa 3060 aagctatatg gccagatgca ctaaaaaagg ccgaaatcat tccaatatat aaagcagggg 3120 aaaaacataa cattaccaat tataggccca tctcactcat atctaatgtt gctaaaattt 3180 ttgaaagaat tatctataac aggctatatg attttatcca aaaaaataac ataatatcaa 3240 aacaccaatt tggttttatg aaaaatattg gctcaaaaaa tgctttaaac tatatcacac 3300 aaattttgta caataatgta gataaaacga taccaacaat agtcactttt ctcgacttgg 3360 ctaaagcttt tgatacagtt gaccayaagt tgttattaaa taaactgtat tgtattggta 3420 ttagaggtca ggcattggat ctccttagaa gctatttgaa taacagatat caaactgtta 3480 aaattgacgg tgtagaaagt gatagyttac taataagtat gggtgttcca cagggtacga 3540 tactgggtcc actcctattt atttatatat ataaatgatg tkctaaagga gattccacta 3600 gaatcaattc tatcgtacgc agatgatact gcaatwatag caacaggcaa tacrtggata 3660 gaggcgcaaa aaagtatgaa tgagtttctt actgtgatat acaaatggtt rgcggtaaat 3720 aagttatcty traaygtgga taaaacagtt tgtatgacat ttggaagcta ttgcgayagt 3780 gtaccgaaac wtattgatgt aagaattcaa aataraaaga tagctagagt tgagwattac 3840 aaatacttgg gaattgtcat tgaytwcaat ctraaatggg tataaacaca tccaatatat 3900 atataataaa aaaracaaag tatctagtct atatttttca caaactatca cayactatgt 3960 caacagaaac tttaagaatg atatattatg cwttcttcca yagcataaty agctatggaa 4020 tagtagcctg gggaatggtg cmtatagaaa taaccgtgat ttattacaaa gtttacagag 4080 aaaaattctc aagattatca acaaaaatac ttttagtata caagatagtc cattaaacct 4140 agagcaagta tttaattttg aatcccttaa aatccattat caagatctca aaactcaatt 4200 ccaaatgtct acaartatta ctcgaaagaa aagtttaatt atacctaaga gcaaaaaacg 4260 agtcagtgat aaaaacagtt ataataatgc tattataatt tacaatkcac taccaaatga 4320 gctcaaggaa ttagatataa gcaaaaccgc ggcaaaamgt aaaattaagt attggatcaa 4380 aaccaawact tagttttcca gatagtttta aattgttaaa tttgttaaat tytaagaaaa 4440 gaaaatgtaa tgagtgttta ttgttgttga aatkrttttg ataatgtaay tatctttagt 4500 yttaagtact agttttaagt yttttattct tatgttatat accaagcttg tacacaggta 4560 acttagttta cctctacaag gcttgctggt atgcgtattc ttaaggtata ctatgtaagc 4620 tattatcttg gtttaatawa ta 4642 // ID BEL-66_AA-LTR repbase; DNA; INV; 596 BP. XX AC supercont1.276; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-66_AA_; KW BEL-66_AA-I; BEL-66_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-596 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.276; Positions 199541 198946. XX SQ Sequence 596 BP; 210 A; 103 C; 112 G; 171 T; 0 other; tgttacgggg aatattgcca tcgctttgaa tacccctcga tccaacataa acgcggcgcg 60 ctagattcct cagctgtcat cacatctgcg aatgacactg caaagcgtgt gacctaaatg 120 tctaatggtt tagtttggat aagattgata tctcagacag agcaagagag tgataaaaga 180 ttaatttgca atttagattg attgagcgtt tacgaaatta ttgtaagtag tatgttgaaa 240 aatatgaaat atctgaacta aactccgaat ttaatacaga tttgataaac cctagtgcag 300 ggttgtatga ttgagcaaat attggaactt ggataaccta taaaaggatg taattgcaat 360 tctgtaaaaa tttgtaaagc taaaaggttg aatttcaggg caaacccaat ttgtaagatt 420 acaaactaga ttatttgttg gccacaaaag ggactgcgga cgaaatatgt aagttcccct 480 tagaaatgta cactcaaaac ccaaaattct aataaaattg atttagcttt tagcgcattt 540 atcaccaaac tatccggtgt agctgctcaa aagatcccga ataccctaaa ccaaca 596 // ID hATm-27_HM repbase; DNA; INV; 3974 BP. XX AC . XX DT 07-DEC-2008 (Rel. 13.12, Created) DT 10-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type family: consensus sequence. XX KW hAT; DNA transposon; Transposable Element; hATm-27_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3974 RA Jurka J.; RT "Families of hATm elements from Hydra magnipapillata."; RL Repbase Reports 8(12), 1921-1921 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 1888..2622 FT /product="hATm-27_HM_1p" FT /translation="MKQTAQEMAAIAKEDVVLSIQASPYPCIVHFDGKTLF FT ETNNGKKLKNDRLAVLATIDKESHLLGVTPLASSSGEDQYNGVMKLLKEYN FT LESKIGLLCFDTTSSNTGIHKGSLVRISTALNKYLILLACRHHVSELRITH FT FCEAVTNEKTTAPDNLLFKQFKNMFEQPNFEYKPSLLVKFSWDEVKGTVVE FT KAAIESLDYCRFYLSRNDIAREDRKELAELVVSYLSLLLTLKLKRLELFTM FT RDF*" XX SQ Sequence 3974 BP; 1500 A; 587 C; 621 G; 1264 T; 2 other; ggggtgttca aaaaaaaaaa atttcaaaac gcgaatacac aataagccaa atttggggca 60 aacataaaaa aaatacgatt aaaaaaaaaa attcaaaaat attgacattt accttgtgct 120 caagcggcaa aataccccga aaaatgcccc aaatagttcc caagtaattt ggctttgagc 180 gctaggtaaa tctcaatatt ttgtaatttt ttttttatat tttgctcata atggtccaaa 240 ctttttttta gtttgttgtg tctcctgttt ttttcataat tattgtttaa attaaaaaaa 300 gattttgcaa aaaaatacgc aaaaaaatag ttttaacgcg catttaacaa aagaaataaa 360 cttttactta atgtcaaaag cgtgaaaaca ctttagtgct atttaaaatt tacttttgtt 420 tgcttgtttt gcttcactgc atttttctat tgctaatttt tcttaaataa taaacatttt 480 aaaaagtcat aaaaagaaaa aactgacgac agtaaatagt gttgaatata atctaaaaat 540 attgttccag ttataatgga aagtttaata gaaaacaaca agagtcacaa aaattgatag 600 ttgtaagtgc aaatagcagt gagaaactat taaaaaatgg tagagaaaag cgaaataaaa 660 aaatgaaaaa aacacatttt attattgagg aagtcaaacc agcaatacca ggtaatgaat 720 tttttctgag ttctttttta gttttaaata ttatttcaat aagatttcac gttgcaaagg 780 atcaacctta ctgaattttt tagttttttt tttagttgat caactaccaa cwggattgga 840 tattttacag cacatagctt ttaatcagtc tatgcttcct aaaacatcaa aaaaatcttt 900 aataattgct tgttcagtaa aaaaaaacag gaaatatgaa agatgtgttg aacgacattg 960 taagtgtatt cttagccaaa taaagggttc gtggcttaaa gctggatttc ctcttcaatc 1020 agataacagt ttgattcaaa agctgaacaa gttgaacaaa aaatatgtta atttaaaaaa 1080 aaaacatgta ccggatgtcg tctttagaat tgaaaaagca acaagacttc agaaaagaac 1140 tgggaaaaat cttttgggca ggaatacctg acttacaaag ctggataaaa aaggacaaaa 1200 atagatccga cctagataag aaagaggacc tagaatttat atacgatcaa gaacacgaca 1260 gaaagttatt tctagggcca gaagataaga aatacaggaa gaaggtaatt catttaatga 1320 cctttttttt cctgaattta aaaactacct tttgctactt tttcattatt gagctttatt 1380 tttgctttaa tactttttac attattatct tttttaattt tacaaaattt tgataaagaa 1440 atttaaaaag aataatatca aataataata ataacaataa taataataac aacaataata 1500 ataataataa taataacaat aatattaata atggtgcatg ttgtttagat tttagagaat 1560 acaagaaaaa gaaattcatc taaaaaagat gctgcaaaat tactggaagt aaggaattca 1620 gagttggaat ttagtccaac acgagacccg agcgaagatt catcgtttga ttcagattat 1680 gacccaggat attggcgttc ttcggagtct aatagcataa taaaagctga gattccaaaa 1740 gatattcttt ctggtaatgt tgccctaata gctacagtta ctgatatttc tcccaatgtt 1800 ttgacaaaag ttacggctgc aatccttgat agttgtagcg ttgatcttac aaatgtaagt 1860 tgcagcagtt cgacagcatc tagaaagatg aaacaaacag ctcaagaaat ggcagctatt 1920 gctaaagaag atgtcgtact tagcatccaa gcgtctccct acccatgcat tgttcatttt 1980 gatggaaaaa cacttttcga aaccaataat ggaaaaaaac ttaagaatga cagactagct 2040 gttctggcaa ccatcgataa ggagtcacat cttctcggtg ttacgccatt agcctcttca 2100 tctggggaag atcagtacaa tggagtgatg aaactactaa aagagtacaa tcttgaatca 2160 aagattggat tactttgctt tgatacaaca tcaagcaata caggtataca taaaggatct 2220 ctagtaagaa tatctactgc gctaaataag tatctaattc tcttagcttg cagacatcat 2280 gtttctgagt tgagaatcac acacttctgt gaagctgtaa caaatgaaaa aactacggcg 2340 cctgataatc ttctgtttaa acaattcaaa aacatgttcg agcagcctaa tttcgagtat 2400 aaaccatctt tgctagttaa attttcttgg gatgaagtta aaggaactgt tgttgaaaaa 2460 gctgcaatag aatctcttga ctattgtagg ttttacttat caagaaatga tattgcaagg 2520 gaagatagaa aagagctagc agagctagtg gtcagctatc tttctctgct tctaacatta 2580 aaattaaaaa gactggagct attcaccatg cgagattttt agggaaatct atatactatt 2640 taaaaatgca aattctttca aatcagattg attttcttct aaaaagtaaa gaacatatta 2700 aaataataac agaatttata gcttgctttt atgcaaagtg gtatctgcaa tcaaatgata 2760 caatcaaagc tccttacatg gatgttactg ccattcatca aatgcatcaa tataaaacag 2820 tatgtgcaga accagatgcc gtaaatgctg tgttgaattc tttattcaaa catacatggt 2880 atctagattc aactctgatt cctttagctt tgctggatga tgacgttaca cttgaagaga 2940 agaaaaaaat tgctgctgca atattatcat atccaaaacc aatgccatgc tattttaaaa 3000 cgaaaaataa acagaacaaa gacataaaaa aaatgttaac acttgaacta gacattcatc 3060 agcagcctcc tagcttagct cctttggttg acgaattctc ctggttaatg tttgagatgg 3120 ttggaataga tgaacagcga attgaggact ggctgacctt gccacctcag tattggcaca 3180 cgcaatcatc gtttagatta tttctaaaat ttgcaaaaag tattgtttgt gtaaatgacc 3240 atgccgagag ggctattgga atgatgcaac aatttgttca ccgctacaga gatgaggaag 3300 aaaaacaaaa tagacttatt actgtagaca aagttcgttc aattctcaag gcatcagaca 3360 gcgaatcgtt atcaaccaat aaaataaata agcaagtatt aagcaagaaa agaatcacag 3420 atggattact taacatacgc tccaaaatac agaaaataaa ttgaaacaag ttacaaaaaa 3480 ttaaaaaagt ttgtatcatt caatcaactt ttattttatt ctcaaaagat taattaaaat 3540 aaaattaaat ttctttttga gacgttgaac aatgtgatta tatataagtc atataaatgc 3600 agaattatta tgcattattt tgttaaaagc gcgttaaaac tttttttttk tgcgtatttt 3660 tttgcaaaat ctttttttta tttaaacaac aattatgaaa aaaccagtat acacaacaaa 3720 ctaaaaaaaa agtttggacc attatgagca aaatataaaa aaaaaattac aaaatattga 3780 gatttaccta gcgctcaaag ccaaattact tgggaactat ttggggcatt tttcggggta 3840 ttttgccgct tgagcacaag gtaaatgtca atatttttga attttttttt tttaatcgta 3900 ttttttttat gtttgcccca aatttggctt attgtgtatt cgcgttttga aatttttttt 3960 tttttgaaca cccc 3974 // ID BEL-646_AA-LTR repbase; DNA; INV; 462 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-646_AA_; KW Pao_Bel_Ele195; BEL-646_AA-I; BEL-646_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-462 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 462 BP; 168 A; 78 C; 80 G; 134 T; 2 other; tgttaccggt accgtcgagt tgcgcaccct atctcaaccg gttccctcac tggattgata 60 aggtgccact ggtcgatgtc aaagagatca cgataacaas aaaacgacga ttatatgcat 120 agagcgtgga gtgaattaca ttaaagctat ttttctacta tctaaaagtt atmttacatt 180 ataaataatc tattgtgctt aaaactagtt tcttagccta aatacgaact agtggaggta 240 agaagattga attaacattg aatttatatc ctaatatgag attcattgta cacctaccca 300 ggagctaaaa ctaacctaaa aggaagtcag ttggacagtg aaatttgagg aaaattataa 360 tttgaagggc aactgaaaag taagttaact tataaaccta atagaacgct taactaaatt 420 aattcacgtt acagttttga agctacaata aatcagctac ca 462 // ID BEL-64_AA-LTR repbase; DNA; INV; 661 BP. XX AC supercont1.274; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-64_AA_; KW BEL-64_AA-I; BEL-64_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-661 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.274; Positions 1231874 1231214. XX SQ Sequence 661 BP; 214 A; 112 C; 130 G; 205 T; 0 other; tgttgcggca cgcagcaagc ccctcggctg gctttggatc acactggcta ccacagtgca 60 ttggtacgga atagatgttg gtagcaggta agaaacaagg atgacataat ttggaaacgt 120 tccgtcaatc gtgatgataa gcaaaattga tcacttcaca gttatagttg ctattgtggg 180 aaatttatta ttgattatta ctgaactaat ttacattgaa gttgaaaatt aaaagttgaa 240 ttactattac tattactatt gaatacggta agaaagtgta gaatatcctt atcaaagacc 300 aatggtttaa tattagattt gttttgcgtt tagaatctgg catttcgaga tagcgtcaaa 360 ttatacccaa tcaggatcgt tccattgcat tggtaattcg aatacaggaa ataagtatgc 420 attaatttct tctcataaga taggtgaatg atcgaatgtc ttttgtatag ccgaaatatt 480 agcaagccac tttagcagtc ggaacaacgt agtccccatc acacggtaga acaccaaaat 540 tgtaagtgta ggttgttacg aatgtgaatg ctaattgtta aatcaataaa cttattccag 600 cttaaagcga tcaatcacac atcccgtgtt tagctacaaa gatttggtgc ctctccgtac 660 a 661 // ID Chapaev-18_HM repbase; DNA; INV; 3105 BP. XX AC . XX DT 11-MAR-2009 (Rel. 14.03, Created) DT 11-MAR-2009 (Rel. 14.03, Last updated, Version 1) XX DE Chapaev DNA transposon -a consensus sequence. XX KW Chapaev; DNA transposon; Transposable Element; Chapaev-18_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3105 RA Jurka J.; RT "Chapaev transposons from the hydra genome."; RL Repbase Reports 9(3), 652-652 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(450..1076,1262..2662) FT /product="Chapaev-18_HM_1p" FT /translation="MPTFAKNHEECRKSVCVICMKKGDQELTENFKAKILQ FT QIQKEINFNDDRVPLATCISCRSHIGKLCDGKTNVVPQIYDFESISIKPLT FT RFSTVCECLICKIAKLKGKERHPLSKVQDSAPKVPQQSSQNAEKHCTKCLS FT VIGRGLPHYCTPGTRHENLRRMAASDPIGAEQVAATVLASKSASPHGTIRL FT SQPCGKLLPLRKGNSAFELLGPSSAQQLFKEPLTTQNMVQIQQNIGLSNNG FT MRKLGSALNQISPIRLVEPSFQQKFAQAGQKLCDHFTVSSVTLAENNQISQ FT VVHCQNLSSLSDVFLASRKMTDSAILKLGIDGGGSFLKVSFTHIIVDEGET FT PPHSPLQKTIKLMSPQTTKSTSVKQQLLVAIAQNTPENYHNVKAIFDLIQV FT QEKCQMDSLVISCDLKLANILCGIQSHSSKHPCCWCDVSSLNLQNCGSPRT FT FGEIRRQHFEFINAGGDTRKSKEFENVVHVPLVDFPDHKLVLEAIPPMELH FT LLLGVVNHLYKNLCKIWPGAEKWPASLHIPIQPYHGGHFNGNDCMKLLRGL FT DKLQLVTGYNNFSQAHDFMQTLALFKDVVISCFGNNLDPDYESKISKFKQS FT YIRLPISVTPKVHAVFYHVPQFIKIKKTGLGLFSEQATEALHSNFKVHWER FT YKRDSSHPDYASQLLKCVIEYNSKKI*" XX SQ Sequence 3105 BP; 1042 A; 554 C; 556 G; 953 T; 0 other; cacggtttgt aaatttggga ccctaatatc aatgaaatac cctatgaaaa cccttttaag 60 gggggatagt acatgccaat gggagtctat attgagttgt tttgaaatcg gggattaaca 120 tcaaaaagaa ttatttgatc atgaaggttt tttaaaaggg ttgatttatg agccaaatta 180 agctgcctat taatggtact aaacttgctc ccttaatttt ttttataaaa taaagattgt 240 agcccatgaa acgtaaaatg tttattattt tattcatttt ttttgaaata attctttatt 300 gagcgttttg atcagtgcaa gcgtaaaatc gaactttcga aactaacgaa aagagtaaac 360 aaatatattc tgggaaaagg cagtacactt taagatagtt tttgaaggaa aacatatttt 420 tttgtgaaca aatatttaag aatttaaaga tgcctacatt tgccaaaaac catgaagaat 480 gtagaaaatc tgtttgtgtt atctgtatga agaaaggtga tcaagaattg actgaaaact 540 tcaaagctaa aattcttcaa caaattcaga aagaaatcaa cttcaatgat gacagagtac 600 cattggcgac ctgcatctcc tgcagatctc acattggaaa gctgtgtgat ggtaaaacaa 660 atgttgtgcc tcagatttat gactttgaat caatttcgat caagcctctt actcgttttt 720 ctacagtctg tgagtgcttg atttgtaaga ttgccaaact caagggaaaa gaaaggcatc 780 cattaagcaa ggttcaagat tcagcaccaa aagttcccca acaatcttcg caaaatgcag 840 aaaagcattg tactaagtgc ttgtcagtta ttggtcgtgg gcttcctcac tactgcactc 900 ctggaacacg ccatgaaaat ttgagaagaa tggctgcctc tgatcccatt ggtgcagagc 960 aagttgcagc aacagttttg gcatcaaaaa gtgcatctcc tcatggcacc atcagactca 1020 gtcaaccttg tggaaagtta ttgccattga gaaaaggtaa tagtgctttt gaactttaaa 1080 tgattaaatt aaaatataaa tattatctat ttatttttca aaaacttact taatttaaaa 1140 ataaaatttt ctctgcaaat atatttgcag agaaaatttt atttttaaat taagtaagtt 1200 tttgaaaaat attaatattt tttgaaggtc atgaactatt ttttaaatgc aatttttata 1260 attaggacct tcatcagcac agcaactttt caaagagcct ctcacaacgc agaatatggt 1320 tcaaattcag cagaatatag gactttcaaa caatgggatg agaaaacttg gatcggcact 1380 aaatcaaatc agtcctatca gattggtgga gccaagtttt cagcaaaaat tcgctcaagc 1440 aggccaaaaa ctttgtgatc acttcactgt gagcagcgtc actcttgctg aaaacaatca 1500 aatatcccaa gttgtgcatt gtcaaaacct cagtagtctt agtgatgtct tcttagcatc 1560 aagaaagatg acggattctg caattttgaa actagggatt gatggtggag gatctttttt 1620 gaaagtcagc ttcacacaca tcattgtgga tgaaggtgag acaccgccac atagcccact 1680 gcagaaaaca atcaaattga tgtctccaca aacaactaaa tcaacaagtg tcaagcagca 1740 actactggta gcaattgcac aaaacactcc agagaactac cataatgtga aggcgatttt 1800 tgacctcatc caagtacaag agaaatgtca gatggattct ctggtcatat catgtgacct 1860 aaagttggca aatattctgt gtggcattca atcccatagc agcaaacacc catgttgttg 1920 gtgcgacgtt tcgtctttaa acctccaaaa ctgtggaagt cctcgaactt ttggcgagat 1980 taggagacaa cattttgaat tcatcaacgc aggaggagat acaagaaaat caaaggagtt 2040 cgaaaatgtc gttcacgtac cattggttga ttttcctgat cacaagcttg tgcttgaagc 2100 tatccctcca atggagctcc atttgttact tggagtagta aaccatttgt ataaaaatct 2160 ttgcaaaatt tggccagggg ccgagaaatg gccggcttca cttcatattc ctattcagcc 2220 ctaccatggt ggacatttca atgggaatga ctgcatgaag ctcttgagag gtttagacaa 2280 gttgcaattg gtcacaggat acaacaattt tagtcaagct catgacttca tgcaaacact 2340 tgcactattt aaggatgtgg tcatttcttg ctttggcaat aatttggatc ctgattatga 2400 atcgaagatc tccaagttca aacaaagcta catccgccta cccatttcag ttactccaaa 2460 agtacatgca gtgttttacc atgtacctca gttcatcaaa atcaagaaaa caggattggg 2520 tctcttcagt gagcaggcaa cagaagctct gcactccaat ttcaaagttc attgggagag 2580 atacaaacga gattcctcac atccagacta tgccagccaa ctcctgaagt gtgtgatcga 2640 atacaacagc aagaaaatct agatctaaag aacagtgtta aaaaaatcct aagaatactc 2700 ttaacagatt tgtgatattt tagtgccttt attttgttcc agatgattcg tctctagaga 2760 gcattataat ttttaatgga attatagaaa ttgttgttta tcttctaaat gcgatcatta 2820 agaaaataaa tgaataaaat aaaaaagctc ccaatttatg tttttaacat tttttgtttt 2880 atgataaaaa atcagaaatg tatttttgta ctacaaatag gctgccaaat gttgcttaaa 2940 attcttattt tcttcaaaac ttcaaggtaa aaaatctcca ttggatgtta atgcaggatt 3000 tcaaaatcac tctgtatggc catccattgt tatgtactat ccccccttaa aagggttttc 3060 atagggtatt tcattgatat tagggtccca aatttacaaa ccgtg 3105 // ID SAT-6_NVi repbase; DNA; INV; 142 BP. XX AC . XX DT 13-MAY-2009 (Rel. 14.06, Created) DT 13-MAY-2009 (Rel. 14.06, Last updated, Version 1) XX DE Nasonia vitripennis satellite repeat. XX KW SAT; Satellite; Simple Repeat; Nonautonomous; SAT-6_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-142 RA Bao W. and Jurka J.; RT "Satellite repeats from Nasonia vitripennis."; RL Repbase Reports 9(6), 1162-1162 (2009). XX DR [1] (Consensus) XX SQ Sequence 142 BP; 40 A; 33 C; 25 G; 44 T; 0 other; cacctctcga aaataagtcg acattgcatg aaatcaggac gattttggcc ctactttagg 60 ggctgttttt ttatttactg catttccgac atcggattcg gattcagcgt caaaaaatac 120 accgaaaacc cgtatcttta ct 142 // ID Gypsy-30_CQ-I repbase; DNA; INV; 4939 BP. XX AC AAWU01003903; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-30_CQ_; KW Gypsy-30_CQ-LTR; Gypsy-30_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4939 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 439-439 (2011). XX DR GenBank; AAWU01003903; Positions 8762 13700. XX CC Positions [3822-4289] - Integrase core CC 'GCGGG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 511..1488 FT /product="Gypsy-30_CQ-I_1p" FT /translation="MKFVYSVQLQMDTSSKLRCSLAPFDATVDSQDLRREW FT EEWLRAFELEMEMQNVFTQHEKFVRLLSFGGRGLQRIFYNLKPVPEEIIPE FT VVPVPMRPPEDPEYDNAIKRLDKFFVGKVNDRVELEIFRSLRQRPDESFKH FT YLLRLRTQAARCEFNEREDKEILQQISMAAKDEKVRDKGLENIMDLDALTN FT YANNREILIQQKEKTKPFVAEEVLATSKHRGSGWSAGPSRSRGDSKPRGRF FT GTGRPRVVCTRCGSWNHDSNSNSCTARGLRCRQCGKIGHYARKCPGRGQNE FT WRRPENDANALDEAEAWKEEVPRRPKPEDISKVE" FT CDS 2502..4787 FT /product="Gypsy-30_CQ-I_2p" FT /translation="MFGITCAPEIYQRIMTEMLAGIEGVIIYIDDIVVAGR FT TLQEHDERLREVLAVLERNNTMLNRGKCIIRVQELEILGFKVSAHGISPSA FT EKVQAIKNFRKPETKEEVRSFLGLMNFVGHFIPHLSTRSEPLRKYLRGEIA FT TFGLEQAQAFDDLRNELSNHVRKLGFFDPRETTELYVDASPVGLGAVLVQR FT DSNNTPRIVSFASKGLTVSERVYPQTQREALAVVWAVEKFFFYLFGLHFTL FT YTDHKTLEYIYGGKHQIGKRACSRAEGWALRLQPYDFGVEYLPGTDNISDP FT LSRLVNQDDPAFDDNAEHYLLAVGEGPAAITFNEIRRETSNDDNLSAVMKA FT LETGQWPPELFRYQAFDKELGVVDGMLIRGERIILPVKLRARALEIAHRGH FT PGVVSMRRNLRERVWWPCMDRDVKDNIQKCRGCAVVSKQNPPEPMQRKQMP FT ERAWQEIAVDFFTAKECATFLVVIDYYSRFLKVIEMKTTTAAKTIEALESV FT FGEQTYPEVIRSDNGPPFASEEFRNYCLSKNVRLNHTIPYWPQMNGLVERS FT NQGILRTLRIAKASGTDWRTALQEYVHVYNRTPHSVTEKAPLELLLGRPVK FT DLLPSLRTGPGSYRDEGVRDMDAIKKAKGKLYADKHRHAKPSDIKVGDAVM FT VKNYDSGKLEPNFRSERFTVIKRSENDVVIESENGVRYRRCVTHLRKWPLE FT TSPDNPNPEDEPTEGDKNSESTDIEEQTVKRPKHSCDDHVKGTVLARDRPT FT RAKKLPARYNS" XX SQ Sequence 4939 BP; 1375 A; 1081 C; 1333 G; 1150 T; 0 other; attggcgcag ccggtacgat cgagggtagg tatttttttc atctcgagat ggtatttctt 60 gaaatgaatt ggtttgacgt gtgtgcgggc ggagaatggt tgcggaaaga ttttttttca 120 ttcgatggct tagaggggcc tgggatggag aaggaggaaa aatcaaaaga gcgggaccgg 180 ttgaagcccg aagcgccaac acaaaagagc ggggccggtt gaagcccgaa gcgccactaa 240 tacaaaagag cgggaccgga tgaagcccga agcgccacca atacaaaaga gcgggaccgg 300 ttgaagcccg aagcgccacc aatacaaata gcggtaccgg atgaagcccg gaggaagaga 360 gagaaagagg gagaaactga cagtacagcc acagacggta cagatgaagc acgatcaagt 420 ttgaagacac agatgtaaag atgaatcatc tgagagttgt aattgaaatt gacaaaatag 480 ttttcgcacg gatgagagaa attggttatt atgaaatttg tttattccgt tcaattacag 540 atggatacca gcagtaagct gcgttgctcg ctagcaccgt tcgacgcaac ggttgattca 600 caagatttac gacgggaatg ggaggagtgg ttacgagcct tcgagctgga gatggagatg 660 cagaacgtgt ttacgcagca cgaaaagttc gtgaggttgc tctcattcgg agggcgaggt 720 ttgcaacgta tcttttacaa cctgaagccg gtaccggagg agataatccc ggaagtcgtt 780 ccggttccaa tgaggccacc agaagatcct gagtacgaca atgccattaa gagattggac 840 aagtttttcg tcggcaaggt gaacgaccgg gtggagctcg aaattttccg ctcgctgagg 900 cagaggccag acgagtcctt caaacactat ttactacgac tacgcacgca agccgctcga 960 tgtgagttca atgaacgaga ggataaagaa attctgcagc aaatctcaat ggcagccaag 1020 gacgagaaag tgcgtgacaa aggattggag aacatcatgg acctggacgc actgacgaat 1080 tacgcgaaca atcgtgagat tcttatccag caaaaggaga aaacgaagcc atttgtggca 1140 gaggaggtgc tcgctacgtc aaaacatcgt ggaagtggct ggtctgctgg acccagtcgc 1200 tctcgtggtg attcaaagcc aagaggacgg tttggaacgg gtcgacctcg cgttgtgtgc 1260 acccggtgcg gttcctggaa ccacgacagc aactcgaaca gctgcacagc acgtggtttg 1320 cgttgtcggc agtgcgggaa aatcgggcac tacgctcgaa agtgtcccgg tcgtggacag 1380 aacgaatgga ggcgtccgga aaacgatgcc aatgctttgg acgaggcaga agcttggaag 1440 gaggaggtcc cgcggcgccc gaaacccgag gatatctcta aggtagaata ggagttttgt 1500 taaactgttt gttttgttgg aagagagtga ttaaaagctc gacgtaacgg tttttgttgt 1560 tggcgattaa taaaatgaat tgaagccgag actgatgcgt gtgttttcgt ttattttgct 1620 taacaggtca atgattctgg cagcaacgga agtgatgggc acatcttctg catgatcgat 1680 tcctatccag tggaatttct aatcgattcc ggttcatcaa tcaacacgat cacagaggat 1740 gtttgggata aactgactga agccaaggtt aagctgtaca atgagaggtc acactgtagc 1800 cggagcttca cggcatacgc aagccgggac tcgttgtgcg tgctgacaat gttcgaagct 1860 cacgtgtcag tcaacccatg gaagccacac acttatgcag agtttttcgt catcaaggga 1920 gccacgaaat gccttctgag taaacgaaca tccgaggaac tccatgtgct caaggttgga 1980 atcagtgttg ataacattac cgcgaaggtt gaaccgtttc cgaagttccc taacgttcaa 2040 gtcaagttgt ccattgacaa aagcgtccct cccagactga tctcgtacct ccgtgtgccg 2100 gtggctatgg aagagaaggt ggacgccaag atcatggaaa tgctgcaaac cgacattatc 2160 gaaaaagttg aaggcccgcc tgtatggatc tccccgatgg tagttattcc caagggctcg 2220 ggggacgtac gaatctgcat caacatgaaa tatcccaacg aggcgataaa gcgtgaacac 2280 tatccgcttc cggtcatcga tacgttcctg aataaactac gaggcgcggt gtacttctca 2340 agattggaca ttacgtctgc gttctaccac gttgaattac accctgaatc cagagcaata 2400 acaacgttca tgactgcgag aggtttgatg cggttcaaaa ggttagaatg gttgaatttt 2460 tcaggtgaca aattaaaaca aattggttgt attttaggtt aatgttcggg ataacgtgcg 2520 cgccagaaat ctatcaacgc atcatgaccg agatgttagc tggaatagaa ggagtgataa 2580 tctatattga cgatatagtg gtggcgggaa gaacccttca ggagcacgac gagcgtttgc 2640 gggaagtcct cgcagtgctg gaacgcaaca acacgatgct gaatcggggc aagtgcatca 2700 tccgggtgca ggaactcgaa attctcggtt tcaaagtgag tgcccatggc atcagcccgt 2760 cggcggaaaa ggttcaggct ataaaaaact tccggaaacc agaaaccaag gaagaagtcc 2820 gtagttttct ggggcttatg aactttgttg gccattttat cccacatctt tcaacaagat 2880 ccgagccgtt acggaagtat ctccgaggag aaatcgccac tttcggattg gaacaagcac 2940 aggccttcga cgacttgcgt aacgaactgt ccaaccatgt tcgcaagctt ggatttttcg 3000 accccaggga aacgaccgag ctctacgttg atgcttcgcc tgtgggactg ggagcagtgc 3060 tcgtccagcg ggactccaac aacaccccca gaatcgtcag ttttgcttcc aagggcttga 3120 cggtgtctga acgagtctac ccgcaaacac aacgagaagc attggctgta gtttgggcag 3180 tggagaaatt tttcttctac ttgtttgggc tacacttcac gttgtatact gatcacaaga 3240 cgctagagta catctacggt ggtaaacatc agataggaaa acgcgcgtgc tcgagggctg 3300 aaggatgggc tctacggttg cagccgtacg attttggggt agagtattta ccggggaccg 3360 acaacatctc agatccactc tcgcgacttg tgaatcaaga tgatccagca ttcgacgaca 3420 acgctgagca ttatctgttg gcagtgggtg aaggtccagc agccataacg ttcaatgaaa 3480 tcagacggga aacatccaac gacgacaact tgtccgcggt catgaaggca ctcgaaactg 3540 gtcaatggcc gccggagctc ttccgttatc aagctttcga caaggaactc ggagttgtcg 3600 atgggatgtt gatccgtggc gagaggatta tacttccagt aaagctgaga gccagagccc 3660 tggagatcgc ccatcgcggt catcctggag tcgtgtctat gcgtagaaat ctgcgggaaa 3720 gagtttggtg gccctgcatg gatcgggacg tgaaggataa tatccagaaa tgtcgtggtt 3780 gtgctgttgt aagtaaacag aatcctccag aaccaatgca gcgtaagcag atgccggaac 3840 gggcctggca ggaaattgcc gtagattttt ttacggctaa agagtgcgct acttttctcg 3900 tcgttatcga ctattacagt cgatttctca aagtcatcga aatgaagact acaacggctg 3960 ccaagaccat cgaagctctg gaaagtgtgt tcggcgagca aacctatcct gaagtaatcc 4020 gcagcgataa cggcccccct ttcgccagtg aggaattccg caactactgc ctgagcaaga 4080 atgtacgctt gaaccacact attccttatt ggccgcagat gaacggtttg gtggagaggt 4140 caaatcaagg gattcttcga acgctacgaa tcgctaaggc atcggggact gattggagga 4200 cagcgcttca ggaatatgtg cacgtgtaca acagaacacc acacagtgtc actgagaagg 4260 ccccgctcga gctgctcttg ggtcgtcccg tcaaagacct actcccgtcc cttagaaccg 4320 gaccaggatc ataccgcgat gaaggcgtac gtgacatgga tgctataaag aaagcgaaag 4380 gaaagctgta tgcggacaaa catcgccacg caaagccgtc ggacatcaag gttggcgatg 4440 cagttatggt caagaactat gacagcggca agcttgagcc aaatttccgt tctgaacgat 4500 tcacagtcat taagcgaagt gagaacgatg tggtgataga aagcgagaat ggagttagat 4560 atcgtcggtg cgtaacccac ctgaggaagt ggcctttgga aacctcgcct gataatccga 4620 atccggaaga tgaaccaacc gagggcgaca agaattctga atcaaccgac atagaagagc 4680 aaacagtcaa gcgacccaaa cattcgtgtg atgatcatgt aaaaggtact gtgctggcac 4740 gtgaccgtcc caccagggcg aagaaattgc ccgcacgcta caattcttag caaactttgt 4800 ctgtttcaac cgttctgagt atttttttct ttttttttct ctcttaaatt tgaatgaaat 4860 aaacattgtt tttacattga atggtttttt ttatttttag ttttttattt ttagtttttt 4920 ttctggacta ggagagaga 4939 // ID CR1-126_AAe repbase; DNA; INV; 3976 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-126_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-3976 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1214-1214 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 18 sequences with >97% CC identity. XX FH Key Location/Qualifiers FT CDS 40..708 FT /product="CR1-126_AAe_1p" FT /translation="MEKKQCGACQLEVNEIEPLRCGFCDVYFHIGPQCCGF FT NLSRPSRDLFSQGKALFMCPSCREELNGRSIKSYITDMQESNIPPPDDLQI FT QVQKLTKLVDTVCQKVDRCLNNTVPADRTVRTEEIWPRLGVKRRCGNDDQP FT IPAAPDRGTNSIDLSDLSVPCLTPEAPPPKFWLYLSGLHPQVTADDVQKIA FT SRCLKLSAPADVVRLVPRGADVTKLSFVSYKLG" FT CDS 741..3902 FT /product="CR1-126_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="CFHLAQWITVPGIHRYVKKPDFDHGLRHGHGEFKLML FT FPVDGGDSRVVSMNENCITRSHVDGENPRVSTRGEYLNHSPVSSAPNASPL FT QNLNVDNRHLWVYYQNVRGLRIKIDTLFCNTIECNYDVIILIETGLNESIN FT STQLFGDNYNVYRCDRSARNSNKSTFGGVLIAVHRRFNSTIIDTINGVFLE FT QVCASVAVGSKRLQLCAVYIPPDKSKDVSAFNDHVESIGELCSLCSVHDHV FT LVCGDYNQPKLLWTRNDDELRPANSITLTTTSTVLIDGMDFLNLCQANTER FT NYLGRTLDLVFGSPDLLPVVLVPASPLVPVDLHHPPLEISLPSINVSLNDS FT TGGNVTSDLSLDFNKIDFDALSVFLAELDWSNILAELNADEMAESFCTILV FT NWMNTNVPIRRPTPSPAWSTPLLRTLKRKRNALQRRLRRDHCSDNKRLFHQ FT ACNEYRNLDSSLYKSYVLRMQSNLRRNPKSFWNFVNSKWKSTPIPTSVYLD FT GVEASNSTEKCELFAAHFSSVFSTESVTPSEVDAAITDVPSNLLDLDTFRI FT TPNMITEAAKKLKCSFSPGPDGIPTAVFCRCTDALAEPLCKIFNQSFDDAR FT FPRIWKQSFMVPVHKKGDKRDVKNYRGITNLSAASKLFEMIISHVILEQAK FT CYISADQHGFMPGRSVTTNLLDFTTTCFEQLENGAQIDVIYTDLKAAFDSI FT NHDILLAKLAKLGASSRLSSWFSSYLRERSLRVKLDSNTSSCFMSSSGVPQ FT GSNLGPLLFLLFFNDVMVLLGSGCGLSYADDLKLYATIHSIDDCSRLQSIL FT DIFVDWCHRNKLTVSVPKCMVMSYFRTTRPLIQDYVIDGSVLNRTDEFNDL FT GVLMDPKLTFNLHRSNVIAKANRQLGFISKISRDFTDPYCLKALYCSLVRP FT ILECAALVWAPHQLSWSLRMESVQRRFIRVALRNLPWRDPRNLPPYANRCK FT LLDLDSLDRRRKTQQAAFIAKLLNGEVDCSRLLAVLNIRVAPRMLRGTALL FT QPRFHRTAFGYHEPMTEMIRIFSSVEGLHEFGESSLRFTSRIRRQNDF" XX SQ Sequence 3976 BP; 1020 A; 940 C; 886 G; 1130 T; 0 other; tgtttattgc tgcgagtggc tgttttgttc tacatcagta tggagaagaa acaatgcggc 60 gcatgtcagc tggaagtgaa tgagatcgag ccgttgcgct gtggattttg tgatgtttac 120 ttccacattg gtccgcagtg ctgtggattc aacctcagtc gaccgagtag agatcttttc 180 tcacaaggaa aggcattatt tatgtgtccg agctgccggg aggaactaaa cgggaggagt 240 ataaaatcgt acatcaccga tatgcaagag tctaacatac ctccgcctga tgatctgcaa 300 attcaagtgc aaaagttgac taagttggtg gatactgttt gccaaaaagt cgatcgttgc 360 ttaaataata ctgttcctgc tgatcgtaca gttcgcactg aggaaatctg gccgcgtcta 420 ggtgtgaagc gccgttgtgg caacgatgat caaccaatac ccgcggcgcc tgatcgtgga 480 accaattcta tcgatctcag tgacttgtct gtgccgtgtt taacaccaga agctcctccg 540 ccaaagttct ggctgtacct gtctggatta catccgcaag taaccgctga cgatgtccag 600 aagattgcat ctcgttgcct caagttgtct gctcctgccg atgttgtacg tttagttccc 660 aggggtgcgg atgtgaccaa attgtctttc gtgtcgtaca aacttggttg ataggtccca 720 acaaaggacc gtgcgcttga tgcttccacc tggcccagtg gattactgtt ccgggaattc 780 atcgatatgt caaaaaaccg gatttcgacc acggattgcg acatggtcac ggcgaattca 840 aattgatgct ttttcctgtg gatggtggtg actctagggt tgtctccatg aacgaaaact 900 gtattacgcg atctcatgtg gatggtgaaa accctagggt ttccaccaga ggcgagtatc 960 ttaaccattc tccagtttct tccgcaccta atgcctctcc attgcagaat ctgaacgtcg 1020 ataaccgcca tctctgggtg tattaccaga atgttcgagg attgcgcatc aaaattgaca 1080 cattgttttg caataccatc gagtgcaact atgacgtcat tattctcatc gaaactggac 1140 tcaacgaaag tatcaactca acacagctgt tcggggacaa ctacaatgtg tatcgctgcg 1200 accgaagtgc gagaaacagc aataaatcta cttttggagg cgttctcatt gctgtccatc 1260 gccgtttcaa cagcacgatt atcgatacta tcaacggcgt attcctggag caagtttgcg 1320 cgtcggtagc agttggctcc aagcggcttc aattgtgcgc cgtttatatc ccccccgaca 1380 aaagtaagga tgtgagtgca ttcaacgatc acgtggaatc catcggtgag ctatgtagct 1440 tatgctctgt ccacgatcat gtactggtat gtggggatta taatcaaccg aaattactat 1500 ggactcgaaa tgacgacgaa ctacgaccag cgaactctat cacactgacg accactagta 1560 cggtgctgat cgacggtatg gattttttga atctttgtca agcgaatact gaacgcaact 1620 atttgggacg cacacttgac ctagtattcg gatcaccgga cttgttaccc gtggttttgg 1680 tccccgcttc accgcttgtt cctgtagatc ttcatcaccc gccgctcgag atatctctac 1740 cgtcaatcaa cgtttctctg aacgactcaa ctggcggaaa cgtgaccagc gatttatcat 1800 tagatttcaa caagattgac tttgacgccc tttcagtttt tctcgccgag ctggactgga 1860 gcaacatttt ggctgagtta aacgccgatg aaatggctga atcgttttgt accattttag 1920 taaactggat gaacacgaac gtaccgatcc gtcgccctac tccttcacct gcgtggagca 1980 caccgttgct aagaacattg aagcgtaaac ggaacgcact ccaacgcaga ttgcgtcgtg 2040 atcattgctc tgataataag cgtctctttc accaagcatg taacgaatat cgtaacctcg 2100 attcatcgtt gtacaagtcg tatgtcttgc gtatgcaatc taacttacgg cgtaacccaa 2160 aaagtttctg gaacttcgtt aattccaaat ggaagagtac gcccattcct acgagtgtgt 2220 atcttgacgg tgtggaagct tctaattcaa cggaaaaatg cgaacttttt gcggctcatt 2280 tctcatccgt gttttctact gaatctgtca cgccttctga ggtggatgct gcaattacgg 2340 acgtaccaag caacctgttg gatttggata cattccgcat tacaccgaat atgattacgg 2400 aagctgccaa aaagttgaaa tgctctttct cgccgggccc tgacggaatc cctactgctg 2460 tgttttgtcg ttgtactgat gcgttagctg aaccactgtg caaaatattc aaccaatcgt 2520 ttgacgatgc acgttttcct aggatttgga agcagtcgtt catggttcca gttcacaaaa 2580 agggcgataa gcgtgatgtt aagaattatc gaggaattac taatctctct gccgcatcaa 2640 aattgtttga aatgatcatc agtcacgtta ttctagagca ggccaaatgc tacatttctg 2700 ccgatcaaca tggctttatg ccggggcgtt cggtaaccac gaacttgcta gacttcacca 2760 cgacctgttt tgaacagttg gagaacggag cccaaatcga cgttatttac actgacctca 2820 aagcggcatt cgactccatt aaccacgata tcctactggc aaagcttgcc aagcttggtg 2880 cttcgagtcg gttatcatca tggttttcct cctacctgag agaaagatcg cttcgcgtaa 2940 agctcgattc aaacacttct tcgtgcttca tgagctcatc cggagttcct caaggaagca 3000 acctgggtcc gctgcttttc cttctattct tcaacgatgt gatggttctt ctgggatccg 3060 gctgtggact ttcctatgct gacgatttga aattgtacgc caccatacac tccatcgatg 3120 attgctcccg actacagtct atattggata tatttgttga ttggtgtcat cgtaacaaac 3180 taactgtaag tgttcctaaa tgtatggtga tgtcctactt ccgcactact aggccactca 3240 tccaagacta cgtcatcgat ggttcggttc taaatagaac ggatgaattc aacgatttgg 3300 gtgtgttgat ggatcctaag ttgacattca acctacatcg ctcgaatgta attgccaaag 3360 caaatcgcca gcttggattt atatctaaga tttcccgaga cttcacagac ccgtattgcc 3420 taaaagcact ctattgctcc ttagtacggc caattttgga atgtgctgct ttggtgtggg 3480 ctccccatca gctatcgtgg agtcttagaa tggaaagcgt ccaacgaaga ttcattcgtg 3540 tggcattaag gaacctaccc tggcgtgatc cacggaacct gccaccatat gctaatcgat 3600 gtaaactgct tgatttggac tctctagacc ggcggcgcaa gactcaacaa gctgctttta 3660 tcgcgaaact actgaacgga gaagtggact gttccagact tcttgctgtt ttgaatattc 3720 gtgtagctcc aagaatgtta cgcggcactg ctttactcca accaaggttc catcgaacag 3780 ctttcggtta ccacgagcct atgactgaaa tgatccgcat cttctctagt gttgaaggct 3840 tacacgagtt tggtgaatct tccctgcgat ttacgtcacg catcaggcgt caaaacgatt 3900 tttagttaag tcccaatttc atgtagacaa tagtcagatg aaaataaatc aataataata 3960 ataataataa taataa 3976 // ID Gypsy14-I_Dpse repbase; DNA; INV; 3604 BP. XX AC Unknown_singleton_87; XX DT 13-MAY-2009 (Rel. 14.05, Created) DT 13-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy14_Dpse; KW Gypsy14-LTR_Dpse; Gypsy14-I_Dpse. XX OS Drosophila pseudoobscura OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; obscura group; OC pseudoobscura subgroup. XX RN [1] RP 1-3604 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1092-1092 (2009). XX DR Genome; Unknown_singleton_87; Positions 9901 6298. XX CC Positions [1265-1771] - Reverse transcriptase CC Positions [2836-3312] - Integrase core CC LTRs are 95% similar to each other. XX FH Key Location/Qualifiers FT CDS 203..1990 FT /product="Gypsy14-I_Dpse_2p" FT /translation="MKDIASRGSIADDTFIQYLIDGIDGKGVNKTILYGAT FT NINQFKEKLKCYKTIMEKENQMEKETSKRNEINIKETDKKSQSMNKDDKKE FT VKCYNCGGKGHLSSNCQNKDKGRKCFKCNQFGHISKDCKIEKFGATSVNTR FT RLQPKPDITHKEVAFENKKCIALFDTGSKFNVMREDFYREIGKPKLESCKI FT VLIGFGDCGSNRIKPVGHFKHAVLIDDGEFLLDFYVVLLCRLIDVKLIIGE FT ELCLLAVIVFNRNGLQVRNMSSNMSSNMSSGDELNIMKICAEIDNDVIDIN FT EAASESAKKEVRELIDNYVPNKSKTTNVEMRIVLKDESPIFSRPRRMAFTE FT TRIVDEQIEEWLKNDIIEESTSEFTSPIVLVKKRDGSARVCVDYRRINKVT FT VKDHFPLPLIEDQLDRLQEAKIFSTIDLQNAFFHVPVSESRRKYTSFVTKR FT GQYQFLRVPFGFLNSPGVFQRHVNAIFRHLSSKGIALPYVDDVIIPAQTEQ FT EARKNLKEVIETCKDFGLELNIKKCHFLKTRIQFLGHLTENSKIYLSTEKI FT DAVSKFKMPQSLKQVESFLGLTGYFRKFIPNYAGIARPLTELTENKKKF" FT CDS 2179..3603 FT /product="Gypsy14-I_Dpse_1p" FT /translation="MSKKTTDAQRKYSSYELEILAVIEALTKFRVYLLGIH FT FKLITDCNAFTKTLEKQNLCTRVARWILFLQEYDYAVEHRSGTRMKHVDAL FT SRYPIMMITEDSISIALKDAQSKDEKIKAIKQILNDEQKTYDDYFLKSGML FT YRLVQDEELMVVPNDMQKQIISRAHDRGHFAAKKTKDLICREFYIQNVEEK FT IKKYIECCIPCILMNRKRGKQEGLLHPLQKEDVPLYTYHIDFLGPLDSTHK FT DYKHIFAVIDSFTKYCWLYPTKSTTANEVIAKLSSQSITFGNPAFIISDKG FT SAFTSQDFVKYCDDEGIKLVKTTTGLPRVNGQVERLNAIIISVLSKLSIDD FT PTKWYRHVSKVQQAVNSTFTRSIDTTPFEMLVGVKMRTKEDLQIRQLINKE FT AATIFNDERSDLRAKAKAQIVKLQNENKKTYNLRRKPARRYNIGDLVAIKR FT TQFGSGLKLKAKFLGPYQVVGIKHNNVFGF" XX SQ Sequence 3604 BP; 1277 A; 609 C; 774 G; 944 T; 0 other; aaggtttttg ttcgccaaaa aatctctaaa aggaatcgcg aaattgtttg tattgagtga 60 aagaggtttg aattcatgga attccctcaa gcaggcatta ttgtcagaat ttaagtcgaa 120 tgtaaccagc aagcaaattc atgagcaact cgcgcaaacg aaacgaggca gcaatgagtg 180 tgtattcgaa tatttctaca aaatgaaaga cattgcatca cgaggaagta ttgcagacga 240 cacatttatc caatatttga ttgacggcat agatggaaaa ggtgtgaata agacaatttt 300 gtacggcgca acaaatataa atcagtttaa agaaaaatta aagtgctata aaacaattat 360 ggaaaaagaa aatcaaatgg aaaaagagac ctcgaagcga aatgaaatca atataaaaga 420 aaccgacaaa aaaagtcaaa gcatgaacaa agatgataag aaagaagtga aatgttataa 480 ttgcggtggt aaaggtcact tatcgagtaa ctgtcaaaat aaagataaag gtcgaaagtg 540 tttcaaatgc aatcaattcg ggcatatttc taaagattgt aaaattgaaa agtttggtgc 600 aacgtcagta aatacgcgta gattgcagcc aaaacccgat atcacacaca aagaagtcgc 660 atttgaaaat aaaaagtgta tcgcgctttt tgatacgggc agcaagttca acgttatgcg 720 ggaagacttt tatcgcgaga ttggcaagcc gaagttggaa agctgtaaaa ttgttcttat 780 tggatttggc gattgtggct ctaataggat aaaaccagtt ggacatttta aacacgctgt 840 tctaattgac gatggtgaat tcttgttaga tttttatgta gtacttttat gtagacttat 900 cgacgtgaag ctaattattg gagaagaact gtgtttactg gcggtaatcg tttttaatag 960 aaacggactt caagtgcgca atatgtcgag caatatgtcg agcaatatgt cgagcgggga 1020 cgaactaaat ataatgaaaa tttgtgcgga aattgacaat gacgtgattg acattaatga 1080 agcagcaagt gaatctgcca aaaaagaagt acgcgaacta atagacaatt atgtgccaaa 1140 caaatcaaaa acaacaaatg tagagatgcg tatcgtatta aaagatgagt ctcctatatt 1200 ttcgcgtccg cgcagaatgg cgtttacgga aacgcgtatc gtcgatgaac agattgaaga 1260 gtggcttaaa aatgacatta tcgaagaatc tacatcggag tttacaagtc caatagtcct 1320 tgttaaaaag cgagacggtt cggcaagggt ttgtgttgat tacaggagga tcaataaggt 1380 aactgtcaaa gaccattttc ctttgccgct tattgaagat caactagaca gattgcaaga 1440 ggccaagata ttcagcacga ttgatcttca gaatgcgttt tttcatgtgc cggtttctga 1500 atctcgtcgg aagtacacgt cgtttgtaac taagcgaggc caatatcaat ttttgagagt 1560 tccgtttgga tttctaaatt caccaggagt ttttcagcgg cacgtcaatg cgatatttcg 1620 acatctgtcc agcaaaggta ttgctttacc ctacgttgat gacgtgatta ttccagcaca 1680 gacagaacaa gaagctagga agaatctaaa ggaagttatc gaaacatgca aagattttgg 1740 cctagaattg aacataaaaa aatgccactt tctaaagacc cgtattcaat ttcttggtca 1800 tttaactgaa aacagtaaga tttatctttc aacagaaaaa atagatgctg tgtcaaagtt 1860 taaaatgcca caatcactca aacaagtaga gagtttcttg ggactgactg ggtatttcag 1920 aaagtttatc ccgaattatg caggcattgc tagaccttta acggagttga cagagaacaa 1980 aaaaaaattt taatttgatg tatatgaaga gaatgccgtt aatatcttaa aaaaaaactg 2040 ctgacggaga acccagtatt gaatatatat aatcagactt acgatacaga agtacacaca 2100 gatgcatcaa ttgatggttt tggtgcagtt ctattacaga aatcgccaga cgacggacag 2160 ctacacccgg tctattatat gtccaaaaag actactgatg cacagcggaa atatagcagt 2220 tacgaattag agattttggc agtaatagaa gcgttaacca agtttagagt gtatctctta 2280 ggcattcatt ttaaactgat aacagactgc aatgctttta caaagacgct ggagaaacaa 2340 aacttatgca ctagggtggc gcgatggatt ttgttcctgc aggagtacga ttatgctgtg 2400 gaacatcggt caggaactag aatgaagcat gtcgatgctc tcagccgcta cccaatcatg 2460 atgataactg aagacagcat aagcatcgct ttaaaagatg cacagtcgaa agatgaaaag 2520 attaaggcta ttaaacaaat tctgaacgac gaacagaaga cttacgacga ttatttcctg 2580 aaatctggta tgctatatag acttgtacag gacgaggaac ttatggtggt acccaatgac 2640 atgcagaaac aaattatcag ccgtgcacat gatagaggtc attttgcagc aaagaagacg 2700 aaggatttga tttgtagaga attctacata caaaatgtcg aagagaagat caagaaatat 2760 attgagtgct gtattccgtg tattttgatg aaccgaaaga gaggcaagca ggaagggcta 2820 ctacacccac tccagaaaga agacgttcca ttatacactt atcatattga ctttttggga 2880 cctctggatt cgacgcacaa ggactacaaa catatctttg cagttataga ctcattcacg 2940 aagtattgct ggctgtatcc cacgaaatcg actaccgcaa atgaggttat tgcgaagctt 3000 agcagtcaaa gtattacatt tggtaatcca gcgtttataa tctcagataa gggctccgct 3060 ttcacctcgc aagactttgt taagtattgt gacgacgaag gaattaagct ggtgaagacg 3120 acgacaggac taccgagagt aaatggtcaa gttgagagac tgaatgcaat cataatttca 3180 gtgctctcaa aactgagtat cgacgacccc acgaaatggt acaggcatgt cagcaaagtt 3240 caacaagctg taaattcaac atttacgaga agcattgata cgacaccctt tgaaatgcta 3300 gttggagtta agatgcggac aaaagaagac ctccaaatac ggcaattgat caacaaagaa 3360 gcagcaacta tatttaacga cgaacgcagc gacctaagag caaaggcaaa ggctcaaatt 3420 gtgaagcttc agaacgaaaa taagaagacg tataatctac gacgaaaacc tgcaagacga 3480 tataatattg gtgacttggt tgccatcaaa cgtacacagt ttggtagtgg cctgaaatta 3540 aaagcaaagt ttttgggacc ctaccaagtt gttggaatca aacataacaa tgtttttggg 3600 tttt 3604 // ID TTAA10_AP repbase; DNA; INV; 168 BP. XX AC . XX DT 25-AUG-2009 (Rel. 14.09, Created) DT 25-AUG-2009 (Rel. 15.12, Last updated, Version 0) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; nonautonomous; TTAA10_AP. XX NM TTAA10_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-168 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2075-2075 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC TSD is TTAA tetranucleotide. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea CC Aphid Genome Project. XX SQ Sequence 168 BP; 42 A; 27 C; 42 G; 57 T; 0 other; cacgttgatg gacacgccat gccatagcat tctatgtgta ttggtcttgc cggtgcatga 60 ctgctatagc atttgttttt gtaaatgttg atttagggtt aaaaatttag tgatgctaaa 120 aggtcagaaa tgtggtgcca tgctatagca gttgtgtcca tcaacgtg 168 // ID Howilli5 repbase; DNA; INV; 2401 BP. XX AC . XX DT 18-OCT-2009 (Rel. 14.1, Created) DT 18-OCT-2009 (Rel. 14.1, Last updated, Version 1) XX DE Howilli5 is a putative autonomous DNA transposon. XX KW hAT; DNA transposon; Transposable Element; HOBO; transposase; KW Howilli5. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-2401 RA Ortiz M.F. and Loreto E.L.S.; RT "Characterization of new hAT transposable elements in 12 RT Drosophila genomes."; RL Genetica 135(1), 67-75 (2009)DOI 10.1007/s10709-008-9259-5. XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 1188..2066 FT /product="Howilli5_1p" FT /translation="MILINFSITRCKYYNIITLKSFCPTRWNTVYYLLKSV FT EINWIELTTILKDKNQTNRIEGINVNHLGSVVQILKQFETVSKKFEATKSP FT TTHLIIPNLNKLKKICLSDSNDIVIIQALKSALYTQILSTIEPNLSKYHKI FT ALFLFPPTNKLVQFSPHEKETVIDECKLIMERFLKNSGSNTLQIPIANDEF FT ADFVEVQQVDPNNNQIDQEIKGYSNINVPYSTNFDALAWWDMHQKFFPLLH FT KASCNIFCIPASSAASERTFSNARNVITGKRCLIAANADNINKIMFLNSNL FT N" XX SQ Sequence 2401 BP; 843 A; 405 C; 416 G; 737 T; 0 other; tagacagctg catgacttca ttcacttaat aagtaaaata gagtatacac ggatagaatt 60 gaatagagtg agtgacagac gcacaatgag tgagtgtgta tgagtattag tatattttgg 120 tactcataca aagcaaattt aatgacactg agtactgact gagtactcag taaaacggag 180 tgtcaaattt gacacatcta aaaacattcg agttatccaa gtatcagggt acatattttg 240 tgtatttatt ttctttgcat cgtttaaaca cagcagacgc aaaaatggga ggaggaaaac 300 tacccaaagt gaaaaaagta tgtaatttaa agcgtatttt gttgtttttg ttttcgtttg 360 ttttgttatt gctgtagtca aatattgcat atatctttct gcttagcttg tgacctgtcc 420 ataaatgttt tttgagtaat ttctttcaga tttaatttag ttcaattaaa atatacaggt 480 tgttaaaaga gctgcgagca caagagagat ccttgcacag gaaataaaga ggaatctgca 540 ttctggattg tatagcttaa tagaaaagag gggtcgaagc gaagtgtggc aatttttctc 600 aagaataaaa aatgaaaatg gagatgagct agttgaactt gtagcctgca agatgtgttt 660 gtcagtcttg aaattcacag ctagcacttc aaatttaacg aagcataaat gctacatagt 720 caatgccagc aaaactcaaa aatcttcccc ggttgatgtg agttcagaaa ctaaacagga 780 aggcgtagct gttgccactg aatgggttgt caagaactgc cgcccattaa aaattatagc 840 agattccgga ctaaacaaat ttgcatcatt tttaattaat gttggtgcaa cttatggtcc 900 aaatgtagac gtggataaat tgttcccaca tccaacaaca gtttcccgga acatagcagc 960 gatatttgac tcgcactttg ttccaataaa agaggaaatt gttaaataca aagtttttgg 1020 atacgcaatc actagcgata tatggacaga taactttttt aaagattcat atttatcgtg 1080 cactgtgcat tatgtaaagg aaggagtact tgttgacaga ctcatggcca tgaaatcaat 1140 gaaaggaatg tcctgcacag gtgtgttaac ttaatgaacg agtgtacatg attttaataa 1200 atttttccat aactaggtgc aaatattata atattattac gcttaaaagt ttttgcccca 1260 ctcgttggaa cacagtttac tacctgctta aatcagttga aattaattgg atagaactga 1320 caacaatatt aaaagataag aaccagacca atagaatcga aggtattaac gtcaaccact 1380 taggttccgt tgttcaaatt ctaaaacagt tcgaaaccgt ctcaaaaaaa tttgaggcaa 1440 ccaaaagtcc aacaacacat ttaattattc caaatttaaa caaattaaag aaaatctgtc 1500 tgtccgattc caatgatatt gtaattattc aagctctaaa gtccgcattg tatactcaaa 1560 ttttatcgac aattgaacca aatttatcaa aatatcacaa aatagcccta tttctatttc 1620 cacccacaaa caaattggta caattttctc cgcatgaaaa ggaaaccgta atagacgagt 1680 gcaaactaat tatggaacgt tttttaaaaa acagtgggtc taacacttta caaataccaa 1740 tagcaaatga cgaatttgct gattttgtag aggttcaaca agttgacccg aataataacc 1800 aaattgacca agaaataaaa gggtactcca atataaatgt accgtacagc acaaatttcg 1860 atgcactggc ttggtgggac atgcaccaaa aattttttcc tttattgcac aaagcaagtt 1920 gcaatatttt ttgtataccg gcaagcagcg cggcgtctga aagaaccttt tcaaacgcaa 1980 gaaatgtaat tactggaaaa cgttgcttaa ttgctgcaaa tgcagataat attaataaaa 2040 taatgttctt gaactcaaac ttgaattaaa tgtagtatat atgttaataa gtatatgaaa 2100 tgaatattaa aatgtaatct atctatgtat ttgctttcgt ttttaggggt tttgaagtac 2160 agattctgta cgcacaacaa aaaaacaaca aaaatgctgt gtcggcacct gtcattcgga 2220 tacattcgaa acatacttac ttatactcac ttatactctc tgttttaaca ttgtacggcc 2280 gagtataaag caagcctatc cactcactca tacttattcc gttttgcact catattcact 2340 caactcaatt tgagagtagt atatccgaat gtcacttttt tgacattcac gcagatgtct 2400 a 2401 // ID DNA-ATAT-2_CQ repbase; DNA; INV; 1079 BP. XX AC . XX DT 28-DEC-2010 (Rel. 16.01, Created) DT 28-DEC-2010 (Rel. 16.01, Last updated, Version 1) XX DE A DNA transposon family from Culex quinquefasciatus - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; KW DNA-ATAT-2_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-1079 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the southern house mosquito."; RL Repbase Reports 11(1), 50-50 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >92% CC identity. TSDs are 4bp; usually ATAT. ~490-bp TIRs. XX SQ Sequence 1079 BP; 359 A; 182 C; 176 G; 360 T; 2 other; gggtcattcc aggtcaactg agtacacttt tggactcgac cttcaccgat ttggaccaaa 60 cttggaggga acgtttatct atcgatagtt aacagaaatc ccaagtttgg tgctgattgg 120 accatccctc tatttttggc accgccctct tttttggcga ttttctaaaa aacttttttt 180 tcttttaatc ataactttgc aactatttga gcaaaagact ttctacaggt tgcattttat 240 agaaaattgt ccaaggaatt cgataaaaat aaaattttaa cccttaaatg cccactaata 300 ttatatttta acgttttaag tataaaaatt cagttttgac caattgmtta tatttttatt 360 tgtttttatt ttatcgccat cgtgttcccc ggacaatttt acataataat caatgtagac 420 ttcaaactaa aatgaactat tggcgagata cagcgatttt actgaaaaaa gtttgatttt 480 gcgcwgcact ctgtcctatg aattcaaatc cgaaataagt ttctcacata taaaaaccga 540 attatgcgag attcattcct ttttgttgaa aagtacacca tgaacttccg agtgctgcgc 600 aaaatcaaac ttttttcagt aaaatcgctg tatctcgcca atagttcatt ttagtttgaa 660 gtctacattg attattatgt aaaattgtcc ggggaacacg atggtgataa aataaaacaa 720 ataaaaatat aagcaattgg tcaaaactga atttttatac ttaaaacgtt aaaatataat 780 attagtgggc atttaagggt taaaatttta tttttatcga attccttgga caattttcta 840 taaaatgcaa cctgtagaaa gtcttttgct caaatagttg caaagttatg attaaaagaa 900 aaaaaagttt tttagaaaat cgccaaaaaa gagggcggtg ccaaaaatag agggatggtc 960 caatcagcac caaacttggg atttctgtta actatcgata gataaacgtt ccctccaagt 1020 ttggtccaaa tcggtgaagg tcgagtccaa aagtgtactc agttgacctg gaatgaccc 1079 // ID TTAA22_AP repbase; DNA; INV; 435 BP. XX AC . XX DT 26-AUG-2009 (Rel. 14.09, Created) DT 26-AUG-2009 (Rel. 15.12, Last updated, Version 2) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; nonautonomous; TTAA22_AP. XX NM TTAA22_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-435 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2090-2090 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC TSD is TTAA tetranucleotide. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea CC Aphid Genome Project. XX SQ Sequence 435 BP; 146 A; 77 C; 74 G; 136 T; 2 other; ggggattggt ggcggtgatt tcctgtcttt gtctaacaca cgcgcaacat aagaatttag 60 acgtgttttt tgtccaacgt accgattgat atagtgaagc gataaaaatt ctgaaaacag 120 atttgaattt gtcgtcaagt ctacttgaga tcggtcagtc cacatatttt atattatccc 180 ccaaaatcct aaaattattg tattattaaa gagtaattaa aaaaatttca tatttcattt 240 tttggagaag tgtaaatcaa aaaatcaaaa atgtagactg accgatctca agtagacttg 300 acgacaaatt caaatctgtt ttsagaatty ttatcgcttc actatatcaa tcggtacgtt 360 ggacaaaaaa cacgtctaaa ttcttatgtt gcgcgtgtgt tagacaaaga caggaaatca 420 ccgccaccaa tcccc 435 // ID BEL-202_AA-LTR repbase; DNA; INV; 455 BP. XX AC AAGE02028271; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-202_AA_; KW BEL-202_AA-I; BEL-202_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-455 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02028271; Positions 14200 13746. XX SQ Sequence 455 BP; 147 A; 80 C; 83 G; 145 T; 0 other; tgttcgatta taatatttga tttggaaatt gtcttagcgt ttctaccaca cattcttatc 60 agtcaccttt acgatcactg ctcatgtttg tctgcctaac acaatgctgc ttcaagtcaa 120 cactgtgggg ctgagcaaga acgatttatc acttcatgcg atgagagcac aaaaggctat 180 acatcagaca gcaagagata cgattcgatc tatcgtgcgg gacggcaaaa tttgtctttt 240 atgtgttatt tatttgaaac ttagattaag caaacattaa atagtttaag aataaaagtt 300 atatacagtg catgcaacaa ataagtgtct atcaataaac ttaaaggtct aaaatctatg 360 tgctaagtgt ttaaaatgat tgattgagtg ctgaaagaac cgtaaatgtt ttgtgtacaa 420 aaattgctcc gctgacctcc ctggctattc gaaca 455 // ID Gypsy-24_AA-I repbase; DNA; INV; 4289 BP. XX AC supercont1.380; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-24_AA_; KW Gypsy-24_AA-LTR; Gypsy-24_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4289 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.380; Positions 569267 564979. XX CC Positions [3378-3662] - Integrase core CC 'GTTAT' target site duplication CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 31..2250 FT /product="Gypsy-24_AA-I_1p" FT /translation="MMNPEFQKSLLDILQNQQKILDQLMQNQYSASSSSQQ FT AVVSTQRNHNQEFLIESLSSGITEFTYDPENGVTFAAWFSRYEDLFIEDAK FT NLDDAAKVRLLLRNISTVAHQKYISYILPKKPKDQNFEQTVSTLKIIFGRQ FT ISLFNARYQCLQLSKSSGDDFYTYAGVVNQKCEEFKLNEVTSDQFKNMQQT FT VCAINRSNPSKSTNSEECNDVPRSPCWRCGNMHYSKYCSFINHECRTCQKT FT GHKEGYCACFEKVGKKQKENGKQRFNHRMQAQAQGVFSINQIGLMDRRKYI FT TVQINGVSTKLQLDCASDITIIPEDVYVQLGSPAGRAPSIDAVNASGEDVG FT LSREVECNVSFNGVTKKRRYFVTTVPALSLFGIEWIEMFNLWNVPINAVCN FT NVKLQSLPQTEGFIEELKKEFASIFSDELGLCKKKVSLTVKSGIKPIFRLK FT RPVPYASTAKIEAELERLQKLGIINPVPYSDWAAPIVAVNKPNGRVRICAD FT YSTGLNSALEPHQHPLPLPQDIFAKLANKKVFSQIDLSDAYLQVEVEPSSR FT KLLTINTHKGLFEFTRLSPGVKTAPGAFQEIVDNMIAGLEGVSGYLDDLIV FT ASDSVEEHIHHLECLFARIREFGFTLKIEKCNFFMSEIKYLGFIVDCQGIR FT PDPEKVRAISEMPAPHDVTSLRSFLGAVNYYSKFVRGMHELRQPMDALLKK FT DVKWNWSQACQQSFNQFKQILQSNLLLTHYDPRLEIIVAR" XX SQ Sequence 4289 BP; 1378 A; 892 C; 918 G; 1101 T; 0 other; gtttttggcg acgaggatta cggaattttc atgatgaatc cagaatttca aaaatcgcta 60 ttggacattt tacaaaacca acaaaagata ctcgatcaac ttatgcaaaa ccagtattct 120 gcaagttcat caagccaaca agcagtcgtt tcaacgcaga ggaatcataa ccaggaattt 180 ctaatcgagt ctctgtccag cggtattacg gagttcacgt acgatccaga gaacggcgta 240 acatttgcag cgtggttctc gagatacgag gatttattca ttgaagacgc gaagaatttg 300 gatgatgcag cgaaagttag actcttgctt cggaatatta gcacggtggc tcatcaaaag 360 tatatcagct acattcttcc aaaaaaacct aaggatcaaa actttgaaca gacggtttct 420 acactgaaga tcatttttgg aaggcagata tccttattca acgccagata ccagtgcttg 480 cagttgtcga agagttccgg agacgatttt tacacatatg caggagttgt aaaccaaaaa 540 tgtgaggagt tcaagctgaa cgaagttaca tcggatcaat ttaaaaacat gcagcaaaca 600 gtttgcgcaa tcaatcgttc aaatccatcg aaatcaacaa atagtgaaga atgcaacgac 660 gttcctagat caccatgttg gcgatgtggg aatatgcact actcaaaata ctgttcgttc 720 atcaaccacg agtgtcgaac gtgccagaaa acaggacata aggagggtta ttgtgcttgt 780 tttgagaagg taggcaagaa acagaaggaa aacggtaagc aacgattcaa ccacagaatg 840 caagcgcaag cacaaggagt gttctccatc aatcagatcg ggctaatgga cagacggaag 900 tacatcacag ttcaaatcaa cggagtgtca accaaacttc aactggactg tgcctctgac 960 atcactatca ttccagagga cgtctacgtt caattaggtt caccagccgg aagagcacca 1020 tccattgacg cggttaatgc aagtggcgag gatgttggtc tcagtcgaga agtagaatgc 1080 aatgtttcgt tcaacggggt aactaaaaaa agacgatatt tcgtaactac agtaccagca 1140 ttgtctttat ttggtatcga gtggatagag atgttcaacc tgtggaatgt gccgatcaat 1200 gctgtgtgca ataatgtgaa gctgcagagt ttacctcaga cggaaggatt catcgaagag 1260 cttaaaaaag agtttgcaag cattttcagt gatgaactcg ggctatgtaa gaagaaagta 1320 tcgctcacgg ttaaatctgg catcaaaccc atctttcgtt tgaaacgacc tgttccatac 1380 gcatcgacag ccaagattga agcagaatta gaacgtcttc agaaactcgg aatcatcaat 1440 ccggtgccat attcggattg ggcggctcca atagtagcag tgaacaagcc aaatggacga 1500 gtgagaatat gtgctgatta ctcaactgga ttaaattctg cattggaacc gcatcaacat 1560 cctttaccgc tacctcaaga tatttttgcg aagctagcta ataagaaagt tttcagtcaa 1620 atagaccttt ccgacgctta tctacaggtt gaggtcgagc cgtcatcgcg taagctactt 1680 acgatcaata ctcacaaagg acttttcgag ttcacaagac tttctcctgg cgtcaaaact 1740 gctccagggg catttcaaga aattgtggat aacatgatag ctggtctcga aggagtaagt 1800 ggatacttgg acgatctaat tgtcgcaagc gattctgttg aagaacacat tcatcatctc 1860 gagtgcctat ttgctcgtat tcgtgagttc gggttcacgc tcaagattga aaagtgtaac 1920 ttcttcatga gtgaaatcaa gtacctaggc ttcatcgttg attgtcaagg aattcgtccg 1980 gatccggaga aagttcgtgc catcagtgag atgccagcac cacatgatgt gacttcattg 2040 agatcatttc tgggagccgt gaattattac agcaagtttg tacgaggaat gcatgaacta 2100 agacaaccca tggatgcatt actcaagaaa gacgttaagt ggaactggtc acaagcttgt 2160 caacagtcat tcaaccaatt caagcaaatt ttacagtcaa atctattgct cacacattac 2220 gatccaaggc tcgaaataat tgttgcccgc tgatgcctcc atgagtggcg tgcgagctgt 2280 tttgtttcat cgatatccga atggaaagca tcaaagcagt ttgtcacgct tcacgaatct 2340 catgattcaa gtttaatagg atacatccta gtgatataaa agaaccttta gccctcatct 2400 tttgctgtta cgaaattcca tcggatgcta tttggtcgtc aattcacatt acagacggac 2460 cataagccgc tgatttccat ttttggctca aaaaagggta tcccggttca cacagccaac 2520 cggctacagc gttgggcttt aactatgctg ctgtataact tcaaaattga attcaagtca 2580 acagaaagtt ttggttatgc tgatttttta tcaaggcttc agcattcaag acctgaagag 2640 gaatacgtta ttgcaagtac tcggatggaa actagcatta gaaatattca agccgaatct 2700 ctttcaacgc ttccaatcac acatccaatc agcattcaag acctgaagag gaatacgtta 2760 ttgcaagtac tcggatggaa actagcatta gaaatattca agccgaatct ctttcaacgc 2820 ttccaatcac acacgaaatg gttgtagctg ctaccaagaa agataaaaca cttcaaaatg 2880 tacttcaaca gataaacatt ggatggtcaa caaacaattt aacacaagag gtcaaatcat 2940 tcaaaaaccg tcaagaatcg ttatacagct caggcgattg tataatgttg tccgatagag 3000 tggtaattcc cggagtcctt cgtactgcag ttctgaagca actacacgcc gggcatcctg 3060 gtatggaaag gatgaaagga atagcacgta gctatgtatt ctggccaaac atcgatgtgg 3120 atattgaaaa ttacgtccgt acttgtaccc gttgtgcagc agtggccaaa tgtccagtaa 3180 aaacaacgct ttcatcatgg cctattccag cgcagccttg gtcccggatt catatggatt 3240 acgctggtcc attcaagggc aagtattttc tggttatcgt cgacgcactt acaaagtggc 3300 cagaaatata ctgtacaaat tccatgactg caaccgtaac tgtgaacaaa cttcgagaat 3360 ctacagcacg tttctgacta ccagatgcaa ttatcaccga caatggtaca caatttgatt 3420 ccagctcgtt cgagacgttt tgcaagaaga atggtattga acacatcaag attcctccac 3480 accatcctca attaaatgga caagccgagc gcttcgtaga tacattgaaa agagcactga 3540 aaaaaatgga tgaagacgga ctggaagaag cactccaaac gtttttgttc acatatagat 3600 acactccaaa caaatcgatc aaagatatga agtcacctgc agaggccatg ttaggaagaa 3660 aactcaagac gaatttggac ctgttgaaga aaccagcaca taaatcgttt cagatcaatc 3720 atcagcaaaa ctttcaattc aatcgagcgc acggtgctaa agaaagaacg tttgtagctg 3780 gtgacgaggt atatgcagag gtttacattc acaataagcg atactgggct tgcggaaaaa 3840 tcattgaaaa gaagggtaac gtgacgtaca acgtgctatt ggatgatgag agaagaagcg 3900 gattgattag gtcacacgca aatcaattac gacgtagata tggtggtaat ccagccaccc 3960 cggaggaaga acaattacca gtccttatgc tgcttgaaga gtttggtatg tatgacgaaa 4020 cacccgaaat tgaccaattg gatccagaat atgaacaacc attggaagtt cccgagttac 4080 aggataatcg attggaaaac ccgccgttag atgattccga agtactcatc aatagtccag 4140 ttgaattagc agagtatggc gacgcaatag aagcatcacc aacattaaca aatattgttc 4200 tcccgaatcc atcacctgta gttagagagt cccgtatacg aaggtttcct gcatttttca 4260 aagactacag tcttttctaa ggagggaga 4289 // ID Gypsy-261_AA-I repbase; DNA; INV; 4280 BP. XX AC . XX DT 13-JAN-2011 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-261_AA_; KW Gypsy-261_AA-LTR; Gypsy-261_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4280 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (13-JAN-2011). XX DR [1] (Consensus) XX CC Positions [4114-4632] - Integrase core CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 480..4241 FT /product="Gypsy-261_AA-I_1p" FT /translation="MIPQKNIKLARYEFYQCQQEPGDDSREPESMSKFINR FT TRMLVKDCNFGDLEDEMLRDKIITGIHDVRLKKKLIETAELTSAKVIELCQ FT AEEATRAEMERNRWLESRQHGVNRIVTASRDSRQCGYCGRSYHKNLSECPA FT RGATCRFCGFKNHFEAVCKKKKEQSKQTGIPGPSRRLAKKTVHTVDCETDP FT PTVTQSDNEQESVNVCQYLYSINQDNDGLLKATLTFLNVNQRERKVPCILD FT SGASCNVVGKRNAMEILGTKQLSLDNESAVLNVFGGGKLRSLGRTEIDCVH FT NGSRFKAVFHVVDFEQPPLLSFRTCLQLKLLQVCFSVTEDQREAARRIVNR FT YPEVFSGLGKLEGKVHLDVDDSVKPVVQHPRRVAVTLRENLRQELSNLVQQ FT GVITEEKRHTNWVSNLVLVKRNDKLRVCIDPIMLNKALERPHYQMPTLDEL FT LPELANAKVFTTVDAKSGFWQLELDEESSKLTTCWTPFGRYRWLRMPMGIS FT PAPEVFQLRANEAIQGLKNVRALMDDFMIFGCGNTIEEALIDHNRNLQAFL FT ERMKEKNLKLNPDKISLCKDSVKYFGHILTASGVQPDPEKVNSILNMEKPK FT DVPAAQRFLGMVTYLSHYLPSLSTLAEPLRRLTYHDQPWTWSHEQDEAFDK FT VKQAVTSAPVLRYFDATKHTTIQCDSSSVGLGAVLLQEGQPVVYASKTLSA FT TERRYAQIEKETLAILFACRKFETYILGRPVTVQTDHQPLIKIFQKPLTEA FT PLRIQRMLLALQRYNITLTFKPGKEIVIADMLSRAAHIDNDITSRDIYDIF FT VTDIENICALDYLPISDARVDEIRFRSREDPDIQSIIRCIIDGWPRRYDEL FT PPAIQIYWKYKNELSTHNGLVLRNDRILIPKSLRGAIMERLHQSHCGVEAT FT TKLARDTVFWPGITDQIRQRVQHCHVCAKFSSNQQPQPMQSHQIPSYPYQK FT VSMDICECTLGNRSSVYLVSVDHYSDYFDVDELTTQSAAAVVKICKRNFAR FT LGKPQEISSDGGPQFMSEEFSTFCHSWGIKHSVSAPYHQQGNGKAESAVKI FT TKMLLRKAYESKQDFWELLLQWRNTPNKTGSSPAQRLFNRRTRCGIPMYEK FT KYLPKIEENVKEKIALNHRRAKSYYDRKARTLPELEIGQPVFVKMKPTDKE FT WKRGTVTNPITDRSSVVAVEDRTYRRDNTMIKVRRSIEEPPSEALPIKEEK FT DTAPSEVPETADYSSKIGSSSTVAVERPKRAIHVPKRFQDFEMY" XX SQ Sequence 4280 BP; 1314 A; 947 C; 1025 G; 993 T; 1 other; acttgtttat ttaatatggt gtcagaattc gcgaataaca gttttacgat cccacgttgt 60 aagttagttt tgttattttg aaaccgtcgc ggatatttcg agtcgtttaa ggcaaattga 120 tgtgtcggcc ggtcggccgt gtctatcgag aagtgagcgt cgaacagaat gacggaaaat 180 ccatcccaag cacaggctgt acgacagcca ccggcggctc cagaaatcaa cagcagtatt 240 ccctttccgg acccgttgtg cagctggatg gcaacgtacg ggagaacgtg gaactgtgga 300 aagatgcttt tgatacctac atcatagcat caggagtaga acagctggaa gagagagtga 360 aaattgcgac gttcaaatct gcgctaggca cagaagctag gcggattttc aatctgtggc 420 cccttcgaga cgaggaaaag aatacggtcg ctgcttgctt gcaatcattg tcctcgtata 480 tgataccaca gaaaaacatc aagctggcac gatacgagtt ttatcaatgc caacaagagc 540 ctggagatga cagccgtgaa ccagaaagca tgtcgaagtt catcaatcgg acccgtatgc 600 tggtgaaaga ttgcaatttt ggtgatctgg aggacgaaat gctccgggat aaaatcatca 660 ccggaattca tgatgttcgt ctgaagaaga aattgatcga aaccgcagaa ttgacttctg 720 ccaaagtgat cgaactttgt caagctgaag aagctacccg agctgaaatg gaacgtaatc 780 gatggctaga aagtcgacaa catggtgtaa acagaattgt tacagcgagc agggacagcc 840 gacagtgcgg gtactgtgga cgttcttatc acaaaaatct atcggaatgt ccagcaagag 900 gagcaacgtg tcgcttttgt ggatttaaga atcatttcga agcagtttgc aagaaaaaga 960 aggagcagag caaacaaacc ggtatcccag gaccatcgag aagattagcg aagaaaacgg 1020 tacacaccgt cgactgtgag acagatccgc caaccgttac tcagtccgac aacgaacaag 1080 aatccgtaaa cgtctgccag tatctgtaca gcatcaatca ggataacgat ggattgctga 1140 aagcaacact aacgtttctc aacgtaaatc agcgtgaacg taaagtacca tgcatccttg 1200 attcgggagc ttcctgtaac gttgtaggaa agaggaatgc aatggagatt cttggtacga 1260 agcagttatc ccttgacaat gaaagtgctg ttctgaatgt attcggagga ggcaagttgc 1320 gctcgttagg ccgcacagaa atcgattgcg tacacaacgg atcacgtttc aaagctgttt 1380 tccatgtggt ggatttcgaa caaccgccgc tgctatcgtt cagaacgtgt ctccagttga 1440 agctgctaca agtatgtttt tctgtcacgg aggatcagag ggaagctgca agacgaatcg 1500 tgaaccgcta cccagaggtc ttctccgggc ttggaaagct tgaaggaaaa gttcatctag 1560 acgtcgacga cagtgttaag ccagtcgtgc agcacccacg gcgcgtagct gttacgcttc 1620 gtgaaaacct tcgtcaggag ttgtcaaact tggtacagca aggagtaatt acagaagaaa 1680 agcgacacac taactgggta agtaacctag tattagtaaa acggaatgac aaattgcgag 1740 tatgcattga tccgatcatg ctaaataagg ctctggaaag accacactac cagatgccta 1800 cactcgacga acttctgccg gagctcgcaa acgcgaaggt ttttacgact gtcgatgcta 1860 aatccggttt ttggcagctc gagttggacg aagaaagctc gaaacttact acttgttgga 1920 cacccttcgg tagatacagg tggcttcgaa tgccgatggg catttctccg gctccagaag 1980 tgtttcaact gagagcgaat gaggccattc aaggcttgaa gaacgtacgt gcactgatgg 2040 acgatttcat gatcttcgga tgcgggaata cgatcgagga ggcattgatc gatcacaata 2100 gaaatctgca agcttttttg gagagaatga aggagaaaaa tctaaaactt aatccggata 2160 aaatcagttt gtgcaaagac agtgttaaat actttggaca tattctaacg gccagcggtg 2220 ttcaaccgga tccggaaaag gtgaacagca ttttgaatat ggaaaaaccg aaagacgttc 2280 ctgccgccca aaggttcttg ggtatggtaa cctatttgtc acattatctg ccgagcctct 2340 cgacactggc ggagcctcta cgtcgcttaa cataccatga tcaaccctgg acatggtccc 2400 atgagcagga tgaagcattc gacaaggtga agcaagctgt aacatcggca ccagtgctaa 2460 gatactttga cgcaacaaag catactacaa tacagtgcga cagtagtagt gtgggacttg 2520 gcgcggtact cctccaggag ggacaaccag tcgtgtatgc gtcgaagacc ctcagcgcaa 2580 ccgagcgtcg ttacgcacag atcgaaaaag aaacattggc tattcttttc gcctgtagaa 2640 agtttgaaac gtacattctt ggacggcctg taactgtcca gactgatcac caaccgttga 2700 tcaagatctt ccaaaagcct ctgacggaag cccctctacg aatccagcgt atgcttctag 2760 cattgcaacg atacaacatc acgctgacgt tcaaaccagg taaagaaata gtaatagctg 2820 atatgttgtc aagagctgcc catattgaca acgatattac cagccgcgac atttacgata 2880 tattcgtaac tgatatagaa aacatttgtg cattggacta tcttcccatt tcagacgctc 2940 gtgttgatga aattcgattc agatctcgag aagatccaga catccaatcc atcatcaggt 3000 gcatcatcga tggatggccc cgtaggtacg atgaattgcc accagcaatc caaatatatt 3060 ggaaatacaa aaacgagctc agcactcata atggcctcgt attgcgcaac gatcgaatcc 3120 tgatcccgaa aagcctacga ggcgcgatta tggaaaggct tcatcaatca cattgcggcg 3180 tagaagcgac taccaaacta gccagggata cggtattctg gcccggaatc accgatcaaa 3240 ttcgacagcg tgtgcaacat tgtcatgtgt gtgcaaagtt ttcgtccaac caacaacccc 3300 aacccatgca gagccatcaa attccttcgt atccgtatca gaaagtttca atggatattt 3360 gcgaatgtac actgggcaac aggagttckg tttaccttgt tagcgtcgat cactactcgg 3420 attattttga cgtggacgaa cttacaacgc agtcagcggc agctgtagtg aagatttgta 3480 aacgcaactt tgccaggctg ggtaaaccac aggagattag cagcgatgga ggtcctcaat 3540 tcatgagtga agaattcagc acattttgcc actcctgggg aatcaaacat agcgtatcag 3600 ctccttacca tcagcaagga aatggtaagg ctgaatctgc ggtcaaaatc accaagatgc 3660 ttttgagaaa agcgtacgag tccaagcaag acttttggga attgttacta cagtggcgaa 3720 atacaccgaa caaaacagga agttctccag cccaacgact tttcaatcga cgaacacgat 3780 gtggcattcc aatgtatgaa aaaaagtatt tgcccaaaat cgaagaaaat gtcaaggaaa 3840 aaattgcatt gaaccaccgg agagccaaat cctattatga ccgtaaagct cgtacactac 3900 cagaattgga aattggtcaa ccggtcttcg tcaaaatgaa gccaaccgat aaggagtgga 3960 agagaggaac agtaaccaat cctatcacag accgttcatc agtggtagcg gttgaagatc 4020 gaacatatcg tcgagacaac acaatgatca aggtaaggcg ttcaattgag gaacctccga 4080 gcgaagcatt gccaatcaaa gaagagaagg ataccgcacc atctgaggtc cccgaaactg 4140 ctgactacag ttcgaaaatt ggttcgtcat ctacagtagc agttgagcgt cctaagcgtg 4200 cgatccatgt tcccaagcgt ttccaggatt ttgaaatgta ctgacaagga aaggcagctt 4260 aaatttaatt gaaaaggaga 4280 // ID CR1-16_BF repbase; DNA; INV; 3701 BP. XX AC . XX DT 30-JUL-2009 (Rel. 14.07, Created) DT 30-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE Amphioxus CR1-16_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-16_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-3701 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-3701 RA Kapitonov V. and Jurka J.; RT "Young families of CR1 non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(7), 1587-1587 (2009). XX DR [2] (Consensus) XX SQ Sequence 3701 BP; 982 A; 850 C; 777 G; 1092 T; 0 other; gagtggaggc tgcgcagcag gtccgcaccc cttactcgtt gggattaaag cttggttccc 60 tgtcattctc aaaggtttgt tcaatttgat ttgtgactta tctgttgcaa gattgacagg 120 ggccgttctt ccggccctct tcttattcgg ggaaagggac agtgctgtgc agtaccactc 180 aatttttttt tttttttttt ttttccttca tggtagatag tcgaattgtt actgcaatct 240 cccactgttg tattattatg ccttttcatg tcattttagt agtgtaattt gtacgattga 300 tgagcgatgt gattgatatg caaatgaggg cgattgctga actctgtatt ttatatgtaa 360 attagattag ttagtgcgtg gcagcacctt ttgaaggtaa tcttaggatt ttttaggtaa 420 ccttttacaa ttagagcttt gtattgtgat aatgggcttc cttcaacttg cattatcctt 480 tgtatctttt atggctacaa tgtcgctatc acctaatctt aatgtcatct atcttaatgc 540 aagaagcgtt aaatctgtaa accagtatag aaacaaattg gtgcaactcc agaacttgat 600 caacctcgaa gcgccagacc tcctagctat cactgagtct tggttaactc cggatgtaca 660 tgaccaagag gttattccgc cggactttgt cacttatcgg aaggaccgtt acgacaccca 720 caacaacaag gctggtggag ggatcttact tgctgtcaga tccacagtct gcagtaggcg 780 cagacccgac ttggagcccc aggatgaaat cctagtttgc gaactccatc cccctaccgt 840 gggaaggatt gctgtggtac tatgttacag gcctccttcc ggcgacctgg caccatttac 900 acgcaacctg tgctcggttt tggagagtgt acataacgaa tacagcatgt gctgcgtact 960 cggcgatttc aacctgcctc gtattgactg gtccagctgt gttgtgacaa acgacggcaa 1020 ggaagccgac ttctgcaact tgatcaacaa ccattttctt caacaactca atcaagtgcc 1080 ttcaaactcg tgtcacaact tactcgatct ggtgttcacg gactttcctg agaggttttc 1140 tgagatcagg gagttacccg ctgtttttga caccgaccat acagtacttg agttcacact 1200 ccaatgtcgt attcaacaca agcgagggct tcctcggaaa gtatacaact tcaaacgagc 1260 ggactggggc ggactcaatg cacacctgga gtcgtcgggt ttagctgagt ctgtcactaa 1320 ccaccctgat attgactctg cctgggaggc gtggtcttcg gctgtgcagt ctgctgtcga 1380 cacgtttgtg ccctctcgca aacttaaggt gtccactact cctccctgga ttgacagcga 1440 agttcgtaac ctacagaaca gaaagcgtac tgcatggaga agagcgaagc gaactgactc 1500 tccctctcac tggacaaagt ttcgcaaact tcggaacagg ctgaaaaacg tcctatcagc 1560 caaatacaag aagtacctgg agagcctgtc gtccactctt catgagtcgc ccaagagttt 1620 ttgggctttt gttcgggcaa agtccaaatc caagtccctt ccatcagtgg tacaactaga 1680 aggcattatt gcacagtctc caatagataa ggcatctatg ttcaatgatt acttcttctc 1740 cacatttaca aagccagata tagacactgt aaaccctgtt atagacattg taagcgatga 1800 tgctctgtgt aacttacagt tttctgtaga ttccgttcac aaggtgcttg ccactcttga 1860 tccaaataag gctgtaggtc cagatgctat gtctcctcat gtcttgaaga gatgtgcgga 1920 cgttatcgca ccatccctga ctctactatt taataaatct ttggccttag gtaagttccc 1980 ctcacactgg aaagacgctc acgttactcc cattcataag aagggagaca aagaggtcgt 2040 ttctaactac aggcctgtgt cattactttc cactgtaagt aaagtcatgg aaaggtgcgt 2100 tcacgacagc ataattccta ggcttcatgc gtcaatacac aatctacaac atggtttcat 2160 gaaaggccgg tcaaccacca cccagctttt ggaagtgtac catcaggtcg ggtcaattct 2220 tgataaagga ggccaagtag acatgttgtt tttagatttt gccaaggcat ttgattcagt 2280 cccacatgct ctactagtcc ataaattaca aatgtacggt tttagtggta gtctcttgtc 2340 atggattgag tcctacctga caaaccgccg ccaaagagtg gtggttgaag gaagtcggtc 2400 tgattggcgt accgtcactt ccggagtccc acagggctcc atcctaggcc cactgttgtt 2460 cgtactttac ataaatgatc tacctgactc ggctagaaac tctatgtcag cactttttgc 2520 agatgatagc aaatgtttca gagagatccg tagcacaaat gactgttcaa aactacaaca 2580 ggatatctgt tcactacatg actggagcat aatgtggaag ttgactttca acctatccaa 2640 atgcatagtg ttaaggttta cacgatccaa gtaccccatc agatatgatt atcacatgtc 2700 aagttcaaac ctgactgtag tggacagcat gtccgacctt ggcatcacag taaaatccaa 2760 cctcacatgg aatagccatg tcataagaac agtggccaaa gctaatcaaa tgcttgggtt 2820 tatcaagcgt tccataggtt acaagtccaa cactgatata cgtaagacat tgtatttgtc 2880 tcttgttaga agtgttttgg aatactgttc atcagtatgg agccccacat cacgtaatct 2940 cacagccctg atcgagggtg tacagcgaag ggcaacgaaa tacatcctcg ggccctccgc 3000 tgaacacctg gactacaagt ctagactatc tagactaggc ttgttaccac tgtcttacca 3060 aagagagatg tctgatgttg tggtatttgt taagtccctt gcaaaggtat atgattcaga 3120 tctagcgagg ttagctgcat tccctacaag aacgccacgt agttcaaggg cacacatgtt 3180 agtccctcag agagtaagat cttcttcatt tgcctcctca tttactccac gacttatcac 3240 aatttggaac aaacttcccg ttgaaatcag agggctggga gcatcagcca caagcccgtc 3300 cgatgtgtct gcctttaaac gaaaactatc tacatttatg caaaaccgtt tccgacaaaa 3360 cttcaagcta gagcaggctt gcacatggtc actagcatgc tgttgcgcct cttgcattgc 3420 cacaaggcca cgataatttt accttatatt attactacat gtacatttgt atatctattt 3480 catagatcta gactccataa tgtatgtgtg ttgttaactt gatttatctt gacttgaatg 3540 ttacattgat tatgatatct gttcttgtat ttatgtatat tatgatttga tgtatcattg 3600 ttatgtttaa ggtatggcgg cttcgaaaag gtgtcttttg acacctgttc cgccataccc 3660 tccagtgtgg agaatccaaa taaataaata aataaataaa t 3701 // ID Gypsy-212_AA-LTR repbase; DNA; INV; 196 BP. XX AC supercont1.5; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-212_AA_; KW Gypsy-212_AA-I; Gypsy-212_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-196 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.5; Positions 732255 732450. XX SQ Sequence 196 BP; 68 A; 45 C; 28 G; 55 T; 0 other; tgtaaagtat atatcaaatg ggaagaaccc acagtgaatt tattatacca cttacacaat 60 caaacaatat ccaaaggtac ccttttataa tttgaaacct acaatacacg aggcagtctc 120 aaaaacgtat cccgtgcaga cagttctgtt ggctatccga aaagcctgtg acccctatct 180 ctatcgtttt attaca 196 // ID Gypsy-23_OD-LTR repbase; DNA; INV; 263 BP. XX AC CABV01004651; XX DT 24-FEB-2011 (Rel. 16.02, Created) DT 24-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Oikopleura dioica genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-23_OD_; KW Gypsy-23_OD-I; Gypsy-23_OD-LTR. XX OS Oikopleura dioica OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; OC Oikopleuridae; Oikopleura. XX RN [1] RP 1-263 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Oikopleura dioica genome."; RL Direct Submission to RU (24-FEB-2011). XX DR Genome; CABV01004651; Positions 14 276. XX SQ Sequence 263 BP; 76 A; 59 C; 56 G; 72 T; 0 other; tgattaggcg taggattaga actagaatta ggattagcgt ttagaagtag attagaactc 60 ccagcacgtt gagtattaga ctttaggtta aaacgtgttc aggtttattt atcgcgaccg 120 atcccctcaa gatcggatcc cagatcagca acgaagacga ccggcatact gtaccctccc 180 ctctgactgt acctgtaccc gatcgccagc acgatctgaa taaattagat taagagtcta 240 agtcgagttc attattttcc gca 263 // ID Gypsy-210_AA-LTR repbase; DNA; INV; 209 BP. XX AC supercont1.2264; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-210_AA_; KW Gypsy-210_AA-I; Gypsy-210_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-209 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.2264; Positions 3379 3171. XX SQ Sequence 209 BP; 83 A; 26 C; 52 G; 48 T; 0 other; tgtaggatct tacaccctat aaaaaccaag agcgatgaat caaagtaggc actggaaaaa 60 agggagagtt gagtaaggaa ccgatcggtg tggatgcgga aagaaagagc ttgtgaaaaa 120 gtagtgaaaa tataggaaaa gtgttaagag aagaattgaa gttattattt caaatagtgt 180 cctgaaactt gctatccgaa tatcctaca 209 // ID Gypsy-43_AA-I repbase; DNA; INV; 4131 BP. XX AC supercont1.107; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-43_AA_; KW Gypsy-43_AA-LTR; Gypsy-43_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4131 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.107; Positions 2200593 2204723. XX CC Positions [1776-2237] - Reverse transcriptase CC Positions [3205-3678] - Integrase core CC 'ATAG' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 624..2780 FT /product="Gypsy-43_AA-I_2p" FT /translation="MRRKKKPGETIETYFYDQVALGRKGKFDDKTIIKYVI FT AGVEDAHRAKGVTISMPRSLPELLEQLKWLDGLGGMRSGTSEGGRATKGEK FT TFYRCSKSGHIADQCSENQKSRKCFRCGSDWHLAKNCTVGRNGEPSRAVKV FT IAVDDGFEKIVSIDDKQMMALIDSGSRVTTMKSSFSQKFGNRQPVDLILKG FT FGGRRIRVTEKVVAILKVDDIEENVEFVVVPNFAQDLPVIVGKDVLKRDTV FT LLTKKNGKVWLAKGECCEQDAGDLQSERSVRDVCSIEAYEPISVEDLNMDG FT SDMSKEDLLECVTDNRDCFSKNYRELGKAKCVELEIVLQDQKPVYEKPYRM FT EFSREAELNMIVKELLEAGIIEPTSSAYSSRASMVPKKEGEFRMVVDYRSL FT NQKTVKDRFPMPTIEYCLSKLVGGEIFITVDLFSGYYQIPVAVNSRKLTAF FT STMDSHYQFVRIPFGLVNGCAVFQRAMNQMVAELKDDNVVVYIDDVILSGK FT TEEEVFEKFKRLLNALRVHGFTVNLKKSQFSKRSVDFLGFEVSKKGVKPGD FT RKIEAVEKLPEPKSKQEVQQFLGLTGYFRRFVKGYSFRASSLVELLKEEVE FT FKWREEQVKAFEDLKRALVSKPVLALYDPASEMELHTDASAIGLGGILLSL FT TAEGWRPVGYFSRKTSEQEAKYFSYELEVLAVVSSVERFRQYLLGRPFKVV FT TDCSAVTQTFASQPLWRSC" FT CDS 2797..3804 FT /product="Gypsy-43_AA-I_1p" FT /translation="MFHVMQLEIDEDDFLVTMQRQDPKLLAIMESLVQPIQ FT DNYDRQVRHDYELVQNRLMRKIGDEQKWVVPERVRWRILKAYHDDMGHMAE FT EKVLENMKKKFWFKRMRHYVKEFIKACPKCAYNKAKGGQTEGKLYPIPKIA FT VPFQTIHLDHIGPFPKSANGNVHVLVLVDGYTKFTFLKAVRSTKSELVVKT FT LVDLTAIFGTPQRVITDRGTAFTAGVFRKFCGEHHINHVFVAVGSPRANGQ FT VERVNRALLTAIRSTLDEDRKWDNCLPSIQWAINNTLNATTRVSPSDLVFT FT FKPRDIIRNEIVLVIHDETDNRIDNMDELQKPAEENIREKQQSQ" XX SQ Sequence 4131 BP; 1202 A; 776 C; 1200 G; 953 T; 0 other; tttctgtaga caggataacc gtgtgcgaag taaactcacg gagtctggag agcgaatcgc 60 gagttttcgg taatcgtgtt gtaggccaag gatcaaaatc cttcgccggt gatagagcca 120 cgtgcaaaag acaattggaa attgtggctt tgtgatatcc gctattgtgt ggctggacaa 180 agaaaagccg ggtctgagta acaatggacg agttgcgcgc gaaaaatctg gagttggtgg 240 ctaggttgga ggagctaaaa gcgaagcttg cagagggccc aaagatcgag cccgcagcta 300 gtgcatcgca gacagcgagc atggtcccgg atgtgtgtag gccaggccgc tttgaactag 360 aaacgctagt cagcgatttc aatccggaca tacccacttg tgagtcggct gaaagctggt 420 tggagaacgt tgatgcaacg gctgatgcgt atggatggcc tgatctcacc aggttttact 480 gcgcgagaat gcacctgcaa ggagcggcga ggctttagtg gaatggggtt caaacgactg 540 tgagaacgtg ggctgtgttt aaagtgaaac ttctagaagc gttcccggaa gccaacgatc 600 cagtttccat tcatgagcag ctaatgcgaa gaaagaagaa gcctggtgaa acgatcgaga 660 cctacttcta cgatcaagtg gctttgggca ggaaaggcaa atttgacgat aaaaccatca 720 tcaaatacgt tattgctgga gtggaagatg cgcaccgggc aaaaggagtg acgatctcga 780 tgcctcgtag tctccctgag ctgcttgaac aattgaagtg gttagatggt ctggggggta 840 tgcggtcagg cactagtgaa ggggggagag caacaaaagg tgaaaaaacg ttctatcgtt 900 gttctaagtc cggacacatt gccgatcagt gtagtgagaa tcagaaatcc cggaaatgtt 960 tccgttgtgg atcagattgg catctagcga aaaattgtac cgttggccgg aatggtgagc 1020 cgtctcgtgc agttaaagtg attgccgtcg atgacgggtt cgagaagata gtgtcaatcg 1080 atgataagca gatgatggcg ttgatcgata gtggaagccg agttactacg atgaaaagct 1140 ccttttcgca aaagttcgga aaccgacaac cagttgacct aatactgaaa ggttttggcg 1200 gtcgtcgaat ccgagtgacg gagaaggttg ttgccatatt gaaagtggac gacatagaag 1260 aaaatgtgga atttgtggta gtgccgaatt ttgcccagga tttgccagtg atagtgggta 1320 aagatgtatt gaagagagac accgtgctgc tgactaagaa aaacggaaaa gtgtggttgg 1380 cgaaaggcga gtgctgcgag caggatgctg gagatctgca gtccgagaga tcagtgcgag 1440 atgtatgttc gatcgaggcg tacgaaccca tatcggttga agatttgaat atggacggat 1500 cggatatgag caaagaagat ttgttagaat gcgtgacgga caatcgtgac tgcttttcga 1560 agaactaccg tgagcttgga aaagctaagt gtgtagaact agaaatagtg ctacaagacc 1620 aaaagccagt gtatgagaag ccgtatagaa tggaattctc gagagaagcg gagttgaaca 1680 tgatcgtgaa agaactatta gaagctggca ttatagagcc aacatcatcg gcatatagta 1740 gtcgggcctc gatggtgcca aagaaagagg gtgaatttag gatggtagtt gactaccgtt 1800 cgctaaatca aaagacggta aaagataggt ttcccatgcc gactatcgag tattgcctga 1860 gtaaattggt gggcggagag atcttcatta ccgtggactt gttcagtggc tactaccaaa 1920 taccggtagc tgtaaacagc agaaaattga cggcgttttc aacgatggat agtcattacc 1980 agtttgtgag aatcccgttt gggctcgtga atggctgtgc tgtttttcaa cgagcaatga 2040 atcagatggt ggcggaatta aaggacgata atgtggtggt ctatattgat gatgtgatcc 2100 tgagtggcaa aacggaagaa gaagtgttcg agaagttcaa aagactactg aatgctctgc 2160 gagtgcacgg atttacagta aatctaaaga agagccagtt ttccaaacgt tcagtggatt 2220 ttcttgggtt cgaagtgtca aaaaagggag tgaagcccgg agatcggaaa atagaagccg 2280 ttgaaaagtt gccagagcca aaatccaagc aggaagtaca gcaattcttg ggactgaccg 2340 ggtatttccg gagattcgtg aagggttaca gctttcgagc gagttctttg gtggaattgc 2400 taaaagaaga ggtggaattc aagtggcgag aagaacaagt gaaggctttc gaagatttga 2460 aacgggcatt agtctcaaag cccgtattgg ccttgtatga tccagcgagt gaaatggagc 2520 tgcacactga tgcgtctgcc atcggattgg gcggtattct actgtccctt acggcggaag 2580 gatggagacc agtgggctat tttagccgaa aaaccagtga acaagaagca aaatacttca 2640 gctacgagct ggaagtactg gcagtagtct caagtgttga acgttttcga cagtatttac 2700 tgggacgacc gtttaaagtg gtgaccgatt gcagtgcagt gactcaaacg tttgcgtcgc 2760 aacccctttg gcgaagctgt tgaagtagag ccagtgatgt ttcatgtgat gcaactagaa 2820 attgacgaag atgacttcct ggtgacaatg cagcgacagg atccaaaact actggcgata 2880 atggagtcgc tagtacaacc gattcaggac aactacgatc ggcaggtgcg acatgattat 2940 gagttagttc aaaatcgatt gatgcgaaag atcggcgatg aacagaagtg ggtggtaccc 3000 gagcgagtac gttggcgcat actgaaggca tatcacgacg acatggggca tatggctgaa 3060 gaaaaagtgc ttgaaaatat gaagaagaag ttttggttca aacgaatgcg ccactacgtg 3120 aaggagttta tcaaagcatg cccaaagtgt gcttacaata aggcgaaggg tggacagacg 3180 gaagggaagc tgtacccaat accgaagata gcagtgccgt tccaaacgat tcatctcgac 3240 cacattgggc cgtttcccaa atcagctaat ggcaacgtgc acgtgttagt gttggtagac 3300 ggctacacca agttcacgtt cctgaaagcg gtacgttcaa ctaaatctga gctggtagtg 3360 aaaactttag ttgacctgac ggcgatcttc ggtactccgc agagagtgat caccgatcgc 3420 ggcacagcat ttacggcagg agtatttcga aaattctgcg gagagcatca tatcaatcac 3480 gtgtttgtgg ctgttggttc tccgagagca aatggacaag tagagcgggt gaatcgtgca 3540 ttgcttacag cgatccgctc aacgctcgac gaggatcgaa aatgggacaa ttgcctaccc 3600 tctatacagt gggcgatcaa caacaccctg aacgctacaa caagggtctc acccagtgat 3660 ttggtgttta cgttcaagcc gagagatata attcgcaacg aaatagtgtt ggtgattcac 3720 gatgagacgg acaaccggat cgacaacatg gatgagctgc agaagcccgc cgaggaaaat 3780 atacgcgaga agcaacaatc gcagtagaga tattatgatg ccagaaggag agaagccagt 3840 aaatattgcg aaggtgacat ggtgttagta gagaaggatg ctgtggtgat tggaggtatc 3900 cgaaagctcg aaccgaagtt caagggccca tacatcgtag ctgaggttct tgggcatgat 3960 cgttaccgaa tccgtgatgt tccgggagcg caacgaaaga cggcagcgtt ggatacggtg 4020 tacgcagccg accgtatgaa acgttggtgt gtgatgggaa atctagacgg cgacgaacac 4080 ctcgatgaca tcggtgattg atggggcatc atctacgcag tggtgtcaga t 4131 // ID Gyp2_Cis_LTR repbase; DNA; INV; 127 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 06-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE Long terminal repeat of Gypsy LTR Retrotransposon from Ciona DE savignyi. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gyp2_Cis_LTR. XX OS Ciona savignyi OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. XX RN [1] RP 1-127 RA Smit A.F.; RT "Gyp2_Cis_LTR - Gypsy LTR Retrotransposon from Ciona savignyi."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX CC Ci000114 4 b target site duplications; 0.5% diverged copies. XX SQ Sequence 127 BP; 31 A; 28 C; 29 G; 39 T; 0 other; tgtggcgacg tgttcagcga agcgttaata atgccttttt cttgttaccg tcttttcgtg 60 tgttgtatta tcgcagagag ttcattacaa agtcgcccgc acaaagaaca cagtgcagta 120 cgctaca 127 // ID Gypsy-224_AA-I repbase; DNA; INV; 4473 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-224_AA_; KW Gypsy-224_AA-LTR; Gypsy-224_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4473 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1051-1051 (2011). XX DR [2] (Consensus) XX CC Positions [3389-3898] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 1832..4336 FT /product="Gypsy-224_AA-I_1p" FT /translation="MDGSKTVNPHIITHHYPLPVIEELISNKSGAKKFALI FT DLRGAYQQLIVSEATKKLLVINTHKGLFTYKRLPFGVKPAATIFQSVMDRI FT LQGISDVQTYIDDILIWAESDEELLSKIKIVLRRLAQHNVKINAEKCEWFV FT SQVKYLGHILSEAGVLPNPEKVKAITAVPVPKSKTQLKAFLGMITFYTKFV FT PKLCILLSPLYNLLTKDSKWEWNYRCDEAFEKSKNAICSAKILTHYDPSKP FT ITVTCDASDDGISGVLSHKINNNDMPVFFVSRRLTKAEQKYPILHREALAI FT VFAMEKFYKYVLGQKVTIVTDHKPLLGIFNSRKGGPPIIATRLQRYFLRLS FT IFDFTITHMAGKENLVADCLSRLPVNQDLSRADLLENDYSSFNQLNYLVDD FT RKVNLNSILISKESESDPLLSIVIKYVRKGWPSHVKDQKLKNFYSKRHELD FT VESGCLIFGERIVVPNSLKLPSLQLLHSNHRGIQKMKQIARRFLYWDGFST FT DIENFVKSCKKCQILGIDRTPKVYGNWPAASTAFERVHIDFFQKFNRTFLI FT LVDAYSRWIEIHKMSKTNAENVAQVLDNIFAIFGFAAIIVSDNGPPFNSFY FT FKKYCQSRNIEHILSPPYHPASNGLAERAVQTTKAVLNKLIEKDSTSSLQI FT ENEINKFLHHHHQTPTTEDKIIPNDRIFSFSPRTELTSMKYNKCIFSDQNK FT SKTKATFNINDNVIYTYKSNGRAYSSEAVIVKQLSNLTYTIDVDGDKRTAH FT KNQLKSVPQKQFILKNHVNTRSLRNELEHDVVDNDQVSIKRKSSQRTATKP FT KPKLYQTLRRSSRINKSKYRTVNVKSIVKKKKM" XX SQ Sequence 4473 BP; 1599 A; 785 C; 914 G; 1173 T; 2 other; gttctggcga cgaagttgaa acggcgcgaa ttgcgtgcag agtgcgtgaa aaatttgtga 60 aattttagtg ctaggtgcac ggatcgggaa agatctagac cgattaggat ttcaacgtgg 120 tggaagttgt ttgaaggaca atgcttggtg ggtctgcgac cgaagacaat aagccagcta 180 caatcatggt ggacaaaaga acgaaaggaa taaggtttcc tttgtattcg ccgggtctta 240 acgtgaattc gtaccttcaa acggttgaga tttatttcac gttaaacaag acaccaaacg 300 atgaaaaagc gttggaattt attacaagtg tgggtcaaga gactgccaat aggattatcg 360 gcagtttcaa gccagacaaa attgttaata aaatctatga agaaatcatt aagcaagttt 420 aaagtgctac acgaagagaa caaaaatgtg ttcgccgagc gccatcgtct tattacgcgc 480 cggcaagaag atggtgaatc gttagacgac tttgcgattg atctgcagaa tattgtcaga 540 acactgcagc gtgagtgtag aaacagaagc tacgtcggtg caatcggtgt ttgtgcgctg 600 ggttacgaaa cgataaaacg cgtgaatcaa tgmtcgcgtg atgcgccgtg atgagaagct 660 caatctggct cagttgttag aaaaagcaaa gaccatcgaa attgccagcc ctagaatctc 720 gaaaaatggc taacacgatg gtaacagctt gaattatgtg ggaaacatgt aattgggagt 780 acagcaggac agtgcttaaa cccggaaaaa tgaatgagca agcgtatgga aagcaatcga 840 atagtgatat ctccacccat cgtggtagtg ctaaagtaag cgatgatact gtgtgctaca 900 attgttatag aaaagggcat ttgtcgtacc atgtgtacgt tgcccaaagc caaaaaacca 960 cggcagggag tgccgtttgc gaagagaaat gtgagcagga aaagagcata cgaagaaaga 1020 ataaaccaac cgtatcggct gccatggaag acctgaaaat gagtgttggc caagaagacg 1080 atgagatctc cgacgatgaa acaactgtag aacaggatag ccgaagagga tgcttctaat 1140 tgggtaaaca acattatgtt gggtaagtcc amcaacaata tccaacacct gccttttgtg 1200 gaactgatgc gttaatggaa tcagcaaact gattaatgga atgtgatacg ggtgcttgtg 1260 caacaatttg ctctctaaac acatacaatg attgttttag catgtgtaga attttacccg 1320 accagagaaa atttttcgtc ttatcaggag aatcagtggg agttctaggt aaaataaaag 1380 ttaaagtgag agtgagtggc gaactcttaa atctctggtt gctaatagtc aaatctccca 1440 aaaattttgc gcctctcttg ggtcgtgact ggcttaatat catatggccc aaatggagaa 1500 atacatttaa actaaattct ctaaaggaag ttaagaggga gttgtgggtt aaaaatacag 1560 ttaaagatct taagagagaa ttttcaaaag cattcgatga tgatctgaca gagcctatta 1620 aagatgtagt ggtagatata aagatagatc cacaggcaaa acccgttgta cataaaccct 1680 atacagtagc gttcaaacac agggaaacag ttgcaaaaca cctagaagat cttcaagcga 1740 aaggtaggta cttgaaaaag tcgaatacgc cgagtgggct tctcccatag tcgtggtagt 1800 gaaaccgaat aaaaaagaca taagaatatg catggatggg tctaaaacag taaatccaca 1860 cattataact catcattacc ctctcccagt aattgaagaa ttaatatcaa ataagagtgg 1920 agcgaaaaag ttcgctctta tcgaccttag aggagcatac caacaattga tagtttcaga 1980 agccacgaaa aagcttttag ttatcaatac acataaaggt ttgtttactt ataaaagatt 2040 gccattcgga gtaaaaccag cggccactat attccagtct gtaatggata gaattttaca 2100 aggtatttct gatgttcaga catacataga tgacatcctc atttgggcag agtcagatga 2160 agaactttta tccaaaatca aaatagtgtt gagaagatta gcgcaacata acgttaagat 2220 caatgccgag aaatgtgaat ggtttgtatc ccaagtgaaa tatttgggtc acattctgtc 2280 ggaggcagga gtgttgccaa atccggagaa agtgaaagcg ataacggccg tgccagtgcc 2340 taaatcaaaa acgcaactta aagcattcct cggcatgata acattttaca ccaaatttgt 2400 tccaaagctt tgcatacttc tatctcctct ttataattta ttaaccaaag acagtaagtg 2460 ggagtggaac tatagatgtg acgaagcatt cgaaaaaagc aaaaatgcta tctgtagcgc 2520 gaagatactt acacactatg atccatccaa acccatcaca gtaacctgcg acgcaagtga 2580 tgatggcatc tcaggagttt taagccataa aattaacaat aatgatatgc ccgtgttttt 2640 tgtttctcgt cgtcttacaa aagctgaaca gaaataccct atattacata gagaagcttt 2700 ggccatagtt ttcgcaatgg aaaaatttta taagtacgtg cttggacaaa aggtgactat 2760 tgttaccgac cacaaacctt tactgggtat tttcaatagc aggaaaggag ggcctcctat 2820 aattgctaca aggctgcaac gatatttctt aagattatcc atattcgatt tcactatcac 2880 tcatatggca ggcaaagaaa acctagtagc tgactgtctt tcaaggctac ctgtaaatca 2940 ggatctcagt agagcagatt tgttagaaaa tgattatagt tcgttcaatc aattgaatta 3000 tctagtagat gacaggaagg ttaacttgaa ttcaatatta ataagcaaag agtctgaaag 3060 cgatcccttg ctgtcaatcg ttataaaata tgttcgtaaa ggctggccaa gtcacgtgaa 3120 ggatcaaaaa cttaaaaact tttactctaa aagacatgag ttagatgtag aatccggatg 3180 cctaatattc ggtgaaagaa ttgtcgtccc aaactcgcta aaattacctt ctctccaatt 3240 acttcattca aaccatcgag gaatacaaaa aatgaagcaa attgctagaa gattcctgta 3300 ctgggatggt tttagtacag acatagaaaa ctttgtgaag tcatgcaaaa aatgtcaaat 3360 tttagggatt gatagaacac ccaaagttta tgggaattgg ccagcagctt cgacagcctt 3420 tgaaagagtt catatagatt tcttccaaaa gtttaatagg acttttttaa tattagtaga 3480 tgcatattca agatggatag aaatacacaa aatgagtaaa acaaacgctg aaaatgttgc 3540 acaagtactc gacaatattt ttgctatttt tggcttcgca gccataatcg taagcgacaa 3600 tggaccacca ttcaatagtt tctattttaa aaaatattgc cagtcacgga acattgagca 3660 tattctttcc ccaccgtacc atcctgctag taatggccta gcagaaaggg cagttcaaac 3720 cacaaaagcg gtgttaaata aactaattga gaaagattct acttcgtcat tacagatcga 3780 gaatgaaata aacaaatttt tgcatcacca tcatcaaact cctaccacag aagacaaaat 3840 aatcccaaat gatcgaattt tctcattctc tcctcgcaca gaattaacta gcatgaaata 3900 taacaaatgt atattcagcg atcagaacaa atccaaaact aaagcaacct ttaatattaa 3960 tgataatgta atctatacgt acaaatcaaa tggtagggca tacagttctg aagcagtaat 4020 tgttaaacaa ctttctaatc taacatacac aattgacgta gatggtgata aacgaacagc 4080 tcataaaaat cagttaaaaa gtgttccgca aaaacagttc attctgaaaa atcacgtcaa 4140 taccaggtct ttgcgtaacg aactagaaca tgacgtcgtt gataatgatc aagtgtcaat 4200 caaaaggaaa agttcccaaa gaacagcaac aaagcctaaa ccaaaattgt accagacatt 4260 acgtcgctct agtagaatca acaaatcaaa atatagaact gtaaatgtga aatcaattgt 4320 taagaaaaag aaaatgtagt tatacattgc atactgtaag gaaaaattgt aggttaagta 4380 tctgaattga atttaagaat gaagtgaaat actattgatg aaataaaaaa aaatgttaaa 4440 ttaccttaac atagatttta aaggggggag atg 4473 // ID hATm-1_HR repbase; DNA; INV; 3434 BP. XX AC . XX DT 30-OCT-2007 (Rel. 12.1, Created) DT 19-JAN-2011 (Rel. 16.02, Last updated, Version 2) XX DE hATm-1_HR, a family of autonomous hATm DNA transposons - a a DE fossilized copy. XX KW hAT; DNA transposon; Transposable Element; hAT superfamily; KW Autonomous DNA transposon; hATm group; hATm-1_HR. XX NM hATm-1_HR. XX OS Helobdella robusta OC Eukaryota; Metazoa; Annelida; Clitellata; Hirudinida; Hirudinea; OC Rhynchobdellida; Glossiphoniidae; Helobdella. XX RN [1] RP 1-3434 RA Kapitonov V.V. and Jurka J.; RT "hATm, a distinct group of hAT DNA transposons in animals."; RL Repbase Reports 7(10), 1049-1049 (2007). XX RN [2] RP 1-3434 RA Kapitonov V.V. and Jurka J.; RT "Consensus sequence of hATm-1_HR."; RL Direct Submission to Repbase Update (19-JAN-2011). XX DR [2] (Consensus) XX CC hATm is a separate group of hAT DNA transposons. Significant CC identity between hATm and hAT transposases appears after PSI CC BLAST iterations. hATm transposons exist in genomes of different CC animals, including segmented worms (Helobdella robusta, leech), CC flatworms (Schmidtea mediterranea, freshwater planarian), insects CC (Aedes aegyptus, mosquito), and tunicate (Ciona savignyi sea CC squirt). All identified hATm transposons are characterized by CC 8-bp target site duplications and conserved termini CC (5'-TAGGgtGgyccnnA). The planarian hAT-7_SM, hAT-8_SM and CC hAT-10_SM transposons also belong to the hATm group. Their CC putative classification as hAT transposons was not supported by CC significant similarity (BLAST and PSI-BLAST) to known hAT CC transposase proteins. CC hATm-1_HR is a young family of hATm autonomous DNA transposons CC identified in the leech genome. TIRs are 14-bp long (one CC mismatch). The TPases is encoded by a single ORF that CC contains a stop-codon. XX FH Key Location/Qualifiers FT CDS 549..2921 FT /product="hATm-1_HR_1p" FT /translation="MASFLSTSATRSSRQKDKIYLIGFVTHQITGGKLPSN FT RQVLRSLFYNIRQVKLNIKDAARLTIKKVFIFWEKARIQTKHLKDSVAKLE FT KLHEEWRKLQKNSNRTGPAQTEKEKLFKAILDDLFDIAHQDALQTATEEDK FT LFLLKQREKGRPGVMGGVDLQCVKAEERWQKRNASEMARLTKFRTSSGELI FT KSRLLFLFNYNDVFYILELEELKSLGSSTSDEDEIEDNDQRTEKVDMVTLN FT YGPEKKMKKYTRGTEEIVNEKLFLILDRCAISDRDAARIISATIESLGHDS FT QQYIVSRSVIRLRRQEFRKERAKLIQQRFTNSELEGAVVHWDGKLLPDMLN FT KENVERVAVLISCGEEEQLIGVPLLENGAGSTIAKSVYSELGKWGALDKIQ FT AMSFDTTAVNTGRIKGACVLLEQLMEKKLLYLPCRHHILEVVLRSVFDAAL FT GKTTGPQSDIFKKFKTEWNSIDKKKFKSGVKDKSVASKLINPGQISSYLLE FT QLTVHHPRSDYKELISLCLIFLERFPSENIVFAAPGAFHHARWMAKAIYSL FT KIYLFREQFQLTIFEKQGFHDICLFIINIYVKVWLDAPKPALAPNQDLQLL FT KSLVNYNKVNKFISDISVSKFINHIWYLNPEQAVFSLFDDSLSNCVKKRMA FT TKLISQANEDEELDDYCDMKPLIKINEVSEILDKNADHFISSQSINFSKRF FT NINDQFLHTNPELWSNNEEYLKSKKIVDGLKVTNDTAERGVKLITDYNSCI FT TKEEDQKQYLLQVIAECRTKFPGCSKASLSEPLPFEQTMS" XX SQ Sequence 3434 BP; 1233 A; 494 C; 617 G; 1090 T; 0 other; tagggtggtc acaagaaatt tttttttgaa atcttatcgg gggaaccccc taaattgtgc 60 cacttgacta aaaaatgaca tttgtaaagt tttagctcaa ttggataatg ctaacccgtg 120 cctcatcgag gttgaaattt cgaaaaattg gaaaattttg aaaaaaaaac gcagtttttt 180 gtttgtaaaa taaaaacggt tcattctaga aacatgattt aagttttatt ttgttcattg 240 atttgttttc ttcaaatgtt actatgatca ctatttagct acaagttata gtttctgaca 300 tatttgcatt gctacagata cacattgaaa aaacgttgaa attctccata acacaaaaac 360 acacaataac ttatttacta ttactaaccc gggtgctata tgttgttttt atgtattaat 420 attaagatct tttggtttat ccacgttttt tcatctttgg tttatattct ttgtttttcc 480 ggtttttaaa acacacgata ttaaatagtt taatcaccac gtttattgtt ttggtattct 540 cagttgtgat ggcttccttt ttatctacat ctgctacgag aagttcaaga caaaaggata 600 aaatttattt aattggtttt gtgacccacc aaattactgg tggcaaatta ccatcaaatc 660 gtcaagtgtt acgtagtttg ttttataaca tccgtcaagt aaagctgaat ataaaagatg 720 cggctaggtt aacgataaaa aaagttttta ttttctggga gaaagcaagg atacaaacaa 780 aacatttgaa agactctgtt gctaagctag aaaaacttca tgaagaatgg agaaaactac 840 aaaaaaattc taacagaact ggaccagctc agacagagaa agaaaaactt tttaaggcca 900 tacttgatga cctatttgac atagctcatc aagatgcact ccagactgca acagaggaag 960 acaaattgtt tttgcttaag cagagagaaa aggggcgccc aggtgtcatg ggaggagttg 1020 atttacagtg tgtaaaagct gaagagcgtt ggcaaaaaag aaatgcttcg gaaatggcga 1080 ggttaacaaa atttaggaca agttcaggtg agttaattaa aagtagattg ttattcttgt 1140 ttaattataa tgatgtgttt tatattttag aattagaaga gcttaaatcg ttaggcagca 1200 gcacgtcaga tgaagatgaa attgaagata atgatcaacg cacagagaag gtagatatgg 1260 tgacactaaa ttatgggcct gaaaaaaaga tgaaaaaata cactagaggt actgaagaaa 1320 ttgttaatga aaagttattt ctaatattgg ataggtgtgc tatttcagac agagacgcag 1380 ctagaataat atcagctaca attgaatctc ttggccatga ttctcagcaa tatattgtaa 1440 gtaggagtgt catccgcttg cgccggcaag aatttagaaa agaaagagca aaactcattc 1500 aacaaagatt tacaaattca gaattagaag gagcagttgt acattgggat gggaaattac 1560 ttccagatat gttaaataaa gaaaatgttg aaagagttgc agtcttaatt agctgtggag 1620 aagaagaaca gttaattgga gttcccctat tagaaaatgg tgctggcagt acaattgcaa 1680 aatctgttta ttcagaactt ggaaagtggg gtgcactaga caagattcaa gcaatgtcct 1740 ttgacacaac tgccgtgaat actggccgca ttaaaggtgc ttgcgtttta ttagagcaat 1800 tgatggagaa aaaattattg tatcttccat gtcgccacca cattttggag gtagtgttga 1860 gatcagtttt tgatgctgca ttgggaaaga ctacaggccc tcaatcggat attttcaaaa 1920 aatttaaaac tgaatggaac agcattgata aaaaaaaatt taaatcagga gttaaagata 1980 aaagtgtagc cagcaaattg attaatccgg gtcaaatttc aagttatctt ctagaacaat 2040 taacagttca tcatccaagg agcgattaca aagaattgat tagtctttgt cttatatttt 2100 tagaacgctt tccgtcagaa aacatcgtgt ttgcggctcc tggtgctttt catcacgcaa 2160 ggtggatggc gaaagcgata tattcgttga aaatatacct tttcagagaa cagtttcaac 2220 tgacaatttt tgaaaagcaa ggattccatg atatttgcct atttattata aatatttatg 2280 tgaaagtgtg gttggatgcc ccgaaacctg cacttgctcc aaaccaagat ttacaattgt 2340 taaaatcttt ggtaaattac aataaagtca ataaatttat ctcagatata tcagtttcaa 2400 aatttatcaa ccacatatgg tacctcaatc cagagcaggc tgttttttca ttattcgatg 2460 attcattaag taattgtgta aaaaagagaa tggcaacaaa attaatatct caagcaaatg 2520 aggatgaaga acttgatgac tactgtgata tgaaaccact aataaaaata aatgaagtat 2580 cagaaatatt ggacaaaaat gcggaccatt ttatttcatc acaatcaatc aatttttcaa 2640 aaagatttaa tattaatgat caatttttgc ataccaatcc agaattatgg agtaataacg 2700 aagaatattt aaagagtaaa aagattgtgg acggattaaa agttacaaat gatacagcag 2760 aaaggggtgt taaattaata acagattaca actcatgtat tacaaaagag gaagatcaaa 2820 aacaatatct tttgcaagta attgctgaat gtcggacaaa atttcctggt tgctcaaaag 2880 cctccttatc cgagcctcta ccttttgaac aaacaatgag ttaaagtaga aatttatttg 2940 atgtataaca actataaaca atataatatg aataaaaaat gtgtttattt gttaataata 3000 gtgatttaga tacaacggtg ttattttaaa aacatttctt tataaacaaa ttaagttatt 3060 gtgtgttttt gtgttatgga gaaattcaac gttttttcaa tgtgtatctg taacaatgca 3120 aatatgtcag aaactataac ttgtagctaa atagtgatca tagtaacatt tgaagaaaac 3180 taatcaatga acaaaataaa acttaaatca tgtttctaga gtgaaccgtt tttattttac 3240 aaaccaaaaa ctgcgttttt tttcaaaatt ttccaatttt tcgaaatttc aacctcgatg 3300 aggcacgggt tagcattatc caattgagct aaaactttac aaatgtcatt ttttagtcaa 3360 gtggcacaat ttagggggtt cccccgataa gatttcgaaa tttttttttt acatatagcg 3420 ttgtaaccac ccta 3434 // ID BEL-51_CQ-LTR repbase; DNA; INV; 306 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-51_CQ_; KW BEL-51_CQ-I; BEL-51_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-306 RA Jurka J.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 256-256 (2011). XX DR [2] (Consensus) XX SQ Sequence 306 BP; 90 A; 89 C; 71 G; 56 T; 0 other; tgttcggtac caccgaaggc tgaaacccca ccgcgtttcc gacccgaaga ctcaccgtgt 60 tacgccaatc tgctttaccg acgagcgaga aacgtcactc attgcgagag gccaaaccgg 120 ccaaacaagt ccacgtccaa ccttcgatcg agaaggacac acagcagcaa caggaacagt 180 tttttagagt agcaggagga ggaagaggaa aaataaaggt agaaaattaa gatctccggt 240 gttttttcct cctgcgtgac tacgctccat acagtccacc actcccatgt cggccaggcc 300 cgaaca 306 // ID Gypsy-12-I_HM repbase; DNA; INV; 3777 BP. XX AC . XX DT 05-JAN-2009 (Rel. 14.02, Created) DT 05-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE An internal portion of the LTR retrotransposon - a consensus DE sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW reverse transcriptase; Gypsy superfamily; Gypsy-12-I_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3777 RA Bao W. and Jurka J.; RT "Gypsy LTR retrotransposons from Hydra magnipapillata."; RL Repbase Reports 9(2), 396-396 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(27..806,810..3725) FT /product="Gypsy-12-I_HM_1p" FT /translation="MDRALRPSRLDTLPNSPSATKLFKHWLRTFEYYLDEL FT PQELNKLKILTNFVVPDVYEIISECPNYEAAIQALQSVYIKPSNEVYARHL FT LATRKQQPGESFDEYLHALKVLAKECNFKQVSAIEYRDEYIRDAFITGINS FT QVVRQRLLENSTISLDEIFSKARTLESAQKNAENYLQYNTSDTTVAAITKK FT PKQFNSNSVSNCWNCGNQRHAKAVCPARESICFKCEKVGHFAKLCKSSKLT FT TAGMNLVDQPLNSYNPSLTLDYSEHKTNLKNISASISPLTKSTIGMLVNGS FT FVNALIDSGSTDSFIHPRLVQKLNLTVHRQNQKQVSMASSSLSSTVCGVVF FT VNISIKNEHYKLVKLNILNELCMDVILGLDFQKQHKAVTLKFEGKRPPLLI FT CGLSKLNVSPPSLFGNLSPNCKPIASKSRRYSKEDQLFIEMEVQRMLKEDI FT IEHSNSPWRAQVVVTKGERSKKRLVIDYSQTINKYTQLDAYPLPRIDDLVN FT KIAQYKIFSTVDLKSAYHQIPITKNDRKFTAFEANGRLFQFKRMPFGIENG FT VACFQRTINEFIEKEGLLDTYAYLDNVTICGKTQEEHDLNLEKFLKSAKNI FT NLTYNHDKCSFSQDKICILGYLIEEGKLKPDPSRLQPMKEMPPPHDAKSMK FT RIIGLFSYYSKWIPNFSDKIASLTKNTKYPLDHQCVNDFNQLKQDIANAVT FT DYVDENVPFEVETDASDIAIAAVLNQNGRPVAFFSRLLQKSELKHPSIEKE FT ACAIIEAVRHWKHYLTGRHFKLITDQQPVSYIFEKQHKSKIKNDKIYRWKI FT ELSCYNYDIVYREGKLNIPADTFSRIHCSAINTNNLIQLHQSLCHPGVTRF FT YAFIKNRNLPFSIDDVKRITNQCKLCCECKPRYYKPLKSNLIKATQPLERL FT NLDFKGPLPSETNNKYFLTIIDEYSRFPFAIPCPDISAQTVIKCLSQVFSI FT FGLPSYIHSDRGSAFISKELKQYLLEKGIATSHTTAYNPQGNGQAEKYNGT FT IMKSIEIATRQHNLPIKAWEKVLPDVLHSIRSLVNTKTMETPHERMFKHLR FT KTASGCTLPSWLSHPGPILMKRHLRNNKYEPLVDEVQLIDANPQYAHIKYP FT DGRETTVSLRHLAPTSSAPIIDQTNHIEPSPENAFNNIKPSPENAFNKPSY FT PYKPLTPEKYGLTNTNPENFPNQPPLYQPPLAENLVNDACPINPLLLITPG FT KDTRRSSRAIRRPLRLIEEDT*" XX SQ Sequence 3777 BP; 1346 A; 725 C; 591 G; 1115 T; 0 other; attttctact cttatcaaaa taaaatatgg atagagctct aagaccttct cgtttggata 60 cattaccgaa ctcaccttcg gcaacaaaat tatttaaaca ttggctcaga acatttgaat 120 attatcttga tgaactccct caagaactca ataagttaaa aatcctgact aattttgttg 180 tgcccgatgt ttatgaaata ataagcgaat gtccaaatta tgaagcagca atacaagctt 240 tacaatctgt gtacatcaaa ccatcaaatg aggtttatgc acgacacctc ttggcaactc 300 gtaagcaaca gccaggagaa agttttgatg agtatttaca cgctcttaaa gttttagcaa 360 aagaatgtaa ctttaaacag gtgtctgcta ttgaatatcg cgatgaatat attagagatg 420 catttataac agggataaat tcacaggttg tcagacagcg actactagaa aacagtacaa 480 taagtctgga tgaaatattt tcaaaagcta gaaccttgga atcagcgcag aaaaatgctg 540 aaaattacct ccagtataat acatcagata caacagtagc tgcaataaca aaaaagccaa 600 agcaatttaa ctcaaattct gtttcaaatt gctggaactg tgggaatcaa agacatgcta 660 aagctgtctg tccagctaga gagtcgattt gttttaaatg tgagaaagtt ggtcattttg 720 ccaagttatg taaatcttct aaacttacaa cagcagggat gaatcttgtt gatcaacctt 780 taaactcata caatccttca ctcacttgac tagactactc tgaacataaa actaatttaa 840 agaacatttc tgcttccatt tcacctttga ctaaatcaac aattggcatg ttggttaatg 900 gttcattcgt aaatgctctc attgacagtg gtagcactga tagttttatt caccctcgat 960 tggtacaaaa actaaaccta acagttcata gacaaaacca aaaacaggta tctatggctt 1020 catcatcttt atctagtaca gtttgtggag tagtttttgt caatatttca ataaaaaatg 1080 agcattataa actagtaaaa cttaacatct taaatgagtt atgcatggat gttattctgg 1140 gtctcgattt tcaaaaacaa cacaaagctg taacccttaa gtttgaagga aaaagacccc 1200 cactattaat ttgtggtttg tcgaagttga atgtatctcc ccctagttta tttggcaatt 1260 tatcgcctaa ctgtaaacca atagcttcaa aatctcgtcg atatagcaaa gaagaccaat 1320 tattcattga aatggaagta caacgtatgc ttaaagagga cattatcgaa cattcaaatt 1380 ctccctggag agctcaggta gttgtaacta aaggagaacg ctctaagaaa agattagtta 1440 ttgactacag tcaaacaata aataagtata ctcaactaga cgcttaccca ttaccaagga 1500 ttgatgatct agtaaacaaa attgctcaat acaaaatatt tagtacagtt gatttgaaat 1560 cagcctacca tcaaattcca attaccaaaa atgatagaaa atttacagct tttgaagcaa 1620 atgggaggtt gttccaattt aagcgaatgc cttttggaat tgaaaatggg gttgcttgtt 1680 ttcaacgtac cataaacgaa tttatagaaa aagaaggact tttagatact tatgcttatc 1740 tagataatgt gacaatatgt ggtaaaaccc aagaggagca tgatcttaat ctggaaaaat 1800 ttctcaagtc agctaaaaat ataaatctta cttataatca tgataaatgt agcttttccc 1860 aagataaaat ttgcattctt ggttatctta tagaagaagg aaaactcaaa ccagatccca 1920 gtcgtcttca acctatgaaa gaaatgcctc ctcctcatga tgctaaatcc atgaagcgta 1980 tcatcggact attctcatac tactctaaat ggattccaaa cttttccgat aagatagctt 2040 ctcttaccaa aaacaccaag tacccattag atcatcagtg tgtaaatgat ttcaatcaac 2100 ttaaacaaga cattgcaaat gctgtgacag attatgtgga cgaaaatgtt ccatttgaag 2160 tagaaacgga cgcatctgac attgctatag ctgcagttct taatcaaaat ggccgccctg 2220 tagctttctt ttcaagatta cttcaaaaat ctgaactcaa acatccctct attgagaaag 2280 aagcttgcgc aataatagaa gcagtcaggc attggaaaca ctaccttact ggacgccatt 2340 ttaaacttat tactgatcag caacctgtat cctatatctt cgaaaagcaa cataaatcaa 2400 agattaaaaa tgataaaatt tatcgatgga aaattgaact atcgtgctat aattatgaca 2460 ttgtttacag ggaagggaaa ctaaatattc cagctgacac attttctaga atacattgct 2520 cagcaataaa cactaacaac ttaattcagc ttcatcaatc actctgtcat cctggagtaa 2580 ctaggtttta tgcttttatt aaaaaccgta atttaccttt ctcaattgat gatgttaaac 2640 gaataactaa tcagtgcaaa ctatgttgtg aatgtaagcc tagatactac aaaccactaa 2700 aatctaattt gataaaggcg acacaaccac tggaacgttt gaacttagat tttaaaggac 2760 ctcttccttc ggaaactaat aacaagtatt ttttaaccat aattgatgag tattctcgtt 2820 ttccttttgc aatcccgtgc cctgatattt cagctcagac tgtaattaag tgtttgtcac 2880 aagtattctc gatttttggt ttgccaagtt acatccattc cgatagaggt tcagccttca 2940 tcagtaaaga attaaaacaa tacttactag agaaaggtat tgcgacaagt catactacag 3000 catataatcc acaagggaat ggacaagccg aaaagtataa tggaactata atgaagtcta 3060 ttgagattgc tactagacaa cacaatttac ccatcaaagc ttgggaaaaa gtattgccag 3120 acgttctcca ttctataaga tccttagtaa atactaaaac aatggagaca cctcatgaaa 3180 gaatgtttaa gcaccttcgt aaaacagcct ctggatgcac tttaccttca tggttaagtc 3240 atcctggacc aatactaatg aaaagacatt tacgtaataa taagtatgaa ccactggtag 3300 acgaagtgca actcatagat gctaatcctc aatatgctca cataaagtat ccagatggaa 3360 gagaaaccac agtttcccta agacacttag ctccaacaag ttctgctccg atcattgatc 3420 aaacaaacca tattgaacct tcaccagaga atgcctttaa taatattaaa ccttcaccag 3480 agaatgcttt taataaacct tcttacccct ataaaccatt aacacctgag aaatatggac 3540 ttaccaatac caatcctgaa aactttccta accagcctcc cctttatcaa cctccgctag 3600 ctgaaaatct tgtcaatgat gcttgtccga tcaatccatt gttattaata acacctggaa 3660 aagacacacg tagatcctca agagcaattc gtcgaccatt acgacttata gaagaagata 3720 catgaaaaac cttgtttgta ttatagtata ttctgtttgt attatggctg cagagaa 3777 // ID Kiri-11_CQ repbase; DNA; INV; 4318 BP. XX AC AAWU01038353; XX DT 25-DEC-2010 (Rel. 16.01, Created) DT 25-DEC-2010 (Rel. 16.01, Last updated, Version 1) XX DE Kiri non-LTR retrotransposon from Culex quinquefasciatus. XX KW Kiri; Non-LTR Retrotransposon; Transposable Element; Kiri-11_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4318 RA Jurka J.; RT "Kiri non-LTR retrotransposons from the southern house RT mosquito."; RL Repbase Reports 11(1), 130-130 (2011). XX DR EMBL/GenBank/DDBJ; AAWU01038353; Positions 14531 10214. XX FH Key Location/Qualifiers FT CDS 346..1434 FT /product="Kiri-11_CQ_1p" FT /translation="MEEIKCNQCNKVDPPGKRGRKSAKNKNVNKPKIDWIE FT CDGCSAWYHSVCVRVNEALLGNMNEYYYFCEKCPLRGSLVSKPAHPAPASP FT AAPAGPSIQEIEQLKQTISDLAAQLVKIQAEVDSVRTTSKKQHDRLQSKLN FT SADQRDGQCAAQSALINNIEQKLEIIEAGARLASTCSQSVNSCRLAINKIP FT AREGENLRGIVESVINFLGVPQEMSHITSCFRLEVKPSKWSDRSLTPTIVA FT VFDNRESRERVLRRYFEKHKDAKLCHLKHAPDLEYRFTINEMLSVHAFRIR FT NLALRYKQRKAIRSVFVRNDSISVLLPGKQRYTPVNSPEHLLELLGHDHQA FT DESSVFFDARSLDVSSSSRC" FT CDS 1470..4310 FT /product="Kiri-11_CQ_2p" FT /translation="MSRTNLPNLRIGHLNVRGLEHSIDGVKLLLDKTQYHF FT VGVTESKLKQSSPAGPIRVPAYNFIKHSLPTGRGRGARTCGGVGLYVQKGI FT KATPILRSLHDPTIPISQRFEFLAVQAKINDLNICVVVLYNPVCTNPHFAQ FT AYEKLLIDLLDFGFDRLFLVGDFNINVAAPQQSVNLVALSRINAAFNLTVL FT PTGPTRITETTSTTIDLLITDSPGSVIKSKTCTGNAISDHEVVYLLSNVRM FT PRSAPQTVRTRNLRAIDPMRLQMDFQARNLQPILEAEDTSAKAVLLNGVLT FT DLLQQHAPERTIVVRDKRTPWISETIKQAVALRDLAFALYSRNPNRVRGDN FT QWQDYIQKRDRANSLIFAAKKRYADQHFDANLPAKTLWCNLRREGIHNNAK FT KNQATEGTDADELNKFFSDGHRQLLGARQENPAEPAHRTEIDHGADEFTFR FT ATSGEEICRKIYEIQTNATGPDDIPISFIKLLCPFVLPILTHLFNEIIMTK FT VFPECWKKAIITPIPKQSNPAQPKDFRPISVLPAVSKVLEKILLSQIAEHL FT DNPNAPLLAGQQSGYRKGHSTTTALAKVTHDLYENLDNGRCTVMVLVDFSL FT AFNCVKHQKLRTKLQSEFRFSRAACDLISSFLARRKQSVRLADTLSGERDV FT PDGTPQGSCLSALLFSLYINSLPSTLKCSYHLYADDLQVYTSGPSADVDTL FT VRAINEDLEAIARWADANWLAPNPKKTQAIVFCKTGIVEPQTNITFCGQTV FT PLSETVVNLGLKLDRNLTWKPQVNDVVMKTYNVLRTFRRFGSVLSTQTRLK FT LVQAVVMPIFSYGDIIYTPGLSAALKGQLHRCFKSTVRFVYGLRRRDTTAA FT VRDNILGVDLPSNQRLRICCFMRQAYHGSLPEYIQQHVQRGQLERARCFII FT PRHTTSSGKSVLVYGATCWNGLPIEAKTKTTLFSFKSCVKQLV" XX SQ Sequence 4318 BP; 1155 A; 1215 C; 998 G; 950 T; 0 other; cagtgtgtta gtgacatttg acagataaac aaacactttg gcaacactgc taggccacag 60 cagcaaagaa aaccgcgctc cctcaaaccg cttttttacg ctgtttttta tcaaatctgc 120 aaacaaaata aagtgcaaaa ctcaaactag tggtcctaaa acagtttttt ggtgtttgcc 180 aaagctttcc tactttccaa cctgtgaaaa tccgtttcca aatttccgaa aaacacaagg 240 aacttacact ccaagtgact taccgtttta tcgtttgtga cgtgtgttgt tgtttaaatg 300 tgtttgtaaa ttggaactgt cagcgtcgtt gtggtcgtat caccgatgga ggaaataaag 360 tgtaatcagt gcaacaaggt ggacccacct ggaaaaaggg gaaggaaaag tgccaaaaac 420 aaaaatgtaa acaaaccaaa aatcgactgg atcgaatgcg atggatgttc agcatggtac 480 cactccgtgt gtgtccgtgt taacgaagcg ctgttgggta acatgaacga gtattactac 540 ttctgcgaaa aatgccctct gcgcggaagc ctcgtatcga aacctgcaca tcctgctcct 600 gctagccctg ccgctcctgc tggccccagt atccaagaga tcgagcaact gaagcagacg 660 attagcgacc ttgctgccca gctagtgaag attcaagctg aagtggattc cgtccggacg 720 acaagcaaga aacaacatga ccgactacag agcaagctca acagtgctga tcaacgcgat 780 ggccagtgcg ctgctcaaag cgcactaatc aacaacatcg agcaaaagct ggagatcatc 840 gaagcaggtg cgaggctcgc aagtacgtgc tcgcaatcgg tcaatagctg ccgcctcgcc 900 atcaacaaga tccccgcacg ggaaggtgag aatctgcgcg gtatcgtcga aagtgtgatc 960 aacttccttg gcgtccctca agagatgtca cacatcacga gctgtttccg gctcgaagtg 1020 aaaccgtcga agtggtcgga ccgttcactt actccgacga tagtagcagt gtttgacaac 1080 cgggagtcac gtgaacgagt gctcagaagg tacttcgaaa agcacaaaga tgccaagttg 1140 tgtcatctca agcacgctcc cgacctcgag taccggttca caatcaacga gatgttgtca 1200 gtgcatgctt tccgcattcg aaacctggcc ctgcggtaca agcaacggaa agccatccgc 1260 tcagtcttcg tccggaacga cagtatctct gttctgctcc caggtaaaca aaggtacacc 1320 cctgtcaaca gtccagagca tctcctcgag ctcctgggac acgaccacca agcagatgag 1380 tcatctgtgt tctttgacgc tcgctcgttg gacgtgtcat cttcctcccg ttgctgattt 1440 tccactcgac gctgacgatc ccccgaacaa tgtctcgaac aaatttgcct aatctccgta 1500 tcggtcacct aaacgtccgt gggcttgagc acagcatcga tggagtaaag ctgcttctgg 1560 acaagacgca gtaccacttc gtcggtgtca cggaatccaa acttaaacaa tcttcaccag 1620 ctggtccaat ccgtgtccca gcctataatt ttatcaaaca ctcactgccg actggccgtg 1680 ggcgtggtgc gcgtacgtgt ggtggtgttg ggctgtacgt gcagaaaggg atcaaagcca 1740 ctccaatcct tagatcctta catgacccaa ctattcccat cagccagaga tttgagtttc 1800 ttgccgtaca agctaaaatt aacgacctca acatctgcgt ggtggtgttg tacaaccccg 1860 tctgcacgaa cccacacttt gctcaagcct acgaaaaact tcttatcgac cttctcgact 1920 ttggatttga ccgactattc ttagtcggcg atttcaatat caatgtggct gcacctcaac 1980 aaagcgttaa tctggttgcg ctttcccgga tcaacgccgc attcaacctc acagtcctcc 2040 caaccggacc gacacgaatc accgagacca catcaaccac tatcgacttg ttgataaccg 2100 actctcccgg atcggtgatc aaatcgaaga cctgcaccgg caatgctatc tcggaccacg 2160 aggttgtgta cctgctttcg aacgtcagga tgccgcgatc tgccccgcag actgtgcgca 2220 ctcgtaacct acgtgccatc gacccgatgc ggctgcaaat ggactttcaa gccagaaatc 2280 tccaacctat cctcgaagcc gaggacacgt ccgcgaaagc agttctgctg aacggcgtac 2340 taacggatct gctgcagcag cacgccccag agcgaaccat cgttgttcgc gataaacgaa 2400 ccccctggat ctccgagacc atcaagcaag ctgtggcact gcgagacctg gcgtttgcgc 2460 tctactcccg taaccccaac cgagtcagag gtgataacca gtggcaagac tacatccaga 2520 aacgtgatcg tgcgaactcc ctcatctttg ccgccaagaa acgctacgct gatcaacatt 2580 tcgacgccaa cctgccagct aaaacattgt ggtgtaatct gcggagagaa ggtattcaca 2640 acaacgccaa gaagaaccaa gctaccgaag gaactgacgc cgacgaactg aacaagttct 2700 tcagtgacgg tcatcgtcag ctgctaggag cacgccaaga gaatccagcc gaacccgcac 2760 acagaacaga aatcgaccac ggtgcagatg aattcacttt tcgagccaca tccggcgaag 2820 aaatctgtcg gaaaatctac gaaatccaga caaacgctac cggcccggac gatatcccga 2880 tttctttcat caagctgttg tgtccatttg tgcttcccat ccttacgcat ctgttcaacg 2940 aaattataat gaccaaagtg ttcccggagt gctggaaaaa ggccatcatc acccccattc 3000 cgaagcagtc gaacccagcc cagccgaagg acttccggcc gatcagcgtt ctgccagcgg 3060 tttcaaaagt gttggagaaa atattgctgt cgcagatcgc tgaacacctc gacaacccca 3120 acgctcctct gttggctgga caacaatctg gttaccggaa aggacacagc acaaccaccg 3180 cactagccaa ggtaacccac gacctgtacg agaatcttga caacggtcgc tgcaccgtaa 3240 tggttctcgt ggatttctcc ctcgcgttca attgcgttaa gcaccaaaaa ttaagaacga 3300 agctgcaaag cgaattccga ttctctcgag cggcgtgcga tctgatatcg tccttccttg 3360 cgcgaaggaa acaatcggtt cggctcgcag acacattgtc aggagagcgc gatgtccccg 3420 acggcacacc gcagggttcc tgcctcagcg cgttgctgtt tagcttatac ataaacagcc 3480 tcccatcgac cctaaaatgc agctatcatc tttacgccga tgacctacaa gtttacacct 3540 ccggtccatc tgctgatgtc gacacacttg tacgggccat caacgaagat cttgaggcga 3600 tagcgcgctg ggcagacgca aactggcttg cgcccaaccc gaaaaaaacc caggcgatcg 3660 ttttttgcaa gaccgggatc gtcgaaccac aaacaaacat caccttctgt ggacaaactg 3720 ttccgctgtc ggagaccgta gtcaaccttg ggctcaaatt ggatcggaac ctgacgtgga 3780 agcctcaagt gaacgatgtg gtgatgaaaa cgtacaatgt gctacgaact ttccgccggt 3840 ttggctcggt actgtcaacg cagactagac tcaagttggt gcaagcggtg gtgatgccga 3900 tcttctccta cggcgatatc atctacactc cgggcctgtc agctgcactg aaaggacagc 3960 tgcatcgctg cttcaagtcg accgtacggt tcgtctacgg cctccgacga cgagacacaa 4020 cagcggctgt tcgcgacaac atcctgggag tggatttgcc gtccaatcaa cgcctaagga 4080 tctgctgctt catgcggcaa gcgtaccacg gcagccttcc cgagtacatc cagcagcacg 4140 tgcagcgagg acaactggaa cgagcgcgct gtttcataat ccccagacac actacttcta 4200 gtgggaaaag tgtactggtg tacggagcca cttgctggaa cggattgccg atcgaggcaa 4260 agacaaaaac aaccctattt tcgtttaaaa gctgtgttaa acaactagtt taagcctt 4318 // ID DNA6-1_CQ repbase; DNA; INV; 1877 BP. XX AC . XX DT 28-DEC-2010 (Rel. 16.01, Created) DT 28-DEC-2010 (Rel. 16.01, Last updated, Version 1) XX DE Non-autonomous DNA transposon from Culex quinquefasciatus - DE consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA6-1_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-1877 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the southern house mosquito."; RL Repbase Reports 11(1), 75-75 (2011). XX DR [2] (Consensus) XX CC ~92% identical to consensus. 6-bp TSD. 15-bp TIRs. The central CC region is very AT-rich. XX SQ Sequence 1877 BP; 717 A; 183 C; 218 G; 759 T; 0 other; cagggttgtt acggacggcg cggatcgcgc ggatggcgcg gatggcgcgt atcgcgcgga 60 tctggcgcgg atttgctggc taattttgct caggcgcgga tttcgcgcgg atagcaattt 120 tgataaacaa aataattgag aacgatagaa tttctttcaa attacaaagg aaaatattat 180 taggaattaa ataaattcaa tctttttcgt tgagtgcgaa aacatcaatt tctaagaagt 240 tgttaatgaa gaaaattttc ttctaaaaat tattgatttt tttttcttaa tacgaaactt 300 cgaaataatc ttcaaggagc gattttttta atgtattgag atttgttata taaatttaac 360 ataacattac aactcattat ttttaaaaca tctgcttaac aattcaaata aataccaatt 420 ttattaaaat caaaataaca aaattcgaaa aattacagaa tttaaaaatt tgaagctatg 480 aaacttttta gattttctat gttttttcaa tataaaaatt taaattttta aatttttggt 540 ttattgaaaa aataaaaatt ctgttgccac tctgattcag gtagaattga ttctactccc 600 aattatcaca acatgacgaa atccacttag aaatctgcta aaatctccgt ctaaaagcga 660 aatgtaatgt taaaattaaa aaataagaaa tctggaattc aacaatttga aatttcagaa 720 tttacatatt caaaaaataa taaaaaaaat agaattcata atttgaaatt gtcaaaactt 780 acagactcaa aaaacgtaaa aaagtacgaa ttattaaatt gaaaaattac gaaaatatta 840 caattgattt tttgttaatt tttatttttt taagaaagtt cttaagttat caaattatta 900 aattattaaa ttattaaatt attaaattat taaattatta aattattaaa ttattaaatt 960 attaaattat taaattatta aattattaaa ttattaaatt attaaattat taaattatta 1020 aattattaaa ttattaaatt attaaattat taaattatta aattattaaa ttattaaatt 1080 attaaattat taaattatta aattattaaa ttattaaatt attaaattat taaattatta 1140 aattattaaa ttattaaatt attaaattat taaattatta aattattaaa ttattaaatt 1200 attaaattat taaattatta aattattaaa ttattaaatt attaaattat taaattatta 1260 aattattaaa ttattaaatt attaaattat taaattatta aattattaaa ttattaaatt 1320 attaaattat taaattatta aattattaat tattaattaa attattattt ttttgccatt 1380 ttcttagttc ggatcatttt cggagtcata ttttagttta gagaaattta gagaagaaaa 1440 taatagaaat ttcgtgtttc gattttccag agtaaagttt tggcgagaat tatcaagtac 1500 taaatcccta gattttataa tttggtgatt tattttatgg ttttaatttt ttaaattttc 1560 gaatttaatt ttttttcctg aatttgtttt taaagcaacg atatttaaag tgttgcaaaa 1620 aatgctaata ttgtttcaga aatggcaaaa aaggttccat ttggtacagg tgggatatga 1680 atccacaact gatcaacggt actgatccaa atgccttctg tattattctt ttttcgttta 1740 aaatttcatt cattatgtta ctctcttcta tgtttgtcgc gatttttttt atatggcgcg 1800 gatttcgcgc ggattgggtt ttggggtcgg cgcggatctg gcgcggattt tttctcgact 1860 tttccgtaac aaccctg 1877 // ID Gypsy-222_AA-I repbase; DNA; INV; 7285 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-222_AA_; KW Gypsy-222_AA-LTR; Gypsy-222_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-7285 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1047-1047 (2011). XX DR [2] (Consensus) XX CC Positions [4850-5326] - Integrase core CC LTRs are 91% similar to each other. XX FH Key Location/Qualifiers FT CDS 510..2546 FT /product="Gypsy-222_AA-I_2p" FT /translation="MELARLQSLYYALNAAHLADDELDHELQIRDVNMIGE FT SRSKQERSLRSLLKAEKEGKAISFKPYWVTVVDELKWCDEKLLGVKKTLEE FT RKSRKAPDQIFKTKLLHIFFRMERLKSHTKEEEDLNSIAIIAGVCTALLNT FT FFSITSPLLEVREAETAILNETIRQMREEALENRAHSDRVVGASGIDHGNV FT VGIGAGESGVDDHRRETGENENDEHDDNREQSRENEDDDGDDDAENVVGKL FT SDENERLKVAVNQLLQRIQALESANATKQREIERIKQSTPVDDRSRQNVDE FT QEQGQGQQNFYEWMKARCGSLESMVDVNKIISSSRTEKPNVRQRNEVDLPK FT GSRLPVHKWSVRYDGTDNGRRLNEFLKEVEFNARSEGFTEAELFVSAHHLF FT TRKARSWFMEVNGNNELGTWQNLVRELKNEFLPADIDYQYERLANSRRQGP FT REKFQDFYLDMVRIFRSMSRQWDDARKFDVLFRNTRADCRTAMLAANVTSI FT PKMREFGKRFDSINWQVYAKRENRFSRPNSQVEEINQQRQNWRGTESNVRN FT QSDYKGGYQGRNFNPNYNPPNKQFPPGKPGFQGRNSEQKFPQKPKPRPESN FT QQQNRQSDPKPSTSGTSALQRIVNAYIPIKRGVCFNCHDEGHNSKNCEQEK FT HVFCENCGFPGFQTNNCPFCQSKNAKKTAQ" FT CDS 3623..5698 FT /product="Gypsy-222_AA-I_1p" FT /translation="MDRVLGHGALEPSIFVYLDDIVIASQTFDEHLQKLED FT LARRLREANLSINLSKSKFCCQELPYLGYILSQDGLRPNPDRVKAILGYQV FT PKSVRQLRRFLGMINYYRRFIANYSEITAPLTDLLKNKPKRVPWNSAADTA FT FEVIKEKLIAAPVMANPNFLLPFTVQTDASDNAIAGVLTQVQNGEEKVIAY FT HSEKLKGAELHYHAAEKEGLAALRCIEKFRGYIEGTKFTLVTDSSALTFIM FT RAKWRSSSRLSRWSIELQQYDMEIRHRKGKENIVPDALSRSIEALDEEPED FT GWYKKLYAAVEDDPEKYLDFKIKDGVLYKFVAAKSDMLDYQFEWKQCVPTS FT NRSQILKQEHDENLHIGFSKCVEKIKQRFYWPRMSADIKKYIGSCESCKEN FT KHSTVSTAPEMGKQRVANRPFQIICMDYIQSLPRSKQGNAHLLVVMDLFSK FT YCLLAPVKKISTVSLCKILEEQWFRRLSVPQIVISDNATTFMSRDFQNLLE FT KYEVQHWANARHRSQANPTERLNRTINSMIRSYVKQDQKLWDSKISEIEFV FT LNNTIHSTTKFSPHCVIYGHEIICKGPEHKQDCSDQLSDEQRVERLRGVNK FT KVGELVREHLKKAHEDTKKRYDLRHKRYSPTFDVGQRVYKRSFRQSSAGDQ FT FNAKLGPQYMPCTIVRKVGTSSYEVSDQQGKSLGIFSAADLKA" XX SQ Sequence 7285 BP; 2259 A; 1350 C; 1719 G; 1939 T; 18 other; ctttggcgcc caacataaaa ttgacttacc atcagtttgt tgtaggctca gtttggaaat 60 tcgcacttga gatagctggt ttctaccgaa gtctctggga aaacatccag gccttcacag 120 gacagttgcc attttgggct tactcctttt atctgacaca gagcgggaaa ttatttcatt 180 ttgctaccga ggggttctga ttgtccattg atcagtaaca aagaacgatt gtttaaaaag 240 tgaatcggag tgcttttgaa ttgaaaggag cgaagtgtgg aatcgtgtgg tgtgattcgg 300 aaggagtaat tgaaaacacg agaaagaatc gtaatttatt tgcggattcg tgaactagga 360 ttttcaaatt taggcttttc aattttagga tttcaaattt agaattttca attttttttt 420 cagcgtgcaa aaccgcttct tacaaatcta ggtagttctt tttaaatttt taattttcta 480 attttgctgg aagtacgcaa acgtgagcga tggagttggc acgattacag tcgctgtatt 540 acgcgttgaa tgctgcccat ctggcggatg acgaactcga ccatgaactg cagatccgtg 600 acgtgaacat gattggtgaa agccgaagta agcaggagag atccttacgt agtttgttga 660 aggctgaaaa agagggaaaa gcgattagtt tcaaaccgta ttgggttaca gtagtagacg 720 aactgaaatg gtgtgacgaa aagcttttag gagtgaagaa aacgttagag gaaaggaaat 780 cgcgaaaagc acctgatcaa attttcaaaa ctaagcttct tcatattttc tttagaatgg 840 aacgactgaa gtcccatacc aaagaagaag aagacttaaa tagcatcgca atcattgcag 900 gagtttgtac cgcgttgtta aacacgtttt tctcaatcac ttctcccctg ttagaagtga 960 gagaagcgga aacggcgata ttgaatgaaa ctattcgaca gatgcgtgag gaagcactag 1020 agaatagagc acactctgat agagttgtag gagctagtgg aatcgatcac ggtaatgtag 1080 taggaatcgg tgcgggtgag tcaggagtcg atgatcatag gagagaaacg ggtgaaaacg 1140 aaaatgatga gcatgatgat aaccgagagc agagtagaga aaatgaggat gatgatggtg 1200 atgatgacgc ggagaatgta gtaggaaagc tttcggatga gaatgaacgt ttgaaggtag 1260 ccgttaatca gttacttcaa cgaatacagg cactcgaatc tgcgaatgcg acaaagcaga 1320 gagagattga gcgaattaag cagagtacgc cggtagacga tagatcgcga cagaatgtag 1380 atgagcaaga gcagggtcaa ggtcaacaaa atttttatga atggatgaaa gctagatgcg 1440 gttcgttaga gagtatggtg gacgtaaata agattatttc atcgtcgcgt actgaaaaac 1500 cgaacgttag acagagaaat gaggtcgatt tgccgaaagg tagtcgattg ccggtacaca 1560 aatggtcggt gaggtacgac ggaactgaca atggtaggag actgaatgaa tttcttaagg 1620 aagtagagtt caatgcaaga tcggagggtt tcacagaggc tgaattgttc gtgtctgcgc 1680 accacctctt tacgcgaaaa gcaagatcgt ggtttatgga agtcaatggg aacaatgagt 1740 taggtacgtg gcaaaatcta gtcagggagt tgaaaaatga attcttacct gcagatattg 1800 actaccagta tgaacggttg gctaattcgc gaaggcaggg tccaagagag aaatttcaag 1860 atttctactt ggacatggtg agaattttcc ggagcatgtc caggcaatgg gatgacgcca 1920 gaaaattcga cgtactgttc cgaaacacca gggcagattg taggacggca atgttggctg 1980 ctaatgtgac atccattcct aaaatgaggg aatttggtaa acgatttgat tccataaact 2040 ggcaggtata tgctaagagg gaaaacagat ttagtagacc aaattctcaa gtagaggaaa 2100 ttaatcagca aagacagaac tggagaggta cggaatccaa cgtcagaaat cagtccgatt 2160 ataaaggggg ttaccaggga cggaatttta atccgaacta taatccccct aataaacaat 2220 ttcctccggg aaagcccgga tttcagggaa gaaattcaga acaaaaattc ccacaaaaac 2280 ctaaaccaag accagagagc aatcaacaac aaaatcgtca gtccgaccca aaaccttcta 2340 cgtcaggtac aagcgcactt cagaggattg tgaatgccta cattccaatc aaacgtggtg 2400 tgtgctttaa ttgccatgat gaaggccaca attcaaaaaa ttgtgagcaa gaaaaacacg 2460 tcttttgtga gaactgtggt tttccaggct ttcaaacaaa taattgccct ttctgccaat 2520 caaaaaacgc caaaaagact gctcaatgag gcagagcagt cctattgatt ccgtaagtcc 2580 tcagattcac gtagaggatc cagaagctat tcttcatcag ttggggtatt cgaaagtggt 2640 tgaacaaccc aaaaagagag acgagatcgc aactttgctc gtcaaactta gtggagatgc 2700 aaggccattt acgaagatcg aactgatggg gattgagctg attggtcttc tggacagtgg 2760 agctgcacgg acggttttag gaatcggagc gagaaaaatc attcaaaaac ttaatctgtg 2820 tgtcaggcca gctgtagtta atttgacaac tgctgctgga gaagatttag aggtccttgg 2880 atgtgcggac attccgatca cattcaacgg gaacacaaaa attttacccg tgctaatcgc 2940 tccgaaactt aaccggagat gcgtgctagg gtatgatttc tggctaaaat ttggaattac 3000 gccagccata cggagtcaac ctatcgagct tttagatgat ggtgttagtg aggagctaga 3060 aaaggaagaa gagttaacgg atgagcagaa agagaaatta gagaatgtga aaaagctatt 3120 tttagtcgca gagccaggaa agctaggaat gactgattta atcgtccata aaatcgagat 3180 gaaagaggag tacaaggatg ccgacccggt taggaagaat ccttatccgt ggagtccgga 3240 aattcaacgg aaaatacatg gagcggttga caatatgwtt caggaaggkg taattgaaca 3300 gtctgattcg gactgggctt taccggtagt accagttgca aaacgggata gtcaggaagt 3360 taggctgtgc ttagacgcca gaaaactgaa tgaaaggacg aagagggatg catatccgtt 3420 accgcaccaa aaccgaatct tgagtcattt gggacaggta aaatacctaa cgacaattga 3480 cctatcacag gcgtttctcc agattccatt gagccctgag tcgcgaagat acactgcctt 3540 ttctatacca ggtagagggc tgttccaatt tacacgactt ccgtttggat tggtgaacag 3600 tcctgctacg ctgagcaagc taatggaccg tgttttaggc catggcgctt tagaaccgtc 3660 aattttcgtt tatttagacg atattgttat cgccagccaa acttttgacg agcatcttca 3720 aaagcttgaa gatcttgcaa ggcgtttgcg tgaagcaaat ttaagcatta atttaagcaa 3780 atccaagttc tgttgccaag aattgccgta tcttggatat attttatcgc aagatggatt 3840 acgacctaat ccagatcgtg tcaaagcgat cttagggtat caggttccga aatcggtgag 3900 acagttaagg cgtttcctcg gtatgataaa ttactaccga aggtttatcg cgaactatag 3960 tgagattaca gcacctctca ctgacttatt gaaaaataag ccaaagaggg tgccatggaa 4020 ctcagcagca gacacggcat ttgaggtcat caaggaaaag cttattgcag ctcctgtgat 4080 ggccaaccca aactttttgc ttccattcac ggtccaaaca gacgcaagcg ataacgcaat 4140 tgcaggtgta ttgacgcagg tgcaaaatgg ggaagagaaa gttatcgcct accattcgga 4200 gaaactgaaa ggtgcagaac tacactacca cgcggcagag aaggaaggac ttgctgcgct 4260 tcgttgcatc gaaaagttca ggggctatat tgagggaaca aagttcacct tggtgactga 4320 ctcatcagcc ctaactttta taatgcgagc aaagtggcgg tcctcttctc gactcagtcg 4380 ctggagtata gagctgcaac aatacgatat ggaaattcgg catcgtaagg gcaaggaaaa 4440 cattgttccc gatgcactct cccgttccat cgaggctctc gatgaagaac ctgaggacgg 4500 gtggtacaag aagttgtacg cagctgtaga ggatgaccct gagaagtatt tggacttcaa 4560 aatcaaggac ggtgttcttt ataagttcgt agctgccaaa tcagatatgt tggactatca 4620 gttcgagtgg aaacagtgtg ttcctacttc gaaccgtagt caaatactga agcaggaaca 4680 cgacgaaaac ctccacattg gattctccaa atgtgttgag aaaattaaac aacgctttta 4740 ttggcctcgc atgagcgcag atattaaaaa gtacatcggt tcatgtgagt cctgtaaaga 4800 gaacaaacac tctaccgtgt caaccgcacc tgaaatggga aagcagagag tcgcaaatcg 4860 cccattccaa attatttgca tggattatat tcaatctctt cccagaagca agcagggcaa 4920 cgctcactta ttagtggtca tggacctgtt ctctaagtat tgcctgcttg ctcctgtaaa 4980 gaagatatcg actgtatccc tatgcaaaat tttggaagag cagtggttca ggcgtctctc 5040 cgtaccacaa attgtaatct ccgacaatgc gaccactttt atgtcgagag attttcaaaa 5100 tctgctcgaa aagtacgaag tacaacattg ggcaaatgcg cgccacagaa gccaagccaa 5160 tccaactgaa cgtctcaatc gaaccattaa ctcaatgatt cgatcgtacg tgaagcagga 5220 tcagaagctt tgggactcga aaatatckga gattgaattc gtgcttaaca acacgattca 5280 ttcaacgacc aaattcagcc cacactgcgt gatttatggt cacgagatca tttgcaaagg 5340 tccagagcac aagcaggact gcagtgatca actgagtgac gaacaacgtg tggagagatt 5400 gagaggagtt aacaagaagg tcggagagct agtgagagaa catctcaaaa aagctcacga 5460 agacacgaaa aagcggtacg atcttcgcca taaaaggtac tcacctacgt ttgatgtggg 5520 tcaacgcgtt tataaacgaa gcttccgtca gtcgtccgct ggggatcagt ttaacgcgaa 5580 actgggtccg cagtacatgc cttgtaccat cgtcagaaag gtcggcacga gctcttacga 5640 ggtgtccgac caacagggaa agtcactcgg catattttct gctgctgact taaaagcatg 5700 aagaacctaa aaacataaat tttcatcaga gctagcataa aaattttaac gtcttattag 5760 tcggtgttag atagtgagtt aggtagtatt tttgttcaaa gttcgtctca aagtacttat 5820 cgtctcctac taagtcatac cgatgaattt tcccgggaaa ccattgcgtg ggtgtattga 5880 atgagttcgt ccagcaaatc tgtagatttt gtatatagat acggccgtat agtcaagaaa 5940 tttgcatgtt tacctagaat tttaaaagta gttgttgtta tcaattggtc aagttttatt 6000 ttgtttcttt taattcgtta tttgtatgag taattatgag tcgaagtcga aaaagcataa 6060 atgcatcgta tgtgttttat gtatgtccma cttgtcttgt acgcaaagtt gcaaacaagt 6120 ttttaacttc gtcatagtca agtagtgaac aagtcgttga tgtgtaggag taattggatg 6180 tcgtgtgtag tcattagggt caagttcaag gtcaagatca tctgattgag aatccacamt 6240 tattcatatc aaaaatttac attaattcat ctcacactat aaagcattcg tttcacccat 6300 acttacaccc tgagaagaac taagaatcaa cgttaaatga gcttttagaa aggaatcccc 6360 agatggaaaa atacgaaaat tagcacaaaa attgatcaaa tggccatwta agaatccgcg 6420 gatttaggaa aaagatattt aaagttttcg ttgagctcac tttccagcaa ttcctttata 6480 aaaccacatt atcaccgaaa cccacatgtg atcaccaaat ctgcagcatc actttgacag 6540 acatttgaca gattttccga tcttcaccga agccacacac accgataagg cactgatcaa 6600 accgttatcg ttacacgaag tagtagaacg ttcgcgactc cgacaaatcg cgtgaaatgt 6660 ttcatgcacc attcgcatga ccttactatt gtgtatgtat gctccagttt tttttttttt 6720 attcaatttt attttgtatt tttaattcgt tttgcatgtt ttaaattata tatagtaaat 6780 ttcttaaatg tctttttaaa caatcgtgtg gaataaaatg cttawtgtwt mgaggttacc 6840 wggcctagtg aaaccctgwg tatttgatcg ctggagaccg gagtcagwag acgttgatgg 6900 tcagtacwga cttaatgtca ctaaatagcg aggagttggt twagtccgac gatgaaaacc 6960 tatgaccccg atactcgatw acmatamtwg ccaatttcac atctttttac tgtcatgtgg 7020 gtctgtctta atctagtacg ttgcaggtga agcgtttcgc gagctaccgt cagtgagcgt 7080 gattttgtca ctgcgtggtg tgctaattga aatgtgaagt gcaggttcag agttaagtgt 7140 gaccggtcgt gtgctagtgt gaattgtaga caaaaaatat ccacgttgtg aatatttttt 7200 tacccgagct gggggagagt gtaagtgttc ggtttgttga ttgatagtgt tgaggagatt 7260 ttgtagtttt gttggtccgt aaaaa 7285 // ID Gypsy-2_CQ-I repbase; DNA; INV; 2263 BP. XX AC AAWU01034370; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-2_CQ_; KW Gypsy-2_CQ-LTR; Gypsy-2_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-2263 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 383-383 (2011). XX DR Genome; AAWU01034370; Positions 3738 1476. XX CC 'CTTAT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 116..2113 FT /product="Gypsy-2_CQ-I_1p" FT /translation="MSDSLARLPYLGQLHPRQEQFQQTSLRQGPLSHQRLL FT PNPDLMIQMLTMFEQLVGTVQQTAQQQMEFMERFSTDAVQPPPNPAQNVDT FT LPGNTKAFQTDSVRSHAVVINEHSVQQWRRCASVNFGGTPIRLPLDAASGG FT PMLSSPSVRVKTASETGLSLDDEFRCDITTAGSTRNEPTIVTELHVSSFPD FT SSPVPESVLHKVFNEELGKEVNLELKENRRPTVCPKRPVAYAVVENVDHEL FT DKQQQQQLNIITPIVYSELAAPIVGVRKAIGSKTVLNTALQPRLYPLLAEN FT AHGAYQQSIATMLARNPGASCHLEDVDVGGATQKEHVRNHFQDYALETRHD FT NAPQLRIVRFKADRKHFAPSFELYLLLYDFASQYVSTDQFGHADVQRLINQ FT HVRPEEDFAIARLRLEDISTQPDPLLRKIRRYVLEGWPTTKAADPKIERYH FT TRADFDVGKACQMCTSVARSPPHSLPVPWPKPAGPWQRSCMDFAEPLNDDY FT VVCTHRISVLATIAILHQMFPQLATVPYQPHGRAKRVVGILKRAVKKISEG FT RGTIARTLNIFLSAYRCAPEVLDPEPHQKEFHRRGIAMRDGNQRDTRSHIN FT QLRSCFKAGPAASPSVKPKALPLDVLLGEWKLASVPELPPLSPMQLTPRPP FT SSLPAIPAEPNTRATYT" XX SQ Sequence 2263 BP; 511 A; 762 C; 581 G; 409 T; 0 other; gtggcgacga gaatctgtcg aattcgtcgg agttgtcgcg gtttacgaag gtgggcgcca 60 cctaacctgc aattccgaca cccacgtgca gctaaaggag gagcccccta gcggcatgag 120 tgattcactt gcccggctcc cctacctggg gcagcttcac ccacgtcagg agcagttcca 180 gcaaacatca ctccgccaag gcccactctc tcatcagcgc ctgctgccga atcctgacct 240 gatgatccag atgctgacga tgttcgagca gttggtcggc acggtgcagc agactgccca 300 gcagcaaatg gagttcatgg agcggttctc aacagacgca gtccagccgc ctccaaaccc 360 ggcgcagaac gtcgacacgc tccccgggaa cacgaaggct ttccagacgg actcggtccg 420 cagccacgcg gtggtgatca acgagcactc ggtgcagcaa tggcggagat gcgcgtcggt 480 caatttcggt gggacgccga tccgacttcc actagacgcc gcatccggtg gcccaatgtt 540 atcatcaccc tctgttcgcg ttaaaacggc ttccgaaacg ggattgtcac tcgacgacga 600 gttccgatgt gacatcacca ccgccggaag cacccgcaac gagccgacca tcgttaccga 660 gctccacgta tccagcttcc ccgactcgtc gccggtgccc gagtcagtcc tccacaaggt 720 gttcaacgag gaactcggca aagaggtgaa cttggagctg aaagaaaacc gccgaccgac 780 cgtctgcccg aaacgtccgg tggcctacgc ggtggttgag aacgtcgacc acgaactgga 840 caagcagcag cagcagcagc tcaacatcat cactccgatc gtctactcgg agttggccgc 900 cccgatcgtc ggggtgcgga aagccatcgg atccaaaacc gtcctcaaca ctgctctaca 960 gccgcgcctg tatccgctgc tggccgaaaa tgcacacggt gcctaccagc agagcatcgc 1020 caccatgctg gcccgaaatc ctggagcgtc ctgccacctc gaagacgtcg acgtcggtgg 1080 cgcaacccag aaggaacacg tccgcaacca ttttcaagac tacgccctcg agacgcgtca 1140 cgacaacgca ccgcagcttc gcatcgtccg gttcaaggca gaccggaaac attttgcacc 1200 aagcttcgaa ctgtacctgc tcctctacga cttcgcctcc cagtacgtct ccaccgacca 1260 gtttgggcac gccgacgtgc aacgtttgat caaccagcac gtacgaccag aggaggactt 1320 cgctatcgcc agacttcgcc tagaggacat cagcacccag cccgatccac tacttcgcaa 1380 gattcgccgg tacgtactcg agggttggcc aacaacgaaa gccgctgatc cgaagatcga 1440 gcgttaccac acccgtgccg acttcgacgt cggaaaagca tgccagatgt gcacctccgt 1500 agcacggtcc ccaccgcact cacttcctgt gccatggccg aaacctgctg gcccgtggca 1560 gcgcagctgc atggattttg cggaaccact caacgacgac tacgtagtct gcacgcatcg 1620 catctctgtc ctcgcaacca tcgccatcct ccaccagatg tttcctcaac tcgcgactgt 1680 cccgtaccag ccgcacggac gagcgaaacg ggtcgtaggc attctcaagc gtgccgtcaa 1740 gaaaatttcg gaggggagag gcactatcgc aagaactttg aacatctttc tgtcggcgta 1800 ccgctgcgca cctgaagtgc ttgaccccga accgcaccag aaggaattcc atcgtcgagg 1860 catcgccatg cgagatggca accaacgcga cacacgctcg cacatcaacc aactgcgaag 1920 ttgcttcaag gccggacctg ctgcttcacc atcagtcaaa cccaaagcgc tcccgttgga 1980 tgtcctgctc ggtgagtgga agcttgcgtc ggtacccgag ctgccaccat tgagcccgat 2040 gcagttgacg ccgcgtccgc cctcttctct acctgcgatt ccagctgaac caaacacgcg 2100 cgcaacttac acctgatctg gatcgctgct tctacaagaa cccggtgcaa caccatctcc 2160 aatccctgcg tcgtcgtcct caacgagttc gtcgcagctg gcactcttct ccggcgcacc 2220 gacagcgctt gcaagagccg accagctgtt ttaagagggg aga 2263 // ID BEL-13_AA-I repbase; DNA; INV; 5782 BP. XX AC AAGE02023429; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-13_AA_; KW BEL-13_AA-LTR; BEL-13_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5782 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02023429; Positions 7225 1444. XX CC Positions [4739-5347] - Integrase core CC 'AAGTG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 1307..5698 FT /product="BEL-13_AA-I_1p" FT /translation="MEAQPTYTMMVEFLQKRARVLENISINRPQHSSSKPT FT SQIPAQKKHNLPRLSSNAATEAPTKSFPTCPACEKQKHSLFDCSVFNGLDA FT KGRMKVVTDKKLCSNCFRSDHFARNCRSKYNCKHCSRRHHSMIHPGPSEMD FT KPTPSAEIGPSSVVTAVAATPIPEVVSTAKSPNTSVLLTTVVLIVVDVYGQ FT EHIARALLDTGSQPNAISERLCQLLHLPRKIVNVPIAGVDSTLTNAKHEVL FT AEIRSRVADFSESLEFLVLRKVTSDTPAASFSTTRWSIPENFPLADPDFNT FT SRRVDMIIGAAHFYSFLRDGRFRLPEQGPLLVESVFGWIVAGKFEISGESV FT RQPAVTCHVATVTSLSEQLERFWRIEELQCQNYSVDEQHCENFYRETVSRD FT PTGRYVVRMPKHPEHDQMVGASKPSAIRRLKWLEQRLLKDANVRTQYHDFL FT REYATLGHMYPVQEDEDDSLKACYLPHHPIIKESSTTTKVRVVFDGFAKTS FT SGYSLNDSLLVGPVVQDELLNLVIRFRTFPIALVADIEKMYRQVSMNPADR FT PLQRILWRFDTSQPIQTYELSTVTYGLAPSSFLATRTLLQLVDDEGTSFPK FT ASTAIKKNVYVDDLISGDKSIEETIQLREELTSLLQKGGFRFRKWCSNSLP FT VLAGLPPDLLGTQSLLKFDPEESIKTLGILWEPEADVFRFDVSVIVKEGPP FT TKRTILSAIAQLYDPLGIISPVVVQAKILMQHLWLIALDWDDIVTPDLQRK FT WAEFCKQLPGLANFRIERFAFAPGFHVAELHCFADSSVVAYGACIYVRSQA FT ADGHVQVSLLASKSKVAPLKPLSIPRLELCAALLAARLFEKIVGSLEMKFS FT ESFFWSDSTIVLQWMKSPPRTWKTFVANRIGEIQTATAGSHWQHVSGKENP FT ADMISRGVSVEELMISELWKYGPTWLREDKSTWPSQHIPESKFTVEELELK FT KNVILATQTLSPDSLFTRFSSYRTLLNVTGFILRFCHNSRSKEQRNSSRVL FT SVSELQKAKFALVKLVQAEAFPDDLRRLKKGFTVSNKSSLSLLSPFLDTEE FT LIRVGGRLQLADASYDVKHQIVIPGFHPFTRLLLQHYHRKYVHGGIAMTLS FT VVRDEFWPLNGRKAVRSSIRNCYECSKTNPQPIQQPIGQLPIARVTANEAF FT LCTGVDYCGPVFLKPVHRKAAARKCYICVFICLSTKAVHLELVSDLSTAAF FT LMALDRFMWRRNKPHHLYSDNGTNFIGAKNALHKVYQMLQPGPDNERISKR FT LSEDGIQWHLIPPRAPNFGGLWEAAVKVAKTHLVRQLGSSLLSFEELTTVL FT IKIEGCMNSRPLVPLSTDPNDLGALTPAHFLVRNMIRPLPEADVRNTPLNR FT LTQYEKLQKYSQNFWYRWRNEYLKQLNQQYCSNPKRYQINVGDIIILKDES FT LAPARWPLARVIDTHPGPDGVTRVVTLRTPSGTLKRPVSRICPLECATES" XX SQ Sequence 5782 BP; 1424 A; 1568 C; 1346 G; 1444 T; 0 other; tttggtgccg tgaccaggat tcctggtttt cccgtccccg acacacacgt gctgtgtata 60 acctcaacat ccctgctgtt agagctattg atcgcgatca tagggtgatt gccgccgatt 120 tgatccatac aaagcctcct gagattggag gtaccaggtg agtacctttc caatcaattc 180 actgccgttg ttgcggtact gctgggtctt ttttcgaccg ttttagttcg tcctcgggat 240 caacccgcat cttcattcga agtacgtgcc tacctgcacc cgaaattcaa ttccgttctc 300 atccgaagct ccatcaaacg attgccgctg atcatttata ttgcatttca tacaaggcct 360 cctgaaattg gaggtaccag gcgagtacca ttccaattac gctaccctcc gttttgcggt 420 acttcctgtg tcctttctgc ccgttttagc actgcagttt tcggagtgag ctgctaccat 480 ggcggatgat aagcgacgca aggcgaaaga gttgaagctg aagcacgcca tcgagtcgct 540 caatcgcctg gacaagtttc tcgaaggcta cgttcccgat cagcaccaac atgaaattgt 600 tcatcgcatg gaccgtctag aaaaggtttg gagcgcctac gaaacgattc aagaagagta 660 cgaggaaatg gacgacgccg aggaatttat tcagaagaat ctggaattac gtggaagggt 720 cgaagaggtt tacttccgtg taaaagccgg attagtttcg aagatgccgg ttcctattgc 780 ctccgttgct tctgcctctg cggtacccac tgttgtgcct gctggtccat cgactctcgc 840 caatgtgaag ctgccgacga tttccctgcc agaattcgac ggagacttca ataactggct 900 cacgttccac gatacgttcg tgtcgatgat ccattcgtcg acgtagatct cccacgtcca 960 gaaatttcac tatcttcgag cggccttgaa gggtgaggct gccaccctga tccagtcgat 1020 cacgattacg gcgcaaaatt actccgtcgc ttggactact cttgtcaatc gctactctaa 1080 caaggctatt ctccggaaga agcacattag agccttgctg aagcatccga agattgcgaa 1140 caacaatgtt gacgccctgc atcggattgt ggatgagttc caacgccata ccaaggtcct 1200 cgaacaacta ggcgagcccg tcgaccagtt tagctccatc ttaatcgaat tgctggagga 1260 caagttggac gacgcttctc tcactgcttg ggaggaattc atcgcgatgg aagctcagcc 1320 aacttacacc atgatggtcg aatttctgca gaagcgagcc cgtgttttgg aaaatatttc 1380 gatcaaccgt cctcaacatt catcttccaa gccaaccagc caaatccccg cccagaagaa 1440 gcacaacttg ccgcgattga gctcgaatgc cgccacggaa gccccaacga agtccttccc 1500 aacgtgtcct gcttgcgaga agcagaaaca ctcgcttttc gattgctccg tgttcaatgg 1560 actggacgcg aaagggcgta tgaaggtggt aacagacaag aagctctgca gtaactgttt 1620 ccgcagtgac cactttgctc gcaattgtcg ttccaagtac aactgcaagc actgctcgag 1680 acgacatcat tcgatgatcc acccaggacc gtctgaaatg gataagccca cccccagcgc 1740 tgaaattgga ccttccagtg tagtaaccgc agttgctgct accccgatac cggaagttgt 1800 ttcaactgcc aagtcgccca acaccagcgt gctcttgaca acagttgtgc tcatcgtcgt 1860 cgacgtttac ggccaagaac acatcgcccg tgccttgctg gacacgggat cccagccaaa 1920 cgcaatcagc gagcgattgt gccagttact ccatcttcct cgcaaaattg tcaatgtccc 1980 gatagcaggc gtcgatagca cacttacgaa cgcaaagcat gaagtcttag ctgaaatacg 2040 ttcccgagtg gcggacttta gtgaatcctt ggagtttctg gtactgcgca aagtaaccag 2100 tgacacaccc gcagcgtcgt tctccaccac ccgatggagc attcccgaaa acttccctct 2160 cgccgatccc gacttcaaca cttctagaag agttgacatg attatcggag ctgctcactt 2220 ttattcgttc ctcagagatg gacgattccg tttgcccgaa caaggtccgt tgcttgttga 2280 aagcgtattc ggctggattg tagccggaaa gttcgaaata tcaggagaat cagtcaggca 2340 gcccgccgta acttgtcatg tggcgacagt tacatcgtta tctgagcagc tagagcgatt 2400 ttggcgcatc gaggaacttc aatgccagaa ttactccgta gatgagcagc attgcgagaa 2460 cttttatcgc gaaacagttt ctcgtgaccc gaccggtcga tacgtcgttc ggatgcccaa 2520 gcaccctgag cacgatcaaa tggtaggagc atcgaaaccg tctgctattc gcaggctgaa 2580 gtggctggag cagagattat tgaaagatgc caacgtgagg acccaatatc acgattttct 2640 cagggagtat gccacactgg gccacatgta tcctgtccag gaagatgagg atgacagtct 2700 gaaagcctgt tatcttccac accaccctat catcaaagag tcgagcacca cgacgaaggt 2760 gagagtagtg ttcgacgggt ttgcaaagac cagctctggt tactcgctta acgattccct 2820 acttgttgga ccagtggtgc aggacgaact cctcaatctc gttatccgtt ttcgaacatt 2880 cccgatagcg ctggtggcgg atattgagaa gatgtaccgc caagtctcga tgaatccagc 2940 tgatcgccca cttcagagaa tactgtggcg ttttgatacg tcacaaccca tccagaccta 3000 tgagttgagc acggtaacct atggccttgc cccctcttca tttctcgcca cccgcacgct 3060 cctacagctc gtagatgacg aaggcacctc attcccgaaa gcgagcacgg ccataaagaa 3120 gaacgtgtat gtagacgatc tcatttctgg ggacaagagt atcgaggaaa ccatccaact 3180 tcgtgaggaa ttgaccagcc ttctacagaa gggtggattt cgtttccgca agtggtgttc 3240 aaattctctt ccagtactcg ctggtcttcc tcccgattta cttgggacgc aatcattact 3300 gaagttcgat cccgaagaga gcatcaaaac cctcggaata ctatgggaac ccgaggctga 3360 tgtctttcgt ttcgacgttt ccgtcatcgt gaaagaagga cctccgacca aacgcaccat 3420 tctctccgcc atagctcagc tttacgaccc tcttggaata atatctcccg tcgtcgtaca 3480 ggcgaaaatt ctaatgcagc acctctggtt aattgctttg gattgggacg atatagtcac 3540 acccgatctt cagcgcaagt gggccgaatt ctgcaaacaa ctgcccggcc ttgccaactt 3600 tcgcatcgaa cgattcgcat ttgccccagg tttccacgtc gcggagttgc attgctttgc 3660 cgattcttct gtagtcgcat acggcgcatg catatacgta cgctcccagg ccgccgatgg 3720 acacgttcaa gtgagtctcc tggcctcgaa atccaaagtg gcccctctga aaccacttag 3780 tatcccacgt cttgagctgt gcgcggcact ccttgccgct cgcttatttg aaaagatcgt 3840 tggctccctt gaaatgaaat tctccgaaag ttttttctgg tctgactcga caatcgtcct 3900 tcagtggatg aaatccccgc cgcgaacttg gaaaacgttt gtagcgaata ggatcggtga 3960 aatccaaact gcgaccgctg gttcgcactg gcagcatgtt tctgggaaag aaaaccctgc 4020 agacatgatt tctcgtggag tttccgtcga agaactaatg atcagtgaac tgtggaagta 4080 cggccccaca tggttgcgtg aggataagtc aacgtggccg tcgcaacaca tccccgaaag 4140 taaattcaca gtagaagagc tggaactcaa gaaaaatgtg attctagcca cccaaactct 4200 tagtcctgat tcactgttca cgaggttctc atcgtacagg actcttctga acgtaactgg 4260 attcattctc cgtttttgtc acaattctcg tagtaaggag cagcgaaatt ctagccgagt 4320 gctctctgtg tcggaactcc aaaaggcaaa atttgcgttg gtgaaactcg tccaagctga 4380 agcgtttcct gacgatcttc gtcgtttgaa gaaaggattc actgtatcaa ataaatcgtc 4440 tctcagcttg ttaagcccat tcttggacac tgaagaactg atccgtgttg gtggccggtt 4500 gcaattagcg gacgcttcct atgacgtcaa gcaccagatc gtgattcccg gatttcatcc 4560 cttcacccgg cttcttctcc agcattatca tcgcaaatac gtccatggtg gaatcgcgat 4620 gactctttcg gttgtccgcg atgagttctg gccattgaac ggccggaaag ctgtccgaag 4680 ttctatacgg aattgctacg agtgcagcaa aacgaaccct caaccaattc agcagccaat 4740 cggccaactg ccgatcgcca gagttacagc aaatgaagcc tttctctgta ccggagttga 4800 ttactgcggt cctgtctttc tcaagcccgt tcatcgcaaa gctgctgcac gaaagtgcta 4860 catctgtgtc ttcatctgcc tgagtacgaa ggcagttcac ctcgagctcg tcagtgattt 4920 gagcaccgct gcttttctga tggctctcga ccggttcatg tggagaagga acaagcctca 4980 tcatctgtac tcagataacg gtactaattt catcggggcg aagaatgccc tgcacaaggt 5040 gtaccagatg cttcagcctg gaccggataa cgaacgaatc agcaagcgtc tctccgaaga 5100 tggcattcag tggcacctga tccccccacg tgcccccaac tttggtggcc tatgggaggc 5160 tgctgtcaag gttgccaaga cgcacctcgt tcgtcagctc ggatcatctc tgctctcctt 5220 cgaagaattg acgacagtac tcatcaaaat cgaaggttgc atgaactctc gtcctttggt 5280 gccgctctca actgacccga atgatttggg ggctcttact cctgcacatt tcctcgtgcg 5340 gaatatgatc cgccctcttc ctgaagccga cgtgcggaac acacccctga atcgcctgac 5400 ccagtacgaa aagctccaga aatattccca gaacttctgg tacagatggc gcaacgaata 5460 cttgaaacaa ttgaaccaac agtactgctc caaccctaag cgttaccaaa tcaacgtcgg 5520 agacatcatt atcctcaagg atgagtccct tgcaccagca cgctggcctc tggcccgcgt 5580 tattgatacg catcctggcc ctgacggtgt aacccgagtt gtcacgcttc gtacaccctc 5640 tggaaccctg aagagacctg tgtcaaggat ctgcccattg gaatgtgcaa cagaatcata 5700 agctaagtat ttgtagttta aacataaagg tataattttt ccatactttg tttgaaaatt 5760 gcttttcaaa ggtggccggt aa 5782 // ID Gypsy-124_AA-I repbase; DNA; INV; 5652 BP. XX AC AAGE02024975; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-124_AA_; KW Gypsy-124_AA-LTR; Gypsy-124_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5652 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02024975; Positions 16833 11182. XX CC Positions [4589-5065] - Integrase core CC 'GAACG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 828..2096 FT /product="Gypsy-124_AA-I_2p" FT /translation="MSWESDFENDDEYDTMSRPPTPGSNILTDDDMEAGRY FT VRMPLNHRMDWFDENGPSSAGGDAADGRSEACVPAQCSEELVAATPSNGSI FT GAEKAEANDRLASLEKRIVELSRLLAEKQSSEKPTTADQSWAAPSREAEAA FT GPSSSIRMDHIPPFPKDVPSNGMWEAFNRYLEKFEIALAIYNISDPVKRAQ FT CLYLAIGDDLQSIVRGAGLRPSLQDPDCYKKMVANIAEYFRAMTDPTAELE FT AFTKLKQERGEPTVTFHARLVSKAQQCGRSPKEQEHSVRIQLLRGMANQEL FT ARAARTFDYNTSFVVQSAARSEAFEAEAKPGASSDQIFAVKEFPPRVRKRN FT HHGQQNGVADKRRRFAAPLHKRDRCSRCDRAQHKGKLCPALDRDCRQCGRR FT GHFAATCRMKKIERVEKVEPLPFSEDEKV" FT CDS 2526..4343 FT /product="Gypsy-124_AA-I_3p" FT /translation="MNNPSKTTNLRAYAAKNPMYVECSFSSTINALGTGNP FT SITAQFLVVPSGHRSLLGRATSGDMRLLKVGASINSCGEVIAVDVFPKMPG FT VLVKFSINKSVAPVRNAYYNVPAAYREGARQRLREMEQRGIIERVTTAPEW FT ISGMSAVPKGKNDFRLVVNMRAANKAINREYFRLPLLHEMKTKLHGARYFT FT KLDLTSAYYHLELAKDSRDVTTFLTEEGMFRFTRLMFGVNCAPEIFQREMT FT RVLEGIPNIIIYIDDILVFAGTIEELHATVNKILELLRTNNLTLNESKCEY FT DKERVNFLGHELDKEGFHIDSVKTKHIEKFRPPETPSELRSFLGLAAYVSP FT YIQNFSDLTNPLWTVSTTRTWSWGPQQSAAFEATKKAIADCTIALGYFSEE FT DKTVLYTDASPVALGAVLVQENTKSIPRIISFASKALTPTEQKYAQNQREA FT LGAVWAVEYFSYYLLGRDFTLRTDAKGVAFILNRSRETSKRVLTRADGWAL FT RLSPYSYKVEYVEGKFNIADPSSRLYDGKDEPFDESTSPWEIAKIEANAIS FT LLTDEEVRSATREDDVLSQVTEALESGEWPKNLHRYKQISEDLCVDDGIIT FT KNGMNLQK" FT CDS 4505..5524 FT /product="Gypsy-124_AA-I_1p" FT /translation="MPKDAEEWVKSCEICITNGMPEKPTPMERIMAPKTVW FT ETIALDFNGPYAKFGGIYILVLVDYRSRFIIASPVKSTNFEETRKFLCNVF FT EREGLPKNVKCDNGPPFNSDDFRNFCIERGIQNIYSTPHFPQQNGLVESYM FT KLINKAMASAVSSGSRYEDELSSAVKAHNAAAHSITKVAPEEMMYGRKFRR FT GLPLMNRGTVALNDDEITKRDREAKLKGKSYEDRRRGARKCQIIPGDVVIV FT ERNVRSKGDSRFDPKRYTVVQQKNGNLLLSSADGQQLRRHVTQTKRVKNDA FT PRVRQPPVEQVNKHTEEHQRPVREKKIPSYLKQYVRVVEEEKLYLNQR" XX SQ Sequence 5652 BP; 1732 A; 1170 C; 1375 G; 1375 T; 0 other; atggcgatct ctgccaggta taatatcaat taggatatta tttaaattcg aatagagtat 60 tgcgcaagcg agaatcaccc aactggttgc aaaaaaacag caattaaatg tgtgaccata 120 ttctgtgcga aaagtgaaat tctgaattgt gaaaagcaaa ttcgaaatct acgtgatcca 180 gtgaccgcca tttttttttt tctcgtttgt gacgattgtg aaaattcgac gagcaaggaa 240 gggattcctt attccgagac ttatttacgg gacgtaaatg acagaaaagt gacctgcccg 300 tgatcaagtg ggacgcttga cgggaataac tagtctggtt attttaattt ttgcctagtt 360 tgtgggacgc aaattagaag gaaagcgtag tggagttttc gggacgaaaa catacgtgaa 420 agtaccacaa gcgggacgct tgatgagaaa tgcaagtggg acgcttgatg agattttttt 480 attgatttgt gggacgcaaa taataaaaaa aaaatgcgtg atagcctttc gggacgaaag 540 ctagcacaca attatgattt acgtcattga gcgggacgct tggtgaggga attccgcatg 600 gtccgttgat tccttggttt gtgagacgca aatcggaata cggaaggcgt tgtagagttt 660 tcaggaaaac taacgtggaa ggaccacaag cgggatgctt gatgatgaaa tatgctcgaa 720 atagtttgca cacttacgta cagatagtac tgtatggaga gcgtaaaggt agtaaccgat 780 tgtacttgtt tcatccgtag gaaacagata acgagaactt ttcgagaatg tcgtgggaaa 840 gcgattttga gaacgatgat gaatacgaca ccatgagccg cccaccgact cctggatcga 900 acattctaac cgatgatgat atggaagcag gacgatacgt aaggatgcca ctaaaccata 960 gaatggattg gttcgacgaa aacggaccgt cgtcggcagg aggagacgca gccgatggac 1020 gttcggaagc gtgcgtgcct gcacaatgtt cagaggaact agttgccgca acgcctagca 1080 atggttcaat tggggctgaa aaagcggaag ccaacgacag attagcctca ttggagaaga 1140 ggattgttga attgtcgcgc cttttggccg aaaaacaatc aagcgaaaag cccactacag 1200 cagatcaatc ttgggcggca ccatctcgag aggcagaggc tgctggacca tcatccagta 1260 ttcgtatgga tcatatcccg ccgtttccga aggatgtgcc atccaatggt atgtgggaag 1320 ccttcaacag gtatctagag aaatttgaaa tagccctggc catatacaac attagtgacc 1380 cagtcaaacg agctcagtgt ctgtacctcg ctatcggaga cgacctccaa agtatagttc 1440 ggggtgcagg cttaagacca agccttcaag atcctgattg ctacaaaaag atggttgcaa 1500 atattgccga gtattttcgg gctatgaccg atccaacggc tgagctagag gcttttacta 1560 aactgaagca ggaacgaggc gagcctacag tcacatttca cgctcggctg gtatcaaaag 1620 cgcaacagtg tggtcgcagc ccaaaagaac aggagcactc tgtcagaatc caacttttga 1680 gaggtatggc gaaccaagaa ctcgcccgtg ccgcaagaac attcgactac aatacgtctt 1740 tcgtcgttca atcggccgcc cgaagtgaag cttttgaagc tgaagctaag ccaggagcat 1800 ccagtgacca gatctttgcc gtcaaggaat tcccaccaag ggtccgtaag cgaaaccatc 1860 atggacaaca gaatggtgtt gctgacaaga gaagacgatt tgctgcgccg ctgcataaaa 1920 gagatcgttg ctcccggtgc gatcgtgcac agcataaagg aaagttatgt cctgcccttg 1980 accgagattg tcgtcaatgc ggtcgtcgtg gccattttgc ggcaacctgt cgtatgaaga 2040 aaattgaacg ggttgagaag gtggaacctc tgccgttcag cgaggatgag aaggtatgat 2100 tctttgatct ttaagatttt actgtattgt tttctttctc caaattattg aagaagtttt 2160 gataaacgaa taaagaagtt gaattagttg agaataacaa acataaataa acaaaaaata 2220 tccaaaaata acgccgataa attgtttttg cttacaactc ctccaatcca ttttcgatct 2280 tggttttcga tttaagttca cttctggaaa tttcgcagat tttgattggc ccatttattt 2340 ctattcgatg ttatttatag ctcatcaata ccctatcgct aaaggatgtt ctggtattgt 2400 gcagtattgg aaaatctgaa ccaattgagt tcctaattga ctcgggagcc gatgccaaca 2460 tcataggggg taatgattgg aataatctac aaaagcaggt ccagaaagga ctagtggata 2520 taacaatgaa caaccctagc aaaactacca atctgcgagc ttacgcggcg aagaacccaa 2580 tgtacgtcga atgctcattt tcttccacca taaacgcctt aggaacaggg aatccatcca 2640 taactgcgca attcctagtt gttcctagtg ggcatcgctc attattaggc agagcaactt 2700 ccggtgatat gaggttgttg aaagtaggag catctattaa cagctgcgga gaagttatag 2760 ctgttgatgt atttcccaaa atgcccggcg tcctagttaa attcagcatc aataaatctg 2820 tcgcgccggt ccgcaacgca tactacaatg tccctgccgc ataccgagaa ggggcaagac 2880 agcgattgcg cgagatggaa cagaggggca taatagaaag agttacaacc gctcctgagt 2940 ggatcagtgg gatgtcggcg gtccctaagg gaaaaaatga ctttaggctg gtggtaaaca 3000 tgcgcgccgc caacaaggca ataaaccgtg agtattttag actaccgctg cttcacgaaa 3060 tgaagacaaa actccacggt gcacggtatt tcacaaaatt ggacctcaca agcgcctatt 3120 atcatttgga gcttgcgaaa gattcccgtg atgtaactac attcctcact gaggagggta 3180 tgtttcgttt cactcgcctc atgttcgggg taaattgtgc tccggagatt ttccagcggg 3240 aaatgactcg ggtgctagag ggcataccga acatcattat ctatatagac gacatcctgg 3300 tgtttgctgg aacaatcgag gaactccacg caacagtcaa caagattctt gagttattgc 3360 ggactaacaa cctcacgttg aacgagagta aatgtgagta cgacaaggaa cgtgttaact 3420 tccttggaca cgaattggac aaagaagggt tccatatcga ttcggtgaag accaaacaca 3480 tcgaaaaatt tcgacctcca gaaacaccat ccgaactgag aagctttctt ggacttgcgg 3540 cgtacgtaag cccatacatt caaaattttt cagacttgac gaaccccctc tggacagtat 3600 caaccacgag aacatggtcg tggggtcctc aacagtcagc agcatttgaa gccacaaaga 3660 aagccattgc tgactgtaca atagcattag ggtatttctc ggaagaggac aaaacagtgc 3720 tatatactga tgcttctcct gttgcactcg gcgccgttct agtacaagaa aacaccaaaa 3780 gcattcctcg tatcataagc tttgcgtcta aagcgttgac gcctaccgaa caaaaatatg 3840 cccaaaatca gcgggaggca cttggcgcag tttgggcagt tgagtacttt tcgtactact 3900 tacttggtcg agattttacg ttgcgtacag atgcgaaggg agtcgcgttc atattgaaca 3960 ggtctcggga aacctcaaag cgagtattga cgagagctga cggttgggct ctacggctaa 4020 gtccttacag ctacaaagtg gagtacgtgg agggaaaatt caatatagca gatccttcct 4080 cccgcctgta tgatggcaag gacgagccgt ttgatgagtc caccagtcct tgggagattg 4140 ccaaaatcga agcaaatgca atcagtctac taacggatga agaagttaga tctgcaacgc 4200 gtgaagatga tgtgctgtcc caggttacag aagccctcga gtctggagaa tggcccaaaa 4260 atctgcacag atataaacag atatctgaag acctttgcgt tgatgatggt atcattacaa 4320 agaacggtat gaatttacaa aaataaaaaa aaaacagttc catacagctg tattcgtttt 4380 ctttccaggt tgtgcaataa ttccagaatc gctgcgcaag aaaacactcg atgttgctca 4440 cgcaggacac ccatctgtag cgaagttaaa gagcatctta cgagagcgcg tttggtggcc 4500 aggcatgccg aaggatgcag aggagtgggt gaaatcatgc gagatctgca tcaccaatgg 4560 tatgccggag aagcccactc ctatggaacg tattatggca ccgaagacgg tctgggagac 4620 aatagcgctc gattttaacg ggccttacgc gaagtttgga ggcatttata ttttggtgct 4680 ggtcgattat cgttcccgct tcataattgc aagtcctgtg aagtcgacta actttgaaga 4740 aacgaggaaa ttcctatgca atgtgttcga gagagagggt ttgccgaaaa acgtcaaatg 4800 tgacaatgga ccacctttca acagtgatga tttccgtaac ttttgcattg aacgtggaat 4860 ccaaaatatt tattccacac cgcacttccc acagcagaat ggccttgttg aaagctacat 4920 gaagctgatc aacaaggcca tggcttcggc agtatcctct gggagcaggt acgaagatga 4980 actttcaagt gccgtgaaag cgcacaatgc agccgcccac tctattacta aagtggcccc 5040 tgaggagatg atgtacggaa gaaaattcag aagaggattg ccgctgatga accgtggtac 5100 cgtagcctta aatgacgacg aaatcaccaa gagagatcga gaagcgaaac tgaaggggaa 5160 aagttacgaa gacagacgta gaggtgcccg caagtgccaa atcataccag gggatgtagt 5220 gatcgtagag cgaaacgtgc gttcaaaagg ggacagtaga tttgatccca agaggtatac 5280 ggtggttcaa cagaaaaatg gaaacctttt gctctcgagt gctgatgggc aacagcttag 5340 acgtcatgtc actcaaacga agcgagtgaa gaatgatgct ccgagagttc gacaacctcc 5400 agtggagcaa gtgaacaaac acactgaaga acaccaacgg ccagtgagag agaagaagat 5460 tccttcatat ttaaaacaat atgttcgtgt agttgaagag gaaaaactgt atttgaatca 5520 acggtaaaac aaataaataa attgaaacat gtactacaca ttgaactttt tgacaaattt 5580 cataagatga gaataataaa agctaacaaa aattcaaact aacagctctt tttttttaaa 5640 gctgaagtgg ga 5652 // ID Gypsy-62_CQ-LTR repbase; DNA; INV; 856 BP. XX AC AAWU01038191; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-62_CQ_; KW Gypsy-62_CQ-I; Gypsy-62_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-856 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 504-504 (2011). XX DR Genome; AAWU01038191; Positions 8126 8981. XX SQ Sequence 856 BP; 249 A; 168 C; 204 G; 235 T; 0 other; tgtagcgacc cacatctttt tttcacctta gatgaaatcc ccgacgcaat aatgtgcgat 60 tggactttga tccatgatga ggagatatca tccagtggtg aaattaaaga gaaccgtgaa 120 gtcgctcaaa tcataattag tctacagcca aattaccccc aggggaactt ggcccagtta 180 gcataacttc acagttctct tttcaagatc aaacgaaggg ccggctgagg cgatgtcgtt 240 tgtgcgttcg ggttttcaca cgcgtcaaaa acagatggcc catgcgtgag ccgataggcg 300 atcgaatcta attcacttgc ttatgccaga gggagagaga gagatagaga aagggagaag 360 aaggacaata tgcgacactt cgggttttga ctcgttgacc aatcaattag gacatggagg 420 ggataattac ctcacccgct aattgattgg tcagaaggtc tcggcaagaa atccgaaccc 480 aaaaaagatt ccaggaatta tggggatctt caaagtcctc aaatgaacga agaggtcact 540 ataaaaggcg atcgaaaatc tcgatcgggg ctcagtttgt tcagtcagcg cagttcagta 600 ttcagtatgt agttcagttc ggagttcagt tcggtgttag tcacagtatg tagttcaaat 660 tttgccctca gttaattctc ggtaatccca gtgttctcag ttcagtagtt tagtgtcagt 720 tttggtgttt gactttttgt aactcgttaa actttctcaa taaaaagtta ggaaaagtga 780 tcgtgcccgc gtttttcact ctaaatcaaa aagaaattcc agtgtttgaa atatatgggg 840 tggacggcgc gataca 856 // ID Saci-3_LTR repbase; DNA; INV; 249 BP. XX AC BK004070; XX DT 09-JUL-2004 (Rel. 9.06, Created) DT 28-JAN-2011 (Rel. 16.02, Last updated, Version 2) XX DE Schistosoma mansoni Saci-3 LTR retrotransposon: long terminal DE repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Saci-3_LTR. XX NM Saci-3_LTR. XX OS Schistosoma mansoni OC Eukaryota; Metazoa; Platyhelminthes; Trematoda; Digenea; OC Strigeidida; Schistosomatoidea; Schistosomatidae; Schistosoma. XX RN [1] RA DeMarco R., Kowaltowski T.A., Machado A.A., Soares B.M., RA Gargioni C., Kawano T., Rodrigues V., Madeira M.A. et al.; RT "Saci-1, -2, and -3 and Perere, Four Novel Retrotransposons with RT High Transcriptional Activities from the Human Parasite RT Schistosoma mansoni."; RL J. Virol 78(6), 2967-2978 (2004). XX RN [2] RA DeMarco R., Kowaltowski T.A., Machado A.A., Soares B.M., RA Gargioni C., Kawano T., Rodrigues V., Madeira M.A. et al.; RT "Direct Submission to GenBank."; RL Direct Submission to Genbank (03-DEC-2003)Departamento de RL Bioquimica, Instituto de Quimica, Universidade de Sao Paulo, Av. RL Prof. Lineu Prestes, 748, Sao Paulo, SP 05508-900, Brazil. XX DR Genbank; BK004070; Positions 4969 5217. XX SQ Sequence 249 BP; 62 A; 50 C; 45 G; 92 T; 0 other; tgtagtgatc ggccagtttc gataagaaca cgattggaat aatagaaaca attattatta 60 ttgtaaccaa ttgtaccaga gattgtttta ctgtatttgt ttaactgtat ccgtttcact 120 tcttgttcca tataccgcct tgtttggttc gagcttgctc acgcactgct tattcgcttg 180 tactctttgg cacgtctcgg tattatttcg ataccacgag atcgccaata aatatacttg 240 atctggact 249 // ID Nematis_C4 repbase; DNA; INV; 2212 BP. XX AC . XX DT 15-DEC-2006 (Rel. 11.12, Created) DT 15-DEC-2006 (Rel. 11.12, Last updated, Version 1) XX DE Nematis_C4 is a Penelope-like retrotransposon. XX KW Penelope; Non-LTR Retrotransposon; Transposable Element; KW Nonautonomous; retrotransposon; Penelope-like element; KW reverse transcriptase; GIY-YIG endonuclease; Nematis_C4. XX OS Caenorhabditis brenneri OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-2212 RA Arkhipova I.R.; RT "Distribution and Phylogeny of Penelope-like elements in RT Eukaryotes."; RL Syst. Biol 55(6), 875-885 (2006). XX DR [1] (Consensus) XX CC Nematis_C4 is a Penelope-like element (PLE) from the sequenced CC genome of the nematode Caenorhabditis_sp4. It belongs to the CC Nematis group of PLEs. Its 5' truncated ORF contains regions CC homologous to reverse transcriptases and to GIY-YIG CC endonucleases. The element is very low-copy, incomplete and CC apparently inactive. Consensus sequence was assembled from trace CC archives. XX FH Key Location/Qualifiers FT CDS 118..1908 FT /product="Nematis_C4_1p" FT /translation="CQKIKKAPRGSGILEEQRRTALSISDKGGEFVVSTRD FT LYRETVEKHLQDRSVYREIRQLEYENAVTLLKHAQEEVYRSWNRWTVDRLM FT DPHPSINTFYGLYKTHKFVERGDKANSKNIKIRPIISGSGGPTDRPSWVVC FT TIITQLLQFVPCHLKSTSEFLEELGRINSPNELHYESFDVESLYTNIDNRM FT AMEAVVRKLRKHSTEIEWFGMSVREVTHLLNACLNFNMFQFNNKLYAQKRG FT LAMGSRLAPVLAVLYMDIIECPSKIHPTRLFKRYIDDFIIITESQETLDQV FT FKSLNAQAETIKLTREKPSEGWLPFLNCQIRQRGNGFETKWYKKPSNKNIL FT IRIESCHPIRQKRNIVKVTRKTAERTTSEGNRDEAEKLAAKILLKNGYGTE FT WEKFRNTKGHRWKQPSRKTDPGKPVTTILPIPFISDQLSDLVRKTLNDVGI FT DATLVELKGKSLKDQLVHNRLFDIRCDKTKCRVCPTLGVGACGKKGGIYQL FT TCECGEAYIGETGRPLVMRIDEHVRAANNPQRPSYATNAFSKHAKAKHGGL FT PIRLTLKLLTIKRNTTRRKALEALYITTQSPSLNGKEELTDLVAKFGKM*" XX SQ Sequence 2212 BP; 775 A; 469 C; 492 G; 476 T; 0 other; cgttcaggct cttaaaaact tgatccagcg tttcttatga ctccgtgatg atgatgaaat 60 catcgatgta tcttttgaac agtctcgttg ggtggatctt actggggcat tcgatgatgt 120 caaaaaatca aaaaagcacc gcgtggctct ggaattcttg aagagcaacg aagaactgca 180 ctctccatct ccgataaggg aggcgagttc gttgtctcaa ccagagatct ataccgagaa 240 acggtggaga aacatctaca ggatagaagt gtttacagag agatcagaca attagaatac 300 gagaacgcag tcacgttact gaagcatgct caagaagagg tatacaggtc atggaaccga 360 tggacagtgg acagattaat ggatccccac ccctccataa acactttcta cgggctatac 420 aaaacccata aatttgttga aagaggagac aaagccaaca gcaaaaacat caaaatacgg 480 ccaatcataa gtggtagtgg gggaccaaca gaccgcccct cgtgggtagt ttgcaccatc 540 atcacacaac tgttgcaatt tgttccatgc cacctgaaaa gtacgtctga attcctcgag 600 gagctcggaa gaatcaacag cccaaatgaa ctacactacg aaagttttga cgtagaaagt 660 ctctatacaa atatagacaa cagaatggca atggaagcgg tagtcagaaa actgaggaag 720 cacagcacgg aaatagaatg gttcgggatg tcagtcagag aagtgaccca ccttctcaat 780 gcctgtctca attttaatat gttccagttc aacaacaaat tgtacgcaca aaaacgagga 840 cttgcaatgg gcagccgtct agctccggta ctagcggttc tatatatgga catcatcgaa 900 tgccccagta agatccaccc aacgagactg ttcaaaagat acatcgatga tttcatcatc 960 atcacggagt cacaagaaac gctggatcaa gtttttaaga gcctgaacgc acaagcagag 1020 actattaagc tgaccaggga aaagccaagc gaagggtggc ttccttttct gaattgccaa 1080 atcagacaaa gaggaaacgg ttttgagacg aaatggtaca aaaagccttc taacaaaaac 1140 atcctcataa gaatagaaag ctgccaccca ataagacaaa agagaaacat cgtgaaagta 1200 acaagaaaaa cagcagaaag aacgacctcc gaaggtaaca gagacgaggc ggaaaagttg 1260 gcggcgaaaa ttctactgaa aaatggatat ggaacggaat gggaaaagtt cagaaacaca 1320 aaaggacacc ggtggaagca gccgagccgg aaaacagatc ccggaaaacc cgtcactaca 1380 atcctaccga taccgttcat atcggatcag ttatcagatc tcgtgaggaa aacactgaat 1440 gatgtgggga tagacgcgac gctggtggag ttgaaaggaa agtcattgaa agatcaattg 1500 gtgcataatc gactgtttga catcaggtgc gacaagacca aatgcagagt ttgcccaact 1560 ctgggagtgg gagcatgcgg aaagaaaggg ggtatctatc aattgacttg tgagtgcggg 1620 gaagcatata ttggagaaac ggggagaccg ttagtcatga gaattgatga acacgttcga 1680 gcagccaata atccacaacg accatcatac gcaacaaacg ccttctcaaa acacgcaaag 1740 gcaaaacatg gaggattgcc gatccgactc accttgaagt tactaacgat aaagaggaac 1800 accaccagaa ggaaagctct tgaagcattg tacataacca cgcaaagccc tagtcttaac 1860 ggaaaagaag agttaactga tctggttgcg aaattcggaa aaatgtgata gggcctaaag 1920 actcaaaaca taaaaatcaa aaactccaaa aacactacct cccctttcat atccataacg 1980 tgttctactt tgtgaccctt tctctaagct ttctctctca ttttcgaatt agcattctct 2040 gtgtaagtcc tatctttttc cttctgcata tgtaatttct agttttagat agtttgtccc 2100 tgatgatggc gtaaagccga aacgttggac cttgaataaa gagataaaag tgcaaaacaa 2160 gaacaaaaac aagggtccag aaaagaaggc caccttcaat ccaaaaatga at 2212 // ID BEL-122_AA-I repbase; DNA; INV; 6209 BP. XX AC AAGE02028819; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-122_AA_; KW BEL-122_AA-LTR; BEL-122_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6209 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02028819; Positions 15666 9458. XX CC Positions [4911-5492] - Integrase core CC 'CAGTA' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS join(495..4178,4182..5936) FT /product="BEL-122_AA-I_1p" FT /translation="MTLRELTKAERGYFDSLHNVESFVANFDPQRDQSRIA FT SRLEYLETLFKEFRDNRAKVEARQEQDLAASRSGEAVKEVQLTKDMIASNR FT KTRLDFENRYFDVKDFLVSQRINTTAVASSSSSHSSPPGQIPTSRIHLPVF FT KIPSFDGSVKDWLSFRDAFQNTIGKDITLSPLDKFNYLLSVLTKEARTLVE FT SIEVTANNFDVAWQMLEQRFENKKMISRALMNSFLDSEPIKRESYDALVNL FT IDSYERNLLQLRKIGLPTQGWSHLLAHILYTRLDAETQRHWERAHNSREVP FT TYEDLLKFLRDHLATLQPLSLSKPRPTEFRPDQKAQKSKISSTLTTTAAPK FT NVCPHCQKPFHSPFKCNSFINMNPSQRLESVKKAGLCLNCLSSSHLVRACP FT SSACRVCGQRHHTMLHLRPSNNGSSQGQEYSQPQPNKPTLPQGQPQIAPAS FT NNSSTSQSKAFATTPSTSFESSASSHPSVALPASAPSNSSIVLLSTAVVKI FT EDQKGNVQFARALLDCCSERNLLSESLAQKLDLRRQHDPLSLQGVGPSAAT FT SKQSTMATIRSRCTDYVVDLKFHILSEFKPVLPSNRLQTDRWKIPPFVQLA FT DPRFFDPNRIDIVIGAEVYYRLLLEGFVDLGPELPHLKETVFGWIVSGKCD FT ALETSRSAVTLVCCNADLERQLARFWEIESCHADESLSVEERKCETHFTET FT TTRDPSGRFMVSLPKKLDVLENLGESRSIAIRRFMSLERQLHSNPQLMEEY FT EAFAQEYVQLGHMAPINPSNTVLLPSEKIYYMPHHCIVRPDSATTKLRVVF FT DASCATDSGVSLNDALMVGPVVQDDLFSILLRFRIPRFVIVADLQKMYRQV FT LVYPSDRSLQRIVFRSSPVEPIQTYELRTVTYGTASAPYLATRCLQQLASD FT GESTHPKAAKVLSKSFYIDDLLSGVESEEERIELCRQLIDLLRSAGFKLHK FT WASNSPNILRNIPAELREDRSLLELDSSSSPVKTLGLLWQPTEDAFRFKIP FT CWSQEGPITKRLVLSESARLFDPLGLLGPIVLRSKLFMQELWKAKVSWDLP FT LSESQQQFWIDFRSDLNVLDEFSVPRWAASAKEPVQAELHGFSDASESAYG FT ACIYLRMVSNGESVSVHLITAKSRVAPKGTEKLDQLIRLPRLELCGALLLS FT HLFEKVESSLQIQASPFFWTDSTIVVHWLAASPSRWKKFVGNRVAEIQQIT FT ASGTWRHVPGIENPADLIRGMQANELVEHSLWWQGPIWLQQPNRFWPDPVR FT TADDHFEREQLQEKQTVALPAVAQSSIFSLKSSLTSLVRLVAYMQRFCNNA FT KKHNPETRSSGALSTVELDEALVRLVKLAQQESFAQDLHSIRTTGQVKSSS FT KLKALSPVIIDGVLRTRGRLNNAGISFAQKQPMILDNKHPFTLLVVRYYHL FT NRLHAGPTLHTAVIRSKFWPLRLRDLVRKVTHECINCFRNRPTLSEQLMAD FT LPPVRVSPTLPFLNSGVDFCGPFYLRPHSRKAAPLKVFVAVFVCLSTKAIH FT LELVGNLSSESFIASLKRFAARRGVPKTIYCDNGTNFVGAQRALNEFLQLF FT RSQQSRLDITRQCSEEGIQFSFIPPRSPHFGGIWEAAVKSLKTHLRRTLAN FT ALVTAEQFHTLLTQIEALLNSRPLTQLSNSPEDLDVLTPGHFLVHRPLTAI FT PEPSYEELPSNRLSQWQQIQKYLRRLWKRWSTEYLSGLQQRTRWTRARDNI FT RIGTMVLIREDNLPPQKWRFGRVVEVFPGNDGLIRVVSIKTKDGIYKRVVT FT RVCVLPIPDNQPGHDGAAFQTFQVPATQATSDGRGPEGPLTS" XX SQ Sequence 6209 BP; 1546 A; 1653 C; 1457 G; 1553 T; 0 other; ttaatggtcc ttcgaggagc cggatttgtg aattttcagt agccaccacg tggcatacaa 60 agtgatctat tgtgttcagt gataatagca catacagcaa ccagcgtgtt gctttgttct 120 cccttggtcc atcggcgtcg tttgcgtagc gacgagtgcg gattcgatcg agctatcgaa 180 tcatcttctg cggacccgtt tcgaaccaag gcacggttca tttgccggac acgaagcact 240 atcggcgtca cgggacgcgc ttctccccct tctttctgtc tgcggcaaag cggcgtggct 300 tggattggcg aacaatagct ggtagcgtga ctacaaaaga ctgaggtacg aaaatacact 360 gtgcatgcga gtttcatttt agtttgtaag gaaattgttt gcgatttacg gattttttgg 420 gcaattgacc tggctttgtt tttgggattt gagagggttt gcgatttgga tcaatacgaa 480 taagattcgt cagcatgaca ttgcgcgaat taaccaaagc cgagcgtggg tacttcgatt 540 ccctacacaa tgtggaatca tttgtggcta acttcgaccc gcagcgcgat caatcaagaa 600 tcgcttcgcg cctggagtat ttggaaacgc ttttcaaaga gttccgcgat aatcgggcta 660 aagtggaagc acggcaggag caagatttgg cagcatcgcg ctcgggagag gcagtgaagg 720 aagtgcagct gacgaaagac atgattgcgt cgaatcggaa gacacggttg gattttgaaa 780 atcgttattt cgatgtgaaa gattttttgg tttcgcaacg catcaacacg acagctgtag 840 cgtcgtcctc ctcctcccac tcctctccgc ccggccaaat tcctacatcc cgcattcacc 900 tcccggtttt caaaatccct tcctttgacg gtagtgtgaa agactggtta agctttcgcg 960 atgctttcca gaatacgatc ggcaaggaca tcacgctgtc gccgctcgac aaatttaatt 1020 accttctctc ggtgctcacc aaggaagcta gaactttggt tgagtcaatc gaagttacag 1080 ccaacaactt cgatgttgcg tggcaaatgc tcgaacaacg gttcgagaat aagaaaatga 1140 tctctcgcgc gctaatgaac agttttctgg actctgagcc gattaagcgg gaatcgtacg 1200 acgctctagt caatctcatc gattcgtacg agcgcaatct gcttcagcta aggaaaatcg 1260 gattacccac ccaaggttgg tcccacttgt tggcccacat actctacaca cgtctcgatg 1320 ccgaaacaca acggcactgg gagcgggcac acaactctcg agaagtacca acctacgagg 1380 acttgctgaa gtttcttcgt gaccatttgg caacactgca accgttgtca ctctcgaagc 1440 ctcgtccaac agagttccgt ccagaccaga aggcgcagaa gtccaagatt agttcgacac 1500 tcacaacaac tgccgcgcct aagaatgttt gtccacactg tcaaaaacct ttccattctc 1560 ctttcaaatg caactccttc atcaacatga atccctccca aaggcttgaa tcggtcaaaa 1620 aagctggttt gtgtctaaat tgtctttcct cttctcattt ggtacgggct tgtcctagct 1680 cagcttgtcg tgtttgcggc caaaggcatc atacgatgct gcatctgcgc ccttctaaca 1740 atggatccag ccaaggacaa gagtattcgc aaccgcaacc caataaaccc actctcccac 1800 aaggtcagcc ccaaatcgca ccagcttcaa acaattcctc tacctcgcaa tcgaaagcat 1860 tcgccaccac cccgtctacc tcgttcgagt cttccgcctc cagccatccg tcagtcgctc 1920 ttcctgctag cgcaccgtca aatagtagca tcgttctcct ctcaactgct gtcgtcaaga 1980 tagaagatca aaaaggaaac gttcagttcg cgagggcact tctagactgc tgctccgaac 2040 gcaacttact cagcgaaagc ttggcgcaaa aactggatct tcgccggcag cacgatcccc 2100 tttctctgca aggcgttggg cctagtgctg caacatcgaa gcaatcgacg atggcaacca 2160 tccgttcacg ctgtacggat tacgtagtcg atctcaaatt ccatatactg tcggaattca 2220 aacccgtctt gccatcaaac cgtttgcaga cagatcgttg gaaaattcca ccattcgttc 2280 aactcgccga tcctcgcttc tttgatccca atcggatcga cattgtcatt ggagctgagg 2340 tttactatcg cctactacta gaaggatttg ttgatcttgg accagaactt ccccatctaa 2400 aagaaaccgt ttttgggtgg atagtttccg gtaaatgcga cgcgttggag acaagtcgtt 2460 ccgccgtcac tctcgtctgc tgcaacgcag atttggaaag gcagcttgct cgcttctggg 2520 agatcgagtc ttgtcacgcg gacgagtccc tctcagtgga agaacggaaa tgcgaaaccc 2580 actttaccga aactacaact cgagatcctt caggacggtt tatggtttcg ctaccaaaga 2640 aactcgatgt tctcgagaac cttggagagt ctcgaagcat tgccattcgt cgtttcatgt 2700 cgcttgaacg tcaacttcac tcgaatcccc agctgatgga agaatacgaa gcgttcgcgc 2760 aagagtatgt tcaactaggg catatggcac cgatcaaccc tagcaataca gtcttgctgc 2820 ctagcgagaa aatttactac atgccgcatc attgtatcgt tcgtccggac agcgccacga 2880 cgaaacttcg cgttgtattt gacgcctcat gcgccaccga ttctggtgtc tcattaaacg 2940 acgctctaat ggtgggcccc gtcgttcaag acgatctgtt tagtattctc ctgcgcttca 3000 gaatccctcg tttcgtcatt gttgctgatc tgcagaaaat gtaccgacag gttttggtgt 3060 atccctcgga tcgctcactc cagcgcatag tttttcgttc ctcaccagtt gaaccgatcc 3120 aaacctacga gttgcggacg gttacttatg gtactgcatc ggcaccgtac cttgcgaccc 3180 gttgcctcca gcaactggca tccgacggtg aatcaactca tcccaaagcc gccaaggtgc 3240 tgtcgaaaag tttctacatc gacgatctgc tttctggggt agagtctgaa gaagaaagaa 3300 tcgaattgtg caggcagttg atcgatctcc tacggtcggc tggattcaaa ctgcataaat 3360 gggcgtcaaa cagccccaac atactccgaa atattcctgc tgagcttcga gaagatcgca 3420 gtcttctaga actcgattcg tcgtcgtccc cggttaaaac cttgggcttg ctatggcaac 3480 caaccgaaga cgcttttcgc ttcaaaatcc cttgttggtc gcaagagggt cccatcacca 3540 aaaggttggt gctttccgaa tcggctcgct tgttcgatcc actgggccta ttggggccaa 3600 tcgttcttcg ctccaagctc ttcatgcagg agctgtggaa agcgaaagtt tcctgggatc 3660 ttccattgag tgaatcacaa cagcagtttt ggatagattt ccgaagcgac ctcaatgtcc 3720 ttgacgaatt ctccgtccca cggtgggcgg catctgcaaa ggaacccgtg caagctgaac 3780 tccatggctt cagcgatgcg tccgaaagcg cgtacggcgc atgcatttat cttcgcatgg 3840 tttcaaatgg tgaaagcgtc tctgtgcact taattactgc aaagtctaga gtggcgccta 3900 agggaaccga aaaattggac cagctcatac ggttgccgcg tctcgagctc tgtggagctc 3960 ttctcttgag ccacctattc gagaaggtgg aaagcagttt acaaatccaa gcaagtccat 4020 tcttttggac tgattctacg attgtcgtgc actggttagc ggcatcaccg tcccgctgga 4080 agaaatttgt tggaaatcga gtagcagaaa tccagcagat cacagcttct ggtacatgga 4140 gacacgtacc gggcattgag aaccccgcag accttatcta gcggggaatg caagcgaatg 4200 agcttgtaga acactcgctg tggtggcagg gacccatttg gctacaacag ccgaatagat 4260 tttggcccga ccctgttaga acagcggatg atcatttcga acgagagcaa cttcaagaga 4320 aacaaactgt tgcccttcct gcagttgctc aaagcagcat attttctttg aagtcgtcac 4380 taacgagttt ggttcgtctg gtcgcctata tgcaaagatt ctgcaacaat gcgaaaaagc 4440 acaacccgga aactagaagc agcggagccc tatcgacagt tgaacttgat gaagcgttag 4500 tgagactggt aaagctcgcc cagcaagaat cgtttgcgca agatctgcac tcgattcgca 4560 ccaccggcca agtgaagtct tcatccaaac tgaaagcttt gtcgcccgtg attatcgatg 4620 gcgttcttcg tacaagaggt cgcctcaaca acgcaggcat atcgtttgct caaaagcagc 4680 caatgattct ggataacaag cacccattca cccttctggt tgtgcggtac tatcacctga 4740 atcggctgca tgcgggccct acacttcaca ccgccgtcat tcggtcaaaa ttttggcctc 4800 tccgactacg agatctagtg cgcaaggtca cccacgaatg cattaattgt ttccgcaacc 4860 gaccaacctt gagcgagcaa ttgatggcgg acctaccacc cgttcgagtg tcaccgacac 4920 tgccattctt gaactccgga gtagacttct gcggtccttt ctacctccga ccacattcaa 4980 gaaaggctgc gccactgaag gtgttcgtag cagtttttgt ttgtctctca accaaggcaa 5040 ttcacctgga attggtaggc aatttatcat ccgagtcgtt catcgcttca ttgaagagat 5100 tcgctgcgcg tcgaggtgtt ccaaagacca tatattgcga caacggcacc aattttgtag 5160 gtgctcaacg agcgctaaat gaattcctgc aactgttccg atctcaacag agccgattgg 5220 acatcactcg acaatgctcc gaagagggca tacaattttc ctttatacca cccagatcac 5280 cacacttcgg tggcatctgg gaagccgcgg tcaagtcgct aaaaacgcac cttcgccgca 5340 cactggccaa cgccttggtc accgccgagc aatttcacac gttgcttacg caaatcgagg 5400 cactgctcaa ctctcggccg ctgacgcagc tgagtaactc tccggaggac ttggacgtcc 5460 tcacgccagg tcatttctta gtgcatagac ctttgaccgc aatccccgag ccctcctacg 5520 aggagctacc cagtaaccgg ctttcacagt ggcagcaaat ccaaaaatat ctccgtcgtc 5580 tttggaagcg ctggtcaaca gagtatctgt cgggactaca gcaacgaacc cgttggacgc 5640 gagcacgcga caacatccgt atcggcacga tggtgttgat ccgtgaagat aatctgccgc 5700 cgcagaagtg gcgtttcggc cgcgttgttg aagtatttcc cggcaacgat gggctcatcc 5760 gagtcgtcag catcaaaact aaggacggca tctacaagag ggtagtcact agagtctgcg 5820 tcctgcctat cccggacaac caaccgggcc acgatggggc agcgttccag acgttccagg 5880 tgccagcaac acaagcgacg tcggatggga ggggacctga aggtcccctc acaagttaag 5940 tatccctata tttttgctgg attttgtctg ccccgaccta aagtctaccg ttttgacagg 6000 tttcctggag tttggcaact caaccgtagg tcgacgtgcg agttccgatc gaggaaccaa 6060 caaccgttgg tcaaggagtc gtagtcaaat attgcagctg agttaaattt ttgaattgta 6120 ttttgatcat acttcattaa tacgttagta agttaagtat tcacagttag gtagtaagat 6180 tgaaatccag atttcaatgg tgggcggta 6209 // ID MSAT-3_CQ repbase; DNA; INV; 49 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A satellite repetitive sequence family from Culex DE quinquefasciatus - consensus. XX KW MSAT; Satellite; Simple Repeat; MSAT-3_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-49 RA Kojima K.K. and Jurka J.; RT "Satellite sequences from the southern house mosquito."; RL Repbase Reports 11(1), 615-615 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 10 sequences with >89% CC identity. XX SQ Sequence 49 BP; 12 A; 7 C; 15 G; 15 T; 0 other; tgtccctgta gttgaacagg tggtgaacat gtagttgttc acgggtaaa 49 // ID TELSAT_PF repbase; DNA; INV; 142 BP. XX AC . XX DT 16-MAY-2005 (Rel. 10.05, Created) DT 03-JUN-2005 (Rel. 10.05, Last updated, Version 1) XX DE Plasmodium falciparum subtelomeric satellite repeat. XX KW SAT; Satellite; Simple Repeat; Subtelomeric repetitive sequence; KW satellite repeat; TELSAT_PF. XX OS Plasmodium falciparum OC Eukaryota; Alveolata; Apicomplexa; Aconoidasida; Haemosporida; OC Plasmodium; Plasmodium (Laverania). XX RN [1] RA Gardner M.J., Hall N., Fung E., White O., Berriman M., Hyman R.W., RA Carlton J.M., Pain A., Nelson K.E. et al.; RT "Genome sequence of the human malaria parasite Plasmodium RT falciparum."; RL Nature 419(6906), 498-511 (2002). XX RN [2] RP 1-142 RA Gentles A., Kohany O. and Jurka J.; RT "Subtelomeric repeat."; RL Direct Submission to Repbase Update (16-MAY-2005). XX DR [2] (Consensus) XX SQ Sequence 142 BP; 51 A; 26 C; 21 G; 43 T; 1 other; ctttatgttc ttagaacaga taaagaggta tcaaagtact tcctccttta tgytccataa 60 aagaacatac aaaggaggat tttattacct cccctcctta atgttcctta gaagaacata 120 taaagaggga tatttaaagc ac 142 // ID EnSpm-N4_BF repbase; DNA; INV; 3212 BP. XX AC . XX DT 04-AUG-2008 (Rel. 13.08, Created) DT 04-AUG-2008 (Rel. 13.08, Last updated, Version 1) XX DE Amphioxus EnSpm-N4_BF autonomous DNA transposon - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; EnSpm-N4_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-3212 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-3212 RA Kapitonov V. and Jurka J.; RT "EnSpm-N_BF - a family of autonomous DNA transposons from the RT amphioxus genome."; RL Repbase Reports 8(8), 795-795 (2008). XX DR [2] (Consensus) XX CC This non-autonomous En/Spm transposon is characterized by 3-bp CC TSDs and imperfect 32-bp TIRs. XX SQ Sequence 3212 BP; 893 A; 704 C; 638 G; 976 T; 1 other; cacagcacga aatctggtat ttaagcaagc ttaaactcaa cgtaaacgcg acgtttctgc 60 accctttccg caacttaacg acatctatcc gtgaacgtaa aggttccctt taatgttacc 120 tttacgttgg cacttaaatg tgcgtttcgg cactcctcac gtgcctttcc gcaactatta 180 gacgcttaaa ggcgtcgtta atgacagttt ttcgtaacct atacgggagg aaagtatgcg 240 tttattaaga cctttctgca ccctttccgc aacttaacga catctatccg tgaacgtaaa 300 gattcccttt aatgcactta tatgtacgtt tcggcactcc ttcacgtgcc tttccgcaac 360 tattagacgc ttaaaggcgt cgttaatgac agtttttcgt aacctatacg ggcggaaagc 420 acgcgtttct taagaccttt ccgtgatctt ttgaccacat ttacagtcga tttccgtcac 480 gatttcccca cgtttatgtg tcgtttacac accctgaaag gcacgcgaag aagtgcagaa 540 acgtacttat ccgaacgctt tccgtgatct ttccttggac tatagttgtc atttattttt 600 agtaaaggcg gcgtaagttt aaacctttcc gcatctttag gtaggcttta tgtgttttgt 660 gtggcttaaa ttggaacttt tgcatcgggt atgcgtgtct tatttggtgt gaatcatacc 720 tacaccggta tctatggctt tgccgtctaa actatcttgt gagtccaatt gtggacttgg 780 attggtcgac cgagggaaca atcggtgtac aatatccatg cacgaatctc cgctacattt 840 aagtccacaa tttggagtcc agtgtcgggc gatacgcctg gaaagacatc atttcaagca 900 gcatagttcc atctggccat cacattagga aaaatccgct gagattcccg aagacgggta 960 ctgttttgtg atgaacaggc cgtgctctcc atatctccgc cagcttatgc tggtcaagat 1020 gacacagtct tgaattcacc ctgagttgtg cccctttgaa gactctccaa catacgtact 1080 ccgtttctgg caagccttga gtgtcctatg gttatgtccg ttccgtcctg tagttccgtt 1140 cgaatccatt tgcggcaaag ccacgccgat aatccgtgtg ttgaaacgga tccaaggctc 1200 cttcttgact tgtcccgaaa cagacgccct gcaaactggt ttcttctata aattatggta 1260 tattcgttag gtttaaaaaa cccacctttg tctgtcattg gataacgccc tatcacatga 1320 ctggcctctt ctgggagctg ctaattagaa attcaatgcc gcgttgttcg ttattcatag 1380 ttgacggcga ataaatttga atttgaaaaa caccatcccc tttgtacgct gtttagccaa 1440 ggttaagcta aacacaactg ttaccacact agaaaatacc agcctttctg ggagaacgtg 1500 attattgctt gtagataata aatgttcttc cagatgataa gaaagttttt ttgtacaatt 1560 atataaacag caaacccgct atattcccat gatacgttac cgcgtagtga ggacgctatg 1620 tcgtcgggat gccaagctgc aaggcacagc cacccgcgag cgacatactg cacaggccgc 1680 ccagtactag tcctcttacc ctgaacggga acatgccgta attatgacca ccaatcttcg 1740 ggggcatgta cccagtcctc ccggcgactg ttgtttcctt tcaacaagga ggttaaatat 1800 ttaatttcct tgctttcaac gctacctctc caacatcttt ttctgtaact atcaatctcc 1860 acctctgtaa agaaagttca gagagttttc tgaaaagtaa gctcaaaggc aaaggtaacg 1920 gaacgatatc ggtaacatac gatactactg gaaaaaataa acgctacctt tccgacatct 1980 ttcttgcacc tgtacatagc atctttttgt gtaaagtgag ctcaatggta acgtaaagac 2040 attggaaaga taccatatgt tatatttatg tttcacccat gaggatattt tgattttcct 2100 ggtttcgtca aacacgctaa cacagaatga gttattcagg cgaagtcgct actttagtga 2160 acggcgcatg tggtatattg agcggaaaat actcgtaatt tggtaaaagc aatgcctctt 2220 ttaagaataa taattaacgc tacctttccg acatcttttt tgcacctata cattccaaaa 2280 attgtatctg ctatatgtaa gttttaacca ggagtaaatg ttgcttatct tggttaaact 2340 caacacgtta acacagcagg ggctattcag gcgaagccgc tgcgttagtg agggcgcatg 2400 tggtatattg agcggaaaat aatgcctctt cttcaaataa tgattaacgc tacctttccg 2460 acatcttttt tccacctata cataaaaggg gggaggtttg cttaaacctc ccccctttta 2520 tgtaaagtca gctcaaatgt aacggaaaga catcggaaag attgtatctg ttatatgtag 2580 ctttcacaca ggatgagatt trcttttctt ggttaaactt aacacgcaaa cacagcagga 2640 gctattcatt tgaagccgct gctttattga acggcgcatg tggtatattg agcggaaaat 2700 actcttaatt tggtaaaagc aatgcctctt cttagaataa tgataaacgc taccattccg 2760 acatcttttt ttgcacctat acataaaagg gggaggtctc catagaccta tactgataag 2820 agtatgtaaa atgagatcaa ttgtaacgga agggcaatga aaatatatca tctcctattg 2880 gtaatttcca ccttagagaa ccacgaaaat aaaaaaaagg cgaaggttaa ctgattgtat 2940 gtgaacgtat ggtcggttca aagcctgtag atgttttctg tacataagca atacccgttt 3000 catatacggc gcatataagg atgcttaaag gttatgtaaa gatattacat gtatttaatc 3060 tgccttaacg aatacggaaa ggttacggaa atgtaaaacc atggtttccg tggtctttac 3120 gataccgtaa actctgcgga aatattttcc cacctttagg ctacatttaa gtatctcgga 3180 aagtatgctt aaatattcgt tttcgtgctg tg 3212 // ID L1-46_AAe repbase; DNA; INV; 4530 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE An L1 non-LTR retrotransposon from Aedes aegypti. XX KW L1; Non-LTR Retrotransposon; Transposable Element; L1-46_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4530 RA Kojima K.K. and Jurka J.; RT "L1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1399-1399 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 3 sequences with >99% CC identity. XX FH Key Location/Qualifiers FT CDS 136..1185 FT /product="L1-46_AAe_1p" FT /translation="MAVPVQTTRINTFMVNFSEVTRKPKVEVITDLIFNKI FT KIPSQRVKSFQFNGSRGVTYVECDSLDYALEIVENHDRKHEVAYDNVKVFV FT PLLMDDGATVVRLHDLPPQMDNDYIKNEMTKYGEVLSIKSEVWETPEQLKG FT IPNGVRKLRIRLVASIPSYIFMAGHSTHVTYKNQQITCRHCGRQVHYGAKC FT AEAVQIINSQRSVNTRLQRGSSGYADVLQGQGQKKRAASDDWIGPNMVNLN FT SLNKKWASRSPITTTDQPSTSSTDNVTEQKPDSEFKIPSKTARVSNIIGGS FT DEMANASEGPSDNTNINQFDVLMSEDDDKSPYFSDSSMVSDSSITGKRKSR FT KRNSKTK" FT CDS 1185..4412 FT /product="L1-46_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MNTNTTYNVNNGLISYSIATINLNNISNTNKINSLKT FT FVNLMDVDVILFQEVESENIELYGFEIVYNVNHDRRGTAIAYKAHIQCTAV FT ERSIDSRVIKIKLNNNIVICNIYAPSGSSNRSSREYFFNEIVPYYLRGSNE FT FMILGGDFNCVETAKDSSNDYTNNFSPMLKRLRQALQLEDAWEVKFGSNKE FT YSFIRGGFGSRIDRIYVSQSLSLNVVDAKYQVTSFTDHKAHIVKVKLPSLG FT KPFGRGVWHYKSLVLEDDNNKEEFKQKWRYWVSQRQYYRNWVVWWAEYAKP FT KVVSFLRWKSRNMYNDFNSNMEFWYYALRRAYNEYLTDPLQISEINRIKGK FT MLKLQHDFSAFLSRNNEIHLNHEPLSIFHLEQKHQKKNKTRIKSFVIDGRL FT EENQENIKSHVVEQIENLFKSEEVATNNDFKPGRSIPNDFLENEELMEEID FT EDEVFCAIKSSASRKSPGMDGLTKEFYMDCWNIIKVEITAVLNCFLRGNIE FT KKMCDGMIVLIRKKNSGNSIDGYRPITLLNFDYKIVSRILKFRLNRLMPFL FT VSCNQKCSNPRRNIFEATCSIRDRIVELQHKKRKALLVSFDMEKAFDRVEP FT RFLFKTLREMNVSDRFVSFLEKVRDVSFSKILINGTLSREVKIERSVRQGD FT PLSMHLFVLYIEPLLQKIVASCSDSLDLINVYADDLSLVVNNLNDLEIVKT FT HIINFGMISGARLNVSKSKAVEIGFGSTPLPTVPWLKVEERIKVLGIYFSN FT SLKKMTVDNWTNAISSVTFSININQARKLNLMQKVFFLNTFALSKVWYVAS FT TIKINNKFCAKLKSLIGTFLWRGYGARVAFNQLILPKDRGGLSLHDPESKC FT NALIVKNFIKLNYDNSFIVNQFLNQLSNDANIPANYDYIKLLANEIPNLPP FT AIRNNISSKAVYDIQITKKPNPEITSKYANKNWSKVWRNLFTPKIMSSEAR FT SAYYLLINEKIPNMESFHRHGRVSTNLCRSCNVAIETIPHIFAECDSSKSF FT WNLCXSKLKGINANEIQRMNFNNFKVPELGRFSRAERNKVLEMFSKFVMYI FT NKTLANNRSLVELNDILTN" XX SQ Sequence 4530 BP; 1627 A; 742 C; 921 G; 1238 T; 2 other; cagtctcgat ttggcttttg atgcgtgcag acgtgctttc gaaacgttca atagtgagtg 60 aataatacaa atattgcctt cggaagaggg acaaatagtt accaacacat aacacacacg 120 aacaagtgaa caaacatggc ggtaccggta cagacaaccc gaatcaacac gttcatggtc 180 aacttttcgg aagtgactag gaaacccaag gtcgaggtaa tcacggattt gattttcaac 240 aagataaaga taccatcgca acgagtgaag tctttccaat tcaacggatc acgtggagta 300 acgtacgtcg agtgtgatag tttggattat gcgctagaaa ttgttgagaa tcatgacaga 360 aaacacgagg tggcttacga caacgtgaaa gtctttgtcc ctctacttat ggatgacggt 420 gcaacagtgg tcagactaca tgatttgcca ccgcagatgg acaacgatta cataaagaac 480 gagatgacga aatatggaga ggtgttgtca atcaagagtg aggtatggga aacgccggaa 540 cagctgaaag ggataccgaa tggggttaga aagctgcgga ttcgtctggt cgcttcgatt 600 ccatcctaca tttttatggc tggtcattcg actcacgtca cgtacaagaa ccagcaaata 660 acgtgccggc attgtggacg acaggttcat tacggagcga agtgtgcgga agcggtgcaa 720 attatcaatt cacagaggag cgtcaatact cggcttcaga gaggttcgtc gggctacgcg 780 gatgtattgc aaggtcaagg tcaaaagaaa cgagcagcgt cagatgattg gattggaccg 840 aacatggtta acctgaactc gttaaacaag aagtgggcct cccgatcccc catcacgacc 900 actgatcaac cgtcaacatc atcaacagac aacgtaacag aacagaaacc agactcagaa 960 ttcaaaattc ccagcaaaac agcacgcgtt tccaacatca tcggtgggag tgatgaaatg 1020 gccaacgcaa gcgaaggacc gtcagacaac accaacatca atcagtttga tgtgttgatg 1080 tccgaagacg acgacaaatc accatatttc agtgatagtt cgatggtatc ggattcatcg 1140 attacaggaa agcggaaatc cagaaagcga aacagtaaga cgaaatgaat actaacacta 1200 cttataacgt aaacaatgga ttgatatcat acagtattgc gacaatcaat ttgaacaaca 1260 tttcaaacac taataagatc aactcgttga agaccttcgt taatttgatg gatgtggatg 1320 taattttatt tcaagaggtg gagagcgaaa atattgagtt gtatggtttt gaaattgtct 1380 acaatgtcaa tcatgataga cgaggaactg caatagcata caaggctcat atccaatgta 1440 cagcggtcga acgaagcata gattccagag tgattaagat caaattgaat aacaacatcg 1500 tcatttgtaa catttacgca ccttctggtt caagcaatcg atcaagcaga gaatattttt 1560 tcaatgaaat cgttccatat tatctcagag gttcaaatga attcatgatc ttgggaggtg 1620 atttcaattg cgttgaaaca gccaaagata gtagtaatga ctatactaat aatttcagcc 1680 ctatgctaaa aaggctcaga caagcattac agcttgaaga tgcgtgggaa gtaaaattcg 1740 gaagtaacaa ggaatattcc ttcattcgag gagggttcgg atcgcgaatc gacaggattt 1800 atgtttctca atcgttatcg ttaaatgtgg ttgatgcaaa gtaccaagtt acttctttca 1860 cagatcacaa agcccatatt gtcaaggtaa agctaccttc tctgggaaaa ccttttggca 1920 gaggtgtttg gcattacaag tccctagttc ttgaagatga caataataaa gaagagttca 1980 agcaaaaatg gcgttactgg gtatcacagc gacaatatta tcgtaactgg gtagtttggt 2040 gggcagaata tgcaaaaccg aaggttgtgt cgtttcttag atggaaatct cgaaatatgt 2100 acaacgattt taacagcaac atggaatttt ggtattatgc tctacggcgc gcctataatg 2160 agtatctaac tgatccgctt caaatatctg agataaatcg tatcaaagga aaaatgttaa 2220 aattgcaaca tgattttagt gcgttcttgt caagaaataa tgagattcat ctaaaccacg 2280 aaccgttgtc aatttttcat ttagaacaaa aacaccaaaa aaagaataaa acacgtataa 2340 aatcatttgt aattgatgga agattagaag aaaatcaaga aaatataaaa tcccatgttg 2400 ttgaacaaat tgaaaattta tttaaatcag aagaagttgc tacgaacaat gattttaagc 2460 cggggagaag tattccgaat gactttttgg agaacgagga attaatggaa gagatagatg 2520 aagacgaagt attctgcgcg ataaaaagta gtgcttctag gaaatcacca ggcatggatg 2580 gactcaccaa agaattttac atggactgtt ggaatattat taaagttgaa ataacggctg 2640 ttttgaattg ctttctacgt ggaaatattg aaaagaaaat gtgcgatggc atgatagtgc 2700 taatcagaaa aaagaattct ggaaatagca tcgatggata tcgaccaata acattgttaa 2760 acttcgatta taaaatagta tcgcggattt tgaaattcag attaaatcgt ttaatgccat 2820 ttctggtgag ctgcaaccaa aaatgttcaa atcctagaag gaatattttt gaggcaacat 2880 gttcgattcg agatcgcatm gttgaacttc agcataaaaa aaggaaagca ttattagttt 2940 cgtttgatat ggaaaaggcg ttcgatagag tggaaccaag atttttattt aaaactttgc 3000 gagaaatgaa tgttagtgat agatttgtta gttttctcga aaaagtcagg gatgtttctt 3060 tctcgaaaat tctgattaac ggtacactat cccgagaggt gaaaatcgaa aggtcagtaa 3120 gacagggtga tcctttgtcg atgcacctat ttgttttata catagagcct ttattgcaaa 3180 agattgtagc ttcctgctca gatagtttag atttaatcaa cgtgtatgcg gatgatttat 3240 ctttagttgt aaataaccta aatgatttgg aaattgtaaa gacacacata ataaattttg 3300 gtatgatttc aggagcaaga ttgaatgtta gtaaatcaaa agctgtagaa attggttttg 3360 gtagcacgcc actgccaact gtgccatggc tcaaagtcga agaaaggatt aaggtgttag 3420 gtatatattt ttcaaattca ttgaaaaaga tgacagttga taattggaca aatgcaatta 3480 gtagtgttac attctctatc aatataaatc aagccagaaa gttaaacctt atgcagaagg 3540 tatttttcct taacaccttt gccttatcga aagtttggta cgttgcatca acgatcaaaa 3600 taaataacaa attttgtgca aaattgaaaa gtctaatagg aacttttttg tggcgaggat 3660 atggagcaag agtcgctttc aatcagttaa ttttacccaa agaccgagga ggcctgtcat 3720 tgcacgatcc tgaaagtaaa tgtaatgctc ttatcgttaa aaatttcatc aagttgaatt 3780 atgataattc tttcatagtc aatcagtttc ttaatcaact ctcaaatgat gcgaatattc 3840 cagccaatta tgattacatc aaattgcttg caaatgagat cccgaattta cctccagcga 3900 ttagaaataa tatttcatct aaagctgttt atgatattca aattacaaaa aagccaaatc 3960 cggaaattac ttctaaatat gcaaacaaaa attggagcaa agtttggcgc aatcttttca 4020 ctccgaagat tatgagttct gaagcaagat cagcgtacta tttattaata aacgaaaaga 4080 tacccaatat ggaatcattt cacagacacg gtagagtatc aacaaacttg tgcagaagtt 4140 gtaatgttgc gattgagaca attccgcata tttttgcgga gtgtgatagt tcaaagtcat 4200 tctggaattt atgctwttca aaactaaagg gtattaatgc caatgagatt caacgaatga 4260 actttaataa ttttaaagtt cccgaattag gaagattcag tagagcagaa aggaataagg 4320 ttttagaaat gttcagtaaa ttcgtaatgt acataaacaa aacacttgcc aataatcgaa 4380 gccttgtaga attaaatgat atactaacaa actgaaatag ttaagcacta acacaatgaa 4440 gactgtaaaa tagttataag tatcaataaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 4500 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 4530 // ID GEMINI1_EI repbase; DNA; INV; 2428 BP. XX AC GEMINI1_EI; XX DT 16-MAY-2005 (Rel. 10.05, Created) DT 16-MAY-2005 (Rel. 10.05, Last updated, Version 1) XX DE Gemini-Ei1 (GEMINI1_EI), a Fot1-like member of the Tc1/mariner DE DNA transposon superfamily from the single-celled eukaryotic DE reptilian parasite Entamoeba invadens. XX KW Mariner/Tc1; DNA transposon; Transposable Element; KW Tc1/mariner superfamily; GEMINI1_EI. XX OS Entamoeba invadens OC Eukaryota; Amoebozoa; Archamoebae; Entamoebidae; Entamoeba. XX RN [1] RP 1-2428 RA Pritham E.J., Feschotte C. and Wessler S.R.; RG Department of Biology, University of Texas at Arlington, RG Arlington, TX 76019, USA; RT "Unexpected diversity and differential success of DNA transposons RT in four species of Entamoeba protozoans. Mol. Biol. Evol. (2005) RT In press."; RL Direct Submission to Genbank (16-MAY-2005). XX DR Repbase; GEMINI1_EI; Positions 1 2428. XX CC The TIRs of Gemini-Ei1 are 43-bp long, have some similarities CC with Fot1 elements from other species and are flanked by TA TSD. CC The element contains a large ORF, which can potentially encode a CC 617-aa protein; 41% similar to the Cirt1 transposase from Candida CC albicans. There are numerous elements related to Gemini-Ei1 in CC the E. invadens, E. moshkovskii and the E. terripinae genome, CC including ~800-bp internal deletion derivatives and elements CC encoding more distantly related transposases. XX FH Key Location/Qualifiers FT CDS 153..2202 FT /product="GEMINI1_EI_ORF" FT /translation="MENENMPVNPVMDNDIHVVVPLKLVDIMSPKFIENHP FT ESKIQQDSSKIIQLIEGLNNKEEMFKHINEAEKSIKEQHKIYFSQNDIENA FT RNISNFLCDDEYFFQLPSRREQIGVIFAIFEKCQGASNRSVGILFGIHHNS FT VRKESLEYKKDCKMGRRQTGRPSLFDERLTIITKAFVNYCSTQKLALTVGS FT FASVAKMVTGYNFKRDTLRKWVIRSFELKSVTGSPEELGRLFIEPKDILEN FT YLNLTQFNGFDMRFVFNVDEIGFQDFADSTSVKVFVPKTADVSKIKIGVGR FT SGKRSSVICMICLDGSLVCPGMVITNQKIHIDVLRRFPDVSFFQSQTGFVT FT KQIFSDWLLLVVVPFIETKRFLFHLSRDTWALITLDGFSGHDFETLKDILL FT DHYIILHKYTAHCSHLIQPLDLCFFGNWKKKCSTITPIYRTSVSSSFLPSH FT YQQHPEQQNSEEIGSQSLVSAMKDLVETLTNNNNISADALRDKGKRLQSGE FT RLKNSIIKTLNALDAANTHEAVISSFERAGMTTRYIGGKFKFFVNIQSATR FT LINEFQTYFEQNVFSNTSLISKLTAPNNDLKMIPIAQLSPKHKKMTNEEIL FT GHLDTISNMITEEGLDECGYSMIKSSLKTLCTYKPVYHNQTGYQFMPPLPL FT IHPRIQIPNIDEFDLENDDWIDLEYGSHYNTEDEX" XX SQ Sequence 2428 BP; 857 A; 394 C; 407 G; 770 T; 0 other; cagatagaag aattattggt tgtcatagaa aaagtggttg tcatgaaaat aatgttatta 60 taaacaaaaa tgtttttata ttcattatta ttaataatat agatgataaa tgcctaataa 120 ataataaaaa aaactgataa taatatttct tcatggaaaa tgaaaatatg cctgttaacc 180 cagttatgga caatgatatt cacgttgttg ttccattgaa attggtcgat ataatgagtc 240 caaaattcat tgaaaatcat ccagaaagca aaattcaaca agatagcagt aaaataattc 300 agttaatcga agggttaaat aacaaagaag aaatgttcaa acacataaac gaagcagaaa 360 aatcaataaa agaacaacat aaaatatatt tctctcaaaa tgatatagag aatgcccgca 420 acatttcaaa ttttctttgt gatgatgaat attttttcca acttccctca agacgcgaac 480 aaattggtgt catatttgcg atatttgaaa aatgccaagg agcttcaaac cgatctgttg 540 gtattctgtt tggtattcat cacaattcag tcagaaaaga aagtcttgaa tacaaaaaag 600 attgcaaaat ggggagacga caaactggaa gaccttctct attcgacgaa cgattaacta 660 tcataaccaa agcttttgtg aattattgtt caactcagaa actggcatta actgttgggt 720 catttgcttc tgttgctaaa atggtaacgg gatacaattt taaacgagat acattgcgta 780 aatgggttat tagatcattt gaacttaaat ctgtaacagg atcaccagaa gaacttggac 840 gcttgtttat cgagcccaaa gatatcttgg agaattactt aaatcttaca caatttaatg 900 ggtttgatat gcgttttgta tttaatgtgg atgagattgg gtttcaagat tttgcagata 960 gtacatcagt taaggtgttt gttccaaaaa ctgcggacgt ttcaaaaatt aagattggtg 1020 ttggaaggtc agggaaacgg tcgagtgtca tatgtatgat ttgtttggat ggaagtctgg 1080 tttgtcctgg tatggttatt acaaaccaaa aaattcatat tgatgtattg agaaggtttc 1140 cggatgttag tttcttccag agtcaaactg gattcgtcac caaacaaata ttcagtgatt 1200 ggctgctttt ggttgttgtt ccttttatag agacaaaaag atttcttttc catttatctc 1260 gagatacctg ggcgttgata acattggatg ggttcagtgg acacgacttc gaaacattga 1320 aggatatact ccttgaccat tacatcattc ttcacaaata cactgcccat tgttcacatc 1380 tcatccagcc cttggatttg tgtttctttg gtaactggaa gaagaaatgt agtacaataa 1440 cgcccattta cagaacaagt gtttcttctt cttttttgcc ttcgcactat caacaacatc 1500 cagaacaaca gaatagcgaa gaaattggat cacaaagttt agtatcagca atgaaagatc 1560 ttgttgaaac actgacaaat aataacaata tttcggcaga cgctttgcga gacaaaggaa 1620 aacgacttca atctggtgaa cgtttgaaaa attctataat aaaaacactt aatgcgttag 1680 atgctgcaaa cacacatgaa gcagtaataa gttcttttga aagagcggga atgacaacac 1740 gatacatagg aggaaagttt aagtttttcg tcaatattca gtctgcaaca agacttataa 1800 acgaatttca gacttatttc gaacaaaacg tgttttctaa cacttcttta attagcaaac 1860 tcactgcccc aaacaatgat ttgaagatga tcccaattgc acaattgtcc ccgaaacata 1920 agaaaatgac aaatgaagaa atattgggtc atctggatac aatatcaaat atgatcactg 1980 aagagggttt ggatgaatgt ggctactcaa tgattaaatc ttctctcaaa actttatgca 2040 cctacaaacc tgtttatcac aatcaaactg gttatcaatt tatgccccct cttccactta 2100 ttcacccaag aatacaaatc cccaacattg atgagtttga tctcgaaaat gatgattgga 2160 ttgaccttga atatgggtct cattacaata ctgaggatga ataacgataa attattaatt 2220 ttgagttttt gtttttaata aattttattt tgctttttaa gtttgacaac cactttctaa 2280 aaaatcaatc caaactacca caggcttccc gtaaatttgc ttgttaaaca actaacaatt 2340 tataattaat aaaaataaat atataaaaat acttttttat actcttgacg actacttttt 2400 ctatgacaac cactttttct tgcatctg 2428 // ID CR1-1_HM repbase; DNA; INV; 4692 BP. XX AC . XX DT 17-MAR-2008 (Rel. 13.03, Created) DT 05-JAN-2009 (Rel. 14.02, Last updated, Version 2) XX DE CR1-type family: consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-1_HM. XX NM CR1-1_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4692 RA Jurka J.; RT "CR1 families from Hydra magnipapillata."; RL Repbase Reports 8(3), 180-180 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 20..535 FT /product="CR1-1_HM_1p" FT /translation="MVFMIILLLVTCNPVQPVPCSAALQDLLSKQRLEGVF FT FLSSVLSYISHAYSNFFFFFFFFFPKYSWRCLSYCISSPVIYVICMFVFCL FT NLWRGVRSPVSYFMFFMYVCVLSEPLARCSLTRKLLYVYVYVFYVFLCMFV FT FCLNLWRGVRSPVSYFMFFMYVCVLSEPLARC*" FT CDS 1717..4596 FT /product="CR1-1_HM_2p" FT /translation="MTKVSKNLKPKKPSPPPKYFNISFTNIRGLRSNFPSV FT ESYLLQNSPDLLALSETNLSSAIPSSDVSVDGYYPLIRKDSNSHMLGLGVY FT IRINSPICREIRFESFDHSFMCFRLAPLHSITFLFVLYRSPSSHDCTLLDV FT ISDQIDHALSLYPSANIVVVGDFNAHHTEWLGSNTTDPAGTKAFNFCVSQS FT LTQIVNFVTRFPDNPNHLPSLLDLCLVSDPSLCSVSPFSPLGGSDHAMISI FT NLSSRTSFLGSPYHRTTYYYPKADWDSFRDFLRDGPWADVFSLSADKCASY FT VTSWIQAGMEAFIPSRRFQVKPHSTPWFSPSCAAAISNRNHFFHLFQKNNS FT LENKRLFIIARNRCKKVLSDAKLHYSQFTKSRILSQKLGSKDFWKIFNNIV FT NKGRSNIPSLIHGTDLVTSPRDKAELFAKNFSSNSTLESYGHSLPSIPVKQ FT VDPLLDIQITPASVAKVISQLNSSTACGPDNIPVTVLQNCSPELSSILSKL FT FNKCLTESCFPACWKMASVVPIFKNSGENSDPSNYRPISLLSVISKVFESL FT INKFLTSHLESNKLLSDNQYGFRSSRSTADLLTAVTERFYRALDGGGEARA FT IALDISKAFDKVWHAGLLHKLASYGVSGKVFEIIKSFLSDRFIKVILEGQH FT SSLFPVTSGVPQGSILGPVLFLIYINDLPDNLSSKVALFADDSTLYSCLDK FT KSSLFDRLEQAADLESDLTSVIDWGSQWLVNFNSKKTQLLTANNYRSMVNI FT PILMNGNPLTESSSLRLLGLSLTTDLSWKPYIQSIAKLASAKVASLYRARH FT FLTSDSILYLYKSLIRPCMEYCCHIWAGSSNDSLSLLDKVQKRIVNVVGPG FT LSAKLEPLSHRRKVASLSLFYRYYHGHCSKELSSLVPSTKIHSRFTRHSAK FT SHSFTVSVPACSKNFYSSSFFPRTSILWNSLPSSCFPDSYNLQLFKSSVNR FT FLAL*" XX SQ Sequence 4692 BP; 1115 A; 1094 C; 818 G; 1662 T; 3 other; ttagtgtcag gtcaggagta tggtcttcat gatcatcctg ctactggtaa catgtaaccc 60 agtgcaacct gttccatgtt ctgctgcctt gcaggatttg ctatctaagc aaaggctgga 120 aggagtgttc ttcctgtcat cagtactatc ttatatcagt catgcctata gtaatttttt 180 tttttttttt tttttttttt ttcccaaata tagttggagg tgccttagtt actgtattag 240 ctcacctgta atctacgtta tatgtatgtt tgtgttctgt ctgaacctct ggcgcggtgt 300 tcgctcaccc gtaagttact ttatgttctt tatgtatgtc tgtgttctgt ctgaacctct 360 ggcgcggtgt tcgctcaccc gtaagttact ttatgtttat gtttatgttt tttatgtttt 420 tttatgtatg tttgtgttct gtctgaacct ctggcgcggt gttcgctcac ccgtaagtta 480 ctttatgttt tttatgtatg tttgtgttct gtctgaacct ctggcgcggt gttgactcac 540 ccgtaagttg atttatgtct atgtttgttt gtgttctgtt tatactctaa tgcagcgtga 600 gctcagtaac taactactta ctccaactac aaaacgtaca ctcaccttag actcctgtat 660 tttgttagat gcagtgtttt tcctaatgca cttccraaat gtcatgcctg gttttctcaa 720 agtactgttg cgggaagtaa ctccttttaa ctactccagg tcaggtcagg ataatcgaga 780 tcaatcttgc tactggtatc acgtaaccca gtaccacctg ccacaagccc tgctgccttg 840 cagagtgtgc tcttcttatg caaaggctag gagaaataaa ctccgactaa aaattcccct 900 gccttggggc tcttggttga gtaaaggcta gagatggtgt ctcgataaaa acactcatct 960 tgggcagatg ttaactgcat ccagctaaac atcccctgcc ttggggctct tggttgagta 1020 aaggctagag atggtgtctc gataaaaata ctcatcttgg gcagatgtta actgcatcca 1080 gctaaacatc ccctgccttg gggctcttgg ttgagtaaag gctagagatg gtgtctcgat 1140 aaaaatactc atcttgggca gatgttaact gcatcctgct actgtcttgt agaaggcctc 1200 ctaggcaaag actttagggg taagcagaaa aattctgcta accagcctcg aaccccttct 1260 tcatctatta ggctggcgta gatgtaacct gtgttacatt gtttcctgcy taggatgatg 1320 aatgctggat ctccttgact ctactcatag aattttgctt atgcctcctt gataatggct 1380 atgcaactct tcctgttatc tccatatgag ggtacagctc taaaactcag tttaatggtt 1440 ctgaggccgg ctggtagtaa ggtttcccga actctgtggt agctctcaga gaggctgatt 1500 ccatcaacag ctgcaaaata tcagagtatt aacagtgcca tgttgcgcat ggatggtgtc 1560 cctgttaatg cttttggtgt gcattgtcaa ggccacataa ggagccctat gtcacggctt 1620 agggttaatt aacaggactg agacaactgc atagctcatt gattaggagg ctgtgctcta 1680 tctatgaatg agtcaagttc tcattaaacc caaatcatga ccaaagtaag caaaaattta 1740 aaacctaaaa aaccatcccc accacctaaa tattttaata tatcttttac aaatattcgc 1800 ggtctacgaa gcaactttcc atcagttgag tcttacctcc tgcaaaattc accagacttg 1860 cttgcactca gtgaaacaaa tttaagttct gcaattcctt cttcagatgt cagtgttgat 1920 gggtactatc ctttgattcg taaagactcc aatagtcaca tgctaggttt aggggtwtac 1980 atacgcatta attcccctat ttgtcgtgaa atcaggttcg aatcctttga ccattctttt 2040 atgtgctttc gcctagcacc tcttcactct atcacctttc tctttgttct ttatcgttct 2100 ccttcttccc atgattgcac tcttttagat gttatctctg atcaaattga ccatgccctc 2160 tctctttacc cctctgccaa tattgttgtt gttggtgatt ttaatgccca tcacactgaa 2220 tggcttggct ctaacaccac tgatcctgct ggcactaaag cctttaactt ctgcgtttct 2280 caatctctta ctcagatagt taactttgtg actcgttttc ctgacaaccc taatcattta 2340 ccttcactcc ttgacttatg tcttgtctct gaccctagct tatgttcagt ttctcccttt 2400 tctcccttag gtggttctga ccatgcaatg atctctataa atctttcttc tcgtacttct 2460 tttttgggct caccttatca tcgcaccacc tactactacc ctaaagctga ctgggattcc 2520 tttcgtgatt ttcttcgtga tggtccttgg gctgatgttt tttctctctc agctgacaaa 2580 tgcgcctcct acgtaacctc ctggattcag gctggaatgg aagcttttat tccttctcgt 2640 cggtttcaag tcaagcctca ttctactcca tggttttctc cttcttgtgc ggctgctata 2700 tccaatcgta accatttttt tcatcttttt caaaagaaca actctcttga gaacaaacgg 2760 ctatttatta ttgcaagaaa tcgatgtaaa aaggtgcttt ctgatgctaa actccattat 2820 tctcagttca ctaaatctcg catcttatcg cagaagttag gctctaaaga cttttggaaa 2880 atcttcaaca acattgttaa caagggtaga tctaacattc catctctcat tcatgggact 2940 gatcttgtta cctctcccag ggataaggca gaactgtttg caaagaactt ttcttctaat 3000 tccactcttg aatcttatgg acattctctt ccttccattc cagtcaaaca ggttgaccca 3060 ttgttagaca ttcaaatcac tccggcttct gttgctaaag tcatatctca attaaactcg 3120 tctacagctt gtggtccaga caacattcct gtcacagtct tgcagaattg ttctccagaa 3180 ctttcttcaa ttctctctaa actatttaac aagtgtttga ctgagtcttg ttttcctgct 3240 tgttggaaaa tggcatctgt ggttccaata ttcaaaaact ctggagaaaa ttctgacccc 3300 tccaattatc gtccgatcag tcttctttct gttattagca aggtctttga gtctttgatc 3360 aacaaattcc tcacatctca tcttgagtca aataaactgc tgtcagacaa tcaatacggt 3420 tttcgatcct ctcgctctac ggccgacttg ctaactgctg taactgaaag attttatcgt 3480 gcattagatg gaggcggtga ggctagggct attgctctcg acatatctaa agctttcgac 3540 aaagtttggc atgctggtct tctccataag cttgcttcat atggtgtttc tgggaaagtt 3600 tttgagatta tcaaatcatt tctttctgac cggtttatta aagtcatcct tgaaggccaa 3660 cactcttctt tatttccagt aacttctggg gttccccaag gttccatcct gggccctgtt 3720 ttgtttctta tctacataaa cgatcttcct gacaaccttt catctaaagt agctcttttt 3780 gctgatgact caactttata ctcctgtctt gacaaaaagt cttctctttt cgatcgccta 3840 gaacaagcag ctgatcttga atctgatctc acttcagtaa tagattgggg ttcacagtgg 3900 cttgtgaatt tcaactctaa aaaaacccaa ttgcttactg caaacaacta ccgtagtatg 3960 gtcaacattc ctatattaat gaatggcaat cctcttactg agtcgtcctc tttacgtctt 4020 cttggattat cgcttactac tgacctttca tggaaaccat atatacaatc gattgctaaa 4080 ttagcttctg ctaaggttgc ttctctttat cgtgctcgcc atttccttac ttctgattcc 4140 attctctacc tctacaaatc tcttattcgt ccctgtatgg aatactgttg tcatatttgg 4200 gctggttctt ctaacgattc actttctctt cttgacaagg tccaaaaacg cattgtaaat 4260 gttgttggac ctggactatc tgctaagctt gaacctcttt cccatcgtcg taaagttgca 4320 tctctttctc ttttctacag atactatcat ggtcactgct caaaggagtt atcatctcta 4380 gttccatcaa ctaaaattca ttctcgtttt actcgtcatt cagctaagtc tcattcgttt 4440 actgtatctg tccctgcatg ctctaaaaac ttttattcat ctagtttttt tccccgcact 4500 tcaatccttt ggaactctct cccatcttca tgttttcctg actcctacaa ccttcaactt 4560 ttcaagtctt ctgtcaaccg tttccttgct ctataactct gttctttttt tttcctagta 4620 actcccaact taatagtggt tgcttgcagc cttgttggga gtgaatgcaa ataaaaaaaa 4680 aaaaaaaaaa aa 4692 // ID Crack-10_AAe repbase; DNA; INV; 4616 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A Crack non-LTR retrotransposon family from Aedes aegypti. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; KW Crack-10_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4616 RA Kojima K.K. and Jurka J.; RT "Crack clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1226-1226 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 10 sequences with >97% CC identity. Closely related to Crack elements in Culex pipiens CC (Crack-1_CP to Crack-4_CP). XX FH Key Location/Qualifiers FT CDS 730..1491 FT /product="Crack-10_AAe_1p" FT /translation="MTHISLKELPDLLSKALTEHANRLLANIEHSQQFLSD FT KLDDFGDQILKLKAEISKLKNENDCLRKDLALISSKTDTVSNVVEKQETAL FT DLRRRLELSSNAILLGIPRVPNENTRTLVTMTCNTLGIDEANSIVSCTRLP FT SSKTEISPIRIVFKDVHAKERLMEKKQKFGPLTASMISGIRWPNGWTNKIF FT IRDDLSPLAMEIFRELKTLQPLHKFRYVWPGRDGVIFVKFHEDSYPVKVRS FT RDDLRKLILSAQN" FT CDS 1540..4431 FT /product="Crack-10_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MNSNNLAKNHRTNSLDAIFTPATLNGLNTLQINIRGI FT NRMEKLDSLCLFLQNLPVVIDILIIGETWIRSDRSKYYNIPGFKSTFSSRS FT SSSGGLAIFIRDGISFEVISNMEDNGLHHIEIVILETGIKVHGIYRPPGFE FT TDRFISILERVISSADLDTPCLIFGDMNLAINNTESRGVQKYLQLLASYNM FT VVTNTHSTRPISNNVLDHVVCPSDVSGRITNFTMDCELSDHCYILTHFETA FT TMKVSQKLTKSIVHHQQIDLHFRMFLEATDFSCLQPNERLLAVTDRFMQLK FT QSFTTQISVNVKVKRNVCPWLNLDIWKLGRISNNLFQRWKRNRQDEHVKDL FT LTHANKKLADAKRRAKSAYYQQLFSTNNPKQLWNRINNMLGNQLDSTKQQT FT LIVDDTEITKPEDLGDAFNNFFSSIGDSTASCLNSDGNINKFNTMDTCGQS FT IFLRPASPTEVSNIIRVLNSSKATGVDGFPVSALKKYSDVLSPIICTSFND FT SLTSGIYPDCLKKALVYPIFKGGDPKNPSNYRPISVLPAINKVFEKLLSAR FT LQSFSDSTALLNPKQFGFRHGSSTEVAVLELVDDIANCMDKKSAAGVVFLD FT LSKAFDTINHSILLKKLDAYGIRGVANDLLRSYLLNRQQKVRVSGISSEYR FT SSNCGVPQGSNLGPLLFLIYINDIAKLQIKGHPRLFADDTAIMYKSNSVTE FT LFADMSNDLRLVTAYLENNLLSLNLHKTKLMVFGARENHAAPHPTLTVNGV FT TIEEVSYYKYLGIYIDNKLRWDCHIRNTVDNCASLCGILRKLSKYVPQHVL FT LKIYFAFIHSRYQYGITTWGTTFNTYLKDIQVQQNRCVKAIFKLDYLHPTN FT QLYSTTEHNILPIQGLYIMRTATIMFKILNNLNLHHNWNFNAAAHHHQTRY FT AHLLQRTGFRTEVGRRRFQNLGPATYNRLPEEIKSARTIQQFRRNLILYIK FT SNIDQFIVR" XX SQ Sequence 4616 BP; 1434 A; 989 C; 864 G; 1327 T; 2 other; actgctgtga aatagctgtt gaactttgtg ctctgctttt tctgcaaaaa ttaaaccgaa 60 accaaaacag aattgctgac tgctgtatat taaagaaaac tcgctgttgc tgctttgtca 120 ttggacatat atgtacctcg atgtaagtag aaacatctga gaaaatcagg cttgaaaatt 180 gctgtgatac tgctgctgta aattctgcct gccatatcta ttgctgctgt tgctgacccg 240 ctgcctcaaa cctccaccct tgcgccattg tacaaacaag cttcatcaaa tctgctgttg 300 tattctaccg atcatatgct gctgttgctg taaaagtgct tctaatgtgc tgcaattaat 360 ctccaccatt aatcgctcat atctacttgt gactgtatcc aatctgcgag aataaaggat 420 cttttctgct ggttgttttc atttcgccca tcaccgccac ccgcccgcca tcgcctataa 480 tcaagtcctg ctgtcaatcc ttttccctat cggaggatct gctgtatgaa agatcaatac 540 caaacatcca tcaacacatc acgaccacca ccaccaggat aatccagcgc tgctcgttgc 600 tattggcgtt ccggctcagt gtgagtaata attagctata ttgctgatag ggcacgtcta 660 agccattcga tacaaccgca cgcacgtata catttagtat agcgccatct tttggccgac 720 actaaaacta tgactcatat cagtctgaaa gagttgccag acttgctttc gaaagctcta 780 accgaacatg ctaatcgtct gctagcgaat atcgagcaca gccaacaatt tttgtccgat 840 aagttggacg atttcggcga tcagattttg aagcttaagg cagaaattag caagctaaaa 900 aatgaaaatg attgtttgag gaaagactta gcgttaattt cgtcaaaaac agatacagtt 960 tcgaatgtgg tcgaaaaaca ggaaactgca ttagacctcc ggcgaagatt agaattgtcg 1020 tcaaacgcca ttctgttagg gatacctcgg gtgccaaacg agaataccag aactcttgtt 1080 accatgacat gcaatactct tggtatcgat gaggccaatt ctattgtttc gtgtactaga 1140 ttaccttcat ctaaaactga gatcagccca atcagaatcg tattcaaaga cgtgcatgct 1200 aaggaacgtc taatggaaaa gaagcaaaaa tttggtcccc tgactgcatc gatgatatcc 1260 ggaattcggt ggcccaatgg ttggaccaat aagattttca tcagagacga tctttcccca 1320 ctcgcgatgg aaatctttcg tgagctcaaa acgctacaac ctttgcataa attccgttac 1380 gtctggcctg gacgtgatgg cgtaatattc gtgaagtttc atgaggattc ctacccggtt 1440 aaagttcgat cacgtgatga cctgaggaaa cttattttaa gtgcccaaaa ttgaagctgc 1500 tgataacact aacagtttca tataaaaaat attcctacaa tgaattcgaa caaccttgca 1560 aaaaatcatc gcactaattc tttggatgca atttttacac ctgccactct caatggcctc 1620 aatacgttgc aaattaacat tcgcggaatc aaccgaatgg aaaagctgga ttctctttgc 1680 ctttttctgc aaaatcttcc cgttgtcatt gatatcctta taattggaga gacctggata 1740 agaagtgaca gaagcaagta ctataacatt cctggattta agagtacttt ttcttctcga 1800 agcagctcat ctggagggct tgctatcttc atcagagatg gaatcagctt cgaagtgata 1860 tccaacatgg aggataacgg attgcatcat attgaaattg ttattttgga gactggcatc 1920 aaggttcatg gaatctaccg tcctcctgga tttgaaaccg atagatttat ctctattctt 1980 gaacgtgtga tatcgtctgc agacctggat actccttgtt tgatattcgg agatatgaat 2040 cttgccataa ataacacaga atcacgtgga gttcaaaaat atttgcagct attggcttct 2100 tacaacatgg ttgtaactaa tactcacagt actcgcccaa taagtaacaa cgtgcttgat 2160 catgtggttt gtccttcwga tgtatctgga cgaataacaa atttcactat ggattgcgaa 2220 ctcagtgatc attgttatat tttgactcat tttgaaactg caactatgaa agtcagtcaa 2280 aaacttacca agtctatcgt acaccaccag caaatagacc tgcacttccg catgtttctg 2340 gaagccactg acttctcatg cttgcaacct aacgaacgac tgttggcagt tacggatcgt 2400 ttcatgcagc tgaagcaatc gtttactacg cagatatcgg taaatgtcaa agtcaagagg 2460 aatgtatgcc cttggcttaa tctagacatt tggaagttgg gtagaatctc caacaacctt 2520 tttcaaaggt ggaagagaaa tcgccaagat gagcacgtca aagatctgtt aactcatgca 2580 aataaaaaac tggcagatgc taaacgtcgt gccaaatctg catactacca acaacttttt 2640 tcgaccaaca atcccaagca gttatggaac cggattaaca atatgctagg aaatcaactt 2700 gattcaacta agcaacaaac gcttatagtg gatgatactg agattactaa gcccgaagat 2760 cttggagatg cgttcaacaa cttcttttca tctattggag atagtactgc tagctgcctt 2820 aattcagacg gtaatattaa caaattcaat actatggaca cttgcggtca atctatattc 2880 ctgcgacctg cctcaccaac cgaagtatcc aacatcatac gagtgttaaa cagctctaaa 2940 gcaactggag ttgatggatt cccagtatca gcgcttaaga aatacagtga tgtcctctcc 3000 ccaattatat gcaccagctt caacgatagc cttacttctg gaatctatcc cgactgtttg 3060 aaaaaggcgc tcgtttaccc tattttcaaa ggtggtgacc ctaaaaatcc cagcaactat 3120 cgacctatat ctgttcttcc tgctatcaac aaagtattcg agaagcttct gtctgcgcgt 3180 ctccaaagct tttcggatag tactgctcta ctcaatccca aacaatttgg attcagacat 3240 ggatcttcaa cagaagtagc tgttttggag ttagtcgacg atattgcgaa ttgtatggat 3300 aaaaaatcgg cagctggggt agtgttcttg gatctctcaa aggccttcga tacgattaac 3360 cattcgattc ttctaaaaaa actagacgca tatggcatac gtggcgttgc taatgatctg 3420 ttgcgaagtt atctgctgaa tcgtcaacaa aaagtaagag tctccgggat tagtagtgag 3480 taccgtagca gtaactgtgg agtgcctcag ggcagcaatc ttggacctct gctgtttctt 3540 atctacatca acgatattgc aaagctccaa attaaaggac accctagact gttcgccgat 3600 gatactgcaa tcatgtataa gagcaactcc gtaacagaac tatttgcaga catgtcaaac 3660 gatttgcgtc tggtaacggc gtatctagag aacaatttac tgtcattaaa tctgcacaaa 3720 acaaaattaa tggtctttgg agctagagag aaccatgctg cacctcatcc aacattgaca 3780 gtaaatggcg taacgattga agaagtttcg tattataaat acttgggcat ctacattgac 3840 aacaagctac gttgggattg ccatatacga aatactgttg ataactgcgc atcgttgtgt 3900 ggaattctaa gaaagctgtc gaaatatgtg ccacaacatg ttctactaaa aatatatttc 3960 gcttttatcc atagtcggta tcaatatggt ataacgactt ggggtacaac cttcaacaca 4020 taccttaaag acatccaagt tcagcaaaac agatgtgtta aagcaatatt caaattggat 4080 tacctgcacc caacaaatca gctttacagc accactgaac ataatattct gccaatacaa 4140 ggactgtaca tcatgcgaac agccacgata atgtttaaaa tcctaaacaa tctcaatctg 4200 catcataact ggaactttaa tgctgctgca catcaccatc agactcgata tgctcatttg 4260 ctgcagagaa ctggctttag aacggaagtt ggaagaagaa gatttcaaaa tttagggccg 4320 gcaacataca atcgattacc agaagaaatc aagagtgctc gaacgattca acaattcagg 4380 cgtaacctta tactttatat aaaatctaat attgatcaat tcattgttcg atagtattta 4440 tgtagaactt tctctaaact gaattcaata atgtaactcg actaatttaa atagctattt 4500 caataagctc taacgtccct tttaaggaac actagttcga tagggattgt gagtattatt 4560 cttattgctt tgtagmatga tcataaataa taaataaata atataataaa aaaaaa 4616 // ID Ginger1-4_HM repbase; DNA; INV; 5882 BP. XX AC . XX DT 02-FEB-2010 (Rel. 15.01, Created) DT 02-FEB-2010 (Rel. 15.01, Last updated, Version 1) XX DE Ginger1 DNA transposon from Hydra magnipapillata. XX KW Ginger1; DNA transposon; Transposable Element; Ginger; integrase; KW Ginger1-4_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-5882 RA Bao W., Kapitonov V.V. and Jurka J.; RT "Ginger DNA transposons in eukaryotes and their evolutionary RT relationships with long terminal repeat retrotransposons."; RL Mobile DNA 1, 3-3 (2010). XX DR [1] (Consensus) XX CC TIR is 70-bp long. XX FH Key Location/Qualifiers FT CDS 374..2764 FT /product="Ginger1-4_HM_1p" FT /translation="MAKERKHFDFDNIEDYLRNKKYPSTISTHDYGIKSNF FT RRAAKRFEVKDGHLFYNKRMVIKDKELQMEVIRDVHRGIGDSEHSKAMASH FT RGKNTTYDNIAQRFFWYNIAADVSKYIRSCEQCQKQGDLKSPKVELKSIPV FT PSSVMEQVGVDICNLPVVDGYCHIIVLIDYFSKWSEAKPIKEKSAQTVSQF FT LYEIMCRHGCFKIQINDQGREFVNEVCKQLHELTGVEQRVTSAYHPQANGL FT VERQNRTIKNSLVKVLEGNPKMWPQIIEGILFAHRVSRHSSTNYSPFMLMY FT NREPILPIDVKHSLVKDESNKQEHREENIDVEQPFDFNFFDAVFSLTSKVR FT ATIMSEASENIKVAQKKQKRDYDSRHLSKSEIKVDDIVLLKNNKRFDRKGG FT KFSQKWLGPFTVVNISEKGVATLKNALGLTLKKTYNIVQLKTYIQGADDKL FT KPILVEGSAYFWNHAPDEIVEMILMYAVQQSENSFPGHKCETYASIKSTCS FT KWARLIERKGTTLLPKIYIDSWKPLGKPSDHNNESNTVSTQMLIKTFGKSS FT GLASQLSNCIGDKKWRSSWLILTPQKHSWYTIDRIFWKTKVCQPNQSSLQV FT SSLWLKNELYELTEADREILESKDSWLNDNLMDAGQKLICKALGSHETYQS FT VLNCQKKKSTYFPVSGDHLQILHDGSSHWLLAFASNGRVQVCDSLRTNLTS FT ISKKCLKSLFQPLVKNGKLEVTFLPVDKQIDSFNCGVFALGYASILLDGKS FT PLDVRFVVNEMRTHYIKCLTDAHLYPFPTLEKAVDVLCNKPKLFMI" XX SQ Sequence 5882 BP; 2045 A; 972 C; 929 G; 1936 T; 0 other; tgtagcggta aattaagact ccaggaaatt aggactttga agtcttgatt tcccgggaaa 60 tcaagacttt gaaggaaaat ttgacgggaa aatttgatgt tatgagtcat aatttcccgg 120 gaaattatga ttggagcggt aaattaagac tccaagaact tgcgattcaa aaagaaaatt 180 tacaaagtca aaatttacaa gtcgttctat tctgcagtaa aaaaaatttt ttttttaaaa 240 aaaactgtaa atagaattta gaatagaatc aaagttgatt cgaacgcaaa agcatgttta 300 aataatgcgc tataattgtt tcatacgtaa acaaataaat aaattttaca aataaacaca 360 agtgtgctgc ttcatggcga aggaaagaaa acattttgat tttgataaca tagaagatta 420 tcttaggaat aaaaaatacc cttctactat atctacacat gattatggga ttaaatcaaa 480 ttttcgaaga gcagcaaaac ggtttgaagt gaaagacgga catttatttt ataataaaag 540 aatggttatt aaagacaaag aactccaaat ggaagttatt agagacgtac accgaggtat 600 tggagattct gaacactcta aggcaatggc ctcacataga ggaaaaaaca ctacgtatga 660 caatattgcg caaagatttt tttggtataa tattgctgct gatgttagta agtatattag 720 gagctgcgaa caatgtcaaa aacaaggtga cctaaagtct ccaaaagtag aattgaagtc 780 tataccagta ccatcaagtg taatggaaca agttggagtg gacatttgta atcttccagt 840 agtcgatggg tattgtcata tcattgtctt gatcgattat ttttcaaaat ggtccgaggc 900 taaacctatc aaggaaaaat cagctcaaac tgtctctcaa tttttgtacg agattatgtg 960 tcggcatggg tgctttaaaa tccaaatcaa tgaccagggt cgggagtttg taaacgaagt 1020 atgtaaacaa ctacacgaac taactggagt agagcagaga gttacgtctg cttatcaccc 1080 tcaggcaaat ggattggtag aacgtcaaaa ccgaacaata aagaactctt tggtaaaggt 1140 tttggaaggt aatcctaaaa tgtggccgca aattattgag ggtattcttt ttgctcatcg 1200 cgttagtcga cattcttcta ctaactactc tccattcatg ctgatgtata atcgcgaacc 1260 aattttgcca attgatgtga agcatagcct tgtcaaagac gaaagcaata aacaagaaca 1320 ccgagaagaa aatatagacg tagaacaacc attcgatttc aacttttttg atgccgtctt 1380 ttcattaaca agtaaagtca gagcaacaat catgtctgaa gcaagcgaaa atataaaagt 1440 tgcgcagaaa aagcaaaaac gagattatga tagtagacat ttgtcaaagt cagaaattaa 1500 ggtagatgac attgtattat tgaaaaacaa caaacgattt gatcggaagg gtgggaagtt 1560 ttcacaaaaa tggctcggtc ctttcactgt agtgaatatc tctgaaaaag gagttgcaac 1620 tttaaaaaac gcattggggt taactcttaa aaaaacatac aacattgttc aacttaaaac 1680 ttatattcaa ggagcagacg acaaattaaa accaatatta gttgaaggat ctgcatattt 1740 ttggaatcat gcaccagacg aaattgttga gatgattttg atgtatgccg tgcaacaatc 1800 ggagaactca ttccctggac acaagtgtga gacctatgca agtatcaaat caacgtgtag 1860 taagtgggcc cgtttaattg aaagaaaagg tactacctta cttccaaaga tatatattga 1920 ttcgtggaag cctcttggaa agccctccga tcataataat gaaagtaata ctgtaagtac 1980 acaaatgtta ataaaaactt ttggaaaatc tagtggacta gctagccaac tatcaaattg 2040 cataggtgat aaaaagtggc gatcttcttg gcttatcttg acacctcaga aacactcttg 2100 gtataccata gatcgaatct tctggaaaac aaaggtatgt cagccaaatc aatcatcttt 2160 gcaagtttcg tctttatggc tcaagaatga actgtatgag ctcacagagg cagataggga 2220 aattttggaa agtaaagata gctggttgaa cgacaatctt atggacgcag ggcaaaagtt 2280 aatttgtaag gctcttggta gccatgaaac ttaccaatca gtattaaact gccagaaaaa 2340 gaagtccaca tatttcccag tttctggcga ccaccttcaa atattgcacg acggtagttc 2400 tcattggcta ttagcatttg cttcaaatgg cagggttcaa gtgtgcgata gcctgcgcac 2460 taacctaaca tcgatttcga aaaaatgttt aaaatccctc tttcaaccac ttgtgaagaa 2520 tggaaagctt gaagtaacat ttcttcccgt tgataagcaa attgatagtt tcaattgtgg 2580 tgtctttgct ctgggttatg caagtatatt gctggatgga aaatcccctc ttgatgttcg 2640 atttgttgtg aatgaaatgc gtactcacta tataaaatgt ttgacagacg ctcacttata 2700 tccttttcca acacttgaaa aggccgtcga tgttttatgc aacaaaccta aacttttcat 2760 gatttgattc aaaactcaaa attgactttt cacttgtatg tgattttata aatttgttta 2820 taaccattac aaagcaaatc ggtaaccaat actattttcg caatatttaa acatttttct 2880 atgtagcgtc tatcattcta ttaaaattac tctagtatat ttcttagagt tttattttaa 2940 gtatttacga aagaaaatat tgcgttaaat acaaaatatg aaaaatacct tagccgtcca 3000 tagaggacgc catccttttt taggcaattt cactcccccc aggcgatgac tacctattta 3060 aaaaataatg aaaaaactta tttgatctca ttccatattt tatggaggac ttctttttga 3120 caatcgtctt ttttataagt tggatgaagg ttattataag accttattag cgaattaaaa 3180 aaacaatttt tatccatctt gtgacatgca gggccgacga caagggaggg ggggaggtag 3240 ggccagggtg caagtgcccc tcctcactta ctttttattt attaaagttg acaattttta 3300 taaaaaatcg aggatttttt ttttaaataa ggtagtgtgc cccccccccc cccacacaca 3360 ctgcaaaaac cgtgtcgtcg gccctgatgg acattgaaag tatgacgttc aaataaaagt 3420 tgcagcagac aacactgata ttttacactt tagcaacatt tttgaaaaaa aatatatata 3480 tatatatata tatactctaa cgatatattc ataaaagcac gtttacatag ttttccatgt 3540 tatttaatat aaaaatagaa aaaaaatctt taaagtgaag tccttttctt gtaaaccccc 3600 tcactcccac tttgtctctt tttatccttc ttatcccaac cccctccctc cttttttgca 3660 atatgttctt tatggttgac cccttgagca attcatttat ctttttttaa agaaaaacaa 3720 cgaataaaaa aaagtaaaac ttcattccat taaacttatt catttctact aatttctata 3780 ttctaaatat tttatatttc tactaattat caaaagttta gttatttctc tatcatcaat 3840 tgaaaataaa atcaataaaa aaataaaatc aataaaaaaa aatttgattt catcaaacaa 3900 gatattattg gagtcttaat ttaccattat tttactttca tcaataattt atttgtgtcc 3960 atagttatac ttttaattct ctttcattaa atctacgttt actttttatt tcataaaact 4020 aattaaacca ttgaactgta aattgttgac tgatgatatg ggatctcgta agatgtcttt 4080 tgtaaaaatg ttatccttca aatagacttt ttatgtggac tatttttact tgaccacctt 4140 agctaaaaaa ataaaacttc catctatatc attattttta ataccttggt gataatttaa 4200 tgcatttata gtctcttcac aaaatacttt taacatgtat caaccttttg tctaacaatc 4260 agtcttgggt tagcagcaac ataagtcagc ttagacattt taaccagctg actaactcaa 4320 tttcagcaga cttttcaatt atctaattgt ttcgtataca ctttaataca tgcataaaat 4380 caaaaagtaa aaaaaatgcc atcagttgtt aaccatagag ataagcaatc aaatcgtttt 4440 aaaaaaatgc ttgatttact ctattataat cattaattaa agcgataact tttccatttg 4500 tcttcttaat ttggttgatt aggatattaa tttgattgta tacgaaatcc aaattattta 4560 ctggtaacat tcttactata aattttgttc cctcataaaa acatacaacc atgaaagcca 4620 atatagttgt tgcaaaactt gattcattgt ttactgcttt acggttcaca tcaaaaaagc 4680 ttcaaaagaa ggtgatctgg ttaaaatatt tgctttacca aacaaatgtc cccgtgataa 4740 gttagcatag gtttaacata aacttcatct agtaataaca tacagttttt ttgactattt 4800 tgttgagttt taaaaaaaca ctacaaataa aagtgtgaaa cttttgaagt tataagtgtc 4860 aggaagataa aaatcatttc ttaattgatc atataaacta tgtgaccgtg caaaatgttc 4920 taaagctcaa acaataactt caggtgaata tttttgctaa aaaatgttac ttgagttcaa 4980 cttatgcatt caacctgctg acgtaaaatt gttttctttt ctgacttctt aaaagaaatc 5040 aaatatccaa atgatgtgac caacttttga acttataaat acaattttta actaatgaag 5100 taatattgaa tacaactcca gcatgatagg tttcatatga taaattatta aatactatta 5160 gacacaattt actaatacca gtattgcttt aatgctcttt tgattgcaaa caacatctat 5220 tcaaaaaata aataaataga taagctaatg aatcattaag tcttccttga gataaataac 5280 tgcaaaatat tctaaatgaa tcaacttcgt caaactctac gaattttgaa agttcatctt 5340 cttttactga tctgagagac tctctaagct ctctaagaga gcttttacct attgttgttg 5400 ttctttttct agtggaagag atggcactaa actctaaatt aattcaaata cactaggtgg 5460 atctcgagga ttttcttttc cttttacttt aataactagg aagtttgatg gaaaatgttt 5520 tgcacaaact acagtgtaat gtttatatgg tatattgtct attgggaaaa cgttaatcca 5580 tctctggcgc tctttaagat ctcttagaaa ttctagaagt atttgtctta tcttttactg 5640 agtcatagtt atagttacaa ccagttttct gtgcattttt aacataattt tctaagcgtt 5700 gcgcgatgtt atttcagttc accctgtagc gaaatgatct ttcaatttca agagattgtt 5760 ggagtcctaa tttaccgttc cactcttaat ttcctggtaa atctaaactc gaaaggttct 5820 aatttaccgg gaaattaaga ctttcgaagt cataatttcc tgagtcctaa tttaccgcga 5880 ca 5882 // ID hATm-58_HM repbase; DNA; INV; 4172 BP. XX AC . XX DT 16-SEP-2009 (Rel. 14.09, Created) DT 16-SEP-2009 (Rel. 14.09, Last updated, Version 2) XX DE hAT-type DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; hATm-58_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4172 RA Jurka J.; RT "DNA transposons from Hydra magnipapillata."; RL Repbase Reports 9(9), 1924-1924 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(835..1311,1619..3589) FT /product="hATm-58_HM_1p" FT /translation="MDILQHFKYCQSNLPNGITKKLDILYCPPKSNLKVDR FT CQLVECECIMAKIKHPWIKAGFPVISDVNIRKSLKNLEQKYSNLLKNVKRN FT SKQDIKNQIGFENFVKRIFWIGIPELKDFIQSDKLRSNESKIKDLQFLEDQ FT EGPRLYHLGPLDTKYTKKVILINLFLYKVKSLNIKKCKNAMKNGSPSKILK FT FYEEDVSSNDELSSDNFSDSDFEPGEWIKRKNKPSSTKIRANIPKDIYSGN FT VALMATVNNISPMVLQRITTAVVQAAGVDMNAIKSSLSTAARRMKRENKEI FT SNIAKEDIKTKVRLSTFPCIIHFDGKTIFETIKGKKLKCERLAVLVNINGE FT SHLLGVPSLPSSSGDDQYKGIMQLLDEYGIKSNVGGLCFDTTATNTGITKG FT TVNRITYDLGKYILQLACRHHVTELRISHFQKHVTHEEKVGPENLFFKRLR FT NKFDNPEFHYNPENILRFDWSLKAGTVVEKAATDALEYCKKYIKKKNIARE FT DRKELAELVIIYLSKSGIVKIKKPGAVHHARFLSSALYYTKLHLLSKQLKL FT HLLSKQLDFLIKEEDVKAQVEQIVEFICCFYAPWYLQSHNPIKAPYLDITA FT LHQMTLYKDVCQVKEAVDKVIDSILKHSWYLDSTLIPLSLLDYKVPDKDKT FT LIAKSILSYKMPSLKASDYKIENKPKVDIKEIINLDSTTASNPPNLAVLVN FT EFSYFIFAVNGFTEERIRDWLSLPPSFWHTQSYYNQFLAYAKTLIIVNDHA FT ERNVGMMQEFIHRYISEEEKQMRLVTIDKVRYAMKKPEKSSSKMTKRNMEI FT GLNNINNFKKFKSD" XX SQ Sequence 4172 BP; 1549 A; 615 C; 661 G; 1346 T; 1 other; taatataaac caatttggcc gaattaaaaa aaagggccta cactacatat attttgcatt 60 taaaatcata cgtatatttg tttttgtttt attaccgagc atttattatt caaattatcc 120 ataaattcgc aaaatttatg acttttgttg tgatattaag aaagcatttg gagtatacaa 180 atacattttt tcttatataa taaaaatgtt ttttggttaa aaatattgta cantgcttgc 240 tactttaatt agttacgtat ttgaaggtaa gtttccttga cattcttcca tcccccttcg 300 cctgccattt cccaaaataa aataaatagc aaaccattac acttcatgtt cttttactaa 360 agtttcatat gaatttattt ttaaataaaa aacaataata tgaagccact acattgattt 420 aaaatttgaa atttacatat ttgactttga attttaaaat tcaaaataaa aatttaactg 480 aacaagtatt aatatgagtt atcagaaaca caagattttg cgcaataaca gtttgagatc 540 ttctaaaaag atttttcata ttatcgataa agctacgcct gctatttcag gtatgtgttg 600 catagaaata aaaataatgt acagttatcg attcggtttt atgatatttt gtatttacct 660 gaaaagaagt cacggtcact aaaacacatt ttcatctcaa tatcgacacc aatccatcac 720 actattatac tctgtaagat aaataatgcg cgcttattta tactatttat atagtattgg 780 taatactatt ttaataaaaa cctatatggt ttaggaacgc aacttccaac gggtatggat 840 atcttgcaac actttaaata ttgtcaatca aatttgccaa atggaattac aaaaaagttg 900 gatattctat attgtcctcc aaagtcaaac ttaaaggttg ataggtgtca actagttgaa 960 tgtgaatgta tcatggcgaa aataaaacat ccatggatta aagcagggtt tccagttatt 1020 agcgacgtaa atattcgaaa aagtttaaaa aatcttgagc aaaaatattc taatttatta 1080 aagaatgtca aaagaaactc aaaacaagat attaaaaatc aaataggatt tgaaaatttt 1140 gttaaacgta ttttttggat aggaattcca gaactaaaag attttattca aagtgacaag 1200 ctaagatcca atgaatctaa aatcaaagat ttacaattct tagaagatca agaaggtcca 1260 agattatatc atttgggtcc tctagacact aaatatacta agaaagtaat ttgattatgt 1320 tttggttgtg agaatttgca tagtgtaagt tttatttttt taaatgttta ggcagttttg 1380 cgtcagatct taattaaatg atgctctcaa ggtagatgaa ctattgagtt actttactat 1440 atggagtgaa gcattagtat ataaaatatt gatttaacta actttatttt gtttttggac 1500 ttacactatg ttgattctca tccacacatt tagtttttaa ataatatact ctttatagtt 1560 tgataaattt attaaatatg ctctaacttt gtgtttttgt cactctaaca acttataact 1620 aataaattta tttttatata aggtaaaatc attaaatata aagaagtgca aaaatgcaat 1680 gaaaaatgga tccccgagta aaatcttaaa attttacgaa gaagacgtta gcagtaatga 1740 tgaactaagt tcagataact tttctgattc tgacttcgag ccaggcgaat ggataaaaag 1800 aaaaaataaa ccaagcagta caaaaattcg ggccaacatt cctaaagata tttattccgg 1860 aaatgttgct ttaatggcaa ctgttaataa catctctcct atggttctgc agagaattac 1920 aacggcggtt gttcaagctg caggagtaga catgaacgcc atcaaatcaa gtctctcaac 1980 tgcagctaga agaatgaaaa gagaaaataa agaaatttca aatattgcta aagaagatat 2040 taaaacaaaa gtacgtttat ctacattccc atgtataatt cattttgatg gaaaaactat 2100 atttgagaca attaaaggaa agaaactaaa atgcgagcga ctagctgtac tagtaaatat 2160 taatggcgaa tcacacttgt tgggtgtgcc ttctttacca tcctcttcag gagacgatca 2220 atataaagga ataatgcagt tgctagatga atatggtatt aaatcaaatg ttggtggatt 2280 gtgctttgac acaacagcaa caaatactgg cattacgaaa gggactgtta atagaattac 2340 ctatgattta ggtaaataca ttctccagct tgcttgcagg caccatgtta cagagcttag 2400 aataagtcat tttcagaaac atgttactca tgaagaaaag gtaggaccag aaaatctttt 2460 ttttaaaaga ttaagaaata agtttgacaa tccagagttt cattacaatc ctgaaaacat 2520 attaaggttt gattggtcat taaaggctgg tactgtagtc gaaaaagcag caacagatgc 2580 actcgaatat tgcaaaaaat acattaaaaa gaaaaatatt gctagggaag acagaaaaga 2640 actagcagag cttgtaatta tttatttatc gaaatcggga attgttaaaa taaaaaaacc 2700 aggtgcagtt catcatgcaa gatttcttag ctcagctctt tattatacaa agttacatct 2760 gctatctaaa caactgaagt tacatctgct atctaaacaa ctagattttt tgataaaaga 2820 ggaagacgta aaagctcaag tagaacaaat tgtggagttt atttgttgtt tttatgctcc 2880 atggtacttg caaagccaca atcctattaa agccccatac cttgatatta cagctctaca 2940 tcaaatgact ctgtataaag atgtttgcca agtaaaagaa gctgtagata aagtcattga 3000 ctcaatatta aaacactctt ggtacttaga ctcaactttg atacctcttt cgcttctaga 3060 ttataaagtt cctgataaag ataaaacact aatcgccaaa tcaattttat cttacaaaat 3120 gccatcttta aaggcatctg attacaaaat tgaaaataag ccgaaagtgg atatcaaaga 3180 gataatcaac ctagatagca caacggctag taatccacca aatttggcgg tgttagttaa 3240 cgagttttcc tactttattt ttgcagtaaa tgggtttaca gaagagagaa tacgcgattg 3300 gttgtcactt ccaccatctt tctggcacac tcaatcatat tacaaccagt ttttagccta 3360 tgcaaaaact ttgattatag taaacgatca tgcagaaaga aatgtaggca tgatgcagga 3420 gtttatccat agatacatta gtgaagaaga aaaacaaatg agactggtaa ccatagacaa 3480 agttcgttat gccatgaaaa aacctgaaaa aagctccagc aagatgacaa aaaggaatat 3540 ggaaataggt ttaaataaca ttaataattt caagaaattc aaaagtgatt agaaaacttt 3600 tttttttctt ttgtaggatt ttttgtaaga taaatagaaa atctagtttt tgtcataatt 3660 ggcgtgagta acttaaaaat tggttttggc gtaaaaactt gaatcgcttt ttaattagta 3720 aaaaaatgag gagggacccg gggtaggtaa gggtgaggga aaggtaaggc aaatacatta 3780 ctaattgaag taacaagcaa ttaacaatat tatcaaccaa aaaatatttt attacacatg 3840 gccgtcgaca ccggggaagg agggggtagg gggcttgccc cttccccttt tttttcgagg 3900 accttttttt atatataaaa ttgaaaaatg cccccccccc ccttactttg aaggccggtt 3960 caatggtcct gttatattag aaaaaatgta tttgtatact ccaaatgctt tcttagtatc 4020 acaacaaact cactaatttt gcgaatttat ggataatttg aataaaaaat gctcggtaat 4080 aaaaccaaaa caaaaatacg tatgatttta aatgcaaaat atatgtagtc taggcccttt 4140 ttttttaaat tcagccaact tggtttatat ta 4172 // ID Gypsy-10_AA-LTR repbase; DNA; INV; 192 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-10_AA_; KW Gypsy-10_AA-I; Gypsy-10_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-192 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 990-990 (2011). XX DR [2] (Consensus) XX SQ Sequence 192 BP; 56 A; 34 C; 42 G; 60 T; 0 other; tggtaatgta gtaaattata ttgttattca atagatttgt agacaatgaa cattgtactg 60 tgtaaacttc cgctagcttt agtatataag gcgggagcac cccgatgggg ctctcttttg 120 tatcccgacc gtagactaat aaagaagtct aagtgaaccc cgtgcgtttg tgagtatagt 180 ccgaacatat ca 192 // ID MARINER_SI repbase; DNA; INV; 1381 BP. XX AC . XX DT 17-NOV-2004 (Rel. 9.1, Created) DT 17-NOV-2004 (Rel. 9.1, Last updated, Version 1) XX DE Red fire ant mariner transposon sequence - a consensus. XX KW Mariner/Tc1; DNA transposon; Transposable Element; MARINER_SI; KW Inverted repeat; mariner transposase; mariner transposon. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RA Krieger J.M. and Ross G.K.; RT "Molecular evolutionary analyses of mariners and other RT transposable elements in fire ants (Hymenoptera: Formicidae)."; RL Insect Mol. Biol 12(2), 155-165 (2003). XX RN [2] RP 1-1381 RA Gentles A. and Jurka J.; RT "Fire ant mariner sequence."; RL Direct Submission to Repbase Update (OCT-2004). XX DR [2] (Consensus) XX CC Average similarity to consensus 98%. XX SQ Sequence 1381 BP; 434 A; 261 C; 288 G; 397 T; 1 other; caattatgtc aaatccaact caatatagaa ctgcaaagaa ttgatatacg tttccgtaac 60 tagcgaaatg ttatatcgtt atattaggtg ttaaacttaa ttcctgccgc tcgctactag 120 aggatgctgg cttcaacgct ggcgagtttt gactatgaag tcatatgtct tttgtagttt 180 agacatcgct gaaagtgatt taattctcat agattagatt tctcgacgcc aaagacattt 240 ttcaatcgac tgaaaatgaa tcgcgataag ctgtttttcc gacgtgtgtt tcttcactac 300 ttcgacctca agaaaaccgc agctgaagcg catcgcttat tatccaaagt gtatggtaat 360 aaaactttat tggaaaaaac atgtagagtt tggtttgaac gcttcaaaaa cggtaatttt 420 gatgtgagag acaaagaacg tccaggacag ccgaaaaaat ttgaagatgt caagctgcaa 480 gaattgctcg atgagaatcc agcccaaacg cttttagagt tgtcggaaca acttaaatgt 540 tactccaata gtcgtctcaa aacgcttgcc tgccatagga aagattcata agaaagggaa 600 atgggctacc acatgagttg tcagaaaatg ccattttgaa tcgtttgact attgcaattt 660 ctttgtttgt caggcaaaga aaaaagagtt ttttgtagcg cttcgtgact ggcgatgaaa 720 aatggattta ttttgataat cccaagcgga aaaaatcatg agtggacccc ggccaaccrt 780 ctgtttcaac gccgaagaga aatattcacg agcataagac cttgctatgc gtttggtggg 840 accagaaagg tgtgctgtat tacaaactct tgggtccaaa caaaactgtt acagctgatt 900 gttaccatca gcaattatac caattgagtg acgtattgat gcaaaaaaat catctgtagt 960 caacaatcga tgcaaagtta ttttgttgca tgataacgct cgaccgcata tcgcgaaaag 1020 cgtgaagcag gtacttgtga gcttgaatgg gaagttctgc cgcacccagc ctactcttca 1080 gacttggcgt cggattacca tctcttccga tcgatgcaac ttacggacgc acacttctcc 1140 ggttacaaag aagttcaaaa atgggtggat gaatggttcc ctcgaaagac accgcgttct 1200 accgtcgtgg gattgccctg ttgccggaga aatggggaaa aagtaataga aaatgaagaa 1260 aattactttg attaaggtat tcattcatct ttccttcgaa acaaatcgat tttatagacg 1320 aaaaaacggc aggaattaag ttaccttcaa tttagctttt atgtatttgt attgttcaaa 1380 a 1381 // ID Gypsy-32_AA-I repbase; DNA; INV; 6824 BP. XX AC supercont1.22; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-32_AA_; KW Gypsy-32_AA-LTR; Gypsy-32_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6824 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.22; Positions 633237 640060. XX CC Positions [5139-5603] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 425..2902 FT /product="Gypsy-32_AA-I_2p" FT /translation="MTMDFEKAYLEDMVTTFLTEEELKFELEIRNLIDNRD FT RSITVNRRKLKNALREESEGVQRVYLYHRRPREELNICQRRLERVRSEIVL FT DDVLKDTSKTGLLHLYNRLKLFKKTYATRDYINEANWIFSNVVQMYLANFA FT ERVFLPTPPGSPNLVQLEGAASHEAQAATAGDFISLGHPVATSIEISLPAS FT IFQSQSVQTVADLFSSVSINPASDITSRAGDDGQSIVSATLGAVPRVRNRD FT ESVSISMSFPTVVTSSSTFTSVSSLGSMPPIWSTPRYRVSTSVQQSRQVRF FT DSARSFADLPVSYVASEPCSSVASVIPTSVGTGDAVYSQSRSFVHFPEATA FT NLDRNRHLPQSVRYTDGPHPTSHWSDPWQVTSNPFSNYRNSLNFPPPNPNI FT PFDFPVSVQGGNPVSYSHFGSFVSQGNYVPINSYASQHPFEMAPGNATPAR FT PPTQPNVIGGSRIPTSDWLGGSFRAPTAPAPSINPFEDFDFDPLQNRVKTV FT PVVKWPMKYAGEDRGMGLNDFLWEVSDWTKSEQISENELLRSFGNLLTGRA FT KMWFTSNKHRFATYSELIENLKLTFRHPDLDHFVLMDIYQKRQQKNETFLE FT FFLDVEKKFKSLTVQIPEIEIVQAVKRNLRPKYKRALIGRELHDLYSLQIA FT GQEIDATNTYLFVKPQAPSQSHAVQVAENRGGTNSNQSRDNRPKFGGKWQA FT NNPKFIPRSNQNQVGPGGKQMNPKEPDSGPQQPKKNGKPKPDEKPCPPSDE FT SEASHAEGKIQDFVPLNDKFICFNCRSQEQLTYQCTRPYKVHCQVCGFKGY FT PTHRCPFCAKNSARRRESGQPKSN" FT CDS 2779..4803 FT /product="Gypsy-32_AA-I_1p" FT /translation="MYSALQGSLSGLRLQRVSHSPMPLLCKKLCAPEGVRP FT TEKQLSDVTDILFALGYEPRLNSVNLSKSDINQTSVLSVLTDNRPFIDVSI FT FGRTIKALLDSGSQKTLLSNTTSPIWRVNNTKVFQSNLNLTSASGDPLQVS FT GRVYLPFTVQGKSRVIEATIVDDLPVECIAGIDFFDAFDVHISMDNAYLFQ FT INTEECTTSSPELVELSSEQDREINRVKRLFKPAMSDQLEVTSLVQHTIEL FT KDEFKSAAPIRLTPFPYSPAIHRALNEEIDRLLALGIIEESNSDWALNAVP FT IKKPNGSIRLCLDARKLNARTKRDAYPLAHVGRILGRLGKTRYLSTIDLKD FT AFLQIPLSPESKPLTAFCVQGRGMFQYTRLPFGLTNSPATLSRLMDKILGA FT GAVEPQIFVYLDDIIVASETFEEHIRLLEELASRLRKANLSINIQKSQFCH FT SEVPFLGYLLSNEGLKPDPSKVQAILDFEAPKTVRQIRRFLGMVNYYRRFI FT GDFSTITAPISDLLAGKPKVVRWTEAAEQAFRTIKERLITAPILSNPDFDK FT EFTVQTDASDRAVAGVLTQLQDGSERVISFFSQKLNSAQQNYSATEKEALA FT ALLAIDKFRGYIEGSHFTLITDASALQYIRNNKWRPSSRLSRWSLDLQHLD FT MTIVHRRGSDNIVPDALSTPRLPTRN" XX SQ Sequence 6824 BP; 1972 A; 1522 C; 1535 G; 1795 T; 0 other; ttctggcgat ccaacgaaaa aacattgctt gtgagagtta ttttttttta ttggtcgttg 60 ggggagttta gaggtgttcg actggagaag ataaaaaaaa tcaactctcc aaccgaaaag 120 ttaaattggg ttcgcacagt ttctttttcg ttgcgagtgc tacgttccac acactgctga 180 tagatcaata ggcattgttc cagatctggc gcgtgctcag ttagatagcc aaatttgttt 240 ttctgcacgt tgcggattgt atcactttag aataaccaat tttgtttttg caagccgagc 300 caaaattcct aggttttttt caattccgtt ttgcatgcct tttgcatgtg ttttttttgt 360 tgtagataga gatacgcaat cgagagtcca cataataaat tttgcttgag aatacggcag 420 agagatgacg atggattttg aaaaagccta cctagaggat atggtaacta cattcctcac 480 tgaagaagag ctcaaatttg agctggaaat cagaaacttg attgataata gagatcgctc 540 gatcacggtc aatcgtcgaa aattgaaaaa tgcgttgcgt gaggaatcag agggggtaca 600 gcgtgtatat ttgtaccacc gtagaccgag ggaagaattg aatatatgcc aacgcagatt 660 agagagggtg cgatccgaaa tcgtactaga tgatgtcttg aaagacacat ctaagacagg 720 attactgcat ctgtacaatc gtcttaaatt gtttaaaaaa acgtacgcta cccgtgacta 780 cataaacgag gcaaactgga ttttttcaaa tgtagtccaa atgtatttgg ccaatttcgc 840 cgagcgagta tttttaccaa ctccgcctgg ctctcccaat ttggtgcaat tagagggtgc 900 tgccagtcat gaggctcaag cggctacagc aggtgatttc ataagcttag ggcatccagt 960 tgctacgagc attgaaattt cgttaccggc gagtatcttt caaagtcaat cggtgcaaac 1020 ggtagctgat ctgttttcca gtgtatcgat taatccagct agtgatataa catcgcgtgc 1080 gggtgatgat ggacaatcga tagtgtccgc aaccctagga gcagttccgc gagtgcgtaa 1140 ccgtgatgag tcagtgtcga tatcaatgag ttttcccacg gtagtgacgt catcgagtac 1200 gttcaccagt gtttcttcat taggatcgat gcctcccatc tggtccactc cgcgataccg 1260 agtttctacg agtgtgcagc aatctcgaca agtgcgtttc gattcggcgc gatcgtttgc 1320 agatttgcca gtgagttatg ttgcatcaga accgtgttca agtgtagcgt cggtgatacc 1380 aacctctgta ggaactggtg atgcagtgta tagccaatca aggagttttg ttcactttcc 1440 agaggcaaca gcaaatctcg acaggaatcg ccacttaccc caaagtgttc gctatactga 1500 cggacctcat ccaacgtctc attggtcaga tccgtggcag gtcacttcga atcctttctc 1560 aaattatcga aattcattga actttccacc accaaatccg aatattcctt tcgattttcc 1620 ggtttccgtt caaggaggaa atccggtgag ttattcacat ttcggctctt ttgtgtcgca 1680 agggaattat gtgccaatta attcttatgc ttcacaacac ccctttgaaa tggctcctgg 1740 aaatgccacc cctgcgaggc ctcccactca gccaaatgta attgggggaa gcagaatacc 1800 gacatcggac tggctaggag ggagctttcg agcaccgaca gctccagctc ctagcataaa 1860 tccattcgag gattttgatt tcgatcctct ccaaaatcgg gtgaaaactg tcccagtggt 1920 aaagtggcca atgaaatacg ccggtgagga tcgaggtatg ggtttgaatg acttcttatg 1980 ggaagtgagc gactggacca aatctgaaca gatttccgaa aatgaacttc tacgttcatt 2040 cggcaatttg ttgactggcc gagctaaaat gtggtttacc agcaataaac acaggtttgc 2100 cacatactcg gaattgatag aaaacttgaa gctgaccttt agacatcctg atctagatca 2160 ctttgtgttg atggatatct atcaaaaaag gcagcaaaag aacgagacct tccttgaatt 2220 tttcctcgac gttgagaaaa aattcaaaag tttaacggta cagattccgg aaatagagat 2280 tgtgcaagcc gtgaaaagga acctacgtcc gaaatacaag cgggctttga ttggaaggga 2340 gctgcatgat ttgtactcac tacaaatcgc aggacaagag atagacgcca ccaatactta 2400 tctatttgtc aagccccaag caccctcaca aagccatgct gtccaagtag ccgagaaccg 2460 tgggggaact aacagcaacc aatctcgaga caaccgaccg aagtttggag gaaaatggca 2520 agccaacaac cccaaattta ttcctaggtc caatcagaat caggtgggcc cgggaggcaa 2580 gcagatgaat ccaaaagaac cggattcagg gcctcagcaa ccgaagaaaa atggtaaacc 2640 taagccggat gagaaaccct gtccaccttc tgacgaaagt gaagcttcac acgcggaggg 2700 aaaaatacag gattttgttc ccctaaatga taagtttatc tgcttcaact gtcgtagtca 2760 ggagcaatta acctatcaat gtactcggcc ttacaaggtt cactgtcagg tttgcggctt 2820 caaagggtat cccactcacc gatgcccctt ttgtgcaaaa aactctgcgc gccggaggga 2880 gtccggccaa ccgaaaagca attaagcgat gttacagata ttttgttcgc acttggctac 2940 gaaccaaggt tgaatagcgt caacttatct aaatccgata tcaatcaaac ttccgtctta 3000 tccgttctca cggataatag acctttcatt gatgtctcca ttttcggtcg gacaattaaa 3060 gccttactgg atagtggcag tcaaaaaacc ttgctttcga acaccacctc tcctatctgg 3120 agagtgaaca acacgaaagt tttccaatcc aatttaaatc ttaccagcgc atctggtgat 3180 ccattacagg tgtctggtag agtatacctt cccttcacag tacagggaaa atcaagggtc 3240 atagaagcaa cgatcgtcga cgatctacca gtcgaatgta tcgcggggat cgactttttc 3300 gatgctttcg atgtacacat ttccatggat aatgcctatc tgttccaaat aaacacagaa 3360 gagtgtacaa catcctctcc tgaactcgtt gagctcagct ccgagcagga tcgagagata 3420 aaccgggtta agagactctt caagccagct atgtccgacc agctagaggt aacctcactg 3480 gtccaacaca caatagaact gaaagacgaa ttcaagagcg ccgctccaat tcgtctaaca 3540 cccttccctt attcacccgc cattcataga gcgttgaatg aagagatcga tcggcttctt 3600 gctctcggta ttattgaaga gtcgaattcc gattgggccc tgaatgcggt accgatcaag 3660 aaaccgaacg gttcgattcg cttatgcctt gacgcgcgta agttgaacgc ccgaacgaaa 3720 cgtgatgcgt atcctctcgc gcacgtaggg agaatattgg gtcggctggg gaaaacgcgt 3780 tacctgagca ccattgacct caaagacgca ttccttcaaa ttcctcttag cccagagtca 3840 aaaccgttga ccgcattttg cgttcaggga cgaggtatgt tccaatacac tcgtctaccc 3900 tttggcctaa ccaacagtcc tgcgactctt tcaaggctca tggacaagat cttgggagcg 3960 ggtgcagtgg agcctcaaat tttcgtctat ttggacgata taatcgtcgc tagcgaaacc 4020 ttcgaagagc acatcaggct cctcgaagaa ttagctagta ggttacggaa ggcgaattta 4080 tctataaata tacagaagtc ccagttttgt cactcagaag taccgttttt agggtacctg 4140 ttgtcaaacg aaggtctcaa gcccgaccca tccaaggttc aggccattct agacttcgaa 4200 gcccctaaaa ctgttcgaca aatacgtcgt tttttaggga tggtaaatta ttaccgtcgc 4260 ttcatcggcg acttcagtac catcactgcg ccgatttcag accttctggc cgggaaaccg 4320 aaggttgtaa gatggacaga agcggctgag caagccttcc gaaccataaa agaacgatta 4380 ataacagccc ccattctctc gaaccctgac ttcgacaagg agttcactgt tcaaaccgat 4440 gccagcgatc gcgcagtggc tggagttttg actcagctac aagacgggtc ggagagagtt 4500 atcagctttt tctcacaaaa gctaaattcc gctcagcaaa actattccgc tacggaaaaa 4560 gaagcgctgg cggctcttct ggcgatcgac aaatttcgtg gatatattga aggatcacat 4620 ttcacgttga tcactgacgc gtcagcgctt caatatatcc gaaacaataa atggcgtcca 4680 tcatcaaggt tgagcagatg gagcttagac cttcaacacc tagatatgac aatagtacat 4740 cggcgaggat cagataatat agtgcctgat gcactatcta ctccaagact tcctacgagg 4800 aactagtgca aaacgttcaa gaagatcccg aacaatattc cgatttccgg ttcgaggata 4860 accagctgtg gaaatatgtc gctgtcgatg atgaaccatt cgacgtgaga tttgagtgga 4920 agctggttcc tcccccagaa aatcggaaca aaattatcga gaaggagcac ctggatagtt 4980 tccatctcgg agtagaaaaa actttgtccc ggttgaggct acgatactat tggccatacc 5040 tagcatccga caccaggaaa tggatccaaa agtgtgcagt ttgcaaggag tgcaaaccgg 5100 catacgtacc aactgtaccc gtcatgggga aacagaagtt ggccgaccat ccgtggcaaa 5160 tcattgccat ggattacgtg ggaccactac ccaaaagcag aaccggatat atgcacattc 5220 tcgtaatcca ggacctgttt agtaagtggt gtcagataca tccaatgcgc cggatagaat 5280 ccgggtctct atgcaagact ttgcgagagg ggtggttttt gcggaattcg atctcggaga 5340 tagtgctgac cgacaatgcc tctacattcc tctccaaaga gtttgaagcc ctgctgactc 5400 aatttgaaat taaacactgg accacggcca gacatcatag ccagggaaac ccggtggaac 5460 gtttgaatgc ggccgttaga acgtactgta agcaggatca gcgcggatgg gacgttaaaa 5520 ttccggacat cgaacatgtt ttcaataata cggtgcacgc agcaacagga tttacacctt 5580 ttttcatcac gcacaatcat gagatcaccc tgtctggcga tgatcaccaa cggatgcgta 5640 gaaaggagaa ttattccgat gaattgcgtg ccgcttatca aaaacaaatc agcggagaaa 5700 tctatgatct agtcaagaaa aatcttctga aggcgtacga aacaaatgcg aatcgttaca 5760 atttgcgtaa acgatctcgc ccagatgatt tcaaaccagg acagtcaatt tacaggcgaa 5820 atttcaaagc ctccaatgcc ggagagtact ataatgcgaa gctcgcccct atgtatttgc 5880 cctgtcgcgt agtcgttaag cacggaagta gttcgtacga gttagaagac gaggatggca 5940 aaaacatagg cgtgtggcct gccgagcatc tcaaaccata atcacgtggc tctcttgcgt 6000 attctcacgt ttccgtttgc gtatcactgt gtaaatttct ctctcagtgc acggagaggg 6060 taaattgcat gatgtggttg tgttgactct atcttgtagc gtccagtgtc cctctcacgt 6120 ggggtagggt gaataggagc aagtaggaga atcactttca ttgagtagaa tactacacca 6180 ttcatgaagc attgctatgc attagaaaac aggccggccg ataaaaaagt atctcaaaat 6240 tgagatgaat aaaggtaaat agaaactagg tgtatctttc aaaaggaaag atgtttaact 6300 agggtaacct attgaccttc agtagtgttc agtggaatga atagcaatta tcttcgaaag 6360 actatctgtg ggaaccaatc gcgaagaaaa aaaaatgtcg tagtccacct aacaaactcc 6420 gtaagaaata gttgaatggg agtcatgtgg cttacacgag tcgtttagat gggacgaaaa 6480 taattaatta cccagcacag ctctagtgaa acacagagcg taacgagtgt tcgacgagtc 6540 cgagaaaact tttataggac ttaccgtgat ggcgtgccag atcacgggag tccgcccaat 6600 tgcgcgaacc cgactcgagc acttaataaa tcctaaaatt tgttctttaa aatttgttta 6660 atttgtagta tttattttct tgtttttttt tctgtaaatt ttcactttac tttttcttta 6720 tatttattct atatgattat tttcttgtac ctaaatattc gttacttgtt tattgttaaa 6780 aaaaatatat ttcatacatt ttttttattg taccccgggg gaaa 6824 // ID BEL-4_CQ-LTR repbase; DNA; INV; 774 BP. XX AC AAWU01030743; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-4_CQ_; KW BEL-4_CQ-I; BEL-4_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-774 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 162-162 (2011). XX DR Genome; AAWU01030743; Positions 14223 13450. XX SQ Sequence 774 BP; 287 A; 146 C; 185 G; 156 T; 0 other; tgttgaggcc gctcactgcc gcgacggggt cgacaccatg ctgctgtgtg ggaacgaacg 60 ccgcgtcgac atctcacgcg cgtgagacgt gaggaagaag gaagaaacaa tgacgaaaca 120 cacgagcaga gggacgacaa atcaaaacaa agtcacaacc caaatttatg agcacaagtt 180 ggtcagggaa aagaaagtgg agtgattttc ggataattag gaggaagttt gaggattagg 240 attttgattt agaatcgagg attcgggaga ggagtttgtg ggccgggtga acaacaagta 300 gacggacaca aattcgtaag ttgttattta atttgaaaca aacagtgaaa taaataacac 360 aaagttcatg caacacgaaa caggaaccac acgcggataa atccgggtgg tgattcacgg 420 agggaggaca agacggttta ggacggaaca ggacagaacc gctgtctttg tacgagtaag 480 ttcgaaacag ataaacatga aaaatataac aaacaaaaac atgcaaaatc tgtactaaca 540 tgtaaaaccg aacacgctgt taaacataac ctaaaattac agctacaaca caaacgtaga 600 acacaaatcc ggatcgggtt ggacggaaaa caggaaacaa acaaattgta agcacaatta 660 cacacacact acacacacat tgtactaaac ttaaatctaa tttgcagctt tgatctgcct 720 taaataaatt gggctgattc gcaaggaggt ttatttccgt tggcgtccag aaca 774 // ID BEL3_Cis_I repbase; DNA; INV; 6114 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 06-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE Internal portion of BEL LTR Retrotransposon from Ciona savignyi. XX KW BEL; LTR Retrotransposon; Transposable Element; internal portion; KW BEL3_Cis_I. XX OS Ciona savignyi OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. XX RN [1] RP 1-6114 RA Smit A.F.; RT "BEL3_Cis_I - BEL LTR Retrotransposon from Ciona savignyi."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX CC Ci000445, Ci000162, Ci000712. ORF from bp 27 to 5954 encodes a CC protein 34 % identical (52% similar to Catch3 in Drosophila). CC Most elements are internal deletion products, missing bp 2755 to CC 5150 or 2972 to 5979. XX SQ Sequence 6114 BP; 1939 A; 1246 C; 1399 G; 1502 T; 28 other; gtaactgagt gggattggca atttaagtgc acatnatgga gacgaaattg cccaaagaga 60 ccagcacgtc gtttgagatt gaaacgaatc agcaaacgcg cgactcgcga gcaagtgccc 120 gaagtgatca gacagacgtc gccatgcaga acttngctgg tgaagaaaat gtaacaacaa 180 ggaaacatcg acatctggac aacgaacttg ntgatgggga taacttcgaa agctggaanc 240 taatggaagg tgaactaaac gataaattag acgtactgga tatggccata attggaaatg 300 attttgtcgg agtgcaaaat gcgagttata atttatcatc gataaggtca aacatggaaa 360 cgatcataga gaaattcgaa tcaaccgaac gcgaaaggtt gtgttcgagg atcaccaaat 420 gcaaggaact tctagctaaa gctaaacaag aggcgaaaat aactgtgatt cagcttcgac 480 aagaaagaat ctcctcgaaa tcnaactcct acaaaagttc aaccagcaaa cgttctagcc 540 gaaagggtga gtccgcggaa gaaatgataa aaaaggagcg ttacaaacaa aatatcgctg 600 ctgctggttt aaaagcaaaa ctcgcattta ctaggaaaca aaatgaactt gaattagcaa 660 aaatgcaagc cgataaagat ttatctttaa atcgcatttc ctgtgaaatc gcagaaaacg 720 acgcccgagc atcggcttgt ctggaagcgg aaagaattga aaatcctttt ggaataagac 780 cagaggatat tttagagttg ccacctgtga acaaatgtac gtcgtttcgc gaacaaatga 840 accatgccga acacacttca attcctcagg cggcagagag cgtccttgca acgacaacgg 900 agcgacacga acgaatcgct ccgaaaagca cctttaatta caatacacca cttccaataa 960 ccactttacc ctctgtttcg tatagattac cggatataaa agtagatgta ttcgatggca 1020 ccgcaatcaa gtttccatcg tgggaaacgg cattcgatgc actgatagag gcgagatcca 1080 gttgcgttgc acagaaactt aatctcctac aacaacactt aagaggcgaa ccnaaggaag 1140 tagttgatgg cttttttctg ttgcaatctg aagaagcgta tgaggccgca agatcaactc 1200 ttaaaaaaag gtatggaaac gaatgcgtta ttagcaaagc gtttaccgat aagctatatt 1260 cctggccaca aattcaccat tctgattcat tgggtctcag acgattttcg gactttttaa 1320 ttcaagttct tactgctaag aacaaaattg caagcttaaa cattttggat tatccgaccg 1380 aaagttcaaa aataatttca cgacttccac tcttcatagc atcgcgatgg aaagacgcag 1440 taatagattg gaaggagagg aacgctggtt catacccaca gttcaaacag ctagtagagt 1500 tcatagaacg ccaatccgag aaggaaaaca taccggagct tcaaaagacg aaggcaattg 1560 anaatcggag aacagcatca ttcgctacaa ggtcgaatga ctctcaagat ggcccaagga 1620 aaatagacgg taaatgtttt tattgcggtg aacgacatca catagatgag tgtgaactgt 1680 tcttaagaat acctagaagg gaaagaaagg atttcctaag aattgaacga ctatgctact 1740 cttgcggttc aagtcagaac cacattgccc gacagtgtcg ncaaagagta aaatgcgaaa 1800 tttgcaagaa gacgcatctg acttgtttac atatggattg ggaaatcgaa aaaacagaat 1860 gtaaatgcac tacgctttgt ggcaacgcna ttcagggaga cgactcttcg atgattcttc 1920 cagtttgggt tagaagtacg aaggccccnt cgactgaagt gctgacgtac tgcattttgg 1980 atccacaatc gaacagttcc tttatttccg aagagcttca agaatgtatg ggagttgccg 2040 gcacagatac aaatatcaac ctatgcacaa tgatgggtag gaataacgtt agtgccacta 2100 tacgaattcg caatctagag atttccagtt ttgaccgatc ggtgcgaatc cctctacccg 2160 ccatttacac acgaccagag atacctgcag ctcggagtca gatacctagg cctgaatacg 2220 ccttgcgatt caaacatcta gcaaaaattt cgcagcagat ctcgccgtac aaaccgcatg 2280 tacctgtagg tatgctcatt ggaacaaacg ttccgtgcgt cattcgaccc cgggatatag 2340 tctgtgggaa agaaaatgaa ccctatgctc aaaaatcgat ccttggctgg ggtatcgtgg 2400 gcatcgtctg taaaaatcac ccaggcgttg gcgacttgat aactcactgc tgcatcgcga 2460 gtgaggaatg tgctaatggc gcgatgatta taaacgggtg taaggagant atatcaccac 2520 agcaagttcg caaagggatg gagctagact tccacgagaa agagtgcaac gatggaagtg 2580 gaagtaaact ttcattacaa gacaaaaagt tcgagaatat aatcggcgag aacattgcac 2640 aattactaga tggtggatac caaagaccac tacgtttaaa gcacaacaac atgcgacctc 2700 cgaaaagtcg accaatagcg aaanggagac ttggccattt aaagcggcgc ttacagcgaa 2760 atggtcaata taagagcgat tacaaagagt ttatgnacaa tgtaattaaa ctttttgata 2820 aatcagtgga caagccctca atgtatgaaa atacgaaccg gataaactac ataccacaca 2880 ccggaaaaca ttattccaag aagcnagaaa atctacgggt tctatttgat tacgcagcgg 2940 aataccctgg tataagctta aacgattacc ttttacaagg gccagatatg ctaaacagcc 3000 ttgtgggcat tctttgccga tttagactgc atagaactgc aatcatcgca gatattgagg 3060 caatgtttca tcaatttaaa gtggaaaagc catatcgtga ccttttgcgc ttcttgtggt 3120 ggaaaaatgg cgacctcgag aaacctattg tggagtatcg catgaaggtc cacatttttg 3180 gtgctgttag ttctccaggt tgcgctaatt atggtcttag gaaagcggca gatgatggtg 3240 aagccgaatt tggaattgag gcggcaaact accttcgaaa aaacttttac gtcgatgatg 3300 gtgttatttc tgttccaact acagagaatg caattgctct aatcaaagac agcaaagctt 3360 tgtgcgcaaa ggcaggatta cggctacana agttcgcctc gaacgatcgg gncgttttaa 3420 aacaaatccc tttaagcgat cgatccaaat ccctacgtga cattcaaatt ggaatccatc 3480 gtcttcccag ggaaagagtt ttggggatca catggtgctt ggaaaacgat tcattgtgtt 3540 ttcgtatcga attaaaggac acgcctctga cacgaagagg natttcagcc actgtaagtt 3600 ctgtttacga cccgttagca tttacagctc ctgtgatgct ggaaccgaag ctgatattgc 3660 agcagttatg ccgcgaaaag gctgactggg atgatgatgt tcccgaaaat attcgatccc 3720 gatgggaaaa gtggagatat aanctnaagg gctttcggga agttgaagtn caacggtgct 3780 acgagccacc agaatttgga acagtagtaa cccgtgagtt gcattatttt tctgacgcaa 3840 gtacagttgg atacggtcaa tgcacctacc taagaatagt gaacagccag caacaagtgc 3900 actgtgcttt tgtatctggt aaatcaagag tagcgccctt gaagcaaatt acaattccga 3960 ggttggagtt aacagccgct gtaatttcag cgcaggtgag cgcttatcta aagcaacagc 4020 tttactttga aaattgcaga gaatacttct gggttgacag caaggtcgtg ctcggttatc 4080 ttgctaacga tgtcaagcgt tttcaaattt ttgtggctaa tagaattcag cggattaaag 4140 actataccaa tggtggcgac tggttacatg taaaaaccgc cgaaaaccct gctgatctag 4200 caagtagagg aatgctncca aaccagctaa cggctttatc gccatggctg cgaggaccaa 4260 acttcttctg gagaacagat ttccttcaac cagacgtatc aaacatgtta gaagaagagg 4320 atgttcaaaa ggagttacaa ctcgaaatga agcgagtcgc tgtgataaat acgaccgtga 4380 gacccctaga tcgttttgac attgacagac ttctccacat ctcctcttgg ttcagagcaa 4440 agagggcagt cgcaaattgt ctgcgtttta ctcgacgtct caagaacaaa acattggttc 4500 ctgtgacacg acaactgatg cacgaagcag aagtcattat actgaaaggt acgcaattaa 4560 gttattttcc agaggagtat gcatcgttga tgagtcgcaa ccgcggaaaa ggttgcaata 4620 gttcgtcgaa atcgaagtta aacagcagtc tgctaagact gagtccattc gtagatgact 4680 gnggcgtcat tagagttgga ggacgacttc agtatgcaaa ttggggcttt gaagtgaagc 4740 atcccgtnat tctaccaaaa gttggtcata ttgtggaact gttaatcaga cattcacatg 4800 agaaggtaca acataaaggt anaactggta cacagaactg tatacgacaa antggatatt 4860 ggattattaa cggttcatct agagtcgcgc attatataag taaatgcgtc atttgcaaac 4920 ggctgcgtgg tgganggcaa acacagaaaa tggcgaattt gccaacggac agactgcatg 4980 aatgtccacc attttcttac tgtggcgtgg actactttgg tccgttttat attaaggaga 5040 agcggtcaat gataaaacga tacggtgtac tattcacttg tctcgtgtct cgagcgatac 5100 atttggaaac ttctaacact ttgaacacag attccttcat aaatgcattg cggcgttttc 5160 ttgcccgtag gtgccccgtg ttgcagatga gatccgactg cgggacaaac tttgttggcg 5220 catttaacga attgaagagc gaagcaggtt caatcaatca agagagcatt aaacaatatt 5280 tacttcgaca ggactgtgat tgggttcctt tcgaatttaa tccccctcat gcaagccaca 5340 tgggaggggt ttgggaaagg caaattcgaa ctgtaagaag ctcactggag ctgatgttga 5400 gacgtttcgg ggaacaactc gacgacgaga cttttagaac cctaatgacg gaggtggaaa 5460 atatcgtcaa ctcgcgcccc ttaactcaca gcggtttaaa cgaagcagga gaacctgagc 5520 ctcttacacc gaaccatttg ctgacaacaa agttaaaacc tttgcttcca ccacctggca 5580 actttcaaca nacggatatg tacgttagac ggagatggag aagagtccaa tacttggcga 5640 acgttttttg gtctcgctgg agaaaagaat acctccagac cctacaagtt aggacgaaat 5700 ggcgcaccat tcaggacaat attcaagtcg gcgatatcgt attgttggtc gatgaaaacg 5760 caacccgaaa tatttggaag atgggaaaag tgttgaaggt gtacccaagt gaagatggtc 5820 ttgttcgcaa agtgaacatt ctcgtcggtg atggccgaag agatgaaagg ggcaaacgca 5880 tatcaccctc aacagtccta gaccgcccca tccataagtt gatcctacta cttcgaactg 5940 gtgaagagtc gtaagagaga ggcaaccgtt aatgttaaac gattacccga cgaggagcca 6000 accaaaccta taaacttgtc ttcgttatcg ccatttactg tattgtttaa cgccatttac 6060 tatgttgntt aacgtttttc aaagggaaaa atttgtgaaa tttttgggaa gcca 6114 // ID MuDRx-1N_SM repbase; DNA; INV; 374 BP. XX AC . XX DT 16-AUG-2009 (Rel. 14.08, Created) DT 16-AUG-2009 (Rel. 14.08, Last updated, Version 2) XX DE MuDR-type DNA transposon element - a consensus. XX KW MuDR; DNA transposon; Transposable Element; Nonautonomous; KW MuDRx-1N_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-374 RA Jurka J.; RT "MuDR-type elements from Schmidtea mediterranea."; RL Repbase Reports 9(8), 1905-1905 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX SQ Sequence 374 BP; 139 A; 56 C; 58 G; 121 T; 0 other; ggacaaaatg gcacggacaa aattgtaggg gaaaatgata ataaattgta ctttttagat 60 ttaaaattct cacgaatgag ataaaaactg caatacaatt aaagcatttt tccgtgaaat 120 aacttatgac aactattgaa acagtaacaa tgaaccaaat attcgttttg gcttcaaatt 180 aaaaacatag ttatacaaat taaaattaac tttgtccagg taattatttg tttcgcttaa 240 taaaattaat tgtcagcact atcacttgag catgagctgg atttcaaatt tttgggattg 300 ccccgtgaat atcggtgcct atatatataa aaaacttaaa gaaagtgcca ttttgtcctc 360 gcgacatttt gtcc 374 // ID Gypsy-157_AA-I repbase; DNA; INV; 5300 BP. XX AC AAGE02018588; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-157_AA_; KW Gypsy-157_AA-LTR; Gypsy-157_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5300 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02018588; Positions 47766 42467. XX CC Positions [2321-2821] - Reverse transcriptase CC Positions [3956-4426] - Integrase core CC 'GTAAC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 735..1841 FT /product="Gypsy-157_AA-I_2p" FT /translation="MSSLIGTVDHFVRGSSFAQYMERMDILYQLNNVADNM FT KKSLFITMSGPVVFEEVKLIYPGKNVKDIDYADMIDKLKNRFDKIEPNMMH FT RHRLHMRKQGIDEPAENYVLGIKLIAAQCGFGAHKDEAVKDAIIFGLRDQE FT LKQKLLMKDDISLDEVEQIVIRTELAKLRVKALSGEEDGARCINSVKYRLG FT HNHSEHNWNSSRNSRQESRRFVSRSHSLDSNRSRSPRHEYQIDRSHYGNRN FT RFDNRNYRPSYQQQQRRRSENEDMHANIICNFCKKRGHVKRNCFKLKNRRA FT VNFVQENDVHSVQERDLEKDESYDFKRLSIRDSEDSEDFACMMIKSGISDG FT GPCMVEVVVEGRKFNMEVDCGAAVSV" FT CDS 2021..4930 FT /product="Gypsy-157_AA-I_1p" FT /translation="MPLLDRNWLEVFFPQWRNAFVSAKAINSLQSENTNLN FT STGKFEMDIKAKFPKSFDGDFSKPIIGHEADLVLKDTSPIFKKAYEVPFKL FT REKVLEHLDSLERQNVITPIQISEWASPVIIVLKKDDDIRMVIDCKVSINK FT VIIPNTYPLPLVQDIFASLAGCKWFCCLDLAGAYTQLQLSKRARKFMVINT FT IKGLYTYNRLPQGASSSASLFQKIMDQILAGLKYVQVNLNDVLIAGRTKEE FT CYKKLMLVMEKLQAANIKINLKKCRFFVNSLPYLGHLITEKGLLPSPEKLL FT TIEEAKVPKDVAELKASLGLINYYGKFVPRLSAKLRPLYQLLKKDSKFVWN FT ETSQLSFEECKKALLTAGILEFYDPSKPVVVISDACTYGLGGVIAHIVDGQ FT EKPISFTSFSLNSAQQKYPILHLEALALVCTVKKFHKFLFGQKFSIFTDHK FT PLLGIFGKEGKNQLCVTRLQRYVMDMAIYDYEIQYRPSAQMGNADFCSRFP FT LEVEVPRCVDSGSVKSINFFNDFPIDYSLIAKETRVDDFLSKIVLYVTNGW FT PQRIENEWKKAYSLRFDLEVVEGSLLYQDRVFVPFKLHEPILKLLHSNHIG FT MVKMKQMARRCLFWLGINEDIEWHVKQCQACLQMSVIPKSTNSTSLTQTNR FT PFSRIHADFFHFDHRTFLLVVDSYSKWLEIDFMKTGTDATKVIRKFVAIFA FT RFGLPDVLVTDGGPPFNAFQFTSFMERQGIKVLKSPPYNPSSNGQAERMVR FT VAKDIFKKFLLDPFTKTLDIEDRILYFLFNYRNTCSGEDERFPTEKVFCFK FT PKTLTDLLHPKRTYKDHLIIPDTKEKLDDEVDNSCSTPSQDPFLKLKLGDQ FT ILYKNYDKTALEKWIAAQFVTRVSTNVFRISIGRNTLNAHRSQLKIMEKRP FT TGSQMRVTMQRVKRPRTASDSEEEFLGFPDVPMVPGQIDDDRRRFKQLKRS FT PIVTRSKSRSKPNLN" XX SQ Sequence 5300 BP; 1740 A; 839 C; 1155 G; 1566 T; 0 other; gttgcgacga gaaaaaaaaa ttagtggatt acctacttgg aaaactgtga tttgttagca 60 atctcagtat agagtggtaa gataaaaaaa atcatagcgg caatagaagg agcaattgac 120 cgtgtgtgag caatagaact gaagcttgtg tcccatgaaa aattaatcac tgttaattat 180 tgggtgcagt gattaattgt ttcaagaggt gttggaatca gttcatttca tcgtagctgt 240 tgaattgtgt aaaaatagaa ctttcaaaag aaaatttgtg aaatttacgg tgcacgtgtt 300 cggtacagta caatagattt aaatctacga gacaccattg tgaggccgaa acaataagct 360 tcaatatata ggccagcaat agctttgtgc tcttttgtta atttatgcag tgtgattctg 420 gagaaaagtt tgtatcggat agtgatcgtt gatttcatgt gtccgtgagc aaatagttaa 480 ttctgcatag acatctaagc aacagtgtgc accagtgact agaaagtgta tattggattc 540 aacttgattg gaggcgacca gcaccggcta ttggattacc acgtgaaatc agcatcagca 600 gcatcagcgg tgtgaaaacc agacagaagg ttctaccaga ccagcgtcgt aagaaaaggt 660 cacgtaagtc gttatttctt attactcata ttctattctt ttagtgtcta ttgtttgtga 720 atacttacgc cattatgtct tccttgattg gcacggtcga ccattttgtt agagggtcca 780 gtttcgcaca atatatggaa cgtatggaca tactctatca attgaataat gtggcagaca 840 atatgaagaa aagtcttttt ataacaatga gtggcccagt ggtttttgag gaagtaaaat 900 tgatttatcc agggaaaaat gtaaaagata ttgactacgc cgatatgatt gataaactta 960 aaaatcggtt tgataaaatt gagcctaata tgatgcacag acataggctg cacatgagaa 1020 aacagggtat tgacgaacca gcagaaaatt atgtgctagg aattaaactt attgcggctc 1080 aatgtgggtt tggcgcgcat aaagatgaag cagttaagga tgcaataata tttggtttga 1140 gagatcagga attaaaacaa aaattgctca tgaaggatga tataagttta gacgaagtgg 1200 aacagatcgt cattagaaca gagttggcga aattgcgggt taaagctcta agtggagaag 1260 aagatggtgc taggtgtata aattccgtga aataccgttt gggtcataat catagtgagc 1320 acaattggaa ctcgagcaga aatagtaggc aggagtcacg taggtttgta agccgcagtc 1380 atagtctgga tagtaatcga agccgttcac caagacatga atatcaaata gatagaagcc 1440 attacggaaa ccgaaatcgt tttgataata gaaattatcg tccatcatac caacagcaac 1500 aacgtcgtag atctgaaaat gaagacatgc atgctaacat catttgtaac ttttgtaaga 1560 aacgcggtca cgttaagcgt aactgtttta aactaaaaaa ccgtcgggct gttaattttg 1620 ttcaggaaaa tgatgtccat tctgttcagg agagggattt ggagaaagat gagagctatg 1680 atttcaagcg tttgagcatt cgtgattcgg aggattcaga ggattttgca tgtatgatga 1740 ttaaatcagg gattagcgat ggtggaccgt gtatggtaga agtggttgta gagggcagaa 1800 aattcaacat ggaggttgat tgcggggcag cagtgtccgt ataagtttga tcacatacaa 1860 aaggtatttt gaaaacgtta aagtggccga ttgtgctagt cgattagtcg tggtgaatgg 1920 ccagcggttg aacattcatg gaaaaattca agttagagtc agtgttaaca acaacgaaaa 1980 acatgtcagc ttgatcattc tggattgtgc aaacaacttt atgccactct tagatagaaa 2040 ttggttggaa gtgtttttcc cgcaatggag aaatgctttt gtaagcgcta aggctatcaa 2100 ctctcttcaa tcagaaaaca ccaacttgaa tagtacgggt aaatttgaaa tggatattaa 2160 ggccaaattt ccgaaaagtt ttgatggtga tttttccaaa ccaattatag gacacgaagc 2220 agatttagtt ctgaaggata catcaccaat ctttaagaaa gcatacgaag tcccattcaa 2280 actcagagag aaggtgttgg aacacttgga ctctcttgaa agacagaatg tgatcacccc 2340 catccagatt agcgagtggg cgtccccggt aattatcgtt ttgaaaaagg acgatgatat 2400 acgcatggtc atcgattgta aggtatctat taataaagtc attataccta acacgtatcc 2460 tcttcctttg gtacaggata tatttgcttc tttagctggg tgtaaatggt tttgttgcct 2520 agatttggcg ggggcgtaca cgcaattgca actttcgaag agggccagga aattcatggt 2580 gataaacact ataaaaggtt tatacaccta caatcgttta cctcagggcg catcctcaag 2640 tgcttccctt tttcaaaaaa ttatggacca gattctagca ggattaaaat acgtacaagt 2700 taacttgaac gacgttctta tagcaggaag aacaaaagaa gaatgttaca aaaaattgat 2760 gttagtgatg gagaaacttc aagctgcaaa cattaaaata aatttaaaaa agtgtaggtt 2820 cttcgtgaat tctcttcctt atttaggcca ccttattacc gaaaagggtt tactaccttc 2880 acctgagaag ttgttgacca ttgaagaagc gaaagttcct aaagatgtag cggaattgaa 2940 agcttccctt gggttaatta attattatgg taaatttgtt ccaagattat cagccaaact 3000 tagaccgcta tatcaattgt taaaaaaaga ttctaaattt gtttggaatg aaacaagtca 3060 gctatccttt gaagaatgta aaaaagcatt gctgacagcg ggtattttgg aattttatga 3120 tccaagtaaa ccagtagttg tgatttcgga tgcatgtacc tacggattag ggggagtaat 3180 tgcacacata gtagatggcc aagaaaaacc tataagcttc acatcctttt ctctgaatag 3240 cgcacaacaa aaatacccta ttttgcatct agaagcatta gctttggtgt gcacagtgaa 3300 aaagtttcac aaattcttgt tcggacaaaa gttttcaata ttcaccgatc acaaaccgtt 3360 gctgggtatc tttggaaaag aaggaaaaaa tcaactttgt gttactagac tgcagagata 3420 tgtaatggat atggcaattt atgattacga aatacaatac cgaccatctg cacaaatggg 3480 taacgcggat ttctgctcta ggtttccttt ggaagttgaa gttccaaggt gtgtggatag 3540 tggtagtgtg aaaagtataa atttcttcaa tgattttcct attgattact ctcttattgc 3600 caaagaaacc agagtagatg attttctttc taaaattgta ctctatgtta ctaatggttg 3660 gcctcaaaga attgagaatg aatggaagaa agcttattca ttaaggtttg atcttgaggt 3720 ggtagaaggt agccttttat atcaagaccg agtatttgta ccgttcaaac tccatgaacc 3780 tatactaaag ttattacatt caaatcacat aggaatggtt aaaatgaaac aaatggctag 3840 acggtgttta ttttggctgg gaattaatga ggatatcgaa tggcatgtta aacaatgcca 3900 ggcatgttta cagatgtcag tgatacctaa atcaactaat agtacatctt tgactcaaac 3960 aaatcggcca ttcagtcgca tccatgccga tttctttcat ttcgatcaca gaactttttt 4020 attggtagtg gatagctata gtaaatggct ggaaatagat ttcatgaaaa cgggaacgga 4080 tgctaccaaa gtaattcgta aatttgtggc gatatttgct cgatttggac tgccagatgt 4140 attggttacc gatggaggtc ctccgtttaa cgcttttcaa tttacgtctt tcatggaaag 4200 acaaggcatt aaagttttaa aaagcccacc atataatcca agcagcaatg gtcaggcaga 4260 gagaatggtg agagtagcta aggacatttt caaaaaattc ttgttagatc ctttcactaa 4320 aacgcttgat attgaagacc gtattttgta ctttttattt aactaccgta atacttgttc 4380 tggtgaagat gagaggttcc caactgaaaa ggttttttgt tttaaaccaa agactttgac 4440 tgacctttta catccaaaaa ggacgtataa ggaccatttg ataattcctg acaccaaaga 4500 aaaacttgac gacgaagttg ataattcatg cagtacgccc tcccaagatc cttttctcaa 4560 gctgaaatta ggagatcaaa tactctataa aaattatgat aagacggcgc tagagaagtg 4620 gattgctgct cagtttgtga ctagagtatc gactaatgtt tttcgaattt ctattggcag 4680 aaacaccctg aacgcacacc gatcgcagct aaagataatg gaaaaaagac caactggttc 4740 gcaaatgaga gtgactatgc agcgagtcaa gaggccgaga acggccagtg attctgagga 4800 ggaattcctt ggttttccgg atgtgccaat ggtgccagga caaatagacg atgatcgaag 4860 gcggttcaag cagctaaagc gtagtccgat cgtcacgaga agtaaatcac gctcaaagcc 4920 gaatttgaat taattgtgtc cgaacaggtc gagcattaca ttgattgttc aaaggcaagt 4980 tgtgcttgcg gtctgaattg aaataagcta cagaaattgt gttttttcga agttgtgttt 5040 gagacctgaa ttaaaataag cagtcaaaat gaatttgtac tcgaaatagt tgtaaaacca 5100 atattattta ttagagcatg ctaagtgtta aattttggta tcgatttaaa gttttgtcaa 5160 aagcaatcat ggtttattga agcatgttca ctgatttgaa ttaggattag ctatttgatc 5220 gaatattgta ttgaaagaca tatttgtaaa tagaatcaag tttgttttta gataaagttt 5280 ttagctaaag ggaaaagaat 5300 // ID Gypsy-130_AA-LTR repbase; DNA; INV; 255 BP. XX AC supercont1.8; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-130_AA_; KW Gypsy-130_AA-I; Gypsy-130_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-255 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.8; Positions 429060 429314. XX SQ Sequence 255 BP; 76 A; 53 C; 35 G; 91 T; 0 other; tgaaatatat tagatcactc catttagatt ccataattat agcaaataga aattagctat 60 agatgtcagt gtctcctatc tcctatcttg attgtaaaca attatcttgc tttattcaaa 120 tgatatccgg tcctactcct atataaggcc tgcacaattt agactaagcc tcttttgtat 180 ttcgactgta taataaacca cattggtaag tgtatagagc ggtctcgtct ttcattcctc 240 cgtccataca gatca 255 // ID Gypsy10-LTR_AP repbase; DNA; INV; 329 BP. XX AC Contig4460; XX DT 21-APR-2008 (Rel. 13.04, Created) DT 21-APR-2008 (Rel. 15.12, Last updated, Version 0) XX DE LTR retrotransposon from pea aphid: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy10AP; KW Gypsy10-I_AP; Gypsy10-LTR_AP. XX NM Gypsy10-LTR_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-329 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from pea aphid."; RL Repbase Reports 8(4), 456-456 (2008). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 329 BP; 94 A; 53 C; 66 G; 116 T; 0 other; tgtagtaatt aaattatata ttgtcatttt atcgatagcg ataagcgcgg gccggacgat 60 cgatacgaac gtcgatcgta ctggtggcga attagtgtag tcgcgaccat cgaccgtacc 120 gtgtgacgat cggaaattat tacggaatat aatattgttt tagtttcatg tgtgtttgtt 180 ataacttata ataaattggt gatcaatccg tgacccatta ataaaagggg ttgagattat 240 atacttacgt aattctttta ttcatttctc accttttatt aatttgtacg ctcaggtaca 300 atagtgcgca ttataacgtt tgtaccaca 329 // ID hAT-49_HM repbase; DNA; INV; 3131 BP. XX AC . XX DT 15-DEC-2008 (Rel. 13.12, Created) DT 15-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type DNA transposon family - consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-49_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3131 RA Jurka J.; RT "hAT-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 2037-2037 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(543..1988,2078..2512) FT /product="hAT-49_HM_1p" FT /translation="MVTSTIYYFEGDGQCLRGTSKPTTRQETELWLIGQMS FT EILSFTKLPSKKEVMALFFYYKEAAKQTVREASHSTTNDVIEVWAKARIPT FT QLKKHVVEKVECMFHEYDKLKKNKENKAKRSESLLKKEEEWKDGLESLFDV FT AHADAMKMISIQEDKEFLLAQREAGRRGKMGSVDKALAKRERDVHNKEENF FT KRRKEREEQDRVARKKKAILQTSESEQESGNDDEAFGEPSSSTTSKRAKRI FT RGRQKILNDKLAANLDMAKXSDRXAALVLTPALQHLGHDPTEYNVNPASIR FT RERIKRRKKIAEXLKEEFKPKVPLXIHWDGKLLADISSTEIVDRLPILVSG FT VGVQQLLSVPKLPSGTGENAASAVHDASVAWGINDQVKCMCFDTTAANTGP FT RNGACILLEQKFGKDMLWLACRHHILEIILEAVVLLCLGPSSGPDIPLFKR FT FQKSWVLIDSTKYHTAQSDKNASNALTDIADERIVFCGACSFRAPAGLHRA FT RWMAKAIYALKIWMFRDQFRLTKREEKGIRDICLFTVRLYVTAWYRSPEAT FT SAPRLDLQLLKDLDAYKTQHPEISKIAVKKLQGHLWYLSEELVALAFFDDE FT VCPETKRQDGQCXASTECRSDTFEXGYC*" XX SQ Sequence 3131 BP; 1016 A; 560 C; 663 G; 871 T; 21 other; tagggtggtc cttaattttc aaagatgatt tttgtatggg gcacccctga attttgttta 60 taagcatgca tacaaaattc acaaaaaaac tcagctaaat aaaatattta gaggtcccta 120 ccacatatac tagttaagta tcatttgggg ttccgtggct gcatttcaca agttgttatg 180 ccggtgaact ctttcactat acacaccata ctgcctgcat attttatcaa tttgatgtta 240 cttatttatt gttaggtttc ccagaatgca tttcaccagc tgtcattttt aatcagacat 300 taacggacaa tttaaaaata atatttcatt aatctgaatt agtgcctagt attgactact 360 gactttctca gggaaagttc cataacaacg gacctcgtgg atactgaaca aaagtaagtg 420 ttttctttgt ttatttttgc ttgcattgtt tgcattattg cataacataa acattgtcag 480 ttggattact gggttctgtt agttttggta gaaattataa ttaatttgta taattaaatt 540 ggatggtaac aagtacaatt tattattttg aaggagatgg ccaatgcttg cgtgggactt 600 cgaaaccaac tacaagacaa gaaactgagt tgtggctgat aggccaaatg tctgaaattc 660 tcagctttac aaagttaccg tcaaaraagg aagtcatggc actattcttc tattacaaag 720 aggccgccaa acagaccgtt cgtgaagcat cacattcaac aactaatgac gttattgaag 780 tatgggctaa agctcgaatc ccgacgcaat tgaagaaaca tgttgttgaa aaagttgaat 840 gtatgttcca tgaatatgac aaactgaaga aaaataagga aaataaagcg aagcgctctg 900 aaagtctact gaagaaggaa gaagaatgga aggatggctt agagagtctt tttgacgttg 960 ctcatgcgga tgccatgaag atgattagca ttcaggagga caaggagttc ctgttagctc 1020 agcgtgaggc gggacgacgt ggtaagatgg gaagtgttga caaagccttg gccaaaagag 1080 aaagagatgt tcacaacaaa gaagaaaact tcaagaggag aaaggagaga gaagaacagg 1140 acagagtggc cagaaagaag aaagctattc ttcagacatc tgaaagcgaa caagaatctg 1200 gtaatgatga tgaagcattt ggtgagccat catcaagcac aacttcaaag agagctaaac 1260 gtatacgtgg tagacaaaaa atactgaatg acaagttggc agcwaatytg gatatggcaa 1320 agktcagtga tagaaakgct gcactwgtgt taactccagc attacaacat cttggtcatg 1380 atccaacaga atacaatgtt aatccagctt ccattcgaag agaacgaata aaacgtcgca 1440 agaagatagc agaargtttg aaagaagaat tcaaaccaaa agttccttta wcaatacatt 1500 gggatggaaa attactcgca gatatcagta gcacagaaat tgtggaccgg ctcccaatyc 1560 tagtatctgg agtaggagtt cagcagcttc ttagtgtccc aaaactacca tcaggaacag 1620 gtgaaaatgc ggcatctgca gtgcatgacg cgtcagtggc atggggaatt aacgatcaag 1680 taaagtgcat gtgctttgat actactgcag ccaatactgg gccaagaaat ggggcttgca 1740 ttcttctaga gcagaagttt gggaaagata tgttgtggtt ggcatgtcgt caccacattc 1800 tggaaattat tttggaagca gtagttttgc tgtgtctcgg cccttcaagt ggacctgaca 1860 taccgctctt taaaagattt cagaaaagtt gggtcttaat tgattccaca aagtaccaca 1920 cagcacagtc tgataagaac gcttcaaatg ctcttaccga tatcgctgat gaaagaatcg 1980 tcttttgcta aagatcaact acaaatcttt caacctcgtg acgactaccg agaacttttg 2040 gagttgtcta ttgttttttt gggaggagta cctttgagga gcatgttcat ttagagctcc 2100 tgccggtcta caccgtgctc gctggatggc gaaagctatt tatgcgttga agatttggat 2160 gtttagagac caattcaggt tgacaaaaag agaggagaag ggaataaggg atatttgyct 2220 tttcacggtt cgactgtacg ttacagcctg gtacagatct cctgaagcaa cctcagctcc 2280 tagacttgat ctgcagctct tgaaagactt ggacgcctac aagacccagc atccwgaaat 2340 ctcgaaaatt gctgtgaaga agttacaagg gcatctctgg tacctatcag aggaacttgt 2400 tgctttagct ttttttgatg atgaggtttg ccctgagaca aagcgtcaag atggtcartg 2460 cmttgcaagc actgagtgca gatcagatac ctttgaaamg ggctactgtt gatccttcgc 2520 tggtgagctc caagaacttg saggactttg tcacctttaa tactcaaaga tttttcagca 2580 tcacaggact tccatcacac tttcttcaac aacagtgtga gccaatggta acatgatgat 2640 gasttcgaga ctgtaaagtm cactgttcga agcatgaagg ttgtcaacga cattgcagaa 2700 cgtggcgtag cactaatgga cgaatacaat aaattgcata caaataatga agagcaaaag 2760 cagttcctgt tgttggtagt aaaaaartac agacagaaat atcctgaycg aaaaaagaat 2820 accctagcaa tggactaact agacggtttt tctcacttcg acagtctcaa cgtagaarga 2880 caataataat tacaaatgta acagtttttt aacgtgcttc aataaagcaa tgtattctct 2940 gtgacagttt aaatttgtcc tgatcattca tattgtgctg agcaaataag ccttcacttt 3000 tattgatacg aactaaaatg ttttattcaa ataaaatttt catacaatca ttttttttaa 3060 tagtcgacca tttgagaggg tgccccatca raaaacaaaa aaaaattttt ttataaacaa 3120 ggaccaccct a 3131 // ID Copia-2_Cfl-LTR repbase; DNA; INV; 258 BP. XX AC AEAB01006442; XX DT 26-JAN-2011 (Rel. 16.02, Created) DT 26-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Florida carpenter ant genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-2_Cfl_; KW Copia-2_Cfl-I; Copia-2_Cfl-LTR. XX OS Camponotus floridanus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Formicinae; Camponotus. XX RN [1] RP 1-258 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Florida carpenter ant genome."; RL Direct Submission to RU (26-JAN-2011). XX DR Genome; AEAB01006442; Positions 440 183. XX SQ Sequence 258 BP; 44 A; 67 C; 53 G; 94 T; 0 other; tgttggaact cgcttgaatt cgagtggact aagttccgac tcgcactcgt ggagcggcgc 60 tacattctgc agtcaatcac tgctatctac ttccttttta ctcttctctg tgtgctctcg 120 atttcgcggc gtgtcgccga tatacttgag ctgtcttttt tgcagtaaag tcgtattaaa 180 tccttattgt gcataaatcc tgttgttctt ttctggctct tcctttgcgc tacgaaaggc 240 tctgcctcct gtacctca 258 // ID Gypsy-17-LTR_NVi repbase; DNA; INV; 1626 BP. XX AC . XX DT 13-APR-2009 (Rel. 14.04, Created) DT 13-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: long terminal DE repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy-17-LTR_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-1626 RA Bao W. and Jurka J.; RT "LTR retrotransposon from Nasonia parasitic wasp."; RL Repbase Reports 9(4), 772-772 (2009). XX DR [1] (Consensus) XX SQ Sequence 1626 BP; 351 A; 401 C; 496 G; 377 T; 1 other; tgtaacacgg caaaatttcg ccgtgtaaca acgacgttcg acgcggtcgg ggtagagtgt 60 ctcaaggcta cctgtgtgaa tgagagagtg aacgtgtcta agtgcgtcgt ttattttttg 120 agtgaacgac gagagaggag cagtggctgc gacggacgaa aaggacaacg cacgtcgccg 180 actcgtgagt gagagatgcg aggagacgac gagcaacgtt ggcgcagcag cagcgacctt 240 tcacctcgac agggcgttca agtcggcccc gcgcaccgta agacagccga tgcaggtggc 300 ggagcccttc gtggggtcta cctacgtgag cgtctgatgc agcaggacca gcgtgatggc 360 agctcgcccg cacgaacgcc aggaacaaaa actacagcgt ctcgcgttcg tagtgacaga 420 tgccgcccga gtcaagggcc ctgacgacgg ggcattggga gtcactacgg gttccgcagt 480 caagagttat cttggttgtc tgcgggtgaa aggctgtctg tgtgctacga gaattcgaat 540 tgaatttcgc ggctatctgc ttcgacatcg tgtccgcggg gctatctgcg cgcatgcgcg 600 accgcgcgac tcggcaacgg cgcgtgaacc gatagtgttg cgagcgagcg ggccgtcggg 660 ccgcgacgcg ttgaccaatc agcgcgcgcg cggtacggcg gctagggaga gcagagtggc 720 ggccgcgagc cgcgagcgct cgagccggca ctcgcctgcc ccgactgctc ccgaacggac 780 gtgcctcaac ttcaacttca cccggtgttc cagctagaag gatttaaggt cagtgaatcg 840 ttttccgttt tctccggtcg tatccgtctc gcgggtgaac ggcagttcca cgttcggttt 900 cgcgaacgaa aaggttatct cgcgtacgcg ttagttacct ccggatcgct gtctttatct 960 ctttatctct ttgtattgtg ttccgtcgaa taaagggtta tctcgtttgt acgggttatc 1020 ttctcgagta ataagcgaag tgtagaatta aggattttga tgatagaagt gtagagttat 1080 tttgtgaaaa cgtaggttaa tacttggcgc cgacacctgg acgcacgagg cgtaaggtgg 1140 tcgggcaatt ctaggataag tcgttcggcg tttgcggaat tcgcgtgcga cgtaattgtg 1200 tgagtgagtg agagacgaaa agtgcgagcg agtgtgtgtg atctggttgc ttcttgagcc 1260 ttgagacgcg cgcgagtcac gcgggatatc tttcgtgtac gggtatttgc ttgtcggagt 1320 tgcctttgtt cacgggttat cttagtaatt tgtacgagtc gattataaat tgtaagaatt 1380 tawtaagaat atacagttcc gtctgataaa ccacgtgtct ctattatccc gaaccgccct 1440 ctctccttac cgttttcggg gctgcagcta gccgagcaaa ccacgtgcaa aaaaccctcg 1500 cgccgggaaa caaagagtcg cgagactctg gcgctcacga ggagcgcctg gcgccaccgg 1560 gttattttcc aagccccggg cgagggattg caccataact ggggtagcac gaaaacccca 1620 gttaca 1626 // ID BEL-643_AA-I repbase; DNA; INV; 5946 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-643_AA_; KW BEL-643_AA-LTR; Pao_Bel_Ele220; BEL-643_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-5946 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [5000-5548] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 14..5946 FT /product="BEL-643_AA-I_1p" FT /translation="MSQSDLTSNLSDPEDLDLTTTPCAACHQLSTTNEPMV FT GCDACSRWFHYRCVGVTEAVKKEKRWFCSESACQDAAKKSKRSTGKRNTGR FT SSTKSDQAQGITPEQKLKSMEEEFAKKMEELEYERVFREKEMEFQRALKEK FT KMQMENELREKELAQEKVLLDRALEDKTEHFEKMKTMRMSYQKCMDGLDEE FT MLHMRKPKPELFGNEARKTIQPGTSNEKQHPVRKLNMQGGGQNADSKGIQP FT SKMQNQANPSNGGEECYDEDDEDEEIEEEAADGEAVERADEEDYQGEEVAD FT VIGAVGGHNVDPYGLGQQRIGPTKSQLAARSAVSKRLPVFTGKSEEWPLFY FT GTYVASSQACGFSDVENLVRLQDSLKGPALESVRGQLILPKSVPKIIEKLR FT QLYGRPELILQSHLERIRKLEPPKPEKLASFVPFGNAIEQLCEHLEAAGLR FT QHMINPILIQDLVDKLPATDKREWVRFKRTKRQVTLRTFTDFVSRIVEEAC FT EANVCMDQKQEPKIMRGNRLGSGTTKGRNMEKGMLLNHDASNSTTGAPAER FT TTLKACKACQRNDHRLRFCEDFRRMTYADRMKIVTRWKLCNVCLNDHGNAT FT CKFKIRCDVSGCQQRHNSLLHPVGGVIGTSAHIRTTSSILFRILPVRLFCG FT EKTITVLAFLDEGASVTLIERQFVDQLGIVGVPERLTITWTADISREEKGA FT KRVCVWTSGIGNNEQLLLNNVYTVENLRLPMQSLDAAVLSEQYKHLKNLPI FT TSYSDARPGMMIGLNNLHSFAPIEVKHGAPGEPIAVRCKLGWTVYGPRKEG FT ASSQNAVLGFHVGVSNEDLHDLIKTHYALEESVMGVKMESPEDERALDILR FT RTTKRSGNRFETGLLWKSDQVSFPNSYPMAVKRLIQLERKLGQKPLLYENV FT RKQILEYQQKGYAHLASEEELNATDPNKVWYLPLNVVLNPRKPDKIRLVWD FT AAATVNGVSLNSQLLTGPDLLTPLISVVTRFREHRIAFGADIREMYHQLRI FT IEADKQAQRFVFRMSKEGPINIYVMDVATFGSTCSPCSAQYVKNQNALEYA FT EEYPAAAAAIIDGHYVDDYFDSVDTIDEAIERAKEVSFVHAQGGFELRNWV FT SNSPDVLRSLGEVTETKPIHLGRVKETGSERVLGIIWNPDHDTFSFSSEHR FT EHLQVYLSGIKMPTKRIVLSCVMGFFDPLGLLAVFTVHGKILVQDLWRTGC FT AWDDTVNEDCWKKWKRWIGLLPEVEAIRIQRCYFGEVPWSSVESVELHIFT FT DASELAYGCVGYLRTVVEGFVRCCLIMSRSKVAPLKRQSIPRLELMGAVMG FT ARMLHSIISTHSIKFQRYVLWSDSQTVLSWISSDQHKFKQFVAFRIGEVLE FT LTRTTDWRWIPSKLNAADALTKWGSGPPLESDGSWFTGPRFLLDPEEKWPS FT QAEPAEKTEEETKEETRAFILYHRASLVEEVIDSERFSQWTRLLRSTATVL FT RFIDNCKRKKTGQPLLTSKATVNQLSMIRAKRLTICQPLKQEELVQAENVL FT WRLVQSQSFADEVAILLKNNARSANQPPISLEKSSMLYKLTPVLDANGVIR FT MGGRMEASRVLPFDMKYPIILSRTHDVTRKLILHYHERFGHAYQETVVNEL FT RQRFHIPNLRATIGHVVRGCIKCKVMRSRPRVPMMGPLPVQRISPQLRPFC FT AVGVDYLGPIEVTVGRRSEKRWIALFTCLTVRAVHLEIAYSLSSQSCLMAI FT RRFICRRGRPEEFFSDNGTNFTGASKELLKQIDGDCAEAVTSSATKWNFNP FT PGAPHMGGIWERMVKSVKEAMTVLNDGRKLTDEILTTTMSEAEDMINTRPL FT TYRSLEPAETEAITPNHFLRGAVRGVDVNQDGPTELAEALRNMYKRSQYLA FT DQMWERWYKEYLPTINQRTKWFDDPKALQIGDLVFVMDGKHRKNWTRGVVE FT EVFAGKDGRIRQANVRTSKGVYRRATANLAVLEIQDGKFGSSVMPEPMFTG FT WGT" XX SQ Sequence 5946 BP; 1735 A; 1275 C; 1601 G; 1335 T; 0 other; aactcaaaaa agtatgagtc aatcggatct aacctcaaat ctctccgatc ctgaagattt 60 ggatctcacg acgactcctt gtgcggcctg ccaccagttg tcaacaacaa atgaacccat 120 ggtcggatgc gatgcgtgta gccggtggtt ccactatcgg tgtgttggag tgactgaggc 180 cgtaaaaaag gagaaacgat ggttctgttc ggaatctgcc tgccaagacg cagcaaagaa 240 atcaaaaaga tccaccggca aaaggaatac cgggcgttct tctacaaaat cagaccaggc 300 ccaagggatc accccggagc aaaagctgaa atccatggag gaggaattcg cgaagaaaat 360 ggaagagctg gagtacgaga gagtattccg tgagaaggaa atggagttcc agcgcgcgct 420 taaggagaaa aagatgcaga tggaaaatga actgcgcgaa aaggaattgg cccaggaaaa 480 ggtcttactc gaccgagctc tggaggataa aacggaacac ttcgagaaaa tgaaaacgat 540 gcggatgtcc taccagaagt gcatggatgg attggacgaa gaaatgttgc atatgcgaaa 600 gcccaagcca gaactgtttg gcaatgaagc tagaaagacg attcaacccg gaacttctaa 660 cgaaaagcaa cacccggtac gtaagctgaa tatgcaaggt ggtggacaga atgctgattc 720 gaagggcata caaccgtcga aaatgcagaa ccaggctaat ccgtcgaacg gcggagaaga 780 gtgttacgat gaggacgacg aagatgaaga aatcgaagaa gaagcggccg atggtgaagc 840 agtggaaagg gccgacgaag aagactacca aggtgaagaa gtagcagatg tgataggtgc 900 cgtaggaggc cataatgtag atccatacgg gctggggcag cagcgtattg gccctactaa 960 aagtcagctg gcagcacgaa gtgctgtttc gaagagactg ccggtgttca ccggaaaatc 1020 ggaagagtgg ccacttttct atggaacgta tgtagcatca agtcaagcat gtggattcag 1080 cgacgtggag aatcttgtga gacttcaaga cagtttgaag ggaccagctc tcgaaagcgt 1140 gcgcggtcaa ttgatacttc cgaaatctgt gccaaagatt atagaaaagc ttcgacaact 1200 ctatggacgt ccggaactga tcctccaaag tcatctcgag cggattcgaa agttagaacc 1260 tccgaagccg gagaagctgg catcgtttgt tcccttcgga aacgccatag aacaactatg 1320 cgagcaccta gaagctgcgg gtttgcggca gcatatgatc aatccgatcc tgatacaaga 1380 tttggttgat aagctcccag caaccgataa acgggagtgg gttcggttca aaagaacaaa 1440 gagacaggtg acgttgagaa cgtttaccga tttcgtttcg cgcatcgtag aagaagcatg 1500 tgaagcaaat gtttgcatgg atcagaagca agagccgaag ataatgcgag gaaaccgcct 1560 tggcagcgga actacaaaag gcagaaacat ggaaaagggc atgttgctga atcacgacgc 1620 ttcaaatagt accaccgggg ctccagccga gaggacgacg ttaaaagcat gtaaggcatg 1680 ccagcgcaac gatcaccgcc ttagattttg tgaggatttt cgcagaatga cctatgcaga 1740 tcgaatgaaa atcgtgacta ggtggaaatt gtgcaacgtc tgtttaaatg accatggaaa 1800 cgcaacctgc aaattcaaga tccgctgtga cgtcagcggc tgccaacaac gccacaactc 1860 gctgctccac cctgtcggcg gcgttatcgg gacgagtgcg cacattcgaa caacaagctc 1920 aattctcttc cgaatccttc cagtgcgact gttttgcgga gaaaaaacga tcactgtcct 1980 agcctttctg gacgagggtg cctcagttac tttaatcgag cggcagttcg tggatcaatt 2040 aggcattgtg ggagttcctg agcggttgac aattacttgg acggccgata tatctcgcga 2100 ggaaaaaggt gcaaaacgcg tttgtgtatg gacatcgggg attggtaaca acgagcaact 2160 tttgctgaac aatgtgtaca ccgtagaaaa cctacgtctc ccaatgcaat cgctggatgc 2220 agcggtatta tcggagcagt acaagcattt gaagaatctt cctatcactt cgtacagcga 2280 cgctagaccc ggaatgatga tcggtctcaa caacctccat tcattcgcgc cgattgaagt 2340 gaaacatggt gctcctggag aaccgattgc ggtacgttgc aaacttggat ggacggtcta 2400 tggaccaagg aaggaaggcg catccagtca gaacgcagtt ctaggtttcc atgtaggggt 2460 cagcaatgaa gacttgcatg atttaattaa gacacactac gcgttagaag agtctgtgat 2520 gggtgtgaaa atggagtcac cagaagacga aagagccctt gatattctgc gacgaaccac 2580 caaacgaagc gggaatcggt tcgaaactgg tctactttgg aaaagcgatc aagtgagttt 2640 tcctaacagc tatccaatgg cagtcaaaag gctgatacag ctggaacgga agttgggtca 2700 gaagccattg ctttatgaga acgtacgtaa gcaaatactg gaataccaac aaaaaggcta 2760 cgctcactta gcatcggagg aggagcttaa tgctactgat cccaacaaag tttggtatct 2820 accgctcaac gttgtcctaa atccacgtaa gccggacaaa attcgtttgg tctgggatgc 2880 tgccgcgact gtcaatggag tatccctaaa ctcgcagttg ttaacaggac cggatctgtt 2940 gactccgttg atttccgtgg ttactaggtt ccgagaacac cggatcgctt tcggcgcaga 3000 catccgcgaa atgtatcacc agttgcgaat cattgaagct gacaaacaag cacagcgttt 3060 tgtgttccgg atgagtaaag aaggccccat taacatctat gttatggacg tggccacttt 3120 tgggtcgaca tgctcgccat gttcggccca atacgtcaaa aaccagaatg cactggagta 3180 tgccgaagaa tatccagcag cagcagcggc aattattgat ggacattatg tggacgacta 3240 tttcgatagc gttgacacca tcgacgaggc gattgaacga gcgaaagaag tcagcttcgt 3300 ccatgcacaa ggtgggtttg aactcagaaa ctgggtttca aattcgccag atgtacttcg 3360 cagccttgga gaagtgacgg aaactaaacc aatacacctt ggtcgagtga aggaaaccgg 3420 tagcgaaaga gttcttggga tcatatggaa tccagaccac gatacgtttt cattttcgtc 3480 ggagcatcgt gaacatcttc aagtgtactt gagcggtata aaaatgccaa cgaaaagaat 3540 agtcctcagt tgcgtcatgg gattttttga tcctcttggt ctgctggcag ttttcacagt 3600 ccatggcaaa atcctcgtac aggacctatg gcgaacgggt tgtgcttggg acgatacggt 3660 gaatgaagac tgttggaaaa aatggaaacg ctggatcggg cttcttccgg aagtggaagc 3720 aatacgcatc caacgctgct acttcggaga ggtaccctgg tcgtctgtcg aatcagtaga 3780 gctgcatata ttcacggacg ccagtgaact tgcgtatggc tgtgtgggat acctgcgaac 3840 ggtcgtcgaa ggatttgttc gatgttgctt gataatgtct cgctctaaag tagcgccgct 3900 taaacgacaa tcaattcccc ggctggagct aatgggtgcc gtgatgggcg cgagaatgct 3960 gcactcgatt attagcactc actccataaa atttcaacgg tacgtcctat ggtctgactc 4020 gcaaaccgtg ctaagttgga tatcctctga tcaacacaaa tttaagcaat tcgttgcttt 4080 tcgtatcggg gaagttttag aattgacaag aactactgac tggcgttgga tcccgtcgaa 4140 attgaacgca gcggatgcct tgacgaagtg gggaagtggt ccgcctttgg aaagcgacgg 4200 ttcctggttc accggtccta gatttctgct cgatcctgaa gaaaagtggc cctctcaagc 4260 agaaccagcc gagaagactg aggaagagac aaaagaagaa acgcgcgctt tcatactata 4320 ccatcgagcg tcgttggtgg aggaagttat cgacagtgaa agattttctc aatggacacg 4380 acttctgcgg agtacggcaa ctgtgctgcg gttcatagac aactgcaaac ggaagaagac 4440 cggccaaccg ttgttgacgt cgaaggctac ggtgaatcaa ctgtccatga ttagagcgaa 4500 gagattgacg atttgccaac cactgaaaca agaagaattg gtgcaggctg aaaatgttct 4560 gtggaggctg gtacagagcc agagctttgc agacgaggtg gcgatactac tgaaaaataa 4620 tgcaaggagc gccaaccagc caccgatttc actcgaaaaa tcaagtatgt tgtacaaact 4680 gacacctgta ttagatgcaa atggagtcat taggatgggt ggtaggatgg aagcttcaag 4740 ggtcctgcca tttgacatga agtacccaat cattctctct agaacccacg acgtcacacg 4800 taagttgata ctacattatc atgaaaggtt tgggcacgct tatcaggaaa ctgtcgtaaa 4860 tgaattgcgg caaaggtttc acatccctaa tttacgcgct acgatcggtc atgtcgtacg 4920 agggtgcatc aaatgtaaag ttatgcgcag tcgtcctcgt gttcctatga tgggccctct 4980 gcctgttcaa cgaatttcac cgcaattacg tccattctgc gccgtaggag tggattatct 5040 aggccctatt gaggtaactg ttggccgaag atctgaaaag aggtggatag cactttttac 5100 gtgtttgact gtcagagcag ttcatttaga aattgcttat tctctttcgt cgcaatcctg 5160 tctgatggca attagaagat tcatctgtag acgaggacgg cccgaggaat ttttttctga 5220 caacgggacg aattttacgg gagcgagcaa ggagttgctg aagcaaatcg atggtgattg 5280 cgctgaagca gtgaccagtt ccgcaacgaa gtggaatttc aatccgcccg gtgcgccaca 5340 tatgggtgga atctgggaga ggatggtaaa gtcggtcaag gaagcgatga ctgtgctgaa 5400 tgacggaaga aagcttaccg acgaaatcct aacgacaacg atgagtgaag ctgaagacat 5460 gataaacacg cgcccattaa cctacagatc actagaacct gcggagacgg aagccataac 5520 tcctaatcat tttctacggg gggctgtacg tggcgtagac gttaatcagg atggtccgac 5580 ggagctagct gaagcgttga ggaacatgta taagcggtca caatatttag cagatcaaat 5640 gtgggagcga tggtataaag agtacctgcc tactataaat caacgcacca agtggttcga 5700 cgatccgaag gctcttcaaa ttggtgacct cgtattcgta atggacggga aacaccggaa 5760 gaattggact agaggagttg ttgaagaagt gtttgctggg aaagacggac gaatcaggca 5820 ggcgaatgta agaacgtcca aaggagtgta tcgtcgagcg acagcgaact tggcggtgct 5880 ggaaatacag gacggtaaat tcgggtcgtc agttatgccg gaaccgatgt ttacgggctg 5940 gggaac 5946 // ID MuDr-1x_AP repbase; DNA; INV; 3643 BP. XX AC Contig21440; XX DT 10-MAR-2008 (Rel. 13.03, Created) DT 10-MAR-2008 (Rel. 15.12, Last updated, Version 2) XX DE A distinct, diverged MuDr-type family. XX KW MuDR; DNA transposon; Transposable Element; MuDr-1x_AP. XX NM MuDr-1x_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-3643 RA Jurka J. and Bao W.; RT "Highly diverged MuDR-type families."; RL Repbase Reports 8(3), 237-237 (2008). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC Related families are present in other invertebrates. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX FH Key Location/Qualifiers FT CDS 628..2874 FT /product="MuDr-1x_AP_1p" FT /translation="MSENQEFICTECGKKFKFSKTLRTHLKKAHPLIDVYQ FT IAPTKKLKNNELYIYTCDLCDKSYCHKRNLDEHKKVAHCEKKETVVALVNC FT SMCSFTTLCTSLMADHYSSVHNIVMQSNKFNFNSLDDFKTWKHDIEQKTNA FT YYVKNCGLAKNFNENYIYIYYKCHSNGYFSSKSTGIRHIKSQGSNKINGYC FT PASMNVILSECTKQCTVTFIDTHVGHLNDLGNLPLDKVTKDIIASKISEHI FT PFEHILDEVRDNISNNVLERTHLLTKKDLYNIEASYNLNNESVLHKNDALS FT VESWVQTVRSDDRFSLVYYKPQDNIDPLFPNLKKEDFVLIIMNNYQKSMLE FT KFGNDVICIDGMNSYNFNLTTVLILDDMREGFPCAFMISNRVDEGVLKILF FT SQIRALTGLIKPKVFMSDMAECFFNAWLVEMKQPKFRIYCTWHVDRAWSKN FT LTKVKLKEKQAEVYKIIRTLLHEQDAKAFENIFESAINQMSADEQTCEFAN FT YFVSQYGNYVQSWAYCHRIHAGVNTNMHIERMHCTLKHIYLQGKKVKRLDK FT SLYALMKFIRDRSIDRLIVIHKGKITSKIKELRKRHKHSLEMSHEMVMKAT FT EDSWDIISKKYSEMYTVNRLKISCDCQIQCQDCLFCIHCYTSSCIDSAIKW FT NMCKHIHLVCQFQYSISNQKNNGMLTNCLVVNNNLNNMPNTDGEHNNETSN FT ILSQLNNSAITSQLSLELEKRKLQESFNKVLNEISTIEELNVLKKLFYL" XX SQ Sequence 3643 BP; 1392 A; 479 C; 516 G; 1256 T; 0 other; acaagcgtct aggtacagag aaccgaatgg aatcaaaatg gcgggttttt atcattaatt 60 tacttttaga taagaaaaat acacaatctg ataacagacc ttatcataat aattattaca 120 ggctacataa ttaatatagg tacctaatta aataattaat attataactt attttttaat 180 ttacattatt tggattgtta aggtaattta agatattgct ataccatata attgttacct 240 gttggtgatt tatgtatttt ttaattaaat gtttatacat taatatttat aaactaataa 300 atttgttgtt attgttttgt tttatacttg tatgttaaat aatttaatat aatttatcca 360 aaattatttt cctttgaata tgattcttct actatgtctg aaaatgttga gtaaatgctg 420 caatgttgtg attcagttgt tgttttgttc ggtttcatgt taaataaatt aacataaatt 480 attcaaaatt atttttctac catgtctgaa aatgttgagt aactgcaatg ttgtgattca 540 gttgttgttg ttttgttctg tttcatgtta aataaattaa cataatttat ccaaaatact 600 tttttttttt aataggatct aggtactatg tctgaaaatc aagaatttat atgtactgaa 660 tgcggaaaaa agtttaaatt ttcaaaaacc cttagaaccc atttaaaaaa agcgcatcct 720 ctaatagatg tgtatcaaat cgccccaaca aaaaagttga aaaataatga attatatata 780 tatacatgtg acttgtgtga taaatcatat tgccataaga gaaatctcga tgagcataaa 840 aaagtggcac attgtgaaaa aaaagaaacg gttgtagcct tggtgaattg ttcaatgtgt 900 tcttttacta cattatgcac atcattaatg gctgatcact attcatcagt acacaatata 960 gtaatgcaga gtaataaatt taattttaac agtttggatg attttaaaac atggaaacat 1020 gatattgagc agaagacaaa tgcatattat gtaaaaaatt gtggtttagc taaaaatttt 1080 aatgaaaatt acatttatat ttactataag tgccatagca acggttattt tagttcaaaa 1140 agtactggta tacgacacat aaaatcccaa ggctcgaata agataaatgg atactgccca 1200 gctagtatga atgttatttt gtcagaatgc acaaaacaat gtacagttac atttatagat 1260 acacatgttg gtcatctaaa tgatttagga aacttaccac tagataaagt aactaaggat 1320 attatagcta gtaaaatttc tgaacacatt ccgtttgaac acatacttga tgaagttcgt 1380 gacaatattt ccaacaacgt gttagaaaga acacatctat taacaaaaaa agatctttat 1440 aacattgaag catcatacaa tttgaataat gaatcagtct tacacaaaaa cgatgcattg 1500 agtgttgaat cttgggttca aactgtgaga agtgatgata gattttcctt agtatactat 1560 aagccacaag acaatataga tccactattt ccaaacctaa aaaaagaaga ttttgtctta 1620 ataataatga ataattatca aaaatcaatg ttggaaaaat ttggtaatga tgtaatttgc 1680 attgatggca tgaattcata taatttcaat cttaccacag ttttgattct tgatgacatg 1740 agagaaggtt ttccttgtgc tttcatgata agtaataggg ttgatgaagg tgtattaaaa 1800 attttatttt ctcagattag agctcttaca ggactaatta aaccaaaagt atttatgtct 1860 gatatggcgg aatgtttttt taatgcatgg ttggttgaaa tgaaacaacc aaaatttaga 1920 atatattgta catggcacgt agatagagca tggagtaaaa atttgactaa agttaaattg 1980 aaggaaaaac aagccgaagt ttacaaaatt ataagaacat tattgcatga acaagacgct 2040 aaagcatttg aaaacatttt tgaaagtgct attaatcaaa tgtcagctga tgaacaaaca 2100 tgtgaatttg ctaactattt cgtaagccaa tatggaaatt acgttcaatc atgggcatac 2160 tgccatcgaa ttcatgctgg agttaatacg aatatgcata tagaacgaat gcattgtaca 2220 ctaaagcata tttacttaca gggaaaaaag gtaaagcgcc ttgacaaatc tttatatgct 2280 ttgatgaagt ttattcgaga tcgttctata gatagattaa ttgtaatcca taaaggaaaa 2340 atcacatcga aaataaaaga actgcgtaaa aggcacaaac acagtctaga aatgtcacat 2400 gaaatggtta tgaaagcaac agaagatagt tgggacatca tatctaaaaa atatagtgaa 2460 atgtatacag tgaacagact taaaatttcc tgtgattgtc aaatacaatg ccaagactgt 2520 cttttttgca ttcattgtta tacaagctct tgtatagata gtgcaattaa gtggaatatg 2580 tgcaaacata ttcatctagt ttgccaattt caatatagta taagcaatca aaaaaataat 2640 ggaatgttaa caaactgttt agtcgtgaac aacaacttaa acaacatgcc taacacagac 2700 ggtgagcata acaacgaaac ttccaatatt ttaagtcaat taaataactc agccattaca 2760 agccagctat cattagaatt ggaaaaaagg aaactgcaag agtcattcaa taaagtgttg 2820 aatgaaatat ctaccattga agaactaaat gtcttgaaaa aactttttta cctataatac 2880 caacattaac agcaataaga aataattcaa atacaccaac cttaaaaaga aaaacttctc 2940 agacatcacc aacaaataaa aaaatcatac cacaacgccg tttatactcg accaagaaaa 3000 caaggaaagt attagaaaag gtattagaaa caccaagcag aacagaacaa aatcatatca 3060 gtgcatcatt aatattcaaa aataatactg cttaatagtt aatgattatt acctatttag 3120 tatttatata ttactttata cacagcgtca aactgacaaa atcacaaatt aattaattta 3180 tacatgacta tattattatg atctgacgaa taatcgattt ttatataata tttttgttta 3240 ttactttaat tgatttttat ataataattt ttgtttatta ttttaattga gttttacata 3300 atattatttg tttattttat acactgagtc aaactgacaa aatcacatat taattaattt 3360 atacatgact atatcattat gatctgacga ataatcgatt tttatataat atttttgttt 3420 attactttaa ttgattttta tataatatta tttgtttata ctttatacac cgagtcaaac 3480 tattattata ttaattatta tgtatttttc ataattttgt ataatgatat tctatcagct 3540 gtagcttgta ataaatattg tatatatttc ttatctaaat gtaaattaat gataaaaacc 3600 cgccattttg attccattcg gttctctgtc cctagacgct tgt 3643 // ID Kolobok-13_HM repbase; DNA; INV; 2751 BP. XX AC . XX DT 08-JAN-2009 (Rel. 14.02, Created) DT 08-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE Kolobok-type family - consensus. XX KW Kolobok; DNA transposon; Transposable Element; Kolobok-13_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2751 RA Bao W. and Jurka J.; RT "Kolobok-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 9(2), 422-422 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 394..2193 FT /product="Kolobok-13_HM_1p" FT /translation="MSTSRNNIKVSNKQNFGRKKKRFFHGNRYTIAAERQQ FT KSNLNIVPASSCASARKLLPLINRNQVVNDKVNKDYFILINFSILSDIVED FT VRCPDCYSLVTIVDNLSKRMGFAHHLEIKCSLCEWKKVLYSSKQISKFNGN FT TAQGRKTFEANTRMIIGFREIGKGYQSMKNFSRCMNLHCLTATPFRKLNIE FT ISKAYDKTAKESMKRATTEMKDNYTDDNKPIMKHVKIDGAWQRRGHTSLNG FT FVSAIVGDKCVDYEAMSKFCMGCRIWKNKKASPRYNAWKAQHACHINHKQS FT SGSMEAAGAVKIFARSIKNNNIIYTHYLGDGDSSSFKEVIKSDPYKDYKII FT PEKLECVAHVQKRMGTRLRNLIKSYKGKPTPLGGKNKLTENVINSMQNFYG FT LAIRSNTNNIYAMKKAIWAILFHCTGYSNQEFRHCFCPRTETSWCQYQYDK FT LHNTSNYKEHINLPLWISNLLKPVFKDLSNEELLLKCVHGQTQNANESLNA FT LXWSRCPKNIFVCKQTFEMSINSAILHYNDGTEGVKNVLLXFGLSGVVTNS FT KSYEQNVSRVKSMEIKSSEKVKNRRKSLRAIKKRYSDKQSQNKTSFYVSGG FT F*" XX SQ Sequence 2751 BP; 1027 A; 347 C; 428 G; 947 T; 2 other; ggtggtagag ctagaaaaaa tatgtaaaaa tttttaaaaa atcgtttttt tttttatttt 60 ttatttttga aggaaaatta cttagctttc atgtactgaa aaaatttttt taaaaaatta 120 tttcttattt agtttttaaa gcttttatgt aactacatac aacgactagt ccttagttac 180 gcccttagca acggtaaaca gtttttttac cttgatacct ttccttaaag aacaacaaga 240 tttctatcaa acactgataa acgtgcaatt gttaaaggtc attaacttaa ggaacattgt 300 aaattttggc agttttttga agcaacttta gtgcatgctg tttaattttg atgttataaa 360 ccttttcaga tttcacctca agtattataa ataatgagta cttcaagaaa taacattaaa 420 gtatctaata agcaaaattt tggacgaaaa aagaaaagat tttttcatgg aaatcgttat 480 acaattgcag cagaaagaca acagaaaagc aatctgaata ttgttccagc ttcaagttgt 540 gcaagtgcaa gaaaattact gccactgata aatagaaatc aagttgtaaa tgacaaagtg 600 aataaagatt atttcatact tatcaatttt tcaattttaa gtgatattgt ggaagatgtt 660 agatgtccag attgttacag ccttgttacc attgttgata atctttctaa gcgcatgggt 720 tttgcccatc accttgaaat caagtgtagt ttatgtgaat ggaaaaaggt tttgtattct 780 tctaagcaga ttagtaaatt taatggaaat acagcccagg gtaggaaaac atttgaagca 840 aacacaagaa tgataattgg atttagagaa ataggtaaag ggtatcaatc tatgaaaaac 900 tttagtcgct gtatgaattt acactgtttg acagctactc catttagaaa actaaatata 960 gaaatatcaa aagcttatga taaaactgct aaagaaagta tgaaacgtgc cacaactgaa 1020 atgaaagata actatactga tgataataaa ccaataatga agcatgttaa gattgatggt 1080 gcatggcaac gacgcggcca tacctccctt aatggatttg taagtgcaat tgtgggtgat 1140 aagtgcgttg attatgaagc aatgtcaaaa ttttgcatgg gctgtagaat atggaaaaat 1200 aaaaaagctt caccacgata caatgcttgg aaagctcaac atgcctgtca tataaatcat 1260 aaacaatctt ctgggtcaat ggaagcagct ggtgctgtaa aaatatttgc acgttcaatt 1320 aaaaataaca atattattta tactcattat cttggagatg gagatagctc atcttttaaa 1380 gaagtaataa agtctgatcc ctataaagac tataaaataa ttcctgaaaa gttagagtgt 1440 gttgcacatg tacaaaaaag aatgggtaca cggttacgaa atttaatcaa aagttacaaa 1500 ggaaagccaa ctccactagg tggaaaaaat aaactgacag aaaatgtgat taatagtatg 1560 caaaattttt atggtttagc tataagaagt aatacaaata acatttatgc catgaaaaag 1620 gcaatatggg ctattttgtt ccactgtact ggatattcaa atcaagagtt tcgccactgc 1680 ttttgcccaa gaactgaaac tagttggtgt caatatcaat atgacaaatt acataatact 1740 tcaaattata aagaacatat taatttacca ttatggatta gtaacctttt gaaaccagtg 1800 tttaaagatc tatcaaatga agaattattg ttaaaatgtg tacatgggca aacccaaaat 1860 gcaaatgaaa gtcttaatgc attgrtttgg tctcgttgcc ctaaaaacat atttgtttgt 1920 aaacaaactt ttgaaatgag tattaattca gcaattttac actacaatga tggtactgaa 1980 ggtgttaaaa atgttttact ttmatttgga ctatcaggcg ttgttacaaa ttctaaatct 2040 tatgaacaaa atgtttcacg tgtaaagagc atggagatca agtcaagtga aaaagtaaaa 2100 aatagaagaa aaagtctcag agcaataaaa aaaagatact ctgataaaca aagtcaaaat 2160 aaaacatcat tttatgtatc aggagggttt taaacatatt gtatatatat ttttaaacat 2220 attacaactt tattttgcgt tttattgaac tttgattttt tgcaagattt tgtaacttca 2280 aaaccaaata tcttaagtta gaaaaattca atttacttga aattttcaca gtatcttcag 2340 tatacatata aagttcattt gaactaaaat cttataataa ttaaatgtat ttattgctta 2400 ttatttgttt tttattctca atttggtatt ttttttttat gcatacataa taaattatgt 2460 atatttttta ttctattaag attcaatgtc attttagttc aaatgaatta tctattgttg 2520 aagaacaaac ctacaaaaat tcattgtgat ttgttgatta tattttgtga tattgtgttt 2580 taaaatttcc ttattttttg gtttattttt ttatataagt atgggtaggc gaggcaaatt 2640 aataaaaaaa aattgtcaaa tgatttttaa cataaattca aactttttta gaagttcttg 2700 ggaaataata atagttttta aaagaattag gttttttcta gtcctaccac c 2751 // ID Mariner-N1_AAe repbase; DNA; INV; 1158 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A non-autonomous Mariner/Tc1 DNA transposon family from Aedes DE aegypti. XX KW Mariner/Tc1; DNA transposon; Transposable Element; nonautonomous; KW Mariner-N1_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-1158 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the yellow fever mosquito."; RL Repbase Reports 11(4), 1287-1287 (2011). XX DR [2] (Consensus) XX CC ~95% identical to consensus. TA TSDs. 405-bp TIRs. XX SQ Sequence 1158 BP; 352 A; 225 C; 208 G; 373 T; 0 other; tacactgata ggcaaaataa agtgcccacc ttaccagttt tcgaatttct ctcattgatt 60 tggttcaaat taaagttaac acacttaaat cttttttgat attttatttt tgatattctt 120 ttaaagttgc acttacgaaa atttgataaa aaaaagaatt ttactcaaag aaaaagaaaa 180 tcaatttgta ttgaaaaaaa aatagtgaca aaataaagtg cccacttcct tcttggcccc 240 agaaaagatg atttaagaaa aataaaaagc aatttaatag ttaatgtgtc ctcctttggc 300 cttaaggact tgctggaggc gcttcggcat gcttttcacc aggttttgta ggtgttgtgg 360 ttctagttct tcccaggcgc gctccaaggc ttcaaaataa ttatttttgt tggtaacacc 420 agttttgtca accctggcat cgagaatcgc ccacaaattc tcgatggggt tgaggtctgg 480 gctttgtgga ggccattcca gcggtttaat ccgacaagac cggaagaaag acttggtctt 540 cttggcagta tgcttcgggt cgttgttcta gagaaatatg aatttctctt caaggcccgt 600 ctggatcagc gaaacctcca gattttcccg caagatgttg atgtaggaat ctgccgtcat 660 tattccgtcg attttcacga ggcttcctac tccactccat gaaaaacacc cccagaccat 720 cacatttcct cctccatgct tcaccgttcc ttgaataatt attttgaagc cttggagcgc 780 gcctgggaag aactagaacc acaacaccta caaaacctgg tgaaaagcat gccgaagcgc 840 ctccagcaag tccttaaggc caaaggagga cacattaact attaaattgc tttttatttt 900 tcttaaatca tcttttctgg ggccaagaag gaagtgggca ctttattttg tcactatttt 960 tttttcaata caaattgatt ttctttttct ttgagtaaaa ttcttttttt tatcaaaatt 1020 tcgtaagtgc aactttaaaa gaatatcaaa aataaaatat caaaaaagat ttaagtgtgt 1080 taactttaat ttgaaccaaa tcaatgagag aaattcgaaa actggtaagg tgggcacttt 1140 attttgccta tcagtgta 1158 // ID Gypsy-173_AA-LTR repbase; DNA; INV; 212 BP. XX AC supercont1.174; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-173_AA_; KW Gypsy-173_AA-I; Gypsy-173_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-212 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.174; Positions 873246 873035. XX SQ Sequence 212 BP; 87 A; 45 C; 27 G; 53 T; 0 other; tgtagtgtac acccatttct caacattata aattagagtg taccatcaaa tgacaacttg 60 tcagtcgaat tatacattgt aatttatata cctaccaata catacaaagc tataaaagcc 120 tcgaaaatac aaatgaagag tcacttcgaa tcagactagc agcaagtaaa caagtgtctt 180 aaatagttaa acccgaaacc ccaaacacag ca 212 // ID DSAT repbase; DNA; INV; 182 BP. XX AC . XX DT 09-AUG-1999 (Rel. 4.07, Created) DT 01-JUL-2005 (Rel. 10.08, Last updated, Version 2) XX DE Drosophila tandemly repeated satellite DNA (a consensus). XX KW SAT; Satellite; Simple Repeat; DSAT; tandem repeat. XX NM DSAT. XX OS Drosophila OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae. XX RN [1] RA Bachmann L.; RT "DSAT."; RL Direct Submission to Repbase Update (05-NOV-1991)L. Bachmann, RL Institut fuer Biologie, Lehrstuhl Fuer Populationsgenetik, RL Universitaet Tuebingen, Auf der Morgenstelle 28, D-7400 RL Tuebingen, FRG. XX RN [2] RA Bachmann L. and Sperlich D.; RT "Gradual evolution of a specific satellite DNA family in RT Drosophila ambigua, D. tristis, and D. obscura."; RL Mol. Biol. Evol 10(3), 647-659 (1993). XX RN [3] RP 1-182 RA Jurka J.; RT "DSAT."; RL Direct Submission to Repbase Update (AUG-1999). XX DR [3] (Consensus) XX SQ Sequence 182 BP; 54 A; 35 C; 44 G; 49 T; 0 other; ctgcagagtt agagtgaaac agggatattt actggcagat tacgttatag tggctatatc 60 ctttatgtcc tgcaatccaa attgattgca aaagatcctg cacattctcc agaaggcatt 120 gacacattca ggacgttgta cagatggcta ggaggcgtgg catgtcctga tccgataaga 180 aa 182 // ID Harbinger2-1_HM repbase; DNA; INV; 2559 BP. XX AC . XX DT 13-AUG-2010 (Rel. 15.09, Created) DT 09-MAY-2011 (Rel. 16.05, Last updated, Version 2) XX DE A family of autonomous Harbinger transposons - a consensus. XX KW Harbinger; DNA transposon; Transposable Element; KW Interspersed repeat; Harbinger2; Harbinger2-1_NV; KW Harbinger2-1_HM. XX NM Harbinger2-1_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2559 RA Kapitonov V.V. and Jurka J.; RT "Harbinger2, a novel clade of Harbinger transposons in protozoan, RT fungi, choanoflagellate, and metazoans."; RL Repbase Reports 10(9), 1215-1215 (2010). XX DR [1] (Consensus) XX CC Harbinger2-1_HM belongs to a novel clade, Harbinger2, of CC Harbinger DNA transposons. This clade includes transposons CC present in protozoan (brown alga), fungi, choanoflagellate, and CC metazoans. CC Harbinger2-1_HM is a consensus sequence of a family of autonomous CC Harbinger transposons that were active in the hydra genome in a CC last few million years.The consensus sequences was derived from CC several copies of that are ~98% identical to it. The CC Harbinger2-1_HM transposon is characterized by 34-bp imperfect (2 CC mismatches) terminal inverted repeats, 3-bp target site CC duplications (TNA), and it encodes two proteins: (i) the 358-aa CC Harbinger2-1_HM1p transposase and (ii) the 230-aa CC Harbinger2-1_HM2p Myb-like DNA-binding protein. XX FH Key Location/Qualifiers FT CDS 270..1343 FT /product="Harbinger2-1_HM_1p" FT /note="Harbinger TPase." FT /translation="MIKLSQVRNQLLIFNSDNTINDEELLLLYDVNKSNNL FT NLSYQSYPRFNLDNMSVDEAKSEFRFLSKDIYEMIDLLNIPEKITCYNGIT FT ISADEAFCIFLKRFAYPCRYQDMIPRFSRPVPQLCMISNHIMNLLFAQWGH FT LLTNLNQGWLDTQHLEMFAAAIHAKGAPLTNCWGFIDGTLRPISRPREHQR FT ILYNGHKRCHGIKFQSLVAPNGLIANLYGPVEGKRHDSGMLADSNLLNKLA FT VCSFNSNREPLCVYGDPAYPHRVNLQCGFKGANISPEQLIWNKNMSKVRVA FT VEWVFGDIVNYFKFLDFKKNLKVGLSPVSKMYLVCALMHNARVCLYGSTTT FT TYFDCQPPSLSDYFK" FT CDS 2077..1388 FT /product="Harbinger2-1_HM_2p" FT /note="Myb-like DNA binding protein." FT /translation="MRWTQLHDYTLLMEVLTFEPWQFKRGSKERGDSWEKI FT SNSLNGLPEPYFKVTGRSTRDHINLLIEKFKKKDTKEKNASGITCDLTDYD FT VAIADVYERFQQSESIFKEQLEHNNGKIDVDNVQAIEMRKRSLETHNETEK FT RNKGQELLKKSRNNSNETIAYLKEKNEIEVNIRKEELEIKKREVEAQANTM FT QHILQQQNTLMQQQNNMMQAMMQQQASFLEILSKFLPEKK" XX SQ Sequence 2559 BP; 817 A; 435 C; 388 G; 919 T; 0 other; accgacttta cgaaatcaaa acgcctactg caacgagaac gtagaaacta tgattttaca 60 agtcgtcgtt gtcatagccg ttttaattaa aatatcgtta cgaaaattac tatgaaaacg 120 tcactttatg taattttgta tagttctgta tatttttctg tatgattttt ttttgtacaa 180 aaatccttaa tgttttgaag taaaaatctc attcggtgca acgatacaac aaagaaatga 240 agttgaaaac ctttattttt taaataaata tgataaaatt atcccaggtt cgtaaccagt 300 tacttatttt taattctgat aacacaataa acgatgaaga gctgttatta ctttacgacg 360 ttaataaatc taataactta aatttgtctt accaaagtta tccaagattt aatttagaca 420 acatgtcagt ggatgaagca aaaagcgaat ttcgtttttt atcaaaagat atttacgaaa 480 tgattgatct tcttaacatt cccgagaaga taacgtgtta taatggcatc accatttcag 540 ctgacgaagc cttttgtatt tttttaaaac gttttgctta cccctgcagg tatcaagata 600 tgattccacg attttcaaga ccagtaccac aattatgtat gatttcaaac catattatga 660 atttattgtt tgcacaatgg ggccatttac ttaccaactt aaatcaggga tggttagata 720 ctcagcatct agaaatgttt gccgctgcta ttcatgctaa aggagctcct cttacaaact 780 gctgggggtt tattgatgga actttaagac caataagtcg tcctagagag caccaaagaa 840 tactttacaa tggacataaa aggtgtcacg gaataaagtt ccaatcacta gttgcaccca 900 atggcttaat tgccaatcta tatggtcctg tggaagggaa aagacatgat agcggtatgc 960 ttgccgattc aaatttacta aataaattag cagtttgttc tttcaatagc aatagagaac 1020 ctttgtgtgt ttatggagac cctgcatacc cccatagagt taatttacaa tgtgggttta 1080 aaggagctaa catatcccct gagcagctta tttggaacaa aaatatgagt aaagttcgtg 1140 ttgctgttga gtgggtattt ggagatattg ttaactattt taaattttta gactttaaaa 1200 agaatcttaa agtgggactt agtccagttt ctaaaatgta tttggtttgt gcattaatgc 1260 acaatgcacg tgtttgccta tatggttcaa caacaacaac atattttgat tgccaaccac 1320 catcgctttc agactacttt aaataatttt ttcatgaaat ataaaaaagt gtacacaaat 1380 aactttattt tttttctggt aaaaatttgc ttaaaatttc taaaaaagat gcttgctgtt 1440 gcatcatagc ctgcatcatg ttattttgtt gctgcatcaa ggtattttgc tgctgcagaa 1500 tgtgttgcat tgtgtttgct tgtgcttcaa cttctctttt tttaatttcc agttcttctt 1560 ttctaatatt tacttctatt tcattcttct cttttaaata agcaatagtt tcattgctat 1620 tatttcttga cttttttaag agttcttgtc ctttgtttcg cttttcagtt tcattatgtg 1680 tctctaatga ccttttacgc atttctattg cttgaacatt atccacatca atctttccat 1740 tattatgctc taattgctct ttaaaaatac tttcagattg ctgaaatctc tcatagacat 1800 cagcaatagc cacatcataa tctgtcaagt cacatgtaat tccacttgca tttttttctt 1860 ttgtatcttt ttttttaaac ttttcaatca aaaggttaat gtgatcacga gttgatctac 1920 cagtaacttt aaaataaggt tctggtaatc cattaagaga gttggatatt ttttcccatg 1980 aatctcctct ttcttttgaa ccacgtttaa actgccatgg ctcgaaggtc aaaacctcca 2040 ttaataatgt gtaatcatgc agctgagtcc atcgcattga actaaaaaat aaaacatttt 2100 atagttaaat tgttatttta ataaacatat caattaaatt attaaaaatc attacttata 2160 caatataaat cacagatttg tatgacagtc agattttaca tatagttata ttaccttgct 2220 gaagatgttg taattgatcc acatgaagat tccataacaa agataaactg cgctataaga 2280 gcattagagc acacacacat atgtaaagtt agtaattcag atttaaaata aaaataatta 2340 gccgacttaa cgatataaat ttaaaaacca caaaaacttt gtgacgttat cataactatt 2400 tcacgttttc actgcaacgc ttaaacttga atttttacgt ccggcctaga actttaaaac 2460 gcgactaggg tattaccctg agctcacgca tgcgcattat caacattttg tcctcagaag 2520 ttgcggttgc agttggcgtt tagatttcgt aaagtcggt 2559 // ID Gypsy-22_DPu-LTR repbase; DNA; INV; 1081 BP. XX AC scaffold_601; XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from Daphnia: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-22_DP_; KW Gypsy-22_DPu-I; Gypsy-22_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-1081 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Direct Submission to RU (16-DEC-2010). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR Genome; scaffold_601; Positions 3642 4722. XX SQ Sequence 1081 BP; 365 A; 125 C; 189 G; 402 T; 0 other; tggtgagtgt aaacttttat tatttcagac ttcgatcaca tttacatttt tctttttatg 60 taagaaaaat tggtatgaaa atgaattatt attatttttt taatcgtggt taattttata 120 agataaatct agaagaaact agaaggaagt aaaaaatttt gtagttattt tatcatttgt 180 tttgtctgat tagaattgct tttaagcagg cagaaattat ggatactaaa ttgttatgga 240 gaaatatcga tattttgttg atttaatgtg aaacaaaacc tgaataaaaa cgaataatag 300 catactcctt gggtgtaatg gttagcgctt cgggtgaaaa aagctcagta tgcaagccct 360 cgacctagca taaggtggac atgccagagt gtaccggaat gtactttcac agccgaggac 420 attttttggt tttgaatgat tgatatttta aatagagtac tttcaagaaa tattgactca 480 aataaataat atttaatttt ttgtacccat accaattgaa tttctgctag tggttcaaat 540 ctttggtgag tgtaaacttt tattatttca gacttcgatc acatttacat ttttcttttt 600 atgtaagaaa aattggtatg aaaatgaatt attattattt ttttaatcgt ggttaatatt 660 ataagataaa tctagatgaa actagaagga agtaaaaaat tttgtagtta tttaatcatt 720 tgttttgtct gtttagaatt gcttttaagc aggcagaaat tatggattct aaattgttat 780 tgagaaatat cgatattttg ttgatttaat gtgaaacaaa acctgaataa aaacaaataa 840 tagcatactc cttgggtgta atggttagcg cttcgggtga aaaaaagctc agtatgcatg 900 ccctcgacct agcataaggt ggacatgcca gagtgttccg gaatttactt tcacagccga 960 ggacattttt tggttttgaa tgattgataa tttaaataga gtactttcaa gaaatattga 1020 ctcaaataaa taatatttaa tttattgtac ccataccaat tgaatttctg ctagtggttc 1080 a 1081 // ID BEL-228_AA-I repbase; DNA; INV; 6913 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-228_AA_; KW BEL-228_AA-LTR; BEL-228_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-6913 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 911-911 (2011). XX DR [1] (Consensus) XX CC Positions [5945-6505] - Integrase core CC LTRs are 93% similar to each other. XX FH Key Location/Qualifiers FT CDS 2228..3712 FT /product="BEL-228_AA-I_2p" FT /translation="MYNVSLLHQLVSRLPPSIKLDWAKYRYTLPRVNLATF FT GNWIYSLAEAASTVTIPAIQEFKPSRNDSRPTKKSAGFLNAHLESNETELK FT PDIDNVRNDTNGCLVCKSSCKELWKCKQFLELSRDSRWAVVRDFDLCRRCL FT QKHNGVCKARACGKNGCTRKHHELLHNENQKENESNPKSDGCERHQPSTSS FT ARHECNSHRSSTNNSLFRYLPVVLYGKSGCIQTFAFLDEGSKLTLMDEELA FT NELELDGVESPLYLRWTGGTERCEKDSRAISVSIAGVYTGAKRFKLDDVRT FT VQGLQLPRQSLDAVKMMERYPYLRGLPIEPYEAARPRILIGLKHAHVSLVL FT QCREGKVEQPIAIKTRLGWTVCGGSDNENVPNMVHYSFHVGSREDHLDEDL FT HQAMKEYFALDSMGVMKPSHALLSLEDQRSNKMLESLTNLNGNRYETGLLW FT RYDDFRLPDSRPMALRRYHLLEKRMAKNPELGKALDQKIEDTLLKDTFGS" FT CDS 5897..6856 FT /product="BEL-228_AA-I_3p" FT /translation="MPSAPIMADLPDARLAAYSRPFTHVGIDYFGPMEVAV FT GRRVEKRWGVLATCMTTRAVHIEIAHSLSTDSCVMAIRNMIARRGIPRYIY FT CDRGTNFVGTSRELSRVEGELDLETMMKEFTTSETAWCFNPPMSPHMGGCW FT ERLIRSVKNSLKSLNLPRRPSDEVLRNALTEIENAINSRPLTHVPIEDNAA FT PALTPNHVLLGTSNGSKPLTMLSDSGTIVRQCWRTSQIIANQFWRRWIAEY FT LPEITRRTKWYSSRSPSVKIGDVVVIVDSTFPRNCWPKGRVIATRPGRDGE FT VRSATVRTATGVYERPVVKLAVLDVMRE" FT CDS 4075..5628 FT /product="BEL-228_AA-I_1p" FT /translation="MFHQIQIREQDRSCQRFFWRDKFGKAAVFEMDVMTFG FT ACCSPSSAQFVKNLNAGRFVGKYPQAVEAIIKRHYVDDMLVSVKTEKEAVD FT LAHQVKYIHAQGGFEIRNWISNSQLVLQSLGEDDVKSKNLDLSAEVIAEKV FT LGLWWCTDKDTFTYKVGWTRHDEMLLQGQRCPTKREVLRVLMSIFDPLGLI FT AHFLIYLKMLLQEIWRSGTQWDEEISDTLFAKWQQWLRVLPEVEGVEIPRC FT YHIRKFTDCELHTFVDASENGFAAVSYMRFSNGKEVECALVTAKTRVAPLK FT FQSIPRLELQAAVLGARLARTVAEALSIKATRRLFWTDSRDVLCWINSDHR FT RFTQFVAHRVSELLDSTEASEWRWVSTKENVADDATKWQGRPNLTTSSRWF FT KGPEFLWRSESCWPQQIEISKTTQEELRPHLVAHITTMAPAICLTNYSSWK FT RLVNVVGFLLRFSANCRRKLQHHPIHTGPLCMEELSAAEKILIRQAQRDNF FT AGEIASLTRGQQISRSSSLFKS" XX SQ Sequence 6913 BP; 2002 A; 1586 C; 1675 G; 1638 T; 12 other; aaactaagac taatagacta tgtatgtatt taactattgt taccattcag gaaaattata 60 acctaccgca taattacgtc aagtcactgc gtttcattca acggtggtta tagttgcgga 120 aattaaccct tgcagtacca gtgttggttg gttcagacaa ccaacatatt atattataaa 180 cctttgtgga cggctccata taacatttta taattttcgt ttattattgc gaaaatgtcg 240 agtcgcgaag cacacggttc cgggaagaat gtcaagttgt cgaagaaggg caagagttca 300 aaagcaaaaa tcgatgaagt ggtcaatagt ggtgatgtgt cgatagtggt cacgggagtt 360 cctgaggaga catcggcgac ggatcggaca ttagctggtc gaacttgcaa gtcatgcaaa 420 ggtccggata gtgacgagat ggtgcaatgt gacaattgcg acaagtggca tcactttggc 480 tgtgttggag ttacggagga agtagcagac catagttgga gctgtccaaa gtgcgtatct 540 gcaaaatggg ctcaacgatc gggatctact tcagaatttg gtggaggaga aaaactggcc 600 aaataatttc tcgaacgcaa ttatgacttg cttgaagaag cagcaaacaa aaggaattcc 660 aagactagtt ccagagcatg ttccagagct agtasgatgc taaggacagc acgcgattgg 720 gtacatggca acaagaaccg agcagacgtc aatgggatcc taataccaaa ctacgatatt 780 caagaccagc gaagttccac agcttggatg ccgtcggtgg tgcgaatgag ccctgcataa 840 gagktagaat acctaaaata atatctcaag catattccac gggcaccaaa ctgataatta 900 catcgggtat tgtaataccc caacaattct tgatgtattt tcgtataaaa gtacaacctt 960 tcaacaatgg ttcgatacct acaacagacc tcgataactg tctaatacct caaggaagtg 1020 tttttatcat caaaatatgg tatgcgcggt gcgtctaaac cagacgatta taaaaatcaa 1080 atgtgccttg gattttcgtc ttcctcgagt tcactatgcg ggtcatattt ctagaaccgg 1140 taagcttcta aagcttcaaa tgcagaaatt ttggtttttt tcacttttag gctctttttc 1200 atactaaatc ccaggaaaat cacgcccata caaaacgtat ggccaaagaa tttctcgaac 1260 gaaagtttta atgccttgct aatatgaaga tatagagcaa atactgatcg caagcgagag 1320 gagctccaag acacattcta gttccagagc tagtaggttc agcctaagcc gagtacgcgt 1380 aagttattgg gagttatgcg tggcaactcg aataagcaag gagaagaaaa tatgccgagc 1440 taccccgaag tgttcgatcc agagaggcac tctacacaaa acccgtgtgc tatagcgatg 1500 ctgaatcagt tggctcagta ccgccagagc caatacagag agcagtcatt ttcttcacga 1560 caggatcact ggacaaacgg gagaacctat tcgataagag cgatgtccag tggagttgca 1620 aatccaacca gcacgccggc tgttgaaatc tgcagtcaac gtacatcggt cgacagggca 1680 atgtttgcaa cgcagcgagg agaggaagat ttaacacgtc cacacgtacc aatatcaagc 1740 gtcgttggag atccgtatcg accgtcatgt tctagaagag agccatgtag agttgaagaa 1800 gatttcgatg acccatgccc tctttcgaaa aagcaactgg ccgcgcgtca agctatatcg 1860 aaagacctgc caagtttttc agggaatcca gaagattggc ccatattttt ctcgtcatat 1920 accaatacaa ctgccatgtg tgggtttacg gacgcagaaa atacagtccg gttacagaaa 1980 agtctaacag gaaaagcgta tgatgcggtg aagagccgtt taatgcatcc gtctaacgtg 2040 aaaggagtta tagacacgct gcgcatgcgt tttgggcagc ccgaagccat cgtccactcg 2100 ctcatagcaa aaattacagc tctaccgtca ctgaaggaag acaaacttga aacgattatg 2160 gattttgcgg ttgaagttca aaacttttgt gccatatata gtcgatgcgt gcgaattgga 2220 agagcacatg tataatgtat cactgttgca tcaactcgtg tccagattgc cgccatcaat 2280 aaaactagat tgggccaaat atcgttatac actaccaaga gtcaacttgg caacctttgg 2340 gaactggatc tattccttgg cggaagcggc aagtactgtg actattccag cgattcagga 2400 attcaaaccc tcgcgcaacg actcaagacc tactaagaaa agtgccggat tcttgaacgc 2460 tcacctggaa tcaaacgaaa cggaattaaa gccagacatc gacaacgttc ggaatgatac 2520 gaacgggtgc ctagtttgca agtctagttg caaagaactg tggaaatgca aacaattttt 2580 ggagctgtcc agagactctc ggtgggccgt agtacgtgac ttcgatttgt gtcgacgatg 2640 cctgcaaaaa cacaatggtg tatgtaaggc cagagcttgc ggaaagaatg gctgtacgcg 2700 taaacaccac gagctgcttc ataatgaaaa tcaaaaggaa aacgaatcaa atccgaagag 2760 tgatggatgt gaacggcacc aaccgtctac gagttcggca cggcatgagt gcaatagcca 2820 ccgatccagc accaacaatt ccctgtttcg ttaccttcca gtcgtactct atggaaaaag 2880 cggttgtatt cagacgtttg ccttcttgga cgaaggctca aagcttactc tcatggatga 2940 ggaacttgcc aatgaacttg agctggatgg agttgaaagt ccattgtatc ttcgatggac 3000 tggtggtacc gaacgatgcg aaaaggattc tcgcgctatt tctgtttcga ttgctggcgt 3060 atacaccgga gcgaaaaggt tcaaattaga tgatgtgcgg accgtacagg gactacaatt 3120 gccgcgacaa tccctagatg ctgtgaaaat gatggaacgg tatccgtatt tgcggggcct 3180 acccatagag ccctatgaag cagcccgtcc acgaattttg attggactaa agcacgccca 3240 cgtcagccta gtgctgcagt gtcgagaagg caaagtggaa caaccaatag ctataaagac 3300 gcgactaggc tggaccgtat gcggtggcag cgataacgaa aacgtaccaa atatggtgca 3360 ctattcgttt catgtaggat cgcgcgaaga tcatttggat gaggacctgc accaggcaat 3420 gaaagaatat ttcgctctag acagtatggg agtaatgaag cctagtcatg ctctactttc 3480 attagaagat caacgcagta acaaaatgct ggaatcgtta actaacttga acggcaatcg 3540 ttatgagaca ggtttgctct ggcgctatga tgacttccgt cttccggata gtaggcctat 3600 ggcgcttcgc cgatatcatc tactagagaa gcgtatggcc aagaatccag aacttgggaa 3660 agccctggac cagaagattg aggatacatt gctaaaggat acattcggca gttaaccaaa 3720 gaggaagaag agcaaccggt ttcacgtgtc cggtatctgc cagtattccc ggtgttcaat 3780 ccgaacaaac cggggaaaat acggatagtc tgggacgcag cagccaccat tttcggtgta 3840 tccctaaact ctgtcctcct gaaaggccca gatcagctgt gttcactgtt ttccatactt 3900 ctacaattca gggaacatcc aattgggctt accggtgaca tacgcgaaat gttccaccaa 3960 atacaaatcc gggaacagga tcggtcctgt cagcgctttt tctggaggga taaatttgga 4020 aaagccgccg ttttcgaaat ggatgtcatg accttcggag cgtgctgttc gcctagcagt 4080 gcgcagttcg tcaaaaatct gaacgccgga cggttcgttg gtaaataccc acaagcggta 4140 gaagctataa tcaaacggca ctacgtagac gatatgttag tgagcgtaaa gacagagaaa 4200 gaggcggtcg atttagcgca tcaggtcaaa tatatccacg ctcaaggcgg tttcgagatc 4260 cgcaactgga taagtaattc ccaattggtt ttgcagtcgt taggagagga tgacgtgaaa 4320 tcgaagaatc tcgatctatc agccgaggtg atagcagaaa aagttctagg actgtggtgg 4380 tgcactgaca aggatacgtt cacgtataaa gtaggctgga cacgccacga cgaaatgctt 4440 ctacaaggac aacgttgtcc cacgaaaaga gaagttctga gggtactcat gtcgattttc 4500 gatcccctcg gcttgatagc tcattttctg atttatctca agatgctatt gcaagaaatt 4560 tggcgttctg gaacccaatg ggatgaagaa atcagcgata ctctttttgc gaaatggcaa 4620 cagtggttgc gagtgctacc tgaagtagaa ggcgtcgaaa taccacgatg ctaccatatt 4680 cgaaaattta ccgattgtga actacacacg tttgttgacg ccagtgaaaa cggattcgct 4740 gctgtatcgt atatgaggtt ttctaacggc aaggaagttg aatgtgcact agtcacagct 4800 aaaactagag tcgcgccatt aaagttccag tccattccca gacttgagtt acaggccgca 4860 gttctcggag cgagacttgc acgaacagta gcagaggcac tatccattaa agcaactcga 4920 cgcttgttct ggacagattc tcgagatgta ctgtgttgga tcaactcaga ccaccgtcga 4980 ttcacacaat tcgtagcgca tcgagttagc gagcttcttg attccacgga ggcatccgag 5040 tggcgctggg tatcaacgaa ggaaaacgtg gctgacgacg ccacgaagtg gcagggacga 5100 cccaacctta ccactagcag cagatggttc aagggccccg aattcctctg gaggagtgaa 5160 tcatgttggc cacagcaaat tgaaatcagc aagactacac aagaagagct ccgtccacat 5220 ctggttgcac acattacaac gatggcgcct gcgatttgtt tgaccaatta ttccagctgg 5280 aaacgtctag tcaacgttgt agggtttttg cttcggtttt cagccaactg tcgacgcaaa 5340 ttgcagcatc atccaatcca cactggtcca ttgtgtatgg aggagctaag tgcagccgaa 5400 aagattctta tacgccaagc tcaacgggat aacttcgccg gggagatcgc cagcctgacc 5460 cgaggtcaac aaatctcgmg aagcagttct ctctttaaat cawctccttt catcgatgat 5520 agcggtatct tgcgcatgcg tggtcgtacc accgctgtcc ttcattaccg aagaagcgaa 5580 aaaccccatc atcctacccc gtgaccacca cctaaccact ttaatcatca gccactwcca 5640 taacaaatat catcatctca accaagacac cgtagtcaat gaactkcgtc agaagttttc 5700 aattcctcga gtacgcgttt catgtgccaa ggtgagaaga aactgtcagc gctgcaaaaa 5760 cgatcatgcc atgccgagcg ctccaataat ggccgatctg cccgatgcaa gattagctgc 5820 ctactctcga cckttcactc acgtaggaat mgactatttt gggccaatgg aggtcgcagt 5880 cggtcgaaga gttgagaaac gctggggagt kctggctacc tgtatgacta cgcgagcagt 5940 gcacattgag atagcccatt cgctcagcac agactcktgc gtaatggcca ttcgtaacat 6000 gatcgcccgc cggggaattc ctcgttacat atactgcgac cgagggacca atttcgttgg 6060 tacgagcaga gagttgagcc gcgttgaagg ggagctcgac cttgagacca tgatgaagga 6120 gttcaccact tcagaaaccg cctggtgctt caatccacct atgtcgccgc atatgggtgg 6180 gtgctgggag cggcttatcc gcagtgtgaa gaacagtttg aagtctctaa acctgccacg 6240 tcgaccatca gatgaagttc tgcgcaacgc cctgaccgaa atcgagaacg ctattaattc 6300 gagaccactc acacacgttc cgatcgaaga caacgctgcc cctgctctca cccctaacca 6360 cgttttgcta ggcacctcta atgggtcaaa accactcacc atgctcagcg acagtggtac 6420 catcgtacgt caatgctggc gtacgtctca gattatcgcc aaccagttct ggagaagatg 6480 gattgctgag tatctaccgg agattactag gaggacgaaa tggtacagct ccagatcgcc 6540 atcagtgaag atcggcgacg tagtcgtaat tgtggactca acattccctc gcaactgttg 6600 gcccaagggc agggtgatcg caactcgccc tggtcgagat ggggaagtam gatcagcgac 6660 ggtgaggact gctaccggtg tctacgaaag gccggtggtt aagctagctg tattagacgt 6720 tatgcgcgaa gakaagtagt cgatcagcag gtcgacgtac ccggggggag tgttggcaag 6780 ccccttgttg ccagcgcgca ccttccacac agcaacctta aacccctaca cacaccaact 6840 tacgccgaga tgacaggtaa ttccaacagc tgtcgatcgg tgagtaaaga taaagaaatc 6900 aagtagaata act 6913 // ID BEL-1_BM-LTR repbase; DNA; INV; 378 BP. XX AC nscaf2210; XX DT 19-MAR-2010 (Rel. 15.04, Created) DT 19-MAR-2010 (Rel. 15.04, Last updated, Version -1) XX DE LTR retrotransposon from silkworm: long terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-1_BM_; KW BEL-1_BM-I; BEL-1_BM-LTR. XX OS Bombyx mori OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Bombycidae; Bombycinae; Bombyx. XX RN [1] RP 1-378 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from silkworm."; RL Repbase Reports 10(4), 582-582 (2010). XX DR Genome; nscaf2210; Positions 4188804 4189181. XX SQ Sequence 378 BP; 140 A; 74 C; 65 G; 99 T; 0 other; tgtttacgct atctgcaatg gataattagc attgatgttt ttgaacgtaa ctgagagcgc 60 atcgatgtag tcccgcccct ttcccctcct atatcaatga cacttttaaa attgaatttg 120 aatcgtgcgc atcaaaattg tgcgatcgcg aaaaagtggt gaatattgag tgcagctaag 180 taagtactta aaaaactgaa agcttttcat caagaacaaa gaaagaagac aggagtggcc 240 actccatgaa gatagtgcta caaatattat cactacaaag acacagactt tcggtgagtt 300 cctaaaaata ttacctacta ccaaatatca tatcgattgc gctcaaaaat ataaaaataa 360 aagaaccaag cccaaaca 378 // ID Polinton1N_TCa repbase; DNA; INV; 4078 BP. XX AC ChLG8; XX DT 09-MAR-2008 (Rel. 13.03, Created) DT 09-MAR-2008 (Rel. 13.03, Last updated, Version 1) XX DE Putative non-autonomous Polinton-type element. XX KW Polinton; DNA transposon; Transposable Element; Nonautonomous; KW Polinton1N_TCa. XX OS Tribolium castaneum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Coleoptera; Polyphaga; Cucujiformia; OC Tenebrionidae; Tribolium. XX RN [1] RP 1-4078 RA Jurka J.; RT "Putative non-autonomous Polinton-type element from Tribolium RT castaneum."; RL Repbase Reports 8(3), 373-373 (2008). XX DR EMBL/GenBank/DDBJ; ChLG8; Positions 12143371 12147448. XX CC CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Tribolium CC castaneum Genome Project. XX FH Key Location/Qualifiers FT CDS 1792..2631 FT /product="Polinton1N_TCa_1p" FT /translation="MESDVDNLWRIDEMPVTDESTKSYTYNEIRESNINMK FT ELEQYTLETNISGWKHLSGAYIFVKSKIKNNPNNYATISNNGINDFDYVRL FT FYEEVLIEETNYVGVATLISNLIEFSGDVSDTSASQLCWYPDTADSADTSQ FT YKYEAQDKSVKFKDLNVDIKTIISKITNNSAFNKGYLERWLLTKDEKTITK FT FIPLSKIYRFVRDCNKVLNGKIRIELRKNRVKIDKTLEKNRYIRFGDISRI FT SNPETTSIMLELECGETVKLIFGAAQAKEDFYEAITVIR" XX SQ Sequence 4078 BP; 1441 A; 673 C; 685 G; 1279 T; 0 other; actactaagt gagccccttg aggacacagc actttttaag gtaaaacacc tcgattttca 60 aggtagatca aaattctaaa aggtagatca aaaagtctaa aatttgaagg tacgaagaca 120 gccccttcag aacactcaat aatttcaagc aaaacaccct gcttttcaag ggtaaacaaa 180 aaagcaaaaa aaaaaaacgt taaaacacca ccgattttcc aggtagttaa aaaaacatca 240 aataatcata aaaacacccc gattttcacg ggtgaacaga aaagcaaaga aaatcgttat 300 aatatcccga ttttcaaggg taagcagaaa agcaaaaaaa atcgtgaaaa caccccgatt 360 ttccaggtat ttaaaaaagt aaaaaatatc aagcaaaata acccgatttg caagagtaaa 420 cagaaaagca aaaaaatcgt gaaaacaccc ccgattttca ggggttaaca gaaaagcaaa 480 gaaaatcata aaacctcctc ggttttactc cgttatagta ataagaaagg attaggataa 540 ttatataact acttatatca aaaatagata aagaagataa gagttatctc aatcagttgt 600 gcgaattaga aaaaaataat gaaaattcgg tgtataccca cacggtatac acccctgaaa 660 ttggtgaata tatctattaa aatcgaggaa aagtttacca ctcgttcttg acgaataaaa 720 aacgcaatta taagaattcg ctctgctcat gagtggacta cgtccacgga ttgtcttttt 780 tttctaatta gtcgcagact tgagtcacct ccagctgacg ctaatatcca aacggtattg 840 tatcaaagtt ttctaaaaca accctcttat cataaacaaa accaaattct tttgtttcta 900 ttttatttaa gagttcctta gttttagtat ctctggtaat tcgattatac tgcctttctt 960 ttctaaaata ttatttatac tattggaatc gatctttttg gatgtctcat agtttaacgt 1020 aaatcctttt attttacaaa caactttatc cgtattagtc ttataacaat aactttttgg 1080 acctgttgat aaccagtcaa caatccacac atccttacct aattcatcgg tccactctcc 1140 caataaacaa ccggttttga tagtattttt accatcgtca atatatacaa cgctatcagt 1200 atcataatat acaaccgcct ctcccctctc agtaagtctt cgccttcggc tctgggcagg 1260 cttcgccagt ctttcttgta gaataaatgg cacttgtcgt tcatagaaaa caattatttc 1320 cgaaacaata tgggtttggt cttaaatatc atagagatat tagaggtgga aatatttttc 1380 aatctataaa aaaatgctat acctgcaata ggctttttat atcgtaatat attaaaacct 1440 ttttggaaaa ataataaaga ggatatcatt caaggtgtga cttcagggat taaaaatata 1500 gcgtctacat acgtaaaatc gccgccgttt tctacttata aagaagaaat tataaatcca 1560 attctaaaca aaatgaaaga aaaatcgatg aatgaacatt tacgaggcga cggattgaaa 1620 aaggctaaga caggcagggc gaatcgatta actaaaaaat caaaggagat tattaaaaat 1680 ctgctgaatg cttaagtttc ttaaaagact tcgcctttcg cctgcggagc cgtagtccct 1740 ttcagggacg ctctggggct gatttcatca gacttctttt tatttaaata aatggaaagc 1800 gatgtggaca atttatggcg tattgatgaa atgcctgtga ccgatgaatc aacaaaaagt 1860 tatacttaca atgaaataag agaaagtaat atcaatatga aagaacttga acagtatacg 1920 ctcgaaacta atataagtgg atggaaacat ttatcaggtg cttatatttt tgtcaaatct 1980 aaaattaaaa acaatccaaa taattatgca actatatcaa ataatgggat aaatgatttt 2040 gactatgtga gactattcta tgaagaggta ttgatagaag aaacaaatta tgtgggcgtt 2100 gctacgttaa tttcaaatct gattgaattt tctggagatg tttctgatac ttcagcatca 2160 caattgtgtt ggtatccaga cacagcagat tcagcagata cgtctcaata caaatatgag 2220 gcgcaagata aatcggtaaa atttaaagat ttaaatgttg atataaaaac tattatttca 2280 aaaatcacaa ataattctgc attcaacaaa ggatatttag agagatggtt gttaacaaaa 2340 gatgaaaaaa ctattacaaa atttattcct ttatctaaaa tttaccggtt tgtacgagac 2400 tgtaacaaag ttttgaatgg aaaaattcgc attgaattac gtaaaaatag ggtgaaaatt 2460 gataagactc ttgaaaaaaa tcgatacatc cgatttggtg atatctcccg aattagcaat 2520 ccggaaacga ccagcataat gttggagttg gaatgtggtg aaacggtgaa actaatattt 2580 ggagccgcac aggcgaaaga agatttttat gaagcaatta ctgtcattag gtaaaatcct 2640 tatcaacagt tactaatttt tacaagtggc tataaagaaa aatatacata aactttgtat 2700 gtaaatgaat ttatgaaatt ttttatgtat aattttttat ctgcgtactt gttttttttc 2760 agctccgatt ctggaaattc tgagtaataa taaatttatt ttgttactta cttgatataa 2820 taaattaatg tgtaaattat aatatatttt tttggcctcc tgtactgtaa aacgtgaatg 2880 acagttgccc atttataaaa ggtaacttga cctttaaaca cgatgctgta aactggtagc 2940 aaacaattag gtaaagtaat cacaaaacac tggacgatta gacatgttta agatacaaac 3000 cgtgcttacc ataattaagt tcgtataacg aaatacttgt tacctataat tatttaattc 3060 aatcgaccat ggtgtatatt taattagtga aaatataaat ccccccggaa cagcgccacc 3120 tatacatacg tttgggaacc acaaatgtca ttgattcaac caaaccaatt cgaaatttta 3180 cacaagttta tcattcggcg tattattaaa atatttggat aaaatgtctg taacatggta 3240 tgtgagcctt gctaggatat taccgcattc gagtgtaaat agctatttta atactgtcac 3300 gtagtgactg ggggtttgct cagaggtagg gcgtcttaat aaaaataccg agaggtctaa 3360 tcaacaagtt tacttacaga ctagttaaca agtaaaaata aatgtcaata cagcatgcct 3420 acctttttgg ggctaggtct ttcaggacga agtcccagct cttggcggcc cacacgcgaa 3480 tccccgataa ttaatattag tttaatagcg agccagcaaa gtgctgactc gtcttaacag 3540 cgacacacat tttcgtaaca atacactgtt gaaaaccaaa tttgtttgga tatatacagg 3600 tgtcccaaaa ttcgcggaac agctgagtgg taggttaata actgatgccc tgaagtgggc 3660 caatagaaaa atagggcaaa atctttgaaa aatgttaatt ttaaaatata cctttaaaaa 3720 tgggggtgtt ttcacgattt ttttttgctt ttctgcttac ccttgaaaat cgggatatta 3780 taacgatttt ctttgctttt ctgttcaccc gtgaaaatcg gggtgttttt atgattattt 3840 gatgtttttt taactacctg gaaaatcggt ggtgttttaa cgtttttttt tttgcttttt 3900 tgtttaccct tgaaaagcag ggtgttttgc ttgaaattat tgagtgttct gaaagggctg 3960 tcttcgtacc ttcaaatttt agactttttg atctaccttt tagaattttg atctaccttg 4020 aaaatcgagg tgttttacct taaaaagtgc tgtgtcctca aggggctcac ttagtagt 4078 // ID CR1-78_HM repbase; DNA; INV; 4693 BP. XX AC . XX DT 05-JAN-2009 (Rel. 14.02, Created) DT 26-OCT-2010 (Rel. 15.11, Last updated, Version 3) XX DE CR1-type family - consensus. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; CR1-78_HM. XX NM CR1-78_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4693 RA Bao W. and Jurka J.; RT "CR1 families from Hydra magnipapillata."; RL Repbase Reports 9(2), 365-365 (2009). XX DR [1] (Consensus) XX CC Re-classified as Crack due to new clade assignment. This sequence CC was derived from sequence data generated by TIGR, J Craig Venter CC Institute. XX FH Key Location/Qualifiers FT CDS 915..4181 FT /product="CR1-78_HM_1p" FT /translation="MLIRNLCKFCNKTVVDSHKGILCNYCSSWLHPKCNYI FT NKHTYSLIEKDSTDWYCSDCINKNLPFCSLSDFELSLTLSSQTIPTNNLKS FT LKSPPHFKKLFNNINKISNKSINCKYYDILELNKVLNNNCNLYLHLNIASL FT PYHIDGLRSLISSLTTLPIVIGISESRLNINDSSKTDVNINGFTIEHSPTE FT AKKGGALLYLRSNLNYFIRSDLIVYSPKYLESIFVEIVNPLKQNIIVGSIY FT RHPSMNSNEFITDHFNPLLERLSLENKQVVLMGDFNMDLLNYNESKVISSY FT LDTLCSYSFFPSIIQPTRITATSKTLIDNIFLNFQTPDLISGNLTISISDH FT MAQFICIPCLSPKKTKEVIFRRTFQNFDNKKFIKDISNADWQIQITNNDNV FT NESLHYFFKTFNKILDQHAPYKKLTAKQIKLKSKPWITNGILKSISIKNKL FT YKQYIKAKNHNIKIHIFNKFKWYRNNICNLLKLSKKSYYINFFNNNLXNVK FT NTWKGIKEIINIKPSSFSSKSINLKFNGNTISDSKAVSNIFNNYFVNTQQT FT LLTKIVSPKCNFSDYLNNPNINSFFIDPVTENEVSNLIKSLQSKKSLGINS FT IPTFLLKLVSHIISKPLCTMINSSFENGLFPDVFKIAKVIPLYKKGSRTDH FT TNYRPISLLSNISKIFEKAMHNRLYNFLDKYQCLYKHQYGFRSKHSTTHAL FT IEITEVIRKALDDKLFACGVFVDLQKAFDTVDHSILLSKLEYYGIRGISLQ FT WFTSYLYNRFQFVSINGNTSSLVKCTTGVPQGSVLGPLLFLVFINDLNLSL FT KYTTSYHFADDTNLLLINKSLKKINKYINHDLSNLIQWLRSNKLSLNTSKT FT EIIIFKSKQSKISKHLNFRLSGQKINPVMYLKYLGIKIDSNLTFSQHLKDL FT SIKLCRSNGMLAKIRHYVNLETLINIYHAIFGSHLRYACQVWGQIRTQEIL FT RLSNLQNKALKIIYFQHYQSNSDHLYFLSKILKIDDLVQLLNCHFVFNYKQ FT KCLPSTFTNFFTVRESCRYHLRSANSYNLHVPNHSSVKYGLKSIKYQSVKS FT WNSLPSPLKSIDSITNFKNCLFNYFLARYYT*" XX SQ Sequence 4693 BP; 1704 A; 733 C; 475 G; 1778 T; 3 other; tggtaaagat ggcggagtta tagatagaat tgtaaattta tcaaataaag taatatatct 60 ttacatgaca gcaattaaga ggaaaaatat ttatagaaaa aaagactaaa aagtaaaaaa 120 aaatatatcg gaaaattaat tagaaaacca aagtacttaa agtttagaga tgtgtaagaa 180 cttcatagac tcaaataaat ataattaaag aacaagatac attacaaaat taataaacta 240 agagaaagta aagaaagaat caagttaatg aaagraaaag atcaagagtt ttttacattt 300 gttttgatta ttttttgctt taaatattat ttgtaaattt aagtattttc caggattcgt 360 ttttattatt gactttttga ttattattat tagcagttat cattgttatt attattataa 420 ttaaatattt atcattatta ttaaataata tagcattatt tatcctatat ttatccatta 480 tttacctctt attacaattt acatcataat ttatcttata taaatgttat ttacctaata 540 ttactactta tctccttctt tacttattat tacttacctt attatcatca aattatttaa 600 ataacatata ttatttatat tatactatgt ctaaagctac tagaggtatt aatatatcta 660 aatttatatt ctaaatcaag ttaataatat tctaatttta tctagctgta atacttattc 720 tcctgtgtaa agtatttaaa tctcttatat cttactatca tttctttata actgatcatt 780 cctcaatatt taaaatagca accatttcta gattaacatt tcttagtttt aatctctcag 840 tttctttata aatatatagt agatatcttt atttaattat tttttctcta tactttgttt 900 aagcataata agtcatgtta ataagaaatc tttgtaaatt ctgcaataaa acagtggttg 960 atagccataa aggaattcta tgtaattact gctcttcttg gttgcaccct aaatgtaatt 1020 atattaacaa acacacctat agccttattg aaaaagattc tactgactgg tattgcagtg 1080 actgcatcaa caaaaatttg ccattttgtt cactttctga ttttgagctc tctttaactt 1140 tatcgtctca gactatacca accaataatt taaaatctct aaaatctcca cctcacttca 1200 aaaaattatt caataatata aacaaaatct ccaacaaaag tattaactgt aaatattatg 1260 acattcttga acttaataaa gttcttaata ataattgtaa tttataccta catctaaaca 1320 ttgcttcgct cccttatcac attgacggtc ttcgcagtct aataagctct cttaccactc 1380 ttccaatagt aattggtatc tctgaatcaa gactaaatat taacgattcc agtaaaactg 1440 atgttaatat aaatggtttt actattgagc atagtcctac tgaggccaaa aaaggtggtg 1500 ctcttctata tttacgctca aacttaaact attttattcg tagtgatcta atagtatact 1560 ctccgaaata cctggaatct atctttgttg aaatcgtcaa tcccctaaaa caaaatatta 1620 ttgttggcag tatatatcga catccatcaa tgaactctaa tgaattcatt actgatcatt 1680 ttaatccctt gcttgagagg cttagtttag aaaataaaca agttgtcctt atgggtgatt 1740 ttaatatgga cttgttaaat tataatgaat cgaaagttat ctctagttat cttgatacat 1800 tatgttctta ttcttttttc ccttctataa ttcaacctac acgtataact gctacatcta 1860 agacactcat tgataatatc tttttaaatt tccaaactcc tgatttaatt tcaggcaatc 1920 ttacaatatc gatttctgat catatggcac agtttatctg cattccttgt ctttctccca 1980 aaaaaactaa ggaggtaata ttcagacgaa ccttccaaaa tttcgataat aaaaaattta 2040 ttaaggatat ctctaatgca gactggcaaa tacaaattac aaataacgat aatgtaaatg 2100 aatcattgca ctattttttt aaaactttta acaaaattct tgatcaacat gcaccctata 2160 aaaaactaac cgctaaacaa attaaactta aatcaaaacc ttggatcacc aatggaattc 2220 taaaatcaat atctattaaa aacaaactat ataaacaata tataaaagca aaaaatcata 2280 acattaaaat tcatattttt aataaattta aatggtacag aaataacatt tgcaatttac 2340 taaaactttc taaaaaatca tattacatca acttttttaa caataatctt wacaatgtaa 2400 aaaatacttg gaaaggtata aaagaaatta ttaatatcaa accttcttcc ttttccagta 2460 aatctattaa tctcaaattt aatggcaata caatttcaga cagcaaagct gtatcaaaca 2520 tatttaataa ttactttgta aatactcaac aaacactgtt aacaaaaatc gtttcaccta 2580 aatgtaattt ttcagactat ttaaataacc caaatattaa ctcctttttt atagatccag 2640 tcactgaaaa tgaagtatca aatcttataa aaagtttaca aagtaaaaaa agtttaggta 2700 tcaatagtat tcccacgttt cttctcaaac ttgtttccca tatcatctca aagcctctct 2760 gtacaatgat aaatagctct tttgaaaatg gactttttcc agatgtcttt aaaatagcta 2820 aagttattcc actttataaa aaaggctcta gaactgacca cactaactat cgcccaattt 2880 cgctactgtc aaacatcagt aaaatatttg aaaaagctat gcacaacaga ctttacaact 2940 ttttagataa atatcaatgt ctttacaaac accaatatgg gtttcgaagc aagcattcca 3000 caactcatgc acttattgaa atcactgagg ttatcagaaa agctcttgac gataagcttt 3060 ttgcttgtgg tgtgtttgtt gatctacaga aagcatttga cactgttgat cattctattc 3120 ttctaagtaa actggaatac tatggtatta gaggaatttc tctccaatgg tttacctcat 3180 atctttataa taggtttcaa tttgtatcta ttaacggtaa cacatcttct ttagtaaaat 3240 gcactacagg tgtacctcaa ggctctgtat taggaccatt gcttttcctt gttttcatta 3300 atgaccttaa tttatcatta aaatatacaa cttcttacca ttttgctgat gacactaact 3360 tacttttaat taacaaatca ttaaaaaaaa taaacaaata tattaatcat gatctatcta 3420 atctaattca atggcttcga tctaacaaac tgtctctgaa tactagtaaa actgaaatta 3480 tcatctttaa atcaaaacaa tctaaaatca gtaaacacct aaattttagg ctgagtggtc 3540 aaaaaatcaa tccagttatg tacctaaagt atcttggaat caaaattgac tcaaatttaa 3600 ctttttctca acatcttaag gacttgtcaa taaaactatg tagatcgaat gggatgttag 3660 ctaaaattcg tcattatgtc aatcttgaaa ctcttatcaa tatataccat gccatttttg 3720 gatcacatct taggtatgcg tgtcaagttt ggggacaaat ccgcacccaa gaaattttaa 3780 gattatccaa cttacaaaac aaagcattaa aaataatata ttttcagcat tatcaatcta 3840 actccgatca tctttatttt ttatccaaaa tactcaaaat tgatgatcta gttcaactat 3900 taaattgcca ttttgtcttt aattataaac aaaaatgtct acccagcaca tttacaaatt 3960 tttttactgt tagagaaagc tgtcgctatc atctacgatc tgctaatagc tataatcttc 4020 atgtaccaaa tcattcttct gtaaaatatg gtctaaaatc cataaaatac caaagtgtaa 4080 aatcatggaa cagtctacct tcaccattaa aatcaattga ctcaattact aatttcaaaa 4140 actgtctttt taactatttt ttagcaagat attatacctg aagactctga aatatgttgt 4200 tatgctttta ttattactac tytcttgata cctactagtt ttattattaa ctttttgttc 4260 tcatattttt attgttatta ttagtattga tattaataat attcttctat ctttattagt 4320 aaattgttat ttttatttat aacataataa gtattattat tgctttttta attttttctt 4380 aagtttattt actacttata ctatgtaatt gttattacca ctattattgc tattatctaa 4440 tattgattgt ataatatctt tattattatt gttagtatta ttactaataa tattcctttt 4500 catcgtcatc ttattattat tgttaatatt atttctataa ttattatcaa tactattatt 4560 attgttatta taatcacttt tattgtaatg ttgcaggttt ttactttaga ttagtacttg 4620 tactatttgt aaaagtcctt gaaattttac aacttatttc agaatatata ttgaattgaa 4680 ttgaatatat ata 4693 // ID hATm-14_HM repbase; DNA; INV; 3658 BP. XX AC . XX DT 07-DEC-2008 (Rel. 13.12, Created) DT 07-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type family: consensus sequence. XX KW hAT; DNA transposon; Transposable Element; hATm-14_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3658 RA Jurka J.; RT "Families of hATm elements from Hydra magnipapillata."; RL Repbase Reports 8(12), 1908-1908 (2008). XX DR [1] (Consensus) XX CC All elements in this series are flanked by (AT)n. Typically, the CC 5'-end is preceded by 5' TTA and the 3' end is followed by 3' TAA CC which could be extensions of the terminal inverted repeats. CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 729..3098 FT /product="hATm-14_HM_1p" FT /translation="MAASSRVAISTRLQTEIFLIGQPEPNLPDKVLPLTSD FT VLKTFFYHLNSTKKSVPESLKTTVDELIIIWNKARIPTAFHPNVVLKLKTL FT VQEFKLIKKKKSRLSDSQKSREKSFTDTIGLLFDIAHKDAETMIRIEEDKE FT FLLDQRCQRKMVMFGEDKELSKLEARNEARKQAERERKLKEEQRQSGASAI FT VPVIEPLHDESYSDDNDNDAVDNDKMVDRDFEIEIPVYYRQQVSKASSSGS FT AESSLASTPKRQCILDTILDSPDVSSTLDRINLSDTKFTILAAAIARAGGQ FT NLDDGSLSRSTVRRKRTYHRSNIEATVRTEFCDLDKPPLIVHWDGKLMIDR FT TNSADPKANVDRLSVGVTGHNVDKILGIAKLSAGTGEAQAKAVIHLLNFWD FT IIGDVIGMSFDTTASNTGSENGACVVLEKHIGRNLLYFACRHHVHEIIVAG FT VFGSLFGPSSGPNIPLFQRFQQYWPKVNQGNFKPLDDIRMKIPLVQELQNE FT VMAFLKENLHVKMPRDDYKEIMDLCLLILGKLPDQEEKNYHFKIPGAYHMA FT RWMAKVIYCFKIYLFREEFKLTTKEEKNLCEFCLFASLVYVKSWISCPNAS FT DAPVNDLFLFQQLKQFAVVNKTISEAAVKKFQNHLWYLSPELVPLALFSNK FT LKTEEKRKMISNMKFHGENWSERLIKLKNIESLEKKSLDMLVTSVSASALR FT SMKVDIDFLFNNDPATWNDSPEYQEGKNLVYSLKVVNDAAERSVALMSMFN FT ESITRNESEMQRLIQVVEDHRKRVPDARKCTLKSYSPR*" XX SQ Sequence 3658 BP; 1264 A; 577 C; 662 G; 1155 T; 0 other; ttagggctgt tcatgtgaac acaggaaaaa aaattttttc tcggatttga atgcatagtt 60 gtttattttg tgccatttga catagtaatt gactgtgtaa aaaatctttc caatcagata 120 atgtttaggg ggtgctcaat gaccataaag ttttacgaaa aacgtcaaaa acgtggaaaa 180 agttccaaag ttccaaaaat ttctagaacc tggttagtta gattttttcc actgaaatat 240 tgtctgattg cgtgctatat agattaatta aagtgcagca gattaactgt catcagggca 300 ccagactaaa aaatgtctgc tagtgtttaa aacaactgtg tttatatcat tttgttaaaa 360 tttgtattca taattttaat actattattt aactcaattt agaatttgtt caaacagaat 420 aaaaaggttt acaagtataa atattatttt tatttggaat ttcagtttga caacaaatgt 480 attaatattt tgatagtacc taaccttagt aacctattac agaaaactag catggtagtc 540 tggtagtgta ttgtttgtta ttaagtgcag ataataataa tttctaattt gaaaatattt 600 atttaatttc tttagtaata attattatta tatattgctg cagctgcata aaccaacaac 660 tgaatgataa gtttgtataa tatggtattg agttttgatt ttctcataat ttttcagtta 720 tcattgaaat ggcagcatca tctcgtgttg ctatctctac acgacttcaa accgaaatct 780 ttttaattgg acaaccagag ccaaatcttc cagataaagt tcttccctta acatctgatg 840 ttttaaaaac atttttttat catcttaatt caaccaaaaa gtcagtgcct gaaagtctta 900 aaacaactgt tgatgaactg attattattt ggaataaagc ccgtattccg acagcatttc 960 atccgaatgt agtcttaaag ttgaaaactt tagtccagga atttaaatta ataaaaaaaa 1020 aaaaatccag attgtcagat tctcagaaat caagagagaa atcatttaca gacaccattg 1080 gcctactttt tgacattgcc cacaaggatg ctgaaaccat gatcagaatt gaagaagata 1140 aagaattttt attggatcag agatgtcaga gaaaaatggt gatgtttgga gaggacaaag 1200 aactttcaaa attagaagca agaaatgaag caaggaaaca agcagagaga gaacgaaagc 1260 taaaagaaga gcaaagacaa tctggtgcga gtgcaatagt tcctgttatt gaacctttac 1320 atgacgaaag ttatagtgat gacaacgata acgatgctgt tgataatgac aaaatggttg 1380 atagagattt tgagatagaa atccctgttt attacagaca acaagtcagt aaagcaagtt 1440 cttcagggtc agctgaatca agtttagcaa gtactccaaa acgacaatgt atcttagata 1500 caattcttga ctcgcctgat gtttcatcaa cattagacag aattaatcta tctgacacaa 1560 aatttactat actagcagca gcaattgcaa gggctggtgg acagaacctc gatgatggat 1620 cattgtcacg ttctacagtc cgtagaaaac gaacctatca tcgatcaaac attgaagcca 1680 ctgttcgaac tgaattctgc gatttggaca aaccaccatt aatcgtccac tgggatggca 1740 aactgatgat agatcgtaca aattctgcag accctaaagc aaatgttgac cggctatctg 1800 taggtgtgac aggtcacaat gttgacaaaa tacttggaat agcaaaacta tcagctggaa 1860 ctggagaagc ccaagcaaaa gcagttattc atctgcttaa tttctgggac ataattggtg 1920 atgttattgg aatgagtttt gatactacag cctctaacac tggctccgaa aatggtgcat 1980 gtgtagttct agagaaacat ataggtagaa atttgttata ttttgcttgc agacaccatg 2040 ttcatgagat catagtagca ggagtgtttg gatcactatt tggaccatca tctggtccca 2100 acatacctct tttccagaga tttcagcaat attggcccaa agttaatcaa ggaaatttta 2160 aacctctgga tgacattcga atgaaaatac ccctggtgca agaactacaa aatgaagtga 2220 tggcattcct caaagaaaac ctgcatgtta aaatgccaag ggatgactac aaagaaatca 2280 tggatctctg tctgttaata cttggaaaac tgcctgatca agaagagaaa aattatcatt 2340 tcaagatccc tggtgcttat cacatggctc gctggatggc caaggttatt tattgtttta 2400 agatttattt atttcgtgaa gagttcaagt tgaccacaaa agaggaaaaa aatctctgtg 2460 aattttgctt atttgctagt ctggtttatg taaaatcttg gatatcatgc ccaaatgcca 2520 gcgatgctcc agttaatgac ttgtttctgt ttcagcaact taagcagttt gcagttgtga 2580 ataaaacgat ttccgaagca gcagtcaaga aatttcaaaa ccatttgtgg tacctttcac 2640 cagaattggt cccactagct ttgttttcaa acaaacttaa aacggaagag aagcgaaaaa 2700 tgatttccaa catgaaattt cacggtgaga actggtctga aaggttgatc aaattaaaga 2760 acattgaaag tctagagaag aaatctctgg acatgctggt gacatcagtt tcagcaagtg 2820 cgttgcggtc aatgaaggtt gatattgatt ttctgttcaa caacgatcca gctacctgga 2880 atgattcccc agaataccaa gaaggaaaaa atttagtata ttcattgaaa gtagtaaatg 2940 atgctgccga acgatctgtg gctttaatgt cgatgtttaa tgaatcgatt acaaggaatg 3000 aatctgaaat gcagaggtta attcaggtgg ttgaggatca cagaaagcga gtgccagatg 3060 ccagaaagtg tactctgaag agttatagtc ctcgctagca ttaacattgc acgaactata 3120 acattacgat aattaaatgt taattgctca tatgtttttg agttgaccta cttagtagtt 3180 gataacggtt tgagaaactg ttactccttt acaataattt tttttgttta taattcgtat 3240 tttaatcaat gacaaagagt caaaaatctt agtgttttat tttggaaaaa ttctgatata 3300 acaaaaaccg cagaacttta gtcgtattgc agtattttat ttcattaatt tctctgctaa 3360 taaaccagta taaagtggat gtggattgta atttgtattc agaatttaat atttagagat 3420 aattaaaact gatcaaatct gaaaaatcta attaaccggg ttttagaaat ttgtggaact 3480 ttgttacttt ttccatgttt ttcacatttt tcgtaaaact ttatggtcat tgagcacccc 3540 ctaaacatta tctgatcgga aagatttttt gcacagttaa ttaatatgtc aaatggcaca 3600 aaataaacaa ctatgcaatc aaatctgaaa aaaaaaattt ttttttggac agccctaa 3658 // ID Gypsy-33_OD-LTR repbase; DNA; INV; 905 BP. XX AC CABV01002370; XX DT 24-FEB-2011 (Rel. 16.02, Created) DT 24-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Oikopleura dioica genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-33_OD_; KW Gypsy-33_OD-I; Gypsy-33_OD-LTR. XX OS Oikopleura dioica OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; OC Oikopleuridae; Oikopleura. XX RN [1] RP 1-905 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Oikopleura dioica genome."; RL Direct Submission to RU (24-FEB-2011). XX DR Genome; CABV01002370; Positions 4486 3582. XX SQ Sequence 905 BP; 302 A; 194 C; 170 G; 239 T; 0 other; tgatacctac agagagtaag agccaaacaa aaatcgttag aaatcacggc ctttaaaatc 60 gccataagga aattcacaaa aaatgcttat cagttgcttg ctcattattc aaactttcta 120 aaacttattt caatccgaat tgttcaaatc atcaaatcca tctcttcctc ggtaaaaaat 180 tattctcttc catgatatca ctaacgttct ttcttttcta aattacttct acgacctcgt 240 ttacgcggat gacatcagca acaacaacga taacaagacg tctgacggcc ctgcgatacg 300 gatttcttcg aggttccgtt cttgtaacgg cttgtcggaa agtagccaac gattttcgat 360 gcgaaacccg acagtaactg cctcaaggta attgcggttg gcaactctat tcctgcctga 420 ttaatttgaa gcctgactaa ctttgaattt taatcctgcg cgaagcacga taacgactgc 480 agcttcaacg acgacttcaa aatcgacaaa atggacaacg acactgcaac ggacaagacg 540 acgatcctgg acacggactt gtgatttcaa tttgatttgg tggtgacata gtcagaagcc 600 aatttcagat cgaaattatt aaaatcggct gggcggcaga aatgagcaga aagcgccaac 660 aagcgagtaa caaaacgaga atggcgctga atgaaaataa caaatggtgc gaaatgcact 720 aacgacaaaa tcaaagatga aaacaatggc gtaagcgcca cgtcagcaga acgcttttat 780 gcagaatttc aaagtttgaa aatcattctc atcataaata caaccgccgt cagtcgattt 840 tgaaattagt gtttcttgtc tttgcattct agagctagtg tagtaaaaat tgcagaatcg 900 tatca 905 // ID Proto2-8_CS1 repbase; DNA; INV; 4284 BP. XX AC . XX DT 15-JUL-2009 (Rel. 14.07, Created) DT 15-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE Annelida Proto2-8_CS1 autonomous Non-LTR Retrotransposon - DE consensus. XX KW Proto2; Non-LTR Retrotransposon; Transposable Element; KW Proto2-8_CS1. XX OS Capitella sp. 1 OC Eukaryota; Metazoa; Annelida; Polychaeta; Scolecida; Capitellida; OC Capitellidae; Capitella. XX RN [1] RP 1-4284 RA Kapitonov V. and Jurka J.; RT "Proto2, a novel clade of metazoan non-LTR retrotransposons."; RL Repbase Reports 9(7), 1563-1563 (2009). XX DR [1] (Consensus) XX CC Proto2-8_CS1 is a very young family of non-LTR retrotransposons CC present in the annelid genome. It belongs to a novel clade of CC metazoan non-LTR retrotransposons called Proto2. This clade CC includes families of non-LTR retrotransposons present in the CC hydra (from Proto2-1_HM to Proto2-5_HM), annelid (from CC Proto2-1_CS1 to Proto2-8_CS1), hemichordate (Proto2-1_SK) and CC amphioxus (Proto2-1_BF) genomes. A model Proto2 non-LTR CC retrotransposon is ~4.3 kb long, contains two ORFs; its 3' CC terminus is composed of microsatellites; usually there is no CC target site duplications generated upon retrotransposition of CC Proto2 elements. ORF1 codes for a protein conserved in Proto2 CC elements from all species mentioned above. ORF2 codes for a CC protein composed from the AP endonuclease and reverse CC transcriptase domains. It appears that the Proto2 clade is a CC clade ancestral to the RTE and RTEX clades. XX FH Key Location/Qualifiers FT CDS 115..1236 FT /product="Proto2-8_CS1_1p" FT /note="ORF1." FT /translation="MAGDIVVCELLCFAVNRIDNTSNDIIIKLCTDFYSPK FT EIEHAKELLHNACSQSRSDLERLRPRKGPKKAQSDMTDILQLVHEMGTDVP FT CFVAVNLAKLPVLGMENINLAAMVADIVSLKKQMAELKCFVQRVDTVESNR FT SPLVATVAAPHIMNPPLDESPTLNDSDSETDETFPKTIDSCVIQPNADTDI FT AVNESHREALSAPASQTISSGQQQQQQAADQLTSYVDAVNRDEPFSVVSPK FT RQYKKKAFLSGQHAASNPQRGRRTNSTHRASHEDVVIGSGTQTRSLKAAKH FT NSSTSNSDGIFVSRLRAGTTFNDMRDHLFRHAGLRLKCVPIRSRNDHLYAS FT FRIVCNAGEMKRLLVPEMWPKGVIARKFVSK" FT CDS 1280..4165 FT /product="Proto2-8_CS1_2p" FT /note="ORF2, AP endonuclease, RT." FT /translation="MLVVILIMNLHPQQKSTVRVTSFNCKNVKSSIPDVYT FT MCCNSDRVFLQETWLSPSEIVILKALHPDFYADGVSAFQDDEQLIGRPHGG FT IGILWRKTLGAHVNCLKYDGERRLMALSVKHGSHNLLLLNVYLPYFTGSDN FT DDNYHEFMAVIGKAIGIIGTSPTLNACIIGDFNACTTKNTLFGKEIRLMCD FT DNGLLLSDVTLLPPDSFTFYSEVHQSTSWLDHCVSTLQAHLSITDINIAYD FT VLSSDHFPLAMSFELPTSELSGPEPVRSTTIRDNTASWERATKSDIEAYHR FT RCSPLLSSIAIPHVALLCSDAQCDNISHREAISKLYRDITNAMLTSSAASI FT PTKRQQRHKPVPGWNEFVKEHHAAARLSFTLWVSAGKPRQGSLHEQMSRDR FT ARFKFALRRCKRDEAQIKADKLAESFLPLDNTAFWKEVQAQSLRSTPLVAS FT VNGTSGEDAIANMWAKHYGEILNASDTDTSRKRVIKELHAAYESAVSEGSL FT NNSLFSPREVEKSIAKMKRGKAAANDGLSAEHLIYAPSQIHVLLMMSFNLS FT LTHNYLPPFVTESTIIPIIKDKSGNGSDIGNYRPIALSSAITKVLESLILC FT KAASFLSSSDHQFGFKSHHSTDMCVYALKEIVRYYLSKSTPVFACFMDASK FT AFDKLNYFTLFEKLLKRKMPVLIIRILFYWYCTQQIRVKWGSAMSNYFNVS FT NGVRQGGVLSPVLFNIYVDDLSFMLRDSRIGCSVSESICNHLFYADDLVLL FT SPSEKGLQHLLNLCYNYSISHDITFNPKKTVCMLFKPRTSTFKKHCKVSLG FT GNVLSVTPSCKYLGVFISNDCSDNTDLARQLRSFYVRSNYLSRYFSACTPA FT VKCSLFTTFCGNIYSGHCWSIFKKSAMRKLTIAFNNSFRRFMFYPRFCSAS FT GMFAFNHVKSLNEILRHAIFNFQKRILNSSNNLICSILSCTVFNSSLWHHW FT KSVLFMN" XX SQ Sequence 4284 BP; 1147 A; 955 C; 892 G; 1290 T; 0 other; tatatcggtc agttaaagac agccgtcaat ttttaacctt ttggggctgt taaagacagg 60 ttatccactg acctcggctg ctttgttgac aaaacaaagc agcgtgcatg cgccatggct 120 ggcgatatcg tcgtttgtga actcttgtgt ttcgctgtga acagaatcga caacacatcg 180 aacgatataa taattaaatt atgtaccgat ttttattctc caaaagaaat tgagcatgca 240 aaggagctgt tacataacgc atgcagtcaa tctcgatcag atctcgagcg cttgcgtccc 300 cgcaaaggac ctaaaaaagc ccagtctgac atgacggata ttttacaatt ggtgcatgag 360 atgggcactg atgtcccttg ttttgtcgct gtcaatctgg cgaaattgcc agtattgggc 420 atggaaaaca ttaatcttgc tgcgatggtt gcagacatcg tttctctcaa aaagcaaatg 480 gcagagctga aatgcttcgt tcagcgggtc gacaccgttg agagtaatcg ttcccctctt 540 gttgctactg ttgctgcacc tcacattatg aatcccccac tcgacgagtc acccaccttg 600 aacgacagtg actcagaaac cgacgaaact tttccaaaaa ccattgattc atgcgtcatt 660 caaccgaatg cagacactga catcgccgtg aatgaatctc acagggaagc actctctgct 720 cctgcctctc aaactatctc ctctggacag cagcaacagc agcaagcagc tgatcaattg 780 accagttatg tggacgcggt gaatcgagat gagcctttct ctgtagtgtc gccaaagagg 840 cagtataaga agaaagcatt tttatctggt caacacgctg cctccaaccc acagcgtggt 900 cgtcgcacca atagcactca ccgtgcttcc catgaagacg ttgtaattgg cagtggaacg 960 cagaccagat ctctcaaagc agcaaaacat aattcttcaa ctagtaactc tgatggcata 1020 ttcgtgtccc gcttacgagc tgggaccacg tttaatgaca tgagggatca cctgtttcgc 1080 catgctggtc tccgtttgaa atgtgtgcca atccgctcca gaaacgacca tctatatgcc 1140 tcgttcagaa ttgtatgcaa tgcaggtgaa atgaaacgcc ttctcgtgcc tgaaatgtgg 1200 cctaagggtg tcatcgcgag gaagtttgtg tcaaaataat ttcaataccg aattgtactt 1260 tgttgcatgt ccttatgtga tgctagttgt aattctcatt atgaatctac atccccaaca 1320 aaaatcaact gttcgtgtaa ccagttttaa ttgcaagaat gtcaaaagct caattcctga 1380 cgtatatact atgtgttgca acagcgacag agtatttcta caggagacct ggctctcacc 1440 ctcagaaatc gttattctaa aggccctgca ccctgacttc tacgctgatg gcgtgtccgc 1500 tttccaggat gatgagcagc tcatcgggag gcctcacggt ggaattggta ttctgtggag 1560 aaagactctt ggcgctcatg tgaactgtct aaagtatgat ggcgaacggc gtttgatggc 1620 gctttcggtg aagcatggtt ctcacaacct cctcctcctc aatgtgtatc tcccctactt 1680 caccggctca gacaacgatg acaactacca cgaattcatg gctgtgattg gcaaagcgat 1740 tggtatcatt ggcactagtc caaccttgaa cgcctgtatc attggtgact tcaacgcttg 1800 tacgacaaaa aacactttat tcggaaagga aattcgactg atgtgcgatg acaatggcct 1860 gctgttaagt gatgttacgc tgttaccacc ggactctttc actttttata gtgaagtaca 1920 tcaatcaacg tcttggcttg atcactgtgt gagcacgctc caggcacact tgtcaattac 1980 agacattaac attgcatatg atgtgttatc ctcagaccac ttcccgcttg cgatgtcctt 2040 tgagctgcca accagtgagc tgtctgggcc tgagcctgtg cggagcacca caattcgtga 2100 caataccgct tcctgggaac gtgccactaa gtcggacatc gaggcatatc atcgacgatg 2160 ctcgcctttg ctctcgtcca ttgcgatccc gcatgtggct ctcctgtgct ctgatgccca 2220 gtgcgacaac atcagccata gagaggccat cagcaaattg tacagagata taacgaatgc 2280 catgctgacg tcatcagctg caagtattcc gaccaaacgt caacaaagac acaaaccggt 2340 gcctgggtgg aacgagtttg taaaggaaca ccatgcagca gcgagacttt catttaccct 2400 gtgggttagt gctggaaagc cgcgccaggg gtcactgcac gaacagatgt cacgggatcg 2460 agctcgcttt aagtttgctc ttcgccgttg taagcgcgat gaagcacaaa ttaaggcgga 2520 caagctggcc gaatcattcc ttccactgga caataccgca ttctggaaag aggtgcaagc 2580 ccaatcgctg aggagtactc ctttggtggc gtcagtaaac ggtacatctg gtgaggacgc 2640 catcgccaac atgtgggcaa aacactacgg cgagattctc aatgcgagcg atacggacac 2700 aagccgaaaa cgagtaatta aagagttaca tgcagcttat gagagcgccg tcagtgaagg 2760 atcattgaat aactcgctct tcagcccaag agaggtggag aaaagtatag caaaaatgaa 2820 gagagggaaa gcagcagcta acgatggtct ctctgcagaa cacctcattt atgctccctc 2880 acaaatacat gttttattaa tgatgtcttt taatttatct cttacgcata attacttgcc 2940 tccatttgtt actgaatcta ctattattcc tattataaaa gataaatcgg gaaatggctc 3000 ggatatcggt aattataggc ccattgccct gtcttctgca attaccaagg tcttagaaag 3060 tttgatttta tgtaaagccg catctttttt gtcttcttct gaccaccaat ttggttttaa 3120 aagtcaccac tctactgata tgtgcgttta tgcccttaaa gaaattgtcc gatattacct 3180 ttcgaaatct actcccgttt ttgcttgctt tatggatgca tctaaggctt ttgataagct 3240 caattatttt acattgtttg aaaaactttt aaaacgcaaa atgcctgttc ttattattcg 3300 tatacttttt tactggtatt gtacacagca aatccgggtg aaatggggca gcgcgatgtc 3360 aaactatttc aatgtttcga atggtgtacg gcaaggcgga gtgctgtctc ctgttctttt 3420 taatatttat gtggatgatc tgagttttat gctacgtgac agtcgtattg gttgttctgt 3480 tagtgagtct atctgtaatc atctttttta cgcagatgac cttgttttat tgagcccttc 3540 cgaaaaaggc ctacaacatc tgttaaacct ttgttataat tatagcatat cccatgacat 3600 tacttttaat cctaagaaaa ctgtatgcat gctttttaaa cctcgaacct ccacatttaa 3660 gaagcattgc aaagtttccc tcggtggtaa tgttttaagt gttacaccgt catgtaaata 3720 cctgggagtt tttatttcaa atgactgttc agacaatact gacctcgcaa ggcagttaag 3780 gtctttttat gtacggagca attatttatc cagatatttt agtgcctgca ctccagctgt 3840 taagtgttca ctttttacaa ccttctgtgg taacatttat tctggccact gctggtctat 3900 ttttaagaaa tcggccatga ggaaattgac gattgctttt aacaactcat ttcgtcgctt 3960 catgttttac cctcggtttt gtagtgccag tggaatgttt gcatttaacc atgttaaatc 4020 tcttaatgaa attcttagac atgctatttt taacttccag aagcgaattt taaactcaag 4080 caataatctt atttgttcaa ttttatcgtg tacggtcttt aattcatctt tgtggcatca 4140 ttggaagtct gttttattta tgaattgatg tgatgcattg cttatttatt attgtttatt 4200 tttattttca tcttatttat ttcatttatc atatgcatgt atatggattt cagtctgaaa 4260 taaactatta ttattattat tatt 4284 // ID Gypsy-9_RP-I repbase; DNA; INV; 5651 BP. XX AC ACPB02035853; XX DT 28-JAN-2011 (Rel. 16.02, Created) DT 28-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Rhodnius prolixus genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-9_RP_; KW Gypsy-9_RP-LTR; Gypsy-9_RP-I. XX OS Rhodnius prolixus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Euhemiptera; Heteroptera; OC Panheteroptera; Cimicomorpha; Reduviidae; Triatominae; Rhodnius. XX RN [1] RP 1-5651 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Rhodnius prolixus genome."; RL Direct Submission to RU (28-JAN-2011). XX DR Genome; ACPB02035853; Positions 107255 112905. XX CC Positions [3728-4204] - Integrase core CC 'TGAG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 1378..3255 FT /product="Gypsy-9_RP-I_1p" FT /translation="MGWDLMRKLGLVVDAGRGQVTIKPRNPSVALVENGCK FT EKAVSYGTDNQENRERHPNRSREGSASEIGVKVARCQKIPGHSECLVEGKI FT ARDLEGDILLEPLMPIEGGLRIARSLNRAEGRRVWVKVINLTESEVNVGKG FT QWVATAERVDLGGESSHSKVVGAVGQEVVPEGEVEVECIGKLSHLAEGDRG FT KLMKVLRMYLQVFVEPGPEGCKLPVYHRIKTTEEGPVIKRPYRVPYNQRPI FT VEEHLREMLDKGVIAPSSSPWSAPIVLVPKKSRDGTCKYRFCTDFRGLNKI FT TKVDPYPLPLITETLENLGRSRYFSTIDLASGYHQIPIHPEDCEKTAFSTI FT GGHFEYKRMAFGLVNAPATFQRLMDQLLSEIKGEECLVYMDDIIIYSPTLE FT EHCRRLSNVLHRLERAGLKANLGKCAFAQREVQYLGHIVSSEGVKPDPNKL FT VAINSYPAPNSAKEVKSFLGLAGYYRRFIKGFADLAQPLTSLLKKDVAFTW FT GKEEQSAFESLKGALCSSSVLVYPDFRDSFILATDASGIALGAVLSQMCAG FT QERPVAYASRQLTPAEQKYSTTERELLAVVWATTQFRCYLLGRKFKLVTDH FT AALKWMLSVKDPSAKIDTLVPKIVRIRV" FT CDS 3512..5008 FT /product="Gypsy-9_RP-I_2p" FT /translation="MVPAKQRIRLIKAHHDTPWSGHPGIERTLTRIKENFF FT WPEMARNVELYVKSCQSCNERKTPSGLAVPLGRVNEATYPFELVSLDVVEC FT PLSSRGNRYLLTMVDHFTRYAEVIPMRTQTAEETPRAFVHNIILRHGAPRQ FT VLTDRGTNFLSNLFKAVCHLLHIKKLQTTAYHPQSNGIVERMHRTLIDSLS FT HFVRKDGRNWDRWVPFVVMAYRSTVHSSTGFTPHYLLYGRDFEMPFDFPYT FT WQQRAASVEDYVMQLKERLSAAYAQAKGSEAKARESWTRFYDAGKKDRVFR FT VGDKVYLLEPAVPQGQAKKFYKPWTGPHEISAQLPPCDYELALNTGGRYVV FT HANRLKPAYQAREDHGSGQANVEAHPSRVAEEPTTSGSNSPADNEQGWWED FT LSETPPRVSLAPSTPGEAVEEGEEDGPESSESTLAEGDSSWHPTPGSGIQQ FT KTQGLLICYVGRRASRNLTPFLGRLGSICGRGIVESAWRMNNGELPDSVLI FT PTD" XX SQ Sequence 5651 BP; 1544 A; 1205 C; 1646 G; 1256 T; 0 other; ttggtgtcag gtgtggggta tagtcggaaa cccgtaacgt ttccccgtcc ttagtattag 60 tgctctggaa ttatggaaga ggtgaaggca ctaaaggagg aggtagacag gttgaaagag 120 gaggtgcgct ggatgaactc ccaaagggaa gcagcaggcg tatcctctaa tgaagggggg 180 gttaagcgag atctttccct tcaatcttta atcatgcctt ggagtgggga catatcagat 240 aggcctattg accaatttct gcaaaatttt tccttagtgg ccgaaagagg cgggtggtca 300 gaggcggata aaattatgat atgtcgtctt aaactaaaag gtgctgcagc agattgtgtg 360 gcttgtcgac cggagctatt gctacccacg gcgacctttg cggatatggt tgatgtgatt 420 agaactcgct ttgtggggga ggctacgcct gagcaaaagc tgttggagct taactctctt 480 gaacaactgt cgggggaaga tgcgcgtcag tttgcagatc ggtgtcggca agtcggggag 540 gaaacattgc ctatctcggc atcagtagct gaagcggggt gggctagggc tcagcttgaa 600 agaattctta tggctgccta tataaaaggg ctcaaaggag aaccaagccg acaactacaa 660 tacgatcccc gaaacgtttc acgaagcgtg gcacgggctt cgcgtattga gcaagtgagg 720 tgaagtcaac ctctcacggg aaatatgggc tgtgcaagga agttccggtc tagaagggaa 780 gacaacagtg gagccccaac gatcagggag catgtgcttt cgttgtggga gaccgggaca 840 ttttgcaaag tcatgtgatc tgagaaggag gaagagcctt gggtggggta accaacgcag 900 tgaggctgct ccctcaaaaa gggagacagc ctgttttgtt tgtggccaaa taggacacta 960 tgcgcgagag tgcgctcgac gagctacctt agcaaaagtc aggacttgtt atgagggaac 1020 aggcccgcac ccaaaagcgg agggctcgac cgatggcccg acgtcgaatg cccgttaaaa 1080 ccgagggccc ctgccctggt gccaagtgta gcaacgggga agtctaaggg tttaacaata 1140 cctgtgttta caggaaagag tcggttggaa ttacttgttg acagcgggtc agaaatttcc 1200 attttaaaac aagcaattcc tggagtggta gtcaagcagt cccgtattaa agcaacagga 1260 gttacgggga gttcggtacc cattagaggt gaacaggaac tggagttctt tgtagagggg 1320 agccgggcga accacacctt tgtgatcgct gcggtagcca ctcgcgggga tggtctcatg 1380 ggttgggacc taatgaggaa actagggttg gttgtagacg ctggccgagg tcaggtcaca 1440 attaaaccca gaaacccgtc agtagcactg gtggagaatg gctgcaagga gaaagcagtg 1500 tcctatggga cggacaatca ggagaacagg gaaaggcacc ctaacaggtc tcgggaaggc 1560 agtgcaagcg agatcggcgt gaaggtagca cgctgtcaga agataccagg tcacagcgaa 1620 tgcttagtgg aggggaagat agcccgtgac ctggaagggg acattctttt agaacccctg 1680 atgccgattg aagggggctt gaggatagcg aggagtttaa accgggctga aggccgccgg 1740 gtttgggtga aagtgataaa cctaacggag tcagaagtga atgttgggaa gggtcagtgg 1800 gtagcaacag cggagagggt ggacttagga ggcgaatcgt ctcactcgaa ggtggtaggt 1860 gccgtgggtc aggaagtggt cccagaggga gaagtggaag tcgaatgtat tgggaagtta 1920 agccacttgg ccgagggtga tcgcggtaaa ttgatgaaag tgctacgaat gtatctacaa 1980 gtctttgtgg agccagggcc agaagggtgc aagttacctg tgtatcatag aattaagaca 2040 acagaggaag gccccgttat caagcgtccc taccgggtac cctataatca gagaccaatt 2100 gtagaagaac atctccgcga aatgctagat aagggggtaa ttgcaccatc aagcagccca 2160 tggtccgccc cgatcgtatt agttccaaag aagagcaggg atggcacgtg taagtaccgt 2220 ttctgtactg attttagagg gcttaataaa atcacgaaag tagaccccta tcctttacca 2280 ctgataacgg aaacgttgga gaacttgggt agaagtcggt acttttccac tatcgatttg 2340 gccagtggtt accaccaaat ccccatccac ccagaggatt gcgaaaaaac cgctttttca 2400 actatagggg gccattttga gtacaaacgg atggcttttg gtctggttaa tgcgccggct 2460 acgtttcaac gattaatgga ccaattattg tcggaaatca aaggggagga gtgcctggtt 2520 tacatggatg atataattat ctacagtcct actctcgaag aacattgccg acgattaagt 2580 aatgtactac atcgcttgga gcgtgcgggg ttgaaagcaa atcttgggaa gtgtgctttt 2640 gcacagcgag aggtgcagta cttggggcat attgtatctt cggaaggcgt aaagccagat 2700 ccgaacaaat tggtcgccat caactcgtat cccgccccaa actctgcgaa agaagtaaag 2760 agtttcttgg gtcttgcggg gtattaccgc cgcttcataa aggggtttgc tgaccttgcg 2820 caaccactta cctcgttact aaagaaagat gtggcattca cttggggcaa ggaggaacag 2880 tctgcgttcg agtctttgaa gggtgcactg tgttcatcat cagtgttagt atacccagat 2940 tttcgggaca gttttatatt agcgaccgac gcttcgggga tagcgctagg agcagtactc 3000 tcccaaatgt gtgcggggca agaaagacca gtggcttatg ctagccgtca actaacaccg 3060 gctgagcaaa agtattccac cactgaaaga gaattgctag cagtggtctg ggccacaacc 3120 cagttccgtt gctatctttt aggtcggaaa tttaaactcg tcacagacca tgctgctttg 3180 aaatggatgc taagcgtgaa agacccctcc gcgaaaattg acacgttggt ccctaagatt 3240 gtccgaattc gtgtatgatg tggaacatag accagggatt aaacacctta acgctgacgg 3300 gttaagtcga agggtcgctt cagcttccaa gccagagttg ttcggtgaag tgcagtgggc 3360 cacggggaaa agcgaccagt tcgggcaaga gcagtctaaa gatcgctggt gtagccaaac 3420 caggaagtcc tggccggaga aaacctatgt tcacgaatcc gggctcttgt actggacgga 3480 tggtaaagcc caggaagacc agtggcgaat aatggtacca gctaaacaga gaattcgttt 3540 aatcaaagcc catcatgaca ccccctggtc cggccatccc ggtatagaga ggacgttaac 3600 tcgtattaaa gaaaactttt tttggcccga gatggccaga aatgtggaac tgtacgtcaa 3660 gagctgccag tcgtgtaacg aacgtaaaac accgagtgga ttagcagtac cattgggccg 3720 ggtaaatgaa gccacttatc ctttcgagct ggtctccctc gacgtggtgg agtgcccgct 3780 aagtagtagg ggaaaccgct acctccttac tatggtcgac catttcacaa gatatgcaga 3840 ggttatcccc atgagaacgc aaacggctga agagactcct cgggcatttg ttcataatat 3900 tattttaagg catggggccc ctcgtcaggt tctgaccgat agaggtacga atttcctgtc 3960 caatttgttt aaagctgtct gccacttgct gcacataaag aaactgcaga ccaccgccta 4020 ccatccccag agcaatggga tagtggagcg tatgcatcgg acactaattg actctctttc 4080 ccactttgtt cgaaaggacg gccggaattg ggatcgatgg gtgccatttg ttgtgatggc 4140 gtaccgctct actgtgcatt cgtcaacagg gtttactcct cattatttgt tatatgggcg 4200 agactttgaa atgccatttg attttccgta tacttggcag caacgggcgg cgtcagtgga 4260 ggactatgtg atgcagttaa aagagaggct ctctgcggcg tatgcccagg ccaaagggag 4320 tgaggctaaa gcccgagagt cgtggacccg tttctatgac gcggggaaga aggaccgggt 4380 atttcgcgta ggcgacaaag tgtatttatt ggaaccggcc gtaccacaag gtcaggcgaa 4440 aaaattttat aaaccgtgga ccgggccaca tgagataagt gcacaacttc ctccctgtga 4500 ctatgaattg gctctgaata cagggggtcg atatgtggta catgccaacc gcctcaaacc 4560 tgcctatcag gccagagagg accatggtag tgggcaagca aatgtagagg ctcacccctc 4620 aagggttgca gaggaaccaa ccacctcagg tagtaattcc ccggcggata atgaacaagg 4680 gtggtgggaa gacttgagtg aaacccctcc cagagtctcc ctagcaccat ccacaccggg 4740 tgaggcggtg gaagaggggg aagaagacgg ccccgaaagt tccgaatcta cgttggcaga 4800 gggagattca tcgtggcatc ccactccggg gagtgggatc caacagaaaa cccaaggtct 4860 ccttatttgt tacgtgggca ggagagcgag caggaatcta actccattcc tagggaggtt 4920 aggatcaatc tgcggccgag gcatcgtcga gagtgcatgg aggatgaata acggcgaact 4980 accagattcg gttctcatac ctactgattg acggcaaaac tattggaata taagcttatg 5040 aaacaaaacc aaagtattag tattgataac tcgggctcag atgagcagta tatggatggg 5100 catgtgcggc ggtggggatg tggacggaga aatggccgtg ggaaaatgag aggatctaca 5160 gagtaagctc aaaatggatg ttccacaagc tgaataccag caaagagaga ttttggtagg 5220 ccaactcaga aatccagtca gccatctaag agcggagtat ctttgagcag gggagatcat 5280 agggctgaag attgccttac gtaagccaca aacataggca atgagatcgt cgaggcacct 5340 aggaagaaag aaactcaccc tgaaaacctg acccgaaggg aggagtagcg ttggaagctg 5400 tgcgggaaaa atggtagggc aagaggagcc ctcggagcga tagctttcca gatgttagag 5460 ttcgagaagt ggcaatatcc tcgccaatgg gatacaccta catcgcatca actgcgaaga 5520 gagctaccag gaagattcca ggttcagcgg cccatgtgga ggggcactag gagtgaacca 5580 gaagctgcgg agtcaactac ctcccctgag cttgaaaagt ggagtccact tttcttcaaa 5640 gggggggata a 5651 // ID Gypsy-621_AA-LTR repbase; DNA; INV; 928 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-621_AA_; KW Ty3_gypsy_Ele43; Gypsy-621_AA-I; Gypsy-621_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-928 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 928 BP; 291 A; 172 C; 220 G; 245 T; 0 other; tgtagagaca ggataaacaa aatataaatg ttggaataat attaataatt aaaaaaaata 60 tgtaatatga ttattctaat tcagaacatt cgtttgaata tatgctctag gaaataataa 120 taatgcataa attttcaaat aaatgaatat ggtaacttta tatatatata ttaaaaaaaa 180 aaagaaaaca gaaaacaaaa attgaatcga atttcaattt tgtcgacgaa agagaaaaca 240 atgaacatcc ctaattccat tgacttctac catccactca ttggttcagg gtcgcgcggt 300 gtttattggt taaaaaaaag gagaagggga gatgatgcaa tcgcgacttc actactagta 360 taaatagagc accgcagaaa gatttcagtt cactttttcg gcggacgtgt tacaagtcac 420 atcagcagca gcagtaacca gttaaagaag cagctccgac agtaagacga atagctttcg 480 cttgttggca tcgatccgga agaagcagca gtcaccgttt gagagtgcgg tcgactgttt 540 tgcttgcggc atccgacaat aagacgctag cgagtggcat cgctttcgcc tgttggccac 600 gatccggctg taagacggga gtgagtggca tcgctttcgc ttgttggcaa cgatccggaa 660 gaagcagcag tcaccgtttg agagtgcggt cgactgtttt gcttgcggca tccgacaata 720 agacgctagc gagtggcatc gctttcgctt gttggcaacg atccgactgt aagacgggag 780 tgagtggaat cgctttcgct ggtcggcaac gattttgtca ggtggaaact atcggaagtg 840 gattccgtta gctactttgc ttatcagcaa ctgcctggaa gtaacatggc agatctacca 900 ggttttgaga atttggccgg tcgcgaca 928 // ID MosquI_Aa2 repbase; DNA; INV; 7202 BP. XX AC AF134900; XX DT 09-JUL-2004 (Rel. 9.06, Created) DT 06-OCT-2010 (Rel. 15.11, Last updated, Version 4) XX DE Aedes aegypti non-LTR retrotransposon MosquI-Aa2, complete DE sequence. XX KW I; Non-LTR Retrotransposon; Transposable Element; endonuclease; KW reverse transciptase; gag domain; pol domain; Mosqul_Aa2; KW MosquI_Aa2. XX NM Mosqul_Aa2. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Tu Z. and Hill J.J.; RT "MosquI, a novel family of mosquito retrotransposons distantly RT related to the Drosophila I factors, may consist of elements of RT more than one origin."; RL Mol. Biol. Evol 16(12), 1675-1686 (1999). XX DR Genbank; AF134900; Positions 1 7202. XX CC Related to Drosophila I factors. Contains putative CC transcription initiator, and putative downstream promoter. XX SQ Sequence 7202 BP; 2099 A; 1856 C; 1503 G; 1744 T; 0 other; tattgaggta agttttttct aaatgaaaat ttgcgtatgg tattattcca actctaccaa 60 cggctagtca tcgaacccag actgttataa accaaccatt tttttccaat acatgtggtc 120 gactttttcc cttcaaattt tgacctattt ttggaagcac ttctcgacaa caagagttga 180 atttttcaat taggttcaat tgaattaggt accctaacta tagttcaaag tcgaacagtt 240 cgacagctca tgtcatagcg agcagacgcg tgtatgctat cgctcccgcg tttctaactt 300 cgcttcgacc gtttcagcaa acagtgcatt atctcaattt gactacctcg tcggacctgg 360 ccaggcccag gagtataata gaaaacatcg aagtaggccg gccagtaagt ggccgaaagt 420 gcgaaatctg gtgccaaact accagtgccg ggaccgtgat ataaagcgtt cttgactgaa 480 tagaacaaaa aaagcttttc tctttgttct tgctcagacg aacaaccgct tagtgagtgg 540 catctcgtgc tgtgcggttc gttttgccta ttgtttgtga agaatatctc cgcaatagtt 600 caaattaatt agtgcttgct gctgcccccc tcgacatgcc gccggggcta gaccccttcc 660 ccctagggga ccctggcccg tctaattgtc gtgatggtga attcgctggc cctcgttttc 720 ctgaatgggg agaccctcaa ggcaattacg gtcaactggt aatgatgcga atggaagccg 780 tctcggggaa tcttccacag gctcctttcc ttctccgaaa atcggtggag agctatctgg 840 gagcgaaggt tgatggagca tatcccgaaa aaggaggtac aacgtatgtt ctgaagctga 900 ggaacaaaaa tcacgctgaa aagctgaaaa ggatgtcaaa attgactgat ggcttcccga 960 tcaagatcgt ggagcatcca gtactaaacg tgtcaaaatg tgttatcagc tgcagcgata 1020 cgtgtgtcta ttctgatacg gagctggtag aggaattgaa agatcaaggc gttaaggaag 1080 tacgaagaat caccaggcgt gacggaaacc agcgaatcaa cactccaacc atcatattga 1140 cactccaagg taccgtcatc ccagaggaca tctatattgg atggatcaga tgccgcaccc 1200 gtccattcta tcccacgcct atgctgtgtt attgctgttg ggatttcgga cacacgcgag 1260 ctcgttgtca gcatcagaac aaccctacat gcggtaactg ctctggtaag caccaaacag 1320 acgtggaaaa cccatgcctt ctcgcggcgt tctgcaaacg gtgcaatacc aatgaccatc 1380 cactctcgag ccgaaaatgt cccacctatg tcaaagaaaa cgaaatccaa cacctacggg 1440 tcgatttagg catttcctat ccagcagcaa aacggcagta tgagcagaga cacggttcaa 1500 aatccatggc atccatcgta gcagatagta acgatcggcg ttatgccgaa ctttcggcga 1560 aactcgataa cgtcctaaag gtagtaaaaa gcaaggacgc tgagatcgag actctactta 1620 ccgaactaaa aaacaaagac gcacacatcg agaaactgga atccactatg aagctcacac 1680 cccaagaacg gctgaataca gtgaaagaac acggcactat caaggacctt gttaacaaga 1740 tccactctct ggagtcagag ctagctagga aagacagaga agtcgctatg atccgtgaaa 1800 tctacatccc caagaaatcc tcaaccagca ctggtaacaa aattaccaag cccgacgtat 1860 caaacgggtg taatacggga cacgatgatc gcaattcttc acacactaat cagtctgagg 1920 catcgcaaat caagaaaaca ggcaagaaaa agaaagctaa caaaaccgac agctacgaga 1980 ggcttaacaa acgtaccaaa aatcgccttt cgccaagcac aactcccgaa tgtacattta 2040 caccggaaac ctcacctgct gacatcgaaa tttcttcggg tgatgaacca atgttgaatg 2100 tatccgggta catatccgac tcctaaacgt cacatcacgc atctagtccc ctcctgaact 2160 tatgtttctt gcctgaactg atcaacaaaa ttaacactgc acttcacctc tcaacaaacc 2220 gctaccgaaa ctgattcaaa aaaggagcgg actagccaag caacagcgac ttacagaaaa 2280 cacatcccat catcgaagcg caaaccaact ccaattgcat ctatcaacga tctggcaacc 2340 tttactggcc aatgcgacta cccggcggag ccaccgagca gtcgaggcac cgccagtgcg 2400 gacgtatatg ccacaccgga actgacggac aaccctcgac actccgtggc agtcggaaga 2460 gggacggaca agggagacaa ctcccatccc cctgaggaca acgagggatt taattcacaa 2520 ccaatacaaa ctaagcaaca aaaatccacg ccggagatgt acttggaacg acagtcttgc 2580 atcggtgaag ctctatcgac gttaagattt cgaacacaac ctcgaattag tcctgaagat 2640 tcccgttggc caacgctgcc tcaaagtata aggtccaacg cagcgctgta tgattcatac 2700 caattcaaac atagagggac ctctattcca aaagcaggct cttctggttg gtgtcttccc 2760 attgtagagt acactggtga tgactgcagt gccccaagtc gacgcgagat cacagaaaca 2820 gtatttctta gtagaccacc caccccaagc acaagatcac tgtcagaaca accaactctc 2880 cgccatccac gaaatatatc cagtatgccc tcggttagca ccttgttcgc catacaatgg 2940 aacatgaatg gcaatatcaa caatctgccg gaccttgagc ttcttgtcag agaccagcaa 3000 ccaatcatat tggccattca agagacgcat cggattaata ccaaccggtt gaaccattcc 3060 ttacagaaac ggtacagctg gaccctcaaa tgtaatgaaa acatctacca ctcggctgct 3120 attggagttt taccaactgt tcctttttct ccgcttcctc tcaacacgga cttaccaatt 3180 gtggcagtta aacttgagta tccgtttcct atgaccgtga tatcgggata cctgccaaat 3240 gggaacattc cggatctcaa acttcgcctc tttaatgctc ttcagagtct aagtagccca 3300 atactagtta tgtgggacgt gaacggtcat catccggaat ggggcagtgt ttcacctaac 3360 gataggggtt cactcattat ggacgtagca gaaaagcttg acctagttat tctcaatgat 3420 ggagccacta ccttcaccag aggccaacgg aattccgcaa tagatgtcag tttatgcagt 3480 ccaagcatag tcaaccgact cctctggact gccaaagagg atcccatggg tagtgaccac 3540 catcccatct tcatacacat cgacgaacaa cctcaagcta cttctcgtcg tccgcgctgg 3600 aaatatgacc aagcagattg gtctcacttt caaacgctga ttgacaccga agtttctgac 3660 aaacccccgg ataacatcga aggtttcctt agtgttttgc atcaagctgc ctctacatct 3720 attcctcgca ctacaccaaa cccaggtcgg agatcattgc catggtggtc tccagacata 3780 aaaaaagtca tcaaagagag acgaaaggcc ctccgtgccg caaagcgtct ggctgatgac 3840 catccagaaa aatcaaaatt aacggaaatc taccgctcca aacgcaacga atgtcgccag 3900 cgtattcgag atgctaagaa aaaaacatgg gaagacttct tggaaggaat caatgctaac 3960 caaacggcat ccgaattgtg gaatcgagtc aacgcattga atggcaaacg acgagcaact 4020 ggcatgacac tacggttacc aggaggccta actcgagacc cactcttaat agcaaacgct 4080 ctagctgatc acttcgcttc tctatcctct ttagaccgat atggtagaaa ttttatctta 4140 aagaatcaag cttctatcga tagcataacc aatctagtta tcccagaaga taacacacct 4200 ctccgcataa actctccttt ccgtatggaa gaactaaact ttgccctacg ccattgcaaa 4260 agtaagtctg caggtccaga tgacctcgga tacccattgt ttcagcacct ttcaatagtt 4320 tcgaaggcaa cccttcttga tctgctgaac aaaacatgga tcgagaatac tctaccgaag 4380 tcctggacac acagcctagt tgttccaata cctaaacacg gaaaagccgc cacttcccct 4440 ggtgacttta gaccgatctc actaacatgt tgcgccagta aaatacttga acgtatggta 4500 aatcgacggt tgatccgttt cctggaagat aatcagtttc tcgaccatcg gcaacacgct 4560 ttccgtcctg gtcacggagc agagacatat ttcaccggtc tgggtgacgt ccttcaagac 4620 gcgatgggca aaggtttaca tgctgacatg gcctcattgg atctcgctaa ggcctacaac 4680 cgcgcgtgga cacctcacgc tattcgtcgg cttgctgatt ggggcctgtg tggacacatt 4740 ctccattttc ttaaaaactt cctgaaaggc agaacattcc aagttattat cggcaacaac 4800 cactcctcaa tccgtgcgga agaaactggg gttccgcagg gatctgttat agcggttact 4860 atcttccttg ttttgatgaa caacattttc gaagcattgc cgaaagaaat ctacattttc 4920 gtctacgcgg acgatatact cttagttgtc atcggccgta ccctaaaatt catcagacga 4980 aaactacagg cagcagtatc tgcggtagcc agatgggctg ctaactccgg ttttgacctc 5040 tcggcggaga aaagtgttat ctctcatgta tgccgctccc gtcatcgtgt tcttcaaact 5100 cccgttatgg taaacggctg cccgattcca tgtaggaaaa caatggttat acttggagtg 5160 cagcttgacc gtgaactacg tttcgatgct cacctaaatg cgatcaagag gaactaccaa 5220 acaagaatca atcttctccg caccttatcc aaaccgcata agagcagtaa cagagacatc 5280 cttgtaagaa tagcaaaatc tataatcaac agccggctct tttatggtat tgaactcttc 5340 ggtctggcag gtgacacttt aatcacacgc cttgccccta catataatca atccattaga 5400 ataatagctg gcttactccc atccactcca gcagatgcgg cctgtgtgga acttggagtt 5460 ctcccattcc gataccaagc tacagaaact ttgtgctgtc gaacgatcgc ctatttagaa 5520 aagaccactg gagatcatga ggtctttctc ctcagggagg ggaacagagc tctagacagt 5580 ttggcccatc aggagctccc cccggttgaa caggtccact gggtcggagc cagaaggtgg 5640 gacgctccag atttcctcgt agacacctcg gtttcgaaac gatttcgagc aggggataac 5700 tctcctgcta tgcgttctca tgtcacggag ttgttagcta gcaagtaccg aaactaccat 5760 caccgtttca ccgatggctc caagtatttg gacagaactg gcttcggcgt taccgacatt 5820 gataaaagct atttttatag actacccgat cagtgctcgg ttttctcggc cgaggctgct 5880 gcaattcttc tggcctctac aactcctgca cccaaaccaa tatgtgttat ctccgactct 5940 gctagcgtac tcgctaccat caactcatcg tcaactcgtc acccatggat ccaagctgtg 6000 cagaagaact cgccctctca aaccgttttt ctatgggtac ccggtcattg cggcattcga 6060 ggcaatgtgg aggctgacca tcttgcatcg aaaggtcgat ccggtcgtct gttcaccaga 6120 ttaacgccag ggatggattt gaaaaactgg accaaatctc aaatccgttc atcttgggcc 6180 ctagaatggg tgaatttaag agataagttc atacgaaaaa tcaaaggaga aacaaaacgc 6240 tggattgata ctaacaatcg tcgtgaccaa caagtgttat ctcgtctgcg taccggtcac 6300 acccacgcta ctcacaatat gggtaacgaa cggccgtttc gcaaaaagtg cattgtctgc 6360 aacactacga tgtctgtcga acacatgata atcaattgtc cttgctttca agcccctcga 6420 gaacgccaca atatcccaga tagcatcaga gatgcgcttt cgaatgaagc ttccagcgaa 6480 gcagcaataa tatctttttt caaggatgcg ggactttaca acaaaatttg acaaatgtta 6540 tcacaacaag attcaaaact attgaacgat atacgatgac catgctgaca cactctgaca 6600 atggatttgg accaatgaag gactttctaa acgatactcg atgacgacta tgctgacccg 6660 gaatggctac caaattggac taccgaatgg acttcgacgc ttgaaatatt taaatctata 6720 atctgccaaa tgtgtacatt aactttgttt gacagggggg cctctcaata cgaagccctc 6780 tttttcccaa cgatggagac gaaccagcct ccggctgaaa gtctcgataa taaagataat 6840 aataataata ataatagttc aaagttgcta aatattattt ttgtgatgaa taaacatgga 6900 tggatgttgg accgttctaa taatcatgta ccatttcagg aaataagcac ccagctgggc 6960 cagaattggg tcccactgga tgccaaaacc ctcccgcctg ccctcggatt ctacccggta 7020 tcggatgacg ttcgtgacca atatatcctc cggtacattc gcaacggcgt tgctgcttaa 7080 gcttcccggt aagtactaca tgctccaaac catttcaatg tcctatattg aatgtttccg 7140 ttcaatgtgc gtaaggttca cggtgaccgc ttcaaacagg cacggaatca agatccgaat 7200 tc 7202 // ID BEL-592_AA-LTR repbase; DNA; INV; 512 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-592_AA_; KW Pao_Bel_Ele30; BEL-592_AA-I; BEL-592_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-512 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 512 BP; 142 A; 107 C; 125 G; 138 T; 0 other; tgttggttca ttatcaaatc tggcggcact gtgtgagaat catcatctca ctcagttgat 60 ggtgctatcg gtccgtggtt ttagacgtgc ttgtctggtt attctggcaa cacagttaac 120 ccaccgatga tgacgatagc taaccagcat actcagatca gccaccccac ccagttagag 180 ttaaagcgtt gatcagagaa cacctgctaa ttccaataag aatcttggcc ggagaataaa 240 tttaaggaac tataaagttg agtctacttg tgaagtgtcc aataaaatta caagtcgcat 300 gtatctagag cggtgttata cgtagctgaa gagtgaagaa agtgtcttct aaaggatagc 360 cgattcgtcc ccgtcgtgct atcgtgttgc tgctgcctgt gaacgttaag atcgagaaat 420 agggtgctgt cgcctagttt ggacccttgc caaccgacga attttggtgt aaattggaaa 480 gcacaacgta gggtaactgg cgagcgccaa ca 512 // ID BEL-85_AA-LTR repbase; DNA; INV; 689 BP. XX AC supercont1.324; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-85_AA_; KW BEL-85_AA-I; BEL-85_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-689 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.324; Positions 160013 159325. XX SQ Sequence 689 BP; 226 A; 135 C; 117 G; 211 T; 0 other; tgttgcagac actggcaaca cggacgatac agatataccg attcgcaaag cccctcgaac 60 ggattctgta tacgcaacaa atgatacaat gcagaataga ctaacagtcg aacaggttga 120 cagctcgaca gagatgaaaa gctcttaaga tgcaagtcat tctagtgcac tattggcatt 180 tcctaattct ctgctaaata ctatagtgat aatactacaa tcgtgaatta cttctgatag 240 aaactggatt agttgatcgc acttatttgt aggtttgttt gagtgatagt aagtttccct 300 caatacttaa taagtcttcc gaattcgata ggttgataca gtcgagtagg acaagcacat 360 tttatttgag ctggctttca gatattcgat atcatatcac accaattgta agtcctgttt 420 attcctctct gataatcact actgttatta tactctaatt aaatattatt gaattcgtta 480 gcgccgtatt acagtccagt ccccagtcag acacaaattg catttagttg gatttcgcta 540 ctaaggaaga ccaatatgta agcaacttaa ccttgaattt agaactattc ttcatattaa 600 aaaactaata aaatatctct agcttcaaga ataccgtcaa caaaactagg cgtttctctg 660 gagaggtcat ctgaacgaat aatccaaca 689 // ID Gypsy-149_AA-I repbase; DNA; INV; 4316 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-149_AA_; KW Gypsy-149_AA-LTR; Gypsy-149_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4316 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1023-1023 (2011). XX DR [2] (Consensus) XX CC Positions [3341-3721] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 189..3506 FT /product="Gypsy-149_AA-I_2p" FT /translation="MRWVQRLENAFAIYAIDDEEMRKNLLLHHMGPESYDI FT ICDKIAPDTPRARTYQQIVDILETFFNPRPLEISENFRFKCRRQGDKDAAS FT PDESVDEYLVALRRIAVTCNFGQYLETALRNQLVFGIRRNDIRSRLLERRE FT LTLQDARDIAVSMELSKKGGAEIEGNQGKLDVHKVQHQPGKRNKSKTKINE FT GEKRFNQKGAGGSDISCYRCGEKSHLANACKHKQTECKFCGLKGHLERVCM FT KKSGSKPSSARPSGKQNTVQTNFVDNSGNDCKDACAVVREVCSVNSISDNA FT KIWMKVRVNGVQLRFEVDTGSPVTIVSANCWKELFPAAQLRKCDTNLVSYC FT NTNIEVLGIMDARVEFDGQNSQLPLYVVKSEKHPLIGREWLSRLSVDWNIL FT LRNKSAVNEIRRGVSGSNAAASNDTAAAVKMVLQKFPRVFEDSIGKISRVQ FT ANLPLKKDVHPVFLKARKIPFNLQNVVEAELDKLVAEGVLTKVNQSNWATP FT IVPVKKSHNRVRICGDYKQTVNPNLVVDRHPLPTVDELFASLAGGKKFSKI FT DLVQAYLQMEVAPEDREILTLSARRGLYRPNRLMYGVASAPAIWQRQMESL FT LQGIEGVSVFLDDIKVTGPDDETHLRRLEEVLRRLDENGIRVNRDKCDFFA FT EKIEYCGYLIDEEGIHKIQKKMEAIQEMRRPKNKDEVRSFVGLINYYGRFF FT RDLSTVLYPLNNLLKNEVSFEWSTQCEKSFQAVKEQMQAETCLVHYSPELP FT LVLATDASPYGVGAVLSHIYPDGSERPIQFASQTLNRTQQAYLHVDKEAYA FT IIFGVKKFFQFLYGRRFVLVTDNQAVTKIFGEHKGLPVMSALRMQHYATYL FT QSFDYEIRFRKSASHANADALSRIPLKLADPDNVIDESDLVEMHQIETLPL FT TAAELAQATAEDSTVKKLIQGIKYGVLVEGKDRFGIDQSEFAMQKGCLLRG FT IRVYVPAPLRKRVLEELHSTHFGATRTKSLARGYCWWNGMDNAIEEMIANC FT VECQSVRREPAKMSLHCWETPTAPFQRVHVDFAGPFLDTYFFILVMHLRNG FT RRLECVVPSLLTVQFGCVVRFSAPLEYRQCSSAITVFSLRPSNFNSSCG" XX SQ Sequence 4316 BP; 1099 A; 1012 C; 1213 G; 992 T; 0 other; gttctggcga cgaggatggc gaatccggac gatgctgctg ctgctcttgc ggctcctgct 60 gctgctgctg ctgaacgtgc tgtgcctgct gctcctgctg tcctgctgct gcccgtgctg 120 tgcctgctgc tggtgttccg ccacccagtt ttgcgataga tcccttcgac aagcgaaagc 180 tgaagtggat gcggtgggtg caacgattgg aaaatgcgtt cgccatttac gcaatcgacg 240 acgaagagat gcggaaaaat ttgctgctcc accatatggg ccccgaaagc tacgacataa 300 tttgcgataa aatcgcaccg gacacgcccc gtgcgagaac atatcaacaa attgttgaca 360 tcctggagac gtttttcaat ccccgcccgc tcgagataag tgaaaacttt cgttttaagt 420 gccgtcgcca aggtgacaaa gatgcggctt cgccggatga gtccgtcgat gaatatttgg 480 ttgcattgcg acggattgcg gtaacctgca actttggcca gtacctggaa actgcgctcc 540 ggaaccagtt ggtgtttggc atcaggagga atgacatccg aagccggctg ctggaaagac 600 gagagctcac cttgcaggac gctcgcgaca tcgctgtgag catggagctg tcgaagaaag 660 gtggagcaga aatcgagggc aaccaaggta agctggatgt ccacaaagtg cagcaccaac 720 caggtaagcg gaacaaatcg aagaccaaaa taaacgaagg ggaaaagcgt ttcaatcaga 780 agggggcagg tgggagtgat atcagctgct accgttgtgg cgagaagtct cacctggcta 840 atgcttgcaa acacaagcaa accgaatgca agttttgcgg cctgaaggga caccttgaaa 900 gagtgtgcat gaaaaaatct ggctcgaagc ccagtagtgc gcgaccgtcg ggcaaacaaa 960 atacggtgca aacgaatttc gtggacaatt ccggaaatga ctgtaaagat gcgtgtgccg 1020 ttgtgcgtga agtgtgttcg gtaaactcca tatcggacaa tgcaaaaata tggatgaaag 1080 tgcgcgtgaa tggagtccag ttgcggtttg aggtagatac gggatcgccc gtcaccatcg 1140 tgagtgcgaa ctgttggaag gaactgttcc cggcagctca attgcggaag tgcgatacaa 1200 atctcgtgag ctattgcaat acgaacattg aagtgctcgg gataatggac gcgcgtgtcg 1260 aattcgatgg acaaaactcg caattaccgt tgtacgtcgt gaaatcggag aaacatccgt 1320 tgatcgggcg cgaatggttg agtcggttgt cggtcgactg gaacattttg ctgcggaaca 1380 aaagtgcggt taatgagata cggcgaggtg tttcgggaag caatgctgct gcttcgaatg 1440 atactgctgc tgctgtgaag atggtactgc aaaagtttcc gagggtattc gaggattcca 1500 tcggaaaaat ctcccgtgtt caagctaatt tgccgctgaa gaaagatgtc catccagtct 1560 tcctcaaggc acgcaaaatt ccgtttaatc ttcagaatgt ggtcgaagct gagttggaca 1620 aactagttgc tgaaggagtt ctcaccaagg tcaaccagag caactgggcg acgccgattg 1680 tgccggtgaa gaaatcccac aaccgggtgc gtatttgtgg cgattacaag cagactgtca 1740 accctaacct ggtggtggac aggcatccgc tccctacggt ggacgaattg ttcgcctcgc 1800 ttgcaggtgg gaagaagttt agcaaaattg acctggtgca agcgtacttg cagatggaag 1860 tcgctccaga agaccgcgaa attcttaccc tcagcgctcg tcgtggcctg taccggccga 1920 accgcctcat gtacggggtg gcatctgcgc ctgccatctg gcagagacaa atggagtcat 1980 tgctgcaggg aattgagggt gtcagtgtgt ttttggacga cattaaggtg acaggacccg 2040 atgacgagac ccacttgcgc cgattggagg aagtgctacg ccggctggac gagaacggta 2100 tccgggttaa ccgggacaag tgcgattttt tcgcggagaa gatagagtac tgcggatacc 2160 tcatcgatga agaggggatc cataaaatcc agaagaagat ggaggccatc caagagatgc 2220 ggaggccgaa aaacaaggac gaagtgcgct ccttcgtagg tcttatcaac tactacggta 2280 ggttcttccg ggacctaagc accgttcttt atcctctcaa caatctgctg aagaacgagg 2340 tgtcgtttga atggagcacg caatgtgaga agtcttttca agcggtgaag gagcaaatgc 2400 aagctgaaac ctgtctcgta cactattccc cggaattgcc tttggtgctg gctaccgatg 2460 cttcgcctta cggggtggga gccgttctga gtcacatcta cccggatggc tcggagcgtc 2520 ctatccaatt cgcctcccaa acgctgaatc gaacccagca ggcgtaccta cacgtggaca 2580 aagaggcgta tgcgataatt ttcggcgtca agaaattctt tcaattcttg tacggccgga 2640 gattcgtttt ggttacagac aatcaagcag tgaccaaaat tttcggggaa cataagggat 2700 tgcctgttat gtctgctctt aggatgcaac actacgcaac ctacctgcag tccttcgatt 2760 acgaaattcg atttcgtaag tcggcaagtc atgctaacgc cgacgcttta tcccgaattc 2820 cactgaagtt agcggatccc gacaacgtca ttgatgagtc ggacttagtg gaaatgcacc 2880 aaatcgagac acttcccctt actgctgcgg agttggccca ggcaactgcg gaagattcaa 2940 cggtgaagaa attgatccaa ggcatcaagt acggtgtact cgtcgaaggc aaggaccgat 3000 ttggaattga ccagagcgag tttgcgatgc aaaaaggttg cctgctgcgc ggaatccggg 3060 tgtatgtgcc tgcgccccta cggaagcgag tcctcgaaga gctgcactca acgcacttcg 3120 gagcaactag gaccaagtca ctggcaagag gttattgctg gtggaacgga atggacaacg 3180 ctattgaaga gatgatcgcc aattgtgtcg agtgccagtc ggtcagacgt gaaccggcga 3240 agatgtcgtt acattgctgg gaaaccccta ctgcgccgtt ccaaagggtc catgtggact 3300 ttgcgggacc attcttggac acatactttt tcattctggt gatgcattta cgaaatggcc 3360 ggagattaga gtgtgtagtt ccatcactgc tgacagtaca attcggatgt gtcgtgagat 3420 tttcagcacc tttggaatac cgtcagtgct cgtcagcgat cacggtgttc agtttacgtc 3480 cgagcaattt caacagttcc tgcggatgaa cggcatcgta cataagatgg gtgcgcccta 3540 ccatccagca acgaacgggc aagctgagcg gtatgtacag actatgaagc agaaactgaa 3600 gtcgctgaag tgtacgaagg cccagttgaa cgtcgagctc tgcaacatac tgctgaccta 3660 ccgaaagatg atacatcctg ccaccggtca atcacctgcg atgatgatgt ttggccggca 3720 gttaagatcg agaatcgacc tgatgttgcc gaagaacgaa gtcgttgatg cgaagaatta 3780 tacagtgcga gaattcaaag acggtgatcg tgtacgtgtc cgggactttt tatctgccga 3840 caagtggaag ttcggcagga ttgctgagaa ggttgggaaa cttcgttacg ctgtccgctt 3900 ggatgacggg cggtgctggg agcgtcacat tgaccatatc gttggtgtgg gcgcttgtct 3960 tccggatact gcatcgaaca atgctagaat cgagaatcgg gatcaccaca gtccgggagt 4020 tgctccatcg gttggagtag cgactcctga acgacctgac aatgcatcat cagctacgtc 4080 aagtgctccg gtctgtgctc cggagattca agctggtcga cgactagcgc cacctgatcc 4140 tgttccagaa ccagaggaag gtcctccatc tgccggggct acaccagttc cgacccaaga 4200 agctacacaa cccttgagac gttctaccag ggtagtgaag gctcccacaa ggttgaattt 4260 gtgatttttt tttttgcagc gaactctaag aactttcttt ttacaagggg gagaga 4316 // ID Kolobok-21_HMa repbase; DNA; INV; 2743 BP. XX AC . XX DT 21-JUN-2010 (Rel. 15.06, Created) DT 21-JUN-2010 (Rel. 15.06, Last updated, Version 3) XX DE Kolobok-type DNA transposon - a consensus. XX KW Kolobok; DNA transposon; Transposable Element; Kolobok-21_HMa. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2743 RA Jurka J.; RT "DNA transposons from Hydra magnipapillata."; RL Repbase Reports 10(6), 792-792 (2010). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 413..2158 FT /product="Kolobok-21_HMa_1p" FT /translation="MGRKDRVKQRTRVTGKKRVFCGNQHTLSSPTSMTNSK FT KYSDIDTSSSRSATKIEYIKNDIPNKNDEKINGYRIIDVEILNTIFESLCC FT PQCINDNSLVLKENFMKKKGYASNLICCCSVCGYSKDFYTSKVCNRTHDIN FT LRIVYSMRSIGQGYSGLEKFSALMNLPSPMSKKNYNGSVKVITDAVTTVAK FT ETMLEAAKEIKVNSNNIVDTGVSTDGTWQRRGYSSLNGVVTTLSMDNGKVL FT DVEPMSRLCKQCQLREDLKSKNSESYKNWYKSHSCNVNCIGSAGSMEVTGA FT KRIFSRSVKEYGLQYTKFYGDGDSKSYPAVKFTYPGVEVEKLECVGHVQKR FT VGTRLRTLKKNMKNLCGRGKLTNSVIDKLQNYYGIAVRSNKNNLQGMKKSI FT HATLFHVASSKENNWHTHCPIGENSWCRYQKDKATGKSTYKPGAGLPLSII FT KHLKPIYADLSEESLLRKCLHGQTQNQNESLNAMIWDRIPKTKYVSLIQLK FT FGTYDAVANFNIGRKSSLLIYKQLNMTPGIYTSSLCDNQNRKRIYLAGYKN FT LESSKKRRKILRGLSKASKDKYENFEGSQYAPGSF" XX SQ Sequence 2743 BP; 981 A; 384 C; 482 G; 896 T; 0 other; gggggaagta ccccataaat taagtaaaaa ttgaaatttt tttttttgat tgttttggaa 60 gcaaaaatta atgaagaaca caatgaaata aagatatttt gcaaacaatg tcaggaaagt 120 ggagtaattt caattttttg aaccatactc atcaagttta accatagcaa cgcacatagc 180 aacgataaag cttctgctta tctgcttatt tcagaggctt ataaaaaagg atttatctac 240 tttgtcttta ctttaattat ctactctgct taaattaaat ttgactcgat tgcctctaat 300 tatttcagtt tttaaacatt ccttaaatgc ttaatttact ttgtttatat gagttgtgaa 360 gtgtaataag tacaagtatt ttatcaattt taaataaaaa ttattcttaa ttatgggaag 420 aaaagataga gtcaagcaaa gaactagagt tactggaaag aaaagagtgt tttgtggtaa 480 ccaacatact ttatcatctc caacatcaat gacaaactca aaaaaatatt cagacattga 540 tactagttct tcaagatcag ccaccaagat tgaatatatt aaaaatgata ttccaaataa 600 aaatgatgaa aaaataaatg gttatagaat aattgatgtt gaaattttaa acactatatt 660 tgaatcttta tgttgtccac agtgtattaa cgataatagc ttggtattga aagaaaactt 720 tatgaaaaaa aaagggtatg cttcgaattt aatatgttgc tgcagtgttt gtggatatag 780 taaagatttc tatacatcta aagtttgtaa ccgcactcat gatattaatc ttcgtattgt 840 atatagcatg agatcaattg gacaagggta ttcaggactt gaaaagtttt ctgcattaat 900 gaacctccca agtccgatga gtaagaaaaa ttacaatggc tctgtaaagg ttattactga 960 tgctgttact acagttgcta aagaaacaat gcttgaggct gcaaaagaaa taaaagttaa 1020 tagcaacaac attgttgata ctggagtttc aactgatgga acttggcaac gtagaggata 1080 cagttcactg aatggagttg tgacaacatt gtcgatggat aatggaaaag tgttagatgt 1140 tgagccaatg agtagacttt gtaagcaatg tcaactacgt gaagatttaa aatcaaagaa 1200 ttcagagagc tacaaaaatt ggtacaaatc tcatagttgt aacgtaaatt gtattggctc 1260 agcaggaagt atggaagtga ctggtgccaa acgcatattt agtagatctg tcaaagaata 1320 cggattgcag tatactaaat tttatggaga tggtgacagt aaaagttatc cagctgtgaa 1380 atttacttat cctggtgttg aagttgaaaa gttagaatgt gttgggcatg ttcagaaacg 1440 agttgggaca cgtctaagaa cattaaaaaa aaacatgaaa aacctctgtg gccgtggaaa 1500 gttaacaaac agtgttattg ataagcttca gaattattat gggattgcgg tgaggagtaa 1560 taaaaacaat ttacaaggca tgaagaaatc cattcatgct acactttttc atgttgcctc 1620 atcaaaggaa aataactggc acactcactg tccaattggt gaaaacagtt ggtgcagata 1680 tcagaaagat aaagcaacag gcaaatcaac gtacaaacca ggtgcaggac ttccactatc 1740 aattataaaa catttaaaac caatatatgc agatttaagt gaggagtctt tgttacgtaa 1800 atgtttgcat gggcagacac aaaatcaaaa cgaaagtcta aatgcaatga tttgggatcg 1860 aattccaaaa acaaagtatg tcagcctaat acagttaaaa tttggaacat acgatgctgt 1920 agcaaacttt aatattggga gaaaaagttc tcttttaatt tataaacagt taaacatgac 1980 acctgggata tatacatcaa gtttgtgcga taatcaaaat cgaaaacgta tttatttagc 2040 tggttataaa aatctagagt catccaagaa gcggcgaaag attctacgag ggttgtcaaa 2100 agcatcaaag gacaaatatg aaaactttga aggaagtcag tatgcccctg ggtcattctg 2160 aaacattatt ttaaacataa tttagttata tttcatattt tttgtgaaat tttaaatttt 2220 ttttttctat ttttgcgttt ttctcaaaat aaggttttta aatgccccgg ctgtgataac 2280 tttggaaccg cttggtgttt aatgatgaaa ttttcagtaa ttatcctatt atataccttc 2340 tgtgatttga acctccactt ttatgaaata cttgacagaa cacgttctat gcctatttca 2400 gttgcctaat tttgacccaa attcattaaa cgataatata tttgccccct gaagaactaa 2460 ctgtcatatt ttttataaaa ttttaggttc agatcacagt tcaaagtatt agttaatggg 2520 aattgcagta tgacagtttg aatatatgta gttcttcgga taagttagcc ggggcattaa 2580 ctgttttttg gttatttttc caatattttc ttggttacca tggcaacaaa gcacaatttt 2640 taaaaatttt cattctgaaa tataaaattt ttacattata tactgtttaa aatcaatttt 2700 actgtttgaa catgagtttt aacatttttg gggtacttcc ccc 2743 // ID Chapaev-22_HM repbase; DNA; INV; 2803 BP. XX AC . XX DT 17-SEP-2009 (Rel. 14.1, Created) DT 17-SEP-2009 (Rel. 14.1, Last updated, Version 2) XX DE Chapaev-type DNA transposon: consensus. XX KW Chapaev; DNA transposon; Transposable Element; Chapaev-22_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2803 RA Jurka J.; RT "DNA transposons from Hydra magnipapillata."; RL Repbase Reports 9(10), 2143-2143 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 964..1752 FT /product="Chapaev-22_HM_1p" FT /translation="MLMKEIILILGSSRQRELFPPTSTLTAQQLLQVQNQT FT GLSNRGMNKLVSTLNQVTPHRLVEKNFREKFESLGKHLSDLFVTCDIMDTT FT KTDSHESKRLLVHCKDLIALRNTVVQIRGPQSNELIKIGIDGGGGFLKVSL FT GVIALEAQNASPPSKKPLFKDSGVKRQLLVAVSENLPEKYGNVKQILDKLC FT LERMSFALSCDMKLANIICGLQSHSSAHPCTWCDIESKNLHQSGELRTFGS FT IQNDYTGFQAAGGQKKEHGTF*" XX SQ Sequence 2803 BP; 902 A; 531 C; 522 G; 846 T; 2 other; agaaaatcct tttggggtat gatagtaggg atatatagga gtgcattaga agtgatttta 60 aaatctttcg tcatcgtccg tggaagaaaa atcgatttga agtttgagca tttttcgaag 120 tttttttgtg aagtcaaagc aagtaaaaag cttattttaa ggcaagaatt tgtatatttt 180 gtcataaagg ttgcaaatag agatattgtt tacattttta catgagttat atgcattgtt 240 tatttatttt aaaaatttcg ggaaagggct aaatgcaaaa gcatttgttt agaaaatttt 300 aggaagagca aaagtgcaat caacatttct aaaatgccta atcaagcaaa atgtcatgat 360 gaaaacagga aaactgtatg ttttctttgt ttaaataaat cctctagaca actcactgct 420 gctatgattg aaagctgcca caaagtcttt ggccgaccaa tcaactttga agaccaaaga 480 gttccagttg gagttcgcac tattcttaga agaaaagagg caggtcatga tgtccaactt 540 ccctctttgc ctgattatca aacaattaaa gttcggccaa caacaagaga atcagtttgt 600 gattgcttga tctgttgtgt tgggaaagca aggttcgaaa gcccagaacc ggtcgggcca 660 actaacccaa ccaatccgcc aaaagaactg ccagctgttg aaaaacgctg ttctgcatgc 720 ttttcactga ttggaagagg tattccccat gtctgcgcca aaggaatgct tagacaaaac 780 ttacttgaag ttgcaacaaa gacggaaaag ccgctgagag ggtggctgct acagtgattg 840 ccaacaagac acaatctcct catggcacng ttcgccttag tcaaggattg ggaagaaatc 900 ttccagtgac tccaggtata aatactctga gtaaaaatca atcttttagc attttttaaa 960 tgaatgttga tgaaagaaat aattttaatt ttaggatctt caaggcaaag agaacttttt 1020 ccaccaacat ccactttgac tgctcaacag ctcttacaag ttcaaaatca gactggctta 1080 tctaacagag gaatgaacaa gcttgtttca actctcaatc aagttactcc acatcgccta 1140 gttgagaaaa actttaggga aaagtttgaa tcacttggaa agcatctttc tgatttgttt 1200 gtcacctgcg acattatgga cacaaccaaa actgacagcc atgagtccaa gagactactt 1260 gtgcactgca aagatcttat tgctctgaga aacactgtcg tccaaattcg aggtccacag 1320 tcaaatgaac ttataaagat tggaatagat ggaggaggag gtttcttgaa agtttctcta 1380 ggagtaattg ccttagaagc gcagaatgcc tctccaccat ccaaaaaacc tcttttcaaa 1440 gattcaggtg tgaaacgcca attgttggta gctgtttcag aaaatctacc tgaaaagtat 1500 ggcaatgtga aacaaattct tgacaaattg tgccttgaaa ggatgtcttt tgcgttgtcc 1560 tgtgacatga aacttgctaa catcatctgt ggtttacagt ctcactccag tgcacaccct 1620 tgtacttggt gtgacattga atctaagaat cttcatcaga gtggtgagct ccgaactttt 1680 ggatctattc agaatgacta cacaggcttc caagctgcag gtggccaaaa aaaagagcac 1740 gggactttct aaatgttgtg catcagcctg ctatcaaatt gccagatgat acactcatct 1800 tagacttcat tccaccaatg gaacttcacc ttctacttgg cgttgtcaac cacctcttca 1860 agaatctctg tttgttatgg caaaatgcaa cagagtggcc aaagatgatg aacattcaac 1920 aacagccatt tcacggagga cagtttgctg gaaatgattg cagaaagctt ttgagaaaag 1980 ttgacatgct tcaacaactt gcggaggcaa attcttgttt cttggcattg ccattcattg 2040 atacattccg aaagtttgat gcagctgttc atgcctgttt tggaaataca cttcaacaga 2100 actactgcgn actaattgag gaattcaggg actcttacct caagcttcca aacacaagcg 2160 tcactcccaa agtacatgca gtcttctttc atgttcccca gtttattaac cggcataacc 2220 gatctttggg tctgtattca gaacaagcaa cagagtcgct gcatcacaac ttcaacaatc 2280 attggcaacg tttcaaaagg cctagcaacc acccagacta ccccaagaat cttcttagct 2340 gtttgatcga ttacaacagc aaacattcat tttgatgtca ttacttgaaa gaaagagaga 2400 gaattatatt ttcttatgtg tgctatatgt cagaatctat atataaatct agatctttct 2460 tgaattaaga tagtaattta ggtcattcag aaaaaatatt gaagtttttg atgaaagtca 2520 gactttcata gctaaaaatc aatttattgc tgtatttcct aaagaaattg taatgtttat 2580 gtttttgttt gtaaatattt tgttgcaaat agagtgtttt gatgcacaga atgaataaat 2640 aaatgtcact tttttttcta tgaaatttaa agtcattatt tagagcagta ataatagaaa 2700 actttgattc gatttttctt ccacggacga tgacgaaaga tttcaaaacc acttttaata 2760 cactcctata tatccccact atcatacccc aaaaggattt tct 2803 // ID Zator-3_AAe repbase; DNA; INV; 4089 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 23-DEC-2010 (Rel. 16.02, Last updated, Version -1) XX DE A Zator DNA transposon family from Aedes aegypti. XX KW Zator; DNA transposon; Transposable Element; Zator-3_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4089 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the yellow fever mosquito."; RL Repbase Reports 11(2), 656-656 (2011). XX DR [2] (Consensus) XX CC 3-bp TSDs. XX FH Key Location/Qualifiers FT CDS join(855..1761,1821..2312,2411..3912) FT /product="Zator-3_AAe_1p" FT /translation="MQRASITTFPDKKVENDGRDALYNKLVDYLQFRGAGF FT KVFQKSDMEFFMSSVVSILWRLDGQWAKLLGAANVSKPPENLHFKPVDQES FT FRVLSHGAHKKKTLPLLKKEDLIGDFEKLDELCTKSFLKSDGWRQVSADVM FT QLKASLADYIHHLDKADEQYKAASASIMDDNRSLKTVPPKQNRTSLVQLLY FT KALEDRLNSVPAYTEQNLIFFAPIERRAKYKYITGLVFDFPIQIYKTFKPN FT ASFVWKITQGDESSSAVIISRIERNIHERISTDKSERLSQAYGSVSTWTKE FT KVGEIVNAITEDEDLPPEQLQDSKEGFQFLLQEGFSVSEAVHMEQQAQAAK FT LNKFANFWAAVSKVLANNDYTVAEERRHGETTWISPLCVSMRDLMSKCETK FT MEEMFPGSNDNFIPSYEYFRLQFVPRNSHTQVSKRYYGRFDVKFGLQKRTL FT HKAHVDQHYGAKQFEFLKIMADDKASIPVGFENAPVSATRRQRHVLMAGLD FT GRGLNAMDHDNIPQHLIPSVAIKLYPPKELSESWYRGKPMIILKDAIFEAS FT SAFRHVADLIKNIEQPDTQKPIMFIGTDGGPDHNVTSIQVMLSYVALFLEL FT DLDFLCAVRTPPNFSVINPAERFMSTANIALIGVALARNDLGKNEKKVRSL FT LSKKQWRDAQEKHPETNYRQLAIDGTKDARSLLSARFESLSYKGEQVRMGC FT PATDDEIEACKRRVSTALGGIDFNAKIVKSQLLKENRVKDFFEKHVKATSY FT AFQVKKCADKSCKYHKKVRSSRTTFDQIVWLPTPAPDANMKYTSFDSVYGN FT EPSDEHVPSKIGTAITESVEKMXPKPTFPLAFTRARMIVTCTECXFPRLLY FT TKYALDKKEYTKLGQFFEDKLYVCGSHLESFPEVFQNTKXSCFDPVSLHYY FT QASTHLPGYKDLCAKCLDPTVTRTEGKLLFCAKCCSKSSQPINTPTTEPKA FT ERRPRGRPKKT*" XX SQ Sequence 4089 BP; 1263 A; 833 C; 867 G; 1121 T; 5 other; ggccgataca aattttattt ccactttttg tctccccccc ccttcataaa gcggaaaaaa 60 ctgcaggggg caaaaaaaaa atatttttca ttcgcgttga aaaacaccag tgggcatatg 120 aaaatttggc tcacgaatgt atctgagcca aagtatacat actcctcgca ctcactagaa 180 atcttatatg ggtattgtca aaaccgagcc agtatctatc tctcctattt actcgcatga 240 ttaaagtaaa caaggcaaca ttctatcgca tcaacgtcaa cgtatgaaag agcgagagag 300 ccacctcttt cgagtgcacg ctatgcaaac gaatttgaca tttccatatt tagctagcag 360 tgttgagttg tctgttgttt ttcacagaat cgcaaatttg tatttagagc aagtacagtt 420 tacctcataa tcaccagttc caccaaaatc atctgcagca gcaacctaca acaatggtat 480 gttttaagat ttgttgttaa aaactcttgt atagtgtctt tcttatcatc caggactcta 540 aaactcctat cgccgtgaga atcgagcgcg aaggaaaact tctcttccaa tccgtgacat 600 cagtagctgt tgaatcttct ttgctggaaa caattgagca ggtcttcgga ccgatagtaa 660 atgggacgga cgcggtcgaa gctatttacg ctggatgtaa atcattcgaa aggcatgatw 720 tgttcataat atctccctcg tgtactatcc gagatgtcgc cacagcaatt ggcacatgta 780 cgaaagttct gtgtatacta aaggctgaaa ttacttcggg aaaggagaag acaagaaatg 840 ctttcgatct gcttatgcag cgagcatcta ttacgacgtt tcctgataaa aaggtagaga 900 atgatgggcg tgacgcgtta tacaacaagc tggtggatta tttgcagttt cgaggagcag 960 gattcaaagt tttccagaag tcagacatgg aattcttcat gtcgtctgtg gtatccattc 1020 tgtggcgttt ggatggacag tgggctaaac tgctaggggc ggcaaatgta tcgaagcctc 1080 cagaaaatct ccatttcaaa ccagttgacc aagagtcttt tcgtgtgttg tcccatggcg 1140 cgcacaaaaa aaagaccctt ccactattga agaaagaaga cttgattgga gatttcgaga 1200 agctggacga actatgtacc aaatcgtttc ttaaatcgga tggatggaga caagtgtcag 1260 cagatgtaat gcaactgaaa gcttcgttgg cggactacat acatcacctg gacaaggctg 1320 atgagcaata taaggcagca tcggcatcta ttatggacga taaccgttca ttgaagacag 1380 ttccgccgaa acaaaatcgt acaagcctag tacaattgct gtataaagct cttgaagacc 1440 gattgaattc cgtcccggcc tacaccgagc aaaatctgat cttttttgct ccaattgaaa 1500 gacgggccaa atataaatat ataactgggc tggttttcga cttccctata caaatttata 1560 agactttcaa accgaatgca tcatttgtgt ggaagatcac ccagggagat gaatcctctt 1620 cagccgtgat catatcacgt atcgaacgaa atatacacga gcgaatttcc accgacaaat 1680 cagaaagatt gtcgcaggcc tatggaagtg tttctacttg gacaaaagaa aaagttggtg 1740 aaattgtaaa tgccatcact ggtaagtttt ttcatttatc gtttttattt tcaaattcca 1800 gttgtgtatt tatatttcag aagatgaaga tctccctcca gaacaacttc aggacagcaa 1860 agaaggattt caatttttgt tgcaggaagg attttctgtt tcagaagcag tgcatatgga 1920 acaacaagcg caggccgcaa aactgaacaa gtttgccaac ttctgggcag cagtttcgaa 1980 ggtccttgcc aacaacgatt atactgtcgc tgaagaaagg cgacatggtg aaacgacttg 2040 gatatcacca ctgtgcgttt cgatgcgtga tttgatgtcc aagtgcgaaa ctaaaatgga 2100 agaaatgttc ccgggatcaa acgacaattt cataccatca tacgaatatt ttcgtctgca 2160 attcgtccca agaaacagcc acactcaggt atcgaaacgc tattatggac gctttgatgt 2220 gaaattcggt cttcaaaaac gtacgttaca taaggcacat gtagaccagc actacggagc 2280 aaaacagttt gaattcttga agatcatggc cggtaagact ttcaaaaagt ttgttcctac 2340 ttgtgtgaaa ctgatttgct ttttattctt taggaaaatt ttctgacgat tccatcgttt 2400 tttttcctag acgataaagc gtctatccct gttggattcg aaaatgcacc agtaagtgca 2460 actcgaaggc agcggcatgt cttaatggca ggacttgacg gaagagggct taatgccatg 2520 gatcacgata atatccccca gcatttgatt ccatcggttg cgataaaact gtatcctcct 2580 aaagagctgt ctgagtcgtg gtatagaggt aaaccgatga ttatactgaa ggacgcaata 2640 ttcgaggcat cttcggcatt caggcatgta gcggatttga tcaaaaacat tgagcaaccc 2700 gatacacaga agccgataat gtttattgga actgatggag gacccgatca caacgttaca 2760 tcgattcaag ttatgctaag ttatgtggct ttgtttttgg aactggatct cgatttttta 2820 tgtgcggttc gcactcctcc aaacttttcg gtgataaatc ccgcagaaag attcatgagt 2880 actgcaaaca ttgccctgat tggagtagct ctcgcaagaa atgatcttgg taaaaatgaa 2940 aagaaagttc gctcgttgct ttctaagaag caatggcgcg atgctcaaga aaagcatcca 3000 gagacaaact atcggcagtt ggcgattgat ggaacgaaag acgcccgcag tcttttgagt 3060 gctcgattcg aaagcctttc ttacaaagga gagcaagtga gaatgggctg ccctgccaca 3120 gatgatgaga ttgaagcgtg caaacgtaga gtttccactg cattgggtgg aattgatttc 3180 aacgccaaga ttgtcaaatc tcaactactg aaagaaaatc gtgttaaaga tttctttgaa 3240 aagcacgtca aggcaacatc ttatgcgttc caggttaaga aatgtgctga caaatcttgt 3300 aagtaccaca agaaagtacg atcgtcgagg actacgtttg atcagattgt gtggcttccg 3360 acaccagcgc ctgatgcgaa tatgaagtac acaagcttcg attctgtata cggcaacgag 3420 ccttcagatg aacacgttcc aagcaaaata ggaaccgcga ttactgaaag cgtggagaaa 3480 atgmtgccca aaccaacatt ccctttagcc ttcacgaggg ctcgtatgat mgttacttgc 3540 actgagtgcw atttcccgag actgttgtat accaaatatg ctttggataa gaaggaatat 3600 actaagttgg gccaattttt tgaagataaa ttgtatgttt gcggttctca tctggaatca 3660 tttccggaag tttttcagaa cactaaastg tcctgtttcg atcctgtcag tcttcattat 3720 taccaagcta gcacacattt gccaggttac aaagacttgt gtgctaaatg cctggatcca 3780 acagtaacaa gaacagaagg aaagctgctc ttctgcgcaa aatgttgttc aaaatcaagt 3840 cagccgataa atactccaac tactgagcca aaagctgagc gacgaccgcg aggcaggcca 3900 aagaaaacat agatacattg aaaaccgtta aaagactcga tggatataca atttttttat 3960 caaaaggaat ttcaataaat aaaaataaat attttcgaaa aataaatgtt tattattttt 4020 ttgccccccc ccccctcgat taaaattggt ccgatgggac aaaatgtcaa aaataaattt 4080 gtatcggcc 4089 // ID Rehavkus-1_CS repbase; DNA; INV; 16648 BP. XX AC AACT01024047; XX DT 30-APR-2006 (Rel. 11.04, Created) DT 26-MAY-2011 (Rel. 16.05, Last updated, Version 2) XX DE This is a recently transposed copy of the Rehavkus-1_CS DNA DE transposon - a fossilized copy. XX KW MuDR; DNA transposon; Transposable Element; Rehavkus group; KW Interspersed repeat; Rehavkus-1_CS. XX NM Rehavkus-1_CS. XX OS Ciona savignyi OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. XX RN [1] RP 1-16648 RA Kapitonov V.V., Gentles A.J. and Jurka J.; RT "Rehavkus-1_CS, a family of Rehavkus DNA transposons from the sea RT squirt Ciona savignyi genome."; RL Repbase Reports 6(4), 190-190 (2006). XX DR EMBL/GenBank/DDBJ; AACT01024047; Positions 21979 5332. XX CC Rehavkus-1_CS belongs to the Rehavkus group of MuDR superfamily CC of "cut and paste" DNA transposons. Members of this group are CC widespread in different metazoa, including insects, sea squirts, CC sea urchin and fish. The genome harbors two copies of this CC transposon. Its ~5-kb inverted termini are composed of a 406-bp CC terminal inverted repeat and 190-bp subterminal CC minisatellite-like unit. The transposon is flanked by a 9-bp CC target site duplication and encodes an incomplete 325-aa CC Rehavkus-1_CS transposase. XX FH Key Location/Qualifiers FT CDS 6388..7362 FT /product="Rehavkus-1_CSp" FT /translation="SLGTNIALLHLNTHYVCDHIRKKRSRIFYCKGYCTFS FT NCPVTFEISIPDLSLQAEINMKGEVRHLQHEKRARYVQGNDREALRKLLEF FT KNPRREFLDGIANLEEDTYQSGNRDGVHSMGVLKQVAHEGKKIERCDMNQW FT LALEKLKTRDGDKSLIRQIGLDPPYIMLYSSEAMRIFYKKCNHDIVYIDAT FT GSLMIGKEKKKFYIYELVVRNIVGKTPFAVASFITARHTIPSITNFLLTFS FT NDVQRASKRKRFPKLVMCDGSVALMKSVCLGLFECSLNAYTDKCLDIAQGN FT HKEMFHQNGFSALLCKSPHVQCKEIVQENVRFY" XX SQ Sequence 16648 BP; 5397 A; 3049 C; 2980 G; 5222 T; 0 other; acctaggtca agcaagcagg caattctcag caggcgattc gaaaagtcaa gcaggcaggc 60 aatagccgat agcagggatc gaaatgaggc taccagtgtt tccaaataaa aaattataag 120 tattttgtga aaattcgcga cgacgtcatt ttaattacag tataattaca gtataggatt 180 cgtgtagggg aaatggtttt aaatttacga aaagcagtga aaataataaa gatgtgacgt 240 caatagtggt ctgattcgaa gctggtcgct aactcgacgg ttgtatactt gggcaaggaa 300 cattacggat attgcctaaa cccggcgcaa taatgggtta gggtaatata atattggagg 360 caaggtgatc tgatgattgc ccttatgtta accccctagg tttattacca atatgtgggg 420 gcaaatcgtg taaatttacg aaaaccagcg aacagtaata catatatgac gtcattttgc 480 atatactgtg tatgttcaat tagtcgtaag tcctcttata cttccgactg aattcacttt 540 gaggtatgta tgagcatcta catttcaccc tctaaacgat ggtgcaaata gatattacca 600 aaatgtgggg gcaaatggtt taaattaacg aaaaccagcg aacagtaata catatatgac 660 gtcattttgc atataccgtg tatgttcaat tagtcgtaag tcctcttata cttccgactg 720 aattcacttt gaggtatgta tgaacatcta cctttcacac tctaaacgat ggtaagttca 780 gtttttacca acatgcgggg gcaaatggtt taaattaacg aaaaccagcg aaagtaaaaa 840 aaagacgtta ggtctttttg gatataccgt gtatgttcat aagtcctaac tcatcttata 900 ctttcgactg aattcacttt gaggtatgtc tgaacatcta cagttcaccc tctaaacgat 960 ggtgcaaata gatattacca atatgtgggg gcaaatggtt taaatttacg aaaaccagcg 1020 aaaagtaata catatatgac gtcattttgc atataccgtg tatgttcaat tagtcgtaag 1080 tcctcttata cttccgactg aattcacttt gaggtatgta tgaacatcta catttcaccc 1140 tctaaatgat ggtgtaaata gatattacca aaatgtgggg gcaaatggtt taaatgaacg 1200 aaaaccagcg aacagtaata catatatgac gtcattttgc atataccgtg tatgttcaat 1260 tagtcgtaag tcctcttata cttccgactg aattcacttt gaggtatgta ttaacatcta 1320 catttcaccc tctaaacgat ggtaagttca gtttttacca acatgtgggg gcaaatggtt 1380 taaattaacg aaaaccagcg aaaataaaaa aatacgttag gtctttttgg atataccgtg 1440 tatgttcaag aagtcctaac tcctcttata ctttcgactg aattcacttt gaggtatgta 1500 tgaacatcta cctttcaccc tctaaacgat agcaagttca gtttttacca acatatgggg 1560 gcaaatggtt taaattaacg gaaaccagcg aaaataaaaa agaagttagg tctttttgga 1620 tatacagtgt atgttcaaga agtcctaact cctcttatac tttcgactga attcactttg 1680 aggtatgtat gaacatttac agtttaccct ctaaacgatg gtgtaaatag atattaccaa 1740 tatgtggggg caaatggttt aaatttacga aaaccagcga aaagtaatac atatatgacg 1800 tcattttgca tataccgtgt atgttcaatt agtcgtaagt cctcttaaac ttccgactga 1860 attcactttg aggtatgtat gatcatctac atttcaccct ctaaacgatg gtgcaaatag 1920 atattaccaa aatgtagggg caaatggttt aaattaacga aaaccagcga acagtaatac 1980 atatatgacg tcattttgca tataccgtgt atgttcaatt agtcgtaagt cctcttatac 2040 ttccgactga attcactttg aggtatgtat gagcatctac atttcaccct ctaaacgatg 2100 gtgcaaatag atattaccaa aatgtggggg caaatggttt aaatgaacga aaaccagcga 2160 acagtaatac atatatgacg tcattttgca tataccgtgt atgttcaatt agtcgtaagt 2220 cctcttatac ttccgactga attcactttg aggtatgtat aaacatctac atttcaccct 2280 ctaaacgatg gtaagttcag tttttaccaa catgtggggg caaatggttt aaattaacga 2340 aaaccagcga aaataaaaaa agacattagg tctttttgga tataccgtgt atgttcaaga 2400 agtcctaact cctcttatac tttcgactga attcactttg aggtatgtat gaacatctac 2460 ctttcacact ctaaacgatg gtaagttcag tttttaccaa catgtggggg caaatggttt 2520 aaattaacaa aaaccagcga aaataaaaaa agacgttagg tctttttgga tataccgtgt 2580 atgttcaatt agtcgtaagt cctcttatac ttccgactga attcactttg aggtatgtac 2640 taacatctac agttcaccct ctaaacgatg gtgtaaattt attttaccaa tatgtggggg 2700 caaatggttt aaatttacga aaaccagcga acagtaatgc atatatgacg tcattttgca 2760 tataccgtgt atgttcaatt agtcgtaagt cctcttatac ttccgactga attcactttg 2820 atgtatgtat gaacatctac agttcaccct ctaaacgatg gtgcaaatag atattaccaa 2880 tatgtggggg caaatggttt aaatttacga aaaccagcga acagtaatac atatatgacg 2940 tcattttgca tataccgtgt atgttcaatt agtcgtaagt cctcttatac ttccgacgtg 3000 aattcacttt gatgtatgta tgaacatcta cagttcaccc tctaaacgat ggtgcaaata 3060 tatattacca ttatgggggg gcaaatggtt taaatttacg aaaaccagcg aacagtaata 3120 catatatgac gtcattttgc atataccgtg tatgttcaat tagtcgtaag tcctcttata 3180 cttccgactg aattcacttt gaggtatgta tgaacatcta cctttcacac tctaaacgat 3240 ggtaagttca gtttttacca acatgcgggg gcaaatggtt taaattaacg aaaaccagcg 3300 aaagtaaaaa aaaagacgtt aggtcttttt ggatataccg tgtatgttca taagtcctaa 3360 cttatcttat actttcgact gaatttactt tgaggtatgt ctgaacatct acagttcacc 3420 ctctaaacga tggtgcaaat agatattacc attatgttgg ggcaaatggt ttaaatttac 3480 gaaaaccagc gaacagtaat acatatatga cgtcattttg catataccgt gtatgttcaa 3540 ttagtcgtaa gtcctcttat acttccgact gaattcactt tgaggtatgt atgagcatct 3600 acattttatt ctctaaacga tggtatgttc agtttttacc atcatgtggg ggcaaatggt 3660 ttaaattaac gaaaaccagc gaaaataaaa aagacgttag gtctttttgg atataccgtg 3720 tatgttcaag aagtcctaac tactcttata ctttcgactg aattcacttt gaggtatgtg 3780 tgaacatcta cagtttaccc tctaaacgat ggtgcaaata gatattacca atatgtgggg 3840 gcaaatggtt taaatttacg aaaaccagcg aacagtaata catatatgac gtcattttgc 3900 atataccgtg tatgttcaat tagtcgtaag tcctcttata cttccgactg aattcacttt 3960 gaggtatgta tgagcatcta cattttattc tctaaacgat ggtaaattca gtttttacca 4020 acatgtgggg gcaaatggtt taaattaaca aaaaccagcg aaaataaaaa aagacgttag 4080 gtctttttgg atataccgtg tatgttcaat tagtcgtaag tcctcttata cttccgactg 4140 aattcacttt gaggtatgta tgaacatcta cagttcaccc tctaaacgat ggtgtaaatt 4200 tattttacca atatgtgggg gcaaatggtt taaatttacg aaaaccagcg aacagtaata 4260 catatatgac gtcattttgc atataccgtg tatgttcaat tagtcgtaac tcctcttaaa 4320 cttcggactg aattcacttt gacgtatgta agaatatcta tattttatat tttatttaat 4380 agtgcaattc gatatttaat gtattggtcg tagctttttt ttgtacaatg tctatgtttt 4440 gcctgagtcg ttataatttt acgttttctt accatgtggt ctagtacaat ttgaaaacga 4500 tttaatatct gaagtaatcg taaaatatgt ttgatatttg ttctttaatt attaatagtt 4560 ctgacctaat tgcaaaattc ttaactttaa ccttttgtat atagtttatt gttggtttaa 4620 ggattttaaa tgcaaaacga gtttttaaaa ctacacctaa taatttgaaa gtacagtctt 4680 aatttaacgc aaacatattt gtatataaca ggttatcaat atatagcata ttatgggtag 4740 acatgggtca ggcaacaatt tttcaaattg tttgaggaaa gtcaaaaacg aaagaaaaaa 4800 acgcagcatt aaagaaataa acaagattgc aaggcgccag gcctatcatt gcaacaaagt 4860 tatttcagta caggaaaagg gtccatttgc aaaggtttca aggtggctga gttcgtgccc 4920 tgaaacatct tctttctgtc tcagtagtcc gtcaatatat aatattgatg atgtacacca 4980 ccgctccgtt agtgataaca gtttatcaac ctctgaagag atatcagatc aagatcgaaa 5040 ccaatccccc acttattcac acaaggattc agttaaatca gatcaaactc gagaccaatc 5100 ccccacttat tcattcaagg attcagttga atcagatcaa aatcgagacc aatcccccac 5160 ttattcaatc aaggattcag ttgaatcaga tcaacatcga gaccaatccc ccacttattc 5220 aatcaaggat tcagttgaat cagatcaaaa tcgagaccaa tcccccactt attcaatcaa 5280 ggattcagtt gaatcagatc aaaatcgaga ccaatccccc acttattcaa tcaaggattc 5340 agttgaatca gatcaacatc gagaccaatc ccccacttat tcaatcaagg attcagttga 5400 atcagatcaa tatcgagacc aatcccccac ttattcaatc aaggattcag ttgaatcaga 5460 tcaaaatcga gaccaatccc ccacttattc aatcaaggat tcagttaaat catatcaaca 5520 tcgagaccaa tcccccactt attcaatcaa ggattcagtt aaatcagatc aaaatcgaga 5580 ccaatccccc acttattcaa tcaaggattc agttaaatca gatcaaaatc gagaccaatc 5640 ccccacttat tcacacaagg attcagttga atcagatcaa aatcgagacc aatcccccac 5700 ttattcacac aaggattcag ttgaatcaga tcaaaatcga gaccaatccc ccacttattt 5760 aatcaaggat tcagttgaat cagatcaaca tcgagaccaa tccccccccc cacttattca 5820 atcaaggatt cagttgaatc agatcaacat cgagaccaat cccccactta ttcaatcaag 5880 gattcagttg aatcagatca aaatcgagac caatccccca cttatttaat caaggattca 5940 gttgaatcag atcaacatcg agaccaatcc accacttatt cacacaagga ttcagttaaa 6000 tcagatcaaa atcgagacca atcccccact tattcaatca aggattcagt taaatcagat 6060 caacatcgag accaatccac cacttattca atcaaggatt cagttaaatc agatcaaaat 6120 cgagaccaat cccccactta cgcaataaaa aacccggtta tatcagatca atttagttgt 6180 gccagcttag ccgcctcaaa caacaaatcg cctgtaatat ctgatacact gatccctaat 6240 ttagacagaa gcatttggac caaatttacc aaaaagacat actctgtgtt tatagataga 6300 gagaattgga accacttaaa aaaaaaccac actgggggcg ttgttttacc aacccagatt 6360 ggagaaggat aatggtcgat gcattaaagc ttgggaacaa atattgctct tttgcattta 6420 aatacccact atgtttgtga tcacataagg aaaaaaagaa gcagaatatt ttattgcaaa 6480 ggttattgca cgttttcgaa ttgtccggtg acctttgaaa tctcgattcc agacttatct 6540 ctccaagctg aaattaatat gaagggagaa gttcgtcatc tgcaacacga gaaaagagca 6600 cgttatgttc agggaaacga cagggaggct ctgagaaagt tactcgagtt caaaaatcca 6660 agaagagaat ttcttgatgg tatcgcaaat ctggaggagg acacgtatca gagtgggaat 6720 cgagacggcg ttcattcaat gggtgtgtta aaacaagtgg cacacgaagg taaaaaaatc 6780 gaaaggtgtg atatgaatca gtggctggct ttggaaaagt taaaaacacg agacggggat 6840 aaatctctga ttcgtcaaat cgggttggat ccaccttaca taatgcttta ttcatcggag 6900 gctatgcgta tcttctataa aaagtgcaac cacgacatcg tatacatcga tgcaacaggg 6960 agtttaatga ttggaaaaga aaagaaaaag ttttacattt atgaattggt ggtgcgcaac 7020 atagtcggaa aaacaccgtt tgccgttgca tcttttatta cagcacgaca cacaatacca 7080 tcaatcacga attttttgct gaccttttca aatgatgtgc aaagagctag caagcgtaaa 7140 agatttccaa aattggttat gtgtgacgga agcgtagcct taatgaaatc tgtttgtttg 7200 ggtttgtttg aatgctccct aaatgcttac accgacaagt gtttagatat tgcgcaagga 7260 aatcataaag agatgttcca ccaaaacggg ttttctgcat tgctgtgcaa gtcaccacat 7320 gtgcaatgca aagaaattgt gcaagaaaac gtaagatttt attaaatact tattaatgat 7380 gacctctata atgacccttt attattttta gttacatgag atataacaat tttgaactaa 7440 aattgatagt gtcgcataat ttaatatcat gtagcctaaa aaaattgtcc tgaaaatttg 7500 tcacattgtt ttatttcaga tgtccaaaat tttacaagtt tgcaatgtat gtgattggag 7560 cacttttcac atgcgacaca atggccaagg ttgatgatgt attcattaat ttatgcatct 7620 tgtgctgttc ggagagtgcc attgatataa ctaatagaag ttataaatac ctctgccaaa 7680 tcatcaacag ggatgtggaa gttgctcctg atcatgttga aactgcggaa aatattgagg 7740 tacgtaacca ctgctacgca cgcttatata catgcatgca aatggtaggt ctagctgtgt 7800 gttttggagc ttttcattaa gaataccaaa caaagttcat ttaattagca tgcatagact 7860 ttgacatcct tgagcactaa ctttacgatt aatgtccata ctactacaac caatatatag 7920 ttttgttgat cgtaatttcc tttttgtttt caaaggaaac aacggaggtg gcttcttcga 7980 ggctgagatc ttatttcagt tcggttcggg attccgctat gcggcaaatc tatgcggaaa 8040 aagcgatttc ccataacgcc gtggtaaatc gcttttatgc cccgatgttc attgattcac 8100 ttgtcaagta ttcaatgcca agtgttctcc tgtggtcttc aattatgctt ggtaaggaaa 8160 atcacattac gttatatttc cattcaacca tttttatgca gtgcagcatg cataaaaagg 8220 gtaaacgtat gcaatacacc aacctcgtta attatttcgt attattctat gtttcaaggc 8280 gatttgggta gacatggcat tggagagagc tacagcgaat attcaagaat gtgggaatcc 8340 ctgctaagaa taaaaaacac gcaggtatta tattatattt tgtacatttt gttagaataa 8400 atgtattaga aaatacattc aaacatgatt gtgcagacaa ttgctgttaa caatcgtacc 8460 caaggagtaa tggagaagtc acagcaggag ctcaaacgca ctcgcctgcg gtcgaaacgg 8520 tacaaacata tcgacgaact tgttaccgtc tttcaaacag accacattgg gcttcttaga 8580 gaatacacag ataactgtgc gcccaaactg acaaaggtgt gctgttttta tttaataaaa 8640 tgttttgttt catttaaata ggtcaaaata tattgatatg agagaaaatt cacgtgtatc 8700 cgcgtgaatg ggaaaaccag gggattacca catgggcaat cgtcaggtca tctcgcgcaa 8760 accgcagggt taccataact attgtagtgt gatgtgtatt attcacacgt cttcaattta 8820 gtaaaagcag gattcaggca atatgtgtta agtgccttga ccaaggacac aactaaattg 8880 ctatcgagca gtttcgaagc tgcaaccctt tcggtttaat ggcgacgcac ttatccacca 8940 gactaatcag taggttttac ctctcctatt tacagaccaa catttcgaag ccgccaaaaa 9000 tgggagttta tcttaaaact ccaacaattg ctgacaaaaa gaggttagca ttaaacagca 9060 attaatgtcc gttgtaaaaa gaaatttact atagcaaatc aatttacaca gaattcagtt 9120 gtgcattccc agcacaaaca ataaggttaa aaaggaaaaa tcattggacc tggaaaaagg 9180 agatttggat gaaaagaaaa ccaaaacctg ttgttgttga ccctttgttg ggtcagtttc 9240 tacagtcgcc tcctaaaacg ttttcaatag aaaagaaaag tatgccatcc agaaaaagaa 9300 aatgaagggc taaaggtgag gcttggatgt tctgaaaaga atcgtatttt atttgtttat 9360 ttacatctct atacaaagaa aattaaaact atatgtaaaa ttttatgcca aacactgtcc 9420 taactgatag gttaggtggt taggtaagta ggatattgat gtttatccct gtaaatggga 9480 aaaccaaggg ggttaacaca caggcagtcg tcatatcacc tcgcgcccac cgttgggtta 9540 ccggagctat tgcagtgtgt tgtacatgat tcacgcgttc ctttaatttg gtagaaccca 9600 ttaattttat cggttcaggg aatatttatt aagtgtcttg cctaaggaca caaccgacta 9660 gcaagcgacc agtttcgaat cttaaacctt tcggttgaaa ggcgtcgcac taaaccacca 9720 gactaaccga taggaaaatg aggttacctc aaccgacccc aaaaccttct tgccataact 9780 ttgaagtttt aaatattatg ttttacagga cgaacgcatt tgcaaattgt tcgacctaaa 9840 aaacaaaaac actgttgttg cttcaataaa cggcggccag cataggttac ttgccgattc 9900 tttttcctct cttcaaaatc ggaaatggct ccgacgggcg aaagtacgtt tattagttta 9960 gtaatatgtt acagaaactt agttgcagcg tgtttaagaa aaacaggaaa ccgtgtcaat 10020 agggtgttta actttttggt agcttttgtc taaatttata ttttctggta ggttatcacg 10080 tattatttga gcttgatgtg catcagtact gcatttgtga tggactcatt cacagccaca 10140 gctatagtgg atggaaatct agcggttgcc agacgccatt tttacaaaaa tgtaagaaac 10200 tatggattaa cagtcgcaat aaatcattat tagttgtctt gtttttactg catgctgctt 10260 ttttaagtag attttgtcat ataaatcatt atattacgcg taggtcaatt ttgatgctta 10320 ctcgtttgtt ttgggagcag ttcacgacat caagttgaat cactggaaac taatggtagg 10380 atataaaata acattatatt ctaaattgtg tagcaatcta ttttttgttt atatcataaa 10440 ctatattttg gtttagtttg tgaacaacaa tactaatgag ttctgcatat ttgactctct 10500 gggcagttca tcgagttcaa gcaaaatgga gttaacacac tggaggtttg ccttgttgat 10560 ccatattagc ttttataaat attttaacta gtttttgact ttcctaatag agaatacaat 10620 attgcccggt ccgcatttgt cgccacttca aaagaaagaa cttggacaag accgtcattt 10680 cgtcattcag tacaagttga cggccatagc tgtggagtat tcgtcattaa ggttgcctat 10740 ggacgatata acttatatta tgtttaatag cactaatttg caagtgaact atactttata 10800 acatcttgta attaaataag ctatttatta tatatcgata ttcgaataag actgggtgct 10860 caaatattta taatttcaga tggcgatgca actgatacaa aataagtctc tgttctttga 10920 tgcatcagag accagtatgg aaaacgaaag aatgtgcata gccaaaaata tcgtggaatt 10980 atcaggtata tattttgatt taaactgaaa atgattataa aaacactaaa taactgctac 11040 aaattttaga cgatatgaaa gagttgtgta gtttttgtgg caaagacaaa aatttcaatg 11100 acgacgaatg ggtgagttta tgtttgttta tgacatttta caatgtgata tttagaattc 11160 gttgtcaata cattttgtgc aggttggatg tgacaacgat gaatgtggaa ggtggtttca 11220 catgcgatgc atgaacatga cgagaaaggc atttgaagat gccaaacttt cactgtggac 11280 atgtccggca tgttcctaat gtcacgtttt ttaatgatag ttagttttta tcaaaatgta 11340 acgtcaaata tagacgtctc tctaaagtaa aaaattatac aaaattacct ctttttttca 11400 tgttcatttt catttcattt tcagcctttc aagaataaaa ttgtatactg catacatact 11460 tcaaaattaa ttaagtcaca agtataagga aagtgttggt aaaatctgat tgtaacatcg 11520 ttaagagggt aaaataaaga tgttcataca tacctcaaag tgagttcagt cggaagtata 11580 agagaagtta cgactaattg aacatacacg gtatttgcta aatgatgtca tatatgtatt 11640 actgttcgct ggttttcgtt aattcaaacc atttgccccc acatattggt aatatctatt 11700 agcaccatcg cttagagggt gaactgtaga tgttcataca tacctcaaag tgaattcagt 11760 cgaaagtata agaggagtta ggacttcttg aacatacacg gtatatccaa aaagacctaa 11820 tgtctttttt tatattcgct ggttttcgtt ataatttaaa ccatttgccc ccacatgttg 11880 gtaaaaactg aacgtaccat cgtttagagt gtgaaatgta gatgttcata catacctcaa 11940 agtgaattca gtcgaaagta taagagcagt taggacttct tgaacataca cggtatatcc 12000 aaaaagacct aacgtctttt ttattttcgc tggttttcgt tcatttaaac catttgcccc 12060 cacattttgg taatatctat atgcaccatc gtttagaggg tgaaatgtag atgttcatac 12120 atacctcaaa gtgaattcag tcggaagtat aagaggactt acgactaatt gaacatacac 12180 ggtatatgca aaatgacgtc atatatgtat tactttttgc tggttttcgt tcatttaaac 12240 catttgcccc cacatgttgg taatatctat ttgcaccatc gtttagagtg tgaaaggtag 12300 atgttcatac atacctcaaa gtgaatttag tcgaaagtat aagaggagtt aggacttctt 12360 gaacatacac ggtatatcca aaaagaccta acgtcttttt ttattttcgc tggttttcgt 12420 taatttaaac catttgcccc cacatgttgg taaaaactga acgtaccttc gtttagaggg 12480 taaaatgtag attttcatac ataccgcaaa gtgaattcag tcgaaagtat aagaggagtt 12540 aggacttctt gaacatacac ggtatatcca aaaagaccta acgtcttttt ttattttcgc 12600 tggttttcgt tcatttaaac caattgcccc caaatattgg taaaaactga acttaccatc 12660 gtttagaagg tgaaatgtag atgctcatac atacctcaaa gtgaattcag tcggaagtat 12720 aagaggactt acgactaatt gaacatacac ggtatatcca aaaagacata acgtcttttt 12780 tattttcgct ggttttcgtt catttaaacc atttgccccc atatattggt aatatccatt 12840 tgcaccatcg tttagagggt aacctgtaga tgttcataca tacctcaaag tgaattcagt 12900 cggaagtata agaggactta cgactaattg aacatacacg gtatatccaa aaagacctaa 12960 cgtctttttt actttcgctg gttttcgttc atttaaacca tttgcccaca catgatggta 13020 aaaactgaac ttaccatcgt ttagagtgtg aaatgtagat gctcatacat acctcaaagt 13080 gaattcagtc ggaagtataa gaggacttac gactaattga acatacacgg tatatccaaa 13140 aagacctaac gtctttttta ttttcgctgg ttttcgttca tttaaaccat ttgcccccaa 13200 atgttggtaa aaactgaact taccatcgtt tagagtgtga aaggtagatg ttcatacata 13260 cctcaaagtg aattcagtcg aaagtataag aggagttagg acttcttgaa catacacggt 13320 atatgcaaaa tgacgtcata tatgtattac tgttcgctgg ttttcgtaaa tttaaaccat 13380 ttgcccccac atattggtaa tatctatttg caccatcgtt tagagggtaa actgtagatg 13440 ttcacacata cctcaaagtg aattcagtcg aaagtataag agtagttagg acttcttgaa 13500 catacacggt atatccaaaa agacctaacg tcttttttta ttttcgctgg ttttcgttca 13560 tttaaaccat ttgcccccac atgatggtaa aaactgaact taccatcgtt tagaaggtga 13620 catctagatg ctcatacata cctcaaagtg aattcagtcg gaagtataag aggacttacg 13680 actaattgaa catacacggt atatgcaaaa tgacgtcata tgtgtattac tgttcgctgg 13740 ttttcgtaaa tttaaaccat ttgcccccac atattgttaa tatctattta caccatcgtt 13800 tagagggtga aatgtagatg ttcatacata cctcaaagtg aattcagtcg aaagtataag 13860 aggagttagg acttcttgaa catacacggt atatccaaaa agacctaacg tcttttttta 13920 ttttcgctgg ttttcgttca tttaaaccaa ttgcccccac atttttgtaa tatctatttg 13980 caccatcgtt tagagggtga actgtagatg ttcaaacata cctcaaagtg aattcagtcg 14040 gaagtataag aggacttacg actaattgaa catacacggt atatgcaaaa tgacgtcata 14100 tatgtattac ttttcgctga ttttcgtaaa tttaaaccat ttgcccccat atattggtaa 14160 tatctatttg caccatcgtt tagagggtga actgtagatg ttcatacata cctcaaagtg 14220 aattcagtcg gaagtataag aggacttacg actaattgaa catacacggt atatccaaaa 14280 agacctaacg tcttttttac tttcgctggt tttcgttcat ttaaaccatt tgcccccaca 14340 tgatggtaaa aactgaactt accatcgttt agagtgtgaa aggtagatgt tcatacatac 14400 ctcaaagtga attcagtcga aagtataaga ggagttagga cttcttgaac atacacggta 14460 tatccaaaaa gacctaacgt cttttttatt ttcgctggtt ttcgttcatt taaaccattt 14520 gcccccacat gatggtaaaa actgaactta ccatcgttta gaaggtgaaa tgtagatgtt 14580 catacatacc tcaaagtgaa ttcagtcgga agtataagag gacttacgac taattgaaca 14640 tacacggtat atccaaaaag acctaacgtc ttttttattt tcgctggttt tcgttcattt 14700 aaaccatttg cccccaaatg ttggtaaaaa ctgaacttac cattgtttag aaggtgaaat 14760 gtagatgctc atacatacct caaagtgaat tcagtcgaaa gtataagagg agttaggact 14820 tcttgaacat acacggtata tccaaaaaga cctaacgtct ttttttattt tcgctggttt 14880 tcgttcattt aaaccatttg cccccacatg atggtaaaaa ctgaacttac catcgtttag 14940 aaggtgacat ctagatgctc atacatacct caaagtgaat tcagtcggaa gtataagagg 15000 acttacgact aattgaacat acacggtata tgcaaaatga cgtcatatgt gtattactgt 15060 tcgctggttt tcgtaaattt aaaccatttg cccccacata ttgttaatat ctatttacac 15120 catcgtttag agggtaaaat gtagatgttc atacatacct caaagtgaat tcagtcgaaa 15180 gtataagagg agttaggact tcttgaacat acacggtata tccaaaaaaa cctaacgtct 15240 ttttttattt tcgctggttt tcgttcattt aaaccaattg cccccacatt ttggtaatat 15300 ctatttgcac catcgtttag agggtgaact gtagatgttc aaacatacct caaagtgaat 15360 tcagtcggaa gtataagagg acttacgact aattgaacat acacggtata tgcaaaatga 15420 cgtcatatat gtattacttt tcgctggttt tcgtaaattt aaaccatttg cccccatata 15480 ttggtaatat ctatttgcac catcgtttag agggtgaact gtagatgttc atacatacct 15540 caaagtgaat tcagtcggaa gtataagagg acttacgact aattgaacat acacggtata 15600 tccaaaaaga cctaacgtat tttttatttt cgctggtttt cgttcattta aaccatttgc 15660 ccccacatgt tggtaaaaac tgaacttacc atcgtttaga gtgtgaaatg tagatgttca 15720 tacatacctc aaagtgaatt cagtcgaaag tataagagga gttaggactt cttgaacata 15780 catggtatat ccaaaaagac cgaacgtctt ttttattttc gctggttttc gttaatttaa 15840 accatttgcc cccacatgtt ggtaaaaact gaacttacca tcgtttagag tgtgaaaggt 15900 agatgttcat acatacctca aagtgaattc agtcgaaagt ataagaggag ttaggacttc 15960 ttgaacatac acggtatatc caaaaagacc taacgtcttt tttattttcg ctgggtttcg 16020 ttcatttaaa ccatttgccc ccacatgatg gtaaaaactg aacttaccat cgtttagaag 16080 gtgaaatgta gatgctcata catacctcaa agtgaattcc gccggaagta taagaggact 16140 tacgactaat tgaacataca cggtatatgc aaaatgacgt catatatgta ttactgttcg 16200 ctggttttcg taaatttaaa ccatttgccc ccacatattg gttaataatc tatggggtta 16260 tcataagggc aatcatcagt tcaccttgcc tccaatatta gattacccta acccattatt 16320 gcgccgggtt taggcaatat ccgtaatgtt ccttgcccaa gtatacaacc gtcgagttag 16380 cgaccagctt ccaatcagac cactgttgac gtcacatctt tattattttc actgcttttc 16440 gtaaatttaa aaccatttcc cctacacgaa tcctatactg taattatact gtaattaaaa 16500 tgacgtcgtc gcgaattttc acaaaatact tataattttt tatttggaaa cactggtagc 16560 ctcatttcga tccctgctat cggctattgc ctgcctgctt gacttttcga atcgcctgct 16620 gagaattgcc tgcttgcttg acctaggt 16648 // ID BEL-34_AA-LTR repbase; DNA; INV; 406 BP. XX AC AAGE02019757; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-34_AA_; KW BEL-34_AA-I; BEL-34_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-406 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02019757; Positions 28197 27792. XX SQ Sequence 406 BP; 143 A; 58 C; 89 G; 116 T; 0 other; tgttgcggac agtgaggacc cctcgtcgat gatgcgatgt gatcccgtgt gtgaagaaag 60 tggatggatg tcaaagtgac aaaaggtcag atgaagcgtc agttgaatga taaacattat 120 tgaaagttgt ggacaacaag attattataa agtaaattga atttttcgta attaagatta 180 ttattacact gttattaccc tattataggt caatcaagca aagaattcag tagtagtagt 240 ctgtgggtgg ttgaatttat ttcggaaact aagctgtaag tagtatctgt taaacaagaa 300 ttcgaagtaa ctgaataaat aaaacttatt gcagctttag cgatactctg cggaaacagc 360 gagttgctta aagaaacccg aaaggataat accattacca ccaaca 406 // ID Gypsy-39_CQ-LTR repbase; DNA; INV; 271 BP. XX AC AAWU01034372; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-39_CQ_; KW Gypsy-39_CQ-I; Gypsy-39_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-271 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 458-458 (2011). XX DR Genome; AAWU01034372; Positions 47518 47248. XX SQ Sequence 271 BP; 52 A; 73 C; 69 G; 77 T; 0 other; tgttgaggac agagaaagtg cgcggagtgc ccccaccttc cgcgcgccgc tggctcacaa 60 gtggctgcca gtctgccact gtgtgacgca gcacagcgcg ttcgatgttt tataagctgt 120 cccgttttgt gttgtgccat cgtgccattc tacttccgtc tgcggacaag aataaacgtt 180 ttaacttgta ctaagtctat tttatttgcg tacgttcgct agtctatttg cccgtcgcgg 240 gtaatccaac gtccagcgtg cgtatacgac a 271 // ID Gypsy-89_CQ-LTR repbase; DNA; INV; 178 BP. XX AC AAWU01006656; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-89_CQ_; KW Gypsy-89_CQ-I; Gypsy-89_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-178 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 558-558 (2011). XX DR GenBank; AAWU01006656; Positions 9735 9912. XX SQ Sequence 178 BP; 40 A; 51 C; 35 G; 52 T; 0 other; tggtggttcc cacgcgtcgt gtagtgccca ccctcattgt tgctacactt ggtagcaaaa 60 ccaacaaacg tcactctagc acttcctatc gcgctattcg gatcgttgca aaataaacca 120 agtttagtta agttaattcc ggtcgtttta ttcgtccgcc gttccgaggt tccccaca 178 // ID Gypsy_DG_I repbase; DNA; INV; 3090 BP. XX AC . XX DT 10-MAY-2009 (Rel. 14.05, Created) DT 10-MAY-2009 (Rel. 14.05, Last updated, Version 1) XX DE Gypsy_DG: A Gypsy-like family of LTR retrotransposons in DE Drosophila grimshawi. XX KW Gypsy; LTR Retrotransposon; Transposable Element; gag-pol; KW Gypsy_DG_I. XX OS Drosophila grimshawi OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Hawaiian Drosophila; OC grimshawi group; grimshawi subgroup. XX RN [1] RP 1-3090 RA Styles P.; RT "Gypsy_DG: A Gypsy-like family of LTR retrotransposons in RT Drosophila grimshawi."; RL Repbase Reports 9(5), 962-962 (2009). XX DR [1] (Consensus) XX CC Gypsy_DG is a Gypsy-like family in Drosophila grimshawi, with a CC copy number of 6 elements. The consensus sequence is 3090bp in CC length, and contains a single open reading frame encoding a CC gag-pol polyprotein between positions 1082 and 2743. Maximum CC identity between flanking LTRs is 98.4%, suggesting recent CC activity. XX FH Key Location/Qualifiers FT CDS 1082..2743 FT /product="gag-pol" FT /translation="MSVSRSIDNLNSLANLESVSCCVCTLEVEHPAQLLAT FT SCNHYFHHTCFNNRSGNKRVCPICRTSLSTSTQNLAVELAASQPAKVALKP FT QSLITRSKSKVVKVITEAMPKVATTGKMQTSKAGISSVSAVRQGNESCTVP FT QSNPVEVIATTSTELSANQEARTSQQSHPVVDQNNAHNLLATNLQQTVVSA FT MATAIAQQTQLLSQAIETGFQRMQNPVGNPADVQPSARPSVHSRERQTFEQ FT LFQLPGPGDSENVPSNGVPPTSSFTPIGSQQCESLRSDRISQIISNWKLRF FT SGRSSLSVEEFIYRVEALTAQSLDGNFELLSRYASNLFEDSGSEWYWRYHK FT SVPCVCWPDLCRALKAHFKDGRTDLDIRTAISLRKQKYHEPFDTYHEAIMF FT LADRLTQPMSEQSLLEVLQANLLPEVQHEILYVPIYNLEQLRHVVRRRERF FT MKAVTKTNVPSQRLIPGRQVHEIAMQTDCEQELENPEIAAMNVSCWNCGRH FT GHRYQECEAERKVFCYGCGRRDTYKPSCTKCNASKNEVSRAPTTSARKLVS FT KGTNTE*" XX SQ Sequence 3090 BP; 920 A; 621 C; 657 G; 892 T; 0 other; gttatttata attagttata atatttattt tggcgcccaa cgtatggggc ctgtttcaag 60 taaaaatatt agtgttcaat aagctaatag gaggttaatc gaaaggccat cggttaggct 120 tccgatctac agtagtattg gaaaaaaaaa aagaagagaa accatgctag tattttgcac 180 ctgtacaggc tcacaaattt cgaggtcagc tacttctgat gtagcagcta tctgcttaca 240 caaatgactt catatcccca tacctttgtc cttgttttga taatagagat gtagtaatat 300 tattcgattc caaaatattg ataatgaccg tctcagtaaa caaaacaata ttgcgctcgt 360 gtaagttcta ggaacaatta caatttcaaa ataattgtgt tgccagactc acaatgcaca 420 agcgcaaatt cctcctagtg gtgctacgat tggtgtaacg ctttagctgg acgaggattc 480 atggataatt aagctatgat ccgccgctga gtggaacatt aaaagtagtt actccctacg 540 tgtctgtgga catacctaca aaccaaaata ttgggccgga ttcgggtgtc gctggattca 600 actcaggtgg ttggtcgcgc catttggatc gggtttaata tcgcggatca cagtctattt 660 tcttagtagt tgagcatttg attctattcc ggatgcgaca gtcttttgcc ggaggagtat 720 agttcctatt ggctggtgct tctttttctt aaactatact ctcgcctgga acaataaccg 780 ttttatttgc aaacgaaaat gggatcattg atgcaattac acttttcttt tacctctatc 840 ctatcgtttc tcttttccaa atctttttgg gggtcaaata aatcaagtaa aggtcggcat 900 acctactctc aagaaggaat tcgcattgtt gtccacactc aaataatgaa ttttgaatat 960 atcgctcggc gtaaattagt tttttttctc ttatcttttt tttagtaatc tttagcacat 1020 ggtttgcgtt tgtttctcat ttatatgaat cattttaata gtttgctatg ttagtatagc 1080 aatgtcagta agtcgaagta ttgacaattt aaattctctt gctaatttgg aaagtgtgag 1140 ttgttgcgtg tgtacattag aggtggaaca cccggcacag cttctagcca ccagttgtaa 1200 ccattatttt caccatactt gttttaacaa tagatcaggt aacaaacgag tttgtcctat 1260 ttgtagaact tcacttagta cttcaactca gaatttggca gttgagttag ctgcatcgca 1320 gcctgctaag gtggcattga agccccaaag ccttataaca cgatctaagt ctaaagtcgt 1380 taaagttata acggaagcta tgcctaaagt cgctacaaca ggaaaaatgc agacttcaaa 1440 ggcgggaatt agctcggtat cagcggtaag gcagggaaat gagtcgtgta cagttcctca 1500 gtccaacccc gtcgaagtaa tagctactac tagcacagag ctaagtgcta atcaagaggc 1560 aagaacgtct cagcagtcgc acccagttgt ggaccaaaat aacgctcaca acctgcttgc 1620 taccaacttg caacaaactg ttgtcagcgc catggcgaca gcaattgctc aacagaccca 1680 acttttgtcc caagccattg agacagggtt tcaaagaatg cagaatcctg taggcaaccc 1740 tgcagatgta cagccatctg ctaggccatc tgttcatagt agagaacgac aaacgtttga 1800 acaactattt caactccctg gtccaggaga ttcggaaaat gtccctagta acggtgtacc 1860 gccaacaagt agtttcacgc caataggatc gcagcagtgc gaaagtttac gttctgatag 1920 aattagccag attatttcta attggaagtt gagattttcg ggacgttcat cgttatcggt 1980 agaagaattt atatatcgcg tagaagctct tacagcacag tcgttggatg gcaactttga 2040 gctgctatct cgctatgcta gtaacttgtt tgaagacagc ggtagcgaat ggtattggag 2100 ataccataaa agtgttccgt gcgtttgttg gccagacttg tgtcgagcgt taaaggccca 2160 ttttaaagat ggtcgaacgg acttagacat tcgcacagcc atttcgttga gaaaacaaaa 2220 ataccacgaa ccttttgaca cctatcatga ggctattatg ttcttagccg ataggcttac 2280 acagcctatg tccgagcagt ctttgcttga ggtgctgcaa gcaaatttat taccggaagt 2340 gcagcatgag attttgtacg taccgattta taatctagag cagttgcgac atgtcgtacg 2400 cagaagagaa cggttcatga aagcagttac aaaaactaat gttccttctc agcgcctaat 2460 tccaggacgg caggttcacg aaattgcaat gcaaacagat tgcgagcagg agttagaaaa 2520 tccagagatc gctgctatga atgtttcgtg ttggaattgt gggagacatg gtcatcgata 2580 ccaagaatgt gaagcagaac gtaaagtgtt ctgttacgga tgcggaaggc gtgacacata 2640 caaaccgtca tgtaccaagt gcaatgcttc aaaaaacgaa gtgtcgcgtg caccgacgac 2700 cagtgcacgc aaactggttt cgaagggaac gaacaccgag taactgatga aatgatcagt 2760 tcactcgatc aactaccgga cttgacttta ctcgttaggc taagagacag caaaattcaa 2820 aaaacttgaa cgaaaaatta cctgatgtta ctaaagtctt tgccaaaggt ttcagtgcaa 2880 aattggcacc agtcttcgta aaagctagaa tcaaatccaa agtcggtcat agctattacg 2940 aattagaaga tttacaaggc cgcccaattg gcaaatatca cgcaaaggac attaaacaat 3000 gataacctta tctacattac tttgtgctct ccaagcggtt atcactcttt gatgttttca 3060 acaccaaagt gtgattttgg cggaagggtg 3090 // ID R1_DVi repbase; DNA; INV; 5248 BP. XX AC . XX DT 08-NOV-2010 (Rel. 15.11, Created) DT 08-NOV-2010 (Rel. 15.11, Last updated, Version 2) XX DE 28S rDNA-specific non-LTR retrotransposon R1 in Drosophila DE virilis. XX KW R1; Non-LTR Retrotransposon; Transposable Element; R1_DVi. XX OS Drosophila virilis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; virilis group. XX RN [1] RP 1-5248 RA Stage D.E. and Eickbush T.H.; RT "Origin of nascent lineages and the mechanisms used to prime RT second-strand DNA synthesis in the R1 and R2 retrotransposons of RT Drosophila."; RL Genome Biology 10(5), R49-R49 (2009). XX DR [1] (Consensus) XX CC This family is specifically inserted into 28S ribosomal RNA CC genes. XX FH Key Location/Qualifiers FT CDS 248..1648 FT /product="R1_DVi_1p" FT /translation="ISAHLRTMPPRKKKPAEKGAFENCRMTSEDEVTLSES FT SSNSEVRRRSKQRLRKATSESAVSSVGAREGSAVPAGAADGSPGAAGASVE FT ERVFAVPHAVSVLDVSEPAVEGPSTSAAALRKRKGTAVEQMKGITGALLQL FT ALKERATGSLICKLVDYTGQYEQLLFALVAQNERLQGRLEAVCGGAGVSHE FT LHVPAGQARAPSVNAARGGPRSAVPSTPEMPRPVETWSLVVRSKAAGKTAK FT DVVEQVVKEVGPSLGVRIHEVKPLRDGGALIRTPSVAEREKIVENTKFNEV FT GLEVCVNDKLGPKVVVQGVHSQITPDEFMGDLYEMNLKDKMSLEAFKKGVR FT MTSKPWAAGGNVAVNIVLEGAVVAMQSLLEIGRCYIKWFSFRVRSFDLVPG FT CYRCLGFDHKVAECRAKEDVCRRCGQMGHRVAQCSNALNCRNCSFKGKPSG FT HLMMSLACPIYGAIVARANARH" FT CDS 1602..4724 FT /product="R1_DVi_2p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="PVPYTGRLLRGRMLDIKMSSRFLQLNCQRSYAAMCDV FT GAMMCERGSVVALLQEPYATGGCVRGLPACMRVFPDSRANSAVIVNDVSIE FT CTLVSSTDWGVCVSLSGSFGRIFVVSMYCKFGDPLEPYIAYLDEVLLLVSS FT VPVILGLDANASSPMWFSKVSRHSSGYSNHTRGEMLAEWAVSQDVRVVNEP FT SEWYTFAGPVGQSDIDVTLVNAVAMSMYAFHWCVLGGLGVSDHNPIEIVVT FT HTITRNESVGGNRWRTCSVNWSLHGLIVQEAATQVPLNAFNDLNVDEQVVC FT VNGWITGANDRMFERYRKVNLKHVKWWTNQLSTKRRMVRSLRKRFQRARAT FT NADGAAQLRIEYSRCMNEYKQMLVKVKEDEWRSFLERNKDDPWGRVYKIVR FT GRGRETDVSSLRVGNVQLTAWRDCMNVLLNEFFPRADHQDSPPSVVRPVDP FT LLDSELEVAFSRLKSRKSPGMDGFTGEMCKSVWKSIPEYMNAMYAKCLNEG FT YFPSEWKCARVIVLLKSPDRIRSNPRSFRGISLLPVLGKVLERIMVERLQE FT RMXDQMSDRQFGFRQGRCVEDAWRYVSDSVEASSSTYVLGIFVDFKGAFDY FT LSWASVLRRLDECGCRELALWKSYFSGRRACAVGRCECVSVNVVRGCPQGS FT ICGPFIWNLMMDPLLVQLEQMCKCCAYADDLLILIEGRSRLEIETNAGMCL FT RTVYEWGESVGVSLAMDKTVSMLLKGRLAASRPPIVRLNGVNVRYASEVKY FT LGITFGERMCFTPHIAGLKARLLGLVGQVRRILRSDWGLSRRAVRTIYEGL FT FVACAAYGSSVWCKAVTTVVGRKKVLACQRVLLLGCLPVCRTVSTEAMQVL FT LGVAPLDLEIRRRALMYRIKRRLPLLQNDWLAGRDVESLGLSECKRLVKEC FT VLSDWQVRWDTSVNGRVTHRFIREVTFAGSRPDFGFGLSLGFLLTGHGSLN FT AFLHSRNLYDSPECRCGSAAETWEHVLCECTRYADFRDLRGMGISEVSGGF FT DVSQSLSTSDRVRRMSEFARAAFSRRRLQLQEGIV" XX SQ Sequence 5248 BP; 1167 A; 1015 C; 1679 G; 1386 T; 1 other; cagttcgttt cagacagtcg ttgggaacag acgtgtttat ttcatcgctc gcttcgcaaa 60 caattttaag ataacggtgc tttcaagcaa cggtatcttg tttacatttt gtgcgcttat 120 cagcaataaa gttttattgc tttgtttacg tcgttgctgc gtgagcaaca ggcgttcgtg 180 tgtgtgtgtg aaaccgccag cttctacgtg tgtgtgtgtg tgtgcgtgct atttatatat 240 taaatagata agcgcacacc tgcggacgat gccgccacgc aaaaagaagc ccgctgagaa 300 gggggctttt gaaaattgta ggatgacgtc ggaagatgag gtgaccttgt cggagtcgtc 360 atcgaacagc gaggtaaggc ggaggagcaa gcagcgcctg aggaaggcaa cttccgaaag 420 cgctgtcagc tcagttggtg cgcgtgaggg gagcgctgtt cccgcgggag ccgcagatgg 480 ctcgcccgga gcagcgggtg caagtgtgga ggagcgggtt tttgccgttc ctcatgccgt 540 gagcgttttg gatgtgtctg aacctgccgt ggaggggccg tcgacaagtg cggcggcttt 600 gaggaagcgg aaaggaacag ctgtcgagca gatgaagggg ataacgggag ctttattgca 660 gcttgcactc aaggagagag ccacggggtc gctcatttgc aagttagtcg attatacagg 720 gcagtatgag cagctcctgt tcgcgcttgt ggcgcagaat gagcggctgc aggggcgctt 780 ggaggccgtt tgcgggggtg caggcgtttc gcatgaattg catgttccag ccggccaagc 840 gcgggctccg tcggtgaacg ccgccagagg gggtcccagg tcggcagtgc cgtctacccc 900 ggagatgcct cggccagttg agacatggtc cttggttgtg cgcagcaagg ctgctggcaa 960 aaccgctaag gacgtggtgg agcaggtggt gaaggaggta ggtccctctc ttggtgtgcg 1020 gatacatgag gtgaaacctc taagggatgg aggtgcgtta attcggactc catcggtcgc 1080 cgagcgtgag aaaattgtgg agaacacaaa gttcaacgag gtgggattgg aagtgtgtgt 1140 gaatgataaa ttagggccga aagttgtggt tcagggcgtt cactcgcaga tcacccctga 1200 cgagttcatg ggtgaccttt acgagatgaa cttaaaggac aaaatgtcac ttgaagcctt 1260 taagaaaggt gtccggatga caagtaagcc atgggcggcg ggaggtaatg tagcagttaa 1320 tatcgtcctc gagggtgctg tggtggccat gcagtccctt ttagaaatcg gacgctgcta 1380 cataaaatgg ttttctttta gagtgagaag ttttgacctg gtaccgggat gctaccgctg 1440 ccttggcttc gaccataagg tggcggagtg cagagccaaa gaggacgttt gccgtcgctg 1500 cggccagatg ggtcaccgcg tggctcagtg cagcaatgca ctgaattgcc gcaattgctc 1560 ctttaagggg aagccgtcgg gacatctgat gatgtctcta gcctgtccca tatacggggc 1620 gattgttgcg cgggcgaatg ctagacatta aaatgtctag cagatttctt cagctgaatt 1680 gtcaaaggtc gtatgcagct atgtgtgatg tgggggctat gatgtgtgaa aggggcagcg 1740 tcgttgccct gctgcaggaa ccctacgcga ccggtggttg cgtaagggga ttgcccgcat 1800 gtatgcgagt attccctgac agcagggcta actccgctgt cattgtgaat gatgtcagta 1860 ttgaatgcac tttggtgagt tcaactgact ggggggtgtg tgtgagccta agcggcagtt 1920 ttggcaggat ttttgtagtg agcatgtatt gtaagttcgg ggatcccctc gaaccataca 1980 ttgcttactt ggatgaggtg ctactactgg ttagtagcgt accagtcatc cttggtcttg 2040 atgcgaatgc atcatccccc atgtggttca gcaaggtatc cagacattcg tctgggtatt 2100 cgaaccacac acggggtgag atgctagccg agtgggccgt gtcccaggat gttcgggtcg 2160 ttaacgaacc cagtgagtgg tatacgtttg cgggtccggt gggtcagagt gacattgatg 2220 ttactctagt gaatgcggtg gcaatgagta tgtatgcttt tcattggtgt gtactaggtg 2280 ggcttggtgt gagtgaccac aatccgattg agattgttgt cacacacact attacgagga 2340 acgaaagtgt tgggggtaat cgctggcgca cttgtagtgt gaattggtcc cttcatgggc 2400 tcattgtgca ggaggcggca acgcaagttc cgcttaatgc atttaatgat ttgaatgtgg 2460 atgagcaggt cgtatgtgtg aatgggtgga taactggtgc gaatgatcgc atgtttgaga 2520 ggtaccgtaa ggtcaacctt aagcatgtga agtggtggac gaatcagtta agcaccaagc 2580 ggcggatggt ccggtctctg cggaagcgat tccaaagggc cagagctacc aatgcggatg 2640 gtgcagccca actcaggatt gaatatagtc ggtgtatgaa tgagtacaag caaatgcttg 2700 taaaggtgaa agaagatgaa tggcgctctt tcctggagcg caataaggat gacccctggg 2760 gtcgtgttta taaaatagtt cgaggtagag gcagggaaac ggatgtaagt agcctccgtg 2820 ttggtaacgt tcagttaacg gcatggaggg attgcatgaa tgtcttgttg aatgaattct 2880 tccccagagc ggatcatcag gactcaccac ctagtgtggt taggcctgtt gatccgcttc 2940 tggatagtga gttggaggtg gctttctcga ggttaaagtc gaggaagtca cctggtatgg 3000 atggattcac tggagaaatg tgtaaaagtg tctggaagtc aattccagag tatatgaatg 3060 cgatgtatgc gaagtgtctg aatgagggtt atttccccag tgaatggaag tgtgcgagag 3120 tgatagtgct cctgaagtcg cctgacagga tcaggagcaa tcctagatct tttcggggca 3180 tcagtcttct tccggttctt ggaaaagttc tggaaaggat tatggtagaa aggcttcagg 3240 agagaatgar tgaccaaatg tctgacaggc aatttggttt taggcagggc agatgtgttg 3300 aggatgcttg gaggtatgtt agtgactctg ttgaggctag cagctccacg tatgtcttgg 3360 gcatctttgt tgattttaaa ggtgctttcg actacctgag ttgggcgagt gttttgagaa 3420 ggttggatga atgcgggtgc cgagaattag ctctctggaa gagctatttc tctggcagac 3480 gtgcgtgtgc tgtagggcgg tgtgaatgtg tgagcgtaaa tgtggttcgt ggctgtccgc 3540 agggatctat ctgtggccca ttcatttgga acctcatgat ggaccccttg ctggtgcagc 3600 ttgagcagat gtgtaaatgt tgtgcgtatg cggacgatct gctcatcttg attgagggtc 3660 ggtcgcgtct tgagatcgag acgaatgcgg gtatgtgctt gcgcactgtg tatgagtggg 3720 gtgaaagtgt tggggtcagt cttgcaatgg acaagacagt gtcaatgctg ctcaagggca 3780 gattggcagc tagtcggcca cccattgtca gactgaatgg agtgaatgtg aggtatgcgt 3840 ctgaggtgaa atatctcggc ataaccttcg gcgagaggat gtgtttcact cctcatatcg 3900 ctggtctcaa ggcccggcta cttggtttgg tggggcaagt gcgtcgtatt ttgaggtctg 3960 actggggcct aagcagacgt gctgtccgca ccatctatga gggtctgttt gttgcatgtg 4020 cagcatatgg atcgtctgta tggtgcaagg cggtcacgac tgtggtcggc agaaagaaag 4080 tgctggcttg ccagagagtg cttctgttag gttgtttgcc tgtgtgccgc actgtctcta 4140 cggaggcaat gcaggtactg ttaggagtag cccctcttga cttggagatc aggcgtcgag 4200 ccttgatgta caggatcaag aggcggctgc cattgctgca gaatgattgg ctagcgggta 4260 gggatgtgga gagtttaggg cttagtgaat gcaagagatt ggtgaaagag tgtgttttgt 4320 ctgactggca agtcagatgg gacactagcg tgaatgggcg tgtcactcat cggtttatac 4380 gggaggttac atttgccggc agccgaccag actttgggtt cggcctgagt cttggattcc 4440 tgttgactgg tcacggttcc ctcaatgcat tcttgcattc gaggaacctt tatgacagtc 4500 cagaatgccg ttgtggctcg gctgctgaga catgggagca tgttctctgt gagtgcacaa 4560 ggtatgcaga ttttcgagat ctgaggggga tgggtataag tgaggttagt ggcgggtttg 4620 acgtaagtca atcactctcc actagtgata gggttagaag aatgagtgag tttgctagag 4680 ctgcattttc caggcgacgt ttgcaattgc aggaaggaat tgtatgaatg atgatggatg 4740 gtagatgtga gaatgtgggg gtacgaatgt gggggtgttt ctgaatggta atttttgttg 4800 aacattgtgt tgtgatgggg ctaccaaccc ttaactgttt tggagtttaa attggaagtg 4860 cccttctggt acgaggcctg accggaggct tttaatctgg taccacgggt aaccaggagc 4920 ccacggaact tgttccgtcc tggttagttg gtgcggccct tcggggagta tcgtggtggt 4980 tgtggtttaa cacccaaatg cgggtagagc atcgctcgac gtggagttgc gtcatacaac 5040 cgggttccgt gacccagatt acggaagagg cttagatagt cctcgtgcca aaccaaggta 5100 gaagtcacaa ccaaacagtc gtgtctttaa ttggtacctg cggaattgtt ccaagggggc 5160 ggtgattgac gcttgaatta atcctatact aggaaccgtg agattaagcc atcgcggcag 5220 gtgctcacgt taagcccact gactttca 5248 // ID BEL-237_AA-I repbase; DNA; INV; 5810 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-237_AA_; KW BEL-237_AA-LTR; BEL-237_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-5810 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 929-929 (2011). XX DR [1] (Consensus) XX CC Positions [4857-5441] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 2236..3192 FT /product="BEL-237_AA-I_1p" FT /translation="MVEVDIRSWPIPAGLSLADPSFHVPDEVQLIIGAELF FT YDLLMQGRMKIGDGCPTLVETRSGLDRQRSGFHLQQEDPTQRLQSQSATSE FT DLNRTLSKFWELEACGEASPLTAREHAVEMHFERTVSRDASGRYIVRLPFN FT ELKGKLGDSYETARRRFEKLLRTFVDDSKRIRYTAFMTEYLSLGHMVEVTD FT DPLDGHFLPHHAVYKEASSTTKLRVVFDASSKTTSGLSLNDALNVGPTVQS FT DLLSIILRFCSHQVVLTADIPKMYRQVRVHEADRYYQRILWLNDKNEIATF FT ELATVTYGCSSAPYLATKVLLQLAKDE" FT CDS 4050..5810 FT /product="BEL-237_AA-I_3p" FT /translation="MNQFVSNRVASIVELTQDYEWHYVRSENNPADLISRG FT LMPDALEEHNLWWKGSPILNEIEPRTDEYDDSIELPEQRKVTTVSATKKVS FT PIDFNRLSNFRRLQRAWVYVLRFIANIRTKTRNISEPTAVEMDEALLVIVK FT LVQQEAFLDLFKLLSCGSQKRNNYSGLSPFVDSDGLIRVGGRLKYSSIPYD FT GKHQLLLPDKHQVTLALIRKLHEEHLHVGQRGLLSIVRERFWPINAKSLIK FT KTIATCYVCCRNNPRPSSQYMGNLPDYRITPSPVFANTGVDYAGPIVLKEA FT GRKTVPYKAYIAVFVCLATKAIHLEVVSNLTTDNFIAALQRFVSRRGMISN FT IYSDNGTTFVGANHELAALRTLFEDQIHQRKLNDFCVTKGIQWHFIPPRSP FT HFGGIWEAGVKSAKYHLKRVVGETKLTYEEMATFLAQTEAILNSRPLIPVS FT DDPNDVEVLTPSHFLIGRSAVCLPEPSYDQEKIGRLSRWQHVNLMKEHFWK FT RWSAEYLHHLQSRPKWHNGITKFEVGSLVVLKDDNTPPHQWRIGRIQTTHP FT GKDGIVRVVTIKTSTGEYRRAVTKVCVLPLIDPEESTAGE" XX SQ Sequence 5810 BP; 1602 A; 1344 C; 1478 G; 1361 T; 25 other; tttggtccta tcgaacctga tttgattaag tatggttttt cgagttgatt tgggccgaat 60 ttcagtgaat agttgtgatt ttgcgcgaat ttgcaccgtg tgcaaatagt ttggcgttcc 120 gccatttgcc gcttcgttcc tgcaatcatt gcttgtgtgc gagtgcgaat tatgacgtgt 180 ccggaacacg aaaacaaaat ggagtacttt gaaagagttg ctcgtggcaa ctgaactaag 240 acgagtacaa gtgaaaattc agatgctgcc atgcagcaag tgcaaatgaa cttgtaatcg 300 aaaacaaact aaaacattcg gtgctgctcg tagcagctaa gtggtatcga acagagaaaa 360 agtgcttaca tgagtgataa aaataccaga tatgaaaccc caaaagttca gcgaaacata 420 accccgaaga atttgttgga tgcgctgctt ggttcggtgc agaatagttc tccatcggaa 480 acggagtcga atcttcctcc aacgaatccg gtatcgaatc gtcagcctac agaaactgta 540 gcacagcaga aagcagctca agcaaaacgt ttggaaaagm aaaaagccat ggagcagaaa 600 ctcgagaagg ctattagatc ggcgcgactt gkctaaggag aagcttmttc aaatccacga 660 agagacgctg gaagtgcaag tgaacaacat tcactggctg aatttacagc tggagagcat 720 acggagatgc tacgacgaca acgagaagac ccattcggag ctttgcgata tcgtttctcg 780 tgagcaacgg aaggctttta cggaggttta aaacttgcag tttgaccggt actacgacaa 840 tctgttcgtg caaatccaga ctgcaatagc aaaactgcaa agcgaagaga aagctaagca 900 agctaggtgg agaagcgctc acattcggcc gtgccgcagc aatcgtccac ggcgaatccc 960 acggctccgc acctgcaagt cccgttgccg accttcgatg gtagtctcga gaactggtac 1020 acgttcaagt gcatgttcca gaccatcatg ggtagatacc ccaacgagtc tcctgcgatc 1080 aagctgtacc atctgaagaa ctcacttacc ggaagcgcat ccggtaagat cgatcaggat 1140 gtgatcaaca acaatgacta caaagcggca tggaaagtgc tcgaagacgc gtacgaggat 1200 gaaaggctaa tcatcgatac ccacatcgat gccctggaga tgctgcccag gatgacgcgc 1260 gagaacgggg aagaattgcg gaagctgatc gaaagctgct cgaaacatgt cgacgccctc 1320 aaaaacctcc aactaccggc ggaaggattg ggtgagatga ttctcatcaa cacggtggcc 1380 aagcgtttgg acaaggtgac gcggacgctg tgggagtcgc agttggacca agaggagtgc 1440 ccgtctttcg gcgagatgat ggatttccta cgcgaacgat gcagaatact ccagaaggtg 1500 aaaggctacc cggaacatcg agcgtcgagc taccgcagtc aagcagaagg gaaaaccaga 1560 gcagaggatt mmttgccggc gaggaacttc gtscaagtat ccaaggaatg tatgccmstg 1620 ttgcagcggt gaccacscka tctacaaatg tgagtsagtt ccgggagcta tgtgtgtccg 1680 gccgctactg caaggtgaaa gcaagcgggt ctatgcttca actgcctgcg tcgcggtcat 1740 cgtacggtag attgcaagtc cgagcagtta tgcaagacgt gccggaggaa gcaccatagc 1800 ctcctgcacg acgacaaggc gtgtagcgaa sataagwscg acskaktccc gttccacccg 1860 ccgctggtac ctgaagatcg ccgatgaatt acctgctgtm caagcccaag ggtctgttaa 1920 ctgcgctcaa gtcgccgaac atgaagaaac aagtgctgct ctctactgca gacgtgctgg 1980 tttgcgggtc aggaggattg agattggcgt gccgtgccct tctggattct gggtctgatt 2040 ctaatatcat gtccgaggag ctcgcgagga gtctggatgt tgcctgggag cggatcgact 2100 tgccgatcac agggctgaac aactctgaaa cccgggtgaa gtacaagttg cgaacgaaga 2160 tcatctctcg cgtaaatcag ttcagcgcta ttctggactt cctggtggta tccaaagatc 2220 accaccaacc tgccgatggt cgaggtcgac atacgctcgt ggccgattcc ggctggtctg 2280 tctttggccg acccgtcttt ccacgtccct gacgaggtcc agctcatcat cggcgccgag 2340 ctgttctacg acctgctgat gcaaggaaga atgaagatag gcgacgggtg ccctacgttg 2400 gtggagacgc gatctgggct ggatcgtcag cggtccggtt ttcacctgca gcaagaagat 2460 ccaacgcagc gtctgcaatc tcaatctgcc accagtgaag atttgaaccg aaccctgtcg 2520 aagttctggg aattggaagc ttgtggggag gcttctccac tcaccgcacg ggagcacgcc 2580 gttgagatgc atttcgagcg aacggtttca cgagacgcta gcggtaggta cattgtgaga 2640 cttccattca acgaactsaa gggcaaactc ggcgactctt atgaaactgc tcgtcgtcgg 2700 ttcgagaagt tgttgcgcac atttgttgac gactcgaaac gaatccgcta cacagcattt 2760 atgaccgagt acctatcctt aggtcatatg gtggaggtga cggacgaccc tttggatgga 2820 cattttctgc cgcatcacgc ggtttataag gaagcaagct cgacgacaaa gctccgcgta 2880 gttttcgacg cttcttcgaa aacgacgtcc ggattgtcat tgaacgatgc gctgaacgta 2940 ggaccgaccg tgcaaagtga tttgctatca attatactac ggttctgttc tcatcaagtt 3000 gttttaacgg cggacattcc aaaaatgtac cgtcaggtac gagttcatga agctgatcgg 3060 tattaccaaa gaatactgtg gctcaacgat aagaatgaaa tcgcaacctt tgaattggcc 3120 actgttactt atggttgctc cagtgcaccg tatcttgcaa caaaagtact tcttcaactg 3180 gccaaggacg aagmacmsga kctgccsttg cggcgaaggt ggtsgagaag acagctatat 3240 tgatgacttt cttactggag gaaagacagc agaagaagtg atccagatct accaccagtt 3300 gtcggatatg ctgaagcgag gaggcttcgg tgtccataaa ttttgctcca acgatgtcac 3360 cgtactcagc aatattccag aagaactcca ggagacccga atggattttg aaaatgccga 3420 cgttaacacc gctatcaaga cccttggaat tatttggaat ccggatcaag attacttcag 3480 cttctacgtg aaaactttcg attcccaaaa ggtttttcca ccaacgaagc gaagcgtact 3540 atcagatatt ggccaactgt tcgatccttt gggttttctc gggccaatca tcacaacggc 3600 aaagctaatt atgcaagacc tttggcgcct tggcttgtcc tgggatcaac cattaccaca 3660 acagcagatg gaggagtggc aagaatttcg acagcaactt ccgatggtta atgaaatgaa 3720 gaagaagcga tgcgtggttg ctgaaacagg aaacgatgtg gaactccatg gattctctga 3780 tgcttctaca agggcctacg gtgcagtcct gtataccaga tgcgtttctg cagatggatc 3840 gatatacacc gaattagttt gcagcaagtc acgggttgct ccactcaagc caaccactat 3900 tccacggctg gaactatgcg gagccttgct tctggctcac ctggtgacga aaacagtggc 3960 ggcaatgaag atacctttca aaagtgtgac acttcggtgt gactcacggg caagtggtgt 4020 tatgttggtt gaaaaaatca ccgcttgcta tgaatcagtt tgtttccaac cgtgtagcaa 4080 gtattgtcga gttgacgcag gactacgaat ggcattatgt gcggtctgag aacaatccgg 4140 ctgatttaat ttcacgaggg ttgatgcctg atgctctcga ggagcataat ctctggtgga 4200 aaggatcacc tatcctaaat gaaatcgaac cccgtacaga cgagtatgac gattctatcg 4260 agctgccaga acaacgaaag gtcacaacag tcagtgcaac aaaaaaggta tcaccgatcg 4320 attttaatcg actcagtaac tttcgacgtt tgcaaagagc gtgggtgtac gttctgcgat 4380 tcatcgcaaa catccgcacg aaaacaagaa acatttcaga gccgaccgca gtagaaatgg 4440 acgaagcatt gctcgttatc gtgaaactag ttcaacaaga agcatttttg gatttgttca 4500 agctactatc ttgtggttcg cagaaacgaa acaactatag tggattgtct ccatttgtgg 4560 attcggatgg gctgatcagg gtaggaggtc ggctcaaata ttcatcgatc ccgtacgatg 4620 gaaaacacca gctcttgtta cctgataagc atcaagtcac tctagcttta atacggaagc 4680 tacacgaaga acatttgcac gttgggcaac gcggtttgtt gtctattgta cgtgagaggt 4740 tctggccaat caacgcaaaa tcgttgatca agaaaacgat tgctacctgt tacgtttgct 4800 gcagaaacaa tccacgacca tcgagtcagt atatgggaaa tctaccagac tatcgcatca 4860 caccgtcgcc agtgtttgcg aacacaggag tggactacgc cggtccaatc gtattgaagg 4920 aagctggaag aaaaacggtt ccgtacaagg catatattgc ggtgttcgtt tgtctggcga 4980 ctaaggccat ccatcttgag gtggtttcta atttaaccac tgacaacttt atcgcggcct 5040 tacagcgttt cgtaagtaga cgtggtatga taagcaacat ttactcggac aatggaacca 5100 cgtttgttgg agcaaaccat gaattggctg cactgcgaac attatttgaa gatcagatcc 5160 atcaaagaaa gctaaacgat ttttgcgtaa ccaaaggtat acagtggcac tttattcccc 5220 cccggagtcc ccatttcggg ggcatctggg aagcaggtgt caaatcagca aaataccact 5280 tgaaacgtgt ggtcggcgag actaagttga cgtacgaaga aatggctaca ttcctggctc 5340 aaactgaagc tattttgaat agtcggccgc tcatcccggt atcagatgac ccaaacgacg 5400 ttgaagtttt aacaccttcc cattttttaa ttggacgatc cgctgtgtgt cttccggaac 5460 catcctacga ccaagagaaa atcggccggc tcagccgctg gcaacacgtc aacctgatga 5520 aggagcattt ttggaaacgt tggtcagccg aatatttgca tcatcttcaa tcacgtccaa 5580 agtggcacaa cggtatcacc aaatttgaag taggatcgct ggttgtgctc aaggatgaca 5640 acaccccgcc ccaccaatgg cgtattggtc gcatacagac cactcatcca gggaaggatg 5700 ggattgtgcg agtggttact ataaaaacat cgacaggaga gtatcgaaga gcagttacca 5760 aggtgtgtgt acttccattg attgatccag aggaatcaac ggcgggagaa 5810 // ID CR1-31_BF repbase; DNA; INV; 3575 BP. XX AC . XX DT 30-JUL-2009 (Rel. 14.07, Created) DT 30-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE Amphioxus CR1-31_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-31_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-3575 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-3575 RA Kapitonov V. and Jurka J.; RT "Young families of CR1 non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(7), 1602-1602 (2009). XX DR [2] (Consensus) XX SQ Sequence 3575 BP; 1110 A; 855 C; 788 G; 822 T; 0 other; ggtgggacga gccgctagag gcgcaagcca cttcatcggc taaatcaacc gagcaaaatg 60 cccagaaaac taagagagga aaaaacaaga aactgtacct tatgtcaaga cagagactag 120 ttcttttata cttgactatg ctcctcatca gtaaatcgta ttcaccagag cctaaccctg 180 gtccactact tgatcagtgc ccaaaccaca catgtacaaa tgatagctct tcttcccagt 240 cccactcatc ctgggcatgt ggtacgtgtg acctccgtgt ttcgtggaac gaccgaggca 300 tagagtgtga acagtgtggt caatggtttc acggacagtg tcagagtgtg gatacacaat 360 gttacgagca gttagcggat tcaaatatcc attggtattg tgcaatatgt ggcagcccaa 420 acagcaaaac ggtcttcgat cttcatggag tagactggac tgactcatcg ctgcacgact 480 ccactctaga ctcctgtaca actcccactg atctacattt caatcctcaa cacgtatcca 540 ctccatccaa aagcagtcaa caagataagt ggaaaaatag gcccttacgt gtattgaaca 600 taaactttca atctgctagt gccaaaagag cggagcttcc ctatcttttg gaaagcctaa 660 aaccagatat agtgttagga actgagacat ggctagattc aacagtggcc actgcagaaa 720 tctttccaga tttatacaga gtttacagga gagatagaga aggacgaggt gggggagtct 780 tagttgccgt gagaaacaac atagacagtt atacagcccc cgaattagag gtagatgagt 840 gtgagctgac atgggtgcgt atcaaattga aaggcagaaa gacactatac gtagctgcct 900 tctacagacc cgatgttagt gacgaggaca gtttgatacg cttgagaaca tctctacaga 960 gaacatctca actacagaat gcacttctcc tcataggagg tgacttcaat ctcccgggat 1020 gggactggtc cacaaacacc ttaaaaccaa agtcaccata cccccgtcta catcaagatt 1080 ttctcgacat gttgtacgac aacggattcg aacaattagt caaccagcca actagggatt 1140 gtaacacact tgacttgttt ctaaccaact gtccagacct gattccacgt gtggaaatta 1200 tcccaggtct ctcggaccac aacattccat attgtgagat taacaccagc tttagaggaa 1260 agtgtcagat acagcgacaa atccccctgt atgcaagggc tgactgggac agtttaagag 1320 cagtggccga agatcttagc acggaccttc aggtcaagaa gtccagtcat tcaactgacg 1380 agctatggtc tactttcaag gacacgcttt tggctgcaat caagaagttc atacctcaca 1440 agactgctag gcccaaaaac aacctcccct ggattactcc aagtattcgg cgtctgatca 1500 acaagagaga caggaaatac agacgtatga agaagaccgg ctcagcaaaa ttaagagagg 1560 agtacaagtc cctacgacgt accatccagc gacaaattcg cagaagctac tggagttact 1620 taaactccat cttcacggaa gactcaaaca cctgtcaagt gaacaacaaa aagttttggt 1680 cttacatcaa gaaccagcgc tcaagtaaca ctggggtggc accactgaag aaaaacggac 1740 gtcttacgtc taacccacag gagcaagctg ttatactaaa tgaccacttt cagtcggtgt 1800 ttggagatgg acgccagtac tcagaagaag aatttgcatc gaagactggt atgacgaaca 1860 ttcagacatc tgaaatggac gacattacca ttacttgtga aggggtcaag aagcttctga 1920 agaaacttga cccacacaaa gcgggggggc ccgatggtat taactccaga gttttacggg 1980 aacttgcaga agaactagcg ccagccttga caaccatttt ccagtcttca ctgtcgtcat 2040 gtgttgttcc agacgattgg aagtgcgcgt acgttactcc gcttttcaaa aaaggggagc 2100 aatacaaccc agccaattat aggccaatat ccttgacatg catttcatgc aaactcatgg 2160 agcatatagt ggtcggtgct gtcatgcaac atctcgaatc caactccatt cttactgaga 2220 accaacatgg attcagaaag ggtagatcat gtgagaccca actattagaa cttaccgaag 2280 aagtgattaa caacttagag ggagggaaac aaactgactt aatagtaatg gatttcgcta 2340 aggcgtttga tcgtgtgaat cacagcttat taactcacaa gcttcgctgt tatggtatcc 2400 aaggacccac gcttgcctgg ataaccagtt tccttcaaaa ccgccgccaa gctgtagtgg 2460 tatccggtca ttactctccg tttgtcagtg tcagatcagg agtcccccag ggctcggtct 2520 tgggtccttg cttattccta gtgtacatta atgatctacc agacaagctg tcatcaatgt 2580 cacgtctgtt cgcagacgac actgcagtct acagaatcat taccagcagt caagaacaag 2640 accagcttca acttgacctt cacaagcttg aacagtggga gaacagttgg gacatggagt 2700 ttcatcctgc caaatgtgtt catctcccaa tcactaggag tcggaagcca tttcaacgct 2760 cctacgtgtt acatggtcac accctggaga ctgtttcaac agtgaagtac ctgggaacaa 2820 caatgagtga gaatgctacc tgggataccc atataaacac catggtcaca aaagctaaca 2880 aaactctggg cttcctcaga agaaacctaa agatcagctc cgtcagaatt aaagagaaag 2940 cctataaagc ctttgtgaga cctgtgcttg aatacgcatc gtcggtatgg gacccacaca 3000 ctaagaagaa catcgacaag atagaggccg ttcagagaag agcagcaaga tttgtactta 3060 acaagttcca caatacatct agtgtgagca gtatgttgag tacattaggt tggcagtcac 3120 tgaaacaacg caggaaaaca gcacgtttgg gtacattatt caaaatccat cacgggatag 3180 tccagtgccc agtcatcagc agtaaactag taccgccccc aacacgacaa cgccgcaagc 3240 acaactgcca gttcaagcaa atcaccacca gaacattata cagagatggg tcattcttgc 3300 caagaacaat caaggactgg aatggtttac cagcagagac agtcgaggcc gccacagtag 3360 acacgtttgt gtctcgggcc tctgcttcat aacaacttac cttgacaacg gacggattca 3420 agggtatcga ccatggactg tttaatggac taacgagtaa tgcgtctaca agtgcgagta 3480 ttgcgtctgc aagtgcgagc gtcaccaccc ccaaacatgc cagtataatc ttcgccaaga 3540 ttgtgggcat taacgggaag aagaagaaga agaag 3575 // ID Crack-10_CQ repbase; DNA; INV; 1468 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A Crack non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; KW Crack-10_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-1468 RA Kojima K.K. and Jurka J.; RT "Crack non-LTR retrotransposons from the southern house RT mosquito."; RL Repbase Reports 11(1), 41-41 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 9 sequences with >92% CC identity. XX FH Key Location/Qualifiers FT CDS 2..1153 FT /product="Crack-10_CQ_1p" FT /note="reverse transcriptase." FT /translation="LDNRDDVLGLFLDLSKAFDTVDRKILIDKLRYAGVRG FT IALDLFTSYLSNRAQFVCIDGIHSLLTLVNVGVPQGSVLGPLFFIIYLNDF FT SLLPLKGDLRLFADDSSLFYNNKSTNMNDLNLRDDLTIVIEYFRLNKLTLN FT INKTNIINIKNSSRSIPNSLSLTTSKFPDLEVVSDTKYLGIILDNRLNWSA FT HINALILKLHKITGIIFKIKHKLPQKVLFLIYHSLFHSILSYVTAVWGNAC FT GLLINKLQVAQNRILKIILNLPIRSHTVDLYVKNNILPVRGIYVFQVCCFI FT YLCLNNKTHSNTKFVRSNHQYGTRYHDLLNRPNVTTVPGERSINFNGAQLY FT NYFNNRFGKSSSLNIFKNQLKQCLSRPDIIEKLLKSVNIFA" XX SQ Sequence 1468 BP; 459 A; 260 C; 206 G; 541 T; 2 other; cttagataat cgcgatgacg ttcttggttt atttttagat ttatcgaaag catttgatac 60 agttgatcga aaaatattaa tagataaact tagatatgct ggcgttcgtg gtattgcttt 120 ggatttattt actagttatc tttctaatcg tgctcagttt gtttgtattg atggaataca 180 tagcttgcta actttagtaa acgtgggagt acctcaaggc tctgtgcttg gtccattgtt 240 ttttattata tacttaaatg atttttcgtt acttcctctt aaaggggact tgaggctctt 300 tgcagatgat tcttctctct tttataacaa caagtctact aacatgaatg atcttaattt 360 gcgagacgat ctaacaatag taatcgaata ctttagattg aataaattaa cattgaatat 420 taacaaaacg aacattataa atataaaaaa ttcttcacgt tcaataccca atagtttatc 480 tctcaccaca agtaaatttc cagatttaga agtagtctca gacactaaat atttaggtat 540 cattctagat aatcgtctta attggtcagc acatatcaac gctcttatac taaaacttca 600 taaaatcact ggtatcatct ttaaaataaa gcacaagctt cctcaaaaag ttctttttct 660 catctatcat tctctatttc attctatact ttcctatgta acagctgttt ggggtaatgc 720 atgtggtctt cttatcaata aacttcaagt agctcaaaat agaatcctaa aaatcatctt 780 aaatcttcct attcgtagtc acacagtcga tttgtatgta aagaataata ttttacctgt 840 caggggaata tatgtgtttc aagtttgttg tttcatttac ctttgcttaa ataataaaac 900 ccatagtaat actaaatttg ttcgttctaa ccatcaatat ggtacaagat atcacgatct 960 gttgaatcgt cctaacgtaa ccacagtacc tggtgagcgc agtataaatt tcaatggagc 1020 tcagctttat aattatttta acaatagatt tggtaaaagt tcatccttaa atatcttcaa 1080 aaatcaacta aagcagtgtt tgtcccgacc tgacatcata gaaaaacttc taaagtccgt 1140 taatattttc gcttaattat tttgtttagt taattctccg attcawtccc tcgtttgtat 1200 ttgtatccat cttgaacaca tttatcttgg tagttttctt ttatttattt tcagccgcta 1260 atcgccagaa accgaactcc ttaaaaggtc tacgccgatg gagttctgga gccagccgca 1320 tcaactttgt cgttacattt wtatttaatt ttcaatgaca tgtaatctac ttaatttgta 1380 ttgcacattg aaaagaaaca gttttttcag tcagcacttt gctggctttt tctgtcaaat 1440 aaaaacaatc aatcaatcaa tcaatcaa 1468 // ID Gypsy-2_DPer-LTR repbase; DNA; INV; 110 BP. XX AC super_2; XX DT 04-MAR-2011 (Rel. 16.03, Created) DT 04-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-2_DPer_; KW Gypsy-2_DPer-I; Gypsy-2_DPer-LTR. XX OS Drosophila persimilis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; obscura group; OC pseudoobscura subgroup. XX RN [1] RP 1-110 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (22-MAR-2010). XX DR Genome; super_2; Positions 5380636 5380527. XX SQ Sequence 110 BP; 42 A; 19 C; 19 G; 30 T; 0 other; tgttaggtat tatattaata tccatatacg gtcacaccat ccaagggata tagaataaaa 60 gagaatcttg aagacaacgc gtgcttaaat ctgagtgtca atacactaca 110 // ID Gypsy-12_CQ-I repbase; DNA; INV; 5013 BP. XX AC AAWU01009192; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-12_CQ_; KW Gypsy-12_CQ-LTR; Gypsy-12_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-5013 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 403-403 (2011). XX DR GenBank; AAWU01009192; Positions 63038 68050. XX CC Positions [3740-4210] - Integrase core CC 'CAATAC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 656..4885 FT /product="Gypsy-12_CQ-I_1p" FT /translation="MSDSNSKMPFSIEPFIPGAVSFLQFLEQLEWVFEHHK FT VTSDEDKRTSFMATCNREVYSEIKRLFPGKDLKKLTFKEITEALQKRYDKS FT VAGFIQRFNFYNRVQGSNESAEDFILDVKQQAELCDFGDFKNTAIRDKLIC FT GMADSVLQESLFDEEDLSLSRVEKLILNREANNARKKIIAGDRRASVLNRL FT GRRDGAPAYKGRSRSRGRDLRGRSRSRSGSFDRRKRDSKKQFFCTFCRRNG FT HTRPYCYDLPKNKKSVKFVDEQPSVQQSSDQPKSSRNKFDRSVTDDECDMQ FT CLSISSVNKVNEPCIRKVFVASLLISMEVDCGAAVSVVCWSTYNKKFSHIP FT LEKCNKKLAVINGSSLKVEGQLSVVVELNDIRQKVSLIVLRSFNEFIPLLG FT RDWLDVFFPGWRNTFGETSRVNQVANTADQVLAEIESKFSNVFDKSLATPI FT KGFEADLVLRDDTPIFKRAYDVPLRLRDQVYQHLESLEKDGVITPVDASEW FT ASPVIIVVKKDGGIRMVIDCKVSINKVIIPNTYPLPLPQDLFASLAGAKLF FT CSLDLTGAYTQLLLSKKSRKLMVINTIKGLFLYNRLPQGASSSAAIFQKIM FT EQILKGIVGVYCYLDDVLIAGKDFEDCKSKLYLVLDRLSKANVKVNFKKCK FT WFVSSLLFLGHVLTNDGLLPCPDKVETIRRAKIPSNVSELNAFLGLVNYYG FT KFIPHLSSRLSCLYRLLKKDVKFVWTGDCSRVFEDCKQALLTSKMLELFDP FT EKPVVVVTDACGYGLGGVIAHEINGEERPISFTSFSLTEAQKSYPILHLEA FT LAVVCTIKKFHKFLFGKEFTVYTDHKPLVGIFGKEGRNSLFVTRLQRYVLE FT LNIYKFEIVYRPSSKMGNADFCSRFPLPEEVPLSLQNEFIKTLNVYNEFPL FT DHILIAKETKEDKLLQKILNYMKHGWPQNLDRCLANVYAQHQDLEFVDGCL FT LYQDRVVIPFSLQKQILKLLHRNHSGITKIKQLARRTVYWFGMNGDIESFV FT KSCHVCCQMNVVPKQVPHSPWIPTTKPFARIHADFFHFDKKVFLVIVDSFS FT KWIEVEYMKFGTDARIVKSKFICFFARFGLPDVVVTDGGPPFNSKDLIDFF FT EKNKVVVMKSPPYNPSSNGQAERMVRLAKDGLKKLLLDPEMNKLTTEDLVS FT CFLFSYRNTCLEDGSSFPSERLFNYKPKTLLDLIHPKNSFSKHLTPRDVEK FT DVVVVGDRHVKNRKFDWVDRLSPGDSVYFKNFKPTEIRRWLLSSFIRRVSP FT NTFQVSLGGRTYLAHRNQLKEAPKEDYQRKVVVTARSERRGTKRGRERYGD FT AAYEDGEDSFYGFAADSFVFAEDSNDGVDVDMEVDMEREGVSRLSSVDRES FT QIRDDPVEGSSTQRLPDFSVPSIDNRAVVRRSTRSKRYKKDKDFVYY" XX SQ Sequence 5013 BP; 1300 A; 827 C; 1261 G; 1625 T; 0 other; gtggcgacga ggaaaagtag aagtttttgg gttttttttt cgcacaagtt ggtgcagcgt 60 gacgttttgt tgctggagta aggacgtgcc gttccggatt gcagtttttt tcaacgtttt 120 tatcggagtt tcattcccgg cggaacgagc atcagcatcg gacggaggac gcgcatttga 180 cggacttcgc agtttagagt ttttttctgg tggagttttt tcccgcttgg ttttgccaag 240 gaggtgtcgt ttacgggtga agttggtttt tccagaaaac tgcgtaagta gagtgcaaat 300 ttgctgcgtc taggaggtcg cattgttgag aattgtgcgt gtgttcattc gccattttct 360 ttgcctagca agcgaggcat ttggcgtcag tattcttttg tgttgattga tttcacacat 420 actaccacac aacacttaag atacacacac ttttagtggt gatacatttt tgtgtcgcac 480 aggttgttgc atcattgtag catcgttgtt gcatcattat tcaatttggt gtgcaattga 540 ttgctgacag ttttgttgcg tttgatgtta gcaacgtttt agtttgacga gtttttattt 600 tgagaacgat ttcttatcat tgtttgttta cgttttgttt tatttatttt ttattatgtc 660 agattcgaac tcaaaaatgc cgtttagtat cgagccattc atccccggcg ccgtttcgtt 720 tttgcaattc ctagagcaat tggaatgggt tttcgagcat cataaagtga cgagtgatga 780 agacaaaaga acgtcgttta tggcgacgtg taatcgcgag gtttattcgg agattaagcg 840 tttgttcccc ggtaaagatc ttaagaagct cacgtttaag gaaattacgg aagcgttgca 900 aaagcgttac gacaagtcgg ttgctggttt tattcagcga ttcaacttct acaatcgtgt 960 tcaagggtca aacgagtcgg cagaggattt tatcttggac gtgaagcagc aagcggagtt 1020 gtgtgatttc ggcgacttta agaacacagc gatccgcgac aagctgattt gcgggatggc 1080 cgattccgtt ttgcaagaat cgttgtttga tgaagaagat ctgtccttgt cccgagttga 1140 aaagttgatc ctgaatcgtg aagccaacaa cgcgaggaag aagataatag ctggtgacag 1200 acgtgccagt gtgttgaatc gtcttggcag acgtgacgga gcacctgctt acaagggacg 1260 gtctcgtagc agagggcgcg atctgcgagg aaggagcagg agtcgttctg gttcgtttga 1320 ccgcaggaag cgtgattcta aaaagcagtt tttctgcacg ttttgtcgtc gtaatggcca 1380 taccagacct tactgctacg atttgcctaa aaacaagaag tcagtgaagt ttgttgacga 1440 gcagccatca gtgcaacagt catcagatca accaaagtcg tcgcgcaaca agtttgatcg 1500 gtcggttact gacgatgaat gtgacatgca atgcttgagc atctcgtcag ttaacaaagt 1560 taatgagccg tgtatcagaa aagtttttgt tgcaagtttg ttgatctcga tggaagttga 1620 ctgtggtgca gctgtttctg ttgtttgctg gagcacttac aacaagaagt ttagtcacat 1680 tcctttagaa aagtgcaaca agaagctcgc ggtgatcaat ggcagcagtt taaaggtcga 1740 ggggcaactc tcagttgtgg ttgagcttaa cgacatacgt cagaaagttt ctctcattgt 1800 tttgagaagc ttcaacgagt ttatcccttt gctaggacga gattggctgg acgttttctt 1860 tccgggttgg cgcaacacgt ttggagagac cagcagggtg aatcaggtgg caaatacggc 1920 ggatcaagtt ttagctgaga ttgagagtaa gttttcaaac gtttttgata aatctttagc 1980 tacgccgatt aaaggttttg aggcagattt agtcctgaga gatgatacac cgattttcaa 2040 acgggcttat gacgttccat taaggcttag agatcaagtt tatcaacatt tggaaagttt 2100 ggaaaaagat ggcgtcataa ctccggttga cgcaagtgag tgggcgtctc cggtgatcat 2160 tgttgtcaag aaggatggtg gtattaggat ggtgattgac tgcaaagttt cgatcaataa 2220 ggtcatcatt cctaacactt atccacttcc gttgcctcag gatttgtttg cttctttagc 2280 tggagctaag ttgttttgct cgctggattt aactggagcg tatacacagt tgcttttgtc 2340 aaagaaatct agaaagttga tggtgattaa cactataaag ggtttgtttt tgtacaatcg 2400 tttgccgcaa ggtgcttctt caagtgcagc aatttttcag aagatcatgg agcaaatttt 2460 gaagggcatt gttggagttt attgttattt agacgatgtg ttgatagcag gcaaagactt 2520 tgaggactgt aagagtaagc tttacttggt tttagaccgt ctttctaagg ccaatgtaaa 2580 agtcaatttt aagaaatgca aatggtttgt ttcaagtttg ctgtttctgg gacatgtgtt 2640 gacaaatgat ggtttgttgc cgtgtccaga taaggttgag acgattcgga gagcaaaaat 2700 tccgagtaat gtttcagagc ttaacgcatt tttggggtta gtaaattact acggtaagtt 2760 tattccccat ttgtcttctc gcctcagttg tttgtatcgt ttattaaaga aagacgttaa 2820 gtttgtttgg actggtgatt gtagtcgtgt gtttgaagat tgtaaacaag ccttgctcac 2880 ttcaaagatg ttggagttgt ttgatcctga aaagccagtt gttgttgtta ctgacgcttg 2940 tggttacggg ttaggtggag taatagcgca tgaaatcaat ggtgaggaaa gaccaataag 3000 ctttacttcg tttagtttga cagaagccca gaaatcctac ccaattttac acttggaggc 3060 acttgctgtt gtttgcacca taaaaaagtt tcataagttt ttgtttggca aagagtttac 3120 tgtgtatacg gatcataaac ctttagttgg tattttcggc aaggaaggca ggaacagttt 3180 gtttgtgacg cgtttgcaga gatatgtttt ggaactgaat atctataagt ttgaaatagt 3240 ttacagaccg tcatcaaaaa tgggcaacgc agatttctgc tcaagatttc cattgccaga 3300 agaagttcca ctttcacttc agaacgagtt tattaagaca ctgaatgttt acaacgagtt 3360 tccattggac catattttga ttgcaaaaga aacaaaagag gacaagttgt tgcaaaagat 3420 cttgaactac atgaagcatg gttggccaca aaatttggat cgttgtttgg caaatgttta 3480 tgctcagcat caggatctgg agtttgttga tggttgtttg ttgtatcaag atcgtgttgt 3540 tattccgttt agtttgcaga aacagattct aaagttgttg catcggaatc attctggtat 3600 tacaaagatt aaacagttgg cacgccggac agtttattgg ttcggtatga acggtgatat 3660 tgaaagtttt gttaaatctt gccatgtttg ttgccagatg aatgttgttc cgaaacaagt 3720 tcctcattct ccttggattc caaccacaaa gccgtttgct agaattcatg ctgatttctt 3780 tcattttgat aagaaagttt ttctagttat tgttgacagt ttttcaaaat ggattgaagt 3840 tgagtacatg aagtttggta cagatgctag aatagtaaag tcaaagttta tctgtttctt 3900 tgctaggttt ggattgccag atgttgtcgt gacagatggt ggaccaccgt ttaattcgaa 3960 ggatcttatt gatttcttcg aaaagaacaa agttgttgtc atgaagtctc caccgtacaa 4020 tccatccagc aatggacagg cagagcgcat ggttagacta gcaaaagatg gtttgaaaaa 4080 gttgttgttg gatccagaga tgaacaagtt gactaccgag gatttggttt cgtgttttct 4140 gtttagctat cggaatactt gtttggaaga tggttcttcg tttccttccg agaggttgtt 4200 taattataag ccaaagacgc tgttggactt gatccatcca aagaatagtt ttagtaaaca 4260 cttgacgcca cgtgacgtgg agaaagatgt tgttgttgtt ggtgatcgtc atgtgaagaa 4320 tcgaaagttt gattgggttg accgtttaag tcctggtgat tctgtttact tcaaaaattt 4380 taaacctaca gagatcaggc gttggttact atcaagtttt ataagacgtg tttctcccaa 4440 tactttccag gtttccctgg gtggtcggac gtatctcgct catcgcaatc agctgaaaga 4500 agcgccgaag gaggactatc agcggaaggt cgtggtcacg gcacggagcg agcgtcgagg 4560 tactaaaagg gggagagagc ggtacggtga tgctgcttac gaggatggag aggacagttt 4620 ttacgggttt gctgcggatt cgttcgtctt cgctgaagat tcgaatgatg gtgttgatgt 4680 cgacatggag gttgacatgg aacgagaagg agtttcaagg ctgtcgagcg ttgatcgtga 4740 atcgcagatt cgcgacgatc cagtcgaagg aagttccaca caaaggctcc cggatttttc 4800 tgtcccatcg atcgataatc gtgcagtagt gcgtcgttcg acgcgatcga aacgttacaa 4860 gaaagataaa gatttcgttt attattaggt tttttttcct ttgttgaatt gtttagtttg 4920 tttaaaggtt tttcctaata gctaagttgt ttaacattct gaattagtta aataatgttg 4980 attatattta gttgtttact taagggtgag aac 5013 // ID BEL-604_AA-LTR repbase; DNA; INV; 615 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-604_AA_; KW Pao_Bel_Ele53; BEL-604_AA-I; BEL-604_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-615 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 615 BP; 202 A; 97 C; 126 G; 190 T; 0 other; tgaacggtac ggcggtgaat cccccgacag attgttaaca atacgaatgc aaagagatag 60 acgtcaatcg tgatgacaac catacctaag gcgagatgtt atagacgaaa aaacgtgtac 120 accatatgtg atgcagatgt gctatatgat tgcgaatttt tatattgaat tgtgtcgaga 180 aattactgaa tttaaaggct gataggatta gattaaagtt ggataaacca tatgaactgt 240 aaagtgagtg aaacctgtca gaataagagt tctttgcgat attccctaaa tttatattct 300 ttaggttagg cattactgta acccattgtt ggctgatcat aattccacgg ttattttgtc 360 gcttaaggcg taagtttcta taattatatg atttattcct gatctaatgt gctattatgt 420 gtagtaggac tttcgggcag aaagcgttgt gaggtcgtac gtgggattgt cagtaaggag 480 aaaaattaat gtaagatacc ttgaaacgta accttgaact atccagaatt ctaaaatata 540 catatatttt tagctttgag cgaaacacca aaaatcgctt cacagaagtt tttctaccca 600 ctcaaagtcc gaaca 615 // ID Copia-101_AA-I repbase; DNA; INV; 4342 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-101_AA_; KW Copia-101_AA-LTR; Ty1_copia_Ele55; Copia-101_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4342 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [1661-2188] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 317..4105 FT /product="Copia-101_AA-I_1p" FT /translation="MSEKFLFARLNNQNFPVWKTRMEMYLKREELWKVIRD FT AAPEPVSDDWTKSDEKALAMIVLGVEDSQLNLVRGVATAVEAWNKLKDFHE FT KTSMTSRVSLLRRICSLNMCEGDNIEKHLYELEELFDKLACAGQEMEMPLK FT IAMIYRSLPESYGSLVTALEGRPDADQTMELVKQKLIDEHQRRQERSGDAG FT ENAMKMYSRKKKSDKVCFQCGKPGHFRRNCPQPKSGPSGSDVKRDSKPKSG FT SAKAKQAEESDSAICFVAGNKCSGPWIVDSGCSSHMTNDKSFFDKLDTSVK FT VKVILADGSETMSGGIGEGTVKCVSGENSLVDIPFKDVLYVPALDSGLISV FT RKLVKRGLKVEFSGSRCEILSAQGKIVAVAEPRGNLFALKVAEEARLGKEA FT RHLQNCQHTWHRRFGHRDPAALDEIQRKELSDDFKIQDCGIRQVCQPCLEG FT KSCRIPFPKTASNRTQRVLDVVHTDVCGPMANVTPGGARYLMTIIDDHSRY FT TVVRLLKRKSGVADCIKKYVAHVKNMFGRAPCVIRSDGGGEYTGHDLKRYY FT EQEGIQAQFTAAYSPQQNGVAERKNRSLQEMATCMLLDAGLDKKYWGEAVA FT TAAYLQNRLPSRAVDTTPYERWFGRKPSLSHMKIFGCPAFVHIPDVKRSKL FT DSKAKKLLFVGYCSDRKAYRFLNTDTNEITISRDARFVELQEEKTPALGGL FT QSAGEEFVEVETTSSRPKKRVDEENSEQPRDQEESEEDTFFDWDDEQEQSD FT QPEATRKEQKRSTRGVLPKRLEDYVVNAAMLAQEEPATYEEAVAGPEQGLW FT QEAMAKEYQSLMENRTWKLVELPAGRTPISCKWVYKKKHDSVGNVSRFKAR FT LVARGFSQKFGVDYDAVFAPVATQTTLRILLTVAGQKKMHVHHLDVKCAYL FT HGKLKEEVYMQQPKGFVIPGKEKFVCKLERSLYGLKQAARVWNETISVLLK FT ELGFTQSVADPCLYVKRLPDGGVVYLLIYVDDMIVASTVEEEIRLIEEQLR FT KKISLTSLGEVKQFLGVRVTRDDEGCFSLDQSVFIKVLAQRFGLQDAKGSS FT IPLDPGYYRSRVNSQPLDSNERYHSLVGALLYIAVNTRPDVAASVSILSRQ FT ISSPTETDWVELKRVVRYLLKTACYKLKLSAAKLDLVGFCDADWSGDPSDR FT KSNSGYLFQLGKATICWASRKQTSVSLSSMEAEYMALSEACRELVWLRLRN FT WRPQQKPTTVLESSLWRLTVNQEDPNISTLGCTTPRIWSIKVWWLSNIVRR FT RK" XX SQ Sequence 4342 BP; 1168 A; 901 C; 1309 G; 963 T; 1 other; ggttcatggg cccagagtca gtgaattggg aaaagtgttt tccgggacag aaagtgcttc 60 gcggaagcca ttagtgtgcg aaaaagtttt cggcgtgggt acgattggta cagttctgtc 120 ggcggttgca tttggctcga gctgcagagc agcgcgcggt tgtgatagtg tgcgtgtgat 180 ttggagcact cgagcagtgc gcgttttcgg ttgctggtgt gttcgaggct ggaagaaatt 240 tctagcgtaa caggtgaaag aggcctgatt ggaaaagttt tcgttctcgt cgcgtgtgtg 300 tcgatcggag aagaaaatga gtgagaaatt tctgttcgct cgtctgaaca accagaattt 360 ccccgtgtgg aaaacgcgta tggaaatgta tttgaagcgc gaagaactgt ggaaagtgat 420 aagagacgca gcaccagagc cggtcagtga tgactggacg aagagtgacg aaaaggcgct 480 ggcgatgatt gtcctaggtg ttgaggacag ccagttgaat ttagtgcgtg gtgttgcgac 540 ggcagtggaa gcgtggaata aattaaaaga ttttcacgaa aagacgtcga tgacgtcaag 600 agtgtcattg cttcgtcgca tatgcagctt gaatatgtgc gaaggggaca atatcgagaa 660 gcacttgtat gagttggaag agcttttcga taagttggct tgcgccggtc aggaaatgga 720 gatgccgctt aaaatcgcga tgatctatcg cagcttgcca gagtcttacg gcagtttggt 780 taccgctctg gaaggtagac cggatgccga tcagacaatg gagctggtca agcaaaagct 840 gattgacgag catcagcgcc ggcaagaacg ttcgggcgat gccggtgaaa acgcgatgaa 900 gatgtatagc aggaagaaga agagcgataa agtgtgtttt cagtgcggca aaccgggaca 960 tttccgccga aattgtccgc agccgaaaag cggcccgagc ggcagtgatg tgaagcgcga 1020 ttcaaaaccg aaatccggta gtgcgaaagc gaagcaagcg gaagaaagtg attcggcgat 1080 ctgttttgtg gctggaaata aatgcagtgg accttggatt gtcgacagtg gatgttccag 1140 ccacatgacg aacgataagt cgttctttga caaactggac acgagtgtaa aggtgaaagt 1200 gatcctggcc gatgggtcag agacaatgtc cggwggaatc ggagaaggca cagtgaaatg 1260 tgtgagcggt gaaaacagtt tggtggacat tccgttcaaa gacgtgttat acgtgcctgc 1320 actggacagt ggactcattt cggtgagaaa actcgtgaag agaggactga aagtggaatt 1380 cagtggttcg agatgtgaga tcttgtccgc gcaaggtaaa atagtggctg tggcggaacc 1440 ccgcggcaat ctattcgcgt tgaaggttgc agaagaagct cggctgggca aggaggctcg 1500 gcacctgcag aattgccagc atacctggca ccgccggttt ggtcaccggg atccggcggc 1560 attggatgag atccagagga aggagctttc tgacgacttc aagattcagg attgcgggat 1620 ccggcaggtt tgtcaacctt gccttgaagg taaatcatgt agaattccat ttcctaaaac 1680 cgcgagcaat cggacccaac gtgttctcga cgtcgttcat actgatgtat gtggtccgat 1740 ggcaaacgta acgccaggag gtgcccgcta tctaatgaca atcattgatg atcatagtag 1800 atacaccgtc gtgcgtctac tgaagcgtaa gagcggtgta gctgattgca ttaagaaata 1860 tgtcgcgcac gtcaagaaca tgtttggtcg ggctccatgt gtaataagat cagacggcgg 1920 gggtgaatat accggtcatg acttgaagcg gtattatgag caggaaggca ttcaggcgca 1980 gttcaccgcg gcctactccc ctcagcagaa cggagtggcc gagcggaaga atcgctcact 2040 ccaagaaatg gccacatgta tgcttcttga cgccggcctt gataaaaagt actgggggga 2100 ggcggtggct actgcagcgt atttacagaa taggctacca tcccgcgccg tcgatactac 2160 accttacgaa aggtggttcg gcaggaaacc gtctctttca catatgaaga tattcggttg 2220 ccctgcattt gttcacatcc ccgatgttaa acggtcgaag ctagatagta aagctaagaa 2280 gctgctgttc gtgggttatt gtagtgacag gaaggcctac cggttcttga atacggatac 2340 aaacgagatc actatcagtc gcgatgctcg gtttgtggaa cttcaggagg agaagacacc 2400 tgcgctaggt ggtctgcaat cggctggaga ggaattcgtt gaggtagaga ccactagcag 2460 ccgtccgaag aagcgagtag atgaagaaaa ctcagaacag ccacgggacc aggaggagtc 2520 cgaggaagat actttcttcg actgggatga tgaacaggag caatctgacc aaccggaagc 2580 gacaagaaaa gagcagaagc ggagcacccg aggcgtactg ccaaagcggt tggaagacta 2640 cgtcgtcaac gcggcgatgt tggcacagga ggagccagcg acctacgagg aggcagtggc 2700 cggacccgag caaggactgt ggcaggaagc tatggcgaag gagtatcagt cgctgatgga 2760 gaaccggaca tggaagcttg ttgaactacc cgccggacgg accccaatca gttgcaaatg 2820 ggtctataag aagaagcatg atagtgtcgg caacgtttcc cggttcaaag cacggctcgt 2880 cgcgagagga ttctcacaga aatttggagt agattatgat gccgtctttg ctccagtggc 2940 gactcaaaca accttgagga ttttgctaac ggtcgccggg cagaagaaaa tgcacgtcca 3000 tcacctcgac gtgaagtgcg cctaccttca cggtaagcta aaggaggagg tgtatatgca 3060 acaaccgaaa gggttcgtta ttcctgggaa agagaagttc gtgtgtaagc ttgaacgcag 3120 cctgtacggc ctcaagcagg ccgcacgagt atggaacgag acgataagtg tgctgttgaa 3180 ggagctcggt ttcacacagt cggtggcaga tccatgcctt tacgtgaaga gattgcccga 3240 tggtggtgtc gtatatctcc ttatttatgt cgacgacatg attgtggcaa gcacggttga 3300 ggaggagatc aggttgatcg aagaacagtt gcgaaagaag atatcgttga cgtcgctggg 3360 agaggtgaaa caattcctcg gcgttagagt gactagagac gatgaaggat gtttcagcct 3420 ggatcaaagt gtgtttatca aggttcttgc gcaacggttc ggtttacaag acgcaaaagg 3480 ttcaagcatt ccactcgatc cgggatatta ccgcagccgg gtgaatagtc aaccactgga 3540 ttcgaacgag cgctaccata gtttggtagg agcgcttcta tatatcgcag tgaatactag 3600 gccggatgtt gccgcaagcg tgtccatcct aagcaggcag atcagcagcc ctacggagac 3660 cgattgggtt gaattaaaac gtgtcgtacg atacttgttg aagaccgcct gctacaagtt 3720 gaaactgtct gctgcgaaat tagatctcgt tggattctgc gacgcggatt ggagcggaga 3780 tccatctgac cgcaaatcga attctggcta tttgttccag cttggcaaag caacaatttg 3840 ttgggcaagc cggaaacaaa cgagcgtatc tttgtccagc atggaggccg aatatatggc 3900 cttatcggaa gcgtgccgtg aattagtttg gctgaggctg agaaattgga gaccgcaaca 3960 gaagccaact actgtactgg aatcgagttt gtggagactg accgtaaatc aagaagatcc 4020 aaacatatcg acactagggt gcactacacc aaggatctgg tcgataaagg tgtggtggct 4080 ctccaatatt gttcgacgga ggaaatgact gcggacatcc taacaaaacc gttgggagcg 4140 gtgaaacaac agcgattcgt ggaagcgatg gggctggtca gtaccgatgg ccagactgta 4200 ccggttccgt gacgacgagg aggagtgtca gctataaagt gggggcagcg tttatggtaa 4260 cgccgtcact gcaccgccac gctggcaaca caagagagag agaagcgaaa gtgaacgatc 4320 attcttttca tacccattag tc 4342 // ID Gypsy-10_IS-I repbase; DNA; INV; 2556 BP. XX AC ABJB010104551; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the black-legged tick genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-10_IS_; KW Gypsy-10_IS-LTR; Gypsy-10_IS-I. XX OS Ixodes scapularis OC Eukaryota; Metazoa; Arthropoda; Chelicerata; Arachnida; Acari; OC Parasitiformes; Ixodida; Ixodoidea; Ixodidae; Ixodinae; Ixodes. XX RN [1] RP 1-2556 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the black-legged tick genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; ABJB010104551; Positions 14737 12182. XX CC Positions [1457-1834] - Integrase core CC 'GAAAC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 635..2446 FT /product="Gypsy-10_IS-I_1p" FT /translation="MLTSAPVLALFDPSRPTKLSADASADGLGAALLQLYD FT KSWRPVAYASRSLSDSESRYSQIEKEALGLVFGCESLHNLTYGIHILLETD FT HSPLVALSKKKRLGDVPPRLQRFFLRLMRYDFTLQFLPGRKLVLADALSRA FT GQHSPHHKQGLEDVEVHAVGMLAALVSETTQDRLRTETQRDPELQHVLHCL FT ENGYPLQGNFSSLDPELSVVNGIILKGTKVVIPSSLRRVMLAPVHQGHLGV FT NKCKERGRQLMFWPKMAQDIHTFVAECATCKKFTYQQPREPLMMRDTLPYA FT WYRVGIDLFAYGGYSYLVAFDAYSNYPEVERLQDTRSQTIIHTLSGWFARH FT GIPIEVCSDNGPQFSSSEFKRFSRVYDFHHVTSSPHFPQSNGLAEKGVQIA FT KRILKKCDDAGDDFFLGLLNYRCSPLIGGQSPGELLYGRKLRAQLPDFSSH FT PRKPVCKRQQNKPGTSLKDLKSGDVVRMRGKGGWFEKARVTNCAGPRSYTV FT QTERNRIYRRNRRHLLKTSEAFSDCLDEDAASPQQHSDAHDNPQPSQTSQS FT LQLPQPHCNPDTDHRSTHPSPASGETLTDEAPALRRSSRSRHAPRRLTYDK FT TFEQFP" XX SQ Sequence 2556 BP; 681 A; 615 C; 618 G; 642 T; 0 other; tggtgccgaa acccgggaag ggaagatata caagctgagc ttgctagcgc cacagtgttt 60 tcaaagcttg atgcgcgggc aggctttcac caaatcccgt tgactgacga aacatcgcga 120 gtttgcacat tcggcacccc tcttggacga taccgatttt tgaggttgcc atttgaattg 180 gcttctgcgc cagaggtatt ccagcgtgcg atgtctgaaa tttttgaagg actgaagggg 240 gtacgtgttt acattgatgg cgtgctggtc tggggagcta cgcaacagga gcatgatgat 300 cggctgcgat cagctctgca agctgcacac agcgcaggcc tgacgctaaa ctcagagaaa 360 tgccagtttg gaattccaga agtaattttc cttggggatg tagtgagcaa agaaggtatc 420 gggcctgatc cgagtctttt agaaaatatc gtgaagatgg atgcacccac aaacaagaaa 480 gctctccagc gcatgcttgg cgttataaat tactttggaa aatacttaaa agagtttatc 540 cgaacgtaca gcaaatctta gacaacttct gcgcgaagat gtactttttg agtggacagc 600 ggctcatgac aatgagtgga agcaattaaa actcatgctg accagtgctc cagtcctcgc 660 actttttgac cctagcagac ccacaaaact gtcggctgac gcgtctgcag atggactcgg 720 agcggcctta ttgcagctgt acgacaagtc gtggcgccct gtagcgtatg cctcccgatc 780 actctcggac agcgaatcac gatactctca gattgagaag gaggcgcttg ggctggtatt 840 tggatgtgag agtcttcaca acctcacata tggtatacac atccttctcg agactgatca 900 cagcccgttg gtcgctttat caaaaaaaaa aagactgggt gatgttcccc ctcgcctaca 960 acgtttcttt ttgagactga tgaggtacga cttcacgcta cagtttctac ctggcagaaa 1020 gcttgtgcta gctgacgctt tgtcaagagc tgggcagcat agtcctcatc acaagcaagg 1080 cttggaggat gttgaagtac acgctgtggg aatgcttgcc gcacttgtga gcgagacaac 1140 acaggacaga ctcagaactg aaacacagag ggaccctgag ttacaacacg tccttcactg 1200 tctagagaac ggatacccac tgcaaggcaa cttcagctca ctagaccctg aactgtcggt 1260 ggtcaatgga atcattctta aaggaacgaa ggtggtgatc ccttcgtcac ttcggcgagt 1320 tatgttagct ccggtccacc aagggcactt aggtgtaaac aaatgtaaag aaaggggacg 1380 ccaactcatg ttttggccca aaatggcgca ggacattcac acctttgttg cggaatgcgc 1440 gacatgtaag aagtttacat atcagcaacc ccgcgaacca ctaatgatgc gcgataccct 1500 accttacgca tggtatcgtg ttggcattga cttgtttgcc tatggaggtt actcttattt 1560 agttgcgttt gatgcgtatt caaactaccc cgaggttgaa agactccagg acacccgttc 1620 gcaaaccatc atccatacat tgtcaggctg gttcgcaagg cacggcattc caatagaagt 1680 ttgctcagac aatgggccac aattttcatc ttccgagttt aaacggtttt ccagggttta 1740 tgacttccat cacgtgacat caagccccca ctttccgcaa tccaatggtc ttgctgaaaa 1800 aggggtacaa attgcaaaga gaattttaaa gaaatgtgat gatgcagggg atgatttttt 1860 tctcggtctc ttgaactatc ggtgttcccc tttgatagga ggacagtctc cgggagagtt 1920 gctgtatgga agaaaactgc gagcacaact tcctgatttc tcatcgcacc cgaggaaacc 1980 tgtgtgcaag aggcaacaaa acaagcctgg aacgtcgctg aaggacctga agagcggtga 2040 cgtcgtccgc atgcgaggaa aaggcggttg gtttgagaaa gctagggtaa caaattgtgc 2100 cgggccacgc tcctacactg tccaaaccga gcggaacagg atctacagga gaaaccgtcg 2160 tcatttgttg aaaacgtcgg aggcattcag tgattgcctg gatgaagatg ccgcttcacc 2220 acagcagcac tccgacgccc acgacaatcc gcagccttca cagacttcgc agtctctaca 2280 gcttccacaa ccacactgta acccagatac agaccatcgt tccacacatc catccccagc 2340 atccggtgaa actttgacag atgaggctcc agcactgaga agatcatcac gaagcagaca 2400 cgcaccacgt cgactaacct acgacaaaac atttgaacag tttccatagt ttcttttgtg 2460 tgtttatttt ttatttaatg ctatccgcat tctttgtatg ttatgttatt ttcgttctta 2520 ttttctttgt atttcttttc tttaaagagc ggaaga 2556 // ID BEL-48_AA-I repbase; DNA; INV; 6791 BP. XX AC supercont1.380; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-48_AA_; KW BEL-48_AA-LTR; BEL-48_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6791 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.380; Positions 372507 379297. XX CC Positions [5010-5591] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 3717..5999 FT /product="BEL-48_AA-I_3p" FT /translation="MQELWSLSCDWDEPVPHTIRLKWENFRNELPKIVTYR FT IDRYAFLPEARIELHTFADASTSAYGACTYARCENALGTVKIRLLASKSKV FT APLKRLTIARLELCACVLAAHLHHRIKDSINVHVSTSYFWTDSAVCLYWLR FT APPSSWKTFVANRVSEVQHYTHGGIWRHISGTDNPADLVSRGMSVEEFIES FT AMWKHGPSWLAHPQQSWPISSPPDVVEDILETKNVVASIRTTPSVNPWFLQ FT WSSYNRLLHVIAYCLRFAAKTRSKARTQPLATPTEIEFHQTSLTVKELANA FT NAVLVRFAQEDVFQQEIKDLKRGNAVSKNSPVRRLSPFIDPDGVLRVGGRL FT NLSQLPYQSKHPALLPKNHPFTWLIGEHYHRKLVHGGGRLLLSMIREEYWP FT LNGRRLVHSIVRKCFRCSRQHPIPAQQQIGQLPASRVSPSRPFSNTGVDYA FT GPLYIKPIHKRAAPAKAYLCIFVCFVTKAVHLELVGDLSTQGFLAALRRFI FT SRRGIPAHLYSDNGKNFEGAKRELAELFARFRSNIEQSVIASACSEQGITW FT HLTPPKAPHFGGLWEAAVKTAKRHLFRQLGSTRLSFEDYYTILHQIEAAMN FT SRPLLPMSDDPNDLAALTPAHFLIGTSLHALPDPDFQHTHMSALDHLQKLQ FT QHVQRFWSHWRTEYLQELMKDTKLAARNDEIQPGRMVIIVDDNLPTTRWPL FT ARITELHPGRDNLTRVVSLRTEKGVITRPITRICLLPLPCLNPDEELPIDT FT ISSSCAIEEK" FT CDS join(667..2349,2353..3447) FT /product="BEL-48_AA-I_1p" FT /translation="MALQLQKATLLSRRATMIAALGRAEAFVNSFDPQRDQ FT GQVALRLEYLNSMWTTLEEVQGQLEDIEDTEEGRLAHANVRADFEPRLFSI FT KADLIAKLPVITQQARIPETPSHTSALSGLKLPTISLPEFDGDYMQWLGFH FT DTFLALIHSNVDVPAIQKFHYLKAALKGEASQLIESIAISSANYDLAWQTL FT VDRYANDYLLKKRHLQALFDIQSAKRETAASLHSLVDEFQRHTKVLGQLGE FT PTGSWSSILEHLLCTKLPDDTLKAWEDHASTADDPNYDCLIEFLQRRMRVL FT ESMLVNHHQSASASNGSAHVSKRQSHFRVSSCASTSASNSKCPACNQEHPL FT IRCAKFYHFSNTERQQLVSSKRLCHNCLKGDHFAKYCPSNYRCKHCSKRHH FT SMLHSGENSRQNSEPSSSRTSMPTQAIQSSVSSVAEPTAQSSVAASEGVPR FT VVVSASAPQPREDVFLLTVLVKVVDAYGQDHFARALLDSASQPNLITERLA FT RRLHLKRSSVNVTIQGAGNLSKKVRESIFARIKSRNDGFECGVEFLLMDTV FT TADLPAQDISVKEWIPENLALADPTFNKSQQIDMVLWAKHFHAFFPSTARL FT QLAENLPILVDSVFGWVVTGSASMNYSVQQKPVTSSVVAVSMLTLEESMER FT FWKTEELKINDGFSIEECRCEDLYQSTTCRDETGRYIVRLPRKPDFDAMLG FT ESKTCALRRFDQLERRLDRDQKLKEEYHDFMKEYLSLGHMRLVETDDGNHS FT HTYYLPHHPVIKEESTTTKVRVVFDGSARTSTGFSLNEALCVGPVVQDDLL FT AIILRFLTYPVALVGDVAKMYRQVLLHPEDCPLQRILFRFSKDMPVQTYEL FT RTVTYGLSPSSFLATRTLQQIAKDEGKAYPLAGPSVPKNFYVDDYVGGANS FT IEEAIQLRGEAIRNALKKAVSF" XX SQ Sequence 6791 BP; 1727 A; 1758 C; 1579 G; 1727 T; 0 other; taatttggtg ccgtgaccag gatcctcgac gccatcatgg aacgtaggaa tcacgtgtga 60 cgatccactg ggcgctgcca tcttgcgcgc gccataccac tggaggaatt taattgtttc 120 aaggctcgaa tcaatcgggc atcaggtaat tacctgtccc aagcgctttg gaggctcaca 180 ggtgcccttt agatattgtt tcatctcttt gcatgatctt cctgtcgacg ggcggagatt 240 ttcgattgca gccacacagt tttggggcac gctcttcgtc cagatacgcc atcgtgagga 300 cattccgtcc ggactattca cgtcgtccga ctaccgccat cacgtttcgg acattgtagt 360 ggaataatag tcgcttccca agccacgcca tcccgccatt gtttcctgat agttcggtgg 420 ctttgcggag gcatccacgt cttttcaacc accttggtga cgccattccg gcgtcgtgat 480 tgcggtgaat tccttcgaag gatttttgga ttttttggaa atacaaggca catacttgcc 540 ttggaactgg tgagtaccat tccaagtggc tttttctttt ctctgcgggc ctaccgtggt 600 cttttcgttt ggcagatccg ttcgtcgcca tcatcaatca ccagtcctat cgcttctgct 660 ttcggaatgg cgcttcagct gcagaaagcg actctactgt cccggagagc cacgatgatc 720 gccgcgctgg gtcgggcaga ggctttcgtg aattccttcg acccacaacg tgaccaaggg 780 caggttgcgc ttcggctgga gtacctcaac agtatgtgga ccacgctgga ggaggtgcag 840 gggcagctgg aggacatcga ggacaccgaa gaaggtaggc tagctcacgc aaatgttcgt 900 gccgattttg agcctcggct gttttcaata aaagccgact tgattgctaa acttcctgtt 960 atcactcaac aagctcgcat ccctgaaact ccctcgcata cttccgctct ttctgggttg 1020 aaacttccga cgatctctct tcctgaattc gatggggatt atatgcagtg gctagggttt 1080 cacgatacgt tcttggcttt gattcattcg aacgtagatg taccagcgat tcagaaattt 1140 cactatttga aagcagcttt aaaaggcgaa gcgtcgcagc tcattgagtc tatcgctatt 1200 agttctgcca attatgatct ggcgtggcaa acgttggtgg acaggtacgc taatgactac 1260 ttattgaaga agcgacatct ccaggcactt ttcgacatcc aatcggcgaa gagagaaaca 1320 gctgcatcat tgcactcgct agtggacgaa ttccagcgtc atacgaaggt tttgggtcag 1380 ctaggagaac cgacaggctc ttggagcagc atcctggagc atctcttgtg cacgaagctt 1440 cctgacgata cactgaaggc ctgggaagac catgcgtcaa ccgccgacga tccgaactac 1500 gattgcctta tcgaatttct tcagcgtcgt atgcgagtgt tggaatctat gttggtgaat 1560 catcatcagt cagcgtctgc atcaaatggg tctgcacatg tctccaaacg acagtcccac 1620 ttccgtgttt cttcgtgcgc ttccacttct gcttctaaca gcaaatgtcc ggcatgcaat 1680 caggagcatc cgctgataag atgtgcgaaa ttctaccact tttcgaacac agaaaggcag 1740 caactggttt cgagcaagcg tctatgccac aattgtctca agggagacca ctttgctaaa 1800 tattgtccgt caaactacag gtgcaagcac tgcagcaagc gtcaccattc catgctccac 1860 tctggcgaga attctcggca aaatagcgaa ccatcttcct ctcgtacatc catgccaaca 1920 caagctattc agtccagcgt ttcctccgtg gcagaaccga ctgctcagtc cagcgttgca 1980 gcatccgaag gtgttcccag ggtcgttgtc agtgcttccg ctccacagcc tcgtgaagat 2040 gttttcctgt taacagtatt ggtgaaggtt gtcgacgcct atggtcaaga tcattttgct 2100 cgtgcattgc tcgatagcgc ttctcagccg aatctcatca ccgaacgctt ggctcgccgg 2160 ttacatttga aaagaagttc tgtaaacgtg actatccagg gggctggaaa tctgtccaag 2220 aaggtgcgtg aatctatttt cgctcgtatt aagtcaagaa atgatggttt cgaatgcggc 2280 gttgagttcc tactaatgga tacagtgact gctgatttac cagcacagga catttccgtc 2340 aaagaatggt gaattcctga gaatttagcg ttggctgatc ccacattcaa caaaagtcaa 2400 caaatcgaca tggttttgtg ggctaagcat ttccatgcat tctttcccag taccgctcgt 2460 ctccagctgg ccgaaaatct tccgattctg gtggatagtg tgttcggatg ggtcgttacg 2520 ggatcagcaa gcatgaacta ttccgttcaa caaaaaccag tcacttctag cgtcgtcgca 2580 gtctccatgc tgacactaga agaaagtatg gaacgatttt ggaagaccga agagcttaaa 2640 atcaatgacg gcttctccat cgaagaatgc cgctgcgaag atttgtatca atctaccacg 2700 tgccgggatg aaaccggccg gtatatcgtc cgactgccac gaaagcccga ttttgatgcg 2760 atgttgggag aatcaaagac ctgtgctctg cgacgtttcg atcaactcga gcggcgactc 2820 gatcgggacc agaaattgaa ggaggaatac cacgatttca tgaaggagta cctctccctc 2880 ggccatatgc gactggtaga gacggatgac ggtaatcact cccacacata ctatctaccc 2940 caccaccccg tgataaagga ggagagtacg actacgaagg tgcgagtcgt gttcgacggc 3000 tcggcccgta cttcaaccgg cttctcgctc aatgaagccc tctgcgtggg tccggtggtc 3060 caagatgacc ttctcgccat tatcctccgc tttctgacct accccgtagc tctagtcggc 3120 gatgtagcga aaatgtaccg acaagtcttg ctacacccag aagattgtcc tctgcaacga 3180 atcctttttc gattttctaa ggacatgccc gtccaaacat acgagctgcg aactgtgacc 3240 tacggtctat caccctcttc gtttctagcg acgcgtacgc ttcagcagat tgcgaaagat 3300 gaaggcaaag catatccgct agctggccct tcggtcccca aaaatttcta tgttgacgac 3360 tacgttggtg gagcaaactc tattgaggaa gctatccagc ttcggggaga agctatccga 3420 aatgctttaa aaaaggcggt ttcgttctga ggaaatgggc ctcgaatcgc ctggaagtgt 3480 tgcagggact tgaaaatgac cagattgcga ctcagtcaag tttgcaattc tgtccggacg 3540 aatccattaa agcgctggga atccgttggg aaccagaaac cgatcaactt cgttttgatt 3600 cccaggttca gccacgagat gatcctccga cgaagcgttc tatactatcg gatatagcta 3660 gactctttga tccactcggt ctgattgctc ctgttgtagt tacagcaaag atcctaatgc 3720 aagaactgtg gtcactctcc tgcgactggg acgagcctgt accgcatacc attcggctga 3780 agtgggaaaa ctttcgaaac gagctaccga agattgtaac ctaccgaatc gaccgatacg 3840 ccttcttgcc ggaggccaga atagaactgc acacctttgc cgatgcctca acgtccgctt 3900 atggagcatg cacatacgcc cgctgtgaaa acgccctggg aaccgtcaag atacgattgc 3960 tagcatcgaa gagcaaagtc gcaccgctta aaagactgac catagcccgt ctcgaactgt 4020 gcgcttgtgt cctggctgca catttgcacc ataggataaa ggactcaatc aacgtccacg 4080 tgtccacctc ctacttctgg acagactcag ctgtctgttt gtactggctt cgcgctcctc 4140 ctagttcctg gaagaccttc gttgccaaca gggtttcgga ggtacagcac tacacccatg 4200 gtggcatttg gcgccatata tccggtactg ataatccagc agatttggta tcacgaggaa 4260 tgtcggtgga agaattcatc gaaagcgcca tgtggaagca tggtccgtca tggttggctc 4320 acccgcagca gtcctggccg atctctagtc cacctgatgt cgttgaggac atcctggaga 4380 cgaaaaatgt agttgcatcc atccgaacaa cacccagcgt caacccatgg ttccttcaat 4440 ggtcatcata caaccggtta ctccacgtta tcgcatactg cttacgcttc gctgccaaaa 4500 ctcgttctaa agccagaact cagccacttg caacccctac agagatcgaa ttccaccaaa 4560 cgtcgcttac cgtcaaagaa ctcgccaatg ccaacgccgt tttagtccgt ttcgcccaag 4620 aagatgtatt ccaacaagag atcaaggacc tgaagcgagg aaatgcggtg tcaaaaaact 4680 cgcctgtgcg tcgtctatcg cctttcatcg acccagatgg agtattgcga gttggaggtc 4740 ggctcaacct atctcagctt ccctaccaat cgaaacaccc tgccctcctc cctaagaacc 4800 atccattcac ctggttgatt ggcgagcatt accatcgaaa acttgttcat ggcggcgggc 4860 gtcttcttct gtccatgatc cgtgaagaat actggcctct caatggtcgc cgattagtcc 4920 acagcattgt ccgcaaatgt ttccgttgct cacgtcagca tccgattccg gcgcaacaac 4980 agattggcca gctaccagct tctcgagtgt caccaagtcg tccgttctcc aatacaggtg 5040 tcgattatgc tggtccgcta tacattaaac ccattcataa acgtgccgcc cccgctaaag 5100 cttacctgtg catattcgta tgctttgtaa caaaagccgt tcacctagaa ctcgttggag 5160 acctatctac ccagggattt ctagcggccc tacgccggtt catatcaaga aggggaattc 5220 cagcccactt gtactcagat aatgggaaaa atttcgaagg tgcgaagcgt gagctggctg 5280 agttgtttgc cagattcagg agcaacattg aacagagcgt aatcgcttct gcctgctccg 5340 agcaaggaat tacatggcac ttgactcccc ccaaggcacc acatttcggt ggcctatggg 5400 aagcggcggt caagactgcc aaacgacatt tatttcgcca gttgggaagt acaagattat 5460 cttttgaaga ttattatact attcttcatc aaatagaagc tgcgatgaac tctcgtcctc 5520 tgctgccaat gtcggatgat ccgaatgatc tcgctgcgtt gacgccagcg cacttcctca 5580 tcggaacctc gctgcatgcc ctgcccgacc cagacttcca gcacacacat atgagcgccc 5640 tcgatcatct ccaaaagctt caacaacacg tccagagatt ttggagccac tggaggacgg 5700 agtatctgca ggagctgatg aaggatacga agcttgctgc acggaacgac gaaatccagc 5760 ccggtcgaat ggtcatcatc gtagacgaca atttgccaac aacccgttgg ccactagccc 5820 gcatcaccga attgcatcct ggaagagaca atttaacacg agtggttagc ctccgcactg 5880 agaaaggtgt catcacaaga ccaattaccc gaatctgttt gctgccgctt ccatgtttga 5940 atcccgacga agaactacca attgatacaa tcagcagttc ctgtgccatt gaggagaagt 6000 aattttgatt attatctgaa aagatattag aaaacacaaa ttagttattg aacactcacc 6060 tgtatctacc tgagacgaga ctcgcgaaga ggaaaaaaag caacgtcata tgcagcccag 6120 atttcgctgt cagatgcagc cagttgattt tgtgagcgaa tatgtggaga gtattggaat 6180 cattcataaa aaaaataatt cagcgaaata ttccatttga tttgcgctga cgaaactaaa 6240 tcgaaacaag caaaacattt gttattgcaa tgtttactgc gtttgcggat atgaaattta 6300 tcttgaaaag gttttctgaa cttggaacga tgcaatcgca attcatcttt gccattgagt 6360 tcatgtaaaa gcttgaacat cttgctcaca aatttgtgta aacacagaat cgaagaagaa 6420 aatcttagca accgtagaaa aagaagaccg acttcgcgag tctcgtctca ggtatctact 6480 acacttagat taggacacag tttgtttttg tttacctttt tttatgttga acacaatcgt 6540 tcaaggcggc gggtatgttg aatcattggc cacactgcgt gagaaatcat cttctcactg 6600 gagtgacggt gctaacgttc gtccaattcg acgagatttt cagtgcctct ggcagctcgg 6660 ccaacacagc tgatcgcgtc gagaggtgaa aaccagccta ctcgcccgtc agtaatcatc 6720 agtctcgttc aactgccgta gcgtaaagat cacctagaaa ggaaaataaa tcgttagtta 6780 aagtgtaaaa g 6791 // ID Gyp3_Cis_I repbase; DNA; INV; 5033 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 06-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE Internal portion of Gypsy LTR Retrotransposon from Ciona DE savignyi. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW internal portion; Gyp3_Cis_I. XX OS Ciona savignyi OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. XX RN [1] RP 1-5033 RA Smit A.F.; RT "Gyp3_Cis_I - Gypsy LTR Retrotransposon from Ciona savignyi."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX CC Ci000389, Ci000390, Ci000031, Ci000391 ORF from bp 111 to 4988; CC closest to the transposase of Osvaldo in Drosophila. 4% diverged CC copies. XX SQ Sequence 5033 BP; 1544 A; 1160 C; 1186 G; 1128 T; 15 other; aattggcgat cctgccagga cgctgttacc gcaacatcta tactagggct tgcagcagag 60 agagaactcc cccgattcta tatantttgt cagtttttta gnkcttttaa atggctttta 120 attttttaat gtcgccgcct gtttatacgc ctggacaaaa tatcggcgat tttattgaca 180 tattttcttc gtactgcgat ggcataaacg ccaacgaagg tgtccggcgn cacatgtttn 240 taacggctat agatgctgat ttgaaatggc agattcaaga aaacgataat tctatctttg 300 aattatcgct atcaaccata ttacaacgag ccagacaatg caaaaatggt accgtgcaat 360 tagaaaaaca ccgcaataac ctgttttcca gacgacaggg tatggatgaa agaacgaaag 420 atttcatata tgcggtgaaa tcaatgggcg atttggcgta ccccgagccg aatgaagcta 480 aagacgaggt aatgtacctc gtattagtca atggccttca agacaggcag cttgcccgaa 540 ccatacccgc gagtataaat taccggcgag agttttgggc cgcctctgaa gtgatactca 600 gagaaggaga acaagcccca cccgaccaag taatgccgct ggaaatggcc aatgcgtcnc 660 atcgtgatga cgaacggctt agaaacatca tggacaaatt agaaaacatg tgtatcgaaa 720 tggataacat gaagaaacag atggtccgac aggagaggcc atccgaaaat catactcacc 780 agcatcaacg gtcaacacgg aaccgcccag aagttcgacc cccattccga gcaagcggaa 840 gaaatcgtaa gtgtttcgct tgcggcacct ttggccatat acgttcccaa tgtaggcgaa 900 caaacattat ggatggtcag gccgtggtaa ctaatccacc cgttcctcgc ggaaagattt 960 taaatagacg ggagcagaaa attgatatgc ctaaaacctt acatagtagc cgacgaggac 1020 cttcgaccga accgaatcga ttcgcgacaa ttatggttgg tgaaacagta atgtgtgcgc 1080 tggtggatac tggggccgat atcacgttga taaaaaagag ttgcttcgac aaatgcatta 1140 aaggcactgg agaaattaac atcctaccga atagttatcg tatggttgga gccaacggca 1200 attgtttaaa tgtggccggt acgaccgagt tgaagataac cgtaggatcc gtgtccacaa 1260 ccatgaacgt ctacatcgta gaggacgtaa cncacgattt cattttaggc ctcgactttt 1320 tgcagaaaca caaatgcaaa atatgtttta gttcaaacat gttggccatc ggagactctc 1380 gcgtaccaat gttgtcaaag cnattcgtac cccgaagaac tgaggtctct ttgattaacg 1440 actacaatgt tccagcaaac atggaggtat gtgtgactgg gaaaatctgg catgaatatc 1500 aaaacgaatc tattcctgat atgacggtgt ttatagaacg actagaaagt ttcacagaaa 1560 agtacggctt actgactgcg aactgcatca ccaattgcga gggtaatcgt gttaaggtaa 1620 gaattgcaaa cctttccgat gaggctattc agttgtgcag cggcgtgcgc attgcgtcct 1680 tgactcctgt atcgggacaa ggaaatgcag aatgcaacat ccccgaaaca tgtgttctac 1740 ttggcaattc aaaaaccgca gataaatgca catgctgcaa gcggtacgcc gacgtgacct 1800 caccgaagga aattttatcc gtgcaaagca aggaagnaaa ttaccggaga gacacggacg 1860 aacactggca ggccattacc gaaggtatca atatcagctc cgcattaacc cacgaacaga 1920 aaatgcgcgc attatcaatt attaaacatt attcagctgc cttctccacc agcagatacg 1980 acctcggtcg cacggacata gttgaacacc acatcgatac aggaaatgta aagcctgtac 2040 gacaagcacc taggcggatg ggtcaccacg caaagaagaa ggtcgaggat atcgtccaag 2100 aaatgctgga tcagaagctc atccgaccct ccacgtcgcc atggtcaagt tcaatcgtcc 2160 tcgttaaaaa gaaatccggt gagactcggt tctgtgtcga ttatcgccag gtaaacgagg 2220 tcactaaaag agatagttat ccaatcccca ggattgacga aagcttggac gagatgacng 2280 gttccgccta cttttcaacg ttggatttaa aatctggtta ttggcagata ccgatgtcac 2340 cggagtctcg cgaaaaaact gccttcgcta gtcacatggg gctatttgag tttaacgtaa 2400 tgcctatggg gctgtgcaat gcagcaggca gtttccaacg gttgatgcga atcgtcctaa 2460 atggagtgga atggagaggt gttttggcgt accttgacga cataatcgtc tatgctcgaa 2520 ccttcgaaga gcacctagat agacttggcg acgtacttnc acggattgtt ggcgcaggac 2580 taacactaaa gccaacaaag tgttctctgt ttcagcaaca agtcaacttt ttggggcaca 2640 ttgtgtccag tgaaggcatc tcatgcgatc ccaagaaagt tcaatcggta gctgagtggc 2700 ccgttccaca gtcagtgaag caagtgaggc agttcgttgg actcacttct tactaccgga 2760 aattcgttaa agattatgcg gccgtggctg cgcctcttca caagttaacc gagaaaaaca 2820 aacggttcac ntggaacgac gaatgccagg catcattcac aacactgaaa cgacgattga 2880 ccacatcacc agttttgcaa tacccagatt ttaagaaacc attcatttta gatactgatg 2940 cttcagactc tgcggtgggc tgcgtcctcg gacagattgt tgatggaaga gaacacgtcg 3000 tagcatatgg cagtagaact cttagcaaag ctgaaagaca ctactctacg acccgaaagg 3060 agcttttggc tgtaatacac gcaacgaaat tattccgttg ctacttgctt ggttcgaaat 3120 ttctcctgcg gaccgatcac gcgtcgcttc ggtggctctg gaagtcgaga gaaatgtacg 3180 gacagtgtgc gagatgggtg gaatacctct cggaatacaa ctttgagctg cttcaccgac 3240 caggcaagaa tcacgcaaac gcggatgcgc tgagtcgaat ttacgaaggg gaagatccgt 3300 tcccatacaa agtgaatcga gaagaagagg ggcatgagga aaatgctgca atcaacagct 3360 tgcacttgca acaaagcata ggacacacgg cggaggacat gcgccaggcg cagcaaaacg 3420 atcccgatat agaaccagtc tttgagtggc ttcaacgcgg agtaagacca gcttaccgaa 3480 aagtgaaatc atctacgcct caaacccgac attactggag tttattcccc aagctacggc 3540 tgcaccaaga aatcatttat ataaaaacag atacactaac tcaacctgga gtagaagtgg 3600 aacgagtgct ggtaccggaa gctttggtac acgaaactat agcctcacta cacgaccaac 3660 aacatggtgg ggctcattta ggtgtgcaga aaactacctc taaagtggca gaacgatttt 3720 attggcccaa ttggagaaag tccgtcgaag acttctgcag gaaatgttat caatgcattc 3780 ttaacaagaa accaacgacc actattcgtg caccactgaa accatccaac gaaatgaaac 3840 caatgcaaag aatcgaaatc gacgttttgg gacctttacc cctaacacga aatggtcatc 3900 aatacatatt ggtggcctgt gacatcttta ccaaatacgt gagagcttgg tcaatgcgaa 3960 accaatcagc agctgaaact gcgagtttac tctttcaccg ttggtttact gtgcatggng 4020 tgccagacgt gatacactcg gaccgtggtg gaaatttcga aagcaaactt ttcgaagaat 4080 tgctccggct aatggggagc aagaaatccc gcacaacagc ttatcatcca gccgggaacg 4140 gcggagttga acggaacaac agaacaatta tgtcgatgtt aaggaactac gttcaacgtg 4200 acgaaagaag gtgggatgaa gccctacccg cggtaattgc atcttacaat gccagtcgcc 4260 atganagtac cggtatctcc ccacactacc ttttaacggg tcgccagctc cgactaccag 4320 cagatttatt gaccaaggat ccaacagtga aagaaaacac atcccattgg gtgcagttgg 4380 aaggcttgcg caatcgactc ttgttggcag aagaggtcgt cagagaaact ctccgtgaac 4440 aacgagctag aatggagaaa acttataaca caaaacaatt tggtccgcca atcgaggtcg 4500 gtgataccgt tgtattaaca aatcccgtaa ttaaagaggg aaagtgtagg aagtttaacc 4560 taccgtacaa gggcccctac accgtggtgg aaaaacaggg agacttaaat tatatcattc 4620 acgacaagtc tggtaacgcc caactggtcc attataatcg tctaaaactt tgcagggata 4680 tcaacatagg ttccaacgag aaacagaaca acaccccatc agaggacgac cacattacac 4740 ccgtggtgac acaaagatat tcaccggtac ggatcgtacc agcaagcaca gcaccgatga 4800 tgacgatgga agaaccgcca caagcgaacc cattagaaat agatggggcg aatntgttaa 4860 accccgatgt ggcggtagga aatacataca gaggacgtct ccggagcagg acgagacgcc 4920 caacttatta ccccgatgag gagagagcac tgagccccgt tacccccctc gagaacactc 4980 gaaactgaca cgcggaatac ccacaggagt gggtcagaat ttggtcgggg gag 5033 // ID Gypsy-1_CQ-LTR repbase; DNA; INV; 214 BP. XX AC AAWU01023480; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-1_CQ_; KW Gypsy-1_CQ-I; Gypsy-1_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-214 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 382-382 (2011). XX DR Genome; AAWU01023480; Positions 7080 7293. XX SQ Sequence 214 BP; 65 A; 47 C; 43 G; 59 T; 0 other; tgctatggca acactgtcca tagcgagacg ccgcaccaac acacttaacg ttgtgcggtt 60 ggcacacctt gcgaggtgac agctttgtct taagcgagca acgaaaacag caagccacgt 120 tttgtaatcg ctcatcttaa aattagtttt agacttaaat aaagtttcta atgttagtcg 180 taattcgcgt acattatttg cgaaagacat aaca 214 // ID Gypsy-18_AA-LTR repbase; DNA; INV; 225 BP. XX AC supercont1.213; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-18_AA_; KW Gypsy-18_AA-I; Gypsy-18_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-225 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.213; Positions 1456527 1456303. XX SQ Sequence 225 BP; 64 A; 44 C; 38 G; 79 T; 0 other; tgttatgtat tcattattat ctctaattga ttttgaaagt tacttttctc tcggtgtttt 60 tgcattatgt cagttgaatt aaaagctaaa caaaaccgcg taccgcgttt gtaaagttgc 120 gataataaaa cgtttttttc taaagtatcc cggttcaata tcgcgttctt cggatccgaa 180 acgttccgga atggttcctt aaccggaatc acccacagaa acaca 225 // ID Gypsy-16_OD-I repbase; DNA; INV; 9220 BP. XX AC CABV01003934; XX DT 24-FEB-2011 (Rel. 16.02, Created) DT 24-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Oikopleura dioica genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-16_OD_; KW Gypsy-16_OD-LTR; Gypsy-16_OD-I. XX OS Oikopleura dioica OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; OC Oikopleuridae; Oikopleura. XX RN [1] RP 1-9220 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Oikopleura dioica genome."; RL Direct Submission to RU (24-FEB-2011). XX DR Genome; CABV01003934; Positions 19179 9960. XX CC Positions [8120-8593] - Integrase core CC 'CTTTA' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 250..1314 FT /product="Gypsy-16_OD-I_1p" FT /translation="MSSAETLKFFTEYAGQLEKEIADGDAKNADIKMKIKL FT RNEMMAKIQECVGAFKADKTLSRDQANSLKSAETALTEISYTGRDFDETAT FT FCGRVDQLYDAYVNGDSELEESFCAKVKLRFKSSTYSRVKSAEEELKTWTQ FT LRDWLNKNFDSGLSSIQLLQRSMETHWDSNQGWKRYAQEIEIRMEPARHAI FT YAQIRKSKAAKSGKPEKDKVNEPKADDIFSFISTSIVAGRLKVVKPHLHAL FT LANEWQGITDSSTLATKIEFLLSQTSGGGSVFFAQSNYKKKGGGKGDKKKE FT SGAGKDGKKAMPVCLDFMKGKCDGTWRGKPCRFSHEIPADTKSRTLVAKAE FT ELPEEKFSTVFH" FT CDS 3426..4433 FT /product="Gypsy-16_OD-I_3p" FT /translation="MNDEEFSEALIDEAAPIEHEVADVVTAKVEISGDHIT FT DMFKKTGVLAQAKGGPRQTWSPKFIQLAVELLSGGETAASAFNFFSMQAKY FT YPELLGKGKDVPKINWFERLRDCLPYLNLMHTKDVITKAGKLFLAADGAAM FT NDCSKSMAVGVLQENGKLHLLDIQKSEGGTGEAIASQMMSIIDGTGLARIL FT GSKIECLMTDQEAAQRKANFIVAEQLCREDNDERPKMIGCNMHSVANCCKN FT SREALQRVSPDAFTLLEDIKCVFGKPPKGGFVQQDGRKELQLLLKELDGTK FT TRIFSHDLGKISVVLLEFQVFIKENDLEMTVVTVALLYGTLKPS" FT CDS 6557..9082 FT /product="Gypsy-16_OD-I_2p" FT /translation="MYKRMPFGLTHSGDSFNKMIAKLLSTVKSQGNFVHYI FT DDILCYSKDEKTHLEVLAQIFDAVESHGARLGGQKCNFGKRSTRFMGREIS FT PEGIGIPKDCLDGLQALKPPTNRKELMSILGSLCWWKSWISANIGEKIVEN FT CFSQVIKEMSALNKAHKEFKWTKSAQVAFDNAKKMLGSGKVFSLPDFREPI FT CVICDASAHAVGAALMQKIGGKQKIVAVFSKTLTETQSRWSATEREGYGCL FT LAIEKFSYYLMGRGFLVLTDHKALCALDRKIIANDKLSRWQARLRKYSFTV FT QYIQGAQNNLADMLSRPWAKIREKEKPSEDLAGEFYNPVGDKNLVIYIPSW FT CCGDKFPRKMLLERTDIAAELFVLKSTITDVWAPNMPIVELRIIECAQTED FT QVIGRVKNFIEKGTEPEKWTIPDSVYGVKRYKRFAKFLGIHAESGCLTINW FT GKRTCLVLPKSLVPKYLQSAHGNGHGGVDRTAQLLSWCWWPDMAENLREFV FT ATCEGCLRRKGYDMQKGKPDRQTLFRATRPWQILYIDFINMPKSRTGKAYC FT LTVMDGYSRFLSVYPTARCRAQDAATALMRHILLYDFPSILSSDQGTHFKN FT ELMEELCGLLGIQRNIHVAFRPESTGCLERSHKVLKNALYGMALDKNTCWE FT MVLPAVTNMMNSCKNRATGVSPFECIYGRVPSFKGIQMVENPSADKPATYA FT YEIAETLKRTHKFVDLCQEEADLASKQEGKSLIKPQILNKGDKVLLNRSMS FT AEAKAQKNPWIGPFEVINTNGVIVQLEIDGKVTWVHRYHCMLYKDRPRDLD FT PDFVDDLYDNDDATAASKEGKPALRRSTRARKPVERFTNS" XX SQ Sequence 9220 BP; 2875 A; 1842 C; 2204 G; 2299 T; 0 other; taactggtac ccggaagcta cgaactcact cgaaggatct ggaactattc tcgcgctgga 60 ttgagaaatc cctgtggtga atttcatcac tctacaaaat cgtctctaca ttacgctttt 120 gtgaaagtcg gatcgcaaca aaaccctcac agaggtcact cagcacatcg gtaggaaatc 180 cgatagtaac ctgagtgcgg ataaacggta aatcgagctc aaaagtcaat aactgcctat 240 tttttcgtca tgtcatcggc ggaaacgctc aaattcttca cggaatacgc aggacagctg 300 gaaaaggaaa tcgctgatgg agatgcgaag aatgcggata taaaaatgaa gatcaagctg 360 agaaacgaaa tgatggctaa aattcaggaa tgcgtaggcg cgttcaaagc ggacaaaact 420 ctttcgagag accaagcgaa ttctctaaaa agcgctgaaa cggcactgac ggaaatcagt 480 tacacgggaa gagatttcga cgaaacagca accttctgcg ggcgcgtcga ccaattgtat 540 gacgcatatg taaacggcga ttcagaattg gaggaatcat tctgcgcgaa agtaaaactg 600 cgcttcaaat cgtcaactta cagtcgcgta aaaagtgcag aggaagaact gaaaacatgg 660 acgcagctaa gagattggct gaacaagaat tttgatagcg gactgagcag cattcagctg 720 ttacagagaa gtatggaaac tcactgggac tcgaaccaag gatggaaaag gtacgcgcag 780 gaaatcgaaa tacgaatgga accggctcgt cacgcaatct acgcgcagat acggaaatcg 840 aaagcggcga aaagtggcaa accggagaag gacaaagtta atgaaccaaa agcggacgat 900 atcttttcgt tcattagtac ttcgatagtc gctggtcgcc tgaaagtggt gaaacctcac 960 ctgcatgctc tgctcgccaa tgagtggcaa ggaattactg actcatcaac tctcgctacc 1020 aaaatcgagt ttttgttaag tcaaacgtcc ggcggcggat cagtcttctt cgcgcaaagc 1080 aactacaaga agaaaggcgg gggaaaagga gataaaaaga aggaaagcgg cgctggaaag 1140 gacggcaaga aagcgatgcc agtttgcctc gactttatga aaggtaaatg cgatggaaca 1200 tggcgcggaa aaccatgtcg attttcacat gaaattccgg cggatacaaa atcgagaact 1260 ttggtggcaa aagcggagga gctgccagaa gagaaattct cgacggtttt tcattagaag 1320 ctggtgaaat gctcgcttcg tcagctaaat taaattatgt cgagcaaggc cgcacttatt 1380 tttacgcaaa aataaatatt tctctcgctg gattctcact ttacacaact gctttattcg 1440 acagtgggag cgacaagaat ataatcgcct tgcatcacat accggagtcg ctgaaaaaac 1500 acatcgcacc tacagaaatg acatttaccg gagtaggtaa aataaaagca ctaggaacta 1560 ttgaggtaat acttaaaccg cacacatcga acttcaaatt tcaccatgtc aaattttacg 1620 tggtgcgcga acagctcccc accattattg gaaaggcctt tatcgtcgag cacgagactt 1680 tatcgcctgg aaaattcaat cttaatggta atggactgga attatttctt aaatctggac 1740 gcaaagttat atttccttgg acggacgagt cgcgcattct ttttgctcga caaaatggaa 1800 gtaagtatga aaattattct acgcagcaga aattggaaat tctgcgcaaa gaaaaaggca 1860 tcgaaatttc cgatgaggtt ttcaaaggag aaaactacaa aaaattggtg gaccttatat 1920 ggacaaagcg caatgttttc aaaggagaaa acgaccctct tggagaattc tcggaacacg 1980 cacacatacc aacccttcca ggactcacga aatcggcaag accacgtccg attcccaagc 2040 atctgcaagc acaagtacgc acggagatac aaaagatgct cgatagcgac gtgatcgagg 2100 aatgcccaga tggttaggca ccttgagaac tggaggataa tggcccaata taatagtaaa 2160 atcattttgt attgaccgga catacctgac tgagtccatt ttagaatctt cagacccttc 2220 ttttttttat aaacactaaa taaaaatgta gatcggcgag gtttttatgt ccggtgactt 2280 taaaaattct cattttggga aacaagcggc ccgaagtgaa cggggcgcgc tgattggctg 2340 aggcgtagct cgcttcaaaa gtgctcctgg aagagttctc gtggaagaac aatagcattt 2400 ttttgtgaaa atttatttga agcgtagcga agctaaaatt tttaataaaa ataattaaaa 2460 ttaaattaaa atatgtataa ggtaaacaaa acaaaaaaac taaaacatcg catatgccca 2520 caaatgtctc caaatgtcat caaatgtcta caaatgtctt caaatgtctt caaatgtctc 2580 caaatgtcat ctaatgtctt caaatgtcca caaatgtccg caaatgtctt caaaaatttc 2640 tcaaaaattt ccacagaaac tcacaagtat ctcttttcta gagtttaatt tgctcaaaat 2700 tgcgcaaaaa acctcaaatt tttctagacg tgttttttga gaaattgcgc aaatttagtt 2760 cttttctttt tctagacgtg tttttctttt ccgaaattgc gcaaattttg ttcgtttctt 2820 tttctagata tgttttattt ctctagattt cttcgttatg ccgggaaaat cctcgaattt 2880 tttcattatt taatacgaaa atcaagcgga atgttcgttt tttcaaaaaa actttttgtg 2940 aagctgatta aagccccgct tcgggtcgag aatactttga agaacagtag attctgtgga 3000 ttgactgagc agagaatcag agggatcttt gaagacgaga aagagtggaa gaagttgacg 3060 cagttagccc gaggaaaatt ttttatgggc gttgttccgt atttgacagg tattttggtt 3120 cttttttgct cgtttttaat agttggttac aaggaatagt tcattataca cccgagcatg 3180 tgtttgaaga tttttcaaaa gcgcgtgagc ttgctgaaaa caagatccag atccagagga 3240 aaatcgttca ctggctgcgt agtgagaact tcaaattctg cgaagaatca tcagttgctt 3300 ttgaggagaa agtgacagaa agagcagcta gaactggact ttcgcagacc ttgatagatg 3360 ctcatcttga cgatgaagac actgatttat ccccggagac gctagaggat gagcctgagc 3420 ctgagatgaa tgatgaagag ttttctgaag ctcttatcga cgaagctgca ccgattgagc 3480 acgaggttgc tgacgttgtg accgcgaagg tcgaaataag cggtgatcac atcaccgaca 3540 tgttcaagaa gaccggcgtt cttgcgcaag ccaagggagg accgcgtcaa acctggtctc 3600 caaaatttat tcagctggcg gttgagctgt tgagtggcgg tgagacagct gcgtctgcat 3660 tcaatttctt ctcaatgcag gcgaaatact acccggagct tcttggaaag ggaaaagatg 3720 ttccgaagat caattggttc gagcgtcttc gagattgcct gccctatctt aacttgatgc 3780 atactaaaga tgtgataaca aaggctggca aactgtttct agcagcagac ggggcagcaa 3840 tgaatgattg cagcaaaagc atggccgtcg gtgtgcttca agaaaacggc aagcttcatc 3900 ttctggacat tcagaaatcg gaaggaggca ctggagaagc gatagcgtcg cagatgatga 3960 gcatcatcga cggaactggg ctggcgcgga ttctcggctc aaagattgaa tgcttgatga 4020 ccgaccaaga agctgctcag cgaaaagcca atttcatcgt cgctgagcag ctttgccgtg 4080 aagacaacga tgaacggccg aaaatgattg ggtgcaacat gcattctgtt gccaactgct 4140 gcaagaattc ccgggaagct ctgcagcgtg ttagtccaga tgcgttcact cttttggaag 4200 acatcaaatg cgtttttggc aagccgccca agggcggctt cgttcagcaa gacgggcgaa 4260 aggagctgca gttattgttg aaggagcttg atggaacgaa aacgcgtatt ttcagtcatg 4320 atctaggtaa aatttcagtc gttttattgg aatttcaagt ttttattaag gaaaacgatt 4380 tggaaatgac agtcgtaaca gtggcgctct tgtacggcac tttaaaaccg tcatgaatgt 4440 gacaaaattg aagaaactgc agtttgatca gagaaggaag aaggatgacc aggaatcaaa 4500 attccacaga atagatagac tgtgctctgg ccagcaagac ggagtcattt tggatgccgc 4560 ggctctcttc ttgaactacg ccgggaatct taacgacttc cacaagaaga tcagccgcgt 4620 tggagatgag gccctgactc ttcagaggct gaaggatctc ttcaaaagca cactggagag 4680 aatgcagaga attctctctg atcagaatcc ctacgaagcc ttgatcgacg cagctcgact 4740 catgtccgaa agcgaagaga gccgtcaaac agttgatgag atcgattcac gatttctaag 4800 agcgacagcc aaggcgcgac gcaacaggct gaatgaagtg gcacgagctg tggttgaggg 4860 tagccgaaaa aagtaccaga aagactttga tcttataaaa gatctggagc tgcccgaaga 4920 ccgacgaata acttgctcaa atagagcatt gggtatgtct aaattcttat ttactattcc 4980 taactatatc ttagaatctt cattcagcac cttgaagggc ttttctgaaa aggtgaagtc 5040 gctgggccaa gacaagctat ttgcagttgg ccagagcaag ttcaacaagc tgagtcactg 5100 gcttgaacag cagccactga aggaacgcga aatgatgatc atggaagcca tccgtgacag 5160 aatgaatcaa gaaaaagaaa gaaaacggcg cactctggat gacgatcgac gcagcttggc 5220 tagtattccg aagagattca aataatttgt ttgatctgca tcaaataaaa acttaaattt 5280 aaaaatggat ttctgtctta tttctttcac tttgtcatta gccaatcagg ataaggagaa 5340 aagcgccacc aagaaacagc gccggtagcg gccgataaat attatgaaaa aaaattattg 5400 tgttcatttc ttgaaatttt aataaaaata attcaatttt attatcatat aaataaaaat 5460 catttaataa ataagttttg gagaaaattt tacaaaaaaa atttttttcc gtgctcgaga 5520 ttaaaatttc tctctaaagc caattttttt gaaatttggt caaaaaattt ttttacgtta 5580 gaaatttgaa aaatgagcat ataccgatga aatcaacagt tctggtagtt tttttccaac 5640 tgttctccat atatggagaa ctctttgccc actaggcgga tttttacatt tgaaattggc 5700 gctgttttaa atttactacc agagaaatta tcatgatcgg taagtatcga ttttttatcg 5760 aggatattat ttatttgacc acgttgcgtg atttcggcgg gctgagatta aaacgatgtg 5820 gctgactggt gagaattctc gactcgaagt cgaggggcag ttggttcgaa acctgagagg 5880 aaaaagagct ttccataaaa aggaaaagaa atttttgcag cccttactca ggttttcata 5940 ccaaaggcac atcggccaac ggcagaaaac ggagcgcctt tatgggcttc gagctcgctg 6000 ccccccgact cagagtcgag agcgctcacc acttagccac aagaggggac aataaacgca 6060 aacagcgctc gagcaataaa gcgagtgcgc gggacgggcc aatcagagcg cgccagtgcc 6120 gctcttcaaa tatttttttt cagccctgaa aagggcttat ttattcattt ttggggaaaa 6180 cggggatgag catcgtactc aggagggtca atttggtaaa ttcgggaaaa atcagctatt 6240 ttaagcctaa aaaagcaatt gcaaggaggt tctcaaggtg cctacagatg gtggtggctt 6300 tcactcgcca ttgcacatcg tgccaaaaaa gaacggaaag atccggattt gttcggattt 6360 caaacaaagt ttgaacaaat gcctgtcgga agaaagggac atttggaaca tcccggttat 6420 cgattgtcta ttcagcgatg ttggtgctgg acatcacatt ttctcggatt tggatatttc 6480 tagcgcgtat tggaacgtta agattgcgga ggaggataga tacaaaacta attttctgtt 6540 cgaaaataag ttatacatgt ataaaaggat gccgtttgga ttaacgcaca gcggagacag 6600 cttcaacaag atgatcgcga aactgctctc cacagtaaaa tcgcaaggaa actttgtaca 6660 ctatattgat gatatactct gttacagtaa ggacgaaaaa acacacttgg aggtacttgc 6720 gcaaattttc gacgcggtgg aatctcatgg cgcaagactt ggaggacaaa aatgcaattt 6780 tggaaaacgc tcaacgagat tcatgggacg tgaaatatcg ccggagggca ttggaattcc 6840 taaggactgc ttggatggac tgcaggcgct aaaaccgcct acgaatagaa aggagcttat 6900 gtctattctc ggatctctgt gttggtggaa atcatggatt tccgcaaata tcggcgaaaa 6960 aattgtcgaa aattgttttt cgcaggtaat taaggaaatg tctgcgctga acaaggcgca 7020 caaggaattc aaatggacga aatccgcaca agtcgcattt gacaatgcaa agaaaatgct 7080 tggatctgga aaagtattca gcttaccaga ctttcgcgag cccatttgcg tgatttgtga 7140 tgcgtcagcg catgcggtcg gagccgcgct tatgcaaaaa ataggaggaa aacagaagat 7200 tgtcgcagtc ttcagcaaaa cgctcacgga aacgcaatcg cgatggtcag cgacggaaag 7260 ggaaggatac ggatgcttac tggcgatcga gaaatttagc tactacttaa tgggccgcgg 7320 attcttggtc cttactgacc acaaggcgtt atgtgcactg gatcgcaaaa ttatcgctaa 7380 tgataagctg agcagatggc aagctcggct tcggaaatat tcatttacgg tacagtatat 7440 acaaggcgcg caaaataatt tggccgacat gttgagcagg ccatgggcga aaatacgcga 7500 aaaggagaag ccaagtgagg acttggcagg tgaattttat aatcccgttg gcgacaagaa 7560 cttggtgatt tatatccctt cttggtgctg tggcgacaaa tttccgcgca agatgctttt 7620 agaaagaacg gatatcgcgg cggagctatt tgttctaaag tcgacaatta cggacgtgtg 7680 ggctccaaac atgcccatag tggagctgcg gattattgaa tgcgcacaga cggaagacca 7740 agttattggt cgggttaaaa atttcatcga aaagggcact gagccggaaa aatggacgat 7800 tccggacagc gtctacggag tgaaaaggta taagaggttt gcaaaattcc tcggaattca 7860 cgcagaaagt ggctgcttaa cgatcaactg ggggaaaagg acatgcctgg tgctgccgaa 7920 gtcgttggtg cccaaatact tgcagagcgc gcacggaaat ggtcatggag gagttgatcg 7980 aacggcgcag ctactcagct ggtgctggtg gccggatatg gcggaaaatt tgcgcgaatt 8040 tgtcgcgact tgcgaaggat gcctgagacg caaaggatac gacatgcaaa aaggtaaacc 8100 agatcgacaa acattattcc gcgcaactcg accgtggcaa atattatata tcgattttat 8160 aaatatgcca aaatcacgaa caggaaaagc gtactgcctc acagttatgg acggatattc 8220 tcgcttttta tcagtttatc ctacggcgag gtgcagagct caggacgccg caactgcact 8280 tatgcggcat attctacttt acgatttccc atctattctg agcagtgatc aaggaactca 8340 cttcaagaac gagctcatgg aagagctttg cggactgctg ggtattcaga ggaatataca 8400 tgttgcattt cgaccggaga gtacaggctg cctggaacgc agtcacaaag tactcaagaa 8460 cgcgctttat ggaatggcgc tggacaaaaa tacttgctgg gaaatggtgc tccccgcagt 8520 tactaatatg atgaacagct gcaagaatcg ggcgaccgga gtatcgccat tcgaatgcat 8580 atatggaaga gtaccgtcat ttaaaggaat acagatggtt gaaaacccat cagcggacaa 8640 accggcaact tacgcgtacg aaattgccga aactctcaaa cggactcata aatttgtgga 8700 tttgtgccaa gaggaagcag accttgcctc taagcaagaa ggaaaatcgc tcataaaacc 8760 gcaaattttg aataaaggcg acaaggttct gctaaataga tcaatgtcgg cggaagcaaa 8820 ggcacagaag aacccatgga tcgggccatt cgaagtaatc aacaccaatg gagtcatcgt 8880 tcagctggaa attgacggaa aagtcacctg ggtgcatcgc tatcactgca tgctgtataa 8940 ggaccgccct cgcgacttgg atccggactt cgtggacgac ttgtacgaca acgacgatgc 9000 tacagccgcg agcaaggagg ggaaaccggc gcttcgtcga tctacgcgcg ctagaaaacc 9060 cgtggaacgc ttcacaaact cttaaggaat cgcttaagtt aatttagtat agattatgcg 9120 atcattaaag ctcgacaagg atcttctgga cgatcgctga ctcttcaact tccttcgttg 9180 gcgtgaaaac tcatttggat gctagaatcg actgggggaa 9220 // ID I-10_AAe repbase; DNA; INV; 5749 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A I non-LTR retrotransposon family from Aedes aegypti. XX KW I; Non-LTR Retrotransposon; Transposable Element; I-10_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5749 RA Kojima K.K. and Jurka J.; RT "I non-LTR retrotransposons from the yellow fever mosquito."; RL Repbase Reports 11(4), 1365-1365 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >90% CC identity. XX FH Key Location/Qualifiers FT CDS 492..1778 FT /product="I-10_AAe_1p" FT /translation="METDGGEGDSNDNSHSKESPPSKPFRIKIYPSTFSGP FT FVVYFRKKERPINVLLISSEVYKIYKSVKEIKKISLDKLRVVFGSREDANA FT LLESKLFVNSYRVYAPCDSCEINGVIYDEDLNCDDIKNHGLGIFRNKSIAP FT VQILECNRLSKLFFNGSESKYVYSNCIKITFSGSVLPDYVMIDNVIFHVRL FT YYPKIMHCERCLLFGHTASFCSNKLKCSKCGDGHPTSDCKENSGVCIHCRK FT KHISFEQCTVYIENQSKLNQKIKNKNLLSYADIMKSSDNINSPNTYEILPD FT NDDNDFGKQQNFTYNPPIKRKRTHNNSSKTNSFSFEPQPSSSYNTNFPVLK FT NCSQQKNIPGFKKNNTSFDEQFKQTDKINDNENSDNSILKILEELVDFLEL FT SEFWKKIIKMILPFLASILNKLNAVGPIISSLFSL" FT CDS 1781..5470 FT /product="I-10_AAe_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="MASKDSKILKILQWNCRSITNKIDRLKVLITNFNIDI FT FCLNETWLERTKFFRIPQFKIIRKDRTVSYGGVLIGIRDNIEFKYLDLSVE FT SQIEYVAVSITKNGFTFCIICFYIPPNASFSVSQIKNILDLIPSPFYILGD FT FNAHNIAWGSHKSDGRGSLIMDLIDELNLNILNDGSLTRIAVPPNNPSCID FT LSLCSNSLSLMSSWKTINDPNNSDHLPILIEFQNTRCDKFTQETFVFDLYK FT NVDWIKFADLMSISIIRSDTSLSAIEKYNLFSKLLFECLHKSQKKKVITGN FT LKKNRPSFWWDDECSNALKNKSVAFKLFRRSGRREDYFLYCKAEALFTRIT FT KFKKRNFWRHFVESLDKETSLAKLWSVARNLRNYDSSDPNVLEYSEAWIEQ FT FASKICPDFVPTLVNFKNQTVNYFPELCVPFSLEELHLALSLTKNTAPGID FT NIKFIVLKKLPEDCKEFLLSIYNLFLSKNIIPFEWRFVKVVSILKPGKDPS FT LADSRRPISLLSCLRKLMERMLLNRLELWAEKNNIFSSSQYGFRKGRGTRD FT CTALLASQIQLSFNKKQDVVSTFLDVSGAYDSVLIDLLFKKMNNLKIPNLI FT SNFLYNLFSLKIMHFFHNGSSKIIRHSCFGLPQGSSLSPFLYNLFTSDMMN FT IIPNGCFLIQFADDNVLSISGKNREIIRHFMQSALDNIDTWAFDNGFTFSV FT QKTKFMIFSRKHSTISISLYLNGYEIEQVFDYKYLGIWFDSKLNWASHISY FT IQKVCSKRINFLRSITGTWWGAHPSDLLTLYKTTIRSVMEYGCFTFGNAVQ FT THFSKLEKLQFRCLRICLKLLNSTHTQSIEVLAGIEPLKIRFQKLNCKFLL FT QCVSYKHPIIDNLKSLYEINPTSKILSSFTYCLNENLTPISSPGFQNYNIN FT IHTFRPLIDLSLYEELKQIPKIEHPRFANYLFERKFEGISRDQFYFTDGSL FT IQNVAGFGVFNNFLAHFFKLQNPCSIFIAEITALYFACNLIRLFTPNLFII FT CSDSLSSLHALNSINFNSKTHHILLELKNLLYDLYTQGFLIKFLWVPAHCN FT IYGNEQADSLAKLGVTRGIIYNREICASEYHSKLNKYCMDNWQHFWNSSDK FT GRWCHSICPFVDKFPWFKNISVNRNFICTFSRIMSNHYICNSYLYRFDMKD FT SNLCDCGEFYEDIDHIVFVCTKFMIPRNKFIKNLKIFYQTPPTSVRDILGS FT KFYPTLKLLYNYLNEISYLV" XX SQ Sequence 5749 BP; 1877 A; 790 C; 864 G; 2218 T; 0 other; catactcggt aagtaggcct tgaccaggca acacggtttt tttcttcgct tgtcattttt 60 ttctccaatt tcagattcga gattttccga ggatttcaat tttggacgtt gatgttggtt 120 acgttgtttc ctgttttcaa cttactttca agagtttgtg ttcagatatt gggattcaaa 180 aacttcaaga atttgtcgat ttgtggaatt attcatcgga ggaccgtctt caaaattcta 240 agtatcattc gagatttccc aaggatttca atttgggacg ttgatgttgg atacgttttt 300 cctgttttca acttactctc aagagttggt gttcagaaac cgggattcaa acatttcaag 360 aatttgttca tttgtggaat tattcatcgg agggccgtct ttaaaaatct aagtatcgtt 420 tgtatttgtt ttgttcctta cgttatttct tattctattc tgttcattac tgtttacatt 480 tagtttatat tatggaaact gacggagggg aaggagattc gaatgataat tctcattcta 540 aagagtctcc tccttcaaaa ccttttcgga taaaaattta tccttctact ttctcgggtc 600 catttgttgt ttactttcgt aaaaaagaaa gacccattaa tgttttattg atttcttcag 660 aggtttataa aatatataaa tctgtgaaag aaatcaaaaa aatttcttta gataaattaa 720 gagttgtttt tggatctaga gaggatgcca atgccctctt agaatctaaa ttgtttgtaa 780 attcatatcg tgtatatgct ccatgtgatt cttgtgaaat caatggagtt atatacgatg 840 aagatttaaa ttgtgatgac attaaaaatc acggtttagg tattttcaga aacaaatcca 900 ttgcaccagt tcagattttg gaatgcaatc gattatccaa attatttttt aatggtagtg 960 aatctaaata tgtttattca aattgcataa aaattacatt ttctggttct gtgttaccag 1020 attatgtaat gattgataat gttatttttc atgttaggct ttactatcca aagataatgc 1080 actgtgagcg ttgtctccta tttggtcata ctgctagttt ttgttcaaac aaattaaaat 1140 gttctaaatg cggagatggg catccgacct cagattgtaa ggaaaattct ggtgtttgca 1200 ttcattgtag aaaaaaacat atttcttttg aacaatgtac agtttatatt gaaaaccaat 1260 ccaaattaaa tcaaaaaatt aaaaacaaaa atctcttatc atatgcagat attatgaaat 1320 catcagacaa tattaattct ccaaatactt atgaaatttt accagataat gatgataatg 1380 attttgggaa acaacagaat ttcacttaca atcctcccat taaaagaaaa agaacacata 1440 ataattcatc aaaaactaat tcatttagtt ttgaacctca accatcatct tcttataaca 1500 caaattttcc tgtacttaaa aattgtagtc agcaaaaaaa tattcctggt tttaaaaaaa 1560 ataatacttc ttttgatgaa caatttaaac aaactgataa gatcaatgat aatgaaaatt 1620 ctgacaattc aattttgaaa atattagaag aattagttga ttttttggaa cttagtgaat 1680 tctggaaaaa aataattaaa atgattctac ctttcttggc ttctatttta aataaactga 1740 atgcagttgg acccatcatt tcatctcttt tttcattgta atggcttcga aggattcaaa 1800 aattttgaaa atattacagt ggaattgtcg cagcataact aacaaaattg acagacttaa 1860 agtattgata acaaacttca atattgatat attttgttta aatgaaactt ggttggaaag 1920 aaccaaattt ttcagaattc cacaattcaa aattatacgt aaagatagaa cagtttcata 1980 tggaggtgtt ttaattggta ttcgagacaa tattgaattt aaatatttag atttatcggt 2040 ggaatcccaa attgaatatg tagctgtttc aattacaaaa aatggtttta ctttttgtat 2100 aatttgtttt tatattcctc ctaatgcttc tttttcagta tcccaaatta aaaacatttt 2160 agatttgatt ccttcccctt tctatatttt gggagatttt aatgctcata atatagcatg 2220 gggtagtcac aaatctgatg gtagaggttc attgataatg gatttgattg atgaattaaa 2280 cttgaatatt ttgaatgatg gatctttgac tagaattgca gttcctccta ataatccttc 2340 atgtattgac ttatcccttt gttcaaatag tttatccttg atgtcttctt ggaaaactat 2400 taatgatcca aataatagtg atcatttacc tattttgatt gaattccaaa atactagatg 2460 tgacaaattt acacaagaga cttttgtatt tgatctatat aaaaatgtag actggattaa 2520 atttgcggat ttgatgtcaa tttcaattat aaggagtgat acttctcttt ctgcaattga 2580 aaaatataat cttttttcaa aattgttgtt tgaatgctta cataaatctc agaaaaagaa 2640 ggttattaca ggaaacttaa aaaaaaatcg gccttctttc tggtgggatg atgaatgttc 2700 aaatgctttg aaaaacaaat ctgttgcatt taaattattt cgtagatcag gtcgaagaga 2760 agattatttt ttgtattgta aagctgaagc tttatttaca agaattacca aatttaagaa 2820 aagaaatttt tggagacatt ttgttgaaag tcttgataaa gaaacttctt tagctaaatt 2880 atggtctgtg gcaagaaatt tgagaaatta tgattcttca gatccgaatg tattggaata 2940 ctctgaagct tggattgaac aatttgcatc taaaatatgt cctgattttg ttcctacact 3000 tgttaatttt aaaaatcaaa ctgttaatta ttttcctgag ctttgtgttc ctttttcttt 3060 agaggaatta catttggctt tatcactcac taaaaatact gccccaggta ttgataatat 3120 taaattcata gttcttaaaa aattaccaga ggattgtaag gaatttttac tttcaatata 3180 taatttattc ctttctaaaa atattattcc ttttgaatgg cgtttcgtaa aagtagtaag 3240 tattcttaaa cctggtaaag atccttcatt agctgatagt cgaagaccta tcagtttatt 3300 atcatgttta cgtaaactta tggaacgtat gcttttaaat cggttagaat tatgggcaga 3360 gaaaaataat attttttcat cttcccaata tggattcaga aaaggtcgag gtactcgtga 3420 ttgtaccgct cttttagctt ctcaaataca actttcattt aataaaaaac aagatgtagt 3480 ttctactttt cttgatgttt ctggtgcata cgattctgtt ttgattgatt tgcttttcaa 3540 gaaaatgaac aatttaaaaa ttcctaattt aatttcaaat ttcttgtata acttattttc 3600 tttgaaaata atgcattttt tccataatgg gtcatctaaa attattcgtc atagttgttt 3660 tgggttacct caaggatcaa gtttgagtcc atttttgtat aatttattta catcagatat 3720 gatgaatatt attcctaatg gatgtttttt aatacagttt gctgatgata atgttttatc 3780 tattagtggt aaaaatagag aaattatcag acatttcatg caatctgcgt tagataatat 3840 tgatacctgg gcatttgata atggttttac attttcagtt caaaaaacta aatttatgat 3900 tttttcaaga aaacattcaa caattagtat tagtttatat ctaaatggat atgaaattga 3960 acaagtgttt gattataaat accttggtat atggtttgat tctaaattaa attgggcaag 4020 tcatatttca tatattcaaa aagtttgttc gaaaagaata aacttccttc gatcaattac 4080 tggtacttgg tggggtgctc atccgtctga tttattaacg ctttacaaaa caactattcg 4140 ctctgtgatg gaatatggtt gttttacatt tggtaatgct gttcaaactc atttttcaaa 4200 acttgaaaaa ttgcaatttc gttgtttaag aatttgtttg aagcttctga attcaactca 4260 tactcaatct attgaagtat tggctggaat tgaaccactc aaaattcgat ttcaaaaatt 4320 aaattgtaaa tttttgttgc aatgtgtttc atacaaacat ccgattattg ataatttaaa 4380 atcgttatat gaaatcaacc cgacaagtaa aatattaagc tcctttacat attgtttaaa 4440 tgagaattta acgccaattt cttcccctgg attccaaaat tacaatatta atattcatac 4500 ttttcgaccc cttattgatt tatctttata tgaagaattg aaacaaattc caaaaattga 4560 acatccgcgt tttgcaaatt atttatttga acgtaaattt gaaggtatca gtcgtgatca 4620 attttatttc actgatggat ctttaattca gaatgttgct ggatttggag tatttaataa 4680 ttttttggct cactttttta aattacaaaa tccttgttct atattcatag ctgagataac 4740 tgctttatat tttgcatgta acttaattag actttttact ccaaacttat ttataatatg 4800 ttctgatagt ttgagttctc ttcatgcttt gaactccata aactttaatt caaaaactca 4860 tcacattctt ttggagctta aaaatctttt gtatgattta tatactcaag ggtttttaat 4920 taaatttttg tgggttcctg ctcattgtaa tatttatggc aatgaacaag ctgattcttt 4980 ggccaaatta ggggttactc gtggaattat ttacaataga gaaatttgtg cttcagaata 5040 tcattcaaaa cttaataaat attgtatgga taattggcaa catttttgga attcgagtga 5100 taaaggacgt tggtgtcatt ccatttgtcc gtttgttgat aaattcccat ggtttaaaaa 5160 catatctgta aatagaaatt ttatttgcac tttttcacga ataatgtcga atcattacat 5220 atgcaacagt tatttatatc gctttgatat gaaggattca aatttatgcg attgtgggga 5280 attttatgaa gatattgatc atattgtgtt tgtatgcacc aaatttatga tacctagaaa 5340 taaattcatt aaaaatttga aaatctttta ccaaactcct cctacatctg tccgtgatat 5400 acttggaagt aaattttatc ccactttgaa actattatat aattatctaa atgagatatc 5460 atatcttgtt tgatttgtat ctgccttaat tttctttgtt tttttttcag gattgaaata 5520 tgaaacacct accatgagag attatttccc atttttctga ttgaacgttt cagaagatca 5580 agaatctggc tctgttatgg ttctatccga gtgagccttt agtttataag atatttatta 5640 taacgtttta aaaaagatga agaggttttg tgcctttttg agaagatcta attaagtgat 5700 cactcaaagg ggtttttccc tctttcaaaa ttttagttaa aataaataa 5749 // ID BEL-234_AA-LTR repbase; DNA; INV; 335 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-234_AA_; KW BEL-234_AA-I; BEL-234_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-335 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 924-924 (2011). XX DR [1] (Consensus) XX SQ Sequence 335 BP; 100 A; 74 C; 81 G; 78 T; 2 other; tgttggagag cagtaggcca tgaaaaagaa sgaagaaaaa aaaaattcat tacttgtcga 60 accgcgtgtt agattagtgc aataaagcga aaacctccaa attagaaacc gtttaaattg 120 twatttcatt ctccattgcg gagatttccc gaagaaaaag gaagaagtgt tccgtcggat 180 tgtttccgac catttgcagg tgagcgaaac gtgccagcct ctgtgcagaa gcaagaagag 240 ctcccgagaa gacgttcccg tcgcatttgg tccttctccg tttcccaggt gagcccgtcc 300 ctgtcggtga atccgagcga gcgaatcctg aaaca 335 // ID BEL-157_AA-LTR repbase; DNA; INV; 207 BP. XX AC AAGE02018937; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-157_AA_; KW BEL-157_AA-I; BEL-157_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-207 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02018937; Positions 18322 18116. XX SQ Sequence 207 BP; 70 A; 45 C; 36 G; 56 T; 0 other; tgattgcgta gaaataaaaa tacagtccat ttggtttcag tgtagagaaa tacattaaca 60 gaatttgcgt tacctttcat atcaagaaac cttgtcagat gaatcagtag taaaataaat 120 cttgtaccga gcagacgcgt ccgactttag ttctttccga ccatcaagta tcgcaagtcc 180 gaaccacgtg aaaattcccc ccgaaca 207 // ID BEL-620_AA-I repbase; DNA; INV; 5996 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-620_AA_; KW BEL-620_AA-LTR; Pao_Bel_Ele68; BEL-620_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-5996 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [5006-5584] - Integrase core CC LTRs are 94% similar to each other. XX FH Key Location/Qualifiers FT CDS 671..5965 FT /product="BEL-620_AA-I_1p" FT /translation="MSKRKIPLRTLQARLRGLQTTFTNLYTFMENYTANTK FT PLEVSVRLNKLDDLWDQMNEAIDEIVAHDESPDDPEAFVKDRLDFENRFFT FT LKTFLLEKGQPTQNTPSTSTAGTSSASTPHVRLPQITLPKFNGKIDEWLTF FT RDLYTSLIHWQVELPAVEKFHYLRSQLEGEALAVIDSLPLTAANYSVAWDL FT LKQRYTNSKFLRKRQVQALFELPTVKKETASELHGLLDSFEKIVRSLDQVV FT VNKADYKDLMLIHVLTSRLDSTTRRSWEEHSSNQQTDTLKDLTDFLQRRLR FT ILEALPGKAPEQKVEPGHSKLPKRVSAVKSCNATFQSSSPSKCIVCPENHL FT LYQCPQFLKMAVSDRDGILRSNSLCRNCFRRGHQAKDCSSKFSCRHCRARH FT HSLVCFKGKVTEVKTSDKPEPTTTTSESSAESNTSRVVNLATTGVKICNSA FT TSGQQHQAPTTGVALLTAVVELEDDKGNKLHARALLDSAAECNLISRRLRK FT MLTVKEENSTVEVIGIQGMASKVQGKFTVLVKSRVTNFSQPMEVYVLPKLA FT VQVSTATFDTSVWDIPKGIELADPDFLKGERIDLLLGAESFFEFFVSDRRI FT RLGENLPLLVDSVFGWVVTGRYSVGGPIQSVLCKVAISSRLDEVLERFWRC FT EEIGLENNYSPEEARCEAHFAQTIKRNASGRYVVSLPKNEEVLVKLGSSKS FT IAERRFFQLERRLTRNEKLREEYCAFMAEYEALGHMRLVAETVEEEGRCYL FT PHHPVVKEESTTTKVRVVFDASAQTSTGFSLNDGLLAGPVIQDDLRSIILR FT SRTRQIILVADIEKMFRQIDVCPEDCRLQSILWRSSPDKPLATYELSTVTY FT GTKPAPFLATRSLSQLAMDEAEEFPLAAKAVMEDFYMDDAITGAEDPNVAK FT QLRIQLQELLRRGGFVLRKFASNCEAVLEDLPIENRSIQTSDGIHLDPDSS FT VKTLGLVWMPNSDTFRFKFRFSPLTEDAILTKRKVLSEIATLFDPLGLVGA FT VITKAKIIMQLLWRLQDENNRQSAWDAKLPAKVEEEWVRFYQRLPILNELR FT IARLVTLTKPVSAQLHLFSDASEKAFGVCAYLRTQDEKGEVKVALLSSRSR FT VAPLKTQSIPRLELCGALLASELYLKIKASIRFSGECFFWVDSTTVLRWLH FT APPLTWATFVANRVSKVQASTENCHWRHVPGEQNPADQISRGIWPQEIVDN FT QLWWKGPSWLQLSPENWPSDKAFTSEGHEEERRRAAFVVAAPVEANFFAEY FT LARFSSFTTLIRTTAYLLRYFHNLRAKRELRRSSGFLTTEELQQAENFIVL FT RVQRETFHREVSALTKGECVPRSSPLRWYHPFVAENGLLRVGGRIGQANES FT EYTIHPIVLPARHLVTKLLMRYYHQRLLHAGPQLMLSTVRLRYWCLGGRNL FT ARETYHQCVRCYRTKPKAIRQFMAELPAPRITPTRPFATTGVDYFGPVYVR FT TGYRQRAVKAYVSVFVCFSTKAVHLELVTDLSTARFLQALRRFVSRRGRCA FT SLYSDNGTNFVGAKNQMMELINRLKSKDHHDVVAKECAEDGMSWHFIPPGG FT PHFGGLWEAAVRSAKVHLLRVLGDTVVSYEDMVTLLTQVECCLNSRPLTQL FT SDDPEDLQALTPGHFLVGSALQALPDEDYMQTAVGKLQDLAATQRRLQDFW FT KRWRSEYLTQLQARTKWWQPPVDVTVGSLVVIREDNVPPIRWKMGRITNTH FT PGPDGVVRVVTLRTMNGTSTRPVSRICVLPVPASTEVKSAEGEEN" XX SQ Sequence 5996 BP; 1579 A; 1468 C; 1502 G; 1445 T; 2 other; agtggtcctt cgaaccggat ggttgaagga aatccatccg gtaggcaatc tacggtattc 60 atttcaccaa cgcagtcgca caaccccacc tctattggca gccgagaaac gccatcttat 120 ttcacggcgt cggttgcaat tggcgccatc ataggagtat tgaataatta aatccaaggc 180 acattacttg ccttgaatag gtgagtgaga ttccaacaaa attattctcc atctcccagt 240 actgctcgtg gtctttcttt cggatcaggt ggaaatttcg atcatcgtaa ttcgacgccg 300 gcgagactat caccattgct cggtgactca ctgcaatttc aaccggactt cgtttgtgga 360 aacgaagcat ttcccaattc ctggttcacg cttgctggta atcactggtg atttaccaca 420 catcgggtta aaagtgaatg gacataacct accggtgctg agggagcatc ctaaccccat 480 cggttcagct acgttgtttc aaccgctact acagtagcac cctactagtt cgtcgttgct 540 gattgtacgc ggagcatctt taaccaataa tctcaaggct caatttatga gctaataagg 600 taattagctc tccaaacaca ttactctccc caagcttcgc tacttggtct tcttttttcg 660 agggaacatc atgtcgaaac gtaaaatacc gctgcggaca ctacaggcgc gactccgcgg 720 actccagacg acgttcacga acttgtacac gttcatggag aattacacgg ccaacacgaa 780 accgttggaa gtcagcgtgc gactgaataa actggacgat ctgtgggatc agatgaacga 840 agctatcgac gagatagttg cgcacgacga atcacccgac gaccctgaag ccttcgtgaa 900 agatcggctt gacttcgaaa accgattttt cacactgaaa acgtttctgt tggagaaagg 960 tcaaccaact caaaacaccc cgtccacttc taccgctggt acctcctctg cgtctacccc 1020 acacgttcga ttaccgcaga tcacccttcc taagtttaat ggcaagatag atgaatggct 1080 cacgtttcgg gacctttata cttccctcat tcactggcag gtggaacttc ctgccgtgga 1140 gaagtttcat tacctccgca gccagttgga aggagaagcc ttagccgtga ttgattctct 1200 accccttact gcggcaaact atagtgtggc atgggacctt ctcaaacagc ggtatacaaa 1260 ctccaagttt ctgcgcaagc ggcaagtgca ggcgcttttc gagctaccga cggtcaaaaa 1320 ggaaactgcs tctgaactac acggtctttt ggactccttc gagaagatcg tgagatccct 1380 ggatcaagta gtcgtaaata aggctgacta taaggatctg atgctgatac acgtcctcac 1440 gtcccggttg gatagcacca cgcgaagaag ttgggaggaa cactcttcca atcaacaaac 1500 ggacacccta aaggacctca ccgattttct tcagcgtcgt ttgagaattc ttgaagctct 1560 ccccggcaag gctccggaac aaaaggtgga acccggtcac tccaaactgc cgaaaagggt 1620 ttcggcagtg aaatcctgta acgctacgtt ccaatcctct tcacctagca aatgcattgt 1680 ctgccctgaa aaccatttgc tatatcaatg cccccagttt ctgaaaatgg ctgtttcgga 1740 tagggacggt atactacgaa gcaactcatt gtgccgaaat tgctttcgta gaggtcatca 1800 agcgaaagat tgctcgtcta aattttcatg tcggcactgt agagcgagac atcactcact 1860 cgtttgtttt aaggggaagg taaccgaagt taagacgagt gataaacccg aaccaaccac 1920 tacaacgtca gagtcgagtg cagaatcgaa tacatcgagg gtcgttaacc tggcaacgac 1980 aggagtgaag atatgcaact cagccacatc gggtcaacaa catcaagcac caacaacggg 2040 cgtcgctctg cttactgcag tcgtagaatt ggaagacgac aagggcaaca agctgcacgc 2100 tagggcgttg ctggatagtg cagcggaatg caacctgatc agcaggcggt tgaggaaaat 2160 gctgactgtt aaggaggaaa acagtacggt agaagtcatt ggaattcaag ggatggcctc 2220 caaggttcaa gggaaattta cggtcctagt aaaatcccgg gtgacgaatt tcagtcaacc 2280 aatggaagtt tatgttttgc caaaacttgc ggtacaggtg tccactgcaa cgtttgatac 2340 cagtgtgtgg gacataccca aggggatcga acttgcggat ccagatttcc ttaagggtga 2400 acggattgat cttctgctgg gagcggaatc atttttcgaa ttctttgtct ctgaccgtcg 2460 catacggttg ggggaaaatt taccattatt agtcgactcc gtttttggat gggtggtaac 2520 aggcagatat tcagttggcg gtccaatcca atctgtcctg tgcaaggttg caatttccag 2580 tcgacttgat gaagttctcg aacggttttg gaggtgcgaa gaaattggac tggagaacaa 2640 ttattctccc gaagaagcaa ggtgcgaagc ccattttgcg caaacgataa agcgaaatgc 2700 gtctggtcgg tatgtggtgt cattgcccaa gaacgaagag gtgttagtca agctgggtag 2760 ctctaaatcc atcgccgaaa ggaggttttt ccaattggag cggcggttaa ccaggaacga 2820 gaagttgcgg gaggaatatt gcgcctttat ggccgaatac gaggctctgg gacatatgcg 2880 attggttgcg gaaacggttg aagaggaagg tcggtgctac ctcccacacc atccagttgt 2940 gaaggaggaa agcacaacca caaaagtgag ggttgttttt gacgcttcgg ctcaaacctc 3000 aacaggattt tccctcaatg atgggttgct cgcaggtcct gttattcagg atgatttgag 3060 atccattata ctgcgaagca ggactcgtca gatcattcta gtggccgaca ttgaaaaaat 3120 gttccggcag atagatgtct gtccggagga ttgccgtctt caatcgatct tatggaggtc 3180 aagcccagac aaaccattgg ctacttatga gctatccact gtaacctacg gcaccaaacc 3240 ggctcctttc cttgcaacca gatccctcag tcaacttgcc atggatgaag cagaagaatt 3300 ccctttggcg gccaaggccg taatggagga tttttatatg gacgacgcca ttaccggagc 3360 agaggacccg aatgttgcca agcagctgag aatccaacta caggaactac tacggagagg 3420 agggtttgtc cttcgaaaat ttgcctcgaa ttgtgaagcg gttttggaag acctgccgat 3480 cgaaaatcgg tcaatccaaa cgtcggatgg aatccatttg gacccagatt catcggtcaa 3540 gacactaggc ttggtttgga tgcccaacag cgatactttc cggttcaaat ttcgattttc 3600 gccattgaca gaagacgcca ttctaaccaa acggaaggtt ttatccgaaa tcgcaaccct 3660 ttttgacccc ttagggcttg ttggagcggt cattactaag gctaaaatca tcatgcaact 3720 tctttggcgc ttgcaggatg agaacaaccg tcaatcggca tgggatgcta aattacctgc 3780 gaaggtggag gaagaatggg ttcgcttcta tcaacggctt cctattttga acgaactgcg 3840 tatagcgagg ttggtcaccc taacgaaacc agtcagcgca caactacacc tattctcaga 3900 tgcttccgag aaggcgttcg gcgtctgtgc ttatctacga acccaagacg aaaaaggaga 3960 ggtaaaggtt gccttgctgt catcgagatc gagagttgcc cccttgaaaa cccaatcaat 4020 ccccaggtta gagctctgcg gggccctact cgcctcagaa ttgtatctga aaataaaagc 4080 ttcgattcgc ttcagcggag aatgcttctt ttgggtggat tctacgaccg tgttgcgatg 4140 gttacacgcg ccacccttga catgggcaac gtttgttgca aatcgcgtat caaaagtgca 4200 ggcatccact gaaaactgtc attggcggca cgtacccggg gaacaaaatc cggcagacca 4260 aatttcacgg ggaatatggc ctcaggaaat cgtcgacaac caactatggt ggaaggggcc 4320 ctcctggctt caattaagcc cggaaaactg gccatcagat aaagcgttca catcggaagg 4380 acacgaggaa gaacggaggc gagcagcttt cgtcgtagct gcgccggtag aagcaaattt 4440 cttcgctgag tatcttgctc ggttctccag cttcaccacc ctcattagga caacggccta 4500 tctcttacga tatttccata atctacgagc aaaaagggaa ttgcgacgat catctggttt 4560 cctaacgacg gaagaattgc agcaagcgga aaatttcatc gttcttcgag ttcaacgaga 4620 aaccttccat cgcgaggtca gtgcactaac taaaggagaa tgtgtaccac gatcatcacc 4680 tctacgttgg tatcatccgt tcgtggccga aaatggtttg ctacgagtcg gtggaagaat 4740 aggtcaagcg aacgagtccg agtacactat ccatccaata gtactacctg ctcgtcattt 4800 ggtcaccaag ctgttaatgc gctattatca tcaacgtcta ttacacgctg gtccacagtt 4860 gatgctgagc actgttcgtc ttcggtattg gtgcctcggt ggacgaaatt tggcgagaga 4920 gacctatcat cagtgcgtgc gatgttatcg cacgaaacca aaggctattc gacaatttat 4980 ggcggagctg cccgcacctc gaatcacccc aacgagacca tttgcaacaa ccggagtgga 5040 ctacttcggt ccagtatacg ttcggacggg gtatcggcag cgagcggtca aggcatacgt 5100 ktcagttttc gtgtgctttt cgaccaaagc ggttcattta gaactggtca ccgatttgtc 5160 gaccgcgcga tttcttcagg cattacggag gtttgtgtcg cgccgcggga ggtgtgccag 5220 cctctattct gataacggca ccaactttgt tggggccaaa aaccaaatga tggagcttat 5280 caaccggttg aaatccaagg atcaccacga cgttgtagcg aaggagtgtg cggaggatgg 5340 gatgtcatgg cattttatcc cacccggcgg gccccacttt ggcgggctgt gggaagccgc 5400 cgtgcgatca gcaaaggtgc atctgttacg tgtgcttgga gataccgtgg tttcgtacga 5460 ggacatggtg acgcttctta cccaggtgga gtgctgcctc aattcaaggc cgctaactca 5520 gctttccgat gaccccgagg acctacaagc attaacccct ggccattttc tggtaggatc 5580 agccttgcaa gccttgccag acgaggatta tatgcaaacg gcggtcggaa aactgcagga 5640 cctcgcggcc acccaacgca gacttcaaga tttctggaag aggtggaggt ctgagtactt 5700 gacccaactc caggcccgaa ccaaatggtg gcaaccacct gtcgacgtca ctgttggcag 5760 cttggttgtt atacgagagg acaacgttcc accaatccgg tggaaaatgg gccgcattac 5820 caatacacat cctggaccag acggcgttgt aagggtagtc acgttgcgga cgatgaacgg 5880 taccagtacg cgcccagttt ccagaatctg tgttctaccc gtcccggcct ctacggaggt 5940 gaaatcagct gaaggagaag aaaactgaaa ctccagtcgt ttcagggagg cgagga 5996 // ID BEL-221_AA-LTR repbase; DNA; INV; 694 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-221_AA_; KW BEL-221_AA-I; BEL-221_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-694 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 900-900 (2011). XX DR [2] (Consensus) XX SQ Sequence 694 BP; 243 A; 115 C; 124 G; 212 T; 0 other; tgtcgcggga ggcgttacaa cctttggcgc ggcagtagaa attcccaaca gggacgctgt 60 ggatttgaca gacggcttgt cacccaatga cttcaacgtt gtgagtggag ataaacaaat 120 agcttatgag aagatcgttg ataaagtgtg attcatttgg aaagctatcg tagctaaaat 180 ccagttaaaa tcctttaatt tagacaatta tttacatgcg attactcaca atgaattcca 240 atggactcga gtaattccca aaagtgagtt atatatgaaa gacagttgta ttaattacaa 300 atattagtta atttgctaat gttgtggttt caggttaggt tccatacact atatcctgct 360 taatctattt acatgaatta aaactgaatt tgaccttaaa ctatttcata aaaccatgca 420 attgtaagta aacatacatc ctaaaacatg acttaagact aaaaccttaa aatcgactta 480 aataggtaaa ttgactagac tgacatgacg gtaacgaaga cgttgactga aaatactaga 540 gtcactgaga tgtgagtatt gtttactgtt aaaaaagatt aatcgaacat aaaatgaaat 600 ttatattata gttttaaagc cctttgccga ataaatatgg aaatcgctac aaaatcagtg 660 tcccgttttt tcaacgtctt gcgccgttcc aaca 694 // ID Gypsy-3_OD-I repbase; DNA; INV; 7279 BP. XX AC CABV01000151; XX DT 24-FEB-2011 (Rel. 16.02, Created) DT 24-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Oikopleura dioica genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-3_OD_; KW Gypsy-3_OD-LTR; Gypsy-3_OD-I. XX OS Oikopleura dioica OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; OC Oikopleuridae; Oikopleura. XX RN [1] RP 1-7279 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Oikopleura dioica genome."; RL Direct Submission to RU (24-FEB-2011). XX DR Genome; CABV01000151; Positions 84152 91430. XX CC Positions [4300-4773] - Integrase core CC LTRs are 91% similar to each other. XX FH Key Location/Qualifiers FT CDS join(493..1848,1852..5364) FT /product="Gypsy-3_OD-I_1p" FT /translation="MSNEIYFKNVADLSRDELLKIYDSLPEKEYPCRIPVT FT DKKSLVLNILQFDQNFVSHEKKHVFEKSSDGNFLTYHPNHDGSCEQKFAKS FT HHKINTEQILAASKAENLSALMASLSCDESDSDGLPGLQMSDMSSETDSDA FT EETIIPNTPKQKPEPRAPNIKREEKENKRKPRKAKKATSSKIPTPMAPVSH FT VSKLRVKPPRYDATIPIPSWLKNLEIYAHCSKLNDNEIITVALTSLLNESE FT GSHVIQSLEAEDLNDWSRFKIRLTTMLGKTNDFYKNEFKNYSRGSDSFGMC FT LSKLISLYKQGYDQKYLGNNDKMLIIERFCESQNSKIRELLMREKSTLTID FT NIATRAGEIEAAIPTRDLNFNISAEEKADSMRELTFQIKQMLKKNVESSNA FT TSFRKTKTGERQKIDTSKLQGHCLSYQKRKKCRYNKSCKYLHSDNAPKDVI FT EYVKSIELALDNSDQIFMCPNYDSAPTTLKFINVIIENLQYPAMLDSGCTR FT TCIRSDLPFIRKTRNSDVELLCANNEVISPKLLCSKIKFQINTKDGPITME FT CSPLVVSNLSVPIILGLDCLQNFGFNAKEDFVTLDYRQIKTVKPEISSKLV FT KISAIIDIDMQKNKEEADSLKIFKSTRESTAEKTNFKPAIGNYGDATDDQK FT LILVKLVNKNRLSFSMSSDDLGKLHGFRFSLPTHDESKSSHQPPRPIPIHV FT RDQVETEINEWKSLGIISETQSDYNIPLIILKKPDKTIRISLDARGLNSLL FT IKDRYPLPHMLTLFNKIGEKLTTGDACFISSLDFHRGYWQVQCEANDAHKL FT AFSFANKHYQANRMLYGTATAPSAFSRIMSKLMDHPSIIIYLDDLICIDST FT FEGHLQTLEFIFKQCSDHGLLLSAKKCNLCMRETEFLGHQILRTGIQPSDK FT HIAAILKINVPTSRKELKRYLGMVTFNAKYVKDASIILAPLYELTSLKNDF FT NWTETHQKAFETMNQELTKKPTLGHFKLNSKLLLVTDSSGKTVGGTLYQNQ FT NGDLVVLGYFSKALTGPDLKRSMRVKELFAMTWAIKHFEFYLLNTEFACYV FT DHKSLLYLFREQQQSKLDIKLTNIHCYLNQFDFTIIHKPGNSQIMASADYF FT SRLPTSKSSDLDENSLKFEELPDIVFMFHTQQKANNDVIFSFGSRKLTKSE FT INILQNSCNLTKNKIIKLEKNKRSKFILKDRVLFSKNRLVLPDLLADEFIQ FT YLHCITGHAGAKQLNHMVRKFYISNVQEKIRATTSSCPACIQIKPVKKLKP FT SMIKNRHFESVPFERTFMDLVDYGRADSSNKRYLLTCCDALTGFLDGHPLN FT NKTDKAVAAGILTLILRHGICEILVTDNGREFGPLCKQIFDRFCIRHVTTT FT AYRSCSNGKIERQHREIHIQLKQMNANDRNWSQKWELSKYFLNNLPKTSLD FT MLSSNEALYGRAFHVPYKIGSQQEDGAKEPFIKALNNYIKELHPSIQAFQV FT QRYQKLLDKDRNSCPVLELGTKVLAWKPNIADGKLGTNWSGPYFVHKRISK FT DSYILKCTETSRIYRRHISLIRPLKTRIGNTTEFKKLQCLQNDDSENKNDS FT HVEANDAENNEVNTSSENCELVKNQFNEISPLHDLQAPDEIKSPESQWSTR FT LRPRK" XX SQ Sequence 7279 BP; 2522 A; 1622 C; 1309 G; 1826 T; 0 other; gtaccactta aatccatcca actgactgac atccatccaa ctgactgaaa tgcctttata 60 accgcgccgc ctgtgcacta tttttcatta ttacacactt actctttcgc acagcaatac 120 ttgaataaat tgccagagct acaaaatata ttttactttt tgtcttttgt aattgcaatt 180 gtagaaagca ttacaaaata ttggtgactg aataaaacgg cgataaagat ccgaagagaa 240 caatacgact gcagctgggg acaactgctg cagcgcttcg acttgcggta aacttcagaa 300 gagcgataca aaaggttaca ggtaagtaaa aagaaacaaa tttttgggta cataaatatt 360 ttttaaatcc agggcgagcc aaagtaaacg gagtgtcaac catacgttgc gccaccggta 420 ggtgaacctt taggataata taaaattaat ttatctgatt tttcttctaa ttcttttttc 480 cgtaattgaa caatgtctaa cgagatttat ttcaaaaacg tggctgatct ttcacgtgat 540 gaacttctca agatctatga ttcacttccg gaaaaagagt acccttgtcg aataccggtt 600 acggacaaga aaagtctcgt cttgaacatt cttcaatttg accaaaattt tgtttctcac 660 gagaaaaagc atgtttttga gaagagttca gacggaaatt tcttgaccta tcaccctaat 720 catgatggtt cttgtgaaca aaaatttgca aaatcacatc ataaaatcaa cactgagcag 780 attctcgctg caagtaaagc ggaaaatcta agtgccctta tggcctcgct aagctgcgat 840 gaaagtgact ctgatggctt accaggactt caaatgtccg atatgtcgtc agaaacagat 900 tctgatgcag aggagacaat tattccaaac actccaaagc aaaaaccaga gccaagagct 960 ccaaatataa agagggagga aaaggaaaat aagcgaaaac caagaaaagc aaaaaaggct 1020 acctcttcta aaattccaac accaatggcc ccggtttctc acgtttcaaa actgcgagtt 1080 aaaccgccaa gatatgatgc cacaattcca atcccaagct ggctgaaaaa tcttgagata 1140 tatgcgcact gctccaaatt gaacgacaat gaaatcatta cagtggcgct gacaagcctt 1200 ctgaatgagt cagagggatc tcatgtaatc cagagcttgg aagctgaaga cctgaacgac 1260 tggtctcgat tcaaaattcg acttactaca atgcttggga aaacaaatga tttctacaag 1320 aacgaattca agaattacag tcgcggatct gacagctttg gaatgtgtct cagcaagttg 1380 atcagcctct acaaacaggg ttacgatcaa aaatacctcg gaaataatga caaaatgctg 1440 atcatcgaac gattctgtga atctcaaaac tcaaaaattc gtgagctgct gatgcgagaa 1500 aaatcaacgc ttacaatcga caatatcgcg actcgcgctg gtgagatcga agcagcgata 1560 ccgacacgcg atctgaactt caacatttct gccgaggaaa aagctgactc aatgcgtgag 1620 ctgacttttc aaataaagca gatgttaaag aaaaatgttg aatcttcaaa tgcaacttct 1680 ttccggaaaa caaagactgg tgaacgacaa aagattgata caagcaaatt acagggtcat 1740 tgtctcagct accaaaaacg caagaaatgc agatacaaca aaagctgcaa atatcttcac 1800 tctgacaatg cgccaaaaga tgtcatagaa tacgtcaaat caatcgaatg actagcactt 1860 gacaacagtg accaaatttt catgtgccca aactatgact ccgctccgac aacccttaag 1920 tttataaatg taataatcga gaacctacag taccctgcaa tgctcgatag cgggtgtact 1980 agaacatgta tacgctcaga cttaccattt atccgaaaga ccagaaactc agatgtcgaa 2040 cttctatgcg caaataatga agtaatatcc ccaaaacttc tttgctcaaa aatcaaattt 2100 caaatcaaca caaaagacgg accaatcacg atggaatgtt cgccacttgt tgtctcaaac 2160 ttaagcgttc cgatcattct tggcctcgac tgtctgcaaa actttggctt taacgcaaaa 2220 gaagattttg ttactttaga ctaccggcaa attaaaacag taaagcctga gatttcgtct 2280 aaattggtca aaatttctgc aatcatcgat attgacatgc aaaaaaacaa ggaggaagct 2340 gattcactca aaattttcaa atcaactcga gaatcaactg ccgagaaaac aaacttcaag 2400 ccagctatcg gaaattatgg cgatgcaact gacgatcaaa aactaatcct tgtcaaactc 2460 gtcaacaaaa atcggttgtc attttcgatg tcaagcgacg atcttggcaa acttcatggt 2520 ttcagattct cacttccaac tcatgatgaa tcaaagtcaa gccaccaacc acctaggccg 2580 atacctattc atgtccgaga tcaagttgaa acagaaatca atgaatggaa atcactcggg 2640 attatatctg agacgcaatc agactacaat attccactta ttattctgaa gaagcctgac 2700 aaaacgatca gaatctcact tgacgcgcgt ggtcttaact cccttctgat aaaagatcga 2760 tatcctctcc cacacatgct gacactgttc aacaagattg gtgaaaaact gactaccggc 2820 gacgcatgct tcattagcag tcttgacttc cacagagggt actggcaagt tcagtgtgag 2880 gcgaatgacg cgcacaaatt ggcgttcagc ttcgcaaaca agcattatca ggctaatcgg 2940 atgctctatg gaacagcaac cgctccttca gctttcagca gaataatgtc aaaacttatg 3000 gatcacccca gtataattat ctaccttgac gacctcatct gcatcgacag cactttcgaa 3060 gggcaccttc aaaccctcga atttattttc aagcaatgct cagatcatgg tctgcttcta 3120 agcgcaaaga aatgtaatct gtgcatgcga gaaacagaat ttctggggca ccaaatttta 3180 aggactggca ttcaaccgtc tgacaaacac atcgcagcaa tactgaaaat caatgttccg 3240 acttcacgaa aggagttaaa acgatactta ggaatggtaa cgttcaacgc aaaatatgtc 3300 aaagacgcat caatcattct ggcgccttta tatgaactca caagtctcaa aaatgacttt 3360 aattggaccg agactcacca aaaggctttt gaaacaatga atcaagagtt aacaaagaaa 3420 ccgacacttg gacacttcaa attaaattcc aaattgctgc ttgttactga cagctctgga 3480 aaaactgtcg gcgggactct ctatcagaat caaaatggtg atcttgtcgt tctgggttac 3540 ttttcaaaag ccctaactgg tcctgatcta aaacgaagta tgagagttaa agaactcttt 3600 gcgatgactt gggcaatcaa gcactttgaa ttctatcttc ttaacaccga gttcgcttgc 3660 tatgtcgatc acaagtctct cctctaccta ttcagagaac agcaacaatc caaactggac 3720 ataaagctaa cgaacatcca ctgctacctt aatcaattcg actttacaat catccacaag 3780 ccgggtaaca gtcaaataat ggcaagtgcg gactacttct cccgccttcc gacatcgaaa 3840 agttcggatc tggatgaaaa ttctctcaaa ttcgaagagc ttccagatat cgtctttatg 3900 ttccacactc aacaaaaagc taacaatgat gtaattttct ctttcggttc gagaaaatta 3960 acaaaaagtg agataaacat ccttcaaaat tcgtgcaatc tcaccaaaaa taagattatc 4020 aaattggaga aaaacaagag atcgaaattc attctaaaag atagagtgtt attctccaaa 4080 aatcgactag ttctaccaga cttgcttgct gatgaattta tacaatatct tcactgtata 4140 actggccatg ctggtgcaaa gcaactaaat cacatggtaa gaaagttcta catctcaaat 4200 gtgcaggaga agatccgagc aacaacgagc tcctgtcctg cttgcataca aattaagcct 4260 gtcaagaagc ttaaaccatc gatgatcaaa aacagacact tcgagtctgt cccgttcgaa 4320 agaacattca tggatctcgt cgattatggg cgcgcagact cgtcaaataa acggtaccta 4380 cttacatgct gcgacgccct tactggtttt cttgatggac acccacttaa caacaagacc 4440 gacaaggctg ttgctgcagg gatcctcaca cttattctac gacacggaat ttgtgagatc 4500 ctagtgacgg acaatggtcg cgaatttggc cctttgtgca agcaaatatt cgacagattt 4560 tgtatccgcc atgttacaac aactgcatac agaagttgct caaatggaaa aatcgaacgg 4620 caacatcgcg aaatacatat ccaacttaaa cagatgaatg caaatgatcg aaattggtcg 4680 caaaaatggg agctgagcaa atacttcctc aataacctcc caaagacatc acttgacatg 4740 ctaagcagca acgaagcact ctacggtaga gcatttcatg ttccatacaa aattggatca 4800 caacaagagg acggtgcaaa agagccattc atcaaggcac ttaacaatta tataaaagag 4860 ttgcatccgt caatacaagc atttcaagtc cagcgttatc aaaaactgct tgacaaagac 4920 cgtaacagtt gccctgttct agaattagga acaaaagttc ttgcctggaa accaaatatc 4980 gctgatggca aacttggcac aaattggtct ggaccctatt tcgtccacaa gcgaatatcc 5040 aaagacagct atattctgaa atgcacggaa acaagcagaa tttatcgacg gcacatcagt 5100 cttattagac cacttaaaac gagaattggt aacacgactg aattcaaaaa acttcagtgc 5160 cttcaaaacg acgattctga aaacaaaaat gacagtcacg ttgaggcaaa tgatgcagag 5220 aacaatgagg ttaacacaag ttctgaaaat tgcgagcttg ttaaaaatca atttaacgaa 5280 atatccccac tacatgatct tcaagctcca gatgaaatca aatctcctga atctcaatgg 5340 tccacacgat tacgaccgag aaaataattt caaataaaag ggggaaaaga agagttataa 5400 tgagaactct taattcaaaa tgtgaaatct gcttactgag ttgcattgtt ggttcgcact 5460 caaatacaaa aattcagagg ctatcctgag ccctctaaag tatagttcct gagaggaaag 5520 atgtattcca aaactcttat aagagcctac tcaacaaaga agcctgatcc gagcttcacg 5580 atgcaattaa tttctgagaa gaacctgttc gtgttcttca aaaaacaatc accgaagaga 5640 agttagctgc atctcaaaac actgacatga atctttttaa acaaggagcc agttccgagc 5700 ttcaatgtaa cgattgcaga gagggagaag ttatttctac gagaaaatct gaacatattc 5760 gaatcaacac cgtactcatg aaatgtaaaa acggctaaac ttctggccct aaacgtctca 5820 aaagaagaca actgtatgac gaaaagtcca tcaaccaaaa gcttaagcaa ccgaggagac 5880 caaatctcaa atttatgtgc aatggatcaa gttcaatcca ttagatgata tggcgagtca 5940 gcttcaacgt agaaggaacc tactttaaaa aaccccgccg aggcctgcac agccctgtga 6000 tgatggctac cgaccactgt tcaacctcta atcagtgcga tatttatgtc cacagcgcgc 6060 aaccggacaa aaatactcaa agctgttacg tttagcaaat gttggtagaa gcaaaaagcg 6120 cctgcttccc atacagctcc gtaatcatat aagagctcgt ataaatcgag catatttcaa 6180 ctctcgtata aatcgagaaa atctactctc tctgaaattg agaaaactcg cataaaatca 6240 aaagcgagaa atcacggcga aatttcatac aattcaaata tgacgtcttg atcgcgcagt 6300 cgtaaaaaaa gggaaaaaag agaaagaagg caattgggat ggccgatttc gcgattttca 6360 atttcttgcc gagctgcgca tttcaaaaaa taaaaataac gaaaaaatgc tgccgctgaa 6420 aaaacgatcg ccaaccgaaa tcgacttcaa cctcttcagc gatgatatct gcgcaaagca 6480 cgtcttcaac aaatatattt taccagtagt attcgggggt accttcgcag atcgcttttt 6540 ggatcaaaaa ttcacaacga gaatccgcga atttaacgat ttcgttggct ttgttaaggg 6600 aaccacagat cagtttttgc ttgaaaacaa attcaacaag caaaatgccc ttcaagtttt 6660 cgaattctgg aaaacgaaat ccccaaaaca agaatctctt atgcagtgga ttgcaagaac 6720 agtagtcttg attcaaattt tggaccctta tctgacaaaa aaaattggta aaatgattgt 6780 atctgactac gaacacaatt tcctaaaacg tcgcacaatg tgggacaagc ttctcacttt 6840 cgatctgcaa gcagaaaagc gtgtgactat gatgttcctc aaaatgttct acgaagatat 6900 atcacagttc aaaaatacga ctctcccaga tcctcaagtt atccagccgc aacaatcgca 6960 gtcggctgca ttcgaccaag ttttcccatc tactcagccc cgaatggcaa ctacttttcc 7020 agaagagctt aacagctcaa atcacatccc ttattcttac gaaaatccag ttccagtctc 7080 aaatgcagct ccagactacg ctgatttggt tccttacgtt ccatactcag tccaaaatca 7140 gccttcatgt ccagacaatc cagctcaaac ccaaaatcag gcgcaatttc ctgaaaatgc 7200 atatatccaa agcgacgctc tcctccaact gaccagatat taaatacgtg caaaagtcat 7260 tctttaagaa gggaagatt 7279 // ID Gypsy-15_DWil-LTR repbase; DNA; INV; 275 BP. XX AC scaffold_180708; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-15_DWil_; KW Gypsy-15_DWil-I; Gypsy-15_DWil-LTR. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-275 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_180708; Positions 8567665 8567939. XX SQ Sequence 275 BP; 78 A; 56 C; 67 G; 74 T; 0 other; tgacgagcca agcaaaatag tgagcggaag caattgcaac atagtcacag tagctgtgtc 60 agctatttag tcggtctcac ggtcgtccga aatatgatcc tccagctgag gcgcattgac 120 cgcggccacc agcgaggcac gaggcgctga ttcagcaatt cttggttggc tgtgggcttc 180 tttgggtctt tataagatat tataagttta gtttctaaat atatttcgag tcggcaagtt 240 agccgctaat aaagagaaca ataattaaat atcca 275 // ID Sola1-3_AC repbase; DNA; INV; 3148 BP. XX AC . XX DT 28-JAN-2009 (Rel. 14.02, Created) DT 28-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE Sola1 DNA transposons from Aplysia californica. XX KW Sola; DNA transposon; Transposable Element; Sola1; Sola1-3_AC. XX OS Aplysia californica OC Eukaryota; Metazoa; Mollusca; Gastropoda; Heterobranchia; OC Euthyneura; Euopisthobranchia; Aplysiomorpha; Aplysioidea; OC Aplysiidae; Aplysia. XX RN [1] RP 1-3148 RA Bao W., Jurka M.G., Kapitonov V.V. and Jurka J.; RT "New superfamilies of eukaryotic DNA transposons and their RT internal divisions."; RL Molecular Biology and Evolution 26(5), 983-993 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS join(344..721,693..1505,1683..2585) FT /product="Sola1-3_AC_1p" FT /translation="TCQWAIWHEAVEVHAVITHVVLKLVYRNCDFLLKLFK FT NVCLLAEVISKCQGSTAEESFEFDNSDTDPSFQLGLSEARRCKEEVFAACD FT RCDALLCYVHFLEDTNVCDHGNRKKTALTKKKRKRKQNPRKRGKENRIDQE FT KEEKKTELTKKKKENKAVKENRKVPESSEQPEHFVLEGSRKEHTVEKQPRI FT NKQKAAKRKLASGEDYFSPYTKKVMPAKKMGKACLGVTCKNRALECDTITE FT SLRKDIFQEFYSLASLQLQREFILRHVVNSDKKKESSHQSRRQKTNVYYLT FT AQGSLVKVCKKLFLNTLAISDRTVRTAFKKLTPSGTIEKERRGGRQSTKIV FT ERDTKIRNDIEEHLRRFPRMESHYCRASTAKEYLHPDLNLRKNVLHVSRAV FT PWDVKEECKQRASKDETVLSATFDLQQVIYLPISNESALFYKRRFSNYNLT FT FYNLGRTDCHCFLWNESQSKRGSSEISTAVFEALQHYDSCGKKIAYLFSDG FT CPGQNKNTIIPTMLLYSVNTLPNLREISQRFFVTNYGQMKGDSVHSTISSA FT MSAAGNVYVPTELTPIFRLARPKQPYTVMPLEFSDFWNFKQLSLDLRVLGV FT QRDSESDTINWNKMMEFRVLKECPTTIFFKTSHIESDYRTIQLKRHTKQTT FT ESFTLFKLNTEPTKVPMAKYNDLVSLCSGVTPVVRQADHKAFYFSLPH*" XX SQ Sequence 3148 BP; 1022 A; 658 C; 660 G; 808 T; 0 other; cgtcggtcat ctcacgaagt gtcgtaactg catttttggg cattttgagt tatttatacc 60 atgccgatca gcttgacgag atctttcttc tctcaaagcg aaaaggtgct atctctacta 120 gttttcgagt tataaaatat ccgagttacg tcgtatcgta aaatcaggac gtattcaatc 180 tcacgaagag tcgtatcata tcatacgtca cttcgtgaga tacaataaca atatactgct 240 ctctcagtat cagtcattgc caatctcacg tagggacgta tcgctttcaa cagtatattg 300 acactctcat aggcattggc atttggcctc tcgcagccct taaacttgcc aatgggcaat 360 ctggcatgag gcagtggaag ttcatgccgt cattacgcac gtagtcttga aacttgttta 420 cagaaactgc gattttctgc tgaagctctt taaaaacgtg tgccttttgg ctgaagttat 480 ctctaaatgc cagggttcta ctgcggagga gtcttttgag tttgacaatt ctgacactga 540 tccatcattt caactggggc tgagtgaagc taggcgttgt aaagaggaag tttttgcagc 600 atgtgatagg tgtgatgctc ttttatgcta cgttcatttt ttggaggaca caaatgtgtg 660 tgatcatggc aataggaaga aaacagcatt gaccaagaaa aagaggaaaa gaaaacagaa 720 ttgaccaaga aaaagaggaa aagaaaacag aattgaccaa gaaaaagaag gaaaacaaag 780 cagtgaaaga aaatagaaag gtgcctgaat ccagtgaaca accagaacat ttcgttcttg 840 aaggctcacg aaaagaacat acagttgaaa aacagccacg aatcaataag caaaaggcag 900 ccaagagaaa attggcttca ggtgaagatt atttcagccc ttatacaaag aaagtcatgc 960 ctgcaaagaa aatggggaaa gcctgcttag gtgttacgtg caagaataga gcactagagt 1020 gtgatacaat cacagagtca ctgaggaaag acattttcca agaattctat tcacttgcaa 1080 gcctccaact ccaacgtgag ttcattctgc gccatgtagt gaacagtgac aagaaaaagg 1140 aaagctcaca ccaaagtcgg cgccagaaga ccaatgtata ttacctgact gctcaaggaa 1200 gccttgttaa agtttgcaaa aaactctttt tgaacacttt agccatatca gacagaacag 1260 tgagaacagc cttcaagaaa ttgacaccca gtggaacaat agagaaagaa agaagaggtg 1320 gtcggcagtc aactaaaatt gttgaaagag acacaaaaat tcgaaatgac attgaagagc 1380 acctgcggcg gttcccaagg atggagtcgc attattgccg tgcaagcact gcaaaggagt 1440 atctgcaccc agacctgaac cttagaaaaa atgtactcca tgtttctaga gcagtaccat 1500 gggactgacc cctccaagtt tcagtacgta cagtcgagtg ttcaaaaaag caaatctggc 1560 gttccacacc cccaaaaaag accaatgctc actgtgcaca acgttctttg aaggggatga 1620 aacaacaaag gctgaactca aggacgcatt cgagaaacat gtctctgaaa agaaaagagt 1680 aagtaaagga ggaatgcaag caacgggcat caaaggatga aacagttctg agtgctacat 1740 tcgatctgca gcaagtaatc tatcttccca tctcgaatga aagtgctttg ttttacaagc 1800 ggcgcttctc caactacaac ctgacttttt ataaccttgg acgtactgac tgccactgct 1860 tcctgtggaa tgaaagccaa agcaagcgtg gaagctcaga aatatccaca gctgtgttcg 1920 aagcgcttca gcactatgac agttgtggta aaaagatagc atacctcttt tcggatggct 1980 gtccggggca gaacaaaaac actatcattc caacaatgtt gctatactct gtgaacacac 2040 ttccaaatct cagagagatc tcccagcggt tctttgtgac aaactacggc caaatgaagg 2100 gggactctgt tcatagcact atctcctctg cgatgtctgc agctgggaac gtctatgtac 2160 caacagagct gacgcccata ttcagacttg ccagaccaaa gcagccttac accgtgatgc 2220 cgcttgagtt ctctgacttt tggaatttca agcaattgtc actcgacctg cgtgttctgg 2280 gagttcaaag agattcagag agtgacacta tcaactggaa caaaatgatg gagttcagag 2340 ttctgaagga gtgcccaaca acaatattct tcaaaaccag ccatattgag tcagattaca 2400 gaacaatcca attgaagaga cacacaaagc aaaccacaga gagtttcact ctcttcaaac 2460 tcaacacaga accaaccaaa gttccaatgg ctaagtacaa tgacttagtg tcgctgtgct 2520 ctggcgtaac gcctgtggta agacaagctg atcacaaggc tttctatttt tcacttcccc 2580 attaaatttg actgagatat tcttggatat tcatagtgtt acttactgat acatctacat 2640 tatgtgaatt gctgtccaac atggttgcag ttcaaatgag ctaaaacgtc tccaaaatat 2700 gtctttattt gttcaaattt tcttcaataa acaattggaa atcgtagtga gatctcatgt 2760 ttttttatgc cattgtgatt tggttttctc attttgcgta ttatgaaaca aaaattagaa 2820 acacggtgtt tttccttcat ataatacaaa gaaaatataa taatacgaca cttggtgaga 2880 cgtccattga tggtgggacc gatgaaggta tatctcacga agtgacgtat cggattttct 2940 gccgattttc tcacaggaat gatacaatac tgccgaaaac agcatgtgga gtgaaagtgt 3000 accgctcaaa gaagctttga tccaaaccgc atagattcca gtcttccggt tttgacgcca 3060 gagcaaaaag aaaagtgcca aaaactttaa agctctctcc tgcaaaactc taattttgca 3120 gatacgacac ttcgtgagat gaccgacg 3148 // ID BEL-161_AA-LTR repbase; DNA; INV; 485 BP. XX AC AAGE02024744; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-161_AA_; KW BEL-161_AA-I; BEL-161_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-485 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02024744; Positions 46470 45986. XX SQ Sequence 485 BP; 143 A; 76 C; 102 G; 164 T; 0 other; tgttgcggta acttgcgccg taccctatga gtggtgaaag agagcatcaa ccatatccat 60 atctctgaca agtcgcattg aactgttgta gagatagccg tgtattgtgt tgcttccatt 120 gctgcatcta ttagcgttgt aattatttgc ctaaatctgc cggttaacag gaattatttg 180 gtccaccata ggttgtaggg taataaagat cgtgcttggt gcctgtaaag taagaactaa 240 aaaatactga aaaacaagta aaatatgtaa ttcatcaaaa tatttacagt ttgattcaca 300 cattgaattt ggtgtcagtt ggttggagtt ttaaggagta ggatctgacc agtgtgagta 360 actttattgt tgttgtgatt gcaaatactt attattaaat gattaaatat agcttttagc 420 gtttctggat tcaacatttg gtgtagtttc ttgctgaaaa gagtcccgaa tactcctacc 480 caaca 485 // ID RTE-3_BF repbase; DNA; INV; 4202 BP. XX AC . XX DT 29-JUL-2009 (Rel. 14.07, Created) DT 29-JUL-2009 (Rel. 14.07, Last updated, Version -1) XX DE Amphioxus RTE-3_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW RTE; Non-LTR Retrotransposon; Transposable Element; RTE-3_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-4202 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-4202 RA Kapitonov V. and Jurka J.; RT "Young families of RTE non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(7), 1701-1701 (2009). XX DR [2] (Consensus) XX FH Key Location/Qualifiers FT CDS 765..4148 FT /product="RTE-3_BF_1p" FT /translation="SGTPRVAFDSGKDLRNPIGQSPPALSRAAPGQLGTDP FT SRSACFIGCLELRVLLDKWIVCRAPDKQESKEKRRQKTQPIRIGSWNVRTM FT RTGLSDDLTVIEDIRKTAAIDRELYRLNIDIVALQETRLPDSGSLKEDSYT FT FFWQGKGMEETREHGVGFAVRNTLLHMIEPPTGGTERIITLRLSTHEGPVN FT LLCVYAPTLQATSEVKDQFYGQLDSAIKKIPVSEHIFILGDFNARVGTDQE FT SWQTVLGHHGIGKMNENGQRLLELCCYHNLCVTNTFFQNKAIHKASWRHPR FT SQRWHQLDLVITRRTSLNSVCNTRAYHSADCDTDHSLIAARIKLRPKKLHH FT MKKKGQPKIDVSKTMLPDRNQKFLECLEGTLNNIQPQDAEHRWETLSKTIY FT SAAAQSYGKKERKNTDWFEAYISELEPVMDTKRKALVSYKQNPSSQNLQAL FT KAARQEAQRASRRCANNYWLLLSERIQLASATGDIRRMYEGIKQATGKPIK FT KSAPLKAKSGEIITDKDKQMARWVEHYLDIYSTENSVSQDALDNIEDFSVL FT AELDADPTIEELSKAIDSMSNGKAPGEDNIPAEIIKSGKSVLLEPLHELLR FT LCWKEGKVPQSMRNSKIVTLYKNKGDRTDCNSYRGISLLSIVGKVFAKVVL FT TRLQVLADRVYPESQCGFRAERSTTDMIFSVRQLQEKCREQQRPLYIAFID FT LTKAFDLVSRRGLFQLLRKIGCPPQLLDIIISFHEDMKGVVSFDGETSEPF FT AIRSGVKQGCVLAPTLFGIFFSLLLKSAFGHSTQGVHLHTRSDGKLFNLAR FT LRAKTKVRSVLIRDMLFADDAALVAHVEDELQQLLNQFAHACSEFALTISI FT KKTVVMGQDVPQPPVVTIGSEVLEVTDHFTYLGSTVTSNLSLDKEIDRRIA FT RAAGVMTKLGTRVWNNSHLTLNTKLEVYRSCVLSTLLYGSETWTTYAKQEN FT RLESFHLRCLRRILGISWRDRVPNTTVLERSCSLSIHLLLCQRRLRWLGHV FT SRMKDGRIPKDILFGELATGKRPVGRPALRFRDVCKRDLKLTDIDPASWEQ FT IAADRNRWRHTVKDGLAKGQERRTEHLESRRRKRKEKPQQGNPSAFICPNC FT GRDCHARIGLQSHSRRCQPP" XX SQ Sequence 4202 BP; 1157 A; 1086 C; 1075 G; 884 T; 0 other; tctgtaaatg gctgtgtgat gcgtctagtg tgtaggcgta gcggtagtgt gcttgccact 60 tctcgctttc agccctcacc gctggctagc ggagctgcat gcagcactgc gaaaagagag 120 ccgagtgcgt aagtctctcc tgccagtaca tagcctctcc acagcaagtc cccattgcag 180 tgcctcctcg tggctacaga cggaaactgg gcaccgtaag gccccaggct aaactgcagg 240 gagctcggag gaggtctggc ccccagacgc acggcatgca tggcccaccg gcgtgtggac 300 acgccctgat gcctgcgaac cagaccccca gctatgggca aatagcacgg gtagacggag 360 ctcgtcagcc ttggatggca gttcgtctag gggaaggaaa accctgattc aaaaacctcc 420 gctgccttgc ggctataccc agtcctggga aaggctacgg gagttaaccc agagagaaaa 480 tccggagtgg agtacgtgag gcggttggct gtcaaactct gtcatccttc cggcaactcc 540 tgcagccaaa ccaacgccaa gtgtcacgcc tcgcgttccc ttggaccacg tcggtgaggt 600 cgagaggggg gtcctgttgt gtttttgggc agcgcaggtc ctccataaac ctgcccaggc 660 tagcgctctg gagaggccac tccagtcgcc cccatcactg ggggtgagaa acaaaccggg 720 agacagcagt ttacgggtta taagtccttg ctaaattgac gtaaagcggg acgccacggg 780 ttgccttcga cagtgggaag gatcttcgta atcctattgg tcagtcaccg cccgctttaa 840 gccgggcagc ccccggccag ttgggtactg acccgtcacg atctgcctgc ttcatcgggt 900 gcttggagct tagagtctta ctcgacaagt ggatcgtatg tcgcgcacca gacaaacaag 960 aaagtaagga aaagagaaga caaaagaccc aacccattcg catcggtagt tggaatgttc 1020 gtaccatgcg taccggcctg tcagatgact taacggtcat agaggatatc cgtaagactg 1080 cagcgattga tcgcgagctc tacagactga acatagatat agttgccttg caagaaacac 1140 gtctccctga cagtggatcc cttaaggagg actcctacac cttcttctgg caagggaaag 1200 gtatggaaga aactagggaa cacggagtcg gctttgccgt caggaatacc ttgctgcaca 1260 tgatagaacc acccacaggg ggcacagaaa ggatcattac cctacgtctc tccacgcacg 1320 agggtcctgt caaccttcta tgtgtttatg ctccaacgtt acaggctact tctgaagtca 1380 aggaccagtt ctatgggcaa cttgacagtg ccattaagaa aatcccagta tcagagcaca 1440 ttttcatcct aggtgacttt aatgctaggg tgggcacgga ccaagagtca tggcagactg 1500 tgctaggaca tcatgggatt ggaaaaatga acgaaaatgg acaaagactc cttgagctat 1560 gctgctatca caatctctgt gtaacaaata ccttcttcca gaataaagct attcacaaag 1620 catcctggcg acaccccagg tcgcaacgct ggcaccagct ggaccttgtc attacacgac 1680 gtacttccct aaacagtgtg tgcaatacaa gagcatacca cagcgcagac tgtgatactg 1740 accactctct tattgccgca aggataaagc taagacctaa aaaattgcac cacatgaaga 1800 agaaaggcca gcctaaaatt gacgtcagta agactatgct acctgacaga aaccagaaat 1860 ttctagaatg ccttgaagga actttgaaca acatccaacc acaggatgca gagcacaggt 1920 gggaaacact gagcaaaacc atatacagtg cagcagccca gtcgtatgga aagaaggagc 1980 gaaagaatac agactggttt gaagcataca tatcagagtt ggaacctgtc atggatacaa 2040 aacgcaaagc cctggtttcc tacaagcaaa atcccagcag tcaaaactta caagcattaa 2100 aagctgcccg acaggaagcc cagcgggcct cacgtcgttg tgcaaacaac tactggctcc 2160 tgttgtcaga gcgcatccag cttgcttcag caactggaga catcaggagg atgtacgagg 2220 gcatcaaaca agccacgggc aaaccaatta aaaagagtgc acccctcaaa gcaaagtcag 2280 gagagatcat cactgacaaa gacaaacaaa tggcacgttg ggtagaacat tacctggaca 2340 tttactcaac agaaaactca gtctcacaag atgcacttga caatattgaa gacttctctg 2400 tcctcgcaga gctggacgct gaccctacca ttgaggagtt gagtaaagcc attgattcca 2460 tgtcaaatgg gaaagcgcct ggtgaagaca atatcccagc tgagatcata aagagtggga 2520 aatctgtctt gctggaacca ctacatgagc tactacgtct atgctggaaa gagggcaaag 2580 tacctcagtc aatgcgaaac tccaagatag tcactctgta taagaataag ggagaccgta 2640 cggattgcaa cagctataga ggcatctctc tgctaagcat tgtgggaaaa gtctttgcca 2700 aagtggtcct taccaggctc caggttctgg cagaccgggt ctacccggaa tcccagtgcg 2760 gcttccgagc cgagaggtcc acaactgaca tgatattctc cgtacgtcag cttcaggaga 2820 agtgccgcga acagcagagg ccactataca tcgcctttat cgacttgacg aaggcgttcg 2880 accttgttag cagaagaggt ttattccaac tgctgaggaa gattggttgt cccccacagc 2940 tgctagacat catcatctcc ttccatgagg acatgaaggg tgtggtcagt ttcgacgggg 3000 aaacctctga gccctttgca attcggtcag gtgtgaaaca agggtgtgtg ctggccccga 3060 ccctgttcgg gattttcttt tctctgctgt tgaagtctgc cttcggtcac tcaactcaag 3120 gtgtccacct acacacccgc agtgatggga agctgttcaa cctggcccga ctgagagcca 3180 agacaaaggt ccgctcagta ctgatcaggg acatgttgtt tgccgacgat gcagcactgg 3240 tcgcacatgt tgaagatgag ctgcaacagt tgctcaacca gtttgcccat gcgtgcagtg 3300 aatttgcgct gaccatcagc ataaagaaaa cagtagtcat gggccaggat gttccacagc 3360 cacccgtcgt caccataggc tcggaagtgc ttgaagtgac agatcacttc acgtatcttg 3420 gatcgacggt tactagcaac ctgtctttag acaaggaaat tgacagacgg atagctaggg 3480 cggcaggcgt gatgaccaag ctcggaacga gagtgtggaa caatagccac ctgacgctta 3540 acacaaaact tgaagtatac cgttcttgtg tactcagtac gcttctgtat ggtagcgaga 3600 cctggactac atatgccaaa caggaaaacc gccttgaaag cttccatctt cgttgcttaa 3660 ggcgcattct tgggatttct tggagggaca gagtacccaa caccactgtt ctggaacgct 3720 cctgctccct gagcatccat cttctcctct gtcagcgacg tctacgctgg ctgggacacg 3780 tatcccgaat gaaagatgga cgcatcccga aggacatcct gtttggtgaa ctagcgacag 3840 gaaaacggcc agtaggacga ccggcactgc gctttagaga tgtctgtaag cgtgatctga 3900 agctaactga cattgaccca gctagctggg aacagattgc agcagaccgc aacagatggc 3960 gtcacacggt caaggatggt cttgcaaagg gccaggaaag acgtacagag catcttgagt 4020 caaggagacg caaacgcaaa gagaaaccgc aacaggggaa cccctctgcc ttcatatgtc 4080 caaactgtgg ccgcgactgc catgccagga tcggtctaca gagccacagt agacgctgtc 4140 aaccaccgtg acctgataca gagcgctacc atcatctgga aagatggaag gatgcctact 4200 ac 4202 // ID Gypsy-15_CQ-I repbase; DNA; INV; 5256 BP. XX AC AAWU01024898; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-15_CQ_; KW Gypsy-15_CQ-LTR; Gypsy-15_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-5256 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 409-409 (2011). XX DR Genome; AAWU01024898; Positions 15206 20461. XX CC Positions [2488-3021] - Reverse transcriptase CC Positions [4174-4644] - Integrase core CC 'CCCCG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 838..1755 FT /product="Gypsy-15_CQ-I_1p" FT /translation="MTGPPLDGAGQVPPVPSPVFSPASYNLPNFKYIHLPP FT SEVRNAWISWIRWFESIMAAAGVIDSLTKKMQLMAMGGAELQSAYYGLPNV FT EPVGPTVTPYEDAKEKLDQHFSPKHHDSFERFLFWSMHPAEDESIEKFCLR FT VQQKAEKCYFGKTDTESRHIAILDKIIQYSSEELRQKLLEKEKLTLDDAMK FT TINAHQSVRYQAEKMSSNVTNKAPTPTVVNRMYDGNRKDGENSGSQRGCRQ FT CGYPSHRNGESCPAADRKCLRCNHVGHFRSVCTTRFPSNDVSNLNLIILEK FT RLTKFVLGVKRSEA" FT CDS 2002..5136 FT /product="Gypsy-15_CQ-I_2p" FT /translation="MEADETQRCAHIQREVIFIVLLFLKVGLQTNYFYRID FT SHKRFLAYGRVPLKLITAFDANLEIDDNGHTLETKTSFFVIEKGQQPLMGK FT ETAQKLGVLKIGLPSTHQHSVHRVKTNSNAFPKMKGITVTLPIDRSVPPVI FT QPLRRCPIPLLDKVKSKIDELLEMDIIERVTRPTSWVSPLVPILKENGELR FT LCIDMRRANQAIQRLNHPLPVFEDFLPKFRNAKLFTTLDIRQAFHQVELSH FT DCRDITTFVTNWGLFRYTRLLFGVNAAPELFQNLMESILADCVNVAVFIDD FT IVIFGATEGEHDAAVKGVLLVLKRYGILLNDHKCKFKQHEIRFLGHKLSSN FT GVSPADEKVKSILQFRPPKTKEELRSFLGLVTYVSRFIPNLANQNALLRAL FT LKQESPFAWKVEHQGEFDRLKRVIGSAQHLGFYDPQDRTLLVTDASGEGLG FT AVLIQFKGSTPRIISYASKSLTDCEKTYPAIEKEALGIVWGVEKFKMYLLG FT INFELETDHRPLETLFTATSRPTARIERWLLRIQAFKFRVVYRRGSANLAD FT CLSRLGSHVADQWTDETEVYIRRIVAVSLSTFADCDDCCTFDVETEIFIRT FT IQESAAIDIEEVIQATMSDNEMRKLKECVQSGKWDNAEMKQYALFRTEYTV FT ANNLIMRGNKLVIPAGLRHRMCQLAHEGHPGESKMKTRLRDRCWWPGIDTD FT AVQTCKSCEGCRLVQVPDAPEPMSRRSLPEKAWVDVALDFLGPMPGGEYVL FT VVIDYYSRYTELAVMNKITARETINQLKRMFRVWGPPRTITLDNAKQFVST FT ELHEYCQLNGIHLNHTTPYWPQANGEVERQNRSLLKRMKIANALYGEWKNE FT LDRYLEMYNNTPHSVTGKSPNELLQNRRLRSKMPHVDDLATTPPSTEFRDK FT DHEKKIMGKEREDAKRMAKPSNIEIGDNVLMKNLLPANKLSTNFLKEKFMV FT IGRNGSNVTIESVDSGRTYDRNTSHLKKVVDAPSNTGSEPELIVEDGEVLQ FT RADAEIIDDRADPDQNEPLPEEGLSRRSSRTRQIPQRYR" XX SQ Sequence 5256 BP; 1556 A; 1112 C; 1283 G; 1305 T; 0 other; caaatggcga cgaggcttcg agcccgaaaa cgcgtgcgac ggtaaatcag tgcaaaagtg 60 attatttttt tcggaaaatt tgttctggta cggtttgtgt tgaaaaaaat aagttatgat 120 aagaaaaaag gaattattaa gatgaaaatc agcagtgtcg agtaaaagtg gggtgcacag 180 gaaaaaatag ttcaaatcaa tattaaatgg ttttcccgat ctctgggttc gatttagtta 240 ggtttaataa tgaaaaaaaa aaaccaagga aaactgtgaa aaaaaaagag ttatttcacc 300 cgaaaaaaag ggaaagtagt ttaaaattaa ttgaagtgac gccactggga cgccattttg 360 caaaagcgaa aattgagtga gattctccgc cacaggcaca gagaatgaga gtgtgaagaa 420 acagtgaaaa aaaacgagtt ccccaagcaa cgttccgaaa atgagaagcg ctctcaatct 480 ctcttgcata ctgctgttct taaactacct atgtattggt tgaattttcc ccggttcagc 540 tggatatttt ctggttggat tgtgtgtgcg tgcgcgtggt gccgattaaa ttctggctgt 600 tgaatgaatg gtgccgatta aattcaggct gataattatt tggtgccgat tgaattctgg 660 ctgaggaatg ttttgtgccg attgaattct ggccgtcgat ttaatgatgc cgattaaatt 720 caggcaatta gtgatttgat aaataagaaa aaagatagct taaatgtgtg ttagagtgga 780 acggtctctt ttagagattt tgttttgatt tcttttaaca gtgaataaat tttcaaaatg 840 acgggtcctc cgctagatgg agctggacag gttcctccgg tgccgtcacc ggtcttcagt 900 ccagcctcgt acaacctgcc caacttcaaa tacatacatc ttccgccgtc tgaagttcgt 960 aatgcgtgga tttcatggat acgatggttc gagagtatta tggccgcagc tggggtcatc 1020 gacagtctaa ccaagaagat gcaactcatg gcaatgggtg gtgcagaact gcagtcggca 1080 tattacggct tgccgaacgt agaaccggtt ggaccgaccg taacgccgta cgaagacgcc 1140 aaggagaagt tagatcaaca tttttcaccg aaacatcacg acagttttga gcggttctta 1200 ttttggtcga tgcatcctgc ggaagacgag tccattgaaa agttttgtct tcgggtccag 1260 caaaaagccg agaagtgcta ttttggcaaa accgacaccg aaagtcgcca cattgcgatt 1320 ctggacaaaa tcatccagta ctcttcagag gaattgagac aaaagctgct ggaaaaggaa 1380 aaactgaccc tcgacgatgc catgaaaacc ataaacgctc atcagtcggt tcgttaccag 1440 gccgagaaga tgtcaagcaa cgtgacaaac aaagcaccga ctccaacggt cgtcaaccgt 1500 atgtatgacg gcaaccggaa agacggagag aactctggtt cacaacgcgg gtgccggcag 1560 tgcggctacc catcgcaccg aaacggagaa tcgtgtccag ctgccgaccg aaagtgcctc 1620 cgctgcaatc atgtaggtca tttccgatct gtctgtacaa cccgttttcc atcaaacgat 1680 gtaagtaact taaatcttat tattcttgaa aagcgactga cgaaatttgt attaggtgtc 1740 aaacgatcgg aagcgtaaac cggttttcca aaaccggaat caacaagctc cagcaagagc 1800 tccagcaaaa cggtatcgtt ccgattcgag ccgtaacgca tttcgtgtcg aagaatactc 1860 ctccgattgc gaaaccgaag atctcccttg ctataatgtt ggggaagccg atgatgaact 1920 gattaagtgt cgcgtcggag gggttgatgt tgtcatgttg attgactctg gctcgaaaca 1980 taatcttatc gacgatacga catggaagct gatgaaactc agagatgtgc gcatatccaa 2040 cgagaggtca tttttattgt tttattgttt ttaaaagtag gtcttcaaac aaattatttt 2100 tacaggattg actcccacaa acgattttta gcttacggaa gagttccgtt gaaactaata 2160 acagcatttg atgcgaatct tgagatagat gataacggac acaccctgga aactaaaact 2220 tcgttcttcg ttatcgagaa gggtcaacaa ccgttgatgg gcaaagagac tgctcaaaaa 2280 ttgggcgttt tgaagatcgg gttgcccagc acccaccaac attctgtgca cagggtcaag 2340 acaaacagca atgcatttcc taaaatgaaa ggcatcacgg ttacgcttcc gatcgaccgc 2400 agcgttcccc cggtcattca accattgcga agatgtccta ttccgctact ggacaaggtc 2460 aaatcaaaaa ttgacgaact cttggagatg gacattatcg agagagtgac ccggccgacc 2520 tcgtgggttt ctccgttagt tccgatcctc aaagagaacg gcgagcttcg gctttgcatt 2580 gatatgcgac gggctaacca ggcaatccag cggttgaatc acccattgcc ggtgttcgag 2640 gattttttgc ccaaattccg gaacgccaag cttttcacta ctcttgacat tcgtcaagca 2700 ttccatcaag ttgagttgag ccatgactgt cgggacatca caaccttcgt cacaaattgg 2760 ggactgtttc gttatactcg gttgcttttt ggagtaaatg cggccccgga attgttccaa 2820 aatctcatgg aaagcatatt ggctgactgc gttaacgttg ccgtctttat tgacgatatc 2880 gttatctttg gcgctactga aggagaacac gacgctgcag ttaagggagt tttattggtc 2940 ctgaagcgct atggaatctt gctcaacgac cataagtgca agttcaaaca acatgaaatc 3000 cgatttcttg ggcacaagct atcatctaac ggggtgtcac cagctgacga gaaagtcaag 3060 tcaattctcc agttcagacc accaaagacg aaagaggaac tacgcagctt cttggggctt 3120 gttacgtatg tttccaggtt catacctaat ctggctaatc agaacgctct gctacgagcc 3180 ttgctgaagc aagaatcgcc cttcgcgtgg aaggttgagc accaaggaga attcgatcgg 3240 ctgaagagag ttattggatc tgcacaacat cttggtttct acgaccctca ggatagaact 3300 ttacttgtca cggatgcatc tggagaagga cttggagctg tcctcattca attcaagggt 3360 agtacgcctc gtatcatcag ttacgcatct aaaagtttga cggattgcga aaaaacgtac 3420 ccagcgattg aaaaagaggc gttgggcatt gtttggggcg tggaaaagtt caaaatgtat 3480 ctattgggaa tcaactttga gcttgaaaca gaccatcgac cacttgaaac actctttacg 3540 gctacatccc ggcctactgc caggatagag cgatggctgc taaggataca agcattcaag 3600 ttcagggttg tgtaccgcag aggatcagcg aacttagccg actgcctctc cagattggga 3660 tcacatgtcg ccgaccagtg gacggatgaa acggaagttt acattaggcg tattgttgca 3720 gtatcgctgt cgacgttcgc agactgtgac gattgctgca ctttcgacgt cgaaacggag 3780 atatttatca ggaccattca ggaaagcgcg gccattgata ttgaagaggt gattcaagcc 3840 acgatgtctg ataacgagat gaggaaactc aaggagtgtg tacaatctgg aaaatgggac 3900 aacgcagaaa tgaagcagta cgctttgttt cgcaccgaat acactgttgc taacaatctc 3960 atcatgcgag gaaataaatt ggtgattcca gcaggcctac gacatcggat gtgtcaactt 4020 gctcacgagg gccatcccgg ggagagcaag atgaaaacgc gacttcgcga tcgttgttgg 4080 tggccgggaa ttgatacgga tgcggttcaa acgtgtaagt catgtgaagg ttgtagatta 4140 gttcaagttc ccgatgcacc tgagccgatg tcgcgccgct cgttacctga gaaggcttgg 4200 gtcgatgtag ctttggactt cctggggcct atgcctggtg gagagtacgt gctcgtggtt 4260 atcgactact atagcagata tacagaactg gcggttatga acaaaattac ggcaagggaa 4320 acgatcaacc agctcaaacg aatgtttcgc gtatggggtc cgcccagaac tatcacattg 4380 gataatgcga aacaattcgt ttcgacggag ctccatgagt attgtcaact gaacgggatt 4440 caccttaacc acaccactcc ttactggcca caggccaacg gcgaagtcga aagacaaaat 4500 cgatcgctat tgaagagaat gaagatcgcc aatgcgttgt acggagagtg gaagaatgag 4560 ctcgatcgat atctggaaat gtacaacaac acacctcact ccgtgacagg aaaaagccca 4620 aatgagctgc ttcagaaccg gcgtttgcgc tcaaaaatgc ctcatgttga tgatctggcg 4680 accacaccac cgagtacaga gttccgcgac aaagatcacg agaaaaagat catgggaaaa 4740 gagagggaag acgctaagcg tatggctaag cctagcaaca tcgagatcgg cgataacgtt 4800 ttaatgaaaa atcttctccc agcaaataag ctatcaacta acttcttgaa ggagaagttt 4860 atggtgattg gcagaaacgg atcaaatgtc acgatcgaat cagttgattc tggtcgtacc 4920 tacgacagga acacatcgca cttgaaaaag gtcgtcgatg ctccatcgaa cactggcagc 4980 gagccggagt tgattgtgga ggatggtgag gttttgcagc gagctgatgc tgaaataatc 5040 gacgatcgag cagatccaga tcaaaatgaa ccacttccgg aagaaggatt atctaggcga 5100 tcgagccgaa ctcgccaaat cccgcagaga taccgttgag ttttgggaac cccatgatgt 5160 aactatgtta taataaatag actagaaccg tattagttgt tcattaattt aaataaagaa 5220 aagaaagaaa aaacttaaac tttacaaaag agggga 5256 // ID BEL-16_DPu-I repbase; DNA; INV; 9042 BP. XX AC ACJG01007345; XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the common water flea: internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-16_DPu_; KW BEL-16_DPu-LTR; BEL-16_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-9042 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the common water flea."; RL Direct Submission to RU (08-FEB-2011). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR Genome; ACJG01007345; Positions 11256 2215. XX CC Positions [7600-8115] - Integrase core CC LTRs are 96% similar to each other. XX FH Key Location/Qualifiers FT CDS 1579..3591 FT /product="BEL-16_DPu-I_1p" FT /translation="MLHSDKKRSKRKGSKDDTPNRLALPKVSKNLLGSERV FT VTDSEEIMNRVLAESEPETGARSRRAPKKNPDGTLNLRGSRPSSRASDRNP FT SGSPPVINLEGNTNEPQSPKIGLTVDDIQRQAGAQPSPEIPPPNLDGFGAT FT PERKTLSEWMREEFARFNGEMATITGRLTKTDARVTQVATRLEYHKEQIIG FT LNTELPKIRQLVTGLRSSITEDTTARITAVETNIATTMLAQRNATETLQTR FT MVAHIERAVGQLQNSLNELRQTGFDDTELREDVRQLVEAQGEEITRDLAQS FT LGQHSEEVGQAIVELKVRMSRMDSRLRSIYRGQQTAAQGPQPERTRQRAQG FT QQPERERTQRQRSSKRSLWDEGSEEEASEENEPSVRRPTSRRGATFHNQPP FT PSYRSSFTASGRRPAPGPEPAPAPNPAEAPAPAPAEAPAPAPAPAPAPAPA FT EAPAHAPAPAPAPAPAPAPAPAPAPAEPAPPYQAAGRPYVPPQAPDDVKRE FT LTLELEQLREELTQAYAQYGRARSDEARQHERIAFMTTYTYYKTAVENLMP FT MLTTAEKAELRLQQRSVEAERLSIAPAQAPMPIQTAKVRLSPPKFTGEVLR FT YHGWRQTWNTYDNNPIYTATEKSQMLEQALEGEAANATASFTFTAETYNTI FT LQVLADRFGDRNQAIGEREAKL" FT CDS join(4780..5433,5437..8976) FT /product="BEL-16_DPu-I_2p" FT /translation="MYEFMLLEAPRRHRKIVAHNTIFGWTISGSLGDPDCE FT SVANVAKVEIIPEEHEISSVDKHFAAFWGLESLGIPNKEQTDAEFIQTYID FT SIEADEEGRAVVRYPFTQERPDYDSCRSIADIRFENLWRSDSFTKEKRVRY FT HEVMMSYLREGFIEIADPNYDGPMAFLPNRPVEREDAETTKIRPVFDGSVH FT HKNRRSFNANLEIGPNLNPDIMGILMRFRYCIAWTADIEKAFLQIKIHEDH FT GQIVRFLWVEDPEAAIPKVVVYRWKKLTFGLASSPFILRAVLTWHLQKYET FT EHPGITARTLNQIYVDDWMGGAETPEKAAEEIRLINRILGEIKMELCKWST FT NSAELSNILRGEFEFSSRPSKLGIDEVFEHADKKALGILWDPVTDQFKFNA FT DKIISEARRLGEGLTKRQLFSLALTLYDPLGFVNPAIFSAKRAMQISWLAN FT GKWDQIVNSCTNENWEEFIKGLEDLQLIRIPRWTGIDVKEPSELHIFCDAS FT EVGYGAVAYEVQGKTIALIAAKSKVAPLPKKAMTIPRLELLSNLMASILGK FT YIRDQHRVEHRAHIWTDSQIATCWITASNKTPEIWVQNRVSKIREFGAEIH FT FCEGTQNPADLLSRGGSAKEIQKSNWWKGPEWLPEYDKYKPIEVKANLIQT FT DMVRPKHPDWRKEYSDWDRMVRAQAMIQRSVDGFKGKMYRPSLISDQVLFE FT IKDLKEIKLKKRPGAKTQASFKRLTYEELLTARYALIRYAQMESYPEDYDR FT IERGKVPRDKMFRKLRPVFDTTNRIIRCRGRAQALLARYHLDALILLPPDH FT YITTVLLRKLHRRVGHMGMNAMTIEVRNEFWLPKHCQVIKDILNRCVRCQK FT VNATAFSEKPAHLPIERLKIADPFSVTGIDMAGPYNVLVNEPSKAVVVQNR FT TESGIPEVESESSDESEEELVPERVDALTKKKKKKKKEKKQPKFAIDGRQI FT VKVYIVMFTCATTRAIHLEVTRGKSAAAFINAFRRFCALKYTPRTVYSDNA FT LEFDSTARYLRRLWQCKSVYDFMANRNIAWRFSASVAPWWGGFWERMIKTV FT KQALHKTFNPKQMDFDMFHTVITEISDIVNNRPLGYIANDETALTPNQLIK FT GGFVNKFDTEPPEEEELRGADSLFLTNRETARRKLVAEWWTEFVPTYLKDL FT NRFHQDQTPSKSVKLGQVVLIHQDYVKRINWNIGRVIELIKGRDHLVRKVK FT LVMVDSKGKTSIVDRPVQNLYPLEVEPGIIDADFAKKPSRFGDHVTSSGQI FT CKRTYMGGVWKTTALDDEATDCRPHVVQPRETTIASDFNGGATNRPSDVAS FT TTGARPRHGPPKMPVINVPASADLERSNPIEWIDDEDLTPKQLRAEVRRRD FT RAQAAWRPKPSAKPDGVIRTSARCWTDKPATSPAK" XX SQ Sequence 9042 BP; 2771 A; 2300 C; 2424 G; 1547 T; 0 other; tggtcttcga accggttaaa acggaccttt agttgcgact gaaactaatc tctgtcttat 60 ttttgcagtg tcgagaatca cgagcaagat cgtcccaagc cgacaattcg tagccgaccc 120 gcctaggtcg ctgaacgaat ctatcgctaa catcaaagcg tacgccacga agcttgtgga 180 cgacaaccgt aacggagaca tcaccgacgg aaagagacag ggctggaggc tactcgaagt 240 cctgtttcca attgagttct acgaactcac aggggccact ccacccttgg ctttgcaaca 300 cacaaagcct atccaagtgg gagatagcaa aagcggagaa attggaaaga gaggccttgg 360 cactcttgac tcccgaacag agggctcagt tcgacgagga cgaagcgcaa attcggagag 420 acaaagaggc gcagaacgct ctacgacaag aggttcgcgc ggccaaaaag aggaagcgag 480 acgcggtgga ggaagcgaaa gttcgcctgg cagagaaggt agcgatagaa aagcgactgg 540 cgtacgagcg tactaaacgc cgtagggtta acccagaagc tgatcagggt aactcggtag 600 ccccagttaa ggtcccgttt gtgacaccaa acggagaccc ggtcctacca ggagactcct 660 taaccgatct gtggacggcc atcgccacag ggaaattcgc catcccgggt acggcaagca 720 acccggtgac ctctgagatt cgggcagtgc tggctaagca cctgaatcag gcacaaactc 780 cgcagccatc ggcctcgacg agccaggcag agactccgca gccatcggcc tcgacgagcc 840 aggcaaaagg agctaagaca cctagaaggc aaccggcagc tggaaggaaa aaagcggcag 900 cttcgaaaga ggcccaagct caaatcctga accctccggt ccttctgagt ccaattagtc 960 ccaagcccaa gaaaggcaag gcacgagtcc ggccgacgac cgagacaccc gatcacctgt 1020 cagacgccga cgccctgaaa agggtcacgg tccttctgag aaggtgtgac gaggtagaca 1080 ccggagccaa gccggtgaaa accccgaagc ccagaaagaa gtcactgttc aaccctccgt 1140 caaaggagtt ggtagaggca gcggcagaac aaacggctcg tcgaaagagc ctccaagcta 1200 cgaaagagat cgaggtgaat ctagcggact cggccaagaa tctctacgga gaggatttgg 1260 aagagttacc tagcgagagg gaggacgaac tcctcaaaga gccggcccag gtaccggcaa 1320 caatccaagc ggcggtagag cagacgacgc gcccggtctc caaacgatcg ggcctacgga 1380 tcgactcagg cgacgagtcg gagccagagc tacagctcac agccgacgac gaggaggcgg 1440 cactcaagga agcggcactc aagaccctcc ccaaagcggc cgaatagccg tcttaatttg 1500 tgtaaattcc tgtgtttgtc gtcaataata caacacttgt aaaaactaaa aattgggtct 1560 gtctttcatt tcgattaaat gttgcactcg gacaaaaaac gttcaaaacg gaaggggagc 1620 aaagacgata cccctaaccg gctagcgctt ccgaaagtca gtaaaaatct gttaggctcg 1680 gagcgggttg tgacagatag cgaagagata atgaatcggg tactagcgga gagcgagccc 1740 gagactggag cgagatccag acgagcccca aaaaagaatc ccgacgggac gctcaaccta 1800 agagggagta ggccgtcgag cagggcctca gaccgaaacc cgagtggctc tccaccggtt 1860 ataaacttag aggggaacac gaacgaaccc cagtcaccga aaattgggtt aacggtagac 1920 gacattcaaa gacaggcagg agcccagcct tcgccggaaa taccgcctcc gaatctagat 1980 ggatttgggg ctaccccgga acggaaaact ctatccgaat ggatgagaga agagttcgcg 2040 aggtttaacg gtgagatggc aacgatcact gggcgactaa ctaagacaga cgcacgggta 2100 acccaggtag ccacgaggct agaatatcat aaagaacaaa tcatagggct taacacagag 2160 ttaccgaaga taaggcaatt agtaacgggt ctcagaagct cgataacgga agacacaacc 2220 gcacgaatta cggctgtaga aacgaacatt gcaacaacga tgctggcgca gagaaacgcg 2280 acagagaccc tgcagacccg aatggtcgcg catattgaac gagcagtagg gcagctccaa 2340 aacagcctga acgagctaag gcaaacaggg ttcgacgaca cagagttaag agaggatgta 2400 agacaactgg tcgaagccca aggtgaggaa atcacgagag acctagcaca atcgctgggc 2460 cagcactcag aggaggtggg tcaggctatc gtggagctca aagtaaggat gtcaaggatg 2520 gacagcagac taaggtcgat atatagaggc cagcagacgg cggctcaagg acctcagcca 2580 gagagaactc ggcaaagggc tcagggccag cagccggaga gagagagaac gcaacgacag 2640 cgaagctcta aaagatcgct ttgggatgag ggctcagagg aagaggcaag tgaggagaat 2700 gagccttctg taagacgacc gacgtctagg aggggagcga catttcataa tcagccccct 2760 ccatcatacc gcagttcctt tacggcaagc ggccgaagac cggcaccagg cccagagcca 2820 gccccggcac caaacccggc ggaggctccg gcacccgccc cggcggaggc cccggcaccc 2880 gccccggcac ccgccccggc acccgccccg gcggaggccc cggcacatgc cccggcaccc 2940 gccccggcac cagccccggc accagccccg gcaccggctc cagcgccagc ggaaccggca 3000 cctccgtatc aggcggcagg tagaccatac gtgccaccac aagcaccaga tgacgtcaaa 3060 cgagagttga cgctagagct cgagcagctc agggaagagc taacacaggc ttatgcccag 3120 tacggtcgtg caaggtctga cgaagccaga caacacgaaa gaattgcgtt tatgacgacc 3180 tatacatatt acaaaacggc agttgagaat ttaatgccaa tgctaacgac agcagagaaa 3240 gccgaactgc ggctgcaaca gcgcagcgta gaggcggaaa gactgagcat agccccagca 3300 caggcgccga tgcctataca aacggctaaa gtacgattat caccgcccaa gttcacggga 3360 gaggtactta gatatcacgg ttggcggcaa acgtggaaca cgtacgacaa caacccaata 3420 tatacagcca ccgaaaaaag ccaaatgttg gagcaggctc tcgaagggga agctgccaat 3480 gcgacggcga gtttcacgtt cacggcggaa acttataaca ccattctcca agtactggct 3540 gacaggtttg gcgaccgaaa tcaggccatt ggagagagag aagccaagct atgagaggcg 3600 gcaagtaaac cggtgggatc gaacccaacg tccgcccagc tacgagaaaa acatatcgag 3660 atatgtaatc aaaggcaagg cctcctgatc caaggcgtcc agccggcgac attcgattca 3720 catacggcaa aagagattct gaactccctg ccaggggtaa taacagaaaa atggagaatg 3780 gattgaaggc cagaaagggc accgacctta gaccaagtcc tacatcaact cgataggttg 3840 atccgaatta aatcgtcgga agaagcgtcg agacgggtcg actataccca gacaactcag 3900 gccaacgcag cgagcatagc cacaaaaacg tttaagatcc gagacaaaac tccgacaagg 3960 cttgtctgcg tgatatgcaa aagtgagacg catgtttcgc gagaatgcac gtacggcaat 4020 gcatccgaga gacggaaggt ggtgatggac gcagacagat gcttcaggtg ctttggccaa 4080 ggccataggt atatagattg ttcggagagg cgggcatgtc ggcactgtta ctccacctcg 4140 catgcgtcgg tcctatgtcc gcacgagaga gagagtcggg atagaacaaa aggcaagcag 4200 gcaccgagag ccgctagcag gtcaccgagt aggcctcgat caggcaaacc gttaatgcga 4260 tcgcaatcac cggcgccatc gaggacacca tcctcggaac ggccagtcaa tgagcaatcg 4320 aatgtaggag taatagaacc ccacagcaag gaaaaaattt ccgccggggt attgctgaac 4380 tttggagcga aagccaaagc agaaaaaaaa caaggattgg aagaataggt ctcactgttc 4440 ggaatattcg atagcggttg cggtaattcc tttatcacag cagccctagt acgaaaactg 4500 aatgcgaaaa tcatcggcag acaacgtatc agagtcttga cgtttgggtc tcatacaccc 4560 ttcgacgagt attgcgatat tgttcaagtc acgatagccg gcatatctag aacagtaacg 4620 gaaaatttca tagtccgaga ccacattcag acgctaacac cgtacaccga gacgaaactt 4680 ggtgaaagat tgatcaaaga aggtgaaacc ttggccgacc cgaggcaccg aagaccgtca 4740 caggtcaaaa cgatagatat cctaatagga aacagcaaca tgtacgagtt catgttactg 4800 gaagccccgc gcaggcatcg gaaaatagtg gcgcacaaca ccatcttcgg atggactatc 4860 tcggggtccc taggagaccc ggattgtgag agcgtagcga acgtggccaa ggtagaaatc 4920 atacccgaag agcacgagat ctcgagtgtc gacaaacact tcgccgcttt ctggggcctg 4980 gaaagcttag gtatcccgaa caaggaacaa acggatgccg aattcatcca aacttatata 5040 gacagcatcg aagccgatga agaagggcga gccgtcgtac ggtacccctt tacccaagaa 5100 cgaccagact acgactcatg tagatccata gccgacataa gattcgaaaa tctgtggcga 5160 agcgactcct tcactaaaga gaagcgagta cgatatcacg aagtaatgat gtcatactta 5220 cgcgaagggt tcatcgaaat agccgaccca aattacgatg ggccaatggc atttctaccg 5280 aacagaccgg tcgaacgaga agacgcggag accacaaaga tccgaccagt ctttgacgga 5340 tcagtccatc acaagaatcg gcgcagtttt aacgccaatc ttgagatagg accaaacctc 5400 aacccagaca tcatggggat cctaatgaga ttttgacggt attgcatagc atggacggcc 5460 gacatcgaga aggcgttctt acaaattaaa atccacgaag accacggaca aatcgtgaga 5520 tttctttggg tcgaagaccc cgaagctgct atacccaaag tggtagtcta ccgatggaag 5580 aaacttactt ttggactcgc gtcaagcccg ttcatactga gggccgttct aacatggcac 5640 ctacagaaat acgaaactga gcatccaggt atcacagcca ggactcttaa ccaaatctat 5700 gtcgacgact ggatgggggg agccgaaact cccgagaaag cggccgaaga gatacggcta 5760 ataaaccgaa tcttgggcga gatcaagatg gagctctgta aatggtcgac gaattcggcc 5820 gagctatcaa acatactacg gggggagttc gaattctcct cccggccatc gaagttggga 5880 atcgacgagg tttttgaaca tgcagacaaa aaggcactgg gaatcctttg ggacccggta 5940 accgaccagt tcaagttcaa cgccgacaaa ataatatccg aagcgcgaag gctaggagaa 6000 gggctaacaa aacgacaact atttagccta gcactaaccc tatacgaccc gttgggcttc 6060 gtgaacccgg cgatattttc cgcgaaacgg gcgatgcaga tctcatggct ggccaacggc 6120 aaatgggatc agatagtgaa cagctgcacc aatgaaaatt gggaggagtt catcaaaggt 6180 cttgaagacc tgcagctcat acgaattccg cgctggacgg gcatagatgt taaggagcct 6240 agtgaactac acatcttctg cgatgccagt gaggtaggat atggggctgt agcatacgaa 6300 gtacaaggga agacgatcgc cctcatagca gcgaaatcaa aggtggcgcc gttacctaag 6360 aaggcaatga ccatccctcg gctcgagctg ctgagtaacc tgatggcgtc tatacttggc 6420 aagtatatac gggaccaaca tcgggtggag catagagccc acatctggac cgattctcag 6480 atcgcgacat gttggatcac agcctccaat aaaaccccag aaatctgggt acaaaaccga 6540 gtctcgaaga tcagagaatt cggcgcagaa atccacttct gcgaagggac tcagaaccct 6600 gctgacctcc tctcacgggg aggctcagca aaggaaatcc agaaatcaaa ctggtggaag 6660 ggccccgaat ggcttccgga atacgacaaa tataagccta tcgaagtcaa ggcaaatctc 6720 attcagacgg atatggtccg cccgaaacac ccagactggc gcaaagaata cagcgactgg 6780 gacagaatgg tcagggccca ggctatgatt cagagatcag tggacggctt taaaggcaaa 6840 atgtaccgac cgtctctcat cagcgaccaa gtgctgtttg agataaaaga cttgaaagag 6900 atcaagctca aaaaaagacc gggagcaaaa acccaggctt cattcaagcg actaacatac 6960 gaagagttgc tcacggcgcg atacgccttg ataagatatg cgcagatgga gtcatacccc 7020 gaggattacg acagaatcga gcggggaaaa gtccctagag acaagatgtt tagaaagtta 7080 agaccggtgt tcgacacaac gaatcgaatc atccggtgca gaggtcgagc tcaggcactg 7140 ctagcaagat atcatctcga cgcattgata ctgcttccgc ccgatcacta tataacaaca 7200 gtcttgctgc gcaagttgca tagacgggtc ggacatatgg gaatgaacgc catgaccata 7260 gaagtcagaa acgaattctg gctgcccaaa cactgtcaag tgataaaaga catcctcaac 7320 cgatgcgtga gatgtcaaaa ggtcaacgca acagccttca gcgagaaacc ggctcacttg 7380 ccaattgaac ggctaaaaat agccgatcct ttctcggtca ccggcatcga catggctgga 7440 ccgtataatg tactcgtaaa cgagccaagt aaagcggtgg tggtccagaa ccgaaccgaa 7500 tcaggaatcc ctgaagtgga gtctgaatcc tccgatgaat cagaggaaga attggtacct 7560 gagagagtcg atgccctcac caagaaaaag aaaaagaaaa agaaagaaaa gaagcaaccc 7620 aagttcgcga ttgacgggag acagatagtc aaagtgtaca tagttatgtt cacttgtgcc 7680 acgacaaggg ccatacatct agaagtgact agaggaaaga gtgcggcggc gttcattaac 7740 gctttccgtc ggttctgcgc gttgaaatac acgccaagga cagtctattc agacaacgcg 7800 ctggaattcg atagtacggc aagatacctc agaaggctct ggcaatgcaa gtcagtctat 7860 gacttcatgg caaatagaaa catcgcttgg cgattctcag caagcgtagc tccctggtgg 7920 ggtggcttct gggagcgtat gataaagacc gtgaagcaag cgttgcataa gacctttaac 7980 ccgaaacaga tggacttcga catgttccac acggtgatca cagagatcag tgacatcgta 8040 aacaaccgtc cgttgggata catcgctaac gatgagacag ctctaactcc gaaccagtta 8100 attaaaggag gcttcgtaaa caagttcgat accgaacctc cggaagagga agaactgcgg 8160 ggcgcagact ccttgtttct cacgaacaga gaaaccgcta ggcgaaagct agtagctgag 8220 tggtggacag aattcgtccc aacctatctg aaagacctaa ataggttcca ccaagaccaa 8280 acgccgtcga aatcggtcaa actaggacaa gtagtcctga ttcaccaaga ctacgtcaaa 8340 cgaatcaatt ggaacattgg tcgggtgatt gaactcatca aaggtcgaga tcacttagtc 8400 cggaaagtca agttagtcat ggtcgactca aaaggcaaga cgtcaatcgt agacagaccg 8460 gttcagaacc tataccctct cgaggtagaa ccaggcatca ttgacgcaga ctttgccaaa 8520 aaaccaagca gatttggaga tcacgtaacc tcatctgggc aaatatgtaa gagaacttac 8580 atggggggag tgtggaaaac cacggccctc gacgacgagg ctacggattg ccggccacat 8640 gtggtacagc cgcgagagac aacgattgcc tcggacttta acggcggagc tacgaatcgc 8700 ccgtcagatg tggcatcgac cacgggagca agaccacgac acggtccacc aaagatgccg 8760 gtgataaacg taccggcgtc ggccgaccta gagaggtcta acccgataga gtggatagat 8820 gacgaggatc taacacccaa gcagctcaga gctgaagttc gtagacgcga cagagcacag 8880 gcagcatggc gaccgaagcc atctgctaaa ccagacgggg tgatccggac cagcgcgagg 8940 tgctggacgg ataaaccagc tacaagtccc gccaaataat gcatgtgaac accgatacaa 9000 atcgacggcc ctcggaaacc gtcggacggc aggtcaggcc ca 9042 // ID Gypsy-12_DPu-I repbase; DNA; INV; 4862 BP. XX AC scaffold_221; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-12_DPu_; KW Gypsy-12_DPu-LTR; Gypsy-12_DPu-I. XX NM Gypsy-12_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-4862 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 739-739 (2010). XX DR Genome; scaffold_221; Positions 128863 133724. XX CC Positions [3708-4169] - Integrase core CC 'AAAAG' target site duplication CC LTRs are 98% similar to each other. CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX FH Key Location/Qualifiers FT CDS 2934..4679 FT /product="Gypsy-12_DPu-I_1p" FT /translation="MVHGYRVPLRYSGARAGSGGMGDEKVSSVSQALVTIL FT DKYTLDAVDNPKLQRLKERLSPFIFTTTWRKGRHHAIPDALSRAPVNDPTP FT EDEGTSSEIQTFVRHVIVHHVNAMTRTDDDVVEPDAVVELPHLPDPMLDDL FT RVAAASDADYAELIAAIIDGFTVPRHQTHQSVRQFWKIREDLSVDDGLVLF FT GQRIVIPKSARRDLLRKLHAAHQGIVRMKRRARQTVFWPGISNDITTLVES FT CQNCQERLPRQQKEPLFRDPLPTRVFEDVSADLFQLGPLHVLVYADRLSGW FT PVVHEWRHDPSACEVTQAVIENFVDLGVPVRFRSDNGPQFEAHSFQTKLRQ FT WGVVWGNSTPNYPQSNGQAEAAVAAMKDLMEKISPAGVASDEFSQGMLEFR FT NTPRENGLSPAEMVLGHSLRSIIPAHHTSYATSWQSVMEARERQAAMDAFV FT KFRYDENARSLAPLSIGTNVRIRDPKSKFWDRVGVVVGIGRYQSYRIKFAN FT GSVLWRNRRLLRPMVAVAGPSKTSSARKRAAALGIPEATTACQPTRAQPPR FT ILIAEAQSTPPRRLCFGGSASENVLSCSMSERISA" XX SQ Sequence 4862 BP; 1019 A; 1347 C; 1411 G; 1085 T; 0 other; tggcgcagtt ggacttacct ctcgttctca tcgtgtagag gtatttgtgt tacatttttc 60 ggaattgagt acgcccgtga gaaaaagtac agtgtagttt gggcgcgtcc tttgtctccc 120 acacgcgttg gggcggcggc catcttgctt cctttgttct cgagccaagg cgagtgcctc 180 gtcggcagcc atcttggctt cctttgttcg aactgttccg agttgtattg tggaaagtgt 240 gcttgctttc tcgactacca cgtacctttc tctgcgtgtg atttcttcat cgactgtctt 300 cttttgtgcg tgacgcttta ctttactcaa ggcattctta tttttccgtg gacggcggcg 360 gccgccttgt gatcttcttt atgtcttcgg tggccaagtg ttcctgcacg ggccgcgctc 420 attgtgttta tatcgtggcc gtgtgaacta gcttcacgtg acagtgggtt cgacgtgtgg 480 tgtggttatt ttatgttcgt cccgggcatc accacaactt acgatcgtcg tctcgcacga 540 agccggagtc cattcagcca gggccgtttc aatcttcgca gaacatcgcc atgtcgacgg 600 tagcggacgc actggacgcg gcagtggccg ccacggcagc agccacagcg gccgcgggtc 660 gcatcgacgc ggttgaacaa aatcaaatcc agatgacggc acagctcacc ggcattgccc 720 agcaacttca agcattactc ggcggtggcg ctggtggagg cggtggcgct ggaggaggcg 780 gtggcgctgg cggcggtggc gctggaggag gcggcggcgc tggaggaggc ggcggcgctg 840 gcggtggtgg tgccacgcag agacgccgga tcgatccgtc cggcttggag aaactgcatg 900 cagacatctc tcttccgcaa ttacgcacgt ggagaaatcg ctggggtgat ttttgccagc 960 taaaccaatt ggctacttat ccggtggccg agcagatggc cgctttcagg atggttctcg 1020 acccagcgat gcagcagatc gtggaagtgg ctcttggaat tttaccaaca tcggcgctat 1080 ctccgacaga tgtactcgat cgcatcagca cttacgtacg ttcgaagcga aatatcgcgt 1140 tagattgagt cgcgttcgaa gaatgtgagc agagtgcggc ggaaacattc gatgatttct 1200 acatccgttt acggaactta gctgaggcgg ccgatttatg cgggacgtgt ctggacacgc 1260 ggatgacgac ccgcgtcatg gcgggaattc gggacacaga cgcgaaaagg aaattgttgg 1320 ccctgagtcc gtttcctaca gcccaacagg ccatcaacat atgtcgagcg aggagtcagc 1380 aagagcgaac gagaaatcct taagtaatcc accgacggtg tcgtatgtgc catcgaaggc 1440 ccagcgttcg ttcaaaccga cagataccag caggtgcggg tcgtgcggtc gctcggcgca 1500 tcgcaatgga gagccgtgcc cagccgttgg aaaacagtgc cacaattgtg gcgaaaacaa 1560 ccatttctcc ccgtgttgcc caaaaaagcc gaagacagat gcaacggccg gcggcggcgc 1620 tagtggcggt ggtggcggcc aaagaggacc catacgggga taccgaccca ccaagaggat 1680 accgaccgct cacgggtgca tatgaagcgc atcgtggtgg gcaacgtccg cgcacacagg 1740 cggcatcggc ccgctcctac catcccgtta ctgcttcgtg atgcggctgg caaggtactg 1800 accacagtca acaaaactat tccggacggc ggggccgaag ccaccgttgg gggcatggac 1860 gtgttacgtg cgcttgggtt atccgaaaag gacctcacat cttccacctt caacttggta 1920 atggcggaca agtcaacccc cctgctggtg gtgggagaaa aggaatacct ggccgactat 1980 gaaggtgtga ccgccaatat cacaatcatg ttcagcccag acgtaaaggt attattgttc 2040 cggtgacgga agcgtcggat tgggccgcgc cactagtcgt tacccggaaa gcagacaact 2100 cactgcgact tgttgtagat cacaccaggt tgaacagaca tgtgcgcaga ccgacgcatc 2160 ccactcgtac gccacgcgat gcagtcgccg agatctccgg tgacgcccgg ttttttacaa 2220 ccttcgatgc agcaaatggg tactatcaga ttcccctcca cccatcgtca caacacctaa 2280 ccatatttat gacgccgtgg ggcagataca agttcctgcg ggcgtcgatg ggcctttgta 2340 gttccggtga cgaatacaac cgccgtgctg accaggcgtt tcaaggcgtc aacaataccg 2400 tccgggtggt cgacgaccta ctccgtttcg acagctcctt tccggagcac gtggccgggg 2460 tttgtgcggt gctgtcggcg gccaggagcg ctgggattac cttcagcctc aagaaattcc 2520 attttgcccg cagccaggtc ctttgttgga tttcaaattc agcagggcgg cgtttcggcc 2580 gacccggata agattcgagc catctctgat ttcccacagc caactaacat cacagagctc 2640 cgctccttta tgggattggt ggagcagcgt gctggattct ccacggacgt cgcggcggcg 2700 aaggcaccct tgcggcctct cctcagcact aagacgcctt ttttgtggac cactgatcac 2760 gacagtgcgt tctcagccgc caaaaaagct ctcgtggcgc caccgatcct ggctcatttc 2820 gatcccacat tggagacgtc cctacaggtg gacgcgtctc gcaagaacgg tatgggctac 2880 gtcctccttc agctacacgg gtctacgtgg aagcttgtgg acgccaattc ccgatggtgc 2940 acggataccg agtcccgcta cgctatagtg gagctcgagc tggcagcggt ggaatgggcg 3000 atgagaaagt gtcgtctgta tctcaggcac tcgtcacgat attggataaa tacacgctgg 3060 acgcggtgga caatccgaag ctgcagcgcc tgaaagagcg cctttcaccc ttcattttca 3120 cgacgacgtg gcgtaaaggg cgccatcacg caattccaga cgctctatcc cgagccccgg 3180 tgaacgatcc gacaccggag gatgaaggta caagttccga aattcaaaca tttgtgagac 3240 acgtcatcgt ccaccatgtc aatgcgatga ctcgaacgga cgacgacgtg gtggagcctg 3300 acgccgtggt ggagcttccc catttgccgg accccatgct ggacgacctt agagtggcag 3360 ccgcatccga cgccgactac gcggaactca tcgcggcaat tatagacgga tttacagtcc 3420 cgcgacacca gacccaccag agcgtgcgcc agttttggaa gatacgcgag gatctgtctg 3480 tggacgacgg tctggtgtta ttcgggcagc gtatcgtcat cccaaaatcg gcccgtcgcg 3540 atcttttgcg gaaactacac gccgcccacc agggcattgt acggatgaaa cgacgggctc 3600 ggcagacggt gttctggccg ggcatttcca acgacattac taccctggtg gagagctgcc 3660 agaattgtca ggagcgactc ccgcgccagc agaaagagcc cctatttcgc gaccctttac 3720 cgacccgagt gtttgaagac gtatcggccg atttgttcca gctcggcccc ctacatgtcc 3780 tggtgtatgc tgatcgcctc tccggatggc cagtggtcca tgaatggcgc cacgatccgt 3840 ccgcctgcga agtcacccag gctgtcattg agaatttcgt cgacctggga gtgccggtca 3900 ggttccgttc agacaacggc cctcaatttg aagcgcacag tttccagacc aagttacggc 3960 aatggggcgt cgtgtgggga aattcaacgc ccaactaccc gcaaagtaac ggccaagcgg 4020 aggcggccgt ggccgcaatg aaggatctga tggaaaaaat atcaccagcc ggtgtggcat 4080 cggacgagtt ttcccagggt atgctggagt ttcgaaatac gcccagggag aacggcctat 4140 cacccgcgga aatggttttg ggtcattccc tgcgctcaat catcccagct catcacacgt 4200 cttatgcaac gagttggcag tcggtgatgg aggcgcggga gcggcaggcg gccatggacg 4260 cattcgtaaa attccggtac gatgagaacg ctcgctcact tgctcctctt tccatcggta 4320 ctaacgtccg catccgcgat cctaagtcga aattttggga cagagtcggc gttgtcgtcg 4380 gcatcgggcg ctaccagagt tatcgcatca agttcgccaa tggcagtgtc ttgtggcgca 4440 atcgtcggct gctgcgacca atggtggctg tggccggccc ctcgaagacg tcatcagcga 4500 ggaagcgagc agcggcgttg gggatcccgg aagcgacgac agcgtgccag ccgacacgag 4560 ctcagccccc ccgcatcctc atagcggaag cgcaatcaac accgccccgc cgcctttgct 4620 tcggggggag cgcgtccgaa aacgtactgt cgtgttcgat gtctgaaagg atttcagcat 4680 aatagtgtct gtgtattgtc gtataccccg tcagagttac gcatttagct tttgtatcga 4740 ggggctcgtg tatttgttct tgttccgtct ttgtgttcgt gtgttgtctt tgagcaggat 4800 tcttttgatg tgtgtaagct gttaatttcc gacgtacggt gaattaacag cttgggaggg 4860 gt 4862 // ID BEL-26_CQ-LTR repbase; DNA; INV; 629 BP. XX AC AAWU01010404; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-26_CQ_; KW BEL-26_CQ-I; BEL-26_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-629 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 206-206 (2011). XX DR GenBank; AAWU01010404; Positions 23380 22752. XX SQ Sequence 629 BP; 205 A; 129 C; 119 G; 176 T; 0 other; tgttaccacg cgaacagata gatgttttac ttttcacctt tcaaagaatg gtagcaaatc 60 agataggaaa ataacagaaa atgctcactt atttgcatac aaatgttttg tgctcttact 120 tgagaatcat aacgttagaa ttaggacagg taaatttctc atagtttaat gggacttata 180 agaatcacgg gtagctacag tgagtcttcg tagaagcaaa catatgcgaa agaattatga 240 gtatccgtat cacccatgtg tatgaaggat gactgtgcag gtccaggtag tatgtgaaag 300 agagatacaa ctagatttgc ctacctatct cgttctgcct aacatcaagg attagtttta 360 aatcatatac atcaataatg caacacgtag ctttaagttt atattgtaca ttaaattcca 420 aataaaccaa gttcggagat cactcccaaa aacaactgtt ttacttcagt caatttgaga 480 actcctcatc ctcgtattgg aacgagcagc ctccagcagc cagcagtctt cagcagtcac 540 ccgggatcag caccagcagt tctacttgga gtcgtcggca gccagagcgt aagttccgca 600 aattaaaaac gtacccacgc gtttttaca 629 // ID hATm-19_HM repbase; DNA; INV; 3590 BP. XX AC . XX DT 07-DEC-2008 (Rel. 13.12, Created) DT 07-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type family: consensus sequence. XX KW hAT; DNA transposon; Transposable Element; hATm-19_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3590 RA Jurka J.; RT "Families of hATm elements from Hydra magnipapillata."; RL Repbase Reports 8(12), 1913-1913 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 564..2876 FT /product="hATm-19_HM_1p" FT /translation="MKKISVIYQCKKFFLMLLMKLVAERWLISNTEFKEPI FT VCGTKGITLRLSRSWSAFSKIARNKSQKSYIKHWEPKLDKLLDIALCKCRI FT VYCSENDPPCKESCDSGAHCLCKCDLNLRIPQKELIWIKNQREKQGSISSM FT QEFGLDVKETAKLRLKHLRKSIDQARLARQREEASKSKLESLETFYPPAVR FT FDVENVDVTIEMHNNSNDVIVSDSEEVSDIDKVSDSDIVSDSKEDSTIEKQ FT DNYVKRNYLDISNAAVASVRFGVSSTATAAIINGLLKDIMKAGKLVEDKKN FT LICDSMKVFRAKERAMKYSRIEENFDCERNIITGIFVDGRKDKTLILIHDK FT TTGTFRKKIIKENHITVTEEPRGRYLTHYTPKPKSETSKPAKQCAIGLFNW FT LMERGKDLNLQLIGSDTTNEMSGWKGGMIHFVEELLNRKLFRSFCWLHINE FT LPFRHIVEKLDGPTSSDKGWCGAVGKLFSKVDNLERITQFTPISLLEPLVN FT ISEDFLKQMSTDSLVAWKYLNAILKGKLDPEVEALKCGRLSHSRWLTCGMR FT CLLLYMSKHDLEPVNAEILRLLATWVTQVYLPMFFEIKVKHDIKYGSCHLL FT KLFRLWQKQDDRIKEASKLYLRNESWWAHPENILVSLLCSDNSEQRLFAVD FT TILAVRKGSELGSNDVRPFKVPQTINLDACDIKELIDWKKEVVTEPIFTTN FT MSLVQLNALIVSPLQLPPYSLHTQSCERAVKLVTEAAESVCGWDKRDGFIR FT TQLRNRELMPLFKSKKDFQ*" XX SQ Sequence 3590 BP; 1248 A; 533 C; 632 G; 1177 T; 0 other; ttagggtata tcatactttt gcatgttatg aatttcattc agtacatcac ttttcctatg 60 tattcatata ccctgagttg aatggtataa ttttttttag tgaaaaaaca agtcttgcta 120 ccgggaaaac gtttttaatt ttccgaaaat attgtgtttt tttctaattg atgactattt 180 atgctataaa tgtacataat aattatttaa aatactgttt agccctatgc ttaattttga 240 taatatgttt tgagtttttt taaaaattat tttatgtata agtatcagtt aatatacaaa 300 tataccttat aaatattata attactattt aaatttaatt tcaaattaat tgctgttatt 360 gctgccttta tatctttcag actcataata tattaaaaaa agaaaaatgg cgttattagt 420 tagcaagaaa accagaaaat cgattaacac caaactaagt gagtatcttg gaggtcctag 480 aaagttcctc caaactgaat ttccaacctt acgtgactgt ttacagcgtt gtctggatct 540 gcagcgagac agaatactag tctatgaaaa aaatcagcgt aatttatcaa tgcaagaaat 600 tttttctaat gttgcttatg aaattggttg cagaacgatg gcttatttcc aacacagaat 660 ttaaagaacc catagtttgt ggaacaaaag gaataacact aagactcagt agaagttggt 720 ctgcattctc taagatagct cgaaacaaat ctcaaaaatc ttacataaaa cactgggagc 780 ctaagcttga taaactttta gatattgctc tttgcaaatg cagaattgtt tactgcagtg 840 agaatgatcc cccttgcaaa gaatcttgtg atagcggtgc tcattgcctt tgcaaatgtg 900 acttaaattt aagaattcct caaaaagagc ttatttggat aaaaaaccag cgagaaaaac 960 aaggcagtat ttcatctatg caagaatttg gtttggatgt taaagagact gccaaactta 1020 ggttaaaaca cttacgcaag tcaatcgatc aagcaagact cgcacgacaa cgagaagaag 1080 cttcaaaatc taagttagag tcgttagaaa cattttatcc accggctgtt agatttgatg 1140 ttgaaaatgt ggatgttaca atagaaatgc ataataattc aaatgatgtg atagtcagtg 1200 atagtgaaga agtcagtgat atcgataaag ttagtgatag tgatatagtc agtgatagca 1260 aagaggactc gaccatagaa aagcaagata attatgttaa gcgaaactac cttgatatta 1320 gtaatgctgc cgttgcttct gttagatttg gtgtatcatc gactgctact gctgcaatca 1380 ttaatggtct actgaaagat attatgaaag cagggaaact tgttgaagat aaaaaaaatc 1440 ttatatgtga tagcatgaaa gtttttcgtg ccaaagagcg tgctatgaag tattccagaa 1500 tagaagaaaa tttcgactgt gaaagaaata taatcaccgg aatttttgtt gatggtagaa 1560 aagacaaaac tttaattctt atccatgata aaaccacagg caccttcaga aaaaaaatta 1620 tcaaagaaaa tcatataact gtgacagaag aacctagagg tcgttacctt actcattata 1680 caccaaagcc taagtcagaa acatctaaac cagccaagca atgtgctatt ggattattca 1740 actggttaat ggaaagaggt aaagatctta atctccaact aattggcagt gatactacaa 1800 atgaaatgtc tggatggaaa ggtggaatga ttcattttgt ggaagaactt ctcaatcgaa 1860 aactttttag atctttttgc tggcttcaca ttaatgagct tccattccgt catattgttg 1920 aaaaacttga tggaccaact tcttcagata aaggatggtg tggagcagta ggtaaactat 1980 tttccaaagt tgataatctt gaaagaatta ctcaatttac tcctatttcg cttctggagc 2040 ctcttgttaa tatcagtgaa gatttcttaa agcaaatgag cactgacagt ttagttgcat 2100 ggaagtactt gaatgcaatt ctgaaaggaa agcttgatcc tgaagtagaa gctcttaaat 2160 gtggtagatt gtctcacagt cgatggctta cttgtggcat gagatgcttg cttctctata 2220 tgagtaagca tgatcttgaa cctgttaatg cagagatctt aagactgctt gcaacttggg 2280 tgactcaggt ttatcttcca atgttctttg agattaaagt aaagcatgac attaaatacg 2340 gttcttgtca tcttttaaag ctgtttcgac tttggcagaa gcaagatgat aggattaaag 2400 aagcatctaa actttatctc agaaatgaat cttggtgggc tcatcctgag aatatacttg 2460 tctcgcttct ctgttctgat aactcagaac aaaggttatt tgcagtcgac acaatattgg 2520 ccgttaggaa gggaagtgag ctgggctcaa atgatgttag accttttaaa gttccacaaa 2580 caataaacct tgatgcatgt gacattaaag aacttattga ttggaaaaaa gaggtcgtta 2640 ctgaaccaat tttcactaca aacatgtcgt tagtccaact caatgcattg atagtttctc 2700 ccctgcagct acctccatac tcattacata ctcaaagttg cgagagagca gtcaaattag 2760 tcacagaagc agctgagtca gtctgtggtt gggataaaag agatggtttt ataagaacac 2820 aattgcgaaa tcgtgaacta atgcctttgt ttaagtcaaa aaaagacttt cagtagctta 2880 tttaattact aatttggtat tttttgattt tatttaaact ttgttgtgca taacattaaa 2940 tgtttgaaaa aattcttgtt ttttatgttg atcttaaaag aatgttaggg ggctgaaaag 3000 aaaatttgta gcaaaaaaac aacagcaaga tttttaattt aaacgggtat tttgagtaag 3060 tttgagggga gggggctttc gcccaatcca gtgtgtgggt atatatatat atatatatat 3120 atatatatat atatatatat atatatatat atatatatat atatatatat atatatataa 3180 taattgtaat ataccttgat cttataatca taataaaact atcaaatatt gaaaaaaaaa 3240 aaaggggggg gggggagccg tagccctttt tacgcactct ggatctagaa tcctggaagt 3300 taaataataa ttatatctgt tttatttcac gtagtttact ttttacctaa tataattgta 3360 atttttagta ttaattgact aaaacctttc caaaattaag ctttagttaa aatagactaa 3420 tttttgaaaa aaaaccctaa ttttaaaacc cgtttcccgg tagctagaca tcaatatttt 3480 gtaaaaatat tattatattt gaattcccta tacccaaata cataggaatc caagcgtgct 3540 agtcaaaatt ctaaaaattt aatttttggc tctagtatga tataccctaa 3590 // ID Gypsy-604_AA-I repbase; DNA; INV; 4356 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-604_AA_; KW Gypsy-604_AA-LTR; Ty3_gypsy_Ele17; Gypsy-604_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4356 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [3278-3754] - Integrase core CC 'CCCA' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 104..4315 FT /product="Gypsy-604_AA-I_1p" FT /translation="MPVTRKMAAKTKREQLEDEPFYDVASDGMDDDQTDGS FT KGDTQAQLPTQTHSFALHPHDVGKLIPEYSEGDGVVKWLRRIDHFRVLYGW FT TEQVCLLYASTRLSGAAANWYRRQEEQIFNWPQFKDQLILAFPETFDEADI FT HRQLEAVKLEKNESYESYVYRVDALAQKGDFSTSATLKYIIKGLRYDKVFA FT SLLSVQYKSTLELLRHIKWVASNLNMIPQTSTRFEKTVAFNASESASPLTC FT FNCRESGHKSIDCPKPPRKERCGRCLKVGHATDKCSAAAYHPKQKQPFQLG FT TNATGRSATNAVFPSDQRNAVSVDESSQAMVELDVAEDRIRALALVDSGSY FT ASLIKYSLLPPTIPIRNTREEIVGINSIKVNVVGEWKTFVYIDHIKYNVRF FT LVVSEDTMQVGLILGRSFLKDNEIQFIRFQMSDIPESSDNLSKLSEWSMFE FT DMHFSYNLEEDYLDLDVGDNSETYSFRRKLEDILKVSYFDRQRPETPSVKY FT QAKIRLKENKIFSATPQRLSVFEKNELDKIIVDLLEQGVIRESDSPYTSRV FT VLTKKKNNSYRMCVNYKPLNRIVERNRFPMPIIEDQILKLQGKRYFSTLDL FT KNGFYHVELTEESKQYTSFVTNSGQYEFNRLPFGYANSPAIFVKYLSKVLE FT PFIQDGRIVVFIDDMMIASENAEEHLQTLSEVLEVLSDNHLELQFSKCRFL FT KTHTEYLGYDVRFNCIQPSDRHIQSIRDFPVPIDRKSLQRFLGLVNYFRKF FT ISGFNILAAPLYDLLKEDREFVLTSDHLKSVEKLKQALISKPVLRIYSPTA FT ETELHTDASSAGYGGVLLQRQTDDGLMHPVMYFSKKTSPTESKLHSFELET FT LAIVYCLQRFRVYLFGLHFKIVTDCNSLKQTLEKRDINPKIARWAMYLEQF FT DFEICHRSGSKMQHADALSRVNVHLIEEENRIESSVFENALYVAQLQDSGV FT TSLKQAVNLGSMKDYEVRDEVLYKRIGNKSLLYVPEQMVPSVINKFHNEMG FT HFGVDKVCNLIKRTYWFPRMREQVQAHTKSCITCIAYNPRNKRYDGVLFNV FT EKPKAPFEVLHIDHLGPLERGKGKNEYILAVVDACTKFIKLYPTKTTKTTE FT VMKSLRNYFHAYSTPKVLISDRGTAFTSLAFNKFTEDHGIRHIKVATACPR FT ANGQIERYNRTMMPLLSKLVEETGNSWDSVITDAEYLLNNTVNRATATTPA FT ILLFGVEQRRRIDYDLTQYLLELNEEADMRELQKIREVASKQNQRQQEYNK FT RMYDKHCRRNTSYSAGDLVMLRRVNVPGERSKLKKKFRGPYMVKKVLDKNR FT YIVGDLDNFQVTRTRFEGVFDPLNMRLYQKAKGKENEGNVLNDEQHQGVEY FT LEDDKERSLMNEEEYQDVEYLEEEDTDVEYEDVEYLEDEYE" XX SQ Sequence 4356 BP; 1353 A; 820 C; 1046 G; 1137 T; 0 other; gattcagaag tgggatacca ccggatgtac cggcagtgta tagcggaaga tcggaaacgt 60 aactccgaat gttttgaggt gcgacgaagg atcttttttc acgatgccag tgacgaggaa 120 aatggcggcg aaaacgaagc gtgagcagtt ggaagacgag ccgttctacg atgttgcaag 180 cgatgggatg gacgacgacc agaccgacgg cagcaaagga gacacacaag cgcaattacc 240 aacacaaacg cacagctttg ctcttcaccc acacgacgtt ggcaaattaa ttccggaata 300 ttctgaaggt gacggggtgg taaaatggtt acgtcgaatc gatcattttc gtgttctgta 360 cggttggacg gaacaagtgt gtttgctgta tgcttcaact cgtctgagcg gcgcagcggc 420 taattggtac agaaggcaag aggagcaaat tttcaactgg ccgcagttca aggatcagct 480 gatcttggca tttccggaaa cctttgacga agcagacatc catcgacaac tggaagctgt 540 caagcttgag aagaacgagt catacgaatc gtatgtatat cgtgttgacg ctctggcaca 600 aaagggtgac ttcagcactt cagcgactct gaagtatatc atcaaaggac tccgctatga 660 taaagtgttt gcaagtttat tgtcagtgca atacaagtcg acgttggaat tgttacgtca 720 tatcaagtgg gttgcatcga atttgaacat gattcctcaa acgtcaacac gattcgagaa 780 aaccgttgct ttcaacgcat ctgaatcagc aagcccttta acctgcttta actgtcgtga 840 gtctggtcac aaatcaatcg attgtccgaa gcctccacgt aaggaacgat gcggtcggtg 900 tttgaaggtt ggacacgcga ctgacaagtg ttcggcagcg gcgtatcatc ctaagcagaa 960 gcaaccattt cagctgggta cgaatgcgac tggacggagc gcaacgaacg cggtattccc 1020 gagtgatcag agaaatgcag tatcggtgga cgaatcttca caagcgatgg tagaattgga 1080 tgtcgccgag gaccgcattc gagccttggc cctcgtggac tcaggtagtt atgctagtct 1140 tataaaatat agtttgttac cccctactat tccaatcaga aatactagag aagagatagt 1200 cggtataaat agtatcaagg taaatgttgt tggtgagtgg aaaacctttg tttatatcga 1260 ccacatcaaa tataatgttc ggtttttggt cgtctcagaa gacacaatgc aggtagggtt 1320 gatattaggt agaagtttcc tcaaagataa tgagattcag tttatacgat ttcaaatgag 1380 cgatattccc gaatcatctg ataatttgtc gaagctttct gagtggtcca tgtttgagga 1440 tatgcatttt tcgtataatt tggaagaaga ttatttggat ttggatgtgg gtgataacag 1500 tgaaacgtat tctttccgac gtaagctcga agatattttg aaagtaagct atttcgatcg 1560 acagcgcccg gaaactcctt cagtgaagta ccaagcgaaa atacgactaa aggaaaacaa 1620 aatcttctcg gcaacacctc aaaggctaag tgtatttgaa aagaacgagc tcgataaaat 1680 tatcgtggat ctgttggaac agggtgtcat acgcgaaagt gattcgccgt atacttctcg 1740 ggtggtctta acgaagaaga agaacaatag ttatcgtatg tgtgtgaatt acaaaccatt 1800 gaacagaata gttgagagga atcgctttcc catgcctata atcgaagatc aaattttgaa 1860 acttcaaggg aagcgatact tttcaacgct ggatctaaaa aatggattct accatgttga 1920 actaaccgag gaaagtaaac agtacacttc ttttgtcact aacagtgggc aatacgaatt 1980 taaccgtctg ccatttgggt acgcaaattc tccagcaatt ttcgtaaagt atttatcaaa 2040 ggtgctcgaa ccttttattc aggacgggag aatagtggtg ttcatagatg acatgatgat 2100 tgcttcggaa aatgctgaag aacatcttca aacgttaagt gaggttttgg aagttctttc 2160 agacaaccat ttagagcttc agttttccaa gtgtcggttc ctgaaaaccc atacggaata 2220 ccttgggtac gacgtgaggt tcaactgtat ccaacctagc gataggcaca ttcaatccat 2280 ccgtgatttc ccggtcccca ttgatcgaaa aagtcttcaa aggtttctcg gactcgttaa 2340 ttacttcaga aagtttataa gtgggtttaa tattctggca gctcctttgt atgacttact 2400 gaaagaagat cgagaatttg tattgacgtc agatcacctc aagtcggtgg aaaagttgaa 2460 gcaagcactt atttctaaac cagttcttcg aatctattca cctactgcgg agacggagct 2520 gcatacagat gcttcttcgg cgggatacgg tggtgtactg ctacaacggc agacggacga 2580 tggattaatg catccggtta tgtatttcag caagaaaacg tcgccaactg agtcgaaact 2640 acacagtttc gagctagaga ccttggcgat agtatattgt cttcaaagat ttcgtgttta 2700 cttatttggt ttgcacttta aaattgtgac tgactgcaac tcactgaaac aaacgttgga 2760 aaaaagggat attaatccta aaatcgctcg ttgggccatg tacctcgaac aatttgattt 2820 tgaaatctgt catcggtccg gatcaaaaat gcagcatgca gacgctctgt cacgagtgaa 2880 tgtgcatctc atcgaagagg agaaccgaat tgaatcttcg gttttcgaga atgctcttta 2940 tgtagctcag ttgcaagatt caggggtaac tagtttgaaa caggctgtga acctgggttc 3000 tatgaaggat tatgaggttc gagatgaggt tctgtataag agaattggaa acaaatcctt 3060 gctatatgta ccagagcaaa tggtaccttc agtaatcaat aagttccaca acgaaatggg 3120 acactttggt gtagataaag tttgtaacct gatcaagagg acttattggt ttccccgaat 3180 gcgagaacag gtacaagctc atacgaagtc ttgtattaca tgcattgctt acaatccacg 3240 aaacaaacgc tacgacggag tactgttcaa tgtagagaaa ccaaaagcac cttttgaagt 3300 tctgcatatt gatcatctcg gtccattgga gagaggaaaa ggaaaaaatg agtacatact 3360 agctgtagta gatgcctgca cgaaattcat taagctctac cctacaaaga caacaaaaac 3420 gacagaggtg atgaaatctc ttcggaacta ttttcacgct tattcaacac ctaaggtttt 3480 gatatcagat aggggaactg cgtttacttc actcgcgttc aataaattta cggaggatca 3540 tggcattcgt catataaagg tagcgaccgc atgtcctagg gctaacggac agattgagag 3600 atataatcga acgatgatgc cactactcag caaactcgta gaagagacag gaaatagctg 3660 ggattcggtc atcaccgatg cagagtactt gctcaacaat actgtcaatc gggcaaccgc 3720 gaccacccct gccatattgc tattcggagt agagcagcgt aggcgaattg attatgacct 3780 gacacagtat cttttggaac ttaacgagga agccgatatg cgagaacttc aaaaaattcg 3840 ggaggtggct tcaaagcaaa accagcgtca gcaggaatac aataagcgga tgtatgataa 3900 gcactgccga agaaacacaa gctattctgc aggagatttg gtgatgctga gaagagtcaa 3960 cgtaccggga gagcggagca agttgaaaaa aaaattccga ggaccttaca tggtgaaaaa 4020 ggttctcgat aaaaaccgat atattgtagg tgatttggac aatttccaag tgactcgtac 4080 cagatttgag ggggtgtttg atccgttaaa catgcgacta tatcagaaag ctaagggcaa 4140 agaaaatgaa ggaaatgtat taaacgacga acagcatcaa ggtgtggaat atctagaaga 4200 tgataaggaa agaagtttga tgaacgaaga agagtatcag gatgtggaat atttggaaga 4260 agaggataca gatgttgaat atgaagatgt tgaatattta gaggatgaat acgaataatc 4320 tctttcagat gaatgatgtg tgcaggatgg ccgagc 4356 // ID BEL-117_AA-I repbase; DNA; INV; 5511 BP. XX AC supercont1.311; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-117_AA_; KW BEL-117_AA-LTR; BEL-117_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5511 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.311; Positions 488463 482953. XX CC Positions [4472-5065] - Integrase core CC 'GATAT' target site duplication CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 576..1640 FT /product="BEL-117_AA-I_3p" FT /translation="MADARTKQQLINRRTTLIASLGRTEQFVENYAAERDQ FT GQVKLRLDNLDTVWMGLEDVQTQLEDLETTNEGMAQNLYYRSHNETRYFAI FT KAALQSYLPIVPSSVSAIPQSAVSGLSSIKLPTITLPEFDGDYNQWLTFHD FT TFVALIHSNPEVHDTQKFHYLRAALKAEAAQLVESIGISSANYTIAWQTLV FT SRYANDYLLKKRHLQALLDCPRMKKESASALHSLVDEYERHTKTLRQLGEP FT IDSWSTMLEHLLCLRLDDGTLKAWEDFATTAETPDYRCLIEFLQRRLRVLE FT SMSVNHQAQPASNSQPPAFRRPPFYKTVSHAAAEAVPRKCYSCDQQHPLFQ FT CPQFDKMSSPIA" FT CDS 2239..3315 FT /product="BEL-117_AA-I_1p" FT /translation="MVRLPRHPDFDHMLGESKQAALRRFHHLENRLNKEPA FT LKEEYHSFMLEYLTLGHMRLVPPSNLESHQSYYLPHHAVVKEASTTTKVRV FT VYDGSAKTTTGHSLNDALLVGPIIQDELLTLIIRFRKYPIALVAEIAKMYR FT QVLLHHEDVPLVRILWRFNSSEPISVYELLTVTYGLSPSSFLATRTLQQLA FT ADEGESYPLGGPALKKGFYVDDYIGGAETEEEAICLRNELDDLLEKGGFQL FT RKWASNNPTVLEGLEPSRIGTQRALKLGNEESIKTLGVSWEPRTDQLRFDS FT MQVTTRERSTKRSILSAVSKHFNPLGLTAPVIIRAKMLLQELWLQPCGRRS FT KRHYSSEMGHLLRRTS" FT CDS 3767..5422 FT /product="BEL-117_AA-I_2p" FT /translation="MTVDAFLGSNLWHHGPEWLTLSEEDWPVGAQQQKAPE FT DILERRETVAAVQNKQDVNPIFCTSSSYSRLIRVTAYCFRFIKACLRQTSS FT GPAGKAAFVTVDEMEMIRTKLVRLAQADVFEEEIRYLGKGKAVPKRSHLRL FT LSPFIDSEGIVRVGGRLRLAEQPYLTKHPILLPSSHPFTRKIAHHFHLKLL FT HGGGRVTLAAIRQEYWPIQGRRLVNSVLRTCFRCARASHTPTKQQTGQLPL FT DRITPITGIDYAGPVYMKPIHKRASPTKAYISVFVCFATKAVHLELVSDLS FT TPAFLTALRRFIARRGCPLHIHSDNGKNFQGAQNELHQLYELMQKEKSSGK FT IATYCSSQWIQWHMAPPKAPHFGGLWEAAVKIAKKHMHRQLGNAMLSFEDM FT STVLAQIEAAMNSRPLTPLNEDPNDLAVLTPAHFLIGTTMTALPDVDVSNI FT DISRLDHYQRLQHRAQQFWHQWKTEYLQELQKENRHNALNTAIQPGRMVVV FT VDEFLAPVKWPLARIINTIPGPDGLIRVVDLKTCKGIIRRPVMKICLLPLD FT ETIDI" XX SQ Sequence 5511 BP; 1484 A; 1479 C; 1166 G; 1382 T; 0 other; tattttggtg ccgtgaccag gatcaggttt cctcccagat tttccgccat cacaatacgg 60 actgcgacgc tattcccgcc atagttctgt accattgtgt gccccatcgg cattcataag 120 atcgccatct tccccgggtc cttattgtga ctctccgcca tcgagaatta ataggattaa 180 tacaaggcct agtaccgcta ggcacacggt acgtagaatt ccaagttgtg caggaagtct 240 cgcgggtgtg tttagatttt tttcgatttt ctcccgattc cgtgataccg cttcaaccga 300 cccgtttatc cggatacgct acacgctatt gaggtacccc accggcttat attagccacc 360 aattccggcc gaagtaacca ccactgaatc ctgccgtgct ctgtgtctcc gaatcatcgt 420 tttaaccgcc agttccagga gttttggaat agatatacaa ggctcaatac tagccttgag 480 aaggttagta ccgctccaac ttctctaccc tatttgtccg gctctaccgg gtctttcttt 540 gtgtcatccc atctagagca tatatccaag tcaagatggc ggatgcacgg acgaagcaac 600 aactcatcaa ccggcggacc acgctcattg cttcgttggg cagaacggag cagttcgttg 660 agaactacgc tgctgaacga gatcaaggtc aggtgaaact gcgtctcgat aatttggata 720 ccgtgtggat gggactggag gatgtacaaa cccagctaga agacctcgaa acgaccaacg 780 aaggtatggc tcaaaacctt tattaccgat cccacaacga aacccgctat ttcgcaataa 840 aagctgctct ccaatcgtac ctccccattg taccttcttc cgtaagcgca atccctcagt 900 ctgctgtctc tggcctctcg agtatcaagt tgccgaccat tactctgccg gaattcgatg 960 gtgattacaa ccagtggctg accttccacg atacgttcgt ggccctgatt cattctaacc 1020 ccgaagtgca cgacactcag aaattccatt atcttcgcgc ggcgctcaag gcagaagccg 1080 ctcaactagt agaatccatt ggtatcagct ccgccaatta cacaatcgcc tggcaaactc 1140 ttgtttctag atacgccaac gactatctcc tgaaaaaacg ccacttgcaa gctctactcg 1200 attgcccccg catgaaaaaa gaatccgcct ccgctttgca ttctcttgtc gatgagtatg 1260 agcgtcacac caaaacccta cgccagcttg gagagcctat cgattcttgg agcactatgc 1320 tcgagcatct actatgcctt cgccttgacg acggtacact taaagcatgg gaagacttcg 1380 caacgactgc cgaaacaccc gattaccgat gtctaattga gtttcttcag cgtcgccttc 1440 gtgtgttaga atcgatgtct gttaatcatc aagcacagcc agcatctaat tcgcaaccac 1500 ccgcttttcg ccgcccaccg ttctacaaaa ccgtttctca tgccgccgca gaagccgttc 1560 ctcgaaaatg ctattcatgc gatcagcaac atccactgtt ccaatgccct cagttcgata 1620 aaatgtcctc gccgatcgcc taaatctggt gaacaacaat cgcctttgcc ataactgctt 1680 ttgccaaaat catattgctc gcaactgcca gtgaaaatta tcttgccgat tttgtagaaa 1740 acgtcaccat tcattactcc accctggata ccacccaacc gatgctgtgc agcatccctc 1800 tacatcgcgc ataacaacat caacatcgaa gcccaccgaa aagcgagata ccataacgtc 1860 aaaccagcga acaccaacat ctacctacgc tgtcaccgcc caatcgagca acccatcaca 1920 acaatccaat gcgaatgttt tgctgtctac ggtcgtctta attgtttacg actctgatgg 1980 atcggcgcac cctgctcgag caacaatcct gtccttattg ataccatttt tggctggata 2040 gtcaccggta ggaaccaaaa tgcatacgaa actctcccag tagcttgcaa tattaccctt 2100 gccgatcccc tccacaaagc ccttgaacgt ttctggagca ttgaggaaat agatggtaat 2160 cgcaaatatt ctttggaaga acaacagtgt gaaactcact tcactgctaa tgtttcccga 2220 actccagaag gccggtatat ggttcgcttg ccccgtcatc cagactttga ccatatgctg 2280 ggagaatcaa aacaagccgc acttcgtcgc ttccaccatc tcgaaaatcg tctcaacaag 2340 gaacccgccc tgaaagaaga gtaccactct tttatgctgg agtatctaac cctaggtcat 2400 atgaggctgg ttcctccgtc caatctcgaa tctcaccagt cttattacct acctcatcat 2460 gccgttgtaa aagaggccag tacgaccaca aaggttcgag tagtatacga cggatccgcc 2520 aaaacaacta ctggtcattc cctaaatgat gccctgctcg taggaccaat catccaagat 2580 gaattactca cccttatcat ccgttttcgg aaatacccaa tagcccttgt cgcggaaatt 2640 gctaaaatgt accgccaagt tttgctacac catgaagatg ttccactagt tcgaatcctg 2700 tggcgcttca attctagcga acccatttcc gtttatgagc tcctaacggt aacctatgga 2760 ttgagtccat catcgttcct tgctactcgt accctccagc aacttgctgc cgatgaaggg 2820 gaatcatacc cactaggtgg tcccgcattg aagaaaggtt tttatgtgga tgattatata 2880 gggggagcag agacagagga ggaagcgatt tgcctacgaa acgagttaga tgacctgcta 2940 gaaaaaggag gatttcagct aagaaagtgg gcgtcaaaca acccaacagt actagaaggc 3000 cttgaaccat cacggattgg aacccaacgc gcattgaaat taggtaacga agaatccatc 3060 aagacccttg gagtcagttg ggaacccagg accgatcaac tgagattcga ttccatgcaa 3120 gttaccacta gagagcgatc aacgaaacga tccattctgt cggctgtatc aaaacacttc 3180 aacccgctcg gtcttacagc acctgtcatc attcgagcaa agatgttgtt gcaggagctt 3240 tggttgcaac cctgtggacg acgaagtaag cgacactatt caagcgaaat gggacaccta 3300 ctgcgccgaa cttcctaagc tttcaacatt ccgggttggt cgatacgcat ttttgccgaa 3360 ttctacgatt catttgcaca ccttcgcaga tgcatcccag catacatacg gcgcatgcat 3420 ttatgctcgt tccactaatg tcgaaggtaa aatacacgtt cagctgattg cttcgaagtc 3480 gaaagtagcc ccgcttaaac gattgtccat tcctcggctg gaactcttgg cagctgttct 3540 tgcagtgaga ctgcatcaaa aggttctcct ggcactggat attcctatct cagcctccta 3600 tttctggtca gattcaaccg ttgttctcga atggttgcgt gcacccccat acacttggca 3660 cactttcgtt gcgaatcgcg tttccgaagt ccaaaccact gttcctgaat cgcgttggca 3720 tcacgtcgct ggaaagcaaa atcctgcgga tctcgtatca agggggatga cagttgatgc 3780 ttttcttgga agcaatcttt ggcaccatgg acctgagtgg ttgacacttt ccgaggaaga 3840 ttggcctgtt ggagcgcagc agcagaaggc acctgaggac atcctagaac ggcgagaaac 3900 ggtggcggct gttcaaaaca aacaagatgt taacccaata ttttgcacgt cctcctccta 3960 cagcagattg attcgtgtga cagcatattg ttttcgtttc atcaaagcct gtcttcgaca 4020 aacgtcaagt ggccctgcag gaaaggcagc attcgtcacc gtggatgaaa tggaaatgat 4080 tcgcacaaaa ttagttagac tggcccaagc agatgttttt gaggaagaaa taaggtactt 4140 ggggaaaggt aaagcggttc ctaagcggtc acacttacgc cttctaagcc cattcatcga 4200 ttctgaaggt attgttagag ttggaggtag attgcgctta gctgagcagc cttacttgac 4260 caaacatccc atcctcctac ccagttccca tccatttaca cggaaaattg cccatcattt 4320 ccatctgaaa ttgttgcacg gtggcggccg cgtgacatta gcagcaattc ggcaggaata 4380 ctggcctatc caaggacgcc gtttggtcaa tagcgttttg agaacctgtt ttcgttgcgc 4440 tcgtgcttca catacaccca ccaaacagca aacaggacaa ctaccattag atcgaattac 4500 tccgatcacc ggaattgatt atgccggacc ggtctacatg aagccaatac acaagcgagc 4560 ttcaccaact aaggcctaca taagtgtttt tgtatgcttc gctaccaagg ccgtgcatct 4620 agaactcgtg agcgacctct ctaccccagc attcctcact gccctgcgga gattcatagc 4680 tcgtcgcggt tgccctctac acatccattc cgataacggg aaaaattttc agggtgccca 4740 aaatgaactt caccaattat acgaattgat gcagaaagag aagtcgtccg gaaaaatagc 4800 cacctactgt tcaagccaat ggattcagtg gcatatggcg ccacctaaag ccccacattt 4860 tggtggattg tgggaggcag cagttaaaat agctaaaaaa cacatgcatc gacaacttgg 4920 caacgcaatg ctctcgttcg aggacatgtc tactgtgttg gcacaaattg aggcagcaat 4980 gaattcccgg ccgttaactc cactaaacga ggatccaaac gacttggctg tgctcactcc 5040 agcacacttc cttattggca ctacaatgac cgcactccca gatgtcgacg tcagcaatat 5100 agatatcagc cgattggacc actatcagcg tctccagcat agagcccagc aattttggca 5160 tcagtggaag acagaatact tgcaggaact ccagaaggaa aatcggcaca acgcactaaa 5220 cactgcaatc caacccggaa gaatggtcgt cgttgttgat gaatttctgg ctccggtgaa 5280 atggccgctc gcaagaatca tcaataccat acccggacca gatggactca tccgtgtcgt 5340 cgatctgaaa acctgcaaag gaatcatccg tcgtcctgtt atgaaaattt gtctgttgcc 5400 gttagatgaa accatagata tatagaatgt aaacattttg atttgtttat gcttgaattg 5460 ttatgacagg agttaggcaa agtgaaattg aataatttca ggtggcggct a 5511 // ID Gypsy23-I_Dpse repbase; DNA; INV; 8600 BP. XX AC Unknown_group_816; XX DT 15-MAY-2009 (Rel. 14.05, Created) DT 15-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy23_Dpse; KW Gypsy23-LTR_Dpse; Gypsy23-I_Dpse. XX OS Drosophila pseudoobscura OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; obscura group; OC pseudoobscura subgroup. XX RN [1] RP 1-8600 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1124-1124 (2009). XX DR Genome; Unknown_group_816; Positions 2314 10913. XX CC Positions [741-1166] - Reverse transcriptase CC Positions [2316-2825] - Integrase core CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 717..3194 FT /product="Gypsy23-I_Dpse_2p" FT /translation="MYEELDRMLALGVIEESNSAWSCPVVLVRKPEKVRLC FT IDSLKVNEVSRKDAYHMPLIDGILSRLPKAEYITSLDLKDAYWQIPLEEGS FT RDKTSFTVPGRPLYQFKVMPFGLTNAPSTMSKLMDRIIPANLRNEVFIYLD FT DLLIVSDTFERHLSVLKVLAEKLSAAGLTINIQKSKFCVPEVRYLGHVVGH FT GIIGMDLDKIEAIKEYPAPKSVKQLRRFLGMTGWYHKFIKNFAAIAAPLTN FT ALKQKRQFIWSEEAEAAFTALKNSMCEAPVLHTPNFEKPFFIHCDASHSGV FT GGVLMQINEEGDEVPIAFMSRKLNKCQRANSVTEKECLAAILSIKKFRAYV FT EGHQFTVITDHASLKWLMSQSDLSSRLARWALKLQGYSFNISHRKGSRNIV FT PDALSRVFSPDLSEIEPEERIDLESEHFSSTDYVSLRAKVASSQAQMPDVK FT VVGRHIYRRNEHATGERVADELCWKIWIPKGLIDSVLKMAHEHALSAHGGI FT NKTLEKVPRYYYWPTLLRDVKEFINRCEICKCTKHPNRPLRPPLAKTGETE FT RFFQKLYVDFLGPYPRSRSGNVGIFVVLDHFSKFPFIKPVRKFTADAIVPY FT IEEQLFHCFGVPEKLVSDNGVQFKSHAFNSLLQQYGIEHAYTAVYAPQANA FT SERVNRSILAAIKAYINTDQGNWDEQLSSVACALRSAVHSAVKASPYQLAF FT GQQMITNGTTYQLLRQLEMLEDRWLHFSRDDSFDLMRGTAKEAMRAQHERN FT EKTYNLRSKEVSFRVGQEVYHRNFQQSNFVKGFNAKLAPVFVKSRVRRQLG FT QAYYELEDLQGRLIGKFHAKDIKQ" FT CDS 4600..6282 FT /product="Gypsy23-I_Dpse_1p" FT /translation="MCEAPVLHTPNFEKPFFIHCDASHSGVGGVLMQINEE FT GDEVPIAFMSRKLNKCQRAYSVTEKESLAAILSIKKFRAYVEGHQFTVITD FT HASLKWLMSQSDLSSRLARWALKLQGYSFNISHRKGSRNIVPDALSRVFSP FT HLSEIEPEERIDLESEHFSSTDYVSLRAKVASSQAQMPDVKVVGRHIYRRT FT EHATGERVADELCWKIWIPKGLIDSVLKMAHEHALSAHGGINKTLEKVRRY FT YYWPTLLKDVKEFINRCEICKCTKHPSIPLRPPLAKTGETERFFQKLFVDF FT LGPYPRSRSGNVGIFVVLDHFSKFPFVKPVRKFTADAIVPYIEEQLFHCFG FT VPEKLVSDNGVQFKSHAFNSLLQKYGIEHAYTAVYAPQANASERVNRSILA FT AIKAYINTDQGNWDEQLSSVACALRSAVHSAVKASPYQLAFEQQMITNGTT FT YQLLRQLEMLEDRWLHFSRDDSFDLMRGTAKEAMRAQHERNEKTYNLRSKE FT VSFRVGQEVYHRNFQQSNFVKGFNAKLAPVFVKSRVRRQLGQAYYELEDLQ FT GRLIGKFHAKDIKQ" XX SQ Sequence 8600 BP; 2473 A; 1751 C; 2120 G; 2256 T; 0 other; atttttgaca atttaaccat agttaataag aaaataacac aggaagaaaa agaggatagg 60 gagtctaaac ggaaggcacg agctaaacta cgtcgggaag aatttgccag aaatccaagg 120 ttagagaaaa aagctctggt atcgtcaata atatcagaag atctcaagcc atatgcagag 180 gtcacattat tagacagaaa gcttgttgga ttaatagata caggggcatc tattagttgc 240 attggtggag atgtggccag cgagctgctt gaaagccaat ttcaatttaa gcccatgaca 300 tccacagttc ggacagcggg tgggcagtct cagagagtgg ttgggaagct taaagcggaa 360 gtgggctata aaggtgttac cgagctgtta ccctatacat cgttccaacc ctaacgcagc 420 ctctgtatct gggcattaac tcttggacca aatttggctt attgccagcc gatttgtatc 480 cacaaggaag tgctaatgac acggtgaacc attgcgatga ggtgtggaag ccagaaaacc 540 aacgtccttt atcggaggtg caacgagaac aactgaatag aaccattcag tgtcttccgt 600 ctttcgcgga taatggttta ggaaaaacca cgttgttgtc ccatgtaata gatgtaggtg 660 aggcaaagcc cacaaaacaa cgacactatg cagtgtcccc agctatcgag aaattaatgt 720 atgaggagtt ggatcgcatg ttagcgctag gggtcatcga agaatccaac agtgcgtggt 780 cttgtccggt agtactcgtc cgaaagccgg agaaggttcg tctttgtata gacagcctca 840 aggtgaacga ggtgtctcgg aaggacgcgt atcacatgcc gctcatagat gggatcttaa 900 gccggttacc aaaggcagaa tacataacta gcctcgacct gaaagatgcc tactggcaaa 960 taccgctaga agaaggttct agggataaaa catcattcac agtgcctgga cggccgcttt 1020 accagtttaa agttatgccg tttgggctta ccaatgcgcc gtctactatg tccaaactta 1080 tggaccgcat tattccagcc aatcttcgaa acgaagtgtt tatatacttg gatgacctgc 1140 ttatagtgtc agacaccttt gaaaggcatc tgagtgtgtt aaaagtatta gcagagaagc 1200 tgtctgcagc ggggcttaca attaatatac agaagagcaa gttctgtgtg cctgaggtcc 1260 gctatctggg gcatgtagtg ggtcacggta ttataggcat ggacttggac aaaatagagg 1320 ccattaagga atatcctgct ccgaagtcgg tgaaacagtt gcgtcgattc ttgggtatga 1380 cgggatggta ccacaaattt ataaagaatt ttgcggctat tgcggcgcct ctgacgaacg 1440 ccctgaagca gaaacgccag ttcatttggt ccgaagaagc agaagctgca ttcacagctc 1500 tgaagaactc aatgtgtgaa gcgcccgtct tacacacacc gaattttgaa aagcctttct 1560 ttatccactg cgacgctagt cactcaggag taggaggggt actgatgcag ataaacgaag 1620 agggcgatga agtaccaatt gcgtttatgt cacgaaaatt gaataagtgc caacgtgcta 1680 actcggtgac ggaaaaggaa tgcttagctg ccattctgag catcaagaaa tttcgcgctt 1740 atgtagaggg acatcaattt acggtgataa ccgaccatgc ctccttgaag tggctgatgt 1800 cccaatcaga tctgagtagc agactggcca gatgggcttt gaagcttcag gggtattcct 1860 ttaatattag ccaccgaaaa gggtctcgaa acatcgttcc agacgccttg tctcgggtct 1920 tttccccaga tttgtcagag atcgaaccag aagaaaggat cgatttggag tcggagcact 1980 ttagttcgac ggattacgtt tcccttaggg ctaaagtagc gagtagtcag gctcagatgc 2040 cagatgtaaa ggtggtaggc aggcatatat atcgtcgcaa cgagcacgca acaggcgaaa 2100 gggtagcgga tgaactctgt tggaaaattt ggatacccaa ggggctcata gattcggtcc 2160 tgaaaatggc ccacgagcac gccctatctg cacacggagg cataaataaa acgctcgaaa 2220 aggttccgcg ctactattat tggccgacgt tgttaagaga tgtcaaagag ttcataaacc 2280 ggtgtgaaat ttgtaagtgt actaagcatc caaatagacc tttaagaccg ccactagcaa 2340 aaacggggga aacagaacgg ttctttcaga agctctacgt agattttctg ggaccgtatc 2400 ctagaagtcg tagtgggaat gtcgggatat ttgtggtgtt agatcatttt tccaaatttc 2460 cgtttattaa acccgtaaga aaatttaccg ccgacgccat cgtaccatac attgaggaac 2520 aattatttca ctgctttgga gtgccggaaa agctagtctc ggacaacggc gtccaattta 2580 agtcccatgc attcaactcc ttgcttcagc agtatggcat agagcacgct tacaccgccg 2640 tttacgctcc gcaggctaac gcgtccgagc gagttaacag atctatcttg gcagcaatca 2700 aagcctacat aaatactgac caaggtaact gggatgagca gctcagcagt gtagcctgtg 2760 cgttaagatc ggctgtgcac agtgccgtaa aggcgagtcc ataccagttg gcgtttgggc 2820 aacagatgat cacgaatggt accacgtatc agttgttgag gcagctagag atgttagaag 2880 atcgatggtt gcatttttct agggacgact catttgacct gatgcgtggc acggcaaagg 2940 aggcaatgag ggctcagcac gaacgaaatg aaaagacgta taaccttcga agtaaggagg 3000 tatcattcag agtgggacag gaggtttacc atcgaaattt ccagcaaagt aatttcgtga 3060 aaggatttaa tgccaagttg gctcccgttt ttgtgaagtc ccgggtaagg cgacagttgg 3120 gtcaagccta ttacgagcta gaggatctgc aaggacgtct gatcggcaaa ttccatgcca 3180 aagatatcaa acagtagcac catggttaaa aatcgctcta agtgggaatc acccttttgg 3240 tccttcttcc ccaaagtgtg attttggtag gggggatttg aagtggtttg aaagagtaac 3300 gtataaaaaa aggtataaat actgggatcc acaaaaggga acatttggaa aagtaccaag 3360 aagggatatt ggtctcccta tcttactcaa ctttgccgcc tcaattcaag ttaaaaggta 3420 aacgaaggaa agcggttttt tttctgtttt tttgagttga aacttgtaaa ataattacgg 3480 cgtcaattcg aaaatggagg attctcaagt ttaacgctgt attgtggggt ctcagactgt 3540 gcagatctag cctcggtctc ttggaagaaa aactgttggc gaagacagac gtgctatttt 3600 tgtccagtcc aagtataaag aaaaaaaaag aggaaaaaaa accgggaaaa gatcttaaca 3660 gctgaaacca gcaaaaagaa agaaatattg tgcgtttata aagtaaaagt gtgctaagcg 3720 ccaatttcaa ttggtatgta tttcagcgtc gctaatatac accacataca ctcgtggcgt 3780 agggtagtgc tttttttatt gtagaaaatt aatttttcag attggcggtt agtaccggcc 3840 taaacttcgg gcaatcgttt gggtacctgt ctgaacccca ataagaaagc tcgcaacaac 3900 aatttccccg atattggggc ctccgcccgc gacaggggac ttatgctcct cgcccaacta 3960 cagctccgta ccaaaaattc ccagcgctag gccgagcagc acaaagcgag aggcgtaaaa 4020 gatcggctca tcagtcgctt tgcgggcgcg agccacttat gtgtcacgga cccgcgatat 4080 attggaggcc gtccaagtct gcggaaattg ttaaagctat gtaaacggtc aaccaatttc 4140 tacccgaaat atagatctgg gtctagacct attttcgttt ccccatacga agtgccgagg 4200 agtgcccaga tgtgaagccg gcggcaacga ctgtcgtttt ttttttatat ccatcagtgt 4260 aaccgctgat tcctaattta tgttatagag tcattagtgt aattagctta gtaaatttct 4320 tttggttgtg ttttaaattc tgttcggaag aaatagcaag tggaatgtgt gacagggcga 4380 aagtgaacaa cggttaaccg acagaggaga gcgtgagttt ttttttgttg taaaattatt 4440 tctccagctt tcgttttagg aacgttgccg gaatggtacc acaaatttat aaagaatttt 4500 gcggctattg cggcgcctct gacggacgcc ctgaagcaga aacgccagtt catttggtcc 4560 gaagaagcag aagctgcatt cacagctttg aagaactcaa tgtgtgaagc gcccgtctta 4620 cacacaccga attttgaaaa gccttttttt atccactgcg acgctagtca ctcaggagta 4680 ggaggggtac tgatgcagat aaacgaagag ggcgatgaag taccaattgc gtttatgtca 4740 cgaaaattga ataagtgcca acgtgcttac tcggtgacgg aaaaggaatc tttagctgcc 4800 attctgagca tcaagaaatt tcgcgcttat gtagagggac atcaatttac ggtgataacc 4860 gaccatgcct ccttgaagtg gctgatgtcc caatcagatc tgagtagcag actggccaga 4920 tgggctttga agcttcaggg gtattccttt aatattagcc accgaaaagg gtctcgaaac 4980 atcgttccag acgccttgtc tcgggtcttt tccccacatt tgtcagagat cgaaccagaa 5040 gaaaggatcg atttggagtc ggagcacttt agttcgacgg attacgtttc ccttagggct 5100 aaagtagcga gtagtcaggc tcagatgcca gatgtaaagg tggtaggcag gcatatatat 5160 cgtcgcaccg agcacgcaac aggcgaaagg gtagcggatg aactctgttg gaaaatttgg 5220 attcccaagg ggctcataga ttcggtcctg aaaatggccc acgagcacgc cctatctgca 5280 cacggaggca taaataaaac gctcgaaaag gttcggcgct actattattg gccgacgttg 5340 ttaaaagatg tcaaagagtt cataaaccgg tgtgaaattt gtaagtgtac taagcatcca 5400 agtatacctt taagaccgcc attagcaaaa acgggggaaa cagaacggtt ctttcagaag 5460 ctcttcgtag attttctggg accgtatcct agaagtcgta gtgggaatgt cgggatattt 5520 gtggtgttag atcatttttc caaatttccg tttgttaaac ccgtaagaaa atttaccgcc 5580 gacgccatcg taccatacat tgaggaacaa ttatttcact gctttggagt gccggaaaag 5640 ctagtatcgg acaacggcgt ccaatttaag tcccatgcat tcaactcctt gcttcagaag 5700 tatggcatag agcacgctta caccgccgtt tacgctccgc aggctaacgc gtccgagcga 5760 gttaacagat ctatcttggc agcaatcaaa gcctacataa atactgacca aggtaactgg 5820 gatgagcagc tcagcagtgt agcctgtgcg ttaagatcgg ctgtgcacag tgccgtaaag 5880 gcgagtccat accagttggc gtttgagcaa cagatgatca cgaatggtac cacgtatcag 5940 ttgttgaggc agctagagat gttagaagat cgttggttgc atttttctag ggacgactca 6000 tttgacctga tgcgtggcac ggcaaaggag gcaatgaggg ctcagcacga acgaaatgaa 6060 aagacgtata accttcgaag taaggaggta tcattcagag tgggacagga ggtttaccat 6120 cgaaatttcc agcaaagtaa tttcgtgaaa ggatttaatg ccaagttggc tcccgttttt 6180 gtgaagtccc gggtaaggcg acagttgggt caagcctatt acgagctaga ggatctgcaa 6240 ggacgtctga tcggcaaatt ccatgccaaa gatatcaaac agtagcacca tggttaaaaa 6300 tcgctctaag tgggaatcac ccttttggtc cttcttcccc aaagtgtgat tttggtaggg 6360 gggatttgaa gtggtttgaa agagtaacgt ataaaaaaag gtataaatac tgggatccac 6420 aaaagggaac atttggaaaa gtaccaagaa gggatattgg tctccctatg ttactcaacc 6480 ttgccgcctc aattcaagtt aaaagggaaa cgaaggagag cggttttttt tctgtttttt 6540 tgagttggaa cttgtaaaat aattacggcg tcaattcgaa aatggaggat tctcaagttt 6600 aacgctgtac tggggggtct cagactgtgc agatctagcc tcggtctctt ggaagaaaaa 6660 ctgttggcga agacaaacgt gctatttttg tccagaccaa gtataaagaa aaaaaagagg 6720 actaaaaacc gggaaaagat cttaacagct gaatacagca aaaagaaaga aatattgtgc 6780 gtttataaat taaaagtgtg ttaagtgcca atttcaattg gtatgtattt cagcgtcgct 6840 aatatacacc acatacactc gtggcgtagg gtagtgcttt ttttattgta ggaaattaat 6900 ttttcagatt ggcggttagt accggcctaa acttcgggca atcgtttggg tacctgtctg 6960 aaccccaata cgaaagctcg caacaacaat ttcgccgata ttggggcctc cgcccgcgac 7020 aagggactta tgctcctcgc ccaactacag ctccgtacca aaaattccca gcgctaggcc 7080 gagcagcaca aagcgggagg cgtaaaagat cggctcatca gtcgctttgc gggcgcgagc 7140 cacttatgtg tcacggaccc gcgatatatt gaaggccgtc caagtctgcg gaaattgtta 7200 aagctacgta aaccgtcaac caatttctac ccgaaatata gatctgggtc tagacctatt 7260 ttcgtttccc ccatacgaag tgccgaggag tgcccagatg tgaagccggc ggcaacgact 7320 gtcgtttttt tttttatatc catcagtgta accgctgatt cctaatttat gttatagagt 7380 cattagtgta attagcttag taaatttctt ttggttgtgt tttaaattct gttcggaaga 7440 aatagcaagt ggaatgtgtg acagggcgaa agtgaacaac ggttaaccga cagaggagag 7500 cgtgagtttt tttttgttgt aaaattattt ctccagcttt cgttttagga acgttgccgc 7560 agtccatgga agcgctttca tcgctgtttc aacgcattta ttaaaacaaa tcaaggaagg 7620 tcgtaagtga tcccaaaaac ataaagataa aatattggag tttgtttgtt attagcaact 7680 atttattgta taaccgaaaa agaaaaaaat ctcacgtagt caggggcttc ttcgcttaag 7740 agaggctggg cgctctccct tatgccttgc tggcaaccat aagcgcttac gccgcgcata 7800 agagacagcc ctcttaagaa caaggcacaa acgggaagat gcgaggctgg tcgctctgac 7860 gttcttttat gctttctact tgcggttgct ttcgctaatg cacttgaata actgacctgc 7920 cataacgatc gctgcgcaca gtttttttta cattcccttt tgcatattta tttattatta 7980 ttatctattt gcctaacata aaaactagag atatgtaccc ttcccttcca tcaatttgta 8040 agttaaattt caatctaaat tgtattatta agagttaagc atttattagt ttacgagttg 8100 cgatgcatgt gctattgctc aatatgaaat ataggattta acatgaatcg gggctagagt 8160 tctaatatgc ggggaccatg ctgggcaaat gagggtaatg ggggatggga tcaccggatg 8220 aaccgggagc ggtgatcatg gtagggtggg catgaggggt cacaacaaca ccgtggcacg 8280 acttgtatgt gtgtatgtgt ccgtgtgcgt cgttaggtgt atttgctctt gtacatattt 8340 gtgccgatac ttcgcttgca cgtgaaattt taatgagcgt tactgtgtca gtattatgat 8400 tgttggagta gagctgtgtc ccgtccgttc tgggaaataa aaaaaacccc cccatttctt 8460 gccgatgaag ccggaagccg gaacaagcac gtgcccggac gggagacgaa tcatcacctg 8520 ctcagccttc cgtgaaggtt tggtaatatc caatatgcta cagttagttt attatttggc 8580 gcccaacgtg gggcccgagt 8600 // ID Gypsy-16_DPu-I repbase; DNA; INV; 7024 BP. XX AC scaffold_847; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-16_DPu_; KW Gypsy-16_DPu-LTR; Gypsy-16_DPu-I. XX NM Gypsy-16_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-7024 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 747-747 (2010). XX DR Genome; scaffold_847; Positions 4317 11340. XX CC Positions [5636-6127] - Integrase core CC 'GTAC' target site duplication CC LTRs are 100% similar to each other. CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX FH Key Location/Qualifiers FT CDS 2984..6760 FT /product="Gypsy-16_DPu-I_1p" FT /translation="MEAMVDSGAKNSAVSEEWLVNRSCFEIRPPSNYRSLD FT GMPINNVVGETALTVRYQGVVVDLPRVAVVKKMLYPLVLGIEWIVQSGAVI FT KGVEGKAEVIMPDRSPVSSVHPKKEEEDTLKTEENLSYEEATKELKNLCLM FT AKEEEMPSSEDDKEQSVVLKTLHSSIVPANSAGYFKCRVPNDSSKFWMVST FT AGAVGAKGGWVTPNCVVEVEDGVLIIPVVNVNPHAISRKSAKGVLKAVPVR FT ESEIYPFESEDTEASVVANFQSDDGLPAYTDTLDDVKIDGSLTPEQVKELR FT ELLNGNRRCFLSKKGETHLAKHYIHTGDAKPIYSLPYRVSVAERKLISDHV FT EKMIAEKIIQPSFSSWSSPVVLVRKKNGEVRFCIDYRRLNAVTERDGYPLP FT RIEDVLGRLSGAKYFSSLDLESGFWQMAMAEEHREKTAFVTPDGLFEFLRL FT PFGLCGSPPSFQRLMDRVLHGLKWSECLCYMDDILVFGATFEEHQDRLNKV FT LRALGDAGLVLNTKKCVFGAKRIVHLGHVVDSHGISPDPAKTEAVVKFPRP FT RNVTELRAFLGLASFYRSFIPGFAETARPLHSLLKKDADVKEDWADVHEKA FT MAQLKEKLVTAPVLVCDDGTSELELQTDASVKGIGAVLILNKDGKANPITF FT ISRKLSKAEENYHANELECLALIWALGKLRHFVYGRPLLVKTDSSALCWLF FT KKKEVNGKFARWMLILQEYLLDIQHLRGAANVVADVLSRAPVDAANGDVMA FT VIQAGGYTSQEVGILQHADDDIRLIVLALQGFSKDPSNHQYPEFALHKGVL FT YKKNTRTGRPLLLAVPSIMRRDIVEECHDSADGGHRGVEKTLARIRQRFWW FT EGMASTVKSYVKSCHFCQTFKPRVGLPVGKLRPIPPPREMFHTLGIDHLGP FT FKSTIRGNRHLIVCIDYLSRWMEARPVASTGVDEVLPFLEEALILHHGTPT FT RIISDKGPCFTSLAFSAFCEKWNIKHVQASAEHPETNGLVERINSSIASTL FT AAFVNFKHSDWDEKIARAVFSINTSKQSTTEITPFELVFGRPAVMSLESAF FT PWPPSTPLTHEERVEVVSRWRRIARRLIIIRQKKSKLNYDRFRKGDPTFQI FT GELVLIARRRKTKKTTKKFIPRFIGPYQVYRKVSPTCYAVEDLPCFRKKRL FT WRRFNAHVSQIRRYSVRRETEWCPNSDEYESCDEETDQNGIMSNESEHVEN FT QPDESSIENQPSEPSIENQEFVPKHSTPFFSRVGRAARPINNDNNFIYY" XX SQ Sequence 7024 BP; 1797 A; 1635 C; 1726 G; 1866 T; 0 other; tttggtggca gcggcgagat atcggtcccc agaagcccgt ttcggaactt ctggtactgg 60 taaagaactc gcgtggagac gccattttgg gtgtgcgaga gcggagcgtc tcgacgactt 120 gatttcgtag agaactactt gaattcgtga tttcgtgctg aacatttgtc tgaagtgatc 180 gtacactttt tgtattgtct ttctgaaaga acgctaacga acgaacaaaa aaaaagagag 240 aagagtgttg aagtgttgag tcatctgatc gcgttggcta gccgccattt tgtttcatca 300 tcgtcctttt tgttctagaa tcgtcgttgt cgcggtcctt atttctcatg gtcgtttgtt 360 ctttgaacat ttcggtcgga aacttattgg gctcggcgac cggttcgttc atttcattcg 420 ccatatcatt tcatcttcgc cccatcgttt cgtcgttatt tctcagtggt tcttcatttt 480 tgtcgcgcgg gtgagagccg ccatttttat gtcatcatca ttcctttgtc tcgtcgcgtt 540 ggctagccgc cattttgttt catcatcgtc ctttttgttc tagaatcgtc gttgtcgcgg 600 tccttatttc tcatggtcgt ttgttcttcg aacatttcgg tcggaaactt attgggctcg 660 gcgaccggtt cgttcatttc attcgccata tcatttcatc ttcgcctcat cgtttcgtcg 720 ttatttctca gtggttcttc attttagtcg cgcgggtgag agccgccatt tttgtgtcat 780 catcattcct ttgtctcgtc gcgttggcta gccgccattt tgtttcatca tcgtcctttt 840 tgttctagaa tcgtcgttgt cgcggtcctt atttctcatg gtcgtttgtt cttcgaacat 900 ttcggtcggg aacttattgg gctcggcgac cggttcgttc atttcattcg ccatatcact 960 tcatcatcac ctcatcgttt cgtcgttatt actcattggc tctttatttg aaaattgttc 1020 gcttcatctg ccttatcatc ttgccgaatc gccgtattat ttccttgtta ttttcgtcga 1080 ccgttattcc ccttcatgcc ccgtcgaaga aacagtaagg ccagtcgggc gattcgggct 1140 gctgttttca aagaaaataa agttcgtcgg caagaagcta acgctcttca acgtcaacaa 1200 gcggcagtag cattggcact acaggcagca cccgttgtcg cgcctgaacg actagccgcg 1260 gatatccagc tgggagagga gattctacga gccgttcaag cccaaaggcg catcataccc 1320 cagcatctgc taccgccgct tggggttttg gtccaacaag ccgaggctat agctcggaac 1380 gcgcaactag cacacatccc actccaaccg gaaccaattg acgacgttct tcaagtagct 1440 ctagtcgaaa acgaagaagt tgattaacgc atttgtggtc ctattctcgg tcgttggtcc 1500 aatcggatcg ttcggtcaga cagttgttgg cccttttgta ctataatctt ctttggttca 1560 taatataatt cgtcatgagt catcatcgca tttcattgta acgtcgtcat actttttttt 1620 cccttaccgc taccttccgg ttggtgaatc gtcagaactt gacagatgct tcgctgtcgg 1680 tgtggagtag gacggtgtgt gaggtgtgtg tgtgttagag aaggtcagca gtgcgacgaa 1740 aactgccttt gtttcccaga acgttgcaat aacagaaggc tggaggattt tcaagacgcc 1800 gaaggagagg cagaaatcga agtcgctgcc atggatccaa acgcattgaa tgctgcaatg 1860 gctgcgctgg ctgctaacca acagcaacaa caacaacagt tccaactgca acaacaacaa 1920 atggctgccg accagcagca acaacaacag cgatttcaac tgcagcaaca acagatgcaa 1980 cagcaacaga atttgttagc ggccttaacg aatcggctct tagccgcccc tattccacct 2040 gtcgttccac cagcgccggc tgcagtccga gcttttcttg atgcagacgt aaaatttagt 2100 ggatctgcga acgattgttt tcaagattgg ctgcagatgg tgaatcgcaa ggccttggca 2160 gaaaattggg gagatgacga taagcgtcga gcggccataa gttccttgtt tggcaaggcg 2220 ctcacgtggc aggaggaaat tgggagccat ctgttgctgt ggaatgattg gatcgacgga 2280 cttcgaggag ccttcgaggt ccagctgacg gaaagtcaat ggcaagcact tgtcgaagga 2340 agaaaacaac ttccgaacga accgggatcg acttacgttt tggacaagtt aaactatgtc 2400 gtaaacgctc tatacctctc acggatgcgg aactgatccc tttccttatt cgaggacttc 2460 ttcatcccgg gatacagtcg gttatgatgg gtaacccccc catttcggtc aacgcttttc 2520 ttatcgaaat tcgccgtcta gagaacatca gtgattcgcc ggtcggctca agtgcttcaa 2580 aagagaccga gaaaatcgac gaaaagacgc ccgaaaagac ccgatctgct gattcgttat 2640 ttcaggctat ggaggcattg acgagccaag tggctgttct gacacgtaca gtaaaccgcc 2700 catcttcccc aggtacactt agaccagcaa cccggcaagt gacattcgaa cagcgcaacc 2760 cgactccccg aaatgaggtc caatgctaca attgcggcga ttttggccac atttcacgtg 2820 attgtccaaa accgaatcct cgctacccga aatcttcgac gaacgcggaa aacgggtcag 2880 ccggcccaac ggggcagggt cggcaataaa tgtaatcgtc cccactagtg ctccgcctat 2940 aaatagtccc ttcatcacag ctgatgtctt tcgcgtaggc gaaatggaag cgatggtaga 3000 ttcgggggcc aaaaacagcg cggtatccga ggaatggtta gttaaccgtt cgtgtttcga 3060 aattcgcccc ccctcaaact atcgatctct ggacgggatg ccaataaaca acgtcgtggg 3120 agaaaccgcc ctcacggtcc ggtatcaagg agtcgtagtg gatttgccga gagtcgctgt 3180 ggtgaaaaaa atgctatatc cgctggtatt aggaatagag tggattgttc aaagtggagc 3240 cgtaattaaa ggagttgagg gaaaagcgga ggtaatcatg ccagatcgaa gcccggtatc 3300 aagtgttcac cccaaaaaag aagaggaaga tacattgaaa acggaggaaa atctgagcta 3360 cgaagaagct acgaaggagc tgaaaaatct gtgtttaatg gcgaaagaag aggaaatgcc 3420 gagttccgaa gatgacaaag aacaaagtgt tgtgcttaaa actctacatt cgtccatagt 3480 gcccgccaac agtgccggat atttcaagtg tcgtgtgccg aacgactcta gcaagttctg 3540 gatggtgtca acagcggggg cagttggcgc taaaggaggc tgggtcaccc ccaattgcgt 3600 ggtggaagta gaagacggcg tgttaattat tcctgttgtt aacgtcaacc cacacgcaat 3660 cagccgaaaa tctgcaaaag gtgtcttgaa agcggtgccg gtgagagaaa gcgaaattta 3720 tcccttcgag agtgaagata ctgaagctag tgtcgttgca aattttcaaa gtgatgatgg 3780 tctgccggcc tataccgaca ctctagacga cgtcaagatt gatgggagtc ttactcctga 3840 acaagtgaaa gaactcaggg agttattgaa tgggaatcgt cggtgttttc tctcaaaaaa 3900 aggagaaact cacctcgcca aacactacat ccatacagga gatgcgaaac ctatttattc 3960 cttaccttat cgcgtatcgg tggcagaacg caaacttatc agtgaccacg tggaaaagat 4020 gatcgccgaa aagatcattc aaccctcctt cagttcttgg agctctccag tggtgcttgt 4080 tcgaaagaag aatggggagg tgcgcttttg tatcgattat aggcgactca atgcagttac 4140 cgagcgagat ggataccctc tacctcggat tgaagatgtg ttaggccggc tatctggagc 4200 gaaatatttc agcagtctgg atctcgaaag tggcttctgg caaatggcga tggctgaaga 4260 gcatcgtgaa aagacggctt ttgttacgcc ggacggcctg ttcgaattct tgcgtctccc 4320 ttttggcctt tgtggatcgc ccccgagctt ccaacgactt atggatcgag tcctgcatgg 4380 gcttaagtgg tcggagtgcc tctgttacat ggatgacatt ttggtgtttg gcgccacgtt 4440 tgaagaacac caagaccgac tgaacaaagt gttaagagcc ctcggagatg ccggactagt 4500 ccttaataca aaaaagtgcg tttttggtgc gaaaaggatt gtccatctag gacatgtagt 4560 tgatagtcat ggtatcagtc ccgatccagc taagacggaa gccgttgtga agtttcccag 4620 accccgtaac gtgaccgaat tacgagcctt tttaggttta gcgtcgtttt atcgttcctt 4680 tattcctggg tttgcggaga cggcgcggcc actgcattct ttgttgaaaa aagacgctga 4740 tgtcaaggaa gactgggcgg acgttcatga aaaagctatg gcgcaattaa aggagaaact 4800 tgtcacagcg ccagtgcttg tatgcgacga cgggacttct gaattggaac tacagactga 4860 tgcgagtgta aaagggatcg gagcagtgtt aattctcaac aaagacggga aggctaaccc 4920 gataacattt atcagtcgca aactgagcaa agctgaagaa aattatcatg ccaacgagct 4980 cgaatgttta gccttgattt gggcattagg caagttacgc catttcgtct atggaaggcc 5040 tctcttggtg aaaactgaca gcagtgcatt gtgttggctg tttaagaaga aggaagtgaa 5100 tggaaaattt gcacggtgga tgttgattct acaagaatat ctcctggata tacagcattt 5160 acgaggggcg gctaacgtag tggcagatgt attgtcacgg gcaccagtgg acgcggcaaa 5220 cggagatgtc atggcagtta tccaagctgg tggttatact tctcaagaag ttggaattct 5280 ccaacatgcc gacgacgaca ttcgcctcat agtgttagcg ttgcaaggat tttcaaaaga 5340 cccgtcgaat catcagtatc cggaatttgc cctgcataaa ggtgttttat acaaaaaaaa 5400 cacaaggact ggaaggccgc tcctgttggc ggtcccttcg ataatgcgga gagacatcgt 5460 cgaggagtgt cacgactcag cggatggagg tcacagggga gtagagaaaa ctttagctcg 5520 aattcgacaa cgattttggt gggaaggaat ggcatcgact gttaaaagtt acgtaaagtc 5580 ctgtcatttc tgccagacgt ttaaacctcg cgtcggatta ccagtaggaa aactccggcc 5640 tattccgccg ccccgtgaaa tgttccatac attgggcata gatcatcttg gcccgttcaa 5700 atctactatt cgtggaaatc gacacctcat agtctgtatc gactatcttt ctagatggat 5760 ggaagcccgc cccgtggcta gtacaggagt cgacgaagta ttgcctttcc tagaggaagc 5820 tcttatccta caccatggaa cgccaacccg aataatatcc gacaagggac cgtgcttcac 5880 ctcccttgct ttttccgctt tttgtgagaa atggaatatt aagcatgtac aggcttcggc 5940 agagcatccg gaaaccaatg gtctggtcga gaggataaat agttccatcg cttcaacact 6000 cgctgcattc gtcaacttta aacattctga ttgggacgaa aagatcgcga gagccgtatt 6060 ctcaatcaac acatcaaaac aatcgacaac ggaaatcact ccattcgagc tggtcttcgg 6120 tagaccagct gttatgtccc tggagtcggc atttccatgg ccgccatcta cacctcttac 6180 gcatgaagaa cgagtggagg tggtatcgcg ttggagaaga atcgctcgcc gactaattat 6240 tatccgccag aaaaagagca agctgaatta tgatcgtttc cgaaaaggtg acccaacctt 6300 ccagatcgga gaactagtcc tcattgcccg gcgtcgaaaa acaaaaaaaa ccacgaagaa 6360 attcatcccg cgattcattg ggccatacca agtctaccgt aaagtatcac ctacgtgcta 6420 cgcagttgaa gatttgccgt gtttccgaaa aaaacgactc tggcgccgtt tcaatgccca 6480 cgtcagtcaa atcagacgat attccgtacg acgcgagacg gaatggtgcc cgaatagcga 6540 cgaatatgag agctgtgacg aggaaaccga ccaaaatgga atcatgtcta atgaatccga 6600 gcacgtcgag aatcaaccgg acgagtccag catcgagaat caaccgtccg agcccagcat 6660 cgagaatcag gagtttgtgc cgaaacactc aactccattt ttctctcgag tgggaagagc 6720 ggctcgaccg attaataatg acaataattt catttattat taatacccgt tgtgccggcc 6780 tttttttttg ttaatttcaa ctgtcgtgcc gtttacttct ggtcgtccgt tttaatctaa 6840 tggtcgataa catgtttgaa tttcaaaatt ttgcttattc attgctccat ttgtccatct 6900 tattggttcg tcttatttgt ccgtcctaat cgtccgtctt aattgtccgt ctgactgtgg 6960 cgagttgtat gcgtcgggcc gtccttaatt aagatcccag atctttctgt caggaagggc 7020 cgaa 7024 // ID Gypsy-13_SI-I repbase; DNA; INV; 4217 BP. XX AC AEAQ01024017; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-13_SI_; KW Gypsy-13_SI-LTR; Gypsy-13_SI-I. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-4217 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01024017; Positions 4607 391. XX CC Positions [1611-2111] - Reverse transcriptase CC Positions [3247-3717] - Integrase core CC 'TTGTA' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 1071..2405 FT /product="Gypsy-13_SI-I_2p" FT /translation="MYNLSNVLASEEPLIVKVEIEGKSIAMELDTGAGKSI FT LPEKIFKEKFSHCKLENTKIRLRMYNGSILIPEGQISVNIKCKETVIQAQL FT IVVKKGNRILMGRDLMKLLNIKMEQINFIKEEDNLKALLQEYKELFNNELG FT KYKFEKIDLKLSKEANPIFIKSRPIPLAFKEKISKQLEELEKKGVIEPIDT FT SAWGTPLVPVIKKDGSIRICADYKITVNKFLEDVKHPLPRIEELFTALSGG FT ESFTKLDLTAAYNQLEVTERTSKLLAWSTHKGIYSLKRLPFGTKPACSIFQ FT RTIEKVLQGVKNVINFIDDIVVTGANKEEHLKNLREVFKRLSEAGFKVNLK FT KSVFFQPEIKYLGHVINKEGLHKDPEKVAAMLEAPKPKNVSEVKAFVGMVN FT YYGKFVPNLSQVLIPLYELQRSTTFKWTQQCEEAFNNVKKRISVREKFGSF FT " FT CDS 2302..4170 FT /product="Gypsy-13_SI-I_1p" FT /translation="MNSSVVQHLNGRSNAKKRLIMLKSELASERNLVHFNK FT KWKLKLVCDASKVGIGAVLLHVLPDSTEKPISFASRVLHDAEKNYSVIHKE FT ALAIYWAICKFYQYLMGNEFILCSDHKPLMALFGEHKGIPQMAAGRLQRWA FT LFLSGFQYKFEHIKGIQNGGADGLSRLPRPIKIKEAEVEDYFHFVTAERTP FT IDATQIKKELRKDNILSKVYLYTRDGWPNSVSEEIKVFASKANEIGIENDI FT LMWGYRVIVPFKFRKALLEEIHGAHLGMAKMKMIARQYFWWPNIDKEIEEY FT VKDCEACRTTANNPNKSPLIKFQEAEFPFDRIHIDFAGPFKGKTYLIVVDA FT FTKWPEVFEMSNTNTESTIEKLRQCFARFGLPRMIFSDNGRQFVSEEFENF FT CKNNGIKHRTSAPYHPSTNGLAENAVGSFKKGLSKALADKRNASLSTTTLI FT NRYLASYRNAPHTSVGESPSKLMFGREIRTRLSLLNRPERDKAREKQVQYF FT KGNREILLKPGDIVYVRDYKIPTKPTWRKAIIKSKIGNRTYVCKTLDSEEL FT IWKRHIEQIIEAGKFYSEEEDCMSKSDKDNKKSDSKELEVVQEPEKDNVCE FT SPEETKTDINVKPKRIIKPVQRLNL" XX SQ Sequence 4217 BP; 1587 A; 552 C; 934 G; 1144 T; 0 other; attggcgacg aggatattct ggcaaagcgg gtgcatcgga aattgaacga gaagtacaat 60 caaggcggat gtaacaattg agatcagaga cgtcaggagc aggagcaatt aaaagccaag 120 ttgcatcaag agcagatgca gctagcagag tggcaaaggc aactgcagca gcagcaaaat 180 ttaatgcagt agcaagcaca acaagtaaag actcaggtac ctaaagaagc gatgccgaac 240 attacttttc tttcaacgat tggagcgctg tccgaattta atattggtga agactggaat 300 ctctatcaag agcggctagg acagtatttt gtagcgaatc aagtttcgca ggaacgtaag 360 gtagcagtat taattacatt agtaggacaa gaggcttata aaattttgaa ggacctctgt 420 gatccaacgc ttccagaatg taaatcgtat gaagaattgt gcgaaatttt aaagaaacag 480 tttgccccaa gagtctcggt tttcaaggag agaatcgaat tttacgagtt aaagcagaag 540 gaaaaggagt cagtaaatga gtggtttgct cgtattaaga gcaaagctat caactgcaaa 600 tttggagcgc aactggacga caaaatcaag gacagatttg ttactgggtt aagtaaaggt 660 cgaattttgg atagagtatg cgaggaagag catacaacaa cgctacagtc gattctcgag 720 gtggcaagaa agaaggaagc agctttggct tcgtcgtcta aggctagtct ggtggacgtg 780 cacaatttga agtcgggaaa agcaaaatcg gctcagcagg tgacgttcca gaaaaagaag 840 aaagaggaga agacaggaag ccagcaccaa gggtccaaag aagaacgaaa tgcgtacatt 900 gcggaggtac aaggcatatt tttgcaaagt gcaagtataa gacatataaa tgtaaaattt 960 gtagcaagga aggtcatttg gctaaaattt gtaaaaataa taagagcacg gtagctaata 1020 caaattattt agaatctggt tcaaatgata aggatacaga gattgtggat atgtataatt 1080 taagcaatgt tttagcaagc gaagaaccgt taatagttaa agttgagatt gaaggtaaat 1140 ctattgccat ggaattagat accggagcgg gtaaatcaat attaccggaa aagattttta 1200 aagaaaaatt tagtcattgc aaattagaaa acacaaagat taggttaagg atgtataacg 1260 gaagtatttt aattccggaa gggcaaattt cagttaatat taaatgcaaa gagacagtga 1320 ttcaagccca gttaatagtg gtaaagaaag gaaacagaat attaatgggt agagatttaa 1380 tgaaactgtt aaatattaaa atggaacaga tcaattttat taaagaagaa gataatttaa 1440 aagcattact gcaagagtat aaagaattat ttaataacga gttgggtaaa tataaatttg 1500 aaaagataga tttaaaatta tcaaaggaag ctaatcctat ttttattaaa tcaagaccga 1560 taccattagc atttaaagaa aaaataagta aacaattaga ggaattagaa aagaaaggag 1620 taattgagcc tatagatacg tcagcttggg gtacgcccct ggtaccggtt attaagaaag 1680 atggaagtat tagaatttgc gccgattata aaattacagt caataaattt ttagaagacg 1740 ttaagcatcc tttacctaga attgaagaat tatttacagc gttaagcggt ggagaatcat 1800 ttactaagtt agatttaaca gcggcgtata atcaattaga ggttacagaa agaactagta 1860 agttacttgc gtggagtact cataaaggta tttattcgct taaaaggtta ccgtttggaa 1920 ccaagccagc gtgttcaatt tttcaaagaa cgatagaaaa agtcttgcaa ggtgtaaaaa 1980 acgtaattaa tttcattgat gatattgtag ttacaggtgc taacaaggaa gagcatttaa 2040 agaatttaag agaagtattc aaacgtcttt cagaggcggg atttaaagtc aatttaaaga 2100 aatctgtatt ttttcaaccc gaaattaaat atcttggaca tgtaattaac aaagaagggt 2160 tgcataagga tccagaaaaa gtagctgcaa tgttagaagc acctaaacca aaaaatgtgt 2220 cagaagtaaa ggcctttgtt ggtatggtta attattatgg caaattcgta ccgaatttat 2280 cacaagtgtt aataccttta tatgaactcc agcgtagtac aacatttaaa tggacgcagc 2340 aatgcgaaga agcgtttaat aatgttaaaa agcgaattag cgtcagagag aaatttggtt 2400 cattttaaca aaaagtggaa attgaaatta gtttgtgatg catccaaggt aggcattgga 2460 gctgtcttat tacatgtttt gccagatagc acagaaaagc caatttcatt tgcatctaga 2520 gttcttcacg atgcagaaaa gaattattca gtaattcata aagaagcatt agctatttat 2580 tgggctatat gcaaatttta tcaatattta atgggtaacg agtttatttt atgttctgat 2640 cataaaccat taatggcttt gtttggggag cataaaggca ttccacaaat ggcagcaggc 2700 agattacaaa gatgggcctt atttttaagc ggatttcaat ataagtttga acatattaaa 2760 ggcattcaaa atggaggagc agatggctta tctaggttgc caagacccat taaaattaaa 2820 gaagcagaag ttgaagatta ttttcatttt gttacggcag agcgtacacc tatagatgca 2880 acacaaatta agaaagaatt gcgaaaagat aatatattaa gtaaggtata tctgtataca 2940 agagacggat ggcctaattc agttagcgag gaaataaaag tttttgcaag taaagcgaat 3000 gaaattggta ttgaaaatga tattcttatg tggggatata gagtaatcgt tccgtttaaa 3060 tttagaaaag ctctattgga agagattcat ggagcacatt taggaatggc caaaatgaaa 3120 atgatagcca gacagtattt ctggtggcct aatattgata aggaaataga agaatacgtg 3180 aaagactgtg aagcttgcag aactacggct aataatccta ataagtcacc attaattaaa 3240 tttcaggaag ctgaatttcc gtttgataga attcatattg actttgcagg accttttaaa 3300 ggtaaaacat atttaattgt agtggatgca tttactaagt ggccggaagt gtttgagatg 3360 tctaatacga acacagaaag cactattgaa aaattaagac agtgttttgc tcgttttggt 3420 cttccacgta tgattttctc agataacggc agacagtttg tctcagaaga gtttgaaaat 3480 ttttgtaaaa ataacggcat taagcataga acgtcagcac cgtatcaccc atcgactaat 3540 ggtttagcag aaaatgcggt aggttctttt aaaaaaggtt tgtcaaaagc gttagcggat 3600 aagcgaaatg cgtcattaag tacaactaca ttaataaata gatatttagc ttcgtacaga 3660 aatgcgccac atactagcgt aggagagagt ccatcaaaat taatgtttgg gcgtgaaatc 3720 cgaactagat taagtttgtt aaataggcct gagagagata aggctagaga gaaacaagta 3780 cagtatttta aaggaaatcg agaaatttta ttaaagccag gagatatagt gtatgttaga 3840 gattataaga ttccaacaaa accgacgtgg cgtaaggcga ttattaaaag taaaatcgga 3900 aatagaactt atgtgtgtaa aacgttagac tcggaagagt taatttggaa aagacatata 3960 gagcagatta ttgaagcagg aaaattttat agtgaagagg aagattgtat gagtaaaagt 4020 gacaaagaca ataagaaatc ggattcaaaa gagttagaag tggtacaaga gccggaaaag 4080 gacaatgtat gtgaaagtcc ggaggaaacg aagacggata ttaacgttaa gcctaagaga 4140 attattaagc cagttcaaag actcaattta taaaatgtat taaacattta agtttttcat 4200 aattagtggg agaggag 4217 // ID Gypsy-240_AA-I repbase; DNA; INV; 4821 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-240_AA_; KW Gypsy-240_AA-LTR; Gypsy-240_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4821 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1081-1081 (2011). XX DR [1] (Consensus) XX CC Positions [3765-4223] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 2019..4751 FT /product="Gypsy-240_AA-I_1p" FT /translation="MSASIKIDKTVPPVRQSYRRIPIPLEEVTILKLKELE FT RQDVIERVYEASEWISPMLVKRKSLNDVRIIIDLREANKAVVREVHPLPTM FT EQMMAKLRGSTVFGKLDIKQAFHQLLLREDCRYITTFISPLGLMRYKRLVF FT GLSAAPEIFQKCMETILADFPWLVIFIDDVLVHAPNKESLKLRMSLVKNRF FT KQYGIVLNENKTVECATEIDFLGYQLSAEGIKISPGKLEAIKTFRMPKSAE FT EVRSFLGLATFVSRHIPNFSTISSPLNALTRKGTPFEWKDSHQEAFETLKD FT LMSKSETLAFFNPNLETYVETDASPVGLGAVLYQKSVNGTFNIICYASKTL FT SEVERRYAQTEREALALVWAPEKFHYYLYGMSFWIVTDHKPLVSIFGDRIK FT PSARIERWKIRLMSYDYKVLYRPGKTNIADTLSRLCQGESGDAESYDEECE FT RVIRLVSTDVCPVAISLDEIIEATQHDIELTELMRWLPYPPRRWPSTLTRY FT RPIADDLSSDGDIVLKNKKIIIPRKLYDKILKLGHEPHLGNTAMKKRLREK FT VWWVRMDSMVEEYVSSCNGCLLVSEPATEPMVRSPLPDKPWKKLALDFTEV FT TKGIHLLVIVDYYSRYPEVEIITSMTAKSTIFKLRTIFARFGYPHEIVCDN FT GPPFTSDEFVHFCTAFGIVLKHSIPYAPFQNGLVERHNRTLLKTVKISVAM FT GRDWKNDLYDFLLAYRNTPHTVTNISPAKLMFGRNLKDKIPDMLTEESEEK FT DNVTSIDDKNKEKGKKYGDRKRKAKHSTVDIGDHVLIKNVVKQNKLTPTFN FT PQPHIVVKKTGTRLTLRNLDTGVELDRHVNHVKILISNSPFRSPELSDPPM FT NNTVQSPILEKSCTKPKTTVESDSAKMSNQNQLSNSGPIRNRSTRNHRKPG FT YLKNYVT" XX SQ Sequence 4821 BP; 1599 A; 867 C; 1063 G; 1290 T; 2 other; ttggcatacg agtgattggg attttttttt ctggagcatc ggcggaagtc aaacgaggac 60 atcagtacca ttttggaaaa atgacatcgg taaagtatag aattcaattt tgtgtgatgg 120 ttaaatgaga aaaaagtgca ttcgtgaacg caagaagtga actgaaaatg ggatatttat 180 tcctatcgtc aaatccatcg agaaaatgga aaatctattt ttcataggaa agacgatagg 240 aaaatattcc agttagtttc catgagcaac acaaacggtg tgggaacctc gagttccagc 300 ccaccggaag gaacaagcga aggggattcc agttcgctgt ttcaaaggaa ttataggagg 360 ttcaaagaat ctcagtctta ttcctggaag aaacgtaggg aactttccgg aagttctagt 420 ccgtttctgc attcaagtat agggaataaa gttattccag tctgcttgga gagggggaag 480 aatgaaaaga aaaaccggaa aaaatgttgt ttggaaacag cctatgacta ctagcttgcg 540 ctcacgttat gaaaaaaaga aaacaaaaat gaatttgaaa atcgtcagtc tgacagatac 600 gtasatgttg ctmtagtaac taggaagcat tatttttgtt gacaacatat acgttatgtt 660 tcagttgaat tacaacgtta cacctttttg tgaatctgga gccggcctcg ttagttctcg 720 ttgggaagaa tggagagacc agtttattgc atacctcgac ctgaaaggaa tatgtgacgt 780 agacgaaaag cataaagcac taatatgttt tggtggacct gacgtgagga agattgccaa 840 ggatgtgact gtaagcggaa atattttgga cgatgcttat cgtgctgtac tggaagcact 900 cgacaattat tattcacctc gaatgtcgtt gcgttatgaa cgattcaagt ttcggcaaat 960 gactttcaac cctaaagaga agctggatca gtttgttata cgattaaggg ctcaagctgc 1020 cgcttgcagt tatggtgacc aaattgaggg aatgataatg gaccaaataa tttacgcaac 1080 tcaaacggat gacaaactgc gatctaagta tttggaggtc gacactactt tagacgaaat 1140 gttgaaaatt ggtcgcacgc atgaaaccgt caataagcag gtatattaat aatttggtta 1200 tttttcacag attttaagaa aaaaaaatat taggttcaag aatttcgcat caatccaacc 1260 gacaattccg agctaaacgt gatagacgat tccgtaaaac aagcgaaacc caggctgtcc 1320 tgcagtcgtt gtttggggaa tcacctgcct actgattcaa aatgtccagc tcgtggatcg 1380 cattgtaacc gatgcaataa aatgggacat tatgcgcgct gttgcagagg tatgaagaaa 1440 ccgtttctac aacaccattc gaaatcaatc agcagaaagc gtcaagcata cgaacaaatg 1500 caggaagatc gtcgtgaaca aaagccgaag tttgtacgag aagttgatga cgtcaccaag 1560 acagcggaaa tacgcgagtt attccatttg aatggtaagc gcacaacagt cgtctccgta 1620 ggtggggtcc agctccggtt tatcgtggat actggggcag acgaggacgt tttgagtatt 1680 gaagattgga acacattaaa aaggactgga tttgaggcgt ttgcgattag aaagggaagc 1740 gctaaggttt ttcaagctta cggatcaaga aagccgttga ccgttctagg agaagtcgac 1800 gcgatgttga aaacagcaga tgaatcttta agaaccacat tgtttgtgat tcaggatggt 1860 aagcattcgc tactttccgg acgtagcgca gaaaagctcg gtgtagttaa atttctgaga 1920 tcggtgagtg aagagatttt cccggttatc aaaggtaaga atctgtaatt gatttaaaaa 1980 taggatagat tattatattg taaattcatt atgcagatat gagtgcaagc attaaaatag 2040 ataaaaccgt accccctgtt cgtcagagtt atcgccgtat cccaattcct ctagaagaag 2100 ttactattct gaaactaaag gagttggaac gtcaagatgt aattgaacga gtttatgagg 2160 catcggaatg gatatcgcca atgctcgtca agcggaaaag tttgaatgat gtgcgaatta 2220 ttattgacct tcgtgaagcc aacaaagccg ttgtaagaga agttcatccc cttcctacaa 2280 tggaacaaat gatggctaaa ttaaggggca gtacagtttt tggaaaactt gatattaagc 2340 aagctttcca ccaacttcta ttacgagaag attgtcgata tatcacaact tttatatcgc 2400 ctttgggcct aatgcgttac aaacgattag tatttggatt gtcggcagca ccagaaattt 2460 tccagaaatg tatggaaacc atacttgcag attttccatg gttggtaatt ttcattgacg 2520 acgttctggt tcatgctcca aataaggaat cactgaaact aagaatgagt ttagtgaaga 2580 atcgtttcaa gcagtatggt attgttttga acgaaaataa gacggtagaa tgtgctacag 2640 agatagattt tcttggatat caattatctg ctgaaggtat caaaatctca cctggaaaat 2700 tagaagccat aaagacattt cgaatgccaa aatctgccga agaagttaga agttttttgg 2760 ggttagctac cttcgtaagt cggcatatac caaacttctc aacaataagt agtcctttga 2820 atgcattaac aaggaaaggt acaccatttg aatggaaaga ttcacatcaa gaagcttttg 2880 aaacattgaa ggatttgatg agcaaatctg aaacattggc gtttttcaac ccaaatctag 2940 agacgtatgt agaaaccgat gcgagtccag tagggttggg tgctgtcctg taccagaaat 3000 ctgtaaacgg aacgtttaac attatatgtt acgcctcaaa gaccctatct gaggtagaaa 3060 gacgctacgc tcaaacagag agagaagctt tagcattagt atgggctcca gagaagttcc 3120 attactatct ctatggaatg tcattttgga tagtcactga tcacaaacct ttggtttcaa 3180 tatttggcga tagaatcaaa ccttcagcaa gaatcgaacg gtggaagatt cgattaatgt 3240 cctacgacta caaagttctc tatcgaccag gcaaaacgaa cattgcagac actctttcgc 3300 gattgtgtca aggagaatct ggtgatgcag aaagttacga tgaagagtgc gagagagtaa 3360 ttcgtttagt ctcgaccgat gtatgtccgg ttgctatttc acttgatgaa attatcgaag 3420 ccactcagca tgatatagag ctgactgaat tgatgcgttg gctgccatat ccgccaagac 3480 gttggccaag cactcttaca cgatatagac caatagctga tgatctttcc agtgacggtg 3540 atattgttct gaaaaataag aagattatca ttccacgaaa actatatgat aagatactta 3600 aactaggtca tgagccacat ttgggaaaca cagctatgaa aaagcggttg agagaaaaag 3660 tctggtgggt acgaatggac tctatggtag aagaatacgt cagttcatgc aatggatgcc 3720 ttctagtttc cgaaccggct actgaaccga tggtaaggtc cccattacct gacaaaccat 3780 ggaaaaagct ggcgttggat ttcacagaag tgactaaagg aattcatttg ttggtaattg 3840 ttgattacta ttcaaggtat cccgaagttg aaatcatcac ttctatgact gccaaatcta 3900 cgatattcaa actccgtaca atatttgcaa gatttgggta ccctcacgaa attgtgtgtg 3960 acaatgggcc accttttaca tcagacgagt tcgtacattt ttgcaccgct tttggtatag 4020 ttcttaagca ttcgatccca tacgccccgt ttcaaaatgg actagtagag agacataaca 4080 gaactcttct gaaaacagtc aagataagcg tagcaatggg aagagactgg aagaacgacc 4140 tttatgattt tcttctggct taccggaata caccgcatac ggtcacaaac atttctccag 4200 caaaactcat gtttggtaga aatttgaaag acaaaatacc tgacatgctg acagaggaat 4260 cggaggaaaa ggataatgta accagcatag atgacaagaa taaagaaaaa gggaagaaat 4320 atggcgatcg gaagcggaaa gcaaagcact ctacagtcga tatcggtgat cacgtgttga 4380 ttaaaaatgt ggttaaacaa aataagctca ctccgacttt caaccctcag cctcacatcg 4440 tggtgaagaa aaccggaacc cgccttacgt taaggaatct cgatacagga gttgagctag 4500 ataggcatgt taatcatgta aagatcctta tttctaacag tccattccgc agtccggaac 4560 tttctgaccc tccaatgaac aacacagtac agtctccgat tttggaaaaa tcttgtacta 4620 aaccgaagac taccgttgaa agcgacagcg cgaaaatgtc taatcagaac cagttatcaa 4680 attcaggtcc aatacgaaac agatcaacaa gaaatcatcg aaaaccaggt tatcttaaaa 4740 attatgtaac gtaatcaaaa atacaaaatg taataattca taatatagaa atagattctt 4800 taatataaac taagggagag a 4821 // ID R1 repbase; DNA; INV; 5092 BP. XX AC M19755; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 08-NOV-2010 (Rel. 15.12, Last updated, Version 2) XX DE R1 repeat family. XX KW transposon; Repetitive element R1; R1. XX NM R1. XX OS Bombyx mori OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Bombycidae; Bombycinae; Bombyx. XX RN [1] RP 1-5092 RA Xiong Y. and Eickbush H.T.; RT "The site-specific ribosomal DNA insertion element R1Bm belongs RT to a class of non-long-terminal-repeat retrotransposons."; RL Mol. Cell. Biol 8(1), 114-123 (1988). XX DR GenBank; M19755; Positions 44 5135. XX FH Key Location/Qualifiers FT CDS 467..1849 FT /product="R1_1p" FT /translation="FGTMSEEERELFSPRVSLARSPPRGPTAPLPAPMPPP FT APRGVRAGAAMSAKGRRTGLHIPVASGLSSSAPVTPVDPVAGVPSFPIPVA FT SGAPTGPLDVAAQGRLELLERANRAVRGIMSVATAASKLNKSEVNLISELG FT RDILAVVGALGIQLSDKELVVERLRSAEASARRDSVAPSMAAGAAAGSGTG FT PATFATVLRTGPGGVPRSIGASQGPSLAFYPSEGNAELKTAEDTKKEVKKA FT IDPKSMAIGIQSVRKVGNAGVVVQTTSPGAAVKLRNAAPPSLRVTEPRRRQ FT PLVAVNGVEGDPSFEEVIECLASQNLDPEEWPLTRVRAELTGAFKKGRRQS FT NNTTVVFNASPRIRDALVKIGRVYVGWVACEVTDFVRVTCCNKCQQYGHPE FT KFCRAKEATCGRCGEDGHRMEACKAASACCATCRRFRREAMHPTASRDCPA FT RRHAEERFLNQVEYGY" FT CDS 1830..4982 FT /product="R1_2p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="IRSSMDIRPRLRIGQINLGGAEDATRELPSIARDLGL FT DIVLVQEQYSMVGFLAQCGAHPKAGVYIRNRVLPCAVLHHLSSTHITVVHI FT GGWDLYMVSAYFQYSDPIDPYLHRLGNILDRLRGARVVICADTNAHSPLWH FT SLPRHYVGRGQEVADRRAKMEDFIGARRLVVHNADGHLPTFSTANGESYVD FT VTLSTRGVRVSEWRVTNESSSDHRLIVFGVGGGTTGERDEDEEARSDLRPG FT EPCALRRYRDRGVDWDLFRSRIHERMGSLDLEEPVAALCEKFTGVITRTAE FT ECLGSLKADRTDRGYEWWTPVLDKLRVAQGRARRRWQKARRTGGEEEEQSG FT RVFRDRRREYRRAMHDAETAFYREIAEGGNRDPWGLAYRTASGRRRAPTNV FT VNGVEYAGRCSDDVSGAMRTLMWALCPDDYMSRDTPYHARVRIMAALPPSG FT RDADPLSKDSLRAIIGSLKNTAPGIDGLTARIIKKALPAAEAEFVAVYARC FT VVEGTFPPVWKDGRLLVLPKGNGRPLTDPKAYRPVTLLPVLGKILEKVLLQ FT CAPGLTHSISPRQHGFSPGRSTVTALRTLLDVSRASEQRYVMAIFLDISGA FT FDNAWWPMIMVKAKRNCPPNIYRMLTDYFRGRRIAVVAGECAEWKVSTMGC FT PQGSVLGPTLWNVLMDDLLALPQGIEGTEMVAYADDVTVLVRGDSRAQLER FT RAHAVLGLAEGWASRNKLDFAPAKSRCIMLRGKFQRPPIVRYGSHVIRFEN FT QVTVLGVSSTIASLSRHAAAIGERASRCFGKMSRVSASAWGLRYRALRVLY FT MGTYVTTLTYAAAVWYLRAAVHVVRSVLLRTQRPSLTLLTKAYRSCSTAAL FT PVLAGVLPADLEVTRAGRRMRECEGLARELAAERRRRIDGDVLVVWQNRWV FT SEGKGRELYKFFPDVADRKKATWMEPDYQTSQILTGHGIFNKRLADMRLRE FT GHACDCGAVEEDRDHVLWECPLYDEIRGRMLDGISRSEVGPVYHADLVRDE FT KNFRLLREFAIHGTRRALRANHEDCLATKNVGDVSSPRTGRMLVE" XX SQ Sequence 5092 BP; 967 A; 1281 C; 1751 G; 1093 T; 0 other; tgacttcgcc gtcggccttg gtcgaggaca gacgtgcgtt ccgttatttc tttattttcc 60 gtcatttaag tgtattgtgt ttctattggt gtatcggacc ctctcgtttc ggcttgaggt 120 ttaagtcata agacgccgcg gccatcttgc tgtgtgagcg gtgtgacgag tgcgaaggcg 180 gagtttagct cgacgtggag tcggcccctc tcgcttcctc ttgggtgccg gtccatatag 240 gtcggtgtcc atattggatt gcgtgtgaga cggccgattt gcgtgagggc ggacccgcca 300 tttaggtctg tgacagtgac actagtgtgc gatagtgacg ttttataatt tgctgtgggc 360 ggtagccgcc attttgtgat agtgacactc gaggatgcga cagtgacgtt ggtttgtgtt 420 tgtgttcgtg tgcgtgtgtg aatatctttg cgtgatagta atataatttg gaacaatgtc 480 ggaggaggag agggagctat tttcccctcg ggtctctttg gcgcgctcgc cgcctagggg 540 accgactgca cctctgccgg cgcctatgcc gccgccggcc ccccgggggg ttagggcggg 600 agcggctatg agtgccaaag gccggaggac cggcctgcat atcccggttg cttctggact 660 gtcgtcctcg gcgcctgtca ccccggttga cccagtggct ggtgtcccta gcttccccat 720 tcctgttgcg tcgggggctc cgactggacc ccttgatgta gcagcgcagg ggaggctgga 780 gctgctcgag cgggcaaacc gagcggtgcg cgggatcatg tcggtggcga cggcggcatc 840 taagctcaat aagtcggagg tcaatctaat ctctgaactc ggtcgcgata tcctggccgt 900 agttggggcc ctcgggatcc agctctctga taaggagctg gtggtcgaaa ggctccgctc 960 ggcagaggcc tctgcacggc gtgactccgt ggcgccttcg atggctgctg gtgcggcggc 1020 cgggtctggg actgggcctg cgaccttcgc cactgtcctc aggacgggcc ctggtggggt 1080 cccgaggtcc attggggcct ctcaggggcc ttccttggca ttctacccgt ccgagggcaa 1140 tgcggaactg aagacagctg aggacacaaa gaaagaggta aaaaaggcca ttgaccctaa 1200 gtcaatggct attggaatac agagcgtaag gaaggttggg aatgcggggg tagtggtgca 1260 gaccacgtcc cccggggccg cagtcaagct tagaaatgct gccccgccat cactgagagt 1320 caccgaaccc aggcgccgac agcccttagt agctgtcaat ggcgtggagg gcgacccctc 1380 cttcgaggag gtaatcgagt gcctggctag ccagaacctc gacccggagg agtggcccct 1440 cactagggta cgagcggagc tcacgggggc gttcaaaaag gggaggcgac aatccaacaa 1500 cactactgtg gtgtttaacg cctcccctcg catcagggac gccctcgtga agattggcag 1560 agtgtatgtg gggtgggtcg cctgtgaggt cacggacttt gtccgggtga cctgctgcaa 1620 taagtgtcag caatatggtc acccggagaa attttgccgg gccaaggagg ccacctgtgg 1680 ccgatgtgga gaggatggcc accgaatgga ggcctgtaag gcagcctccg cgtgctgtgc 1740 gacctgccgg cgattccgtc gcgaggctat gcacccgacg gcctcgcgcg actgtccggc 1800 gcgccggcat gcggaggagc gcttcctaaa tcaggtcgag tatggatatt aggccccgac 1860 ttcgtattgg ccaaatcaat ctgggtggtg cagaggatgc gacgagggag ctaccctcca 1920 ttgcacggga tctcggcctg gatattgttc ttgtacagga acaatattcc atggtcgggt 1980 tcctagccca atgtggagca caccccaagg cgggtgtgta tatccgcaat agggtgctcc 2040 cctgcgcggt tctgcaccac cttagcagca cacatataac ggtagtgcac attggggggt 2100 gggacttata tatggtgtct gcgtacttcc agtatagtga ccctattgac ccatacctgc 2160 accggctcgg gaatattctt gaccggctgc ggggggctcg ggtcgttatc tgcgcagaca 2220 ctaatgccca ctcgccattg tggcactcgc tgcccaggca ctacgtcggt cggggtcagg 2280 aagtggctga ccgccgcgcc aagatggagg atttcattgg ggcgaggcgg ttggtcgtcc 2340 ataacgcgga tggccacctg ccgaccttca gtacggcgaa cggagaatct tatgtcgatg 2400 tcacgctgtc tacgcgggga gtacgcgtgt ctgaatggcg tgtaactaat gaatcatcga 2460 gcgatcaccg gctcattgtg tttggggtgg ggggcggtac aacaggggag cgggacgagg 2520 acgaggaggc gcggagcgat ttgaggccgg gcgagccgtg cgcactgcgt cggtaccggg 2580 accgtggggt ggattgggac ctcttcagat cgcgtatcca cgagcgaatg gggagtctgg 2640 acctcgagga acctgtggct gccctttgcg aaaaatttac cggggttata actcgcacag 2700 ctgaagaatg cttaggatca ctgaaagcag atagaactga caggggttat gagtggtgga 2760 ccccagtact cgataagctt agggtagccc agggtagggc caggcgtcga tggcaaaagg 2820 cccgccgaac ggggggtgag gaagaggagc agtctgggag agtcttccgc gaccgcaggc 2880 gcgagtatcg aagagcgatg catgacgctg agaccgcctt ttaccgggag atcgctgaag 2940 ggggaaaccg tgacccgtgg ggactagcgt atcggacggc gagtggtagg cgacgcgcac 3000 caactaatgt ggttaacggg gtggagtatg cggggcggtg ctcggatgat gtgagtgggg 3060 ccatgcgcac cttgatgtgg gcgctgtgtc cggatgacta tatgtctcgg gacactccgt 3120 accatgcgcg ggtgcgtatc atggccgcgc tccccccatc cgggcgggac gcagacccgc 3180 tgagcaaaga ctcccttcgt gccataattg gctcactgaa gaataccgca ccgggcatcg 3240 acggtttaac ggcgcgcatt atcaaaaagg cacttccggc tgctgaggcc gagttcgtgg 3300 ccgtatacgc acggtgcgtt gtggagggga ccttcccgcc ggtgtggaag gatggccgcc 3360 tacttgttct gccaaagggg aatggcaggc ccttaacgga ccctaaggcg tatcgcccgg 3420 tcaccttgct gccggtcctg ggaaagatct tggagaaggt attattgcag tgtgctcctg 3480 gcctcaccca tagtattagt ccgcgccagc acgggttctc tcctggacgc tcaacggtga 3540 cggcgctgcg aactctgctg gacgtgtcgc gcgcctcgga gcagaggtac gtaatggcca 3600 tattcttgga catcagtgga gctttcgata acgcgtggtg gcccatgata atggtgaagg 3660 ccaagcggaa ctgtccgccc aacatctatc ggatgctgac ggactatttc cgcggacgcc 3720 gtattgccgt tgtcgcgggg gaatgtgcgg aatggaaggt gtccacgatg ggctgtccgc 3780 agggctcagt gctcgggccg acgctctgga acgttctgat ggatgacctg ctcgccttgc 3840 cgcaggggat agagggaaca gagatggtcg cctatgccga tgacgtgacg gtactggtta 3900 ggggtgactc acgggcgcag cttgagagga gagcgcacgc cgtgctagga ctcgcagagg 3960 ggtgggcgag caggaataag ctcgattttg ccccggcgaa gtcccgatgc ataatgctga 4020 ggggaaagtt tcagcgtccc cctatagtcc ggtacggcag tcatgtcatt cggttcgaga 4080 accaggtgac ggtgttgggc gtctcttcga cgattgcctc tctttcgcgg catgcggcgg 4140 ccattggcga gagggcgagc aggtgcttcg gcaagatgtc tagagtttcg gcttcggctt 4200 gggggctgcg atatagggct ttgcgtgtct tgtacatggg cacttatgtt acaaccctta 4260 cctatgcggc ggccgtatgg tatttgcggg ctgctgtgca cgtcgtgcgc agcgtgctgc 4320 ttaggacgca gcgcccgtcg ttgacgctgc taacgaaggc ctaccgttcg tgcagcacgg 4380 ctgctttgcc ggtgttggcg ggcgtcctgc cggcggacct ggaggtgact cgtgctggac 4440 ggaggatgcg ggagtgcgaa ggattggcgc gggagttggc ggcggagaga cgacgacgga 4500 tcgacggcga tgtcttggta gtttggcaga acaggtgggt gtctgagggt aaggggaggg 4560 aactgtacaa gttctttccc gatgttgcgg acaggaagaa ggcaacgtgg atggagccgg 4620 actatcagac ctcgcagatc ctcacgggtc atgggatctt taataagcgg ttggcggata 4680 tgcgactgag ggaggggcat gcttgcgact gcggggcggt tgaggaggat agggaccatg 4740 tcctgtggga gtgtcctctc tatgacgaaa tccggggcag gatgctcgat ggaatctcgc 4800 ggtctgaggt gggcccagtt taccacgcgg acctggtcag ggacgagaaa aattttcggc 4860 tcttgcgcga gttcgcgata catggcacac ggcgcgcact gcgcgcgaat cacgaggact 4920 gcctggcgac gaagaatgtg ggtgatgtta gcagccccag aacggggaga atgctagtgg 4980 agtgaagggg acggtgttag tttgtagaac cgatcggtac cttggtgccg tgaagttcat 5040 gcttcggtcc taataaccgc aaggttggtg ggaccatggg aggtggtggg aa 5092 // ID BR1_CT repbase; DNA; INV; 242 BP. XX AC J01053; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 25-MAR-1997 (Rel. 2.02, Last updated, Version 2) XX DE Chironomus balbiani repetitive sequence. XX KW Satellite; Simple Repeat; BR1_CT; Repetitive sequence. XX OS Chironomus thummi OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Chironomoidea; OC Chironomidae; Chironominae; Chironomus. XX RN [1] RP 1-242 RA Baeumlein H., Wobus U., Gerbi A.S. and Kafatos C.F.; RT "the basic repeat unit of a chironomus balbiani ring gene."; RL Nucleic Acids Res 10(13), 3893-3904 (1982). XX DR GenBank; J01053; Positions 1 242. XX SQ Sequence 242 BP; 100 A; 62 C; 51 G; 29 T; 0 other; ccaagatcaa gaccaagcaa gggatctaag cctagcaagg gatccaaacc agaaggacca 60 tctaaaccaa aatccagacc cgaaaaacca tctaaaccaa gcaagggaac taagccacgt 120 ccatctaagc ctagcaaggg aactaaacca cgtccatcta agcctagcaa gggatcaaaa 180 ccaaaaccag aaagatgcgg tagtgcaatg agaagagttg aaagcgagaa atgcgctgca 240 ac 242 // ID R2-1_TSP repbase; DNA; INV; 3690 BP. XX AC . XX DT 29-MAY-2009 (Rel. 14.06, Created) DT 21-JUL-2009 (Rel. 14.06, Last updated, Version 2) XX DE A family of R2 non-LTR retrotransposons in the Trichinella DE spiralis genome - a consensus. XX KW R2; Non-LTR Retrotransposon; Transposable Element; R2-1_TS; KW R2-1_TSP. XX NM R2-1_TS. XX OS Trichinella spiralis OC Eukaryota; Metazoa; Nematoda; Enoplea; Trichocephalida; OC Trichinellidae; Trichinella. XX RN [1] RP 1-3690 RA Kapitonov V.V. and Jurka J.; RT "A family of R2 non-LTR retrotransposons in the non-segmented RT roundworm genome."; RL Repbase Reports 9(6), 1150-1150 (2009). XX DR [1] (Consensus) XX CC R2-1_TSP is a very young family of R2 non-LTR retrotransposons. CC The consensus sequence was derived from thee copies 99% identical CC to each other. R2-1_TSP elements are inserted at the same target CC site in 28S rRNAs. XX FH Key Location/Qualifiers FT CDS 132..3386 FT /product="R2-1_TSP_1p" FT /note="contains the reverse transcriptase and FT restriction enzyme-like endonuclease domains." FT /translation="MSNRLANTAAAGGVPEKTSGTLDIPGQPSSSGEKRAI FT SYPGPFGCNSCSFTSTTWLSLELHFKSVHNIRDFVFLCSKCKKSWPSINSV FT ASHYPRCKGSVKAAVVPTSLANTCTTCGSSFGTFSGLQLHRKRAHPDVFAA FT SCSKKTKARWSNDEFTLLARLEAGLDPACKNINQVLAERLMEYNITRGVEM FT IKGQRRKDQYKALVRQLRSNSETQQCVGLAGSMDSNVPANDTSSSVASEVS FT ITYPEYGAVMSCDLIKEATGMAIVDINELQSNLRKAFLSGRKLPMKFHGAR FT ETAQKKMANPRVAKFKRFQRLFRSNRRKLASHIFDKASLEQFGGSIDEASD FT HLEKFLSRPRLESDSYSVISGDKSIGVAHPILAEEVELELKASRPTAVGPD FT GIALEDIKKLNTYDIASLFNLWLKAGDLPASVKASRTIFLPKSDGTTDISN FT CRPITIASAMYRLFSRIITRRLAARLELNVRQKAFRPEMNGVFENSAILYA FT LIKDAKVRSREICVTTLDLAKAFDTVPHSRILRALRKNNVDPESVDLISKM FT LTGTTYAEIKGLQGKLIPIRNGVRQGDPLSPLLFSLFIDEIIGRLQACGPA FT YDFHGEKICILAFADDLTLVADSAAGMKILLKAACDFLEESGMSLNAEKCR FT TLCITRSPRSRKTFVNPAAKFIISDWKTGISSEIPSLCATDTFRFLGHTFD FT GEGKIHIDTEEIRSMLKSVKSAPLKPEQKVALIRSHLLPRLQFLFSTAEAD FT SRKAWLIDSIIRGCVKEILHSVKAGMCTDIFYIPSRDGGMGFTSLGEFSLF FT SRQKALAKMAGSSDPLSKRVAEFFIERWNIARDPKVIEAARRVYQKKRYQR FT FFQTYQSGGWNEFSGNTIGNAWLTNGRARGRNFIMAVKFRSNTAATRAENL FT RGRPGTKECRFCKSATETLAHICQRCPANHGLVIQRHDAVVTFLGEVARKE FT GYQVMIEPKVSTPVGALKPDLLLIKADTAFIVDVGIAWEGGRPLKLVNKMK FT CDKYKTAIPAILETFHVGHAETYGVILGSRGCWLKSNDKALASIGLNITRK FT MKEHLSWLTFEIIFITQISRIYNSFMKK" XX SQ Sequence 3690 BP; 959 A; 918 C; 870 G; 943 T; 0 other; ctcctgacta acctgatttc gtccgtgcgg cggcgttttc ttttcgctct ccgctcgtcg 60 aaatttgctg tagttgattc gcttttcttt gcgttttctt ctactttcgc agttttttct 120 gcattgccac gatgtcaaac cgccttgcca atactgctgc ggctggtggg gttccagaga 180 aaacctcggg aactttagac attcctggcc aaccctcttc atccggtgaa aagcgtgcga 240 tctcttaccc tggtccattc ggttgcaatt cgtgttcgtt tacgagtacg acttggctct 300 cattggaatt gcattttaaa agcgtccata atattcgtga cttcgtcttc ctctgctcta 360 aatgtaaaaa aagctggcca tcgatcaact ccgtagctag ccattaccct cggtgcaaag 420 gtagcgtcaa ggctgcagtt gttcctacat ctttggcgaa tacgtgcacc acgtgcggct 480 caagcttcgg tactttcagt ggtcttcaac tccatcggaa aagagcacat ccggacgttt 540 ttgctgcttc ttgtagcaaa aaaacgaagg cgcgttggtc taacgacgaa tttacccttc 600 tggcgagact cgaagcaggt ctggatccag cctgtaaaaa cattaaccaa gtactagcgg 660 aaaggttaat ggagtataac atcaccagag gcgtagaaat gataaaaggc caacgtagaa 720 aagatcagta caaagcgctc gttcgtcaac tccggtcaaa ttctgaaaca cagcaatgtg 780 taggtttagc cggaagtatg gattcgaacg taccggccaa cgatacatcg tcttccgttg 840 catcagaggt cagcattacg taccctgagt acggggccgt gatgtcgtgc gacctaatta 900 aagaagcgac tggtatggcc atagttgaca tcaacgagtt gcaaagcaac ttacgaaaag 960 ccttcttgtc cggccgcaag cttcccatga agttccatgg agcgcgtgaa accgcccaga 1020 agaaaatggc caacccccgt gttgcgaaat tcaagcgttt ccaacggttg tttcgaagca 1080 acaggaggaa actggccagc cacatcttcg acaaagcctc actggagcaa ttcggtggca 1140 gcatcgatga ggcatctgac catttagaaa agttcctctc ccggccaaga ttggagtccg 1200 attcttattc cgtgataagc ggtgataagt caatcggagt tgcacatcca attttggccg 1260 aggaggtgga attggaatta aaagcctccc gaccaaccgc tgttggtccg gatggaattg 1320 cactggaaga cattaaaaaa ctcaatactt acgacatagc cagtcttttc aacctctggc 1380 taaaagctgg cgacctaccc gcatcggtga aagccagtag aaccatcttt ttgcccaaaa 1440 gcgacggcac caccgacata tcgaactgtc ggccaatcac aatcgcatcc gccatgtatc 1500 ggctgttcag cagaataata acgcgacgtc tggcagccag gttggaattg aacgtgcggc 1560 aaaaagcgtt ccggcctgaa atgaacggcg tattcgagaa ctccgccatt ttatacgccc 1620 tcatcaagga tgctaaggtc aggtcaaggg aaatttgcgt aactacgctc gaccttgcca 1680 aggcctttga cacggtgccc cactcacgca ttttacgagc cctgaggaaa aataatgtcg 1740 acccggaatc cgtcgacctg atttcgaaaa tgttaacggg tacgacttat gcagaaataa 1800 aagggctcca gggcaaactt atacccattc gcaatggagt caggcaaggt gaccccttgt 1860 cgcccctatt atttagtcta tttatagacg agataatagg tcgcctacaa gcctgcggcc 1920 ctgcctacga tttccatggc gaaaaaattt gcatcctggc tttcgccgat gatctgacgc 1980 tggtggctga cagcgcagct ggtatgaaga tccttctaaa agcggcttgt gacttcctgg 2040 aggaatctgg aatgtcactt aatgcagaga aatgccgcac tctctgtatt acaagatctc 2100 cccgaagccg caagactttc gtcaacccag ctgccaaatt catcatcagc gattggaaaa 2160 cgggtatcag ctcagaaatc ccctccctgt gtgcgacgga cacctttcgt ttcctggggc 2220 acaccttcga tggagaagga aagatccaca tcgatacgga ggaaattcga tccatgctca 2280 aatcggtgaa gtcagctcca ctgaaaccgg aacagaaggt ggctttgata cggtcacacc 2340 ttcttccccg ccttcagttc ctgttttcta cagctgaagc tgacagccgg aaagcctggt 2400 tgatcgattc catcatcagg gggtgtgtga aggagatctt gcactcagtg aaagctggta 2460 tgtgcactga tatcttttac ataccctcta gagacggtgg aatgggattt acttccctcg 2520 gggagttttc tcttttcagc aggcagaagg cactcgccaa gatggctgga tcgtcggacc 2580 ccctctcgaa acgggttgct gaattcttca tcgaaaggtg gaacatcgcc cgtgacccga 2640 aagtcattga agctgctcgg cgcgtctacc agaaaaaacg gtaccaacgc tttttccaga 2700 cgtaccagag cggtggatgg aatgaatttt cgggaaacac tattgggaac gcctggttga 2760 caaacggccg tgcccgcgga agaaatttca taatggctgt gaaattccgt tccaacaccg 2820 cagccacccg ggccgaaaac ctacgaggcc gccccggcac gaaagaatgc cggttttgca 2880 agagtgccac cgaaactttg gcacacattt gccagaggtg tccggcaaat cacggcttgg 2940 ttatccagcg ccatgacgca gtcgtaacat tcctggggga agtggcgcgg aaggaaggtt 3000 accaggtcat gatagagcct aaggtgtcaa ccccggtcgg cgcgctcaag cccgacctcc 3060 tactcatcaa agccgacact gcattcattg tggatgtagg cattgcatgg gaaggtggac 3120 gcccactaaa gctggtcaac aaaatgaaat gtgacaagta caaaactgcc atcccggcaa 3180 ttttggaaac atttcacgtt ggccatgctg agacgtacgg cgttattctg ggcagccgcg 3240 gatgctggct caagagcaac gacaaggcgt tggcatcaat tgggctcaat atcacacgga 3300 agatgaaaga acacctgagc tggttgacgt ttgaaattat atttataact caaataagcc 3360 ggatttataa ctcattcatg aaaaaatgag gtttttgttt tcttttttcc ttttaccatt 3420 cttgttccat tgttgttatt tgctttaatc ctgtatttta ccgccggcaa ttccattgtt 3480 attattactg ttactgttat tattgttact attgttttta cttttactta ctactgttat 3540 tatactttaa ttcgttaact tacgttattg ttaccactac ttactttgct ctctcgcaaa 3600 cgttcgttgt tgtttctttt ggaccaggtt tagagaaatc gcacgcacag cggaactgga 3660 ccgcttaagc cagaaatagt aaagtaacaa 3690 // ID Rehavkus-2_CS repbase; DNA; INV; 8581 BP. XX AC . XX DT 30-APR-2006 (Rel. 11.04, Created) DT 26-MAY-2011 (Rel. 16.05, Last updated, Version 2) XX DE This is a recently transposed Rehavkus-2_CS DNA transposon - a DE consensus sequence. XX KW MuDR; DNA transposon; Transposable Element; Interspersed repeat; KW Rehavkus group; Rehavkus-2_CS. XX NM Rehavkus-2_CS. XX OS Ciona savignyi OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. XX RN [1] RP 1-8581 RA Kapitonov V.V., Gentles A.J. and Jurka J.; RT "Rehavkus-2_CS, a family of Rehavkus DNA transposons from the sea RT squirt Ciona savignyi genome."; RL Repbase Reports 6(4), 194-194 (2006). XX DR [1] (Consensus) XX CC Rehavkus-2_CS belongs to the Rehavkus group of MuDR "cut and CC paste" DNA transposons, which are widespread in different CC metazoa, including insects, sea squirts, sea urchin and fish. The CC genome harbors several copies of this transposon. Its ~2-kb CC inverted termini are composed of a 343-bp terminal inverted CC repeat and 185-bp subterminal minisatellite-like unit. The CC transposon is flanked by a 9-bp target site duplication and CC encodes a 1071-aa Rehavkus-2_CS transposase. The C-terminus of CC the transposase is composed of the Ulp1 cysteine protease domain CC (pos. 830-990) and PHD zinc finger (pos. 1023-1070). XX FH Key Location/Qualifiers FT CDS 2930..6142 FT /product="Rehavkus-1_CSp" FT /translation="MFLIGVGDALITLSNLIIESCIYSVVFMTLFDSESVL FT QAIHSMEHFDSSSFHSWELLSDKLCKRNLSKIERKNDIKSLKYLYQKYKDE FT NNLHSSTVSAYPMDFSNDRDKNKTTNTSNSSNLISSSNSSASQSPEHVSSS FT EVSDIDDEMEECELWRPSPPLVRSSVFTLDENDWNEVRASVSGGRVKTKNW FT GDIFRRSLKSSNPFCVFSFMSHHWNEKRTSSRVFNCKGVCKFKTCDIKIEI FT NLYSEMKKEFHVTYTGLLQHGKLEVRSSHLKGQRRQEMRKELANVNTRAVY FT LNNLRQLPADVFESGNRDCAPSKSVLKKVRFQARKIEVASENDTIALETLK FT NAQSIDKDHSQTLQVISSTPPAIMLWSDETMRIFHDRSQTDIVYIDATGSI FT ITSEKGKRPYYAYEIVVRHPIKGKCPLAVATYVAEQHNIASICNFMSMFRS FT AENRVVGKTLAKLFICDGSMALIRSILRCFMDDSLEAYLTRCFSLVTGKAT FT DGTFSKPFLHLCGSHTMKNMKKLSGLVRFRRQETMYILGLIMCADTMSDIT FT DMLRHIVILFSTKNESLSNFSWNFLSQKIKRETKMDESFFSVDDDEPDDDL FT TPEFTNCNQFTKYFSSVISQARNEAEFSEVSGTNPFYCPELITKFIENWLP FT KLPLWTALTIGNLGRHGSSDVYKANCKQFIKLSKLNEQNLTRDNRTQGIME FT KSQQELKRTRCHGRRFKGLGEFAKIYHDEHNGLIREFQDGLKSLRKRKQTN FT SQPEQEDQERWGKRSKRGDNNVGKYLQPPTKSLRGPSNSNKKGPSLVLDHL FT SKEECAPTGLPQRRSASKIEPIFAVDEAALKMLTKRRKWLDDRHIDATMFL FT LRQHFPNIDGLQSCLAYPNSFKSLTYTPRNKFVQIMNIHGDHWVTVSNKFS FT TDNSTINIYDSLPSELSKTDKYNISTLLNVDSTKLTLRFPKVPKQVIGSGN FT CGVYAISFAVALCMGLNPENLVYPINRLREDIKGMLEKRQITRLFSETSCK FT TKQLKAQNSSLKVDLFCHCRLAGDDNGYIQCSRCDGWYHFSCTRTSIPENV FT IGSNVKWYCDNCVL" XX SQ Sequence 8581 BP; 2721 A; 1555 C; 1654 G; 2650 T; 1 other; accaaagttc ttttacctag aaatataaaa cgcggaaaaa aacggaagac ggaagtgaaa 60 agttaaaccc tttaacaagg ggctacggac ctaaaaatta cagaatttgt taattgaacc 120 gtgctctttc gaatgataag ttttgattta aaaaaaatgt aaaatattaa tatcgaatat 180 gaatattatt aatattcacg catttataga gttttcgcca gttattcctt tggatatgtt 240 gtagcccttg ccaatacctt caaattgata tataaattat atacaatatc cgcggggggc 300 tatatgcacg aagcgtctga atattcataa ctcgttattt agggatgctg cggttttcat 360 atccagatga acatttatgg agtttttgcc aaaaaatcct ttgtatatgt tgtagccctc 420 tataaatacc tttaatttaa tgtatagttt atatagaata acccgcaagt aaactatatg 480 cacgaagtgt ctgaatattc ataactcttc atcaagggat gtagcggaag atgctgcagt 540 tttcttatct agatggatat ttatagagtt ttcgccaatt attcctttgg atatgttgta 600 gcccttataa ataccttcaa tttaatgtaa agtttatata ctataaccgc gggaagctat 660 atgcacgaag cgtctggata ttcataactc gctttacaga ggagtgcatt ggaagatgct 720 gaagttttct tatccagatg gacatttata gagttttcgc caattattcc tttggatatg 780 ttgtagccct tgccaatacc ttcaaattga tatataaatt atatacaata accgcgggaa 840 agtatatgca cgaagcgtct gaacattcat aactcgtcat caagagatgc attggaagat 900 gctgcggttt tcttatctag atggatattt atagagtttt cgccaattat tactttggat 960 atgttgtagc cctttccaat accttcaaat tgatatatag attatataca ataaccgcgg 1020 gaaactatat gcacgaagcg tctgaatatt cataactcgc tttacagggg gtgcactgga 1080 agatgctgcg gttttcttat ccagatggac atttatagag ttttcgccaa ttattccttt 1140 ggatatgttg tagcccttgc caataccttc aaattgatat ataaattata tacaataacc 1200 gcgggaaact atatgcacga agcgtctgaa tattcataac tcgtcatcaa gggatgcatt 1260 ggaagatgct gcggttttct tatctagatg gatatttata gagttttcgc caattattcc 1320 tttggatatg ttgtagccct trccaatacc ttcaaattga tatatagatt atatacaata 1380 accgcgggaa gctatatgca cgaagcgtct gaatattcat aactcgcttt acagggggtg 1440 cactggaaga tgctgcggtt ttcttatcca gatggacatt tatagagttt tcgccaatta 1500 ttcctttgga tatgttgtag cccttgccaa taccttcaaa ttaatgtata gtttatatac 1560 aataaccgcg ggaaagtata tgcacgaagc gtctgaatat tcataactcg tcatcaaggg 1620 atgcattgga agatgctgcg gttttcttat ctagatggac atttatagag ttttcgccaa 1680 ttattccgtt tggatatgtt gtagcccttt ccaatacctt caaattgata tatagattat 1740 atacaataac cgcgggaagc tatatgcacg aagcgtctga atattcataa ctcgctttac 1800 agggggtgca ctggaagatg ctgcggtttt catatccaga tgaacattta tagagttttt 1860 gccaattatt cctttggata tgttgtagcc cttaccaata ccttcaaatt gatatataga 1920 ttatatacaa taaccgcggg aaactatatg cacgaagcgt ctgaatattc ataactcttc 1980 atcaagggat gcattggaag atgctgcggt tttcttatct agatggatat ttatagagtt 2040 ttcgccaatt attcctttgg atatgttgta cccttttaaa taccttcaat ttaatgtata 2100 gtttatatac gataaccgcg ggaagctata tgcacgaagc gtctgaatat tcataactcg 2160 ctttacagag gagtgcattg gaagatgctg cggttttctt atccagatgg acatttatag 2220 agttttcgcc aattattcct ttggatatgt tgtagccctt tccaatacct tcaaattgat 2280 atatagatta tatacaataa ccgcgggaaa ctatatgcac gaagcgtctg aatattcata 2340 actcgttatc agggggtgca ctggaagatg ctgcggtttt cttatccaga tggacattta 2400 taaagttttc gccaattatt cctttggata tgttgtagcc ctttccaata ccttcaaatt 2460 gatatataga ttatatacaa taaccgcggg aaactatatg cacgaagcgt ctgaatattc 2520 ataactcgct ttacaggggg gtgcactgga agatgctgcg gttttcttat ccagatggac 2580 atttatagag ttttcgccaa ttattccttt ggatatgttg tagccctttt aaataccttc 2640 aatttaaggt atagtttata tacgataacc gcgggaagct atatgcacaa agcgtctgaa 2700 tattcataac ccgttataca gaagctgatt tgatgagaga tgttaggatt ttttttattt 2760 aaatggatgg ctttttgcca acttatgcta taaatttgtg cttgtatttt gttgttgtaa 2820 atttgtttgc acaaacatgt tgttgtttac ttgcaattga cttgtttggc tcgagcgata 2880 tctatatggt gcgaaagaag aagttatcgg ccccggttga tggtgcggca tgtttttaat 2940 cggcgtcggt gacgctttaa tcacgttatc aaacctcatt atcgaaagtt gtatatattc 3000 tgtagttttc atgacattgt ttgattcgga atcggtttta caggcaattc attcgatgga 3060 gcattttgac agttcgtcct ttcatagttg ggaacttctg tcagacaagc tgtgcaaacg 3120 aaacctctcc aaaattgaga gaaagaatga cattaaaagt cttaaatatt tatatcaaaa 3180 gtataaagat gaaaataatt tacactcttc cacagtttcg gcatatccga tggatttctc 3240 aaatgatcgg gataagaaca aaacaacaaa tacaagtaat agtagtaatc ttatttcgag 3300 ttcaaattca tcagcgtcac agagccccga acacgtgagt tcaagtgagg taagtgatat 3360 cgacgacgag atggaagaat gcgaattatg gcgcccatcc ccaccattgg taaggtcttc 3420 tgtatttacg ttggatgaaa atgattggaa tgaggtgaga gcatccgtgt ctggtggacg 3480 tgtgaaaacg aaaaattggg gtgacatttt tcgacgaagt ttaaaaagta gtaatccttt 3540 ttgtgttttc agttttatgt cccaccattg gaacgaaaag cgtacgagta gtcgagtgtt 3600 taattgtaaa ggtgtttgca agtttaaaac atgtgacata aagatagaaa ttaatttgta 3660 cagcgaaatg aaaaaagagt ttcacgtgac ttacactggg ttgctacaac acggtaaact 3720 tgaggtccgg tcctcacatt tgaagggaca gcgtcgacag gaaatgcgca aggaacttgc 3780 caatgtaaat actagggcag tttatcttaa taatttacgg caattaccag cagatgtctt 3840 tgagtctgga aatcgtgact gtgctccatc aaaaagtgta ctgaagaagg ttcgatttca 3900 ggcgagaaag atagaagtcg cgtcggagaa tgatacaatt gcgctcgaaa ctttaaaaaa 3960 tgcacaaagt attgacaaag atcacagtca aacccttcag gtgatttctt ccacacctcc 4020 agcgataatg ttgtggtctg acgaaacgat gagaattttc cacgatcgca gtcaaactga 4080 tatagtatat atcgatgcta ccggtagcat aattacttca gagaagggta aacgtccgta 4140 ctatgcctac gaaatagttg ttcgccaccc aataaaagga aaatgtccat tggctgttgc 4200 aacttacgtt gcagagcaac ataacatagc ttcaatatgt aatttcatga gcatgtttcg 4260 ttcggcagaa aatcgtgttg ttggtaaaac tttagcaaaa ctttttattt gtgatggtag 4320 tatggcttta attcggtcaa tccttcgctg ttttatggat gactcacttg aagcttattt 4380 aacacgatgt ttttctttgg tgacaggtaa agcaacggat ggtacatttt ccaaaccatt 4440 tttacatttg tgtgggtccc acacgatgaa aaatatgaaa aaacttagtg ggttggtaag 4500 atttcggcgt caagagacca tgtatatact tggtttaatt atgtgtgccg atactatgtc 4560 ggatataaca gacatgttac gacacatagt tattttattt agtaccaaaa atgaatcttt 4620 atctaatttt agttggaact ttttgtctca aaaaattaaa agagaaacga agatggacga 4680 gtccttcttt tccgttgacg atgatgaacc agacgatgat ttaacccccg aatttacaaa 4740 ttgtaatcaa tttacaaaat atttttcaag tgttatatct caggcccgaa atgaggctga 4800 attttcagaa gtaagcggta caaacccatt ttactgtcca gaattaatca ccaaatttat 4860 tgaaaattgg ctacctaaac ttccgttatg gacggctctg acgatcggta atcttggccg 4920 gcacggtagt tcggacgttt acaaggctaa ctgcaaacag ttcattaaat tgtcaaaact 4980 aaacgaacaa aatttaacac gagacaatcg aacacagggg ataatggaaa aaagtcagca 5040 ggaattaaaa cgcacacgtt gtcacggtag acggtttaaa ggcttagggg agtttgcaaa 5100 aatataccat gacgaacaca acggcttaat acgtgaattt caggacgggc ttaaatcgct 5160 aagaaaaaga aaacaaacca actctcaacc cgagcaagag gaccaagagc gatggggtaa 5220 gagatcaaaa cgcggagata ataatgtagg aaaatatctt caacctccca ctaaatctct 5280 tcgtggtcca agtaattcga ataaaaaagg tccttctctg gttttggatc atttgtctaa 5340 ggaggaatgc gcccccacgg ggctaccgca aagacgcagc gccagcaaaa tagaacctat 5400 ttttgctgtt gacgaagcag ctctaaaaat gctgacgaag cgaaggaagt ggttggacga 5460 cagacatata gatgccacca tgtttcttct tcggcagcat tttccaaata ttgatgggtt 5520 gcaaagctgt ttggcttatc caaacagttt taaatctctt acttatacgc ctcgaaacaa 5580 atttgttcaa attatgaata tccatgggga tcattgggtt accgtctcca acaaattttc 5640 cacagataac agtacaatca atatatatga cagtttacca tctgaattga gtaaaacgga 5700 caaatataat atttctactc ttttaaatgt agactcaacc aaattaacgc ttcgttttcc 5760 aaaggtgccc aaacaagtaa ttggttcagg aaattgcggc gtctatgcga tttcgtttgc 5820 ggttgccttg tgcatggggc taaatccaga aaatctagta tatccaatta atcgtctccg 5880 agaggatatt aaaggtatgc tagaaaaaag gcagattact cgcttgtttt cggaaacttc 5940 atgtaaaaca aaacaactta aagcacaaaa cagttctctt aaagtggatc tcttctgtca 6000 ttgcaggctt gctggtgatg acaacggtta catccaatgc tccagatgcg acggatggta 6060 ccatttttct tgtaccagga cgtctattcc agaaaatgta ataggatcca acgtaaagtg 6120 gtactgtgat aactgtgtgc tttaagcggg tagtcgaaac gcacacttcc aaacaatgct 6180 aaggtctttt ctaccaaact tgtgatggac ttttctccac acacctgaac tgttttttta 6240 cgttactgtg ttactcgtag aacgcaaaaa aacaaaaagc tgtgtggaga acatgcactt 6300 acatctcttt atctgtcgca tctcctatgt taaaatacgt gaaatggtac aatatagtaa 6360 gacattataa ataaggtgcg ttcaaataaa atagaaaaaa tgcgattatc tatctttgta 6420 cagccaaagt ctgcagtggt gcaagttagt ttatcgtatt tttttcctaa tggcatttct 6480 aaacatcggc gaccaagaag gtgttaaaaa gatgaacttg tttgtcgaaa tgagataacg 6540 caaataccta tttatctcaa ttagacaact gcagcatctt ccaatgcacc ccctctgtat 6600 aacgggttat gaatattcag acgcttcgtg catatagctt cccgcggtta tcgtatataa 6660 actatacctt aaattgaagg tatttgaaag ggctacaaca tattcaaagg aataattggc 6720 gaaaactcta taaatgtcca tctggataag aaaaccgcag catcttccag tgcacccccc 6780 tgtaaagcga gttatgaata ttcagacgct tcgtgcatat agtttcccgc ggttattgta 6840 tataatctat atatcaattt gaaggtattg gaaagggcta caacatatcc aaaggaataa 6900 ttggcgaaaa ctctataaat gtccatctgg ataagaaaac cgcagcatct tccagtgcac 6960 ccccctgtaa agcgagttat gaatattcag acgcttcgtg catatagttt cccgcggtta 7020 ttgtatataa tctatatatc aatttgaagg tattggaaag ggctacaaca tatccaaagg 7080 aataattggc gaaaactcta taaatgtcca tttcgataag aaaaccgcag catcttccag 7140 tgcacccccc tgtaaagcga gttatgaata ttcagacgct tcgtgcatat agccccccgc 7200 ggatattgta tataatttat atatcaattt gaaggtattg gaaagggcta caacatatcc 7260 aaaggaataa ttggcgaaaa ctctataaat gcccatctgg ataagaaaac cgcagcatct 7320 tccagtgcac ccccctgtaa agcgagttat gaatattcag acgcttcgtg catatagctt 7380 cccgcggtta ttgtatataa tctatatatc aatttgaagg tattggcaag ggctacaaca 7440 tatccaaagg aataattggc gaaaactcta taaatgtcca tctggataag aaaaccgcag 7500 catcttccag tgcaccccct gtaaagcgag ttatgaatat tcagacgctt cgtgcatata 7560 gctccccgcg gatattgtat ataatttata tatcaatttg aaggtattgg caagggctac 7620 aacatatcca aaggaataat tggcgaaaac tctataaatg tccatctaga taagaaaacc 7680 gcagcatctt ccaatgcatc tcttgatgac gagttatgaa tgttcagacg cttcgtgcat 7740 atactttccc gcggttattg tatataattt atatatcaat ttgaaggtat tggaaagggc 7800 tacaacatat ccaaaggaat aattggcgaa aactctataa atgtccatct ggataagaaa 7860 accgcagcat cttccagtgc acccccctgt ataacgggtt atgaatattc agacgcttcg 7920 tgcatatagc ttcccgcggt tatcgtatat aaactatacc ttaaattgaa ggtatttata 7980 agggctacaa catatccaaa ggaataattg gcgaaaactc tataaatatc catctagata 8040 agaaaactgc agcatcttcc gatacatccc ttgatgaaga gttatgaata ttcagacact 8100 tcgtgcatat agtttcttgc ggttattcta tataaactat acattaaatt aaaggtattt 8160 ataagggcta caacatatac aaaggatttt ttggcaaaaa ctccataaat gttcatctgg 8220 atatgaaaac cgcagcatcc ctaaataacg agttatgaat attcagacgc ttcgtgcata 8280 tagccccccg cggatattgt atataattta tatatcaatt tgaaggtatt ggcaagggct 8340 acaacatatc caaaggaata actggcgaaa actctataaa tgcgtgaata ttaataatat 8400 tcatattcga tattaatatt ttacattttt tttcaaatca aaaattatca ttcgaaagag 8460 cacggttcaa ttaacaaatt ctgtaatttt taggtccgta gccccttgtt aaggggttta 8520 acttttcact tccgtcttcc gtttttccgc gttttatatt tctaggtaaa agaactttgg 8580 t 8581 // ID Gypsy-16_RP-LTR repbase; DNA; INV; 265 BP. XX AC ACPB02041007; XX DT 28-JAN-2011 (Rel. 16.02, Created) DT 28-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Rhodnius prolixus genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-16_RP_; KW Gypsy-16_RP-I; Gypsy-16_RP-LTR. XX OS Rhodnius prolixus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Euhemiptera; Heteroptera; OC Panheteroptera; Cimicomorpha; Reduviidae; Triatominae; Rhodnius. XX RN [1] RP 1-265 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Rhodnius prolixus genome."; RL Direct Submission to RU (28-JAN-2011). XX DR Genome; ACPB02041007; Positions 31062 31326. XX SQ Sequence 265 BP; 85 A; 49 C; 78 G; 53 T; 0 other; tgtaacgaag ttgacctggg cggagaagaa aggcctagat tgttctagag gttgaacgac 60 caaagtcagt aacaggccgc cactgcagaa acaaaagggg cggtatctac ttgtggcaga 120 aaagaaaggc ctagattgtt ctaggggtta agcgactaaa gccggtaaca ggccaccact 180 gtgtagacaa aagggccggt ctttacttga actagaaagg caagggctag attgttcaag 240 gagttagtga cctgtgtaaa agcca 265 // ID Kiri-26_AAe repbase; DNA; INV; 3434 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 16-FEB-2011 (Rel. 16.02, Last updated, Version 1) XX DE A Kiri non-LTR retrotransposon family from Aedes aegypti. XX KW Kiri; Non-LTR Retrotransposon; Transposable Element; Kiri-26_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-3434 RA Kojima K.K. and Jurka J.; RT "Kiri non-LTR retrotransposons from the yellow fever mosquito."; RL Repbase Reports 11(2), 721-721 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 9 sequences with >97% CC identity. CC This family and related Kiri elements found in two mosquito CC species, Culex quinquefasciatus and Aedes aegypti, constitute a CC distinct lineage belonging to the CR1 group. The phylogenetic CC analysis with PhyML indicates that Kiri is a sister lineage of CC the L2 clade. Although several Kiri elements do not encode two CC proteins, probably due to 5' truncation, Kiri elements generally CC encode two proteins. Their ORF1 proteins commonly contain an CC L1-like RNA-binding domain. XX FH Key Location/Qualifiers FT CDS 129..353 FT /product="Kiri-26_AAe_1p" FT /translation="MLSVKSXRVRNLALRMKQKKLIQSVYVRNDSISIRLP FT CQKKYIPIEDTGHLLKLMNASSCTNESSIFYDAESSHF" FT CDS 398..3238 FT /product="Kiri-26_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MPNNCTNLRIGHLNVRGLECHADEMKILLDKNQYHFF FT AVTETKMKASSSMGPVRVPSYNFXKHCLPAGRGRGSKSCGGIGLYFLKGLK FT VSPVMKSTFNPDXPVGQRFEYLVVQTKINELTVGIAIIYNPIVTNPNFTAE FT YEKLLFDIQDLGVDRLYLLGDFNINVTAPSPTGNHEALLRIHQTFNLTILP FT TTPTRITDRSATTIDLMVTDCPQSIVTAKAVSNSISDHETIYLISNVRCRK FT PTAHRLTVRDFRAIDTTRLQADFQGIRFENILDTRDVNVKTQSLTSELCNL FT MDRHAPEKIITVKDKRTPWMTTEIEQAIVVRNLAHELYSRNPHRRRDDAQW FT LEYTQKRDRAAILVSQAKKRYGERYFAPHLPAKKLWNNLRREGVHNSTKKE FT LPEEQIDVEQLNSFFCEGHRQLQIQVPATSTRNELHRAIRDEGVREQFNFC FT HTNAGEVASKMNQIQSNATGADGIPISFVKLLCPFIYPVLVDLYNSVIDSR FT TFPVMWKKGIVTPIPKVPNPLXPKDFRPITVLPAISKVLEKVLLDQIVEYI FT DAPGSHLLSRCQSGYRKKYSTTTALAKITHDIYNSFDGDHCTLMVLVDFSL FT AFNCVNHLKLKSKLSEEFCFSQDACALVTSFLGHRSQAVKVGSTVSAVHPL FT TDGTPQGSCLSALLFSLYVNSLPQGLKCSYQLYADDLQIYVSGPKQDIERL FT VALVNTDLQHIERWAGINSLFPNPKKTQAIIFCREGTITPITDITFCAEAI FT PLSDSVTNLGLQMDRNLRWLTQVNGITKKVFGTLRTFRRFSPVLTTQTRKK FT LVQAVVMPFFTYCDVVYHPGLSVAQKEQLNRCFKAAVRFVYRLRRRESTEE FT VRNTILGHDLATNYRNRICSFMRQGYAKDLPDYLLQHLQRGTQERTRCFTI FT PAHTTSQRKSILIAGALSWNGVPLQIKLEPTINSFKAAVKRWQ" XX SQ Sequence 3434 BP; 1023 A; 872 C; 717 G; 817 T; 5 other; tggtagtatt cagtagcgct gaagtacgct cggaagtttt gaagaaatac ttccagttac 60 acaaagaagc caagttgtgt aatttggaaa aggggcttcc tctacagtat cgcttcaccg 120 ttaatgagat gctctctgtt aaaagctwcc gtgttcgaaa ccttgcgcta agaatgaagc 180 agaaaaagct cattcaatcg gtgtacgtca gaaacgacag catatccata cgtctaccgt 240 gccagaaaaa gtacattccc atcgaagata ccggacatct gcttaaattg atgaacgcaa 300 gctcttgcac taacgagtcc tcgattttct acgacgcaga atcctcacat ttctaaatat 360 ttaatcaagc gctaacgatt cctatcacga cgaaaatatg cccaacaatt gcacgaacct 420 ccgtataggc caccttaacg tccgtggact agaatgtcac gcggatgaaa tgaaaattct 480 actcgacaag aaccagtacc actttttcgc agtcaccgaa accaaaatga aagcatccag 540 ttcgatgggg cccgttcgag tcccgtctta caatttcwtg aaacactgcc taccagctgg 600 aagaggcaga ggcagcaaat catgcggtgg aataggactg tacttcctca aagggctgaa 660 ggtttcacca gtgatgaagt caactttcaa ccccgacktc cctgttggac aaaggttcga 720 ataccttgtg gtccaaacca aaatcaacga gctcactgta ggtatcgcga tcatctacaa 780 tccaatcgtt actaatccga atttcaccgc tgagtatgaa aagttacttt ttgatataca 840 agacctaggc gtcgacaggc tgtacctcct aggagatttc aatatcaatg taactgcacc 900 ttcgcctacc ggaaaccatg aagcactttt gcgaatacac caaactttta atttaacaat 960 tctgccaaca actccaacta ggattacaga caggagtgca acgactatcg acctgatggt 1020 taccgactgc cctcaatcaa ttgtgacggc caaagcagtc tcgaactcaa tttcggatca 1080 cgagacaatt tacctaatct cgaacgttcg atgcaggaaa ccaacagccc accgcctaac 1140 ggtaagagat ttccgtgcca tcgacacaac cagattgcaa gcagacttcc agggaataag 1200 attcgaaaac attttggata cgagggacgt caatgtcaaa acacaatccc tcacgtctga 1260 attgtgcaac ctgatggatc gccatgcacc tgagaagatc atcacagtca aggataaacg 1320 cacgccatgg atgacaacag aaattgaaca agcaatcgtc gtcagaaacc ttgcccacga 1380 actctattcg cgaaatcctc atcgaaggcg ggatgacgca caatggctgg agtataccca 1440 aaaacgtgac cgagctgcaa tcctggtttc acaagccaaa aagcgttacg gcgaacgtta 1500 cttcgcacca catctaccmg cgaaaaaact ctggaacaac ctaaggaggg agggcgttca 1560 caacagcacc aagaaagagc taccagagga acagatcgac gttgagcagc taaacagctt 1620 cttttgcgaa ggtcatcgcc aactacaaat tcaagtgcct gcaacctcta cgaggaatga 1680 actacatcgt gcaatcagag acgagggcgt cagggaacag tttaacttct gccataccaa 1740 cgctggtgaa gttgcctcta aaatgaacca gattcaatca aacgccaccg gtgcggatgg 1800 catcccaatc tcgtttgtaa agctgctttg tccattcatc tatcccgtac tggtagacct 1860 gtacaactcg gtgattgata gcagaacgtt tccagtcatg tggaagaagg gaatcgtcac 1920 tccgattcca aaagtgccga acccgctaca kcccaaagac ttcaggccga ttacagtgct 1980 tcccgcaata tccaaagtac tggaaaaggt tttactcgac cagattgttg agtacatcga 2040 tgctccaggc tcgcatttgc tttcgaggtg ccaatcgggt tacaggaaga aatacagcac 2100 aacaacagct ctcgccaaaa ttacacacga catctacaac agcttcgatg gtgaccactg 2160 tactctcatg gtgttagtgg acttctctct cgcattcaat tgcgtcaacc acctgaaact 2220 gaagtccaag ctcagcgaag agttttgctt ttctcaagat gcctgtgcgc tggttacatc 2280 cttccttggg catcgcagcc aagctgtgaa agttggatca acggtctctg cagtgcaccc 2340 tctcacagat ggtacgcccc agggttcctg cctaagcgct cttcttttta gcctgtacgt 2400 aaatagtctg ccgcagggac tcaagtgcag ctaccagctg tacgcagacg acctacaaat 2460 atatgtatcc ggaccaaagc aagacatcga aaggcttgta gcgttggtca acaccgatct 2520 tcagcacatt gaacgttggg ccgggatcaa ctcacttttc ccaaatccaa aaaaaactca 2580 agctatcatc ttttgcagag aaggaacaat tacaccgatt acagacatca ccttttgcgc 2640 cgaagccatc ccactatccg atagcgtcac taacttgggt ctacaaatgg acaggaatct 2700 tcgatggctc actcaggtta atggaatcac aaaaaaagtt tttggcactc tgcgtacctt 2760 tcgtcgtttc tcgcctgtgc tgactaccca aacccggaaa aaactagtgc aagccgtggt 2820 gatgcctttc ttcacctatt gcgatgtagt gtatcacccc ggcctgtccg tagcacaaaa 2880 ggagcagctg aacagatgtt ttaaggcagc ggtccgtttc gtataccggc ttcgtcggag 2940 agagtccacc gaagaagtga ggaacaccat cttgggacac gatctagcaa cgaactatcg 3000 gaaccgtatt tgcagcttta tgcggcaggg atatgccaag gatctgcccg actatttgct 3060 gcagcacctt cagcgaggaa cgcaagagcg cactcggtgc tttaccatcc cagcacacac 3120 gacgtcccaa aggaaaagta tcctcattgc aggagctctt agctggaacg gcgttcctct 3180 tcaaattaag cttgaaccaa caattaattc attcaaggca gccgtcaaac gatggcagta 3240 gataataggt acctagatta aatttaatgt ggtcgttatt gacactgaca gtcactgtaa 3300 cttgtaaaac cgtgatttaa ttactagttt ccaatgttta atatccaaat ttacttcgtt 3360 cgtttccttg acctgagata tagtttgtaa actgacaggg tatcgtaacg ttaataaatt 3420 acaattacaa ttac 3434 // ID Gypsy-19_DWil-LTR repbase; DNA; INV; 286 BP. XX AC scaffold_181074; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-19_DWil_; KW Gypsy-19_DWil-I; Gypsy-19_DWil-LTR. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-286 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_181074; Positions 55063 54778. XX SQ Sequence 286 BP; 85 A; 51 C; 69 G; 81 T; 0 other; tgtgaagagg tgttgtccgc atcagaatgg ttatgcccgt ccctggtaat tgggagagcg 60 tctacaagag cattgagaga gcgaattgga gagtcagtcg tcagtcggca gtcgtcagtc 120 gggagttgca gttgaaagtt gaaagttgtc agtcgaaagt tgtcagtcga aagtcggtcg 180 cgatgcccaa tcaaagttgt tctcagttgt cttttatata acctaaccac ccagaatgtc 240 tataatactt taattaataa acataatcaa tataataaac tttaca 286 // ID Chapaev3-1_SM repbase; DNA; INV; 2407 BP. XX AC . XX DT 29-FEB-2008 (Rel. 13.02, Created) DT 06-JAN-2010 (Rel. 15.02, Last updated, Version 2) XX DE Chapaev3-1_SM is an autonomous DNA transposon - consensus. XX KW Chapaev; DNA transposon; Transposable Element; Chapaev3; KW Chapaev3-1_SM. XX NM Chapaev3-1_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-2407 RA Kapitonov V.V. and Jurka J.; RT "Chapaev3, a distinctive group of animal Chapaev transposons."; RL Repbase Reports 8(2), 50-50 (2008). XX DR [1] (Consensus) XX CC Chapaev3-1_SM belongs to the Chapaev3 group of the Chapaev CC superfamily of DNA transposons (see comments on Chapaev3-1_PM). CC Chapaev3-1_SM is a family of relatively old planarian Chapaev3 CC transposons: genomic copies of Chapaev3-1_SM elements are ~91% CC identical to their consensus sequence, which was derived from CC multiple alignment of fifty Chapaev3-1_SM elements. Chapaev3-1_SM CC contains 14-bp terminal inverted repeats and encodes a 562-aa CC transposase. CC This sequence was derived from sequence data generated by DOE CC Joint Genome Institute. XX FH Key Location/Qualifiers FT CDS 385..2070 FT /product="Chapaev3-1_SMp" FT /note="transposase." FT /translation="MPRECKNHPDKFCYVCGNFTTKLQRRTITTNLKKIYN FT LYFGCHLGDQDKPWAPHQICSACSNGLRDWVNKKKASMPFAIPMIWREPRD FT HHEDCYFCSVNMTGFSTKNKNKIVYPVMDSARRPVQHGEELPIPIPPDDGV FT DSIEDDADGDEGATGGVPGPSADPDYTLEGRNFEPKLLTQGQLNDLVRDLS FT LSKEKAELLASRLQENNLLDKNVRISYYRKRNINLATCFTVDGPLCYCHDI FT HGLFTELSQTYAASDWRLFIDSSHRSLKAVLLHNGNVKPSIPIAHSVHLKE FT TYDNMDILLKAIKYGSHQWNICGDLKVLGMLMGMQAGFTKYCCFLCLWDSR FT ATTEHYSKRDWALRSTYVPGTSSVQSVPLVDPQKIFLPPLHIKLGLMKNFV FT KAMGKVNSQGFEYLAKKFPKVSAAKLKEGIFVGPQIREVLLDIEFERALSP FT LELKAWLAFKWICANFLGNNKSHAYKDGVEKLIDAYKEMDCRMSLKMHFLH FT SHLDFFPENLGAVSDEQGERFHQDIKAMEVRYQGFWNESMMADHCWMLYRE FT IPDQKFKRKAYSQRF" XX SQ Sequence 2407 BP; 755 A; 440 C; 480 G; 732 T; 0 other; cactggtcaa caaagaattt tcatcagaga taattttgag ttgcctggat atcatttttg 60 acgctgattc caaaactgca atcagatttt ctctagcacg tatagttttt atgatgaaag 120 catatatgta aatacacaga tttaacaatt ttatataggc aaagtactac ctttttcaac 180 attttattgg gtttgtctta tgtatttcgt agttttacgt gttcatttgt ttgtttcaga 240 catcttatcg tacgtctggc aacatgcaca attgatacct gccatcgctg agaacagcga 300 gctacattct gccagtggct actgtcgtgt gaagtgttgt gctggtgtta attcgtttca 360 aatcatttgc ttggatcaat cagcatgcct cgagaatgta agaatcaccc agacaagttt 420 tgttatgtat gtggcaattt cacaactaaa ctacaacgac gaactattac aaccaacctt 480 aagaaaatat acaatcttta ctttggttgc cacctaggtg accaagataa accatgggcg 540 ccacatcaaa tctgttctgc ttgttcaaac ggattacgcg attgggtgaa caaaaagaaa 600 gcttcaatgc cgtttgccat acccatgata tggagggagc cgagagatca tcatgaagat 660 tgttattttt gtagtgtaaa catgacaggg ttttctacaa agaacaaaaa caagattgtt 720 tatccagtca tggattcagc acgtagacca gtgcaacatg gtgaagaatt gcccattcca 780 attcccccag atgatggtgt agattctatc gaagatgacg cggatggtga tgaaggtgca 840 actggtggcg ttcccggacc gtctgcagat ccagattaca cattggaagg tagaaatttc 900 gagccaaagc tgttaaccca aggacaattg aatgatcttg ttagagacct atcactgtct 960 aaggagaaag cagagcttct cgcctcaaga ctacaggaga acaatttact tgacaaaaat 1020 gttcgtatca gttactaccg aaagcgaaat attaatttag caacatgctt tacagtagat 1080 ggtcctctct gttattgcca tgatatacat ggactattca cagaattgtc tcagacgtat 1140 gcggcatctg actggcgtct cttcattgat tcatcgcata gaagcctgaa agcagtactt 1200 ttgcataatg gaaatgtcaa gccatccatc cctattgccc attctgtcca tctaaaagag 1260 acatatgaca atatggatat actacttaag gcaatcaaat acggcagtca tcaatggaac 1320 atctgcggag atcttaaagt tttaggtatg ttaatgggta tgcaagcagg atttacaaaa 1380 tattgttgtt tcctttgcct gtgggacagt cgtgccacaa cggaacacta cagcaaacgt 1440 gactgggcac taaggtctac atatgtccca ggaacaagca gtgtccagtc tgtcccttta 1500 gttgatcccc aaaagatctt cctaccacct ctccacatta aacttggttt aatgaagaat 1560 tttgttaaag caatgggtaa agtcaattca caagggttcg aataccttgc taagaaattt 1620 ccgaaagtta gtgcagcaaa attgaaggag ggaatatttg ttgggccaca aatcagggag 1680 gttttattgg atattgaatt tgaacgggcc ctcagtccac tggaattgaa agcttggctg 1740 gctttcaaat ggatatgcgc aaactttctg gggaataaca agtcccatgc atataaggat 1800 ggagttgaga aactcatcga tgcttataaa gaaatggatt gtcgaatgtc tttgaaaatg 1860 cacttcctgc attcccattt agatttcttt cccgaaaacc ttggtgcagt gagtgacgag 1920 caaggggaac gctttcacca agacattaaa gctatggaag ttcgttacca aggattttgg 1980 aacgaaagca tgatggctga ccattgttgg atgttgtatc gtgaaattcc agaccaaaag 2040 tttaaaagaa aagcgtactc tcaacgcttt taagatcttg taatgtgtat attcttattt 2100 tacaagtatg ttcaataata tctttaatgt aaattagttg cttgcacttc attgcgttgt 2160 attaatttag tgtatgatgt atattttact tcttacaaat aaatgatttg cataaaatgt 2220 gatttattac accctagata ttcataaaat tacgtatatt ttcataaaat acaaaaaagt 2280 aaataatcga aaaactttac gtgctagaac aaaactgtaa ccagatttgg attgagcatc 2340 agaaatacat tcagaaaagc ttatttttgt ctgtgatgag aaaaaagttt cattttgttg 2400 accagtg 2407 // ID Gypsy-5_PPc-I repbase; DNA; INV; 4682 BP. XX AC chrUn; XX DT 06-JUL-2010 (Rel. 15.07, Created) DT 06-JUL-2010 (Rel. 15.07, Last updated, Version -1) XX DE LTR retrotransposon from the Pristionchus pacificus genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-5_PPc_; KW Gypsy-5_PPc-LTR; Gypsy-5_PPc-I. XX OS Pristionchus pacificus OC Eukaryota; Metazoa; Nematoda; Chromadorea; Diplogasterida; OC Neodiplogasteridae; Pristionchus. XX RN [1] RP 1-4682 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Pristionchus pacificus genome."; RL Repbase Reports 10(7), 1002-1002 (2010). XX DR Genome; chrUn; Positions 154091304 154086623. XX CC Positions [3409-3870] - Integrase core CC 'ATAAT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 595..4590 FT /product="Gypsy-5_PPc-I_1p" FT /translation="MTLDQVKKKMLTLFGDNTSIFDRRRKMLDLKMSKENI FT DDVRVLAARVNLTVENAQVSEATIDEWKVLTFLHSLDLSRYSDIHMKMMQT FT AKHKGKECTLDDLLSDYNDLSQLKKDSRNITDSRREVNYVNERGGKNNPQK FT KSENTRPLQGHNQPCSSCGKKGHSRPKCYHRNSEWTNCGRKGHIAVACRSK FT GKNAHCVSVDTVATTDYHIPLKINGQRASMKIDTGADITIVSEMTWKAIGE FT PKCSSADCTATCANGNTLQLQGKFRAKAEYGGVQAEDYMYVTHKNINLIGK FT NFIEMLNLVEIREPGPTINEVTTPPSASAEYTEWVKTAFPEVTASGLGRCT FT EMSASLQLKPDAKPVFVRARPVPYALTERVETELDRLEKSGVIEKVEYSTW FT AAPILTVSKPNGSIRMCADFSTGLNAAIDLPAHPLPVPEDIFASLNGASIF FT SQIDLAEAYLQVPLDEEAQKLLVINTHKGLFRYKRLPFGVKAAPGMFQRLM FT DTMLSGIKHAVPYLDDIIIGGRTKKEHDETLIQVMLKLKKFGLRTRAEKCS FT FGMKEVSFLGFIINKDGRHTDPKKTEAIRTMPEPENAVMLRSFLGMANYYG FT QFINGMHKLRSPLDHLLKDNVKWKWTKECAKAFSDIKEILNNQLSLVHYDP FT KKEIVVAAGACEDGIGAVILHRFPDGSLRAISHASRKLKPAERNYGQIEKE FT ALALIFAVGKFHKYVFGRRFTLQTDHKPLLSIFGSPKEVPAYTAKRIYRWA FT ETLLMYDFHIEYINTDSFFYADALSRLISECQSEQEVNIALVQTEIDVQDT FT FSTAIRRMPVTARIIQSETEKDTLLQEVIKKHLRGWSKKDAKVESLLPFYT FT RRLDLTLMKGCLLYGNRVIVPKTLRQRLIKDLHSEHPGIVRMKSLARSICF FT WPGLDQEIEHTVLKCDRCSKAAKAPVKVPLQPWPTAERPWERIHVDYAGPI FT RGEYYLVIVDAYSKWPEVYCTQKITASVTVDFMKDAISRYGIPEVIVSDNG FT TQFTSELFNQMCLSYGMKHITIAPYHPQSNGQAERFVDTLKRSLKKMNGEA FT PNKEIIRHFLMTYRRIPNPNVPEGKSPAQVFIGRSIRSKIDLIRPTKRSDK FT VDERMKDQFDKRNGARDRWYSVGDHVYYRAPDGPNRTKWLPAIITAKKGKV FT MFEIEINGKKQRAHANQLRKNAGMTTLTDESDTEVPLQLLLDTFNLDRPVQ FT DNEDPVMINQRDEFRNEPMENLMDPREDMPLRFDPVDEFHVELNLNDENDR FT VSVSTASSRFASAQSSPLPSPVKQPTASPVKQHVASPVKQTVNTRPRREHK FT PIVRLDPDPSKKTYTAETKQ" XX SQ Sequence 4682 BP; 1424 A; 1115 C; 1105 G; 1038 T; 0 other; tctggcgttc aggactattc gaactcaaag taagtcattt tcgaatattt tcgaaaacct 60 tttctttgaa tttcttgtga ggtagctttg ttctactgac tcactgtttt ccttgtgagg 120 tagctttgat ctacttactc acttttatcc ttgtgaggta gccaagttct acttactcac 180 tgaacaattt catcttttgc accggaggtg ctgccaaaat ttttgatttt gaatcaaaat 240 aagtagaatt tctgagtgca atgggtgagg aaaacgaaag tatggtagag gaaatccacg 300 acctgaaggc actggtagcc accatggccc agttgctcaa gaatcagaat cagaaccaga 360 aacccgataa ctcgacggaa atgtcgagcc tctctctcag cgctgtacga attcacgtac 420 tgtccagaag aaggaagtac attcgagaga tggtggggca gacatgagga tatattcctc 480 atcgatctga aggactggga agatctcaag aagatcagac ttctgatccg gcatgtatca 540 acaacagttg aacgaacttt cactgaatcg attgctccga ccaaatgggc tgacatgaca 600 cttgatcaag tcaagaagaa gatgttgact ctatttggag acaatacgtc catctttgat 660 agaagacgga agatgttgga tctgaaaatg agcaaggaaa acatcgatga tgtacgggta 720 ctcgctgcaa gagtgaatct gaccgtggag aacgctcaag tgagcgaagc aacgatcgat 780 gagtggaagg tcctgacgtt cctacactca ctggatctct cccgatactc ggatattcat 840 atgaagatga tgcagacagc caaacacaaa ggcaaagagt gtacactgga tgatctgtta 900 tctgactaca atgatctgtc acagctcaag aaggattcac gaaacatcac tgattctcgt 960 cgagaagtga attacgtgaa cgagagagga ggaaagaaca atcctcagaa gaaatctgag 1020 aatacgagac ccttacaagg acacaatcag ccatgttcta gttgtggaaa gaagggacac 1080 tcacgaccaa agtgctacca cagaaattct gagtggacta attgtggccg gaaaggtcac 1140 attgcagtag catgtcgctc aaagggcaag aatgcacact gcgtgagtgt ggatacggta 1200 gctacaactg actaccacat ccctctgaag atcaacggac aacgagcgtc catgaagatt 1260 gataccggcg cggacataac gatcgtctcc gagatgacgt ggaaagcgat cggtgagccc 1320 aagtgctcaa gtgctgactg tacagcgacg tgtgctaatg gaaacacact acaattacaa 1380 gggaagttca gagcgaaggc cgagtacggt ggtgtgcagg ctgaggacta catgtatgtc 1440 actcacaaga atatcaacct aatcggaaag aacttcatcg aaatgctgaa tctcgttgaa 1500 attcgagagc ccggcccgac tatcaatgag gtcacgacac ctccgtcggc cagtgcagag 1560 tacactgaat gggtgaaaac agcattcccc gaggtaaccg caagcggtct cggacgctgc 1620 acagagatgt cagcatctct tcagctgaag cctgacgcca agcccgtctt cgtcagagct 1680 cgaccagtac catacgctct cacagaacga gtcgagacgg aactcgatcg tctggagaag 1740 agtggagtga ttgagaaggt ggagtacagt acatgggcag cccccattct gactgtgagc 1800 aaaccgaatg gatcgatcag aatgtgtgca gatttcagca caggactgaa tgcagcaatc 1860 gatctaccag ctcatcctct tcctgtaccg gaagacatct tcgcatctct gaatggtgcc 1920 tcgatcttct cacagatcga tctcgctgaa gcttatctgc aggtacctct cgatgaggaa 1980 gcccagaagc tactggtcat taacactcac aagggactat tccgatacaa gagattaccc 2040 ttcggagtga aggctgcccc cggcatgttt caaagactaa tggatacaat gctgagtggc 2100 atcaagcatg cagttcccta tctcgatgac atcatcatcg gaggtcgaac gaagaaggag 2160 cacgatgaga cactgatcca agtaatgctc aaactgaaga agttcggact aagaactcga 2220 gctgaaaaat gctcattcgg aatgaaggaa gtcagctttc tcggcttcat catcaacaag 2280 gacggtagac ataccgatcc aaagaagaca gaagcaatca gaacgatgcc ggaaccggag 2340 aatgcagtaa tgctccgaag tttcctcgga atggccaact actatggtca attcatcaac 2400 ggcatgcaca aactgagatc tccactcgat cacttactga aggataatgt gaagtggaag 2460 tggaccaaag aatgcgcaaa agcattctcg gatatcaaag agatactgaa taatcagctc 2520 agtctcgttc actacgatcc caagaaggag atcgtggtcg cagcaggtgc atgcgaggac 2580 ggtatcggtg cagtcatcct gcacagattc cctgatggaa gtctgcgtgc aatcagtcat 2640 gcttccagaa agctgaaacc agcagagaga aactacggac agatcgagaa agaagcactc 2700 gctctgatat tcgctgtggg caagttccac aagtatgtat tcggacgtcg gttcacactg 2760 caaaccgatc acaaaccact gttatccatc tttggctcac ccaaagaagt gccagcctat 2820 actgcaaagc ggatatacag atgggccgaa actctgctga tgtacgattt ccacatcgaa 2880 tacatcaata ctgattcatt cttctatgct gatgctctgt caagattgat aagtgaatgt 2940 cagtcggaac aagaagtgaa tatcgcactc gtacagactg aaattgatgt ccaggacaca 3000 ttcagtactg ccattcgtcg aatgcctgtc acggcacgaa tcatacaatc agaaactgag 3060 aaggatactc ttcttcaaga agtgatcaag aagcatctca gaggatggtc gaagaaggac 3120 gcgaaagtgg agtccctcct tccattctac acaagaagac tcgatctgac tctcatgaag 3180 ggctgtttgc tctatggtaa tcgagtgatt gtgcccaaga cactgcgtca aaggctaatc 3240 aaggacttac attccgaaca tccaggaatt gtaagaatga aatcactcgc aagaagcatt 3300 tgcttctggc ccggtctcga tcaagaaatc gagcatactg tactcaaatg cgatcgttgc 3360 tccaaagcag cgaaagctcc ggtcaaagta cctctacaac catggccaac ggctgagaga 3420 ccgtgggaac gcatccacgt cgattatgcc ggcccgatca gaggcgaata ctatctcgtg 3480 atagtcgatg catacagtaa atggccagaa gtctattgca ctcagaagat cacagcatcc 3540 gttactgttg atttcatgaa agatgccatc tcgcgatatg gtatcccaga agtgattgtg 3600 tccgataacg gcacacagtt cacttccgag ctgttcaatc aaatgtgtct ctcttacggc 3660 atgaaacaca taacaatcgc tccgtatcat cctcaatcca atggacaagc agagagattc 3720 gtcgatactc tcaagagaag tttgaagaag atgaatggag aagctccaaa caaggagata 3780 attcgtcact ttctgatgac gtacagacga attcccaacc cgaatgtacc cgaaggcaag 3840 agccccgctc aagtctttat cggaagatca attcgatcaa agatcgacct catccgtcct 3900 accaagagat ctgataaggt ggacgagaga atgaaagatc aatttgataa gagaaatggc 3960 gccagagaca gatggtactc tgtgggcgac catgtgtact atcgagcacc agatggtccg 4020 aatcgaacta aatggcttcc agcgataata actgcaaaga agggaaaagt catgttcgaa 4080 atcgagatca atggaaagaa acagagagca cacgccaacc agcttcgaaa gaatgccggc 4140 atgactactc taactgatga gagtgacact gaagtccctc tacaactgtt gctcgacaca 4200 ttcaatctgg acagacccgt tcaagataac gaggatcctg ttatgatcaa tcagagagat 4260 gagttcagaa acgaacccat ggagaatctg atggacccac gagaggacat gcctctaagg 4320 tttgatcctg tggacgagtt ccatgttgaa ctcaatctga acgatgagaa cgacagagta 4380 tctgtctcta ctgcctcgtc gagattcgct tccgcccaat cgtcaccact gccgagcccc 4440 gtcaagcagc ctacagcgag ccccgtcaag cagcacgtcg cgagccccgt caagcagacc 4500 gtcaacacga gaccaaggcg cgaacacaag cccatcgtca gactcgatcc cgatccatcc 4560 aagaagacgt acactgcgga gacgaagcag taatcccgtc cccccgtgct acgccctcac 4620 ctcacctctc tcccttttct ttctgtattt taagatcggc attgccaatc ttgaagggga 4680 gg 4682 // ID Gypsy-185_AA-LTR repbase; DNA; INV; 272 BP. XX AC supercont1.145; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-185_AA_; KW Gypsy-185_AA-I; Gypsy-185_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-272 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.145; Positions 1013695 1013966. XX SQ Sequence 272 BP; 116 A; 24 C; 57 G; 75 T; 0 other; tgttgcatac aagttcaata caatgttcgg taaaatagga agataccgat ttcgaatata 60 ggaagtaaaa taaggaattg tgcgagagag atgaaagaga gtaaaagaga taggtttaac 120 gaaaagagaa tatagaagtg taatgatgaa caggagaaat aaatctagtc agaaagtgtt 180 tgagatcgaa ctgaacaagt cgtgttttat tctagagaag aaaagttaat atctttaaat 240 tatattataa aaacccttta caatattttg ca 272 // ID Gypsy-6_CQ-LTR repbase; DNA; INV; 508 BP. XX AC AAWU01000668; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-6_CQ_; KW Gypsy-6_CQ-I; Gypsy-6_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-508 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 392-392 (2011). XX DR GenBank; AAWU01000668; Positions 30715 30208. XX SQ Sequence 508 BP; 123 A; 105 C; 117 G; 163 T; 0 other; tgtagcgacc ctaagtcgcc tacctactat acatagacta cagttattgt ttgtcatgaa 60 caatgtcaag ttcacgacaa cggtaaatgg tctagcacac gcggtgtgtt atacaaaaca 120 ttcaggtagc ggtaacgctt gacacctcat gttttgtata acatttttcg accacgacct 180 agggcactgg tagatcaccg ttcagtgatc ttagtattta acggcctgag cttgttcagt 240 cgcgtgcgtc ttgtgttgtt cattcaagag gctgtaagtc gggggcctct tttggtattg 300 tgcagtcatt agtcagtgtt ccttatgctt gtgctctgtt gttagttgtg cagccaatag 360 tcttgggctt gggtttgtgc ttgtgcttgt gtcagtcagt actcatgtca atgtaataaa 420 taaaagcagt taaaatcaaa gtgcaagtgt tatcttaagc cagtgcataa tatcacagtt 480 atcccgctcg gacactcggt ccgataca 508 // ID Helitron-3N1_NVi repbase; DNA; INV; 4960 BP. XX AC . XX DT 13-MAY-2009 (Rel. 14.05, Created) DT 13-MAY-2009 (Rel. 14.05, Last updated, Version 1) XX DE Helitron DNA transposon - consensus. XX KW Helitron; DNA transposon; Transposable Element; Nonautonomous; KW Helitron-3_NVi; Helitron-3N1_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-4960 RA Bao W. and Jurka J.; RT "Helitron DNA transposons from Nasonia vitripennis."; RL Repbase Reports 9(5), 964-964 (2009). XX DR [1] (Consensus) XX CC The ~1-kb homology at the 3'-end with Helitron-3_NVi suggest CC this element is a nonautonomous helitron. XX SQ Sequence 4960 BP; 1702 A; 891 C; 709 G; 1654 T; 4 other; ctagcaaaaa acgctcgctt cgctcgcgat taacaaatta attgtgatga tatttaaggg 60 gatgtcgtcc caaaaaacac tgtttttcta ttacataaaa ctttttgaat aaagtttgct 120 aatgaaaaac cgagtctgta gcgccgtagt gttaaggtag ttcacttggc ccagtggtgc 180 tcaattcgag gttgaattag aaaattaatt tttttaaata ttatgcagtg ttatcaagtg 240 tgaaaactct agaattttta gttttgtata tcaaaatatt gatagaaagt gtttagagca 300 tcaaatatcc tcaaaactta gaaaagtgag aaacactgcg cgcatatata gatatactga 360 tagcatagaa ttgaaacttt cgattgctgt aacttttcaa aaagcaccgc ttgccctatc 420 aaatttttat ttccttgatt tgtattatga ttttacatca tgacctagtt tttgaaaaag 480 atgtaagatt tattttcttc gtaattaaac aaaatgtaca ctaaatttag acttaaaaaa 540 tcataccgct tggrgggtaa aaaattttac cacgttatag agggcactta ataggatata 600 tttacaaaat tgcaagaata ttaatttttt gagtgaacta ccttaagttt aatgaatcgt 660 tctaccgaaa gaaacttcgt cgtactcatg cgattgcagc gccaacctta aaatctaatt 720 ttacataaaa ctttttaaat aaagcttgca aatgaaaaac cgagtctgta gcgccgtagt 780 gttaagttta atgaatcgtt ctaccgaaag aaaaattcga tcggactttc tgtttttcta 840 gaagtttgaa aatacaaaat actccgtggt ggatttgcga agcttttgcg ccgacatccc 900 cttaattatt tgacgtgtca ttattaaata tactgttaaa tattagatgt acactcgatg 960 gtagttaatt cttgtggaac gtttttgaaa tgctccaata agattgccag attgataaat 1020 tctaaccaat caaaagcacg acttcggaaa cgtttcataa gaattaatta ccgtcgagta 1080 taaatgaata tattcaatac agccaacatt ctataaaagt agttatagct actagtacac 1140 aggaccggct ttaaaattat agaataaaat agaaataaat tttgaatatt aaaaaatgtt 1200 tttaaactct ttgtaattat ttgatgacag tattaattat attaacaaaa tctaacaaaa 1260 aaataatatt aattaaacat tttttttatt tacatttata aaataatgta aattgtgaga 1320 ttataacaat cataacattt atattctgta ctaaaatgta atacataata cgatatttac 1380 ttttatatta aatgataacg atatcttcaa tccctaacga aaaagaaaaa tgaaaacaat 1440 tgaaaaaaaa cgttatgaac tgcagatgat agcagaaaaa tgtcggaatt gagacacaga 1500 taacagtaat cttttgttgt cttcagtttt atttagtttg ctagatactt cagttggatt 1560 tctttttttt tcaggtgtaa aattttacaa tttagaaaat aaataaaaaa atgaatacaa 1620 cagacaataa ctattaataa gtgaagacaa cagttttttc atagttcaaa gcagttttct 1680 tcattttata attgctgtct tctacgttca ctagtttgtt cccaatcaaa acagctatct 1740 tcggatttct acacattttt cttaactttt ttaaaaacac gcctttaata acataaaaaa 1800 gaaaattaac ttttaaaaaa gtaaaaggtt aggcgagttt gatatattcc gcaacgaaat 1860 aacagataaa aaagaactga gatcaatcaa ccaaagtcaa gtttattata tttaatcata 1920 atgaagcaac aacatttttt ttggtcaata aagggcttac ttaacttaaa gttaaattat 1980 atactgatca attaggaaga attctcaaga tacttactac gtaacttgcc atgaccgttt 2040 cattgtactt atggtgtttt tatgacgtta taataaaata cgtaaagaga aaattttagt 2100 tgttcaccaa taaacaaaat gaaacggtca tggcaagtta cgtagtaagt atcttgagag 2160 ttttgaagtg aatacttcta taggctcgcc gggtacacat tttactgggg agtgaatcgg 2220 cgagcggcga ccgtcacggc ggcgcgacag accgatcgtc acgaaagcag cggcagctct 2280 ttccccccac cacgcactct tcgccgcgca tgttagcatg ctcgcacagc tcagctcttc 2340 ccttacatat cagccgcact tatttacaca cgctgcaagc ggctatcgga ttttctcgga 2400 tttcggacgc gctaatatag cggtaccttt tcatggtact tcaatttttt tgcgtcaact 2460 agacgacgtg acttttttaa tatgcgcaaa acgtttgcag gtaaacatta ttgacttatt 2520 gcggtaaaca ttatcactgt atgacacgcc ctggcgtaaa cctcgattta ccgcacgctc 2580 gtgcaaaaat gcactctcta aatctcggat tacgcacgtt gtgtcataaa atatcgtacg 2640 atactattat tcttattata ttttctttat tcgtattata ttttgcctat cctaaattta 2700 ctagttgatc aatgattact tcacctaaga ttttttaact ctctgtcaaa gttgcgatta 2760 agttggaaaa ataagaaaat ggcggcgatt tacgcatcaa aatcactaaa tcaaaaaatt 2820 ataaaaattt atcaaaaaag tcacaattaa atgtatcctt gtagattagt actttgcact 2880 caacgaagac gcatttgaat ttgcgtcgaa gactaattca tttcagtttt gccgcgaaga 2940 ctatagtata tatattttga actcttgcgt cgaagactat aaagtacatt tgcagtcccg 3000 ctgcgaagac tagtacattt ctcacttcgt atacgtttga ttctctggca acgaaactat 3060 tcaaaatttt ccaaaattcc acaccatcta tttaagtatt cacttctatc ggctgtcccc 3120 gagacagcca catttttcta attgatcagt atataactta acttaagtta agtaagccct 3180 ttattgacca aaaaaatgtt gttgcttcat tatgattaaa tataataaac tggactttgg 3240 ttgattgatc tcagttcttt tttatctgtt atttcgttgc ggaatatatc aaactcgcct 3300 aaccttttac ttttttaaaa gttaattttt tgagatttta gtccttttta gtaaattttt 3360 ttaggtattt catatttctc ctaaaaataa aatataagca tctttagcac cttttgacca 3420 tggttttatt gtaacatttt tatcactaac atgaaatcac tgattgttct gatacactac 3480 cctcgaatga tttctacacc tagacaccaa aaccaagtgc gaactaccta gacctcgtca 3540 tcggccaccc tgtacaccaa gacgccgtcc acgtatgatg tatgtaccta gaaataaaaa 3600 aaccacggag cgcaccacct agacctcgtc atcggtcacc ctgtacacct agacaccaty 3660 ctcgaattat ttttatayct agacacaaaa aaccacgaag cgtaccacct agacctcgtc 3720 atcggccacc ctgtacacct agacaccgtc ctcgaatgat ttatttacct agacaccaaa 3780 aaccacggag cgcaccacct agacctcgtc atcagccacc ctgtacaccg agacaccgcc 3840 ctcgaatgat gtatttacct agacataaaa aattacggag cgcaccacct agacctcgtc 3900 atcggccacc ccgtacacct agacaccgtc ttcgaatatc catttatcgt ctccatttat 3960 ctcgaccaaa attgcagatt tcatcttgtg atatgcatgt gcgtcatttc atattcacaa 4020 acacatagaa ttaaaaaaag attatacctg cacaaataac tatttgcagc atccctttga 4080 tctcgaccaa aactgcggat tttatcttgt gatatgaatg aatacttaca gcattcgtca 4140 ttatatgtca cgaacacaaa actttaaaaa aacgattata tctgcacgag taattatttg 4200 tagcatccct tttatgtaaa tcgaatctgc agataacatt ttgtgatatg catgaataca 4260 taaatcgtca tttcatattc acaaacaaat aactttaaaa aaaattatat ctgcacaaat 4320 atttaatcgc agcatccctt ttatctggac caaatttact gattacatct tatgatatgc 4380 acgaatacat aaagcattcg tcatctcata ttcacaaata gatggcttta aaaaaaacag 4440 catatctgca caaatattta atcgcagcat ccctttcatc tggaccaatt ttaaaataat 4500 ctttattcat ccatgtatat atatatatat atatatatat atatatatat atatgtcaaa 4560 aatataaaaa aaaattaatt aattgcgtct atttttttta attgttttaa cagcttccaa 4620 tctacctgga ccaaatcttg aaaatgtttt ttagtaggaa tgtatatttt tatcatatgt 4680 acctgtctat accaataaat agttttttta tttataaaat acatcaagtt ttgtggaata 4740 aatgcaaaaa aaaatccgga cggacagacg gacatttttt aagttctttt actgctaaaa 4800 actaattgaa atttgataca tttrattttt ttttcggtaa cttgaatgat ttaagttcaa 4860 agaacatttc taccgtaaaa tagatttaat tttttttttt acgagttttg gagggttttt 4920 acaaaattga ccaaatataa aattttaata tagtgtagat 4960 // ID DNA-TA-1_AAe repbase; DNA; INV; 4074 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A non-autonomous DNA transposon family from Aedes aegypti. XX KW DNA transposon; Transposable Element; nonautonomous; KW DNA-TA-1_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4074 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the yellow fever mosquito."; RL Repbase Reports 11(4), 1270-1270 (2011). XX DR [2] (Consensus) XX CC ~95% identical to consensus. TSDs are usually TA. TIRs are CC ~1200 CC bp long. XX SQ Sequence 4074 BP; 1254 A; 820 C; 820 G; 1178 T; 2 other; cactacccgt cataaatacg gactcactca aaaaagtatt gcaacacaca gctgaataat 60 gaatcaccct gaaaagttta gcatcacctc aaaacaggtt aattcacatt gctatatctc 120 gagatcctta cgacttacaa agaagcgatc ttcagcaaag ttgttcaggg gatcaaggac 180 atccggaaag cgaacagttt agttcgcgat tttgccgcta ggtggcgcta gtgagcattt 240 aaaattgagc attttgaact agttctagct tgtgatccat aagagataga aagttcgggt 300 cttcggcaaa gttgctcagg ggttcaagga catccggaaa gcgaacagtt tagttcgcga 360 ttttgccgct aggtggcgct agtgagcatg taaaattgag catttcgaac tagttctagc 420 ttgtgatcca taagagatag aaagttcggg tcttcggcaa agttgctcag gggttcaagg 480 acatccggaa agggawcagt ttagttcgcg attttgccgc taggtggcgc tagtgagcat 540 gtaaaattga gcattttgaa ctagttctag cttgtgatcc ataagagata gaaagttcgg 600 gtcttcggca aagttgctca ggggttcaag gacatccgga aagcgaacag tttagttcgc 660 gattttgccg ctaggtggcg ctagtgagca tgtaaaattg agcatttcaa actagttcta 720 gcttgtgatc cataagagat agaaagttcg ggtcttcggc aaagttgctc aggggttcaa 780 ggacatccgg aaagcgaaca gtttagttcg cgattttgcc gctaggtggc gctagtgaga 840 atttcacatt gctatatctc gagatcctta cgacttacaa agaagcgatc ttcagcaaag 900 ttgttcaggg gatcaaggac atccggaaag cgaacagttt agttcgcgat tttgccgcta 960 ggtggcgcta gtgagcatga ctcaaaactg agcatttcga actagttcta gcttgtgatc 1020 cataagagat agaaagttcg ggtcttcggc aaagttgctc aggggatcaa ggacatccgg 1080 aaagcgaaca gtttagttcg cgattttgcc gctaggtggc gctagtgagc atgtaaaatt 1140 gagcatttcg aactagttct agcttgtgat ccataagaga tagaaagttc gggtcttcgg 1200 caaagttgct caggggttca aggacatccg gaaagcgaac agtttagttc gcgattttgc 1260 cgctaggtgg cgctagtgag catgcaaaat tgagcatttc gaactagttc tagcttgtga 1320 tccataagag atagaaagtt cgggtcttcg gcaaagttgc tcaggggttc aaggacatcc 1380 ggaaagcaaa cagtttagtt ckcgattttg ccgctaggtg gcgcaagtga gcatgtaaaa 1440 ttgagcattt caaactaggc actttcagac ctgcagcact tgaatataaa tgcgaatcaa 1500 acctgaatca cttgaatatg gtgcgaatcc gacatgaatc acttggatat gaagcgaatc 1560 aaacataaat cactggaaat aaatcctaaa tctcttgaat atgaaatcaa tcagacctga 1620 atcacttgta cataaagtga atcaaacttg aaacatttga aaatgaagtg aatcacttga 1680 aatcatttga acatcaatga atcaaaccta aatcagtcga atatgagtga atcagataag 1740 aatcactagg atatggaaca aactgaatca cttgaaaatg aaatgaatca attctgaata 1800 acttaataat aaagtgaatc tgacgtgaaa cacttgcggg tatggagaga actaaacttg 1860 aatctattga atataaaatg aatcaatcct gagtcacttg aacatggagt gaatttgaca 1920 tgatttttgt ttactccata ttcaagtcaa acctgaattt cttgaatata cagtcgactc 1980 tccacaactt gatattctat aactcgatat actctataac tcgatcgatt tttcggtccc 2040 ctcaaattcc catatatcgt gctctccata agtcgatatt tctataactc gatatctcaa 2100 ttagtcgatg tcacgtgaga ggaaaatttc cctccataac tcgatatcca ttttaaaatc 2160 cttcttatac tgggaaactt ggacttattc tggaagtgaa taaatacttg caagttgtaa 2220 taaatgttta aaacattaaa ataaaaaaaa aattacgtaa ggaaaaacat gggacatttg 2280 tttgtaaaat aatatcaaca aaatattatt gctttcttac agaagtaatc atatatgtat 2340 tggaaatact gaaatttgtt tcttcgattt tgggattttt tgcgtcggta tattctataa 2400 ctcgataatt ctttaagtcg atggtccctt gaatatcgag ttatggagag tcgactgtag 2460 agtgaatcaa acccgaaaga catgaatata aagagaataa gacatcaatc acttggatat 2520 gaagtcaatc aaacttgaac catgaagaga ttctgacttg aataatttga atgtgatgaa 2580 tatgagtcct gactgagtca ctcgcacatg gagtaagtca taccagaatc actagcatat 2640 gaatcagata tgatacactt gaatatacat ttgatcagac ctgaatcact ttaatttaat 2700 atgattccat ccctaatcac ttcaacttcc aaatgtcaaa gaatatctga acagcattct 2760 ggaaattcta tcttatcaaa ttcataatgt taaggtacag tcatccctcc agagtaaact 2820 gtttcctgga tgcccttgac tttccgaaca actttgccga agattcaaac ttcctatctc 2880 atctggctcg aaaaatagaa tttgtttcat atatttaatt aaatatgcgt actagtgcaa 2940 cctagtggca aaattaggaa ccaaaaaatt tgatatttgg atgtccttta ctttatttct 3000 gttctttact taccgaagac tcgaactttc tatctcttgt ggatcgcaag ctagaactag 3060 atcaaaatgc tcagttttac atgctcacta gcgccaccta gcggcaaaat ctcgaaccaa 3120 actatctgtc ttccggatgt ccttgacttt ctgaacaact ttgccgaaga ctcgaacttt 3180 ctatctcatg tggatcacaa gctagaacta gttcgaaatg ctcaatttta catgcccact 3240 agcgccacct agcggcgaaa tctacaacct aactgtttgt cttccggatg tacttggcct 3300 tcttaacaac ttcgccgaag tctcgaactt tctatctctt atggatcaca gtctagaaat 3360 ggttcaaaat gctcaatttt acatgctcac tagcgccacc tagcggcaaa atcgcgaact 3420 aaactgttcg ctttccggat gtccttgaac ccctgagcaa ctttgccgaa gacccgaact 3480 ttctatctct tatggatcac aagctagaac tagttcgaaa tgctcagttt tacatgctca 3540 ctagcgccac ctagcggcaa aatcgcgaac taaactgatc gctttccgga tgtccttgac 3600 tccctgagca actttgccga agacccgaac tttctatctc ttatggatca caagctagaa 3660 ctagttcgaa atgctcaatt ttacatgctc actagcgcca cctaacggca aaatcgcgaa 3720 ctaaactgat cgctttccgg atgtccttga acccctgagc aactttgccg aagacccgaa 3780 ctttctatct cttatggatc acaagctaga actagttcga aatgctcaat tttacatgct 3840 cactagcgcc acctagcggc aaaatcgcga actaaactgt tcgctttccg gatgtccttg 3900 atcccctgaa caactttgct gaagatcgct tctttgcaag tcgtaaggat ctcgagatat 3960 agcaatgtga attaacctgt tttgaggtga tgctaaactt ttcgggggtg attcattatt 4020 cagctgagtg ttgcaatact tatttcagtg agtccgtatt tatgacgggt agtg 4074 // ID Penelope-9_HM repbase; DNA; INV; 2187 BP. XX AC . XX DT 13-DEC-2008 (Rel. 13.12, Created) DT 14-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE Penelope-like element. XX KW Penelope; Non-LTR Retrotransposon; Transposable Element; KW Penelope-9_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2187 RA Jurka J.; RT "Penelope-like elements from Hydra magnipapillata."; RL Repbase Reports 8(12), 2099-2099 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(426..773,620..1423,1366..1755) FT /product="Penelope-9_HM_1p" FT /translation="MECLAMNTAFITIKDHKDDFANNTKCRLINPSKPELG FT KVSKILLENINNKVRKATLVNQWHNTDDVINWFQSILKTNKTVLSSSLILK FT SSXLLFLKNYFKIHLITPSNLQKFQKKHRNKLVPINIKDKQNCTFIQFDIE FT EFYPSISKELLQNSLNHAKQFTEISEKTLDIINHSRKSLLFTEHNSIWVKK FT LGDPNFDVTMGSYDGAEVCELVGLFILHTISKDYGLKTIGLYRDDGLCCFH FT NINGPQSERIKKNIVNLFKEKFNLKITIKTNLKIVNFLDVTFNLSENTFQP FT YRKPGDHPLYINVNSNHPPSIIKSIPTMISNRISNISSSKEVFDRATPFYN FT NALKXSGFKEKINFTPNIQKYKNQNXDQETLFGLTLRFHKMKPKSRSRNII FT WFNPPFSQNVKTNVAKTFLNLIEKRFPKTHKFHKIFNRNNLKVSYSCLPNI FT NNIITSHNKKILSNVSHENANQTCNCRQTTLCPLKGKCLTKNIIYICNVKT FT SPRRRRILLYWSNRXHF*" XX SQ Sequence 2187 BP; 872 A; 350 C; 278 G; 672 T; 15 other; cttaacagaa aaagtcgaat tagtnatcaa acgtatgaga tggaaagcct taatttgcga 60 gaaaaatctt aaccaagaaa cagattacca ctttaatttt aaaacaagaa aatgtccacc 120 gcaacatccg gatctagtaa attttgaaaa cgatctactt aaaatgatac aaaatatttc 180 atttactaat tcacaaccag tttcaaaaac agctaaataa tgacatttgc aaaattaaat 240 catcagaaaa cgtttttgtt tttgcggata aatcacataa tatttatgag cttgaaaaat 300 cttcttataa taaacttttt aatgaaaacg taacaagaac ttacaaaaaa tctgataatt 360 cgcaatataa taacattaat aatgcggcta aaataattgc taaaaactta ggtattgacg 420 acaggatgga atgtcttgca atgaataccg cgtttataac tataaaagat cacaaagatg 480 attttgccaa caatactaaa tgtcgattaa ttaatccatc taaaccagaa cttggaaagg 540 ttagtaaaat tttactggaa aacatcaaca ataaagtgag aaaagccacc cttgtaaayc 600 aatggcataa tacrgatgac gtaataaatt ggttccaatc aatattaaag acaaacaaaa 660 ctgtactttc atccagtttg atattgaaga gttctaycct tctatttcta aagaattact 720 tcaaaattca cttaatcacg ccaagcaatt tacagaaatt tcagaaaaaa cattagatat 780 tattaatcac tcaagaaaat ctttactgtt tactgaacac aatagtatct gggtcaaaaa 840 acttggtgat cctaattttg acgtcactat gggaagttat gatggggctg aagtatgtga 900 attagttggc ctctttattt tacatactat aagtaaagac tatgggttga aaaccatygg 960 attatataga gatgatggtc tttgctgttt tcataatatt aatggtccac aatccgagag 1020 aataaagaaa aacatcgtta atctttttaa agagaaattt aatctaaaaa ttactataaa 1080 aaccaactta aaaattgtaa actttcttga tgttaccttc aacctgtcag aaaatacttt 1140 tcagccatac agaaaaccag gtgaccaccc attatacatc aacgtaaatt ctaaccaccc 1200 wcctagtatt attaaatcca taccaactat gatatctaac cgtatcagca atatttcatc 1260 tagcaaagaa gtttttgata gagcaacacc attttataac aatgcgctaa aakctagtgg 1320 attcaaagaa aagattaatt tcactccaaa tatccagaaa tataaaaacc aaaatcamga 1380 tcaagaaaca ttatttggtt taaccctccg ttttcacaaa atgtaaaaac aaacgtagca 1440 aaaacttttt taaatctaat tgaaaagcgt tttcctaaaa cccataaatt tcacaaaatt 1500 tttaatagaa acaatttaaa agttagttat agctgtcttc caaacatcaa caatattatt 1560 acttcacaca ataaaaaaat attatctaat gtttctcacg aaaatgccaa tcaaacgtgt 1620 aattgtcggc aaactacctt atgtccactg aaaggaaagt gtcttacaaa aaatatwatt 1680 tacatctgca atgtaaaaac atcaccaaga agaagaagga tattattata ttggtctaac 1740 agagmacact tttaaagatc gctggtacaa gcataaaaac tcctttcaat acgaaagtaa 1800 agctaactcc acagaacttt ctaaatatat ctgggaattg aaaaataaag ggattttcaa 1860 atcctatttt tacgtggaaa attattgata gagcggaacc atttaaacct ggaggaaart 1920 tatgtaacct ttgtttaaca gaaaaatacc atatcattac atcacctttg aaactattaa 1980 ataaacgaaa tgaaytaata tcaaaatgtc gccacgaaaa caaatttgtt atatacaact 2040 ttaacgtcat ctaagctaac actgtattgt atttttgtat ataatttttg ttgttgttwt 2100 tattgtttgt aacgtctgat gatcgctatt gcgtgaaact ytkagttatg taatttaaat 2160 atatcttaat aattatttag ttttcac 2187 // ID Kiri-37_AAe repbase; DNA; INV; 4104 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 16-FEB-2011 (Rel. 16.02, Last updated, Version 1) XX DE A Kiri non-LTR retrotransposon family from Aedes aegypti. XX KW Kiri; Non-LTR Retrotransposon; Transposable Element; Kiri-37_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4104 RA Kojima K.K. and Jurka J.; RT "Kiri non-LTR retrotransposons from the yellow fever mosquito."; RL Repbase Reports 11(2), 732-732 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 12 sequences with >95% CC identity. CC This family and related Kiri elements found in two mosquito CC species, Culex quinquefasciatus and Aedes aegypti, constitute a CC distinct lineage belonging to the CR1 group. The phylogenetic CC analysis with PhyML indicates that Kiri is a sister lineage of CC the L2 clade. Although several Kiri elements do not encode two CC proteins, probably due to 5' truncation, Kiri elements generally CC encode two proteins. Their ORF1 proteins commonly contain an CC L1-like RNA-binding domain. XX FH Key Location/Qualifiers FT CDS 222..737 FT /product="Kiri-37_AAe_1p" FT /translation="MNPDFLSKIFRVLREGHEQLQKXIDRSMADLKRNLQA FT SSDTSAASSFDHSSVVFDQASGLADEERVESFQRSPHCGYEDCAVYISVDT FT RENLQRHRLCFSPSIPFTHQIVATKTRKPSRISHANRKHYHGYQTLTIGNT FT TVLWIKLSDRRLKDRWKPDRVNFICGRAVVEAI" FT CDS 1082..3934 FT /product="Kiri-37_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MLDNGNVNSVRTDGVFSCIPKVVMRTALLSDKLNICH FT TNVQSLCARQLNKLDEFKICFENSNIDIICVTETWLTSNISDEMVAVEGYR FT VIRNDRSYARGGGISIYCKNDLRCRVIAASDLSDADEGDVTEYVFAEVSFN FT QSKFLLGVVYNPPRNDCSEIIEQKMAELSLHYQNVVLVGDFNIDMNVINAR FT TSRFQSVMDNFDMHLINREPTHFHCGGCSLIDLFITNIPDFVVKFNQVAAA FT GFSKHDMIFASLNISRSNVTSYRTFRNYAQIDYASLNDALYSIDWDHLYSI FT TDSDTALDFFNFHLTQLHNSFVPLKIHKPRNQNGWFNNEIQFAIIERDLAY FT RLWVSERSQYNHDQYKRLRNRANHKIKLAKIRYVSQTVNSSVSSKDLWRRL FT KQLNVTSSSKSAVQPINTCDEINDYFASNFSEASSTVTVFPENSHGLRFHE FT VTTFEVINAIDSITSNAVGFDEVPLKFVKMILPHIITQLTYIFNLIISTSK FT FPNAWKMAKVIPIPKKTGSADLSNLRPISILSSLSKAFEKIIKSQIQTFLD FT NNDLLYSRQSGFRSNHSTTTTLLSVHDDIHQEIDKNGTGFLLLLDFSKAFD FT RVSHSKLLQKLSNQFCFSRSAVSLIRSYLHNRKQAVAVDDKYSEFVNIVSG FT VPQGSVLGPILFSLFINDLPNVIKFCKMHIFADDVQIYLCSNRLSISEIAR FT LINLDLRNIYRWSERNLLPLNPEKSQVLFMTRSRNNTVSPPSISLNNVPMK FT YVDKISNLGVIFQSNLEWDSHINSQCRKVYNGLRHLKLTANFLPTATKLKL FT FKTLLLPHLVYGSEFLLNASSRSLGRLRVAVNCCIRWIFNLSSYSSVSHLQ FT PQLLGCSFSNFVKFRCFITLYKIINTAKPLYLHEKLQPFRSIRERNFRISR FT VRTSYYRNTLFVRGIVYWNQIPAQIKISRSINEFRRGCISWLNTSNQ" XX SQ Sequence 4104 BP; 1228 A; 785 C; 740 G; 1350 T; 1 other; gcttaaacga gcttggaaaa cataaatttt aatggttaat tctggtgttt ttgcgtaatc 60 taactggtac gtgactacag ttaggaacag cctcatgagt gaacgccttt tcgaacaata 120 gcaattgtac ctatacaaac aattggtact cagaactctc gccgacatta atcgttatta 180 atccgtgctt attacgagat cttcatcaaa cttttgtcga gatgaatcca gattttctgt 240 cgaaaatatt tcgagtgctt cgtgaagggc atgagcagtt gcagaaaasa attgacagat 300 caatggctga tttaaaaagg aatctgcagg catcgagtga tacatcagca gcttcatcgt 360 tcgatcattc atctgttgtc tttgatcaag caagcggact ggcagatgaa gagcgtgttg 420 aatcttttca aagaagtcca cactgtgggt atgaggattg tgcggtttac atttcggtgg 480 atactcgaga gaaccttcaa agacaccggc tgtgcttttc accttctatc ccattcactc 540 accaaatcgt cgcgacaaag acacgcaaac cgtcaagaat atctcatgca aaccgaaagc 600 attaccacgg ctaccaaaca ctcacgattg gtaacacaac tgttctatgg atcaagctgt 660 cggataggcg tctaaaggat cgctggaagc cggatcgtgt gaattttatt tgcggccggg 720 cagtcgtcga agcaatctga tctaccgaaa acaaacatcc taactcttct tttttttttt 780 ttttttgcta atccatgttt acaacccaac ttatccgttg ttcttatcct cgctttcccg 840 tgttatcttc cttctttaag cttaattgga gcaaactgct gatggctgtg tgctaatagt 900 tgatcattca atcgttctct ttgaatcaag tgcaattttg ttcatttgag ttaggctttt 960 ggtgcagaaa tgattttgta ttactcctta gtgtatcttt cacttctagt tactacaact 1020 ggttactttg tgttgattgg gatgcattct ttctcgagct cgtacataac atttttcatt 1080 gatgttggat aacgggaacg taaacagtgt acgcacggat ggagtattct cttgcattcc 1140 caaagtcgtt atgcgtactg ctctgctttc cgataaactt aatatatgcc ataccaatgt 1200 gcaaagcctt tgtgcacggc aacttaacaa gcttgatgaa ttcaaaatat gctttgaaaa 1260 cagtaatatt gacatcatat gtgtgacaga aacgtggtta acttcgaata tatcggatga 1320 aatggttgct gttgaaggtt acagggtaat tagaaatgat cggtcgtatg ctcgtggtgg 1380 aggcatatct atctactgta aaaacgattt gagatgccgt gttatagcag catctgatct 1440 atctgatgcc gatgaaggtg atgtcacaga atacgtgttt gcagaagttt cttttaacca 1500 aagcaaattt ttattaggtg ttgtttataa tccaccaaga aatgactgtt cagaaattat 1560 tgaacagaaa atggcagaac tctcccttca ttatcaaaac gtcgtgcttg tgggtgactt 1620 taatattgac atgaatgtta ttaatgcaag aactagtaga tttcaaagcg ttatggacaa 1680 ctttgacatg catttgatca acagagaacc aacccacttt cattgtggtg gctgctcatt 1740 gatagatctg tttattacaa atatccctga ctttgtcgtc aaattcaatc aagtagctgc 1800 agctggtttt tcaaaacacg atatgatttt cgcttcttta aacatctctc gttcaaatgt 1860 gacaagctac cggacattta gaaattatgc tcaaatagat tacgcatcgt tgaatgatgc 1920 tttgtattct attgattggg atcatttata ttccattacg gattccgata cagcgttaga 1980 ctttttcaat ttccatttga ctcagcttca taactctttt gttccactaa aaattcataa 2040 acccagaaac caaaatggtt ggtttaacaa tgaaatccaa ttcgccatca ttgaaagaga 2100 tttagcctat cgtttatggg tgtctgaaag aagtcaatat aatcacgatc agtacaagcg 2160 cttacgcaat cgagccaacc ataaaataaa attagctaaa ataagatatg tttcacaaac 2220 ggttaacagt tctgtctcca gcaaagacct ttggcgtaga ctcaagcaac tcaatgttac 2280 atcaagttct aagtcagcag tgcagcctat aaatacttgt gatgaaatca atgattattt 2340 tgctagtaat ttttcagaag cctcgtcaac tgttacagtt tttcctgaaa attctcatgg 2400 attacgcttt catgaagtaa caacatttga ggtcataaac gcaattgatt ctattacatc 2460 taatgccgtg ggtttcgatg aggttccatt gaaattcgtc aaaatgatcc ttccccatat 2520 aataacgcag ttaacgtata ttttcaatct gattattagt acatcgaagt ttcctaatgc 2580 ttggaaaatg gctaaggtta ttcctattcc gaaaaagact ggtagtgctg atttaagtaa 2640 tttacgaccg attagtattt tgagttcgtt gtccaaagct ttcgaaaaaa tcattaaatc 2700 tcagattcaa accttcctag ataacaatga cctactttac tcgcggcaat cgggttttcg 2760 cagtaaccat agcactacaa caactctact atctgttcat gatgacatac atcaagaaat 2820 cgacaaaaat ggaactgggt ttttgttgtt actagacttc tcgaaggcgt ttgatagagt 2880 atcccattct aagttgctac agaaattatc taaccaattt tgtttttctc gatcagcagt 2940 ttcactcatt cgttcatatc tccataatcg taaacaagca gttgctgttg acgataagta 3000 ttctgaattt gttaatattg tctcaggagt tccgcagggg tccgttttag gtccaatatt 3060 attttccctt tttataaatg accttccaaa tgtaataaaa ttctgtaaaa tgcatatttt 3120 tgcagatgac gtgcaaattt atttatgttc caacagattg tcaatatctg aaattgctcg 3180 cttgataaat ctagatttga ggaatattta ccggtggtct gagcgtaacc ttttaccctt 3240 gaatccagaa aaatcacaag ttctgttcat gactcgatct cgcaacaata cggtttcacc 3300 accctccata tccttgaaca atgttcctat gaagtacgtt gacaaaatat ccaacttggg 3360 tgtaattttc caaagcaact tagaatggga cagccatatc aattctcagt gtagaaaagt 3420 ctataacgga cttcgtcatc ttaaactgac agctaatttt ttaccaacag caactaaact 3480 taaattattc aaaaccttat tactgcctca tttggtgtat gggagcgagt ttttgcttaa 3540 cgcctcttct cggtctctcg gcagactccg cgttgctgtg aattgctgta ttagatggat 3600 attcaatttg tcgtcgtact ctagtgtttc acaccttcaa ccgcaattgc ttggatgctc 3660 tttttctaat tttgtaaaat ttcgttgttt cataacattg tacaaaatca tcaacacagc 3720 aaaacctcta tatttgcacg aaaaactcca accttttcgt agtattcgtg agaggaattt 3780 ccgtatatcg cgcgttagaa cttcatacta cagaaatact ttattcgtgc gtggcattgt 3840 ttattggaac cagattccgg ctcaaataaa aatttcgcgg tctatcaacg agtttcgcag 3900 ggggtgcatc tcatggttga acacaagtaa ccagtagttt aaattgtatc tagaactaat 3960 aacttgtttt tgaaatagtt gtaagttaag aaatgaattc gtcacataca tattgttttt 4020 tttttctgaa tgtagaatta aaaaaggtaa taaccttact ctacatgtat tggtatggaa 4080 ataaataaat aaataaataa ataa 4104 // ID CR1-55_BF repbase; DNA; INV; 558 BP. XX AC . XX DT 30-JUL-2009 (Rel. 14.07, Created) DT 30-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE Amphioxus CR1-55_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-55_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-558 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-558 RA Kapitonov V. and Jurka J.; RT "Young families of CR1 non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(7), 1626-1626 (2009). XX DR [2] (Consensus) XX SQ Sequence 558 BP; 166 A; 108 C; 118 G; 166 T; 0 other; tcgacagaaa tgtatgctga tgatacttca ctgtacaagg cagccaagtc agtcagtgct 60 attgtggcag ctttgcaacc tgatcttata gaattgtcta actgggttag tgtcaatcga 120 ctagttatga atgtactgaa aacaaaatgt atgctatttg gcactgctcg caagcttgct 180 ctgttacccg ataaatcttt gaaccttgta atcggctctg agaaagttga acaagtaatc 240 gaagcagtcc tattgggctt acatatggac ccgtcactaa cttggaatct tcataccaaa 300 tatcttgtaa caaaactctc aggaaactta gcctatgtta gaagatatgc ttgttatata 360 cctgatgatg tgtgtaaact ggtacttcaa gccctggtgc tatcagtagt acagtattgt 420 tcaccactgc tagccagcat gtcagattcg aatatgagaa aactgcagat agtacaaaat 480 agagcgtgta gactgttatt aaaatgtaca tcggacacat ctgttaggcg gatgcatctt 540 gagcttggtt ggcctaca 558 // ID Gypsy-2-I_DP repbase; DNA; INV; 5175 BP. XX AC . XX DT 17-MAR-2009 (Rel. 14.03, Created) DT 17-MAR-2009 (Rel. 14.03, Last updated, Version 1) XX DE An internal portion of the LTR retrotransposon - a consensus DE sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW reverse transcriptase; Gypsy superfamily; 4-bp TSD; Gypsy-2-I_DP. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-5175 RA Bao W. and Jurka J.; RT "Gypsy LTR retrotransposons from Daphnia pulex."; RL Repbase Reports 9(3), 657-657 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 112..1155 FT /product="Gypsy-2-I_DP_1p" FT /translation="MAERGARAARRAAVAAVNPDRPEDVEHHHHQGPAQRI FT GGIAREENRVRRKEREPPLFRGEAHEDAVDWLSRYEEIAHYNEWNVDEQLH FT NFGIHLEGVARRWYLSLIPAPLTFQDLRARFLVAFKPPNYDLDLETKLRSR FT FQEEHEPVMTYCHDVIYLCSRVDRNMGEALKVQHLLRGLKRPLVQKVYPFL FT DPEVHTSQDFMRLVQIQCQADLLANQPPLVSSCTPPAVLMIPPQPPILPQA FT SPFVTEDRLKAFKKELATEFRCELSSTLSSMRKEISNDVRDVLRESRNPKP FT FTSKRSRDGQPICRQCHQPGHIARECDQRKCFKCNLPGHLAKSCPTPSSSV FT KPSN*" FT CDS 1092..4934 FT /product="Gypsy-2-I_DP_2p" FT /translation="PPRSPCKKLPYSFILSEAVKLAAVVERGNTTDSVFSI FT PQLDRTKLVLKNVLFSNRNVEAIVDTGSGVTVISPNLCTALKIPIRRWTGQ FT DVVLADGKRTRPKGMIDAEFTIDHRPVTVSALVFDINGYDLLLGNDTLRQL FT QSIQVDYFPDMASLTIGDPDFMCNNKDNYVFCNKSCMVPAHTMIVIPIKHA FT RSPKTGAEKLEMIEFSPKVMNDKGLSIGRFCYSSRAPPETVQLINFSDSPQ FT WIQEGVALGKIFEVQEASDTSAAKTETEALDFDRSINQDLPLVDRVKFKNL FT LGRYSYCFSAHDDDLGSSNLVQHRIDIGDHLPTHQAPYPSSWKQREIIGTQ FT VQRMRRAGIIEPSQSPFAAPVVLVRKPDGSWRFCVDYRKLNAITVKDVYPL FT PRIDDALSRLEGSKYFSIIDLQSGYWQVQMEPSDREKTAFITADGLYQFRV FT MPFGLTNAPSTFQRMMDVMLAGLKWNTCLVYLDDIVIFSDSISQHLQRLEV FT ILQRLAQANLKLKLNKCSFAATQLKILGYIVSGNGLSPDPSKVNAVQAFPT FT PTTVKNVQSFVGLCSYYRRFIRDFAVIARPLTELTRKNQIFRWTAEHETSF FT RSLQAALTSPPVLGHPDYRLPMEIHCDASDYGIGAVLVQQQTGGERVLAYA FT SRLLSHAECNYSITEKECLALVWSTQKFKVFIWGIKLKVVTDHHALCWLMR FT KRDLAGRLARWSLQLQDLDIEVVYRSGRLHTDADALSRHPIDPPEPEGEIP FT MLLISSATHSKPAVIKTLQLECDWCNPIIQGLKDEHPCHRVHRLIRHFVLK FT KDLLYHRIIRNGRAYLRLCLPPALQKQVLLACHDDVTAGHLGVTRTLAKIN FT QRFYWPKMIQSVSQYVRSCEDCQTKKKPKERPAGYLEPVQARLPFEKIGID FT LIGPFPLSTLGNRYVIIAVDYLTKWVIAKAIPTATSKEVVDFFVRRIVLQH FT GAPINVISDRGKCLTSNFTNELFRALQSNHLVTTAYHPQCNGLVERFNHTF FT AEMISMFVNSKHSNWDDVIDHVVFAYNTSKQESTGKTPFLLLYGREALLPI FT DAALGNNPNPDANHIQLLQQLPALREKVIRRLAWIQHRQKKRYDKHRRLKS FT YSVGDLVLVYRPLRKKGRSEKLLHSYHGPFPIVKKLSNVTYVIKCKNSRKN FT NLDRVHVCSLKPFHPRLSNQNITFPSCDPVLPTEKGKSRTLPKSRERSVVK FT GKSRTLPEIREETIVNGRNLSDNFTDRGEKDYIPNQPQIPSIATKPPRVIY FT DNLRSRRELKTVERLGITK*" XX SQ Sequence 5175 BP; 1411 A; 1247 C; 1134 G; 1383 T; 0 other; tttggtggag atgcagggta ttctctaaca ctgttcggtt caacactgta acttgtgcct 60 ttgcgttccg ttttcctgtg tttgtcgggg tctgtttccc ttggtgaaag aatggccgaa 120 cgaggagctc gcgctgcgcg acgtgctgcc gttgccgcag taaacccaga tcgccccgaa 180 gacgttgaac atcatcatca ccaaggacct gcacaacgaa ttggtggtat tgcccgtgag 240 gaaaatcggg taagaagaaa agaacgcgaa ccacctcttt ttcggggaga agcgcatgaa 300 gatgcggtgg attggctgtc gagatacgag gagattgctc attataacga gtggaacgtt 360 gacgagcagc tgcacaattt tgggatccat ttggaaggag tagcgcgacg atggtatttg 420 agtctaatac cagccccact aacgtttcaa gatcttcggg ctcgattctt ggtggctttt 480 aagcccccta attatgattt ggacttggaa accaaacttc gctcccgctt tcaagaagaa 540 cacgaacctg tcatgactta ttgtcacgat gtgatttact tgtgttcccg cgtcgataga 600 aatatgggcg aggcccttaa agtacagcat cttttacgcg gtttgaaacg ccctcttgtc 660 caaaaggttt atcccttcct cgaccccgaa gtccatacct cccaagattt tatgcggttg 720 gtacaaattc aatgccaagc tgacttgtta gccaatcaac caccgttagt gtcgtcatgt 780 acacctccag cagtattaat gataccacca caaccaccca ttctaccaca agcctccccg 840 ttcgttactg aagatagatt gaaggcattc aaaaaagaat tggcaactga atttcgatgc 900 gaactcagtt caacattgag tagcatgagg aaggaaatca gtaatgacgt tcgggacgtt 960 ttacgagaga gccggaatcc aaaacctttt acgagcaaac gttctcggga cggacaaccg 1020 atatgtcggc agtgccacca gcccggccat attgcgcgtg aatgtgacca acgcaagtgt 1080 tttaagtgta acctccccgg tcaccttgca aaaagctgcc ctactccttc atcctcagtg 1140 aagccgtcaa actagctgcg gtggttgagc ggggtaacac caccgattcc gtcttctcca 1200 ttccccaact ggatcgcact aaattagtat tgaagaatgt cttattttca aatcgtaacg 1260 tggaggccat cgtggatact ggttctggcg ttactgttat ttcccccaac ctttgtacag 1320 ctttaaaaat cccaataaga agatggactg gacaggacgt ggttttggct gatggtaaaa 1380 gaacgcgccc aaagggtatg attgatgcag aatttacaat tgaccaccgt cctgttactg 1440 tctcggctct agtgtttgac atcaacggat atgatctgtt actgggaaat gatactcttc 1500 gtcaacttca gtcaatccaa gttgattatt ttcccgatat ggcttcctta acaataggag 1560 atccagactt catgtgtaat aataaagaca attatgtttt ttgtaacaaa tcctgtatgg 1620 tgcctgctca tactatgatc gtcatcccaa tcaaacatgc acgtagtcct aaaacgggtg 1680 ctgaaaaact tgaaatgata gaattttctc caaaagtcat gaatgacaag ggattgtcaa 1740 ttggaagatt ttgttactca tctagagccc cacctgaaac tgttcaactt ataaatttct 1800 ccgactctcc ccagtggatc caagaaggag tggccttggg aaagattttc gaagtacagg 1860 aagcgtccga tacgtcagcg gcaaaaactg aaacagaagc cctcgatttc gatcgaagta 1920 ttaatcaaga tttaccccta gtcgatcgag tgaaattcaa gaatttgctg ggccgatact 1980 cttattgttt ttccgcccat gatgatgatc tcggatcttc taatttggtc caacatcgga 2040 ttgatatcgg cgatcaccta ccgacccatc aagcaccgta tcccagctca tggaaacaac 2100 gagaaataat aggtacccaa gtgcagcgta tgcgtcgagc gggaattatc gagccctccc 2160 agagtccctt cgctgctcct gtggttttgg tacgaaaacc tgatgggtca tggcgattct 2220 gtgtcgacta ccgaaaactg aatgcgatca cagtcaaaga tgtgtacccc cttcctcgaa 2280 ttgatgatgc cctgagccgg ctggaggggt ccaagtattt ttcgatcata gatctccaga 2340 gcggctattg gcaagtacag atggaaccaa gcgatcgaga aaagaccgcc ttcatcacag 2400 ccgacggtct ttatcagttt cgagtcatgc cttttggcct tacaaatgcc ccaagtacgt 2460 ttcagagaat gatggatgtt atgttggccg gtctgaaatg gaacacttgt ttggtttatc 2520 tcgatgatat agtgatattc tctgactcaa tctcacagca tttacaacgt ctagaagtta 2580 ttctgcaacg tttggcgcaa gccaacctaa aactgaaact gaataagtgt tctttcgccg 2640 ccacgcagtt aaaaattctt ggctatattg tgagcggaaa tggtctttcc cctgaccctt 2700 caaaagttaa tgctgtccaa gcttttccca ccccaaccac cgtcaaaaat gtacagagtt 2760 ttgtcgggct atgctcttat tacaggcggt tcataagaga ttttgccgtc atcgccaggc 2820 cgctcactga actaaccagg aaaaatcaaa tttttcgatg gacggccgaa cacgagacca 2880 gcttccgctc ccttcaagct gccttgacgt cgcctcctgt tttaggacat cctgattacc 2940 gattaccaat ggagatccac tgcgatgcat ccgattacgg cattggcgct gttctcgtgc 3000 aacaacagac cggcggagag agagtattgg cgtacgccag tcgccttttg agtcacgctg 3060 agtgcaacta ttccatcacg gagaaggagt gtttagccct cgtttggtcc acccaaaaat 3120 ttaaggtctt catttggggc atcaaactca aggtagtgac ggaccatcac gccttgtgtt 3180 ggctaatgcg aaagcgtgat ttagctgggc ggctagcccg ctggagcctc caattacaag 3240 atttggacat tgaagtcgtt tatcgaagtg gccgacttca tacggacgct gatgcacttt 3300 ctcgtcaccc gatcgaccca cctgaacctg aaggcgaaat tcctatgttg ttgattagtt 3360 cggccactca ttccaaacca gctgtgatca aaacgttgca acttgagtgt gattggtgta 3420 atccgattat acaggggctg aaagatgaac atccttgtca ccgagttcac cgactaattc 3480 gacattttgt tctaaaaaaa gatctcctat accatcggat tatccgcaat gggcgcgctt 3540 acctccgatt gtgtctacca cccgccctcc aaaaacaagt cttattagcg tgccatgatg 3600 atgtcaccgc tggacatctt ggtgtgactc ggaccctagc caaaattaat caacgctttt 3660 attggccgaa aatgatccag agtgtcagcc aatatgttcg atcctgtgaa gattgccaga 3720 caaagaagaa gcccaaggaa cgcccggcgg gatatctaga acccgtccaa gccaggctgc 3780 cattcgagaa aatcggaatt gatttaatcg gaccgtttcc actctctact cttgggaatc 3840 gctacgtcat tatcgccgtg gattatttaa ccaaatgggt gatcgcaaag gccattccaa 3900 cggccactag taaggaggtg gttgactttt ttgtacggcg aattgttctt caacatggcg 3960 cccccatcaa tgttatctcc gaccgcggta aatgtctaac ctccaacttc acgaatgagc 4020 ttttccgagc attgcagtca aatcatctcg ttactacggc ttaccacccc caatgcaatg 4080 gacttgttga aaggtttaat cacacctttg ccgagatgat ttcaatgttt gtgaattcaa 4140 agcattccaa ttgggacgac gtgatcgacc atgtcgtctt cgcgtacaac accagcaaac 4200 aagaatcaac tggaaaaacc cctttccttc tgttgtatgg aagagaagct ctcctaccca 4260 tcgacgcagc tctcggaaat aatccaaacc ctgacgccaa tcatatccag cttttacaac 4320 aacttcccgc tcttcgagaa aaagtgataa ggcgtctagc gtggattcag catcgtcaaa 4380 agaagaggta tgacaagcac aggcgtctga agtcctattc cgttggagat ctggtattag 4440 tgtatcgacc tctgcgaaaa aaaggacgtt cagaaaaatt gttgcactcc taccatggtc 4500 ctttccctat tgttaaaaag ctgtcgaacg ttacttatgt gatcaaatgt aaaaattccc 4560 gtaaaaataa ccttgaccga gtccatgtat gcagtctcaa gcctttccat ccccgcctat 4620 caaatcaaaa tataactttc ccatcctgtg accctgtttt acctaccgag aaaggaaagt 4680 cgaggacttt gccgaaatct cgagagcggt cggtagtgaa aggaaagtcg aggactttgc 4740 cggaaatccg ggaggagaca attgtgaacg gtcggaacct ctcggacaat ttcactgaca 4800 gaggcgaaaa agattatatc ccaaaccagc ctcaaatccc ttctatagct actaaacccc 4860 cccgtgttat ttatgataac ttgcgatcca gacgagaatt aaaaactgtt gaacgacttg 4920 gaattacaaa gtaatcatta tgtaaacgta ttttttttaa tgtgtgatat tcgtgtgttt 4980 agatcaagag cctcgtggat gtattcacct taacctggaa aacgtcgaaa tttaatacat 5040 gtcagttact aattatttat tctacccgcc ttttccttac tctgatatag ataccaagct 5100 agccatctga gagctctccg tcattgatgt caataatgcc atgatcggga cgatcattta 5160 cgtgggcggg gaaga 5175 // ID BEL-69_CQ-I repbase; DNA; INV; 6386 BP. XX AC . XX DT 07-JAN-2011 (Rel. 16.02, Created) DT 07-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-69_CQ_; KW BEL-69_CQ-LTR; BEL-69_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-6386 RA Jurka J.; RT "LTR retrotransposons from the southern house mosquito."; RL Direct Submission to RU (07-JAN-2011). XX DR [2] (Consensus) XX CC Positions [5433-5993] - Integrase core CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 3909..6041 FT /product="BEL-69_CQ-I_2p" FT /translation="MDKDCGFERLLGMVWVPDEDVFTFRLNFRRDHAKLLT FT GEEIPTKQQALSIVMSIYDPTGFLAAFIVHGKIILQDVWRSRVDWKQKIPD FT ELFQRWRQWIALLPKIEDLKIPRCYFPGYHPDSLRSLELHVFVDASESAYS FT AVAHFRLVDGKRVRCAFVSSKTKVAPLEPLSIPRLEANAGVLGVRLRKSIV FT TGHSLPITRTRFWTDSKTVLQWIRSMDLRRYRPYVAFRVNEILSMSAVEEW FT GYCPSRLNVADLATKWGKQGPPLDIASPFYQSQEFVYDDPSEWPEPCDEMV FT ELAPEELRSAFVFAHFVLKPLVKWERFSKWERLLRAMGYVHRFIDRKLNRA FT KKPWTTDLTREELQQAERSLCRIAQSDEYPDEVATLNQNQRVPAEQRRPLE FT KSSNIVKLCPMLDEAGVLRVKGRIQAADFVPYDARHPVILPRDHPVTALLL FT DYYHRRFQHANNETVVNEVRQKFSISKLRVQVRLTQNNCEWCKVYNSAPIA FT PQMAPLPRARLSPFLRPFTFTGVDYFGPYLIKVGRSVAKRWIALFTCLTIR FT AVHLEVVAGLSTDSFKKAMRRFIARRGAPQEIYSDRGTNFIGASGELAKEI FT VLNINQELSSTFTDAHTQWRFNPPAAPHMGGSWERMVRSVKVALGAIPTQR FT KLDEESLVTMLTEAEYMVNSRPLTFIPLENADQESLTPNHFLLMSSSGVQQ FT PAKDPVSEGAA" XX SQ Sequence 6386 BP; 1609 A; 1654 C; 1736 G; 1285 T; 102 other; gaaaactcga agatttaccg caaatcatac atttgatcgg aataaatcat gaatcagcgt 60 ccmaacaccc gtcaaactcg ttcgcaaacg agagccgctc agcaattaca gcagcaagca 120 ggaattaacc tccaaagtgt tgacgctgca ggcgagcccg cagctcggcc cgcagtcttt 180 gaagcctcaa cagttgacat cgagggcgga gatgatattg attgtgggag ttgcgtccgt 240 ccgaacaacg ccgagatgta tatggtgcaa tgcggscagt gccaaatcta ctaccacttt 300 tcgtgcgcag gagtcacaat agatacggtg tacctgaagc cgttcgtttg caagacctgc 360 gcccagttac gcccggcaag aaatacccca tcagtgcgtt ctgcatcgag tgtgagcgtg 420 cgcagctcgc agatttccgc ggaactgcgg agagtagaag aagagttaca actagaggat 480 cagatgtcac kgcttttaga acgcaagaaa cakctgatgg ccaaaagaca tgaactgttg 540 agccagaagg gagacgccaa gtcccgtagc agcggtcgtw gtagccggac gaccacggac 600 agagtccaag attggataat ggaccaagga atggccgaga attccggcca tggcatcgac 660 aaaccgatga ccgaccctac cgagtctatt aacmaactgt atggaaatcc ggtgaaccag 720 cttaacgcag ggcgaacttc aacgcccctg accaaggaac tcggcgccgt cccgaaagat 780 cgcaaacagt catcggaggc tgctttcggc gtttctaatt ttccgaaact acccgggata 840 ccgccagtgg acaatcaatc tgtwctgggt accaagaaaa cgctcctgga taaaccattc 900 ccgaaaggtg ctattggtaa actcttttta cctgcgtcgg cttacgaaac ctggcgaagt 960 gaaacgaaac ggtacgagaa acaggagctt atcaagcaga aaaaccgtgc agcagctgaa 1020 aacgaatacc aacagaagtg ttttgaggag gtccagcagg agctgaacga gcagattgag 1080 gcgttaaaac ttcagcagca cgggaacgag agcgatcaga atcctccggg aaattggtac 1140 gaccgtgaac tgttgaagct gcagaaatcg cgtgacgaca acgccacgaa acaatttccw 1200 ccgcccggtg cccacaacag cttgccgtct atttcgggaa ggagtggtga cccgatagat 1260 ctcaacatca gaagcccctg cggcggacgc tcaagattat ctgtgtcaac tgcggaacca 1320 cctagcttct ctacgcggcg atcgaatccg gtgccaccaa atctgtcacg aacggcgtct 1380 ccgccgcccg cgcctgtgat cacagctcaa cagctggctg cgcgtcaagt agtgtccaaa 1440 gacttgccag acttcttcgg ggatccggca gagtggccac tgttcatcag tagatacaac 1500 cactcaactc aaacttgtgg ctttactgat tccgagaatc tgctccgtct ggaccgggca 1560 atcaagggtt atgcgagaga gatggtaagc agtttgttat tggatccgtc aacggtacct 1620 gagctgttgt cgtcgttgcg actcctttac ggtcgaccgg aacagatagt gcacaacctg 1680 atcgctaaag ttcgagctac accggccccg aaagcaaaca aactcgattc tctggtaacg 1740 tttggattgg ttgtacagaa cctgtgcgga cacctgagag ctatcggcat ggaaaaacac 1800 ctttccaatc cgacccttct gtgcgagctg gttgaaaagc tccccgacaa cgtaaagttt 1860 aactgggcct tgtatcagca tcagctgccg gaagttgact tgaatgcctt tggtatttac 1920 atgtcacaaa ttgcgacggc tacgagcgga gttacactct tgtctacacc ccctagagct 1980 gtacgagacg accgtccaaa atcgaaagaa aaggcgttcg tcaacgcgca cgccgaacaa 2040 gaaggcgagw cmagcgacgg tgagggcaga gascmkccga aggagcgstt gaacgttgtc 2100 aacaagatcg aaacccatag ggcgtgcccg gcwtgcggmg tcgamggtca ccckgcgmgg 2160 gactgcgkwc awttcmaggg kctgagcttg gacgatcgct ggaaggtggt kaaagmgaag 2220 aagttktgcc gacgctgttt gactcckcac tcccgttggc cgtgcaamgc wgaagcctgc 2280 ggkgtgaacg gttgtcaaaa aagacaccac cgactwctgc actacgaccm sccgccggtg 2340 gagcacaaag cggctsagcs taccgacgct acgacgmasg ccacggtcac gattcaccgc 2400 cagccggmcm cgacgaccct tttccgmats ttgccagtca cgctgtacgg ggcgmgsggg 2460 cgggttgaca cstttgcctt cctggacgac ggatcctckg tgacsctkgt ggagagatcg 2520 attgcgaacm agctgggwat tgcgggagag gackcctctc tctgcmtcca ctggaccggc 2580 ggcatmaaga aaaacatctc gaacacgcaa ctggtmgatc tggagatctc cggggccagc 2640 macacgaakc gactcawggc gaaggcggtt tacaccgtcg amgggctsgg mttgccggam 2700 cagtcgatgg atttcggggc gatggcggcc aagtacgatt acctgcgaag tctccccgta 2760 cagaacmtga wgtccgcggt tcctggcgta ctgatcggkc tgaacaacct mcacctgatg 2820 gccccsctga agctacgwga aggcagagaa ggcgaaccaa tcgccacgaa aacccgcctt 2880 ggctgggcmg tctacggakc katsccsggg mgwgaggcgc ccttcctaca ccggcagatg 2940 cacatcsatg gaagctcgcc aggtgwcgat cttcacgaat atgttcggag ctttttcgct 3000 gttgaawgca tcgggatwgk agccaccgga agtgccgawa gtccagacga acaacgagcg 3060 aaaaaaatct tggcggagac caccaagcgg acttcaaccg gccggtttga aaccggactt 3120 ctgtggaagc aggattacat cgagtttccg gacagtagag caatggccga aaaacggatg 3180 aggtgcctcg aaaaacggtt gcttcgggac caaacgctgt acgacaacgt tcgaaagcag 3240 atcgcmgatt tccagcasaa aggwtttgcc cacaaagcka ctgccgaaga gttgagaagg 3300 tttgacccgc gccgaacctg gtacctacca cttggggtgg tcatcaaccc aacaaaccgg 3360 gaaaagttcg tcttatctgg gacgcagccg caaaggttga mggtgtttcc ctaaactcga 3420 tgctsctgaa gggtccmgac ttgctttcwc ctctkmtgtc cgtgctsttc aagttccggg 3480 aamggcaagt ggcgatcggm ggtgacatcg aggccatgtt ccaccaggtc cksatmcgcg 3540 aagcagatcg aagcgcccag cttttctact ggcgggactc cccagagaaa ccgctggaaa 3600 ctatggtgac cgacgtcgcc attttcggcg ctagttgctc ccccgctcac tcccagtatg 3660 ttaagaatct caacgccacc gagcacgaag ccgagttccc cagggcagct gacgcaatcc 3720 ggaccaagca ttacgtcgas gattacctgg atagtgtcga caccgccgaa gaagcagcgg 3780 caattgcgct tgaggttgct gatgttcacg ctaaagctgg gtttcacatc cggaactggg 3840 tctccaacga cgactcggtg ttgcagaaaa tcggtacagt caacccaact acagttaagc 3900 ggttcgtgat ggacaaagat tgcggkttcg agcggttact gggtatggtt tgggtgccgg 3960 atgaggacgt gtttacgttt cgcttgaact tccggagaga ccacgcaaag ttgctgactg 4020 gagaggagat ccctacgaag caacaggcac tgagcatcgt aatgagcatt tacgacccga 4080 ccggattctt agcagcattc atcgtacacg gcaagattat cctccaggac gtctggcgat 4140 cacgcgtgga ctggaagcaa aaaattccag acgaactgtt ccagcgttgg agacagtgga 4200 tcgcgctcct tccgaagata gaagatctga aaataccccg ttgctacttc cctggttacc 4260 accctgacag cctgcggtca ctggaacttc acgtcttcgt cgatgcaagt gaatcggcat 4320 actcggccgt cgcgcacttc cgactcgtgg acggaaagcg cgtacgctgt gcattcgtgt 4380 cgagcaaaac gaaagttgca ccattagagc cgctctccat cccgcggctc gaggctaacg 4440 caggtgtact gggtgttcgt ttgcgcaagt ccatagtgac cggccattcc ctacccatca 4500 cacggactcg tttttggacc gactcgaaga cggttctcca atggattagg tccatggacc 4560 ttcggcgcta ccgtccgtat gtggcctttc gagtaaacga aatactgtcc atgtcagcgg 4620 tcgaagagtg ggggtactgc ccatcccgtt tgaatgtggc agatctcgcc acaaagtggg 4680 gtaagcaagg cccaccgttg gatatcgcca gtccgtttta ccaaagccaa gaattcgtat 4740 acgatgaccc ttctgaatgg ccagaaccct gcgacgagat ggtggagctt gcgccagaag 4800 aactaaggtc ggcatttgta ttcgcgcact tcgttttgaa gccattggtc aaatgggagc 4860 gattctcaaa gtgggagcgg cttcttcgtg ccatggggta cgtgcatcgc tttattgatc 4920 gcaaactgaa ccgagcgaag aagccgtgga caaccgattt aacgcgagaa gagctgcagc 4980 aagcagaaag gagcctgtgt cgaatagcac aatctgatga atacccggac gaagttgcta 5040 ctttgaacca gaatcaacgt gttccagccg aacaacggcg acccttggag aaatcaagca 5100 acatcgtgaa gctctgcccg atgctggacg aagcaggagt tctacgagta aaagggcgca 5160 ttcaagccgc cgacttcgtt ccctacgacg ccaggcaccc ggtcatcctt ccccgagatc 5220 atccagtgac agcgttgctt ttggattact accatcggcg gttccaacac gcaaacaatg 5280 agaccgttgt caacgaagtg aggcaaaaat tcagcatctc gaaactgcgg gttcaagtgc 5340 gtctaacaca gaacaactgt gaatggtgca aggtgtacaa ttcagctcca atcgctccgc 5400 aaatggcccc actgccccgg gctaggctgt ctccgttttt gcgtcctttc acattcaccg 5460 gagtggatta ctttggacca tatttgatca aagttggaag aagcgttgcs aagagatgga 5520 tagcactttt tacgtgtttg acaataaggg ccgttcatct ggaggtcgtt gcgggtttgt 5580 caactgactc gttcaagaaa gctatgcgga gattcatcgc gcgcagggga gctcctcaag 5640 aaatctactc ggaccgcggc acgaacttta tcggagcaag tggtgagtta gcaaaggaaa 5700 tcgttcttaa catcaaccag gagctgagca gcacgttcac ggatgctcac acgcagtggc 5760 gcttcaatcc cccggcagca ccgcacatgg gaggctcgtg ggaacggatg gtgcgttctg 5820 tgaaagtcgc tcttggagcc ataccaacgc aacgaaagct agacgaagaa tcgttggtaa 5880 ccatgctgac agaggcggag tacatggtaa attcccgccc tctcacgttc atccctctag 5940 agaatgctga tcaagaatcc cttaccccaa atcatttcct gctgatgagc tcgagcggcg 6000 tgcaacagcc ggcgaaggat cccgtaagcg aaggtgctgc twtgaagaac agctggaacc 6060 tgattcagca cgcattggac gagttttggc gtcgatggat taaggagtac atgccgacac 6120 tttcgagacg ggagaaatgg tttggtgaaa cgcggccgat taaagaagga gatcttgttt 6180 ttgtagttga cgagggaacg agaaatcgct ggcagcgagg acgagtagtg cggacgcatc 6240 cgggcaagga cgggcaggtt agacgmgtag atgtgcgcac agtcaacgga gttctaccga 6300 atcgtgcagt cgttcgattg gccctamtag acgtagctgt tgatggtgac gccgaagaaa 6360 cacttcaggc gacacgtggg ggagaa 6386 // ID Gypsy-260_AA-LTR repbase; DNA; INV; 1405 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-260_AA_; KW Gypsy-260_AA-I; Gypsy-260_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-1405 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1122-1122 (2011). XX DR [1] (Consensus) XX SQ Sequence 1405 BP; 354 A; 346 C; 363 G; 342 T; 0 other; tgtagcagta ccatgaattt gtaattataa tttgtaaata aatagcatta gtagcaaatg 60 taatgaactt caccattgaa ctaccccttg aaatcgaccg ctcagtttga gtgaatatta 120 gcatagaact caatgaactc actgagctca tttactccac accgcagtcc ataaactaaa 180 tgagttccaa ctagccttct ggcgccaaac ctcagcagcc atacgaactt tgttgaagcg 240 accatgtgac gatcctcgga catcgacctg tcgagatcgt attgaagaat cgttgacccg 300 gcctttccta ttcataatct tcgggagggc agcgatttgg aagatctatc ttggggaaag 360 gggatgcttt cacccgacga atttggaacg ttggtgtgcg tgaaagcgaa gtcttgacct 420 tcgcgaaatt aacttcgtta atttcgcgta atatccggtg acgaagacct tcacatctcg 480 cgccttatct gaggatcttc agacttagct cgttcacttc gttcaggtcc gcggaagagg 540 cgagaacggt ccgaattagt tatcgcgccg tggtgtgttc cgaaggtgaa ctaaaaagtg 600 tgaaagtgct tcgagtttac tattagattt tctttcgttt ccctcgaata gattaggtag 660 agttgctagc agaaagtgcg tagtgcgttt attgcggttt aagtgcgagt gtagacttgc 720 ggattaagcg gtaagacctt tgtgcaaatg gtgagcgtca gtgttcaaca cccgccatca 780 acctttttta tagatttgtg gcacgttgcg agcccgtcgc accaccccca tatcgttaga 840 gaccgttttt gaccccgctt tcctgctgcg aaggaagtcc cggcggtaca cgtggccgga 900 gagggcagcg tacgttactc ccgcgccctt ggcagggcgc caagcaacga ggccggtttc 960 taccggcgag gttctcaact ccgacgacgt tacgtccgag tgagaaacag cgaaggaaaa 1020 aagttccccg accggattcg atccggtgac caccagcgtg agaggcgagg accctatcca 1080 ctagaccacc gtcagtgtcg atcgttggtg agacggccgt accccgtcgt tgaagaccac 1140 aaatctgcgt gggatgcgcc gttgagaacg gcaggcaaga ctatcatcga gcgagcgagc 1200 gagcaagtag catccagagt cgagcaccgg agaggagacc gcgtgtagta cacgtggtac 1260 taatagttcg agtcagcaaa ccgcagatgc ccccacgaca gtagacagcg tccaaatcgt 1320 aagtagcatg catgcttttt gattcccatg cgcatggtga gatttgcccc tagccttagc 1380 caaatcagct agtgtggccg ctgca 1405 // ID Gypsy-13_AA-I repbase; DNA; INV; 4263 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-13_AA_; KW Gypsy-13_AA-LTR; Gypsy-13_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4263 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 995-995 (2011). XX DR [2] (Consensus) XX CC Positions [3316-3792] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 181..4239 FT /product="Gypsy-13_AA-I_1p" FT /translation="MTSDSKSNPFQAASVQKMATVGRMDPFIPGEDFDVYI FT KRLRMYFIANGIPENLKAAVLMTVMGNETFQILDSLFSPEDPCTKTFDQIV FT EKLKNQFKPMVIVSAERYKLYTRKQKVNEPIAEYVVALKHLAQSCSFGNFL FT ADALRDAFIIGIHDQKIRKRLLSEELDFDKAYKIAAGMELADAEEMNLKIE FT SGLNHVVSGKKNVSVRSVKRGSAISRETIDEHRAKYCTRCGRSNHAVNQCP FT AANVLCYKCRLRGHFANQCRTRNVHLIQDNDEEEFVDETVDLISWVAEGNE FT PLLVPVHIEGKEVIMEVDSGACKTIISENEYREKFYHVTLKQEPKSFQVIT FT GQKFRPLGVGEFKVNMPNSNCQITLEMSVIRSDFKFKPLLGRSWLNVLYPG FT WKNRIISQERNSVHTVENAQTDKVMTNGKREIVDYQGLVRREQVLGENKVN FT FLKALQEHFPRAFSEDNSEPIEEFEVDIVLKEHTPVFHKAYELPFKMREPV FT EKELERLVADGILKPVKHSRYASPIVAVPKSDGKSIRVCVDCKRTVNPFVE FT TEHYPLPRIDDILASFPGCKVFCVLDLKGAYQQLALSEESKKYLTINTHKG FT LFQFQRLCYGVSSAPSIFQQVIDQILRGLKFVKSYIDDTLIGGKNFAECFE FT NLKRVLDRLNKYNVHINLEKCEFFLPQVDYLGHTLSEKGISPNSDKIKAIV FT EAPSPSNCAQLQSYLGLLNYYSRFLPNASSLLRPLYDLLKKDTTFHWTEIH FT ERAFKDSKRLLLSNNLLEMYDPDKKIIVSCDASPYGVGAILSHEINGEQKP FT IIFASSTLSAAEQNYSQLHREALAIMFAINRFHKYIYGYKFELHTDHEPLQ FT AILNPKKCKSAIAVARLQRWAIQLSMYNYVVKYKPSSKMRHVDALSRLPLT FT DDTKVDFVKSLNVSNDIPIDLEMVRKSSETDPLIKMLMKFCRSGWARTKQF FT DPELKYYHKLQQCFAVENDCLVYRDRVVIPKEMQIKILELLHEGHIGIVRT FT KMLARQYVWWRNIDNDIEAFINHCSVCQQTQKKASSVTIPWPTPKTVLERV FT HLDLFYFGKYSFLIIVDAFSRWIEVCLLKTSNARTIIDNMRKFIVTFGLPK FT EIVSDNGPPFNSANFIDFCIRQGIKTTKSPPYHPESNGLAERGVQTTKAAL FT KKFLLDHKYRKLTIDEQVDNFLFKYRNTPNAVSGISPAEMMFSFKPKTLID FT KLSIKSDKLENVQCKREKEKVTSTKIKHFVKGEKIMYMNHFKNFCKWIPGR FT VVDKLSQALYRIEISGNVRIVHVSSIRKSNLHDKYHPNVSVARSTQVQARP FT HRVVLKKNIGSTKHSSQTKWSDFSKLRRSQRKRKPPVFYRC" XX SQ Sequence 4263 BP; 1455 A; 701 C; 905 G; 1202 T; 0 other; agctctataa aagcgcaggt caatcttgta caggtctctt ttaagttctg gacatctatc 60 aataaacact aattgatagc agcgcatctg tgttttgtaa gtataaatcc gaaacatatt 120 agttggcgac gaggatagtg caaccattcg tgtgcaaata cacgtgtgtc gtgtgaagca 180 atgacaagtg atagtaagtc gaatccattt caagcagcaa gtgttcaaaa aatggctaca 240 gtgggccgga tggatccgtt tattcccggt gaagattttg atgtgtatat taagagatta 300 agaatgtatt tcatcgccaa cggtattccg gagaatttaa aagcggctgt gttgatgaca 360 gtgatgggaa atgagacgtt ccaaattttg gactcattgt tcagtccgga agatccgtgt 420 actaaaacgt ttgatcagat tgtggagaaa ttgaaaaacc agtttaaacc aatggtgata 480 gtgtccgcgg aaagatataa gttatacact cgaaaacaaa aagtgaacga accgattgct 540 gaatatgtcg tggcgttaaa acatctggca caatcttgta gtttcggaaa tttcctggca 600 gatgcgctcc gcgatgcttt tattatcggt attcatgacc agaaaattcg aaaacgactg 660 ttatctgaag aattagactt tgataaagcg tacaaaatag cagccggtat ggagttggct 720 gatgcagaag agatgaattt gaagatcgag agcgggttaa accacgtggt gtccgggaag 780 aaaaacgttt cggttagaag tgtaaagcgt ggtagtgcaa tatcaagaga aacaatagat 840 gaacatcgtg caaaatattg tactcgatgt ggaaggtcta accatgcagt gaatcagtgt 900 ccagcagcta acgtattgtg ctacaaatgt agattgcggg gtcatttcgc aaaccagtgc 960 agaacaagga atgttcattt gattcaggat aacgatgaag aggaattcgt agatgaaacc 1020 gtcgatttga tcagttgggt agccgaaggt aatgagccgt tacttgttcc ggtccatata 1080 gaaggtaagg aagtcatcat ggaagtggac agcggtgcgt gtaaaacgat aattagtgaa 1140 aatgaatatc gagaaaaatt ttatcatgtt accttgaaac aagagccaaa atcgttccaa 1200 gtcattacag ggcagaaatt taggccacta ggagtgggtg aatttaaagt aaacatgccg 1260 aacagtaatt gccagatcac tttggagatg agtgtcatac gttcagattt caaatttaag 1320 ccacttttgg gcagatcatg gttaaatgtg ttgtatccgg ggtggaaaaa cagaattatc 1380 agtcaggaac gaaatagtgt acatacggtt gaaaatgctc aaacagataa agtgatgacg 1440 aacggaaagc gcgaaatagt cgattatcag ggattagtac gaagagagca agtgttgggt 1500 gaaaataaag tcaattttct aaaagcgctt caagaacatt ttccacgagc attcagtgag 1560 gacaatagtg aaccgataga agaatttgag gtggacatcg ttttgaagga gcatactcca 1620 gtgtttcata aggcatatga actgccattt aaaatgcggg aacctgtaga aaaagaattg 1680 gaacgactgg tagcagatgg aattttgaaa cccgtcaaac atagcaggta tgccagtccg 1740 atcgtagctg tcccgaaatc cgatggtaag tcgatacgag tatgtgttga ttgtaaacga 1800 actgttaacc catttgttga aacagaacat tatccccttc cacgaatcga tgatatctta 1860 gctagctttc ccggttgtaa agtattttgt gttctcgatc ttaaaggagc ctatcaacag 1920 ttagctctgt ctgaagaatc caagaagtat ttaacgatta atactcacaa gggtttattt 1980 caatttcaaa ggctttgcta tggtgtaagt agtgcacctt ccatttttca acaagttatt 2040 gatcaaattt tgaggggatt gaagtttgtc aaatcttaca ttgatgatac tttgatcgga 2100 ggaaaaaatt ttgcggaatg ttttgaaaat ttaaagcgcg tactggatag actgaataaa 2160 tacaacgtgc atataaatct agaaaaatgt gaatttttct taccacaagt ggattatctt 2220 ggtcatactt taagtgaaaa aggcataagt ccaaatagcg ataaaattaa agcgattgtg 2280 gaagcaccta gtccttcaaa ttgtgctcaa ttgcaatcat acctaggatt gctcaattat 2340 tattcacgtt tccttcccaa tgctagtagt ttgttaagac cactttatga tcttttgaaa 2400 aaggacacaa ctttccattg gacggaaatt cacgaaagag cgtttaaaga tagcaaacgt 2460 ctcttattgt cgaataattt attggaaatg tatgaccctg ataaaaaaat aattgtctcc 2520 tgtgatgcat ctccttatgg agttggagca attttatccc atgaaataaa tggagagcag 2580 aaaccgatca tttttgcttc cagcaccctg tctgcagcag agcaaaatta ctcacagttg 2640 catagagaag cactggccat aatgtttgcg attaatcggt tccataaata tatatacggt 2700 tacaaatttg aattacatac agatcatgaa ccacttcagg ctattctaaa tccaaaaaaa 2760 tgtaagagtg cgatagcagt agctaggtta caacgatggg ctattcaatt gtcaatgtat 2820 aattatgtag taaaatacaa accgtcttcg aaaatgcgac acgtggatgc cttatcgaga 2880 ttacctctga ccgacgacac caaagttgat tttgtgaaat cgttgaatgt ttcaaatgat 2940 attccaattg acttggaaat ggttcgaaaa tcttccgaaa cagatccact gataaaaatg 3000 ttgatgaaat tttgtcggtc aggctgggca agaactaaac agttcgaccc tgaattaaaa 3060 tattatcata aattacaaca atgtttcgcg gtggaaaatg attgtttagt atatcgtgat 3120 cgcgtagtga ttccaaaaga gatgcagatt aaaattttgg aactattgca tgaaggtcat 3180 atcggcatag tgcgtacaaa aatgctagct agacaatatg tatggtggag aaatattgat 3240 aacgacatcg aagccttcat aaatcactgt tcagtttgcc aacaaactca gaaaaaggca 3300 tcatctgtta cgataccttg gcccactccg aaaactgtcc ttgaacgagt acatttagat 3360 cttttctatt tcggaaaata ttcgtttttg ataatagtgg atgcttttag cagatggatt 3420 gaagtttgcc tcctcaaaac atccaacgca agaacgatta ttgataatat gcgaaaattt 3480 attgtaactt tcgggttgcc aaaagaaata gtgtcagata atgggccacc gtttaattct 3540 gctaatttca tcgatttttg cattcgacag ggaataaaga ctaccaaatc gccaccatac 3600 cacccggaat cgaatggttt agcagaacgt ggagtgcaga caactaaagc cgcgttgaaa 3660 aaatttttgc tggatcataa gtatcgtaaa ttaacaatcg atgaacaagt tgataatttt 3720 ttgtttaaat acaggaacac accaaacgct gtttcaggaa tatctcctgc agagatgatg 3780 tttagcttta aaccaaaaac actgatagat aaattgtcca ttaaatcaga taaacttgaa 3840 aatgttcaat gtaagagaga gaaggaaaaa gtaacaagca caaagattaa acattttgtt 3900 aaaggggaga agatcatgta catgaatcat tttaaaaact tttgtaagtg gattcccggt 3960 agagttgtag acaaactttc tcaagcttta tatagaattg aaataagtgg taatgtaaga 4020 attgtacacg tttcttccat aagaaaatct aaccttcatg acaaatatca cccaaatgta 4080 tctgttgcaa gaagtactca agttcaagcc aggccacaca gagtcgtttt gaagaaaaat 4140 atagggtcta ccaagcatag ttctcagact aaatggtcag atttttcaaa actaagacga 4200 tcacaaagaa aaagaaaacc gcctgtattc tatagatgtt agaaaaatct aaaaggggag 4260 aag 4263 // ID DNA2-3_CQ repbase; DNA; INV; 447 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A DNA transposon family from Culex quinquefasciatus - consensus. XX KW DNA transposon; Transposable Element; nonautonomous; DNA2-3_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-447 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the southern house mosquito."; RL Repbase Reports 11(1), 70-70 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >96% CC identity. 2-bp TSDs. XX SQ Sequence 447 BP; 176 A; 61 C; 65 G; 145 T; 0 other; cccgggcaga cggtaataac aaaattcatg ccatttcaat aacaaattct gttaaaataa 60 cagaaagtgt tatggaatct tcctgaaaaa tccatttttg cataagggtt taataacagt 120 ttatgttatc ataacaaaat ttgttattgg tctgatattg gttgaaagcc aaaacaactt 180 tggaataaca ttttttgtta tggaagaata actccaactg ttattgggat gatcggatta 240 gttgttaaaa taacaaaaaa taataacaaa gatttgttcg aagaataact aaaaatgtta 300 ttagtctgtt attacaataa caatccaata acaaaaaaat cataacgacg aataacaaat 360 cttgttataa ataacgtaaa atgttattgg cctagtattt tcaaatatca aaaaatgtta 420 ttcccaagtt atttccgtct gctcggg 447 // ID DNA8-23_AP repbase; DNA; INV; 207 BP. XX AC . XX DT 22-AUG-2009 (Rel. 14.08, Created) DT 22-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-23_AP. XX NM DNA8-23_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-207 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(8), 1765-1765 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 207 BP; 74 A; 43 C; 32 G; 58 T; 0 other; catagataat aaaagaatgg ataggccata ctatcttatc acacgtgcat gaacgttaca 60 aggggtctca taataaaata atgataagac cccttgctag agtgagaaac accatataaa 120 aactggctgt ctctcatcta cattccagca cgcaaacccc tcgtatatta atataggaat 180 ggcctatcca ttattttatt atctatg 207 // ID DNA3-8_AP repbase; DNA; INV; 350 BP. XX AC . XX DT 27-AUG-2009 (Rel. 14.09, Created) DT 27-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA3-8_AP. XX NM DNA3-8_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-350 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 1949-1949 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 3 bp TSD CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 350 BP; 94 A; 80 C; 68 G; 108 T; 0 other; gaggaaattc aggagcggca aacgttgcaa acgtttgata acggagaaat tcggaacgtc 60 cggttgctaa acgaagctgt aaacgtttgc aaacattgca aacgccatgt tccagcgttt 120 ggtctgtttc gttctagtaa ccagaaccgt ggttgttttt gtttatacta ttgccgttat 180 cgttatcaca attcacgact tatcacgatt aatttttttt tatgtcttac accacgacca 240 cggttaaaca aaataccgca ctaaaccaag cgctcgcgaa tttctacgtt ttcaacgttt 300 caaacgttgg cgttcgttct gcaaaacgtt tgccgctcca gaatttcctc 350 // ID Gypsy-4_SI-LTR repbase; DNA; INV; 164 BP. XX AC AEAQ01007762; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: long terminal DE repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-4_SI_; KW Gypsy-4_SI-I; Gypsy-4_SI-LTR. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-164 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01007762; Positions 285 122. XX SQ Sequence 164 BP; 67 A; 20 C; 37 G; 40 T; 0 other; tgtaataagt agatggcgac cgtaatgaat atagagggcg ttagagtagc gagcgcgtgc 60 gtgcgttaga catagagata gaacatacac gattgtagac atacacaata aatatagagt 120 taaaagagaa caatagttat taataagcat ctaataatat taca 164 // ID DNA8-32_AP repbase; DNA; INV; 240 BP. XX AC . XX DT 23-AUG-2009 (Rel. 14.08, Created) DT 23-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-32_AP. XX NM DNA8-32_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-240 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(8), 1774-1774 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. Putative hAT element. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 240 BP; 57 A; 49 C; 48 G; 86 T; 0 other; cagggatggg caactggcgg cccgcgtacc ttttttatct ggcccgcgct acattttcaa 60 atttcgtgtt ttcgtgtttt gtagtaatta ttttaaaaat gtccgctaaa aagcgtaaaa 120 aataaatgtt agtaaatttg aaggaccttt ttttttttgc tttctcctat gcactgcggc 180 ccgctagatt ttaaagtact taaaagtggc ccgctagtta ctttgagttg cccatccctg 240 // ID Gypsy-134_AA-I repbase; DNA; INV; 7131 BP. XX AC AAGE02019934; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-134_AA_; KW Gypsy-134_AA-LTR; Gypsy-134_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-7131 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02019934; Positions 10968 18098. XX CC Positions [5024-5506] - Integrase core CC 'GCAT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 420..2627 FT /product="Gypsy-134_AA-I_2p" FT /translation="MEFFKMNPNHLDESEVNYELTVRNYSIEKSVESRRRD FT LRVVFRDPESEKNVRVTDVMMMEDIRVVPSKLKEIADLLQQGPNPACFSRL FT VHYYQRIRRYSPRTMQQQEELRCLLEMISKISANYFATGLDESVGHPPMAE FT STTQAHPLMNTLTVDPSTISANEDRSNQANGQRVSGSPWLMNWDETEGAHG FT GEIDPSESHLTKVIPALVDDLLNFKRGTPTEAAPRQVETAEAWKSNPWNYA FT VPGGHNGKAQQINPAIGVPEFCLETSAEPNGPQVTNSVLDDLRSHLPRVRP FT NAENSRGRESLQSSGFVHVSEIETYVKKYFDQLIRQGSFQISVPNQTMDNL FT SHQIANVSIYPPVNLGYTAAPDSRGTPTLRGTMPDTSPPLQLSGHAPPTSP FT LVHTADNVPRISRNVAYGEPNLNRTGYSATRPDPAENLGYPQNFTRNSFPL FT TMSGHSRRLPHQQCSIIEKWPKFTGDSNAVPVTDFLRQINILCRSYDITKN FT ELRIHAHLLFKESAYVWFTTYEEKFTSWEILEAYLKMRYDNPNRDRIIREE FT MRARKQRPTELFSAYLTEMEMLAQRMIKKMSESEKFEIIVENMKLSYKRRL FT ALEPIHSIEHLAQLCFKFDALESNLYSGSPQPRPTVHQVTGEDDVVDEQFI FT GDELDEICALKARMGKSFSRANTNPENIGEKAKQAICWNCQRVGHMWKDCD FT KRKTIFCHICGLTDTTAYRCPNKHDFGQKGELPKNE" FT CDS 3200..5875 FT /product="Gypsy-134_AA-I_1p" FT /translation="MPTIELPERSFESSEGLITEHTLTELERAALYEAVKQ FT LPETREGQLGRTQLIEHRIDLLPDAKPKKVSYYRWSPSVEKVIDAEIQRMK FT DLGVIEECDGPVDFLNPLLPIKKANGKWRICLDSRRLNQCTKKDDFPFPNM FT MGIIQRLPKSKFFSVIDLSESYYQVPLELSARDKTAFRTNKGLFRFVVMPF FT GLTNAPATMARLMSRVLSHDLEPYVYVYLDDIIIVSNSLDEHIRLIKIVTD FT RLKNAGLTINLQKSKFCQKAIKYLGYVLTKEGLSMDVSKIQPVLDYPIPKT FT VKDVRRLLGLAGFYQKFLPNYAEITTPITNLLKKGVKKFAWTEDADIALEK FT LKAALISAPILTNADFTLPFIIETDSSDLAVGAVLVQVQNGERKPIAYYSK FT KLSSTQRRYSATERECLAVLLSIENFKHFIEGSQFIISTDAMSLTFLKTMS FT IESKSARIARWALKLSKYDMLLKYKKGSENIPADALSRAVNQLDVLDPYIS FT QLKLMIEKYPERYSDFQVSDGKVYKHIHHAANAEDPSYRWKYVAPVIERRE FT IIQKTHEEAHLGFVKTLAKVRERHYWPRLAADVKRFCSQCEVCRESKIPNL FT NVQPMCGKPKLCSRPWELISMDFLGPYPRTKRGNVWLLVVSDFFSKFVLIQ FT CLRSATAVATCAFVENTVFNVFGAPRICITDNATVFKSELFQKLLNNYSVT FT HWPLAVYHPSPNPAERVNRVIVTAIRCSLNQERDHRNWDQNVHQIARAIRT FT SVHESTGYTPYFLNFGRNMISNGREYEDLRESEPRKNPTESAEDIKQLHMK FT VQQNLLKAYQKYSHPYNLRANKRHQFHKGDVVYKKMVHLSDKSRNFVGKFA FT NKFEKVRITEVLGTNTYALERLNGQKIAGSYHGSFLKRA" XX SQ Sequence 7131 BP; 2138 A; 1442 C; 1529 G; 2022 T; 0 other; ttttggcgcc caacgtgggg ccggaaatag tgtttccgat aagttggcat ttggtatttg 60 aataagaagt tatattgaat tgcctattct gaactcaatt gtatttttag tcgataaaat 120 caacctttgg tcattattat tccgcgttta agtaattgaa ttcgtgaaat cgtgtcgttc 180 ggtaagttag ttcctatatt cggttattta tttcatatat tcggttagtt agttaatata 240 ttcggttgat aggttcgaaa ttcctattat ttagtttgtg cttttgcttt atcagtattt 300 ttagtttttt tttcggtttt ggtttggaat tagatttttt ttttcttttc tttgtaaata 360 ttatttaatg aataagtgct aatctatcca atattgaatt acaaagtggt ttgtttgaca 420 tggagttttt taaaatgaat cccaatcatt tagatgagag tgaagttaac tatgagctca 480 cagtacggaa ctattcgatt gaaaaatcgg tagaaagccg tcgaagagat ctcagagtag 540 tctttcgtga cccggaatcg gagaaaaatg ttcgggtgac agatgtgatg atgatggaag 600 acatccgtgt agtgccaagc aagttgaagg agatagcgga cctgttacaa caggggccga 660 atcctgcctg tttctctcgc ttggtccact attatcaacg gattcgtcga tacagtcctc 720 gtacgatgca gcaacaggaa gaattgcgtt gtttgctgga aatgatttcc aaaatatccg 780 cgaattactt cgcgactggg ttagatgagt cggtaggaca tccaccgatg gctgagtcca 840 ccacccaagc ccatccgttg atgaatacgc tgactgtgga tccgtccacg atatcagcta 900 atgaggatag atcgaatcaa gctaatggac agcgcgtttc cggatcccca tggttgatga 960 attgggatga aacggaaggc gcccatggcg gtgaaatcga cccttcggag agccatttga 1020 cgaaagttat accagcgttg gtagatgatc ttttgaattt caagcgaggt acacccactg 1080 aggcggctcc acgccaggtt gaaactgccg aggcttggaa gtcgaatccg tggaattatg 1140 cagttccagg agggcataac gggaaggctc agcagatcaa tcctgccatt ggtgttccag 1200 aattttgcct ggagacgtct gccgaaccca acggacctca ggtgaccaat tctgttttag 1260 acgacttgcg ttcccacctg ccacgagtcc gtcccaatgc ggagaattct cgtggtcgtg 1320 agtcccttca aagcagtgga ttcgttcacg tgtctgaaat tgaaacatat gtaaaaaaat 1380 actttgatca actgattcgt caagggtcgt ttcagatctc agttccaaat cagacaatgg 1440 acaatctttc gcatcagatt gccaatgtca gcatttatcc tccggtaaac ttggggtata 1500 ctgctgctcc agattcgaga ggtacgccta ctctccgcgg tactatgcct gacacttcgc 1560 caccgttgca actgtcagga catgctccac cgacttcgcc attggtacat accgctgaca 1620 atgtgccaag aatttcgaga aatgtggctt atggtgagcc aaacttgaat cgaacgggat 1680 attctgcaac tcggccagac ccagctgaaa atcttgggta tccccaaaat ttcacacgaa 1740 attcgttccc gttaacgatg agtggacatt cgagacgatt gccgcaccaa caatgtagta 1800 tcattgagaa gtggccgaaa tttacaggcg actccaatgc ggttcctgta acggattttc 1860 tacgacagat caacattctt tgtagatctt atgatattac caaaaatgaa ttacggatac 1920 atgcgcattt gcttttcaaa gagagcgctt atgtttggtt tactacttac gaggagaaat 1980 tcacctcttg ggaaatattg gaagcctact tgaagatgcg gtacgataat cccaatcgtg 2040 accgtatcat ccgagaagaa atgcgagctc ggaagcaacg acctactgag ttatttagcg 2100 cgtatctcac tgaaatggag atgcttgcgc agagaatgat taaaaagatg tcggagagcg 2160 aaaaatttga aatcattgta gaaaacatga agctttctta caagcgtcga cttgccctgg 2220 aacctattca ctcgatcgag catctagctc aactctgttt caaatttgat gctctcgagt 2280 caaatttgta ctccggttct ccacaaccac ggccaacggt ccatcaagta acaggcgagg 2340 atgatgtggt tgatgagcag ttcatcggtg atgagttgga tgagatttgt gcgctgaagg 2400 ctagaatggg aaagagcttc agcagagcaa acactaaccc cgaaaacata ggtgagaagg 2460 ccaaacaagc aatttgttgg aattgccaac gggtcggaca catgtggaaa gattgtgata 2520 agcgtaagac gatcttttgt catatctgcg gtttgacgga caccactgcg tacagatgtc 2580 cgaacaaaca tgactttggt cagaagggag aattgccaaa aaacgaataa acgaggtcaa 2640 ttctgggaat ccttgtcctc gtgaaaataa tcaagcgatt cctcttccca gctacagcat 2700 tttcaacagc atccatcaaa ttaataccaa atttcatcgc tgcccccact tgaaagtgag 2760 aattctttcc gaagaagtgg aagggttagc cgataccgga gcaagtttat caataatcag 2820 ctcagtggaa ttaatcaata agttaggctt gaaaatccat ccaatcccaa taaaaatctc 2880 taccgcagat ggaacagcat atcgatgcct aggttatgcc aacgtgccct tctcatatca 2940 acaaaagact cacgttattc cgacgataat cgtgccagaa gtaactaagc gtttaattct 3000 gggagtggat tttttaaaca aattcgggtt tcagctgata cctcccaaca acgatactac 3060 cgaggaatca cagatagaac tatcatgtgc agaagattac ttcggagatc ggactggtga 3120 aatatgtttt caaattgagc cctgctctaa ctcaaaattg tctgaaagca tagacgaaac 3180 ttctgatgag agtttagaga tgccgacaat tgaactacca gaaagatcat ttgaatcttc 3240 cgaaggtttg attacggaac ataccttaac tgaactcgaa agagcagccc tttatgaagc 3300 agttaagcag ttaccagaaa ccagagaggg tcagcttggt agaacacaac tgatagaaca 3360 tcggattgac cttctaccgg acgccaaacc caaaaaggtc tcgtactatc gatggtctcc 3420 aagtgttgaa aaggttatcg acgccgagat acaaagaatg aaagaccttg gagtcattga 3480 agagtgtgat gggccagtgg acttcctcaa cccccttcta ccaattaaaa aagcaaatgg 3540 gaaatggaga atatgccttg actctcggag gctcaaccag tgcactaaaa aggatgactt 3600 cccattccct aatatgatgg gaatcattca gcgactgccc aaatccaaat ttttctcagt 3660 aattgacttg tctgaatctt actatcaggt gccactagaa ttatctgcta gagacaagac 3720 cgcttttcgt acaaataagg gcttatttcg attcgtagtt atgcccttcg ggttaactaa 3780 cgcccccgcc actatggccc gattgatgtc acgcgtttta agccatgatc tcgagccata 3840 tgtgtatgta tatcttgacg atattataat cgtttctaat agccttgatg agcatatccg 3900 cttgattaaa attgtcacag accgacttaa aaatgcaggt cttacaataa atctgcaaaa 3960 gagcaaattt tgccagaaag caattaagta tttgggatat gtccttacga aggaaggctt 4020 atcgatggat gtgagtaaaa tacagccagt acttgactat ccaattccca agacagtgaa 4080 agatgttcgc cgtcttcttg ggctcgcagg gttctatcag aaattcctgc ctaattatgc 4140 agaaatcacc acgcccataa ccaatcttct aaaaaaggga gttaaaaagt tcgcgtggac 4200 cgaagatgcg gatattgcac ttgagaaatt gaaagcagcc ttaatctcag ctccaatttt 4260 gacaaatgct gattttactt taccatttat tatcgagact gatagctcgg acttagcagt 4320 gggtgccgtg cttgttcaag tccagaacgg tgaaagaaaa cccatagctt actattctaa 4380 aaagctgtcg agtacgcaga ggcgttacag cgctactgag cgcgagtgtc tcgcggtttt 4440 attaagcatc gagaatttca agcacttcat cgaaggttcc cagttcatca tctcgacaga 4500 tgccatgagc ttaacctttt tgaagaccat gtcaattgag agcaaatcag ctagaattgc 4560 caggtgggct cttaagctct ctaaatatga catgctttta aaatacaaga aaggctcgga 4620 aaatatccca gccgatgcct tgtcacgagc tgtaaatcaa ctcgatgttt tggatccgta 4680 catttcgcag ctaaaattaa tgattgaaaa atatccagag aggtattcag acttccaagt 4740 tagcgatggt aaagtctaca agcatataca ccatgccgca aatgccgaag acccatccta 4800 ccggtggaaa tatgttgctc cagtaattga gagacgagaa atcattcaga aaacccatga 4860 agaagcacat cttggattcg tcaaaacctt ggcaaaggtt cgcgagagac attattggcc 4920 acgcttagca gctgatgtta aacgattttg tagtcagtgt gaagtatgcc gagaatctaa 4980 aataccgaat ttaaacgtcc aaccaatgtg tggcaaaccg aagctatgtt ctcgtccttg 5040 ggagttaatc tccatggatt ttttggggcc gtaccctaga actaagagag gcaacgtatg 5100 gttacttgtc gtaagtgact tcttctccaa gttcgtatta attcaatgtt tgcgaagtgc 5160 aacagccgta gcgacctgtg catttgtaga aaacaccgta ttcaatgtgt tcggtgcccc 5220 gcgcatatgc attacagata atgctactgt gttcaagtca gaactcttcc agaagttgct 5280 caataattat tctgttactc actggccatt ggctgtctat catcctagcc ccaacccagc 5340 cgaacgcgtt aatagggtaa ttgtcacagc cattcgatgt tccctaaacc aagaacggga 5400 ccaccgtaat tgggaccaaa acgttcacca aattgctagg gcgatacgta cgagcgtcca 5460 tgaaagcacg ggatatacgc cttacttttt aaacttcggg cgaaatatga tcagcaacgg 5520 tcgtgagtac gaagatctaa gagaaagtga accccggaaa aatcctactg aatcggctga 5580 ggacataaaa cagttgcata tgaaggttca acaaaatttg ctaaaagcct atcaaaaata 5640 tagtcacccc tacaatctac gggccaataa acgtcaccaa tttcacaagg gagatgtggt 5700 gtacaaaaag atggtccatt tatccgataa atcacgaaac ttcgttggga aattcgccaa 5760 caagtttgag aaagtgcgca taaccgaggt tctgggcaca aatacctacg ctctggaacg 5820 cttgaacggc cagaagatag ctggaagcta tcatggctcg ttcctaaaaa gagcataatg 5880 accacaagct atgtcggcca tcgatcagat gtgcataaac cagcgtaaat acaaaatact 5940 caaagaggtc gattcctacg tccaatgttt gagatgtcag ctagtctgtt tcctcacatt 6000 gtagaccaaa aatgctttat gtcaattctc agctatgtcg gtgcatcata agaatgtgct 6060 taactagtag aatccgacca aaacactctc agaggtattc aatttggatg tccaatgctc 6120 gagatgactc ttagtagatt cctcgcattg aaagcaatga gaattctgtc aaacaaacct 6180 aaaatttcag aaatatcaat cacatttcta ttccgttagt agttgtttta aggtgtgatg 6240 catcagttgc attcctggcg gttggttgat cgacgaacgt tgaccttgac ctactgcaac 6300 cccttcaata gcccagattt gtccatcaat agaccggttt tgaagtatta aattcactat 6360 tttcccaatc tgctaccata aactttgcat acacttaata cttttacctt acttaatcca 6420 ttttgcctaa attccatgcc aaattagttg aaatagtctt atttttgaag aaattccatc 6480 gcacgatgca cacattctgt tttgtcgcgt tgttttgatt agattgacgt ttcggtatga 6540 agaagctttt agaatcgaaa aagctttcgc gcgtacccgg tactgaaata tcggaatagt 6600 atttggaagc gtcgatcgtc ggttttgttt ttgcgtattt tggctatgtc attttgatcg 6660 atatgaatga gtagttatga tttcgcggta gctgataata tgctagttta accaaatggt 6720 tggaaaacaa tatgctcaga tatttctgag gttaattcgg ttatgattag attgaaggtt 6780 ttatgctcat atatttctga ggttaattcg attatgatta gttttggaga tctgttgctc 6840 agatatttct gaggttaact cggtataaaa atctcaagag ttttctgagg tttcatttta 6900 gttgagttcg ttaagtgaga aaaaggttga atgagtttgg atgaatgagt tgagttcata 6960 atggcatagt taaaatcaga taaatgttct atgtgtagaa gctttagaaa aatataaatt 7020 atttttgaga aaatcagtaa ctgagtcaat aatattatat aactacatta aagcaataaa 7080 aattttgaaa ttttatttca aaatttttat tgaaatttgt atgggcgaga a 7131 // ID BEL-58_CQ-LTR repbase; DNA; INV; 360 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-58_CQ_; KW BEL-58_CQ-I; BEL-58_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-360 RA Jurka J.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 270-270 (2011). XX DR [2] (Consensus) XX SQ Sequence 360 BP; 83 A; 87 C; 89 G; 101 T; 0 other; tgtttgaaat cgagaatagt gtaagttgcg ccctttgccc gatatttaat agcgccgatt 60 agaccgcttt ggtgatccgt acgggagaac ccgacgggtt tgatttgttt atatgttcta 120 ataaattgaa gtcttcagcc aacaaccaac tggctcggtt catcagaacc ccccgcgtat 180 tttgtactgc ccgatcgtcg cgaataaagt tctgtgcgcg ctacaagccg tgtttttgct 240 tcgaccgtgt tttgaggaag aggtcgttga cctcccgaaa gtgtgtgtgc tctggtcgag 300 ctgaagattc cctcaacccc caatacagtc cactgaagtt gaagggctcg atccggaaca 360 // ID BEL-187_AA-LTR repbase; DNA; INV; 639 BP. XX AC supercont1.91; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-187_AA_; KW BEL-187_AA-I; BEL-187_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-639 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.91; Positions 1033944 1033306. XX SQ Sequence 639 BP; 203 A; 120 C; 132 G; 184 T; 0 other; tgttgccgac actggcagca ctgggaattt gaattgcgaa tcgacccaca aaggcccccg 60 gttggactta acacgcagcg actgacaatg accgtgcgca aggctcgatg cgtgcaaaat 120 gtcaagataa ggggagagtg ataatttgcg agtcatgtaa acagcagcag ttgaacggaa 180 gaaatttgtg ctatattatt gaatttttga atatcctatc gggtcgattg attgtactta 240 attactagtt tgctctgtaa gttgaacata ttttcgctgc acaatttgaa tattgaatta 300 tcataggttt gtttcttaaa tagattagct tagtatcctc aggagttacc atccttttgt 360 ctgcggtcgc aagagaacga tattacaatc gtaggttcta acctattcat atagatatat 420 gcttaagatt aaaacatgat cccccccaat ttagaagact tctctataag gaaacacgta 480 ttaagattgt tggagtagaa ataactaagg agaccagttg tgagtagcaa tgaattcgat 540 aaagttacac catttaataa acgctatttc cagcttaaag aactcggcac agaacaacac 600 cagtgcagct tctgttaaga ggtccgaaat ctcccaaca 639 // ID Copia-12_CQ-I repbase; DNA; INV; 5115 BP. XX AC AAWU01014235; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-12_CQ_; KW Copia-12_CQ-LTR; Copia-12_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-5115 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 339-339 (2011). XX DR GenBank; AAWU01014235; Positions 33573 28459. XX CC Positions [1427-1963] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 80..3421 FT /product="Copia-12_CQ-I_1p" FT /translation="MVLPNASGSGDGAQTNPPGSSSSLAGLPPIQRLIGRE FT NWSTWKFAVQMYLEHEELWEIVQPTPNADGTLPAVNAKLDRKTRAKIAMLV FT DPENFVHIQEAKTAREAWMKLEETFGDLGPERKIGLLCKLINTKLVNCSSM FT ADYVNRIVTTSMQLRGIGFNVDEEWVGALLLAGLPPEYKPMVMAISNSGLP FT INGDVIKTRLLQEVEEPPVTVALATKDKRYPSKQAAGQSKGPKCRSCHKFG FT HIAKDCQNKNRLGGKGQSALSTVLSTIGAAEEDEWYFDSGATSHLTKDRGL FT MENLQDGSGTVTAANNGAMKVVATGDAKLKASCCPGSPALDVRNVQLIPEL FT TTNLLSVSQIVEKGNTVIFDKRGCQVVNPDGDVFATGTRVNNLFRFDAEKQ FT SRAMACRESGQAELWHKRMGHLNVHGLKQLKNGLVTGIKFSDDGIGDCKVC FT ALGKQTRLPFGKAGSRAEELLELVHSDICGPMEETSMGGKRYYITFIDDKS FT RRIHFYLLKTKSAEEVLAVFREYHCLVERQTGRKLKTIRTDNGKEYINRMF FT QDYLKTNGIRHQTTVDYTPEQNGMAERVNRSIVERSRCMLFEANLPKRFWA FT EAASTAVYLLNRSPTKGHAKTPEEVWSGKKPDLSHLRIFGTSVMAHVPKQK FT RRKWDAKAEECVLVGYEEDTKGYRIFNKTTRSVTVSRDVRFISEGETSHAE FT VAATEQGAVYVKLDLLESLAQPGTDGTRTTPTGQQPELCQPGPSREPDIPS FT EQEDQFEDTSDDETSGESSYETVEEDPADVTVTLSPALPPRTSSNPPVSQV FT LRRSGRERALPGKLKDFDLGNKGVPRSNVSDNPGSGPAPADDRQSLIAAGR FT PRKRLNKASSCTDDPCTVGEALGGGQSAQWTAAMQEEYDSLMKNGTWDLVD FT PPPNLKIVGCKWIFKTKRDSEGAVVRHKARLVAQGFTQKYGYDYDEVFAPV FT VRQTTFRTLLAVAAKRNMVVKQYDVKTAFLNGDLEEEIYMRQPPAFVTDGN FT ERKVCRLKKSLYGLKQAARSWNNKLHTVLAKQGFIRCKADPACTRSEGEQV FT LLRTGVRRRLIVASDASRPDREAFSGAPSTHIFKSTRVPRVEAKWSRARVT FT RQVERF" FT CDS 3294..5114 FT /product="Copia-12_CQ-I_2p" FT /translation="MRADLTVKLSPALPPRTSSNPPVSHVLRRSGRERALP FT GKLKDFDLGNKGVPRSNVSDNPGSGPAPADDRQSLIAAGRPRKRLNKASSC FT TDDPCTVGEALGGGQSAQWTAAMQEEYDSLMKNGTWDLVDPPPNLKIVGCK FT WIFKTKRDSEGAVVRHKARLVAQGFTQKYGYDYDEVFAPVVRQTTFRTLLA FT VAAKRNMVVKQYDVKTAFLNGDLEEEIYMRQPPAFVTDGNERKVCRLKKSL FT YGLKQAARSWNNKLHTVLAKQGFIRCKADPCLYKKVKANKFCYVLVYVDDL FT IVASDASGLHQELVAALEKNFEISQLGDIHYYLGIEVERDDRGDFFINQAK FT YINQVVVSNGLSDAKVSKMPLDPGYLKQEAEGEPLPDNFEYHKVVGQLLYL FT SVNTRPDISAAISILSRKTAAPTQTDWTELKRIIRYLKGTSHYRLQVSKTD FT KQAGLIGYSDADWAESTADRKSNSGQIFLFNSGTISWSCRKQTCVALSTAE FT AEFIALAEATQEALWLKRLLEDLDEDGAVTVNEDNQSCLKMLEKEKFSNRT FT KHIATKYHFIRDVKEKDEVKYIYCPTEEMVADILTKPLPGVRMSKLAGACG FT LAPAPRGGV" XX SQ Sequence 5115 BP; 1337 A; 1291 C; 1579 G; 908 T; 0 other; agtggttatg ggcctagagt aggaagtttg gaagattcta gaagttgaaa aattttctca 60 agaagttttg ctgatcaaga tggttctacc gaatgcttcc ggaagcggcg atggtgcgca 120 aacgaatccc ccaggctcca gttcaagtct agccgggtta cccccgattc aacggctgat 180 tggacgcgaa aactggtcga cctggaagtt cgcggtccag atgtacctgg aacacgaaga 240 actgtgggaa atcgtgcagc caaccccgaa cgccgacgga acgttgccag cagtgaacgc 300 caagctcgac cggaaaacgc gagcgaagat cgcgatgttg gtggatccgg agaatttcgt 360 ccacatccag gaggcgaaaa cggcgcgcga ggcgtggatg aagctggaag agacgttcgg 420 agatctcgga ccggagcgga agattggcct gctgtgcaag ctcatcaaca cgaagcttgt 480 gaactgcagc tcgatggccg attacgtgaa ccggatcgtg acgacgtcga tgcagctgcg 540 agggatcgga ttcaacgtcg acgaggagtg ggttggagct ctacttctcg ccggattgcc 600 gccggagtac aagccgatgg tgatggcgat ctccaactcc ggattgccga tcaacgggga 660 cgtgatcaag acgagactgt tgcaagaggt cgaggaaccg ccagtcacgg tagccctagc 720 aacgaaggac aagcggtatc ccagcaaaca agcggcaggt cagtcgaaag gaccgaagtg 780 tcggtcttgc cacaagttcg ggcacatcgc caaggactgc cagaacaaga accgactagg 840 aggaaaagga cagagcgccc tcagcaccgt tctgtcaacg attggcgcag ctgaagaaga 900 cgagtggtat ttcgactctg gagcgacctc gcatctgacg aaggacagag gtctgatgga 960 gaatctgcaa gacggtagcg gaacggtgac ggcagccaac aatggtgcca tgaaggtggt 1020 ggccacaggg gacgcgaagc tgaaagcaag ctgttgccct ggtagcccag cccttgatgt 1080 ccgcaacgtg cagctgattc cggagttaac cacgaacctg ctttccgtaa gtcaaatagt 1140 tgagaaggga aacacggtga tcttcgacaa gcgtggctgc caagtagtga acccggatgg 1200 agatgtgttc gcaaccggaa ccagggtgaa caatctcttc cgattcgacg ctgagaagca 1260 atcccgagcg atggcgtgtc gagaatctgg acaagcggag ctgtggcaca aacggatggg 1320 tcatctgaac gtgcacggac tgaagcagct caagaatggt ttggtaaccg gcatcaagtt 1380 cagcgacgac ggcatcggcg attgtaaggt atgtgccctc gggaagcaga cgagactacc 1440 gttcgggaag gccgggtctc gagctgaaga gctgttggaa ttggtccatt ccgacatctg 1500 cggacccatg gaggagacgt cgatgggagg caagcgatat tacatcacct tcattgatga 1560 caagtcgcgg cgcatccact tctacctcct gaagaccaag tcagcagaag aagtcctggc 1620 cgtgttccgg gaataccact gtctggtcga gaggcaaact ggacgaaagc tgaagacaat 1680 ccggacggac aacggtaagg agtatattaa ccgaatgttc caggattatc tgaagacgaa 1740 tgggattcgg caccaaacga cggttgacta tacgcccgag caaaacggga tggccgagcg 1800 agtcaacaga tccatcgtcg agcgctccag gtgcatgctc tttgaagcga acctaccgaa 1860 gcggttctgg gcagaggcag catccacagc ggtgtacctg ctgaaccgat caccgacgaa 1920 gggacacgcg aagactccgg aggaagtctg gtctggcaag aagccggatt tgtctcacct 1980 gcgcatcttt ggaacatcgg tcatggctca tgtaccgaaa cagaagcggc gaaagtggga 2040 tgccaaggca gaggagtgcg ttctagtcgg ttacgaggag gacaccaaag gttaccggat 2100 cttcaacaag acgacccggt cagtcacggt cagccgggac gtgaggttca tcagcgaagg 2160 agaaacgagt cacgcggaag ttgcggccac cgagcaaggt gccgtgtacg tgaagctgga 2220 tttgctggaa tcgctggcac aacctggtac ggacggtact aggacgacgc caaccggaca 2280 gcagccggag ctgtgtcaac ccgggccaag tcgtgagccg gacattccgt cggagcaaga 2340 ggatcagttc gaggacacat cggatgacga aacgtctgga gaatcgagct acgagacagt 2400 cgaagaagat ccagcagacg tgaccgtgac gctttctccg gcgctccctc cacgcacatc 2460 ttcaaatcca cccgtgtcac aggtgttgag gcgaagtggt cgggagcgcg cgttaccagg 2520 caagttgaaa gattttgatt tgggcaacaa aggtgtaccc cgatccaatg tttcagacaa 2580 ccccggctct ggaccggcgc cggcagacga tcgacaaagc ctgattgcag ctggacgtcc 2640 aaggaagcgg ctgaacaaag cgagcagctg tactgacgat ccatgtaccg tgggcgaggc 2700 gctcggcggt gggcaatctg cgcagtggac ggctgcgatg caggaggagt acgactcact 2760 gatgaagaac ggcacgtggg acctggtgga cccacccccg aacctgaaga tcgtcggctg 2820 caaatggatc ttcaaaacga agcgtgactc ggaaggtgca gtggttcgtc acaaggcccg 2880 attggtcgca cagggcttca cgcagaagta cgggtacgac tacgacgaag tatttgcgcc 2940 cgtggtgcga cagacgactt tccggactct cctggccgta gctgcgaagc ggaacatggt 3000 ggtgaagcag tatgatgtca agaccgcgtt tctcaacggc gatcttgagg aggagatcta 3060 catgcgccaa ccaccagcgt ttgttacgga cggcaacgag cggaaagtgt gcagactcaa 3120 gaaaagtctt tacggcctca agcaagcagc caggtcgtgg aacaacaagc tgcacaccgt 3180 actcgccaaa caaggattca tcagatgcaa ggcggaccct gcctgtacaa gaagtgaagg 3240 cgaacaagtt ctgctacgta ctggtgtacg tcgacgattg atcgtggcca gcgatgcgag 3300 cagacctgac cgtgaagctt tctccggcgc tccctccacg cacatcttca aatccacccg 3360 tgtcccacgt gttgaggcga agtggtcgcg agcgcgcgtt accaggcaag ttgaaagatt 3420 ttgatttggg caacaaaggt gtaccccgat ccaatgtttc agacaacccc ggctctggac 3480 cggcgccggc agacgatcga caaagcctga ttgcagctgg acgtccaagg aagcggctga 3540 acaaagcgag cagctgtact gacgatccat gtaccgtggg cgaggcgctc ggcggtgggc 3600 aatctgcgca gtggacggct gcgatgcagg aggagtacga ctcactgatg aagaacggca 3660 cgtgggacct ggtggaccca cccccgaacc tgaagatcgt cggctgcaaa tggatcttca 3720 aaacgaagcg tgactcggaa ggtgcagtgg ttcgtcacaa ggcccgattg gtcgcacagg 3780 gcttcacgca gaagtacggg tacgactacg acgaagtatt tgcgcccgtg gtgcgacaga 3840 cgactttccg gactctcctg gccgtagctg cgaagcggaa catggtggtg aagcagtatg 3900 atgtcaagac cgcgtttctc aacggcgatc ttgaggagga gatctacatg cgccaaccac 3960 cagcgtttgt tacggacggc aacgagcgga aagtgtgcag actcaagaaa agtctttacg 4020 gcctcaagca agcagccagg tcgtggaaca acaagctgca caccgtactc gccaaacaag 4080 gattcatcag atgcaaggcg gacccctgcc tgtacaagaa ggtgaaggcg aacaagttct 4140 gctacgtact ggtgtacgtc gacgatttga tcgtggccag cgatgcgagt ggactccatc 4200 aagagctggt agctgcactg gagaagaatt tcgagataag tcaactgggc gacatccatt 4260 actatctcgg aattgaggtc gagcgcgatg accgaggtga tttcttcatc aaccaagcga 4320 agtacatcaa ccaggtcgtc gtgagcaacg gtctgagcga tgcgaaggtc tccaagatgc 4380 ccctggaccc tgggtatctg aagcaagaag cggaaggaga gccattgccg gacaactttg 4440 aataccataa ggtggtcgga cagctgctgt acctgtcggt gaacacgaga ccggacattt 4500 ccgcagcgat ttcaatactg agcaggaaga cagctgcacc gactcaaacg gactggaccg 4560 agctcaagcg gatcatccgg tacctgaagg ggaccagcca ctatcgtctg caggtgagca 4620 agaccgacaa acaagctgga ctgatcggat actcggatgc agactgggcg gagagcaccg 4680 cggacaggaa gtcgaacagc ggacagatat tcttgttcaa cagtgggacg atcagctgga 4740 gctgtcgcaa gcagacgtgc gtcgcgctgt caacggcgga agcagagttc atcgcgttgg 4800 ctgaagcgac acaagaagct ttgtggctga agcgattgct ggaggatctg gacgaggacg 4860 gcgctgtgac ggtcaacgag gacaaccaga gctgcctcaa gatgttggag aaggagaagt 4920 tcagcaaccg caccaagcac atcgcgacga agtaccactt catccgggac gtgaaggaga 4980 aggacgaggt gaagtacatc tactgcccaa cagaggagat ggtagcagac atcttgacga 5040 agccactacc aggagtcagg atgtccaagc tggcaggagc gtgcggatta gcaccagcac 5100 ctcgaggagg agtgt 5115 // ID Copia-96_AA-I repbase; DNA; INV; 4249 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-96_AA_; KW Copia-96_AA-LTR; Ty1_copia_Ele181; Copia-96_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4249 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [1571-2104] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 1571..4249 FT /product="Copia-96_AA-I_1p" FT /translation="MARRPFPKKVRRKSKAVLDLIHSDLCGPMKTATPGGR FT RYFLSLIDDYSRFTTLYILRKKSDTVDVIRDFVRLMKTQFGRPPKIIRSDQ FT GGEYRSAELVKFLKQEGIQQQFTTAYTPQQNGVAERKNRSLVEMARCMIID FT SGLHYRYWAEAVNTANHLQNMLPTKPVQQTPYEIWHGTKPDMSMIQIFGSE FT AYVHIPKEKRTKLQSKATKLTFVGYSQQHKAWRFIDLKTNEIVFSRDARFL FT PSSDREIPDAVDDEIFVVPPVKPSSNMEHEESDESDEDSEDEPDADSDDEE FT EFRGFEEESCDNLDLGFENPADRSLYEDANGSVVSRQDGELVSIQGESTYL FT QEEQFAVTPRRSGRSTKGIPPERYVADGKLARSQQCEPRSYQEAVSDPENG FT HWKAAMNDELKSLRECKAWDLTSLPPGSKTIGCRWIYKKKQDEQGKLVRYK FT ARLVAQGFTQRYGLDYDEVFAPVAKQVTLRTLLTIAGRDEMQVRHVDVKTA FT YLNGDLKETIYMRVPPGLRVEKGQVCRLRKSLYGLKQSARVWNQKLNDVMK FT RLGFRQADADPCLYVRKTSGGTVYILVYVDDMLIATSRDEDYEGVVKALAS FT EFQITTLGEVKHFLGIRVTRKNNAYCLDQKAYIDKLVAQFGLADAKGSRIP FT MDVGYLQQKEELESLPNNEKFQSLVGGLLYLSVNTRPDIAISTSILGRQVS FT KPTNADWTEAKRVLRYLKSTSDLKLELAVNRQELRGYADADWAGNVKDRKS FT NSGYLFQLGGGPISWCARKQSCVALSSTEAEYISLAESCRELLWLKKLLKD FT FGEPVQEPVQIFEDNQSCIKMLEQNAGLKRSKHVDTKYHFVKDLAENDNVN FT VTYCPSADMLADIFTKPLNRVKLEDIRERIGLRSLRDEEE" XX SQ Sequence 4249 BP; 1165 A; 851 C; 1261 G; 969 T; 3 other; ataggttatg ggcctgtgaa gtagccattt tgtttttgtg caaagtgaac acgttcagac 60 aagtcgtggc gcgtttgttg gaatttttcc ggaagtgtta aattcaacaa aaatggagaa 120 aattggtatc gcgaagttaa cgaacgataa ttacgattcg tggaaattgg aagtcgagtt 180 tttgttagtg agagaaggac tttggaagta tgtctcgccg ggagtgaagc cggaagtggc 240 tgcttcgggt tcgaacgtcg cggaactggc tgcctgggac gagggagacc agagagcgcg 300 cgcgacgatc ggtttattgc tgacaaaaag tcaacacggg catatccgga atacgaattc 360 cgcgaaagca gtgtgggaca atttggaaaa gcagcaccag aagaagacgc ttacgaccaa 420 agtgcatttg ctcaagcgaa tctgcgacat ggagtaccac gacggggaca acatcgaaga 480 acacctgatg gagtttgagg atttgttcga gaagctggct aatgctggaa caaaactaga 540 cgaagatctc caggtggttt tagtgtttcg aagtttgcct ggttcgttcg atgccttgac 600 gacgacgcta gagaaccgag ccgatgatga gctaacgatg catcttgtga aagggaagat 660 catwaacgaa gtgcacaagc gtcgtgatcc gkcggtagtg gattcggttg ttctcaaagc 720 ggaagacaag aagaaaaact cagtgtgttt tttctgcgag aagcccggtc ataagaaagc 780 aaactgcagc ttgtggctga ggcagaagaa agaagaaggt gacagtagct cagcagtgag 840 caggcacagc gaagtgagaa aagtgtcgaa aaagctcgcg aaggcgaagc ttgcgacgtc 900 gcgatcggaa gaatttactt ttcttgccag agagtgcgtg acgaatggtt gtgtcgtcaa 960 ttccgatgtg aacgtagaac cgagaaatag tgcgtttgtt gccagtgagt gcgcggcgaa 1020 agactggatc gtcgattccg gtgcgagttc gcatatgtgc gcaaatcgga agtttttcgt 1080 gtcgttcaat gaaccgcgga tggacactcc gaaatcggtt acggttgccg acggtaaaca 1140 tgcagcggtg aaaggcgtcg gattgtgcca gattttttgc atcggaggtg gtggcgcaga 1200 aacgcgaatt ttgttgagtg aagtgctttt cgttcctgat ctcgacatga atctcgtgtc 1260 cgttggcagg ctcgttcaga agggagcgag cgtaatcttc gacaagaacg gatgtagaat 1320 tgcgaacggt gataagattg ctgctattgc gccgtcaagc aagggtctgt tccatcttca 1380 aatggttgaa cgtgctagtg cagtttcggg tcagtgtcat acgaagaact gcatacacga 1440 gtggcacagg aagctgggac atcgtgatac tcaagctatc ctggatctgg agcggaaagc 1500 tttggcaaca ggcatcaagg tacgtmactg ctctgttcgt tattcttgtg aaccctgttt 1560 ggagggaaag atggcaaggc gaccatttcc taagaaagtg aggcgtaagt cgaaagcggt 1620 gttggatttg atccacagtg acttgtgtgg gccgatgaaa acggccacac ctggaggtcg 1680 tcggtatttt ctttcgctga tcgacgacta cagccgcttt accactctat acattcttag 1740 gaagaagtcg gataccgtgg acgttatccg agatttcgta cgtttgatga aaacgcagtt 1800 tggtagaccc ccgaagatca tcagatcgga tcaaggagga gagtaccgca gcgcagagtt 1860 ggtcaagttt ctcaagcagg aaggaattca gcagcagttt acgacagcat acacgccaca 1920 gcagaacgga gttgcggagc gcaaaaatcg ttcgctggta gagatggcac ggtgtatgat 1980 catcgattct gggttgcatt atcgttactg ggccgaggcg gttaacacgg ccaaccatct 2040 gcagaatatg ttgccaacga aaccggtgca gcagacgccc tacgagattt ggcacggtac 2100 gaagccggac atgagcatga tccagatttt tggttcggaa gcttatgtgc acatcccgaa 2160 ggaaaagcgg acgaagctgc aatcgaaggc gacaaaattg acgtttgttg ggtactccca 2220 gcagcataaa gcgtggcgtt tcatcgactt gaagacaaac gaaatcgttt tcagtcgtga 2280 tgctcgtttc ttgccgtcaa gtgatcgaga gattccggat gcagtcgatg acgaaatttt 2340 cgtggtgcct ccggtgaaac cgtccagcaa catggagcat gaagagtcgg acgagtccga 2400 tgaagattcc gaagatgagc cggacgcaga tagtgacgat gaggaagaat tccgtgggtt 2460 tgaagaagaa tcttgcgaca atttggatct tggttttgaa aatccagcag atcggagttt 2520 atatgaagat gccaatggca gtgttgtctc tcgacaggac ggcgaactag tttcgatcca 2580 gggggagtcg acttatttgc aggaggagca attcgccgtt acacctcgtc ggtctggaag 2640 gtcaacgaaa ggtattccac ctgaacgtta cgtggcggat ggcaaactgg ctcggagtca 2700 gcagtgtgaa ccccgaagct accaagaggc agtgagtgat cctgagaatg gccattggaa 2760 ggccgccatg aacgacgagc tgaagtcatt gcgggagtgc aaggcctggg acttgacgtc 2820 gttgccgcca ggaagcaaga ccatcggatg ccgctggatc tataagaaga agcaggatga 2880 gcaaggtaaa ctagtacgat ataaagcgag attagtagcg caaggtttca cgcaacgata 2940 tggcctggac tacgacgagg tattcgcccc agtcgcgaag caagtgacgc ttcgtacact 3000 gctgaccatc gctggacgag atgagatgca ggtcaggcac gttgacgtga aaactgcata 3060 tttgaatggc gatctcaagg agaccattta tatgcgggta ccacccggac tacgagttga 3120 aaaaggacag gtgtgtcgtt tgcgtaagag cctatatggt ttgaagcaat cagccagggt 3180 ttggaaccaa aagctgaatg atgttatgaa gcgactggga ttccggcagg cggatgctga 3240 cccgtgctta tatgtgcgga agacgagcgg cggcacagtg tatatcctgg tgtatgtgga 3300 cgatatgcta atcgccacat cacgtgatga agactacgaa ggagttgtga aggctttggc 3360 tagcgagttc cagattacaa cacttggtga agtgaagcat tttttgggaa taagagtgac 3420 gcggaagaac aacgcctact gcttggacca aaaggcttat atcgacaagt tggttgcaca 3480 gttcggactt gcggatgcta agggatcacg gattcctatg gacgtcggat acctacagca 3540 aaaggaggag ctggagagtc taccgaacaa tgagaagttc cagagcctcg tcggtggttt 3600 gctgtatcta tcagtaaata cccggccaga catcgctatt agtacatcca ttcttggaag 3660 gcaggtaagc aagccaacga atgcggattg gacggaagca aaacgggttc tgcgctacct 3720 gaagtctacg agtgatttga agttggaatt ggctgtaaac agacaagaac ttcgaggata 3780 cgcggacgca gactgggccg gaaatgtgaa ggaccgaaaa tcgaattcgg gctacctgtt 3840 tcaactgggt ggtggcccga tttcatggtg tgccagaaag caatcgtgtg tggcactttc 3900 atcgacggaa gcggagtaca tttcgttagc tgaaagttgt cgggagctgc tttggctgaa 3960 aaaactattg aaggacttcg gggagcctgt acaggaacca gtacagatct tcgaagacaa 4020 ccaaagttgc atcaagatgt tggagcagaa tgcaggactg aagcgttcga agcacgtgga 4080 caccaaatac cacttcgtga aggatttggc tgaaaacgac aacgtaaatg taacttattg 4140 cccatcggcg gatatgttgg cggacatttt tacgaaaccg ctgaacaggg taaaattgga 4200 ggacatacgc gagaggattg gattgcgatc tctgcgcgat gaggaggag 4249 // ID LIN13_SM repbase; DNA; INV; 6135 BP. XX AC . XX DT 11-AUG-2009 (Rel. 14.08, Created) DT 11-AUG-2009 (Rel. 14.08, Last updated, Version 2) XX DE Non-LTR retrotransposon; consensus. XX KW NeSL; Non-LTR Retrotransposon; Transposable Element; LIN13_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-6135 RA Jurka J.; RT "Non-LTR retrotransposons from Schmidtea mediterranea."; RL Repbase Reports 9(8), 1907-1907 (2009). XX DR [1] (Consensus) XX CC The 5' and 3' termini are approximate. CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 52..2670 FT /product="LIN13_SM_1p" FT /translation="MKSKEGNPDLIQEMNRLFHSLIDQIMDGNIVKSNKKE FT NRSDSISKPNINSIKDQKVKKTNFKNNFKIEDHLRNDKNFNILKENYYQHQ FT KFYKNLENCDFNGLIFKIFLLKEKINKLKFKITDFKNLNIKRNETTEKEKI FT MNEDEIKLKKVKSYKDVLVNNKKDFKKNEDEKIIKNNGKNLKITIRNENAI FT INEKTPKNLNLKGIRNWTDDFFQNLTFYFKKNMDDPSTKILNMDILKYIEI FT KFTLMNTYDNTPSRLITEIYKKIPENDIKKLIDSYFNTLNSEIFLKFKNRL FT KNLKYEAFIKDFLFLINXKFKSLLNVFEYLQLTILRNINAKKIFRAHQARE FT NSERSSSTFGRLADEIIQNGDANITERSSSTSGRLTKNLIQLTDSKVKLTK FT SSSLKSLNKNNRYNLLVDLDDNEGIMDXAVSDSTKKXDKSVTEKVVNHRRK FT LTINKMETIYISGELENTISYSPLASDVQLSNVSEPKKQPDNSDNADLSQF FT PAPPFDRNLDEILNGLTSADKETCENNLKNVEETNIFTDSEHEEFEPTSIP FT GSDRNSDMSLFSQTLPQNNEPSENKEIMVNXTCEITPPKNYFDLMCKQSDN FT TTNRKLLNIEKVITAVMPNIYNENGPGQAIVNAKPEERKHNILYVLDNKGK FT FKCNAKEKKPQCGEIPHYDYEGLVEHASIIHNATFNDQSYIECKPCDPKKN FT KDNATNIIKXAEIFTHIEAHNHGTSCAQSDTMKMYLHLTKENVFYCKYRGK FT SKKNLCKKYFNLDSSMEDILEHMKTHSGYIFNENSKIACYCGIWEHFPKLX FT NHIKNQHLNEFINTVSITEDNLTISTXVSPPVLAGILASNETQSNLEEDQI FT KPRNLPENLAFNRDAEKELS" FT CDS 3133..5970 FT /product="LIN13_SM_2p" FT /translation="MKEAMRPIIRKYNCAKFPESDIRNYRVLIEDLIYQVN FT LDTITCEEILNEIERINGRLNPKKYFKENRPKSDLIHLQKKKAAELLCVKR FT LKFQINQKIEIEKIWESSDVDHRPPMAKFLKTFANQDCPISSTSLITLPFY FT KDEDSDVFTDCKTMSHIMMNLDSSAPGMDLITGGDWKKISPKHELITAICN FT CVLRNKICPQRWKLYRTVMILKPGKLPESFRANSWRPLAIMDTAYRIFTTL FT LNNRLLHWIRNGSLISQNQKAIGVPDGCAEHNATIHIAIDRAKRCKSELHI FT VWLDITDAFGSLPHDLIWYTLAGMGLKSETLDLIKELYMDVKTMFDCQGAL FT SEPVNITKGVKQGCPLSMTLFCLSIDYILKSILNKYPFFLHDLNISVLAYA FT DDLVLLSDSYSGIKKSLESTVXLAAFANLKFKPSKSGYLSFNDINVDINRL FT YLYNEEIPMISENNKYRYLGVDFSNKRNQDIDGRLDSALALTKSLFGSYLH FT PSQKLSAYKIFIHSKLIFSLRNCVIGHRILDCDRNRVTQGREKQLGFDQKI FT KGLLKTMIGDKFQAMNNNFFYTHCKLGGLGVTAAIDEYLIQSITGITRLFN FT SSNISFRKMLIXELAQSRGNGNFEAGLSWMNCEFNKIFPKSSFFVKFQKSA FT QSLKRKFCIYVYLKFIDDSFTLEITYKKRISYINHQNLNTLSKELHNFVGL FT HYAEQWCKMRVQGHIAASIGDSITAKYLIASDTLNDAQYYFLVRARNNLLN FT LNYGAYRLKYNLNTKCRLCNLDEETQAHVFNHCRAKPNARRVKHENVLLGI FT VSFLEKIGFEIDVEKSPKYVSIPTKLKPDMVIRSKRNKDIHVLDLKVPYDS FT VEGFEKAREDNYVKYKDLAMQIGKAFNQTATISAVVIGCLGTWDKKNNAAL FT SKIGLTKTEIIALAKIACADAVIACYHIYREHISFTKSIPPV" XX SQ Sequence 6135 BP; 2305 A; 1007 C; 1069 G; 1741 T; 13 other; cgaaaaaaat tgacattcaa tagcaccggg cagaaattgg aaactgaaga tatgaaatca 60 aaggaaggca atcccgacct gatacaggaa atgaacagac tatttcattc actcattgac 120 caaattatgg atggaaatat tgttaaaagt aataaaaagg aaaatagatc agattcgatt 180 tcaaagccaa atatcaattc aataaaagat caaaaagtta aaaagaccaa ttttaaaaat 240 aattttaaaa ttgaagatca tttacgtaat gacaaaaatt ttaatatttt aaaagaaaat 300 tactatcaac atcaaaagtt ttataaaaat ttggaaaatt gtgattttaa tgggctaatt 360 tttaaaattt ttttacttaa agaaaaaatt aataagctca aatttaaaat tacagatttt 420 aaaaatttaa atattaaaag aaatgaaact acagaaaaag aaaaaattat gaatgaagat 480 gaaataaagt tgaaaaaagt taaaagttat aaagatgttc tagtgaataa taagaaagat 540 tttaaaaaaa atgaagatga aaaaattatt aaaaataatg gaaaaaattt aaaaattaca 600 attagaaacg aaaatgcaat aataaatgaa aaaactccaa agaatttaaa tttaaaagga 660 ataagaaatt ggacagatga tttctttcag aacttaacat tctacttcaa aaaaaatatg 720 gacgatcctt ctactaaaat attgaatatg gacatcctaa aatacatcga gattaaattc 780 actctaatga acacttatga caatactcca tcaagactga tcacggagat ttataaaaaa 840 atacctgaaa atgatattaa aaaactaatt gatagttatt ttaacacttt gaatagtgaa 900 atatttttga aatttaaaaa tagattaaaa aatttaaaat atgaagcctt tataaaggat 960 tttttatttt taatcaattt naaatttaaa tcattgttaa atgtgtttga atatctgcaa 1020 ttaacaattc tcagaaatat caatgcgaaa aaaatttttc gcgcgcatca ggcgcgcgaa 1080 aatagtgaac gcagctcgtc gaccttcggt cgactcgcgg atgaaataat ccaaaatggg 1140 gatgcgaata ttactgaacg cagctcgtcg acctccggtc gactcacaaa aaatctnatt 1200 caactcacgg atagtaaagt taaactaaca aaaagtagct cattaaaatc actcaacaaa 1260 aataaccgat acaatttatt ggttgatctg gacgataatg aagggataat ggatgangcg 1320 gtttctgata gcactaagaa agntgataaa tcagtaacag aaaaagtagt taatcataga 1380 aggaagctaa caatcaataa aatggaaacc atttatatat ctggagaact tgaaaataca 1440 atatcttatt cacctctagc atcggatgta caactatcaa atgtttcgga acctaaaaaa 1500 caacccgata attctgacaa tgcagacctg tctcaatttc ctgcacctcc tttcgataga 1560 aatcttgatg agattttgaa tggattgaca tctgctgata aagaaacctg tgaaaataat 1620 ttaaaaaacg ttgaagaaac aaatattttt actgattccg aacatgagga atttgagcca 1680 acgagcattc ctggatcgga tagaaactcg gatatgagcc tattctctca aacacttcct 1740 caaaacaatg aacctagcga aaataaagaa attatggtga attntacatg tgagatcact 1800 ccaccaaaaa attatttcga tctcatgtgc aaacaatccg ataacacaac caacagaaaa 1860 ctgctgaata ttgaaaaagt tataactgct gtaatgccca atatctacaa tgaaaatggt 1920 ccaggacagg caatcgtcaa tgcaaaacct gaagaaagaa agcataacat tctatatgta 1980 cttgataata aaggaaagtt taaatgcaat gctaaggaaa aaaaaccaca atgtggggag 2040 attccgcatt atgactatga aggactagtt gaacatgcta gtatcataca taacgctact 2100 ttcaatgatc aaagttacat agagtgtaaa ccctgtgatc caaagaaaaa caaggataat 2160 gctacaaata taataaagtn cgcagaaata tttactcata ttgaggccca taatcatgga 2220 acatcatgtg cccaaagtga tacaatgaaa atgtaccttc atttaactaa ggaaaatgtt 2280 ttctattgca aataccgagg taagagtaag aaaaacctnt gcaaaaaata ttttaacctt 2340 gactcatcaa tggaagatat tttggagcat atgaaaacac attctgggta tatttttaat 2400 gagaatagca aaatagcatg ctactgtggt atctgggaac attttcccaa acttatnaac 2460 cacatcaaaa atcagcatct gaatgagttt atcaacactg tatcgatcac tgaagataat 2520 ctcactattt ctacagnggt gtcacctcca gtgcttgctg gaatacttgc atctaatgaa 2580 acacaaagta atttggagga agatcaaatt aaaccaagaa acttgcctga gaatcttgcc 2640 ttcaacaggg atgctgaaaa agaattgagt tgatggtcgc agcacttggt caaagcatat 2700 gtatattcat ttgctataaa aacatcatca atattcgtca atccttatac ttgcaatgct 2760 ttgatccagt gcaactacaa aactttcttt gaaacattcc ctttcaaaga ttttgcaaaa 2820 tggaatgaga ttatattgcc attccacaat agctcatcat cgtggtcttt tttctttcta 2880 aacaagagaa aaagaattgc attgattatt gacccaactt ctgatgatag ccatacttta 2940 cattttgaat tggcaactga tatcctcaag acaatactta atgtgcagaa tatatttggg 3000 gatcttaaat tccctcttac tgaaattgaa taccctgtat gccatgaggt aagtctatcc 3060 gcattttatg tatgtcattt cataaaatgc ctaatatcgg acacatcaat taccatacca 3120 gatataaata ttatgaaaga ggcaatgaga ccgattatca gaaaatataa ttgtgcaaaa 3180 tttcctgaaa gtgatatcag gaattatcga gtactcatag aagacctcat ataccaagta 3240 aaccttgaca ccattacttg tgaagaaata ctgaatgaaa ttgaaaggat aaatggaagg 3300 ttaaatccta aaaagtattt caaggagaat agaccaaaaa gcgatttaat acatctacaa 3360 aagaaaaaag ctgcggaact cctatgtgtt aaaaggttaa agtttcaaat taatcagaaa 3420 atagaaattg aaaaaatatg ggaaagcagc gatgtagatc acagaccacc tatggccaag 3480 tttttaaaaa catttgcaaa tcaggattgc ccaatatcaa gtacgtcctt gataaccctg 3540 cctttctaca aggatgagga ttctgatgtt tttactgatt gcaaaaccat gtcccatatc 3600 atgatgaact tggatagctc tgcccctgga atggatctca ttacaggtgg tgattggaag 3660 aaaattagtc ctaaacatga actaataacg gcaatatgca attgcgtatt gcgtaataag 3720 atatgcccac aaagatggaa gctatataga acagtcatga ttttaaagcc nggaaaatta 3780 cccgaaagct ttagagctaa ctcttggaga ccactggcga tcatggacac agcctataga 3840 atttttacaa cattgctgaa taaccgccta ctacattgga tcaggaatgg cagccttatt 3900 agccagaacc aaaaggcaat aggtgttccg gatggatgtg ctgaacataa tgcaactata 3960 catatcgcaa tagacagggc taaacgatgt aaatcggagc ttcatattgt ctggcttgat 4020 atcactgatg catttggttc tctaccacat gacctgatct ggtatacact ggctggtatg 4080 ggtttgaaga gtgaaacatt ggacttgata aaagagctat atatggatgt gaagactatg 4140 tttgactgtc aaggagcctt atctgaacct gtaaatataa ctaagggagt caaacagggt 4200 tgtccattat caatgacact cttctgcctg tctattgact acatcctgaa gtccatttta 4260 aataaatatc ctttctttct acatgacttg aatatcagtg tcttggcata tgctgatgac 4320 ctggttctnc tttcggactc ttactcggga atcaagaaat cattagaaag cactgttgan 4380 ttggcagcct ttgcaaatct aaaattcaaa ccatctaaat ctggatattt gtcttttaat 4440 gatattaacg tggatataaa taggctatat ctatataatg aagagatacc aatgatatct 4500 gagaataaca aatatagata tcttggggta gatttctcca acaaacgaaa tcaagatata 4560 gatggacgac ttgattcggc actggcattg accaaatctc tttttggatc atatctgcat 4620 ccatcacaaa aattaagtgc gtacaagatc ttcattcatt cgaagcttat cttctctttg 4680 cgtaactgtg taataggtca tagaatcctt gactgtgatc ggaacagagt tacgcaggga 4740 cgtgaaaagc aattaggttt tgatcaaaag atcaagggtc tcctgaagac catgattgga 4800 gataagtttc aggcaatgaa taacaatttc ttctatacgc attgcaagct gggaggtctt 4860 ggtgtcactg ctgctatcga tgagtatctg atacagagca ttactggtat aacgagactt 4920 tttaactcat ccaacatcag ctttcgaaaa atgctaatan cagagctagc acaatctaga 4980 ggaaatggaa actttgaagc tggactcagt tggatgaact gcgaattcaa taagatattc 5040 cccaaatcct cattctttgt aaagtttcaa aagtcggcgc aatctcttaa gagaaagttt 5100 tgtatatatg tgtatttaaa gtttatagat gatagtttca cacttgagat aacatataag 5160 aagaggatct cttacattaa tcatcaaaat ctcaacactc tttcgaaaga acttcataat 5220 tttgtgggtc tccattatgc tgaacaatgg tgcaaaatga gagtacaggg acacattgct 5280 gcttctatcg gagatagtat aactgctaag tatctgatag caagtgatac acttaatgat 5340 gcgcagtact atttcttggt acgtgcgaga aataacctgc taaatctcaa ctacggagca 5400 tatcgcctta aatacaatct taacacaaaa tgcaggctat gcaatcttga tgaggagacc 5460 caggcacatg tgtttaacca ctgccgagcc aaaccgaatg ctagaagagt taagcatgag 5520 aatgtattac taggcatagt tagtttcctg gaaaaaattg gatttgagat agatgttgaa 5580 aaatctccga agtatgtctc aattccaaca aagctgaaac ctgacatggt gattaggtct 5640 aagagaaata aagacataca tgtccttgac ctaaaggtac cttatgactc tgttgaaggt 5700 ttcgaaaaag cacgggaaga taactatgtt aagtataaag atttggctat gcagattgga 5760 aaggcattta atcaaacagc cactatatct gctgtggtta ttggatgcct gggcacatgg 5820 gacaagaaga acaatgccgc tctctcaaag atcggattaa caaagactga aatcatagca 5880 ttggccaaga tagcatgtgc agatgctgtg attgcatgct atcatatata tcgggaacac 5940 atttccttca caaaaagtat ccctcccgtt tagtctcgta ggaaaatttt acgaggcaat 6000 gctggtgata gatcggcgtt gcagttttgt gtatgcagaa taaaaacaga agagtaatta 6060 gtgctgagca tcgctcgcat atttagccga aaggccgttt tttttgttaa aataaaattt 6120 tgaaaaaaaa aaaaa 6135 // ID DNA8-47_AP repbase; DNA; INV; 307 BP. XX AC . XX DT 25-AUG-2009 (Rel. 14.09, Created) DT 25-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-47_AP. XX NM DNA8-47_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-307 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 1977-1977 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 307 BP; 103 A; 35 C; 59 G; 110 T; 0 other; cagtgtttct caacctgggg gtcgcgaccc ccccgggggt cgccaactta ttatttaggg 60 tcgcgtatga attaaaattg tatgtattat atgtatttgt aaatatttgg tatacgtttt 120 tttaattaaa aataattata atattatatg tattttatgt tgtgtacaat gtacagtaat 180 tttagtattg attattaata ataatctata ttctatgcga gtatttaaaa aaaatatatg 240 tagggggtcg ctaaaaaatt gaaattcaaa aagggggtcg ccacagtaaa aaggttgaga 300 aacactg 307 // ID BEL-131_AA-I repbase; DNA; INV; 5993 BP. XX AC supercont1.255; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-131_AA_; KW BEL-131_AA-LTR; BEL-131_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5993 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.255; Positions 802251 808243. XX CC 'GGGGA' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 35..5992 FT /product="BEL-131_AA-I_1p" FT /translation="MPADDLTMTACGMCQTTSDVDEGMVGCDSCNRWFHYR FT CVGVNENVAEAKNWFCSESSCQEYAKKLTKRQEPKTGKQRKKATSGNESDK FT SSVTPLDQKLKALEKERKAKEREMEEERILKEKRMEMDRFLKEREMEMQEE FT LREKEMQQERELLEVALERKRNYLARVKAMRESYQVQMESIQQELIDGSNQ FT LKRPNELCPNGVQSIKPVQEQKTAVVSKSDHPKSSSELYSQTPVRSSAIKG FT KISRFAMVPAVGSKSKSFDEDAEEKNPDELEDKVQGSENEEDDEEESSDDE FT DEQDTEDDEQSEVEQEKAVKKDNQTRGKTTAPLAPHGRGPTKAQLSARNGI FT TRKLPVFSGKPEEWPLFIGSYNASNAACGFSDVENLVRLQECLKGPALESV FT RSQLLLPQSVTKVINKLRQLFGRPEQLLHHHLDKIRKLDCPKPDKLSTYIP FT FGNAVEQLCDHLEAAELTQHLVNPLLVQDLVDKLPAQDKRDWVRYKQKKKH FT VTLRTFTDFVSKIVAEACEANVNMEFKITQGPSGPLHRMPREKGAVFNHSL FT SSDMQSKPKDQMKLKPCKICQRTDHRLRFCEDFKCLSPSERIKLVEQWKLC FT KVCLNDHGKAQCKFQIRCKVDGCTARHHTLLHPAGRPLAINTHMSCGEAIL FT FRMIPVMLHCGEHIVETLAFLDEGASVTLVEKTLADKLGVQGTKNPLTITW FT TADIKRTEKDSRQMNLWISTKDTEGKILLRTVQTVPELMLPQQSLNAPSLM FT QKYKFLRNLPVESYYDKRPGILIGINNVHAIAPIESRLGSIGEPIAVRSKL FT GWTIYGPSRRTTCAQEEFVGCHNVISNQELHEVLKSHYALEESVAAVAKES FT KDDQRARKILHETTMRVGDRFETGLLWKTDNPEFPDSLPMALRRLKHLEHR FT LEKMPELYDNVRLQMIQYEQKGYAHRATAEELSEAESSGSWYLPINVVLNP FT KKPGKVRLVWDAAATVNGVSLNTQLLTGPDMLAALPSVINRFREHPVAFGG FT DIREMYHQIMIRKADKKVQMFLFRNSSRDKPIVYIMDVATFGSKCSPCQAQ FT YIKNRNALEFATQYPEAAEAIIDKHYVDDYFDSTDTVEEAVRRAKEVKLVH FT SKGGFEIRNWVSNSEEVLHNLGWKKTDKAVHFNQDKQTGMERVLGIIWDPK FT EDVFSFSTDHREDLKPYLRGEKTPTKRIVLSSVMGLFDPLGLLTTFTIHGR FT ILIQNLWRTGCDWDERIDSASELEWRRWIGCLPDVEQVRIPRAYFGGWKST FT EIETLQLHVFTDASLQAYGSAAYLRATVGDEVRSALVMARAKVAPLKRQSV FT PRLELMGAVLGARLLRSVMTNHTLPIQKCYLWTDSQTVLSWLRSDQHNYKQ FT FVAFRVGEIQELTSLADWRKIPSRFNVADLLTKWGNGPDLQAGGPWFSGPK FT FLQQNEEYWPAESLPEANIAEERRALVMVHGTLSAEPVIPVESFSRWTRLL FT RVTATVVRFISNCRRKMAKLPILGTEASDKHRQLIKAKLNVLKRPLLQAEL FT HSAECILWKQAQREGYAEEVRMLKKGEERELGQPVVHLQRSSALYKLTPFL FT DEKGVMRMGGRMQNSLSATYDEMYPIILPKEHEITNKLLQDFHERFGHGNR FT ETVFHEVRQKFKIPKLRSRIALVMKNCVWCKVHRCKPTVPLMAPLPVQRIT FT PGLRPFSAVGLDYLGPVDVSIGRRTEKRWICLFTCLAVRAIHLEIVESLTT FT QSCLMAIRRFICKRGAPDEFFSDNGTNFRGAYNELKKIVEKINNECAEGIM FT NAKIKWNFNPPATPHMGGIWERLVRSVKEALKVLTDGRKLTDEILRTALAE FT TEDIVNSRPLTYMPDYAEESALTPNHFLRGTVKESDTMLQYQADLGGALRD FT SYKRSQYLAEEIWKRWCKEYLPTINQRTKWYTEQRPVSIGDLVFVVDGTNR FT KSWIRGIVECVYVGSDGRIRQADVRTSKGVFRRAIANLAIMEVRDGKSGTS FT EQEVPELRAGD" XX SQ Sequence 5993 BP; 1875 A; 1213 C; 1551 G; 1354 T; 0 other; aactcaaaaa atttactcgg ataacctcaa aacgatgcct gctgacgact taaccatgac 60 tgcctgcgga atgtgccaaa cgacttcaga tgtagatgaa ggaatggtgg gatgtgattc 120 ctgtaaccgg tggttccatt accggtgcgt aggagtaaat gaaaatgtgg cagaggcgaa 180 gaactggttt tgctccgaat ccagctgcca agagtacgct aagaagttga caaaaagaca 240 ggaacctaaa actggaaagc aacgaaagaa ggctacgtcg ggtaatgagt ctgataagtc 300 cagtgtgacg ccgctggatc aaaaactaaa ggctcttgag aaggaacgaa aagcgaagga 360 aagagaaatg gaagaagagc gaatcctgaa agaaaagagg atggaaatgg atcgttttct 420 caaggaaagg gagatggaaa tgcaagagga gcttcgggaa aaggaaatgc agcaggaaag 480 ggagttattg gaggtcgcct tagaaaggaa gcggaactac ctagcaagag tgaaggctat 540 gcgagagtcc tatcaagttc aaatggagag tatccagcag gaacttattg atggtagcaa 600 tcaactcaag cgacctaacg agttgtgccc aaatggtgtt cagtccatca agccggtgca 660 ggagcaaaag actgcagtcg tctcaaaatc cgaccatcca aagtcttcca gtgaattata 720 ttcgcagaca ccggttcgaa gttctgcaat taaagggaaa atcagcaggt tcgccatggt 780 tcctgcggtc ggctcgaaaa gtaaatcatt cgacgaagac gccgaagaaa aaaaccccga 840 cgagttggaa gataaagttc agggtagcga aaatgaagaa gatgacgaag aagaatcgtc 900 tgacgatgaa gatgagcagg atacggaaga cgatgagcaa tccgaagtgg agcaggaaaa 960 ggctgtgaaa aaagacaacc agacgagagg gaagactacc gcgccattag ctccccacgg 1020 gcgggggcca acaaaagcac agctgtcggc acgcaacgga atcactagaa aacttcctgt 1080 tttctcggga aaacccgagg aatggccatt atttattgga agctataatg cctctaatgc 1140 agcctgcggt tttagtgatg tagagaactt ggttcgcctc caagagtgct tgaaaggccc 1200 tgcattagaa agcgtcagaa gccaactgct actgccacaa tcagttacca aggtgattaa 1260 caagctccgt caactttttg ggcgacccga gcaattgcta caccaccatc tagacaagat 1320 caggaagctg gattgtccca agccagataa actctctacc tacataccat ttggtaatgc 1380 cgttgaacag ctgtgtgacc atttggaggc cgcggagttg acacagcatc tagtaaatcc 1440 actcttagtt caagatttgg tggataaact tccagcgcag gataaacgag attgggtccg 1500 ctacaagcag aagaagaagc acgtgacgtt gagaacattc accgattttg tatcgaagat 1560 agtagcagaa gcatgcgaag caaatgttaa tatggaattc aaaatcaccc agggtccgtc 1620 cggaccactg catcgtatgc cacgagaaaa gggtgcagtg tttaatcaca gcttgtcttc 1680 agatatgcaa tcgaagccaa aagatcaaat gaaactgaaa ccttgcaaaa tttgtcaacg 1740 cacagatcat cgtcttcgct tctgtgaaga ttttaagtgc ttgtctccat cagaacgcat 1800 aaagctggtt gaacagtgga agctttgtaa agtctgcttg aatgatcatg gcaaagccca 1860 atgcaagttc cagatccgtt gcaaagtaga cggatgcact gcccgtcatc acactttgct 1920 gcaccctgct ggtagaccgc tagcgattaa cacacacatg tcatgtggcg aggcgatctt 1980 atttcggatg attccagtga tgctgcactg cggtgaacac atagtggaaa ctcttgcgtt 2040 cctggacgaa ggagcctccg ttacactcgt tgaaaaaaca ctagctgaca agctcggtgt 2100 acaaggaact aagaatccac ttacgatcac ctggactgcc gacatcaagc gcacggagaa 2160 agactctcgc caaatgaatt tgtggatctc cacaaaagat acggagggaa agattttgct 2220 acgaaccgtg caaacagttc cagagttgat gctgccacag caatccttga atgcaccgtc 2280 attgatgcaa aagtacaagt tcctgagaaa cttaccagtc gagtcatatt acgacaagcg 2340 tccagggatt ttgatcggca taaacaacgt acatgcgatt gcgccgatag aatccaggtt 2400 ggggtcaatt ggagaaccta ttgcagtgcg ctccaagtta ggatggacca tatatggacc 2460 aagcaggagg acaacatgtg cgcaagaaga gtttgttggt tgccataatg ttattagcaa 2520 tcaagaacta cacgaggtcc tgaaatcaca ctatgcactt gaggaatccg ttgcagcagt 2580 agcgaaggaa tccaaagacg accaaagagc cagaaagatc ctccatgaaa ctacaatgcg 2640 agtcggcgat agatttgaga ccgggttgct gtggaaaacg gacaacccag aatttccaga 2700 cagtctaccg atggcactgc gacggttgaa acatttggaa catcggttgg agaaaatgcc 2760 cgaattatat gataacgttc gattgcaaat gattcagtat gagcaaaagg gatatgcaca 2820 tcgtgctaca gcggaagaac tatcggaggc tgaatcgtct ggttcttggt acctgccaat 2880 caatgtggtc ctcaatccga agaaacccgg aaaggtacgc ctggtctggg acgcagcagc 2940 cacggttaac ggagtttcac taaacaccca gttgttaacc ggtcctgata tgctcgcggc 3000 cttaccttcc gtgataaatc gttttcggga acatccagtg gcatttggag gggacattcg 3060 tgagatgtac caccaaatta tgatccgcaa agctgacaag aaggtacaaa tgtttctatt 3120 ccgtaattca agtcgggata agccaatagt ctacattatg gatgtagcca cttttggttc 3180 caagtgctcc ccatgtcaag cacaatacat aaagaataga aatgctctcg aatttgcaac 3240 tcaatatccg gaggcggcag aagcaattat cgacaagcat tatgtggacg actatttcga 3300 cagcactgac actgtcgaag aagctgtacg gcgagcgaaa gaagtaaagt tggtgcattc 3360 gaagggaggt tttgagattc gaaattgggt ttctaactca gaagaagtcc ttcataactt 3420 aggctggaag aagacagaca aagctgtcca cttcaaccaa gataagcaga ctggaatgga 3480 acgagtgctt ggaatcatat gggatcccaa ggaagatgtc ttttcatttt cgacagatca 3540 tcgagaagat ttaaaaccgt atttgcgggg agaaaaaaca ccaaccaaaa gaattgtgct 3600 aagcagtgtc atgggattgt ttgacccact gggtctattg acaacgttta ctattcatgg 3660 tcgcatactg atccagaatt tatggcgaac gggatgtgac tgggatgagc ggattgattc 3720 agcaagcgag ttggaatgga gaaggtggat cggatgttta cccgacgtag aacaagtccg 3780 aattcctcgg gcatactttg gaggatggaa atcaactgaa attgaaacac tacagctaca 3840 tgtgtttact gatgctagcc tgcaagccta tggaagtgca gcgtatcttc gagctacagt 3900 tggagacgaa gttcgtagtg cactagtaat ggcgagagcg aaagtggccc cattaaagcg 3960 gcaatcagtt cctcgtttgg agcttatggg agcagtactc ggagcgagac tgttacggtc 4020 agttatgacg aatcacacgc tgccaattca aaaatgctat ttgtggacgg actcccagac 4080 tgtactcagt tggctacgct cggatcagca caactataaa caattcgtgg cgttccgagt 4140 aggagagata caagagctga caagtttggc tgactggcga aaaattccat caaggttcaa 4200 cgttgcggac ctgttaacca agtggggaaa tggaccagat ctccaggccg gtggaccatg 4260 gttcagcgga ccgaagttcc ttcaacaaaa cgaagaatac tggccagccg aatcactgcc 4320 tgaagcaaat atcgcggaag agaggagagc gttagtaatg gttcatggaa cactgtctgc 4380 agaaccagta attcctgtag aatcgttttc aagatggacc agacttctca gagtaacggc 4440 aaccgtcgtt cgcttcataa gcaattgtcg gagaaagatg gctaaactac caatactagg 4500 aacggaggca tcggataaac atcgtcaact aatcaaagca aagctaaatg ttttgaagcg 4560 gcctttgtta caagcagagc tgcacagcgc agaatgtatt ttgtggaagc aagcacaacg 4620 tgaaggttat gctgaggaag taagaatgct gaaaaaaggc gaggaacgtg aattgggaca 4680 accagttgta catctgcagc gatcgagtgc gttatacaaa ctcacaccgt ttttggatga 4740 gaaaggagtg atgcgcatgg gaggtaggat gcagaattcg ttatcggcga cgtatgatga 4800 aatgtaccca ataatattac caaaagagca cgaaattaca aataagttgc tgcaggattt 4860 ccatgaaagg tttggacatg gtaacagaga aacggtattt cacgaagttc gccaaaaatt 4920 caaaatacca aaattgagat caaggattgc tctagttatg aagaattgtg tttggtgtaa 4980 agtacatcgc tgtaaaccga cggttccact aatggctcca ttgcctgttc aaaggattac 5040 accaggtttg agaccgttca gcgcagttgg tctggactac cttgggccag tggacgtatc 5100 aattggtaga cgaacggaga aaaggtggat ctgtttgttc acttgtcttg cggtgcgagc 5160 aatccatctg gaaatcgtgg aaagtctaac aacacagtcg tgtttaatgg cgattcgcag 5220 atttatctgc aaacgtggtg ctccagatga gttcttttcc gataatggaa cgaattttag 5280 aggggcttac aacgagttga agaaaattgt agaaaaaatc aacaatgaat gtgctgaagg 5340 aatcatgaac gcaaaaatca aatggaactt taatccaccg gctacgcccc acatgggcgg 5400 tatatgggaa aggctagtgc ggtctgtcaa agaagcccta aaagtactga ctgacggaag 5460 aaaattgacg gatgagattt tgaggacagc gttggcggaa acagaagata ttgtcaattc 5520 gaggccttta acgtacatgc cggattatgc ggaagaaagt gcgcttactc ccaaccactt 5580 cttacgtggc acggtcaagg agtcggacac gatgttgcag taccaggcag acttgggcgg 5640 agcattgagg gattcgtaca aacgttccca atatctagcg gaagaaattt ggaagcgatg 5700 gtgtaaggaa tatcttccaa ctataaatca acggacaaaa tggtacactg aacagaggcc 5760 ggtgagtatt ggtgatctgg ttttcgttgt agatggaacc aaccggaagt cgtggattcg 5820 aggaattgta gaatgcgtgt atgttgggtc tgatggtaga atacgacaag cagatgtccg 5880 cacttctaaa ggtgttttta ggcgagcgat tgcaaatctg gctattatgg aagttcgtga 5940 cggtaaatcc ggaacatcgg aacaggaagt accggagtta cgggcagggg atg 5993 // ID BEL-100_AA-LTR repbase; DNA; INV; 474 BP. XX AC supercont1.40; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-100_AA_; KW BEL-100_AA-I; BEL-100_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-474 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.40; Positions 1559514 1559041. XX SQ Sequence 474 BP; 141 A; 97 C; 102 G; 134 T; 0 other; tgttcgtgta accacgaaaa gagaaataaa attaattcgt gcgccatcta tctgtgagca 60 aatgcatcat aaacggttcg tcgatttcga taatgaccca aaaggagaaa gaaggcgatg 120 tagcatggct gcacaccttc ttccggctaa ccttgctata taaggaactt cgtgacatgt 180 gtcatcctct tttttttggt tcaacatcca aacgtcatcg agtggacatc aacaacggaa 240 tttatatacg aagtgaaagt ttagtcattt gaatttattg aaatagagta aattaagaaa 300 aataaagttc gtttttgtta agttaacgcg tgtgagtcag tgaattactt accgtggccg 360 cctgccgtct gagaaacctg gaagaaggtc agtgttactt acctgggtct ggaccgtccg 420 aaacctactc cggcataagc tgaactccgc tgttactcgc tgtgccattc gaca 474 // ID Tx1-4_AAe repbase; DNA; INV; 5066 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A Tx1 non-LTR retrotransposon from Aedes aegypti. XX KW Tx1; Non-LTR Retrotransposon; Transposable Element; Tx1-4_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5066 RA Kojima K.K. and Jurka J.; RT "Tx1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1460-1460 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 13 sequences with >98% CC identity. It is positioned at the deepest branch of the Tx1 CC clade, and does not show sequence specificity. XX FH Key Location/Qualifiers FT CDS 158..1342 FT /product="Tx1-4_AAe_1p" FT /translation="MADTELRANSLRIRFERGVPEPADSEIFKFMKSKMGL FT NSEKLLSMYKDKSETSIIVKFKKEEDMRNTLSDLPGTMVFEYSKYESTEVK FT LSSANAIVRYIRLFNLPPEVDDREISVVMQRYGKVVRLVREKYGEETGFPI FT WTSVRGVYVELKEGIEVPATVFIRNLRARVSYEGIINKCYLCGSKDHFKAE FT CPEKKSVNERLSNQQSSSYSGILRGGGKWLKQQVKTSSNEKDGMIKLGQGL FT PKRSNTTQAQLDSCQDRDEGAAQRNEDNHIEDVQTSTVNQNELCVSETEHE FT DNCGNVEQSVMDVADNAFIKVTGKRGRKHQKSAEKEGTSDSDESPSEKLHR FT DPTTQAGVKVGALSESLDRITRSRSKQKKMSDDVSSAVAGCEPTVREKSND FT D" FT CDS 1705..4962 FT /product="Tx1-4_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MNHVMRIASVNLNSSTNLVNKNLLKDFIFNNDLDVIF FT LQEVCYSNFSFVPSHIPYINVNERESGTAVLIRNSFETSQPLFSLNGRITS FT IVIQNINFVNVYAFSGSNRKKERDELFLNDLTTHLAKANVSTNIVGGDFNC FT IINSSDCLGDSKSYCVGLRQLIEAFGWKDVAWELKKNQFTFHRANSASRLD FT RFYVPQNFISNILDFKTEPVVFSDHCGVVMKIKVDQTMIVSRGRGFWKINS FT TVLKDDRVSELFAEACDSWKLRQVYDLNKNKWWNEVFKIKCKQFYKSKSWE FT LVSRITNEKQYFYGKMKEYMEKINRGDDTYLDIKVIKSRIMEIETERLNHL FT GNTFSIQNLLQGEIINVFQIAAQIKRQERNNILKLEENNVIITDSIQLRKH FT VYDYYSNQFKKETVVSNQREIEDSLNHLTNCLDDEDRAQLVSPISLEELKD FT ALELAKKKKSPGPDGLTYEFYLQNFEVIKYDMLEIFNAYLSGTFLPPKGFS FT DGVITLVPKDHRKKLSLDNYRPISMLNCDYKIFTKILAERIQSKMHKLLGT FT GQTACLQKTSCIDNLKNLRRVLTKSCENKRFKACLVSIDLNKAFDRVDHEY FT LWNVLEKFGFPPTLINCLKNLYDIAFSRVLVNGFLTDAFRITRSVRQGCPM FT SMILFVLYLEPLIRKISESLQGFWVYDKFIKVIVFADDFNVIVRSEVEFDL FT LLTIFECYSRYAKIKINFEKSSYMRLNKAMIGPQRIKEAENMKILGVTFYK FT SWYLTVVANYDSVISHIKSLLQKHKIRNMNLYERCMVLNTFILSKLWYIAQ FT VFPPLNRHIAEIRKICGNFIWNGLIFRVERNQLYLDYFKGGLRLIDPEAKM FT KALFIKNLLYNEDMNGDFLEENYLLQLRTPQRITKNAREWIANAIEVKSRP FT QLNSTKLLYDYFVGMNNSVPKIEEKIDIDWPTLWENLNHNFISLNDRSKVF FT CFVNNLIATGEKLYAYNVRKAAVCDKCGGVDNITHRLKVCPNAKIVWQWCS FT QIIRTRMGCNVDDLEEFLSQTICLRSQKQKAALWLTLRVISFNLAGSVPNL FT FVFKKQLRELRWNNRKLFVKHFGNNLNIC" XX SQ Sequence 5066 BP; 1771 A; 720 C; 1119 G; 1456 T; 0 other; cagttcgtgt ttggatttcg tgccgtgcag acgtataata gagagtacga cagaaaaagt 60 tctgatttgt gagcgcccgg atctcagcaa ggctgattcc aaaaaaaaaa gtgaaagcgt 120 tagtttttga aagaagaagt gaggagtagt gaagaaaatg gctgatacag agttgagagc 180 gaattctctg cgcattcgtt tcgaacgcgg ggtgccggag ccggcggata gtgagatttt 240 caaattcatg aaatcgaaga tgggtctcaa cagcgaaaag ttgctttcga tgtacaagga 300 caagagtgaa acgtcgatta tcgtcaagtt caaaaaggag gaagatatgc gaaacacttt 360 atcagatcta ccgggaacga tggtttttga gtatagcaaa tatgagagta cggaggttaa 420 attgtcatcg gcaaatgcta tcgtgcgata cataagactg ttcaatttgc ctccagaggt 480 cgatgatcgc gaaatttcag tagtgatgca acgttatgga aaagttgtgc ggttagtgcg 540 ggaaaagtat ggagaagaaa cggggttccc aatatggact tcggttcgtg gtgtatatgt 600 ggagttgaag gagggtattg aagtgccggc aacggttttc atcaggaatc ttcgcgcgag 660 agtttcctat gagggcataa tcaacaagtg ctatttatgt gggagtaaag atcatttcaa 720 agcggagtgt ccggagaaaa agtcggttaa cgagcgattg agcaatcagc aatcatcgtc 780 gtacagcgga attttgaggg gtggtggaaa atggttgaaa caacaagtga aaacaagcag 840 caatgaaaaa gacggaatga tcaaactcgg tcaaggtctt ccaaagcgat ccaatacaac 900 tcaagcacaa ttggattcgt gtcaggacag ggatgaaggg gcagcgcagc gaaacgagga 960 taaccatatt gaggacgtcc aaacatctac agtgaaccag aacgagctct gcgtaagcga 1020 aacagagcac gaggacaact gtggcaacgt ggaacaaagc gttatggatg tagcggataa 1080 tgccttcatc aaggtcaccg gtaaaagagg acggaaacat cagaagtcgg cagaaaaaga 1140 aggaacttcg gattcggatg agtcaccatc cgaaaagctg catcgtgacc cgacgactca 1200 agcaggggtt aaggtaggtg cgttgagtga atcgctcgat cgtataactc gtagcagatc 1260 aaaacagaaa aaaatgagtg acgatgtgag tagcgctgtt gcaggatgtg aaccaacagt 1320 gagggaaaaa tcaaacgatg attagaacaa agttaattag caagagaata gttgtatcta 1380 cgatacatag gaatatttgt gagttccggt atgagtatgg atgttttatg ggagagtttt 1440 atattgagca acgatagaga ttatggatgt attgatatgc agaaatgtag agttattgtt 1500 ttgcagattc gcaattagca gtttagtaga taagtagatt ggtagattta ttagacatga 1560 agatttttta atttgtacat ttgtagactt gttgaattgt agatttgttg atttgtagat 1620 ttgtagattt gtaaatttga agatttgttg atttgtaatt gataattttg tagattttca 1680 aaccttttca aaaaataatt caagatgaat catgttatga gaatagcaag tgttaattta 1740 aatagcagta cgaacttagt caacaaaaac ctgttgaagg atttcatttt taacaatgat 1800 ttagatgtaa tatttttgca agaagtctgt tattcaaatt tttcgtttgt tccatcacat 1860 atcccgtata tcaatgtaaa tgagcgcgaa tccggaactg ctgttttgat acggaattca 1920 ttcgagacaa gtcaaccttt attcagtttg aatggtagaa taacttcaat cgtgattcag 1980 aacataaatt ttgtgaacgt atatgcattt tcgggatcta accgcaaaaa agagagggat 2040 gagttgttcc tgaatgatct gacgacacat ctagcgaagg caaatgtatc tacgaacatt 2100 gtgggaggag atttcaactg tatcatcaac agttctgatt gcttgggaga ttctaaaagc 2160 tactgtgtgg gtctgcgaca attaattgag gcatttggat ggaaagatgt agcgtgggaa 2220 ttgaaaaaga atcagtttac ttttcataga gcgaactctg cgtcccgatt ggaccgtttc 2280 tatgtaccac aaaacttcat tagtaacatt cttgatttca aaacagagcc ggttgttttt 2340 tctgatcatt gcggggtagt catgaaaata aaagtagatc aaacgatgat agttagccga 2400 ggacgaggat tttggaaaat aaattcaact gttctaaaag atgatagagt ttctgaacta 2460 tttgcagagg cttgcgattc atggaagttg aggcaagtgt acgatttaaa taaaaacaaa 2520 tggtggaatg aagtattcaa aattaaatgt aaacagtttt acaaatcaaa aagttgggaa 2580 ttggtaagta ggataacaaa cgaaaaacaa tatttttatg gtaaaatgaa agagtatatg 2640 gaaaaaataa ataggggtga tgatacatat ttggatatta aagtgattaa atcaaggata 2700 atggaaatcg aaactgaaag actgaaccat ttggggaaca ctttcagtat ccaaaacctg 2760 cttcaaggtg aaataatcaa cgtttttcag attgctgccc agattaagcg acaagaaaga 2820 aataacattt taaaactcga agaaaataat gtgataatta ctgattcaat acagctcagg 2880 aaacacgttt acgattacta ttcaaatcaa tttaaaaagg aaactgttgt ttcaaaccag 2940 agagagattg aagattcttt gaatcatcta acgaattgtt tggatgatga ggatagagca 3000 cagttggtat cacctataag tttagaagag ttgaaagatg ctttagagtt agccaagaag 3060 aaaaaatccc ccggtccaga tggactaacc tatgagtttt atttgcaaaa ctttgaggtt 3120 ataaagtatg atatgttaga aatcttcaac gcttatctct cgggaacctt tttaccaccg 3180 aaaggcttct cagatggagt gatcacgctg gtaccgaaag atcacaggaa aaaattatcc 3240 ttggataact atcgaccgat ttccatgttg aactgcgact ataaaatatt tacgaaaatt 3300 ctagcagaaa ggattcagag caagatgcat aagttactag gaaccggtca aacagcgtgc 3360 ctccagaaga catcttgtat agataaccta aaaaatttga gaagagttct tacgaaatcc 3420 tgcgaaaata aacggttcaa agcatgtttg gtgagcattg acttgaacaa agcatttgat 3480 agagtagacc atgaatactt atggaatgta ttagagaaat ttgggtttcc tccaacacta 3540 ataaattgct taaaaaatct atatgatatt gctttctctc gggttttagt aaacggattt 3600 cttacggacg cctttaggat cacaagatcg gttagacagg gctgtcctat gagtatgatt 3660 ttattcgttc tgtatttaga gccgttgatt aggaaaattt cagaaagctt acaaggattt 3720 tgggtgtatg acaagttcat taaagttatt gtctttgctg atgatttcaa cgttatagtg 3780 cggagcgaag tagaatttga cttactgtta accatttttg agtgctactc aaggtatgca 3840 aagatcaaaa taaattttga gaagtcatca tatatgcgtt tgaataaggc tatgatagga 3900 ccacaaagaa taaaagaagc ggaaaatatg aaaattctag gagttacatt ctataaaagc 3960 tggtatctca cagtagttgc aaattatgat tcggtaataa gtcatataaa atcactcttg 4020 cagaaacata aaataagaaa catgaacctt tatgaaagat gtatggtttt aaataccttc 4080 atcttatcaa agctttggta tattgctcag gtttttccgc cactcaatcg acatattgct 4140 gaaattagga aaatttgtgg aaactttatc tggaatggtt tgatatttag agtggaaagg 4200 aatcaacttt atcttgatta tttcaaggga ggattaagat taattgatcc cgaggcaaaa 4260 atgaaagcgc ttttcataaa aaatttactc tataatgaag acatgaatgg agacttctta 4320 gaggaaaact atttacttca attaagaaca cctcaaagaa taacgaaaaa tgctcgagaa 4380 tggatagcaa acgctattga agttaaatct agacctcagt taaattcaac aaagttatta 4440 tacgactatt ttgttggtat gaacaattct gtaccaaaaa ttgaagaaaa aatcgatatt 4500 gattggccga cattatggga aaatttaaat cacaatttta tttctttgaa tgatcgttcc 4560 aaagtgttct gttttgtaaa caatttgata gctacaggag aaaaactcta tgcctacaat 4620 gttcgtaaag cagcagtttg tgataaatgt ggaggagtag ataatataac tcacaggctt 4680 aaggtttgtc caaatgccaa aattgtatgg caatggtgta gtcagattat tcggacacga 4740 atgggttgta acgtagatga tttagaagag tttctttcac aaacgatatg tttgcgatca 4800 caaaaacaga aagctgcatt atggttgact ttacgcgtga tttcgttcaa tttagcagga 4860 tccgttccta atttgtttgt ttttaaaaag caattaagag agctaagatg gaacaatcga 4920 aaactttttg taaaacactt tggaaataat ttgaatatat gttgaaaagt acatagaaat 4980 aagatttaga taaggaaaat gaacatgtaa tccaagtatg aactgtaccc ttttctaata 5040 tcaataaaaa aaaaaaaaaa aaaaaa 5066 // ID Copia-105_AA-LTR repbase; DNA; INV; 240 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-105_AA_; KW Ty1_copia_Ele38; Copia-105_AA-I; Copia-105_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-240 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 240 BP; 61 A; 56 C; 38 G; 85 T; 0 other; tgttgagtgt aaccaccgta ctctagttgc ccccactttg ttctgctcct atgagcgtac 60 ttatgttttg ccccctttta tttctacacc tttgaagttc tttcggtcgc tatgaaagtg 120 acatctgatg taaacaaatg ttaaagtgaa aaacacgttt cgaataaaac catcttttag 180 ttaaagttaa tcgcggtttt actattgtaa tttcctcccg gcctatccga aaccacttca 240 // ID I_Ele20 repbase; DNA; INV; 6214 BP. XX AC . XX DT 14-OCT-2010 (Rel. 15.1, Created) DT 14-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE An I clade non-LTR retrotransposon family from Aedes aegypti. XX KW I; Non-LTR Retrotransposon; Transposable Element; I_Ele20. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-6214 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6214 RA Kojima K.K. and Jurka J.; RT "I clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (14-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. This consensus is generated from 14 CC sequences with 94-98% identity, and ~99% identical to the CC original sequence in [1]. XX FH Key Location/Qualifiers FT CDS 407..1705 FT /product="I_Ele20_1p" FT /translation="MAASSSGDPGGTAARRLPTYMDPTNKFGELTFLQLTG FT KDGVQLPXNPYIVGKSVEMCAGGPIEDGKSEAQGTRYTLKVRDPAQVAKLL FT KMTHLIDGTKVEVIAHPNLNVCRCVISCFELIHMDEAEILREMGGDGVIRV FT QRIMRLEGGKKVNTPALILTFCKTTFPGHIKVGLLHVPTRPYFPNPMLCYN FT CFSYGHTRTRCPGPQRCFNCSGNSHGEEECGEAPHCRNCNDGHRPTNRQCP FT VYKQEIEVIKIKVRDNLSFPEARKRAELQAQGSYAQVAAQQNEFDKKLKEL FT EATMKQKDELIAKLLEDNRQKDEKMEQMLAQIWQLKQQISSQEKPHSSREI FT KSAQPPSGVVTRSRNNSPALTRSRNNSPAIQETKRSRASKQHTDSLTKPAS FT PDRQSPPPKKTATTIPTQRSTYSDDDISISEIPPNHRLR" FT CDS 1659..6110 FT /product="I_Ele20_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="MTIFRYLRSHPITAFDSSNAKHRLNTHDDDPRLTTDN FT EINRFESEECAVLLSTQAQQVEKTVRDALPQPVDSDLXSVADGTADNTTMR FT VFEHSRRIQMIVEETEDINNETTQSLSLMPTTVGIHQRSSVVRSTVVGIGG FT QDVPQVSPFNNHDLAGTAASVPITDPRTSLTQPGYQNTHLQGDTYSQTHRQ FT DIVSCSQTTQSLSPVPVAVGDFGSSVVFSTAAGTVGQDVPQVSTSQQRPLM FT EFTAETIPIPATILNQQAPSRGASVTSSCSSDILPAVNNNDKCFALQWNMR FT GLRANISELKQLIGKYSPCIVALQETKVRQTSVPPDFVGKNYSLLLEPKKD FT HYWQHGVGLAIKEGIPFERMIIDTTLHLIAVRIQKPFQATVVSIYIPPSAQ FT QCETALNNLFDQLDGPVIFLGDFNAHHLAWGSQSSNTLGRFIAETTLARGL FT VILNNGSPTRIDPATGNTSSIDISFCSESLAQKLNWRTLSDTCNSDHFPMI FT IGLPGWSDTATTRKKWFFDRADWSVYERITAELIRSDEVWDVDRFTAKLIE FT AATKCIPQSSGQAGPKSVPWWCPEVKDAIKRRRKCLRSLRRMDASDCERPA FT ALKRFQEARATARKAIKTAKQQSWEDFVGKLSPSSTTTELWRTVNTLRGKR FT QHRTPILKLANRYSDDPSEIAEELAKHYSERSATSSYSPLFQKAKEKAEQQ FT RIDPLLNTNDRYNMDITLNELLWALDKGQSGSTGHDMIGYPMLKHLPLSVK FT LILLEIFNGIWRSGVFPSSWKKAIIVPIPKPKCNTPEPDTFRPISLTSCMA FT KVLERIINRRLIAELESSGRLDERQHAFRSGRGADTYLAELEKSLPMTDEH FT SLIASLDLSKAYDTTWRHGILRTLKSWRIRGRMMKLIQSFLSERMFQVCVG FT GQLSRPHRLENGVPQGSVLSVTLFLVAMQPIFRVIPKGVEILLYADDILLV FT VRSRKTEGLHRKLQSAVKAVEKWTKSVGFTISATKSSTFYCSPNARRNPSQ FT NITINRVTVPRTNRLRILGITLDRTLNFKAHCKMVKESCASRLRIMRTIGA FT QLPRGNRNTMLQVGSAIITSKLLYGIGLVSRGGEAAFQNLAPVYNQMIRYA FT SGAFVTSPITSIMVEAGTLPFELLATQSVIRIAIRLLGIHGPNRNLPLVQR FT ASTRVGELTGAALPMIAQLVKRCSREWHASKPNVVWDVKNAAKAGDPPNKV FT RPIVQQLLSNRFKTSTVYYTDGSKGNDTVGAGLFGVNLAEKYSLPKQCSIF FT SAEAYAIRMALLLSNESDIAILTDSASCLLALEAGRSKHPWIQEIESLARG FT KNVQFCWIPGHTGIRGNNEADRLANDARCQPIINIALPAEDAFKDIKIGVR FT LYWERLWLTYRDVKLREIKFDTHKWKDRENNAEQRVLTRLRIGHTRLTHSF FT LLSKESPPNCECCGTTLDVRHIVLHCQKYDDVRQKYEINTNSLRSALKNDD FT EGETKLIKFLRDTKLFEKL" XX SQ Sequence 6214 BP; 1905 A; 1499 C; 1388 G; 1418 T; 4 other; tcatcgcgtt tcgacacgcc gttccgtacg gtagctacag cgcgcgctct cgtagtaaac 60 aaggtaattc tccctagttt tccgcgaccg gcgcggtcga gacgttactg tatcaataag 120 tgtagtgact gagaaacagt gaatcgtgaa ggatctgtga aaatcgctac gtaaacggtc 180 gaaattagtg aaatttctgc cactcacgtg cagtgtatac gaaaacaaac ggctgctgtg 240 tagttaccct ctcaacacca ttcattcttt cgtgttcgcs stgacggtgt gccagaagag 300 cgttaaattg gaataaaaaa agaagctcaa aagtttacaa ggaaccaaaa ttggggtagg 360 taacgggtga ccgttaccac tgtagaagtt tttgccgtgt aacaacatgg ctgcgtctag 420 cagcggagac cctggcggaa cagccgcgcg aaggttgccg acttatatgg acccgacaaa 480 caaatttgga gagctaacat tccttcaact cacgggaaag gatggtgtcc aacttccgam 540 aaatccgtat attgtcggga aatctgtaga gatgtgcgcc ggtggaccaa tcgaagatgg 600 aaaaagcgaa gcccaaggaa cacgatacac tctgaaagta agagatccgg ctcaagtagc 660 taagctgttg aagatgaccc acctaatcga cgggacaaag gtagaggtga ttgcacaccc 720 gaatctgaac gtctgtcgat gcgttatttc atgcttcgaa ctcatccata tggatgaagc 780 agaaatatta agagaaatgg gcggtgatgg agtaattcga gttcagagaa tcatgcgact 840 tgaaggcggt aaaaaggtta acactccggc gctgattcta acattttgca agactacgtt 900 cccaggtcat atcaaagtcg gtctacttca tgttcccact cgtccatact ttccaaatcc 960 tatgctgtgc tataactgtt tcagctatgg acacacacgt actcgatgtc ctggaccaca 1020 acgttgcttc aactgctcag gaaactcaca cggggaagag gaatgcggcg aggctcctca 1080 ctgccggaac tgcaacgatg gccaccgtcc tactaaccga cagtgtccag tatacaaaca 1140 agagattgaa gttataaaaa tcaaagtgcg tgataattta tccttcccag aagcaagaaa 1200 acgagccgaa cttcaagcac aaggcagtta tgctcaggtg gctgcacagc agaatgagtt 1260 tgataagaaa ctcaaggaac tagaagcaac aatgaaacag aaggacgaac tcatcgctaa 1320 actgttagaa gataaccggc agaaggatga aaaaatggaa caaatgttag cgcaaatttg 1380 gcagctaaaa cagcaaatct ctagtcaaga aaaaccgcac tccagcagag aaattaaatc 1440 agcccaaccg ccatcaggtg tcgtgacacg ctcaagaaat aactctccag ccctaacacg 1500 ctcaagaaat aattcgccgg ctatccaaga aacgaaacgc agccgagcct ctaaacagca 1560 tacagactcg ctcaccaaac cagcatcgcc agatagacaa agcccacccc cgaagaaaac 1620 tgcaaccact attcccaccc aacgatcgac gtattctgat gacgatattt cgatatctga 1680 gatcccaccc aatcaccgcc ttcgatagct ccaatgccaa acaccgtttg aatacacacg 1740 atgatgaccc acgccttacc acggataatg aaataaatcg gtttgaaagt gaggaatgtg 1800 cagtactcct ttcaacgcaa gcacagcagg tagagaaaac tgtccgggac gccctacccc 1860 aacccgttga tagtgatctt kcctctgtgg ccgatggaac tgctgataat accacgatga 1920 gagtttttga acatagtagg agaattcaaa tgattgtaga agaaacggaa gatatcaaca 1980 acgaaactac acaaagtctt tcccttatgc cgaccactgt cggtattcac caacgtagta 2040 gtgtagttcg ttcgacagtg gttggcattg ggggacaaga cgtccctcag gtcagtccct 2100 ttaacaacca tgacttagca ggtactgctg cgtctgttcc aattacagat ccgagaactt 2160 cacttaccca accggggtac caaaatactc acttgcaagg agacacgtat agccaaaccc 2220 acagacaaga tatagtcagc tgctctcaaa ctacacaaag cctttcccca gtgccggtcg 2280 ctgtcggcga tttcggaagt agtgtagttt tttcgacagc ggctggtact gtgggacaag 2340 acgtcccaca ggtcagtacc tcgcaacaac gtccgttaat ggaatttact gcggaaacca 2400 ttcctattcc agccacaatc ttgaatcaac aagcaccatc acgcggtgca tcggtgacgt 2460 catcatgttc gtcagatatc ctgccagctg tgaacaacaa cgataaatgc ttcgcgctac 2520 aatggaacat gcgtggacta cgagcaaata tcagtgaatt gaaacaactc attggaaagt 2580 atagcccctg catcgttgct ctccaagaaa ctaaggtaag gcaaacatct gtaccgccgg 2640 actttgtcgg caaaaactat tcactgctat tggaaccaaa aaaagaccac tactggcagc 2700 acggggttgg ccttgcaata aaagaaggca ttccgttcga gcgcatgatt atcgacacta 2760 cgttacatct catcgcagtg cgcatccaga aacctttcca agctacggtc gtatcaatat 2820 acatcccacc aagcgctcag caatgtgaga cagctctgaa taacctcttc gaccaattag 2880 atgggccggt aatctttcta ggtgatttta acgcacacca cttagcgtgg gggtcgcaat 2940 cctcgaacac actcggtcgt ttcatcgcag agactacgct agccaggggg ctggtaatac 3000 tcaataacgg ttctcccaca cgtatcgacc cagcgacagg caacacttcg tctatcgaca 3060 tctccttctg ctcggagagt ctagcgcaaa aactcaactg gcgaacgctt tcggatacgt 3120 gtaacagcga ccatttccca atgataattg gactcccggg gtggtcagat acagctacta 3180 ctcggaagaa atggttcttc gaccgtgcag attggtctgt atatgaacgt ataactgcag 3240 aattaatccg atcagacgaa gtgtgggatg ttgatcggtt tacggcgaaa ctcatcgaag 3300 cagcgaccaa atgtattcca cagtcaagtg gacaagctgg cccaaaatca gttccatggt 3360 ggtgccccga agttaaagat gctatcaaac gacgacgaaa atgcttacga tccctaaggc 3420 gtatggatgc ttcggattgt gaaagaccag ctgctcttaa aaggttccag gaagcgagag 3480 caacagcaag aaaggccata aaaacagcaa aacaacaatc ttgggaagac tttgtaggta 3540 aattatcacc cagcagcacg accactgagc tctggaggac agtgaataca cttcgcggca 3600 agcgacagca ccgaaccccc attcttaagc tagcgaacag gtactcggat gatccaagcg 3660 aaattgcaga agaattagca aaacattata gcgaaaggtc agcgacatcc agctattcgc 3720 cgctttttca aaaagcgaaa gaaaaagctg aacaacaacg catagacccc ttgctgaata 3780 ccaacgatcg ttacaacatg gatataaccc tgaacgaact cttgtgggca cttgacaaag 3840 ggcagagtgg ttcaacaggc catgacatga tagggtatcc catgcttaaa caccttccgc 3900 tgtccgtcaa gctaattctc ctagagattt ttaatggtat ttggcgcagt ggtgtctttc 3960 cctcctcctg gaaaaaagcg attattgtgc ccataccaaa accgaagtgc aacacccccg 4020 aacctgatac gtttcgaccg atttcgctta ctagctgcat ggcgaaagta cttgaacgta 4080 tcatcaacag acgtctgatc gccgaactgg agtcctccgg cagacttgac gagcgacaac 4140 atgcctttcg ctcggggcgc ggtgcggata catatctagc cgaacttgaa aaatcactcc 4200 caatgacgga tgagcatagc ctaatagctt ctttggacct atcgaaggcc tatgatacta 4260 catggcgaca tggaattctg cgcaccctta aatcttggag aatacggggt cgtatgatga 4320 aacttattca aagcttcctc tccgagcgga tgtttcaagt ttgcgtaggt ggacaactgt 4380 ctcgcccaca tcgattggaa aacggtgtcc cacaaggctc cgtactatca gtaacactct 4440 tccttgtggc catgcaacct attttcagag tgataccgaa aggagtggaa atcttgcttt 4500 atgcagacga tatccttctc gttgtgagaa gccgaaaaac ggaggggctg catcgcaaat 4560 tacagtcagc cgtcaaagcc gtagagaagt ggacaaaaag tgttggcttc acgatctctg 4620 ctactaagtc ttcaactttt tattgtagcc ccaatgcgcg ccgtaaccct tctcagaata 4680 tcaccataaa ccgtgtaact gtaccccgaa caaatcgatt gcgaatcctc ggcattactc 4740 ttgatcgcac attgaacttc aaagcccact gtaagatggt caaagagtct tgtgcttcta 4800 ggctccggat aatgagaacg attggagccc agcttcctcg tggcaaccga aatacgatgc 4860 tgcaagtcgg ctcagcaatt ataacatcca aactgctgta cggtataggc cttgttagca 4920 gaggaggtga agccgcattc cagaatcttg caccggtata caatcagatg atccgttatg 4980 catccggcgc ttttgttacc agccccatta catcaatcat ggtggaagcg ggtactttgc 5040 ccttcgaatt gttggcaacg caatcagtca tacgaatcgc tatacgccta ctgggaatac 5100 atggacctaa tagaaacctg ccgctggtac aacgagcgtc tactcgtgta ggagaattaa 5160 ctggcgcagc attaccgatg attgcacagc tagtaaagcg gtgtagtcgt gaatggcacg 5220 cttcgaagcc gaatgttgta tgggatgtga aaaacgctgc gaaagccggc gaccctccca 5280 ataaagtgcg accaatagtt caacagttgc tttcaaaccg attcaaaacg tcaaccgttt 5340 attatacgga tggctctaaa ggcaacgata cagtcggtgc tggtctattc ggagtcaatc 5400 tagcagaaaa atatagcctt ccaaagcagt gcagcatttt ctctgctgaa gcatacgcga 5460 ttcgcatggc tcttctgcta tcgaacgaga gcgacattgc gatcctcaca gattcagcta 5520 gttgtctact ggctttagaa gctggaagat caaaacaccc ttggatccag gaaatagaat 5580 ctctcgctcg tggaaaaaac gtacagttct gctggattcc cggccatacc ggtattcggg 5640 gcaacaacga ggctgaccga ctagcaaatg atgcaagatg tcaaccaata atcaatattg 5700 cactgccagc ggaagatgcc ttcaaagata taaaaatagg agtacggtta tattgggaaa 5760 ggctatggtt aacataccgg gacgtaaaac tccgagaaat caaatttgat acccacaagt 5820 ggaaagatcg tgaaaacaat gccgaacagc gagtgctaac acgacttcga ataggtcata 5880 cacgattaac gcacagtttt cttttgtcaa aagaatcacc tcctaactgc gaatgctgtg 5940 gcactactct agacgtacgc cacattgtgc tacattgcca gaagtatgac gatgtcagac 6000 agaaatatga aatcaacact aacagcttaa gaagtgcatt gaaaaacgat gatgagggag 6060 aaacaaagct gataaaattc ctacgagaca ctaagttgtt tgaaaagttg taaagttgct 6120 ttttgtagta cataatgtta atttaattca ttctgatacg aatgcaccat ctaaatggtg 6180 taaagtatcg ttaaataaaa aaaaaaaaaa aaaa 6214 // ID SINE-1_CQ repbase; DNA; INV; 1373 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A SINE family from Culex quinquefasciatus - consensus. XX KW SINE; Non-LTR Retrotransposon; Transposable Element; SINE-1_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-1373 RA Kojima K.K. and Jurka J.; RT "SINEs from the southern house mosquito."; RL Repbase Reports 11(1), 620-620 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >98% CC identity. ~19bp TSDs. This family is longer than usual SINEs, CC but it cannot be extended further. XX SQ Sequence 1373 BP; 394 A; 261 C; 257 G; 461 T; 0 other; tccttgggca tgagtaagga cctcgacgcc gttagcaatg ttggctttaa aataaaattt 60 cgtgataata tcacttcccc ccagccaaca tttttgccat gcgaagcggg cctcgacgct 120 gtgtctaggt ttaattccta gctaggagcc atcagtgtct taagaggtgg tcccaaggcc 180 gagtaggagt tcatgaagca aattagtcgt ctcccacccc ttttaaattg aaaaatgaac 240 tctgaaacca gcgcttactc acaacagaaa cagaggcctc ttctaggcat tgtaggatag 300 aatagatttt gaaatttatt taaaaaaaaa tattattacg tagcattttg atatgtcaaa 360 atttccatag agtctcggtc aatcaatagt gcgatgatat cggacaaaat tcgttaatct 420 gttgaactaa cggttttcgg attatggttg tatcgaccat tgttgcgaat cgagggtcta 480 ctggagcctt gacgatcaaa tggagcctaa catttcaaaa gggcgcatgg gttttgtgaa 540 caggagtgtg tctccttcac tcaaatgaca gtttacataa ggtataatag agatgcactc 600 ttgtttgcaa agctctatgc cctaatgaaa tgaatggcat gaaatagtcg tcttaattac 660 tagtgttcaa cttatgaact gttcatttat gtttattggt ttttggaagc ggcaagactg 720 gtttaaaaac aagatccttt accttatttg gcgttataat aaaacattcg gggatttgag 780 attaaattta ctttcagaca gtttagttta ggttgttgat taaagttaga tagttttccg 840 tagatgaata attggactta aatgattagt tgatttgaac gaagtgtaac atttcattgg 900 cagatctgtt tctagttgta atgtacgaca agactattga cggacgtgac aggtcggcag 960 ttagttttca tctttttttc tctagtattt tttatttcct ttaaatttgg aatacatcat 1020 aggtttcttt tgtatagatc tccgattagt tacattttct tatttaatta actaataatt 1080 taatttattc cgggccttct tactttgaat cacaatttca gattttcttt attcagtgtt 1140 aaacttagat taccgtaact gctcaaatca tccaagttct tactactcac atcaacacta 1200 attttctttc acctcccccc tcccgatccc ctatatcttc cggtggtgtt catttggtat 1260 gcaatactag ttcggccact accctttttt ccacaagaaa ccggacttgg actgacttga 1320 ggccgcgtcc cccaccttgc tccgtaaaac gaaaacgaaa acgaaaacga aaa 1373 // ID Polinton-3_TC repbase; DNA; INV; 17681 BP. XX AC AC154128; XX DT 15-MAR-2006 (Rel. 11.02, Created) DT 15-MAR-2006 (Rel. 11.02, Last updated, Version 1) XX DE A family of autonomous Polinton DNA transposons - a fossilized DE genomic copy. XX KW Polinton; DNA transposon; Transposable Element; KW Interspersed repeat; Maverick; Tlr; Polinton-3_TC. XX OS Tribolium castaneum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Coleoptera; Polyphaga; Cucujiformia; OC Tenebrionidae; Tribolium. XX RN [1] RP 1-17681 RA Kapitonov V.V. and Jurka J.; RT "Self-synthesizing DNA transposons in eukaryotes."; RL Proc. Natl. Acad. Sci. USA 103(12), 4540-4545 (2006). XX DR GenBank; AC154128; Positions 37011 19331. XX CC This transposon is characterized by 981-bp terminal inverted CC repeats and a 6-bp target site duplication. It encodes a family B CC DNA polymerase (POLB-3_TC), retroviral integrase (INT-3_TC), CC ATPase (ATP-3_TC), cysteine protease (PRO-3_TC, corrupted by a CC stop codon) and one unclassified protein (PY-3_TC) conserved in CC Polintons from different species. XX FH Key Location/Qualifiers FT CDS 5444..4224 FT /product="PY-3_TCp" FT /translation="MSASKLQVYSSPEIDDSISKEEEHTYSPQVRSFDNND FT EIEIIINQRDIWISLFESFIQVDGEFIPDPVPETGGGNVTLTNNAAAYLFE FT NVSYELNNVELDSVREVGTVSTIKTFLCYGKDEVRALTLAGWNDQESQQLK FT TFNETDNTFSFRIPLSYLLNLPFDYHRIVSGHQKLRLIRSRNDANCFISTG FT TRKATLKINNIELKAKHVYPNDEIKLTILEGINKDRVISLPFRKWEIHELP FT SVRQTNNDIWRVKTSTQLERPRFIIVCFQTNRKNNPKSDVTLFDHCDVRSV FT RLWLNSNVYPYETWKLQFAKNKYLEAYQAYVDFYKEFNGKDRAEPILSYTD FT YTKRPIFVLDCSKQNEAIRSSTIDISLEFESDNNFPADTRAYCIIIHDRIM FT QYYPLTGIIRQLI" FT CDS 7541..8233 FT /product="ATP-3_TCp" FT /translation="MKVLKQKLQLPIINHDYNNVDNKHKMSHRHGTLLPES FT IRCIICGPSGGGKTNILLSLLFDPNGLKFENIYIYCKSLYQPKYKLLSDVL FT QKIKGMEYNQYEHNEDVLDPKDAKNNSIFIFDDVIMCNQAKIRDYFCMGRH FT KNIDSFYLCQTYTKIPKHLIRDNANLLVILKQDLLNLKHVYEDHVGADMKF FT EQFCEICQECWKNQYGFLVICKDNELNKGRYRKGFDHFIQV" FT CDS 10386..11330 FT /product="INT-3_TCp" FT /translation="MVRLKQVSDVKQKLVNEIFKPARRNFKRRRTIIKGLD FT DLWQVDLAEMQNLSYHNNGFKYILVVIDCFSKFLWTRPLKSKSANEVSRAM FT ESILKNEKERIPKNLQSDQGKEFYNTTFQDLMRRYNINHYSTYSVKKAAIA FT ERVIRTLKSKIYRYFTLFATHKWINKLQEITKNYNTTIHRTTGRKPTEINA FT SNQGDITAYDHLKIVDKNRKKFKVGDFVRISKEKMLFEKGYTPNWSTEIFK FT ISKINLTNPVTFLLEDKEGQPIKGAFYIWELSKTQYPDVYLVEKVIRRKGT FT KLFVKWLGLPDNQNSWIEMKDIT" FT CDS 6692..6123 FT /product="PRO-3_TCp" FT /translation="KKLPIKLPNRALTDNDLRRYAKLLKLPKFKGVFMRDE FT LIHMKPDKTECAIINLDSKKGSGTHWVCYRKTNKQVDYFDSYGNLKPPLEL FT VRYLGNCNIYFNHDRYQFLSYNCGHLCLMFLHNKR*DNHVFYSVMYKTFSL FT TGVESTLSEHYEPPIDLEENSNYSIALIGFYTNNNIPNIEEGTNKFYYFAP FT " FT CDS 13367..16549 FT /product="POLB-3_TCp" FT /translation="MEILFLELNINKFNPLRASTYIPLPKDIAKRRAVINI FT HNDDNECFKWSVLAHLHSTEVNVNRHRVDAYREFEEELDFTDISFPVKLTD FT IFKFEALNNISVNVYGTEQRFDHEKGKWVSDIIGPFYLTSKKRDIHVNLLL FT IVDDERMIKHYCLISNISRLISSQLSKHNHAKFICNGCLLFFSSEQKLVKH FT RKYACNEVVTTLPTSDLIINKFGKEVPGNELTFQNFDKSLKVPFVFYFDFE FT SILKPLRNDNSADNIRTIVTHLHEPYSFGYYIKCAYNDNLSIYRTYRGKDA FT PKIFVKWLQDDVNRLYHDHLKHIVPLEMTPLDELVFQISTHCHLCKEPFKT FT DYGSLNCDRVRNHCHLTGEFLGPAHSICNLNYKIPKYIPVFCHNLTNYDSH FT LFIKALATEKTIINCIAQTKEKYITFSKKILVDKVMNTTTNTVDNIFITLR FT FVDSFRFLSFSLDKLSQTLTSNECLTIRKHFPDDNQFNLIRQKGVFPYSYL FT DCFNKLDDTSLPLIQQFYDNLNKTDISEHDYERAQEVWNLFDCKTLGDYSD FT IYLKSDVLLLTDIFENFRKVCLKIYELDPAHYLTAPSLGWDAMLKMTGIKF FT ELLTDIDMVHFFRKSIRGGLCQCSKRKAVANNKFLSNHDPSKPTSFIMYLD FT ATNLYGAAMSEYLPYSGFRWEDPNEFNDNLILNLTDNADVGYIFEVDLEYP FT SELHNLHNDLPFCPEKFIPPSSKTEKLCATLNNKVHYVLHYRNLRQAIQGG FT LKLINIHRVLSFNQSPWLKTYIDLNTEMRNKSKNALEKDVFKLMNNSVFGK FT TMESIDKRVNVKLVTSWNNIGRKPGAESLIARPNFKNSSIFTPNFVAIQLE FT NEKVFYNKPLYVGFTILDISKTIMYDFYYNFIKLKYGNNATLLYTDTDSLV FT IEIFCENIYDDIKSNIERFDTSNYKAENIHGIPVSSSVIGKMKDEYKGRVI FT SEFLGTGAKAYCVDVEGELSKKAKGIKHNVITSELHKIDYQNAVTVPNTEI FT IKEMNIFRSKLHNIYTELKKKIALSFKDDKRYILNNSEGRTLSWGHKNILP FT QTQE" XX SQ Sequence 17681 BP; 6215 A; 2711 C; 2788 G; 5967 T; 0 other; agtagataga gatagccggt tcaaaaacca cctccacctc cctctactgc gcatgtgtga 60 aaattgtgtc agtcagccgc agacgcaagc gcgtaatcgc agctcgctgg cctgcgacca 120 gctggtcagc cggcctgcgg ccgactgacg ccatttgtga tgatgcaaat atatttatgt 180 agtataataa taataataat aataataatc gtcgccatct tggaagtttg aaagtcggcg 240 gcggtcattt ttgtttaatg gtaaattggc gccaaattta aattatttca aaattaatat 300 ttgaaattag tcggccatgt ttgtttgtta aaaaaaatgg cgccaaaatt ttaatttatt 360 taaaaattaa tattgggtgt aatgttggta gcgccgatgt cgtcacaaat tccgcctaaa 420 ttctcacgtt agatggcgcc acggtctgca aacttttcgc tagatggcgc tgccgtcagc 480 gcagccgtca atcaaagatg gcgctggcct ccattttttt tttttaaaca caccccccgc 540 ccgtttgcga atatgcaacc aagaaaacaa gtagtagtag tagtagtagt agtagctttt 600 tggggttgtg aaatttttgg tttttggggt tgtgaaattt tgaatatatg atttttgaaa 660 tgtgcaattg tgaatacatg aagtatatat acaatacatg ctgtattgag ttaagcaaaa 720 tatcagaaaa tcatcgcgct tgtcactaat aactaataaa ggaaagtcat tattttatcg 780 ataaggaaaa aaaacctagt atatgtagtt ttttattttt gtaatgttgt ttgcctaact 840 tgtctgttgc gcaccaagtt tattttgtta tttgtttaaa atataatgca aaaaaagttg 900 ttgttgttgt tgttccttcc atggggagtc gaacctcagg taggtagagt ggatagtgct 960 gtctattttg gaaacctgaa aaaaatcata aattagtttt ttaaatagag gtgaacatga 1020 cacttaccaa aaaattttac attaatcttc gtgaacaata cgttcgaaat taaaaatgtg 1080 ggcatcatgt tcctgatttc cattagcttg cttacgtccc atataaatca aataagatcc 1140 gagtccttcc attagtgtcg taattatggt atcattcata actgctgtta gtcgtttagg 1200 taaaaaaact ttaccgacgt tgtccagata tgctaccatc ataaccccat gttgtgtgtt 1260 tgccttggtt aaattgatga ttttatattt cttgttcact tccaaatcgt tggcactgac 1320 atatgtatca gtatattcac aggcagcaat tttattcaga tttaatagtt gatcagccat 1380 tacagaatcc tttgaaattt aaatgcaaaa atacacagcg gagcacatac aacttctcac 1440 aaagatagtt ttaataaaac tggtgaaatg tgtaaaaaac ggaactttta taatcaattt 1500 ttgcggttaa cagataacac gctttcctaa aaaataaaaa tgaaataaaa tatgatttaa 1560 tactaatttt gtggtttacc ttatcgtagg aattttgtga cgcacttttt gtttgataaa 1620 aatttgataa aaattttcct aaaaaattaa acacattttt agcaatcaaa tgtggtaaac 1680 aaatttaaaa aattaaagaa tttaccaaaa aaagattatt tatgttggtt ttagagaaga 1740 attcagtagt ctgaatcgga agaatttgtg gatgaagaac ttgtgccgaa gtcaatattt 1800 tcgataaact tacgatgcgt ctttttggca caacttttgt aggtctgaag ctgaattaga 1860 tgtttaaatt agatgtttag aaagaacgtt ttattaagtt gtaatataaa cttactgtaa 1920 acaaacactg atgaatacac caactacacc aactatcttc ttcacgagta taatctgtta 1980 aattttcaac caggactgca ttgtgtgtaa taatcttttt aaaagataat tttttcaatt 2040 gaattaagat gggttctcgc ggacaattaa tgttaaatat agataaagtt ttaggtcccg 2100 cttgtatgac actcttagta aactcatcat attcaaagta attattttca ttaaaaaagt 2160 acagctttcc gtttaaatat ctaaatgtcc ttgttatttt attaggaaca ccttcaaata 2220 attcgtttgt taaaccagga acattatatc cagagatgca actagataat gtaatatatg 2280 tattatctct gagaaaaaga taaataccac ctgagtaagt tgttatcgct gccgaaattg 2340 gacttgatat tttaaattct tcgcttatag atttcggata attagtttta aatctaaaag 2400 tagataatga aaaaaatata aacttcagct tttacaataa ttactatatc atctgttgag 2460 ggtttatgat aaattgttgt aaattcttca aaatcttttg gtaaaaaatg ttaaccgccc 2520 agtaattaat taaggttttg gatatgaatt tttacctatt tcaattaaac aaatccactt 2580 tttttataaa agatgtataa atgtgtttta actattagaa atgtatatat tgtttgtaaa 2640 ctattggata taaatttttt ggggaaaatg attgtaacga taaattaata atcaaataca 2700 actggatatt tataaaattt ttgattcata aatttacaag aaaaaaacgt tagacaggaa 2760 ataagatcac aaaaataatc agaatgaata ttatttttta ttatacataa aacaaactac 2820 atacttaaac taaaaaagtc atacacctaa ataaattttt aattgcagtt tttaagccaa 2880 acctaacgct atgaacatca taaaaaacgc tatgaacatc ataaaacgat ccaacatgta 2940 tgaggtttaa tgtacccata cccaaatgtc aattataccc cgctcctcat cggtgtctat 3000 cgcggcggtc aagcgcccca cggtaaaaat atcaaaataa cctccaaggg atgggggcgg 3060 ttccagtcgc agatagatgt taataaggga aacccactcg aaggtgcccg cgggaatttg 3120 ggtgaggagg tggtccacct cctcttcggt taacccagcg tggttaacaa ccaaatctgg 3180 gttatagtcc ccatagtaac ctgcgggatg ttctgtaaca aaagaaagag aaatttaata 3240 aatagattac attaatttat taagactgat gatgaagtgt tgtaattaaa tttcaaacaa 3300 ttgattcgga aaaaatttaa aataaaataa tgtaaaaatt atttctactt actagacatg 3360 atggttgtgt tgactgttga tatttcaata aaaattattt cacacactaa aagttgatcc 3420 ctttatactt tttgtaatct taactagagt ggagagggca ggttaatgaa gtaggagaag 3480 gaaacgagat taaaacttta aacgtttatt tcggtaaatt ctgaattaca taattataaa 3540 taaaccgtac attattaaga ctacagatac atattttagt ttttaagtaa tgatgcatac 3600 ataacggaag ttgtggttgc aattttggac aagagtttgc ttcctccaaa ttgatgatcc 3660 tagggtaaaa gtcctttgaa tcgaaaaatt gtcttaaaaa atcctcttta acgcttcctt 3720 ttacgtaaat aatatcgata acatcatcga taatattttc ttctagtatg ttataaatat 3780 tttcatagtc gatatttcca ctgttgtaac gtaatagatg atgatttaat tctaaatatt 3840 tcacttgttt ttttacatct gcacttagtt gacgaaacgg caccggtgat ttaaacaaat 3900 agtgaccagt tcgatgtcca tcaaatattg ttaattcttt aacgatgaat tcaggtatat 3960 aaaatccttg aacatcaatc actactacct tcatgttgaa cacttaaatc tagactgaaa 4020 aatttgttca cgatgagcag aatttaagat ctcctttaaa atatttttca tttgattaat 4080 tctaaactta aacctttctc tgtctgcagc gcattgtatc caatagtttc cttttctagc 4140 atcttgataa gcatatgacc aaactattag attatgtact tccggagcac tatcacttat 4200 agtaacactt tttttcatcc ttatatcagt tgtctaatta ttcctgttaa cggatagtac 4260 tgcattattc tatcatgaat gataatacaa taagctcttg tatcggctgg gaaattatta 4320 tcgctttcaa attctaagct gatatcaatg gtggaacttc gaattgcttc gttttgcttg 4380 gaacagtcta atacaaatat cggtcttttt gtataatctg tatagcttaa tataggttca 4440 gcacgatctt ttccattaaa ttccttataa aaatcaacgt aagcttgata tgcttctaaa 4500 tatttatttt tagcaaattg tagtttccat gtttcatatg gataaacatt tgaattcaac 4560 cataaacgta cgcttcttac atcacaatga tcaaatagag taacatcaga ctttggatta 4620 ttttttcggt ttgtttgaaa gcaaacgata ataaatcttg gtctttctaa ttgagtagat 4680 gttttaactc tccagatgtc gttattagtt tgtcgaactg atggaagttc atgaatttcc 4740 cattttctga acggtaacga tataacacga tctttgttga tcccttctaa tatagttaat 4800 ttaatttcat catttgggta aacatgtttt gcttttaatt caatattgtt aatcttaagt 4860 gtcgcttttc tcgttccggt agaaatgaaa caattagcat cgttacgact tcgtattaat 4920 cgaagttttt gatgaccgct aactatccga tgataatcaa atggtaaatt taacagataa 4980 gataaaggga ttctaaatga aaacgtatta tcagtttcat taaatgtctt cagttgttga 5040 ctttcttgat cattccatcc agcgagagtt aatgctctta cttcatcttt gccataacat 5100 agaaaagttt tgattgtaga tactgttccg acttcacgta cactatctaa ctcaacatta 5160 ttcaattcat agcttacatt ctcaaataaa tatgctgctg cattattagt caaggttaca 5220 tttccaccgc cagtttccgg taccggatct ggtatgaact caccgtccac ttgtataaat 5280 gattcaaaga gagaaatcca aatatctctc tgattaatta taatttcgat ttcatcattg 5340 ttatcgaaag atcttacttg cggactatat gtatgttctt cctctttact tatgctgtcg 5400 tctatttccg gtgaactgta aacctgaagt ttgctggcgg acattatatc ctaacgattt 5460 aagaatctgc acgctctgat gtgataatgt cgactgatga caagatactg actgataatt 5520 tgatggattg tatgattttt tatattgatt ttgaaacact aaacccatgt ttatgcttct 5580 tttttcagag caagacgaat attgattgtt tgaccgttaa aatctatagg acaatggtct 5640 tgatcaacaa tatcaatagt taggttaggt atattggatc tattaatgat aggtaaataa 5700 ataggattta atggttccac aaaaatctgt tcggctggac taacacttaa agcagtttca 5760 taaagagtat gagattcttt gtttttgtag aaagaacctc ttacaatatt acactcgata 5820 cgtattgtat taaccttaat tatgttaaca ggtaaatcac tttcatgtaa ttgtcttgct 5880 ttaagtttcc ttttagaaaa acctaacaat ttaccaatac tatcttcgtg agcgaaatta 5940 atgtcgtaaa tactaaatat ctcacacttt aatgtgtttt cgttagcctt caatgaaaat 6000 gtattctgtt tttgagcatc agttgcagat cgtccactaa gaataatttg tatatatttt 6060 tctatagcag ctatttcata agcacctttc ggaaatgtta tcactttttc ttcagtagtg 6120 ttcggagcaa aataataaaa tttatttgta ccctcttcta tatttggtat attattgtta 6180 gtataaaaac cgattaaagc tatagaataa ttcgaatttt cttcaaggtc aatgggtggc 6240 tcatagtgtt cagacagcgt actttcaaca cctgttaggg agaatgtctt atacatgact 6300 gaataaaata catgattatc tcatctttta ttatgtaaaa acatcaaaca taaatgtccg 6360 caattatatg ataaaaattg ataacgatca tgattaaagt aaatattaca atttcctaag 6420 tatcgtacta attctagagg tggctttaga tttccataac tatcaaaata atcaacctgt 6480 ttatttgttt ttctataaca aacccaatga gttccggatc cttttttaga gtccaaatta 6540 atgatagcac attctgtctt atcaggtttc atatgaatta attcatctct cataaaaacg 6600 cctttaaatt tgggtagttt taataattta gcatatcttc ttaaatcatt atcagttagt 6660 gccctattag gcagttttat tggtagtttt ttggaaattt acgtaaatac aatccataac 6720 ctcgtggttg ctttcgaaga tataaaccac ttcccgcctt cccgatagcg atatcttcca 6780 gtactttatt atgacgttta ttctcatcta atttctgttt cgctgcttta gcatcattta 6840 cagctttagt tatagcagca gcacctccgc ccatagcccc taaggcacta agtcctccaa 6900 aaattaatgg taacaatggt aaaaatcctc ctgtttttat aggtataata cgaggaattt 6960 ttatacgctt acgacctcca gattttctaa ctgctatacg agcagctgat aatgcagatt 7020 ttatattatc atatccctta ttttctaaat ttgataaaag agctttatga aaaggttgtt 7080 gttttttctg tgtcctcttt ctagatttct ttgtcttcct tttcgatttt ttcctttgga 7140 ttcccatacc taatttcctt ttgactttca ttgcgttagt tacaaaccag gcacttgctt 7200 tttcaccaaa tgaagcgtct ttagacttaa cacgtgacca tgcactatct tctaaaattc 7260 gatccgcttg atgtctatct tttaaatcgt tggaattagc gtaagctata tcgtgcgttt 7320 tacaaaagct gtcaagcttg tttattccag gatcacctcg ttctaaccgt ttcttaagtt 7380 ttgtaccagg tccgcaatat tgatatcctg gtaaatgtaa ttcaaaaggt aatttattga 7440 ttgcagtatt taatagtcct tcccctttga aagacaacat ttaactgaac ttttttccct 7500 atataaacct tcttataaat gtttttaatt tagtatgaaa atgaaagtgt taaagcaaaa 7560 attacaatta cctatcataa atcatgatta taataatgtt gataataaac ataaaatgtc 7620 tcatcgacat ggtacacttt tacccgaatc aatacgttgt ataatttgtg gaccttctgg 7680 tggaggaaaa actaacatat tattgtcttt attgtttgat cccaacggat tgaaatttga 7740 aaacatatat atttattgta aatcattata tcaaccgaag tataaactgc ttagtgatgt 7800 gctacaaaaa attaaaggaa tggaatataa ccagtatgaa cataatgaag acgtgctaga 7860 tccaaaagat gcaaaaaata actctatttt tatatttgat gatgtcataa tgtgtaatca 7920 agctaagata agggattact tttgcatggg ccggcataaa aatattgata gtttctatct 7980 ctgtcaaaca tatacaaaaa taccgaaaca tttaattcga gataatgcaa atttattggt 8040 aatattaaaa caagatctgt taaatttaaa acatgtatat gaagatcatg ttggtgctga 8100 tatgaaattc gaacaattct gtgaaatatg tcaggaatgt tggaaaaacc aatatggttt 8160 tttggttata tgtaaagata atgaattaaa taagggtcga tatagaaaag ggtttgatca 8220 ttttattcaa gtgtagagtg atatctattt gagaacgttt ccagaccggt taatatcagt 8280 ttcacagcgg ttagtatcgg tgcgatatcg gttaataccg gttccaaagc ggttagtatc 8340 ggttagatac cggttagaga ccggttaata tcggtctcag accggttagt atcggtgtga 8400 tattggttaa taccggttcc agagcggtta ttatcggata gagaccggtt aatatcggtt 8460 tcagaccggt taatgtctgt taaaaagcag ttcctgatcg gtttgagatc ggttaatatc 8520 agtttgagac ccgttataat agcagtcgaa taagaagtta tataaacaat aattctgatt 8580 tgttcggtta gtttataaca ttaagcggta atgatgggta ccaaaccaga tcaaaacctt 8640 aaaagacata tattaaaagc agctgattca attcgaaaga agtataaagc cataaagtta 8700 aacagcagtg aagcggatga gtcaattaaa aagctttttc agcctgttat aaatccttta 8760 gaagaaattt caataaaatt ggagaagcaa tctccaccac cacagcaaca acaacaaaaa 8820 caacacaaaa gaccaaaaca tcatattcaa aagcaaagga aacaatttct taaactacca 8880 aaaaaagaaa agattaaaca agaacttgaa ctaccagttg aagaaacata tgaacctagt 8940 acatcactgc gacaaccaga agaaatatat gagtcattac cagaaataaa agaagaagat 9000 gaagaaacca aatctgacac tatggatgat ttaattaata gcagacatat tttagatgaa 9060 tttttagaac aatatcctcc agttgcgaga aatgcaatag aagcagtttt agagaaaacc 9120 agtgatcaga catttggtcc tagatataat tcaaaagaga ataaactatt tatgggtaaa 9180 catgaattaa taatcagtca aaatggtgat ttaattttag ataatgaatc atttcctggt 9240 actcctggtt tatacagact aatatttttt aaaaaccctg aagataaata cactcaaatt 9300 acaaaagagg atcggtcagc atacaaaaaa atacttgaaa tttctaatac acatagaaaa 9360 aataatgatc ccacaaaaca actaaaaggt catagaggtg taaaatatac tcaaattgtg 9420 caaccaatgt ttacttcatt tcctaaatct ccgtttattg gtgaaggatt attcactaat 9480 aaacgtgtag aatatgttta ctgggatgac ataaatgaat tagtgtcacg tctagcatta 9540 ttacatgcag cgacaaaagc gggtaacaat tcccatctga atgaaatttt atcaatagaa 9600 gaagaattaa aggaatgtgg tgtaatatat taggcttaag tttttttctg gttaatcctc 9660 agtgttgaaa tgcccattga tctattcgga cgtaaagtgt tatcagcatc aagtgcattt 9720 tttaaatcta caccatcacc ttttgttttc actgctgatg gtgaccttga tttaaaacat 9780 catagaatct gtaacattaa tgatccaaca gaagacaaag attgtgcgaa taaggtatat 9840 attgatttag taagtaaaca aatataccaa aagatggaag gagtaatagc tgctttaacg 9900 gatttaaata ataatgttca taaagcagat gctatatgga tggataaatt taatcagtta 9960 aacttaacca tgaatgttaa gtacaaaata cttgatgaaa aaattgataa tataattgct 10020 acaattcctc acacgattga atcagaattg cataacaaac atttaagtat aaattatgaa 10080 atcgataaca agtggaaaaa atttatgaat aattcaaatg agataaccag acgaataaaa 10140 aaacttgaag aacgctttga aggaatcgat agtattaaga taaaaccaga aaccatttcc 10200 aaacgttcta aagcaggtga tacagatcaa cttgaaaatg tacctatatc aaagaaaagg 10260 aaagtaacag attcatatga aaatagtgaa agatagaacc catgcctcat aataaaaaca 10320 agtgtcctcc aggtcaaatg tcagatcgtc atggaaattg tagagaagtg taccgtaaat 10380 aagcaatggt tcgactaaag caagttagtg atgtaaagca aaaattggta aacgagatat 10440 ttaaacctgc acgcaggaat tttaaacgaa gacgtactat aattaaagga ttggatgatt 10500 tatggcaagt agatttagca gaaatgcaaa atttatctta tcataataat ggatttaaat 10560 atattttagt cgtaatcgac tgtttttcta aatttttatg gaccagacca ctgaaatcga 10620 aatcagctaa tgaggttagc agagctatgg aatcaatact gaagaatgaa aaagaacgaa 10680 taccaaagaa tttacagtca gatcaaggaa aggaatttta taataccact tttcaggatt 10740 tgatgagacg ttataatatt aatcactata gtacgtacag cgttaaaaaa gcagcaattg 10800 ctgaacgagt tatacgtact ttaaaatcca aaatttatcg atattttacg ttatttgcga 10860 cacataaatg gataaataaa ttgcaagaga taaccaaaaa ttataatact acaatacatc 10920 gaacaacagg aaggaaaccc acagagataa acgcttcgaa tcaaggagac ataacagcat 10980 atgatcatct aaagattgtt gataaaaaca gaaaaaaatt taaagtcgga gatttcgtaa 11040 gaattagcaa agaaaagatg ttatttgaaa aaggatatac acccaattgg tcaacggaaa 11100 tttttaaaat cagtaagatt aatctaacca atcctgtaac atttctgcta gaagataaag 11160 aaggtcaacc aatcaaagga gctttttata tttgggaatt atcaaaaact caatatcctg 11220 acgtctattt agttgagaaa gtgattcgac gtaaaggaac taaattattt gtgaaatggt 11280 taggtttacc agataatcag aatagttgga ttgaaatgaa agatattacc taataaataa 11340 gaaatatata tacgatataa ttatttaacg tctaatttag ttccatttaa gatggcaacg 11400 agtgtagaac ctgttataaa aaaagtgctg cattttgaaa aaaccataag cgatattcga 11460 ttactaatga aactggaact actgaaatta ggaccgtgcc agttgatgga tatatttcat 11520 aaattaccgg aaacatatca acaagatcat gatataaagc tgaatctacc atgtttacaa 11580 cattattata acagttttga cgaagatcag tttgatggtc ctccttcatc aataaataat 11640 tgtgtttcgt gtattttaga tgatacagga agccaacaac aacaacaaca acaacaacaa 11700 caacaacaac aacccactgc gtttcaagtg taatgcctct tttgaactca tcactcatct 11760 tgtttgttac tcttatacaa aatgtgttgt acgtgcaaaa accgttagat gttgtagata 11820 tgtacatcgt cttcactagt gaaagtccta acaacacaag atggtctcct ctctggaaaa 11880 gtttattttt caacgatgtt gattaccaat ggaagaagat gaatctgggt tatttgagca 11940 atactgcaat agaaatattt aaatatgaaa atgtttctat agcaaactct gtaatatatt 12000 ttaaattaag actagttttt accaatagta attatattga tactgactgg aagagactta 12060 tacttaatga tcatgatgta gtttcaataa aaaattgttt taatcatgtt cctacctttt 12120 tagcaatgtt tggaattgtt tgtacatgtt ggatcgtttt tatgatgttc atagcgttag 12180 gtttggctta aaaactgcaa ttaaaaattt atttaggtgt atgactttgt tagtttaagt 12240 atgtagtttg ttttatgtct aataaaaaat aatattcatt ctgattattt ttgttatcat 12300 atttcctgtc taacgttttt ttccttaaaa acgttaaatt tttatcagtt atattttgaa 12360 ttttttgaat ttttacgtaa ggaaccacaa tatccggttg caaagtttat ggaatataaa 12420 atcatatcgc atcaatgatt aattcatttc tacttcgact gtccgtgtat gaaagtaatc 12480 atgttcaagt gctctcaatg cgataacaag tttactcgca aagatgcttt aacacgtcat 12540 gtaaaaaaac atagtaagta aatacatttt acgttataga aattgtttta attaagagca 12600 tttttaaggt gataaattat ctaaactcac ttgcaccgga tgtaaaacgt catttacgag 12660 atttgataac tataggcgtc atattacaaa ttcatcattg tgttcaatgg attcggatga 12720 tatgactcct aaaaaagccc gcctgacaac tagaacacct ttacaaggta tttttgtaaa 12780 tattctttat attattatta ttataacttg tttttgtttt tagaagttac tactaacgct 12840 ccatcatgtt catttcatca attttgtgaa atatgcaata gtcactacag aggatccaga 12900 ttaaaccatt tacgttcttt aaaacataaa agtgtcatct ccgccaaaat taatgatgat 12960 gatgatttcg ttgaagaata tcaagttgcc ttcaaatcaa gaattacctc ataccgattg 13020 aaaaattcac aactagatga cttggaagtg tgcaattttt ttgaaagaaa tcaagataaa 13080 cttatcactt taataactag aaaactgaga gaatttaacc aacttaaaat taatttcgaa 13140 ctttttgcta aatatatttt accttctaaa gacattatcg aaatcaaatc attcaacact 13200 aaaaatatca tcatttcaat atcgacaaat attaaggtag aactggaaga aatatgtggc 13260 gtcataaaaa acaaaatgag tgaattcttg gaaaaagata gtggtaagta atatcaattt 13320 tatttataaa catattattg tattaaattt tttaggttgg attttaatgg aaattttgtt 13380 tttggaactt aatatcaaca aattcaatcc actacgtgca tcaacatata taccattacc 13440 gaaggacata gcgaagcgga gagcggttat caatattcat aatgatgata atgaatgttt 13500 caaatggtct gttttggcac accttcattc aactgaagta aatgttaatc gacatcgagt 13560 tgatgcttat cgtgagtttg aagaggaatt agattttaca gatatttcat ttccggttaa 13620 attaactgat atttttaaat ttgaagcact taataatatt agtgtaaatg tctatggaac 13680 agagcagcgt tttgatcatg aaaaaggtaa atgggtatca gatattatag gtccctttta 13740 tttaacttct aagaaacgag acattcatgt aaatttatta cttatcgtag acgatgaaag 13800 aatgatcaaa cattactgtc tgataagcaa catttctcga ttaataagtt cacaactttc 13860 caaacataat catgctaaat ttatatgtaa cggttgtcta ttattttttt cgagtgaaca 13920 aaaattagtt aaacatcgta agtatgcttg taatgaagtt gtcactacat taccaacttc 13980 tgatctgatt ataaataaat ttggtaaaga agtgcctggt aatgaattaa catttcaaaa 14040 ttttgataaa tctcttaaag ttccttttgt attttatttc gattttgaat ctattttaaa 14100 acctttacgc aatgataatt cagccgataa tatccgtaca attgttacac atttacatga 14160 accatatagt ttcggatatt acataaaatg tgcatataat gataatttat caatttatcg 14220 gacttatcga ggaaaagatg ctcctaaaat ttttgtgaaa tggttgcaag atgatgtaaa 14280 tagactttat cacgatcacc tcaaacatat cgtaccactt gaaatgactc cgttagatga 14340 attggttttt caaatatcta cccattgtca tttatgtaaa gaacctttca aaacagatta 14400 cggaagtcta aattgtgata gagttcgtaa tcattgtcac ttaacaggtg aatttttagg 14460 acctgcacat tctatttgta atttaaatta taagattcca aaatatatcc ctgttttttg 14520 tcataattta actaattatg atagtcattt atttattaaa gctttagcaa cagaaaagac 14580 catcattaat tgcattgctc aaactaaaga aaagtatatt acattttcaa aaaaaatttt 14640 agttgataag gtcatgaaca cgactacaaa tacagttgac aatattttta taacgcttag 14700 attcgtagat tcgttccgtt tcctttcttt ttcattagat aaattatctc agactttaac 14760 ttctaatgaa tgtctaacca ttcggaaaca ctttccggac gataaccaat ttaatttgat 14820 aagacaaaaa ggtgtttttc catattcata tttggattgt tttaataagt tagatgatac 14880 cagtttacca ttgattcaac aattttatga taatcttaat aaaactgata tcagtgaaca 14940 tgattatgag cgagcacagg aagtctggaa tctatttgat tgcaaaacgt taggtgatta 15000 ttccgacatt tatttaaagt ctgatgtttt gttactaaca gacatttttg aaaatttcag 15060 aaaggtttgt ctcaaaattt atgaactaga tccagctcat tacttaacag caccttcatt 15120 aggctgggat gctatgttaa aaatgactgg aattaaattc gaattgttaa ctgatataga 15180 catggtacat ttctttcgaa aatctattag aggaggactt tgtcaatgtt caaaacgtaa 15240 agcagtagca aataataagt ttttatcaaa tcatgatcct tctaaaccaa cttcttttat 15300 aatgtacctt gatgcaacca acttgtatgg agcagcaatg tctgaatatt taccttatag 15360 cggttttcga tgggaagatc ctaatgaatt caatgataac ctaattctca atttgacaga 15420 taacgctgat gtaggttata tattcgaagt tgatttagaa tatccttctg aattacataa 15480 tcttcacaac gatttaccat tttgtcctga aaaatttatt cctccctcta gtaaaactga 15540 gaaactttgc gcaaccttga acaataaagt acactatgtt ttacattatc gaaatttaag 15600 acaagctata caaggtggtt taaaattaat taatattcac cgtgttttaa gttttaatca 15660 atcaccatgg ttgaaaacat atattgattt aaatacagaa atgcgtaaca aatcaaaaaa 15720 cgcgttagaa aaagatgtat ttaaattaat gaataattca gtttttggta aaaccatgga 15780 atcaattgat aaaagagtaa acgtgaaatt ggtaacatca tggaataata taggtagaaa 15840 accaggtgct gaaagtttaa ttgcacgacc aaattttaaa aattcatcta tctttacacc 15900 aaattttgta gctattcaat tagagaacga aaaagttttc tacaataaac ctctgtatgt 15960 aggttttact attctagata tcagtaaaac tattatgtat gatttttatt ataatttcat 16020 aaaacttaaa tacggtaaca atgcaacgct tttgtatacc gatactgata gtttagtaat 16080 tgaaattttt tgtgaaaata tttatgatga tattaaatct aatatagagc gttttgatac 16140 ttcaaattat aaagcagaaa atattcatgg aatacctgta tcgtcttctg tcatcgggaa 16200 gatgaaagat gaatataaag gacgtgtaat atcagaattt ttaggaacag gtgctaaagc 16260 atattgtgtg gatgttgaag gtgaattatc taaaaaagct aagggtatta aacacaatgt 16320 aattacaagt gaattacata aaattgacta ccaaaacgca gttactgttc caaacactga 16380 aattattaaa gaaatgaata tctttcgttc gaagcttcat aacatttaca cagaattaaa 16440 gaaaaaaatt gctttatctt tcaaagatga taaacgttac atcttaaaca attctgaagg 16500 tagaacatta tcttggggtc ataaaaatat tcttcctcaa actcaggaat aatatttaat 16560 ttgtattgta gttattgtat tataatgttt atgtattata atatgtatgt atgtatgtac 16620 ttatatttag atgtaataaa caattaaagt aatttattgt ttttattatt atccattaac 16680 caaatagctt atgtgaattt gtttcaggtt tccaaaatag acagcactat ccactctacc 16740 tacctgaggt tcgactcccc atggaaggaa caacaacaac aacaactttt tttgcattat 16800 attttaaaca aataacaaaa taaacttggt gcgcaacaga caagttaggc aaacaacatt 16860 acaaaaataa aaaactacat atactaggtt ttttttcctt atcgataaaa taatgacttt 16920 cctttattag ttattagtga caagcgcgat gattttctga tattttgctt aactcaatac 16980 agcatgtatt gtatatatac ttcatgtatt cacaattgca catttcaaaa atcatatatt 17040 caaaatttca caaccccaaa aaccaaaaat ttcacaaccc caaaaagcta ctactactac 17100 tactactact acttgttttc ttggttgcat attcgcaaac gggcgggggg tgtgtttaaa 17160 aaaaaaaatg gaggccagcg ccatctttga ttgacggctg cgctgacggc agcgccatct 17220 agcgaaaagt ttgcagaccg tggcgccatc taacgtgaga atttaggcgg aatttgtgac 17280 gacatcggcg ctaccaacat tacacccaat attaattttt aaataaatta aaattttggc 17340 gccatttttt ttaacaaaca aacatggccg actaatttca aatattaatt ttgaaataat 17400 ttaaatttgg cgccaattta ccattaaaca aaaatgaccg ccgccgactt tcaaacttcc 17460 aagatggcga cgattattat tattattatt attattatac tacataaata tatttgcatc 17520 atcacaaatg gcgtcagtcg gccgcaggcc ggctgaccag ctggtcgcag gccagcgagc 17580 tgcgattacg cgcttgcgtc tgcggctgac tgacacaatt ttcacacatg cgcagtagag 17640 ggaggtggag gtggtttttg aaccggctat ctctatctac t 17681 // ID Gypsy-90_AA-I repbase; DNA; INV; 7220 BP. XX AC supercont1.249; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-90_AA_; KW Gypsy-90_AA-LTR; Gypsy-90_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-7220 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.249; Positions 152520 159739. XX CC Positions [3537-4082] - Reverse transcriptase CC Positions [5169-5645] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 3234..6023 FT /product="Gypsy-90_AA-I_3p" FT /translation="MDFWQKFQIWPTVKDCAMIEANIPPIPESQQDSFTES FT ELQQLVEVRKLFVGAQPDKLTITPLIEHRIEISEEWRGKPPVRQYPYTLSP FT KVQQKVAEELERMLSIGIIERANSDWCSNVVPVIKPTGKVRLYLDARKINE FT RTVRDAYPLPHPGRILGQLPRAKYLSTIDLSEAFLQIPLEKASRKYTAFSI FT QGKGMFQFTRLPFGLVNSPATLSRLMDRVLGHGELEPNVFVYLDDIVIVSE FT TFEHHVQLLREVAKRLTEANLSINIDKSKFGVSELPFLGYLLSTEGLRANP FT EKVRAIVNYERPTTVTKLRRFLGMANYYRRFIEDFSGITSPLTDLLKTKSK FT VLGWSEAAENSFNLIKEKLISAPVLACPDFTEEFTLQTDASDVAVAGILTQ FT IQDGFERVIAFFSHKLTTPQRNYHACEKEALAVLLSIEAFRGYIEGSHFTV FT ITDSAALTHIMSAKWKTASRCSRWCLELQHHDMTIRHRRGKENVVADALSR FT SVATLDAKTNSDTPLVSPAISNTNPNPTDSALTTYEDLLEKVSEHPDEHVD FT FQLRDGVLYKYVANTSEPHDDRFEWKIVPHPNERTDIIFECHDNSMHPGVD FT RTLSRIRLRYFWPRMVLDVREYVGKCTTCKETKAPNIALAPPMGERRITSH FT PWQIIALDFIGPLPRSRTQNQYILSVVDLFSKWIMLIPFRKIDSKNLCKAL FT RDQWFYRNSVPEVLITDNASCFLSHEFRSLCSRFDIRHWLNSKYHSQANPV FT ERVNRTVNAAIRTYVKSDQKLWDTRLSEVEAVLNTSEHSATNFTPFFATHG FT HEMFLKGSDHCFGADDPQISHAERGKNQAELFGDIKNLIQEKLEKAHQESL FT KRYDLRHRAYGKLLQAGQMVYRRNMKQSSALDDYNAKYGPQYLPSKVIRRI FT GSSSYEIADLDGKSVGVWPAIHLKPA" FT CDS join(412..1557,1561..2844) FT /product="Gypsy-90_AA-I_1p" FT /translation="MAVSNQKLVVMKLKQMFYDIRVDHLAEDELDFELEVR FT KIVFNDNEAIARKRRALREALKAEKTVENPFLVLKRDPAGEFKVCVDKFTE FT IDGSIRVSAKAVPPRYESRLLHLGTRLILLENHLSTEEKIEVTRMQEAVLA FT RLNEYFYDKRAVAQVEQEDAGIDRLFDEIENPVGQASEIGPTGLPPITPVV FT QSEGSSLPGDDADILESLRRLGLMKGDLQQVESIDVKEALLSLEHEIYLLR FT KFKELHSTLPASSVASVTQAATFTQPIHTTYTGTIPKQSVPVVVTTVWNPP FT IGSIPPVCSHATIPGLSNPYFQSLTNSQYSWSNYVPPNTQPNYSTAMTGAP FT GYRPDVSVAYPMINFLTQTTTASSIPRITTSVYPNYCVPSHISYASPSNSC FT SNTAPTTLYERGQTNQVITSGVDPWRLPIPSGGSSNPLLGSVSNPLIIPSS FT GNTYVPQTVPGWPLSGPAIPSHQQAGVADPPRSHYGHKSLPVSKWKLEKYA FT GTDQGLKLNEFLVLVSQLALSERTSEAELFDSAFHLFTGPALNWYMTMRSS FT GRLVSWSHLVTELRKTFAHPELDSLVRTRVYQRRQQRNETFQEYYYDMESM FT FRSMIVPMSDHEQLDILKRNMRADYKKTLLWKPIHSLPDLLEAGHMIDASN FT FSLYAKVFGNEKSANAISETRTDRGESKGQNHRPPYKQGQYQNTKFNPKST FT ESNLKKKSDQDNKKPTLTTQAGQSKEPDPKEGPSKPTRTLEMLIESHRPPR FT SYECLYCRQTNHSLEQCRNYRGSLCMVCGFKGFETQNCPYCQKNGLQTVQK FT RRPSSPSA" XX SQ Sequence 7220 BP; 2168 A; 1573 C; 1596 G; 1883 T; 0 other; attggcgatc caactaaaaa aaatatcctt gttccagaac atttttctgc ttgctccggg 60 agactcacga ttgttccttc gttcatcgtg attgtttcgt tgtgtttcgt cgaaatagat 120 gctaaaaggt aagatttact atctgtttga tccattttca taaatcaaac caattatata 180 tgctacattt tagattatcg ggtttatttt ccatagttgt actatttttc tattgaattg 240 aattgattag gttaaacaag tgaattgaat tagaatttaa gaatcgaatt gaattgaaat 300 ttaggattaa ttgaattgaa tttgaattta ggattaagta aattgaattg agaagtgaat 360 ttatgctatt gctaaggatt tgagattaaa aatttggaaa agaaactgaa gatggcggtc 420 agtaatcaga agttagtagt gatgaagctg aagcagatgt tttacgacat tagggtggat 480 cacttagccg aagacgagtt agattttgaa ctggaagtga ggaaaattgt tttcaacgat 540 aatgaggcta ttgctaggaa aagaagggct cttagagaag ctttgaaagc ggaaaagaca 600 gttgagaatc cttttttggt tttgaaacgc gaccctgccg gagaatttaa agtttgtgtg 660 gacaaattca cggaaataga cggttctatt agagtttcgg ccaaagcagt gccacccagg 720 tatgaatcga ggttgttgca tctaggaact cgattgatcc ttctagaaaa tcatttatcg 780 acggaagaga agattgaggt taccagaatg caggaagcag tcctggctcg actgaatgaa 840 tatttttatg ataagcgagc agtagcacag gttgaacaag aggatgcagg gattgataga 900 ttgtttgatg agattgaaaa tccagttggt caagcgtcag agataggccc cactggacta 960 ccaccgatca caccagtagt acagagcgag ggaagctcac tgcctggtga cgatgcggat 1020 attttggagt ccttacgaag attgggatta atgaaagggg atttgcaaca agtggagtcg 1080 atagatgtca aagaagctct gttgtcacta gaacacgaaa tttacttgtt gagaaagttt 1140 aaggaactac attccacgct tccagccagt tccgttgcta gtgtgaccca agcagctact 1200 ttcacacagc cgattcacac gacttatacg ggtacaatcc ctaagcaatc ggtgccagta 1260 gtagtaacca ctgtctggaa tccaccgata gggtcaattc ctcccgtgtg tagtcatgca 1320 accattcctg gtctttcgaa tccctatttt cagtctctaa ccaacagcca atactcatgg 1380 tcaaattatg tcccacctaa tacgcaaccg aattattcga cagctatgac aggtgctcct 1440 ggataccgcc cagatgtttc agttgcgtat ccgatgatca attttctcac acagactacg 1500 acggccagtt cgattccacg tatcacaacc tccgtgtatc ctaattattg cgtgccgtag 1560 tcacacatat cttatgcttc accgagcaat tcttgttcga ataccgctcc tactacactg 1620 tatgagagag gacaaacgaa tcaagtcata acctcgggcg tcgacccgtg gagattaccg 1680 ataccgtcag gagggagttc caatcctctg ttaggatcag tctcaaaccc tttgattata 1740 ccgagctctg ggaacactta cgttcctcag actgtcccgg gttggccatt atcaggccca 1800 gcaataccgt cgcatcagca ggcaggtgtt gcagatcctc ctcgcagtca ttacggccac 1860 aaatcgttgc ccgtatcaaa atggaagcta gaaaagtacg ccggaactga tcaaggactc 1920 aaactgaatg agttccttgt tctggtgtcc cagttggcac tttcggaaag aacttcggag 1980 gccgagcttt ttgattctgc ttttcatctg ttcacagggc cagctctgaa ctggtacatg 2040 acgatgcgct cttcgggacg tcttgtcagc tggtcacacc tggtcactga gctccgtaaa 2100 acgtttgccc accctgagct tgattcgctg gttcgcacta gggtttacca aaggcgtcag 2160 caaaggaatg aaacatttca agaatactac tacgacatgg agagcatgtt tcgatcaatg 2220 attgttccca tgagcgacca cgaacagctc gatatcctga aaaggaatat gcgagcagac 2280 tataagaaaa cgcttctttg gaagccgata catagtctac cagatctgct cgaagccggt 2340 cacatgattg acgcttcgaa tttctcgtta tacgcgaaag ttttcggtaa tgagaaatca 2400 gcgaatgcta tttcggaaac gaggactgac cgaggcgaat cgaagggtca aaatcataga 2460 cctccttata aacaaggtca gtatcagaac acaaagttca atccaaaatc aactgaatca 2520 aacctaaaga agaaaagcga tcaggataac aagaaaccaa ctctgactac acaagccggt 2580 cagtcaaaag aaccagatcc aaaagagggt ccctccaaac ccacgaggac gttagaaatg 2640 ctcatcgaaa gccatagacc tcctcggagc tacgagtgtt tgtactgccg acaaaccaat 2700 cattccttag agcagtgtag gaattacagg ggttccctgt gcatggtctg cggttttaaa 2760 gggtttgaaa cccagaactg tccttactgc caaaaaaacg gcttgcagac ggtccaaaag 2820 cgccgaccgt caagcccaag cgcgtaaatc ctgtccctac agggattacc cttgccgaat 2880 tttgggaacc agttcgggaa gagttttatt cgagtgatga gtcccaggtt ctcaacattt 2940 gtattggaga cacgcacgac aaccgacctt acgcgaaaat taaaatctat ggccgaccct 3000 ccaaaggtct cttagattct gggagtcagc tcactttaat tagcgagcat gtgttccaca 3060 agctaaatgg acaaaaactg cgtccagtca gacaacctgt agtggtgcga tccgcgaatg 3120 ggtctgagct agaggttctg ggccaactct cgatcccttt caattttgga gggtgtataa 3180 agataattcc caccctcgtg gtgaaaactc tttccgcaga atgcatacta ggtatggatt 3240 tctggcagaa gtttcagatc tggcccactg ttaaggactg tgcaatgatt gaagccaata 3300 tcccaccaat acctgagtct cagcaagatt cttttacaga atctgagcta cagcagcttg 3360 ttgaggttag gaaactattc gtaggtgccc aaccagataa gctaacaatt acacctctaa 3420 ttgagcaccg aatagagatt tccgaggaat ggagaggcaa acctccggtc cgccaatatc 3480 catatactct atctccaaag gtgcaacaaa aagtggcaga agaattggag agaatgctat 3540 cgattggtat tattgagaga gctaactccg actggtgctc caatgtagtg ccggttataa 3600 aaccaaccgg aaaggttcgt ctctacctag acgctcgcaa aataaacgaa aggactgtac 3660 gtgacgcata tcccttgccc catcctggcc ggatactcgg tcagttgccc agggccaagt 3720 acctgagtac catagattta tcggaagcgt ttctccaaat cccattggag aaagcttcca 3780 gaaaatatac agcgtttagt attcagggca agggaatgtt ccaattcacc aggttgccgt 3840 tcggattagt gaatagtcca gcaaccctgt ccagactcat ggatagagtc ctcggacatg 3900 gtgaattgga accgaacgtc ttcgtatacc ttgacgatat tgtcattgta tcggagacgt 3960 tcgaacatca cgtgcagctc cttcgtgaag tggcgaaacg tctgacagaa gcgaatcttt 4020 ccattaatat cgacaaatct aagttcggcg ttagtgaatt gccgttccta ggataccttt 4080 tgtcaacaga gggtctaaga gctaaccccg aaaaggttag agccattgta aactacgaac 4140 gacctacaac ggtcacgaaa ttacgaaggt tcctaggaat ggcgaattac tatcgccgtt 4200 tcatagagga cttcagcggt atcacgtcac ctttgacaga cctgctaaaa acaaaatcga 4260 aggtgttagg atggagtgag gcagctgaga attcgttcaa cctaattaag gagaaactga 4320 tctcggcccc agtccttgcc tgtcctgatt tcaccgagga attcacactg caaaccgacg 4380 caagcgacgt tgcagtcgcg gggatcctaa cccagatcca agatggattt gaaagggtga 4440 ttgccttctt ttcgcacaag ctgacaactc cacaacggaa ttatcatgct tgcgaaaagg 4500 aagcgcttgc cgtgctattg tccattgaag cctttcgagg atacatagag ggatcccatt 4560 ttacggttat taccgactcc gccgcactaa cgcatatcat gagcgcgaaa tggaaaacag 4620 catctcggtg tagccgctgg tgcctcgagt tacagcatca cgatatgacg atacgccatc 4680 gtcgagggaa agaaaacgta gttgccgatg ccctctcgcg aagcgtagcc actctggatg 4740 ctaagactaa ttcagacaca ccactggttt cccccgctat cagcaacacg aacccaaacc 4800 ctactgactc ggctttgacc acttatgagg atcttttgga gaaagtctcc gaacatccag 4860 acgaacatgt tgattttcaa cttcgcgatg gtgttttata caaatacgtg gctaacacca 4920 gtgaaccaca tgacgaccga tttgagtgga aaattgtgcc tcatcccaac gaacgcactg 4980 acataatttt cgaatgccat gataattcaa tgcatccagg ggttgatcga accctgagcc 5040 ggatccgttt gcgatatttc tggccaagga tggtcctaga cgttcgtgag tatgtaggta 5100 aatgtacgac ctgcaaagaa actaaggccc ctaatatagc ccttgcacca ccaatggggg 5160 agagacgtat cacgtctcac ccttggcaga tcatcgcctt ggacttcata gggccattgc 5220 ccagaagtcg gacccaaaat cagtacatat tgtccgttgt cgacctgttc agcaaatgga 5280 taatgctgat tccttttcgc aaaatagata gcaagaatct atgcaaagcc ttacgagatc 5340 agtggtttta ccgtaattca gtacccgaag tactaattac ggataatgcc tcctgttttt 5400 tgtcccatga gtttcgttca ctttgtagta gattcgatat tcggcattgg ctgaattcta 5460 aatatcactc acaggcaaac cctgtagagc gagtaaatag aacagtcaat gccgctattc 5520 gaacctatgt aaaaagtgat caaaaacttt gggatacccg attgtccgag gtggaagctg 5580 tcttgaacac ttcagaacac tcagccacaa atttcacccc gttcttcgca actcatggac 5640 acgaaatgtt tttgaaagga tcggaccact gtttcggtgc tgatgatccg caaatatcgc 5700 acgctgagcg tgggaagaat caagccgaat tgttcggcga catcaaaaat ttgatccagg 5760 aaaagcttga aaaagcccac caagaaagcc tcaagcgcta cgatctgcgt catcgcgcct 5820 atggaaaact attacaggcc ggacaaatgg tttatagacg aaacatgaag cagtctagtg 5880 ccttagacga ctataatgcc aaatacgggc cacaatatct accgtcaaag gtaatacgaa 5940 gaataggatc ttcatcttac gaaatcgctg atttggatgg aaaatcagta ggcgtatggc 6000 cggctatcca tctcaaacca gcatgaaaac tcacgaccaa ctatatgtta tgatggtcat 6060 tgacctgcta agaactacac gcgaaccaaa tatttacaca gacgttcatt gacctcagct 6120 tctgtctggg aaggttgatg atcgctagct atgaactcca atcgaatgtt tacaagtcgg 6180 ccgctgaact ccatctcgtc ccgggaaatt cagcatcagc aacaaacacc tcagtgaccc 6240 gattttttgc cttcggttct gataattctg gttgagcggg aataatcagc tctccagttc 6300 atcggtcctg caagcagcaa ttctccatcc tctaagctaa gctcttgtcg aagaaattaa 6360 ttagcaaaat catttccata ctacgtaaag tttagtcttc tcgtctcaag ctacgatgat 6420 gctttgaatg aagtttgttt ttatgtcgtt ttgtgttgct ccggttttac aaccgtgcgt 6480 tcacaaggca aactgtcaac cagtggaatg gttcacatcc ttcccaaaaa aaaaaggaag 6540 tacgagggac ccgtcgtttt gaataaatcg acgaggtcac agatcaaatt agagggaaaa 6600 cttcctcttt ctgtgtgaaa gaggagagat ttttgcctac tgctgaaaag ttagaaacaa 6660 gagttataaa ttaaatagcc tcttcgttgt attagaagag atacgtgttc ttctaatcga 6720 taatatctat ggagataccg cttgtctcct aggtgattta gatttacatt tattataaga 6780 catcgtggaa tgaatggtct attggatgat tggtgaatga atgtgttttg agggtaccca 6840 tatgaatgag agtgtaaatg gaaacaatat aaaaatgaga atgaatgaaa gcaaaataaa 6900 attgaaaatg aaaatgaaaa gaaaattact agaattaagg aacagaaatt aggaattggt 6960 aatgttgata attttttttc acagcaacct gaacaatcag tttaatttaa acatacgtac 7020 aacacggtaa aagtaaattc cactaatcta caaatagata gtatccataa ttgatagtaa 7080 taaaattaag gatccggaga attgcttact tggggcatac taattcacga ggacttcgtt 7140 tcgagaaaag atgaatccca gaaaaattac caagctgcaa tttccagcct ggcaattttt 7200 ctctggggag aagggatgat 7220 // ID Transib-19_HM repbase; DNA; INV; 3454 BP. XX AC . XX DT 15-JAN-2009 (Rel. 14.02, Created) DT 15-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE Transib DNA transposon -a consensus sequence. XX KW Transib; DNA transposon; Transposable Element; Transib-19_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3454 RA Bao W. and Jurka J.; RT "Transib transposons from the hydra genome."; RL Repbase Reports 9(2), 459-459 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(1587..2327,2281..2997) FT /product="Transib-19_HM_1p" FT /translation="MYTGEEALALILDCRLSKADYQTISTIRIGAVSKGNK FT LYPTYNEVREEKDLCIPRCGIQTTDYSASVDLQSLIDHTTYRLLKILNESV FT KAPEKSILVFIHKVGFDGSTGQSLYKQSTEDKISISYVEEQSLFLSCIVPI FT KLYIEESGESLWTNEKPSSTLYCRPVRFQYVKESAEVITQEYISFKNSNLK FT PTIFNSFVIKHKLECTMVDGKVATIVSPLSNSFQVCSICGLIPFKYEQSKC FT CFFKNLSPSNMNNLNVAFSKIYNTVSTELGLSTLHAWIRCFECCLHISYRI FT ELGKWQIRGEDDKEIVIKKKIIVQQKMKDYLGLIVDVPKSSGSGTTNDGNT FT ARKVFRNYKLSAEILGLDKNLMKRFYVILCVLSCKHQINADELEKYTKETA FT ELYVSLYPWFPMPQIVHKILIHSSQVVRDKPVPVGLLSEEAQECQNKDVKN FT YREHFTRKYSRKITNEDVMRRLLCSSDPIISSLRRSSRMY*" XX SQ Sequence 3454 BP; 1291 A; 503 C; 510 G; 1150 T; 0 other; cacagtgggc cagagatcgg cataaatgcc gaaaatgagt atttgtggaa actttttttt 60 ttttgcaata ttttaaaagt aaatacccca caataaaaaa atctgtcgtt taaatatttt 120 aatttttgtt ttaggagtcc agggaaagtt ttcaagcttg tccatttaaa aaaaaaaaaa 180 aattttttaa aaaaaaaaaa atactcataa aattaaattt catctcgtaa gcgcagtttt 240 ttttaatata attaaaattt tataatctca atatcagggt acatgcttta ttttagaaaa 300 tttaacaaat aaaaataatc ttgtatttta ggactttttt ttaactttaa atatgacagt 360 taaaagtatt acaaaacata ataaaaagtt accataaatg gttcaagatc ttttttcgcc 420 cacgtttccc cactatggga ttaacttttt tacagtgtct taatattgta gaaggtttct 480 cattaaaaat atcttgcgca aagttgcgcc aattaaagtt tagcaggttt taaacataca 540 actcgtttgt catttttaat tgttcaaagc acattgttgt taagtatgaa ctctaaataa 600 ttaattgcaa ttacggaaac tacaaaaaac attaaaatat aatgctattc cactttatta 660 agtaatgtgg ttttttttaa tttataaact taaacttatt tcagctaatt atcaataatg 720 tttttaatta tgtaaaatac gaatcctatt taaaaacttt aaaaaactat aattatattt 780 ttaacttgaa attttcttct accaatcact atgaaaaata aagaaattac tcaaaaattg 840 tttcatgctt tggaaaacaa cttaatcaaa tcagttaccg accctgatgc ttataaagtt 900 acagggatct ctgacataga cttcaaatta ttttctaaac agtttaacaa aaaatggaat 960 aaagtaaaca gacacaagga tagatttgag ttgcattaca aaatctggtt agaaaaagaa 1020 ctacaaacaa gcgatgcatg ttttgaaaca gataaactag acaagggcca aaactcgggc 1080 ttggcttttc ctacttcttc tcacaattta ggtatgatat tttataaaaa ttaaagtaat 1140 ggacttatga tgtttttcac tgtttaattg tgttttagca aatgtttata gtaaatacaa 1200 tcatctgata ttttaatata aattagttgt ctacggaaga ccaccaggtg caaattccat 1260 accattcacc caagcttctg aaaaaacaaa aaatgaggag agcagaaaaa acaaatattg 1320 ctgtcgaaag agaggaatta ttttactcac ttattttgaa actaaaagag gaaaaaaggt 1380 tcagcgatgc caaggtacat taacctcaat actatagtca tttttaattt atcttattta 1440 ttaaaaaaaa atctcaaaaa tactaagggg gataattttt cttagtggat tgtattcacc 1500 tcactcttca aatttacatt tttacactta aggttgtgaa gcaaatatta aaatcagaga 1560 aagatggtgc agttaattca acaacaatgt atactggaga ggaagcatta gcgttgattt 1620 tggactgccg acttagtaaa gccgattacc aaacaatttc aacaattcgc attggagcag 1680 tttctaaagg aaacaaattg tatccaactt ataatgaagt aagggaagaa aaagatttat 1740 gcatacctcg ttgtggaata caaactactg attattcagc ctcagttgat ttacaaagcc 1800 ttattgacca tactacatac aggttattaa aaatactcaa tgaatctgta aaggctcctg 1860 agaaatccat tcttgtcttt atacacaaag taggatttga tggttcaacc ggccaatcat 1920 tatacaaaca aagtacagaa gacaagataa gtatcagtta tgtcgaggaa caatctttat 1980 ttttatcctg catcgttcca attaaacttt atattgaaga atctggcgaa tcactatgga 2040 ctaatgaaaa accatcctct acactttact gccgtccagt acgatttcaa tatgtaaaag 2100 aaagtgctga ggttattaca caagaatata tttcctttaa gaacagtaac ttgaaaccaa 2160 cgatctttaa ttcctttgtt atcaaacata agttagaatg cacaatggtt gatggaaaag 2220 tggctactat agtttcgcct ttaagtaatt cctttcaagt atgtagtatt tgtggactga 2280 tccccttcaa atatgaacaa tctaaatgtt gctttttcaa aaatctataa tacagtttcc 2340 actgagcttg gattgagcac attacatgca tggattaggt gctttgaatg ctgccttcat 2400 ataagttata gaattgaatt aggaaaatgg caaattagag gagaggatga taaagaaatt 2460 gttataaaaa aaaagattat tgttcaacaa aagatgaaag attacttagg tttaattgta 2520 gatgtaccaa aaagcagcgg ttcaggtaca acaaatgatg gaaacacggc acgaaaagta 2580 ttccgaaact ataagttgtc tgctgaaata cttggcttag ataaaaatct catgaaacgt 2640 ttctacgtaa ttctttgtgt tttatcgtgc aaacatcaaa ttaacgcaga tgaacttgaa 2700 aaatatacca aagaaacagc agagctctac gtttctctgt acccatggtt tccaatgcct 2760 caaattgtac ataaaattct aattcatagt tctcaagttg tacgagataa accagtacca 2820 gtaggtcttc tttctgagga agctcaggaa tgtcaaaata aagacgttaa aaattataga 2880 gaacacttta cgagaaaata ttcaagaaaa atcacaaatg aagatgttat gcgaagatta 2940 ttgtgcagtt cagatccaat tattagctct ctaagaagga gttcgagaat gtattaaaga 3000 acttgaactt aacaaagatt gtattaaact tttgttgtaa ctttttttta atttaaagtt 3060 ttgaatattt ttactctgat ttctgataaa atatacaaat aaataaatgt ttctttgttt 3120 ataaaatatc taataaataa atgttttaat aaatgtttct ttgtttatta aagggggtct 3180 aacacgtcac aaatatttta attaattaat gcttcgaatc tgggaaaaac tgtttctaaa 3240 aacaaccaaa agttatgtct aaaatctcta acgccctata cacccttttc tattagggtg 3300 tttagggtgc ctgcatctgc aaaaaaaaaa aatttttttt taaaaaagat cttcatgatg 3360 actagagtca ctgataaaaa tttcaatttg acaactttaa aaataaagga gttatgattt 3420 tttgtccatt ttatgtcgat cttggcccac tgtg 3454 // ID Gypsy-68_AA-I repbase; DNA; INV; 7690 BP. XX AC supercont1.280; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-68_AA_; KW Gypsy-68_AA-LTR; Gypsy-68_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-7690 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.280; Positions 1015611 1007922. XX CC Positions [4761-5240] - Integrase core CC 'ATAT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 376..2364 FT /product="Gypsy-68_AA-I_1p" FT /translation="MALYYFFASHLLNDEVDYELKLRNYGEECSKSLDSKR FT RTLRRLVNQDKQENRDYRSMYSIDQEFDLISSRVNAIACALSQNVDAKLIS FT RLGHYHSRALRSNAHSTEAKAMKDSLVKQISDLVALYRPKSPMIPSGGHEE FT SSQEEDIDHEKAQNKLTGSEGQNDGSKLGLEIRKGKEIHNESGTNVGQSNL FT ELKVQNLEAQMTEIMSMLQQVLAKQQQNGQGQKEPEATVQVNRTGAIPKTS FT ATQMSNSGLQASPLEESRNRLHLGDNFVRNLGESRQNTAGTRLGQSHNQFM FT GECYQGTNSLQHQGFPIQAPIRDGLPRAGFRTVNPPISPQYEVQGEQSMGN FT TFVGQTRYFPGYGTWNENHRNEPFGSPSGSENFQRDCRMQYDRRIEKWNIY FT FSGVSRSPTLEDFIYKVKVLASMNGIPRDILISHIHLLLRDEASNWFFTYY FT EANWNWDEFETRIRYRFGNPNQDQGNRQQIYERKQLKGETFIAFVTEIERL FT SKLLTNPLTAQRKFEIIWENMRQHYRSKLACFQINNLDQLIQMNYRIDASD FT PSLHPVGQKQGVNNIEVDSEGEPSEDEEINELSRQYQRGQGVPRHQDGTRE FT RTNEAARVPLCWNCRKNGHFWRECREAKTTFCYVCGNPGRISTTCESHPKR FT DSARGVAGSQNSGN" FT CDS 2784..5624 FT /product="Gypsy-68_AA-I_2p" FT /translation="MIQNGTNFEEVALIQPTPMDTERVESLHFFIHPIENL FT PNLEKTDPDDSLDIPGLELPEPSRATPESIETEHVLTPEERSQLTEVVREF FT PCTNENRLGRTTLIQHEIVLRDEAKPRRQPLYRCSPSIQAEMEKEIERYKK FT LDAIEECSSEWANPLVPVRKSNGKIRVCLDSRRINALTKKDSYPMRDMKGI FT FHRLENAKYFSVIDLKDAYFQIPLKEECRDLTAFRTSQGLFRFKVCPFGLT FT NAPFTMCRLMDKVIGFDLEPHVFVYLDDIVVATKTLSEHLRLLRIVANRLR FT LANLTISLDKSRFCRKRVNYLGYLLTDEGIAIDNSRIEPIMNYARPKSVKD FT IRRLLGLAGFYQRFIHGYCRIVAPISDLLRKGQKKFVWTEAAEESFQELKA FT ALVSAPILANPDFRLPFVIESDASDNAVGAALIQHIDGEQRVIAYFSKKLS FT STQKKYASVEKECLGVLLAIEHFRHFVEGSKFKVVTDARSLLWLFTIGVES FT GNSKLLRWALKIQSYDIELEYRKGKQNIIADCLSRSIEVVSLASLDQEYQD FT LIEQITKSPQNYRDFKVVDEQVYKFVKRQDSLEDTRFCWKRYPRRDDRVQI FT VREIHEKAHFGFLKTLAAVREKYFWPLMSTQIKSFCQKCVKCQASKASNIN FT TTAPLNVQRKIAEYPWQFITMDYVGPLPASGRGRNTCLLVITDVFSKFVLI FT QPFRQATAESLVPFVESMVFQLFGVPEVVLTDNGSQFVSKPFQELLATYHV FT THWKTPSYHPQINDSERVNRVLTTAIRATIKKDHKEWSNNIQIIANAIRNS FT VHEATKYTPYFVMFGRNMISDGREYRHLRDTSAENGNMGSEDREKLYSEIR FT ENIRKAFEKHSKYYNLRSNANCPRYTIGEKVLKRNTELSDKGKGYCAKLAP FT KYVPAMIKRVVGEHCYELEDEKGKRIGIFNCKFLKKFSLPTSN" XX SQ Sequence 7690 BP; 2397 A; 1438 C; 1739 G; 2116 T; 0 other; tatttggcgc ccaacgtaaa atttatcgtt atacttattc aatgtatttg taatgttgag 60 aaaatattca attctgtttt ataagtgtaa agttcatgtt aatattagtt ttctgtgctt 120 ggttatgaaa atttagtttg agttaaggtt gatttcggat atttgtgaat caaggcttct 180 tgtcgggaat ttattaagga aaattgcaaa ttacagttac ttttaaagta aaatttacgt 240 ttatatctca tatctgttgt attttttttc gtcgcttgta gtgttttgtg gggtgttcta 300 attattacat ttcttttcta gttatatttt gacatacctt gttctatctt ctctgttttt 360 tttttttttt tcaaaatggc tttatattat ttctttgcgt cgcatttatt gaatgatgag 420 gtagactacg agctcaaact tagaaattat ggggaagagt gtagcaaaag cttagattcc 480 aaacgaagaa cgttgagaag gctagtcaat caagataagc aggaaaatag agactatcgt 540 tcaatgtatt ctatcgatca ggaatttgat ctaatttctt ctagagtaaa cgcaatagct 600 tgtgctttgt cccagaatgt tgatgcgaaa ttaatctcac gcctgggaca ctatcattca 660 agagcgctgc gtagtaatgc tcactcgact gaggcaaaag cgatgaagga ttcgttggtt 720 aaacaaattt ctgatttggt ggctttatac agacccaaaa gcccaatgat accgtcaggt 780 ggacacgaag aatcctcaca ggaggaagat atcgaccatg agaaggctca gaataaatta 840 actggttcag agggccaaaa tgatgggagt aagttaggct tagaaatacg caaaggaaag 900 gaaatacata atgaaagtgg aacaaacgtg ggccaaagta acttggagtt aaaagtgcaa 960 aacttagagg cacaaatgac agaaattatg tcgatgttgc agcaggtgtt agcgaaacaa 1020 caacagaatg gtcaaggtca aaaggaacca gaagctacag tacaagtaaa tagaacaggg 1080 gccattccga aaacttctgc aactcagatg tctaattcgg gtttgcaagc atcaccattg 1140 gaagaatcta gaaatagatt gcacctcgga gacaactttg ttaggaactt gggagaaagt 1200 cgacagaaca ctgccggaac cagattagga cagagtcaca accagtttat gggagaatgt 1260 tatcaaggca caaattctct tcaacatcag gggtttccaa tacaggcacc cattcgtgac 1320 ggtttaccgc gggcaggatt tcgaacagtc aatccgccaa taagtccgca atatgaagta 1380 cagggggaac aatctatggg caatacgttt gtaggtcaaa caagatattt tccagggtat 1440 ggtacatgga atgaaaatca ccggaatgaa ccgtttggaa gcccaagtgg cagtgaaaat 1500 tttcagaggg actgtcgtat gcagtacgat agaaggatcg aaaaatggaa catttatttc 1560 tcaggagttt cacgatcgcc cactttggaa gacttcattt ataaggtcaa ggtattagca 1620 agcatgaacg gcattcctag ggatatcctg ataagccaca ttcatttgtt actacgagac 1680 gaggcttcca actggttttt tacctactat gaagcaaact ggaattggga cgagttcgaa 1740 accagaataa ggtataggtt tgggaatcct aaccaagatc agggaaatcg acaacagatc 1800 tacgagagaa agcaactgaa aggagaaacc ttcattgctt ttgtaacgga gattgaaaga 1860 ctgagtaaat tacttacgaa cccactaaca gctcagcgaa agttcgagat tatctgggag 1920 aatatgcgcc agcattatcg gtcgaagtta gcatgttttc agataaacaa tctagaccaa 1980 ttaatacaaa tgaactaccg aattgatgcc agcgacccga gtttacatcc ggtcggtcag 2040 aagcagggag taaacaacat tgaggtagac tccgagggag aaccatcaga ggatgaagag 2100 atcaacgagc tgagcagaca gtatcaacga ggccaaggtg tcccaagaca tcaagatgga 2160 acaagagagc gcacaaacga ggcagcaaga gtaccgctct gctggaactg ccgcaagaac 2220 ggacacttct ggagagagtg cagggaggct aagacaactt tctgctacgt ttgtgggaat 2280 ccaggcagaa tatcaaccac atgtgaaagt catccgaaac gtgattcagc gcgtggggtg 2340 gcgggttcac aaaattcggg aaactgaatg tggagtgcgt agttggggac cttagcattc 2400 ctggggagtc ctctgttccc atccaatcga aactgtatat tgatccgttc aatagccttc 2460 ttgaagtgaa aatccaaact aatcaatgcc ctcaggtaaa agtggccata tttggagtag 2520 aaattgatgc ccttctcgat tctggggccg gtatcagtgt tgccaattcc agtgatctga 2580 ttgatcgtca tggactaaaa ctgttaccgt cgccaatcaa aatttgcacg gctgataaga 2640 cacaatattc ttgtactggt tatgtgaacg ttcccatcca gttcaaaggt gtgaccagag 2700 tggttgctct tgtaatcgtg cctgagatct ccaggcagct tatactagga atcaattttt 2760 ggagggcgtt caacattaaa ccaatgatac agaatggtac gaatttcgaa gaagtagctc 2820 ttattcaacc aacgccgatg gataccgagc gagtggaatc tctacatttc tttattcacc 2880 caattgaaaa tttaccaaat ctcgagaaaa cggaccccga tgactctttg gacattccag 2940 gtttggaatt accggaacca tctcgagcta caccagaatc gattgagact gaacacgtgt 3000 taacgccgga ggaaaggtca cagttaacgg aagtagtgag ggagtttcct tgcactaatg 3060 aaaacaggtt aggacgaact acattgatac agcatgagat cgtactacgc gatgaggcaa 3120 agcctagaag acaacctttg tatcgatgct ctccctccat tcaagcggaa atggaaaaag 3180 agatcgagcg atacaagaag ctagatgcta ttgaggagtg ttcgagtgaa tgggcgaatc 3240 ctttggttcc cgttcggaaa tcaaacggga agatcagagt atgcctggac tcaagacgca 3300 tcaacgcact aactaagaag gattcgtatc caatgaggga tatgaaagga atattccatc 3360 gcctcgaaaa cgccaagtac ttttcagtga tcgatttaaa agatgcttat tttcagatcc 3420 cgctcaagga agaatgtcga gatctgactg catttagaac atcgcaggga ttatttagat 3480 ttaaggtgtg ccctttcggt ctcacgaatg caccgtttac aatgtgccgc cttatggaca 3540 aggttattgg cttcgatttg gagcctcacg tattcgtata tttagacgac atcgtggtgg 3600 ccaccaaaac tttgtcggag cacttgcgtc tactacgaat cgtggcaaac cgtttgaggc 3660 tggcaaacct tacaatctcg ttggacaaat cgcggttttg cagaaaacga gtgaactact 3720 tgggatattt actgaccgat gaaggtattg caatcgacaa ttctcgaatc gaaccgataa 3780 tgaactacgc gagacctaag agcgtcaagg atattaggcg cttgttagga ttggctggat 3840 tctatcagcg atttattcat ggttactgta ggattgtcgc gcctatctcc gatctgttga 3900 ggaaagggca gaagaaattt gtatggacgg aagcagcaga ggagtccttc caggagctca 3960 aagcggcatt ggtgtcggca cccatattgg ccaacccaga ttttcgttta cctttcgtta 4020 tagaatcaga tgcgtctgat aacgcggttg gagcagcatt aatacagcac atcgatgggg 4080 aacagcgagt cattgcttat tttagcaaga aattaagcag cactcaaaag aaatatgcga 4140 gtgttgaaaa agagtgcttg ggagttctct tagcgatcga gcatttccga cactttgtgg 4200 agggcagcaa gttcaaagtt gtaaccgacg caagaagctt gctgtggttg ttcacgatcg 4260 gagtcgagtc aggaaattcc aaactccttc ggtgggcact caaaatccaa tcatacgaca 4320 ttgaattgga gtacaggaaa ggtaaacaga atataatagc ggactgtctt tcacgctcaa 4380 tagaggtcgt ttctttggca tcacttgacc aagaatatca agatttgata gaacagatta 4440 ccaaaagccc acaaaactac agggacttca aggttgtcga cgaacaagtg tacaaatttg 4500 ttaaaagaca ggattcgttg gaagatacgc gcttctgttg gaagcgatat cctcgaagag 4560 acgatcgagt acaaatcgtt cgcgaaatcc atgaaaaagc ccactttggg tttttgaaga 4620 ctctagcagc tgtgcgggag aaatattttt ggccgttgat gagcacacaa atcaagagtt 4680 tctgccagaa atgtgtaaaa tgtcaagcca gcaaagcgtc gaatataaat acgactgcgc 4740 cattaaatgt tcaacggaaa attgcggaat acccctggca attcataacg atggattatg 4800 taggaccttt gccagcttct gggagaggaa gaaacacttg tctgctagtg atcactgatg 4860 ttttcagtaa atttgtgctg atacagccct tcaggcaagc gaccgctgaa tcactagtac 4920 cgtttgtgga aagtatggtt ttccaacttt tcggggtccc ggaggtggtc ttgacggata 4980 acggatctca attcgtatct aaaccgtttc aggaattgct agccacgtac catgtaaccc 5040 actggaaaac tcctagttac catccacaga ttaacgattc cgaacgagta aatcgtgtat 5100 taacaaccgc cataagagca acgatcaaga aagatcataa agagtggtcc aacaatattc 5160 agattatagc caacgcaatc cgaaattcgg tccacgaagc cacgaagtac acgccatact 5220 ttgtgatgtt tggccgtaac atgatatcag acgggaggga gtatagacac ctgagggata 5280 cctcagctga gaatggaaat atggggagtg aggatcgaga aaaactgtac tctgaaatcc 5340 gtgaaaacat aagaaaggcg tttgaaaagc actcaaaata ttacaattta aggtcaaatg 5400 caaactgccc taggtatacg attggagaaa aggtcttgaa gcgaaatacg gaactgtcgg 5460 acaagggaaa agggtactgt gccaagctgg ccccaaaata cgtccctgct atgattaaac 5520 gagtcgtggg agagcattgc tatgagttag aggatgaaaa agggaaaagg attggcattt 5580 tcaactgcaa attcctcaaa aagttctctt tgccaacctc taattaggta aactgctatt 5640 tttcaagcta tgaaacctat ttattaggtt acaaaacgtc taagggcgtg ttaaaataca 5700 taagtggcaa aaactttaac ttgctcgatg ataatctata ataatgcatt tattcatcag 5760 ctatgtattc atttaatgga aactacgcac tctatgagtt gctcaaaaat caacaaagtt 5820 gcattttgca ttggtctcct cattatcgag tcgagctttc attgatcaag ctatgtatat 5880 ttaaaaagtg gacaaagcaa ctttcaaaca agttgcacaa aattgtagcc tgccattcat 5940 aaggtgctcc tcggatttcc gagttgagca gtcccacaac aacatgattc attactggag 6000 acaagcctta tgaatgagcg cggatcacta ttctcacttc aatgacgatt ccattcactt 6060 tcataatccc tacacgacag ctatgaatct caacctgagt aacaatgtac gaaaaacact 6120 ggcgaggaaa cgaagcaata ggtgaagagt cgtgtctacc ttccctgagc tacctgtact 6180 tataattagt tatcctaagt taccttaaac tagattaacg tttctcttaa tatcactata 6240 gttacgtttc gtcgatgatc tgtcgtattc agtcgtctgc gtcaaagtgt acatattagt 6300 tttcgtccat caagtagatg tagtttccaa taagatatcc gtgatcttca aaaactttgg 6360 tcgtggccgt cctgtgattg tatgccattg atacatggat tcattcatgt tgtagataat 6420 cgttgaatac tcattgatgt ttcctcataa gttgccagtg tacatatagg ttaagatacc 6480 atagatcagt ccgagagttc cgcagtgctt cggtagtaag ccaccaataa gaaataattt 6540 ccaataagaa ttcatcaatt gccgcagcat actgcattgc catgtgtaac aaaatcccac 6600 gtcgcggcac ctgcagagag aaaccacaaa gcaataagtt ctttgcagaa gagcaagtaa 6660 ttaatactcc ccccgtaaaa caactccagt tgaagccgat gccaatagtt tttgccattt 6720 tgtaatgaaa tagcctgcaa cctgcctgga actggaattc cagcgcgcac gattaaagct 6780 acggactctt tcagctcaca gtgttcctat ttttcgttga cagttgatga gaatgggttt 6840 ccggtgtttg acagtttgat ggctgcgtac gtgttaggtg gattcagtac ttgggtgtga 6900 agattattgt gaagggcagt agaacagttg agggatttta atttcggtgc ttttttgagt 6960 ttcgggtgac cagaacaaga ttctggtgtg gaaactcaac taagtcttag gtttaggtta 7020 cgttttgtga gatgaatgga catattgcct aaggttaatc tttggggtga gccagatctc 7080 cacaaaatag attattgcgg taaattcggt aaagacttct atactcagcg taatgattga 7140 ggttgttgtc gaatctgaat gagcctgtat tcaggatatt tgatgatttt atgagtggta 7200 tttttccgga ttcatctgaa tctgccaaaa gtacagttga attaagttgt gagagatttt 7260 ttttaagttt gtaaggattg agtacaccta acacgttgca ttatcaagct agttgaacat 7320 tttttatgca tttttttttt cgttgataag tagccttgta aatatgaatt cacaatcaaa 7380 atttaaagat aaattataga taaaaataat tagttgacag atttactaga gaagctaaga 7440 attcgctgga gaccggagtc attgacgctg atggtcagta gcaggtacga tctgtgatgg 7500 atctgtattt tggaaatagg gaaggagtca acttttagtt gatgatgaaa tcatatgacc 7560 ccgaattcat cagcctctaa gtaaataatg tttattaatt ccattttgta aatagtagca 7620 gtaagaaaac ccttacgaaa atttagttga aggaattcaa ctaaattttc gtaaccttag 7680 catggagtga 7690 // ID CR1-108_AAe repbase; DNA; INV; 4939 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-108_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4939 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1196-1196 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 11 sequences with >95% CC identity. XX FH Key Location/Qualifiers FT CDS 1746..4862 FT /product="CR1-108_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MEAPNPLISVEPLPPATSSHPGPEFECVEGVFQPITA FT GKYVHVSNNSRSVXSIDSSRQAPMEARLDDGAIEIIESGDVPAAEDSVKER FT MFSMYYQNVRGLRTKIARLRLMMSSCDYDVIVFTETWLREDIDSSEISSDY FT AIFRCDRSSRTSQHLRGGGVLIAVKNVLNCESVHLNDCDELEQVAVCVKLR FT QRSLCIIAVYFPPNSNVELYTAHAKAVQEMTDRLSADDIILSIGDFNLPSL FT RWCLDEDINGYIPMNALTETERCLTEGLFACSLRQVNSFVNVNDRLLDLVF FT VNLPEYFDLVAPPSPLLTIDNHHTPFVLLLDNSEIPTLLDDQLNCFNFDYA FT TCDFDLLNSNFGRIDWDLQLFSDTVDQMLAAFYVLLNDVVSNHVPRKRKAS FT VSVFSKPWWTPELRNLRNGLRKMRKRYFLSKSDIDRVQLRQREAEYEQALS FT STYENYLLRIQSSVKQNPSYFWDFVKQRKSNNRIPFSVSFNGVTSRSNVEA FT ANCFASFFEGVFSRTSPVQRNNHFAQITSHDIHLPRFEFSQDAVRKVLEGL FT DTSKGPGTDDIPPVLLKNCANTLALPIAAVFNRSLRDGIFPKAWKLAAIVP FT IHKSGNLNCVSNYRGVSILCSLSKVFEKMIHEVLYSAAAPYISISQHGFMK FT NRSTTTNLMCYVSEISRGMETKNQVDAVYVDFAKAFDTVPHNLIIDKMKHL FT GFPDWFTKWLYSYLSGRSAFVKVNSIRSRYFSIPSGVPQGSVLGPLIFIIY FT INDLSELISCSKLSFADDLKFFRTIASPADCFDIQADIDLLLVWCGDNGMR FT VNEKKCKVISYTRRSNVIHHEYCMGLNPLERVNSICDLGVTIDSKLRFNEH FT INIIVAKAFTVLGFIRRHASGFNDVYCLKTLFCSLARSILEYATPVWSPYY FT ATHTLIIERVQKHFLRFALRQLPWNDPQNLPCYPDRCKLINLETLSARRTK FT LQRLFIFDLITGNVDSPELIQLIPWNTPPRRFRNTLMFAVPFHRTNYGSNN FT SLNSCLRSFNDVSNAFDFNLSKDMFKLRIRSIE" FT CDS join(147..920,802..1827) FT /product="CR1-108_AAe_1p" FT /translation="MAKQCAKCCEAINGIXYVVCRGYCGAFFHMNACSNVT FT RALQSXFTTNRXNLLWMCDKCADLFENSHFRAISTSADQKSPLNTLTTAIT FT ELHTEIKKINSKPNVQFSPAAXWPIITQRRATKRPLETIVPARASENCCVG FT SKQPLGNVVSVPVCKNEDPKFWLYLSRIRPDVSXEAVSAMVKANXDIDDDL FT VVVKLIPKEKDITTLTFVSFKVGLDPSMKSKALDPETWPQGVLFREFEDYG FT SQKFRFXRTRNRQHLQRSQKRWIXKHGLRACYSXSLRITDLKSFVXLVQET FT GNTFSDSANIVSCYSGNGFELNGLHPGRTILSNPEALDPPXTVVPFQPAFN FT SRPGPVFGFGDGVPVXSGKYTXIKENYVPDSFIVSSEPSQCHQLPSALIVP FT APGCKLRSPMEAPKPLISVEPFLPATSSHPGPEFECVEXVFQTVXXGEYVH FT VSNNSRPVMSIACRHHSXPVXFDVSNHSSHHQLXSSAXIISGCTPASHMEA FT PNPLISVEPPGPEFECVRGSSKPSRRVVQCEQFSSSNVHRFSTICACTSTF FT PIIRHIINFHRPGKSFRPRDARLQASWKPPILSSQSSHSRQRPAVIPVLSL FT SV" XX SQ Sequence 4939 BP; 1348 A; 1167 C; 1038 G; 1357 T; 29 other; ggcaawtttt ttctgatcac cgcgacatct gcagaataat agttgaagca caaatcgcat 60 acaagttcca atacaagcca cgtttgcaac tacaacatac aakcgggcat aagtaatacg 120 cgctgcataa ccacaggcga ttcacsatgg caaagcaatg cgcaaagtgc tgtgaagcaa 180 tcaatggwat ckattatgtt gtctgtcgcg gttattgtgg cgctttcttc cacatgaacg 240 cgtgttcgaa tgtgacacgt gcgctwcaat ccwacttcac aacgaataga aakaatctcc 300 tctggatgtg cgacaagtgt gcggatctat tcgaaaactc acactttcgc gctatwtcga 360 cttctgcsga ccaaaaatca ccattgaaca cgcttaccac tgcgatcaca gagctacata 420 ctgagataaa aaagataaat tcaaagccta acgttcaatt ttctcctgct gctawctggc 480 caattataac tcaacgtaga gcsacaaaga gaccgcttga gacgatcgta ccggcccgag 540 cctcggaaaa ctgttgtgtt ggaagtaaac agcctctggg caatgtcgta tctgtaccag 600 tttgcaaaaa cgaagaccca aaattttggt tgtacctgtc cagaattcga ccagatgttt 660 cawtcgaagc tgtttcagct atggtsaaag ctaacwtgga tatcgatgat gatctcgtag 720 tggtgaaact cattccaaaa gagaaggata taacaacact cacattcgtg tccttcaaag 780 ttggtcttga tccgtcaatg aagtcaaaag cgttggatcc mgaaacatgg cctcagggcg 840 tgttattccg kgagtttgag gattacggat ctcaaaagtt tcgttttsct cgtacaagaa 900 accggcaaca ccttcagcga tagcgcaaac atcgtctcct gctactccgg taatggattt 960 gaattgaatg gactccatcc gggacgcacc attttaagca atccggaagc cctcgatcca 1020 cccwtcacag tcgtgccatt ccagccagcg ttcaacagtc gtcccggccc tgtgtttggt 1080 tttggtgacg gggtgcccgt tgmttcaggc aagtacacac kcattaagga aaattatgta 1140 cctgattcgt tcatcgtttc cagtgaaccg tcacaatgtc accaacttcc atctgcattg 1200 attgttcctg ccccgggatg caaacttcgc agccctatgg aagctcctaa acctctcatc 1260 tcagtcgagc cattcctgcc agcgaccagc agtcatcccg gtcctgagtt tgagtgtgta 1320 gagsgagtct tccaaaccgt aawtscaggc gagtatgtgc acgtgtcgaa taattctcgt 1380 ccagtaatgt ccatcgcttg tagacaccat tctkcgcctg tamacttcga cgtttccaat 1440 cattcgtcac atcatcaact twcatcgtcc gcaasaatca tttcgggatg cacgcctgca 1500 agccacatgg aagcccctaa tcctctcatc tcagtcgagc cacccggtcc tgagtttgag 1560 tgtgtgaggg ggtcttccaa accgtccagg cgagttgtgc agtgcgaaca attctcgtcc 1620 agtaatgtcc atcgctttag caccatctgc gcctgcactt caacgttccc aatcatccgt 1680 cacatcatca actttcatcg tccaggaaaa tcgttccggc cccgggatgc acgcctgcaa 1740 gcctcatgga agcccccaat cctctcatct cagtcgagcc actcccgcca gcgaccagca 1800 gtcatcccgg tcctgagttt gagtgtgtag agggggtctt ccaacccatt accgcaggca 1860 agtatgtgca cgtgtcgaac aattctcgtt cagtamaatc catcgattct agccgccaag 1920 ctccgatgga agctcgattg gatgacggag ctattgaaat cattgaatca ggagatgtac 1980 ctgccgcaga ggatagtgtc aaagagcgca tgttttcgat gtattatcaa aatgttaggg 2040 gattgcgcac caaaatagct cgtttgcgtt tgatgatgtc cagttgtgac tacgacgtga 2100 ttgtcttcac tgagacatgg ctacgcgagg atattgacag cagtgaaatc tcttccgact 2160 acgcaatttt taggtgcgat cgaagttcta ggacaagcca acatttgcgt ggaggaggag 2220 ttctgatagc tgtcaaaaat gtcttgaact gtgaatcagt tcatttgaat gattgtgacg 2280 aactcgagca agtggcagtt tgtgtgaaat tgaggcaacg gtcgctgtgc ataattgctg 2340 tatactttcc accgaattca aacgttgagc tgtacaccgc tcatgcgaag gcagtgcaag 2400 aaatgacaga tcgattgtca gccgatgaca tcatcctgtc gattggtgac ttcaatcttc 2460 ccagcttgcg atggtgtttg gacgaggata ttaatggata catccccatg aatgcactca 2520 cagaaacaga gagatgttta actgaagggt tgtttgcctg cagtctccga caggtcaaca 2580 gttttgttaa cgtaaatgat agacttctgg atctcgtttt cgtcaacctt ccagagtatt 2640 ttgacttggt agcgccgcct tctccacttt taactattga caatcaccat actccattcg 2700 ttttactact tgacaacagc gaaattccga ccttacttga cgaccagcta aactgcttca 2760 attttgacta tgcaacctgc gactttgatt tgcttaattc taatttcgga cgaattgact 2820 gggatcttca attgttcagt gacacagtag atcagatgtt agccgctttc tatgttttac 2880 tcaatgatgt agtcagcaat catgtaccta ggaaaaggaa agcgtctgtc tccgttttta 2940 gcaaaccgtg gtggactcct gagctaagaa atctccgtaa tggcctcaga aaaatgcgta 3000 agcgttattt tctctcaaag agtgacatcg acagagttca gctgcggcag cgtgaagctg 3060 agtacgaaca ggctctttcg tcaacgtatg agaattattt gctaaggatt cagtcatctg 3120 tgaagcaaaa cccttcgtac ttctgggact ttgtcaagca acggaaatcc aacaaccgca 3180 ttcctttctc ggtcagtttc aacggagtga catctcgttc gaatgtggaa gcagccaatt 3240 gttttgcctc attttttgaa ggcgttttta gcagaacctc acccgtccaa cgtaataacc 3300 actttgccca gataacgtct cacgacatac atcttccacg cttcgagttt tctcaagatg 3360 cggtacgaaa ggttctagaa ggactggaca cctcgaaagg acctggcacg gatgacatac 3420 ctccagtgtt actaaagaat tgcgccaata cacttgcatt gcccattgct gctgtgttca 3480 atcgttcgct ccgagatgga atttttccaa aggcatggaa actggcggca attgtaccga 3540 tccacaaatc cgggaatctc aactgtgtct ccaactatcg tggtgtatct atcctgtgca 3600 gtcttagcaa ggtgttcgaa aaaatgattc acgaagttct gtatagtgct gctgctccgt 3660 atatctcaat tagtcagcat ggctttatga agaacagatc aacgacaaca aacctgatgt 3720 gctatgtgtc agaaatatcc cgaggcatgg aaaccaaaaa ccaagtcgac gcagtttatg 3780 tggattttgc gaaagctttc gacacagtac cgcacaatct gatcatcgat aaaatgaagc 3840 atctaggttt ccccgactgg tttacaaagt ggctgtactc atacctctct ggacgtagcg 3900 cttttgtaaa agtgaactcg attagatcaa gatactttag cattccatcc ggagtaccgc 3960 aaggtagcgt acttggacca cttattttca ttatttacat taatgacctg agcgaactga 4020 tctcttgttc caaactatcc tttgccgacg acctaaaatt tttccgaacc atagcttctc 4080 cagctgactg tttcgatatt caagctgata ttgatctatt gttagtttgg tgtggtgaca 4140 atggcatgcg tgtcaatgag aagaaatgta aggttatatc ttacactcgc cgcagtaacg 4200 tgattcacca tgaatattgt atgggtttaa atccacttga acgagtaaac tcaatatgtg 4260 atttgggggt cactattgac tcgaagctcc ggttcaatga acacatcaac atcatagttg 4320 ccaaggcatt cacggtgctg ggctttatac gtcggcatgc atctggattt aacgatgtgt 4380 actgcctgaa gacgctgttc tgttcactag ctcgcagcat tttagagtat gccacaccag 4440 tttggtcgcc ttattatgca acgcatacat tgatcatcga gcgtgtccag aaacattttt 4500 tgagatttgc tctgcgacaa ttaccatgga atgatcctca aaatcttccc tgctaccctg 4560 atcgttgcaa actgatcaac ctagaaactc tctcagcccg tcgtaccaaa ttgcaaagac 4620 tatttatctt tgaccttata acaggtaacg tagatagtcc ggaacttatc cagctgattc 4680 cgtggaacac accacctcga cgcttccgaa atacattaat gtttgctgtc cctttccaca 4740 gaacgaacta tggttcaaat aattctttaa actcgtgttt aagatcattt aacgatgtaa 4800 gcaacgcatt tgattttaac ctctcgaaag atatgttcaa acttagaata agaagtatag 4860 aataatatag tttttaagaa atcagtctgt acgacaaagt cgaagatggt gaataaataa 4920 ataaataaat aaataaata 4939 // ID Gypsy1-I_Dmoj repbase; DNA; INV; 8797 BP. XX AC scaffold_6123; XX DT 13-MAY-2009 (Rel. 14.05, Created) DT 13-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy1_Dmoj; KW Gypsy1-LTR_Dmoj; Gypsy1-I_Dmoj. XX OS Drosophila mojavensis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; repleta group; OC mulleri subgroup. XX RN [1] RP 1-8797 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1026-1026 (2009). XX DR Genome; scaffold_6123; Positions 9279 483. XX CC Positions [4870-5301] - Reverse transcriptase CC Positions [6434-6910] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 3997..6102 FT /product="Gypsy1-I_Dmoj_1p" FT /translation="MVRTDVSLAGKTAEVNLLIMPTMLDHVILGMDFLCAI FT GTTVRCGNAELEMRMVDDVGEGASPSSEQRVEKGNSSLGKQRQPVEGVKEL FT DSEASHEAATPKEGPKSVIAVSTIGVQQASSQGTPPDENRDKKKRMEKGTP FT EERLRGSLPATRRDKKERSEIMEDKRGAGPPARAVKQSEDLRSGPEERSPE FT FKSVELEREGEPEGGPLTREAAESEYGGHERSAPNSKEWPDDLEKELQEFL FT EAELALFEDLRGVTHIAEHSIRMKDDKPLKQGYYPKNPAMQKVIDEQVDEL FT LQAGAIEPSKSPHSAPIVLVKKKIGDWRMCVDYRQLNAHSIPDAYPVPRIL FT HILKRLRHARFISTLDLKSGYWQIPMAADSREYTAFTVPGRGLFQWRVMPF FT GLHSAGATFQRALDSVIGPDMEPHAFAYLDDIVVIGATKEQHVANLKEVFR FT RLRKANLRLNRKKCSFFREKLAYLGHVISGEGICTDPAKVEAIRSLPSPSC FT LKELRQCLGMASWYRRFVPDFASLVQPMTKLLKKGQKWAWSEEQEEALQKL FT KESLTTARILACPDFSAKFVLQTDASDYGLGAVLTQEVEGQERVIAYASRK FT LLKAELNYSATEKECLAIVWAIRKMRCYLEGYRFDVVTDHLALKWLNSIES FT LTGRIARWALELQQFQFDVRYRRGGRRTVPTAHRQLQAGRGRKSPLRVDKE FT DA" FT CDS 6098..7162 FT /product="Gypsy1-I_Dmoj_2p" FT /translation="MRERIVKEPEKFRDYVEENGQLYRNLGHRIDEEDFIP FT WKLCVPSSLRCRVMRECHDAPTAGHQGVRKTAARLAQRYYWPGMFRDAAKY FT VRCCETCQRFKCVQQKPAGHMLTRQVAEPMAVFCADFVGPLPRSKRENTML FT LVFHDAFAKWVELVPLRKATTALLQLAFRERILGRFGVPRTFVCDNGVQFA FT SRSFKAFMESLGVTLQYTAPYSPQKNPTERTNRTVKTMIAQYIEGHQSSWD FT ELLPEITLAVNSSVADSTGFTPAFLMFGREPRLPAALYDEVTPGSATRETQ FT PEAKEVKMREVFNIVRSNLQRASKDQGRHYNLRRRDWRRKDSRRNWRPNST FT ALTRSLSSYPPT" XX SQ Sequence 8797 BP; 2882 A; 1719 C; 2403 G; 1793 T; 0 other; tgtttgtagg taataagtaa agatatcgaa tgaaatagag ttttgttatg gtgaatggga 60 tttgaacaag gataatttga ttttgtagtg ataaagggaa atgaaatgga ttttgttgca 120 gattttgatt tggttaggct atattttgaa gtgtaagagt tagtttgtaa gagtggattt 180 cgatatagat tttggtttac attttgaagt catttataag ttgataggag ttagttgtta 240 ttagaataag cgttagttcg taaggtagca ctaagtagtt gataattaga tgtaagtgcc 300 gtagcgatta gccttagaaa gtaagcgaaa ttcaaatgag tgggcgggca cgcgatcagc 360 aatgcagctc gaaagctcgc ggtggtgaat tgcattgctg gtcgcgtggt aaaatctgca 420 cagcggcaga agaggattga aagctgcgtg cagtttttgg tcagtctgga gctctcgatg 480 tgagcagacg cgctcaaaag ggaaaaataa aactaactag cagatgtaaa ggttataagg 540 tgaaaagtaa ggaaatgcac gtagagatgg ttgcgtattt attcgtagct agtttaataa 600 tcaatttgtt agtttactaa tcataactaa cactaactgg tagtttgaac gagggcgaat 660 cccgttcgcc agaaggttaa aactctggca tgcaaaaatc gcgtggtctg gagtgaacgc 720 cggtggcaac gctgggcgtt cctcgatccg cggtctaccg atcgtagtaa ccccattaac 780 attccatgac acgctttcgc agccccgacc cccgcaatct gaagtgcgct tgatcgcgcc 840 actcgattgt ggccgccctc ccgcgtgcaa acgcgtgggc gtgggcgtgg tcggcgagct 900 cgggccacgt ggtcgcctaa cctatacaac tttccccccc tctcccattt ctctaactca 960 atacgtcccg tatctagtta tggccctggg cagactgaga ctctgacccc ggctggaaaa 1020 taatgtaacc ttacaagtgg cgcccgagca gggacgcggg gctagagaga gagtggtata 1080 cgggcggcgt aaaaacccgc cggcataccc aaatagaggt gacgggcaat agaggcgatt 1140 gtgtagttgt ttggtggtta atttgaatat aggcagtgtg cgtataaaag cagcaaaagg 1200 caagaacatt catgataaag tgcgaaagtt aaaagtgaag acaccctacg acggccgcgg 1260 ggtcgggggg tcagaaattt cgggaagcgt cgagcgaaaa acgggactag aaaaccgttg 1320 cggcgccgaa aagtacagtg cgtgcgaaaa tattacgcag aagcgtaaaa ccgaaaacgc 1380 atatgcataa gcatgtgcac gcgcgggggc agcgtagcga acgtcgccga aatgcagcaa 1440 cgagaagtgg taagagcaga gtgtgtatat atgtatgcac acatcaaaag taagtaacag 1500 aggacaaaca aaaaaaaaaa aggagcggac agagtaaacg aaattaaaga tcagcaagat 1560 tattgatgcc gctgcaattg caccaaggag ttaatagttg agtataaaag tggttacaga 1620 tgaagtaaga gaaaagtgaa agcatttagt gccgacggca agcgaatata cccgccataa 1680 gaaagtgtag ttgtgaaaaa gaaatagtca agaaaacgcg catagtcaag ggtgcaagtg 1740 tgcatatatg tatatacatg agtgaatgta aggattagcg gggagagaac aaaaagtgaa 1800 atagtgaagt gtatatatgt atgtgcgcat agacgagtgc gcatatgtat atatatacat 1860 attaatgtgc atgtgagaca attagttaag agggggtgag cggcggagtg tatatatgta 1920 tgtgcgtggg tgcatgcgtg cgtatatatg tgtatacatg cgaagcgctc agttaagaga 1980 gagtgagcag tgaagggtac tatgcgcgca taggcaaggg tacatatgta tgtgtatatg 2040 tatgtgagaa aacaaattga gagagagagc gatcggtcaa gcgtatgtgt gtgattatgc 2100 gcatatgaca ctgggcgatt ttttttgtat gcttgcgcat gcgggtgcgt gtatgtgtgc 2160 aattatgcgc atgtgacact aggcgatttt tctgatgcgc atatgacacg aggcgatttt 2220 gatcgtggga gtggacagta ggccgagacc agataaaata aaataaaata aaataaaata 2280 aaataaaaaa aaataaaata aaataaaata aaataaaata aaataaaata aaataaaata 2340 aaataaaata aaaacgcact ggaacggagc ggagcggaac atagcaaggt taagtaggat 2400 aaagcgaagc gaagtaaaag caaagaagat aaagcgaagt caagtcaagt gaatatgaag 2460 tgaggcgata tgagataaag cgaagagaaa atgcagcgag gagaactgaa gcggggtgaa 2520 gcagaataga agcggagtaa taaagtaaag ggagataaaa taacacggaa taagtggaga 2580 cgggaaagtc aaccagagta aagagaagag gagtaagtca agtaaaacgg tagtgaaagc 2640 aaagcaagtt gaagcgagca aactgggata aagggataaa gttgagtcgg atgagcgaac 2700 ggaaagtgaa aataaaaaaa aaggaggggg aataagaaca attaagaagt aaaattgaaa 2760 gagaaaaaaa aaagaaatga aacaatgtaa aataatctaa actaggttaa gaaagaacaa 2820 agaaaagaaa taaaacaaaa tagattaaga aagaacgaag gaaagaaata gaacaaaata 2880 gattaaggaa aataaaggaa acgaaataaa acaaaataga ttaagaaaaa taaagaaaat 2940 gaaataaaac aaaatagatt aagaagaata aagaaagcga aatcaaaata aagtaaacca 3000 agaagggcaa agaaagagaa gtaaagtaaa ttgaggaaat taaaggaaaa agaaataaaa 3060 gaaagtacat taagaagaat aagataaaag agaataaaat agaataagac gaatgaagaa 3120 aagtaaacca aaactagatt aggtgtaata taagatagat taagacagta aaagcaaaca 3180 gaattgtata agataagaaa atttgaaaat ttgtttggtg gaccagatcc gggcgcgccg 3240 gcagcgggaa ggcgaaccct tcaaggacta cttgattgag ctgcgtctat tgatgcgtca 3300 ggctagatac agcccggccc aagaattgaa tagggcctac gagaacgcat ccccggaata 3360 cagattgtac gtacacgaca cgatttctca tctctaacac agctgaccca gatggccgcg 3420 gagttcgaga acgtgaggag tcaacaaaga gaacaaggaa ggaaggctac ggtggagacc 3480 gtctggccca agccagtgga ggtgcggcag agtaatccct tccgcgcaaa gacatctgat 3540 ccccgatctg gaggacaagc cgaggggcgg acagagaagc cggcggcgac acagtcggtg 3600 gaggcaacta gccggatgat ggcccaggac gctggaccta ggccaagccg agtttgttat 3660 cgatgtgcac aacccggaca tttcgcgcga gagtgtcata acgcacaggt gttgttttgc 3720 cgacactgtg gtcagcgagg ccggactgcg agggagtgct gcgaacgaga cagctcggga 3780 aacggccagg ggcgccctcc aagagagcgg agcggcgcca attaaacacg ggagatgagg 3840 agaggaaccc gctccgagtg aaggagtcaa aggacgaccg ccacgcgcag ttttgtcagt 3900 gaaggctgcg cgcgtcgacg gacgataaga ggcgagcgtt gccaggttga gtcgcggatc 3960 aggttggccg atggatcagc actggacgtg acggagatgg tgaggacaga tgtgagcctg 4020 gcaggaaaaa cggcggaagt aaatttgctg ataatgccga ccatgctgga ccatgtcatc 4080 ctcggcatgg acttcctatg cgccatcggc actacggtgc gttgcggcaa cgcggagctc 4140 gagatgagga tggtggacga cgtgggagaa ggagcatcgc cgtcaagtga acaacgagtg 4200 gagaaaggca acagcagtct tggaaagcaa cgccagcccg ttgagggagt gaaggagttg 4260 gactcggagg cgtcgcacga ggccgcaacg ccaaaagagg gcccaaagtc ggtaatcgca 4320 gtgtcgacga tcggcgtcca gcaagcgtcg agccaaggca caccgcccga cgagaatagg 4380 gacaagaaga aaaggatgga aaagggaacg ccggaggaga gacttagagg gagtctccca 4440 gcgacaagga gagataaaaa ggaaagaagc gagataatgg aggacaaaag gggagctggg 4500 ccacccgcga gagctgtgaa gcagagcgag gacttgagga gcggacccga ggagcggtcg 4560 ccggagttta aatcagtcga gttggagagg gaaggtgagc ccgagggcgg cccactgaca 4620 agagaagcag cagagagcga atacggagga catgaacgat ccgcgcccaa tagcaaggag 4680 tggccggacg acttagagaa agaattgcag gagtttctag aagcagagct agcattgttt 4740 gaggacctca gaggcgtgac gcacatcgcc gagcatagta tccggatgaa ggacgacaag 4800 cctttgaagc aagggtacta ccctaaaaat ccggctatgc aaaaggtgat cgacgagcag 4860 gtggacgagc tactccaagc aggggcgatt gagccgtcga agagcccaca tagcgcaccc 4920 atagtcttag tcaagaaaaa gattggggat tggcgcatgt gcgtagatta tcgtcaactc 4980 aatgcccact cgatcccgga tgcgtacccc gttccgcgga tcctgcacat tttaaagagg 5040 ttgcggcacg cacgattcat atccacgctg gatctaaaga gcggttactg gcagatccca 5100 atggcggcgg atagcaggga gtacacggcg ttcaccgtgc caggcagagg attgttccaa 5160 tggcgagtga tgccattcgg tttgcattcg gcgggggcga cattccagag ggcattggat 5220 tcggtcatcg ggcccgacat ggagcctcac gcgtttgcgt atctagacga catagtggtc 5280 atcggagcga cgaaggagca acacgtggcc aacctgaagg aagttttccg tcggttgcga 5340 aaggccaatc taagactcaa cagaaagaaa tgtagcttct ttcgggagaa gttggcgtac 5400 cttggccacg tgataagcgg agaggggatt tgcacggatc ccgcgaaggt tgaggcgatt 5460 cggagcctcc catcgccttc gtgtctaaag gagttgcggc aatgcctagg aatggcatcg 5520 tggtatagac ggttcgtgcc ggacttcgca tcgctagtac aaccaatgac gaagctcctg 5580 aagaagggtc agaagtgggc ctggagcgag gaacaggagg aggcgttgca gaaactcaag 5640 gagagtttga cgaccgcgcg tattctggcc tgtcccgact tctcggccaa attcgtgctg 5700 cagaccgacg ccagcgacta cggcctcggt gcagtgctga cgcaagaagt tgaggggcag 5760 gagcgagtaa tagcatatgc cagtcggaag ttactgaagg cggagctaaa ctactcggcc 5820 acggaaaagg agtgtctcgc gatcgtctgg gcgatccgaa agatgcgatg ttacctggag 5880 ggctacaggt tcgacgtcgt cacggatcac ctagcgctga aatggctcaa ctctatagaa 5940 agtctcacgg gccggatcgc tcgttgggcg ctagagttgc agcaattcca gttcgacgta 6000 cgctatcgcc gtggtggccg acgcactgtc ccgacagccc atagacagtt gcaagcaggc 6060 cgtggaagaa aatccccctt gcgggtggat aaagaggatg cgtgaaagga tcgtaaagga 6120 gcctgagaag ttccgggatt acgtggagga gaacgggcag ttataccgga atttaggaca 6180 taggatagac gaggaggact ttatcccatg gaagctctgc gtcccaagca gtttgaggtg 6240 cagagtgatg agggagtgtc acgacgctcc cacggccgga catcaaggag tgaggaagac 6300 ggcagcacga cttgcgcaaa ggtactattg gccgggcatg ttccgagatg ccgccaagta 6360 cgtgaggtgt tgcgaaacgt gccaaagatt taaatgcgtg caacagaagc cggccgggca 6420 tatgctcacc aggcaggtgg cggagccaat ggcggtcttc tgcgcagact tcgtgggacc 6480 tttgcctcgt tccaagcgtg agaacacaat gttgttggtg ttccacgacg ctttcgctaa 6540 atgggtggag ctggtgccat tgaggaaagc tactaccgct ctgttgcaac tggcgttccg 6600 ggagcgtata ctcggcagat ttggagtacc caggacattc gtttgcgaca acggcgtaca 6660 atttgcaagt aggagtttca aggcgttcat ggaatctttg ggggtgacac tccagtatac 6720 agcgccttat tcgccgcaga agaacccgac ggagagaacg aatcgcacgg tgaagacaat 6780 gatagctcag tatatcgaag gccatcagag ctcatgggac gagctgctgc cggagataac 6840 gctggccgtc aattcgagcg tggccgactc cacgggcttc actccagcat ttctgatgtt 6900 tgggcgggaa ccacgtctac ccgccgcttt gtacgacgaa gtcactcccg gttcggcgac 6960 tagagagacc cagcccgagg ccaaagaagt taaaatgaga gaggttttca acatagtgcg 7020 cagtaacttg cagcgagcat caaaagatca aggcaggcac tataacttaa gacggcgtga 7080 ctggcggcgg aaggattcgc ggcgaaactg gcgcccaaat tcgacggccc ttacaaggtc 7140 gttaagttcc tatcccccaa cgtagtgcgc ctaacgaagg agggcgaacg caagagaagg 7200 gtggccaaca ttgcacagtt gaagccattt catcaaggag atgaggaaat tgacatgatc 7260 ccggtaaccg agaccgggga aacggagaac aggcctatac gatgaccgtc gactcaccaa 7320 agccagaaag atcgaaaacc atcagcagat agacattttc caatacatca acaaagccag 7380 caatattttt caatacacaa atcaaaacca gcaagatcaa caactatcag cggagacaca 7440 tatccaatac acacaccaca gccaggaaga tcaaaaacca tcggcggtga cgcggagcag 7500 caaaatcaac aaacatcaac ggagacacat ttggcaatac acaaattgaa accagcaaaa 7560 tcaacaacca tcagcggtga cacattttct aatacatcac caaagccagc aaaatcaaaa 7620 accaacagcg gaaggtcatt ttccaataca tacatctagc cagcaaaatc agcaactata 7680 agcggagagt cattttccaa tacatcagct aaagccagca agatcaaaaa ccattagctg 7740 tgacacattt ccaatatatg tatattatta taaaagcatt tgaatagagt taatattata 7800 tcaaaagaat cataatacat cagcaaaagc acacaagaaa aaaatccatc agcggagaca 7860 cattttccaa tacctcacca acaccagcaa aatcaacaac catcagcgga gacacatttt 7920 gcaatacaca aattaaaacc agcaagacca acaaccatta gcgatgacaa attttccaat 7980 acatcaccaa caccagcaag atcaaaaacc atcggcggta acaaattttc caatacagca 8040 ccaaagccag aaaatcaaaa accaagaacg gagaaacatt ttcaaataca taccccacag 8100 ccagcacgat aaaaccacca tcggcggtga cacatttatt tatttattat accctgaacc 8160 cattaagatt gtatgtctac aaaagtattc aacaaataaa ttgtcataat attgcccaca 8220 ttgcagcgca gccgctgctt actgcgcagt tagcagagcg cgtgcttata agtgcatgtg 8280 tatgtatgtg tacatagaag aagtgcagaa tgattagaat atcagcttgg agggttcagg 8340 gtatctccta gtcgagcagt ctcgactgga accttcttac ttgtttattt atttaattaa 8400 ttaattaatt aattaattaa ttaattaatt aattaattaa ttatttactt atttatttat 8460 ttatttattt atttatttat gtttgaagct gccgagcttt tttgacaact ctggctccaa 8520 tcatttacga aggcaaaaaa taaaaaccaa aacgatataa tttagcttat tatttattaa 8580 tttaatgccg agtctgtgga cactgtttag gcggagttaa ttagattcga gacaaaatat 8640 gtacgtgttg cgcgaatgaa ggccgttttt gtacagagtt taaaagaata taaataaaga 8700 ggagtatgca attaacagag cggatgagat ttacatatgt tatgttatgt gtgttatgct 8760 ttgttatgtg caagagtgcg ctaacagcga atgagaa 8797 // ID Gypsy-44_CQ-LTR repbase; DNA; INV; 395 BP. XX AC AAWU01034529; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-44_CQ_; KW Gypsy-44_CQ-I; Gypsy-44_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-395 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 468-468 (2011). XX DR Genome; AAWU01034529; Positions 4492 4098. XX SQ Sequence 395 BP; 120 A; 112 C; 49 G; 114 T; 0 other; tgtagcgtac cagatgtaac atatcgaatt atccttatcc ttggattccc ataaatcctt 60 cccaaagctt actcaaattt cattacataa cttaagctat tatccgcatt cctccttctg 120 agaccttctt tgcataccca cacacgacga cactcaagta ataaacagaa ccttttcctc 180 attaccatga cctccttttt attgcaacca aatgttaacg ctgcgctgac tcttcaagta 240 cagcgcattt gtttaaaccc tggccacatg tttgaacaac atgttgaagc cagccaacgc 300 tttcgctagg tttcccaaac tctaagctca gaataaaata ccattcaagt ctgaacctct 360 gcagaacaca gtcattcata tcacaacgcc taaca 395 // ID BEL-18_AA-LTR repbase; DNA; INV; 405 BP. XX AC supercont1.352; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-18_AA_; KW BEL-18_AA-I; BEL-18_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-405 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.352; Positions 1369 1773. XX SQ Sequence 405 BP; 138 A; 69 C; 94 G; 104 T; 0 other; tgtttgcgag aaaatgtaac gatgtgcaat tactgaggat aaattatgtt attgagatgc 60 cgtcattgat ctttattaga acattttggt atcggcgcac cgcatgattg caagaaggca 120 taggtaggag ctttttctgt gtagagtgtt tgcgcgccta gattgaaagc taaccagaag 180 atgtgtagaa gagtaaaagt acagtccatt caagaaaaag ggaactgaag agcaagcatg 240 aatgaacagg caacagtcca ctaacaggct tcgtgaagtt atgttctcac ggagaaatca 300 aagtgaataa agtaaaagtt atattgaaaa tcaacccacg cgtttacagt tcaacttaat 360 gccaatccgg tgaaaatcca aatcccctga gctgtcgtcg gaaca 405 // ID BEL-20_AA-I repbase; DNA; INV; 5796 BP. XX AC supercont1.208; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-20_AA_; KW BEL-20_AA-LTR; BEL-20_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5796 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.208; Positions 1532580 1526785. XX CC Positions [4843-5427] - Integrase core CC 'GCCAG' target site duplication CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 2125..4050 FT /product="BEL-20_AA-I_1p" FT /translation="MTTSSNKIIQTRVSSRDGKYFAVLDFLVTPAITELPS FT GKIDTHLWPLPAGVELADPSFNVPEEVDMIIGAEVFYDVLKQGRLKIGGDF FT PTLAETTFGWVVSGPVNTQRKAAPRTRKHCHINTTQEDINQTLLRFWELEA FT GYVTTKMTATERAVEQHFGKTHSRNSEGRYVVRLPFNDLQNKLDDSYENAK FT RRCNRLISKKQKAKQKAYTDFMNEYRLLGHMQEIGNDPAGGYFLPHHAVCK FT ENSSTTKTRVVFDASAATTTGISLNDTQLVGPTVQSDLITLILRFCTHQVV FT LTADVPKMYRQVQVHPEDRRYQRIVWLNDKGEMVTFELTTVTYGCSSAPYL FT ATRSLIQLAKDEANEFPLASRVVKKDSYIDDFITGGRTAAEVVETYEQLQA FT MLKRGGFGAHKFCSNSSEVLRAIPAELQEQQVDFESSEVNNTIKILGLIWN FT PTDDYFAFNVTNLINQGTPTKRVVLSEISRLYDPLGFLGPVGTIAKLTMQE FT LWRLKLDWDDELPEEQMEKVVHVTFREQLLSIRNIRKKRCVIPAGAQRIEL FT HGYCDASKRAYGACLYVRCILADGKINVQLLCSKSRVAPLKPVTIPRLELC FT AGLVLAQLTRKSMEALEVNFDSVTLWSDSQIVLSWLKKSPLVLL" FT CDS 4819..5796 FT /product="BEL-20_AA-I_2p" FT /translation="MGDLPSYRVTPSPAFSKIGIDFAGPFVLKESGRKPRF FT FKAYVCVFVCMSVKAVHLELCTDLRSETFLAALQRFVSRRGLPSDLFSDNG FT TTFVGANHELASLRKLFEDQAHTEKLAEFCSAKGIIWHFIPPRSPHFGGIW FT EAGVKSMKHLLKRVVGETRLTYEEMATFLTEAEAVLNSRPLCPLSDDPNDL FT EALTPSHFLIGRPGLAIAEPSYGEQKINRLSRWQHVQSMREHFWNRWSTDY FT LHTLQTRKKWKDGVLDIKIGSLVLLRDENVPPQLWKMGRIVALHPGKDGVV FT RVITIKSSSGEYRRAVAKVCLPPDVESDDPRGGV" XX SQ Sequence 5796 BP; 1689 A; 1229 C; 1436 G; 1442 T; 0 other; ttttttggtc ctatcggaac cggatgtgga cagctcgcga ttttctggtg atttaacccg 60 gattcgttac cgatcgggag tgaatttagt ggatttgcac cgcgtgcaaa atggcgtagc 120 caatgattgc agtataccga ggtggtgtgc gtgcaaagat gcgatcgcaa atgctcgatg 180 ctttgtgaat cgatttttgg tagtgatgaa gtgattggat ctgcttgaaa tgcggaaagc 240 ttgaattgtt tagtgaaaat ccatgatttg ggttttaatt acgaaaatca gcatcagtgt 300 gtgatttatg cgcagacagt gctctgccac cgatgaaaaa agctccaagc tatcagaaag 360 aagaattagg ttacgtttga ttgtgggctt gccagagcga gtcggcaaat gtaaaagctt 420 gaaagctttc aggagtgtgt ttgtgtaaga ttgctacgcc atatctgagc gtcagcaaac 480 gcgatggttt gagctttccg aatgggaaca gtgttggatt gctgcatctc gacgatcagc 540 aaaagcgaaa acttgatagt tttccagaag aattgtggtt gtgttagtaa tgctgcacgc 600 agctaacgtc tgaaaagact tgtgaaagat tcattcgtat aagctcaaag ctttccgaac 660 gagaagtgtg gttatgtgcg attcatgttt gccgcgactg tgaatcggca attgcaaaag 720 ttaaagagcc actgaagaac aaagtgaggt tatgaatttg tttcggtgaa gcgcgatgct 780 taccgaatga aaggaaatgt gttttggtgg ttgtcaatac aacaaccgaa gaaaacttgt 840 caacattggg ttgacaacta tagtgaagtg aaaatcgaaa tgtaaccaca taaccgaaaa 900 atgaccggaa aaaaaagaag aatccaaaat tgaagagtgc tactcaaaat gaaaatgaaa 960 gtgatacacc gaaagtgaaa atgagtcaaa tctcgaagaa gagtcttcag ttcgatctct 1020 cgaaaaacaa aagtgacgaa aatagaacgc cggacactgt taatcagttg tcagaacaat 1080 ttccatcggt cgctcctgct ttgacagtga atcagcaact cgttgattgg agaagcgtca 1140 ggtaaaatag accaggatat aatcaataat aatgattatc aatcagcttg gcgaatgcta 1200 gaagatgcct acgaggatca acgtttgatc atcgatacgc atatcgacgc actgttcaac 1260 cttccaagaa ttacaaagga gaatggagaa gagttgcgga aattggtaga atcatgtaca 1320 aagcacgtgg atgcattgaa aacccatgat cttcctgtgg aaggattatc agaaatgatg 1380 ctcatcaaca tcattagcaa gcggctggat agagagacta ggaaacactg ggaatcgtca 1440 ttgtctcatg atgatcaacc tagttatgat gacttgttag aattcttgaa ggatcgttgt 1500 cgtattcttc agaaactgtc aaatcatggc caagccagtc aaccacagac agtaatcaag 1560 cccaagcaac agaaggtttt cgttcagacc agtaaagaaa catgcccatg ctgctcaggt 1620 tctcacaata tttacaaatg tgaaaccttc aagaaactac aaatccctga acgtttcgag 1680 aaggtgaaaa gggcaggact gtgcttcaac tgtcttcgca ccggacatcg tacagtggaa 1740 cgcaagacag ataagcaatg caaaacgtgt ggcaaacgcc accacagtta tttgcattat 1800 gaacgtgctg aagatcctaa gaagccgagc gacaacaatc aaccgaagaa ccctgtgccg 1860 caagcctaga ttgaacccga accggaacat ctagcggaaa aacgtacgat tagctgttgt 1920 acgcaagcaa cgtccgtgac gaagcagatt ttcctgtcaa ctgcgaaggt gttagtgagt 1980 ggatctggta gtgtgactac tacatgtaga gccttgctcg attgttgctc ggagtcgaat 2040 ctgatttcgg aaaaattggc tacaaaattg aacatcaaac catcggttat tgacccgccg 2100 attttgatct gcggtttaaa cggaatgacg acgagtagca acaagattat ccagacaaga 2160 gtttcgtcta gggatggcaa atactttgct gttctcgact tcctggtgac ccctgccatc 2220 accgaacttc cttcaggtaa gatagacacc catttgtggc ctctccctgc cggcgtagag 2280 ttggcggacc cctcgttcaa tgttcctgaa gaagtcgaca tgattattgg cgccgaggtg 2340 ttctatgacg ttctgaaaca ggggcgtctg aagattggtg gcgattttcc cacactggct 2400 gagactacat ttggttgggt tgtcagtgga cctgtcaaca cacaacggaa ggcagctccc 2460 cgcacgagaa aacattgtca tattaacacg actcaagagg acatcaacca aactcttttg 2520 agattctggg agctggaagc aggatacgtt accaccaaga tgactgcaac tgaacgcgct 2580 gtagaacaac atttcgggaa aacacactct cgaaatagcg aagggaggta cgttgtaagg 2640 ttacctttca atgatcttca aaataagctt gacgattcgt acgaaaatgc caagcgtcgt 2700 tgtaatagat tgatcagtaa aaaacaaaag gcgaaacaaa aggcgtatac agactttatg 2760 aatgagtatc gtctacttgg acatatgcaa gagataggca acgatcccgc tggtggttac 2820 tttctcccac atcatgcggt gtgtaaggag aatagttcaa ctaccaaaac tagagttgtt 2880 tttgatgcct cggcggccac gacaacgggc atatcgttaa acgatacgca attggtgggt 2940 ccaactgtac aaagtgattt aatcacactc atcctgaggt tttgtacgca ccaagtcgta 3000 ttgacagctg atgttcctaa aatgtaccgt caggtacagg tacatccaga ggaccgacga 3060 taccagcgga tcgtttggtt gaatgacaaa ggagagatgg ttacatttga gctgacaacc 3120 gtcacttatg gatgctccag cgctccctac cttgccacga ggagtttgat acagctagcc 3180 aaggacgagg caaacgaatt tccactggca tcgcgagtag tgaaaaagga tagctacata 3240 gatgatttta ttaccggagg acgaacggct gctgaagttg ttgagacata cgagcagttg 3300 caagcgatgc tgaagcgggg tggttttgga gcacacaagt tctgctctaa cagttcagag 3360 gtgctacgag ccataccggc tgagctacaa gagcagcagg tggatttcga gtcatcggag 3420 gtaaacaaca ccatcaaaat tttaggcttg atctggaatc ctactgatga ttattttgcg 3480 ttcaatgtga cgaacttgat caaccaaggc acacctacaa aacgagtcgt tctgtctgag 3540 ataagtcgac tttatgaccc gctaggtttc ctaggaccgg taggaaccat tgcgaaatta 3600 actatgcagg aactatggcg attgaaactt gattgggacg acgagcttcc tgaagagcag 3660 atggaaaaag tcgtccatgt aacttttcgt gagcaattgc tgtccatacg aaacattcga 3720 aagaagagat gtgtcattcc agccggtgcg caacgtatcg aactacatgg ctactgcgac 3780 gcctcgaaac gagcttatgg tgcctgcctg tatgtgcggt gtattctagc agatggtaag 3840 atcaacgttc aacttctctg cagtaaatct cgtgttgcgc ctttgaaacc ggtcaccata 3900 ccacgccttg aattatgtgc tggactggtt ctagcacaac tgacgcgaaa atcaatggaa 3960 gcgttggaag tcaatttcga cagtgtcaca ctctggtcag attcccaaat cgttttgagc 4020 tggctaaaga aatctccgtt ggtactctta tgaattcgtt tgcaatagag ttgtttccat 4080 catcgagctc acaccaaact tccagtggcg ctacgtgaga tccgagtgta atcctggcag 4140 atgcgttatc acgtggaatg cttccagagc atctaataga aaaccggcta tggtggggag 4200 gttcacccga acttcgacat tcgcgatatt tggaagaaga agagatgcct cccattgacg 4260 atgaaggcct gccgaaacta ccaaaaagtg ttttgatcaa catcaaaaag gagcccgctt 4320 tttatttcgc gcgagtcagc aagttcacac gtctgtagcg tgcatgggca tatgtttgga 4380 ggttcatcga caactgccga gcgacgaaga agaactgtgg gcatctaact gcatcagagc 4440 tatccagacc aactcaaaca atcgtgaagc tggttcagga ggaaatgttc ccggatctgc 4500 tcaaggatct gaaaatggag aaaacgaagc gaaataatta ttctgggcta gcaccatatt 4560 tggaccaagc tggcataata agggtcggtg gacggttgaa atattcgctt attccgtacg 4620 agggtaaaca ccagatcctc cttccagaga aacaccatgt taccttgttc cttgttcgtc 4680 aattgcatga aaacaatctt catgtagggc aaaatggatt agtgtccatt atccgccaac 4740 agtactggcc tgttaaggtg gatagctgga cgctgctata cctgctacaa acacaatccg 4800 caacaattga agcagttcat gggcgacctg cctagttatc gtgtcacccc atctccagcg 4860 ttttccaaaa tcggaatcga ctttgcaggc ccattcgtgc tgaaggaaag cggtcgcaag 4920 ccaaggttct tcaaggcgta cgtctgtgta tttgtgtgta tgtccgtaaa agccgtgcat 4980 ctggagctgt gcaccgatct tcgttccgag acattcctcg ctgcactaca acgatttgta 5040 agccgacgtg gcctccctag tgaccttttt tctgacaacg gtacgacatt tgttggcgcc 5100 aaccacgaat tggctagctt gcgaaagtta tttgaagatc aggctcacac tgagaaactg 5160 gctgagttct gcagtgcaaa aggaatcatc tggcacttca tccctcccag aagtccacac 5220 tttggtggga tctgggaggc tggagttaaa tcaatgaagc acctactcaa acgagtcgtt 5280 ggcgaaacta gattgaccta tgaggaaatg gcgacattcc tgacagaagc tgaagctgtt 5340 ttgaattcgc gtccgttgtg tcccctctcg gatgatccga atgatttgga agcactgacg 5400 ccatcgcatt tccttatcgg acgtcccggt ctagcaattg ctgaaccgtc gtatggtgaa 5460 cagaagatca acaggctgtc aaggtggcag cacgtgcaga gtatgaggga gcatttctgg 5520 aaccggtggt ctacggacta cctccacacg ttacaaaccc ggaagaaatg gaaggatggc 5580 gtactagata tcaagatagg atctttggta ttacttcgcg acgagaacgt gccaccacag 5640 ttgtggaaaa tgggtcgcat cgtggccttg caccccggca aggacggcgt agtcagggtg 5700 attaccatca agtcttcgag tggtgaatac cgacgagcag tagcgaaggt gtgtttgcca 5760 ccagatgttg aatcggacga tccaaggggg ggtgta 5796 // ID I-70_AAe repbase; DNA; INV; 6259 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE An I non-LTR retrotransposon from Aedes aegypti. XX KW I; Non-LTR Retrotransposon; Transposable Element; I-70_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6259 RA Kojima K.K. and Jurka J.; RT "I clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1341-1341 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 8 sequences with >99% CC identity. XX FH Key Location/Qualifiers FT CDS 340..1671 FT /product="I-70_AAe_1p" FT /translation="MDGHSRPPPSDPPDSSRCPGWMIGDDELQHTIALQMR FT VRTSAASTEENEFHLPTDPFVIGNAVMLAVGPANARKVTASKEARGTRYIL FT RTNSKMISEQLQKITELPDKTPVEIVPHPTLNVVQGIVYDLDTINHTEEYM FT LSNLRAQGICSVRRIKKRKGEAYQNTPLSVLSFQGSVLPQHVYFGLLRIPV FT RMYYPSPMLCFRCANYGHTKKKCDSTKFTEVCLNCSTQHDHPVGENCQNTP FT FCKHCQEGHTPISKVCRVYQEEQAIIKTKVDRGLSYGEARANFREANKATT FT YANVLQNRLRNDESEKDKVIKMLQQEVESLRQVILELKQKRANPSISDNNV FT PTQSSAPSTSKLVQPIKSLPGIPGKTPSVALSRLNSIEQCLKTYTENQVRS FT SNSNPNLELEPTDSMDFESNRNNKRKGNKNKTEPESPERKKGIASSSKKK" FT CDS 1674..6179 FT /product="I-70_AAe_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="MAHETFSEDNDHAIDPDSNFGQLRYIFKQNCTMDNQQ FT NQSSLSKHSATIQPKLFRKYDSVKSYNSDSPSATHPTSALTRDVPAEVDEP FT LAAADVVCLSSQASTVLFSNSDNNFPRVQLLVDHPNNENCAYTPVHSPFSC FT SPISSDSNHHGHSQSSTAAFDSPSARHSTSALTRDVPVITDEPLAAADVAC FT LNPQASKISYRSTDPSLCRPQIFAGPTIERSSIQRPDMAIALPVEALVNSE FT MYDQIHRITAVPSTSYHVRLLLYPAVPNSNDNPAVPNSNPHRQEHNDQSNT FT NTNESLTASPIESTGSRRILMLQWNVRGLWANHSELSHFVSQKKPISINLQ FT EMKTASVERVLRGEYEWKLADHNIRNGKGMAGLGILKEVPHHFYAFNSNIP FT MCAARLHNPYNVTVVAFYFPNNALDKDIIDAMNLLLATTEPPYVIGGDANA FT AHEAWGSSKASQRGRALLSWIVDNDMVVLNNGTPTFLSDVHCTYSAIDITI FT ASKGIASKYNWEVIDDTMGSDHFPIITFSNETISRQGCRKRWLYKDADWRK FT FEQCIDKILPVDIETDIMHFGSTITKAAESSIPKSSGKIGRKSEIWWTEEV FT ANAVKSRRKILRAMKRLNPDDPRKEEMRKSFQTARAEARKVIAEAKKVSWD FT LFCQTFSPTTSSEQIWGNFNRLNGKRKAGVRTMIIDKTHVTNPTQIAEHFA FT ENFESISQTNDKHCSQEYDRTRDDADIEPSVLDANFTLEELLRAIDSAKGY FT SAGIDDVGYPMIKHLPLHAKISMLKAYNKLWNEGIFPEKWKEGIVIPVPKP FT GKSLQHADSYRPITLLSCIGKIYERLINHRLIVHLEAHQLLNQNQHAFRSG FT KGTATYFADLNEILTDALQRNLHCECALLDLKKAYDTTWRPHIVQRIRELD FT LGINMTKSIESFLNNRCFRVSFGGALSSSKIQENGVPQGSVLSVTLFTIVM FT DTVFNVIPPNVTALIYADDILLITVGKKVTSTRKRLDSAVQAVCEWANNIH FT FEISAQKSSLLHVCRAKHRNWKKHRKDIIVNGESIPNVKVARILGIWLDSR FT ATFKKHFKNTINALQNRLNFIRAIAARAHRSVIWRIANATCISKLLYGIEL FT FGLSCIEPYKPVFNRLIRLASGAFRTSPSLSLAIESGLLPFKERVSLAYIR FT SYCKLVEKSPQIRPALEENIEQLSVTLFGESFPAISKLHRFGYRPWYYPRP FT KVDWTVKARFQVGQGAHKAQAIVNELLNSKYMQHTKIYTDGSKDATAVGCG FT IIGPNFKIEGQLPKCCSIFSAEAAALAIAVYHAGNAPTVILTDSASSLSAL FT DKGNIMHPFIQAIETYSDGKQVTFLWIPGHCAISGNVEADHAAKKGRTSLP FT VEYEVPAIDAIRWTKNKINNQLQISWNENTVDNLALVKQNIGKWCDQKKKS FT DQRILTRCRIGHTRLTKCHLFQNKDPEICDSCNTPLDVKHILLECRKFDDI FT RNRIGINSNIAIALGNNKNEEEKMLKYLKDTGLYSLL" XX SQ Sequence 6259 BP; 2038 A; 1336 C; 1296 G; 1588 T; 1 other; tcagtaaatt atcacaactt gatcagtacg gtcgcgtgag tttgtttact taaaactcaa 60 atcgagttat ttctcttgaa aacggtcgaa attaattgtg attttcacat cgtgaaaggt 120 actagaggtt ttcctaatag tcaatagtgt twtccactaa atcatagacg ctaaagtgaa 180 gttctgttgc aaattgtgaa cgtgttttgt tgacatagaa gccgcagtac atcactagca 240 cggaacccgt cgtgtttgtg gtttcctttt cggctgcatc tcgtcgagag cattaaaaac 300 atccaatagt attggttagt tacaacacgt cctagccgga tggatggtca ttccagacca 360 ccccctagtg atccaccgga cagcagtcgg tgcccgggat ggatgattgg tgatgacgag 420 ctacaacaca caatagcact gcaaatgaga gtgagaacaa gtgcagcaag tactgaggaa 480 aatgaatttc atctccccac ggaccctttc gtgattggaa atgcggtaat gctagctgtt 540 ggtccagcaa atgctcggaa agtgacggca tcaaaagagg cgcgtggaac tcgttacatt 600 ttaaggacca actccaaaat gatcagcgaa cagcttcaaa agattactga gctgccagac 660 aaaactccag ttgaaattgt tccacatcca acgctcaacg tcgttcaagg gatagtttat 720 gacttggata cgatcaacca caccgaagaa tacatgttga gcaatttgag ggcacagggt 780 atttgttcag ttcggcgcat caaaaaacgt aagggagaag catatcagaa tacaccgctt 840 tctgttttgt cgtttcaagg atctgttcta ccacaacatg tgtactttgg acttctgcgc 900 atcccagtgc gaatgtacta cccctcccca atgctttgct tcagatgtgc caactatgga 960 cacactaaga aaaaatgtga tagcaccaag ttcactgaag tttgcctcaa ctgctccact 1020 caacatgatc accctgtcgg agaaaactgc caaaacactc ctttttgcaa acattgtcag 1080 gaaggtcata ctccgatctc gaaagtctgc cgcgtttatc aagaagagca agctatcatc 1140 aaaacaaagg tcgaccgtgg gctatcctat ggagaagccc gtgccaattt ccgtgaagct 1200 aacaaggcaa ccacctatgc caacgtgctt caaaatcgtc tgcgtaatga tgaatctgaa 1260 aaagacaagg tcatcaaaat gttgcaacaa gaagtggagt cattacggca ggtaattctt 1320 gaacttaagc aaaaacgagc gaaccctagc atcagcgaca acaatgtacc aactcaatct 1380 agtgctccct ctactagtaa attagtacaa cccatcaaga gtttacctgg tatccctggt 1440 aagactcctt cggtggcatt gtcacgcttg aattccattg aacaatgcct taaaacgtac 1500 accgaaaacc aagtacgaag ttctaattca aacccgaatt tggaattgga gccgacggac 1560 agtatggact tcgaatcgaa tagaaataac aaacggaaag gtaataaaaa caaaactgaa 1620 ccggaatctc ctgaaagaaa gaaaggtatc gcttcaagct ctaagaagaa ataatggccc 1680 acgaaacctt ttctgaagac aacgaccacg ctattgaccc ggactcgaat tttggacaac 1740 tgagatacat ctttaaacaa aactgcacaa tggacaacca gcaaaatcaa tcaagtctgt 1800 cgaaacactc tgccacgata caacctaagc tatttcgcaa atacgactct gtaaaaagct 1860 acaattctga ttcgccaagc gcgacgcatc ccacgtcagc cctgaccaga gatgttccgg 1920 ccgaggtcga tgagcctctg gcggcagctg acgtggtttg cctatcttca caggcaagta 1980 cagtattatt ttcaaattcg gacaacaact ttccgcgtgt acaacttctt gtagatcacc 2040 ctaacaatga gaactgtgca tatacaccag tacatagtcc gttctcctgt tccccaatat 2100 cttcggactc caaccaccat ggtcattcgc agtcgtcaac tgcggcattc gattcgccaa 2160 gcgcgaggca ttccacgtcg gccctgacca gggatgtacc ggttattacc gacgagcccc 2220 tggcggcagc cgacgtggct tgcctaaatc cacaggcaag taaaatatca tatcgctcta 2280 cagacccaag tttgtgccgc cctcaaatat ttgcaggtcc tacaattgag agatcctcca 2340 ttcagcgacc agacatggca atcgccttgc ctgtcgaagc gctggtcaat tctgagatgt 2400 acgatcaaat ccatcgcatc accgctgttc catcaaccag ctaccacgta cgactactgc 2460 tatatccagc ggttccaaac agcaacgata acccagcggt tccaaatagc aatcctcatc 2520 gtcaggagca caacgaccag agcaatacca acactaacga atctctgaca gcatcaccga 2580 tcgaatcgac tggatcgcga agaatactca tgcttcagtg gaatgtcaga ggtctatggg 2640 ctaatcattc ggagttatcg cacttcgtta gccagaagaa gccaatttct ataaatcttc 2700 aagaaatgaa aaccgcaagc gtagagcgtg ttttgagagg agaatatgaa tggaaattag 2760 ctgatcacaa catacgaaat ggcaaaggaa tggctggact ggggattctg aaagaagttc 2820 cccaccactt ctatgctttc aattccaata ttcccatgtg tgctgctcgg ttgcacaatc 2880 cgtataacgt tacagtggtt gcgttttatt ttccaaataa cgctcttgac aaggacataa 2940 ttgacgcaat gaatctttta ctggcgacaa ccgaacctcc atatgttatc ggaggtgatg 3000 ccaatgctgc tcacgaggct tggggtagct ctaaggcgtc tcagagagga cgggcattac 3060 tgtcatggat tgtagacaat gatatggttg tgcttaataa cggaactcct acttttctca 3120 gcgatgtcca ttgtacttat tcagctatcg atataaccat tgcttctaag ggtatagctt 3180 ctaaatacaa ctgggaagtc atagacgata ctatgggaag tgatcatttt cccattatca 3240 ctttttcaaa tgaaacaata tcaagacaag gttgccgtaa acggtggctc tataaggatg 3300 cagactggag gaaatttgaa caatgtatcg acaaaatctt accagttgac atcgaaactg 3360 acatcatgca ctttggttcc acaataacca aggctgctga aagctcaata cccaaatctt 3420 ctggtaaaat tggaagaaaa tccgaaatat ggtggactga ggaagtagca aatgcagtaa 3480 aaagccgacg taaaattctc cgtgcaatga aacgtctaaa tcctgacgac ccccgtaaag 3540 aagaaatgcg gaaaagtttt caaacagcta gagccgaggc gcggaaagta atcgccgagg 3600 ctaaaaaagt ttcatgggat ctcttttgtc aaacgttctc gccaacaact agttctgaac 3660 aaatttgggg taactttaac cggctaaatg gtaaaagaaa agcaggagtt cgaaccatga 3720 tcatagataa aacccacgta accaacccaa ctcaaattgc agagcatttc gcggagaatt 3780 tcgagagtat atctcagaca aatgataaac attgcagcca agaatacgat cgaacacggg 3840 atgatgcgga tatagaaccc tctgtgctag atgcgaattt cacactggaa gaattgttgc 3900 gagctattga ctctgctaaa ggttattcag cgggaattga tgatgttggc tatccaatga 3960 ttaagcattt accacttcat gcaaaaatta gtatgttaaa agcatacaat aaactatgga 4020 atgaaggcat ttttccagaa aaatggaaag aaggaatagt aatccctgtc ccgaaacccg 4080 gtaaatcact tcaacatgcg gacagttatc gcccaataac tcttttaagt tgtattggga 4140 agatctacga acgactcatc aatcaccgtc ttatagttca tctggaggct catcagctat 4200 tgaaccaaaa tcagcatgca tttcgatctg gaaagggaac cgcaacatat tttgcagatt 4260 taaatgaaat attgactgac gctctccaaa gaaacctaca ctgcgaatgc gctcttctag 4320 atctgaagaa agcctatgac actacatggc gacctcatat agttcaacga attcgagagt 4380 tggatcttgg tatcaacatg acaaaaagca ttgagagttt tttgaacaac agatgctttc 4440 gtgtaagttt cggtggggca ctgtcctcat caaagatcca agaaaatggt gtacctcagg 4500 ggtcggtatt atcagttacg ttgtttacta ttgtaatgga tactgtgttc aacgtgatcc 4560 caccaaacgt tacagcgctg atttatgcgg atgacatctt gttaatcact gttgggaaaa 4620 aagtcaccag caccagaaaa cgtcttgatt cggcagttca agctgtgtgt gaatgggcga 4680 acaatatcca ctttgaaatt tcagctcaaa aatcttcgct tcttcacgta tgtagagcaa 4740 agcacaggaa ttggaagaaa catagaaaag acattatagt aaatggggaa agtataccaa 4800 atgttaaagt tgcgaggatt cttggtattt ggcttgatag tagggcaacc ttcaagaaac 4860 atttcaaaaa cactatcaac gcattacaga atcgtttaaa cttcatcaga gcaatagctg 4920 cccgtgctca tagaagtgta atttggagaa tagctaacgc aacttgtatt tcgaagctgc 4980 tctatggaat agagcttttt ggactaagtt gcatagaacc atataaacct gtattcaata 5040 ggttgattcg attagcctca ggagcattta ggacatcgcc ttctttatca ctggccattg 5100 agagtggcct acttcctttt aaagaaagag tctcattggc atatatccga agttattgca 5160 aacttgtaga gaaaagccct caaatccgac cagcattaga agaaaacata gagcaactgt 5220 ctgtgactct ttttggagaa agttttccag caattagtaa gcttcatcgt ttcgggtatc 5280 ggccgtggta ctatcctcgg cccaaagtgg attggacagt gaaagcgaga tttcaagttg 5340 gacaaggtgc tcacaaagcg caggccatcg taaatgaact tctcaacagt aaatatatgc 5400 agcatacaaa aatctacact gatggatcaa aagacgcgac agcggttgga tgtggaataa 5460 tcggaccgaa ttttaagata gaaggccaac ttccaaaatg ctgttcaatt ttttctgcag 5520 aagcggctgc acttgcaatc gctgtatatc atgctggtaa tgcaccaacg gttattttaa 5580 cggactcagc tagctctctc tctgcgcttg ataaaggaaa catcatgcat ccttttattc 5640 aggctattga gacatactcc gatggcaaac aagtgacttt cttatggata ccgggtcatt 5700 gcgcaatatc cggaaacgta gaagctgatc acgctgcaaa gaagggaaga acaagtttac 5760 ctgttgaata tgaagtacca gctatagatg caatcaggtg gacaaaaaat aaaattaaca 5820 atcagttgca aataagttgg aatgaaaata cggttgataa tttggcatta gtaaaacaaa 5880 atattggtaa atggtgtgat caaaagaaaa aaagtgatca aagaattctt acaaggtgca 5940 ggataggcca tacgcgattg actaaatgcc atttattcca aaacaaagac cctgaaattt 6000 gtgatagttg taatactcca ttagatgtaa agcatatttt actagaatgc agaaaattcg 6060 atgacattag aaacagaata ggaatcaatt caaatatcgc catagcttta ggaaataaca 6120 aaaatgaaga agagaaaatg ttaaaatatt taaaggatac aggactttat agtttacttt 6180 gaacagaact ttagattaat aagaggcgaa tgaattgaaa atttaaagcc tctataaaaa 6240 taaagcaaaa aaaaaaaaa 6259 // ID BEL2-LTR_Dmoj repbase; DNA; INV; 372 BP. XX AC scaffold_6541; XX DT 13-MAY-2009 (Rel. 14.05, Created) DT 13-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL2_Dmoj; KW BEL2-I_Dmoj; BEL2-LTR_Dmoj. XX OS Drosophila mojavensis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; repleta group; OC mulleri subgroup. XX RN [1] RP 1-372 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1013-1013 (2009). XX DR Genome; scaffold_6541; Positions 197851 197480. XX SQ Sequence 372 BP; 111 A; 82 C; 68 G; 111 T; 0 other; tgttgggctc agcaatcatc gcttcattgt aattaagagc ggcagttcat agagtatgtt 60 atcatgtaag agaatttgta acggagatca cgctctccct tttgcgttgt ctctttcacc 120 cacgcaccgg ctcagcttaa tccgccggct cgcgtacatt gtagttagta tgcaattaac 180 tcttgtttag agcagatgca gtgttagatt gactattaca aacatagtag cggacagcac 240 gacagcctag tcgcttgctc gcgtctatca cataaactct gtaaactcta ataatcgcta 300 ataaactaat cactaagtaa attcaatcgt ttaataaatt aaagctattc gaaaaatctc 360 gatagctcaa ca 372 // ID Copia-111_AA-LTR repbase; DNA; INV; 257 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-111_AA_; KW Ty1_copia_Ele79; Copia-111_AA-I; Copia-111_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-257 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 257 BP; 87 A; 49 C; 52 G; 69 T; 0 other; tgttgaggaa tcaattttca ttgttgtatc aatactttta tgcctcacta tgtacatata 60 aggttaagcc tcactataac tgtagaatac gtgtgtagca aaaccatcgc tagtggacca 120 tgaagcatca gaatcaatac cctgtgtacg accagacaag gcatcagttt tcagatcctt 180 gctggcgaag gcaagataat aaaggaaggt ggatagcctc taagaagaag aaaacccgga 240 gttttcaatt atcaaca 257 // ID Crack-3_CP repbase; DNA; INV; 4776 BP. XX AC . XX DT 22-JUL-2009 (Rel. 14.07, Created) DT 22-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE Culex pipiens Crack non-LTR retrotransposon. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; Crack-3_CP. XX OS Culex pipiens OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RP 1-4776 RA Kapitonov V.V. and Jurka J.; RT "A family of Crack retrotransposons from Culex pipiens."; RL Repbase Reports 9(7), 1334-1334 (2009). XX DR [1] (Consensus) XX CC The Crack-3_CP consensus sequence was derived from multiple CC alignment of 4 copies of Crack-3_CP that are 99% identical to CC each other. XX FH Key Location/Qualifiers FT CDS 382..1425 FT /product="Crack-3_CP_1p" FT /note="ORF1." FT /translation="MSAEPSICAVCKVAEPEATKIVTCVNCYKSEHTECKG FT VYGSSAAALRAKPYFCSVQCCEIHHRCQNSTLFSGIDDKVMSEILIEVRKT FT NTEMHDIQQTMGELKRFNTYVGNQIEVLVTEIKSVKRDYVKVKQDVGYLQS FT DQKQTNETISDLQLSLDRVNRAALIKNAVILGVPTTQTEDTKQLVRQIAAA FT VKCQLPQDAIVEATRLVSKDPSQRTSKSVPIRVVFAEEQHKEDLFAKKKNH FT GLLFTSALGAGSGPATNRIIIRDEMTSFGMKLLKDVRSLQEQADIQYVWPG FT RDGVILTKKSSDSKVEKVRSLLDVRKLQQSKRSRDESLLHSTDLSMSILAE FT PDPKR" FT CDS 1486..4383 FT /product="Crack-3_CP_2p" FT /note="ORF2." FT /translation="MSMYCNIEFECMQQWIVNKFCTDVVSLKILQLNIRGM FT NDLAKFDGVREMLQQYGERIDVLVLGETFLKADRTCIYNLDGYIATFSCRN FT ESSGGLAVFVRNDIDMDVLENRTADGFHHIHCQLRSLNKRIDCHAVYRPPS FT FGATRFLDEMDQILSKTKNGHDCIIVGDMNIPVNKPSINIVQQYSHLLSSY FT GMVVTNTFVTRPASGNILDHVICDEKLSHSVTNETIPSDLSDHCVVLSTFR FT RAEPILKRTLEKQIIDHRRLNQLFFDRITTLPQTLTADQKLSHVINCYNTL FT LPQCTRTVKAQVNIKGHCPWMTLHLWKLIQIKDNTLKRKRRNPNDAHLSEL FT LAHVSKKLQKEKAECKRNYYSNQLSSGDQKASWKVINEAMGKNSSRAYPTE FT LLINGNKTTDSNQIAQHFNDFFSNVGPNLAATINSDREVHKFGTLSALPEG FT NSLFLRPASINETIALINALNPKKSPGPDGIPASFLKANFVFFAHLLTDVF FT NEIVETGVYPAVLKVARVTPVHKSGDKKEANNYRPISCLSILDKVIEKLIV FT SRLVDFITQHKLLYAHQYGFRQGLSTLSASCNLVDDIYDSLDKRELVGALF FT IDLKKAFDTIDHRLLLEKLETYGIRGVAKAVIESYLSDRLQFVSLGQVRST FT LSPVITGVPQGSNLGPILFLLFINDIGKLKLKGTTRLFADDTSVFYHGKDC FT ALIQRQITQDLELLDDYFQTNLLSMNLSKTKYMLIRSPRKQLPARQPIVFN FT GQQIEEVNQYPFLGLTLDDTMSWLAHIDLLKKKLAPICGILWKLSSFLPTS FT CLKKLYFSLVHSRLNYLVTTWGLAYAVHLHDLQVIQNRCIKAVFRKPYLYP FT TSLLYSNPADSLLPLRALQKLQMLTHVRKCHLSTTTASTFPIREQTSENDR FT ATRQLGDFILPLPRTEFGMKRVAYVGCKLHNELPTVCKNASQGAFKLSLRN FT HIKSKIIDYLV" XX SQ Sequence 4776 BP; 1353 A; 1221 C; 1047 G; 1155 T; 0 other; aaagctgcca ccctgcaagt gacgagcaca tcactctcta cagtgtcctg aaactaacag 60 tttttaagga ggaaaaactg attcgtggta tttgcagtta cgtgaaaagt gaagatcacg 120 aggcggagga caggtgaacg tgttatcttg gatgccggtg tatggacagt gcaagaatct 180 tgagggaaaa agttcctaat ttcctggtaa atatactcca ttgggaagcg tagtaagacg 240 ttacacttac tcataagttg tcaataagat gatagcttaa gttccagaat tttcgctgat 300 gcagtactag ttcgccggtt attactgtct gtcacttatc gctgccccca cttgtttgta 360 aaaaaaaaac attcggtgga aatgtctgcc gaaccgagta tttgcgcggt gtgtaaagtt 420 gccgagcccg aggcaaccaa aattgtcacc tgtgtcaact gctacaaaag tgaacacaca 480 gaatgcaaag gagtgtacgg gtcatcagct gctgcgctac gagcgaagcc ctacttctgc 540 tcagtgcaat gttgtgaaat tcatcatcgc tgccagaatt caacgctgtt cagcgggatc 600 gacgacaaag tcatgagcga aattctcatc gaggtacgca aaacaaacac cgagatgcat 660 gatatccaac aaaccatggg ggagctgaaa cggttcaaca cctacgttgg gaaccaaatc 720 gaggtacttg tcacggagat caagtctgta aagcgggact acgtgaaggt gaaacaggac 780 gttgggtacc tccagagtga ccagaagcaa accaacgaaa caatcagcga cctgcagtta 840 agccttgatc gagtcaacag agctgcgctc atcaaaaacg ctgtcattct gggcgtacca 900 acaacccaaa ctgaggacac caaacaactc gtgcgccaaa ttgcagctgc cgtcaaatgc 960 caactaccgc aggatgctat cgttgaagct actcggttgg tatcgaaaga tccttcccag 1020 cgaaccagta aatccgtgcc gattcgagtc gtcttcgcag aagaacaaca caaggaggat 1080 ctcttcgcaa agaagaagaa ccacggtttg ctgtttacat ctgctctagg tgctggttcg 1140 ggccctgcga cgaacagaat catcattcgc gacgagatga ccagcttcgg tatgaagtta 1200 ctgaaggacg tcagatctct ccaggaacaa gccgacatac agtatgtctg gccaggtcga 1260 gacggggtaa ttctcaccaa aaaatcgtca gactcgaagg tcgagaaggt ccgcagcctc 1320 ctggatgtcc gcaaactgca gcagtctaaa cggtcgcgtg atgagtcgct cttacactca 1380 actgatctct cgatgtcgat cttggccgag ccggacccga aacgctgatg gtagtgatct 1440 ggttgatcta gctgctaagc gattttactg taaactactt aaattatgtc tatgtattgt 1500 aatattgagt ttgaatgtat gcaacagtgg attgtaaata aattttgtac tgatgttgtt 1560 tctctgaaaa ttcttcaact aaatatacgg ggcatgaacg atctggccaa gtttgatggt 1620 gtacgggaga tgctgcagca gtatggagaa cgtatcgatg tgttggtgct tggtgaaacc 1680 ttcttgaagg cagatcgaac ttgtatttac aatttggatg gatacatcgc aactttttcc 1740 tgtcgtaatg aatcgagcgg aggtttggct gtgttcgtgc ggaacgatat tgatatggat 1800 gtactcgaga accgaacagc tgacggattt catcacatac actgccagct acgttcctta 1860 aacaaacgta tagactgcca tgctgtgtac aggccaccgt catttggtgc tactcggttc 1920 cttgacgaga tggatcaaat cttgtcgaaa actaaaaatg gtcacgattg tatcatagtc 1980 ggtgacatga acattcccgt taacaaaccc tcgatcaata ttgtgcagca gtattctcac 2040 ctgttatcat cgtacggcat ggtggttacg aacacttttg taacgagacc cgcgagtggt 2100 aatattctgg accatgtgat atgtgatgag aaactatcgc actcggttac caatgaaaca 2160 attccctctg acttgagtga ccattgtgtg gtcctttcta cattcagaag ggctgaaccg 2220 atcctgaaaa gaactctgga gaagcagatc attgatcacc gtcgtctcaa ccaactgttc 2280 ttcgatcgca ttacaacgtt gccgcaaacc ttaacagctg atcagaagct ttctcacgta 2340 attaactgct acaataccct actgccccaa tgtacgagaa ctgtcaaagc ccaggtcaat 2400 atcaaagggc actgtccatg gatgacgctc cacttatgga agcttatcca aataaaagat 2460 aacacgttga agcggaagag gcgtaatccg aatgatgccc acctgtccga actgctggcg 2520 catgtctcca agaaacttca gaaggagaag gctgaatgca agcggaacta ctacagtaat 2580 cagctttcct ccggtgatca aaaggcgtcg tggaaggtca tcaacgaagc tatggggaag 2640 aacagctctc gcgcttaccc aaccgaactg ctgatcaacg gcaacaagac aacggattcg 2700 aaccagatag ctcaacactt caacgatttc ttcagcaacg tgggacccaa ccttgcagca 2760 actatcaaca gcgatcgtga agtccacaaa ttcggaaccc tgtctgcttt acctgaggga 2820 aactccctgt tcctgagacc agcatcaatc aacgagacaa tcgccctcat caatgcattg 2880 aatccaaaaa aatcccccgg cccagatgga attcccgctt ccttcttgaa agcaaacttc 2940 gttttcttcg cgcacctgct gactgacgta ttcaacgaaa tagttgaaac tggagtctac 3000 cctgctgtct tgaaagtggc ccgtgttaca cctgtgcata agtcgggtga caaaaaggaa 3060 gcaaacaact atcgtccgat atcctgtctc tccatcctgg acaaagtcat tgaaaaattg 3120 attgtttcgc gacttgtaga cttcatcact cagcacaaac tcctttacgc ccaccaatac 3180 ggtttcaggc agggtctgag cacactatcc gcctcgtgta acctagtaga tgacatctac 3240 gactcgctgg acaaaagaga actcgttgga gcattgttca tcgacttaaa gaaggcgttt 3300 gatacgatcg accatagact cttactggaa aaactggaaa cctacggcat acgaggtgta 3360 gccaaggcag ttattgagag ttatttgtct gaccgactcc agtttgtatc gctcggacaa 3420 gtacgcagta cgctcagccc tgttataact ggcgtcccgc aaggaagcaa ccttggaccg 3480 attctgtttt tgctcttcat taacgacatc gggaagctga aactcaaggg gacaacacgc 3540 ctgtttgccg acgacacctc ggtcttttac cacggaaaag actgtgctct gattcaacgc 3600 cagattaccc aggacttgga actccttgac gactatttcc aaacaaacct gctctccatg 3660 aacctgtcca agacaaagta catgttgatt cgatcacccc ggaagcagct cccagctcgc 3720 cagcctatag tcttcaacgg tcaacagatt gaagaagtta accagtaccc gttccttggt 3780 cttacccttg atgacacgat gagctggtta gcacacatcg acctgttgaa aaagaaacta 3840 gctccaatct gcggaatact ttggaagctg tcatcgttcc tgcctactag ttgcctcaaa 3900 aagttatact tctcccttgt ccactcccgc ttaaactacc ttgtcacgac ctggggactt 3960 gcttatgctg tacacctcca tgacctgcaa gtgatacaga acaggtgtat caaagccgtc 4020 tttcgcaagc cgtacctgta cccaactagc ctcttgtatt ccaacccggc cgactcactc 4080 cttcctcttc gagccctcca aaaactccaa atgctgaccc acgtccgaaa atgtcacctg 4140 agcactacca ccgcttcaac gtttccaata agagagcaaa cctcagaaaa tgatcgagct 4200 acaagacaac taggtgattt catcttgccc ctaccgagaa ccgagttcgg tatgaaaagg 4260 gtcgcttatg ttgggtgcaa gctgcacaac gagttgccga cggtttgtaa aaatgcgagt 4320 cagggtgctt tcaagctgtc gttacggaac cacatcaagt caaaaataat tgactacctt 4380 gtctaaacac ccgccactaa agagttttct tctctgcttt ccggtcgcca ccgcccgccg 4440 tccaccaccg tcgtccaccg cccaccaccc gccgcccgcc gcccgccgcc caccacccgc 4500 cgcccaccgc ccccccgccg ccaactgcta cccgccacca accgcccact accagcaact 4560 gcggcttaca acccgcaccc agtgcttata tgatttagaa tttagctttt tttaatttat 4620 gtaatagttt gcacagcctt taaagagaaa ctgttctcac tggcatacgt gcatagcagt 4680 ccgagcctaa cctttaaaaa ataaaataaa aggtaggtga aatattccca gcttctgctg 4740 ggaagtgcgt aattcggtat taaaaaaaaa aaaaaa 4776 // ID hAT-4C_AP repbase; DNA; INV; 4014 BP. XX AC Contig37637; XX DT 31-JUL-2009 (Rel. 14.07, Created) DT 31-JUL-2009 (Rel. 15.12, Last updated, Version 2) XX DE DNA transposon. XX KW hAT; DNA transposon; Transposable Element; hAT-4C_AP. XX NM hAT-4C_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-4014 RA Jurka J.; RT "DNA transposons from pea aphid."; RL Repbase Reports 9(7), 1367-1367 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX FH Key Location/Qualifiers FT CDS 1383..3110 FT /product="hAT-4C_AP_1p" FT /translation="RRYVKLYLFYNLFNIINNILIVLFFISVEFNSTSSSE FT DVESITETESSRSPMPCPLQMKRARLEILTPRLSAVLDKCKISDRDAVHLL FT TACVEATSLNPLDFVINRTSIRLSRQRFRELNAFRIKYNFFDLNLKFITIH FT WDSKILLDITGKTKVDRLPIIATAPNTEQLLGVPELSAGTGYEVSSAVYDT FT LEDWSLLDRVQAFVFDTTASNTGRLNGACVLLEHKLGRNILYLACRHHVFE FT IILQSVFVGSKFAPSSGPDIPLFKRFKNQWETIDLTQFSIWSMDVKTSNVL FT NDVSQQILIFAHKKLRDDFPRDDYKEFLELVVIFLGDVPPGGIKFRQPGPY FT HLARWMAKGIYCLKIMLFQKQFKLSVGEKKALQTICCFIVKCYAESWFTAP FT DAVQAPLNDFIFIKKLNSYKKDDKVIAEIALNKFVNHLWYLNEECAAFSLF FT DDRINVNQKRNMVKKILEEDKEKEKEVQKKLYLKSEEVSNFVNNELPTSLL FT SAKSMVMFKRFDLSTNFLKIDPSEWNSQNDYIQGQQTIRSLKVVNDTAERG FT VKLMEEFNDKITKDEDQKQFLLKVSCLYK" XX SQ Sequence 4014 BP; 1495 A; 525 C; 590 G; 1404 T; 0 other; tattagggtg gtccttatat ataggtgatg aaaaaaaaat taaagatttc ttttcaagca 60 ccccccaatt ttgttcattg ctataagaaa ataatttagg ttaaatttgg tgatgataac 120 tcaccccctt cacgtgccgc aaaagggttg aaaaatgact aacgagggta gtttttctgt 180 ttttcgaatt taaaccaaaa actataagac ccacaaaaaa acacatagaa ccttaattat 240 agaacattaa attatctatg aataatgatc ttatataatt tcttgtcaat cctttatttt 300 ccgagaattt gttaaataac caaattttat gtatattttg aaaaattgat ttttctattg 360 ttattctaat atattttacg tatggttcaa attacgacga agtcttttat acaaaaaatg 420 ttaattagat aattataata tataagataa ttttctaaaa ataaagtcat aagtaaaatt 480 tgcattagac caataacttt gtctttacag taggatttgt gaaatatttt attaaacaac 540 tattttctac caccacgaag aaccaaatac aacataatta ttattttatt ttattgacaa 600 gaaaaatatt actttataat aaagtttgat gaatcaaaca aattattgta tacctaaata 660 tttattgtat aataatttcg gaaaactatt tttaagtttg agataagaca gcaaataata 720 atatgtcctt ttggttatac acattactgt tattgagtat tagattagat tattatcgtg 780 gttgattgtc gtataattga aaccataatg tcacattcac tgttcattcg tctttctgtt 840 catatctata atatcatgcc tcgtcaaaaa atatttattg ttggtgttat gaaaaactaa 900 atacttggta gtaaattacc atctaaaaaa gatgttttaa gtgttttatt ttttaatatg 960 cggatggtaa aattaacgct acaagatagt atttcattgg ttattgatga atgtttaatc 1020 ttttggaaaa aagcccgaat accaactcga gatcgatgtc attgcttaaa acaattgaaa 1080 aaactctatg aagaattacg aaacctcgaa aaaagtaaaa ataggaatag tgagttatgt 1140 aggcagagag agcatacttt cgaagacaat ttaaatgatt tatttgatat tgcgcatgca 1200 tttgctatgg atatgataaa aataaatgaa gataaagaat ttttgaactt tcaacgacaa 1260 aaaggccgtc ctgggtgtat gcttagtaca gatgtaaaat tagccggaat tgaaaaacga 1320 aaacataatc gtgaaaaaaa atacattgat gtacaagaaa agtcgtgttc atcgaaatct 1380 gaagaaggta tgttaaatta tatttatttt ataatttatt taatataatt aataatatac 1440 ttattgtttt attttttatt tcagttgaat ttaattctac atctagttcg gaagatgtag 1500 aatcaataac agagactgaa agttctagat ctccaatgcc gtgtccattg caaatgaaaa 1560 gggcacgatt agaaatatta acacctcgac tttctgctgt acttgataaa tgtaagatca 1620 gcgatagaga tgctgtacat ttattaacag cctgcgtgga agcaacttct ctaaatccat 1680 tagattttgt cataaatcga acttcaattc gtctatctcg tcaacgtttt cgtgagttaa 1740 atgcttttag aataaaatac aatttttttg atttaaattt aaaatttata actatacact 1800 gggattcaaa aattcttctc gatataactg gtaaaacaaa agttgataga cttcccatca 1860 ttgcaacagc tccaaatact gaacaacttc ttggtgttcc cgaactttct gctggaaccg 1920 gttatgaagt ttcatctgct gtgtatgata ctcttgaaga ttggtcactt cttgatagag 1980 tacaagcatt tgtatttgat acaacggcat ctaatactgg acgtttaaat ggtgcttgtg 2040 ttcttttaga acataaactt gggcgtaata ttttatactt ggcttgtaga catcatgttt 2100 ttgaaattat tctccaaagt gtatttgttg gatctaagtt tgctccatct tcaggccctg 2160 atatcccatt atttaagaga tttaaaaatc aatgggaaac aattgattta acacagtttt 2220 caatttggtc aatggatgtt aagaccagta atgttcttaa tgacgtaagt caacaaatat 2280 taatatttgc ccataaaaaa cttagagacg attttcctcg ggacgattat aaagaattcc 2340 tcgaattagt agtcatattt ttaggagatg ttcctccggg tggtatcaaa tttcggcagc 2400 caggacccta tcatttggct agatggatgg caaaaggtat atattgccta aaaattatgt 2460 tatttcagaa acaattcaaa ttaagtgtag gagaaaaaaa agccttacaa accatttgct 2520 gttttattgt aaaatgttat gcagaatctt ggttcacagc tccagatgca gtacaagcac 2580 cacttaatga ttttatattt ataaaaaaac ttaattctta taaaaaagac gataaagtta 2640 ttgcagaaat agctttaaat aaatttgtaa accatttatg gtatcttaat gaagagtgcg 2700 cagcgttcag tttatttgat gacagaatta atgtaaacca aaaacgtaac atggttaaaa 2760 aaatattgga agaagacaaa gaaaaagaaa aagaagtaca aaagaaatta taccttaaat 2820 cagaagaagt ttctaatttt gtaaataatg agctcccaac atcattactc tcagctaaat 2880 cgatggtgat gtttaagaga tttgatttat ctactaactt cttaaaaatt gatccatctg 2940 aatggaactc acaaaatgac tacatccaag gtcaacaaac aattaggtca cttaaggttg 3000 taaatgatac agctgaacgt ggagtgaaac taatggaaga atttaatgat aaaattacaa 3060 aagatgaaga ccaaaaacaa tttttattga aagttagttg tttatataag taataggtat 3120 tagtagatat tcataataaa ttgtttctaa atataatatt aatatttgaa ttacatttta 3180 aatgatttca gactgtacaa gattatcgtc gtaaataccc agggcatagc agagaacagt 3240 tgaaaacgcc ctatgcctaa taattaaata attaatttag taatctcaaa tcataatata 3300 ttaataatta taatatcaat aatctctcag tactcacctt agtaatattt tattattata 3360 caataaatat ttaggtatac aataatttgt ttgattcatc aaactttatt ataaagtaat 3420 atttttcttg tcaataaaat aaaataataa ttatgttgta tttggttctt cgtggtggta 3480 gaaaatagtt gtttaataaa atatttcaca aatcctactg taaagacaaa gttattggtc 3540 taatgcaaat tttacttatg actttatttt tagaaaatta tcttatatat tataattatc 3600 taattaacat tttttgtata aaagacttcg tcgtaatttg aaccatacgt aaaatatatt 3660 agaataacaa tagaaaaatc aatttttcaa aatatacata aaatttggtt atttaacaaa 3720 ttctcggaaa ataaaggatt gacaagaaat tatataagat cattattcat agataattta 3780 atgttctata attaaggttc tatgtgtttt tttgtgggtc ttatagtttt tggtttaaat 3840 tcgaaaaaca gaaaaactac cctcgttagt catttttcaa cccttttgcg gcacgtgaag 3900 ggggtgagtt atcatcacca aatttaacct aaattatttt cttatagcaa tgaacaaaat 3960 tggggggtgc tcaataaaaa atttgaaagt ttaaaaataa ggaccaccct aata 4014 // ID Copia-56_AA-LTR repbase; DNA; INV; 283 BP. XX AC AAGE02023339; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-56_AA_; KW Copia-56_AA-I; Copia-56_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-283 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02023339; Positions 15753 15471. XX SQ Sequence 283 BP; 66 A; 60 C; 58 G; 99 T; 0 other; tgtgggaaac taacaatgac cacctcccat atatatgtac cttgactggg taatttatac 60 acctaatact gtgaactttg tttagagtgt aagcgagctt ccgcgcgaga acagcggaag 120 acattttgtt attcattcat tcgttattgc tcttcgttcg atacagatgt gaataaaagc 180 taagttgtgt cgtaatcgga tgtgcgcgtt tcttttatcc gaagttcctt tttcttcttt 240 ctccgcgagt gatacttcct gctgcctggt gattctgtcc aca 283 // ID Gypsy-50_AA-LTR repbase; DNA; INV; 185 BP. XX AC AAGE02019559; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-50_AA_; KW Gypsy-50_AA-I; Gypsy-50_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-185 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02019559; Positions 763 579. XX SQ Sequence 185 BP; 61 A; 30 C; 40 G; 54 T; 0 other; tgtggtgctc taacactgtc actgacaaca cagatataga tcaactatgg ttagagggtt 60 tgtttcgggg gtgattaaac acagaagtga gatattgaat caacaataaa cgtgtctgcg 120 agtttcaata attatattcg taataagcgg tctttattaa tccggaagta tccgagaata 180 ccaca 185 // ID Gypsy-37_CQ-I repbase; DNA; INV; 5678 BP. XX AC AAWU01034069; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-37_CQ_; KW Gypsy-37_CQ-LTR; Gypsy-37_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-5678 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 453-453 (2011). XX DR Genome; AAWU01034069; Positions 21700 16023. XX CC Positions [2538-3077] - Reverse transcriptase CC Positions [3428-3781] - Integrase core CC LTRs are 95% similar to each other. XX FH Key Location/Qualifiers FT CDS 496..1629 FT /product="Gypsy-37_CQ-I_1p" FT /translation="MDAEQAALAAAAAAAAAAAAQAAQALAAEQAAAEAQR FT TLEEAAHIVNSMKVPNAVDTIPEFTGNPIKLHSFIRSVENLLPFMRPCVNT FT PFYNMWLQAIRRKIIDDADNVLELYGIGLDWEAIKEFLIMYYSDKRDNLTL FT TRELYSLYQTGTIEDFYGKIQHTLSLLINHANVSIQDLNIRADRIGMYRQN FT ALNVFLAGLKEPIGGNIRSRGPDNLKHAFDACMEELNFQRINNSRKFELPP FT MPMQNYKFPKNYFTSNQVPNIPPKNYFALNQVPNVPPRNVFAPFNSKPNLP FT KPEPMDVDRSLRSRNVNYINRPNNQRNHFVQGPQRFHVEELRNLEQDDNED FT EDQVEASGSNEVFDETHNEVDDVNFHMGTTTNNQI" FT CDS 1983..3500 FT /product="Gypsy-37_CQ-I_3p" FT /translation="MYRKYPVIMCKTIPKQSILNIEIPVTEKSGDFFFEDF FT EVLPKLIICSGLYKASENKAVITIRNDSKNDQQFALQNPMQIELNNFEIIS FT SDEINHHEFQPERYRKLIDQLRLDHLNTEERKKLLEVINEYQDLFHVDNEP FT LSATNGTTHVIRTSDDLPVYQKSYRYPYCHREEVSNQISKMLKEGIIRPSS FT SPWNSPIWVVPKKIDASGQQKWRIVVDYRKVNAKTVDDKFPIPNISDILDK FT LGRSHYFTTLDLTSGFHQIKMEPQDIIKTAFSTEDGHYEYTRMPFGLKNAP FT ATFQRHMNVVLAGLIGKICFVYMDDIIVYSTSLEEHCVNLSKVFDALSAAN FT LKIQLDKSEFLKKEVAFLGHVITDEGVKPNPDKIDAIQNWPVPKTEKQIKE FT FLGTIGYYRKFIQDFSKITKPLTQCLRKGEQVEHTPQFIKSFEQCKQILSS FT SSILQYPDFSKPFILTTDASKYALGAVLSQGPIGKDKPIAFASRTLNKSEE FT NYFHFTVYRN" FT CDS 4111..5676 FT /product="Gypsy-37_CQ-I_2p" FT /translation="MTTLAEIIITDLNRNDEFIAFKEYKTKIIRNLNLHLH FT IVNVENIQSNIDIISHALNIFYKNNSTQNFHKVLERRVMKLSDNFKRLKPL FT TPRHKRGLVNPIGTIIKSITGNLDNNDLYDINKQLDITKTKTNYLIESNNR FT QIIINDRLQSKLNEITKHLNEQQIKILKEISIQINKKTIQSDYFCKQNLFE FT LLFNIQFLDQEINDLVESVQLAKLGIICKNIISLNEMETFEKILTDQNITI FT SSLDQIYEYLGLEVFHKNHSIIFVIKVPNFEPFEYDFINIELLPKNNSIIT FT FDYKSVITNGTKSFVSQIPCYTIEKYKICNSKNLKDISEDKCISNALRGTP FT ARCNFKEYQGGTQIDVVNDFTVILKDANYPTSIKSECNIKERSVTGTLLVS FT YEGCDITINNTKFSGKSHVEHHKITMIPMIGISFKQEWFFKEHNISQIQIS FT NINHIEKIEQQYTKLQYFTIGGFMLTLSCSIIIVLLWLITRKTKSAATVEN FT NLESNDPVSASISQHSNKDSGRIFSK" XX SQ Sequence 5678 BP; 2197 A; 970 C; 903 G; 1608 T; 0 other; gtcggtagga tatgctgttc aattgctggt gaacgaatat catcaaagac agcatgaaga 60 ccatccttat ttttatgtaa gtaatttcgt gagcctcaag tgaatattga gtgcatagaa 120 ttccgttggg gactgttaaa ttgaagatta tcatgaattg ttaagaggca atcaattatt 180 ccaagctcaa gtgaatattg agttaagtga actttaaagt gaactgttaa tggggacaaa 240 ttattccgaa actcaaatga atattgagag aagtgaattt ttcgtgaact gttaacggga 300 aacaacatta agttaagtga gttttgtgac aaacaaatct aattaatact caaatgaaca 360 ttgggtacaa atcaagtaat tttttttaca atcaaataaa ctgttgatgt taagtgaaaa 420 ataaaaggac tgtttcaaaa attaaaagga agagaagaaa ccgttaagtg attgaatttt 480 ttttgaaaaa taaaaatgga tgctgagcaa gcagcacttg cagcagcagc agctgcagca 540 gcagctgcag cagcccaagc tgctcaagca cttgcagctg aacaagctgc ggccgaagca 600 caacgcacat tagaagaagc agcgcacatt gttaacagca tgaaagttcc gaatgctgtt 660 gatacgattc cagaattcac cggtaatcca atcaaattac attcattcat aagaagcgta 720 gaaaatttac tgccttttat gaggccttgt gttaacactc ctttttacaa catgtggtta 780 caggcaatta gaagaaaaat aattgatgac gcggataatg ttttggaact atacgggatt 840 gggcttgatt gggaagcaat caaagaattt ttaatcatgt actacagtga taaaagagat 900 aaccttactc ttaccagaga attgtattcg ttgtatcaaa caggtacaat tgaagatttc 960 tatggtaaaa ttcaacatac tttgtctttg ctaataaacc atgctaacgt ttcgatacaa 1020 gatctcaata taagagcaga tcgaattggt atgtaccgtc aaaatgcatt gaacgttttt 1080 ctagctggcc tcaaagaacc gataggagga aatattaggt ctcgcggacc cgataatctg 1140 aaacatgcgt ttgatgcgtg catggaagaa ttaaatttcc aaagaataaa taattcacgc 1200 aaatttgaat taccaccaat gcctatgcaa aactataaat ttccgaaaaa ttattttact 1260 tcaaatcaag ttccaaatat tccaccgaaa aattatttcg ctttaaatca agtgccaaat 1320 gttccaccga gaaatgtttt tgctccattt aactcaaaac caaatttacc aaaaccggag 1380 cctatggatg ttgatcgttc tcttcgatca agaaatgtga attacatcaa tagacccaac 1440 aatcaaagaa atcatttcgt acagggaccc caaagattcc atgttgagga attgagaaat 1500 ttagaacaag atgacaacga ggatgaagat caagtagaag cttccggctc aaatgaagtc 1560 tttgatgaaa cccataatga agtagacgat gtaaattttc atatgggaac tactacaaac 1620 aaccaaattt gacatccaga aattatattc catacctcag acttccaact caagagggat 1680 ttgatttgaa attgttaatt gatacgggat caaataaaaa ctatttgagt ccaaaaagag 1740 tacgaaaagc atcaaagctt caaaaaccaa caattgtttc aaatatttcc ggaaaacatt 1800 tgattaatga atacgttaat tttaatccat ttccttcatt aacgactaaa aaatttattt 1860 tccatgtttt tgattttcat aaatattttg atggactcct tggatacgaa acacttcaag 1920 acctcgatgc caaaattgat tcaaaactga atacgatcca aattctcaat caaacattta 1980 aaatgtatag aaaatatcct gttattatgt gcaaaactat accaaaacaa tcaattttaa 2040 acatagaaat cccagttact gaaaaatctg gtgacttttt ttttgaggat tttgaagttt 2100 taccaaaatt aattatttgt tcaggtcttt acaaagcttc tgaaaataaa gccgttataa 2160 ctatacggaa tgactctaaa aatgatcaac aatttgcttt gcaaaatccc atgcaaattg 2220 aacttaataa ctttgaaatc atatcatcgg atgaaatcaa tcatcatgaa tttcaaccag 2280 aacgatacag aaaacttatt gatcaactaa ggcttgacca tttaaacacc gaagaaagga 2340 aaaaactttt agaggttatt aatgaatatc aagatttatt tcacgtcgat aatgaaccac 2400 tatcagcaac aaacggaaca acacatgtta ttcgaacatc agatgattta ccagtatacc 2460 aaaaatctta taggtatcca tattgtcatc gagaagaggt aagtaatcaa atatcaaaaa 2520 tgctgaaaga aggtatcatt agaccttctt catctccgtg gaattcaccc atttgggtag 2580 tacccaaaaa gatagatgct tcaggacaac aaaaatggcg cattgtggtt gactatagaa 2640 aagtcaatgc caaaaccgtt gatgacaaat ttccgatacc gaacataagt gatatccttg 2700 acaaattagg caggagtcac tatttcacga ccctagattt aacctccggg tttcatcaaa 2760 ttaagatgga accacaagat atcatcaaaa ctgccttctc gactgaagac ggacattatg 2820 agtatacaag gatgcctttt ggacttaaaa atgctccagc cacctttcaa cgtcacatga 2880 atgtcgtttt agctggttta attggaaaaa tatgttttgt ttacatggac gacataatcg 2940 tctatagcac aagccttgaa gaacattgtg ttaatttatc taaagttttt gatgctctca 3000 gcgcagcgaa tctcaaaatt caattagaca aaagcgaatt tttaaaaaaa gaggttgcat 3060 ttttaggtca cgtcataact gatgaaggtg tcaaacctaa cccggacaaa attgatgcta 3120 ttcaaaattg gccagtacct aaaacagaaa aacaaattaa agaattttta ggtacaatag 3180 ggtattacag aaagtttata caagatttta gtaaaattac taaacccctc acacaatgtt 3240 tacggaaagg tgaacaagtt gaacataccc ctcaatttat aaaatccttt gaacaatgta 3300 aacaaatttt gtcaagcagt agcattcttc aatatcctga tttttcaaaa ccatttatat 3360 taactactga cgcatcaaaa tatgcgttag gtgcagtttt atcccagggg cctataggaa 3420 aagataaacc tatagcattt gcgtcaagga ccttaaacaa aagcgaagaa aattattttc 3480 actttacggt ataccgaaat taatcgtgag cgataacgaa ccatcgttaa aatccacaga 3540 agttcgtgga atgctggcag atctcaacat tgaaacttat tacacaccat ctaataaaag 3600 cgaggtgaat gggatcgttg aaaggtttca ttcaacctta tgtgaaatac tacgaacaaa 3660 cgaagaaaaa tttgcgaatt tatcaatgaa ggaaaagtta agaatctctg tggcactata 3720 caataataca atacattctt caaccaatct taaacctacc gaaattttct acggaattcg 3780 ggatggagaa gagaggccta ctgaattgat tgatataatt gcggccagag ataaagttta 3840 tgatgaagct atcatagcac taaaaaatac acaaaaaaag atttatctta ccataataaa 3900 aacagagaac accaaccaaa tctcgaagaa tctgcgtccg tttacataga acgtcaaggc 3960 gtcaaaagta aacataagga caaatttaat aaaacaaaag tagctcaaaa tctacgaaaa 4020 accttcattg atgaaatcgg tagaaaacat cataaaacca aattaagacg ccaaagaaat 4080 taacaaataa cttttatttc taggtgcgtg atgacaactt tagccgaaat catcatcacg 4140 gatttgaaca gaaatgatga attcatcgcc ttcaaagaat ataaaacaaa aatcattcga 4200 aatctaaatc tgcatttgca catagttaat gtagagaata ttcaaagcaa cattgatata 4260 atttcacatg ctttgaacat tttttacaaa aacaattcaa ctcaaaattt tcataaggtt 4320 ctagaaagaa gggtaatgaa gttatccgat aatttcaagc gccttaaacc actgacacca 4380 agacacaaac gaggacttgt aaatccaata ggaaccatta ttaaatcaat aactggcaac 4440 ttagacaaca atgacctata tgacattaac aaacaacttg atataacaaa aactaaaact 4500 aactacttga ttgaaagcaa taatcgacaa ataataataa atgatagact tcaatcaaag 4560 ttaaatgaaa taactaaaca cttgaatgag cagcaaatca aaattttaaa agaaatttca 4620 attcaaatca ataaaaaaac aatccaatct gattactttt gcaaacaaaa cctcttcgaa 4680 ctattattta acattcagtt tttggatcaa gaaatcaatg atttagtaga gtcagtgcag 4740 ttagctaaat taggtatcat ttgcaagaat attatctctc taaatgaaat ggaaacattt 4800 gaaaaaatat tgaccgatca aaacattaca attagtagtc ttgaccaaat ttatgaatat 4860 ctcggtttag aagttttcca taaaaatcat tcaataatat ttgttattaa agttccaaat 4920 tttgaaccat ttgaatacga cttcattaat attgagcttc tccctaaaaa taattcaata 4980 ataacatttg attacaaaag tgttataaca aatggaacaa aatcttttgt gtcacaaata 5040 ccatgttata ctattgagaa atacaaaatt tgtaattcaa aaaatctgaa agacataagc 5100 gaagataaat gcatttcgaa cgcattgaga ggaaccccag cgcgatgcaa cttcaaggag 5160 taccagggag gcacacagat tgacgttgtc aacgatttta cggtaatatt aaaggatgca 5220 aattatccaa cttctattaa atcagagtgt aacatcaaag aacgctctgt tacgggaaca 5280 ttattagtat catacgaagg ttgtgatatt accattaaca acaccaagtt tagtggtaaa 5340 agtcatgttg aacaccataa aattacaatg attcctatga taggaatttc attcaaacaa 5400 gaatggtttt tcaaagagca taacatttca cagattcaaa tctcgaatat aaaccatatc 5460 gagaagatag aacaacaata cacaaaactc caatacttta ccataggagg ttttatgcta 5520 acattatcat gtagcataat aatagtattg ttgtggttga ttacaagaaa gacaaaatca 5580 gcagctactg tggaaaacaa ccttgaatcc aatgatcccg tttcagcatc cataagtcaa 5640 cattcaaata aagattcggg ccgaatcttc tctaagga 5678 // ID SINE_CP1 repbase; DNA; INV; 321 BP. XX AC X79507; XX DT 13-MAR-1998 (Rel. 3.02, Created) DT 13-MAR-1998 (Rel. 3.02, Last updated, Version 2) XX DE C.pallidivittatus Cp1 (SINE transposable element), clone pCp125. XX KW SINE; Non-LTR Retrotransposon; Transposable Element; CPCP1125; KW SINE_CP1. XX OS Chironomus pallidivittatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Chironomoidea; OC Chironomidae; Chironominae; Chironomus. XX RN [1] RP 1-321 RA He H., Rovira C., Recco-Pimentel S., Liao C. and Edstrom E.J.; RT "Polymorphic SINEs in chironomids with DNA derived from the R2 RT insertion site."; RL Unpublished. XX RN [2] RP 1-321 RA Rovira R.C.; RT "SINE_CP1."; RL Direct Submission to Genbank (31-MAY-1994)C.R. Rovira, University RL of Lund, Dept of Molecular Genetics, Solvegatan 29, 223 62 Lund, RL SWEDEN. XX RN [3] RP 1-321 RA He H., Rovira C., Recco-Pimentel S., Liao C. and Edstrom E.J.; RT "Polymorphic SINEs in chironomids with DNA derived from the R2 RT insertion site."; RL J. Mol. Biol 245(1), 34-42 (1995). XX DR GenBank; X79507; Positions 1 321. XX SQ Sequence 321 BP; 91 A; 63 C; 67 G; 100 T; 0 other; aagcttgctc tgctaacaga acgacttatt ctcattttga ataagcatat tatctctcta 60 agcgagcgaa tgctttacaa ccgttgaaaa aagagtggtt catatatatt ggcattctgc 120 caatgccaat ggcaaagccc gatagctcag tggtctgagc acttgaccgg caatcgagag 180 gtgcgaggtt cgattcccgc tcgggaagag tcattgggtg aattactttt ttttcaactt 240 tttcttttaa actatttctc tctaagcgaa gaaatgctta ttcaaaatga gaataagtcg 300 ttctgttagc agagcaagct t 321 // ID Gypsy-47_CQ-I repbase; DNA; INV; 2175 BP. XX AC AAWU01016425; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-47_CQ_; KW Gypsy-47_CQ-LTR; Gypsy-47_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-2175 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 473-473 (2011). XX DR GenBank; AAWU01016425; Positions 88159 85985. XX CC Positions [1085-1552] - Integrase core CC 'CTGTG' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 218..2152 FT /product="Gypsy-47_CQ-I_1p" FT /translation="MGTQNMGRRFRYVSEAVRVAGSRGVNAKPRDTHTLLA FT TPRIIACASKALSASEQKYPQSQKEVLSIVWGVERFSYYLISKPFVVRTDS FT ESNEIIFDSEQRTGKRAVTRAETWALRLQSFRFKPKRVPGNQNIADALSRL FT IDKSQIDEPFDDINEKHVLYALDAAHMNLSWNTIEIASETDEENCAVRSAL FT HTGVWPENLRRYETQSKELRVLGSLVFKSDKIILPLKLRRTAIEIAHQGHV FT GCGATKKLLRDYFWWPNMSKEAVEFVGKCEACLMVSRKNPPLPLSSRTLPQ FT GPWEILQIDFLTLNGCGSGHFLVIVDMHSRFLQVIEMKTTDCTKTNAALCK FT AFTTWGLPLIIQSDNGPPFQSQEFTDFWEDKGVRVRKSIPLSAQSNGGVER FT QNQGIIKSISAAKAEQRNWREALQEYVHVHNTRKEHSRLGVTPFELLVGWK FT YRGTFPALWETKSSEKVDKEQVAEDDAASKLCSKKFADNSRGAKESDIQPG FT DRVVISIPQRNKLDPSFSRERYTVLARKGAKVVVRSDCGVQLARNVQDVKR FT AQDFSVDSESVGDDADDTLTNVPPDARDHNSETEQQENRQPPATNNSSDVR FT RNLDDSEAAQIPITSGDGSPTNNTGRPKRNILRPSKYKDMYLYNIFQ" XX SQ Sequence 2175 BP; 591 A; 544 C; 581 G; 459 T; 0 other; atggcgcagc tgctcccaca gggtaagaat ccgttaaagt ttggtcgacg gatgaccgga 60 taaccagttc aagtgaccgg gcgtttgcca ataatttggg accgttcgat ggggaaagtg 120 gttttttttc caatatggac gccttctagc gagctgtgat ttttccgttt gatggattgg 180 tattggttgg actccgaaaa ttgcgtgtag tggtgagatg ggcactcaga acatgggacg 240 tcgcttccgt tatgtgagtg aggctgtgcg agttgccggt tcaagaggtg tgaatgcaaa 300 acctcgcgac acgcatacac tcttggctac accccggatc atcgcgtgcg cttctaaagc 360 tttgtcagcg agcgagcaaa agtaccccca gtcgcagaaa gaggtgcttt cgatcgtatg 420 gggcgtcgaa cggttttcgt actatctgat cagcaagccg ttcgttgttc gtacggactc 480 ggaatcaaac gaaataatct tcgacagcga acaaagaacc ggaaaacggg cagtgactcg 540 agctgagacg tgggcactcc gtctgcaatc ttttcggttc aagcccaaaa gagttccagg 600 taaccagaac atcgccgacg cgctgtcacg cttgatagat aaatcacaaa tcgacgaacc 660 cttcgacgac atcaacgaga agcacgtttt gtatgccctt gatgccgccc acatgaacct 720 ctcttggaac acgatagaga tcgcatctga aactgatgaa gaaaattgtg ccgttcgatc 780 tgcattgcat accggagtgt ggccagaaaa ccttcgtcgc tatgaaacac aatcgaaaga 840 gctccgcgtg ctgggttcac tggttttcaa aagcgacaag atcattcttc ctctgaagct 900 tcgtcggacc gcgatcgaga ttgctcacca gggacacgtc gggtgcggtg ccaccaagaa 960 gctccttcgg gactacttct ggtggcccaa catgagcaag gaggcggtcg agtttgtcgg 1020 caagtgcgaa gcttgcctaa tggtatcaag aaaaaacccg cccctaccac tctccagccg 1080 cacacttccc caaggacctt gggagatctt acagatagac tttctgacac tgaacggatg 1140 tggatcgggg catttccttg tcatagttga tatgcactca aggtttttgc aagtcattga 1200 aatgaaaacc accgattgca caaaaaccaa cgccgcgctc tgcaaagcgt tcactacctg 1260 gggacttccg ctcatcatcc aaagcgacaa cggcccgcct tttcagagcc aagaatttac 1320 agacttctgg gaagacaaag gtgttcgggt gcgcaagtcg atccccctca gtgcccagtc 1380 aaatggaggc gtcgaacgac aaaatcaagg cataataaag tcgatttccg ctgccaaagc 1440 cgagcaacga aactggaggg aagccctaca agagtatgtg cacgtgcaca acaccaggaa 1500 ggaacactcg cgcctcgggg ttacaccgtt cgagctgttg gtgggctgga aataccgcgg 1560 aacttttcca gcactctggg aaactaagtc ttcagagaag gtcgataaag aacaggtagc 1620 cgaggacgac gccgcatcta aactgtgcag taaaaagttc gctgacaaca gcaggggcgc 1680 caaggagtcc gacattcaac ctggcgatag ggtggtcatt tctatcccgc aacgtaacaa 1740 actggacccg tccttttcga gagagaggta taccgtgcta gcaaggaaag gggcgaaagt 1800 ggtcgttagg agtgactgcg gggttcagct ggctcgaaac gtgcaggacg tcaagcgcgc 1860 acaagatttc agcgttgata gcgagtctgt tggggacgac gccgatgaca cgctgacgaa 1920 tgtaccaccc gacgcgcggg accataactc ggagacagaa cagcaagaga atcggcagcc 1980 acctgcgacc aacaattcat cggacgttcg caggaacctc gacgattcgg aggcggccca 2040 gattccgatt acttccggag acggttcacc aacgaacaat actggaagac ccaagaggaa 2100 cattctaaga ccaagcaaat acaaggatat gtatctgtac aacatttttc agtaatgagt 2160 aggactaagc gagga 2175 // ID Copia-50_AA-LTR repbase; DNA; INV; 105 BP. XX AC AAGE02025154; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-50_AA_; KW Copia-50_AA-I; Copia-50_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-105 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02025154; Positions 14098 13994. XX SQ Sequence 105 BP; 36 A; 29 C; 18 G; 22 T; 0 other; tgcaatcgta taccccttga ccaacactac acacaacaga gatataccat gaactgtatc 60 cgcaaagagg cggccctgca ttaaaacgtc aaaaggttgc tttca 105 // ID BEL-186_AA-I repbase; DNA; INV; 1916 BP. XX AC supercont1.94; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-186_AA_; KW BEL-186_AA-LTR; BEL-186_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-1916 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.94; Positions 1900199 1898284. XX CC 'GCCGT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 31..1893 FT /product="BEL-186_AA-I_1p" FT /translation="MEVQCEKCKSPALLVRCLTCSLCENHYHHECLGMVDS FT SPKSDWKCNECVSSTQGNQIASGLQRLQDALSSEREAIDRRFDRLQKHLNA FT LRAQIEKSPTPTNPIVNQLFASDVGESTDDETNITQNGSVLESGLQKMFAR FT QVIPSDLPIFTGKPEEWPVFISCYNNSTKACKFTNVENLMRLQRCLRGSAR FT RAVDSKLLLPECVPQVIETLKLLYGRPELLISTLISKLRDTPAPRDDDLNS FT LITYGLAVQNLSDHMIAAGLKSHLNNPCLLQEMINKLPTHLKLQWAAHKRD FT HDSSTIATFSKFMSELVEAASLVPSTIVLDQNDDRDEHSDDSDDDRGSSRS FT YRNNSRREKRKCCYVCGESNHRVSSCTVVANMPVSKRWELVNQLKICPPCL FT NRHLPWPCKTVKNCGVNECRLKHHQLLHSANDTAHTSGCTIDVGSCNRTLL FT KYIPVTLYGKSKQIDTFAFIDEGSSRTMLEAQVAEELEIDGKPDSLTLKWI FT DGTVNTNTTKTLTVGISGVNHTKRFSLRDVFTVENLDLSIQNFNSALLFDD FT SLKDLPPFRYASAKPRLLIGLSDAQLVVPSIVHVAKNNEMIACKCSLGWYY FT YGNSRGDNEKQNNRGEFVTHSLK" XX SQ Sequence 1916 BP; 594 A; 398 C; 430 G; 494 T; 0 other; ctaaagaatt tttcaataat atccgaccaa atggaggtcc aatgtgaaaa atgcaagtca 60 ccagctttat tagtgagatg cctaacctgc tcactatgtg agaatcacta tcatcacgag 120 tgtctgggca tggtcgattc ctcacctaaa tccgattgga aatgtaacga gtgtgtgtca 180 tcaacccaag gcaatcaaat agcttcaggt ttacaacgac tacaagacgc gctatcatct 240 gagagagaag ctatcgatag gcgatttgat cgtctgcaga agcaccttaa cgcattgaga 300 gcacaaattg aaaagtcacc tacaccaact aatccgatcg taaatcagct gtttgcatcc 360 gatgtaggcg aatctaccga tgacgaaacg aacataactc aaaacggtag cgtattggag 420 tccggattac aaaagatgtt tgctcgccag gtcatcccat cagatttacc aatattcacg 480 ggcaaacctg aagaatggcc cgttttcatt agctgctaca acaactctac gaaagcatgc 540 aaattcacca atgtagagaa ccttatgaga ctgcagcgat gtcttcgtgg ttctgctcga 600 cgagccgttg atagtaaact actgctccct gaatgcgtgc ctcaagtcat cgagacttta 660 aaattgttgt atggaaggcc cgagttgttg atatccactt tgatctccaa actgcgtgat 720 actccagcac cgagagatga tgatctgaac tcgttgatca catacggctt agccgttcaa 780 aacttgtcgg atcatatgat cgctgctgga ctaaaatcac acctaaacaa tccgtgtctc 840 ctccaggaga tgataaacaa acttccaaca catttaaagc tgcaatgggc cgctcataaa 900 cgagatcacg atagtagtac tatagccacg tttagcaaat tcatgtctga attggtggaa 960 gcagcgagtt tggttccatc caccatagta ctggaccaga acgatgatcg agatgagcac 1020 agcgacgata gtgatgacga tagaggatca agccgatcgt atcgcaacaa cagccgacgg 1080 gaaaagagga aatgctgcta tgtgtgcgga gaaagcaacc accgcgtttc tagttgtacg 1140 gtagttgcta atatgccagt gagtaaacgt tgggagcttg ttaaccagtt aaagatatgt 1200 cccccttgct tgaatcgaca cttgccgtgg ccatgcaaaa cggttaagaa ctgcggagtg 1260 aatgaatgtc gattgaaaca tcatcaattg cttcattcag cgaatgatac agctcataca 1320 agtggatgta ccattgatgt tggtagttgc aatcgtacct tgctaaaata catcccagtg 1380 accttatatg ggaagtccaa gcaaattgat acgttcgctt ttatcgacga agggtcgtcg 1440 cggaccatgc tagaggcgca ggttgcagaa gaactcgaga tagatgggaa acctgattca 1500 ctgactctaa aatggataga tggtacagta aacactaaca cgacaaagac cttgactgta 1560 ggaatttctg gggtaaacca cacgaaacgg ttttcattgc gggatgtgtt tacagtggaa 1620 aaccttgatt tatcaattca aaatttcaat tcagcattat tgttcgatga ttccttgaaa 1680 gacttaccgc cgtttaggta cgcaagtgcc aaaccacgtt tactgatagg tttgtcagat 1740 gcgcaactag ttgtaccatc gatagttcat gtagcgaaaa ataatgaaat gattgcttgt 1800 aaatgttctt taggctggta ttattatgga aattcacgag gagataatga aaagcaaaat 1860 aatcgcggag agtttgtaac acattctttg aaataattgt gttacggggg ggagaa 1916 // ID Copia-22_NVi-I repbase; DNA; INV; 4182 BP. XX AC . XX DT 30-JUN-2009 (Rel. 14.07, Created) DT 30-JUN-2009 (Rel. 14.07, Last updated, Version 1) XX DE LTR retrotransposon from parasitic wasp: internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; KW internal portion; Copia-22_NVi-I. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-4182 RA Bao W. and Jurka J.; RT "LTR retrotransposon from Nasonia parasitic wasp."; RL Repbase Reports 9(7), 1512-1512 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS join(341..829,763..4152) FT /product="Copia-22_NVi-I_1p" FT /translation="MNYIYCSITNEQLEFVGEEDTALKIMNQFDKMYLKES FT TALQICIRNKIDRMKLKDYEEXSTFFIEFEKIINELKSAGAKVNEREKLDY FT MLKTLPESLSYVGDLIDSMQESTCEFLKNKIKMWETRSPNDSGNKKPNVFK FT AERKDLKCFGCGKKGHLRRACTNPKKRLKVFWLRKERSSKESVYKSVKRSQ FT WRNAVDRRCTATARQRLWQHGRGYGQRGRGYGQRGRGRGWQPRADGTSQHD FT ARYTYNEGYQEDGTGTFLTQVEKKNDRESETVVYNCDRGKIEWILDSGCSD FT HIINNDLYFSESIILKKPINVKVGDGRILKGTKVGKIITYFMINNKRIKIT FT INNVFYVKEMDKNLISYAKVTDKNKIISIGNVSKIYNKNNNLIGIAYKENG FT LYKINSYVEKLETYANNVEKITQKEKFHRILGHVNFNYLNTMCKEKLVEGM FT PKSFENVMLKCGTCIQNKMHNLPFQNNRSGANGILELVHTDLNGPHKNEGF FT DGSKYFLTFIDDFSKCALVYTLKSKDEVYNYFLDYINKVENLTGKKIKRLR FT CDNGREYMNKNMFNLCSEKGIIVEPCPPYVHELNGTAERYNRTIMNSARCL FT LYDSKLNIKYWPEIIKAAAYLKNRTITNTSQNKTPFEIFFGKKPNISNLKL FT YGSKVFVRVPEIKRQSKWDRKADIGILVGYESVGYRVLVNGKIIVTRHVDF FT IENENLIGFKGDDESDDESDISINENSESNEIEENVVNENKKERKPSNKSD FT EKIDDQEIELRRSEREKKKPDRYGQSDSYYIYVNIVSADSPKTYEEALQGN FT DCDLWKEAMNREINCLNKNKTWQLVEKPKDKKILDLKWVFTNKSDNHKKAR FT LVVRGFQQREVLEDLYSPVAKIQTLKLLLSYCCENGLMIIQMDVETAFLNG FT KIKSEVFVKQPLGYSDKTEKVYKLEKALYGLRESPRAWYECFDKYMQKLEF FT KRSVSDYCLYTKCENSETIYLILHVDDLLICGKTMRKINEIKTKLSQKFSM FT KDLGEVKTYLGININYDYKNCKMTMDQSNYIESLARQYNIENSKLFKTPME FT QNLSLESAQTASSDIKYRNLIGALLYISSGTRLDVSYSVNYLSRFQNSYNE FT THYKYALRVLKYLYLTKDLKLTYKKNENAEVMDSFVDADWAGDKSDRKSTT FT GFIIRLYGNVIYWKVRKQGSVTKSSTAAEYVALSEAVSEIKVVKNLLKDFS FT INLEKPIKIYEDNSGAIAIAKFGNMTKNSKYIEVHYHFVNECYEKKEIEIV FT KVDSENNIADILTKALGRNKFENFRMNLKIV*" XX SQ Sequence 4182 BP; 1776 A; 455 C; 831 G; 1116 T; 4 other; tggtagcaaa gtgatggttg aaatgttagt gaggttatgt acccacgtgt tgctggtagt 60 aaagtgaaaa aagaaaattg tgagaagctt gaaataaata cagtgaagtt agagcagcta 120 ataaaattac agattaagaa aatctggata ttaaaaaaat atatttttga aaataaaagt 180 gaaactacga gagacagaaa aatgtcgaaa actgatgatt taagatgtca tatatttgac 240 ggtaaagtat atttaaaatg aaaaagtgcg atcaaccagc aacaaaagaa aagttacaat 300 ctgaagacca gactgagtgg aatgagaaga attcaaggca atgaactaca tatattgtag 360 tataacaaat gaacagttag aattcgttgg agaagaggat acggcattaa aaattatgaa 420 tcaattcgac aaaatgtatt tgaaagagtc gacagcattg caaatatgca taagaaacaa 480 aatagacaga atgaaattaa aagattacga agaatsaagy acattcttta tagagtttga 540 gaaaataata aatgaattga aaagtgcagg tgccaaagta aatgaacgag aaaaattaga 600 ctacatgtta aaaactctac cggaatcatt aagttatgtt ggagacttaa tagattcaat 660 gcaagaaagc acgtgtgaat ttcttaaaaa taaaattaaa atgtgggaaa cacgaagccc 720 gaacgatagc ggcaataaaa agccgaatgt atttaaagct gaaagaaaag acttaaagtg 780 ttttggctgc ggaaagaaag gtcatctaag gagagcgtgt acaaatccgt gaaaagaagc 840 caatggagga atgcagtgga caggaggtgc acagcaacag cgaggcagag gttatggcaa 900 cacggacgag gttatggtca acgcggacga ggttatggtc aacgcggacg aggcagaggc 960 tggcaaccac gtgcagacgg aacgagtcaa cacgatgcga ggtacacgta taacgaaggt 1020 tatcaagaag acggaacagg aacattttta acacaggtag aaaagaaaaa tgacagagaa 1080 agcgaaaccg tagtgtataa ttgcgatagg ggtaaaatcg agtggatttt agatagtgga 1140 tgttcagacc acattataaa taatgactta tatttttctg aaagtataat tttgaaaaag 1200 cctattaatg ttaaagtcgg agatggtaga attttaaaag gcactaaagt tggtaaaatt 1260 ataacttatt ttatgataaa caataagaga attaaaataa cgataaataa tgtattctat 1320 gtaaaagaga tggataaaaa tttgattagt tatgctaaag taacagataa gaataaaatt 1380 atatctattg gaaatgtgtc aaagatttat aacaaaaata acaatctgat tggaattgca 1440 tataaagaaa atggtttata taaaataaat agttatgttg aaaaattaga gacatacgca 1500 aataatgttg agaaaataac acaaaaagaa aaatttcata gaatcttggg acatgtaaat 1560 tttaattatc taaatacaat gtgcaaagaa aaattagttg aaggaatgcc aaaaagttty 1620 gaaaatgtaa tgctaaaatg tggaacgtgc atacaaaata aaatgcataa tttgccattc 1680 caaaataatc gtagtggtgc aaatggaata ttagagttag ttcacacaga tttaaatggg 1740 ccacataaaa atgaaggttt tgatggatct aagtatttct tgacttttat agatgatttt 1800 agtaaatgcg ctttagtata tacgttaaaa tcaaaggatg aagtgtacaa ttatttttta 1860 gattatataa ataaagtcga aaatttaact ggcaagaaaa ttaaaagatt aaggtgtgat 1920 aatggtagag aatatatgaa taaaaacatg tttaatttat gtagtgaaaa gggaataata 1980 gtagaaccat gtccaccgta tgtacacgag ctgaacggaa cagctgagcg ctataatagg 2040 acaataatga actcagcgag atgtttactt tatgattcga aattaaatat taagtattgg 2100 ccagaaatta taaaagccgc agcgtattta aaaaatagaa caataacaaa tacaagtcag 2160 aataaaactc cgtttgaaat attttttggg aaaaagccaa acattagtaa tctgaaatta 2220 tacgggagta aagtatttgt tagagttcca gaaataaaaa gacaatcaaa atgggataga 2280 aaagctgaca taggtatact cgtaggctat gaaagcgttg gatacagagt tttggtaaat 2340 ggaaaaataa tagttacacg acatgtagac tttatcgaaa atgaaaattt aattggattt 2400 aaaggtgatg atgagagtga tgatgaatcc gacattagta taaatgaaaa ctctgaatca 2460 aacgaaatcg aagaaaatgt tgtaaatgaa aataagaaag agagaaaacc aagtaataaa 2520 tcagatgaga aaattgatga tcaagaaatt gaactgcgaa ggtcagaacg cgaaaaaaag 2580 aaaccggata gatatggtca atccgattcr tattatattt acgtaaacat tgtaagcgca 2640 gacagcccga aaacttacga agaagcttta cagggaaatg attgtgattt atggaaagag 2700 gctatgaata gagaaataaa ttgtttaaat aaaaataaaa cttggcaatt agtagaaaag 2760 ccgaaagata agaaaatttt agatttaaaa tgggtattta caaataaatc tgataatcat 2820 aaaaaagcga gattagttgt acgaggtttt caacaaagag aagtactgga agacttgtat 2880 tctccagtag ctaaaattca aacattgaag ttactattgt catattgttg tgaaaacggt 2940 ttaatgatta tacagatgga tgtcgaaacc gcattcctga atggaaaaat aaaatcagaa 3000 gtatttgtaa aacaaccttt aggttacagt gataaaaccg aaaaagtgta taaacttgaa 3060 aaagcgttgt atggactgcg agaaagcccg agagcttggt atgaatgttt cgacaaatat 3120 atgcaaaaat tagaatttaa aagaagtgta agcgattatt gtttgtacac aaaatgtgag 3180 aatagcgaaa caatttactt gattctacac gttgatgatc tattgatttg cggaaaaaca 3240 atgcgaaaaa ttaatgagat taaaaccaaa ttgtcacaaa aattttcaat gaaagactta 3300 ggcgaagtaa agacttattt aggaataaat attaattatg attataaaaa ttgtaagatg 3360 acaatggacc aaagtaatta catagagtca ttagctagac aatataatat tgaaaatagt 3420 aaactattta aaacaccgat ggaacaaaat ttaagtctag aatccgcaca aacagcatca 3480 agtgatataa aatatagaaa tctcatagga gcattattat atataagctc tggaacaaga 3540 ctagatgtta gttatagtgt aaattattta agtcgatttc aaaatagcta caatgaaact 3600 cattataaat atgcgttgag agttctaaaa tatttgtatt tgactaaaga cttaaagtta 3660 acttataaaa agaatgaaaa tgctgaagtc atggacagtt ttgttgatgc tgattgggca 3720 ggagataaat ccgataggaa atccacgaca ggatttatta taagattata tggaaatgtt 3780 atatattgga aagtccgaaa acagggcagt gtgacaaagt cgtcaacagc cgctgagtac 3840 gtagctttat cagaagctgt aagtgaaatc aaagttgtta aaaatttgct aaaagacttt 3900 agtataaatt tagaaaaacc aataaaaata tacgaggata actcaggagc catagcaata 3960 gcaaagtttg gtaacatgac gaaaaactcg aagtacattg aagtacatta tcattttgta 4020 aatgagtgct acgaaaagaa agaaattgaa attgtaaaag tagactcaga aaataatatt 4080 gctgatattt taacaaaggc tctgggtaga aataaattcg aaaattttag aatgaattta 4140 aaaattgttt aaagtttaag tagcataaaa attaaggagg tg 4182 // ID DNA4-9C_AP repbase; DNA; INV; 165 BP. XX AC . XX DT 27-AUG-2009 (Rel. 14.09, Created) DT 27-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA4-9C_AP. XX NM DNA4-9C_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-165 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 1957-1957 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 165 BP; 36 A; 54 C; 44 G; 31 T; 0 other; tactctgttc aaaataagac gatgacctgg cggcgaccag ttttcgttcc gcgacggtca 60 gacgccgtcc ccactacggc gacgcgtcgg gcgctgccac ggaacaaagc cacgcccatg 120 cgcacaactc ggccgccggg tcatcgtctt attttgaaca gagta 165 // ID Sat104_Cis repbase; DNA; INV; 99 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 06-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE Satellite from Ciona savignyi. XX KW Satellite; Simple Repeat; Sat104_Cis. XX OS Ciona savignyi OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. XX RN [1] RP 1-99 RA Smit A.F.; RT "Sat104_Cis - Satellite from Ciona savignyi."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX CC Ci000104. XX SQ Sequence 99 BP; 9 A; 17 C; 34 G; 39 T; 0 other; gggtagtgtt gccctgtatt tgtggtatgc ctggggtagt gttgccctgt atttgtcgta 60 tgcctggggt agtgttgccc tgtatttgtc gtatgcctg 99 // ID Gypsy-83_AA-I repbase; DNA; INV; 4450 BP. XX AC supercont1.247; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-83_AA_; KW Gypsy-83_AA-LTR; Gypsy-83_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4450 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.247; Positions 115097 119546. XX CC Positions [3621-4121] - Integrase core CC 'GGCAT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 496..1896 FT /product="Gypsy-83_AA-I_1p" FT /translation="MEKWDIPQFKFKSLPRNVVRHEWIKYKRNLDYIIAAT FT EETDRTRIKNIVLAKAGPDLQEVFASIPGADVQEDPEKTVDPFAVAINKLD FT EYFSPKQHETFERNLFWTLKPNAEETLGKFMLRCQDQAAKCNFGSNAEESR FT AISVVDKVILYAPNDLKEQLLQRDVLKLDDVTKIVSSYESVKHQAQSINLP FT GTSYASGTSYASESDYTSPSTINKIRQQSSKRECTRCGRFGHSANDAGCPA FT RSKECIKCKRIGHFVAQCRSLAILKRKQFTTQEPMWPNKRFKPHQVNEIET FT ADDKGNANFLFSISDGGELIRIQLGGIVLQVLVDSGCQKNIVNEQSWKYIK FT ANGAKIWNITKNCDEIFLPYGENAKPLTLLGKFDTTVSIDDAGTILETVAT FT FYVVKGGQQCLLGRVTATKLGVLFIGLPSTHGINAITTTKAQPFPKIKGVQ FT EKKEASSINEISNHELIGICFR" FT CDS 1965..4121 FT /product="Gypsy-83_AA-I_2p" FT /translation="MTKIEEKLNSLLASDIIEPVQGGCQWVSPLVTVVKDN FT GDLRLCVDMRRANAAILRERHIMPTIEDFLPRFTSARWFSRLDVKEAFHQV FT ELDPQSRYITTFITHMGLFRYKRLMYGIACAPELFQRIFEQILSPYSKNVV FT SFIDDVLVFASTEQEHDEVLKIVLSTLAEYGILLNQSKCLFKVSELEFLGH FT AISPDGICTSNSKVESLQKFRAPTTSEEVRSFLGLVTYVGRFLPNLATITA FT PLRELTHSGVKFTWGREQEESFVEIKRMIGNVQHLKFFDNTLRTRVVADAS FT PVALGAVLMQFKGPTDDDPRPIAYASKSLTTTEKRYCQTEKEALALVWAVE FT RFSVYLLGCKFELETDHKPLEAIFKPTSRPCSRIERWVLRLQSFSFVVKYR FT RGSSNVADPLSRLVEEQAPEEFDAESKFMVLAVLESAAVDIQQLEEMSSTD FT DTMEAVKQCLRTGNWDTQQVKSFAPFKNELGFVGDLVVRGNKLVVPGSLRS FT RMLDLAHEGHPGESVMKRRLRDRVWWPGMDRDAEQRVKSCEGCRLVSLPNR FT PEPMSRREMPFKPWIEVAIDFLGPMPCGTYLLVVIDYYSRYKEVEMMAKIT FT AKETVERLDRIFSRLGYPQTITLDNAKQFVGTDLEEYSKIKGITLRHSTPY FT WPQENGSVERQNRSLLKRLQISHALGRDWRRDLREYLLMYWQNSNRTVVWE FT NNQIKNTCFGRPRNDTHRG" XX SQ Sequence 4450 BP; 1303 A; 966 C; 1145 G; 1036 T; 0 other; ttggcgacga ggagcaaaat caggtaatga aacaatcgta gtattttgct ttcatgaaag 60 ttttccagca tttgtgatta taaaaaagtg aggttaagta acagtttatt ttctcgtttc 120 gctacggaaa ctcatagcga tagtcgcaaa ccgcaacggc agactgcggg aaactgcaaa 180 ggcagaccgg ggggaaagac cgcagaggcg gacaacgcaa actcgcgaag gcggacgacc 240 gaaaaaccgc agaggcggac gactgcaaaa ggccgcagag gcgggcacgt tactgattac 300 aggacgtgga atatagccgc aaaggcggta ggctgctata tcgcagtgaa tttcgaatcg 360 aattggacta aaggcattcc ttgcaacgga atagcagcca gagggcaacc cgtgaatagg 420 tatgtaaaaa aaaagctaaa tactaatagt gatagaactc actcaatgta acattgattt 480 tagaatatct cgacgatgga aaagtgggat attccccaat tcaagttcaa atcattgcct 540 cgcaatgtcg ttcggcatga atggatcaag tataaaagaa atcttgatta tattattgcg 600 gcaaccgagg aaaccgatcg tacaaggatt aagaacatcg ttctggcaaa agcgggacca 660 gacctacaag aggtcttcgc ttccatccct ggtgcagatg tacaggagga cccagagaaa 720 acagtcgacc cattcgcggt ggcgataaac aagctagatg agtatttttc gccgaaacaa 780 catgagacgt ttgagcggaa ccttttctgg acattgaagc cgaatgcaga ggaaacatta 840 gggaagttta tgttgcgatg ccaagaccaa gcggcaaagt gtaactttgg aagcaatgca 900 gaggaaagtc gagcaatcag cgtggttgac aaagtcattc tgtacgcacc gaacgatcta 960 aaagagcagc tgcttcaaag ggatgttttg aaactggacg atgtcactaa aattgtcagt 1020 tcatatgagt cggttaagca ccaggcacaa tcgattaacc tgccgggaac gagctatgcc 1080 agcggaacga gctatgctag cgagtcggat tatacatccc cgtcgactat caacaagatt 1140 cgacaacaat catcgaaaag ggaatgtaca cgatgtggcc ggtttgggca ctctgcgaac 1200 gatgctggtt gtccggcgag atccaaagag tgtatcaaat gcaaacgcat cggacatttc 1260 gttgcacaat gccgatctct cgctatttta aaacggaagc agtttacgac gcaagaaccg 1320 atgtggccga ataagcgttt taaaccacac caagtgaacg agattgaaac tgccgatgac 1380 aagggaaatg caaattttct gttcagcatt agcgatggag gtgagctcat ccggatccag 1440 ctaggtggaa tagttctgca agtacttgtg gactctggat gccagaaaaa cattgtcaat 1500 gaacaatctt ggaagtacat caaggcgaat ggagcaaaga tttggaacat aacaaagaac 1560 tgcgatgaga tatttctgcc gtacggcgaa aatgcaaagc cgttgacact tttggggaag 1620 tttgacacaa ctgtgtcaat agatgacgcc ggcacgattc tagaaacggt cgctacgttt 1680 tatgtggtca aaggtggtca acagtgcttg ctcggtcggg tcacggctac gaagttgggc 1740 gttctgttta tcggattgcc cagcacgcac ggaatcaatg cgatcactac tacaaaagct 1800 caaccgtttc ccaagattaa gggtgttcag gaaaaaaaag aggctagctc aattaatgag 1860 atttccaacc atgagttgat tggtatctgt tttaggtgaa aatacccatt gatgaatctg 1920 tgccacctgt ctgccaacaa ccgcggcgac ctcccattgc gctcatgact aagattgaag 1980 aaaaactgaa ttcacttcta gctagtgata tcatcgagcc agttcaaggc ggttgtcagt 2040 gggtatctcc gttggtgact gtggttaagg acaacgggga tcttcgctta tgcgtggata 2100 tgcgccgagc aaacgcggcc attttgcgag aaaggcatat tatgccaaca atcgaggatt 2160 tcttgcccag attcacatct gcacggtggt ttagtcgact tgatgttaaa gaagcgtttc 2220 atcaagtcga gctggatccc caaagtcgtt acataaccac attcattact cacatgggcc 2280 tcttccggta caagcgattg atgtacggta ttgcatgtgc cccagagctg tttcaaagga 2340 tctttgagca gattttgagt ccctacagca agaatgtggt cagcttcata gatgacgttc 2400 tggttttcgc cagcacggag caagaacatg acgaggtcct caaaatagta ttgtccacgt 2460 tggccgaata tggaatccta ttgaatcaga gtaaatgtct attcaaggtt tctgaattgg 2520 agttcttagg gcatgccata tcccctgatg gcatctgtac ttcgaacagt aaggttgaat 2580 cccttcaaaa atttcgtgcg cctacaacat ccgaggaggt ccggagcttc ttgggtctgg 2640 tgacgtatgt cggccgtttc ctaccaaacc tagcaacaat aactgctccg ctccgtgagt 2700 taactcattc gggggttaag tttacctggg gcagggaaca agaggagtcc tttgttgaga 2760 ttaaacggat gataggtaac gtccaacact tgaagttctt tgacaacacg ctgcgcacaa 2820 gggtggtcgc tgacgcttcg cctgtggcac tcggtgctgt tcttatgcag ttcaaaggac 2880 ctactgatga cgacccccgc ccgatagctt atgctagcaa aagtctgaca acgacagaga 2940 agcgctactg ccagacagag aaagaagctc tcgcgcttgt ctgggcagtg gagcgttttt 3000 ctgtctatct cttagggtgc aaatttgagc tagagacaga ccataagccc ttagaagcga 3060 ttttcaaacc tacttctaga ccatgctcac gcattgaacg gtgggtgctt aggctccagt 3120 cattttcgtt cgtagtaaaa taccgtagag gcagtagcaa cgttgctgac ccattatcca 3180 gattggttga ggagcaggca ccggaagagt ttgacgcgga gagcaagttt atggtgctag 3240 cggttctaga atcagcggct gttgatatac aacagctcga agagatgtcc agcacagacg 3300 atactatgga agccgtaaag cagtgtttgc gtaccggtaa ctgggataca caacaagtca 3360 aatcattcgc tccgttcaag aacgaattag ggtttgttgg agacttagta gtcaggggaa 3420 ataaactggt ggtgccgggt agcctaagat cacggatgct ggatctggct catgagggcc 3480 atccgggaga atcggtaatg aagaggcgtt tacgtgaccg agtttggtgg cccggaatgg 3540 accgggacgc agaacaaaga gtgaaatcat gtgaagggtg cagattagtg agccttccca 3600 acagaccgga accgatgagt cgtcgagaga tgccttttaa gccatggatt gaggtggcca 3660 tagactttct cggacccatg ccttgtggaa cctatttact ggttgtaatc gactactata 3720 gccgttacaa ggaggtcgag atgatggcaa aaatcacggc gaaagaaacc gtagaaagac 3780 tcgacagaat atttagccgc ctcggctacc cacaaacaat taccttagat aatgcgaaac 3840 agtttgtagg aaccgatttg gaagaataca gcaaaataaa aggaatcacc ctcagacact 3900 ccacaccata ctggccgcaa gagaatgggt cggtagagcg tcagaaccgc tctctcttga 3960 agcgccttca gatcagtcat gcgctcggta gggactggcg gcgggatctt cgtgagtatc 4020 tgctgatgta ctggcaaaac tccaaccgaa ctgttgtatg ggagaacaat cagatcaaaa 4080 atacctgctt tggacgacct cgaaacgaca cccaccgggg atgagacgcg tgacagagat 4140 cgactgttga aggacaaggg aaaacaatca gaagatctga gacgtcatgc tagagagtct 4200 tctctatcca ctggtgacac agtgctgatg cagaaccttt tgcctggcaa caagctgtca 4260 acgacgttca acccgaagaa gtagctaccc gagagttcca gctcagaaga ggagtttcac 4320 ggattcgact tggaggcggg caacgatgat cctaaaccgg ttccttccac tcgacagcga 4380 cgcataacaa agaaaccctc tagatttgca gattacgttc cttaacattc tactattgaa 4440 aaaggggaga 4450 // ID CR1-40_HM repbase; DNA; INV; 4500 BP. XX AC . XX DT 17-DEC-2008 (Rel. 13.12, Created) DT 17-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE CR1-type family - consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-40_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4500 RA Bao W. and Jurka J.; RT "CR1 families from Hydra magnipapillata."; RL Repbase Reports 8(12), 1868-1868 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 876..3971 FT /product="CR1-40_HM_2p" FT /translation="MANDFITKDFDKLRFDFFKTQNNIFANNLYDDDLFIF FT NGPDSPYFDTKTFKAELHNNIDEFTVIHINIRSLNTNIDKLKIFLFECKYS FT FSMICLTETWCSDENSYTNSNLLIPNYNLISYERKANKRGGGIITYIHNDL FT NTKIRNDLSISDSNSEVFTVEVINDTTKNLLISSCYRPPDGDIIMFSSFMK FT QIYTTSNVEHKKLFCIGDMNINYLSYHEHANTKIFFDDMIQHYIFPIINKP FT TRVTPTSITAIDNILTNSIQDPSLKVGIIKTDISDHFPIYFSLSQQATRVN FT NLKIKIIKRSINKFSTQKLKDSLAVVNWSKIYHECNLGHTNSAYNNFINIF FT LNLYNKHFPKVEKEIKAKYLTTPWITQGIRKSSKQKQKLYIKYLKNRNHFN FT LSKYKQYKNLFEKIKLKSKKLYFSKQIQKFKGDNKKIWDIIKEIIGKDKKN FT SNQLPPKVLISEKEFNNKKIISEKFNKYFASIGSKLASKIQPTNNSFENYI FT EDSRCSLVFKELEYEELETAIKTLKINKGPGIDNIPANIILNVFPEISKPI FT FQIFNSSIITGIVPDYLKIAKIIPIYKSGDTSFLNNYRPISILPVFSKLLE FT RIIYNRLYKYLTTNKILNENQYGFQTQHSTEHAIVDLVNRISESIYEKKFV FT LGVFVDLSKAFDTINHEILLKKLEKYGLKNHTLQWFKSYLYNRQQCVITDD FT NYHSKLLKLTCGVPQGSILAPLLFLIYINDLPKVSSKLEFIMFADDTNLFY FT SSTNIKDLYDNVNIELQKLNIWFKTNKLSLNIEKTKYTLFHSNQQRNTLPN FT ILPSLKIENIVIERTHETKFLGVIIDENLSWKSHINALNTKISKNIGLLYK FT AKPMLSQSNLKLIYFSFIHSYLIYANIAWASTHITKLKTLYRRQKHAVRLI FT YHKDKFTHTEPLFKEMNVLNIYQINIYQNILFMLKFKLGQVPNHFLNNFFI FT NNVNKYNTREKGNFITPFKKTKLSQFSISYRGPYLYNKLVTKNKSICNLDS FT ICNQNFLKKKLKNLILNLNRYSEFF*" FT CDS join(61..567,536..814) FT /product="CR1-40_HM_1p" FT /translation="MDITMKSIEKIITKQLEEHKKSILKETERLLKEQEKS FT FTLIMSANLKIITDRLDTIEKDNNNNKSKIKNIEKDLNEIKDSLSFQENKT FT LENISHLKKYYDNQINTLYKKSIDLENRSRRNNLRIDGLKESPGESWDDCE FT KEVKKFFNNQLKISKEVVIERAHRIGQKKRKEHIASVKRKDNKPRTIVLKL FT LNFHDKNKILNSLTHLKGTGIYINEDFAQETIEHRRKLWEEVKKLRSEGKY FT AILKYDKIFSRDFKNTRFQK*" XX SQ Sequence 4500 BP; 1895 A; 715 C; 521 G; 1369 T; 0 other; ttcgcggaga ttttgcggaa acaagacgtg ttttttatta gcgataaaat ttacataaaa 60 atggatatta caatgaaaag catcgaaaaa attatcacaa aacaactcga ggaacacaaa 120 aaaagtattc ttaaagaaac tgaacgacta cttaaagagc aagaaaaatc atttacttta 180 ataatgagtg caaacctcaa aattataact gatagactag acacaataga gaaagacaac 240 aacaacaaca agtcgaaaat aaagaatatc gaaaaagacc tgaacgaaat aaaggacagc 300 ctcagttttc aagagaataa aactttagaa aacatttcac acttaaaaaa atattatgac 360 aatcaaataa atacattata taaaaagtca atagacttgg aaaatcgttc cagaagaaat 420 aatctaagaa tagatggact aaaagagtct ccaggagaaa gctgggatga ttgtgaaaaa 480 gaggtaaaaa aattctttaa taatcaatta aaaatatcaa aggaagtcgt tatagaaaga 540 gcacatcgca tcggtcaaaa gaaaagataa taaaccaaga acaatagttt taaaactcct 600 gaactttcac gacaaaaaca aaatactcaa ctcattaact catcttaaag gaactggcat 660 ttacataaac gaagattttg cccaagaaac tatcgaacat cgaaggaagc tttgggaaga 720 agtgaaaaaa ttacgcagcg aaggtaaata cgctatttta aagtatgata aaatttttag 780 tcgagatttt aaaaatacgc gctttcaaaa ataactttac tcaaatcgaa ttctattaaa 840 aaaaaaaaaa aaattgattt aattcttttt tcgtcatggc taatgatttt ataacaaaag 900 attttgataa gttgcgtttt gactttttta aaactcaaaa taatatcttt gcgaataatt 960 tatacgacga cgatttattt atttttaatg gacctgactc tccatacttt gatactaaaa 1020 cttttaaagc tgaacttcat aacaatatag acgagtttac ggtaatacac ataaacatta 1080 ggagcttaaa tacaaatatc gataagttaa aaattttcct tttcgaatgc aaatattcct 1140 ttagcatgat ttgtcttaca gaaacttggt gttctgacga aaattcttat acaaattcaa 1200 acctcctaat ccctaattat aatcttattt cttatgaaag aaaagctaac aaaagaggag 1260 gtggaataat aacctacatc cataatgact taaacacaaa aattagaaat gatctttcta 1320 tttctgattc aaatagtgag gtctttacag ttgaagtcat taatgatact acaaaaaatc 1380 tcttaatatc ctcttgttat agaccgcccg atggagatat tataatgttt tcaagtttta 1440 tgaaacaaat atatacaaca agcaatgttg aacataaaaa gctattctgt attggtgaca 1500 tgaacataaa ttacctaagc tatcatgaac acgctaatac caaaattttt ttcgatgata 1560 tgatacaaca ttacattttc ccaattataa acaaacccac ccgggtaact cccacctcaa 1620 ttacagctat tgacaatata ttgacaaatt caattcaaga tccatcctta aaagtaggaa 1680 taattaaaac ggatatatct gaccacttcc caatttactt ctcactatca caacaagcaa 1740 cacgtgtcaa taacttgaaa ataaaaatca tcaaaagaag cataaataaa ttctcgaccc 1800 aaaaactaaa agactcacta gcggtagtaa attggtcaaa aatatatcat gaatgtaacc 1860 tcggacacac aaattccgct tacaataatt ttataaatat tttcctaaat ctctacaata 1920 aacattttcc aaaggtcgaa aaagaaatta aagcaaaata cctaaccact ccatggatta 1980 cccaaggaat aagaaaatcg tcaaaacaaa aacaaaaact ctacataaaa tatctaaaaa 2040 atagaaacca ttttaatttg tctaagtata aacaatataa aaatctattt gaaaaaataa 2100 aattaaaatc caaaaaacta tatttttcaa agcaaataca aaaatttaaa ggagataata 2160 aaaaaatctg ggatattata aaagaaataa ttggtaaaga taaaaaaaac tcgaaccaac 2220 tacctccaaa agttctcata agtgaaaaag agtttaataa taaaaaaatt atctctgaaa 2280 aatttaacaa atactttgca agtataggtt ccaaactcgc ctctaaaatt caacctacaa 2340 acaattcatt tgagaattac atcgaggact ctcgttgctc cctagttttt aaagaactag 2400 aatacgaaga acttgaaaca gcaattaaaa cattaaaaat taacaaaggt ccaggtatcg 2460 acaatatccc agccaatatt attttgaatg tcttcccaga aataagtaaa cccatttttc 2520 aaatatttaa ctcttcaata attacaggaa ttgtacccga ctatttaaaa atagctaaaa 2580 ttattcctat ctataaatcc ggagatacat ctttcttaaa taattacaga ccaatatcaa 2640 tccttcccgt attttcaaaa ctgctcgaac gaataatcta caatcgttta tataaatacc 2700 taaccacaaa caaaatcctg aatgaaaacc aatatggttt tcaaactcag cactcgacag 2760 agcatgctat cgtagacctc gtcaatagaa ttagtgaatc aatctatgaa aagaagttcg 2820 tattaggagt ctttgtagac ttgtcaaaag cgttcgacac aataaatcat gaaatcctac 2880 taaaaaagtt ggaaaaatac ggattaaaaa accatacact gcagtggttt aaaagttacc 2940 tgtataacag acaacagtgt gttattacag atgataatta ccacagtaaa ttattaaaat 3000 taacttgtgg tgttcctcaa ggttcgattc ttgctcctct tttgttccta atctatataa 3060 acgaccttcc aaaagtttct agcaaacttg aatttataat gtttgcagac gatacaaatt 3120 tattttactc ttcgacaaac atcaaggatc tgtatgataa tgtaaacata gaactccaaa 3180 aattaaatat atggtttaaa acaaataagt tatcgctaaa catagaaaaa acaaaataca 3240 ctttatttca ctcaaaccaa caaagaaaca cactaccaaa tattctaccc tctctaaaaa 3300 tagaaaatat agtaattgag agaactcacg aaactaaatt tttaggagtt ataattgatg 3360 aaaatttatc ttggaaaagt cacataaatg ctttaaacac taaaatatca aaaaacatag 3420 gtctacttta taaagcaaaa cctatgctgt cgcaaagtaa tttaaaactt atatatttct 3480 cttttataca cagctacctc atatatgcga atatagcttg ggcgagtacg catataacca 3540 aattgaaaac tttatatcga cgtcagaaac atgctgttag acttatctac cataaagaca 3600 aatttacaca cactgaaccc ttatttaaag aaatgaatgt gcttaatatt tatcaaatta 3660 atatttacca aaatatattg tttatgctaa aatttaaatt aggtcaagta ccaaaccatt 3720 ttttaaataa cttttttata aataacgtta ataaatataa cacaagagaa aaaggaaact 3780 ttattacacc tttcaaaaaa acaaaacttt cccagttctc aatttcatat cggggtcctt 3840 atttatacaa caaactagta actaaaaata aatcaatctg caatctagat tcaatttgca 3900 atcaaaattt cttgaaaaaa aagttaaaaa atctgatact gaatctaaac agatactcag 3960 aattcttcta agactataaa ttaaaataca tatatatcaa ctacaaagtg aactgactcg 4020 tttatgtgta ttatctaaat gaatcttttt tttttttctc aaatttctca aacaatatat 4080 caaaatgtgt ttattacaca caaaaaaaac acttgatttt ttcttatcaa gtttagttat 4140 tatgctgcta gttatgtcta atattttata ctgagcttat tttgagttta ttttttatta 4200 tgagctaaat ttaaagaata tataaacata gttgtactta atatataatt atatttatat 4260 attttatata tatttatatt tatatatata tttaacataa tcataataat atatcaaaca 4320 cattgtaagg gcttcgtgat aagatcatta cgatcttctt gaagtccggc ctgtactatt 4380 taattataaa tttcactata tatattttga ctcacgagac ttgtaaatat tacttttaat 4440 ttacgaaact tgtaaatgtt atatgtacgg caaatatatg aataaaaaaa aaaatatata 4500 // ID Galileo_DB repbase; DNA; INV; 5407 BP. XX AC . XX DT 28-FEB-2008 (Rel. 13.02, Created) DT 29-FEB-2008 (Rel. 13.02, Last updated, Version 2) XX DE Drosophila buzzatii transposon Galileo, putative complete. XX KW P; DNA transposon; Transposable Element; Galileo_DB. XX NM Galileo_DB. XX OS Drosophila buzzatii OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; repleta group; OC mulleri subgroup. XX RN [1] RP 1-5407 RA Marzo M., Puig M. and Ruiz A.; RT "The Foldback-like element Galileo belongs to the P superfamily RT of DNA transposons and is widespread within the Drosophila RT genus."; RL Proc Natl Acad Sci U S A 105(8), 2957-2962 (2008). XX DR [1] (Consensus) XX CC This sequence was assembled from several PCR products. Therefore, CC it is listed as "consensus". XX FH Key Location/Qualifiers FT CDS 1349..4086 FT /product="Galileo_DB_1p" FT /note="Putative transposase." FT /translation="MAQISVVNEKSVGVKKXLEKCLGXSGAKCVIETCGAS FT YRQSSVNFPVKLFSFPSKEIEFNKWVKTCHLPSNFNRKKAKICDRHFERKY FT IGKRKLKANAVPTLNLCDPNFFSNSADFNDDFRLVDNRGAEHQNENVPNEC FT DDELIENLLIFDDNSKKEFWQNLAVDRTPYCLNCIKREQNEVYYRKKYYEI FT GLDLKKVQERYTKLKRRFISFKRVSNYRGVSFRVRRAKKTVNVFTTINSLA FT HVSEQSKVLCKMLLKNTNFYNSAERVLSQNINFYSARAYEYLRDVLHLKLP FT SKKSLNRWAIFKNLTPGSNPELLENLQGIVEKMSDKGKYAVLVFDEVKIKK FT GLQYNSYLDEIQGFENDGEKRTKFLGQQVCVFLIRGLFENWKYVLSYTVSA FT NGIRHSDLKSKVEANIGLSQALGLNVKAVVCDQGSNNRAVFDRWGIDINKP FT SFHVNDKEIFAVFDAPHLVKSLRNILLRHNISTTQGTVSXNIIRKLYEIES FT KNLTRLCPKLTSKHVSPNCFEKMKVKYATQVFSHSVAAAIRTVIDSGGFSD FT CKDSAVATAIFIEKINRLFDCLNSHVLFDSNPYRCALTRNNNVHEYLQEMR FT DYFHDLQYPQKVYCITGMIITISSVIALAENIWNDNNDLFFVATSKLNQDP FT LENLFYLIRSRGATNTNPTIFEFNSIISKMLSMKVLTSASISGNCILDEDS FT MLANIIKDSGSTLSVFHSQCEIHSSVYEEPSDPDFEIELSLDSTIVNIQNA FT FNENALRYFAGYLLHKLLQRTDCEVCTNLLKGSDEMQCSSEYLILNKNYNY FT IHQYLKLKAPSDNFYNIIKIHFDIFQKIFDKKPFIAKLKEKIILHCMRATA FT KSTLHSDWFSPSHPCFDHRKFMLNQFVLILIRKNCKWLTDSIVSKSSNLSK FT SKLKMIREX" XX SQ Sequence 5407 BP; 1777 A; 928 C; 1027 G; 1668 T; 7 other; cactaaccat acaacacata gactggacaa ctggaacaaa ttaatgtgca cacttatttt 60 ttgacattag aaccccatca acttcgtatt tgctcgggtt tttacttttc gggtcgggag 120 agaaaaattg ctgatgacaa aatcattccg cttcgagcgc tcgacggtga cggtcgattt 180 tccaaaaaat cctgacatgt gccgctcttt cccttgctct ctttctcttt ctgccccatg 240 ctttttactc tctgctctct tctcacaaaa atatatgtat gctggcacaa ccagcaagca 300 gctcacaaaa attgaaattt cttctcgacg cgatcccatt ggttcccttc tgagcgaaga 360 gtataccgaa ttcttaaaca gctttgccat gctcatgcaa aatgttagcg ttttcatgat 420 tatttgtgga aagggtatga aagtgttgat gtgcatacag aatatttttt ctcgctgacc 480 atacacacac acatttacac ttcctcgcta tatgtgtagg catttatgtg cctgtgtgca 540 atatgcttgc tgcctgtagc acattagaac cccagcagtt gatgttcgtt tggttgggta 600 gcgatgtgag cttgcataca cacacacatt tacacttctt cgctatatgt gtaggcattt 660 atgtgcctgt gtgcaatatt cttgctgccg cctgtagagc actattagaa ccccagcagt 720 tgatcgttct gtacacatac acatttacac ttccttgcca tatgtgtagg catttatgtg 780 cctgtgtgcg atacacttgc tgcatgtagc aattctaatt tccgagcgat tgctgttggc 840 tagccaatgt cagcatgcat acgcacacac atacatacat atgtgtacgt ctgtgatgtg 900 ataatcacaa tcgccgagta tcacttgtat aggcacttga atgaatgctc tgcctggggc 960 atttgcatgt ttcgtgcatg ggtctcgaat attgggcgtg tgcctaccta ctttgaatta 1020 agtaaacgat tttgggttca aattgcacga gaggtcaatt attttgtatt acctgttgat 1080 cggttatcat ttgttcttta atatttagta cctgatttga ccctataggc aaaaatgata 1140 catgttattc aaatgcttgc gcaaattcag gtcacatata acccttttta ataaatgcct 1200 acatatacag gtaaaattta cccttttatc ctatataaag tagggggaag tgcacttgtc 1260 actttagttg acactcaact tccgaaccga acggtgtttt tctaattttt acaaaacttt 1320 acaaaatata taagtttttt ttaaaactat ggcgcaaata agtgttgtga acgagaaaag 1380 tgttggtgtg aaaaaannnc tagaaaagtg tttaggaana agtggtgcaa aatgtgttat 1440 agaaacatgc ggtgccagct acaggcagag ttcagtaaat tttccggtga agttgttttc 1500 ttttccatca aaagaaatag aattcaacaa atgggttaaa acatgccatt tacccagtaa 1560 tttcaaccgg aaaaaggcaa aaatatgtga cagacatttc gaacggaaat atattggtaa 1620 gcgtaaatta aaagctaacg cagttccgac cctaaattta tgcgacccaa actttttttc 1680 taatagtgca gattttaatg atgattttcg tttagtagac aatagggggg cggaacacca 1740 gaatgaaaat gttccaaatg aatgcgatga tgaacttata gagaatttat taatctttga 1800 cgacaattca aaaaaagaat tttggcaaaa tttagcagtt gataggactc cttactgttt 1860 aaattgtata aagagggagc aaaatgaggt gtattacaga aaaaaatatt atgaaatagg 1920 tttagatctc aaaaaagtac aggaaagata tacaaagtta aaaagaaggt ttatttcttt 1980 taagagagtt tctaattata gaggggtttc atttagagtt aggagggcaa aaaaaactgt 2040 taatgtcttt acgacaatta acagtttggc acatgtttcc gaacaatcaa aagttttatg 2100 taaaatgctt ttgaaaaata caaattttta taatagtgct gaaagggttc tctcacagaa 2160 tattaatttt tattctgcca gggcatatga atatcttaga gatgttcttc atttaaaact 2220 gccatctaag aaatctttaa atagatgggc gattttcaag aacttaacac ctggatctaa 2280 tccagagttg ctagaaaatc tgcagggtat tgtagaaaaa atgagcgata agggaaagta 2340 tgcagtatta gttttcgatg aggttaaaat taagaaaggt cttcaatata attcatatct 2400 tgacgagatt caggggtttg aaaatgatgg agaaaaaaga actaagtttc taggtcagca 2460 ggtctgtgtt ttccttatta ggggcctttt tgaaaattgg aaatatgttt taagctatac 2520 tgtttcagct aatggcataa gacattcaga tcttaaatct aaagttgagg caaacattgg 2580 actatcgcaa gcattaggtc ttaatgtcaa agccgttgtt tgtgatcagg gttccaataa 2640 cagggccgtt tttgatagat ggggcataga cattaacaag cctagctttc atgttaatga 2700 taaagaaata tttgcggtat tcgatgcacc acatcttgtt aaatcgctta gaaatattct 2760 tttaaggcat aacatctcca caactcaggg cacagtttcg nnnaatataa ttagaaaatt 2820 atacgaaata gaatctaaaa acttgacacg tttatgtcca aaattgactt caaaacatgt 2880 gagtccaaat tgttttgaga aaatgaaagt caaatatgca acccaggttt ttagccacag 2940 tgtagctgct gcgatacgca ctgttattga ttcgggtggg ttttctgatt gtaaggatag 3000 tgcagttgca acggcaatat ttattgaaaa aataaataga ctttttgatt gtttaaatag 3060 ccatgtgtta tttgacagca atccttatag gtgcgccctt acaaggaaca ataatgtgca 3120 tgaatacctt caggaaatga gagactattt tcatgacctg caatatcctc aaaaagtata 3180 ttgcattaca gggatgatta ttacaatctc atctgtaatt gctttagctg aaaatatttg 3240 gaatgacaac aatgatctct tctttgttgc cacgtcaaaa ttaaaccaag atccattaga 3300 aaatttattt tatttaatta gaagtcgggg agcaacaaat acgaacccaa ccatttttga 3360 atttaattcc ataatatcca aaatgctttc tatgaaagtt ttaacatcgg cttcaatatc 3420 tggaaattgt attctagatg aagattcaat gctagctaat ataataaaag atagcgggtc 3480 caccctttct gtatttcata gccaatgtga gatacattcg tctgtatatg aagaaccttc 3540 agatccagat tttgaaatag aattgtcatt agattcaaca attgtaaaca ttcaaaatgc 3600 tttcaatgaa aatgcattgc gatattttgc gggatatctc ctgcacaagc tgttgcaaag 3660 aactgattgt gaagtttgta ccaatctttt aaaaggatca gatgaaatgc agtgttcctc 3720 tgagtatttg atacttaata aaaattataa ttatattcat caatacctta agttaaaggc 3780 cccttctgac aatttttata acataattaa aattcatttt gacatattcc aaaaaatatt 3840 cgacaagaag ccatttatag ctaaacttaa agaaaaaatt attcttcact gcatgcgcgc 3900 tactgctaaa tcaaccttac atagtgattg gttttcccct tcccaccctt gctttgacca 3960 tcgtaaattc atgcttaatc agtttgtttt aatattaata agaaaaaatt gtaaatggct 4020 tacagatagt atagttagta aaagtagtaa tttaagtaaa agtaaactga aaatgattcg 4080 tgaataagaa atttaaaata aataaaacat aaaacataat atatttttgt tgttatttag 4140 tgaaaaatat gtctacataa cggatgcata ttttttgcat aaaagggtaa attttacctg 4200 tatatgtagg catttattaa aaagggttat atgtgacctg aatttgcgca agcatttgaa 4260 taacatgtat catttttgcc tatagggtca aatcaggtac taaatattaa agaacaaatg 4320 ataaccgatc aacaggtaat acaaaataat tgacctctcg tgcaatttga acccaaaatc 4380 gtttacttaa ttcaaagtag gtaggcacac gcccaatatt cgagacccat gcacgaaaca 4440 tgcaaatgcc ccaggcagag cattcattca agtgcctata caagtgatac tcggcgattg 4500 tgattatcac atcacagacg tacacatatg tatgtatgtg tgtgcgtatg catgctgaca 4560 ttggctagcc aacagcaatc gctcggaaat tagaattgct acatgcagca agtgtatcgc 4620 acacaggcac ataaatgcct acacatatgg caaggaagtg taaatgtgta tgtgtacaga 4680 acgatcaact gctggggttc taatagtgct ctacaggcgg cagcaagaat attgcacaca 4740 ggcacataaa tgcctacaca tatagcgaag aagtgtaaat gtgtgtgtgt atgcaagctc 4800 acatcgctac ccaaccaaac gaacatcaac tgctggggtt ctaatgtgct acaggcagca 4860 agcatattgc acacaggcac ataaatgcct acacatatag cgaggaagtg taaatgtgtg 4920 tgtgtatggt cagcgagaaa aaatattctg tatgcacatc aacactttca taccctttcc 4980 acaaataatc atgaaaacgc taacattttg catgagcatg gcaaagctgt ttaagaattc 5040 ggtatactct tcgctcagaa gggaaccaat gggatcgcgt cgagaagaaa tttcaatttt 5100 tgtgagctgc ttgctggttg tgccagcata catatatttt tgtgagaaga gagcagagag 5160 taaaaagcat ggggcagaaa gagaaagaga gcaagggaaa gagcggcaca tgtcaggatt 5220 ttttggaaaa tcgaccgtca ccgtcgagcg ctcgaagcgg aatgattttg tcatcagcaa 5280 tttttctctc ccgacccgaa aagtaaaaac ccgagcaaat acgaagttga tggggttcta 5340 atgtcaaaaa ataagtgtgc acattaattt gttccagttg tccagtctat gtgttgtatg 5400 gttagtg 5407 // ID Gypsy2-LTR_SM repbase; DNA; INV; 808 BP. XX AC . XX DT 18-FEB-2008 (Rel. 13.02, Created) DT 03-MAR-2008 (Rel. 13.02, Last updated, Version 1) XX DE LTR retrotransposon from Schmidtea mediterranea: long terminal DE repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; Gypsy2-LTR_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-808 RA Jurka J.; RT "LTR retrotransposon from Schmidtea mediterranea."; RL Repbase Reports 8(2), 92-92 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX SQ Sequence 808 BP; 345 A; 106 C; 162 G; 195 T; 0 other; tgttgaggat acctaaataa tagagtaaaa taattaataa acatgtaaaa tgattggggg 60 aaacaatgag aagtgagtaa gatgtggttc tggagcttaa taaaaagaga atgaaaaaga 120 aatgttgaat aacgagaaag gaataagaga aaatcaataa aaacagacaa ttaaatcaaa 180 gaaggtcatt tataattaac aagtgcaaat ttaggaaaac aaaaacaaaa ttcgcacatt 240 atctcgaaag ccctttcaag ataagggaag aaatgttgtt tgagaatccg ggaaagggca 300 tctccaaaaa atgatttatc tggacagctg tcagtgatta tggaggccca aaatgaatta 360 gcgttaaaag atatcagaaa tcagtaagtg ttattgaatg atggagtata aaaaggaagc 420 gcaattgaaa tcgtaacaca caaaaaataa gacaaaactt gcagaagaca aagaaagaca 480 aaaaagaaat taaagagaaa tttgaaacag aaattaagca aggtcaaagc aaagcaaagc 540 aactaaatca gagttaccga tctcagaaac ccaccgcagt agtcaggtga ttgtaaagcc 600 aattgatccg aaaagtgcta ttcttcccga agggtcgtaa taagcactaa gtgtgattga 660 ttgataattc gttgtgtcat tttaaagtgg agataaagtg tattaatcaa taaagtttat 720 atcgtttgga gttcgttact gttgtttaaa gcacgcttgg agtcagattg acaataattc 780 ctattcaaat tcaccgacaa ccccaaca 808 // ID DNA-2_SM repbase; DNA; INV; 542 BP. XX AC . XX DT 06-NOV-2008 (Rel. 13.11, Created) DT 06-NOV-2008 (Rel. 13.11, Last updated, Version 2) XX DE Putative DNA transposon. XX KW Merlin; DNA transposon; Transposable Element; Nonautonomous; KW DNA-2_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-542 RA Jurka J.; RT "Non-autonomous DNA transposons from flatworms."; RL Repbase Reports 8(11), 1797-1797 (2008). XX DR [1] (Consensus) XX CC 8 bp TSD. Partial ORFs match other Merlin sequences. CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX SQ Sequence 542 BP; 140 A; 115 C; 137 G; 150 T; 0 other; ggtacttgtt cattatcggg acccccagta aattttcaac tattttgatt agaatgaata 60 tctttaactt gccgagcacc gaggaggagg ccgttgcgtt tcttcaagag aagggtatct 120 tgccggccaa tcgaatctgt ccgaacggtc acgaaatgaa gctctacttt ggcacacgca 180 ttttttggaa atgtaacatc aagtcttgca agaagaaggt caacatgcgc gatggaaact 240 ggtacaactt cgtggatccg gacaccggca ctcacaccca gaccgtcgaa cgtatgtggg 300 gatcggccaa gtggcgcaac aagaaacacc gaggcaccgc acgtcatcat ttggagtcgt 360 atttggctga gttcatgtgg cgcaagcatg ttgccagaga ggatgttttc gaggctttgt 420 tagaagcaat tgtggctttc tggcctcctg agtctcagat gcagtagttg taattgttga 480 attaaatgtt ttcaccgaat ttttgatttt tttactgggg gtcccgataa tgaacaagta 540 cc 542 // ID EHAPT2 repbase; DNA; INV; 542 BP. XX AC . XX DT 14-SEP-2004 (Rel. 9.08, Created) DT 29-MAY-2010 (Rel. 15.06, Last updated, Version 2) XX DE Entamoeba histolytica ehapt2 non-LTR retrotransposon, consensus. XX KW SINE; Non-LTR Retrotransposon; Transposable Element; KW Nonautonomous; nonautonomous non-LTR retrotransposon; KW Repetitive element; EHAPT2. XX NM EHAPT2. XX OS Entamoeba histolytica OC Eukaryota; Amoebozoa; Archamoebae; Entamoebidae; Entamoeba. XX RN [1] RA Willhoeft U., Buss H. and Tannich E.; RT "The Abundant Polyadenylated Transcript 2 DNA Sequence of the RT Pathogenic Protozoan Parasite Entamoeba histolytica Represents a RT Nonautonomous Non-Long-Terminal-Repeat Retrotransposon- Like RT Element Which Is Absent in the Closely Related Nonpathogenic RT Species Entamoeba dispar."; RL Infect. Immun 70(12), 6798-6804 (2002). XX RN [2] RA Cruz-Reyes J., ur-Rehman T., Spice M.W. and Ackers P.J.; RT "A novel transcribed repeat element from Entamoeba histolytica."; RL Gene 166(1), 183-184 (1995). XX RN [3] RA Gentles A., Kohany O. and Jurka J.; RT "Entamoeba histolytica ehapt2 non-LTR retrotransposon, RT consensus."; RL Direct Submission to Repbase Update (30-JUN-2004). XX DR [3] (Consensus) XX CC Average similarity to consensus 92%. Transcribed into polyA+ CC RNA, with no ORFs. It's 5' termini is homologous to R4. XX SQ Sequence 542 BP; 222 A; 76 C; 96 G; 146 T; 2 other; ggcacgtctg aaacaccaca cacaaaccct agtacaaatt cattcttcga ctctcccagt 60 tattatctgg ttatgacggt gcytttgaat taggaatgta ttagggaatg ctgcaaaggg 120 tgcagcaaga gaatacagta gaatattaca tggatgtaat ataagaatct actgaagtgt 180 gggtatgact aaaagaagat tagtcaaagt aagactaaaa agaarattag tcaaagtaat 240 acagtagtaa taaaatgatt ccttctccca ttcataaaat aagaaaaatg aaattcctta 300 aaattaaggc agaaaacaaa caaaggctta aaaagaagaa ataagcagaa gaagtttgaa 360 aaaccttaat aggaagaaat aaagcaaaga agtgctttcc tcattttgca agaaaaacct 420 aaagaatagg ttaacaaaga gattactctt ttttaataag ctcagggatg ggattagtct 480 cccctgagct aggaagaata gatgaaaatt ctattaatac ttaattaatt actttttctt 540 tt 542 // ID Copia-10_SI-I repbase; DNA; INV; 4071 BP. XX AC AEAQ01016285; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-10_SI_; KW Copia-10_SI-LTR; Copia-10_SI-I. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-4071 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01016285; Positions 4862 792. XX CC Positions [1563-2063] - Integrase core CC 'AAAAT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 108..4037 FT /product="Copia-10_SI-I_1p" FT /translation="MADDEKYKVPLFDGTNYSNWKFRMQVLLEEHDLFDFV FT EKSLDTLISELPTTSAGTQSVTVRKSDRKCKSLITQRIADSHLEYVKEKPT FT AFEMGRALSESFERRGVASQLRLRKMLLTMRYVTTETMAAHFLKFDKLIRE FT LKSTGATLEETDIVCHLLLTMPEEYNMVVTALETLSSEQLTLGFVKTRLLD FT EEAKRGGTSASGKGANSPAVFSAATANRKVNANKHDGTTNEKQRERFGFKC FT HHCGILGHKRSECRKFKQGNKNSSSAKVAFDEDVDKSRKEFVFIAKEENAI FT ASTCWFLDSGSSEHLATKETRLINMRNITPSVTIRVAKSGQVLTASEVGDL FT SVCVRANGEVRKILISGVLSVPGLECNLLSVRKLEMNGFTVTFKDGKGIIH FT KENIIAAIAHRTEKLYKLHFEYNSEVTNICETEENFLLWHQRLGHLSSAGM FT KKLLRMADGIKLKDTDLSSEVCEVCVDGKQTCLPHQTKRIRAKRPLQLVHS FT DLCGPIDITSFDGKRYLLTFIDDYTHFTVAYTLKAKTEVLRHFKMFQSMAE FT AHFNLKISRFRCDNGREYLSNEIRQHFENCGVQYEFTIRYTPQQNGVAERM FT NRTIIEKARCMILNSKMNKTFWSEAVIAAVYLINRSPTIALKGKVPAELWF FT GEKPNLRKLRVFGCVAYLHIPKELVNGKFESRSKHCKMVGYCANGYRLWCP FT EDNKIIFGRDVVFNESKFAFESRDFYNGLIQRNPEEVTNAEETSTSGGEEE FT EEAESDETDTTRKQDVNNLRKSTRKKKQPKYLEDYTILALHAESFVENVPE FT DFDDIQGRDDKEQWLSAVNEEMKSLIDNKIWDLVPLPPGKKAIDNRWVFRI FT KRDHDGNIDKYKARLVIKGCSQRKGLDYNETYAPVARLTTVRTLLSVVNKF FT KLKTRQLDVKNAFLHGTINEVIYMKQPQGFAKNNGLACKLNRSLYGLKQAP FT RAWNSRFHEFIIQQGFERSERDSCLYTVFSKNNRIFLLLYVDDIIIAGDNE FT QWMHRIVNSLRGEFSMKDMGELKTFLGIRIEQTAEGMFLNQKTYMKNLLSR FT FGMTECNPSKTPMEVNPGKHEELEEETVIELKPFRELVGCLMYLMLTTRPD FT LSVAVNFYSRFQSNAKLAHWKGLKRILRYIKGTLNHGLFFKCDSNLEVPLQ FT IYVDANWATDNDRKSTTGYLLQVFGSTVDWSTKKQGGVTLSSTEAEYVALA FT TALTEAIWLKGLLESFGVHVDDPIKVFEDNQSVIHLLGRWDHRRLKHVDIK FT YNFVRNMYQNKEIDVQYLNTKEQIADMLTKSLERGQFVKLRSYIGVCEI" XX SQ Sequence 4071 BP; 1329 A; 727 C; 975 G; 1040 T; 0 other; ggttatgggc ccagttaatt gacacgtata gataaaggaa catagtcacg gaaatctcag 60 tttgtaattt cgagaagttt gcaaggaaga aagaaaacgt gactacaatg gcggacgacg 120 agaaatacaa agtcccatta ttcgacggaa cgaactatag caactggaag ttccggatgc 180 aggttctgct tgaggagcac gatctcttcg atttcgtcga gaagtcgctt gatacgttaa 240 tctcagagct gccgacaaca tctgcaggca ctcaaagtgt cacggtgagg aaaagtgata 300 ggaagtgtaa atcgttgatc acgcagcgca tcgctgacag ccatctcgaa tatgtgaagg 360 agaaacctac tgcatttgag atggggcggg ccctttcgga aagcttcgag cgaagaggag 420 tagcgagtca acttcgtctg aggaagatgc tgctgaccat gcggtatgtt actacggaaa 480 caatggcagc acatttctta aagttcgata aacttatacg tgaattgaag tcgaccggtg 540 ctactttaga agaaacggat atagtgtgcc atttgttgtt gacaatgccg gaagaatata 600 atatggtagt gacggctttg gaaacacttt cgagtgaaca attaactctt ggtttcgtta 660 aaacgagatt gctggatgaa gaggcgaagc gaggtggcac gagtgcgagt ggaaaaggcg 720 cgaactcgcc agcagttttt tcggctgcga cggccaatcg gaaagtaaat gcaaataagc 780 acgatggcac aactaacgaa aagcaacgcg agcgttttgg tttcaaatgt catcattgtg 840 gtatacttgg gcataagcgg tcggagtgtc gtaaatttaa acaaggaaat aaaaattcga 900 gctcggcaaa ggtcgcgttc gacgaagatg tcgacaaaag tcgaaaagaa ttcgtgttca 960 ttgctaaaga agagaacgcc attgcctcta cgtgttggtt tttggactca ggatcctcag 1020 aacatttagc aacgaaggaa acacgcttaa ttaatatgcg gaatataacg ccttctgtga 1080 ctattcgcgt cgcgaaatca ggacaagttc taactgcatc cgaagttggt gatttatccg 1140 tatgtgtgcg agcaaacgga gaagttagga aaatcttaat atccggagtt ctgtctgtgc 1200 ctggacttga gtgtaatctt ctttcggttc gcaaattgga aatgaacggg ttcacggtaa 1260 cttttaaaga cgggaaagga ataatccata aggagaatat aattgctgca attgctcatc 1320 gtacggagaa actgtataaa ctgcattttg aatataattc ggaggtcacg aatatttgcg 1380 agacagagga gaattttttg ttgtggcatc aacgattagg ccatctaagc agtgcaggaa 1440 tgaagaaact tttaaggatg gctgacggaa taaaacttaa ggacacagat ctctcgtcag 1500 aggtatgcga ggtatgcgtg gatggtaagc agacctgtct acctcatcaa actaaacgta 1560 tcagagcaaa gcgtcctctc caactggtgc atagcgattt atgcggccct atagatatta 1620 cgtcattcga tggaaaaagg tatttattaa cattcattga tgattatacg cattttacag 1680 tagcatacac attaaaagcc aaaacggaag tactaaggca tttcaaaatg tttcaatcaa 1740 tggcggaagc acatttcaat ttgaagatta gcagattcag atgcgacaat ggaagggaat 1800 atctctccaa tgagattcga caacatttcg agaactgcgg agtccaatat gagtttacca 1860 ttcgttatac gcctcagcag aatggagtgg ctgaacggat gaaccgaacc ataattgaaa 1920 aagcacggtg tatgatcttg aactccaaga tgaataaaac attctggtcg gaagcggtaa 1980 tagcagcagt ctatctaatt aatcgtagcc caacaatcgc gctgaaaggg aaagtaccag 2040 ctgaactctg gtttggagaa aaaccaaatc ttcgcaaatt aagagttttt ggatgcgttg 2100 catatcttca tattccaaag gagctggtaa acggaaaatt cgagtcccgt tctaaacatt 2160 gcaagatggt cgggtattgt gccaatggtt atcgactctg gtgtcctgaa gacaacaaga 2220 taatattcgg acgagacgtt gttttcaatg agtcaaagtt cgcatttgaa agcagggatt 2280 tttataacgg tttaattcaa aggaatccag aggaagtaac aaatgctgag gaaacctcga 2340 cttcaggcgg agaagaagag gaagaagccg agtcggacga gaccgataca acacggaagc 2400 aggacgtcaa caatctgcgg aaaagcacac ggaagaaaaa gcagcccaaa tatctagaag 2460 attatacgat tctggcgctt catgcggaat cattcgtgga aaatgtacct gaagactttg 2520 atgacatcca aggaagagat gataaggaac aatggctgag tgcagttaat gaagaaatga 2580 aatcactcat tgacaataag atttgggatt tggttccatt accgcctgga aagaaagcta 2640 ttgacaacag gtgggtgttc agaattaaaa gagaccacga tggaaatatc gataaatata 2700 aagcacggct tgtaattaaa gggtgctcgc agaggaaggg tctggattat aatgaaacct 2760 atgcgcctgt cgcacgtcta acaactgtac gaacccttct atcagttgta aataaattca 2820 agttgaagac cagacaacta gatgtcaaga atgcttttct acacggaaca atcaacgaag 2880 tgatttatat gaaacaacct caagggtttg ccaaaaacaa tggtcttgct tgtaagttaa 2940 atcgctcgtt atatggccta aaacaagcac cacgagcatg gaactccagg tttcatgaat 3000 tcattataca gcaaggtttt gaaagatcag aacgagatag ttgcctttac acagtttttt 3060 caaagaataa ccgcattttt ctcttactat atgtggacga cattatcata gcaggagata 3120 atgaacagtg gatgcacagg atcgtaaact cacttagggg tgaattctct atgaaggata 3180 tgggtgaatt aaaaaccttt cttgggataa ggatcgagca gacggcggaa ggcatgtttt 3240 tgaaccaaaa gacatatatg aagaatctcc tttctcgatt cggtatgaca gagtgtaatc 3300 cttcgaagac accgatggaa gtaaatcccg gaaaacatga agaactcgag gaagaaacag 3360 tgatcgagtt gaaacctttt agagagctcg tgggttgcct gatgtatctc atgcttacta 3420 cacgacctga tctgagtgtt gccgtaaatt tctacagccg tttccagagt aacgctaaac 3480 ttgcacactg gaagggattg aagcgaatcc ttagatacat taaaggcact ctaaatcatg 3540 gactattttt taaatgtgac tcaaatctgg aagtgcctct ccagatatat gttgatgcaa 3600 actgggccac ggataatgac cggaagtcaa caactggata tttacttcaa gttttcggat 3660 ctacagtgga ctggtcaaca aagaaacagg gtggagtaac actatcctct acagaagcag 3720 agtatgttgc cttggcaaca gcattaactg aagcaatttg gttaaaagga cttttggaaa 3780 gctttggtgt gcacgtggat gatcctatca aggtgtttga ggacaatcaa tctgttattc 3840 atttgttagg tagatgggat catcgtaggc ttaaacatgt agatataaag tacaattttg 3900 ttcgtaatat gtaccaaaac aaggagatag atgtgcaata cctgaacact aaagaacaaa 3960 tagcggacat gctgacaaag agccttgaaa gaggacagtt cgttaaactt cgttcttata 4020 ttggagtttg tgaaatctaa atacattttg tttgcataca ttgaggcgga g 4071 // ID CR1-127_AAe repbase; DNA; INV; 4561 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-127_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4561 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1215-1215 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 14 sequences with >97% CC identity. XX FH Key Location/Qualifiers FT CDS 721..1575 FT /product="CR1-127_AAe_1p" FT /translation="MEICNSCAVSMTSSEVVCSGFCKANFHYKCVHLSESL FT YKDICGNASVFWLCKGCCEIMKSARFKNAMTSTHAASLELRDAYQKVVEDL FT KAEIKNSLLAELKQEIQGGFNKLSPAILSPVPRRFQFRDRNTPKRTRDEES FT AQSSEQPSKIFCGTGQSAGSTSEGLAANADDKFWVYLTKISPEVSESDVQC FT LAQDRLQTADVVVKSLVPRGKPLSMLSFVSFKVGVHKDLKSKAMDPATWPE FT GIQFREFIDHDSNVRNFWRPALRLDPGATSSIVMQPQIALQMST" FT CDS 1482..4490 FT /product="CR1-127_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="CSEFLETSAALRSGSNFFDSNAATNRSSDVYVEEHHS FT ALRSRPSCSANSSPNYLTLYYQNVRGIRTKTNELFLRLSSCDYDIVVLTET FT WLHSNISNSELATGYCIFRCDRSSASSDLQRGGGVLIAVKASLNCKSVALQ FT NCENLEQAVVCIELQKSSIYVCCIYLRPNSQVTLYSAHFSAIQCICEGMSE FT ADTIVVVGDYNLPRLSWEIDDDINALLPSNASSEQEVVLVETMVASGLHQI FT NSLQNTNGRILDLAFVNEPGDVELIKPPAPLLGIDNHHMPFLLRINAEDYQ FT LQYVMDSNQTYDYDFRHCNFADLDTAISSIDWTTLFHGKTTDETVRMFYDT FT LNGILSENVPRRRRRCQPFKHPWWTSELQNLRNVLRKSRRRYFRTRTVVDY FT DNLRSIENRYNECQTEAFRAYISRLETTAKQDPSSFWNFIRNRKSSNRIPA FT EMTHNNCTANSPETLADMFADCFENVYRNSSPPFIPDNLRQSPVFDINFPL FT LEFNQHDVSSALNELDISKGPGTDNLPPVFLKECSASLQLPLCIIFNRSLR FT DGTFPQPWKVASISPIFKSGSNQIAENYRGVSILCCVAKVFESLVHNALYA FT AAQPLISDSQHGFVKKRSTTTNLMLFTDFVSTALENRQQVDAIYFDFSKAF FT DRVPHDLAVAKLNHLGFPRWITDWLHSYLTNREAFVKVNGAKSRNFAITSG FT VPQGSVLGPLIFVLFVNDLCFRLKSPKVLFADDLKIYRTISSLLDCCAIQS FT DINEIQLWCVENGMELNTKKCKQISFTRRQSRIDFEYLVGPDILERVESIR FT DLGVIIDYKLQFNEHISIATAKGFAALGFIRRSTKHFQDIYALKSLYCALV FT RSVLEYAVCVWSPHHTTQIIRMERVQRSFVRYALRQLPWANPTDLPDYRSR FT CNLIALETLSARRTKLQRLLVFDLIVGNVNCPSLLENVSLYAPSRQLRERN FT LLFVRRHRTSYGFHNPLDKCFRAFNNVSVVFDFNISKFVFKNRIRNLE" XX SQ Sequence 4561 BP; 1227 A; 1101 C; 967 G; 1264 T; 2 other; ctggcaaccc tgcctgttat tagtcgtgct cttccgtcgc tccaacaatt tttgcgtcaa 60 tcwcatcgaa accgccaccc gaaatcgctt gaagccgcat caagtcgccc gcaaccgctc 120 aaacmtcctc ccacagcagt ttaaagcagc gttcagtcag gtaggattag tgaatccaag 180 ttattccgag ctggctagag cgtggtttga gaaaaactgc ccaccgccat tgcccaaata 240 aattatctcc cacctcgcca ctctccacct gttctgaccc gctactctgt gtacaaacat 300 cgctcgtcac aacatcaact gccatcgttt gccgagcata atacctcctc cgaaaaacgt 360 catatttgca ttaagcaccc tgctgtaacc aagatcaagc caatacacta ctcctgatct 420 tgcccatctt ctcactcaac aatccgaata gaccaccgga gtcccaccag aaataatccg 480 tggtagttca tcggttccat tgatcggatc tcatcgtagc cattcattca acgatcgacg 540 aaggtaaaca aacattgtgc cgtcaaccat tccaagaacc gacatttgtg tgagtgcagt 600 cattttcgca tctgtccctg gcagtgttgc cagtttagcg gctaacatcg ataacgctgt 660 aacgcttgag tgacgcttga aattcgatta cgttttttgt ttaaatcgac ggttgacatt 720 atggaaattt gcaacagctg cgcagtaagc atgacctcgt ctgaagtggt gtgcagtggc 780 ttctgtaaag caaatttcca ttacaaatgc gtccacttgt cggagtcgtt atataaggat 840 atttgcggaa atgcttctgt cttctggctt tgcaagggat gttgcgaaat tatgaaaagt 900 gctcgtttca agaatgctat gacatcaacg catgcagctt ctctagaact tcgagatgca 960 taccagaagg tggttgaaga tttgaaggcg gagattaaaa atagcctttt ggccgagttg 1020 aaacaagaga tccaaggagg attcaacaaa ttgtctcctg ctattctttc accagttcca 1080 cgtcgtttcc agttcaggga tcgtaacaca cccaagcgaa ctcgtgatga agaatccgca 1140 caatcttcgg agcaaccatc gaaaatattc tgcggcaccg gccaatcagc tgggagtaca 1200 tcagaaggac tggcagcgaa tgctgacgat aagttctggg tgtatttgac aaaaatatca 1260 cccgaagtgt ctgaaagtga tgttcagtgt cttgctcagg atcgactaca gactgccgac 1320 gttgttgtga aatcgttggt acctagagga aaaccgcttt cgatgttgtc gtttgtatcc 1380 ttcaaggttg gtgtacacaa ggatctcaaa tcaaaggcta tggatcctgc tacctggcct 1440 gaagggatac aattccgtga gtttatcgac cacgatagta atgttcggaa tttttggaga 1500 ccagcgctgc gcttagatcc gggagcaact tcttcgatag taatgcagcc acaaatcgct 1560 cttcagatgt ctacgtagaa gaacatcatt ccgcactccg aagtcgtcca tcttgctcgg 1620 caaactcatc tccgaattat ctgacgttgt attatcagaa cgtcagaggc atacgtacta 1680 aaacaaacga gttgttcttg aggctttcgt catgcgacta tgatatcgtg gtcctcacag 1740 aaacatggct gcactcaaat atatccaatt cggaactcgc tacaggctat tgcatatttc 1800 gttgtgatcg gagctcggcc agcagtgacc ttcaacgtgg tggcggagtt ttaattgcgg 1860 ttaaagcctc tctcaactgc aagtctgtgg cattgcaaaa ctgcgagaac ctcgaacagg 1920 ctgttgtatg cattgaactc cagaagtcgt cgatctatgt atgttgcatt tatctacgtc 1980 ccaactcgca agtcactctc tactctgcac acttctcagc gatccagtgc atatgtgaag 2040 gcatgtcgga agcagatacg atcgtcgtcg tgggcgatta taatcttcct cgtctaagct 2100 gggaaatcga tgacgatatt aacgcattgc tgccctctaa cgcatcatcc gaacaagaag 2160 tcgtcctagt tgaaacgatg gtagcttcgg gattgcacca aatcaatagc ttgcaaaaca 2220 cgaacggccg aatactcgat ttggcattcg tcaacgaacc aggtgatgta gagttaatca 2280 aaccgccggc acctctcctc gggattgaca accaccatat gccgttcctt cttcgaatca 2340 acgcagaaga ttatcaattg cagtacgtta tggattcgaa tcaaacatat gactacgatt 2400 ttcgtcactg caacttcgcg gatctcgaca ccgccatttc ctccatcgat tggactacgc 2460 tgtttcacgg taagactaca gatgagacgg tacgtatgtt ctatgatact ttgaatggga 2520 ttctcagcga aaatgtaccg cgaagacgtc gtagatgcca accatttaag cacccttggt 2580 ggacttccga gttacagaac ttacggaatg tcctccggaa gtctcgtagg cgttattttc 2640 ggacaagaac ggttgttgat tatgacaatc tccgttcaat agaaaatcgg tataacgaat 2700 gccaaacgga agcatttcgg gcctatattt cacgactgga gacgactgct aaacaagacc 2760 cctcttcgtt ctggaacttt attcgtaacc gcaaaagctc aaatcggata ccagctgaga 2820 tgacacacaa taattgtacc gctaactctc ctgaaacatt agctgacatg tttgcagatt 2880 gcttcgaaaa cgtttaccgt aacagctcgc caccgtttat tcccgataac ctcaggcagt 2940 ctcctgtgtt tgatataaac ttcccgctgt tagaatttaa tcaacacgat gtttcatcag 3000 ccttgaatga acttgatatt tccaaaggac caggaaccga caatcttcct ccagtgttcc 3060 taaaagaatg ctccgcatct ctgcaattgc ctctttgcat tattttcaac agatcgctcc 3120 gtgacggtac ttttccgcag ccgtggaaag tagcttcaat aagtccgatt ttcaaatctg 3180 gttccaatca gattgccgag aactaccgcg gcgtttccat tctgtgctgc gtggctaaag 3240 tcttcgaaag cttagttcat aacgcgttgt atgcagctgc tcaaccgtta atttcagatt 3300 cgcaacacgg gtttgttaaa aagcgatcaa ccactacgaa tctgatgttg ttcactgatt 3360 tcgtatcgac ggccctcgag aacagacaac aagttgatgc catctatttc gacttctcca 3420 aagctttcga cagagttcca catgatctcg ctgttgcgaa gctgaaccat ctgggatttc 3480 cgcgctggat tactgattgg ctccattcat atctgacaaa ccgtgaggcg ttcgtcaaag 3540 taaacggagc aaaatctcga aatttcgcta ttacatctgg tgtgcctcaa ggtagcgtcc 3600 tcgggccact gattttcgtg ctgttcgtga atgatctctg cttccggctt aaatcaccca 3660 aagtgctgtt tgccgacgat ctgaaaattt acagaactat ttcgtccctt ctcgactgct 3720 gtgctattca atccgatatt aacgagattc aactgtggtg cgttgaaaat ggaatggagc 3780 tgaacactaa gaagtgcaaa caaatttctt tcacccgacg ccagtcccga atcgactttg 3840 aatatttggt gggcccagat atcttggagc gcgttgaatc catccgcgac cttggtgtca 3900 taatcgatta taaactgcaa ttcaacgaac atatcagcat cgccactgcc aaaggattcg 3960 ctgccctcgg atttattcga cgtagtacga aacattttca ggatatctac gctttgaagt 4020 cgttgtactg tgctctagtt cgaagcgttc tggaatatgc cgtctgtgtg tggtctccac 4080 atcatactac gcagatcatc cgaatggaaa gagttcaacg tagcttcgtt cggtacgctc 4140 ttcgccagtt gccttgggca aatccaactg atctgccgga ttacagaagc cggtgtaacc 4200 tcatcgcttt ggaaacgctc tctgccagac gtacaaaact acagagactg cttgtgttcg 4260 atctgattgt aggaaatgtc aattgtccgt cactgttgga aaatgtgtcc ctgtatgccc 4320 cgtctcgtca actacgtgaa cgtaacctgc tttttgttag acgccatagg acttcatacg 4380 gttttcataa ccctctggat aaatgttttc gtgcgttcaa taatgtgagt gttgtgttcg 4440 acttcaatat ttcaaagttt gtttttaaaa atagaataag gaacttagag taagatacag 4500 tctggggaat ttatttttaa ttcaagacgg tgataaataa ataaataaat aaataaataa 4560 a 4561 // ID BEL-7_CQ-LTR repbase; DNA; INV; 205 BP. XX AC AAWU01032418; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-7_CQ_; KW BEL-7_CQ-I; BEL-7_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-205 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 168-168 (2011). XX DR Genome; AAWU01032418; Positions 1909 2113. XX SQ Sequence 205 BP; 55 A; 39 C; 34 G; 77 T; 0 other; tgttagtcgt aaaataaggg ttttcagttt agcgtgtaca acttttctat ttttttgttc 60 ccacctttaa ctgtcatctt gaaataaacc taccttaagt ttttactttt cgctcgcaca 120 gaaagcacgc gcgttttttc tctctgtcga aaatttgaat tgaaaaatac agtccgcttg 180 aaattcgcga agtttttgga gaaca 205 // ID Gypsy-232_AA-I repbase; DNA; INV; 4806 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-232_AA_; KW Gypsy-232_AA-LTR; Gypsy-232_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4806 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1067-1067 (2011). XX DR [2] (Consensus) XX CC Positions [3678-4166] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS join(2598..3401,3405..4511) FT /product="Gypsy-232_AA-I_1p" FT /translation="MSPRLLQYPDFTQKFILTTDASDIGCGAVLSQISEAG FT ERPIAFASKTFIPAERNKPTILKELIAIHWAINYFEAYLYGRRFTVRTDHR FT PLVYLFGIKNPTSKLTRIRIDLEGYDYDVVYIKGKENVAADALSRIATTSD FT ELKISNVMIVNTRSMTRKAQKLVKNNLKDISIENEKKETDHLSTYETERPT FT ETRKMMKLCTAVVGNTLKFMILKKNCKTIVAQVHQDVRNGSHAFECALPEI FT EKITKRLGAKNLALPANDFIFTIVPLNLKTIANKILKTLKIIIYKEPIFIN FT NPEEIQTLLSNHHNTPVGGHVGQFRLYLRLREKYRWRDMKRSIAQFVQACE FT LCKRNKIIKHTKEPMVITTTPSKAFEVISIDTVGPLPKSNKNNRYCITIQC FT DLTKYISIIPIPNKEANTIARALVENFILTFGNFLELRSDQGTEYNNEVLE FT QISKLLQIKQTFSTPYHPQSIGSLERNHRCLNEYLRSFTNEHQSDWDDWIK FT YYEFSYNTTPHTDHNFTPFELIFGRKASLPQDVLENSNEPIYNFESFSNEL FT KFKLEKSHEIVKQKLLEHKYRRKKEFDNNINPIKVAINDMIFLKNENRRKL FT DSFYLGPYKIIKISEPNCEIEHIISGKSVNVHKNRIIKA" XX SQ Sequence 4806 BP; 1816 A; 898 C; 838 G; 1251 T; 3 other; cggacgggct atgaaatggc gaccgcgagt taatccgaac cagtaatcgt cggaaaaagt 60 caagtgagaa taattgtgaa taatccagac ccgtgcaagt tagtgaaaca ccacgaagaa 120 tgggctccga ggaatcaaaa gcccgagtga atcataacgg tgatagtcac atcgaaattg 180 tgaacaacca gttagagcac acgacaacac tgaactccat aacaacctgg tagtggttca 240 tatcggctgg tgtaacaatt caacttgttt taagtttgtg gggagaaatc caaaggcgaa 300 tgagcaagtc aaaaagggca actaagcaag ctcagcgagc tcacgatgag ggagcaacag 360 cccaacaaca cagcaacagc atcagcagca acagaacaac gagagcggtg agaagaggaa 420 aatgcctaag tcacaaactg gggacatccc tcgtcaagct tacgatgggc tccggactca 480 actcttcctc gatgcgttgc gttattctcg tcgatggaat ttgcagcggc agaagacagc 540 ggcgacgaaa cagcactggc aaaaatgatt gctgtaaaaa taaggggcac tgtatacttc 600 tgacaaaatc accaaccaag atttcaaaat tccgaggcaa ttatagccaa ggacacagtt 660 atggccaagg acaaaatttt agaggtcaaa acttcagagg acaaaatttc agaggccaat 720 ggaatagagg aagaggacga tacgaaacac gctctaccta tttcatgcag cctataccgc 780 aggcaataca acccgttttc aacaattgcc gccgtagcta ccaacaaaca accagttacc 840 agcagccgca accaacagca acaacagccg gcacaaaaca acactttttt aggcacacag 900 tttggtcgac atttacagga atgcatcgac ctccaattac gttattttaa aaatagattt 960 atcggaaacc cagtgtactt ttattgttga cacaggtctg atatatctat cattaaaagt 1020 cgtaaagtaa aatcgtctca aatttactac cctgaagaaa aatgcgtcat atcaggtaca 1080 ggagatgaag gaattcactc tttaggaagt actttcgcta acatattagt tgagggtata 1140 cctattcagc agaaattcca aatagtttca aatgatttcc ctattccaac ggacggaatc 1200 attggaaaag atttcttggc aagatataag tgtaaaattg actacgaacc gtggatgttg 1260 tcatttactg tagacaatca gcttattttg atacctattg aagataattt ccaaaacaat 1320 ttctttttac ctccacgatg cgaagttacc cgttacatac caaatcaaaa tttacaagag 1380 gatatggtgg tacactcaca ggaaattcaa ccaggaattt tttgtggaaa tacgatcata 1440 tcagctaaag aaccaatatt aaaattcatt aatacaccga taaagcagtt tatattactt 1500 acacagattt taaccctata tggaaccact tagaactttc gagtagtacc acacaaaaaa 1560 cactcaacaa ataaaaccag acgaaataaa attttgaaaa aattgatact cagcacctta 1620 ctccacagct aagcgagaac tgaaaaattg atcattcgat atgaagatat ttttaacctg 1680 gaagatgaga aactgtcaac caataatttt tatacgcaga acatcagtct taacgataat 1740 gttcctgtgt atatacccaa ctataaaact attcatgccc aaggtgaaga aatcgaaagg 1800 caagttcaga aaatgttaaa agacgaaatt atagaaccct cagtttcgtc atataattcc 1860 ccgattttgc tggtcccgaa aaagtcggat aattcagaaa acaaatggcg tctagttgtt 1920 gattttaggc aactaaacaa gaaaattatg ccagacaaat ttccgctacc aagaattgaa 1980 agtatactag accaacttgg tagagcaaaa tattttagca cactggatct gatgtcaggt 2040 tttcatcaga tccccttgga cagtaattcc aggaagtata ccgcattttc taccaactct 2100 gggcactatc agttcaaacg attacctttt ggattaaata ttagtccaaa tagttttcaa 2160 aggatgatgg ctattgctat ggcaggttta accctgaacg tgcatttatc tatatagatg 2220 atattgttat tatcggatgt tcgttgaaac atcatttgtc aaatttagaa tcagtatttg 2280 aacgaatgag gaaatataat ttaaaattaa attatccaaa tgtaaatttt taaaaaccga 2340 agtgactatc ttggtcataa aataaccgat cagggaatat tgcccgacga ctccaaattt 2400 aaagccatta gggactatcc aattcccaaa aatgtagacg atgtgcgaag attcgttgct 2460 ttttgcaatt attatcgaga tttgtcgaaa actttgctga catagcttac ccattaaacc 2520 aattactgaa gaaaaacgtt acttttacat ggactgataa atgtcaaaat gcattcgatc 2580 tgttgagaca acagctgatg tctccaagat tattacaata ccccgatttt acccaaaaat 2640 ttatactaac gacagacgcg tcagacatag gatgcggagc tgtcttatct caaatttctg 2700 aagcaggaga acgacctatt gcgttcgcaa gtaagacgtt tattccagca gaaagaaata 2760 aaccgaccat actaaaagaa ctaatcgcca tacattgggc cataaattat ttcgaagcct 2820 atttgtatgg aagacgattc acagtacgaa cagatcacag acctcttgtg tatctattcg 2880 gaatcaagaa tccaacctcg aaattgacta gaattcgtat cgatcttgaa ggatacgact 2940 acgatgttgt atatattaaa ggtaaggaaa atgtagcagc ggacgcatta tcgcgaatag 3000 ctactacatc tgacgaatta aaaatttcta atgttatgat agttaatact aggtctatga 3060 cccgtaaggc acaaaaatta gttaaaaata atttaaaaga tattagcatt gaaaatgaaa 3120 aaaaggagac tgatcacctc tctacatatg aaactgaacg ccctacggaa acaagaaaaa 3180 tgatgaagct gtgcacagcc gtagtgggca atacattgaa atttatgata ttgaagaaaa 3240 attgcaaaac gattgttgca caagtgcatc aagacgttag aaatggaagt cacgcattcg 3300 agtgtgctct tccagaaatc gaaaaaatta caaaaagact aggcgcgaaa aatttagcat 3360 tgccggcaaa cgattttatt tttaccatag taccattaaa tktattaaaa actatagcta 3420 ataaaatatt aaaaacttta aaaattataa tttataaaga accaatcttt ataaataacc 3480 cggaagaaat tcaaacatta ctaagcaatc atcataatac accagttgga ggacatgtag 3540 gtcaattcag actgtaccta agactccgag aaaaatatag atggcgcgat atgaaaaggt 3600 ctattgcmca attcgtccag gcctgcgaat tgtgcaaaag aaataagatt atcaagcata 3660 cgaaagaacc aatggtcata accacaacac catcaaaagc tttcgaagta atatcaatag 3720 acacagttgg tcctttaccc aaatctaata aaaataaccg gtactgcatt acgattcaat 3780 gtgaccttac aaaatatata tccataattc cgatccctaa taaagaagca aacaccatag 3840 ctagagcatt agtcgaaaat tttattctga cttttggcaa ttttctcgaa ctacgatctg 3900 atcaaggtac tgaatataac aatgaagtac ttgaacagat tagtaagtta ctccaaatta 3960 agcaaacctt ttcaacacca tatcaccctc aatctattgg ttcattggag agaaatcaca 4020 gatgccttaa tgagtatctg cgatcgttca ccaatgagca tcagtctgat tgggatgatt 4080 ggatcaaata ttatgaattc tcatacaaca caaccccgca cacagatcat aattttaccc 4140 catttgagtt gatatttggc agaaaggctt ctctcccgca agacgtatta gaaaatagta 4200 atgaacccat ttataacttt gaatcattca gtaatgaatt aaagtttaaa cttgaaaagt 4260 cacatgaaat agtaaaacaa aaattattgg agcataaata ccgaagaaaa aaagagtttg 4320 ataataacat aaatcccatt aaagttgcaa ttaacgacat gattttctta aaaaatgaaa 4380 atagaagaaa actagattcc ttttatcttg gaccttataa aataataaaa atcagtgaac 4440 caaattgcga aatagaacac ataatttcag gaaaatcagt aaatgtacac aaaaatagaa 4500 taattaaggc gtaaataact ttaactagag tcccaccgag cctaacctct cacaaaaaaa 4560 aaaaaaaawa tatgaaactt aaaaaaataa aataacgggg taggtgaaag tttccagcat 4620 atatgctgag aagtgccaaa actctattag gtagaaataa agttatttag ttaagacaaa 4680 gtgttagaaa cttaaggaaa accaacatat tgtaaagctc aaacaaagac atctgggaag 4740 gcaaaacgca agaatgattt cattacactt tctatttcat tacatcattc tttaaaaggg 4800 gggagg 4806 // ID Crack-14_BF repbase; DNA; INV; 2510 BP. XX AC . XX DT 30-APR-2009 (Rel. 14.04, Created) DT 30-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE Amphioxus Crack-14_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; KW Crack-14_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-2510 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-2510 RA Kapitonov V. and Jurka J.; RT "Young families of Crack non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(4), 819-819 (2009). XX DR [2] (Consensus) XX CC Its 5'-terminal portion is incomplete. XX FH Key Location/Qualifiers FT CDS 1..2313 FT /product="Crack-14_BF_2p" FT /translation="TATCIDLAFTNCKERCTTTKSTPLGFTDHNIIYLTRK FT TKVPKTQPHVVYKRTYSKFNPDDFQQDMSMVPWHIVYDESDVNEALHLFTT FT LFCNVADQHAPAKRRTVRSNPAGWVDDELREVMRLRDDAKKEALASGLVSD FT ICIYKKLRNAAVKLNRKKKAAYFKSKIAESRNDPKEMWRTLNGMMGRGHRR FT SACTVERDGEILTKAGDVADYFNSFFTQKVNSLRAGLDSQTNGDLLQLIRD FT KVMVGKDCSFDFQLTHWEQIHKHLLSLSDGKATGLDNVDNKLLRLAADQIA FT RPLCYIINLSLITSEYPTAWKRAKVVPLPKSATTRLSGPNSRPISLLPAFS FT KVMEKVVSSQISDYFKVNDLMSPHQHAYRKHHSTCTALLQLVDDWYRKIDQ FT GNIVGVIFLDFSAAFDLVDHSFLLQKLACYGFSDTAVHWMDSYLAGREQCV FT HFNGTNSPFREVNCGVPQGSCLGPLLFTIFTNDLPFAVRETIPEMYADDTS FT AYTCSPSVETVTVNLQEDLNNICNWVRLNRLFLNANKTMCMLLGSRPKMSK FT KPKLTLFADGCPIQQVTTMKLLGSTLDECLTWDLHVEETIKKMARSLGAIR FT RSVAYLDQSLVKMLTETLTLQHLDYCAAVWASTTQKNITALQVMQNKAGRL FT TLGTNRTPIKHMHNVLSWLTVSERLYVSSMCTLHKVLLAREPSLLYSRLHL FT VSGQHSYHTRHAARQNLRTEKPKTNSLKKSFTYRAASDWNKLSHELQTANT FT SRIFRRQLVKHMRDLAKDT*" XX SQ Sequence 2510 BP; 781 A; 538 C; 577 G; 614 T; 0 other; acagctacgt gtatcgatct ggcttttaca aactgcaagg aaaggtgcac aacaacaaag 60 tcaacgcctc tgggttttac ggatcacaac atcatctact tgacaaggaa gactaaggtt 120 ccaaagactc agccacatgt tgtctataaa agaacctaca gtaagttcaa cccggacgac 180 ttccagcaag acatgagcat ggtaccttgg cacattgtat acgatgagtc agatgtaaat 240 gaagcactgc acctgtttac aacattgttt tgcaatgtag ccgatcaaca tgctcctgct 300 aaaagacgga ctgtgaggtc gaacccggca ggttgggtgg acgatgaact tcgagaagta 360 atgaggctca gagatgacgc taagaaggaa gctcttgcgt caggcctcgt ttcagacatc 420 tgcatataca agaaactaag gaatgcagct gtcaaactaa atcggaaaaa gaaagccgcg 480 tacttcaaat ctaagatagc tgagtccaga aatgacccaa aggaaatgtg gagaacactg 540 aatggaatga tgggtagagg gcatcgtcgg tcagcctgta ctgtagagcg ggacggagag 600 atcctcacta aggctggtga cgtagcggac tacttcaact cattctttac gcagaaggtg 660 aactctctga gagcagggtt ggactcgcaa acaaacgggg accttctcca gctgataagg 720 gataaggtca tggtgggaaa ggactgtagt tttgactttc aactgacaca ttgggaacag 780 atccacaaac accttctgtc cttatctgat gggaaagcta ctggacttga caatgtcgac 840 aacaagctgt taaggttggc tgccgatcaa attgccagac cactttgcta catcataaac 900 ttatcgctca ttacctctga gtatccgacg gcatggaaaa gagccaaggt tgttcctctg 960 cccaagtcgg ctacgactcg cttaagcggg ccgaacagca gaccaataag tctcttacct 1020 gctttcagta aagtgatgga gaaagttgtg agttctcaaa tatcggacta tttcaaagtg 1080 aatgacctga tgtcacccca ccaacacgcc tacagaaaac atcactcgac atgcactgca 1140 ctcttgcaac tagtcgatga ctggtatcgt aagatcgacc agggtaacat cgtcggggtc 1200 atattcttgg atttttccgc tgcctttgac ttggtggatc acagctttct gttacaaaaa 1260 ctggcgtgct acggtttctc agacacagca gtgcattgga tggacagtta cctagcagga 1320 agagaacagt gcgtacactt caacggcacg aactcgccgt ttagggaggt taattgtggc 1380 gtaccccaag gtagttgcct tggccccttg ctgttcacaa tatttacaaa tgacctacct 1440 tttgccgtga gggaaacgat ccccgaaatg tatgcagacg acacatctgc atacacgtgc 1500 tcaccatcag tggagacagt aacggtcaat ttacaggaag atttgaataa tatatgtaac 1560 tgggtgaggc taaacagact gttcttgaat gcaaataaga caatgtgtat gttgcttggg 1620 agtagaccca aaatgtcaaa gaaaccgaaa ctcaccttgt ttgctgacgg atgtcctata 1680 caacaagtaa caacaatgaa gctacttggg agcaccttgg atgagtgtct cacgtgggac 1740 cttcatgtag aagaaaccat aaagaaaatg gctaggtccc tgggcgccat cagaaggagt 1800 gttgcatact tggatcagtc cttagtgaaa atgttaacag aaacacttac cttgcaacat 1860 cttgactact gtgcagcggt ttgggcttct actacacaaa agaacatcac cgcactacaa 1920 gtgatgcaga ataaagcagg cagacttacc ttgggaacta accgcacccc gatcaagcat 1980 atgcacaatg tcttatcatg gttgactgta agcgaaagac tctatgtaag cagcatgtgt 2040 acattacaca aggtactcct tgccagggag ccgtctttat tgtacagtag actacatttg 2100 gtatcgggcc agcatagtta tcatacaaga cacgcggcac ggcaaaactt aagaactgaa 2160 aaaccaaaaa caaacagcct gaaaaaatcc ttcacatata gagcagcatc agattggaac 2220 aaactatcac atgagctgca aactgcgaac acatccagga ttttcagacg gcagctggtg 2280 aaacatatgc gagacttagc taaagacaca tgaggggctt gatgatttat gattattgga 2340 taaggcacat gcgttttgtt taatgatttt ttattgacat gttgtatgtg tgtatgttgc 2400 aaactgtatg tgtccaagta ttatgtgtat tactcctgga agattagttg ttgcattgat 2460 atgttaacaa ctaaaggagt taatgaacaa taaacaataa acaataaaca 2510 // ID CCRP1 repbase; DNA; INV; 187 BP. XX AC X04359; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 28-SEP-1995 (Rel. 1.08, Last updated, Version 1) XX DE Grasshopper repetitive sequence. XX KW CCRP1; Repetitive sequence. XX OS Caledia captiva OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Orthopteroidea; Orthoptera; Caelifera; Acridomorpha; OC Acridoidea; Acrididae; Gomphocerinae; Caledia. XX RN [1] RP 1-187 RA Arnold L.M.; RT "The heterochromatin of grasshoppers from the Caledia captiva RT species complex."; RL Chromosoma 94, 183-188 (1986). XX DR GenBank; X04359; Positions 1 187. XX SQ Sequence 187 BP; 45 A; 42 C; 44 G; 56 T; 0 other; tcctttctcc taatcggctc ggtaatctgc cttcttaatt tgttacgtag catttcaacg 60 agggtgctat acctatacaa agtttcataa ggcgcactct gacaggactg tagttacacg 120 cccttgaata tagcagggaa ggtccgtgcc acgtgacgct ggaaatttta tcggttttcg 180 gagagga 187 // ID Gypsy-2_DPer-I repbase; DNA; INV; 4308 BP. XX AC super_2; XX DT 04-MAR-2011 (Rel. 16.03, Created) DT 04-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-2_DPer_; KW Gypsy-2_DPer-LTR; Gypsy-2_DPer-I. XX OS Drosophila persimilis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; obscura group; OC pseudoobscura subgroup. XX RN [1] RP 1-4308 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (22-MAR-2010). XX DR Genome; super_2; Positions 5384944 5380637. XX CC Positions [3179-3664] - Integrase core CC 'GTGAT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 68..4099 FT /product="Gypsy-2_DPer-I_1p" FT /translation="MNSNMTKNEAPTFYQMATIAEFSPHQETFCIWKEKFD FT IHLCELNVKEENTKKAVLLKSIGTAAYTVLHSLCDPVSPVSKAYKELCEIL FT KTHYTPPTLIFRERKQFHMSTKSEDETVAEWYARVKQLALECKFGSNLEAF FT VLNQFVMSLPNPIYERICEEDENLTLADALKKAMIMETKISTRKTEHNVNY FT MHQKNGTSQWQRQRGENRDQAKSNGRSHFKGNRNVSYRGGRSDGDGDGETD FT CDAGREKKQQPCTHCGWRNHNSNNCKYKACKCHSCGKIGHLASICKNKSKK FT SVNYVSQTKHSDNNFDNDNYNFSIYSVDTTSTSGVYYLPVKIDGNITKIAC FT DTGAPCTLVPKTFYESLNTSRPLRKCLVPYVDYNGDSIKVIGEYDATIEYR FT GLKKQIVVVVTNAVSPPLLGRTFLRAFNFELMQVNNVYVNEPNSVITEQIK FT SEFAEVFEPRLGSYTTNTISLLLKEDAKPIFFKPRSVPLAWKEKIEVQLRK FT FIDDGILEQVVSSEWATPLVPILKPNGDIRICGDYKVTLNRSLVDVKYPLP FT RIDDIFAALEGGTLYSKLDLSNAYNQLNLDEQSQNLCTWSTHIGLLKVKRL FT PFGIKTAAAIFQKTMEGLFQGMRGVVVYQDDITVTGRDLQEHISNLKAVLR FT KLKSVGLKLNDKKCEFFKSKICYLGFSIDRIGLSKNNDRVASVLFAPAPEN FT VSQLRAFIGMVNYYSKFINNFAQIMSPLYALLKKNTKFNWTRECQSAYEEV FT KKEVTSEQVLVHYNPDQPLVLTTDASSHAVSGVLSHKCNEDIKPIAFVSRA FT LSKSEHNYSTIEKEALAIVFCVSKLRQYLLGNKFILRTDHKPLLSIFEDLK FT GLPLMASARMQRWALTLSGFQYTVEHIKGTLNIADGLSRMPQWETAVPNED FT YNYIHFIQSEKVFKISFKEIARETRRDPILSKVCEAINTGSKTRLNGSEFS FT AYISKTNELSVEYDCILWGHRVVIPTKLRQAILAEFHASHLGIVKTKMLAR FT SYVWWPCIDSEIEKLIRDCIPCQELQSSPEKSLLIPWKSTGKAWSRVHIDF FT AGPIRGFHLLVIIDSYTKWAEVFKTKEITSLFTINKLREVFCRYGLPDVLV FT SDNGRQFTSDDFKTFMKNNGVNHIFTAPGHPATNGQAENFVKTVKKSLYAN FT IKANERQDFNTILNRFLIDYRNTIHCTTGESPAKLFFGRTLRTRFSLLKPP FT TVESIINKKQTECVVNHKGRRSTEFFKGQKVMVRDYKNPNKASWTQATVKQ FT QLGPRSYCCILANNNREIKRHLDQIRNQSTNNHTSPNAKTPDVESNCSVDD FT NSKTNNEQIVSQPTPTRRELRPREAGKVVKRI" XX SQ Sequence 4308 BP; 1609 A; 717 C; 843 G; 1139 T; 0 other; taattggcga cgaggataaa ataaaacgta ttaatataca tacacatgtt aaaagtgcat 60 ataatatatg aatagcaaca tgacgaaaaa tgaagcacca acattctatc aaatggcaac 120 aattgctgag ttttcgccac accaagaaac gttctgcatt tggaaggaaa aattcgacat 180 acatttgtgc gaattgaatg tgaaagagga aaacacaaaa aaggcagttt tgttgaagtc 240 gatcggtaca gcggcttaca ctgtactaca tagtttgtgc gatccggtat cacccgtatc 300 aaaagcatac aaagaattat gtgaaatttt aaaaacacac tacacaccac ctacactcat 360 attcagagaa agaaaacaat ttcacatgtc cacaaaaagt gaagatgaga ctgtagcgga 420 gtggtatgct cgagttaaac aattggcatt ggaatgcaaa tttggctcaa atctggaagc 480 gtttgtatta aaccaatttg tgatgagcct tcccaacccc atatatgaaa gaatatgtga 540 agaagacgaa aatctaactc tggctgatgc actcaagaag gcgatgatca tggagacgaa 600 gatcagcaca aggaagacag aacacaacgt caactacatg catcaaaaga acgggactag 660 tcaatggcag aggcaaagag gcgagaacag agatcaagca aagagcaacg gcagaagtca 720 tttcaagggc aaccgaaacg tgagttacag aggcggcaga agcgacggtg acggcgacgg 780 cgaaaccgac tgcgacgctg gcagagaaaa gaagcagcag ccatgcacac actgtggctg 840 gcgaaatcat aactcaaaca attgcaagta taaggcatgc aaatgtcaca gctgtggaaa 900 aataggtcac ttggcgagca tttgtaaaaa caaatcaaaa aaatccgtaa attatgtatc 960 tcaaacaaaa cattctgata ataattttga taatgataat tataactttt ctatctatag 1020 cgttgataca acctcaacta gtggagtgta ttatttacct gtaaaaattg atggaaacat 1080 aacgaaaatt gcttgcgaca ctggtgcacc gtgcacattg gttccaaaaa cattttacga 1140 aagcttaaat accagcagac cacttagaaa atgtctagtt ccatatgtag attacaatgg 1200 agattcaatc aaagttattg gtgagtacga tgcaacgatc gaatatcgag gtctaaaaaa 1260 acaaattgtt gttgttgtaa ccaatgcagt gagtccacct ttgctaggta gaacattttt 1320 gagagctttt aattttgagt taatgcaggt gaacaatgtg tacgtaaatg aaccaaattc 1380 tgtaattaca gaacagatta aatcggagtt tgctgaagtt tttgagcctc gtttaggttc 1440 atatactacg aatacgatat cgttactatt aaaagaagat gcaaaaccaa tatttttcaa 1500 gcctaggtca gtaccattag catggaaaga aaagattgaa gtgcagttac gaaaattcat 1560 agatgatgga attttagagc aagttgttag ttctgaatgg gcaacacctt tagtaccgat 1620 tttaaaacca aacggtgaca tacgaatttg tggtgactat aaggttacat taaaccggtc 1680 tttagtcgac gtaaagtatc cactacctcg aatagacgac atttttgcag cgcttgaagg 1740 gggtacactg tattcaaaac ttgacctgtc aaatgcttac aatcagctaa atttagatga 1800 gcaatctcaa aatttgtgta cttggagcac acatattgga ctattgaaag ttaaacgctt 1860 accatttggt ataaaaactg cagcagctat ttttcaaaaa acaatggaag ggttgtttca 1920 gggtatgcga ggagttgtgg tttatcaaga tgacataaca gttacaggac gagatcttca 1980 agaacatatt tcaaacctaa aagctgtact tagaaaactg aaatcagttg gacttaaatt 2040 aaatgacaag aaatgtgagt tctttaaatc caaaatttgt tacctaggat tttcaattga 2100 caggatagga ctgagcaaaa acaatgatag agttgcgagt gtattgtttg caccggctcc 2160 ggaaaacgta tcacagctta gagctttcat aggcatggta aattattatt caaagtttat 2220 caataatttc gctcagataa tgagtccttt atatgcatta ttaaagaaaa atacaaaatt 2280 taattggaca cgcgaatgtc agagtgcata tgaagaggta aagaaagaag taacttctga 2340 acaagtttta gttcactaca acccagatca accgttagta ttaacgactg atgctagcag 2400 tcatgcagtt tcaggtgttt tatcacataa atgcaatgaa gatatcaaac caatagcatt 2460 tgtatcaaga gcattatcca aaagcgagca taattacagt acaatcgaga aagaggccct 2520 ggctatagtg ttctgtgtga gtaaattaag acagtatctt ttaggaaaca agttcatcct 2580 tcgaacagat cacaaaccat tactaagcat atttgaagac ctgaaaggac ttccattgat 2640 ggcctctgca cgaatgcaaa gatgggcact gactttgtca ggttttcaat atacagttga 2700 acacataaag gggaccttaa atatagcaga tggactctca agaatgcctc aatgggaaac 2760 ggctgtacca aacgaagact ataattacat acattttata cagtccgaaa aggtgttcaa 2820 aattagcttc aaagaaatag ctcgtgaaac acgacgtgac ccaatattat caaaagtttg 2880 tgaggcaatt aacactggtt cgaagacaag attaaatggc agcgagtttt ctgcctacat 2940 ctcaaaaaca aatgaacttt cagtggaata tgattgtatc ttatggggac acagagtagt 3000 aattccaacc aaactgagac aagctatttt agcagaattt catgcatcgc atttgggaat 3060 agtaaaaacc aaaatgttag cacgttctta tgtgtggtgg ccttgcatag attctgaaat 3120 tgagaaatta attcgagatt gtattccttg tcaggaacta caatcaagtc cagaaaagag 3180 tctgttaata ccgtggaaat ccacaggaaa ggcttggagt cgagtacata tagactttgc 3240 aggaccaatt agaggttttc atttattggt tattatagat tcttatacaa aatgggcaga 3300 agtattcaaa acgaaggaaa taacttcatt gtttactatc aacaagctta gagaagtatt 3360 ttgtcgatat ggtttaccag acgtgttagt tagtgacaat ggacgacagt ttacttctga 3420 tgattttaaa acgtttatga aaaacaatgg tgttaatcac atttttactg ctccaggaca 3480 tccagcgact aatggtcaag cagaaaattt tgtaaagact gttaagaaat cactatatgc 3540 aaacataaag gctaatgaaa gacaagattt taatacaatt ttaaatagat tcttgatcga 3600 ttacagaaat acgattcatt gtactacggg cgaatctcct gctaaattat tttttggtcg 3660 tacactaaga acaagatttt cgttgttaaa accacctacg gtcgaaagca taataaataa 3720 aaaacagact gagtgtgttg taaatcacaa aggtagacga agcacagaat tttttaaggg 3780 tcaaaaagtt atggttaggg actacaaaaa tcctaacaaa gcaagctgga ctcaggcaac 3840 tgtaaaacaa caactgggcc cgcgttcgta ttgttgtatt ttggcgaaca ataacagaga 3900 aataaaacgt cacctggatc aaattagaaa ccaaagcact aacaatcata catcgcctaa 3960 tgcgaaaaca cctgatgtcg aaagcaattg ttcagtagat gacaattcca aaaccaacaa 4020 tgaacaaata gtttctcaac caacacccac caggagagag ttgagaccac gtgaggcagg 4080 gaaggtagta aagcgtattt aacaataaaa taatataaag aaaggaatta tgaaatcaaa 4140 ttatgtacaa attaaacact gaaacaaata ctaaatagat taagaagaaa gaaaaatacg 4200 aaatcacata aattttatat acgataatta gattaagtac acaaatcaaa ttgttataag 4260 aaaatagatt tgtaaatgac atgtatatta aacatctaag gagaggcg 4308 // ID Tx1-12_CQ repbase; DNA; INV; 4798 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A Tx1 non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW Tx1; Non-LTR Retrotransposon; Transposable Element; Tx1-12_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4798 RA Kojima K.K. and Jurka J.; RT "Tx1 non-LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 644-644 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 6 sequences with >97% CC identity. XX FH Key Location/Qualifiers FT CDS 13..1188 FT /product="Tx1-12_CQ_1p" FT /translation="MEPKRESTIKLRFGAGSRGPNNAEVFKFFGKQEWTXE FT ELSAMYRDDYSIFVKFKTEQQMREALSKLGPQTRFEYDDGTSMMVPVTAAA FT GSFKYVRIFGLPPEVDDRFIATAMSKYGTVQQMIRERFPVETGFPIHNGVR FT GIHMELVSEIPAQLTIQHVKARIYYDGLQNKCFGCGALDHLKAACPNRKDV FT NKRLTPAQKPGTGSFASVVANGTSGKAEELPPXSGMVLLGKXGAAPPTAPA FT ESVQPSEPTPSGPPXSGPSDQPQLPADPVMPPVPIAPQEPTPPKPADIAEE FT KQDQPMEEDGEREQSESDGEPMEKDGEWIEKXGKGGKGKGKRGRPKGRPGS FT DSSEVDTNGRKKFIVPGQXHDLLDAQGDRTRSRSRSAAKPAGGDLQXXK" FT CDS 1351..4623 FT /product="Tx1-12_CQ_2p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="MNFVYTIATLNLNSSNANVNKNLLKDFIMDYNVDIAF FT LQEVSFEDFSFIYSHNSLVNISIDKKGTAVLIRKSFSXSDFILDPSGRIVS FT VRVNDVNLINIYAHSGNNLRKERDNLFTEALTVHLNKPGTKSMLIGGDFNC FT ILHAEDSGSSQKNFSCGLKHLVDLXKLKDIAKTRKECKFTFHRGESASRLD FT RFYGSDNIIGNVTSCSTIPLAFSDHHAVLIKLKLDDNNLSTRGRGYWKINP FT SFLSMEDVTEKFKQEYLKLKQRDIFNRDLSKWWFFHFKGKAKNFYKSESWH FT FNQSSRNCKAVLLNRLNELSDKLANGDNVGDEMSVIKSKLMSLEQNRLKNL FT GEKLPACTLAEGEKITIYQLSNKIHRGNIFGLQLRDGPSHEILTDNAKLGP FT VIHQYFSNQFQQSEDNISSAEIDNSIESISKNLSNDQKINLVKPITVEELY FT ETLKKCNRKKSPGPDGLTYEFYLCHFDILKNDLCRLFNLYLRGESSPPKEF FT SEGIITLVPKKGDKTLLENHRPISLLNTDYKLFTKLLANRIQIYMEDLLGS FT GQSACIAGRSCTDNLNDVRRLITKSVESKTFKGMLLSIDLEKAFDKVNHKF FT LWKILEKFGFPEKIIKILQCLYTKATSRVLYNGFLTNEIKIFSSVRQGCPL FT SMILFVLYIEPLIRKIDQCILGVLTYNKFLRVVAFADDLNVFVRNAEEFDN FT VLSIVDSFSRYSKVKLNYKKSCFLRINSCIGGPFIVQETEQLKLLGITFKN FT TWKDIVDINYSKLISDMKYRINLNNFRMMSLLEKVWFINTFVLSKLWYLSQ FT IFPPKNQHIATIRKIIGMFLWKNNIFRVDKRQLYLDVDQGGLGLIDPESKC FT KALFIKNILKEQNISATDVSEKYLLGLNFTSKLTRNGKEWIMLAADLNCSS FT PTINTAKQIYRNFIHEYLFKPRVQVELPQVNWGALWKIMGQTFILAKDKAN FT LYLLINDLVPNKRKLKQYNIGRIDSEICDECGQIDTNSHRLRTCTAGSELR FT EWMIKVLRKRFGIQLGQIDDILFWQIDENSHLQKAAMWLVVHYVAFCVDKF FT PGLSLFVFQKSIREFRWNSRTFVKRYFDTNLNIC" XX SQ Sequence 4798 BP; 1608 A; 803 C; 1033 G; 1333 T; 21 other; cgtagcacca aaatggaacc caaacgcgag agtacgatca agctccgttt cggagccgga 60 tccaggggac cgaacaatgc ggaagtgttc aagttcttcg gaaagcagga atggaccmac 120 gaggagctwa gtgcgatgta tcgggatgac tactcgattt tcgtgaagtt caagacggaa 180 cagcagatgc gggaagcttt gtcgaagctc ggtccccaga ccagattcga gtacgatgac 240 ggaacctcga tgatggtacc ggtcacggca gcagcgggat ccttcaagta cgtccggatt 300 ttcggactgc ctccggaggt ggacgaccgg ttcattgcta cggcaatgtc caaatacgga 360 acggtgcagc agatgattcg ggagcgtttt ccggtggaaa ccgggttccc gatwcacaat 420 ggagttcgcg gaatacacat ggagctcgts tcggagattc cagcgcagct gacaattcag 480 catgtcaagg cgcgaatcta ctacgacggw ctacaaaaca agtgcttcgg atgtggagct 540 ttggaccacc tcaaagcagc gtgtccaaac cggaaggacg tcaacaaamg gctgactccg 600 gcgcagaaac cagggactgg ttcctttgcc agcgtggtgg ccaatggaac aagcggaaag 660 gcagaagagc taccacccaw ttctgggatg gttttgctgg gaaagmttgg tgcggcgccg 720 ccaacagctc cggcagagtc ggtgcagcca tcggaaccga cgccatctgg accgccggma 780 agcggcccat ccgatcaacc gcagctgcca gcggacccgg tgatgccacc ggtwccgatc 840 gcaccgcaag aaccgacgcc accgaagcca gcggacatcg cggaagaaaa gcaagatcag 900 ccaatggagg aagatgggga gagagagcag tcggagagcg acggcgaacc gatggagaag 960 gatggtgagt ggattgaaaa aamggggaag ggagggaagg ggaaggggaa gcgtggtcgt 1020 ccgaagggca ggcccggttc ggactcttcc gaagtkgaca cgaacggccg aaagaagttc 1080 atcgtgccag gacaggkcca cgatctgctc gatgcacagg gmgaccgaac gagatcgcga 1140 tcgcgatcag cagccaaacc agcaggcggc gacctgcaac samgcaagta aaagaaccga 1200 attttccggc gttgacccat tgcactcaca catggacttg wttttgattc gtggaccatt 1260 tactaacaca accgatatta ctaacccttt tactaacacc ttaacaatag tttacttact 1320 aacacaaaag aaaagacaac ttctgtaaat atgaactttg tatatactat agcaacacta 1380 aacttaaaca gcagcaacgc aaatgtaaat aaaaatttat tgaaggattt tattatggat 1440 tacaatgtag atatagcttt tttgcaggag gttagttttg aagatttttc gtttatttat 1500 tcacataatt cattggtaaa tataagtatt gataagaaag gcacagcagt gcttattagg 1560 aaatcctttt catwttctga ttttattcta gatccatcag gcagaattgt ttcggttagg 1620 gtaaatgatg taaatttgat taatatttat gcwcactctg ggaacaattt aaggaaagag 1680 cgggataatt tgttcaccga ggctttgaca gttcacttaa ataaacctgg aaccaagtcg 1740 atgctgattg gaggtgactt taattgcatt ttgcatgcag aagatagtgg aagttcacaa 1800 aaaaactttt cttgtggatt gaaacattta gttgatttgt wtaaattaaa agatatmgcg 1860 aaaacacgga aagaatgcaa attcacattt cacaggggag aatctgcttc caggttggat 1920 cgattttacg gatctgataa catcattggg aatgttacaa gttgttcaac tattccactc 1980 gcgttttcag accatcatgc tgttttgatc aaattaaaat tagatgataa taatttatcc 2040 acaagaggac gtggatactg gaaaataaat ccatccttct tgtcgatgga agacgtaact 2100 gaaaagttta aacaagaata tttgaaactg aagcaaagag atatatttaa tcgtgatttg 2160 agtaaatggt ggttttttca tttcaaaggg aaggcaaaaa acttctacaa aagtgaaagt 2220 tggcatttta atcaatcgag caggaattgt aaggctgtat tattaaatag attaaatgag 2280 ctttctgaca aattagcaaa tggtgataat gtgggtgatg aaatgtccgt gattaaatcg 2340 aaattgatgt cattagaaca aaatcgttta aagaatttag gggaaaaact accagcttgt 2400 actttagcag aaggagagaa aataacaatt tatcaattat ccaataagat acaccgtgga 2460 aacatttttg gcttacaact gagagacggt ccaagtcacg aaattttaac cgataatgct 2520 aaactcgggc cagttattca tcaatatttt tctaatcaat ttcaacaaag tgaagataat 2580 attagcagtg cggaaataga taattcaatt gaatcaatta gtaaaaattt aagtaatgac 2640 caaaaaatta atttagttaa accaattaca gtagaagaac tttacgaaac tttgaaaaaa 2700 tgcaatcgta aaaagtcgcc tgggcctgat ggattaactt acgaatttta tttatgccac 2760 tttgatattc tcaaaaatga tttatgtcgt ctatttaatc tttatctgag aggagagagt 2820 tcacctccta aagagttttc agaaggaata attactttag ttccaaagaa aggggataaa 2880 acacttttgg aaaaccatag accgatttct ttacttaata cagactacaa actttttact 2940 aagcttctcg caaatagaat tcaaatatat atggaggatt tattaggttc aggtcaaagt 3000 gcatgcattg ctggacgatc atgtactgac aacttgaatg atgttagacg tttaatcacg 3060 aaatcagttg aaagcaaaac ctttaaagga atgctactaa gtatagattt ggaaaaagct 3120 tttgacaaag tcaatcataa atttctttgg aaaattctag aaaagtttgg gtttccagaa 3180 aaaattatca aaattttaca gtgcctctat accaaagcaa catcaagagt attatataat 3240 ggatttctta caaatgaaat taaaatattt tcttctgtca gacaaggatg cccgctaagc 3300 atgattttat ttgtattgta tattgaaccg ttaattcgta aaattgatca atgcattctc 3360 ggggttctta cttacaataa gttcctcaga gttgtggcat tcgcggacga cttgaatgta 3420 tttgttcgaa acgcagaaga gtttgacaat gtcctgagta ttgttgactc cttctcccga 3480 tattcgaagg ttaaacttaa ttataaaaag tcttgttttc tcagaatcaa cagttgcatt 3540 ggtggacctt ttatagttca agaaactgaa caattaaagc ttttaggtat tactttcaag 3600 aacacgtgga aggatatcgt tgatataaat tattctaaat taatcagtga catgaaatac 3660 agaattaatc tcaacaattt tcgaatgatg agtttattag aaaaagtttg gttcattaac 3720 acttttgtgt tatcaaagct atggtattta tcacaaatct ttccaccaaa aaatcagcac 3780 attgctacga tcaggaaaat tattggaatg tttttatgga aaaataatat ttttcgtgtt 3840 gataaaagac aactttatct agatgtggat caaggagggt tgggtttaat cgacccagaa 3900 tcaaaatgta aagcattatt tataaaaaat attttaaagg agcaaaatat atctgcgact 3960 gatgttagcg aaaaatattt attaggttta aattttacat caaaattaac aagaaatggg 4020 aaggagtgga ttatgttagc tgctgattta aactgtagta gtccgactat aaatacagca 4080 aaacaaattt atcggaattt cattcatgaa tatttgttta aaccacgtgt tcaagttgag 4140 cttcctcaag tcaactgggg agctttgtgg aaaatcatgg gccaaacttt cattctggcc 4200 aaggataaag ctaatttgta cttattaatt aatgatttag tgccaaataa acggaaactg 4260 aagcagtaca acattggaag aattgatagt gaaatttgtg atgaatgtgg acaaatagat 4320 acaaactctc atcggttacg aacatgtaca gcaggttctg aattgcgtga gtggatgata 4380 aaagttttaa gaaaaagatt cggaatccaa cttggacaga ttgatgatat tttgttttgg 4440 caaattgacg aaaatagtca cctccaaaaa gcagcaatgt ggttagtagt acattatgtc 4500 gccttttgcg tggacaaatt tcccggtctt agcttatttg tttttcagaa gagtattaga 4560 gaatttagat ggaactcgcg gacctttgta aaacgatatt ttgatactaa cttaaacatt 4620 tgttagatat ataaaggatg gatcttgtag ggcttggtag ggcaaactta attgatactt 4680 tgtacatcag ggaatttaag aatcctcaaa aaagttgctt gctgtttgaa ttgtaatcga 4740 aatggtaaat agttgctaaa taaagaaaag tttttagaaa aaaaaaaaaa aaaaaaaa 4798 // ID BEL-239_AA-I repbase; DNA; INV; 5432 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-239_AA_; KW BEL-239_AA-LTR; BEL-239_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-5432 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 933-933 (2011). XX DR [1] (Consensus) XX CC Positions [4400-4972] - Integrase core CC LTRs are 94% similar to each other. XX FH Key Location/Qualifiers FT CDS join(125..1591,1595..5341) FT /product="BEL-239_AA-I_1p" FT /translation="MEKKKEPMRQQAELSESASASHISKSERVRNWITSQQ FT IDKEDKGKIQLGMSNSRIARPASPQRLNAPLLATAYPTPQFSNLSLRDDDR FT CIGNPELPTTYMQIAARQVTGKDLPVFSGSPENWPMFIRIYEETTAACGFS FT DVENLVRLQKCLRGNALETVRSRLMMPAGVPHVIKTLQMRFGRPELIIRSL FT LDRVRHVPAPKPERLDTLIDFGLAVENLVVHLQAAKQENHLTNPVLLQELV FT MKLPAQLRLDWARYELIHQDPTLAAFGTFMNELIQAASEVSFELPISLSAR FT SEKPKDREKVFVHAHDSPDMETKPTGVVKKFPKPCIVCSVVGHRIVECEEF FT KAKSIDDRLKIVRQNNLCRTCLNFHQKWPCRTWNGCNIEGCREKHHSLLHS FT PISSNCVHLSTNHNYILDDDCVRYPYFRILPVVVSMDNKRQLIFAFIDEGS FT SSTLLDRSVAEQLGLEGPTEPVTLQWTANVARQESKSKRVNLQIAAGNTST FT FQINGAHTVEQLLLPKQSVPYSALVKQYPHLGGLPIADYEQAEPKMLIGLD FT NLSLCVPLKIREGRYNEPIAAKCRLGWAIYGFRTGAPMPTVSINFHVPAAQ FT DADHELNEQLNDFFSLDKLGSTSTFEIPESEAVKRARKLLQETTRRVSTRF FT ETGLLWKSDTIEFPDSFPMASKRLELLERKLSRNSSLQGQVHQKIAEYVAK FT GYCHRASLEELNSSDSKRVWYLPLCVVVNPKKPNKVRVVWDAAAKVNGVSF FT NSALLKGPDLLTSLTSILYHFREYRIAVTGDIEEMFLRILIRPEDSSSQRF FT LWRENPGDNPSVYIIDVATFGSTCSPSSAQFVKNANAKDYREKFPRASSAI FT IKYHYVDDYLDSFDTVEEAIEVVKQVKSIHSEGGFNLRHFLSNSNDVVLAV FT SEAIGNDSDEDTKQFNIVRDEKIESVLGMKWNPSRDVFVYTLSLRDDLIVI FT INPSHTPSKREMLKLVMSLFDPLGFLAFYLIHGRILIQDVWATGVDWDTPV FT NSELAQRWWQRISFLPSLNKVQIPRCYFHGYMDNSKQLHVFVDASDAAYAC FT VAYLRAVGTAGVELAIVGAKSKVAPLKVLSVPRLELMAAVIGARMVESVVT FT SHSYTITDVFLWSDSSTVLAWINSDHRKYHKFVGVRIGEILALTKINQWRW FT LPTKQNPADEATKWGDGPEFSSNSRWFRGPPFLYQTESEWPESRISSSTEE FT ESINGRHTVHQNSHQAIPTSIIDFSRFHKWERLLRTQAYVIRFVNNVGSRR FT NGKHVEKGTLTQNELKHAERQLWKQAQAEFYYQERGVLLETQGSPAARHNV FT VHKASPIYKLWPYIDEDGVMRMRGRIGAAWYATPDAKYPVILPKTHPISSL FT LVDWYHRRYNHANHETVVNEMRQRYEIPTLRVLVKQTSKSCFKCRIANATP FT CPPPMAPLPEQRLTPFVRPFTFVGLDYFGPLIVKVGRSEVKRWVALFTCLT FT VRAVHLEVVHSLSSESCVMAVRRFIARRGTPAEFFSDNGTSFVGANKQLQR FT EIASRNEVLSSTFTNTNTRWHFNPPGAPHMGGAWERLVRSVKSAIGMIIDT FT PRRPTDEVLETILLDAEAMINSRPLTYIPLDTADEESLTPNHFLLGNSSGV FT KQPMMEMKEYRTNLRSCWALAQHITDTIWKRWIKEYLPVITRRCKWFEEVR FT DIQEGDLVLMIGTAMRNQYIRGRVEKVFIGRDGRVRQALVRTATGVYRRPV FT AKLALLDVEVSGKRGPDT" XX SQ Sequence 5432 BP; 1556 A; 1212 C; 1303 G; 1356 T; 5 other; atwcatgtaa cttamttgga atatgaagaa aatgttgatg gaagaagaga caaagctccg 60 agaagcggag ttgtcgaagc aaaaagcttt acaggagaag ataatgkttt tkagacggga 120 atcgatggag aagaagaagg agccaatgcg acagcaggcg gagttgagtg aatcggcgtc 180 ggcgtcgcat atttcgaagt ccgaaagggt gagaaactgg atcacgtcac agcagatcga 240 taaggaagat aagggtaaaa tacaacttgg aatgtcgaac tctcgcatag ccaggccagc 300 ttctccacaa agactaaatg ctcctctact agccactgcc tatccaacgc cacagttctc 360 taatctttcg ttgcgtgatg atgatcgttg catcgggaat ccggagttac ccacgacgta 420 tatgcaaatc gctgcccgac aagtaacggg caaggacctt ccggttttca gcggtagccc 480 ggaaaactgg ccgatgttta ttcggatcta cgaagaaacg actgcagctt gcggattctc 540 agatgtagag aatttagtac gccttcagaa gtgtttgcgc ggaaatgccc ttgaaaccgt 600 gcgaagccga cttatgatgc ctgcgggagt gccgcatgta atcaaaaccc tccaaatgcg 660 tttcggtaga ccggaattaa ttatccgttc tctactagat cgagtgcggc atgttcccgc 720 tcctaagccg gaacggctcg atactctaat cgactttggc ttagccgtgg agaatcttgt 780 agtccatttg caagcggcga aacaggaaaa tcacttaaca aatcctgtct tactgcaaga 840 gttggtcatg aagctgcctg ctcagcttag attggattgg gccagatacg aactcatcca 900 tcaagaccct acccttgcag catttggcac cttcatgaac gaacttattc aagcagcgag 960 tgaagtgtcg ttcgagttac cgatcagcct atcagcgaga tccgagaagc caaaagacag 1020 agaaaaagtt tttgtacatg ctcacgactc ccccgatatg gaaaccaaac ctaccggtgt 1080 agtgaagaag tttcccaagc catgtattgt atgcagtgta gtagggcacc gtattgtgga 1140 atgtgaagaa ttcaaagcaa aaagcattga tgatcgcctg aaaatagtac gacagaacaa 1200 cttgtgccgt acctgcttaa atttccacca gaagtggccc tgtcgaacat ggaatggctg 1260 taacatcgaa ggctgtcgcg aaaaacatca ctccctgcta cattcaccaa tttcttctaa 1320 ctgtgtacac ctttcgacaa accacaacta tatattggac gatgactgtg ttcgttatcc 1380 atattttcgc atactcccag tggttgtatc catggacaat aagcgacaat tgatatttgc 1440 gttcattgat gaaggatcct catctacact tttggaccga tctgtggcag aacagctggg 1500 attagaagga ccaacagagc ctgtgacgtt gcaatggacg gccaacgtcg ctcgacaaga 1560 atcgaaatcg aaaagggtga acttacaaat tkcggcggct ggaaatacta gtacattcca 1620 gatcaatgga gctcatacgg ttgagcagct tctattgccg aagcaatcag tcccatatag 1680 cgcattagta aagcaatatc cacatttggg cggattaccc atcgctgatt atgagcaggc 1740 cgagccgaaa atgttgattg gacttgataa ccttagtctg tgcgtacccc ttaaaatacg 1800 cgaagggcgt tataacgaac ctattgctgc gaaatgtagg ctaggctggg cgatttatgg 1860 gttcagaact ggtgcaccga tgcctacggt gtcgattaat ttccatgtac cagccgccca 1920 agacgcagac cacgaattga atgagcaact gaatgacttc ttctcattgg ataaattggg 1980 gtctacatct acgttcgaga tacctgagtc cgaagctgtt aaaagagcta gaaagttgtt 2040 gcaggaaacg acgcgtagag tttccactag attcgaaacc ggtttattat ggaaatctga 2100 caccattgag tttcctgata gttttccgat ggcaagcaaa cgacttgaat tgttagagag 2160 aaagttgtct agaaattcat ctcttcaagg tcaagttcat cagaaaatag cggagtatgt 2220 ggcgaaagga tactgtcatc gggctagtct tgaggaactg aactcgtcgg acagtaagcg 2280 tgtatggtat ctcccgttgt gcgttgttgt aaacccaaaa aagccaaata aggttcgtgt 2340 tgtgtgggac gctgcggcaa aagtaaacgg tgtatcgttc aattccgcgt tgctaaaagg 2400 cccagaccta ttgacgagcc ttacctctat tctataccat tttcgtgaat acagaattgc 2460 cgtgaccggg gacattgaag aaatgtttct ccgaattctc atccgaccag aagacagtag 2520 ttcgcaacgg tttctatgga gggagaatcc tggcgacaat ccttcggtat acattatcga 2580 cgtggcgacc tttgggtcga cttgttcccc aagctcggcg caatttgtta aaaatgcgaa 2640 tgcaaaagac tatcgagaga aatttccccg ggcttcctca gcaatcatca aatatcatta 2700 cgtagatgat taccttgata gttttgatac ggtagaagaa gccatcgagg tagttaagca 2760 ggtgaagtca attcactccg agggaggttt caacctgcgc cattttttgt ccaactccaa 2820 tgatgttgta ttggcagtca gcgaagcaat tggcaatgat agcgacgaag atactaagca 2880 attcaacata gtacgagatg agaagataga atccgtacta ggaatgaagt ggaatccatc 2940 cagagatgta tttgtctaca cgttatcgct ccgtgacgat ctaatcgtca ttataaatcc 3000 ctcacacact cctagtaagc gagaaatgct caagctcgtc atgagcctct tcgaccctct 3060 aggtttcttg gcattttacc tgatacacgg gagaatttta attcaagatg tttgggctac 3120 tggcgttgat tgggacacac cagtaaacag tgaattagct caaagatggt ggcaacggat 3180 cagcttcttg ccatcgctca acaaggtgca aatcccccgc tgctattttc atggttatat 3240 ggataattca aagcagcttc acgttttcgt ggatgcgagc gatgctgcgt atgcttgtgt 3300 cgcctacctg cgcgcagttg gaacagcagg agtggagttg gcaattgtgg gcgcgaaaag 3360 taaggtagcg cctctcaaag ttctgtctgt ccctcgctta gaattgatgg cagcggtaat 3420 cggtgctcgg atggtagaat ctgtcgttac ttctcattca tacaccataa ccgacgtatt 3480 cctctggtct gactcatcaa cagttcttgc ctggatcaat tcggatcacc gaaagtacca 3540 taaattcgta ggagtcagga ttggagaaat tcttgcgttg acaaaaataa atcaatggag 3600 atggttgccc accaaacaaa acccagctga tgaggcaacc aagtggggag atggacccga 3660 attcagttcc aatagtcgat ggtttcgtgg gccgcctttt ctctaccaaa cagaatctga 3720 atggcccgaa agtcggatat cgtcttctac tgaagaggaa tcaataaatg gtcgccatac 3780 cgtccatcag aacagccacc aagcaatccc cacatcaata atagactttt cgcgttttca 3840 taaatgggaa cgattgctta gaacgcaagc ttacgtaata cggttcgtca acaatgttgg 3900 tagtcgcaga aatgggaaac atgttgaaaa gggaacacta acgcagaatg agttaaaaca 3960 cgccgaacga cagctttgga aacaggctca agcagaattt tactatcagg agcgaggtgt 4020 actgctggaa acccaaggga gtccggccgc tcgtcataat gtggtacata aagcaagtcc 4080 aatctacaag ttgtggccat acattgacga agacggcgta atgcgaatgc gaggaagaat 4140 aggagcagca tggtacgcta cacccgatgc caagtatcca gttatacttc caaaaaccca 4200 tcctatttct tcactactgg tagattggta tcaccgacgt tataatcatg ccaatcatga 4260 gacggtggtg aatgaaatga gacaacgcta cgaaatacca acgctacgag tgctagtgaa 4320 acaaacttca aaaagttgct tcaagtgtcg aatagcaaat gcgactcctt gtccaccacc 4380 aatggcacca cttccagaac aacgtttaac accctttgtg agaccattca catttgtcgg 4440 tttggattat ttcggtcctt taatagtcaa agtcggtcgc tctgaagtca aacgatgggt 4500 ggctctattc acgtgcctca cggtgcgtgc tgtacatctc gaggtagtac atagtctatc 4560 cagcgaatca tgtgtgatgg ctgttcgacg cttcattgcc cgccgcggta caccggcgga 4620 gtttttcagc gacaacggga cgagctttgt aggtgcaaat aaacagttac agcgagaaat 4680 agcgtcaaga aatgaagttc tttccagcac tttcacaaat accaacacaa gatggcattt 4740 caatccgccc ggtgcccccc atatgggcgg agcatgggag cgattagttc gctctgtcaa 4800 aagtgccatt ggaatgatta tcgacacacc tcgtcgtcca acagatgaag tgctggaaac 4860 gatcctattg gacgccgaag ctatgataaa ttctcggcct cttacctaca tacctctgga 4920 cactgcagac gaagaatctt tgactcctaa ccatttcctt ttggggaatt cttctggcgt 4980 taaacaaccg atgatggaaa tgaaggagta tcgaactaat ctgcgaagct gttgggcact 5040 agctcaacat atcaccgata ctatttggaa gcgatggata aaagaatatt tacccgttat 5100 aacccgtcgt tgcaaatggt tcgaagaagt ccgggatatc caggaaggcg atttagtgct 5160 aatgattgga acagcgatga ggaaccaata tataaggggc cgcgtggaaa aggtgtttat 5220 cggacgagat ggtcgagtgc gtcaggcttt ggtgcggacc gctactggag tatataggag 5280 acctgtcgct aagcttgcac tactcgacgt tgaagtatct ggtaaacgtg gtccagacac 5340 atagacccgc ctattcttga ccatccttta cgggtggggg gatgttacgt cgcatagcaa 5400 gcccctcgac tggtcacact gtccgaatcg tg 5432 // ID CR1-115_AAe repbase; DNA; INV; 4236 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-115_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4236 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1203-1203 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 18 sequences with >93% CC identity. XX FH Key Location/Qualifiers FT CDS 132..896 FT /product="CR1-115_AAe_1p" FT /translation="MNKTLYSGGQSSKQVNQSLDQTSFQRKIDMLTLDDVD FT DPINRTRSCEETSFFEVLDEINSTVAQPSEKFIVGPNKRVQILTNQPSSSS FT SSHVQENASTPAASSQKRRAPIITSDISSHIGHSSQHRLDLPASNRSRQHD FT NTRPSSGPLSVAKVGQTASDMSDFYVTPFTPNQSEEDIKQYIQEICKVDIS FT SVRVAKLVPRGKKLDDLTFVSFKVSVDNTISEMIGDPWYWPEGVSVRAFDY FT IQKNEPTTLRPTSS" FT CDS 827..4129 FT /product="CR1-115_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="RCLCPCIRLHPKKRTNDTPSDLVIEMTTTSHVPNHQT FT LLAVPDLRFNPGCLTKLGRIHSGIPGDPTPSVTVQLTSAISSNRRRPSSVD FT DFGGRVAQPAISGKFSSNIEHSSPHVTSLSSTVEDISSNRKLTHPSSSINQ FT RTPPSPRRTSIRIYYQNVRGLKTKIDSWFLAVSDCDYDIIAITETWLDDRI FT NSVQLFGPMYDVYRCDRNSENSNKRSGGGVLIAVRKSLSSVQVNVGNMVEQ FT AWVSVKLEKRSLHISSFYLPPDKTNDPNIIEAHLNSIRFIESGAAQDDEIL FT IFGDYNLPRAYWIKDDHGSIVLDSARSTLNASCNAIIDGMAFHNLHQLNLV FT SNKNGRILDLVFSSSELVAVTEASDRLIAIDLHHPPLEMVLLAVPEEICAH FT ETEHRLDFKKINFDSLLTFLDNINWAFVSLCTDVDTATRNFIGILNDWLVT FT HVPRSKSQRYPPWETHELRTLKRRKNAALRRFRKNRAPAFKTQFQCASRAY FT MSMNDSCYRAYVVRTQNQLRRNPKSFWGFVNSKRKETGLPLTMSFRERECS FT DPSGKCALFAECFSSVFNNNLCTSSDIERATCDVSPHVADIDIFDVTDSMI FT LRAVSKLKNSHSPGPDGIPAIVYKKCINSLLAPLRHIFNLSLAQGSFPSIW FT KESFMFPVFKKGSKRDISNYRGITSLSAGCKLFEIIVNEYLHFELKNFIST FT DQHGFFPGRSVSTNLANFSNTCITNMENGYQVDAIYTDLTAAFDRIDHDIL FT LKKIEKIGASVGFIKWLRSYLVGRKLKIRIGASCSSYFESTSGVPQGSNLG FT PLLFSVFFNDVTNVLPRGCRILYADDLKIFLVIRNPDDCVRLQNMLSHFYD FT WCERNKMSVSVSKCFVISFHRKSSPINFNYKLAGCTLNRTTLVRDLGVMLD FT SNLSFNQHRCMVIDKANRQLGFIAKISREFTDLYCLKALYCSLVRSILETA FT DIIWTPYQSTWIERIEKIQKRFLRHTLAHLPWNNPANLPPYTDRCRLLNMD FT TLENRRRANQALFVAKLLKGEVDAPMLLSLLPVNIPSRPFRRYTFLRQAQH FT RTNYGSNAPLPAMIAEFSRVQHLFDFNLSSGVFKNRVLEYLRAS" XX SQ Sequence 4236 BP; 1215 A; 995 C; 837 G; 1189 T; 0 other; cacgtcgtct gaattttgac cccgccgttt atgaccgaga gaagactatt attaaagctt 60 tgcgagagct attaataaga actgattcaa tcgatacgag attgggcaat tacggcgaaa 120 acctacggag aatgaataaa acgctctaca gtggaggcca aagttctaaa caagttaacc 180 agtcgttaga tcaaacgtct ttccagcgca aaatagatat gctcaccctg gatgacgtcg 240 acgatcctat caacagaaca agatcatgcg aagagacttc tttttttgaa gttttggacg 300 aaattaattc tacagtggca caaccgtctg aaaaattcat cgtcgggccc aataaacggg 360 ttcagatcct aaccaatcaa ccatcttcga gctcaagcag tcacgttcaa gaaaacgctt 420 ctactcccgc cgcttcttct cagaaacgta gggctccgat catcacttcg gatatatcgt 480 cccatatcgg ccactctagt cagcaccgcc tcgacctacc tgctagcaat agatcacgac 540 agcatgacaa tacacgacct agttctggac cgctcagtgt ggcgaaagtt ggacaaacgg 600 ccagtgatat gtctgacttt tatgtcactc cattcactcc caaccaatcg gaagaggata 660 taaagcagta cattcaggaa atttgcaaag tggatatttc ttctgtacga gttgctaagc 720 tggtgcccag gggaaaaaaa ctggatgatc ttactttcgt ctctttcaaa gtttccgttg 780 acaatactat ttcggaaatg attggtgacc cttggtattg gcctgaaggt gtctctgtcc 840 gtgcattcga ttacatccaa aaaaacgaac caacgacact ccgtccgacc tcgtcataga 900 gatgaccacc acttctcatg taccgaatca tcaaacgttg ctggctgttc ctgatctccg 960 gttcaaccct ggctgtttga cgaaattggg acgcattcat tcaggcattc cgggcgaccc 1020 tactccatcc gtcacagttc aactcacatc tgccatctca tccaaccgaa gacgtcctag 1080 ttctgtagac gattttggag gcagggtcgc ccagcctgca atctcaggca agttttcttc 1140 caatattgaa cattcgtccc ctcatgtcac ttcgctttcc agcactgttg aagacatctc 1200 cagcaacagg aaattgactc atccaagctc atcaatcaat caacgtaccc caccctctcc 1260 tagacgcaca tcaataagga tctactacca aaatgttcga ggcctgaaaa caaaaatcga 1320 tagttggttc ttggcagtta gtgattgtga ctacgatatt atcgcgatca ctgaaacttg 1380 gctagatgat cgtataaact ccgttcaact tttcggtccg atgtatgacg tttatagatg 1440 cgatcgaaat tccgagaaca gcaacaagcg tagtggaggt ggtgttctga ttgcggtcag 1500 aaaaagttta tcttccgtac aagtaaacgt cggcaacatg gtggagcaag cgtgggttag 1560 tgtgaaactg gaaaaaagat cactccacat ttcttcgttt tatctccctc ccgataaaac 1620 aaatgacccg aatattatcg aggcacattt gaattctatt cgattcattg aatctggcgc 1680 tgctcaagac gatgaaatac taattttcgg ggattacaac ctaccacgag cctattggat 1740 aaaagatgat catggaagca ttgtcctgga ctccgcacgt tctaccctga atgcttcatg 1800 caatgccatt attgatggta tggcgttcca taatcttcac caattgaatt tggtcagcaa 1860 caaaaatggg cgcattctgg accttgtgtt ctccagttct gagctggtag ctgttactga 1920 agcgagtgat agattaatag ctatcgactt gcaccatcca cccttggaaa tggtactttt 1980 ggctgtgcct gaagaaatct gtgcgcatga aacggaacat cgtttagatt tcaagaagat 2040 aaacttcgat tctttactga ccttcctcga caacataaat tgggcgtttg tgagcctctg 2100 tacagatgtc gacactgcca ctcgcaattt tattggtatt ctgaacgatt ggcttgttac 2160 gcatgttcca agaagtaaat cacagcgata ccctccatgg gaaacccacg agctgcgaac 2220 actaaaacga agaaagaatg cagcactgcg tcgttttcgt aaaaatcgtg ctcctgcttt 2280 caagacccaa tttcaatgcg ctagccgagc ctacatgtcg atgaatgatt catgctatcg 2340 agcttatgtt gttcgtaccc aaaatcaact tcgtcgtaat ccgaaatcat tctggggttt 2400 tgtgaacagt aaacgtaagg aaaccggact gcccttaact atgtcattta gagaaaggga 2460 atgcagcgac ccctcaggaa aatgtgcttt attcgccgag tgtttttcga gtgtcttcaa 2520 taataatcta tgcacctcct ctgacataga acgcgccact tgcgatgtct ccccccacgt 2580 tgctgacata gacatttttg acgtaaccga ctccatgatc ctgcgggcag taagcaaact 2640 gaaaaactct cattcacctg gtcctgacgg aattcctgcc atcgtttaca agaagtgtat 2700 caactcactc ttagcacctc ttcgacacat attcaacctt tctctggcgc agggtagctt 2760 cccaagtata tggaaagaat cattcatgtt cccagttttc aaaaaaggat ctaaacgcga 2820 tatcagtaat tatagaggaa tcacctctct cagtgctgga tgcaagctat tcgaaattat 2880 cgttaacgaa tatctacact tcgaattgaa aaatttcatc tccacggacc agcatggatt 2940 cttccctggg agatcggttt caacgaatct agccaacttc tctaacacat gtatcacgaa 3000 catggagaac ggatatcagg tggacgctat ctacactgat ttaaccgctg ctttcgaccg 3060 aatcgaccat gatattcttc taaaaaaaat agaaaaaatt ggagcatctg ttggatttat 3120 caaatggtta cgaagttatc ttgttggacg gaaattgaaa atcagaatcg gtgctagttg 3180 ctcgagttat tttgaaagta cttctggagt tccccaaggc agtaaccttg ggcctttgtt 3240 gttctccgtt tttttcaacg atgtcaccaa tgttcttcca cgtggatgcc gtatcctgta 3300 cgcagatgac cttaaaatat tcttagtgat acgcaatccg gacgattgtg ttagacttca 3360 aaacatgcta agccattttt acgattggtg tgaacgaaac aagatgtctg taagcgtttc 3420 gaaatgtttc gtgatttcat ttcataggaa atcgagtcca atcaatttca actacaaact 3480 ggcaggctgt accctgaata ggactacctt agttcgggac ctaggggtga tgttggacag 3540 taatctctcc ttcaatcaac acagatgtat ggtgattgat aaagctaatc gtcaattagg 3600 ttttatcgca aaaatctctc gtgaattcac cgatctctac tgcttgaaag cactgtactg 3660 ctcgcttgtt cgctcgatac ttgagacagc cgacataatt tggactccct atcaatcaac 3720 atggatcgaa cgaatcgaaa agatccagaa acgtttccta cgacacacac ttgcacattt 3780 gccttggaac aatcctgcga acctgcctcc ctacactgat cgttgtcgac tccttaacat 3840 ggataccctg gaaaaccgga ggcgagcgaa ccaagcgctt ttcgttgcaa aactactgaa 3900 aggtgaagtt gatgccccaa tgctactttc tcttctacct gtaaacatcc cgtcaagacc 3960 gtttcggaga tatacatttc tacgtcaggc tcagcataga actaattacg gatccaacgc 4020 tcctctacca gcaatgattg ctgaattttc tcgtgttcaa cacttatttg attttaattt 4080 atcgtccggt gttttcaaaa acagagtcct tgagtatttg cgagcttcat aaatctttgt 4140 cattagtctt caattttaaa atagttttaa gccttgttca ctaggatact aaatccgatg 4200 aatgtatttt gtaataaata aataaataaa taaaaa 4236 // ID Copia-14_DPu-LTR repbase; DNA; INV; 292 BP. XX AC scaffold_26; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-14_DPu_; KW Copia-14_DPu-LTR; Copia-14_DPu-I. XX NM Copia-14_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-292 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 692-692 (2010). XX DR Genome; scaffold_26; Positions 1060040 1060331. XX CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX SQ Sequence 292 BP; 70 A; 61 C; 49 G; 112 T; 0 other; tgttgtttgt atttgttcca aacattatat tccttgttgc ggaacctcat gtggttgccg 60 aaacccagac caaccctacc cctgtcttct tgtcagacag aaatgttgtc tgctgtgact 120 cgattctgtg taactgacaa gtgtggtact gaatatcttc ttctcaattt cgttaggtat 180 taactctgtt attcgttcta ttacctgaga ttccatctaa aggtaataca gttgttttgg 240 ttaagcaata ctccatttca tgttgtattt actttgctaa ttacactcaa ca 292 // ID DNA8-111_AP repbase; DNA; INV; 616 BP. XX AC . XX DT 29-AUG-2009 (Rel. 14.09, Created) DT 29-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-111_AP. XX NM DNA8-111_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-616 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2049-2049 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 616 BP; 217 A; 90 C; 76 G; 233 T; 0 other; tagggctgac ggtattgaaa ataattttcg gtattacggt attgatttca ataccgatat 60 ggtacggtat accgaaataa taaataaaat ctaactagta tcgatatttt ttactttaat 120 attgtttcac attaatatta tttcacatta tactttatta tatactataa tatataatat 180 tataatctat attattgcca ttgatactag aaaacagcac taaaacttgc actaataatt 240 attttatact taaaattaag ttaatttatg taaattcttg tactttgctt caattttttg 300 aaaaacataa gtaggtaact agtaaaacta gtggattttt ttaaactgtc cctacctatc 360 ggctatcaac atctattacc tataaaacaa catgattaca tgaataggta ttttatatct 420 ttatttttct acactgctac agtataggtt taaactttaa aagtaatcaa tttgaagcta 480 agtaactcac ctaaatggta ggtaggtact caagttgttt accgtttacg gtataccgtt 540 tagacggttt tttacggtat accgtttacc aaaataattt taaaataccg gtataccgaa 600 aataccgtca gcccta 616 // ID Mariner-34_SM repbase; DNA; INV; 1918 BP. XX AC . XX DT 12-AUG-2009 (Rel. 14.08, Created) DT 12-AUG-2009 (Rel. 14.08, Last updated, Version 2) XX DE Mariner DNA transposon from Schmidtea mediterranea: consensus. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-34_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-1918 RA Jurka J.; RT "Mariner DNA transposons from Schmidtea mediterranea."; RL Repbase Reports 9(8), 1883-1883 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 384..1820 FT /product="Mariner-34_SM_1p" FT /translation="ASKQKKIIEDAASSHANQRSLARKYNVSLGCINGILK FT KKKIIGDVEINFIRRRKLYKTEQLDPILFKWFQIKRSRGFPVSGALLQSKA FT IKIAEELKIVGFCASSGWLNLFLKRHKIKSRVLSGESALVNGDIISTFFKE FT KGSLIKRYKNEDTFNCDEAALFFKATQPTSLVLTENDSKTGRFPKQRITVL FT LCCSATGEKLTPLIIGSSQKPRCFSGFDFNKLDIVYKANKKSWMTLNIFEE FT WLSLLNRKMRANNRKILLVLDNATVHPVEISLSNVECLYLPKNTTSKTQPL FT DQGIIRSFKSKYRALLLEKIILHEELDYVDVLSKINLLQCANMIHHAWSDV FT SIETIKHCFCKAGFLEKEYLSIGEVTIEAVEDELLEIENNFPTQEESIEED FT STAFIISEANNYVVEDFDAIKEENPQEEIFCYPTNKEVIMAVDLIESWMVY FT NSPKSVCMLKPIIDEIIRERRSRKSKITDFMIKK" XX SQ Sequence 1918 BP; 697 A; 259 C; 338 G; 624 T; 0 other; cagtcaactc tttcttatat gaacatctcc ttatatgaac atcttcctat tatgaacatt 60 tatcaaattc tataatatta atttgagaaa tatatatatt tattatgaac ataattaatt 120 tttatctatg aacaagagaa ctctgtttta ttgacaacaa aaatattaaa aaaatgataa 180 ttatataaat ttcgaattta ggtagattgt tttaaaaatc tttttttgcg tataattaat 240 aattagctta ttttttatgc aaacatttaa ttagtcaagt tcatgatata gtttattaat 300 gattaattaa tgcatacatt tagatatttt taattttcaa cccatgaaat ctccagaaaa 360 gaaacgaaaa agacatgatt tgagcatcga aacaaaaaaa gataatcgaa gatgcagcca 420 gcagccatgc aaatcagcga tcacttgcaa gaaagtataa tgtttcgtta ggatgtataa 480 atggaatatt aaaaaagaag aagatcatag gtgatgttga aataaatttt atacgaagaa 540 ggaaacttta taaaacagag caacttgacc caatactgtt caagtggttt cagattaaga 600 ggtctcgagg ctttccagtt tccggtgcac tgttgcaatc gaaagcaata aaaattgctg 660 aagaattaaa aattgtcgga ttttgtgcat cctctggttg gttgaattta tttctgaaac 720 gccacaaaat aaaatcgcgc gtactatctg gggaatctgc cttggttaat ggtgacataa 780 tttcgacatt tttcaaagaa aaggggtcgt taattaagcg atataaaaac gaggacactt 840 ttaattgtga tgaagctgct ttgttcttta aggcaacaca accaacgtca ttagttctta 900 ctgagaatga ttccaaaaca ggtagatttc cgaaacaacg aattacagtt ttgctttgct 960 gtagtgcaac gggggaaaaa cttacgcctt taataattgg atcctcacaa aaaccaagat 1020 gtttttcggg attcgatttt aacaagttgg atattgttta caaagcgaat aaaaaatcat 1080 ggatgacttt gaatatattt gaggagtggt taagcctttt aaatcgaaaa atgcgggcaa 1140 acaatagaaa gattttgttg gttttagata atgcgaccgt tcatccggtt gaaatttcct 1200 tgtcaaatgt tgaatgtcta tatctaccaa aaaatactac tagcaaaacc cagccactag 1260 atcaaggaat aattcgctca tttaaatcca agtacagagc actgctactc gaaaaaatta 1320 ttttgcatga agaattggat tatgtcgatg ttttatctaa gataaattta ttacagtgtg 1380 caaacatgat acatcatgca tggtctgatg tttccattga aactattaag cattgttttt 1440 gtaaggcggg attcttagaa aaggaatatt tgagtatagg agaggtgaca attgaagcag 1500 ttgaagatga attattggaa attgaaaaca attttccaac acaagaagaa tcaatcgagg 1560 aggattctac tgcatttatt attagtgaag ccaataatta tgttgtcgaa gattttgatg 1620 caataaagga agaaaatcca caggaggaaa ttttttgtta tccgactaac aaggaagtaa 1680 ttatggcagt tgatcttatc gaatcttgga tggtatataa ttccccgaaa tcagtttgca 1740 tgttaaagcc aataattgat gagataatac gcgaaaggcg atctaggaag tctaaaataa 1800 ctgactttat gattaaaaaa taattggcat aatttttgaa cttcttatat gaacagtttt 1860 tatgtatgaa cactattttt tattgaaact aagtgttcat attagaaagg gttgactg 1918 // ID Gypsy19-I_Dya repbase; DNA; INV; 6721 BP. XX AC chrU; XX DT 19-MAY-2009 (Rel. 14.05, Created) DT 19-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy19_Dya; KW Gypsy19-LTR_Dya; Gypsy19-I_Dya. XX OS Drosophila yakuba OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; melanogaster subgroup. XX RN [1] RP 1-6721 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1113-1113 (2009). XX DR Genome; chrU; Positions 1334210 1327490. XX CC Positions [2270-2776] - Reverse transcriptase CC Positions [4121-4606] - Integrase core CC LTRs are 95% similar to each other. XX FH Key Location/Qualifiers FT CDS 299..5014 FT /product="Gypsy19-I_Dya_1p" FT /translation="MKYSVILNLTMEELAKWLSEKRVPLTGEETIHQLRGM FT VKAMVVDQDNAIKEGGEPIGADGVAESVAGSAAVSVAESVAGSAAVSVAES FT VAGRESEQEERVSDELTEITKQMTLYTARKDLAELKAKVAALEGNSLDRKP FT VVEIRDLEANLRKFSGDDNMSVHAWIRDFGLAAVMYGLDSSQRWTLGSRML FT EGSARSYIMLEKPATWNELSESLRKTFGCQMTNHQVAKQLQSRSIRVNESL FT LQYFIAMRHIAEQGTFEDVDIVKYIVDGLQDHTGCAAPLYYCASLEELREK FT MIRYQMVQTENRRRTPVTPLRQVTVAKPAHAGGDVGSIRCFNCRAMGHFSS FT QCKKPKRPEGACFHCFETGHHYRRCPKRLQAAAPCNDDMLDEKEDDDDSTG FT RNPYIQLVSIQLLSKEGCIDIDNIFALLDTGSPVSFINRELIPQELKDDQL FT KKSRFRGLSGQNVLTFGKIKIVLLFQEEYYSIEAYVIPNNILPTPILLGRD FT VLRKLSLKLVKRKNVIKKNLNQLSLNKTKALSSNPICASLFTSTYYPNNNI FT TESKDRNPYVNAVATPVKPQIFQEWMQQYENCRISTGDDVVQVGSEFGRAS FT LQECREIIQNQYINKFSKSFSEPKYRMGIRLTDSTPFYCAPRRLSYREREI FT VSETVNDLLANRIIRPSNSPYASAVVLVKKKDGNVRMCIDYRGLNKKTARD FT NFPLPLIEDCLEYLEGKSVFSVIDLKSGFHHIGVEAESVKYTAFVTPDGQF FT EYLRMPFGLKNGPAVFQRFISDALRDFVKNRQIVVYMDDIILASYSVEEHK FT SILARLLDRLVEFNLEINLKKSSFLQKRIDYLGYDVSVQGILPNDTHLTAI FT KQYPEPINSKTLHSCLGLFSYFRKFVPNFSRVARPLTDLLRKGGPFPMSCS FT EKDAFLQLRTALSSPPVLSIFNPLYETELHTDASSHGFGAVLLQRQPDNKL FT HPVFFYSRKATAAESRYHSFELETLAIIYALRRFRIYLEHKPFLIVTDCSS FT LVQTLSKKAINPRIARWSLELESFNYSIAHRPGSNMAHVDALSRQTELLRE FT VEGISDPCPIPSRDIEPCKPYGSSEPSEPCKPCGYSEACRPCEPSEPCKPC FT ELSESRGTKRSFQPTTCDVPCNRKKCVKSVRFAPETENIIGIVGASPYDID FT LLLQTEQIRDSEIRRVREKLENGEVENFVLIDGIVYRSKDNEKMCMYVPSR FT MQEDLIRKFHEKLGHFATDKCVNKMREKYWFPGMRSKVDLFIKNCLPCILH FT SVPKSIHNRTLHSIPKVPVPFDTIHIDHLGPLPSINSKRKHLLVIIDAFTK FT FVKLYPVNSTSTKEVNCSLEKYFEYYSRPRRIISDRGTCFTSFEFQEFLES FT RNIDHIKVATAAPQANGQVKRVNRVLTPMLGKLSEPLNQADWYKLLNRVEF FT AINNSVQCSTGKTPSMLLFGCEQRGPVVDELTEYLEEKFSQEKPQCLDTVR FT AEASENIRQSQIRNETAYGLKHRAPALYKVDDFVAIRNVDASAGHCKKFAP FT KYRGPYRVNRVFPNDRYEITDIDNCQLTQLPYKGILEAARLKPWLQVCNKT FT IGLCL" XX SQ Sequence 6721 BP; 2132 A; 1233 C; 1510 G; 1836 T; 10 other; atgattacgt gtgtaaacat cccattttga accgcgcgcc ttagtgtaac caaattgtat 60 aatgttacga tatcgatata aagtgttatc gatctaaata gaattaaggt agattataag 120 aataaagaaa taaggtcact ttttgatttc tggagaaaaa cgaccgacgc gcaaataaaa 180 taacaaaaga aaaatcaaat aaaataaaag caatccggcg ttaaattatt ataaaataaa 240 acctggagta ttacaaatca ggtgtggggt tcgcccaaag taatatttcg gcttgagaat 300 gaagtattca gtcattctta atctgacaat ggaggagctt gcaaaatggc tttccgagaa 360 gcgagtgccg ttgaccggcg aggagaccat tcaccagctg cggggaatgg taaaggctat 420 ggtggtagac caggataatg ccataaaaga aggcggagag ccgatcggag cggatggcgt 480 ggcggagagc gtcgcgggaa gcgccgctgt gagcgtggcg gagagcgtcg cgggaagcgc 540 cgctgtgagc gtggcagaga gcgtcgcggg aagagagagt gagcaggagg agcgggtgtc 600 ggacgagttg accgaaatta cgaagcagat gacgttgtat acagcacgaa aggatctcgc 660 ggagttgaaa gcgaaggttg ccgccttgga aggcaattcg ctggaccgca agcccgtggt 720 tgagatcaga gaccttgagg caaatttacg aaaattttcg ggagacgaca acatgtctgt 780 gcacgcatgg attcgtgact tcgggctggc tgcagttatg tacggactgg acagctctca 840 acgttggaca ctgggaagtc gaatgctgga gggatcggct cgctcataca ttatgctcga 900 aaaaccggct acttggaatg agctaagtga aagtctgcga aaaacttttg gatgccaaat 960 gacgaatcac caggtggcaa agcagttgca gagccgttct attcgtgtga atgaatcgct 1020 gctgcaatat ttcattgcaa tgcgtcatat tgccgagcaa gggacctttg aggacgtgga 1080 catcgtaaaa tacatcgtag atggtctgca ggaccatacc ggatgtgcag ctccattata 1140 ttactgtgct tccctcgaag agctgcgtga gaagatgata cgataccaga tggtacagac 1200 ggagaacagg cgaaggacac cagttacacc attgaggcag gtgactgtag cgaaacctgc 1260 gcatgctgga ggagatgtcg ggagcataag gtgcttcaat tgccgagcga tgggacactt 1320 cagcagccag tgtaagaaac cgaagcgccc ggaaggcgct tgcttccact gtttcgagac 1380 tgggcatcat tatcgacggt gcccgaagcg tttgcaagcg gctgcccctt gtaatgacga 1440 catgctagac gagaaggaag acgacgacga ttcaactggc agaaatcctt atattcaatt 1500 agtaagcata caattactta gtaaagaggg gtgcattgat attgataaca tttttgcgtt 1560 actagatacc ggaagcccag tgagctttat caacagggaa cttattccgc aagaattgaa 1620 ggatgatcag ttgaaaaaaa gtagatttcg cggcctgagc ggccaaaacg ttttgacatt 1680 tggaaaaatt aagattgttc ttttgtttca agaagagtat tacagcattg aagcctatgt 1740 aattcctaat aatatattac caacgcctat attactgggc agagatgtac ttagaaagtt 1800 atcacttaag ttagtcaaac gcaaaaatgt tattaaaaag aatctcaatc aattatcatt 1860 aaacaaaacc aaagcattat cctcaaaccc aatctgtgca tccctattta ctagtacata 1920 ttacccaaac aataatataa ccgaaagtaa agataggaat ccatatgtaa acgcagttgc 1980 aactccggtg aagcctcaga tctttcaaga atggatgcaa cagtacgaga attgtagaat 2040 atcaactggg gatgacgtag ttcaagtagg cagtgagttt ggtcgcgcct cattgcaaga 2100 gtgtcgtgaa ataatacaaa accaatatat taataagttt tccaaaagtt ttagtgagcc 2160 aaagtataga atgggcattc gtctcactga ttctacacca ttttattgtg cccctagaag 2220 gttgtcatat cgtgagagag aaatagtttc cgagaccgtt aatgatctgt tagccaacag 2280 aattatccga cctagtaatt cgccgtatgc ctcagctgtg gttttagtta agaaaaaaga 2340 tggaaacgta aggatgtgta ttgattaccg cgggttaaat aaaaagaccg ctagagataa 2400 tttcccgtta ccccttatcg aagattgctt ggaatacctt gagggtaaat ccgtgttttc 2460 agtcatagat cttaaaagtg gctttcatca cataggtgtt gaagctgaat ccgtgaaata 2520 taccgcattc gtaactccgg atggtcagtt cgaatacttg agaatgcctt ttgggcttaa 2580 gaatggtcct gccgttttcc aaagatttat ttccgatgcc ctgagagact ttgtaaaaaa 2640 ccgtcagatc gttgtttata tggacgatat tattttagca tcatattccg tagaagagca 2700 taaaagcata ttagccagat tattagacag gctcgttgag tttaaccttg aaataaattt 2760 gaaaaaaagt agttttctgc agaaacgaat agattaccta ggttatgatg ttagtgttca 2820 aggaattttg cctaacgata cacatttaac tgctataaaa caatacccgg aaccgattaa 2880 tagcaagacc ttgcattcct gtttaggttt attctcttat tttagaaagt ttgttccaaa 2940 tttttcgcgt gtagcaaggc ctttaactga cctacttaga aaaggggggc cattccccat 3000 gtcgtgttcc gaaaaggatg cattcctgca gttaaggaca gcgttatcga gtcctcccgt 3060 cctgtcaatc tttaatcctc tttatgaaac cgaattgcat acggatgcga gttcccatgg 3120 atttggggct gttttattgc agcgacagcc ggataataaa cttcaccctg ttttctttta 3180 ttcaagaaag gccactgcag ccgaatctcg ttatcatagt tttgaacttg agaccttggc 3240 aatcatatat gccctgcgcc gttttcgaat atatctggaa cataaaccct ttttgatagt 3300 aaccgactgt agttccttag ttcaaacctt gagtaaaaag gctataaatc cccgtatcgc 3360 ccgttggtcc cttgagttgg aaagttttaa ttattctata gcccataggc cggggtcaaa 3420 catggcccat gtagatgcgc ttagtaggca aacagagttg ctaagagagg tggagggaat 3480 ttccgatccg tgtccgattc cgtcaagaga tattgaacct tgtaaaccgt atggatcttc 3540 tgaacctagt gaaccttgta aaccgtgtgg atattccgaa gcttgtagac cgtgtgaacc 3600 tagtgaacct tgtaaaccgt gtgaactttc tgaatcccgt ggaactaaga gatcttttca 3660 acctacgaca tgtgatgtac cttgtaatcg taagaaatgt gttaaatccg ttaggtttgc 3720 cccggaaact gagaatatta ttggaattgt aggtgcgtcc ccttatgata ttgatttact 3780 gttgcagacc gaacagataa gggattccga aatacgtcga gtgcgagaga agttagagaa 3840 tggtgaagtt gaaaattttg tgttgataga tggaatagtg tatagatcga aggataatga 3900 gaaaatgtgc atgtatgttc cgagccgaat gcaggaagat ttaattagga aatttcatga 3960 aaaattggga cattttgcga cagataagtg tgttaataaa atgagagaaa aatattggtt 4020 tcccggtatg agaagtaagg tcgatttgtt tataaagaat tgtctaccgt gtattctaca 4080 ttcagtcccg aagagcattc ataatagaac gcttcatagt atacctaaag ttccggtccc 4140 gtttgatacg atacatattg accacctagg tccccttccg tctattaact cgaagagaaa 4200 gcatttgttg gtgattattg atgcgttcac gaagtttgtt aaattatatc ctgtcaactc 4260 caccagtact aaggaggtaa attgttcatt agaaaaatat ttcgaatact atagtcggcc 4320 ccgtagaatc atttccgata gaggcacgtg tttcacttcc tttgagtttc aagagttcct 4380 cgagtccaga aatattgacc atattaaggt agctacggcg gcccctcagg caaatgggca 4440 agttaagcgt gtgaaccgtg ttctaacacc aatgttggga aaactatcag agcccctaaa 4500 tcaagccgat tggtataagc tgcttaaccg tgttgaattc gctataaaca attccgtgca 4560 gtgtagtacc ggaaagaccc ctagtatgtt gttatttgga tgtgagcaac gtggtccagt 4620 agtagatgag cttaccgagt atttggaaga gaaatttagt caagagaaac cgcaatgtct 4680 agacaccgta agggccgaag caagtgagaa tattcgtcaa tcacagatcc gaaatgaaac 4740 agcatacggc ttaaagcata gggcaccggc gttgtacaag gtagatgatt ttgttgctat 4800 tcgtaatgtt gatgcgagtg caggacattg taagaagttt gcacccaagt atcgtggccc 4860 atatagagta aaccgcgttt ttcctaatga ccgttatgaa ataactgata tagacaattg 4920 tcagctgacg caactcccat ataaggggat ccttgaggct gctaggctta aaccatggtt 4980 gcaagtgtgt aataaaacta taggattatg tttatgatcg aggtcgatca aatgtcaggt 5040 tgcccgaatg taaacatccc attttgaacc gcgcgcctta gtgtaaccaa attgtataat 5100 gttacgatat cgatataaag tgttatcgat ctaaatagaa ttaaggtaga ttataagaat 5160 aaagaaataa tgtcacgcgc aaataaaata acaaaagaaa aatcaaataa aataaaacca 5220 atccggcgtt aaattattat aaaataaaac ctggagtatt acacgtgcaa taaattaggg 5280 agttccatag tcttcttgta agagcagcca gcctgtataa tttttggagg atggggcaga 5340 ggattatgat tatgactact gttggcttac tagagcaatg aaagagactt gatgaggatc 5400 aaagaattga tggaatcctt tagtgacata gtgacatctc tgacatgtat aacttagact 5460 ttaagttgaa cctcgaaaaa aggaagtacg tacagtacac cgatgtttga aaaaccgtgg 5520 gaaaccatgg cgaagcacat ccgttaaatt catttctcag aacgtaagag ttatctacgt 5580 atccgtatga ccaaccaaac tgaaccagga tgccagaact gcttacacgg ttacatgttt 5640 ggctttgtgt gtgactaata agtcactcat gcatgtcccg attctagtga cgacgctaac 5700 cccacggtat ttttttccac ggtgttcgcc aggcaagacc aggaaacccg acggactggg 5760 acctcgtcat ttccgtagat gtatttaagc ccaagcaaat atagcttcac aagaatctaa 5820 atatgcatta gtccgggacc acattatcaa caattcaaaa gaaggagatg acatacagga 5880 cacccgaaag atacaaaagt tttatgataa cagagcagaa cattgacagc tcaaagagaa 5940 aatgctggac agagtggaaa gcgtacgcag cgtactagta gatacaggtg ctagcgctat 6000 ttctattggc aaaggagcaa aataattatt aaaacgaacc cagtcaataa cacagtactc 6060 acatacctaa ctcgcaattc aacgtgccga aacggtttca aaagagtaaa ttgcgcaatg 6120 tttagcgttg tttatgttgt atgcgaaatg agtgaaattc aatccaaagc ggaatggata 6180 gaatgagtaa atgcagagta aatgtttaga gaactcggat aaaattttgg aaaattagtc 6240 gttcatggct tgcataggcg atgatgagaa tacagagctc gaacgaaagc gcaagctctt 6300 agtaaattgc gtacatatgt acaacataat atatgtaaat acataatata tctgtaaata 6360 gaataactca ttaataatat agcattttca gatgctagaa atttatttaa tttaacaatc 6420 taaactataa ataaattatg tttctgcatt cagtaattta ttaatccatg ccttttattt 6480 ttttatgtct tgtattaaaa agcaagacaa aaaagaaaaa acaaatacgc aataaatttg 6540 acgtagccaa tgcggtgcac agttaaccgt ctcggttatt ctgttattat ttgtgcattg 6600 cttcgccgtt cgaatatagt aaatttttta ctcatgcctt taccacattt gtcaaattcg 6660 ttttactcta tgtctcactc agtcagtcag aactttgagt acgagnnnnn nnnnnattag 6720 t 6721 // ID BEL-109_AA-I repbase; DNA; INV; 5448 BP. XX AC supercont1.257; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-109_AA_; KW BEL-109_AA-LTR; BEL-109_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5448 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.257; Positions 450600 456047. XX CC 'TACTA' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 1175..4771 FT /product="BEL-109_AA-I_2p" FT /translation="MSFKPFPKPQPRGALTYVATNAPFCDVCSNQQHNTYQ FT CGKLLHMNPEARFALVNRIGLCVNCLKRHPEENCRSGMCRKCNLPHHTLLH FT SAISPASIQAFDASVPASNSPGQPFQSLISALDSPANLDASNVLLATAAIN FT VIDSRNRPRTCRAVLDSASQMSFITERCCNELGIRTRSACMDLEGISSVPT FT RAHKCAEIVIASHCSDFRAVVSCIVLEKITNTLPCKPAHIGNWIIPESINL FT ADPLFYRPGNIDVLLGSEIFFQLLQPGQLSLGANGTAPILQNTKLGWVVAG FT RYDNVNPIPNSPVSTCLLISIDDELSKQLRRFWELEEYAALSKHLSEEELL FT CEQHFCQYTNRDESGKFVVRLPFSQHPRNLGNSLQIAERRLYHIERKLERH FT PRLKAEYHAFMREYLDLGHMSAVDNVTTHGLSVYLPHHCVVKETSSTTKCR FT VVFDASAKTTSGLSLNDVFMCGPTIQDSLINILIRFRMPPIVLVGDVKQMY FT RMIWVHEDDRDCLKILWRWNKEEAIKEYRLNTVTFGTKCASFLATRCVQQL FT MESHRVQYPVAVEKVEKGIYVDDVLTGAETEEEARTLREQLTEMFKDGGFH FT LRKWASNCPAVLEGVPEADLEVKINIEESESNVVKALGMQWQPCSEEFHFS FT YQPNQILQPTKHMILSQIASLFDPLGLLAPVIVKAKLVMQRMWELKVAWDA FT NPPGELTNEWLNFWQSFSLLNSFQIPRHVINVKNWSRLYMHGYCDASNVAM FT GACVYIRAVNDKGDTSSHLLCAKSKLAPIGNGKTTIPRLELCAAVILARLI FT TNVRSALSTTSFYEVRAFSDSKVVLAWLAAGASRWKTFVANRITEICSHLP FT AINWFHVPTQDNPADLISRGAFPEHIKTNVFWWFGPEWDPSTSETSALVCA FT FNTNEQRQIEREHRTTAVALAVVHENRFLDGLMSRYYPKLKLLLRITARLL FT RVGHPEFRSTHRLSPDEINYAQRIFVRHTQQQHFCKDISRLQRGLPVERSS FT SLHQLHPFLDEYGLVRVGGRLQQSDLSYDNKHPILLPTHSIFTSFVLLDEH FT HEKLHCGPQLLLAASRKQYWIIRGCSAARKVYRDCVTCSRIKPVPLTQQMG FT QLPADRLKPLPPFTITGVDYAGPVSIIGRKCRAAVPSKGYIALFVCLGTRT FT AVDMGYLAKCTPTMLQIFERQQRSYANFISKLAPPSTVMK" XX SQ Sequence 5448 BP; 1490 A; 1267 C; 1273 G; 1418 T; 0 other; tatggtcctt cgagccggat cgcgaaaata tcgtcgcgag ccgaaatatc taatatatat 60 catcagtgcc cacgaattga caccgtcgca gtcgtgctcc tgtcaatcgg cgaaggtgat 120 atttataaga agtgtgtgat ccagagaaaa gacttcgaac aataagtgcg cgaaatccat 180 aacaatggat aaacttatcc gcgaacgaaa gtcactagag ccgcgtttga aacgaatcgc 240 cgagactgcg gtcaaaatca agcctttaga agctgaagaa gtggatgtcc aaattgagtt 300 ggatgcattg cgtgaagttt gggcttctta tgcaacgttg tacaaacgga ttatcagtgc 360 atgtgaaaat gatgttgagt acgatgaggc cgttagccat caagcaaaat tctaagagca 420 ctacagtacc gcgaaaaatc ggcttttgaa aactttgaaa gccatcaaag ggcgggaaga 480 cagtacggtt gttcagcagg cgtctcatga cgtcatcaag caactcgctg atcaacaggc 540 agaattttta cgaataatgt ctgcaaacat gacctcgtca gcaaacaata gcgctaccgc 600 atctccgcta tcggacctga agctacctcg aatgaatttg ccaatcttta gtggtaacta 660 cctcgagtgg cagtctttct tcgacttgtt tgagagtttg gtggatgcaa atccttcatt 720 gaaggacagt caaaaactct attttttgaa gacaaatctc gatggtgaag ctgcgtcgtt 780 aatctcccat ctcaaaatcg aagacgctaa ttatcaaacc gctttggaca aactcaaatc 840 aagatacgac aagccgagag aaatcgctaa caaacacatt cagcgatttt taactcagca 900 aacgttgacg tcagcatcag caaatggtct acgatcgtta cacgacgtat cggatgaagt 960 tatccgagcg ctccaagcca tgaatcgaga ggatcgtgat acgtggctac tattcatcct 1020 ctcagaaaag gtggatcctg ataccaaaca attatggtgc cagaagattt cagaaatgta 1080 ggattctgac atcaccttgc aatgtttcct caaatttatc gaatcgagaa gcttcgcgct 1140 tcaatcatct caacctgcta gaccaagaac cggtatgtcg ttcaagccgt ttcccaaacc 1200 tcagcctaga ggggcattga cttacgtcgc cacgaacgct ccgttttgcg acgtttgttc 1260 aaatcaacaa cataacacct atcagtgtgg aaagctcctt cacatgaacc ctgaagctcg 1320 atttgccctg gtcaacagga taggtttgtg tgtaaattgt ctcaaaaggc atccggaaga 1380 aaattgccga tcaggtatgt gtagaaaatg taacttaccg caccacacgt tactacattc 1440 tgccatatca ccagcgagca tacaagcctt cgatgcgtct gtgccagctt ctaatagtcc 1500 aggacagcca tttcagtcct tgatttcggc gctggactct cctgccaacc tcgatgcgtc 1560 caatgttctc ctcgctacgg ctgctatcaa cgttatcgac agtcgcaacc gaccacgcac 1620 ttgtcgtgcc gttttagaca gtgcctcaca aatgagtttc attacagaga gatgttgtaa 1680 cgaactcggc atccgaacgc gatcggcatg tatggatctc gaaggcatat catctgtacc 1740 aactcgagca cacaagtgcg ccgaaatcgt tattgcttcc cattgttcag actttcgagc 1800 ggtggtatct tgtatagtat tagagaaaat tactaacacg cttccctgca agccagcaca 1860 tattggcaat tggataattc ccgaatccat caatcttgct gatccgctct tctaccgccc 1920 tggaaacatc gatgttctac taggtagtga aattttcttt caactactac aaccgggtca 1980 actctctctc ggtgccaacg gtaccgcacc aatacttcaa aacaccaagc ttggttgggt 2040 tgttgcgggt cgatatgata atgtgaatcc aataccgaat tcccctgtgt caacatgtct 2100 gttgatatct attgatgatg aactatccaa gcaattacgt agattttggg aactggaaga 2160 gtatgcagcg ctgtccaaac atctctcgga agaagagctg ctctgtgagc agcatttttg 2220 tcagtatacc aatcgtgacg aatcaggtaa attcgtggtt aggctccctt tctcacagca 2280 tccgagaaac ctcgggaact cactacaaat tgctgagcgg agattgtacc atattgagcg 2340 aaaactcgaa cgacatccac ggttgaaagc tgagtaccac gcgttcatgc gtgaatacct 2400 cgatctcgga cacatgtctg ccgtagataa tgtcacaaca cacggccttt cggtatacct 2460 accacaccac tgcgtggtta aagaaacaag ctccaccact aagtgccgtg ttgtcttcga 2520 tgcttcggcg aaaacaacta gcgggctatc attgaatgac gttttcatgt gtggaccaac 2580 tattcaggat tcgttgatca acattttaat acgattccgc atgccaccga tcgtccttgt 2640 aggcgacgtc aaacagatgt atcgtatgat ttgggttcac gaagatgatc gtgactgcct 2700 gaagatttta tggcgatgga ataaggagga ggctataaag gaataccgat tgaacaccgt 2760 cacattcggc acgaaatgcg catccttttt ggctaccagg tgtgtgcagc aattgatgga 2820 gtctcacaga gtgcaatatc cagtggccgt cgaaaaggtg gaaaaaggaa tctacgtgga 2880 cgatgttttg accggtgctg agactgaaga ggaagcaagg actttgaggg aacagttgac 2940 agagatgttc aaggacggtg gatttcattt gaggaagtgg gcttccaact gcccagcggt 3000 tttggaagga gttcctgagg cagatttgga agtgaaaatt aatattgaag aaagtgaaag 3060 caacgtggtg aaggctctcg gaatgcaatg gcagccgtgc agtgaagagt tccacttttc 3120 gtaccagcca aaccagatcc ttcagcccac aaagcacatg atcctgtcac aaattgccag 3180 cctttttgat ccactaggct tgctcgctcc ggtcatcgtg aaggcaaagt tagtgatgca 3240 gcgcatgtgg gagctgaagg tggcctggga tgcaaatcct cctggtgagt taaccaatga 3300 gtggttgaat ttttggcaaa gtttctcgct tttgaattct ttccagatcc ctcgccatgt 3360 gatcaacgtg aagaactggt ctcgtctcta catgcacggc tactgcgatg cttccaatgt 3420 ggcgatgggc gcgtgtgtgt acattcgcgc cgtcaacgac aaaggcgata cctcatcaca 3480 cctgctgtgt gccaaatcga aattggcccc cattggcaac ggtaaaacaa ctataccccg 3540 attggaatta tgcgctgcgg tcattctggc acgattgata acaaatgttc ggtcggctct 3600 ctccacgaca agtttctacg aggtgcgggc tttctctgac tccaaagttg ttctggcttg 3660 gcttgctgct ggtgcttcaa ggtggaaaac atttgtcgca aatcgcatta ccgagatatg 3720 ttcacacttg cccgctataa actggtttca cgttcccact caggataacc cggccgattt 3780 gatctcgcga ggagcatttc cagaacatat aaaaacaaat gttttttggt ggtttggacc 3840 agaatgggat ccgagcacaa gcgaaacatc tgctttagta tgcgctttca acaccaatga 3900 acaacggcaa attgaaagag agcaccgtac gacggccgta gctttggcag ttgtgcatga 3960 aaatcgcttc ttggatggtc tgatgtcgcg atactatccc aaactcaaac tactattgcg 4020 gatcacagca cgattgcttc gtgttggtca tcctgagttt cgtagcactc accgcctctc 4080 tccagatgaa atcaactacg ctcagcgaat cttcgtgcga catacacaac aacaacattt 4140 ctgcaaagat atcagccgtc tacagcgtgg tctccctgtc gaacgaagca gttcacttca 4200 ccagctccat ccatttttgg atgaatacgg tctcgtcagg gtgggcggtc gactacagca 4260 gtcagatttg agttatgata acaagcatcc gattttgtta cccacacact caattttcac 4320 gtcctttgtt ctgttggatg agcatcacga aaagctacat tgtggacctc agttgttgct 4380 tgcggcttca agaaagcagt attggattat acgcggttgc agtgcggcac gtaaggtgta 4440 tcgtgattgt gttacgtgca gcagaatcaa accagtgcca ctaacccaac agatgggtca 4500 actaccagct gatcgtttga agcctcttcc accatttacg attactggcg tagattatgc 4560 cggtccagtg agcatcattg gtcgaaaatg tcgggctgcg gttccatcga agggttatat 4620 cgctctgttt gtgtgcctag gaactcgaac tgctgtagat atggggtacc tagcaaaatg 4680 tactccgaca atgcttcaaa ttttcgagcg tcagcaaaga tcttacgcga actttatcag 4740 caaattagca ccaccgagca cagtaatgaa gtaagcgatt atttggccga taaaggagtg 4800 gagtggttgt ttattccggc tcgatcaccg catcaaggag gattgtggga ggcagctatt 4860 aaggttgcta aaagattgtt aagtcggatt ggagataatt acaagtacac tttcgaggaa 4920 ctaagtacga ttttatcgca agttgctgcg tgcatgaaca cacgtccgat ttcagcaatc 4980 tctattgacc caacggatcc gcaacccttg actcctgccc attttttaat tggacgtcct 5040 cttgatgctt tgccagaaat caaccatctt gatctgcatg ttggctcact gtccaggtgg 5100 gcttacgtgc aacgtgtttc acaggacttc agatcgagat ggcagaccga atatgtggga 5160 ggacttcaaa ggtccttgaa atggaatagt gtatctccga atctcaaaca aggcgacttc 5220 gtgcttcttg tagatgacag ccagaaatgc caccaatggc caatccgacg tatattggaa 5280 ctgttccctg gatcggatgg actagtaaga gtggtttcgg ttaaaacatc gaagggaata 5340 tttcgccggg acatcaggaa acttcgtcga tgtcccttgg acagcgatga gtacgttgca 5400 ggcaggaatg ggactgaaat tactgcccgt aatttggtgg gcggatta 5448 // ID Dparam1cons repbase; DNA; INV; 489 BP. XX AC . XX DT 19-APR-2011 (Rel. 16.04, Created) DT 19-APR-2011 (Rel. 16.04, Last updated, Version 1) XX DE Consensus of mariner-like element internal region of mellifera DE subfamily. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Dparam1cons. XX OS Drosophila paramediostriata OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; tripunctata group; OC tripunctata subgroup III. XX RN [1] RP 1-489 RA Wallau G.L., Hua-Van A., Capy P. and Loreto E.L.; RT "The evolutionary history of mariner-like elements in Neotropical RT drosophilids."; RL Genetica 139(3), 327-338 (2011). XX DR [1] (Consensus) XX CC Consensus of clones with show less than eight percent divergence. CC Dparam1cons. XX SQ Sequence 489 BP; 121 A; 122 C; 135 G; 111 T; 0 other; tgggttccgc atgagttgac ggaaaaaaac atttttgccc gtatggatgc atgcgaatcg 60 cttctgaatc gcaacaaaat cgacccgttt ttgaagcgga tggtgactgg cgatgaaaag 120 tggatcactt acgacaacgt gaagcgcaaa cggtcgtggt cgaaaagggg tgaagctgcc 180 cagacggtgg ccaagcctgg attggcggcc aggaaggttc ttctgtgtgt ttggtgggat 240 tggcagggaa tcatccacta tgagctgctc ccctatggcc agacccttaa ttcggacctg 300 tactgccaac aactggaccg cttgaatgca gaactcatgc agaagaggcc atctttgatc 360 aacagaggac gaattgtctt tcatcaggac aacgccaggc cacacacatc ttttgtgacg 420 cgccagaagc tccgggagct cggatgggag gttcttttgc atccaccgta ttcccccgac 480 ctagctcca 489 // ID DeuSINE repbase; DNA; INV; 347 BP. XX AC . XX DT 05-JUL-2006 (Rel. 11.06, Created) DT 02-AUG-2006 (Rel. 11.06, Last updated, Version 2) XX DE Deuterostomia SINE element - consensus. XX KW SINE3/5S; SINE; Non-LTR Retrotransposon; Transposable Element; KW conserved; DeuSINE; CNE. XX NM DeuSINE. XX OS Metazoa OC Eukaryota. XX RN [1] RP 1-347 RA Nishihara H., Smit A.F. and Okada N.; RT "Functional noncoding sequences derived from SINEs in the RT mammalian genome."; RL Genome Res 16(7), 864-874 (2006). XX DR [1] (Consensus) XX SQ Sequence 347 BP; 68 A; 88 C; 87 G; 98 T; 6 other; tatgaatcct aatgccccag catrgtgatt gggggcactg trctgttgra ggtgccgtct 60 tttcggatga gacgttaaac ggaggtcctg tctactctct gtggtcatta aagatcccat 120 ggcacttttc gtaaagagta ggggtgttaa ccccggtgtc ctggtggcca aattcccact 180 ctggccctta wccatcatgg cchcctaata atccccatct actaattggc tmcatcactc 240 tctcctctcc actaatagct gttgtgtggg tgtgcgttct ggcgcaaaat ggctgccgtc 300 gtttcatcca agaggtggct gcactgcact gtcagtggtg aggagaa 347 // ID Chap1a_Cis repbase; DNA; INV; 187 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 06-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE hAT DNA transposon from Ciona savignyi. XX KW hAT; DNA transposon; Transposable Element; Chap1a_Cis. XX OS Ciona savignyi OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. XX RN [1] RP 1-187 RA Smit A.F.; RT "Chap1a_Cis - hAT DNA transposon from Ciona savignyi."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX CC Ci000014. Internal deletion product of Chap1b_Cis (95% CC similarity). XX SQ Sequence 187 BP; 46 A; 58 C; 43 G; 40 T; 0 other; caggcatggg caacatacgg cccgcgggcc gggtccggcc cgtgaaggga ttttgaccgg 60 cccgcagact gtatctaacc ataccctgta attaagatat tgaatactat tcatgtaata 120 atccggccca ccaagaactt cattttccca catctggccc gccgactaga gaagttgccc 180 acccctg 187 // ID CR1-1_CQ repbase; DNA; INV; 4887 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-1_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4887 RA Kojima K.K. and Jurka J.; RT "CR1 non-LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 1-1 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 10 sequences with >98% CC identity. XX FH Key Location/Qualifiers FT CDS 158..1822 FT /product="CR1-1_CQ_1p" FT /translation="MQCYVPTCSSPFDQQYLWNCTGICNRKFHAACIGVKR FT GSEDDLQLHVLPICSLCRNNLHMEIDIRKIMQHHLEGSHMILTHIKTNDKT FT NQEALNELHNKIIGLQEEIKKSAEIASRILSAVLTQAPAMDFPLNEVRKAF FT EDTTSDQNQKLAESFCRIQEITSSHRDDIIKTIASAEKCPSDIILAVHDEV FT RALTSSINKLQDNEIEVRQKSLAEELNEQTVLEIESGWRLIGSKKIWKPDW FT TDFDARQRTRRLQEKEAEKARRRNRKHRQQIQHQQQQQSQQHHQQRRQQQH FT LQQQQSHQHLRRQQHQTNCNSDAGRNPSTPHFNLKDNQFDQDNASSSNFFT FT HSSTKLSDKELLDQARVEFSGQPPTTSNSNFINFRKGETINPYRKEKTPIV FT PPLNATTPQPAKTSSQTPEVTDSMFCLDPMKPPIVRLTEQSAVGDGRFLLA FT RLREIKVYDNLRLYLAYLKDQKPDVCIDGLTLTSMHVFFASNGLPTEPEHL FT MNIFMEYNSTIGISPKQTLTDLETYRKYVTTRRLQYLQHSREAANKFYLPT FT TSSNFYKH" FT CDS 1849..4824 FT /product="CR1-1_CQ_2p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="MTPSQEAVSTINNLSMSPKISSTEQQNANEILIYCQN FT FNRMKSPGKMKEISLNILSHSFDIILGTETNWDESVHPEEIFGNNYFVFLG FT NRNLNLSQKKSGGGVLLAINARLNPKEIVTEKHFQFEQVWAKATIAGQVHI FT FASVYFPPDHANKQSYELFFKTVEIITSKMEPEVKLHIYGDFNQSKVEFIS FT DQENEAILLPVIGENETLHFLFDNIANYGLFQINHVKNQRNSFLDLLFTNC FT IEDFHVQESVTPLWKNEVFHTAIEYSIYVHKNTLPIDWEYEEVLEYNKTNF FT VEAKRKLLAIDWQNLFNNEGNVDELVGKFYTEINTITSETVPTRRRRRNNT FT GNKYPVWFTPQLRNLKNRKQKAYKLYRNNTNDTNLLNYLNISDHFFSALNS FT ANEEYNSKVESEVKSCPKNFFNYVKSKSKSSNFPSQMQLDENVGSNSKEIC FT NLFSKFFKEVYTSFSEEDRDRDYFSYIPEFPNDVSVNSLSETEVRQALKDL FT DSSKGPGPDGIAPAFLKNLAEELTYPLHHLFNMSINTGKFPQTWKKSFLVP FT IFKSGPKSDIRNYRGIALLSCIPKLFESIINEKIFQQVKNRITCKQNGFFK FT GRSTSTNLLEFVNFTLNAMHNRNFVEAIYTDFSKAFDRIDIPLLIFKLQKI FT GIQPNLLEWLKSYLTKREQIVRFQNVLSESIHVTSGVPQGSHLGPLLFILY FT VNDISFILKKINVLVYADDMKLYMEIGNANDSHVFQNEINLFYTWCSKSLL FT QLNVKKCNSIAFSRKHETPNITVLLGNQPVEKCKVVRDLGVILDSQLTFVE FT HYNTIINKAKSTLGFIKRFAFNFQDPYTIKLLYITYVRPLLEYCSIVWNPY FT YAVHQARIESVQKQFLLYALRKLNWTAFPLPSYEARCMLINIQSLQERRKF FT AMLSFINDIISQRIQSAALFSVIRNSIHEPSRTLRNSPLFRITAYTTNYLK FT NSPLNQMMRFYNENSQYIHFDMSKPELRKNLYNRNNI" XX SQ Sequence 4887 BP; 1748 A; 1108 C; 786 G; 1245 T; 0 other; gcagttgagt ctttttaccg gacggacgcg ttttttaatt cgctctacac tttattttta 60 tttaccttca ctgacccagt ccacaaacgt aaacgtaaac atatttattt ttttttgtcg 120 cgcgtcgcgt gtgcacttat taacaaacgt aagcaaaatg cagtgctatg tgccgacgtg 180 ctcttcaccc ttcgaccaac aatacctctg gaactgtacg ggcatttgta atcgaaaatt 240 tcacgcagcc tgtattgggg tcaagcgtgg ctccgaagac gacttgcaac tacatgtact 300 tcctatatgc agcctttgtc gaaataacct acatatggaa atcgacatta gaaaaataat 360 gcagcatcat ttagaaggca gtcatatgat cttgactcac atcaagacca atgacaagac 420 aaaccaagaa gcccttaatg aactgcataa taaaatcata ggcctgcagg aagaaattaa 480 gaaatctgca gaaatcgcaa gtcgaatact ttcggcagta ttaacacagg caccggcaat 540 ggacttcccc ttaaacgaag tcagaaaagc atttgaagac accacctcgg accagaatca 600 aaaactcgct gaatcattct gccgtatcca ggaaattacg tcatcccacc gggatgacat 660 aattaaaaca attgcctcag cagaaaaatg cccctcggac ataattttag cagtccacga 720 tgaagtgagg gccctcactt catccatcaa taaactgcaa gacaatgaaa ttgaagtccg 780 gcaaaaatca ctggcagaag aattaaacga acaaaccgta ctggaaatcg aatctggctg 840 gcgacttatc ggcagcaaaa aaatttggaa gccagactgg accgacttcg acgccaggca 900 gcggactcgc cgtcttcagg aaaaggaagc cgaaaaagca cgtcgccgca accgaaaaca 960 tcgtcaacaa attcagcacc agcaacaaca acaatcacaa caacatcatc agcaacgccg 1020 gcaacagcaa cacctgcagc aacaacaatc acatcaacat cttcgtcggc aacagcacca 1080 gaccaactgc aactctgacg caggccgaaa cccaagtacg ccacatttta acttaaagga 1140 caatcaattc gaccaggaca acgcaagtag cagcaacttt ttcactcatt cgtcaaccaa 1200 attgtccgac aaggaacttc tagaccaggc aagggtcgaa ttttctggcc aacctccaac 1260 tacatcgaat tcaaacttca tcaacttccg aaaaggggaa accatcaacc cctaccggaa 1320 ggaaaaaacg ccaatcgtcc cacctctaaa cgccaccacg ccgcagccag caaaaacgtc 1380 atcacaaact cctgaagtaa cagacagtat gttctgtctg gacccaatga agccgcccat 1440 agtacgccta actgaacagt ctgcagtcgg cgatggccgt ttcctgctgg ccagactccg 1500 cgaaattaaa gtctatgaca atctcagact atacttggcg tatctgaagg accaaaaacc 1560 ggacgtctgc atagacggac taacactaac cagcatgcat gtcttttttg catccaatgg 1620 cctgcctact gaacctgaac atcttatgaa catcttcatg gaatacaact caacaattgg 1680 aatttcacct aagcaaaccc tcaccgacct ggaaacctac aggaaatatg taacaactag 1740 aagacttcaa tacctgcagc attcgcgcga agccgccaac aaattttacc tgccgactac 1800 atcgtcgaat ttttacaagc attgacgcct tctaacacgg aagaagaaat gacaccctcg 1860 caagaggcag taagtactat taataattta agtatgtctc caaaaatttc ttcaacggaa 1920 caacagaatg cgaatgaaat tctaatttat tgtcagaatt tcaatcgcat gaaaagccca 1980 ggcaaaatga aggaaatttc tttaaacatt ctctcacatt ctttcgacat tattcttgga 2040 actgaaacta actgggacga aagcgtccac cctgaagaaa tttttggaaa caattatttt 2100 gtattcctag gcaatagaaa tttaaatctc agtcagaaaa agtcaggagg cggcgtcctc 2160 ttagccataa atgccagact taatccaaaa gaaattgtaa ctgaaaaaca ttttcaattt 2220 gaacaagtct gggccaaagc cactattgca ggccaagtac acatttttgc atcagtatac 2280 tttccgccgg accatgcaaa taaacagtcg tatgaactat tctttaaaac agtagaaatc 2340 ataacatcaa aaatggaacc ggaagtaaaa cttcacattt atggcgactt taatcaaagc 2400 aaagtcgaat ttatatctga ccaagaaaat gaagcaattc ttctcccagt cattggggaa 2460 aatgaaactt tgcattttct ctttgacaat attgcaaatt atggactttt ccaaatcaat 2520 catgtaaaaa accaaagaaa ttcattttta gaccttttat tcactaactg tattgaagac 2580 tttcatgtac aagaatctgt aacccccctt tggaagaatg aagtcttcca tacagcaatt 2640 gaatattcta tttatgtcca taaaaacact ttgcccattg actgggaata tgaagaagtc 2700 ctggaataca ataaaacaaa ctttgtagaa gccaaacgta aactacttgc aattgactgg 2760 caaaatttat ttaataatga aggaaatgtc gacgaattag taggaaaatt ctatacggaa 2820 attaacacta taacatctga aaccgtacct actagaagaa gacgacgtaa taacacaggc 2880 aataaatacc cagtgtggtt cactccacaa ctaagaaatt taaaaaatag aaaacaaaaa 2940 gcctacaaac tatacagaaa caatacaaat gacacaaatc ttctgaatta tctgaatatt 3000 tccgaccatt ttttttcggc actcaattct gccaacgaag aatataatag taaagtcgaa 3060 tctgaagtca aatcatgccc gaaaaatttc tttaattacg taaaatccaa atccaaaagt 3120 agtaacttcc catcgcaaat gcaactggac gaaaatgtag gcagtaactc aaaagaaatt 3180 tgcaatcttt tttcaaaatt tttcaaagaa gtatacacct cattttccga agaagaccgc 3240 gaccgcgact acttttcata tataccggaa tttccaaatg acgtctcagt caattctttg 3300 tcagaaacgg aagtacgcca ggcattgaag gacttagact catcaaaagg accaggaccc 3360 gacggaatag cacctgcatt cctaaagaac cttgcagaag aattgacata tccactgcat 3420 catcttttca acatgtcaat aaatacagga aaattcccac aaacatggaa aaagtctttt 3480 ttggtgccta ttttcaagtc aggcccaaaa tcagacatac gtaattatcg cggaattgcc 3540 cttttgtctt gcattccaaa acttttcgaa tctattataa atgaaaaaat ctttcagcaa 3600 gtaaaaaacc gcatcacatg taaacagaac ggctttttta aaggccgctc tactagcacc 3660 aaccttttag aatttgtaaa ttttacactg aatgcaatgc ataatcgcaa tttcgtagaa 3720 gcaatttaca cagactttag taaggcattt gacagaatcg acataccatt attaatcttc 3780 aaactgcaga aaattggaat tcaaccgaat cttttggaat ggcttaagtc atatttgact 3840 aagcgcgaac aaattgttcg cttccaaaat gtactatcgg aatcaattca cgtcacctct 3900 ggggttccgc aaggatccca tctaggacct cttcttttca tcttgtatgt aaacgacatt 3960 tccttcattc ttaaaaaaat taacgtactt gtatatgcag acgacatgaa attgtatatg 4020 gaaataggaa atgccaatga cagtcatgta ttccaaaacg aaattaatct tttctacaca 4080 tggtgtagta aaagcctact ccaattgaat gtaaagaaat gtaattccat tgccttcagc 4140 agaaaacatg aaacaccaaa cataacagta ttattaggaa accaaccagt agaaaaatgc 4200 aaagtagtac gtgatctagg tgtcatccta gactcacaac taacttttgt agaacactat 4260 aacacaataa taaacaaggc aaaaagtaca ttaggcttta taaagcgctt tgcattcaac 4320 ttccaggacc cgtatactat taaattactc tatataacgt atgtcaggcc actcttggaa 4380 tactgtagta tcgtctggaa tccatactat gccgtacacc aagcacgtat tgaatctgtc 4440 caaaaacaat tcttactgta cgcactacgt aaacttaact ggactgcatt tcctctccca 4500 tcgtatgaag cacgctgcat gctcataaac atacaatcat tacaagaacg tcgtaaattt 4560 gccatgctct ctttcatcaa cgacattatt tctcaacgca tacagtcagc agcattattt 4620 tcagtaatac gcaatagtat tcatgaacca agccgtactc ttagaaattc accacttttt 4680 agaataactg catacacgac aaattattta aaaaattcgc cattaaatca aatgatgcgc 4740 ttttataatg aaaattcaca gtacatacat ttcgacatgt ctaaaccgga actacgaaaa 4800 aatctgtaca atagaaataa tatctagtat gtaagaaaat tgtaagtagt ctacataagc 4860 ttgacgaata aacaataaac taataaa 4887 // ID Penelope-12_HM repbase; DNA; INV; 7021 BP. XX AC . XX DT 03-FEB-2009 (Rel. 14.02, Created) DT 03-FEB-2009 (Rel. 14.02, Last updated, Version 1) XX DE Penelope-like element. XX KW Penelope; Non-LTR Retrotransposon; Transposable Element; KW Penelope-12_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-7021 RA Bao W. and Jurka J.; RT "Penelope-like elements from Hydra magnipapillata."; RL Repbase Reports 9(2), 450-450 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 1678..3555 FT /product="Penelope-12_HM_1p" FT /translation="MKDLFDMHNIRGSFHYKIRVEKTRATAEMINRMITNK FT SDYDLTEAERQVLSRGLGFIQNDQYVPEKTFKDAISKLKRNMLLRVHFKDN FT GGLPRQTPMKRKLDTVWNPPASNNANLNLFFERIKSETDTVLNERMNIAHP FT NLTKNMKEAILTLKNNNNIIIKQADKGGSICILNRRDYEEKINTMLADDTT FT YKLLLYDPTIEMARNVVSTIDYILFKGVIDEKTANYIRPNNPCRPPLFYGL FT PKIHKTGIPLRPIVSACDGPTDHLSYFVTKFIQPLAETLPSYFRDSTQFLH FT MLQNQPLPPQKYIFVTADVTSLYTNIPHKDGIEAIKLLFNKIPHDLRPPNS FT PPVAYIQIMIDTILTNTTFQFDDRNYLQMTGTSMGTRMAPPYANIFMGTLD FT EAVTGQFKDSITFYKRFIDDIFFIFTGTIEDLEHVITFMNNLHPTIKFTFN FT HSDISIDFMDLTLYKNHLDKIQSTIHRKPTDTMSLLHFSSYHPHHITSGLI FT HSQAIRYNRLISDSHHLYKELRTLARQLVLKGYHLDIINRNITKALLHTQK FT DLIHKNKEADDTHIFPIVTPYNEQGIRINNAILHNWHLIHNDPELCTLFPP FT VPTLTHTNTLSLRDRLIRTKYTANHNN*" XX SQ Sequence 7021 BP; 2138 A; 1480 C; 1079 G; 2318 T; 6 other; gaagtgagtg tgtttagctt tctattcacc cgatttgaaa gtgaacggaa acataacgag 60 accatactag acgacaaacg gtacgtacta atgcactaca ccatcacacc tgtagacatc 120 aacggtcttc aaaatcagta tatcaaatat ttgttctttg ttacatatga tctacataca 180 gagtgattag agtgttagtt tgttctctcc ctgcctccga gcatttactt cgttctttac 240 cttcacaaaa gtcttatcac tcattttata ttytcatatt atttttattt tatattcaat 300 gacaccatct tctaactgtt tttattatca catctctcag gtctacaaaa ctggacggct 360 ccgttggtca ctttttgcga tcattgtgtc gcataaacaa aaatttattc tttgtggctg 420 gtgagccccg actttttcca atgtacgtca caacgacaaa acggacatta ggctatcccg 480 ctggatgtgg ttacggcgct caataaccta atgactacgc tgacactcac gcacacgttc 540 agtcggaagc tccacaatgc acgtcggatc agcgctcgct gactatgtgc tcggatctga 600 aatctcactc catagcacat cattccacac cagcacgggg cagtagctct tttggctaag 660 agtgatggct tgtggcctac ggtgactaat gtgtgcattc gatcaattga aatccattct 720 gtaggtctca cgtctggctc tgatgtgcta gcccataagc gcaaggagtg agtgtgttta 780 gctttctatt cacccgattt taatgtgaac gaagacataa cgagaccata ctagacgacc 840 aaacggtaca tactaacgca ctacaccatc acacctgtag acatcaacgg tcttcaaaat 900 cagtatatca aatatttgtt ctttttttga ttattttctt ttgaatgacc caagagagag 960 tgttacatgg gtgagcgatt atccatttag atttgaccat acgactagac atcaccagcc 1020 gattcaccat gctacatctg tagtcttcac catgcttaca tctgtagtct caatctccat 1080 tttttagtgg catacctcac cacctattat actttttatt tactttttta gttttttaat 1140 ttagatttta aattaatatt atattttttt tttctaataa tatttttttt tttatacttg 1200 ttttatttta gtttatatat gttattttgt ccgtaatata caagttgtta tagtttagtg 1260 gattgtgcgc aacctgcggt gtctaaggtt ccgggttcga tgcgaggata tgtctgatgt 1320 atacatttaa tttattacac aatttttaat tcttgaatgc atcgattcat tttattacaa 1380 ctatacattt tataacacac atttcataac atatatattt aatgacgtaa cacagattac 1440 ttttatacac acattattat gggttataat aaagcttgtt atttttaaat tcctaagtag 1500 cgggctgtct ctctactcac ctaattacgg gtgtacatat acctatagac acactgaact 1560 gatgaaccac ccatagctta ccacctgcaa cgctaaatga ctttctccat aactttctag 1620 aacaacggat caatcttcac accaactgat gaaccaagga cattcactcc gattacaatg 1680 aaagatttgt ttgatatgca taacatccga ggatcgtttc actataagat tagggtggag 1740 aaaacaagag ccaccgcgga aatgattaat cgaatgatta cgaacaaatc ggactacgac 1800 ctcactgaag cggagaggca agtattaagc agagggttgg gatttattca aaacgaccaa 1860 tacgttccgg aaaagacgtt taaagatgca atatcaaaat taaaaagaaa tatgctactg 1920 agagttcatt ttaaggacaa tggaggacta ccccgacaaa caccaatgaa gaggaaactt 1980 gacacggttt ggaaccctcc tgcatccaac aatgcaaatt taaatctttt ctttgaaagg 2040 ataaagagtg aaacagacac agtactaaat gagagaatga acatagcaca ccccaaccta 2100 acgaaaaaca tgaaagaagc aatcttaaca ctgaaaaata ataacaatat catcatcaag 2160 caagctgaca aaggtggtag catctgcatc ttaaacagga gagactatga agagaaaatt 2220 aacaccatgc tagcagatga taccacatat aaactcctac tatacgaccc caccatagaa 2280 atggcacgga acgttgtatc gacaattgat tacatcttat tcaaaggagt gattgacgaa 2340 aagactgcaa attacatacg tccaaacaat ccttgtcgcc ctcccctgtt ctatggactc 2400 cccaaaatac acaaaactgg aattcctctc cgacctattg tatctgcttg cgatggtccc 2460 acagatcacc tctcatattt tgtgactaag ttcatacaac cactagccga aacacttcca 2520 tcatacttta gagactccac acaatttctt cacatgctcc aaaatcaacc actaccgcct 2580 caaaaataca tcttcgtcac cgctgacgtt acctccctgt acactaacat accccataag 2640 gatggtatag aagccatcaa actacttttt aataaaatac cacatgacct aagaccaccc 2700 aattcaccac ctgtagcgta tatacagatt atgattgaca ccattctgac aaacacgacg 2760 ttccagtttg acgaccgcaa ctatttacag atgacaggga catcgatggg tacacgcatg 2820 gcaccaccat acgctaacat attcatgggc acattagacg aagccgtcac tggacaattt 2880 aaagacagca tcacgttcta caaaagattc atcgatgaca tttttttcat attcacaggc 2940 actatagaag atctcgaaca tgttataaca tttatgaaca acctgcaccc caccattaaa 3000 tttactttta accactcaga tatttccatt gactttatgg acctaactct ctacaagaac 3060 catctagata aaatacagag cacaattcac cgtaaaccca ccgacacaat gagccttcta 3120 catttcagtt cataccaccc acatcatatc acatcaggtc tcattcacag ccaagcgatc 3180 agatacaata ggcttatatc tgactcccac cacctatata aggagctccg cacattagcg 3240 cgccaattgg ttcttaaggg gtaccatctt gatattatta acaggaacat cactaaagcc 3300 ctattacaca cacagaaaga tcttatccac aaaaacaaag aagccgatga cacccacatc 3360 ttccccatag tcacacctta caatgaacaa ggtatacgta tcaacaacgc gatattacac 3420 aactggcacc ttatccacaa tgacccagaa ctctgtacac tatttccacc ggtacccacc 3480 ctcacacaca ctaacacact ttccctcaga gatagactta tccgcactaa atacactgcc 3540 aatcacaata actgaacatc cactgtttgc tctagttcgt tttcagctgg taggacgtac 3600 ttctacccct acccgcatgt tacaattttg tatttcaatt attttacaca taaaatatac 3660 cgcctatgtt gtttatgttt tttaccttat tacatgatct atcaataagc caaattgtta 3720 tattattatt cgttcgacgc ttacagttta taattcgctt ataaatacac cagaaaatta 3780 ctagagggct tttgtttata ccccttacac aaaatgctta gacatattta cacgaggata 3840 ctaaacaaat gatttacaca cttaactgct attttaggat gaacatgcta tagagatatt 3900 ccttcgccaa gccacggagg acgcgaacgc attatctaac gcaagtcttg atagggaacg 3960 atacatcgtt tacaaagaaa tgaatggaat atgcttccac cctatgaata cctggccact 4020 atccatacaa ataatattct ggaacataac tataactgac accgaaacat tcaaattgac 4080 actttttttc attggtaacg gatgctctcc atacttaatt ttcaaattcc ttattctcgg 4140 ctttagaaaa aaccgcgcaa aatataaaaa aagaatatac caactgctgt ggctagctag 4200 aaacattgaa aaccactcac atagatggtt ttattttgac atttatctca attctatgag 4260 gtatttaaac atggaccctt gtccatgtga agggaaaaac cttgaggata aacaaaatga 4320 tttatccata ttttatttag aataaaaaaa taaaatatca caatgaatgc attttatatc 4380 ttacataata catactaact acacaaggac tacaacgatc aaactttgct atacactttt 4440 tacatatggt acgccccaaa acaagacaaa aataaattaa gtttagcatg cgttctttac 4500 aagacagatc catctgctta tcacttttac catcctttag gtggtagatt agccacttta 4560 syyacttcct tttaattttt acttattttt tttttatttt taattttttt ctattacctt 4620 ttagtcactc taatagctta tactatgtat ttcggttata ctttttgtat tcgactacca 4680 tattgttaga tttttttgat cctaagaaaa tttatctgac gtcgataagc cagtttgttt 4740 tagttttgca ctatacagtc ttcaatagcg ttcttatggt tattatgtaa acaatctttg 4800 tttttcaact cgatttttta agtccgctag atactctttg atcgattacg cgtttatgac 4860 ctgtatcggt ycactctcat atagtatttt acatattaaa ttattattca tttctggtgt 4920 atccactaat tatattgtgt gttagctggc tatcttgagt ttcaatttat ttatttatta 4980 ttatctaata ctattttcag ttaggaatta tctccatcta tcaggtgaga tttatattct 5040 ctgaacaaat tgtcttacta aactccacac cacatttgta tacctttacc acatactatt 5100 tttatcctag tttaaatcgt tacgagtcgt gatttcctgg attgtgaata tacatatata 5160 tatatatata tatatatata tatatatata tatatatata tatatatata tatataaaca 5220 gatatatatt aggcattact ttattaaaaa atggtaagtc actttgtagt attatgtgct 5280 atctgctgct tattatttat tttgttgtca ttgtcactat tctatatgct ctattgtcac 5340 tattttgtat agatatgttc atgtgtatgt atgtatgtat gtatgtatgt atgtatgtat 5400 gtagggatat gtgtttattt ttttgtttwt tttttttttt ttttatattg tgagtgcgtg 5460 tatttacaca catatttaca gctgtaccac gcgcattcct caccaccctc acgagatctc 5520 taagaatgtt ggagctgttt tcaaaaactg ggatagggat tggttttttg gcttttgtag 5580 tgtagtccta ttatattgca atattgacta gcatgacgta gagtgcaggc ggtgagatct 5640 gttgcaagga cacgcccctg acgagtggcg ttagtgcagg tctcgggctc tccgaaacgt 5700 cattggtgat cgcagtgcat tcctgaggag ggggatcacc atttatccac tacttaattc 5760 taattggtta ttatggcacc aacatttttt tacctcttaa acccatgtat ttttgcgctt 5820 tatgagcttt ttattatgaa tttgttacat ttacgacatt tatatatttt ttatgttaca 5880 caggtgatta tcagttatca attttttatc tagagtttag aacatatata gtaccacatt 5940 tattagacat ttgtatggtg cgattatttt agttatgaat accattataa accagcggat 6000 ttttattgac ttgtttactc atagatcatt atttcatcat ttgttaaaag acgactagag 6060 tgtgagtttt tattcccctt ttcctgagtt gcttactttt actcttccaa ctaaatagtc 6120 tttgcattaa tttatgtttt tttacctaaa tgacatcatc ttcaatctgt ttttattttc 6180 acatctccca ggtctacgta actggacggc tccgttggtc actttttgcg atcattgtgt 6240 cgcataaaca aaaatttatt ctttgtggct ggtgagcccc gactttttcc aatgtacgtc 6300 acaacgacaa aacggacatt aggctatccc gctggatgtg gttacggcgc tcaataacct 6360 aatgactacg ctgacactca cgcacacgtt caatcggaag ctccacaatg cacgtcggat 6420 cagcgctcgc tgactatgtg ctcggatctg aaatctcact tcatagcaca tcattccaca 6480 ccagcacggg gcagtagctc ttttggttaa gagtgatggc ttgtggccta cggtgactaa 6540 tgtgtgcatt cgatcaattg aaatccattc tgtaggtctt acgtctggct ctgatgtgct 6600 agcccataag cgcaagaagt gagtgtgttt agctttctat tcacccgatt tgaaagtgaa 6660 cggaaacata acgagaccat actagacgac aaacggtacg tactaatgca ctacaccatt 6720 acacctgtag acatcaacgg tcttcaaaat cagtatatca aatatttgtt ctttgttaca 6780 tatgatctac atacagagtg attagagtgt tagtttgttc tctccctgcc tccgagcatt 6840 tacttcgttc tttaccttca caaaagtctt atcactcatt ttatattctc atattatttt 6900 tattttatat tcaatgacac catcttctaa ctgtttttat tatcacatct ctcaggtcta 6960 caaaactgga cggctccgtt ggtcactttt tgcgatcatt gtgtcgcata aacaaaaatt 7020 t 7021 // ID Mariner-30_SM repbase; DNA; INV; 2315 BP. XX AC . XX DT 12-AUG-2009 (Rel. 14.08, Created) DT 12-AUG-2009 (Rel. 14.08, Last updated, Version 2) XX DE Mariner DNA transposon from Schmidtea mediterranea: consensus. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-30_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-2315 RA Jurka J.; RT "Mariner DNA transposons from Schmidtea mediterranea."; RL Repbase Reports 9(8), 1879-1879 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 245..1996 FT /product="Mariner-30_SM_1p" FT /translation="MFFVIFLVKSLIIIFDFICKINKPCKMSPKKKSQEDS FT HKRIKLTIETKRKIIENRERGVSVADLSRTYDRSTSTICTILKNKDKIMEI FT DASKGVTRISTQRLRVLDDVERLLLIWINEKQLQGDTINENIICEKAKMIF FT DDLVKKKPRTSTAEEEVFKGSHGWFEKFKKRTGIHSVVRHGEAASCDKRAA FT ENFIGDFKKLIDSNGYLPQQIFNCDETGLFWKKMPRRTYITAEEDAMPGHK FT PMKDRLTLLFCANASGDLKVKPLLVYHSETPRAFKKCKVQKNRLNVMWRSN FT KKAWVTRDLFTDWLNNVFGPSVKNYLLHENLPLHVLLVMDNAPAHSPGLQD FT DLVEEFKFIKIQFMPPNTTALLQPMDQQVISNFKKLYTKALFEHCFEMTEK FT TNLTLREFWKDHFHIVACLKIIETAWEAVTKRTLISAWKKLWPDGDGECNF FT EGVETLPVESVVNEIVSLATTMGLEVNVNDIHELVKEPSQELTTEELMELH FT YVSQEEVVEEIMSEKEEGTIKQQSSGAIREMLKSWETAASYIEKYHPDKAV FT AMRATNLFNDNAVLPFYQILKRRQKQPSLDSFLIKKN" XX SQ Sequence 2315 BP; 773 A; 418 C; 481 G; 643 T; 0 other; cagtgaaacc tcgcataacg agaaattcgc ataacgagta aaaaaaattg gaaaaaaacg 60 gctcgcttaa cgaacgattt ttcgcataac gagtgaaagt cctggattgt aaaaaaaaat 120 ttaaaacaaa aatatttgca gtattcacga aaacattatt tatggataca tcagtatgtg 180 tttacaattg ttccgctagc atattttact tggatataaa cttttacaaa cccttgttta 240 tacaatgttt tttgttattt ttttagtaaa aagtttaata attatttttg attttatttg 300 taaaatcaat aaaccttgca aaatgtcgcc gaaaaaaaag tcgcaagaag acagccataa 360 gagaattaaa ttaaccatag aaacaaagcg taaaatcatt gaaaatcgag aacggggcgt 420 gagtgtagcg gatctatcgc gcacatacga ccggtctact tctacgatct gtactatcct 480 taaaaacaag gacaagatta tggagataga tgcgtcaaaa ggagttacaa gaatatctac 540 gcaacggtta cgtgttctcg acgatgttga aaggctgctt cttatatgga taaacgagaa 600 gcaattacag ggcgacacta ttaacgagaa catcatttgt gagaaagcaa aaatgatttt 660 cgacgaccta gtaaagaaga agccaagaac atcaacggcc gaagaagaag tgtttaaggg 720 aagtcatggg tggttcgaaa aatttaagaa aagaaccggc atccacagcg tcgtgaggca 780 tggtgaagca gccagctgcg acaagagggc agcagagaac ttcatcggcg acttcaagaa 840 actcatagat tctaatggct atctgccgca acaaattttt aattgtgacg agacgggtct 900 tttctggaaa aagatgccta ggcgaactta tattactgca gaggaggatg caatgcccgg 960 gcacaagcca atgaaagacc gcctcacgct acttttttgt gctaatgcaa gcggcgattt 1020 gaaagttaaa ccgctgcttg tttatcattc tgaaacccca cgagcattca agaagtgtaa 1080 agtccaaaaa aacaggctaa atgtgatgtg gaggtcaaac aaaaaggcgt gggtgacacg 1140 tgaccttttt actgattggc tcaataatgt gtttggtccg tctgtgaaaa attatttact 1200 tcatgagaat ctgccgttac atgtcttgct tgttatggat aacgcacctg ctcattctcc 1260 aggcctacaa gatgatctcg ttgaagaatt caaattcatc aagatccaat tcatgcctcc 1320 caataccact gccttactcc agcctatgga ccagcaggtt atttcgaact ttaagaagct 1380 ctacaccaag gcactcttcg agcactgttt tgaaatgact gaaaagacca atctcactct 1440 cagagagttt tggaaagatc acttccacat cgttgcctgc ctcaagatta tcgaaacggc 1500 atgggaggca gttaccaaga gaactctcat ttctgcgtgg aagaaacttt ggccagatgg 1560 tgatggcgaa tgtaactttg agggggttga aacattaccc gtagagtctg tagttaacga 1620 gatcgtgtct ttggctacga ccatgggact agaggtaaat gtaaatgata tccacgagct 1680 tgtgaaagag cctagccaag agctgaccac tgaagagcta atggagttac attacgtttc 1740 tcaagaagaa gttgtggagg agattatgtc agagaaggag gagggaacaa taaagcagca 1800 atcttctggc gcaataagag aaatgttgaa atcatgggaa actgctgcgt cgtacattga 1860 gaaatatcac cccgataaag cagtagctat gcgcgctaca aatttattta atgataatgc 1920 cgtattgcct ttttaccaaa ttttgaagcg tcgccaaaaa caaccgtcac tagacagctt 1980 tttaataaaa aaaaattagt tatgtatcat tacaacaaat ttattattcg aatacttttt 2040 aatttttgta aaatgaattc gcattacagg ttggtttatt ccgttacaaa aagaaagacc 2100 aagatggtct acaatcctta acaacatgat catgtatata attttcattt aataaaagta 2160 tatgtcataa aattttttgt gtattttttt cggcacggaa cgaattatcg tattttacct 2220 taatttatat gggaagcgtt gtttcgctta acgagtgttt cgctttacga ggaacgttct 2280 ggaacgaatt atgctcgtta tgcgaggttc cactg 2315 // ID Gypsy-201_AA-LTR repbase; DNA; INV; 190 BP. XX AC supercont1.64; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-201_AA_; KW Gypsy-201_AA-I; Gypsy-201_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-190 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.64; Positions 1541047 1540858. XX SQ Sequence 190 BP; 56 A; 33 C; 33 G; 68 T; 0 other; tgtaataata ttgatcatat ttaatcagaa ctgtcataac cacagggcat attccatagc 60 tagctatagt taggttgagt ccgccagtct attatgatat tattgtctat catgaaataa 120 agtcagttat catggctacc gtgcattaaa caccgcgttt tattctgctg gtttttcgag 180 aattgttaca 190 // ID Gypsy-3_BM-I repbase; DNA; INV; 2607 BP. XX AC nscaf2937; XX DT 19-MAR-2010 (Rel. 15.07, Created) DT 19-MAR-2010 (Rel. 15.07, Last updated, Version -1) XX DE LTR retrotransposon from silkworm: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-3_BM_; KW Gypsy-3_BM-LTR; Gypsy-3_BM-I. XX OS Bombyx mori OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Bombycidae; Bombycinae; Bombyx. XX RN [1] RP 1-2607 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from silkworm."; RL Repbase Reports 10(7), 981-981 (2010). XX DR Genome; nscaf2937; Positions 529564 532170. XX CC Positions [1720-2196] - Integrase core CC 'TATA' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 160..2592 FT /product="Gypsy-3_BM-I_1p" FT /translation="MPRKGAKEDEECFAEPAPQISFCELEKSMTAFSGDDA FT YGVDTFIKDFEDIANLMQWTGIEKLIYAKRLLKGTAKLFLRSLTGTTTWEL FT LKTGLKEEFGLHLNSAAIHKTISNRKMKPGETYQQYFLQLKELAVLGNIED FT DALMEYIIDGIPDNEMNKTILYGASNIKEFRKKLDLYAEIKKKCRTSNKAY FT NQSTTSKPTTSGWKKPLPKKRCFNCGDLDHESSGCSKGIKCFRCNDFGHKS FT TDCPKNIKKTLEIKCNKENDSQAPYKKVLINNVMVKSLIDTGSDVNLMNES FT TFTQIHGDTCNYHPDLVQSLTGIGDCEVHTRGSCTPRIEIDGCQCEATFYI FT IEDDKIPVDVIIGNPILQDIELSFKSGVISAKRILQVTMLEDQTEEPMIIG FT NFKYKDQIEKILTEYDPGKCCKESKVELKVLEKEHYDDFMIEAGILWKIKD FT GKKLMVVPKKMQNEIIRKNHDKGHFGLIKTEELISRNYYFENMKDKIKSVL FT ENCLECILISHKKGKSEGFLHAIDKGDIPLSTYHIDHLGPLTSTNKNYKYI FT LTVVDGFTKFTWIYPTKTLSTDEVIDKLKIQQQTFGSPQRIISDRSTSFTS FT NTFKEFCETEGILHHIITTGQPRGNGQVERVHQIIIDALSKLSADDPTKWY FT KHISNIQRCLNGSFQRSIKMSPFELLTGVKINNKNEAIYNLINEENLKFFC FT DEREDLRKKARENIQKIQDENRRNFNRRRRKPREYEVGNLVAIKRTQFVQG FT YKLHPRYLGPYEVVKKKRNDRYDLQRIGQGEGPHHTSSSADMMKPWVADAV FT EYESSGTDD" XX SQ Sequence 2607 BP; 991 A; 430 C; 519 G; 667 T; 0 other; aaactttggg ggctcgtccg ggatcgaaat cgacatcaag attggtgatc agtgtacaaa 60 agttgcgaac aaagactacg cacacgggaa aaagtgaggg tattttcaca tttcaaattt 120 cacgtgaaga cgtttacaaa gaattctatt gatagtaaaa tgcctcggaa gggtgctaaa 180 gaagacgaag aatgctttgc tgaacctgct ccacaaattt ccttctgcga attagaaaaa 240 tctatgactg cattttctgg tgacgatgca tacggcgtgg atacctttat aaaggatttt 300 gaagatatcg ctaatctcat gcaatggacg ggcattgaaa aactgatcta cgcaaaaagg 360 cttttgaagg ggacagcaaa attgttcctg agatctttga ctggtacgac tacatgggaa 420 ctcttgaaaa cgggattaaa agaggaattt ggactacatc tgaacagcgc agccattcac 480 aagaccatat caaataggaa gatgaaacca ggtgaaacct accaacaata ttttttgcaa 540 ctcaaagaac tagcagtact cggaaatatt gaagatgacg ccctaatgga atacatcatt 600 gatggcatcc ctgacaatga aatgaataaa actatactat atggagcttc caacatcaaa 660 gaatttagga aaaaacttga tttatatgct gaaataaaga agaaatgtcg aacgtcaaac 720 aaagcttata atcaatcaac aacatccaag cctacaacat ctggatggaa gaaaccatta 780 cctaagaaga gatgtttcaa ctgcggagac ctagatcatg aatcttcagg ctgctctaaa 840 ggaataaagt gcttccgttg taatgacttt ggtcataagt caactgactg tcccaaaaat 900 ataaagaaga ctctagaaat aaaatgtaac aaggagaatg atagccaagc accctacaaa 960 aaagttttga tcaataatgt tatggtaaaa tcacttattg acactggcag tgatgtgaac 1020 ctaatgaatg aatctacatt tactcaaata catggcgaca cttgtaacta ccatcctgat 1080 ttagtacaaa gcctgacagg cattggtgac tgtgaggtac atacaagagg ctcatgtact 1140 ccaagaattg agattgatgg ttgccaatgt gaagcaacgt tctatatcat cgaagatgac 1200 aaaataccag ttgatgtaat aatagggaac ccaatattac aggacataga actcagtttt 1260 aagtctggtg ttatttctgc taaaagaatt ttacaagtca caatgttaga ggatcagact 1320 gaggaaccaa tgattatagg aaatttcaaa tataaagatc agattgagaa aatactgaca 1380 gaatatgatc ctggaaaatg ttgtaaagaa tccaaagttg aattaaaggt attagaaaaa 1440 gaacattatg atgactttat gattgaagct ggaatattat ggaaaataaa agatggaaaa 1500 aaattaatgg tggtacctaa gaaaatgcaa aatgaaatta ttagaaaaaa tcatgacaag 1560 ggacattttg gattaattaa aacagaagag ttgatttcca gaaactatta ctttgaaaac 1620 atgaaagata agatcaaatc agtattagaa aactgcttag agtgtatttt aatttcacat 1680 aaaaagggaa aatccgaagg ttttctgcat gccatagaca aaggagacat accactatct 1740 acttaccata ttgaccattt gggaccgttg acatcaacaa ataagaatta caagtatata 1800 ctcacagttg tagatggatt cacaaaattt acatggattt acccaacaaa gactttgtcc 1860 actgatgaag ttattgacaa actgaagatt caacaacaga cttttggttc accacagagg 1920 attattagtg acagaagtac ctcctttaca tccaacactt ttaaagaatt ttgtgaaact 1980 gaaggaatct tacatcacat tattactact ggccaaccta gaggtaatgg tcaggttgag 2040 agagtccatc agataataat tgatgctcta agtaaattat cagcagatga tcccaccaag 2100 tggtacaaac acatcagtaa tatccagaga tgcttaaatg gttcatttca aagaagcatt 2160 aagatgtcac cctttgaatt actgacagga gtcaaaatta ataataaaaa tgaagctatc 2220 tacaatttaa ttaatgaaga gaatcttaaa tttttctgtg atgaaagaga agacttgagg 2280 aagaaagcaa gggagaacat acaaaaaatt caagatgaga atagaagaaa ttttaaccga 2340 agacgaagga aacccagaga atatgaagta ggaaatcttg ttgctatcaa gaggacacaa 2400 tttgtgcaag gctacaaact gcatccaaga tacttaggac cttatgaagt cgtgaaaaag 2460 aagaggaacg accgttatga tctccagagg attggtcaag gtgaaggacc ccatcacacc 2520 agtagctccg cggatatgat gaaaccttgg gtcgctgatg ctgtcgagta tgagtcatct 2580 gggacagacg actgagcagg acggccg 2607 // ID R1NS-1_CQ repbase; DNA; INV; 5783 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE An R1 non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW R1; Non-LTR Retrotransposon; Transposable Element; R1NS-1_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-5783 RA Kojima K.K. and Jurka J.; RT "Non-sequence-specific R1 non-LTR retrotransposons from the RT southern house mosquito."; RL Repbase Reports 11(1), 599-599 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 10 sequences with >99% CC identity. It shows no sequence specificity. XX FH Key Location/Qualifiers FT CDS 893..2506 FT /product="R1NS-1_CQ_1p" FT /translation="MVNSKTNVHYTXEEVXRLNXXIRSAGNIRSDITRSGL FT SLXVLARXAEELWKEXRSKYNALAEEASAEKAARLLLEXRVXALEVHNDSH FT PSIEVEDQVTHSEFXXLREQNAALQATVEELSKTIAELAXAKPASXXAALD FT SEVXALREQNXALXAEVAALRETAVTXPPXESQSGAXXXLSSELAKAQQEI FT LXLREXLKGLKEEXAKXESEPKSGKSGKTGKNKGKNXGKNQQNQKPKQAKG FT GAXNADPKPAVPXTPPGSKEXXTSPEGASQGGXPGDGSXXGNANDGFTVVN FT RRKPRSKHKPRXRNEAIAIKADEXSYASLLRNMRSNDDFKXLGEATKSVRR FT TRRNELLLILKKGAKPSSEYARLVAESVGSDEIKVRSLCPETTLQCKNLDE FT TVTAEDLLDAITTQCLTGTLSAPVQLRKYNQGTQTATFKLPAKIAAMVLKV FT GKIKVNWSVCPVSAIERPTVCFRCLEYGHKSWACKGPDRSKLCRRCGAEGH FT QSKGCTAKAKCLICTGDGSNHATGSYTCPSFRSAFDKLRPCK" FT CDS 2491..5508 FT /product="R1NS-1_CQ_2p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="AETVQVIQLNLNHCXTAQQLLFQTVAEKKCDVALLSE FT PYRIPEGNRNWLSDPPKAAAIWVTGQFPIQELVAAEEGFVIAKVNGVFYCS FT CYAPPRWSIDRFDRMLDRLTDALTGKTPAVVAGDFNAWSTQWGSRESNRRG FT QDLLEALARLNVDLANEGSASTFRRNGVSSIVDVTFCSPSLXASMDWKVDE FT GYTHSDHQAVTFRICRVAXXPAQPAPRQXCRWKTSTLXKEVFVEALRRESA FT TRDILELDCDGLMTILVGAXDAAMPRQXKPRNARTPVYWWNDDIAAIRAAS FT LRARRKLQRARSDEQREARRVIYKIAKATLXKAIKESKRNCFGNLCQEANN FT APWGNAYRIVMAKLKSGAVAIDRSPEMMSRIIDGLFPHHEVEPWPTTPYYH FT EGLTXDEXSITNEELIKAASALKANKAPGPDGIPNQVLKLAIEENPDMFRS FT ALQKCMDTGVFPDRWKRQRLVLLPKPGKPPGDPSAYRPICLIDTAGKLLER FT VILNRLTIYLERGLSDRQFGFRKGRSTVDAIRAVLEKAKIAVEPKRRGMRF FT CAVITLDVKNAFNSASWDAIARSLHRFRVPNYLCKLLKSYFENRVLLYETD FT EGLRELIITAGVPQGSLIGPGLWNGMYDGVLTLXXPTGVSIVGFADDIVLM FT VLGESQLQVEVRATEAIRTIEEWMGLHHLGLAHQKTEVVAVNNFKSAQCIN FT VRAGNCXIXSKRSLLYLGIRVDDKLSFNSHVXYVCEKAMKAIASLSRMMGN FT SSAVKSSKRRLLAXVVXSVLRYGAPAWGAALEAKTNLQKVTSVYRLXCLRV FT CSAYRTASAEAACVLASMIPIGLLVREDMRCHELRGVGGHRASARTATMAX FT WQRSWDDSTKGRWTHRVIPNLERWVNRTHGEINFYLTQVLSGHGCFRQYLH FT RFGHATSPNCPNCTGTVETVEHVVFXCPRFQSSREILLLGCGXDTXPDNLT FT DRMCXSEEAWGAASXAITQIMLKLQGKWRTDQRASRNEGPPPSSSVRYVTS FT " XX SQ Sequence 5783 BP; 1442 A; 1505 C; 1678 G; 1004 T; 154 other; ccgtggggac atgtgaaggg tccccgcggg agctgagctg aggctaccgg gcgggttgca 60 gtcggcggat agctgtatgc tcttgcatgt ttacactgct gagcacccgg accaacagcg 120 ggaagtcgat caggacgtgg tggaatccaa cttgggcaag tgttgttgtt gttttcctac 180 aaaaagccac accagtccgt tacataaatc cttccgacaa agcgaatgaa tgccgcttta 240 agcgttcaaa ggcccatact tcacaaaccc aaatccaaga tgtgcagcga tccgtgtcga 300 gggattcatg gctaggaggg tttaaaacag cctagtcgct aacggtagcc tgtggggcaa 360 cagggcgcac cccacagtat tgtagccctt accgcgctaa ggcagggcaa tggcgcggcg 420 gacggtgtat ttccctagcg actcgtggga tgaaccagca atgttgaaaa attcaaataa 480 aatcacaaac gakggttccg akgcacccga wgcatcggtt smagaagaaa gtccggtgga 540 aaacgcggag tctgggggga ccwcctccca gcctgcctta gtagwaccgg ccgtcgcgat 600 ggcgmaccgg cctgagacwk cagwmmakgg ccmgccgatc ggacctgtag ttcccgcgtw 660 ccmkttcgcg aacaccgcgg gagtgcaacg atgaatccgc tgaggagtcc tmcaagmgaw 720 mactckctgg ggagwmgatc swgggggwcc cmaagcgtgt kaggwsgggg scsawagagc 780 gkatgataac tccgaaackg ccgcccatcs cgggasgaaa taggagagww cctcccaccg 840 tagacagtga cgcaatacgc gaacgagcgc acgtgagcta agcgacctac tcatggtaaa 900 cagcaagact aacgtccact acaccmtcga ggaagtcgma aggctaaacg akcwcatmmg 960 aagcgccggc aacatccgga gcgacatcac magatcaggg ctgtcgctcw cggtgctggc 1020 gcgtcawgcg gaggagctct ggaaggagmt gcgatcgaag tacaatgccc tcgctgagga 1080 ggctagtgcg gagaaagckg cgaggttgtt gctggagsaa agggtgawgg ccctkgaggt 1140 ccacaacgac tctcaccctt ccatcgaagt agaggaccag gtaacccata gcgaattcma 1200 wkcgctacgg gagcagaatg cggcsctcca agcmacggtg gaggaactat ccaaaaccat 1260 cgcggagctc gcakctgcta agcckgcmtc cgamcmagct gcgctcgaca gcgaagtgtm 1320 cgcactccgc gagcagaacc wggcgctcas cgccgaggtw gcagccctga gagagacggc 1380 cgtcacacma cctccgwgtg agtctcaatc aggggccsaa kcgmcccttt cktcagagct 1440 tgcgaaggca cagcaggaaa tcctcwccct gcgcgaagam ctgaagggcc tsaaggagga 1500 agmggccaag wgggagtcgg aaccgaagtc tggaaagtct ggcaagactg gaaagaacaa 1560 gggcaagaac mawggtaaga accaacaaaa tcagaagcca aagcaggcga agggtggcgc 1620 cmagaacgca gatcccaaac ctgcagttcc aawmacgccc ccaggtagca aggaggwtgk 1680 caccagcccc gaaggggcgt cccaaggtgg cwmcccgggc gacgggagcg kcgawggaaa 1740 cgctaatgac ggcttcacgg tkgtgaaccg aaggaaaccc aggtccaagc acaagccccg 1800 kwcgaggaac gaggctatag cgatwaaggc cgacgagmaa agttacgcga gcctgttgcg 1860 gaacatgcgc tcgaatgacg actttaagga sctgggggag gctaccaagt cggtgcggcg 1920 aactcgccgg aacgagcttc tcctcatcct caaaaagggt gcgaagccca gttccgagta 1980 cgcccgactg gttgccgaga gtgttggaag tgacgaaatc aaggtccggt cwctctgtcc 2040 mgaaaccact ctccagtgta agaacctgga tgagacggtg acggcwgaag acctccttga 2100 cgccataaca acacagtgcc ttacgggcac cctctcggca cccgtccaac tgaggaagta 2160 caatcaaggc acgcagacgg ctacctttaa gttaccagcg aaaatcgcgg ccatggtgct 2220 taaggtgggt aagatcaaag tcaactggtc agtatgtccg gtgtctgcga tcgaacggcc 2280 aaccgtatgc ttcagatgcc ttgagtacgg ccacaagtcg tgggcctgca agggccccga 2340 ccgcagtaag ctgtgcagac gatgcggagc cgaaggccat cagtcgaagg gctgtacagc 2400 taaggcgaag tgtctaatat gcacggggga cgggagcaac catgctacgg gaagctatac 2460 ctgtcctagc ttcagaagcg cgttcgataa gctgagaccg tgcaagtgat acagttgaac 2520 ctcaatcact gcgamacggc gcagcagctg ctcttccaga cggtggccga gaagaagtgt 2580 gacgtagcgc ttctatcgga accttatcgg attccggagg ggaacaggaa ttggttgagc 2640 gacccaccca aggctgcggc aatctgggtg acgggtcagt tcccaataca ggagctagta 2700 gcagcagagg agggcttcgt gatcgctaag gttaacggag tcttctactg cagctgctac 2760 gctcccccga ggtggtctat agacaggttc gacagaatgc tggatagact gacggacgcg 2820 ctcacaggca aaacccccgc agtggtggcc ggcgacttca acgcgtggtc cacacagtgg 2880 ggtagccgcg agtcgaaccg cagaggccag gacctactcg aggcacttgc caggctaaac 2940 gttgatctag ccaacgaggg atccgcgagc acgttccgta ggaacggagt ctcgtcgatc 3000 gttgacgtta ctttctgtag tcccagcctt mtggcaagca tggactggaa agtggacgag 3060 ggctacactc acagcgatca ccaagcggta acgtttmgga tctgccgtgt ggcccsmsgt 3120 cctgcacaac cggcmccccg mcaagwgtgc cggtggaaga catcgaccct gamcaaggag 3180 gtctttgtcg aagcgctaag aagagaaagt gccactcggg acattctaga actcgactgc 3240 gacgggctaa tgaccatcct ggtgggcgcc wgtgatgcag caatgccgag gcaakcwaag 3300 ccamgaaacg cgcggacccc agtttactgg tggaacgacg atatagckgc cattcgagcg 3360 gccagcctga gggcccggag gaaactccaa cgagctcgct ccgacgagca gagggaagcc 3420 aggcgggtga tctacaaaat cgccaaagct accctwwgca aggcaattaa ggagagcaag 3480 aggaactgct tcggcaacct ctgccaggaa gccaacaatg caccatgggg taacgcctat 3540 cggattgtta tggctaagct gaagagtggc gcggttgcca ttgatcgttc acctgaaatg 3600 atgtcacgaa tcatcgacgg gttgttccct caccacgagg ttgagccctg gccaacgacc 3660 ccctactacc atgaggggtt gacggwagat gaacmmagca tcacgaatga ggaacttatc 3720 aaagccgcat cagccctcaa agcgaataag gcaccggggc ccgatgggat ccccaaccag 3780 gtccttaaac tggcgatmga ggaaaacccg gacatgttca ggtcggctct ccagaagtgc 3840 atggacacgg gggtctttcc ggaccggtgg aagcgacaaa ggctmgtcct gttgccgaag 3900 ccggggaagc cacccggcga cccatccgcg tataggccca tttgtctcat agacaccgct 3960 ggcaagttgc ttgagagagt tattctsaac agactaacga tctatttaga gaggggtctt 4020 tcggaccgtc aatttggctt caggaaaggg aggtccacgg tcgacgcgat cagagcwgtc 4080 ctcgagaaag ctaagattgc mgtcgaacca aagcgtcgag ggatgcgctt ctgcgccgtc 4140 atcactctag atgtcaagaa tgctttcaat agcgctagtt gggacgccat tgcgagatcc 4200 ctccatagat ttcgtgtccc caactacctg tgcaagttgc tcaaaagcta cttcgaaaac 4260 cgagtccttc tctacgagac ggacgagggt cttcgcgaac tgatcatcac ggcgggggtc 4320 ccacaaggct ccttgatcgg tcctgggttg tggaatggta tgtacgacgg ggtgctgact 4380 ctasagttwc cgacgggcgt tagcatagtt ggctttgccg acgacatcgt cctgatggtg 4440 ctcggtgagt cacagctcca agtcgaggta cgagcaaccg aagcaatacg gacaatcgaa 4500 gaatggatgg gactgcacca tctagggctk gcgcatcaaa agacwgaggt agttgccgtg 4560 aacaacttca agtctgcaca gtgcattaac gtcagggcag ggaactgcwc gatcgmctcc 4620 aagaggtcat tgctctacct gggtatccgg gtggatgaca aattgagctt caacagtcat 4680 gttgmctacg tmtgtgagaa agctatgaaa gcgatagcat ccctatctcg catgatgggg 4740 aatagctckg ckgtcaagag tagtaaaagg cgtctgctgg cgasmgtggt cgsttccgtc 4800 ctccgctacg gagcaccggc atggggcgcc gccctagaag caaagactaa cctmcagaag 4860 gttacgagcg tctacaggct gmtttgcctt agggtatgca gcgcgtaccg caccgcatca 4920 gcagaggcgg cctgcgtwct agctagcatg atcccaattg gactacttgt ccgagaagac 4980 atgcgatgcc acgagctaag aggggtcggg gggcaccgtg cttcggccag gacagccact 5040 atggctgwwt ggcaamgaag ctgggatgac tcgaccaaag ggcgatggac acatcgcgtg 5100 atacccaacc tggagaggtg ggtaaacagg acgcatgggg aaataaattt ctacctgacc 5160 caagtcctat cggggcacgg gtgcttcaga cagtacctac acagattcgg acacgcaacc 5220 tcacccaact gcccgaactg tacgggcacc gttgaaacgg tggaacacgt ggtatttgam 5280 tgcccmcggt tccagagtag cagagagatc ttgttgctgg gwtgtggtmc kgacacawcg 5340 ccagacaacc tgacagacag aatgtgccwk agcgaggaag cgtggggtgc agcttccasa 5400 gccataackc agattatgct caaattgcag gggaagtggc gcacggacca gcgtgcgtcc 5460 cggaacgaag ggcccccgcc cagtagttcc gttcgatatg ttacaagttg agaagtatta 5520 aactccaatt gatgaagaag gcacgcgcac gtacaaatgt acatacgagc gtgcctcccg 5580 gacaaagggc tcgggcacaa ccgttccgaa acgagagcgc ttaattagcg ctgctcctcc 5640 ccgacgtaat atcgaaagac agttccgggg aggcttgaag gcctaccgtg ataaagtggc 5700 gttttttagt gagtcggact caactccaac ctcacacgat gtctgcagac agatttttcg 5760 tcctactgaa acaaaaaaaa aaa 5783 // ID Copia-29_DPu-LTR repbase; DNA; INV; 243 BP. XX AC scaffold_212; XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from Daphnia: long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-29_DP_; KW Copia-29_DPu-I; Copia-29_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-243 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Direct Submission to RU (16-DEC-2010). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR Genome; scaffold_212; Positions 133409 133167. XX SQ Sequence 243 BP; 61 A; 49 C; 40 G; 93 T; 0 other; tgaatgtaaa cacggaatca acgtttgaat caacgtttac ttagcatctg tatttccaag 60 tatttcggat cgtttgctaa tcgtcttttc tttctccctc ttccttgtgg aagagaagtt 120 tctggcttag tgtctttcaa gccctcttac agagatctac cttttgattc agaaggttca 180 atacaaattt actagagcgt acaaaaatgt gtttgtcgtc tctcgtttta ctcttataat 240 tca 243 // ID Gypsy-595_AA-I repbase; DNA; INV; 4334 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-595_AA_; KW Gypsy-595_AA-LTR; Ty3_gypsy_Ele22; Gypsy-595_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4334 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [3466-3942] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 496..1674 FT /product="Gypsy-595_AA-I_1p" FT /translation="MGSSDEEENVRAPQFTFHDIGDSLEKFSGERSDRGVL FT EWLNDFEKTCDNFGWNSVQKFVYGRRLLKGTAKLFVSSSSGLNDWAALRAA FT LEEEFEDKVTSAEIHELLRNRKKKSDESYLQYIYHMQNIAKRGGIEEEAVC FT DYVVRGITDDPVNKVCLFGATKVNDLKDRVKQYEKMKNQMRDSGRSSAKPV FT VRAEKEKVKKSSKISTKSDTSESTAEEVRCYNCGNRGHYANDCDMKSRGPK FT CFVCGDFGHRAKDCKQQKEKDNEVNILTEERMPVLQISVKNIELRALFDTG FT SRHNLICESAYKRIGEPALTNTPMVFSGFGAMKTRARGKTVVGICVDSEEQ FT PEMPFYVVPVGSMSYDAVLGMDALNRLDAEINKEGVKINKKKNEAKCDDE" FT CDS 1984..4335 FT /product="Gypsy-595_AA-I_2p" FT /translation="MRVCVDYRQLNQKVVKDCFPMRNIEDQVDSLKKAKVF FT TTLDLKNSFFHVPVEVSSQKYTSFVTHTGQYEFLKTPFGFCNSPASFSRFV FT ADVFRDLIRSGRLIIYVDDAIIPSESEEENIEVLKEVLKVAAENGIQFNWE FT KSQFLKNEVDYLGYVIGDGSYRISPTKIRAVQYYREPSNVKELQRFLGLTS FT YFRKFIPEYALIAKPLTSLLKKDVSYEFGDSQRKSFMSLKQCLVSDPVLKI FT YDPDAETELHTDASKEGYGAVLLQKSDDGKLHPVYYMSQQTSSAEKNYSAY FT HLEVLAVIKAVERFRVYLLGIPFKIVTDCAAFQHTMKAKQLGHRVARWALL FT LEEFTYTVEHRAGTRMKHVDALSRAPVMLTSSDPMVEMIRSAQKRDEKLKV FT ITELLKTQTFDDYFVSDDLLMKLIDGREVIVVPAELQSEVIRRAHENGHFG FT VRKLEDVIKRDFFIPSLAEKMKRQIECCVKCILANRKRGKVDGLLAPIPKG FT DVPFDTYHVDHLGPMDVTEKMYKYLFVVIDAFTKFTWIYPTKTTNAHEVIQ FT RLQNQSELFGNPRRIISDKGAAFTSNDFKVYCAEQNIEHVEITTGVPRGNG FT QVERSNQVILAMLTKLSVNDSTKWYKHVANIQRWINASVHQSTTVSPFEAM FT FGVQMRHAGDVRLCELLEEIRVAQFDDQRSGIRAKARAAIEKAQQDQRRSY FT NLRAREPPTYKIGDVVAIKRTQFGPGRKYAAEFLGPYKITNVKPNNRYDVE FT KITGEGPKKTSTAASHLKPYRIWGPEPLQDDRV" XX SQ Sequence 4334 BP; 1258 A; 833 C; 1222 G; 1013 T; 8 other; tgggggctca accgggattg cctcccagtg ttttcagttc cggaagattg agtggaaagc 60 gattttmgtt catgtgttga attgaaattt cgtgccgcca tattgtgtga tcgaatgttg 120 aaaaaatccc gagtgttgtg agattggaga gaagtctcca aattgagagt tgtgagaacg 180 aagcgaaatc ctaccaccac tgtgagttat gtggtcgaat ggagaatatc ttcgcctgtg 240 agagcagagt agtgtttgtg tggacgtgaa acgagatgct gcataatgtc cgaaagtgaa 300 gagcagcttg cgcaacaatc smttcggaat ttgcgagaac tgtgtgcgga gaaaaacctc 360 tcgactagtg gcaaaaaggc gacgctcgtg cgaaggcttt taggaaatag cacggtcgaa 420 gacgaaacag tggtaaacgt tggtggacaa aaagtcacca catcaactcc gaaaattacg 480 agaaaccagc gcgaaatggg ttcgagtgac gaagaagaaa atgtgcgcgc gccgcagttc 540 accttccacg atatcggtga ttcgctcgaa aagttttccg gagaacgttc cgatcgcggt 600 gtgcttgagt ggctgaacga tttcgagaag acgtgtgaca atttcggttg gaatagcgtg 660 caaaagttcg tttacggccg cagactgttg aaaggcactg cgaagctgtt tgtgagtagt 720 tcatccgggc tcaatgattg ggctgcgttg agagccgccc ttgaagaaga atttgaagat 780 aaagtgacga gtgcggaaat ccacgagctg ctgcgaaatc gcaaaaagaa gagtgacgaa 840 tcctatctgc agtatatcta ccacatgcaa aacatcgcga aacgcggtgg aatcgaagag 900 gaggctgtgt gtgattacgt agttcgtggc ataaccgatg atccggtgaa caaagtgtgt 960 ttgtttggtg cgaccaaagt gaacgatttg aaagaccgcg tgaagcaata cgagaaaatg 1020 aaaaaccaaa tgcgagacag tggtcgtagt agcgcgaagc ccgtggtgag agccgagaaa 1080 gaaaaagtga aaaagagctc aaaaatctcc accaagagcg acacatccga gtccactgcc 1140 gaagaagtgc ggtgctacaa ttgtggaaat cgtgggcact atgccaatga ttgcgacatg 1200 aagtcacgcg gaccgaagtg ctttgtgtgt ggtgattttg gtcaccgcgc gaaagactgc 1260 aagcagcaga aagagaagga caacgaagtc aacattctca ccgaagagag aatgcccgtg 1320 ctccagatca gtgtgaaaaa tatcgagctg cgtgcgttgt tcgatacggg cagtcggcat 1380 aacttgatat gtgaaagtgc atacaagcga attggagagc cggcgcttac caatactccg 1440 atggtgttta gtggttttgg tgcaatgaaa acgcgagcac gtggcaaaac cgttgttggt 1500 atttgtgttg actcggaaga gcaaccagag atgccatttt atgtcgtgcc ggtcggtagc 1560 atgtcatatg atgctgtttt gggcatggat gcgttgaatc gactggatgc ggagatcaac 1620 aaagagggag tgaaaatcaa taagaagaaa aacgaagcaa agtgcgacga cgaggktgaa 1680 atgatgatgg ttttggaagg ggagaagagt ggtgttgata tcgacgtgcc accccgatat 1740 gcaagtgtta tcagcgagat gattaagaac tatsaaccaa aaagcgatgt gaaaagtcga 1800 gtggaaacaa aaatcatttt gaatgacgac gttccagtgc acattcctcc gcgacggttt 1860 gctccgaagg aaaaagcgat cctggaaaaa actgtagacg agtggctcaa agctggcata 1920 atcaaggaga gcgtcagtga atatgcaagc cccgttacac ttgcacccaa gaaaaacggt 1980 tcgatgagag tgtgtgtgga ctatcgwcag ttgaatcaga aggttgtgaa agattgcttt 2040 ccgatgagaa acattgaaga tcaagtggac agtctcaaaa aggcgaaagt gtttactaca 2100 ctcgatttga aaaactcktt tttccatgtg cctgtggaag tgtcaagtca gaagtacact 2160 agtttcgtaa ctcacacggg acagtacgag ttcttgaaga ctccattcgg attttgtaac 2220 agtccggcta gcttcagtcg ttttgtggct gacgtgttcc gtgacctaat aagaagtggt 2280 cggctgataa tttatgtcga cgatgcgatt attccgtcgg agtccgaaga agaaaacatc 2340 gaagtgttga aggaagtgct caaagtcgct gctgagaatg ggattcagtt taattgggag 2400 aaatctcaat ttttgaagaa cgaagtggac tatctgggtt atgtgattgg tgatggtagt 2460 taccgcatat ctccaacgaa aattcgtgct gtgcagtatt accgtgaacc ctccaatgtg 2520 aaggagttac aacgtttcct gggtctcacg agctacttcc gaaaattcat tccggagtac 2580 gcgttgattg cgaaaccgct cacttcgctt ttgaagaagg atgtgtcgta cgagttcggt 2640 gattctcagc gaaaaagctt tatgtcgctg aaacagtgtc tagtgtccga ccccgttttg 2700 aaaatttacg atcctgatgc cgaaacggag ctgcacaccg atgcctccaa ggaagggtat 2760 ggtgctgtgc ttcttcagaa gagtgatgac gggaagctgc acccagtgta ttacatgagt 2820 cagcaaactt cgagtgccga gaagaattac agtgcgtacc atttggaagt tttggcagtg 2880 ataaaagccg tcgaaaggtt ccgcgtgtat ctgttgggaa ttccgttcaa aatcgtgacg 2940 gattgtgcgg catttcaaca tacgatgaaa gcgaagcagt tgggacaccg tgtggcgagg 3000 tgggccttgc tgctggaaga gtttacstac actgttgaac accgagcggg aaccagaatg 3060 aagcatgtcg acgcgttgag tcgagctcct gtgatgctga cttcgagtga tccaatggtt 3120 gaaatgataa gaagtgctca gaagcgagat gagaagctaa aagtgattac cgaactattg 3180 aagacacaaa ctttcgatga ttattttgtg agtgatgatt tgctgatgaa actgattgat 3240 ggacgagaag tgattgttgt tccggcagag cttcagagtg aagtgatccg aagagcgcac 3300 gagaacggac attttggtgt gcggaagctc gaagacgtta tcaaacgtga ctttttcata 3360 ccgagtctgg ctgagaagat gaagaggcag attgagtgtt gtgtaaagtg cattttggcg 3420 aatcgcaagc gaggaaaagt tgacggattg ttggctccga ttccgaaagg tgatgtgcct 3480 tttgacacgt accatgtgga tcatcttgga ccaatggacg tcaccgagaa gatgtataag 3540 tacttgtttg tcgttatcga tgcatttacg aagttcacgt ggatataccc gacgaaaact 3600 acgaatgccc atgaagtgat ccagcgattg cagaatcaaa gcgaattgtt tgggaaccca 3660 cgaagaatca tcagtgataa aggggcagcg ttcacctcga atgactttaa ggtgtactgt 3720 gccgagcaga atatcgaaca tgtggagatc acaaccggag ttcctcgggg aaacggccag 3780 gtagaaaggt cgaaccaagt gatactggcg atgctcacga agctcagtgt gaatgattcg 3840 acgaagtggt acaagcacgt tgcgaatata caacggtgga ttaatgcgag cgttcaccaa 3900 agcactactg tgtccccgtt cgaagccatg tttggggtac agatgcgcca tgccggtgac 3960 gtacgattgt gtgagttgtt ggaagagatt cgtgtcgctc agtttgatga tcaacgaagc 4020 ggcatcagag caaaagccag agcagcgatc gagaaagcac agcaggatca gcgacgatca 4080 tataatctgc gagcacgaga gccacctacc tacaaaatcg gcgatgtggt tgctatcaag 4140 cgaacgcagt ttggtccagg taggaagtac gcggctgaat ttctcgggcc gtacaagata 4200 accaacgtca agcccaataa ccgatacgac gtcgagaaga ttaccggaga aggaccgaag 4260 aagacatcga cggcggctag tcatctgaag ccgtaccgga tctggggacc agaacctttg 4320 caggatgacc gagt 4334 // ID Transib-N1_AAe repbase; DNA; INV; 2278 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A non-autonomous Transib DNA transposon family from Aedes DE aegypti. XX KW Transib; DNA transposon; Transposable Element; nonautonomous; KW Transib-N1_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-2278 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the yellow fever mosquito."; RL Repbase Reports 11(4), 1309-1309 (2011). XX DR [2] (Consensus) XX CC >99% identical to consensus. 5-bp TSDs; usually CGNCG. TIRs are CC 950 bp long. XX SQ Sequence 2278 BP; 795 A; 357 C; 348 G; 778 T; 0 other; cacagtggga cggaatcaaa aaaagtggga catgtgggtt tgcgccaaaa ctatcggttt 60 tagcgttttg gtgtcttctg caatgtttct tctttttaaa agtactttat tatagaaata 120 aaaaaaaatc gttttcaatt gtatacactg agaaaaaaaa attaactttt ttattttttc 180 tcaaaattga aaactaccta aggaagtatt gtaggtaacg tcatttgaag aaactttgtc 240 gaagacgaaa aaattctagc tctcatactt actaagttat atggtaaaaa tgatgttaac 300 ccccttaaaa atgatttttt tacaatatct tttttatatg atttttttaa tttttacaat 360 gttctacaat gttgtagata ctccaaaaat acacattttt gctgaagaga ctaaagcgct 420 atctcttatt ttgaaagagc tatgaactat ttcttctttt ttatacgtca cctttaaggg 480 ttaacagctt gcaaagttgc aggtagtagc agttaatttt gacgttatac tgttcagtaa 540 cttacacact ttcaaaacat atatgaacca gagcgggatc caatgggatc aataatgttt 600 tttaagttta acgatcttga tgttataaca taatattgca gttttcaaca attatgatgg 660 acatattatg cttctgaata attttatatt gcttttgagg ttcatatgta ttgcaaaaca 720 atagtaagca attggtttca atattggagt acctttttta actcatgtta gaatattgat 780 ttattttcgg agatgtttcc ttcatccaaa tttagttcat atttgttcaa caaattcatc 840 gaattctaat attttcttct gctggagtta taattaaagc tcatttaaat cattttcttc 900 ccaatcttca tcgtcggagg aatttgacca atcgttgtga gaaacatctg ctcagctata 960 tgtaaagcat tacgactggt acaaaatgcc atccagtgtt caccgtgttc tgattcatgg 1020 cgctgaaata gtcagacatt gcatcgttcc agtcggaact ttatctgaag aacctcaaga 1080 atgtcgaaat aaggacatca aaagttttcg cgaaaatcga tcaaggaaat gctcacggta 1140 aacttttcaa atatttatgt cttctcaaat aataaagcat acttttcttt ttaacagtgt 1200 agcaaccaat caggacattt tcaaccgcct catggtatct tcggatccgc tgatttcatc 1260 catgcgaaaa attccaaaga ggaaaagtat tccgtttcga aaagaagtat tggaatgttt 1320 gaaagaacca gatgtttctc acaacgattg gtcaaattcc tccgacgatg aagattggga 1380 agaaaatgat ttaaatgagc tttaattata actccagcag aagaaaatat tagaattcga 1440 tgaatttgtt gaacaaatat gaactaaatt tggatgaagg aaacatctcc gaaaataaat 1500 caatattcta acatgagtta aaaaaggtac tccaatattg aaaccaattg cttactattg 1560 ttttgcaata catatgaacc tcaaaagcaa tataaaatta ttcagaagca taatatgtcc 1620 atcataattg ttgaaaactg caatattatg ttataacatc aagatcgtta aacttaaaaa 1680 acattattga tcccattgga tcccgctctg gttcatatat gttttgaaag tgtgtaagtt 1740 actgaacagt ataacgtcaa aattaactgc tactacctgc aactttgcaa gctgttaacc 1800 cttaaaggtg acgtataaaa aagaagaaat agttcatagc tctttcaaaa taagagatag 1860 cgctttagtc tcttcagcaa aaatgtgtat ttttgaagtt tctacaacat tgtagaacat 1920 tgtaaaaatt aaaaaaatca tataaaaaag atattgtaaa aaaatcattt ttaagggggt 1980 taacatcatt tttaccatat aacttagtaa gtatgagagc tagaattttt tcgtcttcga 2040 caaagtttct tcaaatgacg ttacctacaa tacttcctta ggtagttttc aattttgaga 2100 aaaaataaaa aagttaattt ttttttctca gtgtatacaa ttgaaaacga ttttttttta 2160 tttctataat aaagtactct taaaaagaag aaacattgca gaagacacca aaacgctaaa 2220 accgatagtt ttggcgcaaa cccacatgtc ccactttttt tgattccgtc ccactgtg 2278 // ID hATx-6_SM repbase; DNA; INV; 2274 BP. XX AC . XX DT 04-FEB-2008 (Rel. 13.02, Created) DT 03-MAR-2008 (Rel. 13.02, Last updated, Version 1) XX DE hAT DNA transposons: consensus. XX KW hAT; DNA transposon; Transposable Element; hATx-6_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-2274 RA Jurka J., Tempel S. and Bao W.; RT "A distinct, diverse family of hAT transposons from Schmidtea RT mediterranea."; RL Repbase Reports 8(2), 24-24 (2008). XX DR [1] (Consensus) XX CC TAGATTT-termini and 8 bp TSD. CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 995..2110 FT /product="hATx-6_SM_1p" FT /translation="MVKYGRDCPSESQLCLCHRLHLAVVETFYKKKVFESN FT DVDIIEEDDDDDFQDNEFFDDDVNEEEFADDEDVDIQKNLKNVRILIKFLK FT KSSVRNSILQKKIIDEIGHELELQLDVKHRWNSIHPMLEKLLRVRIPIQNT FT LIELNALHLVSGIDFEKFSSLMNAIKPLTLAVETLGRQDANLLTAKGAMNF FT LYNKLNECNDVICSTLKLNIERRMKERENSTIIDTLACLTNNSIKPSKIVL FT EFASNLFARLFESSSSISPSSSPEHVDLNTDAPVSLEAELDAAINYVTCEN FT FQKSNLNIKDDFIIFKKTGQRPEKLEKLFEALTTIKPSSTEAERTFSVSSH FT FCGKLRTRLSDYSLNCLVFLKYLYINEKK" XX SQ Sequence 2274 BP; 819 A; 339 C; 387 G; 708 T; 21 other; tagagtttga cggttcggat tatccgtatc cgatatccgc gaatttgcag acaaaaatcc 60 gtattcggta tccgctcaaa tatactcgcg attatcggat agtgaaaatt attttaaatt 120 atgtatagaa taaaatttta aatagaataa aaaatcgata taatttgtta atgcgaaatt 180 gttagtaaag ctttcaaatc aaagaattga aaaaaatcaa aggtaattat aattaatttc 240 tttttcaccg taaaacttgt attaaattta tattgtacta ctatttagcc catggcacca 300 tcagctcctg tttgggaata ttttraaaaa attccttcta ccaataaaaa agttgccaaa 360 gctcagtgtt taaagtgtat ggtttttatt tcaaatagta atggaagcac gggwgcwatg 420 mtaaaycatc tgaaactggt wcacaaaaty cattgtaaac gggataatga tgaatccgtt 480 ttgggggagc aagaacaaaa cttcaagaaa atcaaagagc ttcaaattca atgatgratt 540 ttgttaarcg gcacagtttg gaagagatag tgtcaaartt agctgctgaw gayagwattt 600 caatwcgtrc tataacaaat tcctcattta ttmgtgaatc tatttctaga caggggatat 660 aacttaccaa actgtgaaac tagtgtaatg aaattgattc atactaattt cgaagtaaag 720 aaaacaagag gtaataaaaa aattaacaac aattaaraat gatggtaata agtttagtat 780 aacccttgat gagtgggtaa gtgcamgaaa tcgtcgsttt attaatatwa atgtacatgg 840 aaataatgaa aattgcaaca atctaggttt ggtgagagct actggttcgt gtactgcagc 900 tgtagtgtta gaaattgtaa ccaagcattt agagttgttt aatttgtcgt tccaaaacga 960 catcgtttgt actacaaatg atggtgccag cgtaatggtg aagtatggga gagattgtcc 1020 atctgaatca cagttatgtt tatgtcacag gctgcatttg gcggtagttg aaacatttta 1080 taaaaagaaa gtttttgaat caaatgatgt tgacattata gaagaagatg atgatgacga 1140 cttccaagac aatgaatttt tcgatgatga tgttaatgaa gaagaattcg cagatgatga 1200 agatgtcgac atacaaaaaa atctgaagaa cgtgcggatt ttgattaagt ttttgaaaaa 1260 atctagtgtt cgaaattcaa ttttacaaaa gaaaattatc gatgaaattg ggcacgaact 1320 agaacttcaa ctggatgtca aacacaggtg gaactctatt catcctatgt tagaaaaact 1380 actcagagtt cgaattccaa ttcaaaacac gctcatcgaa ttaaatgcac ttcacctagt 1440 atctggaata gatttcgaaa aattttccag cttaatgaat gcaataaaac cattaacatt 1500 agctgttgaa acattgggga gacaagacgc caatttatta acagcaaaag gagcgatgaa 1560 ttttctgtac aacaaattga atgaatgtaa tgacgtgatc tgtagtacat taaaattaaa 1620 tatcgaacga cggatgaaag aacgagaaaa ctctaccatt attgatactc tggcatgtct 1680 caccaataac tctataaaac catcaaaaat tgttttggaa tttgcatcta atctgtttgc 1740 tagacttttt gaatcttctt catctatatc accttcatct tcaccagaac atgtagattt 1800 gaacacggat gcaccagttt cgttggaagc agagctagat gctgcaatta attatgttac 1860 ttgcgaaaat tttcaaaaat ccaacttaaa catcaaagat gactttataa tatttaagaa 1920 aacaggtcaa aggcctgaaa aacttgaaaa attatttgaa gccttaacaa caataaaacc 1980 atcatcaacc gaagccgaaa ggactttctc tgtttcctcc cacttttgtg gcaaacttcg 2040 aacccgttta tctgattatt ctctaaattg cttagttttt ctaaaatatt tgtatataaa 2100 tgaaaaaaar tgaataaatt ttcatcttaa attgacaatt tttcgagttt tttatctttt 2160 ttttttaaat tagctatttc caaaatatcc gtatccgaac ggttaaccgc ggttttcgaa 2220 taatgaaaac cggtattcgc ggatactaaa aaaccggata ttttggaaac tcta 2274 // ID BEL-72_AA-LTR repbase; DNA; INV; 612 BP. XX AC supercont1.274; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-72_AA_; KW BEL-72_AA-I; BEL-72_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-612 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.274; Positions 182461 181850. XX SQ Sequence 612 BP; 209 A; 90 C; 110 G; 203 T; 0 other; tgttacgacg tacaagcgat aggccctgta ggtagttaga tcacgtcaaa gtcaaacgag 60 gtgataatgt tgattgttga gaagctggaa ggaaagatat atgactcagc gtgcatacaa 120 gtgatgaact aaatatcagc taaattccta aaatttattg tttttatcta cttttagatt 180 gaattattac gtgctatagt gtaatgaatg aattccctaa attgccttat aaattgtgtt 240 cctaaaggtg gtgaacttaa tgctttaaaa accatttgta cgaaataacc taaaactatt 300 tacaatttgt atagaaacct acatcgaaga actttgtggc agtgattgaa ttagtatttt 360 catttatttt ttataattgc acagttagtc gtaagttaag cataacacta tttgaattga 420 tattcacact aattatgttc tacctatagc aaatagtacc gagagaagaa atcatccgca 480 agtgcgttta caccggttga gtaggaaacc aatttgtaag aattcgttag tttgtaagca 540 ttgtcaaaat ctaatatcgc aatatatttc agtcgagttg ctagttaaac acaactacgg 600 gaagttttca ca 612 // ID MSAT-1_CQ repbase; DNA; INV; 174 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE Minisatellite-type sequence from the southern house mosquito: DE consensus. XX KW MSAT; Satellite; Simple Repeat; MSAT-1_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-174 RA Jurka J.; RT "Tandemly repeated DNA from the southern house mosquito."; RL Repbase Reports 11(1), 629-629 (2011). XX DR [2] (Consensus) XX SQ Sequence 174 BP; 40 A; 65 C; 27 G; 42 T; 0 other; ccgaacttct tttcaatgat tccagaatct tccaccactt tctcgcggat tctcgcgcgc 60 gacgtcacac accgatcccc ccgacgaaca ccacacgact aatggcccaa cttccaaggc 120 ctcgacgcga acttcttttc aatgattcca gaatcttcca ccactttctc gcgg 174 // ID Gypsy-87_AA-I repbase; DNA; INV; 5147 BP. XX AC supercont1.127; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-87_AA_; KW Gypsy-87_AA-LTR; Gypsy-87_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5147 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.127; Positions 1727926 1722780. XX CC 'GGTAC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 750..2198 FT /product="Gypsy-87_AA-I_1p" FT /translation="MLIVCLFSRRGRSTLMLDNLIVADVIRTLGLQLQTFE FT YSSHTDDVGIEWRKWLRSFETMIRASRISDDEWKRDLLLHYAGPSVQQLFD FT TLPELPDSDMRGPLVNIEQYTPNMTSYEEAVSKLGAFFLPKANTTYERHLL FT RQMKQKVGESIDGFTVRLRIQAERCGFGDKVEENVKDQIIQNCQSTTLRRE FT LLKRGDATLEEVLRIAKIFETVAQQEKSFSGVEQKPPISDVNKIEVKTFNR FT RNRFGQPVSVECHRCGYFGHMAKDVKCPARGKSCNKCGGRDHFAKKCRKRK FT HTVEASAKRDFVRQENIDCDARESESTNSNTVKHIVESETEYVFNVTTPDS FT DGELKCVIGGVSTSAVIDSGSKYNLLSDSIWKTLKACKVVVRNQTREVSKV FT FKSYSGNNLPVLGAFTATVQLGERNGPAEFYVVQGNGKLLIGRDTATEMGV FT LKIDTSVNKVDAELPETTEKPSSSTIEGSSTTNELSKM" FT CDS 2844..5093 FT /product="Gypsy-87_AA-I_2p" FT /translation="MFGINCAPELFQKIMEQILSGCEGCVNFIDDVIVFGA FT DQHEHDVRLQMVLRRMNEMNVLLNNSKCIYGVTELKFLGHLLSDNGIKPDT FT DKLESIKNFREPKTPEETRSFLGLVNYVGKFIPNLATLTEPLRQLTKHQHK FT FVWSREHQIAFDELKRHMINPCTLGYFDGDDRTQLIADASPVGLGAVLIQT FT NNRGPRIISYASRSLTDVEKRYAQIEKEALALVWAVERFHYYLFGRSFDLI FT TDHKPLEAIFGPKSKPCARIERWVVRLLTYKGKVIYRPGKTNIADPLSRLA FT ITSDQVGKPFEAYTEHYVNWVASNAAPIALKLREIERESEMDPTIQAVGIG FT LQQGEWSDDAAPFKLFASELCFAGKILLRGTRMVLPEKLRARTLDLAHEGH FT PGMTIIKQRLRVKVWWPKLDSQVEKYVKSCRGCTLVAAPSAPEPMQRKELP FT SHPWQHVAIDFLGPLPSGHYLFDYYSRYIEVEIMKKIDSTETIKHLTTIFA FT RFGLPISITADNGPQFVSEEFRDYCASCNIKLINTTPYWPQQNGEVERQNR FT SILKRLSISQATNTDWVSELNKYLLMYRSSPHSTTGKTPSEMMFGYNIRDK FT VPSINLPKEIDEETADRDKEKKEKGKLYADERRNARPNPISEGDKVLVKRM FT TKPNKLATNFEPVVYDVIKRRGGDIEVASEETGTSYRRHVSHLKRIDGKPE FT STRICDNPPDTNSLDAQGSGFTKRRSADSSQSESKSKRTTRLPGYLRDYV" XX SQ Sequence 5147 BP; 1600 A; 966 C; 1242 G; 1339 T; 0 other; ttggcttcga ggaataaaat ggagcattcg cttggtaagt aaaattgaag aaaagtatgt 60 tttcttttgg aaacattgtt tatgaataaa tagtgtaatg aatctaagcc attagtaatg 120 ggtcagtcgc ttaaatgcgg gaaatttgta tgtctgacac cattctgcgg tagagcaggt 180 ggaattttaa aatagatcag acgcttaaat gcggaacatg gaagaggctg attctattct 240 gcggtagagc aggtactttt cattgcaaat agaacagacg cttgaatgcg gtatgaaatt 300 gctgattcta tactgtggta gcgagagagg caaggaatga acgagacgcg aagagaaaaa 360 gtagagatga agtgaaaaaa aatttgccat tattagatgg acgcaattgt tttgccaggc 420 gttacgagct atttcgatgg agtgaaaatt tgtgattgtg accactatta ccagaacgca 480 aatgttttga tagttcgagc ttcatgtcca agttgaaatc aatgcactat tagatgaacg 540 caattgtttg acaggtgaca acttgaaggt tctcagaaca cgatttgcta attgagctga 600 attgcgagtg tgaagaatat tgaatgtttg tgttttggaa aacctcgtcg tcctgtagat 660 agtgtaaatg gtgattgatt tgatgaaaaa cattatagtg taagatgaat tcagatataa 720 taatgctgtg actgagatga gccaaccata tgttgattgt ttgtttgttt tcacgtcgtg 780 gccgaagcac tctaatgttg gataatctta ttgttgcaga tgtgatccga acgctgggtt 840 tgcagttgca aacatttgag tactcgtcgc atactgacga cgtcggtatc gagtggcgca 900 aatggttgcg atctttcgaa acaatgatcc gggcaagtag aattagtgat gacgagtgga 960 agcgggattt attgctacac tatgctggcc ccagtgtaca gcagctattt gatacgttgc 1020 ccgagttgcc ggatagtgac atgcgtggac cgctagtaaa tattgagcaa tacacaccaa 1080 acatgacaag ttatgaagaa gccgtgtcga aattaggtgc gtttttcctc ccgaaagcaa 1140 atacaacgta cgaaaggcac ctgctacgcc aaatgaagca aaaagtcggt gagtccatcg 1200 atggtttcac ggtaaggctg cgcattcagg ctgaaagatg tggctttggc gataaggtcg 1260 aggaaaacgt gaaggatcaa ataattcaaa actgccaatc gacaacatta cgcagagaat 1320 tgcttaagcg aggtgatgcc actctagaag aagtgctacg gattgcgaag atctttgaga 1380 cagtggcgca acaggagaaa tcattcagcg gtgtcgaaca gaaaccaccc atcagcgacg 1440 tgaacaaaat cgaagtgaaa acgttcaaca gaagaaaccg ttttggtcag ccggtaagtg 1500 tggagtgcca ccggtgtgga tactttggac acatggccaa agacgtgaaa tgcccagcaa 1560 ggggaaagtc gtgcaataaa tgtggtggcc gcgatcattt cgcgaagaaa tgtcgtaaac 1620 gtaagcacac tgtggaagcg agtgcaaaac gagattttgt tcgacaggaa aatattgatt 1680 gcgatgctcg tgagtccgag agcacaaatt cgaacacggt aaagcacatt gttgaatcag 1740 aaacggaata tgtgttcaac gtcacgacac ccgacagtga tggtgaattg aaatgcgtaa 1800 tcggcggcgt atcaacatcg gctgtaattg actcgggctc gaagtacaat ttattgagcg 1860 actcgatatg gaaaacgttg aaggcgtgta aagtagtggt ccgcaatcaa accagagaag 1920 tgtcgaaagt attcaaatcc tacagcggca acaatttacc agttcttggt gcatttactg 1980 cgaccgttca acttggtgaa cgtaatggac cagctgaatt ttacgttgtg caaggtaatg 2040 gaaagcttct gattggacgt gacactgcta cagagatggg agtcttgaag atcgacacat 2100 cagtcaacaa agttgatgcg gaacttccgg agactacaga gaaaccatcg tcatctacca 2160 tcgaaggatc gtcgaccact aacgaactat caaagatgtg atagtggaca taccgatcaa 2220 cgcagaagta acgccggtga ttcaaccgta tcgtcgcatt ccaatggcac tggagaactt 2280 agttgatgaa aagatcgatg aacttttaac tcaaggtgtc atcgaaaaag ttaatgaacc 2340 atccaaatgg gtatctccga tggtggtggt gcccaaaggt gacgacgtac gcatatgtat 2400 cgatatgcgg cgggcaaacg aggcggttgc aagggagaac catccgttgc ctaccatcga 2460 agatttcctg ccacagcttg ccaaagcgaa agtgttttcg agactggata taaagaacgc 2520 gtttcatcag gtgagctaag tttgttttct attttctcga ttgacattat tttgaattga 2580 cagggttctt aaccgatcta gttgaggatt ccgaattact tttttttatc tatttttttt 2640 tttgttctta aattctgatt gattttacaa attgattttg cttagaaaac cattgtcaat 2700 tgcttcaacg ttttgtctaa cataaatgaa gtttgaactg aaaacttata cgatgagttt 2760 tattttaggt ggagatatcg cagaaatccc gagagataac aacctttatt actcgtaaag 2820 gtttgtacag gtacactagg ttgatgtttg ggatcaactg cgccccggag ttgttccaga 2880 agattatgga acaaatcctg agtggatgcg aaggttgtgt aaactttatt gatgatgtca 2940 tcgtattcgg tgctgatcaa cacgaacacg atgtgcgact gcaaatggtt ctccgtagga 3000 tgaatgagat gaacgtcttg ctgaacaaca gcaagtgtat atatggtgtt accgagctga 3060 aatttttggg gcaccttttg tcagataacg gcattaaacc tgatactgac aagttggagt 3120 cgattaaaaa ttttcgtgaa ccaaaaactc ctgaggaaac caggagcttc ttagggttag 3180 taaattatgt gggaaaattc atcccaaacc tagccactct aacggaacct cttcggcagt 3240 tgactaagca tcagcataag ttcgtgtgga gtcgagaaca ccaaatagct tttgatgagc 3300 tcaagcgaca tatgataaat ccttgtactt tgggctattt tgatggcgac gatcgtactc 3360 agctgatcgc cgatgctagt cccgtgggtt taggggctgt tttaatacaa accaataata 3420 gaggtccacg aataatttca tatgcgagta gaagtttgac ggacgtcgaa aagcgatacg 3480 cacaaataga aaaagaagcg ttagcgcttg tgtgggccgt tgaacggttc cattactatt 3540 tgtttggacg ttcatttgat ttgattactg accacaaacc tttggaggca atttttggac 3600 caaaatcaaa accctgtgcc cgaatcgaaa ggtgggtggt tcgtttgtta acgtacaaag 3660 ggaaagtcat ctaccgccca gggaaaacga acatagcaga tcctttatcg cgactggcta 3720 ttacatccga tcaagttggg aaaccgtttg aagcatatac ggaacactat gtaaattggg 3780 tggcgagtaa tgctgcacca atcgcattga aattgaggga aattgaacgg gaatctgaaa 3840 tggatccaac aattcaagca gtcggtatcg gattacaaca aggagaatgg tccgatgatg 3900 cagccccatt taagctgttt gcatctgagt tatgctttgc cgggaaaatt ttactacgtg 3960 ggactagaat ggttctacct gaaaagctga gagcgcgtac cctcgatttg gctcacgagg 4020 gtcatccggg gatgacgata ataaaacagc gattacgtgt taaggtttgg tggcctaagc 4080 tggacagtca agtcgagaag tatgtcaaga gctgccgtgg atgtacatta gtagcggctc 4140 catcagcccc cgagcccatg cagcgaaaag agctaccatc acacccttgg caacatgttg 4200 ctatcgattt cttggggcca ctgccatccg ggcattatct tttcgactat tatagtcgat 4260 atatcgaagt ggaaataatg aaaaaaatag attctaccga aacaatcaag catttaacta 4320 caattttcgc ccgatttgga ctgccgatat caattaccgc ggataatggc cctcaattcg 4380 ttagcgagga gttccgggac tattgtgctt cctgcaacat caagttgatc aataccactc 4440 cctattggcc gcaacaaaac ggcgaggtgg agcggcaaaa taggtccata ttaaagaggc 4500 tctccattag ccaggccaca aatacagatt gggtaagtga gcttaacaaa tatttgttaa 4560 tgtacagatc atccccacac tcaacgacgg gaaagacccc atctgagatg atgttcggct 4620 acaatatccg tgataaggta ccatccatca atctaccaaa agaaattgac gaagaaacgg 4680 ccgataggga caaagaaaaa aaggagaaag gaaaactata tgctgacgaa cgacgtaacg 4740 ctagaccaaa tcctatctca gaaggcgaca aggtacttgt taaacggatg acaaaaccta 4800 acaaattggc tacaaacttc gagcctgtgg tctatgacgt catcaaaaga agaggcgggg 4860 atatagaagt tgcatctgaa gaaacgggta catcatatcg taggcatgtt tctcacttga 4920 aacgtattga tggtaaacct gagtccacaa gaatttgtga caatcctcca gacaccaatt 4980 cattggatgc tcaaggaagc ggttttacaa aacgaagatc ggctgattca tcgcaatctg 5040 aatctaaatc aaagcgaaca acacgactac caggatactt gcgagattat gtgtaagctc 5100 taaacccgtc atgagtaatc taaaataaat taaaaagaga aaagtga 5147 // ID BEL-4_SI-LTR repbase; DNA; INV; 747 BP. XX AC AEAQ01012976; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: long terminal DE repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-4_SI_; KW BEL-4_SI-I; BEL-4_SI-LTR. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-747 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01012976; Positions 6898 6152. XX SQ Sequence 747 BP; 247 A; 157 C; 135 G; 208 T; 0 other; tgttatgtac gtgacacatt aaaagagtgt catgcacgac acattcacac acaaatatat 60 ataaacatat ataaatgcgt acgttataat tgtaaatgaa atatacattc acacacatac 120 acacatacat gtatgtcgga aaacggacga cgggtgaact tagataaaag agaattctcg 180 ggcaattcga catatttggc acttcgagat ttcctgttac taagcgaatt tatgtatcgg 240 tattccgcgt taccatgcac acgcttgttt attaaataag tccaatttgt tatcggtgtt 300 ttttgtccaa aatacgcgtc tcgtcctgac tacgctttcg accaccgaca agagcagaca 360 taaacgtgtg tgaattttca tttcagtaag accattaata cttgagtctt cttccaagag 420 tgataaatgc gatcatatcg ctacgcgata aaatacgcgc ttgttacaac gtatgcgcga 480 gtgacaaaat aacagaggaa cagcatccct tcgacgaatc cgcgacccag ggtagtgact 540 cctaggcgac gtccttacgg actcagggcc gctgtcttta ggcgtcatcg cggtagatac 600 acacatctcc tttagacgtc accacggtat gatgtaaata cacgaacaaa taaataccaa 660 ttataatatt ggcaatttgt acaatattaa ataaaataaa tctttaaaaa atccagtgcg 720 ccaaatttaa gtgttttcca ctgaaca 747 // ID BEL-639_AA-I repbase; DNA; INV; 6108 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-639_AA_; KW BEL-639_AA-LTR; Pao_Bel_Ele103; BEL-639_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-6108 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [5157-5717] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 491..1624 FT /product="BEL-639_AA-I_1p" FT /translation="MLAKPSRKGRVEQREQLKEPFDQAAKSLKASSVKSAT FT SRKSSRVQLELQLRKLDAERTLLEDKRKLIEQQYSVLQELAELEDRVEDVE FT DGDQVDGDSKVEGWLRDGLNSDYEAESTDSSEEYPGTEDSDDEDVGNSSED FT DIQEDPRSNASHFNPKGRSTPRIDQTKSQRRNVTHSNQLACSLTRNQLAAR FT QVVAKDLPTFTGNAEEWPIFFSTFESTTRMCGYTNDENMIRLRNCLRGEAY FT AAVKSFLLHPSTVDRAIGALKLRFGQPRFVIHSLKEKILAMPPLRPDSINK FT MIDFALAVQNLEATIDACGQKELMRDASLLGDLVGKLPASAKLEWARHTRC FT LRKVNLSAFSKWIYDMAEDACLVAEPRKNQESPQS" FT CDS 2244..6107 FT /product="BEL-639_AA-I_2p" FT /translation="MDEELAKELNLSGERHPLCLKWTGGLHRTEDGSRSVQ FT IEISGIKGKRYHFEDIRTVSELQLPCQTLDVQQLQSEYPYLRGVPVQSYRE FT VRPRLLIGIQHAVATLVRKSREGKPGQPIAIKTNLGWTIYGGAPANQSMNM FT VHYTNHVSFCDHEIENTNANMDRAMKEYFSLESLGIMSPAKHVRSQEDERA FT MSLLHKLTHFNGERYETGLLWRKDNEQLPNNKAMALKRFNHLERRMEKDPE FT LARVVKEKLADYVSKNYVRKLTSEELAEIHERAWYLPTFPVINPNKPGKIR FT IVWDAAATTNGVSLNSALLAGPDLLEPLIRVLYRFRQYRFAICGDIREMFH FT QVAIRNEDQHSQRFFLRDDEGQKEPSTYVMQVMTFGASCSPATAQFVKNMN FT ADRFSNTHPSAARAIVRCTYVDDMLSSAETEQEAIELAKSVWFIHNEGGFE FT IRNWMSNSSTVLAAIRGNSNVEKSLDLTSTLATEKVLGMWWCTKADCFTYK FT INWDRLGEDLLAGKRCPTKREILRTMMTIYDPLGLISHYLMYLKVLLQEIW FT RTGVGWDEKVTQESYDKWQTWLKLLPEIELLQIPRCYRLQTSTDEYTEIQL FT HTFVDASENGMAAVVYLRFAEAGTVECSLVTAKTRVAPLKYLTIPRLELQA FT ALIGARLAHSVIQALDITVARCVFWSDSRNALSWIRADHRRYSQFVAARVS FT EILDLTNAADWKWVPTKWNVADEGTKWQRRPSFTSESCWYKGPEFLLHPEN FT EWPATPVKFEESCEELRANIHVHYEATKLEFPVDKFSEWKRLLHTTAYMLR FT FTRNARPKKFSRIGGCITSEEIRDAEVFHFRTAQANKFSEELSIMRREESK FT AVIPKRSPLFKLSPFIDEQGVLRMRGRTAACPYLLPDAINPIILPREHPVT FT HLIVRSYHVKFHHQNHEAVINEVRQKFCISRLRRVYAKVRLDCQRCKLRDT FT RPRPPAMADLPPCRLTAFIRPFTHTGVDYFGPMEVCIGRRVEKRWGVLLTC FT LSIRAVHIELASSLTTNSCIMALRNFIARRGTPAVFYSDRGTNFIGSEREL FT KQTLKTVDQNRMAQEFVSSNTSWSFNPPAAPHMGGSWERLVQSVKRTLSEL FT KPCPRPNDEELRNALIEVEGILNARPLTHVPIEDEAAPALTPNHWLLGSSD FT GLKPWSLLDNDSIALRRGWHQSQVLANHFWERWLREYLPEITRRSKWYQKV FT PPIQEGEIVLIADPNLPRNCWPKGRVIGTVNRDGQVRKVTIQTMRGIYERP FT AVNVAVLDVKGPEELANSEAESANWGG" XX SQ Sequence 6108 BP; 1735 A; 1447 C; 1612 G; 1312 T; 2 other; cgctagtaac aaattttaaa ttcgtttacc gagttgaacg atgtcggcac cggctggagc 60 ggtaggcatt agcctgccga tcaaggttta atcaagagct ctgcttttca atcaaatgac 120 gcgcatgaag acccccctct gaacccaacg gagaaatcaa ataaaaaccc tcttctcaaa 180 cctcaccaaa aagtaccctc tgtcaaaagc aaatcgtcaa aaaatcggga taaatcatcg 240 gcggtcggtc atagttgtga aacgtgtagg tcagcggaca acagccgtat ggtgcagtgc 300 gatgattgtg acggttggca tcattacatc tgcgtgggtg tcgatgatcg aatagtgaag 360 aaaccttggc ggtgcgtgat gtgtgaggag gtttggacga aacggaagaa tatcaagcat 420 acgaaagtgg aagagggaaa gaagaaaccg aaagccactg gaaaaaccaa ccaggaaggt 480 ggaccgaagg atgttggcaa agccaagcag aaaaggacga gtggagcagc gggaacaact 540 gaaggaaccg tttgaccagg cggcgaaaag cctcaaagcg tcatcagtta agtcggcaac 600 ttcgcgtaaa tcttcaagag tccaactcga actgcagctg cgcaaactcg atgctgaacg 660 gaccttgctg gaagataaac gcaaattgat cgagcaacag tacagtgtcc tgcaagagct 720 tgcagaactt gaagatcgcg ttgaggatgt ggaagatggt gaccaagtag acggggattc 780 caaagtagag ggttggctgc gcgacggact caactcggac tacgaggctg agagtactga 840 ctcaagcgaa gagtacccag gtactgaaga ttcggatgac gaagacgtcg gtaattcatc 900 agaggatgac attcaagaag atccacgtag caacgcatcg cacttcaacc cgaaaggacg 960 ctccacgcca aggattgacc aaacgaaatc gcagaggaga aatgttacgc acagcaacca 1020 gttagcctgt agcctgactc gtaaccaact agcagctcgt caagtcgtgg cgaaagatct 1080 gcccacattc accggcaatg cggaagagtg gccgattttc ttttccacct ttgaaagcac 1140 gacacgaatg tgtggataca ctaatgatga aaatatgatt cgattgagga attgcttgcg 1200 tggagaagcg tacgccgcgg tgaagagctt tctgcttcac ccgtcaacgg tagacagagc 1260 aataggtgct ctcaagctga gattcggtca accacggttt gtcattcatt cgcttaagga 1320 aaagatactc gccatgccac cgctaagacc cgattcaatc aacaaaatga tcgactttgc 1380 gctggcagta cagaatcttg aagctacaat tgatgcgtgt ggacaaaagg agcttatgcg 1440 agatgcgtcg ttgctaggcg acttggttgg aaagcttccg gcctcggcga agttggaatg 1500 ggcaagacat acccgatgct tgcgaaaggt caacctgtcg gccttcagca agtggattta 1560 cgatatggca gaagatgctt gtttggttgc cgaaccacgc aaaaatcaag aatcacccca 1620 gagtcakgag ccacgtaaaa agtccaaggc gtttctgaac acccacaccg atcagccgaa 1680 ctacaggaaa gaggaacggc acgcagctgg tagcaacagt gcttctaaac cgacgggtgc 1740 atataagaaa accccgggcc ggcggggcgc ggcggcgccg gcgcgggggc gcccgcgccc 1800 cgccgggggc cggcgccggg cgcgccggcg ggcgcggcgc ggcggcgggc cggccacaat 1860 ctcggccatg tattgtctgc aaaggaggtt gcgcttctct tgcaaagtgc ggaaggtttt 1920 tggatctctc ttatgacgga aggtgggcaa cgattcgcga agcaagagtc tgtcggaagt 1980 gtctgacgca acacaaagga ggatgcgaat caaagcagtg tggtgtgaac ggatgtggct 2040 ataaacatca tcccttgctg cacaaggagt tgaatgtaga atcatcagcg attgagaggc 2100 cacagcatga agagcagtcg tgtaacactc atcagtctgg ctcaagttcg atccttttcc 2160 gttatattcc agtcgtggtg tacggatgcg gaattgtagt tcactgctac gcctttctag 2220 atgacggatc gtcgaaaacg cttatggacg aagagcttgc aaaagaattg aacctatctg 2280 gagaacgtca tccactgtgc ctcaaatgga ctggcggctt gcatcgaacg gaagacggtt 2340 cgcgtagcgt acaaatagaa atctccggta taaagggaaa gcggtaccat ttcgaggata 2400 tccgaactgt ttcagaactt cagttgccgt gccaaacgct cgacgtgcaa caactacaat 2460 ctgaataccc ctacctaaga ggagttcctg tacaatcgta tcgagaggta cggccgcgac 2520 ttctcatcgg aatccagcac gctgttgcta cgctggtgag gaagagtcgt gaaggaaaac 2580 cgggtcaacc gatcgccatt aaaaccaacc ttggatggac aatctacggc ggggcacctg 2640 ctaatcaatc tatgaacatg gtgcactata cgaaccacgt ttccttctgc gaccatgaga 2700 ttgaaaacac gaatgcgaac atggatcgag caatgaagga gtatttctcc ctcgaaagtc 2760 ttggcatcat gtcaccggct aaacatgtac gttcccaaga agatgaacgc gctatgagtc 2820 tgttacataa gctcacccat ttcaacggag aaaggtatga aaccggtttg ctatggagaa 2880 aagacaacga gcaattacca aacaacaaag ctatggcgtt gaaacgtttc aatcatcttg 2940 aacgccgcat ggaaaaggat cccgagctag cgcgagtagt gaaagagaag ctggcagatt 3000 acgtttcaaa gaactacgtc cgtaagctaa ccagcgaaga gcttgcagag atccacgaac 3060 gcgcttggta tttgcccact tttccggtta tcaatccaaa taaaccggga aagatwagaa 3120 ttgtctggga tgccgcagct acaacgaatg gagtatcact aaactctgcg cttctagccg 3180 ggccagatct attggagccg ctgatacgcg ttctctacag atttcgacaa taccgtttcg 3240 cgatatgtgg tgatatacgc gaaatgtttc accaggtcgc aatacgcaac gaagatcaac 3300 atagtcaacg atttttcctg agagacgacg aaggacaaaa ggagcccagc acttacgtaa 3360 tgcaggtgat gacgtttggt gcgtcctgct ccccggcaac tgctcagttc gtaaagaaca 3420 tgaacgcaga tcgtttcagc aacacccatc catctgcagc aagggcaatc gtgcggtgta 3480 catacgtcga tgacatgctg agcagcgccg aaacagagca ggaagcgata gagcttgcaa 3540 aatctgtatg gtttattcac aatgaaggtg gcttcgaaat ccgtaattgg atgtccaact 3600 cgtcgacagt tctcgcagcc attcgcggaa actcaaatgt tgaaaaaagc ctcgacttaa 3660 cgtctaccct tgctaccgaa aaggtcctcg gtatgtggtg gtgcacgaag gctgactgct 3720 tcacctacaa gatcaattgg gacagacttg gcgaagacct gttggcaggc aagcgctgcc 3780 ccacaaagcg agaaattttg cgtaccatga tgaccatata cgatccgtta ggtttgatat 3840 ctcactatct gatgtacctg aaggtactac tacaggaaat ctggcgaaca ggtgtaggct 3900 gggacgaaaa ggtgactcag gaaagttatg acaaatggca gacatggctg aagcttcttc 3960 cggaaatcga gcttttgcaa attccacgtt gctatcggct gcagacatca actgatgagt 4020 atacggagat tcaactgcat accttcgttg acgccagtga aaatggtatg gcagctgtag 4080 tgtacctccg atttgcggaa gcaggaaccg ttgaatgctc gttggttaca gcgaaaacac 4140 gagtagctcc gctcaaatac cttaccatcc cgagactgga gcttcaagca gcgctcatcg 4200 gcgctaggct ggcgcactcc gtcatccagg cactagacat tacagtcgcc cgatgcgtct 4260 tctggtctga ctctagaaac gccctgtcgt ggatacgcgc agatcaccgt agatacagtc 4320 aatttgttgc cgctagggtc agtgaaatct tggatctgac aaatgctgct gactggaaat 4380 gggttccaac gaaatggaat gtggctgatg aaggcaccaa atggcagcga cgcccctcat 4440 ttacaagtga gagctgttgg tataaaggcc ctgaattcct gttacatcca gaaaatgagt 4500 ggccggctac acctgtaaaa ttcgaggaat cctgtgaaga gttacgagcc aacatccatg 4560 tgcactacga agcaacgaag ctagagtttc cagtagacaa attcagtgaa tggaaaagat 4620 tgcttcatac aaccgcctac atgttgcgat tcaccagaaa cgctcggccg aaaaagtttt 4680 cccgaattgg cggctgcatc acaagcgaag aaattcgcga cgctgaagtc ttccatttcc 4740 gtaccgcaca agccaataaa ttctcagaag agctgtcaat catgcgccgg gaagagagca 4800 aagcagtaat cccaaaacga agtccgctgt tcaaactaag ccctttcatt gatgaacagg 4860 gcgtgctgcg tatgagaggc cgaaccgcag cttgcccata ccttttacca gacgccatta 4920 atcccataat tcttcctcgt gagcatccgg taacgcatct cattgttcga agctatcatg 4980 tcaagtttca tcaccagaac catgaagcgg tcataaacga ggttcggcag aaattctgca 5040 tcagtcggct acgtcgggtg tacgctaaag tcagactcga ctgtcaacgt tgtaagctga 5100 gagatactcg accccgccct ccggcaatgg cagatctacc accttgcagg cttaccgcgt 5160 ttatccgccc gttcactcac actggggtgg actattttgg cccgatggaa gtatgcatag 5220 ggagaagggt tgagaagagg tggggggttc ttctaacatg tctttcaata cgcgctgtcc 5280 atattgaatt agctagctca ttaaccacta actcgtgcat aatggcgtta cgaaacttta 5340 tcgctcggcg aggaactcca gctgtattct acagcgacag gggtacaaat tttattggct 5400 ctgaacgtga gttgaagcag acgttgaaga ctgtagacca aaacagaatg gcacaggagt 5460 tcgtaagctc caatacatca tggagtttca atccgccagc tgctccccat atggggggaa 5520 gttgggaacg actggtgcag tccgttaagc ggactttgtc ggagctcaaa ccatgtccgc 5580 ggccaaacga cgaggaatta aggaacgctt tgattgaggt cgaaggtatt ttgaacgcac 5640 ggccactcac acacgtgcct atcgaagatg aagcagcccc ggcgctcact ccgaatcact 5700 ggctattggg aagctccgat gggttgaaac cgtggtctct actggacaat gactcgattg 5760 cgttaagacg aggctggcat caatctcaag tactcgcaaa ccacttctgg gaaagatggc 5820 ttcgagagta tctaccggag ataactagac gcagtaaatg gtatcagaag gtcccaccga 5880 tacaggaggg tgaaattgtg ctgatcgcag atccgaatct cccaaggaat tgctggccaa 5940 agggtcgtgt cataggtacg gtcaaccggg acggacaggt acgtaaggtg acgatccaga 6000 ctatgagagg aatttacgag aggccagctg tgaacgttgc ggtacttgac gtcaagggcc 6060 cagaagagtt ggcaaactca gaggcggagt ctgcaaactg ggggggag 6108 // ID Kolobok-11_HM repbase; DNA; INV; 2738 BP. XX AC . XX DT 31-DEC-2008 (Rel. 13.12, Created) DT 31-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE Kolobok-type family - consensus. XX KW Kolobok; DNA transposon; Transposable Element; Kolobok-11_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2738 RA Bao W. and Jurka J.; RT "Kolobok-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 2069-2069 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 386..2173 FT /product="Kolobok-11_HM_1p" FT /translation="MANKRNKVRSLKRKYQGNRYSRKEKDVVDCSRKSETT FT SASKLKNKHNFNGATNCDSEVDNFFFICNFRVIKNIFELYSTCSECNSKLT FT FLHNKSVRMGFSIGIIIQCSNCGFESTFYSSPTINQSSKPGSNPYEINLRA FT VMAFREIGRGNEAMSTFTAIMNMPPPLTNHSYNLTNNKLHKVYKEISLQSM FT KAAVSELREILNANASDDEIIDCGISIDGTWQRRGYSSLNGVVAGLSHENK FT KVIDVFSLSKFCMQCEVRKRINNPIDFECWKATHNCQVNHFSSAGSMEAAG FT AQEIFHSSIQKYNLRYTKYLGDGDSSSFSNVVKSKPYGDCIIEKLECIGHY FT QKRVGSRLRQKIKDFKGRLLSDGLKISGKGRLTNKSINTMQNFVGMAIRQN FT KNDLLCMRNSVIAVLYHCTNFPYEVTRHQFCLKGKNSWCKWQSDKVTGKTS FT YRQKINLPVTIMEEIKPIFQELSNPEMLRKCLHGMTQNCNESFNGFIWQRC FT PKATFTARKILEIAVYSAILNYNDGFTSLRYIFKMLGFTGGIYFEKGAFKK FT DKKRLSSMSRKSTDMNKKRRKHLRSIKKGYLDIEKENEDVNFYASGGF*" XX SQ Sequence 2738 BP; 990 A; 363 C; 441 G; 944 T; 0 other; ggtggctcac ccctattaaa aatcaaaaaa aaaaattttt tttttttttt gttttatgct 60 atttcaaaca ttgaagaaca aaatccataa tcaattttta ctactatgtg ctttaaaagt 120 aacaaaaagt acagttattg tggtcaatgc actttcctat tccatagcaa cgccatagca 180 acggtgtaaa caactatttc attaactact tcttaaagat tttcactaga aattttgaag 240 tatttctctg tgttagatct attatgcttt tctactgttc tgcaaagtat atcgttttta 300 aaaaatgtgc tttgtttcat tattctgatg tattttaact aaaactgtct gtctgaattt 360 aactgaaagc tatctaaatc taaatatggc aaacaaaaga aataaagtta gaagtttaaa 420 gagaaaatat caaggaaata gatattcaag gaaagaaaag gatgtagttg attgtagcag 480 gaaatcagaa acaacaagtg catcaaagct taagaataaa cataacttta atggtgcaac 540 gaattgtgat tctgaagtcg acaatttttt ttttatatgt aactttagag ttattaaaaa 600 tatatttgaa ctgtattcta catgcagtga gtgtaatagc aagctaactt ttttgcataa 660 taaatctgta cgtatgggat tttctattgg aattataata caatgttcaa actgtggctt 720 tgaaagtaca ttttattcat ctccaactat aaatcaaagt tcaaaacctg gttcaaaccc 780 atatgaaata aatttacgag ctgttatggc atttcgtgaa attggacgtg gtaatgaagc 840 tatgtctact tttactgcta taatgaatat gcctccacct cttacgaatc acagttataa 900 tcttacaaac aataaattac ataaagttta taaggaaatt agtttacaga gtatgaaagc 960 tgctgtatca gaactaagag aaattttaaa tgcaaatgct tctgatgatg aaattattga 1020 ctgtggcata tcaatagacg gaacatggca acgtcgtgga tattcttccc taaatggtgt 1080 tgtcgctggt ctttcacatg aaaataaaaa ggtgattgat gttttttctt tatctaagtt 1140 ttgtatgcag tgtgaagtac gtaaaagaat aaacaatcca attgattttg aatgttggaa 1200 agctacacat aattgccagg ttaatcattt tagctcagca gggtcaatgg aagctgctgg 1260 agctcaagaa atatttcatt cgtctataca aaagtataat ttaagatata ctaaatacct 1320 tggagacggt gactcaagtt cattttctaa tgttgttaaa agtaaaccat atggtgattg 1380 tattattgaa aaactagagt gtattggcca ttatcagaaa agagtaggct ctcgtttgcg 1440 tcaaaaaatt aaagacttta aaggcaggtt gttaagtgac ggcctaaaaa tatcgggaaa 1500 aggaagatta accaataaat ctattaacac aatgcagaac tttgttggaa tggctataag 1560 acaaaataaa aatgatctac tgtgtatgag aaattctgta atagctgtac tttaccattg 1620 tacaaatttt ccttatgaag ttactcgtca tcaattttgt ttaaaaggga aaaacagctg 1680 gtgcaaatgg caatcagata aagttacagg taaaacaagt tatagacaaa aaattaactt 1740 acctgttaca attatggagg aaattaaacc tatttttcaa gaactgtcaa accctgaaat 1800 gctaagaaag tgtttacatg gaatgacaca aaactgcaat gagtcattta acggtttcat 1860 atggcagaga tgccctaaag ctacttttac tgcaagaaaa attcttgaaa tagctgttta 1920 ttctgctatt ttaaattaca atgatggatt tacttcttta aggtatattt ttaaaatgct 1980 tgggtttact ggaggtattt atttcgaaaa aggagctttc aaaaaagata aaaagcgact 2040 atcaagtatg tcaagaaaat cgactgacat gaataaaaaa cgaagaaaac atttaagatc 2100 tataaaaaaa ggttatttag acattgagaa agagaatgaa gatgtaaact tttatgctag 2160 tggcggtttt tagtaacttt tcacaaaaac acttttcttt tttacattgt tttattttta 2220 aatgcgtttt ttaggacttt cttattttta tacggttttc ttacaaaaac attaatatct 2280 tcagtttggt taaagctttt atcttcaaat tttcacagta tgtgtattaa cttatttatg 2340 agggctggaa ctaaaattgt aatatttcaa gtataaataa attattttca atatagtttg 2400 ttagactgac actggcgatt aattagtttt tgcacattta ttgctgccat tttagttttt 2460 ttccacattt ggaaaaaatt ttagttccag ccctaaagca attatttagc taccatttgg 2520 caaaatatag tgtaataagt gcaaatttta ctcaagatat cattgtttta gataaggtaa 2580 gattttggac aaattatgct gactcagcat caaaaaatgc aaatttcatt tttattttga 2640 aaagttaaag ttttaatgta atttccttat ataaaaaatt aaaaacaagt ttagaacctt 2700 tataaacaaa aagttagatt taatgggggt gagccacc 2738 // ID CR1_Ele37 repbase; DNA; INV; 5091 BP. XX AC . XX DT 18-OCT-2010 (Rel. 15.1, Created) DT 18-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE A CR1 clade non-LTR retrotransposon family from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1_Ele37. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-5091 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5091 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (18-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. This consensus is generated from 17 CC sequences with >97% identity, and ~100% identical to the original CC sequence in [1]. XX FH Key Location/Qualifiers FT CDS 346..1203 FT /product="CR1_Ele37_1p" FT /translation="MSSSAACDHCAKPIKCEEDCVTCMAFCERMVHLKCTV FT AKLNKAFVRIVHENPNLMWMCDECAKLMKIARFKSTVSSFGEAINAITEKQ FT ESVHAEIKKELAKQGQQIAQLSMRITPLTPILSQEPGTSSRQPPLKRRRDD FT GSVSNKPLVGGTKIVADANVLTVPEPIELFWVYLSRIHPSVKPDAVEQLVK FT DCLHCEGIIKAIPLVKRGIDTNRLNFISFKVGIDPSLREAALNADTWPKGI FT LFREFEDSTSKNLWLPRQNTPAILISPDPASSPFSTPVSGIVPSC" FT CDS 1248..4919 FT /product="CR1_Ele37_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MGALEPLDTVAQAASRHLSRSGPVVEYGNRVSQPSHT FT GKYSSFSSSSPLDQFSRSSAVDPSATTSNRLYSQIAATMASSYLESADNYL FT ISIDGPSIEAHCATQLFSQPGRTVYSFLEAPDSSSEVVPSPAIAQHSRPGP FT LARCGSGVFHRALTGKYIQASVPSSPDAPSIFSVNDDISTSSAAGQPERIV FT ESFMEALDPSATVLPLAAHVHHSRSGPVVGNGERVFQQPDSGEYSPVSTCT FT HSTPDALLHSRISEPSAATSKLDIVLYYQNVGGMNTSIDDYRLAVSDACYD FT IIVLTETWLDSKTLSNQVFGSGYEVFRCDRNPNNSCKRTGGGVLVAVRSGL FT RVKALDNGLWRCLEQVWVSIQFDGRTLFLCALYIAPDRVRDVELINAHCQS FT VFSMMESASPTDEIIIVGDFNLAGVSWKPSHSGFLYPDPECSSFHACAINL FT LDSYSAATLTQINHVVNENNRSLDLCFVSFQEKAPIISEAPCALVKDVAHH FT PALMISIEGYQISEFDDRPASVSYNFVKADHRSIADVLSSIDWATILDPRD FT VEVAATTFSNVLSYVIDRHVPKRIHLQDAHPPWQTKELRQLKSQKRAALRK FT LTKHRTLPLRDDYVRINNEYKRLSRLCYSQHQRDIELKLKNKPKSFWKFVN FT EQRKESGLPSSMELNGEIASTTQEICQLFAEKFANVFNDEGITEDQVSRAA FT SYVPLNGESLSAIDVQFDAITKATSKLKSSNNPGPDGVPSALLKKHIDNLL FT VPLRHLFQMSLSSGTFPSCWKTAYMFPVHKKGSKHDVNNYRGIAALCSVSK FT LFELVVMEPLLSHCQQYLSPDQHGFTAGRSTTTNLLCLTSYITNSMTERTQ FT TDVIYTDLSAAFDKLDHAIAIAKLDRLGVSGNLLCWFRTYLTGRQLLVSIG FT DFKSSSFSASSGIPQGSHLGPLIFLLYFNDVHLLIKGPRLSYADDLKMFLQ FT IRSTTDCHFLQQQIDAFANWCSLNRMVVNPVKCSIITFSRLKQPTLFAYSL FT YGTIIERVNHVKDLGVIMDSQLSFRQHVSYTVDKASRTLGFIFRIAKNFTD FT IYCLKSLYCSLVRSTLEYCSAVWCPAYNNGAERIESVQRRFLRFALRKLPW FT TNPFRLPSYESRCQLIDLELLRTRRDTIRALTIADTLQGRIDCASILEQID FT LNARPRLLRNSSLLRLPLRRTNYSTNGGINGLLKLFNRVASIFDFHLTREF FT LRRRFSSFFAGRNN" XX SQ Sequence 5091 BP; 1285 A; 1338 C; 1076 G; 1392 T; 0 other; tatcatcact gtacgaaagt tggttggttt atgtatcgct ccggaaatta tagtgttatt 60 ccgttattta atcgtattta atcgaatcct ttctgtgtca accgttgaat tgcccgtctg 120 tgaacattaa ttccgtggaa gatccggttt agttggtgat aagattgaga aatcactttt 180 aagattctgt gaaacagtgg tacacttctc tcgcgcgatc ttgttgtttg tgcgcttact 240 acacttttca agcaaccgta cttttgtttt gctgtttacg tacctgtacg ctcagtgcac 300 tttctcgacc accgtacttt caaattagag catctccacc ttaccatgtc atcgtccgct 360 gcttgcgatc attgcgcaaa gcctatcaaa tgcgaagagg attgcgtaac ttgtatggct 420 ttttgcgaga ggatggttca tctgaaatgt acggtcgcaa aactcaacaa ggcgtttgtg 480 agaatcgttc atgaaaatcc gaacctaatg tggatgtgcg atgagtgcgc caaattaatg 540 aaaattgcga ggttcaaatc gacggtttct tcatttggtg aagctattaa cgctattacc 600 gaaaaacaag aatctgtgca tgctgagatc aagaaagaac tagccaaaca aggacagcaa 660 atcgcgcagc tatccatgcg catcaccccg ttaactccaa ttctttccca agaacctggt 720 acttcctcac gacaaccgcc cttgaaaaga cgccgtgatg acggttcggt ttccaacaaa 780 cctctcgtcg ggggcactaa aatagtggct gatgcaaacg tgcttaccgt gcctgagccc 840 atcgagcttt tctgggtgta tctctctcgt attcacccta gtgttaaacc ggatgcggtt 900 gaacaactgg ttaaggattg cttgcactgt gaaggaatca tcaaagcgat tccgcttgtt 960 aagcggggaa tcgataccaa tcgcctgaac ttcatctcat tcaaagtcgg cattgaccct 1020 agccttcgcg aagctgcgct caacgcggac acatggccga aaggaatact gtttcgcgag 1080 tttgaagaca gcacatcaaa aaacttgtgg ctgccacgcc aaaacacgcc tgctatcctg 1140 atttccccag atccagccag ctcgccattt tcaactccgg tatccggcat tgtaccgagc 1200 tgttaacacc tgaatgctgt aaaccagaac gcactgcctg tcgcattatg ggagccctcg 1260 aacctctcga cacagtcgcg caagctgctt cccgccatct cagccgttct ggtcctgttg 1320 tcgagtatgg taacagggtc tcccaaccct ctcacacagg caagtattca tcgtttagta 1380 gctcttctcc gcttgatcag ttctcacgtt ctagcgctgt ggatccctct gccaccacga 1440 gtaatcgctt gtatagtcaa atcgctgcga ccatggcctc ctcatatctg gagtccgccg 1500 acaattatct catttccatc gacggccctt caatagaagc tcactgcgca acccagctat 1560 tttcacaacc aggacgcact gtatacagct ttttggaagc ccctgattcc tctagcgaag 1620 tcgtgccatc tcccgccatc gctcagcaca gtcgtcctgg tcctttagct agatgcggat 1680 caggggtctt ccatcgcgca ctcacaggca agtacattca agcctctgtt ccttcgtcgc 1740 ctgatgcgcc ctcaattttt agtgtcaacg acgatatttc aacttcatct gctgctggac 1800 aaccagaacg catcgtagaa agcttcatgg aagccctcga tccctccgct acagtcctgc 1860 ctcttgctgc ccacgttcat cacagtcgtt ctggtcctgt agtcggtaac ggagagaggg 1920 tcttccaaca acccgactct ggcgagtatt cacctgtttc aacatgcact cattcgacac 1980 ctgacgcact attgcattcc agaataagcg aaccttcggc agcgacttcg aaactagaca 2040 tcgttctgta ctaccaaaat gtcggcggca tgaatacaag tatcgacgat tatcgcttag 2100 ccgtctcgga tgcctgctac gatattatcg tcttaaccga gacttggctt gactctaaaa 2160 cgctttctaa tcaggtattc ggatccggtt acgaggtatt tcgttgcgat cggaatccaa 2220 acaatagctg caaacgcacc ggaggcggtg ttcttgtggc ggttcgctct ggactgagag 2280 tgaaagctct tgacaacggc ctgtggaggt gcttggaaca agtctgggta tctatccagt 2340 tcgacggtcg tacattgttt ttgtgcgcct tgtacattgc acccgatcgg gtgcgcgacg 2400 tcgaacttat taatgctcac tgtcaatctg tgttctctat gatggagtca gcatccccga 2460 cggacgaaat aatcattgtc ggtgacttca atcttgctgg cgtttcgtgg aaaccgtctc 2520 acagcggttt cctctacccg gaccccgagt gctcgtcgtt ccatgcctgt gcgattaatc 2580 tgttggatag ctacagtgcc gccacattga ctcaaatcaa tcacgttgtc aatgaaaata 2640 accgtagtct ggatctatgt ttcgtgagtt tccaagaaaa agcaccaata atttccgagg 2700 ctccgtgcgc cttagtaaaa gatgtcgctc atcatcctgc gttgatgata tctattgaag 2760 gctaccagat atccgaattc gacgatcgcc ccgcctctgt ttcgtacaac tttgtgaaag 2820 ccgatcatcg tagcattgca gatgtcctgt caagtatcga ttgggcaact attcttgacc 2880 cccgcgatgt tgaagtcgct gcaactactt tttcaaatgt tctctcatat gtcatcgaca 2940 ggcatgtacc aaagcgcata catcttcaag acgcacatcc tccctggcaa acaaaagagc 3000 ttcgccaact taagtcgcag aagagagccg ctttaaggaa actaacaaag catcgaacgc 3060 ttcctctaag ggacgattat gtgaggatca ataacgaata taaaagatta agtcgcttat 3120 gttactctca gcaccagcga gatatagagc ttaaacttaa aaataagccc aaatccttct 3180 ggaaatttgt aaatgaacaa cgtaaggaat ctggccttcc gtcgtctatg gagctgaacg 3240 gagaaatcgc atctaccacg caagaaatct gtcagctgtt cgctgaaaaa tttgccaacg 3300 ttttcaacga tgagggcatc accgaggacc aagtaagccg cgcagccagt tatgttcctc 3360 tgaatggtga atcgttgagt gctatcgacg tacaattcga cgcgatcacc aaggccactt 3420 caaagctcaa atcctcaaat aatccggggc cagacggtgt cccctccgca ctcctgaaaa 3480 aacacatcga caacttgcta gttcctctcc gtcatctttt tcaaatgtcg ctttccagtg 3540 gcacttttcc gtcctgctgg aaaaccgcat acatgttccc agtacataaa aaaggaagta 3600 aacacgacgt gaataactac cgaggaattg cggcgctgtg ttccgtctcg aaacttttcg 3660 agcttgtcgt tatggaacct ttgctctcac actgtcagca gtatctcagc cccgaccagc 3720 atggtttcac tgcgggccgg tcaactacca ctaatttact gtgcctcaca tcctacatca 3780 ctaatagcat gacggaacgt acgcagacgg atgtcatata caccgatctc tctgctgcat 3840 tcgacaagct ggaccacgct atcgcgatcg caaaactcga cagacttggt gtcagcggca 3900 atttgttatg ttggttccgg acgtacctca ccggtcgtca attattggtt tcaattggcg 3960 atttcaaatc tagtagcttt tctgcttctt ctggcatccc acaaggtagc catttgggtc 4020 cgttgatctt tctgttgtat ttcaacgatg ttcatctgct aatcaaaggt cctcgtttgt 4080 cttacgccga cgacctaaaa atgttcctac agatccgctc tacaaccgac tgccactttt 4140 tgcaacaaca gattgacgct ttcgctaact ggtgctctct taataggatg gtagttaacc 4200 cagtcaaatg ttcaattata acgttctcac gactgaagca gccgacccta ttcgcctaca 4260 gtctgtatgg gacgattatc gaacgcgtta accacgtgaa agatctgggt gtcatcatgg 4320 actcacaatt gtctttcagg cagcatgtgt cgtacaccgt agacaaggcg tccagaaccc 4380 ttggatttat cttcaggatc gccaaaaact tcacggacat ctactgccta aagtcgcttt 4440 attgctccct cgttcgttcc actctggaat attgttctgc agtctggtgt cctgcataca 4500 acaacggcgc cgagcgaatc gagtcggttc aacgccgctt tctacgtttt gcgcttcgca 4560 aactaccgtg gacgaatccg ttccgcctac ctagttacga gagcagatgc caactaatag 4620 atctcgagct actccgcacg cgaagagaca caatcagagc tttgacgatc gccgatacgc 4680 ttcagggacg aatagactgc gcatcaatcc tcgaacaaat cgacttgaat gctcgaccac 4740 gactgctccg caatagttcg ctcctaaggc ttcctctccg acggacgaac tacagtacta 4800 acggtgggat caacggatta ctgaaacttt tcaaccgagt cgcatcaatc ttcgactttc 4860 atctcactcg tgagttctta cgtcgaagat tttcatcctt ttttgctgga cgaaacaact 4920 gacaacggct ttgaatgtgc tttagattaa gttttagctc ttgacttcca tatgtgttat 4980 ttgtagattt gacctttctt gtctcgattt gtatgttttt aactaccaac atcattgggg 5040 cctcactttg cctgttgatg taacaaataa acaaataaac aaacaaacaa a 5091 // ID Academ-3_CS repbase; DNA; INV; 6580 BP. XX AC . XX DT 16-MAY-2011 (Rel. 16.05, Created) DT 16-MAY-2011 (Rel. 16.05, Last updated, Version 1) XX DE DNA transposon: consensus. XX KW Academ; DNA transposon; Transposable Element; Academ-3_CS. XX OS Ciona savignyi OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. XX RN [1] RP 1-3053 RA Yuan Y.W. and Wessler S.R.; RT "The catalytic domain of all eukaryotic cut-and-paste transposase RT superfamilies."; RL Proc Natl Acad Sci U S A 108(19), 7884-7889 (2011). XX DR [1] (Consensus) XX SQ Sequence 6580 BP; 2109 A; 1234 C; 1312 G; 1925 T; 0 other; tagtccgaga gctcagcacc aattttagaa cgtcatcgag aacaaaaggt ttcatacgcg 60 aatttggtta attatgttat gaacgttcaa tttctgtttg tcagctggtt cggaccgggg 120 gcgtataagc gctacgttac cccattttac ccccaaatgg gtcaatcagt aacgtcaaaa 180 gtacaagaat gaaatttatt taaaaagaaa gccataaaca tattataatt aaaataagtg 240 aaactgacaa aaatttccgg ggcaaagacc cggtcagctg gtcccatttg tgctacccaa 300 aagggcgatt gatgacgtca taacgaacaa ggatgaaacc tattggaaat cgaagataca 360 gcaaagtaca tatctaaaat gatttagctg acttaatttt caggggcaaa gtcctagtca 420 gaccccaatt actgcgctgc ccctttttta acatgacgtc ataacgaaca aggatctatt 480 ggaaattgaa gatatagcaa agtacatatc caaaatgatt tagccgactt aattttaggc 540 aaagttgcgt tcaggacgcc aattgtacaa taaccccttg ctacccatac atcttattgc 600 ttctattatt atgacagaac tttgcacttg catataagct caaactttca ttattacgtt 660 tattattagg tcatagaagt tcaggaattg taaaaagggc aacaatgccg tgtgttgttg 720 tttgctatat atcgctaggc ttagtttacg tacggtttgt atttttaaaa cagcaacaat 780 ctaacgacag ttatggcaga agaagtcgaa ggtatgggtt tgaaagtatt acattactat 840 tgcaatgaaa gcagtgtaag tgtttaacat gttgatttta atttaagacg taaactcact 900 tggggatgat tctggcttac tatctgataa aaaccaagag gaagtagcgg agaaccatca 960 aggtacacgc ttttctttac atacgtttaa tttcatgtat gtacgtatat atgactatgt 1020 gttacacgat acctacatgt gcatgtatta gatacataaa gtatgtttga agtaatattt 1080 tattattttt taaagatttg tgctgtgtcc tccacgttgg tgctggctgt cgaagtggtc 1140 acgtaactct gtttcatgtt gaaactttaa aaaaaatgtc aattgatatt aaaagttaga 1200 aagcatactg cgttgaaata cggaaatttt gatcttcctg aatctgtcga taatcactct 1260 ggttttcacc ttgactgcta caaaaaattc acagccttat cggccaaaca aagaaaattc 1320 cttaaagaaa taaatgattg tcctacaact tccagcaacg ttatttccaa tgaaatctcg 1380 gaatccagta gaacagcacc agcgaatagg ttaatgcgat caacacaaac agcaagcact 1440 tcgtcatcgc ggacgggaat attcaaaaaa gtatgtttgt tttgctccaa ggaaagtaaa 1500 aaacaaagtg gaaacaaaat tcccttgacg gcatgccaaa cgacaaactt tcaggagaat 1560 gtgaaaagat acgctgaaat tttaggtgac accgtaatgc tcttgaaaat caaggatgta 1620 gatttcgtcg caaaagaagt tcattaccat ggcatttgca gaatacaata tcagaaacga 1680 gctgaagctg caggaatgca tcattcgact gaccaagtgt gtaaatcatt gtggcattgc 1740 tctagggatg tgcatgctcg cgcattcgac acaatatgtt tatggataca tgatactgta 1800 gtacttaatg gagaggtgca cgctttgaaa gatgtcaatg aatattacaa gtcattaaca 1860 gtcgaaatag gtggacaaca atttgacgaa acgacaatga attcccagca tttgagagat 1920 aaattaatta aacactttgg agatcaaatt cgaatatgca ttggacaaaa taagcatggc 1980 catctattat ttaacagcac gatggatata gaaacggcag taaggtcttc acataacgtc 2040 aaaaagaatt agcagttaaa atgagggatg ttgcatttag ccttcgtaca gccattctaa 2100 atgctgaaac aaaagaactt ccaaaggatc ttaaagtttc acatgtggaa aagggggaga 2160 tcgcggtccc tgatattatt tcaatctttt ttggtttcct tattaatggt ccagatgtac 2220 gtgttgggaa cagtgagagc aaaaaacgaa gaatcgactc tgtttcccaa gatgttgttt 2280 tctctgttgt aggtggacga tataaaccat caaagcattt gaaattgggg cttgcaatga 2340 aaagtatcac aggaagcaga aaggtaatcg agatcttaaa cagattgggg cactgtgtta 2400 gttaccacac cgttgaagaa attgaaaccg aacttacgtt tgcttcttgt gagaccaatc 2460 gtgttacgcc gtatggaatt tcactatctc caaatgtgcc tatcggatta gcattcgaca 2520 attacgaccg attcgtagaa acattaactg gaaaagacac tcttcatgat accgtgggga 2580 ttgcatatca agaaaaaaat acagaagaaa atgtagtaga gcctgatagt gatgtacctt 2640 cagatgattc agaacggcag gcaagtaatt ctcgtcgacg tacttttgta ccgagtggtc 2700 ttgatataga accatacaga aaaattccta gaatgtcgtc atcggaattt ttaccaacag 2760 atgacccaag gagacatttt gtaccagaaa gctatgcacg agctaaacat ttggctattc 2820 tgtggatgat atctgccttg gttctccatt gaacacacgc catgtgggtt ggatggaacg 2880 ccaacctgga aggaccttca acgtcaactc ataaagtatt ttatttacct cagactaacc 2940 agtctccaac ctcatcatcg gttgtagctg aaacgcttcg aattgcacag cgtgttgcaa 3000 gtgaatgcgg aaagaatgag atttccgtca cgtacgatct cgcaatagca aagaaagcta 3060 tgcaaattca agcagaagaa actcctacct ttgacaacgt ttttgttcag cttggaggat 3120 ttcacatcga aatggcatat tttaaagtga tgggtaaatt cataaatgag tctgggggac 3180 ctcacatttt ggaagaaagt ggatgtttag ctactggctc aatgaatgcg ttcattttgg 3240 gtaagcatta caatcgatgt aaacgaattc accagttact atctcttgcc atggaaatca 3300 cacactttaa agcctttctc aatgcaacta caagttcctt gtctaatact gtgttgacag 3360 aacttaaagg cattcaatcg tatgatgcgg ttgataatct tgcactggag ctacgagatc 3420 ttgtgaaaaa ttacgagaat ttccggaagg aaactgaaca aggaaatcac ggagctactg 3480 ctcaatactg gttgtcatac gtaaacatgg ttcaccttta tcacgaattt tcacgaagca 3540 tacgcactgg cgactttcag ctttatgtgt attgcttgcc aaagctagct tcttttttct 3600 tcgcttttgg tcatgttaat tacgctcgtt ggcttacacg attttccgac aacctgctaa 3660 aaattaagga aacccatcca gaattgggat cagagtttga agatggaaac ttttcattga 3720 gacgtacaca caaaggattt gcaagggtac ctatcgattt aactctggag caaaccataa 3780 atgcagatgc ctcaagccaa cgaactggtg taacagcatt tacgaattct atctcagcac 3840 gacaaaggtg ggcaagaagt cattacattc gtacaaacat tttgtcaata ctgttcgatg 3900 aactcggcct tcatcgtata gaagatgtta gccaagattt gaagccacat caagttcgca 3960 aaaactccga acggttgcaa aatatcctca acaagataaa tgattcgctt aacccgtttt 4020 cgcaagttat taccaatgaa actttatata acattggttc tggaaaagct gcctctgttg 4080 atacaactaa atttctgctg cgtgctcgtg agataggcga aaaagcacga gataccttca 4140 ttgagaaatg tgtcgaaaat cctaaatgct ttgaagaaag aaaagttaaa gtatttacat 4200 ttgcgtcaga aggcattaaa acaaaaagaa aacgcagcaa cggtcaaata tatgaagtga 4260 aaatggaacg tgatttgttt ggaagaatcc tttgcatagc cctggacaac aaagtggata 4320 tgggtgaagt gttagcgtac cccctgacgc cattgcctct ttcgttgtgc cacattgatg 4380 ggattatgca caaaacacaa aaatccacgc tactgaaaga actggagaga cgaatcacaa 4440 ccgacgcccc atcaaagatt gattgtttgg tagttgacgg aatgttttat cttcatttgg 4500 tgcatgatct tcctacaaca tttggaagaa ttgcctacta tctaatgtgc aagttgtgtc 4560 aaaatgattg cagcactatt catttggttt ttgatagaac gatgtcacca tcaatcaagg 4620 attgtgaaag aaatagtaga gagggaactt cagacagatc ggtggcattt cgaataaccg 4680 gtcccgctca aaaacggcct tcagattact tgaaggcgct gaggaatgac aatttcaagg 4740 aatctcttac agaatttctt gttgattatt ggcaagacga ctccatctcc aacgttctaa 4800 aagacaagtt gctgtacgta acctgtagag aaaaatgcta ctcgtatcac aacagcaatg 4860 gtaaagtgat aaggatggaa gaaccaacac ttcaaagttc gcacgaggaa gcagattcgc 4920 ggatgatatt tcatctaact caactttctg tcccaagcaa tgtggtcata cgaacagctg 4980 atactgatgt attagtgatt cttcttggga atatgcataa tttttctcca aatttaaaaa 5040 tatggatgga ggtcgggttg cagtcaaata acactctgcg atatataaat gcttcgaaac 5100 tgcacgttaa gcttggttta cagttatgta aatctctccc tggattccat gcattcacag 5160 gatgtgatta tacagcttcc ttcagcagaa aggggaaaat tcgaccgttt aaacttcttg 5220 aagatgacga catggtccaa aatgcatttg caagtttagg ggaaacggaa gatgttgaat 5280 ccgtagcgat ccaaaacctt gaaaagttca tttgtagaat gtatgggatg aagaacaaca 5340 cttcggtaaa tgatgcacgt ttacatgcgt ttctgaaagc atatactcct caaaatactg 5400 gtaaacctat gaatgacatt aaaggaatca ctgccggcac actgccaccg tgtgctgcct 5460 ctctgcatca gaaaatactg cgaacaaatt acgtttccag tatttggcta aatgcccatt 5520 cctgtaaccc cccgtctttg ctaccagaaa actgtggctg gactttggta gatgagcagt 5580 attctttgtg ctggtatcat ggcgactgtg ttcctccttc atgtgatacc atgataaacg 5640 agaacatcag cgacaacatc gacacagatg acactacact aagggaatat gataatacta 5700 acgaacacca ttcattgagt gattacgatt ctgactaatt gagtgattac gaaaatgatt 5760 tccgataaat cttttaaata taatttgtaa taaattacag tgcatgtgat tgtcgccgat 5820 ggaaaatcgt acttactgga tttattaaaa aacttgccca aaaataagtg aaaaatccga 5880 ttgtaaaaga aactaaccac tacaaatatc ttgaataatg cgtttttcct tattttccta 5940 actcttttga ttaattgcgc ctatgtaatt ttttttaatt tttaattaaa ctcatcgctt 6000 ttattctttt gtgtctataa aaagcacagt tactttcgca atttaaggta tcagtgagaa 6060 tataacccta aaagtttgtc ggttttcagc atatattgtt cctggtcgtt atgacgtcat 6120 gttaacctgt tttgaaaggg gcagcgcagt aattggggtc tgactaggac tttgcccctg 6180 aaaattaggt cagctaaatc attttgatta tgtactttgc tgtatcttct atttccaata 6240 ggtttcatcc ttgttcgtta tgacgtcatc aatcgccctt ttgggtagca caaatgggac 6300 cagctgaccg ggtctttgcc ccggaaattt ttgtcagttt cacgtatttt aattataata 6360 tgtttatggc tttcttttta aataaatttc attcttgtac ttttgacgtt actgattgac 6420 ccctttgggg gtaaaatggg gtaacgtagc gcttatacgc ccccggtccg aaccagctga 6480 caaacagaaa ttgaacgttc atagcataat taaccaaatt cgcgtatgaa accttttgtt 6540 ctcgatgacg ttctaaaatt cgtgctgagc tctcggacta 6580 // ID Copia-14_SI-LTR repbase; DNA; INV; 209 BP. XX AC AEAQ01018334; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: long terminal DE repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-14_SI_; KW Copia-14_SI-I; Copia-14_SI-LTR. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-209 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01018334; Positions 157 365. XX SQ Sequence 209 BP; 44 A; 58 C; 37 G; 70 T; 0 other; tgggaattgc gagacgtggc tttagatttg cgtcgccatc ttgctaggga cttgttttct 60 ctctagtctt cgccatctac atattttcta tgttccacac ttacgcgcca ctctgtgcac 120 gcgctgtatc tctcctaaat aaattgtaat accattaatc acagtgtcat ccattaagag 180 cctccacctg ctgcgattat ccgtcctca 209 // ID hAT-N1B_CQ repbase; DNA; INV; 1169 BP. XX AC . XX DT 29-DEC-2010 (Rel. 16.01, Created) DT 29-DEC-2010 (Rel. 16.01, Last updated, Version 1) XX DE Non-autonomous hAT DNA transposon from Culex quinquefasciatus - DE consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW hAT-N1_CQ; hAT-N1B_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-1169 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the southern house mosquito."; RL Repbase Reports 11(1), 97-97 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 10 sequences with >92% identity. CC 8-bp TSD. ~87% identical to hAT-N1_CQ. XX SQ Sequence 1169 BP; 410 A; 196 C; 167 G; 396 T; 0 other; caggggtgac caaagtatgg cccgcgggcc aaacgtggcc cgcgaggtga ttttttgtgg 60 cccgcggacc cattttgaat gatcacgtaa aatggcccgt tgaccacttg taaagtgatt 120 ttatgctttt ttttaaatta aggtttattc taaacatttt tatgcttatc ttttatattt 180 tttgttaata aaaaagctaa tgacttatca gtattttgat ccccatttta ggacacacat 240 attttgtcaa aaaatccttt taattaaaac aatttcacac ggttcaatgt gagtgaaact 300 aataaatatc gtgaaaactt tcatcaaaca tttttgaaaa tatttataaa acgttcaaat 360 ttgacttagg tagaaacctg taattgttct aaaaatataa gtattctaga ttaatttgaa 420 agtcataatg acccttttat gaactgaccc tcaaactaac tttacactta gaaaccggga 480 ttcgaactca tgacagttag ataacgaatc tgattgacta ccatctgatt caggtagaca 540 caatgttgaa atcggaggat cgaggctatt gcaaatattt tacaaagctt ttgtcgaccg 600 accctcccct ccccaattat taaaaattgg ctcaaaaaac aaggagcaaa aatatttata 660 aaaaatcgaa aaatcaatgg aaacttaggt acgatcagct gaaatcaatt aaaaatgaat 720 tcctcttcgt tcagaatcat ttgagcatgt tcgggtttat taaaaataaa ttttattttt 780 gtaaattttc gatgaatcaa tgttttttcg caaaaaaaaa tcgtcgaatc tttttttttt 840 gaaaataatg attgcagttt aactttacgg ttgcttaaaa cattttctat cattgaaatt 900 ctagctctaa aagatttttc attagtcaca ctcgatttaa agataaaatt accccgttac 960 ataaattgat cagtttacat gtcaccactt tgaaatataa atacaaaact attatttgta 1020 attaccgaaa atgttcaaaa aataaataaa aaatcaactt atagatgtat tttccagtct 1080 tttaaattaa aattatttag taaatctggc ccgcaagctc aattgagctt taaatttggc 1140 ccggcctcca aaaactttga gcacccctg 1169 // ID SINE3-1_TC repbase; DNA; INV; 262 BP. XX AC . XX DT 28-NOV-2007 (Rel. 12.11, Created) DT 28-NOV-2007 (Rel. 12.11, Last updated, Version 1) XX DE This is a recently retrotransposed family of SINE3 elements - a DE consensus sequence. XX KW SINE3/5S; SINE; Non-LTR Retrotransposon; Transposable Element; KW SINE3-1_TC. XX OS Tribolium castaneum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Coleoptera; Polyphaga; Cucujiformia; OC Tenebrionidae; Tribolium. XX RN [1] RP 1-262 RA Kapitonov V.V. and Jurka J.; RT "SINE3-1_TC, a family of SINE3 retrotransposons from the red RT flour beetle genome."; RL Repbase Reports 7(11), 1180-1180 (2007). XX DR [1] (Consensus) XX CC This is a first family of SINE3 retrotransposons identified in CC insects. SINE3-1_TC contains a pol III internal promoter (pos. CC 47-93) derived from 5S rRNA (81% identity to the 5S rRNA present CC in the beetle genome). SINE3-1_TC elements are flanked by 8-10 bp CC long target site duplications. The 3' terminus is composed of the CC (ATATTT) microsatellite. Based on presence of TSDs and the CC 3'-microsatellite, it is expected that SINE3-1_TC elements were CC retrotransposed by an endonuclease and reverse transcriptase CC expressed by a I-like non-LTR retrotransposon. This family CC includes >20 copies that are only ~1% divergent from the CC consensus sequences. XX SQ Sequence 262 BP; 68 A; 54 C; 64 G; 76 T; 0 other; agttagttgc ataggttagc tgatttcata gattataatt aatctatgaa gttaagcagc 60 tctgaccccg gtcattcctt ggatgggtga ccggataatg taatgccaaa acatcgcaac 120 taactcgtct ttcggaggag acgttaagcc gtcggtcccg gtcactacta gtggtcgtta 180 ggtcaggtca gaggctgtag atgcgacctg aaaactctgg cacttgggtg atggcattac 240 caccatacgc atatttatat tt 262 // ID I-61_AAe repbase; DNA; INV; 6446 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE An I non-LTR retrotransposon from Aedes aegypti. XX KW I; Non-LTR Retrotransposon; Transposable Element; I-61_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6446 RA Kojima K.K. and Jurka J.; RT "I clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1332-1332 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 11 sequences with >94% CC identity. XX FH Key Location/Qualifiers FT CDS 371..1831 FT /product="I-61_AAe_1p" FT /translation="MLSGTNPFPIWDTGPANRRNGEFTGPLFPEWGDPDGN FT LGQMVMLRMEAVEGTLPPAPILLRKSVESYLGAKVEGAYPEARGTSYVLKL FT RNTAHVEKLKRMSQLANGFPIRIVEHPILNVSKCVISCSESCNYSDKELLE FT ELQPQGVKEIRRITKSHGETRINTPTIILTIQGTTVPQYVHIGWIRCRTRL FT FYPSPMLCYCCWEFGHTRARCTQLNNPTCGLCSEKHVIDKDNPCTAAAFCK FT RCNTIDHPLSSRKCPTYTKEEEIQHLRVDMGVSYPAAKRQYDLNHSSKSMA FT SVVAAGNDQRYAELSSKLDNVLKDMKTKDNKIEALITEVRNKDAQIEKLQA FT ALKATPQDRLNLVKEHGTIQDLVDKIRSLESALARKDREIATIRDIYIPKR FT TADNLSTDRSTKKQTTKNQESNSTKSSNKGTKKKVDKTDSFESLNKRTKNR FT PSPSATPERTDPAPESPSHTVHMDFSSGDEPMLNASGYISDT" FT CDS 1907..6202 FT /product="I-61_AAe_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="MLPHRSRIEAINRSIDLETSLVSTHTPTTSTTNSIFN FT LASIYDSQLSTPSTSPAELPSSRGPVSAEVMTTPELADNPRHSVAAGRGTD FT KGGNSYPPEDNAGLKTSKKPASNPVKKQSWKRCSTTPSASGRYPQRRALPE FT SARTNAALYDLIQFKHTRTPFPTAGRSSSPLHESQHVNQEESSEAPSLIRT FT PRLPEAGTRLSIAESSILTTNSPTRLPRDNSSRLTTKAPLAIQWNMNGFYN FT NIGDLELLIRDQQPIILAIQEPHRISTDGLNRSLRGQYKWTLKCNENVYHS FT AAIGILSSVPHSSIPLDTDLPIVGVRLDYPFPLTVISGYLPNGNVPNLETQ FT LMDILKSLDTPSLILWDTNGHHTEWGSPSSNARGSLIMRIAESLDLVVLND FT GAVTFTRGDTQSTVDISLASVNIINRLSWTAGEDPLGSDHHPITIRFDEQP FT VAITRRPRWKYDQANWSQFQETLDTTVADMNPNNIDDFLNAVHHAASSTIP FT RTKGNSGRKSLPWWSPDVKKIIKERRKTLRAAKRLPKDHPNKDHADEVYRK FT KRNECRQLIREAKRKSWEDFLEGINANQTSSDLWNRVNALSGKRRATGMTL FT QVAGGLTRDPHLIANKLADYFESLSSLGQYDNNFIRRNHVSINSIQNIAIP FT EDTAVPLPINSPFRLEELNFALRRSKGKSAGPDELGYPMFQNLSINSKATF FT LELLNKTWAENTLPKSWTHSLVVPIPKVGKVTKSPSDFRPISLTCCASKIL FT ERMVNRRLSRFLEDNQLLDHRQHAFRPGHGTGTYFTGLGDVLQDAMNKELH FT VDIASLDLAKAYNRAWTPKAINQLSEWGLSGHILHFLKNFLSDRTFQVIIG FT NHSSSTRKEETGVPQGSVIAVTIFLVLMNSVFDSLPKEIYIFVYADDIVLL FT VVGRTLKFIRRKLQAAVTAVARWALHSGFKLSAEKSVISHICRYRHRMLSS FT PVKTNGSPIPMKKTVVILGIRLDRELRFEDHLRDTKKNCQTRLNILRTLSK FT PHRSSNRGTLLKVSKAIVNSRLLYGIELFCLAEDSLILSQLGPTYNQSIRI FT ISSLLPSTPADAACVELGVLPFRYQMVETLCCRTIGYLEKTTGDHEVFLLR FT EANRALANLAHLELPPVEQVHWVGARRWDAANLHVDLSIANRFRAGDNSIA FT MRSHVTELLARKYYTHQLRYTDGSKTRGRTGFGVTDTDSSHFFQLPNQCSV FT FSAEAAAILLAITTPASKPICVISDSASVLSTINSPTTQHPWIQAIQKDCP FT ARTVFLWVPGHCGIQGNVEADHLAAIGRTGRMFTRLTPGADLKHWLKSTIR FT VAWAQEWANVRQPFIRKIKGEITQWTDTTNRHDQLILSRLRVGHTHATHNM FT GSTGPFRRICSTCNITMTVEHLLVNCPFYQGIRDRYDISNNIRDILANDPA FT RETALISFIKDGGLHRAI" XX SQ Sequence 6446 BP; 1894 A; 1699 C; 1342 G; 1511 T; 0 other; cagtagacag ctaactatag cagcgaacgg tcgcgttgaa accgttctcg tatatcgctt 60 gattattcgt cgcgttggtg gtaataaatt acctctcgtc ggtttatctt cgctccattt 120 tgagccagtc ctaaatagtg tctgcttgag tagttttgat agcatttaga gcgtaatatc 180 aaatattaat tgtgctgtag cctcggatca agcttttgat ttcagagcgc tgtgtggaaa 240 tcaaacaata gagctgattg ccattgtctg cagctaagca aaaataccgc ctagtgagtg 300 gcatctcgtg ctgtgaaact tgttttctct ttcgagctgg tgaacaataa gtggtctacc 360 agttcccgac atgctgtcgg ggactaaccc tttccccatc tgggatacgg gtccagctaa 420 tcgtcggaat ggagaattca caggacccct cttccctgaa tggggagatc cggatggtaa 480 cctcggtcaa atggtaatgc taaggatgga agccgtggaa ggtactcttc ccccagcacc 540 tatccttctc aggaagtcag tagaaagcta cttgggagcg aaggtggagg gagcttatcc 600 tgaagcgaga ggcacttcgt atgtcttgaa gctaaggaat actgctcatg tggagaaatt 660 gaaaagaatg tctcagcttg cgaacggatt cccgattaga atcgtggagc acccaatcct 720 caacgtatcc aagtgcgtca ttagctgcag tgagtcgtgt aactactcgg acaaggaact 780 actcgaggaa ctgcaaccac aaggcgtgaa ggaaatacgc aggatcacca aatctcacgg 840 cgaaacacgc atcaatacac ccaccatcat tttgaccatc caaggcacca ccgtacccca 900 atatgtgcac atcggttgga tccgctgtcg aacccgccta ttctaccctt caccaatgct 960 ttgctactgc tgctgggaat ttgggcacac ccgcgctcgt tgcactcagc ttaacaaccc 1020 gacttgtggc ctctgctccg aaaaacatgt gatcgacaaa gacaatccgt gtactgctgc 1080 tgctttctgc aaacgatgca acactatcga ccacccactc tcaagccgca agtgccccac 1140 ttatacgaaa gaagaggaga ttcaacactt gcgagtggat atgggagttt cgtatccagc 1200 agccaaacgt caatacgatt tgaatcacag ctcaaaatca atggcttccg tcgtagccgc 1260 tggcaacgat caacgttacg ctgagctctc gtctaaactg gacaacgtgc taaaggacat 1320 gaaaaccaag gataacaaaa tcgaagcgct gattacagaa gtccgaaata aagatgccca 1380 aattgaaaaa ctccaagcag cgcttaaggc cacaccgcaa gacagactaa atctggtgaa 1440 ggaacatggt acgattcagg acctagtcga caagattcgt tctcttgagt ctgctcttgc 1500 cagaaaagac agggaaattg ccacgattcg agacatatac atcccaaaaa gaaccgccga 1560 taatctttct accgaccgct ccaccaagaa gcaaactaca aaaaaccaag agtcgaattc 1620 caccaaaagc tctaacaaag gtacgaagaa gaaagtggat aaaactgaca gttttgaatc 1680 tctcaataaa cgcaccaaaa atcgcccttc gccatcagcc actcctgaac gtacagatcc 1740 tgctccggag tccccctcgc acactgtcca catggatttc tcttcagggg acgaaccgat 1800 gctaaacgca tccggttaca tatccgacac gtaaatgttt cggcagtaag actagttttc 1860 cctcgattct tcataagatc aacagaacca ccaacgacaa acttcaatgt tgccccatcg 1920 aagcagaatt gaagccatca accgtagtat agatctggaa actagtctag tatctactca 1980 cacaccaaca acatcaacca ccaacagcat ctttaacttg gcatccatct acgacagtca 2040 actttccacc ccttcaactt ccccggcgga gctaccgagc agtcgaggcc ccgtcagtgc 2100 ggaagtcatg accacaccgg aactggcgga caacccccga cactccgtag cggccggaag 2160 agggacggac aagggaggaa actcctatcc ccctgaggac aacgcgggac tcaagacttc 2220 aaaaaagccg gcatccaatc ctgttaagaa acaatcctgg aaacgctgta gtactactcc 2280 atcggcatct ggccgatatc cacaaagaag agccttgcct gaaagtgcta gaaccaatgc 2340 agcattatac gacttaatac aattcaagca tacgagaact ccttttccaa cagcgggacg 2400 ttctagctcg cctctccacg aatcacaaca tgtcaaccaa gaagaatcgt ccgaagcacc 2460 aagtctcatc aggactccac gtttacctga agctggtact cgcctatcta tcgcggaatc 2520 atcaatcctg acaacaaatt ctcctactcg actacctcga gacaactcaa gcaggcttac 2580 taccaaagct ccgctggcca tccaatggaa catgaatggc ttttacaaca acatcgggga 2640 cctcgaatta cttatccgtg atcagcaacc aattatcctt gccattcaag aaccccacag 2700 gatcagcacc gatggactaa accgctcttt gagggggcaa tataagtgga cccttaaatg 2760 taacgaaaac gtctatcact cggctgctat cggtattctg tcatctgtcc cccactcttc 2820 tatcccattg gataccgatt taccaatagt aggcgtgaga cttgattacc ccttccccct 2880 gacagttatc tcgggatatc tgcctaatgg aaacgtaccg aatctcgaaa ctcaactcat 2940 ggatatatta aaaagcttgg atacaccaag cctgatccta tgggacacta atggtcatca 3000 cacagaatgg ggtagccctt cttccaacgc tcgtggttcg cttatcatga gaattgcaga 3060 atcgcttgat ctggtagttc ttaacgacgg agctgtaacc tttactagag gggatacaca 3120 atctactgtt gatatcagct tagctagtgt gaatataatc aaccgacttt cctggacagc 3180 cggagaagac cctttgggta gtgaccacca tccgattaca atccgctttg acgaacaacc 3240 agtggcgatt acacgccgcc cccgctggaa gtatgaccag gcaaattggt cccaattcca 3300 agaaacctta gacacgacag tagcagatat gaatccgaac aacatcgatg attttctgaa 3360 tgctgttcac cacgcggcat catctaccat ccctcgaaca aaagggaaca gtggccgaaa 3420 atcgcttccg tggtggtctc cagatgtaaa aaaaataata aaagaaagaa gaaaaactct 3480 ccgtgcagct aaacgattgc ccaaagacca cccgaataaa gaccacgctg atgaagtcta 3540 ccgcaagaaa cgaaacgaat gtcgtcaact tatccgagaa gcaaagcgga agtcctggga 3600 agacttcttg gaggggatca atgcaaacca aacatcctct gacctgtgga accgagtcaa 3660 cgcgttgagt ggcaaacgaa gagcaacagg catgacccta caagtagcgg gaggccttac 3720 ccgcgatccg catttaatag ccaacaagct agcagattac ttcgaatcac tctcttcatt 3780 gggtcaatac gacaacaact tcattcggcg caatcacgta tccattaata gcattcaaaa 3840 catagctatt ccagaagaca ctgctgttcc tctccctatt aactccccat tccgcctgga 3900 agaactgaac ttcgctctac gtcgcagtaa gggcaagtca gctggcccag atgagctcgg 3960 ttacccgatg tttcaaaacc tatccattaa ctcaaaagct acttttcttg agcttctaaa 4020 caaaacgtgg gccgagaaca ctcttcccaa gagttggacc catagcctgg tggtcccaat 4080 ccccaaggta ggaaaagtta ctaaatcacc aagtgatttc cggccgatct cactcacctg 4140 ctgtgctagt aaaattcttg agcgaatggt aaaccgccga ctgagtcgat tcttggagga 4200 taaccagctt cttgaccatc gtcaacatgc gtttcgacca ggtcacggaa caggaacgta 4260 cttcactggg ctgggagatg ttctccagga tgccatgaac aaagagcttc atgtcgatat 4320 cgcttcctta gatctggcaa aagcgtataa tcgcgcctgg actccaaaag ccatcaatca 4380 attgtccgaa tggggcttaa gtggccacat tctccatttc ctgaaaaact ttttaagcga 4440 caggacattc caagtgatca ttggaaacca ctcctcttca actcggaagg aagaaaccgg 4500 ggtcccacaa ggatccgtaa ttgcagtgac tatctttcta gttttgatga atagcgtttt 4560 tgattctctc ccaaaagaga tatacatatt tgtatatgcg gacgatattg tcttgctcgt 4620 tgtgggtcgt actcttaagt ttatccgacg gaaactccag gctgcagtca ctgcagttgc 4680 tagatgggct ctccactctg gttttaagct gtccgctgaa aaaagtgtaa tatcccacat 4740 ctgccgctat cgtcatcgaa tgctatcatc gcctgtgaag acaaacgggt ctccaatccc 4800 aatgaagaaa actgttgtta tacttggaat acgattggac cgcgagctga gatttgagga 4860 ccacttgcgc gataccaaaa agaattgcca aaccagactg aacattctcc gcactctttc 4920 aaaaccacac cgtagtagta atcgaggaac cctccttaaa gtatctaaag caatcgtaaa 4980 tagtcgacta ctttacggta ttgagctgtt ctgcctagca gaagattccc tcattttatc 5040 tcaacttggc cctacgtaca accaaagcat aagaattatt tctagtttgc tcccatccac 5100 accagcagat gcagcttgcg tagaactagg tgtgcttcct tttcggtacc aaatggtgga 5160 gactttgtgc tgccgaacaa tcggctacct cgaaaaaacc acgggagacc acgaggtctt 5220 tctcctcaga gaggcgaata gagccctcgc aaatctggcc catttagaac tccccccggt 5280 tgagcaggtc cactgggttg gagccagaag gtgggacgct gcaaacctac atgttgatct 5340 ttccatcgcg aatcgcttcc gtgcaggaga taactccatt gctatgcgct cacatgtcac 5400 cgaattgtta gctagaaaat attatactca ccaactccgc tacactgatg gttccaagac 5460 cagggggaga accggttttg gtgttaccga tacggacagc agccactttt tccagctccc 5520 caatcagtgc tcagtgtttt ctgctgaggc tgctgcaata ctgttggcca ttacaacacc 5580 agcatccaaa ccaatttgcg taatctccga ttcagcaagc gtcctatcta cgatcaactc 5640 tccaacaacc cagcacccat ggattcaagc tattcagaag gactgtcccg caagaaccgt 5700 tttcctttgg gtacccggcc actgtggtat ccaaggcaat gtagaagcag atcacctcgc 5760 tgccatagga cgaaccggtc gcatgtttac caggttgaca ccaggtgcgg atctgaaaca 5820 ctggttaaag tctacaatcc gcgtagcatg ggctcaagaa tgggccaatg tgagacaacc 5880 cttcatacgc aaaattaagg gagaaattac acaatggacc gacactacca atcgccacga 5940 tcaactgata ctttctcgcc tacgagttgg acatactcac gctacgcaca atatgggatc 6000 cacggggcct tttcgcagaa tatgttcaac atgcaacata accatgacgg tggaacacct 6060 actagtcaac tgtccattct accagggaat acgagatcgc tatgatatat caaacaacat 6120 aagagacatc cttgcgaacg acccagccag agaaacagca ctaatttcat tcatcaaaga 6180 tggcgggtta caccgggcca tttgagcccc aaaacacaga accaacataa aatatccaca 6240 ccattcatca tacggatcaa ctaagctacg attttgtgtt aatgtgttgt atatcgattt 6300 tataagaata tcctttgtaa ttgtaactaa actatgaaac tgtatgatag gggggcccct 6360 cattgcgaag ccctctccct ttttcttcca gagacgaacc agccacgatc tgagctgaaa 6420 gtctctttaa taaagataat aataat 6446 // ID Crack-34_AAe repbase; DNA; INV; 4867 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A Crack non-LTR retrotransposon family from Aedes aegypti. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; KW Crack-34_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4867 RA Kojima K.K. and Jurka J.; RT "Crack clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1250-1250 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 15 sequences with >97% CC identity. XX FH Key Location/Qualifiers FT CDS 136..1167 FT /product="Crack-34_AAe_1p" FT /translation="MWYYLDDRSEEKCIISCYMVCSAVNSNQYLCSHCSGI FT VTVSNVYIEIFHDVIRVLSFCWCFTIRFGSTQIIKSAREYKWVTELVVNMF FT EDCIICVTCRKEEKDVKKVIECGYCHKCEHYDCKNVFGSAVRKLRSQPYFC FT SLQCSELHQQTKSTVEADSQMHRDIQLVLKEVRETRAEMHAVKNTVGDMEK FT FQSFLSEKLDTLLGEIQSTKSDHKALKTDVENLTFQQQSVCDRVDRLELDL FT DRINRSTVSQNAVIIGIPAVDNENPNEIVRKVAAAVGCQLPDDAILDVKRL FT LPKNANRDARPSSARPAPIKVCFKTVCHKEELLSKKKKSRIAASVGCQPFN FT C" FT CDS 1491..4367 FT /product="Crack-34_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MYCNRAYDCVDDWIANKSILAHLNCKILQINARGINV FT LSKFDCIKELLERSRERFDVVVICETWIKPDRTHLYQIEGFVGIYSCRNES FT HGGLAVFTRADLHVEVLANRTIDGFHHIHCQFKSLSSPVDLHAVYRPPGFE FT VRRFFSELENIISAVRNGQKCILVGDTNIPTNMTSNNIVGEYIRLLASYNL FT VVTNTNTTRPASNNVLDHVICTDTLAEKVVNDTICTDLSDHSFVVSSIDMS FT CHTTKKTLSKQITDHNRLNDLFRQSMINLPLTMTANETLEYVIERYKSHLK FT ECTRTVTCQAKVKNNCPWMTLELWRLIKVKDKQLKISRANPTDERASTLLT FT HVSKMLQSKKAQCKREYYLRIINRNQQRNTWKLIKDVSGRSASKNSVRKIR FT KNDQTITDPLQMCTAFNEYFCNIGNQLASTIPSNRNIHRFDTIQTASSSMY FT LRPSTPNEIILLINGLDIKKSAGHDKISTTFVKVHHDFFAELLHHVFNEII FT QTGQYPSCLKQARVIPVLKSGDPTELNNYRPISTLSVIDKILEKLLVARLV FT EYLSRFDLMYSRQYGFRRGSSTLAATCDLVEDLYDSLDNRKLAGALFIDLK FT KAFDTVDHNLLIEKLECHGIRGTPKQLLESYLAGRTQYVYMDDCESPPLPI FT TIGVPQGSNLGPVLFLLYINDLCRLNLHGKLRLFADDTSLSYNGNDCASIL FT RYIKEDIELLMEYFSENMLSLNLNKTKYMVIHSKRRRISSHDPVIIRGYTL FT EKASEYCFLGLTIDETMSWSAHIKSLKKTLSSLCGLLRRVSPFMPFSCMQK FT IYFSLVHSRLQYLIANWGLASKSLLNELQVLQNRCLKIVYNKPFLFPTNLL FT YSVVDKSILPIYALHELQIVVQIRKIITDPTLHHNTVLTVNRQDRASRQAG FT NFILPRPSTDFGRKKFSFIGSKLFNQLPTTCKTSRSLAEFKRSVKQYLKSK FT IQQR" XX SQ Sequence 4867 BP; 1484 A; 1047 C; 967 G; 1369 T; 0 other; agtttttctg gcagcactgc cagcctaata tccctcttgt ttctaaggaa tataaaattg 60 ttaatttcct cggaaataaa caacctaact ccgataattg tacgagaaag gcgagatgat 120 ggcatagtaa cttggatgtg gtattatctg gatgatcgca gcgaggaaaa atgtataata 180 tcatgttata tggtctgttc agcggtaaat tctaatcagt atttatgcag tcattgttct 240 ggcattgtca ctgtatctaa tgtgtacatt gaaatatttc atgacgtaat ccgggtactt 300 tcattctgct ggtgtttcac gatccgtttc ggtagtacac aaattatcaa gagtgctcgt 360 gaatataagt gggttacgga gctcgtcgtc aacatgttcg aggattgcat aatttgtgtt 420 acgtgccgca aggaggaaaa ggatgttaag aaggtgatcg aatgtgggta ctgccacaaa 480 tgcgagcact atgattgtaa gaatgtcttc ggaagcgctg ttcggaaact acggagtcaa 540 ccttatttct gctccctgca atgtagtgag ctgcatcaac aaacgaaatc gaccgttgaa 600 gctgattccc agatgcaccg agacatacaa ttagtattaa aggaggtacg tgagacacgg 660 gccgaaatgc atgcagtcaa aaacacggta ggagacatgg agaagttcca atcttttctt 720 tcagagaaat tggacactct gcttggcgag attcaatcaa cgaaatctga tcacaaagct 780 ctgaaaacag atgtcgaaaa tctcactttt caacagcaat ctgtgtgtga tcgtgttgat 840 aggttggaac ttgatctgga ccgtataaat cgttctactg tgtcacaaaa cgccgtaatt 900 attggaatac cagcagtcga caacgagaat cctaatgaaa ttgtgcgtaa agtagcagct 960 gcggtggggt gtcaattgcc tgacgacgca attcttgatg tcaaacgctt gctgccgaag 1020 aatgcaaatc gagatgccag gccttccagt gcaagacccg cccctatcaa agtatgcttc 1080 aagacggtat gccataaaga ggaactcctt tcgaagaaaa aaaaatcacg gatcgctgcg 1140 tctgtcggat gtcaaccctt caactgctga acctaatcga aaagtaataa tacgggacga 1200 gctaactcca tacggactcc agctgctgaa cgaaacgaag gatgcacaag accaactagg 1260 cttcaaattt gtctggcccg gacgtaatgg tgtagtttta gtcaagcact ccgagtcttc 1320 gactgtcaag gtgattcgca gcatgaagga cttggaaacg ctgaagagat tgagcaataa 1380 acgactttat gcctctaact cgtcgtcacc ggatgaacaa caaccgaacc cgaagcgacg 1440 attgtagatc tgtctctgtt tccacgcttc aattagtatt ttaatttatc atgtattgta 1500 atagagctta tgattgtgta gatgattgga ttgcaaataa aagtatactt gctcatctca 1560 actgtaagat tctgcaaata aacgcacgag gtatcaatgt attatccaaa tttgattgta 1620 ttaaagagct tcttgagaga agtcgagagc ggtttgatgt ggttgtaatc tgtgagacat 1680 ggattaaacc tgatcgtact catttatacc aaattgaagg ctttgttgga atctactcgt 1740 gcagaaatga gtctcatgga gggttggctg tatttactcg cgctgacctg catgtagaag 1800 ttcttgcgaa tagaacaata gacggatttc atcacataca ttgtcagttt aaatcgttga 1860 gcagtccagt tgatttacat gctgtttatc gacctcctgg atttgaggtt cgtcgattct 1920 tctcggaatt ggagaacatt atctctgcag taagaaatgg gcagaaatgc atccttgtcg 1980 gtgatacaaa cattcctacg aatatgacta gcaacaatat cgttggggaa tacatacgtc 2040 ttcttgcttc gtacaatcta gtagtaacga acaccaatac tactagaccc gccagtaata 2100 atgtacttga tcatgtcata tgtacagata cattagccga aaaagtagtg aatgatacta 2160 tttgtactga tttgagtgat cattcttttg ttgtttcatc cattgatatg tcatgccaca 2220 ccaccaaaaa aactctgtca aagcagatta cagaccataa tcgattgaat gatttgtttc 2280 gccaatcaat gattaattta ccgcttacaa tgactgccaa tgaaactctg gaatatgtga 2340 ttgaacgtta taaatcacat ctcaaagaat gtactagaac agtgacatgc caagctaaag 2400 ttaaaaacaa ctgtccctgg atgactcttg aactctggag actaattaaa gtaaaagata 2460 aacaattgaa aatatcccgt gcgaacccca ccgatgagag agcttctaca ttattgactc 2520 acgtatctaa gatgttacaa tcgaaaaagg cacaatgcaa acgtgaatat tatctgcgta 2580 ttataaacag gaatcagcag agaaatacgt ggaaattgat aaaagatgtc tcgggaagat 2640 ctgcttcaaa aaattcggta aggaaaatcc gaaaaaatga tcaaaccatc actgacccgt 2700 tacaaatgtg caccgctttc aacgaatatt tttgcaatat tggcaatcaa ttggcgtcta 2760 ctattccgag caaccgaaat atccatcggt tcgatacgat acaaactgcc tcgtcttcta 2820 tgtacctacg accttcaacg cctaatgaga ttattttgct gattaacggt ttggatatta 2880 aaaaatctgc aggacacgat aaaatatcca ctacattcgt aaaggttcac catgactttt 2940 tcgccgaact tctgcatcat gtattcaacg aaattatcca gactggacaa tatcctagct 3000 gcctgaagca ggcacgagtc attccggttc ttaaatctgg tgatccgaca gaactgaaca 3060 actatagacc gatatcaact ttgtctgtaa ttgacaaaat cttggaaaaa cttcttgtcg 3120 caagacttgt tgagtacttg tcacgcttcg acctcatgta cagtcgccaa tacggtttca 3180 gaagaggttc gagtacactt gctgctactt gcgaccttgt tgaagatctc tatgattctc 3240 tggacaaccg gaagctcgct ggagctttat tcatcgatct caagaaagct tttgatactg 3300 tagatcacaa tcttctgatt gaaaagctcg agtgtcacgg aatcagagga acgcccaagc 3360 agttgttgga gagttatctt gcgggacgga ctcaatatgt ctatatggat gactgtgaga 3420 gtcctccatt acccataaca attggagtac cacaaggtag taaccttggt cctgtacttt 3480 ttctgctgta catcaacgac ttgtgtagac taaatctcca tggtaagctt cgtctttttg 3540 ctgatgatac gtcattatca tacaacggga acgattgtgc cagcatactg aggtacatta 3600 aagaggatat agagctacta atggagtact tcagtgaaaa tatgctttcg ttgaatctga 3660 ataagactaa gtatatggtg attcactcta aacgtcggcg catttccagt catgatcctg 3720 ttattatccg aggatacact ctcgaaaaag catccgagta ctgtttctta ggattaacca 3780 tcgatgaaac aatgagttgg tcagcccaca tcaagtcact caaaaaaacg ttaagctcgc 3840 tttgcggatt gcttcggaga gtttcaccat ttatgccatt ctcttgcatg cagaagatat 3900 acttctcgtt agtacattcc cggctgcaat atcttatcgc caattggggg ctagccagca 3960 agtcactctt aaatgagttg caagtattgc agaatcgatg cctgaaaata gtttacaaca 4020 aaccgttcct gtttccaaca aatcttctgt attcagttgt cgataaatca attcttccga 4080 tttacgctct gcatgaacta caaatagtag tccaaatcag gaaaatcatc acggatccta 4140 ctctgcatca caacacagtc ttaaccgtca atcgacaaga tcgagcctcg cgtcaggcgg 4200 gaaatttcat actcccgcgg cccagcactg actttggtcg gaaaaaattc tcttttattg 4260 gaagtaaatt gttcaaccag cttccaacga cctgtaaaac ttctagatct cttgctgagt 4320 tcaaaagatc tgtcaaacaa tacctcaaga gtaaaatcca gcaacgctag tctttactga 4380 atcgctcttt gcttcttttt ttttcctcct ttcccgcatg ccaccgccac gaaatgctcc 4440 gccgaccgcc atccaccaac cacccatcgc caactgccaa ccacccaccg cccaccaccc 4500 aacgtcaacc gctcaccaca aaccgccaac caacaatcgt caccacttgc cgccctttgc 4560 tttcctgcat tcaacgtcga tttctacatt gagttacgta tcgcctatca tcatcgtcat 4620 tctaccaagc atttaaaaca cgtgaaccat attgtgatta atgtagagta attgtcagaa 4680 aactactaaa acacttcctt caaagagctc attgctcact ggaatgtgtt ccaaatgagc 4740 tgtatatact tgaattgttc tcaataaatg aaaagacgag gaggttttgt gcctgttgga 4800 ggaagcagct ttaaaaaagt tcacctccaa tgggtttttc cctgctccac cttaaaaaaa 4860 aaaaaaa 4867 // ID Helitron-2_AAe repbase; DNA; INV; 6227 BP. XX AC . XX DT 30-DEC-2010 (Rel. 16.01, Created) DT 30-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE Helitron-like sequence from Aedes aegypti: consensus. XX KW Helitron; DNA transposon; Transposable Element; Helitron-2_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6227 RA Jurka J.; RT "Helitrons from the yellow fever mosquito."; RL Direct Submission to Repbase Update (30-DEC-2010). XX DR [2] (Consensus) XX FH Key Location/Qualifiers FT CDS 922..5532 FT /product="Helitron-2_AAe_1p" FT /translation="MVTVLPRELEDDYAFNVCIKKHLIHKSNYLSGFVKKS FT VVKAWLEYLITTPLYRREGIIFNQERLRAFINGPQSATSGGEEMIQLELID FT VNNDAELLAGQQQTVMWNEDKCLELAPAQNRQPVSIVYDDNAEELSFPDIY FT LGYPRTFRAGTRVTPYMKATSELRRSDRRGTKPNHLLYMAMKILRLRVAEG FT LQNVFKNIGTVDITRGQISDRRFVDELMERNLSFMKSIPNSVQYWYLRKQD FT LFAMIRQLGKPTMFLTLSASETQWPLLLKQLHKLSSEYNGIDLSDPLQELN FT AQQRATLVNDDAVTCCLYFNKLVDVLMGILSSPRFSPFGKHYVVDFFKRIE FT FQHRGSPHAHIMLWLANDPNETVSEHMPATMELIRKVCSISAIHLSETIDK FT QIHSHTFTCYKRNEKRCRFNIPYWPMNEERTLVPLSADDSRRDRLRKRASE FT MRNILETKAFDTLEEFLADCRCTYEYYLDVLRSSIQRPTIFLKRSMNEMWT FT NPFNPWIAEKLRSNMDLQFILDVYSCACYLAGYVNKSNRGVSGLHRELINL FT QQQYPDQDYAALLKKVSLKMLNSVEMCAQEAAWVLLRLPMSEASRKVEFVP FT TMWPQERVRSRKQFKQMDEEEIDEDSTDIWTKNIIQKYEEREDMDDICLAD FT FVAWYTPCRGANSYKRRSVPRILKWRGYSMNELEEYKRESVLLFLPFRNEH FT CDILDGTKFLQLYEINEADILRKRKEYDRDLNLEQTVEEYLRTCENEVDGE FT HENAATKKHDEFVRTISMIPNDDDIEYLPTGALNAVIRKRTNVMSKEDYCA FT MVRATNVEQRDLILHVIDGLHSYNEINKPLQTFFTGPAGCGKTFTLRILME FT TINRFSQAHNAQKNAYVACASTGKAAVAIGGTTVHSAFRITMSRQKNSKLS FT FEMLQMYRNAFSNIKAVIIDEVSMIGADVLNTIHTRLQDITGNYDEPFGGI FT TIIFCGDLRQLPPVNARAVYKPTGNSFHGAVLWQVLNFFPLVKVMRQSDVE FT FSSILTKIGNGQKMTADETKLIESRFRTVEWCKQNVPGAIRLYHRNMDVEQ FT YNNEALTDQDTLECTADDVFAGYRDAAQLASSRIKLYKMSVVETGGLPYLL FT RLSVGMPYMITTNVDVEDGIVNGAIGELKYIEKDEDDCIVKLWFKFENEVI FT GAALRIKSRPAVYSRPGILQTDWTPISKRSASIKLSGIIKCKRIQFPVVSA FT CALTIHKSQGGTFPEVVYDYDKSQDQQLVYVGLSRVTSLQGLFLTNSTNSF FT KFHHAKGSNSPKMVDLRNELQRLGNHRLRTLGDELREILDHSGPACTLMSI FT NVQSLNAHAMDIATDQILTSVEFLALSETWLDDHSSVDIAGYSCINQFKRP FT GVRAGGVAIYQKDTAFNVAVPHTIQKISEEYDAMLGVADQVGDICAASIMV FT MDIKVLLITVYISPGTTVQDTKMFLIRNLFNYVKQDTPIVVTGDFNIDVSK FT QENIKFVDFMIKHFNLKLANRLNEATTLGGSCIDLTFIKNISVECSRYCSY FT FSYHRPILSILTIEAPELSNI" XX SQ Sequence 6227 BP; 1870 A; 1315 C; 1377 G; 1664 T; 1 other; tttgattgaa gatgtgctca gatttcctaa gattaactca catttactca aaagtgctgc 60 ctgataccaa cgaaaaagtt taaacaattg cctgatgtaa gtggtctcag aggtgaaaac 120 atcgaagtct gacagaaaac aaggatgaaa atattgaatt ccaacaaaaa cattaacgca 180 ggattgctca gagttactct cgattgttca ctttcttgcc ccttgtgttg attattgaaa 240 attaaatcat ttggaaattt tagggattca cattttaact gactattgtt taacattgca 300 gaaattgttc cacctcattc tatggagaat gagtcttttc aaactcctga tcttattagc 360 cagcatgagt agtcagtaaa tctaaaatta tcaatattat attttgaaca gtgatcgata 420 tgacagaggt tggtccatct agccctacat cagatgaacc taatttggaa agagccgatc 480 gagaatattg caaacgattc atcgataatc aattcgggtt tgcatgcagt gtttgcgata 540 ggctttggtt tctgaacgat ctgaaaccaa tcaccgaagc tgcaggtaaa gtgttgctgg 600 atgctggtca ttttgattct gttacgggat ttaaagtatg ccaaacgtgt cgaagtagct 660 tgcaacggaa ttctgtgccg aatctgtcta catcgaacgg ctttaagtat ccaccttttc 720 cgccgggtct gccaccattg gacccaatca ctgagaggct gatctcacca cgtttgcctt 780 tcatgcaaat tcgtcgcctg cgccgcgcgc aaggtctgtt aatacatcgt aatattgtga 840 gtttgtttaa cattatccga aattttaggc agttacacga tcatcggtca agtgatcaat 900 gtcccagtag atgtagatga gatggttacg gtgcttcctc gtgagttaga agatgattac 960 gcgtttaatg tgtgcatcaa aaagcactta atacataaat caaattactt gtctggattc 1020 gtcaaaaagt ctgttgttaa agcttggctt gaatatttaa taactacccc attgtatcgt 1080 cgggagggta tcattttcaa tcaggagcga ttgagagctt tcatcaatgg gcctcaatct 1140 gccacgtcgg gaggggagga aatgatccag ctagagctca ttgatgttaa caatgatgca 1200 gaattgctcg ctggtcaaca acaaacggtt atgtggaatg aggataagtg tctggagctt 1260 gccccagccc aaaaccggca accagtctca attgtctacg atgacaacgc cgaggagctt 1320 tcttttccgg atatttatct tggatatccg aggactttca gagctggcac tcgtgtgacg 1380 ccgtacatga aagccactag tgaactgcga cgaagtgaca ggcgtggaac aaagccaaac 1440 cacttgttgt atatggctat gaaaattcta agacttcgag tggcggaggg attgcagaac 1500 gtcttcaaga acataggaac agtggatata acacgaggcc agatcagcga ccgaagattc 1560 gtggatgaac taatggaacg aaacctatcg ttcatgaagt caatcccaaa ctcggtccag 1620 tattggtatc taaggaagca ggaccttttc gccatgatac ggcagctggg caagccaaca 1680 atgttcctga cgttaagtgc cagtgaaacg caatggccat tattgttgaa gcaactacac 1740 aagctctcca gtgagtataa cggcatcgat ttgtctgatc cgttgcaaga gttgaacgcc 1800 caacaacgtg caacgctggt caatgacgat gctgttacgt gttgcttata cttcaacaaa 1860 ctcgttgatg tgctgatggg cattctttct tcgccgagat tcagtccatt cggaaagcac 1920 tatgttgttg acttcttcaa gcgcatcgag tttcaacacc gtggaagtcc acatgcccac 1980 attatgcttt ggttggcaaa tgatcctaat gaaacggttt ccgagcatat gcctgctaca 2040 atggagctta tcaggaaagt ttgctccatt agcgcaattc atttatcgga aacgatcgat 2100 aagcagattc atagccatac attcacctgc tataaacgga atgaaaagcg ttgtcggttt 2160 aatattccat actggccaat gaacgaggag cgaacattgg tacctctctc cgccgacgac 2220 agtcgacgtg atcgattgag gaaacgtgcc tcggaaatgc ggaacatttt agaaaccaag 2280 gcatttgaca cgctagagga gtttcttgcc gattgtaggt gcacttacga gtactatctc 2340 gatgtacttc gttcctcgat ccagcgacca acgatcttct taaaacgatc gatgaatgag 2400 atgtggacga atcctttcaa cccgtggatc gcagagaagc ttcgttccaa catggatctg 2460 cagttcattc ttgacgtgta ctcatgtgca tgttatttgg ctggatacgt gaacaagtcg 2520 aaccgaggcg tcagcggatt acatcgtgag ctcatcaatt tgcagcagca ataccccgac 2580 caggactatg cagctttgtt gaagaaggtc agcttgaaga tgctgaattc tgtagagatg 2640 tgtgcccaag aagctgcatg ggtacttctt cgactaccaa tgtcggaagc cagcaggaaa 2700 gtggaatttg tgccaaccat gtggcctcaa gaacgcgttc gatctcggaa gcagttcaaa 2760 cagatggacg aagaagaaat tgatgaggat tcgaccgaca tatggacaaa gaacatcatc 2820 cagaagtacg aagagcgtga agatatggat gatatttgtt tagctgattt tgtagcttgg 2880 tatactccat gcagaggtgc caatagctac aaacgccgga gcgtaccacg aatactaaag 2940 tggcgaggtt atagtatgaa cgagttagaa gagtacaagc gtgaatcggt actcttattt 3000 ctaccgttca gaaatgaaca ttgtgatatt ctggatggta caaagtttct tcagctgtat 3060 gaaataaatg aagcggacat cttgaggaaa cggaaggaat atgatcgcga tctcaatttg 3120 gagcagactg tcgaggagta ccttcgtact tgtgagaacg aagtagatgg agagcacgag 3180 aatgctgcca ccaagaagca tgacgagttt gttcgaacga tttccatgat tccaaacgat 3240 gatgacatcg aatatttgcc aacaggagcg ctaaacgcag tcatcaggaa gcgcaccaac 3300 gtcatgtcaa aagaggatta ctgcgctatg gtacgggcta ccaatgtgga acagcgtgac 3360 ttgatcttgc atgtaatcga tggactacat agttataacg agatcaataa gccattgcaa 3420 actttcttca caggtcctgc aggatgcggc aaaactttta cgctgcgcat cctgatggag 3480 acaatcaacc gcttcagtca ggctcacaac gctcaaaaga acgcgtatgt tgcatgtgct 3540 tccacaggaa aggcagctgt tgctattgga ggaacaacag tgcactctgc tttccggatc 3600 acaatgtcaa gacaaaagaa ctcgaagctt agttttgaaa tgcttcagat gtaccggaat 3660 gctttttcaa atataaaagc tgttatcatc gacgaggtca gcatgatcgg agcggatgtg 3720 ctgaatacta tacatacgcg tctgcaagat attaccggaa attatgacga accgttcggt 3780 ggaatcacca tcattttctg cggtgactta cggcaactac cacccgtaaa tgcaagagcc 3840 gtttacaagc caactggaaa ctctttccat ggtgctgttc tttggcaagt gcttaatttc 3900 ttcccgcttg tcaaagttat gagacagtct gatgttgaat tttccagtat acttaccaag 3960 attggtaacg gccagaaaat gactgccgat gaaactaaat tgatcgaaag tcgattccgt 4020 actgtcgaat ggtgcaagca gaatgtgcca ggagcaataa gactgtacca tcggaacatg 4080 gatgtcgaac agtacaataa cgaggcgcta actgatcagg atacactgga gtgtaccgcc 4140 gatgacgtat ttgcgggata tagggatgcc gcccagttgg caagttctcg catcaagctc 4200 tacaagatga gcgtcgttga aaccggtgga ttaccgtatt tgctacgact gtccgttggt 4260 atgccataca tgatcactac caacgttgat gttgaagatg gcatagtgaa tggtgcaatc 4320 ggtgagctga agtacattga aaaggatgaa gacgactgca tcgtgaaact ctggttcaaa 4380 tttgagaatg aggtgattgg tgctgcattg agaatcaaat cgaggccagc tgtttactcg 4440 agacccggta ttctgcagac cgattggaca cccatctcaa aacgatcggc tagtatcaaa 4500 ctgagtggca taatcaaatg caaacgcatt cagtttccgg tggttagtgc ctgtgctttg 4560 actattcata agtcgcaagg tggcactttc cccgaggtcg tgtacgacta tgacaaaagc 4620 caggatcagc aattggtata tgttggtttg tcacgggtta cttcactgca aggactattt 4680 ttgacaaact ctaccaattc tttcaagttc catcacgcca aaggcagcaa ttctcccaag 4740 atggttgact tgaggaacga gctgcagcgc ttgggcaatc atcgattacg gacgcttgga 4800 gacgaactgc gtgaaatact ggatcacagc ggccctgcct gcacattgat gagtatcaat 4860 gtacagagcc taaatgctca tgcaatggat attgctacag accaaattct gaccagtgta 4920 gaattccttg cacttagtga gacttggcta gatgatcact catccgtcga cattgctggc 4980 tacagctgca tcaaccagtt caaacgccca ggcgtgagag ctggaggtgt ggcgatttac 5040 cagaaggata cggccttcaa cgtagctgtt cctcacacaa ttcaaaagat tagtgaggag 5100 tatgatgcaa tgcttggcgt agcagatcag gttggagaca tttgtgcggc atcgattatg 5160 gttatggaca ttaaagtact tctaataaca gtgtacattt ctccaggtac tactgtgcaa 5220 gatacgaaaa tgttcctgat ccgtaattta ttcaattatg taaaacaaga tactcctatt 5280 gtagtcaccg gtgactttaa tattgacgtt tccaaacaag aaaatattaa atttgtcgat 5340 ttcatgataa aacatttcaa tttaaagttg gcaaaccgtc taaatgaagc gaccacatta 5400 ggtggttctt gtattgattt aacgtttatt aaaaatataa gtgtagagtg tagtcgatat 5460 tgttcatact tttcttacca tagaccgatc ctatcgattc taacgatcga agcacccgag 5520 ctatctaaca tttaatctac tggagaagtt tgacgaacta cagctggagc caggaccgga 5580 accgatcatc caaccaaaca acgcaagcaa gcgacgacga aacaaacaaa ctcaatcatc 5640 tggaacatca ctggatcatt cctggaacac caaacactgg ttctcggaac cacagaacga 5700 caaacggtcc ttttcaggat caccaaaatg aacgacaaac ggtccttttc aggaccacca 5760 aaatgagtcc ataaagagtc aattactgaa aaatttcatc cgatcaataa aaatacaaga 5820 ctagtacact tgcaaattca tacacatgac aaattattaa taatccttca ctacacttat 5880 actcgaactt cactctcatt caaacatcat aatcaactca aatccttttt aaagtatttt 5940 actttgaaat ccgaaactct gaaaaagctt ccaaaccaaa tgacagatcc cccttaaacc 6000 ataacgtaat tgaatactca caatcacagc acctatcatc tatcataaaa cattatattt 6060 atcatctgca aaaagctacc gcactgttgt aagagtctga cccataattg atcgaagttg 6120 tgtccatagt agttctacgt gaaccccgcg gtaatgtaac agacattacc caccccaatt 6180 tttttgatct gtattwaggt tcagcgagaa ttctacagtt tggaaaa 6227 // ID TRAS4_SC repbase; DNA; INV; 1920 BP. XX AC AB046674; XX DT 09-JUL-2004 (Rel. 9.06, Created) DT 30-JUN-2010 (Rel. 15.07, Last updated, Version 3) XX DE Samia cynthia TRASSc4 gene, non-LTR retrotransposon, partial cds. XX KW R1; Non-LTR Retrotransposon; Transposable Element; KW endonuclease domain; reverse transciptase domain; TRAS4_SC. XX NM TRAS4_SC. XX OS Samia cynthia OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Saturniidae; Saturniinae; Attacini; Samia. XX RN [1] RA Kubo Y., Okazaki S., Anzai T. and Fujiwara H.; RT "Structural and phylogenetic analysis of TRAS, telomeric RT repeat-specific non-LTR retrotransposon families in Lepidopteran RT insects."; RL Mol. Biol. Evol 18(5), 848-857 (2001). XX DR Genbank; AB046674; Positions 1 1920. XX SQ Sequence 1920 BP; 763 A; 332 C; 434 G; 391 T; 0 other; gggactgtga aagccgctat agcaatattt gatgaaagat tgagtattat agagcattct 60 cagctaacaa cgcacaatgt agcagtagcc acacttgaca caggacacac aaaaattggt 120 atcatatcgg tctattttga ggacaccaaa ccattaacgc catacttgga caaaataaaa 180 acaataatag aaaaattgga cacaaaaaaa gttattatag gaggggacgt aaatgcctgg 240 agtagttggt ggggaagtag gagagagaat gatagggggg aagagataac gggatggatt 300 acagaagaag gatatcatgt cctgaatcaa ggtagtatac cgacatttta cacgataaga 360 ggagggaaag agtaccagag ctgtgtggac atcacgatct gttcagacca aatactaagt 420 aagatacgtg gctggacgat tgaccaggaa ttggtaaatt cggaccataa ttgcattaaa 480 ttccaaataa taacaggaga gctaaaaaca agaacacaga aaaagacaac aagaatatat 540 aaaacacaaa aagcgaagtg gacagatttt agattaacga ttagtaaaaa aatagaagca 600 aacaaaataa caactaatgc aatcagagaa attacagaaa cacaagaatt agatagtata 660 atagaaaaat acaacaacat tataacacag gcatgtaaat taaatatccc taaaataaat 720 agaaatacaa ctaataaaca aaaaaataac ttaccgtggt ggaccgtcga gttggaggaa 780 gaaaagagga gagtacttac gatgaagcga agaatccgtt gcgcagcaca gcaaaggaag 840 tcccacgtcg tcgaagaata cctgaaaagg aaagaaaaat atgagttagc agcaaatgaa 900 gcacgtacaa atagctggag agaattttgc accaagcaga aaagggaaac aatgtgggag 960 ggtatatata gagtaatccg gaaagcagcc ccacaatacg aagaccagct actgagccag 1020 aacggacaga acctaaatcc ggaagattcg gtaaaactgc ttggtgcaac cttctttcca 1080 gacgactgta ccacagatga tacggtagaa catacaaaaa taagggatga tgcgaaggta 1140 accaacatag aagtcgatga tacggaagat gaccctccaa taaccgaagc cgagatgatc 1200 cacgcagcac gatcatttaa caaaaagaaa gcacctggaa aagatggatt cactgcagat 1260 atctgtttta acgcaataaa agccaacagt gagacattcc tggaaataat aaataagtgt 1320 atggaattgt catggtatcc gacatcgtgg aagagtgcat tcatactaat tttgcggaaa 1380 ccaaataaag ctagttacga aaacccaaga gcatacagac cgataggatt gctgccagtg 1440 ttagggaaga ttatggaaaa aataatagta aaaagaataa gatggcacac agcaccgaaa 1500 ttgaatccac gacagtacgg gtttacacca cagcgctgta cggaggactc cctctatgat 1560 ctaatgacac acatcatgaa caacttaaca cagagaaaga taaacattgt tgtgtcgttg 1620 gacatagagg gggccttcga cagcgcgtgg tggcccgtgt tgaagtgtag attaaaagaa 1680 ctaaaatgtc ctaggaatct caggaaaata gtagacagct acctagacaa tagacaggtt 1740 gaaatgaatt atgcgggagc ctcatacagt aaaataacga ccaaaggatg tgtacaggga 1800 tccatcagcg gcccagtttt ctggaatata ataatagacc cacttataga tcgattggca 1860 gacaaaaaca tttattgtca ggcgttcgcg gacgatgtgg tcctggtttt cgacggagac 1920 // ID hAT-7_SM repbase; DNA; INV; 3389 BP. XX AC . XX DT 11-OCT-2007 (Rel. 12.1, Created) DT 00-0000 (Rel. 12.1, Last updated, Version 1) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-7_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-3389 RA Jurka J. and Obukhanych T.; RT "haT-type repetitive family from freshwater planarian Schmidtea RT mediterranea."; RL Repbase Reports 7(10), 1035-1035 (2007). XX DR [1] (Consensus) XX CC This planarian DNA transposon has ~15 bp-long TIRs flanked by 8 CC bp-long TSDs and encodes a protein distantly related to CC Tam3-transposase from a snapdragon Antirrhinum majus. Otherwise CC it shows little similarity to other hAT families. XX FH Key Location/Qualifiers FT CDS 620..2506 FT /product="hAT-7_SM_1p" FT /translation="MNSTRSKTKVYLLESYTEVLTGGKLPSLRQVLGLFLH FT EHKENGKTIREASTIAIESTAKFWQKARIPTRAVQHCQAKLESEFDVWRLL FT KKNASRKAETQKAKEAAFCSRLDDLFDIAHADALKNIKIEEDREFLIAQRE FT KGRRGSMLGLDVTLTNKEQRASEARDKLAARRHRAVCDQQQLDAKAELSEL FT DSHSNSSTSESDESGDEMQGATGGVHTPVRSQRKRGRLEVVSPGLAAMLDR FT TKVSDRKAVYLVAETAKSLGQDVKDLALNRSTIRRQRQDHREMISGRLREE FT FRADVDVPLVVHWDGKMLRDLTGKETIDRLPILVSGKGISQLLVAAKLQSS FT TGKAQAEAVYAALNEWGIADRVRAMSFDTTSSNTGLANGACVLLEQMLGIN FT LLSLACRHHIFELVIGAVFQTCMGPSSAPEVLMFKRFQSCWQFIDQASYST FT GMDDKDVAAVLMDTRNKLLEFAESQLQIMQPRDDYREFLELSIIFLGGVPA FT RGIRFIAPGAMHHARWMSKVLYSLKVYMFRSQFDLRPTEVKGLRDICTFSV FT SLYMKVWFTAPCAATAPHNDLTFLQALIDYEVVHPAISKAASAKMAKHLWY FT ISEELVGLALFDKDLSTATKRELVSAIKVID" XX SQ Sequence 3389 BP; 966 A; 677 C; 792 G; 954 T; 0 other; tagggtggtc cttatttttc aactttcaaa aagtgcctgt ctcacccttt tattttgttc 60 ctatatataa gaaaaaaatt taggccaaat atgagctcaa ttgaacaaca tttagaggta 120 gctcaatgaa gtcaaagttt tataatagac cgcattatca aaaatagtga aatacataac 180 ccccactact gctgagttta aaatttagca atagattatc atttaaatca tccctgattt 240 tgttacttac tactcagtgt gtgggaacca gcctttggtg ttcttgttct acttcttcaa 300 tattttgtgc aactaactgt ctggtgaagt tatagtgaat tgttatttac gtttaagaaa 360 agctctgaat agtatttatt actgtaatta tttaaaggta tgtactgatt atctgctaac 420 ttattgttta catttttgtg tagcttctgg ttgaaaatat tgtaatagtt tttctgttac 480 tgcaatagtg gcaacctgac tgtgaattgt agttgcagtt ttactggact tagtggattg 540 ctagcagtaa ttaataaata atttctcact gtaaatcaaa tcatgacata cattgtattg 600 tacttcactt acttttagaa tgaacagtac caggtctaag acaaaggtgt atctgctgga 660 gagttataca gaggttttga ctggtgggaa gttgccttca ttacggcagg tactcggtct 720 tttccttcat gaacacaaag agaatggcaa aacaatacga gaggcatcaa cgattgccat 780 tgaaagtact gctaagtttt ggcaaaaggc aagaatacca acaagggcag ttcagcattg 840 ccaagctaag ctagagagtg aattcgatgt atggcgtcta ctaaagaaaa atgcatcaag 900 gaaagctgag acacagaagg ctaaggaagc ggcattttgt tctaggctgg atgatttatt 960 tgacatcgcc catgccgatg cactcaagaa tataaagata gaagaagaca gagaatttct 1020 tatagctcag agggagaagg gacggagagg aagtatgctg gggcttgatg taactcttac 1080 caataaagag caaagagcat ctgaagctag ggacaagctt gcggctagac ggcatcgagc 1140 agtgtgtgac cagcagcagt tggatgcgaa ggctgaactc agtgaactgg acagccacag 1200 caacagctcc acatctgaat ctgacgaatc aggtgatgaa atgcagggtg ctactggcgg 1260 agttcacacg cctgttcgat ctcaacgcaa gagagggagg ttggaagtgg ttagccctgg 1320 gttagctgca atgctggaca gaacgaaggt gtctgaccgt aaggctgtgt acctagtggc 1380 agaaacggca aaaagtcttg gtcaagatgt gaaagatctt gccttgaaca ggagtacaat 1440 acgtcgacag cgacaagatc atcgggagat gatatcaggc cgtctaagag aggagttccg 1500 agctgatgtt gacgtcccac tggttgtgca ttgggacggc aaaatgcttc gtgacctaac 1560 gggcaaggag accatagatc gactacccat cctcgtctct ggaaaaggta tttctcagtt 1620 gttggtagca gccaagcttc aatcgtcaac tggaaaagca caagctgaag ccgtgtatgc 1680 tgctctaaat gaatggggga ttgccgaccg ggtgagagcc atgtcatttg acacgacgag 1740 ttccaacaca ggtctggcca atggggcttg tgtccttctc gagcagatgc ttggcatcaa 1800 tctgttgtcc cttgcgtgtc gccatcacat atttgaactg gttattggag ccgtgtttca 1860 gacatgcatg ggaccatcat cagcaccgga ggtcctaatg ttcaagcggt tccagtcatg 1920 ctggcagttc atcgaccagg catcatactc gactggaatg gatgacaaag atgttgcagc 1980 tgtcctgatg gatacacgga ataaactact ggagtttgca gagagccagc tgcagataat 2040 gcagccaagg gatgattatc gggaatttct ggaactgtcg ataattttcc tcggaggagt 2100 ccctgcaaga ggcattcgtt tcatcgctcc aggtgccatg caccatgctc gatggatgag 2160 caaagtactg tacagtttga aggtttatat gtttcgatcg cagtttgatc tcagaccaac 2220 agaagtaaaa gggctgcgag acatctgcac tttctcggtc agcctttaca tgaaagtctg 2280 gttcacagct ccttgtgctg caacagcacc gcataatgat ctcacgttcc ttcaggccct 2340 cattgattat gaagtagtgc acccagccat ctccaaggca gcttctgcta aaatggccaa 2400 acatctttgg tatatctcgg aggaactcgt tggactggct ctgtttgaca aggatctatc 2460 aactgcaacg aagcgcgaac tcgtcagtgc tatcaaggtg attgactgat tgcttgatgt 2520 ttacttgact tatgtcattt atcattaatt tcttgttttt gagtgtttga tagaactgtt 2580 agctctggta ctaaaactaa tttgtaatgt tttaccatac aggaaaatga cggagcggac 2640 aaaccgccaa aacgaatcaa catcgacatt cagtccgttc gcggcaaatc acttgcttac 2700 ttcatgacca agcattcgca tgttctgttc gaacgattgg aacttccaga ttcttttctt 2760 gctgttgatc cggaggagtg gaatggccac gaagactacg agcatgcagc tgcacttgta 2820 cgcgatctga aggtagtcaa cgatcatgcg gagcgcggag tagctttggt tcaggagtta 2880 agcggcatgc tcacaaaaaa tgaacagcaa ttccagttcc tgattcaagt tgtgcaggag 2940 aatcgtagac tgtttcctaa ctccctgaag cagaccctaa cttcagagaa ttttgccaca 3000 gatattcatg aatcggacgc gaatgtgtaa gacaagtgac agtggggtgc ccggtgctgg 3060 ctgccgaatg ccgatgctgc atcctagcaa cagtgtgatg ctaaaacttc tccatttgtc 3120 tttggtttat cttatatacc aactccaatg tcagtgatat ttgttgatgg cattaaaagc 3180 gtgacaattt gcttctttca tttttatata agagttgcga aaaacgcacc attcgtaaaa 3240 actttgatgc tcttgagcta cctctaaatg tcaatatttt ctccccaaaa tttttctcca 3300 gtgttttttg tctaaaagga acaactgtgg agggtgagac attgaaaaac accaaaaaaa 3360 ttttttggcc ctgtataagg accacccta 3389 // ID Copia14-NVi_LTR repbase; DNA; INV; 268 BP. XX AC AAZX01010927; XX DT 13-NOV-2007 (Rel. 12.11, Created) DT 13-NOV-2007 (Rel. 12.11, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: long terminal DE repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia14-NV; KW Copia14-NVi_I; Copia14-NVi_LTR. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-268 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from Nasonia parasitic wasp."; RL Repbase Reports 7(11), 1156-1156 (2007). XX DR Genome; AAZX01010927; Positions 926 659. XX SQ Sequence 268 BP; 66 A; 70 C; 55 G; 77 T; 0 other; tgttgaaatc tagtgtgtat tttcgcgttt tcgataggct aagtcctaca catattcaga 60 ctagtatagc gtcctccgat gagacccgac aataagagtg gggatgacac gacatcgcgc 120 gctcctagcg gcgtttcgaa ctccgatctg taaggaccat attcagtcga actcccgagc 180 gagtagcacc tctcgaataa agcctttttg tatttcaata agtgtcttgc atctttactt 240 cgcgttatcc tcccgcatac cctcaaca 268 // ID Gypsy-2_IS-LTR repbase; DNA; INV; 210 BP. XX AC ABJB010439064; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the black-legged tick genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-2_IS_; KW Gypsy-2_IS-I; Gypsy-2_IS-LTR. XX OS Ixodes scapularis OC Eukaryota; Metazoa; Arthropoda; Chelicerata; Arachnida; Acari; OC Parasitiformes; Ixodida; Ixodoidea; Ixodidae; Ixodinae; Ixodes. XX RN [1] RP 1-210 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the black-legged tick genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; ABJB010439064; Positions 2256 2047. XX SQ Sequence 210 BP; 57 A; 53 C; 63 G; 37 T; 0 other; tgtagcggca gctatcagca gcgatccagc gcttgaacgg cgccaagctc acgggacagg 60 cacgagctaa ggacacgagg tgcctggacc acgagctgca cggaatctgg ggattcggcg 120 aagggcccct cgatgagcta gtcagaagga cagtgtaaaa tattgtaaat aaatatgtag 180 agtgtgattc gccgagcctc cactcctaca 210 // ID Gypsy20-I_Dpse repbase; DNA; INV; 5433 BP. XX AC Unknown_singleton_29; XX DT 15-MAY-2009 (Rel. 14.05, Created) DT 15-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy20_Dpse; KW Gypsy20-LTR_Dpse; Gypsy20-I_Dpse. XX OS Drosophila pseudoobscura OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; obscura group; OC pseudoobscura subgroup. XX RN [1] RP 1-5433 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1116-1116 (2009). XX DR Genome; Unknown_singleton_29; Positions 42475 47907. XX CC Positions [1958-2500] - Reverse transcriptase CC Positions [3407-3883] - Integrase core CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS join(224..3148,3152..4393) FT /product="Gypsy20-I_Dpse_1p" FT /translation="MNRDEILALLVADLRDKLAEIGLNRTGRKVELQNRLL FT AHYGFQTDQNDEEVRDEGETADEINEEVSFRTVASNNEQARGNERRDENPI FT GSGQRRQADSSNSGRSWFTLKDVEGSVSQFSGTSSPDINQWIEELEECALT FT VEWNQLQVFIYAKQLLSGAAKLFIRSQRDIRDWNVLKDALIDEFGVKVSSA FT EIHRRLGKRQKRKNETLHEYLYALMELAKPIEMDDESLIEYFVSGIPDSST FT NKALLYQARNMRQLKVQIEAYQKMTGSVRSQGKFDSHFNKGEKELVKSAEK FT FKKYFNCGDESHIKRDCPKRDNRCFRCNQLGHRAAQCKVDMAIKQEKATNL FT VLENGRTKDTSSGLELKSVNYGKVVFKGLVDTGADLCLLSRNVFLKLGRGK FT LAGHSKCLTGIGESQIMTFGSITIPVQIDDIDLMVEFHVIADEDMGFDAIL FT GRTLLESVDIKVTRDGTARVLRSGSQVSTRRNVPEHRDERENVNSCTELLN FT EFQSLCMVNLEQSEAINIDLSHLPHSERDQKSKIIEEYKPSRNVHCPIEMK FT IVLTDDLPVYQHPRRMAYSEQNIVDEQVNEWFAQKIIKPSTSEYASPIVLV FT PKKNGRKLLCCDYRKLNEKIVRDNFPMVQLQDVIEKLQGALIFTTLDLTNG FT YFHVPVEVQSQKYTSFVTQKGHYEFLFVPFGISNSPAIFTRFIVAVMRELV FT QKGDAVVYMDDIIIPSKDVKEGVQKLKRVLEVAGKNGLKINWSKGQILQSK FT VDFLGYVIQNGTITPGKEETQCVANFPIPKDKKAIQRFIGLTSYFRKFIEG FT YAVIARPLSDLLRKDSKFDFKELQRLAFEQLKAALARKPVLRWYNPKLSTE FT IHTDASMLGFGDVLLQKDPEDGQLHPVMYMSKKTKPCEENYHSFELEVLAV FT IGALTKWRVYVFGLKFKIITDCNAFAMTIKKKDVPLRVARWAMYLQDYDYE FT IEHRSGSKMLHVDSLSRVACLLTEILFKDLNRELIVVPSQMETEIIQIAHR FT QGHFSVKKTQDLVEKSYFIPKLRNKVERVVSGCVQCILVNTKAGRQEGYLT FT PIDKGDKPLVTYHLDHVGPMEITKKRYNHILVVVDEFSKSVWLYPTRSTGV FT EEVLNCLERQAVSFGNPFRIVTDRGAAFTSHLFKEYCDKQKNKHLLIATGV FT PRGNGQVDRMHKIVVPMLSKMSLENPGNWYRHVGRVQQIINNVEPRSTKVA FT PFKLLTGVYMRITDIAELRELVQQSLIGELDEDHEQLRKEARDNIQSLQEE FT NKRAFDKKRKDEKQYKINDLVAIRRTQYGVGLKLRGKFLGPYKMVKIHKHG FT RYDVEGAGEGEGPFKTSTGAEFLKELGANSMSGGPNVGFGGRATESVEITE FT EVTERETRSGRRTMRGENTASTIDAI" XX SQ Sequence 5433 BP; 1854 A; 873 C; 1409 G; 1297 T; 0 other; agcgaaatat tgcaaagaag cagacagaca agaacggcgg tagcagtggt aattaacata 60 gtgttaagtt tacattttca ataacataac attaagttta aagagcggaa tactgttaat 120 tattgtaagg tctgagagga actggattct aaaactgtat tttggcgagc gagcggcata 180 gatccatttc ttattctatt atatttcata caagtttgta aaaatgaata gagatgagat 240 tttggcactg ttggtggcag atttgcgaga taaattggcc gagataggtt taaatagaac 300 cggaagaaaa gttgaattgc aaaacagact gctggctcac tatggttttc aaaccgatca 360 aaacgatgaa gaagttaggg atgagggaga gactgcagat gagattaatg aagaggtgtc 420 atttagaaca gtggcctcaa ataatgaaca agcaagggga aatgaaaggc gagatgaaaa 480 tccaattgga tctggtcaga ggcggcaagc agacagttct aattcaggaa ggtcatggtt 540 tacgttgaaa gatgtagaag gcagtgtgtc acagttttcg ggtacaagtt caccagacat 600 caatcagtgg atcgaagaac tcgaggagtg tgcactcacg gtggaatgga accaattaca 660 ggtgttcatt tatgccaagc aattgctcag tggggcagcg aagcttttta ttcgaagtca 720 acgcgatatt cgtgattgga atgttttaaa ggatgcattg atcgacgaat ttggggtgaa 780 ggtgtcatca gctgagatac atcgtaggct agggaagcgg cagaagagga agaacgaaac 840 gttgcacgag tatctgtacg ctctgatgga gttggcaaaa ccaatcgaga tggatgacga 900 aagtttaatt gaatattttg tcagcgggat tccagattct agtacaaaca aggctttact 960 gtatcaagct aggaatatga ggcaattgaa ggtgcaaatt gaggcatacc agaaaatgac 1020 agggtcggtg agatcacaag ggaagtttga ctcacacttc aataaaggag aaaaagagtt 1080 agtgaaatca gcggaaaagt ttaaaaagta ttttaactgc ggagatgagt cgcatattaa 1140 gcgggattgt ccaaaaagag ataacaggtg tttcagatgc aaccagctag gacatagagc 1200 tgctcaatgc aaggtcgaca tggcaattaa acaggagaaa gcaaccaatc tagttctgga 1260 gaatggtagg acaaaggaca cttcgtcagg attagagttg aaatcggtaa actacggaaa 1320 agttgtgttt aaaggattgg tcgatacagg tgcagatttg tgtcttctaa gcagaaatgt 1380 gtttttgaag cttggtagag gaaaactggc aggacatagt aaatgtttaa cgggaattgg 1440 tgaaagccaa attatgactt tcggtagtat cacaatacct gtgcagattg atgacattga 1500 cttaatggta gaatttcacg taattgcgga cgaggacatg ggatttgatg ctatattggg 1560 tagaactctt ttagagagtg tggacataaa ggtaacaaga gatggaacag cacgagtcct 1620 cagatcagga agtcaagtaa gtactaggcg caatgttcca gaacatagag atgaaaggga 1680 aaatgttaat tcatgtacgg agttgctgaa tgagtttcag agtttgtgta tggtgaactt 1740 agagcagagt gaagctataa atattgattt gtcacatttg ccccattcag agagagatca 1800 aaaaagtaag ataattgagg aatacaagcc gagtcgaaat gtgcattgcc ctatagagat 1860 gaaaattgta ctaacggatg atttacctgt gtatcagcat cctaggcgta tggcttatag 1920 tgagcagaac atagtagacg agcaggtcaa cgagtggttc gcacaaaaga taataaagcc 1980 cagtacatca gaatatgcct ctccgattgt attagtacca aagaagaatg gacggaaact 2040 tctatgctgc gattatcgca agttaaatga gaaaatagtg cgagataact ttccgatggt 2100 acagctgcag gatgttattg agaaattgca aggagctttg atttttacga cactagattt 2160 gactaacggt tatttccacg taccagttga agttcagtca caaaagtata catcatttgt 2220 aacgcagaaa ggacactatg agttcctatt tgtaccgttt gggatttcga attcgccagc 2280 aatcttcaca cgtttcatag ttgcggtaat gagagaattg gttcagaagg gtgacgctgt 2340 agtatatatg gatgatatca ttataccaag taaggatgtt aaggaaggcg tacagaagtt 2400 gaaaagagtg ctagaagtgg caggaaagaa tggtttgaag ataaactgga gtaaaggtca 2460 aattttgcaa agtaaggtag atttcttggg gtatgttata caaaatggta ccataacacc 2520 cggtaaggag gaaacacagt gtgtagcaaa ctttccgatc ccaaaggata aaaaagctat 2580 acaacgattc attgggctga cttcttattt caggaaattc attgaaggat acgcagtcat 2640 agccaggcca ttgtcggatt tgttgagaaa ggattctaag ttcgatttca aggagttgca 2700 gcgattagca ttcgagcaat tgaaagccgc gctcgctagg aaaccagttt tacgatggta 2760 caatcctaaa ttatctacgg agatacacac ggatgcttct atgttaggtt ttggggatgt 2820 cttgttacaa aaggatccag aagatggtca actgcaccca gtgatgtata tgagtaagaa 2880 aacaaagcca tgtgaggaaa actatcattc attcgaactt gaagtgctag cagtaatagg 2940 agcattgact aaatggcgtg tgtatgtttt cggactgaaa tttaaaatca ttacggactg 3000 taacgcgttt gccatgacga taaaaaagaa agacgttcca ttaagagtag ctaggtgggc 3060 aatgtatctg caggactacg attacgaaat agaacatcgt tcaggatcga agatgctaca 3120 cgtagactct ctcagtcgag ttgcttgtta attattaaca gagatcttat tcaaagatct 3180 aaaccgagag ttaatagttg ttccatctca aatggaaacc gaaataatac agatcgccca 3240 caggcaggga catttttcgg tgaagaaaac acaggatctg gtagagaaat cttactttat 3300 ccctaaacta agaaataaag tagaaagagt ggtaagcggt tgcgtgcagt gtatcttggt 3360 taatacaaaa gcgggcaggc aagaagggta cctaactccc attgacaagg gtgataaacc 3420 attagttacc tatcacttag accacgtcgg gccaatggag attacaaaga aaagatataa 3480 tcacattttg gtagttgttg atgaattttc aaaatccgtt tggttgtacc ccacacgtag 3540 taccggagta gaggaggtgc tgaactgttt agaaaggcaa gcagtatcat ttggtaaccc 3600 attcagaatt gtcactgatc gtggcgcagc tttcacatcg cacttattca aagagtactg 3660 tgacaagcag aagaataaac atttgttaat agctacaggc gtacctagag gcaatggaca 3720 ggtcgataga atgcacaaaa tagtagtacc gatgctgtca aagatgagct tagaaaatcc 3780 aggaaattgg tataggcacg tgggtagagt gcaacagata attaacaatg tagaacctag 3840 gagtaccaaa gtcgccccat ttaaactcct aacaggagta tacatgcgta taactgacat 3900 agctgaattg agagaattgg tacagcagtc tctcataggt gagctagacg aagatcatga 3960 gcagttaagg aaagaggcta gggacaatat tcagtcgttg caggaagaaa ataagcgagc 4020 ctttgataaa aagagaaagg atgaaaagca gtacaagatc aatgacttag ttgcaatcag 4080 gcggacccaa tatggtgttg gattgaagtt gaggggaaag tttctagggc cctataagat 4140 ggtcaagatt cacaagcacg gccggtatga tgtagaagga gcgggagaag gagaaggccc 4200 ttttaaaaca tctacggggg ccgagttttt gaaggagctc ggggcgaact ctatgtcagg 4260 agggccgaat gtgggatttg gtgggagagc aacagagtct gtagagataa cagaggaagt 4320 aacagagaga gagactagaa gcggccgtcg aacaatgaga ggagagaaca cagcctcaac 4380 gatcgacgcc atttaattcg gtgccaaagt ctcactcagc cacacgcaaa ttacgagagc 4440 aaatatctgt attaatcaaa atatacttaa agcacagcga ccgttgtatt cataactaat 4500 cttaaataat attcataaat aaagactaat tataataaaa cctggcgtcc tggactaaag 4560 tacaactacg gaaaatccgg ggccacaatt tttgggggct caaccgagat ttgaacgatt 4620 acaaacggaa agtcatcaaa ctaaagttgt gttttacaaa tattgcgaga actcggtgaa 4680 ttgtagtggc gttgtttgga acattctgga agaaatctgc aaaaactctg ctgcgcgaaa 4740 attcaaacaa aaggcaattg gcaaagcagc gcgagagtga gcaagacggc agcaagcagc 4800 tacggtaaaa aggagaagaa aaaccgaagc aaatcgaatc aagaggaacg gcgtaagcag 4860 cagagacggc gataaagaga ccctgtaccc aatctgaagg caacgcaaga cttggtggat 4920 ggtcagcaac gtgcaatagg caggatcgtt ggcggtgagc atcatctaga caaggagaaa 4980 ggcgtggagc agagaaggta gctagtggtt cgcaagtact gtacggcgaa aaaggcagct 5040 ggtcaactca aaaagtctgc agcaagaaga cgttgagaaa attgacaagg cagtgagtgc 5100 gtggaaacga gacgcagcag tcagtaactg taaaagcaga ggaatagaga ggaacggcgt 5160 aaacagcaga gacggcgata aagagaccca gtacccaatc tgaaggcaac gtaagacttg 5220 gtagatggtc agcaacgtgc aataggcagg atcgttggcg gtggacatca tctggacaag 5280 gagaaaggcg tggagcagag aaggtagcta gtggttcgca agtactgtac ggcgaaaaag 5340 acggctggtc acgacgaaaa ttctggagct gtgttagtgt gaagacgaca gaaaattctg 5400 gagctgtgtt agtgtgacga cgacagaaaa ttc 5433 // ID Academ-2_BF repbase; DNA; INV; 7893 BP. XX AC . XX DT 16-MAY-2011 (Rel. 16.05, Created) DT 16-MAY-2011 (Rel. 16.05, Last updated, Version 1) XX DE DNA transposon: consensus. XX KW Academ; DNA transposon; Transposable Element; Academ-2_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-3053 RA Yuan Y.W. and Wessler S.R.; RT "The catalytic domain of all eukaryotic cut-and-paste transposase RT superfamilies."; RL Proc Natl Acad Sci U S A 108(19), 7884-7889 (2011). XX DR [1] (Consensus) XX SQ Sequence 7893 BP; 2304 A; 1792 C; 1859 G; 1938 T; 0 other; tagtggggca gataacatca aattttggag gaatctttca ctttgtgata caagcaccaa 60 acttggtaca gatactcatt ggactttact gttttgaaaa agtacgcggg ccacttgatt 120 tgactttcgg cgccccctag cggcgcgttt tggtactgca gcagaccttc cggctttgta 180 tctcctgttc taaacatgct acagtcatga ttttatgtgg tagaaagctg gtagggtaag 240 gaagaagtga cgtaagtttg gtttccctag cggcttgtgt tggaactaca gcggcgtttt 300 tttgccgctg ccgaattgcc ctaggggcgc tttttggtac tgcagcagac cttccggact 360 ttgtatctcc tgttctagac atgctacagt catgattttt atgtggtagg aagctgttgg 420 agtaaggacg aagtgatgta agtttggtct ccctagcagc ttgcgttgga actgcagcaa 480 cggttttttg ccgctgacga atgtgaagcg gaacaactcc ggacgtttga aatgtccaaa 540 agtaggttct tgactatggg aaagctaaat gttcaacttt aattgtgata caagcgccaa 600 acttagtaca aatactcatg tgatactact cttttgaaaa agtacgcggg ccacttgatt 660 tgacttttgg cgccccctag gggcgctttt tggtactgca gcagaccttc cgggctttgt 720 atctcctgtt ttaaacatgc tacagtcatg atttttatgt ggtagaaagc tattggggta 780 aggaagaagt gacgtgagtt tggtttccct agccgcttgc gttgaggtca tgggctgttt 840 caaatagacg aggtgagggt agcacacaaa gggtttcgac acttctccac tgcgtcctat 900 taaaccactc attccaataa tatgcctgtg cgagtctttt gtggattaca ttttaaaatg 960 tatactgctg catgagggct acaagtgtct ctgtttatca cacagacagt cctgggaaga 1020 agaacatatt ttctgtgcaa agcatgctgg gaaaagcaaa agcaaacaat cctgggaacg 1080 agttagtttt tggcgggaaa atctcagtga gtgacacagc atgacaggtt cctacacatt 1140 gagcataaag acaacacccc aggctggtta atgtggacct tatctcagga aagaggactg 1200 aattgatgca cagccactga ggaagaggtg agagacagaa attagagtgg ttgtacaccg 1260 ttgtttcctc tcctagcttc ggacccacca ccgacacaat ggcggatagt ggaggcagcc 1320 tgacgtcaag gcacagccat ggcttactgg tgacatcaca ctgccgtgaa acttagcatg 1380 gagttcagtg tttacttgaa tggcactgaa tgaactatag ggctaaatgt agaatttgtc 1440 tgaacctact tgtatctcat ctgaatgtgt ataatcatac taaaggtagt aaaaggagaa 1500 tcgctcggca ttactgcttt aattcactca ttacccaaca agatggcaag atcagttccc 1560 ggatcatcaa agtggaaaac taactggagc aaatgttgta tttgccaaga ggagaagggt 1620 aatgaggatc tgaaatcccc acccatacgt tactcacccg aacaggatgg ctacacaatg 1680 attgcaacca acgtccccct gttccatgcc ctccatgaga tgcccattgt tctcgaccca 1740 accagactgg atgaaggcgg tggaatagaa gaaacactga ggagaaacaa ggcacagtac 1800 caccaaagct gccgttatct atttaacaac accaaactgg agagggcaag aaagagaaga 1860 catgggcttc aaacttctca gtcagaagaa ggacaaacca aactgcgaag aaccagccgt 1920 gacggtctgg aatctgagtg ctttctttgt gaacaagaag aacctgcatt agaactgaga 1980 catgccatga ccatgaagct taacaaaaga gtcaatgact gtgctacaac tctaaaggat 2040 ggaagactgc tgaaaatact gagtggtgga gacgtcgtag cacaggagct caagtatcat 2100 cctgcatgct tgacatccct gtacaacaga gagagagcct acctgagaaa tgtcgaacaa 2160 agagagcaga gccagaagca ggatgtctac ccactggcat tttcagagct tgttacctac 2220 atagtggaga ccaaactcag ttttgaggga cctacaatat tcaagctggc agatatggtt 2280 aatctgtaca aacagcgtct acaacagttg gggatggaga caccggatgt taactgcacc 2340 agactgaagg acaagctgtt ggctgaatta cctgagctgg aagctcacaa gaagggaaga 2400 gacgttctgc tggctttccg aaaagatgtt ggcttttccc tgtcacaagc ctctaactac 2460 accgatgcca tgactcttgc caaagcagcc acgatcttaa ggagacacat gctggaccac 2520 aagttttcgt tcgaaggaac gtttcatgag ggcattgaaa atgccattcc acccagcctc 2580 cttcaattcg tgggcatgat tgagcatggg gctgacatta aatcacaact gagatttggt 2640 gcttcaaagt cagaccttgc aattgcacag ctgttgcagt acaactgcta cacaagatac 2700 aaggaaggag catcgactca caggcattcc aaagatcggg aaacgccatt tcccatttac 2760 atgggaatgt ccatatatgc caagaccaga aagaagttgc tggtgaacat gcttcatgat 2820 catggtcttg gtatttcgta tgacagggta ctggagatat ctgcacagct tggagatgca 2880 agcattaaca ggtacataga ggatggtgtg gtctgcccac acgctctgag gagagggctg 2940 ttcaccataa cagccatgga caacatagac cacaatccta ctgcaacaac tgcaaccacc 3000 tctttccatg gcaccagcat ttcagctttc cagctcccaa cagaaggtaa tcagggtgaa 3060 tttcgggaac ctctgacgct gaggcttggt gaggagaaag tgaagaaagt tccagaactg 3120 cccgagttct ataccaatat tcgtccagcc ttcttcacaa agaaaaaccc ttcgcctcca 3180 cgaagccatg gtgtgcagac agttcaagac aatactctgt taggaccaca attggcacta 3240 gagtatgcgt ggctggagaa agtcagtgtg gaagaagaaa cagacggagc agtgaatctg 3300 acctggtcag cccaccatgc ctcccagaag agaagcccta aagttgaggt tagcgtaacg 3360 tcactgttac ctctctttcg agatccagcc cactcagttg ctactatcag acatgttatg 3420 gacaaagtca tggagactgt aactttcctg aatccaggcc agatacctgt catgactgct 3480 gaccagccaa tctatgcatt ggcaaagcaa atccagtggc actggcctga gcagtatggt 3540 gaggataagt ttgtcatgat gtttggaggc ctacacattg agttggctgc actgaggtca 3600 gttggaacaa ttctacaggg cagtggctgg acgtgggccc tcgtggaggc aggggtggca 3660 tcctctggga ctgcagaatc tttcctatca gcagcaagca tcaccaggac acgccaggct 3720 caccagataa cagcatgtag tctctaccaa ctaatgaagg cagcctacag tgactattgt 3780 accgaggcag ctgacaactc tgaggaactg ctgagctttg atgcttggtg taatagccgc 3840 aagctgcaga gtccccagtt tcagttctgg agtttggtgc tgtcgatgga gctcgtgatc 3900 ctgctactga tccgagcatt tagggaagcc aacttcaacc tgtattgcca ggcattggct 3960 gagctcattc catacttctt cgccaacaac aacacaaagt atgcacgttg gctccccatc 4020 catctcaaag acatgttgac cctgaaagag aagcatccac agctggccga ggagtttgag 4080 agtggaaagt ttgttgttca caagtcaaga cgtgagttct ctggaatggc tattgaccag 4140 gcacacgagc aggccaatgc tgttatcaaa gctgatgggg gggcggttgg tgtgactgaa 4200 gatccctcgg cactgagacg atggatgata gctggtcctc aagtcagcca cttggttgaa 4260 caatatgaag cagcatctga agccaaggag gctgtagaac caaccagcca ccacaaccag 4320 acatcacagg cccaaagagt tttcatggag aacgtcaaga agctgaccca agttttaaag 4380 gagctgagta atcctttcca ggaagagact agggacctgt tgtcactgga taccaaggat 4440 atagctcatc ccagtgtagt tgaactgctc agtacacatt atgaaagggg cagaacatcc 4500 tttcaacagt ttatgggggg cctacaaaat agggaagtga ccaccttcta tgatcccatc 4560 aagaaaaaca tggttgactt cttccgacag caaccagcct ccactgatgc ttcaaagcga 4620 aaagtgctga aggaggattg tcagctcttc tcgaagctgt tcatctcatg ccagagcaga 4680 gaatgtgatc tgcaggagtt ctttcgccat gaaaatcagc agttccctgc ttccttgagt 4740 gaaggtggga agctttatac cagtcagaag tcccagcttg cagccatact tgagagtaag 4800 gttacaatac ctgatgtgga accgcaggct gatactatca tcatcgatgg gtcagcactg 4860 gtgaacaccc tgctgccacg cacctcaaag actttcgaag actatgccat cttggatgtg 4920 ctaccgatag tgcaagcata ctccaccaag tacaagagaa ctgacattgt gtttgatgtc 4980 tataaaccat caagtctcaa gggtgaaacc aggttaaagc gtgggcatgg agcaaaacgt 5040 agagtgacaa acaggggcag aataccctca aactggcgga acttcctgcg ggagagtgac 5100 aacaaaaccg agttattcag gttcctcgct gacaagatta cacagatgtc tacacccaac 5160 ttggtcattg tgaccagaga tgaagatgct gccagcaacc gcacaatcag cttggagggg 5220 atcgcaccat gcagtcagga ggaagcggac acacgcatct ttgtacatgc caggcatgca 5280 gtgcaagaag ggagcaaggt cctgatggtg aaagccagtg acacagatat cctcgtcata 5340 gcactcagtg ttctgcctct ccttcaacag tttggtctgc tgcagttgtg ggtggccttt 5400 ggccaaggat acaacctgag atggtttcct atacatgacc tgtacttctc cataggaatg 5460 gagaaaagca aaggaatact cttcttccat gccttcactg gttgcgatgt tgtgtcaggc 5520 ttccgcggca aagggaagaa gtctgcgtgg caaacttgga acgtgtgtgc tgaggcttct 5580 gatgtctttg caaagctcag ccagtatcca gtagcagtag atgatgatga tctgccagtc 5640 ctagagaaat tcgtcgtgac aatgtacgac agatccagta gtgttgcaag cgtcaatgat 5700 gccagattgg acctgtttgc ccgcaagcag agaccatatg aagccattcc tccaacgaag 5760 gaagcactcc gtcagcacgc aaaacgcgct gcttatcagg cgggctgcat atggagtcag 5820 tcaacactcc gccaaccaga aacacagagt cccgctgact ggggataggc gaagagcgaa 5880 gacacatgga aggtcttctg gacaacactt ccacctattg cagagagttg ccaacagctg 5940 accaagtgtg gatgcaagct ggaatgctat ggaagatgca aatgctacaa attcggtctt 6000 ccttgcactg gactgtgtag ctgcaaatgt gagatttaac ctcattggtg caagtaagat 6060 gttgttagga ctatgtaaaa gattcactgt aagtgtaagt aaaataagcc ctccaacagt 6120 cgtcgcggcc acgatataca tgtactataa actgaacagc caagaatgtt atttggcaga 6180 tcttacagca ggtcttacag ctgtattgaa tttgttctat ttttaaactt aagttttaca 6240 cactcaacac aacggaacgc agggaacacg ccatagctaa caaaacaaac aaaaagtcat 6300 gtttcacaga tttgctataa cgattgcata ctctcaggca tagataatgc tgttctgttc 6360 agttctccat gatgaaattt gcttcgtgtt gaaagatgct gcacagactg gaaacctcac 6420 gtgagtgcca tcctattcac tataccagtg ctcccatact cactccttca aattaatcta 6480 cagaaggctg gatcctcgtt ttttttaggt ggcaagtata cttaacaaaa ttactaaata 6540 taggttggca ttcatagtag tgaacgaata tgacgaccga ggactgatgg tgttttaatt 6600 attcattatg gactcttcac tgttcagttt gataaggatg gcaatgacta tcaatgtcaa 6660 aaaaatgtgg aacattctag ttgtgtaaat aaaatatttc gtcacaatat gtcctagttt 6720 gtctataatt ctcaccatct gttgctcaca aaagtattta aaaaaaactc attagaaggt 6780 tgagaggtga ccatttagct ttcccatagt caaaaaccca cttttagaca ttttaaacgt 6840 ccgaagttgt tccgcttcga attcggcagc ggcaaaaacg cccctgtagt tccaacgcaa 6900 gttcctaggg agaccaaatt taaacttacg tcacttcttc cttgcaccaa cagctttcca 6960 ccacataaaa gtcatgactg tagcatgttt cgaacaggag atacaaagcc cggaaggtct 7020 tctgcagtac caaaaagcgc ccttaggagc aatttggcag cggcaaaaaa cgccactgca 7080 gttccaacgc aagctgctag gaaaaccaaa cttacatcac ttcgtcctta cgctaacagc 7140 tttataccac ataaaaatca tgactgtagc ttgtttcaaa caggagatac aaagcccgaa 7200 aagtctactg cagtaccaaa aagcgcccct agggcaattc ggcagcggca aataagaaac 7260 gctgctgtag ttccaacgca agcagctagg gaaaccaaac ttacgtcact tgttccttac 7320 cctaccagct ttctaccaca gaaaaatcat gactgtagca tgtttagaac aggagatgca 7380 aagcccggaa ggtctgctgc agtaccaaaa ggcgccgcta gggggcgcca aaagtcaaat 7440 aaagtggccc gcgtactttt tcaaaacagt aaagtctaat gagtatctgt actaagtttg 7500 gtgcttgtat cacaattgta gttgaacatt tagctgtccc atacttttag acattttaaa 7560 cattcggagt tgttccgctt caaattcggc agcggcaaaa aaaaacgccc ctgcagttcc 7620 aacgcaagct ggtagggaga caaaacttac gtcacttcat ccttactaca acagctttct 7680 accacttaaa aattatgact gtagcatgtc tagaacagga gatacaaagc ccggaaggtc 7740 tgctgcagta ccaaaacgcg ccgctagggg gcaccaaaag tcaaatcaaa tggcccgcgt 7800 actttttcaa aacagtaaag tctaatgagt atctgtacca agtttggtgc ttgtatcaca 7860 aagtgaaaga ttttttcagt tatctgcccc aat 7893 // ID Copia-37_DPu-I repbase; DNA; INV; 5609 BP. XX AC ACJG01004148; XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the common water flea: internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-37_DPu_; KW Copia-37_DPu-LTR; Copia-37_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-5609 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the common water flea."; RL Direct Submission to RU (08-FEB-2011). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR Genome; ACJG01004148; Positions 8148 2540. XX CC Positions [2486-3019] - Integrase core CC 'ATTAT' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 2396..3439 FT /product="Copia-37_DPu-I_2p" FT /translation="MVKEDLTIGLNINGDRDIPASLCSNCEFGKFSRIPLK FT TGRNRATRIGELTHSDVWGPIATTSIGGARYFVTFKDDYSGYTTVYFMKRK FT NEVPALIRLYRALLLNETGFYMLTLRSDNGKREYVNKENAEWLAQHGIRHE FT TSAPHTPEQNGSAERLNRTLLEPVRCMIIESGLPACLWAEAISYTTYIKNR FT VLSRTAQLTPYEYWNEKKPDMTHIRIFGSKAYVRNPTVSSKLEPRSQEGFF FT VGRCSTQNASRIYIPTTEKIVVSKDVKIDETILYRDHIAKDLLPSKVRSHT FT YFLLYFYIYIYILYVYSYVLLYTFLYMFPCFVHIVYIYIYISLFLLIYILL FT IIHVK" FT CDS 3358..5448 FT /product="Copia-37_DPu-I_1p" FT /translation="MFCTYSIYIYIYFFISTYIHTSYYSCKIIFFSFFPSK FT EQLQLAESNEESSMDTRETDVPITVETPNEEEACPVSAIEDEIPLITDETL FT HEGSNLPTDPLIHNEATQEPGMIIDQDAHDATPIAAEVPQANSVDQPRRST FT RIRARQIQSVAKQAVLSNDSAIQDSTGANPTEPETYREAINCPEAEFWIEA FT MNEEYNSLIKNNTWTLCRLPPDRKAIEGKWICKYKPGFKTTSPRYKARFVI FT KGFSQIHGIDYTETYAPVAKTFSFRMIMAIAAEKDLEMIQLDVKTAFLYGT FT LEEEIYMKQPEGFIIPGKEEEVCRLVKSLYGLKQTSRVWNVKFNEFIVALG FT LERSKCDPCIYYRHLRQGEPDEEITFFILYVDDGLILSNQNAVLLKMIEFL FT GKEFEIRSLPADRFIGVNIERDRTHRTIHLSQPDYVKTILARFGMTNCSSI FT TVPADPCVKLSPSMCSRTEEEEAPMNNVPYLEGVGSLMHLANLTRLDISFA FT VGQVSRFSQNPGMEHWKGLKRILAYLRKTVNHGLLFGGGSNELCGYVDADY FT AGDLENRRSTSGAVFILNNGPISWHSRRQTCVALSTTESEFIAASDGTKEA FT TWLRRLYAELGGADLSVPLHCDNQGAIAQILNPIFHQRTKHMDVRFFFVRD FT AQQEGKIDVIYIETELQLADIFTKALPTPRFEMLRHNLGVQELNIQNA" XX SQ Sequence 5609 BP; 1694 A; 1382 C; 1062 G; 1471 T; 0 other; ggttatgggc ccagatttac gttgctgaac actgactgac aatggccgag aacgaggttc 60 aagataacca acctttaagg gcaattgccc acctgccaaa attcgatggc accaaccacc 120 gagagtggaa ttttgaaatt gatttagtct tccaacacca tgacttgaaa gacgtagtgt 180 tggggaacga agtactccca gaagaggtga caatgtcact cattctattt cttttgtaaa 240 tattttagct gtatcgttct atcagcacat gtcttattaa atgctcaatg cagtttaagg 300 agtgtgattg tcgcatggac aatccactta atcagaacac acagcacttt gcattcagcc 360 ttaacgctgt aaattttctt ttgttcttca ttttacaaac tgatattagt cagccttcac 420 gctgctcatt cagtattact gctctgcagt cactaagagg acaaaaaaaa ggtcagcctt 480 aacgctgtaa attttctttt gttcttcatt ttacaaactt atatcagtca gccttcatgc 540 tgctcatttc actactgcac tgcagtcact aagaagacaa gacagtcagc cttaacgctg 600 tttattcttt actgttcttt gcaaacttac atcagtcagc cttcatgctg cccatttcac 660 tgctcagctt gctcgattgc tctgtaatta ttcagagact ttgtcaatca gccctaacgc 720 tgcttactct attgttcact gtgcagtcac atcaataagc ttgcatggta tccattccat 780 tgcatttttt tacacagaac aatctaatca gccttaacgc tgcagttgat ctctacacat 840 atatgtataa ttgcattcaa tatgtatact ttgttaatat tcttatgttc atcatattaa 900 cctcatattt cctctgttgt aggatcgaaa tgaagctggt gaattggtga atgatgcagc 960 cataaggcta tggacgagga aaaacatcac cgctattaac ttcatatttg cctcaattac 1020 gaaaaaaatg aaagaaaact tatacactcc tggcctcaat gcggcacaaa tatgggcgaa 1080 gttaaatcta caatatcagc tccaaaccga agaacagctg catcttctat ggcaacaata 1140 ctacgacttc aaacatacag ctggtacctt tcctaatatc attcattttt tttcccttac 1200 tgaacaagtt tctctcctta acaggagacg atatgagaac catgattcag aaactctcaa 1260 acatagctga ccagctaaga gaaagagaac agatcttgcc tgaagtccaa cttgtctcca 1320 aggctcttgc aacacttccg gaaaacttcc gaatcgtcag gactgtttgg accagtcttc 1380 cagcgaatga ccggactctt gatcatctac tgcagcgact cataactgaa gaaagcgttc 1440 tgaagtctta ccaaacaaaa acggaacaca acaatgaagc tgcattcacc gctaacaaca 1500 gccgccaaag atttaactct ggtcgcggag gatctggaag agaacgtcat ggggtgcaag 1560 gtggttttgt ggacaaacgg ccacgttgtg gccactgcaa cagcccaacc catgaagaaa 1620 aggtctgttt ccaaaaacac ggctaccccc caaactggaa caccacgaaa agaggacgag 1680 gtagaggcag aggtgggggc aacccatcca attcatccaa cgccctattg tccttatcaa 1740 atctggatcc gagaagtata ctgacattcc tcgtggactc aggagcttcc aaacacatga 1800 gtgaccaaag aagctttttc acaaccttca aaccaatcaa agccggcact tggtctgtca 1860 aaggtaagta attttttttt ttattttttt tttattggcc aaaaggagct tctaatctca 1920 aatcttttct ctctcatccc aggcatcgga gactctgaac tggacgctct aggcgtcgga 1980 aacattgacg ttattgtgga agttgatcag acaacaacat ccagaacatt aacggaagtt 2040 ctatatgtct ctggacttgg ctcaaacctc ttctccgtga gtgcagcaac tgccaatggt 2100 ctaatagtca cgtttgacga taacaaggta tcatattatc ctcaaaacca gttgcatcac 2160 aaaccattca caatttttct ttgggatctt tccccaggtt ctctttaaga gaaacggcga 2220 aacagtcctc acgggggaga aacaaaacga gactttattc caactccaca agaaaccgga 2280 cattcatttg acccagagtg acaacatcca acctctcact ctcgtcgcgc ttcgagcatc 2340 aatcatcacg tggcatcgcc gcttaggtca cataggctac cacacactct tgaagatggt 2400 aaaagaagat ctgaccatcg gattaaacat caatggcgac cgtgacatcc ccgcgtcact 2460 ctgttccaac tgcgagtttg gcaaattttc cagaatacct ttgaaaacgg gaaggaacag 2520 agcaacaagg atcggagaac ttacccactc agacgtatgg ggtcccatcg ccaccacaag 2580 catcggaggg gcccgttatt ttgtcacctt caaggacgac tatagcggat acacaacagt 2640 ctacttcatg aaaaggaaga acgaagttcc ggccttgatt cgactttacc gcgctcttct 2700 actaaacgaa accggcttct acatgctcac acttcgttcg gacaatggca aaagagaata 2760 tgtcaacaaa gaaaacgccg agtggctcgc ccaacatggg attcgtcatg aaaccagcgc 2820 tcctcacaca ccagaacaaa acggctcagc agaacgtctc aaccggaccc ttttagaacc 2880 agtccgatgt atgataattg aaagtggcct acccgcttgc ttgtgggcag aagccatctc 2940 atacaccacc tacataaaga atcgggtcct atcaagaact gcccaactca ctccctatga 3000 gtactggaat gaaaagaaac ccgacatgac tcacatcagg atatttggat caaaagctta 3060 tgtcagaaac ccaaccgtct catcaaaact agaacctcgt agtcaagaag gtttctttgt 3120 cggtcgttgc tcaactcaaa acgcatctag aatctacatt cctacaaccg aaaaaattgt 3180 cgtaagcaaa gacgtcaaaa tcgatgaaac cattctttac cgtgaccata tagcaaagga 3240 tctactacca tcaaaggttc gttcacatac atactttctc ttatatttct atatatatat 3300 atatatactc tatgtatatt catatgtact gctatataca tttctatata tgtttccatg 3360 ttttgtacat atagtatata tatatatata tatttcttta tttctactta tatacatact 3420 tcttattatt catgtaaaat aatttttttt tctttcttcc cttcaaagga gcaattacag 3480 ctcgctgaaa gcaacgaaga atcatcaatg gacactagag aaactgacgt tccaatcacc 3540 gtagaaactc ccaacgaaga agaagcttgc ccagtatctg caatcgaaga cgaaattccc 3600 ctcatcacag acgaaacact tcacgaaggg tcaaaccttc caactgaccc cctgattcac 3660 aatgaagcta ctcaggaacc cggaatgatc atagatcaag atgctcacga tgccactcca 3720 attgcagcag aagttcctca agccaacagc gtcgatcaac cacgccggtc aacgcgcatt 3780 agagcaagac agatccagtc agtcgccaaa caagcagttc tttctaatga ctcagccatc 3840 caagattcaa caggagccaa tccaacggag cctgaaacct atcgtgaggc aattaactgt 3900 ccggaggctg aattctggat tgaagcgatg aatgaagaat acaactctct aatcaagaac 3960 aacacctgga cactatgccg actgccaccc gaccgaaagg caatcgaagg caaatggatc 4020 tgcaagtaca agcccggttt caaaacgact tccccacgat acaaggcaag gttcgtgatc 4080 aaaggatttt ctcaaattca cggcatcgac tacaccgaga cttatgcacc agttgccaag 4140 accttctcct tccgaatgat tatggccatt gcagcagaaa aagatcttga aatgatccaa 4200 ctcgatgtaa aaaccgcatt tctttatgga acactggaag aagaaatcta catgaaacaa 4260 cccgaaggtt tcatcatccc tggaaaagaa gaggaggtgt gtcgtctcgt taaaagtctc 4320 tatggcttga agcagacctc ccgcgtttgg aacgtaaaat ttaatgaatt cattgtagcc 4380 ttgggtcttg aaagatctaa atgtgatccc tgcatttact accgccacct ccgtcagggg 4440 gagccagatg aggaaataac cttctttatc ctgtacgtag atgacggcct gatcctgagc 4500 aaccaaaatg cagttctgct gaaaatgatt gaattccttg gcaaagaatt cgaaatacgc 4560 tctcttccag cagaccgatt cattggtgtc aacattgaac gcgaccgaac tcatcgaaca 4620 attcacctct cccaaccaga ctacgtgaaa acaattctag caagatttgg catgacaaac 4680 tgcagctcca tcaccgttcc agctgaccct tgcgttaaac tgtcgccatc catgtgctct 4740 cgcactgaag aagaagaagc cccaatgaac aatgttccat atttggaggg ggtcggctcc 4800 ctaatgcatc ttgctaatct gacacgacta gacatctcat ttgccgttgg acaggtttca 4860 cgtttttccc aaaaccctgg catggagcac tggaagggac tgaaaagaat cctggcctac 4920 ctgcgcaaaa ccgtcaatca tggactttta tttggtggtg gcagcaacga actctgtggt 4980 tacgtcgacg cggattatgc cggagatctg gagaataggc ggtcaacatc tggagcagta 5040 tttatcctca ataatggtcc catctcttgg cacagccgtc gccaaacctg tgttgctctg 5100 tccaccacag aatcagaatt catcgcagcc tctgatggaa caaaggaggc aacatggcta 5160 agacgtctgt acgctgaact aggtggtgcc gatctatccg tccctttaca ttgcgacaat 5220 caaggcgcca ttgctcaaat tctcaaccca atattccatc aacgcacaaa acatatggac 5280 gtacgcttct tctttgtacg cgatgctcaa caggagggca aaattgatgt catttacatc 5340 gagacagaac ttcagttagc cgacatcttt acaaaggccc tgccgactcc gagattcgag 5400 atgctgcgcc acaacttggg tgttcaagag cttaatattc agaatgccta aattaaggga 5460 cgatcaagtt tccttatatg tcatgaaatc tcaattattc aaatgccttt gaactgcatc 5520 cttatttact aacccattga gggacgtttc ttcttccttg tctttgaatt ccaatattct 5580 tgtttggggt gcgcttaact tgagggacg 5609 // ID SCAR_MA repbase; DNA; INV; 1061 BP. XX AC AF387098; XX DT 09-JUL-2004 (Rel. 9.06, Created) DT 09-JUL-2004 (Rel. 9.06, Last updated, Version 2) XX DE Meloidogyne arenaria sequence characterized amplified repeat DE sequence. XX KW SCAR_MA; Dispersed repeat; KW sequence characterized amplified repeat. XX OS Meloidogyne arenaria OC Eukaryota; Metazoa; Nematoda; Chromadorea; Tylenchida; Tylenchina; OC Tylenchoidea; Meloidogynidae; Meloidogyninae; Meloidogyne; OC Meloidogyne incognita group. XX RN [1] RA Sui D.D., Lewis A.S., Fortnum A.B., Dong K. and Kluepfel A.D.; RT "Simplex PCR Identification and Multiplex PCR Diagnosis of RT Root-knot Nematode Species without Restriction Enzyme RT Digestion."; RL Unpublished. XX DR Genbank; AF387098; Positions 1 1061. XX SQ Sequence 1061 BP; 285 A; 184 C; 148 G; 325 T; 119 other; ttgannngct ncgnnnccta tnkggcgact gggccgcggg aattcgattg gggggttggc 60 caataatgag atagagtcga gggcatctaa taaaggtatc ttttaataag atccgaccat 120 atttttatat ttaaacacct ttttcttctt cctgttcatc tccattttct attcttcttt 180 ccaattccct gttctctgca cgaataacca ttaaatcatc taataatctt tcaagttgtc 240 catccactcc tttytgagtt tccagttcaa ttttatttgt caattcatct aanwnaaagn 300 attccatttc tccttgggtt tgtantgcgc tggnnntaag ggcncattnt tcawganttt 360 ntacttaaan aggaaatttc tttaanagaw wtttttcttc agancagtwc agytcatgtn 420 agtccnnccg tncantccng nataasgtgn gaagggatgg tcattggana nngaatggnm 480 cnytctncaa gawaccaaag gataaaaatc ctagactgay cnatataaaa cttatgatca 540 aaaactgact ttttcanaaa agatacvatt ttygsncatt tatsgatcat aactttytta 600 taaancaggc aaaaaatttt gawttttkgg atcttgkagt gngtttaaty ntatntccan 660 tganmccang aaccnaaraa attatccttc wtaagtnata ggtscnnctt atnamynnng 720 gknctctnna ttnanttwta aattnaccan gcmnttnwta ngcnctttnn cttantnwwc 780 nwtnryywtt tgtttnkktt ctttctgtgt taaagtcatt gaagaggata ttccaccacc 840 ataaacattt ccatttgcac caatccatct gtctacagca gtaaatgttc gatcaagtat 900 tccatcaagt tgaactattt ctcgacgaat ttgttgaaca gaagaggctg tctaaaaaga 960 aataaaaata atttgaaaaa cgattccttt gaatattcag cccccaaccc cccaatcact 1020 agtgaattca ttctagaggt cacgncnngn nacggatnaa a 1061 // ID BEL-19_AA-I repbase; DNA; INV; 5487 BP. XX AC supercont1.314; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-19_AA_; KW BEL-19_AA-LTR; BEL-19_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5487 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.314; Positions 757908 763394. XX CC Positions [4528-5100] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 4273..5487 FT /product="BEL-19_AA-I_1p" FT /translation="MTLTPGFAAKFPTILPQQSRISFLLADKFHRRFRHAN FT RETVINEMRQEFYIPKMRTLVAKVAQDCMFCRVRRAVPRPPPMAPLPKVRL FT TPFVRPFTFVGLDYFGPVLVKVGRSNAKRWIALFTCLSIRAVHMEVVHSLS FT TESCIQAVRRFVSRRGSPAEIFSDNGTSFHGANNQLQKEIKDRNAALASVF FT TNSNTRWSFNPPGTPHMGGVWERMVRSVKEAIKSTLESSRKPDDETLETVI FT IEAESMINTRPLTYIPLESADQEALTPNHFLLGSSSGVKQLPVLPTDYRTT FT LRSSWNLAKHLVDEMWRRWISEYLPVISRRCKWFDHVRDLKEGDLVLVVNG FT NVRNQWIRGRVEHVVVGKDGRVRQAWIRTRAGVDRRSVANLALLDVMENGE FT QDDGSGSREGG" FT CDS join(36..2993,2997..4352) FT /product="BEL-19_AA-I_2p" FT /translation="MSLQVDKADAAASPANCTKCDLPDSAENMVACDGCGS FT WYHYSCAEFDESVAELPWKCDSCGIMNQTPIPTKTRPKKGAQRLVIPGGAE FT KQPKSSGGSKTGSRRGKKSAVGDNVSATSSAKARLALELQMIDEQDRMREE FT ELQAELEMKNKKLMLERQTRDRELELEARKLAEEKAFQQKQLDEEEKNRKA FT LLEMKRLSLEAKKSIVRQFSQIGSEVSSLSGSEVTSSEEKVRSWLQKSSEQ FT TEGNGQSTSAAKGTNNVLENSSKAVFSGIRGPILGSTITTKHSIDKVPHEL FT EAEKDEDDSACQIQPILTSSKAGHGIASGGVHDPEENGPTSRQLAARQVMG FT KDLPTFSGNPEEWPIWISNFQRSTETCGFSLDENLIRLQRSLKGPAMEMVR FT CRLLSPASVPHVIKVLQMRYGRPEILIRALTEKIRQLPPPKMDDLDSIVDF FT GLAVDSLVEHLKTAKQQAHLTNPSLLHDLVAKLPVDYRLKWSAFKSSVAVV FT NLGTFGRFTSSLVELAYEVMDDQPTTKTTKGVTQKPKNRSYVHAHTESSSP FT GPSSANSCSCNANSVKKFCLACERDGHKVAECFIFKAMSIDDRLKIVSQNA FT LCRTCLNQHGKWPCKTWKGCGIEGCRLRHHTLLHTVTPAVPHLVSSQLERG FT ICCPIFRIVPVTLFGKDHRVNVFAFVDEGSQTSLVDDAVAEQLGVTGPSGT FT LSLQWTGNVTRDEVNSRIIQMEISGKSSMNRYKIVGVRTVECLRLPAQSLC FT YKDLAEIYPHLRGLPINDYADVTPKLLIGLDNLKLTIPLKIREGDWGQPMA FT AKCRLGWSIYGCSRSATESVICGFHVGGWTNSESELNQIVRDYITLDNSGV FT QPPISPLESEEDKRARLLLQSTTRRVGNRFETGLLWKWDNVQFPDSYGMAF FT RRLRSLEKKLDKEPQLYECVRRQIQEYQEKGYAHRVTEEELSKTNSGKCWY FT LPLGVVLNPKKPNKVRIIWDAAAKVNGISLNALLKGPDYLASLIEVFYHFR FT LYAVALTGDIKEMFHRLLIRQEDRQFQRFLWRDRDSDEVNVYVMDVAIFGA FT TCSPSSAQYVKNVNALEFESEYSRAVTAIVRYHYVDDYLDSFPTAEEAVKV FT GTDVRRIHAAGGFEIRNFLSNDPAVAVCVGESSTSEEKMIKPEKEINVESV FT LGMKWIPIGDNLTYTFVVRADLEHVLDKTHAPTKRQVLRVVMSLFDPLGLI FT SYFVIHGKILMQEIWASGVNWDEPIGSELCEQWQRWTDLLPQLNSVRIPRC FT YFPTANYQTYSSLQVHVFVDASKSAYASVVYFRVESIGGIAVTLVAGKAKV FT APLKMMSIPRLELQAVVLGTRLLNSVIAMHKLPVSRRVLWTDSQTVIAWLR FT ADHRRYQQFVGFRVAEILSTTDMQEWRKIDTELNVADMATKWGKGPSFDDS FT NTWFRGKIPNDSTAAVSNQFSACG" XX SQ Sequence 5487 BP; 1495 A; 1211 C; 1516 G; 1265 T; 0 other; attctttaga atttttcatc gcgagatccc ctaagatgag tcttcaggta gacaaggccg 60 atgcagctgc atcacctgcg aattgcacaa aatgcgacct tccggacagc gcggagaaca 120 tggtggcatg cgacggttgt gggtcttggt atcactacag ctgcgcggag ttcgacgaga 180 gtgtcgccga gctgccgtgg aagtgcgatt cgtgtggaat tatgaatcaa accccgattc 240 cgacgaaaac acgaccgaag aaaggtgcgc aacgactcgt gatcccgggc ggtgcagaga 300 agcaaccgaa atcgtccggc ggcagcaaga cgggctcacg gagaggtaaa aagtccgctg 360 taggagacaa cgtaagtgcc acttctagtg ctaaggctag gctcgcactt gagctacaga 420 tgattgatga gcaggatcgg atgagagaag aggagttgca ggctgagtta gaaatgaaaa 480 ataaaaagtt aatgctagaa aggcagacta gagatcgaga attagagtta gaagccagga 540 agttagcaga ggagaaagct tttcagcaaa agcagttaga tgaggaagag aaaaatcgca 600 aagctctgtt ggagatgaaa cggctgtcac tcgaagcgaa aaagagcatc gtccgtcaat 660 tttcgcaaat cggtagtgaa gtgagcagct taagtggaag tgaagtgacg agttccgaag 720 aaaaggttcg aagttggttg cagaaatcca gcgagcagac agagggaaac ggacaatcca 780 cgtcagcagc gaagggtacc aacaacgtcc tggagaattc gtcaaaggcg gtgttcagtg 840 gaattcgtgg accgattcta ggctcaacta taacaacgaa gcactcaatc gataaggttc 900 cacacgaact agaagcggaa aaggatgaag acgattctgc gtgtcaaata caaccaatac 960 taacaagcag caaggcaggt catggtattg caagcggtgg tgttcacgat ccagaggaga 1020 atggcccaac cagtcggcag ctagcagcac ggcaagtcat gggaaaggat ctgccaacgt 1080 tttccgggaa tcctgaagag tggcccatct ggataagcaa cttccagcgt tcaacagaga 1140 cttgcggatt ctcactggac gaaaacctga tccgtctgca gcgtagtctc aagggaccag 1200 caatggaaat ggtgcggtgc agactgctct cccctgctag cgttccccac gtaatcaaag 1260 ttctgcaaat gcgttacggt cgaccagaaa tattgatacg ggcactcacg gaaaagattc 1320 gacagctccc ccctccaaag atggatgatt tggatagtat agtagatttc gggctagcgg 1380 ttgatagctt ggtcgaacac ctgaagaccg caaagcaaca agcacatctg acaaatccgt 1440 cattgttgca cgacctggtc gcgaaacttc ctgtcgatta ccggctgaaa tggtcggcct 1500 tcaagagttc tgtggcagtg gttaatctcg gaacgttcgg aagatttacg tcgtcgttgg 1560 tggagttagc ctacgaggtt atggatgacc agcctactac gaagacaaca aagggagtta 1620 cccaaaagcc caagaatcga agctacgttc acgcgcacac ggaaagttct tcgccagggc 1680 caagttcagc gaacagttgt tcatgcaatg ccaactcggt gaagaagttc tgtcttgctt 1740 gcgagaggga tggacacaag gtggcagagt gtttcatctt taaagctatg agcattgacg 1800 accggctcaa gatagttagc cagaatgctc tttgtagaac ctgcctgaac caacatggca 1860 aatggccttg taaaacgtgg aaaggatgcg gaatagaagg ttgtcggtta cgtcatcata 1920 cgctgctgca caccgtcacg ccggcggtac cacatctagt atccagtcag ttggaacggg 1980 gaatttgttg tccaattttc cgaattgtgc cagtgacgtt gtttggaaaa gatcatcgag 2040 ttaatgtgtt cgcatttgtg gatgaaggtt cgcagacttc gcttgtggat gatgctgtag 2100 cagagcaact aggtgtcaca ggtccatctg ggacattaag tctgcaatgg acaggtaacg 2160 taaccaggga cgaagtaaac tcacgcatca tacagatgga gatatccggc aagtcgtcaa 2220 tgaatcgcta caagatagtg ggtgtgcgta cggtggaatg tcttcggctg ccggcgcagt 2280 ccttgtgcta caaagacttg gcagaaatat acccacatct ccgaggtttg ccaatcaacg 2340 actacgcaga tgtaacaccg aagctcctta ttggtcttga caacctgaaa ctcacgattc 2400 cactgaagat tcgagaagga gattggggcc aacccatggc agccaaatgt cgtttagggt 2460 ggagcattta tggctgttca cgatcagcta cggaatcggt aatctgtggg tttcacgttg 2520 gaggatggac caattcggag agtgagttga accaaattgt ccgtgattat ataacgctag 2580 acaattccgg agtgcaacct ccaatatctc ctttggagtc ggaggaagat aagcgggcac 2640 gactgctact ccagtccacc actcgcagag tcggcaaccg tttcgagact ggtttattgt 2700 ggaagtggga caacgtgcaa tttccggata gctacgggat ggcatttcgt cgattgcggt 2760 cgttggagaa gaagcttgat aaggaaccac aactttatga atgcgttcgg cgacaaatac 2820 aggaatatca ggagaaggga tacgctcacc gtgtgactga agaagaactc tccaaaacca 2880 attctgggaa gtgctggtat ttgccgttgg gagtagtcct gaaccctaaa aaacccaaca 2940 aggttcggat tatttgggac gccgccgcaa aggtgaacgg tatatcgctc aattaggcgc 3000 ttctcaaggg tccggattat cttgcttcgc taatagaagt attttatcat ttccgtctct 3060 acgcagtcgc attgacagga gacataaagg agatgtttca tcgcctcctc atccggcagg 3120 aggaccgcca attccaacga tttctatggc gggatcgtga ctccgatgaa gtaaatgtgt 3180 acgttatgga tgtggcgata tttggtgcga catgttctcc aagttcggca caatatgtca 3240 agaatgtgaa tgcactggag ttcgaaagtg aatattctcg cgcggtgact gcaatcgtcc 3300 gttaccacta cgtcgacgat tatctcgaca gctttcctac ggcagaagaa gcagtaaagg 3360 tggggactga cgttcggcgt atacacgcag caggtggatt tgaaatccgg aattttcttt 3420 caaacgatcc ggcggtcgca gtgtgcgtag gagaaagctc cacatctgag gagaaaatga 3480 tcaagccgga gaaagagatt aatgtagagt ctgtccttgg tatgaagtgg atcccaatcg 3540 gtgataacct cacctacaca ttcgtagtac gcgccgacct tgagcatgtg ctggataaga 3600 ctcatgcgcc aaccaaacga caagtgctgc gcgtggtcat gagtctgttt gatccgttag 3660 gattgatcag ttatttcgtg attcacggaa aaattctgat gcaggagatt tgggcttccg 3720 gcgttaactg ggatgaaccg attggcagcg aactttgcga acaatggcaa cggtggacag 3780 atcttctgcc acaactgaac tccgttcgaa ttcctcgttg ctacttccca acagcgaact 3840 accaaaccta ctcttctctc caggttcacg tcttcgtaga tgctagtaag tcagcatatg 3900 ccagcgttgt ctatttccga gtagagtcga taggaggaat agcggtgaca ttggtagcag 3960 gcaaagcaaa ggtagctcca ttgaagatga tgtccattcc tcgattggaa cttcaagctg 4020 ttgtattggg cacccgattg ctcaacagtg tcatcgcgat gcacaagctt ccagtcagtc 4080 gtcgcgtttt gtggacagat tcccagacgg taattgcttg gttgcgtgcc gaccaccgac 4140 gatatcagca gttcgtaggc ttccgggttg cggaaattct ttcgacgacg gacatgcagg 4200 aatggaggaa aatcgacact gagctaaacg tagcggatat ggccacaaaa tggggtaaag 4260 gaccgagttt tgatgactct aacacctggt ttcgcggcaa aattcccaac gattctaccg 4320 cagcagtctc gaatcagttt tctgcttgcg gataagtttc atcgtcgctt tcgtcatgct 4380 aatagggaga ctgtcattaa cgaaatgaga caggagttct atataccgaa gatgcgtacg 4440 ttggtggcaa aagttgcgca ggattgcatg ttctgtcgtg tgcgcagagc tgtaccgcgg 4500 ccgcctccaa tggccccact gccgaaagtt cggttgacgc cgttcgtacg gccattcacg 4560 ttcgtcggct tggattactt cgggccggtt ttggtgaaag ttggccgcag caatgccaaa 4620 cgctggattg cactcttcac gtgcttgagt attcgggcag ttcacatgga ggttgtccat 4680 agtttgtcaa cggaatcctg cattcaagca gtgcgcagat ttgtatctcg tcgaggatcg 4740 ccggcggaaa ttttcagcga caatggtacc agctttcatg gcgccaacaa ccaattgcag 4800 aaggagataa aagatcggaa tgccgcacta gcatcagtgt tcactaattc aaatacccgt 4860 tggtcgttca atccaccggg cacacctcac atgggtgggg tatgggaaag gatggtccgg 4920 tcagtcaagg aagccatcaa gtccacgctg gaatcgtcga ggaaaccaga tgacgaaacg 4980 ctagagaccg tgatcatcga agcggaaagt atgattaata cgcgtccact tacctatatc 5040 cctttggaat cggcggacca ggaggcacta acgccgaacc acttcttgtt gggaagctct 5100 tctggtgtaa aacagttgcc agttcttcca acagactacc gtacgacgtt gagaagcagt 5160 tggaacttgg cgaagcattt ggtcgatgaa atgtggagac ggtggatatc ggagtatctt 5220 ccggtgattt cacgccgttg caaatggttc gaccacgtga gggatttgaa ggaaggagac 5280 cttgtgctgg tggttaatgg aaacgtgcgg aaccagtgga ttagaggacg agttgaacac 5340 gtcgtagtgg gaaaagatgg acgagtgcgg caagcatgga tacgcacgag agctggagtc 5400 gatcgcaggt cggtagcaaa cttggctcta ctcgacgtca tggagaacgg tgaacaagac 5460 gatggtagcg gttcacggga gggggga 5487 // ID Gypsy-14_OD-I repbase; DNA; INV; 8369 BP. XX AC CABV01004575; XX DT 24-FEB-2011 (Rel. 16.02, Created) DT 24-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Oikopleura dioica genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-14_OD_; KW Gypsy-14_OD-LTR; Gypsy-14_OD-I. XX OS Oikopleura dioica OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; OC Oikopleuridae; Oikopleura. XX RN [1] RP 1-8369 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Oikopleura dioica genome."; RL Direct Submission to RU (24-FEB-2011). XX DR Genome; CABV01004575; Positions 1229 9597. XX CC Positions [4248-4721] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 2368..3477 FT /product="Gypsy-14_OD-I_1p" FT /translation="MKVTPRTNPPRPVPIGLKPLVASELQKWKALGMIRET FT QSGFNIPLLILRKPDGSIRVSLDARLINTKLVQDRFPLPAIPDVFAKVSER FT IKSSDNCFISTVDFARSYNQIQIAENDCHKIAFSHEGRHLESSRLLYGLST FT APGGFSRIMAKLFGENPSFISYMDDLIVVDSDLEDHKKNLRTLFETCRKYG FT LVLNGKKVVLCASSLDFLGHTINKKGIFPLSKHMLNIRMFERPTSRSSLKR FT FLGMCNFQLKFFPKLGLTLDPLNRLLSKEFHTFCWTDQAEQAFVAVKEMLT FT ECTGLAHFDRNLSLWLVNDASGHGLGSTLYQLNKNDEFEPIGYHSRPFKGP FT DLKRSIREKELMAIAVRLPAFFLLPHG" FT CDS 5193..6707 FT /product="Gypsy-14_OD-I_4p" FT /translation="MDRDTNTAWWYSDAGIGVEQEESLYFIDSSVEVIFYR FT HYSSPCLLDINTIAKEAVCSSTSAQKDVGANILKIAEACDREWNSELNILS FT QYASHNFQNSVYYSHPQPLSSSRAPRAISLVAVSAFVSIFTSLITSGLGLY FT SAFATHNSNAIKESLTDYIKFQHGEKMMVALGLENNREATSVFLEYACEKD FT LYDSNFKVTNAASKVLKSYTDAVVQESLSLSFGTWPRNMEFLESLLKLCTE FT TGANTKHFCLDAIYSSQVNFRFLGSVSENGTLSHRLLINMPIQATSLRRNS FT FYKLINLGFWASENFMRFDLPASVIKSDNLYFEITKESCLKNFCNIDLISI FT GKHSSCLSSLFEENTTEDCNVVIVKEPESCDIRRLSNSTIIQAREATVLLD FT SDEGPLTIPLNKGNLMVSLPGKLLCYSRNDSETFLLPKPMVSHYSSLLPGQ FT IEVSHAINTSKLLELEAKLSEGLKIEKRLRSLEFSDDRLEVNGAEYPTPFG FT LLLSPASQF" FT CDS join(39..1424,1428..2489) FT /product="Gypsy-14_OD-I_2p" FT /translation="MSTPGTEFYSKTVEHLSLLELVCLAARSNVETSKRQT FT KREIAKNIGRRFNFFLENTDRRFYFKRETSKISLLPTPTTEDDKIKFPPKE FT VASNNRLNVEEIKLEIKKETETFEGDLFMEIKEENDNGDTDEVDKVPEQQT FT EPQPTQQQQRPDDRSVKEQDDESFPPVTPIPSLIGKDSSTKHVYSNQFSDY FT RRTEPRRTTFNQKIKYDPELGVEAFIQSVQSYCSANFIREEDRIIDIARAA FT LNTSADGVVIQETLTPYELVSWPLFKARIRDALGFSPNDYREDFDTFKRGD FT IKIGVAFAKLVRYYKRGYLNENEEIDSKDQKLICKQFIRSFEQPLRTLLMA FT EEDSLTFSNIVNRAGHLERVYRPQREQIAAIRPSGPTQPNLDTQILELINS FT FKEQSELTKKLLAENTRGKGSRPKSDRPNPEGYCIDNMKGRCKRGAECKYS FT HSPAPAHVAARFQPNEKLVSHGTAPEQIFAVVPNALSNISPRLKYVTVFVS FT GIKSLAMIDSGATRTVVRAGEPFLAGLPTRNAQIKLVTADNTAMTANKLVD FT LPIRFTNQAKSLISTIIVENLASPIILGLDFIKDLTFSSDSIFVHLNGNHL FT RLVDPSVSVKSIRALNKLELEPYAISCIDPKIPHNLIGTFFVDGQSNLRNP FT DFEVIPTVYSSTHPSQIFVKNKTGKVITIRKSERIAIANQIELNCNAIVEV FT TNVEREEDEVENFQAEREERAENLKFEPVIKSYGSLEGDDLDNIKQLVREQ FT RLSFQINDQDLGKCGHFRFTVPLLDESDTAHQPPKASTDRLKTPCSQRVTK FT MESVRNDKGNAKRF" XX SQ Sequence 8369 BP; 2580 A; 1848 C; 1626 G; 2315 T; 0 other; aattggtgac aaagtggtca aaaaaaagag cctacaaaat gagtaccccc ggaactgaat 60 tttattcgaa gaccgtcgaa cacctttcgc tgctagagct tgtttgcctg gccgcgagaa 120 gtaatgtcga gacttcgaaa agacaaacaa aaagagaaat cgccaaaaac atcggtcgca 180 gattcaactt ctttctcgaa aataccgatc gacgatttta cttcaaaaga gaaacatcaa 240 aaatctcact tctcccaaca ccaacaactg aggatgacaa gataaaattt ccacctaagg 300 aggttgcttc taacaataga cttaatgtag aggaaataaa acttgaaata aaaaaagaga 360 cagaaacctt cgaaggggat ctttttatgg agatcaagga ggagaatgac aatggcgata 420 ctgatgaggt agacaaagtt ccagaacagc agacggaacc acagccaaca cagcaacaac 480 agagacccga tgacaggtct gtaaaggaac aagatgatga gtccttccca ccagtcactc 540 cgatcccatc tcttatcgga aaagatagct ctacaaaaca cgtctactca aatcagttct 600 cagactacag aagaacggaa cctcgtcgaa cgactttcaa ccagaagatc aagtacgacc 660 ccgaacttgg agttgaagcc ttcatccaat ctgtccagtc gtactgcagc gccaacttca 720 tccgagaaga agacagaatc atcgacatcg cacgcgccgc actcaacaca agcgctgatg 780 gtgttgttat acaggaaaca ctcactcctt acgagctcgt ctcctggccg cttttcaagg 840 cacgaatccg agacgcactt ggattctcac ctaatgacta ccgcgaagat tttgacacgt 900 tcaagagggg cgatatcaaa attggagtcg cttttgcaaa acttgtcagg tactataagc 960 gaggttacct aaatgaaaac gaggagatcg acagcaagga tcagaaattg atctgcaagc 1020 aatttatccg cagcttcgag cagccacttc gcacgctact gatggctgaa gaagactcgt 1080 tgaccttttc aaacattgtt aaccgcgccg gtcacctgga acgagtttac cgaccccaaa 1140 gagaacaaat tgccgccatc cgcccctcag gaccaactca acctaatctt gatacgcaaa 1200 ttctcgaact cataaattct ttcaaagaac agagtgagtt aactaaaaag cttctcgcag 1260 aaaacacccg cggaaaagga agccgaccca aatctgatcg accaaatccc gaaggctact 1320 gcatcgacaa catgaagggc cgttgcaaaa gaggagcaga atgcaaatac tcccacagcc 1380 ctgcaccagc ccatgtcgca gcccgattcc agcccaacga gaaatgacta gtaagccacg 1440 gtactgcccc agaacaaatt ttcgctgttg tcccaaacgc tctttcaaat atttcgccta 1500 ggttaaaata tgtaactgtt ttcgtaagtg gtattaaatc ccttgctatg attgattccg 1560 gagcaaccag aacggttgtt cgcgccggtg agccttttct tgctggttta cccacgcgta 1620 acgctcaaat aaagcttgtc accgctgata acacggcaat gacagccaat aaactggttg 1680 atttaccaat aagattcacc aatcaagcaa agagtttaat tagcactatt atcgtagaaa 1740 acttagccag cccaataatt ttaggactcg attttattaa ggatttgacc ttttcctctg 1800 actctatttt tgtgcacctg aatgggaatc accttcgatt ggttgaccct tcagtttctg 1860 ttaaatcaat tcgcgctctc aataaattag aacttgaacc ttacgcaatt agttgtattg 1920 acccaaaaat acctcataat ctcattggca cctttttcgt tgatggccaa agcaatttga 1980 gaaatcctga ttttgaggtt attcctacag tttactctag cacgcatcca agtcaaattt 2040 ttgtgaaaaa caagactgga aaggtcataa ctatccgtaa atctgaacga atcgctatcg 2100 caaaccaaat cgagcttaac tgtaatgcga tagtagaagt tactaatgta gaacgagagg 2160 aggacgaggt tgaaaatttt caggcggaaa gagaagagcg ggcggaaaat ttaaaatttg 2220 aacctgttat taaaagttac gggtcactag agggcgacga ccttgataat atcaaacaat 2280 tggtcaggga acaaagactt tcgttccaga ttaatgacca agatttaggt aaatgcggtc 2340 attttagatt tacagtcccc ttgctcgatg aaagtgacac cgcgcaccaa ccccccaagg 2400 ccagtaccga taggcttaaa accccttgta gccagcgagt tacaaaaatg gaaagcgtta 2460 ggaatgataa gggaaacgca aagcggtttt aatattccgt tgcttatttt gcgtaagccc 2520 gacggctcaa ttcgtgtttc acttgatgcc cgtctaatta atacaaaact agtgcaagac 2580 cgttttccac tcccagcgat ccctgacgtt tttgcgaaag tctcggaacg aataaaatca 2640 agtgataatt gttttatttc gactgttgat ttcgcgagat catataatca gatccaaatc 2700 gcggagaatg actgccataa aatcgcattt tcccacgagg gccggcatct agaatcatca 2760 aggcttttat atggcctaag caccgcccca ggtggtttta gccgcataat ggcaaaacta 2820 tttggcgaaa atccaagctt tataagttac atggatgacc ttatcgttgt tgattccgat 2880 ctcgaggacc ataaaaagaa cttgcgcacg ctatttgaaa cgtgccgaaa atatggtctc 2940 gttttaaacg ggaagaaagt agttctttgc gctagcagcc ttgatttttt gggtcacacg 3000 attaataaaa agggaatttt tcccctttca aaacatatgt taaatatacg tatgttcgag 3060 cgaccaacat cacgttcaag ccttaaacgc tttttaggaa tgtgtaattt ccaactgaaa 3120 ttctttccaa agctaggctt aacactagac ccgctcaacc ggttactttc aaaggaattc 3180 catacttttt gctggactga tcaagcggaa caagcctttg tagctgtcaa agaaatgttg 3240 acagagtgta ccggtctagc acatttcgat cgtaatctga gtctatggct tgttaacgac 3300 gcaagcgggc atgggctagg ctcgactctt taccaattga ataagaacga tgagttcgaa 3360 ccaattggat atcactcgcg gccgttcaaa gggccagatc ttaaaagatc tattcgtgaa 3420 aaagagttga tggccatcgc ggtccggttg ccagcatttt tcctattacc tcatgggtag 3480 gaaatttaac gttgttactg atcacaagtc gctgctgtat ctttacagag aacatcttgg 3540 ctcagcgctt gatctcaaat taacaaacat ttttattctc ttgcaaaatt acgattttac 3600 tattgtgcac cgccccggaa cgtcgccgct tttagcgtcg gcagattatc tatctaggtt 3660 acccggaacg actttaaatc agatcgagca ggaatatcaa acgagcgatg tccccgaatt 3720 cattttcaat attgacattt ttcctgaaaa ggggaaggag atgcttaatg atcaaagtga 3780 gcataggcga ctttatttag aaagattaat gcaaagtctc aagccgaagc ctgaggaaaa 3840 taaagaaaag ctatttataa atttcgatac aaatcgtatt agcaaaaccg aacttgtaac 3900 gctacagtcg gagtgccctt cgctgcgaaa tatttttgtc aaacttgagc aaaaatcaaa 3960 agggacaata aagaaattta aaataaacga tgactcactt ttagttcgca ttgatcaaaa 4020 cacctcccgc cctgttcttt ctggtaaact agcacaggaa tttatatcat tcacgcactg 4080 cgagtttggt cacccaggca tacatcagac gatgcgcctg gtttcaaaat gctgctttat 4140 tgttaatctc aaaaagcacg tgaccgactt tcttgggcag atgtttgaca tgtttacgct 4200 ctaaacccat ggcagcacta aagggggccc tacaatctaa tcgagttttt acggatatac 4260 ctttccgcaa gacgagcatc gatttgtatg atctgggtaa accggatgct caaaacaagc 4320 gctatgtttt atcgatgaaa tgcgagctca catgttacta cgatggagtc acgctctcta 4380 acaaaacaga taagctcgtt tcaaacggct tattagagct tatattgcgt tacggtgtta 4440 ccggaaagat tttgagcgat aacggccgtg aattcggacc gttaacaaaa gcccttttta 4500 agaaatttgc gattgaacat gtactaacaa gtgcttataa ctcacgagga aatagcctag 4560 ttgagcgctc tcacagaaca ataacgcaaa aattgaaagt gcttggtgca actaggagaa 4620 attggagctc gcactttcca ctcgtgaagt tctatctaaa taatctgcct tcgaaagctc 4680 ttaataattt aagcccagcg gaatgccttt atggtagatc ccttttgatg ccactaattg 4740 atacccagta cgtgcaacct ttgagctctg agggacctta ttcacaagca atttctaatt 4800 atattttcta aattacaccc ctcattagcc gcttatcact ataagcgtta ttctaccgcg 4860 cttgaaggta acaacgcaaa aagtctaaac ctcaaagtcg gcgaccgcgc gctgatctgg 4920 aaacccgtct taacggatgg caaactctca aaagtgtggg atgggccata tgtcatcgta 4980 aagaaactgg gtcagtgctc attcgtgctc gccgatccat ccaagcgtca aaaattccgc 5040 cgtcatgcga gacaccttcg ccctataaag gaacgcgagc ccttcgaaga ctcagaacga 5100 aatctagaaa gtgaacaaaa tctcgaaaaa ggaaatctag attacacaga gttttcaaaa 5160 gagttcgacc caaccatcga atatccatac aaatggacag agataccaac accgcttggt 5220 ggtattccga cgccggaata ggcgttgagc aggaggagag cttatatttc atagatagtt 5280 cggtagaagt catattctac cggcactaca gctccccctg tctcctagac atcaacacga 5340 tcgcaaagga agctgtctgc agctccacct ccgctcaaaa agatgttgga gccaacattc 5400 tcaagattgc cgaagcctgc gaccgagagt ggaacagtga gttaaacata ctgtctcagt 5460 atgcttccca taattttcaa aattcggtat attattccca tccccaacct ctttcatcct 5520 cccgcgcccc acgtgcaatt agcttagtag ctgtttccgc tttcgtgtct atttttactt 5580 cccttattac tagcggcctc gggttatatt cagctttcgc cacccataac tctaacgcaa 5640 ttaaagagag tttaacagac tatataaaat ttcagcacgg agaaaagatg atggtcgccc 5700 tcggtcttga aaacaaccgt gaggctacca gcgtttttct cgagtacgca tgcgagaaag 5760 acctttacga tagtaatttc aaagttacca atgcggcgag taaggtgctt aagtcttaca 5820 ccgacgcagt ggtacaggaa agcttaagcc tttcgtttgg cacctggcca aggaacatgg 5880 aatttctaga atcactttta aaattatgca ccgagactgg cgcaaacacc aagcatttct 5940 gtttggacgc tatctatagt tcccaggtta atttccgctt tttagggagc gtaagcgaaa 6000 acggcacact ttcgcaccgc cttctgatta atatgcccat tcaggcgacg agcctcagac 6060 gtaactcgtt ttacaagctt ataaatttag gattctgggc atcggaaaat tttatgagat 6120 ttgatctccc cgcttccgtt ataaaatcgg acaatcttta ttttgaaata actaaagagt 6180 cctgccttaa gaatttttgt aatattgatt taatatcgat aggtaaacac tcctcgtgtc 6240 tatcctctct tttcgaggaa aacacaactg aggattgtaa cgttgtaatt gtgaaagagc 6300 cagagtcatg cgatatcaga cgcctgtcaa attccactat tattcaagct agagaagcca 6360 ccgtcctttt ggatagtgac gaaggtccct taactatacc tttgaacaag ggcaatttaa 6420 tggtatccct tcccggcaaa cttttatgct attcgcgaaa cgatagtgag accttccttt 6480 tgcctaagcc gatggtttct cactattcaa gcctattgcc gggtcaaatt gaagtatcgc 6540 atgctataaa tacttcaaag cttttagaac tagaagcgaa actctcagag gggcttaaga 6600 tcgaaaaaag acttcgctcc ctagagtttt ctgatgaccg attggaggtt aatggggctg 6660 aataccctac cccttttggg ttattattat cgccagcttc tcagttttag gggtacaatt 6720 aatgctctgt ttccttagga gtgtccaggt tatgccagcg cttaaatttt tggcaagatt 6780 caccaaagga ctgcgcaagg aaacaggcgc gcccgcccaa gcttctttag aggaaaacat 6840 tgagtccccg tgccttgtcg cggttgttga ccaaaatgtt agtggagaaa tatcggctga 6900 actctttcat aaggaaacaa gcgcctaatc ataaaaatcc aaatttgaat tcgttaattt 6960 tttaaaaaat aaaattttca gtaaaataat gttaccctgt gtctcgtcaa tgccctgtta 7020 gcccgctgta aaattaggca acaaccattt ctggctgcat ctagtggttt tcactcatcc 7080 ctatgggggg tagcgacttg cacaaagtgc gaagccgacc tagcaaccgc ccgtgtatag 7140 agagtcaacg tgaaaactgc gcactaattt tactttagga ttacaaaagc gaaattcaaa 7200 acacaagcca aattattttt gagaaatttt attttttaca aatttgattc agaaaatata 7260 attttttata tcttgagggg catccaagag ggaacgactt ttcattttct tttctttaag 7320 taactcacgt ttttttgccg ttttttcgtt atggatgtcg gaaacgacgg atcgacccca 7380 accgccaatt tgcattcttc ttttattcgt aatttttgtg ccgccttcat gattgctgta 7440 ttcacagagc cagttgtttc ctggtagctt gtctctgcca atgaaatctc taaaaaaata 7500 tgttaagaaa aaaaatatga aaaaaaaaaa aatgaaaaat ttcgaaaatt ttaagaattt 7560 ggataatttt gagaatttag gaaaaaaaat ttttttctga aaaatttcga aaattttgaa 7620 aattttgaaa aatttggaaa ttttgaaaaa tttgaaaatt tctgaaaaat ttcgaaaatc 7680 tagaaaaaaa tttacaaaat ttactgtatt ttagtaaaat ttactgtaat ttactatttt 7740 tactgtaatt tgcaaacgta ccttatcaat gctgcaccgc gcgaggtact tcacaaattc 7800 aggagtatct ctcttctctt tgtttcgctt tttccatatt tcaaagcgtt tgataatctg 7860 cttttctctt ttctgtatat ctttcatcgt tttcattcca aaaatacgca gcgcgcttag 7920 aaaaaatctc aaagttttag taattttgta gtttttcgga tcttcaactt catatgcaag 7980 aagttcttgc atgttttctg gtatttttac catttttgta atttttgata attttgataa 8040 ttgcacacca tatttatgca tattcaagac atctagcgac cgctagatga aatttgtgta 8100 tttactcaac actaaatgtc aattggccgc agccgataac aatccaaatg taccctcctc 8160 cgccaaattg acaacatctg tacatatcgt gcgcgcaatc ggcatatcca ccgctgaaat 8220 ctacttcgcg acgcatctgc acacctactg accgcgattc gactcgccag acacgctcga 8280 ttggcaatca acgcagtccc gattgcagaa aatgatgaac ttttaatttt aaaaactgta 8340 tatttcacac ctttcaagaa aggcggggt 8369 // ID BEL-73_CQ-LTR repbase; DNA; INV; 249 BP. XX AC AAWU01004923; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-73_CQ_; KW BEL-73_CQ-I; BEL-73_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-249 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 292-292 (2011). XX DR GenBank; AAWU01004923; Positions 14646 14894. XX SQ Sequence 249 BP; 74 A; 76 C; 50 G; 49 T; 0 other; tgttgcgcta cttgcgcatc cctagcacga gaaacagcag cggcgttttt ggcacaccct 60 gtacgaagcg cgcgcgcgac tccacacaca cactaacaca cgatctctgt aacgaaacac 120 acgacaaaca cgaataaacc aacagtctca agtagtaacc ttagcaaacg gacgcgtttt 180 tgcctttgct cccgaaatta ctgcttgacc gcgaaagcga tttatacagt ccactcatcc 240 ggaggaaca 249 // ID CR1_Ele6 repbase; DNA; INV; 5241 BP. XX AC . XX DT 19-OCT-2010 (Rel. 15.1, Created) DT 19-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE A CR1 clade non-LTR retrotransposon family from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1_Ele6. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-5241 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5241 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (19-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. This consensus is generated from 12 CC sequences with >98% identity, and ~100% identical to the original CC sequence in [1]. XX FH Key Location/Qualifiers FT CDS 266..1654 FT /product="CR1_Ele6_1p" FT /translation="MTSVICSACTQSIGSESDRVYCFGGCDQILHTRCSEL FT SSSAMAAARGNVALKYFCFGCRKQQTCINEINKKCTDLLNRVNVISDLVSK FT HDTAFENLDKQLCEKVEAMLLPRLLSAIHATKSSASDCNTCVVASNDSYAA FT VVSALPTVNVPNTANTNKQIAMNNSNCTPVISDLGGGLLRSGKRRNLDPSR FT KERATMSTESATPCTPRKSATVARIEQTVLLRPKRKQTSEITQKDVHRKLD FT PAEFLVKSAFFKEDGEAVIRCETSENALKFVNAATKMLSEKYDISVQKPLK FT PRIKIFGISEDFTAEEILTKLRKQNNLPESSDVKIVKLQKAKSRKSNPVDA FT VLEVDAKSFDYLIKLERVNIGWNRCRAMEMVNVLRCYNCSAYGHKASSCSN FT SVCCPRCSGDHNADECEAEYEKCANCERLNKDRKSKNDELLDTNHSAWGTD FT CPIYQRRYKLAKERIDYS" FT CDS 1664..5149 FT /product="CR1_Ele6_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MDRSMKRQEQHNACSGNLADSICLAEVTVDTAPKSND FT GHRFQKPSTIRIYHDLPLDDQQGPSQRVKIIDQWNQQTVCPGNLAGSICLA FT EVTVDTAPHSNCGDRFQINSTRPYQCTTIIDEQERHHPVAHQVNNYDEQHS FT ARAGKSMLSICRAEAWQDTTHSSRDFHGFQDTNQQARTDEINNCETNTPDV FT SPAAAHASSSSPSDHSITFYYQNVRGLRTIIEDFYLTVMDAEYDIVVLTET FT WLNSEIQSPQLFGQLYTVYRQDRDSSRCGKSRGGGVLIAVNNRLASSRIDI FT ARIDTIEHLWIKIEVSASTTYVGVVYIPPDVASDSGTISTLVNSLETIVNA FT SGLHDSHMLFGDFNQPSLTWCLSSGHGLIVDHQLSTTTVTSAILLDGLSLL FT NMTQVNPFKNALNRTLDLLLVNDDIAEKCILYDPPEPLTRVDPHHPPFLVV FT YHCQSVTQFRDIVELAAYDFNRTDFEELNQAFQITDWTPLYAAENVNAALD FT YFVSTLTELFRLHVPLRRSPRKPVWSNGTLRRLKRQRAAALRKYSRHRNPF FT TRRSFIVASTKYKCYNRTLYERHARRTQTNLRQNPKRFWSFVNEKRKECGL FT PTSMFYENDSSDSLDGICNLFAKHFSSVFDDTTATTEQLDDAIRYVPENIL FT DFHTCDFSANDILSALRKIKSSTAVGPDGIPTIVLKKCSNALCEPLRYICN FT SSIRQSTFPNRWKDSVMFPVFKKGDKRDVANYRGITSLCAGSKLLETLVSQ FT DLLRASKAYISPDQHGFYPRRSISTNLVQFTSFCITNMGEGAQIDTVYTDL FT KAAFDRVNHQLLLRKIHRLGSPTDFVQWLKSYLVNRQLCVKLGNCQSVAFT FT NHSGVPQGSNLGPLLFSLYFNDVCLVIPAGCRLVYADDLKIFLIVKSADDC FT RQLQRLLDLFSQWCEANFMCISVGKCAVISFSRCRNLLIWPYTIRGNAIER FT VASFRDLGVILDSQLTFREHYSYIIAKANRSLGFIFRLSKEFTDPHCLKAL FT YCSLVRSTLETASTVWSPYHDVWIKRIESIQAKFIRYALRFLPWSNPDELP FT PYENRCRLLELDTLEKRRKMMIAVFAGKIITANIDAPFILSRINLNVVPRP FT LRTRNFLRLDFQRTDYAQHEPIRRMCEVFNCVYDLFDFNMSIDVFRNRLYS FT RNL" XX SQ Sequence 5241 BP; 1536 A; 1128 C; 1110 G; 1467 T; 0 other; aaaacatcgt tattgtcagt tgtgcgtgat aatgtttgca tagtttttcg tgattttaaa 60 gtgtttccgt gagttttata tcgcgtatgt tatagctgtg taaagcgtgg gaagctgttt 120 tccaatatct actgtcgaaa agatcttcgg ttgagcatac gacgaatata cggcatatct 180 ttttttgaag aattgagatc taccaccagc aacaaaacat cagtttggtg gtgcggggct 240 tcgcgagggg cttcatggct ctacaatgac gtcggtgatt tgtagtgcat gtacccagag 300 tataggctct gagagcgaca gagtatattg tttcggcggt tgtgatcaaa ttctccacac 360 acgatgctca gagcttagct cgtcggctat ggctgctgct agagggaatg tcgcactgaa 420 atatttctgc ttcggttgca gaaagcaaca aacatgtatc aacgagatca acaaaaagtg 480 tactgatttg ttgaatcgag tgaacgtaat aagtgatctt gtatcaaaac atgacactgc 540 tttcgaaaat ttggataaac agctctgcga gaaggttgaa gcaatgctac tgccgcgttt 600 gctctctgcg atccatgcaa caaaatcgtc tgcttctgat tgcaacactt gcgttgtagc 660 atcgaatgat tcatacgctg ccgttgtaag tgcattgcct acagtaaatg taccaaacac 720 cgctaacacc aacaagcaga tcgcaatgaa taattctaat tgcacaccgg ttatctctga 780 tctcggcgga ggcctgctta ggtctggaaa acgacgaaat cttgacccat caagaaaaga 840 gagagcaact atgagcactg agtcagcaac accgtgtacc cctcgaaagt ctgccactgt 900 cgccagaata gaacagacag tcctgcttag accgaagcgc aaacaaacat cagaaatcac 960 acagaaagat gttcatcgta agttagaccc agcagaattc ttagtcaaat cagcgttttt 1020 taaagaagat ggcgaagctg tgattagatg tgaaaccagc gaaaatgctc tcaagtttgt 1080 taatgctgct accaagatgc tgtccgaaaa atacgatata tccgtacaga aacctctcaa 1140 gcctcgcatc aaaatttttg gaatatctga agattttact gctgaagaaa tactcaccaa 1200 gcttcgtaaa cagaacaacc ttcccgaatc ctccgatgtt aagatagtta aactgcagaa 1260 agccaaatcc agaaaatcaa atcctgtcga tgccgttttg gaggttgatg cgaaatcgtt 1320 tgattacctg attaaattgg agcgggtcaa cattggatgg aaccgctgtc gagccatgga 1380 gatggttaat gttcttcggt gttataattg ttcagcttac ggccacaagg cgtcttcatg 1440 cagtaatagt gtatgctgtc ctagatgttc aggtgaccat aatgctgatg aatgtgaagc 1500 agaatacgaa aaatgtgcta actgtgagcg gttaaacaag gatcgaaaat caaaaaatga 1560 tgaactcctc gatactaacc attcggcatg gggaactgac tgtccaatct accaaaggcg 1620 ctataagttg gccaaagaac gaatcgatta ttcctagcaa ttaatggata gatcaatgaa 1680 acgacaggag caacataatg cttgttcagg taatttagct gacagtatat gtcttgccga 1740 agttaccgta gacactgcac ccaaatcaaa cgatggtcac cgttttcaga agccttcgac 1800 aatacggatc tatcacgatc taccattaga cgaccagcaa gggccatccc aacgtgtaaa 1860 aatcatcgac caatggaatc agcaaactgt ttgtccaggt aatttagccg gcagtatatg 1920 tcttgccgaa gttaccgtgg atactgcacc ccattcaaat tgtggcgacc ggtttcagat 1980 taactcaacg cggccctacc aatgtacgac cataattgac gagcaagagc gacatcatcc 2040 cgtcgcccac caagtgaata actacgatga gcaacactct gctcgtgcag gtaaatctat 2100 gcttagtata tgtcgcgccg aagcgtggca agatactaca cacagctcaa gagacttcca 2160 tggctttcag gatacaaatc aacaagcaag aaccgacgaa atcaacaact gtgaaaccaa 2220 cacacctgac gtttcacctg ctgccgctca tgcatcgtct tcatcgccat ctgatcattc 2280 gataactttc tattatcaga acgtcagagg gttgcgtacg atcatcgaag acttttacct 2340 gaccgtaatg gacgccgagt atgatattgt cgtactgaca gaaacatggt tgaactcgga 2400 aattcaatcc ccacaattat ttggacagct gtatactgtc taccgacagg atcgcgattc 2460 atcgcgctgt ggaaagtcta gaggtggtgg cgttcttatt gccgtcaaca acaggcttgc 2520 ttcttcacgt atcgatatcg ctcgcattga tacaattgag catctatgga tcaaaattga 2580 agtgtctgct agtacaactt atgtcggagt cgtctatata ccaccagatg ttgcatctga 2640 ctcaggtact attagcactt tggtcaactc tctcgaaaca attgttaatg catctgggct 2700 acatgattca catatgctgt ttggtgactt caatcaaccg agcttgacat ggtgcctttc 2760 ctccggacac ggtttgattg tggatcatca gctttcgact accaccgtga ctagcgcaat 2820 cctgctagat ggactttcat tgctgaatat gacgcaagtc aaccccttta aaaatgctct 2880 caacagaact ctggatcttt tgctcgtaaa cgatgatatc gcggaaaagt gtatcctata 2940 tgatccgcct gaaccattga ctcgtgtgga tccgcaccac ccgccgtttt tagtcgtcta 3000 tcattgtcaa tctgttacac aattcagaga tatcgttgag cttgctgcgt acgattttaa 3060 ccgaaccgat tttgaagagc ttaatcaagc atttcagata actgattgga ctcctttgta 3120 tgcggcagaa aacgttaacg ctgcgttgga ttactttgtc agtacactga ctgaactgtt 3180 tagactccat gttccactac gccgatcacc tcgtaagccg gtgtggtcaa atggaactct 3240 tagaagattg aaacgtcaga gggcggcagc tttacgaaag tattcaagac accggaaccc 3300 atttacgaga agatcattta ttgttgcgag caccaagtat aaatgttaca accgcacatt 3360 atacgaaaga cacgctagac gtacgcaaac gaaccttagg cagaacccaa aacgattttg 3420 gtcattcgtg aatgagaaac gtaaggaatg tggattgcca acctccatgt tttacgaaaa 3480 cgattcttca gactctcttg atggtatctg caatttattt gctaaacact tttcaagtgt 3540 ttttgacgac accactgcta ctactgaaca acttgatgac gcaatcagat atgtacctga 3600 aaatatactt gatttccaca cttgcgattt ttcagctaac gatattttat cagcattgcg 3660 caaaataaag tcctcaactg cggtcggtcc agacggaatt ccgacgattg tgctgaaaaa 3720 atgttcaaat gcactatgcg aaccactacg ctacatttgc aacagctcta tccggcaatc 3780 tacgtttccc aaccggtgga aggactctgt gatgtttcct gtgtttaaaa agggcgataa 3840 gagggacgtg gcgaactata gaggaataac ttcactctgt gctggatcta agttgttgga 3900 aactctagta agccaagacc ttctgcgcgc atcaaaagct tatatttcgc cggaccagca 3960 tggtttctat cctagaagat ctatttcaac gaatctagta caatttactt ctttctgtat 4020 caccaatatg ggagaaggtg ctcaaattga tacggtttat acggatctga aagctgcatt 4080 tgatcgagtc aaccatcagc tactgctccg aaaaatacat cgactgggat caccgacgga 4140 ttttgtgcaa tggttgaaat cgtacctcgt gaatcgacag ttgtgtgtga agctgggcaa 4200 ttgtcagtca gttgcgttca ctaaccattc aggcgttcct caaggaagca accttggacc 4260 gttactcttc tcattgtatt tcaatgatgt gtgccttgtt atacctgctg gatgccgatt 4320 agtatatgcg gatgatttga aaatatttct tatcgttaaa tctgcagacg actgtaggca 4380 actacagaga ctcttagatt tgttttccca gtggtgcgag gccaacttta tgtgcataag 4440 cgttggaaaa tgtgctgtga tatcattctc acgatgtagg aatcttttga tatggcctta 4500 cacaatcaga ggaaatgcga ttgaaagagt tgcaagcttt agagacctcg gggtgattct 4560 tgatagtcag cttacattca gagaacacta ctcgtacatt attgctaaag ccaacaggag 4620 cttaggattt atctttcgtc tgtctaagga gttcaccgat ccacattgtt taaaggctct 4680 ctactgttct ttggtacgtt caacacttga aaccgcttca actgtgtgga gtccgtacca 4740 cgacgtctgg ataaagcgaa tcgagtcaat acaggctaaa tttatccggt acgcattgcg 4800 gtttcttcca tggagcaacc ctgatgagct tcctccatat gagaaccgat gccgcttgtt 4860 ggaactggac acacttgaaa aacggagaaa gatgatgata gccgttttcg ctggaaaaat 4920 cattaccgcg aacattgatg ctccttttat tctatccaga atcaatttga atgtagtacc 4980 cagaccccta cgtactcgaa atttccttag gctggatttc caacgaactg actacgctca 5040 gcatgagcca attcgaagaa tgtgcgaagt ttttaactgt gtgtatgacc tgtttgattt 5100 taatatgagt attgatgtct tccgtaatcg tttatattct agaaatttgt gatgtttaat 5160 gttaaaatta agaatattca tgtagacatt tttgttcgat gaattatgaa ataaataaat 5220 aaataaataa ataaataaat a 5241 // ID Gypsy-11_SI-I repbase; DNA; INV; 8216 BP. XX AC AEAQ01022239; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-11_SI_; KW Gypsy-11_SI-LTR; Gypsy-11_SI-I. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-8216 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01022239; Positions 8472 257. XX CC Positions [4893-5372] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 370..1800 FT /product="Gypsy-11_SI-I_1p" FT /translation="METRGGAKPPVKKVLSEEELDAIRMAQVHEKRLREES FT AALDKERDRLDREKREFERNKNAESTIAADMRGLLSALKNEMMHEINMLRQ FT DVGNRRSPTRSPTPPAGQRMFSNTPSPTRHQGSPDRSGAPKISFREILEGV FT PRFDGYNVPLTQFARACRRAREIAPANHEKHLTRLLVNKLSDRACAAVEDE FT PCESVTQLIDLLNGAFGSPNSIAQYRGELSRAYMKPNEHILDFISRIKELR FT STILDAERRTRGVLDAAIINDVDELTARSFCDGLPLRYRMLMRREHENSPF FT EAFATTKAIAKRDELERERYDPNYRPDRRGGNRPFSGNGLTRDYRDSRPST FT ERYREPTNYSATRDIREPRQDYRPLADRYRRPIEYRNSRESREHYDNRRVV FT ADTMNQRTESAPRDANSIWCRYCKYPGHEIQNCRKRQYNENKQGNAPGPSS FT RPDAARGGPPPNRPVRCVEIDLEEKNELVSSE" FT CDS 1752..5372 FT /product="Gypsy-11_SI-I_3p" FT /translation="MCGNRSRGEERIGLLRIKNFNHAPTVRITSDIFQPDV FT TLMVDTGAALNLIKINKILPNVQIDSNSIIFLTGITEGRVETLGCVSANVL FT GREIKFHVIPSEFPVSCDGILGADFLRGAGKINFVEQTLEWHNASFPFLNR FT EMTKFPARSSVVMCVNVTNAAVATGYVPRLTIDENIYLGEAIVSNREGKAY FT LRAFNTSDKDIILPTPTVELQNFDLPGGSYPEQGTLVPALGKGNDSGNILA FT IAGAGGSRKDEVKSLLRLEHLNKEELKHVDNIVEKHCDIFQLPNDKFECTN FT VMKHKIKTTDEQPVHTKQYRFPSIHKEEIDKQVKTLLENDVIKPSVSPYNS FT PLWIVPKKPDSKGNKRWRMVIDFRMLNEKTIGDAYPLPNITEILDQLGSAK FT YFSIFDLASGFHQILIDEADAHKTAFSTPYGHFEFNRIPFGLKNAPATFQR FT LMDLVLVGLQGTELFVYLDNIVIYSSSLREHAQKFEKLAARLRQANLRLQP FT DKCEFLRKEVTYLGHVIGENGVKPDPEKIRAVKEFPKPTNPKQIKQFLGLA FT GYYRRFIQNFSKIAHPLTNLLKQDSAFLWESPQENAFETLKESLCNYPVLQ FT YPDFSKPFLVTTDASNVAIGGILSQGPLGKDLPVSFAGRRLNQAEQNYSTI FT ERELLAIVYCVNYFRPYLYGQKFQLITDHKPLVWLHSVKDPTSRLVRWRLK FT LAEYEYQVIYKAGKNNTNADALSRNPIPYKAFPISSDNFEESFFPSPSNNV FT NETCYETNQMRDMTTKSPEQTETIKTAPVKVQVEAINDGMADTHTRATGIN FT ETEYETDLADSDTNATFSETETDEPIVDPVDEEYQINRTRINETRDHFKDK FT KDNLAIFVDETGRPCDTGSRHLAEASKLPEIRETNVGRAKILRQGKYYLIV FT LAINTQGSKVTQRETIKMALVSLYDVVRELELRSVSISKGTVAAVPWDIIQ FT GYIARFFYDTTVLISICSNQIITPEEQDRKKIMYKNHNTAIGRHKGVSKTY FT DRIRYNYYWTNMKTDIQNFVKNCKDCQLKKLVRLKTRQPMTLTDTPGMAFE FT KVSLDIMGPLPTSLNGHTYILTIQDLLTKYSLAIPLQQATSTHIADAFVNN FT FICIFGAPKGMLTDRGTNFLSNLMRLIARRFKIKQYSTTAYRPQSNGSVER FT SHQVLWDYLKQFVRGNDWDEYLSLATFSYNTSVHEATRYTPHELVFGR" FT CDS 5843..7834 FT /product="Gypsy-11_SI-I_2p" FT /translation="MYASSIFTLLMIPVCVRGLIGYDCHGEGLNITTLSLL FT DIGSCNLDDVEPINDEVYVQLMQLSDYNPVSTKQCRVEIDRIIYYCGMHSH FT VSIVGNGRRQYIRELGADACRRLHETGVLAISTTTIDKLVINSTNLRSLTL FT AGRVATDGRCHGAQYTDSYGTWDNVVVQATAKISLRTFEANIKQSTGEIIL FT PSGTRCAVSNRVCFDADGSETYWEMTPSDSCHFDRYDILYEGMANRLSPAK FT GQVTPVVYTVTSKDTTFALTKSAEVNVCGYKLFKTEHPKLLIFETQRGQTF FT KSRSRVSVNDLDMFTYVNSKFVYVEKHIKLQLTRLYKDLMEQKCALERQVL FT QNVLSLSSIAPDEMAFRMMKMPGYTAVISGEVIHLIKCVPAQCKIRHTESC FT HNELPVTYRNESYFLLPRSRILTKKGTPRDCHGLLPTMYRIQDTWFRITTR FT PEETIAPPTIQPLTQPSWKYVSPDSLAVSGIYTAEDLDRLRDHIMFPVEKP FT STLNTIARGARGQDIPSGSISILNLLDEKSLDRIGENAGKRIWNGFMSFGS FT FSAGVLAIFVIIRIIKLIIDTLIHGYALHSIYGWSIHLLGAIWSSVTNLLL FT HMGNEKRVPTPVVNREYILIRPEPKDGKIIFSQEAPSTEQAEGQSGNPSTE FT RSYAQLRKYLESESL" XX SQ Sequence 8216 BP; 2513 A; 1936 C; 1867 G; 1900 T; 0 other; taaatatcgg cgctactcat aatccggaac ggtgcatact ttgtcgcatt ccccttacca 60 ccactcaagt gattcgcgat tgtggaacat gccccacgga tctagggtat tttctggagt 120 atctcgatca tgaaaactcg gccgcctacg aggatcccga gccgacaatc ctcgcaatat 180 ccgaagaggg cgtttaaagg agaaaatttt tagcttcgaa taaacttcat ccctccccct 240 tgctcctttc tgtacaatca atcgagcagt gtgaacgtgc cgtaaatctt tgcaccgcgc 300 cgtaacataa aactggtgac agcagtggga tacgcattat tgttgtgcag atattaaaat 360 tcattaaata tggaaacgcg cggcggtgct aaaccccctg taaagaaagt gctctccgaa 420 gaagagctcg atgccatacg gatggcacag gtacacgaaa aacgtctccg tgaagaatct 480 gctgctcttg ataaggagag agaccggcta gaccgcgaga aacgagagtt cgaacgcaat 540 aaaaatgccg agagtactat cgccgcggac atgcgcggat tattatctgc tttaaagaac 600 gagatgatgc atgaaattaa tatgttacgg caagatgtgg gaaatcgccg cagccccacg 660 cggagcccta ccccacccgc cgggcagcgc atgtttagta ataccccctc cccaacgcgc 720 catcagggtt cccctgaccg ctcgggcgca cccaaaatct ccttccgtga aatcctagag 780 ggtgtccctc gcttcgacgg atacaatgtt cccctaacac aatttgctcg cgcttgtcga 840 cgagcccggg aaatcgcacc cgccaaccac gaaaaacatt tgacgcgact tttagtgaat 900 aaattatccg accgggcatg cgccgccgtc gaggacgagc cctgcgagtc ggttactcag 960 ctcattgatt tgttgaacgg tgccttcggt tcaccaaata gtattgctca atatcgcggg 1020 gaactaagca gagcatacat gaaacccaat gagcacatcc tagattttat atcgcgaatc 1080 aaggagctcc gctcaacaat tttagatgcc gagcgtcgca cgcgaggcgt attagatgca 1140 gcaataatta acgacgttga tgaattgacg gctagatcat tctgcgacgg tctcccgcta 1200 cgataccgta tgctcatgcg gcgcgaacac gagaattcgc ctttcgaagc attcgctacc 1260 accaaagcga tagccaagcg cgatgagctg gagagagaga gatacgaccc taactatcgg 1320 ccggatagga gggggggaaa tcgacccttc tcaggtaatg gccttacgcg agattataga 1380 gactctcggc ctagtaccga gcgttatcgg gagccaacta attattcagc cacacgcgac 1440 atacgcgagc cacgtcagga ttaccgccct ctcgcagata ggtaccgcag gccgatcgag 1500 tatcgaaact cacgcgaatc gcgcgaacac tatgataatc ggcgggtcgt ggcggataca 1560 atgaatcaac gcacagaaag tgcaccccga gatgctaact cgatatggtg tcggtattgc 1620 aagtacccgg ggcacgaaat ccagaattgc cgtaaaaggc aatataacga aaataaacag 1680 ggaaacgcac ccggcccctc gagcagaccg gatgcggctc gaggcgggcc cccgccgaat 1740 cgtccggtga gatgtgtgga aatagatctc gaggagaaga acgaattggt ctcctccgaa 1800 taaaaaattt taatcacgcg cccactgtcc gcataaccag cgatatattc cagccagatg 1860 tcaccttaat ggtcgacacg ggggctgcgc ttaacttgat taaaattaat aaaatactgc 1920 ccaatgtcca aatcgattct aattctatta tctttctcac gggcatcact gagggacgcg 1980 ttgagacctt agggtgcgtc agcgcgaatg tgcttggacg agaaatcaaa tttcatgtca 2040 taccttcgga gttcccagtc tcgtgcgacg gcatcttagg agccgatttc ttacgaggcg 2100 cggggaaaat aaattttgtt gagcaaacat tagagtggca taatgcctca ttcccgtttc 2160 taaatcgaga gatgaccaaa ttccctgcac gctcaagtgt agtaatgtgt gtaaatgtaa 2220 caaatgccgc cgtagcaact ggctacgtcc cgcgactaac aatcgatgaa aatatctacc 2280 tcggtgaagc gatagtctca aaccgtgaag gcaaagccta ccttcgcgcc tttaatacat 2340 cggataaaga tattattttg ccaactccta cagtagaact ccagaatttt gacctgcccg 2400 gaggatctta tccggaacaa gggaccctcg tacccgcctt agggaagggt aacgactcag 2460 gcaatatttt agcgattgcc ggagccggag gtagtcgaaa agacgaggta aaaagtttat 2520 tgcgcctgga gcatttgaat aaagaagagt tgaagcacgt tgataatata gtcgagaaac 2580 attgcgatat atttcaatta ccaaatgaca aatttgaatg tactaacgta atgaaacata 2640 aaattaaaac tacagacgag caacctgtac ataccaagca atatcggttc ccctcgatcc 2700 acaaagaaga aatcgataaa caggttaaaa ccttgttaga gaatgatgta ataaaaccat 2760 ctgtttcacc gtacaattct cctttgtgga tcgtgccaaa gaagcccgac tcgaagggta 2820 ataaacgctg gcggatggtc atcgatttca gaatgcttaa tgaaaaaacg attggcgatg 2880 cgtacccttt accaaatata acagaaatcc ttgaccaact cggcagtgca aagtatttca 2940 gtatatttga tctggcgtca ggtttccacc agatattaat agacgaagcc gacgcgcata 3000 aaactgcatt ctccacgcca tatggccatt ttgaattcaa cagaataccc ttcgggttga 3060 aaaatgctcc ggccacattc cagagactaa tggacttggt attggtcgga cttcaaggca 3120 ccgaattatt tgtgtatcta gacaacatcg tcatatattc aagctcgttg cgggaacacg 3180 cgcaaaaatt cgagaaacta gctgcgcgcc ttagacaagc gaatctgcgt ttacagccgg 3240 ataaatgcga atttctccgt aaagaggtaa cgtacctggg tcacgtgata ggagaaaacg 3300 gagttaagcc agatcccgag aaaatccgcg ccgttaagga attcccgaaa cccacgaatc 3360 ccaagcaaat aaaacaattc ctagggctgg ctggttatta ccgccggttc atccaaaact 3420 tctcaaaaat cgcgcacccg ctaacaaatc tcttaaaaca agactcggcg ttcttatggg 3480 aatcaccaca agaaaatgca ttcgaaacat tgaaagaatc attatgtaat tacccggtgt 3540 tacaataccc ggacttctct aagccgttcc ttgtaacaac tgacgcgtct aatgtagcga 3600 taggcggtat attaagtcag ggccccctgg gaaaagactt gccggtgtcg tttgcaggta 3660 ggcgattaaa ccaagccgaa caaaattatt cgacaattga gcgcgaacta ctagccatag 3720 tatattgcgt aaactacttc cgaccgtact tgtatgggca aaaattccag cttatcaccg 3780 accacaagcc gctcgtgtgg ctgcactctg tgaaggatcc tacttccaga cttgtccgat 3840 ggcggttgaa acttgccgaa tacgaatacc aagtaatata caaagcagga aaaaataaca 3900 cgaatgccga cgcactctcc cgaaacccga ttccctataa ggcctttcct atctcttcag 3960 ataatttcga agaatccttt tttccgagcc ctagcaacaa tgtaaatgaa acatgttacg 4020 agacaaacca gatgcgagac atgacgacca agagtcccga acagactgaa accataaaaa 4080 cggcgccggt caaagtacaa gtggaagcca taaatgacgg aatggccgac acacacacac 4140 gcgcgacagg tataaatgag acagaatacg aaactgacct tgcagacagc gatacgaacg 4200 cgacgttctc cgagacggaa accgacgaac ctatcgtcga ccctgtcgac gaagaatatc 4260 aaataaaccg cactaggatt aacgaaaccc gtgaccattt taaggataag aaagataacc 4320 tagcgatatt tgtagacgag acaggccgac cctgcgatac tggatcgaga cacctggctg 4380 aagctagcaa attacctgaa atcagggaaa ctaatgttgg aagggcaaaa attttacgac 4440 aaggaaagta ctatttgata gttctggcaa tcaataccca aggttccaaa gtaactcaac 4500 gggaaacaat aaaaatggcc ctcgtatccc tttatgatgt tgttcgagaa cttgagcttc 4560 gatcggtttc catttccaaa ggaactgtgg cagctgtgcc gtgggacata attcaaggat 4620 atattgcccg ttttttctac gataccacgg tacttatatc tatctgttcc aaccaaatca 4680 ttacgccgga agagcaagac cgaaagaaaa taatgtacaa aaatcataac actgcgatag 4740 ggagacacaa gggtgtctca aagacgtacg acagaatacg gtacaactat tattggacta 4800 acatgaaaac tgacatacaa aactttgtaa aaaattgtaa agactgtcaa ttgaaaaaac 4860 tggtgagact aaaaactaga caaccgatga cgctgacgga caccccaggt atggcatttg 4920 aaaaggtttc cttggatata atgggtcctc tccccacttc ccttaatggt cacacctata 4980 tccttacaat acaggatttg ctgaccaaat attcactggc aataccactg caacaagcca 5040 cttccacgca tatcgccgat gcattcgtaa ataacttcat ttgtattttt ggtgctccga 5100 aggggatgct caccgatcgg ggaaccaact ttttgagtaa tctgatgcga ttaatagcgc 5160 gtagatttaa aattaaacaa tatagcacaa cggcatatcg cccgcaatcg aatgggtccg 5220 tagagcgctc acatcaagtg ttgtgggatt acttaaaaca atttgtaaga ggaaacgatt 5280 gggatgaata tttaagccta gccacatttt cctataatac aagcgtgcac gaagccaccc 5340 gttatactcc gcacgagctg gtctttggta gatagcggaa attccggcgg cggacccgca 5400 ggtagaaggt gtacggaacg aaacctatga aagctatctc actaaccttt tttgtaagat 5460 ccgtgaaact caggaaatag cacgcaagaa tttaatcaga gctaagttac gatctaaaca 5520 atattacgat aagaaaataa gacctgtaac ttttagcgta ggcgatatcg tttatatact 5580 taaagaacca ataagaaaca aactcgacaa ccagtatata ggaccatacg agataagcga 5640 gattattgat aagcataatg ttaagattaa gttaagcagc ggtagatata agatagtaca 5700 cagcgataag ttgaagatag ctcacgtgga gcccccttcc gaagcagcca ccccaccatc 5760 cagcagcgac gacgtggaaa gggaagcagt gggcccgact cctcgtcagt caagggatgc 5820 ctagcagcac aaaaaggaaa ccatgtacgc gagctccatt ttcacgttgc tgatgatacc 5880 tgtgtgtgtg cgtgggctga tagggtatga ctgtcatgga gaggggttaa acataacaac 5940 cctttctttg ttggacatcg gatcctgtaa ccttgacgac gtagaaccta ttaacgacga 6000 agtctacgtc caactaatgc aattatcgga ctacaatcct gtatctacaa agcagtgtcg 6060 cgttgaaata gatcggatta tttattattg tgggatgcac tcccacgtat ctattgttgg 6120 caacggacgg cggcaataca tccgagagtt gggcgctgac gcatgccgcc gactccatga 6180 gacgggggtt ttagctattt caacgaccac aatcgacaaa ctggtgataa actcaacgaa 6240 tttgcggagt cttactctgg ccgggagagt tgcaaccgat ggaagatgcc acggagccca 6300 atacactgat agctatggta cgtgggacaa tgtcgtcgtt caagccaccg ctaaaatctc 6360 ccttcgaaca ttcgaggcaa acatcaagca atcgactggc gaaattattt taccatctgg 6420 aacacgctgc gcggtgagca atcgcgtatg tttcgatgcc gatggatcgg agacatattg 6480 ggaaatgacc ccctctgaca gctgccattt tgatcgctac gacatactat acgaaggcat 6540 ggcaaacaga ctttccccag cgaaggggca ggtaacgccc gtcgtatata cagtgacttc 6600 aaaggacacg acgttcgcat tgacaaagag tgccgaagta aacgtgtgtg gatataagtt 6660 gtttaaaacc gaacatccaa aactgttaat tttcgaaaca cagcgcgggc agacatttaa 6720 atcgcgatcg cgggtgtcgg tgaatgattt ggacatgttc acgtatgtaa attcgaaatt 6780 cgtgtacgtc gagaaacata taaagttaca actcacgcgt ctgtataagg acttaatgga 6840 acaaaagtgc gcgctggagc gtcaagtgtt acagaatgtt ctgtctctct caagcatcgc 6900 cccagacgaa atggcattcc gcatgatgaa gatgccaggg tatactgctg taatctccgg 6960 cgaggtaatt cacctgataa agtgcgtgcc agcacaatgc aagattcgtc atacggagag 7020 ctgtcataac gaacttcccg tgacgtaccg gaacgaatcc tattttttgc tgccgaggtc 7080 acgaattctt acgaagaaag gaacaccgcg agactgtcac ggattactcc cgacaatgta 7140 taggattcag gatacgtggt tccgtattac cactaggccg gaggagacaa ttgcaccgcc 7200 aactatacag ccgttgacgc agccgtcgtg gaagtacgtc agcccggatt ccttggcagt 7260 cagcggaata tacaccgcgg aagatcttga ccgtcttagg gatcatataa tgttcccagt 7320 ggaaaaacca tcgacgctga acacaatagc ccgaggagcc aggggccagg acataccatc 7380 aggtagcatt tcaatcctaa atttattaga cgaaaagtcc ttggaccgta ttggggaaaa 7440 tgcgggtaaa cgaatctgga acggattcat gagttttggc tcattcagcg cgggggtgct 7500 ggcaatattt gtcatcataa ggataataaa actaattatt gacacactta tacacgggta 7560 cgcactacac tccatttacg gatggagcat acacctattg ggtgccatat ggagctcggt 7620 aacaaacctt ttgctgcaca tgggaaatga gaaacgagta cccacaccgg ttgtcaacag 7680 ggaatacatc ctaattcgcc cggaacccaa ggacgggaaa ataatattta gtcaagaagc 7740 gccttccaca gaacaagcgg aggggcagag cggaaatcct tcaaccgagc gatcatacgc 7800 ccaactgcgt aaatatttag aaagcgaaag cctgtaacaa cacttttctt tttttttctt 7860 ctttttttta tttttacaat aagcgtagct gtaagtgata gacataaaac tagtaagcat 7920 gccagattaa ttataaaaaa tgttggacat ggtccaacat tttcatcgat ggggggaggt 7980 gttacatccg ccgtttcccg gtgtaaattt ttgcaaggat gataccgagt aaaattcaga 8040 acataaaact tttgattgag actataacca tggaaacggg ataggaagcg tcccctccct 8100 cgatattagc gccgcagtcc attcacttgg ggcgaccacg aggtatccac cccactctgt 8160 acccaaagta agtgccatat tctcattaag cgtattacgg cgcataaatt tccttt 8216 // ID Gypsy-5_AC-LTR repbase; DNA; INV; 214 BP. XX AC AASC02053826; XX DT 18-JAN-2011 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from Aplysia californica (California sea DE hare): long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-5_AC_; KW Gypsy-5_AC-I; Gypsy-5_AC-LTR. XX OS Aplysia californica OC Eukaryota; Metazoa; Mollusca; Gastropoda; Heterobranchia; OC Euthyneura; Euopisthobranchia; Aplysiomorpha; Aplysioidea; OC Aplysiidae; Aplysia. XX RN [1] RP 1-214 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Aplysia californica (California sea RT hare)."; RL Direct Submission to Repbase Update (02-FEB-2011). XX DR Genome; AASC02053826; Positions 14427 14214. XX SQ Sequence 214 BP; 56 A; 35 C; 73 G; 50 T; 0 other; tgtggttagc tgtgtgattc tcgggaaagt agtcgagtac ggttgacatt gaagaaacca 60 aggacaggcc gaggcgtgag ccacggactt ttgaagactg cgtgagtagc ggtcgaggcg 120 cagcgcagag agttgcgacg agttagtgca agttggatcg ctacatgagg tgggtgtgtg 180 tagtaaaacc gtatgcatgt atagttaaac taca 214 // ID DNA-TA-8_CQ repbase; DNA; INV; 2102 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A DNA transposon family from Culex quinquefasciatus - consensus. XX KW DNA transposon; Transposable Element; nonautonomous; DNA-TA-8_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-2102 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the southern house mosquito."; RL Repbase Reports 11(1), 58-58 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >88% CC identity. 11 bp TIRs. TA TSDs. XX SQ Sequence 2102 BP; 775 A; 269 C; 360 G; 696 T; 2 other; cacggagaga ctgtgcccag aaattttaac aattaaaatt gttgatttta ctttcgtaaa 60 ttttaacaaa aacgtacttg ttactgccaa tattccgtta aaataactaa aattcagtat 120 ttatttaata acctgtttgt tgaaaatgct agaggcaaaa agctaacaga tttcccgaac 180 tcgaatgaac gtgctcgact gttagctttt gttttaggaa aatcaaaaat tttgttggtt 240 gaatataatt tacatttggt tgaatattta gaagatgaac ttgggtttgt ttacgtctgt 300 catttgaagt caggattcga acgagagcgg tcggtttgtt gagtttgtgt tgttaattgt 360 gaagtaagag gatgtgaaag gtgaaaatca cactatttgg ctaagttgaa gtggcaaatc 420 gaagtgcaca ctgctgcaac tgtgaaggtg gaattggtga gtaagtttgt tgatgatttc 480 tttgcaggtg ataaactatg aagagcaaga cgaggacatt cgcaatgggg acctctgatg 540 cgaggggact ttatgataat gtttggagga aagagagggg cagtggaagg aagaggcggg 600 ccaactcttg cacgacaata cttgcacggg caaatttaac aaaatgttgg ttaatctaag 660 taagtatggg tttaatttta cacaacaatt tatctattgt gtttttaatt agaaaactga 720 atcaccagga accttcgtgc ttacgatgga tatttcagaa tcatcaacca taaaatacat 780 ggaaatgcgt acacattata tcaaaataaa aattcgatgg tgggaaatgt ctaaaaaaat 840 aaaatatata aaagatatam aaaagaagat attaatattt taaaaatgct gttgaaaaag 900 actgctcaat aaactggtaa gtaaaactca aaatattgtt ttcaggaatc ggagtacgtt 960 gaataaattg aatcaaatgc aatgaaagta tgttgccaaa agtagttgtt aaattagtgc 1020 aaactkggta agtagcatta ttgatatttc ttgtttataa ttacatttat ttacgttcct 1080 tttatcagtt tagagccgga attaattttt ttattagatt aataataaat ttcatatgtc 1140 tatttgcagc aaaaggttcg ccttggtgta aattctgtgt ccaacctcaa gaaaaataac 1200 taaacatgag accgttgtag atgatgacga cttatcggat gatttctcag tgatggtaag 1260 tgaatacaac acttaattgt atacaacata ataaaaatat aagcttaaaa cttggtgaaa 1320 tttttgctcg gtttctactt ttaaggaatt tacgagaaat taaacatatt ttttgaacat 1380 gtttaacaaa atttgtaaaa tgcttttttt attataattt ttttaaaaca ttaatcaaac 1440 ataacatgtt ttataataac aattaaaaaa atacaaataa gtacaagtac aacaagttaa 1500 tttgtgaggc attattacaa aatttactgc aaattaaata aaaatattgc taactacagc 1560 taattactga ctacatgtac gaatattttt ctgtattcaa gtaagacatt agttaaaaat 1620 atagctagaa attcggccca gcctgtatcg ttaaataatt aaagaaaata tatatagaca 1680 ctaacataaa ttatgtttaa ttaagaaatg ttttgttata aaccactaat aatttgctgc 1740 atgcaaaaat atttcagttg atacaacgaa aaaaaaagtt aaaaaatagt tgaaaaatat 1800 tgtccagcct gttttttaca gaatatttaa caattattca gctgaaaaca aaaaaaaata 1860 gttaattttc tgaaaaaaat attctagcag tcgactgcta gaattttttt tcggtcattt 1920 aaccaacaaa aatggcaatt ctaactattt ttttagttac ttttacctac tggtggtcag 1980 caatttcggg ttaacaaaat caacaatcga ttgttgaaaa aaagctgtgt aaaaaattag 2040 cccatacttt tagttaaatt aaccaagaat ttcttagatt tgccctggcc ggtctctccg 2100 tg 2102 // ID hAT-15_HM repbase; DNA; INV; 3902 BP. XX AC . XX DT 07-DEC-2008 (Rel. 13.12, Created) DT 12-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-15_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3902 RA Jurka J.; RT "hAT-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 2004-2004 (2008). XX DR [1] (Consensus) XX CC The youngest copies are >99% identical to consensus. CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 482..3205 FT /product="hAT-15_HM_1p" FT /translation="MSEDLKRKSKGGAEKQREKKQKLLLQGGEKCKKITEL FT FRLSTPTQFGGDTLVDDLGSQISDGDQAGPAVSASVSCTVLTPSQKSVSSD FT SRPQNQCSEIFMRPQSQCLESFFKSHPVQPKEDGENLPFSNVRRVFYRSQD FT DINRVWLSYCHSSKAMFCTVCLAYSKASESNAFTDGMNDWKHVHQRIEEHE FT SSKHHCRSVEAHIMSSNDKDVYSLLFSNQKNVRAAQVKEKRQVLERVIEIV FT KLIGKRGLSYRSINDAAYCLDNVNIDHGNFLEILILLSKFDPLMKNHLDIV FT TEKSKQRHVTCAASGTKGRGGLVTFISSTTVNYVIESVRRMIKASIADEIQ FT SAGMFSVQLDTTQDVSVKDQCSIVIRYVKDHIYERLVSVSNCTNSTGKGMF FT ELFRDEIVKMGINIENCVGNSTDGAANMQGQYSGFTKWLSEASPNQVHVWC FT YAHVLNLVLVDTTQTTKAAISIFQLINKCAVFLRESYIRMDIWASKQSNNR FT RLNVIGETRWWAKDHALKKIFGSFNNPSTSLYIELIETLQELASSEDFNPT FT VRDTAQTLLDKCISFETILTVQVFLRIFQHTASLSKYLQTSGMDVLQAYRN FT VDCTIKALRDVSRDFDGINASAKKFVEWANNSLEDLKIDVTVEAALPEKRV FT RVPKAMSGERRTHEAVGDRAIDCYRITVHNCVMDKVVKTLEERFKAQDTLF FT ADMACLNPENFYDIKKNGIQPTALVRLSSILTKFNKDATASNLQSELIDFS FT LKWSKFKKTVPEEYLGMNLAENRNDNDELENDEETAVSSSFSEDKEAHGKC FT KSCKNCPVCCFTVLHKYNFYSKAYTHLYNAYKLLLTLSSTQVACERSFSKL FT KYIKNRLRSSMSSTHLEAFMLMSVEVDVMVGLSNEDIIDSVAEIAPGLKTH FT LVL*" XX SQ Sequence 3902 BP; 1190 A; 778 C; 838 G; 1096 T; 0 other; cagggctgga ctgggccgcc gggacaccgg gacgattccc ggtgggccgc ctggttttta 60 gcaggttgtt ttgatttgct tttgacttca aacgcgattt atggctagcg ataattatgc 120 gataccatcc gcagtgagat atttcgtatt tttaatgatt gcattttgtg aagtgaatct 180 tttgaagtga caggtcgtgt aacggtgaaa acggcatttg aaaacagtgt gtgaaactac 240 gtgtagcata aagtgaaaag ttgatgattg cgtattcggg aagcgggagc gatcggagcg 300 atcgttgacc acggaaataa tatttagttg cgaaaaactg gccagatttt ttagtcgttt 360 tgagaacatt tcattttatg tatgtttcca tgtttgtccg caaaatagtt ataaagttaa 420 aatgtaaact taaatgggcc tctaatatta tagtcttctc cttacgtata tatttgataa 480 gatgagtgaa gatttgaagc ggaaaagtaa aggcggggca gaaaagcaac gtgaaaagaa 540 acaaaagctt cttcttcaag ggggagaaaa gtgtaaaaag ataaccgaac tttttagact 600 aagtaccccc acccagtttg gtggtgatac tctggttgat gatttaggtt ctcaaatttc 660 tgacggcgat caagccggtc ctgccgtctc agcctccgtg agctgcactg tgctaactcc 720 atcccagaaa tctgtatcct ctgactcccg gcctcagaat cagtgttcgg aaattttcat 780 gcgtccacaa tcgcagtgcc ttgaatcatt tttcaagagt catccggttc aaccaaaaga 840 ggatggggaa aatttaccat tttcaaatgt tcgtcgagtg ttttatagaa gccaagacga 900 tatcaatagg gtttggcttt cttactgtca cagttcgaag gccatgtttt gcacagtgtg 960 tctggcctac agcaaagcta gtgaatccaa tgcatttact gatggcatga atgattggaa 1020 gcatgttcac caacgaatcg aagaacacga atcgagtaaa caccattgca ggagtgtaga 1080 ggcacacatt atgtcgagca atgacaagga cgtatatagc ttgctttttt caaatcaaaa 1140 aaatgtgaga gctgctcaag ttaaagaaaa acggcaagtt ctagaacgtg tgatagaaat 1200 tgttaaactt attggaaagc gaggacttag ttacagatct atcaatgacg cagcctactg 1260 tctggacaat gtcaatattg accacgggaa ctttttagaa attttaatcc ttctgtctaa 1320 atttgacccg cttatgaaaa atcatcttga catcgtgacg gagaaaagta agcaacgaca 1380 tgtcacgtgc gcagcatcag gaacgaaggg acgaggaggg ttagtaacat ttatttcaag 1440 cactactgtc aattatgtaa ttgaatctgt tcgccgtatg attaaggcat cgattgcaga 1500 tgagattcaa tcggcaggaa tgttctccgt gcaactggac acaacgcaag atgttagcgt 1560 aaaggatcaa tgctcgattg taatccgata tgttaaagac catatttacg agcgactggt 1620 atccgtcagc aattgtacta attcgacagg caagggtatg ttcgagctgt tccgggatga 1680 aattgtcaaa atgggcataa acattgaaaa ctgtgtggga aactctacag acggggcggc 1740 aaacatgcag ggccaatatt ccggtttcac aaaatggctc agtgaggcat ccccaaacca 1800 agtgcatgtg tggtgttacg ctcatgtatt gaatcttgtt cttgttgata ccacacaaac 1860 caccaaagct gctatcagta tatttcagct tataaacaaa tgtgctgttt tcttgcgcga 1920 atcatatata cgaatggaca tatgggcgtc aaaacaatca aacaatagaa gactaaatgt 1980 aattggcgag acacgctggt gggcaaagga ccatgcatta aaaaaaatat ttggaagctt 2040 caacaaccca tcgacgtcct tgtatattga gctaattgaa actttacaag agttagcctc 2100 ctctgaagac tttaacccaa ctgttcgcga caccgcccaa accctattag ataagtgcat 2160 ctcatttgaa acaattctta ccgttcaagt ttttttgcgt atattccaac acacagcatc 2220 actttccaag tacctgcaga caagtgggat ggacgtgctc caagcctacc gaaacgttga 2280 ctgcactatc aaggcactgc gagacgtgtc cagagacttt gatggcatta atgcgagcgc 2340 taaaaaattt gtcgagtggg ccaacaactc attggaagat ctgaaaattg atgtgacggt 2400 tgaagctgca ttgccagaga aacgagttcg cgtcccaaaa gcaatgagtg gtgagcgacg 2460 cactcacgag gctgtcggag atcgcgcaat tgattgctac aggataacgg tacacaactg 2520 cgttatggac aaagtagtga aaactttgga agaacgcttt aaagctcaag acacgttgtt 2580 cgctgacatg gcgtgcttga atccggaaaa tttctacgat atcaagaaaa acggtatcca 2640 acctactgcc ttagtacgtc tcagcagcat tctaacaaaa tttaataaag atgcaacagc 2700 ttcaaattta caatccgaac tcatcgactt ttcgctcaag tggagcaaat ttaagaagac 2760 cgttcctgaa gagtatttgg gcatgaactt agcggaaaac cgaaacgaca acgatgaatt 2820 ggagaacgac gaggaaactg cagtttcttc ttcgttctct gaagacaaag aagctcatgg 2880 taaatgcaag agttgcaaaa attgccccgt ctgttgcttc acggttttac acaaatacaa 2940 tttctacagc aaagcctaca cccatttgta caatgcttat aagctcttgc tgactttatc 3000 gtctactcaa gttgcgtgtg aacggtcctt ttccaaatta aaatacatta aaaatagact 3060 gcgaagttca atgtcgtcca cgcatttaga agctttcatg ctgatgtctg tcgaagtaga 3120 tgttatggtt gggctgtcca acgaagacat catcgactcg gttgcagaaa tcgcacctgg 3180 gctaaaaaca catctggtcc tttgatccac taccatccac attcggtaag tttcgataat 3240 atggcgtcag aaacattatg ttattacgga cgttaccaaa tcataattga tgggttattt 3300 tttttgttaa tatcgagtgc aacacattat aaccacagaa ttctggagac agcatgtata 3360 ccttttatga ttgcaataca gattctattg gtcgaatgca gttttgcact aactatacca 3420 tgtataaggt acgtgctata aatattactg ttaaagcatc aaagtcgatt atcagatgtt 3480 tttgtgcaac ttagtcaaat aattgttttg tagtagagca gagagcttgc acattatagt 3540 atagttgtga cttttcgagc gatattttcg actacctatt aaaacatatt gaaacttcta 3600 tattatacct gtatggtaaa tggtagctat atacctatat agatatatat aaatatatat 3660 tatgcaagaa taaaaatgtg aagcagtcta caatttgtag gccttaaaaa tgtcgccgta 3720 attttatcta actcctgaac atgatacgaa tttttttgct gctgctatgc aagtcggaaa 3780 ttcccagctg aacccccccc cctcatcctc ccccgcccct taatgactta gtattgctgt 3840 atgtgggccg gaccgcttgg tttacagtcc cgggccgctt gaccaccccc agtacagccc 3900 tg 3902 // ID BEL-103_AA-I repbase; DNA; INV; 7258 BP. XX AC supercont1.294; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-103_AA_; KW BEL-103_AA-LTR; BEL-103_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-7258 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.294; Positions 168511 175768. XX CC Positions [5200-5790] - Integrase core CC 'AACTG' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 864..2651 FT /product="BEL-103_AA-I_2p" FT /translation="MNSDQVKMASELKELKRQIRQLQNTFDGVRRFVGRFK FT KEKHYAHIDTRLEMLEAAMTKFYAIRRKIEEVMEEIDESSVAESKESPEER FT TARLEILSERRCQESSDVIQETEDVYCELRSLLSSLKGEPTSTTSSANPEA FT AAQHVVKPTSTVKLPELRLANFGGRLLEWVTFRDQFQSLIHRNYQLSDMDK FT FSYLRSSLVGEALQEVGALEMTSANYSIAWDLLQKRYENKKLIVKAHLDAL FT FAVEPMKRESYEALNHLISEYDRNLQMMGKIGEETANWSTILVHMVCCRLD FT PATLRHWESHHCSKDVPKYDDLMEFLRNQCSVLQSIAPEKSTDTDVRKART FT SVSHPSTQFSNNRCHFCGEAQHSAFRCQRFLKMKVPERYEKVKKCGLCLNC FT FSPSHLVRLCTKGVCRHCQRKHHTLLHLGSSEGGAAATATQIGSSVPQEQT FT RTQRANNHQTQPHAQNQTAPSQNTGPNTRSFPIASTSTHPLPTTDRTPSQH FT TNSLPVSIHTPSRQVLLSTALVRITDQYGNVQLARALLDSCSEYCFITRSL FT CQKLKLGGSPNHLSIVGIGGSSTKSTRMVNATVMPRSLKISPYPIARFA" FT CDS 3010..6159 FT /product="BEL-103_AA-I_1p" FT /translation="MCKEIFERTTTRDETGRFVVTLPKKESVIQKLGDSKA FT NAFKRFYGLERRFAANTALKEAYFEFINEYRSMGHMEEVSEEEEPAYFLPH FT HAVLKPDSTTTKLRVVFDASCRTSTGVSLNDGLMCILWRPSPDDSIKAYKL FT TTVTYGTSSAPYLATKCLQRLGSVGEATHPAAAKVIKKDFYVDDALTGTDD FT LEEGKALVSELIDLMSSAGFILRKWSSNSAELLGNVPIDLRDERSSFELDS FT STSAVNTLGLIWEPAADIFRFTVPKLELDTPITKRIVLSESAKIFDPLGLI FT GPVVVQAKIFLQTLWKQKCNWDDLLPDELQSSWIEFRRNLLALDTLTVPRW FT VSFAKDLVTVELHGFSDASNVAYGACFYLRCITVDGTITAKLVTAKSRVAP FT LEDLKRKKKVLSTPRLELSAALLLSHLYEKVCSSIKIPFQSFFWTDSTIVK FT YWLASPPSRWQVFVANRVSEIQHITSNGVWNHVAGADNPADIISRGMIPAQ FT LKYETMWFEGPLWLRQDRSMWPSNSSVTPDVEHSLLEERSSVALPARAKPP FT NELFLLRSSFTELVRIVAYIRRFLHNATPNNRTHKRSGQIKLQELNESTEA FT LVRLAQADSFPEEVAALSRAREVKSSSKILMLHPILVDETLRVGGRLANAP FT ISESRKHPMILHHRHPFTKLVLRHYHLAHFHAGQQLLIASVREKFWPTNAR FT DLARTVCHECVTCFRNKPTVHEQLMADLPSVRVNPAAVFLKVGVDLCGPFY FT TKYPVRRSAAVKCFVAIFVCLATKAVHMEVVADLSTQAFLAAFKRFVAVRG FT KPQVVMCDNATNFVGANRELEELRLQFLDQQFQHTVVRTVEDEGIQFEFIP FT ARSPNFGGLWEAAVKSFKGHFRKIVGNQQLSYDELHTIVQQVAAILNSRPL FT TPLSNDPNDYVALTPGHFLVGRPLTAVPEPDLQEIPENRLSVWQRSQDFVQ FT RLWRKWKTHYLSDLHNRTKWTRKRDNIKVGTMVLVKEDNLPPQRWKLGRVA FT EIYTGSDGNVRVVEVRTKDGLFKRAISKICVLPIKDNAPIEEEY" XX SQ Sequence 7258 BP; 1924 A; 1767 C; 1762 G; 1805 T; 0 other; ttttggtcct tcgagccgga tttacgaccg gtggaccgtt ggatttggag aaatccgtga 60 agatcggaca gtgaaccgtg tcgaacgaca atagtgtggt gggactatag tggcgtcggt 120 accggccata gtgtatagtg gtttggcttg gcgtgggtac ccgccatcga caaaagccac 180 agtgaagaaa ggccatagtg agtgaaattg aaattggcgc ggcttgagga gccgccatcg 240 aagaaagcct gaagaaccga aaccggccat cgtaattcag cggtcccttt cggcgcggga 300 aggagcagtg accggtcgat gaccctgcat caagtggata gcatcagcca atctccagca 360 accaaggatc gcggaacgag aatcaaccag gcaaagggaa gtagcgtgag tagttgagat 420 ctgtggtggc atcatgcatg agatgcccac cgaaatattt gtgggtcgtt ttgaataaaa 480 cgaatgtgag atgaaattca aatcaatcat ctttcaattg catgcgatca ttttcctgga 540 atgtggttct tgtgtcctct tgattgacgg tcttggtttc tgctggcgtt tttctggctt 600 gttgttcgct cggttggatc ccctgctgca gtcaatttcg ttcgaattta tctctgtcac 660 ccgtgggacg tcgaggaatc gcgaatcggt gaatcacggc aatttggtgc tattaatgag 720 tggaagtgac taaatgtgta cgctttgaag ctattgatag gttgaatcaa tcgctctgtg 780 actttttcga actttagtgc gtgattggtt caatttaatc tttgcaattt gaactgatat 840 taagtgagag ttgacaattg acgatgaaca gtgatcaagt gaaaatggcc agtgaattaa 900 aagagctgaa acgacaaata cgtcagctgc agaatacctt tgacggtgtt aggcgtttcg 960 tcggtaggtt caagaaggag aaacattatg cgcatatcga taccaggttg gagatgttgg 1020 aggcggccat gacgaaattc tatgcgatcc gtagaaaaat cgaagaggtg atggaagaaa 1080 tcgatgaatc aagcgttgcg gaatccaagg aaagccccga ggaaagaacc gcccggttag 1140 agatactgtc ggagcggcgt tgtcaagaat cttcagatgt gattcaggaa actgaggacg 1200 tctattgcga gttgcgatcc ttgttgtcat ctctcaaggg cgaaccaaca tctacaacat 1260 catctgcgaa ccctgaagct gctgcgcaac atgttgttaa gccaacgtca actgtgaagc 1320 tgccagaact tcgattggct aattttggag gacgcctact agaatgggtt acgtttcgtg 1380 atcagtttca gagcttgatc catcgcaact atcagttgtc ggatatggat aaattcagtt 1440 atcttcggtc atctttagtg ggtgaagcat tgcaggaagt tggggcacta gagatgacgt 1500 cagcgaatta ttcgattgcc tgggatctac tacagaagcg ctacgagaac aaaaagctta 1560 ttgtcaaggc acatctggat gctctcttcg ctgtagaacc catgaagcgt gaaagctatg 1620 aagcactgaa tcatctcatc agcgaatacg accgcaacct tcagatgatg ggaaagattg 1680 gagaggagac tgccaattgg agcacgatat tggtacatat ggtgtgttgc agacttgatc 1740 ctgctacact tcgccactgg gaatcgcacc attgttccaa agatgtcccc aagtacgacg 1800 atttgatgga atttctgcgg aaccaatgct cagttctcca gtctatcgct ccagaaaaat 1860 ccaccgacac tgacgtgagg aaggcgagga cttctgtgag tcatccgtcg acacagtttt 1920 ccaacaatcg atgtcacttc tgtggtgaag ctcagcattc agcgtttagg tgccagcgat 1980 tcctgaaaat gaaggtgcct gaacgttacg agaaggttaa gaaatgtgga ttgtgtttga 2040 actgcttttc tccatcgcat cttgttcggc tctgcacaaa gggtgtctgc cggcactgtc 2100 agcggaaaca tcacactctg ctgcatctag gatcttcaga aggtggagca gctgctacag 2160 caacgcagat cggttcctcc gtcccacaag aacagactcg aacacaaaga gcgaataacc 2220 atcaaacaca gccacacgcc caaaaccaaa cagcaccaag tcaaaacact ggaccaaaca 2280 caagatcatt cccaatcgca agcacaagca cacacccact acccaccaca gatcgtactc 2340 cctctcaaca cacaaactca ctgcctgtca gcattcatac gccatctcgt caagtactgt 2400 tgtccaccgc cttggtgcgt ataacggatc agtatgggaa cgtacagctt gccagagcac 2460 tgctagactc gtgctccgaa tactgcttca ttactcgaag cttgtgccaa aagctcaagc 2520 tgggaggttc tcccaaccat ttgtccatcg tcggtattgg aggatcgtct acgaaatcaa 2580 caagaatggt gaacgcaacc gttatgccac ggtcactcaa aatatcgcct tatccaattg 2640 cacgttttgc ctaagctgac atcagatttg ccgaccgaat ttgtgaatgt acaacaacta 2700 gcgattccag agcatcttac tctagcggat cccacatttt ttgagccggg atcgatagat 2760 ctgatcatcg gcgccgaata ttattacgat ttgctggcag aggggaaggt taagctggta 2820 gacgatggac ccactctgca agaaactgtt ttcggctggg tcgtctctgg acgcgtacct 2880 ggatcttctt cggttgttca acgatccatt tcgtatcctt gtgttgcacc cgatctgaac 2940 gacctgctta ccaagttttg ggaactcgaa tcctgccact ctgggggaac gttgtcagta 3000 gaagagtcaa tgtgtaaaga aatcttcgag cgaactacca cgcgtgacga aactggtaga 3060 ttcgtagtta cgctaccgaa aaaagaatct gtcatccaga aactggggga ttccaaggcc 3120 aacgctttca aacgattcta tggactggag cgtcggtttg cagcaaacac cgccttgaaa 3180 gaggcatatt tcgagttcat caacgaatat agatcgatgg gccatatgga agaagtgtca 3240 gaagaagagg agcctgccta ctttttgcca catcacgcag tgctaaagcc tgatagcacc 3300 acgacgaaac ttcgggtcgt ttttgacgcc tcgtgccgca cgtcgacagg cgtgtctttg 3360 aacgatggat tgatgtgcat cctttggaga ccatcgcccg acgactccat taaggcgtac 3420 aagttaacca cggttaccta cggaactagt tctgcaccct atctcgcaac aaagtgcctt 3480 caacgtttag gaagtgtagg tgaagcaacg catcctgctg cagcgaaagt gatcaagaaa 3540 gacttttacg tagatgatgc gttgacaggg actgacgatt tggaagaagg aaaggcttta 3600 gtatcggagc tgatcgacct gatgagctca gctggtttca tcttgaggaa atggagttcc 3660 aacagcgctg aactacttgg taatgttccg atagatcttc gagacgagcg tagttccttt 3720 gagttggact cgtcaacatc tgccgtgaat acccttggtt tgatttggga accagctgct 3780 gatatctttc gtttcactgt tcccaagctg gagttggata cacctatcac caaaagaatc 3840 gtcttatccg aatcggcaaa gattttcgat ccgttgggac taattggtcc ggtagtcgta 3900 caggccaaaa tctttcttca aacgctatgg aagcagaaat gcaactggga cgatttatta 3960 ccggacgagt tacagagctc ttggatcgaa tttcggcgaa atctcctggc cttagacact 4020 ctcactgtcc ctcgctgggt ctctttcgcc aaggatctag tcacagtaga gttacacgga 4080 ttttcggacg cttctaacgt tgcctatggt gcctgctttt atcttcgctg tattacggtt 4140 gacggtacca taactgccaa gctcgtcact gccaaatcca gagtcgcgcc tctggaggat 4200 ttgaagcgaa agaaaaaggt tttatcaacg cctcggctcg aactttccgc agcgctcttg 4260 ttgagccact tgtatgaaaa ggtatgcagc agtatcaaaa ttccatttca atcgttcttt 4320 tggaccgact ccacaatcgt gaagtactgg ttagcttctc caccatcccg ttggcaagtg 4380 ttcgtcgcaa accgtgtctc ggaaattcag catataacca gcaacggcgt ttggaaccat 4440 gtggccggag cggacaatcc agcggacata atatcgcgcg gaatgatacc cgcacagctg 4500 aaatacgaaa ctatgtggtt tgaaggtcca ctgtggctac gacaagaccg ctcaatgtgg 4560 cctagcaatt cgtctgtaac accggacgtt gagcattcac ttctcgagga gcgatcttcg 4620 gttgcgctac cagctcgcgc aaagcctcca aacgaactat ttctcttgcg ttcatcgttc 4680 accgaactcg tgcgaatcgt agcatacatt cgtagattcc tccataacgc aactccgaac 4740 aaccgcaccc acaagcgttc gggacagata aagcttcaag agctgaatga gtcaaccgaa 4800 gctctagttc ggctagctca agcggacagt tttcccgagg aagttgcagc actttctcgt 4860 gcgcgggagg tgaagtcttc gtctaagatt ctcatgctac atccaatcct cgttgatgaa 4920 acgcttcgcg ttggcggccg gcttgctaac gcgccaattt cggaaagtcg caagcaccca 4980 atgattttac atcaccgtca tccgttcacc aaactcgttt tgcgacacta tcacctcgca 5040 cacttccatg ctggtcaaca gcttctaata gcctcggtca gagagaagtt ctggccaacc 5100 aatgcacgtg atttggcccg cactgtgtgt cacgagtgtg taacgtgctt ccgtaacaag 5160 cctaccgttc acgaacaact aatggccgat ctgccgtcgg ttcgagtaaa ccctgcagcc 5220 gtatttttga aggttggtgt agatctttgt ggaccattct acaccaaata tccagtccgg 5280 cgtagcgcag cggtgaaatg ctttgtagcg atttttgttt gtctggcaac gaaggccgta 5340 cacatggaag ttgtggcgga tctgtcgacc caagccttct tggctgcttt caaacgattt 5400 gtcgcagtta gaggaaaacc acaggtggta atgtgcgaca atgccaccaa ttttgtaggc 5460 gccaacagag aattggaaga gttgcgtctc cagtttctcg atcaacaatt ccaacacact 5520 gtagttcgta cagttgaaga tgaaggcatc cagttcgagt tcataccggc tcgttctcca 5580 aacttcggtg gactctggga ggcggcggtt aagtcgttta aaggtcattt tcgcaagatc 5640 gttggaaatc agcaactcag ctacgatgag ctacatacaa ttgttcagca ggtggccgca 5700 atattgaatt cgcgcccact aaccccgctt agcaacgacc caaacgatta tgttgcttta 5760 accccaggac acttcctcgt tggaagaccg ttgactgcgg ttcctgaacc tgatcttcaa 5820 gagatacccg aaaatcgttt gtcagtctgg cagcgatcgc aagattttgt gcaaagactc 5880 tggcggaaat ggaaaaccca ttatctgtcg gatttgcaca atagaaccaa atggactagg 5940 aagcgcgata acatcaaggt tggcacaatg gtgttggtga aggaggacaa cctgcctccg 6000 caaaggtgga aactggggcg agtagcagaa atttacaccg ggtcggacgg aaacgtccga 6060 gtggtagagg tccgcaccaa ggacgggctc ttcaaacgag ccatctccaa aatctgcgtt 6120 cttccaatca aggataacgc gcccatagaa gaagagtact agactcttcc atcgatggtg 6180 cttcggcacc gcgggggtct tcacattgag ggcctccgct ttccagttaa gtatgtgttt 6240 cattgaataa ttcaaaaagc tcaagaaatg ttcatccccc atagccaccc gtcggtccct 6300 tcggggacgc ccgtcatcag aagacatcgc gctgctatct ggtccatccg tcaatcacgc 6360 tttagttcac catgtagtcg ttgagcatgc cctatccatg atgtccatat cgtccgtcct 6420 gctattgaaa tttcgttgtg acgtcaatct atatcaccaa tgcattaacc actctcactg 6480 aaaggtcatt agcgcatcta ccatcatcaa ccatccagct gaggaattgg taccatatct 6540 cagcatcgct gcaaacgcca tcgacaagtc gtccacaggg cctgatcaac ccgatcggag 6600 ggctaatgtc ttcgtcggtg cagcatcgtt catcgcaaca tagaaccgta gtagaccttt 6660 tcaatccgtt tggagcgacg taccgaccgt cgtcatcggt gcctggggta cctgcggagg 6720 ccaggaggcc tccgctccag gcaagttctg ttttagttca taaattcaag gtaaaaagca 6780 ccagttcatt tctagcgtcg attgatgtca gcataccaca ccggaggacg taggacagca 6840 tcagcgaagc cacccactag aagagaggtt ccattcgacc ccaacaaggg cacggacagc 6900 gtcgaaacaa tccatcaaca tcaagtcgtc atcccagtcc aagcagaatc caatatcgtc 6960 gtcgtcatca ccattgtagt ccatctcaag cccctatcag caatcatcgt ctacatctac 7020 agtccttcga gcgtcgagca atatcagcta tccagcgatc aacaagcagg tcatcgctct 7080 ccggctgaac tcgtcggaca gaatagtgct agtatataag aacatctaag tcaaaacaaa 7140 taatgcaaaa tactaggtgg cagtagcatt cgaagtcggt taggtagagt tacagagtcg 7200 gtcgaaatag tatgcccagg atttgattga aatccctatg ctttcaaggt ggccggta 7258 // ID BEL-224_AA-I repbase; DNA; INV; 6013 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-224_AA_; KW BEL-224_AA-LTR; BEL-224_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6013 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 905-905 (2011). XX DR [2] (Consensus) XX CC Positions [4907-5488] - Integrase core CC LTRs are 90% similar to each other. XX FH Key Location/Qualifiers FT CDS 605..5920 FT /product="BEL-224_AA-I_1p" FT /translation="MAEKKMKSKELKRKNTMDSIQRIEDFLVNFDAERDKH FT EITIRLSRLDKLMEAFESIQGEYEAFDDSPAFVAANAKVRAKVEEQFFRVK FT GGLMSKDIPPPNPAQPLNQQVAPAIAHVTGVKLPTIELPRFDGDLNDWLTF FT RDSFSSLIHSSPEIPCVQKFQYLRSALRGDALKLIESLTITANNYALAWEA FT LLDRYSNSYLLKKKHLQSLMVHTKVSGKSPVALRNVVEDFQRHVKILNQLG FT EPTAQWCSLLVQLLCARIDDHTLKEWEEFVSGNEDPTYENLIHFLTRKIRT FT LESLHISAEQPYTSQGKHYPPVRQSKQQSNSSPKINSFSAVEHTQPSCLAC FT SDCHPLVKCPVFEKMQLKDRLNLVNTKRICSNCFRSNHFARNCSSNFSCRH FT CQKRHHTLLHPGFDSPGTNAGNSSSNQVNRPVSPPKQSAANGDVVSNSSGG FT ANVISSNSVVEPRGVNVFLSTIVVKVLDGYGNEHLARALLDSGSQCSLMSD FT RLCQQLRLVRHRIEQPILGVGESQIRAVCSVNTEARSRFGDFSIPLNCLVL FT KKLTSDLPAVTIPITNWNIPNQIDLADPEFNVPRKVDLIIGAEHFYAILQG FT GRFPLGPKAPTLVESKFGWFVSGKASYEAMPHTLVCCMSTLDSLNQNIEKF FT WKLEELDRALPSPTEQYCEEYYKKTVSRNPDGRYIVRYPKKDNFQDMVGES FT LVNSKRRLEGLERRLDSNPILKERYHAFIAEFIELGHMRKVPPTEPEPHTV FT CYIPHHAVQKESSSTTKVRSVFDASAKTSSGFSLNEALLVGPVIQDELLDI FT VIRLRRHKVVLLADIEKMYRQVEIHPDDRPLQRVLWRFNKSDPITKYEMTR FT VTYGLAPSSFLATRTLLQLAEDEGDSFPLAASALKQDFYMDDFIRSVATIS FT QAIQLRKEMDELLRRGGFPLRKWCSNFSEVLEGVPSENLANPSSQTFAPDE FT AIKALGISWEPASDQFRFNVGPFSVDAPITKSKILSVIAQLYDPLGLIAPV FT IVRAKILMQLLWTISIDWNEEVPVEIRTIWESYVEDLTLLSDFRISRLAFD FT VGEVQFHCFADASESAYGTCVYARTLKPNGEVKVVLLASKSRVAPLKKRSI FT PRLELCAAQLGAQLSSRVVAALKMETTPVFYWTDSTVVLHWLRSPPQTWKT FT FVSNRVADIHTLTNSSRWFHVPGVSNPADLVSRGMPVPQFLGSREWIHGPD FT WLGEGEENWPKLNLSQTKLADVECERKTTTLFARSPPPINPLFARSSSFDR FT LLRTTAYCLRFCHYTRKGPKQPTIALTPIEIETARDRLVKIVQTECFKPEL FT TLLRKGKPVANNSSLKLLNPFLDSVGIIRVGGRLKLSTESYTTKHQILLPG FT FHKFTRLLLMSYHCKLIHGGISLTLGVVRNEYWPTNGRRAVRSVIRTCYRC FT TRANPRPLQQPVGQLPLARVTPSRPFASTGIDYCGPVFVKPAYRKSAPTKA FT YIALFVCFSTKAVHIELVGDLSTASFLSALQRFIARRGKPEHVYSDNATNF FT VGAKNELHALYRMLSNTDEINRISSALATDGIQWHMIPPRAPNFGGLWEAG FT VKVAKKHLLRQLGSSPLLYEDLVTILVIIEGAMNSRPLAPLSEDPNDFQAI FT TPSHFLIGSSLQALPYPDMKDVPVNRLRNRYDVIQQKQQLFWYHWRTEYLK FT ELQRLSASNPQRVVLKIGQVMILQDSCLPPVRWPLVRVVELHPGQDGVTRV FT VTIRTPTGATMKRAVTKLCPLPMTDEEEMIYTDEATDHSSPGIVVEDSSKQ FT Q" XX SQ Sequence 6013 BP; 1576 A; 1504 C; 1390 G; 1543 T; 0 other; tttttggtgc cgtgaccagg atcacgggtg ttggattttc cctggaatca ccacgacgtt 60 cggacccaag ctaaccccac tttcaccctt cggaaggatt gttccagata gcagtacctg 120 ctcaaaaagt acaaggcctg ttgaagacag gctccatgtg agtatctgtc catccatcta 180 ttgtgctttg agaggtactt cctgggtctt cttttgacca atttaggatt tacacgtgtt 240 gctgctgttg gtgttgggca cgcggttgct gctgctgccg gtgcataccc tggtattcct 300 aaccgctacc gcctatctgt tcatcatccc tatcgtgttc gacgaggccc gtcatccacg 360 cactcggaag gttttcagtg ccgattggat tcagtgcatc ccatctttgc attccaaccc 420 atcccaattg ccatctttat cggtggttac aactagttcc agtcttacca ttaaatacaa 480 ggcctgtcat tggacgggca caggtgagtg tcagtccaaa tacttaccgt catccaaaag 540 gtgcttcctg tgtctcctgt ccattcacag cctccggtga cgtcatcatc acctcctcgg 600 catcatggcg gagaagaaaa tgaagtcgaa ggagctgaag cggaaaaaca ccatggattc 660 cattcaacga atagaggact tcctggtaaa cttcgatgca gagcgagaca aacatgagat 720 aacgattcgt ctgagcagac tcgataaatt gatggaggca ttcgaatcaa ttcaaggcga 780 atacgaggcg tttgatgatt ctccagcatt tgtggcggca aacgcgaagg taagggctaa 840 ggtggaagaa caatttttcc gggtaaaagg cggtctcatg tcgaaggata tcccccctcc 900 caacccagcg caacctctaa accagcaagt ggcacccgcc attgctcatg taaccggtgt 960 gaagctccca accattgaac tcccacgatt cgatggcgat ttgaacgatt ggttgacgtt 1020 ccgcgactcg ttctcttcgt taatccactc atcgccagaa attccttgcg tccaaaagtt 1080 ccaatatttg cgatcggctt tgcgtggtga tgcattaaag ttgatcgaat ccctcacaat 1140 tacggcgaat aattatgctc tcgcttggga ggctcttctt gatcgctact ccaattccta 1200 tctactgaag aaaaagcact tgcagtcgtt gatggtccat acgaaggtat ccggaaaatc 1260 cccagtcgca cttcggaacg tggtagaaga tttccagcgg catgtgaaga ttttgaatca 1320 acttggtgag ccgacggcac agtggtgtag tctccttgtt cagcttttgt gcgcgcggat 1380 cgacgatcat actctgaaag aatgggaaga atttgtttct ggtaatgaag atccaactta 1440 cgaaaaccta atacattttc tgacgcggaa gattcgtacc ttggaatcgt tacacatttc 1500 ggctgagcaa ccctacacgt cccaaggaaa gcactatccc cctgtcaggc aatcgaagca 1560 gcaatccaac tcttctccga agatcaactc gttctctgca gtggaacata cacagcccag 1620 ctgccttgcg tgcagcgatt gtcatccttt ggtcaaatgc ccggtgtttg aaaagatgca 1680 gttgaaggac cgcttgaact tggtgaatac caaacgtatc tgtagcaact gcttcagaag 1740 caatcatttt gcacggaatt gttcctccaa tttttcttgc cggcattgtc agaagcggca 1800 tcacacccta ttgcatccag ggtttgattc tccaggaacg aatgcgggaa attcaagctc 1860 caatcaggtc aatcggccgg tctctcctcc aaaacaaagt gctgcgaacg gtgatgtagt 1920 ttcgaattcg tctggtggag caaacgtaat atcctccaat tcggtggtag aaccccgcgg 1980 cgtcaacgtg ttcctctcta ccatcgtcgt caaggttctg gacggttatg ggaacgaaca 2040 tctagcacga gccttattag acagtggatc gcaatgtagt ctcatgagtg accgactttg 2100 ccagcagctt cgattggttc ggcataggat tgagcaacct attctcggtg ttggtgagtc 2160 ccaaattcga gcagtttgtt cggttaatac cgaagctagg tcgagattcg gagatttctc 2220 tattccgctc aattgcttgg tactcaagaa attgacttcc gatctccccg ctgttacgat 2280 cccaataacg aactggaata tccccaacca aattgatctc gccgatccag agttcaatgt 2340 gcctagaaaa gtggatctca tcatcggtgc ggaacacttt tatgcaatcc tacaaggtgg 2400 tcggtttcca ttaggcccta aagcaccaac gcttgtagaa agcaaatttg gatggttcgt 2460 ttcgggaaag gcttcctatg aggcaatgcc ccacacctta gtatgctgta tgtccactct 2520 tgattcactt aatcagaaca tagagaaatt ttggaagttg gaagaactgg atagagccct 2580 cccttcacca accgagcaat attgtgagga gtattataag aaaacggtat cgagaaatcc 2640 cgacggccgg tacattgtaa gatatcccaa aaaggacaat ttccaagata tggtaggcga 2700 atctttggtc aactccaagc ggcgattaga aggtttagaa cggaggttgg atagtaatcc 2760 catcctgaaa gagagatatc atgcatttat cgctgaattt attgaactgg gacacatgcg 2820 aaaggtaccc ccaactgaac ctgaaccaca cacggtttgc tacattcctc atcatgcggt 2880 tcagaaggaa tccagttcga ctaccaaggt gcggagcgtg tttgacgcct ccgctaaaac 2940 tagctcagga ttttcattaa atgaagcact acttgtcggc cctgtcatcc aggatgaact 3000 gttggacatc gttattcgtc tccgaaggca taaggttgta ctattggcgg acatcgagaa 3060 aatgtaccgc caagttgaaa tccatcctga cgatcgtccc ctgcagcgtg tgttgtggcg 3120 cttcaacaag agcgatccaa ttactaaata tgaaatgacc agagtgacct atggattagc 3180 cccatcgtcg tttctcgcaa cacgaacgct acttcaactt gccgaagacg aaggtgattc 3240 gtttcctctc gcagcatcag cactcaagca agacttctat atggatgatt ttattcgaag 3300 cgtagcgact atttcacaag ccatccaact gcgcaaggag atggatgaat tgctgagacg 3360 tggagggttt ccccttagga agtggtgctc aaatttctcc gaagttttgg aaggtgtacc 3420 atctgaaaat ttagccaatc catccagcca aactttcgcc ccggacgaag cgataaaagc 3480 tttgggaatt tcttgggagc ctgcatccga ccaatttcgc ttcaacgttg gccctttttc 3540 ggtagatgct ccaatcacca aaagtaaaat tttgtccgtc attgcccaac tatatgatcc 3600 cttgggtctg atcgcacctg ttatagttcg tgccaaaatt ctgatgcaac tcctgtggac 3660 aatttcgatc gactggaatg aagaagtgcc tgtggagata cgaaccattt gggagagtta 3720 tgtggaggat ttgaccctgc tgtccgactt ccgaatcagt agattggcct ttgatgtggg 3780 ggaggtgcag tttcattgct tcgcggatgc ctccgaatca gcctacggaa cctgtgtcta 3840 tgctcggact ctgaaaccca acggcgaagt gaaggtggtg ttactcgcat caaaatcacg 3900 cgtagcacct ttgaagaagc gcagcattcc cagactggaa ctgtgtgcag ctcaactggg 3960 tgctcaatta tcatcaaggg ttgtagctgc actgaaaatg gaaaccactc ccgtcttcta 4020 ttggaccgac tcaacggtag tgctacactg gctccgctca ccaccacaga cttggaaaac 4080 tttcgtttca aaccgagtag cagacattca caccctgacg aacagttcta gatggtttca 4140 tgtacctgga gtgagcaacc ctgccgatct tgtttcccgt gggatgcccg tcccgcaatt 4200 cctaggcagc agagaatgga tccatggccc agattggttg ggagaggggg aggaaaactg 4260 gccgaagcta aacctttcgc aaacaaaact tgctgatgta gagtgcgaac ggaaaactac 4320 aactttgttc gctcgcagtc caccaccaat caacccttta tttgcccgtt cgtcatcatt 4380 tgatcgcttg ttacgaacca ctgcgtattg tttgcgattc tgccactata caaggaaggg 4440 accaaaacaa ccaacgattg cgctgactcc tattgagatt gaaactgcca gagatcgttt 4500 ggttaaaatt gtacagaccg aatgttttaa accagaactc accctgctaa gaaaaggaaa 4560 acctgttgcc aataattcaa gcctaaaatt gctcaaccca tttctggact ctgtgggcat 4620 aatccgagtc ggtggccggc taaaactatc gacagaatcg tacacaacaa aacatcaaat 4680 tcttctacct ggttttcaca aatttaccag attgttgttg atgtcatacc actgtaaatt 4740 gatccacggg ggaatatcac ttacgcttgg cgtagtgcgc aacgaatatt ggcccactaa 4800 tggaaggaga gcagtgcgta gcgtaatccg aacatgctat cgctgtacgc gtgcaaaccc 4860 tcgccccttg caacagcccg ttggtcagct tccccttgcc cgagttacgc caagtcgccc 4920 cttcgcatcc acaggaatcg actattgtgg cccagtattt gtaaaacccg cctaccgcaa 4980 gtccgccccc accaaagcct atattgccct ctttgtgtgt ttcagcacta aagctgtaca 5040 cattgagctg gtcggtgatc tatcgacagc ttcattcctc tcagctctcc aacgtttcat 5100 agctagacga ggaaagcccg aacatgtata ctcagacaat gccactaatt ttgtcggggc 5160 aaaaaatgag ttacatgccc tctaccgaat gctctcaaat accgacgaaa taaatcgtat 5220 ctcgtccgct ctcgctactg atggaataca gtggcacatg atcccacctc gtgcccctaa 5280 cttcggcggc ctttgggaag ccggggtgaa ggtcgcaaaa aagcatctac ttcgtcagtt 5340 gggtagctca ccgttgctgt acgaggacct agtaacgatc ctagtgataa ttgaaggtgc 5400 aatgaattcg aggccattag ccccgctttc agaagatcca aacgactttc aagccattac 5460 gccaagccat tttttgatcg gctcatcctt gcaagcccta ccttatccgg atatgaagga 5520 cgttccagtg aaccggctta gaaaccggta cgatgttatc cagcagaagc agcagctgtt 5580 ctggtaccac tggcggacag agtacctgaa ggagcttcaa cgcttatctg ctagtaaccc 5640 acaacgtgtc gtactgaaaa ttggtcaagt gatgatcctg caagacagct gcctaccgcc 5700 cgtacgctgg cctctcgttc gtgtagtgga actgcatccc ggacaagatg gtgttacgcg 5760 cgtggtaaca attcgcactc caaccggagc gactatgaaa cgagcggtga cgaagctgtg 5820 tccgctgccc atgacggacg aggaagagat gatctacaca gatgaagcaa ctgaccactc 5880 cagtccaggt atcgtcgtag aagactcttc aaagcagcag tgagtaaatc gattatgttg 5940 aagattatgt atttcgctag atagatagat tagttttttg tagtgaaatg agacatttca 6000 ggtggccggc cta 6013 // ID RTE-14_BF repbase; DNA; INV; 3483 BP. XX AC . XX DT 29-JUL-2009 (Rel. 14.07, Created) DT 29-JUL-2009 (Rel. 14.07, Last updated, Version -1) XX DE Amphioxus RTE-14_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW RTE; Non-LTR Retrotransposon; Transposable Element; RTE-14_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-3483 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-3483 RA Kapitonov V. and Jurka J.; RT "Young families of RTE non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(7), 1712-1712 (2009). XX DR [2] (Consensus) XX FH Key Location/Qualifiers FT CDS 78..3458 FT /product="RTE-14_BF_1p" FT /translation="GMSPRCNCSYLTWGSSVSATGDGLAVWRGVDCGSSEV FT LYNEGAHWPKLAVVWVTGTLGQDKSLKHPGNEDWQPLPVFAVYGSKMDRES FT SPKPILPGAGVPRSVAGSRVGPRRHHDDSPVNSREGGELDDHIRRNATTKR FT HILRCRHPLRLCTFNIRTLKVSDVGSSKIKTVYKTEELCRNLKQFGIEICG FT IQETRWMHKKDRRDEINSYTEPFGYTLYTVSAWENESKAATGGVGIILGRT FT AQDLLISVERISDRIIKVQLRGNPAVTVIVAYAPTEAADDCSKDTYYSQLR FT QTVEGVAPHDFLAVLTDSNARLGPEDAQFTYNSSTNNNGQRLLEILEDYQL FT LATNTLFEKRKGKLWTWRSPQDTYHQLDYIIVRAKWRNSVTNCEAYSSFSS FT LYSDHRAVTANISLRLRKSKIKNPKQNVKYIWSDLAADEGLQERYAVEVRN FT RYQALLLEDDEGQTEDYDKFISANASAAEECLRKVPKQKKRIKCLDPRVSA FT VREEVEKAYQAYLAGNKKEELREKYKEKKQELYGTYAVIDEEELTSKIEEV FT EKAHKNSQHGAAWKLINDISGRNSAQSSKLKANSPEERVSLWYTHFSKLLG FT SPPVISEEDTPIKPVFDTLNMSDEVFSEAEFIAAKMSILCGKACGDDGITP FT EFLKYAGLDDVVLGFINKAFSTGQLPERWKTLIIVPVPKTGDLTKPDSYRG FT ISLISLVLKLYNRMLLNRLRPLLDPLLRSSQNGFRQGRSTVGQIMAIRRLL FT EGVNHKNLSCIITFIDFKKAFDSIHRGKLMDILRAYGVPEKLVTAIAATYS FT QTWAKVRTPDGDTEPFQILAGVLQGDTLAPFLFIVALDYALRCAIEGKEER FT LGFTLKERASRRIPAKIVTDLDFADDIALISDTAEKACTLLNAVERQCQRI FT GLQLNTKKTKVMAFNSSDNNVATLDGTRLEVVPDFKYLGGWIASTAHDMKV FT RRALAWNALHSMRRVWQSGMADDLKRRLFVSTVECVLLYGSETWTLTVQDE FT RALDGMYTRMLRRALNVSWEDRVRNIALYGNLPRLSDKIRQRRMQLAGHYV FT RHPELVASELILWEPVQGKRKPGRQRTTLIDTLKRDSGLDRLSSTAELRSL FT MKDREEWRRRVQHASRVGI" XX SQ Sequence 3483 BP; 1044 A; 803 C; 900 G; 736 T; 0 other; ctcccctccc cctctctggg ttacgaggac acaggccaga caggggtcta tagctgaagt 60 ggaagcggcc tgtctaggga atgagccccc gatgcaattg ctcctacctt acctggggta 120 gctccgtctc ggcaactggc gatggactgg cagtttggcg tggcgttgat tgtggttctt 180 cggaggtatt gtacaatgaa ggagcacact ggcccaagtt agcagttgtg tgggtcacag 240 gaacactagg acaggacaag tccttaaaac acccaggcaa tgaggactgg caacccttgc 300 ctgtatttgc cgtgtatgga agcaaaatgg atcgagaatc ttcaccaaaa cctatacttc 360 caggggcggg agtccccagg agcgttgctg gttcccgagt gggcccaagg aggcatcatg 420 atgactctcc tgtaaactcg agagaggggg gagagcttga tgaccacata cggagaaacg 480 caacaaccaa gcgacacatt ctgcgatgta ggcatcctct tcgactctgc acatttaata 540 tccgaactct caaagtcagt gacgtcggta gtagcaagat taagacagtg tataagacag 600 aggaactatg cagaaatctt aagcagtttg ggatagagat ctgtggtatc caagaaacaa 660 gatggatgca caaaaaggac agaagggatg agatcaacag ttacactgag ccttttggat 720 acacattgta tactgtatca gcctgggaaa atgagtcaaa agcagctact ggaggagtcg 780 gaatcatact tggtagaact gctcaagacc tccttataag tgttgagcga atttcggacc 840 gtatcatcaa agttcaacta aggggaaacc cagcagtcac agttatcgtt gcctatgctc 900 caacagaagc agctgacgac tgcagcaaag atacttacta cagccaactg cgacaaacag 960 tcgaaggggt ggctcctcat gacttcctag cagtgttaac ggactcgaac gcccgacttg 1020 gcccagaaga tgcacaattc acctacaatt cctcaacaaa caacaatggt cagcggctgc 1080 tagagatcct tgaagactac cagctccttg caacaaatac tctatttgag aagcggaaag 1140 gtaagctgtg gacatggagg tcgcctcagg acacatacca tcagcttgac tatataatcg 1200 tcagggcaaa atggagaaac agcgtgacta actgtgaagc ctatagctct ttcagctcac 1260 tatactctga ccacagagcg gtcacggcaa acatatccct acgcttgcgt aagagtaaga 1320 taaaaaaccc caagcagaat gtcaagtata tctggagtga cctggcagca gatgagggcc 1380 tgcaagagcg ctatgctgtg gaggtgagaa accgctacca ggctctactg cttgaggatg 1440 atgagggcca gacagaggac tatgacaagt ttatctctgc aaatgcctca gcagctgagg 1500 aatgtctgag gaaggttcca aaacagaaaa agaggataaa gtgtcttgac ccaagagtca 1560 gtgcagtaag agaagaagtt gagaaggcgt atcaggcata tttggctggg aataagaagg 1620 aagaactcag agagaagtac aaagagaaaa aacaggagct ctatggcacc tatgctgtaa 1680 tcgacgaaga agagctaact agcaaaatag aagaggttga gaaagcacac aagaactcac 1740 aacatggtgc agcctggaaa ttgatcaacg acatctcagg tcggaacagc gctcagtcgt 1800 ccaagctcaa agcaaacagc ccagaggagc gtgtatcact ctggtacaca cacttcagta 1860 aactccttgg aagtccacct gtgattagtg aggaagacac accaatcaaa ccagtctttg 1920 acacactcaa tatgtctgat gaagtctttt ctgaagctga gttcatagca gctaaaatgt 1980 caatactatg cgggaaagca tgtggggacg acggcatcac tcctgagttc cttaagtatg 2040 ccggtctaga cgatgtagtc ctgggattca tcaacaaggc tttctccact ggccagctac 2100 cggagcgctg gaaaacactc atcattgttc cagttccgaa aacgggagac ctgacaaaac 2160 ctgacagtta caggggcatc agtttgatat ctctggtctt aaagctctac aacaggatgc 2220 tacttaaccg gctgagacca cttctagacc cactgctaag atcctcccaa aatggattca 2280 gacaaggaag atcgaccgtg ggacagataa tggcaatcag gcgtctgctt gaaggagtca 2340 accacaagaa tctaagctgc attataacgt tcatcgactt caaaaaggca ttcgactcca 2400 tccaccgcgg gaaactaatg gacatcctgc gggcatatgg agtgcccgag aagctggtga 2460 cagcaatagc agcgacgtat tcacagacgt gggccaaggt taggacccca gacggagaca 2520 ccgaaccttt ccaaattctg gcaggcgtac ttcagggtga cacgttggcg cctttcctgt 2580 tcatagttgc acttgactat gcactgagat gtgccatcga agggaaggaa gagcggttgg 2640 gcttcaccct caaggaaaga gcaagtcgtc gaataccagc taaaatagtg acggacttag 2700 acttcgcaga tgatatcgcg ttgatatccg acactgcaga aaaggcttgt actcttctta 2760 atgcagtcga gcgtcaatgt caaaggattg gcctacagtt gaatacaaaa aagactaagg 2820 tcatggcttt caattcatca gacaacaatg ttgctactct tgacggcact cgactagagg 2880 tagtacccga cttcaagtac ctcggtggct ggattgcctc aacagcacac gacatgaaag 2940 tgagacgagc tctcgcttgg aacgccttac acagcatgag gagagtgtgg cagtcaggaa 3000 tggctgacga ccttaaacga cgcctctttg tctctactgt tgagtgtgtg ctactgtatg 3060 gctcagaaac ctggacacta acagtacagg atgagagggc actggatggc atgtacacta 3120 gaatgctaag gagagctcta aatgtgtctt gggaggaccg ggtaaggaac atagcactat 3180 atggtaacct gcctagacta agtgataaga tcaggcaaag gcgcatgcag ctcgctggcc 3240 actatgtgcg acacccagaa ctagtggcaa gtgagctgat cttatgggaa ccggtccagg 3300 gtaaaaggaa acctggcagg cagagaacta cccttatcga cacactcaag cgagattccg 3360 ggctcgacag actgagcagc acagctgaac tacgttcact gatgaaggat agggaggagt 3420 ggagaaggag ggtccagcat gcttcccgtg ttggcatctg acgatgtcct cgactgaact 3480 gaa 3483 // ID Gypsy-151_AA-LTR repbase; DNA; INV; 172 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-151_AA_; KW Gypsy-151_AA-I; Gypsy-151_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-172 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1028-1028 (2011). XX DR [2] (Consensus) XX SQ Sequence 172 BP; 53 A; 30 C; 34 G; 54 T; 1 other; tgtaataaga ttgtaaatat atctagcaac actgtagtat tcggtatgaa cttccttcta 60 gagcattagt cagtagtcat tgagcagtgg acggatagtc acgtcgcaaa waaagtcgtc 120 gtattgaacc gtgtaaatcc gtgttttatt accgttaaag acacttctta ca 172 // ID Mariner-9_SM repbase; DNA; INV; 1098 BP. XX AC . XX DT 11-FEB-2008 (Rel. 13.02, Created) DT 03-MAR-2008 (Rel. 13.02, Last updated, Version 1) XX DE Mariner DNA transposon from Schmidtea mediterranea: consensus. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-9_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-1098 RA Jurka J.; RT "Mariner DNA transposon from Schmidtea mediterranea."; RL Repbase Reports 8(2), 153-153 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 265..903 FT /product="Mariner-9_SM_1p" FT /translation="MDQGTISAFKAYYLRETMRQLINETDGGGKDAMRLWW FT KQFNIMKAIRNIDEAWNEVTQSCMKGVWKRLFSEIVEVHCNYENNIQNIIE FT EISRIAISGGIEDVDEDGIHTLLKSHGEPLTNEELQEMDCHTYLENDVSKP FT VPVRNLTVKTIEKALSLHNQCMELLEENDEDYERVSAVKRGCNALLDSYRQ FT ILRNKRREAKQGNLDSFIQRREK" XX SQ Sequence 1098 BP; 359 A; 194 C; 224 G; 318 T; 3 other; tacaggtaat cccgttcacc gcgaccccgg cgagtgcgaa ttcgttataa cgcgaccgac 60 aatttatttt tcagacactg gccttgacag tgcgataaat cgtcataatg agtgacacat 120 aatgttcttg ctcacttaca cgcatataaa tgtacatagt aagtcatgag agtgagtgag 180 acaaaataat gtcgctacgt ggctcctagt agatacaatt tacatacata cagttgacct 240 agtttagctt tggatcaagt gcctatggat caaggcacaa tatcagcctt caaagcttat 300 tatttaagag aaactatgcg tcaactcatt aatgaaactg atggaggtgg taaagatgcc 360 atgcggcttt ggtggaaaca atttaatatt atgaaagcta tcagaaacat cgacgaagcg 420 tggaatgaag tcacccagtc gtgtatgaag ggtgtttgga aaagattgtt yagtgaaatt 480 gttgaagttc actgcaatta tgaaaataat attcaaaata tcatcgaaga gattagtcgt 540 atagccattt ctggaggcat cgaagatgtt gatgaagatg gtattcatac tttgttaaaa 600 tcacacggcg aacccttgac taatgaagag ctgcaggaaa tggactgcca tacataccta 660 gaaaatgacg tatccaaacc tgttccagtg agaaatttaa ctgttaaaac catcgaaaag 720 gccctttctc ttcacaatca atgcatggag ttgttagaag agaacgayga agactacgag 780 cgagtgagtg ctgttaaacg tgggtgtaat gcattacttg acagttatcg gcaaattttg 840 cgcaataaac gtagagaagc caaacaaggg aatctagatt cctttataca acgtcgagaa 900 aaatgaataa attaataatt tatccattgt actatactta atgttatttg ccttttrcat 960 tttttattct aatgtttatt taataaatat aaagtttttc gagtttttaa atgcaaaaat 1020 cccccgttga tcgcgatatt tcgcttaacg cgatcatgtc cagaacgtat cccccgcgtt 1080 gaacgggggt tacctgta 1098 // ID Gypsy-145_AA-LTR repbase; DNA; INV; 188 BP. XX AC AAGE02022110; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-145_AA_; KW Gypsy-145_AA-I; Gypsy-145_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-188 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02022110; Positions 81191 81004. XX SQ Sequence 188 BP; 54 A; 44 C; 27 G; 63 T; 0 other; tgttatgatt tacttcagtg taggaatcag tgaactacct cacctacata tataaacaca 60 ttgattcttt gacatttatg tattaccaac ctaaaaccga ttttcattat taaaacgtgt 120 tcgcaacgag tccgaacggt tcttttattt gctccgcgtc cgatcatctg ttacccgccg 180 atacaaca 188 // ID Gypsy-58_AA-LTR repbase; DNA; INV; 202 BP. XX AC supercont1.29; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-58_AA_; KW Gypsy-58_AA-I; Gypsy-58_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-202 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.29; Positions 1486983 1486782. XX SQ Sequence 202 BP; 57 A; 42 C; 35 G; 68 T; 0 other; tgttgtagta ttggatagcc catcgcattc cattcataag cgcacactca cacactcttg 60 ttcatgttgt tccactgtgt tggtagtttt tcctttcatc attcgtgtgc taacagtcag 120 aatagcaagt caagtgactg taattgagtc cgtaataaat ctgaattact aagtaaaata 180 cgcgttttat tgcaccaaaa ca 202 // ID PiggyBac-1_HM repbase; DNA; INV; 3081 BP. XX AC . XX DT 11-DEC-2008 (Rel. 13.12, Created) DT 15-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE PiggyBac-type family: consensus. XX KW piggyBac; DNA transposon; Transposable Element; PiggyBac-1_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3081 RA Jurka J.; RT "PiggyBac families from Hydra magnipapillata."; RL Repbase Reports 8(12), 2100-2100 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 796..2643 FT /product="PiggyBac-1_HM_1p" FT /translation="MNPKNFYGSTKRLRIPFRKTVFPVNQPXLSEVSDLSG FT ESDEDFITSESESEYGTSEWDSETSDGNEQQQAPQLSSSKTHQHLLQQPQP FT TYQPQEKKRNLIWKIKKDDDVQQINLPFQNSEDKEDDNNGISDEPISYVRR FT FLTDELLQKIVDESNKYASQINIENMLRLEKDELEQFIGILYLMSIVKLPS FT TRMYWSNELYFEKVAAVMTLNRFEKIKLFLHCNDDRQRPENCTDKLYKIRP FT IINMLKESFISLKHDEILCIDEQVVPFKGRSSLKQYNPHKPKKWGFKFYIL FT ASVDGLVNNFEIHSGSIDVCVGQPDLKASGNIVMHLLVNVPRHKWHKLFFD FT NWYTGFDLVKTLYHQGIACTGTVRTNRLPNIKMPTDGELKKEGRGSTAIRV FT TTVDNVELRAIKWFDNRGVALLTSYEAVNPINIVNRWDRKTKRIVQVKRPS FT VVTTYNTYMGGVDLLDGMLSLYRIHIRSKKWYHKLIWHFFDLIVVQAWILY FT CRDMKKSQALKKDIFQLRIFKLKVAKCLVQSDKALSSRRGRPSTSVDHLHR FT LKRKRGPAANLPEPSTRKDNIGHFPFFVDKKGRCKYPGCTGITKVFCEKCK FT VHLCFTSNSNCFRKFHE*" XX SQ Sequence 3081 BP; 1099 A; 426 C; 505 G; 1049 T; 2 other; ccctatattg catagcgtgt ctttaaatta acataccaca tattggattt ttacaaagtt 60 taactaaagg tagtaactta aagtatcttt tgagtagttt agagttatga tacagaatat 120 tttaagtgtg accactttgc aataaaacta tatattacct tagtgaaaac atttgaagtt 180 gctaatttag agatgacggc ctcacttcat attgttaaac tttgaagaat ttattttgat 240 atttctgagc aaatttatat caattaaaaa atacttatta attttttaaa gttacaacta 300 tgttttagaa gcatttagat ttaatttata tttattataa aggtataaac ctgaaaaatg 360 tgtttgttgc tttaaagcaa caccatgcaa ctagatatat aaaaacctaa aaacaacata 420 tttatgaatt gtttattttt aaatgttttg tcaaggtttt gtaaaaagtt ttaattgtgt 480 tactagatga gtgttaaatg cttttatgga agcttatcta agtcaaaact taatcatcct 540 cctgtataat ttccattcca aaatccagaa atgtcagatg atagtgattt gtttacagat 600 tttgatgaag aataaaaatg caataaaaac ttaaacttta agcagaatat taagtacaaa 660 tttaatttaa gcattaattt aattatctta aactttttaa aacactgttt aagtattttt 720 atctttcaat ggagtctact agaaagataa aaatatttct tatatattat aaaaatattg 780 ctaaattttt tttagatgaa tcctaaaaac ttttatggat ctacaaagag gttaagaatc 840 ccatttcgaa aaactgtttt tcctgtaaat caacctraac tatctgaagt cagtgatctt 900 tcaggagaat ctgatgaaga ctttattaca tcagagtcag aatcagaata tggtacttca 960 gaatgggata gtgagacttc agatggtaac gaacaacagc aagctccaca gttatcatca 1020 tctaaaacac atcaacattt actgcaacaa ccacaaccaa cataccaacc acaggaaaaa 1080 aaaagaaatc ttatttggaa aattaaaaaa gatgacgatg ttcagcaaat aaaccttcca 1140 tttcaaaaca gtgaagataa agaagatgat aataatggaa tctcagatga gccaatatca 1200 tatgtaagaa gatttttaac tgatgaactt cttcaaaaga ttgttgatga gtctaataaa 1260 tatgcatcac aaataaatat agagaatatg ttacgtttgg aaaaagatga acttgagcaa 1320 tttattggca ttttgtattt gatgagtatt gtgaagctcc cgtcaacacg catgtattgg 1380 agcaatgagt tgtattttga aaaagttgct gcagtgatga cattaaaccg atttgaaaaa 1440 ataaagttat ttttacattg taatgatgac agacagcgtc cagaaaactg tactgacaag 1500 ctttacaaga tacgtcccat cattaatatg ctgaaagaat cttttattag tctgaaacat 1560 gacgaaatcc tgtgtattga tgaacaggtt gttcctttca aaggaagatc ytcgttaaaa 1620 cagtataatc ctcataagcc aaaaaagtgg ggtttcaagt tttatatctt agctagtgtt 1680 gatggactag ttaataattt tgaaatacat tccggttcaa ttgatgtttg tgtaggacag 1740 cctgatttaa aagcatctgg aaatattgta atgcatctgt tagtcaatgt accaaggcat 1800 aaatggcaca agttgttttt tgacaattgg tacactgggt tcgacctggt gaaaacactt 1860 taccaccaag gcatagcatg cactggaaca gttcgtacaa acaggcttcc aaatattaaa 1920 atgcctactg atggtgaatt aaaaaaagaa ggtagaggat caacagccat tagggttacc 1980 acagttgata atgttgaact tcgcgctatt aaatggtttg ataacagggg tgttgcattg 2040 ttaacaagtt atgaggcagt gaatccaata aacattgtaa accgttggga tagaaaaacc 2100 aagagaattg ttcaagtcaa acggccatct gttgttacca catacaacac atatatgggt 2160 ggggtggacc ttcttgatgg catgttgagt ctttacagga tccatattcg atctaaaaag 2220 tggtatcaca aactcatttg gcattttttt gatttgattg tagtacaagc atggatactc 2280 tactgtagag acatgaaaaa gtcacaagct ttaaagaaag atatttttca attgcgcata 2340 tttaagctga aagttgcaaa atgcttagtt cagtctgaca aggctttaag cagtagacga 2400 ggacgaccat ccacttctgt tgaccatctt catagattaa agaggaagag gggtcctgct 2460 gcaaatttac ctgaaccatc cactaggaaa gataatattg gtcattttcc tttttttgtt 2520 gacaaaaaag ggcgttgtaa gtatccaggc tgcactggaa taacgaaagt tttttgtgag 2580 aagtgtaaag ttcatttatg ctttacatct aattcaaatt gttttcgaaa gtttcatgaa 2640 taattcttga aatacttctt gttttttatt tacagttttt tgaaattgag ataatagtta 2700 agcttgtaaa ctaattctta ctttgaaatt tttttgaaag attttcttgt tttaaccaag 2760 agctgtaact aaatgttaag taaatttgat ttttaattac aattattctg tagttatttg 2820 tattttagaa ttcttgttta gactttgtgg tcaatttata atattatctt cttcagataa 2880 actattcttt gacattgtaa cttagtttaa aaaactgtca taactgcata ttgttgcctt 2940 aaagcaacaa agtccaaaaa acctaagtaa aaaggtaaaa aaaaaaattt gattattttt 3000 tgttttctag tcctcaaata acattttaaa tggattttca aattttttaa tcattagcgg 3060 aataaatcat gcaatatagg g 3081 // ID Merlin10_SM repbase; DNA; INV; 1213 BP. XX AC . XX DT 14-AUG-2009 (Rel. 14.08, Created) DT 14-AUG-2009 (Rel. 14.08, Last updated, Version 2) XX DE Consensus sequence of Merlin-type family of repeats. XX KW Merlin; DNA transposon; Transposable Element; Merlin10_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-1213 RA Jurka J.; RT "Merlin-type element from freshwater planarian (Schmidtea RT mediterranea)."; RL Repbase Reports 9(8), 1900-1900 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 157..1050 FT /product="Merlin10_SM_1p" FT /translation="MAESNFLEFIESTTPEQMFDFLLEKKLINEMQNCKHC FT EIEMKCCKFPKSELEKIWRCMNVNCVHYETSLSVLHCSFFHNSTISIKKTL FT KIVYYILKKTKQKDISDHVGVSIKSICSIKKKLSVKIEEYFRNNPIRLGGN FT EKCVQVDETKLNHNAKAHRGRSAIAPTWAITMVDTSTTPARGYAEIVPNRS FT SETMIPIIEKVVRSGTTIQTDEWKAYNPLSNKDCYEHKKITHKFNFVDPVT FT GVHTQNVESFNNKLKLDIKREKGVKRDDRPRFLTFFIFIDTYKDNSLNKML FT EILKIN" XX SQ Sequence 1213 BP; 454 A; 166 C; 199 G; 394 T; 0 other; ggatcatgcg aactttatgc tctgaaagac cataaagttc gcatgatgat tttataagag 60 cataaagttc gtatgttcat attaataaaa tttccgccat cttttcccgc cattttcaaa 120 tttaaaagat ataaatatat ttttagcata ccctaaatgg cagaatcaaa ttttttagaa 180 tttatagaaa gtacaacacc agaacaaatg tttgattttt tgctagaaaa aaaattgatt 240 aacgaaatgc aaaactgtaa acattgcgag atcgagatga agtgttgtaa atttccgaaa 300 tcagaattag agaaaatttg gcgatgtatg aatgtaaact gtgtgcatta tgaaacttct 360 ctatctgttc ttcattgtag cttcttccat aatagtacga tttcaattaa aaaaacacta 420 aaaatagttt attatatttt gaaaaaaaca aaacaaaagg atatatcaga tcatgttggc 480 gtttcaatta aatcaatttg tagtataaaa aagaaactgt cagtaaagat tgaggaatat 540 tttagaaata accctattcg attgggtggt aatgaaaaat gtgttcaagt cgatgaaacg 600 aaactgaacc ataatgcgaa ggcacatagg ggcagatctg ccattgctcc tacctgggca 660 attactatgg ttgatactag cacaactcct gctcgtggtt atgctgaaat tgttcctaat 720 agaagttcag aaactatgat accgatcatc gagaaggtag ttagaagcgg caccacgatc 780 caaacagatg aatggaaagc ctacaatcct ttaagcaaca aggattgcta tgaacataaa 840 aagattactc ataaatttaa ttttgtagat ccagttaccg gcgtacatac gcaaaatgtt 900 gaaagtttta ataataaatt aaaattagac ataaaaagag aaaaaggagt taaaagagac 960 gataggccaa gatttttaac attttttata tttatagata cttataaaga taattccctg 1020 aacaaaatgc ttgagatatt aaaaattaat taaatttttg tgttaacttt tttgtaatat 1080 attttttttg gttaaagtcc atttttttat taatctttct aatattttta taagagtata 1140 aagttcgcat gttgaatttt taagagcata aagttcgtat gatcgtctcc agagcatgaa 1200 gttcgtatga tcc 1213 // ID L1-1a_Cis repbase; DNA; INV; 6478 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 06-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE L1 Non-LTR Retrotransposon from Ciona savignyi. XX KW L1; Non-LTR Retrotransposon; Transposable Element; L1-1a_Cis. XX OS Ciona savignyi OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. XX RN [1] RP 1-6478 RA Smit A.F.; RT "L1-1a_Cis - L1 Non-LTR Retrotransposon from Ciona savignyi."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX CC Ci000141, Ci000973, Ci000161 1% div. XX SQ Sequence 6478 BP; 2632 A; 1151 C; 1048 G; 1642 T; 5 other; ccctatgtaa cggtgagaaa cgaaattttt ccaaagcgaa atggcagaaa cgaaagtttt 60 gtttctgaaa aaaaatgtat tagataattt ttttagcagt taggtggcac tagaggtcaa 120 acagtaaaat aaccatataa cttcttttaa ttcaataacg gtgtctggcc gaatggttag 180 cgcgactgat tcgtaattat ggaccagccc atgttgggtt cgatacttaa cccggtgaat 240 atttttttta ttcacatgaa ccagcctaat taaaattggt aaaacgtcac agtgacgtca 300 tgcagaacat tggcggcgta agaaatcagc tttgtcgcgc gttattatga tgtcataatt 360 aaactcgagt taacagtaca aacacctgcc aatagtggtg tgaataatga ctcataatgg 420 agttacatta ttacttgtgt aatccccaca acagaaagat ttttttgttt gcnatacaaa 480 attacaatta tggtgctcag tttgtgatca ggtgtttant aattacgtgc gtcactgtgg 540 atatatcccc ctctctcgcc agcaatacct gcaaacgaaa ggtaaaatat ttagttgcct 600 tgtttttttt tgtatttgta ggtataaagc cccgatttgt ttcaattttt cagttaccag 660 cctctaagcc ttttttttgg cttgaggcga atagcttcct ttggttcaat tctcgtatag 720 ttctcctttt ttcttgttgt gtaagatagc atctctgtgc cacgtagctc caaacgtggg 780 caacactggt gctatcttaa cnagtgtgct ctaaccactt cctttacaga agcaattact 840 agtgtttccg cgctcgattt tagcgcattt cctggaaaca ctgataattg tttttgcgtg 900 agaagtgtaa attacttcgt tttttcgctc ttatactgct ctttagagcg aatttgctct 960 gagttacaat aatagtgtta tctttaagtc aagttaacac gacaagtaag tgacttaata 1020 cgtgagtttc cttacaccaa aacacacttt acacaaaaaa catacaatta aaaccactca 1080 cctttaaaca acacgatgaa tgacggcata ccggaaatcg aagcgaaatt ggagagatcg 1140 attgtatttc aacttacggg tacggtcgaa catatgagtg ttagctcatt tatggaacta 1200 tttgcggatg gagcgccgat ggcggatatg tcgcagatcg tagaaggagt aatcatggac 1260 aatttcaaag attgcacctt ccttgttacc ctaaaggaac aaagcggaga agtagtaatt 1320 ccggaaaaac gggagataat tcactacttc aacacaaacg acatcaattt cactaccgat 1380 aagggatcga caatgtccct aaaagctgag cttccccaag gcgaatctga agtcgtctct 1440 ctacacccgg tcactgcaga cacctgcaag gagaaattag aagcaatgat aaataaccaa 1500 aactggggca aaatcaaaaa cataaattac ggaacacacc gtaatttcaa caaaataaaa 1560 aatggatggg taaacataac ccttactgaa acaaacatta aaaacattcc tcccttgata 1620 aaaatcggag gaagaaccat caccgtgacc aggcctggag aggagcacat ggctctctgc 1680 aggtactgta agcaaagagg acacacgcag aacaaatgtc ccaaaaaggg gttctgcgtg 1740 gaatgcaaag cacatggtca tacctccaga aactgcagaa cctcatacca ggaaccgcgc 1800 ccaagaacag tgttccactc ttcctgtaac acagcgcagt tagtgaggac tccaaatagg 1860 aatgccaatc aaagccagtg gcaaactgct aaaaagttgc ctgaaaaaca aatgcataat 1920 acgagaaatg cacaaataaa acttaccaac agattcggca ttcttcagga atcgcagcac 1980 tcggagatcg aagatgacct ggaatgttta aggaagttag ttgggtcaat ggatgaaagt 2040 atcttcatgc ctgaagactt tccagatatt accacggcca acagcacacc aaaaaatgac 2100 aacacaaact taccaaaacg aagaagacga cgcaccaaaa agaggggatc aacatcaaac 2160 ccctcaagcc aggaaaaaaa gattgtttac tacaaccaga ttcaatcaac cagcaaccaa 2220 atcaaacggg aagtcaaaca aacggaaaca gtaaacttaa gcgacagctc cagtgaaaac 2280 gaggccagca caataaatga catacctgaa atcaccctcg gaccacgatt accatctgct 2340 gttgaatcag tttatagaac gcccgaaaat actgcgccct tcccaccaat cccgaaccgc 2400 accctaacgg acataactaa attctacacc gccgaaggtt tacaatctcc caagagacca 2460 agatctggaa cataaagaga actcgcgtaa aacgcaacaa aaatttacaa acaacaataa 2520 catncttaaa atactaatga ttccatatac tatataatct aatggagcct gaaaaagtat 2580 taggcaacaa tttaaaaatt ggatctctga acacaaatgg aatactaaac aaaattaaaa 2640 agataatata ctatatggaa tcaaataaaa tcgacatatt gctaatacag gaaactcacg 2700 tatttacgaa agatatgatg tccaaattta aagcgcagtc caacatcgaa atcttcgtta 2760 atgcgcccga acatccaatt agatcattcc gccaaggaac tgcaatcctt gtcaaaaagc 2820 acctcctgcc aatgtacaaa attcaccata atataatatt tgaaaaccgg gtacaaaaat 2880 taaatctcca atataaacac aatgatatca acatttataa tttatacctg aaatccggac 2940 aaacgcanaa aaatcttatt gccagagaac aaatgatata tgaccttaaa gataaactag 3000 gggacgtaaa tgaaacaatc gacctgttga ttggtgattt taatatggtt tctaatgaaa 3060 tcgacgtaaa agcaaattat gataaaagaa aaaaacgaga tagaatcgcc ttacagcggc 3120 tacaaaatgg aaataacttt caggacgcat ttcgagtaat acataaacaa actatagaat 3180 ttactagaat aacaaaaacg agtgctacaa gaatagacag aatatatgtt aatagactag 3240 cgaaaaacaa aaccttcgcg ctaaatcaca tacgtaatta cttttcggac cataacaatt 3300 gcccggtaat aactctaaaa ataaatagta atcgtaaatg gggtctatcc ttttacaaaa 3360 taaataattc aatattacaa cacaacgaca taattgaaaa cttgaataca atgtggatta 3420 attggcaaaa acaaaaaaca aaatacatga actctgctac atggtgggaa tacggtaaaa 3480 aactgattgc aaacgaagta cgatactttt cccaaaacat aaatcatgcg gaacggaagc 3540 gatatttaac cagagtgctg gaaattaaag aactagaaaa acaataccag tcccaaaata 3600 tagtaacaaa aataacacga ttgaaagaaa atataaataa gtacgaacga aaaattaatg 3660 aaggtgcgat aattagatca aaaataaata taattgaaga tgaagaaaaa ccaactaaag 3720 agttttatag atacgaagaa gcaaaaggta atagagatac tatttacaat atatataatc 3780 aaatgggaga gctaacgaaa aatcaaaacc agactcttaa cgccacgcat gccttttacc 3840 aagatctatg gacaagtgct gaaataaata tagactatat tgatgaatat ctgtattttt 3900 tggatccaat agaatatgac caattggatt taaaaaatat gacgcaacct atcaaccaca 3960 aagaaattca cgagtgcatt ttagagataa atgataatag cacgccaggc tgtgacggtc 4020 taacatcaaa aatatataaa caattatggg aagttataaa atatgatatg gaagaattat 4080 acaataacat ctatttaaag ggaataatgc cagaaacaat gcgaaccgcg atagttaaac 4140 taatatataa gaaaggggat aaaaaagaca ttaaaaattg gagacccatt tcactactta 4200 acacagacta taaaatactt agcaagatta tagcaaaacg tttaggtata ataattaata 4260 aaataattag tccaaatcaa aaatgcgcca tacctggaag atccattaat aatagtttag 4320 aaaatataaa cgcatgcata gaggcggcta aatattttaa caaaaattta acaattcttg 4380 caatagactt tgaaaaagca tttgatcgtg taaattatac ctacttattt aaaatactaa 4440 caaaattaaa catacccgaa tatataatta aatggataaa gataatatat aataaaatac 4500 aaagtaaaat agaaataaac ggagccttca cggataacat aaatataaca aggggaatac 4560 gacaaggttg cccttgtagc atgatccttt tcctaattgg cgtagaagtt ctaactcgaa 4620 agattgaggc taacaaaaat ataaaaggat ttaaactaaa ccatatagaa ttaaaaacag 4680 aacagtacgc agacgattta tctatattaa tatcagacaa tatgtcatta aaggaaacca 4740 ttaacgaaat aaaattattt gaaaaagcat caggccaaaa aatgaatgca agtaaaaccc 4800 aaataataac taatgattct ttaataaata atgtaataaa cgaacacctc ccaaatgaat 4860 gcataaagga aaaaataaaa atactaggcg tttattttag tttaaacaga gaatgcataa 4920 cagaaaacac cgaaaaagct cgccgcgtaa taaaagcctt gtattggaaa aatcttaaac 4980 gaaaactaac attaaaaggt agaattatta taataaacgc gctctttatg ccgcaattat 5040 taacgatcgg aagacatatg atactgccaa aacaatttat taacgaaatt aataactaca 5100 tgtataaatt tatttggttc ccacaaaaaa ttgacagaat agcacgaaaa aaactaatag 5160 cgctacctat tgatggcgga ttaaatgccc cagatataat actaaaatta aaagctgtaa 5220 gagcaactcg cttatatgag attagaaaac tagaaaaatt gcaaacgatc tcacaagaat 5280 ggacacgttt caatttagca tcaaccatta aattaataaa tgaacagcta tatacgaact 5340 cggcattaaa cgcaacagaa ccaaacaaat tttataagga aattcgacaa acgatatata 5400 cgctacgacg gaaagacttc ccatgggaat caaacaaatt aaaacccatt tacgtagaac 5460 taataaaaga taaggcgaaa gcgactataa tacgtgaaaa taatgagata ataaaatggt 5520 cacaaattac acttaatgat aaactaacta aacaacactt caataacgta gaacgagata 5580 gaaattacaa aatagcgcat aacgcatatc acttcggtga ttggtacaga gataaaattg 5640 gaacgcaata tcaaaacggt aaattattaa ttagaaattg caaattttgt ggaaatagat 5700 ctgacaatat aaagcacatt cttacgaaat gtcaattaac cgaaataata attaatgaaa 5760 tagaagtact aacaaacaat gcttgcaaac agaaaacgca aataacaaaa tcaataatat 5820 tatataacca aacgaaaaat aatgcaacgc ctaatttact tgttaccaaa gcaatcaata 5880 tttttaaggc tgaaataata aggaaaaaac atcaactcga ttttggaaat aaatacatag 5940 agagcaaaga tgaatttacc cgacgaatac tttggataat aaatacgaaa atcaaaaata 6000 ttttattacg ggaaagtaca ttgagaggca aacaagaaac ttatgaatta tacgatctga 6060 acgagactta tgtaatttga accaattgga ttcctcctta aaaataaaaa tgtcacttat 6120 gtagactact ttatatcggg caaatatgtt ttgaaactga acaatataat aagaaacaag 6180 gcaactaact attttacctt tatattataa aaataattgg aaaatatgga aaaacaaaat 6240 aactcacaaa aagtaaagct ttgcaggaat tgcttggtga agaaaccatc gatccaaaca 6300 aaagatcttg ctgcacacca acctttcatc caacttgacc tgtaatataa aaagtagtaa 6360 ttaataatcg ttgtttatga tcgtattgaa atttgtgttt tttatgcaaa tattattatt 6420 atgtacggcc ggaaatccgg tgaacgtgtg tttagtcgcc caataaaaaa aaaaaaaa 6478 // ID PIGGYBN1_SM repbase; DNA; INV; 586 BP. XX AC . XX DT 28-JUN-2007 (Rel. 12.1, Created) DT 20-SEP-2007 (Rel. 12.1, Last updated, Version 1) XX DE Non-autonomous DNA transposon from Schmidtea mediterranea. XX KW piggyBac; DNA transposon; Transposable Element; Nonautonomous; KW PIGGYB_SM; PIGGYBN1_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-586 RA Jurka J.; RT "PIGGYBN1_SM: PiggyBac-type element from freshwater planarian RT (Schmidtea mediterranea)."; RL Repbase Reports 7(10), 1095-1095 (2007). XX DR [1] (Consensus) XX CC The youngest copies are ~95% identical to consensus. XX SQ Sequence 586 BP; 199 A; 76 C; 101 G; 210 T; 0 other; ccctagaaag atagtctgcg taaaatttac gcatgcgttt gcgaaatatt gctctctctt 60 tctaaatagc gcgaatccat agctgtgcgt ttaggacatc tcattcgtcg ctgggagctg 120 ccgtgtggcg tacgtgtcga taaggtaagt gtcaccgatt ttgaaatatg aaaaaaatgt 180 aaaaaaatta tatgtcgtga gcataacgtt gatatgtgcc aaagctgttt gtaactgatt 240 tttatataac tatatgttcc taatatatat aagatatgct aattgttgct tttgtactaa 300 aaaggctgat tattgtagta aaaaataagt tttcgtatga aagagtaagt gttaaagaat 360 ttcgttgatt tatagaagaa gtttaaagtt tttgtttttt taataaataa aggagtataa 420 gtaaattatc tttaatatta ttattcttat atgtgtaaat aaataatcac ataaaattta 480 ttgaaataaa ttaataaacc tcgacctgca gctaattaaa acacatgcgt caattttacg 540 catgagtatc ttttccgtac tacacaatat gattatcttt ctaggg 586 // ID Copia-113_AA-LTR repbase; DNA; INV; 202 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-113_AA_; KW Ty1_copia_Ele104; Copia-113_AA-I; Copia-113_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-202 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 202 BP; 61 A; 42 C; 32 G; 67 T; 0 other; tgttgcaatc gaagcaatcc atgattattt gaatttgtaa ccaacgaata ccttcttatg 60 tacccactct cttatgtact atagcgtagc cacctaggca ataccatagc aacacttagc 120 gacgtaggtt gttgattagt aataaatttt cattattagt tagacgttca accagaacag 180 tcgagtttta ctttactctg ca 202 // ID Gypsy-136_AA-LTR repbase; DNA; INV; 1907 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-136_AA_; KW Gypsy-136_AA-I; Gypsy-136_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-1907 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1006-1006 (2011). XX DR [2] (Consensus) XX SQ Sequence 1907 BP; 499 A; 407 C; 485 G; 514 T; 2 other; tgtagcagtc aggaattttg taaatagttt tgtatatawt tggtgtaaat aatgtgaata 60 gaataggttg gtaagaacat gcagtcttta agaatgaatt tgaattttga aaagtattcc 120 cttagcgtag gcaaacgagc attaaacact tgcaaacgct tccctgggtc gaagccattt 180 gttttctcga taaggtggga aagggaaggc agcacgacta gcatacaagt tatccaatgt 240 aactgggtag tcacgacagt acacgaagac tttcgggaaa tgacgtactt ccgggtgcgt 300 tcgagataca ctttccctta tctgaggctt cacagactcg atcatttcgt ccgtgatcat 360 ccaaagagtt ggaagtgtcc gcgtttgcta gttagttttt cgtgtgccga gagtatgtgt 420 tagcaaagtg atgttaagta tagttaagta gtttaatgta gaatgtttcg ttctagtagt 480 aggccaaaaa gtgtgcgtga agaattgtgt gtgcgaaggt agaagctaga acctataatc 540 aaggtgagaa tttgtgctct agacgtgctc tttgtgccag tgcttatgtt ccgtgtcctg 600 ccagattggc ttgtatttcc cgcccacagt tcttcgcacc caagcgcgac atttctttcc 660 ccaacggacg aggaattcct ggctaccgcc aaccagccag aagcggacgg attcgtcacc 720 atatcagtcc gaggaggccc aggggacctt cacgtggacg gagtccactg gctggatcgt 780 caatcgaccg gtaaggcagt actaacagcc gaaagaccgc catcgcttgg gtaagtcaac 840 caccatttcc tttttgatca cgccgtgacg tgtaaaatat ccacgacagc catcaaccaa 900 gccggtgaga gcatcgacct gtcgaacagg tcatcgactg ccggttacgt tcaacagcag 960 caatccagtt agccaaatca ttcttttttc ggaacgaagt gcggcgtccg gtgagggcgc 1020 cgcggagatt tttttttgtg ccagtcaaag actgaagatc taatgtgcca ttagcgccaa 1080 gaaccatcaa gccgtttgaa gatcgaaacc ttttttcgat agcgtcttgg gatccgcctt 1140 cggacccgtg aaaatgcgtg cgtgagtcct tttcgtgacc gccgcaaaac aagcttgtma 1200 ccaggagcgc agacggatac gcagctgagc ttccaacacg tgacgtcacg tagaacgtag 1260 agagcactga gcacagtagg ttagataggg aggagttaag agaaagcaca cacaagctag 1320 gagggaggaa gagagaacga aacacataca gataatgagt tcgaactgaa tccatagcaa 1380 taaaacatgt atgttaccct aagaatttaa acgaatatat ctgtccccaa tagtaaccgc 1440 caataaatgt tctacgttat gagtttcttc cctagccatt tggttgttta attgtattta 1500 acgtcacagc gcgatttgta gtttgatgtc ccgtccccca catgatcgag tttgttttgg 1560 tgggttctga ttctcttcca gttcggtgag atgaccatag gtgagttgag agagtttgcc 1620 gatgatgaga acttctctca agtgagatgt taggggtacg ggtggtctct caccgattct 1680 ctgagtgtcg tcgtgcgtat acggttatgt tgcgataata gggttttccc gtttggccgt 1740 tttagttcgt agccagctgg aacgatccat aaatcctttc gaggaaatta aatggactcg 1800 ccctcaggga agtctcatcc gggcaattag aaattggacg cgtggtctcc ctagtggagt 1860 ggcgcttaag ctgcgtgttt ttgaagcgga tcaccctgtt ggctaca 1907 // ID Gypsy-235_AA-I repbase; DNA; INV; 4486 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-235_AA_; KW Gypsy-235_AA-LTR; Gypsy-235_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4486 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1071-1071 (2011). XX DR [1] (Consensus) XX CC Positions [1846-2352] - Reverse transcriptase CC Positions [3546-3911] - Integrase core CC 'GTTG' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 187..3588 FT /product="Gypsy-235_AA-I_1p" FT /translation="MPKIKRRRNATDEREEEEEEEEAVIKQEHQGQQNVLP FT VCSSKMSLNPIPEATEVRMSRPLHPEEVRALISEYDGSYDAAIWIRKVEHY FT QKLYGWTDNATLLYATTRLIGPAKLWYSSVEERIFGFQGFKVMLLTTFPSY FT HDEADIHRELMTVMKAPQETYDNYVFRVQNVASKGHVSETSVVKYIISGLS FT RDKLYHQIAANEYTTVFGLLKRIKWCESNFLMKKADAVGSRQPVRQSGSVV FT KGGASGPPLGASGFICFNCNETGHKSVNCPRPQRRPRCSSCLKVGHSADQC FT FKVANDGGTGTTGTSKTTVALITAGEGENTTQDAVPNEVIDTDEDGMTTCD FT AIIGRRLKQLKLLVDSGSAVSLIKRSELSNLIRFNSKIGNRTVDITGINQS FT KVKIEGCLKTKVIVKGKLVLDVSFWVVGNDTMGVSAILGRDFLKQNNITVI FT RFEKPEISAFSTKFDLNLINDAFGDSLLVATDCGNLDLMVGDNDETRQYAD FT QINADFETFYRQKQPDAEKTVKYMATIRLKEDKYFNVSPQRMGKFEKDAVD FT QIVEDWLHDGTIRESESPYSSRVVLVKKNNGRYRLCINFKTLNKLVERDRF FT PLPIIEDQIAKLEGMKYFSTMDMKNGFFHVELSEDSKKYTSFVTENGQYEF FT NKLPFGYTNAPSVFCRYVAKVLNEFVKSGELVVFMDDFMLFTKTIDEHLDL FT LRRVFSALKDNGIELNLEKCRFLVTQVEFVGYDIRSNQISPCERHVKAVRD FT LPIPQNIKCLQRFMGLLSYFRKFIKGFSSIASPLYDLLKKDSVYKFGPEHL FT EAFETLKSLLVSRPVLSIYSPVAETQLHTDASAQGFGGILMQRQSYDGKFH FT PVMFFSRKTNPAESRLHSFELETLAVVYSLQRFRHYLIGIPFEIVTDCKAL FT KHTLEKRDTNSKIARWADFIAEFDKQIVHRSGDRMQHVDALSRMYVHAVEV FT DEYGSRSLFEDSLYVAQIKDERLCRLKSSVEEGKISGYEIRDNLLYKAENG FT RLLLYIPEEMEESVIYRFHDSLGHFGKDKVLHLIKRSFWFPKMAEKVLAHT FT KKCISCIMFNPKSRKADGELQSIDKGNRPFWMVHADHLGPLETTKGKNKYV FT LAVVDGFSKFIKLYPTKTYRRTRMKL" XX SQ Sequence 4486 BP; 1356 A; 833 C; 1095 G; 1201 T; 1 other; atttttcaga agtgggatgg cggagaatat ttccgcaaaa acgtaattcg agcgtaaata 60 agtgtaccgc gacggtagtg caaatgttat tacaaagcta gctacggaca taacgaaatt 120 tcaacgtaaa acgaaatcat cggaacgttg ttttgccggt gaaaacgaaa cgccattttg 180 gaacgaatgc cgaaaatcaa acgtcggcga aacgcgacgg acgaacgaga agaagaagaa 240 gaagaggagg aagcggtaat taagcaggag caccaaggac aacagaacgt cttgccggtg 300 tgcagcagta aaatgtcgct gaacccaata cctgaagcca cggaagtccg aatgtcacga 360 ccactgcatc cggaagaagt gagagcttta atcagtgagt atgatggatc gtacgatgca 420 gccatttgga tacggaaagt ggaacattat cagaaactgt acggatggac ggataatgcc 480 actcttctat acgcaacgac gaggttgatt ggtccagcga agttgtggta cagctcagta 540 gaagagagaa tctttgggtt ccaaggattc aaagtgatgt tattgacaac ctttccgagt 600 taccacgacg aagcggatat acatcgggag ctaatgacgg twatgaaagc tccacaggaa 660 acgtacgaca actacgtatt tcgagttcaa aacgtcgcca gcaaaggaca tgtttcggaa 720 acctcggtgg tcaaatatat catcagcggc ctttcacgtg acaagctgta tcatcaaatc 780 gctgcaaacg agtacaccac ggttttcggt ttgttgaaaa ggatcaagtg gtgcgaatcc 840 aattttttga tgaaaaaagc agacgcagta ggatcccgtc aaccagtacg tcaatccgga 900 tcggttgtta aaggcggagc aagtggacct cccttaggag cgtccggctt catatgtttc 960 aactgcaacg aaactgggca caagtctgtc aattgcccaa gaccccaacg acgtccgcgc 1020 tgcagttcgt gtttgaaggt gggacacagc gcagaccaat gcttcaaggt ggcgaacgac 1080 ggtggaacgg gtacgacagg tacgtccaag acaacggttg ctttgattac agcgggtgaa 1140 ggagagaata ccacgcagga tgcagtaccc aatgaggtga tcgacactga cgaggatggg 1200 atgacaacat gcgatgcaat tatcggcaga cgattgaaac agctcaagtt gctggtggac 1260 tctggcagcg cggtgagtct aataaaacga tctgaacttt ccaatttgat taggtttaat 1320 agtaagatag ggaaccgtac cgtagatata actggtatca atcagtctaa agtgaaaatt 1380 gaaggttgct tgaaaacaaa agtgatcgtt aagggtaaac tggtactaga tgtctcgttt 1440 tgggttgtgg gtaacgatac tatgggagtg agcgctattt tgggtcgtga tttcctcaaa 1500 cagaacaaca taacggtgat tcgtttcgag aaacctgaaa tatctgcatt tagtactaag 1560 ttcgatctta acctgataaa cgatgctttc ggtgatagtt tgctcgtagc aacagactgt 1620 ggtaatcttg atttgatggt gggtgataac gatgaaacaa ggcaatacgc agatcaaata 1680 aatgctgatt ttgaaacgtt ttatcgccaa aagcagccgg acgccgagaa gactgttaag 1740 tatatggcaa cgatacgttt gaaagaggat aagtatttca atgtttcccc tcaacgaatg 1800 ggaaaatttg agaaagatgc tgttgatcaa atagtcgagg attggttgca tgatggtacg 1860 atacgcgaga gtgaatcgcc gtactcgagt agggtggtcc ttgttaagaa aaataatgga 1920 cggtatcgat tgtgcataaa tttcaagaca ctgaacaaat tggttgaacg agatcgattt 1980 ccactcccga taattgaaga ccaaattgcc aaactagaag gtatgaagta tttctcaaca 2040 atggacatga agaacggatt ttttcacgtt gaattatctg aagattctaa gaaatacaca 2100 tcatttgtta cagaaaatgg gcaatatgag tttaacaaac tcccatttgg atatactaat 2160 gcgccttccg ttttttgtcg ctatgtagca aaagttttga atgagttcgt caagtccggt 2220 gagttagttg tgttcatgga tgatttcatg ctattcacga aaaccataga tgagcatctc 2280 gatctcttac gtcgtgtttt ctctgccttg aaagataacg gaatcgaatt aaatctggag 2340 aaatgtcgat ttttagttac acaagttgaa tttgtggggt atgatatccg atcaaatcag 2400 atcagtccat gtgaacgcca tgtaaaagct gtccgagatt taccgattcc gcaaaatatc 2460 aaatgtctgc aaagattcat gggactgtta agctattttc ggaagtttat taaggggttt 2520 tccagtattg cgagtcctct gtacgatctc cttaaaaagg attcggttta taagttcggt 2580 cctgaacatc tagaagcgtt tgaaactttg aaatccttgc tagtgtcgcg tcccgtacta 2640 agcatttatt cgccagtagc tgaaacgcag ttgcacactg acgcatcggc gcaaggattt 2700 ggtggcattc ttatgcaacg gcaatcgtat gacggcaagt ttcatcctgt gatgttcttc 2760 agtcgcaaaa caaatccggc cgagagtcga ctgcactcgt ttgagctgga aaccttggct 2820 gtcgtgtatt cgcttcaacg tttcagacat tatctcatag gaataccgtt cgaaattgta 2880 acggattgta aagcactcaa gcatacgcta gagaaacggg acaccaacag caaaattgct 2940 agatgggcgg attttatcgc tgagttcgat aaacagattg ttcaccgttc gggagatcga 3000 atgcagcatg ttgatgcgct ctcaaggatg tatgttcatg cggtggaggt agatgaatat 3060 ggaagtcgaa gtttgtttga ggattcattg tacgttgcac agattaaaga tgaaaggctt 3120 tgtcgtctta aatcgtctgt cgaggaagga aaaatttctg gttatgagat tcgtgataac 3180 ttgttgtaca aagcagaaaa cggtcgcctt cttctgtata tccctgaaga aatggaggag 3240 tcggttatat ataggttcca tgattctctc ggtcactttg gtaaagataa ggttctacat 3300 ttgataaaac gttctttctg gtttcctaaa atggccgaaa aggttcttgc acacacgaaa 3360 aagtgtattt cctgtatcat gttcaatccg aagtctagga aagcagatgg agaacttcag 3420 agtatcgata aaggtaatcg ccccttttgg atggttcatg cagatcattt agggccgtta 3480 gaaactacga aggggaaaaa caaatacgtt ttggcggtgg tggacggttt ttccaaattt 3540 attaagcttt atcccaccaa gacttatcga cgaacacgca tgaagttatg aagcatttga 3600 agtcttattt catcaactac agtactccta ggatacttgt cacagatcgt ggagcatgct 3660 ttacttctca ggcgtttaaa agttttgttg agacgcatgg agttgttcat aatctcgcag 3720 caacggcttg tccacaagcc aatgggcaag tggaaagata taatcgcaca ttagtaccct 3780 tattggcaaa actagttgag tcctccggtt cttcttggga tagtgccttg attgacgccg 3840 aatatctttt gaataataca accaaccgtg gatctggagc aataccgtcg aaactgttgt 3900 ttggtgttct gcagctacgc aaaatatcca atgatgccgt ccagtatttt caggatttgc 3960 tagaatctcc agattgtgtt gacttgaagt cttctcggaa tgaagcggct attcatatgc 4020 gtaaaatcca agattataac aagcgaaacc atgatagtaa atgcttgaaa accgtttatt 4080 tggaggggga cctagtcgtg atccgtagcg ttccggttgt tggtggggat aaaaagctga 4140 ggcctcgatt caaaggcccc tatcaagtca agaaagtgtt ggatagaaac cgttatgttg 4200 ttaccgatat tgagggctat caggtatcag ggaaacgttt cgagggaata ttcgacccac 4260 aaaatatgcg gttgtacaag cgtgatccca cgagaccttc agataacgga tctgaaagtg 4320 aattgagtga taatgagcag taatagaatt ttcgatttat tacaggaagg taacagaacc 4380 gattaatgaa ctattgtcaa ttgccgttca gatcgcaacg atacataaaa catgtaatag 4440 atataggaaa ttagggctgg cgagaactct tgtaggatgg ccgagc 4486 // ID CR1-95_AAe repbase; DNA; INV; 4542 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-95_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4542 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1183-1183 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >97% CC identity. XX FH Key Location/Qualifiers FT CDS 111..1361 FT /product="CR1-95_AAe_1p" FT /translation="MNCEICALDASVDSALWTCAGCPRKFHAACIGVTVHR FT SSMRRKDKKVVDFSSYVLPCCDSCQELLQAKLDFNRLTEQQKLLTEQLHAN FT TEVAHRSNLQQNKQSMVNEAFEGLEILMTTIKNELATINKTSSLAGCVVAI FT KNHITTILDTVTKMTNDNISKSFESVSSNLATELKNINDGICQINQLHLDL FT AATTAMSSNPYLGIDILDELKTWSAKIIAPRSTEPSTSSHGSCTSLNREND FT ADNSGWRCLGTRKVWRADWTDYDARKIVRLNQQKQAAKAKRRRKQKINRNN FT TADNGMNHHPNDSRNRNSAGRLDGDFIAAGGHSFLPPDRVLLAAAKERFSR FT PPQCSSSGTPAAQRPIQFQRGEILNPSRTCVENQQPAHGRQSHRISTGCSS FT NSAASCEACHAQHSCFRRNRRTP" FT CDS 1313..4402 FT /product="CR1-95_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="SVPCSTFVFSAQSTHSLEVGLDLTHSIEVGQDRSLLY FT LTHSSHSEEVRDSQESGIPIDSTQNTSSNPTQIGTEIVVYCQNFNRMKSSL FT KIREIHNKILSCSFPIILANETSWDEGVRSEEVFGCNYNVYRDDRNFETSE FT KKSGGGVLVAVSIQFNSEIISTEKCEEFEHVWVKILVEGETHVFASVYFPP FT NNARKNSYDKFFQIADEIMSKLSPEVKVHIFGDFNQRSIDFIPDADNECIL FT LPIVGDNETLQFLFDKIASLGLNQVNHIKNKQNNYLDFLLTNMYEDFCVSE FT SLTPLWKNEAFHTAIELSIFVHKNNRPDDCEYEEVFQYHSANYENIRERLN FT RIDWQTILRSERNVEASVDVFYKLLFEIILDEVPLKRIRRRNHSKTPVWYN FT EQIKNLKNRKQKAHKKYKRNNNNENLTSYLSICDLLNQAIDNAYEDYNLKT FT ENEIKSCPKNFFNYVNTKLKSDNFPSIMHLDKNVGDSSEKICNLFAFFFQE FT IYTTFDDEDRDCNYFDFLPEYTRDIEISHINVKDVMEGLNKLDVSKGSGPD FT GIPPLFMKSLAVELTTPLFWLFNMSLESGTFPKSWKSSYLVPIFKNGKKSD FT IRNYRGIAIISCIPKLFEAIINEKLFNQIKSRITNAQHGFFKGRSTNTNLI FT EFVNYTLTAMDNGHHVEALYTDFSKAFDRVDIPMLLFKLQKLGIESNLLKW FT IESYLTNRQQIVRFKGSKSNPIQVTSGVPQGSHLGPLLFILFVNDISLILK FT HIKVLIYADDMKLYLEIINKEDINIFQNEIKVFYTWCNKSLLQLNVKKCNL FT IAFSRKRDTPNVTITLGNQPVVKCDRVRDLGVILDSKLTFIDHYNTIINRA FT NNMLGFIKRFSYSFHDPYTIKTLYTAYVRSIIEYCSVVWSPFSITHEERIE FT SVQKQFLLFALRKLGWTRFPLPSYKARCMLINIQTLKVRREFAMVSFVNDI FT VSHRIDSAELLAKLNFYAPFRQLRNRNIFLTCHHRTNYAKYGPLNQMMATY FT NKHCASIDFTTTKTKLKRYFNSVR" XX SQ Sequence 4542 BP; 1570 A; 882 C; 836 G; 1254 T; 0 other; cagtcagtca gtcgtcaatc agtacggacg tgttttttaa ttgctccgac cccgcgcgtg 60 gttttaattt ttttttaatt cgttaattaa acatattttt tgaatacgtc atgaactgtg 120 aaatttgtgc attggatgcc tccgtcgatt cggctttgtg gacatgtgcg gggtgtccac 180 gtaaatttca tgccgcgtgt atcggcgtta ctgtgcaccg gagttcaatg cggaggaagg 240 acaaaaaagt ggtggacttc agctcatatg ttctgccttg ttgcgatagc tgccaagaac 300 ttctccaggc gaaattggat tttaatcgcc ttactgagca gcaaaagctt cttaccgagc 360 agctccatgc aaacaccgaa gtggctcatc ggtctaatct gcagcagaac aaacaaagca 420 tggtaaatga agctttcgaa gggctcgaga ttttgatgac cacaataaag aatgagcttg 480 ctaccattaa caaaaccagt agtctagctg gttgcgtagt agccataaaa aaccacataa 540 cgacaatact tgatactgtc actaaaatga cgaacgacaa catatcgaag tctttcgaat 600 ctgtatcatc aaacttggcc actgagctca aaaacattaa tgatggcata tgccaaatca 660 atcagctgca cctggatttg gcagccacaa ccgcaatgtc ttcgaaccca tatctaggca 720 ttgacattct cgacgaattg aaaacttggt cagcgaaaat catagccccg aggagcactg 780 aaccttctac ttcgtcacac ggttcatgca ccagtttgaa ccgcgaaaat gatgccgata 840 actcaggttg gcgctgcctt ggaacaagaa aggtgtggag ggctgactgg acggactacg 900 atgcgcgcaa aatagtccgc cttaatcagc aaaaacaggc ggcaaaggct aagcggagaa 960 ggaagcaaaa aatcaaccgc aataacacag ctgacaacgg aatgaatcat catccaaacg 1020 acagtcgaaa tcggaatagt gcgggtcgac tggatggaga cttcatagcg gcaggcggtc 1080 acagctttct accaccggac cgagtacttt tggcagcagc aaaggagaga ttttccagac 1140 caccacagtg ctcatcatca ggaacaccgg ctgctcaacg gcccatacaa tttcaacgag 1200 gagaaatttt gaatcccagc cggacatgcg tcgaaaatca gcaacctgcg catggcaggc 1260 agtcacatag gatttctacc ggatgctcat caaattcagc tgcatcatgt gaagcgtgcc 1320 atgctcaaca ttcgtgtttt cggcgcaatc gacgcactcc ttagaggtag gactcgactt 1380 aacgcactct atagaggtag gacaagatag aagtttactg tacttgacac attcctcaca 1440 cagtgaagag gtaagagaca gccaagaatc aggtattcca attgactcaa cccagaacac 1500 ttcttcaaat ccaacacaaa ttggaacaga aatcgtagta tattgccaaa atttcaaccg 1560 aatgaaaagt tcgttgaaga taagagaaat tcataacaaa attttgagtt gttccttccc 1620 aataatttta gcaaatgaaa caagttggga tgaaggagtt aggagtgaag aagtgtttgg 1680 gtgtaactat aatgtttata gagatgaccg gaattttgaa acgtctgaaa agaagtcagg 1740 gggtggagtt ctagttgctg tttcaattca atttaattcg gaaattatca gcactgaaaa 1800 atgtgaagaa ttcgaacatg tttgggtgaa aatacttgtg gaaggcgaaa cacatgtgtt 1860 tgcctcggta tactttcccc caaataacgc tcgtaaaaac tcttatgata aatttttcca 1920 aatcgctgac gaaatcatgt ctaagctctc tccagaagta aaagttcaca tattcggaga 1980 ttttaatcaa agaagcattg actttattcc agacgctgac aatgagtgca tcttgcttcc 2040 aatcgttggt gataacgaaa cattgcaatt tttatttgac aaaatcgcta gcttaggcct 2100 caatcaagtt aaccacatta aaaataagca aaataactat ttagattttt tattaacgaa 2160 tatgtatgag gacttctgtg tgtctgaatc actaacccca ttatggaaaa atgaagcatt 2220 ccatacggca attgagctct caatttttgt acacaagaat aacagacctg atgactgcga 2280 atatgaagaa gtttttcaat atcattcggc aaattatgaa aatattaggg aaaggttaaa 2340 caggatagat tggcaaacga ttttaagaag cgaaagaaat gtcgaagcct cagtcgacgt 2400 attttacaaa ttgttatttg aaattattct tgacgaggtt ccattaaaaa gaataaggcg 2460 aagaaaccat tccaagactc cagtgtggta caatgaacaa ataaaaaatt tgaaaaaccg 2520 caagcagaaa gcacataaaa aatacaaaag aaataacaat aatgaaaact taacaagcta 2580 tctatcaatt tgtgatctac ttaatcaagc catagataac gcatatgagg attacaactt 2640 aaaaactgaa aatgaaataa aatcttgccc aaaaaatttc tttaattacg tcaatacaaa 2700 actgaagtca gataattttc cctcgataat gcatcttgac aaaaacgttg gtgatagctc 2760 cgagaaaatt tgcaatcttt ttgcattttt ctttcaagaa atctatacta cttttgatga 2820 tgaagaccgc gattgcaatt actttgattt cttaccagaa tatactagag acattgaaat 2880 cagtcatatc aacgtaaagg acgtcatgga aggcttgaat aagttagatg tttcaaaagg 2940 gtctggacca gatggaattc caccattatt catgaagagc ttagctgtag aacttacaac 3000 tccattgttt tggcttttta acatgtcatt agagtctggt acttttccaa aatcatggaa 3060 aagctcatac ctggtaccca tctttaaaaa tggaaaaaaa tctgacataa ggaattatcg 3120 tggtattgcc attatttctt gtattcctaa actattcgaa gcaatcatta atgaaaaact 3180 attcaatcaa attaaaagca gaataacaaa tgctcaacat ggattcttta aaggacgttc 3240 aacaaataca aatctaatag aatttgtcaa ttacacactc actgcaatgg ataatggtca 3300 ccatgtagag gcactttata ctgacttcag taaagcattt gaccgcgttg atatacctat 3360 gctactattc aaattacaaa aactaggaat agagtctaat cttctgaaat ggattgagtc 3420 ctacttaacc aaccgtcaac aaatagttag attcaaagga agtaaatcga atcccattca 3480 agtcacctct ggagtccctc aaggttccca tttgggccct cttctcttca tactgtttgt 3540 caacgacatc tctttgatcc ttaaacacat aaaagttctg atttacgctg atgatatgaa 3600 actttattta gaaattatta ataaagagga cataaatata ttccaaaatg aaataaaagt 3660 cttctacacc tggtgtaata aatccctgct gcaattgaat gtaaaaaaat gtaacttaat 3720 agcttttagt agaaaacgag acacaccaaa cgttacaatc accctaggaa atcagccagt 3780 ggtaaaatgt gacagagtca gagacttagg tgtcatctta gattcaaaac taacattcat 3840 tgatcattat aatacaatca tcaacagagc aaataacatg cttggattca taaaacggtt 3900 tagttatagt ttccatgacc cgtacacaat taaaacgtta tatactgcat atgtaaggtc 3960 aataattgaa tattgtagtg tagtttggtc tcctttttca ataacacatg aagaacggat 4020 agaatcagta caaaaacaat tcctactgtt tgccctacgt aaattaggtt ggacaagatt 4080 tcctctacct tcttataaag cacgctgcat gcttatcaac atacaaactc taaaagtgcg 4140 tcgtgaattc gcaatggtgt ccttcgtaaa tgatattgtt tcacatcgta ttgactccgc 4200 agaactctta gcaaaactaa atttttatgc acccttccga caactgcgaa accggaatat 4260 attccttacg tgccatcatc gtactaacta tgccaaatat gggcctctta atcagatgat 4320 ggcaacatat aataaacatt gcgctagtat tgacttcaca acgacaaaaa ccaaattgaa 4380 gcgatatttc aattcagttc gatgattgta aactcgttta taaaacattg cagcataaaa 4440 tgttccttta taattcctgt tagaaataag aaaaacatgt aaaattagtg tgtaatgaaa 4500 cggtctacta ttgattgacg acaaataaat aaataaataa at 4542 // ID Crack-4_HM repbase; DNA; INV; 4374 BP. XX AC . XX DT 14-SEP-2009 (Rel. 14.09, Created) DT 14-SEP-2009 (Rel. 14.09, Last updated, Version 2) XX DE Non-LTR retrotransposon: consensus. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; Crack-4_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4374 RA Jurka J.; RT "Non-LTR retrotransposons from Hydra magnipapillata."; RL Repbase Reports 9(9), 1935-1935 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 1621..3981 FT /product="Crack-4_HM_1p" FT /translation="MLFKYNILPVINKPTRVTKSSATAIDNIFINNYIETS FT FETGIFKTDISDHFPIYVKINPFKAPSNGDPKVITLKKRNLSEVNKNNLSD FT KLKQETWNQVFNCDNTNKAFNIFLNTFLDIYNETCPYEFKITKKKTILNPW FT MDKTLIKCSRNKQKLYNKFLKNRTEVNEKSYKKYKRFYENLLKKAKKTYYS FT KEITKXKFDVKKTWDLINDIMGRKKINTAILPQRININKTDIYCPLQISNE FT LNKYFTNIGPNLAKKIITTPNSFQKYLKSLNGAMLQDREFSHEEYEIAFSF FT LKKNKSPGFDDVSSNIVISNKKYISKPLFHIIKLSIKEGIFPEALKLAKVC FT PIYKNNDQSEISNYRPVSVLSVFSKIFERVIYNRIYNHFNENNLFYTKQFG FT FRKKHSTEHAIIEIVDQITKGFENNSFTLGLFIDLSKAFDTVDHLILLEKI FT KHYGVINKTYDWLKSYLTERKQYVISKESGQLNILCGVPQGSILGPLLFLI FT YVNDFNNASAKLNSIMFADDTNLFLSNCDINQLYSDMNSELMNVDEWFKAN FT KLSLNVDKTKYTLFHKKSQDEKLPLKLPDLILNDALISKNDSIKFLGILID FT ENLSWLPHINYIQSKISKTIGLMYRICPYVTSESLKLVYFGLIHCFIYYAN FT ITWASTQQKHACRLXYIFYKKXYEHAKPLMEDMKMMNIYEINIYQHLIFMY FT KCNNSLIPKNFNDKFTXNVNKKYSLRINQWNTYKLPRINSKYSEYGISYRG FT PKLWNTFQKTKNLNTVKSLNSFKFIIKNEIFKLDK*" XX SQ Sequence 4374 BP; 1803 A; 596 C; 582 G; 1387 T; 6 other; gctgcatgct ttcttcacga acagacgtgt ttttgcagcg taagaattaa aagagaaaaa 60 aaaaatcngt attttcaagt ttttgaaagg gaatatatcg tatatattaa ataaaacaat 120 atatatcaaa ataaattaaa atgagcaaag ctgaagttgt ttcaatggta atgatgagaa 180 agcttctaaa cctacaaaaa gagacatcat cttgtcattc tttacagaaa cagttaacac 240 gataaattta aaatttgaat cagttcaaaa aaacattcaa gacgtaaaaa aggatcttga 300 cgatatcaaa gctggcctaa catttgttgg tgatgcctgc gacgataagg ttaaaagtgt 360 aaatgaaaaa attgataaag ttaaagaaga gttatccagc gtaaagataa tacaggggaa 420 aatttcgact gacaacaacg acctaaaatt aaaatctgtt gacatagaag atcgaaatcg 480 aagaaacaac ttgcgtattg atggtttatc tgagagtagc gatgaaaaag attgggaaaa 540 aactaaagaa aaagtaaata aactctttgc tgaaaatctg aacatcaaaa gagaagtcaa 600 aattgaacga gcacacagag ttggagtttt gaacaataat cgtgaacgaa caattgtttt 660 gaaactgcac gactacgagg acaaaaaagt gatcactgat aacgcatcga agcttaaagg 720 taccaatatt tttattaatg aagatttttg tagcactact cgaaaaattc gaaaagagtt 780 attcgctaag gcgaaaattc atagacaaaa cggtttttat gctaaagttg tttataataa 840 actaattgtc catgaattcc gacaaaataa atgtgcaagc acagaaaatg taatacctcg 900 ggataaataa tttataaata tatttatagg taaattctca tatacttact taaaaatgga 960 ttcgaataat atttttcaac aaggttttga aaatataatt tttgaccctt ttgaagacaa 1020 tttaaaaaat atcacaaata tttttcaaga aactcaaagg aatttaatga aaacaccgta 1080 ttttgatcct tatgaattta aggggaaaaa gaaacaaatt cattttctat attacaaata 1140 aatattagga gcataaacca aaactttgac aagtttaaag aattcttaaa tattataaat 1200 tacacgtttg acgtcatatc tctttcagag acctggcatg aaaatacact tgattctaat 1260 tttatttacg aattgcctaa ttataaatta attagtcaac cgcgagaaag taataaaaac 1320 ggaggaggag ttggtgttta tgttttaaac aaatattcat ttaaaataaa aaataaacta 1380 tgctcgtcaa atcttaatta tgaaagtcta tttattgaaa taataaatga caaagataaa 1440 aacattttag ttgggtgcgt atatcgtcca ccgagtggaa aaattaaatt gtttgaagaa 1500 ttcataaaaa atacagtctc aaaaataaat aaagaaaaaa aaaaattgta tattgccggc 1560 gacataaatt tgaacgctct gacatataaa aataatccaa aaacaaaatc tttttttgac 1620 atgcttttta aatataatat cttaccagtt ataaataaac caactagggt aactaaaagt 1680 tcggcaactg caatcgataa tatctttatt aacaattata tagaaacgtc atttgagaca 1740 ggtattttta aaactgatat tagtgatcac ttcccaatat acgttaaaat aaatcctttc 1800 aaagccccat ccaacgggga tcccaaagtt ataactctaa aaaaacgcaa cctatcagaa 1860 gttaataaga ataatctttc cgataaacta aagcaagaga cgtggaatca agtttttaat 1920 tgtgataata caaataaagc atttaatatt ttcttaaaca catttcttga catttataac 1980 gaaacgtgcc cgtatgaatt caaaataact aaaaaaaaaa caatattgaa tccttggatg 2040 gataaaactc ttataaaatg ttcaagaaat aaacaaaaac tttacaataa atttttaaaa 2100 aacagaactg aagttaatga aaaaagttac aaaaaatata aacgctttta cgaaaatctt 2160 ctcaagaaag ctaaaaaaac atattatagt aaagaaatta caaaaancaa attcgatgtg 2220 aaaaaaacat gggatttaat taacgatata atgggaagaa agaaaataaa tacggctata 2280 ctacctcaaa gaataaatat caataaaact gatatctatt gtccattaca aatctcgaat 2340 gagttaaata aatattttac aaacattggc cctaaccttg ctaaaaaaat cataactaca 2400 ccaaattctt ttcaaaaata tcttaaatct ttaaatggcg cgatgctgca agatcgcgaa 2460 ttttctcatg aagagtatga aattgctttt tcttttttaa aaaagaataa atcacctggt 2520 ttcgatgacg tttctagtaa tatagttatt tcaaacaaaa aatacatctc aaaacctttg 2580 ttccatatta taaaactttc aataaaagaa ggaatttttc ctgaagcatt aaagttggca 2640 aaagtatgtc caatttacaa aaataatgat caatctgaaa tctctaatta tagacctgta 2700 tcagtacttt ctgtattttc aaaaatattt gaacgtgtaa tttataatag aatctataac 2760 cattttaatg aaaataattt attttacaca aaacagtttg gttttcggaa aaaacattca 2820 acagaacacg caataattga aatagtagac caaattacta aaggatttga aaataatagc 2880 ttcactctag gattatttat tgacctatca aaagcatttg atacggttga tcacctaatt 2940 cttttggaaa aaataaaaca ctacggagtg ataaataaaa cctatgattg gttaaaaagc 3000 tacctcactg agagaaaaca atatgttatt agtaaggaat caggtcaact taatattttg 3060 tgtggagttc cgcaagggtc aattcttggt ccattgcttt tcttgattta tgtaaatgat 3120 tttaataatg catcagctaa gcttaactca ataatgtttg ctgacgacac aaatctcttc 3180 ctttcaaact gtgacattaa tcaactgtat tctgacatga actctgaact aatgaatgta 3240 gatgaatggt ttaaggcaaa taaactttca ctcaatgtag ataaaacaaa atatacttta 3300 tttcataaaa aatcacaaga tgaaaaactt cctttaaaat tacctgattt gattttaaat 3360 gatgcactaa ttagtaaaaa tgattcaata aaattcttgg gaattcttat tgatgaaaat 3420 ttatcatggc ttccccatat taattatatt caatcaaaaa ttagtaaaac aattggtctg 3480 atgtaccgta tttgtccata tgtaacatca gaaagtctga aattagtata ttttggccta 3540 atacattgtt tcatatacta tgccaatatc acatgggcta gtactcaaca aaaacatgca 3600 tgcagactta natatatatt ttacaaaaaa anatatgaac atgcaaaacc attaatggaa 3660 gacatgaaaa tgatgaatat atatgaaatc aacatttacc aacacctaat ttttatgtat 3720 aaatgtaaca acagcttgat tccaaaaaac tttaatgata aatttacaan aaatgtaaac 3780 aaaaaatatt ctcttagaat aaaccaatgg aatacttata aattaccaag aattaatagt 3840 aagtattctg aatatggaat atcataccgc ggaccaaaac tatggaacac ttttcaaaaa 3900 acaaaaaatt taaatactgt aaaatcatta aattccttta agtttataat aaaaaatgaa 3960 atttttaaat tagataaata gttttcttga tttttcgaaa tgaaaaacga gttccattat 4020 tcttagtttt tttacttgtt aattggctct tattttgata cttaacagag ttgccaatat 4080 attttaatga atattgataa aactatatga tgttatttac tttcttgatt taatggaaat 4140 atttatgcaa gctttaacct ttttattata ttggagatac tatttttttt aacttgtaaa 4200 gcccacctag gggctctatg aaaagattgt ggtgacttta gtcatccgta tcttctttga 4260 gcccctgcct gtttnttata aattcacttt tttttgtatg taaacgtttc tttgtaatta 4320 tatttcttat tatataaaca gcaaatatat attaaataaa aaataaaaaa aaaa 4374 // ID BEL-8_DWil-I repbase; DNA; INV; 5591 BP. XX AC scaffold_181123; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-8_DWil_; KW BEL-8_DWil-LTR; BEL-8_DWil-I. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-5591 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_181123; Positions 89675 84085. XX CC Positions [4659-5219] - Integrase core CC 'ATAAC' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 71..2479 FT /product="BEL-8_DWil-I_2p" FT /translation="MSNKMKTRGRKKRINMDAMETPKSAVNESTAGVSEIE FT RDASNKSKAPKDREDNTNILESVQRMMTQLQEQIASQSSKIEAIHNGQEQQ FT IQEIHLVQSEIVNLKIANESITDCISELEKTHALGGTQGAAILDGAQRHGA FT IMGAPHALGGTQGAANIDGAQRHGAIMGAPHALGGTQGAANLDGAQRHGAI FT MGAPHALGGTQGAANKDGAQRHGAIMGAPHASSGTQGATDDGATQNKMPFK FT TTTGISGVYEPFPINVGTGESGNVTGCVRVGQASQGQYPHNYMSGHRIMDL FT PEFSGRAEDWPVFITMFIESSAAYGYTQLENVIRLQKSLKGAAKREVKGVL FT VNPANVPHIIEHLQFMYGRPELLIKSQPEMVNDIPQIPENNLERMIPFAIK FT VNNLVIFLQSSNAVLQLQNPSLLDQLVSKMPMSKRMQWQEFSSQIYPYANL FT MDFSNWLNSIARTIAKVTLMTTLNRPTSKGDCRHNPKLVMQTNTGEYQKKD FT KKYTKVCTECNGDHSLSECDVYVKADVNEKWNMVKKLGVCFSCLKRNHNMI FT SCKFKRRCNISGCKSSHHKSLHNSNRQTVEEDKAEAATHTVKSSMKRLLVK FT VLPVEIMGPQNTLVKTFAFLDEGSNVTLINENLAKKLGVKKNMDWLTLNWI FT KNHTINLNTTVCHLKIRGINGNKNWLQLKGTTIPDLELPLQSMDHQTLAEE FT YIRVANLPIASYTDAKPTILIGIDNTHLVRTIRSVDIGPRLVASKTPLGWV FT IFGSRADHSNTSPLVTSVFHANSLEDQMNREMHDLMADYFTTELKRYFKTQ FT Q" FT CDS 4143..5405 FT /product="BEL-8_DWil-I_3p" FT /translation="MGWILRFKQRILKRQCDNDVTTLSSRELQDAENTLVK FT IMQKEIYHKELADLQANSCVSNESSIFSLTPTTNADGILVVNGRLDYATSL FT SQSARQPIILPKKHKLTDLIVTSYHDRFHHQNDELIITEIRQKFWIICLRA FT AVKRAKTVCQFCKNRKIQPQATLMGQLPTDRVTPYQRPFSYVGVDYFGPVT FT VTIGRRHEKRWVALFTCFTIRAIHLELAADLSTDACILCIRNFINRRGIPI FT RIRSDMGTNFVGADRLIREVKDLFDWNYIAEEMANKGIEWKFNCPANPSSG FT GCWERLVQSVKKVLVITLQEKAPQVETLRSLLIEAENIINSRPLTHIPVDK FT DDPGPLTPNHFLLGTTNSTQTPGEEDNFCHRKQYRICQQLKNTFWKRWIRE FT YLPTLDKANKVIRTDSTHQDGRLSHNM" FT CDS 2554..3543 FT /product="BEL-8_DWil-I_1p" FT /translation="MALKRLQLIEHKIIRSNSVKTYDEAIMDFVKKGYAVK FT MTKLQTCVISSKTWYLPHFAVYNVNKPGKLRLVFDAAASIEEVSLNSRLLK FT GPDENPPLTQILFQFRQGAVAVCADIKEMFLQVRVRQEDQDCLRFLWRHGD FT TTREVDTYKMTVMIFGATCSQCCAQCVKNINAQNSTSSKEVIDAIQNNHYV FT DDFVASFVSEQKAYDISSEVKRVHELAHFELRGFVSNSKWVQNKLNKEEKN FT KTDKLPVSMDKKAVDKVLGMYWYNDIDQFTFDLKMNRVEQSILKAENCPTK FT RQLLSLVMSIYDPLGMLADLTIHGKILIQEILCHLKLT" XX SQ Sequence 5591 BP; 1957 A; 1001 C; 1213 G; 1420 T; 0 other; ttctttattt taacttaaat aatatgtaaa agtgctttga tggtgtcgga attgactaga 60 aatgtgaaat atgagcaata aaatgaaaac gcgcgggaga aaaaagcgta tcaatatgga 120 cgcgatggaa acgccgaaat cggcggtaaa cgagtctacc gcgggagtga gcgagataga 180 aagagacgct tccaataaaa gcaaagcgcc aaaagacaga gaagacaata cgaatatatt 240 ggaaagtgtg caaagaatga tgacgcaatt gcaagaacaa attgcgtcac aaagttcaaa 300 aattgaagct atccacaatg gacaagaaca acaaatacaa gaaattcatt tagtgcaaag 360 tgaaatagtt aatttaaaaa ttgctaatga atctataact gactgcattt cagaattgga 420 gaagactcat gctttgggcg gcacacaggg tgctgctatc ttggacggtg cacaacgtca 480 tggagccata atgggcgctc cgcatgcttt gggcggcaca cagggtgctg caaatataga 540 cggtgcacaa cgtcatggag ccataatggg cgctccgcat gctttgggcg gcacacaggg 600 tgctgcaaac ttggacggtg cacaacgtca tggagccata atgggcgctc cgcatgcttt 660 gggtggcaca cagggtgctg cgaataaaga cggtgcacaa cgtcatggag ccataatggg 720 cgctccgcat gcttcgagtg gcacacaggg tgctaccgat gatggtgcta cacaaaacaa 780 aatgccattt aaaactacaa ccggaatatc tggtgtctac gaacctttcc ccattaatgt 840 tggtacgggt gagtctggta atgttacggg ttgcgttcgg gttggtcagg ctagtcaggg 900 acaataccca cataattata tgtcggggca cagaattatg gatctgccag aatttagtgg 960 tagggcagaa gattggccag tatttattac aatgttcata gaatcctccg ctgcttatgg 1020 ttatacccaa ctggaaaatg taatacgttt acagaaaagc ttaaagggag cggcaaaaag 1080 ggaggtaaaa ggggtattag tgaatccggc aaacgtgcca catattatcg agcacctaca 1140 gtttatgtac ggaagaccag agttgcttat taaaagtcag ccagaaatgg tcaatgatat 1200 accccaaatc ccagagaata atttagaaag gatgattcca tttgccataa aggtaaacaa 1260 tttggtcata tttttgcaat cttcaaatgc agtattacaa ctgcaaaatc catctttgct 1320 ggatcaatta gtttcaaaaa tgccaatgtc aaaacgcatg cagtggcaag agttttcatc 1380 tcaaatatat ccatacgcca atttgatgga ttttagtaat tggttaaata gtatagcaag 1440 aacgattgcg aaggtaacat taatgacaac tttgaacagg cccacttcaa aaggtgattg 1500 tcgtcacaat ccaaaactgg taatgcaaac aaatactggc gaatatcaaa agaaggataa 1560 gaaatataca aaagtttgca ccgaatgtaa tggtgaccac tctttgtctg aatgcgatgt 1620 atatgtaaag gctgatgtaa atgaaaagtg gaacatggtc aagaaattag gggtttgttt 1680 cagctgtcta aaacgcaacc acaatatgat atcctgtaag ttcaaacgaa gatgtaacat 1740 cagtggctgc aagagttcac atcacaaatc gttgcacaat tctaatagac agacggtaga 1800 agaagataaa gcagaagcag cgacacatac agtcaaatct agcatgaaaa ggttgttagt 1860 taaagtcctg ccagtcgaaa taatggggcc gcagaataca ttagttaaaa cgttcgcgtt 1920 tttggacgag ggttccaacg taacgttaat taacgagaat ttagctaaaa agctaggggt 1980 aaaaaaaaac atggattggt taacattaaa ttggattaag aatcatacca ttaacttgaa 2040 tacgacagta tgtcatctaa aaataagagg tattaatgga aataaaaatt ggttacagtt 2100 aaaaggcact acaatccctg atctggaatt acctctacaa tcaatggatc atcaaacatt 2160 agcggaagag tatattcgag ttgcgaacct cccaatagca agttacacag atgccaaacc 2220 aacaatactg atcggcatag ataataccca cctggtcagg actatacgaa gcgttgatat 2280 cggtccaaga ctagtggcat caaaaacacc tttgggatgg gtgatttttg gatcaagagc 2340 agaccattct aatacaagcc cgttagtgac atcagtgttt catgcgaatt ccttggagga 2400 tcaaatgaac agggaaatgc acgatttgat ggcagattat ttcacaactg agctaaagag 2460 atacttcaaa acacaacagt aaaattagaa ggagaatatg aaaccggatt gttatggtca 2520 aaagaaacat taaatcttcc cgacagttac agtatggctt tgaaaagact ccaattaata 2580 gagcataaaa ttattcgctc aaatagcgta aagacctacg atgaagccat tatggacttt 2640 gtgaagaaag gatatgccgt caaaatgaca aaacttcaaa cttgtgtaat aagttctaag 2700 acatggtatc tacctcattt cgccgtttac aacgtgaaca aaccaggaaa gctacgacta 2760 gtattcgacg cagcagcctc aatcgaagaa gtttcattga attcaagatt attaaaagga 2820 cccgatgaaa atccgccatt gactcaaatt ttattccaat ttcgacaagg agcagtggca 2880 gtttgtgcag atatcaagga aatgttcttg caggtgcgag ttcgacagga agatcaggat 2940 tgcttacggt ttttatggag acatggtgac acaacaaggg aagtcgatac ttataagatg 3000 acggtcatga tattcggagc cacttgttct cagtgttgtg ctcaatgtgt aaaaaatata 3060 aacgcgcaaa acagtacgag ttcaaaggaa gtaattgatg caatacaaaa taatcattac 3120 gttgatgact ttgtagcaag ctttgtatct gagcaaaagg catacgacat aagctctgaa 3180 gtaaaaaggg tgcatgaatt agctcacttt gaactaagag gctttgtttc aaactcgaag 3240 tgggtacaaa acaaattgaa taaagaagag aaaaataaaa ctgataagtt accagtaagt 3300 atggataaga aggctgtcga taaggtattg ggcatgtatt ggtacaacga catagaccaa 3360 tttacttttg acttaaaaat gaatcgcgtg gaacaaagca tacttaaagc tgaaaattgt 3420 ccaacaaaga ggcagcttct aagcttggtc atgtctatct atgatccatt agggatgtta 3480 gccgacctga cgattcatgg aaaaatattg attcaggaaa ttttatgtca tctcaagcta 3540 acatagaatt gcatattttt gtagatgcta gtgaacaagc ctttgctgcc gttggttatt 3600 ggagaataat acttgatgga cttatcgaaa caattttcgt cgcgggaaaa actagatgcg 3660 caccaataaa gactttatct gttccccgat tagagcttca ggcagcggtt ttaggtaccc 3720 gacttaagac tctgatactg gaaagtcatt ctgttaacgt acagaaatat acattatggt 3780 cggattcaaa gactgtcata gcttggattc aatcagatca tcggaaatat aaacaatttg 3840 tggctcaccg tataaatgaa atactggaaa caacaaacga aaatgattga cgatgggttc 3900 caacacattt aaatgctgcc gataaagcca cgagaacaaa ggtaaaaatt aagtacgagc 3960 caggtaacat ttggacaaga ggccccgaat ttttattgaa ggatgaatct agctggccag 4020 tacaggaaaa gatctacgtt gaaaacgatt cctctgaagt aaaacgggta tatttaacaa 4080 cttctgttga aaatattatt gacctgaaca gattttctaa gtatttgaaa ctgaagcgaa 4140 caatgggatg gatattgaga tttaaacaac ggatattgaa acgtcaatgt gataatgatg 4200 taacaacact ttcttcgcgg gaattgcaag atgcagaaaa tacattagta aaaatcatgc 4260 aaaaggaaat ctatcataag gaactagctg atttacaagc aaattcatgc gtatcaaatg 4320 aaagtagcat cttctcttta actcctacga caaatgccga cgggatctta gtggtcaacg 4380 gtaggcttga ctatgcaaca tcactatcgc aaagtgctcg tcaacctatc atattaccca 4440 aaaaacacaa actaactgat ttaatagtaa catcgtatca cgatagattc caccatcaaa 4500 acgatgaact cattatcact gagattcgtc aaaaattttg gataatatgc ttgcgagctg 4560 ctgtaaaaag ggcaaaaaca gtttgccagt tttgtaaaaa tagaaaaatt caaccgcaag 4620 cgactctgat gggtcagtta ccgaccgatc gtgtcactcc ttaccaaaga ccgttttcct 4680 atgtaggagt tgactatttt gggccagtta ctgtaacgat cggccgtcga catgaaaagc 4740 gatgggtggc tctatttacg tgctttacca tacgagctat acacttggaa ttagcagcgg 4800 atttgtctac agacgcatgt attttatgta ttcgaaactt cataaacagg cgtggtatac 4860 caatcagaat acgttcggat atgggaacaa attttgttgg agctgatcgt ctaataaggg 4920 aagtcaaaga cctatttgac tggaactaca tcgctgaaga gatggcaaat aaaggcatag 4980 aatggaagtt caattgtcca gcaaacccaa gttcaggagg atgttgggaa cgcctagttc 5040 agtcagttaa aaaggttttg gtaataacat tgcaagaaaa ggcgccccag gttgaaacac 5100 ttagaagttt gcttatcgaa gcggaaaata tcataaatag cagaccactt acacacattc 5160 cagtggataa agacgatcca ggtccattaa cacctaatca tttcttgctt ggcacaacca 5220 attcgacaca aacccctgga gaggaagaca acttttgtca tcgaaaacaa tataggattt 5280 gtcaacaatt aaaaaatacg ttttggaaac gatggataag ggaatatcta ccaactcttg 5340 acaaagcgaa caaagtgatt cgcaccgatt cgacccatca agatgggaga cttagtcata 5400 atatgtgacg aaaaccaacc aagggcaagc tggaaaaggg gaatcatcgt gaaagttatt 5460 ccagcaagag atgggctaat acgagttgct gaagtacaaa caaatgccgg aattttaaaa 5520 cgaccagtgt ctaagcttgc tgtattagac gttttgggta aaactcccga gtagttttac 5580 ggtaggcggg a 5591 // ID hATm-15_HM repbase; DNA; INV; 2384 BP. XX AC . XX DT 07-DEC-2008 (Rel. 13.12, Created) DT 07-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type family: consensus sequence. XX KW hAT; DNA transposon; Transposable Element; hATm-15_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2384 RA Jurka J.; RT "Families of hATm elements from Hydra magnipapillata."; RL Repbase Reports 8(12), 1909-1909 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 730..2184 FT /product="hATm-15_HM_1p" FT /translation="MMNVSSGEATNPLMKFLILLKWTEHGYLPRTSDCTIS FT KWKAKEQWDTLQAKRPVKKRSIPLKDKNFLMSHPTHHLTILAGFSDSDSEA FT DDSEYENNEEEDETTPTPSRKHHKSKIAVNLVTSTRVSTNKAANICKQLAR FT EGIDIPTPCQSAIYKSTIKEAIKLKEEMIEKLHMESWSLHFDGKHIDRMDY FT QVVVLKNERREIKLDVLGLVDGRALTIAQGISKVLDEFHLWNSIQMIVADT FT TSVNTGKKNGVVVILQRMFTEKRINKPQFISCQHHVLDRILRLVMDEELGS FT KTQSPNIEYPFVSQLLKEYEQLKTQFDNGTEVILETSGWRDDMKFLFHLTR FT VFRFFEEKGHFPLIRFQKIPNISNARWNSRAILAILAFILIPSTRTVLQRV FT WFTDQLYNENDFNNLAEILNPYKKALSSLKNHWKREPSILQIPRSNQCAER FT AIKVMEEIYSSCKNKDKLHLRFLLCNKQEQGQRSESLLD*" XX SQ Sequence 2384 BP; 855 A; 423 C; 452 G; 654 T; 0 other; ttagggttat gactttttga ctttttaaaa aaaaattaat ttcctgtgtc ataaatcgat 60 gaacttttat tagaaaacat cgaaaacatg ttcaattcct ctcggttgta aaaaaaggtc 120 accctgcaat tcaaattacg tttcgtcatt tatttaatct tgttactgct actactaata 180 cgtgtggctt tcctgcaaat catttaaaac ttgtatagat tagaagtgtt tagaacttta 240 gaacattttg gaaaataaca gaatatttta gaagacttta gaacattcta gaacaattta 300 aaatattttg gaacattcta gaacaattta gaacattctg gaatattcta gaacatttta 360 gactatatat agctgctcat ctaattttat agttattaca gaagcaacac atggcgtcgt 420 acaagcctag cgatagtagc attcagaaga acaaggatgt ctggaccaat aatctggtaa 480 aatttgaaga ctctcataaa agaagaatgg tcccttcgga agaatattca ctggaagaaa 540 agtgcagaat gcctctccaa aagcaaatta tcggacgcta ccatttcctt ggaccaaaaa 600 tcaaaggaag aaaggagcga atagcggaaa tttccaaaga agtcaccaaa ttgtggaaga 660 ataaactgaa ttttcctcgt gtatctgatc aagttataca agcaaagctt gataaaatac 720 tgaaatgtta tgatgaatgt gtcaagcggg gaagctacga accccttgat gaaatttttg 780 atattactaa agtggacgga acatggttat cttccgagga caagcgattg taccatctcc 840 aagtggaaag caaaggaaca gtgggatact ctacaggcca agcggccagt aaagaaacga 900 tccatccctc taaaagacaa aaactttcta atgagtcatc caactcatca tcttactatc 960 ttggctggtt ttagtgattc tgactctgag gctgacgaca gtgaatatga aaacaatgaa 1020 gaagaagatg agacaactcc tacaccaagc agaaaacatc ataaaagcaa aattgcagtc 1080 aacttggtga cctccacaag agtatcaaca aacaaagctg ctaacatatg caagcaattg 1140 gctcgtgaag gcattgacat cccaactcca tgtcaatcag ctatttacaa gtcaacaatc 1200 aaagaggcaa tcaagctgaa ggaagaaatg attgaaaagt tgcatatgga aagttggtcc 1260 ttacactttg atggcaagca catcgacaga atggattatc aagtggtcgt ccttaaaaat 1320 gaaagaagag aaatcaaatt ggatgtcctt ggcttggtag atggcagggc tttaactatt 1380 gctcaaggaa tttctaaagt cctcgatgaa tttcacctat ggaattcaat acagatgatt 1440 gttgcagata ccaccagcgt taacactgga aaaaagaatg gtgttgttgt aatattgcaa 1500 cgaatgttca cagagaaacg catcaacaaa cctcagttta tcagttgtca acaccatgta 1560 ttagatagaa tcctccgttt ggtgatggat gaagaacttg gaagtaaaac acaatcgcca 1620 aacatcgaat atccatttgt atctcaactg ttgaaggagt acgaacaact gaaaacacag 1680 tttgataacg gaacagaagt aattctcgaa acatcaggct ggagagatga tatgaaattt 1740 ttgtttcatc tcactcgagt gttccgattt tttgaggaaa agggacattt tccattgatc 1800 aggtttcaga aaatacccaa catcagcaat gcgaggtgga actccagagc tatactagcg 1860 atcctggcct tcattttaat tccttcaaca aggacagttc tgcagagagt atggtttaca 1920 gaccaactgt acaacgaaaa cgactttaat aacttggctg aaatactgaa tccatacaaa 1980 aaggctttaa gctctttgaa aaatcactgg aagcgagagc catccatact acagataccc 2040 agaagtaatc aatgcgcaga gcgcgcaatc aaagtaatgg aggaaattta ttcatcatgc 2100 aaaaataaag acaaattgca ccttcgattc cttctgtgca acaagcaaga gcaaggtcaa 2160 cggtctgagt cattgttgga ttgaaaattt attgtattac atgactatga cattctaaat 2220 acatttgaat acatagtagt acttatgtca aaatcgattt ttcaatggcg gggtgaactt 2280 ttttgaaaag atatgcaaat aatagttcat ttcgtgcttt ttaggtaaaa gttcatcaaa 2340 ctacagtaca ggagcatggg gttgataaaa agtcataacc ctaa 2384 // ID DNAX-6_AP repbase; DNA; INV; 258 BP. XX AC . XX DT 25-AUG-2009 (Rel. 14.09, Created) DT 25-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNAX-6_AP. XX NM DNAX-6_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-258 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2059-2059 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC TSD is either TA or TATA CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 258 BP; 54 A; 73 C; 74 G; 57 T; 0 other; tattcagaat aagaattccc actcgcttga cgaaaactcg tcttgttcgc gcatgtctgt 60 tgccaccagt tggcgctgat gtttgcagcg ccgcccccgc cgacgttccg caggtgtata 120 cgtatacgat agcgacgggt gacggcagcc gtcgacaact ggcgaccaat cacagcgcag 180 ccgacactcg ccaggggggt ctgacagctg ccggcggttc agttgcgtgc gcgaaagtgg 240 gaattcttat tctgaata 258 // ID RTE-2_BF repbase; DNA; INV; 3062 BP. XX AC . XX DT 29-JUL-2009 (Rel. 14.07, Created) DT 29-JUL-2009 (Rel. 14.07, Last updated, Version -1) XX DE Amphioxus RTE-2_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW RTE; Non-LTR Retrotransposon; Transposable Element; RTE-2_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-3062 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-3062 RA Kapitonov V. and Jurka J.; RT "Young families of RTE non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(7), 1700-1700 (2009). XX DR [2] (Consensus) XX FH Key Location/Qualifiers FT CDS 1..3036 FT /product="RTE-2_BF_1p" FT /translation="MTVHKGKRRLPVRTRATSSVSENNQADDMNTTIAKKL FT QLRKMNLGIGTWNVRTLNAEGKLQQLEYELKNYTWDILGIAEARLTGSGEI FT TTDEGHKVYYSGHESAHIHGTAIMVHKETKDAVIEHNPVSSRLMSLRLSAK FT PFNVTIVQAYAPTSDATDKDLNDFYNGLTVLMQRIPKKDITVLLGDFNAKI FT GTDAYKDWSGTVGKHGFGNTNDNGLRLLEFARYHSLAIANTFPAHKPSRKA FT TWHSPDGKTHNMIDYILIGKRHLSSLNLAKTRTFPGADVGSDHDMVLTTLS FT VRLKSLQKKKDGRIHYNVKKLKDPNTLAAFQSEIEERLAPLLADNAEKDDL FT ASEAESNLKAAAKAILGKERLTKHPWINEQVLQKCDERREKKKVRFKSRRE FT NREYQTVNREVKQAIKDAKEQWIQQECSKIEAGIRSNNTKQAFQTLKTLTH FT SSTNQITCIEDADGNLLTAKTDIVKRWEEYCRELYNFQLTAEEGVLQDLKS FT RTCQFQEDDEPDILESEVTAAIKALKQGKSPGIDNVPAELLKAGGETVVKV FT YTRICNHVFKTGKWPDSWTTSIVIPLPKKGNLKKCQNYRTISLISHPSKIL FT LKVILNRLQPQAEQILAEEQAGFRKGRSCAEQIFNLRLICEKYRELGKPVY FT HTFVDYKKCFDRVWQDGLWAVMRRFRISTRIVNSIEALYKASKSTVMTAGE FT FSECFPTSVGVRQGCLLSPTLCNIFLENIMREALTPKESPVKLAGRIITHL FT QFADDVDLIDGSAADQQEQFSSLDSTSRRYGMEVSLDKTMCLVSGPPGFRT FT QVTVRGTELEQVGEFTYLGSLQTEDGSSAREVKVRIAKSTLALSKLKHIWS FT SKNITMPTKIRLLRSLVLSIFLYGAETWTLNADIKKRINALEMNCYRRLLG FT VHWTSHTSNKDIQERITTLAGPLPSFLSLVKRKKLQWFGHACRRKGSLTNT FT ILQGKVEGARPRGRPRRTWINDLKEWTGRTSAQLTRLADDRQSWKSLVNSI FT TAPTAT" XX SQ Sequence 3062 BP; 1036 A; 683 C; 710 G; 633 T; 0 other; atgacggttc acaaagggaa gagacgcctt ccagtccgaa ctagagccac gtcgtccgtg 60 agtgaaaaca accaagccga cgacatgaac accaccattg ctaagaagct acaattgcgt 120 aagatgaatt tgggaatagg aacttggaat gttaggacac ttaatgctga aggtaagcta 180 caacaactgg agtatgaact taagaactac acctgggaca tcctgggaat cgcagaggca 240 agactaacag gttctggcga aataactaca gatgaaggtc ataaagtcta ctacagtgga 300 catgaaagcg cccatataca tggaactgca ataatggtac ataaagagac aaaggatgca 360 gtgattgaac acaatccagt ctcgagtcgt ttgatgtcac tacgtctcag tgccaaacca 420 tttaacgtta caatagttca ggcatatgca ccaacaagtg atgccaccga taaagacttg 480 aatgacttct ataatggtct cacagttctc atgcagagaa tcccaaaaaa ggatataacc 540 gtactgctcg gggacttcaa tgccaagata gggactgatg cctacaagga ctggtcagga 600 actgtaggta aacatggttt cggaaacaca aatgacaatg gactacgtct attagagttt 660 gctagatatc atagcctagc catagcaaac acctttcctg cacataagcc tagccggaag 720 gcaacctggc actcccctga tggcaagacc cacaatatga ttgactatat tctgataggg 780 aagcgacacc tttccagtct aaacctagca aaaacccgca cttttccagg tgccgatgtc 840 gggagcgatc acgacatggt gttgacaaca ctatcagtac gacttaagtc actacagaaa 900 aagaaagatg gaagaataca ctataacgtg aaaaaactta aggatccaaa tacacttgcg 960 gcattccaat ctgaaattga agaaagactt gcaccacttc ttgctgacaa tgctgaaaag 1020 gatgaccttg caagcgaagc agagtcaaac ttgaaagctg ctgcaaaggc catactgggg 1080 aaagaaagac tgacaaaaca cccatggata aatgaacaag ttcttcagaa gtgcgatgaa 1140 cggagggaaa agaagaaagt cagattcaaa tcccggcgtg aaaatagaga atatcaaaca 1200 gtaaacagag aagttaaaca ggccatcaaa gatgctaagg agcagtggat ccaacaggaa 1260 tgtagtaaaa ttgaagcagg catacgaagc aataatacaa agcaggcatt ccaaacgttg 1320 aagacactta cacatagcag cacaaaccag atcacatgca ttgaagatgc tgatgggaac 1380 ctgctaacag ctaaaacaga catagtgaaa cgctgggaag aatactgtcg ggagctgtac 1440 aactttcagc tgacagcaga agaaggtgta ctacaagact taaagtctcg cacttgtcag 1500 tttcaagaag acgacgaacc agacattctt gaatccgaag taacggcagc cattaaagca 1560 ttaaaacaag ggaaatcacc aggaatagac aatgttccag cagagctact gaaagcaggt 1620 ggtgaaacag ttgtaaaagt atacacccga atctgcaacc acgtcttcaa aactggtaag 1680 tggcctgatt cctggacaac atctatagtt attcctcttc ctaagaaagg caacctaaag 1740 aagtgccaaa actaccgcac cataagcttg atatcacatc caagcaaaat ccttctgaaa 1800 gtcattctca accgactcca acctcaagct gagcaaatcc tggcagaaga gcaagcgggt 1860 ttcagaaaag gccgatcctg tgctgaacaa atcttcaacc tacgcctgat ctgcgagaaa 1920 tatagagaac taggaaaacc agtataccac actttcgtgg actacaaaaa atgttttgat 1980 agggtgtggc aagatggact gtgggcagtg atgcgccgct tcagaattag cacaaggatt 2040 gtaaactcca ttgaggcact gtacaaggct tcaaagagca cagtcatgac agctggcgag 2100 ttcagcgagt gtttccctac ctcagtggga gtgcgccagg gctgtctctt gtcgcctaca 2160 ctgtgtaaca tattcctaga aaacataatg agagaggctc tcacaccaaa ggagtctcct 2220 gtcaagcttg caggacgcat cataactcac cttcagtttg cagatgacgt agacttgata 2280 gatggctcag cagcagacca gcaagaacag ttcagtagct tagactccac gagcaggagg 2340 tatggcatgg aggtgagcct cgacaagaca atgtgcctag tatcgggtcc accaggcttt 2400 agaacacaag tcacagtaag aggtacagag ttggagcagg tcggggagtt cacctacctc 2460 ggctccttac agacagagga cggcagttca gcccgagaag tgaaggtcag aattgcaaag 2520 tctactcttg cattgtcaaa gctcaaacac atctggtcta gcaaaaatat taccatgccg 2580 accaaaattc gtcttctacg ctctcttgtc ttgtcaatat ttttatatgg tgctgagaca 2640 tggacactga acgcagacat aaaaaagaga ataaatgctc tagagatgaa ctgctacagg 2700 cggttactag gagtccactg gacatcccat acttcaaaca aagatatcca ggaacgcatc 2760 accactctgg caggaccact acccagcttt ttatctttgg tcaaaagaaa gaagcttcag 2820 tggtttggac atgcgtgtag gcgaaaagga agcttgacaa atacaatctt acaaggcaaa 2880 gtggaaggag cgcggccgag gggtcgcccc cgccgaacat ggataaacga cctaaaagaa 2940 tggactggaa ggacgtcagc ccagctgacg aggctggcag atgatcgcca gagctggaag 3000 tcactagtga acagcattac tgcccctacg gccacatgag ctacgggacg agtgagtgag 3060 tg 3062 // ID Mariner-13_SM repbase; DNA; INV; 1307 BP. XX AC . XX DT 20-DEC-2008 (Rel. 13.12, Created) DT 11-AUG-2009 (Rel. 14.09, Last updated, Version 2) XX DE Mariner DNA transposon from Schmidtea mediterranea: consensus. XX KW Mariner/Tc1; DNA transposon; Transposable Element; MARINER-13_SM. XX NM MARINER-13_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-1307 RA Jurka J.; RT "Mariner DNA transposons from Schmidtea mediterranea."; RL Repbase Reports 8(12), 2245-2245 (2008). XX DR [1] (Consensus) XX CC TSD : TA. CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 124..1185 FT /product="Mariner-13_SM_1p" FT /translation="MSERKDDWNPPESWKRITCIMMIRAGHQNKDIMTAAQ FT CSLNTVKTIRHELETCNGDYEAVARRKIHNRRSDCVRTAEFLKDLQEKVLK FT NPGIGIRALSREMNVSSSTMKLALNEDLRYYSYKRRKGQLLTEKARENRLI FT NAKKLLSKVKHPAEPQTIWFFSDEKNFCQDQKHNTQNNRWLAYSPKDIPRV FT MQTKFPQTVMVFGCVSSEGDVMPPHFFREGLRLNSDGYMELLNTVVKPWIT FT KVVNGRPYVWQQDSAPCHTSGKSQKWLSENFYDFTSPNVWPPNSPDLNPMD FT YFVWGAVEKDTNCYASNTKAQLMDKIKSVFEALPRETVASACSRFRSRIEA FT VIDANGGYFE*" XX SQ Sequence 1307 BP; 404 A; 263 C; 273 G; 367 T; 0 other; cagggtgttc gggataaatt tgacatattt ataattgatt atgttttata atattaattg 60 attattatta acccacacta ctcatcattt aaaagtttgg acccttcttc attcatatac 120 aagatgtcag agaggaagga tgactggaac ccacctgaat catggaagag aataacttgc 180 atcatgatga ttcgtgccgg ccatcaaaat aaggatatta tgactgctgc ccaatgctcc 240 ctgaacacag taaaaacaat caggcatgaa ctagagactt gcaatggtga ctatgaggct 300 gtagcaagaa gaaagatcca taacagacga tctgattgcg ttcgtacagc agaattcctc 360 aaagatctgc aggaaaaggt gttgaagaac cctggcattg gaattcgggc tttgtcacgt 420 gaaatgaatg tttcatcctc cactatgaag cttgcactca atgaagacct tcgctactac 480 tcatacaagc gccgcaaagg tcagctgctc acagaaaagg cccgtgaaaa tcgtttgata 540 aacgcgaaga aacttctgag caaagtgaaa catcctgctg aaccacaaac aatctggttc 600 ttttccgatg agaaaaactt ttgccaggat caaaaacaca acacgcagaa taatagatgg 660 cttgcataca gtccaaagga cattcctcgt gtaatgcaga ctaaatttcc ccaaactgtg 720 atggttttcg gatgtgtgtc ctctgagggc gatgtaatgc ctccgcattt tttcagagag 780 ggcctcaggt tgaattcaga tggctacatg gagttgctaa acactgtagt caagccctgg 840 ataacaaaag tagtcaatgg taggccgtat gtatggcagc aagattcggc cccttgccac 900 acctctggga aaagtcaaaa atggttgtct gaaaattttt acgacttcac cagtccaaat 960 gtttggcctc caaactcccc agaccttaac cctatggatt attttgtatg gggcgcagtt 1020 gaaaaagaca ccaattgcta tgccagtaat acaaaagccc agttaatgga taaaattaag 1080 agtgtttttg aggcccttcc cagggagact gtagcatcag cttgttccag gttccgaagc 1140 aggattgaag ctgtgatcga tgctaatggt ggttattttg agtgaaactt gttgatagta 1200 ttgtaatttc tgtgcctgtt attttttttc aattcattaa atttagtagt tacaaagaag 1260 atttcctttt tacttacgaa attgtcaaat ttatcccgaa caccctg 1307 // ID Copia-103_AA-LTR repbase; DNA; INV; 319 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-103_AA_; KW Ty1_copia_Ele83; Copia-103_AA-I; Copia-103_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-319 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 319 BP; 82 A; 67 C; 71 G; 99 T; 0 other; tgaacagcag ataggtcaac cgtgcgaatg aacagactct gtatccagtt tcaaagttaa 60 taaacaacgt gtttaaacgt aacaatactt tcagctttcc taagaaagag aagctactgg 120 tgagtacata agttcgtggt ggaagacttt cctctgaact tagtctaggt tgtgcattca 180 gctttcaggt gaagattttc acgggaagct tttgcatccg ccctggtttt gctgtttgat 240 ttggttcgga caacggtttg ccaggtgaag tatttcacgg gtaacctttg tctccactcc 300 gctctcgcct atttcaaca 319 // ID DIRS-3_DPu repbase; DNA; INV; 5337 BP. XX AC scaffold_192; XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE DIRS retrotransposon from Daphnia. XX KW DIRS; LTR Retrotransposon; Transposable Element; nonautonomous; KW DIRS-3_DPu. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-5337 RA Jurka J.; RT "DIRS retrotransposons from Daphnia."; RL Direct Submission to RU (08-JUN-2010). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR Genome; scaffold_192; Positions 80454 75118. XX FH Key Location/Qualifiers FT CDS 137..3937 FT /product="DIRS-3_DPu_1p" FT /translation="MSRPPSDFFRGRFFDPRVNYFDQPVSERERMRDREMR FT IRSHLQHPGYQFRERSPVGFSRQWDGFGFVSPFEDHEGRFYDHRSEYRGDY FT SYDPTFEYNNPPNDRRQFQGYYCGDKDFYFFSEDGLCYDAQGVTYRFEEDG FT FYYDVVTGVRHDSGGFACDEKGVRIGDNLDTSDEDLETDDPPPQNKTVPPP FT SEKSASPKGKAIRPSTGKSVSFQSEKKTPPRKEQANAGPSAAQNQDELNSD FT QTRRSRPWRPSSMFDPVLGEGGGSEAPVTAPVAINPTPTVAAIPVITIEGV FT SLPGGSGSQPLSVSKDISEEISSLMTKGISTDTSKAISGEFPLEFVDAEFS FT LKPPKLDGWISRRALLKTDKSIVRSINAAEESFTKAQLRIMDIAPPLIDLY FT ARLNSLPDSESLKRPVQAALQQWGRAFFHVTKERRSAAIALAEPGAEYLLR FT DPDAFESGKEARSFLFTERYLQAMLTDANQDNTLAQAARATAAAAAARVPK FT RSNPKRGRVDQPGSQFPPPRYGTDQMGRGGRGGRGRNDRGRGQPANPWLQG FT GRRYVAQNSKHLPPCKTAPQSHLVPHAQTKVPSNPVGVSWPHTGPQAVAAR FT LKLFAARWPAVTSDQWVLEAVREGITIDFVSEPIQKFLPPQITMSVEMSAV FT CDAEVRELLSKRAISEVTDGSTGFVCSFFCIKKKQPGQFRPIVNLKPLNKF FT IRYQHFKMENLESVRFLVRKGDWLAKVDLKDAYFTVAVKKSHHKFLRFRWK FT DRVFEFSCMALGLAPAPRIFTKILKTVMAFLRRQGIRLVIYLDDILILNES FT KIGLEADINTVIDLLQSLGFLINWEKSIVVPTQTIEYLGLIVDSTDPSFSL FT PSSKAEAVKRMCEGALSEGRVSLRTLASIQGNFSWAIPAIPFAQAHYRSLQ FT RFYILNAQRVDFNLEAEVCLSPGARLDLNWWVANIEKANGKMFFPRDPDFE FT IFSDASLTGWGAVCNGITTRGPWTLQDKDKHINELELLGAFFAIQSFLAKE FT SGIAVRIFLDNSTAVSYVNKCGGTRSAALTATAKAISAWCEARHISVEAVH FT LAGELNTIADRESRAEADASDWRLDANMFSLISEIWKMDTDLFAASWNYQL FT PQFISWGPQPGAVAANAFSVSWKDICGYAFPPFSVIFKCLEKLRREKASII FT LICPIWTGQPWFPVLLEHTADIPRLLRPSPTLLTSARGEPHPLLRSGALNL FT AAWKLSGDLTTCGAFRDRLSSFSWRGAVAIQSLPTSPPGSVGEIGVWDGIS FT IPCAVI" FT CDS 3574..4737 FT /product="DIRS-3_DPu_2p" FT /note="tyrosine recombinase." FT /translation="MSRKTPAGESVDYTNLPHLDRPALVSGFIGAHSRHPP FT APASVANATNISAGRTSPAAPIRRSQSSRLETLRRSYDVRGFSRSAIELFL FT AGSRGNTIAAYESAWVSWGNWCVGRNIDPLRSDLASVSAYLAHLHASGKSY FT STINLHRSMLSTTLPSIDGSPIGQHPLIIKLLRGCYNQNPPRPRYDSTWDP FT IRVLQFMSTLGNNEFLPLPTLSGKLVTLLALATLLRVSELASVTFASVILA FT ENAVKFSLSKPRKAQRSGPLQTVTLPAFSDSNTCPVKALQSYVNRTSVNRP FT PSQEGMLFISLIAPFRAVTGNTIGRWIKNFLRTAGINTEIYSAHSTRSAAS FT SLAVARGLSVDQILQAGNWASESTFNRFYNREKTATFAASVMTDA" XX SQ Sequence 5337 BP; 1334 A; 1372 C; 1315 G; 1316 T; 0 other; gaccctaagg gtgtagacct attttatttt agcattttca aagatttgaa ggttatgtag 60 ccttcgtcgg ccgcggcacg cccccttttc ctcagactta tttttgctct gttacaagta 120 acatcgctcg acaaacatgt ctcggccgcc aagtgatttt tttaggggcc gtttctttga 180 tccacgcgta aattatttcg accagcccgt atccgagcgt gaaaggatgc gggatcggga 240 gatgcgtatt cggtcacatc tccaacatcc tggctatcag tttagagaaa gatcaccggt 300 tggtttttct cgtcagtggg acggtttcgg tttcgtttct ccctttgagg accacgaggg 360 ccgcttttac gaccaccgat cagaataccg cggtgactat tcatatgatc caacattcga 420 gtataacaat ccccccaacg ataggcgaca atttcagggc tactattgtg gtgacaaaga 480 cttttatttc ttcagcgaag atggcctttg ttacgacgcg cagggcgtta cttatcgttt 540 cgaggaggat ggattttatt acgacgtagt gaccggtgtt cggcatgatt cgggcggttt 600 tgcgtgtgac gaaaaaggcg tgcgcattgg tgacaattta gacacgtcag acgaggatct 660 cgaaacagac gatccgcccc cgcaaaataa aaccgtacct ccgccgagtg aaaaatcagc 720 tagtccgaag ggcaaggcaa ttcggccatc aaccggcaaa tcagtttcct ttcaaagcga 780 aaagaagacc cctcctcgca aggagcaagc gaacgctggc ccgagcgcgg ctcaaaatca 840 ggatgagttg aattcggacc aaacgagaag aagccgtccg tggcgtccta gttcaatgtt 900 cgatccagtt ctaggagagg ggggggggag tgaggcgccg gtgacagcac cggtggcaat 960 caacccgacg ccgaccgtag cggcaatccc cgtgataacc attgaaggag tgagcctacc 1020 cggtggttca ggctctcagc cgctttcagt ttctaaagat atctcggaag agatttcatc 1080 gttgatgacc aagggtattt cgaccgacac ttcgaaagct atttctgggg agtttccgct 1140 tgagttcgta gacgccgaat tttccctaaa accacctaaa ctagatggtt ggatctcgcg 1200 tcgagcgctt ttgaagaccg ataaaagcat cgttaggtcc atcaacgcag cggaagagtc 1260 gttcactaaa gcgcagttaa gaatcatgga tatcgctcca ccgttaatcg acctttacgc 1320 tcgtttaaat tcactgccgg acagtgaatc gctcaaacgc cctgtgcagg ccgctctaca 1380 gcagtgggga cgagccttct tccacgtaac taaggaacgt cgaagcgccg caatcgctct 1440 tgcggagcca ggagcggaat atttgctgcg cgatccggat gccttcgagt ccggaaaaga 1500 agcgcggtcg tttctcttta cggaaaggta tctacaggct atgttgacgg acgccaacca 1560 agacaatacc ttggcccagg cagccagagc gacagcggca gcagcagctg cgcgcgttcc 1620 aaaacgatcg aatccaaagc gcggccgcgt cgatcaacca ggctctcaat tcccgcctcc 1680 gcgttatgga acagaccaga tgggccgtgg tggaagaggt ggacggggca ggaacgaccg 1740 aggtcgtgga cagccggcca atccttggct gcaaggagga agaaggtatg tcgcccaaaa 1800 ttctaaacat ttgcctcctt gtaaaacggc tccacaatcc cacctggtac ctcatgcgca 1860 aacgaaagtt ccgtcgaacc ctgttggtgt gtcgtggccg catacgggcc ctcaagcggt 1920 agcagccaga cttaaacttt tcgcagcgcg ttggcctgct gtaacttccg atcaatgggt 1980 cttagaagct gtccgcgaag gtattactat cgatttcgtt tctgaaccta tccaaaaatt 2040 tctgccgccg cagattacaa tgtccgtcga aatgtctgcg gtttgcgacg cagaagtgcg 2100 cgagttatta tcgaagcgcg caatatcaga ggtgacggac ggatcgaccg gcttcgtttg 2160 ctctttcttt tgcataaaga aaaagcaacc aggtcagttt aggcccatag tcaatttaaa 2220 accgctcaac aaattcattc gctaccaaca ttttaaaatg gaaaaccttg aatccgtccg 2280 tttcttagtc agaaaggggg attggttggc gaaagtagat ctgaaagatg cgtattttac 2340 ggtggcggtt aaaaaatcgc accacaaatt cttgcgattt cgttggaaag atcgcgtttt 2400 tgagtttagt tgcatggcct tgggcctcgc gcccgcgccc agaattttca ctaaaatttt 2460 gaaaacggtc atggcctttt tacgtcgaca aggcatcagg ctggtcatct atttggatga 2520 tattctgatt ttaaacgagt cgaagattgg gctggaagcg gacatcaata cggttattga 2580 cctcctccaa tcgctcggtt tcctgatcaa ttgggaaaaa tcgatcgtcg tcccgactca 2640 gacgatagaa tatttgggct tgatcgttga ctcaaccgac ccatccttct cgctccctag 2700 cagcaaggcg gaagcagtca agaggatgtg tgaaggtgct ctatccgaag gtagagtatc 2760 cttacgaacg ctggcctcga tacagggaaa cttttcttgg gcgattccgg ccatcccatt 2820 cgcgcaagca cattatcgca gccttcagcg tttttatatt ttaaatgcgc aacgggtcga 2880 ctttaacttg gaagcggaag tttgcttatc acctggcgcc cggctagacc tcaactggtg 2940 ggtggccaac attgagaaag caaacggaaa aatgtttttt ccgcgtgacc cagatttcga 3000 gattttctca gatgcgtcgc tgacagggtg gggagcggta tgcaacggga taacaacgcg 3060 gggcccttgg acgttgcaag ataaagataa acatatcaac gaattagaac tcctgggcgc 3120 gttctttgcg atccagtcct ttttggcgaa agagtcgggc attgccgttc gaatattctt 3180 ggataattcg acagccgtaa gttacgttaa taaatgcggc ggaacgagat cggccgctct 3240 cacagccacc gccaaggcca tttcagcttg gtgtgaggcg aggcatattt cggtggaagc 3300 ggtccatctt gcgggggaac tcaatacaat agcggatcgt gaatctagag ccgaggccga 3360 cgccagcgac tggcgtctag acgcaaatat gttctcttta atatcggaaa tatggaagat 3420 ggacactgat ctgttcgccg cgtcttggaa ctatcagcta ccccagttta tttcatgggg 3480 gccccagccc ggggccgtcg cggccaacgc tttttccgtc agttggaaag atatttgcgg 3540 ctatgcgttt cccccctttt ccgtaatttt taaatgtctc gaaaaactcc ggcgggagaa 3600 agcgtcgatt atactaatct gccccatctg gacaggccag ccctggtttc cggttttatt 3660 ggagcacaca gccgacatcc cccggctcct gcgtccgtcg ccaacgctac taacatcagc 3720 gcggggcgaa cctcacccgc tgctccgatc cggcgctctc aatctagccg cctggaaact 3780 ctcaggcgat cttacgacgt gcggggcttt tcgcgatcgg ctatcgagct tttcttggcg 3840 gggagccgtg gcaatacaat cgctgcctac gagtccgcct gggtcagttg gggaaattgg 3900 tgtgtgggac ggaatatcga tcccctgcgc agtgatttag cctcagtgtc agcttacctt 3960 gctcacctcc acgcatccgg taaatcatac agcactataa atttacaccg ctcaatgttg 4020 tcaacgactc tgccgtccat cgatggttca ccgatcgggc aacacccgct cataattaaa 4080 cttctgagag gatgctacaa tcagaacccg ccaaggcccc gttacgattc cacatgggat 4140 ccgattcgcg tccttcaatt tatgtccacg ttgggcaata acgagttcct gcccctgccc 4200 acgttatccg gaaaattagt cacccttcta gctctcgcca ctctgctaag agtgtcagag 4260 ctagcctccg tcactttcgc atcagttata ctggctgaaa atgcagtcaa attctcctta 4320 tcaaagcctc gtaaggcaca gcgtagcgga ccattgcaaa cggtcacact accagcgttt 4380 tccgactcga acacttgccc ggtaaaagca ctacaatctt acgttaatcg aaccagcgtc 4440 aatagaccac cgagccaaga aggcatgttg ttcatttctc tcatcgcccc ctttcgggcc 4500 gtgacgggca atacaatcgg aaggtggata aaaaattttt taagaacggc cggaattaac 4560 acagaaatat acagcgcgca ctcaacgcgg agcgcagcct cgtctctggc cgtagccaga 4620 ggcctctcgg tcgatcaaat tttgcaagcg ggcaattggg ccagtgaatc aacgttcaac 4680 agattctata atcgagaaaa aacagcaaca ttcgcagcat cggttatgac cgatgcttaa 4740 actttaaaat cacccttagg gtcgagcgga atgtattccg ctctacaatt gaaaattacc 4800 agagtgatcg ctctgcgatc acgataggta attgtaattg tagtaggaag gaatgagaga 4860 gaccctaagg gtctctattc ccaaccctca atcctcccgc tcattcttct attcctacat 4920 cttaaatgtt gcattttgta cgggccatta cccaggattc aaccggagag gcattccaaa 4980 agaggccggg caggaggcag cagaccaacg tgagcaaacg gcctgccaaa aaaaaaaaaa 5040 ttgtccgtct ctttccctct ggactcaaca cattgttccc ctcaagaagc cattatctct 5100 ttatgttgtc tctattcggc tttttctgac aagctgtcag ttttttatgc ctccaaaaat 5160 tgtttccctt atatactctg tatgtcaagt acttcttgtt tcaacgttat tgttttcggc 5220 acgcggccgc ctcagtataa gtctgaggaa aagggggcgt gccgcggccg acgaaggcta 5280 cataaccttc aaatctttga aaatgctaaa ataaaatagg tctacaccct tagggtc 5337 // ID DNAX-5_Tad repbase; DNA; INV; 413 BP. XX AC . XX DT 08-OCT-2009 (Rel. 14.1, Created) DT 08-OCT-2009 (Rel. 14.1, Last updated, Version 3) XX DE Putative non-autonomous DNA transposon: consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; DNAX-5_Tad. XX OS Trichoplax adhaerens OC Eukaryota; Metazoa; Placozoa; Trichoplax. XX RN [1] RP 1-413 RA Jurka J.; RT "DNA transposons from Trichoplax adhaerens."; RL Repbase Reports 9(10), 2147-2147 (2009). XX DR [1] (Consensus) XX SQ Sequence 413 BP; 157 A; 58 C; 53 G; 145 T; 0 other; gcgaaatgcc tagtcagttc gcaaactgca aaaataccaa aacacgtaaa tcaggaaacc 60 gtctttcttt ataaatagag aaactatctt tattcatgca gtaatattac taaaaaaaca 120 gagaatctat atttctttga tgattataat aataataata atagactgat tcaaataatc 180 aatacatatg taacttgcaa ttactgaagt aattgccaaa catttagaga tactattatc 240 aattaagtag ttattattga attgataatg ttctgattat gaaaattatt atttattttt 300 acaagttctc tgattatgac atttactgtt acacttaaca ttagccttaa agcatttgca 360 tcggttaata atacaagatg tatggcattt tgcaaattga ctaggcattt cgc 413 // ID Outcast-19_AAe repbase; DNA; INV; 5709 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A Outcast non-LTR retrotransposon from Aedes aegypti. XX KW Outcast; Non-LTR Retrotransposon; Transposable Element; KW Outcast-19_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5709 RA Kojima K.K. and Jurka J.; RT "Outcast clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1433-1433 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 11 sequences with >96% CC identity. XX FH Key Location/Qualifiers FT CDS 307..1809 FT /product="Outcast-19_AAe_1p" FT /translation="MSQDSDSADARGGDDPPIENSDTFDSANEEMEMNIQL FT KRTGISDSDSNEEVDKNVIKRMKSNTGAYKKTNTEKQIEVIDKNEQEFQQV FT NRRKRKDKTKDDKVYDDGWIKNSIYTTFFIEPKLQDEQNKNDQNKSKRIHV FT MEVAKILHNIEVKNYKSLTVAGKNRFKVTFDKPKQAEALINSKLLADCFKY FT NVYVPNMFKQTIGVVRNIPPSITEEEIQANIVSNKKIQKIERIKRMINKEL FT ISTYSVKIFVEGEKLPEEITLYGIPAKVEVYIFPLRICFNCWRYGHKAKAC FT KAKIRCKLCGLEHSDKDCQSAIKKCVHCNGEHIANDYNCPERHRQDQIRTA FT MAKMKLTFSEASKQYARPNRSQQIRLQSQTEFPPLNPIPNLRPNLNSSNVS FT PRVQSVSKVIKTNPIIHKNVFPQPEIVEEPIHEFKENPYKVTEIEKLKQQL FT KNEIIIELNNSGVFNKIKKIQDLIKVQTSKDVNDWNSDLLLININEELGSI FT LNTPF" FT CDS 1858..5493 FT /product="Outcast-19_AAe_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="MANVTRDLQILQYNINSIRPSETRSLLSNFLTKNKVD FT IAILSEIWLKPDEDIKFNGYNFPKITRSKGYGGVGLLVKNDLAFTEIKLPE FT LNPIEVIAIKILNTIEPIIIFSVYIPPPPINNNQLKDPLKQLLKFIDNLKA FT HTILGGDLNAHHPTWNSSNNICPRGELLINLLDDSDLLVLNDGSPTLIKPP FT NTIPSAIDLTLATPQIAAKIDWEVLDEELSGNHKMIFYKLLDSVRFIPYKQ FT IVVNKKKAISLINNISVDIIENSNDLQKKLNECINQSLYNKTEYKKVPKKW FT WTSEIDNLLKEKNDKLKKYYKNLTINNFLDFRRAKAKLKSVIRREKRKSYK FT ELINELTPNLPSKMLWNTVRMLSGGFRKKDNIILMNNEQLAQKFIDINFPP FT IDEPIKYTPKTRSTISVQISVADFLKILKIKKDSSAPGNDYISYHILKQLN FT PLFIDKIVDILNQVLKTRSMPDDWRTVKIVPIHKPGKQQLDPLAYRPISLI FT QIFLKSINILVKKKIESYIESNNIIPKCSFGFKKRSSAVNCVNFLISKIQE FT TKRNNLIPVVTFLDLSKAFDNVDISILLEKLASTGIASDLVDWVYFYLKER FT KATITLQNGIEISAITNKGLPQGCPLSPLLFNLYTAEIHQLADNDVIFFQF FT ADDFAILIIARNISEATSKMNNILDKVQTTLSKLKLKVNPEKSTTICFTNK FT FQDNLNIKIDNNNIKTEPYQKYLGIWIDHKLSFKKHITETVFKIRRKTNVL FT KMISKKNGGAHPQVMLQINKSLVRSHIDYGISIYGSACKTDLNRLQVAQNI FT GLRLSLRLLKSTPNHVVLAETGEIPVDLRANILALKETTKTLYFRNSPLVE FT TLSSIISSDRDFKHLTFLENNATLNNFLLVQLCAMNNAIPNFDTNQLEIQI FT NLKTNSKNNTNKKVLKSIATQIILDNYSDFYKIYTDGTKTSEGVGCGFYDS FT QMLISVSHKLNPLFSIANAELIGILEAIKYAYLKGEKKICILTDSKSGCQM FT ILNGRQLENYIVNEIYNFLCKSDIIKVIVQWIPSHIGILGNERADTAAKLS FT LNKQSVLPFGLTLGDTILACKKSILEDWNSRYRFISEEKGIQHFKIMNSVS FT TKPWFYKMSFNTVDIVRLSRIRSLHTATKERLHSWSLVPSSLCESCNVVED FT LSHILFKCTKYNRIRNKYQILINKTDIIEIAKSKSYTEYKQITKFLEEIKI FT SV" XX SQ Sequence 5709 BP; 2223 A; 871 C; 889 G; 1726 T; 0 other; cattctctct tgaactgttg tgatcgcaag ttcgaagttt tgcttggtga accatcaagt 60 cgcgacacgt ttttgataga aattcttaca gagaaaaaag tgcttttgat tttagctttt 120 ttagtgatta agtgattggc ttttaactaa attgatttag attaagtgaa gagtaaaaga 180 gttccattag tttttatttc ttttagaggt aagtactttt tgtaagactg ttcatagatt 240 tattcctatt ttaaggttag gttaagtgca tccgattagg aggtgggatt cgatgaatcc 300 accgtaatga gtcaggactc agatagtgca gatgcaagag gaggagatga tcctcctatt 360 gaaaattcag acactttcga ttcggcaaat gaggaaatgg aaatgaacat acagctgaaa 420 cgaacaggaa ttagtgattc tgactctaat gaagaagttg acaaaaatgt aattaaaaga 480 atgaaatcaa acacaggtgc ttacaaaaaa acaaatacag aaaaacagat tgaagttatt 540 gacaaaaatg aacaggaatt ccaacaggtg aacaggagga agaggaagga taaaactaaa 600 gatgacaagg tttacgatga cggttggatt aaaaattcaa tttacacaac ttttttcatt 660 gaaccaaaac tacaagatga acagaacaaa aatgaccaaa ataagagtaa acggattcac 720 gtaatggaag tagcaaagat cttacataat attgaggtta agaattacaa atcgctgaca 780 gttgcaggca aaaacaggtt caaggtaacg ttcgataaac caaaacaagc tgaggcatta 840 attaattcaa aacttttggc tgactgtttc aagtacaatg tttatgtccc caatatgttc 900 aaacaaacca ttggcgttgt caggaatatt ccaccgtcaa ttaccgaaga ggaaattcag 960 gcaaatattg tctctaacaa gaagattcaa aagatcgaaa gaattaaaag aatgataaat 1020 aaagaactga tctcaacata ttcggtaaaa atttttgttg aaggcgaaaa acttccagaa 1080 gaaataacac tgtacggtat tccagcaaag gtagaggttt acatatttcc gctgagaatc 1140 tgtttcaact gttggcgcta tggccacaag gccaaagcat gtaaagcgaa gatcagatgc 1200 aagttatgtg gacttgagca tagcgataag gattgtcaat cggcaatcaa gaaatgtgta 1260 cattgtaatg gtgagcacat cgcaaacgat tacaattgtc cagaaagaca cagacaagat 1320 caaattcgaa cagctatggc taagatgaaa cttacttttt ctgaagcttc taaacaatac 1380 gctagaccga acagatctca acagataaga ttacaatccc aaaccgaatt tccacccctc 1440 aatccaatac ctaacttgag acctaaccta aacagttcaa atgtttctcc aagagttcaa 1500 tctgttagca aagtcattaa aaccaaccca ataattcata aaaatgtttt cccgcaaccc 1560 gaaatagttg aagaaccaat ccatgaattc aaggaaaatc cgtataaagt aacagaaata 1620 gaaaaactaa aacaacaatt gaaaaacgaa ataataattg aactgaataa ctctggtgtt 1680 tttaataaaa tcaagaaaat acaagattta atcaaagtac aaacaagtaa agatgtgaat 1740 gattggaatt ccgatttact tttgattaac attaatgaag aactcggatc aatcctgaat 1800 acaccgtttt aattaaaaat cattctcaat ttcttcatcc agttttaatt ttcaaaaatg 1860 gccaatgtta caagagattt acaaatctta caatataata ttaacagcat tcgcccatcc 1920 gagacaagat cgcttctctc aaacttccta acaaaaaata aagtagatat tgctattctt 1980 tccgaaatat ggcttaagcc tgatgaagat atcaagttta atggatataa ttttcctaaa 2040 ataactagat ctaaaggcta tggtggagta ggtcttttag ttaaaaacga tttagctttt 2100 acagaaatta aattaccgga gttaaatcct atagaagtaa tagccattaa aattctcaac 2160 accattgaac cgattatcat tttttctgtt tatattccac ccccaccaat taataataat 2220 caattgaaag atcctctgaa gcaacttttg aaatttattg acaatttaaa agcccacact 2280 attttgggtg gtgatctgaa tgcacatcac ccgacttgga attcatctaa taacatctgt 2340 ccacgaggtg aattattgat caatctgcta gatgacagtg atttattagt attaaatgac 2400 ggttctccga ctttgattaa acctccaaat accataccat ctgcaataga tttaaccctt 2460 gctacaccac aaattgcagc aaaaattgat tgggaagttt tagacgagga attgtctggt 2520 aaccataaaa tgatttttta taaattatta gattctgtaa gatttattcc ttataaacaa 2580 atagtagtta acaagaaaaa agccattagt ttaattaata atattagtgt tgatataatt 2640 gaaaattcaa atgatttgca gaagaaattg aatgaatgca taaatcaatc gttatataat 2700 aaaactgaat ataaaaaagt cccaaagaaa tggtggactt cagaaattga taatttacta 2760 aaagagaaaa atgataaatt aaaaaaatat tacaaaaatt taaccattaa taatttttta 2820 gattttcgta gagcaaaagc aaaacttaaa tctgtaatta gaagagaaaa aagaaagtca 2880 tataaagaac taattaatga acttactcct aacttgccat caaaaatgct ttggaatacg 2940 gttagaatgt taagtggtgg tttccggaaa aaggataata ttattttaat gaacaatgaa 3000 caactggcac aaaaattcat agacataaac tttccaccaa tagatgaacc aattaaatac 3060 actccgaaaa caaggtcaac gatttcagtt caaatttcag tggcagattt tctcaaaata 3120 ttaaaaatta aaaaagactc gtcagcccct ggtaatgatt atatctcata tcatatttta 3180 aaacagttaa atccattgtt tatagacaaa atagttgata ttttaaacca agtattgaaa 3240 actagaagta tgcctgatga ctggcgaaca gttaaaatag ttcccattca taaaccagga 3300 aaacaacagc tagatccgtt agcttatcgt ccaatttcac taattcaaat ttttcttaaa 3360 tctattaata ttttagtcaa aaagaaaata gaaagttaca tagagagcaa taatattatt 3420 cctaaatgtt cttttggctt taagaagaga tcatctgcag taaattgcgt aaattttcta 3480 atatctaaga tacaagaaac gaagagaaat aatttaattc cagttgtaac atttcttgat 3540 ttgtcaaaag cttttgacaa tgtagatatt agtatattat tagagaaatt agcttctaca 3600 gggatagctt ctgaccttgt agactgggtg tacttttatc ttaaagaaag gaaagccact 3660 atcactttac aaaacggaat agaaatttca gctattacta ataaagggct ccctcaaggg 3720 tgccctttat caccattgtt atttaattta tatactgcag aaatacatca attggcagat 3780 aatgatgtaa tattctttca atttgctgat gattttgcaa tattaataat agccagaaat 3840 atttcagaag ctacgtcaaa aatgaacaac atccttgata aagttcaaac tactctatcc 3900 aaattaaaat taaaagtaaa tccggaaaaa tcaactacaa tatgctttac aaacaaattt 3960 caagataatc ttaacattaa aatagataat aataacatta aaacagaacc atatcagaaa 4020 tatcttggta tatggattga ccataaatta agttttaaaa aacatattac agaaacagtg 4080 tttaaaatca ggagaaaaac aaatgttctt aaaatgatca gtaagaaaaa cggtggtgcg 4140 cacccacaag taatgcttca aattaataaa tcattggtaa gatctcacat agattacggt 4200 atttctatat atggttctgc ttgtaaaaca gatttaaata gactccaagt tgcccaaaat 4260 ataggtctca ggttgtcatt gcgattatta aaatcaactc ctaatcatgt tgtgttagcg 4320 gaaaccggag agatacccgt agacttgaga gctaacatac tagcacttaa agaaacgact 4380 aaaaccctat attttagaaa tagcccatta gttgaaacat tatcatcaat tatttcgtca 4440 gacagagatt tcaaacactt aacttttctg gaaaataatg ctaccttaaa taatttccta 4500 ttagttcaac tttgtgctat gaataatgca attccaaatt ttgatacaaa tcagttagag 4560 atccagatta atttaaaaac aaattcgaaa aacaacacta acaaaaaagt tttaaaatcc 4620 attgccactc agattatttt agataattat tccgattttt ataaaattta cactgacgga 4680 actaaaacta gtgaaggtgt aggatgtggt ttttatgatt ctcaaatgtt aatatcagtt 4740 agccataaat taaatcctct tttttcaata gctaatgcag aactaatagg aatactggaa 4800 gctatcaaat acgcttatct gaagggtgaa aagaaaatat gtattcttac cgattctaaa 4860 agtggatgtc aaatgatttt aaatggtagg cagttagaaa attatattgt aaacgaaatt 4920 tataattttt tatgtaaatc agatatcatc aaagttattg ttcagtggat acctagtcat 4980 ataggtattt tagggaatga acgtgcggat acagcagcaa aacttagttt aaataaacaa 5040 tcagttcttc cttttggttt gacattgggt gacaccatac ttgcttgtaa aaaatcaatt 5100 ttagaggatt ggaatagtag atatagattt atatcagaag agaaaggtat tcaacatttt 5160 aaaattatga attctgttag cactaaaccg tggttttaca aaatgtcatt taacacagta 5220 gatattgtta gactttctag aattagatca ctacatactg ccactaaaga gagactacat 5280 agctggagtt tagttccatc ctcattatgt gaatcttgta atgttgttga agacctatcc 5340 catatcctat tcaaatgcac aaaatacaat agaattcgta ataaatacca gatattgatc 5400 aataaaacag atataattga aatagctaaa tcaaagtctt acacagagta caaacaaatt 5460 accaaatttt tagaagaaat aaaaataagt gtttgaacta attgtttaat tacgtagaga 5520 aaatttattt taatttacct gattcgatgt gttttatttt ttcgggaatc actttaattg 5580 cgacaacaaa gataaaatca ttatgctttt tacgcggttt cgttttgtca cttgactaat 5640 ttaaacttga caacacctgg ctaaatggat caacttggtc tgtgccaaaa gagagaaaaa 5700 gaaaaaaaa 5709 // ID Gypsy-615_AA-LTR repbase; DNA; INV; 201 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-615_AA_; KW Ty3_gypsy_Ele18; Gypsy-615_AA-I; Gypsy-615_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-201 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 201 BP; 63 A; 38 C; 39 G; 61 T; 0 other; tgtaagcttt cagagaaaaa taaattatga tacaccctat cgaatatgga ccaccctata 60 gtttatagct agttatgtag aattactctg gagtgtgtac cgataggcag ttagaagtga 120 actctgatcg agaaaacatc aataaacgat ctcttgtgaa gacggtgcgt ttcattccac 180 ttttccggat ttctgcctac a 201 // ID Gypsy-45_CQ-I repbase; DNA; INV; 4763 BP. XX AC AAWU01015815; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-45_CQ_; KW Gypsy-45_CQ-LTR; Gypsy-45_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4763 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 469-469 (2011). XX DR GenBank; AAWU01015815; Positions 31222 35984. XX CC Positions [2073-2606] - Reverse transcriptase CC Positions [3745-4215] - Integrase core CC 'ATCAC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 583..1950 FT /product="Gypsy-45_CQ-I_1p" FT /translation="MERFEIPPFTFKTLPANEVRDQWVRWKRNFQYAVFAS FT GETNKTKLKYILLARAGPDLQDVFQTIPGADVEADDTNGVDPYKVAVEKLD FT AYFAPKHHESMERNIFWTMKPDQGESIEKFLFRAQEQANKCNFGKTQQECR FT EICVIDKITLFAPPDLKEKILQTDKNSLDDVLKIVSTHESVKYQASQMVAA FT GPSGSRQPCLVPVEVNRMHTHPPKNHAVECSRCGKRGHLGNDPSCPALNIQ FT CNRCQRIGHYEQKCRTVMAKGAVSRATNSTPKPKRSFQRVRPVDDENIVDE FT GGDKSFIFSIGDGDEFLWIKIGGIMTQVLIDSGSNKNIIDDTTWKRMKVQG FT IEIRNATKQVDKQFRGYGKDAQPLSVIGMFDSTVEIQNENQQMQAEARFYV FT VAKGNQPLLGKETAQELNVLRLGLPGQDDRTWMVSLILILLGHFFHQLKYV FT SRLRTIARSPN" FT CDS 2890..4749 FT /product="Gypsy-45_CQ-I_2p" FT /translation="MISDVKNLSFFDNSLRTRVVADASPVALGAVLLQFES FT STDDSPKFICYASKSLTSTEQRYCQTEKESLALVWAVERFAHYLIGRTFEL FT ETDHKPLEAIFSPTSRPCLRIERWILRLQSFRFRVVYRKGSSNIADSLSRL FT TVHSNDNEDIDPDEKFLILTVLESTAIDISEIEDATIHDQELTLVKDSLRS FT GIWKYPEIKVYETFQNELGLVGELVVRGNKMIVPSSLRRRFLELGHEGHPG FT ESGMKRRLRDRVWWPGMDKVIKKWITDCEGCRLTGLPQKPEAMQRRPLPLE FT AWVDVAIDFLGPLPSGEYLFVIVDYFSRYKEIEIMTKITAKDTVDRLRAIF FT KRLGFPRTISLDNAKQFLSSEFRDYCRECGITLNYTTPYFPQQNGEVERQN FT RSLLKRLQISNALKRDWKQDLDEYLTMYYSTPHSITGKTPSELMLGRTIRT FT KLPFLREIETAPPNEEFCDRDAIAKKHACDRENIKRNARPSSIQEGDKVLM FT QNLLPGNKLTTTYSPVEYTVVRKSGTRCTVQSEDSGTTYERNSSHLKKIPT FT SLPAESVDLTSTRSSTSPSNCSISQEPLREDSPPAPLDTSPKPADIPNIPS FT RPKRVCKRPLKFDDYVDVPLDQ" XX SQ Sequence 4763 BP; 1359 A; 1131 C; 1222 G; 1051 T; 0 other; ttggcgacga ggcgggcgct ctagaaccca gggtaagaaa gaatttgttc gttattcata 60 aaaggaaaag acgtaaacat aaaaattaat cttgcttttt aaagcagccg gacgaaggca 120 tccgccgctc aaaggcatcc gccggtcgaa ggcatccgcc gatcaaaggc atccgccgct 180 caaaggcatc cgccgatcaa aggcatccgc cgctcgaagg catccgccga tcaaaggcat 240 ccgccgctcg aaggcatccg ccgatcaaag gcatccgccg atcaaaggca tccgccgctc 300 gaagggatcc gccgctcgaa gggatccgcc gctcgaaggg atccgccgcg caaaggcatc 360 tggccggtcg aaggcattcg ctagacggaa catccaccgc acggagtagg aaagctagaa 420 gcagtaccag ctgcaccgga agaggcagga cgtacgaagg tcgtccacca caaacatgtt 480 ttctcggaac tgattcggca agacgattca ggaaaccagc aaggtaaaag ttttagtttc 540 aaatgctaag tgaaattttc tgagtatttt cattctatta ggatggaaag gttcgaaatt 600 cccccgttca cattcaagac gctgccggcg aacgaagttc gtgaccaatg ggtccgctgg 660 aagcgaaatt tccagtatgc agtttttgcc agtggagaaa cgaacaaaac aaaactgaag 720 tacattcttc tggcacgggc cggtcctgac cttcaggatg ttttccaaac aatccccgga 780 gccgacgtgg aagcggacga tacgaatgga gtggatccgt acaaagttgc cgttgaaaag 840 ctcgacgctt actttgctcc aaagcaccac gaatcgatgg agcggaatat cttctggacg 900 atgaaaccag accaaggaga gtccatcgaa aagttcctgt tcagagctca agagcaagcc 960 aacaaatgca actttggcaa gacacagcaa gaatgccgcg aaatatgcgt gattgacaag 1020 atcacattgt ttgccccgcc ggacttgaag gagaagatac ttcaaacgga caagaacagc 1080 ctggatgatg tcctcaagat tgtctcaacg cacgaatccg tgaagtatca ggccagccag 1140 atggtagcag caggaccgtc cggtagtcgt caaccatgcc tggtcccggt ggaagtgaac 1200 cgaatgcaca cacatccacc aaagaaccac gctgttgagt gttcaaggtg cggcaaacgc 1260 gggcacttgg gaaatgaccc aagttgtcca gcgctcaaca tccagtgtaa ccgttgccag 1320 agaattggac actacgagca aaaatgtcga acagttatgg cgaagggcgc agtgagccga 1380 gctacgaact caaccccaaa accaaagcgc agttttcaac gagtacgacc agtagatgat 1440 gagaacatcg ttgacgaggg gggagacaaa agtttcatat tctcgatcgg agatggtgac 1500 gaatttctgt ggatcaagat tggaggcatc atgacgcaag tgctgattga ttctggaagc 1560 aacaagaaca taattgacga tacgacttgg aagcgtatga aggtacaagg aatcgaaatt 1620 cgaaatgcta ccaagcaggt cgacaagcag ttcaggggat atggcaaaga cgcacaacca 1680 ctcagcgtca ttggaatgtt tgattctaca gtcgaaatcc agaatgagaa tcaacagatg 1740 caggctgaag cacgattcta cgtcgtcgca aagggaaacc agccgctgct cgggaaggaa 1800 accgcccagg agctcaacgt actaaggctt gggctgccgg gtcaagatga ccggacctgg 1860 atggtaagtt tgattttaat actgctaggt catttctttc atcaacttaa atatgtttca 1920 agattgagaa caatagcccg ttccccaaac tgaagggaat taagctccgc atttcggttg 1980 atggctcggt aactcccgtc gctcagccag caagacgacc accccttgct ctgctgagtc 2040 gcattgagga gaagctgaac caactggaat ctacggacat aatcgagaag gtagagcagt 2100 acagcgactg ggtttcaccc ttggtggtgg tggttaaaga taatggcgac ttgagactgt 2160 gtgttgatat gcgccaggcg aaccgtgcga tcaagcggga acactttgtc atgccgacag 2220 ttgacgatat cttgcctcgg atgaacgcag caaatttttt cacccggctg gacgtcaagg 2280 acgcgttcca ccagattgag ctggaggaaa cgtcacgctc gattactacc tttattacgc 2340 accgcggcat gtaccgatac aaacgactaa tgttcgggat cagctgtgcc ccggaacaat 2400 accagaaaat catgggacag ctactggcgg ggtgcgacaa ctgtgtgcat tacatcgatg 2460 acatcattgt gtttggacgc acagaagaag aacacgatcg ttgtgtggaa aaggtgctaa 2520 ccgtactaaa gagccgtaac gttctgctga atctcaagaa atgtttgttc aaagtgactg 2580 agctcgattt ccttggtcat cacatttcgg acaaaggcat acgaccagct gacgataagg 2640 tgcgagcgat tcgagccttc agatctccgc gaaatgtcga agaacttagg agtttcctgg 2700 gactagtcac ttacgtcgga agattcttgc cgggtcttgg gacaatttcg gctccgctgc 2760 gaaagcttac tcaacgagac gttagcttca gctgggaaaa tcagcacgag caggctttct 2820 tgaggtggat tttcatattt ttcatgtatt tcttggcatg cttaaaattt caattacagg 2880 ttgaaaacga tgatttcgga cgtcaagaac ttgagttttt ttgacaactc ccttcgtaca 2940 agagtcgtcg cggatgcatc tccggtagca ctgggcgctg ttttgttgca atttgaatcc 3000 tcaaccgacg acagcccaaa atttatttgc tacgcaagca aaagtttgac atctaccgag 3060 cagcgatact gccagaccga gaaagagtcc ttggcgttgg tttgggcggt cgaacgattt 3120 gcgcattatc tcatcggtcg gacgtttgaa ttagagactg accacaaacc cctcgaagca 3180 attttctcac caacatcacg cccatgcttg aggatcgaac gatggattct tcgtctacaa 3240 tcgttcaggt tccgagttgt ctaccggaaa gggtcatcta acattgctga ctctttgtcg 3300 aggctaaccg tgcattccaa tgacaacgaa gatatcgatc ctgatgagaa gttcctgatt 3360 ctgaccgtgt tggaatcaac ggcgatcgat attagtgaga ttgaggacgc gaccattcac 3420 gatcaagagc tgaccctcgt gaaggatagc ttgcgctcag gaatttggaa gtacccggag 3480 atcaaagtgt acgaaacctt ccaaaacgaa cttggattgg tcggagagct cgttgtcagg 3540 ggcaacaaga tgatcgttcc gtcaagtttg cgcaggagat tccttgaact gggacatgag 3600 ggacatccag gagaatccgg aatgaagcgg aggttgagag acagagtatg gtggccagga 3660 atggacaagg tcatcaagaa atggattact gattgtgagg gctgccgact aacaggactt 3720 ccgcagaaac cagaagcgat gcaacgaaga ccactgcccc tggaagcttg ggtggatgtt 3780 gccatcgatt ttcttggccc gcttccttct ggggagtacc tcttcgtaat cgttgactac 3840 ttcagccgct ataaggaaat cgaaataatg accaaaatca cagcgaagga taccgttgat 3900 cgacttcgtg ccattttcaa gcggctgggg ttcccgagga caatcagtct cgacaatgcg 3960 aagcagttct tgagttcgga gttccgggac tattgtaggg aatgcggtat aaccctgaac 4020 tacacaacac cgtacttccc gcagcaaaat ggagaggtcg aaaggcaaaa cagatcattg 4080 ctgaagcggc tgcagatcag caacgctttg aaacgagact ggaagcagga ccttgacgag 4140 tacttgacga tgtactactc tacgccgcat tccataaccg gaaagacacc gtcggagttg 4200 atgctgggac gcacgattag aaccaaactt ccgttcttga gggaaattga aaccgctccg 4260 ccaaacgaag agttctgtga tcgcgatgct attgcgaaga aacatgcttg tgaccgtgag 4320 aacatcaagc ggaacgcgag accttcgtcg attcaagaag gggataaagt gttgatgcaa 4380 aatttactac ccggaaacaa gctcacgaca acctactctc cggtcgagta tactgttgta 4440 cggaaatctg gtacacgttg cacagtacag agcgaggata gtggaacaac ctatgagcga 4500 aactcttcgc atctgaaaaa gattccaact tcactgccag cggaatctgt agatctaaca 4560 tcaactcgct catcaacatc tccatctaac tgttctattt cgcaggaacc actccgggaa 4620 gattctccac cagcaccatt ggatacctca ccgaagcctg ctgatatccc taatattccg 4680 tcaaggccaa agcgagtttg caaacgtccg ctgaagtttg atgattacgt tgatgttccg 4740 ctcgatcaat aaaggaaggg aga 4763 // ID Gypsy-76_CQ-I repbase; DNA; INV; 2989 BP. XX AC AAWU01003228; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-76_CQ_; KW Gypsy-76_CQ-LTR; Gypsy-76_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-2989 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 531-531 (2011). XX DR Genome; AAWU01003228; Positions 11529 14517. XX CC Positions [1875-2387] - Integrase core CC 'AATG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 306..2975 FT /product="Gypsy-76_CQ-I_1p" FT /translation="MSDDESKVEVKEETSKEESPVIAKIDFPRFDTDDIET FT WFICLEAAFSVNGIKRDKTKFNAVIVALENRAKYVYSVIKSCSDSTSTNKY FT DTLKAAVIAHFRPSESQQLTSLLSGLSLGDLEFVFPYVDDLCIASENIVQH FT KAHLRTVFERLRQNGLTINTGKCQIGLPKVEFLGHLITPEGIKPKPDKVKA FT ILDFPRPSVAKQLKRFLGSINFYRRFIKNAAIHQQVLHSMISGNIKNDNTP FT LVWTEATNGAFEKCKQDLVNCTFLAHPSPEAKLALETDASGTAIGAVLHQI FT TEDGPLPLAFFSKKLNESEQKSSTYDRELLGIYEAIKYFKDTLEARDFCVY FT TDHKPLTTAFQQRPERANPTQLRRLCFISEYTTDVRHVPGDDNKVADMLSR FT IESITSADTIDFDLMAEQQKTDPELETFLKNPPTNTTLNLKSLKSPNSKSP FT IYCDISTGVIRPFVPAQLRRSVIRKLHSGSHPGLRATTALVSERYVWPNLR FT RDCKEVVTSCIPCQKSKVHRNNRSPISQIATPDTRFSHVHMDLIGPLPPSD FT GNAYCLTLIDKFTRWPEIIPIPNMTAHTVARSFVAGWIARFGVPATVTTDL FT GRQFESELFRTLTQLLGITHLRTTPYHPQANGQIERVHRHAKSAIMCHQTP FT NWTEVLPLVLLGMRTSLKEDLQATAADLVYGTSLRLPGEFFTAEESKSPTP FT EFVTDLKATMEKLRPTPTSNHSKGTTCVQKDLDTCSHVFVRVGAIKPPLTQ FT PYTGPHKVIRRKGKVFVVEINGKHCPISIDRLKVAFTQAEDEPSRAHVQPN FT TTGSSETSRTSEPKKTPSSKSKPKSKSSLSEPRLQTSLPEPKLFKPLKSVI FT KNTKTSDAEPPTASSEQSTYRTRSGRQVKFPRQYWKI" XX SQ Sequence 2989 BP; 912 A; 753 C; 601 G; 723 T; 0 other; atcagaaaat atataactct cgtctgagaa ggacgccatc agaaccacgc ggtttagatc 60 cggtttctac agcgtccgaa ataagtaaca ttagttgtta cattggtgac cccgaacgta 120 aaactaataa attccgcgtt ccgcagtgac ccgcgaaagt gattttattt ttatttttcg 180 caaaaatcac tcgtgttgcg cgaatcttgt tcttcaaaaa gaaaccattt tgtacgcctt 240 tgtactaacg ccacgttaag tgaactactg cttcagtgca gtaagaagtt actgccttag 300 taacaatgtc ggatgatgaa tctaaagttg aggttaagga agaaacttcc aaggaagaat 360 cgcccgtaat tgcaaagatc gatttcccac gtttcgatac ggatgacatt gagacgtggt 420 ttatttgcct tgaggccgca ttcagtgtta atggaattaa acgagataaa acgaagttta 480 atgcggtgat cgttgcgctt gaaaatcgtg ctaaatatgt gtattcggtg atcaaatcgt 540 gtagtgattc gacttccaca aacaagtacg acacgctgaa agccgcggtg attgcgcatt 600 tccgaccctc ggaatcgcaa cagttaacta gccttttgtc cggactgtct ctcggtgacc 660 tagaatttgt tttcccttat gtcgatgatc tttgcattgc gtctgaaaat atcgtacagc 720 ataaagcaca ccttcgcaca gtatttgaac ggttgaggca aaatggtctc acaatcaaca 780 ctgggaaatg tcaaattggt ctaccaaaag ttgaatttct tggtcatctc atcaccccgg 840 aaggtatcaa gccgaaacca gacaaggtaa aggcgattct tgattttcct cggcctagcg 900 ttgccaaaca actaaagcga tttctaggga gcataaattt ttatcgacgc ttcataaaaa 960 atgctgctat ccaccagcaa gtccttcatt caatgatatc tggcaacata aaaaacgaca 1020 atacgccgct tgtgtggacc gaagcaacga atggagcttt tgagaaatgc aaacaagact 1080 tggttaactg cacttttctc gctcatccgt cgccagaagc aaagctcgcc ttagaaacag 1140 acgcttcggg cacggcaatt ggcgcagttt tacaccagat cactgaggat ggcccacttc 1200 ctctagcctt cttctctaaa aaattgaatg aaagtgaaca aaaatcgagt acatatgacc 1260 gagaattgct aggtatttac gaagccatta aatacttcaa agacacccta gaagctcgtg 1320 atttttgtgt ttacacggat cataagccgc tcacgacagc ctttcaacaa cgcccagaaa 1380 gagctaaccc tacgcagtta cgcagactct gtttcatcag cgagtacaca acagacgtgc 1440 gacacgtgcc tggcgacgat aataaagtcg cagacatgct aagccgtatc gaatcgatta 1500 cttccgctga tacaatcgat ttcgacctca tggcagaaca acaaaaaacc gatcccgaac 1560 tagaaacctt tttaaagaac cccccaacca acaccacgtt gaacctcaag tcactcaaat 1620 cgccgaattc aaaatcacca atctactgtg atatctccac gggagtgatt cgaccgtttg 1680 tacctgcaca gcttcggcgc agcgtaattc ggaaactcca tagtggatct catccgggat 1740 tacgagcaac caccgcactt gtctcggagc gttacgtgtg gccgaacttg cgtcgagatt 1800 gtaaggaagt cgtcacgagt tgcattccct gtcaaaagtc gaaggtccat cgaaacaatc 1860 gctcaccaat ttcgcagatc gccaccccag acacacgttt ctcacacgtc cacatggacc 1920 tcatcggacc gttaccacct tcagacggta acgcgtactg tctaacgctg atcgacaagt 1980 ttactcggtg gcctgagatc attccaatac caaacatgac agcgcacacc gtcgcgcgct 2040 cttttgtcgc aggttggatc gctagatttg gagtaccagc caccgtgaca acagaccttg 2100 gaaggcaatt tgagtctgag ttgtttagaa cgctcactca gttgcttggt attacccatc 2160 tgaggacaac accttatcat ccgcaagcaa acggccaaat cgagcgagtt catcgtcacg 2220 ctaaatcagc aatcatgtgt catcaaactc cgaactggac agaagttctc ccactcgttc 2280 tcctcggaat gcgaacatct ttgaaggaag acttacaagc aacagcagct gatctcgtct 2340 acggaacatc gctgcgctta cccggtgaat tttttacagc agaagaatct aaatcaccta 2400 cgccagagtt tgtaactgac ttgaaggcaa ctatggaaaa gttgcgtcct actccaacat 2460 cgaaccactc gaagggaaca acgtgtgtgc agaaggactt agatacctgc agtcacgtgt 2520 ttgtgagagt tggagccatt aagcctccgc tcacacagcc gtatactgga cctcacaaag 2580 taatacgcag aaaaggaaaa gttttcgtcg tcgaaataaa tggcaaacac tgtcccatct 2640 ccatcgaccg tctcaaagta gcttttacac aagctgaaga cgaaccaagt cgagcacatg 2700 ttcaacccaa cactaccgga tcatccgaaa cctccagaac atctgaacct aaaaaaactc 2760 catccagtaa atctaaaccc aaatctaaat ccagtttatc tgaacccaga cttcaaacca 2820 gtttacctga acccaaatta ttcaaaccct taaaatctgt aatcaaaaac actaaaacca 2880 gcgatgctga accacctaca gcatcatcag agcagagtac ctacagaacc agatcaggcc 2940 gacaagtgaa atttccccgg cagtattgga aaatctaagg gaggagtac 2989 // ID BEL-9_AA-LTR repbase; DNA; INV; 586 BP. XX AC supercont1.344; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-9_AA_; KW BEL-9_AA-I; BEL-9_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-586 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.344; Positions 974817 974232. XX SQ Sequence 586 BP; 175 A; 122 C; 131 G; 158 T; 0 other; tgatcagtat cagctgatca taattttgat gtgaatttgt aaatatgtaa ataacccact 60 aaccctagaa ttaagcaatg aatttgaacc accctaacat tatatagact ctcacgagct 120 ttactgagag cgaatacgaa cggtcaactt tctgcctgaa cgaagaaacg gcaattaaat 180 cagtagcaaa taaaacgttg aacggttcgc taggaaggct atattcggtc ggtgattatt 240 gttcttgttg cccgtagtct acgaaataaa gtgtagtgtc gtggtggaat aagcaaacca 300 atttcaactg tggttccgaa ataattatct attggtcggt cgtgaagtgc gtgcgtgaat 360 aaattcacga aactattcgc tcaattctgt gtggtgatcg tgaagcctcc aagagatctg 420 tgcagatctg gtcttcttat cactttcgaa caagaaggtg aagaaaacgt gtgacagtgt 480 ttcgtaccgc caaattgtcc ccactggttt ctgtgcgccg ttcgaagcac gaacggggag 540 aaacaccccc gaatcccggc aacatatcga gcctgcggct cgaaca 586 // ID Copia-23_SI-I repbase; DNA; INV; 4199 BP. XX AC AEAQ01023869; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-23_SI_; KW Copia-23_SI-LTR; Copia-23_SI-I. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-4199 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01023869; Positions 4681 483. XX CC Positions [1635-2135] - Integrase core CC 'ATAAT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 186..4112 FT /product="Copia-23_SI-I_1p" FT /translation="MEFFHGVEKLTGKDNWTTWKFSSRNLLRGVEGAYEVC FT IGELVKPAVLAAGANPAEQQAYSAVLQKWDKADRAAAQILVKTLDPKVLAL FT LVACESAREMWIKLHAIFEQQTKLAAHAVQSEFFGFSMAPGDDLVSHIAKF FT EALVLRLQQLNKKPDDSSLMVRLLDTLPESYESFRQSWWARPEDKQTLEEL FT IAVLTADDTRRNVYKVKQEKTEALFAAANLKNKNGATSTRVLERKPADTKK FT KKVPKCFGCGEKGHVRKDCKKDKGKKTSSNSNSSSTSGGDFAFVMEVENAE FT CEQDVWLMDSGATNHYCLHREWFVTYEVFAKPLTVLVGNNSTMLAVGKGRI FT NTEALVNGEWLTCHIDNVLHVPDGRRNLMSLSLILDKGLNMRVNRKRCEFV FT KDGVVRLCGERYGMLYKLHIRATSNTPCEVNLASAESLQVWHERLGHQDKR FT HVTKFLRDRGVTVITDDSFCAACVEGKQQRTTFKSRTQRSTRAGEIMHADV FT CGPMEEQSLGGAKYFVCFTCDYSKFRIVQFLKEKSEVVATVEELLKFVENQ FT CGRPLRVFQSDGGLEFNNEKMKKLMKANGVQFVITNPYSPEQNGCAERTNR FT TIVESARTMLLAKKLPKILWAEAVNTAVHLLNRSGPTGESGKAPYEIFTGK FT SVNLHQLHVFGTSCFVQIPKERRHKWDAKGKPGVLVGYSQDIDGYRVWIQG FT EKTVVRSKNVTFEPEVGGKTVTVVSDDCEPKTKAVEPVESHPGRVRADEMP FT DESEIDAERETQFAGGKQTSNNVPVVEYQGRLRDRSTLCPPERYVEMCVVE FT LEPRSYKEAMESVERESWKEAMREELQSLAENSTWELTELPSNRRAITNRW FT VFKVKRNPNGSINRHKARLVVRGFNQRPGVDFLETFSPVVRFDTLRSVLSV FT AATEHLKLAQFDVKTAFLNGNLSEDIYMTQPEGFADGSNRVCKLKRSLYGL FT KQSPRCWNYKFKEVVMGLGMTESSADPCLFVRSVNGNKLLVVLYVDDGLVA FT ATRQSDIDKFLRELRSRFKITIESFGCFLNLQIAQDADGSITIHQQGYAET FT VLRRFKMDGANPVSTPIDQYIHQNTADVVLSSAPYREAVGCLMYLAVATRP FT DIVFAVNYASQFLEKPEMKHWELVKRILRYVRGSSSTGIRYSTSLASGELH FT VYSDADYASDPKTRKSVSGMISMYCGGAITWASNRQKCVSLSTTEAEYVAA FT CEAAKEAVWLKYLFEDISPLKSVPTLIVDSTSAMKLSKNPEFHKKSKHIDV FT RYHYLRDQVVNDRLCMTYVCSKQQVADICTKPIPRVQFCYLRSLIGMY" XX SQ Sequence 4199 BP; 1130 A; 769 C; 1167 G; 1133 T; 0 other; tggtagcaga gcgtggtttt tggtgtagag gaggggagat cgatattgat ttcttgtgtt 60 cgcgtgatct tcgtcggcca tttttttttt cgttgctgtt cattctcgag agacgcgtgt 120 ttcggcgtga gtggaacttt gtcattttcg ttgactgaaa gtttttgctg gtgatcgcta 180 tagacatgga gttttttcac ggagttgaga agttaacggg aaaggacaat tggaccacgt 240 ggaagttctc gtcgcggaat ctgttaagag gcgttgaagg agcttacgag gtgtgcattg 300 gtgaactggt aaaaccggcg gttcttgctg caggtgcgaa tccggccgaa caacaggcct 360 attcggcggt cctgcaaaag tgggataaag cggatcgtgc tgctgctcaa attttggtga 420 aaacacttga tccgaaagtg ctggcgttac ttgtggcatg tgaaagtgca agagaaatgt 480 ggataaaatt gcacgccatt tttgagcagc aaacgaagct ggctgctcac gcagtccagt 540 cagaattttt cggttttagc atggcgccgg gtgacgatct ggtcagccat attgccaaat 600 tcgaggcgct tgtgttgaga ctgcaacagt tgaacaagaa gccggatgac tcatcgctaa 660 tggtgcggtt gctggacact ttgccggaaa gctacgaaag ttttcgacag tcatggtggg 720 cacgcccaga agacaaacag acgctcgagg aacttattgc agttctcacc gccgacgaca 780 cgcgacgcaa cgtatacaag gtgaaacaag aaaaaacgga ggcgttattt gcagcggcta 840 atttgaaaaa caaaaacggt gctacttcga cgcgtgttct tgagcggaaa cctgctgaca 900 cgaagaaaaa gaaagtgcca aagtgtttcg gttgcggcga aaaagggcat gtgaggaagg 960 actgcaaaaa ggacaagggc aagaagacgt caagcaatag taactctagt agtactagtg 1020 gcggtgattt tgcttttgtt atggaagttg agaacgctga atgcgaacag gatgtttggc 1080 ttatggactc gggagctacg aatcattact gtcttcatcg agaatggttt gttacctatg 1140 aagtttttgc aaaacctctc actgttcttg ttggtaataa ctcgacaatg cttgctgttg 1200 gaaaaggaag aatcaatact gaggctctgg taaatggaga gtggttgacc tgtcacattg 1260 ataacgtttt gcatgttcca gacggccgtc gcaatttgat gtcacttagt ctcattcttg 1320 acaaaggttt gaatatgcga gtgaatcgga agaggtgcga gtttgtcaaa gatggagttg 1380 tacgtttatg tggagagagg tatggaatgc tgtataaact gcatatccgt gcaacaagca 1440 ataccccttg tgaggttaat ctggcgtctg ctgagtctct tcaagtgtgg cacgagcgct 1500 tgggtcacca ggacaaacgt catgtcacaa agttcctgcg agatcgtggt gtgacagtca 1560 tcactgatga ctctttttgt gcggcgtgtg tcgaggggaa gcagcagagg actaccttta 1620 agtcgagaac gcaacgttca acgagagctg gcgaaatcat gcatgctgac gtctgtgggc 1680 caatggagga gcagtctttg ggaggcgcta agtattttgt ctgtttcacc tgtgattact 1740 cgaagtttcg cattgtacag tttttgaaag aaaaatctga ggttgttgct acagtcgagg 1800 agcttttgaa gtttgttgaa aatcaatgtg gacgtccatt gcgcgtattt cagagtgatg 1860 gaggacttga gtttaataat gagaagatga agaagctcat gaaggccaac ggagtccagt 1920 ttgtgattac caacccttac tctccagagc agaatggatg cgcggagcgt accaatcgca 1980 caatcgtgga gtcagcacgc acgatgcttt tagcaaagaa attgcctaaa atattgtggg 2040 ctgaagcagt caacaccgct gtccacttac tcaatcgatc cggtcctact ggagagagtg 2100 gaaaagcacc ttatgagatt tttactggaa aatccgtgaa cttgcatcag ttgcacgtgt 2160 ttggaactag ttgttttgta caaattccaa aggagcgacg acacaaatgg gatgcaaagg 2220 gaaagccagg agtcttggtg ggctattctc aagatatcga cggttatcga gtctggattc 2280 aaggtgagaa aactgtagtc cggagcaaga acgtgacttt tgagcctgaa gttggtggaa 2340 agactgttac tgttgtgtct gatgattgtg agccgaagac taaggcagtt gagccagtag 2400 agtctcatcc tggaagagtt agagctgatg aaatgccaga tgagtctgag atcgacgcag 2460 aacgtgagac tcagtttgcc ggtggaaagc agacaagtaa caatgtacct gtagttgagt 2520 atcagggacg tttgcgagac aggagtactt tatgtccacc agaacgttat gttgagatgt 2580 gtgtggttga gcttgaaccg agaagctaca aggaagcaat ggagtcagtc gagagagagt 2640 cttggaaaga agcgatgcga gaggaacttc aatcactggc tgagaactct acttgggaat 2700 tgacggagct gccttccaat cgtcgagcca tcacgaatcg ttgggtattt aaagtcaaaa 2760 ggaatcctaa tggtagtatt aaccgacaca aggctagact ggttgtgcgc ggtttcaacc 2820 agaggccagg tgtggatttc ctggaaactt ttagccctgt ggtacgattc gatactcttc 2880 ggtcggtgct aagtgttgca gccactgagc atttgaaact cgctcaattt gacgtgaaga 2940 cggcgtttct taacgggaac ctatctgagg acatttacat gacacaaccc gagggattcg 3000 cagatggtag caatcgagtg tgcaagctga aacggagctt gtacggactc aagcagtcac 3060 cccgctgctg gaactataaa ttcaaagagg tcgtgatggg acttggtatg acagagagta 3120 gtgcagaccc ttgtttattc gtcaggagtg tcaatggtaa caagttgtta gtggttctgt 3180 atgttgatga cggattggtg gcagcgacaa gacagagtga cattgataaa ttcttgaggg 3240 agttacggtc aaggttcaaa attacaattg agagttttgg ttgcttcttg aatctgcaaa 3300 ttgctcagga tgctgatggt tcaataacaa tccatcaaca aggttatgcc gagacagtgc 3360 tgcgtcgctt caaaatggat ggagcaaacc ctgtgtccac tcctattgat cagtatattc 3420 atcagaatac tgctgatgta gttctgtcat cggcgccgta tcgtgaggca gttggttgtc 3480 ttatgtacct ggcagtggca acccggccag atattgtttt tgctgtcaac tatgcatcgc 3540 agttcttgga gaaaccggag atgaagcact gggaattggt gaagaggatc ttacggtacg 3600 tgcgaggatc atcatctacg gggattcgct attcgacgag cttggcgtct ggagagctgc 3660 atgtatatag tgacgctgat tatgcaagtg atccgaaaac gcgtaaatct gtcagtggca 3720 tgatcagcat gtattgtggc ggtgccatta cgtgggcgag taatcggcaa aagtgtgtca 3780 gtctctcgac aaccgaggct gaatacgtgg ctgcgtgtga agcggcgaaa gaagcagtgt 3840 ggttaaagta tctatttgag gacatttcgc ctttaaagtc cgtgccaacg ttgatcgtag 3900 atagtaccag tgcgatgaaa ttgtctaaaa accctgaatt tcacaagaaa agtaagcaca 3960 ttgatgtgcg ttaccactat ttgcgagatc aagttgtaaa tgatcgtttg tgtatgacgt 4020 atgtgtgtag taagcagcaa gtagctgata tttgtacgaa accgattcca cgagttcagt 4080 tttgctactt gagatctctc attggaatgt attaaacagt tttttttgcg tttgtattgt 4140 ttaatagtct ttcggttatt aaaagagaga tcaatcatct taaattgtag gggaagtat 4199 // ID Copia-30_CQ-I repbase; DNA; INV; 3808 BP. XX AC AAWU01004727; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-30_CQ_; KW Copia-30_CQ-LTR; Copia-30_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-3808 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 369-369 (2011). XX DR GenBank; AAWU01004727; Positions 27475 23668. XX CC 'AAAAC' target site duplication CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 3..3806 FT /product="Copia-30_CQ-I_1p" FT /translation="MAQYTTGNSRQVKESLFDLNIRHLETDTMADEKQKIL FT LIPTFHGAVKSDYPCWKVRVLHYLKREGLSHCLEKIPAKESYAEVAQYEMR FT LKDDNEVVTIIWHSIDNTAMKHVLGLVFAKQIMDKLDHVYERQGQLTLFDY FT RRQLYRLNLTSFGTLTEQVTDDEKRSTLMAAIPDQFENTMDALAVMRKDDI FT RKMSLDEFKGIFLDAEMKKKLTDDEVVSSLALAARRKKEERTCYRCGMAGY FT TAWKCGQEPEREEFPAGRRHPSPQRYGRVSPQRSGRANASGERIVRFVDAS FT PEREHGREFHQDRSNFALIVQTGPARQIAHGSTRPNALGQPEEQSARGEDR FT GVQERRAGEKPAWQTAHGAPARQIAHGTTQPIALGQPEERQNRGEDQRVQI FT VRREEENPAWQTAYGEAAWQIAHGLTRPNALSQPERQAARGEDYGVQEWRG FT IGKPVWHTAHGEFEKRSTYLQTRLKARGQPARLVARGEDHRVQEWRGVDES FT AWQIAHGLKEPNTLGQPEERQARGEDHEVQIVRRRGNQPERQIAHGREGRR FT IAHVPTHLTAFGQPAMCKNRGEDRRVQNLKREVEDPERHIAQGKSEWRITH FT LHTQPKALGQPLERKTHGEDRKVQNFRREVINSERQIAHGEAARQMAHVST FT HSAVFGQPEEYRTRGADRRKQILRREVVGKSEWLIAHGNSAQRIGHVQARP FT IALDRLSERNAKGEDRSVQQEWRRAEQPARQAAFGADKKQQSGSGAGEQKR FT ARAVKMIMSSSTTDRKVLKECWPEEPRTNTTAKDSEMIIGKKRGVLKSNAI FT LREQVKTLFVLILFLNLLSVSKIFSTRKRVRFDYECKFIVDKKEDDEEYDI FT ECNDQALLSKSGNLAEIWHKRFHRRDTGYNSMVDGCNFEISQIEGKLVVSK FT SCELRKQARHLFIIEEPVGVLPSLICQQAQQVEYNGLRHLAVVYLMRYSNL FT ILEKFKELEALVYTNFLVQNSKLRLGIIEELDCLKENSICSFKVKDRESVL FT MIRAKFLKMCAEASVKLPIDESGERMIGLKLTGRSWKERSNEVVRKMDFDR FT LHGNTSVHVSEDQSLCIALQSYVRGALCRFGIQLNLNCMKIDVPPKKKPYQ FT TVCYFSRFQNCATERHWSGLQQVLRYLRGTTKLFVLNDALDDKVIDNMFSW FT SSKRQKPSISSTEAKSASSCNAAKEGSWMSSQLEYYTFYKDQHCRGARRTS FT TIEEHRHRLRPAGHDTSGETEGRICAEYRTTCRHTYEAVRKTTIREDIRIS FT LIESLRG" XX SQ Sequence 3808 BP; 1136 A; 763 C; 1103 G; 806 T; 0 other; gtatggccca gtacactact ggaaatagca ggcaggtgaa agaatcgttg tttgatttaa 60 atatcagaca tttggagacc gatacgatgg ctgatgagaa gcagaaaatt ctgcttattc 120 cgaccttcca tggagcagtc aagtctgact atccgtgttg gaaggtgagg gttctacact 180 atctcaagcg agaagggttg tcccattgct tagagaaaat accagcgaag gaaagctatg 240 cggaagttgc gcaatacgag atgcggctca aggatgacaa tgaggttgtc accatcattt 300 ggcactcgat tgacaacact gctatgaaac acgtgcttgg attagtattt gccaagcaga 360 tcatggacaa gcttgatcac gtgtacgaac gtcaggggca gctgacactg ttcgattacc 420 gtcggcagct gtaccgtctt aacctgacga gtttcggcac cctgaccgaa caggtgacgg 480 acgacgagaa gcggagtact ctgatggcgg cgattccaga tcagttcgag aacacgatgg 540 acgctctagc cgtgatgagg aaggacgaca tccggaagat gtccctggac gagttcaagg 600 gtatattctt ggacgccgag atgaagaaga agctgactga tgacgaagtt gtatcaagcc 660 tagcgttggc agcaagaagg aagaaggagg agcggacatg ttatcgctgt ggcatggctg 720 gctacacggc ctggaaatgt gggcaagagc cagagcggga ggaatttcca gctggccgac 780 ggcatcccag cccgcagagg tatgggagag taagtccgca gagatccggg agagctaatg 840 cctccggaga gcgcatagtc cgatttgtgg atgcgtcacc agagcgcgaa catggacgtg 900 agtttcacca ggatcgctcg aatttcgcac ttattgtgca aactggacca gcgcggcaga 960 ttgcccatgg atcgacccga ccaaatgcac ttggtcagcc agaagagcaa tctgctcgtg 1020 gcgaggaccg cggtgtgcag gagcggcgag cgggggaaaa accagcgtgg cagactgccc 1080 atggtgcacc agcgcggcag attgcccatg gaacaactca gccgattgca cttggtcagc 1140 cagaagaacg tcagaatcgt ggagaagacc agagagtgca aattgttcgg cgagaggaag 1200 aaaatccagc atggcagaca gcctatggtg aagcagcgtg gcagattgcc catggattaa 1260 cccggccgaa tgcacttagt cagccagaga gacaagcagc tcgtggcgag gactacggtg 1320 tgcaggagtg gcgagggatt ggcaaaccag tgtggcatac agcccatggt gaattcgaga 1380 agcgtagtac ttatttacaa acccggctaa aggcacgtgg ccagccagcg aggcttgttg 1440 ctcgaggcga ggaccataga gtgcaagagt ggcgaggggt agatgaatca gcgtggcaga 1500 ttgcccatgg attaaaagag ccgaatacac taggtcaacc agaagagcgt caggctcgtg 1560 gagaagacca cgaagtgcaa attgtacggc gaagaggaaa ccaaccagag cggcagattg 1620 cccatggcag agaagggcgg cgaattgccc atgtaccaac ccatctgaca gcatttggtc 1680 aaccagcgat gtgtaaaaat cgtggagaag accgcagagt gcagaattta aagagagagg 1740 ttgaagatcc agagcggcac attgcccagg gcaaatcaga gtggcgtata actcatttac 1800 atacccagcc gaaggcactt ggtcaaccat tggagcgtaa gactcatggc gaggaccgta 1860 aagtgcagaa ttttcgcaga gaggtaataa actcggagcg acagatagct catggcgaag 1920 cagcgcggca gatggcccat gtatcaactc actcggcagt gtttggtcaa ccagaggagt 1980 accgaactcg aggagcagac cgaagaaagc agattttacg gagagaggta gtaggaaaat 2040 cagagtggtt gattgcccat ggtaattcag cgcagcgaat cggtcatgta caagcccggc 2100 cgatagcact tgatcgatta tcagagcgga atgctaaagg agaagataga agcgtgcagc 2160 aggaatggcg aagggcagag caaccagcaa ggcaagcagc ctttggtgcc gacaagaaac 2220 agcagtcagg gagtggtgcg ggagagcaaa agcgagccag agcggtcaag atgatcatga 2280 gctcgagtac caccgaccgt aaagtcctga aggaatgctg gccagaggaa ccaagaacaa 2340 acacgaccgc gaaggacagt gagatgataa tcggcaagaa gagaggagtg ttgaagtcaa 2400 atgccatatt gagagaacaa gtaaagacac tttttgttct aattttgttc ttgaacctct 2460 tgtccgtatc aaagatattt tccactagga agcgagtgcg ttttgactac gagtgtaagt 2520 ttatcgtaga caagaaagaa gatgatgagg aatatgatat tgagtgcaat gatcaagctt 2580 tgttgagcaa atcagggaat ttggctgaga tctggcacaa acggttccat cgtcgtgata 2640 cgggctacaa cagcatggta gatggatgta attttgaaat ttcgcagatt gaggggaagc 2700 tggtggtcag taagtcttgt gagttgagaa aacaagccag acacctcttc ataatcgaag 2760 agccagtggg agtgctgccc agtttaatct gtcagcaagc gcagcaggtt gaatacaacg 2820 gcttgaggca cttagcggtt gtttatctga tgaggtattc gaatttgatt ttggagaagt 2880 ttaaggagtt ggaagcattg gtttatacta attttctcgt ccaaaattcg aagttacgat 2940 taggtattat agaagaactt gattgtttga aagagaactc catttgttcg ttcaaagtca 3000 aggaccgtga aagtgtgcta atgattcgag ccaaattttt aaagatgtgc gcagaggcaa 3060 gtgtgaaatt gccgatcgat gagagcgggg agagaatgat cggattgaaa ctaacaggtc 3120 gcagctggaa ggaacgatct aatgaggttg tgaggaagat ggacttcgac agacttcatg 3180 gcaacaccag cgttcatgtt tctgaggatc aatcgctctg catcgctcta caaagctacg 3240 tgcgaggggc actgtgtagg tttggaatac aactgaatct taattgtatg aagatcgatg 3300 ttccccctaa gaaaaagccg taccaaactg tatgctattt cagcagattt caaaattgtg 3360 caacagaaag acattggtct gggttgcagc aagttttgag atacctacgt ggtacaacta 3420 agttgtttgt tctaaacgat gctctagatg acaaggtgat tgacaacatg ttttcttggt 3480 catcgaagcg gcagaaacct agtatttcgt cgacagaagc gaagtcagcg tcttcgtgca 3540 acgctgccaa ggaagggagc tggatgagca gtcaacttga atattacact ttctacaagg 3600 accagcattg cagaggagct aggagaacat caacaatcga agaacatcga catcgacttc 3660 gacctgcggg acatgataca agcggagaaa ctgagggtag aatatgtgct gaatatagaa 3720 caacctgcag acatacttac gaagctgtta ggaagacaac aattcgagaa gatattagaa 3780 tttctttgat tgaaagtttg aggggaag 3808 // ID Gypsy-116_AA-LTR repbase; DNA; INV; 408 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-116_AA_; KW Gypsy-116_AA-I; Gypsy-116_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-408 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1003-1003 (2011). XX DR [2] (Consensus) XX SQ Sequence 408 BP; 103 A; 90 C; 131 G; 80 T; 4 other; tgcgacgctg tttccgactc ggtcggaggg acagaagatc gctctcttgg tgcttgtggc 60 tcgtggaata cggacgtgct tggcgagatt aaaggatacc cgatattcag tgtccgcgta 120 gccgaaagtg aataccgaaa atggatcatt taaaacccta atagcctaat ttggccgcga 180 agaagwttag tggaacggag tggagtgaat agttacccgg gagtgcccgg agcgcagtcg 240 ggaagtgtca gggaccasgt gaaaaggccc tgaggtgtga ccgcgtgtwc ctggcgtgcc 300 scgaaagtag ccagtgtggt gtaccggtgg agcccgaaga aacgcggaaa cctcctaaac 360 cccgagatcc caaacccgaa attagtgtgg gaatagtgcg gtgaggca 408 // ID Gypsy-95_AA-LTR repbase; DNA; INV; 308 BP. XX AC supercont1.1; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-95_AA_; KW Gypsy-95_AA-I; Gypsy-95_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-308 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.1; Positions 3659339 3659032. XX SQ Sequence 308 BP; 91 A; 67 C; 61 G; 89 T; 0 other; tgtatactaa gcaacatctt gactgaatac actttagagt gtattcagtc atcaacaatc 60 attagcgcga gcatacggct gcgcaagcgt aagcgtaagc gtatgcgcag cattgcaggc 120 aacactgctg tcaaatgtat aaaagtaaat aaatcgagct gtctggctct cttctacttg 180 aaacttcaaa caatcaacaa gtgttattaa gtctccagaa attcccttcc cgaaagtgat 240 ttgttctacg ttggttcctc ccggttggtc gtagttaagt gaaattctct agttccggat 300 actagaca 308 // ID I_Ele37 repbase; DNA; INV; 6621 BP. XX AC . XX DT 27-OCT-2010 (Rel. 15.1, Created) DT 27-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE An I clade non-LTR retrotransposon family from Aedes aegypti. XX KW I; Non-LTR Retrotransposon; Transposable Element; I_Ele37. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-6621 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6621 RA Kojima K.K. and Jurka J.; RT "I clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (08-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update and extension. This consensus is generated CC from 18 sequences with >91% identity, and ~99% identical to the CC original sequence in [1]. XX FH Key Location/Qualifiers FT CDS 305..1963 FT /product="I_Ele37_1p" FT /translation="MALANGFPPLLGDPGGPGGGGGGTGPSNKINGEYTGR FT LLPSFMDHAGTSGELRFLKMEAVSGSLPQDPFLLRLSVEKFINGPIDGAYK FT ENKGIAYVLKVRSKAQADSLRRMTKLADGTSIRINEHDTLNQRKCVVSNYD FT TIGLTEEYLQSQLSAQGVKEVRRIKRKNAIGELENTPTVILTICGTVIPPH FT IDFGWNRCKTRNFYPAPMLCFRCWEYGHTGKRCREPHRICGKCSKVHPEDR FT IVTTAPDLTEEPTVSAGSSNPPSAIERNRTPCLEAAFCKICQSDDHSVSSR FT KCPAYVRECEIQHIRVDMGISYPQARREYESRQSASSCSTAYTGVVNASKD FT KEIADLTAKVQKLQSDTKMKERRIEEMERQLQNRGVGDRLETVQQNGTIED FT LIRKVTALTATVEKLEEALVKKDETIKRQKKEIALFRAMETKSECSNVTVP FT ETQSSTETESIIPGTVEQETTEQVAEWVKCNSRKQPEKNCQKSSKNKDNSP FT AVNSMTDLHGTRNLNTPHALNLPKRNREEESSGDSLSIKAPAIKRSIRARK FT MKGNQK" FT CDS 1950..6212 FT /product="I_Ele37_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="RETKSNPILRNAQSLYEQSNLKQPLDSPHKIFTSDEE FT DECSTSKEWTEQYETQPDKNVATFEKERHHDSNTNTNNQLLPGQTNNSTGE FT TMPRRYPLRTRRKPLRLRYPIRETKIPKYFPPLGEEVSNAQRYDVLHKTRY FT GPSASIIPGTSRLTHKKTNAPSLLPAADPGWNSFPDETVCYTQITEHAPLQ FT VSDCCSRDALEGHSSRISSGSSSHSRTVLAIQWNINGFFNNLADLELLVNN FT MKPVVLALQEIHKASVSSMNNKLRKGYNWIAKSGNNIYHSVALGISSELKF FT TEIPLDTDLPIVAARLDWPFSASVVSFYLPNGNIPNLENQLRNVIQSIPTP FT IIMLGDCNGHHQAWGSHDSNARGSLVAELASSCELSILNDGSPTFIRGLAE FT TAIDISLISTGILNRIHWTAGTDPLGSDHVPITLYLDTATPETSRRPRWLY FT EKANWAEFQSSFEAAIENSKPNSINELTDLIIEVATATIPKTSSTPGRRAL FT PWWSDEVKAVVKARRKALRAFKRLPSSHPEKEAASQRYRETRNQCRQTIRE FT SKEKSWSEFLNGINEHQSTTELWSRVNRIHGKRRVKGMALKTNGIMTRDPA FT LIVDALADHFYELSSIKRYPETFLKKHPSPDKAIKNFVVPPGMGQCFNLPF FT TMTELDFALQKASGKSAGPDEIGYPMLKHLPTEGRIVLLNAINKEWIDGTL FT PDGWRHGFVVPLPKTCGPANDVKSYRPIVLTSCTAKLMERMVNRRLVEYLE FT TNRKLDRRQHAFRPGQGTGTYLATLGQVLNDALSRNEHIEIASLDLSKAYN FT RAWTPGILRQLAQWGITGNMLTFIKNFLNERTFQVMVGNSKSKLVAEETGV FT PQGSVLAVTLFLVAINGVFKVLPKGIYIFVYADDILLVVCGKYPKATRRKL FT QSATNAIGKWADLSGFDIASEKCARLHICSSNHTPPRKPITIHGKPIPTKK FT TLKILGVTLDRNFCFKEHFETTKKKCKNRLNLLKVITNRCTKNDRMTRIRV FT ADAIICSRLTYGIEITCRASDRLIQQLAPVYNNAIRTIAGLLPSTPADAAV FT VESGVLPFRYKITIAICNRAVSFLEHTKDDGSVAFIASEANRILDSVAGVT FT LPPVVGLHRNGPRSWQANLPKIDKRLKNHFRKGNNPIEARALFFEQLRSRY FT SNTEIRYSDGSKAVGKLGIGVCGTGLREHRSLADQCSVFSAEAAALFVAAS FT KPSDRPVLIATDSASTLLALESPRTKHPWIQATQVLIDDETKNVTFTWVPS FT HIGIPGNEDADKLANMGRSSRRLTRKVPGSDAKLWVKNTVQDAWEREWRTN FT RLLFIRKVKGTTRIWDDRPDRREQVVLSRLRTGHTRLSHDYTGGLPFRRTC FT ETCAVHNTVEHIICECPTLEQLRTRYQLGSIRNALNNDKAGETSLICFLKD FT AGLFSQI" XX SQ Sequence 6621 BP; 2066 A; 1547 C; 1477 G; 1530 T; 1 other; cagtcgacag ctatcgttcg tagcgtacgg ttgtgctttt caacctgcct gaatagagtg 60 cgagtctatt gtttcaatac agtagtgata tagtagagcg gtgcctgtag gtaacaattg 120 tgactcggtg ctccccaaac tacggggcca attcatttca tcgttgcgaa aagtktgttg 180 ggactagcgg ggcccaaagc gtaactgcct gctgtggaag tgcaagtcat catatcatca 240 ccaacgatca tccgaagtgt gcaaatagga tacaccacat aaaaaggtca tcccgttcgc 300 tagcatggcg ctagcgaacg gtttcccacc ccttctgggg gatccagggg gacctggagg 360 aggaggagga ggtacaggtc catcaaataa aatcaatggt gagtacactg gacgtttact 420 accaagcttt atggaccatg ctggaacgtc gggagagtta cggttcttga agatggaggc 480 ggtttccggc tcgctcccac aggatccgtt tcttttgcgt ctatccgtgg agaaatttat 540 caatggacca atcgacggcg cgtacaagga gaacaaagga atagcatatg tcctcaaggt 600 ccggagcaag gcgcaggctg acagcttgcg tcgcatgacg aaattggccg acggaacatc 660 gattcgaatc aacgaacacg atactttgaa ccagcgaaaa tgcgtggtgt ctaactacga 720 caccattggt cttacggagg agtacttgca aagtcaatta tctgcgcaag gtgttaagga 780 agtccgcagg atcaaacgca aaaacgcaat tggggagctg gaaaacacac caacggtgat 840 tctcacaata tgtggtacgg taatcccccc gcacattgac ttcggttgga atcgatgtaa 900 aacgcgcaat ttttaccctg ctcccatgct ttgtttccgt tgctgggagt acgggcatac 960 gggcaagcga tgtcgagagc ctcatcgcat atgtgggaaa tgcagcaagg tgcatccgga 1020 ggacaggatc gtaaccacag ctccagattt aactgaagaa ccaacagtct cagctggcag 1080 tagtaacccg ccatccgcta ttgaacgcaa tcgaacgcca tgtcttgaag ctgctttctg 1140 caagatctgc cagagcgacg atcattcagt gtcgagtaga aaatgccccg cgtacgttag 1200 agaatgcgaa attcagcaca ttcgtgtgga catgggcata tcatatcccc aggcccgccg 1260 tgaatatgaa tcccgtcaat ctgctagtag ttgcagtacc gcatataccg gagtagttaa 1320 tgccagtaaa gacaaggaaa ttgccgattt aacagcgaag gtccagaaac tgcagtctga 1380 cacaaagatg aaagagcgac gaatagaaga gatggaacgc caactccaaa acagaggtgt 1440 cggagataga ctggagacag tgcagcaaaa cggaacaatt gaagatttga tccggaaagt 1500 gacagctctt acggccaccg tggagaaact agaggaagca ttagtgaaaa aggacgagac 1560 tataaaaagg cagaagaaag aaatagcgct attccgagcc atggaaacta agtctgaatg 1620 ctctaacgtc accgttcctg aaacacagtc gtcaacggag acagagtcga ttattccagg 1680 taccgtcgaa caagaaacaa ccgaacaagt tgccgaatgg gtcaaatgta actctagaaa 1740 acagccagaa aaaaactgcc aaaaatcctc gaaaaataaa gacaacagcc ccgctgtgaa 1800 ttcgatgacg gacttgcatg gaacgcggaa tttgaataca ccacacgcct taaacctccc 1860 aaagagaaat cgcgaagaag aatctagtgg agactcatta tccatcaaag ccccagccat 1920 caagcgaagt attagagctc gcaagatgaa gggaaaccaa aagtaatccg atccttcgaa 1980 acgctcagtc gctctatgaa cagtccaacc ttaaacagcc tcttgatagt cctcataaaa 2040 tcttcacctc cgacgaagaa gacgaatgct caacaagcaa agaatggact gagcaatacg 2100 agacacaacc tgacaaaaac gttgcgactt ttgaaaaaga aagacaccac gacagcaata 2160 ctaatacgaa caatcagcta cttcccggcc aaacgaacaa cagcactggt gaaacaatgc 2220 caagacgata cccactgcgt actcgtcgta agcccctgcg tctcagatac cccatccgtg 2280 aaacaaaaat tcctaaatac tttccccccc ttggtgaaga agtttccaat gcccagcgat 2340 atgatgtact acataagacc cgctatggac catctgcgtc tattatacca ggtacgtcta 2400 gacttactca caaaaaaacc aatgccccat cgcttttgcc agctgcagat cccggatgga 2460 attcttttcc tgacgagact gtatgctata cccagattac agaacatgct cctttgcaag 2520 tatcagactg ctgttcaagg gatgcgttgg aaggacacag tagtagaatc tcatccggct 2580 caagctcgca tagtcgaaca gtcttagcaa tacaatggaa tattaacggg tttttcaata 2640 acttggctga cttagaactg ctagttaata acatgaagcc tgttgtccta gccctacaag 2700 aaatacacaa agcgtctgta tcttcaatga ataataagct aaggaaggga tacaactgga 2760 tagcaaaatc cggtaacaat atataccatt cggtcgcgct gggtatttcg tccgaactca 2820 aatttactga aatcccctta gacaccgatc ttcctattgt tgcagcaaga ctagattggc 2880 ccttttcggc atcggtagta tccttttacc taccgaatgg aaatattcca aatctagaaa 2940 atcagctaag aaatgtaata caatcaatcc ctacaccgat cattatgcta ggtgattgca 3000 atgggcatca tcaagcatgg ggaagtcatg attccaatgc tcgtggttca cttgtagctg 3060 aactagccag ttcatgcgaa ctttcgattc taaatgacgg ttcacccacc tttatcagag 3120 gtctagctga gactgctata gatatctctc ttatttcgac tggaatacta aaccgaattc 3180 attggacggc aggcactgat ccgttgggaa gcgaccatgt acccataacc ttgtatctgg 3240 acacagcaac tccagaaacg tctcgtcgac ctagatggtt atatgaaaag gccaattggg 3300 ctgaatttca atcttctttt gaagctgcga ttgaaaattc caagccaaat tcgataaatg 3360 agcttactga tttgattatt gaagtcgcta cagctacaat tcctaagacg agtagtacac 3420 ctggtcgccg agcccttcct tggtggtcgg acgaagtgaa ggcggtggtg aaagctcgta 3480 gaaaagctct tcgagctttt aagaggctgc cttctagtca ccctgaaaaa gaagcagcaa 3540 gccagcgata ccgtgaaacc agaaatcaat gtcgacaaac aattagggaa tcaaaggaaa 3600 aatcgtggtc cgagttcttg aacggtatta acgaacatca atccaccaca gaactgtgga 3660 gtcgcgtgaa tagaatccat gggaaaagac gggtgaaggg catggctcta aaaacaaatg 3720 gaataatgac cagggatccg gcattgattg tagacgcgct tgctgaccat ttttacgaac 3780 tttcttccat taaaagatat cctgaaacat ttctgaaaaa acatccttca cctgataagg 3840 cgataaagaa ttttgtggtt ccacctggaa tgggtcaatg ttttaacctt cctttcacaa 3900 tgacggaatt ggattttgcc ctccaaaagg catcagggaa gtcggcgggt cccgatgaaa 3960 taggctaccc gatgcttaaa catcttccaa cagaaggaag aatagtgtta ctgaatgcaa 4020 taaataaaga gtggatagac ggtacccttc ccgatggctg gaggcatggt tttgtggtgc 4080 ctttacccaa aacctgtggt ccagcaaatg acgtcaaaag ctaccgacca attgtgctca 4140 cgagttgcac tgcaaaactt atggaacgaa tggttaacag gcgactcgtt gaatatttag 4200 aaaccaatcg taagcttgac cgaagacaac atgcattcag accaggccaa ggtacaggaa 4260 catacctggc tacgcttgga caagtcctca acgatgctct atcacggaac gaacatatag 4320 aaatcgcatc attagattta tcaaaggctt acaatcgagc ctggactcca ggcatactac 4380 gacagctagc tcagtggggt atcacgggaa acatgctgac attcataaaa aatttcctga 4440 atgaacgtac cttccaggta atggtgggca atagtaaatc aaaattagta gcagaggaaa 4500 ccggtgtacc acaaggctct gttttggcgg taaccctgtt tcttgtggca ataaatgggg 4560 ttttcaaagt tctccccaaa ggtatttata tattcgtcta tgcggacgat attcttttgg 4620 tagtttgtgg aaaataccca aaagctacta ggagaaaact tcaatccgca acaaacgcaa 4680 tcggtaaatg ggctgacctc tctggttttg atattgcatc tgaaaaatgc gccagacttc 4740 atatttgctc cagcaaccac acccctcccc gaaaaccaat cacaatccac gggaaaccaa 4800 tccccacgaa gaaaacattg aaaatccttg gagtaacact agatcgtaac ttctgcttca 4860 aagaacattt cgaaacaacc aagaaaaaat gcaaaaatcg actaaatctc ctaaaggtta 4920 ttaccaacag atgtaccaaa aacgacagaa tgacccgaat aagagttgcg gacgcgatta 4980 tatgcagtcg gctcacctac ggcatagaaa taacctgccg ggcttcggac cgtctcatcc 5040 aacaactagc gccagtatat aacaatgcca taagaactat agcggggttg ctgccatcaa 5100 cgccggccga tgcggccgtt gtggaatccg gtgtccttcc cttcagatat aaaataacca 5160 ttgccatctg caatagggca gttagttttc tggaacacac caaagatgac ggatcggtgg 5220 ctttcattgc cagtgaagcc aaccgaatcc tggattctgt ggccggtgtc acactccccc 5280 cggtggtagg actccaccgc aacggaccaa ggagttggca ggccaatctt cctaaaattg 5340 acaaaagatt gaagaatcat ttccgcaagg gtaataaccc aatcgaggca cgagcgctgt 5400 tttttgaaca actgagaagc cgatactcaa atactgaaat cagatattct gatggctcta 5460 aggctgtggg taaattagga attggtgtgt gtggcacagg actgcgcgaa caccgcagtc 5520 tggccgatca atgttccgta ttttcagccg aagctgcagc tttgtttgta gcagcgtcca 5580 aaccgagtga tagaccggtt ctgattgcca ctgactcggc tagtactctg cttgcactgg 5640 aatctcctag aaccaaacac ccatggattc aggcaaccca agtacttatc gatgacgaaa 5700 ctaagaatgt aacttttact tgggttccga gtcacatcgg catacccggt aacgaagatg 5760 cggacaaact agcaaatatg gggagatcga gccgtcgtct cacacgcaaa gttcctggat 5820 cagacgcaaa gctttgggta aaaaacaccg ttcaagacgc atgggaaaga gagtggagaa 5880 cgaaccgcct tcttttcatc cgaaaagtta agggaacaac cagaatctgg gacgacagac 5940 cggacagaag ggaacaggtt gtgctatctc gactccgcac agggcatacg aggttatcac 6000 acgactacac tggaggcctt ccttttcggc gaacctgcga gacatgcgcg gttcataaca 6060 cggtggaaca tataatctgc gaatgtccca ccctggaaca actcagaacg cgctaccaac 6120 taggtagcat ccgaaatgct ctcaacaacg ataaggcagg tgagaccagc ttaatatgtt 6180 ttttaaaaga tgcaggctta ttttcacaaa tttaatccca aaacaaacag caacacacct 6240 caccggatgt gggttccatt tggaacctac atctggcagc cacgaaacga accaacaacc 6300 agcgaaaaca gacctatggc atgaaaatgg acagcgataa ggaaccaaag atacccacga 6360 cacagatgag gatttatttt agtttaacgc cagataaatc aagttatgac tgtctctttc 6420 tatgcgaagg tccttctgat cttcttatct ctatttgtac cattatataa tatgtttatt 6480 tgattcaaac aaactctgta catacacatt attcgcgagg ttgaatcctt caggttcctc 6540 ctcctttttt atatttttgt cgaggtgaac cagccacggg ctgaaagcct cgttaataaa 6600 gacaataata ataataataa t 6621 // ID BEL-230_AA-LTR repbase; DNA; INV; 481 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-230_AA_; KW BEL-230_AA-I; BEL-230_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-481 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 916-916 (2011). XX DR [1] (Consensus) XX SQ Sequence 481 BP; 152 A; 112 C; 90 G; 125 T; 2 other; tgttgaggcc gccagcaagc tgccgggaag tttcgaccgg cagacgatca tgcctcatgc 60 cggcagccca acaaacacac cacaacaatc aacacaacat actcaccatc ttttccacct 120 ggatcgttct cgaacgtacc taaaatagaa gaaaacaatt agaactaatg aattcatact 180 cacctttatg ctaatgatgc cactagaacc taccagaact attgtttttg ttttgcttga 240 attgacaata tgctagaatg ttcgggcgct gctacacttt gcattcgaaw gcacgctaat 300 aatacgtaag gcaacaatag tgtattcagc aaatggaaag ttctacacgc ttattgtcta 360 atcgagtata aatatcaccc agttggcata ggtcaaatca gtttcgaata aaccgttgaa 420 cagtgaaaat cggatcagca agcgttttwa tttatcctcc ggtgtggaag tcgcgttaac 480 a 481 // ID L1_Cis99Sat repbase; DNA; INV; 190 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 06-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE Satellite from Ciona savignyi. XX KW Satellite; Simple Repeat; L1_Cis99Sat. XX OS Ciona savignyi OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. XX RN [1] RP 1-190 RA Smit A.F.; RT "L1_Cis99Sat - Satellite from Ciona savignyi."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX CC Minisatellite emanating from L1_Cis99ext. XX SQ Sequence 190 BP; 40 A; 10 C; 60 G; 80 T; 0 other; tacggtgtat tgagtattgt acggtgtatt gagtattgta cggtgtattg agtattgtac 60 ggtgtattga gtattgtacg gtgtattgag tattgtacgg tgtattgagt attgtacggt 120 gtattgagta ttgtacggtg tattgagtat tgtacggtgt attgagtatt gtacggtgta 180 ttgagtattg 190 // ID Tx1-15_BF repbase; DNA; INV; 5843 BP. XX AC . XX DT 28-APR-2009 (Rel. 14.04, Created) DT 28-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE Amphioxus Tx1-15_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW Tx1; Non-LTR Retrotransposon; Transposable Element; L1-15_BF; KW Tx1-15_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-5843 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-5843 RA Kapitonov V. and Jurka J.; RT "Young families of Tx1 non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(4), 852-852 (2009). XX DR [2] (Consensus) XX CC ORF1 is corrupted by mutations. XX FH Key Location/Qualifiers FT CDS 1826..5593 FT /product="Tx1-15_BF_2p" FT /note="endonuclease and RT." FT /translation="MTQVEVHISSFNCNGIGNSIKRREVFTWLRDKQHHII FT CLQETHSTLSVEKRWQNEWGGSMIFSHGTSNQRGTAILFQNSIKPCIHQTK FT TDKDGRWLIVDLSLDEYRFCLVNIYAPNEDSPEFFFNISEELDDFSESNEH FT LIITGDFNTVQNPLVDRLATNTTYHPKAFESISELKCKFDLQDIWRFRHPD FT TVRYTWRRRRQASRIDYFLVSFSLINRINNCKIADSFRSDHRLISLSFVTA FT DFPRGRGYWKFNTSLLEDKSFHTKTVEIMKEFFSINSGTANPHVVWEAAKC FT FFRGHCIKFSSFKNKQYLSREKTLIDDINALQTELDSTPSSPDSVLDALDQ FT KQKELELLYSQRVQGVMTRSRAKWMELGDRCSKYFLNLVHRNYTRKNIQKL FT QISENSFTCNPTEILERQTEFYSSLYSFKDPPVPLTPENCKDFFPEDYCRV FT LSENQRQSCEGLITEDELLDAINSFSSGKSPGLDGIPVEVYKQFYSVFKAL FT MLECFNFSLTQGFLTNTQRHGAISLLLKQGGNGQDKDPTLLDNWRPLTLLC FT CDTRILSKCLALRVKSVISHIIDKDQSGFIQGRFIGENIRRILDIIDHYEK FT EQKPGLIFISDYKKAFDSIRWDFIIKSLNFFNFGPQFSAWVKVLYNDITSS FT VLNNGYISQPFCLHRGVRQGCPLSPYLFIIAVEMLAIKVRSNEDLTGLSIL FT GKSTKISQFADDTDFPFTPTLASFYALLKDLESFSCISALTLNFEKCRILR FT IGTLKNTNFKLPTHLPFQWVDGNVEVLGVHIPQDLDTIVDLNFEPRLAKLD FT RLLYPWRIKGISLFGKVTIINSLITSQFTHLFQVLQTPDKSFFQQYERKFF FT SFIWNGGPERISRKTIYNSIENGGLNLTHLYAFACTIKASWVPRLYFNQDW FT STTWVIRLHPSLGSSLFPFFQIRSTKHLKLSPFLLDVLDAWFKYQYKPPTC FT AAEVKQQLLFMNDSILIDSVPIFMNTFINRNIIFVNDILNDMGTISTYEEF FT SRKYDAICDYFKYKQLISAIPQKWKSMLCGNVFESVCKPVQRNSCWLKQVK FT INKDMYQFFLSYYNLIDISHNVQLKWLYLFDTPIPWKQVYSSIIFCTIDSS FT TRFFQYKIVHKFLPTNKLLYIWKCIDTPLCSFCHEEEETYLHVFWECPHLT FT PFWDKIKNWYHSKTTINLKLNGFNIIFGNLHFGTPPIENLITLLAKIYIYR FT CRKPSTLNFDSFLRYVNFFNKVEYYVALKKGKLDKHLGKWGSLCS*" XX SQ Sequence 5843 BP; 1777 A; 1090 C; 1093 G; 1883 T; 0 other; tttcttgaaa taatgaatca gaaattagcc gtcaatgtca ttgttgctga agacatgata 60 cttcccacat tgattgttat gtcagatacc aaaggcaata tggcggccgg gtgagtcgtg 120 gttacatgcg ctccagcatt ttgttctttt gtgcacgttt atcgtactgt cacttagtga 180 acatttacat gactttgtta catgaactgt tgtaacttgt acgcatatgg gaaagcggca 240 tagaggctct aggggcagca gctcagagga aaacagtccg acatcgcgac cgtgcccccc 300 tcccaaaaca gcaaaaatgg cgtcccccga ggctgcgctg acatcgttgc cgccagacct 360 tcagaagtta gcgcagttat tttttgacca tactaagaag caacaggagg atactgtggc 420 ccaagtacag actatgggtg acaaactgtc taggaaaatt gatgcgctgg acgctaaaat 480 gcacaccctg tcagcggagc tgaccacagt gcgggaacga gtcggtgcgg tggaagcttc 540 agcagaattc cacgagacgg aactacaaaa cctacggaaa caactagatg aggaacggcg 600 tgctcgagcc aaggcggtca tcctggccga gagatactcc aggaagccgg acgtcatcat 660 tcgaggaata aggttccaca aggacgaaga ctgcaagcag atacttggtg atttcttgac 720 gacggaacta cacctggagc tgaagccaat tgtcgccata catcgtctgt caaaaccaac 780 aacaacacgt ccgaaccctc ccctgcttgt caggtttgtg aactttcatg atcgggacag 840 agttttgttc gagggacgta aactccgcgg taacaacaag ggttttgcag tgtatgaaca 900 tctcccccct cccctccagt cggcgagggc taaactggtt cctgagaggg atgagtcgat 960 caagaacaag ggtaaagctt ctattgttgt acctcccaga agcacctttg cggtgctgtt 1020 tgtagatggt aaggagaaga agagaattga tgctgtggat ttactcttga actctgagat 1080 cacattaact tgaacttttg tggccagaaa cacattactc tacatgctat agtgctcgca 1140 gaaaagaatg tcaagcgaga aactgcaata aatgatcttg ttattttcct cttgagagcc 1200 tttgtattcc tttaactgta agatgggaaa aaggcatgag tgtatgatct acatgtgtta 1260 aattatgtct gcttctgtca tttacttctg ctttgttctg ttttcactcc ttttatagga 1320 gaattgacga ttatgttcga attgttttgc tttgcatctt ggatgcagtt tgatttctct 1380 gtgaatatgg tttttgctac tactgtcatt tcatccgact tcttcctatt tcatttggac 1440 tggttttgat tatgttattg ttaacatgct atgctatcta tgttttacca tgcgtacaca 1500 aactaccatt agataattac ctttggtgtg ctataactaa gttaaactta gaaattggag 1560 tgcacgcaaa ggaaccgcta actcaggatt atcaacaccc tgcctctgtt tgtttattta 1620 tgttttaggg ggaaccatcc acttattctg tacttgagtt ttgttacttt gttctatgtt 1680 gagaaccatt cagtattatt gtattcttgt tctgttaact tttgttcagt tttgtttgtt 1740 tacattagtt tatttgacat gatgactttg ctgcgcccca atcccccaga tgacttccct 1800 gcattcctga cattccccag atgccatgac ccaggttgag gtacatattt ctagttttaa 1860 ttgcaatggt ataggtaatt ctattaaacg gagggaggtt tttacatggt taagggataa 1920 acaacaccac ataatatgtc tacaagaaac acactcaacc ttgtctgtgg aaaagagatg 1980 gcagaacgag tggggagggt ccatgatttt ttctcatggt acctcaaatc aaagaggcac 2040 tgccattttg tttcaaaata gtattaaacc ctgtattcac caaactaaaa ctgacaaaga 2100 tggaaggtgg cttattgtgg acctatcact tgatgaatat agattttgtt tggttaatat 2160 atatgcccca aatgaggatt ctcctgagtt tttcttcaat atatctgaag agttagatga 2220 tttctctgag tccaatgaac atttgattat tactggtgat tttaatactg ttcaaaaccc 2280 cttagttgat agattagcca caaacaccac ctaccaccca aaagcatttg aatctatttc 2340 tgaattgaaa tgtaagtttg acctacaaga catatggcgg tttagacatc cagatacggt 2400 tagatacact tggcgtcgta gacgccaagc aagcaggata gattactttt tagtaagttt 2460 ttctttgata aatagaatta ataattgtaa aatcgcagat agttttagat ccgatcatag 2520 attgatttcc ttatcatttg tcactgcaga ttttccaagg gggagggggt attggaagtt 2580 caacacatct ttattagaag acaagtcatt tcatacaaaa actgtagaaa tcatgaaaga 2640 attctttagt attaactctg gtacagccaa tccccatgtg gtctgggaag ctgccaagtg 2700 tttctttaga ggtcattgta ttaaattttc gagtttcaaa aataagcaat acctttctag 2760 agagaaaaca ttgattgatg atataaatgc tcttcaaact gaactagata gcacaccttc 2820 ttccccagac tcagtccttg atgctcttga tcaaaaacaa aaagaattag aattattata 2880 tagccaacgt gttcagggtg ttatgacaag atccagagca aaatggatgg agttagggga 2940 taggtgttct aaatattttc ttaacctagt gcacagaaat tataccagaa aaaacattca 3000 aaaattacaa atttcagaga attctttcac atgtaacccc acagagatct tagaaagaca 3060 aacagaattt tactcgtctc tttattcctt taaagatccc ccagtacccc tgacccctga 3120 aaattgtaaa gactttttcc ccgaagatta ttgtagagta ttgtcagaaa accaaaggca 3180 aagttgtgag ggcctcataa cagaagatga attgctagat gctattaact ctttttcaag 3240 tggaaaatcg cctgggttgg atgggattcc tgtggaagtg tataaacagt tttattcagt 3300 atttaaagct ctaatgttag aatgttttaa tttttcattg actcaaggtt ttttgacgaa 3360 tacacaaaga cacggtgcta tttctttatt attaaaacag gggggaaatg gtcaagataa 3420 agaccctaca ttattagata actggagacc actcactttg ttgtgctgcg ataccagaat 3480 actttctaaa tgtctggcac ttagggtaaa atctgtcata tcacacatta ttgataaaga 3540 ccaaagtgga ttcattcagg gcagatttat tggtgagaac attagacgta tactcgatat 3600 tattgaccat tatgaaaaag aacaaaaacc aggacttatt tttatctcag actataaaaa 3660 agcatttgat tctatcagat gggacttcat cataaaatca ttaaactttt ttaactttgg 3720 acctcaattt tcagcctggg tgaaagtctt atacaatgac attacaagtt ctgttttaaa 3780 caatggatat atatctcaac ctttctgttt acatcgtgga gttagacaag gctgtcccct 3840 tagcccttac ctgttcataa ttgcagtaga aatgttagct attaaggtcc gcagtaatga 3900 agatttgacc ggcctgtcaa ttcttggaaa gagtactaaa atttctcagt tcgcagatga 3960 tacggacttc cccttcactc ctactctagc atctttctat gccctcctta aggatctaga 4020 aagtttttct tgtatttctg cccttacttt gaattttgaa aaatgtagaa tacttaggat 4080 tggaaccttg aaaaatacta attttaaatt acccacccat ctcccatttc aatgggtaga 4140 tggtaatgta gaagttttgg gtgttcacat accacaggat ttagatacca ttgttgatct 4200 gaactttgaa cccagattag caaaattaga tagactttta tatccatgga gaattaaggg 4260 aatatctttg tttggtaaag taactattat taattcattg atcacttctc aatttacaca 4320 tctctttcaa gtccttcaaa cccctgataa gtcttttttt caacaatatg aaaggaaatt 4380 tttttcattt atctggaatg gaggaccaga gagaataagt aggaaaacaa tttataattc 4440 gattgagaat ggggggttaa atcttacaca tctttacgcc tttgcctgta ccatcaaggc 4500 atcgtgggta ccaagactat attttaatca agactggtca acaacctggg taataagatt 4560 gcacccatct ttgggatcta gtttgtttcc ctttttccaa ataaggtcta caaagcatct 4620 taagctaagt ccatttttat tggatgtatt ggatgcatgg tttaagtatc agtacaagcc 4680 ccccacatgt gctgcagaag taaaacaaca acttcttttc atgaacgata gcatccttat 4740 agatagtgta cccattttca tgaacacatt tataaataga aatattatct ttgtcaatga 4800 tattctcaat gatatgggta ccataagtac atatgaagaa ttctcaagaa aatatgatgc 4860 tatttgtgat tattttaaat ataagcaatt aatttctgcc ataccccaaa aatggaaatc 4920 catgttatgt ggtaatgttt ttgaaagcgt ttgtaaacct gttcagagaa atagctgctg 4980 gttaaaacag gttaagatta ataaggacat gtatcagttt ttcttatcat actataattt 5040 gattgatatt tctcacaatg tgcaactgaa atggctttat ctctttgaca cccctatccc 5100 ctggaaacaa gtttactcct ccataatatt ctgtacaatt gactcatcaa ctagattttt 5160 ccagtataaa attgtgcata agttcttacc tacaaataag ttattgtata tttggaaatg 5220 tattgatacc ccattatgct cattctgcca tgaggaagaa gaaacttacc tgcatgtttt 5280 ctgggaatgt ccacacttaa cccccttttg ggacaaaatt aaaaactggt accacagtaa 5340 aacaaccata aatttgaaat tgaatggatt taatatcatt tttggaaatc tgcactttgg 5400 aactccccca attgaaaacc ttataacgtt actagccaaa atttatatat atagatgtag 5460 aaaacccagt accttaaact ttgactcctt tctccgttac gttaatttct ttaataaagt 5520 agaatattat gtagccttga aaaaaggcaa actggataag catctgggga agtggggctc 5580 cctatgcagc tgatatcgta tccattaacc tacagttaag cattgtcaaa cactaatcta 5640 agttccttat tgttatggtt tgagatctca gttatattgt ttgtttgttt ttaaaatcta 5700 agttcatcat cgctatcgtt tgagatctca gttatattat cgttgaaatc taagctcatt 5760 attgttatgg cttgagatct cagctatatc cttttttgaa actaagttca ttattgttat 5820 gatttgagat ctcagttata tta 5843 // ID Dmedpu1cons repbase; DNA; INV; 492 BP. XX AC . XX DT 19-APR-2011 (Rel. 16.04, Created) DT 19-APR-2011 (Rel. 16.04, Last updated, Version 1) XX DE Consensus of mariner-like element internal region of mauritiana DE subfamily. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Dmedpu1cons. XX OS Drosophila mediopunctata OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; tripunctata group; OC tripunctata subgroup II. XX RN [1] RP 1-492 RA Wallau G.L., Hua-Van A., Capy P. and Loreto E.L.; RT "The evolutionary history of mariner-like elements in Neotropical RT drosophilids."; RL Genetica 139(3), 327-338 (2011). XX DR [1] (Consensus) XX CC Consensus of clones that show less than eight percent divergence. CC Dmedpu1cons. XX SQ Sequence 492 BP; 141 A; 117 C; 126 G; 108 T; 0 other; tgggtgccgc atgaactgaa gccaagagac gttgaacgcc gttttatggc atgcgaacaa 60 ctgcttcaac ggcacaaaag aaagggtttt ttgcatcgaa ttgtgactgg cgatgaaaag 120 tgggtccatt acgacaatcc aaaacgtcgg gcaacgtatg gataccctgg ccatgcttca 180 acatcgacgt cggcgcagaa tattcatggc ctgaaggtta tgctgtgtat ctggtgggac 240 cagctgggtg ttgtgtatta tgagctactg aaaccgaatg aaacgattac gggggatgtc 300 taccgacgac aattgatgcg tttgagccga gcactgcgag aaaaacggcc gcaatacgcc 360 gatagacacg acaaagttat tttgcaacat gacaatgctc ggccacatgt tgcacaagtg 420 gtcaaaacat atttagaaac gctcaaatgg gatgtcctac cccacccgcc gtacacaccc 480 gacctagctc ca 492 // ID hAT-52_HM repbase; DNA; INV; 3914 BP. XX AC . XX DT 18-DEC-2008 (Rel. 13.12, Created) DT 18-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type DNA transposon family - consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-52_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3914 RA Bao W. and Jurka J.; RT "hAT-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 2040-2040 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 1471..3219 FT /product="hAT-52_HM_1p" FT /translation="MFHLFRLSPFLTSILIKYIYYIFLLQLQAEEDHGTPP FT QTHKQATVKEAFQWQQKYDKNSPEARKLNLAVAEFICIDKVPIYTVQKCGF FT QQMLHHFNPKYQLPSRNFFMYTEIPRMYNDTRDLIIQHLSDKSFYSCTTDL FT WTSRTADTFMSLTLQYITKSWELQSWCLGCCGLNTDHTAESLKEAFDEKLE FT DWKLDIARMSGITTDSASNNKKAFEDYTWIPCFGHNLHLAVNKALDINRVS FT AVLSRLRKTISAFTRSPKLSRQLTKKQKDLSSPDHKLIHDEPTRWNSSYDM FT VERFLEQQQVVCAVLAEDRKKWHLMPKDSDITVLETVKAVLEPLSPFTDAL FT SGEKHTTLSSVLPLLWKIFECLSHEQSDSALAKEMKEKIHEYLQHRYDDLQ FT LRFLLNTATYLDPRFKNSFVSLKDDVKQSLLDEVKKMGNEKEGIASRVSGG FT LSSKEVRPSKKSKNHLKFLLSTIQGEKKEKGEPASSSQAPLSASDEINGEF FT LVYDQMPEVSAEDDPLIWWKTNMGTLPQLSEFARKYLCIAASSCSSERVFS FT TAGYIVSPRRSRLSQEHVDMLVFLSENLEMAKKVT*" XX SQ Sequence 3914 BP; 1342 A; 618 C; 680 G; 1274 T; 0 other; tagtgttgtc acggtacacg gtacttcggt accaagtcgg tactaaaaaa atttaaacgt 60 cacggtacca agttttctta agtaccggta gtaccgaaga cccgttcaaa cccgattcct 120 tgatgcgcat gcgtgcagta gcgcgtcctc tttattaaac tgctgagaaa tggcgaccgg 180 tggtgcagct gatgtggtag ctctaaatcc acttgttgag aaaaaaggag gaaagagcca 240 tgtgtggaaa tattttggat ttgcggctga tgacaagggg aatatcattg ataatcaaaa 300 acccatttgc aaacgatgtc gccgaagttt tctctcaaaa ggaggaaaca catcgaactt 360 aataaaacac ctcaaagacc gacacccgga tctgacgaaa gagttcaagc aggtaagttt 420 caaactaatt taaaatatag agcttctgtt atgaattata tatcatattt taatttgaat 480 gtctgaacca gtcggataga aataaacagc caaaaataag tcttacagaa agattaaaat 540 aacaataata agcttaaaaa ttaaattaaa aaaatatata ctttttttac gtttttttac 600 aacctattgt taagactttt cttttaataa gtatttctgt aaaaagtcgt ttgtaaaaag 660 ttgtttttaa aaagtcgctc ataaaaagtc gtttgtaaaa agtcgtttgt aaaaagttat 720 ttgtaaaaag tcttttgtaa aaaagatttt gacgtcttta atagaatgac tttacagaga 780 aaataaaatc ggtgacttca aaagcgaaat gtaacttttt aagtgacttc gaattaaaac 840 gtttaagcat taaaagtttt gaacaacgtt caatgttttg aataaagtcg ttgttgttaa 900 taaaattaat attaaaaata gttctaaacg atttaattta ataaaaaaaa ttataattta 960 gtaaaagata atataaatat atatttttgc tgattctttc cttcttcaaa aaaataaata 1020 aaaaaattcc aaaaaaataa aaaatattta tcgaagaaaa cgttttagcg aataaagttt 1080 tatgtatagt tttttcgatt aaagtattaa aggtttttat cgcattataa aaatggaaat 1140 aaatatagtt atagaaatag ttttaaattt attataaaaa atacatactt ataaaataaa 1200 taaaactaca tatattatat ttaattattg ggaaaatcct tctaatcctt tactttttta 1260 cttccatggt gctactaata gaaattatgt aattacctac cgtgtaaagg attaaacaat 1320 tttatacaaa aagtattttt taacaaaatt taaaatattt aatcttttta caagtactaa 1380 ttatttatta aaattgttta tataggcttt tatataggct attatatagg ctattataaa 1440 agctgaggtt tgttatttag agaaaaacta atgtttcatt tgttcagact cagcccattt 1500 ttgacttcta tacttataaa gtatatttat tatattttcc ttttacagct gcaagctgag 1560 gaagaccatg gaacaccccc gcaaactcat aaacaggcaa cagttaaaga agcttttcag 1620 tggcaacaaa aatatgacaa aaactcacct gaggcaagaa agctgaacct agctgttgct 1680 gaattcatat gtattgataa agtgcccatt tatacagttc aaaaatgtgg attccaacaa 1740 atgctccatc atttcaaccc aaaataccag cttccaagtc gaaacttttt tatgtacact 1800 gagattcccc gcatgtacaa tgacaccaga gacctcataa tccagcatct cagtgacaaa 1860 tcattttaca gctgcacaac agatctttgg acaagtagaa ctgcagacac tttcatgtct 1920 ttaactctgc agtacattac taaatcatgg gaattgcaat cctggtgtct tggctgttgt 1980 ggtctcaaca cagaccacac tgctgaaagt ttgaaggagg cctttgatga aaaacttgaa 2040 gactggaagt tggacattgc aagaatgtca ggcatcacaa cagacagtgc ctcaaataac 2100 aaaaaagcct ttgaggacta cacctggatt ccttgttttg gacacaatct ccaccttgct 2160 gtaaataagg cattagacat aaacagggtg tctgcagtgc tgtcaaggct tcgaaaaact 2220 atttctgcct ttacaagatc cccaaaactt tcacgccagc tgaccaagaa acagaaagat 2280 ctgtcctctc cagaccacaa actcattcat gatgaaccca ctcgctggaa ttcttcatac 2340 gatatggtgg aacgtttctt ggagcagcag caggttgtct gtgctgtcct ggctgaagac 2400 aggaaaaaat ggcatctgat gccaaaagac tcagatatta cagttttaga aactgttaaa 2460 gcagttcttg agccactcag tccttttact gatgcactta gcggggaaaa acataccacc 2520 ctgtcttcag tcttgccgtt actctggaag atatttgagt gtctcagtca tgaacaaagt 2580 gactctgcat tagccaaaga gatgaaggaa aagatacacg agtatcttca gcatcgctat 2640 gatgacctgc agcttcggtt tcttttgaac actgcaacct accttgatcc acgatttaag 2700 aacagctttg tctctctaaa ggatgatgtg aagcaaagtc tcctggatga ggtgaaaaaa 2760 atgggcaatg aaaaagaagg aattgcatct cgggtgtctg gaggattgtc aagtaaagaa 2820 gtcaggccat caaaaaaatc caaaaatcac ctaaagtttc tgctttccac cattcaaggg 2880 gaaaagaaag aaaaaggaga gccagcatcc tcaagtcaag cacctctttc tgcaagtgac 2940 gagataaatg gtgaattctt ggtgtacgat cagatgcctg aggtcagtgc tgaggatgat 3000 ccacttatct ggtggaaaac aaatatgggt actctacctc aactatcaga gtttgccagg 3060 aagtaccttt gcattgctgc atctagctgt tcatctgaaa gagtgttcag tactgcaggg 3120 tacattgtta gcccaagacg ctcaagactg agtcaggaac atgttgatat gttagtgttt 3180 ctatcggaaa atctggaaat ggcaaagaaa gtaacttaag gaataactgt gtagaccagt 3240 attgatttgg gtgaaacttg tttgaaaaaa atgttgttat ttatctagtt atttaattac 3300 tcaaatgttt acaatccttg gcttgaaata cttaaattat gccgttgttg ttttgttttt 3360 ttacatgtat aataatattt taatttacaa cttgttgttc agttatggca tattttgttt 3420 aatttataaa aatgttaaag tttataaata agagtgatta ttcagtcatg gcttattgtt 3480 ttgttattta tctagttatt taattactta aatgtttaca atccttggct tgaaatactt 3540 aaattatgcc attgtttttt tacatttata ataatatttt agtttattac ttgttggtca 3600 tggcatattt tgtttaattt ataaaaattg cgttttttag gtttgtaatt tttcatttat 3660 tatttgtgga aaaagttgaa acacaatttt aaacaagtat gaagaatggc attgtttaac 3720 tttattaata aaaatagtgt gtttttcagc tagcatgttt ttgtgttttg ctttgataca 3780 aagaattgtg aatttttatt ggcaaattta gtctaaatta atctttggtt tggtaccgaa 3840 attggtaccg agaaccgtgg atttttactg gtatcggtac cgaatactga aattttggta 3900 ccgtgacaac acta 3914 // ID Tx1-1_HM repbase; DNA; INV; 5060 BP. XX AC . XX DT 20-JAN-2009 (Rel. 14.02, Created) DT 20-MAY-2010 (Rel. 15.06, Last updated, Version 2) XX DE a non-LTR retrotransposon from the Tx1 clade - consensus. XX KW Tx1; Non-LTR Retrotransposon; Transposable Element; KW reverse transcriptase; endonuclease; L1-6_HM; Tx1-1_HM. XX NM L1-6_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-5060 RA Bao W. and Jurka J.; RT "L1-like retrotransposon from Hydra magnipapillata."; RL Repbase Reports 9(2), 430-430 (2009). XX RN [2] RP 1-5060 RA Kojima K.K. and Fujiwara H.; RT "Cross-genome screening of novel sequence-specific non-LTR RT retrotransposons: various multicopy RNA genes and microsatellites RT are selected as targets."; RL Mol. Biol. Evol 21(2), 207-217 (2004). XX RN [3] RP 1-5060 RA Kojima K.K. and Jurka J.; RT "Reclassification into the Tx1 clade."; RL Direct Submission to Repbase Update (20-MAY-2010). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. The previous version includes CC downstream U2 snRNA sequence [1]. U2 snRNA gene is a target of CC Tx1 clade elements named Keno [2]. The target of this CC retrotransposon appears 2-bp upstream of those of fish Keno CC elements. This retrotransposon is reclassified into the Tx1 CC clade, renamed, and U2 snRNA sequence is deleted [3]. XX FH Key Location/Qualifiers FT CDS 805..4332 FT /product="Tx1-1_HM_1p" FT /translation="MAVKILSLNVNGFNNDQKRNFLMNYFLNFDIIFLQET FT HCCXXTQLXWVNEWXKKSNGHSIWNNGXNREXGVAXLSKXXFXFSQITNDK FT XGRILAANITLGKKIFRIINIYGPNDPQEKEKFFKSLNKHAXXXDXYILXG FT DFNMVEDPXLDREGGDRFNPNNXIGXKNLXLFTEKYQLTDIFRKQXPXKKV FT FSXEHQXYKSRIDRIYXSNXLXNLNPXCEYKKVGXXDHSSVSCEIXXEXEN FT PRGPGYWHLNXSLINDKKFLEMLKCEFEFNQKQNXFENSNQAWEYTKXMVR FT VIAEQRGXEIRKEKREKIXXLEXXLNTENSKXDIDDVKIKRIKKELSELNN FT KNGVFIRTKQVIIEEGEIPSRSLFLLEKHNQXKKTIKTISYEXKTVTDTDX FT IQKVLRKFYXKLYNKEKLDENXQNRFIDKIQNILKDDENTFLNEXFTVNEL FT XQAALSLEKNKSPGIDGIPVEFWQSTWGIYGEEFTRLANENFFENXENQIP FT WTQRTAVISLLPKDGDLSLLQNWRPISLLCADYKIVTKALALRMAKVLXKI FT LEPSQTCSVPGRNIFSNLFLVRDLIHYTNEKKVGAFLMSLDQEKAFDKMDR FT NFLIKVLEKFNLGNXFINAIKQTLKNTQSIITNNGYLSKPIQIKRGVRQGD FT PISLMLYCVAVETLALDFKXSQRIEGIQLPGTTPLKILQFADDTTLXCKXP FT QCVXEIXSVLIAFEKATGSNINMXKTKALALGGLXNXQSHLQFXNXXXKWS FT NQDGLKVLGITFFTNLQXTADXNXDKAVEKFXKILKNFKYRNVSLKCRGMF FT VNTLALSKVWXVANIFPINKLHEKXIKKEISXYLWQSNFAEPIKRNILNLK FT VDKGGLGILDVLRQCIXLRLKHLMLFRXDPNXHESFYLKXYLLAYSLSKYA FT KKGYPQWNFXTEINFPKSLKELPFXYSDVVXLLKTKQNLFKEKTXTTKKVY FT LXLGDQEDYXNLTKTQXFWENTFQTKXPWDKIWWRXFRSYAQGPSQNTXWK FT ILQNILPTXEKCKLWXKNRGRXXTNCLXCNQNENTLHPFXYCKVARAIWNS FT LXFVYXKILPETNFNXIHVIFYXNIENLPKLDPKAXLVTTLTNIITXEMWR FT ARNMRVMENKILSPSKITKNIIKEIKYIWKIKYEKYKRENNVEKFHALFSI FT NDAISKKHEKELVFTI*" XX SQ Sequence 5060 BP; 1805 A; 726 C; 758 G; 1441 T; 330 other; aagaagcggg taagtctgtg gaggaagrtc yggaggatct gaggaatrat gtcgagcttg 60 tggcttcttc cacagwgakg ggtrtagaaa ctgcatcagc tattaytgat gatgtagcta 120 ccccctccct cgrcacctcg gtcctccccg gaacaatagg ggwtttccct accggagccg 180 attcwtctca cattaacaas acaatcatgt ctcttgcgaa aaygaagcat ygatgtcagc 240 atcagaactc gacactacgg aggtcaaawa tccgaaaagr aarattkcyc waccgamwga 300 tctcgttata gtagtaacag ataaaaaraa gaaatcgatt caataccgaa actaatcccg 360 artwtmgaga aaaaywtcac aaacatcaac tctracatct caaaaaaact tcagctgcag 420 atagagsamg ratkywcgga caactwatgg ctctacgmtw tggtamamtm aatgagacca 480 aygttatcga rttraaagaa caaaacycaa rttcgacgta attaaaacay tggaaaaraa 540 rygstgawgc agataaaagt wttttarttr ramttttgaa twmgcaatac cgaaagtayt 600 gcaatgartt caagatttcs atgkcgacgk aaragtaakw tcattatttt tctttttata 660 yawtttwatw tatacgattt tttttkattt atttaagggc gcawtatttt wctgagaaaa 720 ctccaatatt tcattttaaa aaattgtgwt tttttattgt tttaactttt aaayttcgww 780 ttttagatat atytttctca gaaaatggca gtaaaaattt tatcgttaaa tgttaatggt 840 tttaataayg atcaaaaaag raacttttta atgaattatt ttttaaattt tgatataatw 900 ttcytacaag aaacycaytg ctgtracwyt actcaactgm artgggtaaa tgaatggtya 960 aaaaartcaa atggccattc aatwtggaat aacggaarta ayagagaaar aggagtagct 1020 wttttaagya aaamagawtt tawattttca caaatwacaa acgataaaar aggtagaatt 1080 cttgctgcca atatwactct aggcaaaaaa atttttcgaa taatcaatat atacggacca 1140 aatgatccgc argaaaaaga aaaatttttt aaatctttaa ayaaacatgc gcywyttwgt 1200 gattwttata tattaaawgg wgaytttaat atggttgagg acccgtytct ygacagagaa 1260 ggyggggacc gttttaatcc aaataatrtt ataggtarga aaaatttaaa kttgtttacc 1320 gaaaaatatc agytractga tatttttcgg aaacaamayc cwamcaaaaa agtattttca 1380 twcgaacacc aaaaktataa aagtagaatt gayagratwt ayrtttctaa trawttgtyt 1440 aatttaaatc caawatgcga atacaaaaag gtgggttkka stgaccatag ctcggtttcc 1500 tgygaaatcc yyyttgarrr ggagaatccr cgggggccgg gatattggca yttgaacwca 1560 tctttratwa atgataaaaa atttttagaa atgctwaaat gygaatttga atttaatcaa 1620 aaacaaaact yttttgaaaa ctctaaycaa gcttgggaat ataccaagts yatggttaga 1680 gttatagcag aacaaagagg aygcgaaatt agaaaagaaa aaagagaaaa aattawtwtt 1740 ttagagmarg ytttaaatac agaaaattct aaaakygata ttgatgatgt aaaaattaaa 1800 cgaattaaaa aagaattatc tgaattaaat aacaaaaatg gtgtgtttat ccgaacgaaa 1860 caagtaataa ttgaagaggg agaaataccc tccagatcct tgtttctttt ggaaaaacac 1920 aaccaaarca aaaaaactat taaaacgata tcctatgaaa rcaaaactgt tactgataca 1980 gayrgcattc aaaaagtttt ragaaaattt tatraraaac tctayaataa agaaaaactt 2040 gatgaaaatw tgcaaaaccg attcatwgat aaaattcaaa atattctgaa agatgatgaa 2100 aatacttttc ttaatgaaaa wtttactgta aatgaactct wtcaagccgc cctttccctt 2160 gaaaaaaata aatctcccgg aatcgacggw attccygtcg aattctggca gagtacatgg 2220 ggtatttatg gggaggagtt yacccgatta gcaaatgaaa acttttttga aaacayygaa 2280 aaccaaatcc cgtggaccca gagracagca gtgattagtc tyctwccgaa ggatggagac 2340 ttgtcgctgc tgcagaactg gaggccaatc tcattgctgt gcgctgacta caagattgtc 2400 acaaaagcgc trgctttgag aatggccaar gttctcsaaa aaatccttga accctcccaa 2460 acttgctcag tacccggaag aaacatcttt tcgaacttgt tcctggtccg agacctcata 2520 cactacacca acgaaaaaaa agtcggygcy ttcttgatgt ctttggacca ggaaaaagct 2580 ttcgataaga tggatcgaaa yttcctaata aaagttttag aaaaatttaa tctyggaaay 2640 aaktttatta aygccatmaa acaaacttta aaaaatacyc aatcaatwat waccaacaac 2700 gggtayctga gcaaaccmat tcaaataaaa agaggagttc gacagggaga cccratctct 2760 ttgatgcttt actgtgtggc agttgaaacm cttgcccttg actttaaacr cagycaaaga 2820 atygaaggga tycaacttcc yggcacaacy cccytaaaaa tattacaatt tgcwgaygac 2880 accacactag yttgcaaama tccgcaatgc gtagrygaaa ttttwtctgt tttaattgca 2940 tttgaaaaag cracaggttc raacataaat atgrcaaaaa caaaagcgct tgcaytgggg 3000 ggtcttsara acwcccaaag tcatytacaa tttgraaatw twracrtaaa atggtcaaac 3060 caagacggtt taaaagtact tggaataacc ttttttacaa atttacaaaw gacagctgat 3120 wtyaattwtg ataaagctgt tgaaaaattt wctaaaattt traaaaattt taaatataga 3180 aatgtttcat taaaatgtag ggggatgttt gtaaacacgc tagctctgtc aaaggtctgg 3240 twtgttgcaa acatcttccc yataaataaa ttgcatgaaa aaaakatyaa aaaagaaatt 3300 tcartttatc tctggcaaag yaaytttgct garccaatwa aaagaaatat wytaaattta 3360 aaagttgata aaggrggatt aggaattcty gaygtcttra ggcaatgyat tkcyttaaga 3420 ctaaagcatt tgatgctctt ccgasawgay cctaattsyc atgaaagttt ttatttaaaa 3480 aratatttac ttgcatattc gttaagcaaa tatgcaaaaa aaggttatcc ccartggaay 3540 tttytwacyg aaataaactt tccaaartct ttaaaagaac tccccttytw ttactctgat 3600 gtggtagamc ttttaaaaac aaaacaaaay cttttcaaag aaaaaactwt wacaactaaa 3660 aaagtttatt tattmttrgg tgaycaagaa gattatgwaa atttaactaa aacmcaawta 3720 ttttgggaaa acacytttca aacmaaamtg ccttgggata agatctggtg gcgawgctty 3780 mgatcttatg cccaaggacc ytctcaaaac acamamtgga aaatcytaca aaacattyta 3840 cccactmrwg aaaaatgtaa attatggrta aaraaycgcg grcgagscam tacmaattgc 3900 ctcgyytgca accaaaacga aaacackctg caycctttyr tatattgcaa rgtwgcgcgw 3960 gcaatttgga acagtttgag ktttgtatat rrgaarattc tccctgaaac aaactttaay 4020 ayyatacatg taatttttta twtaaayatw gaaaatctrc caaaattaga yccaaaagca 4080 awattagtta ccacwctaac aaatatwata actaswgaaa tgtggcgcgc taggaatatg 4140 agagtaatgg aaaataaaat tctctctccc tctaaaataa caaaaaatat tataaaagaa 4200 attaagtata tatggaaaat aaagtatgaa aaatacaaac gtgaaaacaa cgttgaaaaa 4260 tttcatgctt tattttccat aaatgatgcc atttctaaga aacacgagaa agaactwgtg 4320 tttactatct agaaacaaca accatattaa agcaaccttt ttctttgttt rtttttcaat 4380 aaatatatac ggaaaggcct gtccccagca accctggtgg gtgtgacgga catgcttatt 4440 ttttataaat gcgaggctta atggccagca atatttatta taaaccttag aattacgaaa 4500 aagcatgacc ccagtaacct tgtggtgcga cggtcatgcg tttttattat aaatgcccgg 4560 cttaatggcc agccaaattt attataaaca mgactgtgtt ttattttata aaaatttttt 4620 tttgaaagca gctgaaatrg ttttttatat accatgtgtt tttttgtaaa tattttgttt 4680 taattaacat agattttgta gtaaaaaata taaacttagt ttatattaac ttaattttat 4740 tttgcggttt gaaatttcga gtttttcgtt ttcgttttcg ttttcttttt tcgtttttcg 4800 ttttagtaaa aaccgcaaaa aaaacaaaac aagaaarttt tgagttattt gacttgacat 4860 ttcatatatt cttgacaatt tatatgaatt ttcaaagaaa tttttgtgtt ttatatcaat 4920 ttctggtaat tttttgtaaa taactttggt catatatagt ttcacgattt attgttttat 4980 aaacattatt tctttgtaaa tttaacggtt tttttgtaaa tactttttgc aagaaataaa 5040 ttgtaaagaa aaaaaaaaaa 5060 // ID Transib-15_HM repbase; DNA; INV; 3654 BP. XX AC . XX DT 14-DEC-2008 (Rel. 13.12, Created) DT 14-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE Transib DNA transposon -a consensus sequence. XX KW Transib; DNA transposon; Transposable Element; Transib-15_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3654 RA Jurka J.; RT "Transib transposons from the hydra genome."; RL Repbase Reports 8(12), 2104-2104 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 1401..3197 FT /product="Transib-15_HM_1p" FT /translation="MGRPPLNFDEASERTKRRKTEEMRSTFSSSELAFASQ FT MSLRASGACNAGVTVKDLPSTIQHRVANYIESLKLSQISQTSELCAETALS FT ILMEAKLSKHQYCIIRSAAIANSSSFLPSYEKLKEAKKMCYPEDITVTEIS FT AEVKLQSLLNHTCLRIIKTQQDVIDSLNVNILSNFILILKWGFDGSSGHSE FT YKQRFADGRNSDANVFLTSLVPLQLLCTNSVLGQDVILWKNRTPSSTRFCR FT PIRLQFLHENVHTSINEKDYIEDKIKSLVPLIIVQHKIEIKIHYKLVMTMI FT DGKVCNAITLTKSTMRCYLCSATSSQFNNINFVKEIKVDESKLEYGLSTLH FT AWIRFFECFLHLGYKLGIKKWQARSDIEKSIILQQKLLIQNAFKLQLGLIV FT DRPKPGCGNTNDGNTARRFFKNSQVSADILGVDLELINRFHVILQVLSSGF FT PIDADKLDNYSTETALLFVEKYPWYNMPTTVHKVLIHGALITKTAILPIGQ FT LSEDAQESRNKDLKKYREDFSRKNSRKNTMHDVFCRLLVSSDPAISSLLKS FT HAKVSRSLSSQALQMLLPPEINENSETHSSVIEFINNISIINDSDDSGDLF FT *" XX SQ Sequence 3654 BP; 1335 A; 542 C; 556 G; 1221 T; 0 other; gacgcacagt gggctaattt tcaaaaaagc tggccaaaag tcaaaaaaaa aaaatcacta 60 ttttgaattt attattttga aactacataa acaaagccct gttgcatttt attgtataag 120 aaaaaattgt ataaatttta aagttaccta tatctacgca gttaaacatc tgttattaaa 180 aacacttaat tggcgtttta aaaacaaaaa aaaaatgtgg ttaaataagt atcgtttaca 240 aaaaagttat ttatcgttac taaaaaacta actatggacg atactattca aggtatttat 300 ataatatttt agtaaaaagt tttatattat ttctctaaat tttatgcgaa aagtgcgaac 360 aaaaaagtac agaataaagt ttatttattt tctttaacct atttttcaag cattgaaaaa 420 gtagtgaacc cgcgagattc cgcttagtta taatttggat gttgttcata aaagtgtatt 480 ctaaagattg gttaaaagaa aaggcaccta tttttgccta ttacttcatt aacttatatt 540 gtgtagtgct agtcgtgttt tttagtaaac aaaatggcgg tgttttgttt tctttacaaa 600 atcagtcaaa ttgtatatag atatagttat tatgtttctt gagaaactga ttttgttaag 660 tgcaataaaa ctacaataaa attagtagca cacatgaatg agattttgac aaagttagag 720 tgtagggggg gggggggaca aaccccctcg gttacgacgt cttttttacc tcagctacga 780 cgatttttgt atagttttat tgaattttac aaaatcagtt tctcaaaaaa cataataact 840 atatctatat acaatcatat gattaacagt atattttacc tttctcttat atatcgaatt 900 tatataatat aatatattat cgattacttt ttattttatt tttaattagt ttagtaaagt 960 tataatgttt ttatgatttt ttcaattatt atattttcaa ttaatatcta aggttcacaa 1020 aaagatcatt tgcaaattcc tcgaattgaa atttttgaaa ttttaaaaaa cgagttatct 1080 tcagatctag atgaccagct tgatattttg caatcacagt tgtgtctaag aagaaaaatt 1140 tgtttccaat tttcaacaga tgataaaaag aaactaaaaa atgtattgac aaaattaaaa 1200 ataaaatgga aaagtagtaa ccgaacaaaa aaaatatttt tgaagacaca tgacacttgg 1260 ttaaaagcct caacaaatat ttgtgtaagt tatttaatgc ttatctctat taaagtttta 1320 aactttttta tatttactaa acatttaaat aatttcagat tgttcaagca cgaaacaaag 1380 agaaaaaaag ccatagtgca atgggtcgac cacctttgaa ttttgatgaa gcaagtgagc 1440 gaacgaaacg gcgaaaaaca gaggaaatgc gttcgacatt tagttcatcc gagttagcat 1500 ttgcttctca gatgagtctt cgggcttcag gagcatgcaa tgccggagtt acagttaaag 1560 atttaccttc caccattcaa caccgagtcg caaattatat agagagcctt aaattgtccc 1620 aaatttcaca aacatctgaa ctatgtgctg aaacagctct ttccatacta atggaagcaa 1680 agctttctaa acaccaatat tgcattataa gaagtgctgc aattgcgaat agctcttctt 1740 ttttaccatc ttacgaaaaa cttaaagaag ccaagaaaat gtgttaccct gaagatataa 1800 cggtaactga aataagcgcc gaagtaaaac tgcaatcttt attaaatcac acttgtttac 1860 gaattataaa aactcaacaa gatgtaattg actccttgaa tgtaaatatt ttatcaaatt 1920 ttattttaat tcttaagtgg gggtttgatg gcagttccgg tcatagcgaa tataaacaga 1980 gatttgccga tggaagaaac tcagatgcaa atgtttttct gacttcactt gtaccattac 2040 aacttttgtg tacaaattca gttttaggtc aagatgttat tctatggaaa aataggacac 2100 cctcatctac tcgattttgc cgacctatca gactgcaatt tcttcacgaa aatgtccata 2160 caagcattaa tgaaaaagac tatattgaag ataaaattaa atcactggtt cctctcatca 2220 ttgttcaaca taaaatagaa attaaaatac attataaatt ggttatgaca atgattgacg 2280 gaaaagtttg caatgcaata acattgacga aatcaacaat gcggtgttat ttgtgttctg 2340 ctacttcgag tcaattcaat aatatcaatt tcgttaaaga aataaaagtt gacgaatcaa 2400 aattggaata cggtttatca acgttgcacg catggattcg cttttttgaa tgttttttgc 2460 atcttgggta taaacttgga ataaaaaaat ggcaagcacg ttctgacatt gaaaaaagta 2520 taatattaca acaaaaattg ttaatccaga atgcttttaa gctgcagcta gggttaatag 2580 ttgatcgacc gaaaccagga tgcggtaata caaatgacgg aaatacagct cgaagatttt 2640 ttaaaaactc acaagtgtct gctgatattc taggagtgga tttagaacta ataaatcggt 2700 ttcatgtaat tttacaggta ttatctagtg gctttcctat agacgcagac aaacttgata 2760 actattctac cgaaactgca ctactttttg ttgaaaagta tccttggtac aacatgccta 2820 caacagtaca caaagtactt attcatggag ccttaataac aaaaacagca atactaccca 2880 taggacaact atctgaggat gcacaggaat cgagaaacaa agacttaaaa aaatatcgcg 2940 aagatttctc taggaaaaat tccagaaaaa atacaatgca tgacgttttt tgcagacttc 3000 tggtttcgtc agatcctgca atatcttcac ttttaaaatc tcatgcaaaa gttagcagat 3060 ctttgtcttc tcaagcttta caaatgttat tacctccgga gattaacgaa aactctgaaa 3120 ctcattcgtc tgtgattgag tttatcaaca atatttcaat aataaatgat tcagatgaca 3180 gtggtgattt attttaaagt aatataaatg aaacaattat gataaatgat gcaaataaaa 3240 gttttccttg ttttccttga acaaataaat aagtagaata aataatgaat ttttcaattt 3300 atttttctta acattgcggc ataataatac ggtcataata cattatatat cggtattata 3360 tataatgttt atatatgtaa tgtaaataat acattagtct attgaactta aatgtcgaat 3420 tagaactaat tggttaaaca cgaaggccca aaggcatttc taatgattaa tgtttttaaa 3480 acaccgccat tttggatctg ccattattga attttggtac acttatgtaa aattcaaagt 3540 tggggattca aaaaaatgct atacagaaat tttccattaa ttataatcat tttttgactt 3600 ttggccaggt tttttgaaat tccgcccact gtgcactggt cacaaaaacc agtc 3654 // ID hATm-42_HM repbase; DNA; INV; 3856 BP. XX AC . XX DT 07-DEC-2008 (Rel. 13.12, Created) DT 11-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type family: consensus sequence. XX KW hAT; DNA transposon; Transposable Element; hATm-42_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3856 RA Jurka J.; RT "Families of hATm elements from Hydra magnipapillata."; RL Repbase Reports 8(12), 1936-1936 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 1297..3195 FT /product="hATm-42_HM_1p" FT /translation="MVKFGNVDKIFSENVHRAQEKLLKMSEKIQRHAEKNK FT KKESNYSEVDFLSTDSENQSEYDEIYLPPAKVRKKQKRFQNDLKVCLQLSL FT GDILHNWIPIMTRFRIGVRPGMCLLNSLYKAGGVDIDNIPISRSTLNRKTH FT TIVESEAQMIREENLDKVRGLKIVVHFDTKILKRYNWEKGVCKSEDRIAIS FT ASSPESGPLDFLLGILKIESSKGSDQATAIQAMLEYYEMSDQIIGLCSDTT FT SSNTGRKKGAVSIIVSYALGRPVLWLMCRHHIYERHVAHVMKEIFGPTNGP FT SRKLYVTLQKIWPQIYRDVNKLERIVKFDWTQDAFRPNSLLYKLASDTKEF FT CITALRESTFQRGDYRYLCELLAFFLGAELPNFSFKQPGAHHEARFMADCI FT YLLVIQMTQMYYPVDSESGNISTLSNLKAATNYIVFFHGFFFLKCAMASQA FT PSNDLMAFNIAFQLQTMDEFKEFAVVGKVLYQSLSRHTWYLSPQQVVFALA FT DKELKVEVKNNMLNKLLSYDVPVLQDLIKKKPESIIEIVPTSKLDDFINEQ FT SYLLFLLLDISKKELLLWKEKGIEACESEKTSQSFTYFSKCVRTLAVVNDR FT AERHIKLIQDFVERTHNEDRLQDTLQVFYYIHAI*" XX SQ Sequence 3856 BP; 1353 A; 607 C; 668 G; 1228 T; 0 other; ttagggtgtc tcaaaaaagt atcgtatatt gaattttgat gtgcggaata ctgcattctg 60 cgtcattttg aaaatgaaga cgaacaaaaa aaaaattaag tctctagctc tagcggaagg 120 ttatttaggt caatttcctt ttttttatat atatataaat tttgagataa aattaatttt 180 attcttcttc aaacaaacac atgtttcgac aaaagaatgg cgaacaatag aaattaatga 240 tgcgtatcta ataactcgag ctaaattatt attaaatcac ttttaaaagt tgttttcaac 300 aaaaacaaaa ctctttactt actaaaactc tttacttacc aaaactcagg tattaacttg 360 attttaaaat ttagtctttt tttttcattg ataataatca ttattagata aaaatgtggg 420 atcatttaat attatattta aataaattca atattatgtt atcaaagttt tctaattaaa 480 cgatggcaat aaaataataa taataaaaaa ctcgtccaat ataatgtgca atgcattgtg 540 catccaacta ttctgatttt attttaaatg aaaaatagtt gattttagaa aatgcctcta 600 gaccaagagc gcgacaatca gataatttat tcacttaaag cagatattct taacgcgtcg 660 aagattacaa ggaagcgaag acaattttat ttgcttgggt ttcctttgca gagttttgcc 720 cccaatagac ttcccacaaa tggagaaatg ctcagaaggt actactggct taatctcgac 780 aaaaacacaa ggtaaattag taactaacca acgtaattaa tgtaaataac taactaacta 840 atttttaaaa tgttttagta ccacacagat caacgtcaaa attggctgcc cagttgcttc 900 tggtaagact tatttgagat gctccaggtt tgctgacgga gggtgcggaa agaaaaccaa 960 gcagttaaat acaaacgtct gtattctgag ggagcttgtc actttgtggg aacaagctgg 1020 gtttagcgat actttcactg tgaccgagca aactataaag accagaatta ccaagaaagt 1080 taaagagtat cacactttaa aaatgcttaa aacactagat ttgcccgagg gcagctacaa 1140 caagaaggtt gacaaatttt tgaaagagag tagtgctctt ttttttctat tcaaaaatca 1200 aacataattt atgatttaat taaaattgat aaaaataggg atgacgaggc taagagtgaa 1260 gacataaaat ttttgaggca ttgtctgaga ggggatatgg ttaaatttgg caacgttgat 1320 aaaatattta gtgaaaatgt tcacagagca caggaaaaac ttctaaaaat gtcagaaaag 1380 atccaaaggc atgcggaaaa aaataagaaa aaagaaagta actattcgga agtagacttt 1440 ttaagcacag atagtgagaa tcaaagtgaa tacgacgaaa tttatttacc tcctgctaaa 1500 gtccggaaaa aacaaaaaag atttcagaat gatttgaaag tttgccttca attaagtctg 1560 ggagacattc ttcacaactg gattccaatt atgaccagat ttagaattgg agtcagacca 1620 ggtatgtgtc ttttgaattc tctttacaaa gctggtggtg ttgatattga taatatccct 1680 atatcaagat caactctaaa caggaagacc catactattg tggagtctga ggcacagatg 1740 atcagggagg aaaatttaga taaagtaagg ggtttgaaaa ttgtggtaca ctttgacaca 1800 aaaatattga aaagatacaa ttgggaaaaa ggagtatgta aaagtgagga tagaattgcc 1860 ataagcgcat cttcacccga gtctggtccg ctagacttcc ttctgggcat cttaaaaatc 1920 gaaagttcta agggaagtga tcaagctact gccattcaag ctatgcttga atattatgag 1980 atgtctgatc agattattgg attgtgttcg gacacaacat ccagtaacac tgggagaaaa 2040 aagggtgccg taagcattat tgtttcatac gccctaggta gaccggttct ttggctaatg 2100 tgtagacatc acatctatga aagacatgtg gctcatgtaa tgaaggaaat atttggtcca 2160 accaatggtc ccagcaggaa gctatatgtt actctccaaa aaatatggcc gcagatttat 2220 agagatgtaa acaaattaga gagaattgtg aaatttgact ggactcaaga tgcctttaga 2280 cctaactctt tgttgtacaa acttgcttcg gatacaaaag aattctgtat cacagctctt 2340 cgtgagagta ctttccagag aggagattac aggtatcttt gtgagcttct ggcatttttt 2400 ctaggagctg agttaccaaa cttctcattt aaacaaccgg gagctcatca tgaagcaaga 2460 tttatggcag actgtatcta cttgctggtg attcaaatga ctcaaatgta ctacccagtt 2520 gattccgaat ctggtaatat aagcacgtta agtaatctga aggctgcaac caactacatt 2580 gtattttttc acggcttctt ttttctgaaa tgcgcaatgg cttctcaagc accatcgaat 2640 gatttgatgg cattcaatat tgcttttcag ctgcaaacaa tggacgagtt taaagaattt 2700 gcagttgtgg gaaaggttct ttaccaaagc cttagccgtc acacttggta cctgtccccc 2760 caacaagtag tttttgctct tgctgataag gaactgaaag ttgaagttaa gaacaatatg 2820 ttaaataagc tgctttccta tgatgtcccg gtgttgcaag atctgattaa aaagaaacct 2880 gaatctatta ttgaaattgt tcctacgtct aaactagatg attttatcaa tgaacagtcc 2940 tatcttttgt ttttgctgtt ggatatttcc aaaaaagaac ttctcctctg gaaagaaaaa 3000 ggcatcgaag catgtgaaag tgagaaaacc tctcagtctt ttacatactt ctcaaagtgt 3060 gtaagaacac ttgcagttgt aaatgaccgt gcagaacgtc acatcaagtt gattcaagat 3120 tttgttgaga gaacacacaa tgaagatagg cttcaggata ctctccaggt attttattat 3180 atacacgcaa tataattggc acttattagc aagtataata tgaatagtta atatataata 3240 tttacttaat atatttatat ttttaggttg tccagagaaa ccggcagcag atatcgaaaa 3300 aggctacaaa gaaagatctt aatactatta tttagttgag taatattttg tagttaaaaa 3360 tataggttta tatttgtatt attattaatt tttatttctt ttgaaattaa atgataaacc 3420 taatctccga attatttttt ttacttttca cactaaaata ttgcttcaag gtcaaaacat 3480 tgacgttttt actataattc gcaaaaatga tgctaaatgt atcgttttat cccatttaaa 3540 agtgaactca tctctaaact caattagtta ttttacaatg aaattaaaaa atttcatagc 3600 acgcaaattc acctgtttga cacgcaccaa caattttctt ttgttttaag taacaattgc 3660 tatttttata ttaagataac attcaatttt atctcaaaat ttatatatta aaaaaaaagg 3720 aaattgacct aaataacctt ccgctagagc tagagactta attttttttt tgttcgtctt 3780 cattttcaaa atgacgcaga atgcagtatt ccgcacatca aaattcaata tatgatactt 3840 ttttgagaca ccctaa 3856 // ID Copia-7_AA-I repbase; DNA; INV; 4207 BP. XX AC AAGE02017938; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-7_AA_; KW Copia-7_AA-LTR; Copia-7_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4207 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02017938; Positions 10057 5851. XX CC Positions [1570-2073] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 286..4197 FT /product="Copia-7_AA-I_1p" FT /translation="MQEPANSPDCRGSARVCQGTKDGQRHLGRAEEQLRTD FT WNCRQVNPEKAVPRAAAGGRWRREGFLAPVREDSARAKDGGSQHAGGRRGV FT STTAGASRILRSAYHGSGETIQPQQLTLEYVKKRLLDEQVKRLNSSPAGES FT ENSGDSAFSGKKLGIKCFGCGKYGHKRAECPENRPAEDSSSTRLRKKNGKF FT SRKTKFGASVAEEMTFVATETMECLSVATVSKKNVDWILDSGATDHMVKSK FT TFFDEMHALDNKVRIAVAKSGQAVFAEYAGTVKVNMLIDGRKVPAVVKDVL FT YVPQLACNLFSVRRLEDNGMMVRISNGTVKVLKNGSVVATGRRAGQLYKMD FT IELRDVAYSASGGECEAGAMASKKDGFDLWHRRFGHLGTANMKTLWKHGMV FT ETTTKISDWPERCVCEVCLQGKQSRQPFEDGGERRASRPLEIIHSDVCGPF FT TQKTWDKKRSFVTFIDDFSHFTVVYLLESKDEVQSKFEEYCARVTAFFGTR FT VSRLRCDNGGEYTGKSFRKYCRTQGINVEATIPYSPQQNGVAERMNRTLLE FT KARSMVHEAGLHKSMWGEAVLTAAYLCNRSPTSALKEKKTPYEIWYGRKPN FT IDKMRVFGSVAHTWVPKEKRDKLDPKSEKNVMVGYTANGYRIWDPRKRRVT FT VARDVVFDEQKDGSVAAPNNAKQVLWRNDDTEDCEDEPGAPVVGQPNIAPQ FT PEKAPPDVNAEHITDDDEDADTALPSQPERVDCPAEATTRRSERERKLPGK FT FLDFITGSRASTAAECSTGVSEDHDDATMAFALNAETFVENLPTTIDGLRR FT RSDWQHWKKAIISEMESLQKNETWELVPKPPNKNIVDCKWVFKIKRDEAGN FT VNRYKARLVAKGFSQRKGYDYDETYAPVAKGTTVRVLLAVANQMGYHLHQM FT DVKTAFLNGQLKEEIFMRQPEGFESGDPSLVCHLKKSLYGLKQAPRSWNSE FT FHNFVIQLGFQRSNLDSCLYWWSDGKIVVYLLLYVDDIILVSNQMDVIAEM FT KKKLSTRFEMTDVGLLKMFLGLKIDRDQAKGTMKISQPKYVDDLLQRFGMA FT NCKPAPTPLEPNLKLERREDEAVTSEPYRELIGCLTYLALSSRPDISAAVN FT FFSRYQSAPTSEHWSHLKRILRYLKGTVNHGLAFRRNEGSAPLVGYADADW FT GNGPADRRSISGNVFQVFDGTFSWMTRKQSSVALSSTEAEYVSLSHAVCEA FT IWLRNLFLELGIKLDQPVVLHEDNQSCICIAQEPRDHKRMKHVDIRYNFIR FT EKLQDGTFKIQNIPTNEQLADLFTKGLSRGPFEMLRSKLGLFG" XX SQ Sequence 4207 BP; 1142 A; 949 C; 1240 G; 876 T; 0 other; ataggttatg ggcccagaac ggaagtagtt tatttgaaga aacgtttggt caagttctgc 60 ggaagaagtc aagatgagcg acgaaaaact ctggctgttc aacggtgaaa acttcggcaa 120 ctggaaattc aggattgaag tgctgatgga agacaggggc ttgctggagt gcctggaaac 180 ggcaattgcg gatcaagagt acgctgagga gcttccggac gatactgctg aggagaggca 240 gcggaagaag aagctactgg acgaacggat ggccaaggat cgacgatgca agaacctgct 300 aattcaccgg attgcagagg atcagctcga gtatgccaag gaacaaaaga cggccaaaga 360 catttgggac gagctgaaga acaacttcga acggattgga attgcaggca agttaatcct 420 gaaaaagcag ttccaagagc tgcggctggc ggaaggtggc gacgtgaagg ctttcttgct 480 ccggttcgag aagattctgc gcgagctaag gacggcggaa gtcaacatgc aggaggaaga 540 cgtggtgtgt caactactgc tggcgcttcc cggatcctac ggagcgctta tcacggctct 600 ggagagacga tacagccaca gcagcttacg ttggagtatg tgaaaaagcg gcttctggac 660 gagcaggtga agcgactaaa ttcaagtcct gcaggtgagt cggaaaattc tggagattcg 720 gcgttttctg gtaagaagct aggaatcaag tgcttcggct gcggcaagta cggtcacaaa 780 cgagctgagt gtcccgaaaa ccgtcctgct gaagattcta gctcaactcg gctccgaaag 840 aagaatggaa aattttcaag aaaaacaaag tttggagcga gcgttgccga ggaaatgacg 900 ttcgttgcta cggaaaccat ggagtgtttg tctgttgcta ctgtgtcgaa gaaaaacgtc 960 gactggatcc tggactcggg cgcgacggat cacatggtga agagcaagac tttctttgat 1020 gagatgcatg cgttggacaa taaggtgcgt atagcggtgg caaaatctgg tcaggccgtt 1080 ttcgcggagt atgcgggcac cgtgaaagtc aatatgctga ttgatggaag gaaagtaccg 1140 gctgttgtca aggatgtgct ctacgtccca cagttggcat gcaacttgtt ctcggttcgg 1200 cgcctggaag ataacggaat gatggtaaga atctctaacg gtacagtgaa ggttttgaag 1260 aacggaagtg tggtcgcaac tggcagaaga gctggtcagc tgtacaagat ggacatcgag 1320 ctacgagatg tggcttactc agcttctggt ggcgaatgcg aagctggtgc gatggcttcg 1380 aagaaggacg ggttcgattt gtggcatcga cgattcggcc acttgggaac ggcaaacatg 1440 aagaccctgt ggaagcacgg aatggtggaa acgaccacga aaatctcgga ctggccggaa 1500 cggtgtgtct gtgaagtctg tctccaaggg aagcaatcaa ggcaaccatt cgaagacggt 1560 ggagagcgac gagcgtctcg tccactggag atcatccatt cggacgtgtg tggtccgttt 1620 acgcagaaga cgtgggacaa gaagcggtca ttcgtcacgt tcattgacga ctttagccat 1680 ttcacggtcg tgtacctcct ggagtccaag gatgaggtgc aatcgaagtt tgaagaatac 1740 tgtgcacggg tgacggcgtt ctttggaact cgagtgtccc gactgaggtg cgacaacggc 1800 ggagaatata ctggaaaatc gttccgtaag tattgccgca ctcaagggat caacgtcgaa 1860 gcaaccattc catacagccc acaacaaaac ggagtggctg agcggatgaa ccgcacgctg 1920 ttggagaagg cccgttcgat ggtacacgaa gctggactgc acaaatccat gtggggcgag 1980 gcggtgttga ctgctgccta tctatgcaac cggagcccta catctgcgct gaaagagaag 2040 aagacaccgt atgagatatg gtatggacga aaacccaaca tcgacaaaat gcgagtcttc 2100 ggttcggtgg cacatacctg ggtgccaaag gagaaacgtg ataagttgga ccccaaatcg 2160 gagaagaatg tgatggttgg ttacactgcg aacggatata gaatctggga cccacgtaag 2220 cggagagtta cggttgcgag agacgtggtg ttcgacgagc agaaagatgg ttccgttgct 2280 gcgccgaata atgccaagca ggtgctgtgg cgtaacgacg atacagaaga ttgtgaagat 2340 gaaccgggag ctccagtggt gggccaaccc aatattgccc cccagcctga aaaggcacca 2400 ccagatgtga atgcagaaca cataactgac gacgacgagg atgccgacac cgcgctccct 2460 tcgcaaccag aacgtgtcga ctgccctgca gaagcgacca cgaggcgcag cgaacgggag 2520 cgcaaactcc ccggtaagtt ccttgatttt attaccggct ctcgagcatc tactgctgcg 2580 gaatgttcta caggtgtttc tgaggaccat gacgatgcga cgatggcgtt tgcgttgaat 2640 gctgagacgt tcgtggagaa cctgccgacc acgatcgatg ggttgcgacg gcgaagcgac 2700 tggcaacact ggaagaaggc catcatcagc gagatggagt cgttgcagaa gaatgagacg 2760 tgggagctcg ttccgaagcc acccaataag aacatcgtcg actgtaaatg ggtgttcaag 2820 atcaagcgag atgaagctgg caacgtcaac cggtacaaag cccgcctggt cgcaaagggc 2880 ttttcacaac ggaagggtta cgactacgac gagacgtacg cacctgtagc gaaggggacc 2940 acggtgcgtg tgctgttggc ggtggcgaac cagatgggat accacctaca ccaaatggac 3000 gtcaaaaccg cgttcctcaa tggacaactg aaggaggaga tattcatgcg acaacccgaa 3060 ggattcgagt ctggtgatcc tagccttgtg tgtcacctga agaagtccct ctacggattg 3120 aagcaggctc cccggagctg gaactcggag tttcacaact tcgtaatcca gctgggcttc 3180 caacgttcca atctagatag ctgtctttat tggtggagcg atgggaagat cgttgtgtat 3240 ctcctgctat atgtggacga cataattctg gtctcaaacc agatggacgt gatagccgaa 3300 atgaagaaga aactttcaac aagatttgaa atgactgacg tcggcttatt gaagatgttc 3360 ctcggcctca agatcgaccg agatcaagcg aaaggaacga tgaagatcag ccagccgaag 3420 tacgttgacg acctcctgca gcggtttggc atggcgaatt gcaaacctgc accgacgcca 3480 ctggaaccga acctgaagct agaacgccgc gaagatgaag cagtgacctc cgaaccatat 3540 cgagagctga taggatgctt gacgtacctg gcactgtcgt caagaccgga tataagtgct 3600 gcggtcaact tcttcagcag gtaccaatct gcaccgacaa gcgaacattg gagtcacctg 3660 aagcgcattc tccgctactt gaagggtact gtgaaccacg gtctggcatt ccgaaggaac 3720 gaaggatcgg cgccactagt tggatatgct gacgctgatt gggggaacgg cccggccgac 3780 cgccggtcca tctcgggtaa cgtgtttcaa gtgtttgatg gtacattttc ctggatgact 3840 cgcaagcaaa gcagcgttgc cctctcttca actgaagctg agtatgtttc gctcagtcac 3900 gctgtgtgtg aagcaatctg gctgcgaaac cttttcctcg aactcggaat aaagctggac 3960 caacccgtag tgctgcatga agacaatcag tcttgcatct gcattgccca ggagccacgt 4020 gaccacaaaa ggatgaagca cgtggacatc cggtataatt tcattcgtga gaagctacaa 4080 gatggtacgt tcaagatcca gaacattccg acaaatgagc agttggcaga cctgttcacc 4140 aaaggattgt cacgtggacc attcgagatg ctacgaagta agctaggatt attcggttga 4200 gcagggg 4207 // ID BEL-42_AA-LTR repbase; DNA; INV; 652 BP. XX AC AAGE02017346; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-42_AA_; KW BEL-42_AA-I; BEL-42_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-652 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02017346; Positions 7899 8550. XX SQ Sequence 652 BP; 224 A; 95 C; 135 G; 198 T; 0 other; tgtgaaaaac ccactgagct ggccgcacgg tttggctcaa ttacccgtcc atagcgatac 60 gaggtcgtta ggatgactta gtggatagag taggattaca ggaaagagac tcaaaacaaa 120 aataaaattg aacttagtag tgtgaacatt tgaagaatta gaataaatta ttatagcctt 180 gtttaatcgc ctatttgcaa cggtgtgaat aattaatagg ttgtgatctg tgcctagtgt 240 ttaccaagtt ttaaacaggt aaatttgaat tgagtgcatt tgaaatcaat attgaaattg 300 tatgattact tatggctagg ttacattgga agacaacaca accgttttcg tacgttctca 360 aacctagatt tactatcgga catagttcta aaccgtgagt aaataatcaa ttagattaca 420 gatggagaat taaatttaat actatgttta ggctagcgtt gtgaacaagt cgagatctag 480 gaaggcgatt ggtcagaatc actcattgca agaggtgtta aggagaaact atctttgtaa 540 gtaagaacaa aatttgttaa attgatgaga aactactgga tattaaaatt tacagtttga 600 gctgcttgta agctgctaca aaactttagc gtattccacc gaattccgaa ca 652 // ID Gypsy-255_AA-I repbase; DNA; INV; 5228 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-255_AA_; KW Gypsy-255_AA-LTR; Gypsy-255_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-5228 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1111-1111 (2011). XX DR [1] (Consensus) XX CC Positions [2228-2605] - Reverse transcriptase CC Positions [4078-4548] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 710..2794 FT /product="Gypsy-255_AA-I_2p" FT /translation="MSGNIVGSIEHYVRGTSFSNYAERLKFIFEYNKVPEE FT SQKSLFITLSGPAVFEELKLLYPATNISTLNYTDIIKKLQERFDKKEPDLI FT QRYKFYNRSQTQFETAENFVLEVKLQAEFCDFGEFKDIAIRDKLVMGVYDE FT ALQQKLLGEEKLTLATAERMIVNWELAGERAKMISNKVQTNGVGSVRARLG FT HVPAKPKNNEDEHFDDRFQRSRGRDRNWNGGRSYNRASGEYRSRSRSGSRY FT RNYADRQRTELVCYYCGKRGHMVKKCFKYINDQRKPAVKFADAKEPENNLE FT DLFDRFKTNLEDSSEDESGELFCMMVNSNTNLNEPCLTNVLVEGKKVKMEI FT DCGSSVTIMGKQLYNRMFDMPVEKFSKKLVVVNGANLTIKGKVHVEVVLNG FT SIKHLDLIILDSNNDFLPLLGRDWLDEFFPQWRNAFSNSLFVRNINEERQK FT EAAIVAVKQKFSSVFHKDFSSPIVGYEADLVLKEDSPIFKRAYEVPYRLQD FT KVVEHLETLEQQNVITPIKTSEWASPVVVVIKKDQQIRLVIDCKVSINKLI FT IPNSYPIPIAQDIFAKLSGCKVFCSLDLEGAYTQLSLSEKSRKFVVINTIK FT GLFRYNRLPQGATSSAAIFQQVMEQVLEGLKYVSVYLDDVLIAGTDFADCC FT EKLHTVLDRLSKANIKVNWSKCKLFVSNLPYLGHVITENGLLPCPE" FT CDS 3553..5091 FT /product="Gypsy-255_AA-I_1p" FT /translation="MSIYDFEIAYRPSGQMGNADFCSRFPLEQMVPKENDQ FT EFVKNINFSQEFPVDFSLVATETKQDCFLQKVISYYQHGWPERIDKELLNI FT FSNQHDLEVINGCLIYQDRVVIPTSMQKGILKLLHCNHSGIVKMKQLARRS FT VYWFNINADIESYIRNCDPCNRMLITPKEKVETKWIPTNRPFCRIHADFFY FT LNQKVFLIIVDSFSKWIELEWMKYGTDAERVIRKFVALFVRFGLPDVVVTD FT GGPPFNSHAFVSFLEKQGIKVMKSPPYHPASNGQAERSVKLVKEVLKKFLL FT DKEVMNLQMEDQINLFLINYRNTCLTKEGSFPSEKVFNYTPKTMLDLINPK FT KAYVPKINNDANNSEVGVNPKTTSNRTRSMATVDPIFKLKTGETIWYKNNN FT SKDFTRWVEAKFVRPCSPTVFQILIGNVVAKAHRGQLRVPKTVAKPPPLNL FT TTPCRKNYKRARSISNEEDFPGYGEDDVATRKRRGLEVEKEGAGNEGVISP FT VVRRSKRLLEKNKTRTP" XX SQ Sequence 5228 BP; 1752 A; 844 C; 1157 G; 1474 T; 1 other; aatactaaaa gtgactacga ggaaaaagga gaatcgcgtg tgaagtgtta tagtgaaaac 60 tattagaaga aagatagtgg ctctaaagaa gatattgtat ggtgagcaac aaaggttgtc 120 gcaacaaagg tgattaaaag aaaagaaaag tgttttagaa ataaattgaa agggaaagaa 180 aagaaattag agttgcaaag gcacaaatta aattctagag gtcatctcgg tcattccgtt 240 acaaaggtaa caaaaagcgt tgctgggttc taagtgagtc acgctggtga cattttgttt 300 tctagataaa aaaaaaggat cttcagaggg ctgagggctc caacgggaga aaagtgattt 360 taccctggtt ccctagtgaa tatttcaacg ccagacctgg cagtggaaag tgtgaatttg 420 tgaaaacgaa gccattaaaa gtaatcgcta agccaaagat tggtggataa ggtcattaga 480 agcagattgg atcccagtgt ataggagatc cattactgga ttatagtggt ggtacagcga 540 gcaggtgtgt gagctgtgag atcgagtttg gcgcttcaag gtggtgagac gtgagtttcc 600 tctttggtga gtgattctta tataccatat accctataat ttgctctgat tgtgtatcaa 660 ctagctgagt ttttttttct ctctgcgttt gttttgattt tctaaggaga tgagtggaaa 720 cattgttggt tctattgagc actatgtgcg tggcacgtct tttagcaact atgctgagag 780 acttaaattt attttcgaat ataacaaagt accagaagaa tcgcaaaaat cattgttcat 840 taccttaagt gggcccgctg tttttgaaga gctaaaattg ctttacccgg cgacgaatat 900 ttcgactttg aattataccg atataataaa aaaattgcaa gaaagatttg ataaaaaaga 960 gccggatttg attcagagat ataaatttta taatagatct caaacacaat ttgaaactgc 1020 tgagaatttt gtattagaag tgaagttaca agcagagttt tgtgactttg gggaatttaa 1080 ggatatagct atccgtgata aactagttat gggtgtttat gatgaagcat tgcagcagaa 1140 attattaggt gaagagaaat taactttggc aacggccgag cgtatgattg taaactggga 1200 attagctggg gagcgcgcca aaatgatttc gaacaaggtt caaacaaatg gcgtaggatc 1260 ggtgcgagca cgccttgggc atgttccagc taagcctaaa aacaatgaag atgagcattt 1320 tgatgatcgt ttccaaagaa gccggggcag agatcgaaac tggaatggtg gtcgctctta 1380 caatagggcg tctggagagt acagaagcag aagtaggagt ggatcaagat atagaaacta 1440 tgctgataga cagagaaccg agttggtttg ttactactgt ggaaaacgag gacacatggt 1500 gaaaaaatgc ttcaagtata tcaacgacca gcgcaaacca gcagtgaaat tcgctgatgc 1560 aaaagaaccg gagaataatc ttgaggattt gttcgaccga ttcaagacga atttggagga 1620 ttcttcggag gatgaatcag gtgaactatt ttgcatgatg gttaacagta acactaattt 1680 aaacgagcca tgtttaacga atgtactagt tgaaggcaaa aaggtaaaaa tggagataga 1740 ttgtgggtcc tcagttacga taatgggaaa gcaattatac aataggatgt tcgacatgcc 1800 agttgaaaaa tttagcaaaa agttggtagt agtgaatggc gcaaatttaa ccataaaagg 1860 taaagttcac gttgaagtgg ttttaaatgg ttcgattaaa catttagatt tgattatttt 1920 ggacagcaat aatgattttc tgccattatt agggcgtgat tggttggatg aatttttccc 1980 tcagtggaga aatgcctttt caaactcctt atttgttagg aacatcaacg aagaacgaca 2040 aaaggaagct gctattgttg ctgtaaaaca aaagttttct agcgttttcc ataaagattt 2100 ttcgtcacca atcgtaggct atgaagcaga cttggtgctt aaagaagact cgcctatttt 2160 caaaagagct tatgaggtac cgtacagact gcaggataag gtggtcgagc atttggaaac 2220 gttagaacaa caaaacgtca taactcctat taagaccagc gaatgggcct cacctgtggt 2280 cgtggtcatc aaaaaggatc agcaaataag attagtaata gactgcaaag tttctattaa 2340 taaacttatt atacctaact catatcctat tcctattgcc caagatattt ttgctaaatt 2400 gtcagggtgc aaagtttttt gttctttgga cctagaaggg gcctacactc aactctcttt 2460 gtctgaaaag tctagaaaat ttgtcgtcat aaatacaatc aaaggattgt tcagatataa 2520 tagattacca cagggagcta cgtcaagcgc agccattttc cagcaggtta tggagcaagt 2580 acttgagggt ttaaaatatg tatcagtata tttggatgac gttctcatag cgggaactga 2640 ttttgcggat tgttgcgaga aactccatac ggtacttgac agactttcaa aagctaacat 2700 taaagtaaat tggagcaagt gtaagctctt tgtatcaaat ttgccatatt taggccatgt 2760 tatcacagaa aatggtttgc tgccttgtcc agaamaaaat aatacaatta gcaaagctcc 2820 agtgcccaaa aatgttaatg aattgaaatc gtttttgggc cttatcaatt attatggaag 2880 atttattcct aagttgtcgc ccaggctaca ctgtttgtac aatttgttga aaaaagatgt 2940 gagtttcatg tgggacaaca aatgcgagca agcattccaa aacagcaagc gacaactatt 3000 agatgcaaat atattagagt tttatgaccc gaataaaggc cggtttgcta ctgacaagaa 3060 aaggaatcgc ctatctcgat gcgtggaaaa tttctcccaa gaaatggtca agaaaatcac 3120 ttttttctga tcgcagtacg cagttgagaa gaaatgtcac ttcaaagggg gctctctgta 3180 ggaaatttct tgaccctttc agtcaaggaa tttctttact tttcttgagc aaaacagcat 3240 acttagctta aacctataat tgtagtgaca gatgcgtctg gatatggtct cggtggagtt 3300 atagcacatg ttgaaaatgg cattgaaaaa ccaattagct tcacttcatt ttctttgaac 3360 aaggcacaac gttcctatcc cattttacat ttggaggcat tggctttagt atgctgcgtg 3420 aagaagttcc ataagtatct ttatgggaaa acattcacag tgtacacgga tcataagcca 3480 cttgtgtcaa tttttggaaa acctggaaga aatacgttat acgtaactag actgcaaaga 3540 tacgtcttgg aaatgtcaat atacgatttt gagattgctt atcgaccgtc aggacaaatg 3600 ggcaacgctg acttctgttc tcgcttccca ctggaacaga tggtgcccaa agaaaacgat 3660 caagaattcg ttaaaaatat aaattttagt caagaatttc ctgttgattt ctcattagta 3720 gcaacagaaa cgaagcagga ttgttttttg caaaaagtaa tatcttatta tcaacatgga 3780 tggcctgaga gaattgataa agaattatta aacatatttt caaaccagca tgacctggaa 3840 gttattaacg gatgcctaat atatcaagat agggtagtta ttccaacatc catgcaaaag 3900 ggaattctga aattgctaca ctgtaaccac agtgggattg taaaaatgaa acaattggca 3960 cgaaggtctg tgtactggtt taacatcaac gcagacattg aatcttatat acgtaattgt 4020 gatccgtgta acagaatgtt gataactccc aaggaaaaag tagaaactaa atggattcct 4080 accaatcgac cgttttgtcg catccacgca gatttttttt accttaatca aaaagtattt 4140 ctgatcattg ttgatagttt ttcaaagtgg atagagttgg agtggatgaa atacggtaca 4200 gacgctgaga gagtgattcg taagtttgtt gctctgttcg tccgctttgg tctaccagac 4260 gtagtagtta ctgatggagg gcctccattc aattctcatg catttgtatc ctttttggag 4320 aaacagggaa ttaaagttat gaaaagccca ccatatcatc cggcaagcaa tggacaagca 4380 gaaaggtccg taaaactagt aaaagaagtg cttaaaaagt tcctgttaga caaagaagta 4440 atgaacctac aaatggaaga ccaaatcaat ctatttctta ttaactacag aaatacctgt 4500 ttgacgaagg agggaagttt cccttcggaa aaagtgttca actatacacc taaaactatg 4560 ttagatttga taaatcctaa gaaagcatat gtaccaaaaa tcaataacga tgctaacaat 4620 agtgaagttg gtgttaatcc aaagacgacg agtaatcgaa ctagatccat ggctaccgta 4680 gatcctatat tcaagctaaa aacaggggaa acgatatggt ataaaaataa caattcaaag 4740 gattttacta ggtgggtaga ggctaaattt gtaagaccat gttctccgac tgtttttcag 4800 atcttaattg gaaacgtggt ggcaaaggcc cacagaggac aattaagggt tcctaagaca 4860 gtagcgaagc caccaccgct aaacctgacg actccatgca ggaagaacta caagcgtgcc 4920 cgttcgattt ccaatgagga agatttccct ggatatggag aagatgacgt tgcgacacga 4980 aagcgtcgtg gtctggaagt cgagaaagaa ggggccggaa acgaaggtgt aatatctcca 5040 gtggtacgaa ggtccaaacg gctattagaa aagaataaga caagaactcc ttgaattcta 5100 taaccgatat aagcgtaaga cgcattcgag ttgaataata aatgaattgt tagatctgaa 5160 atattgtgag caatcaaaat atctaagtgc attaaaatgc attcggaatc ttttataggg 5220 gtgagaat 5228 // ID Kiri-5_CQ repbase; DNA; INV; 3316 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 16-FEB-2011 (Rel. 16.01, Last updated, Version 2) XX DE A Kiri non-LTR retrotransposon family from Culex quinquefasciatus DE - consensus. XX KW Kiri; Non-LTR Retrotransposon; Transposable Element; Kiri-5_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-3316 RA Kojima K.K. and Jurka J.; RT "Kiri non-LTR retrotransposons from the southern house RT mosquito."; RL Repbase Reports 11(1), 124-124 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 10 sequences with >97% CC identity. CC This family and related Kiri elements found in two mosquito CC species, Culex quinquefasciatus and Aedes aegypti, constitute a CC distinct lineage belonging to the CR1 group. The phylogenetic CC analysis with PhyML indicates that Kiri is a sister lineage of CC the L2 clade. Although several Kiri elements do not encode two CC proteins, probably due to 5' truncation, Kiri elements generally CC encode two proteins. Their ORF1 proteins commonly contain an CC L1-like RNA-binding domain. XX FH Key Location/Qualifiers FT CDS join(178..606,552..3017) FT /product="Kiri-5_CQ_1p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="MTNTVNNVQELRFEPINATLKNNLSKENGFKVGHLNT FT CSIICHHDDVKNIFTRTGFDCIGFTETWLNSDIKDALIKFDNFEILRHDRN FT STTRVXGGGVALFVRPEYKTKIIKTSPRDSIEYMFCELKVVSCKLWLVLSI FT NPQPIKSSQLQVVVGVIYKPPTLXDFSFFFEDLGNISSTYNNIILMGDFNI FT NILVSNPMSNNFTRLINSFSLSIVNKSFPTHFHNTGSSLLDYFICSENLXS FT TLFSQVSVPAVSHHDLIYTVFNLPVPVASLKTISFRNFNRIDKEAVRNDFP FT NFNFDQLNGIVDCETLASNFNHKLIQLVNKHAPLKTKTILPTSCPWYSREA FT YNACVDRDLAYRFWKNNRTLENKLIYNRFRNRATNLIRIAKKQYSAKLFDP FT TLPSKTIWKNFKSLGFRDDPDNLDDKFKPEDVNNCFIQKIENYPTGNRQPT FT IIPDIVEEDQFKFRAVTEDEVHKAILRIKSDAIGYDEIPLKFVKLFLDLLV FT PYITLLYNKIISTNTFPTIYKLAIVKPIPKIKNPSNPNDLRPISILPSLSK FT GCEILLKDQIVQYVTFKKLITPFQSGFREHHSTSTIITKVCQDINASFDSN FT YVTVLILLDFTKAFDLIDHRIFIDKLKEQFQFSSDACDLMKSYLSDRSQLV FT QINGVRSQFRSTKRGVPQGSVFGPLAYTLSVNDLPSFITHCNKQSFADDTQ FT IYLSCPVDQLPNKISLINSDLSAISRYCKDNGLVLNTSKTKAIXFSRQNTV FT LPHLPPIQIDNENVEIVDECLNLGIKMNKHLTWXDHVSLIKKKVYSTLRTL FT NCHRHTLSIDTKIKLVHTLLMPHFIYGNVIFSKMNVDTERHLQVCFNSCIR FT FIYNLRRFDHVSQFQRSIFDLSLSKFYQFQLLLFLFKLINFKEPQYLFESL FT VFSRSNRTNNIIIPVHHTNFMGNSFAVRGARLWNDLPHNLKSITLFSEFKR FT RCKEHLKNSI" XX SQ Sequence 3316 BP; 1101 A; 569 C; 470 G; 1168 T; 8 other; ttgtttggat ttagatgata gtagatttgc ccagcagtac aggatttttt gttgttattg 60 ttattttgtt tgtttaagsa gtttatctga aaaccaaaac agcactatta tcttttgttt 120 tgccctgatt gctcttttcc tacgcgtgtt tttctctaca tagatttata tatttttatg 180 actaatactg tgaacaacgt gcaagaactg cgatttgaac caatcaatgc aactctgaaa 240 aataatcttt cgaaagaaaa tggatttaaa gttggtcatt taaacacatg ttctataatc 300 tgtcatcacg atgatgttaa aaacattttc actcgaacag gatttgactg tataggtttt 360 accgaaacat ggttgaattc tgatataaag gatgctctga ttaaattcga caattttgaa 420 attctgagac atgatcgtaa ttctacaaca agagtgagwg gaggaggagt agcacttttt 480 gtacgccctg aatacaaaac taaaattatt aaaacatccc caagagatag tattgaatat 540 atgttttgtg aattaaaagt agtcagttgc aagttgtggt tggtgttatc tataaacccc 600 caaccttgaw kgattttagt tttttctttg aggatcttgg taacatatct tcaacttata 660 ataatattat tcttatgggt gatttcaata ttaatattct tgtctcaaat ccaatgtcaa 720 acaactttac tcgtcttata aatagctttt ctctctcaat agtcaacaaa tcatttccaa 780 ctcattttca taacacaggt tcctcattgc ttgactattt catatgttca gaaaatctwc 840 matctactct attctctcaa gtttctgttc cagccgtgtc acaccatgat cttatttaca 900 ctgtttttaa tttacctgtt cctgttgcgt ctttaaaaac catatctttc agaaatttta 960 atcgcattga taaggaagcc gttagaaatg attttccaaa ctttaatttt gatcagctta 1020 atggaatagt ggattgcgaa actttggctt caaattttaa tcataaatta attcaactag 1080 tgaacaagca tgcccctcta aaaacgaaaa caattctacc aacaagctgc ccatggtatt 1140 ctagagaagc atataatgca tgtgtagatc gggatttagc ttaccgtttt tggaaaaaca 1200 atagaactct tgaaaacaaa cttatttaca atcgttttcg aaatcgtgct acaaatctga 1260 ttagaattgc aaaaaaacag tattctgcta aattatttga tccaaccttg ccatccaaaa 1320 caatttggaa aaactttaaa agtcttggat ttagggacga cccagataac cttgacgata 1380 aatttaaacc tgaagatgtt aataactgtt ttatccaaaa aattgagaac tatccaactg 1440 gaaacagaca acctactatt attcctgata tcgttgaaga agaccaattc aaattccgag 1500 ctgttactga agatgaagtc cataaagcta tacttagaat taaatctgat gcaatagggt 1560 atgatgaaat tccacttaaa tttgttaaac tttttcttga tcttttggtt ccatacatta 1620 cgcttttata caataaaatc atttctacaa atacctttcc aacaatatat aaactggcta 1680 tagttaaacc gatacctaaa ataaaaaatc catctaatcc taacgatctg agacctattt 1740 ctattttgcc atcattatca aaaggttgtg aaatcttgct gaaggatcaa atagttcaat 1800 atgttacatt caaaaaactt atcacaccat ttcaatctgg ttttcgagaa caccatagta 1860 catccaccat tattacaaaa gtttgccagg atataaatgc atcatttgat tctaactatg 1920 ttacagtact tatcttattg gacttcacta aagcctttga tttgattgac cacagaatat 1980 ttatagataa acttaaagaa caatttcaat tttctagtga tgcttgtgat cttatgaaaa 2040 gctacctgag cgaccgttcg caattagttc aaattaacgg agtccgatct cagtttaggt 2100 ccacaaaaag aggtgtaccg caaggatccg tttttggtcc gttggcgtat acattatccg 2160 ttaacgattt gcctagtttt ataactcatt gtaataaaca atcttttgcc gatgatacac 2220 agatttacct ttcttgccca gttgatcagc ttcctaacaa aatatctcta ataaactctg 2280 atttatctgc tatttctcgt tactgtaagg ataatggact tgttttaaat acttcaaaaa 2340 caaaagcaat awttttttcc cgtcagaata cagttcttcc tcacttacct cctattcaaa 2400 tcgacaatga aaatgtagaa atagttgatg aatgcttgaa cttaggtatt aaaatgaaca 2460 aacatcttac atgggakgat catgtatctc taataaagaa aaaggtctat tcaactcttc 2520 gaactctaaa ttgccataga cacacattat caattgacac taaaataaag ctagtacaca 2580 cactattaat gcctcatttc atttacggca atgttatatt ttctaaaatg aatgttgaca 2640 ctgaaagaca cctacaagtt tgcttcaaca gttgtatccg tttcatctac aatttaagga 2700 gattcgatca tgtttcacaa tttcaaaggt caatttttga tttgtctttg agcaagttct 2760 atcagtttca attgttgtta tttttattca aacttataaa tttcaaagag ccgcaatatc 2820 ttttcgaaag tttagttttc agtcgttcaa atcgcacaaa taatattata ataccagttc 2880 accacactaa ctttatggga aattcattcg cagttcgagg cgcacgtttg tggaatgatt 2940 tgccgcataa cttaaaatct ataactttat ttagtgaatt caagcgtagg tgtaaagagc 3000 atttaaaaaa ttcaatttga gttttgtttt ggtttttgat taagcaatgc cttgtattac 3060 acactctttt ataactacat acacagctga taatcagtct taaatttatt actgaattct 3120 acttaaacaa gtaatttgtc gtattatcga catttttata tctttagctt atttcgtttt 3180 agtttttaat ttagttgtta gtttttagtt ttaatgaaac ataaggtcct tatcatttaa 3240 gtcaatgtat acaaaaaagg ttgtaccttc ggtatattgt tgaaaaataa aatacaaata 3300 caaatacaaa tacaat 3316 // ID MIMO repbase; DNA; INV; 381 BP. XX AC . XX DT 03-AUG-2000 (Rel. 5.07, Created) DT 03-AUG-2000 (Rel. 5.07, Last updated, Version 1) XX DE Mosquito DNA transposon "MIMO" - a consensus. XX KW DNA transposon; Transposable Element; MIMO; MITE; TE. XX OS Culex pipiens OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RP 1-381 RA Feschotte C. and Mouches C.; RT "Recent amplification of miniature inverted-repeat transposable RT elements in the vector mosquito Culex pipiens: characterization RT of the Mimo family."; RL Gene 250(1-2), 109-116 (2000). XX RN [2] RP 1-381 RA Feschotte C.; RT "MIMO."; RL Direct Submission to Repbase Update (02-AUG-2000). XX DR [2] (Consensus) XX CC TA target site duplication. Similar to so called "MITES" CC which are likely to represent non-autonomous DNA transposons. CC Mimo is a family of miniature inverted-repeat transposable CC elements CC (MITEs) from the house mosquito Culex pipiens. The 381-bp CC consensus CC sequence is derived from a multiple alignment of MIMO copies, CC which CC share both length and sequence conservation (pairwise identity CC ranging CC from 75 to 96%). Copies possess 23-bp terminal inverted-repeats CC and CC are flanked by a TA target site duplication. CC The estimated copy number of Mimo elements in the C. pipiens CC genome CC is ~1,000. Despite evidence for recent mobility of some of these CC MITEs, it is not known how these elements have spread in the CC genome CC since none has been found to encode a protein. However, TA target CC site CC duplication and sequence similarity in terminal inverted-repeats CC suggest CC a possible affiliation of this mosquito MITE family with CC pogo-like DNA CC transposons. XX SQ Sequence 381 BP; 126 A; 68 C; 72 G; 114 T; 1 other; cagtagttgt tcggtaactg ggcgttgttt aactgggctg ctttttaact gggcgctcga 60 taactgggcc gtagcccagt taaaaagcag acaaacgtca aaaaaccaaa acaaaccgaa 120 atgaccgagg ggttaatgga tgcaaaaatc atattcaata aataaaaaaa cttttttcaa 180 acttttgttt aatttcaagt tacaattaaa gaaaaatgaa aagtaagcat tgtaaataaa 240 tttgagtaac ttttattttt atatttcatc tggaaaatat tatgcatcgg tggtcccctc 300 gttattttty gttcagccca gttaacgagc agcattcggt gactgggcta cgttctcagc 360 ccagttaccg aacaactact g 381 // ID Jockey-N8_CQ repbase; DNA; INV; 1839 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A non-autonomous Jockey-like non-LTR retrotransposon family from DE Culex quinquefasciatus - consensus. XX KW Jockey; Non-LTR Retrotransposon; Transposable Element; KW nonautonomous; Jockey-N8_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-1839 RA Kojima K.K. and Jurka J.; RT "Non-autonomous Jockey non-LTR retrotransposons from the southern RT house mosquito."; RL Repbase Reports 11(1), 593-593 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 10 sequences with >94% CC identity. This family encodes a protein similar to Jockey ORF1p CC but does not encode ORF2p. Thus it is a non-autonomous non-LTR CC retrotransposon derived from Jockey, like HeT-A. XX FH Key Location/Qualifiers FT CDS 112..1551 FT /product="Jockey-N8_CQ_1p" FT /translation="MPRAGRGRGSSSAASKNPRSSSTGRVQKGKHTNAAST FT AIAHGSVPGDVTDPSFLHVSYRPRESPVRTRQSTRASGSTSDQQQQQQSTS FT TTTTTTSNVPTSNEFEMLSGEENNTASDSSSAGDDDDDDELARKQGKQKKC FT NSPKERRPPPIFVLDTLADDVDELLEGLQYCLKISKTSVQVITHKKQNFDL FT VVKKLKLRNFKFFTFDPAEKVPVKIVLQGYPDRPISDLEEHLSGVKVKPRE FT VKVLSKSTTVTGTYTLYLLYFDRGSVKIQDLRQIKALDGFFVTWRFYTKNP FT TDAAQCHRCQKFGHGSRNCNLPPRCVKCGETHFTERCNLPRKDQLGENAQQ FT HKARVKCANCGGNHTANFRGCAARKKYLEEQDKKKKAPASRQPPKTSSTNA FT AAAGGGAVPSTNPAFPPGWGGSYARVAASGSGTSPGQADVTGDDLFTLPEF FT FALAGEMLTRFRACRNKAEQFQALSELMMKYIYNG" XX SQ Sequence 1839 BP; 512 A; 471 C; 450 G; 406 T; 0 other; catcgggact ggcaacacca gccagcaaca gtcaactgtt cggagctcct caaacttttc 60 gattttttcg ccgtaattaa cagtgtttac ccgcgagttt gctgctccag catgccaagg 120 gccggccgtg gccgtggcag ttcgagtgcg gcctcgaaaa acccgcgaag ttcatctacc 180 ggcagagtgc agaaaggtaa acataccaac gccgcatcga ccgccatcgc gcacgggagc 240 gtgcccggtg acgtcaccga tccttctttt ttacacgtgt cataccgtcc acgtgagtcc 300 ccagtacgga cgagacaatc gacccgggca tccggaagta cttccgacca gcagcagcag 360 cagcagtcca ccagcactac gacaacaacc acttccaacg ttcccaccag caacgaattt 420 gagatgctga gtggagaaga aaacaacacc gccagtgaca gcagcagtgc tggcgacgac 480 gatgacgatg atgaacttgc gcgcaagcaa ggaaagcaga aaaagtgtaa ctctccgaag 540 gaacgacgac cacctccaat ttttgttttg gatacgttgg cggacgatgt tgacgagttg 600 cttgagggac tccaatattg tctcaaaatt agtaaaactt cggtgcaagt gattacgcat 660 aaaaaacaga atttcgactt ggtagtgaag aagttgaagt tgcgaaactt caaattcttc 720 acatttgacc cagcagagaa ggtcccagtg aagatcgtcc ttcagggata tccggaccgc 780 ccgatctccg acctggaaga acacctgtcg ggcgtcaagg ttaagcctcg agaggttaag 840 gtgctctcga aatcgacaac ggtgacaggt acgtacacat tgtacctcct gtacttcgat 900 cgtgggtcgg tcaaaatcca ggacctacgg cagatcaaag cactggacgg cttcttcgtg 960 acgtggcgat tctacacgaa gaatccgacc gacgcagcgc aatgccaccg ctgtcaaaaa 1020 ttcggacacg gttcaagaaa ctgtaatctt ccgccccggt gcgttaagtg cggtgagacc 1080 cacttcaccg aaaggtgcaa tcttccgcgg aaagaccagc tgggggaaaa cgcccagcag 1140 cacaaagcgc gcgtcaagtg cgcgaactgt ggtggaaacc acacggcgaa tttccgtggc 1200 tgtgccgcgc ggaaaaagta cctcgaggag caggacaaga agaagaaagc gccagcgtcc 1260 cgccaacctc caaagacttc gagcacgaac gctgcagcag ctggcggcgg agcggttcca 1320 tcgaccaacc cagcgttccc tcccggttgg ggaggttcgt acgctagagt agccgcttct 1380 gggagcggta cttcaccggg acaagcagac gttaccggag atgatctctt cacgcttccc 1440 gagtttttcg ctcttgctgg agagatgctc acgcgctttc gtgcctgccg gaacaaggca 1500 gaacaattcc aagctctgag tgagcttatg atgaagtata tctacaacgg ataagctgcc 1560 ttgtgcagcg aagtttttcg acgtcaaaag acaaaacaaa ctgtgatcta gttttaagct 1620 tttctatctc tatccttttc cttagcaaat ttagaaggtt tttttttaaa ataatttttc 1680 cttctgttgg caaatccatt gttagaatat ccaattacat caaaatgaac tatagtacaa 1740 atcatagttg aaaggtactc caaaactcta ttaggttata agaattacga aattgtgaat 1800 tgattattta ctaataaaaa ctaattgaat cgaattgaa 1839 // ID Gypsy-18_DPu-I repbase; DNA; INV; 4654 BP. XX AC scaffold_318; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-18_DPu_; KW Gypsy-18_DPu-LTR; Gypsy-18_DPu-I. XX NM Gypsy-18_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-4654 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 751-751 (2010). XX DR Genome; scaffold_318; Positions 3774 8427. XX CC Positions [3659-4150] - Integrase core CC LTRs are 91% similar to each other. CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX FH Key Location/Qualifiers FT CDS 1115..4570 FT /product="Gypsy-18_DPu-I_1p" FT /translation="MVDSGANTSVIRHSIIASINHLIIPVSSILKMADGRS FT VESVGELELKVEWQSKKEKITVLVLMELSHDLILGTDWINQIGGISIYPST FT SSVIIEPFTDHQFVHSPPEIYRTDRTVIVPPLSLAFLPIPKQADERGASRS FT QCLIVNRSFSANPGAEWVIPNSIVTELGDHYYVSILNPSRKTITVQSGQQV FT SGIELIKEAQWSVLSDNVEVSSVSGGKFSTPEFLQQTLDTIPVEYRSQMEK FT LLTRYSHLFHENDPSFLKSTTVAEHHINTENHQPIRSAPYRVSHTEREAIR FT SQVEEMLQAGIIEPSTSPWSSPVVMVPKKNGELRFCIDYRKLNAVTVRDVY FT PLPRIDDFLDHLGGATVFTCLDLKSGYWQIPMGKESQQKTAFVTPDGLYQS FT KRMPFGLCNGPAAFQRTMDEILTGLKWNVCLVYLDDLVICGNTYQEHQDKL FT ETVFLAVEKAQLTLNLSKCHFAQHQILCLGHQVTCDGITPDPSKIDAIIDF FT PSPDKSEPKQRISVLRSFLGAVSFYRRFFDHFASLLAPLYELLKKNASWKW FT DRKHELAFQYVKMRLVNAPVLSHPTATGKFEVHVDASGVGLGAVLMQANLT FT TNEFHPISYLSRRLTSAEANYHSNELECLALVWALTKFRHYLYGQVFTVKT FT DNNVVRWLSQKKDIRGKLARWVLILQEFNFSIEHLKGTENKVADALSRHPV FT TGDDPSFPEVLRDVCNYHNYSKEELAAWQQGDSTIREPLLQLQGLRLDEGQ FT SNENLEKFVLENGVLYRKGDVHQRKFRLVVPSLLRREILSSCHDTPDSGHF FT GIAKTTEKISKNYWWPGLSLSVKSYVAACSFCQKNKSRTSIPVGKLQPISP FT PARIFSLIGIDHLGPFRLTTSGNRYLLVAIDYLSKWVIAKPVASTAAALIR FT TFIETEVIAQHGYPHRIITDRGTGFTAKFLQNQLQQWGINHSFITTEHHQS FT NGQVERINRCLVMAFKPFVNTTHSDWDTKITHATLAINTAKQESTKASPFE FT IVYGRQPEFPTERQFPWPPDDKESLQHFLRRVTRLRKKIQWRLVVQQRQVK FT ERYDKSRKKSPKFSVGDLVLVARTLRFPQLTQKFLPKFIGPFQIAEKRSSL FT IYLVESLPAARKKLWCRFPAHVNQLKLFKTPNEVDWPALGPTEA" XX SQ Sequence 4654 BP; 1364 A; 1088 C; 1026 G; 1176 T; 0 other; tggtgtcaga agtgggatcg ggccatagaa tggttggaat tttttccgac cgctcgattg 60 aaccaatgtt tgttgtttac tgtgatttaa aaaaaaaaaa aattcatgtt ttaacgtcat 120 taatgtttct cttctcctag tttttcgctt tctaaaaatc gacatggcag acgatcagct 180 tgccattcta attgggatcg tccaaaatca acacggtctg caggagcgta tggttgagca 240 gaatcaacaa cgtaatcatg agcttcccac ttatagcgga aaggcggaag aagatgtcct 300 tgagttcgta gaaaccgtca atcgggaagc agtggccgga aactggcccg aaaatcaaaa 360 actgcaactg gcaaaaggag cattacggga aattgcgtcc aaatggcgtt ggttacatga 420 tgcggaggca cctcggaatt gggagggatg gtcggctgct ttatgcgcag cctttcgcaa 480 acgctatacg tttggagaat gggagtgctt tatgcgcagc ctttcgcaaa cgctatacgt 540 ttggagaatg ggagaagatg gtaaccgcca aagtacagca gaccaacgag acaggaccgc 600 aatatgcatt gaccaaggct aaattgcgcc gccattgtcc ctatccaatg acggaggcag 660 atttcgtccc ttatcttgtt caaggaattc gccatcgaca gttccgtacg gttttgctac 720 agaaccttcc tcccaccact acagcgttta tccaggttta tggcctgctc gaacaaaata 780 gtgatagagc tcctgaacga gaatcggaac ttgaagctcg aattgaagct caaacgcaag 840 aattgtcaga gctaaggaaa cagttgggta aaaatcagcg attccaaccg tatcagactc 900 gaacggggcg acaggcggac accactggca gagaacgttc ttgttttcaa tgtaatcgac 960 ctggccacat gaaaagagat tgtcccgaaa ggaatcgttc ggaaaacgac aaggccggac 1020 cagtgggcca ggcccggcag taagtgctct tgaagtccca aataatcggc ccatcgttga 1080 cgtcacaatt ccggggatag gaattgtcaa ggcaatggtc gactctggcg ccaacactag 1140 tgttattcga cactccatca tcgcatcaat taatcatctc atcattcctg tgtcttcaat 1200 tctcaaaatg gcggacggtc gctctgtaga aagtgttgga gaattagaac tgaaggtcga 1260 atggcaatca aaaaaagaaa agataactgt gctggtttta atggagctct ctcatgacct 1320 catacttgga acagattgga ttaatcagat tggcggaatc tccatctatc catcaacaag 1380 ttcagtaatc atcgaacctt ttaccgacca ccaattcgtc catagtccac ccgaaatcta 1440 tcgaacagat cgtaccgtca tcgtcccacc cttgtcactc gcattcctac caataccaaa 1500 gcaagccgac gaaagaggtg ccagccgctc acagtgcctt attgttaatc gttccttctc 1560 tgctaatcca ggtgctgagt gggtgatacc gaactctatt gtaacggaac tgggagacca 1620 ctattacgtc tctattctca atccgtcgag gaagacgatc accgtccagt cgggacaaca 1680 agtatccggg atcgaattaa ttaaagaagc tcagtggtca gttttaagtg ataacgtgga 1740 agtgagtagt gtgtcgggag gaaaattctc aaccccagag tttctacaac aaactctcga 1800 cactattcct gttgagtatc gttctcagat ggaaaaactt ctgaccagat acagccacct 1860 cttccacgag aacgatccat cgtttctcaa aagtaccaca gtcgccgaac atcacatcaa 1920 cactgaaaat catcaaccga tacgatcagc cccctaccga gttagtcata cggaacgaga 1980 agccatccgc agtcaagttg aagaaatgct gcaagccggc attatagaac catccactag 2040 tccttggtcc tcaccggtgg tcatggttcc aaagaagaat ggtgaacttc gattctgcat 2100 cgattaccgg aagttaaatg ctgtcacggt ccgtgacgta tacccgctgc cgagaattga 2160 cgatttcctg gatcaccttg gaggagcaac tgtgttcacc tgcttagacc taaaaagcgg 2220 atactggcag atcccgatgg gtaaagagag tcaacagaaa actgcatttg tcacgccaga 2280 tggattatac cagagtaagc gcatgccctt tggactctgt aatggaccag ctgcatttca 2340 gcggacaatg gacgaaattc taaccggatt gaaatggaat gtttgtttag tatacctcga 2400 tgatttggta atatgtggta acacatatca agaacatcaa gacaagttag aaactgtgtt 2460 cctggccgtc gaaaaagctc agcttactct caacctgagt aaatgccatt tcgctcaaca 2520 ccaaattctt tgcctaggtc atcaggtgac gtgtgacggt atcactccag atccttcaaa 2580 aatcgacgcc atcatagatt ttccttcccc tgacaaaagt gaaccgaagc aaaggatttc 2640 agttctacga agttttctgg gagcagtttc tttttatcgg cgattctttg atcatttcgc 2700 ttcactcctt gcacccttgt atgaacttct aaagaaaaat gcctcctgga aatgggacag 2760 gaaacacgag ctcgctttcc aatacgtcaa gatgcgtctg gtaaacgcac cagtattgag 2820 tcaccccaca gcaacaggaa aatttgaagt tcacgttgat gcgagtggag ttggcctcgg 2880 ggcagttcta atgcaagcca atcttaccac aaatgaattt catcctattt cttatcttag 2940 tcgtaggctg acctcggcgg aagctaatta ccattctaac gagttggaat gcctagccct 3000 tgtgtgggca ttgactaaat ttcgtcacta cctatacggt caagtgttta cagtcaagac 3060 agataacaat gtcgttcgat ggctgtcaca gaaaaaagac attcggggaa aattggctcg 3120 ttgggtttta atcctacaag aattcaattt ttctatagag catttaaaag gaaccgaaaa 3180 caaagtagct gacgctcttt ctcgccaccc agtgactggt gatgatccat cttttcccga 3240 agtattaaga gatgtttgta actatcacaa ttattcaaaa gaggaactag cggcatggca 3300 acagggcgac agcactattc gagaacctct tctgcaacta caaggcctac gactggatga 3360 ggggcaaagc aacgaaaact tggaaaagtt tgttttggaa aatggtgtgc tctacaggaa 3420 aggcgatgtc catcaaagaa aatttcggct ggtcgtgcct tcactgctac gacgagaaat 3480 tctcagtagc tgccatgata caccagacag cggacatttt ggaattgcca aaaccacaga 3540 aaaaattagc aagaactact ggtggccagg actctccctt agtgtgaagt catacgtagc 3600 tgcgtgctca ttctgccaaa agaataagag taggacatcg attcctgttg gcaagcttca 3660 acccatttcc ccacctgcaa ggatattctc gttaatagga atagaccatc tcggtccttt 3720 ccgactaact accagcggta atcggtatct cctagtggcc attgattacc tgtcaaaatg 3780 ggtgattgcc aaaccagtag ccagcacggc tgccgctctc attcgaacgt tcatagaaac 3840 ggaggttatt gcacaacatg gttatcctca tcgtatcatc actgaccgcg gaactggttt 3900 tacggcgaaa tttcttcaga atcaactgca gcagtgggga atcaatcatt ccttcatcac 3960 aacggaacat catcagtcca atggtcaagt agaaagaatc aatcgatgct tggtaatggc 4020 cttcaaacca ttcgtgaaca ccacccactc tgattgggat accaagatta ctcatgcaac 4080 actagccatt aacacggcca aacaagagag tactaaagct tctccatttg aaattgtgta 4140 tggaagacaa ccagagtttc ctactgaacg gcagtttccc tggcctcccg acgacaaaga 4200 aagtcttcaa catttcctgc gtcgtgttac acgactaaga aaaaaaatcc aatggagact 4260 cgtcgttcag caacgccaag tcaaagaacg gtacgataaa agtcggaaga aatcgccaaa 4320 attctcggtt ggagacctcg tattagtagc aagaactcta cgtttccctc aattgactca 4380 aaaattcttg cccaagttta ttggtccgtt ccaaatcgcc gaaaaaaggt catccttaat 4440 ttatcttgtt gaatctcttc cagctgctcg gaagaagttg tggtgtcgtt tccctgcaca 4500 tgtcaatcaa ctaaaattat tcaaaacccc gaacgaagta gattggccag ccttggggcc 4560 gacggaggcc tgaaattgtg tcaaatttgt gttctctaat tcattccttt tagtttcggt 4620 ttttggtcag aaaaactcga gtcaggagag gccg 4654 // ID BEL-24_CQ-LTR repbase; DNA; INV; 206 BP. XX AC AAWU01010212; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-24_CQ_; KW BEL-24_CQ-I; BEL-24_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-206 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 202-202 (2011). XX DR GenBank; AAWU01010212; Positions 21544 21749. XX SQ Sequence 206 BP; 56 A; 39 C; 44 G; 67 T; 0 other; tgttcggcga gattttgttg tgaccacctt atgtgtaaat aaccttgatt ccgtttttat 60 cgccacctat tttgtacaga gtcttgttat taaacgtaaa gttgttgacc agaaaacgga 120 gcggaaagac gtgtgttttg tgtctcttcc gccaaattcg aaaccgagaa aatacagtcc 180 gctgaattct gatttatcga ggaaca 206 // ID BEL-7_CQ-I repbase; DNA; INV; 5755 BP. XX AC AAWU01032418; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-7_CQ_; KW BEL-7_CQ-LTR; BEL-7_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-5755 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 167-167 (2011). XX DR Genome; AAWU01032418; Positions 2114 7868. XX CC Positions [4777-5367] - Integrase core CC 'AAACC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 377..1792 FT /product="BEL-7_CQ-I_2p" FT /translation="MNLEVQEEEHESGDDTKKEVTPVKKQLEEQRRKQKIR FT EMEERLKVLVQQRGAVKGKLTRLKTALTQRTGTINPNLRKVNFLKMHLETV FT KACYSEYNGFQNEIYALGLSEDQKKVHESQYEEFEMLHNMLVVELNDLLDE FT VSKPVANAALVPVAAANAALQNYLPPLSVPLPKFDGTYETWYSFKSMFQNI FT MARYTNEAPAIKLYHLRDALIGKAAGVIDQEMINNNDYDAAWAVLEELYED FT KRAIIDRHIDNIFALPKITRDNAAELRKVIDICVKNTEALKKVNLPVDGLG FT EQMVVNLLASRMDKDTRRAWEATQKAGVLPTYEATITFLKEKRRIADKLEQ FT NSESDKVKPQRSVTKTKTLVVASEAKCSVCNQDHEVIKCEQFKLKSVNERY FT SHLRKHGLCFNCLRKVHVDELVPEVQQAAPYPASHRRAKEAGTRLPTRRQQ FT LPRIPAQKAALNPFHQPRRDRPSLAKV" FT CDS 2200..5727 FT /product="BEL-7_CQ-I_1p" FT /translation="MLLGAGIFWDLMKAKRITLAANLPSLRDTELGWVVGG FT VMSECTPVIARTFCTVNEDEELNKLLNRFWEIEGVEDLRRPVTSTAEESLE FT HFRSTYSRKPDGRIVVRLPFNERKSDLGASRDMAIRRFLNLERRLDQQPDL FT KREYAKFIHEYEQLGHMKEVDVGPSEPPGSAYYLPHHCVLRPSSTTTKLRV FT VFDGSAKTSTGVSINDALKVGPTVQNDLLSILLNFRCYRYVFTTDIPKMFR FT QIELHPEDTPYQRILWRDDRSQLLKVFELKTVTYGLASSPFHATMALNQIA FT EDGVKEFPLASAALKKSFYVDDGLCGAQSVEDARLLSRELRKLLNTGGFDA FT HKWCANDPAILEDIPEELWGTTFDLKDAAAKSVVKTLGVAWNASQDWFTFS FT VLPPEEETKPLTRQKVLSEIAKFFDPLGLAGPVVTTAKLILRKVGTLKNIG FT WLDPVPESVANEWRRFRKQLPALNDFRVSRCVYDEAAERVELHGYSDASDD FT AYGASVYARNIYPDGEITMRLICSKSKLLPKKVKNVPKELSTPKGELEGAL FT LLAELVDKLVNVVDVCFDSVNLWCDSQVVLSWIAKKPDDLELFMANRVRKI FT QQLTSKYRWGYIPTDDNPADLISREVMPQNLIQRVIWKDGPVSMNNRTIEV FT VQPPLLDGADLPGLRSTKCLALAAPIQRLRIFDKLGDFRRLLRSMSLVVRF FT ANYIISGRKVVVKGLPTSEERAAALKLILRLVQRETFQTELLALKENHSHR FT LRGLNPFIDPDDGLLRVGGRIKQAFVPYDSRHQILMPAKHPVTESFIRYEH FT VKNLHVGQKGLLALIRQRFWPMNVKTTIRKVIRGCVTCFRVDPTKTVQLMG FT DLPSYRVQPAPAFFNTGVDFAGPFLIKSLTAARRPMMTKGYVFLFVCMSTR FT AIHLELVSSLTTEGFLAALRRFTGRRGLVNKMYSDNATNFAGSETELERLA FT KLFAEEQHIEQVEEFCSSRGITWSFIPPRSPHFGGIWEAGVKSVKHHLKRV FT VGNQKLTFEELTTVLVQIEAVLNSRPLTPCSNDPNDLTAVTPAHFLIGREM FT QAVPEPSYLQLKQSTLSRWQHVQAMQQQFWKRWIAEYLPELQNRQKWFKTT FT KIQPGALVLISDPNAPPMQWQLGRIIALHPGKDDVTRVVTLRTAKGECKRG FT VSEICLLPLDQETTEED" XX SQ Sequence 5755 BP; 1402 A; 1497 C; 1770 G; 1086 T; 0 other; ttttggtcct tcatggccgg atagctgaag tagtgtccgg gagtgaaaag tgacggatta 60 accggtgggg aaatccgaat cgaactttcg cgagcgaagt gtttgcgagt gaaggcgtgg 120 ggcgcggggc gtagcaccca cgcgcgaaaa agtgctttat tccggaagcc agtggacggg 180 cggctgtaga gcagtccgaa aagagacggt tccggaagaa ataaagtgcg aaaaccgacc 240 cttggggtcc gcgtgtgaaa aaaactgcga caaaagaaac ggagtggttc gtggggcgta 300 gtagccacga ggcgagaaag actcttccgg cttggggcgt agcacccttc ttgtgaaaaa 360 aagagcatgg aaaagcatga atctggaagt gcaagaggaa gaacacgaaa gcggtgacga 420 cacgaaaaaa gaagtgaccc cggtgaagaa gcaactggaa gagcagcgtc ggaaacagaa 480 aatccgagaa atggaggaaa ggctgaaggt gctcgttcag cagagagggg cagtgaaagg 540 gaagctcacg cgcctgaaga ccgctcttac gcaacgtacc gggacgataa atccgaactt 600 gcgtaaagtg aacttcctca agatgcacct ggaaactgtc aaggcctgct actccgaata 660 caacgggttc cagaacgaga tctacgcgct gggcctctcg gaagatcaga aaaaggttca 720 cgagagccag tacgaggagt tcgagatgct gcacaatatg ttggtcgtcg agctcaacga 780 tttgttggac gaagtgtcca aacccgtagc gaatgctgcc ctggttccgg ttgctgccgc 840 gaacgccgct ttgcagaact acctcccgcc gttgagtgtg ccactcccga agtttgacgg 900 tacctatgag acgtggtact cgtttaagtc catgttccaa aacatcatgg cacggtacac 960 caacgaggca ccggcgatca agctgtacca tctccgggac gcgctgatcg gcaaggcagc 1020 cggcgtcatc gatcaggaaa tgatcaacaa taatgattat gacgcggcgt gggctgtctt 1080 ggaggagctg tacgaggaca aacgtgcgat catcgatcgg cacattgaca acatcttcgc 1140 tctgccgaag atcacccgcg acaacgccgc agagctccgg aaggtcattg acatctgcgt 1200 gaagaacacc gaagccctga agaaagtgaa cctcccggtc gacggactgg gggagcagat 1260 ggtcgtgaac ctgctggcct ccagaatgga caaggacacg cgcagagcat gggaagcaac 1320 ccagaaggcg ggtgtgttgc cgacgtacga ggccacgatc acgttcctga aggagaagcg 1380 cagaatcgcg gataagctgg agcagaacag tgaaagtgat aaagtgaaac cgcaacgttc 1440 ggtaacgaaa acgaagacgc tggtggtggc cagcgaagca aagtgttcag tgtgtaacca 1500 ggatcacgaa gtgattaagt gtgagcagtt caagctgaaa agtgtgaacg agaggtacag 1560 ccacttgcgg aagcatggcc tgtgctttaa ttgtctgagg aaggtgcacg tcgacgaact 1620 cgtgccagaa gtgcagcaag cggcaccata ccctgcttca caccgacggg ccaaagaagc 1680 aggaacccgg ctgccaacac ggcgtcaaca gttgcccaga atcccagcgc agaaggccgc 1740 actcaatccg ttccaccagc cacggcggga tcgtcccagt ctggcgaagg tctgacacta 1800 tgcaccacat ccgaggcccc aaggaagcaa attctactct cgacggctgt agtgctggtt 1860 tacggcgccg gcagcgtacc ctacctgtgt agggcgctga ttgactcgtg ttcacagaac 1920 catttcgtga ccgagcgttt cgctaacctg ttggcgagca aaaaggaacg ggctgattac 1980 caggtcagcg gattgaacgg gggaacgacc agaatcagtc acctggtgcg cgcgaaggtc 2040 aagtcccgcg ttggcaactt tgctgccgac ctcgagctcc tggtggcccc gaagatcact 2100 ggcgacgtgc cggtgaagac gatcgacatc gctgggtgga acttgccacc agacgttgaa 2160 cttgcggatc ccaactttaa ccagcgaggc cgagtcgaca tgctgctcgg agcgggcata 2220 ttctgggatc tcatgaaggc aaagagaatc acgctggcgg cgaacctccc atcgctcaga 2280 gacacggaac tgggctgggt ggtcggtggc gtgatgtcag aatgtacacc agtgattgca 2340 cgtacgttct gcaccgtgaa cgaggacgaa gagctgaaca agctgctgaa ccggttttgg 2400 gagattgaag gagtcgaaga tctgcggcga ccggtgacgt cgacggctga agagagtctc 2460 gagcacttcc ggagcacgta cagtcggaag ccagacggga ggattgttgt tcgtcttccg 2520 ttcaacgagc gcaagagtga cctgggcgcg tcacgagaca tggcgatccg tcggttcctg 2580 aaccttgagc ggcggctaga tcaacaacct gacctgaaac gagagtacgc caagtttatc 2640 cacgagtacg agcagcttgg acacatgaag gaagtcgatg tcggaccgtc cgaaccacct 2700 ggatccgcgt actacctgcc gcatcactgc gtcttgcggc ccagcagtac aacaacgaag 2760 ttgagagtcg tgtttgacgg ctcggcgaaa acgtcgactg gagtgtccat aaacgacgca 2820 ctgaaggttg gccccacggt acaaaacgat ctgctgtcga tcctgctgaa cttccgctgc 2880 taccggtacg tcttcactac ggacatcccg aagatgttcc gccagattga gctgcacccc 2940 gaagacacgc cgtaccagcg aatactgtgg cgcgacgacc ggtcgcagct gctgaaggtg 3000 tttgaactga agacggtgac ttacggcctg gcgtcatctc cgttccacgc gacgatggcg 3060 ttgaaccaga ttgcagaaga cggcgtgaag gagttcccgt tggcgtctgc agcgttgaag 3120 aaatcgttct acgtggacga cgggctctgc ggagcgcaga gtgtggagga tgcccgattg 3180 ctgagtcggg aactgcgcaa gctgctgaac accggcggct tcgacgcgca caaatggtgc 3240 gcgaacgatc cagccattct ggaagatatc ccggaggagc tctggggtac gacgttcgac 3300 ctgaaggacg cggctgcaaa gtcagtcgtg aagacgctgg gtgtggcgtg gaacgcgtcg 3360 caggactggt tcaccttcag cgtgctgcca cccgaagagg agacgaagcc gctgacgcgg 3420 cagaaagtac tcagcgaaat tgccaaattc tttgatccac ttggcctggc tggtccggtg 3480 gtgacaactg cgaagctgat cttgcggaag gtcggcactt tgaagaacat cggctggctg 3540 gatcccgttc cggagagcgt tgccaacgag tggcgtcgat tccggaagca gctaccagca 3600 ttgaacgact tccgggtctc ccggtgcgtc tacgatgaag cggcagaacg cgtggaactc 3660 cacggttatt ccgacgcgtc ggacgatgcg tatggcgcca gcgtgtacgc acgcaacatc 3720 tacccggacg gcgagatcac gatgcggttg atttgcagca agtccaagct cctcccgaag 3780 aaggtcaaga acgtcccgaa agaactcagc actccaaaag gcgaacttga gggtgcgctg 3840 cttctcgctg aactggttga caagttggta aacgttgtcg atgtttgctt cgattcggta 3900 aacttgtggt gcgactcgca agtggtgctg agctggatcg cgaagaagcc tgatgacctg 3960 gaacttttta tggccaacag ggtgaggaaa atccagcagc tgaccagtaa gtaccggtgg 4020 ggatacatcc cgacggatga caatcctgcc gacctgattt cgagagaggt gatgccccag 4080 aacttgatcc agcgggtcat ttggaaagac gggccggtgt cgatgaacaa tcgcacaatc 4140 gaggttgtgc aaccccctct tctcgacggc gcggacctcc cgggattgcg atccacgaag 4200 tgtctggcgc tggcggcgcc gattcagcga ttgcggatct ttgacaagct cggagatttc 4260 cgccgactgc ttcgcagcat gagcctggtg gtgcggtttg cgaactacat catctcagga 4320 aggaaggtgg tggtgaaggg cttgccgacg agcgaggagc gcgcggcggc gctgaagcta 4380 atcctgcgct tggtgcaacg agagacgttc cagaccgagc tgttggcgtt gaaggagaac 4440 cactcgcacc gtctgcgagg actgaacccg ttcatcgatc ctgacgacgg ccttctgaga 4500 gttggcggtc ggatcaaaca agctttcgtg ccgtacgaca gccgacacca gatcctgatg 4560 cctgccaaac accctgtcac ggaatccttc atccgctacg agcacgtaaa gaacctccac 4620 gtgggccaga agggcctgct tgcgctgata cgccaacgct tttggccgat gaacgtcaag 4680 acgactatcc ggaaggtgat tcggggatgc gtgacgtgct ttcgagtcga tccaacgaag 4740 acggttcagc tgatgggaga tctcccgtcg taccgcgtac aacccgcgcc ggcgttcttc 4800 aacactggag ttgatttcgc tgggccgttc ctgatcaagt cgctgaccgc agcacgaagg 4860 ccgatgatga cgaaggggta cgtgttcctc ttcgtgtgta tgagcacacg agccatacac 4920 ctggagctgg tctcgagtct gacgacggag gggtttctcg ctgccttgcg gcgtttcact 4980 ggacggcggg gcctggtgaa caagatgtac tcggacaacg cgaccaattt cgctggatcg 5040 gagacagaac tcgagcgact cgccaagttg tttgcagagg agcagcacat cgagcaggtg 5100 gaggaatttt gcagcagtcg cggaatcacc tggagtttca tcccgccgcg cagtccgcat 5160 ttcggtggga tctgggaagc tggggtaaaa tcggtcaagc atcacctgaa gcgagtagtc 5220 ggcaatcaga aactcacctt cgaagaactg accaccgtgc tggtgcagat agaagccgtg 5280 ctgaattcgc ggccgttgac gccgtgctcc aacgacccga acgatttgac cgcagtgacc 5340 ccagcccact ttctcatcgg acgagaaatg caagcggttc cagagccgtc gtatctccag 5400 ctcaagcagt cgacgttgtc ccgctggcaa cacgtgcagg caatgcagca gcagttctgg 5460 aagaggtgga ttgcggagta tttgccggag ctgcagaacc gtcagaagtg gttcaaaacc 5520 accaagattc aaccgggcgc tttggtgctt atcagcgacc cgaatgctcc gccgatgcag 5580 tggcaacttg gtcggattat cgcactgcat cccggtaagg acgacgtcac cagggtggtg 5640 acactacgga cggcgaaggg cgaatgcaag cgtggcgtgt cagagatatg cctgcttccg 5700 ctggaccagg agacgaccga ggaggattga aatgcaccat ttcaatgggg gagga 5755 // ID Shinagawa-5_AAe repbase; DNA; INV; 2104 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.03, Created) DT 11-APR-2011 (Rel. 16.04, Last updated, Version 2) XX DE A non-autonomous DNA transposon family from Aedes aegypti. XX KW DNA transposon; Transposable Element; nonautonomous; KW Shinagawa-5_AAe. XX NM Shinagawa-5_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-2104 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the yellow fever mosquito."; RL Repbase Reports 11(3), 842-842 (2011). XX DR [2] (Consensus) XX CC >88% identical to consensus. 8-bp TSDs. TIRs are ~110 bp long CC and composed by degenerate repeats. Related non-autonomous CC elements, named Shinagawa, are found in Aedes aegypti and CC Culex quinquefasciatus. CC The region 1766-1496 is an inserted FEILAI-1B-like element (~86% CC identical to the FEILAI-1B_AAe consensus). XX SQ Sequence 2104 BP; 679 A; 368 C; 353 G; 704 T; 0 other; gattgtatga catttgccag aaaaccattt gccagaatca attcgccaga atgatttttg 60 ccagaaaacc attccccaga atgtaccatt cgccagaaag ccattcccca gaatggacca 120 tttgccagaa aaccattccc cagaatcatt tttttgtaat ttattttcaa cctttatatc 180 tattgatcga caaacttata gaaaatttaa ggctacccaa ttttccgata cttgctatgt 240 tagtgtcttg aatttactca tattatttgt acttagatca taaaacattg taaagccaac 300 tttcagatta tttcatcaag agaattggac tattagcact atataaaagc atataaaaca 360 aagcaattga tcgatccgtg taatttaggc tattttgtta cattccttcg caaacaattt 420 tatgctcaca acgtgaaaga tgtgtggcag aactggcgta agattaactt tttttttgct 480 atatagctga cctgaggaaa aaaagctgag acgtttggca cgaattagtt tagttgaatg 540 gatttattgc tgaatagtgt atttttgcga actatcgcag aacatttaaa aagaacaacc 600 tttcattgaa agaagggaaa attatgcttg aaatatgata agtttagcgc caataattat 660 tttaacagta taactataat gataataaat ttataaacat tattcaagtc ttgaatttga 720 aaaacgtgtt tttttgtttt aaaataattt tacacaattt gcgacaggct accgaaaaac 780 tcttcaatat tcagtattga ccctgatttt ttgtttctgc tatctggtac cagagatatt 840 tagaaatccc ttggaggact gccacgtgga catgattcac ggtggaaacc gaaaaactca 900 tagatttttt ttatattcag tcacttgctg tacgaaacaa gtcggttatg aagattgatg 960 ttttaggaat gacttgagcc attgaagatt acagattgga taaaggtcat ggttagcatt 1020 ccatgatcga ctcactttca atacagctcc ttttttagga ctgtcagtga ctctttttcc 1080 gatttttgca tttttttttc atttcactta aatcctcttt gcagacatct tcattgttac 1140 cagtctattt ctgatttgtt cctaaatact ctaattattg aaaattgaaa tgacacagta 1200 ttgttaatat aacttccatt gcttttgtta gctgcaggca gcaaaacacc ttttcagtac 1260 aattatgaag aacagccaat gatataaaga agggtaaatc tcatatgaag catataagtc 1320 cttgtatttg ttagccctga agaacagctt aacatttaaa gaagggaaaa tcattatgaa 1380 aatgttgaaa atacagttgc ctataatagc ttgaataaaa tagtctggaa gatgatcatc 1440 agaacattgt aaaaatagtt atttcttaga aactatgaac acttcggaag ttaaacttct 1500 tatctggcgt tacgtcccaa ctgggacaga gcctacttct cagctaagtg ttcctatgag 1560 cacttccaca gttattaact gaacttttat tgccaattga ccatttttca tgtgcttatc 1620 gtattgcagg tacgaagata cttcgaaggg gaatcgagaa aatttccaat ccgaaaagat 1680 ctttgaccgg tgagattcga acccacttcc caaagcttgg tcttgttgaa tagctgttag 1740 tttaccgcta cagctatctg ccccccgtag tctataatac tattagttaa aaatatgttt 1800 ttaaacgaaa aactcatcta agaacacaat tattttctat ataaggcaaa atatataaca 1860 acaatttcca tatatatcgt acgctcaaaa gcataatcat tcgaaggatt tgacacgcct 1920 aagtattgct acgcatagca tagtaatgaa gtattttttt ctgggaaatg gtctttctgg 1980 ggaatgactt tctggggaat ggtccattct ggcaaatggt tttctggcaa gtgatccatt 2040 ctggcaaatt gattctggca aacgactttc tggcaagtgg ttttctggca aatgtcatac 2100 aacc 2104 // ID Academ-1_Aplcal repbase; DNA; INV; 6157 BP. XX AC . XX DT 16-MAY-2011 (Rel. 16.05, Created) DT 16-MAY-2011 (Rel. 16.05, Last updated, Version 1) XX DE DNA transposon: consensus. XX KW Academ; DNA transposon; Transposable Element; Academ-1_Aplcal. XX OS Aplysia californica OC Eukaryota; Metazoa; Mollusca; Gastropoda; Heterobranchia; OC Euthyneura; Euopisthobranchia; Aplysiomorpha; Aplysioidea; OC Aplysiidae; Aplysia. XX RN [1] RP 1-3053 RA Yuan Y.W. and Wessler S.R.; RT "The catalytic domain of all eukaryotic cut-and-paste transposase RT superfamilies."; RL Proc Natl Acad Sci U S A 108(19), 7884-7889 (2011). XX DR [1] (Consensus) XX SQ Sequence 6157 BP; 2045 A; 1216 C; 1222 G; 1674 T; 0 other; taggctatgg gctactggga gtgaacctcc cctccccccc cctggggatt tttcgccact 60 atgattttct gaaagggtag gacctaggga cttcccaaaa aaagtttgca ccaaaatgtt 120 ggcaatctac agagttactg cccttgattg tccagaaatc aagaaaatac ttgattttta 180 ctttggggtt ctctgtgaag ccaaaaatgt gttctttcat gtgacgtaaa attcaatgtt 240 atacgatgat gtaagtgttg taaaaatatt tttttgttaa tagattacaa ggtgcaaaga 300 ggaaaaaata aaattcaaag gctgatttct caaaatctat agaaatgatg attccattgc 360 ttcaaatctt ctataacccg gtacaatagg tttctgatag acacacactt tgaggcacaa 420 aatgagaagt attttatgca aaatgactaa atatctatga gattgctata ttagaaagca 480 aaactgcgta tgtcatttta aaaacgtggg caaataggcg tcatatgtac ttttgtctat 540 atagtctaga ccagtcatat gggcatgata aaggtgtgtt aaaaaacatt ttgttagttt 600 gaactaactt aaacttaaaa gaagacagaa gtaaaaaaat ttgaagtctg gtttctctaa 660 caacgtacaa ttttctttgt atgtttcagg gaattaatta ttgaattgaa cgtacaaaat 720 caagcatggc agaactgaaa tcagtacccc tgacattaaa gaaagagagg tcaaaatctc 780 aaaagaaaga aaagcaactt gaagaaagga catgtgtagt acataactgt aacttcaaag 840 gccatgatga aataaaaccc ctcagtgaga gacgctggga caccattcaa aaatcaaaaa 900 cacaacggtg tcttgcagaa actgacagtg atagactatc ctcaatttgt gaaagaatac 960 cgactgagtt tgactctgct caacatgggt ttcatgatta ctgctacaat acattcatca 1020 atttgaggag tctaagaaaa aggcattcat caggagagtc tgacaaatca gcacaagaag 1080 ggtcttgctc cactaggaaa aaggtaaagc aatcgttttc aacacgcctc ttgcctattc 1140 aatgcatttt ctgtgaaaag actaccaaat ggacacgcga cagcaagaga accaagaaag 1200 ttcataaact tgtcaaatgt gttaccagaa cagctgaggc ctctataaaa gaagctgtaa 1260 acacgaaatc tgactcaaaa cttcggggcc aaattgaaaa tgttgatctc attgcccaag 1320 aggcatggta tcatgaacca tgtagaaaag cttacacacg taaggagggt cgacgttcac 1380 aagtttctac atctaactac tgttcggaaa aagaaaaggc agaagctgaa gaaagaaaga 1440 aagctcaagc tgcagaggaa gaagcccact ccaaggcttt tcagcacgtc tgtcagtttg 1500 tggaagaaca tgtcactgcg ctggcccctt tgttaggatt actcttttga aagagatgta 1560 ttgcactttc atgcaaacaa attatccaca gttttacaac aagaactaca agacatataa 1620 gttgaaggac aaacttgtga agcacttcca gtccaggatc aacttctggc agagatattc 1680 tgggacaagt gatctcgtgt actcagacga aattccaact ggacaggctg ttggagttgc 1740 ctttgaaaat gcaacctctg aagagagact tgttatagaa gcggcaatgg tgatacgacg 1800 agcagtgctt gatggatgca atgaaagtca aaagttacct tggccaccaa cagacacaga 1860 tctgcgctca gaaactttgc agataccaca actattaaca acatttcttt cctttcttta 1920 ctccaagaac ggaaaaccac agtcaacaag atgccacagg agagttgttt ctacaggtca 1980 ggacatatgc tacaatgtga cgaaagggga gtggaaaatg cctaaacata ttatgattgg 2040 tgtatgtgtc cgtcatctaa caggaaacac tcagctgatc aatatcttaa accggcatgg 2100 acattctgta tcccactcat ttctcctaga aatggaaact gccatgtgtg acagtataca 2160 ggtttcttca ggtagtttgc ccccatccat catgcctgac aacaacctca tcactcactt 2220 ctgttgggac aattttgact tgaatgagga aactctatca ggagctggaa ccacacattc 2280 aactcatgga atagtcatac aagaaatcaa agagcagcca tatattcccc agccaacacc 2340 agaggttgac agaacaaaga aaaggtcaat ccatattcca caaaaacagc ttgacccatg 2400 tttcctcaat tctaaggtgg aacccacctt gataacaaac agagtatcgc ttcacaaaaa 2460 agttgacttc aagattgact tttcaaactt tgtgtggaca ctatcaagac gtgaaaagaa 2520 caaagatgat ccgagtgttc ctggatggaa tggatgggta tcaaatgagt tgaacactgc 2580 cgaaacaacc tgcacaactg tagattacat ggaacctctg agcaaaccaa ttacagaact 2640 ttcaactgta caagaagtac tgaacatttc acaagaggca agcacagcag ttggtcaaaa 2700 gtataccttt atcacatttg atctggctgt agcaaagatg gcatattcac ttgtgtagca 2760 gaacaaggtt ctctataata acgtgatcat tcacctcggt gtgtttcaca tactttgtgc 2820 atatctgaaa gcaataggca aaatgatgtg tggaagtgga tttgaagaag tagttataga 2880 ttccaaaatc tgtgccagtg gatctattga aaaggtgatg aaagggaaac attacaacag 2940 atcactcagg gtgcacaaag ctgttttgga ggctctggag cgtctttcat ttattgcttt 3000 tcaacaacat ggacagtatg acttgttggt tgaaaaggca aaggaggaat tgaaagattt 3060 ttcttattcc aacaatgata ttaatgacga gcctatgaac tcagtgaaag aacttgctga 3120 gagctatttt cagttcaaac aggatgttcg tcacgggaaa ctgggcaaaa ctgctcagtt 3180 ttggattatg tacatggatg cagtgtggaa agttctcaac tgtcttacag caacaaagag 3240 aaatgacttt gacttgcaca taacttgtct tgaacagatg tgtccccttt tctttagcat 3300 ggaccacccc aattatgcta gatacctctc tgcatacatc atccttctac tgaacctgaa 3360 agattctcat cctggtgccg aggacctttt gcgatacaaa ggattcagtg tgcgtaggtc 3420 aagagtgtca ggtgcaagaa atgctgtgga cttaacaatt gagcaaacaa ttaaccgaca 3480 agccaaatgt aagggtggca ttgttggttt tagtcaaaat gttgctgcat accacaagtg 3540 gtgcatgacg agacataaaa gggcgagcat tgtcactgct ctgttggaag aatctggact 3600 ggacagtaag gatcaagaac acaaagactg ccaccaatca caaatgagac tatcagagaa 3660 aagtgtacaa agtgtccagc aatcttttga atcgtttatc aacccatttg acactgttgc 3720 aaatgacaaa ctgatttctc tgtcttcagg aatggaagcg acagtgaaag tagaagcaga 3780 tcttttgtca attgaaaaag atggaaaaca gctatatgaa aattttgtca atacaaccct 3840 cattgaacaa gctgagagtt tccatgcacc tctcaaaaga aacaggaaac tgacatttgc 3900 ttcacaacaa aagacagcaa agctgaagac aacaaagaaa aaggaggtta aattaactgc 3960 acaaagaaac atgtttgggc agcttctcat gctgtctaca gaaaatgatc tggaccttca 4020 agaagtgatg gattaccctc taggaccagt gccttgggcc ctcgccaccc cagatggatt 4080 tcctattaaa acaaacaaag cagttcttat gcataaactg gaagataagt cagctcttca 4140 aagcccatct caagaccatg aacacatcca tattatagat ggcaatgctt tttatcatac 4200 tctgcatccc tctgatttac cagatacttt tggcgagtta gccagtacaa tattttgtgc 4260 tctacccaaa gttaacaaag tccactttgt gacagacaac tacaaagaag attcaattaa 4320 atctttggaa cgaattcgca gaggagaatc tcagacattc acagtccgag gcttttctac 4380 aaaggttccc aaagatttca aaatgtttct tatgaataat gaaaacaaaa agcaactgac 4440 acaattctta ctttatgaat ggcaacaaga ttgttatgca atgatgttgc tgaacagaga 4500 aatatacttt gcatgtgacc aacagtgctt tgtcctcagt agcagtgatg gaaagactac 4560 agactcaaga cctgtcccag agttatcatc cagccatgag gaagcagaca cactactgat 4620 cgtgcatgcc gtcttctctg atcagaacat tgctactcca aacacagaca tcattattcg 4680 gtccccagac actgatgtct ttctactgat gattgccttt gttaaacatt tcacacaccg 4740 tttgtatttt gacacagggg ttgggaacaa aagaagatct ctacacattc agacgctctg 4800 tgacaaaatg gaaacacata tactggattc tattcttgga ctccatgcat tctctggatg 4860 tgatgtgact agcgcttttg tccagaaagg aaaggtgaag cccctcacca ttctccacaa 4920 gcacccagag tttgctgttt catttaagga gctaggaaca tctgaaactg tttcttcaga 4980 actgttctcc aatctggaaa agtttgtttg tcacctgtac ggaaaaccag cttacttcag 5040 cacaaacaag cttcgtcatg accttgtaag gctaaagtac ttggctaaag gtcaaagctt 5100 actgtcatgc ttcgatggat ttgacatcag cttgctacca ccgtgcagag aagccctgaa 5160 gctgcacatc ctacgtgtta actatcaaac tttaatatgg aagcaagcac atctggctga 5220 accattaata cctgatccgc aagatcatgg ttggaagaga ggtgacggtg gagtgctatc 5280 agtggagtgg tgcaaagact tggtccctca acaattggtt gatattttgt caggatcaga 5340 tacagagcaa gaggaagaca aaccggaaaa ctgtcctctc tattatgata acagcgatga 5400 agaagatgag atttcagatt ctggaagtga cagcagtgaa gatgaactca tgacatagaa 5460 agcatttaag taagacttta ctgtacttta catgaaacac caactgactt acaaactaca 5520 gtacaaaatt tggataatta gcactcaagc aactcacttc tgttacataa atgccctaag 5580 caagtcgaat ttctagatgt tattatcttt ttctcaatta tttcttttac gagttcactc 5640 aataatactt acttcaagta cacatatagc caatggctat aactgcattg ctaagggctg 5700 tctgtaactg tgcattgaac ttattttttt gttgaacaaa atgtttgcat gtgtagctct 5760 tcctatgttg tgtaataccc agtagggaat atactacgct aactttgtgc gactgttttc 5820 aatttttaga gaaaacggac tttgaaaatt atttttttct ctatacggta tgtaaataac 5880 taccacaaaa tggtctgtat acctgtttct ctttaaccta gggcaccaac tgaagtcata 5940 tttaatagaa agtcaaatta tctactcgaa aagtgaattt ttttaaaaaa gttaattttc 6000 acaattttcg acaattcagg ggcaataact ccgcagattg ccaacatttt gctgcaaact 6060 tttttttgta tgtagtcctg agtgtgctct ttcagaaaat catagtgagc aaaaatcccc 6120 agggggggga gggggggttc cagtagccca tagtcta 6157 // ID Copia-13_AA-LTR repbase; DNA; INV; 126 BP. XX AC AAGE02020441; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-13_AA_; KW Copia-13_AA-I; Copia-13_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-126 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02020441; Positions 7855 7730. XX SQ Sequence 126 BP; 35 A; 21 C; 24 G; 46 T; 0 other; tgcgatcgta gtagcacaat aatacgcgtc gctatgtgtt ttgttattag ttgctaagat 60 agaataaaat tttcattcca ctgctagtaa tcagtcaagc tagacgtttt atttgattgc 120 tctgca 126 // ID BEL-21_AA-LTR repbase; DNA; INV; 690 BP. XX AC supercont1.128; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-21_AA_; KW BEL-21_AA-I; BEL-21_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-690 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.128; Positions 464578 465267. XX SQ Sequence 690 BP; 261 A; 119 C; 123 G; 187 T; 0 other; tgtcgcgacg agaccccccg ttgcagctca actgtggccg gtcaatgtgg ggatcacaac 60 tgtggttgac tgaacgatca tcaccacaaa gggagaatga taaaaacaat gaccaagcaa 120 ataagtctaa atacgtgtcc aaaaagtgaa catccaactt ataaaattaa gaaaaattag 180 gatctaaatt aaagctaaaa tttgaattta tatctaaata ctttcacgag taaatgcatt 240 gaatttagta aatagaattg cagtattgag gcggataaca ctgaattact taagctaata 300 gtacggagag gtatgacatg cagtttttat gatcttatgc tagaggcctt acatgaaatt 360 atacttcaaa tctaggtttt acgcgatcat cctaatctaa acgcatatta aacactaaac 420 ggtactagta aacaccaatc aaaccgtaag tagaggaatc gaagaagttt ttttagttaa 480 ttcaacatat cttatacata cagagtaatc caactatcaa cagcgtcgag tgtgataaga 540 tcctagattt cgaaggtcac caaaattgta agtagaagag taaggcaatt atgactatgg 600 agaaattaaa actgaattca tttttagcta aaagcttacc acaataaaac ctgaggtttg 660 ctttacgtga tttgaaagac ccccacaaca 690 // ID Transib-6_HM repbase; DNA; INV; 3706 BP. XX AC . XX DT 30-JAN-2008 (Rel. 13.01, Created) DT 30-JAN-2008 (Rel. 13.01, Last updated, Version 1) XX DE Autonomous Transib DNA transposon -a consensus sequence. XX KW Transib; DNA transposon; Transposable Element; Transib-6_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3706 RA Kapitonov V.V. and Jurka J.; RT "Autonomous Transib transposons from the hydra genome."; RL Repbase Reports 8(1), 6-6 (2008). XX DR [1] (Consensus) XX CC Transib-6_HM is a very young family of autonomous Transib DNA CC transposons that may be still active in the hydra genome (copies CC are ~0.1% divergent from their consensus sequence). The consensus CC sequence was obtained based on a multiple alignment of 10 copies; CC it codes for a 648-aa Transib transposase. Like other Transib CC transposons, Transib-6_HM is characterized by 5-bp target site CC duplications and short terminal inverted repeats. CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 1043..2986 FT /product="Transib-6_HMp" FT /note="Transib transposase." FT /translation="MNRRQLYIYIKETDCNKNDLLCKLVEYVENVSPNTFL FT DIKPTLSSFVHQYKRKMTTVKREYTQFENKYATWLDAIISFKLIKAVTPEK FT NYGGRPETPFNKSSIRSKDRKVSNIIHMYSTNEILYAASRSLKKDGRNAEA FT ISVKSILYTPQASLFSDKALALFLDANLSKATYQLLRNNALDLDSAIYPTY FT HDLKSAKDRCYPDDVIISDYSAEIPLQSLLQHTSQRLCDSQIEVLTADSLK FT IFKKLTLRSKIGFDGATGLSVYKQSSTDEAGRSLANEVSLFITCLVPLDLY FT FFTDAKRQLVWRNHKPSSPIFCRPVRFKYIKETKEVVLEEFESIKNDIKTL FT PVSKVIIAGQTLDVHHIIDITMIDGKVQTIISTATTSTQCCSVCGISPKFM FT NDLPIVLKKEVKNENLQNGISTLHAWIRLFECLLHIAYKIPIQKWQARGVD FT DKAIVATRKKKIQVAFQNEMGLVVDQPCSGGAGTSNDGNTARRVFQQAEKS FT AEIMGVSPVLIRRLHSILSVMSSGFDIEPNKFKKYCFETAELYTQLYIWYP FT MSQTLHKVLIHGHLVIEFFSLPLGMMSEEAQEASNKYFKKYRECFSRKCDR FT KKTNLDVFHRLLCHSDPFIAQYRKTNSTRKAILPADAIALLKEPSIICSL" XX SQ Sequence 3706 BP; 1275 A; 560 C; 563 G; 1308 T; 0 other; cacagtgggt cagaagtcgg caaaacctaa aacataaagt ttaaaaaaac tttttttttt 60 ttttttttaa tataggttaa ataagtctaa tatacgaaaa taatattatt tatgataata 120 gattatgatt aaatagaaat acttggccgc aaactactcc agttccgtca gcaagcaatg 180 ttgaaaatta ttcattattt tttctaaata ttgccgttta tttttacatg cttcctgctt 240 taaaaggtag gaaaatttta tgctatattt tttgaatttg tatatagatg tataatttat 300 cctcaaaata tcattataaa aaatataaca gtattttttt atgcctaact gtacaggctg 360 ttttttaaat tatgtaaatt ctttttttct ttgttggtgt ttaaatagta gtgtattaag 420 caatattaaa gatatgttgc agctgttttc ataagtactt gatgtgtact actattatca 480 agcaaaatat tttgccagtt tttgccagtc tagagtgagc aatgatttta atatgtaata 540 attgtaattt ttataaaccg gccgtgttgt tatctcatta atataatttt tatagtccgg 600 acggactata ctatttattg gcgaatattt aagagaacat aataataaga aagaaaatat 660 aataataata ataataaaaa aatattatta gacaatatat tccttttgta tatatctgtt 720 catattgggc ataaaatatt acagtaggtt gcctagtttt taacaaacac atgctcatta 780 ttttgtttgt acatttaatc tactctctct ctctttccct ctctcactct ctctctctct 840 ctctctctct ctctctctct ctatatatat atatatatat atatatatat atatatatat 900 atatatatat atatatatat atatatatat atatatatat atatatatat atatatatat 960 atatacatat atatacacac attattatat atattattat attatctatt cttatattaa 1020 tagatgttct ttttaatttt agatgaatag aagacagtta tacatttata ttaaagaaac 1080 tgactgtaat aaaaatgatt tactttgtaa actggttgag tatgtggaaa atgtttcacc 1140 aaacacattt ttagatataa agccaacatt atcatcattt gttcatcaat acaaacgtaa 1200 aatgacaact gtaaaaaggg agtatactca atttgaaaat aaatatgcta cttggctaga 1260 tgctattatt tcttttaaac taataaaagc tgttactcct gaaaaaaatt atggtggaag 1320 accagaaaca ccttttaaca aatcttcaat acgatcaaag gacagaaaag tttctaacat 1380 tatacatatg tactctacta acgaaatatt atatgcagct tcaaggtcat taaaaaaaga 1440 tgggagaaat gcagaggcta tcagtgtcaa aagtatttta tatacccctc aagcaagtct 1500 tttttctgac aaagcactag cactcttctt agatgctaat ctgtcaaagg caacatacca 1560 acttctccgc aacaatgctt tggatctaga ttcagctatt tatccaacat accatgattt 1620 aaagagtgca aaagacagat gttatccaga tgatgttata atctctgatt actcagcaga 1680 aattccgttg cagtctctac ttcagcacac atcacaacga ctttgtgatt ctcagataga 1740 agtgctgact gctgattccc ttaaaatctt taaaaaactt acacttagat caaaaatagg 1800 atttgatggt gcaactggtc tgtcagtgta taagcagtcg agtacagatg aggctggcag 1860 gagtttagcc aatgaagtat ctttatttat tacatgctta gtgccattag acttatactt 1920 ttttacagat gccaagcggc agcttgtttg gaggaatcat aaaccttcgt cgcctatttt 1980 ttgcagacct gtccggttta aatacatcaa agaaacaaag gaagtagtct tagaagaatt 2040 tgagtcaatt aaaaatgaca tcaaaacact cccagtttct aaagtgataa ttgctggaca 2100 gacattagat gtacatcata tcatagatat cactatgatt gatgggaaag tgcagaccat 2160 aatttccact gctacaactt ccactcagtg ttgcagtgtt tgtggcatct caccaaagtt 2220 catgaatgat ctgccaatag ttttaaaaaa agaagtgaaa aatgagaatc tgcagaatgg 2280 catctcaaca cttcatgcct ggattaggct ttttgagtgt ttactacata ttgcatacaa 2340 gattccaatt cagaaatggc aagcaagagg tgtagatgat aaagctatag ttgccaccag 2400 aaagaagaaa atacaagttg cattccaaaa tgagatgggt cttgttgtag accagccatg 2460 ctctgggggt gctggtacct caaatgatgg gaacactgca cgtcgtgtgt ttcagcaggc 2520 tgaaaagtct gctgaaataa tgggtgttag tcctgttctt atcagaagac tacattcaat 2580 attatcagtg atgtcttctg gatttgatat tgaacctaac aaatttaaaa aatattgttt 2640 tgagactgca gagttgtata cacagttata tatatggtat cctatgtcac agactttaca 2700 taaagttctg attcatggac atctagtgat agaatttttt agcttgcctc ttggcatgat 2760 gtctgaggaa gcacaagaag cctcaaataa gtactttaaa aaatacaggg agtgcttctc 2820 tagaaaatgt gacagaaaaa aaaccaattt ggacgtgttt catcgccttt tatgccattc 2880 agatcctttt attgctcaat accgtaaaac aaacagtaca agaaaagcta ttctgcctgc 2940 cgatgcaatt gctctgttga aagagcccag catcatttgc tctttgtgag tttttgatga 3000 tatcttagaa ataaatgttt ttgccaatga atgtacgtgt ttagcaataa atgtttaatt 3060 atatataaga tttttttttt caatataaat aaaaacaaca tcttaatata ttaatatagt 3120 ttttaatcat agttttattt gttttaattt attatgaggt tttggctaga aattaactat 3180 tttaacccat tatagttttt tattattgtg catagtaata ataaaggttc taccatctag 3240 tgttgctaga ttattgcaag aatctagaat agtatacttt ttatactatt ctagattcaa 3300 aaaaaacttt aaataatatt tttgtttcta attttattgc cgaattaagg gaggggggct 3360 atggccccgg ggccttaaat tttaggggcc ccgtaacatt caagaagacc ttccccttac 3420 aaatcgtgct atataaattt ggctaataaa gtactgcaga aaaaaatagc cccgggtccc 3480 gaatgttctt aatccgcctc tggctagttt acggaatttt ttttataggc taaaaaattg 3540 gaatttttca aaattatgca ataaaaatcc aagtttttat aaaactaaaa tttttttttg 3600 cataatttgt ttttcaatat aatacaccgt ttgtaataaa aaaaaaaaaa aattaaaata 3660 tattttagag gttttggctg aaaattggca cctctgaccc actgtg 3706 // ID Copia-100_AA-LTR repbase; DNA; INV; 243 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-100_AA_; KW Copia-100_AA-I; Copia-100_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-243 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 243 BP; 58 A; 64 C; 46 G; 75 T; 0 other; tgttaggagt acaacgtgtc atccctctac accgggcagg gcatctgcca taccagagca 60 taaaccacat tgtgtggtat acctttcagt atgatatact ttgattgttg ttatctaata 120 aaaacgccat tcgagagtac cgcgtgcgtt catattttgg tccgattatt ttccgagatt 180 ttcccgtcga tttccttcct accggtcgcg taatccactg aatcctgttc cgccaaaccc 240 aca 243 // ID Crack-7_AAe repbase; DNA; INV; 4626 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A Crack non-LTR retrotransposon family from Aedes aegypti. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; KW Crack-7_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4626 RA Kojima K.K. and Jurka J.; RT "Crack clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1223-1223 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 9 sequences with >97% CC identity. CC Closely related to Crack elements in Culex pipiens (Crack-1_CP CC to Crack-4_CP). XX FH Key Location/Qualifiers FT CDS 395..1393 FT /product="Crack-7_AAe_1p" FT /translation="MESESFTCKICERTETASDTNPVVTCDCCLNTFHVKC FT KKISKAALNSLSGAPYFCSPKCSEIFRRFTAFTSNKKLCEADKEYITATIK FT DALAEKLDSVIDEIRNLQIGQDVIAKEQDDLKERCSHLENAVSDLEDEVEF FT LHRDRISNNVMFFGVPLMASENTTDVVQAIINRLTYAQPQLITRNDILSVF FT RFPQKSKNSSAPPIRVVFKSCDCKGQFMQTARQIQNLASTDVDATWTVNGA FT GSKIVIREEATKLSQHIYKQLKSKQKDLGLQYVWIGRNSVVLVKKDDSSST FT AKIKNWNDLDKLQGSLLKETNKRTASEISPQQDHIPKKYCS" FT CDS 1489..4398 FT /product="Crack-7_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MNSKIFNFKFSDLDEFNLNTNFNALSNKENSMVILQQ FT NMQGMNNFSKFDNFSLFLNELQCNIDIIVLTETWINDENEKYYKIPGFNKV FT NSSRVDSRGGGLMIFFHSRYEIDVIDVDNSRFSFQFIHIRIKLSHAEHFHV FT CAVYRPPATNMNEFRIFLESIMEENVRLPTIILGDINIAVNKSNTIVTDYK FT NLLSSCSFEVTNTFETRPASQNILDHVICNSSNFIEIQNFTIFNDLSDHLP FT VLSKFTLKTKLQIVKLTKNLLNYRTFHREFLVFLENLSFDASPNEILIDIM FT NKYNELRSIHTRTIELQARVKDNAAPWITVDVWKLMQIKNKLFRKLKRCPN FT DNHTKDLLKHVNKKLHFAKNIAKSNFYKALFSDGDSKKKWNNINKLLGRGG FT KNKQHLSLTTNGLLINDPLQVATEFNNYFSRIGSNLANNLRSERNYNKYRT FT ITPVQQSIFLDPTSVVEIKKLINDLSPNKCAGPDNVSVSDVKKYSEIFAPI FT LTRIFNEVLHSGEYPSCLKLSRVIPVFKGGDPQDSSNYRPISTISVFSKIL FT EKLLTTRITSFMNHNNVFYCNQFGFRKGSSTETACIESVDDILRAFDSKNI FT VGGLFIDLRKAFDTLDHNILKSKLGLLGIRGIASDLLNSYLSSRSQYVEIL FT GQRSSACPVTVGVPQGSSLGPLLFLLYINDMGNLPLKGKLRLFADDAASFY FT VGRNVSDIRGLMQADLNILNDFFKQNVLSMNVSKTKSVVFKSQHNRISTDL FT QLHLEQIPVEQVKEYKYLGLILDETLSWRAHIEHLMKKLSPLCGILRKISY FT FVPVQVLLKIYFAHLHSRLQYLLSIWGSAPKTTIRKLQIIQNKALRNVYKL FT PYSFSKSQLYMEKAKNILPLLGLHEYKQIVYIYKITKITGTHSNQIIPRPN FT HSHYTRQSNDFQRHRIRTEYGRKRITYAGITLYNNLPDDIKNLPNMGLFTK FT RLKQYLSENLMRYIQ" XX SQ Sequence 4626 BP; 1600 A; 807 C; 821 G; 1397 T; 1 other; cactgttaag tgaacaagtc acagtaaaac ggtggataga ttttatcaaa aattcaacgt 60 ttttcgaagt gaatactctt caaactacct gcaaaagttt aatttgccag actggtttgg 120 cttaaaagtg aaatcaacgt ttgatgttca ggcaaagtgt tgcaatcatc atttaatcaa 180 tagactaaat gagtagcttc tatataatgt gctagagatg ttaacagtta tgatgatgta 240 aaactgtgaa caacctgatc caccgttagc accaatagct cgacacgggc cgataagatc 300 gtttctactt attgagatca ctacctaaac tgttgttgtt gttgctgtcg ctggggtttc 360 tgctgttgtt gtacgtgttt gctgctggga agtcatggaa tcggaatcgt tcacctgtaa 420 gatctgcgag aggactgaaa cagcctctga caccaaccct gtggtcacat gtgactgttg 480 tctcaacacc ttccatgtga aatgcaaaaa aatcagcaag gctgcgctga atagcctgtc 540 tggagcgccg tacttttgtt ccccgaagtg cagtgaaata tttcgacggt tcacagcttt 600 cacctctaac aaaaagttgt gtgaagccga caaggaatac atcacagcga ctataaaaga 660 tgctctggct gaaaaactcg attctgttat tgacgaaata cgcaatctcc aaatcggtca 720 ggatgtaata gcaaaggaac aggatgacct caaggaacgt tgttcgcatc tggagaatgc 780 agtcagcgat ctggaagatg aagtggaatt tttgcatcga gataggatat caaataacgt 840 catgttcttc ggagtcccat tgatggccag cgaaaacacc actgacgttg ttcaagcaat 900 cattaaccga ttgacctatg cgcaaccaca gttaatcacc agaaacgata tcctctccgt 960 gtttcgtttt cctcagaaaa gcaaaaactc tagtgcacct cctatcagag tcgtctttaa 1020 aagctgtgac tgtaaagggc aattcatgca aactgcgcgt caaattcaga acttggcgtc 1080 aactgacgtg gatgcaactt ggacagttaa tggggcagga tccaaaattg ttattcgaga 1140 ggaagcaaca aaactaagtc aacatatcta caagcagcta aaatccaagc aaaaggatct 1200 gggtcttcag tacgtttgga ttggtcgcaa cagcgttgta ctggttaaga aagatgacag 1260 ctcatctact gcgaaaatca aaaactggaa tgacctcgat aaacttcagg gctcactact 1320 gaaagaaaca aacaaaagaa ctgcaagtga gatcagtcct cagcaagatc atattccaaa 1380 gaaatattgc tcataaatga taattgtctt tctcaagatg tctattcaca tttacttagt 1440 tttagagatt tcggtaggat taggtttgaa ctagaaaata tcttacttat gaattcgaaa 1500 atatttaatt ttaaattttc agatctagat gaatttaatt taaataccaa tttcaatgca 1560 ttatctaaca aggaaaattc tatggtaatt cttcaacaaa atatgcaagg aatgaataat 1620 ttctcaaagt tcgataactt ctctctgttt cttaatgaat tacagtgcaa tattgatatt 1680 atagtattaa ctgaaacatg gattaatgat gaaaatgaaa aatattataa aatccctggt 1740 tttaataaag ttaattcaag tagggtagat agtcgaggtg gtgggctaat gatatttttc 1800 cattcaagat atgaaattga tgtgattgat gtagataact ctcgatttag ttttcagttc 1860 attcatatcc gcattaaatt atcacatgct gaacatttcc atgtttgtgc cgtatatcgt 1920 ccaccagcta caaatatgaa tgaatttcga atatttttgg aatctattat ggaagaaaat 1980 gttagattgc ctaccattat actcggtgat ataaatattg cagttaataa atcgaataca 2040 attgttactg attacaaaaa tctactatca tcttgctcgt ttgaagtgac caatacattt 2100 gaaactagac cggccagcca aaatatctta gatcatgtta tctgtaattc tagtaatttt 2160 attgaaatac aaaatttcac aatattcaat gatttaagtg atcatttacc tgttttatca 2220 aagtttacat tgaaaaccaa attacaaatt gttaaattaa ctaaaaattt gctgaactat 2280 agaacttttc atagggagtt cctcgtattt ttagaaaatt tgtcattcga cgcaagcccg 2340 aatgaaatac tcatagatat catgaataag tataatgaat tgcgtagtat acatactaga 2400 actatcgaat tgcaagctag agttaaggat aatgctgctc catggattac agttgatgtt 2460 tggaagctaa tgcaaataaa aaataaactt tttcgaaagc tgaaacgttg tccgaacgat 2520 aaccacacaa aggatcttct taaacacgta aacaaaaaat tgcatttcgc aaaaaacata 2580 gcaaaatcta atttttataa agcattgttt agtgatggtg attcaaaaaa aaagtggaat 2640 aatataaaca aattgcttgg taggggagga aaaaacaagc aacatctgtc gcttactacc 2700 aatggcttat tgataaatga ccctctccaa gttgcaacag aatttaataa ctatttctct 2760 agaatcggat ctaatttagc aaacaattta cgtagcgaaa gaaattataa caaatatcga 2820 acaatcacgc cagttcagca atcaatattt ttagatccaa cttctgtggt tgagattaag 2880 aaattgatca acgatttatc accaaacaaa tgtgcaggcc ctgacaacgt ttccgtgtct 2940 gatgtcaaga aatattctga aatatttgca ccaattttga caagaatctt taatgaagtt 3000 ctacacagtg gagagtatcc cagttgttta aaactttcaa gagtgattcc cgtgttcaag 3060 ggaggagacc cacaggactc cagtaattac agaccaattt ctaccatttc agtatttagt 3120 aaaatcctag aaaaactttt aactacacga ataaccagtt tcatgaatca caacaacgta 3180 ttttactgca atcaatttgg gtttaggaaa ggctctagta ccgaaaccgc ttgcatagaa 3240 tcagttgacg atatccttcg tgcttttgac tctaaaaata ttgtaggtgg gttgttcatt 3300 gatctacgca aagcatttga cacgcttgac cataacattc tcaaatctaa actaggactg 3360 ctgggaatcc gcggaatagc tagcgatctg ttgaatagct atttaagctc taggtcacag 3420 tatgttgaaa ttcttggcca gcgtagttca gcatgtccag taacagttgg tgtacctcag 3480 ggaagcagct taggcccact gttgtttctg ttgtatatta acgacatggg caatcttcct 3540 ttaaaaggga aactgagatt atttgccgat gatgctgcgt ccttctatgt tggtagaaat 3600 gtttcagata taaggggtct aatgcaggca gatttgaata ttcttaatga cttttttaaa 3660 caaaatgtcc tatcaatgaa tgtttctaaa acaaaatcag ttgtattcaa atctcaacat 3720 aatcgcattt ccactgattt acagttgcat ttagaacaga taccagtaga acaagttaaa 3780 gaatataaat atcttgggct aattttggac gaaacgttgt catggcgagc acacatagaa 3840 cacctaatga aaaagttgtc cccattatgc ggcattctgc gtaaaatctc gtattttgtt 3900 ccagtacaag ttttgttgaa gatatatttt gcmcacttac actctagatt acaatattta 3960 cttagtattt ggggatctgc tccaaaaaca actatacgca aacttcaaat tattcaaaac 4020 aaggcgttaa gaaatgttta taaattacct tacagctttt caaaatctca actgtacatg 4080 gaaaaagcaa aaaatatact ccctttgctc gggttacacg aatacaaaca aattgtttac 4140 atttataaaa ttactaaaat tacaggaact cattcgaatc aaataattcc taggccaaat 4200 catagccatt atactaggca gtctaacgac ttccaaaggc atagaattag aacagaatat 4260 ggaaggaaaa gaattacgta tgctggaatt accctatata ataatttgcc ggatgatata 4320 aaaaatttgc caaatatggg attgtttaca aaaagattaa aacaatactt aagtgaaaat 4380 ttaatgcggt acattcaata atttattaaa gattaacaaa taagttctaa ttacagtgtt 4440 gctacatatt aaagaaatca ttagcctacc cttcaaagaa caaaatgttc attgggttag 4500 ctagtaggaa taaaattctt tgtatgtatt attgaaaaga caagcaggtt ttgtgcctca 4560 aggagaagta acatctgaga cagttacact ccttggggtt tttccctgct tcaataaata 4620 aataaa 4626 // ID Penelope-13_HM repbase; DNA; INV; 3045 BP. XX AC . XX DT 14-SEP-2009 (Rel. 14.09, Created) DT 14-SEP-2009 (Rel. 14.09, Last updated, Version 2) XX DE Non-LTR retrotransposon: consensus. XX KW Penelope; Non-LTR Retrotransposon; Transposable Element; KW Penelope-13_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3045 RA Jurka J.; RT "Non-LTR retrotransposons from Hydra magnipapillata."; RL Repbase Reports 9(9), 1937-1937 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 1527..2711 FT /product="Penelope-13_HM_2p" FT /translation="MGAFDGAEVCELVGIFLLFEISKFYNKFEVGLYRDDG FT LAVFRNKSGPQMEKIKKHLSEIFKYNGLRISIQCNMKIVNYLDVTLNLSEC FT TFKPYLKPDNTLNYIHADSNHPPSILKQIPHSVELRLSTNSSTKEIFQXAV FT PLYNEALLKSGFKCNLTYNPKAISTKKRNRARNIIWFNPPFSKNVSTNVGK FT CFLKLIGKHFPNNNKLHKIFNRNTVKVSYSCMPSVKSIINSHNKYILYNDT FT NSNQENCNCLDKNFCPLSNRCLTSNIVYQATVTTTSDSSPNEKSYIGVSET FT SFKLRYANHVKSFNIPKYKNDTELSKQVWKIKEANLVPIVKWKILRRCKAY FT NPTSKSCKLCLTEKYEILNYKNTNLLNKRSEIVSKCRHRNRFLLSQFDTGD FT *" FT CDS 2258..2779 FT /product="Penelope-13_HM_1p" FT /translation="MPSVKSIINSHNKYILYNDTNSNQENCNCLDKNFCPL FT SNRCLTSNIVYQATVTTTSDSSPNEKSYIGVSETSFKLRYANHVKSFNIPK FT YKNDTELSKQVWKIKEANLVPIVKWKILRRCKAYNPTSKSCKLCLTEKYEI FT LNYKNTNLLNKRSEIVSKCRHRNRFLLSQFDTGD*" XX SQ Sequence 3045 BP; 1109 A; 455 C; 427 G; 1053 T; 1 other; ataaataaaa ggttagatga gaaaaaacaa cttttttacg gtacttaaag tttcatgccg 60 ttttcggcac tcatcagcca tatatttaaa tcaagaaaaa accgttcaaa aatacaatta 120 aaaaaccgtt acaaaattca aaaaaaacaa tggaatgagt tctgacgtca aataattata 180 atacaaataa tagcaataat aataatgata ataataataa taatataata ataataataa 240 taataataat aataataata ataacaataa taataataac aataataata acaataataa 300 taatattatg gaaaaaacat cttttctaat cgccagtgtc aaattgagaa aggaggaatc 360 tgtttctatg tctgcactta gaaacaatct cactcctttt atttagtaga ttagtatttt 420 tataatttag gatttcatat ttttcggtga gacataattt acaagatttg gatgtaggat 480 tatatgcttt acatcgcctg agaattttcc acttaactat aggaactaaa ttggcttctt 540 ttattttcca tacctgcttt gacagctcag tatcattttt gtattttggg atgttaaaag 600 atttaacgtg gttagcgtat ctcaatttaa aagaagtctc acttacaccg atgtaggatt 660 tttcgttagg agatgaatca ctagtggtgg ttacagttgc ttggtataca atgttgcttg 720 ttaaacatct attggacaac gggcagaaat ttttatctaa gcagttacag ttttcttgat 780 tagagtttgt atcgttgtat aaaatgtatt tgttgtgcga atttatgatc gatttcacac 840 tcggcataca actgtagcta actttgactg tgttcctgtt aaagatttta tgtagtttgt 900 tattattagg aaaatgtttg ccaattaatt tcaaaaaaca ttttcccacg ttggtgctag 960 ctttttgaga acgaactcat tgatcttgta aaacacatta agtttcgaaa ggtacaaagc 1020 aattttcaaa ataaactaaa gcaagcatag aaggtaaacg tctttcgaaa aatcatgaag 1080 ttttcgataa aattgatatt aatacagctt ccaattgttt tatcacctta aaagaccata 1140 aagataattt tgaaaacaat ccgactgtcc gtttattgaa tcctgctaaa aatgaggttg 1200 gtagaattag caaggatgtt ctttctaaaa ttaatacaga cctaagaagt atattacaat 1260 taaaccaatg gaaaaacact cgaaatgtta tcgattggtt taaagccata aaagacaaac 1320 acctttataa atttttaatt tttgatatca ttgacttcta cccttctatt agtgaaacac 1380 ttctaaaaaa ctctatcaaa tttgcagagc aacacttaac actaaataaa gagaatatat 1440 cgttaatttt tcatgcaaga aagtctttgc ttttcaataa taatcaggtt tggataaagc 1500 aatcaagtgg tttatttgat gtttccatgg gagcttttga cggcgcggag gtttgtgagc 1560 ttgtaggaat atttcttctt ttcgaaattt caaaatttta taacaagttt gaggttggtt 1620 tataccgtga cgatggatta gcagttttta ggaataaaag cggccctcaa atggaaaaaa 1680 taaaaaagca tctttccgaa atatttaaat acaatggctt acggatttct atacaatgca 1740 acatgaaaat tgtcaactac ctcgacgtca ctctaaatct tagtgagtgc acctttaaac 1800 cttatctcaa acctgataat acattaaatt atatccatgc tgattcaaat caccctccaa 1860 gtatactaaa acaaatacca cactcggttg aacttagatt atctacaaat tcttctacca 1920 aagaaatatt tcaacangct gttcctctat ataatgaggc tttattaaag tctggtttta 1980 aatgtaatct aacatataat cctaaagcaa tatcaactaa aaaacggaac agagcacgca 2040 acatcatttg gtttaatcct ccatttagca aaaatgttag caccaacgtg ggaaaatgtt 2100 ttttgaaatt aattggcaaa cattttccta ataataacaa actacataaa atctttaaca 2160 ggaacacagt caaagttagc tacagttgta tgccaagtgt gaaatcgatc ataaattcgc 2220 acaacaaata cattttatac aacgatacaa actctaatca agaaaactgt aactgcttag 2280 ataaaaattt ctgcccgttg tccaatagat gtttaacaag caacattgta taccaagcaa 2340 ctgtaaccac cactagtgat tcatctccta acgaaaaatc ctacatcggt gtaagtgaga 2400 cttcttttaa attgagatac gctaaccacg ttaaatcttt taacatccca aaatacaaaa 2460 atgatactga gctgtcaaag caggtatgga aaataaaaga agccaattta gttcctatag 2520 ttaagtggaa aattctcagg cgatgtaaag catataatcc tacatccaaa tcttgtaaat 2580 tatgtctcac cgaaaaatat gaaatcctaa attataaaaa tactaatcta ctaaataaaa 2640 ggagtgagat tgtttctaag tgcagacata gaaacagatt cctcctttct caatttgaca 2700 ctggcgatta gaaaagatgt tttttccata atattattat tattgttatt attattgtta 2760 ttattattat tgctattatt attattatta ttattattat tattattatt gttattatta 2820 tattattatt attattatca ttattattat tgctattatt tgtattataa ttatttgacg 2880 tcagaactca ttccattgtt ttttttgaat tttgtaacgg ttttttaatt gtatttttga 2940 acggtttttt cttgatttaa atatatggct gatgagtgcc gaaaacggca tgaaacttta 3000 agtaccgtaa aaaagttgtt ttttctcatc taacctttta tttat 3045 // ID P-1_HM repbase; DNA; INV; 3369 BP. XX AC . XX DT 15-MAR-2008 (Rel. 13.03, Created) DT 17-MAR-2008 (Rel. 13.03, Last updated, Version 1) XX DE P-type DNA transposon family: consensus. XX KW P; DNA transposon; Transposable Element; P-1_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3369 RA Jurka J.; RT "P-type DNA transposon families from Hydra magnipapillata."; RL Repbase Reports 8(3), 346-346 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 400..2961 FT /product="P-1_HM_1p" FT /translation="MVNKCVVTNCKTGYSTGPKKSTFHFPEESSLRERWIY FT FVNRKDWLPSKYSAICIDHFEDKFIKYGKRCTMKWDLQPVPTIHTDKKSSS FT STLRVPKLPRKEPTLRYLGKDEFSDFQNIDKIISLNSLKEQHCPPGFTFKK FT LHDSVVFYKLCFDEISGIPTVFESITVNKDLNVSLSYKGYHISLPEWFRSG FT HNCKLTNCSMIENFPAHIRNKANAFNSILTELNEIRYYSPQGRPPYSASVI FT RYALILRHTSAQSYKLLLEQLPLPSFSLLRKIQSGDINAFKAIKVLLQKNC FT VSSDCVLLVDEMYLQKAAQYQSGKYVGEDSESNLYKGIVVFMIVGLQKSIP FT YVVRSSPEVYITGSWLKKEIDECIISLHQSGFKVRAVVADNHSTNVSAFSE FT FHKAYNGDGKLFICHPVYNGVLKTYLLFDIIHLIKNVRNNLLNAKKYVFPS FT FSFDLFKDKVEVPAGYIAWSLFHKIYEKDQLLNGNLKKARKINYQVTHPGN FT NKQDVNLALAIFDETTTAAILSYFPERNDAAQFLSLFHKLFITLNSKQKFN FT TSNQLGNAAVKGDNKPTFYREIANWIDIWSKCDYFTFSKQTSLALITTLRA FT TASLIDDLLEEDYTYVLTSRLQSDPLELRFSKYRQMSGGRFLVSLLEVCNS FT EKILAVRSLLKEDINFWEENIYQTFDNSSLENIMQEIDLLSTEILECQLSE FT DSMEVAVTIAGYVAKKLKKKFGCHTCNQKMVSTDQDILNNEYLKILSRGGL FT ICPSQPLSDFICHLFSILDVISPILIKHCSYSFSIKSFAEQIFKNYYSSVD FT FTCDNHQELGIKYGSRIVINIFYNNLQKETTDSVRKDQVKQFKKRQRTNNN FT S" XX SQ Sequence 3369 BP; 1197 A; 470 C; 533 G; 1169 T; 0 other; atcatggcct actataatta cacggcctga tttgttgcat tttccgtaaa aaattaagag 60 gccagttttt ttacggctta ttttctatta taagacaact aacaatattt aataattaaa 120 ataatgtgat tattacaatt attatatcaa agtttggttt atattttcat attcatgtag 180 tatatatttc tgattatatt ctgactgatt gtctgtttca ttataatcca ttagagagtg 240 gaataataaa atagacttac agacagtcta cttggtatat tgtatacagt cgaactgctt 300 ttttttatgc aatgttattc aagtggagat aacttcaatt ataaatgtaa gaaatctcaa 360 tattattgta gtttttgaaa ttttttttta gttataaaaa tggttaacaa atgtgttgtc 420 acaaactgta aaactggtta ctctactggt ccaaaaaagt ctacatttca ttttcctgaa 480 gaaagcagtt tgcgagaacg atggatatat tttgtcaata gaaaagattg gcttccatct 540 aaatactctg ctatttgtat tgatcatttc gaggataaat ttattaaata tgggaaaaga 600 tgcaccatga aatgggatct ccaacctgtt ccaactattc acacagataa aaaaagtagt 660 tcatcaactt taagggtgcc aaagttgcct agaaaagaac caactttaag atacttggga 720 aaagatgagt tcagtgattt tcaaaatatt gataaaataa ttagtcttaa ttcgttaaaa 780 gaacagcatt gcccaccagg atttacattt aagaaacttc atgatagtgt tgttttctat 840 aaactctgtt ttgatgaaat atcaggtata ccaactgtgt ttgagtcaat aacagttaat 900 aaagatctta atgtttcatt gtcgtataaa ggatatcata tttctttacc tgaatggttt 960 cgcagtggtc ataattgcaa attaacaaac tgtagtatga tagaaaattt tcctgcccat 1020 attcgaaata aagctaatgc ttttaattca attttgacag agctaaatga aataagatac 1080 tattctccac aaggcagacc tccatattct gcttccgtta tacgttatgc attaatttta 1140 cgtcacactt cagcacagtc atataagttg ttgttagaac aattaccctt accatctttt 1200 agtttattaa gaaaaattca aagtggtgat ataaatgcat ttaaggctat aaaagttttg 1260 ttacaaaaaa attgtgtatc aagtgactgt gtgcttcttg tggatgaaat gtatttgcaa 1320 aaagcagccc aatatcaaag tgggaaatat gttggtgaag attcagaaag taatctttat 1380 aaaggtattg ttgtttttat gattgttggg cttcaaaaat ctattcctta tgttgtgaga 1440 tcatcacctg aagtgtatat aacaggttca tggttaaaaa aagaaattga tgaatgcatt 1500 atctcattgc accaatctgg atttaaagta agagctgttg ttgcagataa tcattctacc 1560 aatgttagtg cattttcaga gtttcataaa gcatataatg gagatggaaa gttatttata 1620 tgtcatccag tttacaatgg agtattaaaa acatacttgt tgttcgatat aatccattta 1680 ataaaaaatg ttagaaataa cttgttaaat gcaaaaaaat atgtttttcc ttctttttca 1740 tttgatttgt ttaaagacaa agttgaagtt cctgctggtt acatagcatg gagtttattt 1800 cataaaatct atgaaaaaga tcagttatta aacgggaact taaaaaaagc gagaaaaatc 1860 aattatcagg tcactcatcc tggaaataac aaacaagatg tcaaccttgc attggcaata 1920 tttgatgaaa caactacagc tgcaatacta agttattttc ctgaaagaaa tgatgcagca 1980 caatttttat ccttgtttca taagttgttc atcacactta attcaaagca gaaatttaat 2040 acatccaatc aacttggcaa tgctgccgtg aagggtgata acaagcccac attttataga 2100 gaaattgcaa attggattga tatttggtca aaatgtgatt attttacatt ttccaaacaa 2160 acttctcttg ctcttattac aacattacgg gcaactgcat ctcttatcga tgatttgctt 2220 gaagaagact atacatatgt tcttacttcc aggttacaaa gtgatccttt ggagctacgt 2280 tttagtaagt atagacaaat gagtggtggc agatttctag tgagtctttt agaagtatgc 2340 aatagtgaaa aaattttggc agtaagaagt cttctaaagg aagatattaa tttctgggaa 2400 gaaaatattt atcagacatt tgataattct agtttagaaa atattatgca agaaattgat 2460 ttgttgtcta cagagatttt agagtgtcag ctttctgaag atagcatgga agttgctgtt 2520 acaatagcag ggtatgtcgc aaaaaagttg aaaaaaaaat ttggatgtca tacttgtaat 2580 caaaaaatgg tttctacaga ccaagatatt cttaataatg aatacttgaa aatattatcc 2640 agaggtggac ttatatgtcc aagtcaacct ctgtctgatt ttatttgtca tctttttagt 2700 attctagatg tcatatctcc tattttaatc aaacattgca gttattcttt ttcaattaaa 2760 agttttgcag aacaaatttt taaaaattat tacagcagtg ttgattttac ttgtgacaat 2820 caccaagagt tgggtataaa atatggatct cgtatcgtta taaatatctt ttataacaat 2880 ttgcaaaaag aaacaactga ttcagtaaga aaagatcagg tgaaacaatt taaaaaaaga 2940 cagagaacca ataataattc ctaataaaaa gctatttctt ttatactgtt tgattcaatt 3000 ataataaaaa caatataaca aacctttgat tttatttcca ctaaagattt gtagatggtc 3060 tttgcaacat gtaatgttat tttaagtacc atgtttatat atagctatct gaaagcctat 3120 ttcagtattc tgttttactg gaataaactg aagcaggcaa tcagttagcg aaaaatccag 3180 tataacataa tgtaactaca ctaagaaaat ccgagtcatt tgttccaatt ttaataaaat 3240 atgaataaaa ctttcacaat gctaacagaa gaactcaagc tttcaaacgt tagccgtaaa 3300 aaaactggcc tcttaatttt ttacggaaaa tgcaacaaat caggccgtgt aattatagta 3360 ggccatgat 3369 // ID TRE3C repbase; DNA; INV; 4751 BP. XX AC AF134171; XX DT 24-APR-2000 (Rel. 5.03, Created) DT 16-AUG-2009 (Rel. 14.09, Last updated, Version 2) XX DE TRE3C is a non-LTR retrotransposon - a consensus sequence. XX KW L1; Non-LTR Retrotransposon; Transposable Element; ORF1; ORF2; KW LINE; TRE3; TRE3C. XX NM TRE3C. XX OS Dictyostelium discoideum OC Eukaryota; Amoebozoa; Mycetozoa; Dictyosteliida; Dictyostelium. XX RN [1] RP 1-4751 RA Szafranski K., Glockner G., Dingermann T., Dannat K., Noegel A.A., RA Eichinger L., Rosenthal A. and Winckler T.; RT "Non-LTR retrotransposons with unique integration preferences RT downstream of Dictyostelium discoideum tRNA genes."; RL Mol. Gen. Genet 262(4-5), 772-780 (1999). XX DR GenBank; AF134171; Positions 1 4751. XX CC ORF1: 259-1131; ORF2 (reverse transcriptase):1131-4574. CC This consensus sequence has been constructed [1] from 161 CC independent reads on genomic shotgun clones. XX FH Key Location/Qualifiers FT CDS 259..1131 FT /product="TRE3C_1p" FT /translation="MNFFDKLKGELNLGEEELNKKTIQIRLFYPHTLELNS FT NSTKLIDNFIVNGTLNIVRRQKGITVFGRILKEKFTLLHECYSTFKYEDFP FT LQKFILYTPGYHYVHFYTFTKNIIKKINNQMVKENWGLEPLLIDECNIVML FT PDVIRYSCILLFNKEDDISIIKTNIDAIKGSKHPIRAFISEYNNKEIRDVR FT KKKNTSNKNQQPTTTNTTKSIPSTPPTTNASTTTPITTTPITTTQTTTTPT FT TTTHQLPTSNTTINNLNKTTTTTTTTTKPINKITTSNGTKFPIASSIRK*" FT CDS 1131..4574 FT /product="TRE3C_2p" FT /translation="MEQLKLLLWNCRGNQSTNAKNKTEETIKRIGTQLALL FT TETNFNGFNHHKTMFNFERIDHGSGKGTGIAIENRDTRKGHISINFKDDDG FT RILSIKYNSFNSINILLIYAPATISERNTFIINSKSLFKKYNSINHQIIAG FT DFNNNHDCNSFFGTELRKIIDQDMLLDTGIEENTPTFPRSMKRLDRIYCHP FT TLLNQNSKLVVHNTVFNKSDHFPITITIQTNRETTTTTTTKLERLPWTLCK FT EILNNKHIHDGLSELISKNKDKIKSVEEWTKFKNNVIRDYLKKEQNKIKKE FT KNKRKYVIHKLLGNSDIIPKMRKELNEEISRILEEERKVKAWDIKLKLHLH FT QETPSKYLTSILKSRAKDKSIFQIKDKDNKTISDKENIAKRFVEFYQDQYE FT EKEDNEETHKKLLEKWEVDVDLIKKLEIDRPIRIHEVTKAIKTSSIHKSPG FT LDGINALFYKYHINSIARILTIAFNDLLTNKKEIPTKFKEGVITTIFKKGD FT ELNISNRRPITLLNTDYKILSKILNSRLLDITSKIINKFQNGFVPNRFIQD FT NIQIMKEVIEISNKRKNNTLITFYDFNKAFDSISHKSITRTLEHIGIPPKF FT TAILLNLLKDTKNKIKINDFLVNGITIRRGTKQGDPISPTIFALVLEPLLI FT DIINDNTIKGFTLPNSKSLKLTAFADDIATFTNSTEELMKINTKIQKYCSA FT TSSSLNKEKTVMIAIGDKPHDLPFQESTVPERYLGLNFTKTGLNSKYNTLI FT QEMKNNLIKWKSQAITMKAKMTILKTYVLSKLTYHQYMDNLNEEQIEEINN FT MTRWFLFSSVKNTYTEERKYKTMMKIDRAYADWKEGGIKLWDIELRHIAFK FT IWYMNRLLHNNYNNNNNTLQEWYMEQLSRKKAHTSTLNDMCRHWGVFRVKF FT YQNHPKINELPDCIRNDNDEPLKLKEIYELMIKDRHPTPRRTEWQKLWAVR FT YNTAIPKVFININSISHQKGRNTLFRFFSRSLPGINHERDTRCKICGHLFR FT DPYSHLFTLCQDILDIEKTIISTVNKLSFIKIHRWSMDTLDISKYNRTERI FT FPNLIGIIAHQLWKIICHKLFNTDESKPEPKFEQKVIETELLNLIETEKFI FT TLKKIKHDEAILKNTNQDLHKYKFNKAWQTPAAPNPLPI*" XX SQ Sequence 4751 BP; 2208 A; 910 C; 589 G; 1044 T; 0 other; aaaagcctac agttgatcaa acggatacta gatacagaaa aacatatcca tcatcaatct 60 aacatcaact acattcaatc tatctactct acacactaat tccttcaatc cttggattgg 120 gaataagaat agatctgaat aaacgagaaa gaaaaaaaaa aaaaataaaa ataaaaatca 180 aaaatatttt tatataaaca cataattaaa aaaaaaaata ataaaaataa tataaaaaaa 240 aaaaaaaaaa aaaaaaaaat gaattttttc gataaactta aaggtgaact caacttaggc 300 gaagaagagt taaataaaaa aactattcaa ataaggttat tctacccaca taccttggaa 360 ttaaatagta attcaacaaa attaatagat aatttcatag ttaatggtac tcttaacatt 420 gtcagaagac aaaagggtat aacagtattc ggtagaatac taaaagaaaa attcacatta 480 ttacacgagt gttacagcac tttcaagtat gaagatttcc cactgcaaaa attcatactt 540 tacaccccag gatatcacta tgtccacttt tacacattta caaaaaacat aataaaaaaa 600 ataaacaatc aaatggtaaa agaaaactgg ggattggagc cattactaat agatgaatgc 660 aacatagtaa tgctaccaga tgtgattaga tattcatgca tactattatt taataaagaa 720 gatgatatct caatcattaa aacaaacatt gatgctataa aaggtagtaa gcatccaata 780 agagccttca tctctgaata taacaataaa gaaataagag atgttagaaa aaaaaaaaac 840 acaagtaata aaaatcaaca accaacaact acaaatacaa ctaaatcaat accatcaact 900 ccaccaacaa ctaatgcatc aacaaccaca ccaataacaa ccacaccaat aacaaccaca 960 caaacaacaa ccacaccaac aacaaccaca catcaattac caacaagcaa tacaacaatc 1020 aacaacctaa acaaaactac tactactact actactacta caaaacctat caataaaatt 1080 actacaagca atggtaccaa gtttccaatt gccagtagta ttagaaaata atggaacaat 1140 taaaattatt actatggaac tgcagaggta atcaatcaac caacgctaag aataaaacag 1200 aagaaacgat aaaaagaatt ggcactcaat tagcactcct aactgaaact aatttcaatg 1260 gattcaacca ccacaaaaca atgttcaact ttgaaagaat agatcatggt tcaggtaaag 1320 gaactggcat agcaatagaa aatagagata caagaaaagg ccacatatca ataaacttca 1380 aagacgatga tggtagaata ctatcaatta aatacaattc ctttaattct atcaacatat 1440 tattgatcta tgctccagcg actatatctg aaagaaacac cttcataatc aactcaaaat 1500 cactgtttaa aaaatataat tcaatcaacc atcaaataat agcaggcgac ttcaataaca 1560 accacgattg caacagcttc ttcggtactg aattacgtaa aataatagac caagacatgc 1620 tactagatac tggtattgaa gaaaacactc caacatttcc aagatcaatg aagagactag 1680 atagaatcta ctgtcatcca acactactaa accaaaattc aaaactggta gtccacaaca 1740 cagtatttaa caaatcagat cactttccca tcacaatcac aatacaaacc aacagagaaa 1800 caacaacaac aacaacaacc aaactagaaa gactcccttg gacattatgt aaagaaattc 1860 tcaacaacaa acatatacat gatgggttat cagaattaat cagtaagaac aaagataaga 1920 tcaaatcagt ggaggaatgg acgaaattta aaaataatgt catcagagat tatttaaaaa 1980 aagaacaaaa caaaataaag aaagaaaaga acaaaagaaa atacgtaatc cacaaattac 2040 ttggaaatag tgatatcatt ccaaagatga gaaaagaact caatgaagaa ataagtagaa 2100 tattagaaga agaaagaaag gtaaaagcat gggatatcaa attaaaactc cacctgcatc 2160 aagaaacacc aagcaaatac ctcacaagta tccttaaatc cagagccaaa gataaatcaa 2220 tattccaaat aaaagataaa gataataaga ccatatcaga taaagagaac atagcaaaaa 2280 gatttgttga attctatcaa gatcaatatg aagagaaaga agataacgaa gaaacccaca 2340 agaagttatt agaaaaatgg gaagtagacg ttgatttaat caagaaactc gagatagaca 2400 gaccgataag aatacatgaa gtgaccaaag caatcaaaac ttcaagtatc cacaaatctc 2460 caggtttaga cggtatcaat gcactattct acaagtacca catcaactca atagcaagaa 2520 tattgactat agctttcaac gacctcctca caaataagaa agaaatcccc acaaaattca 2580 aagaaggagt aataacaaca atattcaaaa aaggagatga actaaacatt tcaaatagaa 2640 gaccaatcac ccttctcaat acagattaca aaattttaag caaaattctc aacagtagat 2700 tattagatat caccagtaaa attatcaaca aatttcaaaa cggctttgtt ccaaacaggt 2760 tcatccaaga caatattcag atcatgaagg aagtaataga aataagtaac aaaagaaaga 2820 acaatacact cattaccttt tacgatttta acaaagcatt cgattcaatc agccacaaga 2880 gtatcacaag aacattagag catattggta tccccccaaa attcacagcg atcctactaa 2940 acctactcaa agatactaaa aacaaaataa agatcaacga ctttttagtc aatggaataa 3000 caatcagaag aggaacaaaa caaggagacc caatatcacc aacaatcttt gctctagttt 3060 tagaaccact tctaatagat atcatcaatg ataatacaat caaaggcttc actctaccaa 3120 attccaaatc attaaaactc actgcatttg ctgacgacat agcaacattt acaaactcca 3180 ctgaagaact aatgaaaatt aatactaaaa tccaaaagta ctgctcagca acatcatcat 3240 cgttaaacaa agagaaaaca gtaatgatag caataggaga taaaccacac gatctaccat 3300 tccaagagag tacagtccca gaaagatatc ttggcctcaa tttcaccaaa acaggtttaa 3360 attcaaaata caacacttta atccaagaaa tgaaaaacaa tctaatcaaa tggaaatcac 3420 aagcaataac gatgaaggca aagatgacaa ttctcaaaac atacgttcta tccaaattaa 3480 cataccatca atacatggat aatctaaatg aagaacaaat tgaggaaatc aataacatga 3540 ccagatggtt cttattctcc tcagttaaga atacatatac agaagagaga aaatacaaaa 3600 ctatgatgaa aatagacaga gcatatgcag attggaaaga aggaggcata aaattatggg 3660 atatagaact aagacatata gcattcaaaa tctggtacat gaacaggcta ctccataaca 3720 actacaacaa caacaacaat accttacaag aatggtacat ggagcaatta agtagaaaaa 3780 aagcccacac ttcaaccctc aacgatatgt gcagacactg gggtgtattc agagtcaaat 3840 tttaccaaaa ccatccaaag ataaatgaac ttccagactg tataagaaac gacaatgacg 3900 aaccactaaa actgaaagaa atttacgaac tcatgatcaa agacagacac ccaacaccaa 3960 gaagaacaga atggcagaag ttatgggcgg tgagatacaa cacagcaata cccaaagtat 4020 tcataaacat caacagcatt tctcaccaaa aaggtagaaa caccctcttc agattcttct 4080 caagatcact tccaggtatc aaccacgaaa gagacaccag atgcaagatc tgtggccacc 4140 tattcagaga cccttattct cacctcttca ctctatgcca agatatccta gatattgaaa 4200 aaaccatcat atcaacagtt aacaaattat cattcatcaa aatccacaga tggtcaatgg 4260 ataccttaga catatcaaaa tacaacagaa ctgagagaat cttccccaat ctcataggaa 4320 taatagcaca ccaattatgg aagataatct gtcacaaatt gttcaacact gatgaaagca 4380 aaccagagcc aaaattcgaa caaaaggtca tagaaacaga attactaaat ctcatcgaaa 4440 ctgaaaaatt catcacacta aagaaaatca aacacgacga agcaatacta aaaaacacca 4500 atcaagatct tcacaaatac aaattcaaca aagcctggca aaccccagca gctccgaacc 4560 ctcttccaat ttaagtagta gtaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 4620 aaaaaatata atatagttta aaaaatttaa tacaattgaa taataaaaat aaaaccgttt 4680 aacaaacacc gccgaaaaga cacaaggaca ttccttgaag gtctgatcac aggtaaaaaa 4740 aaaaaaaaaa a 4751 // ID Gypsy-228_AA-LTR repbase; DNA; INV; 116 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-228_AA_; KW Gypsy-228_AA-I; Gypsy-228_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-116 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1060-1060 (2011). XX DR [2] (Consensus) XX SQ Sequence 116 BP; 39 A; 18 C; 24 G; 35 T; 0 other; tgtagtaggc tttggtgcat agttctagag ctaagtaggc attgacttgg acagtgacaa 60 ataagaccat tctgaaataa aacgttgaaa agataacatc gccttttatt cttaca 116 // ID Gypsy3-SM_LTR repbase; DNA; INV; 172 BP. XX AC Contig139; XX DT 15-AUG-2007 (Rel. 12.08, Created) DT 15-AUG-2007 (Rel. 12.08, Last updated, Version 1) XX DE LTR retrotransposon from Schmidtea mediterranea: long terminal DE repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy3-SM_LTR; KW Interspersed repeat; LG_I. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-172 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from Schmidtea mediterranea."; RL Repbase Reports 7(8), 759-759 (2007). XX DR [1] (Consensus) XX SQ Sequence 172 BP; 69 A; 14 C; 32 G; 57 T; 0 other; tgaggaagtt ttatgatgaa gaaatatctt cgtagaaaat ttgtttaaat aagaaagatt 60 aaagcagaca atttttagag tttgatattt attgaaataa aatttatgga aacaaaaact 120 gattttaatg ttttacagga tgaacctcct aatatggtgg cagcggtaaa ca 172 // ID DNA4-6_AP repbase; DNA; INV; 242 BP. XX AC . XX DT 25-AUG-2009 (Rel. 14.09, Created) DT 25-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA4-6_AP. XX NM DNA4-6_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-242 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 1953-1953 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC TSD: TATA CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 242 BP; 47 A; 81 C; 72 G; 42 T; 0 other; ctctattcgc cgagagttga cccactcgct cgacgaaaac cggtcgcggc cgcgcaggcc 60 accgcgtctt tggagcgcta gtggtgatcg tcgccagtag cagctgccta ctacgacgag 120 cactgccgat cgccgaacac cgccgaccaa tcacagaccg ccggtctcct actcgggggg 180 tctgaacgtt ccggcggctc agtatgcatg cgcgcaagtg ggtcaactct cggcgaatag 240 ag 242 // ID Gypsy-15_RP-I repbase; DNA; INV; 3989 BP. XX AC ACPB02047260; XX DT 28-JAN-2011 (Rel. 16.02, Created) DT 28-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Rhodnius prolixus genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-15_RP_; KW Gypsy-15_RP-LTR; Gypsy-15_RP-I. XX OS Rhodnius prolixus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Euhemiptera; Heteroptera; OC Panheteroptera; Cimicomorpha; Reduviidae; Triatominae; Rhodnius. XX RN [1] RP 1-3989 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Rhodnius prolixus genome."; RL Direct Submission to RU (28-JAN-2011). XX DR Genome; ACPB02047260; Positions 4134 146. XX CC Positions [3190-3648] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 44..2026 FT /product="Gypsy-15_RP-I_2p" FT /translation="MPKSRKSRDLMEQEELDGMGELQKKMERFSFDPFDRT FT KVSLEQYLDQFERRCEVKGLGGTCMTRREIRKQLLLAYVGAESLGAVKNLL FT IPRQLDSCDYSDILGALQALYKPEKTIFTARIDFEQAVRSEEESLTAFLSR FT LKSLSADCKYGSSLDERLRDRLLGGVRLSKLEKEARMRWPDGQDSSGEPVK FT LEQVYQLARAMERVTEELKQEEIYKVGQSNKFKSKQQDLRARRKEREMREE FT NKGNRAKCENCGRRNHEREECPARKAKCFRCDKEGHFARMCKSSIKRIEED FT QQEYDDTDFSEDDSLKAIKSRVHKVGRAEIKVVLNGVSCEMEFDSGARVST FT ISRSWWTRLGQPPLQPLGNLQGYGKHLLEVLGQALVEVELNGERKKLKVAV FT MKEEDIPLFGLPWIQAFRLHLPKEVSIKKVKNQGKHSDVNVKVGNIINEFK FT ELFQDSLGTIKGTQAAVHLKPYANPIAFRARRVPFPLRRPVERELERLLQA FT GVIEKVDPTKTPILWATPTVNVDKGNGKVRICGDFRVTLNQYIIPDIHPMP FT TFADLTAKLVGGVEFSIIDLRDAYLQMEVAEQVRDFLVIATHLGFFRYGRL FT PLGLSSAPAIFQRWMENLIADIPNVGVNLDDIIITGPTREQHLATVKKSSG FT ENESRRIKCKIR" FT CDS 3031..3948 FT /product="Gypsy-15_RP-I_1p" FT /translation="MKGTQGVIAMKSIARYYVWWPAMEKDIEQYVAKCQGC FT QENRSNVPEVPLYSWNIPDYPWERIHLDFAGPFQGNYWMVGVDAYSKWPEI FT AIMGKITTEATINRLREWFSRYGVPKKLVTDNGPQFRSGEFKNFCDRNAIK FT HIRTTPYHPKSNGLVERFIRTFKQRYLAGKDEGGSPQEKVARLLLSYRNTP FT HRTTHKSPAELFLKRRLPTVLDRLKPDPKEEMEKEIWKQKRYHDLSIQKRE FT FIEGQEVWIKNELGKGWKPGMINCRTGELSYLVMSEGELKRKHADQLRARI FT GATAPEKEEKEEMS" XX SQ Sequence 3989 BP; 1359 A; 691 C; 1020 G; 919 T; 0 other; aattcttgat tacgtattta ctatattggc gacgaggatc aaaatgccta aatcaagaaa 60 aagccgagac ctgatggagc aggaggaatt ggacgggatg ggggagttgc agaagaaaat 120 ggagagattt tcttttgacc catttgaccg gacgaaggtg agtctagaac aatatttaga 180 ccagtttgaa aggagatgcg aggtgaaggg ccttggaggt acgtgtatga cccgcagaga 240 aataagaaaa caattattac tggcctatgt gggtgcagag agcctaggag cagtaaaaaa 300 tttacttata cccaggcagc tggatagttg tgactattca gatatcctag gtgcactgca 360 agctttgtat aagccagaaa aaactatttt tacagctagg attgattttg aacaggcagt 420 tagaagtgag gaagagtcac ttactgcatt cttaagcagg ttaaaatcac tgagtgcaga 480 ttgtaaatat ggatctagct tagacgagag gttacgagac agattattag ggggtgtcag 540 gctaagtaag ctagagaaag aagctcgcat gaggtggcca gatggacaag actcttcagg 600 agaaccagta aaattagagc aggtatacca gttggctaga gctatggaaa gagtaactga 660 ggaattaaaa caggaagaaa tatataaggt tggacaatcg aataagttta aatctaaaca 720 gcaagatttg agggcacgaa ggaaagaaag ggagatgcgt gaagaaaata agggtaatag 780 agcgaaatgt gaaaattgtg gaagaaggaa ccatgagcgg gaagaatgcc ctgcgaggaa 840 agcaaaatgt tttaggtgtg ataaagaagg acacttcgct cgtatgtgca aaagttcgat 900 aaaaagaatt gaagaagacc aacaggaata cgacgatact gattttagtg aggacgactc 960 cttaaaagcc atcaaatcaa gggtgcataa agtagggaga gcggaaatca aggttgtact 1020 aaatggagtt tcctgcgaaa tggagttcga ttcaggcgcc cgagtgtcta caattagccg 1080 gtcctggtgg acaagattag gtcaacctcc attacagcct cttggtaacc tccaaggtta 1140 tgggaaacat ttattagagg ttttggggca ggctctagtg gaagttgaat tgaatggaga 1200 gcggaaaaag ctgaaagtag cggtaatgaa ggaagaagat attcccttgt ttggcctgcc 1260 ttggatacag gctttcaggc tgcacttacc aaaagaagtg tccataaaaa aagtgaaaaa 1320 tcagggaaaa cacagtgatg taaacgtaaa ggtaggaaac ataataaacg aattcaaaga 1380 attgtttcag gattccctag gcaccatcaa gggcacgcaa gcggccgtcc atttaaaacc 1440 ttatgcaaac ccaatagcat tcagggctcg aagggttcca ttcccattga gacgaccagt 1500 agaacgagaa ttagagaggt tgctccaagc tggagtgatt gaaaaagtgg accctacaaa 1560 gacgccaata ctctgggcca cgcctaccgt aaacgtagac aaaggcaatg ggaaggtaag 1620 gatttgtggc gatttccggg tcactctaaa ccagtacatt atcccagata tccatccaat 1680 gccaaccttt gcagatctca cagcaaagtt agttggggga gtggaatttt caatcataga 1740 tctccgggat gcctatctac agatggaggt ggcagaacag gtcagagatt ttttggtcat 1800 tgccactcac ttaggatttt ttcgctatgg cagattaccg ttgggattat cttccgctcc 1860 cgcgatattc caacgatgga tggaaaattt aatagcggac attccgaatg tgggtgttaa 1920 tctggacgac atcataataa caggaccaac tcgcgaacaa catttggcaa cagtgaaaaa 1980 aagttctggc gagaatgaga gccgtaggat taagtgcaaa ataagataaa tgtaggtttc 2040 ttcaaccaga ggtaacatat ctggggcatc gcattgatca gtgcggtatc catccaacaa 2100 tggaaaagct ggaagctatt cgtggtgcca agtctccctc tacaaaaagg gagctaagag 2160 catttctagg ggcaattaat tactatgaaa aatttattcc ccatctgcac agcctctgcg 2220 cctgtttaca tcgcttgaca attttgaagg tgagatggaa ttggactgaa gatgaagaaa 2280 aggtgtttca agcgacaaaa cagttattta tgggtaagga gtgcattgta ccatatgacc 2340 caagaatgcc tatgaccctt atgtgtgacg cgtctgaatc agggttgggg cggttttgtt 2400 gcataacttc cctgacggaa ccgaacggcc ggtggcatac gcctcaagga cattagcggc 2460 ggccgaaaaa cattatgcgt ccattgacag agaagcacta gcatgtgtat ttggagtgag 2520 aaaattccac caatatcttt atgggaacaa attcacttta gttacggatc ataaaccctt 2580 agagagattg tttggtccca aaagagatct tcccaaagtg actaacaaca gactaacccg 2640 ctgggcggtt gttctcaata attacaactt cgatgtagaa taccgaaaag ggtcagataa 2700 ttctgtagcg gatgcattat caagacttcc actgaaaaca gaagataatg ataataatat 2760 cgatgtcgca gatagcttta tttcaataaa aaagttggaa gatttggaat taacaaaaga 2820 agaattacaa attcagacta agagggatca aatcttaagc aaggtgagta ctttcgtcat 2880 caccagctgg ccgccgaaac tcaaagaggt aggaataaaa tactatttcg acaaaaagaa 2940 tgaaataatt ttagaagggg gcatacttat gtggaacggg agaataatag ttccccagac 3000 actcagaggg aaaaacgcta gcgactctcc atgaagggca cccagggtgt gatagcgatg 3060 aagtcgatag cgcgttatta tgtttggtgg cctgcgatgg aaaaagatat agagcagtat 3120 gtggccaaat gtcaaggttg ccaagagaac aggtcaaacg tgcctgaagt accactttac 3180 tcatggaaca taccagacta tccatgggag aggattcacc tggatttcgc tgggccgttc 3240 caaggtaact actggatggt gggagtggat gcttactcca agtggccaga aatagccata 3300 atgggaaaga taaccacgga agcgacaata aatagactcc gagaatggtt ctccaggtac 3360 ggggtaccaa aaaaattagt gactgataat ggacctcaat ttaggtccgg agagttcaag 3420 aatttttgtg atagaaacgc gataaaacac atcagaacca ctccatatca cccaaaatct 3480 aatggattag tggagcgatt cattagaacc tttaaacaac gatatctggc ggggaaggac 3540 gaaggaggca gccctcaaga aaaggtagca aggttgttgc tgagctatcg gaacacacct 3600 cacagaacta ctcataaatc accggctgag ttatttttga aaagaagact ccctactgtg 3660 ttggataggt tgaagccaga tcctaaggaa gaaatggaaa aggagatctg gaaacaaaaa 3720 agataccacg atctaagtat ccaaaaaagg gaatttatag aggggcaaga ggtttggata 3780 aagaatgaat tagggaaagg atggaaaccg ggtatgatta actgcaggac tggagagtta 3840 tcatatctag tcatgtcaga aggtgaactt aagagaaagc atgcggacca attaagggca 3900 aggataggtg ctactgcacc agaaaaggaa gagaaggaag agatgagctg aagatggaga 3960 ggtagatagc taagtaaagc ggggagaaa 3989 // ID hAT-38_SM repbase; DNA; INV; 2383 BP. XX AC . XX DT 14-AUG-2009 (Rel. 14.08, Created) DT 14-AUG-2009 (Rel. 14.08, Last updated, Version 2) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-38_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-2383 RA Jurka J.; RT "haT-type repetitive family from freshwater planarian Schmidtea RT mediterranea."; RL Repbase Reports 9(8), 1841-1841 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 423..1739 FT /product="hAT-38_SM_1p" FT /translation="MANSSTRSSCRHGIFGQSSVLPINQLPIQEEVFKCFL FT WYRNQSHEASSQIQSKRDIMKLVAVDVIAIWAKASIPTIAQHSVINKMEKD FT SFQTIFDICSCKCISRGISDRNRRICKMKVPRIEWEFWVDQISERNIVIGN FT VDLDTSAALERQIQRKEKCANLKRESTNSMQMVSGYSVEENLSPQEAESSE FT GKEDXMECEISAETRISNRDACKVINACLQDMNINTQENLPMSSKLRRQRI FT LYRTAAVTNHCISNQELLCIGFXGRIDETRLYEGEPSSLHIVHFKPIIGET FT DTIFDKLESGLSQDQIYLYRIANVIQLRFKNSDHYELNYILTASPGNLSQA FT RWLTCANRILRYYIGTKTPSTSLISIVSYILNVYTPSWFRIKTHPYLADGT FT KNFFYIVQRIQLFCNKSIRQVPQSSLQKKQLLLSPGKCSCVGTVR" XX SQ Sequence 2383 BP; 796 A; 424 C; 430 G; 731 T; 2 other; gggcgggtcg attttcacaa cctataccgt ctttagtggg aaaagttgcc aatcacgcat 60 acaagtacta ttataatcat taaaagtgtt ttagaatatg tttaggtccc ccgcatttga 120 tcagactgca tataatattg atatattaat atatctatgt acattttaat ggattcatta 180 ttccttatta aaattacctg aataaaatat aaaatgttgg tactggcctt tcattattat 240 cgtatttcat aaaaataaga aattataatt catttaaatg tgttttactg tatcattgtc 300 tcatacagcg cttattcgat aatccatctg tatttctgat ttatagaagc tataacataa 360 gtataatttt atggtatatt tgtaatttac ctaaatactt tattatttta atttagttct 420 agatggctaa tagctcaact agatccagtt gtcgccatgg aatatttggt cagtcaagtg 480 tacttcctat aaatcagctt cctattcaag aagaagtatt taagtgtttc ttgtggtatc 540 gaaatcaatc ccacgaagct tcatcacaaa tacagtctaa gcgagatatt atgaaacttg 600 tagcagtgga tgtaattgcc atctgggcga aagctagcat tcctaccata gctcagcata 660 gtgttattaa caaaatggag aaagattcat ttcagacaat ttttgacata tgttcttgca 720 aatgcattag cagaggaatc tcagatagaa accggcgcat ttgtaaaatg aaagttccta 780 gaattgagtg ggaattttgg gtggaccaga tttcagaaag aaatatagtg ataggaaatg 840 tagatttgga tacatcagct gctctcgaaa gacaaattca aagaaaagag aaatgcgcga 900 acctaaagcg agaaagtaca aatagcatgc aaatggtgtc aggatattca gttgaagaaa 960 acctttcgcc acaagaagca gaatcaagtg aaggcaaaga agattncatg gaatgcgaaa 1020 tctcggctga aaccagaata agtaaccgag acgcctgtaa ggtaattaat gcttgtttac 1080 aagatatgaa catcaataca caagaaaatc taccgatgtc atcaaaattg cgaagacagc 1140 gtattcttta tagaacagca gctgttacta atcactgtat ttccaatcaa gaattactat 1200 gcatcggttt tganggacga attgatgaaa ccaggttgta tgaaggagaa ccttcatctc 1260 ttcacatagt ccacttcaaa cctataattg gtgaaacaga tacaatcttt gacaaattgg 1320 agagcggttt aagccaagat caaatttacc tctatcgaat cgctaatgtg attcaattgc 1380 gtttcaaaaa ctctgatcat tatgagctca attacatttt aactgcttct cctggtaatc 1440 ttagccaagc acgatggcta acctgtgcga atcgtattct tcgatactac attggtacaa 1500 aaactccatc tactagtttg atatctattg tctcctacat tctgaacgta tatactccgt 1560 cttggtttcg gataaaaact catccatatt tggcagacgg caccaagaat ttcttctaca 1620 tcgttcaacg aattcaatta ttttgtaaca aatcaatacg ccaggtgccc cagtcgtctc 1680 tacagaaaaa acagctactt ctgtcacccg gaaaatgttc ttgtgttggc actgtcagat 1740 gaaaacatta acgtgaggcg agatgcagtg cagaaagtcg tggtggctag attttcaaga 1800 aaagaaggtg aaatttgtca attttcaaag tcaaatttca tcatcaactt tcaagccgag 1860 aattactttg acctgatcga ttggaacccg tcttttatca cgcctgcacc tattctaaat 1920 gacatatctg acgagtatct aagacagcta gttgaagcag ctccattgcc aataccgaaa 1980 tttcattgtc atactcaggc agttgaacgg actgtaaagg aagttactcg tgtttcctca 2040 aaggtttacg gccacgaagc aagacatggc atgattgtat cagcagaaaa ctcaaggaga 2100 aagtatgaaa ataaagatat tgaaactaaa gaaggatatt tcacaagttg atcgtattca 2160 gtgttcaacc taactctatc gctaatgttt gtattgtgac tttaaagtag ttgaaaaata 2220 tacacagtgt gtaatttcac ctttttcttt tacataaatg cgggggccct agacattaac 2280 caatttgaaa gaaaatttgc atgtgaactt tggatataca aagacaactt tttttagtag 2340 agacaaggta gtcctgtaag ttttgtattc catcgacccg ccc 2383 // ID Ingi repbase; DNA; INV; 5257 BP. XX AC . XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 15-JUL-2010 (Rel. 15.08, Last updated, Version 3) XX DE Non-LTR retrotransposon family Ingi in Trypanosoma brucei. XX KW Ingi; Non-LTR Retrotransposon; Transposable Element; KW Repetitive sequence. XX NM INGI. XX OS Trypanosoma brucei OC Eukaryota; Euglenozoa; Kinetoplastida; Trypanosomatidae; OC Trypanosoma. XX RN [1] RP 1-5257 RA Kimmel E.B., Ole-Moiyoi K.O. and Young R.J.; RT "Ingi, a 5.2-kb dispersed sequence element from Trypanosoma RT brucei that carries half of a smaller mobile element at either RT end and has homology with mammalian LINEs."; RL Mol. Cell. Biol 7(4), 1465-1475 (1987). XX RN [2] RP 1-5257 RA Kojima K.K. and Jurka J.; RT "Consensus update."; RL Direct Submission to Repbase Update (15-JUL-2010). XX DR [2] (Consensus) XX FH Key Location/Qualifiers FT CDS 9..4982 FT /product="Ingi_1p" FT /note="includes AP-like endonuclease, reverse FT transcriptase, and ribonuclease H." FT /translation="MPATSTWCQGPVPRIIGGSQEPAAFLSWGTLLCSGYG FT IIQHRDQQRLAGTPFFICRSLGTCQRAISSIIRTKMLLSGDVEENPGPSLR FT GMQWNCAGLSQGKRLALHKTLVDERIAFCLLSETRMTPGEAACFSVAGYQH FT HGIARNCKGGGVSILVREDLPVETGMAVVGRIEQVHATIHLARGTALTVTS FT AYIPPKHTFTATDLDTLLTTDGAQLIGADANAHALAWDRASPPNTKGETLT FT QWCIDNQFLVCNTGECTRYARHHGESTPDVTLSRNCTVYTWTSLYSPDSDH FT HHIFFDVIVGDDTDALSCPRLRKPMYAWLKADWRNFRLKVDELCRKIGREK FT NVNTLEQKLSSAIRIATKVSVPRGCRATPPHWTPELAKLDEEIAGCGPSHR FT REKLVATRKQILDRTTKKRWSTLCSRLAVSDRCSWHIVKKVYAPRPLTTPA FT VLVDNAAITDYRQAERFSKLYSSRARRHPDSHPPAPIKTIASEFSPITMAE FT LRRSIKLLPSGSAAGPDCLYNEALQHLGRTALNVVLRLFNESLRTGVVPPA FT WKTGVIIPILKAGKKAEDLDSYRPVTLTSCLCKVMERIIAARLRDTVESQL FT TPQQSGFRPGCSTLEQLLHVRAALCRPTHQYRTGAVFVDYEKAFDTVDHDK FT IAREMHRMKVSPHIVKWCVSFLSNRTGRVRFKEKLSRSRTFERGVPQGTVL FT GPIMFIIVMNSLSQRLAEVPLLQHGFFADDLTLLARHTERDVINHTLQCGL FT NVVLQWSKEYFMSVNVAKTKCTLFGCIERHPLTLQLDGERIGADRTPKLLG FT VTFQCLQGMATHAAETRRKMDFRLLQIAAISASTWGPRRQVLRAFYLALVQ FT AHTMYGIEVWYWDASERSRDLLAAAQHKASRIIAGIPHGTRKEDSLLEANL FT LPLKTTTLVRSMKFMLMCESRGGCLRRSAEEVYHSKHPVRALHSRIMRSYP FT HLRIEPREHPLETSTLRHSCRPLFHTQIKPVCADDPDDVKREASEKWIARH FT FARRGKEPPRREHYELWTDGSVSLGEKSGAAALLYRNNTLICAPKTGAGEL FT SCSYRAECVALEIGLQRLLKWLPAYRSTPSRLSIFSDSLSMLTALQTGPLA FT VTDPILRRLWRLLLQVQRRKIRIRLQFVFGHCGVKRNEVCDEMAKKAADLP FT QLRDTWIPDIIAYAKRVLRSEEVHENTHRFGITGNHFPTKHKEELTREEET FT ALARFRVGSSRHYGWMLRKINPSVPPQCRWCNPQHAAIGPTIQTAPTVATR FT TLQRTSEPTKCTECDATYQCRSSAVTHMVNKHGFVRADALRRIKYGDATPA FT VDIPPEPPPVVAIVPLPSSTRVPMRPQVLHCTLCASKFAVPGRLLHHLRTI FT HGIGSGSCRVKRGRENEDSXQGDGRAPAAPASQDTRKLPFQCDLCEASFGT FT RSSLSLHKKFKHKSIVTEDGTVVVVQFPRKRAREGNVDVPGKGEVQCGVCQ FT KVLSCRDSLIRHCKAFHKGEGVELKCSKSTKLCATDSPHTNTSMLVCPTCG FT RQCASKTGLTLHQKKMHGMKVERAVTSQRGDCEETSLHLMSCPGLKELRVR FT FAVEGVCVQDVCFSKRLAQFLIAVERSRPQVKSSPKVTVIQPTLPSATSPL FT IAECGASRKSTGRRNSPDTLRDRGGMQSTSNRNRKKRGRE" XX SQ Sequence 5257 BP; 1402 A; 1395 C; 1423 G; 1036 T; 1 other; ccctggcgat gccggccacc tcaacgtggt gccagggtcc agtaccccgt atcatcgggg 60 gaagccaaga gccagcagcg ttcctttcat ggggaacact gctgtgctcc ggctacggca 120 tcatacagca cagggatcag cagcgtcttg ctgggacacc gtttttcatt tgtcggtccc 180 tgggcacgtg ccagcgtgcc atcagcagta tcatccgcac taagatgctg ctgtccggtg 240 atgtggaaga gaatcccggc ccgtcgttgc gcgggatgca gtggaactgc gccgggctat 300 ctcaaggaaa gagattagca ctccacaaaa cccttgttga tgagcggatc gccttttgtc 360 tgttgagcga gacaaggatg acgcctggag aggcggcttg ctttagcgtt gctggctacc 420 aacatcacgg aatagctcgt aactgcaaag gaggtggtgt atcaatacta gtgagggagg 480 acctaccagt cgagaccggt atggccgttg ttggtcgcat tgaacaagtg catgcaacaa 540 ttcaccttgc acgtggaacg gcgctgactg tcacgtcggc atacatccca ccaaaacaca 600 ccttcacggc aactgaccta gatacgcttc tgacgactga cggtgcccag ctcatcggtg 660 cagacgctaa cgcacacgcg ttggcatggg atcgcgcgag cccaccaaat accaagggtg 720 aaaccctcac acagtggtgc attgacaacc agtttctggt ctgcaacact ggtgagtgca 780 ccaggtacgc gcgccaccac ggggagtcca cccctgatgt gacactgtca aggaattgca 840 cagtgtacac gtggacatcg ctgtattctc ccgatagcga tcaccatcac atatttttcg 900 acgttatcgt tggagacgac acagatgcac tgagttgccc gcggcttcga aagcccatgt 960 acgcatggct aaaagctgac tggaggaatt tccgtctcaa ggttgacgag ctctgcagga 1020 aaattggtag agagaagaac gtcaacaccc tggaacagaa attgagctcc gccatccgca 1080 ttgcgacgaa ggtctccgtc ccccgtggct gcagagcaac gccaccacat tggacacctg 1140 agctcgcgaa actcgatgaa gagatcgctg gatgcggacc ctcccaccga agggaaaagt 1200 tggtagccac gcgtaagcag atcctggacc gtaccacaaa gaagagatgg agcacgctat 1260 gctccagact tgcggtgtca gaccgctgca gttggcacat tgtcaagaag gtatatgcgc 1320 cacgaccact aaccacaccg gctgtacttg ttgataacgc ggccatcacg gactaccgtc 1380 aagctgagag gttcagtaaa ctgtactcgt cccgcgcaag aaggcacccc gactcacacc 1440 caccggcacc aataaagacg atagcgagtg agttcagtcc catcacgatg gctgaactac 1500 ggagatcgat caaactgcta ccgagtggat ccgcagccgg acctgattgc ttatacaacg 1560 aggcactgca acatctcggt agaacagcgc tgaatgttgt tctgaggcta ttcaatgaga 1620 gcctacgaac gggagtcgtg ccgcctgcat ggaagactgg tgttatcatc cccatcctga 1680 aggccggaaa aaaggcggag gacctcgatt cttacaggcc tgtgacgctc acgagctgtc 1740 tctgcaaagt catggagcgc ataattgccg cgaggcttag agacactgtt gagtcccagc 1800 tgacgccgca gcaatcaggc tttcgccccg gatgctcaac gctcgaacaa ctcctgcacg 1860 tccgcgctgc cctctgccgt cccacgcacc aatatcgtac gggtgctgta ttcgttgact 1920 acgagaaggc attcgataca gtagaccacg acaaaattgc gagggaaatg cacagaatga 1980 aggtatcacc ccacattgtg aagtggtgcg tatcatttct gagtaaccga actggcagag 2040 tgagattcaa ggagaagctt tccagaagca gaacatttga gcgaggagtg ccacaaggaa 2100 ctgtccttgg cccaatcatg ttcattattg tcatgaactc gttgagccaa cgccttgcag 2160 aagtgccgtt actgcagcac ggattctttg cagacgacct gacgctactt gcgaggcaca 2220 cagagaggga tgtcatcaac cacacactac aatgcggcct aaacgtggtg ttacagtggt 2280 caaaagagta cttcatgtct gtcaacgtag cgaaaacaaa gtgcacactc ttcgggtgta 2340 tagagcgcca cccccttaca ttacaactgg acggcgaaag aataggagct gacaggacac 2400 cgaagcttct aggagtaaca ttccagtgtc tgcaggggat ggcaacacat gcggccgaaa 2460 cgagacgcaa gatggacttc agactactgc agatagcagc catctcagct tctacatggg 2520 ggccaagacg acaagtactg agagcttttt atctagcact cgtacaggca cacaccatgt 2580 atggcattga ggtatggtac tgggacgctt cggaacgaag tcgcgacctc cttgcagcag 2640 cacaacacaa agccagtcgc atcatagccg gcataccgca tgggacgcgc aaagaggact 2700 ctctgctgga agcaaacctc ctgccactca agacgaccac tcttgtgcgc agcatgaaat 2760 tcatgctgat gtgtgagtca cgaggcggat gtttgcggcg cagtgctgag gaagtatacc 2820 acagcaaaca cccagtcaga gccctacatt cccgcatcat gcggtcctac ccccacctcc 2880 gcattgagcc acgcgagcac ccactagaga catcgacgct ccgccacagc tgccgaccgc 2940 tatttcacac gcagataaag cctgtgtgcg ctgatgaccc tgacgatgtc aaaagggagg 3000 cttccgaaaa gtggattgca cggcattttg cacggagggg gaaggagcca ccgcggcgag 3060 agcactacga attgtggact gatggatccg tgtccctcgg tgagaagtcc ggagcagctg 3120 ccctgctcta tagaaacaac acgctgattt gtgcacccaa gaccggagca ggggaactct 3180 catgcagtta cagagcggaa tgcgtagcat tagagatagg actgcaacgg ctgctgaaat 3240 ggcttccggc atacagaagc acaccgagca ggttgtccat cttctctgac tcgctgtcaa 3300 tgttaacagc actgcagaca ggccccctag ccgtaacgga cccaattcta agacgactat 3360 ggaggcttct gcttcaagtt cagagaagga agatacgtat ccgactgcaa tttgtgtttg 3420 gccattgtgg cgtgaaacgg aatgaggttt gtgatgaaat ggccaaaaag gccgcagatt 3480 taccacagtt gcgagacaca tggatccccg acatcattgc ttatgcgaag cgagtgctta 3540 ggtcggaaga agtccatgag aacactcata ggtttggtat cacgggcaac cactttccaa 3600 caaaacataa ggaagaactg acgagggaag aagaaacggc actggcacgc tttcgggttg 3660 ggtcttcaag acactatgga tggatgttgc gaaagatcaa cccgagtgtg cctccacagt 3720 gccgatggtg caacccgcaa catgcagcga tagggccaac aatacaaaca gccccaactg 3780 ttgcgacacg cactcttcag agaacctccg aaccgaccaa atgtacggaa tgtgatgcca 3840 cataccaatg ccgctcgagt gctgtaacgc acatggtaaa caaacatggc tttgtgcgag 3900 ctgatgccct ccggaggatc aaatacggcg atgcaacacc tgcagtggat atccccccgg 3960 agccccctcc agtggtggcg atcgttcctc taccatcgag cacacgagtc ccgatgagac 4020 cgcaggtgct tcattgtacc ctctgtgcct ccaaattcgc agtgccaggc cgactattac 4080 accaccttag aacaatacat ggcataggca gcggtagttg ccgcgtgaaa agggggcgag 4140 agaacgagga ctcaktgcaa ggagatggta gagccccagc agcgccagcc tctcaggata 4200 cacggaagct gccgtttcaa tgtgacctgt gcgaggcgag cttcggtaca cgctcttccc 4260 tgtcactaca caagaaattc aaacataaga gcatagtgac ggaggacggt accgtggtgg 4320 tggtgcaatt ccctcgtaag cgtgcccgtg agggaaacgt tgacgtcccg gggaaaggcg 4380 aggtgcagtg tggtgtgtgc caaaaagtgc tcagttgcag ggactccctc atccgacact 4440 gtaaggcttt ccacaaaggt gagggagtcg agcttaaatg cagcaaaagc acaaaattgt 4500 gtgccactga ttccccgcat acaaacacat ccatgttggt gtgcccgaca tgcggaaggc 4560 agtgtgctag caaaactggc ctcaccctac atcaaaagaa gatgcacggt atgaaggtag 4620 agcgcgctgt taccagccaa cgcggcgact gcgaagaaac gtcgctgcac ttgatgagct 4680 gtcccggttt gaaggagtta cgtgtcaggt ttgcggtaga aggtgtgtgt gtgcaggatg 4740 tatgtttctc taaaaggctg gcgcagtttc tcattgcggt tgaacggagc cggccgcagg 4800 tgaagtcctc acctaaggta acagtcatac aacccactct cccttctgca acttcccccc 4860 ttattgccga gtgtggagct agcaggaaaa gcaccggacg caggaatagt ccagacaccc 4920 tcagagacag agggggtatg cagtcaacaa gcaacagaaa cagaaagaag agggggagag 4980 aataataaac gaataacaaa aatcaaaaca aaaggattgc taattgacat cttttggaga 5040 gtccggggtg gggggcttct cgccccatct gctgtattcc gttcaactgc ggagctacaa 5100 caaaaattat agagggtgtg ttaggctgaa taaaaaaggg agactctgcc acagtcgcca 5160 gaccgatagc atctcagggc tctacggtga tggctgatgg ccgcgccagt ggggggaaac 5220 tctcacgaag gcacgaagaa aattccaaaa aaaaaaa 5257 // ID PRSAT1 repbase; DNA; INV; 142 BP. XX AC M76467; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 28-SEP-1995 (Rel. 1.08, Last updated, Version 1) XX DE P.ratzeburgii satellite repeat sequence. XX KW SAT; Satellite; Simple Repeat; Constitutive heterochromatin; KW PRSAT1; heterochromatin repeat; Satellite repetitive element. XX OS Palorus ratzeburgii OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Coleoptera; Polyphaga; Cucujiformia; OC Tenebrionidae; Palorus. XX RN [1] RP 1-142 RA Ugarkovic D., Plohl M., Lucijanic-Justic V. and Borstnik B.; RT "Detection of satellite DNA in Polorus ratzeburgii: Analysis of RT curvature profiles and comparison with Tenebrio molitor satellite RT DNA."; RL Biochimie 74, 1075-1082 (1993). XX DR GenBank; M76467; Positions 1 142. XX SQ Sequence 142 BP; 53 A; 19 C; 26 G; 44 T; 0 other; gaatccgaaa caaaatcgtt ccttaaatac gaaatgcgaa tgatttaaga acgtttgcag 60 agaaatctgg ctgatacatt cgctaatttt taaaattaag cagtaaaatg taaagtttgc 120 atggtttcag ctgtatttaa ga 142 // ID BEL-128_AA-I repbase; DNA; INV; 3265 BP. XX AC supercont1.1; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-128_AA_; KW BEL-128_AA-LTR; BEL-128_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-3265 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.1; Positions 3322077 3325341. XX CC 'GGCTT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 501..2531 FT /product="BEL-128_AA-I_1p" FT /translation="MAEEAKTIDEWQQKKQNVVDSVALLEYYVTNFDKEES FT RKEQVEAWAEKLERFYDDFHRMAVKIEALSSEEDPIDLKGERQKFDSRYYG FT LRAFYLKQMAKATSTSSSNLTQPGMSMNIRLPEINLPKFSGRLEEWCVFRD FT SFQSAVGSRSDIGPVEKLHYLKGLIQGEAARILDPIKVSEQGYKDAWRALK FT LRFENKRQLIKCHIKTLFDTPAMCEESSDELLALADRFEQQISVLKSLGEP FT ADKWSSLLVYLLTIRLDPCTLREWENYCTRLDTDNIASVLGGIASTSSTIA FT DDSTSMPSYVQMVNFLQNYSRVLHAVSSATFDTSLRQYTKSPTKSAGYLLA FT SPPNKASNPIPDSSSPSKPERHCEKCGQNHFLFHCPEFQNLDVSRRTELVR FT QGNMCMNCLRSSSHIAQTCSGTSCRVCSRKHHTLLHTDASDIMPTSNTQSA FT DSTCYVAFEQVRPSTEPSKFVSQHLIQQPSTSTASVNPYFGVHDTSGNLQC FT DLVSQSQAAITGTVFLPTALVNIRNSRGRTITACCLLDCASQRDYVSAGLC FT DKLQLPQVQLPLPITVSGIGNTTTLVEHEAKVTVFSRMSPISVESSMLILP FT SITTKLPHCSVNVHQWSIPHHVNLADPTFAVTSNVDMILGAAHFFHVLRDG FT RIHLGNDLPLLQNTEFGWVVSGEYAGES" XX SQ Sequence 3265 BP; 838 A; 880 C; 772 G; 775 T; 0 other; taatttggtc cttcgaaccg gatattggac cgccggtgtt gtgaaaagtg tccaccgaag 60 gcctagaaga ggcaaaaaat cacgctactg atgcgagatc atcgtagcaa ccgagaaaca 120 aggtgcaaag tttgactttg ccgctgtgaa gaaccagtaa aagtgcgcgg aaactgattc 180 acgatagcgt ttcgacctca agaaccgcca tcacgatgtc gttagtgaac taacagaagt 240 acgatttccc cagaaaatcc cccattttgc cttttaccgc gagtgaaatt gaacgaaaag 300 tcaatctttc gtccccgtcg ttcaactccc gacggctgtt agtgtgtggt tatcttctgc 360 tgctgaatgc acagtgggct aaagtggtga agtgggtgaa cgaaattcga acaagataaa 420 gttttgcaaa tcaacaacat tttgattgtg agacattttt ttggtttttt ggcggtggaa 480 aagtgtgttc ggaaagaaca atggcggaag aagcgaagac cattgatgag tggcagcaga 540 agaagcagaa cgtagtggac tccgtggctt tgttggagta ctacgtgacg aacttcgaca 600 aggaagagtc ccgcaaggag caggtagagg cttgggctga gaagcttgaa cgtttctacg 660 atgacttcca tcgtatggct gtcaaaatcg aagccttgtc atctgaggag gaccctatcg 720 atttaaaagg tgagcgacag aaatttgaca gtagatatta tggcttgcgc gcattctacc 780 tgaagcaaat ggcgaaggcc accagtacct cttcttccaa tcttacccaa cccggcatgt 840 caatgaatat tcggcttccc gagatcaatc tcccgaaatt cagcggccgt ttggaggagt 900 ggtgtgtctt ccgcgattca tttcaatctg ctgtcggttc ccggagcgat atcgggcctg 960 tcgagaagct gcactacctc aaaggactta ttcaagggga agcggcaagg attcttgacc 1020 ctataaaagt cagcgagcag ggctacaagg acgcatggcg ggcactcaag ttgcgcttcg 1080 agaataagcg gcagctcatc aagtgccaca tcaagacgct cttcgatact ccagcaatgt 1140 gtgaggaatc gtccgatgag ctcctggcgc tggccgatcg cttcgaacag caaatctccg 1200 tgctgaagag tttgggtgaa ccagccgaca agtggagttc gttactggtc tacctactga 1260 cgattcgcct cgatccttgt acactccgag aatgggagaa ctactgcacc aggctcgata 1320 ccgacaacat agcttcggtt ctgggaggaa tcgcttcaac atcgagcacc atagctgatg 1380 attcgacttc catgccgtcg tacgtacaga tggtgaactt cctccagaac tattcgcgtg 1440 tcttgcatgc tgtctcctca gccacgttcg atacttctct tcgtcaatac accaagtccc 1500 cgacaaaatc agcaggatat ctcctagcat cgccgccgaa caaggcatct aacccgatac 1560 cagacagttc cagtcccagt aagccggaaa ggcattgtga aaagtgtggt cagaatcact 1620 tcctgtttca ctgtccagag ttccagaacc tagacgtcag ccgccggacc gaactggtaa 1680 ggcaaggaaa catgtgcatg aattgtttgc gctcctcttc ccatatcgcc caaacatgct 1740 cgggcacaag ctgccgagtc tgctctcgaa aacaccacac gctgcttcac acggatgcca 1800 gcgatatcat gcccacttct aacactcagt ctgctgattc gacttgttac gtcgcctttg 1860 aacaagtccg accaagtacg gagccatcga agttcgtttc tcagcatcta attcagcaac 1920 cgtcaacttc tactgcttcc gttaatcctt atttcggtgt tcatgacact tccgggaatc 1980 tccagtgcga tctggtctcg caatcccaag ccgcgatcac tggaaccgtc tttcttccaa 2040 ctgcgctcgt gaacatccgg aatagtagag gccgtaccat caccgcttgt tgtttgctgg 2100 attgtgcgtc ccaacgagac tacgtgtctg ccggactgtg cgacaagcta caactgccac 2160 aagttcaact accactacca atcactgtca gcggaattgg caacactacc acgcttgtcg 2220 agcatgaagc taaagtgacc gttttctcac ggatgtcgcc aatatccgtg gaaagttcga 2280 tgctgatcct cccgtcgatt accacgaagc tgccacattg ttccgtcaac gttcatcagt 2340 ggtcgatacc acatcacgtt aacctcgctg atccaacctt cgctgtcacc agtaacgtcg 2400 acatgatcct gggagctgcc catttcttcc atgttcttcg tgacggacgg attcatctcg 2460 gcaacgatct gcccttgctc cagaacactg aattcggatg ggtggtctcc ggagaatatg 2520 ctggagaatc atgaaaattc cagtagccgt tgtatcaaaa atccaggcac cctcggagaa 2580 cctaccgaga atgctcctct ctggtcgagc ggaccgtcat gtcttgacga attggaccac 2640 tcgaggctac ccaagcacgt tccagccaaa tcaaacgctt ttgaaaactc caaagacagc 2700 tcgatcgtcc tgccattaac tgagaaggtt tatcacgaaa ttaatcttca gttcctgcac 2760 ccccagcttg tgaacgacga actcctcgtg acgagagagc accatgttga tcccggcttg 2820 ccgctgtcat gctcttcggc agaaatgtcg gtctcttggc ggccggaatc tggtacagca 2880 gagcgtttac cgttgcataa ttcccgtcca cgcaaaacca ccaacacttc tccggtgcga 2940 gtcacccaag tgcagccttt cgaatccgtt ggcaccgttc ttgccaccaa ggttgttcac 3000 caagatattc aacgccagcc tcaggcaatt caacagtcac aggagaagaa tcgcgcgcat 3060 ttactgcgac gcgacaagca gcacagatgt tccgcttttc cagcaatgat ccgttaccat 3120 ccatggaaat cccggagctg gaaagtggcc tgtgccagct gcgccaaacg tgaacccaac 3180 gcctattcta gctgacgacc tctgtccgaa gaagttcctc cgccgatgat aaggaaaaat 3240 cacgaaattt cattggcggc cggaa 3265 // ID DNA-1_BTe repbase; DNA; INV; 702 BP. XX AC . XX DT 01-FEB-2011 (Rel. 16.02, Created) DT 01-FEB-2011 (Rel. 16.02, Last updated, Version 1) XX DE DNA transposon from the buff-tailed bumblebee. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA-1_BTe. XX OS Bombus terrestris OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; Apoidea; OC Apidae; Bombus; Bombus. XX RN [1] RP 1-702 RA Jurka J.; RT "DNA transposons from the buff-tailed bumblebee."; RL Direct Submission to RU (15-JAN-2011). XX DR [1] (Consensus) XX CC ~92% identical to consensus. TTAA tsd. XX SQ Sequence 702 BP; 242 A; 104 C; 92 G; 264 T; 0 other; gggggtgtcc tgaattagaa tgttctaaaa cagcgtcttt ttgtgatttt ttttagaagg 60 gaaagaaaga aacgaagtta ttgaattttg aggtatggtt ttatatatat ttaacgaata 120 caaaaaaatt ttttttaaag taaaaaaaaa attataacag atttctaagc ctatttcatg 180 ggctgcagtt ttcaatcggc gggaaagatc cacctcgtaa ttcttgaccg aaacgaaaaa 240 ccaaaaaagg attaaattat tatatatttt tcttctcgat gaactaaaaa agttcgtcaa 300 aaagtttatt taaataattg ttatagcacc ttaaagttta ttttttcttt ttataaaaca 360 cattttttct ttaaacccgc caaaattttt aatttcacac tattttctgc ctttttttag 420 ttcatcgaga agaaaaatat ataataattt aatccttttt tggtttttcg tttcggtcaa 480 gaattacgag gtggatcttt cccgccgatt gaaaactgca gcccatgaaa taggcttaga 540 aatctgttat aatttttttt tttactttaa aaaatttttt tgtattcgtt aaatatatat 600 aaaaccatac ctcaaaattc aataacttcg tttctttctt tcccttctaa aaaaaatcac 660 aaaaagacgc tgttttagaa cattctaatt caggacaccc cc 702 // ID Copia-6_CQ-LTR repbase; DNA; INV; 222 BP. XX AC AAWU01041312; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-6_CQ_; KW Copia-6_CQ-I; Copia-6_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-222 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 328-328 (2011). XX DR Genome; AAWU01041312; Positions 4391 4170. XX SQ Sequence 222 BP; 60 A; 66 C; 43 G; 53 T; 0 other; tgttgagatg aacaaccctg tagtcgtcat cgagtcgacc tagtcggccg aaccccctgt 60 cgacctcacg cgcagctgtc agagctgcaa aacaacaaca acaaatgtgt tgcgtcatcc 120 gttacaaaag cacgcggttc aataaagtcc tccagtttag ttttgttaaa ctacgcgttt 180 ttctccggaa caaatccgac caacctgcca ctctgctcat ca 222 // ID Ginger1-1_HM repbase; DNA; INV; 3425 BP. XX AC . XX DT 02-FEB-2010 (Rel. 15.01, Created) DT 02-FEB-2010 (Rel. 15.01, Last updated, Version 1) XX DE Ginger1 DNA transposon from Hydra magnipapillata. XX KW Ginger1; DNA transposon; Transposable Element; Ginger; integrase; KW Ginger1-1_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3425 RA Bao W., Kapitonov V.V. and Jurka J.; RT "Ginger DNA transposons in eukaryotes and their evolutionary RT relationships with long terminal repeat retrotransposons."; RL Mobile DNA 1, 3-3 (2010). XX DR [1] (Consensus) XX CC TIR is 270-bp long. Tpase contain one intron (463-795). XX FH Key Location/Qualifiers FT CDS join(310..462,796..2592) FT /product="Ginger1-1_HM_1p" FT /translation="MHTEFADVYNFLKRCIYPKTAISKGDKSNFRRKTKPF FT VIEDNELYHLGKSGRKVIWCKEEQHHIIKSVHDGNDISNEANALSGHIGIN FT STTEHIKKRFFWFGMIKDITAYVSECDNCQKAKNKKLQVKPLLQSISIPKG FT NMKQVGIDLTQLPEVNGYKYLIVLVDYFSKWVEAEPLFDKTAKSVAIFLYK FT QICRHGCFEIQINDQGREFVNELSNELHRKTGTRQRMTSAYHPQANGLVER FT QNQSIKRTLVKVLEDAALEWPFIIDGVLFSMRIRKHKSTGFSPFELLYQRE FT PVLPIDIDQNLIDYNDNATLIEDDSQNKKAILETFNKMNEMKKCIFDEAYE FT NIQKSQVRQKRDYDKRVNPKNSIAIGTKVLLRNHKRDDRKGGKLVKPWIGP FT YTVTNISSTNNCSLKNEKNVILKKKYNLTSLSLYKEKTLSNGSIQIEEVNK FT LSNDNDIKIEPVIKDVESSKSYINTQHESCVNVLDKEIAKCATFEMRKLIA FT EKFEIVWKHNVAVGNINLTLCPQGLHRIKGDGNCFFRAVSFIITGSEDEHK FT QIRDKVVNHMCNQIHDEMTGYLSMPIQTHICKTRMLEDATWATDAEIIGCA FT SFMEVDIQVYSKYGEKTKWMIYPCSLRLNKLSDYSIFLDNSTGFHFDVVIN FT S" XX SQ Sequence 3425 BP; 1292 A; 502 C; 531 G; 1100 T; 0 other; tgtagtatga gattattact cccgcttttt ttttactccc cagtaaaaat ctcatattaa 60 atttttactc cccagtaaaa accgcttatc aaatttttac tccccagtaa aaatcttata 120 ttaaattttt actcccatta tgagattttt actcccataa aaatattatg cggcaaaacg 180 tatttttgta acaaaatcac gttgtatttt gagaatagtt ttcaacgttt taattactgc 240 ggtttcttga aaagcaataa gaaacgaatc tttttatgca gtggctttca taattctaaa 300 aatttaacta tgcatactga atttgccgac gtttataatt ttttaaaaag atgcatatat 360 ccaaagacag ctattagtaa aggcgataag tcgaactttc gaagaaaaac aaaaccattt 420 gtaattgaag ataatgagct gtatcatctg ggtaaatctg gcgttaaggt aaacagttta 480 tataattaga gtttttttat actagcgtgg tttgcgataa atacatacat ttttatgtat 540 gtgtttaagt atcacgtttt acataataac gtaatacata ataatatcat gttttaataa 600 tgtatcatgt aataataata cgtaatatat aataatatca tgttttacat aattaaataa 660 taaaaagctt gtactttact agtataatac tgccgtgata aggaaaaaca cagtaattgt 720 aaaaatatcc actttgcact cttttaataa atatatatat atatatttat aacagcatct 780 ttaatttttt tttagagaaa agttatttgg tgcaaagaag agcaacacca tataattaag 840 tctgttcatg atggtaatga tatttctaat gaggctaacg cattatctgg ccacataggc 900 atcaattcaa caacagagca tattaaaaag agattttttt ggtttggtat gataaaagac 960 attacagcct atgtttcaga atgtgataat tgtcaaaaag caaaaaacaa gaaacttcaa 1020 gttaaaccat tgttacaaag catatcgatt ccaaaaggta acatgaaaca ggttgggatt 1080 gatttgactc aactcccaga agtaaatgga tataaatatc tgatagtact agttgattat 1140 ttcagcaaat gggtagaagc tgaacctctt tttgataaaa cagcaaaatc tgttgcaata 1200 ttcctgtata aacaaatatg tagacacggg tgttttgaaa tacagattaa cgatcaaggc 1260 cgtgaatttg taaatgagct ttcaaatgaa cttcacagaa aaactgggac tcgccaacgc 1320 atgacgagtg cataccatcc acaagcaaac ggtttagtag aacgtcaaaa tcagtcaata 1380 aaacgaacat tggtaaaggt gctagaagat gctgctcttg aatggccatt tattattgac 1440 ggtgttttat tttcaatgcg aattagaaaa cacaaatcaa ctggattttc accctttgaa 1500 ttactttatc aaagagaacc tgttctgccg attgatatag atcagaacct aattgactat 1560 aatgataatg ctactctgat tgaagacgat tcccaaaaca aaaaagcaat tttggaaaca 1620 ttcaataaaa tgaatgaaat gaagaagtgt attttcgatg aagcttacga aaacatccaa 1680 aagtcacaag taaggcaaaa acgcgactat gacaaaagag ttaacccaaa gaatagtatt 1740 gctataggaa caaaagtttt attgcgaaat cataaacgtg atgacagaaa aggtggaaag 1800 cttgttaagc catggatagg accatacacg gtaactaata tttcaagtac taataactgt 1860 agtttaaaaa atgaaaaaaa tgtaatattg aaaaaaaagt acaatctaac aagtttgtcc 1920 ttgtacaaag aaaaaacact tagtaatgga agcattcaaa tagaagaagt taacaaatta 1980 tcaaatgata atgatatcaa aattgaacca gttatcaaag acgttgaatc aagcaagtca 2040 tatataaata ctcaacatga aagctgtgta aatgttttag acaaagaaat agcaaagtgt 2100 gcaacttttg aaatgagaaa attaattgca gaaaagtttg aaattgtttg gaaacacaat 2160 gtcgccgtag gaaacataaa cctaactctt tgtcctcagg gactccatag aattaaagga 2220 gatggaaact gcttttttag agcagtttct tttatcataa caggaagtga ggatgagcat 2280 aagcaaatta gagacaaggt tgttaaccat atgtgtaacc aaatacatga tgaaatgact 2340 ggatatttga gtatgccaat tcaaacgcat atttgtaaaa ctaggatgct tgaagatgcg 2400 acatgggcaa ctgatgcgga gataattgga tgtgcttcat ttatggaagt agatattcaa 2460 gtatacagca aatatggaga gaaaacaaaa tggatgatct acccatgcag tttacggctt 2520 aataagttaa gtgactattc tatattttta gacaattcaa ctgggttcca ctttgatgtt 2580 gtgattaata gttgaaaatc tagtccatat acatttgtat ttcttttacg agttttaaat 2640 aatgctgttt aatttcaatt attgtatttt cttttcttgt actatgtaag tttatatatt 2700 aaaaacagca atttgtttga aaacggaagt aatatagtta ggctgtctaa ataaaaaaaa 2760 gattttaata aaattacgtg gtgcaaattt aaaattgttt tacattacat attgtcttac 2820 aaatcgttat ctgaaaattg cccctcccct attttcaaac acaaaaaaat ttatttcatt 2880 tcgcaatggt gtttgaatct gctgctacac ctttcaaaaa taaatttgaa aattttcttt 2940 aacattacct tcataaaaac acatttaaaa aaatctataa agtgaagtcc ttttctaaaa 3000 acctccccac ctcccttctc tcatcctttt tatctcaaac ccctcctcca accctatttg 3060 gacgacgtct tctattgatg accccttact acatgatctg gatcggctct tgttttttca 3120 tttaaacact acaatatttt aaaataatgt tcatattaaa cggaatcgtt tttttattgc 3180 tatttaaaaa acaactgtct ttcaaacatt aaaaataaaa attcaaaata caacaaagtc 3240 gcaaaacaca ttttgccgcg taatattatg ggagtaaaaa tctcatagtg ggagtaaaaa 3300 tttaatataa gatttttact ggggagtaaa aatttaataa gcggttttta ctggggagta 3360 aaaatttaat atgagatttt tactggggag taaaaaaaaa gcgggagtaa taatctcata 3420 ctaca 3425 // ID BEL-2_DWil-LTR repbase; DNA; INV; 332 BP. XX AC scaffold_177039; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-2_DWil_; KW BEL-2_DWil-I; BEL-2_DWil-LTR. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-332 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (04-MAR-2011). XX DR Genome; scaffold_177039; Positions 1 332. XX SQ Sequence 332 BP; 113 A; 67 C; 64 G; 88 T; 0 other; tgttaccgcc gtcattacaa atttaaatat taaccgtcaa cccgaaaata ctgggttatg 60 gcagccctga aaataccatg gtatggcagc cctgaaaaac acgagcggga ccgaccgaga 120 gtttggcgtc agtctgtgat ggcaccaggg gaagaaagcg acaaactttt ttaaagcgtc 180 tcgttaaaag atttattaaa agcgttagga agccttcttc gtttaaaatg tcaaattaca 240 tcacacatgt tgtaaaatcg cgctaaataa actaaatgct atataaactg ttacttaagt 300 ttgttttaat acaaacaaaa ccccgtgcgt ca 332 // ID Gypsy-592_AA-LTR repbase; DNA; INV; 415 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-592_AA_; KW Ty3_gypsy_Ele160; Gypsy-592_AA-I; Gypsy-592_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-415 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 415 BP; 126 A; 108 C; 66 G; 115 T; 0 other; tggtacctac cgcgaaagtc aaaacgttcc catagcatcg aactgcttag gcgacacatg 60 caatgtacgc ataagcagga cgatgaccac caaccgtttc gacaaatctg acatgacaat 120 aatgtgtcat actcatattt caatttctca aagcacagtc ggtaagttat caccaagtgc 180 cttcagcctt gaaactgcct tctccacatg tgcattcatt ctctctcttt ctttctcgat 240 accgatcgtt atcgatcgct cacacgaata catgcatatg atatccagct acctccagta 300 ggacctaatg ttaaacgaaa ctttgaataa acgattcatt ctattatcaa cctcagtcaa 360 gtagagttga ggatcatttc acttcgaaaa accattatca aggctgatcg cctca 415 // ID Crack-3_AAe repbase; DNA; INV; 3747 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A Crack non-LTR retrotransposon family from Aedes aegypti. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; KW Crack-3_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-3747 RA Kojima K.K. and Jurka J.; RT "Crack clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1219-1219 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 7 sequences with >95% CC identity. CC Closely related to Crack elements in Culex pipiens (Crack-1_CP CC to Crack-4_CP). XX FH Key Location/Qualifiers FT CDS 19..702 FT /product="Crack-3_AAe_1p" FT /translation="MSVEIKQSVSKLESEMECIKTSQQYISDEFDGMKDII FT SQHKXEICSLQKDVTSIKSDCVTTHQHVEELNYELNVLRQANFEGHMLISN FT VIKVAQEDLGELLRNMFSLLNINYDPEGILSVGRLSSSNQNGIQPILVRFA FT SILTKDKLMRAARERPICCDEIGLGVKQRIFFNHRLTPANQRLLAAARKFK FT REHSFKFVWFTNGEIFLRKDEDCRAIKISDVRDLCGLI" FT CDS 690..3611 FT /product="Crack-3_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MWINLNSISLRSFLNEFIIIVVDHCPNIINEIKIDYN FT NDISELSSTSLNILHLNTRSCRNKIDELTQMMYELNKTIHVIVFSETWLYE FT NEICNIADYTSYHSCREDRGGGVSIFVLGSLRSQLTINLTYDVNNFLIVDL FT IDIGIKVMGVYNPGRNVFEFLDKFEQIIINHEKLYICGDFNINLLDVSNDL FT VQNYRCRIESLGFFILNSSDSKYATRISNTISTTIDHFITNQFQFKMQLIT FT KDTESHLSDHKTLILSIEIRVEKPEQQRTAHVIKYERFLNETLHHNIEECN FT SFEHLTQNLSQIISENKEETVYKKTFEIRKPYINTALLRDIKHKNRLYKSY FT KNAPDNSAIKNELYRKYITERNRLRNKTKAAKENYYKQQIENHKCNAKKTW FT DLLRQVIFERKTIEQSSQKLSLMQNGVLLNNDKLIANCFNTFFINVGQVTS FT PPQTLGDFHSFMTQEIETEFGFHHIEEDTINNIIINLNSSAASGLDKISTK FT FLQKCKNYLIPKIVELVNHMIDSSVFDNVLKQAKVVPLFKSGDKLNTINYR FT PISILPALSKISEKVLYQQLSSYLFQNNLIHSNQFGFIPKSSTESATLELV FT NFVIRGLDDGKFVACIFIDLQKAFDCIPHDILIEKLAYYGLTNQAVALVSS FT YISNRQQICCVNDTQSDSMSIRTGVPQGSILGPILFNIFINDLLKLPLKGM FT LQCYADDAASKYKANSLALLKEMMQYDLDLMNQWFSSNRMSVNTSKSNFMI FT FTLSNDVPSLNLMIDNKPLKQVYETNYLGLIIDFKLRWQSHIQKVKSKIVP FT YIFAIRKARRCLGLQSCWLLYNSYILPRLSYLICLWGSAASSQLNVLKVLQ FT NRVIKIIKRLPVLSPSIALYSPRILSLINLHKYSIVFVIYKIKNGIIKHNI FT ELMPATEVHSHFTRNRERFFTSTPRTELAVRNIFYTGVINFNSLPQRLKNE FT TNIVNFKRNLRIHVFENGL" XX SQ Sequence 3747 BP; 1347 A; 604 C; 618 G; 1177 T; 1 other; aacgcccgta tcgacgaaat gtcggtcgaa ataaagcagt ctgtatcgaa gctagaatca 60 gaaatggaat gcatcaaaac atcacagcag tacatatcag atgagttcga tggtatgaag 120 gatattattt cccagcataa acakgaaatt tgctctctcc aaaaggatgt aacttcgatc 180 aaatctgatt gtgtaactac tcatcaacac gttgaggagt taaattatga gttgaacgtt 240 ttgaggcaag cgaattttga aggacatatg cttatttcga atgtaatcaa agtagcgcaa 300 gaagatcttg gtgaactttt gaggaatatg ttttccttgt tgaacataaa ttatgatcct 360 gaaggcattc taagtgtagg acgattatct tcatcgaatc aaaatggaat tcaaccgata 420 ttggtccgtt ttgctagcat tttgacgaag gacaagttga tgagggcagc acgagagcgt 480 ccaatctgct gtgatgaaat cggcttggga gtgaagcagc gcattttctt caatcatcgc 540 ctaacacctg ctaatcaacg tcttctggct gctgctcgaa aattcaaaag ggaacatagt 600 ttcaagttcg tttggtttac aaatggtgag atattcttgc gaaaagatga agattgtaga 660 gcaatcaaaa tttcagatgt tcgagattta tgtggattaa tttaaactca atttccttga 720 gatctttcct taatgaattt attattatcg tagtggatca ttgtccaaat attatcaacg 780 aaataaaaat tgattacaat aatgatattt ctgaattatc tagcacttct ttgaatattc 840 ttcatctaaa cactcgcagc tgtagaaata aaattgatga attgactcaa atgatgtatg 900 aactaaataa aactattcat gttatagtgt tctctgaaac atggttgtat gaaaacgaaa 960 tttgtaatat tgccgattat acttcatatc atagttgcag agaagatcgt ggaggtggtg 1020 tctccatatt tgtcttgggt agtttgagaa gtcaactgac tataaatctc acctatgatg 1080 taaacaattt cttgattgta gacctaatag atattggtat aaaagtaatg ggtgtttata 1140 atccgggaag aaatgtcttt gaatttcttg ataaatttga gcagataatt attaatcatg 1200 aaaaactata catatgtggt gattttaata taaatttgtt agacgtgtca aatgaccttg 1260 tgcaaaacta ccgatgccga attgaaagct taggtttctt tatactaaat agctcagatt 1320 caaaatatgc aacccgtata tctaatacta tttcaacaac catcgatcat ttcattacaa 1380 atcagttcca gttcaaaatg caactaatca caaaagatac tgaaagccat ctttcggatc 1440 acaaaactct tattttgtca attgaaattc gtgttgaaaa acctgaacag caaagaacgg 1500 cgcatgttat taagtatgaa agatttctta atgaaacgct acatcataat atcgaggagt 1560 gtaatagttt tgaacatctt actcaaaatc tttctcaaat tatcagcgaa aacaaagaag 1620 aaacagtata caaaaaaact tttgaaatac ggaagccgta cataaacaca gcgttactga 1680 gagatataaa acataaaaac agattgtaca aaagttataa aaatgctcca gacaattctg 1740 cgataaaaaa tgaactatat aggaaataca taactgagcg aaatagacta agaaataaaa 1800 caaaagcagc gaaagaaaac tattacaagc aacaaattga aaatcacaaa tgtaacgcta 1860 agaaaacatg ggatctacta aggcaagtaa tatttgaacg gaaaactatt gagcaatcct 1920 ctcaaaagtt gtctcttatg caaaatggtg tcttgttgaa taacgataag ttgattgcca 1980 attgtttcaa tacctttttt ataaatgttg ggcaagtcac ttcaccacca caaacattag 2040 gcgattttca ttcatttatg acacaagaaa tcgaaactga atttggtttt caccatattg 2100 aagaagacac aataaacaac atcattatca atcttaattc cagtgccgca agtggtctag 2160 acaaaatatc aacaaaattt ttgcaaaaat gtaagaacta tttaattcca aaaatagttg 2220 aactggtcaa tcatatgatt gattcttctg tatttgacaa tgttttgaaa caagccaaag 2280 tagttccttt gttcaaatct ggtgacaaat taaatacaat caattatcgc cccatttcta 2340 tcttacccgc tttatccaaa atttctgaaa aagttctcta tcaacaacta agtagctact 2400 tgtttcaaaa taatcttata catagtaatc agtttggatt tatacccaaa tcaagtacag 2460 aatctgcgac gttagaattg gtgaactttg taatcagagg cttagatgat ggaaagtttg 2520 tggcatgtat atttattgat ttgcagaaag cattcgattg cattcctcat gatattttga 2580 ttgaaaagct tgcatattat ggcctaacta accaagctgt tgcgcttgta agctcataca 2640 tttctaatcg ccaacagata tgctgtgtga atgatacaca aagtgattcc atgtcaatcc 2700 gtactggtgt accacaagga tctatattag gaccaatttt gttcaacatt ttcataaatg 2760 atttgttaaa actgcctctt aaaggaatgc tgcaatgcta tgctgatgac gcagccagta 2820 agtacaaagc aaatagttta gccttattaa aagaaatgat gcaatacgac cttgatttga 2880 tgaatcagtg gttttccagt aacaggatgt ctgtcaacac aagtaaatcc aatttcatga 2940 tattcacgtt atccaacgat gttcctagtt taaacctaat gatagataac aaacctttaa 3000 agcaagttta cgaaacaaac tacctaggtt taattataga ctttaagctg agatggcaat 3060 cacacataca aaaagtgaaa agcaaaattg tgccctacat ttttgctatt agaaaagcgc 3120 gtagatgtct tggtttacaa agttgctggc ttttatataa ttcatacatt ttacctcgcc 3180 tctcttattt aatttgcctt tggggttcag ctgctagtag tcaattaaat gttttaaaag 3240 tgcttcaaaa tagagttatt aaaattatta aacgcttgcc tgttctttcc ccttcaattg 3300 cattatactc accaagaatt ttatctctaa ttaatcttca caaatacagc attgtatttg 3360 ttatatacaa aatcaaaaat ggaattatca aacacaacat tgaattaatg ccggctacag 3420 aagttcactc acattttact agaaacagag aaagattttt tactagtaca cctcgtactg 3480 aattggcagt aagaaacatt ttttataccg gagtcattaa ttttaattca ctacctcaaa 3540 gactaaaaaa tgagacaaat atagtaaatt tcaaaagaaa cctaaggata catgtttttg 3600 aaaatggatt gtagttttaa gtgtattagc gtattttaag ttaatttagt gatacttgaa 3660 cataattgaa ttcgctagat aagtagctgt acacagtttg tattaacttt tcagctatgt 3720 atgcaataaa gaaaaaaaaa aaaaaaa 3747 // ID DNA-TTAA-1_AP repbase; DNA; INV; 423 BP. XX AC . XX DT 24-MAR-2009 (Rel. 14.03, Created) DT 24-MAR-2009 (Rel. 15.12, Last updated, Version 2) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; KW DNA-TTAA-1_AP. XX NM DNA-TTAA-1_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-423 RA Bao W. and Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(3), 659-659 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC TSD is TTAA tetranucleotide. XX SQ Sequence 423 BP; 139 A; 75 C; 72 G; 137 T; 0 other; gaggacgtcg cacccgcatg tgttgtctcc gtcttacaca cgtacgacat agcaaatgtt 60 cgttcagcag attcaattgt gtgctgttag tttttatatt agagtgaatt gacctattat 120 caaatttaaa ggtaagatta ttatctaggg ccccacgtag gcttttattc atattattag 180 ttttaagtaa gttatgacct ttttgaaagt gtacatttta aaatgatcat aactcactta 240 aaaataataa aattaataaa agcctacgtg gggccctaaa taataatctt acctttaaat 300 ttgataatag gtcaattcac tctaatatca aaactaacaa cacactattg aatctgctga 360 acgaaaattt gctatgtcgt acgtgtgtaa gacggagaca acacatgcgg gtacgacgtc 420 ctc 423 // ID L2B-1_HM repbase; DNA; INV; 4365 BP. XX AC . XX DT 21-JUL-2009 (Rel. 14.07, Created) DT 21-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE Hydra L2B non-LTR retrotransposon - consensus. XX KW L2B; Non-LTR Retrotransposon; Transposable Element; L2B-1_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4365 RA Kapitonov V.V. and Jurka J.; RT "L2B, a novel clade of non-LTR retrotransposons from animals."; RL Repbase Reports 9(7), 1413-1413 (2009). XX DR [1] (Consensus) XX CC This family, together with L2B-1_CP and CR1-1_AG from the CC mosquito genomes, belongs to a novel L2B clade of non-LTR CC retrotransposons. This clade appears as a sister clade of the CC L2A, L2, Daphne and Crack clades. The L2B-1HM consensus sequence CC was derived from multiple alignment of several copies of CC L2B-1_HM, which are ~96% identical to each other. XX FH Key Location/Qualifiers FT CDS 126..1334 FT /product="L2B-1_HM_1p" FT /note="ORF1." FT /translation="MTKTKKHLSPHNADKLDEDSDHSISNLKTSVFNCIEC FT EISIHGKSVECGVCKEWCHLKCSKLTNDDFNAITKLKKSRVIWFCFKCEKK FT LVKMNKNDKKVKYTFIKELYTDNKILNKEITELKIENELIKKELQDKIKIK FT QIPIISSNVNSISERTFSEIVVKKINNHDKINTKKLIMDNIKPKELGVEIQ FT HIRNQNNEVIIRCNDEKSKDKLYQEIKGKLGVNVSVEKGNTEKMKRIEFKI FT TKHDEVFNEILANDIIALNNLNNKAKVIVVKRINMTKNKYELVIADIDINT FT YEYIINKKNGRLAIGWESVQIRKHINIIQCYHCQGYNHIAKNCHFNTPSCC FT YCSGDHYSKDCQNHESNWCVNCLSKNKKFNLMINPQHCAFDKKCPVYNDIK FT NRVELKYDN" FT CDS 1338..4280 FT /product="L2B-1_HM_2p" FT /note="ORF2." FT /translation="QSIVVKDIKIAYTNINGLINKLDRVTLYVQKFKPHIL FT GLTETHITSLIFEKELEIFGYSFINLESSSAHTGGCILYYKNGIKVNSDPK FT SIVKDKCHWLLSTELVINKIKFHFSVLYKSLQENKTIFLEFFDTWITTIRS FT KNIIILGDFNINTLSNDKHSIALKQSIKNNQMYIFNSTPTKISEKSSTCID FT LAISNCTLIETNEKEKLSDHEMITCKYSIKEKRQFKTKKAKNIVVNSSKQN FT FHKIDYSLLNKNISVELKNIKKEIDIEVMANSYTTIIKNNINMQEKFYKTN FT KEINNNNPSILNNWLRNSKYTDIVKEYNQEWKTFVWFKYHGFSESKCKIQY FT NKFKKIRNKKTSLLRKLKADYYKIKIDSVKNDPKAMWNSLKYLTSVKPVIK FT SNIMDDIKFDEENVIRSFQRDDTKKSIPDKYNEFLIQSIIDIETSIQSDMS FT YNAMENLSLNDNIPCFCEFTPLTMQDLNDIVNQLQNKNSMCEGLNVIIFKE FT LYSSFSSILLKIINLSLKNGKVPSKWKKSTIIPIPKLMSPKLPQDLRPINM FT LPTYEKILEIAVHKQLSSYFESNNIFYNNQFGFRKNRSTESAIQLLLSKWR FT TSLNNNKFVVTVMLDLKRAFETVNRKILIQKLQGYKIKGTVLNWIADYLSM FT RTQVVKIKGEESGELVCDIGVPQGSVLGPLLFIIYMNDIASKIEYSFVNLF FT ADDTLVCIEGTNLEETFFKLNQDLKILCKWLSNNKLKLNTLKSMAMLIVNS FT KKRKNSLINHNNNIKVILDDVVIEFSDSVKYLGIIIDYQLNFTEHIKYISN FT KIAKKTGYLGRISKYLTYWTKKIIFQTIIAPHYEYCASIFLDVNTNNLNIL FT EKLQNKAMRIILKCHWLTHKTEMLNLLGWISVKNRIIFKSLCFIHNVTSTV FT NDIFKPFYSKNSELHQHYTRQASYFHKSSQCNRAGQKLLFVFGLKLYQEIA FT GGFVQLDYKKFKKKLLQTLKNNQLIQ" XX SQ Sequence 4365 BP; 1869 A; 546 C; 597 G; 1353 T; 0 other; gttgaacatc attgtatgaa catcaaatta tatctatagc tgcatgaata attcattaag 60 atcgacttat tatcttgtaa ctgtaattat acaaatcttt cattttgtac gaataaaagg 120 tgaaaatgac aaaaacaaaa aaacacttaa gcccacataa tgctgataaa ttagatgaag 180 atagtgatca ttcaatatca aacttaaaaa cctctgtctt taattgcatt gaatgtgaaa 240 ttagtattca tggaaaaagt gttgaatgtg gagtatgtaa agaatggtgt catcttaaat 300 gtagtaaact cactaatgat gactttaatg caataacgaa actaaaaaaa tccagagtta 360 tatggttttg ttttaaatgt gaaaaaaaac ttgttaaaat gaataaaaac gataaaaaag 420 tgaaatatac atttataaaa gaattatata ctgataacaa aatattaaat aaagagatca 480 cagaactaaa aattgaaaat gagttaataa agaaagagtt gcaagataaa ataaaaataa 540 aacaaatccc aataatatca agcaatgtaa atagtatatc tgaaagaaca tttagtgaaa 600 ttgttgtcaa aaaaattaac aatcatgata aaattaacac aaagaagttg attatggaca 660 atatcaaacc aaaagaatta ggagtagaaa ttcaacatat ccgaaatcaa aataatgaag 720 taatcataag gtgcaatgac gaaaaatcaa aagacaaact ctatcaggaa attaaaggaa 780 aattaggtgt aaatgtctca gttgaaaaag gtaacactga aaaaatgaaa aggattgaat 840 ttaagattac caaacacgat gaggtattca atgagatatt ggcaaacgat ataatagctc 900 taaataattt gaataataag gctaaagtca tagttgtgaa aagaattaac atgacaaaga 960 ataaatatga actggttata gctgatatag atataaacac atatgaatac attataaata 1020 agaaaaatgg tagattagca attggctggg agtctgtgca aattcgaaaa catataaata 1080 ttattcaatg ttaccattgt caaggttaca atcatatagc aaaaaattgt cactttaata 1140 ctccttcatg ttgctactgt tcaggtgatc attacagtaa agattgtcaa aatcacgaaa 1200 gtaattggtg tgttaattgc cttagcaaaa acaaaaaatt caatttaatg ataaatcctc 1260 aacattgtgc attcgataaa aaatgtccag tatataatga cattaaaaat agagtggaat 1320 taaagtatga caactagcaa tcaatagtag taaaagacat taaaattgca tatacaaata 1380 taaatggtct cataaataaa cttgacagag ttacactata tgtacaaaaa tttaaacctc 1440 atatacttgg attaactgaa acccacataa catccttaat ctttgaaaaa gagcttgaaa 1500 tttttggtta ttcttttata aatttagaat catcaagtgc tcataccggt ggttgtatat 1560 tatattataa aaatggcata aaagtaaatt ctgatcctaa atctatcgtc aaggacaagt 1620 gtcattggtt attatctact gaactagtca ttaacaaaat taagtttcat tttagtgttt 1680 tgtataaatc tctgcaagag aacaaaacca tcttcttgga gtttttcgat acttggatta 1740 ctacaattcg ttcaaaaaat attatcattc ttggtgactt taacataaat acactttcaa 1800 atgacaaaca ctctatagca ctaaagcaat ctatcaaaaa taatcaaatg tatattttca 1860 acagtacacc aacaaaaata tctgaaaaat caagtacttg tattgactta gcaatttcaa 1920 attgtactct tatcgaaact aatgaaaaag aaaaactttc tgatcatgaa atgataactt 1980 gtaaatactc tatcaaagaa aaaagacaat tcaaaactaa aaaagcaaag aatattgttg 2040 tcaacagtag taaacagaac ttccataaaa ttgattattc tttattaaat aaaaacattt 2100 cagttgaatt aaaaaatata aaaaaagaaa ttgatattga agtaatggct aatagttata 2160 caacaattat taaaaataac ataaatatgc aagaaaagtt ttataaaacg aataaggaaa 2220 tcaacaacaa caatccatct attctcaata actggttaag gaattctaaa tacaccgata 2280 tagtaaaaga atataatcaa gaatggaaaa catttgtttg gtttaaatac catggtttta 2340 gtgaaagcaa atgtaaaata caatataaca aatttaaaaa aataagaaat aaaaaaacat 2400 cactattaag aaaactaaaa gctgattact ataaaataaa aattgattca gtcaaaaatg 2460 atcctaaagc tatgtggaat tcgttgaaat atctaacatc tgtcaaacca gtaattaaat 2520 caaatataat ggatgatatt aaatttgatg aagaaaatgt tattagaagt tttcaaagag 2580 atgacacaaa aaaatccatt ccagataaat ataatgaatt tttaatccaa agtataatag 2640 atattgaaac atcaattcaa agtgatatga gttataatgc tatggaaaat ttatcactaa 2700 atgataatat tccttgtttt tgtgaattta ctcctttaac tatgcaagac ttaaatgata 2760 tagtaaatca attacaaaac aagaattcaa tgtgtgaggg cttaaatgtt attatattca 2820 aagaattata ctcatcattt tcaagtattc tattaaaaat tattaattta tcgttaaaaa 2880 atgggaaagt tccttctaaa tggaaaaagt caacaatcat tccaattcca aagctaatgt 2940 cgccaaagtt accacaagac ttaagaccaa ttaatatgct tcctacttat gaaaaaatat 3000 tagaaattgc tgttcacaag cagttgtcaa gttattttga aagtaataat attttttata 3060 ataatcaatt tggtttcaga aaaaatagat caactgaatc agctattcaa ctactacttt 3120 ctaaatggag aacatctttg aacaacaata aatttgttgt aacagtaatg ctagatctta 3180 aaagggcatt tgaaacagtt aatagaaaga ttcttataca aaaattacaa ggttataaaa 3240 ttaaaggtac agtactcaat tggattgcag actatctttc gatgagaact caagtagtca 3300 aaataaaagg tgaagaatct ggcgaactag tttgtgatat aggggttcca cagggtagtg 3360 tacttggacc tcttttgttt atcatttaca tgaacgatat tgcatcaaaa attgagtatt 3420 cttttgtaaa tttgtttgct gatgatacat tggtttgtat tgagggcaca aacttggaag 3480 aaactttttt taaattaaat caagacttaa aaatactttg taaatggtta agtaataaca 3540 aacttaaatt aaatactctc aagtccatgg caatgcttat tgttaattca aaaaaaagaa 3600 aaaatagttt aataaatcat aacaataaca taaaagtgat attggatgat gtggtcattg 3660 aatttagtga tagcgtgaaa tatctaggta ttattattga ttaccaactt aattttactg 3720 aacatatcaa atatatctct aataaaattg caaagaaaac tggatatctt ggtagaataa 3780 gcaaatactt aacttattgg accaaaaaga taatctttca aactataatt gctccccact 3840 atgaatactg tgcttctatt tttcttgatg tcaacacaaa taacttaaat atcctagaaa 3900 aacttcaaaa caaagcaatg agaattatat taaaatgtca ctggttaact cataaaacag 3960 aaatgttgaa tttgcttggt tggatcagtg taaaaaacag aataattttt aaatcactct 4020 gtttcattca taatgttaca tcgactgtaa atgatatatt taaaccgttc tattcaaaaa 4080 atagcgagtt acatcaacat tacactagac aagctagtta ttttcataaa agcagccaat 4140 gcaatagagc aggacagaag ttattgtttg tgtttggttt gaaactatac caggaaatag 4200 ctggtggatt tgttcagctg gactacaaaa aattcaaaaa aaaactgctt caaacactaa 4260 agaataacca acttatacag tgaacattta ttttgtaaag gggattattt attttgtata 4320 agttactaaa ctagcattaa gttgctaaat aaataaataa ataaa 4365 // ID BEL-14_CQ-I repbase; DNA; INV; 6744 BP. XX AC AAWU01030126; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-14_CQ_; KW BEL-14_CQ-LTR; BEL-14_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-6744 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 181-181 (2011). XX DR Genome; AAWU01030126; Positions 18807 25550. XX CC Positions [5687-6265] - Integrase core CC 'GTTGT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 1369..5316 FT /product="BEL-14_CQ-I_1p" FT /translation="MPPKIVPSTSKATLKQLQTKLKSLQSSFNIIEKFKNE FT FKEDTTANEITVRLERFDELWERIVECSSEVECHDGYVAVEGDPFTVERLK FT FEQTYFEVKSFLLDKIKDMQDGSTLDQTTRTADSSFHAVQDHVRLPQIKLR FT TFDGNIDEWFSFRDLYSSLIHWKQELPEVEKFHYLKGCLEGEARSLIEPIK FT ITTRNYQVAWDLLLKRYNNSKLLKRRQVQALFKLPVLTKESVTDLHRTLDS FT FQRTVQSLDQIVQPADYRNLLLVEILSSRLDPYTRRGWEEYTSTQENGTLV FT NLTDFIQRRISVLESLPPKLSTGTSESKQESSQGLRKKPFSVHVSNNAVQS FT TTQKCQACSESHYLHTCPTFTKMPISNRESLIRSHSLCRNCLRSGHLAKKC FT QSKYSCWKCKGRHHTMLCFHAEGAATQNQTGKRVSSDGSGPSTAKPTETSS FT NSATTDSVSSNAATGQSSSILLATAVVLVEDNEGNTVTARALLDSGSECNF FT IAERLYQRLNVSMQKVDVSVVGIGQAAMKSKRRIQAVVKSRTSPYNQEMSF FT LVLPKVTVNLPTVSINTTNWRIPDNVELADPTFFHSRVVDLVLGIQHFFNF FT FQTGKAIPLGNELPRLTDSVFGWVVSGEVSSPCSLTQITCNMAISDSMEEL FT MSRFWSCEEVGTGNIYSPEETRCEDQYARSVSRAANGRYTVGLPKNEQTWA FT QLGESRTMATRRLLGLERRLAQDAALRKQYQDFMMEYLELGHMKKVEAGPA FT NDSPRCFLPHHPVVREESTTTKVRVVFDASAKTSTGLSLNDVLLVGPTIQP FT DLRDIVLRSRMRQVMVVSDVEKMFRQIDMAPEDSPYQSILYRFNPEEEIAT FT YELTTVTYGTKPAPFLATRTLKQLAQDEQQRFPLAAQALETDVYMDDVISG FT ADDVATATELRAQLDSLLGSGGFKLRKWASNEPSVLRGVARENLALPDEID FT WDQEASVNTLGLKWLPKTDCFRFKFKNPAVTPGQPLTKQLILSIIATLFDP FT LGLLGASLVLLKMFMQKLWTLQRVDGTRYDWKDPLPPTVGEEWRKLSQHLG FT VLNNIRIPRCTVIKGAVDIQIHCYTDASEKAYGACVYIRSKNVERKLARRL FT LTSKSKVAPLKQQPLPRLELNGSDLGADLVEWVIRVLGKRFPVFYWTDSTC FT VLLWLKKPPSYWQTYVGNRVARIQRITEELGAVWNHIPGVSNPADLISRGV FT PPEELEGNEMWEEGPDWDDEEPSSWPAQPDLSGEEAPERRRVVVACVATVE FT FIDILLSRFSSYISLIRAVALARRFSEANVWIRHTEGAEGSGRSHRPISSK FT NHIC" FT CDS 5465..6661 FT /product="BEL-14_CQ-I_2p" FT /translation="MVLPAKHRFTELLFKYYHVLLLHGGPTLVLTSVQYWP FT LGGKDLVRLVIHKCKKCYLAKPTSIQQQMGNLPKARVTVSRPFSQTGVDYF FT GPVYVKAGRGRQPTKAYVALFVCMATKAVHMELVTDLSTECFLQALRRFIS FT RRNRPTDIYSDNGTNFVGAKNVLEQLLKQLKDVSHHEKIARVCELEGIRWH FT FNPPSAPHFGGLWEAAVRSAKHHLLRVLGENSASFEDYNTLLTQVEACMNS FT RPLTALSNDPTDLEPLTPGHFLTGASLQALPEPDYSHIPGNRLNRWQMVQQ FT QTQNFWKRWRSEYLTQLQSRTKNWQHPKKVEVGKLVVIVDNNQPPMRWKMG FT RIHELHPGEDGVVRVVTVKTATNLLQRPVAKLCILPSQEEDEEESSAQPEA FT TAVEE" XX SQ Sequence 6744 BP; 1739 A; 1672 C; 1760 G; 1573 T; 0 other; ttagtggtcc ttcgaaccgg atcaacgttc cacggtacca ggaggatcta tctggtgtcg 60 gacaaaggga ctaggatcgc gttggtcgct ggaggacaaa ggaaatcatc gcttcgccgc 120 catcgcggat tcaaacaata ccgtctgtac gtcatcatcg ggctattgtc cacggacgcc 180 atcgcgggca tcattggcgc caacagcacg cggtcacaca cgcacccgcg aggaaggact 240 tcgccattgg ttcatcgctg gacatcaccg aacaaccgga gttgttcact tccgttcagg 300 taaagtagaa ctgcacaggt gtatgtctat gccgaagcct tctacagcca taaactaaca 360 cacgcacttc gactaccact ctctacgatc cggacggtcg actaccgtcg caaacttcca 420 tctactgcca tcgggttgga gcacccggcg gaatcgggtg aactcgccgt ttcgagctga 480 tttcacgagc ggtcgcggaa cggaagacaa cgcatgaggg acaggacgag ttcccggagc 540 gttcactgcc tgtaacgcat gagggacagg acgggttccc ggagcgttca ctgcctgcac 600 cacgaggctg gccattgact acggccattg aggatcaccg cacgctgatc tacgacttcg 660 acttgacatc agtagatgag gcagtttgtc caggtaaaaa cttaaaaata gatatgtctt 720 gtcgaagtta agactttttg tatgctgcac ctaaaaataa acttcacttc tcgacgaatc 780 tgcgttggta ctacttcttg gcgagcaatc cctgtactca atcaattgag ggtcgtttca 840 tctggctggt tgcttcggat ttggtgacgt gtctggttgt ccaggtaaag cacacgtata 900 tgtctaccga agccaaccat ttcgatacta catccgcact ttctctcgtt acgttcttct 960 gctgaaatac actcgctttg gagactgact gacgttcttg gcaaggttca aactcgattt 1020 ctgcaacgtg gactctgggg gtggcacacc tgttagaccc acaggtaaag ctagaagtat 1080 atgtctaccg aagcttcata ctgtggtact acataagttt ccctctgtcg atgtttgccg 1140 cactacttgg ttgattggct ggagagatca ttacgcccac gcacgcacgt acacacattt 1200 gtggatcatt tcgttgttgg agaggactgt actggatatt caactcgagc cggccagcac 1260 tgctgttagt tgtctaggta aagctacagt atatgtccgg tcgaagttgg acgtttttgg 1320 aatactacac cgtacattct cttctcgtta ctatcatcaa caacaacaat gccgccgaag 1380 atcgtgccat cgacctccaa ggctaccctt aaacaacttc aaaccaaact gaagagtctg 1440 cagagctcgt tcaacatcat cgaaaagttc aagaatgagt ttaaggagga caccactgcc 1500 aacgagatca ctgtccggct tgagcggttc gatgaattgt gggagcgaat tgttgaatgt 1560 tcttctgaag tggaatgcca tgacggttat gttgcggtgg aaggtgatcc attcacagtt 1620 gaaaggttga aatttgagca gacctatttc gaggtcaagt cgtttttgct tgataaaatc 1680 aaggacatgc aagatgggtc tacactagat caaacaactc gaactgctga ttcttcgttt 1740 cacgctgttc aagaccacgt tcgcctaccg caaatcaagc ttcgcacttt cgacgggaac 1800 atcgatgaat ggttcagctt cagggatctc tattcatctc ttattcactg gaagcaggaa 1860 ctacctgagg tggagaaatt ccactatctc aagggatgtt tggagggaga agcgcgatca 1920 ctaatcgagc caatcaagat caccacacgg aactaccaag ttgcttggga cttgttgctg 1980 aagcggtaca acaacagcaa gctgctgaag aggagacagg ttcaagctct tttcaagcta 2040 cctgttctca ccaaggaatc ggttacggac ttgcacagga cgctcgacag ctttcagagg 2100 acggtgcaat ctctggatca gattgtccaa cctgcggact atcggaactt gctgctggtc 2160 gagattctga gctctcgttt ggatccttac accagaaggg gttgggagga gtacacttca 2220 acgcaggaga atggcacgct ggtcaatttg acggatttta tccagcgtcg catttcggtt 2280 ctggaatcac ttccaccgaa attatccact ggaacttcag agagtaaaca agaatcatcc 2340 caaggactca ggaagaagcc gttttcagta cacgtgagca acaacgcggt acagtcgaca 2400 acacaaaagt gtcaggcgtg ttcggagtcg cactatcttc acacgtgtcc gacctttacg 2460 aaaatgccga taagcaacag agagtcactg attcgaagcc actcgctctg tcggaactgc 2520 ttgcgaagcg gacatctcgc caagaaatgt caatccaagt attcctgctg gaaatgtaag 2580 ggacgtcatc atacgatgct gtgcttccat gcggagggag ctgccacgca gaatcagacg 2640 ggaaaacggg tctcgtctga tgggagcggt ccgtcgaccg ctaagcctac tgaaacttct 2700 tcaaactcgg cgactacgga ctcggtgtca tccaacgcgg ctaccggtca gtcgtctagc 2760 atcctcctcg cgactgcggt tgtacttgtc gaggacaacg agggcaacac cgtcaccgca 2820 cgcgcgctgc tggactctgg ttctgaatgc aattttattg cggaacgact ttatcaacgg 2880 ttgaacgtgt ctatgcaaaa ggttgacgtt tcggtcgttg gcattggaca agcggccatg 2940 aagtccaagc ggagaataca agctgtcgta aagtctcgga catctcccta caatcaggag 3000 atgagttttc tggtattacc gaaggttact gtgaacctac ccacagtttc catcaacact 3060 acgaactgga gaattccgga caatgttgaa ctcgctgatc cgacgttctt ccattcacgg 3120 gttgtggatc tggtactcgg cattcaacat ttcttcaact tcttccaaac gggaaaggcg 3180 attccgttgg gcaacgaatt gccgcgactc accgactcgg tattcggttg ggttgtttct 3240 ggagaagtgt cgtcaccatg tagcttgacg caaatcactt gcaacatggc gatatcggac 3300 agcatggagg agctgatgtc cagattttgg tcctgcgagg aagtaggaac tggtaatatc 3360 tactccccag aggaaacgcg ctgtgaggac cagtatgcac gctcggttag ccgcgcagcg 3420 aacggacggt acactgttgg acttcctaaa aacgaacaaa cctgggctca attgggagag 3480 tccagaacga tggccaccag gcgtcttcta ggcctggaac gaagacttgc tcaggacgca 3540 gcacttcgta agcaatacca agacttcatg atggagtatc tggagcttgg tcacatgaag 3600 aaggtggagg cggggccagc gaacgactcc ccgcggtgct tcctaccgca ccatccagta 3660 gtaagagaag aaagtactac cacgaaggtg agagtagtct tcgacgcgtc ggccaaaacg 3720 tcgaccgggc tctctctcaa cgatgttcta cttgttggac cgaccattca accagatctt 3780 cgagacattg tgctacggag tagaatgcgg caggtgatgg ttgtatcgga tgtggaaaag 3840 atgttccgcc agatagacat ggcgccggag gattcacctt accagagcat tttgtaccgc 3900 ttcaaccctg aggaggaaat cgcgacgtac gaacttacga cggtgacgta cggtacaaaa 3960 ccggcaccgt ttctcgccac tcgtacacta aagcagctag ctcaggacga gcagcaacgg 4020 tttccactag cagctcaagc attggaaacg gacgtctaca tggacgatgt catttctgga 4080 gcagatgacg tcgcaaccgc taccgaattg agggcacaac tggacagttt gctgggttcg 4140 ggaggtttca agctacggaa gtgggcgtcc aacgagccat cggtactacg tggggtagca 4200 cgggagaatc tagcacttcc tgacgagatc gattgggatc aagaggcgtc agtcaacact 4260 ttgggtctta aatggctgcc caaaactgat tgcttcaggt tcaagttcaa aaatccagcg 4320 gtgacgccgg gacaacctct aactaagcag ctgattttgt ccatcatcgc aaccttgttt 4380 gatcccttgg gactcctggg cgcatctctg gttctgctga agatgttcat gcaaaaactg 4440 tggacgttgc agcgagtgga tggcactcgg tacgattgga aggatccatt gcctcctacg 4500 gtgggtgagg aatggcggaa gctaagtcaa catctaggcg tactcaacaa catccggatt 4560 ccgagatgta cggtaatcaa gggagcagtc gacatccaaa ttcactgcta cacggatgcc 4620 tcggagaagg cgtacggggc atgtgtctat attcggtcca agaatgtaga gcggaagcta 4680 gcaaggcggc tactaacatc caaatcaaaa gtcgcacctc tcaaacaaca acctctacct 4740 cgcctggagc tcaatggatc ggatctgggt gcagatttgg tagagtgggt cattcgagtc 4800 ttgggcaaac ggtttccagt tttttactgg acggactcga cttgcgttct tctttggctc 4860 aagaaaccac caagctactg gcagacctac gtcggaaaca gggttgctag aattcaacgg 4920 attacggagg aacttggtgc tgtttggaac cacataccag gtgtcagcaa cccagctgat 4980 ctcatctcac gtggtgttcc accggaagaa ctagaaggta atgagatgtg ggaggaaggt 5040 cctgattggg acgatgagga gccaagcagt tggccagcgc agcctgatct gtcaggagag 5100 gaagcaccag aaaggcgacg tgtcgtggtg gcatgcgttg caacagtcga gttcatcgac 5160 attctgctct caagattctc ctcttacatc tcactaattc gagctgtggc cctggcgcga 5220 cggttcagcg aagcaaacgt atggatacgt cacactgaag gagctgaggg aagcggaagg 5280 agtcatcgtc cgattagttc aaagaaccac atttgctgag gaaatcgaag atctgttagc 5340 tgagaaatca gtctctgctc attcacgttt acgctggttc aacccacaca tcgacgaagc 5400 tggtgtgctt cgagttggag gacgtttgca acactcgaag gaaccgccgg ggaggaaaca 5460 ccctatggtt ctcccggcaa agcacaggtt tacagaactt cttttcaaat actaccacgt 5520 gttactgctg cacggtgggc ccacactcgt gctcacatca gttcagtatt ggccattagg 5580 tggcaaggac cttgtgcgac tggtcattca taaatgcaag aaatgttacc tagcaaaacc 5640 aacttctatt cagcagcaga tgggaaatct gcccaaggcg agggtgacag tgtcacgacc 5700 tttctctcaa accggagtgg actattttgg tccggtgtac gtgaaggctg gtcgagggcg 5760 tcaaccaacg aaggcgtacg tggcactctt cgtgtgcatg gctacaaagg cagtgcacat 5820 ggaacttgtc accgatctgt ctacggaatg ctttctacaa gcacttcgta gattcatttc 5880 tcggcgaaac cgtccaactg atatctacag tgacaacggt acaaatttcg tcggggctaa 5940 aaatgtactg gagcaactgt tgaagcagct gaaggatgtc tcacatcacg agaagatcgc 6000 aagggtttgc gaactcgaag gcatccgttg gcactttaac ccacccagtg caccacattt 6060 tggaggcctg tgggaggcag cggtgcggtc ggccaagcat cacctcctgc gagttctcgg 6120 cgagaattca gcatcgtttg aggactacaa cacactactg acgcaggttg aagcctgcat 6180 gaattcaagg ccgcttactg cactctccaa cgatcctaca gacctcgaac cacttacacc 6240 tggtcatttt ctgaccgggg catcgctgca agccttgccg gaacctgatt acagtcacat 6300 ccctgggaat cgactgaaca ggtggcagat ggtacagcag caaacgcaga acttctggaa 6360 acgctggcga tcggagtacc ttacgcaact acagagcaga acgaagaact ggcagcatcc 6420 taagaaagtg gaggtgggca agctggtcgt catcgttgac aacaaccaac ccccaatgcg 6480 gtggaaaatg ggtcgtatcc atgagctgca tcctggtgaa gatggtgttg tgcgcgtcgt 6540 gacggtcaaa acagcaacaa atctccttca acgtccagtt gccaaacttt gcatcctgcc 6600 atcccaagaa gaagacgaag aagaatcttc agctcagcca gaagcaacag cggttgagga 6660 gtagtgtcca atcagtcgag cgtcgagagg gtttctgttt ctttattttc agaagtgttc 6720 gagcaacttc agggtgggtg cgga 6744 // ID hAT-35_SM repbase; DNA; INV; 2627 BP. XX AC . XX DT 12-AUG-2009 (Rel. 14.08, Created) DT 12-AUG-2009 (Rel. 14.08, Last updated, Version 2) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-35_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-2627 RA Jurka J.; RT "haT-type repetitive family from freshwater planarian Schmidtea RT mediterranea."; RL Repbase Reports 9(8), 1838-1838 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 572..1837 FT /product="hAT-35_SM_1p" FT /translation="MQIVTEKEELVTSKLRETMTKKRKYDESYLSLGFVDA FT NNMPQCVLCAKIFPNSSMVPAKMRRHLETVHAEYKDKKTEYFMRKRDDFLR FT GQNLIVSTFKTENEKATEASYLVSYRIASAGEAYTIGERLIKPCAIDIAKC FT LLDEKSAKAVSAVQLSNDTVTRRIIDLAANVKYELISRLKYCNFALQMDES FT TDVAGLAVLLVFVRYQYLNSIEEDLLLCETLQSYTTGEEIFNIVHNFVENN FT GLDWKNCIDICTDGAKAMVGKTAGAVSRIKNLAPNSTNSHCILHRQALAVK FT KIPILLKNVLDEAIKIINFVKSRPLKTRLFKILCDDMGSIHKNLLLHTEVR FT WLSRGKVLLRLFELRCELASFFNENEMNLQQRLTDKLWLFRLSYLADIFSK FT TNEVNLSLQGKQVTLFTANDKIEAFKKN" FT CDS 1894..2406 FT /product="hAT-35_SM_2p" FT /translation="MDFYNEIDGEIGENDMSGLCGEICEHLVGILNCVNQY FT FPDEHNKKLKINEWVRKPFEVNQKPIEFTVEEYEAFIDMISDSSLQSKFEK FT LPLSDFWCSIKEEYPNLSKKAVNILIPFATTYMCESGFSSYASTKTKYRNK FT LNAEADMRIQLSSIQPDIKKICQNNKQVHSSH" XX SQ Sequence 2627 BP; 944 A; 410 C; 444 G; 829 T; 0 other; cagcggttct caaagtgtgc tccgcggagc tcctccaagg gttccgcgaa aaaattcgaa 60 aattatttgt acctttcctc tttgcaataa taattaattt ttaaaattct aatctggaaa 120 tcaaaaatca ctcaattgaa aattatgtct attttataaa aaacactcat attattctct 180 ctgagaaatt tgcatttttt tgagtataat ttgaatggtt ttatacgtaa aacaaaaaca 240 aaaaattaag aaggaaagca aaaggaaaga gaacgaaaat taaacaaaat tttatatgga 300 cggtacttga aatattttat gcttgttcag tatgttgctg ctggtgtaca tgcaacatat 360 agaaagtaaa atgtactgaa atatgtacct atctatttat aatttgtatt ttaatcattt 420 ttaaatagaa ataaatcatt aaccatggat ccttggctaa ttaaagagac agcaaaaaaa 480 tgaacaatca tctggattac gtgcgtccgt aaatattgag aaaacagcaa cagatatttt 540 actgaaaact gattcctctg ctactcaaac catgcaaata gttactgaaa aagaagaact 600 tgtcaccagt aaactacgcg aaaccatgac taagaaacga aaatatgacg aaagttattt 660 atctcttgga tttgtagatg ctaataatat gcctcaatgt gttttatgtg caaaaatatt 720 tccaaatagt tctatggtgc ctgcaaagat gcgccgtcat ttggaaactg tacatgcaga 780 atataaagat aaaaaaactg aatatttcat gcgtaaacga gatgatttct tacgaggtca 840 aaacttaatt gtttcaactt tcaaaactga gaatgaaaaa gctactgaag catcttatct 900 tgttagttat cgtattgctt cagcaggcga agcctataca ataggtgaga gattaataaa 960 accttgtgca attgatattg ctaaatgcct ccttgatgaa aaatcagcaa aagcagtttc 1020 tgcagtacag ctatccaatg acacagtaac acgtcgtatt atagatttag cagccaacgt 1080 caaatatgaa ttaatatctc gcttaaagta ttgtaatttc gctctgcaaa tggatgagtc 1140 tacagatgtg gctggacttg cagtattact tgtatttgta agatatcagt atcttaattc 1200 cattgaagaa gatcttttat tatgtgaaac attgcaaagc tatactacag gcgaagaaat 1260 tttcaacatc gtgcataatt ttgttgagaa caatggatta gattggaaaa actgcattga 1320 tatatgtact gatggtgcta aggcaatggt aggtaagact gctggggctg tctcaagaat 1380 aaaaaattta gcaccaaaca gcaccaatag ccactgcatt cttcaccgtc aagctcttgc 1440 agtaaaaaaa ataccaattt tattgaaaaa tgtccttgat gaggcaataa aaataatcaa 1500 ttttgttaaa tcgcgacctt tgaaaacacg acttttcaaa attctttgcg atgatatggg 1560 cagtattcat aaaaatctac tcttacatac cgaggtgcga tggctatccc gtggcaaggt 1620 acttttaaga ctgttcgagt tgcgttgtga attggcatct ttttttaatg aaaacgagat 1680 gaatttacaa caaaggttga cagataaact atggttgttt cgactttcgt acttggctga 1740 cattttttca aaaacaaacg aagtgaatct ctcacttcaa ggaaaacaag tgactttatt 1800 tacagccaac gataaaattg aagcttttaa aaaaaattag acttttggat tacttgttct 1860 cgaactcgtg atttgaattg cttcccaaca ctaatggatt tttataatga aattgatggc 1920 gagattggag aaaatgatat gtcgggattg tgtggtgaaa tttgtgaaca tttggtgggt 1980 atattaaatt gtgtcaacca atatttccca gatgagcata ataaaaaatt aaaaataaat 2040 gaatgggtca gaaaaccttt tgaagtcaat caaaaaccga tagaattcac agttgaggaa 2100 tatgaagctt ttattgatat gatttcggat tcctcactgc agtcaaaatt tgaaaaacta 2160 cctttgtcgg atttctggtg ctctattaaa gaagaatatc caaacttatc aaagaaagca 2220 gttaatatac ttattccctt cgctactacc tatatgtgtg agtctggatt ttcatcttac 2280 gcgtcgacga aaaccaaata ccgcaataaa ctaaacgcag aagctgacat gcggattcaa 2340 ctatcatcca tacaaccaga cattaaaaaa atctgccaaa ataacaaaca agtgcattca 2400 tcgcattaaa aaaattattt ttagttttgt ttttgtagtc tttgtttcta atgtgcaaaa 2460 tatgtttaat ttttgttttt tttgttaaaa ataaagaaat gtacataaat acaatttaat 2520 gctgttagac atatcatatt ttttaaaaaa tagtaaacaa ctagggttcc gtagaaatct 2580 atatgtcttt caagggctcc gcaactgaaa aagtttgaga accgctg 2627 // ID Gypsy-164_AA-I repbase; DNA; INV; 5978 BP. XX AC AAGE02017464; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-164_AA_; KW Gypsy-164_AA-LTR; Gypsy-164_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5978 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02017464; Positions 6427 450. XX CC Positions [5036-5383] - Integrase core CC 'AGAG' target site duplication CC LTRs are 96% similar to each other. XX FH Key Location/Qualifiers FT CDS join(3099..3704,3708..5069) FT /product="Gypsy-164_AA-I_1p" FT /translation="MLNDEIIEEAKSEWNSPLLLVPKKSENGKNKWRLVID FT YRKLNANLQDDKFPLPNIEEVIDSLSGAQHFTHLDLSQGYYQCELKPEDRH FT VTAFSTPTGQYQMTRLPMGLKISPSSFSRLMTVAMSGLNLERCLIYLDDII FT VFGRTFDEHNKNLISVFQRLREVNLKLNPAKCNFFKQELIYLGHFISKEGV FT LPVPQKIETIKSKSPSSSDEVKRFVAFANYYRKHIRNFASLCSPLNYLTRK FT GSEFNLNSECETAFQQLKASFINPPLLDYPDFSEENTFKLHTDASGYGIGA FT VLSNKNEKPIAYASKPLNNAEKNYSTIEKELLAIVWAIRHFRAYLYGRKFE FT LYTDHRPLVYLFTLTDPSSRLTKFRLALEEYNFDVFYKKGCENAVADALSR FT ISTTELKDMHTKLTQEAFITTRMQTRREKNKTNHEKNVTGHDNPTMKIIQI FT IVKDHSSEENAKWIPENETILIKPTETLVSLRRTMKELGKICTQQNVDELY FT IKINNQSAHKFYEQITSNDLQKGIPNIIRIASNIKVIEDEVTKKLILNDYH FT VLLTAGHAGVKRTMNTIRQRYYWNGMKQDVHKFVKSCEKCQKYKSINVPKP FT EMTITTTANTAFEKVYLDLVGPLIPSEGYEYILTTQCELTKFVTATSTRQQ FT KQLRRHL" XX SQ Sequence 5978 BP; 2162 A; 1114 C; 1152 G; 1550 T; 0 other; tggcgatcct gccagttctg aaatcttctt aagaaaaaag aaagtggtaa gtgcaatcgt 60 gataattaaa aagcaatagc tgatcaatcc aattttcccg attgaaaggg tccaaacagt 120 gtgtaaagac atcgatttca ctccgttttt gtttcaaacc aaagtgcaag tgacaatggg 180 ctggttttcg gcggacgaaa tcgtcgccgc cccagccgcc gcaagtgcat cagaagggca 240 ctacaccgcc caatcggttt ctccgtgtat tttagctaca gtagccgtag gctacgtagt 300 gctacgaggg ttagcgaaaa tacatcgcca actcacagaa cgggtcgcag aaagaacaac 360 ccgccgcgtc gcggcgcagg tgtaaagaaa ccattaatta tgaacaattt attagtgacc 420 gtcgtttaat tatgaacaat ttgttagaaa ccgtcgcgaa gaagaacagt tagttataaa 480 ctttgggaaa tgaataagtg aaggatggat caaaaagtgg atcattatac ctacactctg 540 tgctttaccc agattgtcca aataagaaac aatttaaaag tgagggcagt gaaaagcttc 600 acaacagaag aagctaaagc ccagctaagg attgtgacag ccaccggagt accagcattc 660 ctttgccaga cccttggaac agaaacattc gcagccccgg tgatgcgaaa actgactcag 720 gagattcgga aatggatgca gccagccagc gtctacgagt gaatcaatca gtggatcaga 780 tgcaggacca gctgaagccc agaagtgacg tgatgtcctg atcaatcccg accccatccg 840 acgccatctg gtaacgaatc gaagtgtctc tgtttagtca tcttgtacgt ataaaaaatt 900 ttccattttt tttttatgta aacaacaata ccttactagc aagataagta gctgtaacaa 960 ttcctatttg aaatccaaga acatgatttt gaaatgttgc cagtaaaact tcttctaagt 1020 actagttcct ctaagttaat tgtttttgtg tcctacttga accatacatg gaccaatatt 1080 tcgaaaagga aacagtggtt aggaatttat taaagtctgt tgccggagct aacgcaaaaa 1140 ggcctcagta caataaaaag aaggatacgg tactaaggca cgtaaattga taacctgttc 1200 gcggagttca ggcaacaggt aaacgatttt ttcatcaaaa aagagaagga ggacgaagaa 1260 gaaggaggtg aaaaagaaga aagcgaagaa gataaagaaa aaatgccgcc aaaactggac 1320 ctcagcattg ggctgaaact tgtggacaag ttcagcggtg acgctgagaa aatagcagga 1380 ttttttgagt cagtagatct gttgaaagat tactccgact gaagcggata tactgaagtt 1440 tatcaaaaca cgtctggttg gtcccgcaca tggggtgatc aacaccgcag tgacattggc 1500 aaatgcgaaa gctttgctaa aaaacaaatt cgctatgaaa ttcagcccac aagccatcga 1560 atctgagatg gctagcgtta aacagaagaa atcttctgtt agtgaatatg gacaacagat 1620 taacgagcta gctgcaaagc tcgcagcagc tcatgtatca cgtggaacct ttgatgacga 1680 ggcagcagct gacgcagtgg ttcagcctgt caccattaag gcttttgtaa acgggttgaa 1740 ggacccaaaa gcgcaatttt tcctaaaagc gcggaaccca caaacactta ccaaggcaat 1800 cagcgatgct ttggaagtaa acgacagcga aaccgaaacg gcaatgtggt ttgccgccgg 1860 tccttcgcga tatcataatc aaccgaatta ttcaaatcga ggataccgaa gtaaccgagg 1920 ttaccgaggc aaccgaaatt tcagaggtcg aggtagatct gactacaata gatccgacca 1980 ccaaagccaa aattatgaga cgtctagttt ccgaggaaat cgaggccata ggggtaacac 2040 gaatcaacgt tacccgtctc aacatcaaca taggggacac gttgcacacg ttgcacacgt 2100 agcagaagcc cctcaagaaa ccagaccaaa caatcaagca cccacacaac gtccggaaca 2160 agaagcaaat ttaattgatt tatttcgttg atttgaatca acagaaatta aaatgttcac 2220 cgagaactat tttctcagta tttaatactg atattgtttt tattatcgat agtggtgcgt 2280 cttgtagcat tatcgatagt agatttttgc caaataatgt taaatttcga accgatgaca 2340 tcatttcaat aaaaggaatc aatggagtaa ccaaatcctt agggtccatt gatactattt 2400 tacgttttgg aaacgataat tacgatatta aatttaatgt agtttcaaac ttgcctagta 2460 atatagcagg tttaaaagga actgattttt tggtccatta tagggcaaat attgacttcg 2520 aggcattaac attatcccta agatacagga atgacaaaca cataataccc ttgacactga 2580 atggtactgt agcaattacg ataccagccc gaacagaaat tacaacatat gtcaaaacgc 2640 aacatgtaaa cacatgtgta gtacttaacc aggaagttac ttcacatgtt tatattgcaa 2700 attctattgg agaaccttca aacggattga taccagttcg gattgttaat ttcaaaaaca 2760 aaccggtcgt tattgatcat attgcacccc acattgagcc agcttcaaat tacaatataa 2820 ttgaattaag caaaaatgac aacaatattg ataaaaaccg agccaataaa ttgctcagag 2880 aacttaaact aggtcacctt tctggaattg aagagaagac aattaaacag atttgtctca 2940 aatatgcata tatcttttgt ttagaaggtg ataaacttgg aacaactaac gtttattgcc 3000 cgacgatatc tgtcaaaccc aatagtcaac cctcctttag taaaccatat aaaattccac 3060 attcacaaaa ggaagaagtc attaagcaag ttgagaaaat gcttaacgac gaaattatcg 3120 aggaagcaaa atctgagtgg aacagcccct tactgcttgt tccgaaaaaa tccgaaaatg 3180 ggaaaaataa atggagatta gtcattgact ataggaaact caatgctaat cttcaagatg 3240 acaaatttcc gctacctaat atagaggaag taattgattc cctatcagga gcacaacatt 3300 tcacgcattt ggacctgtcg caaggttatt atcaatgcga attgaagcct gaagacagac 3360 atgtaacagc attttcaaca ccaactggtc aatatcaaat gactagattg cccatgggat 3420 tgaaaataag tccctctagt ttttcacggc taatgactgt agctatgtca ggcctaaatc 3480 ttgaaagatg tttgatatac ttagatgata ttattgtgtt cgggagaaca tttgacgaac 3540 acaataaaaa tttgatttcc gtttttcaac gactacgaga agtcaacttg aaactcaacc 3600 ctgctaaatg taatttcttc aagcaggagt tgatatattt agggcatttt atttcaaaag 3660 aaggtgtact acctgttccc caaaaaattg aaacaattaa aagttagaag agtccttcat 3720 cctcggatga agtcaagaga ttcgttgctt tcgcgaatta ttataggaag catatacgaa 3780 actttgcgag tttatgttcg cctttgaact atttaacgcg aaagggaagc gaattcaatt 3840 tgaattcgga atgtgaaact gcatttcaac agttgaaagc gagtttcatc aacccccctt 3900 tgttggatta tccggatttt agtgaggaaa atacatttaa attacacact gacgcatctg 3960 gttatggaat tggcgcagtg ttgagcaata aaaatgaaaa acctatagct tatgctagta 4020 aaccgttgaa taacgctgag aaaaactatt caaccatcga aaaggaactt ttggcaatag 4080 tttgggcaat tcgtcatttt cgtgcatatt tgtatgggcg aaaatttgaa ttgtatactg 4140 accatcgccc attagtatac ctttttactc ttacagatcc ttcaagcaga ctgacgaaat 4200 ttcgtctggc gcttgaagaa tacaactttg atgtctttta taagaaaggt tgcgagaacg 4260 ctgtggcgga cgcgctatca cgtatctcga caactgaatt aaaagacatg cacacaaaat 4320 tgacacaaga agcattcata accacacgaa tgcaaactcg aagagagaaa aataaaacta 4380 atcatgaaaa gaacgtcact ggccatgaca atcctaccat gaagattatt caaatcatag 4440 tgaaagatca tagctctgaa gaaaatgcga aatggatccc agaaaatgaa acgatactca 4500 tcaaaccaac ggaaacattg gtttcgttac gacgaacaat gaaggaactt ggtaagattt 4560 gtactcaaca aaatgtcgac gaattatata taaaaataaa taatcaaagt gcgcacaaat 4620 tttatgagca gataacgagc aatgacttac aaaaagggat accaaacata ataagaatag 4680 caagtaatat taaagtcatc gaggacgaag taaccaaaaa actgatcctt aacgactatc 4740 atgtgttact tactgcagga catgcagggg tcaaaagaac catgaataca atcagacaaa 4800 ggtactattg gaatggtatg aagcaagatg tacacaagtt tgtgaagtct tgtgaaaaat 4860 gccaaaagta taagtcaata aatgtaccta aaccggaaat gacaataacg acaacagcaa 4920 ataccgcatt tgaaaaagtt tacctagacc tcgtaggacc tttgattcct tcagaaggct 4980 acgagtatat cttaacaact cagtgcgagt taacaaagtt cgtaaccgcc acatcaacaa 5040 gacaacagaa acagttgcgg aggcatttgt aaaaaacgtt gtaataaaat atggagtacc 5100 agacagaatc gcatcagaca gaggaacaga gtttatgtct gaactgttca cttcagtagc 5160 gaaactgtta aacatcgaaa aactaaacag caccgcttac catcatcaag caataggatc 5220 tctagaaaac actcataaat gtttaggaaa ttttttgagg acccaatgtg acaacaaact 5280 gttttcgtgg tcaacttggg taccatacta tgaatttgcg tataacaaca caacgcactc 5340 aacaacgaac tatactccat tttacttagt ttatggcaaa ttatcgaaaa tgccatccaa 5400 cataatagat gctccacctg agcctattta taatgtagat gactattgca agcaattaaa 5460 gtgcagattg caaatatgtc atagcgaagt taggaaccgt ttgatagagg agaaaacgaa 5520 gcgaacggca gagtttaata aaactgcaga attaaagatt taccatccag gagatttagt 5580 ttggcttaaa aatgaaacag ccaagaaact tgaagcaaag tatattagtc catacaaagt 5640 aattgaagat ttgaatccga atcttaaaat tttaattaac aaaaaggaag atttagttca 5700 taagagtagg gtaaaatcat atgaagaaaa agtagaagat taggttagac gtgtggtgta 5760 taatcaagtt aagttaaatt tcaataacac catataattc tcatatattt tacaaagtat 5820 tagaattcaa aaacattctt tattaatatt accttattac aaatacactt atatgaagta 5880 acacagaata ttcaaatatc aaaaaagtaa aaaaacagtt acataaatga tttaaaaatt 5940 aaacttaagc taatttttaa attttattag gtaaggcg 5978 // ID BEL-635_AA-LTR repbase; DNA; INV; 618 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-635_AA_; KW Pao_Bel_Ele49; BEL-635_AA-I; BEL-635_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-618 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 618 BP; 218 A; 104 C; 107 G; 189 T; 0 other; tgtgacggca aacccctcgt tgcagcgacg ccgtagatga aagaagagga gttatgtcaa 60 cggtcgatga cgtggactgt cagatcgata agtgattttg catatcatgc taaataaatt 120 ggatagaaca atcaagttat tttcagtcag tgtgaactaa aatttcttaa gcaaatctta 180 aattatcaag ttaatctagt gttgaattca tgcagttagt gttcttaaac ctagtgagta 240 aaacctttat gcagaattgt gaaataatac ttaactaaat tcctatttct attgatctcg 300 atattgttta cttagaacct aaattcaact taatactaga tcacagttta atctttggct 360 agagtttggc tacacgcagt aagtaataga aaatgatcct aaaaagaaat attacaagta 420 ataaacattt gtaggaagca agtatcatca gcagtcgacg aaacgaatcg aacgtttcgg 480 aggttatcac caaaatatgt aagtgacaat agatttacat aaagagattc taccttacaa 540 taaaatttat ttgtagcttt aagcagtaca ctacaaccct gtgtttgctc taaagaattt 600 ggcattccca cccccaca 618 // ID Gypsy-43_CQ-LTR repbase; DNA; INV; 1997 BP. XX AC AAWU01034657; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-43_CQ_; KW Gypsy-43_CQ-I; Gypsy-43_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-1997 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 466-466 (2011). XX DR Genome; AAWU01034657; Positions 9390 11386. XX SQ Sequence 1997 BP; 517 A; 528 C; 508 G; 444 T; 0 other; tgtaacgagt acatagggag taaaagtaca gtcggtatga ttgggaagta gaaataccta 60 aactgtatta aacgtacacg ccgcagagta gagctttctt gctgagtgac ctttgctcat 120 tcctttggtc actcaaatgc cgagtacctt ttgcgccagt tatcccaaaa gctttccgtt 180 caaccagatg gccgagtggt ttaaactctg ctctttaggt gagccagcac aggagatcgg 240 ggagagccag ccgcacgaat ctcgcgaaga gtttcgcgtt ttcgcggttc gggggtgatg 300 agatcgtgga ggtgataaag cggtctaggg gaatgacaag ctattgtcca gggttggtaa 360 ccgtgagagg gaacggaatt cggaactgaa gcctctcata agtctcggcg ttctgatgac 420 gcgcccaacg gttcactttt atcagttccg cggaagaggc accggtggcc ctcgattgtg 480 cgcgtctcgt gattaaattg taggttctag ctttccatga gtttgtcttt ttaggttatc 540 tctagtgtaa tgtgtttctc tgtcctttcc ctgttagcta gtttagcgag attagtagat 600 tgtgctagtg ccgtgtcgtg tgcgtgtgtg taagcctgcg tgcatgtgcg ttccagaatt 660 cctggtaagt ccggaacccg accacctcgt ggcttaacgt gcgctaaccc gtgattccgt 720 gtcccgtgcc agttctgcga caccgccacg ctcgccaaag acggccacga tctaccgggg 780 accagttacc gcttcgatcc ggccgcaacg cccacgccgc cgaaggccag cgctcgccaa 840 actgcgctag gcagcggaac tgtacggcca cgccgtaaca ggacagcagc aaccgtcacc 900 gcagcgtcct tgaataaccg ccatcgcgat cagcatccgc catcgcgatc gccaaccgcc 960 atcaacctaa cctggaaggt taggaaagca cgtgagtgag tgaagaggga gagaagtccg 1020 gcctcgaagg cggacacgcg aaccggcaac caacaaccag ctaaccagcg ttccagccga 1080 gcagcagcag cagatcttca aaagtaggtc gtcgaggttc cctcgcaggt gcgaccaacc 1140 acgtgctcga gtcccgtccg gttctaaccg gagagcagca acagcagcag cagcaagtga 1200 tgacgtcgta gacgccgtcc gcagcgaaac accaaccacc aaaccaccga gtttcttcga 1260 aaccgtcgat cagcagcagc agaacaggta tcgagacgac cacgtgatcg tcgaccgcga 1320 acaccaagca aaccgtgagt aaagcatgct ttaagtgata agtccatggc acacttttcg 1380 cccgctacca accttaccaa ctagtagcgc ggcgccgcgt ctgtcgagac tgttccgact 1440 cgcctaaaat gaacacataa aacgcacata gcacacacac ctaaaccgga agaagaagag 1500 agaagaagga agacaaacac acggtagaat ctgaactgaa gtaacaggat ctcagggaaa 1560 ggaagagggg aggatcgccg gcaaacacaa acgtacgtac gaaaacatga atgactcact 1620 gtaaataaac gaacacaaaa cactacaaat gtttttcctg ctgcacttaa cctaaacaca 1680 cagtagctat aaaacactta tgctagggtg accgaaagag tcgactcatg ttttggtcgc 1740 gaccgtttaa ataaatgtag attttctttc aatattctat tttactttat ttttctactg 1800 gagacagggc tccgaatccc ttggaccatt ggctattttt cttgcagatc tggttcaggt 1860 agactgttga aggccgcagg tgatgaatca cgccctgagg tctccagtcc gtttccggtt 1920 tctttttcag ctacccttcc tgggtagtgg cttggtacgc tagtcggagt acctccgacg 1980 cgttttacac ccccaca 1997 // ID Mariner-9_HM repbase; DNA; INV; 3542 BP. XX AC . XX DT 22-MAR-2008 (Rel. 13.03, Created) DT 22-MAR-2008 (Rel. 13.03, Last updated, Version 1) XX DE Mariner-type family: consensus sequence. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-9_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3542 RA Jurka J.; RT "Families of Mariner elements from Hydra magnipapillata."; RL Repbase Reports 8(3), 226-226 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(558..2303,2297..2629) FT /product="Mariner-9_HM_1p" FT /translation="MVRNYVKSKQKLSKTYTDYELKKAVELVNDGEPCKRV FT ARLLNIPHRTLRCHVSGLRKTSVVGRRSALLPEEEVTIAQYIATFSDFGYA FT FEIVDLKLFIQSFLQKSDRDCPYFKENLPGNDWVRSFLKRHKKLLSYRACQ FT NICRKRAAVSCDSVNRFFSNLEETIKNILPQNIINYDETNLSDDPKSKQMI FT FRKGTKHAERVMNTSKSSVSIMFACSADGVFLPPYTVYKAERLMDTWILGG FT PMGARYNRTKSGWFDSHCFIDWMQTLVIPYFRHVDNDAPKILIGDNLACHL FT SIEVIEICEANNIMMVFLPPNSTHLLQPLDLAVYGPMKSTWRKVLTAWKIG FT EGRFHTSLPKNVFPRLLLNLISNMDHIQEFAVNGFKTSGIYPLNRKKIIDK FT ILRADISTSSQNLVSPLVLERLKELREASAKKPGAVLRGKKVKVSPGKSVS FT LDDVAVSSVLKSQSKRKLIKIKNILLNDSLDKITSSVDLDSFKDIIDYDMP FT SSSKMYIELQEDSAKKSEGIQKVKSKTLKKKDLFAKKSNLYNGVSGLYGSI FT FHQKQTHIQKKLEDCLNEDCLKKSFESDPNSNKEIRNIVHDSGMVVTSKYP FT LVEGSYVIVRFEGKKIPYHFVGQVVNSEEDNNWNIKYFRRDFDASNPRIIS FT FKEPQVPDYLLTESKSIVKQLPCPQYEKNNISFLKEIIGDLYMR" XX SQ Sequence 3542 BP; 1263 A; 503 C; 574 G; 1202 T; 0 other; taccgtagcg tggggtaatt gagcactttc tggggtaatt gagcacccgt aaacatttta 60 attttaaaag tcatataaaa tagataatat caatgcttag atcattttca agatataaat 120 atttaaatta acttttttaa gttgtatgtt gctaattaca actgttttgg tcaaaaataa 180 atagctgata acattttaaa tatatcacga tgcagaatca tagaattccg taaaaactaa 240 taactggaaa tactaagggt aagcagttat agttaaaacc ttcttagagt tttgaatttt 300 ttgtagtata ctatacatat gttgaggtaa attcgtataa attttaaagt ctcataaatg 360 aattgttaat aaaatattgt tgtttaaagt aatgtggggt aattgagcac ttttatttgg 420 ggtaaatgag caccttttca aatagctgtt ttttaccctt taaatactat tgttaatgaa 480 ataaatatct aaaggtgcaa ataatttaaa tttttagctt atatgtttta ctttttacat 540 attagatatc tgataaaatg gtgagaaact acgtaaaatc aaaacaaaaa ctttccaaaa 600 cctacacaga ttatgaactt aaaaaagcag ttgaacttgt taatgatgga gaaccttgta 660 aacgtgtggc aaggctattg aacataccac acagaactct aagatgtcat gttagtggtt 720 taagaaaaac tagtgtggtt ggtcgaagga gtgcattgtt accagaagag gaagtaacaa 780 ttgctcaata cattgcaact tttagtgatt ttggctatgc attcgaaata gtagacctaa 840 agctgtttat acaaagtttt cttcaaaaat ctgataggga ttgcccatat tttaaagaaa 900 atttgcctgg taatgattgg gtgagatcat ttttaaaacg acataaaaaa ttgctatctt 960 atcgagcatg tcaaaatatt tgtagaaaac gtgctgcagt ttcatgtgat tctgtaaata 1020 gattttttag taatttagaa gaaactataa aaaatatttt gcctcaaaat ataataaact 1080 atgatgaaac aaatctttct gatgacccaa aatcaaaaca aatgatattc cgaaagggta 1140 ctaaacatgc tgaacgagtt atgaatacat caaagtcatc agtgtctatt atgtttgcat 1200 gttctgctga tggtgtattt cttcctccat atacagtgta taaagctgag cgtctaatgg 1260 acacatggat acttggaggt ccaatgggtg cacgatataa ccgaacaaag tctggttggt 1320 ttgatagtca ctgttttata gattggatgc aaacattagt aataccttac ttcagacatg 1380 ttgacaatga cgcaccaaaa attctcattg gtgataattt agcatgccat ttgtctattg 1440 aggttataga aatttgtgag gctaataata ttatgatggt ctttttacct ccaaacagta 1500 cacatttgct acaacctcta gatcttgcag tttatggacc catgaaatca acatggagaa 1560 aagtgcttac tgcatggaaa attggggaag ggaggtttca tacatcccta ccaaaaaatg 1620 ttttcccgag attgctcctt aatctaatat ctaatatgga ccacatacaa gagtttgctg 1680 taaatgggtt taaaacatct ggtatctacc cgctaaacag aaaaaaaatt attgataaga 1740 ttttaagagc agacatttct acatcctctc aaaacttagt atccccgtta gttcttgaac 1800 gtcttaaaga acttagagaa gcatctgcta aaaagcctgg ggcagtgttg cgtgggaaaa 1860 aagtcaaggt ttcaccggga aaaagtgttt cactggatga tgttgcagtt tcttctgttt 1920 taaaatccca atcaaaacgc aagctaatca aaataaaaaa tatccttttg aacgactcct 1980 tagataaaat aacttcttca gtagatttgg atagttttaa agatattatc gattatgata 2040 tgccttccag ctccaagatg tatatagagc tacaagaaga ttctgcaaaa aaatctgagg 2100 gtatacagaa agtcaaaagt aaaacgttaa aaaaaaaaga tttatttgct aaaaaaagta 2160 atttatataa tggtgtttct ggcctttatg gttccatttt ccatcaaaag caaacacata 2220 tccaaaaaaa gcttgaagac tgcttaaatg aagactgctt aaaaaagagc tttgaatcag 2280 atcctaattc taataaagaa atatagtaca tgattctggt atggttgtaa cctctaaata 2340 tccattagta gagggttctt atgttattgt aaggtttgaa ggaaaaaaga taccatatca 2400 ttttgtaggc caagttgtaa attcagagga agataataat tggaacatta aatactttag 2460 acgagatttt gatgcttcaa acccaagaat catatccttc aaagaaccgc aggttcctga 2520 ttacttatta acagaaagca aatctattgt taaacagctt ccttgtcccc aatatgaaaa 2580 aaacaacata tcttttttaa aagaaatcat tggtgactta tatatgcgtt gatatatctg 2640 tatcatatat ttatgttttt ttgttttatt ttgttataaa gtttgaacaa aatatattag 2700 actatttttt catttgttta ttttatgttt gagaaagttg acttaaacca acttcaaact 2760 tatgtaagtt tatactagtg ttcacatcct tcacccatgt atataatatg agtcagggtg 2820 ataggaagtc cggttagttt gtctggatta tttatttttt taatacacaa gaaacctttt 2880 tttttaataa attatgaaaa ttgaaataaa aaaaaattgt tccatttgat tatgttcaac 2940 atacaaataa aaaatatcta agtttatttg aatgctaggt aaagaatatt agattagata 3000 aagaaaattc gtctgctgca ttgtttgtct ttggatttaa ttttttgcag ggggccagga 3060 tgacccaaaa atgggactga gtgtcagata aggatctttt gcgcgcgcac acacacacac 3120 acaaagtatg atgcttttga taaaatactg tcaataattt ttataaaatg aaattaaatc 3180 aatattcaat ataaaatata gatttattat tgcatctatc tccgcgttaa aaagctatat 3240 ccgcgtaaga ctttcccaaa tttgtatata tgtatatata tatatatata tatatatata 3300 tatatgttta tgaaaaagat attaaggtgc tcgtttaccc caaatcgtgg ggtaaatgcg 3360 cacccgcata ttttgtgtca ttaactcttt gctggaattt tttttttaat tcaaccttcc 3420 taaatgatag acctaatata gtactaaaat aacatatctc actttttaaa attttctaca 3480 agtttttatt ttatttaaat aaaagcttca aaaatggtgc tcaattaccc cacgctacgg 3540 ta 3542 // ID Gypsy-73_CQ-LTR repbase; DNA; INV; 136 BP. XX AC AAWU01041308; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-73_CQ_; KW Gypsy-73_CQ-I; Gypsy-73_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-136 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 526-526 (2011). XX DR Genome; AAWU01041308; Positions 19145 19010. XX SQ Sequence 136 BP; 41 A; 26 C; 25 G; 44 T; 0 other; tgctaggccg cggtagttgt aacatcgatg tcagtaatta tttagaaaca taatatagtt 60 gttagagata aactttgatc tttaaattac tttaatatca catcatatgc gccaatcctt 120 accgctcggc ggcgca 136 // ID Gypsy-213_AA-I repbase; DNA; INV; 5648 BP. XX AC AAGE02029464; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-213_AA_; KW Gypsy-213_AA-LTR; Gypsy-213_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5648 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02029464; Positions 137856 132209. XX CC Positions [4608-5075] - Integrase core CC 'AAAGT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 669..2177 FT /product="Gypsy-213_AA-I_2p" FT /translation="MYFSLFLKDFGTRVRIGNYQSVWPLLFVIRKTKRTSD FT EVSLFFFRSESPDTIYITRPKVTKTELRNKLKSEKARTAALLEEIKQLREE FT NEQLRKSSAEVLAVATNSNDVETNMVPSGTQQTESSVKNEESKLLSSVHQL FT SISSLNVPECKPDVEGEGIQRHSFEMWRDLLIDSMKLLGVNDEPTMFTIFR FT VKAGQQLLEIFKNTKSDSLAPDENEFPFSNALHRLKSYFGSGSDVLLQRRK FT LAVMDQKSNESDLSYISRVGAVARLCDYEEGKEFEQIVSTIAEHATSKEVR FT AIALKMLSRNASFTDLVDKIRELEAIRLNEEFFMKKRMKSEPAAALAPVRA FT DFPGQPRYEHRSPQNYFPARSYNRSSTGSGRYSRSRANSTGGRREFSRYYR FT SPRDRNLVPPRDRSIGASREREGRCWRCNSVYHLPVDCDAADKICRNCGVM FT GHIARACTRRFPTSSSSRKRYSEADITGDGPSAKRMAAIEAPKVENVDKEE FT VSGFDEA" FT CDS 2363..4081 FT /product="Gypsy-213_AA-I_1p" FT /translation="MISVTLAGMPCEFLIDSGAQVNTISEKNFIKLNSDEK FT YREQIFNFRRQTDRSLKPYASSEYIKVLGTFEAPLYISDDRPVLLEKFYIV FT NERYSLLGRSTAIRYCVLLLGFKVPAHSISINAEANDQIKSIRSLNTGVAF FT PKFNIPPVVIHYDKSAPPCRNIFTNIPISLKPIVHQRLLELVRADIIEPVK FT PNMDTSFCSSMLVVPKGKDNIRLVIDLRGPNRYIMRTPFAMPTLETILAEL FT SGARWFSTIDLSNAFFHIELAEESRHLTNFFTEFGTFRCVRLPFGLCNAPD FT LFQEALERKVLAGCPGVKNYLDDVLVFGRTEAEHDENLRQVMARFNDHNVK FT LNEAKCHFKKQSVTFVGFKLTADGWEVEEEKINAIQNFRKPTSCQEVKSFL FT GLVTYIDRFLISRAEKTEYLRRLANSETFYWTTDEDNEFESLRNISTYTIK FT RLGYYSPEDRTELFVDASPIGLGAILVQFNSQSIPRVIACASKSLTATEQR FT YPQTHKEALAVVWGVERFATYLLSKSFIVRTDSEANQFIYNGKYRLGRRAV FT SRAESWALRLQQYDFRIERVPGEFLW" FT CDS 4236..5627 FT /product="Gypsy-213_AA-I_3p" FT /translation="MELTWHRIELSSENDHELQSVKEAIESGYWGSDLTKF FT EPHKKELRFLGNLLFKDNKVILPLSLRNEAMKSAHEGHVGEMAMKRIMREF FT FWWPGMSAGVEKFVRNCETCIRLSKKNPPVPLSSRELPDGPWEVIQIDFLS FT LPGCGSGEFLIVVDIYSRYLSVVEMRCMDAESTNAALCSIFQLWGLPKIMQ FT SDNGPPFQGTNFVSTWEDRGVRVRKSIPLSPQSNGSVERQNQGIIKAVAAS FT KIDKLSWRNAIQKYVHHHNTVVPHSRLNATPFELLVGWKFRGTFPSLWSAK FT RLDRTDIRERDAESKLKSKRYADKTRHATASDIVMGDWVLLAQHKRSKTDP FT TFPADRYRVIAIDGPKITVMSKNGIQYSRNIQEVKRVPFFEEQTIIQPPEF FT DTDNNSSYADKDDQEGFSSLPSPAHDNEGFDDSRDDAAPNNPAKNRNLRPR FT AVLRKPSRFDDKFVYTVIN" XX SQ Sequence 5648 BP; 1734 A; 1110 C; 1317 G; 1487 T; 0 other; atggcgcagt ggaggaaagc agttctcggc ggaggtaagt atttttttta tttttatatt 60 taattgaagt gaagtgaaaa gaaatgacgg catcaaaaat caattaagac agtgcagtat 120 tctaggtcaa gcgataggac ttgacaagag aatgcaagaa gtatttccgg cgaaagtaaa 180 gtgaaatgag aaattgattt taaatcatga gaatttgaat ttatttattg atttttttca 240 agtagaagga gcgagaggcc tcctcgaaaa gaaacatcgg agtgtgtggg tttcatccgg 300 aatcaaagtt ttatttcgac agtaaagtat gctcggtcaa gcgataggac ttgacaagag 360 aatttaagaa gaatctatcg cggaagtgaa gtccgtgcga tgtttttttt tttaatcaag 420 aatttgagcg tattttgttg attttttttt ttcacaattt gaaggagcga gcggcctcct 480 gggaaaaaaa aacggagagt gtcaatttga ttccagcatc aaaattccat tcagacattg 540 aagtatttta ggacaagtga tagggcttgg taagagattg caagaaaaat ctctgatgga 600 agtgatgaca gtagaaatta tatttcggga gaagtgagtg gactctccga gatgaaaaat 660 cgaaatgtat gtatttttcg ctatttttaa aagactttgg aacgcgtgtt agaattggaa 720 actatcagag cgtttggcct cttttattcg tgataagaaa aacgaaacga acatctgatg 780 aagtgtcttt gttttttttt cgcagcgaat cgccggacac gatttacatc acgcgaccaa 840 aagttacaaa aactgaactc agaaataaac taaaatccga aaaagctcgt actgctgcgc 900 tgcttgaaga gatcaaacaa ttgcgggagg aaaatgaaca gttgcgcaaa tcgagcgcag 960 aagttcttgc ggttgctacg aatagtaatg atgtggaaac caatatggtt ccaagtggca 1020 ctcaacaaac cgagtcatcg gtcaaaaatg aggagtcgaa gctgttgtct agtgttcatc 1080 agctttccat ttcttcttta aatgttccgg aatgcaaacc ggatgtagaa ggcgaaggca 1140 tacagcgcca ttcattcgaa atgtggcgag acttactgat tgattcgatg aagttgctgg 1200 gagtaaatga tgaaccgacc atgttcacca tttttagagt gaaagctgga caacaactcc 1260 ttgaaatctt taaaaatacg aagtcagact ctcttgcgcc ggatgaaaat gagtttccct 1320 tctcgaatgc tctacaccgg cttaagtcgt acttcgggtc tggttcagac gttctgttgc 1380 aaagacgaaa gttggccgtg atggatcaaa aatcgaatga atccgatctc tcctacatca 1440 gtcgtgtggg agcagtggct cggctttgcg actatgaaga aggcaaagag tttgagcaga 1500 tcgtaagtac catcgccgaa cacgctacaa gtaaggaagt tcgtgccatt gctctcaaaa 1560 tgttgagtag aaacgcatca tttacggacc tcgtggataa aatccgagag ctggaggcga 1620 ttcggctaaa cgaagagttt tttatgaaga agcgaatgaa gtctgagcct gcagcagcgc 1680 tagcaccagt acgagcggat tttccaggtc agccaagata cgagcataga agcccacaaa 1740 actatttccc cgctagaagc tacaatcgtt cgagcaccgg gtcggggagg tactcgagaa 1800 gtcgtgcgaa ttccactggc ggtcgccgtg agttcagcag gtactacaga tcacctcggg 1860 acaggaatct agtaccacct cgtgatagga gcatcggtgc atctcgtgaa cgtgaaggca 1920 gatgttggcg gtgtaacagc gtctaccact taccggtgga ttgcgatgcg gcagacaaaa 1980 tatgtcgaaa ttgcggtgtg atgggacata ttgcgagagc ttgtacccga cgattcccca 2040 caagttcgtc atcacgtaag agatattcgg aagcagacat cactggagat ggtccaagtg 2100 cgaagcggat ggcagctatc gaagctccaa aagtagagaa cgttgacaaa gaagaagtaa 2160 gtggatttga cgaagcctga agtttcaagt cgctaaattt gaattctgaa atattgtttt 2220 ttcttttttg aatgaaatga tgtatctttt tttgacttat tgataaatat aataaactga 2280 agcattttga ttattttttg tccaggtaat atcatctgat atattcgaat tggtatctgc 2340 acttaacatt ggagatgaag ggatgatctc agtaacgtta gctggcatgc cctgcgagtt 2400 cctgatcgac tccggggctc aagttaacac aatttctgag aaaaatttca taaaattgaa 2460 cagtgacgag aaataccgag aacaaatttt caatttccgt aggcagacgg atcgttcttt 2520 gaaaccatat gcatcttctg agtacattaa agtgctcggc acgttcgaag ctcctctgta 2580 catatcggat gacagaccag tactcctcga aaagttttac atagtaaatg aacgttactc 2640 tttactcgga aggtcgaccg caatccgcta ctgtgtgtta ttactgggct tcaaagttcc 2700 agcacactca atttccatca acgcggaggc aaatgaccaa ataaaaagca tacgatcgtt 2760 aaacacaggt gtagcatttc ctaagttcaa cattccgcct gttgttatac actatgataa 2820 gtcagcgcca ccatgcagga atattttcac taatattccg atttcattga aaccaatcgt 2880 gcatcaaagg ctgttggaat tggtcagagc ggatatcatt gaaccagtga aaccaaatat 2940 ggacacatca ttttgttctt cgatgctggt ggttccgaaa ggcaaggaca acattcgatt 3000 ggtaatcgac ttgagaggtc caaatcgtta tataatgaga acacctttcg ccatgccaac 3060 tcttgaaact atcctcgcag aacttagcgg agcccgttgg ttttcgacca tcgatctttc 3120 aaacgcgttt tttcatatcg agctagcaga agagtcccga catctcacga acttttttac 3180 tgagtttgga acgttcagat gtgtcagatt accgtttgga ttatgcaatg ccccggattt 3240 gtttcaggag gctttggaac ggaaagtatt ggcaggatgt cccggagtaa aaaactacct 3300 cgatgacgta ttggtgttcg gccgtacaga agcagagcac gatgaaaatc tgcggcaagt 3360 catggcccga tttaatgatc ataatgtaaa acttaacgag gctaaatgcc acttcaagaa 3420 gcagtcagtc acgtttgtag gcttcaaatt aacggcagat ggatgggaag ttgaggaaga 3480 aaagattaac gcaatacaaa acttcagaaa gccaacatcg tgccaagaag tgaaaagctt 3540 ccttggactg gttacgtaca ttgaccgatt cttgatcagt cgagccgaga aaacggaata 3600 cttgcgccgg ttggcaaata gcgaaacctt ctattggacc accgatgaag ataacgaatt 3660 tgaatcactt cgcaacatat caacatatac tatcaagaga ttggggtact atagtcccga 3720 agacagaacg gaactttttg ttgacgcctc cccaataggt ttgggagcca ttcttgtcca 3780 gttcaacagc cagtccatac ctagagtgat cgcgtgtgct tccaagtcac tgacagctac 3840 agagcaacgt tatccccaaa cgcataagga ggcactagcc gtggtatggg gagtggagcg 3900 atttgcgacc taccttttga gcaagtcctt catcgtgcga acggactcag aagctaatca 3960 gttcatctac aatggcaagt acagattggg aagacgcgca gtctccaggg ctgagtcatg 4020 ggctttacgt ttacagcaat acgattttcg catcgaaaga gttccaggtg agttcctgtg 4080 gtaacagacc ttaaacatgt tctcaccgat tttttataat aggtaactgc aatgtggcag 4140 acgcgctctc tagattggta catgatgcaa aggaagcagt tccatttgaa agtgataatg 4200 aaaaccattt cttgtattca ctcgatttcg caagtatgga acttacatgg catcgcattg 4260 aattgtcttc cgaaaatgac catgagttgc aatcggtgaa agaggccatc gagagtggtt 4320 attggggatc cgatttgacc aaattcgagc ctcataagaa agagctgcgg tttctaggta 4380 atctgctgtt caaagacaat aaagtgattc ttcctttgtc tctaagaaat gaagcgatga 4440 aatcagcaca cgaaggacat gttggagaaa tggcgatgaa aaggattatg cgagaattct 4500 tttggtggcc aggtatgtca gcaggtgtgg aaaagtttgt cagaaactgt gaaacatgta 4560 taaggttatc taagaagaat ccaccagttc cattatcatc ccgcgaactt cctgatggac 4620 cttgggaagt aatacaaatc gatttcttgt cccttcctgg ttgtggctct ggagaatttc 4680 ttattgtcgt tgacatatat tcacgttacc tgtccgtggt cgaaatgcga tgcatggatg 4740 cggaaagtac gaatgcagca ttatgcagta ttttccagct ctggggtctc ccgaagatta 4800 tgcaaagtga caatggcccg ccatttcaag gaactaattt tgtttctacg tgggaagata 4860 gaggagtaag agttcgtaag tcgattccac tgagccctca gtctaacggc tcagttgaac 4920 gccaaaatca gggcattata aaagcggtag cagcttcaaa aattgacaaa ctcagttgga 4980 ggaatgccat tcagaagtat gtacaccatc acaacacagt agtaccgcat tctcgtttga 5040 acgccacccc cttcgagtta ttagtaggat ggaagttcag aggaacgttt ccttcgttat 5100 ggtcagctaa gagattggat cgaacagaca ttcgagaaag agacgccgaa tccaaactga 5160 aaagcaaacg atatgcggac aaaacgcgac acgctacggc ctccgatata gtgatgggag 5220 attgggttct attggcgcaa cataaaagaa gcaaaactga tcctactttt cctgcagaca 5280 gataccgcgt tattgctatc gatggaccaa aaatcactgt gatgagtaaa aatggtattc 5340 aatactccag aaacatccag gaagtaaaac gtgtaccatt tttcgaagag caaacaataa 5400 tacaaccgcc agaattcgac acggacaaca actcaagtta tgcagacaaa gatgatcaag 5460 aaggcttctc atcactacca tccccggctc atgacaacga aggttttgac gattcccgag 5520 acgatgccgc acctaacaac ccggctaaaa atagaaattt gcgtccccga gccgttctac 5580 gtaaaccatc gaggtttgac gacaaatttg tgtatacagt tataaattag agtagacata 5640 gtaagaaa 5648 // ID DNA4-1_CQ repbase; DNA; INV; 1716 BP. XX AC . XX DT 26-DEC-2010 (Rel. 16.01, Created) DT 26-DEC-2010 (Rel. 16.01, Last updated, Version 1) XX DE A DNA transposon family from Culex quinquefasciatus - consensus. XX KW DNA transposon; Transposable Element; nonautonomous; DNA4-1_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-1716 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the southern house mosquito."; RL Repbase Reports 11(1), 71-71 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >95% CC identity. ~580 bp TIRs. 4-bp TSDs. XX SQ Sequence 1716 BP; 575 A; 275 C; 288 G; 578 T; 0 other; cacggcgtct ttttcagggg ggtagtattt tttgagaaat agcaagaaaa ctgtcatata 60 catatgaaag tgtggaacat ccccgttcat ttgcggggcg aacgaaaatc tttggtcggg 120 aaaaatcaaa gttatcctca tttgaagttt ttgaactttt ttcctatggg gccaaatttc 180 gtctatttca atttagtatt gttataagtc tacatgggca tgatttatgg ttcagatcat 240 atgatttctt atgggaaatg atgcaaggaa catttttcct gaagcacgca aagtgattga 300 aaataaggga aaaaagttat aaaggtttta ttgaaaattt gggagatttt ctgataaaat 360 tgcacacccc tttcccaaaa accacaatgt ttttgatatt cagatcatta tttcgatgaa 420 ttcttttttg agaaatgttt ctggacccat tgagtatata aatcactaca gaaaagtgta 480 attggttggg tttaagctca actgtatgtc acatttatga attttggatt tcaccgaatc 540 aacatatttt tgtgtgtttt ggttataaaa tgtaccgttt acattaaatt ttcacttttt 600 tttgtgagta catggtccac atgatatgtt ttgtatctaa ttgagacatg cagtgtaaat 660 acaatcacag ggtttctatt tctatgcctt ttgttaattt gtatttccat ttgggctata 720 ttatttgtat tgaaactaca gagttgaact tgcaattatt ggtaataaca taaatattgc 780 gtaaaaatca ttaaaaagtt taaatgtatg taaatctatt aatttaatct caaataattg 840 agcttaacat actaaacaag tctggcatcc aaatttgaaa caacgaacaa tacaacaatt 900 aaattgtttt gctaaaaaac acacaaataa ttgatcacag tggcagtgcg tggccgaatg 960 gttacgctgt ccgctttgta agcggatgat tctgggttcg attcccaacc tgggggtcgt 1020 taatgtggat ggtttgattt gattttgatt tacaatatta tcacctgacc tcaaaataag 1080 tacgagatct cttgtgaggt ttaataagac taatacaaat gtgaaaatct aatggaaacg 1140 gtacatttta taacaaaaac aaacaaaaat atgttgattc ggttgaaatc caaaattcat 1200 aaaagtggct tacagttgag cataaaccca accaattaca cttttctgta gtgatttata 1260 tactcaatgg gcccagaaac atttctcaaa aaagaattaa tcaaaataat gatctgaata 1320 tcaaaaacat tgcggttttt gggaaagagg tgtgcaattt tatcagaaaa tctcccaaat 1380 ttccaacaaa acctttataa cttttttccc ttattttcaa tcactttgcg tgcttcagga 1440 aaaatgttcc ttgcatcatt tcccataaga aatcatatga tctgaaccat aaatcatgcc 1500 catgtagact tataacaata ctaaattgaa atagacgaaa tttggcccca taggaaaaaa 1560 gttcaaaaac ttcaaatgag gataactttg atttttcccg accaaagatt ttcgttcgcc 1620 ccgcaaatga acggggatgt tccacacttt catatgtata tgacagtttt cttgctattt 1680 ctcaaaaaat actacccccc tgaaaaagac gccgtg 1716 // ID CR1-83_AAe repbase; DNA; INV; 4127 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-83_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4127 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1171-1171 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 21 sequences with >91% CC identity. Closely related to T1 and Q. This consensus is likely CC 5'-truncated. XX FH Key Location/Qualifiers FT CDS 3..434 FT /product="CR1-83_AAe_1p" FT /translation="TEMKNELMETKALTKSLAAKISSNDVTSNQARVAWPS FT IKRSREATSARETPRSRPKVKLVVGIKSVEKESMTVDTVAKPAEKFWIYLS FT RIARHVTEADIAELVKKCLNTNAIDVRKLVRKDADLEQLAFISFKIGVDLK FT LTTAQ" FT CDS 471..3998 FT /product="CR1-83_AAe_2p" FT /translation="MMEASKPSDTVEPCRFISSTNCSGHPSRSVPVFGIGE FT EVXQLAHSGKYNNNXVSSYTEYCSTSSTSVCNDGHPIPRSLSFVTSSTTEG FT TSLSMMEASNSSDTVEPRRIVSSIADFVSSTELHHRSSTTERTSQSTMGAP FT KPSDTVEQCRFPSPASCSGRPSRSVPVFGIGDGVSQPACSGKYNDIGAFSD FT PESRPFCSNNVSPVSGYLPNLKSPSSLAIQPHGKPDNRLWAYYQNVRGLRT FT KVDELFLAAADCGYDIIFLTETGLDESINSVQLFGTSFNVFRCDRNSSNSD FT KASFGGVLIAVAQCYPSSLVTVEHGSNLEQVCVSTSIKGIRLSLAAVYIPP FT DKSREVEIMEAHIASVKELTDKDGKVIVCGDYNQPRLQWIERDNKVYLAET FT SSLNAASIALLDGMDFLNLNQCNTIKNELGRTLDLVFYNVESEAVIDEAAV FT PLLPTDAHHPPLVLSLPSLLAPSSRSVDAQIRLFDYRKIDYASLIQFLSDR FT NWNDIIESSNVDEMTVQFCNVVKHWLESNVPFKRRPFSPAWSTPSLRKLKR FT KRNAAQRRLRRCKTVDAKREFKRTSTAYSTLNAALYKSHVLRAQWNLRRNP FT RAFWSFVNSKRKNPSIPINIRLEDRVAACAAEASELFAEHFASVFETCEAS FT DIEMESAACDLPSDVLNLSTFVVTEEMIITASRKLKNSFSPGPDGIPAVIF FT RRCAVVLAKPLAAIFTRSFGLGIFPALWKQSFMFPVFKSGDRRDVKNYRGI FT TSLSAASKLFEIILSDAVLNASKSYVSTDQHGFIPGRSVVSNLLEFTSTCI FT TAMEQNTQVDVIYTDLKAAFDKIDHRILLCKLAKLGASETLTSWLHSYLTG FT RTLRVQLDSRVSKPFTSSSGVPQGSNLGPLLFTLYFNDVAALLGAKVKLVY FT ADDLKLYLIVRTVEDCRRLQCLLNLFVDWCRRNKLIVSIPKCVVMTFHRIK FT DPILYNYYIDGNLLRRVDRVNDLGVLLDPKLDFRLHYSSIIAKANRQLGFI FT SKIARDFKDPHCLKALYCSLVRPIIENASVVWCPYEVTWILRLEGVQKRFV FT RLALRNLPWRDPVNLPPYPNRCLLLSLETLERRRKIQQATLVAKIINGEMD FT CPELLSQLNFRIPQRTLRSYTLLQPRNHRTAYGYHEPLTAIIRVFTSMESA FT FEFGEPSYKFRNRIRRM" XX SQ Sequence 4127 BP; 1142 A; 903 C; 905 G; 1174 T; 3 other; tgacggaaat gaagaatgaa ctcatggaaa cgaaggctct aaccaaatca ttggcagcga 60 aaataagttc aaacgatgtt acttccaatc aagctcgagt cgcatggcct agcatcaaac 120 gatctcgtga agccacaagt gctagagaaa cacccagatc tcgaccaaaa gtaaaacttg 180 ttgttggaat taaatcagtc gagaaagaaa gtatgactgt tgacacggtt gccaagccag 240 cagaaaaatt ctggatatat ctttccagaa ttgcccgtca tgtaactgag gcagatatcg 300 cggaattagt aaaaaaatgt ttgaatacca atgcaattga tgtgaggaaa ttggtgcgta 360 aagatgctga tttggagcaa ttggcattca tttctttcaa aattggtgtc gacctgaagt 420 tgactacwgc acaatgagat tcacctttaa cggaacgcac gtcactaagc atgatggaag 480 cctccaagcc atccgacaca gtcgagccct gccgttttat ttcttctaca aactgttccg 540 gccatccaag tcgttccgtt cctgtgttcg gaattggtga ggaggtcttm caacttgctc 600 actcaggcaa gtataacaat aatasagtct cttcgtacac tgaatattgc tcaacttcca 660 gcacaagcgt gtgcaacgat ggccatccga tccctcgctc actcagcttt gtaacatctt 720 caactacgga aggcacgtca ctcagcatga tggaagcctc gaattcatcc gacacagtcg 780 agccccgccg aattgtttct tctatcgccg acttcgtttc ttctactgaa ctacatcatc 840 gctcatcgac aacggaacgc acatcacaaa gcacgatggg agcccccaag ccatccgaca 900 cagtcgagca atgccggttt ccatctcctg caagttgttc cggccgtcca agtcgttccg 960 ttcctgtgtt cggaatcggt gacggggtct cccaacctgc ttgctcaggc aagtacaatg 1020 acataggagc tttctcggat cctgaatctc gtcctttttg tagcaacaac gtttcgccgg 1080 tcagtggtta cctgccgaac cttaaatctc cgtcatcatt agccattcag ccccatggta 1140 aacctgacaa ccgactttgg gcttactacc aaaatgtgcg aggacttcga actaaagttg 1200 acgagttgtt tcttgctgct gctgattgcg ggtatgatat tatatttttg actgaaacgg 1260 gtctagacga gagcatcaat tccgttcaac tattcggtac gagtttcaac gttttccgtt 1320 gcgatcgcaa ctcaagcaat agcgataaag ctagttttgg tggagtgttg attgccgtcg 1380 cgcaatgcta tcccagttct cttgtaactg tggaacatgg cagcaattta gaacaagtct 1440 gtgtctcgac gagcataaag ggtatcaggc tcagtcttgc cgcggtatat ataccaccgg 1500 ataaaagtcg agaagttgaa attatggagg cccatatcgc ttccgttaaa gaactgactg 1560 ataaagatgg caaagtgata gtttgcggtg attataatca acctcgatta caatggatag 1620 agagagataa taaggtctac cttgctgaaa catcatcact gaatgctgcc agtattgcgc 1680 ttctagacgg gatggatttt cttaatctta atcagtgcaa tactatcaag aatgaacttg 1740 gacgtacgct tgatttagta ttctataatg tagagagcga agctgtaatc gatgaagctg 1800 cagtaccgct tttgccaacc gacgcccacc accctcctct ggttttatcg ctaccttcct 1860 tgcttgctcc ttcttctcgc tcagtagatg cccaaattag actatttgat tatcggaaaa 1920 tcgattacgc ttcgcttata cagtttcttt ctgatcggaa ctggaacgat atcattgagt 1980 catcgaatgt tgatgaaatg acagtgcaat tttgtaatgt tgtaaaacat tggttggaat 2040 caaatgtgcc attcaaaaga cgaccttttt cacctgcgtg gagtactccg tctcttcgga 2100 aacttaaacg aaaacggaac gcagctcaac gaagattacg gagatgtaag acggtagatg 2160 caaagcggga atttaagcgc acgagtaccg cctattcaac gttaaatgcg gcgctatata 2220 aatcgcatgt tcttagggcc caatggaatt tgcgacgaaa tccacgtgcc ttttggagtt 2280 ttgtaaactc taagcgcaaa aaccccagca ttccgatcaa tatacggctc gaagatcggg 2340 tggctgcctg cgcagctgaa gcgagtgaat tatttgctga acactttgca tctgtgtttg 2400 aaacgtgtga agctagcgat attgaaatgg agtctgccgc ttgtgatctt ccgtcagatg 2460 tgttaaactt gtctacgttt gttgtaacag aagaaatgat tataacggcg tccagaaagt 2520 tgaaaaattc attctctccc ggtccggatg gtattcctgc cgtgattttc cggcgttgcg 2580 ctgttgtctt agccaaacct ctcgctgcta tcttcactcg atcttttggt ctgggcattt 2640 ttccagcctt gtggaagcaa tcattcatgt tcccggtgtt caagtcaggc gatcgacgag 2700 atgtgaaaaa ttatcgagga atcacaagtt tatctgctgc atcgaagtta ttcgagatca 2760 ttctgagcga cgctgtattg aacgcctcta aaagttatgt ctccacagac cagcatggtt 2820 ttattcctgg tagatcggtg gtatcaaact tgctggagtt cacaagtaca tgtatcacgg 2880 caatggaaca gaacacccag gtcgatgtca tatacactga cttgaaagct gccttcgaca 2940 agatcgatca taggatacta ctgtgtaaac tcgctaaact cggagcgtcg gaaacactca 3000 catcttggct gcattcttac ctgacaggta gaacattacg cgtgcaattg gattcacgtg 3060 tatctaagcc atttacaagt tcatctggtg tacctcaggg tagtaatttg ggaccgttac 3120 tttttacgct atatttcaac gatgttgcgg cattactggg agctaaagtc aaacttgtat 3180 atgctgatga tttaaaactc taccttatcg tccgtacggt agaagattgt cggcgtcttc 3240 aatgtttact aaatttgttc gttgattggt gtcgtcggaa taaattgatc gttagtattc 3300 caaaatgcgt ggtaatgaca ttccatcgta taaaggatcc aatactgtac aactactaca 3360 tcgatggcaa tttgcttcgt agagttgata gagtaaatga ccttggagta ctcctggacc 3420 cgaagctcga ttttcgtcta cattattcat caatcatcgc gaaggctaac agacagcttg 3480 gctttatttc taagatagct cgagacttca aggatcctca ctgccttaag gcattgtact 3540 gctcactggt taggcctata attgaaaatg cttctgtggt ctggtgtccg tacgaggtta 3600 cctggatcct tcgccttgaa ggagtgcaga agagattcgt gaggctagcc ttacggaatc 3660 ttccatggcg tgacccagta aacctaccgc cttacccgaa cagatgtttg ttgctaagct 3720 tggaaacatt ggaacgaaga aggaaaattc aacaagcaac gttggttgca aagataataa 3780 atggtgaaat ggattgcccg gaattattat cgcaattgaa ctttcgcatt ccgcaacgga 3840 ctctaaggag ctacacgtta ctacagccaa ggaatcatcg tacggcatac ggctaccacg 3900 aacctctaac ggcgattata agagtgttta cttcgatgga atcagcgttt gagtttggtg 3960 agccctcata taagttccgc aatagaataa gaagaatgta atgtactgtt tttaatgtag 4020 ttttgtgctg tagattgtct ttgttcatta gttttttttt atcaattaac cacattcatg 4080 tagactgttg ttgtcagatg aattaaatca ataaaacaat ataaaaa 4127 // ID CR1_Ele18 repbase; DNA; INV; 4697 BP. XX AC . XX DT 06-OCT-2010 (Rel. 15.1, Created) DT 06-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE A CR1 clade non-LTR retrotransposon family from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1_Ele18. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4697 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4697 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (06-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. This consensus is generated from 22 CC sequences with >91% identity, and ~100% identical to the original CC sequence in [1]. XX FH Key Location/Qualifiers FT CDS 763..1620 FT /product="CR1_Ele18_1p" FT /translation="MAQICEKCANEISGETVTCGGFCSSIVCLKCSAISDA FT MYASIKANMHLVWMCTCCRNLLSKARFSNSLVSVNKASESVIESMKAEIRD FT SILADIKHEIRSNFKTLINSVPRTPASHYRTPLVASSKSKRPRENDDDXDN FT PASRPAKSMCCIGTNAADTNLVVPEAPTEDSNMFWLYLSGILPEVPDSKVT FT ELAESKLKSTNLRVVKLVAPGRDIKSLTFVSYKIGMPADLKPTALSTETWP FT RGIRFREFENKGSKKQFFWRPTEPTPNQSDLTETPRASGQSHFQR" FT CDS 1527..4628 FT /product="CR1_Ele18_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="EAVFLEADRTNAQSIRFNRNTESIRSESLPAINSAVV FT SRIEGGSPTVDHNVINHSPNATITVSHSTPTTAPSPIADRYTPRSSNTPLT FT VYYQNVGGMRSKTNDFFLATASCEYDVIVLTETWLRSDVSNSELSSDYNIF FT RCDRSSASCHFSRGGGVLIAVKKAISSLPVRLERCNSLEQIVVKLRLQSKS FT VYVCGIYLRPNSDPDKYAAHADSVQQIMQKTSDGDSVLIIGDYNLPHLHWA FT YDDDLDSHLPTNASSEQELVFTENMIASGLLQICDVRNSNDRLLDLAFVSN FT SSEVELFEVPFSILPPDRHHKPFVLRLFTSELEASHAVSAQHDFNFNRCNY FT NAVIEELSLFRWEELFQETDVNAAVANFYNVVFGVIRRHTPLRRSSPRRPN FT SLPWWNADLRRRRNVLRKVRRRFLQHRSEENRSLLRRLEAEYNECVSSSFR FT EYIFRIEDEVKADPSVFWRFFKERKGAQSLPSEMSFGGITSHSPTESVELF FT ADFFKSVYNPSSPLISADALDLLPSFDIHFPCPTFSIADVTEALNAIDSSK FT GPGSDKLPPVFIKRCAEVLAIPVCRLFNLSLTEAIFPDAWKLSAITPIHKA FT GNLHKIENYRPISILCCLAKTFERLIHDRMYSAAKPLISQYQHGFMKNRST FT TTNLMAYVSSLNFSMEKRFQVDSIYVDFSKAFDKVPHDLTLRKLDRLGFPD FT WFINWLRSYLRDRTAYVNLKSVESTRFPTPSGVPQGSHLGPLIFIIFVNDL FT IDRINSHRLMYADDLKIYRIISSSIDCAALQQDIDTVSSWCVLNGMEMNAI FT KCKVISFTKSRAQMRFDYSANGTLLDRVKSIKDLGVVMDEKLNFNEHVAAT FT TAKAFAVLGFIRRNASEFRDPYALKSAYCSLVRSILEYAVQVWAPYHETQI FT ARIERVQRCFVRYALRRLPWSNPIVLPAYDSRRQLIGLESLRHRRVYLQRM FT FAYDVISSRIDCPELLQQANFFVPARLLRQRSLIRISRHRTVFGQNHPLEK FT CFNLLNDVDIFDFTIDRNRFKNIVRQIY" XX SQ Sequence 4697 BP; 1235 A; 1144 C; 1005 G; 1312 T; 1 other; agattttctg gcaccactgt tcttgcaact tgtgttgttt ttgtgtcgct ccatcaattt 60 ttcgattact gccgtcgtac atctcaacaa atcatcgccg atcatctttc tggctcctga 120 ctcatcaatt actcgcccgc attgaaacca ccctcaatct tggacaggta ggatagtgat 180 tttgaaataa taaaatccga gccggctaga gagtggatgt tttgcccgca ccgccattgc 240 cacccgctgc atttttccca cctcgccgct ctcctcctgc ttccactgta ttcatccaac 300 acaagtaagc ctcattcaag cactcgcagc caagcaccgt gtattaacac caccacactg 360 gcgctatctc cgatactgca cggaatttaa taactcggac tgtactacgc ttatcaagac 420 tgctggataa tatatccccg tttgctgtcc cgtcttatta ctttgacaca tccgaatgcc 480 cttcgcataa ccgcattggc ctgttccgca tccattcgta cccaggatcc acccaacgta 540 agtcgagtca tttcgccgta gtgttagttg ttgacataca tacgtccact taaatcgctc 600 acgcaacgga atcagctgat acgagcaagt gagacgtgag ttacattcat catctcactg 660 cgtttcttct ggatcatcat ctagccatct catcgatcgt attggatcgg ccatctacct 720 gacactagcg ggaacttatc cagtgttgct tgttccactg aaatggctca aatctgcgaa 780 aaatgtgcga acgaaatcag cggtgaaact gtgacatgtg gcggtttctg ttcttcgatc 840 gtatgcctaa aatgctccgc aatttctgac gcaatgtatg cttctattaa ggcaaacatg 900 catctggtat ggatgtgcac gtgttgcaga aatttgcttt ccaaagcgcg tttttccaac 960 tctctggttt cggttaacaa ggctagtgaa tctgtgatcg agtccatgaa ggctgagatt 1020 cgtgacagta ttctggcgga tatcaaacat gaaatccgtt ccaatttcaa gactctcatc 1080 aattccgttc cacgtacgcc agcatctcat tatcgaacac cgctcgttgc atcatcaaaa 1140 tcgaaacgac cacgtgagaa cgatgatgac gwcgacaatc ccgctagtcg accagcaaag 1200 agcatgtgtt gtattggcac taatgctgcg gatacaaacc tggttgttcc agaggctcca 1260 acagaagatt ccaacatgtt ctggttgtat ctatctggaa ttcttccaga agttccagat 1320 agcaaggtga ccgaattagc agagtcgaag ctgaaatcaa ccaatttgcg ggtggttaaa 1380 cttgtcgcgc ctggtagaga catcaaatcg ctgactttcg tgtcgtataa aatcggaatg 1440 ccggcggatc ttaaacctac tgcgctttcg acagaaacat ggccccgtgg aattcgcttc 1500 cgcgagtttg aaaacaaggg tagtaagaag cagttttttt ggaggccgac agaaccaacg 1560 cccaatcaat cagatttaac cgaaacaccg agagcatccg gtcagagtca cttccagcga 1620 taaactctgc agtggtttcc cgtatcgaag gtggttctcc cacagttgat cacaacgtta 1680 tcaatcattc accaaacgcc accatcactg tatctcattc aactcctacc actgcgccgt 1740 ctccgatagc cgatcggtat actcctcgtt cgtcaaatac gccgttgact gtctactacc 1800 agaatgtcgg tggaatgcga agcaaaacca acgacttttt cttggctact gcctcctgtg 1860 aatatgatgt tatcgttctg actgaaacgt ggcttcggag cgatgtgagc aactccgaac 1920 tatcctccga ttataacatt ttccgttgcg accgtagcag tgcatcttgt cacttcagca 1980 gaggtggggg agtgctaatc gctgttaaga aggctatctc aagtctccct gttcgactcg 2040 aaagatgtaa ctcactcgag caaattgtgg taaaactccg cctccagtca aagtctgttt 2100 atgtctgtgg gatatacctc aggccgaaca gcgatcccga caagtatgcc gctcatgctg 2160 acagtgtgca acaaataatg caaaagactt ccgacggtga ttctgtgctg ataatcggtg 2220 actacaacct gcctcatctg cactgggcat atgacgacga tttggattca cacctaccga 2280 cgaacgcgtc ttcggagcaa gaactagtgt tcacagagaa catgattgcc agtggtttat 2340 tgcagatttg tgacgtacgt aactcaaacg atagactact tgacttggcc ttcgtcagca 2400 acagtagcga agttgaattg ttcgaagtgc cattctcgat tttacctcct gaccgccacc 2460 acaaaccgtt cgttttacgt ttattcacaa gtgaattgga ggcttcgcat gcagtgtcgg 2520 cccagcacga tttcaacttt aaccgttgca attacaatgc tgtaatcgag gagctgagct 2580 tgtttagatg ggaagaacta ttccaggaga ccgatgttaa cgcagcggtt gccaatttct 2640 acaacgtggt tttcggggta atacgcagac acactcctct tcgccgatct tctcctcgcc 2700 gcccgaatag tttaccgtgg tggaacgctg atcttcgtcg ccggcgaaat gtcctccgca 2760 aagtgcgtcg ccgtttcctg caacaccgtt ccgaagaaaa cagatccctg ctacgtagac 2820 tcgaggctga atacaacgag tgcgtctctt catccttccg tgagtacatt ttccgcatcg 2880 aagatgaggt taaggcggat ccttccgtgt tttggcgttt cttcaaagaa aggaaagggg 2940 cacaatcact cccatcggaa atgtcttttg gtggcataac aagtcacagc cctacggaat 3000 cagtcgaact ttttgctgat ttttttaaat cagtgtacaa tccctcgtct cctttgatat 3060 ctgcggatgc gttggatttg ttaccatctt tcgatattca ttttccgtgt ccgacgtttt 3120 caatagcaga tgtcaccgaa gccctgaacg ccatcgactc atcgaaaggc ccaggatcag 3180 ataaacttcc tcctgttttt atcaaacgtt gtgctgaggt actcgcaata cctgtttgtc 3240 ggttattcaa cttatccttg actgaagcaa ttttcccaga cgcatggaag ttatctgcaa 3300 ttacaccgat acataaagct ggaaaccttc acaagattga gaactatcga ccgatttcga 3360 tcctctgctg cttagcgaag actttcgaac gcttgattca cgacagaatg tattcggctg 3420 ccaagcccct aatttcacag tatcagcatg gcttcatgaa gaatcgctct actactacca 3480 acttgatggc gtatgttagt tcgctgaact tcagtatgga aaaacgcttc caagtggata 3540 gtatttacgt ggacttttct aaagcgttcg acaaggtacc gcacgactta accctaagaa 3600 agctcgatag actaggtttt cctgattggt tcattaactg gttgagatcg tacctgcgtg 3660 atagaaccgc ttatgttaat ttaaagtctg ttgagtctac gcgttttcca actccttcgg 3720 gtgtcccaca aggtagccac ctgggaccct tgatatttat aattttcgtc aatgacttaa 3780 tcgatcgaat caattcacat cgtctcatgt atgcggatga tttgaaaata tatcgaatta 3840 taagctcgtc gatcgattgt gccgcgctac agcaagacat cgacaccgta tctagttggt 3900 gtgtattgaa cggtatggag atgaatgcga taaagtgtaa agtgattagt ttcaccaaat 3960 cgcgggcgca aatgagattc gactatagcg cgaatggaac gttgttggac cgcgtaaaat 4020 ctattaaaga tcttggagtt gttatggacg aaaagttgaa tttcaacgag cacgttgctg 4080 cgaccactgc caaggccttt gctgtactag gattcattcg acgaaacgca tccgaattcc 4140 gtgatccata tgctttaaaa tccgcttact gctcgctcgt aagaagcatc ctggaatatg 4200 ctgttcaagt atgggctcct taccacgaaa ctcaaatcgc tcgtatagaa agagtgcaac 4260 gatgcttcgt ccgatatgcc ctgaggagat taccgtggtc aaacccgatc gtgctgcctg 4320 cgtatgatag tcggcgccaa ctaatagggc tggagtcact acgccatcga agagtttacc 4380 ttcaaaggat gttcgcatat gacgtaataa gtagccggat cgactgccca gagcttctgc 4440 aacaggcgaa tttctttgtc ccagcgcgtc ttttacgtca acgttcttta atcaggataa 4500 gtcgacaccg tactgttttc ggccaaaatc atccactcga aaaatgtttt aatcttttaa 4560 atgatgtaga tatttttgat tttactattg atagaaatcg ttttaagaat attgtgagac 4620 aaatttatta gtaagataag gggtattcag tctgtacagc aaagctgaag acggtgttaa 4680 ataaataaat aaataaa 4697 // ID Nobel_LTR repbase; DNA; INV; 314 BP. XX AC . XX DT 09-MAY-2009 (Rel. 14.06, Created) DT 09-MAY-2009 (Rel. 14.06, Last updated, Version 1) XX DE Nobel_LTR: LTR sequence flanking Nobel_I in Drosophila DE persimilis. XX KW BEL; LTR Retrotransposon; Transposable Element; Nobel_LTR. XX OS Drosophila persimilis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; obscura group; OC pseudoobscura subgroup. XX RN [1] RP 1-314 RA Austin S. and Styles P.; RT "Nobel_LTR: LTR sequence flanking Nobel_I in Drosophila RT persimilis."; RL Repbase Reports 9(6), 1152-1152 (2009). XX DR [1] (Consensus) XX CC Nobel_LTR is the LTR sequence flanking Nobel_I in Drosophila CC pseudoobscura and D. persimilis. Solo LTRs are observed in both CC species. There are 25 copies of the LTR in D. pseudoobscura and CC 110 copies in D. persimilis. Nobel_LTR is 314bp in length. 51 of CC the 110 LTRs in D. persimilis share a series of 5 mutations from CC the consensus sequence. XX SQ Sequence 314 BP; 104 A; 53 C; 63 G; 94 T; 0 other; tgtagcgtca tttgaatatt aataattgtt attaggatag tatttcatga agtctagtgt 60 aagttaggct agggcaaagc gggagcgcaa taaaagctcg ttctctcgct cttggcgaat 120 tcgccccgct ttgccaccgt aggtaaggta ggcttagaag aaattgtttg aatatattga 180 aaagaagtcg agaaaaatcc ctgaaatcga acgcacgcac cccttttctt tcgtcgtatt 240 aaaataaaaa aaaaattgca gtcacccaag aaatttcggt attggttatt tagaaaaact 300 attctaaaat ttca 314 // ID hATx-3_SM repbase; DNA; INV; 2705 BP. XX AC . XX DT 21-OCT-2007 (Rel. 12.1, Created) DT 21-OCT-2007 (Rel. 12.1, Last updated, Version 1) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hATx-3_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-2705 RA Jurka J.; RT "haT-type repetitive family from freshwater planarian Schmidtea RT mediterranea."; RL Repbase Reports 7(10), 1039-1039 (2007). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 690..2549 FT /product="hATx-2_SM_1p" FT /translation="MASNEKESWVWKYFIKRIVNRNTVGAKCTICKNEKDL FT VANTSTLARHLTSIHNINIKSAHLDSTIIATQRVQTNVISFLQRKTLEYLI FT AELAAVDGFSIRGITNSKFIRESITAKGYHLPRCETQVMGLVHKFYDQVKE FT AAIQQIIDDKSENLKFSITLDEWVDATARKFFNVTLNRSRNGVINLGLVRI FT IGSCTAEKLFEAVNNHLLSFGIDFTRDIVAATGDGASTMLKFGRLSPVIYQ FT ICLNHAFHLGVIKTFYHKKERLLDINQAIETEVLEDNFNEDEYPMEGDEDE FT EFYLNEDIESMLASVRRIVVLFRRSSLKTDLLHKKIQELNVVTENDISAED FT ELQKKQLDEKVKLQLDVKTRWNTIVHMIRIFIKYYEPIKDALIELDEERML FT YGINKENLQMLINILTPIELAVLELGKRDATLRKAMVSIEFVRDKISQNTS FT AISLKLLQNINEYIDDRMDPNLFHLIECLDTAAVPPNSVLKFATEIMKRLF FT GTDETNDENESDNILATVSREDNNFSSLAQELTQRINSSSIPKTLQLKFDK FT LKQEFQIFKSTGNRTENLERLRNALDTIQPTSTESERTFSVAGNFLSKLRC FT KLSDKSLNALIFLKYYFKNPWC" XX SQ Sequence 2705 BP; 980 A; 412 C; 465 G; 848 T; 0 other; tagggattcg gcttgattaa gcttgagctt gaaagcttgg tgtttttggc caagctcgaa 60 ggctcgggct tgacaggtaa agcttgacaa gcttcgagct taatatattg cttatgtact 120 gttgcagata agtgataaaa aaataaattt tagtttaatt tcttcaactt caatcaaaat 180 ttttaaaaac ttaaaaagat aaatttaacg tacaaaaata atttgaacta tcccaaagtt 240 tcttgtgatg gaactgttgg aattctcctt tttcagttca atttctagtt ttaatccgtg 300 ttgtcataac gtatatttca tatttaaata caattatgta aatagtagtt gtaagcccct 360 caatagaaaa taaatagtta ctttctggaa gtcgacaaga aaggtcgtct tcattactta 420 caagccgtat ggcatttcct gtgtctatat tactatataa aacgtttgga gtttaaatgc 480 gtgaaattat taaagccggc ggcatttgag tgatttcatt actatttaaa agtgagtaaa 540 cattaggaaa ttgtaaaata aagaagaaat aaagaaaaaa cttaacctca tcagagtaca 600 aaattaacta gaaaaagtgt atttggtata ttttttattc atataattgc taatgaaata 660 tatacatatt ttattattta gacaacataa tggcatcaaa tgagaaggaa tcgtgggttt 720 ggaaatactt tatcaaacga attgttaaca gaaatactgt cggtgcaaaa tgtacaattt 780 gtaaaaatga aaaggattta gttgctaaca cttctacttt ggctcgacat cttacttcta 840 ttcacaacat taacatcaaa tcggctcatc tagacagcac aattatagct acgcagagag 900 ttcaaacaaa tgttatttcg tttttgcagc gaaaaacatt agaatatctt atcgccgagc 960 tagcagctgt tgatggtttt tcaattcgag gcataacaaa ttcaaaattt attcgcgaaa 1020 gcattactgc gaaaggttat cacttgccaa gatgcgaaac tcaagtcatg ggtcttgtgc 1080 acaaatttta tgatcaagta aaggaagctg caatacaaca gattattgac gataaatcag 1140 aaaatttaaa gttttccatt actctcgatg aatgggttga tgccacagca cggaaatttt 1200 tcaatgtcac attaaatcga tcacgcaacg gtgtaattaa tctaggacta gtgagaataa 1260 ttggaagctg tacagctgaa aaattatttg aagctgttaa taatcattta ttgagttttg 1320 ggatcgactt cacacgcgac attgtagctg caactggtga tggagcaagt acgatgttga 1380 aatttggcag attaagtccc gtcatatatc aaatctgtct taatcatgct tttcatctgg 1440 gcgtcatcaa aactttctat cataaaaagg aaagattact tgatataaat caagcaattg 1500 aaactgaagt tcttgaggat aactttaatg aagatgaata tcctatggaa ggtgacgaag 1560 acgaagaatt ttatctcaac gaagatattg aatcaatgtt agcttctgtc cgtagaattg 1620 tagtgttatt cagacgatct tcactcaaaa ctgatctttt acataagaaa atacaagaac 1680 tgaatgtcgt cactgaaaat gatatcagtg cagaagatga acttcaaaag aaacaacttg 1740 atgaaaaagt taagctccaa cttgatgtga agacaagatg gaatacaatc gtccacatga 1800 taagaatttt cataaaatat tatgaaccaa ttaaagatgc attgatagag ctggatgaag 1860 aaagaatgtt gtatggtatt aataaagaaa accttcaaat gcttatcaac atccttactc 1920 ctattgaatt agcagtattg gaattaggaa agcgagatgc aactcttcga aaagcaatgg 1980 tttctattga atttgttcga gataaaatat cacaaaacac ttcggctatt agtttgaaat 2040 tattgcaaaa tataaatgaa tatattgatg atagaatgga tccaaacctt tttcatctca 2100 tcgaatgcct tgacactgct gctgtaccac cgaattctgt tctaaaattt gctacagaga 2160 taatgaaacg actttttggc acggatgaga caaatgatga aaatgaatct gataatattc 2220 tggcaacagt ttcacgtgag gataataatt tttccagtct ggctcaagag ttaacacaac 2280 gtattaattc gtcttcgatt ccaaaaacat tacagttgaa atttgataag ttaaaacaag 2340 aattccaaat ttttaaatcg actggaaata gaacagaaaa tctagaaaga ctacgaaatg 2400 ccctggacac aattcaaccg acatcaactg aatcagaaag aacattttca gtagcaggaa 2460 attttttatc aaagttgcgt tgtaaattat cagataaatc gttgaatgct ttgatcttcc 2520 tcaaatatta ctttaaaaat ccttggtgtt aaataaaata aaaattttta gttataaaaa 2580 ataaatagtg tgtatgttgt tttaattttt tttcttaagt tctgaagctt gaaagcttga 2640 caaatatgac caagctcaag ccgagccaag cccaagcctg aaacaccaag ctttccgaaa 2700 ctctg 2705 // ID BEL-145_AA-I repbase; DNA; INV; 6798 BP. XX AC AAGE02022721; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-145_AA_; KW BEL-145_AA-LTR; BEL-145_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6798 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02022721; Positions 8924 15721. XX CC Positions [5771-6349] - Integrase core CC 'GTAGC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 1415..6703 FT /product="BEL-145_AA-I_1p" FT /translation="MPPKTIASTKKELSLRVLQTRLNSSLAMFDEILKFSQ FT SMNEDTTAIQVSIRVDKLEELWETVNEAIIEVESHEDYADDDEDCSKVRAK FT FIERYFTVKTSLLEKAKEVEEVASNQSRILDSSQSQSTLDHVRLPQIKLQS FT FSGNIDEWLSFRDLYTSLIHWKTDLPDVEKFHYLKGCLTGEAKALIDPLAI FT TKANYLVAWDTLTKRYNDSKLLKRRQVQALFKLPMLTRESASELQSLLEGF FT ERIIQNLDQLVQPQDYKDLLLLDIIGSRLDPVTRRGWEEYSASREQDSIKD FT LTEFLQKRIRVLGSLPSRNADLKGDSAAITKKKFFGPRTSHSAVQASGGRC FT VACSESHPLYQCSSFQRLTVSARDKLLRNHSLCRNCFRRGHQASDCSSRFV FT CRNCKAKHHTMVCFHSDKGDGAKSNSSSNSTQADSSNVTLERNAHPTTSTT FT TDSVSSNTAFQRSLSVLLATAVVLVEDETGNSFPARALLDSGSECNFMTEN FT LCQRLKIQRRRSDISVLGIGQANTRVKHKVTATIKSRVSEFSRNMEFLILP FT RVTADLPTTSIQTACWEIPGGVELADPAFFNSKAVDMVLGIQYFFAFFKTG FT KEIELGKGLPLLTESVFGWVVSGLVNADRPGSTISCNMAVTDNLEEILSRF FT WACEEIEFPNNNSPVEAKCEEHYSCTVQRGVDGRYTVSLPKDDTVRAQLGS FT SRDIALRRLQGIERRLLREPILRDQYEKFMEEYLELGHMRRVDVPTSANRC FT YLPHHPVVKESSTTTKVRVVFDASCKTSSGVSLNDVLLVGPVIQDDLRAII FT LRARTKQILIVADVEKMFRQIWIDQKDLVLQNILWRKDFNERAETYELCTV FT TYGTKPAPYLATRTLQQLAVDEQQRFPMAARAILEDVYMDDVLTGEDDTEA FT AKQLQVQLEEMMESGGFHLRKWASNSASVLEGIANENLAVAKEDGVQLDPD FT PAVKTLGLYWFPETDVFKFQFKIAKIGPVENLSKRKLLSMIATLFDPLGLI FT GAVIVTAKIFMQRLWCWVDEDGKKLDWDRPVDANECEKWIELHQQIPILNE FT ISIPRCIVLPGATNVQIHFFSDASKAAYGVCAYVRSTDNCGRSYVALLTSK FT SKVSPLKTQTIPRLELCGALLAAQLKEKVFNALKIDARAFFWVDSTCVLQW FT LKAIPATWNTFVANRVAKIQRSTENCEWNHVQGVENPADLISRGLLPQEML FT ENQLWWEGPPWLKREREFWPNQSTGGAPDEATEEVVEEVRRSRAHCVTTET FT ETFGRNYVKKFSTFTDMICRTAYILRFIKMLEKSKENCKFPSFLTTEERAK FT AELAIIRCIQREEFDGEWAALSKNDAVARSSPLRWFNPILSKDNVIRVGGR FT LGHSQETENTKHPIVIPASHPLTRLLFQHYHQKLLHAGPQLLLATMRQRYW FT PLGGRNIARKVVHDCIRCFRTKPNTVEQFMGELPPSRVTVARPFSRTGVDY FT FGPVYFRPGPRRTAVKAYVALFVCMCTKAVHLELVTDLSTDRFLQAFRRFI FT ARRGRCTDVYSDNGTNFVGARNQLEQLFELLRSNDHRQKMVTECAKECITW FT HFNPPSAPHFGGLWEAAVRSAKGHLLKVLGELVVPFEDLCTLLVQVESCMN FT SRPLTPMSDDPSDLEPLTPGHFLIGAPLEQLPDHDYTDFPVNRLHMCQLIQ FT QKLQIFWKRWRMEYLTQLQGRMKRWKPAIPISVGKLVVIKEDNTPPLRWKM FT ARIQEVHPGADGVVRVVTLKTASGLLKRAVEKICLLPPSSQPV" XX SQ Sequence 6798 BP; 1923 A; 1480 C; 1609 G; 1786 T; 0 other; tttggtcctt cgaaccggat gactgaagga aggtcgtcca aaaatcaccg gtttgcctat 60 cgatccaccc aaggataacc gccattggag gcacagttgg gatcgcctat tcacgccatc 120 acaaagccag tcgtcttttg gctcaatatt cgcccgatca accgccatct cgatcatctt 180 cgcatcttcg aacaatcgcc atcgttgaag acaccaaggc tgcgggcagt aacacactgt 240 aatacgtaaa aaagcacttt agaacttctt gctggattcg gtaccgatta atggtcccat 300 aggtgggtac acattagtat tatgtccagc cgaagctagg atcaacaact attgctaaca 360 ctacatgcat tccatctatg aacacttcgc cattcatcac tgcaccattc aacatttcca 420 ccaacactcg agggagcact agaatccgat atccattgga gatcttcgaa ggcaacgttt 480 aaccactggg tagtaaaccc tggttcatcc aggtaaatat tcacggatat atgtccagcc 540 gtctctagga tcttcggata atcttcacca ccaattattt cgatttcata ctgctccaat 600 tcgacgagta aaccaaaaca aactacggaa tacgttcatc acgattagca gtattacacc 660 taccactact atcgatgtga gaaagatcgc catctcgtta cgtcatcatc tgcccaattg 720 atcatcgatg tgaatcgatc cctttcgtca tcactatctc cattgagcat tacccggacg 780 cgaatccact gcatcattgc atcactggtc atcagttgca gaatttagtt gcacaggtta 840 gttagaacaa acacttttcg cactaattag tatatggcca gccgatgctg ggaatctacg 900 gaaactacat gtggcacttt tgacttttct ttgaacactg atacgtggtg atgtatgcgt 960 gggatgatcg atatcaatat ttgattttgt ggaacgtgtt ttcgaggggc cgatcggagc 1020 attctctgca cgttctacgt attgaggagg aacagtagtg attaccgtga ctagccggca 1080 ctctgttttt tggaacacca gtaccacgaa tttgtctctc aggtcagttt aagacagtat 1140 atgccctgcc gaggcttcgg actccgatgt atactacatg aatgtttttt cttcatgcaa 1200 tcccatggta ctggaaacct attgaacgag attttcccaa tataaatttg aatcaaccgt 1260 caattggaag ttgtgcatac ctgtagagag gaggagttca tcttttgtga aggtcagtaa 1320 gctctacagt gtatgtcttg ccgaagcaag taggaaatac ggaatattac aactactttt 1380 ctttcaactt tcctgggacg aatcttgccg tacgatgccg ccgaagacaa tcgcctcaac 1440 gaagaaggag ctgtcactcc gagtgttaca aactcgactc aactcatcat tggcaatgtt 1500 tgatgagatt ctaaaatttt ctcaatcgat gaatgaggat accacagcca ttcaggtgtc 1560 aattagggtt gataagttgg aagagttatg ggaaactgtt aatgaggcta ttattgaggt 1620 agaatcgcat gaagattacg cagatgatga tgaagattgt tctaaggtta gggcaaaatt 1680 tatagagcga tattttacgg tgaaaacgtc attgttggag aaggccaagg aagtggaaga 1740 agtcgcttcg aatcaatccc gtattttgga ttcatcacaa tcgcagtcca cgctggatca 1800 tgtccgcctc ccacaaatca aacttcaaag tttcagtggt aacattgacg agtggttgag 1860 ttttcgggat ctatacacat cgttgatcca ctggaaaacc gacctccccg atgtcgagaa 1920 gtttcattat cttaagggat gtctcaccgg cgaagcaaag gcgttgatag atccacttgc 1980 tattactaaa gcaaactatc tagtagcgtg ggacacatta acgaaacgtt acaatgatag 2040 taagctcttg aaacgtcgcc aagtccaagc ccttttcaag ttgccaatgc tgacaaggga 2100 gtcagcgtcg gagcttcagt ccttactgga aggatttgaa aggatcattc aaaacctgga 2160 tcaacttgtg cagcctcaag attacaagga tctgctgtta cttgacatca tagggtcgcg 2220 ccttgatcca gtcacacgac gaggatggga agagtattca gcttcaaggg aacaggattc 2280 gattaaggac ctcacggaat ttcttcaaaa gcggattcga gtattgggct cgctaccatc 2340 caggaatgct gatcttaagg gagattcagc agctattacc aagaagaagt tctttggtcc 2400 acgaacaagt catagcgcag tacaagcatc tggtggacga tgcgtggctt gttcagaatc 2460 ccatccgcta taccaatgtt catccttcca acgtctaacg gtgtccgctc gagataagct 2520 tcttcgtaat cattcgttat gccgcaactg tttccgacga ggacatcagg cttctgactg 2580 ttcttctcgt ttcgtttgcc gaaattgtaa agcgaagcac cataccatgg tttgtttcca 2640 ctcggacaag ggcgatggcg ccaagagtaa ctcttcatca aattctacgc aagctgattc 2700 gagtaatgtg acattggaac gaaatgcgca cccaactact tctacaacta cagattcggt 2760 ctcctcaaat actgcctttc aacgttcttt gtcagtactg ttagctacgg cggtagtttt 2820 ggtagaagac gaaacaggta acagtttccc tgctcgggct ttgctagatt ctgggtccga 2880 atgcaatttt atgacagaga atttgtgtca acggttaaaa atccaacgaa ggcgttcgga 2940 tatatctgtc ctggggatag gacaagcaaa tacgcgagtc aaacataagg ttacggcaac 3000 catcaaatca agggtgtcag aattttctcg gaatatggag ttcctcattt tgccaagggt 3060 tacagcagat ctacccacca caagtatcca aacggcttgc tgggagattc ccggaggcgt 3120 ggaattagcg gatccagcat ttttcaattc caaggcggtg gatatggttc taggaatcca 3180 gtatttcttt gcgtttttca agaccggcaa ggaaattgag ttagggaaag gactcccatt 3240 gttgacagag tcggtattcg gctgggtggt atccggtttg gtgaatgctg atcgtcctgg 3300 gtcaacaatt tcctgtaaca tggcggttac agataacctg gaagaaatat tatctcgctt 3360 ttgggcctgc gaagaaatcg agtttcccaa caacaactca ccggtagaag caaagtgcga 3420 ggagcactat agctgcacgg ttcaacgcgg tgttgatggc cggtatacag tctctcttcc 3480 gaaggatgat acggttcggg cgcagttggg ttcatcaagg gatattgccc ttcggcgtct 3540 ccaaggaata gaacgaagat tacttcgaga accaatatta cgagatcagt acgagaaatt 3600 tatggaggaa tatttggagc tagggcatat gcgcagggtg gatgtgccca catcggcgaa 3660 caggtgctat ttacctcacc atcccgttgt aaaggagagt agtacaacaa ccaaagtcag 3720 ggtagtattc gacgcttcgt gtaagacgag ttcgggagta tcgcttaacg atgtgttact 3780 ggttgggcca gtgatccaag atgatttacg cgccataatt ctgcgggctc gaaccaaaca 3840 gatcctcatc gtggcagatg tggaaaaaat gtttcgccaa atttggatag accagaagga 3900 tctggttttg caaaacatct tgtggaggaa ggatttcaac gaacgcgccg aaacgtacga 3960 actttgcacg gttacgtacg gtacgaaacc cgctccgtat ttagcgactc gaacattaca 4020 gcagctggcg gtagatgagc aacaacgatt tcctatggca gcaagagcaa tcctggaaga 4080 cgtatatatg gatgatgttc tgacaggtga ggatgatacc gaagcggcaa aacaattaca 4140 agttcaactt gaagaaatga tggaaagcgg aggatttcat ttgaggaaat gggcctcgaa 4200 ttcagcatca gttctggaag gaatcgcaaa cgaaaatctt gcggtagcga aagaggatgg 4260 tgttcagttg gatccagatc cggctgtgaa gaccttaggg ttatactggt ttcccgaaac 4320 ggacgtgttc aaattccaat tcaagattgc aaagatcggc ccggtcgaaa atttgtcaaa 4380 acggaaactt ttgtcgatga tagccactct gtttgaccct cttggcttga taggcgcagt 4440 cattgtcact gctaaaatct ttatgcagcg tttgtggtgt tgggtcgatg aagatggaaa 4500 gaagctggat tgggatcgac ctgttgacgc caatgaatgt gagaagtgga tagagctaca 4560 tcaacaaatt ccaattctta acgaaatcag cataccacgg tgtattgtac ttccaggagc 4620 aacaaatgtt caaattcatt tcttctctga cgcatccaag gctgcgtatg gagtttgtgc 4680 ttatgttcgt agcacagata actgcggaag aagctacgtc gcattattaa cttcgaaatc 4740 caaggtttcc ccattgaaaa cgcaaacaat tccgcggctg gagctttgtg gagcgttgtt 4800 agcagcacaa ctgaaagaga aggtatttaa cgcgctaaaa atagatgcaa gggcattctt 4860 ttgggtagat tccacctgcg ttcttcaatg gttgaaggca atcccggcta cttggaacac 4920 ctttgtagct aatcgtgtgg caaaaattca gcgttcaacg gaaaattgtg aatggaacca 4980 tgttcaaggc gttgaaaacc cagcagacct tatttcgagg ggattgttac cgcaagaaat 5040 gttggaaaac caactttggt gggaaggtcc accgtggctc aagcgggaaa gagaattctg 5100 gccgaatcaa tctacaggag gtgcaccaga tgaagcaacc gaagaagtag ttgaagaagt 5160 tcgtagatct cgtgcacatt gtgttacaac tgaaacagaa acatttggaa gaaattatgt 5220 gaagaaattt tccactttca ccgatatgat ttgccgaaca gcgtacatac ttcgtttcat 5280 aaaaatgttg gaaaaatcga aggagaactg caaatttccc agttttctaa caacagaaga 5340 gcgtgccaag gcggaattgg cgattatacg ctgtattcaa cgtgaggagt tcgacggaga 5400 atgggcggcc ctgagcaaaa atgatgctgt agcaagaagc tcaccacttc gttggttcaa 5460 tccaatactt tcgaaggata acgtgattcg agttggtggc agattaggac attcccagga 5520 aaccgaaaac actaaacatc ctattgtgat acctgcatct catccactga caagactact 5580 tttccaacat tatcaccaaa aacttctgca tgccggaccg cagcttctac tagcaaccat 5640 gagacagcgg tattggcctt taggtggaag aaatatagca cgaaaggtcg tgcatgactg 5700 catacggtgt tttaggacta aaccaaatac agtcgagcag tttatgggag agctaccacc 5760 ttctcgagta acagtagccc gaccattttc caggacagga gtggattatt ttggccctgt 5820 atacttccga ccaggcccac gtcgtacagc tgtgaaggcg tacgtggcct tatttgtttg 5880 tatgtgtacg aaggcagttc atttggaact tgtgaccgac ctatcgacag atcgctttct 5940 gcaagcattc aggagattta tagccaggcg aggacgttgc acggatgtgt attccgataa 6000 cgggaccaat tttgtaggtg cccgaaatca actggaacag ctattcgaac ttcttcgtag 6060 caacgatcat cgtcaaaaaa tggtaacgga gtgtgccaaa gaatgtatta cttggcattt 6120 taaccctcct agtgcaccac atttcggtgg gctctgggag gctgcagtgc gttctgcaaa 6180 gggacacctt ttgaaggttt taggtgagct cgtcgttcca tttgaggatc tctgcactct 6240 cctagtccaa gtagagtcct gtatgaattc cagaccgcta acacctatgt cagacgaccc 6300 atctgatctg gaaccactca ctcctgggca tttccttatc ggtgctccac tagagcaatt 6360 accggaccac gactataccg attttccggt gaatagattg cacatgtgcc agttgatcca 6420 gcaaaaactg cagatattct ggaagcgctg gagaatggag taccttactc aactgcaagg 6480 tcggatgaaa cgttggaagc ctgcaatacc gatttcagtg ggaaaactgg tagtgattaa 6540 ggaagacaat acgccaccac ttcggtggaa gatggcaaga attcaggaag tacatccggg 6600 ggcagacggt gtagttagag ttgttaccct gaagaccgct tcaggattgt tgaaacgcgc 6660 ggtagagaaa atttgccttc ttcccccatc gtctcaacct gtttaagcca tgaatcgatc 6720 aaaacccacc accaatttag tgttcgaaga ggtcattctt tattttcaga aattccggaa 6780 tttcaggttg ggggagta 6798 // ID Gypsy-5_CQ-LTR repbase; DNA; INV; 153 BP. XX AC AAWU01034000; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-5_CQ_; KW Gypsy-5_CQ-I; Gypsy-5_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-153 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 390-390 (2011). XX DR Genome; AAWU01034000; Positions 49522 49674. XX SQ Sequence 153 BP; 30 A; 34 C; 38 G; 51 T; 0 other; tgtggcggag tctgtggact cgccctttgc gtcattagtc gtgtcatcgt agtgacagtt 60 tgatgtccgg tgtaaataaa gagtcagtct gattttaacc tgtacgtaac cggtcgtctt 120 tatttcgatg cgccctcatt gcgattatcc aca 153 // ID Gypsy-5_TCa-LTR repbase; DNA; INV; 245 BP. XX AC ChLG7; XX DT 21-FEB-2011 (Rel. 16.02, Created) DT 21-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the red flour beetle genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-5_TCa_; KW Gypsy-5_TCa-I; Gypsy-5_TCa-LTR. XX OS Tribolium castaneum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Coleoptera; Polyphaga; Cucujiformia; OC Tenebrionidae; Tribolium. XX RN [1] RP 1-245 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the red flour beetle genome."; RL Direct Submission to RU (21-FEB-2011). XX DR Genome; ChLG7; Positions 16910614 16910858. XX SQ Sequence 245 BP; 64 A; 67 C; 53 G; 61 T; 0 other; tgtggcggca ctggaaccac acaggcgatt cacggttgat cgtaaccgta gccacaacac 60 ttcccatgac aagtgacagc ccgccagaac gacactagcg acaccacctc tcttggcgtc 120 gagcgtcact cggtcacacg tgcataattt tagttttgta accgtttcaa tttgttgcgt 180 taataaagga gtttatgtaa agtgagtttc gttggtctcc tcgcgacaca ccatataaac 240 ccaca 245 // ID Kiri-27_AAe repbase; DNA; INV; 4099 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 16-FEB-2011 (Rel. 16.02, Last updated, Version 1) XX DE A Kiri non-LTR retrotransposon family from Aedes aegypti. XX KW Kiri; Non-LTR Retrotransposon; Transposable Element; Kiri-27_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4099 RA Kojima K.K. and Jurka J.; RT "Kiri non-LTR retrotransposons from the yellow fever mosquito."; RL Repbase Reports 11(2), 722-722 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 3 sequences with >99% CC identity. CC This family and related Kiri elements found in two mosquito CC species, Culex quinquefasciatus and Aedes aegypti, constitute a CC distinct lineage belonging to the CR1 group. The phylogenetic CC analysis with PhyML indicates that Kiri is a sister lineage of CC the L2 clade. Although several Kiri elements do not encode two CC proteins, probably due to 5' truncation, Kiri elements generally CC encode two proteins. Their ORF1 proteins commonly contain an CC L1-like RNA-binding domain. XX FH Key Location/Qualifiers FT CDS 324..1067 FT /product="Kiri-27_AAe_1p" FT /translation="MMSTRSQKPITSPAATTGYTLESVMTMLSKRFDETNQ FT NIDTSKDELNNKLDSIKEDLQKQIMSVEQNFSALKAACDDEHRSLSSRIDQ FT ATESVQRLENRAELIVAGIPFITNENLRDYILHIAETIGFDCDRVSHIHCK FT RLRSGNLPDGSECFILLQFSVVCLRDEFYSKYLSKRDLKLSHIGFSSDRRI FT YINENLTVNARAIKRAALKLRRENKLAAVSTKLGVVQVKKTANGPSMSVAS FT IDQLMQI" FT CDS 1240..4029 FT /product="Kiri-27_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MLSSRNSTLSQHTIPRIVMNTRRFCKFEEIKLMLLDS FT KISVACFSETWLNLSIPDQAISVPGYHLVRNDRVYKKGGGIAIYIREGISF FT KSVYSSNVTASSVCKTECLALELQIDETSMLISAIYNPPDNDASTFMLCEL FT VTLLSTYEHCVLIGDLNTDWSRRSTRKERLAELFTTYSMTVVGTEPTHFYN FT DGCSQLDLLVTNRPDIATSFHQVSVPGLSKHDLIFCSFDIDMCPSSRNYIG FT DYRDYVNFDPNTLRNNINNISWNAFYDIDDVNHLIEFFNNSILAVHDSSIP FT IRTVRRRKRYNTWFNDDIRKAILERDLAYKDWRVAPRERKDEKKHVYTQLR FT NRTNALIRRVKMEHIGRNIVMAGGSPTALWKMVKDLGVGKSATQTDCSFDP FT ELVNQKFASHFTIDNESSFEYQDPHPVGSFNFRQIEAYEVINAFYDVKSNA FT VGIDEIPTRFIKTILPLIVDHVVHLFNKIIVSSVFPEIWKCAKIIPLKKLR FT HLNSMDNLRPISLLSVLSKVLEKLLKKQISAHISSLNLANEDQAGFRRGHS FT VKTALLRVYDDIAAFVDRKDKVVLLLIDFSKAFDTISHQRLLHKLQTRFSF FT SLSAVELIKSYLVGRTQVVRCNDLSSDAVAVTSGVPQGSVLGPLLFSLYID FT DLPGVLRYCRVQMFADDVQLYCSHPEAATAAAMLNEDLQRVINWASRNFLR FT INHTKTKAILFGDNRHVDPAILPLTVEGSQIQFTDYANNLGIIFQSNLGWD FT KAVSTQCGKVYAGLRTLRATTSDLPQNIKLQLFKSLLLPHFIFGDVVHLGM FT SMAMFSKLEVALNDCCRYVYNMNRYTSVRNLQRNLLGCPFRRFYEMRACLQ FT LWRIFHRSEPANLARKLRLLQGRRSLNFVVPLHRTAIYGDSLFVRGVVYWN FT AQPMELKQIHSEAMYKKAALEYFNRN" XX SQ Sequence 4099 BP; 1175 A; 834 C; 869 G; 1221 T; 0 other; agttctggca accatgccaa tgaagtataa cagacgcgaa tagtgtacgg acgacaaaca 60 gttttttaag tatgctaagt tcatcccggg agtttgagtt tgcattaatc gagtaatata 120 aatatgtatg gacattgttg aagtgaaatc gcgtaaaatt cgtttcgccg cgctacatat 180 tctccgaccg aaaatcattg ctgcaggtca ttagaagaaa gccataagat cccgaattcg 240 cacaattgtt tccgtacata gtggtctact gtgggaatca tctcatattg ctatattctt 300 ccgctgttcg taaacaagtt cgaatgatgt caaccagaag ccaaaaaccc attactagtc 360 cggctgctac aactggatat acgttggaaa gtgtgatgac aatgctttcg aaacgttttg 420 acgaaacgaa tcagaacatt gatacgtcta aagatgagct caacaataaa ctcgatagca 480 tcaaagagga tcttcagaag cagattatgt cggtggaaca gaacttcagt gctttgaaag 540 ctgcctgcga tgacgagcac agatcactca gcagtcgtat tgatcaggcc accgagagcg 600 ttcaacggtt ggagaacaga gctgaactga ttgttgcagg gataccgttt attaccaacg 660 agaatctgag ggattatatt ctgcatattg cggaaacgat aggttttgac tgcgacagag 720 tgtcccatat ccactgcaag cgtctacgat ctggtaacct tcctgatggc agcgaatgtt 780 tcatcctctt gcagttctcc gtggtgtgtt taagagacga attctactcg aagtacttat 840 ccaaacgaga cctaaagcta agtcacatcg gtttttcctc ggatcgtcgt atttacatca 900 acgagaactt aactgtgaat gctagggcaa tcaagcgagc tgctttgaag cttcgtcgag 960 aaaataaact agctgctgtt tcgactaaac ttggagttgt acaagtgaag aagacggcga 1020 atggtccttc catgtctgtc gcatctatag accagttaat gcaaatttaa ttatgtattt 1080 gaatctctta gtgtgcttac ttattgtttg ttaatgtaaa aatgtttgta aaagtttttg 1140 tttttgaaat tgtcggaata ttgtaattgt atgtgaaaaa atggagaaga gtatgtgtat 1200 gcagcctatt agtgtacatt gcttcccaca cggtacctga tgttatcatc gagaaactcc 1260 acactctcac aacacactat ccctagaatt gtaatgaaca ctcgacggtt ctgcaagttc 1320 gaggaaatca aactaatgct gcttgattcc aagataagtg ttgcttgttt ctcagaaaca 1380 tggctaaatc tttctattcc tgatcaagca atatctgttc ctggttacca tctcgtacgt 1440 aatgatcgag tgtataagaa gggtggtgga atagccatct atattcgtga aggcatctct 1500 ttcaaatctg tttactcttc aaatgtaact gcttcttcgg tatgcaaaac agaatgtctg 1560 gccctagagc tgcaaataga tgaaacttct atgttgatct ctgcaatcta caatccacct 1620 gataatgatg catcgacctt catgctgtgt gagttggtta cacttttatc aacttacgag 1680 cactgcgtgc tcatcggaga tcttaatacg gattggtcac gtagaagcac tagaaaagaa 1740 cgattggcgg aactgtttac tacttattct atgaccgtag tgggaactga gccaacccat 1800 ttttataatg atggatgctc tcaactggat ctgctcgtca caaaccgtcc ggatattgct 1860 acttcctttc atcaagtttc tgttcctggc ttgtcaaaac acgatctcat attttgttcg 1920 tttgatatcg atatgtgccc atcgagtcgg aattacatag gagactatcg tgattatgta 1980 aatttcgacc caaatacatt gcgtaacaac atcaacaata tctcttggaa tgctttctac 2040 gatatcgatg atgtgaacca cctcattgaa ttcttcaata atagtattct agcagtgcat 2100 gattccagta ttcctataag aacagtaaga cgacgcaaac gttacaatac ctggttcaac 2160 gacgacattc gaaaggctat tttggaaagg gacttggcgt ataaggactg gagagttgcg 2220 ccaagggaac gcaaagacga gaagaaacat gtgtatacac agttgaggaa tagaacgaat 2280 gctctgattc gccgtgttaa aatggagcat attggcagga atattgttat ggctggtggt 2340 tcacctaccg ctctctggaa aatggtaaaa gatctaggtg tcggcaaatc tgcaactcag 2400 acagattgtt cttttgaccc agaattggtg aatcaaaagt tcgccagtca ttttacaatc 2460 gacaatgaat ccagctttga ataccaagat ccacatcctg ttgggagctt caattttcgt 2520 cagattgaag cttatgaagt gattaatgct ttttatgacg tcaagtctaa tgcggttggc 2580 atcgatgaga ttcctaccag atttatcaag acaatactgc cactaattgt cgaccacgta 2640 gttcacctat tcaacaagat tatcgtatca tctgtcttcc cggaaatttg gaagtgtgct 2700 aagataattc ctttgaagaa gttgaggcac ctaaattcaa tggataatct gcgtcctatc 2760 agcttgttga gtgttttgtc gaaggtctta gaaaagctgc tgaaaaagca aataagtgct 2820 catatttctt cactaaatct agcgaacgaa gaccaagctg gcttccgacg tggtcatagc 2880 gtgaaaactg cgctgcttcg tgtatatgac gatattgctg catttgtcga caggaaggat 2940 aaagtagtac tattgctgat tgatttcagc aaagcgttcg atacgatctc tcatcagcgt 3000 ctcttgcata agctgcaaac acgtttctct ttctccttat cggctgttga gttgattaaa 3060 tcgtatctcg ttggtagaac acaagttgta cgctgtaacg atttatcatc tgatgcagtt 3120 gccgtgactt ctggtgtacc acagggctca gtacttgggc ccttgctgtt ctccttatac 3180 attgatgatc tccctggcgt actgcgatac tgtcgggtac aaatgttcgc agatgatgtg 3240 caactttact gttctcatcc cgaagctgca actgcagcgg caatgctgaa cgaagatcta 3300 caaagagtga taaactgggc ctcccgaaac tttctccgca taaaccatac aaaaactaaa 3360 gcgattttgt tcggtgataa ccggcacgtg gatccagcta tcctgcctct taccgttgaa 3420 gggagtcaaa ttcaattcac cgattatgcg aacaaccttg ggattatttt ccagagtaac 3480 ttaggctggg acaaagctgt atcaacccaa tgtggaaaag tgtatgctgg tctccgtact 3540 ctgcgtgcca ctacgtcaga cctgcctcaa aacattaagc ttcagctttt caagtccttg 3600 ttgttaccac acttcatctt tggcgatgtc gttcacctcg gaatgagtat ggctatgttt 3660 tcgaagctgg aagtagcact aaatgactgt tgccgctacg tttataacat gaatcgctac 3720 acttctgttc gaaatctcca aagaaatctt cttggctgtc cttttcgccg gttttatgaa 3780 atgagagcat gtttgcagtt gtggaggatt tttcatcgca gtgagcctgc caatcttgct 3840 cgaaaactga ggctgcttca aggacgtcgt tcactaaatt ttgtggttcc attacatcgt 3900 actgcaatat atggtgactc gttatttgtg agaggagtgg tgtattggaa tgcgcaacca 3960 atggagctaa aacaaattca ctcggaagca atgtacaaaa aggctgcact agaatatttt 4020 aacagaaact agtttatttc tcgttcaaaa aaacgtttag tgttcgctag aacttaagtt 4080 actacaatgt aaatatcta 4099 // ID Gypsy-167_AA-LTR repbase; DNA; INV; 1014 BP. XX AC supercont1.341; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-167_AA_; KW Gypsy-167_AA-I; Gypsy-167_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-1014 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.341; Positions 269616 268603. XX SQ Sequence 1014 BP; 319 A; 220 C; 233 G; 242 T; 0 other; tgtaacagtg ttacaagaag aacacttggt tatcatttca taacgctatt cattcacgcc 60 aaaaaaaaaa ataaaataaa atttaaaacc taagttgacg catgggacca gcaagggtta 120 aacgataact tttgtacaac atgagcgccg ctagtagtgg gagattgaac cacaatttta 180 aaccactcga ttggtaactt tgttgtgatt atttcatcac ctaagaacga tgatcatcag 240 gctcatcagt tggactgcca actgaagagt tgtttttcta aaatacaata gacttacaga 300 acaaaaaaac ggcttagcgt cggggcacaa ttgaaaaata aaaactacga tcaagtcttc 360 tcagttcggt cgttgaaaga gtcagacgtt ctttagatga aaagttgaaa tctaagtggt 420 atcataagac gatcagaaag ttaaatagga gaagttaagg tggataaaag tgatgctaaa 480 ttgattggcc ttaatgaaaa gtttttgagt acaaagtaaa actcctgaag ttacaaaaac 540 tgaaaaaaaa ctgcacacaa ttctgtagtg caggtgcaga attccctcga ggaccaggac 600 atcattatcc ttcaagtgaa ggtgtcggtg tagggtgcag gaaatagcga cacgccgtcg 660 ttgtttctag ggacgaatcc gaagcccggc ggcttgtagg cctgccctcg tcgtcgagcc 720 agcattcgac cctcgcgcat tccagggccc aagggtatcc acgcccttag gagacgatcg 780 gtcgccaaat tcgcatcgtg ctccgcggtc tcgaagggaa tatcaagagg agtcctcgtt 840 catggaagct ccctccatcg gtgcaaagcg tgaaaccatc agcatttgaa ctgacagcat 900 cgggatcgct taaacgcaat cgaagacaat cacctgcgta gttccgatcg tcgagccgcc 960 gtgcagttcg gccatcagtc aatacgccgc cgaagtagtt tggccgtcgc acca 1014 // ID Polinton-3_HM repbase; DNA; INV; 38791 BP. XX AC . XX DT 02-FEB-2009 (Rel. 14.02, Created) DT 02-FEB-2009 (Rel. 14.02, Last updated, Version 1) XX DE A family of autonomous Polinton DNA transposons - a consensus DE sequence. XX KW Polinton; DNA transposon; Transposable Element; KW Polinton DNA transposon; Maverick; Polinton-3_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-38791 RA Bao W. and Jurka J.; RT "Polinton DNA transposons from Hydra magnipapillata."; RL Repbase Reports 9(2), 457-457 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 3281..1992 FT /product="Polinton-3_HM_1p" FT /translation="MNEDQEKAYVALLNNQNVFLTGKAGTGKSFTINIIYE FT RFIKSKKNICITASTAIAAQMYSDASTVHSFAGLGYRLESVETILKNMTDV FT VKERLKNVQILVIDEISMLPVHVLKTLNMILKNIKKSHRLFGGCVLLVAGD FT FYQLTPVGEETLLFEDPFWNECEFVNVELVKNFRQNKDLTFQTHLNNVREG FT IFCESTLAFFRKRLEPADKHLDPSYTRLFMFNKDRIAHNEEYLKKTNSPYF FT IFXSEIKTFNNFKINEKWPFKTPQNVTLKKGVPVICVRNISELGIVNGTQG FT VVKSVNLERVVIMVDEQRIVITPQTEMLYDCKRNVIAECKGMPLVLAYALT FT IHKVQGLTLSNVVIHLTRXQFTWHLFYVALSRAQTRKSVYIIAQPSFFQTF FT LKNINIHSAVKSFYSNLIKPTPEENDYKKRIKLQM*" FT CDS 23347..24969 FT /product="Polinton-3_HM_13p" FT /translation="MDEESLPSLTAKLTKEEIEGYQDRWRGFDVPVQQGEV FT LTDYDVEYYPDTNTDMTKKIQFTIQPGENLIYRTMVLNFTLNLYKSILNNV FT LTPFTAADAAFIRKIAMRANFCLNLIKGIKVYPNFEWNNQTTANNSLNTNE FT KNEQRGYIQRFKYPTSYLDKVYLAQTANYRVDSSGPTPNLGLSVSRAATLA FT HMTDPNTPIIADTTTGPTNYQALNNAQFVIPACSPTGCQFSIPLYHLDDIF FT ELEEYFGKNLKLVIILEIETNLAQLFENMPQTTAADMTAMNAMAFRLNNPT FT ITANEVKVTDTYAFERNKFIDAFPNTQYVAMHKNYWEVRSIDVPPQTKTMV FT IPFNTFKQDSLTPHAVCVSLFCKDQAEHSNFRCRAPRNMAQELIASVTFNN FT VRINFLQGGGSTFTYDLSLPKTRKQLYREYLSFINDGTNSENDMRNQILRY FT STDYVGEFAATENDYWLNSYNEVLAFDLTADKNRVTDMDSPNVLGSSNASM FT TLKFTTSINNAVVRIDMLHKSQFVMTGKGFNENLALMPKSLSNT*" FT CDS 22562..23140 FT /product="Polinton-3_HM_12p" FT /translation="MFHNKRNRCFVKKNSRHLLNPLGRVVDRSKYDLEDEN FT VKLKNGFINLNDQLQLLQQQLEKEKQEQHHQFLKFKDDLAKQKIPKQVESP FT VEKITDVPVAVEKTPIFERPTIRDTYEKPAIQVEKTRYLCAMCQNAVNAVD FT IIHGYCKDCNRRESLFVKKTPPVDKKVIKKKYMSKGYKKALKEVKKKFEQE FT ED*" FT CDS 16165..18237 FT /product="Polinton-3_HM_9p" FT /translation="MKRVYNPPTVDELYNGIIKKLKDNRKELTKLDYLKEA FT AKIEKSNEVMINGNKLDPNKKYTIEPELLTLPQFGQYAENQWTDRGQYEIA FT AEQQLEFNRSLQEQYPYMTLLAVDAPNVITDPNVWNAFQSVNQDIPVFVDL FT LKKDKIEEFNARNSTRQLVEFNRTQRQTKNSQTLRSEALKDFYKPPTKPAR FT HVLVPLESHEFNALYRDVNEIEKTQLLNDSRLSSEKLIKDWIGKALKGHSG FT SLQQNAYKLSFAVLKTDSNVPNNVRLFLTYKSKVFDTNETLEMIISNMTNK FT QLPPLSDERLVYQEANNIFTLINSHVSEEPKYVLYDKYDLVYLPFSKLLKA FT ATKGQVPFRDTKKWDSVSLDKTQQKDFKWVSLNTSFATANLLDDTSLFNRI FT IEQLIGSLEDVVNRWIMDLLAYYRMSGQQFIDDMMDIVKDSQKFIEPERLN FT KLDYLTKKFYKADDMDSALLFFYILSYRFFHQRFIALRKQAFSAGIYAAIM FT NIMEKEKEFMANVIDNLPNLFNTNSNVSVQAKIETIMVISTLYSNNASSLH FT FLVSEYAALGPEGNPKKVKDFCLKYASFIKFDQNTPPNVITKKIHTMQLRS FT DAKKSQKTSETFRQRLEFFDKPESSFNTDTSQNFSTSTPLKNKDYVDPKDV FT RNPIKVDEETYSPDTRKQIDDLKAEIAKKSAAFSLSKNGSD*" FT CDS 18224..19468 FT /product="Polinton-3_HM_10p" FT /translation="MDQIDDLVSSALTTIQSSLFGLNTIAKIKKNLLALQN FT GSKSAQDQKLFDNIRKFREENEPYNKLIKRWSEMSLLFKVTNVLSKLEEPR FT TPYSYRTHKKSYNLPGQYFHCDLADMSVFNTDKKVYRRNYCVVLVDGVSGF FT VRFKASIGKSAREILVALKSLFEHMNRQEEVFLQTDRGGEFYNKQFEAWCN FT ENKITHFSSKNLQKAYLAENAIRRLKDIYQKLRIAKKLTDKTDWTSFLPLV FT EEKLNKSKRTVSGITPEQLNLDTKQGEALRMLDSFQDQKKARLTFKSQNLY FT RVENETLKKKLPILKPLLVGMKVYKRNFNTDPRLKFAKASTNVVSKWDLKN FT IYNIVKVIRNNDVANNFYSPPDLYKVANALDEKEVYYLKRADITPYIRDRK FT KDYYSNLNEYETEFFDKLNL*" FT CDS 15352..16122 FT /product="Polinton-3_HM_8p" FT /translation="MFKTIEDLDIWLASELQSDESQVKKEITRLNELEKKS FT LVDLEPHSKEERKEELDRVITYEYKEEELDRVITYEYKETELQCKIDKKKA FT MRRISLAARSNLFIPDLKKRKILTENDIIVYMGHFVQKLNPSFMLSCDFFT FT NDERNDIIKAFQDYVDPCIKLFLGEMLFPYSTEDESGKLKRKTFKKQERNN FT LIIDYVSVHKDTLLRYFNSLDFRKFINMVNMYFSKIYAFKSSAPSDIIICH FT ARTYFYNTYGINKKL*" FT CDS 19677..22496 FT /product="Polinton-3_HM_11p" FT /translation="MELPANIIVLTEDQYELRKLVNDYMRRMISTELVPVF FT EKGGMDSILNTEIKNGFYLRSIQYYMDFGNEAGAYTESLNLATDLVTQKLS FT GVSSTRTLLYALQKLQPGVDVFSKWFKVPLTHRKPNSTKQVVKRSDLYYDV FT NSFYVNNQFKNNALLLPLQLLGTYQTALGNDLFGIEFVKDANGNATPELQY FT TWSKAAAPDFYLLYGRTKFSYEDTSGWDKTNPWFPMEISSKRQLALPDLMI FT YSHSSDRDPDTPVAFDFGPWSNITRDTQGPTPYETLFNWARTFYQFFGPDP FT TDPSVHVWQNKSQAEIAALPFLVFVEHYKNTTNVEVLNLIDPTRPYYNMTK FT TSNKRIVYRITPVRYNILDSFYYVSEVKMDLYLANMMKSTGPTPSTPELKE FT LQAFYYKQFLPWNVAQCIKHVVNDPLVKKTAIQYNDKTQSMSPWYDLDYVP FT LVDTIYRTCVDSIPIAFTSDKNSPDVLKFLLGNMVGTYMRKNNVISNNLPV FT DKTGVPQFPPHHFMFLQQQLFFPVDFTKDHYVTYKFTLENVQFNDMPMPYL FT ASNELVWKEFQWQTPQFFPNSLGGSEPLLCNKPYGLTELEVPYDYTNISVG FT TGNESFWILPSFTNIFNNDHSVRTARRFFVPPLTIKTKTPSTISEDVFMLM FT KNVDILNIDPTADYGKYLEVLFYIDEADPIFSNSLKVSLNSTAFIERKIAP FT SDNKYFKFTNSSKSYLEQLSLKQLTVRGVSNTGGIILQVADDGSIIEDWKT FT SLPIQCLKMSLYEPMYKIDKRITVLLERDRSLVGVEKNPYSFKMSAANVRD FT LSTLLNGKRKFICTLANILNISNRVLNFKPTDKNVWITMDGVEQTPTFVNN FT TVLDNVIMSFDMAPYLAIQDSVVQISFENTIRPQFEGFYKTFVINDADDFK FT NLKITFVNSDLAPLQFDETXTFNNPNFTFYIQPFN*" FT CDS 8763..13913 FT /product="Polinton-3_HM_6p" FT /translation="MKFKKEISEEKYNNDVYYKKILWTSKRYSDESGVDDL FT IVSDQMTWDDVRVFIENKTRDVILKEMSRTKRQMPESRNFLVGFVSVVMFY FT RSEADGYRLVKSIFRSPVHYVRTMEISKKIEELVHVNVEGSRHVSHIDSGF FT VLYNLSQFQAQIIGGMSTERFVLLRNSIIGARDTLWYKIKKYDYRNKALCD FT NPLSKRNLKMVQQYLKRIQDDPLNRRKQYEDDPLSRRSLAFVSRFLKKKNQ FT AELFRFCPSYINKFELRNTCIDYSYMVEKGNCFYMSFLVSLVQIKKQWFSP FT NDWQESLDEERLIEYATNDCLDTKEQLEWLKIRSFNISYEGLSILLDEISE FT RSVLNKFNLVIFKWLPKPVRVHLKFEKRNRFQRVAYKRFTKLYESKKIFRD FT DHNPHRVFANEIERVESNENLKRQNTIQLMLLDNKKGTHCVSLLTKRFQYR FT FENAFQGYVCDFCQNCYDRHRAHRTCAGFYKENKLVYPLKKRFYEPQQVLF FT HQSRNCFPLYITFDCETETRDGQLFYYCHSVTLFCTNEKLSAEIFGYDTQG FT NVLRESSMFFLNTKESMEFLFQKLPWSLRQHVNPLLEVVRLKRWQKVEDVS FT DLESLRVFDLFAICYVVISLMQEKLKENQLLRISRSERQQLFDEELPKCTI FT CKLSVDKRVYEKTILSSNHNVYQCEDAYHAPVVYILFGNISKMNHFTLSRF FT SSNYLQVTSKENIEKHLLYLFRIFVCFYLFESRVLCRVQNGSLCEYLMVSN FT DTLLAIVNTVRRRFELEEETFEDMLSDSYFFYNEILFLVGKINCLYTDYSV FT VLKYFDFTSSVSKLGQVWDHCLVAGSLSMSEETPSEYVNNNKATLETYLQS FT LVYRYIDHAVIDHDHWTGSVLGLAHSYCNVQYGRVEKPRVDIFSHNAPFDH FT SVMINKHFEEFCSFPWSHKDDPDLGDFDNIVKRLSLEHSFFITDNMSKIKS FT LSFGDFCVFKDSYRITNTSLADATRNLDTKTKEQVRLSVIHYLIKTKIVSH FT QRGAXLISNVEGTQFFTGKSCFPYSKISYESLEEPLVDVDGNPCCPPIEDF FT LHNKLSRSKNEEDERVALKEPYENVVAMWKMLDLKKYSTLLEIYNVIDTIA FT LAFVLFDFDNKIYNELHFHAISQLSSFSFSSKMNSYIGNIAIQHPPSKEHH FT RMIEECMKGGMSTNGTQSVALDTSYFENVLNEQPNVSNKILIMDENGQYAG FT AQELLLPFAMRSFIEYHYQTLQEVVDVYGGESDGSFVKSALVRCVIKIDDR FT YHWLTGGISPCIKRDSISICDLSPSEILDKRKWDTRSNSFQIALDKGERLV FT YKLDEHETVESLDRLLWMTKHEHVIIVKILAVVGVVRYNLAKEFVSSLSKK FT RAEANAVGDKVKDGSFKLMINGHYGWTARKSETYKNCKFIVNKQKCLRKSL FT QIIIEAVNYLKENGIFEIETYLPASLTTLLDVHDVEQVCIDLSDKRIRRDG FT IDSYSSIYKHLILTEIFHFDTKSVYSPDKTINEFEAMSIALRNEKINEAFC FT LITTKRNVLNDSLRLVACGILGHAKKSVTVFASVIKRALLKENIFCQLVAT FT DTDSLFFSIYGDHCEDFDIKVENVLANDKDFESMMDYSNYPKHHPLYSTRY FT QKMTHRFQNESPNKLITFVIALAAKCYKVTFVDDSHKFRAKGFPQQHLLSS FT HYVNGLSLFVLENYFDSTRSCLNRSSTLFKQNRLQFSTNGIALCEKK*" FT CDS 7137..7976 FT /product="Polinton-3_HM_4p" FT /translation="MFQVKSQSISVFKQILLKRCLIVCQEWRYAADVQDFI FT KDFILNCRHDTDIHDIRDIHEDWIYVKQLITCERNAEMEEHMEDYVQYISL FT ALLKQCRILKYFLYGKMSYEDVEQLDCSLEKVRAPSPYFTLTNLFAPTWES FT LNHSIFTKLQKASTXFSHRKAFYVRPTAYYKYEKLWLIFRYIYLQQIYLKK FT KSHYELLTTSFLSPTDLEEERFHYDMIELFNNFLVRANCYLDDCQIEIKVQ FT IEDTKHHNKHCRLRSCKDISLTDILNMPNLRYMYLQFN*" FT CDS 6093..6686 FT /product="Polinton-3_HM_3p" FT /translation="MKPQLMCKNPLNETSNDVYKSFKSKDKRKKCLGKNLM FT DLLVDDKIEKPQHVCDNCIDKCVIEYNLIRSRRNARLVKKMLKFVHEIQLS FT EQERHFAFSKYMIDKQSYYDSERKCSFYTRNLLNEELESYIDGQLHSKPIF FT KAVQQYKLGIQDVTYKIVTPTDWRVVSQEAMGFALKTIEKAHDVSITKTAK FT NALQKYI*" FT CDS 14220..15359 FT /product="Polinton-3_HM_7p" FT /translation="MKIKMENAYEALCQKRGFPLPLNKSSEIGDKELPSEK FT YDIFKEQVDVYQGNVLIVGPVNSGKSKLTRSIVMNGKFEAKHYHLINKREL FT AKEQKQEWQRVCRSDYSKIKSVYFHELDGKKSLDDIIASLLQIQQEAYETQ FT DSNNRKTKTVFIFDDIQDDCLKSSQYSNLITSGSHLQIGTISCFHALPFKQ FT NTKWNTLVGNYKLIVAFNESPEINRILAEEFPSSNRNRNSAVHLLKEITAD FT RTKHDHLALFREHTAKLRSQIDNVDVQIVYVPIASNGEADYEYTQTKIENK FT PWTTFVNHNYIRVLCKKNVYKLENLIKKDPELDDDRPPDYENEPIVFSDNK FT KRKLDFVNKKLKKRRFSSSAESSSCSSSESNNDSDV*" FT CDS 8124..8555 FT /product="Polinton-3_HM_5p" FT /translation="MFMSLKGSAQMPIRATQGSAGFDIFAMEDVYLLKRAT FT VKVPTGIRMQIPDGYYGQLLSRSSLAAQNITVEGGVIDSDYRGEICVLLKN FT NNDREFVIKTDLAIAQLVFLQCFSPQLFIVVPEEELTSTGRGIGGFGSTDC FT MQRI*" FT CDS 4935..5885 FT /product="Polinton-3_HM_2p" FT /translation="MFFALKRFLMSHSSELCVPFGNCSYEPCSFLEYFILI FT GESGCIYDIAAEYFRQTKYKKLGSDYSYIEYSTSLLMLKNITYSDEFHDEA FT YRTIFYLYAIPNPYTETDFKVFLERIQYVYELRKRKVNITFRPRKNTQSDD FT YYYYEMSLYRYISNLLDSTCNRVHGSRFNRFEKIDCLIELILDPASIYRNK FT DGCAYGVNVFANCYGNPPLVISLLRGGYLAFIWQCVLKYTFSFCCETNCRK FT DGMSKDLFMQSKDFLKIKDEIELCGMWCLFILYLKKNSKDFLGDENVSSFN FT EACDCAAELLDKKTYDNEYGITKM*" FT CDS 27186..28298 FT /product="Polinton-3_HM_14p" FT /translation="MTTDLSLEKYCKKNFKNFKGVFPFDQVNNVFKQNFQV FT GDFFIYNSDSSKKNGTHWRMMLKLNNNSLFCFDSLGKESFFQQIPIAVQNN FT PDLLEYAETNSLSTALSQMTLNFNDHLYTRRKKHFPLRDTQDEWFTKFCFY FT YSQKHDIQNVNIVFNDLAYQLGDSSLCGPWILYFLHLLRQQWFNSSDIEID FT TIDSILNQYFTFAGDNRTKNECVSNLSDFFYFVKTSFWLYADKTAQKLYEK FT DFQIFDRLNVKERTAGALYRRRPTPYSGINKDVYKNDIKKNNTLTAFLSQN FT NLLLVKDPATPFLTNDDLKNLPAILKSNLMQTYLDDDQQRHTYNNYQKLKN FT IAQFYSIPVEQRNELSELKKMLQKYEL*" XX SQ Sequence 38791 BP; 14048 A; 5814 C; 6010 G; 12881 T; 38 other; tgatgatacg tgtataccaa taattttttt taaaaaaatt aaaaataaya agaaaagtct 60 ttttcttctt ataaaaatta aagggggggg ggtagtcata aaaaaagtaa aaataaaaaa 120 gaggggtaaa atttttcaaa aaataaaaaa ataaaaaytt gggttagagg gaaagaaact 180 tgggttagag ggttaggagg gttagggggt aggtaggtgt cagggtagga ggcagggggg 240 tagggtggna aaaatataaa aacataaatc aagtgataag caggtaggtt aaaaaatcaa 300 tgttattggg tgtaatcagt gtgtgtgaaa ataggcgtaa tcaggtggat aagcaggtgc 360 gtaatcagtg tgtgtgaaaa taggggcagg ggtgggagag gggagcaaaa aaaacataaa 420 ttaagcaggt aggttaaaaa tcaatgttat tkgtttaatc agtgtgtgtg tgaaaatagg 480 gtgtaatcag tgtgtgtgaa aataggcgtt atcaggtgtt aagaggtgtt aaggaggtaa 540 gcgaatgtgt taaaaaatat atataaaaac ggttaagcag gtgcgtaatc agtgtgtgtg 600 tgaaaatagg tgtaatcagg ggttaggggg ttagggggtt aggaggggta ggaggcagrg 660 ggcaaaaata actcaaatga gacaattttt tttaaaaaac aaacaaaacg ctcaacataa 720 ctcggacgtg gactttgttg tggaaaagag gtaggcacgc gcaaacaaaa cgataaatat 780 taatgatgtg aatttttcaa taaaaaatta cattgaataa tattttggaa aaatatattt 840 taaaaaaaaa tcaaaatacc atttgttaaa aataacgttt gtaggaacac tatatttaaa 900 aataaaatct ggacatataa aaaatttatt ctctatatag aaaatgcaac gtgttataaa 960 aaccataagt tctttaagat taattataaa atttggatct ggtttaattg ccaaatcttg 1020 ttaaccaatc ttgttgaatg actgataaat aaaccaatct tgttgaatga gagtttcata 1080 tgacttaatt aattcaccac ataaaaattg catttcataa aagtcatgtt cgcgtaaact 1140 tttaataaaa tcaacataag cagataaacg aaataatgtg caaatgttat tagaatgttg 1200 atcaataaaa tagccacaaa aaggtgttgc aacgtgttga tgatagtagt ccatggctag 1260 tttagttttg ataatttctt gcaaagtttt cattttttct aattattttt ttttgtaata 1320 actactattc taaattaaaa aaaaaggttt aaataagcca aaaaaattca acaaaaaaat 1380 ttttttaatt tgtaaccgcg aaatttataa caaattaacg actctatagg tatttaagaa 1440 tatataatat ttaaaaaaaa attatttaaa aaaaaattaa acttcttctc gttcaaattt 1500 tttttcttta cttcttttag tgcttttttt gtaactcttt gcgataccaa gctgaataat 1560 aatcatcttc atatttacgc caatcatatt catcttcata ccatacataa ccatgttcat 1620 ttcgatataa aatttctcct ttacctcgca ctcgatattt ctctatttct tyttttaatt 1680 gtgcagtaag aatgtgttca aaactatgtt catccatttt tttttctaac twctattcta 1740 aataaaaaaa aggtttaamt aagccaaata aaattcgaca aaaaaatctg aatatatata 1800 aaaaaattta ataaaaaaaa tatatttttc aattaaattg caaatacata taatgttttt 1860 tttttatttt ataaaaaaaa gaataaacat ttttttatga tatcaattta catttttttt 1920 aaaaaaaaat ctaaatacaa tttttttaaa taaacaaaca aatcacatat ttgtaattta 1980 atacgttttt ttcacatttg taatttaata cgttttttat aatcattttc ttcaggcgta 2040 ggtttaatta aatttgaata aaaacttttt acagctgaat gtatgttaat attttttaaa 2100 aaagtttgaa aaaaagacgg ctgagcaata atgtatacac tcttgcgtgt ttgcgctctc 2160 gacaaagcta cataaaacaa atgccacgta aattgayttc tcgttaaatg aataactaca 2220 ttagaaagag tgagaccttg cactttgtga atagtgagcg cataggctaa cacaagcggc 2280 attcctttgc attcggctat tacatttctt ttacagtcgt acagcatttc tgtttgaggc 2340 gtgattacaa ttcgttgttc gtctaccata atgacgactc gctctaggtt cactgacttt 2400 acaacgcctt gcgtgccgtt gacaatacca agctcggata tattgcgaac acaaataact 2460 ggtactcctt tttttaaagt tacattttga ggcgttttaa atggccattt ttcgttgatt 2520 ttaaaattgt taaatgtttt tatttcactg tyaaatataa aatagggtga atttgttttt 2580 tttaaatatt cctcgttgtg tgcaattcga tccttgttaa acataaacaa ccttgtgtaa 2640 ctcggatcta aatgtttatc ggcaggttct agccttttcc taaaaaaagc caaagtggat 2700 tcgcaaaaga tgccctcacg cacgttgttc aaatgcgttt gaaaggtgag gtctttgttt 2760 tgtctaaagt ttttcacaag ttctacatta acaaattcgc attcgttcca gaaagggtct 2820 tcgaataaca atgtttcttc gccaacaggc gtcagctggt aaaaatctcc ggcaacgagc 2880 aacacgcatc ctccaaatag acgatgtgat tttttaatgt tttttaaaat catatttaat 2940 gtttttagca cgtgtactgg taacattgat atttcgtcta taaccaaaat ttgaacattt 3000 tttaaacgtt cttttacgac gtcagtcata tttttcaaaa tggtttctac ggattctaga 3060 cgatagccga ggccagcaaa cgagtgaaca gtgcttgcat cggaatacat ttgcgcagcg 3120 atagccgtac ttgccgttat acatatgttt ttttttgatt ttataaaacg ttcgtagatg 3180 atattgattg taaatgattt tcctgtgcca gcctttcctg ttagaaaaac attttggttg 3240 tttaacagtg caacataagc tttttcttga tcttcgttca ttttttaccc cctttgtttt 3300 tttttttaaa ttgtgttgtt tttatacgaa ccaatgacgt aaatacttaa gaaggcgagc 3360 aaataaaaaa atgtttaaga aaacacaagc cgcaacaatt ttcgctttca tgtttacttt 3420 tgcaagttaa ataaaaatgt ttatataagc aaaagtcact gttatgagaa aaaaaaattg 3480 gcaaactttt tacattttta aaatatggat gtcgtaaaga tgaaagtaga aaaaaataaa 3540 tgatagtcat ctgacgactg tgtgataata ttttcgtaga gaaaaaaacg gcgagattaa 3600 taagaattat gaaaatgaaa gtcaagtgtt gagactataa tttcgaaata aaaatagtca 3660 aaaatagtca actgacgcaa aatagtcaac tgacatagga gttatgttac aaaatgaatt 3720 aatttttttt aaaaacgact gagagataat attgttttaa aaaaagtacg gtgagataaa 3780 aaaattgatg ttgacaaaaa attggcgtta ttgatttaaa aaaaacctat aaaaagggtc 3840 acttttggca ataaaaaatc aaaaaaacat ttgaagcaaa aaactcagag gtgttcaaaa 3900 aaaatggaaa acgaaaaaat acaaggcttt gagctgactg gaaagacagc gaaatttatt 3960 acgagcgatt gctatgctcc tggtatttat cagttttcag atgtcgactt taagaacatg 4020 tccacttatt ttttagcaac agtgcgtctt gtatgtacag tatgaccaga tgaaaaagac 4080 atggatattt tgtgataaca gcgacttcaa caagcgcata ttggttaatc ctagtcaagt 4140 taaactgata aacaattcac tggtgggaca gtttttttta cgtgcacatt atcctaacgt 4200 gactcgcgtt aataaaaaag gagaaacggt ttttgtggta agagatcaat tttttcgcat 4260 ggatttgaaa gaaaaaagta aatgctaaaa acattagttt atttctttgc cggaactgcg 4320 caaatatcaa aatgaagttt atgatgagta ctttgctact aaaaggtgta caattgaaaa 4380 caagcgagga ttatttgaaa ttaaatatag agacaggaar tcttgagcag atgcatttct 4440 atcattttga gtatgaaaca gccaactcga ctgagatgga gtgtgtcatt attgacgaaa 4500 aactaaaaaa tgcaaatgca gtaatgaatc aaatcgcggt ttcgttaaac gaagagcctc 4560 caacaaaaaa aaccaagaaa acaaaaaatt gtatccgatg acgaagataa ttagtgtaac 4620 tattttttat tagaaaaaat atttttttaa aaaagtttat ttgttttttt tattaaataa 4680 aaaaaaagtt ttttaatatt atttttttaa tttttgtttg tttgttttga acaataaagc 4740 gtttttttaa aaaagaagcc ttgaaaaaaa cagcaagttt tttttttatt tgctaatctt 4800 ttttgttata gccaatatat attattgttt ctcttaaaaa attaagttta aataattttt 4860 aagtattgca aatattattt tgttttttat gtttttatac ataaaaatat atttttttag 4920 aaaaaaataa aaaaatgttt tttgcattaa aaagattttt aatgtcacat tcgagcgagt 4980 tatgtgttcc ttttggaaac tgttcgtatg aaccttgttc ttttttagaa tattttattt 5040 tgattggcga gtcaggatgc atttatgaca tagctgcaga atattttcgt caaacaaaat 5100 ataagaagtt aggtagtgat tattcgtaca tcgaatattc gacaagtttg ttaatgttaa 5160 aaaacataac atattctgat gaatttcatg atgaagccta ccgaactatt ttttatcttt 5220 atgctattcc taacccttac acagagacag attttaaggt atttttagaa cgtattcaat 5280 atgtttatga gttgaggaaa agaaaggtga atattacatt tcgtcctcga aaaaacactc 5340 aaagtgatga ctattactat tacgagatgt cgctttaccg ttacatttca aatttattgg 5400 atagcacttg taatcgtgtt catggatcga gattcaatag atttgaaaaa atagattgtt 5460 taattgaatt aattttggat ccagcttcta tttaccgaaa taaagacggt tgcgcgtatg 5520 gtgttaatgt ttttgcaaat tgctatggta atccaccgtt agtgataagt ttgttgagag 5580 gtggttatct tgcttttata tggcagtgtg ttttaaagta cactttttct ttttgttgcg 5640 aaactaactg tcggaaagat ggtatgagca aggatttgtt tatgcaaagc aaggattttt 5700 taaaaattaa agacgaaata gaattatgtg gaatgtggtg tttgtttata ctttatttga 5760 aaaaaaattc aaaggatttt ttaggcgacg aaaatgtatc gagttttaac gaagcttgtg 5820 attgtgctgc tgaacttttg gataaaaaaa catatgacaa tgaatacggc atcactaaaa 5880 tgtgaaaaca tatttaaaaa ataatttttt ttattatctt tgtatctttt ttttttttta 5940 aaaaaaaata tttttttaaa aataattaat aaatttatgg taaataattt ttaataattt 6000 tttgttattt tttttataat tattttttat gtaaacaaaa taaaatttta tattatttta 6060 aatatatatt tttttgtarg tgaaacaaaa aaatgaaacc tcagctgatg tgtaaaaatc 6120 ctttaaatga aacctcaaat gacgtgtaca aatcctttaa aagtaaagac aaacggaaaa 6180 agtgtttagg taaaaatcta atggatttgt tggtagatga taaaattgaa aaaccgcaac 6240 atgtatgtga taattgtatt gataagtgtg ttattgagta taatttaata agaagtagaa 6300 gaaatgcaag actagtcaaa aagatgttga agtttgtgca tgaaattcag ttatcagaac 6360 aggagaggca ttttgctttt tcaaagtata tgatagacaa acaaagttat tatgacagcg 6420 aaagaaaatg ttctttttat actcgaaatt tgttaaatga ggaattggaa tcgtatattg 6480 atgggcagtt acattcaaag cctattttta aggcagtgca acaatataaa ttaggcatac 6540 aagacgtgac ttataaaatt gtaacaccga cggattggag agttgtttct caagaggcga 6600 tgggttttgc tttraaaaca attgaaaaag cccacgatgt aagcattaca aaaacagcaa 6660 aaaatgcatt gcaaaaatat atttagtttt ttttttttgt ttatgttatt atgaaaaaaa 6720 ataaaaaaat aaaaaaaatt gtattttttt taatgttctt tttttaatat atttactatt 6780 tttttttaat aaatttttaa cattagtttc tttattttaa tacatttttt attacatttt 6840 tttatttatt acaataaaat aaaaataatt ttttttcaat ttttttataa aaaatatttt 6900 ttttttactt atttattaaa gataaaaaat gtctgtaaga gaaaaaataa tatcatttta 6960 tgttattgac gaagcaggaa taacaaaaag atatactcaa ccttttactg ggytttcgat 7020 acgaaaaaac acatttggrg aataaaaaca atttggaaaa taataacaaa acggaaagcg 7080 atttggaatt gaaaaaaaga aaaaaagatt attcattttt ttagtttaaa aaaaaaatgt 7140 ttcaagtaaa aagtcaatcg attagtgtgt ttaagcaaat acttttaaaa cgttgtttga 7200 ttgtgtgtca agaatggagg tatgcagctg atgtgcagga ttttataaaa gattttatat 7260 tgaattgtcg acacgatacg gatattcatg atatacgcga tattcatgag gactggattt 7320 acgtaaaaca attgataact tgcgaacgaa atgcagagat ggaagaacac atggaagact 7380 atgttcaata tatatcacta gctttgttaa aacaatgtcg tattttgaag tattttttgt 7440 acggtaaaat gagttatgaa gatgttgaac agttagattg cagtttggaa aaagtgagag 7500 caccgagtcc gtattttact ttaactaatt tatttgcgcc cacttgggaa tcattaaacc 7560 attcgatttt tacaaaatta caaaaagctt ctactgygtt ttcgcatcga aaagcatttt 7620 atgtgcgtcc aacagcgtat tacaaatatg aaaagttatg gttgatattt cgttacattt 7680 atctgcagca aatttattta aaaaaaaaat cccattatga gttattaaca actagtttct 7740 tgtcgcctac tgatttggaa gaagagcgct ttcattatga tatgattgaa ttatttaata 7800 attttttagt gcgtgctaat tgttaccttg acgattgtca aatagaaata aaagtgcaaa 7860 tagaagatac aaaacatcat aacaaacatt gtagattacg aagctgtaaa gatatatctt 7920 tgactgatat tttaaatatg cctaatctac gttatatgta tttgcaattt aattgaaaaa 7980 aaaatatttt tttattaaat ttcttttttg tatatatcaa atataacttt tttttaacaa 8040 aaaaaaaacc atatataaaa tataactttt tttattaatt tttaaaaaaa aacttttttt 8100 tagagaaaaa aaactaaaaa aaaatgttta tgtcattgaa aggttcagcg caaatgccaa 8160 tacgagctac acaaggtagc gctggktttg acatttttgc tatggaagac gtgtatttat 8220 taaagcgagc aactgttaaa gtgcctactg gtattcgaat gcaaattcct gacgggtatt 8280 atggacaatt gttatcgaga tcatcattag ctgctcaaaa tataacagta gaaggtggcg 8340 ttattgatag cgattatcgc ggagaaattt gtgttttgtt aaagaacaat aacgaccgtg 8400 agtttgtaat taaaacagat ttagctatag ctcaattagt atttttgcaa tgtttttcgc 8460 cacaactttt cattgttgtt ccagaagagg agttaacaag cacaggaaga ggaattggtg 8520 gattcggttc aactgactgt atgcaaagaa tataattgta taaaaaacaa taaaaaatac 8580 attgtaaaaa aaaagttttt tttacagaag aaaaaaacgt gtaagaattt ttttttaaaa 8640 aaattttttt tataatgatt attaagattg agataaattg gtttacttag tgagaaaaat 8700 aagtttttat ttttatatat atttgttcaa taaaaaaaaa aaattttttt tagagtaaaa 8760 aaatgaaatt taaaaaagaa atcagcgaag aaaaatacaa taatgatgtt tattataaaa 8820 aaattttatg gacatcgaaa cgttattcgg atgaaagcgg cgtcgatgat ttaattgttt 8880 ctgatcaaat gacgtgggac gacgtccgtg tgtttatcga aaacaaaaca agagatgtta 8940 ttcttaaaga aatgagcaga actaagagac agatgccgga aagtcggaat tttttagttg 9000 gttttgtgtc agtggtcatg ttttatcggt cagaagcgga cggatatcgt cttgtaaaat 9060 caatatttag atctccagtg cattatgttc gaacaatgga aatttctaaa aaaattgaag 9120 aactagtgca tgttaatgta gaagggagcc gtcacgtttc tcatatagac agcggatttg 9180 ttttgtataa tttatctcaa tttcaagcgc aaatcatcgg tggaatgagt actgaacgat 9240 ttgtcttgtt aagaaattcc attattggcg cgcgcgatac cctttggtat aaaataaaaa 9300 agtatgatta tcgaaacaag gctttgtgtg acaatccgtt atcaaaacgt aatttaaaaa 9360 tggttcaaca atatctcaaa cgaatacaag atgatccgtt aaatcgacga aaacaatatg 9420 aagatgatcc gttaagtcga cgatcgcttg cctttgtaag tcgttttcta aaaaagaaaa 9480 atcaagcaga attgtttcgt ttttgtccgt cgtacattaa taaatttgaa ttgagaaaca 9540 catgtatcga ctattcgtat atggtagaaa aaggtaattg tttctatatg agttttttag 9600 tatcgttggt gcaaattaag aagcaatggt tttcccctaa cgattggcag gaaagtttag 9660 atgaagaacg acttatagaa tatgcaacaa acgattgcct agacacaaaa gaacaactgg 9720 aatggttaaa aataagaagt tttaatataa gttatgaagg tttatctatt cttttagatg 9780 aaataagtga acggtcggtt ttaaacaaat ttaacttggt cattttcaag tggttaccta 9840 aaccggtgag ggttcattta aaatttgaga aaagaaatcg ttttcaacgt gtagcctata 9900 aacgctttac aaaactttat gaaagtaaaa aaatctttcg cgatgatcat aatccacatc 9960 gtgtgtttgc aaacgaaata gagcgagtag aaagtaacga aaatttaaaa cgacaaaata 10020 ctattcaatt aatgttgcta gataacaaaa aaggcacgca ttgcgtttcc ttactaacta 10080 aacgctttca atatcgattt gaaaatgcct ttcagggtta tgtatgtgat ttttgtcaaa 10140 attgttatga tagacataga gcgcatcgaa catgtgcagg attttataaa gaaaacaaac 10200 ttgtttatcc gctaaaaaag cgtttttacg aacctcaaca agttcttttc catcaatctc 10260 gcaactgttt tcctttgtat atcacgtttg attgtgaaac tgaaacacgc gacggacaat 10320 tattttatta ttgtcacagt gtgactcttt tttgtacgaa tgaaaagttg agtgctgaaa 10380 tttttggtta cgatactcaa ggcaacgttt tgagagaatc aagcatgttt tttttaaata 10440 caaaagagag tatggaattt ttgtttcaaa agttgccttg gtctcttcgt caacacgtga 10500 atccgttatt agaggtagtt cgtttaaaac gttggcaaaa agttgaagat gtgtcggact 10560 tggagagttt acgtgttttt gatcttttcg ctatatgtta tgtagttatt tctttaatgc 10620 aagaaaagtt aaaagaaaat caattgcttc gtatttcgcg gagcgaaaga caacaattat 10680 ttgatgaaga gcttccgaaa tgtaccatat gtaaattgag tgtggacaaa cgcgtttatg 10740 aaaaaaccat tttgtcaagc aatcataacg tgtaccaatg cgaagatgct tatcatgctc 10800 ctgtcgttta cattttgttc ggtaatatta gcaaaatgaa tcatttcact cttagtcgtt 10860 ttagttcaaa ctatttgcaa gtaacaagca aggagaatat cgaaaagcat ttgctttatc 10920 tttttcgcat ttttgtatgt ttttatttgt ttgaaagtcg cgtattgtgt cgtgttcaaa 10980 atggaagttt gtgtgaatat ttgatggtca gcaatgatac actattagca atagtcaata 11040 cagttcgaag aagatttgaa ttagaggaag aaacgtttga agacatgtta tccgattcgt 11100 atttttttta taatgaaatt ttgtttctag taggcaaaat caattgtctc tatacagact 11160 attcagtcgt tttaaagtat tttgatttta cgagtagcgt tagtaaattg ggacaagtat 11220 gggaccattg tctagtggca ggaagtttaa gtatgtctga agaaactcca agcgaatatg 11280 tcaacaataa taaagcaacc ctggaaacat atttgcaatc tctcgtgtat cgctatatcg 11340 atcatgcagt tatcgatcac gaccattgga ctggttccgt attgggcttg gcgcattcgt 11400 attgcaatgt acaatatggc agagttgaaa agcctcgcgt agacattttt tcgcacaatg 11460 ctcctttcga tcacagcgta atgattaata agcactttga agaattttgc agctttccgt 11520 ggtcacataa agacgatcct gatttaggcg attttgataa catcgtcaaa cgactgtcgt 11580 tagaacattc tttttttatt actgacaaca tgagcaagat taaatcgtta tcgtttgggg 11640 atttttgtgt gtttaaagac agttatcgta ttacaaacac gtcgctagca gatgctactc 11700 gtaatttaga cacaaaaact aaagaacaag tacgtttgtc ggttatacat tatttgatta 11760 aaacaaaaat cgtttcgcat caacgaggag ctgasttgat tagcaatgtc gaaggaactc 11820 agttttttac aggaaaaagt tgctttcctt actctaaaat tagttatgaa agtttggaag 11880 agccgctcgt cgacgttgac ggtaacccgt gttgcccgcc tatcgaagat tttttacaca 11940 acaaattgag tcgaagtaaa aacgaagaag atgaacgagt cgcgctaaaa gaaccttatg 12000 aaaacgtggt agccatgtgg aaaatgttag atttgaaaaa atactctact ttgttggaaa 12060 tttataatgt cattgacacg atcgctttgg catttgtttt atttgatttt gacaataaaa 12120 tttacaacga acttcatttt catgccatct cgcaactttc ttctttttcc ttttcaagta 12180 aaatgaattc gtatattggc aatatcgcca ttcagcatcc tccttctaaa gaacatcatc 12240 gtatgattga agaatgcatg aaaggaggca tgtctacaaa cggcacacag agcgtcgcct 12300 tagacacttc gtattttgaa aatgttttaa atgaacaacc taatgtgtcg aataaaattt 12360 tgataatgga tgaaaatggt cagtatgctg gcgcccaaga actgcttcta ccgtttgcta 12420 tgcgtagttt tatcgaatat cattatcaaa cgttgcaaga agtagttgat gtttatggtg 12480 gcgaaagtga tggttcgttt gttaaaagyg cacttgtaag atgcgttatc aaaattgacg 12540 atcgttatca ctggctcact ggcggtattt cgccttgtat taaacgcgac agcatttcga 12600 tatgcgatct gtcgccgagc gaaattttag acaaacgaaa atgggacact cgctccaatt 12660 cttttcaaat agcgctcgat aaaggcgaaa gactcgtgta caaattagac gagcacgaaa 12720 ctgtcgaatc tttggatcgt cttctatgga tgactaaaca tgaacatgtt attatagtta 12780 aaatactagc cgttgtaggt gttgttcgtt ataatttagc aaaagaattt gtgagcagtt 12840 tatcgaaaaa gagagctgaa gcaaacgctg tcggtgataa agtaaaagac ggatctttta 12900 aattgatgat taacgggcat tatggctgga cagcgagaaa gagcgaaact tataaaaatt 12960 gcaagtttat agtgaacaag caaaagtgtc tgagaaaaag tttgcaaatt ataatcgaag 13020 cagtaaatta tttaaaagaa aacggtatat tcgaaattga aacctatttg cctgccagtt 13080 taacaactct tttagacgtg cacgatgtag agcaagtatg cattgattta agtgataaac 13140 gaattcgtcg cgatggaatt gattcttatt caagtattta taagcattta atattgactg 13200 aaatatttca ttttgataca aaaagtgtgt attcacctga taaaacaata aacgagtttg 13260 aagcaatgtc aatcgctctg agaaacgaaa aaataaacga agcattctgt ttgatcacca 13320 caaaaagaaa tgttttaaac gatagcttgc gcttagtagc ttgtggcatt ttaggacacg 13380 ctaaaaaaag tgttactgtt tttgcaagcg ttattaaaag agctctgctg aaagagaaca 13440 ttttttgtca gcttgttgca acagatacgg attctctatt ttttagtata tacggagatc 13500 actgtgaaga ttttgatata aaagttgaaa acgtattagc taacgacaag gattttgaaa 13560 gcatgatgga ttaytcgaat tatcctaaac atcacccact ttattctact cgttatcaaa 13620 agatgaccca tcgctttcaa aacgaatcac ctaataaact tatcactttt gtgatagctc 13680 ttgcagctaa atgttataaa gtgacgtttg tagacgactc gcacaaattt agagctaaag 13740 gttttcctca acagcatttg ttatcgtcgc attatgtcaa cggactttct ttgtttgttt 13800 tggaaaacta ttttgacagc accagatcct gcttgaacag atcgagcact ttatttaaac 13860 aaaatcgact ccagttttca actaatggca tcgcattgtg cgagaaaaaa taataaattt 13920 tatgaataaa gtcaactcaa agctgtacgt atgcaataac gggttgggac tgattcgtgg 13980 cgatattcga ttgcgtgctt tgagaaaatt aaaccaaaca gtcgagaacg acatggcaaa 14040 attgtgcagc gacgagcact tacgtcaatt gtttgaatta gaaaaacaaa tgataaacgc 14100 ttgtcctgat ttacaaaaaa gagacgtctt catgagacgc tttgcgttgc ctattttaaa 14160 atcgatacaa gaaaatactg aataaatttt ttttttgtaa tttttttttt ttcataacca 14220 tgaaaataaa aatggaaaac gcttacgagg cactgtgtca aaaaagagga tttcctttgc 14280 cgctcaataa atctagcgaa attggcgaca aagaactgcc ttctgaaaag tatgacattt 14340 tcaaggaaca agtcgatgtt tatcagggca atgtgcttat cgtaggtcct gtaaattcag 14400 gcaaaagcaa attgactcgc tccattgtaa tgaacggaaa gttcgaagcc aaacactatc 14460 acttgattaa taaacgcgag ctggctaagg agcaaaaaca agaatggcaa cgcgtgtgtc 14520 gaagcgacta ttcgaaaata aaaagcgttt attttcacga gctcgacgga aagaagtctt 14580 tggacgacat tattgctagc cttttgcaaa ttcaacagga agcctacgaa acacaagata 14640 gcaacaatag aaaaacgaaa acagtgttta tttttgacga tatccaagat gactgtttaa 14700 aaagttcaca gtatagcaat ttgattacaa gcggttctca tttgcaaata ggcactataa 14760 gttgttttca tgctcttcct tttaaacaaa ataccaaatg gaacacttta gttggtaatt 14820 ataaattgat agttgcattt aacgagtcgc ctgaaatcaa tcgcatcctc gcagaagaat 14880 ttccgtcgag taatcgtaat cgaaattcag cggtgcatct tttaaaagaa atcacagctg 14940 atcgaacaaa acacgatcat ttagcattgt tccgagaaca tacagcaaag ttacgaagtc 15000 aaatcgacaa tgtcgacgtg caaattgttt acgtaccgat cgctagcaac ggcgaagctg 15060 actatgagta tacgcaaaca aaaattgaaa acaaaccgtg gacaaccttt gtaaaccaca 15120 attacattcg tgtattatgc aaaaaaaatg tttacaaatt agaaaattta attaaaaaag 15180 acccagaatt agacgatgac agaccccctg attatgaaaa tgaacctatt gttttttcag 15240 ataataaaaa aagaaaactt gattttgtaa ataaaaaact taaaaaaaga cgcttttcaa 15300 gttcagcaga gagcagtagt tgtagcagta gcgagagtaa caacgacagt gatgtttaag 15360 accattgaag atcttgatat ttggcttgct agcgaacttc aaagcgacga gtcgcaggtc 15420 aagaaagaaa ttacacgatt aaacgaactt gaaaaaaaat ctttagtcga tcttgaaccg 15480 catagcaaag aggaaagaaa agaagaactt gatcgtgtca ttacttacga gtataaagaa 15540 gaagaacttg atcgtgtcat tacctacgag tataaagaga cagaacttca atgcaaaata 15600 gacaagaaaa aagccatgcg aagaataagt ttggctgcgc gaagtaatct ttttattcct 15660 gatttaaaaa aaagaaaaat cctaaccgaa aacgatatca tcgtgtacat gggtcatttt 15720 gtacagaaac tcaatccgtc gttcatgctg tcctgtgatt tttttacgaa cgacgaacga 15780 aacgatatca tcaaagcgtt ccaagactat gttgatccgt gcatcaagtt gtttttggga 15840 gaaatgttat ttccgtactc gacagaagac gaatcgggca agttaaaaag aaaaacattt 15900 aaaaaacaag aaagaaataa tttaataata gactatgtta gcgttcacaa agatacgctt 15960 ttacgttatt tcaattcgct ggactttcgc aaatttatca acatggtcaa tatgtacttt 16020 tctaaaattt atgcttttaa aagcagcgct ccgtctgaca ttattatatg tcacgcgcgc 16080 acttactttt ataatactta tggaataaat aaaaaattat aaaaaaaaaa attagttttt 16140 tttttttttg ttttgaaaaa aaaaatgaaa cgagtttata atcctcctac tgttgacgaa 16200 ctttataatg gcattataaa aaaactaaaa gataaccgaa aagaattgac taaattggat 16260 tatttaaaag aggctgcaaa aatagaaaaa agcaatgaag ttatgattaa cggcaataaa 16320 cttgatccaa ataaaaagta tactattgag ccggaactat taactttacc acagtttggt 16380 caatatgctg aaaatcaatg gaccgatcgt ggtcagtatg aaatagcagc cgaacaacaa 16440 cttgaattta atcgctcatt gcaagagcaa tacccttata tgactttgct tgctgttgat 16500 gctcctaacg taataacaga cccaaacgtg tggaatgcgt ttcaaagtgt caatcaagat 16560 attcctgttt ttgttgattt gcttaagaaa gataaaattg aagaatttaa tgcgagaaat 16620 agcacgcgac aattggttga atttaatcga acacaaagac aaacaaaaaa ttcgcaaacg 16680 ttacgaagtg aagcactaaa agacttttat aaacctccaa ccaaacctgc gcgacatgtg 16740 cttgttccgt tagaatctca cgaatttaat gctctctacc gtgacgtaaa cgaaattgaa 16800 aaaacgcaat tgctaaacga tagcagactt tcaagcgaaa aacttatcaa agattggata 16860 gggaaggcgt tgaaaggaca cagcggaagt cttcaacaaa atgcctataa attatcgttt 16920 gctgttttga aaacggacag caatgtgcca aacaacgttc gccttttttt aacgtacaaa 16980 agtaaagttt ttgatacaaa tgaaacttta gagatgatca tatccaatat gacaaataaa 17040 caactacctc ctctttctga cgagcgactc gtttatcagg aagccaacaa catttttact 17100 ttaatcaact ctcacgtcag tgaagaaccc aaatatgtat tatacgataa gtacgatttg 17160 gtgtatttgc ctttttcaaa actattgaaa gctgccacaa aaggtcaagt tccatttcgc 17220 gatacaaaga aatgggacag tgtttctttg gacaaaacac aacaaaaaga ttttaaatgg 17280 gtttctttaa ataccagttt tgccacagcg aatcttctcg acgatacttc tcttttcaac 17340 agaatcattg aacaactcat tggcagtctt gaagatgtag tcaacagatg gatcatggat 17400 cttttggcat actatcgcat gtccggacaa cagtttatcg atgacatgat ggacattgta 17460 aaagattctc aaaagtttat cgaacctgaa cgactaaaca aattagatta tttgacaaaa 17520 aaattttata aagctgacga tatggactct gctcttttgt ttttttatat attatcctat 17580 cggttttttc atcagcgatt tatcgcatta agaaaacaag cgtttagtgc aggcatttat 17640 gcggctatca tgaacattat ggaaaaggaa aaagaattta tggcaaacgt tatcgacaac 17700 ttgcctaact tgttcaatac aaacagcaac gtttctgttc aagccaaaat tgaaacgatc 17760 atggtgatca gcacgcttta cagtaacaac gcaagttctt tgcattttct tgttagcgaa 17820 tatgcagcgc tcggtcccga aggtaatcct aaaaaggtga aagatttttg tttaaaatac 17880 gcttctttta taaaatttga tcaaaacacg ccaccgaatg tcatcactaa aaagattcac 17940 actatgcaac tgagaagcga cgcaaaaaaa agtcaaaaaa catctgaaac gtttcgtcag 18000 cgacttgagt tttttgacaa acccgaaagc agttttaaca cggacacaag tcagaatttt 18060 tctacgtcca ctcctctcaa aaataaagat tatgtagatc ctaaagacgt aagaaaccct 18120 ataaaggtcg acgaggaaac atattcgccg gatacacgaa agcaaattga cgatttaaag 18180 gcagaaattg ctaaaaaatc agcagctttc agtttatcaa aaaatggatc agattgatga 18240 tcttgtgagt tcggcgttga cgaccattca aagttcgctt tttggactga acacgattgc 18300 caaaatcaaa aaaaacttgt tagccttgca aaacggcagc aaaagtgccc aagatcaaaa 18360 actgttcgat aacattagaa aatttcgaga agaaaacgaa ccctataaca aattaataaa 18420 aagatggagc gaaatgtcgt tattgtttaa agtgaccaac gtgttgagca agttagaaga 18480 gccacgaaca ccgtacagtt atcgcacgca taaaaaaagt tataatttgc caggacaata 18540 ttttcattgc gatttggccg acatgagtgt ttttaatacg gataaaaaag tatatcgtcg 18600 caactattgc gttgttctcg ttgacggtgt cagcgggttt gttcgtttca aagccagcat 18660 cggtaagtca gcaagagaaa tacttgttgc tctaaaaagc ttgttcgaac acatgaacag 18720 acaagaagaa gtctttttgc agaccgacag aggtggcgaa ttttataaca aacagtttga 18780 agcgtggtgt aatgaaaata aaattacgca tttctcatcg aaaaatctgc aaaaggctta 18840 tcttgctgaa aacgctattc ggcgcctcaa agatatatat caaaaattaa gaatcgctaa 18900 aaaattgaca gacaagacag attggaccag ttttttgccg ctagtagaag aaaagttaaa 18960 caaatcaaaa cgcacagtgt ctggcatcac gcctgaacaa ttgaatttag atacaaaaca 19020 aggtgaagct ttgcgcatgc ttgatagttt tcaagatcaa aagaaagctc gtttgacgtt 19080 taaaagtcaa aatttgtatc gagtagaaaa cgaaacgttg aaaaaaaaac tacccatttt 19140 aaaacctttg ctagtcggta tgaaagtgta caaaagaaat ttcaatacgg atccgcgact 19200 taaatttgca aaagcgagca ccaatgtagt ttcaaagtgg gatttgaaga acatttataa 19260 tattgtaaaa gttattcgca ataacgatgt tgcaaacaat ttttatagtc cgcctgattt 19320 gtataaagta gcgaacgctc tcgacgaaaa agaagtgtac tatttaaaac gcgctgacat 19380 tactccctat ataagagaca gaaaaaaaga ctattactct aacttgaacg aatacgaaac 19440 cgaatttttt gataaattaa atttgtaaaa aaaaaaatga gtttgtattt agaagagtta 19500 tatgaaggaa aatataagac acatgatttt ttttttcaaa aattaattca ggcgactcgt 19560 tatttaatga tacaacagaa cattggaata gcatcaacga atttaaaatt ccgctctcct 19620 tgcctagcga accggcttta acatttaaag aggacactat tatacacatc aaggaaatgg 19680 aactacctgc aaacattata gttttaacag aagatcagta cgaattaaga aaattagtga 19740 acgattacat gcgacgcatg ataagtacag aactcgtgcc tgtatttgaa aaaggcggaa 19800 tggactctat cctcaatacc gaaataaaaa atggttttta tttaagaagc atacagtatt 19860 acatggattt tggaaacgaa gccggcgcct acactgaatc gttgaattta gcaacagatt 19920 tggtcactca aaaacttagc ggtgtttctt cgacacgcac tcttctttat gcgttacaaa 19980 aactgcaacc aggcgtggac gtgttctcaa aatggtttaa agtgccgttg actcatcgca 20040 aaccaaattc aacaaaacaa gtggtgaaac gcagcgactt atattacgat gtaaactctt 20100 tttatgtcaa taatcaattt aaaaataacg cgcttttgtt acctttgcaa cttttaggta 20160 cctatcaaac tgcactcgga aacgatttat ttggtattga atttgttaaa gatgccaacg 20220 gtaatgcaac tcctgaatta caatacactt ggtcaaaagc cgcagcgcct gatttttatt 20280 tactttacgg acgaacaaaa ttcagttatg aagacacaag cggctgggac aaaacaaatc 20340 catggtttcc aatggagatt agctcaaagc gacaactcgc attacctgat ctaatgattt 20400 attcgcactc gagcgatcgc gatcctgata cgcccgttgc gtttgatttt ggtccttggt 20460 caaacattac acgcgacacg caaggtccaa ctccttatga aacattattt aattgggcgc 20520 gaacatttta tcaatttttc ggaccagatc caacagatcc ttcagtacac gtttggcaga 20580 ataaatctca agctgaaatt gcagctttac cgtttttagt atttgttgaa cattataaga 20640 atactacaaa tgtagaagta ttaaatctca ttgatcctac gcgtccttat tacaatatga 20700 ctaaaacaag caacaagaga atcgtttacc gcattacgcc tgttcgttac aacatattgg 20760 actcgttcta ttatgttagc gaagtgaaga tggatttgta tttggctaat atgatgaaga 20820 gtaccggacc gacgcctagc actcctgaac tcaaagaatt gcaagcgttt tattataaac 20880 aatttttacc atggaatgtt gctcaatgca ttaaacatgt agtaaacgat ccgcttgtca 20940 aaaaaacagc cattcaatat aatgataaga ctcaaagcat gtcgccttgg tacgatttag 21000 attatgtccc tcttgtcgat acaatttatc gcacatgtgt agattcgata ccgattgcgt 21060 tyacgtctga caaaaattct ccagacgtac ttaagttttt gctaggaaat atggtaggaa 21120 cgtacatgcg aaaaaataat gttatttcga acaatcttcc agtcgataaa acaggtgttc 21180 ctcaatttcc tcctcatcat ttcatgttct tgcagcaaca actttttttt cctgttgatt 21240 tcactaaaga tcattatgtt acttacaaat tcactttaga aaatgttcaa tttaatgaca 21300 tgcccatgcc ctatttggca agcaacgaat tggtatggaa agaatttcaa tggcaaactc 21360 ctcaattttt ccctaattca ttaggaggca gcgagcctct tctatgcaat aaaccgtacg 21420 gcttgacaga attagaagtt ccgtacgatt acactaacat tagtgtaggc acaggcaatg 21480 aatcattttg gatattgccg tcatttacaa acatttttaa caacgatcac agcgtaagga 21540 cagcgcgaag gttttttgtg cctccgctga caataaaaac aaagacaccg tctactatat 21600 ccgaagatgt gtttatgtta atgaaaaatg tcgacatatt aaatattgat ccgacagctg 21660 actatggcaa atacttagaa gttttatttt atatagacga ggccgaccct atattcagta 21720 acagccttaa agtcagctta aattcgacag cgtttatcga aagaaaaatt gctccgagcg 21780 acaataagta ttttaagttt actaacagtt ccaaatcgta tttagaacag ttgtcgttga 21840 aacaattgac agtgcgaggt gtctcaaata caggcggaat aatattgcaa gtagcagacg 21900 acggttcgat cattgaagac tggaaaacaa gtttgcctat tcagtgcctc aagatgtctc 21960 tctacgagcc aatgtataaa atcgacaaaa gaataactgt gcttttagag cgcgatagat 22020 cactggtcgg agtagaaaaa aatccgtatt ctttcaaaat gtctgcagct aacgttcgtg 22080 acctgtcaac gcttcttaac ggaaaaagaa agtttatctg cacgctagct aatattttaa 22140 acatatctaa ccgtgtattg aattttaaac ctaccgacaa aaacgtttgg attactatgg 22200 atggcgtaga gcaaactccg acgtttgtta ataacacagt tttagacaac gtcattatgt 22260 catttgacat ggctccctat ctcgctattc aagatagcgt agttcaaatt agttttgaaa 22320 acacaattag accgcaattt gaaggattct acaaaacatt tgtcattaac gatgctgacg 22380 attttaaaaa tttaaaaata accttcgtca attcagatct ggcacctctt caattcgatg 22440 aaactrcgac attcaataac cctaatttta cgttttatat tcagcctttc aattgaaaat 22500 cttttwttat aaacacctcg agcttctaac aaataaaaaa aaaaaaaacc atcagaaaaa 22560 aatgtttcac aataaaagaa acagatgttt tgttaaaaaa aattcaaggc acttgttaaa 22620 tccgctcgga cgagttgtcg atcgaagcaa atatgattta gaagatgaaa atgtaaaatt 22680 aaaaaatgga tttattaact taaacgatca acttcaactg ttacaacagc aactcgaaaa 22740 agaaaaacag gaacaacatc accaattttt aaaatttaaa gatgatctag cgaaacaaaa 22800 gataccaaaa caagtcgagt cgccagttga aaaaatcaca gacgttcctg ttgctgtaga 22860 aaaaacacct attttcgaaa gacctacgat tagagatacc tatgaaaaac ctgccattca 22920 agtcgaaaaa actcgttatc tctgtgcaat gtgtcaaaat gcagtaaatg cagtggatat 22980 catacatgga tattgcaaag actgcaatag aagagagtcg ttgtttgtaa aaaaaacacc 23040 accagttgac aagaaagtaa taaagaaaaa gtatatgtcg aagggataca aaaaagcact 23100 aaaagaagta aagaaaaaat ttgaacaaga agaggactaa tttttttaaa aaaatttttt 23160 ttttaaatat tatatattct taaataccta tagagtcgtt aatttgttat aaatttcgcg 23220 gttacaaatt aaaaaaagag aagagcacat caattcgctt ttttacaacg cttgcattca 23280 aaaaaaaatg taataaaaaa aaaagcatta cttaaagttt tttttttttg ttattacaaa 23340 aaaaaaatgg atgaagagag cttgccttcg cttactgcga aattaacaaa agaagaaata 23400 gagggatatc aagatcgttg gcgaggtttt gatgtacccg tacaacaagg agaggtttta 23460 accgattatg acgttgaata ttatccagac actaacactg atatgacaaa aaaaattcaa 23520 tttacaattc aaccaggtga aaacctaatt tatcgcacca tggtgctcaa ttttacactg 23580 aatctttaca aatcaatact aaacaatgtg ctcactccat tcactgctgc tgacgctgcg 23640 tttataagaa aaatcgctat gagagctaat ttttgcctga atctgatcaa aggcataaaa 23700 gtgtatccaa attttgaatg gaataatcaa acaacagcaa ataacagttt aaatacaaac 23760 gaaaaaaatg aacaacgtgg ttacattcaa cgctttaaat atccgacatc ttatctcgat 23820 aaagtatact tagctcaaac tgcaaactat cgagttgata gctctggccc tacrcctaac 23880 cttggtttaa gtgttagcag ggcagcaact ttagcacata tgactgatcc aaatactcct 23940 attattgctg atactacaac tggaccgaca aattatcaag ctctaaataa tgcgcaattt 24000 gtcattccgg catgctcacc aacaggatgc caattcagca ttcccctcta tcatttagat 24060 gatatttttg aattagaaga atattttgga aaaaatttaa aattggtaat tattttagaa 24120 atcgaaacaa atcttgctca attattcgaa aatatgcctc aaaccaccgc agctgatatg 24180 acagcaatga atgcaatggc ttttcgactt aataacccga ccatcacggc aaacgaagtt 24240 aaagtgaccg acacgtatgc atttgaaaga aataaattta ttgacgcgtt tcctaacaca 24300 caatatgtag caatgcataa aaactattgg gaagtgcgat caatcgacgt acctcctcaa 24360 acaaaaacta tggtcatacc wtttaatact tttaaacaag atagcttgac tccgcatgca 24420 gtatgcgtgt ctctgttttg taaagatcaa gccgaacaca gcaactttag atgcagagca 24480 cctagaaata tggctcaaga attaatagca tcggtcacat ttaacaacgt gcgtatcaac 24540 tttctacaag gaggcggtag tacttttact tacgacttat cgttacctaa aacaagaaag 24600 caactttatc gagaatattt atcatttatt aatgacggca caaattctga aaacgatatg 24660 cgaaatcaaa tattacgtta ttctacagat tatgtaggcg aatttgcggc aactgaaaat 24720 gactattggc tcaattctta taacgaggta ctagcttttg atcttactgc agataaaaac 24780 agagttacag atatggactc acctaacgtg ttaggttcat ctaatgcgtc gatgactctt 24840 aaatttacta catctataaa taatgcagtt gtacgcattg atatgcttca caaaagtcaa 24900 tttgtgatga ccggaaaagg atttaatgaa aatctagcct tgatgcctaa atcactaagc 24960 aacacttaaa tcattcgttt ttaaaaaaca ggacagtcgc ttttaattta tttagctgtc 25020 gctgaaatac tgaaaaaata attgcaaagt atttttatag cgaagctcat tatgtgtttt 25080 tttcttattt caaataatat ttttttacgt gcatgatctg acatgaaacg acgcgataga 25140 cgaggcagat ttgttaaaac aaaaaaatca aaaaagcatc gaggaaaagg acctaaattt 25200 gaagccgtca aacgttttgc aaaaagaatc ataaacgcag gtctcaaatc gagcattggc 25260 aaagaaggtg ttcagcgatt agtcrattat caaaattcgt tataaaaaag aaaaaaaaaa 25320 tggctcgcaa acgaggagga aaccgagttt tattaccctc tatggtcatt cgagtaccaa 25380 aacaagtagg aaacggacga aaaataaaac gaaaaaaatg gaaaaaacaa cgcggtggtt 25440 tttttccgct cctgttggca ccgattattg ctgcttttgg agctaaagct gctgctgcag 25500 cagctattgc aggtcctaca gctctagctg ttggaaaagc cgctgcactt ggagcagtag 25560 gaggagcaac cggattagca gtacaaagtg caataaaaaa agccatgtaa aaaaaaagat 25620 attgtatttt tttatataat aataaaaaaa aaaactttaa tacgttttta ttttttttct 25680 aaaaaaaaat aaatgctaac aatatttttt ttttatattt atataaatta catttttttt 25740 aaaaaaaact ttgattaagc gtttgtcatc attgacgcag cgagtccaca aactgacaag 25800 gaagaactca ttagagcata gtctcgaatg atttgaataa cttcaggcac aacatatgtg 25860 ttgattggtt caggaagagg agcaataaga gttgcatctg cgcctgcgcc caaagcgttt 25920 cgataagcgt taatagccaa attcagatct acaaaattac caacaagtcc ttctccgata 25980 agagcgttat taatagctaa aagtcttgca gttacattat tttgccaagt tgtatgcaat 26040 attccaaaaa gacgttcttg ttcatctttt ttcacaaatt cagcttcgat gacttttaaa 26100 aatcgcgttt gcaaattgtt aatttgttga gtcatttttt tattaaaaat cttttttttt 26160 ttataaaaaa aaaatatctt ttttattaac agcttttttt tttttaataa aaaataactt 26220 tttataaaaa atgtcttact ttatttctat tagaaaaaaa agttgatata ttaaacatgc 26280 gataatttaa ctttttgcaa gtatcatttt gctctatcgg gcctagtcca tgttttttta 26340 ctttttttca tttgtttttt ttatgcattt ttttttaaaa aaaatgcatt ctgctattcg 26400 aaaaaacagc tcttttttat aaattaaaca ttagtattta catgtttaat ccattttttt 26460 ttgaatttaa ttttttgtct tgtctctttg cattcatgct tgtgacatta agtgaaagca 26520 acgttaaggt ttttttcaaa aaattattaa gaaatttatt tgaaaaaaaa agaagttatt 26580 gaaaaaatgg aagaaaacaa aatgacaaaa acaacttgta acttctgctt gattttagaa 26640 ctatggaaaa cacacaatat cgaaaatttc gaattaaaaa taaaaaatga tgcaaagatg 26700 cacgttttac aacgactaat taaagccgaa ccaaaagaac tcaaaaagaa aacgttttgg 26760 tacgaagtga gtcatgaaat caatagtctt gatatttcaa aagaaagcat gtgcgacaga 26820 catcaattaa aatttaatca gacaacaata taccaaaatg aaaatacact cgacatgaaa 26880 aaagaattga aatttttaaa catcacctac gggtcagggt tgacatctat gcttagtcta 26940 tcaaatggat tttcattaga agagttacaa aatgaagaca atgttgacgg tatcgctgac 27000 attattaccc tctctatgcc tgttataaaa aaaaattttc aaagctcttg aagtgatgat 27060 ttcaaacggg tttacaaagc gactcttgtc tggcgaaata aaaatagaaa acactgtttt 27120 gttttataaa gaaattgtca atgccgtata cagtctacta attaacaatc tcgcagaaaa 27180 aaaagatgac cactgatctt agtttggaaa aatattgcaa aaagaacttt aaaaatttta 27240 aaggtgtttt tccctttgac caagtcaaca acgtatttaa acagaatttt caagtcggtg 27300 atttttttat ctacaacagt gactcgtcta aaaaaaatgg aacgcattgg cgcatgatgc 27360 tcaaattaaa caataatagc ttgttttgct ttgacagttt aggtaaggaa agtttttttc 27420 aacaaattcc gattgcagtt caaaacaatc ctgacctctt ggaatatgcc gaaacaaact 27480 cgctatcgac tgcgttaagt caaatgactc ttaactttaa cgaccatctt tatactagac 27540 gtaaaaaaca ttttccgctt cgcgacacgc aagacgaatg gtttacaaag ttttgttttt 27600 attattcgca aaaacacgac attcaaaacg tgaacattgt ttttaatgat ttggcatatc 27660 aacttggcga ttcgtcgctt tgcggaccgt ggatcctcta ttttctccat ttactaaggc 27720 agcaatggtt taattctagc gatatcgaaa ttgataccat agacagtatt cttaatcaat 27780 attttacttt tgcaggagat aacagaacaa agaacgaatg cgtttcaaat ttgtcagatt 27840 ttttttattt tgtcaaaacc tcgttttggt tatatgcaga caaaacagcg caaaagttat 27900 acgaaaaaga ttttcaaatt tttgatcgtt taaatgtcaa agaaagaaca gccggcgcat 27960 tgtatcgtcg aagaccgact ccttattcag gcataaacaa agacgtttac aaaaatgata 28020 ttaaaaaaaa caatacgttg actgcttttt tatcgcaaaa caacttgctg ttagtaaagg 28080 atccagctac accgttttta acaaacgacg acttgaaaaa tttgcctgcc attttaaaat 28140 caaaccttat gcaaacttat ttggacgatg atcaacaaag acacacctat aacaactatc 28200 aaaaattaaa aaacattgct caattttatt caattccagt tgaacaacgc aacgaactta 28260 gcgaattgaa aaaaatgcta caaaaatatg aactgtaaaa aaaagattat ttaataaaca 28320 aaaaaaaatt tatatataca tagttttttt tttattattt acatattttt aaaaaatgtt 28380 attatttaca tactttaaaa gttttattaa aagtttcgtc ttccgactcc caatcagaaa 28440 aatcattgtt aaagtcattt acttcatcta ctatgtcatt attttcataa aatacactat 28500 ttgratttat ataacagttt tcataagatg atgttttttt ttcattaaca tcttcttttt 28560 tcatgcgcgt ttgccaacgg gctttaacta ctgtgcagat ttctttaaaa tcatcttttt 28620 tcatgcgcgt tcgccacgtt tgccaacgga ctttagatgt agatgctgta ctattttcat 28680 cgacatcttc atttttcatg cgtgttaccc aacggacgtt agctgctgta caatttaatt 28740 cattcatttt tctaaaaaaa ctaaaaaaaa taaaaaaatt atacctgaaa aaaaaaaaaa 28800 aaaagcgaat gaaattaaaa aataaattat ttttcatagt tttaattttt tattaattca 28860 tgaatacaaa atatattgta ttcttcttaa aaaaatwaaa ttaattataa caaattatat 28920 ctycttttta aatctgtatc attaaagtaa ccagtgtcaa ttagtttctt aataaaaaaa 28980 taacattcgg aattagggca atgcattttt aagttatttc cgtgatgtcc gcaaccaaac 29040 tttttattaa tagataagta ttcgatgtcc ttattcactg acggaatcgt atgcaaactg 29100 ttaatgtcaa acgaaacgca accaatatac tctattaagt ttttttccat agcgccttta 29160 tgataaataa cactcttttc cttgcaaagt actttcagca actctttaac gtcatcacaa 29220 cggtaaatac taaatacaaa aatgtcaagg ttattgatat tacgtttgac atattcagct 29280 ctttgttgat aaaattcttt catttgttct tttgttttac atacttgcaa atgataatta 29340 atacaagtca aatgtcgatt cacaatgctt atttcacgaa tgcaaaagtt tccccattca 29400 tgctttactc cgcaatcaat agcttcaata tccattacaa aatcaaacat tttttcttat 29460 gaacctgttt tttttaaaag aaaaaaataa atttaaaaaa attttttaag aacatgtcat 29520 tcttaaaaat atatcattct taaaaattat ataagttttt taatgttttt taatatctta 29580 ttttttaaag aaaaagaaaa aaattgcaat aataactttt tttatttaca aaaaaaattt 29640 aacaaaaaaa tcattaaaca ataaggcgtc caaaaaagta atgatttttg actttttccc 29700 agaaatcatc tttgtcatct tcctcataca acgaacctat cgcattgtta gcttcaagaa 29760 cataattaaa tttagtacga tgactatcgc tgataaaata ttcgctttgc tttttaaaat 29820 acgtcaaaaa tgcttgcatc tgatgtcgat ccacttgaat atcattaaac gtctgatttt 29880 cataacgctc aatttgatca gacgtccaaa aatcatattc atcaaaatga tttttcatgc 29940 tcggtgattt tttccaacat tcaagaatca aggccaaata gtgcacttta agtaatttaa 30000 taacaagagg aaccttgttg aatctagtag caaaaatatc gattgcatct tcagaaataa 30060 aaaaatcaat caactgatcc atgtgacaat cccattttgt gtacggttta attaatttcg 30120 aaataaaatt ataaacagaa ttatacaaaa aatattcatc atcattcata ttgtaacgta 30180 atttaagaca aaactcgatg cgttctataa attcatcttc tgtcattgtt tccatatcca 30240 ttttgtaatt atcatattct ctacacacaa cctgataaac ctctctccat ttgttgtcga 30300 tcattacatt gtgatctgaa aaaaaatatt atataatata ataaattata aataaacatt 30360 acgttgtgat ctgaaaaaaa aaaggacaaa ctgtttttta cctctccacg aatcgaataa 30420 tttcagattt tttcgataaa ctaccaaact ttgtccagaa tttctttttc ttttaacgcc 30480 ttgataaaca ttttttaaaa aatgatcttc taaaatcact atgacttctt gcaattcatt 30540 caatgtcaaa agtaactcga acatattttg aggttcaaat ttgttttttg aaaatttcaa 30600 aaacaaacgc tcaggatact tgattaaatg agcaacgatc agctttaaaa aacccattat 30660 ttttttacta tcgctaaaaa atgtaagtat cgcaatataa taaaaaaaaa cattttttat 30720 tcatattttt aatgcaaaaa attataacaa atagtttttt taatgtaaat aaaaatgcaa 30780 aaaattataa caaatagttt tttaatgtaa taaaaatgtt ttttatctat ttttttaaat 30840 tttttaaaaa aatgtaaaaa atgttttttt ttttmaaaaa actatataaa aaaaataaaa 30900 aaaagataat ctttaaaaaa aaataaaaaa agataatctt aacttaatca aataaaaaat 30960 aaaaaaacaa acaaacatac tatttaaatg agttataaga aaaaaccatt atttttcata 31020 acatattttt tttaaagaaa taaaataaat acaatttttt aaaaataaaa aaagatctcg 31080 aatcaataaa gtaagcaatt caaaaagaaa aaaatatagt taatatgtat aatatgtata 31140 atatgtacaa ttaatatgta tagtctgtta ttgactgcgg aaaaaaataa tcatcacttt 31200 cagttccgac cattaacgct attttttgtt cattaacact acttttcgct tttaaaaagc 31260 tcacaatgtc gttacgacct atttgttcaa atattgaaat caccattttc aaatcaccat 31320 tttcaataac ataatttata aattcaaaac catcttgaat agtttctttt tttccaagtg 31380 gtactcgtaa catatataac aatttcttga cgttgtcttt agtcaaatgt tcaccgacta 31440 ctaaccaatc atctcgagtt actctattca ttttttatta aataatatag agttgcgttt 31500 agtcgtttat cgttgtattt gttgtatttc gaaatcgtta ttgaccatat ggatttcgat 31560 caatgaaggc ttaattaact aatgtattta tttatttttt tccatttttt atattataag 31620 caatttactt tttttcaaat ctatttaatc tcgatccatg aacacgatta caatcgctat 31680 ttaatatata gtcacttaat aaaaaaagta cttcattttg atatttgcgt aaacttcatt 31740 tttgatattt agtaaacttc attttgatat ttgcacgata aagtaaaact tggaaaaatg 31800 gcaaatttat ttaattattc tcttgttttg ttttcgataa acacacgatc ataaacaatt 31860 agatcatcga ggttttcatc cgaataacgt ttcgatgtcc ataaaatttt tctaggttta 31920 agattttttt tgatttaaga ctgtcatttc gagttaatct attcattttt tttatttaat 31980 aatagaaatt ctgtaaaaaa ataattcgtc tattttcacc aataatcatc caacttatta 32040 ggagagtaag taggcattct tctgaccgaa ttaagagaac atccactaat atggatttcg 32100 atcaatgatg gcttaatcaa ctattgtatt gatttatttt ttcgattttt tataagcaat 32160 ttacgataat gatgcctaag atttcctgac tctattttta aataatctaa acttgttttt 32220 atttgaacac cttttaataa aaaaagtact tgtcataaaa ttattcaaaa tgaacttcac 32280 atacctcatg ttgctgaatg tattcgtatt gacaaaaatc tttacgttaa actttttttt 32340 aaaaatccat acgaaaaaat tgatctctca cgattctgga aatgttgtat gactcaaaaa 32400 gaaacggttt ttttagcaga acagaaagaa cagaaagaac aagtgttcga acaatcgatt 32460 ataaaaaatt aacaaggata ttatagcttt tttgtttcta atacacaata tagagtttgt 32520 aaatcatatt atctttcaac attgaatatt agcagccata gagtgtttta ctttcataag 32580 tttaataaat ctttaactac tggaacacca ttgcaatcaa aatcagggaa acatgttaaa 32640 aaaaggttac tagaagagtc taaacaagtt gtacgagact ttgtgtatta attaaattta 32700 aaaatgttgt tggcaataat ctaaagccca cacctgatct aaaagaaggt gaatgacgtc 32760 aagtgttcga acagtcgatt cttggaattt aacaagtaga tgaagaagtt taaaaaaatt 32820 ataatttata actattttct atatatttaa tacaaaatat tatattatat tttctatata 32880 tttctattat aaaagaaacc ttggttaata gaacattcra aattatctat ttaaagaaaa 32940 gaaatttgtg tagattttgt ttaattatat gtaaattaaa ttttatatca cagctattaa 33000 aaaaaaaact taaagttaaa aaacttaaag ttaatataaa gagaaaatat tattatttat 33060 gtttaatgca acattcaata aaagcattaa aaattagagt tatattatat aaataagagt 33120 tatattatat atcactcatt gttagtttaa taaatcattt ccwaattgaa taagtccatt 33180 atttaacgaa taagattaaa aaaattatag ttatagtcat tgttagtttt tagtcatatt 33240 traacaaata ggtcgtaacg atattgttag ctttttaaaa gcgaaaaaat agtgttaatg 33300 agcaaaaaat agcgttaatg gtcataaaaa aattatagtt atagtcattg ttagtttttt 33360 aaaaaaaatc atttccaact tgaataaata agtccatcat ttaacgaata agattaagaa 33420 aaatataaat tttttaaaaa acattttttt aattaaaaaa aaaaaagaaa gamattttta 33480 taagaaattg aggaagtact aggagtgaaa actgtttgta gagatgcaaa gaaaatgtta 33540 ttctcaacga gcttgcataa aaaaatgttc agaaataaaa attttcttgt aataaacatc 33600 attattgtat ttttttcttc actgaattct ttttttaaat tccacacttt tttttttaaa 33660 atttccatta atcaaacata atgcactgga tatttaaata ttgattttta caagacgaca 33720 agactttaag ttttttttat atctgtccac ttctgagcga taaaacatga gcactgacac 33780 aaaaccaact aaatatttga acattttgat cagaaacaat tagttcattg acgctttcat 33840 ccgaattttc tttaaaaatc gcaataatgt ggttaagatc tgctgattat ttctatgcca 33900 aaagcttgtg tagacgtcat ttattacaga agtccattcc aaaaagttgg caaaaaaaaa 33960 aggagatgat ctaagtgact atataagatt tgaaagatga ttcttatttt attattactg 34020 ttcgtctaat agcgcaatat gatgacagtt ctttttataa gaaaaaaata caaactatcg 34080 agttttctag taaaactaca aagagataaa taaaaatcag aaataaaaat tttcttgtaa 34140 taaacatcat tattgtattt ttttcttcac tgagctatag ctaaatctgt tttaattaca 34200 aactcacggt cgttattgtt ctttaacaaa acacagattt ctccgcgata atgtgatgca 34260 aattaaatga gaacttaatt taagaaagtt aatagcaata aaaaaacgat aatagatagc 34320 gatagcgata gtttcttttg aaaattcaaa attcctttat caataaaggc aaatcaatta 34380 attaaaatct ttattgtaat tctttattta aatcttacac atgtttgtcg tttatcttgt 34440 aaaagtatgc ttcataattt tcctacctac attaaatcac aaacaaatca caaatcgtat 34500 aatagcgcaa tatgatgaca tcagtagtga aataaaaaca tttaataatt ttaaaattaa 34560 caatttgcca cctggtattt atcagttgtt tggatgaaga tttgaaaaat gattcttatt 34620 ttattattac tgttcgtcta atagcgcaat ataatgacat cagtaaaaaa tggttagatt 34680 gcaatacaag tgaccatawt agttagcgat agtttctttt gaaatatacc acaatatacc 34740 atagtgacgg ttaaacaatt gaaaaaaact aaagttgata acaaacattt atctgaaaat 34800 gatataaact actataaatt tttatttata agatttattt tggctcttat aaattgtttt 34860 gatcagtata taccataata taccatagag cttagtaaaa gtattggagt agattggaga 34920 actctagatc tttttttatt aaagacattt atgaatggat tcgttgttca aacattaaaa 34980 aagttattga aaaaagaaca tttaaaaaat gtgcaaattt tgtttataga ctaaaaaagt 35040 ttttttcact tacaattctt tttatcagtc tttaaaatat ccaaggcatt tttgcataag 35100 tcgatgtttg cgttttaaat ttaaaattct aactctgttt aaagatttct cacaggctga 35160 agttcggtcg ctaatataag catcttgtag tatttgcatc gtgtgaaaaa catcattttt 35220 ccatttaaca aatcgattat gtaaacattt tagttgttca tttgcgctag aatatcgcat 35280 ctgcaataaa taatgtttta acagagcata cattttcgct tcatcattgt aatgtgtttt 35340 ccttatgtta tactttgtgt ttacaggtaa cattaagaaa tctgcaaatt ctttccaaca 35400 atacgacaaa tgtttcacta gtttataaac tacacctttt aattcaatat tgttattacc 35460 aagcgttgcc attactctgg ttttttaata aaaaagctcg tctttttata cccttaattt 35520 atttacacta caaacaaagc atgcaaataa aaaactgact tgtaaaaaca tcatatgatg 35580 ttttttttgc ttcaccttca aaaacgtatt aaaggttacg caaatattct gacttgaaaa 35640 aacatcatat gatatttttt tttgcttcat cctcaaaacg tattaaaggt tatgcaaata 35700 aaaaaacaca ctcaaataac atcttatgat aaagcgctat taatctagca aataaaaact 35760 atttacgaaa aaaatataaa aatattttta agtgtaatat tacatcaaat agcgctttct 35820 tcaaagaaac tttattatca tatgattttt ttaaaacctt cccacgatgt tataaaactg 35880 ttttatttat ttaataacta gaaaatccag ttttttaatc aaagcttata aaaaaaaagt 35940 ttattataat ttttttttaa aaagaaaaga ataaaatatg gacgcagaaa cgaaaaaaga 36000 aaatttatca aatttagaga cagataagga aaaattatta aaaatacaaa ttaaattagc 36060 aaaaacaatt caagaagaag ctaccaacct tttgattaat tttcgttgcg atcttgaatc 36120 ttactatgtc attacgtaag tgcttttttt tatttaccat aaaacaaatt atcatatttt 36180 ttaaaatggt ttctacagat tctacacgat agcctaggcc agcaattttt tttcaatctt 36240 ttacaacgtc agtcatattt ttttttaaaa atatgactga cgttgtaaaa gaacgtttaa 36300 aaacttttaa aaaaacttta aaattcaaga agaagctacc aaccttttga ttaattttcg 36360 ttgcgatctg aatcttacta tgtcattacg taagtgcttt ttttatttac cataaaaaca 36420 aattatttac ttgtatagtt tttttttatt ttttttttat ttttttagta ctgaacgaaa 36480 tccagataat ataattagtc gactttctga aaacacaagc gcactattac aaattaataa 36540 acttttgacc aaacttaaaa ataaccaaat tcaaatcaga gtctttaaaa aaagatatca 36600 acaacaacta agaaaagcgt catgggcaca tgaaagaaaa aaaagagtca attacgaaaa 36660 aaacatggaa atattcgatg aaattaaaaa caaaattaaa aaaaaaaacc attttgcgat 36720 aataacgatc cgatgcatga accattttgt attaaacaag aaagaatatc agaagactcg 36780 agcaaacaag aagaattgtc agatgacgat ttaaacttta aataatttgt atgctttttt 36840 tattatttat aaaaaaaata atttttttgt tacaaatttt tataataagc ttgtctttct 36900 tcgtatgaca tttgtgaaag atcagtaggc aacgttttta ttagcgtttc ttgcgttttt 36960 acaggcgttt ctttcttttt tcctttaatt ttattataaa gcgaattagc taaagtgcct 37020 aaaacgtcag cattttgaat tgcaaatttt ccaatcattt ttgcattgtc cactgttatt 37080 aacgaagcca atccttttcc tcgttttacc ttttttcgag tttttcgctt ggtttttttg 37140 taatgcgtat ttcttttttg tctttttttc atctaaatgt ataaaataaa aataatttaa 37200 tttttttttt agataaataa aaaaaatgga tttcaataaa atcgatttya caaaygattt 37260 agtttgcatg acaatgggga tgttgcagtc actattagat tacgatttga aattacaaaa 37320 attagaccct gatgattcgc aaatcgatat cttcaaatat attgtgaaca gactagaaaa 37380 mactttttta aaatattttg aactcctacc aaccgatcaa aaaaaagcaa taatagaaac 37440 taaaatatta gaacattatt tcgatgtttt tgttttgtaa atatttattt tgtaaatatt 37500 attattttgt aaaattatta attataaaaa attctttttt attaaaaaaa aaaaacaata 37560 ccataataac aacttttttt attttttttt tacctttttg ttaataccca tttatgtact 37620 atatctccta ccacgccgtt tttaatatta ttttttgtat ttttatattt ttgttttgtt 37680 gcgcgttttt tgtaccaaac cgttttgcaa tccagctttt ttactgttcg atttgcaatt 37740 tctttactat ccattttgtt tactacaact tgaggctgtc gacctaattg acgccattga 37800 tttgcaattt ttccaccgtg ccgatggtat attctgttta attcttttaa ctcacttttt 37860 gatatttttt ttgtacgtga tatttttttt gtacgcgcca taatgcaatt taataatcaa 37920 ttttttttta ttgttaactt atttaaaatt attttcatta actaaacata taatatacct 37980 taaacttttc tttgtcaata cccatttata tacactcttt ccaactcgtc cactctttct 38040 attttgtttg ttatctttat atttttgttt tgtttttctt ttgcacatta accctgatcg 38100 agtattaggt gaatcgctac ctctttccac aacaaagtcc acgtccgagt tattttgttg 38160 agctaggtcg gcgctaggtt gttttaaacg acgaaacaaa tcggctacgt atcttttgtt 38220 ttcctcttgt tcttttatat attgtacaga atgtcgcaca gattttttgt ctcgaaccat 38280 ttttttaaaa aaaatttaaa aacacacgtg aaataaataa aaaattttat ttttttttta 38340 cttttattta tttaacatat tttaacggtt tatatatata ttttttaaca cattccgctt 38400 acctccttaa cacctcttga cacctgataa cgcctatttt cacacacact gattacacct 38460 gatttcacac acactgatta caccaataac attgattttt taagctacct gcttatcact 38520 tgatttatgt ttttatattt cctcccctcc ccctaccccc tgcctcctac cctgacacct 38580 acctagcccc ctaaccccta acccccctaa ccctctaacc caagtttacc ctttcttatt 38640 ttttattttt tttggaaaat ttttacccct cttttttatt tttttacttt ttttatgact 38700 accctccccc cccccctttt ttataagaag aaaaagtttc ttatattttt aatttttttt 38760 aaaaaaaaac actggtatac tcgtatcatc a 38791 // ID BEL-611_AA-I repbase; DNA; INV; 5931 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-611_AA_; KW BEL-611_AA-LTR; Pao_Bel_Ele16; BEL-611_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-5931 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [4582-5154] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 277..5532 FT /product="BEL-611_AA-I_1p" FT /translation="MSTERRIKALKLRLRSLMTSLNLIKVFVDDFDEDTQA FT DEVPVRLENLIKLWSDYNTVQTELEALDEAALEAHLKERSQLESTYYRVKG FT FLLAHNKAANNQNLMSPTHSEPQSLSSASQVRLPDVKLPVFDGKVENWLNF FT HDLYTSLVHSAVGLSNIQKFYYLRSSLSNAALQLIQTIPISANNYPVAWKL FT LVDHYQNPARLKQTYVDSLFDFSPLKRESATELHSLAEKFEANVKVLQQLG FT ERTEFWDVLLIRLLSTRLDSTTRRDWEEYASAKGAVIFKDLLAFIQRRVTV FT LQSIQAKVTETLPLNQQKRSIQRAVSSHGASQVSHRKCIVCSDHHPLYMCA FT TFSKLSTDEKEKEVRRHQLCRNCLRKGHLSKECSSSSNCRKCRGRHHSQLC FT PNEPTSSSTSKPSSPQQPNSTSSSNSDPPTTSASAAVDETISCATIGQNKK FT TVLLATALIILVDDNGVEHIGRALLDSGSECCFVTERFSQRIKVQRRKINL FT PISGIGQASTQAKQKFTSTIRSRVGDYAATVEFLVLPRVTIDLPATSVDTS FT TWNMPPGIQLADPSFDSSSPVDIIIGAEIFFELFRVPGRIPLGDHLPALVN FT SVFGWIVSGKSSSGSPSSPVVANLATLSEVHQLMEKFWKIEEIDSSTTYSV FT EEHSCEEHFRQNVSRASDGRYVVRLPFKENFLEHLSDNRRIAVRRFHLLQG FT RLLRNPVLHQQYKAFIDEYLELGHMHRIHEYDDAKVNHYYLPHHAVLREES FT TTTKLRVVFDASCKTPTGYSLNDMLMVGPTVQEDIHAITMRSRKHQVMIVA FT DIKMMYRQILVDSRDTPMQLIVWKENPDQPLETFELKTVTYGTASAPFLAT FT RVLIQLADDEGTEFPLAASVLKKDFYVDDLFSGGRNAPEVVNLRNQLDALL FT ARGGFQLRKWASNDESVLEGIPPENRALQASFAFDPDQVIKTLGLHWEPAN FT DCLRYRIELPFETTIRLTKRHALSLIARLYDPLGLVGPVVTTAKVFMQDLW FT TLKDNNGTQWGWDQELPAEYCARWIIYQSLLPRLNELRIERCVLLPDSDSV FT QIHIFSDASQIAYGACAYIRSTNALGLIKVSLLTSKSRIAPLKRQSIPRLE FT LCGALLAAQLYKKVKTSLQIDAQVFFWVDSTVVLHWLNESPSIWSTFVANR FT TSQIQLATQDCQWNHVSGVENPADCISRGMSADTILESNLWWHGPEWLRRQ FT QSQWPIPGAQLTNSPEALKEKRRTSVAATTSRSEPTFIDLLTEKHSDYQRL FT LRVVAYCQRFARNCQKLSSERVKTTFFIVDELREAETVLLRLVQQQCFAEE FT WKQLHKSQPIAMTSRLKWFHPFMSSEQLIRIGGRLGNTEGTYDCKHQIILP FT SSHMISILLARSYHIKHLHAAPQLLINLLRQRYWITGARKVAKLVVHRCVT FT CSRARPKMVQQLMAELPSSRATASRPFTITGVDYWGPIQLAPIHRRASPRK FT AYVAVFVCFSTKAVHIELVGDLSTPKFLQALRRFVSRRGLCSDLHSDNGRN FT FVGAAKELQELVRSKNHQESIGKECNTLGIRWHFNPPKASHFGGLWESAIR FT SAQKHFVRVLGPHTLPFDDMETLLCQIESCLNSRPIVPLSDDPSDYEPLTP FT GHFLIGTSLKAVPDNDLSSIPLDRLQKWHRVQKMFQDLWKRWHLEYLSTLQ FT PRAKWCNPPIQILPDQLVVLRDENTEPMRWPMARILQVHPGSDGVVRVVTV FT QTPTGRYVRPVAKICLLPIASPTSSSDQAATN" XX SQ Sequence 5931 BP; 1591 A; 1528 C; 1302 G; 1509 T; 1 other; tttggtcctt cgagccggat cagcgtcgat ctccgctgtt ccaacatccc tacgcctggt 60 gagtgagtcg ccattgcatg actggaaagt accagaagtt cttcggattc gccatctttc 120 tgcctacact ctcttcaagt tttggatttg cccgccgttc aactttaact cattattcaa 180 ggcattgaaa ttgcctctag caggtgatta aatttccaat tatttcactt tctcgtgtgt 240 gcgttctacg ctagcctttt gtccccagat cccgccatgt caacggaacg ccggatcaaa 300 gcactcaagc tgcgactcag aagcttgatg acgtcgctga atttaatcaa agtgttcgtc 360 gatgacttcg acgaagacac ccaagcggac gaagtcccgg tccgattgga gaatctgatc 420 aagctatggt cggattacaa cacagtgcag acggagctag aggcacttga tgaagctgca 480 cttgaggcac atctgaagga acggtcgcaa cttgaatcaa catactaccg agtcaaaggc 540 ttcttgttgg ctcataataa agcagcaaac aatcagaatt tgatgtctcc aactcattca 600 gagccgcaat ccctttcgtc cgcatcgcag gtgcgattgc cggacgttaa gctcccggtc 660 ttcgatggaa aagttgaaaa ttggcttaac tttcatgatc tctacacttc gcttgttcac 720 tcggctgttg gtctatccaa tattcagaaa ttttattatc tacgatcctc gctttcgaac 780 gccgctctgc agctgattca gacgattccg attagtgcca acaattatcc tgtggcgtgg 840 aagctactag tagaccacta tcaaaaccca gcgcgattaa aacagacgta tgttgattct 900 ctattcgact tttcwccctt gaagcgtgaa tcagcaacgg agcttcacag cctagcggag 960 aaattcgaag ccaatgtcaa agtgcttcag caactcggag agcgaacgga attttgggat 1020 gtcctactta tccgtttgct tagtacacgc cttgactcaa ccactcgaag agactgggag 1080 gagtatgctt cagccaaggg agccgtaatc ttcaaggatc tacttgcctt cattcaacgt 1140 cgcgtgacgg tactacaatc tatccaagcg aaggtgactg aaaccctgcc gttgaatcaa 1200 caaaaacggt caatacagcg tgcagtgtcg agccacggag ccagccaggt tagccatcga 1260 aagtgtattg tttgttccga ccaccatccc ctctacatgt gtgcgacgtt ctcgaagctt 1320 tcaacggacg agaaggaaaa ggaggttcgt cggcaccaac tttgccgaaa ctgtttaagg 1380 aagggtcatc tttctaaaga gtgttcgtca tcctcaaact gccgcaagtg ccgtggtcgt 1440 catcacagtc agctttgtcc gaacgagcca acttcttctt caacatcaaa gccgtcaagc 1500 cctcagcagc ctaattcaac aagttcttcg aattctgatc cgccaaccac gtccgcctct 1560 gccgcggtcg atgaaacaat cagttgtgct acgattggtc agaataagaa aactgttctc 1620 ctcgctaccg ccctcatcat tcttgtagac gataacggcg tcgaacacat tggtcgtgcc 1680 ctactagatt ccggtagcga atgttgcttc gtgactgaga ggttttccca gcgcatcaag 1740 gtgcaaagaa ggaagataaa cttaccgata agcggaattg gtcaagcatc aacacaagcg 1800 aaacaaaagt tcacctccac cattcgttct cgagtcggtg actacgccgc taccgtcgaa 1860 tttctagttc tcccaagagt gacaattgac ttaccagcga cttcagttga cacttcgacc 1920 tggaatatgc ctcctggtat tcagctggct gatccatcgt tcgatagtag cagccctgtc 1980 gacattatca taggcgcaga aatatttttc gagttgttcc gagtccctgg tcgcattccc 2040 cttggtgacc atctccccgc gcttgtcaat tctgtatttg gctggatagt gtcaggaaaa 2100 tcctcatcag gctcaccgtc ttctcccgtc gtcgctaacc tcgcaacact gtcggaagtc 2160 catcaactta tggagaaatt ttggaagatt gaggaaattg attcgtcaac aacctactcc 2220 gtcgaagagc attcatgcga ggaacatttt cgtcaaaatg tgtcacgagc ttctgacgga 2280 cgctacgtag ttcgtttgcc attcaaagaa aactttctag agcatctaag cgacaaccgt 2340 cgcattgcgg ttcgccgatt tcaccttctg caaggtcgtc tgctgcgcaa tcctgttctt 2400 catcaacagt acaaggcttt cattgatgag tatctagagc ttggacacat gcatcggatc 2460 catgagtatg acgacgctaa agtcaaccac tattatcttc cacatcacgc tgttttacgt 2520 gaagaaagca caaccacgaa gctgcgagtt gtctttgatg catcttgtaa gacaccgaca 2580 ggatattcac tcaatgatat gcttatggtg ggacctaccg tccaggaaga cattcatgca 2640 atcacgatgc gatcccgcaa gcatcaagtg atgattgttg cggatatcaa gatgatgtat 2700 cgccagattc tcgtggactc tcgagacact ccaatgcagc tcatcgtttg gaaagaaaat 2760 cccgatcaac cattggagac atttgaactg aagaccgtga cttacggaac agcgagtgca 2820 ccctttcttg ccacccgcgt attaatccaa ttagctgatg atgaaggcac agagtttccg 2880 ttggctgctt cagttctaaa aaaggatttc tatgtggacg atctcttttc cggcggacga 2940 aacgcaccag aggttgtcaa tcttcgaaat cagttagatg cactcctagc gaggggtgga 3000 tttcaactaa gaaaatgggc atcgaacgat gaatccgttt tagagggaat tccacctgaa 3060 aacagagcgt tgcaagcatc cttcgctttc gaccctgacc aagtcatcaa aactctcgga 3120 ctacactggg aacccgccaa tgactgtcta cgatacagaa ttgaattgcc atttgaaacc 3180 acaattcgat tgaccaaacg ccacgctctt tcactcatcg ctcgtctcta tgacccgcta 3240 ggcttagtag ggcctgttgt aacgaccgca aaagtgttca tgcaggacct gtggaccctt 3300 aaagacaaca acggaacaca atggggttgg gatcaggagt taccagctga atattgcgca 3360 cgatggatca tctatcaatc actactgcca aggctaaatg aactccgaat cgaacgctgc 3420 gttttgcttc cagattccga cagcgtacag attcatattt tttccgatgc ctcgcagatt 3480 gcttacggtg cttgcgccta tatcagatct acaaatgctt tgggactgat taaagtctcc 3540 ctgcttacct cgaaatctcg gatagcacct ctaaaacgac agagcatccc acgcctcgag 3600 ttgtgtgggg ctttactggc agctcaatta tataaaaagg taaaaacatc cctacaaatc 3660 gacgctcagg ttttcttttg ggttgacagc accgttgtac tacattggct taatgaatca 3720 ccatccattt ggtcaacatt cgtcgcaaat cgtacatcgc aaattcaact cgccactcag 3780 gactgccagt ggaatcatgt atccggagta gaaaacccag ccgactgcat ttcgcgcggt 3840 atgtctgctg acactatttt ggaatcaaac ctttggtggc acggaccaga atggttacga 3900 cgtcagcaaa gtcagtggcc aatacctggt gctcaactta ctaactcccc tgaagcactg 3960 aaggaaaagc gtcgaacatc tgtggcagct acaacatctc gcagcgaacc gacctttatc 4020 gacttgctaa ccgaaaaaca ttctgattac caacgtcttc ttcgagtagt agcctactgc 4080 caacgttttg ctagaaattg ccagaaactg tccagcgaaa gagttaaaac tacatttttc 4140 atcgttgatg aactgcgaga agcggaaacg gttctacttc gactagtaca gcagcaatgt 4200 ttcgccgaag agtggaagca gcttcataaa tcgcagccca tcgccatgac atcccgcctc 4260 aaatggtttc atccgttcat gtcctcagaa cagttgatac gaatcggtgg taggttggga 4320 aacactgaag gaacgtatga ttgcaagcac caaattatct tgccatcatc gcatatgatc 4380 tcgattctcc tggctcgtag ctaccatatt aaacacctcc acgcagctcc tcagctactt 4440 ataaatttac tccgacagag gtactggatc acaggggcca gaaaagtagc taaactggtg 4500 gttcaccgtt gcgtaacttg ctcgcgggcg cgtccaaaga tggttcagca acttatggcg 4560 gagcttcctt cttcgcgagc cacggcgagt cgacccttta caattaccgg agtggattat 4620 tggggaccca ttcaactagc acccattcat cgccgtgctt ctcccaggaa agcctacgta 4680 gcagtttttg tctgcttcag taccaaggcg gtgcacattg aacttgttgg agatctcagt 4740 acaccaaaat tccttcaagc actaagacga ttcgtatctc gccgtggtct ctgctctgac 4800 ctacacagcg ataatggtcg gaatttcgta ggagctgcaa aagaactgca agaactagtc 4860 agaagcaaga atcaccaaga atccatcggg aaagagtgca acacgctggg aatccgctgg 4920 catttcaatc ctccgaaggc ttcgcatttc ggtggacttt gggagtctgc catacggtcg 4980 gcacaaaagc actttgtccg ggtgcttgga ccacacacgt taccttttga tgacatggaa 5040 acgctcctgt gtcagatcga aagctgcttg aattctcgac caattgtccc acttagcgac 5100 gatccctccg actacgagcc gcttactccg ggtcattttc tgattgggac gtcattgaag 5160 gctgtcccag acaacgatct aagtagcatt ccattggatc gtctccaaaa atggcatcga 5220 gtccagaaaa tgttccagga tttgtggaaa aggtggcacc tggaatacct ctcgacgtta 5280 caaccacggg caaaatggtg caatcctcca atacagattc taccagatca actggtagtt 5340 ctgcgcgatg aaaacaccga gccaatgcgt tggcctatgg cacgaatact tcaagttcac 5400 ccgggttcag acggagtagt tcgcgtcgtt acggttcaga cacccaccgg ccgatatgtg 5460 cgtcccgtag caaaaatttg cctgcttccg attgcatcac cgacatcatc gtccgaccaa 5520 gcagcaacaa attgaatcca gaatatgaca tcaactagta ctattccaag gcgccttcaa 5580 caattcatct gtacaggatc acctaaacgg aaagaataca atgcccttca tttgaaggga 5640 ctaggtaagc accagtccaa tcaaattaat tatgtttaaa cccgctaccg ggtctcaatg 5700 tttgcagaca ccatcagaaa tttcggtttg cttgacgttt cattgaacaa caatgttgct 5760 gtcacactca tccgtcaaag ttacatcgat tcgtcgacga caagggacaa cccattcgtt 5820 tgcgttcaag gtacatcgac taccaacagg tgcaaccaac gtgaaaggcg ttcgtttgaa 5880 atcatacaag tggatcagaa acaatgttcc tgaaggggcc aggatgtttc g 5931 // ID DNA8-1_TCa repbase; DNA; INV; 1767 BP. XX AC . XX DT 21-MAR-2009 (Rel. 14.03, Created) DT 21-MAR-2009 (Rel. 14.03, Last updated, Version 1) XX DE Non-autonomous DNA transposon. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-1_TCa. XX OS Tribolium castaneum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Coleoptera; Polyphaga; Cucujiformia; OC Tenebrionidae; Tribolium. XX RN [1] RP 1-1767 RA Jurka J.; RT "DNA transposons from insects."; RL Repbase Reports 9(3), 668-668 (2009). XX DR [1] (Consensus) XX CC 8bp TSD. Unclassified. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Tribolium CC castaneum Genome Project. XX SQ Sequence 1767 BP; 577 A; 286 C; 306 G; 598 T; 0 other; accaaagcca gtagatatac taaagtagcg acacggacct tgacagggca acaataacga 60 ttatacactc tagagagggc gcaactatcg aaattcgaca tataccgcat tatttacata 120 aaatgcctag ccttgaaaag ccttagtaag tgcgttttgt ccctgtttct acgtttaatc 180 ttcactaaat gaaattgcta aaggggatgc tcaaaagaga tattcgttaa aggtcacatt 240 agatgataga tttaaatcga gaatgtattt aaacaggcag tttagttttc caaggctcga 300 cattttttgt aataacgctg taattttaaa ggttcgtcaa attgtcagta attttgaaca 360 tacgtcaaaa ctgtcaagat tctctcactt ttgaaaaatt ggtaaagagc acagtgattc 420 tggtgttcat ctgtagtttt actgtggtta ttgattatga acagattcca tggaactttc 480 aaacctgtgc ttcaatcacg taaataataa atcgacacca aaaacattaa acaaaagggc 540 ctccagactg ttcaaagtcg tccacattac gaagcacgtc tgtgtgagaa tactggaaac 600 tcattcaaac aaaaattgtt taagaaacgt aaagtggtag gtatcaactt gaaagagaaa 660 tttctaagat accaaatttt cattaagtat ttcaggtaga tttaaacaat acttccttga 720 aaattcattc taaattatta gattccgccc taatgggaaa gaatttaaga ccgctctgct 780 tacactgacc ttcgtttcac aaaatttaat gcatctaaat ctgggaattg tagcgcatcg 840 aagtttggtg gactttttat agccacttat ttacagcaac tatgttatac aaaatttaaa 900 taataaagag aaattcaggt tgtattattt ctatttctta ggtaatatcg aggtgtaaac 960 cgtaccccaa attttgtaca ccagatggtt tcctgtgatt taatcatgat tccttctgat 1020 tttccctaaa tttttaaagt gatttcctta aaatttcttg atttactcaa ttttgtacac 1080 tgatttcctc tgcggtttac ctctcgggta atattgaaaa catttacttt tcttattgcg 1140 aaagatgtct tggtttcata gcagttgtta atagtgttga gggaaaaaat taaatttaat 1200 taaaaaaaaa taaatcgacc aatttaaaat cctggagtaa ttaaatacga taacctgtaa 1260 aaacacttag ggaggagttt attaaaatta gaaatgtagg taacacaatt ttttggattt 1320 aaacaactat tattttaaaa cgtgttgttg gctttttcgt taaatacaaa gttcaaattc 1380 ttttaaggaa tgaattcact gttgaagcgc cagcaccttc ttcgaaatgt ccataagagg 1440 tgctgtattc tgcgcattct aaaagccttt ctcattgcaa tgttcagttg aatttaggga 1500 cgatctgatt tatttttacg tgagtgaagc acaagtttga aggtttcatg gtatctgtac 1560 acataaacag taataaccag ggtgaaatta tagacaatca ccgaatttat tgttctgatt 1620 caccaatttt tcaaacgtga gaaaatcttg acatttttga cgtacgtttt tgaattttga 1680 tggtggagcc atctgtagag tgtataatcg ttattgttgc cctgtcaagg tccgtgtcgc 1740 tactttagta tatctactgg ctttggt 1767 // ID P-27_HM repbase; DNA; INV; 6803 BP. XX AC . XX DT 19-DEC-2008 (Rel. 13.12, Created) DT 19-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE P-type DNA transposon family: consensus. XX KW P; DNA transposon; Transposable Element; P-27_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-6803 RA Bao W. and Jurka J.; RT "P-type DNA transposon families from Hydra magnipapillata."; RL Repbase Reports 8(12), 2080-2080 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 179..2884 FT /product="P-27_HM_1p" FT /translation="MPGENCAIFGCSTSRRHKGISIFKVPTPTNDANKKWS FT NDLINVVTRDRLIDDSLKKRIDSFNLYICELHFTEDQFWVYSSRKILKDGA FT LPTLNLPKKSNPSQRSSLSITKREEFLETQELISQTTPLCYKSFDDFTKRI FT LKLSLGEXWKISFDERFAIITCXSPEHILPKFEIYVDELLDYCIRIFGWML FT PIDHLLYLQYKGSFFNVSLSKFISHIETLILCPGVEVKNLDLINQNFKFEK FT HVITKKFNYLLFKLLPNKPRLNQDEFYRSPYCNVLIKSQSCCSSCHSLQLK FT SNYEVNNKTKTLKQPAKLNAPLKFTSTERIKLSLQQQRLKCKQLESQIEQM FT KTVLNMESEKVNLELDQDLKTLYSGFNQKNIPDFXKLFWNEQQKYIQASSP FT CSVKYHPMIIKFCLNLAAKSSSAYSDLRFDSKTGSGFLVLPSLRTLRDYKN FT YIHPSRGFNPQVISDLAYKTASFSSAERFVTILFDEMKVQEDLVWDKYSGE FT LIGFVDLGDXQTNYATLKNVRELASYVLVFHVKSVVNPLSYSLATFATTGV FT TSTQLMPIFWKAVRYLESINLKVIAATADGASPNRKFFKMHKKLDGGSGKN FT IIYRTKNLFSNDNRFIFFFADVPHLIKTSRNCLSNSGSGCKSRYMWNSGFF FT LLWNHISTFYNSDLERGLKLVPKLTSDHINLTPYSVMRVRLASQVLSETVG FT NVLKQFGPPEAAGTAEFCLMMDMFFDCLNVKNSAEYITKRKPFLKPYETVD FT DSRFTWLDNFLLYFKLWKESIDQREGSFDESSKSKMFISTQTYEGLQITVH FT SFKEVVKFLLENGVQFVFSERFCQDDLENYFGRQRAIGRRKDNPSIRDFGY FT NDNTIKSQFSVRPINGNVQSNSNNIIDIENNPLPKRKRKSLIQIKSQH*" FT CDS 6695..5526 FT /product="P-27_HM_2p" FT /translation="MASFSSDFETNSDLECLFDVFEGGFLNDDINLVTDID FT SVVSDIDIYRDNVKQHACTQCPKLYKTIRGLKRHIASKHILECFEKCSVKM FT SDTENSSNKLSPLKLKSIVEECASLLEMDMCLPFDVRKKFSCEQFSFSIDD FT AEHLWQKLKPIIEKFKGNSDLFYQEYYGLLSENLLASHFSDITLSNTLLTE FT MSAIILNNLSNINPNIIESVGTTQEITDIEKKSLQYLIGYIFQRLYTKFQS FT SKNLCKYSQQCVSILLRCRVMEDDTQALVNIRDRGGLWKVNNIIQDLFLTC FT EIIFRKKTTQVSSKLDPEVLIDSMSTNYIVLSLFKQVCNDPESTEVSKNLL FT EHLITLFVRVRSFSYAKDVIEKHKMRKKSCKTRSLRCEIKKASEGN*" XX SQ Sequence 6803 BP; 2375 A; 1044 C; 1054 G; 2317 T; 13 other; catagttata agaaagaata gtttggaagc gaacaatagc gtattgtttt aactctttca 60 tactgttttt tctcgtttcc ggacttccga acctgcacaa aaagattgtt gttgttagcg 120 tttgcccgtt attttactaa gttgtatttg tgttttgatt gaaattagtt ttcttgatat 180 gcctggtgaa aattgtgcaa tatttggttg ttcaacttca agaagacaca aaggtatcag 240 cattttcaaa gttcctactc caactaatga tgcaaataag aaatggagta atgaccttat 300 taacgtagtt actagagaca gactcatcga tgattcttta aaaaaacgaa ttgattcttt 360 taatctttat atttgtgagc ttcactttac tgaagatcaa ttttgggttt actcatctcg 420 aaagatttta aaggatggag cattgccaac cctaaatctt cccaagaaaa gtaatccatc 480 gcagcgaagt tcgctatcta ttacaaaacg tgaggagttt ttagaaactc aagagttgat 540 ttctcaaact acaccgttat gttacaaatc ttttgatgat tttacyaaac gtattttaaa 600 actctcttta ggtgaawgtt ggaaaatatc ttttgatgag agatttgcaa ttattacttg 660 tarttctcca gaacatattt tacctaaatt cgaaatttat gtcgacgagt tgctagacta 720 ttgtataaga atttttggtt ggatgttacc aatagatcat cttttgtacc ttcaatataa 780 aggatctttt tttaatgttt ctttatccaa atttataagt catattgaaa ctcttattct 840 ttgcccaggt gttgaagtaa aaaatttaga cttaatyaac caaaatttta aatttgaaaa 900 acatgttata actaaaaaat ttaattactt gctttttaag ctgcttccta ataaacctag 960 attaaatcaa gatgagtttt atcgttcacc gtattgtaat gttttaataa aatcccaaag 1020 ttgttgttct tcatgccatt ctttgcaact taaatctaat tatgaagtaa ataataaaac 1080 taaaacttta aaacaacctg caaagctgaa cgctcctctt aagtttacct ccacagaaag 1140 aattaaatta agtttgcaac aacaacgttt aaaatgcaaa cagttggaaa gtcaaattga 1200 acaaatgaaa acagttttaa atatggaaag tgaaaaggtt aatttagaac ttgaccaaga 1260 ccttaaaaca ttgtattctg gttttaatca aaaaaatatt ccagacttca traaactatt 1320 ttggaatgag cagcaaaaat atattcaagc ttctagtccc tgcagtgtta agtatcatcc 1380 aatgataata aaattttgyc ttaatttagc agctaaatct tcgtcagcat attctgattt 1440 gcgttttgat agtaagactg gttcagggtt tttagttctt ccaagtttac gcacattgag 1500 agactataaa aattatatac atccttcaag aggttttaat cctcaagtca taagtgatct 1560 tgcatataaa actgctagtt tctctagtgc tgaacgattt gttacaattc tgtttgacga 1620 aatgaaagtt caagaagatt tggtctggga taagtactct ggtgagttga ttggatttgt 1680 agatttagga gatrttcaaa caaactatgc aacacttaaa aatgtaagag agctagcatc 1740 ttatgtttta gtttttcatg ttaaaagtgt agtaaaccca ctttcctata gtctagcaac 1800 atttgccaca actggagtaa catcaacaca acttatgcca atattttgga aagcagtgcg 1860 ttatcttgaa agtataaatt tgaaagtaat tgctgcaact gcwgatggtg catctccaaa 1920 tagaaagttt tttaaaatgc ataagaaatt agatggtggc tcaggaaaaa acattatata 1980 tcgcactaaa aacttattta gtaatgataa tagattcatc tttttctttg cagatgtacc 2040 tcatctaatt aaaacttcta gaaactgcct aagtaattct ggttctggat gtaaatcacg 2100 ctacatgtgg aatagtggat tttttcttct ttggaatcat atttcaacat tttataattc 2160 cgatttagaa agaggtttga aacttgttcc taaactaaca agcgatcata taaacttaac 2220 accgtattct gtaatgagag tacgacttgc atcccaagta ttgagcgaaa cagtcggaaa 2280 tgttttaaag cagtttggac caccagaagc tgctggaaca gcagaatttt gtttaatgat 2340 ggatatgttt tttgattgtc ttaatgtaaa aaatagtgct gaatatataa cgaaaagaaa 2400 gccattttta aaaccatatg aaacagtaga tgattcacga tttacttggt tagataactt 2460 tctcctatac tttaaattgt ggaaagaatc tatagatcaa agagaaggaa gttttgatga 2520 gagctcaaaa tcaaaaatgt ttatttctac acaaacttat gaaggtttgc aaattacagt 2580 gcattcattc aaagaagttg ttaaattttt acttgaaaat ggtgttcaat ttgttttttc 2640 tgagagattc tgccaagatg accttgaaaa ctattttggt agacaacgtg caattggtcg 2700 cagaaaagat aatccgtcta tcagagattt tggctacaac gacaatacaa tcaagtcgca 2760 gttttcagtt cgtccaatta atggtaatgt tcaatctaac agcaataata tcatagatat 2820 tgaaaataat cctttaccaa aaagaaaaag aaagtcgtta attcaaatta aaagccaaca 2880 ttaagctctt cattttcaca tttattgatt taaaaaaaaa ggttattttc ataaccacaa 2940 aatgttaaaa gcattatttg tcatttatga tgacattatg acaaatcagt cgcccaatat 3000 agacttttta gaccattcag tggcagtggc ccatacaaat ggggggtgga gggggtaact 3060 actcctaaaa aatttcaaac ttcaaaatat tattgataca attaaaaaat ttaataaaaa 3120 acaccggatc aaggaggggg gctagaggga ctatagcccc gggccccgca aatttagggg 3180 cccccaaaaa ttttaaaaat tttttgagaa cctttttttt tttaagtcac ttttaggctc 3240 tggctcataa aaaggcctcc aaaaacaaca atagctccaa gaagttttaa ttcgccactg 3300 ctaggagggc tagacttacc ccctttaagg ggagtaagcc cccccccctc maaagaactt 3360 caaactatta acgacaaaat taaaaaaaag ttttawgcgt catttaaaac aagaaaatta 3420 aaaagatcca cgaatgtact accaattttc aaacaacaat tcaaatttct tccaacaaat 3480 caggrccgtw ccgaacgact taaatttaaa tttgaacgtc tattattata gatgtgctca 3540 aagcacataa acgtttttgg tgaaattgga aacggagagc taacttaggc catttaataa 3600 gttacttatt aaatgataat cttcttgtgg cagtgtactg ttaaggtgct ctaagaagcc 3660 attaggtctg cttggggcac gttaataaat ttaaaaaaaa atccagggaa accgaaccgc 3720 ctgaactttg tytcacctac gaatttttaa gctcgtagat atcaaagaaa ccgatgacct 3780 tttattttag ttgaccaaat agggttcccc ctgtcccaac ttaaaaaaat attcacactg 3840 ctgtatttgg gttctaccaa aaaaaagggt actctgattt caataaacta ggggattttt 3900 aagaataata aaacgacggt cgtatactat tctaacgccc ctttacaaaa ggtgttggca 3960 ctggcagatt aatacttctt ggggtctggg gctattaatt tttttggatt ttttttttta 4020 tgtgtcagag gctaaaaata gtaaaaagta aaagtaaaaa gtccttcata atattttata 4080 taccctaaat tgtggggctc gggtcaatat gcggcagtgt tgtagtggta agagcgctcg 4140 ctttgtatgc gagaggttcg gagttcgact cccaccactt cctgagaggt tcggagctcg 4200 acttaatttc tccgcgcaac catcttgctt gtcaaggttt gtgttttgga gttaaagagt 4260 tgagagaggg ttgtaccacg atttaaaaaa aaaaaaaaaa aaagagtagc ctccttgact 4320 gtagtggccc ctcgggcatt ggggaggtga ataatgaata atgaaaaaat aatacaaaaa 4380 gtagccccct cccttaatcc gccactgggt gtttgaaata aacatgaaaa tgaaataacc 4440 aattttaata catttaattg aaataataaa ctaatcaagg aaaaggaaaa aattatattg 4500 attgatccac atgatgattg aacaacacat gacattgata gggaataaca ttttttttaa 4560 atgagcaaag aaaaatctaa attattttcc aggtgggttg atattttata tttggttatt 4620 tataattata agtgaatgtt atatcaatat agttaatgta accacaatag tgaacgttaa 4680 gctgttgtag tttagttaaa aaaaaagttc attcatgtta caatccaata acagtatttg 4740 ataaaatatt aaaattcagt atttaataaa caataataca ataaataatt ttcaataaca 4800 aaataataaa ttgctgggta acaggtacac ttaattaata aatttaataa ctgcatatat 4860 atcgtgtcag gttaggttat cctaacgtgc taaagtatat acagttatta actgttagta 4920 attgttcctc gctatgtata tattttggca tgtttacatc catgttttca cacttgcgtg 4980 gtttggatgg ttctccatag tgacataaat atgccactaa gaaagaattg aagcaatttt 5040 ttactaatgc cctccctggc gttattacaa aatgccctcc caggagttat cacaaagata 5100 tatatatata tatatatata tatatatata tatatatata tatatatata tatatatata 5160 tatatatata tatatatata tacatatata tgtatatata catatatata tatatatata 5220 tatatatata tatatatata tatatatata tatatataca tatgtatata tatatatata 5280 tatatatata tatatatata tatatatata tatatatata tatatatatg taagtataat 5340 aaaagttaga aataattgac cctaactcta agtttatcat atctggatca ttgcaaacat 5400 tgaaggatta tcttattata taaagtaact attaagtaaa tttttaaaca gaatttaagt 5460 agtttatgtg ccattaatgt tttaaaaaga tatgtggcta tgaaaatagc cttttatgtt 5520 caatttcaat ttccttcact tgctttctta atttcacatc gcaatgaacg agtcttacaa 5580 gatttctttc tcattttatg cttttcaata acatctttgg cataagaaaa cgaacgaact 5640 cttacaaata atgtaattaa gtgctctagc aaattcttac taacttcggt tgattctgga 5700 tcattgcaaa cttgtttaaa aagagataaa acaatataat ttgttgacat ggagtcaatt 5760 agaacttcag gatctagttt tgaagaaact tgtgttgttt ttttacgaaa aataatctcg 5820 catgttagaa aaagatcctg aattatattg tttactttcc acaaaccccc tctatctcta 5880 atatttacaa gtgcctgagt atcatcttcc ataaccctgc agcgcagtaa aattgaaaca 5940 cactgttggc tatacttaca taaattttta ctactttgaa atttagtata caatctctga 6000 aatatatatc caataagata ttgtaaactc tttttctcta tatcagttat ttcttgagtt 6060 gtaccaacag attcaattat atttgggttt atattagaaa gattatttaa tataatggca 6120 gacatttcag tgagtagtgt attgctcaaa gttatatcac tgaagtgtga agctagtaga 6180 ttttcgctca gcagtccata atactcttga taaaataaat cactattgcc tttaaatttt 6240 tctataatcg gttttaattt ttgccaaaga tgttcagcat catctattga aaaactaaat 6300 tgctcgcatg agaatttttt tctaacatca aatggcaggc acatgtccat ttccaataaa 6360 cttgcacatt cttcaactat gcttttcagc tttagcggtg acaatttatt agaagaattc 6420 tccgtatcag acatttttac agaacatttt tcaaaacatt ctaaaatatg cttcgatgca 6480 atgtgtcttt ttaaacctct aattgttttg tatagtttag gacactgagt gcatgcatgt 6540 tgtttgacat tatctctata aatatcgata tcagaaacaa cggaatctat atctgttact 6600 aagttaatgt catcattcaa aaatcctcct tcaaacacgt caaacaaaca ttccaaatca 6660 ctatttgtct caaaatcgct gctaaaactt gccattatta taaacttgtg caggttcgcc 6720 gggtcgtcgt gtagttttca agttaaaatc ggaaaacaat ggaaatggtc cgtgtttcca 6780 gactatgtat tcttatgact atg 6803 // ID BEL-69_AA-LTR repbase; DNA; INV; 697 BP. XX AC supercont1.21; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-69_AA_; KW BEL-69_AA-I; BEL-69_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-697 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.21; Positions 3460068 3460764. XX SQ Sequence 697 BP; 239 A; 130 C; 122 G; 206 T; 0 other; tgtcacgtca cccccctgct acaaatctac tgtggtcgat tcactgtggc cgttcaagtg 60 gtcaactaag cttgaccaat acaacggtcg attaaccgag cgtaaaagag aggataaagg 120 cgcttttcga tgtatgtagg agttcagaac caaaacaaaa ttggaacaac tctacgaagt 180 catgcaaatg agcatttcaa aagttttatc tactattctg cgcatttttc cgttaattta 240 aagctaaaag gtagtgagga accaaaatct aaactctagt ttaacttagt tctagttcgt 300 gcaaaattgg cttaaaatct atagtaccga ggtaagggaa atacttctta aaagtacgtt 360 gaatttataa cctaaaatga attcgttata tatagaaacc gctaatcatc taccaagtcc 420 tagacgaggt tgagcaattg aaaatcctaa ctagctgtaa gtattattgt ttttggttcg 480 aatttgataa ttgagagttt ctaatcctta gtatatctac aaaattatta agcccgagtc 540 accgtggtaa gacccaaccg tctgaaggtc accaaaaatt gtaagtgatt gcaaaaatat 600 tgaattcatt attctaatga aataaaattt agcttttagc ttacccacat acaaaacggt 660 ttgctcaaaa gagtttggga aacttccccc cacaaca 697 // ID Mariner-3_AP repbase; DNA; INV; 2939 BP. XX AC Contig54199; XX DT 07-MAR-2008 (Rel. 13.03, Created) DT 17-MAR-2008 (Rel. 15.12, Last updated, Version 2) XX DE Mariner-type transposable element. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-3_AP. XX NM Mariner-3_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-2939 RA Jurka J.; RT "Mariner families from Acyrthosiphon pisum."; RL Repbase Reports 8(3), 342-342 (2008). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX FH Key Location/Qualifiers FT CDS join(358..537,585..1205,1209..1625) FT /product="Mariner-3_AP_1p" FT /translation="MDQESSNISPMKKNARGNVRIILTIIIVLFRTVLMYY FT SVLYYTTITTIFNSSLLVLVKNGDVPKLTAKEMIKKISEESGIGQRTVSVT FT LSEYRNKKTVTSPNKTKIRPKVTDKVDEFDQNAIRQKVHQFWHNHQIPTLN FT KILTAVNEDDSLPSFSKISLHRVLKHLNFEYVRKSRNSVLIERNDIVCWRR FT NYLETIKNYRQLGRQIYYLDETWVNAGETTSKSWVDITVKSTRDAFLRGLT FT TGQKEPSGKGKRLIVVHIGSSDGFVPGLFCFESKKNTQDYHDEMNGDNFYE FT WFNKILPLLNENAVIVMDNASDHSVKKDPCPVISWKKADIINWLENKGEVV FT DHIKIKSQLLERAQVLKPQYEQYVIDELAKAANKTVVRLPPYHCELNPIEL FT VWSSVKKLCSHE" XX SQ Sequence 2939 BP; 968 A; 466 C; 548 G; 957 T; 0 other; ctccgtacca cgagagaacg catacggcgg ccgacgaaaa ctggtttatt tcgcgcattt 60 cccctctacg aaacagtttc tcagccggaa taagtcggtt gctatcggaa ctataactat 120 agtcggtttt cggttaggtc atggtataca gaatggttgc tgagcagctg acaattgttg 180 attcacgcgt cgtgtacact actgttttga gtttttttta ccgagtgaat cttgttttag 240 tttgactgtg ccgtaaattg tattatattt tttttgcggc agtgtttgcg ttactacgtg 300 tgcgtatatc attttatttt gactgtcagg tgtatacctt attttttttt tattcaaatg 360 gatcaagaaa gcagcaatat ttcaccaatg aaaaaaaatg ctcgtggaaa tgtaagaatt 420 attttaacta ttataattgt attatttcgg actgtactta tgtactatag tgtgttatat 480 tatactacaa taaccactat ttttaattct agtttgttgg ttctggtcaa aaatggatga 540 ttgtaaattt gtacaaaaac aaaatggcac tgcaagcaac ttgagatgtg ccaaaattga 600 ctgcgaagga aatgataaaa aaaatatcgg aagaaagtgg tatcggacaa cggactgtga 660 gcgttactct gtcggaatat cgcaacaaaa agactgtaac atcaccaaac aagaccaaaa 720 tcagaccaaa agtaaccgat aaagtagacg agtttgacca aaatgccatc aggcaaaaag 780 ttcatcaatt ttggcacaac catcaaattc cgacgttgaa taaaatatta acagcagtca 840 atgaagatga cagtttgcca agtttctcaa aaatttcttt gcacagagtt ttgaaacatc 900 tgaatttcga gtatgtccgt aagtctcgta atagtgtatt aatcgaacgg aatgatattg 960 tgtgttggcg ccgaaactat ttggaaacta taaagaatta tcgtcagtta ggtagacaaa 1020 tatattacct tgatgaaaca tgggtaaatg caggtgaaac tacatcgaaa tcatgggtgg 1080 atattacagt gaaatcgaca agggacgcat ttctcagagg tcttacaact ggacaaaagg 1140 aaccatctgg taagggcaag cggctcattg ttgtgcacat cgggtcatct gacgggtttg 1200 ttccttgagg tctattttgt ttcgaatcga aaaaaaatac ccaagattac catgacgaaa 1260 tgaacggtga taatttttat gagtggttca acaaaatttt gccgttattg aacgaaaatg 1320 ccgtgatcgt catggacaat gcgtctgatc attcggtaaa gaaggaccca tgccccgtta 1380 tatcttggaa aaaagcagat attatcaact ggctggaaaa taaaggtgag gtagtcgatc 1440 atataaagat taaatcccaa ttattggaaa gagctcaagt tttaaagcct caatatgaac 1500 aatatgtgat tgatgaatta gcaaaggctg ctaacaaaac tgttgtacgt cttccccctt 1560 atcactgcga attaaaccct atagaactcg tgtggtcatc agtaaaaaaa ttatgttcgc 1620 atgaataata caacatataa attgcaagat gttcgtaaac tattggaaga aggcgtggaa 1680 cgggttacac cagacatgtg gaaaaacttt attactcatg tgacaaaaga ggaagacaaa 1740 ttttggcaaa tcgatgtctt atctgatgaa ctctttgatg aacaagaatt ccacgtctta 1800 acaatcacag gtgacacaag ttcagacttc gattctgatt aaattattta ccaataaatg 1860 tttgtatatt atttttaaat atgactatgg ttttttaata ttagtgtatt atttgtattt 1920 ttattttttg tacatattgt ttttaaatca atcgtaagat ttagtacatt tttaatttat 1980 gattttttgt atgactgtat tatttgtatt tttaatttaa tgacttatga aaaaaatata 2040 ataaatgtaa tatgattata taaccaaaaa aaaattggtt tattgatctt ataacaatat 2100 taaataatat taagtgatac atcaactaat acgcgtgaat ccgtgattat acaagcacgt 2160 aatatattaa tcacttgagc cgctgtataa gctcgtgtga agtgggaatg ttacgtattg 2220 ccgtatgcgt tctcttgtga tacggtgggg gaaatgtgcc ctcgaaacga cggagcgcaa 2280 tcattcggat gcgcgagccg tatgaaggga tggcaaaaat tttccagacc tcgatatatc 2340 ggttatcgat aaatcgatac accgatatat cggatcagaa aatatcgata tatcgcgttt 2400 tttggatttt gaattttgag ttttttttta attactagat agcgtgtttt gacctactct 2460 cgctctcact tgtgccggcg tggttactgg ttaaatgata actatataaa ggctttccat 2520 acttattact atcagtgcgt agtattccct gtccctcttg atttcgggtc atgtttatac 2580 actggatagt gttgtagtct gtgaggctac atcaatcaca acggaattaa ttttttctta 2640 ctcaatttta ttgaataaat taccaatttt taaatgtttt tacttaacat cttagcataa 2700 tatatgccat agtataatat aatataatgt ccacaaaaaa catccattca gaaaaattat 2760 ttaagaagaa acgcaaatac gttcatggaa aaaattcatg gtagtaaaaa tatcgatatc 2820 cgtcaataat tatcgatatc acaaatatat cgaaactaaa atatcgatat ttttacaaga 2880 aatatcgata tattttttgc catccatacg gcccgtatgc gttctctcgt ggtacggag 2939 // ID Troyka-3-LTR_BF repbase; DNA; INV; 199 BP. XX AC . XX DT 29-APR-2008 (Rel. 13.04, Created) DT 29-APR-2008 (Rel. 13.04, Last updated, Version 1) XX DE LTR of the amphioxus Troyka-3_BF autonomous LTR retrotransposon - DE consensus. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Nonautonomous; KW Troyka group; Troyka-3-LTR_BF; Troyka-3_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-199 RA Putnam N.H., Srivastava M., Hellsten U., Dirks B., Chapman J., RA Salamov A., Terry A., Shapiro H., Lindquist E. et al.; RT "Sea anemone genome reveals ancestral eumetazoan gene repertoire RT and genomic organization."; RL Science 317(5834), 86-94 (2007). XX RN [2] RP 1-199 RA Kapitonov V.V. and Jurka J.; RT "Troyka - a distinctive group of gypsy-like LTR retrotransposons RT inducing 3-bp target-site duplications."; RL Repbase Reports 8(4), 516-516 (2008). XX DR [2] (Consensus) XX CC The internal portion of Troyka-3_BF is not reconstructed. Solo CC Troyka-3-LTR_BF elements are flanked by 3-bp target-site CC duplications and are less than 2% divergent from their consensus CC sequence. XX SQ Sequence 199 BP; 49 A; 45 C; 46 G; 59 T; 0 other; cgccagatat gttatgtaat gttatgacgt cagcggtatg ctaatgaacc actactgtat 60 caccaatatg gtaatgaggg aagtgtcccg gatgttctcc gccatcttgt tgttaagcca 120 gttagaagta cattaaaccg agttgcgttc atctccacac tggagtcgtc cttgcggtcc 180 ctgtagttaa ctactgacg 199 // ID Gypsy-612_AA-LTR repbase; DNA; INV; 271 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-612_AA_; KW Ty3_gypsy_Ele47; Gypsy-612_AA-I; Gypsy-612_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-271 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 271 BP; 75 A; 60 C; 63 G; 72 T; 1 other; tgagttttac acaggaattt gattaagtgt acatcctgga attctactac ttctaaaagg 60 ctggtaaatt gaagtgtaga atccatctcc tctgaaaagc ctttatgcac aaacgtcaag 120 gtcaatgtgt gagtgtgttc gagttcctaa gcggttagag gtgaaactcg tggacgacga 180 ttgactcggc cacttggatt cagaccagcg tagcggacgg tccgacttat tcaacgctcc 240 cgtagttkgc aatacccgaa ccgactcaac a 271 // ID Copia-2_RP-LTR repbase; DNA; INV; 150 BP. XX AC ACPB02046047; XX DT 28-JAN-2011 (Rel. 16.02, Created) DT 28-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Rhodnius prolixus genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-2_RP_; KW Copia-2_RP-I; Copia-2_RP-LTR. XX OS Rhodnius prolixus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Euhemiptera; Heteroptera; OC Panheteroptera; Cimicomorpha; Reduviidae; Triatominae; Rhodnius. XX RN [1] RP 1-150 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Rhodnius prolixus genome."; RL Direct Submission to RU (28-JAN-2011). XX DR Genome; ACPB02046047; Positions 31063 30914. XX SQ Sequence 150 BP; 34 A; 19 C; 19 G; 78 T; 0 other; tgaatttgta tgatcgctat gcatttgcac catgtgattt tttttttttt ttcttttaca 60 tatgtggtgt taatatcttt gtatatttta gttacattaa agttttctat tctagctgaa 120 cctttcgttt tttttttatg aaatccaaca 150 // ID BEL-38_AA-I repbase; DNA; INV; 6769 BP. XX AC supercont1.26; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-38_AA_; KW BEL-38_AA-LTR; BEL-38_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6769 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.26; Positions 1265005 1258237. XX CC Positions [5727-6305] - Integrase core CC 'TTATA' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 1374..6677 FT /product="BEL-38_AA-I_1p" FT /translation="MAPPQRKKPPTLKLLMVQLRNIQTSLDDIWRFVNDYQ FT PTTTANQVNVRLQKLDELWERYGETLVEVQTHDDFEDEDETFEKARIVYSD FT RYYHCKAFLMDRAKELEDPDEVEHSMRGNETLGYGHSLDHVRLPQIKLQVF FT NGDIDEWISFRDLFCSLIHRKTDLPEVEKFHYLKGCLQGEPKGLIDSLKIT FT RTNYQIAWDMLLKRYDNSKQLKKRQVQALFNLPSLSKESVADLHTLIDGFE FT KVVQNLDQVIEPEDYKDLLLVTLLTIRLDPVTRRSWEEYSSTKDQDTLADL FT SDFLHRRIRILEALPTKVADTRSSQPQPFRQKSSAVKASYGSVQSSGGRCV FT VCKEDHQLYQCSSFQRLSVRERDEVLRSNSLCRNCFRSGHLARDCSSKYFC FT RNCKGRHHTMVCFKQNKASSANFTTAVGNNNPPPQTESDEPSPSTSSQVVN FT MVATESTVSGTTQQFSSKVLLATAVVIVEDDEGSRFPARALLDSGSESNFI FT AERLSQRLRTHRERVDVSVFGIGHAATKVKHQVSAMVRSRVTAFSQNMSFL FT VLPKVAMDLPTAKVNTQGWTVPDGINLADPTFFNPGSVDMVLGIEHFFDFF FT GSGRRIPLGDQLPALNDSVFGWVVCGGLTNPTKSLRINCSTAATQGLEELV FT ARFWASEEVGVSRLLSLEERKCEESFRKSVRRESDGRYTVSLPKDEGAISK FT LGESRDIAFRRLQGTERRLARDAGLCEQYHEFMAEYIKLGHMTKAEETADD FT LKRCYLPHHPVIKEASITTKVRVVFDASCKTSSGVSLNDTLLVGPIVQEDL FT RSIMLRSRVRQIMLVADVEKMFRQIFITQVDRPLQCILWRFNPVDKVDVYE FT LDTVTYWTKPAPFLATRTLNQLSIDEGEHFPMAAKAIKEDTYMDDVITGAD FT TEEAAQKLCKQLIEMTSKGGFKLRKWVSNIPEVLDSVSEDCLAIRVSEGIN FT LDPDPTVKTLGLTWMPITDQLRFQFSIPNWNSTHQPTKRQVLSVIASLFDP FT LGLLGAAITTAKIIMQLLWRIRNEAGEALDWDQTLPQTVGEMWRKYYEELH FT LFNNIRIERCVMIPEAVLVELHCFSDASNKAFGGCIYIRSHDSEGRLLVRL FT LSSKSKVAPLKTQSIPRLELCGALLTAQLYEKVRDSIRVPVECYFWTDSTC FT VLRWIQAPSNVWNTFVANRVAKIQEIAEYSQWHHVSGKQNPADLISRGISP FT KDIVDNSFWWEGPDWLKESQDRWPVCSENLNVEEEEHERRRVAVCTVSANM FT EFNEWYMEKFASYSDLIRKTAIWLRFMEYLRTESHSRVPLRFISTEELRKA FT EFTIVRRVQQETFADEWKALSKGEAVSRKSPLRWFNPYISNDLVIRIGGRL FT EHSMEQDDAKHPMVLPARHRLTRMILEHYHLQLLHAGPQLLLGVVRLKFWP FT LGGRSVARNVVHQCLKCWRAKPSPAQQLMGELPAPRVTVSRPFSQTGVDYF FT GPFYVRPAPRRPVVKAYAAVFICLCTKAVHLELVSDLSTDRFLQALHRFTS FT RRGRCRDLYSDNGTNFVGAKNKMQDFLKLLWNSEHQERITKDCTEQGIQWH FT FNPPSAPHFGGLWEAAVRSAKNLLNKVVGETPLSPEDFNTLLIQVEGCLNS FT RPLTPMSDDPTDLEPLTPGHFLVGSSIQSIPEPNLEGIRLNRLSQYQLIQR FT MLQDFWRRWRREYLCQLQGRSKRWKPAINVEIGKLVVVRDDNLPPMKWRMG FT RIIQLHPGDDGVVRVVTLKTVAGSMKRPVEKLCFLPIPDEDSSRKSHE" XX SQ Sequence 6769 BP; 1806 A; 1461 C; 1722 G; 1780 T; 0 other; tttttggtcc ttcgagccgg atggtggacc ccgatgagga cttggtcagg acgatatcgg 60 ctggaaagga aacaatagcc tggggcaggc ttggtttggc gccatcgtgg ataggaaaac 120 tcgccatagc tgcgagtgaa taattgccat cagcacaacc tttgacaccg cgccatcgcg 180 acactgcacc acaaaggaca taaaggagga taaaaggatc ttcttccatc ggcttggctt 240 cgtttgggca tccatcgctt ggattgtttc cacagattgg ttggactgcc cgcactttga 300 ggattttgga ttggattcac atacgatacg atttgatgga ctggatttgg agcatctatt 360 ggaccgaagg atcgtttgaa ggctcggact actgatccag taccggattt acttctccag 420 gtaaatatac ctttgatcag tatatgtcca gcctttgcta gcgtatatgt gatactaatg 480 cccattttcg acgtgtttcc ttgttcaacc tactatcaac cgagccggag ggaattgcat 540 gttatccata ccgacgatac gaagctggat ccgatagacc gcaggtcagg tcaggcgtta 600 ttcaaccgct agaagcaact ggttcgagct tggccattcg tgaagggatt cggaggacgg 660 caagaagagg tggcatcaat ttatattttg tatggtgagt gctaagaatg tatatgtctc 720 cgaagcggga cttcagataa tacacatttt taaatgaaaa tttgaatcct cttcacatca 780 tttgccacgc aatccctgag caaaactgtt cgttatcaga acaatcgact cactctgctg 840 gaattttgca cgcttcggaa ttcttctgat cgatttggat ttggattggt gccgttgcta 900 gttccacggg ttagtttgaa cagtatatgt tctgccgaag cttctctttg aatactacat 960 gagtttctcc ctcttctgtg gatccctctg tgaaaaatca tttgcaaacg agctgactgt 1020 gtggtttttg gctttgatga taggacgagc tttggaatac aattagttgc agctgtttgt 1080 tctcacaggt gagctacaac agtatatgtc cgccgaagtc aaaatttgga tactacagct 1140 ggttttttac ccctctcctc ccacattcaa acggatacga gtcggaatgc agtaaacgtt 1200 gtatgacgtc atatcagtgg ttcctcaact gtggataagt gtagttgatc gtagcacatc 1260 cttctgttgc gtcaggtagg cttcaacagt atatgtccgc cgaagccttg gactttcgga 1320 tcactacact atccaattcg tatatttacc ggttcgagtg acgtaaacgc accatggcac 1380 caccccaaag aaagaagccg cctaccctca agctgttgat ggtccagctt aggaatattc 1440 aaacatcctt ggacgacatc tggagatttg tgaatgatta ccagccgact actactgcaa 1500 accaggtgaa cgttcgtttg caaaagcttg atgagttgtg ggagagatat ggcgagaccc 1560 tggtagaagt tcagactcat gacgattttg aggatgagga tgaaacattc gaaaaggcta 1620 ggatagttta cagtgatagg tactaccatt gtaaggcttt tttgatggat agggccaagg 1680 aactggagga tccagatgag gttgaacatt cgatgcgagg aaatgagacg ttaggatatg 1740 gccattccct tgaccacgta cggttgcccc aaatcaagct ccaggttttc aacggggaca 1800 ttgatgaatg gatcagcttt cgcgatctgt tttgttcact catccatcga aagacggatt 1860 taccggaggt ggagaagttc cactacctta agggatgtct tcaaggggaa ccaaagggcc 1920 tgattgattc cttgaagata accagaacga actatcaaat tgcgtgggac atgctgttaa 1980 aacggtatga caacagcaag caattgaaaa agcgacaggt gcaagctttg ttcaatttgc 2040 catcgttatc caaggaatcg gtagcggatc tgcatactct tatcgatggt ttcgaaaagg 2100 tcgttcaaaa tttggatcag gtgatcgaac cggaggatta caaggacctt ttgttagtca 2160 ccttgcttac aatacggttg gatccggtaa ctcgccgaag ctgggaggaa tattcctcaa 2220 ctaaggatca agatacgttg gcagatctct cggatttcct ccaccgcaga atccgaatct 2280 tagaagcact accaacgaag gttgcggaca ccaggagttc tcagccacaa ccgttcaggc 2340 agaagtcgtc tgctgtgaag gccagctacg gttctgtaca gtcgtctggg gggcgatgtg 2400 tggtgtgtaa ggaggatcat caactctacc agtgttcttc cttccaacgg ttgtcagtgc 2460 gagagagaga tgaggtgctg aggtcaaact cgctatgtag aaactgcttc agatcgggac 2520 atctagcgag ggattgttcg tcgaagtatt tctgtcgcaa ttgtaagggt cgacatcaca 2580 caatggtatg cttcaagcag aataaggcta gttcggcgaa ctttactact gctgttggga 2640 acaataaccc tcctccacaa acagaatcgg atgaaccatc tccttctaca tcctcccagg 2700 tggtcaacat ggtagccacc gaatcaacag tctctggtac tactcaacag ttctcttcta 2760 aggtgttatt ggctactgcg gtcgttatcg tggaagatga cgagggtagt cgatttccgg 2820 ctcgtgcttt gttggattcc ggttccgaaa gcaattttat cgcagaacgg ttgagtcaac 2880 gtctcaggac acaccgagaa agggtagatg tatcggtatt tggtattggt catgcagcaa 2940 cgaaggtgaa gcatcaggtt tcggcaatgg tacgttcgcg agttacggct ttctcgcaaa 3000 acatgagttt cctcgttcta ccaaaagtcg cgatggattt acctacggca aaggtgaata 3060 ctcaaggatg gacggtgccc gatgggatca atttggcaga tcccaccttt ttcaatccag 3120 gatcagtgga tatggtgttg ggcatcgaac acttttttga cttcttcgga tctggccgaa 3180 gaattcccct cggagatcaa ctaccagcac taaacgactc tgtgttcggt tgggtggtat 3240 gcggaggctt gacgaatcct actaaaagcc ttcgcatcaa ttgtagtacg gcagctacac 3300 aaggattgga ggaactagtc gcacggttct gggctagtga agaggttggc gtttctaggc 3360 tgctttcgct ggaggaaagg aagtgtgagg agagcttccg gaaatcggtt cgtagggaat 3420 ccgatggtcg atacactgtt tcgttgccga aggatgaagg cgcaatttcc aagttgggcg 3480 agtcacggga catcgcattc cggcgacttc aaggcacgga acgcagattg gcgagggatg 3540 ctggtctgtg cgaacagtat catgagttca tggcagagta tatcaaactg ggacacatga 3600 cgaaagcaga agaaacggcc gatgatttga aacggtgcta cttgccccat catccggtca 3660 ttaaggaagc gagtataacc accaaggtcc gcgtggtatt cgatgcgtcg tgcaagacgt 3720 cgtccggcgt ttcgctaaac gatactctgc tggtgggacc gattgtgcag gaggacttgc 3780 gatcgattat gcttcggagt cgcgtaaggc aaatcatgct ggtagccgat gtggaaaaga 3840 tgtttcggca aattttcatc actcaagtgg acagaccgct gcaatgtatt ctgtggcgtt 3900 tcaatccggt agacaaggtg gacgtatatg aactggatac tgtcacgtac tggacgaagc 3960 ccgcaccgtt tctggcaacg cgtacgttaa atcaactctc gattgatgaa ggagaacatt 4020 ttccaatggc tgctaaggcg atcaaggagg atacatatat ggatgatgtt atcacaggag 4080 cggatacgga ggaagctgca caaaaactat gcaagcaact catcgagatg acatcaaagg 4140 gaggcttcaa actccggaaa tgggtgtcaa atattccgga agtattggat agcgtttctg 4200 aggactgtct agcgattcga gtatcggaag ggatcaatct cgaccctgat ccaactgtaa 4260 aaactctggg cttaacatgg atgccaatca ctgatcagct aaggtttcag ttctctatcc 4320 cgaattggaa ttcaacccac cagccaacaa aacggcaagt tctgtcggta atcgccagtt 4380 tattcgatcc tttaggacta ctaggcgctg caataactac agccaagatc attatgcagc 4440 ttttgtggag gatccggaac gaagctggtg aagcattgga ctgggaccaa acactacctc 4500 agacggtggg tgagatgtgg agaaaatatt acgaagagct gcatctgttc aacaatattc 4560 gtatcgaacg atgtgtcatg atcccggaag cagttttggt tgagcttcat tgcttttctg 4620 acgcttcaaa taaggcattc ggtggatgta tctacatacg tagtcatgat tcggaaggga 4680 gacttctagt tcggctgttg tcctccaaat ctaaggtggc gccgcttaag acgcagtcaa 4740 ttcccaggtt ggagctgtgc ggggctttac tgacggcgca gctatatgag aaggttcgag 4800 attccattag agttcccgtt gagtgttatt tctggactga ttcgacgtgc gttctacgat 4860 ggatacaagc cccttcaaac gtttggaaca cttttgtggc gaaccgggtc gccaaaatcc 4920 aagaaattgc ggaatattcg caatggcatc atgtatcagg caaacaaaat ccggcggacc 4980 tgatctccag aggaatttct ccaaaggaca tcgttgataa ctccttctgg tgggaaggac 5040 cagactggtt gaaggaaagt caagatcgtt ggcccgtatg ttctgaaaat ttaaatgtag 5100 aggaagaaga gcatgagaga cgacgagtgg ctgtatgcac ggtgtcggcc aacatggaat 5160 ttaacgagtg gtacatggaa aagttcgctt cgtattctga tctcatacga aaaacagcaa 5220 tttggttgcg gttcatggaa tatctgcgaa ctgagtccca tagcagagta cctttaaggt 5280 ttatatcaac ggaagaactt aggaaggcag agttcactat tgttcgtcgt gtccaacagg 5340 agactttcgc ggatgaatgg aaggcacttt ctaagggaga agcggtgtct agaaaatctc 5400 cacttcgatg gtttaaccct tatatttcca acgatttggt tattcgcatt ggaggacgtc 5460 tagaacattc catggaacaa gatgacgcaa aacatccgat ggttcttcct gcaaggcata 5520 ggcttactcg tatgatcttg gagcactatc acctacagct tttacacgca ggaccacaat 5580 tgctgcttgg agttgttcgt cttaaatttt ggccgttggg cggacgaagc gttgctagaa 5640 acgtcgtcca tcaatgttta aaatgttggc gtgcaaaacc ctcaccagca cagcagctca 5700 tgggagagct accagcaccg agagtcaccg tgtcaaggcc attttcgcag actggcgtcg 5760 actatttcgg accattttat gtaagaccag ctccaagacg acccgtggtg aaggcgtacg 5820 cggccgtttt tatctgcctt tgtacaaaag cagtacactt agaattggtc tccgatctga 5880 gtactgatcg gttcctgcaa gccctccatc gtttcacttc aagaagagga aggtgtcgag 5940 acctctattc cgacaacgga acaaatttcg ttggagccaa gaataaaatg caggattttt 6000 taaaactgtt gtggaattca gagcatcaag aacggattac caaagattgc actgaacagg 6060 gaatccaatg gcatttcaat ccgcccagcg ctccacattt cggaggactg tgggaagccg 6120 ctgtccggtc agccaagaat cttttgaaca aggtcgttgg agaaacacca ctctcgcctg 6180 aagacttcaa tactcttctt atacaagttg agggatgtct taactcgaga ccgctaaccc 6240 caatgtccga cgaccccaca gatttagaac cgttgacgcc tggacatttt ctagttggct 6300 cttctatcca atctattcct gaaccaaatc tggaaggtat acggttgaac cgtttgagcc 6360 aatatcagtt gatccagcga atgttgcagg atttttggag aagatggcgt agagagtatc 6420 tttgccaatt acagggtaga agcaaacgtt ggaaaccagc tatcaatgta gaaattggca 6480 aattggttgt tgtgcgagat gataatctac caccaatgaa gtggagaatg ggtcgaatca 6540 ttcagctgca tcccggagat gatggagtag tacgagtcgt aacattaaaa accgttgcag 6600 gatcgatgaa gcggcccgtt gaaaagctct gttttctgcc aataccggat gaagattcgt 6660 ccaggaaatc acacgaatag tcagcatact atccccttcc ttccttcctt tcccttcaga 6720 agaggattct tttcttttca gaaagttagc atttctgggt gggtgagga 6769 // ID L1-N3_CQ repbase; DNA; INV; 1402 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A HAL1-like non-LTR retrotransposon family from Culex DE quinquefasciatus - consensus. XX KW L1; Non-LTR Retrotransposon; Transposable Element; nonautonomous; KW L1-N3_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-1402 RA Kojima K.K. and Jurka J.; RT "HAL1-like non-autonomous non-LTR retrotransposons from the RT southern house mosquito."; RL Repbase Reports 11(1), 102-102 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 4 sequences with >98% CC identity. CC This family encodes a protein similar to ORF1ps of L1 in CC mosquitoes. Thus it is likely a HAL1-type element. XX FH Key Location/Qualifiers FT CDS 157..1224 FT /product="L1-N3_CQ_1p" FT /translation="MSQVQTRENTFKVLLSNFPKRPTFEELHKFVHVNLAL FT RPEQVKRLQINHAQNCAHVKCSELKIAQDIVAMHNEKHELVINKTKVKVRL FT MMEDGGVEVKIHDLSENVTDDEIVAFLRHYGDVLSIQETVWGEKFAFRGVS FT SGIRIAKMVLRRHIKSFVTIRNELTLVTYTGQPATCRHCTLTQHIGMSCVE FT NKKLLGQKAXLSERLKSAQTTAPQSYAGVVNEGTSTTVLQTQTAPPQTNPI FT DCISISGTDTATILPNHSTXVTNELSGIGTGDDNEHQIENSRNEESNDEEM FT PVSLTPANMILENNFKVPFPIRDPKTSPSLDTISESESDISPGGSAPNSGR FT RGPGRPKKKKTRH" XX SQ Sequence 1402 BP; 435 A; 342 C; 301 G; 317 T; 7 other; tcagttcgca tcgcactctt gatgcgacca gtcgcataaa ccagagttcc tttgttcaag 60 tgttttttgc tttcttttcc acaccatcgg gtgtggaatc gagttttcgc gtcaaatcat 120 ctaattgtga ttaatttgtg cttcgccatc acaactatgt ctcaagtgca aactcgtgag 180 aacaccttca aggtgcttct atcgaacttc ccaaaacgcc caacatttga agagctgcac 240 aagtttgtgc acgttaatct ggccttgagg ccagagcaag tgaagaggct ccagatcaac 300 catgcgcaaa actgcgcaca tgttaagtgc agtgagttga aaatcgctca agacattgtt 360 gctatgcaca acgaaaagca cgagctcgtg attaacaaaa ccaaggtcaa agttcggttg 420 atgatggagg acgggggagt ggaggtcaaa atccacgatc tctccgaaaa cgtcaccgat 480 gacgaaattg tggcctttct tcggcactac ggtgatgtct tgagcatcca agaaacagtc 540 tggggagaaa aattcgcatt ccgcggcgta tcgtcaggaa tccggattgc aaagatggtg 600 cttcgccgcc atattaagtc gttcgttaca atcagaaacg aactaacgct ggtcacctac 660 acaggtcaac cagcgacgtg cagacactgc acactaactc aacacatagg gatgtcatgt 720 gttgagaaca aaaaacttct cggccagaag gcckacctca gcgagagact gaaaagcgca 780 cagacaactg caccacaaag ctacgctggt gtggtgaacg aaggtacctc caccacggtt 840 ttgcaaaccc aaactgcacc accacaaaca aatccaatcg actgcatcag tatctcgggg 900 accgacaccg cgacgatctt gccgaatcac tcaactgakg tcacgaacga gctcagcggw 960 atcggcactg gagacgacaa cgaacatcaa atcgagaact cacgaaacga agagagcaat 1020 gatgaagaga tgccagtctc tttgacgcct gctaacatga tcttggaaaa caatttcaag 1080 gtcccttttc ccatccggga cccgaagaca tcgccatcwt tggacaccat ctcagagagc 1140 gaaagtgata tctctcctgg aggctctgcg cccaattcgg gccgacgagg acccggccgc 1200 ccgaaaaaga agaagacgcg ccattgaagc tcaaagatag aagaagaaca aactctaata 1260 atatttattg taaacgataa acaatttttg gtatagctgt tcaaataatc tgtatttmtc 1320 gacaaaactt atgcttttct atgtaaacca tgtacaaawt atwatgctct aaataaactt 1380 tttttttaca aaaaaaaaaa aa 1402 // ID Harbinger-5_BF repbase; DNA; INV; 5285 BP. XX AC . XX DT 04-AUG-2008 (Rel. 13.08, Created) DT 02-JUN-2011 (Rel. 16.05, Last updated, Version 2) XX DE Amphioxus Harbinger-5_BF autonomous DNA transposon - consensus. XX KW Harbinger; DNA transposon; Transposable Element; Harbinger-5_BF. XX NM Harbinger-5_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-5285 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-5285 RA Kapitonov V. and Jurka J.; RT "Harbinger-5_BF - a family of autonomous DNA transposons from the RT amphioxus genome."; RL Repbase Reports 8(8), 802-802 (2008). XX DR [2] (Consensus) XX FH Key Location/Qualifiers FT CDS 965..1867 FT /product="Harbinger-5_BF_1p" FT /translation="MYNFRVAHNTISLFIPEVCQAIVEEYKDEVITCPTTP FT EEWTPIEEVFRERWNVPHAVGAVDGKHVAIRKPGKSGSLYHNYKGFFSLVL FT MALVDGDYKFLWMDISSSGSMSDAQIFNDSELKDCLEDGSIGFPEPFPLPQ FT DSEPMPYFLLGDDAFGIRTFLMKPYSHRHLQRDERIFNYRISRGRRVVENA FT FGILVQRWRVLLTTIQPRPSVVQDIVESCVCLHNLMRIRYPSLHSGMMDEE FT DDQHNVVPGAWRQLAQLEDVEHVTGRNRDTKAGKRQRALLKHYFNNPVGAV FT PWQNRMIDG" XX SQ Sequence 5285 BP; 1425 A; 1191 C; 1253 G; 1416 T; 0 other; agggccttca cacggaaatc gaattagcac gaatcaagcc gaaccaggcc gaatgaagac 60 tttgcaaaat tcttgccgca ttcgagtggg aattcgaaat ggacctacat tcttgcagcc 120 ttatcagaat gtagtttgaa taaatagaat gcgcttcgaa cccaacttga atcattcttg 180 ctcattcttg ttgtatgaac tctcgaattc agttcgaata tagttggaac ataatacgaa 240 tatttagaat gcagttggaa tgcaataaga gtatttagaa tgcacttcga atgctgtacg 300 agtgccgttc gattgatggc cgaatatggc ctgaaatctg gtaggaaggt tcctcgagtg 360 ttgtacgaat ttatttcgaa tgcagtcaga atcctcccgg aaaacttcga attatgttcc 420 gaacgttcag gaatgtaacc caaatgcaat tggatgtctt aaaatgaact tcgaatgcag 480 ttattgctag acgagtgcat cacgaatgcc accggaatgg tacctgatag ggtataaaat 540 gtgggtgaat ctagcatgtt cctcagtttc atcccaacca ccataaagta tggaccccct 600 ggaccacgtt gcgttgcttc agattgctat cctccagcaa gacctgcttg agaaaaatga 660 cgagaagaca aggagacgaa agaggggaag accaagaaga tattggaccc gaccatggct 720 gtcagaggag agaaggaggc tgtatggtca tcactcaaaa ttgatggagg aactcagagt 780 agaagaccca tcgtccttct tcaactacct caggatggag cctgtgatgt tcgatgagtt 840 ggtgcaaaga gtggggccaa ggataaaaaa gcaggacacc aaaatgaggt aggcgctacc 900 tccaagcttg acgctagcca tcactgtgag gttcctagct acaggtgata acaaccccag 960 cctcatgtac aacttccgcg tggcacataa caccatcagc ctcttcatcc ctgaagtctg 1020 ccaagccatt gtggaagaat acaaggatga ggtcatcacc tgccccacca ctccagaaga 1080 atggacacca attgaggagg ttttcagaga aaggtggaat gttcctcatg ccgtgggagc 1140 tgtagatggc aagcatgtgg ccattaggaa acctggcaag agtggctcct tgtaccacaa 1200 ctacaagggc tttttctcgc tggtcctcat ggcccttgtc gatggcgatt acaaatttct 1260 gtggatggac atcagttcat ctgggagtat gtctgatgct cagatcttta atgactctga 1320 gctaaaggac tgcctcgaag acgggagcat aggatttccg gagccattcc ctttacccca 1380 ggatagcgag cctatgccat acttcctcct gggagatgat gcctttggga taaggacctt 1440 cttgatgaag ccatatagcc acagacatct tcagagggac gaacggatct tcaactacag 1500 gatctccagg ggccgacgcg tggtggagaa tgccttcggt atcctggtcc agagatggcg 1560 cgttctcctc acgaccatac aaccaaggcc ttccgttgta caggacatcg ttgaaagctg 1620 tgtctgcctg cacaacttga tgagaatacg ttatccttct ctgcacagtg gtatgatgga 1680 tgaggaagat gaccagcaca atgtggtgcc aggagcctgg agacaacttg ctcagcttga 1740 agatgtggag catgtcacag gaagaaacag ggacaccaag gctggaaaga ggcagcgggc 1800 gctcctcaag cattatttca acaatccagt aggagccgtg ccatggcaga accgaatgat 1860 tgatggttaa gagactgact agacgagaat ctaggacatg ctaacactgg atttttttta 1920 ataatttgtg atcatttatg aaagtaattt gtggctattt tcatttccat acagaataaa 1980 gagcatgggc ctcctgcatc aagtactggg ctttcttttg tacggtcggg gtataaaatc 2040 atgatcatag ggattttaat tacaatatgg ctctagcaaa gataagagta tatgcttacg 2100 actgatctgg gtccaggaaa agacatgtaa ttcatttcac caaattattt cattccctgc 2160 atttgcacat cacgttctcc aagaatgata ttgatgttcc acagtagcca cttaaaccat 2220 agcaagtata aagaaaaaca gattgcgaca agataacggt tttctcagta ttaacatctg 2280 gatccatgcc ttattaatat tcgtttgaat ggagttctac aaccataaaa aataaataga 2340 ctcaaataag actgaattct cattatatac gcaacatact agatgtacgg tacatgcagg 2400 tctgtatcct ttttcttgct gttctactgt tccaaaataa aattgtacgt accaaaagaa 2460 aacaaaaaca aggagaaaac atattcaaaa tgttacacaa atcaaactac tcatggctcc 2520 catattctga ctgttcagac tgttgggtaa gtgtgggaga gtgcacaact agacatggag 2580 tgttcagctg gtcatcatct tggttgcttt gggtgaagtt gttgcaggag acactgttga 2640 atccactcag gccagtaaag ttagccatgt ccttgctgtg gatgggtgtt gacgaaggtg 2700 gaggtgtcct cgccgtgtag gtgggtggag cagactgtcg cctgacaaga ggatgtggct 2760 gaaactgttg ctgcacccag ctggcctcct gggatcccca aacggaagtc ggttgtttga 2820 cctgtcctgg ccactgtgat ggatgaggct gccagctcat cggaagacac tgctggctgc 2880 tgggcactgg gataggtaca ggagggtggg atctaaggaa cccatcttgc tgctggtgca 2940 gagtctgcat ggtagactgc tggtgaagct ccaactggta cttctgcaac aagctgttga 3000 cttcaccctg aaacctcagc cacatgggcg gggggaacgt gtatgtagcc tccttcatga 3060 agtcagcaaa ggcgtccttc tggcgttcca tcggtgtctt ctcggccttg gagagcttgt 3120 tcagcagctg tcgttgtagt tcagcaatgt tgcccttggt aacctcagtg tcatcagtag 3180 gactggtctt gggaaaagtc tcttcatccg actgcgtagc tagaggtgga gaatggtggt 3240 catcagagtc atgattggga gctcttcctg ccgctgcagc tgcagccgcc gtggcttgga 3300 ccttggcctt catctaaaga ataaacaatc ataaatttgt cattccaacc atatagaaaa 3360 tgaagttcat ttctagtaga gtgttctata tacagtgata tccgcaattc ttttaacagc 3420 actcttaatg tatacacacg ttgtaaacag atataaaaag acgaactatt tgtttatata 3480 atgacagcca cagacgacac tgtcaacaaa aactatatgt gagttcgtat acaccagttt 3540 taagaataaa gaaagggcat gtaaaaaata gttatagtga tggcatagct ccagatgtgt 3600 tgtgtatcca actgttcgct tgcaacactt gttcgcttgt aacactatct agtctctagt 3660 attccatgta atactatcgc atcactgcat ttagcccatg agggcaagaa caatgaacag 3720 agatcttcat tagtcataat tagggtgaaa acatgcggaa agtatttaca atttcattac 3780 aagcaagcaa atgaaggttg gaaaatcaat gtaaaatact tacactgaca gtcaccctct 3840 tttttccatg gacaatcagg tatggcctga ggaattcgaa atggcgtaat atccaatggt 3900 cccttgctgt cagttccgtg ttgccagagc ccgagggagt aagtttgatc tttccaaagc 3960 gtgtccgcat gttagagtac cactttttga gctgactgct gctcttgccc atctcagtgg 4020 ctttgtgctc ccaaagtgcg tccttgagtc cccttttgat gtaattaggg tgcttcttat 4080 tgagtaggca gttctgctcc ggggcctgga gccactcaac catctcttgc tcctgctctt 4140 cgttgaaatc gacagtcacc agtactctcc tacgcttctt ctgagacttg gcatcctcac 4200 tactgggaga aacctcagca ttcatctctt cctcctgccc ctcagtgaaa tcatccgtca 4260 ccgtctctcc gtgtcgcttc ttctgagact cagtgagaga gatcaccatc catctcttcc 4320 tcttgctcct cagtgacatc tgtactcatc atctctccct tgcgcttctt ctgagactca 4380 gtgagagaaa gatcaccatc catatcttcc tcctgctcct cagtgacatc agttgtgacc 4440 atctttccct tgcgcttctt ccgagactca gtgagagaaa gatcaccatc actgtcagag 4500 gaaggctgtg catcttcctg atggacctcc gccacaatct caatctgaga agtcggggga 4560 gggcgagggt ctacaatttc agacactgga ggtgatggag tatcctcagc aacaggactt 4620 ggctcttcat gagcttcttt tactctagcc cctttacccc gtccacggcc acgggcagag 4680 gtggacttct tgactttggc tgtctttttc ctgggagcca tagcgtggtg ttgaccaggc 4740 gattcctcta tgccaactga cgttcaagtg acagctgctg tggcttttat gcgattcggc 4800 ggcatgtggc tggcggtgat acaattagtc cgaagtcatt cgtggaccaa tcgggacgtg 4860 ccgactgcat tccagccgcg ttctgatggc attcgaggtg cgttccagct aaattctttc 4920 aatatcgaaa tcttcgggca gtaatcgaga tgcaatcgag tcgcattcga aatggcttgc 4980 cgtttcgatt ctccatggaa tgagatcaga atgtctcgaa taatgttgga acgccgtaga 5040 atgcgttcag actgcatttc gaatacaatt cgaatgccat tcgatatttc tccacttcaa 5100 atgcagctcg aaagtttttg acatgtcaaa aactttcgag ccagccaagc gaacggggac 5160 gaatatctcg aatgcagtaa gaatgtttag aatacagtac gaattgcgag gaattgccac 5220 gaatggaaaa aaaatttcat tcggacggca ttccggctca ttcgtgctct cgtgtgaagg 5280 gggct 5285 // ID SMARN1 repbase; DNA; INV; 288 BP. XX AC . XX DT 24-SEP-2007 (Rel. 12.09, Created) DT 01-OCT-2007 (Rel. 12.09, Last updated, Version 1) XX DE Consensus sequence of non-autonomous Mariner-type family of DE repeats. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Nonautonomous; KW SMARN1. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-288 RA Jurka J.; RT "SMARN1: Non-autonomous Mariner-type element from freshwater RT planarian (Schmidtea mediterranea)."; RL Repbase Reports 7(9), 998-998 (2007). XX DR [1] (Consensus) XX SQ Sequence 288 BP; 116 A; 36 C; 43 G; 93 T; 0 other; taccgtatat actcgtgtat aagtcgactc gagtataagt cgaattgaaa atttaaagcg 60 tgtttttagt ccaaaattgg aaacccgttt ataagtcgaa gtaaaaaatt tttaattgaa 120 agtaattaat ttgattaaaa gcaatacgat taaaaaactc attctccaaa aagaattttt 180 tggaaaaaat aaatatgatt tttgtatgac ccgtgtataa gtcgaaatga aaaaaagtcc 240 tcaaatttag tacaaaattt ctcgacttat acacgagtat atacggta 288 // ID BEL-632_AA-LTR repbase; DNA; INV; 580 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-632_AA_; KW BEL-632_AA-I; Pao_Bel_Ele55; BEL-632_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-580 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 580 BP; 154 A; 94 C; 149 G; 183 T; 0 other; tgttgcggcc cactgttcac aattgcaacg ccgcggccca ctgttgcgaa agagcacaac 60 cgtgaacgtg ttctgttttg ttgttgttgt gcagcataaa tgttgattaa cggagagtgc 120 aaaaggaggg agaaaacagc catcgcagct gatcggtgat caattagaaa ggcaactgcg 180 gtggaagatg gtgttttctc catcaagaat tagtgtttgc cagtgaagtt tcctgtaatt 240 tgtttgttct attgtgatcg tttaacatag aatatcgtcg agctgtagcc aagtagtgac 300 gttagttatt cgtcaccgtt gtttaccgta agattacgtt tgttcaatgt gtgtcgtatg 360 tgctaatgtt gattattgta aataggatta ccctgcacac ccggaggtag agtaagagga 420 gcaattgtgg ccgaagggaa aaaactagat tgtaagtcgt ttgtatttaa ttgtttgtaa 480 taaggtttaa cgtaaatgtt gtaaatatat ttctagttga gtttgttgcg agccgacata 540 ctgtgagatc gtttttaccc gcatccgaag ggtgcgaaca 580 // ID Gypsy-6_OD-LTR repbase; DNA; INV; 192 BP. XX AC CABV01000158; XX DT 24-FEB-2011 (Rel. 16.02, Created) DT 24-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Oikopleura dioica genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-6_OD_; KW Gypsy-6_OD-I; Gypsy-6_OD-LTR. XX OS Oikopleura dioica OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; OC Oikopleuridae; Oikopleura. XX RN [1] RP 1-192 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Oikopleura dioica genome."; RL Direct Submission to RU (24-FEB-2011). XX DR Genome; CABV01000158; Positions 51948 51757. XX SQ Sequence 192 BP; 55 A; 42 C; 40 G; 55 T; 0 other; tgtgaggatc gaataaagtc ggttgcagtc ttatcaagat ctgcgccgcg ctctcttgca 60 cttgtacatt ttctgttcag tctagcacaa ttacaagtgc gaataaaagc tacaattaat 120 cggaactctt ccgtgttttt attctagagg attatagaag caggatcagc cgacagagac 180 tcgaccctaa ca 192 // ID TTAA4B_AP repbase; DNA; INV; 430 BP. XX AC . XX DT 26-AUG-2009 (Rel. 14.09, Created) DT 26-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; TTAA4B_AP. XX NM TTAA4B_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-430 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2072-2072 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC TSD is TTAA tetranucleotide. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 430 BP; 155 A; 73 C; 77 G; 125 T; 0 other; gaggacgcta cacccgcatg tgttgtctcc gtcttacaaa tgtaaaacat agcaaaaact 60 gttttgcgcg ggaaagaacc gagaaggcta ctagtcataa ctaagaaacg aaattttcac 120 cacataaaaa ggagaacttt ggctgtgaaa tatcgttttt attttttcga tatcggaact 180 acaaaaaaag ttattcaagt ttgaaaaaca caaaaaataa tggattttga aacaataaga 240 acttttttcg tataagtgat atcaaaaaaa taaaaacgat atttcacaga taatgttctg 300 aactatattg tatatcaatt tggagtctta actatcacta cagcctgagt ggttctatcc 360 cgcgcaaaac agtttttgct atgttgtaca tttgtaagac ggagacaaca catgcgggtg 420 tagcgtcctc 430 // ID hAT-43_HM repbase; DNA; INV; 5569 BP. XX AC . XX DT 07-DEC-2008 (Rel. 13.12, Created) DT 14-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-43_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-5569 RA Jurka J.; RT "hAT-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 2031-2031 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(1200..1868,1783..2937,2796..4400) FT /product="hAT-43_HM_1p" FT /translation="MYSSASLFFVADNTLMDVLRVLTSLEIYIHADYYCKH FT SLISKIFSDYSSHFINLKSLLCMSVFHSALDSGFENNNIIKAEAIANCEST FT EKWAGFLCLLGLSSVLSCNIVSCYPDFGVEKYKIFFNQEISPLTPIKFCSP FT FYILFCYDGNFQSGTFQHNHYVPLIFIPKKRKLSSKSKTIISKKLLPVLSP FT NKFLDTKQTIDQVATIVPSTEAKNPNKKPYILSIQNRPLIKLLQLYPVQKL FT KIQIKSLTFFHKTNPDLEPTSAVLQNTSCILKSNSFSSQLAVSSSQSTTSF FT QSLTLNKDVSSHSKTSKDSVLSKSFFTKSSSIRKVPNFYNNNIECPSNIIL FT PPKEVLNALQSFEQNDVAFFRQKVEFLSDHEIFLLTKNMFIPNEDYIFPKN FT SGRQFRYLWLKEFSWLRYSKSTDGAFCLSCVLFGNKFKLRSRKLTHLYLEP FT FNKWSDAIRSFSNHESTKNGLHAFTMPVFNIFLSHSSGISQPINVIIDTNV FT KEKIIKNRQILRPIVDAIIFCGQTNTPLRGHRDDSQYLPEAGEYSKCGTGC FT FNKLLNFAVRNGNDVLGSILITVQRMPPTFRRLHKMKLLNAVVKKLVNLYY FT LKFAKMFFFQCFRLHLNNCSKNASYISKTSQNEIIKCCGEEISESILSEVR FT KNVFFSIIADEACDSSTKEQMSLVLRFVDSDFNIREDFIQFIHCSEGVKGK FT DLFNVLLNCVSNLNLDIKNCRGQGYDGASSVSGYINXLSAQVLNINSKALY FT THCHSHXLNLSVCKSCNVQLVSEVFNKVRELSYFFNYSENRQKFLEASILE FT REPQTHKKKLKDICRTRWIERIDGLNTFLEHYLSIFHALCIMASLESSVNK FT DTQNKSSTFLNSIGTFQFVFTLVVTTRVFDFTLPVTRLLQSKTIDILDGLY FT LITALKSTFISIRNDIDSFHNNCYEAACLLSSKGEIFVLKPRTCSIQKNRS FT NVPSESVSEYFKRAVTIPLIDHVSTSISTRFKSETVAAYKGLSIILFNIIS FT FLKRPKTQLCWRKQFETFCDFYIDDFPNISALNGELVLWEKYWESFSGTIP FT DNISLTLKSISFPGFENIKIALRILGTLPVTSCECERSFSVMRRLKDYMRT FT TMSEDRLNGLALMYIHQEIVPCIDNVINRFSIKK*" XX SQ Sequence 5569 BP; 1836 A; 861 C; 835 G; 2033 T; 4 other; caggggcgga cttaaacccc gtcaagtgct gcaagtgacg gggtaaaaaa acaaaaataa 60 atataagtat tatgacgtct aacgcccgcc taaagttaat tgccgatcta tatatttttg 120 ttagaaaata taattagaaa aagtttacag tgaaaactat gctttgcaga cgtttgaaaa 180 agtgttttat aaaaattgct aatgtaaact atactttgca gggtattaga aaattattta 240 taaaacttgc taatgtaaac tatactttgc aggatatgag aaaattattt tataaaactt 300 gctaatgtaa actctacttt gcaggatatg agaaaattgg ataagttcat tggtttacgc 360 tattaaaagt agcaatttga ttcgttaaaa aaaaaaaaca tgtttagtgt aaactatcct 420 ttgcagcatt tgcagcagta atttgattgg ttgcaaagat cgataaacaa ctcaaaagaa 480 ctcaaggaac tcaaagaact ccgttctaaa aacaaagtat ttccttaaaa ttttaacgta 540 taagcatggc ttacaacttg attgatgatg tacttaaaag atttaaaacc gataatcttg 600 tcgatttaaa tactttttat gaaactttat tggatacatt tatttacaac aataacaaaa 660 tggttagtaa cgaaggtctt ttaagtcgtt ttattgatgg cacaataaga agaaatatat 720 gtgccgaagt ttgtatccct tcttacttaa aagacaataa aatgcattgt attgctctaa 780 ggtaagtcaa tttacttttt ttttcaaaaa taattatctt attgtatttt attacaaaat 840 ttaatacttt tttgtaaaaa tatttattgt taataccgta ttataaaaat atctaatcta 900 taaatattta ttgttaaaat ctcgttgact aatcaatatt taataagtgt caacaccgtc 960 aagtgatttt gacatgtatg aatataatat gttcatgatg tatgtattaa actcctacat 1020 gtcaaaatcg tttgttggtt tgtctgaatt cttgactatt gatttgattt atgaatttgt 1080 ttaaaaagca aagatttttt actcaatttt aattttattg ataaaagaag tttagctctt 1140 aaaaacatga agtaaatata tatatttttt tcttaaggtc tgaagcatct ggaagttgta 1200 tgtacagctc tgcatcatta ttttttgttg ctgacaatac actgatggat gtgttaagag 1260 ttttaacatc ccttgaaatt tatatacatg ctgattatta ctgtaagcat tctttaattt 1320 caaaaatatt ttctgattat agttctcatt ttattaatct caaaagtttg ttgtgtatgt 1380 cagtatttca ttctgcatta gattctggct tcgaaaataa taatattatt aaagctgaag 1440 caattgcaaa ctgtgagagc acagagaaat gggctggttt tttatgtttg ttaggtttat 1500 catcggttct ttcttgtaat attgtttcct gttacccaga tttcggtgtt gaaaagtata 1560 aaatattttt taaccaagag atctcaccac ttactcccat taaattttgt tccccatttt 1620 acattttgtt ttgttatgat gggaattttc agtcaggtac atttcagcat aatcactatg 1680 tcccactaat ttttattcct aaaaagcgta aactttccag taaatcaaaa acaataattt 1740 cgaaaaaatt gctgccagtc ttgtcaccta ataaatttct agatacaaaa cagaccattg 1800 atcaagttgc tacaattgta cccagtacag aagctaaaaa tccaaataaa aagccttaca 1860 ttctttcata aaacaaaccc agacttagaa cctactagtg cagtcttaca aaatacatct 1920 tgtattctaa aatcaaatag tttttcttct caattagcag tgtcttcttc tcaatccaca 1980 acatcttttc aatcattaac attaaataaa gatgtttctt ctcactcaaa aacatctaag 2040 gacagtgttc tatcaaaatc tttttttaca aaatcttcaa gcatcaggaa agtacctaat 2100 ttttataata ataatattga atgcccttcg aacattattt taccacctaa agaagtacta 2160 aatgctctcc agtcttttga gcaaaatgat gttgctttct ttagacaaaa agttgaattt 2220 ttatctgacc atgaaatttt tttattaact aaaaatatgt ttattcccaa tgaagattac 2280 attttcccaa aaaattctgg ccgtcagttt agatatcttt ggttgaaaga gttttcttgg 2340 ctgagatatt ctaaaagtac tgatggggca ttttgtttat cttgtgtatt atttggcaat 2400 aaattcaaac taagatcacg taagttaaca catttgtatt tggagccttt caataaatgg 2460 tctgatgcta ttagatcatt ttcaaatcat gagtcaacta aaaatggttt acatgcattt 2520 actatgcctg tttttaatat ttttttgagt cattcctcag gcataagtca accaataaat 2580 gtcatcattg atactaatgt aaaagaaaaa ataataaaaa atcgccaaat acttcgtcct 2640 attgttgatg ctataatttt ttgcggtcaa accaatacac ctttgagagg ccacagagat 2700 gattcacaat atcttccaga ggctggagag tattcgaaat gtggaactgg ttgttttaat 2760 aaacttttaa attttgctgt acgcaatgga aatgatgttt taggctccat cttaataact 2820 gttcaaagaa tgcctcctac atttcgaaga cttcacaaaa tgaaattatt aaatgctgtg 2880 gtgaagaaat tagtgaatct atactatctg aagttcgcaa aaatgttttt ttttcaataa 2940 ttgcagatga agcctgtgac tcatctacta aagagcaaat gtcattagtt ttacgatttg 3000 ttgatagtga tttcaatatc agagaagact ttattcaatt tattcattgt agtgaaggag 3060 ttaaaggaaa agatttgttt aatgttcttt taaattgtgt aagcaactta aatctagata 3120 taaaaaactg taggggtcag ggatatgatg gagccagttc tgtctctggg tatataaatg 3180 rtctttctgc tcaggttctc aatataaatt caaaggcttt gtatacacat tgtcatagcc 3240 atsractgaa cttatcagtt tgtaaatcgt gtaatgtcca attagtttca gaggttttta 3300 ataaagttcg cgaactttct tattttttta attattcaga aaatagacaa aagtttttag 3360 aagctagcat tttagagcga gaacctcaaa cgcataaaaa aaaattaaaa gatatctgta 3420 gaactagatg gattgagcgt atagatggtt taaacacttt tcttgagcat tatttatcta 3480 ttttccatgc tttatgtatc atggcttccc ttgagagttc tgttaacaaa gacacacaaa 3540 acaaatcttc tactttttta aattccattg gcacttttca attcgtcttt actttagttg 3600 tgacaacacg tgtctttgat tttactttac ctgttactcg gttactgcaa tcaaaaacta 3660 ttgatatctt agacggtttg taccttatta cagctcttaa aagtacattt atttccatca 3720 gaaacgatat agatagtttt cataataact gctatgaagc agcatgtttg ctttcaagta 3780 aaggtgaaat ttttgtttta aaaccaagaa cctgttctat tcaaaaaaat cgttctaatg 3840 ttccatcaga gtctgtttca gaatacttca aaagagctgt tactatccct cttatagatc 3900 atgtatctac atctatatct acaagattta aatcagaaac tgtagctgca tataaagggt 3960 tatcaattat tctatttaat attatttctt ttcttaaaag acccaaaacc caattgtgtt 4020 ggaggaagca gtttgaaaca ttctgtgatt tttatataga tgatttccct aatatcagtg 4080 ccttgaatgg ggaactagtt ctttgggaaa aatactggga aagtttctct gggactattc 4140 ctgacaatat atctttaact cttaaatcga tatcatttcc tggatttgaa aatattaaga 4200 tagctttaag gatacttggg acacttcctg ttacttcatg tgagtgtgag aggagttttt 4260 ctgttatgcg tcggttaaaa gactatatga gaacaacaat gtcagaagat cgacttaacg 4320 gtttggcatt aatgtatatc catcaagaaa ttgttccttg tattgataat gttatcaata 4380 gattttctat aaaaaaatag acgacttgat ttcaatgatt agtgtctata ctgtttattc 4440 tataacacac caacaacttt tgaaattttt gtctattatc acctctcttc atataacagc 4500 taacaaatag gcatggaaac ccctggaaag gccaagactt caatagtgtc cttaacagga 4560 cgggttaaat gtaaccccat acttttgttt gggattagct atatgattag tgatcatagt 4620 tcaggcaaac attaaccatc tctttttgac cattggnatg acaatctctt tatatcattt 4680 tctagcaaca aaataaaaaa ttaaaaaaat tattttcaat aaattttctc tcaatcaacc 4740 taataatatt cataagtttt tgtgccattt aaatcagttt tttgctgtag ttgtcaaaac 4800 cattagcttt ctttttattt catcaaaaat gatcttcttt tatgttgtta aagcttaact 4860 tggtatgtct ttttttttta atcttgttat ataagaatat atataacttt ttatttcacg 4920 atgtatataa atccatttgt atttttgtct ttctcttttc agttcttaat ttttctttct 4980 ctgattgtcc atgtccggaa taaatatgtt cctagacatg ctgatgagcc ctgatattgg 5040 gcgaaacaat catttgttgg tttatatatt tattatatat ttatatagaa aaaaagtttc 5100 aaagtaaaac taagatatga gcttctcaat gaaatagcaa aaagaaaatc cctttaaatc 5160 ctccagccca agggcaggag gatttagggg atacataggg ggcaggctat gggagcgatg 5220 gcctccctta gatgtttttt ttataagaac attttgtaaa aacgtaagta aatttggtag 5280 gctgtttaaa agtggcataa aattgtgtag tctaaattct tttttttttt gggcttgcct 5340 gcttaaaaat cgttaaacct gtaaatcacc ccctttaatc gtttggatcg aacgaaaaga 5400 aaatttgccc acccctcttt actacatagt ccggatccgc ccttacttat tccccctccc 5460 ccccccccca ttttttctat ttgctttgtg ggcgtagttg cgtcacttac aacatggcgt 5520 aaggcgcatt tttttgacgg ggtaaattaa aaacctgcgt ccgcccctg 5569 // ID CR1_Ele38 repbase; DNA; INV; 5019 BP. XX AC . XX DT 18-OCT-2010 (Rel. 15.1, Created) DT 18-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE A CR1 clade non-LTR retrotransposon family from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1_Ele38. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-5019 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5019 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (18-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update and extension. This consensus is generated CC from 19 sequences with >98% identity, and ~99% identical to the CC original sequence in [1]. XX FH Key Location/Qualifiers FT CDS 280..1140 FT /product="CR1_Ele38_1p" FT /translation="MSDACDHCAKPVRNDDEYISCMAFCERMVHIRCSVTK FT LNKPFVKIIQESPNLIWMCDECVKLMKIARFKSVVSSFGEAIQSITEKQES FT VHAEIRKELAKQGQQIALLSKRMTPSSPFLREPGSSFQQPPSKRRRDEEFN FT FKQVATKPLLGGTKDTTTASILTVPEPEELFWLYLSRVHPNVKPEAIEKLA FT KDCLQCDKPIKAIPLIKRGTDTSRLSFISYKIGIDPKFRDAALSPDTWPKG FT VLFREFEDQSSKNMWLPRLHTPTITVSPEIGASQFSTPTSVVNLAN" FT CDS 1155..4889 FT /product="CR1_Ele38_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MQRSNVATSVTKCRKPERFACRIKGALDPLDTVAPAC FT SCHRSRSGPVVEIGDGVSQPSISGKYACFCSNSLPDQLQFFSFPSDLVDHN FT NIDTGTQYSIQLNPNNQSTPTTTEHQVERSFTSSSQPGRSAASILEVSGSS FT SSVRPFPASVLHSRPGPVARCGSEIFQSALLGKYTQVSAFPMPDAVPNFSI FT LDSTDASDASSTLTHGHPERTVESHMEALNPSDAVQPPAVVVLHSRSGPAV FT GSGARVFQPLNNGEYSTYVSSQSMPDAHVPSRNIGVSKVRRKEDIVIYYQN FT VGGMNSRVNDYRMAVSDSCFDIIVLTETWLDSRTLSSQVFGTDYEVFRCDR FT NPNNSRKLTGGGVLVAVRSRLKAKLIEKESWKCLEQVWVSIKLGDRSLLLC FT TLYIAPDRVRDNELIEAHCDSVITAMESANPVDDVFILGDFNLPGISWLSS FT QNGFLYPDPVHSSMHDCAVSLLDSYSAATLAQINHVVNENNRSLDLCFVSV FT QDQAPFIGAAPCELVKAVPHHPPLIIAVENNQAHDFDNCPAGVSYDFSKAD FT HRSIAAVLADINWDNVLDLRDINTSAQTFSHILTYVIDRHVPKKVVHHDSR FT PPWQTSELRKLKSVKRAALRRYTKYRTPSLRSYYLRVNYEYQRVSRHCFLQ FT YQRSIELKFKRKPKSFWKYVNEQRKESGFPSSMEWNGKSSSCLQEICQFFA FT SKFSSVFSGESISNDQVICAANNVPMNGQALGSLDLDINAVSRAASQLKSS FT NHPGPDGVPSAFLKEHIGNLLAPLCCLFRLSLSSGIFPSCWKVANMFPVHK FT KGRKRDVDNYRGITSLSAVSKLFELVVMEPLLSHCKHLLDSDQHGFIAGRS FT TTTNLLCFTTFITDSMIARVQTDAIYTDLSAAFDKLNHDIAIAKLDRFGVC FT GNLLSWFGSYLTNRQLRVVIGDCSSDSFHATSGIPQGSHLGPLIFLLYFND FT VHLVIKGPRLSYADDLKIFLRICSVADCLYLQQQLDSFASWCSLNGMVVNP FT TKCSIITFSRKKQPITFDYSLLGTKIERVNHIKDLGVFLDSQLTYKQHISY FT TVSKASTTLGFIFRIAKNFSDIYCLKSLYCSLVRSTLEYGSAVWSPIYNNG FT AERIESVQRRFLRFALRKLPWRDPFRLPSYESRCRLIDLDLLRNRRDTFRA FT LTIADVLLGRIDCGTILERVHLNARPRSLRNSVMLRLPLTRTNYGLHGALS FT GLQRVFNKVASLFDFNTTREMLRRRFLSFFDERRV" XX SQ Sequence 5019 BP; 1295 A; 1243 C; 1086 G; 1394 T; 1 other; ttgtttacat cgttgcttgt cggatcgttc tgtttctcgc accgtaatcg aacagttttt 60 cccgtgtaaa cagtgtattt ttcacctgtg tttagtttat tttcgttaat tatcgatctg 120 tgctctccgg catatctatt attstggtgt tctcagtgac aatacgtaga aaacgtcgat 180 agcataatcg tctcgcgaac aacggcggat tattttcctt cacgcttaca gtgactcgaa 240 ctcatattcg tacaggtgcg gtgttagaga atctccatca tgtctgatgc atgcgatcat 300 tgtgcgaaac cggtcagaaa cgatgacgag tatatttctt gtatggcttt ctgtgaacgg 360 atggtgcata taagatgctc ggttacgaag ctcaacaaac cgtttgtgaa aatcatccag 420 gaaagcccga atttgatttg gatgtgtgat gaatgcgtga aactgatgaa gattgccaga 480 ttcaaatcgg tcgtttcgtc gttcggtgag gctattcagt ctattaccga aaaacaagag 540 tcagtacacg ctgaaatcag aaaggaactg gcaaaacaag gacagcaaat cgctcttttg 600 tctaaacgaa tgaccccatc ttctccgttt ttgcgcgaac ctggctcgtc ttttcaacaa 660 ccaccttcca aaagacggcg tgacgaagaa ttcaacttta aacaggtcgc tactaaacct 720 ctgctcggcg gaacgaaaga tacaactacc gccagcatac tcacagtccc cgagccggaa 780 gaattatttt ggctgtatct ttcccgcgtc catccaaatg ttaagccaga agcaatcgag 840 aaattggcca aagattgttt acaatgcgat aagcccatca aagcaattcc gctaataaaa 900 cgagggactg acacaagccg cttgagtttc atttcgtaca agattggaat cgatcctaag 960 tttcgtgatg ctgcactgag ccctgacact tggcccaaag gagttttgtt ccgagaattc 1020 gaagatcaga gttcaaaaaa catgtggtta ccccgcctgc atacgccgac tatcacggtc 1080 tccccggaaa taggagcttc acagttctcc actcccacga gtgtagtcaa cttagcgaac 1140 tagatctgaa caacatgcaa cgtagcaacg tagctacatc tgtcaccaaa tgccgtaaac 1200 cagaacgctt tgcctgccgc attaagggag ctcttgatcc actcgacaca gtcgcgccag 1260 cttgttcctg ccatcgaagt cgttctggtc ctgttgtcga gatcggtgac ggagtctccc 1320 aaccttctat ctcaggcaag tacgcatgct tttgtagcaa ttctctgcct gatcagctcc 1380 aatttttcag ttttccttct gatcttgtcg atcacaataa catcgatact ggaacccagt 1440 attccatcca actaaatccg aacaaccagt ctactccgac gacaacagaa catcaagtgg 1500 aacgcagttt tacgtcaagc tcacaaccag gacgcagtgc agctagcatt ttggaagtct 1560 ctggatcctc tagctcagtc cggccttttc cagccagcgt tcttcacagt cgtcctggtc 1620 ctgttgctag atgtggttca gagatcttcc agtctgctct gctaggcaag tacactcaag 1680 tttccgcttt cccgatgcct gatgcggttc caaatttcag tattttggac tcgaccgacg 1740 caagtgatgc ttcttcaaca cttactcatg gacacccgga acgcaccgta gaaagccata 1800 tggaagccct caatccctcc gacgcagtcc agccccctgc cgtcgtcgtt ctccatagtc 1860 gttccggccc tgctgtcgga agtggggcga gggtcttcca accacttaac aacggcgagt 1920 actccactta tgttagctct caatcaatgc ctgatgcaca cgtgccttcc agaaatattg 1980 gcgtctccaa agtgcggcgg aaagaagaca tcgttatcta ctaccagaac gtcggcggta 2040 tgaactcgcg cgtaaacgac taccgaatgg ccgtgtcgga ttcctgcttc gacattatcg 2100 tcctgacaga aacttggctt gactcgcgaa cgctatctag tcaggtgttc ggaactgatt 2160 acgaggtgtt tcgctgcgat cgtaatccta acaacagcag gaagctcacg ggaggcggcg 2220 tactcgtagc tgtccgttcc agattgaaag caaagttaat cgaaaaagaa tcgtggaaat 2280 gtctggaaca agtttgggtg tccatcaagc ttggcgatcg atctctttta ttgtgcacgc 2340 tctacattgc tcctgatcga gttcgtgata atgagctcat agaggcgcac tgtgactcag 2400 tcatcactgc aatggaatcc gccaatcccg tcgacgatgt tttcatcttg ggcgacttca 2460 atctaccagg catttcgtgg ttatcatctc aaaacggctt cctttatcca gaccctgtac 2520 actcttctat gcatgattgt gcagtcagtc ttctagacag ttacagcgca gccacactag 2580 cacaaatcaa ccacgtggtc aacgaaaata atcgtagcct ggatctttgt tttgtttctg 2640 tgcaagacca agctccattc attggagcgg caccctgtga gcttgtaaaa gcagtaccgc 2700 accaccctcc gttgattatt gccgttgaga ataaccaggc ccatgatttc gacaactgtc 2760 ctgctggtgt gtcatatgac ttttctaagg cggatcaccg tagcatagca gccgtattgg 2820 ccgacatcaa ctgggataac gttctcgatc tcagagacat taatacttct gcgcaaactt 2880 tttctcacat cttgacatac gtcatagaca gacacgtgcc gaagaaagtt gtccatcatg 2940 actcccgccc tccatggcaa actagcgaat tgcgaaaatt gaaatcagtc aagagagcag 3000 ctctaaggag atatacaaag tatcggacac cttcattgcg aagctactac ctgagagtca 3060 actacgagta ccaacgtgtc agtcgtcact gttttttgca gtaccagcgg agcatagaac 3120 tgaaatttaa acgcaaacca aaatcgttct ggaagtacgt aaatgagcag cgcaaagaat 3180 cgggctttcc ttcttcgatg gaatggaatg gcaaatcctc gtcatgctta caggaaatct 3240 gtcagttctt cgctagcaaa ttttccagcg tattcagcgg ggaaagtata agcaatgatc 3300 aggtcatttg tgcagccaac aatgtcccca tgaatggcca agctttgggg agtcttgatt 3360 tggatatcaa tgctgtatcc agggccgctt cgcagttgaa atcgtccaac catcctggac 3420 cagatggagt tccctcagca tttctcaaag agcatatcgg aaatttgctt gcaccgctct 3480 gttgtttatt ccgattatct ctatcttccg gaattttccc ttcctgttgg aaagtggcaa 3540 acatgttccc ggtgcataaa aaaggaagga aacgcgacgt ggataattac cgtggtatta 3600 cgtccttgag tgctgtgtcg aagttattcg agcttgttgt gatggaacct ttgctctctc 3660 attgcaagca cctgctggat tccgatcagc acggattcat cgccggccgc tcaacaacca 3720 ctaacctact gtgcttcacc actttcatta ccgacagtat gatcgctcga gtgcagactg 3780 acgctatata tactgacctg tcggccgctt ttgacaagtt gaatcacgat attgctattg 3840 ccaaacttga cagatttggc gtctgcggta atcttctgag ctggttcggc tcttacctaa 3900 ccaatcgtca gctaagagtt gtgataggcg attgcagttc cgacagtttt cacgctacat 3960 ccggtatacc acaaggaagc cacctgggac cactgatttt tctgctttac ttcaacgacg 4020 ttcatctcgt tatcaaaggc ccccgcttat cgtatgcaga cgacctgaaa atatttttac 4080 ggatatgctc cgttgccgac tgtttatacc tacaacagca acttgatagt ttcgccagtt 4140 ggtgctcgct gaatggaatg gtggttaacc ccaccaagtg ctccatcatc acgttttcta 4200 gaaaaaagca accaatcacg ttcgattata gtttgcttgg cacgaaaatt gaacgtgtga 4260 atcacatcaa ggatttgggc gtgttccttg attcacagtt aacgtataaa cagcatatct 4320 cgtataccgt cagtaaagcg tcaacgactt tgggattcat ctttagaatc gccaagaatt 4380 tctccgacat ctactgtcta aaatcactct attgttctct cgtgcgttcc acgttggaat 4440 atggttccgc tgtttggagt cctatttata ataatggcgc agaaaggatc gaatctgttc 4500 aacgccgatt tctccgattc gcactccgta agctaccttg gagagatcca ttccgcttgc 4560 cgagctatga gagtcgatgc cgattgatag acttggacct cctccggaac agaagggaca 4620 cattcagagc tttaacaatt gcagacgttt tactgggtcg tattgactgt gggacaatcc 4680 tggagcgagt ccatctaaat gcccgtccac gctccctccg gaacagcgtc atgctgagat 4740 tgcctctaac tcgaaccaac tacggacttc acggagcact tagtgggttg cagcgagtgt 4800 tcaataaagt cgcatcattg ttcgacttca ataccacccg agagatgctt cgccgaaggt 4860 ttttatcatt ttttgacgaa cgaagagttt aatttaagtt ttatctcttg actgtttagt 4920 gcttttgtta agtttttgtc ctctgattgt tttatatata ttttagacat cattggggct 4980 acaatttgcc tgttgatgtg tttcgaataa ataaataaa 5019 // ID Copia-37_CQ-LTR repbase; DNA; INV; 129 BP. XX AC AAWU01006939; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-37_CQ_; KW Copia-37_CQ-I; Copia-37_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-129 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 378-378 (2011). XX DR GenBank; AAWU01006939; Positions 4545 4673. XX SQ Sequence 129 BP; 38 A; 25 C; 28 G; 38 T; 0 other; tgtcggtgca agtaggcgtc ctgattgagg acccaatcgt agtagacgta attgatgacg 60 gaaaataaaa ttatcattct tgttcaaacc actgagctga gtagtttact ttttacgagc 120 tctaccaca 129 // ID Chapaev-4_HMa repbase; DNA; INV; 3307 BP. XX AC . XX DT 22-JUN-2010 (Rel. 15.06, Created) DT 22-JUN-2010 (Rel. 15.06, Last updated, Version 3) XX DE Chapaev-type DNA transposon: consensus. XX KW Chapaev; DNA transposon; Transposable Element; Chapaev-4_HMa. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3307 RA Jurka J.; RT "DNA transposons from Hydra magnipapillata."; RL Repbase Reports 10(6), 790-790 (2010). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 1287..2807 FT /product="Chapaev-4_HMa_1p" FT /translation="MPRKCVNLVDNFCYICGEITFFSQKRTMTPLVQTAYQ FT HYFNMKVRNQDKSWAPHICCSSCSVTLREWLKNKKRSLPFAVPMVWREPKN FT HTDDCYFCLTPPIKAGLSMKKIGSVKYPNLSSATRPIPHSDSLPVPTPPQS FT YELEAENEYGNDVEIEENRPSTSNDPDFLLTDVVPHRLSQAELSDLVRDLD FT LSQEKAELLGSRLKQWNLLQFDVKVSHYRKRQQNLLSFFEKKNNLVACIDV FT FGLMYCLNLNYDPTEWRLFIDSSKLSLKAVLLHNGNRLPSVPIGHAVHMKE FT TYINMKVLLNSINYNEHKWKICGDLKVIAILLGMQLGYTKYCCFLCMWDSR FT DRSSHYIKEDWPARNLNTEEKNVIAEPLVDPKDVLLPPLHIKLGLMKNFVK FT GMNKEGQAFMYLRNKFPKISDIKVKEGVFVGPQIRQLMKDPAFDEVLKGQE FT KEAWEALKGVICGFLGNKRDDNYIQLVTILLQKYHQLGCNMSLKIHFLHSH FT LDFFPPQLWSC" XX SQ Sequence 3307 BP; 1151 A; 490 C; 536 G; 1130 T; 0 other; cactgcccaa cagtgatttt gtatcggaac caaaattagt tatttcttat gtatttccat 60 ctgctgaatt caaaaatgac attaaaaatg acatattagc tctagtttaa gaaataaaag 120 caaaaatttt ctctcgaggc aagggtttaa ttaagcgatc gcgaaatgaa aaatccggaa 180 aaatatcagg tcttcaaaat tttagtttac gcaaaaccgt gtactttgtt ttctaggggg 240 gaatcaactt aaatttgtta gggaaaaagt tataaaatat aaaataataa acttacgact 300 gcaaaagtaa taacactcta gtactaactg cattataaaa gtaataacac tataatagtg 360 taattacttt tataacgcac ttataacttt tataattcat tcttagttca aacaacataa 420 gtaatcccaa attttaattg aagtagaaaa gcctctagaa aaatagagtg gatatttaaa 480 taatatttaa aacttattaa agaacttaaa caacttaata aaaacgtact atataaacag 540 taagcattac gaaaaacgtt actaatagtc cgtaaaaacg cacgttaatt ataaaatcaa 600 ccaataaaaa gtttttgttg attgatttta aaatttgcgc gcgtttttac ggactgtttg 660 taacgttttt cataatgcta aacttttcat aaagctcgtt tatattgttg ttttagtgta 720 tttgtaaaag tccctgaagt ttttaataat atttgattat ctactttatt tttttagatg 780 ttttttactt tgattaaagt ttgaaattac taatgttgtt tgaacaaaga atgaactttt 840 aaagttataa gtgctttata aaagtaataa tactattata gttttattac ttatataatg 900 cagttagtga ctggagtgtt attacattcg taatcgtaag tttattgttt tgttttgttt 960 tttataactt tttcgcaaac aaacttaaaa ttatttgctc cttaaaaaca aagtacacgg 1020 ctttgcataa actaaaattt tgaagaccag atatttttcc agattttgca aatcgcgatc 1080 gcttaattca actcttgcaa gggtaacttt cattttttga aaaaagagat ttttgttttt 1140 gactatccat aatgtttgat tcatgtttaa aacttgtgat actagttatg tgtgtttgct 1200 aacgtttctg gtctaatcta ttcatagttt tttttttcat tttctagtgt aaaattttac 1260 aaatctactc atctaaaatt ttacaaatgc caaggaagtg tgtaaattta gtggataatt 1320 tttgttacat ttgtggtgaa ataacatttt tctcacagaa aagaactatg acacctcttg 1380 tgcagactgc ttaccagcat tacttcaata tgaaggtaag gaaccaggat aagtcatggg 1440 ctccacatat atgctgtagc tcttgttcgg ttacactacg agaatggttg aaaaataaaa 1500 aaagatcttt gccttttgct gttcctatgg tatggagaga gcctaaaaac catacagatg 1560 actgctattt ctgcttgacc ccacccatta aggcaggttt gtcaatgaag aaaataggat 1620 cagttaaata tccaaatctt tcctctgcca ctcgacctat tcctcattca gatagtttac 1680 cagttcctac tccaccacaa agttatgagt tagaagcaga aaatgaatac ggaaatgatg 1740 tagaaataga agaaaacagg ccttctacat caaatgatcc tgattttttg ttaacagatg 1800 ttgtgccaca cagattgagt caagcagaat taagtgatct tgttcgagat ctggacttgt 1860 cacaggaaaa ggcagaactt ctagggtcaa gactaaagca gtggaatctt ctccaatttg 1920 atgtcaaggt ttcacattat agaaaacggc agcagaatct gctttctttt tttgagaaga 1980 aaaataatct tgttgcttgc attgatgttt ttggattaat gtattgtctt aatttgaact 2040 atgacccaac tgaatggaga ttattcatag actcatctaa gctaagcttg aaagctgtat 2100 tactgcacaa cggaaaccgt cttccttctg ttcctattgg ccatgcagtt cacatgaaag 2160 agacttatat aaacatgaaa gttctcctca actcaattaa ttacaatgaa cacaaatgga 2220 aaatttgtgg tgatctaaaa gtcattgcta tacttttagg aatgcaatta gggtatacta 2280 aatactgctg ctttttgtgt atgtgggata gtagagacag aagttcacat tatataaaag 2340 aagactggcc tgcaagaaat ctcaacacag aagaaaaaaa tgttattgct gaaccacttg 2400 tggatccaaa agatgttctt cttccaccat tacatattaa gttgggttta atgaaaaatt 2460 ttgtcaaagg tatgaacaaa gaaggacagg cttttatgta tttaagaaac aaatttccaa 2520 aaataagtga tataaaagtt aaagagggtg tttttgttgg accacaaata cgccagctta 2580 tgaaggatcc tgcatttgac gaagttttga aagggcaaga aaaagaagct tgggaagctc 2640 ttaagggagt gatttgtgga tttttaggca acaaaagaga tgataactac attcaattgg 2700 taacaatact tctgcaaaaa taccatcaac ttggatgtaa catgtccctc aagattcatt 2760 tcctccactc acacctagac ttcttccccc cccaattgtg gagctgttag tgatgaacat 2820 ggggaaaggt ttcaccaagt tatttctgta atggaacaaa gatatcaggg tcgttggaat 2880 gaggctatgc ttgcagatta ctgttggttt ttgtgtaggg atgctccaga actagtctac 2940 aaaaggaaag caaaaagatc acaatctcgt gacgataccc cataatcatc tttaaaccta 3000 ttaacttgat gatcatgatt tttttgttta aaactattag aatttttttt gtttaaaact 3060 attagaatat atctgtaata aaaaaaaacc aaatgtactt taaatgttat tagaatatgt 3120 aaattatatg tcatattgtt ctgttgaaaa ttaactgaaa taaaaattta tttccattgt 3180 atatctcaga aactagagct aataaaaaat ttttgttgtt attttctttt ttctcagccc 3240 aaaattagta taatttgaat aaaattgcta aagatacaaa aaaaatatat ttttttgttg 3300 cccagtg 3307 // ID Gypsy5-LTR_Dpse repbase; DNA; INV; 1223 BP. XX AC Unknown_singleton_95; XX DT 13-MAY-2009 (Rel. 14.05, Created) DT 13-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy5_Dpse; KW Gypsy5-I_Dpse; Gypsy5-LTR_Dpse. XX OS Drosophila pseudoobscura OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; obscura group; OC pseudoobscura subgroup. XX RN [1] RP 1-1223 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1054-1054 (2009). XX DR Genome; Unknown_singleton_95; Positions 32939 34161. XX SQ Sequence 1223 BP; 375 A; 317 C; 269 G; 262 T; 0 other; tgcgcttgtt gtaagaaaaa aattggttgc atcgacgtct taattttccc taaaattagg 60 ctacgcgaat gggctaagtc ttctgttggg tccaagaaag ctaagctttt aacacgccat 120 ctactggggg cgcttcgtca ccgtgggcta cccatattat tttgtttggc tgacgttcta 180 atcggcctct tcctcccact gtcattttga caaatcgaaa agaaaaaggg aagaacgagg 240 ggatgaggcc ttgatttccg ggagaagcag ctctctctct cctcgtggta tccgaacagt 300 acagacgtgc ttatccagat caaaaaaaaa aaaatttaat aaaaagttaa aaatcaaata 360 aaagttcttc ggagatcagc acgatccgcc ggcctttggg aagcaagacc ccatccaccc 420 agacgagaaa ccaatcaccg atccactgtg cagaggatgg tgcgaatttt ctgacgatcg 480 ccccaaagcg ggccaccgta gagttgggaa gctccaccca tcgccccaac gccctccgcc 540 aacgcggtcc accgtagagt tgggaagctc cacataaagc cccaacgcca tccaccacag 600 cgagcccttc agcagttccc gagcatcagc gccgacagag agcggcccct cagccgtttc 660 cgagctttag cgccgccagc agcgagacct tcgccgcccc agcgcgagcc gtcgcgaccg 720 agagtgggcc atccagccac cgggccaagc gccgtaaaca aaagtgccaa gtgatcttcc 780 ttgccaggaa taaccagact cgtcccagaa aaaaaaaaga ccctgaccag aatttttttt 840 tgtcaccgtg cttcgcccta acctaaaagc caggtcatga catgtgaagg ttaaaggtgt 900 taaataaacc ccaacggaac gggcagaact cgtgagaacc tcttgtggcc cattgggcgt 960 ttattcccaa agatcttatg tgcgctgtta ggccgttgac ccctttgtgc atataaaacg 1020 aaaccagctg cttggagctt tttccgtgcg ccaacacctt ggtctatgga tcaagaccta 1080 cctcaaagaa actaaataaa cgaataaata ttgaataaag tataaaatat agaagttaaa 1140 gatggaataa tttgatttag gaactatata tgctcaaaaa aaaaaaagcc ctaaacaaaa 1200 agaaaaagaa agaaagaaga tca 1223 // ID BEL-65_CQ-I repbase; DNA; INV; 3145 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-65_CQ_; KW BEL-65_CQ-LTR; BEL-65_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-3145 RA Jurka J.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 283-283 (2011). XX DR [2] (Consensus) XX CC 'TGGGT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 217..3144 FT /product="BEL-65_CQ-I_1p" FT /translation="MFRRSSPALPDPVPEAFLALPGPVPGVSPSLSDREPE FT ANHVLPNPVSEASLALPGPVPGVFPSLSDRVPEANHVLPNTASEASLALPG FT PVPGVFPSLSDRVPEANHVLPNTASEASLALPGPVPGVFPSLSDRVPEANH FT VLPNPVSEASLALPGPVPGVFPSLSDRVPEANHVLPNTTSGASLALPGPVP FT GARPALADRVPEALQNLDKLAAALYRRERVRQRIIQIQRDFRDQKHPTLKS FT VTMTKNKVWSEYHEFNRCHGEVIILVPEEALNEQEEVYWVFESLFNEVAVP FT VEELLLDIQKTSFKKLDPQEVRYEQFSSAPKPKPAATPKPVPTSASTHVNP FT MVFTTTPIKSDPRSSTSPHEVILVPGRYAPPKKSIEVVGLKPEPNPESSLI FT QLKTPEKHIATESICELVSDSRPILSSAQPSAVDSRPREVFNMLPIADIKS FT DFPETTDYLKNVTAKEESGPVLPAIPFIQVSPHHKVIDTDPRRVHSEEPKN FT HPDFELQQNCAQSLSLSAMAACRIRSGDSNRHRGTKSVAANEARSQKCVEI FT RLPIVSADLVFAEVPEDVANSANSERDEEATNSRQRASFCEVLPDSIPEQN FT PEAVPLDQTVALDPDINRFHPESGLNNPQDEADSSRVTRDSGSKVTKIPTK FT MIPRSRGAPVFPERNSQKRNDTNSNPVPRVQISVKNTLHVHLDIVPKTSIS FT LVKTVAEKDPLMPPVSKATGKLELFSTIGKSQPRKCGVPPARCQPRQLCQE FT CAKLQNQTKEVTNQEINYNKDTDDCALQVGRPYAPPSRREQVTYKLTWINL FT IKPGGMCRIAHSETASCNPFPALYREAIGVPKKQSFDNPDRTDDLALTAAV FT AVLQLLQGPRDTLQNQDKKPATQSRPNDTLSDEKVAQEWLINRELVATKML FT NTSDPYLAEETLPVNEHETERQLGEARTTVKILSRPWDPGSTSHTSSPIQN FT KPRNKTEDPELLLNQRFNGGN" XX SQ Sequence 3145 BP; 792 A; 969 C; 804 G; 580 T; 0 other; taaaatggtc caaacgaacc ggattccaag tgactgcttc tgctgctgcc ccgtgctgac 60 gtgtccgatc gtgttcgttt cttgaacacc gcgcgcgagt gaaaaccaat ttcgtcgcgc 120 cactcgacac gaaaaaagtg acacttcaaa aacagtgcaa aagtgcacct gtttggccgg 180 acccagttcc ggagacaatc ctggccttgc cgaaccatgt ttcggaggtc atccccagcc 240 ttgccggatc ctgttccgga ggcattcctc gccctgccgg gtcctgtccc gggggtttcc 300 ccgtccttgt cggaccgtga accggaggca aaccacgtcc tgccgaatcc cgtttcggag 360 gcatccctcg ccctgccggg tcctgtcccg ggggttttcc cgtccttgtc ggaccgtgtt 420 ccggaggcaa accacgtctt gccgaatacc gcttcggagg catccctcgc cctgccgggt 480 cctgtcccgg gggttttccc gtccttgtcg gaccgtgttc cggaggcaaa ccacgtcttg 540 ccgaataccg cttcggaggc atccctcgcc ctgccgggtc ctgtcccggg ggttttcccg 600 tccttgtcgg accgtgttcc ggaggcaaac cacgtcctgc cgaatcccgt ttcggaggca 660 tccctcgccc tgccgggtcc tgtcccgggg gttttcccgt ccttgtcgga ccgtgttccg 720 gaggcaaacc acgtcctgcc gaataccact tcgggggcat ccctcgccct gccgggtcct 780 gtcccggggg ctcgcccggc cttggcggac cgtgttccgg aggcactcca gaatctagac 840 aagctagcag cggcactcta ccgccgggaa agagtaagac agcgaatcat ccagatccag 900 cgtgactttc gggaccagaa acatccgacc ctgaagtcag tgacgatgac gaagaacaaa 960 gtgtggtcgg agtatcacga attcaaccgg tgccacggcg aagtcatcat acttgttccg 1020 gaagaagcac tcaacgagca agaggaagtt tactgggttt tcgaatcgtt gttcaacgag 1080 gtggccgtgc cagtcgaaga gttgctgttg gatatccaga agaccagttt caagaaactg 1140 gatccccagg aggtcaggta tgaacagttc tcatccgccc ctaagccaaa gccagcggca 1200 acaccgaaac cagttcccac tagcgcttcc acccacgtaa atccgatggt cttcacgaca 1260 acgcccatca agtctgatcc acgaagctcg acatcgccgc atgaggtgat actggttcca 1320 ggccgttacg cgccgcccaa gaagtcgatt gaggtcgttg gtctaaaacc tgaaccaaac 1380 cccgagtcga gtctgatcca gctgaaaacc ccagagaagc atatcgctac cgagtccatc 1440 tgcgagcttg tcagcgattc ccggccaatc ctgagctcgg cacaaccgag cgcagtcgac 1500 tcccgaccaa gggaggtttt caacatgctc ccgatcgcgg acatcaagtc cgacttcccc 1560 gaaacgaccg actacctcaa gaacgttacc gccaaagaag aatcaggtcc agtgctgcca 1620 gcaatcccgt ttatccaagt ttcccctcac cacaaggtaa ttgacactga tccacgacgt 1680 gtccacagcg aggaaccaaa gaaccatcca gacttcgagc tccagcaaaa ctgcgcccaa 1740 tcgttgtcgt tgtcagccat ggcagcgtgt agaatacgga gcggagattc gaaccggcat 1800 cgaggcacca agtccgttgc agccaacgaa gccagatcgc agaaatgtgt ggagataagg 1860 ctgccgattg tctcggcgga cctagtgttc gcagaagttc ctgaagatgt cgccaacagt 1920 gcaaactccg aacgtgacga agaagctacc aacagtcgtc aacgtgcgag tttctgtgaa 1980 gtccttccgg acagtatccc cgagcagaac cctgaagctg tgcctctgga tcagacggtg 2040 gcgcttgacc cggacatcaa ccgtttccat ccggaaagcg gtctaaacaa cccacaagat 2100 gaagccgatt ccagccgtgt cacgagggac agcggctcca aagtgacgaa gataccaaca 2160 aagatgatcc cccggagtag aggtgcaccg gtcttcccag aacgcaattc ccaaaaacga 2220 aacgacacca actcaaaccc agtgcctaga gtgcagattt ccgtcaagaa caccctccac 2280 gtccacctag atatcgttcc caagaccagt atttcgttgg tgaagaccgt tgcggaaaag 2340 gaccccctga tgccacccgt gtcaaaggct acaggcaagc tcgaactgtt cagcacaata 2400 ggaaaaagcc aaccgcgaaa gtgtggtgtc cctccggcca gatgccaacc acgccagcta 2460 tgccaagaat gtgccaagct ccaaaaccag accaaggaag tgaccaacca agagatcaac 2520 tacaataagg ataccgacga ttgcgcgttg caagtaggcc gtccgtacgc gccaccatca 2580 cgccgcgagc aggtcacata caagctgact tggatcaatc tgatcaagcc tggcggaatg 2640 tgtaggatcg cccattccga aactgcgagc tgcaacccgt ttcctgcgct gtaccgagaa 2700 gcgatcggtg tccccaagaa acagtcattc gacaacccgg atcgaaccga cgatctggca 2760 ctcaccgcag ctgtggcagt gctgcagttg ctccaaggac ctcgagatac tttacagaac 2820 caagacaaga aacctgccac ccaaagcagg cccaacgata cactgtccga cgaaaaggtt 2880 gcgcaggagt ggctcatcaa ccgagagttg gttgcaacca agatgctgaa cacaagcgac 2940 ccgtaccttg ccgaagaaac cctaccagta aatgaacacg aaacagaacg acagttaggt 3000 gaagccagaa caacagtcaa gatcctgagt cggccctggg acccgggatc gacaagccac 3060 acgtcgtcac ctatccagaa caagccgaga aacaagaccg aagatccaga actgctgttg 3120 aaccagaggt tcaacggggg gaata 3145 // ID Crack-2_AAe repbase; DNA; INV; 4560 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A Crack non-LTR retrotransposon family from Aedes aegypti. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; KW Crack-2_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4560 RA Kojima K.K. and Jurka J.; RT "Crack clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1218-1218 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 14 sequences with >96% CC identity. Closely related to Crack elements in Culex pipiens CC (Crack-1_CP to Crack-4_CP). XX FH Key Location/Qualifiers FT CDS 344..1102 FT /product="Crack-2_AAe_1p" FT /translation="MEDHISVSKATKLFKELLKREFMDMVQSSLNGTMNAM FT QEDFKILQGKLIDIKTSQQFLSDSFDDMKQQLATLRYKLDQFEKTNAHIET FT NTLNLVSIESRMDELEQQALENNLIISNLPERVDENVIDVVKKAVALHGHV FT LTGSEIIACNRMRTENKRGIKPILVKFSNNTSKRMVMSAINGKRPLTATLK FT TGPKVYANQHLTKRNQYMLNTAKEYKLHHGFSNVWFYKGAVYLKRTATSTP FT MRIENIQELRKL" FT CDS 1257..4085 FT /product="Crack-2_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MSEHNCIELMTTCSRNFEHLSETSLNILYFNIRSCRS FT KIDYLETYIHSLQQPIHVIVLSETWLYSWEIFNIQNYQSYHCTRDSDRGGG FT VSIFVRRDVSSQLILSSHKDVNDFVVVELIDERIKIMGVYNPGKNQLEFLC FT ELDNVLSDYPKTIVLGDFNINLLDTHQRIVSDYKNTVEGNGFLFINPMTNE FT YSTRVSNSIHTLIDHVFTDICKYNYGFRITDSEPHISDHRTILVSVGCNVR FT TVEKRKTKTIISYDRITPESLALDNIXSFDELTHAIRSQITQNTREISITT FT KKNPKKPYITNEILSMIEIKNRLYKHSQRDARYTPEYVQYRNSLTNRIKWT FT KRNYYNSKLAENTLNPKNFWNTLNELVHNDSSQQLHNYAIDCPSGRVTNDL FT ELCKIFNEYFINSVDTIIAHRPSSVILPTITARDLFDFSTVTENXVLLAFD FT SINSNAATGYDGIPNKFLKNFKHILSQKYTEVLNGCILTSHFPSTLKIAKV FT VPLFKAGARSNISNYRPISVLNSASKPFERLLHTQIREFLDQHNIINQNQF FT GFIQKSNTLTASINFVSKIVDGLDKSKKVAAMFIDLKKAFDCVDHTILLQK FT LAEIGFSHAAQDMIKSYLENRVQIVKINNTVGPPAHIKYGVPQGSILGPLL FT FIIFINDIFSLPLRGYLQLYADDASLYYEAEDEATLIDNMQHDIELINQWC FT IRNCLALNIDKTLVMTFSLRGQSANIRVACENTYLKQVEHIKFLGLTIDSN FT LKWTSHIFNVENKLRTMAFAMKKIRPITTEDTAWKIYFAFVHPHIMYMNCI FT WGGTNNVHLLKLARLQNGIVKTVKKLPRLTNSTELYSEKVLALNNLNKFEI FT MVYMYKVQHGLIKCNIPLIRVSAVHNQRTRQQNNIYQVQSRTSMAQNSVFQ FT KGIRLFNNLPIEIKLSRSLPVFKTRLRVFLSQQLVQNQQ" XX SQ Sequence 4560 BP; 1589 A; 829 C; 814 G; 1320 T; 8 other; tatctgktgm taatagtgtg tgtgcctttg acactgcgtt cggttttacc atagcgtwac 60 aaatcttgtt aatatttatg aacaagctaa gcttaaatag taaaaacmct ttaatttgat 120 ggttcaaacc acacctgtat agtttacggt gaaaagttac gaaaatagtt agcattttct 180 aaaaaaaaac gttgaatatt aaaccgcgtt ggactgttcg gtgtagagtc acctatacta 240 aaaagggata ccctstgtta gtggagctgt cgatagtatt ggtgctgttg ttgctgtatt 300 ggttgttgtt gttgctgtaa ctcaggctgc tgttgttgtt gaaatggaag accatatttc 360 tgtctcaaag gcaaccaaat tgtttaaaga gctgttgaaa cgggagttca tggatatggt 420 gcagtcttcg ctcaacggga ctatgaacgc aatgcaagag gacttcaaaa tacttcaagg 480 gaagctaata gacatcaaga catcacaaca gtttctatcg gattcgtttg acgatatgaa 540 acaacagctg gcaacactgc gctataagtt ggatcaattt gagaagacaa atgcccatat 600 cgagaccaac acactcaatt tggtttcsat tgaatcgagg atggacgagt tagaacaaca 660 agcattggaa aacaatctga tcatttccaa tttgccagag agagtagatg agaatgtaat 720 cgacgtggtg aaaaaagcag tggctcttca tggacatgta ctaactggat cggagatcat 780 tgcctgtaat cgtatgcgta ctgagaacaa acgaggaatc aagccaatac tcgtgaagtt 840 ttcaaacaac acctcaaaac gcatggtaat gagtgctatc aatgggaaaa ggccactaac 900 tgctacattg aaaacgggac caaaagtata cgcaaatcag catctcacta agaggaatca 960 atacatgctg aacactgcaa aggagtacaa actacatcac ggcttcagca atgtctggtt 1020 ctacaaagga gcagtttacc tgaagcgtac ggctacgtct acaccgatgc gaattgaaaa 1080 cattcaagaa ttgagaaaac tataattact cataattgaa aagtaaatta tcacaagtat 1140 atgtaaaatg cctcacacaa acattatcat gatacacttt ccactacact taactgttgg 1200 ttcaaagtat ataccaccac actgtatcac atgccctgaa acaacacaca taccttatgt 1260 cggaacacaa ttgcattgaa ctaatgacta catgttcaag aaactttgaa catctctctg 1320 agacgtctct caatatacta tatttcaata tcagaagctg ccgtagtaaa atagattatt 1380 tggaaacata catacacagt ttacaacagc caatacatgt aatagtattg agcgaaactt 1440 ggctctacag ttgggaaatt tttaatatac aaaattacca atcctaccat tgtacacggg 1500 actctgatcg tgggggtggt gtgtcgatat ttgtacgacg agatgttagt tcgcaattaa 1560 ttctctcctc tcacaaagat gttaacgatt ttgttgttgt tgagttgatc gatgaaagaa 1620 tcaaaataat gggagtttac aatccgggta aaaatcaatt ggagttttta tgtgagttgg 1680 ataatgtctt atctgattat cctaaaacta tagttctggg ggactttaat attaatttat 1740 tggatacaca ccaaaggata gtatccgatt ataaaaacac agtggaagga aatggttttc 1800 tgtttataaa tccaatgact aacgaatatt ctacgcgagt gtctaattcg atacacacat 1860 tgattgacca tgtattcacc gatatttgta aatataacta tggttttcga attactgact 1920 ctgaaccaca tatttcagat catcgcacga tattggtatc cgtaggctgt aatgtgcgaa 1980 cagtggaaaa gcgaaaaaca aaaaccataa tttcgtatga ccgaataact cccgaatcgt 2040 tagcccttga caacatakca tcatttgatg agttaacaca tgccattaga tctcaaatta 2100 cacaaaatac cagagaaatc agcattacaa cgaaaaaaaa tccaaaaaaa ccctatatta 2160 cgaatgaaat tctatcaatg atcgaaatta aaaataggct ctataaacac tcacaaaggg 2220 atgctaggta tacccctgag tatgtccagt acagaaatag tttgactaac cgtatcaaat 2280 ggacaaaacg aaattattac aacagtaaac ttgctgagaa cactctcaat ccaaaaaatt 2340 tttggaacac attgaatgaa cttgtgcata acgattcaag ccagcaatta cataactatg 2400 cgattgattg tcccagtggt cgggttacca atgatctgga actttgcaaa atatttaatg 2460 aatatttcat taatagtgtg gatacaataa tagctcatag accgtctagt gtgattttac 2520 caacaattac tgcaagagat ttatttgatt tttcaactgt cacagaaaat kaagttctac 2580 ttgcctttga tagcattaac tcaaatgcgg ctacaggata cgatggcatt cctaataaat 2640 tcctgaaaaa tttcaaacat atactttccc agaagtacac tgaggtgctt aatggttgta 2700 ttcttacaag tcatttccct agtacactaa aaatagcaaa agttgttcca ttgttcaagg 2760 ctggagcaag atctaatatt agcaattata gaccaatatc agtattgaat tctgcatcaa 2820 aaccgtttga acggttgttg catactcaga ttcgagaatt tcttgatcaa cataacatta 2880 taaatcaaaa tcaatttggt ttcattcaaa aatcaaatac acttacagcc tctattaatt 2940 ttgtttctaa aattgtagat ggattggata aatcaaaaaa agttgcggca atgtttatag 3000 atttgaaaaa ggccttcgat tgtgttgacc acacaatatt gctacagaag ctagcagaaa 3060 tcggattttc acatgctgcc caagatatga tcaaaagtta cctcgaaaac agagtacaga 3120 tagtgaaaat taataataca gttggaccac ccgctcatat caaatatggc gtccctcaag 3180 gatccatatt agggccactc ctctttataa tttttataaa tgatattttt tcacttccac 3240 tgagaggtta cctccagtta tacgcagatg atgcctcatt atactacgaa gctgaagatg 3300 aagccacttt gatcgataac atgcaacacg atatagagct tataaatcaa tggtgtatta 3360 ggaattgttt agcgctgaac attgataaaa ctttggtaat gacattcagc ctacgtggtc 3420 agtcagcaaa tattagagta gcatgcgaga atacatattt gaaacaggtt gagcacatca 3480 aatttttggg gttgaccatt gactccaatc ttaaatggac ctcccatatt ttcaatgtag 3540 aaaacaaact tagaacaatg gcctttgcga tgaaaaaaat acgaccaata acaaccgagg 3600 atactgcatg gaaaatatac tttgcgtttg ttcatcctca cattatgtac atgaattgca 3660 tttggggagg tactaataat gtgcatttac ttaagctcgc tagactgcaa aatgggattg 3720 taaaaactgt taagaaatta cctagattga caaattcaac cgaattgtac agtgaaaagg 3780 ttttggcgtt gaataacctg aataaatttg aaataatggt ttatatgtac aaagtgcaac 3840 acggtttaat taaatgtaat atacccctta taagagtatc tgcagtacac aatcaacgta 3900 ctagacagca aaataatatt tatcaagttc aatcaagaac atcaatggct caaaacagtg 3960 tttttcaaaa aggaattagg ctttttaaca atctgcctat tgaaataaaa ctatcaaggt 4020 cactaccagt ttttaaaaca cgtttacgtg tattcctatc acaacaatta gtacaaaacc 4080 agcaataata cattgactac gattgtgaaa ccacactgta caaagctggt gtgactagca 4140 tcgagttatt tggattattg tttttcgacg aaacaactct taagctcaca gtgctggctc 4200 cctacgataa tcaaaagtaa cgcgactgta ggcgcgggat ccgtccccag gatcattaac 4260 ctacttgtat gggctcgtag ggttgggaag tccccggtgt acaaaaagta cagttgggta 4320 aatggcttcc tttccggcac tttaaagatg tccatcatgt ttgttacgct ttgaaccaac 4380 atcactaagc caggtacact atcattcaga cataataaca aagaatttag tatttagtta 4440 gttgttcaat atacacactg gaaacacaca catatactga atgaattaca ctttgaggta 4500 actgcgcaca gttttcatta acttctgcag tggcaaataa accaaataaa taaataaata 4560 // ID BEL-99_AA-I repbase; DNA; INV; 2075 BP. XX AC AAGE02034270; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-99_AA_; KW BEL-99_AA-LTR; BEL-99_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-2075 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02034270; Positions 6697 4623. XX CC 'AGCCC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 52..1554 FT /product="BEL-99_AA-I_1p" FT /translation="MDSVARKRTALMGRLAGIRSRVSAMNSTTARIDDLDA FT EMECLSDWWLDYRKLHDGILDLCKDDDLLDDIIKSGIEADQEHNSVKAMIT FT QFQRVIRNRDNPVGNTTSVLALDHNASPVGLPELNLPKGMLPTFSGDYGEW FT TSFFDLFISSVHNNPRLTDAQRLLYLKTYTSGSAAALLRHIKVEDRAYQGA FT LDALKKRFDRKDHIVSHQIQRYLDIPTINAASASSLRRVYSIADDVIRALK FT ASQREERDSWLIQILLVKLDPETRQLWADRSSLNEVDPQVRSIDSFLEFLD FT QRAFTIEATQRPSYHRNNESSSKSSQRHCSSFVSTNSAVIPNCTLCEDQQH FT NLYFCQRFKTLPSSERLRLTSELGVCSNCLRHGHDENRCSASKCRKCDQPH FT HTLLHDAFLRQPSPQVTTMLAYNECSSVLLATAVVNVLDSQNQTHSVRAVL FT DSASQACFITTSLKSRLLLQCETVNMSLQGISRKATQFRKLSPSTCVHEPQ FT ATTNN" XX SQ Sequence 2075 BP; 601 A; 525 C; 458 G; 491 T; 0 other; gtaatttggc gctgtagaca ggatagtaaa agtgttacgt gtaaagaagt gatggatagt 60 gtcgcgagga aacgtactgc gttaatgggc cgacttgctg gtatacggtc cagagtgagt 120 gccatgaact ctaccactgc gcgcatcgac gacttagatg ccgaaatgga atgcctgagt 180 gattggtggc tagactatcg aaaacttcac gacggaattc tggacctctg caaggacgac 240 gacctattag acgacattat caagagtggc atcgaggcag accaggagca caactcggtg 300 aaggcaatga tcactcaatt tcaaagggtg attcgaaacc gagataatcc ggtcggtaat 360 actacttcag ttctggcatt ggaccacaac gcttcacctg tgggactacc tgaactaaat 420 ctcccaaagg gaatgcttcc gactttttct ggagactacg gagagtggac atcgttcttc 480 gaccttttta tcagctcggt tcacaataat cccaggctca ccgacgctca aaggttgctt 540 tatttaaaaa cgtatacatc tggatctgcc gcagcactcc ttcggcacat caaagtggaa 600 gatagagcct accagggtgc tttagatgct ctgaaaaagc ggtttgatag aaaggaccac 660 atcgtaagcc atcaaattca gcgctatctg gatataccaa ccatcaacgc agcttcggca 720 tcgtcgcttc gtcgggtata cagcatagct gatgacgtaa taagagcatt gaaagcatca 780 cagcgagagg aaagagatag ttggctcatt caaattcttt tggtgaagct ggatccagag 840 actcgtcagc tatgggcaga tagatcatcc ttgaacgaag tcgacccaca agttcgttcc 900 atcgatagct ttctggagtt tcttgaccaa cgagcgttca cgatagaagc cactcaacgt 960 ccgtcgtatc atcgaaacaa tgaatcatca tccaaatcgt ctcaacgaca ctgctcaagc 1020 ttcgtatcta ccaacagcgc agtgattccc aactgcacac tctgtgagga ccaacaacat 1080 aatttgtatt tctgccaacg attcaaaaca cttccatcat cggaacggct acgtctgacc 1140 agtgagctcg gcgtctgcag caactgtctt cggcatggcc acgatgagaa tcgttgttcg 1200 gcaagcaagt gcagaaagtg tgaccaaccg catcacacct tgcttcatga tgcatttctt 1260 cgacaaccat cacctcaagt tacaaccatg ctcgcctaca atgaatgctc gtcggtatta 1320 ctcgccacgg ccgttgtaaa cgtcctcgac tcgcagaatc aaactcactc tgtgcgtgca 1380 gtcttggact ctgcctctca agcctgtttc atcacgacct ctttgaagtc acgtctactc 1440 ttgcaatgcg aaacggtcaa catgtcactt caagggattt cccgaaaggc cacccaattt 1500 cggaagttgt caccgtcgac ctgcgttcac gaacctcaag ctacaacaaa caactgaaat 1560 gtgctgtact cgaacaaatt tccgaaccac tgcctcgtca cttcatcaac atcgaaaagt 1620 ggaatttatc gccctatgca ccactggcgg atgagaaatt caacataccc agtggcatcg 1680 atcttctcat tggagcaaac atctttttcg aactactaca gcaggacaaa gtcgacctag 1740 gaccagcaaa accaacactg caaaatagca agctaggatg gatcgtagca ggaagttaca 1800 ccaccgaaag ttcatctgta gacatctgtg caaccgtaaa tagtttttcc aaggtgggcg 1860 gtgtcatcga atcaaccact agagcacctg aggaagacca gaaggtgagc ggcgatcgtt 1920 taataaaatc atcattagta tcatcacaca acttttccgg caccgagagc agctactgcg 1980 aagctcattt gatgacgtcg atggaaacca tttcatcctg cagccaacaa gctatatatg 2040 aagccaaaaa ttgactacgc aattggtggg cggaa 2075 // ID Outcast-2_BF repbase; DNA; INV; 5546 BP. XX AC . XX DT 21-JUL-2009 (Rel. 14.07, Created) DT 21-JUL-2009 (Rel. 14.07, Last updated, Version 3) XX DE Amphioxus I-2_BF autonomous Non-LTR Retrotransposon - consensus. XX KW Outcast; Non-LTR Retrotransposon; Transposable Element; I; I-2_BF; KW Outcast-2_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-5546 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-5546 RA Kapitonov V.V. and Jurka J.; RT "Young families of I non-LTR retrotransposons from the amphioxus RT genome."; RL Repbase Reports 9(5), 1141-1141 (2009). XX RN [3] RP 1-5546 RA Kapitonov V.V. and Jurka J.; RT "Outcast-2_BF family of non-LTR retrotransposons in the amphioxus RT genome."; RL Direct Submission to Repbase Update (21-JUL-2009). XX DR [3] (Consensus) XX CC Outcast-2_BF is a consensus sequence of the young Outcast-2_BF CC family of non-LTR retrotransposons that belong to the Outcaste CC clade (I group) [3]. Originally, this family was reported as CC I-2_BF, a member of the I clade [1-2]. The Outcast-2_BF consensus CC sequence has two ORFs. ORF1 codes for a protein that contains the CC PHD and Zinc knucle domains at its N and C termini. ORF2 codes CC for a proteins composed of the apurinic endonuclease, reverse CC transcriptase and ribonuclease H. XX FH Key Location/Qualifiers FT CDS 69..1319 FT /product="Outcast-2_BF_1p" FT /note="ORF1 protein." FT /translation="TKKTTKAKSNTRENLKKQKTHNSKQKQYCNKVLPRRR FT KQKTTNKKDNPTNNRKQSQQRNKSPSKRQTLSEEEELCICGNDHPEGGCWI FT CCDQCNTWWHGTCAKLSPSTVAYFVKSGEEYYCAYCTIQNINKSTKHIRKK FT DPSHEDVNTPKQTIRSREGPATDTKHKPNKEETNEDKKEEIIIIDNISNPS FT SYKNSAKILAEISRHISGITHSIDLAYSLPRGGIAIHCRESKAASEILKPW FT PEGAFGNAHEQLKVHKVHNIYGRRAVLKNVPTNTNEAAIEQNIYEQTRVKV FT KAHRFHYHDTGKPLKVVRIEAEEGQLSELFTQNIRIENALVKVEAYKTKRH FT TPIRCYQCHKLGHIAKECKESATCAKCGGKKDHTKSCRPNCVNCKGNHPAN FT DSRCKAFISLQQKLVDRDRRQHQ" FT CDS 1418..5395 FT /product="Outcast-2_BF_2p" FT /note="ORF2 protein composed of the APE, RT, and FT RNAse H domains." FT /translation="MLKCLITLIVCLVTLTEQTVHCQTSTTRDKLATRMKI FT AQINIRSINTSATLVEKMCEKKNIDVLCLSEVWHKAKQPTCLKTWTWTHKN FT RTDKRGGGTAIVTRENIKVMEEPIQQTTHADIVALNIYTKALNFVIVSAYV FT PPEDRLALQELLSIVNELNQKKKHLVLCGDLNTKHIAWGNKKNNKLGQMLY FT EAMETGDFQIMNNNTPTRDDSIIDLTIVNKNTTKHMQSWRVEPEVQLRTDH FT NLITMSIGSKEETQKQQKWNLRNVDWKEWQDKTDTVFQQWVEETDWATEKA FT NSRSEEVYNSFKENLLTCAEEVITKKTITQHSKGYWNKALDVQMQITKTAL FT RKFKRRRDKTNLTKYLHEKEILTNMEEEARNTYWNEQLKDMDPRKPQKFWK FT AIKNQLGRSSRPTIQPIRQKDGTIATTDQQIAHAVTTEYAPGGVEITGELA FT TWKQNITQAVEISIKHEKQRLQDDHTYEEDTHEDTLNLDLTLEEVKAAIRK FT MDASSSPSPTEGILPVMIKYGGDGLTIALQHMLNLVWETGQIPKAMKQDNK FT ILIKKPGKDDYNKVRSYRPITLSSVVGKLMERIVDNRLTWWAEANHILSPY FT QEAYRKHRTATHGVLRLVQHIEEAWKNNETTVAVFADYEGCFDRIWQEGLL FT YKLITKGVKGRTLCYLESFLRGRETRFKVNTITTEPEISKVGIPQGAVLST FT TLCNIYTADAYHQTGLDNFQYADDGAAWCSGADVKEVSKQVETSIDHVIST FT WCPLWNMRIEESKTKAMVFSPPHIDQPTADSLEVNNKNIDIIPEVKLVGIT FT LDEKLNFQSHICNTQTKAYKALKAISKVTNAKKNPNQEAHLQLYRAMIRPI FT LEYGTECTLRAGQKHDAAYAPIQRKALLAATGCKIRTSTDALEVLTGIMPI FT DIHLTCRQAQAYLRMATKHTGNPIYDKIAQERSEEKIGTTLHLLETRFKEM FT KGEIEVSTVDKECYYDSRLPPFSVGRITGSFLPTTTNPANKEEAKEKVKQI FT LHEMKKTPTTVVFTDGSSLGNPGPTGCAAVIYEQWGVTEPYTVRKPVAAKS FT NNYEGELQGIYLALNTLHRSQSKNRRILILCDCKAALENVSSLQQAEAYND FT LVNAARQRLFEIQQKGHNIQIEWCPGHMGVEGNELADMQAKLAAEEAKHTE FT NNTTWTKQQAMKHIEEQAVKRWQRRRENQTTSSHMQKANTSLNKKCSTWGS FT RTTQISINQLVSGHTELNAYKNWIDPTETPNCSTCGTRENIDHYMYECPKY FT ENTRQHLMKEIDNIYEDYGIPQQERTMDVVSLAGMRSDLTNEANKRMYLAF FT TKYIEDTRRFAEQV" XX SQ Sequence 5546 BP; 2219 A; 1283 C; 1166 G; 878 T; 0 other; aagcaaacag caatcgacaa caaaacaacc agcagaaaaa caggaggata aaagtaacaa 60 agaactaaac caagaaaaca acaaaagcga aaagcaacac aagagaaaac ctcaagaaac 120 agaaaacaca caactccaag caaaaacagt actgcaacaa agtccttccg agacgaagaa 180 aacagaaaac aactaacaaa aaagacaatc ccacaaacaa caggaaacaa tcacaacaaa 240 gaaacaagtc accaagcaaa cgacaaacat tgtcagaaga agaagagttg tgcatctgtg 300 gtaacgacca cccggaaggc ggctgctgga tctgctgtga ccagtgcaac acatggtggc 360 acggcacatg tgccaaactc tcaccatcaa ccgttgcata ttttgtaaaa agtggagaag 420 aatattactg tgcatattgc accatacaaa acatcaacaa gtcgacaaag cacataagaa 480 agaaagatcc ctcacatgaa gacgtaaaca ctccaaaaca gacaatcaga agcagagaag 540 ggccagctac tgataccaag cacaaaccaa acaaagaaga aacaaacgaa gataagaagg 600 aagaaataat catcattgac aatatatcta atccatcttc atacaagaac agtgcaaaaa 660 tattggcgga aatctccaga cacatatctg gaatcacaca cagcattgac ttagcctact 720 cactgccgag aggaggcatc gccatacact gcagagaaag caaagcagcc tcggagatac 780 tgaaaccctg gccggagggc gcttttggta acgcacatga acaactaaaa gtacataagg 840 tacacaacat atacggaaga agagcagtac tcaagaacgt tccaactaac accaacgaag 900 cagcaatcga gcagaacatt tacgaacaaa caagagtaaa agtaaaggca cacagattcc 960 actaccacga cacaggaaag cccttgaaag tggtacgaat agaagctgag gaggggcagc 1020 tgtctgagct attcacacag aacatcagaa tcgaaaacgc attggtgaaa gtcgaagcat 1080 acaaaacaaa aaggcataca cccattagat gctatcagtg ccacaaacta ggccacattg 1140 ctaaagagtg taaggaaagc gctacatgcg caaagtgtgg ggggaaaaaa gaccacacaa 1200 aaagctgcag acctaactgt gtcaactgca aaggcaacca cccagcaaac gacagcaggt 1260 gcaaggcatt catcagccta caacaaaagt tggttgacag agaccgtaga caacaccaat 1320 aagagacacc agaaatccag cagttagcac tctccacaga agaggcaaaa catacaccac 1380 ggtacaacta ccaaccacaa ccgaagaaag aagaagcatg cttaagtgcc ttataacgct 1440 catagtgtgc ctagtaacac tcaccgagca aacagtacat tgtcagactt caacaacacg 1500 agacaaactt gccacaagga tgaagatagc acagatcaac attagatcaa ttaatacctc 1560 cgccacattg gtagaaaaga tgtgcgagaa aaagaacata gatgtattgt gcttgtcgga 1620 agtgtggcac aaagccaaac aaccaacatg cctcaaaaca tggacatgga ctcataagaa 1680 cagaacagac aaaagaggtg gtggaacagc cattgtcaca agagagaaca taaaagtaat 1740 ggaagagccc atccaacaaa caacacacgc cgacatagta gcactaaaca tctacacaaa 1800 agcgcttaac ttcgtgatag tgtcagcgta cgtcccacca gaagacagat tagcactgca 1860 ggaactcttg agtatcgtca acgagctcaa ccaaaagaag aagcacctag tcctctgcgg 1920 tgacctcaac acaaaacata tagcctgggg aaataaaaag aacaacaaac taggacaaat 1980 gctgtatgaa gctatggaaa ctggcgactt ccaaatcatg aataacaaca caccaacccg 2040 agatgacagt attatagact tgacaattgt aaacaaaaac actacgaagc atatgcagag 2100 ctggagagtt gaaccagagg tgcagttacg aacagaccac aatctcatca caatgagcat 2160 aggttctaaa gaagaaacac aaaaacaaca gaagtggaac ctgagaaacg tagactggaa 2220 agaatggcag gacaaaaccg acacggtctt ccaacagtgg gtagaggaaa cagattgggc 2280 aacagagaaa gcaaacagca gaagtgaaga agtctacaac agcttcaagg aaaacttgct 2340 gacatgtgcg gaagaagtga taacgaagaa aactataaca caacatagca agggctactg 2400 gaacaaagcc ttggatgtac aaatgcaaat cacaaagact gcactaagga agttcaagcg 2460 aagaagagac aaaaccaatc tgacgaaata cctacatgag aaagagatct taacaaacat 2520 ggaagaagaa gcacgaaaca catactggaa cgaacagctt aaagatatgg accccagaaa 2580 accacagaaa ttctggaaag ctatcaaaaa ccagctaggc agaagttcca gacccaccat 2640 ccaacctata agacagaaag atggcacaat agccacaact gatcaacaaa tagcacatgc 2700 agtcacaaca gagtatgctc cagggggtgt ggaaatcaca ggagaactgg caacctggaa 2760 acaaaacatt acccaagcgg tggagatctc cataaaacat gagaaacaaa gactgcaaga 2820 tgatcacaca tatgaagaag acacacatga agacaccctc aacctagacc ttaccctaga 2880 agaggtgaaa gccgccatca gaaaaatgga cgcaagcagc tctcctagcc ccacagaagg 2940 aatactacca gttatgatca agtatggggg agatggcctg accatagccc tacaacacat 3000 gctgaaccta gtatgggaaa ctggacagat ccctaaggca atgaaacaag acaataaaat 3060 actcattaag aaacctggaa aagacgacta caacaaagta agaagctaca gacctataac 3120 actctcgagt gtggttggaa agctaatgga aagaatagtt gacaacagac taacgtggtg 3180 ggcagaagcc aaccacatat tatcaccata ccaagaggca tacagaaaac atcgtacagc 3240 tacgcacgga gtgctaaggc tagtgcaaca catcgaagaa gcctggaaga ataatgaaac 3300 cacggttgcg gtgtttgccg actatgaagg atgtttcgat cgcatatggc aagaaggact 3360 attgtacaaa ctaataacaa aaggggtcaa gggtaggaca ctctgctacc tagagagttt 3420 cctgagaggt cgagaaacca gatttaaagt caacaccatc acaacagaac cggagatcag 3480 taaagtaggc atcccacagg gagctgtact ttctaccacc ctctgtaata tctacacagc 3540 ggatgcctac catcaaactg gactagacaa cttccaatac gctgatgatg gtgcagcctg 3600 gtgtagtggc gcagacgtaa aagaggtatc aaaacaagta gagacaagta tcgatcacgt 3660 aatatcgact tggtgccctt tgtggaacat gcgtatcgaa gaaagcaaaa ccaaagccat 3720 ggttttctct ccacctcaca tagaccagcc aactgctgac agccttgaag taaacaacaa 3780 gaacatcgac atcatcccag aagtcaaact ggtagggatt acactagacg agaagctgaa 3840 cttccaaagt cacatctgca acacccagac taaagcatac aaagccttaa aagcaattag 3900 caaagtaacc aatgctaaga aaaatcccaa ccaggaggct cacctccagc tgtacagagc 3960 aatgatcaga cctattctcg aatacggaac agaatgtaca ctcagagcag gacagaagca 4020 cgacgctgcg tacgccccca tacagagaaa agctctcttg gctgcaaccg gttgcaagat 4080 cagaacgagt acagacgcac tggaagtgct gactggcatt atgcctattg acatacacct 4140 aacatgccgg caagcacaag cctacttgag aatggcaaca aaacacacgg ggaaccccat 4200 atatgataaa atagcacaag aaagatccga agagaagatc ggcacaaccc tacatcttct 4260 cgaaaccagg tttaaagaaa tgaaggggga gatagaggtc agcacagtag acaaggaatg 4320 ctactacgac tccaggctac cccctttctc agtgggcaga ataacaggat ctttccttcc 4380 caccacaaca aaccctgcca acaaggagga agccaaagag aaagtaaagc aaatactaca 4440 cgagatgaag aaaacaccaa ccacggtagt tttcacagat ggttcatcct taggaaaccc 4500 cggtccaaca gggtgtgctg cggtcatcta cgagcagtgg ggagtaactg aaccttacac 4560 agttagaaaa ccagtagctg caaagtccaa caactacgaa ggagagttgc aggggatcta 4620 cctagcactg aacaccctac acagaagtca aagcaaaaac agaagaatcc tcatactctg 4680 tgattgtaaa gcggcccttg agaacgtcag ctcgttacaa caggcagaag cttacaacga 4740 tctagtgaat gcagcaagac agaggctctt tgaaatccaa cagaaaggac acaacataca 4800 aattgaatgg tgcccaggcc acatgggggt tgaaggaaac gagctagcag acatgcaagc 4860 taagctagct gcagaagaag ctaaacacac agagaacaac acgacatgga ctaagcaaca 4920 agcaatgaag cacatagaag aacaagcagt aaaaagatgg caaagaagaa gagaaaacca 4980 aacaactagc agccacatgc agaaagccaa cacgagtttg aataagaaat gcagcacatg 5040 ggggtcaaga acgactcaga tctctataaa ccaactcgtg agtggacaca cggagctaaa 5100 cgcctacaaa aactggattg accccacgga gacaccaaac tgtagcacat gcggtacaag 5160 ggaaaatata gaccattaca tgtatgaatg cccaaaatat gagaacacca gacaacatct 5220 aatgaaagaa attgacaaca tatatgaaga ctacggaatc cctcaacaag aaagaacgat 5280 ggatgttgta tccctagcag gaatgcgaag cgacctgacg aacgaagcaa acaaaaggat 5340 gtatttggcg ttcaccaaat acattgaaga cacgagaagg ttcgccgagc aggtctagaa 5400 caccccaaac acgcatatgc aaagcaccag tgtacaagca ccagcgaggt caacaaatct 5460 aagagcaccc agcagaaaaa gcaccaagag accatgtcta gcggagaagt agacgttaaa 5520 caaggacaac aacaacaaca acaaca 5546 // ID piggyBac-2_BF repbase; DNA; INV; 4195 BP. XX AC . XX DT 13-APR-2011 (Rel. 16.04, Created) DT 13-APR-2011 (Rel. 16.04, Last updated, Version 1) XX DE piggyBac-2_BF DNA transposon DNA transposon - consensus. XX KW piggyBac; DNA transposon; Transposable Element; piggyBac-2_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-4195 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M. and Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-4195 RA Kapitonov V.V. and Jurka J.; RT "Transposable elements in the amphioxus genome."; RL Deposited to Repbase as the tmplanrep.ref file (02-May-2008). XX RN [3] RP 1-4195 RA Kapitonov V. and Jurka J.; RT "A family of piggyBac-2_BF DNA transposons from the amphioxus RT genome."; RL Direct Submission to Repbase Update (13-APR-2011). XX DR [3] (Consensus) XX FH Key Location/Qualifiers FT CDS join(317..814,1153..2763) FT /product="piggyBac-2_BF_1p" FT /note="TPase and the zinc finger." FT /translation="MSRYSLRQVHEEIFSNDSDPEYPTSTEDSDESSSEDD FT ESPPRSPSPGGASGEGPSDDLDELDTGDNGANGRGRARGTGRGRRGRARGG FT RGRGRGRGRGASSARGRGRRGRPATRGRGARGRTTPGGTRRTAVLDSRLAD FT ENWQEVDGDTLPQPPHPFTAERGFTDQVTLPDDPVELDFLNLMFPDHLYKV FT LTEQTNKYAEDFFAQNPKPGLPPNSRPRDWPENGITEPDMKAFMALTIAMG FT IIHQQDINDYWSVDEVQETPFFRSVMSRDKFQLIFKFMHLNDNDEYKARGD FT PEYDPFQKLGVFYPTITQQFKSTWSPGREICIDEGMVPFRGNVHFRTYNPD FT KPDKYGLKAYELCDSSNSYCCCFELYGGKGKEPSEKGLTYDIVMRLMEPYV FT GTGRTLYCDNYYSSPQLFLDLADANTNTCGTMRNRKGVPDAFKDASVTTPQ FT QKFCMANGPLLAVKYKDRRDVKMLTTAHSAKLVGTKKNNHRGEEKVKPECI FT HEYNLSMGAVDTSDQMVAYLSFRRRSLKWWKKAFFHIYSLGICNAFILYKE FT HKEQQRAQQRAQQRALQRANNTDGDDDEQQAEPKPLHKVFRREIVKQLIST FT SGYAKPQRKSLSASGAVLQRLTGRHFLEELPPSESGRVPNRVCQVCNTAKM FT RKAEMRKEPKRKQYRVSRYQCDECKVALCAAPCFKLYHTCKDFVASYLNNH FT DKD" XX SQ Sequence 4195 BP; 1155 A; 887 C; 968 G; 1185 T; 0 other; cccttgtcct gcgggcccgg ttatacaacc gggtacacaa gcagtgcctg tgtgcgggcc 60 cggttatata accgggtggg tgtatttccg gtaagaatgc gtcatacacc cgcgaaccgc 120 gagaaacccg gaagtaactt ggcggcatct cccgccaaaa ggttaagggt catttttccg 180 attgcgcaat ctatcccgtg gtgctttgcg cggcataatt tatgttaatg agcaacatgg 240 cgaacgtcgt ccgctagttt gtttacttct tttgtttggg tgtttctggc ggcaaatttg 300 ttatctaaac gccaaaatga gtcggtattc gttgagacaa gtacacgagg agattttttc 360 gaacgattca gacccagaat atcctacttc gaccgaagat tctgatgaga gcagcagcga 420 agacgacgag tccccacctc gttctccaag ccctggcggg gctagcggag agggaccctc 480 agacgatctt gatgaacttg acacgggcga caacggtgcc aatggccgtg gaagagctcg 540 gggtaccggc cggggacgtc gtggacgagc tcgaggaggt cgtggacgtg gacgcggcag 600 gggaagaggc gcaagttctg caagggggcg aggcaggcgt ggtcgccctg caacccgcgg 660 acgtggcgct cggggacgca caactcccgg tggcacgcgc cgaacagctg tcttggattc 720 ccgtcttgcc gatgagaact ggcaagaggt ggatggtgac actttgcccc agcctcccca 780 tccttttact gcagagcggg ggtttactga ccaggtaaat taatctttat ttgttattta 840 ttttgacact tttgctatct ctatctacca tatgtgtaac gttacttcgt gtgattcatt 900 cggtatttga catttttggc tactaaagtg tatgttatta ttctgaatac tataaaaaaa 960 cgtgccttta gattttcatt ctgtcttttg ctttatattt atatattgat attgataatt 1020 ccctcacatt ggtggacttt gctactatat catttcatta ccattcttga acatacctcg 1080 acattgtact tttacttcac aagtgcacac agacttattt gacttataag attatattct 1140 gtcattcttc aggtcacatt gccagatgat cctgtggaat tggacttttt aaacctcatg 1200 ttcccggacc atctatacaa agttctcaca gagcagacta acaaatatgc ggaggacttc 1260 tttgcccaga accctaaacc tggcctgccc ccaaactcac gtccaaggga ctggccggag 1320 aatggaatta ccgagccaga tatgaaagca tttatggctt tgactatagc gatgggtatc 1380 attcaccaac aagacatcaa tgactattgg tcggtggatg aggtgcagga aacacccttc 1440 ttccggtcag tcatgtcgcg cgacaagttt cagctcatat tcaagttcat gcacctcaat 1500 gacaacgatg agtacaaggc ccgtggtgac ccagaatacg atccgtttca gaaactgggg 1560 gtattctacc caactatcac acagcagttt aagtcaacat ggtcgcccgg cagagagatc 1620 tgcatcgacg agggcatggt tcccttcagg ggaaatgtgc atttccgaac atacaatcca 1680 gacaaaccgg acaagtatgg gctcaaagct tacgagctgt gtgactcaag caacagctac 1740 tgctgttgtt ttgagctgta tggcgggaag ggcaaggaac catcagagaa ggggctgaca 1800 tatgacattg ttatgcgact aatggagcct tacgtcggga caggccgaac attgtactgc 1860 gacaactact acagctcgcc acagctattc cttgaccttg ccgatgccaa caccaacacc 1920 tgtggcacga tgcgcaacag aaagggcgtc ccagacgcat tcaaagacgc ttcagtaact 1980 acgccccaac agaagttttg catggcaaat ggacccctcc ttgcagtgaa gtacaaggac 2040 aggagggatg tcaagatgtt gacaactgca cacagtgcaa agcttgtcgg cacgaagaag 2100 aacaaccatc gtggggaaga gaaagtgaag ccggagtgta tccatgagta caacctgtcc 2160 atgggtgctg tagacacctc ggatcagatg gtggcctact tgagcttcag aaggcgctcc 2220 ctaaagtggt ggaagaaggc tttcttccac atttacagtc ttggcatctg taatgcattc 2280 atcctgtaca aagaacacaa agaacagcaa agagcacagc aaagagcaca gcaaagagca 2340 ctgcaaagag caaacaacac ggatggtgat gatgacgagc aacaagcaga accaaagcca 2400 cttcacaagg tcttccgaag agagattgtg aaacaactca tctccacatc aggatatgcg 2460 aagccacaga ggaaaagtct gagtgcaagt ggcgctgtcc tgcaacggct cactggtcgc 2520 catttcttgg aggagctccc tccgtccgaa agtggaaggg ttccgaatag ggtgtgccag 2580 gtgtgtaata ctgccaagat gaggaaggcg gagatgcgaa aggagccaaa gaggaagcag 2640 taccgtgtgt ctcgctacca atgcgatgag tgcaaggtcg cactgtgtgc ggcgccatgt 2700 ttcaagctct atcacacatg caaagatttc gtcgcctcct atctaaacaa ccacgataag 2760 gactaggcct ctgtatgttt ttcttcaaat atatactttt gacatgatct atacaatata 2820 tactttcatc attctgacat tgtagacttg tgtaaggaaa tgtgttttta ttgcttttga 2880 tgcctttatt gcctttcata tggatgataa caactctata ttgatgtaac atgaaagtac 2940 acagtgtaag ctttctattc atgtacagga agttgtcatt tcctttattg tatgatcaca 3000 aaatgtagta tattcttttg tataagtgtg tggattttgc tgacttgcca gttcagcact 3060 aaggacagtc cacggccatt aacttacctt ttatgataca tgtattaatt ttcctcttca 3120 tatgtttttc cttacattgt caacactagt caggagcttg gcagtccata ttgagacatc 3180 tctggtggtg ctagtgagtg ttgtatggat atagtatgaa gaaatttgtt tttattgctt 3240 ttgatgcctt tattgcctta agtatagatg ataacaactc tatattgatg taacatgaaa 3300 gtacacagtg taagctttct attcatgtac aggaagttgt cctttccttt attgtatgat 3360 cacaaaatgt agtatattct tttgtataag tgtgtggatt ttgctgactt gccagttcag 3420 cactaaggac agtccacagc cattaactta cctgtaatga tccttgtatg cattttcctc 3480 ttcatatgta ttatctgaca ttgccaacac tagtcaggag ctttgtagtc tacattgagg 3540 cagatgtggt ggtgatagtg aaggtttctg tgaaacaatg ctagaaattg tagtttttat 3600 tgccttttat accttaatta ccttcatatt tatatgttat tactatgtta caaaactaga 3660 aagacagtac aagttataag ctttcagttg atgtatagga aagtagtgta gtgtttgtta 3720 taaagctaca gaaagatgta tatgcatgtc gaatagaaca cccttttctg cagaaatttc 3780 agttcagtac cgggacaatc aaaggccata aactcacctg caatgatacc tgggtgattt 3840 tttttctccc acttgtagct ctgatgttgt cagaatacat aagcaatgtt gtagtgttca 3900 tactatcatt ttaatatgcg tcagatacta taatacaaga agtttatcaa aataatttgt 3960 tttattgcct tattgccctt gtttctcatt attgatgact ttgggtaaca tttgtttgac 4020 attttcaaat agtagagact tttagctttg taatgatgta taactttcta gggttacgtg 4080 ggagaaaagt aactttaaaa gggcaaagta aaagaagtgc ccattctttc aacagggtaa 4140 aaaatgccca gcacagggag ggtgtaatag taaaaattcg ccagcaggac aaggg 4195 // ID hAT-69_HM repbase; DNA; INV; 3561 BP. XX AC . XX DT 05-JAN-2009 (Rel. 14.02, Created) DT 05-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE hAT-type DNA transposon family - consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-69_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3561 RA Bao W. and Jurka J.; RT "hAT-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 9(2), 409-409 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 647..3355 FT /product="hAT-69_HM_1p" FT /translation="MYQCYIFFKFKFFKRYNIMDRGKRQRLSGSQYKKIRL FT QREANSAKQKNALLHFLSRSSSEKALAPIVASTSNQNEVEIEIESGGEIEY FT ADSVSVNEYRNSGNDDTAKEPENVDEVEFISNLSEVVVQNVDDIGQNENNE FT DQSGEDPKTNKDLPLNFNDPAKWPIIDGKFKALLMKYGPIQKLPSLFPEDE FT QGRKFSSFHFYRRMSNGDKIKRTWLVYSITKDTVFCFCCKIFSPNKFSLSS FT HDGCRDWKNISVIIKRHETSSGHTTAYLNWRDLEVRLKSGATIDHINQKII FT QSETQHWRQVLARIITLIRTLAGQNLALRGTCEKLFEPNNGNFLKFIEFLG FT NYDPIMSKHIQRITTSEIHTTYLGKTIQNEIVELLANKIKNDILAKLERAK FT YYSLILDCTPDISHMEQMTVVFRFVSATESSSEKPAEVVVSEHFVTFLELQ FT DTTGANMTKVVIDKLQELGVNLDDMRGQGYDNGANMRGKYNGVQARIRQLN FT PRAFFVPCSAHSLNLVLNDAANCCMEAVAFFDLIQGIYNFFSASTHRWSIL FT LNHLSDLTIKPLSATRWESRVGAVRALRFQISGVYDALIEIAEDNTLTTAP FT QVKSRAEAKGIARNISNFKFVCSLVLWYDILFQINIVSKMLQSQALDLSLA FT LEHLNATKSFLCDYRCDEEFAAMVENAKKLAVELEIFEGFDVDDPVRVRRK FT SRQFLYEGRDEPIVSPEQNFRVSFFNRILDIAIQAINERFTQLSEYNELFG FT FLYNIGSKPLTDDELLKHCKDLHLALMSDGQSDINGVELWYEIKAIGRRLD FT TSNSDPKSVLKCIYTSNVVEVFPNLSIALRILLTLPVTVASAERSFSKLKI FT IKSYLRSQMCQDRLVGLATISIEKEIADHLDIEDLVKDFAELKARKIHF*" XX SQ Sequence 3561 BP; 1205 A; 560 C; 646 G; 1150 T; 0 other; caggcccggt tttaagcgta ggcgacgtcg gcgacagccg acggcggcac ttttctagag 60 cggcactttt cttaatttat tttgtgtttt cccaaaaaaa aaaaaaaatt taaatttttt 120 ttttgtaaca agaaattatt ttattttttt tacgggaact ttaatacata ctaatttttt 180 caaataaata atatagtaat aaaagtgttt gtttgtatac atgttttaca tttgggattt 240 ggcgaagaag aaacagtaaa aaaaagtttc ttttttcttt ttcagggatg gatgttagca 300 aaccgccaaa tatacaaaca aacgaattct attggttcat aaaatacgct gactaagcaa 360 attaaagttc cagaatattc ctaagaaact ttcttaaaaa ttttattgaa gttatctttg 420 agatatctat tgtaaattgt tataaaagtg tattattaaa cgaactttaa ctttgaaaaa 480 aaaaaaaaaa ttattatacc cttttataaa ttgattaata tttgtctaac aataattttt 540 ataagtaagt agttaaagtt agttttttcg ttttcttttt catgtatttg aataaactta 600 ttattattat tattattatt atcattaaat ataaaaattt aggtatatgt atcaatgcta 660 tatttttttt aagtttaaat tctttaaacg atacaatata atggaccgtg gtaagagaca 720 gcgcttgtct gggtcacaat acaagaaaat taggcttcaa cgtgaagcaa attctgccaa 780 gcagaagaat gctttgttgc actttttgag caggtcatca agcgaaaaag cactggcacc 840 tatagtagct tcaacttcaa accaaaatga agttgaaatt gaaatcgaaa gcggtggaga 900 aatagaatat gcggatagtg ttagtgttaa tgaatataga aattctggaa atgatgatac 960 tgctaaagaa ccagaaaatg ttgatgaagt ggaatttatc tccaatttaa gtgaagttgt 1020 tgtacagaac gtcgatgata taggacaaaa tgaaaacaat gaagatcaaa gtggtgaaga 1080 ccccaaaacc aataaagatt tgcctcttaa ttttaatgat ccggcaaaat ggcctataat 1140 agatggtaaa tttaaagcat tgctgatgaa gtatggtcca attcaaaaat tgccgtcttt 1200 atttcccgaa gatgaacaag gtagaaaatt ttcaagcttt catttttatc gccgaatgtc 1260 taatggcgat aaaattaaac gcacttggtt ggtgtactcc attacgaaag acactgtatt 1320 ttgcttttgc tgtaagatat ttagtccgaa taaattctcc ctgtcatcgc atgacggctg 1380 ccgagattgg aaaaatattt ctgttattat taaacgacat gaaacatcaa gtggccacac 1440 tactgcatat ctaaattgga gagatctgga ggtgagactt aaatcagggg ctacgattga 1500 tcacattaat cagaaaataa ttcaatctga aactcaacat tggcgacaag ttcttgcaag 1560 aataataacg ttaattagaa ctcttgctgg acaaaatcta gcattaagag gaacatgtga 1620 aaagcttttc gaaccgaata acggaaattt tcttaaattt attgaattct taggaaatta 1680 tgatcctatt atgagtaaac atattcaacg aataacaaca agtgaaattc acaccaccta 1740 tctcggaaaa actattcaaa atgaaatcgt tgaattgctt gctaataaaa tcaaaaatga 1800 catcttagca aaactagagc gagcaaagta ctactcactt attttagact gtacccccga 1860 tataagccac atggagcaga tgacagttgt ttttaggttt gtcagtgcaa ctgaatcatc 1920 ttcggaaaaa ccagctgaag tcgtagtttc tgaacacttc gtaacttttc ttgaattaca 1980 agacacaacg ggagcaaata tgacaaaagt tgttatcgat aagttgcagg aattaggcgt 2040 aaatcttgat gacatgaggg gacaagggta cgataatgga gccaacatgc gcggaaaata 2100 taatggtgtg caagccagaa ttcgtcaact aaatcctcga gcttttttcg ttccttgtag 2160 tgcccactcg ctcaacttgg tattgaacga tgcagctaat tgttgtatgg aggctgtcgc 2220 attctttgat ttaattcaag gcatttataa ctttttctca gcatcaactc accgttggag 2280 tatattatta aaccacctaa gcgatctgac tataaaaccg ttaagtgcca ctcgttggga 2340 aagcagggtc ggtgccgtac gggcacttcg atttcagatt agtggagttt acgatgcact 2400 aattgaaatc gcagaagaca atacattaac tactgcacca caagtcaaga gtcgcgcaga 2460 ggctaaggga atagccagaa atatttcaaa tttcaaattt gtttgctcct tggtattgtg 2520 gtatgacatt ttattccaaa ttaatattgt cagcaaaatg ctccaatcac aggctttaga 2580 cttgtcgctt gctctagagc atttaaacgc taccaaatct ttcctatgcg attatcgctg 2640 tgatgaagaa tttgccgcaa tggttgaaaa tgcaaaaaaa ttggcagttg aactagaaat 2700 ttttgaagga tttgatgtgg atgatcccgt acgcgtacgc agaaagagta gacaattcct 2760 atacgaaggt cgtgatgaac cgatagtatc accagaacaa aacttcagag tatctttttt 2820 caatcgaata ttggacatcg caattcaagc aataaatgaa agatttacgc aattatcgga 2880 atataacgag ttatttgggt ttctttacaa tatcggcagc aaacccttaa cggatgatga 2940 gttattgaaa cactgtaagg acttgcattt agcgcttatg tctgatggcc aatctgatat 3000 caacggggtc gaactttggt atgaaattaa agctattggg agacgacttg atacttccaa 3060 tagcgatcca aaaagtgtat taaaatgtat ttacacatct aatgttgtcg aagtattccc 3120 aaacctttcg atagctcttc gcatactact aacgttacca gtcacagttg ctagtgctga 3180 aagaagcttt tcaaagctga aaataataaa atcctatttg agatcacaaa tgtgtcaaga 3240 tcgtttagtt ggtctagcta cgatttccat cgagaaagaa attgctgatc atttggacat 3300 agaagatttg gttaaagatt tcgcagaatt aaaagctagg aaaattcact tttaattacg 3360 atttgttttt tattttttca tgaatggaat ttgttttgtt ttttattttt gtataaattg 3420 agttttttta ttttctatat taaagtttga ttttcagtgt ttttctattt tccttcaaat 3480 tggaaattat tgcattttta atttttgcgg cggcaaattt gcatctcgcc tacacataga 3540 aaactgttaa aaccgggcct g 3561 // ID Gypsy-21_IS-LTR repbase; DNA; INV; 175 BP. XX AC ABJB010901549; XX DT 15-FEB-2011 (Rel. 16.02, Created) DT 15-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the black-legged tick genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-21_IS_; KW Gypsy-21_IS-I; Gypsy-21_IS-LTR. XX OS Ixodes scapularis OC Eukaryota; Metazoa; Arthropoda; Chelicerata; Arachnida; Acari; OC Parasitiformes; Ixodida; Ixodoidea; Ixodidae; Ixodinae; Ixodes. XX RN [1] RP 1-175 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the black-legged tick genome."; RL Direct Submission to RU (15-FEB-2011). XX DR Genome; ABJB010901549; Positions 318 144. XX SQ Sequence 175 BP; 48 A; 44 C; 38 G; 45 T; 0 other; tgttgtgttc agctgcatat gtatattcat tcaagcaacc ctgtgcctga gagcacatgg 60 gtctaagcaa gaccgccatg accacgacgt tgctccgatt gtaaccggac tcccattgta 120 tgccgaaatg cggctcctta gtatgtaatt aaaagtgtta caccaagaca cgaca 175 // ID Harbinger-3_BF repbase; DNA; INV; 7602 BP. XX AC . XX DT 04-AUG-2008 (Rel. 13.08, Created) DT 04-AUG-2008 (Rel. 13.08, Last updated, Version 1) XX DE Amphioxus Harbinger-3_BF autonomous DNA transposon - consensus. XX KW Harbinger; DNA transposon; Transposable Element; Harbinger-3_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-7602 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-7602 RA Kapitonov V. and Jurka J.; RT "Harbinger-3_BF - a family of autonomous DNA transposons from the RT amphioxus genome."; RL Repbase Reports 8(8), 799-799 (2008). XX DR [2] (Consensus) XX FH Key Location/Qualifiers FT CDS join(197..942,1317..1422,1627..2043) FT /product="Harbinger-3_BF_1p" FT /note="transposase." FT /translation="MAGVPPAFPLVMNQVMFFLYLMRVVLNMRRHARVGRR FT RRLRYRERRQRPARVNAAFFLAALLVAEGGEPVHRGVWQHPRNPASFWDQI FT VLDTLSDAEWYKRFRMRKATFQMLCDELDPELRHKDTRMRDAISVQKRLAI FT GLYWLASGDLMRSVADLFGVSVSSACNIIHEVHNAINEVLFRRYLTFPTGQ FT KLRETIHGFMEKWQFPQCAGAVDGSHIPINAPTKDPTDFFNRKGYHSVILQ FT GVVDHLYRFTDICVGRPGSVHDARVLRDSPFFTRMESGTMLPRELTREIEG FT VPVPVAVLGDAAYPQMPWLVKPYPDSGALSREKFDFNYRQSRARMTVECAF FT GRLKGRWRCLSKRLDVDLDNVPSIVAACCVLHNVCEIHGDQHQEELVLPAN FT DVQRRPGDHGRIVPNAARDALARHFAAGN" FT CDS join(7290..6927,6091..5816,5387..5310,4910..4447) FT /product="Harbinger-3_BF_2p" FT /note="SANT/Myb." FT /translation="MAEGNAGRSRGSRCTWNYEETAFLISVWSEEEMLRQL FT ETPRNKFAYVKIVDRLAEEGIIRTFDQVKTKIRDLKHDFKKTWQHNHKSGN FT GRLDPPHYQALKEFLGCREALEPSSLFESTEITGVLCLEVRAAQSLLPLSS FT IDLDTSTVNASNESSNSNQLDMSTTNASNESSNSDQLDTSTTSASSSGQLD FT TSTSARSSGQLDTSTSASSSGQTGGKRKAQEDGDNNPKSARQEDKNGKKGF FT SKKERKRKTRLETSFSELEKTIHNVHKDQDTKLMEAENKRAEDWKAHDLRV FT MKMQQDHEKETMSNMMGQFVAMMGQFMTAFAPGQQGNSSASNQNPPTAAFQ FT GPPTSDFQRPSAANFQHPQSSASGFQRPHSPAPGFRLPGSPYIRLSAPLCC FT " XX SQ Sequence 7602 BP; 2222 A; 1542 C; 1631 G; 2207 T; 0 other; ggcccgatct agacacattt tttcccgata tagccccatc cgatatagcc ccatagccga 60 tatcggggcg catctaaaca cattttgcaa atggtagcgc gggacatccg ctcgtagcgc 120 ggaacgcccg cggcgcggaa cattcgtgac cccttttgac ctttggtgac cttcgacata 180 tcccgcggat ttcaaaatgg cgggagtgcc gcctgctttt ccgcttgtta tgaatcaggt 240 gatgttcttc ctgtatctca tgcgcgtggt gttaaacatg agacgccatg caagggtagg 300 gcgcaggagg cgattgagat acagggagag acgacagcgg ccggctagag tgaacgcagc 360 gttcttcctc gctgccctgc ttgtggcgga gggtggagag cctgtacacc ggggtgtatg 420 gcagcaccca aggaatcccg cgtccttttg ggatcagatc gtcctggaca cactgagcga 480 tgcagaatgg tacaagagat tcaggatgag gaaagctaca ttccagatgc tttgtgatga 540 gctggacccg gaacttcggc acaaggacac ccggatgaga gacgctatat cagtccagaa 600 gcgcctcgcc attggcctat actggctcgc ctcgggtgat ctgatgagga gcgtagcaga 660 tctctttgga gtgagcgtgt cttccgcatg caatatcatc catgaagtgc ataatgctat 720 aaatgaagtt ctgtttagac ggtacttaac tttcccaacc ggacaaaagt tgagggaaac 780 aatccacggg ttcatggaaa agtggcaatt cccgcagtgt gctggggcgg tggatggatc 840 gcacattccc atcaacgctc ccacaaaaga tcccacggat ttttttaaca ggaaaggata 900 ccattccgtc atactgcaag gggtcgttga ccacctgtac aggtaagttt attacaacta 960 cttactgtgt tcatgataat atataatgtg tgtgtatgtc tctgtttgtt tgtttgtttg 1020 tgtgtgtgtg tgtgtgtgtg tgtgcgattt gacctaggca ctgtcacgtc acagtcggaa 1080 gcgtgtatta acagaaatgt ctgaaataag cattcaaata gtatacttga taaacttcaa 1140 tcatactaac ttatatcatt tctcaaccta cctggattat ttcgaagatc ttgatttcca 1200 tgcaacgacg ttagaacacc aacctatcaa ccattttcca attcctccat aagtattttc 1260 ttaaatttaa gttatgacga gcattcgcaa aattgaagat ttatttttgt ccaaaggttt 1320 acagacatct gcgtcgggag gccagggagt gtacatgacg ctcgtgtcct aagggattca 1380 ccgttcttca cccggatgga gagcggaacc atgcttcctc gggtacgtat ccgtttccgg 1440 cgtgttcaga acataacgtt gaaggattgt gttaaatatt taagcactac aagtatttat 1500 aaaatagaca ttctgatttc gtgttccata tgaacgtcat agaaatgcac tattgacgac 1560 gccgacaaca accacattat tacttattac aataacgaat atccttcttc atcatatctc 1620 ttttaggagt tgacacgaga aatcgaaggc gttcccgtac cggtggcggt cttgggtgac 1680 gcagcatacc ctcaaatgcc ctggctggtc aagccgtacc ctgacagtgg agctctcagc 1740 agagagaagt ttgacttcaa ctatcgtcag agcagagccc ggatgacggt agagtgtgcc 1800 tttggccggc tgaaagggag gtggaggtgc ctgtcaaaac gccttgacgt cgacctggac 1860 aatgtaccgt ccatcgtagc agcctgctgt gtgctgcaca acgtgtgcga gatacacggg 1920 gatcagcacc aggaagagct tgtcctcccc gccaacgacg tccaacggcg accaggtgat 1980 cacgggcgaa tcgttcccaa tgcagccagg gacgccctgg ccaggcattt cgctgctgga 2040 aattagtata gctgtttatt ccatattgct tggttctttt tcgagttata aaatgcacta 2100 tgatacttgt atttgattgc acgaagtttg atatatgtaa acttgcacgt tttcagtaac 2160 gttttgatag ccaaaaatac gtgtttccta tatatgatat caagttaata tcaaacagca 2220 aagactgtaa cagcaatgac tacattgcaa caaattccta cgggggaata acttctttat 2280 attttccttt gcactgtcaa atgttatatg tagtttatag ctcagagaaa atggtagttt 2340 atgtatgttt tcagtaatgt tttgataggc agcaagatgt gtttcttaca tgtgatataa 2400 agtttttatt aaacaccaaa gagtaaatgc aacagattcc caatggggaa taatataaac 2460 taccttatat ataaacttct taggtctttt ttgtctactt tgctgggtac aaaagtacac 2520 catgatattt tcacaaagat aatatagatg gtttttgcct tgatgtacaa aggtagagtt 2580 tgacattaca ttgcacaaag tacatagcct atgttctgat cggtggcaac atgtgttata 2640 tatcatgtaa gttaactatg aagaaagttc cttgcaacaa attcccaacg aggaatacta 2700 tatactaaat acataaaaaa aactttactc ttctttacct tgagttacaa aagtattctt 2760 actatgtttt cttcgccctg agttataaag atacactttg atttttgtat aaaaagggat 2820 gtttttgcct taatgtatag aagtagactt tgacattaca ttgcacaaag tacatagccc 2880 atgttctgat cggtggcaac atgtgttata tatcatgtaa gttaactatg aataaagttc 2940 cttgcaacaa attcccaacg aggaatacta tatactaaat acataaaata aactttactc 3000 ttctttacct agagttacaa aagtattctt actatgtttt cttcgccctg agttataaag 3060 atacactttg atttttgtat aaaaagggat gtttttgcct taatgtatag aagtagactt 3120 tgacattaca ttgcacaaag tacataaccc atgttctgat cggtggtaac atgtgttata 3180 tatcatgtaa gttaactatg aataaagttc cttgcaacaa attcccaacg aggaatacta 3240 tatactaaat acataaaata aactttactc ttctttacct agagttacaa aagtattctt 3300 actatgtttt cttcgccctg agttataaag atacactttg atttttgtat aaaaagggat 3360 gtttttgcct taatgtatag aagtagactt tgacattaca ttgcacaaag tacatagccc 3420 atgttctgat cggtggtaac atgtgttata tatcatgtaa gttaactatg aataaagttc 3480 cttgcaacaa attcccaacg aggaatacta tatactaaat acataaaata aactttactc 3540 ttctttacct agagttacaa aagtattctt actatgtttt cttcgccctg agttataaag 3600 atacactttg atttttgtat aaaaagggat gtttttgcct taatgtatag aagtagactt 3660 tgacattaca ttgcacaaag tacatagccc atgttctgat cggtggtaac atgtgttata 3720 tatcatgtaa gttaactatg aataaagttc cttgcaacaa attcccaacg aggaatacta 3780 tatactaaat acataaaata aactttactc ttctttacct tgagttacaa aagtattctt 3840 actatgtttt cttcgccctg agttataaag atacactttg atttttgtat aaaaagggat 3900 gtttttgcct taatgtatag aagtagactt tgacattaca ttgcacaaag tacatagccc 3960 atgttctgat cggtggtaac atgtgttata tatcatgtaa gttaactatg aataaagttc 4020 cttgcaacaa attcccaacg aggaatacta tatactaaat acataaaata aactttactc 4080 ttaagtttct ttgccatgga caaaaaaaac aaaaagcaaa taactcaaag aagataaaac 4140 tctggtttac accttcgggt aaacatatga ggatctacag atgaaagtag atcaactcag 4200 aagcattctc ctcactggga gatgctgtgc gctctaagtt agaaacgcgg gggggctgaa 4260 aacctggggc aggactgggg gggcgctgga agccagaagc cgaactctgg gggtgctgaa 4320 aacctggggc aggactatgg gggcgctgga agccagaagc cgaactctgg gggtgctgaa 4380 aacctggggc aggactatgg gggcgctgga agccagaagc cgaactctgg gggtgctgaa 4440 aagttagcag cagaggggcg ctgaaagtcg gatgtagggg gaccctggaa ggcgaaaacc 4500 tggggcagga ctatgggggc gctggaagcc agaagccgaa ctctgggggt gctgaaagtt 4560 agcagcagag gggcgctgaa agtcggatgt agggggaccc tggaaggcag cagtgggggg 4620 gttctggttt gaagctgaag agttaccctg ttggcccggt gcaaatgcgg tcatgaactg 4680 gcccatcata gctacaaact gccccatcat gttgctcatg gtctccttct catggtcttg 4740 ctgcatcttc atgaccctca ggtcatgcgc cttccagtcc tctgccctct tgttttctgc 4800 ctccatgagt tttgtgtcct ggtctttatg cacgttgtgg attgttttct ctagttcaga 4860 gaaggacgtt tctagcctgg tcttcctctt cctctccttc tttgaaaaac ctagacatag 4920 taaaaacaaa taaacactga ttctacttgg atgaagattt tctaaacatg aggcaaccac 4980 tggtctggtt ttattttcct acttttctta aacctttaga aaatgcaaat aaggaaaaac 5040 tggtaaaggg taaaacttag gctttacagt agattcacca tacatcaaca acctgcttga 5100 ggcttgagaa gctatgaaag tgagagcaag caagcaagag agggagagag atggagtgag 5160 agaggagcct aggacagctc tctatcatca tttttacacc cccccccccc caattaactg 5220 gaggtcaagg taggtaatac aatacctccg taagtattgt aacaagcctg tctcttggaa 5280 aacaatgata catgtaatag taaaatcacc tttcttccca ttcttgtcct cttgtcttgc 5340 tgactttggg ttgttgtctc catcttcttg tgctttcctc tttccaccta tgaattgtta 5400 acaaagcaaa gaaaactaca ttatattaac atataacaca tacacttaga gaaaaaaatg 5460 ttgaaatatt aatgctacaa ctcgataata gtatgacttt gtaagtttgt atgtatgtca 5520 tgccccaaca caatttatgg aacaactagg cgtagagata aatatacaca tatatctcca 5580 tgaaaaaaga aaccatggca ttaattcata agagaacgaa catatgtagc tgcatagtag 5640 acatataaca cctcattgtg tcttctatca atattgtatg gcatacaatt tatatgtaga 5700 ttgtacaagc aaatagtatt atattctagt attacactgt gcctcagata tactaaaaaa 5760 ctacaaaagt cattagaagt cgtgcctgtc agagtttaag gaatcagcac agtacctgtc 5820 tgaccagagc tgctggcact cgtggacgtg tccagctgac cagagcttct ggcactcgtg 5880 gacgtgtcca gctgaccaga gctgctggca ctcgttgtgg acgtgtccag ctgatcggag 5940 ttgctggatt cgttgctagc attcgttgtc gacatgtcca gctgattgga gttgctggat 6000 tcgttgctag cattcactgt ggacgtgtcc agatctatag aggaaagggg aagcagagac 6060 tgagcagcac gtacctcaag acataataca cctgataagt aagacaatgc gtaatttaag 6120 attagaaaca gcatagtcat ggaccaattt ttgaatgaat aaattgacat taagttaccc 6180 taagcctttg aggaattata tactatctat ttgcaaaaac tttttaaagc tgggaatatc 6240 aaaagtatca tatattatga taggttataa atagaaattg tatgcataca tcttcaattt 6300 gtttatggaa aagtaaagta ataataaaag gtaaggaaca cacactgacc tgtggaagaa 6360 ttcaaggata catctggctg tggggattct gtttggcttg gtccaggtga caagtctcca 6420 ttcccatcat cctcatgtga atcttgtaat atatgtaaaa atcatgcatt agggttgcat 6480 tctttgacaa catcaacatt gacaacaata atatgataat aataaaaatg acagtaacaa 6540 tgtttcccca gaaaaatacc gatgtctcca ttttagggta tacacactcg aaatgtgtgt 6600 catctataca attgttataa gacaaaaata agacatgaat agtaattaac gttacattat 6660 ttaatattta ttttttattt atctggaaat gtgacaaata catttacaca gactcattgg 6720 cagcatttag cattgggtta tgaatatatt cctgtctgtg atcatatgac ttttaaaata 6780 tttttctgaa tcagctattc catatagtta atgaagctaa caaatacaca tcacgtgggg 6840 cacccacccc tccaggaaaa gtggtgcccc cctccccaac aaaaacaatt gtcattacgt 6900 agtaacaccg aaaattcaaa attcacctgt gatttcggtc gactcaaaca gagagctggg 6960 ctccagggcc tccctacagc cgaggaactc tttcagggcc tggtaatggg gagggtcgag 7020 ccggccgttc ccgcttttgt ggttgtgttg ccacgttttt ttgaagtcgt gcttcaggtc 7080 acgaattttg gtcttcacct ggtcgaaggt gcggatgatt ccctcctccg ccagcctatc 7140 cacgatcttt acgtacgcaa atttgttccg aggagtttcc aactgccgga gcatctcctc 7200 ctcagaccat acgcttatga gaaaggcagt ttcttcatag ttccatgtgc aacgggatcc 7260 gcgggagcgc ccagcattcc cctccgccat cttgctctcc cgcgctaatc tgtcgctgtt 7320 cccgcgttat atgctaatga gcccgcgtta acctgtagtt atatgctaat gagcttcgcg 7380 aattcgcgga gcatctacac gcccgccgaa catgtcggga acccctgctg aggtatcggt 7440 aaggtatcgt tagatggctt acgaatggtc cggaccattc gggagaaatt ttcccgatat 7500 agtaatggcg atatagtagt tgcatctaga cgctgagaaa aagctgtcgg cgagatgtag 7560 taggctcgcc gatatcggga aaaactgtgt gtagatcgcg cc 7602 // ID Gypsy-180_AA-I repbase; DNA; INV; 7401 BP. XX AC supercont1.157; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-180_AA_; KW Gypsy-180_AA-LTR; Gypsy-180_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-7401 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.157; Positions 768348 760948. XX CC Positions [2971-3474] - Reverse transcriptase CC Positions [4540-5016] - Integrase core CC 'ATAT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 149..2308 FT /product="Gypsy-180_AA-I_2p" FT /translation="MDYTHLTDEEVTYELALRHVVNLGPTTHRGKVLRLKA FT LIQEENLRDSKPTSSEHVMSSQSNIELCESQIQQLHISTEAAIRAADNAAL FT SHIRTRLYHYRDRLKLIRPPSELRETHGMLTVHVDVLLNKICGTSVVSGVP FT RSESVVSQAPSTGAVRRSSGENEGAVGGTTDATIPVGDSSRQDSTPPPVTS FT FSGGQGRGLLFSSSFRDRGNEDTGEREFGQQAASPPPPYHPRDRNPWNSTT FT REHQSLETRELQARITEMQEREQRMREEYYRMRDDLERLIRRERPAPERAD FT ERRIQKAVHNWPFKFRGEKDTTSLNVFLDRVETFARSEGMSDATLLSSIKH FT LLQEDAIDWYARATSQNLLRTWDQFKREIRREFLPSGYSQILRLEASFRFQ FT GREESFAKFYRDISALFRFVDPPIPNDEKFFLVKKNMNENYAAIVTAARPR FT SLEEMVEVCTGYDETKMLLNRQRRIPIPHSALLEPNFATPVATNRPPPIQH FT QQQPHRFSRVHAVELEENHHEREAGEEQCEDAWQHTIDELVEQVNALKMNL FT ERRSTRPSFTTRDERQFRSTDRLNRTEANVLGAGRQQARETQAQQQRPQAV FT PYELRQPQQQPEQVQRGQGWQARQWTFRTVEESNENNRRSYRLQPEPVSQP FT ESHHEEQQGAPNNQRLAMVCWNCDEEGHRFMDCPKPQAILFCYRCGRKGYS FT LRSCFTCRTDAVNYPAENQQ" FT CDS 2266..5388 FT /product="Gypsy-180_AA-I_1p" FT /translation="MPHGRGKLSSGEPAVEGSSSPHNFGDPSVIPEFDNIN FT SVIINLGSDNRPHAVISVLGKELTALLDSGANCSLLGGARAKLAEECGLQK FT GTVCGGIKTADGTKHRITNFVYLPIVYNNRNEVLPVLLVPSIPDCIILGMN FT FWEKFGVKAVCCTVESQQEEEEGSEEQEMKQLSPEQKRRLEQAIQKFPKAV FT EGKLGRTRLYEHRIDVGNAPPRKQRHYPMSPYVLQEVNREIDRMLALDVIE FT EAQFSPWNNPLVAVKKKTGQYRVCLDARHLNSIMVNEGYPIPQIAAITNNL FT RSSKYISSIDLKDAFWQLPLHSGSRPLTAFTVPSRGHFQFKVVPFGLCTAS FT QALARLMTHLFADLEPLVFHYLDDIIICSETFKEHIALLEEVARRLRQANL FT TISAEKSKFCRKSMKYLGYVIDEQGWRVDDEKIQAIVQFPTPTTRKEVRFL FT GVCNWYRRFIAGFSQLAAPLTNLTSGKAKFRWNPIAEEAFLKLKAALVSAP FT VLAMPDYSKPFAIACDASDTAIGAVLTQEINDEEHPISYFSQKLSASERKY FT SVTERECLAVIRAIEKFRGYVEGIRFVVHCDHAALSYLKSMKNPTALMSRW FT LLRLNAFDFEIRYRKGSINVVPDALSRTVAEAVLTVDQVQDPWYKQLIERV FT KSEGDQFPDFRIANDTLFKNCRCKDEVGAVYHKWKQCVPKEERLQLIRRFH FT DEPAAAHLGFYKTWHKLQAHYYWPQMQLEINDYVRNCATCKACKAPGKRMM FT PQMGNPKPAKTPWEMISVDFVGPLTRSKRGNTVLLVVVDWVSKYVIVKPMR FT AADSQKMVEFLEEEVCLKFSRPRLILSDNGKQFESMVFKSWLAKHKIGHMK FT TAFYCPQVNNAERVNRVVVTCIRALLDGDHREWDEKLPAITAAITAARHEA FT TGVSPHEANFGRNLLLHTDLYTQQELNTPEDPKVAQDLRLSAIRRIQKLII FT ERIKNSHQRAKQRYNLRTRTVAFKAGDLVWRRSFVLSSKADQINSKLEPKF FT VPAIVKEILGANLYVLEDVLSGKKGRFHAKDIKAD" XX SQ Sequence 7401 BP; 2115 A; 1854 C; 1976 G; 1456 T; 0 other; ctggcgccca actgaaaatc cacaacattt agctaccgtt cagcgatttt tttggttttg 60 ttgagctttt tggttttgtg catttgatta tccggccttg ataatcaagg atattttaac 120 ccttacgcat aagatttgat attaagatat ggattataca catttgactg acgaagaagt 180 gacttacgag ttggctttac gccacgtagt caacctaggt ccgaccaccc acagagggaa 240 ggttctccgc ctcaaggctc taatccaaga ggagaatctt agagatagta agcccactag 300 ctcggaacat gtcatgagtt cccaatccaa tatagaactt tgcgagtcgc aaattcagca 360 gctacatatt agcaccgaag cagccattcg ggcagctgat aatgcagctt tgagtcatat 420 ccggacacgt ctttaccact atcgcgaccg tttgaagcta atccgccctc cgagtgagtt 480 gcgggagacg catggcatgc taaccgtgca tgtcgacgtg ttgttgaaca aaatctgcgg 540 gacaagtgta gtgagtggtg taccaaggag tgagagtgtt gtgagtcaag caccgtcgac 600 cggtgcagtg agacgaagca gtggcgagaa cgaaggagca gttggaggaa cgacggatgc 660 cacgattccg gtaggcgatt cttcgaggca agactccaca ccaccaccgg tgacgtcatt 720 ctctggtggg caaggacgag gattgctgtt ttcgagttcg tttcgagacc gaggcaacga 780 agacactggg gaaagggagt ttggacagca agccgcatca ccaccacctc cctaccatcc 840 ccgggatcgc aatccatgga actccacgac acgggaacac cagtcgctag aaacccgtga 900 actgcaagca aggatcacgg agatgcagga gagagaacag cggatgcgcg aagaatacta 960 ccgaatgcga gatgatttgg agcggttgat tcgccgagaa cgacctgcgc cggagcgagc 1020 ggacgaaagg agaatccaga aggcggtgca caactggcca ttcaagttcc gaggcgagaa 1080 ggacacgacg tcactcaacg tgttcctgga tcgtgtggag acttttgcga ggtcggaagg 1140 aatgagcgac gctactcttc tgagctctat caagcacctt cttcaagagg acgccattga 1200 ctggtatgcg cgagcaacgt cacagaacct tttgcggacc tgggaccagt tcaagcgaga 1260 gatccgcaga gaatttttgc ccagcggtta ctcccaaatt cttcgtctgg aggccagctt 1320 cagattccaa ggaagggagg aatccttcgc gaaattctac cgcgacattt cggcgctctt 1380 ccgcttcgtg gatccaccaa ttccgaacga tgagaagttc ttcctggtaa agaagaacat 1440 gaacgagaac tacgcagcta tcgtcacagc agcaagacca cgttcactgg aagaaatggt 1500 agaagtttgc accggttacg acgagacaaa gatgctcctg aataggcaac gccgtattcc 1560 aattccccac agcgcgcttc tcgagccgaa cttcgccacg ccagtagcaa ccaacagacc 1620 accaccgatt cagcatcagc agcagccaca ccggttcagc agggtccatg cggtggagct 1680 ggaagaaaac caccatgaaa gagaagcagg ggaagagcag tgtgaagacg catggcaaca 1740 caccatcgac gagcttgttg aacaagtcaa cgcgcttaag atgaacctgg agcgaagatc 1800 gacgaggcca agttttacca cacgcgacga aagacaattc agatcgacag acaggctcaa 1860 ccgaaccgaa gctaacgttt tgggggcagg gcggcagcaa gcaagagaga cgcaagcaca 1920 gcagcagcga ccgcaagccg taccatatga actacgtcag ccacaacagc agccagaaca 1980 ggttcaaaga ggacagggct ggcaagcacg acagtggact tttcgcacgg tagaggaatc 2040 gaacgaaaac aatcgaagaa gctaccgtct acagccagaa ccggtcagcc aaccggaaag 2100 tcaccatgaa gaacagcaag gagctccgaa caatcagcgg ctagcgatgg tttgctggaa 2160 ctgcgacgag gaaggtcaca gattcatgga ctgtccgaag ccgcaagcga tcctcttctg 2220 ctaccggtgt gggcggaaag gctactcgtt gcgcagttgc ttcacatgcc gcacggacgc 2280 ggtaaactat ccagcggaga accagcagta gaggggagca gctctccgca caacttcgga 2340 gacccctccg ttatacccga attcgacaac atcaactccg tcatcatcaa tctcggaagc 2400 gacaatagac cgcacgccgt tatttcagtg ctaggaaagg aactcacagc tttgctggac 2460 agtggagcga actgctctct gctaggagga gcaagagcca agctagcaga ggaatgcggt 2520 ttgcagaaag gaaccgtctg tggtggaata aaaacggcgg acggcacaaa acatagaata 2580 acaaatttcg tctatcttcc tatcgtatac aacaaccgca acgaagttct tccagtgcta 2640 ctggtgcctt ccattccgga ctgcatcatt ctcggaatga acttctggga aaagtttgga 2700 gttaaagcgg tgtgttgcac agtcgaatcc cagcaggaag aggaagaagg cagcgaagaa 2760 caagagatga agcagctgag tcccgaacag aagagacgtc tagagcaggc gatccaaaag 2820 ttcccgaaag cggttgaagg caagctagga cgtaccaggc tgtacgagca tcgaatagac 2880 gtaggaaacg caccgccacg gaagcaacgg cactatccca tgtcgccgta tgttctgcag 2940 gaggtgaata gagagatcga ccgaatgtta gcgctggacg tgatcgagga ggcgcagttc 3000 tcgccatgga acaacccgct ggtcgccgtc aagaagaaga ctgggcaata tcgcgtttgc 3060 ctggacgcta gacacctcaa ctccatcatg gtgaacgagg gctacccgat tccccagatc 3120 gcggcgatta ccaacaacct ccggagcagc aagtatatct cgtccatcga cctcaaggat 3180 gccttctggc aactacctct ccattccggt tcgaggccgc taacagcatt caccgtccca 3240 tccagaggcc acttccagtt caaggtggtc ccattcggac tatgtacagc tagccaagct 3300 ctggcgcgac ttatgacaca cctattcgcc gacttagaac cgctggtatt tcattacctc 3360 gatgacatca tcatctgctc cgagaccttc aaagaacaca tagcacttct tgaagaagtt 3420 gcccgccggc tacgccaagc aaatctcacg atttcggcgg aaaaatcgaa gttctgcagg 3480 aagagcatga agtacctcgg ctacgtgatt gacgagcaag gctggcgtgt agatgatgaa 3540 aaaatccaag caatcgtgca gtttccaact ccaacgacaa ggaaagaagt acgttttctc 3600 ggtgtctgca attggtaccg acgctttatc gccggattct cgcaactggc ggctccgtta 3660 acgaacctga cgtcaggcaa ggccaagttc cgctggaatc cgatcgccga agaagcgttc 3720 ctcaagctaa aagcagccct agtttcggcg ccggtgctag cgatgcccga ctacagcaag 3780 ccattcgcca tagcgtgcga tgccagtgat actgcaattg gagcggtgct aacgcaggag 3840 attaacgacg aggagcaccc aatcagctat ttctcgcaga agctgtcagc gtcggaaaga 3900 aagtactcag tcacggagcg agagtgtctt gcggtgatcc gagcgatcga gaagtttaga 3960 ggatacgtag aaggaatccg attcgtcgtg cactgcgatc atgcagcgct cagctacctg 4020 aagtccatga agaacccgac ggcgctgatg agtcgttggc tgctacgtct gaacgcgttc 4080 gactttgaga tccggtacag gaagggatct atcaacgttg ttccagatgc tctctcgaga 4140 acggtggctg aagcggtgct cacggttgat caggtgcaag atccgtggta caagcagcta 4200 atcgaaaggg taaagagcga aggagaccag ttcccggact ttcgcattgc aaacgatacg 4260 ctgttcaaga actgccgctg caaggacgaa gtaggagcgg tctatcacaa gtggaagcaa 4320 tgcgttccga aagaggagag acttcagttg atccgacgat tccacgacga gccagcagca 4380 gcgcatctag gcttctacaa aacctggcac aagctccaag cccactacta ctggccgcaa 4440 atgcagctgg agattaacga ctacgtgagg aactgtgcga cctgtaaagc gtgcaaagca 4500 ccagggaaga gaatgatgcc gcagatggga aacccgaagc cggcgaagac tccttgggag 4560 atgatttcgg tggattttgt cggtccgctt acacgttcca aacgaggaaa tacggtgctg 4620 ttagtagtag tagattgggt gagcaagtac gtgatcgtta agccaatgcg cgcagcggat 4680 tcgcagaaga tggtggagtt cctagaagaa gaagtttgtc taaagttctc tcgcccacgg 4740 ttaatactgt ctgacaacgg gaagcaattc gaatcgatgg tcttcaaatc ctggttggcg 4800 aagcacaaga tcggacatat gaagacggct ttctactgtc cacaggtcaa caacgcagaa 4860 cgcgtcaacc gcgtcgtagt aacctgtatt cgggcgttgc tggacggcga tcaccgagag 4920 tgggacgaaa agctaccggc aatcacggcg gcgataaccg cagctcggca cgaggcgacg 4980 ggtgtgagtc cgcacgaagc caactttggg cgaaacctgt tgttgcacac agacctctac 5040 acacagcagg agctcaacac gccagaagat ccgaaggtag cgcaggacct gagactctca 5100 gcaattcgtc gaatccagaa gctgatcatc gagcggatca agaacagcca ccagcgggcg 5160 aagcagcgat acaaccttcg cacaagaaca gtagcgttca aggcaggaga tttggtttgg 5220 cgacgatcgt tcgttctctc gtcaaaggcg gatcagatca acagcaagct agagccgaaa 5280 ttcgttccgg cgatcgtgaa ggaaatcctc ggggcgaacc tctacgtgtt ggaagatgtt 5340 ctaagcggca agaaaggaag attccacgcg aaggacatca aggcagacta agcgaatgtg 5400 ttcaagctat gcaaacaggc ggtgcctgtc aacgagcaag caaaacgctg aagaaaacac 5460 catcattggg cacaattaca gcgaccagga gggcagagaa gatcgatcct gactctgcac 5520 ttccgttaca tcgatttgaa gacatcatcc acggggatct tcgttcctag aagcgagatt 5580 ccggaccggt cgagctatgt acccaggcgg tgcctgtaga ccaacaacag tagcgatcaa 5640 aaacaccatt ttgggagcat caaccacgaa gactgagtga gacgaaccgc tgcaatatta 5700 ctcacctcag caacaccgat tgtgagtcga cagcaaggat ttccgttcat ccgagcgagg 5760 ttccacaact ataaagctat gtccacaggc agtgcctgtc aaccatgagt accaagcttc 5820 aaaaacacgt accaagcggg aaaatggcaa aagtcaggtg tgatgagcaa taacttcaat 5880 atgactacct ctccacctcg acagtttcgt cagctttgtt gtagatcacg atcaacgcta 5940 gagacagtac acgatttgcg atcccatcct tcagaaatca ggacaggcac agtagcaggc 6000 aggagaaaca acaagcggta ccgaaatact tccatcgcgc caggaagcgt accgcgcgcc 6060 gttcgcgaag cgggagaaat cgcaccctgt gtctttggaa acggggagca gggatctctg 6120 cagtgcacca gctggctttc ggataggagg agtggagagt cggagatcga cgaccaacca 6180 tcggttcgac gcggctcaaa gcttacggcc actatcaggc gcggcgaaat cgcatctggt 6240 atcactgaga gaggaggagt cgaggcgagg cctacgcggg caagggagat tgggaccctg 6300 aaagccatcg tgagtaggga ggctccaaat accaacaagg agaacgcagc aagacatcag 6360 cctacagcgg gcgagggaga ttgggaccct gaaagccatc gtaaataggg aggctccaaa 6420 taccattaca agcacctagc aaatcgacgg cctacagcgg gcaagggaga ctgggaccct 6480 gaaaaccgtc gtgagtaggg aggctccaga taccaaaaac aggacaatga atcaagcgta 6540 gcgacatccc gagaagcacc atcatcaagc catggcagca tcatgccaac caggaagaga 6600 aacactcaca cacaaacaca cctagtggtc aggccgttag ccgatcgtga caaatcgagc 6660 acatcatcaa caatacgacc accctcacta cacagagcta tgagtggcac cactacgaag 6720 aaacgaaaca cgatctcggc aatccagcca atccacgact ggagcggtgc acacgaaccc 6780 tcaagtgttt ccagtcaatt cgaatgcgtg gccaagggtt aaagatctgg gggtcgtcat 6840 ggaatcttca tcccaacggt aacgcgacca ggggcgcggt aaaaccctat ttctcgggcc 6900 catgacgatt tagtcgcgga aatgcttctt ttctcagtat atccgcacaa accaaaacaa 6960 aaccattttt ccttagtctg taaatatagc ttcttaatgt ttagtgtgca attaagccta 7020 ttctagtcaa cacagtaggc tagaatccaa tttgaatgat gatttgccgt aggatttgat 7080 gaagctggac aaggactttg gtcagagacg atttgagagg ttgaaaaatc tgccaggcgc 7140 agctggcgta aaatttggta tagattaaga ttatgtaatt aaggcgatag tgaagatctg 7200 atttggcaag cagccgaggc ttgcgtatga tgaagtttag tagagttaag cacacgatgt 7260 aacttgaggt ctgctcgaga gcttagttga ccgacttgtc tctgaccatc atcctaggag 7320 gatgattaaa gatttgttgg aatttttcat gaattgtttg atcaccaatt catgaaaaat 7380 tcttccgcta cggtagagtt g 7401 // ID TBE1 repbase; DNA; INV; 4076 BP. XX AC U85403; XX DT 27-JUL-1999 (Rel. 4.06, Created) DT 27-JUL-1999 (Rel. 4.06, Last updated, Version 1) XX DE Oxytricha fallax transposon TBE1 fal4 insert in a TBE2 element, DE in micronuclear clone 123K1. XX KW OFU85403; TBE1. XX OS Oxytricha fallax OC Eukaryota; Alveolata; Ciliophora; Intramacronucleata; OC Spirotrichea; Stichotrichia; Sporadotrichida; Oxytrichidae; OC Oxytrichinae; Oxytricha. XX RN [1] RP 1-4076 RA Herrick G., Cartinhour S., Dawson D., Ang D., Sheets R., Lee A. RA and Williams K.; RT "Mobile elements bounded by C4A4 telomeric repeats in Oxytricha RT fallax."; RL Cell 43 (3 Pt 2), 759-768 (1985). XX RN [2] RP 1-4076 RA Williams K., Doak G.T. and Herrick G.; RT "Developmental precise excision of Oxytricha trifallax RT telomere-bearing elements and formation of circles closed by a RT copy of the flanking target duplication."; RL EMBO J 12(12), 4593-4601 (1993). XX RN [3] RP 1-4076 RA Doak G.T., Witherspoon J.D., Doerder P.F., Williams K. RA and Herrick D.G.; RT "Conserved features of ciliate TBE1 transposons."; RL Unpublished (1997). XX RN [4] RP 1-4076 RA Doak G.T., Witherspoon J.D., Doerder P.F., Williams K. RA and Herrick G.; RT "TBE1."; RL Direct Submission to Genbank (14-JAN-1997)Oncological Sciences, RL University of Utah, Rm5C334 School of Medicine, 50 N Medical RL Drive, Salt Lake City, UT 84132, USA. XX DR GenBank; U85403; Positions 324 4399. XX SQ Sequence 4076 BP; 1255 A; 746 C; 701 G; 1374 T; 0 other; caaaacccca aaacccctta atgagatgtt ttagtgtatt gatttgtagg gaatttgtta 60 ggggttgggg ttattaatag gttttaatgt gaaatttaaa gtatatgtat ttaatttaat 120 cactcaaaat gcacttttgg atcgtgtggg agaggcctga agtttatctt acttaaccac 180 aactctatct gttctattga tatgtctgat cttgagtttg gttttattat tatttactct 240 gccatttgtt tcattgattg agagactctc gttttgtcaa taaattgctt aattctttgt 300 tttgcgtcct tctcgcacct tgtagagtag tattcgttgc gaggcatatc gcactctatt 360 attattgctg ccatagccca cacatcccat tgaaagtctc catctctcca gtcttctctc 420 tctggataat agccgtgagt ccccataaca gttccaagcc tagatacctc cactcttaag 480 gctctgttga aatcgataat tgctactctt acaggtgaaa agttcaacat tatattctca 540 ggcttgagat ccctgtgaac aaagcctagc ttgtgaagct ctttgatacc ctgtatagtt 600 tatgttgcta ttaatgcaag ctatttatga tttctttact cagttatgaa atctctgagt 660 gtcatttcat atagtgggag tctgaaagct acatgatctt tggctaggag aacagcatct 720 agctttatta agtatttgca ctctttaaac ttagactgag tgagtactta atattcatta 780 gtaaactcca tttaatactt atcatcacta aggagatagt tgaatgggta tactttgtaa 840 gcatactgat gatcctatat aactttcttg tcataggagt cctctaggat taggtagctt 900 tagagagaag tatctaaatc aatctatgca agcactggaa ttttcttgta ttatatttta 960 atcatgattt tagtgacctc tgatgccaag aggttgcgtg caatcctcac gtactctgtt 1020 tcacttgctt gttaaacgtc cgcaacttag agaggctccc ttgcttttct tttgttgagc 1080 agctttagtt gagtgtctta atcttcaatt tctatttctt tcttgtgggc ttttaataga 1140 tgcttcttta agtctcttgt atgagagtac tacttatggc actcaaagca atcatacttt 1200 ccatcaactt tgtcatgctt taagtataca gtaggcttcg cttccttatg caccttcata 1260 tgcctgtacc agctagattt atgagtgaag tctctctcac acttctcaca aaaatatact 1320 tcaacctcct ctgcttacat tgaaacttca tttctaactt tctcctcctc tctgtctccg 1380 acacacttgt ttaccattaa ttttctatta ttaatttcta tttaaataac agagacctca 1440 agtgatagct ctggctgttc tttaagtgag ttcatctcac agctatcaca ctttttattg 1500 ctgccaaagg ttagtctcat ttagtttgat ggaagtgctg ttgtttttct taaaatgtta 1560 gcgctcatat ttaaatttga atttaaaatt ataaaactgc ctctataact gagagcttat 1620 atgagagctt tgcaaggcta caagcttcaa tggaagttct tagagtatta gagaagaatt 1680 atgagtgtta agagacgatc cagctctaaa tatgggtgag gaagtaacca tttttacttt 1740 caataataag ttgattatag tacgtaccga tttaggaggt gcacctaatt aagtacgtac 1800 tgctttagga ggtattttgt gattttatga ttaaaattag ggtgctccta attaagtact 1860 agctttaatt aatgctccta attaagtact agcattttag gagctgctcc taattaagta 1920 cgtaccaatt taaatttaaa aatatggaac tataacagaa gaaattgata tcaatctctc 1980 aaatttgaaa ccagttctca aaaccttctc catatccttc ttagaggttg acctcctata 2040 ggatgcccat atttaatagc aagttttaca gaggtatata attgaaatca ataaacttaa 2100 atttgaatat catttcttag atctgaaaac aagatggagt caatgaatga tgtaatgaaa 2160 agagtgcaag cacaagaatt gattcaaaga tttagaagta aggaggattt gcaccgctac 2220 ctggtacagc aaggtaagga actcatagta actaaactta attatgtagt tggcttgttc 2280 ctgccaacaa tggaaggtac taaaatttca tttctaagag caatactctg tgaagagaag 2340 aaagcattga aaacttagga tgtaaagttt gcagagattc ctaactatcc agaaatctca 2400 gtgactaacc tatatgacga cgctattaat gacccagagg tttcgcagta cctgcctact 2460 aagaagcagt tgtcaaacaa actacctgag agaaactatt tctttggtat acttgcaaca 2520 atcaaaggtg attacttgaa agagattatt caagaatctc ataagaagag gtacagcatc 2580 tctccagaag acaagagcaa acaaggcatc aagataagtg agagctggct atctgaactg 2640 cagaagcacc cttatatatc aagtaagatt tatagtttaa ctctttagaa aaatcgggta 2700 ctggcatctt cttgatgaag gagcagagca agcttcacag agagcgaaag tcgcagacaa 2760 aatatgaagt cagcaagaga ctatattcaa atatttagcc tccagaggaa atgaacatag 2820 atgctggagc tggcagtaag aggaggaacc taggaggagg gaagtagatg gaggaagaaa 2880 agaaagagaa gccagctcct actaaagaat tctaaatgtg acttatatat tcattaactt 2940 aactctaaaa cttatttata aacattcact ctcccttctt tggcacactt ttaagccaac 3000 taattatcta ctctcttcgc tgaacttttt ataaagtcaa ctgtatcgat tggtaggtca 3060 tgcatcaagt ggtagagcag caatttctta taatgacctt ttaatatacc ccagtagaat 3120 tcaattccat taaactatgg gctgtaaggc acgttgaaca ctggcgttat gcttagttgt 3180 tcataactct tccttgtttc tttcgtctta tgaacagaaa ggttatctac gaatagtata 3240 atctcttatt cagggtactt ctctctcagt tgctctaaga acttaatata ctgctctgtc 3300 tttatagagc gaggatgaat aacatatgtt tctaaaccag catcctcgct aatacctccc 3360 agaattgcca ttgtttatac cttgactttc tagtcatata cctctatatt actattcctc 3420 ttataccaag acttctatat gaaagtattg aaggtgaaca cagcctcgtc agcatgtata 3480 aacttaatat tattctatta cgctctctgc atttaatcat aaatataatt taaattgttt 3540 ctgtaaaatg gctgagtgaa atcaatcact ctcttgatac gattcacacc tttgtaagta 3600 ataccatttc gtttataaat gttagcaaca gattatgcag atatcttcaa ctctgggaac 3660 tggcggtgaa acatagtagc tcgttggtcg agagagaagt gagcccaata cttcaaagtt 3720 ttttcattga taagatagtc aatatgcttt tataagaact ttgacttcct cttaggctta 3780 gaattcagct attattgctt taattataca gctataagac agtgggaacg aactgttgac 3840 ggacttcgct ttaagagcag agcaatagac ttatagttca gtactggttt gacttagtca 3900 ctatatacat cacatccata gcgcaattta aacactaaat ctatatactc ttcatcataa 3960 atagacccca atagcttgta attcttgtct cgcatttaat taataacccc aacccctaac 4020 aaattcccta caaatcaata cactaaaaca tctcattaag gggttttggg gttttg 4076 // ID BEL-11_DWil-I repbase; DNA; INV; 5963 BP. XX AC scaffold_181148; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-11_DWil_; KW BEL-11_DWil-LTR; BEL-11_DWil-I. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-5963 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_181148; Positions 842174 836212. XX CC Positions [4986-5573] - Integrase core CC 'GTTG' target site duplication CC LTRs are 94% similar to each other. XX FH Key Location/Qualifiers FT CDS join(69..4583,4587..5948) FT /product="BEL-11_DWil-I_1p" FT /translation="MWSWPKVVEKKLEALKSQELRDVHFAIYTKFVRDNFV FT KRELRKFKGLPYDSKKRLEHIDCISKELGANALQRVCQFLEISPQEGIDCT FT EMASKIYDAISYETSGKSSNVIQASNANQSENQFENMNQLEVLRGIEAIDV FT ENVELQDEISKKEILMLGVKRQLSKAQTGGASADVIEEHEYEIQQLSLEVK FT VLNARAEALVSKKHTLNTTVALIRSSTSSQKSVMSDGDHPTNESLSISNGD FT NNIQSTVITMLARQSLGNKNLEFDGSSHLAWRHFKTQFIKSGSECQYSGEE FT DLYRLDKSLKGAARKKVDILLMSAKEPLKILAILDNIYGAPENVAFELMAK FT AQEIKYVKEIGDFHEFNCSVQNINSALEFVQCKFIDIQKLLATLIERLDES FT FLREWGSFLTARKLRAAEWNFEIFAEWTQEQSRLYHDICFYKKIITINNVD FT NEAQKKFGSQIQPQKVTQTRSTSKCFICDEDKHSLDKCSKFRKAEVKKRWD FT YASTSNACFTCLKFHKGQCNNREQCKIDNCERYHHRLLHKAKQPHLQSLPQ FT AFSGCIQEVQEAAILKMLPVILRGPTGEKEILAFLDDGSTTTMLDINVARE FT VGIIGKPEPYCFGWTEGISRYDETSEIADISITGDSKIQKLQNVRLIKDLN FT LPTQRIDYEAIQKEFPYTKMTKLNSIRNVQPHLLIGQRHATLLTTQTILNV FT DPKGPIMAKTKLGWVIHGPIKAKVREADFVCCHIEDEHLGDMLKNQFSIEG FT FGVCKPATDKTKSTEEKLAISKISSSIIRVNGQYQVNLPWKSDMVELPDSR FT QMAEQRLKCLERKLNRNSELARAYQKEVEKLLKNGYAKIVTNVNLVKSPKL FT WFLPHFAVINPKKPGKVRVVFDAAAICNGTSLNENLLTGPDWLSSIVGVLI FT RFRLNTIAFAADIKEMFNQVKIAKEDAVSQCFLWRNMNTQIPPTVYQMTSM FT IFGATSSPFIAQFVKNHNAAIHKNEAPEAAQAIIENHYVDDYLDSKATVRE FT AVEIIHAVIKIHAAGHFQIRNFICNNHDVLQGIPEDLRGPLTRIRLGEKGD FT ISQKVLGMIWDAQKDIFIFSTTTMHSISEDILNGTRMPTKRKVLAAVNSIY FT DPLGMVTPVTIKGRILMRSIWELGIGWDDVIKGDPVKKWQEWLNDLKRLHD FT VSIPRCYSQYLSIATKVELHVFCDASSKAYAAAAYIRVSYKDAEAATKYDV FT MLIASRARVSPLKHVTIPKLETLAAVMGCRLAETIENEMHLKFDKRFFWTD FT SLTVISWINSNERLQVFVEAYAAAAYIRVSYKDAEAATKYDVMLIASRARV FT LPLKHVTIPKLETLAAVMGCRLAETIENEMHLKFDKRFFWTDSLTVISWIN FT SNERLQVFVANRISEICEKSKRAEWNWVPTMENIADIITRDSPIKVATDRW FT QQGPASLGLDKKMWPQPKLKAPTEPGLMMLVHQDDHPFKILENRISKFTKL FT VRATAWVKRFIENVKAKRCKMELRRGDIVVEEYNWSLSKWLHVQAECFHND FT IIYAKTGKITGGLRLTKYCLSMSEDEILRINSRIIRAEGVVATSPIVLDGD FT QVFTKLMVQHFHNEVQHNGIQVVVHKFKSQYWMPKMNTKIKRIINECMWCR FT RRKAKPVKAQMGILPKERFSTFQKPFTSVGIDYFGPFYTKDGRGSSGRYNK FT CLRKRYGVLFTCLTSRAIHAEIAHSLTTDSCILAIRRMMARRGEIKTLWSD FT NGTNLRGASRELTEALEEIDIQAARGELTQKGIEWKFIPANSPNFGGCWER FT LIGCFKRTLEAVLSVESNPTDEVLQTLFCEAEHLVNSRPYILESDDPDDGL FT GLCPNDIIIQQSGSIEAPCKINGSSEKKTWRMTQMLVEAFWRRFIKEYIPS FT LVNRSKWEKNCDNLKEGELVVVKCMTPRGEWPIGIVLKTHPGEDGIVRVAD FT IKTTKNTYRRPLRNLCRLGLESRDIKDRAEDLVQH" XX SQ Sequence 5963 BP; 2098 A; 1143 C; 1296 G; 1426 T; 0 other; caacataaaa ttggtccttc gagccggatc attaacgcgc ataatacagt ccacaatagt 60 taataaaaat gtggagttgg cctaaagtgg tagaaaagaa attggaggcg ttgaagtcgc 120 aagaattgcg cgatgttcat ttcgccatat atacaaaatt tgttagagat aatttcgtaa 180 agcgcgaact acgaaaattc aagggtttac cttacgattc caaaaagagg ctggaacaca 240 tcgactgcat ctcgaaagag ctcggcgcaa acgctttgca acgagtatgc caattcctag 300 aaatttcgcc gcaagaaggg atagactgta ccgagatggc ctcaaaaatt tatgacgcca 360 tttcgtacga aacttctggc aaatcaagca acgtcataca agcatcaaac gcgaaccagt 420 ccgagaacca gtttgaaaac atgaaccagt tggaagtatt aaggggaata gaagccatcg 480 atgtagaaaa cgtggaactt caagacgaaa tctcaaaaaa ggaaatcttg atgcttggtg 540 taaaaagaca gctaagcaaa gcccaaactg ggggcgcttc tgctgacgtc atcgaagagc 600 atgaatatga aatacaacag ctgtcattag aggtgaaagt tctgaatgca cgcgcggaag 660 ctttggtgag caaaaaacat actttgaaca ctactgtagc actgattcgt tcgtctacct 720 cctcccaaaa atctgtcatg tctgatggag accacccgac aaatgaatcg ctatcgatct 780 ccaacggcga caacaacata caaagcactg taatcaccat gttagccaga caaagcttag 840 gcaataaaaa tctggaattt gacggttcct ctcatttggc ttggcgacat tttaagacgc 900 agttcataaa atcggggagc gagtgtcagt attctggaga agaggattta tatcgtctcg 960 ataagtcact gaaaggggca gcgcgtaaga aggttgacat tttgttaatg tcggccaagg 1020 agccattgaa aattctagcc attttggaca acatatacgg agctccagaa aatgtggcgt 1080 tcgaactgat ggcaaaagca caggaaataa aatatgtaaa ggaaattgga gattttcacg 1140 agttcaactg ctctgttcaa aatatcaatt cagcgctgga gtttgtacaa tgcaagttca 1200 tcgatataca aaagttgctt gcaaccctaa ttgagcggct agatgaaagt tttcttcgag 1260 aatggggatc atttttaacc gctcgaaaat taagagctgc agaatggaat tttgaaatat 1320 ttgctgaatg gacgcaggag cagtcgagat tgtatcatga tatatgtttt tataagaaga 1380 taattacaat caataacgta gataatgaag cacaaaagaa gttcggatca caaattcaac 1440 cacaaaaggt tacacaaact cgctcaacca gcaagtgttt tatttgtgat gaggacaagc 1500 attcccttga taaatgtagt aaattcagaa aagcagaagt gaagaaacgg tgggattacg 1560 caagcacaag taacgcctgc ttcacatgtc taaaatttca taagggtcaa tgcaataatc 1620 gtgagcaatg caagatcgac aactgtgaac gatatcatca tcgcttacta cataaggcaa 1680 aacagcctca tctacaatcc ttgccgcaag cttttagcgg atgcattcaa gaagtccaag 1740 aagcagcaat tcttaaaatg ttacctgtta tattacgtgg tccaactggc gaaaaagaaa 1800 ttctggcttt tctcgacgat gggagcacaa caacaatgtt ggatattaat gtagcacgtg 1860 aggtcggaat aattgggaaa cctgaaccgt attgctttgg atggactgaa ggaatctcta 1920 gatatgatga aacttcagaa atcgcagata tttctattac tggcgatagc aaaattcaga 1980 agcttcaaaa tgtacgtctt atcaaggatt tgaatttacc aactcaacga atcgattacg 2040 aagctataca gaaagaattt ccatatacaa aaatgacaaa attgaactcg attcggaatg 2100 ttcagcctca tctcctcatt ggacaacgac atgcaacact ccttacaact caaaccatac 2160 taaatgtaga tccaaagggt ccgattatgg caaaaaccaa gctcggatgg gtcatacacg 2220 gtccgataaa agcaaaagta agagaagcag actttgtatg ctgccacatc gaggatgaac 2280 atctgggcga tatgcttaaa aaccaattta gcatagaagg atttggtgtc tgtaaaccag 2340 caacggacaa aactaaatca acagaagaaa aattggctat tagcaagata tcctcatcta 2400 taataagagt aaatggacaa tatcaggtaa acttaccctg gaaaagcgac atggtggaac 2460 tccccgatag tcgccaaatg gcagaacaaa gactaaaatg tttagaaaga aagctgaaca 2520 gaaacagcga actagcgcga gcataccaaa aggaagtgga gaagctactc aagaatggtt 2580 atgcaaaaat agtcactaat gtcaacttgg ttaagtcccc aaaactgtgg ttcctgccac 2640 attttgcagt aattaatccc aagaaaccgg gaaaagttcg agtagtgttc gacgcagcag 2700 ctatttgtaa tggcacttcc ttaaacgaaa atctactaac tgggccggat tggctttcat 2760 caatcgtagg cgtgctaata agatttagat taaacacaat agcctttgct gcagatatta 2820 aagaaatgtt caaccaggta aaaatcgcca aggaagatgc agtctcacaa tgctttttat 2880 ggagaaatat gaatacccaa ataccaccaa cagtctacca gatgacgtca atgatattcg 2940 gagcaacctc ctcaccattt atagcacaat ttgtaaaaaa ccacaatgca gcgatacata 3000 aaaatgaggc accagaagca gctcaagcca tcattgaaaa ccattacgtc gacgactatc 3060 tggattcaaa agcaacagtt cgcgaagctg ttgagataat tcatgcagta attaaaatac 3120 acgcagcggg acattttcaa attcgaaatt ttatttgtaa caatcatgat gtcttacagg 3180 gaatacccga agatctacgt ggtccgttaa cacggattcg attgggcgaa aaaggagaca 3240 tatctcaaaa ggttctcggc atgatttggg acgcacaaaa agacatattt attttctcaa 3300 caactacaat gcacagcata agcgaggata tactgaatgg aacaaggatg cctacaaaaa 3360 gaaaggtgct tgcagcagtc aactcaatat atgatccctt gggaatggtg actccagtaa 3420 caataaaagg tcggatcttg atgcgtagta tttgggaact tggcattgga tgggatgatg 3480 tgattaaagg cgaccctgtc aaaaagtggc aagagtggtt aaacgatctg aaaagactac 3540 atgatgttag tataccaaga tgttactccc aatatttatc tattgctacg aaagtcgaac 3600 tacacgtttt ttgcgacgct agcagtaaag catacgctgc agcagcatat attcgagtaa 3660 gttacaagga tgcagaagcc gcaacaaaat atgacgtcat gcttatagca tctcgggctc 3720 gagtttcacc tttgaaacat gttaccattc caaaacttga aacattggca gctgtgatgg 3780 gatgtagact ggcggaaact attgaaaacg aaatgcatct aaaatttgat aaaaggtttt 3840 tctggacaga ctccctaaca gtgatttcat ggataaacag taacgaaagg ctgcaagttt 3900 tcgtagaagc atacgctgca gcagcatata ttcgagtaag ttacaaggat gcagaagccg 3960 caacaaaata tgacgtcatg cttatagcat ctcgggctcg agttttacct ttgaaacatg 4020 ttaccattcc aaaacttgaa acattggcag ctgtgatggg atgtagactg gcggaaacta 4080 ttgaaaacga aatgcatcta aaatttgata aaaggttttt ctggacagac tccctaacag 4140 tgatttcatg gataaacagt aacgaaaggc tgcaagtttt cgtagccaac agaataagcg 4200 aaatatgcga aaagtccaag agggcagaat ggaactgggt tccaaccatg gaaaacattg 4260 ctgacatcat aacaagagat tcgccaataa aagttgcaac agacagatgg caacaaggtc 4320 cagcctctct tggtctggat aaaaagatgt ggccacagcc aaaattaaag gcaccaaccg 4380 aaccagggct tatgatgctg gttcaccaag atgaccaccc atttaaaata ctggaaaata 4440 gaatttcgaa gttcacaaaa ctggtaagag ccacagcatg ggtcaaaaga tttatagaaa 4500 atgtgaaagc gaaacgttgt aagatggaac tccgtcgggg cgatatcgtc gtggaggagt 4560 ataattggtc tctatcaaaa tggtaactcc atgtccaagc ggaatgcttt cacaatgaca 4620 taatctatgc taagactggc aagatcacag ggggtttacg cctcactaaa tattgtcttt 4680 ctatgagtga agatgaaatt ctacgtataa acagccgaat aatacgtgca gaaggcgtcg 4740 tcgcgacaag tcctattgtg ttggatggag atcaagtatt cacaaaattg atggtacaac 4800 attttcacaa cgaagtgcag cataacggaa tacaagtcgt cgtacataaa ttcaagtcgc 4860 aatattggat gccgaagatg aataccaaga tcaaaagaat tatcaacgaa tgtatgtggt 4920 gtagacgacg caaagccaaa ccggttaaag cgcaaatggg tatattaccc aaggagaggt 4980 tctctacatt tcaaaaacca ttcacatccg ttggcataga ttatttcggc ccgttctata 5040 cgaaagatgg tcggggctcc agcggcagat acaataaatg tctgcgaaag cgatatggcg 5100 ttctctttac atgcttaaca tctagagcaa tacatgcgga aatcgcacac tctttgacta 5160 cagactcgtg cattttggca atccgtcgaa tgatggcaag gcgaggcgaa attaaaacat 5220 tatggtccga taatggtact aatcttagag gcgcttcaag agaactaaca gaagctctag 5280 aggaaataga tatacaagca gcgcgcggag aactaacaca aaagggaatc gaatggaaat 5340 ttataccagc aaactcacca aattttggag gttgctggga aaggctaatt ggatgcttta 5400 aaagaacatt agaggccgta ctatccgtcg aaagtaatcc cactgacgaa gtgctgcaaa 5460 ctctcttttg tgaagcggaa catttggtaa atagtcgacc gtatattctc gaatcggatg 5520 atccagatga cggcttagga ttatgcccca atgatataat catacaacaa agtggatcta 5580 ttgaagcacc atgcaaaatt aacggatcca gtgaaaagaa gacatggagg atgacgcaaa 5640 tgttagtgga agcattctgg agacgattca ttaaggaata tatcccaagt ttagttaacc 5700 gatcgaaatg ggaaaagaat tgcgacaacc tgaaagaggg tgaactcgtc gtagtaaaat 5760 gcatgacacc aagaggtgaa tggccaattg gaatagtact caaaacacac ccaggagaag 5820 atggaatagt gagagtagca gacataaaaa caaccaaaaa cacttatcgt cggccactaa 5880 gaaacctatg cagattagga ctggaatccc gcgacataaa ggatcgtgct gaagacttgg 5940 ttcaacacta gggggcggaa ata 5963 // ID Copia-23_CQ-I repbase; DNA; INV; 3481 BP. XX AC AAWU01016733; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-23_CQ_; KW Copia-23_CQ-LTR; Copia-23_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-3481 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 361-361 (2011). XX DR GenBank; AAWU01016733; Positions 9201 12681. XX CC Positions [885-1409] - Integrase core CC 'GTATC' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 879..2444 FT /product="Copia-23_CQ-I_2p" FT /translation="MARPPFPAAAVKRSKEILDIVHSDVCGPMTPTPGGCK FT YYMTMIDDHSRYTVVYFLKAKSEVEERIRQYVRFVQTQFGRKPKVVRSDRG FT GEYTGNSLRKFYEDEGIKAEFTAGYTPQQNGSAERKNRTLNEMGLCMLLDA FT GLDRQFWAEAVNTAAYIQNRLPTKAIETTPFEMWFGEKPDLQHMKIFGCHA FT YVWTPSQKRKKLDAKSEKMVFVGYSSEHKAYRLLNPKSGKIVVSRDVKFLE FT LTPPSKKSEAEEIDQPEEVPVHLNAANDESESDDEFFGFDETTDESGDEFG FT GQPPNEAAPGPDGPNEDVAAQEEEPALLQEEQLDQNNSPVAQPVPLGPRRS FT TREGRGVPAARYEGSIGIAAGSTAEPRTYDEAVNSVESVQWKTAMEEEMAA FT HRENGTWKLTQLPTNRRDIGCKWVFKRKEDETGEVARFKARLVAQGFSQQY FT GSDYDQVFAPVVKQVTLRAVLTLASKRQMIVRHIDIRTAYLYGELQEELYM FT RQPPGYSNGDPSVVCRLCGTTASTRR" XX SQ Sequence 3481 BP; 903 A; 852 C; 1039 G; 687 T; 0 other; ggttatgggc gcagcccgag ccccgcggaa gattgaggtt acgttttatt tctaatcgtt 60 tcggaaaagt tggagtacgt ggtttcgcgc gtgaagagat aaaagttttt ttttgtgctc 120 ggccgggatt gttgagtgaa caatggattt ggcgcgagtt actaaattgg gtgacacgaa 180 ttacgaaacg tgggcttttt cggtcaaggc gctcttgcgc cgtctgaaac tctggaagta 240 cgttgagccg ggtactcccc ccgccgcgcc gagtgcggaa tggtccaccg gagacgagga 300 tgctctcacg acaatccaat tggtggtgga ggaaagccag cagagcgtca tccgcgataa 360 ggtgactgcg aaggccacct gggaggcact gaaggaacgc cacagtaagc tgcggctcgg 420 ccagatgatc atcatacaaa tcgccactca gaactaccaa gatggcgaca gtattgagac 480 ttatttgagc tccatagaaa agctttacgc tcgtttggac aacgctggtg tgcggatgca 540 ggagtgcatc aaggtcgggc tgatcttacg ggggctgcca ccgtcgtatc gcccacttat 600 catggtactg gaagcgcgtg acgaagagga cctgacgttg gccatggtca aggcgaagct 660 gttagacgag gcgaacaact tccagcacca gcaggcacgt gggcaggccg agcaagcgct 720 caaggtgaaa acccggccaa aggtgctggt gcgctaccgc tgcggtaagg cagggcatcg 780 caaggcagat tgcgacgagt cggacgacga gggatccaag aagttttcga agaagaagcc 840 aaaagaaaag gcaaaacaag tgctgtatcg agggcaatat ggcgcgacct ccgtttccgg 900 ctgccgcggt aaaacgatcc aaggagatcc tcgacatcgt gcacagcgac gtttgcggcc 960 ccatgacgcc aacacccggt gggtgtaagt attacatgac aatgattgac gatcacagcc 1020 gttacactgt tgtgtatttc ctcaaagcta aatccgaggt tgaggaaaga atccgccagt 1080 acgtacgctt cgtgcagacc cagttcggcc ggaagccgaa agtcgtgcga tcggatcgtg 1140 gcggcgagta cacgggaaac tcactccgga aattctacga agacgaagga atcaaggcgg 1200 agtttacggc cggctacact ccacagcaaa atggaagtgc cgaacggaag aacaggacgc 1260 tgaacgagat ggggctgtgc atgctgctgg atgccgggtt ggatcgacaa ttctgggctg 1320 aagcagtaaa cacggcggct tacatccaga atcgcctgcc gacgaaggcc atcgagacaa 1380 caccgttcga gatgtggttc ggcgagaagc ccgacttgca gcacatgaag atcttcggat 1440 gccacgccta cgtttggacg ccctcgcaaa agcgaaagaa gctggatgca aaatccgaaa 1500 agatggtgtt cgtcggttac tcgtcagagc acaaagcgta ccgactgctc aatccgaaat 1560 ccggtaaaat cgtcgtgagc cgagatgtta agttcctgga gctaacccca ccgagcaaga 1620 aaagtgaagc agaggaaatc gatcaaccgg aagaagttcc tgtccacttg aacgcggcga 1680 acgacgaatc ggaatccgac gacgagtttt tcggatttga tgagacaacg gacgaatcgg 1740 gcgacgagtt tggaggtcaa ccaccgaacg aagctgcacc cggtccggat ggaccgaacg 1800 aagatgtagc tgcccaagaa gaggaacctg cgcttcttca ggaggagcag ttggaccaaa 1860 ataactcgcc ggtagctcaa ccagtgccgc ttggtcccag acggtcaacg cgagaaggac 1920 ggggtgttcc agctgcaagg tacgaagggt cgatcggaat cgcagctggt tcaacagcag 1980 agccccgcac ctacgacgaa gcggtgaaca gtgttgaaag tgtgcaatgg aagaccgcta 2040 tggaagagga gatggccgcc caccgagaga acggcacctg gaagctcaca caattaccaa 2100 caaatcgacg agatattggt tgcaagtggg tgttcaagcg gaaggaggac gaaactggcg 2160 aagttgcccg cttcaaagca cgattggtcg cacaaggctt ctcgcagcag tacggaagcg 2220 actacgacca ggtgtttgct ccagttgtga aacaggtgac gctgcgggct gtactcacct 2280 tggcaagcaa gaggcagatg atagtgcggc acatcgacat ccggacggcg tatctgtacg 2340 gtgagctgca ggaggagcta tacatgcgac agccgcctgg gtacagcaac ggagatccga 2400 gtgtcgtctg ccgtctgtgt ggaacaaccg catcgacacg gcgttgaaaa gcatagagtt 2460 cacgcgttct tctgcagatc aatgcctcta cacccgaggc acgatttgcg tcctcatcta 2520 tgtcgacgac atcgttgtcg cttgccacac atctgaggaa tacgaatcga ttgtgaaatt 2580 gctgcgccag agtttcaagg tcgtcgagct cggtgacatc aagtttttcc tgggaatcca 2640 cgtcaggaag ggagatggcc actatgtcct gagccagaag tcgtacattt cgaaaactct 2700 cgctcgattt ggtatggacc agtgcaagac gtcgaagatc ccgatggcaa gcggctttct 2760 acaacaaagg gaggaggatg gcgaagctct tttgaaccca ggacagttcc agagcctcat 2820 cggtgttttg ttgtacatcg ccgtcaacac tcgtccggat attagcatcg ccacttccat 2880 tctcggacga cgtgtgacaa ccgcaacaac agcagactgg accgaaggca agaaggtgct 2940 gcggtatctg aaaggtactc tagactacga gttacatctg ggaggcaagc agaatttgca 3000 gttcgagtgc tttgtcgatg cggattgggc aggtgaggca tccgatcaca agtccaacac 3060 tgggttcatc ttcaaactcg gaggaggact gctcagctgg ggctgtcgga agcaaacagc 3120 agttgcgctg tccagcaccg aagcagaata cgtggcgctc gcggagtgtc tccaggagct 3180 catgtggctc cgcaaactga tggtggatct gtcggaacca cttcctgagc cgatggtggt 3240 ctacgaggat aaccagagtt gcatcgctct gacagctgcc gacaggactt cgcgtagatc 3300 gaagcacatc gacacaaagt actgtttcgt gaaggatttg gtcaacgaag gagtggttgc 3360 ggtcaagtac tgttcaacag acctgatgga tgctgacttg ctgaccaagc ccctgggagc 3420 tgtgaaactt caacatttcc ggtcggcgat tggagtaaag tgcgtcgacg ttgaggagga 3480 g 3481 // ID CR1-3_IS repbase; DNA; INV; 2987 BP. XX AC ABJB010111403; XX DT 18-FEB-2011 (Rel. 16.02, Created) DT 18-FEB-2011 (Rel. 16.02, Last updated, Version 2) XX DE CR1-like autonomous non-LTR retrotransposon from deer tick. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-3_IS. XX OS Ixodes scapularis OC Eukaryota; Metazoa; Arthropoda; Chelicerata; Arachnida; Acari; OC Parasitiformes; Ixodida; Ixodoidea; Ixodidae; Ixodinae; Ixodes. XX RN [1] RP 1-2987 RA Jurka J.; RT "Young families of CR1 non-LTR retrotransposons from animals."; RL Direct Submission to Repbase Update (18-FEB-2011). XX DR EMBL/GenBank/DDBJ; ABJB010111403; Positions 11507 14493. XX CC The 5'-terminal portion may be incomplete. XX FH Key Location/Qualifiers FT CDS 121..2844 FT /product="CR1-3_IS_1p" FT /translation="MVMVNKKWNIKNSKKITFSRAEAVQIDIENEKEQIQI FT LAVYRAPQGNIAEFIDDIGNWIKVENSKNLIITGDINIDITKQNNLVDKYP FT NCLAEHGLQSMIKDITREEILGLQLARSCIDHICIRSKKYCKAHVIAEKLA FT DHYFVGIAIPELGIPESERNQVTTYINNAKVDKYVREINWESIANEEDPEI FT IYNKIKNAFTDIYKKSSEQVTVKKRKTGNQWVTKEIDQLIRKKNKAWRLLK FT SSPGSSQRKEEYRSIRNKVTSIIKKEKRNYFRKKFEDTKDLSQCWSVINQV FT LGRNTRLSIDQTIKNNFGTYCNDEKIANDFNNEFIQSIELLCNHGYQNETE FT TRARIGNSMYLPEMTEEDLVEIVSNMKTNKAVGTDSIRPRDVKENVNIIKT FT VLLALYNKSLAIGKVPSGLKTAVVRPIYKKGDKKTLLNYRPVCILPVLNYI FT MEKYVYEALNGFINKYNLLTKTQYGFKTNSSTITLLEDFSDFVNKKLDEGN FT YVLALFIDFSRAFETERHETLIDRLFSLGVRGKLLDWFKDYLSGRTQVVKI FT ASSFSEKRLVKQGVPQGTVLGPILFNIYINDIVNVHLNSVLFQFADDTVIL FT YADNDYTTAENKIQEDVSKLVKWYKENQIIINTKKTELMCFRSPQRRTQME FT GNIKIHSDSCKNCNCEPLKLTTSVKYLGLYIDEHLKWNQHVEHICKRLRSV FT AAQLYHLRSCLPQPILKMIYHSLGKSVLVYGITIYGHCAKYLQYKINSILN FT RVVKNLYVDEHRPLHYMYQHLDIQKLSDLLELNIIIQHQFSDKYKTKYIPI FT RNLRPKNLFQEPKVKTKYGKAVRASYVPRVFNKLPPRIIEITKISELKREL FT KQWFINRKEQQLAEEKSTRGWNLNSVVLCVLFCRELWSKGVYMSCFHFFCL FT FARN" XX SQ Sequence 2987 BP; 1198 A; 426 C; 575 G; 788 T; 0 other; tctattgatt gcttagtcat atctgaaacg aacataaagg actgcgacga agcattattc 60 caaaataaaa gttttgaaac gattacaaat aacaggaaga acaagcgagg aggtggaatc 120 atggttatgg tcaataaaaa gtggaatata aagaatagta aaaaaataac tttttcgagg 180 gcagaagcag tacaaataga tattgagaat gaaaaggaac agatacaaat tctggcagta 240 tatagggcac ctcaaggaaa tatagcagaa ttcattgacg atatcggtaa ctggataaag 300 gtagagaatt cgaaaaattt aataattact ggagatataa atattgatat aactaagcaa 360 aataatctag tagacaagta cccaaattgt ttggctgaac atggacttca atcaatgatt 420 aaagatatta cgagagaaga aattttgggt ttgcaattgg caagatcatg tattgaccat 480 atttgcatta ggagcaaaaa atactgtaaa gctcacgtca ttgctgaaaa attagccgac 540 cactactttg tgggaatagc aattccagaa ttgggtattc ctgaaagcga aaggaaccaa 600 gtaactacat atatcaacaa cgcaaaagta gataaatacg tgagagagat aaactgggaa 660 agcatagcca atgaagaaga ccccgaaatt atatacaata aaataaaaaa tgcgtttacg 720 gatatatata aaaagtcgtc tgaacaggtc accgtaaaga aaagaaaaac agggaaccaa 780 tgggtgacaa aagaaataga tcaactaatc aggaaaaaaa acaaagcgtg gcgactactt 840 aaaagcagtc caggaagcag ccaacgcaag gaagagtaca gatccataag aaataaagtg 900 acatcaataa taaaaaagga gaaaagaaat tatttcagga aaaagtttga ggacaccaag 960 gatctatctc aatgttggtc tgtaattaat caagtactgg gtagaaacac tagactgtca 1020 atagatcaaa caatcaaaaa taactttggg acatactgca atgatgagaa aatagcaaat 1080 gattttaata acgaatttat acaatcaata gaattacttt gcaatcatgg ataccaaaat 1140 gaaacggaaa ctagagctag aattggaaac tcaatgtatc tccctgaaat gaccgaggaa 1200 gatctagtag aaatagtaag taacatgaaa acaaacaaag cagttggaac cgactcgatc 1260 agaccgcgcg acgtcaaaga aaatgtaaat ataattaaga cagtattgct agcattgtat 1320 aataagagtt tggccatcgg gaaggtacca agtggtttaa aaacagctgt ggtaagaccg 1380 atttacaaga aaggggataa aaaaactctt cttaactata ggcctgtctg tattcttcca 1440 gtccttaatt acataatgga aaaatatgtc tatgaagcgt taaatggctt cattaataaa 1500 tataacctat taacaaaaac gcaatatggt ttcaagacaa attcaagtac aattactcta 1560 ctagaagact ttagtgattt cgtaaataag aaactagacg aagggaatta tgtgctagca 1620 ctgtttatag atttttctcg ggcatttgaa acagaaagac atgaaacact aatagacaga 1680 ctattctctt taggtgtcag aggcaaacta ttagactggt tcaaggacta cctttctggt 1740 cggactcagg ttgtcaagat agcaagtagc ttcagtgaaa agagactagt aaaacaggga 1800 gtgcctcaag ggactgtatt gggacctata ttgtttaata tatatattaa tgatattgtg 1860 aatgttcatt taaatagcgt gttgtttcag tttgcagatg acaccgttat tctgtatgca 1920 gataacgatt atacgacagc cgaaaataaa attcaagagg atgtttccaa attggttaag 1980 tggtacaaag aaaatcaaat tataattaac acaaagaaaa cagagttgat gtgttttaga 2040 agtccacaaa gacgaactca aatggaaggg aatataaaga tacatagtga ctcatgtaaa 2100 aattgtaatt gtgaacccct taaactaact acgagtgtga aatacctagg actatacatt 2160 gacgagcatc tcaaatggaa ccaacatgtg gaacatatct gtaaacgact tagatcagtt 2220 gcagcccaat tataccacct aagatcgtgt ttacctcaac caatattaaa gatgatatat 2280 catagcttag gaaaatcagt cttagtatat ggaattacta tatacgggca ctgtgcaaag 2340 tatttacaat ataaaataaa ttctatctta aatagagtag taaaaaactt gtatgtagat 2400 gaacatagac ctttacatta catgtatcaa catttggata ttcaaaaact tagtgactta 2460 ctggaattaa atattataat tcaacatcag ttcagcgata aatataaaac aaagtatatc 2520 ccgataagaa acctaaggcc taaaaattta ttccaagaac caaaggttaa gacgaaatat 2580 ggaaaagcag tcagagcatc atatgtacct agagtgttca ataagctacc accaagaata 2640 atagaaatta caaaaataag cgagttaaaa cgtgagttaa aacaatggtt tattaacaga 2700 aaagagcagc aattagcaga agagaagtct accaggggtt ggaatctaaa tagtgtagta 2760 ttgtgtgtgt tattttgtag agaattatgg agtaaaggtg tatatatgtc ttgttttcat 2820 tttttttgtt tgtttgcaag gaactgaaca agtgtaaaaa aaagggtttg ttataagtga 2880 ctagagctct gtcgctgccc tacgcacaag ccgcttaggc ctatgtgggg tatgaatctc 2940 tgtaatattg tatatggaaa agatgaaata aagattatta ttattat 2987 // ID hAT-51_HM repbase; DNA; INV; 3834 BP. XX AC . XX DT 18-DEC-2008 (Rel. 13.12, Created) DT 18-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type DNA transposon family - consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-51_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3834 RA Bao W. and Jurka J.; RT "hAT-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 2039-2039 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(967..1542,1546..3339) FT /product="hAT-51_HM_1p" FT /translation="LIFFNSGVENDPALWPSILKQREKEELIKKGPSTLPD FT DFPRDASNNRAFPKEVLFIALPNGEKVQRDYLIWSPSSNSLLCFPCSLFGC FT ENSGSYGDRSLLLRWNGGIKNDWRKLSDRIKSHHSNPVHQSHYLNWKTLHS FT ALVDQRGIDSEFQKALEKEVARWREILRCILDAILFLTSQNLSFRGKYSRP FT ELILKNKFLFLVLQNFYYLHLTNLENIVLGQYSSMEKNEKGNFLSCLELIS FT RHNETLKNHLETVARHQNKGTRMQAHYLSWQSQNEFISECATLVQAAVVKE FT VKDAVYFTIITDGTPDVSHTEQITFILRFVRFNSGKMIWEVKERFLCVEDM FT EKKKGADIAYLICNVLAKVNLDLQKLRGQGYDNCSNMAGIYNGAQAHILEK FT NPFALYIPCGAHSLNLAGVHAAESCAEIKTFFGNIQALYVFFSCSPARWKI FT LHEETALSLHNLSDTRWSAHIAAVKPLVKKPREIIAALDRTVNQLDLTADM FT LNQTKSLGKYFKSFESVVLLTIWYKTLQNIDNVSRCLQSESITIEEEVILI FT QLLLDDLGRIRSSWNCILTEAKLVAGNLGFETEFKVKRTRRVKKFHEEPSN FT TAHYHDDTEKNFEVNIFNVALDHIILQIQSRFNVVKKVSSQFSFIWLQSED FT PSSQDDKARQLAKFYSNDVKEDDLIEEICHFDRLKASSLFSKGNSLNLLNQ FT IYSKGLQPIFPSICILLRIFHTMPVSVAEGERSFSKLKIIKNHFRATMGQE FT RLSNLMALSIENDLAKSLSYDEVISNFASKKARKFYLN*" XX SQ Sequence 3834 BP; 1299 A; 641 C; 700 G; 1194 T; 0 other; cagggccgcc gagagggggg ggggccacca ggacagtttg tcccgggcgg cagatgtcca 60 tggggcgcca gatgccgaag taaaataaaa tagttttttt attcttaaaa ttctataatt 120 aaaatgtata tattactcgc gttatatcat tgttttttat tcctcgaaaa gttttaaaaa 180 aatacgaact catcattgat ttcgtattct atacgctatt aaattaaaaa tgtcccttga 240 gatttttgca ttcatcagac taaagtaaaa aaaaacgcaa acgttgataa ttagcaatag 300 caaactgtga atacaattta gaaataattt tattcagaat ttagaaagta atagtaatta 360 tttttcaact attcgttatg atcgacttat ttatggtatt tcatgtttac gatttattta 420 tatttaacta ttgaaatatt gatttatgac attagctttt tgagtttatt attttttgat 480 gagtttgcgt atttgacttg gctattattt tattgatgcg tactgttatt tacagtaact 540 gttgataatg aatcgtaaaa aactttctgg tgcccaaaat cgggcattgg cttccaaaag 600 ggcaaaagac tgtgaaaaaa ataaacaaac gctccaagaa ttaggttggt tttacaaagc 660 tactaatcaa aatactgatt tgaaggtgag ttagtagaaa tatataggtc aataaatata 720 tttcttttat ttccgaagca atttaagaaa aataattaaa cattttttaa gtcggcggaa 780 ttaaataaat cagaacaaac aatggaatca tcgtattatg atgatgttga tctacattct 840 cttggttcag tggaaaatga aacagacgac agttttaacg atgaattaaa acccgaagaa 900 aataaaggtt tttatacaaa caataattca aaatagttaa ataattaatt aagtattagg 960 ttttaattaa ttttttttaa ttcaggcgtt gaaaatgacc cagcactctg gccatcgatt 1020 cttaagcaaa gagaaaaaga agagttaatc aaaaaaggtc cgtcaacttt acctgatgat 1080 ttcccacgag atgcttctaa taacagggcc tttccaaaag aagttctttt cattgcattg 1140 cctaatggag aaaaagttca acgagattat ctgatatgga gcccatcgtc taactctcta 1200 ctttgtttcc cgtgttcttt atttgggtgt gaaaattccg gatcctatgg ggatcgatct 1260 cttcttcttc gctggaatgg aggaataaaa aatgactgga gaaaattgag tgatcgtatc 1320 aaaagtcatc atagcaaccc agttcatcaa agtcattatc tcaattggaa gactttgcac 1380 tcagctcttg tcgatcagcg cgggatcgac tcagagttcc aaaaggcact agaaaaggaa 1440 gtagccagat ggcgtgaaat acttcgttgt attttagatg caattttatt tttaacatct 1500 caaaacttat cctttagagg taaatattct aggccagaac tctaaatact taagaataag 1560 ttcttatttt tagtattaca gaatttctat tatctacatt taactaactt agaaaacatt 1620 gttttagggc aatattcgtc gatggagaag aacgagaaag ggaacttttt aagttgcctt 1680 gaacttattt ctcgtcacaa cgaaacatta aagaatcatc ttgaaacagt tgctcgtcat 1740 caaaataaag gaaccagaat gcaagctcac tatctctcct ggcaatcaca aaacgaattc 1800 atcagtgaat gcgctacatt agttcaagca gcagttgtga aagaagtaaa agatgccgta 1860 tattttacga taattaccga tgggacccca gatgtatctc acaccgaaca aatcacgttt 1920 attcttcgat ttgtgcgatt caactcgggt aagatgattt gggaagttaa agaacgattt 1980 ctctgcgttg aagatatgga gaaaaagaaa ggtgcagata ttgcttattt aatttgcaac 2040 gttttagcga aagtgaatct cgatttacaa aaattgagag gccaaggata cgacaattgt 2100 tctaatatgg ctggaatcta taatggagcc caagctcata tactggaaaa aaatccgttc 2160 gctttataca ttccttgcgg tgctcatagt ttaaatttag ctggagttca tgctgctgag 2220 tcttgcgctg agatcaaaac gttttttggc aatatccaag ccctctacgt gttctttagt 2280 tgcagcccag caagatggaa aattcttcac gaagagactg ctttatcact tcacaatctc 2340 tcagatacta gatggtcagc ccacatagca gctgtaaaac ctttagtgaa aaaaccacga 2400 gaaatcattg ctgctctgga tagaaccgtt aatcaacttg acctaactgc ggacatgttg 2460 aatcaaacga aatcgttggg caaatatttt aagtcatttg aatcagtagt tcttctgacc 2520 atttggtata agacactgca aaatattgat aatgttagtc gatgtctcca gtcggagtca 2580 atcaccattg aagaagaagt catcctcatt caactacttc tagatgacct aggtcgcatt 2640 cgttcgtcgt ggaattgtat tttgactgaa gcaaagctag tggctggaaa cctaggcttt 2700 gaaacagaat ttaaagtaaa acgcacaaga agagtgaaaa aattccacga ggaaccatca 2760 aatacagccc actaccacga cgatacagaa aagaattttg aagtaaatat tttcaacgtg 2820 gcattggacc acataattct acaaattcag tcaaggttca atgtggtgaa aaaggtttct 2880 tctcagtttt cctttatctg gcttcagtct gaagaccctt cttcacaaga tgacaaagct 2940 cgtcaacttg cgaaattcta ttcaaatgac gttaaagaag acgatcttat tgaggaaatt 3000 tgccactttg accggttgaa agcatcgtcc ttgttcagta aaggcaactc tctcaatctt 3060 ttaaatcaaa tatattccaa aggcttacaa ccaattttcc cgtcgatatg catacttttg 3120 cgcatcttcc acacaatgcc agtatccgtt gcagagggtg aaagatcttt tagtaaactg 3180 aagattataa aaaaccactt cagagccacc atgggacaag agcgactttc aaatcttatg 3240 gcgttgtcta ttgaaaatga tctcgcgaaa tcattgtcat atgatgaagt aatttctaat 3300 tttgcctcga aaaaagctcg aaaattttat ttaaattagg aattgtttat gatgtggtta 3360 gttagcaaaa aatgaaataa aaaaaaaagt taaagtagct ataaaggcat actgtattgt 3420 tttccttcta tgccaagtat atgtatatat atatatatat atatatatat atatatatat 3480 atatataata tatatatata tatatatata tatatatata tatatatata tatatatata 3540 tatatatata tatatatata tatgagccgt ggcgcagtgg ttagagcatt ggactcgtga 3600 ctcaagggtt gcgggttcga tcccagctcg aagcacataa acgttatcgg tgaagttgga 3660 aacggagagc caacttaata aatgctgttc tccttgtggc ggtgctctgt gacaagacca 3720 ttaggacttc ttagagcacc ttattaactg aaaaaaaaaa gaaaaaaagc gaaatccata 3780 agggcgccaa aaaagttgtt gtcccgggcg gcttgaaggc tctcggcggc cctg 3834 // ID BEL-152_AA-LTR repbase; DNA; INV; 217 BP. XX AC supercont1.375; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-152_AA_; KW BEL-152_AA-I; BEL-152_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-217 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.375; Positions 336614 336830. XX SQ Sequence 217 BP; 43 A; 59 C; 50 G; 65 T; 0 other; tggtgaagcc aaaatcaatc agccgctgtt gcccgaccga cgggaatcgt gtgtgaccag 60 atgcgttcca catctaccgt aggtagtggt ctttttctgt tacacgcatt tggtcagtcc 120 gtcccacttg ttcatgttcg tttttgtgat acctatatgt atgtacattc ccttgcgatc 180 gcttctcgat cgccagccag ccagcaatca gctggca 217 // ID BEL-211_AA-LTR repbase; DNA; INV; 666 BP. XX AC AAGE02024031; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-211_AA_; KW BEL-211_AA-I; BEL-211_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-666 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02024031; Positions 15350 14685. XX SQ Sequence 666 BP; 237 A; 110 C; 115 G; 204 T; 0 other; tgttgcgtac gcccctcgtg cgacgctgca cctactcagc aacctaattg aaatgtttcg 60 atgacattga ctgtcaatgt actgtctgat gacatgactg tcaatgtact gtcaggagaa 120 aagtgataaa gggacaaaga gtcataccat gcattacgag ctacaacgtg aattcttata 180 ctaaaataat ttgatatttt ctaattaaaa tactaaagct attgtgaatt gattattatc 240 taaattgaaa ctagtaggta taaaataagt ctaaatttga attccatcta aaaattgcaa 300 tattatctac taggttcgtt gctgatttaa ggttagattg acagaatgac agtgcggtaa 360 actaaatgat ttgctaaaat taaggtaaat aatttgctgc tttgaattat ggtcaaactc 420 aacatggatt aatttaaaca gattagatac cgcaaatatc atcaagcgct ccttagaatc 480 cgggtaacaa tattcgatta cgtaaatata gctgaactca cgaaacgtag gttaaccaaa 540 aaatcaccca gactgaatta aactaatgaa atttataact gctattgtag gaaaatttta 600 accgttcccg ttgaataaac gagttgaaaa tcccggatat agttcgctgt cgttactgct 660 gtaaca 666 // ID Transib-N6_AAe repbase; DNA; INV; 1964 BP. XX AC . XX DT 11-OCT-2010 (Rel. 15.1, Created) DT 11-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE A non-autonomous Transib DNA transposon family from Aedes DE aegypti. XX KW Transib; DNA transposon; Transposable Element; Nonautonomous; KW otherMITEs_Ele18; Transib-N6_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-1964 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-1964 RA Kojima K.K. and Jurka J.; RT "Transib-type DNA transposons from the yellow fever mosquito."; RL Direct Submission to Repbase Update (11-OCT-2010). XX DR [2] (Consensus) XX CC [1] Named as otherMITEs_Ele18. CC [2] Consensus update and characterization as a non-autonomous CC Transib. ~97% identical to consensus. This consensus is ~99% CC identical to the original sequence in [1]. TSDs are 5-bp; usually CC CANTG. TIRs are 595 bp long. XX SQ Sequence 1964 BP; 656 A; 318 C; 350 G; 640 T; 0 other; cactatgggc cagggacgca atctggcggg acaaaattaa taactcggta acgaagcgtt 60 tccggtattt ggtgtcttcg gcaaagtttt ttgtaacaac gaggaccatc tactgacata 120 aacggatagt tcgtaaatcg tccgcatagg tggcgccaca atctaacttt ttactgtgac 180 gttctagaga cttggcatgt tcggcaaagt tgttcatttt gataaaataa acaactctct 240 cgaagacgtc aaaattccac agcctactgt tttcgagtta ttggcaaaat tagagaaaat 300 tagtgaaaaa acgatttttt tcaccgaatt tgacattttt cctaagaatc atgtcatata 360 atattttttg ctgataaata tataacaaac gcaagctctg gtggaaaaat tttaataata 420 cgacgcacaa aaattaagaa actatgaaaa aacttgaaaa atagagcaat ttttgaacct 480 taatatctcc aaaagcgcaa aaaatcgcaa gctcaaattt tcaggcaaca tagagcaata 540 cttgatgaaa cggatgtcaa aattccagag aggtatattt tggtgttttt gagatatttc 600 attttttaca atgattatat tactccaata cagttagttt ccatgaggaa tctgaaggga 660 cgatcgactt taagaaataa cgagtcgtgg aacaaaaact ttcggaaaac tgtttaaaaa 720 agccatcata gtagccatga aattcagttt aagagcagtt ccttgagttg atatcgagta 780 atgtagatga aagtgtagat taaaactggg tacataattt tttttcttca ataattcaaa 840 tggtatagta tgccatgata aacttgatca tgtatcagtt aattgttgtc atcatggttc 900 gaggtgagta atttattttt gtgaatgtgt tacttttaac acggaaaatc aaattttaaa 960 gcatattcat gtcgatatct ctaaatacta aaatttcaga gcgcacgatc tcttgtccgg 1020 agcttgtata tttgtttcaa agcggaaaga gcgttgatcc ttggatgttt cgaaacagag 1080 gatacagatc aaaggtatca atttggagat tccttcgatt ttgagaactg ttaggttgtt 1140 taaccttaaa tatgtgttta atcaacggat ttgcttaaat tttgagcttc cggaaaattg 1200 taacaacgaa ttcgagatga aatccaaagc cagcttgtac tgtgagcaga tataataatt 1260 tcatgaaata ataaaatata tttttccgct aattttaaaa tacacttaaa tatttgccac 1320 gaaaatatta atcgcgtaat gaaaagaaaa ccgtaaaaaa aattaaaaaa tctcaaaaac 1380 accaaaatat acctctctgg aattttgaca tccgtttcat caagtattgc tctatgttgc 1440 ctgaaaattt gagcttgcgg ttttttgcgc ttttggagat attaaggttc aaaaattgct 1500 ctatttttca agtttttcca tgatttctta agttttatgc gtcgtattat taaatttttt 1560 ccaccagagc ttgcgtttgt tatatattta tcagcaaaaa atattatatg acatggttct 1620 taggaaaaat gtcaaattcg gtgaaaaaaa tcgttttttc actaattttc tctaattttg 1680 ccaataactc gaaaacagta ggctgtggaa ttttgacgtc ttcgagagag ttgtttattt 1740 tatcaaaatg aacaactttg ccgaacatgc caagtctcta gaacgtcaca gtaaaaagtt 1800 agattgtggc gccacctatg cggacgattt acgaactatc cgtttatgtc agtagatggt 1860 cctcgttgtt acaaaaaact ttgccgaaga caccaaatac cggaaacgct tcgttaccga 1920 gttattaatt ttgtcccgcc agattgcgtc cctggcccat agtg 1964 // ID TCRP2 repbase; DNA; INV; 114 BP. XX AC M21331; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 28-SEP-1995 (Rel. 1.08, Last updated, Version 1) XX DE T.cruzi antigen DNA repetitive sequence. XX KW Repetitive element; TCRP2; tandem repeat. XX OS Trypanosoma cruzi OC Eukaryota; Euglenozoa; Kinetoplastida; Trypanosomatidae; OC Trypanosoma; Schizotrypanum. XX RN [1] RP 1-114 RA Ibanez F.C., Affranchino L.J., Macina A.R., Reyes B.M., RA Leguizamon S., Camargo E.M., Aslund L., Pettersson U. et al.; RT "Multiple Trypanosoma cruzi antigens containing tandemly repeated RT amino acid sequence motifs."; RL Mol. Biochem. Parasitol 30(1), 27-33 (1988). XX DR GenBank; M21331; Positions 1 114. XX SQ Sequence 114 BP; 23 A; 39 C; 35 G; 17 T; 0 other; gccttgccgc aggaagagca agaggatgtg gggccgcgcc acgttgatcc cgaccacttc 60 cgctcgacga ctcaagacgc gtacaggccc gttgatccct cggcgtacaa gcgc 114 // ID Copia-124_AA-I repbase; DNA; INV; 4144 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-124_AA_; KW Copia-124_AA-LTR; Ty1_copia_Ele102; Copia-124_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4144 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [1495-2034] - Integrase core CC LTRs are 96% similar to each other. XX FH Key Location/Qualifiers FT CDS 65..970 FT /product="Copia-124_AA-I_2p" FT /translation="MDREGEVARVPQFDGSNYPQWKFRMSVLLQEHELQEC FT IEVDLVEVDELKVKEEDSAAVRDAKSKAVEKRAKKDRRCKSLLISRVSDNM FT LEYIQRQRVAERRVVCAGTSLSTQKYCKSPPLAKEAADTSTRWWLFARPLS FT HVRADCPGVQIDRCEAGGHRYSVLFAANVGSGVCDRRYRPGNKPEDKLSLD FT FVKCRLLDEEIKHSSGESDDRVIRVSMLPHSVVPARSSKRRRRVLSGSASD FT AIAKGIKLPIVRRRETMRRRRNRRKQVRIWEKARRVFASSVVRVQVCGNRV FT GSSTQDRRSI" FT CDS 1408..2934 FT /product="Copia-124_AA-I_1p" FT /translation="MDNGMVDGISAKPKKSGDEFVCEPCLAGRQTRKPFAV FT REDRQTKRVLELVHSDVCGPVTPVGIEGERYFVTFVDDWSRFTVVFLMNSK FT DEVMDKFEEYEALVSAKFGKRISRLRCDNGGEYVGREFRKFCKKKGIKIEY FT TVPYSPEQNGVSERMNRTLVEKVRSMLEDCGIGKEFWGPAIQTAAFLVNRS FT PASAIGQRKTPYEVWEGNRPNVRGLRAFGAVVHVHIPKERRKKMDAKSWKG FT IFVGYAPCGYRVWDPKQKKIVVARDVVFVENATKMSSEMDPKPERSADVFP FT IRAIREDSDVDSDAEGEEVSDESSDDAGNSDEESDESDAYESGGDDQYDSC FT ADRTIREAESEEQAEAESPGRRSGRSRKPPEWHQNYEMNYTGFALNATSFV FT DNLPSSLAEMKKRPDWDKWQAAVHEEMDSLERNQTWTLVKLPEGRVPITCK FT WLFKVKHNEGDGEERYKARLVARGFSQKQVLTMQKPILLLLGWIPFELCWQ FT LRTSTGWQCIRWM" XX SQ Sequence 4144 BP; 1061 A; 865 C; 1309 G; 906 T; 3 other; ttwggttatk gtcccagtga ccgcggctac cggaaaagcg taaaagttac cgaaaagtgc 60 gaaaatggac cgtgaaggag aagtcgctcg agtgccacag ttcgacgggt cgaactatcc 120 acagtggaag ttccggatgt cggttctcct gcaggagcac gagctccagg agtgtatcga 180 agtggatctg gtggaagttg acgagttgaa ggtgaaagaa gaggactctg ctgccgtccg 240 ggatgcgaaa tcgaaagcag tcgagaagcg ggcgaaaaag gatcgccgct gcaagtcgct 300 gctgatctcg cgagtcagcg acaacatgtt ggagtacatc caacgacaaa gagtcgccga 360 aagacgtgtg gtgtgcgctg gaacgagttt atcaacgcaa aagtattgca agtcgcctcc 420 acttgcaaaa gaagctgctg acacttcgac acggtggtgg ttgtttgcaa gaccactttc 480 tcacgttcga gcggattgtc cgggagtaca aatcgaccga tgcgaagctg gaggacatcg 540 atacagtgtg ctatttgctg ctaacgttgg gtccggagtt tgcgaccgtc gttaccgccc 600 tggaaacaag ccagaggaca agttgtcctt ggattttgtg aagtgcagat tgcttgatga 660 ggaaatcaag cacagcagtg gtgaaagtga cgatcgcgtg attcgggtgt cgatgctgcc 720 gcattcagtg gtgccggcgc gaagcagcaa aagaagaaga agagtgttaa gtggaagtgc 780 ttcggatgcg atcgcgaagg gcataaaatt gccgattgtc cggagaagag aaacaatgag 840 aagaagaagg aatcgaagaa aacaagtgcg catctgggaa aaagcgagaa gagtgtttgc 900 ttcctcagtg gtgagagtgc aggtttgcgg aaatcgagtt ggatcatcga ctcaggatcg 960 tcggagcatc tgacgaacga tcggaagcta ttcaagaagc tgtttccgat gaaggagcca 1020 atgcacatct cggtggcaaa ggaaggtgag tccatcgttg ccaaagagta tggtgacgtg 1080 gatgtgtttg ccgcagcgag gggtaagtcc attccaatta cgcttaagaa ggttcttttc 1140 attccggaag cgcgagtgaa cttgttatcc gtccggaaga tggaaatggc tggactaaaa 1200 gttgttttcg cgaatggcat cgtcastatc gagcgtgaat cgaaagtgat cgcagttggc 1260 gagcgtcgcg gtaaactgta tgaactagac ttccatcggg aaaaaccgtc tgaatcagtg 1320 ctctattcgt gcggtcgcgt gccaaaagaa cttgagttgt ggcatcgtcg ttacgggcac 1380 ttaagcgcga aaaacttaaa agtgctgatg gataacggga tggtcgacgg gataagtgct 1440 aaaccgaaga aatccggcga tgaattcgtt tgtgaaccat gtctcgccgg tagacaaacg 1500 cgtaaaccgt tcgcagtgcg cgaggacaga cagacgaagc gtgttttgga acttgtccat 1560 tcggacgttt gtggtcccgt gacgccggtt gggatcgaag gcgaaaggta ttttgttact 1620 ttcgtcgacg actggagccg gtttacagtc gtgttcctaa tgaactcgaa ggacgaagtg 1680 atggacaaat tcgaagagta cgaagcgtta gtgagtgcga agttcgggaa acgcatttcg 1740 cggttgcgct gcgataacgg tggcgagtac gtcggtcgcg agtttcgaaa gttctgcaag 1800 aagaagggga taaaaatcga gtacaccgtc ccctactctc cagagcagaa cggtgtaagc 1860 gagcgaatga accgtacgct cgtggagaag gttcgttcca tgctcgaaga ctgcggtatt 1920 gggaaggagt tctggggtcc tgcgatccag actgcggcat tcctggtgaa tcgcagccct 1980 gctagtgcga tcggtcaacg gaaaacaccg tacgaagttt gggaaggtaa tcgacctaat 2040 gttcgtggtc tgcgcgcgtt cggtgcggtg gtccacgttc acatccctaa ggaacgtagg 2100 aagaagatgg atgcgaagtc ctggaagggg attttcgtgg ggtatgctcc ctgcggatac 2160 cgagtgtggg atcccaaaca gaagaagatc gttgttgcgc gagacgttgt gtttgtggag 2220 aatgcaacga agatgagttc ggaaatggat ccgaaacccg agcgtagtgc tgacgtgttt 2280 ccgatccggg cgattcggga agattccgat gtggatagtg atgccgaagg tgaagaagtt 2340 tccgatgaat caagtgacga tgcgggtaat tcggatgaag agtcggacga aagcgatgcg 2400 tacgaaagcg gcggagatga ccagtacgac agttgtgccg acagaacgat ccgagaagca 2460 gaatctgaag agcaagcgga agcagaatcg ccggggcgcc gatccggccg aagccggaag 2520 ccgccggagt ggcatcagaa ctacgagatg aactacacgg gatttgcgct gaacgctaca 2580 agcttcgtcg acaacttgcc aagctcgctg gcagaaatga agaagagacc ggattgggat 2640 aagtggcaag cagcggttca cgaagagatg gattccctcg agcggaacca aacgtggaca 2700 cttgtcaagt tacccgaggg gagagtgcct attacgtgca agtggctgtt taaagtgaag 2760 cataacgaag gtgacggaga agagcggtat aaagccagac tcgtggctcg tggatttagt 2820 cagaaacagg ttttgactat gcagaaacct attctcctgt tgctcggttg gataccgttc 2880 gagttgtgtt ggcagttgcg aacgagcaca ggatggcagt gcatcagatg gatgtgaaga 2940 ctgccttcct caacgggcat ttggaagagg acatctatat gacccaaccg gaaggcttcg 3000 agcgtggcaa acatctggtg tgtcgactaa accggtcgct gtatggcctg aaacaggcat 3060 caagggcctg gaatgcaaga ttccacagct tcgtcgaacg attgggattc cgccgaagtt 3120 tgagcgaccc gtgtctctac gtgaagggtt ccggatgcaa ccaagttatc ctggtattgt 3180 acgtggatga tttgcttgtg gttggtcgtc agctgaaggc ggttgaagtg gtgaagcgtt 3240 gcctggccgg agagttcgag atgacggaca taggagaggt tcgacgtttt cggcatgagg 3300 attgaccggg acaccgagca gaggtccctc cggatcagtc agaagggatt tttggagaat 3360 ttgcttcgtc gtttcaacat gcaagagtgc aaggctgcat caacacctat cgagtgccgc 3420 ctgcgcctga agaaaggaga agaagctgaa cgtactgaca aaccgtaccg agagttgatc 3480 ggttgtctca cgtacgttac tttgacgtca aggccggact tgtgcgcggc agttagctac 3540 ctgagccagt tccagagctg ccctacggaa gtgcactggg tgcatgcgaa gagagtactg 3600 cggtacatta aagggacgct ggatttggga ctggtgttcg tggcaaagga gtcggcaccg 3660 gtgatcgaag cattcgccga tgcggattgg gcgaacgacc ctgtggatag gcgttcgctt 3720 acgggattcg tctttcgggt atgtggttcg actgttagtt ggcttaccag aaagcaatcg 3780 accatctcac tgtcctcgac ggaagcggag ctggtggctc tgagtacggc tgtatgtcac 3840 ggaatttggc tggagcgtct actcaaggat ttggcgatcg aaccggagcg tccggttgtc 3900 tatcacgaag ataatcaatc gacgattagg gtggcggaag aagagcgtga taccggtcgc 3960 ttgaagcacg ttgatgtaaa gcatcgattc gtccgtgaag agattcagcg ggggcgtata 4020 gcagttcgtt acatccctac cggtaagcag atagccgata tcatgaccaa agggttgccg 4080 gtaagtgtgt tccagaagca tcgagccagc ctggggttgg ctagttccgg acattgagcg 4140 gggg 4144 // ID Gypsy-183_AA-LTR repbase; DNA; INV; 1053 BP. XX AC supercont1.145; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-183_AA_; KW Gypsy-183_AA-I; Gypsy-183_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-1053 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.145; Positions 1291261 1292313. XX SQ Sequence 1053 BP; 264 A; 262 C; 265 G; 262 T; 0 other; tgtaaccgtt gttacaagaa ttcaaatcct tttttttaaa ttgcaccatg tggtcgagta 60 ttcaaatatt caaatagcat aagcagtcaa aaccactggg cagttcgaca taacttaatg 120 taacggtcgt aaaatcatgg cgtgaagaag ttacagcaaa atggggacac ggtttcacga 180 atgggctgaa gggttcaaag gatgcttgct gggcggccat cgcccacttt taatcaggcg 240 agcgtgtaga ccagaagtgc tcagtggagt agtaaaattt ataaaatagt gcgagtagtg 300 aatcgtgtta agttggagag accgttcacc gcaattcagg taaaccatta gcctaatggc 360 ataaaaggga gactgaatcc taattacctg ctcgccagga cctttttttg ttcttgcttt 420 aaagcaatct ggttggcgcg tcctaagacc gccattgcac gctacggtcg aagtgtcctt 480 tgtagtcact agacctaaca gtcaaggacc attgtttcct ctggccgagg gccatcaacg 540 tcaaggagga cattactcgg gcaacggccc taattaccaa ggagtatttc ccttcacggt 600 tcgacagagg gcggccttgc cgtcgtcgac cggtttcgtc cgaaggaagc gcctgtgcat 660 ccgaaccaac tccacatcgc tgaagaccgt cattccagcc cattcacatc cgccattgcc 720 ccaagctttg tccgccggta tcggtgccat cttgccgatc gtcgcagcgt cagtaggaga 780 gaccgtcaac gccggagtga atccggtaaa tcgtcgcgag cgctcgctgg actgcagaac 840 accaggtacg tgcccgaaaa tccagtgtga cccaccagaa cagtttggcc aaatacatcg 900 ttaaaatttg aaccgctcgt gtgtgtgatt atttgaatag ttgagtgccc tttgaatccg 960 cgaagcccct ccggttgccc ggtagcgcgc cgaccctgtg actgtgtccg agtctcgcgc 1020 tgctgagttg ttgtcgggta gaatattgtt tca 1053 // ID Jockey-6_CQ repbase; DNA; INV; 4391 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A Jockey non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW Jockey; Non-LTR Retrotransposon; Transposable Element; KW Jockey-6_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4391 RA Kojima K.K. and Jurka J.; RT "Jockey non-LTR retrotransposons from the southern house RT mosquito."; RL Repbase Reports 11(1), 117-117 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 8 sequences with >96% CC identity. XX FH Key Location/Qualifiers FT CDS 158..1474 FT /product="Jockey-6_CQ_1p" FT /translation="MKPNVRLKGGSAKNSNLTSQRVVNAAKNSRIQRQNMF FT ESGRRTRSLDRITPGGSSSGSGGYQTRSKTNGSSPFLSQNCFDALGDDDAG FT DGVQGEPEKKQPRVRIPPIHIMGKSVGEVVKLLATNGVPQSDDYWLKYTKT FT SVQFQTKVKELFTKTTALLKSSDVQFFTHDTSEDAPTKFVLSGLPAVSIPD FT LKEELEAMGILPLDIKVLSSKKSGADEHTLYLVYFKRGTVKIQDLRRTKAI FT FNVVVSWRFFSKHPNDAAQCHRCQQFGHGSSNCNLRPKCVKCGGKHLTDVC FT MLPRRAELNNNNNSKSQLKCANCGGSHTANFRGCPARKAYLDELEKRKKKP FT SRPAPPPSGPAPGRNSGAQGGPTHRSSFAATGRPTYAQVSSMNTPPCANLS FT DGEGLFTVTEFLSLARDMFSRLVGCRTKQQQFDALAELMAKYLYG" FT CDS 1470..4145 FT /product="Jockey-6_CQ_2p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="MGSLSHQLKIVNWNSRSVLLKKIEFFDFLSRHQVDIA FT TVSETWLKTKNSFHHPDYRCFRSDRTSTDDERGGGVLIALRKSLNVDYSVL FT DLNTSAIEAVGLEIRFPTQTVHFIAAYFPGIHRGAAWSTFRRDINTLVRRP FT VPFFVAGDFNARHRQWNCLKANRAGNFLASRAVSSDFFIHAPRSFTYQPPG FT GRRPSTLDIVLSNNLVNMSSLTVVNDLSSDHLPVLFSVDVAVPFQRFPTTT FT RCYNRANWPAFQRIVNEKIDLNSPVATNLDSPAGINRCIEFFTSTLLEAEA FT AAVPVVPVRTYEDAKFPESARQLVTLRNTRRRQWYRTRDPLLGAIVESLNN FT RIRFECSIARNQHFSNNIRYLENGAKDVWKISKALRKQVKYSPPLQAGDTL FT KASPEEKANLLADNFARAHRNPSPGDPTTSSAVEASVQQIADSTIPVESVP FT VVRPKEIARIIKSLKPKKAPGKDNIRNTLLKRLPRKGLVMLTLIVNACLRT FT SYFPSSWKHAIVTAIPKPGKDITNPTNYRPISLLPVMSKILERVVLTRIGN FT FVEQRNVVPPQQFGFKRGHSTNHQLARLTRNVKSSLARGESAGMVLLDVEK FT AYDSVWQDAILHKMKLAGFPAYILKLLNSFLKQRSFQVAVDGALSRRHEIP FT FGVPQGAVLSPALYNIFTSDLVMVDGVEYYLFADDTGFVAADKSAEVIIEK FT LQAAQDSLESYQRKWRIKINPAKTQAIFFTRRRAERFLPQSRVLALGHEVP FT WSDEVKYLGLALDPKLKYDKHIDAALQKCDKLTRMLYPLVSRRSRLSNANK FT LLLYKLIFRPTLTYGFPAWHSCASTRRKKLQTRQNKILKMMLDLPFNFPTD FT ELEEAANTESLETWTSKLLQRFWTSCTMSENTLISSLVP" XX SQ Sequence 4391 BP; 1060 A; 1273 C; 1155 G; 903 T; 0 other; aattgggcat ccctagttgg atgactgttt acaagcctcc gagtgctccg cgatttttat 60 cgattttctt gttgaaatcg acgttagatt tcgcgttccg ccgtcgtcga gtaaaccggt 120 agccctggcg aaccaggaca agtggtggaa gcacggtatg aagccgaacg tgcggcttaa 180 aggcggttct gctaagaaca gcaacctaac ctcacagcgt gtggtgaatg cggcaaaaaa 240 ctcccgtatc cagaggcaga acatgttcga gagtggccgg cgaactcgtt cgctggaccg 300 gatcacgccc ggcggcagca gcagcggcag cggcggctac cagaccagga gcaaaacaaa 360 cggcagctcc ccgttcctgt cacaaaactg tttcgacgca cttggtgatg acgacgctgg 420 agacggcgta caaggagaac ctgaaaagaa acagccccgg gtaaggattc caccgatcca 480 cattatgggt aagtcggtcg gagaagtggt gaaactcctc gccacgaacg gtgttccgca 540 gagtgacgac tactggctga agtacaccaa aacatcggtg cagttccaga caaaagtgaa 600 ggaattgttt acgaagacga ctgcgctgct gaaaagtagt gatgttcaat tcttcacaca 660 cgacacatcg gaggatgccc cgacaaagtt tgtgctttcg ggtctgccgg cagtgagtat 720 tccggatctg aaggaggagc tggaggcaat ggggatcttg cctctggaca tcaaggtgct 780 gtcctcaaaa aagtcggggg cggatgaaca tacgttgtac ctcgtgtact tcaagcgcgg 840 gacagtgaaa attcaagact tgcggagaac gaaggcgatc tttaacgttg ttgtttcgtg 900 gcggttcttc tcgaagcacc ccaacgatgc tgcccaatgc caccggtgtc agcaatttgg 960 ccacgggtca tcgaactgca acctgcgccc caagtgtgtg aagtgcggcg gaaaacacct 1020 gacggatgtc tgcatgcttc cgaggagggc ggagttgaac aacaacaaca acagcaagtc 1080 gcaactgaag tgtgcgaact gtggcggaag ccacaccgct aatttccgtg ggtgcccagc 1140 tagaaaagcc tatctcgacg agctcgagaa aaggaagaaa aagccctctc gcccagcgcc 1200 gcctccgagc ggccctgccc caggccgaaa cagcggagcc caaggtggac ctactcatcg 1260 gtcctcattc gctgcaaccg gacgacccac ctacgcgcag gtgtcgtcca tgaacactcc 1320 accgtgcgct aacctttccg acggcgaagg tctcttcacc gttacggaat tcctgtccct 1380 tgccagggac atgtttagtc ggctcgttgg ctgccgcacc aagcagcagc aattcgacgc 1440 gctcgcagaa ctgatggcca aataccttta tgggtagttt gagccatcaa cttaaaatcg 1500 ttaattggaa cagcagatcc gttctgctga aaaaaatcga gtttttcgat tttttgtcgc 1560 ggcaccaagt cgacattgca acagtctctg aaacctggct caagactaag aattcgttcc 1620 atcatcccga ctaccgatgc ttccgatccg acaggacgag tactgacgac gagcgaggcg 1680 gcggtgtcct cattgccctg cgaaaaagcc tcaacgtcga ttattcggtg ctggacctca 1740 acacgagcgc catcgaagcg gttggcctag aaatccgatt ccccacgcaa accgtccact 1800 tcattgcggc ctactttcct ggaatccacc gtggagctgc ttggagcaca ttccggcggg 1860 acattaacac actggtgcgg cgccccgtgc cgtttttcgt tgctggagat ttcaacgcgc 1920 gccatcgaca gtggaactgt ctgaaggcga acagagctgg aaacttcctg gcgtctcgtg 1980 cagtttcttc ggacttcttc atccacgctc cgaggtcgtt cacctaccaa ccccccggcg 2040 gccgccggcc gtcaacgctt gacattgttc tctccaacaa cttggtgaat atgtcatcgc 2100 tgaccgtagt gaacgacctg tcgtccgacc atctgcctgt tctcttcagc gtggacgtag 2160 ccgtgccctt ccaacgcttc ccgacaacga ccagatgcta caacagggcg aactggccag 2220 ccttccagcg catcgtgaac gagaaaatcg acttgaactc ccccgtcgca accaacctgg 2280 acagccctgc aggaatcaac cgatgcatcg agttctttac aagcacgcta ctggaggccg 2340 aagccgccgc cgtccccgtt gtaccggtga gaacgtacga agatgccaag tttccggagt 2400 ccgcgcgcca gcttgtcacc ctgagaaaca cgcgccggcg ccagtggtac cgcacacgcg 2460 atccactgct cggcgcgata gttgagtcgc tgaacaaccg gattcgcttc gagtgttcta 2520 tcgccagaaa ccagcatttt tcgaacaaca tccggtacct ggagaacggc gccaaggatg 2580 tctggaagat cagcaaagcg ctccgtaaac aggtgaagta cagcccgcct ctccaggctg 2640 gcgacacgct gaaggcatcc ccggaagaga aggcgaacct gctggctgac aattttgcac 2700 gagcgcatcg caacccttct cccggcgacc cgactacgtc cagtgctgtc gaagcgagcg 2760 tacagcagat cgccgacagt accatcccgg tcgaatcagt gccagtcgtc cgccccaagg 2820 agatagcacg gatcatcaag tcgctgaagc cgaagaaggc gcctggcaag gacaacattc 2880 ggaacaccct gctcaagcgg ctcccccgca aaggactggt gatgctgacg ctgatcgtga 2940 acgcgtgtct gaggacatcg tactttccgt ccagctggaa gcacgccatc gtcacggcaa 3000 tcccaaagcc gggcaaggac atcacgaatc ccaccaacta ccgtccgatc agcttgcttc 3060 ccgtgatgag caaaatcctc gagcgtgtgg tgcttacccg tatcggaaac ttcgtcgagc 3120 aacgaaatgt cgttcctccg cagcagttcg gcttcaaacg gggccactcc acaaaccacc 3180 agctcgccag actaacgcgc aacgtcaaga gctcgctcgc ccggggcgag tcagctggca 3240 tggttcttct ggacgtggag aaggcgtacg attcggtgtg gcaggatgcg atcctgcaca 3300 agatgaagct ggccgggttc ccggcgtaca ttctgaagct gctcaactcc ttcctgaagc 3360 agcgcagctt ccaagttgcc gtcgatggtg cgctctcgcg ccgtcatgaa attcctttcg 3420 gcgtcccgca aggagctgtc ctgagtccgg cgctgtacaa catcttcacc tcggatctcg 3480 tcatggtgga cggggtcgaa tactacctct tcgccgacga cactggattc gtcgctgctg 3540 acaagagtgc agaagtgatc atcgagaagc tgcaggcggc ccaggactcg ctcgagagct 3600 atcagcggaa gtggcgaatc aaaataaacc cggcgaagac gcaggcgatc tttttcaccc 3660 ggcggcgcgc tgaaaggttc ctcccccagt cacgtgtgct ggcactgggc cacgaagtcc 3720 cctggtcgga cgaggtcaag taccttggcc tcgcgctcga cccgaaactg aagtacgaca 3780 agcacatcga cgccgcgttg cagaaatgtg acaagctgac gaggatgctc tacccgctgg 3840 tgagccgaag gtcaaggctg agcaacgcca acaaacttct gctctacaag ctcatcttcc 3900 gtccgacgct cacgtacgga ttcccagcct ggcacagctg tgcttcaacc cgccgcaaaa 3960 agctgcagac gcgccaaaac aagatcctga aaatgatgct ggacttgccg tttaatttcc 4020 cgacggacga gctcgaggag gcggccaaca cggaatcgct ggaaacctgg accagcaaac 4080 tgctacagcg gttttggacg agttgcacga tgtcagagaa tactctgata tcgagtttgg 4140 tgccgtgata ctgtgatcta gtttttaaga aacccctttt cctcctccta tcccccttct 4200 atcctagcaa aagtgacagg ttttttgcga gttgttttcc tgttttctgt ttcctttaca 4260 accaacttta tttgtatcca ttgtgataaa tgccttatac acagctgaaa ggatccccca 4320 aaactctgta ctgcacttat aaaactaatg ttagtcgata aactaaagaa ataaactgaa 4380 ttgaattgaa a 4391 // ID DNA-TA-14_CQ repbase; DNA; INV; 151 BP. XX AC . XX DT 29-DEC-2010 (Rel. 16.01, Created) DT 29-DEC-2010 (Rel. 16.01, Last updated, Version 1) XX DE A DNA transposon family from Culex quinquefasciatus - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; KW DNA-TA-14_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-151 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the southern house mosquito."; RL Repbase Reports 11(1), 64-64 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >94% CC identity. TA TSDs. ~70-bp TIRs. XX SQ Sequence 151 BP; 57 A; 24 C; 19 G; 51 T; 0 other; cactgaaata aaaaaaagta ccaaattcct aaattctagg aattttgcct cctttcctta 60 ttaagtaatc caaaaaaccc gattactaaa taaggaaaag tggcaaaatt cctagaattt 120 aggaatttgg tacttttttt ttatttcagt g 151 // ID Zator-1_HM repbase; DNA; INV; 3381 BP. XX AC . XX DT 29-JAN-2009 (Rel. 14.02, Created) DT 25-JUL-2009 (Rel. 14.08, Last updated, Version 2) XX DE Zator DNA transposons from Hydra magnipapillata. XX KW Zator; DNA transposon; Transposable Element; Zator-1_HM. XX NM Zator-1_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3381 RA Bao W., Jurka M.G., Kapitonov V.V. and Jurka J.; RT "New superfamilies of eukaryotic DNA transposons and their RT internal divisions."; RL Mol Biol Evol 26(5), 983-993 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 791..3163 FT /product="Zator-1_HM_1p" FT /translation="MSSEKEQLYKSIYEGCLKAFTNKPKQKSQEETNKIWS FT EIKAKSKTLLDINVEVEKKLKELKQIELRNKVGILSYWTKLQNYTPKKLVI FT ESKITFDDLTDATPSITNSEDSEKGLQENSNSLKVLKRPAPKQDQLKNEIF FT ILQKDIAYFTEKRNCNLLSQEDHHILRQKQKLLIEKNRDLVLRQKATERQT FT ATRLKRKKAIEALDPDIRKKLCGIKNAVSKGRPLACDQDSLIKTIVDIALK FT GSASDDRRRTEIVRTVKTLDDLVEELKNHGYNLSRSGVYLRLLPSRSLSTE FT GKRHVKTAPVRLIRAQNSEHKYHADTKFAKSSISALEELAAVLGPKEVTFL FT SQDDKARVPIGLTAANKQAPIMMHMEYKVTLADHDFVIAPQHKLIPSVIAA FT IEIKKDCMEKQAVTYSGPTYIAIRSAKHDSSTALSHLEDINRINHLPAFKS FT SLFTEDEVSKPVMIKTVDGGPDENPRYEKTIHCAKTYFKEHDLDGMFVATN FT APGRSAFNRVERRMAPLSHDLAGIILPFNHFGSHLNSRGETTDKLMEKANF FT AKAGEVLAQIWSSTVIDGFETIAEYREPNSENRVQLKQFGAEWDAKHVKES FT QYLLQIVKCNDQNCCGHRRSSLFHLIPTGFLPPPISLFQGENGLECNNTKN FT NFASLFLNLSLSKTIFPKTITKKFPKGLPYDYSCPSIQSSLPKRICTNCSL FT YFATIKSMKQHKTVCSSIGLSNEIQKVKPHRLVAQRLSELMCVFLNSDKET FT IEWHDQDDVDTSDLNWPSIIIEEHGTPILEGQSPIWEQI*" XX SQ Sequence 3381 BP; 1241 A; 539 C; 584 G; 1017 T; 0 other; ggggctgttc aaataatacg tgacagtctt agggggggga gggagtatta gaaagtgtca 60 ctaaatgtca catgggggag gggggtggtt tgcctcagtg tgacgtgaca aaaaacattt 120 taaattaaat tgtttatcat gcgtaattaa attagtttat aatgcaaact cggaggtatt 180 tatatataat actgatttcg agaaaaatta tatgcaacga cttcacgaat agcgagcgtt 240 ttttgtttaa ctttcattta acacaaattt ttgtttttgt taaatgattg gatttgatta 300 atttatattt cctccgtaca aaattatttt tgtggaaatt gtacgcataa aaagaaactc 360 taatcctacg aacatagaac tagatagtgg gcaagtatct aaaaatgatt tatattacta 420 tcaaaaatca tattttattt ttttaacact atttatatta ataaactaaa aagttaagtg 480 ttattttcgc ttaaaattag gtcttcaaat ataatgcata aacgatatta ttctattttt 540 aatgttatta ttagtattac ttttaatatt ttgtgtcact ctattgcaaa ttattggtct 600 aattacttta ttattttatt attacgttat tattttatat ttatattttt attattcaat 660 aagttagtta gattgttaac gaattttagt attaataatg ctattgttaa cattaattca 720 taatgtgctt gtttttaatt aatggttgtt aggaaaataa atttattgtc gttgttgtta 780 tagaataaaa atgtctagtg aaaaggaaca actttacaaa tctatttatg aaggatgttt 840 aaaagcattt acaaacaaac caaagcaaaa gtctcaagaa gaaacaaata aaatctggtc 900 tgagatcaaa gcaaaaagca aaacattact cgatataaat gtagaagttg aaaaaaaatt 960 gaaagaacta aagcaaattg aacttcgaaa taaagttggt attttatctt actggacaaa 1020 attacaaaac tatacaccaa aaaagcttgt gattgaatct aaaatcactt ttgatgactt 1080 gactgatgca accccatcaa ttacaaatag tgaagacagt gaaaaaggct tacaagaaaa 1140 ctccaattcg ttaaaagttt taaaaagacc tgcgccaaaa caagatcagt taaagaatga 1200 aatttttatt ttgcaaaaag atatcgcata ttttactgaa aaaagaaact gcaacttgtt 1260 gtcacaagag gaccaccata tattaagaca aaaacaaaag cttttgatag aaaaaaaccg 1320 agatcttgta cttcgtcaaa aagccactga acgtcaaaca gcaaccaggc tgaaaagaaa 1380 aaaagcaata gaagctttag atccggatat tagaaaaaag ttgtgtggta ttaaaaatgc 1440 agtatcaaaa ggtcgaccgc tagcctgtga tcaggacagt ttaattaaaa cgattgtaga 1500 tattgccttg aaaggttctg cttctgatga taggagaaga actgaaattg ttcgaactgt 1560 aaaaacatta gatgacttgg ttgaagaatt aaaaaaccat ggttacaatc tttccagatc 1620 aggtgtttac ttaagattac ttccaagcag atctttgagc acagaaggaa aaagacatgt 1680 taaaactgca cctgttagac taataagagc acaaaattct gagcataaat atcatgcaga 1740 cacaaaattt gccaaatcat ccatcagtgc ccttgaagaa cttgctgcag tcttgggtcc 1800 aaaagaagtg acatttcttt cccaggatga taaagctaga gtgccaattg gattgacagc 1860 tgcaaacaaa caagcaccaa tcatgatgca tatggagtac aaggtgactt tagctgatca 1920 tgattttgta attgctcctc aacacaaact tattccttca gtaattgcag ccatagaaat 1980 caaaaaagat tgcatggaaa aacaagcagt cacatactct ggcccaacat acatagcaat 2040 cagaagtgca aaacatgatt catcaacagc acttagtcac ttagaagaca ttaatagaat 2100 caaccatctt cctgcattta aaagcagctt atttactgaa gatgaagtta gcaaaccagt 2160 catgattaaa actgttgatg gtgggccaga cgaaaatcca agatatgaaa aaacaattca 2220 ttgtgccaag acttatttca aagagcatga ccttgatgga atgtttgtgg caaccaatgc 2280 tcctggcaga agtgccttta acagggtaga gagaagaatg gctccgctaa gccatgacct 2340 tgcaggaatt atacttccat ttaatcattt tggaagccat cttaattcaa gaggagaaac 2400 tacagacaaa cttatggaga aggcaaattt tgccaaagct ggcgaagttc ttgcacaaat 2460 ttggtccagc acagtcattg atggatttga aactattgca gagtacagag aaccaaacag 2520 tgaaaatcgt gtacaattaa agcagtttgg agcagaatgg gatgctaaac atgtcaaaga 2580 aagtcaatac ttgcttcaaa ttgtaaaatg taatgatcaa aactgctgtg gtcatcgtcg 2640 cagctcactg ttccatctta ttccaactgg ttttcttcct ccaccaattt ctttgttcca 2700 aggagaaaat ggccttgagt gcaataatac aaagaataac tttgcaagcc tctttttaaa 2760 cttatcacta agcaaaacaa tttttcccaa gacaattaca aaaaagtttc ccaaaggatt 2820 gccatacgac tattcatgtc cgagcattca aagttcctta ccaaaaagaa tttgtaccaa 2880 ttgtagtcta tattttgcca ctatcaaatc aatgaaacag cacaaaacag tttgctcaag 2940 tatcggatta tcgaatgaga ttcaaaaagt taaacctcat cgcctagttg ctcaacgtct 3000 atctgaactc atgtgtgtct ttttaaattc agacaaagaa acaattgaat ggcatgatca 3060 ggacgatgtt gatacctctg atctcaactg gccttcaata attattgaag agcatggaac 3120 tccaatactt gaaggtcaaa gtccaatttg ggaacaaatt taggacatgt ttcttaaaca 3180 gaaacatgtc aattgtattt ttttttaata tgaactgtta caattaaata gacacttgtg 3240 aaaattccaa acttttatct ttgtcacgtc acacaggggt gggggtgggg gagtttttaa 3300 aaatgccaca aagtgtcaca taggggaggg ggggtctcta aaaatgccta aaaaaatgtc 3360 acgtattatt tgaacagccc c 3381 // ID P-1_AP repbase; DNA; INV; 3575 BP. XX AC Contig35933; XX DT 31-JUL-2009 (Rel. 14.07, Created) DT 31-JUL-2009 (Rel. 15.12, Last updated, Version 2) XX DE DNA transposon. XX KW P; DNA transposon; Transposable Element; P-1_AP. XX NM P-1_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-3575 RA Jurka J.; RT "DNA transposons from pea aphid."; RL Repbase Reports 9(7), 1362-1362 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX FH Key Location/Qualifiers FT CDS join(489..1187,1268..3229) FT /product="P-1_AP_1p" FT /translation="MDFRFPLKNPERNQLWINAVGRKGFIPTKNSAICSSH FT FVPSDFKINPGGNYRLHLNDTSVPSVFPGNTSEKKSQIEQNIEQNSPIREI FT IDTNNLLLTTPSKVTARRPLFSPKTKSKHTSEPLKPRKIESKKKIKLLQQG FT IRRRDKKINNLKSLLQNMRFKGLIEEQCEQLLIDQFEGTSQEIFCNELKNK FT GKRPTGYRYSIQLKEFAATLHYYSPKALKYCRYYTFCYHKLFCYCFRTFLK FT LPSGKSILNWTSSINGEPGFFKEVFDTLQTMSPDDRHCNLIFDAMSIKKQI FT SWDERLGKFVGYCDYGNAFELEGSETPATETLVFMLTSINGKWKLPIGYVF FT QNKITASIQAELIKSALTHAHNAGMTVWSVTCDGAYTNVCTLKLLRCKISN FT SYDDIESWFEHPVTRSKVYYIPDACHMLKLARNILANNYVLESDTGYIRWD FT HIRNLFKVQKDLTLKLANKLSMVHVNWHNNKMRVQYAAQTLSSSTADSLEY FT LKNINFPGFENVEATVEYCRAIDRIFDFLNSKSKFSNAFKSPIYYNTIEKR FT EEIIIPLIKYLYTLKFKGSPLHISSKKTFILGFAIAVKSFFSMSRSHFVQH FT PNFKYLLTYKFSQDHLELLFGRIRQRLGSNNNPNTAQFKTAIKQILMKNAI FT KCRSKHNCNTFDDDPIGSLFDFKWKTKKDDIDYEKVNFETIDIEALKKLQL FT LNSYIPESNNYSSTLQDAKNNILYYIVGYLIRKLNLDCSSCENAVLDKFNE FT HDYCKSLSFTKFVNFKNRSGLVFGSKSVFLIILEAEKMFLFLTDNFKSLQI FT PNLEIKIIKHVIVTFSTDKNIFPNLNCENISILERPHKILLITLLTKKCLK FT LRLKSFSKMYSSDIMNPVSKRHKLSKLILFSNQ*" XX SQ Sequence 3575 BP; 1326 A; 523 C; 492 G; 1234 T; 0 other; catagacata taaaacaact agatagccga ctgtatatta cattgccata agaatgttga 60 agtcgggccg ccatatgcca acagggcaat atttacataa taattcaata gataagatac 120 tgtattgata tctgataata aacaatatta caaaaaacgt atttcttttc atattttact 180 tttaccagaa tataaaatac aaattgaatt tctttagtct aaattactaa ttagtaataa 240 aattaccaaa cggtacctac ataacctaca tactaaagta attaataaga tgccagtttc 300 tcgttccgcc catgcctgta ctaatcgaag gcaacgtgat ggcaatatcc attttttttc 360 gtaagtacct gcgaaataat ttttttttac tataatttaa cttttaagat tgtaggttca 420 atccatatct acagttacta ataagtttaa ttagataaag catttcttat ttaattgttt 480 attattgtat ggattttaga tttcctttaa aaaatcctga aagaaaccaa ctttggataa 540 atgctgtagg gcgtaaagga tttattccaa caaaaaacag tgctatttgc agcagtcatt 600 ttgtaccatc cgatttcaaa ataaatccag gaggcaatta ccgtttacat ttaaatgata 660 cttctgtacc atctgttttc cctggtaata catcagaaaa aaaatctcaa attgaacaaa 720 acattgagca aaattcacct attagagaga ttattgatac aaataattta ttattgacta 780 ctccgtccaa agtaacagca agaaggcctt tattttctcc taaaacaaaa agtaaacata 840 caagtgaacc tctaaaacct aggaaaattg agtcgaaaaa aaaaataaaa ttgttgcaac 900 aaggaataag acgaagggat aaaaaaataa ataacttaaa aagcttatta caaaatatgc 960 gttttaaagg tttgatcgaa gaacaatgtg agcaattatt aattgatcaa tttgaaggta 1020 catcccaaga aattttttgc aacgagctca aaaacaaagg taaaagacca accgggtata 1080 gatactctat acagcttaaa gagtttgctg ctacgcttca ttattattca cctaaagcgc 1140 ttaaatattg caggtattat actttttgct atcacaaatt attttgctaa tcgcatatta 1200 aaatacatgc tttgcactta atttacaatc aaatcatttt tttatttttt tttcctattt 1260 aatataatat tgttttagaa catttttgaa attgccaagt ggtaaatcaa ttcttaactg 1320 gacttcaagc attaatggcg agccaggttt tttcaaagaa gtttttgata cacttcaaac 1380 catgagtcca gatgacagac actgtaatct aatttttgat gccatgtcta taaaaaaaca 1440 gatttcgtgg gatgaaagac tgggtaagtt tgttggatac tgtgactatg gcaatgcatt 1500 tgaacttgag ggatctgaaa caccagctac tgagacatta gtatttatgt tgaccagtat 1560 taatggaaag tggaaattac caattgggta tgtttttcaa aataaaataa ctgcttctat 1620 tcaagctgaa ctaattaaat cagcccttac tcatgctcat aatgctggga tgactgtctg 1680 gagtgtaacg tgtgatgggg cctatacaaa tgtctgtact ttaaaacttt taagatgcaa 1740 aataagcaat agctatgatg atattgaaag ttggtttgag catccagtaa ctagatccaa 1800 agtatattat attccggacg catgtcacat gttaaaatta gcccggaaca ttttagctaa 1860 taattatgta ttagaatctg atactggtta cattagatgg gaccatatca gaaatctatt 1920 taaggttcaa aaagatttaa cattaaaatt agctaataaa ttgagtatgg tccatgttaa 1980 ctggcataat aataaaatgc gagtccagta tgctgcccag accctgagct catctacagc 2040 agattcctta gaatatctaa aaaatattaa cttccctggt tttgagaatg ttgaagccac 2100 tgtagaatac tgtagagcaa tagaccgtat atttgatttt ttaaattcaa aaagtaaatt 2160 ttcaaacgct tttaaaagtc caatttatta taataccatt gaaaaacgtg aagaaatcat 2220 tattccacta attaaatatt tatacacttt aaaattcaaa ggatctccat tacatatatc 2280 aagtaagaaa accttcatat taggttttgc aatagcagtt aaatctttct tttcaatgtc 2340 cagatcacat tttgtacaac accctaattt caaatacctt ttaacgtata aattctcaca 2400 ggatcatctc gaattgcttt ttggccgtat taggcagcgt ttaggatcaa acaataatcc 2460 taatacagct caatttaaaa cagccataaa acagatcttg atgaaaaatg caattaaatg 2520 tagatcaaaa cataactgca acacatttga tgacgatcca ataggttctt tatttgattt 2580 taaatggaag actaaaaaag atgatattga ctatgaaaag gtaaacttcg aaacaattga 2640 cattgaggca ttaaaaaaat tacaacttct aaattcttac attcctgaat caaataatta 2700 ctcaagtact ttacaagatg ctaagaacaa cattttatat tatattgtgg gttacctaat 2760 tagaaaactt aatttggatt gttcttcatg tgaaaacgct gtattagata aatttaatga 2820 acatgactat tgtaaaagtt tatcatttac taagtttgta aatttcaaaa atagaagtgg 2880 tcttgtattt ggttctaaat ctgtgttttt aattatacta gaagcagaaa aaatgttttt 2940 atttttaaca gataatttta aatcacttca aattccaaat cttgaaatta aaataattaa 3000 acatgttata gttactttct caacagataa aaatatattt ccaaacctaa attgtgaaaa 3060 tatatccatt ctagaacggc cccataagat tctattaatt acactattaa caaaaaaatg 3120 tttaaagtta aggttaaaat cattctcaaa aatgtattct tcagatatca tgaatccagt 3180 aagcaaaaga cataaacttt ctaaattaat tttattttca aatcagtaat cactaattgt 3240 tataaaaatt gtctaacact atattttact gttttgtttt gtaaataata tttattataa 3300 tatagattga aatgtataat gtacttaata atgttatctt gtatgtattt attttcatta 3360 tttgtataaa ataaaatgtt taaaaaagtt aaaatataat aaaattatac tgtattaaat 3420 tatacatttt gagttttgac atttattttg ttatcatttt gttattaaat attcttatca 3480 atcatgcgat tttgtaaaca ttgccctgtt ggcatatggc ggttgacttc aatattatcc 3540 tgtagtcggc tatctagttg ttttatatgt ctatg 3575 // ID BEL-243_AA-LTR repbase; DNA; INV; 609 BP. XX AC . XX DT 13-JAN-2011 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-243_AA_; KW BEL-243_AA-I; BEL-243_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-609 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (13-JAN-2011). XX DR [1] (Consensus) XX SQ Sequence 609 BP; 197 A; 107 C; 117 G; 188 T; 0 other; tgtcgacgat acccctcatc gttactgaga ggcggtgaaa caactatccg ttcccgcact 60 gtcggtgtaa gggagatcag atgtcatcgc aaactgaatg ttgttgacac cgaagaagca 120 ttagtcagca ttgttgatca tagtcctgaa ttcatggtga tatatcgttg ctctataaag 180 tgggtttgaa attagtcagt atgaattgct ttatctaatg aattgtttaa ttatagccta 240 gttattgaag taaatttatt attggtcgag cggcagttag tcatttgacg aaaaagtgca 300 ttaaaagtac gttgaagttg aactcgtttg catatactta ccagcgttgt cctcataacc 360 tgaaatgaga aatagaatga ttaatactta cctaaattac ataaaaattc tgtagatttc 420 tggagattgc gcgctagttc aaatccgagt gattgagatc ctgttaaaac cagaaaaaca 480 ctgacaatgt aagacaaatc tttaatactt accaatgaat cacatctaat taccttacaa 540 taaattacag cttagtagct gcttagtcac gcatctaaca agacggtgtt tactctactt 600 acgggaaca 609 // ID P-7_HM repbase; DNA; INV; 3170 BP. XX AC . XX DT 17-MAR-2008 (Rel. 13.03, Created) DT 17-MAR-2008 (Rel. 13.03, Last updated, Version 1) XX DE P-type DNA transposon family: consensus. XX KW P; DNA transposon; Transposable Element; P-7_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3170 RA Jurka J.; RT "P-type DNA transposon families from Hydra magnipapillata."; RL Repbase Reports 8(3), 353-353 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 223..2778 FT /product="P-7_HM_1p" FT /translation="MVNKCVAYGCKTGYNSSLEDKKIASFHFPLKNQQLLN FT YWVHFVNRTDWVPSPNSVLCEKHFNEKFIIRGKKCKLNWNLDPIPEIFSSD FT CLKRPSTLPSPIALRRPPKSRIFQEDELSSFSENDKVTEWTMLSIKHSPEG FT FQFKKHLNCVVYYNLIFDPITFFPSILESIKIDYCLCVALQYKGNNVPLPP FT WLSKGRNSKLTSLSMLENLAAYIKNTASSNRNILLDEINSRTYYKAKGQPQ FT YSADMIRHALLLRYTSKQAYEILLESFPLPSISLLNKIKQGGVDAIKAVKT FT LREKGSIHEDVILIVDEFYLQKQAQYCSGIYVGADDTGTLYKGVVAFMIVS FT LKKSIPFIIKAIPEVNISGKWLSDQISDSISSLSLSGFKIRGVVTDNHKSN FT VSAFSLLRSKFGNQSNFFINHPDSPNTKIYLFYDSVHLVKNIRNNLLHCKK FT FVFPSFTFNIKDDSNNCPAGYITWGDFHHLHSKDEKLQGNLRKAPKISYQS FT LHPGNKKQNVSLALSIFDETTIAGFKSYFPERKDISGFLTIFHKWWTISNS FT KERFSPNKLGNAIILNDGKTTFFRQLANWIELWQKSPYFTLSQQTSSALIT FT TLRSQAMLIDDLIEEGYLYVLTSRLQSDPIERRFSQYRQMSGGRFLVSLLE FT VQHSEIILACRSLIKENINFWDEDLKNNHNNPINNELLAILDLNSNEISES FT TLSPETEEVATTIAGYIAKKLIKRSKCNYCKPLLATDTSNIIICPYLQILS FT RGGLTVPSLSLKDFTCGCFAVLDNLSDLITKQGANVKDVSSFVLSKYFPTV FT CFTCDIHKKWGFNFASKIIINVFFNNKQNLINATVRKDAVKSFKSRQREK" XX SQ Sequence 3170 BP; 1150 A; 476 C; 457 G; 1086 T; 1 other; catggcctac tatattatac ggccgtatat gttacatttt agaggccgat tctaagaggc 60 ctattttcgc acaaaaaaag acaacaagtt gaatgataaa actattaatt ttaaacttgc 120 atattatgca aaataaaaag caacttgcat aataagtaga acttgcaaat tataataact 180 gttttagaaa gttaataaat actaaaaaag ctgaatcata tcatggttaa caagtgtgtt 240 gcatatggtt gtaaaactgg atacaacagt tctttagaag ataaaaaaat tgcttctttt 300 cattttccgt taaaaaatca acagttactg aattattggg ttcattttgt taatcgtact 360 gattgggtgc cttcaccaaa ctccgttttg tgtgaaaagc attttaatga aaagtttatc 420 atccgtggta aaaaatgcaa gttaaattgg aatctagatc ctattcctga aattttctcc 480 agtgattgct taaaacgtcc ttccacatta ccatcgccta ttgctttgag aagaccacct 540 aaaagtcgta tttttcaaga agatgaacta agttcttttt ctgaaaatga taaagttact 600 gaatggacca tgctaagtat taaacattca ccagaaggtt tccaatttaa aaaacatctc 660 aattgtgtag tttattataa tttaatattt gatcctataa cattttttcc atcaattctt 720 gaatcaataa aaattgatta ttgcttatgt gtagctttgc aatataaagg aaataatgtt 780 ccacttcctc catggttatc taaaggtcgt aattctaaat taactagttt gagtatgtta 840 gaaaatcttg cagcgtacat aaaaaatacc gcttcttcta atagaaatat acttcttgat 900 gaaataaata gtcgaacata ctataaagcc aagggtcaac cacaatattc tgctgatatg 960 attagacatg ctttattatt aagatataca tctaaacaag catatgaaat tttactagaa 1020 tcatttccat taccttcaat ttctttgtta aataaaataa aacaaggtgg agttgatgca 1080 attaaggctg ttaaaacact tcgtgagaaa ggttcaatac atgaagatgt aatattaatt 1140 gtagatgaat tttatcttca gaaacaagca caatattgta gtggtattta tgtgggtgca 1200 gatgatacag gtactttgta caaaggagtt gttgcattta tgatagttag tttaaaaaaa 1260 tcaattcctt tcattattaa agcgatacct gaagttaaca taagtggtaa atggctttca 1320 gatcaaatat ccgattctat tagttctctt tcgctatctg gttttaaaat tcgaggtgtt 1380 gttaccgata atcataaatc gaatgtaagt gcatttagtt tgcttcgttc caaatttggt 1440 aatcaatcaa atttctttat taatcatcct gatagtccaa atactaaaat atacttattt 1500 tatgatagtg ttcatcttgt taaaaatatc agaaataatt tattacactg taaaaaattt 1560 gtctttccat cctttacatt taatataaaa gatgattcta ataattgtcc agctggttat 1620 attacttggg gagattttca tcatctacat tctaaagatg agaaacttca aggcaattta 1680 cgaaaagctc caaaaatttc atatcagagt cttcaccctg gtaacaaaaa gcaaaacgtt 1740 tctcttgcac tatcaatttt tgatgaaaca acaattgcag gtttcaagag ttattttcca 1800 gaacgtaaag acatttctgg ttttcttaca atttttcaca aatggtggac tatttcaaat 1860 tctaaagagc gatttagccc aaataaactt ggtaatgcga ttattcttaa tgatggtaaa 1920 acaacttttt ttcgacaatt agctaattgg attgaacttt ggcagaaatc tccatatttt 1980 accttatcac aacaaacatc atcggctctc atcacaactc tgcgttctca agcaatgctt 2040 atagatgacc ttattgaaga aggatattta tatgtactta cttcacggct tcaaagtgac 2100 ccaatagagc gtagattttc gcaatatcga cagatgagtg gtggaagatt tctagttagc 2160 ttactagaag ttcaacattc tgaaattatt ttagcatgtc ggtctttaat taaggagaat 2220 ataaattttt gggatgaaga tctaaaaaac aaccacaata acccaattaa taatgaatta 2280 ctggcaatat tagatttaaa ctcaaacgag atatctgaat caactctctc acctgaaaca 2340 gaagaagttg caacaacaat agcaggttac attgctaaaa aattaatcaa aagaagcaaa 2400 tgtaattatt gcaaaccttt gctagctaca gatacaagca atataattat ttgtccatat 2460 ctacaaatac tttccagagg tggtcttaca gtaccatcac tttcacttaa agattttaca 2520 tgtggatgtt ttgcagtgtt agataacctt tcagatttaa taactaaaca aggtgctaat 2580 gtcaaagacg tatcaagctt tgttttatca aaatactttc caacagtttg ttttacctgt 2640 gatatacaca aaaaatgggg ttttaatttt gcttctaaaa tcataattaa tgtttttttt 2700 aacaataagc agaacttaat caatgctact gtacgaaaag atgctgttaa aagttttaaa 2760 agtagacaga gagaaaaata accctataaa aaaaaataaa atatatatat atatattaac 2820 ataaagatag ataaaacaat aaaagaacat aaatttaaac cttggttttc tttgtaaatt 2880 tctttgtaaa aatcgtgttg tcaatttaaa tgtataatgt acaaatataa aaatatatat 2940 tagatacaaa tacatgtata atatacatag atatatatgt ttataaatac acaaatctct 3000 atgattaatt atcgctgatg aatacacttt tctttgtact cgaaattaca tttagtctgg 3060 atgatgagtt ttttyctctg taattgcgca gccatctttt taataaatag gcctcttaga 3120 atcggcctct aaaatgtaac atatacggcc gtataatata gtaggccatg 3170 // ID BEL-56_CQ-LTR repbase; DNA; INV; 687 BP. XX AC AAWU01004237; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-56_CQ_; KW BEL-56_CQ-I; BEL-56_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-687 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 266-266 (2011). XX DR GenBank; AAWU01004237; Positions 44705 44019. XX SQ Sequence 687 BP; 236 A; 118 C; 167 G; 166 T; 0 other; tgttacgacc gctgagatgg ccacactgcc gaacggtgca gcgctggcgg tgtaaccgcc 60 cgactccacc tcaactgtca acgtgcgcgc gaagaagaga gaaaacaaaa acaaagtgcg 120 tgtgtgacac ccagtcacaa gaaaccatgc tgggcaagtc accatcgcga attattagga 180 aaagtaagaa ctttttgctt agtttagctt ttcggataag taggggaaag attgtgtaga 240 cgctgtagaa gttgagtttt tattgaactt gctcacatca ggcacacctg taagtaaaga 300 agaagttgaa atttagtgcg attgattgtg aaattaactt caaatactta ctcagctgcg 360 aaagacgtcc gggaaactgc gccactaccg taccgtcagt tggtgctaaa tcgtaccgcg 420 gtgataacct gtaagtttac ctgaaaagaa acacaaacaa atattaataa aatacacaaa 480 aatataggaa aaggtgaacg gaaaggtgaa ggataaggag gaatgaggag aggaaggagt 540 cgtgatttgt aggaaagagg aagactattt aagtgagtat ggagattgaa ggaaatgtaa 600 acagtaatta atttgaaaaa tatatttaca gtttttgagc tgttttcaca ctgctacaaa 660 aattggtctc tctggttcat ccgaaca 687 // ID Tx1-10_CQ repbase; DNA; INV; 4930 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A Tx1 non-LTR retrotransposon from Culex quinquefasciatus - DE consensus. XX KW Tx1; Non-LTR Retrotransposon; Transposable Element; Tx1-10_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4930 RA Kojima K.K. and Jurka J.; RT "Tx1 non-LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 642-642 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 10 sequences with >90% CC identity. XX FH Key Location/Qualifiers FT CDS 156..1478 FT /product="Tx1-10_CQ_1p" FT /translation="MAESDLKTNSLKIRFAGNCREPTEPEIFDFMRGKMKL FT KADHLLSMYKDKLDASVIVKFKNEEEFKATLARLPGTMEFAYNKYESTQVR FT LSPANAVVKYVRLFNLPPEVDDREIGTALSKYGRIQRLIREKYGEETGFPI FT WTSVRGAYIELKEGTEIPASLHVRNIRARVFYEGLVNKCFLCGATDHIKAE FT CPERKTVNTRLDASRGSYSAAVAGGAARWFKQKMAEPTLSKEPEAXMTNLN FT ALFPQTALMKPQQSMVEVKEKAPPAGPSSGKQIAAAKETDDLWSQALAAVA FT DQLQQPQQENKQDGDGWRVVPPKKEKNKKNPKKVRSSSAGSSGASGSVDPK FT FLRPRTPRPTAISDAQRNRDRSRSAPKNGDSKKPKIIVTAETNEEAKQPSS FT GEMDVDGVDEMKVAGSGGGQGAGGEQGTGGEQGCGGGSFGRRFTVXW" FT CDS 1570..4836 FT /product="Tx1-10_CQ_2p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="MMNFTYKLGSVNLNSSNSLVNKQLLKDFIYINDLDVI FT FLQEVSYDNFSFVPSHAPIVNLSDSGSGTAILIRKTFDFSSPLLDPNGRIS FT SVIINHINFINVYAFSGTNRKKERDDLFLDNMTIHLNKAGSKFCVVGGDFN FT CILNAFDTAGSNKNYSTGLKRLIEIFNFKDIAVELRKHQFTFYRGDTESRL FT DRFYAPSAFLENVLDFTTLPVAFSDHHAIIMKIKSCSDEFSLKGRGYWKIN FT STIANDSSVSDQFSECFETWKLRHQYTDDKNLWWNVIAKKKFKQFYQNKSW FT ELNQQISNEKSYFYRKMLEWMDKKNRNEETLLDIKMIKSRLYEIESNRMKH FT LGNNLNNGNILQNEMVNLYQVSAQVKRFQSSSNLRLKDGNILVVDINVLKN FT KVYQYYANLFTEDNNSSQQLSAEDPIHCLNSSLDQDDRDDIMRPITVVEIE FT EILKNCTRKKSPGPDGLTYEFYINNFDSLKADLVLLFNGYLSGNMRPSKTF FT TDGIITLIPKKGANSVHLDDYRPISLLNCDYKLFMKIIAERLKFYMGKLLG FT FGQTACLPNSSCIDNLKDIRRILTRACESKRFKGCLVSVDLNKAFDRVNHS FT YLWKVLEKFGFPTQLIDCLKNLYANAASRVLVNGFLTPEINISSSVRQGCP FT LSMILFVLYIEPLLRKISVTIPPVLVYGKFINVLAFADDIVVFVKNDYEFD FT NFIKVIDTFSKYASIKLNFAKSVFLRLNNAKVGPQKFKETEALKVLGVIFE FT KSWSSTVNNNYKRLVTNINLSMLQHNVRKINIIERCIVLNTYILSKLWYVA FT QLFPPANSHLAQIKSACGKFIWKGYIFKLDRKQLYLDFFKGGLRLVDPEAK FT SKALFLKNIFYNLNAHGTQPDESYLIDLDSPQKLTRNAREWLNEGKAVVAN FT HDLNTVKLMYNYFVSLNSESPAIEGKLSLKWTLIWKNCSESFLEMKDRAKM FT FGFVNDLLPNKEKLLSYNIGNLTSAVCDKCGGIDSNSHRIKECPRAKEVWE FT WSSRIVQNRYKMNVVDLEDILQLDLDEKSESEKAALWLAVKTISYNLRVSE FT PSLFVLKKEIREHRWNERKNIRIIFGNSLNIC" XX SQ Sequence 4930 BP; 1571 A; 774 C; 1126 G; 1452 T; 7 other; cagttcgtgt ttggacattg tcacagctag acgtgtttca agagcgagta tctaaaatct 60 tccgattcgt gaccagcgtg cgcgctgcct gttctttgtt ataagaacaa tagttcgttt 120 ggtttctttt gttatttcga taacgcggag tgaaaatggc ggaaagtgac ctcaagacca 180 actccttgaa gattcggttt gccggcaact gccgagaacc tacggaaccg gaaatctttg 240 atttcatgcg cgggaagatg aagctgaagg cggaccatct gctgtccatg tacaaggaca 300 agttggacgc ttctgtgatc gtgaagttca agaacgagga agaattcaag gccactctgg 360 ccaggttgcc cggaaccatg gagtttgcgt acaacaagta cgagtcgacg caggttcgcc 420 tgtcaccggc aaacgcggtg gtgaagtacg tgcggctgtt caacctaccg ccggaggtgg 480 acgatcgcga gatcgggacg gckctgtcca agtacggccg aatccagcgg ctgattcgcg 540 aaaagtacgg cgaggaaacg ggtttcccca tctggacatc cgtccgagga gcgtacatcg 600 agctgaagga gggtacggag attccggcgt cccttcatgt gcggaacatw cgagcaaggg 660 ttttctacga aggtttggts aacaagtgct tcctttgcgg tgcgacggac catatcaagg 720 cagagtgtcc cgaacggaaa acggtcaata cccggttgga tgctagccgc ggttcctaca 780 gtgccgcggt tgctggtgga gctgctcggt ggttcaagca gaaaatggcg gaaccgacct 840 tgagcaagga accagaggca amgatgacga acctcaacgc tctgtttcct caaacggctt 900 tgatgaaacc gcagcagagt atggtggaag tgaaggaaaa ggcgcccccc gctggaccgt 960 ccagcggaaa gcaaattgca gcagcgaagg agacggacga tctttggtct caagcgttag 1020 cagcagtcgc ggatcaattg cagcagccac agcaggagaa caagcaagat ggagacggct 1080 ggcgcgtggt gccaccgaag aaggagaaga acaagaagaa tccaaagaaa gtgcgttcaa 1140 gttcggctgg ttcatcgggt gcatcgggtt cggtcgatcc gaagtttttg cgtccgcgwa 1200 ctccgcgtcc gaccgctatc agtgatgcgc agcggaacag agacagaagt cgwtcggcac 1260 cgaagaacgg cgactcgaag aagccgaaga tcatcgtgac ggcagagacg aacgaggaag 1320 ccaagcagcc gagtagtggt gagatggatg tggatggcgt cgacgaaatg aaggtagcgg 1380 ggtctggggg gggacaggga gccggggggg aacaggggac tgggggggaa caggggtgtg 1440 gggggggaag ttttggacga cgtttcactg tgatktggta ggggaatagg gtttgcttct 1500 ttgctttttt aaactttagt tataattttg tatagtagat ttagttagtt tagtttttgg 1560 tagattttaa tgatgaattt cacttacaaa ttaggatctg tgaatttaaa tagttcgaat 1620 agtttagtta acaaacagct cttgaaagat tttatttata ttaatgattt agacgttatt 1680 ttcctgcaag aggtttccta tgataacttt tcgtttgttc catcgcatgc tcccatcgtt 1740 aatttaagtg atagtggttc tggtactgcc attttgatta gaaaaacatt tgatttcagc 1800 agcccccttc tcgaccctaa tggaagaata tcctcagtga taattaatca catcaacttt 1860 attaatgttt atgcattctc gggaactaac cgtaaaaaag agcgggatga tttatttttg 1920 gacaatatga caattcattt gaataaagcg gggtccaaat tctgtgtggt tggaggcgat 1980 tttaactgca ttttgaacgc gtttgacact gctggttcga ataaaaatta ttctaccggg 2040 ttgaagcgac ttattgaaat atttaatttt aaggacattg ctgttgaatt gcgcaaacat 2100 cagtttactt tttatcgggg tgacacagag tcaaggcttg atcgatttta tgcgccgagc 2160 gcttttttag agaatgtttt agatttcaca accctcccag ttgcattttc tgaccatcat 2220 gcgattatta tgaaaatcaa atcttgcagt gatgagtttt ctttaaaagg tcgaggatat 2280 tggaaaataa acagtacgat cgctaacgat agttcagttt cggatcaatt ttctgaatgt 2340 ttcgaaacgt ggaagttaag acatcagtac accgatgata aaaatctttg gtggaacgta 2400 attgcaaaaa aaaagtttaa acaattttat caaaataaga gttgggagtt aaaccaacaa 2460 atttctaatg agaaatctta cttttataga aaaatgttag aatggatgga taagaaaaat 2520 agaaatgagg aaactttatt agatattaaa atgattaaat ctcgtttata tgagattgaa 2580 tcaaaccgta tgaaacattt aggaaacaat ttaaataatg gaaatatttt gcaaaatgaa 2640 atggtaaatt tatatcaagt ttctgctcaa gttaaaagat ttcagtctag ttctaattta 2700 agacttaaag acggtaacat cttagttgta gacataaatg ttttaaaaaa taaggtttat 2760 caatattatg cgaatttatt taccgaagat aataacagtt ctcagcaact gagtgcagaa 2820 gatcctatac attgtttaaa tagttcgcta gatcaagatg accgagatga cattatgcga 2880 cctataacag tagttgaaat cgaggaaatt ttaaaaaatt gtacccgtaa aaagtcaccg 2940 ggccctgatg gattaacata tgagttttat ataaataatt ttgattcttt aaaagctgat 3000 ttagtattat tatttaatgg atatctttca ggtaatatgc gaccatcaaa aacttttaca 3060 gatggaatta ttacgttaat acctaaaaaa ggtgcaaata gtgtgcattt ggatgattat 3120 cgtccgatta gtttgttaaa ttgcgattat aaattattta tgaaaattat tgctgaacgt 3180 ttgaagtttt acatgggtaa attgctaggt tttggtcaaa cagcatgttt gccaaattct 3240 tcttgtatag acaatttaaa ggacattcgc agaattctca ctcgagcatg tgagtctaag 3300 cgttttaagg gttgtttagt aagcgtggat ttgaataagg cttttgatag agttaatcac 3360 tcctatctat ggaaagtcct ggagaagttt gggtttccta ctcaactgat agattgtttg 3420 aaaaacttgt acgctaatgc agcctctcga gttttggtta atggattttt aactcctgaa 3480 ataaatatta gcagctcagt acgacagggt tgtcctttaa gcatgatcct atttgtactt 3540 tacatagaac ctctgcttag aaaaatttca gttactattc ctcccgtttt agtatatggg 3600 aaattcataa atgttttagc ttttgccgat gatatagtag tttttgttaa gaatgattat 3660 gaatttgata attttataaa agtaattgat acattttcga agtatgcatc tatcaaatta 3720 aattttgcta aatcagtatt tcttaggttg aacaatgcta aagttggtcc acagaaattc 3780 aaagagactg aagcattgaa agttcttggg gttatatttg agaaaagttg gtcaagtact 3840 gtgaataata attataaacg acttgtcact aatatcaatt tatctatgct tcagcataat 3900 gtaaggaaaa ttaacataat tgaaagatgt attgttttga acacctatat tttatcgaaa 3960 ttatggtacg ttgcacaatt gtttccacct gcaaattccc atctagctca aataaaatcg 4020 gcgtgcggca agtttatttg gaaaggatac atttttaaat tagatagaaa acaactttat 4080 ttagattttt tcaagggagg tcttaggctt gttgatccag aagcaaaatc taaggcatta 4140 tttctgaaaa acatttttta taatttaaat gcacatggaa cacaaccgga tgaaagttat 4200 ttgatagatt tagattcacc gcagaaactt acacgtaatg caagggaatg gttgaatgaa 4260 gggaaggcgg ttgttgccaa tcacgattta aatactgtaa aattgatgta taattatttt 4320 gtttcactta atagtgaatc accagctatt gaagggaaat tgagtttaaa gtggacttta 4380 atttggaaaa attgtagtga aagcttcctc gaaatgaaag atcgtgcaaa aatgtttggt 4440 ttcgtaaatg atttgttacc taacaaagaa aaattattaa gttacaatat tggaaatttg 4500 acgtcagctg tatgcgacaa atgtggtgga attgattcta attcacacag aataaaagag 4560 tgcccaagag caaaggaagt gtgggaatgg agtagtcgta ttgtccaaaa tcgatataaa 4620 atgaatgtag ttgatttaga ggatattctt caattagatc tcgatgagaa aagcgaatca 4680 gaaaaggcag ctttatggct tgcagtaaaa acaatcagtt acaatttgag agtatcagaa 4740 ccgtcattat ttgttttgaa aaaggaaata agagagcata gatggaatga gcgaaaaaac 4800 ataaggatta tttttggaaa tagtttgaac atttgttaag agcaggttat gatagtttgt 4860 aacacgaaca atgggtaaat aaactatttt taaaaaccga aaaaaaaaaa aaaaaaaaaa 4920 aaaaaaaaaa 4930 // ID Gypsy-82_CQ-I repbase; DNA; INV; 7750 BP. XX AC AAWU01003460; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-82_CQ_; KW Gypsy-82_CQ-LTR; Gypsy-82_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-7750 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 543-543 (2011). XX DR Genome; AAWU01003460; Positions 8138 389. XX CC Positions [4701-5180] - Integrase core CC 'CATA' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 2223..5540 FT /product="Gypsy-82_CQ-I_2p" FT /translation="MLKAKNYVLIPARMEKFQKIFVPANEDILCHSGQIVE FT NVFIGNCIVTPINNFVTIPILNSSEEDILVDLNQLEFEPLSHFQILNLTNK FT NERCKTIIDNINMEHLNSEESDSITNIINEYNDVFYVEGDKLSATNATTHR FT ITTNNGDKPIFTRQYRIPTIHKDVIEKEVQTMLKDGLIKESNSPWNSPLLV FT VPKKQDKDGNKRWRMVVDFRKINDATISDAYPLPLISDILDQLGRARYFSN FT LDLASGFHQILMDPKDSAKTAFSTTYGHYEFLRMPFGLKNAPATLQRFMNT FT ILCGLQGVKCFIYMDDIVVYGDTLKSHNTKLIDVLKRLRQHNLKLQPNKCN FT FLQKEILFLGHKITPDGIKPDEGKIEAVQKFPPPQNQKEIKTFLGMISYYR FT KFIPNLSNLAEPINRLLKKGIQFIWSDKCQQSFNELKSLLTSAPILQYPDF FT TKPFVITTDASNVAIGAVLSNEDKNLPIAYASRILNPAERRYSTIERELLA FT IVWAIKHFKPYVYGREFFVHSDHKPLVYLFNVTHASDRLMRWRLLLEEYDF FT KVIYKPGKNNAVADALSRIQLLTLDDRHEIVKEIGQQITNQKIEKLNVVTR FT HQANKNNAMIANDFKTFELLTNPSLNQNVLQRTESGSKHLKVFVLPSILDA FT KIDYSFNKNEFETAKIEDVGYVDSIESYFVISKNNHLDKICKHKLYKKLTF FT LKNQMKNINKKTISFCFILSSCDEINWNLFQNIVQHIFSDVNCKIHFSNNL FT VQKVSSKNEIEQILKLYHDSPLGGHEGVERTFKRISEKYKWPKMRQDIKNY FT IEKCDLCQRNKILKVNKIPLKITTTAEKPFEKIFLDIVGPLPKTKNGNNYI FT LTIQDDLTKTFIAKAIPEASAETTCKAFMEIGICVYGVPKTLVTDRGTNFT FT SKLFEHLCKLLRVEQVQTSAYHPQSNGALERCHRTLKEYLRSYVCENLNDW FT DDYLSQFCFTYNTSAHSSTGYSPYELMFGHKAELPVAIRTAQKENTNYFCY FT LKDLRFKLESIHKLAKDSLIKTKESRKETYDTKSNEWVPMWGDKVLLMNIT FT IGSGQKLQGIWNGPHEVVAIKSPETTQIRLKPSNKITTVHNNRLKLYKE" FT CDS 5575..7464 FT /product="Gypsy-82_CQ-I_1p" FT /translation="MIIPLFLINLIQLALTTSPVEVSPLYNHGIIYEKLPD FT VKLELGKYNLQSFHDLSSIAEEINQLEKLTPVLNNSCTKLATSFGIDCIRI FT VNDVSFVMTDIINAREIFSDSCNNKRSKRQINFVGHIFKEITGVMDHDDSA FT RIDTDLKNLHENEMILKNQLNKKTIAIDSIFEENNKTIYSMQNKLTEFSKI FT IDHYNNKNSDERTKNLVRNEFSFILTELNFGLSHLRRKINLFQTLMETGKV FT PLTPSIYSPSRFYDDLLKINEKLPYGFSMPFELKLQNIMKYYAISEVTSVL FT HDCKIYVNIQFSISNHVLYEAYKGTSVPIMKNGKMFITTFEKEVILKSPNS FT KIGMLATYAEYKTCKIVQNLRICPSFHILEDLSKLTDCNVQAFFNTSSNSC FT FSQPINLKHQIWVQMVDENSWVFAVPNITQLDVLFNNKHMSYEISGIGKLK FT ILQPALIKSQEVLLHYYPKDFTKIQYNNFKMSFSHTPLEVKLIDKKVPDTL FT ETKIITNANSKQLFHEIVNLKNMNQDKMQLLNVELERHTFSFWTFFMYVIL FT AVLFLFTLFRIYTYCKSKFSRGRGSRCRTQEEEINSQEIQELNPRINTNML FT LPSSAYIVELDNNKMNSKKNVRKKPVYMNINV" XX SQ Sequence 7750 BP; 2961 A; 1257 C; 1217 G; 2315 T; 0 other; ttggtggcag caaaaagaac acacgccgta aagtcaaagc gtgaagtgaa gtaaattcgt 60 gaaaaaagtg aagttatttg cgcatctaaa gtagatgtac atatgtcggc cctattacaa 120 agtgaagcga tttacaaatc tctatgcaaa gctcgacagt cattgaaagt ttgctgtaat 180 caaaacgtct gtgcaaagtg aattacagtg caaaatagaa ataataataa taacagcaaa 240 gtggatccag gaacaaaatc ctgaatttat gcctgacgcc aaaaggattt tttttccgtg 300 ttgttaacgt aaaaataaaa tcgtcgtttt ccgaaacaat ttgacaccgc gcaattttgc 360 caaaaacgtg atcgataata ataagttagt gcaaattcca agtggaaaac acgaaacact 420 atgacagttt ttcaaatctt ctgttaagga aaattaagtg tcatcagtgt cgtttacaat 480 cataaaaaaa gtgctgaatg caaattagct gaaaatccgt acagaaggaa agtcatcaaa 540 catcgaagtc atcgattcca gcatcatcga cagcagcagc agcagcagca ggagctctgc 600 cttcctgtga cagcagatca cgtcaaaatc atcgattcca gcaacattaa cagcagcagc 660 agcaggagct ctgtcttcct gtgattcaac agcagaattc gtcaaggtta gcaagcaaca 720 ggttagtgtt acatgttaac cagaagtaag gcagtagcgc ttaacgcttc cttactgact 780 gaagacaata gtccggcacc gagtgtttcg gatcattctt ctcaagattc aacttacaga 840 ttaccagatt caatttctga ttttgattca gattcagaat caagttttgc atctgacctg 900 tcgtctacct tttcaaacat gccattcgat agagcttatg ccttcaacat tattccggaa 960 ctcaatggac gttccaatat taatttattc ttggatcagt gcgattcgat ttacgatgaa 1020 ttttgagata attgtgggcc tagtttaatt tcaattatga agtctaaaat ccctgacaca 1080 ctttgggtgt cagttagggg ttgtacgact tatgaagact tgaaggagaa gctcatccgt 1140 aaatttgcgg atgacactcc tattagcatg atgctaacta ctttggtaac attaaaccaa 1200 agcaaatttg aagatctaaa atgttttaca gacagagcac aaatgatttt cgacaaattg 1260 gtgctattaa cagaatcttt tttaaattta aaaaccgaag actccgagtc aacagcactc 1320 gtaaaaacgc tattcaaaaa ttttttacta gatttctttg ttaaaggaat caatgatgac 1380 aaattaagac aactcgtctt gtccaataat tttaatgatt ttaataaagc agcagattat 1440 gctaaatgta tggaagttgt tttcaaatcg tatgattctc atacgccagc tccatacaaa 1500 actgattcaa atttcacaaa tcaaataaga aaccaaaagc aatgcaattt ttgcaaaatg 1560 ataggacatt ttgaaaaaga ctgcagaaaa aaacaacaaa tcaaaaattt taaaactagt 1620 gatgaaccga gacagggctc atccaatcaa aaacaaaaac tttgtaattt ttgtggaata 1680 aaaggacatt gggaaaacaa ctgtaggaaa aaacagtccc aagtccacac gttgaactca 1740 aaaaaccaac aggtttcatc aaaccttcct ccagacattt tactggcaga attggtcaga 1800 catttcaatc tgaaataata aaatttgacg agcagaacgt aacaaacaga cctttaatat 1860 ccgtttcatc ttcagataat aattttgata aaattatctt tttattagat cctggtgctg 1920 aagtttctgt attaaaaatc agcaaacttc attcttcaat taaaattttt tctgatgata 1980 aaatcccctt gcaaggaata acaaattccg ttgaattttc aaaaggatca attaattttc 2040 ctttattttt aaagcataca gttttagaac acaaattgta tgttgttgac gatcaatttc 2100 ctatacctgg agatggtcta ctaggatcag attttttcac aaaattttcc tgtgatattt 2160 cttttaaaaa taaaaattta aaaatttatt tgaatccttc tttagaaaat acacaattag 2220 ctatgttgaa agctaaaaac tatgtattaa ttccagcgcg aatggaaaaa tttcagaaaa 2280 tttttgtacc tgccaatgaa gatatattat gtcattcagg gcagattgtc gagaatgttt 2340 ttattggtaa ttgtattgtt acaccaataa acaattttgt aacgattccc atactaaata 2400 gtagcgagga agatattttg gttgatttga atcaacttga attcgaacct ttatctcatt 2460 ttcaaatttt gaatttaaca aataaaaatg aacgttgtaa aaccatcatc gacaatatta 2520 atatggaaca cttaaactcc gaggagagtg attccattac taatattatt aatgaatata 2580 atgatgtatt ctatgtcgaa ggtgacaaat tgtcggctac taatgccact acccacagaa 2640 taacgactaa caacggcgac aagccaattt ttacacgtca atacaggata cccactatcc 2700 acaaagacgt gattgaaaaa gaagtacaga ccatgttaaa agatggtcta attaaagaga 2760 gtaattctcc atggaactca cctcttttgg tggttccaaa gaagcaagat aaagacggaa 2820 ataaaagatg gcgaatggtg gttgactttc gtaaaattaa cgatgcaacc ataagtgatg 2880 catacccatt gccacttatt tccgatatcc ttgatcaact aggtagagca agatattttt 2940 ctaacctaga tcttgctagt ggattccatc aaatattaat ggatccaaaa gatagtgcca 3000 aaacggcatt ttcaacaaca tatgggcatt acgagttcct tcgaatgcca ttcggcctaa 3060 aaaatgcacc agctacactc caacgtttca tgaataccat actgtgtggt ttacaaggag 3120 taaaatgctt catctatatg gatgatattg tggtatatgg agatacactt aaatcacata 3180 atacaaaatt aatagatgtt ctgaaacgat taagacaaca taaccttaaa ctccagccaa 3240 ataaatgtaa tttcttacaa aaagaaattt tattcttagg gcacaaaatt accccagatg 3300 gtattaagcc cgatgaggga aaaatagaag cggttcaaaa gtttccgccc cctcaaaatc 3360 agaaggagat taaaacgttt ttgggaatga tcagttatta taggaaattt attcctaatt 3420 tatcaaattt agctgaaccc ataaacagac tcctcaaaaa agggatacaa tttatatggt 3480 cagataaatg tcagcaatcc ttcaatgagt tgaagtcatt gttgacatca gcaccaatat 3540 tacaatatcc cgatttcaca aaaccttttg tgattactac cgacgcaagt aacgttgcaa 3600 ttggagcagt attatcgaac gaggataaga acttgccaat tgcatacgca agtcgtattc 3660 taaaccctgc cgagaggagg tactcaacaa tcgaaagaga attgttagca atagtatggg 3720 caataaagca ttttaagcca tacgtttatg gccgagaatt ttttgtacat tccgaccaca 3780 agcctctagt atatttattc aatgttaccc acgcatccga taggttaatg cgatggagat 3840 tattgctaga agaatatgac ttcaaagtca tttataaacc cggaaaaaat aacgccgtag 3900 cggacgcgct aagtcgtatt caattattaa cattagacga tcgtcatgaa atcgtaaaag 3960 aaattggaca acaaataacc aatcaaaaaa tcgaaaaatt aaatgtcgtg acgcggcatc 4020 aagcaaataa aaataatgct atgatagcaa atgatttcaa aacatttgaa ttgttaacca 4080 atccatcttt gaatcaaaac gttttacaaa gaactgaaag tggttcgaaa cacttgaaag 4140 tttttgtatt accttccatt cttgatgcaa aaatcgatta tagttttaac aaaaatgaat 4200 ttgaaacagc aaagattgaa gatgtaggat acgttgactc cattgaatct tattttgtta 4260 ttagtaaaaa taaccattta gacaaaattt gtaaacacaa attatataaa aaattaactt 4320 ttcttaaaaa ccaaatgaaa aacattaata aaaagacaat aagcttttgt tttatattgt 4380 cttcatgtga tgaaataaat tggaatttat ttcaaaacat tgtacaacat attttttctg 4440 acgtcaattg caaaatacat ttttcaaata atctagttca aaaagtttca tctaagaatg 4500 aaattgaaca aattttaaaa ttatatcatg atagcccttt aggtggtcat gaaggagttg 4560 aacgtacttt taaaagaata tctgaaaaat acaagtggcc aaaaatgcgc caagatatta 4620 aaaattacat cgaaaaatgt gatttatgtc aaaggaataa aatattgaaa gtaaataaaa 4680 ttcctttgaa aatcacaaca acagcagaaa aaccattcga aaaaattttc ttggatatag 4740 ttggtccatt acctaaaacc aagaatggaa ataactatat tctgacaatt caggatgatc 4800 ttacgaaaac gttcatagcg aaagcaattc cagaagcttc agctgaaaca acttgcaaag 4860 cattcatgga aattggtatc tgtgtgtatg gtgtccctaa aacactagtc acagatcgcg 4920 gaacgaattt taccagtaag ctatttgaac atttgtgtaa attactacgt gtcgaacaag 4980 tacaaacatc agcctatcat ccacaaagta atggtgcatt agaaagatgc catcgaacgt 5040 taaaagaata tttacgttca tatgtctgtg aaaatttaaa tgattgggat gactatctca 5100 gtcaattttg cttcacatat aatacaagtg cacattcatc tactggatac tctccatatg 5160 aattaatgtt tggtcataaa gcagagttgc ctgtggccat acgaacagct caaaaagaaa 5220 acactaatta tttttgttat ttgaaagacc ttagattcaa actagaatct atacataaat 5280 tagcaaaaga tagtctaata aaaacaaagg aatcaagaaa ggaaacatat gacacgaagt 5340 ctaatgaatg ggttccaatg tggggtgata aagttctttt aatgaacatc accattggat 5400 ccggccaaaa actccaaggt atttggaatg gaccacatga agtagtggca ataaaatcgc 5460 cagaaactac tcaaattaga cttaaaccgt caaacaaaat cacaacggtt cataataacc 5520 gattgaaatt atacaaagag taatattttc ctttcaccac tttctttcac agaaatgatt 5580 attcccttgt tcctgattaa tttaatccag ctggcattaa caacgagtcc cgtggaagta 5640 tccccgctct acaatcatgg tattatatat gaaaaattac cagatgttaa attggaacta 5700 ggcaaatata atttgcagtc ttttcatgat ttaagttcca ttgcagaaga aataaatcaa 5760 cttgaaaaat taactcccgt tttaaataat agttgcacaa aattagcaac atcttttgga 5820 attgattgca tcagaattgt taatgatgtt tcgtttgtta tgactgacat tattaatgca 5880 agagaaatat tttcagactc ttgcaataat aaaaggtcca aacgacaaat caattttgta 5940 ggccatatat ttaaagaaat tacaggggta atggatcatg atgattccgc taggatagat 6000 acagatttaa aaaatttaca tgaaaatgaa atgatactta aaaaccaact aaacaagaaa 6060 acaatcgcca ttgactctat ttttgaggaa aataataaaa caatttattc tatgcaaaat 6120 aaactaacag aattttcaaa aatcattgat cactataata acaaaaacag cgatgagaga 6180 accaaaaatt tagtaagaaa tgaattctcg tttattttaa ctgaattaaa ttttggatta 6240 tcacatctaa ggagaaagat aaatttattc caaactttaa tggaaactgg aaaagttcct 6300 cttactcctt caatatacag tccatccaga ttttatgatg atttgttaaa aataaatgaa 6360 aaattaccgt acgggttttc aatgccattt gaattaaaat tgcaaaatat tatgaaatat 6420 tatgcaataa gtgaagtaac ttcagtgtta catgattgta aaatttatgt gaacatccaa 6480 ttttctataa gcaatcatgt tttgtatgaa gcttataagg gaacaagtgt acctattatg 6540 aaaaacggta aaatgtttat cactacattt gaaaaggagg taattcttaa atctcctaat 6600 tcaaaaattg gtatgcttgc aacatatgca gaatacaaaa cctgcaaaat agtgcaaaac 6660 cttagaattt gtccatcgtt tcacatattg gaagacttat ctaaattgac agattgcaat 6720 gtacaagctt tcttcaatac ttcgagtaat tcctgttttt ctcaacctat taatttaaaa 6780 catcaaattt gggtacaaat ggtagatgaa aactcatggg ttttcgctgt tccaaacata 6840 acccaactag atgtattatt taataacaaa cacatgtcat acgaaatatc aggaatagga 6900 aaacttaaaa tcttacaacc agctttgata aagtcccaag aagtgctttt gcattattat 6960 ccaaaagatt tcacaaaaat tcaatataat aattttaaga tgagtttttc acacacacct 7020 cttgaagtga aattaattga taaaaaggtt cctgatactt tggaaacaaa aataatcaca 7080 aatgcaaata gtaaacaatt atttcatgaa atcgtaaatt tgaaaaacat gaatcaagat 7140 aagatgcaac ttttaaatgt cgaattagaa agacacacct ttagtttttg gacgtttttt 7200 atgtatgtaa ttttggcagt tctatttttg tttactctgt ttcggattta cacgtattgc 7260 aaatccaaat tttcgagggg gaggggatct cggtgccgga ctcaagaaga agaaataaat 7320 tctcaggaaa tacaggagct taacccaaga atcaatacaa atatgttact acctagttca 7380 gcgtatatag tagaactcga taataataaa atgaattcta aaaaaaatgt aaggaaaaaa 7440 ccagtataca tgaatattaa tgtataaact acaagattat cacattattg aaatatattt 7500 atataaataa ataaataact aataataaat ataataaatg aatcaataaa tataattaat 7560 atattataat aaataaatgt gaaataaaaa tcaaatgatc aaaagcttga tttctacacc 7620 agagaacatt ttcgcaaaat gatcttgaaa ataaaatgta tagatgttca tggattgaag 7680 attatattta taataaaaaa aaaaaaacaa aagtttcata aacgaaactt tttcagataa 7740 gggtgagtga 7750 // ID BEL-7_SI-LTR repbase; DNA; INV; 287 BP. XX AC AEAQ01030469; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: long terminal DE repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-7_SI_; KW BEL-7_SI-I; BEL-7_SI-LTR. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-287 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01030469; Positions 355 641. XX SQ Sequence 287 BP; 76 A; 66 C; 55 G; 90 T; 0 other; tgtttacgcc gtttatataa ttgtgctcaa tcgactagtt attttctttt ggtttcttcc 60 cgctgtcact agtaatggcc gtaccgcggc gctgtagaaa agtttagtta agttgggagt 120 tgggagaact gcgacgaacg accgttagca tttttactct ctcacgtatt gtatcactag 180 tggagtcata ccgcttatta tcactcgtcc aaataaactg gtagcagttc acgatccaga 240 aacattcatt caatcccgat accacccaac aattcaagta tcgaaca 287 // ID BEL-17_CQ-I repbase; DNA; INV; 5696 BP. XX AC AAWU01036025; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-17_CQ_; KW BEL-17_CQ-LTR; BEL-17_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-5696 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 187-187 (2011). XX DR Genome; AAWU01036025; Positions 17296 22991. XX CC Positions [4612-5202] - Integrase core CC 'AAAAT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 119..2110 FT /product="BEL-17_CQ-I_2p" FT /translation="MTPKKRYVRSNSVKLKELKRRMATILGSAEQIKVFEA FT NYTADQANQLVIRIEMLDELWKQYSEVQEEIELLEDLSGEEISEERLVFQN FT KYVELKALLVSKLPNLSVNTEPTRVASNSASIVSQSVRLPEIQIPKFDGNP FT ELWIEFRDTFKSLIHANPNLTAIQKLHYLKDALIGEAYSSVANLEMSTEGY FT NIGWKLVEKRHNDTNFLIKRHISALFAIAPIKKESSALLMDLADQFDRHVS FT ILNKLEKTEEHWNSVLVEALSRRLDTTTLKEWEKDCKEGERPDFDALVTFI FT RKQARTLQSVKMSHTPTATQETKIVVKSRSTNNFMIAEAAHKCPMCKAAHY FT LYECDQFNKLSPRHCFNIAKKYKLCINCLRGHHLAKNCNSSNCKSCGQRHH FT TLLHLPPAVDGNSYNSNQRTTFTAFGQQVENLQVQTQMYCPPHLMGGVSGA FT RHPNTIDSFVPGTIETPFAHVAQSNQTPSHTNYSAYSAILSEGKPAQNHYN FT EQHKIRSANRAAEDSSGVLDTSRAKTEVFLATAIVRVEDRDGTNHFARVVL FT DSCSQCNFISEAVATKLRLRRTKTNTEVAGIGPGKVRVTESVTVKLRSRTR FT AFETSVVCLIIAKIPAILPSQNIDIRDWQVPRDVKLADPNSTLVRVWTFYS FT GPNSSSRYCLARVTK" FT CDS 2167..5613 FT /product="BEL-17_CQ-I_1p" FT /translation="MYAGRGANPSLCIISSLSGLDAILERFWQVDDFDRGR FT ALSLDDRWCEDHFNRTVSRRDDGRYVVRLPIREDRRPFLGDSYNLAQRRFL FT ANERRFGYDQQLRDDYLRFMDEYAELGHLEPGLRTGKEEFNLPHHAIRRPD FT STTTKTRVVFDASVKGCNQLSLNEILHSGPTVQPTLLSTVLNFRLPKYVFI FT ADIEKMFRQIWIHPEDRKLLKIVWRTNTQSPLETYQLKTVTYGTACAPFQA FT ARVLNKLADDEGHRYPLGAHILKNHTYVDDTLAGRDSLNEAIEATVQLRQL FT LQSGGFNLRKWCSNDRRVLQQIPTDLVEMPHEIEIDRSNTIKTLGLFWNPH FT EDSFGFKIPKLSALDKITKRIVASEMAQLFDPCGLIGSVVVWAKMFVQGLW FT YEKFAWDQELPEGLAKSWHTFRDQLPYLLEFRVPRRVVVSRKFMLHCFCDA FT SERGYGCSVYVVSKDQHKSYSNLLSTKSRVAPLREHDITRLELCAFLLGCQ FT LIDTILTTTEFVAPVVFWTDSSVVLQWIQTPPAIWKVFVSNRIAEIQRLMK FT GHTLRHVPTHLNPADRLSRGILPIEILNDKLHVHGPDFICESIESWPATKF FT DKSVLSVVESERRPIVALFTALVQSEPEEIFTKFSELAKLLKVTVYCLRFC FT TILRERSKGQSLGKIQPREIDNALKTLIRLAQSYSFPHELKLLEKCKAEDC FT TQTELNFKSLIKGLDLFLDEFGLIRIKGRLEKLSAPFDTRCPILLHDDHPL FT TKLIARSIHLQTLHGGPTLMLATLRQRFWPLHGRKLVRNIAYQCVQCFRCN FT PRKRVQIMAPLPTVRITPARVFANCGLDYCGPFLVRPLIGRGAALKMYVAC FT FICMVSKAIHLELVPDLSSQACINAIKRLIARRGRLLTLHCDNATTFKGAD FT RELKELRNQYIAQFQTEMWNNFCLAQGITFHHIPPRSPHFGGLWEAGVKSF FT KHHFKRILGNKSLRLDEVTTCVTQIEGILNSRPLTPLTNDADDLSALTPGH FT LIIGEPLFSLPEPNVSELSINRLNRFQEMRRTVQDFWKRWTRDYVHHLHQR FT PKWHQGTRNPQVGELVLLKQENLPPLQWNIGRITATHPGRDGRCRVVDVRT FT TRGTYRRAAAEVCLLPIETESSKVENPAQHQMQLPTSQPEL" XX SQ Sequence 5696 BP; 1687 A; 1251 C; 1307 G; 1451 T; 0 other; tttggtgccg aaacccggga taatcgctgt cgccggataa atctccaacg tcgtcgagtc 60 atcgccagca cgaagccgct tataagttaa gtatattagt taagtgagtg aagtgaagat 120 gactccaaag aagagatatg tgcggagcaa ttccgtcaag ttaaaagaat tgaagaggcg 180 gatggccacg atactaggtt cagcggaaca aatcaaagta ttcgaggcca attacacggc 240 cgaccaagca aatcaactag taattagaat cgaaatgctt gatgagcttt ggaagcagta 300 ctcggaggta caagaagaaa tagagcttct agaggacctt tcgggtgaag aaatttctga 360 agaacgatta gtatttcaaa acaaatacgt tgaattaaaa gcactacttg ttagtaagtt 420 accgaatcta tcggtaaata cggagcccac tcgcgtggct tcaaattcgg cgagcatcgt 480 gtcgcaatcg gtacgtctgc ccgagatcca aattcccaag ttcgacggga atccggaatt 540 gtggatcgaa tttcgtgaca catttaaatc actcattcat gcaaacccaa atctaacagc 600 tatccaaaag ttgcactatt taaaagatgc actaattggt gaggcttatt cgtccgtagc 660 caatcttgag atgtcgacgg aaggttataa tatcgggtgg aaattagttg aaaagcgaca 720 caacgatact aactttttga tcaaaagaca catttctgct ttatttgcga tagcaccaat 780 taaaaaagag tcttctgctc tgttgatgga cctcgctgat caattcgatc gtcacgtcag 840 catacttaat aagctagaaa aaacagaaga acattggaac tctgtactag ttgaagcttt 900 gagcagacgt ttagatacta ctacattgaa ggagtgggaa aaagattgca aagaaggcga 960 acgtcctgat ttcgacgctt tagtcacatt tattcgcaaa caagcgcgta cgttacagtc 1020 agtaaaaatg tctcacacac cgacagctac acaagagacc aagatcgtag taaaatcaag 1080 gtccacaaac aactttatga tcgctgaggc agcgcacaaa tgtccgatgt gcaaagcagc 1140 tcattatttg tatgagtgcg atcagtttaa caaactttca ccaagacatt gctttaatat 1200 cgccaaaaag tacaaactgt gtatcaactg tttacgtgga catcacctcg ccaagaactg 1260 caactcctct aactgcaaat cttgtggaca aaggcaccac accctattgc atctaccgcc 1320 agctgtcgat ggtaatagtt acaacagcaa tcagcgaact acatttacag cgtttgggca 1380 acaggtagaa aatctacaag tacaaacaca aatgtattgt cctcctcatt tgatgggggg 1440 agtgtcgggg gctcgacacc ccaatactat tgattctttt gtccctggga caatagaaac 1500 cccttttgca catgtagctc agtccaatca aaccccaagc cacacaaact actcggctta 1560 ttcggcaatt ctcagcgaag ggaaacctgc acagaaccac tacaacgaac agcacaagat 1620 tagatcagct aaccgtgcag cagaagactc atcgggagtt ctggacacgt cgcgggcaaa 1680 gacagaggtc ttcctcgcaa ccgcaatcgt gcgggtagaa gatcgtgacg ggacgaatca 1740 cttcgcgcgc gtggtgttag atagttgttc tcagtgcaat tttatatcgg aagcggtcgc 1800 caccaagtta cgattacggc ggaccaagac aaatacggaa gtcgcgggaa tcggtccggg 1860 taaagtgcgt gttacggaat cggtcactgt gaagcttcgc tcgcggactc gcgctttcga 1920 aacgtcggtc gtttgcctca ttattgccaa aattccggca attttaccga gccagaatat 1980 cgatattcgc gactggcagg ttccgcgtga cgtgaaattg gctgatccga attcaacatt 2040 agtgcgggtg tggacatttt actcggggcc gaactcttct tctcgttatt gcttagccag 2100 agtaactaaa taaatgtgtg atatccactc ttacaggaag cactgtttgg acatatcgtg 2160 tcagggatgt acgcggggcg tggagcaaac ccatcgctgt gcataatcag cagtctcagc 2220 gggctggatg ccatcctaga gcgattctgg caagtggacg actttgaccg cggtcgagcg 2280 ctgtcgctag acgataggtg gtgtgaggat cactttaatc gcacggtttc gcgtcgggat 2340 gacgggcgtt acgtggtacg gttaccaatc cgcgaagatc gaagaccttt tctcggtgac 2400 tcgtataatc tagcccagcg tcggttcctg gctaacgagc gcagatttgg atacgaccag 2460 cagcttcgcg acgactacct gcgatttatg gacgaatatg ctgaactcgg acacttggaa 2520 ccgggcttac gcacaggtaa agaagaattt aatttgcctc atcacgccat tcggcgaccg 2580 gatagtacaa ctactaaaac tcgcgtcgtt tttgatgcat ccgttaaagg ttgcaatcaa 2640 ctatcattga atgaaatact gcactcgggc ccgacagttc agcctacact attgtcgact 2700 gttttgaatt ttagattgcc aaaatatgtt tttattgccg acattgaaaa aatgttcagg 2760 caaatttgga ttcatccaga agatcgaaaa ttgctaaaaa ttgtttggcg tacaaatacg 2820 caatcacctt tggaaactta tcagttaaaa acggtcacgt atggaactgc ctgcgctcct 2880 ttccaggctg ctagagttct aaataagtta gctgatgatg aagggcatcg ctatccctta 2940 ggagctcata ttttgaaaaa tcacacatac gtggacgaca ctctagcagg aagagacagt 3000 ctgaatgaag caatcgaagc aactgtgcaa cttcggcaac tgttacagtc tggcggattt 3060 aatttgagaa aatggtgttc aaacgatcgc cgagttctgc aacaaatacc aacagacctg 3120 gtggagatgc cccacgagat cgaaatcgat cgctccaata caattaaaac gcttggattg 3180 ttttggaatc cacacgaaga tagcttcgga tttaaaattc ccaagttgtc tgcgttggac 3240 aaaatcacca aacgcatcgt tgcctcagag atggcccagc tatttgaccc ttgcggtcta 3300 attgggtcag tagtcgtgtg ggcaaaaatg tttgttcaag ggttgtggta tgaaaagttc 3360 gcatgggatc aggaacttcc cgaagggctt gccaaatctt ggcacacatt cagagatcag 3420 ttgccatatc ttttggaatt ccgagtacca cggagagtgg tggtatcgag aaagtttatg 3480 ttgcattgtt tttgcgatgc ttccgaaaga ggctacggat gctctgtcta tgtggtgtca 3540 aaagaccaac ataagtcgta tagcaattta ctcagcacga agtctcgagt cgcaccatta 3600 cgagaacacg acattacccg ccttgagctg tgtgcgtttt tgttaggttg ccagttaata 3660 gatacgatat taaccaccac agagtttgta gctccagttg tgttctggac cgattcgtcg 3720 gtggtattac aatggataca aacaccacca gcaatctgga aggtatttgt gtcaaatagg 3780 attgcggaaa tacaaagact catgaaagga cacactttac gacacgtccc aacacatcta 3840 aatcctgctg atagactgtc aaggggaatt cttccaatcg aaatattaaa cgacaaatta 3900 cacgtacacg gaccggattt tatttgtgaa tcaatcgaat cctggcccgc aaccaagttt 3960 gacaaatcag tattgagcgt tgtcgaatcg gaacgcagac caatagttgc tctttttacg 4020 gcgctagttc aaagcgagcc cgaagaaatc tttacaaaat tctctgaatt ggcaaaacta 4080 ttaaaagtga cagtttactg tttaagattt tgcaccattc ttcgagaacg gtctaaggga 4140 caaagtctgg gaaaaataca accgagagaa attgataacg ctctcaagac tttaattcgc 4200 ttagcgcaat cttattcttt tccacacgaa ttgaagttgc tggaaaaatg taaggcagaa 4260 gactgtacgc aaaccgaact caactttaaa tcgctaataa agggtttgga tctattccta 4320 gatgaatttg gattaattag aattaagggt agattagaaa aactgtctgc tccatttgac 4380 acacggtgtc cgattttgtt gcatgatgat caccccttga ctaaattaat cgctcgctct 4440 attcatctcc aaaccttgca tggtgggcca actttaatgc tagctacttt gcgccagaga 4500 ttttggcctt tacacggaag aaagctggtg cgaaatattg cgtaccaatg tgtgcaatgt 4560 tttcgatgta accctagaaa aagggtgcaa attatggccc cattacctac agtaagaatt 4620 acaccagctc gcgtcttcgc gaactgtgga ttagattatt gcggtccctt tttagttcgg 4680 cccttgatcg gaagaggcgc tgcattaaaa atgtatgttg catgtttcat ttgcatggta 4740 tcaaaggcga ttcatttgga gttggtacca gatttatcat cacaggcctg cataaatgca 4800 atcaagagat tgattgcgcg cagaggacgt ttgctcacac tgcactgtga taatgcaacc 4860 acatttaaag gagcagatcg tgaacttaaa gaacttcgaa atcaatatat tgctcagttt 4920 caaactgaga tgtggaacaa tttctgtttg gctcaaggaa ttacattcca tcacattcct 4980 ccacgctctc ctcattttgg aggattatgg gaggctggtg taaaatcctt taagcaccat 5040 tttaagcgaa ttttaggaaa taagtcgttg cgcctcgacg aagttaccac atgtgtgact 5100 caaatagagg gtattttaaa ttctcggccg ttaacgccgc ttacaaatga tgctgatgat 5160 ttaagcgcat tgactcctgg acacctgatt attggagaac cgctattctc cctcccagaa 5220 ccaaacgtgt cagagttgag tatcaaccga ttgaatcgat tccaggaaat gcggcgtacc 5280 gtccaagatt tttggaaacg atggactcgg gattacgttc atcatttgca tcaacgtcca 5340 aaatggcacc aaggaactcg aaaccctcag gtcggcgagt tagttttgtt gaaacaagag 5400 aaccttccgc cgttgcaatg gaacatcgga cgaataaccg ccacacatcc gggcagagac 5460 ggcaggtgca gggtggtcga cgtccgcacc acaaggggaa cttaccggcg tgccgcagca 5520 gaagtatgcc tgctgccaat cgaaaccgag agcagcaaag tagaaaaccc agcccagcac 5580 cagatgcagc tcccgacaag ccagcccgaa ctgtaaaatt gtcagaagct cggttcaagg 5640 agactgatga agcagaaagt gagtaaagga gttgacccag tcaactaggg gcagga 5696 // ID BEL-172_AA-LTR repbase; DNA; INV; 710 BP. XX AC supercont1.269; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-172_AA_; KW BEL-172_AA-I; BEL-172_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-710 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.269; Positions 1510166 1510875. XX SQ Sequence 710 BP; 233 A; 116 C; 127 G; 234 T; 0 other; tgttggcgac gccacccctc atttggcaac ctagccgtct aataacggca gccgtttcta 60 caaccgacaa aacgaagtgt cagaaagcag gagagctatt agacagaatg tatagagaag 120 gatagagcaa tgatacgtgt agtgaagcag atgcagtgaa tttgttatcc taattcttga 180 atcttatgct aatttattat attttctaat tgataaatta acttctacaa ttacttgggt 240 gcataattag cttatcttta agttgcttta aaattcggaa aagtggcagt gaatgaattc 300 gagattagct gaaatatgtt aagccactag tgaattgagg ttagatgtga aatttgatga 360 ttgttcatga tttttctaaa gattaatcta tgtccccata gcaatcctag aagtcaccta 420 caactaaacg taactctcac ttcgataaat ttatcgagta aattaaataa gtaagatctg 480 gagttgtttt tcctttttcg attaactttc taattgacca atgcaaattc attatagaca 540 cccgctacac ccagcaccat ataatttgtt cgtggaatag ttagcttcta tgagtaaacc 600 gaatttgtaa gtgggtattc tataggtaaa ttaaactcat aaataataaa tttatatttt 660 agcttagagc atcacgctaa agaatcggtg ttttgttgta ttcccgaaca 710 // ID Chapaev-N1_AAe repbase; DNA; INV; 2794 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.03, Created) DT 23-DEC-2010 (Rel. 16.03, Last updated, Version -1) XX DE A non-autonomous Chapaev DNA transposon family from Aedes DE aegypti. XX KW Chapaev; DNA transposon; Transposable Element; nonautonomous; KW Chapaev-N1_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-2794 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the yellow fever mosquito."; RL Repbase Reports 11(3), 832-832 (2011). XX DR [2] (Consensus) XX CC >97% identical to consensus. 4-bp TSDs. TIRs are ~300 bp long. CC Both terminal ~150 bp are ~74% identical to Chapaev-1_AA. XX SQ Sequence 2794 BP; 902 A; 471 C; 483 G; 938 T; 0 other; cacggcgtgt atatgggaaa ggtttatcac aaaaaaatag gcatcgctaa atctttcacc 60 attcgaaaat ataggttttt agctttcatt tgacgtgttc cccaaaaaaa tccaccgagg 120 gatcccgaac attttttttt atttaaaatt tttgttcttt agagtacatt atttttaaat 180 gtaatcattg caaaagacag cttcgttgga taaaagtttt atttccatat aaaagttcat 240 aaaataaata tatatacata tctattgagt ttcattttgt aaaaaacaaa actgtcctat 300 gtgattttga ggatttattg agctatagga cacttattcg attagatatg ttgtaagttc 360 gactgttcac ttattcatac tgtggaagta tatctaaccg agcactgttt cttttcatgg 420 ttagaataca ttaaatttat tcttcttctg tctggcatta cacctctgct gggacatagc 480 ctgcttccca gcttagagtt cgttgagtac ttagttattg attgagaatt ttctttgcca 540 atcaaccgtt aaagattaaa agtagaatgg atcaacatta caacggccta cgaaagtaaa 600 tggaatatgc ttaaacgccc tgttacccaa gccttagcaa aaagcttaaa aaatcctcat 660 taagaatatt atccaggcac ccggagtaag tgacactaca cagacttttt taatagtatt 720 gaagttattt tgatgttttt agcttggaat ttcaaaattt aataagaata tagtggaaga 780 tatttcctgt taaatcaaat cataaaaata attccaccgt gaaagtggtt ttctgacaag 840 cagaacaaca ataatacaat tttcacaatt actgcagtaa ttaaaacacg taagtagctt 900 acaaacaaga aagttttacc attttatact ccttttagat tgatatcata ttcatataat 960 atatgttttc aacatgaaat atggttaatt atgtgattat tggcaacggt aagtgttatg 1020 aggaagaata aaaagttaaa tttcatgtgc ccacttgccc cgtagagttg agtaactgca 1080 atttacagta gatttttatg actttattct aagggtttca atatctcgaa aagatacctt 1140 cccattcgtg cggacatcag cgaatacagg gtgcaacttt tgcaattttt tgggctattt 1200 catgaaagca acatcaatca agtggctgca tttattggtc gtagcaaaat gagctccata 1260 cagcatgcat tttacattac cttttggagc tttttggggg gttttgcgat gatagcgact 1320 caagagtctg ccttgagctc tgcctctgtt tttacagtag ctccataaac ttaagtcagg 1380 aggagtcaaa ttaggatctc caaattcaaa tcatgtgtcc cttttttatg tcagatgtcc 1440 atggaatttc cgttgcagag acataatgaa caaacttcca aagatatcga atttatcgac 1500 tgtttcgtac tgtttgaaaa aaaaaatgac tcaatgattc ttttggtctg gtccaaactg 1560 ttagtactta cttggtcatg gaatggcgtt tcttgaatct tgaatagatt ttctgttcct 1620 caaatgtggc atttttgctt atttatgaag ccagattgtg cctgttgacc acgatatttt 1680 caaggatata acagttgagt taacacaact gcaagactat tttcaattca aagagtcact 1740 attttggcat gttgtttttt tatatcatat taattgtgga aaatatgaca atattaaccc 1800 agcagcaagt gatattgatt tgtaaaaatg gttcaaaaga gctctatttc gtatcaatcg 1860 tttggcagcg tccatttatt acgtaacgct aaaatcggaa atttttgacc acctcacccc 1920 atccccctcc gtaacgtttt ttgtatggaa aaattaaacg ctattccccc ctccccctag 1980 agcgttacgt agtttgtgga attttcagga tggttgatta gatctcatcg gtcgagtata 2040 tttttcttac acacaatata aaaaaaaaaa caatcatgat aggcgaatca aattctcagt 2100 taatgagtaa ccgaagaact cttatttgag aacctggctt tgtctcaata ggggtgtaat 2160 tccagaaaga aggaaaagaa aattaatctt ttctaaatat gaacagaaac agttatacac 2220 tagaatcaaa tacacactta aatccgaatg ccgatctcag ctgtgcaaat ctcggtaaaa 2280 ggtcgtttgt tgacatctta gtaaaagtga cgtttggtaa cggcacccaa aggtgctgtt 2340 ttaagtaaac ttgatgttag gctgacatat cagttaaaat taatttgctt accgctcagc 2400 tgtgcggatc ttggtaaaag aatgaaaatt agttgaactc aactgagtgt gtcctccact 2460 gtacgagtaa gtgtcgtatc cccaagatca cataggacag ttatgtctat tacaatatgt 2520 aatcttatat atctatgtta tatcaacttt catttgaaat ttgggctacc atcgagcttg 2580 agagaaaaaa aaactgtctt ttaccatgat tatatttgaa aatattgtac ttcaaggaac 2640 aaaaatttaa aataaaaaaa aatgttcggg atccctcggt ggattttttt ggggaacccg 2700 tcaaatgaaa gctaaaaacc tatattttcg aatgctgaaa gatttagcga tgcctatttt 2760 tttgtgataa tattgttcta ccagacacgc cgtg 2794 // ID L2B-4B_AAe repbase; DNA; INV; 5210 BP. XX AC . XX DT 07-OCT-2010 (Rel. 15.1, Created) DT 07-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE An L2B clade non-LTR retrotransposon family from Aedes aegypti. XX KW L2B; Non-LTR Retrotransposon; Transposable Element; L2_Ele5; KW L2B-4B_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-5210 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5210 RA Kojima K.K. and Jurka J.; RT "L2B clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (14-OCT-2010). XX DR [2] (Consensus) XX CC [1] Originally named as L2_Ele5. CC [2] Consensus update. ~96% identical to consensus. The consensus CC is ~99% identical to the original sequence in [1] and ~80% CC identical to L2B-4_AAe. It belongs to the L2B clade and renamed. XX FH Key Location/Qualifiers FT CDS 305..1720 FT /product="L2B-4B_AAe_1p" FT /translation="MTECKQCGAPVLISEIPTTCTGCLSVYHYKCTPLTKA FT VAKLVSENENLLFKCSDCLSSQCCGEQNLTLPTVRMFEEEMKKISSISDSF FT GGVREQIAAQINDALKTGMEELVRSLNSTMDKAIFEMSKSVSKELESMKCS FT FFEFNSNNNLVAMNDKHGTSSTASEPSLLPRGKKRKVQEDVCAMSDDVFVE FT SISFADVVKSNAGIKIRNKSTKKKSTANDVMQVTRKARPVIVIKPKESNQS FT SEDTRKILINKLDPKKHKISNLRHGKDGSIIAECATGYNVNVVKDGIQSDL FT GENYNAVVPSSVPRLLVKGMSDQFSSDEFIRILKDQNEDIAINEVKVIRMY FT ENPHFKYKKYNVVIEVDKETHSCLLTAQQVYVKFDRCRVVPEISVLRCFKC FT GEFGHMSTNCKNCDACCRCSGSHKTSECTSTVLKCINCIKMNKERKMNLDV FT NHGAFSYECEVFKNLYQRKKSSLHFNE" FT CDS 1724..4534 FT /product="L2B-4B_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="QPSSSSNVNQFDILYLNIAGLSTNYVELRQIVEEKRP FT YLVFLSETHIVDIDAFDQYSIPGYNVAACLSHSRHTGGVAIYVKESVQYRL FT QLNEVCEGNWFLGIAVDRGMKMGNYGVLYHSPSSSDQRFIEILENWLEMFV FT DCSKLNVLAGDFNINWCDNSNSNHLKRLADFFNLKQKVNVYTRISRHSQTL FT IDHVYTNLDSVSVIKNSDLKVTDHETLVINIDDNSRNKSDFVKLKCWKKYS FT KHAVSSLVERNVDFHTNDGSLDQKSAVLTDVLKTCTNKLVEHKFVSLKNSN FT SWYNLDLLRCKRNRDKLYKKWRRNNNNTNWNRYKVARNKYSQMVKQTRCEY FT IQRKIDQHQNNSKELWKILKSLLKPSNCKPRSITFNGTLEESEQVIACKFN FT EYFVDSVSSINQSIELVDEPDEIKQPINNNCTFECFHPITLEDLRNICFSM FT GKTAGVDNVNARVIQDCFDVIGHNLLDLINESLQTGHVPQVWKESLVIPIQ FT KVAGTIKAEEFRPINMLHTLEKILELVVKGQLLAYLNSNNLLIPEQSGYRE FT GHSCETALNLVLAKWKENVERKDTIVAVFLDLKRAFETISRPLLLRTIKRF FT GFSGSAYRWFESYLTDRTQRTAFNDSVSSPVVNSLGVPQGSVLGPVLFIMY FT INDMRRVLRFCDINLFADDTVIFIAAKNLDDAVAHMNADLRSLSRWLKYKQ FT LKLNISKTKFMLISRNRTNIDVSIQIDDETIDRVREIKYLGVIIDDRLKFD FT AHIDNVIKKIAKKYGILCRLKNDLTIASKIHLYKSIISPHLEFCASILFLA FT NETQISRMQRLQNKIMRLILRCNRFTSSSFLLDALQWLSVKQRIVFLTMVF FT IFKVINGLLPRYLCDRIERGSDVHRYNTRNATELRTPQFLYGASQNSLFFK FT GINVFNSMPIHIKRAATLSQFKRLCISHVKAAF" XX SQ Sequence 5210 BP; 1661 A; 796 C; 1107 G; 1646 T; 0 other; ttcttgatgg taatggtgaa cagtgtagtt tgtttctgca gtgcagtgat aaataaatta 60 aattacgtgt gaaatagtgg acttgccgcg gaacaacgat ttctatcgcg agcaacattg 120 tattttgaaa caggataagt aggaggcaaa ttgtctctaa cgcttgtcct cgtttaattt 180 tgcatttaat tttgccccag cagctgctgg aggctggtaa cttttttgta cactgtattg 240 tttcatctgt gttaagtgtt gcatgcatgt ttcgtttggt ggaagtgtat aggtgctaag 300 caaaatgact gagtgcaagc aatgtggggc cccggtatta atttctgaaa taccgaccac 360 atgtacaggt tgtttgagtg tgtaccatta caagtgcaca ccactgacga aagctgttgc 420 taagcttgtc agtgagaacg aaaatctgct atttaaatgt agtgattgtt tatcaagcca 480 gtgttgtggc gagcagaatt taacgctccc gaccgtacga atgttcgaag aagagatgaa 540 gaagatttct tctatttcgg attcgttcgg tggcgttcgg gagcaaattg ctgctcagat 600 aaacgacgcg ttaaaaaccg ggatggagga attggtgaga agtctaaata gtaccatgga 660 taaagcaatt tttgaaatgt caaaatctgt gagcaaagag ttggaaagta tgaaatgttc 720 attttttgaa tttaactcga ataataattt agttgcaatg aatgataagc acggtacatc 780 aagtacggca tcggagccga gcctgcttcc cagaggcaag aaacgtaagg ttcaggaaga 840 cgtttgcgct atgtcggatg atgttttcgt tgaaagtatt agtttcgcgg atgttgtcaa 900 gagtaatgct gggattaaaa ttaggaataa gagtacaaaa aagaaaagca ctgcgaatga 960 tgtaatgcaa gtgactcgta aggctcgtcc ggtcattgtg atcaagccaa aagagtccaa 1020 ccagagtagc gaggatactc gcaaaatttt gattaacaaa ttggatccga aaaaacacaa 1080 aataagtaac ttgagacacg ggaaagatgg ctctattatt gctgagtgtg caacaggata 1140 caatgttaat gttgtgaaag atggcattca aagcgatttg ggtgaaaact acaacgctgt 1200 tgttccctca tcggtgccaa gacttttagt gaagggcatg agcgatcagt tttcctccga 1260 tgaattcatt cggattttaa aagatcagaa tgaagacatc gcgataaatg aagtcaaagt 1320 aataaggatg tatgaaaatc cacatttcaa gtacaaaaag tacaatgttg ttattgaagt 1380 tgacaaagag acgcatagtt gtttgttaac ggcgcaacaa gtgtacgtta aatttgatcg 1440 gtgtcgtgtt gttccagaga ttagtgtttt gagatgtttc aaatgtggag aattcgggca 1500 catgagcacg aattgtaaaa attgcgacgc atgctgtagg tgcagtggaa gtcacaagac 1560 atccgaatgt acgtcaactg ttttgaaatg tatcaattgt ataaaaatga ataaagaacg 1620 taaaatgaac ttggacgtca accacggagc attcagttat gaatgtgaag tctttaagaa 1680 tttgtatcaa agaaaaaaga gcagcctaca tttcaatgaa tagcaaccaa gttccagtag 1740 taatgtgaat cagttcgata tcttgtattt aaatatagcc ggtttatcta caaactacgt 1800 tgaattacgt cagattgtag aagaaaaacg tccatatcta gtatttttat cagaaacgca 1860 tattgttgac atagatgcat ttgatcaata tagtattccg ggttataatg tggctgcatg 1920 tttatcacat tcgagacaca ctggcggagt tgctatttat gtcaaagaat cagttcagta 1980 caggcttcaa ttgaacgaag tttgtgaagg taattggttc ttgggcatcg cagttgatcg 2040 tggcatgaag atgggtaatt acggtgtatt gtatcactct cccagttcaa gtgaccagcg 2100 ttttattgaa attttggaaa actggcttga gatgtttgta gattgtagta aattaaatgt 2160 tcttgctggt gactttaata ttaattggtg tgacaattcg aattcgaatc atttgaaacg 2220 attagctgac ttttttaatt taaaacaaaa ggtaaatgtt tatacacgta tttccagaca 2280 tagtcaaact ttgattgatc acgtttatac taatttggat tctgttagtg ttattaaaaa 2340 ttctgattta aaagtaacag accatgaaac attagttatt aatattgatg acaatagtag 2400 aaataaaagt gactttgtaa aattaaagtg ttggaaaaag tattcaaaac atgccgtttc 2460 aagtcttgtc gaaagaaatg ttgattttca tacaaacgac ggttcgttag atcaaaagtc 2520 agctgttcta actgatgttt taaaaacctg taccaataaa ttagtcgaac ataaatttgt 2580 ttcgttgaaa aattcgaaca gctggtacaa tttggatttg ttgcgttgta aacgcaatag 2640 ggacaaactg tacaaaaaat ggcgtagaaa taataataat acaaattgga ataggtacaa 2700 ggtcgcgcgt aataaatact cgcaaatggt taaacaaact cgttgcgagt atattcagag 2760 gaaaattgat cagcatcaaa acaacagcaa agagttatgg aaaattttga aatccttgtt 2820 aaaacctagt aattgtaaac cgcggtccat aactttcaat ggcacattag aagaatcaga 2880 acaagtaata gcatgtaaat ttaatgagta ttttgtggac agtgtttcat cgattaacca 2940 atccattgag ttagtagatg aacctgacga aataaaacag ccgattaaca ataattgcac 3000 atttgaatgt tttcacccaa taacgttgga agacctacga aatatttgct tttctatggg 3060 aaaaacggct ggcgtagata acgttaatgc gagagtaata caagattgct ttgatgtcat 3120 cggccacaat ctgctggacc tgattaatga atcgttacaa actgggcacg tgccacaagt 3180 ttggaaggaa tcacttgtga ttcctattca aaaagttgct gggacgatta aagccgaaga 3240 gtttcgtccc atcaacatgt tgcacacatt agagaaaatt ttagaacttg ttgtaaaagg 3300 ccagctgtta gcatatttaa atagtaataa tttgctaata ccagagcaat cgggatatcg 3360 ggaaggtcat tcctgtgaaa ccgcattgaa cctggtgttg gcaaagtgga aagaaaacgt 3420 agagcgtaaa gatacgattg ttgctgtttt tttggatctc aaacgcgctt ttgaaacgat 3480 ttctcggccc ttgttgttaa gaacaatcaa gcgctttgga ttttcgggtt ctgcatatag 3540 atggtttgag agctatttaa ctgatagaac ccaacggact gctttcaatg attctgtttc 3600 gagccccgtg gtgaacagcc ttggcgtacc acaggggagt gtattagggc ccgttttatt 3660 cattatgtat ataaatgaca tgcgacgagt tttacgtttt tgtgacataa atctttttgc 3720 agatgatacc gtgatattca ttgcggctaa aaatcttgat gatgccgttg cacatatgaa 3780 tgcagattta cgttctctga gtagatggtt gaagtacaaa cagttgaaat tgaatataag 3840 taaaactaaa tttatgttga tatcgcggaa tcgaacaaat atagacgtct caatacaaat 3900 tgatgatgag acaattgatc gcgtgcgcga aattaaatat cttggcgtga ttattgatga 3960 cagattaaag ttcgacgctc acatcgacaa tgttatcaag aaaatagcca agaagtatgg 4020 aatactctgc cgattaaaaa acgatttaac gattgctagt aaaatacatt tgtacaaatc 4080 aatcatctct ccacatctgg agttttgcgc ttccattttg tttttagcaa atgaaacaca 4140 aatatcgaga atgcagcgtt tgcaaaataa aataatgcgt ttgattttga gatgcaacag 4200 attcacttcc tcgtctttct tattagatgc tcttcagtgg ttatcggtga agcaaagaat 4260 tgtattcttg acgatggtgt tcatttttaa agtaattaac ggtttgttgc ctcgatattt 4320 gtgtgatcga attgaaagag gaagtgacgt tcatagatat aatactagaa acgcgactga 4380 attaagaaca ccacagtttt tgtacggtgc ttcacaaaac tctttgtttt ttaaaggaat 4440 aaatgttttc aattcgatgc ctatacacat caaacgtgct gcaactctat cacagtttaa 4500 gagactatgt atttcacacg tcaaagctgc cttctgaaca gctacatgtt aacttttttg 4560 tacttgacta atattgtatc ttgacgaagt tttgactaac ggatattttg tattgactat 4620 attgtaaaat atgtttaata attattgggg cactaataaa ctacgatgtt cgcctcgatg 4680 atgatgatgg attttttata tattgacgta gttatagttt attccttttt atgtgtagtc 4740 tacggaggtt tgagtcccgc gcgcggtaca caggtataaa cgactgtttg attacttatc 4800 attcgaattg agaatttatg ctgttggtgg atgccatgca ttagtagtaa gttttctggt 4860 tgaatgtcgg aaatttcttg aaaatgcaaa tgtttccgat agttttgaca tcatcggcgg 4920 ggttatcaat tctggcaatg ttttggattt tgttgaactt tcttgatcac tattaaaact 4980 tatgaaaagt gtggttgaga ccaacagatt tttaccattt tttttttgct gtatagtatg 5040 ggaacttgta atgttgttta tatctgttga aataatcaga ccgtttattg tttgcaaatc 5100 tggttaacat tgatcataat taattatctt aaagatatat cgtctcgctc aaacctttgt 5160 aggggtatgt ggtgggacca tcatcatcat catcatcatc atcatcatca 5210 // ID Gypsy-97_CQ-LTR repbase; DNA; INV; 675 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-97_CQ_; KW Gypsy-97_CQ-I; Gypsy-97_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-675 RA Jurka J.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 574-574 (2011). XX DR [2] (Consensus) XX SQ Sequence 675 BP; 245 A; 101 C; 160 G; 169 T; 0 other; tgtagcgacc ctcagttaat cagaatataa atttagattt ttagttcgac caacagggaa 60 gaatttgtca gaagtacatt ggtctttaca atcagacgaa gtgcaacatc ttcagatgca 120 ctcattaaca catgttgaca cgagtcaagt gttatatttc taggagcacg taggttaaga 180 tataaataaa cacagtagga ggatggaagg aaaagcacaa agcgaaaaca gaagggtcat 240 gaactctgcc aggatccctg ctgagaattt tcgagcagaa agatttcaag atcaacgcgg 300 tgtagtcaaa acgcacgtgg ctgagctact cccgtagaga tgaatattgc gcatgcgtct 360 tgactaaaaa cgcttcagag tataaaagcc taagttagat ttagaatcag gggagttgaa 420 attaggaaag agttaagagt gggagttgag agttgcagtt gggagttgga gttgattatt 480 cggtgtaagt gctacgatgc acagattcaa ccaaaatgcc acggggcata gtgaaaaatg 540 ttaaaataag attaagaaaa atctaagaat acatattttg gagttaaagt gaaaatggtg 600 tcgttctgtg ctaaaggaat aaaaatgaaa gaaatcccaa tgatattaga gactaggctt 660 acacaagctt acaca 675 // ID Gypsy-9_DVir-LTR repbase; DNA; INV; 416 BP. XX AC scaffold_13045; XX DT 10-MAR-2011 (Rel. 16.03, Created) DT 10-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-9_DVir_; KW Gypsy-9_DVir-I; Gypsy-9_DVir-LTR. XX OS Drosophila virilis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; virilis group. XX RN [1] RP 1-416 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (09-MAR-2011). XX DR Genome; scaffold_13045; Positions 1191346 1191761. XX SQ Sequence 416 BP; 108 A; 88 C; 116 G; 104 T; 0 other; tgtcgcagtg tgataggtca gaaaggccgt ttcgcagtgt gattcctagt aattgtacgc 60 gggtacattc ctcagtagca ttctgggaat tcgaggattc ggggaattcc ctgcttttgc 120 tttactggct gatattgaac gaaagcacgg cagcaaaggc agagacgcta tgctgctgag 180 tgaggttcgg tggtaagata ccaagagtct tgagtaagac agagatggct acatacactc 240 gacatgagtg agtttacgac atgagtgagt ttagacatga gtgcgtaacc tgacatgggc 300 agttgagctt gggtatttga accgatcgga actcgctgtg cagccgcgtc aatcatcaga 360 agtatcaaca tggatgacat acctttgctc cgacagacat acccctgccg ccgaca 416 // ID hAT-2_TV repbase; DNA; INV; 3644 BP. XX AC . XX DT 08-OCT-2008 (Rel. 13.1, Created) DT 08-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE hAT transposons from Trichomonas vaginalis. XX KW hAT; DNA transposon; Transposable Element; hAT-1_TV; hAT-2_TV. XX OS Trichomonas vaginalis OC Eukaryota; Parabasalia; Trichomonadida; Trichomonadidae; OC Trichomonas. XX RN [1] RP 1-3644 RA Bao W. and Jurka J.; RT "hAT transposons from Trichomonas vaginalis."; RL Repbase Reports 8(10), 1197-1197 (2008). XX DR [1] (Consensus) XX CC Consensus was built from a few copies. The terminal 4 or 5 base CC pair is identical to that of hAT-1_TV. The TSD is 8-bp long, and CC the TIR is 11-bp long. CC This sequence was derived from sequence data generated by TIGR CC Database: The Trichomonas vaginalis Genome Sequencing Project. XX FH Key Location/Qualifiers FT CDS 357..2546 FT /product="hAT-2_TV_1p" FT /translation="MTETQDEYESKFEGWIDCKLPPKNIPRKVPPLPKFED FT TKFSTNNKCRVCFNIAYLPDTSEKSKERKRLYCYCLAIHYNLLTKTYMRCI FT YHRKPSEFDEDHQHIYQIQCENNPYTRRDLIKQDRKLLYVVKFVVDTNMSI FT NMAASGVLEDFARHVYSTFKGVINFQLKIPYRRDEIRDGIITLADLVAPKI FT HVELDKMGYYSITIDAASHGSRHGFFVMASNPAKMKNHIFIDTMIADGWNA FT RDYNNFFECFTFPKDFLSAIVADGCPAQVKGLCHWRRCGALYPGLQTVYIR FT CVAHLLNLVVKHTADNCAAFAMQINYCKAISKELSKPDTRRALNMRVIHVP FT DHRWMFFWDLLKWIKEKHPKIYELFNMDDPPESVQILRDMGIDFKDGLPQW FT LLNFEEILLPIKKLLNYAESDVGCFCYIYPMFQSTIEQLTGIANRPELSSS FT SKRIADALIANMNEVFDKQGNYALMKFAFGLTPWGRSVIRAELNMHTEYKP FT DVSLFFQEPIPINAELEIPPRSYTPDHLDEVAEDQLDRNLDIEEPEIENEA FT IEGDGQVDAEDIAQDEEVEENEPRRRLGRNLGEWASLSMQEVEEYSFNDFM FT IEMLSRRAEAEARFYKRSSAKSIANEYKKALSNFIHMNESEIRNNTDLTCL FT DQRFWVACDANFKALKEIALKFMSIAATEADVERMISIVRNLRNRFNQNAR FT EDLIKCRILLKLFMTNEILKKIFPDLTDL*" XX SQ Sequence 3644 BP; 1178 A; 626 C; 694 G; 1143 T; 3 other; tagggttgtt agattcccga cgataggtgt cttggaagat ttgcctggtg tcatcatagt 60 gtctttaaat caaaaatttt attttggagt tagttttatt gatttttcaa taaaaaacat 120 ttctaaaaca agaaaataaa acgataaaat atttatattt tatataagga tcttatatat 180 ttaaaataaa tatcatttta tttttttcat tggtgtctgt caaaccataa tacactattc 240 aaaaattcaa tgtgtcttgc ggtttcaaaa gacaccttca aattgaacga tagcacagac 300 actattgaat tttacccatt ttatttagtt tttacccttt ttatttttaa gaattaatga 360 ctgaaacaca agatgaatat gaatccaaat ttgagggatg gattgattgc aaacttcctc 420 ccaaaaatat ccctcgaaaa gtacctccct taccaaaatt cgaggatact aaattctcaa 480 ctaataataa atgtcgagta tgtttcaata ttgcttatct tccagataca agcgaaaaat 540 cgaaagaacg aaaaagactt tattgttatt gtttagcgat acattacaat ttactaacaa 600 aaacatatat gcgctgcatt tatcatcgaa aaccatcaga atttgatgaa gatcatcagc 660 atatatatca aatacaatgt gaaaataatc catatacgag aagagattta ataaaacaag 720 atagaaaatt actttatgtt gtaaaatttg ttgtggatac caacatgtca atcaatatgg 780 ccgcttcagg tgtactagaa gatttcgccc gacatgtcta ctccaccttc aagggagtaa 840 tcaattttca actcaagatc ccatatagaa gagacgagat aagggatgga atcattacat 900 tagccgactt ggtagctcca aaaatacacg ttgagcttga caagatgggc tactattcga 960 taacaattga tgcggcatca cacggaagcc ggcatggctt ttttgtgatg gctagcaatc 1020 cagcaaaaat gaagaaccat atcttcattg atactatgat cgctgatggt tggaatgcca 1080 gggattacaa taatttcttt gaatgtttta ccttcccaaa ggatttcctt tctgcaattg 1140 ttgccgatgg atgccccgcc caagttaaag ggctgtgcca ctggaggaga tgcggagcac 1200 tatatccggg tcttcaaacc gtttatatcc gatgtgttgc gcatctcctc aacttagtgg 1260 tgaagcatac ggcagacaat tgtgcagctt tcgcaatgca aattaattat tgcaaagcga 1320 taagcaagga gttatcgaag ccggacacga gaagggcatt gaatatgagg gttattcatg 1380 ttccggatca tcgctggatg tttttctggg atcttctgaa gtggataaaa gaaaaacacc 1440 ctaaaattta tgaattgttc aatatggatg atccaccgga atctgttcag atcttaaggg 1500 atatgggaat tgatttcaag gatggacttc cacaatggtt gttaaatttc gaagagattc 1560 ttttaccaat caagaaactc ttgaattatg cagagagtga cgttggctgc ttctgctaca 1620 tctatccaat gtttcaatcg acgattgagc agttgaccgg aattgcaaat cgtcctgaat 1680 tatcaagttc gtcaaagcgc attgcagatg cattaatcgc caacatgaat gaagtttttg 1740 acaaacaggg caactacgct ttgatgaaat ttgcctttgg tctaactcca tggggccggt 1800 cagtcatccg cgcagaattg aacatgcata ctgagtacaa gccagacgtt tcactcttct 1860 tccaagaacc aatacctatc aatgctgaac ttgaaatccc tccaaggagc tacacccctg 1920 atcatttgga tgaagtggca gaggatcaat tggacagaaa tctagatata gaagagcccg 1980 agattgagaa cgaagcaatt gaaggtgatg gccaagtaga tgcagaagac attgctcagg 2040 atgaagaagt tgaagagaat gaaccaagga ggagactagg caggaacctt ggtgaatggg 2100 cgtctctgtc aatgcaagaa gtggaagagt acagcttcaa cgattttatg atcgagatgc 2160 tttcgagacg tgcagaagca gaagcgaggt tctacaagag aagttcggcc aagagcatag 2220 ctaatgagta caagaaagct ctttccaact ttatacatat gaatgagagt gaaatcagga 2280 acaacaccga cctcacctgc ttggatcaaa ggttttgggt agcatgtgat gcaaatttca 2340 aggcattgaa agagattgcc ttaaaattca tgtctattgc tgcaaccgaa gctgatgtgg 2400 aaagaatgat ttcaattgtg cggaacctgc gtaataggtt taatcaaaac gcacgtgaag 2460 atctaataaa gtgtagaatc ctattgaagc ttttcatgac aaatgaaatc ctgaaaaaga 2520 tatttccaga tctgactgat ctttaacttg ccgtagcctc ttgctgccgt agctccttga 2580 tgctgcttga cctgcygtag ctccttgatg ctgcttgatt gtagatgttr gctycgaatg 2640 ctgcagcttg acctactgta gccccttgat gctgcttgat tgtagatgtt ggccttgaat 2700 gatgccgctt gacctgccgt agccccttgc tgccgtagtt cgttgctgcc gtagctcctt 2760 gctgctgctt gatcgtagat gttgcccttg aatgctgccg cttgattgta gatgttgccc 2820 ttgaatgctg ccgcttgacc gtagatgttg gctgtgaatg ttatcgcttc aaccgctagc 2880 tcctaactgc ctccaaacag ccgttttgct atgacctcaa tcacttttat ttattatttt 2940 aaatattatt taattttatt atttgaatag gccaacttat atttgagaag ctcaatttga 3000 gaccacttaa tacattgaaa tatcataaat attgaaatat taaaaagtgt atttcaaaat 3060 attgaaatta taaaaattta taatactgaa attgagacaa attaataaat tcaatgtgtt 3120 ttttttgaaa taactctcaa atattatttt tttgtttatt taatgatgaa taaatatgat 3180 tgctttagtc aatcataaaa actaatttaa gtttaaaata atgaaaggat aatttgatcg 3240 ttttttaaaa tacaattatt tttgatgatt tgcaagaatc tttgaaaaga atataatata 3300 ggttctattt ttgtaaaatt tatccaaatc atgtattaca taaattacga ttaaaacgat 3360 gaaagaacca aaaagtgtgt tttgtgtttt gttgttgcgt gttggatgcg ggatcctgtt 3420 atttccgaaa tcgacaaaaa atttggaaat ttaatttccg aaatcgaaat tatattaatg 3480 tttttaatat tcttatatca aaaggattaa agttcattaa gagaaaacaa ttcttcttct 3540 tcatctattg gcgcatttat ttctacacgc caataaggaa gggaaaatta taagacacca 3600 tggtgtcttc aaaacgctaa tttttgatat ttttaacaac ccta 3644 // ID Hopers2 repbase; DNA; INV; 2756 BP. XX AC . XX DT 08-OCT-2009 (Rel. 14.1, Created) DT 08-OCT-2009 (Rel. 14.1, Last updated, Version 1) XX DE Hopers2 is a putative autonomous DNA transposon. XX KW hAT; DNA transposon; Transposable Element; HOBO; transposase; KW Hopers2. XX OS Drosophila persimilis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; obscura group; OC pseudoobscura subgroup. XX RN [1] RP 1-2756 RA Ortiz M.F. and Loreto E.L.S.; RT "Characterization of new hAT transposable elements in 12 RT Drosophila genomes."; RL Genetica 135(1), 67-75 (2009)DOI 10.1007/s10709-008-9259-5. XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 1521..2381 FT /product="Hopers2_1p" FT /translation="MRKTIVLSFFYLFAGTLIRTKVEQCLNSFGLDLNDAI FT IVTDRGSNMISAFKESAHIHCINHLLNNVVEKAFKAVPELLQIEKKCAKLA FT KYFKRSGQNLKLELSLKGVCQTRWNTIFYTMKSIEHNWHDVVQILNEQNEM FT NRLDGLNYSDITAITNVLSKIEEISKFLEADEYPTMHWVIPRIHALQKFIK FT DANSDKDIEQKFSWSLSMHIDSVLLKYITKYHRIALFLFPPTNQLRQVDRT FT LTIEECKEMMRSFLTDDKENLDLSATCSNKFEEFLADFIEPLSGVK" XX SQ Sequence 2756 BP; 826 A; 505 C; 599 G; 826 T; 0 other; tcagattttg cactctagaa gatatagtca tcttctacga ttctgcgttt ttagttttct 60 cgtatcgtcg aaattgtgga tgccacagat tttcgccctt tgtgggggcg gaagtgggcg 120 gggcaaagtt ttgaaatatt tgtagagcag tgacatatca cagaagtctg gatccaaaac 180 atcgttgctc tagctcttat agtctttgag cattaggcgc tgaaggggac ggacagacgg 240 acggacgggc agacagacag ggctcaatcg actcggctat tgatgctgat caagaattag 300 agccctgcac gagtacacac tacttcgtat taacagttag gagccacact gtgcacagta 360 tcaaaacatg attcatgtat ccttttcaca ctatctttca attcattttc tttacttcat 420 ttgctgactc attgactcaa ggcataattg tcgagtgact cctgtcaaat cgtatgactc 480 gcactcacac tgttactaaa aacgttgtga cagagagaga gtgtatgagg tgtctctctc 540 gctcaatttg ttgtgcattg ttttgctcac tgcagtgtgg aatgtgtgag tcaaacgaat 600 cgaaagaaaa gaaaacgtca aaaagctaac gagaaggaca ttttcattta attttttgtc 660 ggtgcttgta ctgtttgcta attactgaaa atggaaaatt cgattgttgg ttcttttgat 720 tatgtaagtt tagtagaaaa atttatgcgt gtgtttctat tcgaatatta tatgcacatt 780 taattcgcag tccttgtcac catctggctg cgaattacca ggcgatataa aagcaaaaat 840 aatggctggc caatacaaaa ttagccaaaa acgtggcaaa agcaaggtgt gggaagtttt 900 tggtcctgtc gttggagagg acagcaatgt tattccaaac atcgtggcat gtcgaaattg 960 ttttacgatt ttcaagttta ctggtgccac gtccaatttg gtccggcata agtgctataa 1020 ggttaccaaa tctgaagaaa acggtaacgt ttatgaacct gtcgtagatg ccgaaacgaa 1080 attatcggtg acaaaggtcg tgacgaagtg ggtcgttcaa aattgccgac catttaatat 1140 tatcagcgat tcaggactta aggacgttat aaccgtggcc cttgatattg gaagtcgttt 1200 tggaccaaac ataaacgtta gcaacctgct acctcatcca acaactgtat cgcgtaatgt 1260 cacagattta tacaataagg aacttcgagc tgtcacagaa gaaatagcct taatgaaagg 1320 taatggctac actattacgt cggacgcatg gacagacacc ttttcgcaga agtcttattt 1380 atgtgtaact ttacactata tatttaaagg aagcatgaaa aaccgtttgt tggaaatgat 1440 ctgcattaac ggtgaatctg caacaggtac gtgtttatac ccttacagag ggtataaatt 1500 atttacgaaa tgtgaggcaa atgaggaaga caattgttct ttctttcttt tatttatttg 1560 caggcacatt aattcgtact aaagtggaac aatgtttgaa ttcttttggg ctagacttaa 1620 acgacgccat tattgtaaca gaccgtggat caaacatgat atctgcattt aaggaatcag 1680 ctcatattca ttgtataaat catttgctga acaacgtcgt tgaaaaagca ttcaaagcag 1740 ttcctgaatt gcttcaaatc gaaaaaaaat gcgcaaaact ggctaaatat ttcaagaggt 1800 ccggacagaa cttaaaactc gagttgagtc ttaagggtgt atgtcagacg agatggaata 1860 caatttttta tacaatgaaa tcgatagagc acaattggca tgacgtcgtt caaattttaa 1920 atgaacaaaa cgaaatgaat cgtttagatg ggttgaacta tagcgacatt accgcaatca 1980 caaatgtact ctctaaaatc gaggaaattt cgaaattcct ggaggcggac gaatatccga 2040 ccatgcactg ggttattccc cgcattcatg ctcttcaaaa gttcataaaa gacgctaatt 2100 cagataagga tattgaacaa aaatttagct ggtccttgag catgcatatc gactcagtac 2160 ttttaaagta cataacgaag taccacagaa ttgcgttgtt tttattccct ccaacaaatc 2220 aattgagaca agtggataga accctcacga ttgaagagtg taaggaaatg atgcgttcat 2280 ttctcactga tgacaaagag aacttagacc tgtcggcgac ttgttcgaat aaattcgaag 2340 agtttctagc tgattttatc gaaccgcttt caggtgtaaa atgaaatgtt caaatccttt 2400 ttgttggatc gttggggctc aaccctccga tctgttttat tgtcatttct gttctgtatt 2460 ttacaattgt ggtttgggct tttgtaagcc aaaagccgcg gagctgcgct gccagtgacg 2520 ccggcagcgg agcggcagaa ctatgtgttc ttatctctta tatgttaata tgtatatgct 2580 cagcgctgtt ggtagcgaga gcgcgaggga agatatatgt gtgtatgcat gtatattagt 2640 gccgtcaagt catgggaaca tgctgacgct gcacttgggt cgctcgaggc attcttgagt 2700 gaatcgtaat acgatcaccc agaatgtctg ggactttatt gttacgtaaa cactga 2756 // ID TTAA27_AP repbase; DNA; INV; 289 BP. XX AC . XX DT 30-AUG-2009 (Rel. 14.09, Created) DT 30-AUG-2009 (Rel. 15.12, Last updated, Version 2) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; nonautonomous; TTAA27_AP. XX NM TTAA27_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-289 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2096-2096 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC TSD is TTAA tetranucleotide. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea CC Aphid Genome Project. XX SQ Sequence 289 BP; 79 A; 57 C; 46 G; 107 T; 0 other; cacgttcgct gcggatgacg ccaaatggcg tcattttaca tagttcgcat atgctgaatg 60 acgccaattg gcgtcattcc ccttgttatt aaatgcacta taatttggtc agcgttacga 120 ttaaattttt tgtttttcgt cagcgtcttc tataagcatt ttttcataaa caagtgcaaa 180 ccgtacaatg atattttaat ttttcaattt ttaggaattt tttttttaaa actccaattt 240 tttcagcgtc acgcttaagt acacacaaat atcatccgca gtgaacgtg 289 // ID BEL-219_AA-LTR repbase; DNA; INV; 822 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-219_AA_; KW BEL-219_AA-I; BEL-219_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-822 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 896-896 (2011). XX DR [2] (Consensus) XX SQ Sequence 822 BP; 274 A; 150 C; 141 G; 257 T; 0 other; tgttacgacc cctcggtcgg cgcgtcgatg tacaaacgat aggtagatca tcgattaact 60 gtcaagatgt ggcctacaac tctaggacgg gtcgattggt catgtttgct tcaacgtcac 120 aacaagatcg aacgagaatg aattgcaaga attgaattct gttagtgatc aatatccatt 180 agttcgccta agtgaaattt accttcaact ctcatgaatt cgaacctaat attattcaga 240 ttacggctaa agtggtacag attgctcaga atcattgaat tataccttat aactaatttc 300 aaattatttc aagggtctgt cagctaaatt aggatataaa attgcaccct tacgtgtgtg 360 tagatcttta aataccagta aatctcctct tattatatat atatatactg gtaaggcaaa 420 caaattgttt gaaatttgtc accatattca ctacataacg ttcttagtag caaaccaatc 480 tttatcatta tttggataac ctttatcgag ggtaacgcat tcaggattaa cttcacgtat 540 acaggttagt ttgttataac aagatgtatg tacttgggag agatcttcat ctaacaacgt 600 acattacaga ggaaaggatc gcgcgaaaac tctgcgaaga aaaccagacg aaaagaagaa 660 aataaggaaa ctaaacgtaa gtcaaattct gattcattta attattactc tactaaaaca 720 atgtattgcc atttgcagga agattttaat cctgttccaa taaactgatt aaattctcgg 780 aaacgtttcc tttctgtacc ttctcgttga cgattgtcaa ca 822 // ID BEL-160_AA-I repbase; DNA; INV; 6261 BP. XX AC supercont1.160; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-160_AA_; KW BEL-160_AA-LTR; BEL-160_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6261 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.160; Positions 676593 670333. XX CC 'CCTAC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 34..6189 FT /product="BEL-160_AA-I_1p" FT /translation="MTGHHDQSMYQCKTCHETDSADAHMIICDQCRQWEHF FT RCAGISEASRSRPFICKQCGTGATGTQRVLRSKTKGNQLTNPSVGGALSQK FT GSKISSVRSFSSRSSVVKAQLELAEEEARMKQKELEEEQELKHLELEEEKK FT QLAEKKKLLEEEARLRQRALEASRERLAKQQSIRRESLEKRNEILLQISER FT GSVLESTTSSVEKVSKWLTAHPQGGMTEENITGAPSDPLLPTPNEQQGEVS FT NALAQPLQTGVRSSQTHDFPLQPSYQEARNIRPLDTFRPGLRAQAAPVIDE FT KLPLPRTVIPREVHSVTVKEPFQQYIASSHPLRDPRSFSPAKPPNFVKTYS FT IPAVNHHVGLPPSDSGDQIRQMHQQAPVWSTPVQHEPVLGPKSFSHRLLTP FT ATELPFPYRNSLPNFDATREGVISGAVRNPLETVLDDGVLNSQKIAARQVV FT GKDLPSFNGNPTDWPMFITSFEQSTAACGFSNAENLVRLQRCLTGHAREAV FT RSRLLLPANVPHVIETLRTLYGRPELLIRSLHEKIRKTPGPRHDRPETILE FT FGLVVQNFVDHLLAAQQGEHLSNPMLLQELVEKLPGPMRMDWAVFKGQQPR FT ATIVLFGEFMSKLVKAASEVSFELPGLFKDLNDGKHQRAKERVRIQTHSTE FT EKSTFKLANSGSRKAPKPCAICDREGHRVAECSKFKQLTVDNRWKLVQNKG FT LCRTCLNNHGKWPCKSWQGCGSEGCHLKHHTLLHSPSTPPSSSHSVNVSLS FT QSSSEEYRTLFRVLPVVLHGKGKSVTIFAFIDEGSQITMLEEKVAKELGVT FT GPTRPLTLQWTGDIKRDESKSQEVSLQIAGKNSGTRYDLRHARTVSCLLLP FT TQSLNYRELCVRFPHLKGLPVEDYDLVQPKLLIGLDNLRLGVPLKLREGGP FT YDLIAAKCRLGWGIYGSSSTNPVPRVRVNFHTAAQPSSDDLLNVQLRDYFA FT FENCGVVAPTEKLESEEDKRARKLLEETTRRTAKGFETGLLWKTDDREFPD FT TYPMALRRMKTLEKKLQRDPLMMQRVREHIIEFEKKGYIRKVSKEEQSTFD FT HRKSWFLPLGVVVNPKKPGKLRIIWDAAAKVDGVSFNSHLIKGPDLLTPLP FT RVLSGFRLYPVAVSGDIREMFLQIGLQASDRNAQMFLFRDHPQDPVQIYAV FT NVTMFGSTCSPSSAQYVKNVNAEQHALHYPRAATAIIEHHYVDDYLDSFRT FT VDEAVQIVRDVKFVHSRGGFEIRNFLFNKDEVLRRTGDIETNTSKEFALVR FT AETTESVLGMKWIPTDDVFTYTLAMRNDLMPILDDNHVPTKREVLRVVMSL FT FDPLGFVAYFLVHGKVLMQDIWASGIDWDDKIGEELFLRWQQWIRYFPQLD FT NIRIPRCYFQPPFPADFDRLELHVLVDASDSAYACVAYYRLETESGVQVVL FT IGAKTKVAPLKALSIPRLELNAAILGSRLLDTIQGYHAYPINRRFLWSDSS FT TVLAWIRSEHRRYNKFVAVRVGEILTTTNANEWRWIPSGVNTADIATKWKN FT IPDLSYDSPWFRGPSFLRKSEEHWPQQKKITPTTEELRSIHVHFSMYPMID FT ASRFSKWTKMLRSMAFVVRYVQNLRRKCKGSTLQLGELQQDELKCAEEMLW FT KVAQAEAFPEEIATLSDSQGLQNARPGRVSKASTIYKNWPFLDERGVLRMR FT GRIGAASFASAEAKYPAILPRQHLITFLLTDWYHCRFRHANRETVVNEMRQ FT RFEIAKLRALIQKVSKNCMICRVSRATPNPPAMAPLPASRLQPFVRPFTFV FT GLDYFGPVLVRVGRSQVKRWVALFTCLTIRAVHLELVHSLSTESCIMAVRR FT FIARRGPPAEFYTDNGTCFQGASNQLEKEKTEARNNSLAAKFTSSKTQWRF FT IPPAAPHMGGAWERLVRSVKVALGSVAEASRIPDDEVLETALLEAEALINA FT RPLTYIPLESADQESLTPNHFLLGSSSGDKVAPFDPIENPAVLRSGWRLAQ FT SITQVFWARWLKEYLPVITRQGKWFEESPNIAVGDLVLVVGGTARDQWVRG FT RIETVIPGRDGRVRQALVRTASVLDV" XX SQ Sequence 6261 BP; 1723 A; 1549 C; 1573 G; 1416 T; 0 other; atcttttaga aagtttgtac gtgggtaatc gatatgacag gacatcacga ccaatcgatg 60 taccagtgca agacgtgcca cgaaacggac tccgcggacg ctcatatgat catctgtgac 120 cagtgtcgcc aatgggagca ctttcgatgc gctggaatta gcgaagctag tcgaagtcgt 180 cctttcattt gtaagcagtg tggaacggga gccactggta cccaacgagt attgaggtca 240 aaaacgaaag gtaatcagct cactaacccg tctgtgggag gcgctctttc ccaaaagggc 300 agcaaaatat cgtcagtccg tagttttagt tcccgatcgt ctgtcgtaaa agcgcaactg 360 gaattggcag aggaagaagc ccggatgaag cagaaggagc tagaggagga acaagaactc 420 aagcatctcg agcttgaaga ggagaaaaaa cagttagctg agaagaagaa attgttggag 480 gaagaagccc gattacgaca gcgagcacta gaagcaagta gggagcgact ggcgaagcag 540 caatcgatcc gtcgagagtc attggagaaa aggaacgaga ttcttttgca gatatccgag 600 cgtggaagtg tgctggagtc tacgacaagt tcggttgaaa aggtatcaaa atggttgaca 660 gctcatccac agggagggat gaccgaggag aacattacag gggcgccaag tgatcccctt 720 ctcccgaccc caaacgaaca acagggagag gtctcaaatg ccttggcaca acctttgcaa 780 accggagtgc gttcgtccca aacgcatgac tttccattac aaccctcgta ccaagaagca 840 aggaatattc ggcctttgga cacatttcga ccggggcttc gggctcaagc agcacccgta 900 attgatgaga agctcccact tccgagaact gtaattcctc gagaagtcca ttcggtaacg 960 gtaaaagagc cattccagca atacatcgca tcgtcgcacc ctctacgtga tcctcgatcg 1020 ttttctcctg caaagcctcc aaacttcgtt aagacatatt ctattccagc ggtgaaccat 1080 cacgttggac taccgccttc cgattctggt gatcagatca gacaaatgca tcaacaagct 1140 ccagtgtgga gcacgcccgt acaacatgag ccagtactgg gacccaaatc gttctcccat 1200 cgcttactca cccctgctac ggaactacca tttccatatc ggaactctct tccgaacttt 1260 gacgctactc gagaaggagt catttccggt gcagtacgca atcccctaga aacggttttg 1320 gatgatggcg ttttaaactc ccaaaagata gccgcgcgtc aggttgttgg taaagatctg 1380 ccatcgttca acggaaaccc taccgactgg ccgatgttca ttacctcctt tgaacaatct 1440 acagctgcat gcgggttttc aaacgcagag aatttggtac gtctacaacg atgcctcaca 1500 ggccatgctc gagaggcggt tcgaagcagg cttttgcttc cagccaacgt gccgcacgta 1560 attgagacat tgcgaacgct ctatggacgt ccagaactat tgattcgttc gctccacgag 1620 aagataagaa agactcctgg acccagacac gatcggccgg aaacgatact cgaattcgga 1680 ctggtagtcc aaaacttcgt tgatcatctg ctagctgcgc aacaaggaga acatctttca 1740 aacccgatgc ttctacagga actagtagaa aagcttccag ggccaatgag aatggattgg 1800 gccgtcttca aaggtcaaca accgcgggct acaattgtgc tgttcggtga gttcatgtcc 1860 aaactggtaa aggcagccag tgaggtcagt tttgaacttc cagggttgtt caaagactta 1920 aacgacggta aacatcagcg agcgaaagaa agagtcagaa tacagacgca ttccaccgaa 1980 gaaaaatcaa ccttcaagct agccaacagc ggtagccgca aagcaccgaa gccgtgtgcg 2040 atttgcgatc gcgaaggcca cagggtggca gaatgctcga agttcaagca gttgactgta 2100 gataaccgat ggaagttagt gcaaaacaaa ggtttgtgca gaacgtgcct gaacaaccac 2160 ggaaagtggc catgtaaatc ttggcaaggt tgtggtagcg aaggatgcca cttgaaacac 2220 cacacgcttc ttcattcgcc gtctactcct cccagttctt cgcactcggt gaacgtatcg 2280 ttgagccagt cttcatctga agaatatcgg acactctttc gagtgttacc cgtagtcctc 2340 cacggaaagg ggaaaagtgt gacgatcttc gcgttcattg atgaaggatc acaaattacg 2400 atgttggagg agaaagtcgc gaaggagctc ggcgttaccg gtccgacaag gcctctcacg 2460 ctccagtgga cgggggatat aaaacgcgat gaatcaaaat ctcaggaagt cagcctgcaa 2520 atagcaggaa aaaatagtgg tacacgttat gatcttcgtc atgcacgcac agttagctgc 2580 ttgctactac caacccaaag cctgaactac cgagaactgt gcgtccgttt ccctcatctc 2640 aaaggcctgc cggtggaaga ttatgatctc gtgcaaccaa aactgctcat tggtttagac 2700 aatcttcgac ttggagtacc gctaaagttg cgtgaaggag gaccatacga tcttattgca 2760 gccaaatgcc gtttgggatg gggaatctat ggtagttcgt ctacaaaccc cgtaccaaga 2820 gttagagtca actttcacac tgccgcacag ccgtcctccg acgacctgct caacgtacag 2880 ttgcgagact acttcgcatt cgagaactgt ggggttgttg ctccgaccga aaagctagaa 2940 tccgaggaag ataaacgagc aagaaagttg cttgaagaaa cgacgcgacg aactgcaaaa 3000 ggatttgaaa caggccttct gtggaaaact gatgaccgag aatttccaga cacgtatcca 3060 atggcacttc ggcggatgaa aaccttggag aagaagctgc agcgagaccc gttgatgatg 3120 cagcgagtac gagagcacat aattgaattt gagaaaaagg gctacattcg caaggttagc 3180 aaagaagagc aatccacctt cgatcaccgg aaaagctggt ttcttcccct gggagttgtc 3240 gtaaacccga aaaaacctgg gaaattaaga attatctggg atgcggcagc gaaagtcgat 3300 ggagtatcgt tcaactcaca cttgataaaa ggcccagatc tcctaacgcc tttgccgcga 3360 gttctcagtg gtttccgttt gtatccggtg gcggtttccg gagatataag ggagatgttc 3420 cttcaaatag ggctacaagc aagtgatcga aatgcgcaaa tgttcttgtt ccgcgatcat 3480 cctcaagatc cggtgcaaat atatgctgtc aacgtcacaa tgttcggatc gacctgttct 3540 ccttcgtcag ctcagtacgt caagaacgtg aacgctgaac aacatgcttt acactaccca 3600 agggcggcga ctgcaattat agaacaccat tatgtcgacg attacttgga cagcttcaga 3660 acggtggacg aggctgttca gatcgtaagg gacgtcaagt ttgttcactc caggggcgga 3720 ttcgaaatcc gcaattttct cttcaataag gacgaagtcc tccggcgaac tggggacatc 3780 gaaacgaata cgtctaaaga attcgccctc gtacgggcag aaacaacaga atcagtgctc 3840 ggaatgaagt ggattcctac cgacgatgtt ttcacttata ccttggctat gcgaaacgac 3900 ctgatgccaa ttctcgacga caaccacgtt ccaacgaagc gggaagtgct tagggtggtg 3960 atgagcctgt ttgaccctct tggtttcgtt gcatacttcc tggtgcatgg taaggtgcta 4020 atgcaggaca tctgggcatc aggtatcgat tgggacgaca aaataggtga agaacttttt 4080 ctccgttggc agcagtggat acgctacttt ccgcagttag ataatatacg tattccacga 4140 tgctattttc aacccccttt ccctgcagat ttcgatcgac tcgagctcca cgtgctggtc 4200 gacgccagcg attctgcata tgcatgtgtc gcttactatc gattggaaac tgaaagcggc 4260 gttcaggtgg tgctgattgg tgccaaaacc aaagtagcgc cgttaaaggc gttgtccatc 4320 ccgcgcctgg agctaaacgc tgcgatctta ggaagccgcc tactggacac catccaaggt 4380 tatcatgcgt atcccattaa ccgcaggttc ctctggagtg attccagtac agtgctagcc 4440 tggatacgat ccgaacatcg cagatacaat aaattcgtcg ctgtacgagt tggtgaaatt 4500 ctaacgacca ccaacgccaa cgaatggaga tggattcctt ccggtgtaaa cacagcggat 4560 attgccacta agtggaagaa tattccagat ctttcctacg acagcccgtg gttccgcggc 4620 ccttcctttc ttcgtaagtc ggaagagcac tggccgcaac aaaagaaaat tactccaacg 4680 accgaagaac ttcgttccat ccacgtccac ttctcgatgt acccaatgat tgacgcttca 4740 cggttcagta aatggacaaa aatgctacga tctatggcct ttgtagtgcg atatgtccaa 4800 aatctgcggc gtaaatgtaa aggttccacg ttgcagttag gagaacttca acaagacgaa 4860 ctgaagtgcg cagaagaaat gctctggaaa gtagcccaag cagaggcgtt tcctgaagaa 4920 atcgccaccc tctcggactc tcaaggactt caaaatgctc gaccgggacg cgtttctaag 4980 gccagtacaa tctataaaaa ttggcccttc ctagatgaac gcggtgtgct gcggatgcgc 5040 ggaagaatcg gagctgcaag tttcgcttca gccgaagcta agtacccagc gattcttccc 5100 aggcaacatt taatcacgtt tcttcttaca gactggtacc actgccgatt ccgccacgcc 5160 aacagagaaa ctgtcgtcaa cgaaatgcgt caacggttcg aaatcgccaa attacgagca 5220 ttgattcaaa aggtttccaa gaattgtatg atatgtagag tttcaagagc cacccctaac 5280 ccgccggcga tggcaccact tcctgcgtcg cggctacaac ccttcgtacg tcctttcaca 5340 tttgttggac tagattattt tgggcctgta ttggtcagag tcggccgcag tcaggtgaaa 5400 agatgggttg ctctcttcac ctgtctcacc atccgagctg tacacttgga actagtccac 5460 agcctatcca ccgaatcctg cattatggca gtacggcgct tcattgcacg ccgtggccca 5520 cccgcagaat tttataccga caacggaacg tgttttcaag gcgccagtaa tcagctagaa 5580 aaggagaaga cagaggctcg gaacaactcg ttagcggcaa agttcacgag ttcaaaaacc 5640 cagtggcgct tcatcccacc agcagcacca catatgggag gcgcctggga gcgccttgtc 5700 cggtcggtca aagtagcgtt agggtcggtg gcggaggcat cccgcattcc agatgacgag 5760 gttctagaaa ccgccctact agaagcagag gccctaatca atgctcgtcc tctcacttat 5820 ataccccttg aatcggctga ccaagaatcc ctcaccccaa atcattttct tttgggatct 5880 tcgagtgggg acaaagtggc accctttgat ccaatagaaa accctgcggt gctacggagc 5940 ggttggaggc tcgcgcaatc cattacgcaa gtattttggg ctagatggtt gaaagagtat 6000 cttccagtga tcactcgtca agggaagtgg tttgaagaaa gcccgaacat agctgtaggc 6060 gacctggtgc tggtggttgg cggaacggca agggaccagt gggtgagagg acgtatagaa 6120 acggtgatcc ctggacgcga cggaagggtg cgtcaagctc ttgtgcggac agcatcggtt 6180 ctcgacgtgt gaaaggatta tgaacccata ccgagagttc ctgagcaaca gaacttggac 6240 cagggttcac gggacggggg a 6261 // ID Gypsy-23_DWil-I repbase; DNA; INV; 3051 BP. XX AC scaffold_181117; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-23_DWil_; KW Gypsy-23_DWil-LTR; Gypsy-23_DWil-I. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-3051 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_181117; Positions 36072 39122. XX CC Positions [2172-2660] - Integrase core CC 'GTAT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 735..3002 FT /product="Gypsy-23_DWil-I_1p" FT /translation="MDDLIVIGCSESHMIRNLRDVFETCRKYNLKLHPDKC FT SFFRHEVTFLGHKCTNKGILPDESKFKTIQDYPTPKNGDEAKRFVAFCNYY FT RRFIPNFAYHAQFINRLSRKKVEFIWDANCQAAFEYLKNKLLSPQILQYPD FT FNKKFCITTDASKIACGAVLSQEFDGLQLPISYASKAFTKGESNKSTIEQE FT LTAIHWAIQYFKPYIYGKKFLIRTDHQPLTYLFSLKNPSSKLTRIRLDLEE FT FDFTIEYIRGQENFAADALSRIEFENIKNIETNKILKVATRSETKKLVHKE FT NKINNSRIETPRVYEIINPSEAKKYAQIKFKKCKCIITKNKEKLLSFNIGD FT LIISGRIDLDRFLPRLEKEAGAQKIYKLKIDMSEKLLRSLTPNELKQKGKD FT TLKHLEIAVLPQVKVITNKEEIRNTLNKYHNDPIEGGHCGVNRTLGKIKRD FT FFWKNMSRDVATYIKNCEKCKLAKVTRHNHSPLVITPTPATTFDTIIIDTI FT GPLPKTMNGNEYAVTIICDLSKYLISIPIPNKSAKVVAKAIFEKFILIYGS FT VKTILTDMGTEYMNSILIELCKNFNITKINSTAHHHQTLGTVERSHRTFNE FT YIRSYISIKKNDWDEYLAYFTYCFNTTPSTVHGYAPFELVFAKHPKTVPFS FT SETVEPIYNIDDFNKEIKFKLQIAQNRAKLLIEKFKALQKETYDKNTKKVS FT LTEGQEILLKNDTGHKLDNKYTGKYTIQSIGPNNNIVIADDKNKKQTVHLD FT RIKIT" XX SQ Sequence 3051 BP; 1250 A; 567 C; 471 G; 763 T; 0 other; tggcgatcct gcctaacatt ggcaacaaac aaaaaggcaa taagaaaaaa aaaaatcctt 60 gccatacata ggcaaagtga taaaaggact cacataccga tgaccaggac cattacccaa 120 ataaagcggc gaggtggcct tctcgtcaga cttagaaggc agcgagcggc cctgctggcc 180 caagtaaaca gcgacccacc accaccatgg gcagaggcat gccaaatttg gattaggata 240 tgccgacttg aggaacagca gcaacaacaa aaagaggcaa agcagcacaa ccaacagcag 300 cagaaccgac cagcaacaac agcagaaaaa taaatcaaaa aaaaaaatca aaatccgcta 360 ctagttgttt gttaattttt ctatagtcaa ctaccaatct ccaccgtttt tctttactgc 420 caggtagcgc ttttttagga accaataaaa tcggactatt gaactcagac attgatggtt 480 caataatatc atcattgatc aatttgtcca cttgtttatt gatttcggtt ttttgactgt 540 gtgctatcct gtagtttttg gtgtagactg gtgttgtgtc tttgagtctt aatttttgtt 600 tatagaaatt gttggacgat atttgttcag attctattgc gaataccata cggattaaaa 660 attgcaccaa actcgtttca aagaatgatg tcccttgcat tcacggtcta tcaccatccc 720 aaacattcct ttatatggat gatctaattg taattggttg ctctgaaagt cacatgatca 780 gaaacttaag ggatgttttt gaaacttgcc gaaaatataa cctaaaattg cacccagaca 840 aatgctcctt cttcagacat gaagttactt ttctaggaca caaatgcaca aataaaggaa 900 ttttaccaga cgaaagcaag ttcaaaacga ttcaagatta cccgactccc aaaaatggcg 960 acgaagccaa aagatttgta gcattctgta attactaccg tagattcatt ccaaactttg 1020 cataccatgc acaatttatc aatagattat ctaggaaaaa ggtagaattt atctgggatg 1080 caaactgtca agcagctttt gaatacttaa aaaacaaatt attaagccca caaatacttc 1140 aatatccaga ttttaataaa aaattctgta tcacaactga tgcaagcaag atcgcatgtg 1200 gagccgtcct cagccaagaa tttgatggac tccaattacc catatcatat gcatctaaag 1260 catttacaaa aggcgaaagc aataagtcaa caatagagca agaattaaca gcaatacatt 1320 gggcaattca atatttcaaa ccatacatat acgggaaaaa attcctaatc agaactgacc 1380 atcagccatt aacataccta ttttctctaa aaaacccatc gtcaaagctt actagaataa 1440 ggctagattt agaagagttc gattttacta tagaatacat caggggacaa gaaaatttcg 1500 cagcggatgc tctctccaga attgaatttg aaaatataaa aaacatagaa actaataaaa 1560 tacttaaagt agcaacaaga tcagaaacaa aaaaattagt acataaggaa aacaaaatta 1620 acaatagcag aatagaaaca ccaagagtat atgaaataat aaatcctagc gaagccaaaa 1680 aatacgcgca aattaaattc aaaaagtgca aatgtattat taccaaaaat aaagaaaaat 1740 tactctcctt caacataggc gatttgatca ttagtggaag aatagatcta gatcgattcc 1800 ttccaaggct tgaaaaagaa gccggtgctc aaaaaatcta taaactgaag atagacatgt 1860 cagaaaaact attaaggtca cttacaccta atgaactaaa acaaaaagga aaagacacat 1920 taaaacattt agaaatagca gttctacccc aagtaaaagt tataacaaat aaagaagaaa 1980 taagaaatac attaaataaa tatcataatg acccaataga gggaggtcac tgcggggtaa 2040 acagaacact aggaaaaata aaaagagatt tcttttggaa aaatatgtct cgggatgttg 2100 caacatacat caaaaattgc gaaaagtgca aactagccaa agtaaccaga cacaaccact 2160 ccccattggt cataacacca acccctgcaa ccacttttga cacaataata atagatacaa 2220 ttggaccatt accaaaaaca atgaacggta atgaatacgc agttactatt atctgcgatc 2280 tgtccaaata tctcatatca ataccaatac caaacaaaag tgcaaaagtg gttgcaaagg 2340 caatattcga aaaattcatt ttaatttacg gttcagtgaa aacaattcta acagacatgg 2400 gaacagaata tatgaattca atactaatag aactttgcaa aaattttaac ataactaaaa 2460 ttaattccac ggcgcaccat catcaaactc ttggaacagt agaaagaagc catagaactt 2520 tcaatgaata catcaggtca tatatatcca taaaaaaaaa cgattgggat gaatatctag 2580 catacttcac atattgcttt aacaccactc catccacggt acatggttat gcaccttttg 2640 aactagtctt tgcaaaacat cctaaaactg tcccattctc gtctgaaacg gtagaaccca 2700 tttacaacat tgatgatttc aataaagaaa taaaatttaa attacaaata gcacaaaata 2760 gagcgaaact gcttatcgaa aaatttaaag cattacaaaa agaaacatat gataaaaaca 2820 caaaaaaagt atcgctaact gaaggacaag aaatactttt aaaaaatgat acaggacata 2880 aattagataa caaatacaca ggcaaatata caatacagag cataggtcct aacaataata 2940 tagtaatagc ggatgataaa aacaaaaagc aaacagtaca tttagataga ataaaaataa 3000 cataaaaagg atgttagaaa aatataaatt tttcttttta aaaaaaggag g 3051 // ID Gypsy-32-LTR_NVi repbase; DNA; INV; 217 BP. XX AC . XX DT 14-MAY-2009 (Rel. 14.05, Created) DT 14-MAY-2009 (Rel. 14.05, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: long terminal DE repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy-32-LTR_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-217 RA Bao W. and Jurka J.; RT "LTR retrotransposon from Nasonia parasitic wasp."; RL Repbase Reports 9(5), 1001-1001 (2009). XX DR [1] (Consensus) XX SQ Sequence 217 BP; 54 A; 59 C; 69 G; 35 T; 0 other; tgtaggtgcc ccactgatag atggcggtac gtccgaccgc ggagatagag agagcagcgc 60 tagctgcgag cgagcgctga cgccgccgtc agttccgttc tgaccagagc acacgccaga 120 cgggctagag agagagagag agagagagag accgaaataa agtgccttgt ccaggagaag 180 agccttgttc ttggtctccg accagcctct tcccaca 217 // ID EnSpm-2_NVi repbase; DNA; INV; 2611 BP. XX AC . XX DT 11-MAY-2009 (Rel. 14.05, Created) DT 11-MAY-2009 (Rel. 14.05, Last updated, Version 1) XX DE EnSpm-type family - consensus. XX KW EnSpm; DNA transposon; Transposable Element; EnSpm-2_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-2611 RA Bao W. and Jurka J.; RT "EnSpm-type families from Nasonia vitripennis."; RL Repbase Reports 9(5), 940-940 (2009). XX DR [1] (Consensus) XX CC The 5'-end is not reached. XX FH Key Location/Qualifiers FT CDS 209..2269 FT /product="EnSpm-2_NVi_1p" FT /translation="MSLKYSVSNKLSVTGILNLLKFVNRLCGKNILPESKY FT QFDQLCKNENTLTLHAVCLTCSMYIGTFEKSNKSVNCENCNSLIDVSNPSN FT ACYFAIVNPSNAVRDYIESHENYYDYIVSERIHEPNVIKDIYDGECYKNFL FT NNLNTIDRHAYATAVFNTDGAPVFESSNVSIWPIYIMLNEIPIHKRLNSTI FT VTGLWFGTSKPEMSVFLDAFVESMNHLSTIGISATIKNQHRKIKLFTLLAC FT VDTVARAPMNGTTLFNGKYGCDWCLHPGHYYGGSMRYPFNIPFPKERTTES FT VVEHAREAVRTQKRVFGVVNASPLINLKNFNIIDGFTPDYMHCFLAGVASQ FT FTERILQHVTVPDIQYMDALLISIKAPHAIGRLSRPLSQRSNWKATEWENW FT LLFYSIPVLNTVLTNRKIMQHWALLVESLSICLGTRITYSELNRANGMLYK FT FVSEVEDMYSLTAMTYNVHQLLHLVKSVHNWGPLWAHSTFPFEAANHKLLT FT AIHSAKGVVLQIIRYTNIQRTVQILENRLYPYCSKLVLDFCQQITSPLARK FT TLKISCITYFGCGNIVPPNIALLFNVAKTTLIFSKMVLNGCLYATSEKINK FT RSCNYYAQLTDETFIKILYFLVDTESKVESTVCEVIKTRPNKYANVVKEVT FT NISEKICVPTSLIVKPCIFIDTKDGMYIIPVPNVISH*" XX SQ Sequence 2611 BP; 920 A; 387 C; 421 G; 883 T; 0 other; aaataattga ataaatagaa tttttttata ggtcactgat actaatgatg agtctgatat 60 attgagttat tcgagcgatg aagaactgct aaagatttta gaaggaaacg acgaagattc 120 taacggtttt aacaagttgt ttaacgatac aaactttttt gaatctctta atttaactgt 180 tgatagttct ccagcagaat tactgttgat gtctttgaag tattctgtgt caaataaatt 240 atcagttaca ggaatattaa atttattaaa gtttgtaaat agattatgtg gtaagaatat 300 tttaccagaa tcaaaatatc aatttgatca attatgcaag aatgaaaata ctttaactct 360 tcacgcagta tgtctgactt gctcaatgta cataggtacg tttgaaaaat caaataaatc 420 tgtcaactgt gagaattgta atagcttaat agatgtatct aatccttcga atgcgtgtta 480 ctttgctata gtaaatccat cgaatgctgt tcgtgattac atagaaagtc acgaaaatta 540 ttatgattat attgtgtccg aaagaattca cgaaccaaat gtcataaaag atatttatga 600 tggggaatgt tataaaaatt ttctgaataa cttaaataca attgatcgac atgcgtacgc 660 aacagcagtc ttcaatacag atggtgcacc agtttttgaa tcttcgaatg tatctatttg 720 gcctatttat ataatgctaa atgaaattcc aattcataaa aggttaaaca gtactattgt 780 tactggttta tggtttggca caagcaaacc agaaatgtca gtttttctcg atgcatttgt 840 tgaaagcatg aatcatttat caaccatagg aatatcggct acgattaaaa atcaacatcg 900 taaaattaaa ttgtttacat tgcttgcatg cgttgacaca gtcgctcgtg caccaatgaa 960 tggtaccact ctatttaatg gaaagtatgg ctgtgattgg tgtttgcacc caggtcatta 1020 ttatggtgga tccatgaggt atccatttaa tattccattt ccaaaagaaa gaactacaga 1080 atcagttgtt gaacatgcaa gagaggcagt gcgtacgcaa aaacgtgttt ttggagttgt 1140 aaacgcatca ccgcttataa atttaaaaaa ttttaatatt atagatggat ttactcctga 1200 ttacatgcac tgctttttag ctggagtagc gtcccaattc acagaaagaa tacttcaaca 1260 tgtaacagtt ccagacatac aatatatgga tgctttatta atttcaatca aagctcctca 1320 tgcaatcgga agattatcga ggccattatc acagcgtagt aactggaaag ccacagaatg 1380 ggagaattgg cttcttttct atagtatacc tgttttaaat acagtactaa caaacagaaa 1440 aataatgcaa cattgggcgc tattagttga atctctatcg atttgcttgg gaacgagaat 1500 cacatattct gaattaaata gagcgaatgg aatgttatat aagtttgttt cggaggttga 1560 agacatgtat tctttaacgg cgatgacata taatgttcat caattgcttc atcttgtcaa 1620 aagcgttcat aattggggac cattatgggc acactctact ttcccatttg aagcagcaaa 1680 tcacaaacta ttaacagcga ttcattctgc gaaaggtgtt gttttacaaa tcataaggta 1740 cacaaacata caacggacag tacaaatctt agaaaacagg ttatatccat attgctctaa 1800 acttgtatta gatttttgtc aacaaatcac ttcaccgtta gctcgaaaaa ctttaaaaat 1860 atcctgtatc acatattttg gatgtggaaa tattgtgcca ccaaatatag cattactttt 1920 taatgtagca aaaactacat taatattttc aaaaatggtc ctaaatggtt gcttatatgc 1980 tacatcagaa aaaattaata agcgttcgtg taattactat gctcaactga ctgatgaaac 2040 tttcataaaa atactatatt ttttagttga cacagaaagt aaagtagaat caacagtatg 2100 cgaagtcatt aaaacgcgac ctaacaaata tgcaaatgta gtgaaagaag ttacgaatat 2160 ttcagaaaaa atttgcgtac ctactagttt aattgtaaaa ccttgtattt ttattgatac 2220 gaaagatggg atgtatataa ttcctgtacc aaatgtaatt tcgcactaag ttcaatatgt 2280 gattacaaat gtatttgtac tgtacaatga ttatttattt ttataattca tgtatatatt 2340 ttaaataaat ctatatttat cttttaataa tgtgcatcac attaattatt taataaaatc 2400 gtattgtaat gcaaacttgt ttatttcaat tccatttata acaagtaata acattataac 2460 agattaataa tttaaaatgc aatttttttt taataatttt tcagaatatc atgtgattat 2520 cacgtgataa attggttgaa aatatcacgc aattatcacg cgattatcac gtgattatca 2580 cgtgataatc acgtgacttt tttcagtagg g 2611 // ID BEL-19_DPu-LTR repbase; DNA; INV; 456 BP. XX AC ACJG01001375; XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the common water flea: long terminal DE repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-19_DPu_; KW BEL-19_DPu-I; BEL-19_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-456 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the common water flea."; RL Direct Submission to RU (09-FEB-2011). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR Genome; ACJG01001375; Positions 3777 3322. XX SQ Sequence 456 BP; 102 A; 116 C; 65 G; 173 T; 0 other; tgttgcggcg ataattattg ttcgcgatct ggcaacgctt tcctccctct cccttctttt 60 ctctcccttc tttcttgtca aaaccatttc tagtcttcgt ctgcaaccct cattttcctc 120 acgtgtcatt ttctgttctg atcacctgct gtgcatttta ctgtgatctt gctatctgtt 180 gtcaggtatg tttctataga gtattacgcc caatttacac ttattcatgt tatatttata 240 tatagaagtg atctgtgact ctatacatgt ccgtcctgtt gtgtcgttac cctaatggtc 300 aacagccttg acacaggtag taattatata aatttcacgt gaaaatattc ccatctaacc 360 cttttctctt tgccatctag gcaccattac tgtcaaatac aacattaaga cttgtcagca 420 tctctgtatt cagctctact catagctccc gaaaca 456 // ID Copia-9_CQ-LTR repbase; DNA; INV; 158 BP. XX AC AAWU01013178; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-9_CQ_; KW Copia-9_CQ-I; Copia-9_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-158 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 334-334 (2011). XX DR GenBank; AAWU01013178; Positions 13051 12894. XX SQ Sequence 158 BP; 41 A; 42 C; 25 G; 50 T; 0 other; tgttgagatg gcaaccactg ccccattacc tttgcgcgct actgcgccaa cagaactgtc 60 agcagatgtt cctgtttact ttgtttacaa taaatcttag tcttttaaca accgctctaa 120 acgtaacaca cgtgttttaa ttatcctcct gtaagcca 158 // ID Gypsy-127_AA-I repbase; DNA; INV; 6243 BP. XX AC AAGE02017298; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-127_AA_; KW Gypsy-127_AA-LTR; Gypsy-127_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6243 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02017298; Positions 123135 116893. XX CC 'CATGT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 674..2071 FT /product="Gypsy-127_AA-I_2p" FT /translation="MSNKGKYVRRIVDYDSDDDEDSPIGCGDMTDTEGVEN FT HETCNSDSDESSIDIGTRTDDTRNDKSDSNSDCETSSSSSYSTRSPSCSPS FT KSPSPVKSTTQETPSKAELEHATTERFDRMEKMIANMNNALARFQAGAGPT FT PDDDAELSWKLSNDVDQRAGGESSSSNIRWDHLKPFPSGIASCKMWEEWNS FT YVENFELAASLSNVNDPVKRTQLLFLSMGNELQGIVKAAKLRPSLSNPNCY FT RTFVTNIQNYLRKMTDTAAEHEEFSRMKQEIGESVVAFHARLTSKAKACKY FT SVDEDRFVRAQLLAGLRNRELVRQARVYGHDTEFIVQSATRDETFESERKQ FT QEGNHVLEIRSNRNEHFNRKRSVRGSGTGGPPVRQQRRGDTNRRPQGQRSR FT CTRCSLFNHRNGQCPALTRNCNECGKRGHYAAACRQKQVNALQNKQNFDSF FT RDKSDVPDDDKQSNQQVTNDY" FT CDS 3085..5673 FT /product="Gypsy-127_AA-I_1p" FT /translation="MEDQGVIEKVTSAPGWISGMSAVAKGRNDFRLVVNMR FT APNRANHREFFKLPLLEEMRVKLHGAKYFTKLDLQNAFYHLELSEESRDLT FT TFMTEDGMYRFTRLMFGVNCAPEIFQREMVRILKEVGDYTIVYIDDILIFS FT DTLEGLHATVGKVLQILRDNHLTLNTSKCEFDRTRLQFIGHELDQEGFHID FT NEKVKSVQNFREPTTLSELRSFLGLASYLSPYLKNFADISSPLWTATTTKA FT WLWGLEQKKAFELVKQHIIDCTVSLGFFSEHDKTVLYTDASPVALGAVLVQ FT EGENQSPRIISFASKSLSSTERKYAQNQREALGVVWAVEHFSYFLLGRRFT FT LRTDAQGVTFIFNRSREDSRRALTRADGWAIRLSPYHYDIEYVRGIDNIAD FT PPSRLYKGEDDAPFDDENSPWEIASLEANSVEFLTEAEIKSATDQDETLQQ FT VMKSLKTGVWSESLRRYRTVENDLYILNGIIVKTGCAVVPKALQTHALEVA FT HEGHPSIAKTKSILRQRVWWPGMPKDVTSWVQSCETCCINGKPERTTPMER FT VFAPKVPWNTVALDFNGPYIKFGGVLILVIVDYKSRYVIAKPVKSTKFECV FT KKVLDDVFQNEGFPKGIKTDNGPPFNGQDFAEYCNKRDISITFSTPLFPQQ FT NGLAESYMKLINKAMAVATTNKTNYVDELKKAVNAHNAAAHSITKVPPEEV FT MYGRKIKRGLPLLQHGKSDFDEDLLQCRDREAKLAGKQREDSRRGARECKV FT KPGDEVVIERPNRTKGDSRFSPTRYTVLEERNGSLILNDKHGKVSKRHVSQ FT TKRVQPWRDVERDSSQYSGPPTSTLQQPNEQPSRALVRPARERRTPGFLND FT YVRVIGIDGE" XX SQ Sequence 6243 BP; 1985 A; 1366 C; 1418 G; 1474 T; 0 other; atggcggatc ctgccaggtg ttgcactttt tgttgtaaca caaacataga accattatct 60 aataaaatta aaacagatgg ggaaagggac cagtcggagc tgaagcagac gacatgcatt 120 tttgtttgac gatttcaaga aaaaatcttg gaagattttt tcgttccaca aagtgaggtt 180 attcggaatg agagaccaaa gcgagcaaaa attgaaaata ccctgagtgc actagtgatt 240 accaacagga aaagctctat ccaaacaatg gcaccgagcg gatcttaagt atcgcgctga 300 gcgagctaga ataagctgcg ctgattccat taaaatcgat tgtgaagcgg aacataagta 360 tcgcgctgag acagttaact ttatagagcg gatcaaaaga tcgcgctacg acaatttaga 420 gtcatcgagc gagtcggaaa gactgcgcca aaatgtatcc atttttaagc gggtcaggag 480 caccgcgctg ataagattaa cagaatggag cgtgtcgaac tggcagcgct gagcaaatag 540 actacatgca tagtggagcg gatttcaaaa tcgcgctcat ggaaaaaaaa ataataaaac 600 taattgcgat tatttttctt gtaccctctc tagacagaaa caaacttccg aaatttagcg 660 aaaacagaaa agcatgtcca ataaagggaa atatgtccgt cgtatcgtcg actacgactc 720 agatgacgat gaggatagcc cgattggttg tggtgacatg acagacacgg aaggagtgga 780 aaaccacgaa acgtgtaaca gcgactctga tgaatcatcc atcgacatcg gcacaagaac 840 cgacgacact cgtaatgaca aaagcgattc caactctgac tgcgaaacat caagttcatc 900 ctcgtattca acgagatctc catcctgttc tccatcaaaa tcaccgtcgc ccgttaagtc 960 aacaacacag gaaactccaa gcaaggcgga actggaacac gccaccaccg aacgattcga 1020 ccgcatggaa aagatgatcg cgaatatgaa taacgccctg gctcggttcc aagcgggagc 1080 cggaccaaca ccagatgacg acgccgaatt aagctggaag ctatcgaacg acgtagacca 1140 acgcgcaggt ggtgaaagct catcgtcaaa tattcgttgg gaccacttga aaccattccc 1200 tagtggtatt gcatcgtgta agatgtggga ggagtggaac agctacgtgg aaaacttcga 1260 gttagcggca tccctaagca acgttaatga ccccgttaaa agaacccaat tgctattcct 1320 ttctatgggt aacgaactac aagggatcgt caaagctgct aagttacgtc caagcttgtc 1380 caatcccaat tgttacagga catttgtgac caacattcaa aactacttac ggaagatgac 1440 cgacaccgcc gccgagcatg aggagttctc gagaatgaag caagagatcg gcgaatccgt 1500 ggtcgctttc catgcgcgat tgacatccaa agctaaggca tgtaagtaca gcgttgatga 1560 ggacagattt gtacgggcgc agcttctagc aggcctccgg aatcgagagc ttgtgagaca 1620 agctagagtg tacgggcacg atacggagtt catcgttcaa tcagctactc gcgatgaaac 1680 gttcgaatct gagagaaaac agcaggaagg aaaccatgtg ctggaaatca ggagcaaccg 1740 aaacgagcat ttcaatcgga agcgttccgt caggggttcc ggaacgggtg gaccaccggt 1800 aaggcaacag cgtagaggtg acaccaaccg acgaccgcaa ggacaacgct cacgatgcac 1860 cagatgctcc ctctttaacc acaggaacgg acaatgtcca gcgctaaccc ggaattgcaa 1920 cgagtgtggc aaacgtggcc attacgcagc ggcgtgtcga cagaagcagg tcaatgcttt 1980 gcagaacaag caaaatttcg acagcttccg tgataaatct gacgtgccag acgatgacaa 2040 gcagagcaat cagcaggtaa caaatgatta ctaatttgta ttatcatttt ccctaaatca 2100 tggcacacta agaatgattt aaaactaaac catcgatacg ttgaataaaa cacataataa 2160 ataaattata tttcctttat tttctgaatg tcacatttta tgcaaatcag ctggttaatg 2220 ccctttcact cgaggacgtc atagttacct gttcaattgg atcatctact cccattcgat 2280 tcctcattga ctcgggagct gacgtaaatg ttatcggagg aaatgattgg gctttactga 2340 cacgagaaaa gcaaatgggg aaagcaaaat tagaaatcat gaggacaatc gataaaaagt 2400 tacatgcata cggaagtcaa gaaccaatgg ttgtcgagtg cacaatcaaa gccgaaatta 2460 ctactccagg atccatccaa cctgctatag ccgccgtctt ccacgtgatt ctcaaaggaa 2520 caagatcact gctaggacga tcgacgtcga gtgacatggg attattgcac ataggaagta 2580 aaatcaacaa ctgtgaacta cggagcaaca accaatttcc aaaaatgcca gggttgaagg 2640 tcaaatttag cattgatcag acggttccac ccacaaaaag cgcgtattat aatgtaccag 2700 cagcgtacag gtaagtttaa acgaaaaaca ttggcagtat acatgattga tactctccca 2760 aaatgcattt aaggtgaaac tgtttttctc taggatccga aatccaaaaa tgcataaaag 2820 gattagttcg aagaagattt caatgcagaa ccatatgtgt aattgtaatt gattcattgg 2880 catttgttcc ctaggtaaca cgaaacaata ttgccgttat ggctgcattg ttggttcgca 2940 aaacgtttca ccttaacata catgcattag cagtcatgct tccttttgtt catcaaacac 3000 caagtcagtt gcctatcaat tacagacagt ctttacagtt ttgatgatgt ttacagggaa 3060 gcagcgcgac aaagactgcg agacatggaa gaccaaggag tgatagaaaa agtaacatcc 3120 gctcctggtt ggataagcgg tatgtcagcc gttgccaaag ggcggaatga tttccgcctt 3180 gtagttaaca tgcgcgcccc caaccgagcg aatcatcgcg aattcttcaa acttccttta 3240 cttgaggaaa tgcgagttaa gctccatggc gcaaaatatt ttacgaagct cgatttgcaa 3300 aacgcttttt atcatctgga attgtccgaa gagtcgcgcg atctgaccac ctttatgacg 3360 gaagatggca tgtacagatt caccagactc atgtttggcg ttaactgcgc gccagaaata 3420 ttccaacgtg agatggtcag aatcctgaag gaggtcggtg attataccat cgtgtatatc 3480 gacgacatct tgattttttc ggatacgctt gaaggtctac acgcaactgt aggcaaagta 3540 ttgcaaatct tgagagacaa ccatctaacc ctaaacacat cgaagtgcga attcgatagg 3600 accaggttac agttcatagg tcatgaactc gaccaagaag ggtttcatat cgacaacgaa 3660 aaagttaaaa gcgtacaaaa cttcagagag cctacaacac tgtcagagct tcgtagcttt 3720 cttggactgg cgtcgtacct gagtccatac ttgaaaaatt tcgctgatat ctctagtcct 3780 ttgtggactg cgacaactac taaggcatgg ttatggggat tggagcagaa gaaagcgttt 3840 gaattagtca aacagcatat catcgattgc actgtatcat taggtttctt ctcggaacat 3900 gataagacag tactatatac agatgcttcc cctgtcgcat tgggagctgt cctagtgcaa 3960 gaaggagaaa atcaatcgcc gcgcataatc agctttgcat ccaagtcact ctcatcaaca 4020 gaaaggaaat acgcgcagaa ccaacgggaa gcattaggag ttgtgtgggc ggtggaacat 4080 ttttcctact tcctacttgg cagacgcttc acactccgta cagatgcgca aggggtcaca 4140 ttcatcttca atagatcacg ggaagattca agacgagcat tgacgcgtgc tgacggttgg 4200 gctatcagac taagcccgta tcattacgat attgaatatg tccgaggtat cgataatatt 4260 gccgaccctc catctagact ctacaagggt gaagacgacg ctccgttcga cgatgaaaac 4320 agtccgtggg aaatagctag tcttgaggcg aattcagtcg agttcctaac ggaagcggaa 4380 ataaagagtg ctaccgatca agatgagaca ttacaacaag taatgaagtc actaaaaaca 4440 ggagtgtggt ctgaaagctt gaggagatat cgcacggttg aaaacgattt atacatcttg 4500 aatgggatca tcgttaaaac tggatgtgca gtggttccaa aagcactaca gacacatgct 4560 ctggaggtag cacacgaagg ccacccatcc atagcaaaaa ctaaaagtat tttgcgacaa 4620 cgcgtttggt ggccaggcat gcccaaagac gtaactagct gggtacaatc gtgcgaaaca 4680 tgttgtatca atggcaagcc agagagaaca acaccaatgg aacgggtatt tgctccaaaa 4740 gtgccttgga atacagttgc cctggatttc aatggccctt acatcaaatt cggcggagtt 4800 ctcatcttag tgatagtcga ttataaatca cggtacgtca tcgccaaacc cgtcaaatca 4860 accaaatttg aatgcgttaa gaaagtcctt gatgacgtat ttcagaacga agggttcccc 4920 aaaggcatca aaacggacaa tgggccgcca ttcaatggac aagacttcgc ggaatattgc 4980 aacaaacgag atatttcaat aactttttct acgccactct ttccgcagca gaacggcttg 5040 gcggaaagtt atatgaaatt gattaataag gctatggcag tggcaactac caacaaaacg 5100 aattatgtcg atgaattgaa aaaagcagta aacgcccaca acgcagctgc tcacagtatc 5160 acaaaggttc cacctgaaga ggtgatgtac ggccgaaaaa tcaagcgtgg actgcctctg 5220 cttcaacacg gtaagtctga ctttgacgag gaccttctac aatgcaggga tcgtgaagcc 5280 aagttagcgg gaaagcagcg tgaagattcc cggcgaggtg ctcgcgaatg taaagttaaa 5340 ccgggggacg aagtggtcat cgaacggcct aatcgcacaa aaggagattc tcgtttttct 5400 ccgacacgat atacggtgct ggaagagcgg aacggcagtc tcatcctcaa tgataagcat 5460 gggaaagtct ctaaacgaca cgtatcccaa acaaagaggg ttcaaccatg gcgcgacgtc 5520 gagcgtgatt catcacaata ttccggacca ccgaccagta ctcttcagca accaaatgag 5580 cagccctcac gtgcattagt ccgaccagca cgcgaaagac gaacaccagg attcctcaac 5640 gattacgtga gagtgatcgg aatcgatggg gagtaatact tcgtcggtaa taccgcaaac 5700 ttgacctgaa ataaaatgaa aatgaattcg aaaatgtaaa tattgaactg caccatcatt 5760 tactcaccgt cggtatagaa caaatcacaa aggcaaacat taaaggaaaa gaacggaatg 5820 cagcagtaat gttttttcgc ggagataaac ttttgttgtt gttgaccgtc cgactttgac 5880 aacgcatcaa caacatcctt ttcttgcttt tctcccttct tcactctttc gaccgatttg 5940 acattttgca gcgagacgcg cacccaaaag ctgcccctgc agcatattat gttgacaaac 6000 ttgccgaggg gtgatgttcg gaaggtgagt aacacacacg gaaagaaaaa aggtactaaa 6060 tttgattaga cttgcacaat gtgcaagcat tttttttctc tttcgtacat ctcattctgg 6120 ttgtctttgt gcatttgtga gatatattag tcgaagatta tttcttccag attggctgta 6180 aacaagttaa aagcacatgt taaaaaaatc aactttattt tattttccga agaggaagtg 6240 aga 6243 // ID hAT-27_HM repbase; DNA; INV; 3009 BP. XX AC . XX DT 07-DEC-2008 (Rel. 13.12, Created) DT 12-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-27_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3009 RA Jurka J.; RT "hAT-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 2016-2016 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 902..2812 FT /product="hAT-27_HM_1p" FT /translation="MSSNRKTSLVWKYFIIEANNPRMVQCKICNSKVSRGS FT DDPKKQGLSGLTSHLKNHHPDIKLTPEKAENPKANEEADLNSNKRKAVTIF FT NMRSKKQRSDMLQSTIPNWVQASSKIDSNSEKGQKFHKSIFEMMILDLQPW FT SIVNDAGFLRHHALIAPNYEIASEKYYRSLLDPTYEKIKVALKEKLVQSEA FT ENVSVCLDAWSSFHHGYLGMTVHFISKDWNRVKFCISCSPFDESHTAKNIF FT KEIKAAAEEWEISSKIGVCLRDNAANVKAAFNEPSCNYKSAGCLNHSLQLV FT IKKDLLSDPTINELIEKSRKLCSYASHSIGFYTELYRQQEIQMDRKDRLGL FT KNDVATRWNSTYYMLERILHLKPAIAATLLKFPSVGIEFSVQDWSLYEKVV FT RILSVFEEATKMLSGSDSCISSCIPIVTTIIKALETTSGDVSVQKLKIAMK FT NAMEKRFSAIEKTEHYSVATLLNPKYKWYFFRSQAALQNAKHIVMSQMENF FT EPLEDELAPVTQIPSGDNEKNTAFSHIMSKIIAQSQTGPNEHSSSKMTREV FT LKDFLDCPMASHCLEFWKNYEKNATSKIKLALTKVAKKYLTPPPTSTDVER FT LFSTAGDILSNERNRLLPENLEKLLFCRENLAVVGFCY*" XX SQ Sequence 3009 BP; 1015 A; 508 C; 490 G; 996 T; 0 other; tagtgatcga ccgaaattcg gttcggttcc ggttccggtc ataaatttgg ccggaatcga 60 aattcctgtt tcggttccgg ctagatttga gatttcggtt tcggccggaa tttcggttct 120 aaatgctacc gaaaatagat attataaaac tttattttaa ataaattatt tagcattgta 180 gcgtaaatat attttgatta aaaaaattta aataattttt ttgctagtta tttcaacttc 240 ttatattatt ttttgcgtcc gatcgcgttg ttacttttat ctggttttta attatcatgt 300 ttaaaatatc atgatctaca aatttaatga cgctacaatg ttatctacat tgtagagtca 360 tatgtttatg gacttgatta tttatttatt ttaattatcg catggaattg gtaacggtaa 420 aatctttacg aacttggtga ttttttttta aattatcacg tggaatttgt ttgaaataaa 480 ttaaaaaagt ttttattggt ttcattcatt tttttaaagc aagttgtttt aatgttttct 540 cgtcgttaaa acattaaaac aataatgttt tcttgtcgtt ataacttttt taatgttttc 600 tcgtttaaat tgttaattgt ctttttaacg aactttttaa tgtattgaac tttttaatgt 660 aacgtaagaa atttttctta aatatatata tatatatata tatatatata tatatatata 720 tatatatata tatatatata tattgttaag ttaacaacaa tattaataca aattaatata 780 attaataatt aatataatta acaaagaact tgttaattat attttaaatc cttgttcttg 840 actcaaacta gcaacattaa aaaactcatc tgattaagtt cttcgatttc attctaataa 900 aatgagctca aaccgaaaaa caagtttagt gtggaagtac ttcataattg aagcaaacaa 960 tccaagaatg gttcagtgca aaatctgcaa cagcaaagtt tcaagaggat ctgacgaccc 1020 aaaaaaacaa ggattaagtg gactcacgtc acacctgaag aatcatcatc ctgacatcaa 1080 acttacgccc gaaaaagcag aaaatccaaa ggcaaatgaa gaagctgact taaattccaa 1140 caagagaaag gcagttacaa tcttcaacat gcgatcaaaa aaacaaagat ccgacatgtt 1200 acaatctaca attccaaact gggttcaagc ttcttcaaaa attgactcta attctgaaaa 1260 aggccaaaaa tttcacaagt caatttttga aatgatgatt cttgacctcc aaccatggtc 1320 tattgtcaat gatgctggat ttttgaggca ccatgctctt attgctccaa attacgaaat 1380 tgctagcgaa aaatactaca gaagtttact ggatccgact tacgaaaaaa ttaaagttgc 1440 tctcaaggaa aagctggttc aatcagaagc tgaaaatgtg tctgtctgcc ttgatgcatg 1500 gtcatcattt catcatggct atctgggaat gactgttcac tttatttcca aagactggaa 1560 ccgtgtcaaa ttctgcatct cttgttctcc atttgatgaa agtcatactg caaaaaatat 1620 tttcaaagag atcaaagcag cagcagagga atgggaaatt tcttcaaaaa ttggagtttg 1680 cttaagagac aacgcagcaa atgtaaaagc agccttcaat gagccaagct gtaattataa 1740 gtcagctgga tgccttaacc actcccttca acttgtgata aaaaaagacc tattatcaga 1800 tcctaccatc aacgaactga ttgaaaagag tcgaaaactt tgttcttacg catcccattc 1860 tattggtttc tacactgaac tttacagaca gcaagaaatc caaatggata gaaaagacag 1920 acttggctta aaaaacgatg ttgccacaag atggaactct acctattaca tgcttgagcg 1980 aattttacac ctgaaaccag ccattgctgc aaccttactc aaatttcctt ctgtgggaat 2040 tgaattttcc gtccaagatt ggagcctcta tgaaaaagtg gtgaggattt taagtgtttt 2100 tgaggaggca acaaaaatgt tgtcaggtag tgattcttgt atcagttcct gcattccaat 2160 tgtgacaaca ataattaaag ctttggaaac aacctccggc gacgtcagtg ttcagaaatt 2220 gaaaatagct atgaaaaatg caatggaaaa acgtttttca gcaattgaaa aaacagaaca 2280 ttattcagtt gctactttac ttaatccaaa atacaaatgg tattttttcc gatctcaagc 2340 tgctttgcaa aatgccaaac acattgtcat gtcccaaatg gagaattttg aacctcttga 2400 agatgaattg gctcctgtta ctcaaattcc atctggtgac aatgaaaaaa acacggcatt 2460 ctcgcacata atgtcaaaaa ttattgcaca gtctcaaaca ggaccaaatg agcattcatc 2520 ttccaaaatg acgagggaag tgttgaagga ttttttggat tgtccaatgg cttctcattg 2580 cctagaattt tggaaaaact atgaaaaaaa tgcgactagc aaaatcaagt tagcattgac 2640 taaggtggca aaaaagtacc tcacaccacc cccaacatcc acagatgttg aaagattgtt 2700 ttctacggct ggagatattc tatctaacga aagaaacaga ctccttccag aaaatttgga 2760 aaaactcctt ttttgtcgag aaaatcttgc agttgttgga ttctgctatt aatcattttt 2820 tattaattta actttgtgtt ttaattggaa taaacttgca aacaaaaact attatttatt 2880 ttcggttccg gtttcggttt cggctctggt tagggtttac tttcggttcc ggttccggtt 2940 aggaatattt aaaatctatt ttcggttcgg ttccggttcc ggcaaaattt cagtgccggt 3000 cgatcacta 3009 // ID Gypsy-152_AA-I repbase; DNA; INV; 6675 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-152_AA_; KW Gypsy-152_AA-LTR; Gypsy-152_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6675 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1029-1029 (2011). XX DR [2] (Consensus) XX CC Positions [4721-5203] - Integrase core CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 1062..2306 FT /product="Gypsy-152_AA-I_2p" FT /translation="MRQQYPGPTVEDLSRRLKTVELHDRDEREILRSVNQN FT QLRSVIPNNLHMPESVKRNFNPQVTFNSIGNLNAHQTERQTGNLNRFNEDT FT FNQYQQLPRSTDFGFLPNQVPQSHSPYNSPLRNLNRSVNQNNDQHYLRRQP FT YQTCNIVEKWPKFFGDSSSVPLVCFLRNVDILCRSYGIDKNELIKHAHLLF FT SGDASIWYTTYVEKFVDWDSLVYYLTLRYDNPNRDRFIKEEMRNRKQKQNE FT LFSAFLTDIESLAQRLINKMSEAEKFDIIVDNMKMSYKRRLALHQIASIED FT LANMCYKFDALESSLYTPRSKPVDVLNLEEVDDSDSEFEEINVLQKSRIGK FT SQSSTETRPKLEKAETPPENCRNIREDGHIRKSRPKQAAIFCHVCGMTGKT FT TSTCPKKHPPFNPSLQGPKNL" FT CDS 2762..5242 FT /product="Gypsy-152_AA-I_1p" FT /translation="MFRIGFAEDYFSNSSEIFSISVSPKITLGETKCLDTV FT EDESLNIPTIEIPESKIETPEDIKTEHELLPEERKKLFEAIQELPITEDNS FT LGRTQVLEHKIELLPDARPKKLPIYRQSLQKEEGIQQDIQRMLRLGVIEEC FT EGPVDFLNPILAIKKPNNKWRICLDSRRLNSCTKRDDFPFPDMVGILQRIQ FT KSRYFSIIDLSESYYQVPLDCNSKNKTAFRTSRGLYRFTVMPFGLCNAPAT FT MARLMTKVLGHDLEPFVYVYLDDIVIVSNSLEEHLRLIKIVANRLSNANLT FT INVLKSKFCQKQIQYLGYILSEHGLSVDSSKIKPVLDYPQPQNVKDVRRLL FT GLSGFYRRFIPSYSDKTAKISDLLKKGKKKFIWTKEADDSFKQLKSALVSA FT PILANPNFSLPFEIETDSSDQAIGAVLTQNQEGERKIIAYYSKKLSSTQRK FT YSATERECLGVLLSIEQFRHFVEGTKFIVKTDAMSLTFLKTMSIESKSPRI FT ARWALKLSKYDIDLQYKKGSENVTADALSRIVNIIDPLLPDEYTSALKEQV FT AKFPEKFKDFRIVDDKLFKYITNTTQLEDNVFKWKYVVPLAERLQIVKNIH FT NEAHFGFSKTLAKVRERNYWPKMSTDIRNFCSSCEICKESKVDNSNPLPPC FT GKPKLCSRPWQMISLDFLGPYPRSKQGNVWALVISDFFSKFVLIQCMRSAT FT AGPVCTFLENMVFLLFGAPQICISDNATVFLSREFKKLLDKYNVSHWNLAV FT YHPAPNPTERVNRVIVTAIRCALNESSSHRDWDKEIHVIAKAIRTSVHEST FT GFSPTLMFSRIWKEKGFREHTMVRF" XX SQ Sequence 6675 BP; 2329 A; 1166 C; 1289 G; 1890 T; 1 other; ttttggcgcc caacttttgg gtctttaaga taattgaatt tcggattagc ttgaattatt 60 atagaataag actattcagt ttgcgtaaga tcagtttgca ataattttaa ttgagtgaaa 120 ttgaatgaat gagagtgagt gagtacggat taagataagt tgagtacgat ttacatttag 180 tttgagttga attcgattga tttagtattt agttgtaatt tggagtgaaa gattgagagc 240 ttaattaacc ttattttttt ttgctttctt atatttgaat ttttataata agtgtatata 300 aataattact gtaaatacga accatggata actatcttag aatcaattta gatgaccttg 360 aatatgagga gttaacgtat gaactcgttc ttagaggaat tccggttacg ggatccatag 420 atgcacagaa aagagaattg cgtgctagga tgcgaagcga aattaaaaat aatattaaaa 480 ttgagagctt tcgtactttg agtgaagaat atgaaattgt accagaaatg ttggagcaga 540 tggagataaa tctcaatctg aagactgaac cgacatacga atcaagactt tatcactacc 600 ttgaacgcgt ggctcgtgca aaagagtttg atgatgtgga cagaggcaac aaactaattc 660 tactcgaaat tatttgtgaa atgcttaggg tttattttag gcgggatata agcttaaacg 720 aggagtcttt tgaagattca cttgagctag gagcggtagg aggacaaagt tctaatgatt 780 cggttacaaa tgtggaactt caggctaatg ataagaataa tacaggccat gaaaccggaa 840 cacgtccaaa agcaaatagg gatttaactg gaaatcctac ttcacgacaa gataaaaacc 900 cgataacatc aagacttcga aatacggtta cacaacaaca aagtaaaact caaaattcgt 960 tcgttccaac tcagaatcca aatakgaatc gtgaatatct gcatatttca gaaatcgaca 1020 gttatgttac agcttgcgta caacaaaagc tcaatgagca aatgaggcaa cagtatccag 1080 gaccgacagt tgaggattta tccagaagat taaaaaccgt agaacttcat gatagggacg 1140 agagggaaat tcttcgatct gttaatcaaa atcagcttag atcggttata ccaaacaatt 1200 tgcacatgcc agagtcagtg aagagaaatt tcaacccaca ggtgacattt aattcgattg 1260 ggaacttgaa tgcacaccag acagaaagac aaacgggaaa tttgaaccgg ttcaatgaag 1320 atacattcaa tcaatatcaa cagctaccga gatcgacaga tttcggtttt ctaccaaatc 1380 aagtcccaca atcacactcg ccatacaata gtcctttgcg taatctgaat agatctgtca 1440 atcaaaataa cgatcaacat tacttgcgta gacaacccta tcaaacatgc aatattgtag 1500 aaaaatggcc gaaatttttt ggtgactcca gttccgtacc attagtatgc tttttgagaa 1560 atgttgacat actatgtagg tcatatggca tagataagaa tgaactaatc aaacatgctc 1620 acttgctatt ctcaggagat gctagcattt ggtacacaac atacgttgag aagtttgtag 1680 actgggatag tctcgtatat tacttaactt tgcgttacga taatccaaat cgcgacagat 1740 tcataaaaga ggaaatgcga aacaggaaac aaaagcaaaa cgaattattt agtgcgtttt 1800 taacagatat tgaatctctg gcccaacgat taattaataa aatgagtgag gcagaaaagt 1860 ttgacataat cgtagataat atgaaaatgt cttataagag aaggttagca ctacatcaga 1920 tagcttcgat cgaagacctt gccaacatgt gttacaaatt tgacgcactt gaaagttcat 1980 tgtatacgcc tagaagcaaa cctgttgatg ttctcaacct cgaagaagta gacgactctg 2040 attcagaatt cgaagaaatt aacgtccttc agaagtcaag aataggaaaa agccagagta 2100 gtactgagac aagacccaaa ctagaaaagg cagaaacacc tccggaaaat tgccggaata 2160 ttcgtgaaga tggtcacatt cgtaaaagtc gtcctaaaca agctgcaatt ttttgccacg 2220 tttgtggtat gacgggcaag acaacatcca cgtgtcccaa aaagcaccct ccgttcaacc 2280 cttctttaca agggccaaaa aacttgtaaa gggagagagg tataggaatc gaactcttga 2340 atccctccca tcagagattc cagaacatga atcgttctca tacttcaatc aaatttttca 2400 aattaatatt gtcccatcca ggtgcccaca tttgagtgtc aaaatactcg attcggaagt 2460 tgaagctctt gtagacacag gtgcgggaat atcggtaatt agttccgttc cattaattga 2520 aaaactcaac cttaaaattc atgactcaaa tttgaaagtt gcaactgctg acagaacatt 2580 ttacaaatgt ttaggctatg taaatgtccc gtataccgtg gactcaatca ctcatgtgat 2640 accgacagta atagtgcctg agttaacaaa agatttaatt ctcggagttg attttcttca 2700 cgcttttgga tttagtttgt ccagaaatca aaaagatagt actgaaaaac aaaaaatcga 2760 catgttcaga atagggtttg ccgaggatta tttttcaaac agtagcgaga tcttttcaat 2820 ttcagtttct ccgaagatca cattgggcga aacgaaatgt ttagatacag tagaagatga 2880 aagtttgaat attccaacaa ttgaaattcc agagtcaaaa attgagacac ctgaagacat 2940 taaaacagaa catgaattgc tacctgaaga acgtaaaaaa ctgttcgaag caattcaaga 3000 gcttcctatc actgaagata acagcctagg tcgaacacag gtgctagaac acaaaataga 3060 acttttgcct gacgcacgtc caaaaaaatt acctatctat cgccaatcct tacaaaaaga 3120 agaaggaata cagcaagata ttcaacgaat gctaagatta ggagtaattg aagaatgtga 3180 aggtccagta gactttttga accccattct tgctataaaa aaaccgaaca ataaatggag 3240 aatatgttta gatagtcgaa ggctgaattc atgcactaag cgggatgatt tcccattccc 3300 tgatatggta ggcatattgc aacgtattca aaaatcacga tatttcagta taattgacct 3360 ttcagaatca tactatcaag tcccattaga ctgcaattcg aaaaataaga cagcattcag 3420 aactagtaga ggcttataca ggttcaccgt gatgcctttt ggactttgta atgccccagc 3480 aactatggca agattaatga caaaagtgtt aggccatgat cttgaaccgt ttgtatacgt 3540 atatctcgac gatattgtaa tagtttcaaa ctcattggaa gaacatttaa ggctcattaa 3600 aatagtagca aatcgtctga gtaacgcaaa cttaacaata aatgttctca aatcaaagtt 3660 ttgccagaaa caaatacagt atttaggata cattctctct gaacacggct tatccgttga 3720 tagttccaag atcaaaccag tattagacta tcctcagcca cagaacgtaa aagatgtacg 3780 tagactgttg ggtctttcag gattctatag aagatttata ccaagttatt ctgataaaac 3840 ggcaaaaatc tctgatctct taaaaaaggg taaaaagaaa ttcatttgga ctaaagaagc 3900 ggacgattcg tttaaacagc tcaaatcagc cttagtatca gctccaattc tcgccaatcc 3960 aaacttctcg ttgccatttg agatagaaac cgatagctca gatcaagcta taggcgcagt 4020 cctcacacag aatcaggaag gtgaacgtaa aataattgct tactactcga agaaactatc 4080 aagtacacag cgaaaataca gcgcgacaga acgcgaatgt ctaggagttt tattgagcat 4140 agaacagttc agacatttcg ttgaaggtac taaattcata gttaagaccg acgcgatgag 4200 tttaactttt ctcaaaacta tgtcgatcga atcaaaatcg ccgcgaatag cgcgttgggc 4260 acttaaattg tccaaatacg acattgattt acaatacaaa aaaggctctg agaatgttac 4320 cgcggacgca ctctcaagaa tcgtgaacat tatagatcct ttgttgccag atgaatacac 4380 atccgcgttg aaagagcaag tagcgaaatt cccagaaaaa ttcaaagatt ttcgtatcgt 4440 agatgacaag cttttcaaat acataacaaa cacaacgcag ctagaagata atgtattcaa 4500 atggaaatat gtagttccat tagcagaaag attgcaaata gtcaagaaca ttcataacga 4560 agctcatttt ggttttagca aaacattagc taaagttaga gaacgaaact attggccaaa 4620 aatgtcaaca gacatcagaa acttctgttc gtcgtgtgaa atttgtaagg aatcaaaagt 4680 agataactca aatccgttac caccatgtgg caagccgaaa ttgtgctcca ggccatggca 4740 aatgatctca ctagattttc taggtcctta tccgcgatcg aaacagggta atgtttgggc 4800 attagtaata tcagattttt tctctaaatt tgttcttatc cagtgtatgc gcagtgcaac 4860 agccggacct gtttgcacat ttttggaaaa tatggtgttc cttttattcg gtgcacctca 4920 aatctgtatt agtgataacg caaccgtatt tctttctcgc gagtttaaaa aactgcttga 4980 taaatacaac gtgagccatt ggaacttagc ggtttaccat ccggcaccga accctactga 5040 aagggttaat agagtaatcg taactgccat ccggtgtgct ctaaatgaaa gctcttccca 5100 tcgagattgg gataaagaaa ttcacgttat cgcaaaagcc attcgtacat cagtgcacga 5160 aagcacagga tttagcccaa cacttatgtt ctcgaggata tggaaggaaa aaggattccg 5220 ggaacatacc atggttcgtt tttaaagaaa ggatagtaga aaatttattc ccatagctat 5280 gaatgcactc ttgccgggaa gtgcaatgta caaatcaaaa ttacaaattc caatgagaaa 5340 atactccatc gaggtgtaac ggttgagcca ttttgagtag gtgtcgaatt gtttccatct 5400 ctcaccatag gccagtcaga tcctaaaacg attcgagaaa ttaatataga atacaaaaaa 5460 ggttaccgca atgtagtgat gtcacaagtt gtttccactc attgttgggc tgaactgaaa 5520 agaaaaatat gttaatttct tggatctgtt tgtaaagcta tgtacataaa caactgcagt 5580 tagaactaaa ttcttcgact agcaacaaaa acaccaatag gcaggtcaat agccactaca 5640 gttcttcgtg aactatatgt gtatagcact gtacatagta gtttaggata ggtttaagat 5700 tcacttaggg taatagcttt gtacgtatgt aagtaacgaa acactttcct gttccataat 5760 ccggttcttc aattgtagat gcaacgtagt ttcaggttca aagaggttcc atccatcctt 5820 ctttttcttg tagtttttcc tttgagaagt agtgtactat cagcatcagg tatcaagtag 5880 tcagtccagg tatttttcca tccagttaaa gcgtaaatcc ttccagcaat agtaaataat 5940 tcacgaaact ttgtagatcc aaacattttc acgactggtt ttcactagaa gtaaacaaac 6000 tttgacagtt cggcgaaaaa gagactgaga gaaactttca ctctcatgag cgtagccgtc 6060 ggagagcgaa gacgttatgc ttagaattag ttagaatagt atagtataag cgcaggcgta 6120 gtagcattag atcgaattac atcggataat tgtagggcgc gtcagaaaat tgagattgcg 6180 tatgaacgag tcgagtatga atgaaattcg tcagtcaaac acagtatttt ttccagttgt 6240 aactggagta atgatcgttt aagtttgagt atgagatttt ccgtaagtat acggagttat 6300 tgttgagatt tttttccgtt gaagtacgga gtaagaaatt tatgttttcg atgtgtgaga 6360 gtgtgagtaa gaatattaaa gtgaacaccc gtaaaggtcg tcacaaggaa tggtttaaca 6420 gaagggatga taatgaaagc tgatcttact acgcaattga ataagttagt ccacgtaata 6480 atgtaacttt aatgaaatat gaatttataa aaacaaagtt ttgagttgag aatgtaaata 6540 ataagaacct tcagattatg attaaattac atttaactct gattgctgta aatttaaaat 6600 atacaatttt atggaaacct aataaaaatt ttgaaatttt catttcaaaa tttttattaa 6660 attggcatgg tcgaa 6675 // ID Homo2 repbase; DNA; INV; 2774 BP. XX AC . XX DT 09-OCT-2009 (Rel. 14.1, Created) DT 09-OCT-2009 (Rel. 14.1, Last updated, Version 1) XX DE Homo2 is a putative autonomous DNA transposon. XX KW hAT; DNA transposon; Transposable Element; HOBO; transposase; KW Homo2. XX OS Drosophila mojavensis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; repleta group; OC mulleri subgroup. XX RN [1] RP 1-2774 RA Ortiz M.F. and Loreto E.L.S.; RT "Characterization of new hAT transposable elements in 12 RT Drosophila genomes."; RL Genetica 135(1), 67-75 (2009)DOI 10.1007/s10709-008-9259-5. XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 429..1823 FT /product="Homo2_1p" FT /translation="MSLIRKHSEIWNHFEDIGSQQAKCKYCKCTISWKSLS FT NLSRHLKSKHPTAMEPVVRQSEEITVVVQNPQPKIKNFVHKPLSDGKAEQI FT DRQLVKMIAKGHHALRLVEEPEFKKLIDDVSHAPGYKLPTRKTLTTSLIPK FT VRDELLGQILDNLRMATAVCLTTDGWTSLSNESYIAVTVHYINQDKTVLQS FT HTIACEAFEESHTSENLFGFLKKIVGKWELQNKVMAIASDNAHNIVGAIKL FT GNWKQVRCFAHSLNIIVQKALEKISSVRTKVKAIAEYFNRSSSGLKKLKDM FT QAMFNLPQLKLTQDVPTRWNSTFKMFQRLSMLKEAVVAALSTRTDLILSPE FT DWSLIEGVLPILQPFYQLTEEICAEKNVTLSKIIVLVGLLQKKMVTLNASI FT ASHTLQEVVETLIFEMDGRFREVETNVLYAESTILDPRFKRRVFKSAEAFQ FT NVVSDIKKKTDKNAHRCRTE" XX SQ Sequence 2774 BP; 893 A; 529 C; 565 G; 787 T; 0 other; catttcatat ttatttaaga aaatatacca ttgcgctata aaacctaaac cacaagacta 60 attaattatc gcaggggctt caacaaataa atttgttggc aacgaatcca cagtcagctg 120 aatagtgttg ggttgctcat gagtgagtga acaaaaaaga gttgttcact aaactgagca 180 actgaacatg ttcctctcga attgttcttt attgctcact tgttcttttt tgctcacttg 240 ctcactatag ctcacttgtt cttttttgct cacatgctct tttgctcagc tgtttttcaa 300 cactcactgt tgcctttgct catttttacg tactgttagc tcattttgcg ttcttccgtt 360 ttttattgtc ttcaaactca cattttcgtg tttcgcgcca acttttgttt cgcttttaaa 420 aattcattat gtcgctaatc cggaagcaca gtgagatttg gaatcatttt gaggacatag 480 ggagtcaaca ggccaaatgc aaatattgca aatgcaccat tagctggaaa tcgctgagca 540 atttgagccg gcaccttaaa agcaaacacc ctacagctat ggagcccgtt gttcgacaga 600 gcgaggagat aacggttgtt gtacaaaacc ctcaaccaaa aattaaaaat tttgtacata 660 agccgttgtc agatggaaaa gcagaacaaa ttgaccgcca gcttgtgaaa atgatcgcca 720 agggtcatca cgctttgcga ttggttgaag agccggaatt caaaaagctt attgatgatg 780 tctcgcacgc tccagggtat aagcttccca caagaaagac gttaaccacc tctttgatac 840 ccaaagttcg tgatgaactt ttgggacaaa ttttggacaa tttgcgaatg gccactgcgg 900 tatgtctaac aacagacggc tggacatcac tcagcaatga gagctatata gcagtaactg 960 tgcactacat aaatcaggat aaaacagtgc tacaatcaca caccattgct tgcgaggcct 1020 ttgaagagtc acacacttca gaaaaccttt ttggtttcct caaaaaaatt gttggcaagt 1080 gggagttaca aaataaagta atggctatag cttcggacaa tgcccacaac attgttggtg 1140 ctataaaatt gggcaattgg aaacaggtga ggtgttttgc gcactctcta aacatcattg 1200 ttcaaaaggc tttggagaaa atcagcagtg tgcgcacgaa agtaaaagct atcgcagagt 1260 actttaaccg tagctcatca gggctaaaaa agttaaagga catgcaagct atgtttaact 1320 tgccacaact gaaactcacg caggatgtgc ccacaaggtg gaattccacc ttcaaaatgt 1380 ttcaacgact gtcgatgttg aaagaggcag ttgtagcagc cctctccaca cggacggacc 1440 ttattttgtc cccagaagac tggagtttga tagaaggtgt acttccaatt ttacagccct 1500 tctatcaact aacggaagag atctgcgcgg aaaaaaacgt gacgctatca aagataattg 1560 tcttagttgg gttgctgcaa aaaaaaatgg tgacactaaa tgcgtctatt gcaagccata 1620 ctttgcagga ggttgtcgag acgctcattt ttgaaatgga tggcagattc agagaggtgg 1680 aaacaaatgt tttgtacgca gaaagcacca ttctggatcc aagatttaaa agacgagtat 1740 ttaaatcagc agaagccttt caaaacgtgg tttctgacat aaaaaaaaaa actgacaaaa 1800 atgcgcaccg ttgtcgcact gaatgaagtt gtcccagctg aagaattggg ttctactgac 1860 aaagacgaca tctggaacga gtacgacaat aagttccaac aagtaaccca ccccacaaat 1920 acaattgctg ctggtattcg agaaatggat aaatatttag ctgaagagta tatatctcgg 1980 aaggctgatc cattggaatg gtggaatcag cgaaaaatgc aatacccgca tttatataca 2040 tatatgctaa gtcgcctatg catagttgca acatctgtgc catgcgagcg catattttcc 2100 agtgctggtg aaacagtaac caaaagacgg tccctgttga agccgaccaa tgtggaaaat 2160 cttatgattt tacataataa catgtaaatt ttatgtatgt tcctaaataa tagtattttt 2220 atttaacgag aagactaata ttcccaatga agggaatttc tttaacttta actttttgtt 2280 ttttattttc taatttagtt ttaagtacaa gttttataga acataaatac tcccaatgaa 2340 aggaataaaa tctgaataaa gtttttggtt tgttttgtta aattagtttt aagtacaaga 2400 cttataaaaa tctttaaaaa agatttaaaa tggaacataa cgtatttcat tgtctgttgt 2460 gtgaagaatt cagtagcttt gagctgttca tgaacagtga gcaactgagc aatgtgagtg 2520 agcaatgtga atgagcaact gagcaaagtg agcgagtcga ctcacttcga aaaaaagaac 2580 aagtgaacat gttcaccaaa ttgagcaacg tttacccaac actacagctg aaactagccc 2640 agcttagact cgggataaac cgaaggttcg ttacaactgg cgcccaacgt ggggccctgc 2700 gataagagtg tgattagtga atacttgtga atatggtgcg agcgtgggta tacaatctta 2760 aaaaagatga agtg 2774 // ID Gypsy-625_AA-I repbase; DNA; INV; 4695 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-625_AA_; KW Gypsy-625_AA-LTR; Ty3_gypsy_Ele63; Gypsy-625_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4695 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [3727-4047] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 101..3916 FT /product="Gypsy-625_AA-I_2p" FT /translation="MADGDSDRNGQNGVDPPGVINRQQLNQPIIPAPAAHQ FT QQQQQQQIFVPVVTTHQQQQPFVPVSSAPQQQFSTFQSHQTPQAQVSTDML FT MQIMHMMQQSMTQSQQRHQQLMGQIIAQQQQFLSSVASSINVQVPPNPEQI FT LDSLASNIKEFRYEAESNATFAAWYSRYDDLFEKDAARLDDEAKVRLLMRK FT LGLSEHERYVSYILPKFPKEFSFAQTVAKLKSLFGAKESVISRRYRCLQIA FT KNPTEDHVAFACRVNKACVEFELGKLTEEQFKCLVYVCGLKSENDVEIRTR FT LLTKIEDNNDVTLEQLSEECQRLFNLKHDSAMIEAPSPYNQVQAVRKFGGK FT RFDKRDREAQTHFSSDAVKKPTYPCWLCGALHYARDCSYKNHKCSDCGQFG FT HREGYCESAKSRKPGSKRKKRVVSTKVVVVDVCSVQQRRRFVSVGLSGTNI FT RLQLDTASDITVISRESWQKLGSPALSPATVKAKTASGNILSLDGEFECDV FT TIGESTRRELIRVTGKQLQLLGSDLVDSFNLWSVPMDSFCCHVSGSPASPA FT ALKSSFPNVFSEQLGLCSKTKVKLELKESVRPVFCPKRPVAYAMYDAVDQE FT LDRLEKLNIITPVEYSEWAAPIVVVRKANGSIRICGDYSTGLNAALQPNQY FT PLPLPDDIFAKLANCKVFSQIDLSDAFLQVEVDEQRRKLLTINTHRGLYSY FT NRLPPGVKVAPGAFQQITDTMLAGLECTSGYLDDVIVGGRTEEEHDRNLRA FT VLKRIQDFGFTIRPEKCTFRKQQVQYVGHVVDSRGLRPDPAKIEAITKLPP FT PTDVSGLRSFLGAINYYGKFVPNMRKLRYPLDNLLKDDAKFQWTPECQKAF FT EQFKAILSSDLLLTHYDPKREIIVSADASSVGLGATISHKFPDGSIKVVQH FT ASRALTKAEQGYSQPDREGLAIIFAVTKFHKMLFGRHFRLQTDHQPLLRIF FT GSKKGIPVYTANRLQRFALHLLLYDFDIEYVPTHKFGNADLLSRLINQHVR FT PEEDYVIASLNLEEDLRSVVSNTVKVLPLNFRAVAQSTQADPLLRQVYHHV FT QHGWPQSKLSGSDIQRFQVRQESLSVVDGCVMFAERLVIPSLLRKRCLEQL FT HRGHPGMQRMKALARSYVYWPSLDADIVDFVKACRHCASVARSPPHSPPVP FT WPKPAAPWQRVHVDYAGPIEGDYYLIVVDSFSKWPEIVQTNRITSAVTIPF FT SVVCLLGWVCPLHWSATTVPSSQAPNLPISAPPTASNTSRRPHSIHNRMAK FT RNDSWTLSRGP" FT CDS 3523..4683 FT /product="Gypsy-625_AA-I_1p" FT /translation="MSTLCVRSQIASSLSTGAVAQTSRSMAARPRRLRRSN FT RRRLLPHRRRFLLQVAGDRPNESYHLGGDHSILRGLFARLGMPVTLVSDNG FT TQFTSAEFADFCASNGIEHLTTAPFHPQSNGQAERFVDTFKRAVKKIREGR FT GSIQHALDIFLLTYRSTPNRALPDQKSPSEIMFGRKIRTCLELLRPPPIRT FT PVPTSDDLKKSRSFSRNDPVYAKLHGRNGWKWVPGTVVEKIGDVMYNVWID FT DRRMLRSHINQLRSRHAADTTPKPSIGQATSNQHLLPLDILLGAWNLPSQP FT AGTPTPSLVPSASLSSPTLAPVSSPEPTLLGSTPARASSTPRHEVPAVPST FT SSSSTTSTSPEFESAIEVEPVVDLPRRSSRTRRPPIRFDPYHLY" XX SQ Sequence 4695 BP; 1128 A; 1382 C; 1184 G; 999 T; 2 other; aaagtggcga cgagaattcg cgagtgaggg tagtttttcc gtcgataaaa actcacccaa 60 catttcgtca cccgcgcgga agaaaagaaa gtggagtgcg atggctgacg gtgatagtga 120 tcgaaacggg caaaatggtg ttgacccccc gggagtgata aatcggcagc aattaaacca 180 gccgattatc cccgcgccgg cggctcacca gcagcagcag cagcagcagc aaattttcgt 240 ccccgttgtg actacacacc agcagcagca gccattcgtt ccggtttctt cagctccaca 300 acagcagttc tcaaccttcc aatcccacca aacaccgcaa gcccaagtaa gtaccgatat 360 gctcatgcaa attatgcata tgatgcagca atcgatgacc cagagtcagc agcggcacca 420 acagcttatg gggcaaatta ttgcgcagca gcagcagttc ctaagtagcg tcgcgtcgtc 480 aataaatgtt caagtgccac ccaatccgga acaaatcctt gactccttgg ccagtaatat 540 caaggaattc cggtacgagg ccgaaagcaa tgccactttc gcggcgtggt actctcgcta 600 cgacgacctc ttcgaaaagg atgctgcsag gctggacgat gaggcgaaag ttcgcctact 660 tatgcgcaaa ttgggtttat ctgagcatga gaggtatgtg agctatatct tgcctaaatt 720 tccgaaggag tttagcttcg cacagacagt tgccaaactc aaaagcctgt ttggagcgaa 780 agagtcggtg attagccgcc gataccgatg cctgcagatc gcgaaaaatc ccacggagga 840 tcatgtggcg ttcgcatgca gggtaaataa ggcttgcgta gaatttgaac taggaaagct 900 cacagaagag cagttcaagt gcttggtgta cgtttgtggc ctgaaatcgg agaatgacgt 960 cgagatccgc actcgtctcc tcacgaaaat cgaggacaac aacgacgtca cgttggagca 1020 gctctccgag gagtgtcaac gcctgttcaa tctcaagcat gacagtgcga tgatcgaagc 1080 tccatctccg tacaaccaag tgcaagcggt gaggaagttt ggtgggaaga ggttcgacaa 1140 gcgtgatcgc gaggcacaaa cacatttttc cagtgacgcc gtcaagaagc caacctaccc 1200 gtgttggctc tgcggtgcat tacactacgc ccgtgattgc agctacaaga accacaaatg 1260 ctccgattgc ggccaattcg gacaccgtga aggttactgc gaaagtgcca aatcccggaa 1320 acckggaagc aaacgcaaga agcgagtagt ctcgacgaag gtggtggttg tcgacgtgtg 1380 cagcgtgcag cagcgacgca gattcgtttc cgtcggcctt tctggaacga atatcagact 1440 gcaactcgac accgcctcag atatcaccgt catcagcagg gaaagttggc agaaactcgg 1500 cagccctgca ttatcgcctg caacggtgaa agcgaagaca gcatccggca acatcttgtc 1560 gctcgatggg gaattcgagt gcgacgtcac catcggagaa agcacacggc gagagctcat 1620 ccgtgtcacc gggaaacaac ttcagctact cggttccgac ttggtggaca gcttcaatct 1680 ctggtccgtg cccatggaca gtttctgttg ccacgtgtca ggctctcctg cgtcacccgc 1740 tgcactcaag tcgtctttcc ccaacgtttt cagcgagcag ctcggcttgt gcagcaaaac 1800 gaaagttaaa ttggagttga aagaaagtgt tcgacccgtt ttctgtccga agcgtccggt 1860 tgcgtacgcg atgtacgatg ccgtcgacca ggaactcgac cggctggaaa agctaaatat 1920 catcaccccg gtcgaatatt cggagtgggc cgctccgatc gtcgtcgtcc gcaaagccaa 1980 tggttccatt cgaatttgcg gagactattc cacgggtttg aacgctgcgc tccaaccaaa 2040 ccagtatcca cttcccctgc ccgacgacat cttcgccaag ctggctaact gcaaggtctt 2100 cagccagatc gatttgtccg acgccttctt gcaggtggaa gtcgacgagc agcgccgtaa 2160 gttgctaacc atcaacactc atcgtggtct ttactcctac aaccgcctcc cgccgggtgt 2220 gaaagttgcg cctggtgcgt ttcagcagat caccgataca atgttagccg gtctggaatg 2280 cacttccggc tatctcgatg atgttatcgt tggcggccga actgaagagg agcacgaccg 2340 caatttacgg gctgttttga agcgaatcca ggatttcggc ttcaccatcc gacccgaaaa 2400 gtgcacgttt cgcaagcagc aggtgcaata cgtgggtcac gtcgtcgata gtcgcgggtt 2460 acgtccggat ccggccaaaa tcgaggcgat tacgaagctg ccgccgccta cagatgtgtc 2520 cggactacga tccttcctgg gggccatcaa ctactacggc aagtttgttc ccaacatgcg 2580 taagctccga tacccgctcg acaatctcct gaaagacgac gcaaaattcc agtggactcc 2640 agagtgccag aaagcgttcg agcagttcaa ggcaattctc tcctcggatc tgctactcac 2700 acattacgat ccgaagcggg agatcatcgt ttctgccgac gcttcttccg ttgggcttgg 2760 ggcaacaatc agccacaagt tccccgacgg tagcatcaag gtcgtccagc acgcttccag 2820 ggcactcacg aaggccgagc aaggctacag ccaaccggac cgtgaaggtt tggccatcat 2880 atttgccgtc acgaaattcc acaaaatgct cttcggacgg cactttcggt tgcaaaccga 2940 ccaccagcct ctgctccgta tcttcggctc gaaaaaagga ataccggtct acactgccaa 3000 ccggctccaa cgtttcgctc tccatctgct gctctacgat ttcgacatcg agtacgtgcc 3060 cacccacaag tttggcaacg cagacctgct ttcccggttg atcaaccagc acgtcaggcc 3120 cgaagaggac tacgtcatcg cgagcctcaa cctggaagag gatctcaggt cagttgtttc 3180 taatacggta aaggttttac ctctcaattt cagagccgtc gcgcaaagca cccaagcaga 3240 cccactgctc cgccaagtct accaccacgt tcaacatggc tggccccagt caaagctttc 3300 ggggtccgac attcaacggt tccaagtcag gcaggaatcg ctctccgtgg tagatgggtg 3360 cgtcatgttt gccgaacggc tcgtcatccc gtcgctgctc cgcaagcggt gcctcgaaca 3420 gcttcatcgt ggccatcccg gcatgcagcg tatgaaggcc ctcgccagaa gctacgtgta 3480 ttggcccagt ttggatgccg atatcgtcga cttcgtcaag gcatgtcgac attgtgcgtc 3540 cgtagccaga tcgcctcctc actctccacc ggtgccgtgg cccaaaccag ccgctccatg 3600 gcagcgcgtc cacgtcgact acgccggtcc aatcgaaggc gactattacc tcatcgtcgt 3660 cgattccttc tccaagtggc cggagatcgt ccaaacgaat cgtatcacct cggcggtgac 3720 cattccattc tccgtggttt gtttgctcgg ctgggtatgc ccgttacact ggtcagcgac 3780 aacggtaccc agttcacaag cgccgaattt gccgatttct gcgcctccaa cggcatcgaa 3840 cacctcacga cggccccatt ccatccacaa tcgaatggcc aagcggaacg attcgtggac 3900 actttcaaga gggccgtgaa gaagattcga gaggggagag gatcgataca gcacgcactg 3960 gacattttcc tgctcacgta ccgcagcaca cccaaccggg ctctgccaga ccagaagtcg 4020 ccatcagaga tcatgttcgg gcgcaaaatc cgcacgtgtc tcgagcttct gcgtcctccg 4080 ccgatacgta caccagtgcc aacgtccgac gacctcaaga aatcgagatc cttcagccga 4140 aacgaccctg tttacgccaa gctccacggt cgtaacggtt ggaaatgggt tcccggtaca 4200 gtcgtcgaaa agatcggaga cgtgatgtat aatgtgtgga tcgacgatcg ccgaatgttg 4260 cgctctcaca tcaaccagct ccggagtcgt catgctgctg acacgacacc gaagccttcc 4320 atcggtcaag ccacctctaa ccagcatttg ttgccgctgg acatcctgtt gggtgcctgg 4380 aatcttccca gccaaccggc cggcacgcca actccttcgc ttgttccgag cgcctcgtta 4440 tcgtcaccta cgttagcgcc tgtctccagt cccgagccta ccttgctagg ttccacgccg 4500 gcccgtgcat cgtctacgcc acgacacgag gttccggctg tcccgtccac ctcttcatca 4560 tcaacgacat caacatcacc cgaattcgaa tccgcaatcg aagttgaacc ggtggtagat 4620 cttcctaggc gatcttcacg gactcgaaga ccgccaatca ggttcgaccc ctaccaccta 4680 tattaagagg ggaga 4695 // ID hATm-12_HM repbase; DNA; INV; 3972 BP. XX AC . XX DT 22-MAR-2008 (Rel. 13.03, Created) DT 22-MAR-2008 (Rel. 13.03, Last updated, Version 1) XX DE hAT-type family: consensus sequence. XX KW hAT; DNA transposon; Transposable Element; hATm-12_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3972 RA Jurka J.; RT "Families of hATm elements from Hydra magnipapillata."; RL Repbase Reports 8(3), 216-216 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(970..1197,1297..3429) FT /product="hATm-12_HM_1p" FT /translation="MNKIKRTRSNSCPGSKKVKEFIGLGADLLPSELPTLR FT NVLQLMLLEMESLDTNLQYVSKEIVHEAAKKSSCKVSKVVVAQYKKVNINI FT QIQPNHYIVQRIEKEWKRVTQTVWGRVKSSQKDKLYNEIDVLFDILRCKHI FT IICKEDPTCSDPKCSHSAISPTCNCPSNLKIPEIELKFIKAQRLKIGMKSV FT YQIHCIDKKEHTRLAKQCKRKEKALIFKKLEEKKVELNNNKFFNEEIEVDV FT IEKANVSSNQFIDTEIDDGIKESVINTNHFPLLAQTVLRYGTSNREAAAIS FT SAVLVDLGIVNKTNNKLIIDHHKIHRETKKVMEKVQKEDLQLYQSEEIKAL FT FFDSRKNLTLINNQDKTTLKYYSKTIKMDFYTVTSEPGGKYVFHFSPLKPG FT VGEKPALMIAKPIADWCRKFDINLQYIGADSTATNTGRLGGAIRRIEDLLN FT KKLTWIICMLHTNELPLRHLIEELDGKTTSNNGFSGVVGKLLNCATLLPVA FT QAFPTITIGKRPVELSKDVIDDLSTDQLYGYKMVLAIRSGNIPIDLIKMEC FT GPVNHSRWLTTANRLMRIWVSVHGLSGQNFNNLSCLVEYVVGVYYPMWFDI FT KVRWSFIEGPRHVLETLRLVKMQSQVTRDIVEKYIILGAWFSHSEAVLTTL FT LCSSIVEERIFAVDRILQIRRGFDKGETSVRERTHSSFLNIDAKNLTELCS FT WTHNVFEPIQTCSFSTETIVDFKVNPMVVDSIPCHTQSIERAVKQTTRACA FT AVYGHDSRDGFIRAGCHHRKLLPKNNTKKNLKQMII" XX SQ Sequence 3972 BP; 1446 A; 516 C; 583 G; 1425 T; 2 other; tagggtgtag tgaaaaaact tttttttcaa aatttaaaat ttaaatgtta ttttcagatg 60 tccatagaca tattaaggtt tactatgcaa tttttttttt cacaacttta aaataaatta 120 ccaaaatgaa gttttaaata ttttataatg atcaaattaa atagttacta tttttggtcg 180 aaaacagaat taacggaaac atcgtataaa aaacaaaaaa cagacttata taaattaaat 240 taatacacat attttttaac tactaaataa attttatttt atcaataaat tttattttat 300 ttgtataaaa tttattatta taaaaattat attaaatatt tttatagcaa aacaaaaacc 360 tttaacttca acttttaatg gtctgtaaaa ttatttttta tttattttat ataatttgca 420 atgtttacat tttaaaatta tatttgcgtt ttcttgattt tgttgtgcaa agtttaaaga 480 gttatttagt tattttaagt agagttataa ataagttcag agagaaaaat taataacttt 540 taggataatt actgaaattt tgtaattttt caatgtattt ttgtttatgt ttatgattta 600 aagaatcaaa aataatctaa agttgacttt tttgttttta aattkaattt ctatactttt 660 tattaattat attttatacc aaaatattca cttgaaaaat tttgcaaacc aagaatttgt 720 ttaatgataa gtataaatat attagccaaa tatactttcc tgcacatacc ctatacattt 780 tttcaataac ttatatatag tataactaac aattagttat attatattta agttctttta 840 aaaaatttgt aagatctata acataaagat atatatatat atatatatat atatatatat 900 atatatatat atatatatat atatatatat atataataaa tatataattt ttatagatta 960 ataaatatta tgaataaaat aaaaaggaca cgtagtaatt cttgcccagg cagcaaaaaa 1020 gttaaagaat tcattggtct tggtgctgat ttgttaccct ctgaattacc aaccctcaga 1080 aatgttttac aactaatgtt gttagaaatg gagtccttag acactaatct tcagtatgtt 1140 tcaaaagaga ttgttcacga agctgccaaa aaatcatctt gtaaagttag caaagtctaa 1200 tattttaata aaattgttat ataatatata tttgttttct actgtttcat tattatattt 1260 tgctatttta aataatattt tttatttctt atttaggttg ttgcacagta taaaaaagta 1320 aatatcaata ttcaaattca acctaatcac tatatagttc agagaattga aaaagagtgg 1380 aaaagagtta ctcaaactgt ttggggaaga gtgaaaagtt ctcaaaagga taagttatat 1440 aatgaaattg atgtattgtt tgatattctt cgttgcaagc acattattat atgcaaggaa 1500 gatccaactt gttctgaccc taaatgttct cattctgcta tttctccaac ctgtaattgt 1560 ccatccaatt taaaaattcc tgaaatagag ctaaagttta tcaaagctca gagattgaaa 1620 attggaatga aaagtgttta ccaaatacat tgtatagata aaaaagaaca cactcgacta 1680 gctaagcaat gcaaaagaaa agaaaaagct ttaatcttta aaaaattaga ggagaagaaa 1740 gtagagttaa ataataacaa gttttttaat gaagaaatag aggtagatgt cattgagaaa 1800 gcgaacgtat cttctaacca gtttattgac actgaaattg atgacggaat aaaggaatct 1860 gtcatcaata caaatcactt tcctcttctt gctcaaacag ttttaagata cggaacaagt 1920 aacagagaag ctgctgctat ttcttctgca gttcttgtag atcttggcat tgttaataaa 1980 acaaataata aattgattat tgaccaccat aaaattcaca gggaaacgaa aaaagtaatg 2040 gaaaaagtcc agaaagaaga tttacagttg tatcagtcag aagaaataaa agcattattt 2100 tttgacagta gaaagaattt aactcttatt aataatcaag ataaaacaac attaaaatat 2160 tattctaaaa caataaaaat ggacttctac actgtcacta gtgaaccagg tggaaaatat 2220 gtattccatt tctctcctct taagcctggc gtgggcgaaa aacctgcatt aatgatagct 2280 aaacctatag cagattggtg tagaaaattt gatattaact tacagtatat tggagctgac 2340 tctacagcta ctaacactgg aagattaggt ggggctataa gaagaataga ggatcttttg 2400 aataaaaaat taacatggat tatatgtatg ctgcatacta atgagcttcc gcttaggcat 2460 cttattgagg aactagatgg taaaacaact tccaataatg ggttcagcgg agttgtgggt 2520 aaattattaa attgtgctac actacttcca gtagcacagg catttccaac aataaccatt 2580 ggtaaaagac cagttgaatt aagtaaagat gtgattgatg atctttcaac tgatcagctc 2640 tatggttata aaatggtttt agctatacgt tcaggtaaca tacctataga tcttataaag 2700 atggagtgtg gtcctgttaa tcacagcaga tggttgacaa ctgcaaatag actgatgcga 2760 atttgggtat cagttcatgg tctgtctggg caaaacttta acaatttaag ttgtttagtt 2820 gaatatgttg ttggtgttta ctatccaatg tggtttgata taaaagtgag gtggtctttt 2880 attgagggtc caagacatgt kctagaaacc ttgagattag taaaaatgca aagccaagtt 2940 actagggata ttgttgaaaa atatatcata ttaggagcct ggttttctca cagtgaagct 3000 gttctaacca ctttgctttg tagctctata gtggaagaac ggatttttgc agtagatcgt 3060 attctgcaga ttagaagagg ttttgacaaa ggagaaacat ctgtccgaga aaggactcat 3120 tcttcttttt taaacataga tgctaagaat cttactgagt tatgttcttg gactcataat 3180 gtatttgagc cgattcaaac atgtagtttt tctactgaga caatagttga ttttaaggtg 3240 aatcccatgg ttgttgattc cataccatgc cacacacaga gtattgagcg ggctgttaaa 3300 caaacaacaa gagcttgtgc tgctgtctat ggacacgatt caagagatgg ttttattcga 3360 gcaggctgcc atcaccgtaa acttttacct aaaaataata ccaaaaagaa cctaaaacag 3420 atgattatat aatttttata tttttaatat gagtgtatat attctttact tattggatct 3480 tcacaaattt atattttttt atttatacta aatggggtcc ataaccaata aagggctcat 3540 tttgcaggtc tactctttgt atagtttttt tctatttttg gaatgggtta tgccaaaaat 3600 tttgatatat tatacccacc ctatatgtgg caaataagga tctagactta gtcttaaaac 3660 tcaattcccc cccccccccg accgcccctc ctcatttaaa aagtctctga taaacttttt 3720 tttttttttt ttaactattg ttttgaaata gataaaaatt taatagcagt ttttaaattt 3780 tgacaaaaat atgatgtgtt taaaagttat gactatttat tgttaataac tcccccattt 3840 accccatttt ggtaatttat tttgaagtta tgaaaaaaaa tttttaacag tatatcttaa 3900 aatgtctatg gacatctgaa aataatattt aaattttcaa ttttgaaaaa aaagtttttt 3960 cactacaccc ta 3972 // ID DNA-2_PPac repbase; DNA; INV; 627 BP. XX AC . XX DT 07-JUL-2010 (Rel. 15.07, Created) DT 07-JUL-2010 (Rel. 15.07, Last updated, Version 1) XX DE Non-autonomous DNA transposon from the Pristionchus pacificus DE genome. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA-2_PPac. XX OS Pristionchus pacificus OC Eukaryota; Metazoa; Nematoda; Chromadorea; Diplogasterida; OC Neodiplogasteridae; Pristionchus. XX RN [1] RP 1-627 RA Jurka J.; RT "DNA transposons from the Pristionchus pacificus genome."; RL Repbase Reports 10(7), 954-954 (2010). XX DR [1] (Consensus) XX CC >97% identical to consensus. XX SQ Sequence 627 BP; 165 A; 145 C; 147 G; 169 T; 1 other; taggggccac aaagtggata gccgatttta tgcactggga tcaaattagt gttcaaacaa 60 tccctcatgg acccaacaac atcatgtttt ttgccgtttg tcgatatcgt gtctctgagc 120 tgcgctagag cccgagaagt acgcgcgcaa cgcactgggg aagaggaggg aaaggcgaga 180 cgagagggaa gggggcggag tctcatttgc gtgtgtctct ctccctcccc ccgccctctc 240 gcgcgcctct ccccgcgttc caagcacacc ttagcggggc atgataaaaa tctgaaaaaa 300 attctaaaaa tcctaaaata ttctaattag tcaanaaatt gctgtagaaa tcctaaaata 360 ttctaattag tcaataaatt gctgtaggtt tggggtacat actcaagaat aggtcaaatt 420 gctactctcg cgaccttctt tatcgatttt gacctactcg ctgaaaactt tttcaagacc 480 tacaagattt ttctgaaagg gggtgctagt ggctatccaa cgcgctatcg gacgacctat 540 ttcgtttaga atgaactttg tgctgctccg tctcgttttt attggaggag catgtcacac 600 aaaagtgggc ggggcttgtg gccccta 627 // ID Chapaev3-1_DA repbase; DNA; INV; 3126 BP. XX AC . XX DT 29-FEB-2008 (Rel. 13.02, Created) DT 29-FEB-2008 (Rel. 13.02, Last updated, Version 1) XX DE Chapaev3-1_DA is an autonomous DNA transposon - consensus. XX KW Chapaev; DNA transposon; Transposable Element; Chapaev3; KW Chapaev3-1_DA. XX OS Drosophila ananassae OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; ananassae subgroup. XX RN [1] RP 1-3126 RA Kapitonov V.V. and Jurka J.; RT "Chapaev3, a distinctive group of animal Chapaev transposons."; RL Repbase Reports 8(2), 44-44 (2008). XX DR [1] (Consensus) XX CC Chapaev3-1_DA belongs to the Chapaev3 group of the Chapaev CC superfamily of DNA transposons (see comments on Chapaev3-1_PM). CC Chapaev3-1_DA is a very young family of fruit fly Chapaev3 CC transposons: genomic copies of Chapae3-1_DA elements are ~99.5% CC identical to their consensus sequence, which was derived from CC multiple alignment of three Chapaev3-1_DA elements. Chapaev3-1_DA CC contains 12-bp TIRs, ~550-bp subterminal inverted repeats, and CC encodes a 554-aa transposase (2 exons). CC This sequence was derived from sequence data generated by CC Agencourt Bioscience Corporation. XX FH Key Location/Qualifiers FT CDS join(910..2323,2393..2640) FT /product="Chapaev3-1_DAp" FT /note="transposase." FT /translation="MCKHSLYKFCFVCGSFILKKTSMKKLTKNIILAYKHY FT FGFDAKFIDKPWTPKSVCAACFCKLIQFGNGKKVYLPFGKPVEWRNPIDHV FT TDCYFCLTKVQGGKHFKVIYPDVQSVTKAAPHSSLYPKREPPAPKTTASDP FT LEELCSDRSSESDLENKDYEPKLMNQSELNDLCRDLNLSKNKSELLASRLQ FT EKKYLHPKTGVTYYRNRGKPFEQFFSKTEEFCFCNNVNALFSAFGEKHNAN FT EWRLFIDGSKYSVKAVLLHQGNKKPSIPVGHSTIAKETYGTMKQLLELIDY FT NSYKWKICCDLKVVAILTGLQGGYTKYCCFLCLWDSRANTKHYSSHDWPRR FT EQFVVGNNNVVAKPLVDIKNVIIPPLHVKLGLMKNFVRALAKDPESKGLKY FT LRTKFPKLSSEKIKAGIFVGPQIRTLFKDENFMQSLNHIEAGAWAAFRDVV FT EGFLGNKKSHNARELIHKMMEKFQEIGARMSLKMHFLHNHFDAFPENLGAE FT SDEQGERFHQDLSAIEKRYEGFWDKGMMGDYCWMQVREEKIDNKRKNLSDK FT CFFRPNL" XX SQ Sequence 3126 BP; 1070 A; 485 C; 556 G; 1015 T; 0 other; cactagttta caaatttaaa tgtaatcgag tgaaccttaa aattgatgat tgttttatag 60 taatttgggg tcggagaaca cgaaaatgac atccgtttat ttttcttaag aaccgttttt 120 gagatatttt tgaaaatcgt gtttttgagg tcctttttca gttttttggt aatatctcat 180 gaaaaaaact gtttacaaac aaaaaaagga tattgattga caggtattga agtaacgttt 240 acgtggatat ataatatgtg cagattgaat caaatctctg taatgcacaa gcgaacaaag 300 tacaaaaata gtgtcatttt cgaccaattt tgaaaatttc aatttttcag atcgattttt 360 gccaaaaaaa acaatttttt ttctaaggtt tcaatatttg agggaaaaga tttattattt 420 accaacaatt taagcctaag attatccaaa ttgagtaaga agtttaaaag ttatagtaat 480 attaacattt tgacacttca taaattaggt cgaaaaaaac atttatcact caaatttttt 540 cgtttgttat caattttctc ttcacgcact tgcatcttgc aaacatttta tctgaaatcc 600 aagacgactg ttcttcttgt tctgtttaca gtttactttt gatgggtcta tgttttgagg 660 ggggatatga tgggagaagc actaaatgtg gggacatatt ttactgattg gaccaccttg 720 ttgttattta atcatgaatt tttcaaagta agacagaact gttgagtaaa tagaaataca 780 acagtaaata caaagaaata caaaataaag ttgtgaatat actgatgaac ataccgattc 840 gttcgtatta gcgagtgttt agttgataga ctgagctagt gtatacttag ttcaaaaact 900 taattcaata tgtgcaaaca ttcattatat aagttttgtt ttgtctgcgg ttccttcata 960 ttaaaaaaaa cttccatgaa aaagttaacc aaaaacatta ttttggctta taaacattac 1020 tttggatttg atgctaaatt tatagacaag ccatggactc ctaagtcagt ctgtgccgcg 1080 tgtttttgta agttaattca atttggtaat ggcaagaagg tgtatttgcc ctttggaaaa 1140 cctgtagagt ggaggaatcc gattgaccat gtgactgatt gttacttctg tttaacgaaa 1200 gtccagggag gaaaacattt taaagttatt tatcccgacg tacaatcagt tacaaaagcc 1260 gcaccacact cttctctgta ccctaaacga gagcctccag cacctaaaac aacagcaagt 1320 gatcctctgg aagaattgtg ttcagatcgc agtagcgaga gcgatcttga aaacaaagat 1380 tacgaaccaa agttaatgaa tcagagtgaa cttaatgatt tatgccgtga tcttaacctg 1440 tccaagaata aatctgaact gcttgcatcc cgcttacaag agaagaaata tttacaccca 1500 aaaacaggag tcacgtatta tcgtaatcgc ggaaaacctt ttgaacaatt tttttcgaaa 1560 actgaagaat tttgcttttg caataatgta aatgcgttat tttctgcatt cggggagaaa 1620 cacaatgcta acgagtggcg tttgtttatc gatggcagca aatatagtgt taaggctgta 1680 ttactccatc aaggtaacaa aaaaccgtcc attccggtcg gccattcaac tattgcgaaa 1740 gaaacctacg gtactatgaa acaacttctt gaactaatag attataactc ttacaagtgg 1800 aagatttgtt gtgacctgaa ggttgtcgcg atactcactg gtttgcaagg cggttatact 1860 aaatactgct gttttttatg tctctgggat agcagggcaa acacaaaaca ttactcctca 1920 cacgattggc cacgacgtga acaatttgtg gtaggaaata acaatgttgt tgcaaaacca 1980 cttgttgata taaaaaatgt gattatacca ccattgcatg tgaaacttgg actcatgaaa 2040 aattttgtca gagctttggc taaagatcca gaaagcaagg gcttgaagta tttgaggacc 2100 aagtttccga aattatcatc cgaaaaaatt aaagcgggaa tatttgttgg tccacaaata 2160 cggactttat ttaaagatga aaatttcatg caatctttaa accacatcga ggcaggagca 2220 tgggcagcat ttcgcgacgt tgttgaaggt ttcttgggaa acaaaaagag tcacaacgct 2280 cgtgaattaa tacataaaat gatggaaaaa tttcaagaaa tcggtgagga caatatataa 2340 atatataatt ataaaatgtt tatgctgaaa caactgctct ttttaatttc aggtgcacgc 2400 atgtccctga aaatgcattt tttgcataat cattttgatg cattccctga aaatttagga 2460 gctgaaagcg acgaacaagg cgaaagattt catcaggatc tttcagctat tgaaaaacgt 2520 tatgagggtt tttgggacaa gggtatgatg ggtgattact gctggatgca agtgcgtgaa 2580 gagaaaattg ataacaaacg aaaaaatttg agtgataaat gtttttttcg acctaattta 2640 tgaagtgtca aaatgttaat attactataa cttttaaact tcttactcaa tttggataat 2700 cttaggctta aattgttggt aaataataaa tcttttccct caaatattga aaccttagaa 2760 aaaaaattgt tttttttggc aaaaatcgat ctgaaaaatt gaaattttca aaattggtcg 2820 aaaatgacac tatttttgta ctttgttcgc ttgtgcatta cagagatttg attcaatctg 2880 cacatattat atatccacgt aaacgttact tcaatacctg tcaatcaata tccttttttt 2940 gtttgtaaac agtttttttc atgagatatt accaaaaaac tgaaaaagga cctcaaaaac 3000 acgattttca aaaatatctc aaaaacggtt cttaagaaaa ataaacggat gtcattttcg 3060 tgttctccga ccccaaatta ctataaaaaa cactttttta aggttcattt ttggtgtaaa 3120 ctagtg 3126 // ID Copia-18_CQ-I repbase; DNA; INV; 3185 BP. XX AC AAWU01015423; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-18_CQ_; KW Copia-18_CQ-LTR; Copia-18_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-3185 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 351-351 (2011). XX DR GenBank; AAWU01015423; Positions 69169 65985. XX CC Positions [1540-2064] - Integrase core CC 'AGAAC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 178..2835 FT /product="Copia-18_CQ-I_1p" FT /translation="MADPKTGITRLNEANWSTWKLRMEALLDTEELWDVIE FT EEVPAAEQQDAVWTRKDRKARGHLIVALEDSQLRHIKGKVHARDIFGALKA FT HHEQATRSVRVSLLKKMCALILPDGGDLVAHLREFEVLFDRLEAAGTSLDT FT DTLICMLLRSLPSSYDGVVTALDCLADDDISLEIVKAKLEDEYSRQQERKG FT GSRQVEKAMRTAESKNRKKPPKCFNCGQLGHFKGNCPKLGGDQKKGGSSPK FT PDGGGGAKAKAAQNDARGVAFTVGGKQTACWVIDSGASAHMTNDREFFESL FT REFAGGNITLADGKKTEIQGEGSGTVFGIDGEGRKVKLDVQEVKYVPGLST FT NLISVGKLANKDYSVVFDKVSCSVVSDSKVIATGGRHGGLYYLRQAEEESM FT VAAERRHKVDCQHQWHRRLGHRDWAAAEKLVKDELATGMKVTDCGLRLECE FT CCMEGKSARLPFPPVIDRKSSKVLDIVHTDLWGPIKTVTPSGNRYVMTMID FT DYSRFTVIYLLKQKSEAADRIKEYVRWVENLFGRKPRVVRSDGGGEYDNRE FT LRTYYKAEGIKPQFTTAHSPQSNGVAERKNRSLTEMATCMLNDAVLDMRFW FT GEAMITAAYLQNRTPSRSIQGTPYELWWGRKPDLKHLRVFGSEAYVHVPGV FT NRGKLDSKARKLVFVGYAFEQKAYRFVDVETDKITVSRDARFLELGNGTST FT VEWSTDNPDEEIQLKPFVVKKEELKEEDAIEEDTAGDDVAGGQISESDDEE FT FQEAESTPNASPRRSTRSNLGTRPRYLDDFELENAVGIAACAVEEPVDHKE FT ALKEPVWRTAMLDELESHQKNGTWKLVQLPEGRKAIGSKWIYKVKRDENNQ FT IVKFKARLVAQGYAKHHVGEPSPTLRDAFIDEE" XX SQ Sequence 3185 BP; 790 A; 788 C; 1061 G; 546 T; 0 other; ggttatgggc ccagaagtgg acgagttttg gtgaaagagg acgattccac aggtactcgc 60 tggtgctccg cgtcgcgcgc ggtgcacggg aaaaagtgag gttggaatcg gggcgaaagt 120 ttgttcgcgt gttggacaaa agtgcgtgca aacaaaaaga caaaagtgct agtaaagatg 180 gccgatccga agaccggaat cacccggttg aacgaggcca actggtccac gtggaagctg 240 cggatggagg cgctgctgga tacggaagag ctgtgggacg tgatcgagga agaagtcccc 300 gcagcggagc aacaggatgc tgtctggacc cggaaggacc ggaaggcgcg aggtcatctg 360 atcgttgcgt tggaggacag ccagctgcgg catattaagg gcaaggttca cgcgcgggac 420 atcttcgggg ctctcaaggc ccaccacgaa caggcgacgc gctccgtccg tgtttcgctg 480 ctgaagaaaa tgtgtgcgct cattctcccg gatggcggcg accttgtggc gcatctgcgc 540 gagttcgagg ttctcttcga ccgtctggaa gcggccggaa cgtcgctcga cacggacacc 600 ctgatctgca tgctgcttcg aagcctcccg tcgtcgtacg atggggtcgt gaccgcgctg 660 gattgcctcg cggacgatga catttctctt gaaattgtca aggcaaagct ggaagatgag 720 tacagccggc agcaggaacg gaagggaggt tctcgccagg tggagaaagc gatgcggact 780 gcagaaagca agaaccggaa gaagccgccg aagtgcttca actgtgggca gcttggacac 840 ttcaagggga actgcccgaa gctcggcggc gaccagaaga agggcggaag ttcgcccaag 900 ccagacggtg gtggaggcgc gaaggcgaag gccgcgcaga acgatgcgcg gggcgtcgcg 960 ttcaccgtcg gagggaagca aactgcctgc tgggtgatcg acagtggtgc cagcgctcac 1020 atgaccaacg accgggagtt cttcgagtcg ctgcgtgagt tcgccggtgg caatatcacg 1080 ctggcggacg gtaagaagac ggagatccag ggagagggca gcggaacagt gtttggcatc 1140 gacggtgaag gaagaaaagt gaaactcgat gtccaagaag tgaagtacgt gccgggactg 1200 tccacgaacc tgatctcggt cgggaaactg gccaacaaag actacagtgt agtgttcgac 1260 aaagtgagct gttcggtagt gagtgacagt aaagtgatcg ccacgggtgg acgccacgga 1320 gggctctact acttgcgcca agcggaggag gaatcgatgg tggccgccga acgtcggcac 1380 aaagtagatt gccagcacca gtggcaccga cggcttggtc atcgcgactg ggcagcggcg 1440 gagaagcttg tcaaggacga gctggccacg gggatgaagg taacggactg cggtctgaga 1500 ctggagtgtg agtgctgtat ggaggggaag tctgctcggc ttccgtttcc accggtcatc 1560 gaccggaagt cgtcgaaggt gctggacatc gtccacaccg acctgtgggg accgataaaa 1620 accgttacgc caagcggtaa tcggtacgtg atgacgatga ttgacgacta cagccgcttc 1680 acagtcatct acctcctgaa gcagaagtcg gaggcggccg atcggatcaa agagtacgtc 1740 cgttgggtgg agaacctgtt tggtcgcaag ccgcgggtgg tgaggtcgga tggaggggga 1800 gagtacgaca accgtgaact ccgcacctac tacaaggcgg aagggatcaa gcctcagttt 1860 accaccgcgc attccccgca gtcgaacggc gtcgctgaac gaaagaacag atcgttgacg 1920 gaaatggcca cctgcatgct gaacgacgca gtgctggaca tgcgattctg gggcgaagcc 1980 atgatcacgg ccgcgtacct gcagaacaga acgccatcca gatcgatcca aggcacccct 2040 tacgagctct ggtggggacg aaagcccgac ttgaaacatc ttcgggtgtt cggcagcgaa 2100 gcgtacgttc atgtcccggg tgtcaatcgt ggcaagctgg acagtaaggc aaggaagctg 2160 gtgttcgtcg ggtacgcatt cgagcaaaaa gcgtatcgtt tcgtagacgt ggagacggac 2220 aagatcaccg ttagccgtga tgccagattt ctcgagctgg gcaacggcac gtcaacggtg 2280 gagtggtcaa cggacaaccc agatgaggag attcagctga agccgttcgt cgtcaagaag 2340 gaggagctca aagaggagga cgccatcgaa gaggacacgg ctggtgacga tgtcgctggc 2400 ggccagatca gcgaaagtga cgacgaagag tttcaagagg cagaaagcac cccaaacgca 2460 agccctcgtc ggtcgacaag atcgaatctt ggtactcggc cgcgttatct ggatgacttc 2520 gagctggaaa acgctgtcgg gatcgcggcg tgcgctgtgg aggaacccgt cgatcacaag 2580 gaagcgctga aggagcccgt ttggcggacg gcgatgctgg acgaacttga gtcacaccag 2640 aaaaacggaa cgtggaagct ggttcagcta ccggaaggac ggaaagcgat cgggtcgaag 2700 tggatctaca aagtgaagcg agacgagaac aaccagatcg tgaagttcaa agccagactt 2760 gtcgcgcagg gctatgcgaa gcatcacgtg ggcgagccgt cgccaacctt gcgtgacgct 2820 ttcatcgatg aagagtgagt acgtcgccct ctgcgagtcg tgccaagaag cggtgtggct 2880 cagacaactt ctccgtgact tcggagagca acaagacaac ccgacggtca tcaacgaaga 2940 caaccaagac tgcttggcgt ttgtgcgttc cgaacgaacg agccgacgat ccaagcacat 3000 cgacacgaag cagaagtaca tccacgagtt gtgcctgagg aaggagatca agctggagta 3060 ttgcccaacc gaccagatga ctgctgacgt gttgacgaga ccgttggggc caacgaagca 3120 ccgggagttc tgcgaactac tcggactcgg gcagcatcag tgagaaggtt cacactgagg 3180 gggag 3185 // ID Gypsy-107_AA-I repbase; DNA; INV; 4479 BP. XX AC AAGE02029780; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-107_AA_; KW Gypsy-107_AA-LTR; Gypsy-107_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4479 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02029780; Positions 32786 28308. XX CC Positions [3451-3969] - Integrase core CC 'GAGGT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 154..4452 FT /product="Gypsy-107_AA-I_1p" FT /translation="MDTNQFKMFLEHQTNIFHQLMQSVNATATSVAAATTS FT SLKQQQHQENHQRPCAANVPVPQPSPLALEGDMEENFNFFVKSWNDYSQAI FT GMNEWPPGDNAKKVSFLLAIIGEPARKKYNNFELTAEESATTELALKAIKT FT KVVAKRNVIVDRLDFFSAVQASRETVDDFCARLRNLARISKLGDLESELMT FT YKLVTANKWSNLRTKMLTMSDITLEKAVDICRAEEITEKRSQELNYTVPES FT DVNKITKGKPRTKHSKPPRCKFCGDAHEFTKGTCPAFGKRCRKCNGKNHFE FT RVCQAAMKRRSRWSRKVREVREEGSSDEETTSADESTEESEQEYEISKIFD FT NSDSGGGVTAELEMNFTKTWKSVLCDLDTGANTSLIGFNRLLELTGQREPM FT LQPSNLKLQSFGGNPINVLGQVKVRCRRLGRKYRLVLQVVDVDHRPLLSAK FT ASREFGFVKFCKTVEFDSSRPNTTTDDKMLKVYRVQAQRIINEHQELFEGY FT GKFPGEVNLELDESVQPIIQPPRRVPIALREDLKKELEALEQSGIIVKEQN FT HTSWVSNLVLVQRGGPNSGIRICLDPVMLNKALRRPNLQTTTLDEILPELG FT KARIFSTLDTKKGFWHVVLNEESSKLTTFWTPFGRYRWTRLPFGIASAPEI FT FQLKLQQVIQDLEGVECIADDLLIYGIGDTMEEALKNHNRCMTMLFHRLKE FT SNVKLNRAKLNICQTAVRFYGHVLTNRGLRPDESKLLTIKNYPTPGNRKEV FT HRFVGMVNYLSRFIRNLSVNLTNLRKLIVETVPWKWTSAEQNEFDQVKALV FT SNIDTLRYYDVTKPLTIECDASCFGLGVAVYQCDGIIGYASRTLTSTERNY FT AQIEKELLAIVFACTRFDQLIVGNPKTTIRTDHKPLLNVFHKPLLTAPKRL FT QHMLLKLQRYNLDLEFVTGKNNVVADALSRAPVANEEGSSTYEKRHVYEVF FT EEMSQVKLSSFLSVSDSRLAEIVEHTGRDSNMQTIISYIQHGWPNTVDRVS FT DGAKIFFAHRHELSTQDGLVFRGDRIVVPHSLRRKLVDSCHASHNGVEATL FT KLARSNLFWPGMSSQVKDVVKQCSVCAKFAASQANPPMRSHPIPVYPFQMV FT SLDVFFSDYKGLKRKFLITVDHYSDFFEVNILKDLTPESVIAVCKENFARH FT GIPQVVLSDNGTNFVNHKMIQFASDWGFQQTTSAPHHQQSNGKSEAAVKIA FT KRLLKKAEESGSEFWYALLHWRNIPNKIGSSPVARLFARSTRCGIPTSANR FT LLPKVVENVPSAIENQRKHNKKHYDRKVRYLPELKVGSPVYIQLEPASSKT FT WTPGTVCNRLSERSYLVDVEGNKYRRSRVHVKPREEQIPSPSSEGSSGEIQ FT NKGQGDNPSTGEDAGVTFQNIPFDESHAVINSDDFLAVSSTPPRIIPSTGY FT ERPRRDIRRPARLVDYQLE" XX SQ Sequence 4479 BP; 1361 A; 960 C; 1091 G; 1067 T; 0 other; tggtgtcaga agtcggcgcg agtttcccgt agtttatttc cggcgtcatt tgtgtacatc 60 gcggaaaatc ggtctcgaaa gtgaatcggg cggcgacgcg acagcggcca ttttgttttc 120 ggtgaaaaaa cgaattggta gatcctcatc gacatggata ccaatcagtt taagatgttt 180 ttggagcacc aaacgaacat tttccaccaa ttgatgcaat cagtgaacgc aacagctacg 240 tcggtggctg cggcgacaac atcatcgtta aaacaacagc agcatcaaga aaatcatcag 300 cgtccatgtg ctgcaaatgt tccagttccg cagccatctc cactagcact ggagggggac 360 atggaggaaa attttaattt tttcgtcaag agctggaacg actattctca agcaatcggc 420 atgaacgagt ggccacccgg tgacaatgct aaaaaagtga gttttttgct ggccattatc 480 ggggaacccg ctagaaagaa atacaataat ttcgaactca ccgccgagga atcagcaact 540 acggaattgg cattaaaggc catcaaaacc aaggttgttg ctaaacgcaa cgttatcgtt 600 gacagactgg attttttctc agcggttcaa gcatctcgag aaacagtcga tgatttctgc 660 gcccgcctca ggaatttggc aagaatttcc aaactgggcg atctcgaatc ggaactcatg 720 acgtacaagc ttgtcactgc gaataaatgg tcaaatttgc gtacgaaaat gttgacgatg 780 tccgatatta cattggaaaa agcagtagac atttgtcgtg ctgaggaaat cacggagaaa 840 aggtctcagg agttaaacta cacggttccg gagtcggatg tcaataaaat cacaaaaggt 900 aaacctcgaa caaagcattc gaagcctcca aggtgtaagt tctgcggaga tgcccatgag 960 ttcacaaaag gaacttgtcc ggcttttgga aaacgctgtc ggaaatgtaa tggtaaaaat 1020 cattttgaaa gggtgtgtca agcagcgatg aaacgcagga gtcgttggtc ccgaaaagtt 1080 cgagaagtga gagaagaggg cagttcggat gaggaaacga ctagcgcaga tgaatcgaca 1140 gaggaaagcg aacaggaata tgaaataagc aagattttcg acaattcgga cagcggtggt 1200 ggcgtaactg cagagctgga aatgaatttt accaaaacgt ggaaatcagt actctgcgat 1260 ctagacacag gagccaacac cagcttgatt ggtttcaacc gtttattgga gcttacagga 1320 caacgggaac caatgttaca gccatcgaat ctaaaattgc agagcttcgg tggcaatccg 1380 attaatgttt tgggacaagt gaaagtgagg tgtcgtcgtt tgggccggaa gtatcggttg 1440 gtactgcaag tcgtcgatgt agatcaccgt cccctactct ctgccaaagc atcacgtgaa 1500 tttggtttcg tcaagttctg taaaacggtc gagtttgatt cgtcaagacc aaacacaacc 1560 actgatgata agatgttaaa agtgtacaga gttcaagcac aacggatcat taacgaacat 1620 caggaattat tcgaagggta cgggaagttt cctggtgagg tgaatctgga actcgatgaa 1680 agcgttcaac caataataca gccacctcga agagtcccga tcgcattacg tgaagatctc 1740 aagaaagaac tggaagctct ggagcaaagt ggcatcatcg tcaaggaaca aaatcatacc 1800 agctgggtca gtaatctggt cctggtgcaa agaggtggac caaactcagg cattcgaatt 1860 tgtttggatc cagtgatgct gaacaaggca ctgcgtagac caaatctgca aacaacaacg 1920 ttggatgaaa tcctcccaga gcttggaaag gctcgtatat tttctacgct agacacgaag 1980 aaggggttct ggcatgtggt tttaaacgag gagagcagta aacttaccac gttttggaca 2040 ccctttggaa ggtaccgctg gactcgacta ccattcggaa tcgcatcagc accagaaatc 2100 tttcagttaa agctacaaca ggtaatacaa gatttggaag gtgttgaatg cattgcggac 2160 gacttactca tatatggaat aggagataca atggaagaag ccttgaaaaa tcacaatcga 2220 tgcatgacga tgctgtttca tcgactgaag gaaagcaatg tgaaactgaa tcgtgcgaag 2280 ttgaacatct gccagacggc ggtaagattt tatggacatg tgctgaccaa tcgaggacta 2340 cgaccagatg aatcaaagct gcttaccata aaaaattatc caactccagg caacaggaag 2400 gaagttcata ggttcgttgg aatggttaac tatcttagcc gcttcatacg aaacctgagc 2460 gttaacctta ccaaccttcg caagcttatt gtggaaaccg taccatggaa atggacttct 2520 gcggaacaga acgagttcga ccaagtaaaa gcccttgtat ccaacatcga cacgctgcga 2580 tattacgacg ttaccaaacc tttgactatc gagtgcgacg ccagctgctt tggtttgggc 2640 gtggcagtct accaatgcga tggaatcatc ggttacgctt caaggacatt aacttctact 2700 gagcgaaatt atgctcaaat agaaaaggag ttgttagcaa ttgtcttcgc ttgtaccaga 2760 ttcgatcaat tgatcgtcgg aaatcccaaa actacaataa gaaccgacca caaaccgctc 2820 ctgaatgtat tccataaacc cctgttgact gcaccgaaac gcctgcaaca tatgctgttg 2880 aaacttcaaa gatataattt ggatctggaa tttgtgacgg gcaaaaataa tgtggtagca 2940 gacgcgctat cacgtgctcc ggtcgccaac gaagagggtt cgagcaccta cgaaaaacgc 3000 cacgtctacg aagtgttcga agaaatgtcc caagtgaagc tcagcagttt tctcagcgta 3060 tcagattcga ggttggcaga aattgtcgaa cacactggga gagattcgaa tatgcagact 3120 atcatcagct acatccagca tggttggcct aacacggtgg atcgagtatc agacggcgct 3180 aaaatattct tcgctcatcg ccacgaattg tctacccaag acggcttagt gttccgtggt 3240 gatcgtattg tagtgccaca ttccttgaga agaaaacttg ttgatagctg ccacgcaagc 3300 cataatggag tagaggccac gttgaagcta gctcgatcca atttgttttg gccgggtatg 3360 agttcgcaag tgaaggacgt cgtaaagcaa tgctcagttt gtgcgaagtt cgctgcatca 3420 caagcaaacc caccaatgag aagtcatccg atcccggtat acccatttca aatggtatcg 3480 ttggacgtat ttttttccga ctataaagga ttgaagcgta agtttttgat aacggtggac 3540 cattattcgg actttttcga ggtgaacatt ttgaaagatt taacgccaga gtccgttatt 3600 gctgtctgca aagagaattt tgcacgccac ggcattccac aggtggtact atccgacaac 3660 gggactaatt ttgtcaacca caaaatgatt cagtttgctt cggattgggg attccaacaa 3720 acaacatcag cgccacacca tcagcaatcg aacggtaaat cggaggctgc ggttaagatt 3780 gcaaaacgtt tgttgaaaaa ggctgaagaa agcggatcgg aattctggta tgccttgctc 3840 cactggcgca acattccaaa caaaatcggc tcaagtcctg tggcgcgttt gtttgcccgt 3900 tcaacaagat gtggaatacc gacgagtgcg aaccgattac ttccaaaagt ggttgaaaat 3960 gttccatctg caatagagaa tcaacgtaaa cacaacaaga agcattacga cagaaaagtt 4020 cgctacttac ctgagctgaa agttggttcg ccagtataca tccaattgga accggcgtca 4080 tccaagacgt ggacaccagg aaccgtctgt aaccgattga gtgaacgttc ctaccttgtg 4140 gacgtggaag gaaataagta ccgacgaagc cgtgtacatg tgaagccacg cgaagagcaa 4200 attccatcgc ccagctcgga gggatcgtca ggtgaaattc aaaacaaggg acaaggtgat 4260 aatccatcga caggtgaaga cgctggagtg acatttcaaa atatcccatt tgatgaatcg 4320 catgctgtga tcaattcaga cgactttctt gcggtttcat caacaccgcc gaggataata 4380 ccaagcactg gatatgaacg cccgagacga gacattagac ggccggcgcg tttggtggac 4440 tatcagctgg aatagatctt cttcttattg tggggagga 4479 // ID Gypsy-92_AA-LTR repbase; DNA; INV; 235 BP. XX AC supercont1.321; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-92_AA_; KW Gypsy-92_AA-I; Gypsy-92_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-235 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.321; Positions 720061 719827. XX SQ Sequence 235 BP; 70 A; 57 C; 32 G; 76 T; 0 other; tgttatatat tttccgtgtc catatcaaac cattacctta attactactc tattacaata 60 accttgaagt taacttgcct tgctcccccc aactataaca accttgttca ttcgccattg 120 taattagaag aaaagtacac gtgaatacat tataaaacta tcggaaagta ttccggagtg 180 ttttactgtt tcccggttcc gtaaaatacc ctcgcgagtg ctctcggtca caaca 235 // ID BEL-114_AA-I repbase; DNA; INV; 4235 BP. XX AC AAGE02020048; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-114_AA_; KW BEL-114_AA-LTR; BEL-114_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4235 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02020048; Positions 115278 119512. XX CC 'AGATG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 863..2875 FT /product="BEL-114_AA-I_1p" FT /translation="MPIEHTPKKQPVDPEELKMLIHQRGQIKGKVTKINRS FT VEEAEDDPTKLSISLLKVFSKKLEMHYNEYISIHREVIAAIPPSKIEEQDE FT KLDEFDQLHTETLERIEQLMESFTKPVDAQHQANADGSAPQVIVQQQPLRA FT PIPTFDGNTENWPKFKAMFEDLVGRSRDSDAIKLHHLDKALVGDAADLINA FT KMIQDNDFRQVWKQITDQFENPRVIVDTHIDGLIQLKPAAKGSLKDLVSLT FT KACDRHVAGLEYQGLVIDKLSGLIITKLVVNCLDNHTRQLWERTQKHGELP FT DYNQTMLFLKNECQILDRCNNSRVAGSTKEPSTKSSNPKPSLKSHISTSVN FT GSNNCLICGEDHRHFECSKFHALNMSERIAKVKELKICFNCLRPGHRVADC FT ASKKTCSTCRKKHHSLLHEESFAKAQPESKPQVHKPAPPKTDSSMVASTSG FT SGSTAVPEQQLPVSSTCSCNHSQSLKTVMLLTAVVMLESNGELIPCRVLLD FT SGSQVNFLAERMADRLKVPREAMSVPITGVGGSKMYAREKLSVTVQSTYSA FT FTSSVECLVVPKVTGIIPGTKIDVSSWPIPTGFHLADPEFNVPKGIDMLIG FT ASKFFSLLKSGQLHLADGLPELQETHFGWAFSGEISDVDSNELLSHAASLA FT SLDIPIIHSREVENFPDSTSLT" FT CDS 2886..3899 FT /product="BEL-114_AA-I_2p" FT /translation="MYQCVKSSQQTHQHLPSGRFSIHLPYRQELSDLQWHL FT SKNLDFKPQCTKFIDESDALIHSCEDNALEDKYQSTGPPVKSLNDGYRVGK FT PVESELFSGLRFRKHSVLVTADIEERYRQNLVVTLHPKCHCILSGCNLSWL FT QFRNASAQVSEVTILRYDFSTSTIAVKIHRCADFSAYGAVICVRIILSDSS FT DRLHILCRNSRVPFKRLLAARFPSKFNDSYVQRFIEACSLKSSTLQIRFIN FT LEVIIKVVQYEIQRVQNEKEYKRIVDLNPLCLVGVLKVIERLKPSPLLFNS FT KYPYISPKHPIVYRDYPIEEVHEGSENLLVALPWIFYPFTVVPKHN" XX SQ Sequence 4235 BP; 1213 A; 948 C; 915 G; 1159 T; 0 other; tggtccttcg agccggatcg ccaccaacga acgccggatg gtagcctttg gtctggatgc 60 ttggagaaca gtgtcgaaaa gtgagaagtg aatacgaaaa gtacagtccg cgaaaagttt 120 gagtgaaaat ccgtcgttgt cggttgagaa gaaattcagc cggtgtgttg ttcgcgtcat 180 cgtcggctgt attctgccga gccgattgca agctgaaaca tccagccaac ccgtgacatc 240 attactgcaa acttccggta gcaaaacgtg aaaaagttgt gaagaagaaa aacttccgtg 300 atgagtaagt aagccgaact gcaagtgtga aaaactcgac taacgttaca gcagccggcg 360 agtgttatgg aaaatactag aacagtgaac attctgggtt gatagtccac catttcctag 420 ttttgaaaaa gccagttgtt ttttgtgata cgttgtgcct caccgtgctc tagttagtga 480 tctgtactgc ccccgtttcc gaatcaaagt tcagtaaacg cagccagaaa actttacgat 540 tttgaatcaa tgtttagtga acacagccag caaagtactg tacaaggagc tacggtgagt 600 ttatccaagc cttagtttga aaagtccgcc attttgttga taaactagtc gttatccccg 660 tcgatatccc tgtcgttccc cgttgtcctc cgcattcgtc ttcttattca tcgtcgttcc 720 ccgttgtcca ctgctgtgaa aagtttgtgc gactggagta tccacgtcgt ttgtgccgga 780 agagagtatt tgtgctaccg gaagtgagaa ttttggaaga aagtgttgtg aaagactaaa 840 aagactaaag tgaagtgtaa aaatgcctat cgaacataca ccaaagaagc agcccgtgga 900 tccggaagaa ctgaaaatgt tgatccacca aagagggcaa atcaaaggca aagtgacaaa 960 aataaacaga agtgtagaag aagcagaaga tgatccaaca aaattgagca tatcattgct 1020 aaaagtgttt tccaaaaagc tcgaaatgca ttacaacgag tacatttcga tacatcgcga 1080 agtgattgca gccattccgc cgtcgaagat agaagagcaa gacgagaagc tggatgaatt 1140 cgatcagctt cacaccgaaa ctctcgaacg aattgaacaa ctgatggagt cgttcaccaa 1200 acctgttgat gcacaacacc aagccaatgc tgacggatca gcacctcaag tcatcgtgca 1260 gcaacaaccc ctgagagcac caatcccaac cttcgacggc aatacggaga attggcctaa 1320 gtttaaggcc atgtttgaag acttagtcgg acgcagccgc gattccgatg ccatcaagct 1380 gcaccacctg gacaaagcct tggttggtga tgcagctgac ttgatcaacg ccaaaatgat 1440 ccaggataat gatttccgac aagtttggaa acaaatcacc gatcagttcg agaatccaag 1500 ggtcatcgtg gacacccaca tcgatggtct catccaactg aagccagcag cgaaaggaag 1560 ccttaaggat cttgtgtcgc tgaccaaggc ctgcgatcgc catgtcgcag gattggagta 1620 tcagggtctg gtcatcgaca aattatccgg actcatcatc acgaagctcg tcgtcaactg 1680 cctagacaac cacactcggc agctttggga gagaacgcag aaacacggtg agttacctga 1740 ctataatcaa actatgctat ttttgaaaaa cgaatgccaa attcttgatc gttgtaataa 1800 ttctcgtgtc gcgggttcaa ctaaggagcc aagcaccaaa tcatctaacc ccaaaccatc 1860 cctcaaatct catatttcca catctgtaaa tggttcaaat aattgtttaa tttgtgggga 1920 ggatcaccgc cattttgaat gttcaaaatt tcacgctttg aatatgtcgg aacgtattgc 1980 caaggttaag gagttgaaaa tttgtttcaa ttgccttcga ccgggtcacc gagttgctga 2040 ctgtgcctcc aagaaaactt gttcaacctg tcgtaaaaag caccattccc tgctacatga 2100 agagtctttt gcaaaagctc aaccggaatc gaaacctcaa gttcataaac ccgcacctcc 2160 gaagacagat tcctccatgg tagcttccac ttctggttcc ggtagtaccg ccgtccctga 2220 gcaacaacta cctgtgagtt ccacgtgttc ttgtaaccat tctcagagtt tgaaaactgt 2280 gatgttgctg acagctgttg taatgcttga gagtaatggt gaattgattc cttgccgtgt 2340 acttctggat agcggttccc aagttaattt cctcgccgag cgaatggctg accgattgaa 2400 agttcctcga gaagccatga gtgtccctat tacaggagtc gggggttcaa agatgtacgc 2460 tagagaaaag ttatcggtta cagtacagtc aacatattcc gctttcacct caagtgtcga 2520 gtgtttagta gttccgaaag taactggaat catcccaggg acgaagatcg acgtttcttc 2580 ttggccgatc cccactggat tccaccttgc agatcctgag tttaatgttc ccaaaggtat 2640 cgatatgttg attggtgcct ctaagttttt cagtctactt aagtcagggc agcttcacct 2700 agctgatggt ctaccggagc ttcaagaaac ccactttggt tgggctttct ccggtgagat 2760 tagtgatgtt gacagcaatg agttgctgtc tcatgcagct tcactcgctt ctttggatat 2820 acccataatc cattcacggg aagtggaaaa tttccctgac tcaacttcgt tgacttgacg 2880 tatcaatgta tcaatgtgta aagtcgtccc agcaaactca ccaacatttg ccttctggtc 2940 gattctcaat ccacctaccc tatcgacaag agttgtctga cttgcagtgg cacctcagta 3000 aaaatctgga ttttaagcca caatgtacca aattcattga cgaatctgat gcattgattc 3060 acagttgtga ggacaatgct cttgaagaca aatatcaatc gactggtccg cctgtaaagt 3120 ctctgaatga tggttatcgc gtagggaagc ctgtcgaaag cgagttgttc agcggccttc 3180 gttttcgtaa acactctgtg ctagtcactg ccgatattga ggaaaggtat cggcagaatc 3240 ttgttgtaac tctccacccc aaatgccatt gcatactatc tggatgtaat ctatcgtggt 3300 tgcaattccg caacgcgtca gcgcaagtta gtgaggttac tattctacgt tatgattttt 3360 caacctcgac aattgctgtg aaaatccacc gttgcgcaga cttctctgct tacggtgcag 3420 tgatttgcgt tcgtattatt ctttccgata gttcagatcg tctccacatt ctttgtagaa 3480 attcgagagt acccttcaaa aggctcctgg ctgccagatt cccatctaaa ttcaatgatt 3540 cgtacgtgca acgattcata gaagcttgca gtctgaagag ttcaaccctt caaattcgat 3600 ttatcaacct agaggtcatc attaaagtcg tccagtatga aatccagcga gtccaaaatg 3660 aaaaagaata taaacgtatt gtagacctaa atcccttgtg tctcgttgga gttctcaaag 3720 tcatagagag gcttaaacca tcccctttgt tgttcaattc caaatatccg tatatttcac 3780 cgaaacatcc tattgtgtat cgcgattatc cgattgaaga agtccatgaa ggttcagaaa 3840 atctgcttgt cgcattgcct tggatattct acccattcac cgtcgttcct aaacacaatt 3900 gagaaggttt tcaattgaac tgtcttaatc attggcaatg tgttcaatgt ttgcgtgatg 3960 attttttcat taaatggcct cgtttttgtt tgcaaacagt atacccttcc aataaaaatc 4020 gtattgaatt acaccaagta taattgttta gcatgagagt aaataaaata ccagtactta 4080 tgtggaaatt tgaaacaata aataaccaaa cctatcctgg gaacagataa tttggtgtat 4140 accgtcgagt gcaggtagtt tattagcgta taattgcatt aatatggcaa gtttgattga 4200 acagtcagag aattcatctc agcccggcgg gagta 4235 // ID Gypsy-40_DPu-I repbase; DNA; INV; 10213 BP. XX AC ACJG01004461; XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the common water flea: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-40_DPu_; KW Gypsy-40_DPu-LTR; Gypsy-40_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-10213 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the common water flea."; RL Direct Submission to RU (08-FEB-2011). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR Genome; ACJG01004461; Positions 21474 11262. XX CC Positions [4438-4941] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 41..2014 FT /product="Gypsy-40_DPu-I_2p" FT /translation="MHKIPKFFRKAKTQEGDSVVSPTKQILSPEELDKLKN FT KIKTEISTAEIRIQYILNKTTRPNERDPKEIENLENIIKNLRAKYAELIDP FT TVTIYKTKKRAVEKTDQAASQGPDKPNQTNAAPARPGTVTLLDSNDEAEEG FT ATARPTRVEERKTECAKCHTPIVLSITTDKEQCRQCNVHRDTGDVEEQKPR FT TTQPLKEDRPKENIARVETKEEEVQPQIPRQVHLPLMQHQAQQQQQVARYR FT AQARNIFDDSDDDDEMALTDVAINAWQAAVERQAEILETGLVFINAQSVKK FT EIPTFTGEIEGKMAIEEWFKLAERMATHAAWTDEQKLHFFQERMSKSAANF FT NDSLPNADRVTYAVWKQAILNGLADNTTKARKKEQLKTLKQEEKERVRDFK FT TRIDDTYRIAYGVNAATSNHADVVALRDETKKDVLLNGLKTQIADLVWNRP FT EINDATYVETVELAEECEKVVEIKKIAKNKDLSSAVTVMSEEAEKTKEEIN FT NLKGLIQKLMTTPIAQPAVQLEEKTINALNRLAISNTDGKWPSQNRERRTV FT RFASESPNRSRSQTPENRRRYNTPTPYRGDHEGSNWGDQKQYESRRCYVCD FT KKGHLARQCWRAKQSPFAQQNARQNGQQTYRRNQPGMRTNQPRNRNFNNFR FT NGTGRNGNN" FT CDS 1819..5487 FT /product="Gypsy-40_DPu-I_1p" FT /translation="MLRMRQERSSSKTVLEGKAKPIRTTKCPPKRTTNVPT FT KSTWHENKSAKEQKFQQFPKWNWAERQQLKKLRRKRPNSLSYKLFHVEISI FT DNVVTQALVDTGASVSAISEYFFSRLAKDIKKNKIESEDEKLHSICGKSMT FT IAGIYDLPIKLDANKDEIYQQFYVIPNLTETCVLGMDFITQNAVTYDGKTR FT KLTYTINNKTFTIDDEVKRLTHYKITLAGVKVAVKEPTAATINIEDPNLEI FT HRDKITKLIDKYKDMTAENLYELGKAEGSKHSIPTTGQIVYQRPRRQARAL FT QGVIQKEVKEMLDCGIISPSTSPYNTPIHLAKKKGGGYRFCMDFRQLNSTT FT TKDKFPLPRIDETIDYLYGSKFFSTLDLISGYWQIEIDEKDRPKTAFSTED FT GHYEYNRMPFGLTNAPSTFQRLMNQILQPVIKKFALVYLDDVIIYSTSIEQ FT HIKHIDIVLDLLKKAGLKIKLSKCTFLQTSVKYLGHVISEKGISPDPMNIK FT AIENYPTPKTVSQLKSFLGLAGYYRKFIKNFADRAHALTILTKKNNPWKWK FT EEEEEAFQFLKKCLIHPPILRYPNFSREFLIHTDASGYGVGSVLSQKHIED FT GEEKEVVIAYASRHLNDVEKNWSTIEKEAYAIVHAVKQFYPYLYGRKFQVL FT SDHKPLRELLRKKDTSAKLARWAFCLQDYDIDIEYRSGKTNQNADCLSRIP FT EPDNNTTAQPEQTPVINALTTINFAEEQEKDKYCRRAREKYKKMLKMKEEL FT INESETEDDNMSIRDNAIDDSPQDSDQGYLTDEDNVYNDEEEVVELENGLL FT GTSAGNLVPESLKEKIFKRFHDSPYAGHLGIKKTTARIQRRFKWHKMGKDI FT KEYVKGCEICAKRKAVGANKAPLNPIPPPKDVWQTMAMDIMGPLVESGRER FT NQYILVMGEYLTRYIITAPMPDQTAETVARTFVNNVVLIHGVPETVLTDQG FT TNFQSELMNIMYKQYGITRLKTTAYRPQCDGMVERVNRTLADIIASYVSKE FT PTTWSDFLPSATFAYNTAVHSSTGYTPFYLMYGREATEPQDMIKPVRNRNM FT TDVNMIFSQMWYDALDITKEKLEEAKEKQKAYYDRNTKRIEFKIGDKILLK FT EMANTPGKFNMRWEGPYTVKERKGNVNYKIYADNGKKMMIVHADRMKHFHK FT RKSPEETAEEITQNKSIAEKPEEVKQNEKKRKTKENPTKEPNLSEEPRYSL FT RKTIRMPDRFRPN" FT CDS 5508..7544 FT /product="Gypsy-40_DPu-I_4p" FT /translation="MAKFISIIILIIIIVYITPSSSLNITVCDCKKPKFIG FT IMDTEKPEYCRNKKKDDPILADYQFFVKEEPHMSWEGYVCKAWLKRKTIDG FT FFFGGFDTVFTAEIQHLSENDCWRMKQTKRCGENEMIGDENQFSFTSTPNG FT EGVWMQKVSFTSNNCMVQKISLKKDCATCPTTSPFGILKEKPRETFVYHQD FT SVIVWIQPKRNLEEECKIKIKRKSTGTITKIDDNTYKLVDAPGQLDYIYTA FT ETQNICDFTLHKLKNVDGAYLQVQNKNWSHIYNVETKYCLNPQIRTPSKCS FT NLTGGEFKIVDNKLNIIPKNNSDKKICIAMSPFFHHMDECSNLYSLKWDPD FT SLEITGNTRCLQMNDNSSVFMNICSGENHQKWIFGMTEMNITESVDTKESL FT LLQHHQYVEDQSVTNENIIEDELKRIYCNNLQMARQATSLISESSGLLAAR FT ANNLPMCHRLKPMGEAFLVQKCSAINITIGAKQTACGYEPFYDNQTIGKDG FT FSLHPFSECFWEDGIVNFNGKTFMWKKELNDWAYVHPNIHQSTLRLTAKFT FT ELNDNEANYQLNHHEFYQKQEYEKINSINDLITRIQVTDASPSVMMNQEAE FT SKFGKFELWHRIRKGLIAVISILFFIILCMLTIIVFIKVRTLRSKQRFEHY FT AEQLQENRKLIRAESLERKRAETNNDESLV" XX SQ Sequence 10213 BP; 4026 A; 2105 C; 1956 G; 2126 T; 0 other; tttggtgaac gtgccaatcc gaacaacaca aatgtaagta atgcacaaaa taccaaaatt 60 tttccgtaag gctaagacac aagaaggcga ttccgtagtt tccccaacta aacaaatatt 120 gtctccggaa gaattagaca aactaaaaaa taaaattaaa acagaaatat caacagcaga 180 aatcagaatc caatatatac taaacaaaac tacccgacca aatgaaagag accccaaaga 240 gatagaaaat ctagagaaca tcataaagaa tctgagagct aaatacgcag aactcataga 300 cccaacggtt accatatata aaacaaagaa acgagcagtt gaaaaaacag accaagcagc 360 atcacaaggc cctgacaagc ctaatcaaac taacgccgcc ccggcaaggc caggtacagt 420 gaccttactc gattcaaacg acgaggctga agaaggagca acagcccgac caactcgagt 480 agaggaaaga aaaacagagt gcgcaaagtg tcacaccccc atcgtcctat cgataacaac 540 tgacaaagaa cagtgtaggc agtgtaacgt acacagagac acaggagacg tggaagaaca 600 aaaaccgaga acaacccaac ccctgaagga agaccggcca aaagaaaata tagcaagagt 660 cgaaactaaa gaagaagaag tacaaccaca aatccccagg caagtgcacc tgccgctcat 720 gcaacatcaa gcacaacaac agcaacaggt agccagatac agagcacaag ccagaaatat 780 ttttgacgac agcgacgacg acgacgaaat ggccctcaca gatgtagcta taaacgcttg 840 gcaagcagca gtagagaggc aagctgaaat tctcgaaacc ggactagtat ttataaatgc 900 gcaaagcgta aagaaggaga taccaacatt caccggagaa atcgaaggaa aaatggcaat 960 cgaggaatgg ttcaaactag ccgaacgtat ggcgacacat gcagcatgga cggacgagca 1020 aaagctacat tttttccaag aaagaatgag caaatcagct gctaacttca acgattccct 1080 accaaacgcc gacagagtta cttatgcagt atggaaacaa gcaatactaa acggcctagc 1140 ggacaatacg acaaaagcaa gaaagaaaga gcaactaaag accctcaaac aagaagaaaa 1200 agaacgagtc agagatttta aaaccagaat agacgacacc tacagaattg catacggcgt 1260 aaacgcagca acaagtaatc acgcagacgt ggtagcactt cgtgatgaaa caaagaaaga 1320 cgttttgcta aacggcctca aaacacaaat agcagacctc gtctggaata ggcccgaaat 1380 aaacgacgca acatacgtag aaactgtaga actggccgaa gaatgcgaaa aagttgtaga 1440 aatcaagaaa atagcaaaga ataaagattt gtcttctgcc gttacagtaa tgtcagaaga 1500 agccgagaaa actaaagaag aaataaacaa cctaaaagga ctgatccaaa agctgatgac 1560 aacaccgatt gctcaaccgg ccgttcaact agaagaaaaa actataaacg cactaaatag 1620 attggcaatc tcaaatacag acggcaaatg gcccagccaa aaccgtgaaa gaagaactgt 1680 tcgcttcgca agtgaatcac ccaacagaag tagaagccag acgcccgaaa atagacgaag 1740 atacaacacg ccaactccgt acagaggaga ccacgaagga tcaaattggg gagatcagaa 1800 gcaatacgaa tctagaagat gttacgtatg cgacaagaaa ggtcatctag caagacagtg 1860 ctggagggca aagcaaagcc cattcgcaca acaaaatgcc cgccaaaacg gacaacaaac 1920 gtaccgacga aatcaacctg gcatgagaac aaatcagcca aggaacagaa atttcaacaa 1980 tttccgaaat ggaactgggc ggaacggcaa caactaaaaa agctaagaag aaaacgccca 2040 aatagcctta gttataaact ttttcatgta gaaatcagca tagataatgt agtaacacaa 2100 gctttagtag acactggcgc atcagtgagc gcaatttcag aatatttctt ttcaagatta 2160 gccaaggata taaaaaagaa taaaattgaa tcggaggacg aaaaactcca tagtatttgt 2220 ggaaaaagca tgaccatagc aggaatatat gacctaccga taaagctaga cgcgaataag 2280 gatgagattt atcaacagtt ctatgttata cccaatctaa cagaaacttg cgtgttagga 2340 atggatttta taacacaaaa tgctgtcacg tatgatggaa aaacaagaaa gttaacctac 2400 acaattaaca acaaaacgtt tacaatagac gatgaagtaa aaagattaac acattataaa 2460 ataacattag ccggagtaaa agttgcagta aaagaaccga cagcagcaac aattaatata 2520 gaagacccaa acttagaaat ccatagggat aagataacga agctaataga taaatataaa 2580 gatatgacag cagaaaattt atacgaacta ggaaaagcag aaggaagtaa gcattcaatc 2640 ccaacaacag gacaaatcgt ctaccaaaga cctagacgac aagcaagagc tttacaaggc 2700 gttattcaaa aagaagtaaa agaaatgctc gactgcggga ttataagccc aagtacaagc 2760 ccatacaaca cacctatcca cctagctaaa aagaaaggag gaggataccg tttttgtatg 2820 gatttccgtc agttaaattc aacaacaacg aaagacaaat tcccccttcc aagaatcgat 2880 gaaacaatcg actacctgta cggatcaaaa tttttctcga cactcgacct gatcagcggg 2940 tattggcaaa ttgagataga cgaaaaggat aggccgaaaa ccgctttctc gacggaagac 3000 ggacactatg aatataatag aatgcccttt ggtctgacga atgccccttc tacatttcag 3060 cgtttgatga atcaaatttt acaaccagta attaaaaaat tcgcactagt ttacttggac 3120 gacgttatca tatactccac ttctatagag cagcatataa agcacataga tattgtttta 3180 gatttactaa agaaggcagg actaaaaatt aaattgtcaa aatgtacgtt cttgcaaacg 3240 tcagttaaat acctgggaca cgtaatctca gaaaaaggaa tctctcccga tcccatgaat 3300 ataaaagcaa tcgaaaatta cccaactccc aaaacagtta gccaattaaa aagttttctt 3360 ggtctagccg gatattatag gaaatttatt aagaattttg ctgatagagc ccatgcacta 3420 acaattttaa caaagaaaaa taacccatgg aaatggaagg aagaagaaga agaagctttc 3480 caatttttaa agaagtgtct gatccatcca ccaatcctac gctatccgaa cttttcgcga 3540 gagtttctta ttcatacaga tgcgtctgga tatggtgtag gatcagtttt aagccaaaaa 3600 catatagaag atggcgaaga gaaagaagtc gtcatcgctt acgcttcacg acatctgaat 3660 gatgttgaga aaaactggag taccattgag aaagaagcgt acgcaatagt gcatgcggtg 3720 aaacaatttt acccgtacct atacggaagg aaatttcaag tattaagtga tcacaaacca 3780 ttgagagaat tattaaggaa gaaagatact tcagcaaaac tagctagatg ggctttttgc 3840 ctgcaagatt atgatattga tatcgaatat cgatctggaa aaacaaacca aaatgcagac 3900 tgcctcagta gaattccaga accagataac aatactacgg cgcaacctga gcaaacgcca 3960 gtgataaatg cgctgacaac cataaacttc gcagaagaac aggaaaaaga taaatactgc 4020 aggcgcgcgc gcgaaaaata taagaaaatg ttgaaaatga aagaagaatt aattaatgaa 4080 tcagaaacag aagatgataa tatgtcgatt agggataatg caatcgacga cagcccccaa 4140 gacagtgatc aaggatatct tacagatgaa gacaacgtat acaacgatga agaagaagta 4200 gttgaactag aaaatggcct actaggtaca tctgcaggaa atctggtccc ggaatcctta 4260 aaagagaaaa tttttaaaag attccatgac agtccatacg caggacacct tggaataaag 4320 aagacgacag caaggattca acgaagattt aaatggcaca aaatgggaaa agatataaaa 4380 gaatatgtta aaggttgcga aatttgcgct aaacgaaaag cagtaggagc aaataaggca 4440 ccactaaacc cgattcctcc cccaaaagac gtatggcaga cgatggccat ggacatcatg 4500 ggaccgttag tagaatcagg aagagaaaga aaccaatata ttctagtaat gggagaatat 4560 ctgacacggt acattatcac agcaccgatg ccagatcaaa cagcggagac agtagcaaga 4620 acgtttgtaa ataatgtagt attaattcat ggagtaccag aaacggtatt aaccgatcaa 4680 ggtacgaatt ttcaatcaga gctaatgaat attatgtata aacaatatgg aatcactcga 4740 ctaaaaacaa ccgcgtatag accccaatgc gatgggatgg ttgaaagagt gaacagaaca 4800 ttagcggata taatagcaag ttacgtatcg aaggagccaa ccacatggtc cgatttttta 4860 ccatcagcaa catttgcata taacacagca gtccactcaa gtacaggata tactcccttc 4920 tatctcatgt atggcagaga agcaaccgag ccccaagata tgattaaacc agtccgaaac 4980 cggaacatga ctgatgtaaa tatgatcttc tcacaaatgt ggtacgatgc actcgatatc 5040 actaaagaaa agctagagga agctaaggaa aagcaaaaag cctactacga cagaaatacg 5100 aagagaatag aatttaaaat aggagacaaa atcctattaa aagaaatggc taatacgcca 5160 ggaaaattca acatgagatg ggaaggcccc tatactgtga aagaaagaaa agggaatgtt 5220 aactataaaa tatatgcaga caacgggaaa aagatgatga tagtacatgc ggataggatg 5280 aaacattttc acaaaagaaa gtcaccggaa gaaacggcag aagagataac acaaaacaaa 5340 tcgatagcag aaaagccaga agaagtaaaa caaaacgaaa agaaaagaaa aacaaaggaa 5400 aatcccacta aagagcctaa cctgtcagag gagcccagat actctttaag aaaaacaatc 5460 cgcatgccag atcgtttcag accaaactaa tcaagaaatc attacagatg gcaaaattca 5520 tttcaatcat catcctgatc atcatcatcg tctacatcac cccatcttcg tctttaaata 5580 tcacagtatg cgactgcaag aaaccaaaat tcattggaat aatggacacc gagaaacctg 5640 aatattgtcg aaataagaag aaagacgacc caatcctagc agattatcaa ttcttcgtaa 5700 aagaagaacc acacatgtca tgggaagggt acgtctgtaa agcatggttg aaaagaaaaa 5760 cgatcgacgg attctttttc ggaggttttg atactgtatt taccgcagaa atacagcatc 5820 tctccgaaaa cgactgttgg cgaatgaaac aaacaaaacg atgcggagaa aatgaaatga 5880 ttggagacga aaatcaattt tcattcacgt caaccccaaa cggggaaggc gtttggatgc 5940 aaaaagtatc tttcacatcc aataattgta tggtacaaaa gatatcattg aaaaaagatt 6000 gcgcaacttg ccccactaca tctccattcg gaattctaaa ggaaaaacct agagaaactt 6060 ttgtgtacca tcaggattca gtcattgtat ggatacaacc aaaaagaaat ctagaagaag 6120 aatgtaagat taaaataaaa agaaaaagca caggcacgat cacaaaaata gacgacaaca 6180 cttacaaatt agtagatgcc ccaggacaat tagactacat ttacacggcg gaaacgcaaa 6240 acatctgcga ctttaccctc cacaaactaa aaaacgttga tggcgcatat ctccaagtac 6300 aaaacaaaaa ttggtcccac atttataacg tagaaacaaa atattgcttg aaccctcaaa 6360 ttagaacacc atcaaagtgc agtaacttga ccggaggaga atttaaaatt gtagataaca 6420 aattaaatat aatacccaaa aacaactcag ataaaaagat atgcatcgca atgtcaccat 6480 tctttcatca catggatgaa tgtagtaatt tgtactcact gaaatgggac ccagactcat 6540 tagaaatcac tggaaacaca agatgcctac aaatgaacga taattcaagt gtatttatga 6600 acatttgctc cggagaaaat caccagaaat ggatattcgg aatgacagaa atgaatataa 6660 cagaaagtgt ggacacaaaa gaatcccttc tcctacaaca ccaccaatac gtagaagacc 6720 aatcagtgac aaatgagaat attattgaag atgaattaaa acgcatctac tgcaataacc 6780 tacagatggc tcgtcaagca acttctctca tatcagaatc aagtggattg cttgcagcaa 6840 gagcaaataa cctcccaatg tgccatcgat taaaacctat gggtgaagcg tttctcgtac 6900 aaaagtgcag tgcaataaat atcaccatcg gagcaaaaca aacagcctgt ggatacgaac 6960 cattttacga caatcaaaca atcggcaagg acggattctc attacaccca ttctcagaat 7020 gtttttggga agatggaatc gtaaacttta atggaaaaac atttatgtgg aagaaagaat 7080 taaatgactg ggcgtatgtg cacccaaata ttcaccaatc aactctccga ctaacagcaa 7140 agttcaccga attaaacgac aatgaagcaa attatcagct aaatcatcac gaattttatc 7200 agaagcaaga atatgaaaaa attaattcaa ttaacgattt gatcactcga attcaagtta 7260 ctgatgccag tccatccgta atgatgaacc aagaggcaga atctaaattc gggaaatttg 7320 aattatggca ccgtatccga aaaggactga tagcagtcat ctcgattctg ttctttatta 7380 ttctctgcat gctaacaata attgttttta taaaagttcg aaccctaaga tcaaaacaac 7440 gtttcgaaca ttacgcagag caactgcagg agaacagaaa actgatacgt gcagaaagtc 7500 tggagcggaa acgcgcagag accaacaacg acgaaagtct tgtttaaaaa aaaagaatat 7560 aaccaaacgg acacgcaaac caaattttta ccgaccccca gcgcaccggc cgttcccccc 7620 cataaaacaa agacttaacc tttaaccccc cgcggacaaa taaaaaaaaa aaagggctaa 7680 cacccaggcg acaaatatct ctatataagg atacgaagca ataacggaaa aattatcagt 7740 ccccatcatg caagcaaact cgtcacacgg caaaaatatg gaagtcctca tccaaaataa 7800 agcaaagaac tcaacaaaag aaatggacaa agcagtaaat gaagcgatat ctaacaaaca 7860 agctccggtg agaatcaata aaataaatat gacatatagt cgaattgcaa gcacggcaag 7920 ttggtttgga actagaggaa aatctgttaa ccgtctcatt gaacgagtca aaacagagaa 7980 aaacgtcaat attttcaacc accgaaacga gacacctctg ataattgcaa gccaactggg 8040 agcctatagc ttcgcaagtt taatacttac gcagaaggct tcaattaaca tccaagacga 8100 aaaaggctat tcagcactcc actacgccat cgaagcaaac gcgcacaaat tagtgcttga 8160 attattaaga aatggcgcag acgtaaacgc tcgtacaaaa gaaggaaaca cccccatgca 8220 tattgcagtg caactaaaca acatagaaat tggacgagcg ttaaggaaga atcgatattt 8280 caattcgaca atattaaaca acgaaggaga cagtgcgctg atactagcag tcaaacaaaa 8340 taaaataaaa atgttgcaca tcctattgat atcatcggag aactacaaca aacaggacaa 8400 agaaggaaaa acgccattgc attacgcatg catgaatgct agcccttgca ttttgcacaa 8460 acttctagat aaaacccaag gatcaatcaa agacgtttac ggggacatac ccttgacaat 8520 tgccattaag tacaacaacg aaagagcagt ggacatgctg tggcatagtg tccactttga 8580 tgcatcaaca agagacaatc aacacaatac gctaattcac atggcctgcc gccacgaaaa 8640 cgtgtcactt ttaaaaataa tttcctctcg cacaacagac ttcaatgcga gaaatatgta 8700 tgacgaaaca cctcttcaca tcgcatgctc tgtcaacaat ctagcggcaa ttaaaatgct 8760 aaaagaaaag tacgtggaca gaaacgccct agatgtgaga ggacaaacgc cgttagtcat 8820 aacggcaagt tttaattaca cggaagcagc aaaggaactg ttcgacgaca cagacacacc 8880 agtcgatcct atttttttaa atgacgactt aaattcgact gatgtaatca tcgtaaatgg 8940 cgaacctgtc aacataaaaa cgtgcattct tctggacgcc gcagacgaaa agggaaacac 9000 ggcattgcat tattgttccc tttttggaaa taatgaacta gccaatttcc tcatccacaa 9060 aggcgcaaga ataacattac tcaacaacga acaagactcc ccagtgacca ttctagctaa 9120 actgccatct ggatacaatt gccctatata tacggaggta tatgcacccc agtaagagat 9180 gccgtggaat gcaccagaat tggacactaa gcaattattt cccgatgaag gactaagtga 9240 cccagaaact gacctaatgt ttctttaagt tcaaatatca aatggtcaaa agaataaaat 9300 aatgaaaata aaagtgagta aaagaaaaga aaagaaaagt aaagttcagt cttgcatcaa 9360 ccctcttctc tacaaatacc aaaaatgtac ggtcgaggac gcggacacca attgttaaac 9420 agtgcaacaa atggtgaagt aaacaattca tttagaaatg aaggagcaac aacaagtgaa 9480 acaaccagtg aaattgtgac aattccatca atgacttggc gacgtgagct ctcgccagca 9540 cggcctgttt cagaggacgt gcaacgcctc acggaaatga gtttccgacg aatggaggaa 9600 gcgatggaag agcaaaggaa tgctcgtcgc catcaaagaa atatcccgga cactccaaga 9660 agcccgtctc cgatgtccat ggaagaacag gagaattacg tgcagtcacc gagaatcatc 9720 ccgatggccc ccggaagtcc ccctccgcct gaccatcacg gacgtcatct gcccccatca 9780 tacgattcat tgtacggtcc gcccacaaca cgccgcctgc ccgactttct acaagagcat 9840 cagcaggtgc aacgcgagtt gatggaatta agcgaaaaat tcttggagac gcaagcgcgg 9900 ctaatggccg tataccagca agaccgccag cgtatcctcc ggcgttttta ttaagatatt 9960 ataataataa taaccaactt tttgtagtca aatgatagaa acagtagaca attttgaaaa 10020 acaaaatttt agataccgcc gggccgacgg ttaattgttc tttatacttt gtctttgtat 10080 ttggaattcg taatcaccaa tctaaattcg tatgaataaa attgtatcct taatccatac 10140 ctgcattccc gtgcctttgt cataacctct attgattaga cacgagggcg tgtcttttca 10200 aaggaaggag gta 10213 // ID hAT-22_SM repbase; DNA; INV; 3498 BP. XX AC . XX DT 13-FEB-2008 (Rel. 13.02, Created) DT 03-MAR-2008 (Rel. 13.02, Last updated, Version 1) XX DE hAT DNA transposon from Schmidtea mediterranea: consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-22_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-3498 RA Jurka J., Bao W. and Tempel S.; RT "hAT DNA transposon from Schmidtea mediterranea."; RL Repbase Reports 8(2), 71-71 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 1217..3382 FT /product="hAT-22_SM_1p" FT /translation="MENPDVSIIKDLLENPFSRRDVATKNEIVKLSRPVPD FT IFLSQNVKIKGQQYVRHFQKSIYEKFKWLTGCRELNKLFCWPCLLFVVNEK FT DKNIWIKQGYSDLNHFHEAAKKHERTQSHINAVIQLKTFGTVRIDLMLDQQ FT KASSIVQHNEKVTRNRNILKRLIDVVCYLAKQEEAFRGHNENINSFNRGNY FT LELVTLISKYDDLLANHLRTATVFSGTSNRIQNDIIASIKNILISEIRNEV FT KETPFVAIMLDETSDISHKSQLSVVLRYVNYGVPVERFLFFKDVSGDRSAE FT ALCNVVQETILSWGCENKLIAQTYDGAAVMAGALNGTQKKVKDIFPEAMFV FT HCCAHVLNLVLSQSVLFIKDCKLFFATISGIASFFSVSTKRNHVLNETIKR FT KFPKLAATRWNYSSLLVNTIKSERQGLSELFENIIENPEGWDNDTINIAKG FT YHMFFEEFNSNFLLNIFAKVFSYTDVLFYILQKTNLDIAYSTQKVEEVKNA FT IRNMRNDVAFIEVYEETISVVGPPNLKKRRVNLQNSSVDQHFKSLYYEIMD FT TIFCQLDIRFKKISSLKFFELLNFSFFKKYQSKFPDECLNSLKNAYAKHFD FT FVKLTNELCVLYRSESYHEMNLAKLIQHILVNDLTDALSEVNKLAHLICTI FT PFSTASVERSFSALKRVKNYARNTMSQDRLINLSVITIEKEMLHLLQSKPE FT FYDHVIDDFVKNTDRRMDFVYK" XX SQ Sequence 3498 BP; 1243 A; 493 C; 590 G; 1172 T; 0 other; gcagtggcgt gcggtctata tgtgcagggt gtgcactgca cacccagtta aaattatgca 60 tgacaataat tttatcaact atgtctacca atatttatat tttatatagt acatattctt 120 tataatatgt attatttata accaaatgtt ttaaaaattc actcattcaa atatttgctt 180 gtatttctgc agttcgtgcg gtactaggtg ctgcacaacg tagcgcgttc aaaagcaatc 240 gaacagtcca agtattgata tataaacgca ctgcgcacta gcagaaaatc atgctttata 300 tgcatgagct actggagaag cacagcgtac tatagcattg aggcaagtgc agctactttt 360 cctgcacttg ttactctgaa tcatccgacc gaaggtgcga tttattaagc gttgtattaa 420 atcggggact tttgtgctga agcgattttg cttttttaat tggccatacg atttttcaga 480 atgttaatgt ttaaatttat atattgtaaa atgctgtttt catgaaaata taaaaaaagt 540 caatatataa agagaaatac tttaatgtgg taaaagtaaa taatttctgt gatgtattat 600 aagtatagaa ataatcgctg aaaaactatt ttgttttgtt agacaaaaaa gaaacgaatt 660 tttaatttta acagaaaagt ttaaatttta ccattcattg tatcttacag agaaagagaa 720 tttataaatt atatctgtaa tttgtattac cacatccctg gttttggttt gaaaaatagt 780 caagattttt ttcgcagtat atatgatttc caaatgttaa agaaaaatta ataaataaat 840 ataatttaat tcactgaccg aacaacgaac aagaaacagt tcgttcttcg ttctattgcg 900 ccagcgtgca gtgaaatcct ctgcacatgc aatagacgtt cggtttgtta gcaagatgcg 960 acgacgcacg cacatataat catacaccac acaacacaat atatcagttt tgctagtgct 1020 ggtgcctcta gtacttttct caagtttcgc tggtttaaat gaaaataata acattaaaaa 1080 atttttttaa attatagtta ttatatcttg attgataaat tgatttaatt tgatacattt 1140 gatacatcat ttcaggtata aattttattt attgtattct ttaaatttaa attcaaaata 1200 aataatccat ttcagaatgg agaacccaga tgtctcaata attaaagatt tattggaaaa 1260 tccattttca agacgagatg ttgctacaaa aaacgaaata gttaaattaa gtagaccggt 1320 gcctgatatt tttctatcgc aaaacgttaa gattaaagga caacagtatg tgcggcattt 1380 tcaaaaatct atctatgaaa aatttaaatg gttaaccggt tgtagggaat taaataagtt 1440 gttctgttgg ccatgtttac tttttgttgt aaatgaaaaa gataagaaca tttggataaa 1500 acaaggctac agtgatttaa atcattttca cgaagctgca aagaaacatg aaaggacaca 1560 atctcacata aatgccgtaa tacaattaaa aacatttggt actgtcagga tagatttaat 1620 gttagatcaa caaaaggctt cgtcgattgt tcaacacaat gaaaaagtaa caagaaatag 1680 aaatattctt aagagattaa tagatgtagt ttgttactta gcaaaacaag aagaggcatt 1740 tcgtggacac aatgagaata taaattcctt taatcgagga aattatctag aattagtaac 1800 tttaattagt aaatatgatg accttttagc aaatcatcta agaaccgcga ctgtattttc 1860 tggaacctct aatagaatac aaaatgatat tattgcatcg attaaaaata ttttgatttc 1920 cgaaattaga aatgaggtaa aagaaacgcc attcgttgct attatgctgg atgaaacgtc 1980 tgacatatct cacaaatcgc aattatcggt tgtgctacga tatgttaatt atggtgtacc 2040 agtggaaaga tttttgtttt tcaaagatgt tagtggtgac cgttcagcag aagctttgtg 2100 taatgttgtg caagaaacga ttttgagttg gggatgtgag aataaactga ttgctcaaac 2160 gtacgatgga gcggcagtca tggcaggagc tttgaatggg acgcagaaaa aagtcaaaga 2220 catttttcca gaagctatgt tcgtgcattg ttgcgcacat gtacttaatt tagttctgtc 2280 acaatctgta ttgtttataa aagattgtaa attatttttt gcaacaattt ctggcatagc 2340 gtcatttttt tcggtgtcta caaaaagaaa tcatgtttta aatgaaacta ttaaaagaaa 2400 atttccaaaa ttagctgcaa cacgttggaa ttattcatct ctcttggtaa ataccataaa 2460 aagcgaaagg caaggcctat cggaattatt tgaaaatatt attgagaatc cagaaggctg 2520 ggacaacgac acaattaata ttgcaaaggg ttatcatatg ttttttgaag aatttaatag 2580 taattttctc ctaaatattt ttgccaaagt gtttagttat accgatgttt tgttttatat 2640 tctacaaaaa actaacttag atatagcata ttctactcaa aaagtagagg aagtaaaaaa 2700 cgcaattcga aatatgcgaa atgatgttgc atttattgaa gtctatgaag agactatttc 2760 tgtagtgggt cccccaaatt taaagaaacg tcgtgtaaat ttacagaact caagtgtaga 2820 ccaacatttt aaaagtttat attacgaaat aatggacact attttttgtc agttagatat 2880 tcgttttaaa aaaataagtt cgctcaaatt ttttgaatta ttgaatttca gcttttttaa 2940 gaaatatcaa agtaaatttc cggatgaatg tcttaattcg ttaaagaacg cgtatgcaaa 3000 acatttcgat tttgtaaagc taacaaatga attatgcgtt ctgtatcgta gtgaatcata 3060 tcatgaaatg aatttagcga aacttattca gcatatactg gtaaatgatt tgacagatgc 3120 cctaagcgaa gtaaataaat tggcgcactt aatttgtact attccatttt caactgcttc 3180 agtggagaga tcattttcag cattgaaaag agtaaaaaat tatgcacgga atacaatgag 3240 ccaggataga ttaatcaatt tatctgtaat aacaatagaa aaagaaatgt tacacttatt 3300 acaatcaaaa ccagagtttt atgatcatgt aattgatgat tttgttaaaa atacagatag 3360 gcgtatggat tttgtgtaca aataaaaaat ttatagtaaa aacgttttta atgtatattt 3420 ttatggtttt attattatgc ttacctatta tcgttgctct ctgcacaccc tgcatggaaa 3480 cccaccgcac gccactgc 3498 // ID Zator-2_AA repbase; DNA; INV; 4043 BP. XX AC . XX DT 29-JAN-2009 (Rel. 14.02, Created) DT 25-JUL-2009 (Rel. 14.08, Last updated, Version 2) XX DE Zator DNA transposons from Aedes aegypti. XX KW Zator; DNA transposon; Transposable Element; Zator-2_AA. XX NM Zator-2_AA. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4043 RA Bao W., Jurka M.G., Kapitonov V.V. and Jurka J.; RT "New superfamilies of eukaryotic DNA transposons and their RT internal divisions."; RL Mol Biol Evol 26(5), 983-993 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS join(350..1672,1690..2208,2304..3257,3297..3893) FT /product="Zator-2_AA_1p" FT /translation="MKLNLFYREFGVEFPTTRMEQVPIAVRIEKEGKQVLQ FT AIVSVPTTSSFQSIIARVLGRIENGHEKIDKIYTGSLSMEQKDLFEVDQTS FT VLQDASRVIGTCTKVLCRLKSTLEPSKQPKNAFHQMMQSASAAVYPEKKVE FT KDGRDQLYNCVVQYLKTMNAGFGRFQKDDMQAFMSVIVSALWYLDGQSEKI FT ASAPGVRSVPKRFCFTPENGETFRVLTHGNMRKKPLPRMTRDEMIVVFDRL FT DALCATGITKSKSWSEVRSDVEGLRETFAGYINYLNQVENRAENETIDDSK FT LHDNRSLKTVPGSSSQSALAKLMYKPLHQFLIQEEPCTPVNLILYAPVERK FT ARYKYIQGLCFEFDVNVYRSFKPNATFVWRLPQERSESQIANTIHRLESGL FT ADRIRVDKAKRAEEAFGGLASWTDKDLNDFLTICKGNNFLIVIHIIHFLDD FT KDTDDANSEDLSDTVKGVQLLLNDGHEVEDVILLRDQEQEARDGKFNNFWK FT AVEAVISENDLAVAQERRHGTTSWISPLCVSLRDLMEKSEEKMNQLFPESL FT MNNVPSMEFFRLQFSPRNSRTNSAKRYYSRFDIKYGLQIRTLRKSHPDQHY FT GAKQFQYMKIMAGTFWNIFYAVHKKVNSMFTGLFCDTSIVFFLDDKAAIPV FT GFEGAPVSATRRQRSVLKAGLDNRGLNALDHDNIPQHLTPSITIKLIPPSE FT LSNAWYAGQPYIIIKDAIFQHSTAFRHASELVARYSKNDLERPMLFIGSDG FT GPDHNLTSIMVMLSYIAIFLHLNLDFLCAVRTPPNFSFINPAERFMATANL FT ALIGVSLSRNHLGENEKFVKNLLSKKQWREAQEKHPQNDYPRLALEGTKDA FT RELLQSRFESLCYKGEKVKVFPAASEESIQAIKSEITNQWPALNLNKISKA FT QAMSDSSFKQFFDKHVKATAYTFQVRTILFLQVKKCQDESCAYHKSKRLDL FT NVNWLPTPELKGDKYKSFDEVYGSEPNDAYQPSKTKTSRGDSLPQPSFPLA FT STRARLIVNCTECKFPRVLYSRYSLNEKELFDTKRFFDENYYICGSQLETF FT PDIFQHPKISCFDPIHLHYYQCPSLKGYKNMCAKCLSSNANITYNKINLCE FT KCHSPKSKTVVVKQPKGRSVKK*" XX SQ Sequence 4043 BP; 1275 A; 815 C; 794 G; 1159 T; 0 other; ggccggaaca aatctgaaaa ttttcttttg tcaccccccc cttcaaaacc atcgaaaatt 60 tgaaagggga aaataaaaaa aagttaactt tttctcagtg catttttccg tgttttggat 120 tatttcatat ttttaacgtt gggcccgcag taatgttttg tcctgaggag agttttgtca 180 ttgttgatac aagctacttg ctagaaaccg cctcaacgtc aaatgtcaaa atatttctaa 240 gctgtcttgt ttgtatacct cacttaaaat tttataatct cgtggcaaca gtcttatatt 300 cggtaaatat gcatacatat tttgagttta tgttctgaat catcaacata tgaaattaaa 360 tttattttac agagaattcg gagtagaatt ccccaccacc aggatggagc aagttcccat 420 tgcggtacgg atagagaagg aaggtaaaca ggtgttgcaa gccatagtct ctgttccgac 480 gacatcaagc ttccagtcga ttatcgctcg agtgcttgga aggatagaaa acgggcatga 540 aaagattgat aaaatctaca ccgggagtct atcaatggag cagaaggatc tgtttgaagt 600 cgatcaaact tctgttcttc aggacgcttc aagagtgatt ggaacgtgca ctaaagtttt 660 gtgtagactg aaatcaactc tcgagccttc gaaacaacca aaaaatgcat ttcatcaaat 720 gatgcaaagt gcttcggctg ctgtatatcc tgagaagaag gtggaaaagg acggtcgaga 780 tcaactctac aactgcgttg tgcaatattt aaagacgatg aatgcaggat tcggacgatt 840 tcagaaagat gacatgcagg ctttcatgag tgtgattgtt tcggctctat ggtatctgga 900 tggccaatca gaaaaaatag cttcagcgcc aggagttcgt tcagtaccaa aacggttctg 960 tttcacacct gaaaatggcg aaacattccg tgtgcttaca catggcaaca tgagaaagaa 1020 acctctccca cgaatgacaa gagatgaaat gattgttgta tttgaccgct tagacgctct 1080 atgcgctaca ggtattacaa aatcaaaatc gtggtcagaa gttcgatcgg atgtggaagg 1140 attacgtgaa accttcgcag gttatatcaa ttacctgaat caagtggaga atcgagcaga 1200 aaatgaaacc attgatgatt ctaaactgca tgacaacaga tctctgaaaa ctgtacctgg 1260 tagcagttcc cagtctgcat tagcaaaact tatgtataaa ccgcttcatc agtttctgat 1320 ccaagaagaa ccgtgcactc ccgtgaattt gatcctgtat gctccggttg aaagaaaagc 1380 gagatacaaa tatattcaag gactatgttt tgaattcgat gtaaacgtct atcgttcatt 1440 caaaccgaat gccacttttg tgtggaggct gccccaagag cgatcagaat ctcagatagc 1500 aaacacaatc catcgtttag aatcaggttt ggctgaccgt atcagagttg acaaagcaaa 1560 acgggcggaa gaagcttttg gaggcctagc ttcgtggacc gacaaagatt taaatgattt 1620 tttgactatt tgtaaaggta ataactttct aatagtgatc catatcatcc actaacaact 1680 tttgtttgat ttttagatga taaagacacc gatgatgcta attctgaaga cctttcggac 1740 accgtaaaag gagttcagct gcttttgaat gacggccacg aagttgaaga cgttatcctt 1800 cttcgagatc aagaacaaga agctcgtgat ggtaaattca acaacttttg gaaagctgtt 1860 gaagctgtga tatccgagaa cgatttagct gttgctcagg aaaggcgaca tgggaccact 1920 agttggatat ctcctctttg tgtctcccta cgcgatctga tggaaaaaag tgaagagaaa 1980 atgaatcaac tttttccaga atcgttgatg aacaatgtcc cctcaatgga attcttccgt 2040 cttcaattct ctcccagaaa ctcacgtact aactcggcca aacgatacta cagcagattt 2100 gacataaaat atggactaca aatacgtact ttgcgaaaat cacatccgga tcagcactat 2160 ggagcaaagc agttccaata catgaaaatt atggcaggta cgttttggta aaccagtact 2220 aaaataggat aaattcaaac aatttgcata ttccaattgc cctggaaagt tcattgaaat 2280 tactgtttta agtccagccc taaaatattt tttatgcagt tcacaaaaaa gttaattcca 2340 tgtttacagg tctattctgt gatacaagta ttgtattctt tttggacgat aaagcagcga 2400 ttccagtggg atttgagggt gctccagtca gtgctactcg gaggcagcga tctgttttga 2460 aagcaggttt ggataacaga ggactgaacg ctttggacca tgacaatatt ccgcagcatt 2520 tgaccccttc cataacaatc aagctcattc ctccttctga actttccaat gcatggtatg 2580 ctggacagcc ttacatcatt atcaaagatg ctatatttca gcactcaact gctttccgac 2640 atgcttcaga gcttgttgct cgctactcga aaaatgatct tgaaagacca atgctgttca 2700 ttggatcaga tgggggacct gatcataatt tgacctcgat catggtaatg cttagttaca 2760 ttgccatatt tcttcacttg aatctcgatt ttctgtgcgc agtgcgcacg ccaccaaact 2820 tttcattcat caacccagca gaacgtttca tggccaccgc gaatctagct ttgatcggag 2880 tttctttgtc gaggaaccat ctcggtgaaa acgagaagtt tgtcaaaaac ctactgtcaa 2940 aaaagcaatg gcgtgaagct caagagaaac acccccaaaa cgactatcct cgtttagctc 3000 tagaaggtac aaaagatgca cgagagttgc ttcagtcaag attcgaatca ttatgctaca 3060 aaggggaaaa ggtgaaagta tttccagctg caagcgaaga gagcatccag gctatcaaat 3120 ctgaaataac taaccaatgg cctgcgctaa atttgaacaa aatctccaag gctcaagcta 3180 tgagcgattc atcgtttaag cagtttttcg ataaacatgt caaagcgact gcttatacct 3240 ttcaggtaag aacaatatag caatcctaag atttgttttt aattttttat tcctaactct 3300 tcttacaggt aaagaaatgt caagatgaga gttgtgctta ccacaaatca aagcggttgg 3360 acctcaacgt gaattggcta ccaacacctg aactcaaagg cgacaaatac aagtcatttg 3420 atgaagtcta cggctccgaa cccaacgatg catatcaacc aagtaaaact aaaacttcaa 3480 gaggagatag tttgccccaa ccatcattcc cactagcttc taccagagct cggttgattg 3540 taaactgtac ggagtgcaaa tttccgcgag ttttgtattc gcgatatagc ctgaatgaaa 3600 aggagttgtt cgatacgaaa cgtttcttcg atgagaacta ctacatttgt ggtagtcagc 3660 tagaaacatt tcctgacatt tttcaacatc ctaaaattag ctgctttgat cctatccatc 3720 ttcattatta ccagtgtcca tctctgaaag gctataaaaa tatgtgtgca aaatgccttt 3780 ccagtaacgc gaatatcaca tataataaaa tcaatctgtg tgaaaaatgt cattcgccaa 3840 agtccaaaac cgtggttgtt aagcagccaa aaggacgatc cgttaagaaa taatgtgtat 3900 ttttcgatta aaatactgtt ttgttgcaat aaaaattaaa tttattgaat ttacctaaat 3960 ttccatttta ttattttttc tcccccccct tcacaaattt ctaagcagag tgacaaaaga 4020 aaattttaat atttgttccg gcc 4043 // ID Gypsy5-NVi_I repbase; DNA; INV; 10634 BP. XX AC AAZX01005749; XX DT 05-NOV-2007 (Rel. 12.11, Created) DT 05-NOV-2007 (Rel. 12.11, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy5-NVi; KW Gypsy5-NVi_I; Gypsy5-NVi_LTR; internal portion. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-10634 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from Nasonia parasitic wasp."; RL Repbase Reports 7(11), 1135-1135 (2007). XX DR Genome; AAZX01005749; Positions 21716 11083. XX CC Positions [6845-7282] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 1629..3152 FT /product="Gypsy5-NVi_I_3p" FT /translation="MSDKRVTRNQSKNDPELQKEIDKFRQFRQVEDEDLGN FT ESELSDTFNSATSHNSTLNTDIFDESENNTTGEKTVVDQVTLDNSTNSVEN FT KDNMSTAGTNISLKDALKVVPEFSGDLSELSNFLEGCQEAKDMIPTTAEEN FT LTKVLRGKLIGEARKAILGEKFKKVDELTNYIRDIYLPAKTVHQLQGELGS FT SYQKENESVIVFANRMRDLGTRILEAKRVATNAEVTEEFKKATTDNIIECF FT KNGLKPEIERKMIVADTVLDIVKNAITAERLLNAQNALRGINKSESSKLTE FT KKSVYFCKICRNNEHDTQNCNKKSNDICSICNRVGHPTSKCRFGNNSNEAC FT QLCGVAGHQAKQCTGGKLNIICQICGIRGHGANTCRTIRNSVQCQLCGISG FT HKANTCNRLNNQSSVLNQEQMINRSKLHCDKCNRNGHTAARCYAKTVAYVS FT NENNGNQFVCFYCKQPGHFIRECPVKRSENSRVLQNTGAIPKSQSQEQRQR FT ENSPDLSELVPI" FT CDS 3029..5212 FT /product="Gypsy5-NVi_I_1p" FT /translation="MPSKKVGKLQGSSEHRRNSEEPKPRAASEGEFARFIG FT ASTNINKNAGIPYIELGDKNKTFNIKFMIDTGAQSNIIKISSVPNTVEVDT FT TTNFNLKGISDKPVESLGVIEMKVIGSSTRFFVVDDDFQIPYNGILGCEFL FT YKNYCKVNFDSKTLEVNGNSIKFKQDNEINKLNKFVNYEVSNADIKLPFID FT VLNSPTQEITQFLLDTGAEYSLIKKTLVPDYCKIDNEDKMFLRGIDGEIIK FT TEGSTIIDIFGIEHRIQVVPDDIPIPAEGVIGVDYLSKHDTTLNFKTKSLR FT VHDKYSPFKDRKFINLTINNKMPINEEDFDLESEEENVVDSMYNNYSVHDE FT KNKYYSNEFSNMGQIYSFCVSKEETFNELEIKQNADILESRLCNIFNALRE FT EHGIEPVEQPSIHPEILDARADEVMGILRLDGLTEEEVEQIDALIRKHADR FT IQLQGEDLEATNVLTHRINTTDDIPIYVRQYRLPHSLKEEIETQIKELLRK FT RIIQHSNSAYNSSIWVVPKKPDSTGKIRWRIVLDFRELNEKTVPDKYPIPN FT IADIFDQVGGAKYFTILDCVSGFHQIMLDPKDRHKTAFSTPLGHFEYLRMP FT FGLKNAATEYQRLMNIVLGDLIGKGVFVYIDDIVIYAKNLEEHNRLVNEVM FT NRLRKANLKLQPDKCEFLRREVVYLGHRLSEEGIQPDLGKIEAVSKFPQPI FT DVKGIRQFLGLAGYYRRFIKDFAKIAKP" FT CDS 7727..9667 FT /product="Gypsy5-NVi_I_2p" FT /translation="MAKLVLWGILVLPVVLGLVGYDCGSRSLNITTLSLLD FT VGPCEAPQDTINVTREYVQLLQINEYSEATVIQCKLEIKRTVYYCGMHSHI FT SIVNNGQSEYLYDVSRKACQELHRSGTFMFANVHVIAGVKVNRTVTHSMQF FT TGFVNSEGTCNGGAYSDPYGTWENVVVQGVMQISLKEYSAKININNNKIFL FT KSGTVCNLSEESCIDVEGGYTFWSAIPRDNCKFDRYGVLYEGFANKMIDSE FT YNKEQIIYSLATNDITFALTSKSKDYLCGYTIIRTEHPKLVIFETVKGQTF FT ATKSKISVSNLDIFAYVNSKFVFVEKHMRHQIKQLYRDVVVQRCNIERQTL FT KNALAIATQAPDEFAYHLMKGPGYMALVAGEVVHVIKCIPVEVKVKHGDNC FT YSELEVTKGNRTMYLTPRTHILKLRGTQISCNHLLPAYYSIEGTWYKILPK FT PTDTKDPITIQPNSKSSWEYKSPESLATSGIYTEKDLDELRERIMFPVEKP FT ALLNDIAREMRGHIVTDKEGTLIKLLNADAVEKIIASTWDRIWMKFLTFGT FT VSAGVIGIFLIFHLIKTIIDIGIQGYTLHAIYGWSIHLLGAVWGSITYLLL FT HLNKTDTDQKQDENQDIELQEPLNQEPSTSQQINQVDDKKRNEGFFSRC" XX SQ Sequence 10634 BP; 4273 A; 1285 C; 2064 G; 3012 T; 0 other; ggataaaatc agggaacgtc tccatcccga gttcagcgga agctgactca aaacggagga 60 atattttata gtgcgccgag catcggacgt aacaaatttt gggggttcag ccgggatcat 120 aacgtttcta aattttattt tgataaaacg cgttgagttt attcgatttt attcggtttt 180 tagtgttggt gatttctata tggtcttaag tgaaattttt ataaaggcaa agtttttgaa 240 aagtttattt gaaattaaaa tacgtgaaat ttagtttgaa gattttgtga gaaaatttta 300 tgaaaaataa taatagtttt tgaaatttta tgtgaacaaa aataagaaaa tttgaagttg 360 atttgtttga attgagaaga aaaagtatga aggttatatt gtcagtttaa aagaaagaaa 420 aacgtgaata ttgaaaaatg agttttaaat gaattttata agggaaacta gtgttgaaaa 480 atataaataa agtgaaagtt agttaaagaa taactaagga ataaaataaa aggaaatttt 540 ggaaatttat atgggaaatt atatgtgaat atttgtagaa gtaaacaaat acaaagaaat 600 tataaaagtt ataaatcgga cagagtaacc gtgaaaaact taagaaatat tgaaaaatta 660 ataaaaattg tgttaaataa agaaattaca ttgatgtttt gaatatgaaa cgttgatagg 720 aatatgaaat tgaagtaatt ttatttgcaa tttaagagaa attgcgtgaa attttatgat 780 acaaaaaaaa agcagttttg aaaaagtgat tattgaaaaa aagtgttatt tatgatataa 840 aatttgaaaa gtagatattg ggttaaaatt aaagtgaaat ttgaataatt gaattataaa 900 gttttgaatt ttgaaaaatt tgaatgttga ataaaaaggt tttatgcaac aaatgttaat 960 aaataataaa ataaataaat aaattgagaa tagattttca aaaagattta aaacgagtaa 1020 aaacattata tgtttgtgaa caaaaattta tatttaatgt cagtaaggca aaagaaaatt 1080 ttagataaaa gttaaaatca gtttatgaaa aatagtaagg ttttattatt gttttaattt 1140 gttttagtgg gcaataaaaa gtcacctaag caacggaaaa ggttagctaa gaaaaagtta 1200 gaagaaaatt taaatcaagt aagtcaatcg gacacattca aaagaatcaa acagtcaaaa 1260 agtaaaagga ggaaaattaa gattcaggaa aagttgatag aaaaagatta tgtggataat 1320 ttatgtttta ttaggcagtt aaagaacaaa ttagcaatag tgcgatattt ggatgttatt 1380 aaatctccaa ttaaaaaggt aaaatcaaga agaatcgaca tgagtaaatg tgataaagtt 1440 agaaaaaatt tgtttattaa agaaaatgat agagattgcg ataaatgtat aaatttaaaa 1500 attcgtgata aatgtttaga atgttttata ggtagaaata atttgccacg agcagtttta 1560 gttgaattga atttgaggaa aaccgttatc taaaaatatt aaaggcaatt tgaaaattaa 1620 acgtttacat gagtgataaa cgggtcacac ggaaccaatc aaaaaacgat ccagagttac 1680 aaaaagagat agataaattc agacaattta gacaagtaga agacgaagat cttggaaacg 1740 aatcagaact atcagatact tttaattctg cgacatcaca caatagcact ttgaacacag 1800 acatttttga cgaatcagaa aataacacga ctggggaaaa aacggtagta gaccaagtta 1860 ctttagacaa tagcacaaat tcagtagaaa acaaagacaa catgtcgaca gcagggacta 1920 acattagttt aaaagacgcg cttaaagtag tgccggagtt tagcggagat ttgtcggaat 1980 tatccaattt tttagaagga tgccaggagg caaaggatat gattccaacc actgcggagg 2040 aaaatcttac gaaagttctg agaggtaaac ttataggcga agctagaaaa gccatattag 2100 gagaaaagtt taaaaaagta gatgagttaa ctaattatat tcgcgacata tatttacctg 2160 ctaaaaccgt gcatcagtta cagggagagt taggtagttc gtatcaaaag gagaatgaga 2220 gtgttatagt atttgctaat agaatgagag atttgggaac aagaatatta gaagctaagc 2280 gagtcgccac gaacgcggag gtaactgagg aatttaaaaa agcgacaacg gacaacataa 2340 ttgaatgttt taaaaacgga ttaaaaccgg aaatagagag aaaaatgatt gtggccgaca 2400 cagtattaga cattgtaaaa aatgcgatta cagcggagag acttttaaat gcacaaaatg 2460 ccttgcgtgg aataaataaa agcgaaagta gtaaattaac tgagaagaaa agtgtatatt 2520 tttgtaaaat ttgtagaaat aatgaacacg atactcaaaa ttgtaataaa aagagtaatg 2580 atatttgtag tatttgtaat cgcgtgggac atccgactag caagtgtaga ttcggaaata 2640 atagtaatga ggcgtgccaa ttatgcggag ttgccggaca tcaggctaag caatgcacag 2700 ggggaaaatt aaatatcata tgtcaaattt gcggaattag gggtcatggc gctaatactt 2760 gtagaacaat aagaaatagt gttcaatgtc agttatgtgg aatttcgggg cataaggcaa 2820 atacgtgcaa taggttaaat aatcagagca gtgtcctaaa tcaagaacaa atgataaata 2880 gatcgaaatt acattgcgac aaatgcaatc gcaatggaca cacggcagct aggtgctatg 2940 caaaaactgt agcgtacgtt agtaatgaga ataatggcaa tcaatttgtg tgtttttatt 3000 gtaaacaacc gggacatttt attagggaat gcccagtaaa aaggtcggaa aactccaggg 3060 ttcttcagaa cacaggcgca attccgaaga gccaaagcca agagcagcgt cagagggaga 3120 attcgccaga tttatcggag ctagtaccaa tataaataaa aatgcaggaa taccgtatat 3180 agaattagga gacaaaaata aaacattcaa tattaaattt atgatagata caggggctca 3240 aagtaatatt attaaaatta gctcagttcc aaatacagta gaggttgata caacaacaaa 3300 ttttaattta aagggaattt ctgacaaacc agtagagagt ttaggagtaa ttgagatgaa 3360 agttataggt agtagtacta gattttttgt agttgatgac gattttcaga taccgtataa 3420 tggaatttta ggctgtgaat ttttatacaa aaattattgt aaagtaaatt ttgatagtaa 3480 aactttagaa gtaaacggaa attcaattaa attcaaacag gataatgaaa taaataaact 3540 aaataaattc gtaaattatg aagtttcaaa tgcagatatt aaattacctt ttatagatgt 3600 gttaaattct ccaacgcaag agatcacaca atttttacta gatacaggag cagaatatag 3660 tttaataaaa aaaactttag taccagatta ttgtaaaatt gataacgaag ataaaatgtt 3720 tttgagaggt atagatggcg aaataataaa aactgaagga tcaactataa tagatatttt 3780 tggaattgaa catagaatac aagtcgtacc ggatgacatt cctataccag ctgagggagt 3840 aataggagtc gattatttaa gtaaacatga cacaactttg aattttaaaa ccaaaagttt 3900 aagagtacat gataaatatt caccatttaa agatagaaaa tttataaatt taacaataaa 3960 taataaaatg ccaattaacg aggaagattt tgatttagag tcagaggaag aaaatgttgt 4020 cgattcaatg tacaataatt attcggttca tgacgagaaa aataaatatt attcaaacga 4080 attttcaaac atgggacaaa tttatagttt ttgtgtttca aaggaagaaa cttttaacga 4140 acttgaaata aaacaaaacg cagatatttt agaaagtaga ttatgtaata tttttaatgc 4200 acttagagag gagcacggaa tagagccagt ggaacagcca agtatacatc cagaaatttt 4260 agatgctaga gcagatgagg taatgggaat tttaaggtta gatgggctta cagaagaaga 4320 ggtagagcag atagacgctt tgattagaaa acatgctgat agaattcaat tacaagggga 4380 agatttagaa gctactaatg ttttgacgca tagaattaat actaccgatg atataccaat 4440 ttacgttaga caatataggt taccgcattc attaaaagaa gagatagaga cacagattaa 4500 agagttatta agaaaaagga ttattcaaca ttcaaattca gcatataata gttcaatttg 4560 ggtagtacca aaaaaaccag attcgacagg aaaaatacga tggagaatag ttttagattt 4620 tagagaatta aatgaaaaga cagtaccgga taaatatcct attcctaata tagctgatat 4680 ttttgatcaa gtgggagggg cgaaatattt tactatttta gattgcgtat caggttttca 4740 tcaaataatg ttagatccga aagatagaca taagacagct ttttcaacac ctttaggaca 4800 ctttgaatat ttaagaatgc cgtttggatt aaagaacgct gcgacggagt atcagagatt 4860 aatgaatata gttttaggcg atttaattgg taaaggggtt tttgtatata ttgatgatat 4920 tgtaatctac gcgaaaaatt tagaagagca taataggtta gttaatgaag taatgaatcg 4980 attaagaaaa gctaatttaa aactacagcc ggataagtgt gaatttttaa gacgagaagt 5040 agtatattta ggacataggt taagtgaaga agggattcaa cctgatttag gtaagattga 5100 agcggtttcg aagtttccac agccaataga tgttaaggga attaggcaat ttttaggttt 5160 agctggctat tatagaaggt ttatcaagga ttttgcaaag atagcaaaac cttagacaaa 5220 actgttagaa aaagaaacag aatttgactg gaatacagaa tgcgaggaat cgtttgtaaa 5280 acttaaagag atattgtgca atgcaccatt gttgcaatat ccagacttca caaaaccatt 5340 tttaataacg actgatgcat cgggatatgc tttgggaggg attttatctc aagggactat 5400 tggaaaggac cggcccatag catacacctc gagagtattg agaggcgcgg agttacaata 5460 tgacgtgtac gaaaaagagg cactcgccat gatacatagt gttaaaacgt ttagacctta 5520 tgtatttgga cgccgattta caataatcac agatcacaag cctttattat ggttcaaatc 5580 tgctgattta aacactcggg ttcaaaaatg gcgatttaaa ttatctgagt acgactacac 5640 gatcgagtat aaaccaggca agcaaaactg taacgcagat gctttgtcta ggaatcccgt 5700 ggagttaaac gttgtgacaa gagcaagggc taaaataaat aaagacttaa ctgaacaaga 5760 caaaaatgaa gtaacaaaaa ttcgaaaacc aaatgttaag agattagtga aaagtaaaaa 5820 ccggaaagta actaaaaaaa taaatcaagg gccacttaaa gatcggtacc caaagagaaa 5880 tagaaataaa attaattacg aagagagtga ggagagtgac ggtgatattg aagctgaagt 5940 agaaaattta cctaaattaa ataaaataaa accaaaacct ttaaaataag ctaatgataa 6000 agatgaggat agtttatcgt cgtcctcaga cgaagacatt tttaatgata ataatgacaa 6060 tttagacaac aaggaaattt taaataataa taatgatgat attttaaata atgataattc 6120 agatgacgaa gttgaaacaa aatcacaaga tgaagttcat agttcaaaaa ttgtttatag 6180 taaagaattg attcagtata gaaaaagtaa tgtagcctat tttatagata gtgctggaaa 6240 accttgtgat tttggtgcga cagagttgat taaatttaaa aaaaatagat ccaaatacaa 6300 tattagaact aaatgaagtt aaaagcaata aaagatcaaa caataaatat tattttccta 6360 tgtgcattaa aaatgaaaaa ccagagtcaa tatcacaaat taaaataaat atcgaaacag 6420 tgttaattga attaaaagaa aatttagaaa aattaaatca gaaagattgt agttttgcaa 6480 aaagtaaaga aatagaaaaa ttagagtgga atgagatttt aaatttaatt agtaaagttt 6540 ttgaggattc taatattaaa ataattattt gtaaaggaac tttaaagtat gtaccagtag 6600 aaaaaagaga cttgatattt aacgaattac atacatcagc gataggaggg catagaggtg 6660 tttcaaaaac ttataagcga attaaagaaa attattattg ggaaaattta aaggaagata 6720 tacagaaaag aattcaaatg tgtttagatt gtcagttaaa aaaattagtt agattaaata 6780 ccaaacaacc aatgataatt acagatacgc cgggaacaac ttttgttaaa atagcaatgg 6840 ataagtagga cctttaccga aaactaaaaa cgggaatgag tatattttaa ctttacaaga 6900 tcagctctca aaattttgta tagcaatacc tataaaagac acttatgcag ctactattgc 6960 ggatgtattt gtaaagaaag taatatgtat ttttggagca ccgcgagtag ttttaactga 7020 tcagggaagc aactttttaa gtaagctaat gacaagagtg gcaaaacgat ttaaaataaa 7080 gaaagttaga actactgctt ttcatccaca atcgaacggg tcattagaac gttcacatca 7140 tgcgttgagc gagtttttaa agcaatatgc tgacggtgat gatgattggg atgaatgggt 7200 tgatgtagca actttaaact ataataccgg aattcacgaa gggacaaaac acactccatt 7260 tgaagtagtt tttggtcgac tagcaaggat tccttctagc gagccactac gagaaggtga 7320 tgtgtgtcca acatataaag gatatatcat agatttagta agtaggttac atgaaattag 7380 aaaattagtt cacgataatt tagtagatgc aaaaataaaa tcaaaaaaat attatgataa 7440 tcatattaac cctaagaatt ttaatgtagg agatttcgta ttcttaaaat caggacataa 7500 gccaaaaaaa ttaaaaaatc actattccgg accatataga atcatggaaa ttttaaacaa 7560 aaataacgtt cgtataaaaa cagaaaaggg agacaaaata gttcacatca atcgattaag 7620 aatttttaaa attaaaattc aggcaaaaag gaaaaaacca ttactacacc gagatagtga 7680 ttaataaatt ttgatttttt ttttgattac agaataatac ctaaatatgg cgaaattagt 7740 cttatgggga atcttagtct tgccagtagt tttaggatta gtaggttatg attgcggatc 7800 ccgatcgcta aatataacaa cattatcttt actagatgta gggccatgcg aagcgccaca 7860 agatacgatt aatgtaacgc gcgagtacgt acagttgctg cagataaatg agtattctga 7920 agcaactgta atacaatgta aattagaaat taagagaacg gtatattatt gcggaatgca 7980 ttcacacatt tcaattgtta ataacgggca aagcgagtat ctatatgatg taagtagaaa 8040 agcatgtcaa gaattgcata gatcaggcac atttatgttc gcaaacgttc atgtaatagc 8100 aggcgtaaaa gtcaatagaa cagtcacaca cagtatgcaa ttcacaggtt ttgtaaatag 8160 tgagggaacg tgcaacggag gtgcgtattc tgatccttac ggaacttggg aaaatgtagt 8220 cgttcaggga gtaatgcaaa ttagtttaaa agaatattca gcaaaaatta atatcaataa 8280 taacaaaata tttttaaaat cagggacagt ttgtaatctt tcggaagaga gttgcattga 8340 tgttgaaggc ggatacactt tttggagcgc aataccaaga gataattgca aattcgatcg 8400 gtacggagta ttatatgaag gttttgcgaa taaaatgata gattcagaat ataataagga 8460 acaaattata tattcacttg cgacaaatga tataactttt gcgttaacta gtaaaagtaa 8520 agactattta tgcggttata ctattatcag aacagaacat cccaaattag taatttttga 8580 aacagttaaa ggacagacat ttgcgactaa aagtaaaatt tcagtttcaa acttagatat 8640 ttttgcgtat gtaaattcaa aatttgtatt tgtagaaaaa catatgagac atcaaattaa 8700 gcaattatat agagacgtcg ttgtacagcg gtgtaatata gagcgacaga ccttaaagaa 8760 tgcgttggca atagctacac aggcacctga tgagtttgct tatcacttga tgaagggtcc 8820 aggatatatg gcgttagtag caggagaagt agtacacgta ataaaatgca taccagtaga 8880 agtgaaagta aagcacggcg acaattgtta ttcagaatta gaagtcacaa aaggaaatag 8940 aactatgtat ctcacaccga gaactcatat tttaaaatta agaggtacgc agatttcatg 9000 caaccatctt ttaccagcat actactcaat agaaggaacc tggtataaaa tactgccaaa 9060 accaacagat acaaaagatc caattaccat tcaaccaaat tcaaaaagta gctgggagta 9120 caaaagtcca gaatccttag caacatctgg catttacaca gaaaaagatt tagacgaact 9180 gagagaaaga atcatgtttc cagtcgaaaa accagcctta ctcaacgata ttgctcgaga 9240 gatgcgcgga cacattgtga cagataaaga aggcacatta attaagttac ttaatgcaga 9300 tgcagtagaa aagataattg catcaacttg ggacaggatc tggatgaagt tcctaacgtt 9360 cgggacagta agtgcaggag tgattggaat atttttaata tttcatttaa taaaaacaat 9420 aattgacatc ggcatccaag gatatacatt gcatgcaata tacggatggt ccatacatct 9480 gttaggtgca gtttggggat caataactta cttattatta catttaaata aaaccgatac 9540 ggatcaaaag caggacgaaa accaagacat agaattgcaa gaaccattga atcaagagcc 9600 atcaacatct caacaaataa atcaagtaga cgacaagaag agaaacgagg gatttttctc 9660 aaggtgttag taaactcatg ggagtactgt ggagttaaaa ggtctgcgta ataaattaca 9720 acgcaattca tataatttta ttcttcagtt atagattaag taaaaattat atttaagatt 9780 tctaattaat atatctctat ttttcattac tcaggagagt aataattact atggtctttc 9840 agaccctagt aatagtttaa gggtgggggt gttacgtcat agcctaattt aaaaaaaaaa 9900 caaaaaaaat attaaataaa ataaagtgtg cgcagctgcg cgcatttcaa aaaaaaatgt 9960 gtttgtgttt accgtagata agcgagcgtc cggacagcgc agttagacag tccgaggcca 10020 ttaagagcga gcgtgggcat agcgattggg atcggtgcgg cgggcagcgg ccctaccata 10080 aaatttgtgc gcagctggga aaaacccaga aattttgttt actatgctta tgttacgccg 10140 tagaacacga agccgtggat aaaattttta tttttgaagt agaatattta tattaaataa 10200 tgtagaagat ttaaattcat gtaaagtatg gacagctgaa aatactaagc tacagctgtg 10260 catattttat ttagttcaag agtaagagca cgtgttcgcg gagtaaaact taaaataata 10320 ctcgtacaaa ttttatgatt agtttttagt ttaatagtta catagattta ttatgttaga 10380 taaaaagttg gcgaccacgg cgtaaaatga ttatttatcc gtgcagattt tacgatttga 10440 tttcaattaa aaaatgaatg ttataaaaaa ttagttaaaa atagcaggca gaatattata 10500 atgatactat tattacaaaa tatgaatttt acaaacatat tataaataac gcgtagaaaa 10560 attgaaaacg gtacagagag ggccatctaa cgaaaaatcg gcaaggctaa gagtagtgtg 10620 ggagggacaa aggg 10634 // ID Gypsy-24-I_NVi repbase; DNA; INV; 10116 BP. XX AC . XX DT 22-APR-2009 (Rel. 14.04, Created) DT 22-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW internal portion; Gypsy-24-I_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-10116 RA Bao W. and Jurka J.; RT "LTR retrotransposon from Nasonia parasitic wasp."; RL Repbase Reports 9(4), 785-785 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS join(2960..7543,7553..9604) FT /product="Gypsy-24-I_NVi_2p" FT /translation="HKKLYRSKIIGNACEQILQSLQKYKPYDRRMFPITKP FT AERRVGKRIKPSRTEWPRSYNTNPLSNTENTVKTNVITASQNQNRANYEDH FT DQQEVQLLEEKEIKLEIYHLKLNDLLEYVEIINPYSKNLCNLLVDSGAQIN FT IIKQNQIPIGTQLLDKKILLSGIANNSIKTLGIVELPINGKYFDFHVAPND FT FVIPYDGILGIKLLRGSKLRLDEGYLEVNNVKLKLQRGPITKTGMKLSQKE FT TLNIILKEIIRDNDAYNIEQQNILKETDHENNIDKIRMQDEANQEICEESR FT TSKISNIEDELKQTEQSHEKQNTEINEAEPLLENDSVTENTFNNDVFAILN FT TISKIHEINDKSELQSLQQKKINENKPRNLENFLKNDDKNIENFILDVNTV FT LQEQVTPEEAPTQTKRLNYKEILTIMKENNVKLSKADKKIIFEEKKYQYVS FT IDIDYIPEGAQVVEELNIDEKLILNIETEILNKNQENKTRKPSRKEILRKK FT LQLKNGGKRNDERIERMINQYPDVFWIEGDKLGTCNVEQHEINLTSDKPVY FT VKQYPLAHKSKGIALKEMGKLQETKAIRETKSAFNAPALVVPKKELIPGEK FT RYRLVIDYRALNEVTIADPYLLPNINEILDLIGKRKYFDVLDLKSGFHQVK FT MRPSDIHKTAFSLHPLGRFEFVVLPFGLKNSARTFQRVINKVLAKYIDKIC FT FNYIDDIIIFGDTIEELEERFGLIAQALIEAGLKLEPEKCEFSKTEVCFLG FT HVISEAGIKPDENKIQAVKKFPIPNNVKKVRQFLGFVNYYRRFIKNLAEIT FT KPLTILLQKDRPFAWGLEQQTAFDKLIESLCSAPVLQYPQFDKPFIITTDA FT SQYALGCVISQGEIGKDRPIAYASRVLQPAELKYATYEKEALGIMYAIKTF FT KNYIYGNKFIIVTDHRPLLWLKSADNNERVQRWRLKLADYDYEIVYKAGKQ FT NTNADALSRNPVAKMDICVVTRAQKRREENQTSQDLTNDEKIKIINEQRSK FT LHNKKLQIKKRKRAKVQKSRKKSVPEEEKPHDKQKNIVYSKNTIQFRNDNI FT IHIVNNKGESLEDNIGELLKKENIHWNKNLSDNEIEILNKGKNKIFVLCLN FT TAESVVTVKTNIIKLFQLLKTIFLQKNYLIFSISKDLIINNISWEEIEEIL FT KTIFEGTNIKVVICLNNIVYVEEKQRDKIFEMLHSTKIGGHSGINRTYNRI FT KDKYYWENLKVDIQNRVNACEDCQRNKLKRIKIKQPMVITDTPLKTFDKIC FT MDVVGPFNVTKNNNKYILSIQDQLTKFIIIASMSDQTAESVADALVKKFIC FT IFGSPKLVLTDRGANFTSKLLKQVARRFKFTKIETTAFSPQSNGSLERAHH FT PLCEYIKNFTSKKIEWDELLEFAQFHYNTSVHTSHKFTPHELVFGYPARLP FT SSETLKKSEQLPTFNGYLENLVLKMQEMAKLARENLIDSKLKSKAYYDRFI FT NPIDLKIGEKVWLIKEPKPGKLEKNHNLGPYEILKVHENSNVTINYNGKPK FT TVHTNKLSRCHTNISLSRNMDNMIIQMIIIFTLFRNNHAIIGYDCGTTTPN FT ITTFSLLDSGECDFHNTQVNSTSVNIELIQVAEFREVTVIQCKIEIHRTVS FT SCGIFGHLIPTENGEQEYIYEISHEQCKLIHDTGIFKYDNIHTIANLKVNS FT TITKGIDFAGSAEGNSCSGASYADSFGSWNKVFVHGLIKITLIQNSAKVSL FT KNDKLRLSSGTVCKFSEKHCIDMEGGHTFWEVLPQGECFRNSFDILFKGIA FT TKYFSDTNHEIVYALNSNEVTFALAIKDKIMRCNREFIRTEHPKLLIIEKS FT SYLNAYHFQNDKISNINSVNLDLFTYVNSKFVYVEKHLGKQLNNLYFNLIT FT EKCEIERKVIQNSLAIASLSPDEFAYSIMKKPGYIARIAGEVVHLIKCVPK FT TVKILHLKECYNQLPVLSGNETWFLTPKTHILVKKGVQVNCNSIVPTYYRI FT SEEWIKFTPTPTKTVSPHILKPNTKIGWTFESVETLATSGVYSQEELDQLQ FT QQLMFPIEKASLLNVIAREMAGEKTSENIFSENTIRNLAESTWHMLIEKLI FT TCGSFSSVFIMILIILHIGKLIIDTIIRGYTLHTIFGWSIHLCGAIFSSVT FT HLLTTLENTSSIKEQIHTNKKKNRVYEETELQEIKITAPISAPMPPPIPSR FT PPHANTYTSKITIPKIQTQSKASISLFSLTPL*" FT CDS join(763..2040,2027..2482,2337..3098) FT /product="Gypsy-24-I_NVi_1p" FT /translation="MSDNKKEVNKNDTQRPRLTRNAVRENPALTKDIVIPN FT IQNPFTKSKKIIRSPPTNSGWIKDSQQVLLSDINNKKLEDSQQIVTPDTDQ FT QSKQNLEIQEILKENKDLKQEDTTPGRTTGAIKKQIKQLNKNHLSLSIPLT FT PAKQKEFETVEKPNHSTFSPITPATAIPIGLDRTVNNTKFLQGVEETSLID FT IEGISDQPSLIGNPSNSKFNFTFDFDNLLDGNRANDINMAENPLRLSLKYV FT ANLVPEFDGKNISVNEYIEKLKHAKNMLSAADQANLIPILKIKLKGETYKA FT MLNATINTIEDFIKAIRQIYPSTENMHSLYGQIEEILQKPDESVLSFANRL FT QELVLKIKDSKEVQGLTATEKAAFETKINNDAFQGFKEXLKQEIRIELGAV FT TNLTEAITKAIEIESKFNKRGILCHKINNIYIIIYIYSDSHNHNTSNATIY FT ACQICDSYGHEALFCHNSGCVYCKKKEHLSNNCRKVKDKIELICTFCNNQG FT HSIDACKLNKIKGNHCQYCQVMGHTVTQCPFIIEYELCWKCKESGHDPTAC FT TKKADVSNSCEFCNSAGPYDKSMPRGYMQTMNMNCVGSVKKVAMIQLPVQK FT RQMLAIHVNFAIARAHTIKACPEVICKRCNQLGHPVKYCPLVRSPIANNRF FT IMCSICNAEDHEAIECEQVKILVAQHKTKSFQTNIQCQVCDEKGHSAKNCP FT QIGNKQQSFNNSNSKYQNFSNSNNFRKYSNYNNQNTNFQNTNNSQIKCDYC FT EMRGHPVYKCRKLKNLENIARREENCTYCNKSGHNIKNCTEVKSLEMRASK FT FCSLCKNTNHMTEECFRLQNQQSVESGNE*" XX SQ Sequence 10116 BP; 4184 A; 1451 C; 1742 G; 2733 T; 6 other; ttggttccga tacaaatcga attatagagt cttaaaagaa aagttacata taaaaaaaag 60 ggaaaatttt aacaaattaa tcaaactaca gttgcatttg taaatttaaa caaatataaa 120 ttacatatat aaatgcccat accccaaaag taagtcaaag tgttgaaaaa aaaaaaaaaa 180 aaacaagtgg gtgctagaat tttgtgcaaa aacctatcct aaagacgctg aaaagcgact 240 tatcctttga tcgcttaaat agcgattact cttttccata aatcaatatc ctaacggaaa 300 ccatataact cgggaaattc agttagtcaa ctataaaatt atatctgaat aaacagagat 360 aaaaaaaaaa tatatatata aaatttgcaa acaaaacggt agttagattg gaaaatcaat 420 cattaaaaca gaaatataaa atcgcgagtt caaagaaaga ttgggcgagt gacctgcttg 480 cgctcatcgg cgatcgcgct cacctgacgt tgcagtgtca gcaaggcgat tgaaagaagc 540 gcttcaaatt ggtagcgctt gtgtcactag gtcaatcgag aatattcaaa acatatgtga 600 agaaattaat tacgcgcata cctagaaagt agatcgtgtg ttaaaatata agtaggacag 660 taaaatttcg tttgaattgt ttgtaagcga tttgaaattc taaaattcgg actaaataag 720 attttgtaat aaataaaaag gtaacaaaaa tgagattatc aaatgagtga taataagaaa 780 gaagtaaata aaaacgacac tcagaggcca agattaacta gaaacgctgt tcgtgaaaat 840 ccagcattaa caaaggacat agtaatccct aatatccaaa acccattcac aaaaagtaag 900 aaaataattc gatcgccgcc gacaaacagc ggatggataa aagatagtca gcaagtacta 960 ttatctgata taaataataa aaagctagaa gatagccaac aaatagtaac acccgataca 1020 gatcagcaaa gtaaacagaa tctagaaatt caagaaattt tgaaagaaaa taaggacttg 1080 aagcaggaag atacaacacc agggcgaact acgggagcaa tcaaaaagca aatcaagcag 1140 cttaataaaa accatctaag cctaagtata ccgctaacac ctgcaaaaca aaaagaattc 1200 gagacagtag aaaagccaaa tcattcaaca ttttcaccta taacaccagc cacagcgatt 1260 ccaataggcc tagacagaac agttaataat accaaatttc tacaaggagt tgaggagaca 1320 tcgttaatag atatagaagg tattagtgac cagcctagct taattggaaa cccaagtaat 1380 tcaaagttta attttacttt tgattttgat aatcttcttg acggcaatcg tgccaacgat 1440 atcaacatgg cggagaatcc gctacgctta tcgttaaaat atgtagctaa tttggtacca 1500 gaatttgatg gaaaaaatat ttcggtaaat gaatacatcg aaaaattgaa acatgccaag 1560 aatatgttat cagcagcaga tcaagctaat ttaattccaa ttttgaaaat taaattaaaa 1620 ggggaaactt acaaggcaat gttgaatgca acaatcaata ctattgaaga tttcataaaa 1680 gcaatacgac agatataccc atctactgag aatatgcact ctctgtatgg gcaaattgaa 1740 gaaattttgc aaaaaccgga cgaatcggtt cttagttttg ccaatagatt gcaagagtta 1800 gtgctgaaga ttaaggacag caaagaagta caaggtctca cagcaacaga aaaggccgcg 1860 tttgaaacta aaataaataa tgatgctttt caaggtttta aggaakgatt aaagcaagaa 1920 atcagaatag aattgggggc agtaaccaat ttaacagaag ccattacaaa agctatagaa 1980 atagaatcga aattcaataa aagaggaatt ttatgccata aaataaataa tatatatata 2040 tagcgattca cataatcaca atacaagtaa tgctactata tacgcatgcc aaatctgtga 2100 cagttatggg catgaagcat tattctgtca taattcggga tgtgtatatt gtaaaaagaa 2160 ggagcatttg tcgaacaact gcagaaaagt taaggataaa attgaattga tttgtacatt 2220 ttgcaataat caaggacatt ctatagacgc gtgtaaactg aataaaatta aaggcaatca 2280 ttgccaatat tgccaggtta tgggccatac agtgactcaa tgcccattta taatagaata 2340 tgaattgtgt tggaagtgta aagaaagtgg ccatgatcca actgcctgta caaaaaaggc 2400 agatgttagc aattcatgtg aattttgcaa tagcgcgggc ccatacgata aaagcatgcc 2460 cagaggttat atgcaaacga tgtaatcaac ttggacatcc agtaaagtac tgcccgttag 2520 taagaagtcc gattgcaaat aacagattta taatgtgctc aatttgtaat gcagaagacc 2580 atgaagccat agaatgtgaa caagtcaaaa tattagtagc gcaacacaag acaaaatctt 2640 ttcaaacgaa tattcaatgt caagtctgtg atgaaaaagg ccactcagct aaaaactgtc 2700 ctcaaatagg taataaacag caaagtttta ataattcaaa tagtaaatat caaaatttta 2760 gtaactctaa taactttaga aaatatagca actataacaa tcaaaatact aattttcaaa 2820 atacaaataa tagccaaata aagtgtgatt attgcgaaat gcgtggtcac cctgtatata 2880 agtgtagaaa attgaagaat ttagaaaata ttgctagaag agaggaaaac tgcacatatt 2940 gtaacaaaag cggtcataac ataaaaaatt gtaccgaagt aaaatcattg gaaatgcgtg 3000 cgagcaaatt ttgcagtctc tgcaaaaata caaaccatat gacagaagaa tgtttccgat 3060 tacaaaacca gcagagcgta gagtcgggaa acgaataaag ccttctcgga ccgagtggcc 3120 gagaagttac aacacaaatc cactcagcaa tacagaaaac acggttaaaa caaatgtcat 3180 aacagcatct caraatcaga acagggcaaa ttatgaagat catgatcagc aggaggttca 3240 attgctcgag gaaaaagaaa tcaaactaga aatatatcat ttaaaattaa atgatttatt 3300 agaatatgtt gagataataa atccctattc taaaaattta tgcaatttac ttgtcgatag 3360 tggagcacaa ataaatataa taaaacaaaa tcaaattcct ataggaacac aactgctaga 3420 taaaaaaata ttattatccg gcatcgcaaa taattctata aaaacattag gtatagtaga 3480 actaccgata aatggtaaat attttgattt tcacgtagca cctaatgatt ttgtgattcc 3540 atatgatgga atactgggaa ttaaattgtt aagaggaagt aaactacgtt tagatgaagg 3600 gtacttagag gtcaataatg taaaattaaa attacagaga ggtccgatta caaaaaccgg 3660 aatgaagtta agccaaaaag aaactctgaa tataatttta aaagaaatta ttcgcgacaa 3720 tgatgcatat aatatagaac aacaaaatat tttaaaagaa acggatcatg agaataatat 3780 tgataaaatc agaatgcaag atgaagcaaa tcaagagatt tgtgaagaga gtaggactag 3840 caaaatcagt aacatagagg atgagttaaa gcaaactgaa caatcacatg aaaaacagaa 3900 tactgagata aatgaggcag aaccattact agaaaatgat agtgtcacag aaaacacatt 3960 taataatgat gttttcgcca ttttaaacac aatatcgaaa atccatgaaa taaatgacaa 4020 atcagaatta caaagtttac aacaaaaaaa gataaatgaa aacaagccta ggaacttgga 4080 aaattttttg aaaaatgatg ataaaaatat agaaaatttt atacttgacg taaatacagt 4140 attacaagag caagtaacgc ctgaagaagc gccaacccaa accaaaaggc tgaattataa 4200 ggaaatatta actattatga aggaaaataa tgtaaaacta tctaaagcag ataaaaaaat 4260 aatttttgaa gaaaaaaaat atcaatacgt ttcaatagac atcgattata taccagaagg 4320 ggctcaagta gtagaagaac ttaatattga tgagaaatta attctaaata tagaaactga 4380 aattttaaat aaaaaccaag aaaataaaac cagaaaaccg agtcggaagg aaatattgcg 4440 aaagaaattg caattaaaaa acggaggtaa aagaaacgac gaaagaatcg agcgaatgat 4500 aaaccaatat ccggatgttt tttggataga gggcgataaa ttaggtacat gtaacgtaga 4560 acaacatgaa atcaacttaa cttctgacaa gccagtatac gtaaagcagt atccattagc 4620 acacaaatct aagggaatag ctttaaaaga aatgggcaaa ttacaagaaa cgaaagcaat 4680 cagagaaaca aaaagtgcgt tcaatgcgcc agcgttagta gtcccgaaga aggagctaat 4740 cccaggagag aaaagatata gattggtgat agattataga gccttgaatg aggtgactat 4800 agccgatcca tatttacttc ccaatattaa tgaaatttta gacttaattg gaaagaggaa 4860 atattttgat gtattagatt taaaatctgg ctttcatcaa gtaaagatga gaccttcgga 4920 tattcacaaa actgcattta gcttacatcc attaggaaga tttgaatttg tcgtcctacc 4980 gttcggtctt aaaaactcag cccggacttt tcagcgggtt ataaataaag ttttagccaa 5040 atatatcgat aaaatatgtt ttaattatat tgacgacatc attattttcg gtgacacgat 5100 tgaagaatta gaggaaagat tcggtctaat agctcaagca ttaatagaag cagggctcaa 5160 attagagccg gaaaaatgtg aatttagcaa gacagaagtg tgttttttag ggcatgttat 5220 tagtgaagca ggtataaaac cagatgaaaa caaaatacaa gcagtaaaga aattcccaat 5280 tcccaataac gttaaaaagg ttagacagtt tttaggtttt gtaaattatt atagaaggtt 5340 tattaaaaat ttagcagaaa ttaccaagcc actaaccata ttactacaaa aggataggcc 5400 atttgcgtgg ggattggagc aacaaactgc atttgataaa ctcatagaat cattatgttc 5460 agcaccggtc ttgcaatatc cgcaattcga taaaccgttt ataataacga cggacgcaag 5520 tcaatatgcg ttaggttgtg taatttctca aggggagata ggtaaagaca gaccaatagc 5580 ttatgcatca agagtactgc agcctgcgga acttaaatat gcaacctatg aaaaagaagc 5640 tctgggtatt atgtatgcta taaaaacatt taaaaattat atttatggaa ataaatttat 5700 aattgtaact gaccatagac cactattatg gttgaaatct gcggataata acgaaagggt 5760 tcagagatgg agattaaaac tagcagatta tgattatgaa attgtgtata aagctggaaa 5820 acaaaataca aatgcagatg cattatcaag gaatcctgta gcgaaaatgg atatatgcgt 5880 agtaacaaga gcccagaaaa ggcgagaaga aaatcagacg tcgcaagatt tgacgaatga 5940 tgaaaagata aaaattatta atgaacaaag atcgaaactt cataataaga agttacagat 6000 aaaaaaacgc aaaagggcca aggtacaaaa gtctagaaag aaaagtgtac cagaggagga 6060 aaaaccacac gataagcaga aaaatatagt atattccaag aatacgatac aatttcgcaa 6120 tgataatatt attcatatag ttaataataa aggagagtca ttagaagaca atatagggga 6180 attattaaaa aaggaaaata ttcattggaa taaaaactta agtgataatg aaattgaaat 6240 attaaataag ggtaaaaata aaatttttgt tttatgttta aatacagccg aatcagtagt 6300 tactgttaaa actaatatca ttaagttatt tcagttatta aaaacaatat ttttgcaaaa 6360 aaattattta atatttagca tttcaaaaga tctaataata aataatataa gttgggagga 6420 aattgaggaa attttgaaaa ctatatttga aggtacaaat ataaaggttg tgatctgtct 6480 aaataatata gtttatgtag aggaaaagca aagagataaa atatttgaaa tgctacactc 6540 tactaaaata ggaggacatt cgggtataaa tagaacatat aatagaataa aagataagta 6600 ttattgggaa aatcttaagg tagatattca aaacagagta aatgcatgcg aggattgcca 6660 gagaaataag ctaaaaagaa taaaaattaa acaacctatg gtaattacgg atacaccctt 6720 aaaaactttt gataaaatct gtatggatgt agtaggtccg tttaatgtta ctaaaaataa 6780 taataaatat atactttcaa tacaggatca actaactaaa tttataatta tagcatctat 6840 gtcagatcaa acggcggaat ccgtagctga cgcgttagtt aagaaattta tatgtatttt 6900 tgggtcacca aagctcgtac taacagatag aggggcaaat tttacaagta aactattaaa 6960 acaagtagcg cgtagattta agtttacaaa gattgaaact acggcatttt caccccaatc 7020 gaatggttct ttagagagag ctcatcaccc tctgtgcgaa tatataaaaa attttacatc 7080 aaagaaaatc gaatgggatg agttgttaga gttcgcacag tttcattata acacgagcgt 7140 tcatacgagc cataagttca caccacatga actagttttt ggttatccag ctagacttcc 7200 atctagcgaa acattaaaaa aatcagagca attgccaacg tttaatggat atttagaaaa 7260 tcttgtatta aaaatgcaag agatggcaaa attagctcga gaaaatttga tagattcaaa 7320 attaaaatca aaagcatatt atgatcgatt tataaatcca atcgacttaa aaattggtga 7380 aaaggtctgg ttaataaaag aaccaaaacc aggaaaattg gagaaaaacc ataatttggg 7440 tccatatgag atcctgaaag tacatgaaaa tagtaatgta actatcaatt acaatggaaa 7500 accaaaaaca gttcatacaa ataaattaag tcgttgccat acttgatact aaaatatttc 7560 tctttctagg aatatggaca acatgattat tcaaatgatt ataattttta cgctttttcg 7620 aaacaatcac gctataatag gatacgactg cggcacaaca acaccaaaca taacaacatt 7680 ctctctactg gactcaggag aatgcgactt ccacaataca caagtaaatt caacaagcgt 7740 caacatcgag ctgatacaag tcgctgaatt ccgagaagtc acggtaatac aatgtaaaat 7800 agaaatccac cgaacagtat caagctgtgg aatcttcgga catcttattc ctacagaaaa 7860 tggagagcag gaatatattt atgaaataag tcatgaacaa tgtaaactta ttcatgatac 7920 gggaattttt aagtatgata atattcacac aatagcaaat ttgaaggtaa attcaacaat 7980 tacaaaagga atagattttg ccggaagtgc agagggcaac tcttgctcag gtgcatctta 8040 cgctgatagt tttggatcct ggaataaggt atttgttcat ggcttaatta aaataacgtt 8100 aattcaaaat agtgctaagg ttagtttaaa aaatgataaa ttaagattaa gctcaggaac 8160 ggtgtgtaaa ttttctgaaa aacattgtat agatatggaa ggaggacaca cgttttggga 8220 agttttaccg caaggggaat gttttagaaa ctcttttgat attcttttca aagggatagc 8280 aacaaaatat tttagcgata ctaatcatga aattgtttac gcactaaatt caaatgaagt 8340 aacgttcgct ttagctatta aggataaaat tatgcgttgt aatagagaat ttatacgtac 8400 agaacatcca aaattactaa tcatagaaaa gagtagttat ttaaatgcgt atcattttca 8460 aaacgataaa attagtaata taaatagcgt aaacttagat ttatttacat atgttaattc 8520 gaaatttgta tatgtagaaa agcatttagg gaagcagcta aataatttat attttaattt 8580 aataacggaa aaatgtgaaa ttgaaagaaa ggtaatccaa aattccttag ccatagcctc 8640 tctttccccg gatgaatttg cctatagtat tatgaaaaaa cctggataca tagcaagaat 8700 tgcaggagaa gtggtacatt taataaaatg tgtaccaaaa acagtaaaaa ttttgcatct 8760 aaaggaatgc tacaatcaat tgccagtttt atcaggaaat gaaacctggt tcttaacgcc 8820 aaaaacgcat attctggtga agaagggagt tcaagtaaat tgtaactcaa tagtacctac 8880 atactacaga atttcggagg aatggataaa attcacacca acaccaacta aaacagtatc 8940 acctcatatt ttgaaaccaa acacgaagat tggatggaca tttgagtctg tcgaaacact 9000 ggcaacgagt ggagtatatt ctcaagaaga gttggaccaa ttacaacagc aattaatgtt 9060 tccaatagag aaggcatcac ttttaaatgt gattgccaga gaaatggcag gagagaaaac 9120 atctgaaaac atattttckg aaaatacaat aagaaattta gccgaaagta catggcacat 9180 gcttattgaa aaactgatca catgtgggtc attttcatct gtgtttatca tgatacttat 9240 aatcctgcat attggaaaac tgatcatcga taccattatt agaggctaca cgttacacac 9300 gatttttgga tggtcaatac atctatgtgg agccattttc agctcagtaa cgcatttatt 9360 gacaacactg gaaaatacaa gcagcataaa agagcagatt catacgaata aaaagaagaa 9420 cagagtttat gaagagacag aattacaaga aataaagatt accgctccaa tatccgcccc 9480 aatgccacca cctattcctt cacgcccacc acatgcaaat acctatacaa gtaaaatcac 9540 aataccgaaa attcagacac aatctaaggc tagtatatct ttattttcat taacaccayt 9600 ataaaagaaa tctattatat aaaatcccta taatgttatt ctacgtatac aacaamatag 9660 gtatacaaaa tttatatttt ttatgaaaat tgttagttta tagatatttg agactaatgt 9720 taaaactccg attatattat tctgcatata cgatgtattg atagagtaaa tttgtattaa 9780 aatcaactga tgtttttaat gaatgttatg taaaatgata tctataagat ttttaaatat 9840 gtaaataaaa ttccagcata tttagtaatt taaatataaa acagtaaaaa tgaaaacatt 9900 taatgatttt aaacaataat aattaataaa agaaaactat tcttgaaatt ttwaagtaaa 9960 taagttggaa tttttaagtg aaaaaaaaat tattgttaaa atatgttaga attttaagga 10020 tcaaaattat ttgtaaaccg ctagttaatt ttagttttat gtaatattac aataaagttg 10080 ccattgtacc aatgggcaat cggctagttg tggggg 10116 // ID Gypsy-40_CQ-I repbase; DNA; INV; 4487 BP. XX AC AAWU01014791; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-40_CQ_; KW Gypsy-40_CQ-LTR; Gypsy-40_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4487 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 459-459 (2011). XX DR GenBank; AAWU01014791; Positions 8716 13202. XX CC Positions [3346-3813] - Integrase core CC 'AGCAT' target site duplication CC LTRs are 94% similar to each other. XX FH Key Location/Qualifiers FT CDS 134..1165 FT /product="Gypsy-40_CQ-I_2p" FT /translation="MPPKEENQAEGSPVQVIRSGMFGQIEQYVLGEDWDEY FT MNRLEMFFQVNDTPDEMKVPVMFTVAGPSLYTLANRLCAPVSPRIKTYTEL FT TALFKDHFGPTTNVVSERYQLRFCEQTPSQRIADFIVTLKAKAQTCDYGNF FT LQDALRDQFVAGIHDQSLRKKLLTESTLTFEKACTIAKAYEAALSQNKDMS FT APSTSKMAALHNGGPSRQSKGKQPNRNQSSKSSRPPPKPKKPCFRCGRDHD FT PDKCPALDWTCYACGKKGHVSSVCQSKSRQSQPKGSSKRVGEMSEAVEVLR FT LNVLAGVGETAVRAARQPSSAAQSAAGSGSVSVLEPSSSKTPSSSNGGSPD FT R" FT CDS 1438..4347 FT /product="Gypsy-40_CQ-I_1p" FT /translation="MYPLLGRDGLDLMYPEWRKVFSVKSVSSAQTSFESEL FT LKRFPKVVSETAKDVITGFSAEIVLKPDATPVFHKAYSVPYALREKVDQAL FT DKMVEEAILVPVRTSPWASPIVVVPKKDSSVRLCLDGKATLNRFITTEHYP FT LPRIDDILAKMANWKVFCKIDLSGAYLQVSLSESSQSICTINTHKGLFQYT FT RMPFGISPAPAIFQSIIEQILFQSPGIAYLDDIIVGGSTMEECRENLFQVL FT QRLNDHNVKINLSKSSFFQSKIEYLGYSITSEGIRPSESKVKAIIDAPAPK FT NVQQLQVYLGLLNYYHRFLPNLSIELRPLYDLLRKNCRFVWTESCQSAFDR FT TKSLLLENDLLEPFDPSKPIILAVDASPYGVGAVLSHLVDNVEKPVCFASS FT TLTPAQVNYAQVHKEALAVVFGVKKFHKYLYGSRFILITDNSGVKEIFNPT FT KGTSAIAAARLQRWALIVAGYNYVIEHRPGKLMNHADALSRLPLPEEPDVE FT HVSLGINSLAAGSQVVNLDLIRSKQKSDPILSKIFDFVKSGWPTNLNSSFK FT PYLSLNSHLGIDENVLYFDDRVVVPDALKANVMNQLHSNHDGIVRMKMLGR FT LYVWWKNFDKDLNDFVKKCEVCQQRQPVPRESLQSAWPRCDRPFQRIHMDL FT FYLEGHTMLIIVDAFSKFIDIRLLKSGNSIHLIEQVESFFATFGIVEEVVS FT DNGPPFNSELFVAFLSANEVKVSKSPPYHPQSNGLAERGVRSAKDVLKKYL FT LDEKCKPLSMARKLNRFLINYRNTPSTVTNRTPSSMIFSYTPRTLTNVVNP FT QKVQVESTNAKPMIVRKKPIIQVSHESAKMKSFNAGDKVLYRNHFKDIIRW FT IPAIVLQKLSPLTYLISLEGNVRMVHSNQIRYSDLSDKFHPSVPVHPVKAN FT NDDESSGYEDTSSEDGNPVQVPATIRVTPPSKNKKKTSKRRRSETKSPKVR FT RSNRLRGQPKLKYPK" XX SQ Sequence 4487 BP; 1150 A; 1074 C; 1129 G; 1134 T; 0 other; cttggcgacg agggtaaaac ggaacggtaa actgaacagt acaatcagtg atcgcgagtg 60 agtcgtagtt cgcgaaaaag tgtaagtaaa cggtcgaaag tgcagtacac gtggtgtgga 120 cataacctca accatgccac cgaaggaaga gaaccaggct gaaggcagcc cagtgcaagt 180 gattcgtagt ggaatgttcg gtcaaatcga gcagtatgta ctcggtgagg actgggacga 240 gtacatgaac cggctggaaa tgtttttcca agtgaacgac acgccggacg agatgaaggt 300 tccggttatg ttcaccgtgg ccgggccgag cttgtatacg ctagcaaaca ggctctgcgc 360 cccagtctcc ccgcgtatca agacgtacac cgagctaacc gccctgttca aggatcactt 420 cggtcccacg accaacgtgg tgtcagagcg ctaccagttg cggttctgtg agcagacgcc 480 ttcccagagg attgcggact tcatcgtcac cctgaaggcc aaggcacaaa cctgcgatta 540 cggtaatttc ctgcaggatg ccctgcgcga tcagttcgtc gccgggatcc acgaccaaag 600 tctgcgcaag aagttgttaa cggagtccac gttaacgttc gagaaggcct gcacgatcgc 660 caaggcctac gaagctgcgc tgagccagaa taaggacatg tcggccccgt cgacgtccaa 720 aatggcggcc ctgcacaatg gcggaccgag tcgccagtcg aagggtaagc aaccgaaccg 780 gaaccagtca agtaagtcaa gccgtccccc gccgaagcca aagaagccat gcttccggtg 840 cggccgtgat cacgatccag ataagtgccc cgccctggat tggacgtgct acgcgtgtgg 900 gaagaagggc cacgtgtcgt cagtgtgcca gagcaaaagt cgccagtcgc agcccaaggg 960 aagcagcaaa cgtgtaggtg agatgtcgga agctgtggaa gttctccgat tgaacgttct 1020 tgccggagtc ggcgagaccg cagtgcgagc agcccgtcag ccgtcgtctg cagcccaaag 1080 cgcagcagga agcggtagtg tgtccgtcct ggagccgagc agcagcaaaa cgccgtcgtc 1140 aagcaacggt ggatccccgg atcgctgaat cgtgtcgatt cgcccgaact cgtgtcgctg 1200 gactgcgaag gtcgccgtct ggagttcgaa gctgactgcg gagcctgcaa aagcgtgatc 1260 tctgcaacta cgtaccagaa atacttctcg cattgccctt tagtgcccac tgctcaagaa 1320 ttcatctctg ttcccggtca acgcatccaa ccgcagggtc tggtttcgct tcgcgtttcc 1380 gcaccttcgg ggatccaagg taagttggac ttgatagtga ttaccactgc taagaagatg 1440 tatcctctgc tcggtcgcga tggtcttgat ttgatgtatc cggagtggag aaaagtcttt 1500 tcggtcaagt ctgtgagtag tgcgcaaacg tccttcgaat ctgagctctt gaaacgattt 1560 ccaaaagtag tttccgaaac cgcgaaagac gtaattactg gtttttcagc agaaattgtg 1620 ttaaaacctg acgctacgcc agtttttcat aaggcgtact cagttccgta cgcgcttcgt 1680 gagaaagtcg atcaagcgtt ggacaaaatg gtggaagaag ccattttggt acccgtgcgt 1740 acatcgcctt gggcgagccc tattgtggtc gttccaaaga aagattcgtc agtaaggctt 1800 tgtttggatg gaaaagcaac cttgaaccgt tttattacca ccgaacacta ccctttgccc 1860 agaattgacg atattttggc aaagatggcg aattggaaag ttttttgtaa aattgatttg 1920 tctggagctt atttgcaagt ctccttgtct gagtcgtcac agtccatctg tacgatcaat 1980 acgcataaag gtctctttca gtacaccagg atgcctttcg ggatatctcc cgcccccgca 2040 atttttcagt cgatcatcga acagattctg ttccagtcac cgggcattgc gtacctggat 2100 gacatcattg tgggtggatc tacgatggaa gagtgtcgtg aaaatctttt ccaggtcttg 2160 caacgcttaa acgatcataa tgtcaagatc aatctgtcga agtcaagctt ttttcagtcg 2220 aaaatagagt atttaggtta cagcataacg tccgaaggta ttcggccaag tgagtcgaaa 2280 gttaaggcga ttattgacgc ccccgcaccg aaaaacgtac aacagttgca agtttatctg 2340 ggattgctta attattatca tcgttttctg ccgaatttgt cgatcgaatt gcgtcccttg 2400 tacgatttgc ttcgcaaaaa ttgtcgtttc gtgtggaccg agagctgtca gtcggctttc 2460 gataggacga agtcgctttt gctcgaaaat gatctactcg aaccgtttga cccttcgaaa 2520 ccgatcattc tcgccgtcga tgctagtccg tacggcgtcg gcgcggtgtt atcgcacctg 2580 gtcgataatg tcgaaaagcc cgtctgtttc gcgtcgtcca ctctcacccc cgctcaggtc 2640 aactatgctc aagtgcataa agaggcatta gcagtagttt ttggagtcaa aaagttccac 2700 aaatacttgt atggctcgag attcattctt atcacggaca acagtggagt gaaagaaatt 2760 ttcaatccga ctaagggcac ttccgcgatc gccgccgcca ggctgcaacg gtgggctctg 2820 attgtcgctg gttacaacta cgtaatcgag caccgtcctg gtaaactcat gaatcacgct 2880 gatgcactct cccgtctccc tctgcccgag gaacccgatg tcgaacatgt cagtctcgga 2940 ataaacagtc ttgccgccgg atcgcaagtc gtaaatcttg atcttattcg tagtaagcag 3000 aaatctgatc caattttgtc gaaaattttt gattttgtga agtccggatg gccaacgaat 3060 ctcaattcaa gctttaagcc ctatctgagt ttgaactctc acttgggtat cgacgaaaat 3120 gtgttgtatt tcgacgatag agttgttgtt ccagatgctc ttaaagcgaa tgtaatgaat 3180 cagctgcatt cgaatcacga tggaatcgtt cgaatgaaaa tgctaggtcg tttgtacgtg 3240 tggtggaaga attttgataa agatttaaat gattttgtca agaagtgtga agtttgccaa 3300 caacgacagc cagtgccacg cgagtcgctg cagtcggcat ggccacgttg tgatcgtccc 3360 ttccagagga ttcacatgga tttgttttac cttgagggcc acacgatgct tatcattgtt 3420 gacgcgtttt ctaaattcat tgatattcga ttgctaaagt ctggtaacag tattcacttg 3480 attgagcaag ttgagtcctt ttttgctaca ttcggcatag ttgaagaagt tgtttcagat 3540 aacgggcctc cctttaattc tgaattgttc gttgcatttt tgagtgccaa cgaagtcaaa 3600 gtgtcgaagt ctccgcctta ccacccccaa tctaacgggt tggccgagag aggcgtgagg 3660 agcgcaaagg atgttctcaa gaagtatctc ttggacgaaa aatgcaaacc gctgtccatg 3720 gcgcgaaagc tcaatcggtt tttgattaat tatcgtaaca cgccgtcgac agtaaccaac 3780 cgcacacctt cttcaatgat cttctcgtac acgccgcgta cgctgaccaa tgtggtaaat 3840 ccgcagaagg ttcaggtcga aagcaccaac gccaagccca tgatcgtcag aaagaaaccg 3900 ataattcagg tctcccacga atccgcgaaa atgaagtcgt tcaacgcagg tgataaagtc 3960 ttgtaccgta accatttcaa agacatcatc cgttggattc ccgcgatcgt gcttcagaag 4020 cttagtccgt tgacttattt gattagtctt gaaggtaatg ttaggatggt tcattctaac 4080 cagattcgct actcagactt gtccgacaag tttcatccct ctgtccctgt tcaccccgta 4140 aaagcgaata atgatgatga aagtagtgga tatgaggaca cgtcttctga agatggtaat 4200 cccgttcagg tgccagccac cattcgcgtc actccaccgt ctaaaaacaa gaaaaagact 4260 tctaagcgta gacggagtga aacgaaatcc ccaaaagtgc gtcgttccaa tagacttcga 4320 ggtcagccga aactgaagta cccaaaataa tggttgatct tgcgttgtac ctggacccgg 4380 gaaaagttct ttaatgaaga agcagtttag atttagtttg agttgtattt ggcagcgatt 4440 agtcttctag tgcgtagatg tggaggatcg tttgaagcgg gagaaac 4487 // ID Copia-4_DWil-LTR repbase; DNA; INV; 201 BP. XX AC scaffold_181075; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-4_DWil_; KW Copia-4_DWil-I; Copia-4_DWil-LTR. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-201 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_181075; Positions 978930 979130. XX SQ Sequence 201 BP; 52 A; 39 C; 33 G; 77 T; 0 other; tgttaaaata tgtttataag caccgcagta tgtctatctc tttcttatta ttctcgaatg 60 atccgatctg tgaattgtaa cggacattct ttcatttctg ctttaaattg attttctcgc 120 tgcctgcgct gcagtgaata ttcttatgca taagtttagt ctctaataaa cctctactag 180 aataggttat gggcccagac a 201 // ID BEL-94_CQ-I repbase; DNA; INV; 6293 BP. XX AC AAWU01007086; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-94_CQ_; KW BEL-94_CQ-LTR; BEL-94_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-6293 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 315-315 (2011). XX DR GenBank; AAWU01007086; Positions 22836 16544. XX CC Positions [5336-5875] - Integrase core CC 'CTAT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 14..6271 FT /product="BEL-94_CQ-I_1p" FT /translation="MTTIRPDENCPYCGLPDEADREMVQCDRCDVWYHFRC FT AGVTRAVKHQSWFCGPCQAERHYGPNPELNPAGSEEDNFSVPPSELSSLRS FT HTSSSIRQGLQLLDEQFQVRQKRLAEEKAMQERRLALEEEYAKQQNDLREE FT HLKVRQELLSSLRDMSVSGSRGSARSRASAVRSERVRQWMEKQQNPVENPR FT KVVPRGAFPNIKPAQDVSPLRNPNMPQQCERASKIDGTVRVGKPQEFPSNE FT ASGGTGYEGVSDGLGRRNVPANPEQNVGRPCPAPRQQHNMPSTFAQDTRRP FT CPAPRNFVDGLGQRKMPSTSARDALGTRRVEDYVPFAGQPREGVEGEAGQY FT WPLISRGNDGRTDRNSFGDPPAPHADGTLALLSGAPNQQLQPQPPQPFDVG FT DGRYRFKSPQEHLQYNQSEQRDGNDRGPHDRGQFGGITPEQLASRQVLPRK FT LPIFSGSYEEWPLFISCYNTTTEACGYTDNENLVRLLDCLRDPALGHVRGQ FT LLLPQSVPRVIETLRRIYGRPEQLLQSLLMKARRCEPPKSDRLETFITFSL FT VIEQLCDHLEATNLQDHLVNPMLLQELVDKTPAGTKMEWVRFKRRHGQVTL FT RTFADFMAGIAEDACEVAPMIDNKQLAGDKRVEKRTKEKGFVHAHDATTVP FT EQTTKERVDIKPCRVCQSTEHKIRNCDVFKNLTQADKTTLVDKWKICRICL FT NEHGDSRCRFKIQCNVEQCRERHHPLMHGEAATTHAAEATEAAHNAHRADI FT GQMQFRLVAVTLHHGTRSVRTTAFLDEGSSMSLVERSLVDVLGVKGGELEP FT LRLIWTADISRTEKNSKKISVQISTAGCDQKHQLALRTVTELKLPTPSFKI FT SSLTKQFKHLGDLEVDDNLGKPRLLLGLDNLHLFAPQESRIGQPNEPIAVK FT SKLGWSVYGPSSDNKCGGGVVGSHFTENISNQELHDLIKEYFAVEEAGVTA FT GVLESDDDRRARVILENTTVKVGGRYETGLLWRSDDYDFPNSYPMAVRRLE FT QLEKKLSKNPELAENVRRQITEYQLKEYAHKATPKELSDSDPRKVWYIPLS FT VVVNPRKAKVRLVWDAAASVNGISLNSQLLKGPDLLVSLIEVLCRFRERPV FT AFGGDLQEMYHQVRIRSEDKQAQRFLFRDNPTLKPDVYVMDVATFGATSSP FT ASAQFVKNKNAAEFAHVHPEAAAAIIQRHYVDDYFDSADTEEEAIERAQAV FT KHIHAQGGFHIRNWVSNSTRFLEALGEPNQKQTVQFTSDKHTNTERILGIT FT WNSVLDVFVFSIPTNEALQPFLTGKARPTKSIVLKFLMSFFDPLGLIAHYL FT VHGKILMQDIWRSGCDWKATINDDCYAGWTQWIKWLPHLEQLQIPRSYFGD FT IRSSRIESVELHVFSDASEKAYGSGAYLRIVCDGAIRCTLALAKTKVAPLK FT LLSIPRLEVSGAVLGARLANTAETCHSLPISRRVFWVDSATVIAWIHSDHR FT KYKPFVAHRISEILSLTSVDEWHFVSTKSNVADDLTKWRKRFTMSSEDRSI FT RGPDFIYTDSKSWSVPATPIENVEEELRAAFLFHEIHLPEPVIDAERFSSW FT KTLVRSLAVIFRFRSNCQRKARGLPIEAVPATSAVTGHVKGAVSSTEVPLR FT QEEYAQAEHCFWRLVQADMFPDEVKVLMKNRELPREEWHQVEKTSVIYQLS FT PFLDHAGVLRMEGRAEAAEELPFDLRFPVILPKKHPITIKLVEHYHRKLRH FT GNNETVVNEMRQRFFIGNLRAVVRMVANDCQLCKIRKSRPAVPRMAPLPIQ FT RLQPYLRPFSFVGVDYCGPITVTVGRRSEKRWIAVFTCFNTRAVHLEVAHS FT LNKQSCLMALRRFMCKYGVPREIFSDNGTNFHGANNEGVLIRAINNNCADV FT VTDARLKWNFNPPSAPHMGGVWERMVRSVKECLKVLDDGRNLTDEVLLTVL FT AEAAEIINSRPLTYQPQDASSPEALTPNHFLRVGPANEELVLEAANVGQAL FT TDSYQRSQMLAQKMWKRWISEYVPSLNRRPKWHEEREPVKAGDLVFIADEE FT QRKTWIRGIVDEVIKAKDGRVRQAVVRAQGKLYKRPVAKLAVIEVGANGRS FT EPHSEKPRD" XX SQ Sequence 6293 BP; 1605 A; 1645 C; 1768 G; 1275 T; 0 other; ttctcgaaag attatgacaa ctatccggcc agatgagaac tgcccgtatt gcgggcttcc 60 ggacgaagcc gatcgggaaa tggtccaatg cgaccgatgc gacgtctggt accacttccg 120 atgtgccgga gtgacgcggg cagtgaaaca ccagtcatgg ttctgcgggc cgtgccaggc 180 cgaacgacat tacggcccga atcccgaact aaatcctgcc ggcagcgagg aggacaactt 240 ctcggtccct ccatctgaac tttcgtcgct ccgatcccac accagttcgt ccattcggca 300 aggactgcag ctgttggatg agcagtttca agttcggcag aagaggttgg ccgaggagaa 360 ggccatgcag gagcgaagat tggcgctcga agaagaatac gccaagcaac agaacgatct 420 acgtgaagag cacctcaagg ttcgccaaga gctgctctcg tctctgcgag acatgagcgt 480 tagtggaagt cgaggcagtg cgcgcagtcg agcaagtgct gtgcgcagtg aaagagttcg 540 gcaatggatg gaaaaacagc aaaatccggt ggaaaatcca cggaaagtgg ttccacgcgg 600 agcttttcca aacatcaagc cagctcaaga tgtgtctcct cttcgaaacc ctaacatgcc 660 acaacagtgc gagagggcga gtaaaattga cggcactgtc cgtgtgggta agccgcaaga 720 gtttccgtcc aatgaagcaa gtggcggtac aggatatgaa ggagtttctg acgggctggg 780 gcgccggaat gttcccgcaa acccagaaca aaatgtcgga cgcccatgcc ccgctccacg 840 acaacaacac aacatgccat caactttcgc ccaagatacg cggcgtccgt gtccggcacc 900 aagaaacttc gtcgacgggc tggggcagcg caaaatgcca tcgacctcag caagagacgc 960 gctaggaacg agacgagttg aggattatgt gccctttgca ggacagccaa gagaaggagt 1020 agaaggagaa gcggggcaat attggccgct aatctcacgt ggaaacgacg gcagaactga 1080 tcggaacagc ttcggagatc cacctgcgcc ccacgcagac ggcacacttg cgttgttgag 1140 cggtgctcct aatcaacaac tgcaaccaca accgccacaa ccttttgacg taggtgacgg 1200 acgctaccgt ttcaagtcac cacaagaaca tctacagtac aaccagagtg agcagcgcga 1260 tggaaacgat cgtggccccc atgaccgtgg tcagtttggt ggaataacgc cagaacagct 1320 tgcctcgcga caagttctgc cccgaaagct gccaatcttc tctgggagct acgaagagtg 1380 gccgctgttt atcagctgct ataatacaac caccgaagcg tgcggctata cggacaacga 1440 gaaccttgtt cgtctcttgg attgtttgag agatcctgct cttgggcatg tacgcggtca 1500 gttgttgctc ccacagtcgg ttcccagagt tatcgaaacg ttgcggagga tctacggacg 1560 tcctgaacaa ctgttgcagt ctctgttgat gaaggctaga cgttgcgaac cccccaaatc 1620 tgatcgactg gaaacgttta tcaccttcag cctggtcatt gaacagctgt gcgaccactt 1680 ggaggcaacg aaccttcaag accacctggt aaaccccatg ctacttcagg aacttgtgga 1740 caaaaccccg gctggcacga agatggagtg ggtccgattc aaacgccgtc acggtcaagt 1800 cactctgaga acgttcgcgg atttcatggc cggcatagcc gaagacgcgt gtgaagtggc 1860 gcccatgata gacaacaagc agcttgcggg tgacaaacga gtggaaaagc ggaccaagga 1920 aaaggggttc gttcacgccc atgatgcgac tactgttcca gagcagacca ctaaagaacg 1980 agttgatatc aagccctgtc gagtctgcca gagcaccgag cacaagatcc ggaactgcga 2040 tgtgttcaag aacctaacgc aagcagacaa gacgacacta gttgacaagt ggaagatctg 2100 tcgcatctgc ctcaacgaac acggggactc gcgatgccgg tttaagatcc agtgcaatgt 2160 cgaacaatgc cgggagcgtc accatccgtt gatgcacggc gaagctgcaa caacccatgc 2220 tgcggaagca acggaagcgg cacacaatgc ccatcgagct gatattgggc aaatgcagtt 2280 tcgcctcgtc gcggtcacgc tgcaccacgg aacgcgttct gtgcgaacga ccgcattcct 2340 ggatgagggc tcatcgatga gtttggtgga gcgctcactg gttgatgttc tcggcgtcaa 2400 aggaggagaa cttgagccgt tgcgcttgat ttggacggcg gatatcagtc gaactgagaa 2460 aaactccaag aaaatcagcg tccagatatc aacggccggg tgtgaccaga aacatcagct 2520 ggcgttgcgc accgttaccg agctgaaact accaacaccg tcgtttaaaa tatccagcct 2580 gacgaagcag ttcaagcacc tgggagacct cgaagtcgac gataacctag gcaagccgag 2640 gttgcttttg ggattagata acctgcattt gttcgctccc caagaatccc gcattggcca 2700 gcccaacgaa cccatagccg tcaagtcaaa gctgggttgg agcgtgtacg gccctagcag 2760 tgacaacaag tgcggcggag gtgtcgtcgg ttcacacttc acggaaaaca tctcgaacca 2820 agagctgcac gacctcatca aggagtactt cgccgtggag gaagctggcg taaccgctgg 2880 agttctagaa tccgacgatg atcgccgagc aagagttatt ttggagaaca ccacagtgaa 2940 agtcggcggc cgatacgaaa cgggactctt gtggcgctcg gatgattacg attttccgaa 3000 cagctatccc atggcggtgc gacgtctaga gcagttggag aagaagctct ccaagaaccc 3060 agaactggcg gagaatgtgc ggcgacagat tacggagtac cagctgaagg aatacgcgca 3120 caaggctacg ccaaaggaat tgtccgattc ggatcctcgc aaagtgtggt acataccact 3180 cagcgtggtt gtaaacccca ggaaggcaaa agtgcgactg gtgtgggacg cggctgcttc 3240 cgtcaacgga atatcgctga acagtcaact cctcaaaggc cctgaccttc tggtgagctt 3300 gatcgaagtt ctctgtcgct ttcgtgagcg gccagtggct ttcggaggtg atctgcaaga 3360 gatgtaccac caggtccgca tccggtcgga agacaagcaa gcacaacgat tcctttttag 3420 agacaacccg actctgaagc ccgacgtgta cgttatggac gtggctacat ttggagccac 3480 tagttctcct gcatccgcac agttcgtcaa gaacaagaac gcagccgagt tcgcccacgt 3540 ccacccggaa gcagcagcgg cgataatcca gcggcactat gttgatgatt attttgacag 3600 tgcggatacg gaagaagagg cgatagaacg agctcaagca gtgaagcata tccacgcaca 3660 aggagggttc catatacgta actgggtgtc caactcgacg cgcttcctgg aagcactagg 3720 agaaccaaac cagaaacaga ccgtccagtt caccagcgac aagcacacca acaccgagcg 3780 tattcttgga atcacctgga actccgtgct ggacgttttc gtgttctcca tcccgacaaa 3840 cgaagcactg cagccattcc tgaccggaaa agcccggccg accaaaagta tcgtgctgaa 3900 gttcctcatg tcgttctttg acccgctagg cctgatcgct cactacttgg tacacggaaa 3960 aatcctcatg caggacatct ggcgttccgg ttgtgactgg aaggcgacga taaacgacga 4020 ctgttacgct ggatggacac agtggattaa gtggctgccc cacctggaac aacttcaaat 4080 ccctcgcagc tactttggcg acattcgcag ctccagaatt gaatccgtcg agcttcatgt 4140 gttcagtgac gctagcgaga aggcctacgg aagtggggcc tatctgcgca ttgtttgcga 4200 tggagccatc cgctgtacgc tcgccctagc gaagaccaag gttgcgccgc tcaagctgct 4260 gtcgattcct cgcctcgaag tttcaggggc agtccttgga gccagactag ccaacacagc 4320 agaaacttgt cattcgttgc caatttctcg acgagtgttc tgggtggact cggctactgt 4380 gattgcctgg attcactctg atcatcgaaa gtacaaaccg ttcgtcgcgc accggataag 4440 cgaaattctc tcgctcacga gtgtcgacga gtggcacttc gtttctacga aatccaacgt 4500 ggcggacgac ctgacgaagt ggaggaagcg atttacgatg agcagcgaag accgttcaat 4560 ccgaggacca gattttatct acacggactc taagagctgg tctgtgcctg caacgccgat 4620 tgaaaatgtg gaagaagaac tgagagcagc tttcctgttc catgaaattc acttgccgga 4680 gcccgtcatc gatgcagagc gcttttcatc ctggaagacg ctggtcagat cgctggccgt 4740 catcttccgc ttccgatcaa actgccagcg caaggcgcgt ggtttgccga ttgaagctgt 4800 tcccgctaca tctgcagtaa ccggacacgt gaaaggagca gtttcgtcca cagaagttcc 4860 tttgcgccaa gaagagtacg cccaggctga acattgtttc tggaggttgg tgcaggcaga 4920 tatgtttccg gatgaagtga aggtgctgat gaagaatcgg gagctgcctc gagaggagtg 4980 gcaccaggtg gaaaaaacga gtgtcatcta ccaattgtcg ccttttctcg accacgctgg 5040 ggtgcttcga atggaaggcc gagccgaggc tgcggaagag ttgccttttg acctgcgttt 5100 tcccgtgatc ctaccgaaga aacatccgat taccatcaag cttgtagaac attatcatcg 5160 caagctccgg catggcaaca acgaaacggt cgtcaatgaa atgcggcaac ggtttttcat 5220 cgggaatcta cgagccgtcg tacggatggt cgcgaacgac tgccagctgt gtaagatccg 5280 taagagtcga cctgctgtac cccgtatggc accgctaccg attcagcgtt tgcagccgta 5340 cctgcggccg ttcagtttcg tcggcgtgga ttactgtggc ccaatcactg taactgttgg 5400 gagacgcagc gagaagcggt ggattgccgt gtttacctgt ttcaacacca gagcggttca 5460 cctagaagta gcccacagtc tcaacaagca gtcctgtctc atggcactga ggaggttcat 5520 gtgcaagtac ggcgtgccca gagaaatatt ctccgacaac ggaacaaatt tccacggcgc 5580 gaacaacgag ggagtcctga tccgtgccat taacaacaac tgtgcggacg tggtgacaga 5640 tgctcgactc aagtggaatt tcaaccctcc gtccgcccca cacatggggg gagtttggga 5700 acgcatggtc cggtccgtta aagagtgtct gaaggtgctg gatgacggta gaaatttgac 5760 ggacgaagta ctgttgacgg tgctagcaga agctgcagaa atcattaact cgagaccttt 5820 aacgtatcaa ccgcaagatg cttcgtctcc ggaggcgctc acgccgaatc actttttgcg 5880 cgtaggccct gcaaatgaag aattggttct tgaagcggcg aacgtaggtc aagcgctgac 5940 ggacagctac cagcggtccc agatgttggc tcaaaagatg tggaagcgct ggatatccga 6000 gtacgtgccg tcgctcaacc gccgacccaa gtggcacgag gaacgggaac cggtgaaggc 6060 gggtgacttg gtgttcatcg cggatgaaga gcagcggaag acgtggatcc gcgggatcgt 6120 ggatgaggtg attaaggcga aggatgggcg agttcgacaa gctgttgttc gtgctcaagg 6180 aaagttgtac aagcggccgg tggcgaaact ggcggtcatc gaggttggag ctaatggacg 6240 gagtgaacct cattccgaga aacctcggga ctaaggttca cgggctgggg aac 6293 // ID DNA8-55B_AP repbase; DNA; INV; 186 BP. XX AC . XX DT 26-AUG-2009 (Rel. 14.09, Created) DT 26-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-55B_AP. XX NM DNA8-55B_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-186 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 1987-1987 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 186 BP; 54 A; 46 C; 37 G; 49 T; 0 other; caggggtttt caaccttttt gggtccacgg ctcttccgat tttgaaaaca atttaaacgg 60 ctcccatacg aataatgata ataataaaat aaacaaattt acattatttt ctactgcgga 120 cgcggctccc ttgataataa tcggcgacgc ccctgggagc cgcgacgcac aggttgaaaa 180 cccctg 186 // ID Copia-29_CQ-LTR repbase; DNA; INV; 144 BP. XX AC AAWU01004852; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-29_CQ_; KW Copia-29_CQ-I; Copia-29_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-144 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 368-368 (2011). XX DR GenBank; AAWU01004852; Positions 83018 83161. XX SQ Sequence 144 BP; 40 A; 40 C; 27 G; 37 T; 0 other; tgttggaagt caagcctaag tgcggccctt gaagattagt ttaagttagc tttaggcgca 60 gtaggccatg ccacgcgcaa actttataaa aactccaccc actgtgaatt cctcctcttt 120 ccccgaataa acaccaactc gtca 144 // ID L1-99ext_Cis repbase; DNA; INV; 1452 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 06-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE L1 Non-LTR Retrotransposon from Ciona savignyi. XX KW L1; Non-LTR Retrotransposon; Transposable Element; L1-99ext_Cis. XX OS Ciona savignyi OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. XX RN [1] RP 1-1452 RA Smit A.F.; RT "L1-99ext_Cis - L1 Non-LTR Retrotransposon from Ciona savignyi."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX CC Ci000682, Ci000325 appears to be extended ' UTR of L1-99. Ends CC in minisatellite L1-99Sat. Contains Cis1_SINE insert. XX SQ Sequence 1452 BP; 408 A; 286 C; 322 G; 426 T; 10 other; gtggaccaac atagggggcg atccatgatt acaaacgtgg ttgtatacga gagatattgt 60 agcatggacc gtctatgctg agatgattat ctgcggaaga tcatatagag agaggaagaa 120 atttcaaaca ttcgggaata aaagctatta atgtgacgta acaaagaaat caaggaggga 180 gatatacggt cgtgcctgat accgactttc aacacaagat gtattgaaat gaaccatgca 240 ttgacaaatt ggtttaaagc aatgtcaaaa tgaaaccatg agtttctgat ttttcaggga 300 aaatgcaaca aatcgcttgt ntcanncggc aaggtactaa ccagatgttt ccttaaccca 360 gtggttaaat aggtgaacaa atattctgga cgcctaacct ataatcttaa ttccattcca 420 ccgcttagtc tggtggttta gtgcgacacc tttcaaccga aaggttgcag gttcgaaatt 480 ggtcgcaaga tagtcggtta tgttcttggg caaggcttta aattgcaaaa ctcaaagttc 540 cgtgtgtcaa agttaacgaa cattgcctga acccagtgga ttaatgggtt ctaccaaatt 600 naaggaacgt gtctatcata tacaacacac ttcaatagct ccggtataac ccgaggtgat 660 ctgacgattg cccgtgtgtt aaccctcttg gtttgccccc attcacaggg ataaacatga 720 atatcctatc ttaaacgcgg cgggttacga gcaaatgcan ttggctgctg gtcgcttggc 780 acagagtaaa ccttaatctt tacttcctga tgcaatgatc ggtgactgac ttatactgat 840 atatatggtg gatcgcaatg taacttattg atttcacgat gtgaacacga gatgcgatcg 900 ggaagagcga acggacaaca aagaggaaac aagcacacac tcccagaaac cacacaccgt 960 tgacaatgga ctggcactac cacaaaggcc agatcggcca aagcgtacac ggcggaagcc 1020 atagtatttc cacaatgaaa tttgtaacct ccaggaggag gttgaatata gttggggggt 1080 tatggattgg ttggagtaag agtnttctgt tttaatgttg ttgtttttct tttattgttt 1140 cactattcgt atcacttact tggacgtttc cacttgtttt actttgcatt ttcatcttta 1200 ctttgcactt atctatggtc actgtgtact taagttgcac ctcttcaggc gttctagcng 1260 tttccacccg cacacacggt ttatcgattg tcggcactaa ttatttttcg agctcttacc 1320 gttgtccggc cnaaacgaat gtgtcagggn agttctcgtt gtacntctca acaagcgcac 1380 agtattttcc tttcttgagc tttgagtatc gttgtcgagt atattgagta ttgtacggtg 1440 tattgagtat tg 1452 // ID piggyBac-6_SM repbase; DNA; INV; 2372 BP. XX AC . XX DT 29-MAY-2008 (Rel. 13.05, Created) DT 29-MAY-2008 (Rel. 13.05, Last updated, Version 1) XX DE DNA transposon from Schmidtea mediterranea. XX KW piggyBac; DNA transposon; Transposable Element; piggyBac-6_SM. XX OS Schmidtea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae. XX RN [1] RP 1-2372 RA Kapitonov V.V. and Jurka J.; RT "Families of autonomous piggyBac element from the freshwater RT planarian genome and two clear examples of horizontal transfer of RT piggyBac transposons between a flatworm and insect."; RL Repbase Reports 8(5), 525-525 (2008)Repbase Reports. XX DR [1] (Consensus) XX CC piggyBac-6_SM is a very young family of piggyBac transposons, CC characterized by 18-bp TIRs (two mismatches) and TTAA target-site CC duplications. The consensus sequence was reconstructed based on CC multiple alignment of 17 copies (they are ~99.6% identical to the CC consensus). This transposon may be currently active; and the CC consensus sequences is a good approximation of the active CC transposon. XX FH Key Location/Qualifiers FT CDS 508..2241 FT /product="piggyBac-6_SMp" FT /note="piggyBac transposase." FT /translation="MAKRIKLTDGEILRALEESSEDEVEIEDNSEEMIEND FT LSGSETDVLETASETSDEMFISEDSEESDNENQYISYIGKDGTEWNSVPYI FT KGRTAAHNIIRGGINKVVLPPGKHIDCSLDAFSLFFNDNIINIIVKFTNLH FT GRTVLGKNWKETDQMEILAYIGLLIDAGLKKQGLFDYDEFWDPLFGCTIYR FT ACMSKNRFAALSRHLRFDDLLSRSARRDQDRFAPIRDIWDLVNKNLRKYYI FT PGENLTIDEQLVPFRGRVKFRQYMPSKPDKYGMKIWWICDSKTSYPLFGIP FT YLGQEQAGRAVNLARTIVEQLCEPFERTNRNITFDNYFTSYELANSLLSKG FT LTCVGTLRKNKRCIPQNFLPHRTREVESNVFGFRRTMTLVSYVPKKNRAVV FT LLSTMHHTSEVDNTNKNKSEINLYYNSTKGGVDTLDQKCHAFSVKRKTRRW FT PIAQFYNLVDVCGVAAEIIWKNLYPDWNKTKMNSRRKIFLKNLVQELVIPN FT IKRRSVKHLPKSTIATINETLENSSNIVVIDKSVPETNIRRRCYLCPSAKG FT RASKQCCGNCGKNVCNEHSQKIVVCKTCLNK" XX SQ Sequence 2372 BP; 862 A; 332 C; 427 G; 751 T; 0 other; ccctattaac actaggttgt gactttgaat cacatttcga tttaatttgt agatattata 60 tgcaaataca ctaattttgc caaatgataa taatgtattc tatcatttat aatttcttcg 120 cataaaaaaa ctaaaaaaac aaattttatt tcttatagtt gctatagaaa cttcatatta 180 tataattccg acgtaaaata ccaaattgtg actctaagac acatattaat ttacttagat 240 aaagtaatgg ttttactata ctttacttgg taatggtttt caaatattca aaagtgaata 300 ctaagtaggt atagtgttgt tatagaaact taaaatcttg tgacaaagta caatagtaaa 360 ttttattatt tatttttgtt tctggttggt ggtatattag tgcccatact tttatatttg 420 gttcaagata agcatttttt ctaaaagcag gtaagaggct ataatataat attgtccaga 480 taacaaattg tatattttgt ttcaaggatg gctaaaagaa ttaaattaac tgacggagaa 540 atactcagag ctttagaaga aagtagtgaa gacgaagtag aaattgaaga taattcagaa 600 gaaatgattg aaaatgatct ctcaggtagt gaaactgacg tattagaaac tgcttccgaa 660 actagtgatg aaatgtttat aagtgaagat agtgaagaaa gtgataacga aaatcaatat 720 atttcgtaca taggtaaaga tggtacagag tggaactctg taccatatat caagggccga 780 actgctgctc ataatattat tagaggagga ataaataaag ttgtcttacc gccaggcaag 840 catatcgatt gttctctaga tgcgttttcg ttatttttca atgataatat tattaatatt 900 atcgttaaat tcacaaattt acatggcaga actgttcttg gcaaaaattg gaaggaaacc 960 gaccaaatgg aaatacttgc atatattgga ttattgatag atgctggatt aaaaaagcaa 1020 ggattatttg attatgatga attttgggat ccattatttg gctgcacaat ctatcgtgcg 1080 tgcatgtcaa aaaacagatt tgcagcatta tctcgtcatt tacgttttga cgacctgtta 1140 tcaagatcgg cgcgtagaga tcaagataga tttgcaccaa ttagagatat ctgggatctc 1200 gtcaacaaaa atttgagaaa gtattatatc ccaggagaaa atttaacaat tgatgagcaa 1260 ctggtgccat ttagaggtag agttaaattc cgacagtata tgccctccaa accagataaa 1320 tacggaatga agatttggtg gatatgtgat agcaagacta gttatccttt atttggaata 1380 ccgtaccttg gacaagagca agctggtaga gcagtaaatt tggccagaac tattgttgaa 1440 caattatgcg aaccatttga gagaacaaat agaaatatca cattcgacaa ttattttaca 1500 agttacgaac ttgcaaatag cttattgtcc aagggcctta catgtgtagg taccttacga 1560 aagaacaaac gatgtatacc acaaaacttt cttcctcata gaactagaga agttgaaagt 1620 aacgtttttg gtttccgaag gacaatgacg ctagtcagtt acgtaccaaa aaaaaatcga 1680 gctgttgtct tactatcaac catgcaccac acttctgaag tggacaacac aaataaaaat 1740 aaatcagaaa taaacttgta ttataatagt acaaaaggtg gggttgatac gttggaccaa 1800 aagtgtcatg ctttttcggt aaaacggaaa acccgtagat ggccgattgc acaattttat 1860 aatctcgtgg acgtatgtgg tgtagctgca gaaattattt ggaaaaattt atatccggat 1920 tggaacaaaa ctaaaatgaa ttcgcgacgc aagatatttt tgaagaattt agtgcaagaa 1980 ttagtaatac caaatataaa acgtagaagt gtaaaacatt tgcctaaaag taccattgct 2040 acaatcaacg aaactttgga aaattcttca aatattgtgg tgattgataa atcagttcct 2100 gaaacaaata ttcgcagacg atgttacctt tgtccatcag caaaaggtcg agcctccaaa 2160 caatgttgtg gtaattgtgg gaaaaatgta tgtaatgaac attcccaaaa aatcgtggtt 2220 tgcaaaacat gtttaaataa ataattgata ttttatttta tttttcttat gaataaatga 2280 cattacagat ataactatat gcatttttaa tatgtgtctt aaagtcacat cgtgatgttt 2340 tacgtaagta tattaaccta gtaataatag gg 2372 // ID NVBRP5 repbase; DNA; INV; 175 BP. XX AC X64092; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 25-MAR-1997 (Rel. 2.02, Last updated, Version 2) XX DE N.vitripennis repetitive DNA from B chromosome. XX KW SAT; Satellite; Simple Repeat; NVBRP5; Repetitive DNA; KW satellite DNA. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-175 RA Eickbaum C.D.; RT "NVBRP5."; RL Direct Submission to Genbank (27-DEC-1991)D.C. Eickbaum, RL University of Rochester, Dept. of Biology, Hutchison Hall 334, RL Rochester, NY 14627, USA. XX RN [2] RP 1-175 RA Eickbush G.D., Eickbush H.T. and Werren H.J.; RT "Molecular characterization of repetitive DNA sequences from a B RT chromosome."; RL Chromosoma 101, 575-583 (1992). XX DR GenBank; X64092; Positions 1 175. XX SQ Sequence 175 BP; 51 A; 30 C; 38 G; 56 T; 0 other; taagttctcg gcttacccac ctattatccc ttctattgcg atggtttatt tctactagta 60 gtataagaaa taattacagc aagtttggtt gaaatcgaag gttgggaatc tggacaactg 120 ggcgctagta gttgtcaaga aggaatctga atcacgaata tagatctttg tcgac 175 // ID P-28_HM repbase; DNA; INV; 3885 BP. XX AC . XX DT 22-DEC-2008 (Rel. 13.12, Created) DT 22-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE P-type DNA transposon family: consensus. XX KW P; DNA transposon; Transposable Element; P-28_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3885 RA Bao W. and Jurka J.; RT "P-type DNA transposon families from Hydra magnipapillata."; RL Repbase Reports 8(12), 2081-2081 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(1169..2725,2641..3159) FT /product="P-28_HM_1p" FT /translation="MSLHNFILYCQVNRKRKALSVSLTSLENGVEGSVIDI FT NVNFSPFKVQKNKIQNLKSKLKSAQQKNRRLKSKVKSLKKVVQCLKSNNLI FT SSQCEEMLTTTFSGVPLEIMKRVSSIKNQHSIRTKFPDELRSFAMTLQFYS FT SKAYEYVRKTFQLALPHQSQIRHWYTKIPAEPGFTKAAFIALQAAVCASKK FT IGREILCSLMLDEMAIKKQVQWDGKKFRGFVDLGNGIDNDDSLPLARDALV FT LMVVSINSNWKVPCGYFFIDGLNGAERANLVKNCLKKLFDVGVKVVSLTCD FT GPSYNFAMLSELGANLKPSNLLPSFPNPSKSCEKVYVLLDVCHMLKLLRNT FT LADIGIIVDSNNNKIRWHYLVELEKLQSKEGIRMGNKLKLAHIEFRQQKMK FT VNLAAQIFSSSVADALLYCSENLKLSQFEGCGATVEFIRIIDRLYDILNSR FT NPFAKGYKSALRTCNKALWFPFLEMAYNYIISLKDTTGTQMILSSRKTGFI FT GFLIELNLYKVYFLTGLKKKVHHKNWFYWIFNRIKSVQGIFLDWVEKEGSP FT LNYLLTYKLSQDHLELFFGAVRSAGMFNNNPNCQQFTATYKRLLLRSSIQC FT SNGNCSIRDKTHILNVLDDSFADFSTKISMTEIELIRKYDLLERKPTINDH FT DYADIPNFSNLSEYKKVSISYIAGYVAKMTSKKLSVLNVKMH*" XX SQ Sequence 3885 BP; 1450 A; 553 C; 582 G; 1300 T; 0 other; cactgatctc aaaatataac tggatagcgg attctattgt tttttccgaa aaaagttgaa 60 gccaaaattg gcctcccaag atggcggcga tttaagttcc tagcggtaaa accgatttta 120 tatggcttta tcaaactacg atagaaaata attaaataag gctaataaat ttgtcaaaaa 180 taaaagaaaa caaatgaatt ttataaatat tcgagaaaaa aaaaagaaat tattttttaa 240 ataatttaat ttattattaa atggcattta caattttagt taaattttgt atataattta 300 tgtacctttt agttcttagt tacccaatgc tgtatgctag aaggttaata acataacagc 360 atggtaagca gcaatcatct ctacctacaa tttatatttt ctattttttt agctagagac 420 cattttgtac aaatgctgca tgaactatta actgagtgct gtgagtacac acaagctcag 480 gggttccttc taggattata ttatacttga atgttgaaag tgtcactttc actgtttgga 540 aaaatgtatt tattaaattt ttgatataga aatatttgaa acattacttc tttaaaatat 600 cttcttaact aaagattacc tggtgttaat tattacggtt atatagtttg ctgacttatt 660 tatattaatg agctgtaaag tagcaggtaa ataattttat aaacatgcca agttgctgtg 720 tctttggatg tacaaataga actggatcta aaaacaattt aaatgaaaag gtttcttttc 780 acaagtaagt caaactttaa tataaataat ttgtataaat aacttatatt caaatataaa 840 taaaccagaa attataatat aaatactcac gactttagat ttccaaaaga tatagaaaag 900 aaaaaaaaat ggatacataa cattggacaa gaaaattaca taccctcttc taatgtcacg 960 ctttgctcag aacattttga aaaatcttgt tttgataaaa ctggccagac aattcgttta 1020 aagcaaaatt ctgagccaac agtttttata ttaccaaaac aatctttaaa ggtttgtata 1080 agttaactta taataaaaaa acctaacaat tattttctaa caaattgtaa tattgtcaac 1140 agaattatta acatttttaa aattcacaat gagtttgcat aattttatat tgtattgcca 1200 ggtaaacaga aaaagaaaag cattatctgt aagcttaacc tctctggaaa atggagttga 1260 aggttcagta atagatatta atgtaaactt ttcaccattt aaagtgcaaa agaataaaat 1320 tcaaaattta aaatcaaaat taaaaagtgc acagcaaaag aatcgtcgcc tcaaatctaa 1380 agtcaagtct ttaaagaagg ttgttcaatg ccttaaaagt aataacttaa tttcatctca 1440 atgtgaagaa atgcttacaa caacattttc aggagtacct ctagaaatta tgaaaagagt 1500 ttcatcaata aaaaatcaac atagtataag aactaaattt ccagatgaac tgcgatcttt 1560 tgcaatgact ttgcaatttt actcgtctaa agcatacgag tatgttcgaa agacttttca 1620 attggcacta ccacatcaat ctcaaataag acattggtac accaaaatac cagccgaacc 1680 tggatttacc aaagctgctt ttattgctct gcaagctgct gtatgtgctt caaaaaaaat 1740 aggtcgtgag attttatgct cccttatgct tgatgagatg gcaataaaga agcaagttca 1800 atgggatggg aaaaaattta gaggatttgt tgatcttgga aatggaattg ataatgacga 1860 ttctttacca ttggcaagag atgcacttgt tctaatggtg gtttcaataa atagtaattg 1920 gaaagtacca tgtggttatt tttttattga tggacttaat ggagccgaac gtgccaattt 1980 agttaaaaat tgtttgaaga aactttttga tgttggtgta aaagttgttt cgctaacttg 2040 tgatggtcca tcatacaact ttgcaatgct ttctgaactt ggtgcaaatt taaaaccaag 2100 taatttatta cccagttttc ccaatccaag taagtcatgt gaaaaagttt atgtgttgtt 2160 agatgtatgt cacatgttaa aacttctcag aaataccctg gcagatattg gtattattgt 2220 ggacagcaat aataataaaa tcaggtggca ttatttagtt gaacttgaaa aattgcagtc 2280 caaagaagga attagaatgg gaaacaagtt aaaattagcc catatagaat ttagacaaca 2340 aaaaatgaaa gtaaacctag ctgctcaaat atttagttca agtgttgctg atgccttact 2400 ttattgctca gaaaatctta aattatcaca gtttgaagga tgtggtgcta ctgttgagtt 2460 tattcgcata atagatcgat tatatgacat attaaattca agaaacccat ttgccaaagg 2520 atataagtca gcattgcgta catgtaacaa ggctctttgg tttccatttc ttgaaatggc 2580 atataattat atcatcagtc ttaaagatac tactggtacg caaatgatct tgtctagtag 2640 aaaaactggt tttattggat ttttaataga attaaatctg tacaaggtat atttcttgac 2700 tgggttgaaa aagaaggttc accactaaat tatttgttaa catataagct cagtcaagac 2760 catctagagc tcttttttgg tgctgtaaga tctgcaggaa tgtttaacaa caatcccaat 2820 tgccaacaat tcacagccac atataaacga ctattgctgc gaagtagcat tcaatgttca 2880 aatggtaatt gcagcatcag agataaaaca catattttaa atgttttaga cgattctttt 2940 gctgacttca gcactaaaat atctatgacc gaaattgaat taataagaaa atacgatttg 3000 ttggagagaa aaccaactat taatgatcat gactatgcag atattccaaa tttcagcaac 3060 ctatctgaat acaaaaaagt ttcaatttca tatattgctg gatacgtagc taaaatgact 3120 tcaaaaaaat tatctgtatt aaatgtcaaa atgcactagt actccctttt catttagaac 3180 aaaatagctt tttaaaattc aaagacagag gaggtttgat taaaccaacc ataagtgtta 3240 ctatcatatg cgaagaaact gaaaagtgtt ttcaaagatt gcttgcttca acaaacgcac 3300 ctactaatga aataggaatt caaaaaagca atcgcaagtg ctgttttgag atcagtagat 3360 ttctctaaaa ttttcattga actagcttct catatgtttg actcagctat tagtgataac 3420 catttatttt cactggttca acttatttca gaaaattatg caaaaatacg cctttatcac 3480 cttggaaaag aacaaactgc caaaataaca ggaaaaaagc ttcgtaagca attggcaaaa 3540 ctcgtcttat ttaaaaatca ataaattcat taatgattgt atatatatac acatataata 3600 tatgttttat attatatata tatatatata tatatattgt atagtactaa tttcttattt 3660 gttatttact aattttaaat gaattatttc ttaataactt tcacaagaaa attttatctt 3720 atttaattat atttatttaa aaaaagaaag agaaatgaaa ggtattatat ataatttaat 3780 tttaaaagtt ttgtttaaga aaaaacccgc cgccatcttg agaggccaaa actggcttca 3840 aaaaatttgc cgtatacgct atccaattat attttgagat cagtg 3885 // ID Crack-11_AAe repbase; DNA; INV; 4632 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A Crack non-LTR retrotransposon family from Aedes aegypti. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; KW Crack-11_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4632 RA Kojima K.K. and Jurka J.; RT "Crack clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1227-1227 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 8 sequences with >98% CC identity. CC Closely related to Crack elements in Culex pipiens (Crack-1_CP CC to Crack-4_CP). XX FH Key Location/Qualifiers FT CDS 289..1323 FT /product="Crack-11_AAe_1p" FT /translation="MQKSMSKNVSTCVICRREEKDLRKLLECGYCQKVEHV FT TCKNIVGSAVRKLRCQTYYCSEDCKAQHLRSPKPAEMESQVLKEVRMVLSE FT VKGTRAEMQAEMLAVRTTIGELEKFHNFLSEKLDCLMDDMKLVRREQSELK FT TKYEGLQSDQRATSSTVEQLELEVDRLKRRDLSKNVVIVGVPMVKNESTVQ FT IVKKISVVIGYDLPDDAVLDAKRILSKVQSQNGMKSAPIKVVFSNETHKEE FT LLLKKKSHGPLLLSTIDATFGDGKIVLRDELTSYGMNLLKEVREVQKQFDL FT KYVWPGRNGAVLVKKGDNSKIDVIRCLSDVDLLQRRNLKRQLTVSPEKPAQ FT KR" FT CDS 1368..4256 FT /product="Crack-11_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MYTNNKFDCISDWIEQNKVPIGGSNSLKILQWNIRGI FT NDLAKFDCVGELLQRCKKRIDIVILCETWVKSDRTQMYKLDGYTGVFSSRS FT ESHGGLAVFMRNGLIVEVCQNRAVEGFHHIHCQLLSSGKRINFHAVYRPPS FT YGVREFLDNIENMLSVNRKGDECIIVGDMNVPINMGSNNVVQEYLRLLISY FT NVVVSNTHVTRPSSCNVLDHVICSENLANTVINDTVENTISDHCLIMSSFN FT LACVSETCTLQKEIVDHSRLSELFVSSLVSLPDNMAPGEKLSHIVECYRNC FT LIQCTRTVSIQAKVKGHCPWMTIELWKLISIKDRLFKRHKIHPNDQHTSDL FT LDHVSKLLLKKKKLAKRNYYHTIIEKASQSNAWKVVNEVVGKNNNTDTPNR FT IWKDGVLLSDQGLICSAFNSFFCNIGAQLAATIPSARDPDRFRTTTGHSSS FT IYLSPATLNETITLINDLKERKSPGPDMISAKFIKIHYAIFAPLLTDVFNE FT MISTGNFPECLKVAKVIPIFKSGDPRELNNYRPISCLSVLDKILEKMLACR FT IMNYASHFGLIFSHQYGFRKGSGTLSATCDLVEGIYESLDAKRMVGAVFID FT LKKAFDTINHELLLEKLEFFGIRGTTLTLLESYLTNRQQFVSIGNTISEIR FT TISSGVPQGSNLGPLLFLLFINDICKLNLKGEICLFADDTSLFYKDVQYRN FT IQQQMDHDLNLLYDYFCANKLSLNLKKTKCMFIHSPRRRFPPRLPLTVHGV FT NVEEVHEYVFLGLTIDSTMSWSGHIRNLKKKLSSICGALRRVSNFIPGKWL FT MQLYYTLVHSRLSYLVALWGSASKSILRELQVVQNRCLKIALNKPFRFSTA FT MLYSNRSDNVLPIKALYDLQTLTHVHRISHDPSLHHNIAIRRIQRSRASRQ FT AGNFSLVRPNTEMGRKKLTFYGFKLHNDLRPECKSVNNISIFKKMLTMEMK FT QNVSKYLF" XX SQ Sequence 4632 BP; 1427 A; 954 C; 969 G; 1282 T; 0 other; ttcattcaac agtgatctct acattaacca ggccaaggca catcgctgga cgtgttatct 60 ccacccacgc tagcccggat aaggcaaaaa aatccgaatc tctagtggtc tggtaactat 120 agccatccgt agcaaaccaa ttgattcttg actcatctat tgtctcacca ccagcatgca 180 caagcacttt gtcgtataga agctagttga gctgtctttg ttatacacat tcgctttcat 240 tcgttctaca gttttgtctg tccgtggccg tgaaattatt gttagttgat gcagaagtca 300 atgtcaaaaa atgtttcgac ttgtgtgatt tgtcggagag aagagaaaga tctgcgcaaa 360 ttactggaat gcggttactg ccaaaaagtt gaacacgtca catgcaagaa tatcgttggg 420 agtgcggtac ggaaattacg ttgtcaaacc tactactgtt ctgaggattg caaggcacag 480 catttaaggt caccaaaacc ggcggaaatg gaatcccagg tcctcaagga ggtccgcatg 540 gtcctgagcg aagtcaaggg aacgcgtgct gaaatgcaag ctgagatgct agctgtgaga 600 acaacgatcg gtgagctcga gaagtttcac aacttcttat ccgaaaagct agattgtttg 660 atggatgaca tgaaattggt aagacgggag caaagtgagc tgaaaacaaa gtatgaagga 720 ctacaaagtg atcaacgagc aacttcttca actgtagagc aactggagct ggaggtagat 780 cgtttgaaaa gaagagatct gtcgaaaaac gtcgtcatcg ttggagttcc gatggtgaaa 840 aacgaatcga ccgtccaaat cgtcaaaaaa atttccgtcg tgattggata tgatctaccc 900 gatgatgcag ttctggacgc taagcggatc ttatccaaag ttcaatcgca aaacggaatg 960 aaatcggcgc ctatcaaagt ggtattctcc aatgaaactc acaaggaaga gctgttattg 1020 aaaaagaagt cacatgggcc actcttactt tcgaccattg atgccacctt tggagatgga 1080 aaaatcgttc tgcgggatga gctcacatct tatggaatga atctactgaa ggaggtccgt 1140 gaagttcaga agcaatttga cttgaagtat gtatggccag ggcgtaatgg agctgtacta 1200 gtcaagaagg gggacaactc taagattgat gtcattcggt gcctcagtga tgtcgacttg 1260 cttcagcgta gaaatctaaa gcgtcaacta acagtatcac cagaaaaacc agcacaaaaa 1320 cgatagatgt tggatgtgta caccgctgtt cttgtttgta acccacaatg tatactaata 1380 ataagtttga ttgtatatct gattggattg agcaaaataa agtgcctatt ggcggatcaa 1440 attcattgaa aatacttcag tggaatatca ggggcattaa tgatcttgca aaatttgatt 1500 gtgttggtga actgttgcag agatgtaaga agcgtatcga tattgttata ttatgcgaaa 1560 cctgggttaa atccgataga actcagatgt ataaattgga tggttatact ggagtgttct 1620 cctcaagatc ggagtcacac ggggggttag ctgtgttcat gaggaatgga ttgatcgttg 1680 aagtttgtca gaaccgagcc gtggaggggt ttcaccacat tcattgtcaa ctactttcaa 1740 gtgggaaacg aattaatttt cacgctgtat atagacctcc atcgtacgga gtgagagaat 1800 tccttgacaa tatcgagaat atgctatcag taaataggaa aggagatgaa tgtataattg 1860 tgggagacat gaatgtacct atcaacatgg gatcgaataa tgttgtacag gaataccttc 1920 gcctgttaat atcctacaat gttgtagttt cgaacacaca tgttaccagg ccatctagtt 1980 gtaacgttct tgatcatgta atttgctcgg aaaacttggc caataccgtt ataaatgaca 2040 ctgttgaaaa cacaattagc gaccattgct tgattatgtc gtcatttaat cttgcatgtg 2100 tatcggaaac ctgcacgcta cagaaagaaa ttgtggatca ctcacgcctc agcgaattgt 2160 tcgtttcctc tcttgtgtcc ttgcctgata atatggctcc tggagaaaaa ctgtctcata 2220 tagttgaatg ctaccgtaac tgccttatac aatgcaccag aacggtttcc attcaagcga 2280 aggtgaaagg gcactgtcca tggatgacaa tagaactttg gaaactaatc agtattaaag 2340 accgcctatt caagcgacat aaaattcatc ccaatgatca acatacttcc gatctattgg 2400 accatgtttc aaaattgcta ttgaagaaga aaaaattggc taaacggaac tactaccata 2460 ctatcattga aaaagcttct caaagtaatg cgtggaaagt agtcaatgaa gtagttggca 2520 agaataacaa tactgatacc cccaatcgaa tctggaaaga cggagtttta ctgtccgatc 2580 aaggcttgat ttgttctgca ttcaactcgt ttttctgcaa tattggagct cagttagcag 2640 caacaattcc cagtgcgaga gaccctgacc gtttcagaac aacaaccggc cattcgtcat 2700 caatatatct ctctcctgca acgcttaatg aaacaatcac gctgataaat gatttgaaag 2760 agaggaaatc gccgggtccc gacatgatat cagccaaatt catcaaaatt cattatgcca 2820 tatttgcccc tctgctgacg gatgttttca acgagatgat aagtaccggt aattttcctg 2880 agtgcttaaa agtggcgaaa gtgataccaa tcttcaagtc tggagaccct agagaactga 2940 acaattatcg acctatctcc tgcttatcag ttttggacaa aatactggag aagatgcttg 3000 cttgtaggat tatgaactat gcatctcatt ttggtctgat cttttcgcat cagtatggct 3060 tcagaaaagg gtctggcact ctaagcgcga cttgtgatct tgtcgaggga atatatgaat 3120 ctctggatgc taaaagaatg gtcggtgcag tctttattga cctcaaaaag gcttttgaca 3180 ccataaatca cgagctgctc ttggaaaaac tggaattttt tggaatcaga ggaacaactt 3240 taactcttct ggaaagctat ctcaccaatc ggcagcagtt cgtatcaatc ggaaatacta 3300 tcagtgagat tcgtaccatt agctcagggg ttccccaagg cagtaatctc ggcccattat 3360 tgttcttatt gttcatcaac gatatctgca agctgaatct taaaggggaa atctgcttat 3420 ttgctgatga cacttcactt ttctacaaag atgttcaata ccgtaatatt caacaacaaa 3480 tggaccatga tctcaatctg ttgtatgatt acttctgcgc taataagctg tctctgaacc 3540 tgaagaaaac gaaatgcatg ttcattcact caccaaggcg tagatttcct cctcgcttac 3600 ctctcacagt acatggcgtc aatgttgagg aagtgcatga atatgttttc ttgggactta 3660 caatcgactc cacaatgagc tggtctggac acatcagaaa cctcaaaaag aaactcagtt 3720 ctatatgcgg agccttacgg agagtatcca acttcattcc tggtaaatgg ttgatgcagc 3780 tgtattatac gttggttcac tcaagactga gttacttagt agcactttgg ggatccgcta 3840 gcaaatcgat tctccgtgag cttcaagtgg tacaaaatcg atgccttaag attgctttaa 3900 acaaaccttt ccggttttct accgccatgc tgtattccaa tcgaagcgat aatgttttac 3960 cgattaaagc actgtacgat ctgcaaacgc tcactcatgt ccacagaata tcccatgacc 4020 cttcactgca tcacaacatc gccataagaa ggattcaacg tagtcgtgca tcaagacaag 4080 caggaaactt ttctcttgtc aggccaaata ctgaaatggg ccgtaagaag ctaacttttt 4140 atggattcaa gctgcataat gaccttcgtc ccgaatgcaa atcagttaac aacatcagta 4200 tcttcaaaaa gatgttaaca atggagatga aacaaaatgt ttcaaaatac ttattttaat 4260 gtcccaaacc acatttcact tatacattac gtatattctt ttgatccgct tccatccaca 4320 gccgccgccc tccacccacc agccaccgcc cgccatcaac cacccgccgc caacctccca 4380 ccatcactca acgccttcga aacgaaatgt taagcaatag tagtattgat gtaaagtgtc 4440 tgaatattat tgttgtagat agtttatgaa gcacacttcc ttaaaagagc ttgagctcac 4500 tggaaggtgc acacttgtaa taatgaatga aaagatgagg gggtttttac gccttttgga 4560 gagaaaactt gaaagaagtt tactccaaag ggcttttccc tactccgaaa aaaagataaa 4620 taaataaata aa 4632 // ID piggyBac-N1_BM repbase; DNA; INV; 242 BP. XX AC . XX DT 30-MAY-2008 (Rel. 13.05, Created) DT 30-MAY-2008 (Rel. 13.05, Last updated, Version 1) XX DE A piggyBac nonautonomous DNA transposon from a silkworm - a DE consensus sequence. XX KW piggyBac; DNA transposon; Transposable Element; Nonautonomous; KW horizontal transfer; piggyBac-15_SM; piggyBac-N1_BM. XX OS Bombyx OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Bombycidae; Bombycinae. XX RN [1] RP 1-242 RA Kapitonov V.V. and Jurka J.; RT "Families of autonomous piggyBac element from the freshwater RT planarian genome and two clear examples of horizontal transfer of RT piggyBac transposons between a flatworm and insect."; RL Repbase Reports 8(5), 536-536 (2008)Repbase Reports. XX DR [1] (Consensus) XX CC piggyBac-N1_BM is a relatively old family of nonautonomous CC piggyBac transposons, characterized by 15-bp TIRs (one mismatch) CC and TTAA target-site duplications. The consensus sequence was CC reconstructed based on multiple alignment of 25 copies that are CC ~93% identical to the consensus). The genome contains ~200 copies CC of piggyBac-N1_Bm that are ~90% identical to the consensus. The CC complete piggyBac-N1_BM DNA sequence is 88% identical to the CC planarian piggyBac-15_SM (Schmidtea mediterranea). piggyBac-N1_BM CC is a nonautonomous deletion derivate of piggyBac-15_SM (pos. CC 1-196, 2312-2356). The high nucleotide identity, including CC non-coding regions, between these transposons in two species CC diverged from their last common ancestor over 500 million years CC is a clear evidence of horizontal transfer of piggyBac CC transposons. XX SQ Sequence 242 BP; 63 A; 39 C; 36 G; 104 T; 0 other; cactaaacat accataacgg gtcaaatgac ccattttgaa cttttgcatt gaaaatgcac 60 gtatcatctt attgcttttg cacatttgac ttcatgactt tttcttacca attaatctac 120 agttattata ctattttttt tttttttgtt ttgttttgtt tattttgagt aatacttgtc 180 gtataccgtt atctacccgc tatggtagaa ataggtacca gtaagtgttg gtatgattag 240 tg 242 // ID Merlin7_SM repbase; DNA; INV; 1063 BP. XX AC . XX DT 12-AUG-2009 (Rel. 14.08, Created) DT 12-AUG-2009 (Rel. 14.08, Last updated, Version 2) XX DE Consensus sequence of Merlin-type family of repeats. XX KW Merlin; DNA transposon; Transposable Element; Merlin7_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-1063 RA Jurka J.; RT "Merlin-type element from freshwater planarian (Schmidtea RT mediterranea)."; RL Repbase Reports 9(8), 1897-1897 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 95..991 FT /product="Merlin7_SM_1p" FT /translation="MSYYEREILSLRFQPTKVIIEKLMSLGLIKDTFLCQH FT CKIPMIMRKRQTKDNYAWVCNNACCDFYRTTKSIRSDSIFEDLRCPLVDAF FT TVIFSWTMDKTIKSLVTEFNLKKRTIIKIFLRLRLIVKRHLDADPIRLGGI FT NKVCQIDESMFCHKVKAHRGRAPYKQVWVFGIVDTSFVPARGYMEIVSDRS FT ARTLLPIIRRVCRPGTIIHSDEWASYNQIQSSLGFEHQTINHSLYFVDPIT FT KIHTQNIESYWNKQKLKLKSLRGVRRVDLSLYLAEFMWKEIYKNDIYNTLF FT GVIRLYN" XX SQ Sequence 1063 BP; 395 A; 141 C; 177 G; 350 T; 0 other; agtacgttat ctgccctaaa cagggcagat aacgtactaa taaataaata aatatttaaa 60 ttaacaaatt aataaataat caattaataa taaaatgtca tattatgaaa gagaaatttt 120 aagtttaaga tttcaaccaa caaaagtaat tattgaaaaa ttgatgtcgc tgggtcttat 180 taaagatact tttctatgcc aacattgtaa gatccccatg attatgagaa aaagacaaac 240 aaaagataat tatgcatggg tatgtaacaa tgcgtgttgc gatttttaca gaacaacaaa 300 aagtataaga tcagattcta tatttgaaga tctgagatgt ccacttgttg atgcttttac 360 cgtcattttt tcttggacta tggacaagac aattaaatca ttggtaactg aatttaatct 420 caaaaaaaga actataataa agatttttct gcgtctacgt ctaatagtaa aacgtcatct 480 tgacgcagat ccgattcgtc ttggaggaat aaacaaagtg tgccaaattg atgaatctat 540 gttctgtcat aaagtaaagg cacaccgagg tagagcacca tataaacagg tttgggtgtt 600 cggtatagtt gacacatcat ttgttccagc taggggatat atggaaatag tttcagatag 660 gtctgcaaga acactactac ccattataag acgagtttgt agaccaggaa caattataca 720 ctctgatgaa tgggcttctt ataatcaaat tcaatcatct ttggggtttg aacatcaaac 780 gataaatcat tcgctgtatt ttgtagatcc aattactaag attcatacac aaaatattga 840 atcgtattgg aataaacaaa aactcaagtt aaagtcttta cgaggagtaa gaagagtgga 900 tttgtcttta tatcttgcag aatttatgtg gaaggaaatt tataaaaatg atatctataa 960 taccctattt ggtgtaataa gattatataa ttaaatatat cggtttttgt ttgatttttt 1020 atttatttta tattttataa aaatttaggg cagataatgt act 1063 // ID DNA-TA-1_CQ repbase; DNA; INV; 238 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A DNA transposon family from Culex quinquefasciatus - consensus. XX KW DNA transposon; Transposable Element; nonautonomous; DNA-TA-1_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-238 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the southern house mosquito."; RL Repbase Reports 11(1), 51-51 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >99% CC identity. TSDs are TA. XX SQ Sequence 238 BP; 70 A; 55 C; 52 G; 61 T; 0 other; cactcaaacc ccgatggttt gacaccaact gttgtcaaac gaacggggtc actttttagt 60 ttgacacccc ttttacacgg agttcacaca cactaccaaa cgtttgtttt gatagtgtgc 120 gtgagcgccg tgtaaaaagt gacagttcgt cactttttag tttgactttg accaaccaac 180 ggggtacaaa ctaaaaaagt gtcaaacgaa aaagtgacca accaccgggg gttgagtg 238 // ID hATm-23_HM repbase; DNA; INV; 3408 BP. XX AC . XX DT 07-DEC-2008 (Rel. 13.12, Created) DT 09-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type family: consensus sequence. XX KW hAT; DNA transposon; Transposable Element; hATm-23_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3408 RA Jurka J.; RT "Families of hATm elements from Hydra magnipapillata."; RL Repbase Reports 8(12), 1917-1917 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 647..2827 FT /product="hATm-23_HM_1p" FT /translation="MTNENLLNYFRENVSKLIYHHKKLVSYRSKKTETQKL FT HENNFLALEKELFDIGHPKLYEIIEKDRIRSKDAKKCDILFYEDQKTARTM FT IIDKIDKKYQKVYSNREFRKRKAKELEQKEESATKISKLKQFSSNSNCINE FT NQDLFLLENFDNEEFQYNNYDEIEKQSTNIKQNKVTVTVDIDELIKKTSTF FT CGRFGISVGVQTGLLALFLRCGNINLNEVKLSKTKVEKIRKDLYVSESELI FT KEKIKEKCKNSNLILHIDTKKVTTLNEDYISEVSERLAIVVTSPDWDLNTN FT GKQQDELLGIVECQSGKGYDQALACFNVIKKFGIESIIIGICADTTACNTG FT CNKGCISILEELLNRPLLRLLCRKHSQERHILHAINAITKTESKGPSKSIY FT TRFKAAWPMHYENVQNNMDQLSKFPWCEVSGTPLEDMAKKSLSFCKKALEL FT NTFPRNDYKHLCQYVIVFLEGREAVPEFIIHKPAAEHEARFMADALYILAM FT KLTQPIIKFLTLEEEIVFSKAAIYVASVHAKNFLQSSQIVSAAHNDLCTVK FT EMIKIKDFDADVSQAFISSYLRHSYYLSEETVVFCLFDKSLDNKIKEKVAQ FT QLLKTNLPEIFKLKKFKPVEINEHSELNDYVGRNSWLIFKIVPYLSKWLES FT SPDLWTYDSDYKNMQRFIKRLVCVNDCCERAIKLVKDFIDSTIKEEKLQDI FT LLVLKEHRVNFPFYKCNWSKEILNKL*" XX SQ Sequence 3408 BP; 1317 A; 440 C; 533 G; 1118 T; 0 other; ggcaggtccg tttttagggg ttgaaaaaac tacaactttg ccaatttatt ttcccattct 60 atgttgccat aaaagacttt tcctgaaaaa atttttgaaa aattttaaat tctctgtcct 120 ccctaggagt ttcaatattt catatttatt acatttttta gagaattttt ttttttagtt 180 tgcaccgccg gacctatgcg gagttataat cacagcgcgc aataataaat tttatgaaac 240 tgcgctgcgt ttgtaacttt ttaaatttaa aatggcaata tcaacacgtt caaatactga 300 aatattttta ttaggacaaa cccacgaaat tggttgcaga actttgccta caaaaggcga 360 cgttttaaga tatttttatc actgctgcag attactgaat ttcaaaaaaa aaaaaaaaaa 420 aaacaccgaa attatttcct gccctctaga tataaatcac gagctactat gtaatgataa 480 aatatgtcag cccaaatgtg ttgtttctgc tcttagaaaa ccgtggctta tgggtggtat 540 accacacatt agttgcagaa gtattaggta actattgatt taaataatgt ttatagttta 600 ttacgtaatt attgataatt attgatataa aaatttaagc tgacaaatga caaatgaaaa 660 tttattaaat tatttcaggg aaaatgtttc caaattaata tatcatcaca agaaactagt 720 atcatatcgt tcaaaaaaaa cggaaacaca aaagctgcat gaaaataatt ttttggcatt 780 agaaaaagaa ttgtttgaca tagggcaccc aaaactttat gaaataattg aaaaagatag 840 aattcgatca aaagatgcaa aaaaatgcga tattttattt tatgaagatc aaaaaactgc 900 aagaacaatg ataattgaca aaatagacaa aaaatatcaa aaagtttatt ccaacagaga 960 gtttcgcaaa aggaaagcca aggagttaga gcaaaaggag gaatctgcca caaaaatctc 1020 aaagttaaaa caattttcat caaatagtaa ttgtattaat gaaaatcaag atttattttt 1080 gttagaaaat tttgataatg aagaatttca atataataat tatgatgaaa ttgaaaaaca 1140 atcaactaac ataaaacaaa ataaagtgac agtaacagta gatattgatg aacttattaa 1200 aaaaacttca acattttgtg ggcgttttgg aatttctgtt ggagtacaga caggattatt 1260 agcattattt ttaagatgtg gaaatatcaa tcttaatgaa gtaaaacttt caaaaacaaa 1320 agtagaaaaa ataagaaaag atttgtatgt aagtgaatca gagttaatta aggaaaaaat 1380 aaaagaaaaa tgtaaaaata gtaatttaat tttgcacatt gatacaaaaa aagttacaac 1440 tttaaatgaa gactatattt ctgaagtaag cgaacgttta gctattgttg ttacaagtcc 1500 agactgggat ttaaatacaa atggaaaaca acaggatgag ctgctaggca tagttgaatg 1560 tcaatcgggg aaaggatatg atcaagcatt ggcatgtttt aatgttataa aaaagtttgg 1620 aattgaaagc attataattg ggatttgtgc tgacacaaca gcatgtaaca ctggttgtaa 1680 taagggttgt ataagtattt tagaagaact tttaaatcga cctctgttaa gacttctttg 1740 tagaaaacat tcccaagaaa gacatatttt acatgcaata aatgcaatca ctaaaacaga 1800 aagcaaaggt ccatcaaaat ctatttatac gagattcaaa gctgcttggc caatgcatta 1860 tgaaaatgtt caaaataata tggatcagtt atcaaagttt ccgtggtgcg aagttagtgg 1920 aaccccatta gaagatatgg ctaaaaaaag tttatctttt tgcaaaaaag ctttggaact 1980 taatactttt cccagaaatg attataaaca tctgtgtcaa tatgtcattg tttttctgga 2040 aggaagggag gcagttcctg aatttattat acacaaacca gctgctgaac atgaagctag 2100 atttatggca gacgctttat acattttggc aatgaagctt actcaaccaa tcataaagtt 2160 tttaacccta gaggaagaaa tagtattttc aaaggctgcg atatatgtag cttcagttca 2220 tgcaaaaaac tttcttcaat caagtcaaat tgtttctgct gctcataatg atttgtgtac 2280 agttaaagag atgataaaaa taaaagattt tgatgctgat gtttctcaag cttttatttc 2340 aagctatctt agacacagtt attatctttc tgaagaaacg gttgtctttt gtttatttga 2400 taaaagtctt gacaacaaaa ttaaagaaaa agttgctcag cagttgttaa aaacaaattt 2460 acctgaaatc tttaaactta aaaagtttaa gcctgtagaa ataaatgaac attcagaact 2520 taatgattac gttggtcgaa attcttggct aatttttaaa atagttcctt acttatcaaa 2580 gtggttagaa agttcacctg atttgtggac ttatgattct gattataaaa atatgcaaag 2640 gtttataaaa agattagtct gtgtcaatga ctgttgcgaa agagctatta aattggtaaa 2700 agactttatc gattcaacaa ttaaagagga gaagttacaa gatattttac tggttttaaa 2760 ggaacacaga gttaattttc cattttataa atgtaactgg tccaaagaaa tacttaataa 2820 attgtaagaa aagtttaact ttttataaga aaaattcaac tgaatttttt atgtatatta 2880 tttacaaaac gtatttgtaa aacctttagt caaagtctca tacattttta acaactttta 2940 ttttttatct ttattaagta caatatttgg attccttatt aaacaaactt ttagaatgca 3000 agttttattg agttttttaa tcaatattta agagtattta gataaatata caattaaaaa 3060 catagtaaaa accaggtctt taaaagaaac cgggttatat ttcaaaatgt gttatctcga 3120 gaaattttta atattttgag ctcaaattta gtaccaacct tcctaatagt agacccagct 3180 cactcaaaaa aaaattataa gttttgttgg tccttgatcc aattaaaaaa gcaatttaac 3240 gaataattgg tcaaaaatgg cttttttaaa aatatttaaa ccctcaggga ggtcaggaaa 3300 attgaaatat tttaaatttt tttttggaat tgtcttgtat atcaacgtag aatgggaaaa 3360 taaagtggca aagttatagt ttttctgacc cctaaaaacg gacctgcc 3408 // ID MuDr-2_HM repbase; DNA; INV; 10128 BP. XX AC . XX DT 07-JAN-2009 (Rel. 14.02, Created) DT 28-FEB-2011 (Rel. 16.02, Last updated, Version 2) XX DE MuDr-type DNA transposons from Hydra magnipapillata. XX KW MuDR; DNA transposon; Transposable Element; MuDr-2_HM. XX NM MuDr-2_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-10128 RA Bao W. and Jurka J.; RT "MuDr-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 9(2), 442-442 (2009). XX DR [1] (Consensus) XX CC TSD is 9-bp long. TIRs are 55-bp long. MuDr-2_HM_1p is the CC transposase, which is distantly similar to other MuDR CC transposases. CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 2008..5235 FT /product="MuDr-2_HM_1p" FT /translation="MKKNISFSKFKFMANQQERSLFYEQKYSLQHLESFRH FT QFLFVFQNDHNYALQIIPVDQSYSNTINDKTSTAYLNKNASLKDQNVIEVN FT MHNETSDKVFIPLSMTSEKSVTSKNSKATLNITAATPAITKKTETLTFNEV FT LTLLQNATYFNNGEETIPVRPKAGEIYGFTISSDKTLKKHLRADGYMWRNK FT KSCALGNGLITKTYFYARVGKTYTNNFEKHEYMLQNGKKALIHYLGNEKSY FT VDRCHGNSTKSKSFFYRSSHYVFDQVLAQGKNKPMDAYRSLLSNDPGGHIS FT TSIIPRNPKQVANIMTRNAEKNNFHNKDEVWNSHLVAYSIHPFVRKIESYP FT DICIVMCCEKIVAQMEEMLELLPCKEITLHYDTTFNLGNYYLSCLAYRHVC FT LETKRNKYLYSSPTVPFAFFIHEKRLAQNHKYFFQTLKENVCGKNFNKSRK FT LFISDREFDGSHYFDNTKSLVCWNHLLQNVKEWMRKNGKFSSSEYDEAVKA FT MKYILRSNSVFEYESRLKSLTKGNINPWNDQKFQKNYIQYMHSDVISRAAA FT FVLAEEGIKYPNFGVTNNASESLNKVIKDLIDNKITKMDSCILAMNYLSAY FT FHREVEKGYYNVGLYQLSDEFAFMLRSRDELSANQIILPDDIVKHIKETEI FT HYKDHNNISSPYASIEMLVEKPVKSDLPKLVYSISSLAKDILNKNGVVFVP FT QNRSWSVKESETIIHSVNLNPPAHCTCGASKNCVHILAVQYLSGNVNSLEH FT FEGKKRNLTSLIQKNRLKVERKSGKKKPRKCDIDSNLKTLNFCELAVPKNE FT SFSIKQNEQSTVPEIKYPIPRGLQWGVGLVLNSCTIDNTLTMLKLRCHADK FT YFLNRLGTSEAEIELKKAMRYLNNDDDNYGKEIWACFLKSKIQITSSPCFN FT LYGSSKTFFFNHFKEAFTFESVGKCFTCLNSSFHPFDFIELPPNHDPTDYL FT LGYLKDKRLLLNCLVCGNKELNHCGTTLKSQNTWMIVIECDIPNISISSLM FT NLPKKILIEKCYFSLACITLGLNAHFTSLHYIRDQWFFYDGMQKNGKLTKM FT IPYNWQNKVANHVIYLID*" FT CDS 7874..7323 FT /product="MuDr-2_HM_2p" FT /translation="MMKQFVKALLKDGETFKFFSSKFPGLSEAKLKEGAFV FT GPDIRELMKDNNFENMVNYVERSAWNSFKDVVTNFLGNQKDSDLKNIVENM FT LCNSKNLGCSMSSKLHFLNSHLDYFPENLGAVSEEQGERFHQDIKEMERRH FT QGRWDTNMMGDYCWMLHRENVTSHKRKSSKRSFETKKTRFYKAL*" XX SQ Sequence 10128 BP; 3547 A; 1448 C; 1486 G; 3647 T; 0 other; gagagtttct aaaacgccga attctaaaac gccgaaaagc cgaaatgccg aaaaaccatc 60 ttggagtaac agaataaaaa acggaatgcg agcaaagaac aaccggtagt caacccccgc 120 attattagtt taaattaatg ttttgaattt tataaaattc aaaacatgta aaattaaaaa 180 atataatgta gcaataagtt ttgtattgct atgccatcta tgtgttgtat accaaaataa 240 tagtggcctg caacaataaa gtaagtttag cggaagctgc acaataaggg gaaagcctgt 300 cactaatgct aaacatgcac agtaagataa aaatctgtta tcacaacaat aggaaatagt 360 aaaattttcg ttgtttttct ttctatttgt ttctttattt tgtgttagct aacacagatt 420 aaagattaat taatctttaa tctgtgttag ctaacaaata ttaagacatt aaactttaat 480 gttttaatag tatggttgac ccatcagttt gacaaacctc aagtactttt aacagccata 540 ctttcctttt ataagctaaa acgattgcta aacatattta ttttattgaa agtaaatata 600 gagtagttag tttagcaaca aaattcaata aagcttgttg aagaaacttc aatgaagtct 660 taataagtat tctaatttgt ttttttgctt gtttgtttat gtttagaaaa atgacaagga 720 atgcaagcat tgggagaagc tttgagcttt gcaaagtgct gtatatgaag gaaggtacct 780 aaaggtatgt tctattcaaa ttattattgt aaaactgtct agtgagaaga gagttcgtta 840 ctgtatagaa aaaaaaagaa ttagtatatg aagtttaata atttgttaaa attggaaaat 900 aaacttctag gcatgaattt attataaaca gaacagaaaa tttatggaac acattgccat 960 tggaagttgt caagtaaaca aatgtaaaca gtttcaagag aaaattagac aaatatttag 1020 tatcaaaaaa gttaattatc ttaaaaatta ttttttgttt aaactattca tcttttgcat 1080 tatattattg aaagttatat tttgttatat ataagttata ttgcagtata ttttcaaaag 1140 ctgtcaaatt aaatttctaa ctcacttgct gttaacgcag ctacagtttg actactacta 1200 tgactattat atatttttaa atgttagcat cagttttttc agtaattttt actattttgt 1260 taataatgct aatatatata tatatatata tatatatata tatatatata tatatatata 1320 tatatatata tatatatata tatatatata tatatatata tgaaagattt gtgaaattta 1380 actactcctt tattagtttt tgattctgaa attaagtaat ttctaatccg catagtttcc 1440 agtggattaa tatttttctt agaatgcaaa actttcatta gttctctgaa ttgtgacact 1500 atttttgcat aacattgcaa agtcttcact tgttacacat acaagttttt atttatgttt 1560 ttaaataaat ttaatctgga gttgataaga ttatggcaat gtttttaaat tgttaattag 1620 aatgtagttt tctgagaaaa gataaaataa attgacctat atcttaatca attttaatat 1680 gaagctaata gaatagtgta aacttttttt atgttttttt ggaaatcata ataagatgta 1740 ggttttatta aatttttaag atgtagatga taatttttag atagattaga acatttacaa 1800 aaattttccc aaataaatat gacagttttt accataacaa tttaatttat gtatgccttt 1860 aatgtgtatt tgtttttact aagtatttaa tattattttt attaccttta ttgttatttt 1920 tttacttaaa tttatcaaca aaaaaaatta ccaatttttt tctgaattat attttgtatc 1980 tttgaatgaa tctcaaaatt gaaaattatg aaaaagaaca tttctttctc aaaattcaaa 2040 tttatggcca accaacaaga aagatctttg ttttatgagc agaaatattc ccttcaacat 2100 ttagaaagtt tcagacacca atttctattt gtgtttcaaa atgaccacaa ctatgcattg 2160 caaataatac ctgttgatca atcttattct aatacaatca atgataaaac atcaacagct 2220 tatttaaata aaaatgcgag ccttaaagat caaaatgtaa ttgaggttaa tatgcataat 2280 gaaacttctg acaaagtttt tattccactt tccatgacat cagaaaaatc tgtgacatca 2340 aaaaattcaa aagcaacttt gaacataact gcagcaactc ctgccataac taaaaaaaca 2400 gagacactta cctttaatga agttctgact ctattacaaa atgctactta ttttaataat 2460 ggagaagaaa ctatacctgt tcgtcccaaa gcaggagaaa tttatgggtt tacaatttcc 2520 agtgacaaaa cattaaaaaa gcatttgcga gcagatggct atatgtggag gaacaaaaaa 2580 agttgtgcat taggaaatgg tttaatcact aagacatatt tttacgcaag agttggtaaa 2640 acatatacta acaattttga aaagcatgaa tatatgttgc aaaatggaaa aaaagcattg 2700 attcattact tgggcaatga aaaatcatat gttgatagat gtcatggaaa ttcaacaaaa 2760 tcaaaatctt ttttttacag atcatcgcat tatgtttttg atcaagtttt ggctcaaggg 2820 aaaaataaac ccatggatgc atataggagc cttttatcaa atgacccagg tggtcatatt 2880 tcaacttcta ttattccaag aaatcctaaa caggttgcaa atattatgac aagaaatgct 2940 gaaaaaaata attttcataa taaggatgag gtttggaata gtcatcttgt tgcgtattcc 3000 attcatccat ttgttcgaaa aatagaaagc tacccagaca tctgtattgt tatgtgttgt 3060 gaaaaaattg ttgcacaaat ggaagaaatg ttagaattac taccttgtaa agaaattaca 3120 cttcactatg acacaacttt caatttagga aattattatt tgtcatgctt agcttatcga 3180 cacgtttgtt tggaaacaaa aaggaataaa tacttatatt cttcgcctac tgttcctttt 3240 gcatttttta tacatgaaaa aagattagca caaaatcaca aatacttttt tcaaacatta 3300 aaagaaaatg tttgcggaaa aaattttaat aaatccagga agttattcat ttctgatcgt 3360 gagtttgatg gctcccatta ttttgataat acaaaaagtt tggtatgttg gaaccattta 3420 ttgcaaaatg taaaggaatg gatgcgaaag aatggtaaat tttcatcatc agagtatgat 3480 gaagctgtaa aagccatgaa atacattctt cgatcaaatt ctgtttttga atatgaatca 3540 agacttaaaa gtctaacgaa aggaaatatc aacccttgga atgatcaaaa atttcaaaaa 3600 aactacattc aatacatgca ctctgatgtt atttctcgtg ctgctgcatt tgttttagca 3660 gaagaaggga taaagtaccc aaactttggt gttaccaata atgcgtcaga gtctttaaat 3720 aaagttatta aggatcttat tgacaacaaa ataacaaaaa tggactcttg cattttagct 3780 atgaattatc tcagtgccta ttttcacaga gaagttgaaa agggatacta taatgttggt 3840 ttgtatcagt taagtgatga gtttgccttt atgttaagat caagagatga acttagcgct 3900 aatcaaatca tcctaccaga tgacatagta aagcatataa aagaaactga aatacattac 3960 aaagatcata acaatatttc ttctccttat gcttctatag agatgttagt tgaaaaacca 4020 gttaaaagtg atcttccaaa acttgtttat tcaatatcct cattagcaaa ggatatatta 4080 aataaaaacg gagttgtatt tgttccccaa aatcgaagtt ggtcagtgaa agaatcagaa 4140 acaattattc attctgtaaa tttgaaccca cctgcacatt gtacttgtgg tgcctcgaaa 4200 aattgtgttc acattttagc agttcaatat ctctctggaa atgtaaatag tttagaacac 4260 tttgaaggga aaaaaagaaa cttaacttca cttatacaaa aaaataggct taaagttgag 4320 agaaaatcag gcaaaaaaaa accaaggaaa tgtgatattg attcgaattt aaaaacattg 4380 aatttttgtg aacttgcagt tccaaaaaat gaaagtttca gtataaagca gaatgagcaa 4440 tctactgtgc ctgaaataaa atatcctatt ccaagaggac ttcaatgggg tgttggtttg 4500 gttttaaaca gttgcacaat tgataacact ttaacaatgt tgaagttgcg ttgccatgca 4560 gataagtact ttttaaatcg ccttggtact tcagaagcag aaatagagct taaaaaagca 4620 atgagatatt taaataatga tgatgacaat tatggtaaag aaatctgggc atgcttttta 4680 aaaagtaaaa tacaaattac aagctcacct tgttttaact tatatggatc atctaaaaca 4740 ttttttttca accactttaa agaagcattt acattcgaga gtgttggaaa atgttttaca 4800 tgtttaaata gttcttttca tccttttgat tttatagaac tgcctccaaa ccatgaccca 4860 acagattatt tgctaggtta tttgaaagat aaaagattgc ttctaaattg tttagtttgt 4920 ggcaataaag aattaaatca ctgtggcacg acattaaaat ctcaaaatac ttggatgatt 4980 gtgattgagt gtgatattcc taacatttca atttcttcgc tgatgaattt gccaaaaaaa 5040 attttaattg aaaaatgtta cttttccttg gcatgcataa ctcttggact taatgctcac 5100 tttacatctt tgcactacat aagagatcaa tggttttttt atgatggtat gcaaaaaaat 5160 ggaaaactaa ccaagatgat accatataat tggcaaaata aagttgcaaa ccatgtcatt 5220 tatcttattg attaatgctt tttgaactga tatgaattgc actgtaaagt tttaagattt 5280 tagattcatt tttactatat agtgataata ttataactaa cacgatatat ttatgtttta 5340 taataaaaaa gttgtgtttt tttatttatt agagtaattt ctgttattta tatatagctt 5400 ttgaaacact tacagtgagt ttgcgaataa tttgaaagaa gttttagtta aaattttcct 5460 tgtgttagat atttttttat acccatttgt tttcatatag tatttagttt taaatacttt 5520 tagtaatagt agaaaagaaa atgtttttaa agttgttgct ttttttatca ataaatctgg 5580 tataattggt attgttttgt tttgcatcat aatacaaatt ttcataaaat agttatatat 5640 tgttatatat tttcatgtta tgtaaaactt ctttaacata gatagattta tgtaacatag 5700 acacccgaaa attgcgatat aggttgacat ctatttcaaa tgatatcaaa aattaacttt 5760 gaatatgggt gtatgtaaaa acactggcaa ccagcttgta tgccaatata taaaagtctc 5820 attgacccat gtttcaggta aacagcttat aaggtttttc ttttttattg ataaattcta 5880 aaaaataaat tattttttaa aatgtatttt agtgtatgct gatgtagctt ctaaaatgtg 5940 agagggcaag gaaattagac gttacagtac ttttcacaaa ctaatctggc tttagtctat 6000 atactacaag aagtaataag aaaaatcaat ttcagaagat taccaatcct ttcttcctgg 6060 catctttcaa aaactcttaa aagtgaattt ggcattgaca ggttatatac tttttttgtt 6120 gctggtacat aagatatttg tcatttttaa aatatagaca acccaatact tttcttattt 6180 tatatatact gtgggtgggt gaaataaaaa cattacttaa gaagactttt tactcattta 6240 ttcaaaagtg ttaatttata aggcataaat ttggcagtga tttttatttg acaattgctg 6300 tttcaaattt ttttaaaact caatcacttc atattcaatc caatttattt agccttatta 6360 ttaaatatta tgaaatattt atttatctgt agatactaaa tggcgataat tatgttatat 6420 cagcaaaacg agtaatgaac agttttagtc ttcaacggtt gaaattactc agcaaacatg 6480 aattcattga agaaactgat tgtcgttgct ctagcataaa gatcgttgca cgatcgacct 6540 gataaaagat gaaattactt atatatgtga ttaattaaaa aaagacaatt agtttgtaaa 6600 taatcaggtg aatggtagcc tttattatat ggctggttgt gttcagcaca ggatgttgtg 6660 catagttaat gtaacaatat cagtttttca gttccaaaat taaaaattat gtttcacata 6720 gtaaattggg ttttcttact cttttgcatt ttattcgttt ttgttatata ttatttaaca 6780 gtgctccaaa atatgaaaaa acgtttttat ttgtaatttt aaagaaactc tttttttatt 6840 cctacacttg acatcatcaa ttaatttaga aagtaacagt atcggtgcta aataattatt 6900 tctctaataa ctaactttta aaaaaaagtt ttaacattta aaaatttatt acggtctatg 6960 tgggtaccgc caatattttt ttaattcgca ttttatattt gctattacta ctaaaatttt 7020 taagttttta taaataatgg agctttaaaa tatatttatt aattacacag gtctgcaaat 7080 taaaactttt tttttttttt tggaatgggt gaattatatg tattttggta tgctgatttc 7140 aataatacca tccatttcga cccattacgt aaggattttc cgctaagtcg aaaataaaat 7200 atgggcaaag cccgccttac actactattt tagacaataa tcataagaag aaaaaaaatg 7260 gtaaaactat gcaaatttgg ttcattttaa atcataaaat aatttgtata cattatatac 7320 aactataaag ctttataaaa tctagttttt ttagtttcga agcttcgttt agagcttttt 7380 cttttgtggg atgttacatt ttccctatgc agcatccagc agtagtcgcc catcatattt 7440 gtatcccatc gaccttgatg ccttctctcc atttctttta tatcttggtg aaacctctca 7500 ccttgctctt cactaactgc accaagattt tccgggaagt aatccaaatg ggagttcaaa 7560 aaatggagtt ttgaactcat tgagcatccc agatttttag aattgcacaa catattttct 7620 acgatatttt ttaagtccga gtctttctga ttccctagaa aatttgttac tacgtcttta 7680 aaagaattcc atgctgacct ctcaacatag ttcaccatat tttcaaagtt gttatctttc 7740 attagctcac gaatgtcagg accaacaaaa gctccctctt tcagctttgc ttcagacaga 7800 ccaggaaact tgcttgaaaa aaatttaaat gtctcaccat cttttagcaa tgccttcaca 7860 aactgcttca tcatgccaag tttgatatgg agtggtggta gaaggacctt tttaggatca 7920 accaaagatt caaagtgcac gttttcatct ccgatcttca atgttctttt tggccacact 7980 ttttgcttcc agtgtttagt tttagcacga ctatcccatt aacaaagaaa aaagggtaat 8040 tttttgtaac caccttgctg tccaagtatc atagaaatta ctttaagatc cccacaaact 8100 gtccactcga attctttgta ctttaattaa tccaagatca atgccaagtt acaataagtt 8160 gccttcaaat gcactgagtg accaacgggt attgatgcgt acttattctt gttgtgaagt 8220 aaaaaagctt ttattcttct ttttgaagaa tcaataatct gtctccaact ttcagccata 8280 tgttgaactt tgaacatgga tattaaccca ttgttatcgg aacaatagac catctcacca 8340 tattgaggaa aataagggct caaaatattt attttgcaga tatgttttga atagcagatt 8400 gttttgagag tcctaaatct cggatcaggt tatttaattc agcttgaaaa aaatgttgag 8460 gactactgta gtcgcctaac tgaaactcgt cacaagaact ttcttctgat tcactttcta 8520 atgatggtaa atcatctaga ctttggggtg cttgtggtgt aggaatgcca ggtccgtgag 8580 ggactgggtg gagtgtcgat tccatgtttg gataaaaaat ttatttttta tttttggtac 8640 tatacccctg cacataacac gaacaaaagt agcagtcatc gctgtggttt cgaggctctc 8700 tccagattat gggtacacca aaacgaaaag atttaacttt accttttgac cacattctaa 8760 gttcttcaac gcaccctgaa caaactccat gcggagccca aaatttgtcc tgattgccta 8820 tcttcacacc aagatacaca agtacacatt tttaacaaat tcagtaatat ttcattgttg 8880 ttttgtatat atcgggattt tccatactta ctatggaaaa tcccgatata tatatggttc 8940 aacgcttact aatattatct tcatgagtta ctaaatattt tgttttagcc tacgaaacat 9000 aaaattatct tataataagt taataaataa aataataaat attttggcaa taaaccccgt 9060 tttgcgccct aaaataaaca atcccacagc attttgcgtt ttttagaaaa accttacatg 9120 atggaatttt tttgataact ttttcgaatt cagcgaccta aaatttataa gaattaactc 9180 tttaatctta gaaaaaaaat atgttgcaat taattgcccg cgtattttag agattacacg 9240 aggctttatt ctatttaatg aaataattga tgtgattcaa acccatatat atttaaaaca 9300 aggtatatta aaaaactctt tctaaaaagt ctttaactct ttaatgagac caaaaaagct 9360 aaaaggtata ttttgttctg tcatttgcga tgtttcaatg ttttggtctg tttaacatcg 9420 ctcacaaaag ctttgaatgc gggaaaagat tcgtagaagg cgcaaattat tggtagagat 9480 tctcaaagat atcttttgtc aatgttgctc agtgtgcaag gattcattct tacatgagtt 9540 tgaagcctac atagaatatt ttccaacctc gtataacaag aaatattatc ataaaagatg 9600 ttctctgctt aaggaccaag tttggttaga attgttgtag tgtaccagta gtttctctta 9660 cacaacaaca agggttttta tctttacaca gttttcttgt gcaattattt atgagaattc 9720 gcttgcttta tggttcaccc tggtcataat gaataactct aattaaaaat tttaatattt 9780 aattagttaa taatttttac gcaaattttt tcgataacac aactaaatat aaaattagca 9840 aaaaagtttc aaaataattt tataacatta atttttattt cgtctctttt tctattttta 9900 ttcgcgaaag tttgaaagga cagaaccaaa agccaaattt ggcttttgcc aagtttaaaa 9960 ggacagaagt ccttttaaac tttgcattta caaatgcgca ataaaaaatc cgggggatat 10020 ttaacttttc gttgtttttt ttgacggtta ccgaagtcta ttattacgcc atttttttcg 10080 gcatttcggc ttttcggcgt tttagaattc ggctttttag gcgatccc 10128 // ID BMMAR3 repbase; DNA; INV; 1307 BP. XX AC AF461149; XX DT 12-SEP-2002 (Rel. 7.08, Created) DT 01-JUL-2005 (Rel. 10.08, Last updated, Version 2) XX DE Bombyx mori DNA transposase gene BMMAR3, complete cds. XX KW Mariner/Tc1; DNA transposon; Transposable Element; BMMAR3; KW complete cds; transposase gene. XX NM BMMAR3. XX OS Bombyx mori OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Bombycidae; Bombycinae; Bombyx. XX RN [1] RP 1-1307 RA Robertson H.M. and Walden K.K.; RT "Bmmar6, a second mori subfamily mariner transposon from the RT silkworm moth Bombyx mori."; RL Insect Mol Biol 12(2), 167-171 (2003). XX DR Genbank; AF461149; Positions 1 1307. XX SQ Sequence 1307 BP; 411 A; 243 C; 297 G; 356 T; 0 other; actagtcagg tcataagtat tgtcacacag taaaaacttt tcttttagaa tgctggccac 60 aaaaaagttt attgaattcg aatttcgaat tgttcatgaa aataaaaatg tatactttta 120 gaattttact catttttaaa tatggagtgg acgcttaaag aagaccgtgt tgcagttatt 180 gcgttgcatc gttgcggtta cgcgccaatt caaattttta acatactgaa aaatttgaat 240 ataaccaaaa gattcgttta tcgtaccatc aaacgataca atgaagactc tagtgtagat 300 gacaggtcaa gaagtggtcg ccctcggtct gttaggactc cagcagtgat aaaagctgtg 360 aaggcgcgaa ttcaaagaaa tcccaaacgt aagcagaaac tgttggccct tcagatgggg 420 ttaagcagaa ccacggtgaa aagggtgtta aatgaagact tagggcttcg ggcatatcga 480 agaaaaacag gacatcgttt gaatgctcgt ctaatggacc tgagactgaa gagatgccgc 540 gctttgttga agcggtacgc gggaaaaaaa tatcgggaaa ttcttttttc ggatgaaaaa 600 atttttaccg tagaagagag ctacaacaaa caaaatgata aggtgtacgc acacagtagt 660 gaagaagcga gcaaccgtat tccgcgtgtc caacgaggtc attttccatc ctcgctcatg 720 gtatggttgg gagtttctta ttggggctta acagaggtac atttttgtga gaaaggtgta 780 aaaacgaatg cagttgtgta tcaaaataca gtcctgacga accttgtgga acctgtttct 840 cataccatgt tcaataacag gcactgggta ttccaacaag attcggcgcc agctcataga 900 gcgaagagca cacaagactg gctggcggcg cgtgaaatcg acttcatccg gcacgaagac 960 tggccctcct ccagtccaga tttgaatccg ttagattaca agatatggca acacttggag 1020 gaaaaggcgt gctcaaagcc tcatcccaat ttggagtcac tcaagacatc cttgattaag 1080 gcagccgccg atattgacat ggacctcgtt cgtgctgcga tagacgactg gccgcgcaga 1140 ttgaaggcct gtattcaaaa tcacggaggt cattttgaat aaactttagt gtcataagaa 1200 tctatgtttt gttaagttca ttttggtata tgaatggtta cataatgaat aaacttgttt 1260 caattatttt acattaaaca tgtgacagaa tttatgacct gactagg 1307 // ID SINE2-1_AP repbase; DNA; INV; 1770 BP. XX AC . XX DT 18-MAR-2009 (Rel. 14.03, Created) DT 18-MAR-2009 (Rel. 15.12, Last updated, Version 2) XX DE A family of SINE retrotransposons - a consensus sequence. XX KW SINE2/tRNA; SINE; Non-LTR Retrotransposon; Transposable Element; KW Nonautonomous; Interspersed repeat; SINE2-1_AP. XX NM SINE2-1_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-1770 RA Bao W. and Jurka J.; RT "SINE retrotransposons from Acyrthosiphon pisum."; RL Repbase Reports 9(3), 661-661 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC Elements are 93% to the consensus. The 5'- 10-20 bp is the CC internal box-A found in tRNA. gene. XX SQ Sequence 1770 BP; 618 A; 206 C; 229 G; 716 T; 1 other; ttagattctg agtggaacga tgaatgtatt gattttacaa tgatgtgtgt ttttttttta 60 tttttttttt tattttattt tttttttttt gtgtctgtca tcacctttta ggacagtaaa 120 agtgcttgga ttttcttcaa cagtatcttt tctgatagga aagtgaatct agttggtact 180 ttgggggggt caaaagtaat ttaaaatttt tccaatagtt ttcaaaagcg ccgtgaaaaa 240 caaaagaaaa attaaggaaa aacgggaatt tttacgcaaa atctgttttc gagaaaattg 300 attttggttt ttggtgtaac tttaaaacaa atgaccgtag atacatgaaa ttttcactgg 360 ttgtttatat ttccattttc tatacatgat aaaattttca aaatattttg atttgttttg 420 aactgtttag ggacattttc agtttccaat tttattagtt tttttttcta tgaatgtcaa 480 taaaacttta tttgttgagt aaaaatactt gaaaatttaa tacaaggctc ctactatatt 540 gttacaatga catttgaaaa atattaaaaa tccttagtca caggtttttt tttattagca 600 tttaaagttc aaaaattgac aaaatatgta aaaatcacga aaattagcaa attattttga 660 gttaagaatt cgtaaaaatt tttcttttta aatctaagat ttgaaaatgt aatacaagat 720 tcctcataat attgtctacc tttatcaaaa aaaaaaaatg tctacaagaa agtcaaatta 780 aatttttatg agcgtttgaa attcatattt ttacaacatt tgatattcac tcgatttctc 840 atgtgacgat tttcttattt tattgtaatt aaaaaacgaa tgactgtaga tatttgaaaa 900 tttcactgaa tgtttatatt atcattttct atacaccata aaattttgaa aatattttga 960 ctctttttga gctgtttacg gacattgtca gttttcaatt tttttagttt ttttttctat 1020 aaatatcaat aaagttttat ctgttgggcc aaaaagtgta aaaatttaat acaaggctcc 1080 tgatatattg ttacaatagc agttgaaaaa tattaaaaat acataggcac aatttttttt 1140 tataagcatt taaagatcga attttgacaa aatttatcaa atttaaaatt gaataattat 1200 tttgtagtta aaaatttata aaatgttcaa cttttatatc taaggattga aaatttaaaa 1260 caagattcca cgtaagtagt taattctgtt accaaaaaat ctaaaaaata cataagcaca 1320 gtttattttt atagtcattt taagttcaaa tttggacgaa attacatatt aaaaaaccta 1380 gaataactat tttagttatt ttgttgtgat tgtataatat tattcgtggg tatacttgaa 1440 acttctaaag tatactatta tatatctatg atagtatcac ggtttgttgt tgatgtataa 1500 cgcgttataa gtacctaatg gatattgtga tatgattaat ttggaattta ttataggtac 1560 ctattatagg tcaatttttt tttaatacca tagataagta tatmtataat aaatatgtct 1620 tatacctaga ctgacatacc gtctccgctc agaatcgttt ttcttataca atgatattat 1680 atcattgaat tcaaatttaa taccatccat tatacagtga cccacttgta acctactgta 1740 cagcagagcg acatccactt acccaccttt 1770 // ID I-56_AAe repbase; DNA; INV; 6348 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE An I non-LTR retrotransposon from Aedes aegypti. XX KW I; Non-LTR Retrotransposon; Transposable Element; I-56_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6348 RA Kojima K.K. and Jurka J.; RT "I clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1327-1327 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 8 sequences with >96% CC identity. XX FH Key Location/Qualifiers FT CDS 396..1739 FT /product="I-56_AAe_1p" FT /translation="MAGVQSGPHGGPQNDPGGSVLVRRQPDWMLSEDELGQ FT TMVLLMRRKPEDAVSQTQASSSDHHKALPNPFIVSASIDVAVGLRESSKVT FT ITKEGRGTRYLLRTSSKAAMKKLCQISQLTDGTLVEVISHPTLNAVQGVVY FT EADSTDVDDDTLLKYLEPQGVKSVRRIIKRVNGVIRNTPLIVMTFHGTILP FT EIVLFGKLRVQVRPYYPTPMICFNCGSYGHSKKFCGQAGVCLQCSQVHSLE FT DGEKCSNAPYCKNCEGSHPVTSRTCQKYKDEQNIIRMKIDRNISSAEARKI FT YFAETTKETLASEVQKRLDQSESAKDKIIAELRAEIESLKTKLTTILNKQK FT AKNNTESRKEPGMAQTSTKPSSSESCQSLQRLSRKDQSFKSPPSMARDSAR FT RNESKNWQIDNERRTRSKSRKHAMEISPTDMNFKNIKRPSLPPDANGSTVE FT VDE" FT CDS 1798..6231 FT /product="I-56_AAe_2p" FT /note="endonuclease, reverse transcriptase, and FT ribonuclease H." FT /translation="MDKTTDTQRNIFHIGINENRFSMDNKSDYAFELHQNT FT RKPLPEAALSNSCIYDQLHNFQVEPSTSSKLLVIETSDSPSARPSTPTLTR FT DVPSLLDEPLAAVGVDWPSSRASILHSIASYYPTHHTSKRYVSATQTIADV FT GCNSSMHNKLQNLHNPQKMSSEADTAVTSDSPSARPSTPTLTRDVPSSLDE FT PLAAVGVDCPPSRASISQLPTKTLPTHRASKNVSENTLQQRENPLEYTLWN FT PNRQHVQEAPPSPTSTHSSLTETPSNRRRLAIQWNINGLYRNLSDLDVLIK FT GENLPTVLAIQEHHLRSNATRDLASLLRGRYTWRTLTGSNIFQGSALAVRI FT DEPHSNIPVDTPMLALATKLEGRFHCTVVSIYIPVQPYPTFEHDFDHLLSN FT LEPPFLILGDFNAHHRAWGSSRDDPRGRAIMRICEKYDLIVLNDGSDTYIS FT GQRRTAIDLSLCSSSIIGHLRWHVHRDPAGSDHCPIEILGNDSPPCTGRRR FT RWIFDRANWEAFESNIDSQIEMNREYSAVELGKIIVGAAKQHIPRTSKTPG FT RQAVPWWSDEVKNAIKLRRKALRVWKRTKSDDPFHDAKKERWLEARRVCRS FT IIKQAKESSWQHFIDGISPKSSTQEIWQRLNALRGKRRFSGFQLKVDGHYT FT DNPKDIADELGKYFSSISSSALYSESFLAKKTKIEVNPTSFSLDTLTSDES FT YNKAFTMSELLWALDHAKGKSCGPDLIAYPLLKRLPIHVKQKLLDSYNLLW FT VNGCFPDEWKNSTVVPIPKVGARSKDVQDFRPISLTSCNGKIMERMVNRRL FT MTCLRERKLLDPRQHAFCQGRGTSTYMTDLGNTLNQAINENLHIDIATLDL FT SKAYNRVWREGVLRQLQEWGLNGNLPLFIQSFLSKRCFQVAIGHQLSRTCI FT EENGVPQGSVLAVSLFLIAMESLFKVLPKDVLIFVYADDIVLVVVGNNATR FT VRIKLQAAIRAVARWANSTGFQMAAEKCSISHICNHRHNPWSREVTVDNIR FT IPYRKVTKILGVYFDRHLSFNHHFEQLKTNLKQRSCLVRTISRCQSTSNRK FT TIMNIGNSLILSKIFYGVEATCRNTEQFKKSLSPFYNKMVREASGLLPSSP FT TKSSCVEAGLLPFDSLLAKVVATRATSYAEKAKHKSPQCLLSTVANNLFKN FT LTRSRLPKVLQLIRIXDRIWNMPSPRIDWGLKHKLSTKDSPLKVQAEFRST FT VETNYKSHIKLYTDGSKLGNNVGFGVYGPGINKSLRLPNYCSVFSAEAAAL FT KLAVEEMYPSDAPVVIFTDSASVLKSIESGKSRHPWVQLIENKYKSNISFC FT WIPGHKGIAGNELADRLAAVGRTSQRQSTKVPGDDIRKYIASSIEIAWKVT FT WSKETDLFLRKVKGTTEKWNDREKRKEQRILSRLRVGHTKITHAHYLGSAR FT DAPECEVCLCRLTVEHLLVSCPIYDNYRNDVNLPASVRDILSNDSNTEEQL FT LLFLKNSQLIDKI" XX SQ Sequence 6348 BP; 1994 A; 1469 C; 1333 G; 1551 T; 1 other; acatcgagtt cgggatacta tacctagcta cacttgtgtc gcgatcgctc gtattatttt 60 gcaaaaactc gtcggttttt cgactaattt aatataataa gttagttcaa ctagagactc 120 tcgaaatacg gaacaattat cggtaaccag cacttgtgat aagggcaata tagtgcgtaa 180 aaaccacaaa atactactgc caaatagcgt cgccaataag caaacgaatg tgacttcgcg 240 ttattgcttt gcgttctatt gtgttgtgaa ctgaacaatc gtagtctgct tcaacccgag 300 tgcaagttga taataaaacg gtgcctatcg gggttgtgaa ataaaagaag tgggacgtat 360 aaaagtgtga aaaaaaaaaa cgtcatcgtg gcctcatggc cggggtccaa tcgggccccc 420 atggggggcc acaaaatgac ccgggggggt cagttttagt gagaaggcaa ccagactgga 480 tgttaagcga agatgagcta ggacaaacta tggtacttct gatgcgtcgt aagccagaag 540 atgctgtctc gcaaactcaa gcctcaagca gtgatcacca taaagcacta ccaaatccgt 600 tcattgttag tgcatcgatt gatgtagcgg taggtttaag agaatcatcg aaagtcacga 660 ttaccaagga gggtagagga acccgttatc tgctccgtac cagctcgaag gcagcaatga 720 aaaaattgtg ccaaatttca caactcactg acggtacttt ggttgaagta atttcccatc 780 ccacgctgaa tgctgtgcaa ggagtggttt atgaagcgga ttccacagat gttgatgatg 840 atactctttt gaaatatctc gaaccccaag gtgtgaaatc ggttcgccgt atcattaagc 900 gggttaatgg agttattcgt aacacaccgc tgatagtgat gacatttcac ggaaccatcc 960 ttcctgaaat cgtactattt ggcaaacttc gtgttcaagt gagaccctac tatccaacac 1020 cgatgatttg ttttaattgc ggctcctacg gacactctaa aaaattctgt ggccaggcag 1080 gtgtgtgtct tcaatgctct caagttcaca gcctagagga tggagaaaag tgcagcaatg 1140 ctccttattg caagaattgt gagggctctc atccggttac ctcacgcaca tgccaaaagt 1200 acaaggacga gcaaaacatc attcgcatga aaatcgaccg caatatttca tctgctgagg 1260 cccggaaaat ctattttgca gagaccacga aggaaacatt agcaagtgag gttcaaaaga 1320 gattagacca atctgaatct gccaaggata aaatcattgc tgaactgcgt gctgaaattg 1380 aatcattgaa aacaaaactc actactattt tgaataaaca aaaagccaaa aataataccg 1440 aatcacgaaa ggaaccggga atggcccaaa cttcaacgaa accatcatct tcagaatctt 1500 gccaaagcct acaacgattg tcacgaaagg atcaaagctt caaatcccca ccctccatgg 1560 cccgcgacag cgcaaggaga aacgaaagta aaaattggca aattgataac gaaagacgaa 1620 cgcgaagcaa aagcaggaaa catgccatgg aaatctcccc tactgatatg aactttaaaa 1680 acatcaaacg cccatccctt ccgccagatg ctaatggaag caccgttgaa gtagatgagt 1740 aaccagtagc acaagtttcg caaaatcaac cctcatataa cggacctcgc aatacatatg 1800 gacaagacaa cggataccca acggaatatc ttccatattg gaataaacga aaatcgcttt 1860 tcaatggata ataaatcgga ttacgctttc gaacttcatc aaaacacaag aaaacctcta 1920 ccggaagctg ccttatcaaa ttcctgtatc tatgaccagc tgcacaattt ccaagtagaa 1980 cccagtacct cgtcgaaatt attagtcatc gaaacctctg attcgccaag cgcgaggcct 2040 tccacgccga ccctgaccag ggatgttccg agtttactcg atgagcctct ggcggcagtc 2100 ggcgtggact ggccctcatc aagggcaagt atccttcatt cgattgctag ttattaccca 2160 acgcaccata cttcaaaacg ttacgtttca gcaacccaaa ctattgccga tgttggctgc 2220 aactcctcca tgcataacaa actgcaaaat ctacacaacc cccagaagat gtcttcagaa 2280 gcagataccg ccgtaacctc tgattcgcca agcgcgaggc cttccacgcc gaccctgacc 2340 agggatgttc cgagctcgct cgatgagcct ctggcggcag tcggcgtgga ctgccctcca 2400 tcaagggcaa gtatttctca attacccact aagactctcc caacgcatcg tgcttctaaa 2460 aacgtttcag aaaacaccct acaacaacga gagaaccctc tagagtacac cctctggaac 2520 cctaacaggc agcatgtgca agaagcgcca ccatctccaa caagtacaca cagtagccta 2580 accgaaacac caagtaaccg ccgccgttta gctatccagt ggaatattaa cggcttatac 2640 cgaaacctaa gcgacctgga tgtcttgatc aaaggagaaa acctaccaac agtcttggct 2700 atacaagaac atcatctgcg ctccaacgct acacgtgatc tggcttcctt gctccggggc 2760 cgatatacgt ggcgtacact gaccggtagc aacattttcc aaggatccgc ccttgctgta 2820 cgtatagacg aacctcactc gaatatacct gtggacacac caatgctcgc tctggcaaca 2880 aagcttgagg gtcgctttca ctgtaccgtg gtaagcatct atattccagt tcaaccgtac 2940 cctacatttg aacacgactt cgaccatcta ctgtctaatt tggagccccc ttttttgatt 3000 ctgggagact ttaatgcaca tcatagggca tggggttctt cacgggatga ccccagaggt 3060 agagcgataa tgcggatttg cgaaaaatac gatttaatcg ttttgaacga tggtagcgat 3120 acctacatca gcggacaaag gagaaccgct attgatttat cgttgtgttc cagcagtatt 3180 attggtcatc tccgctggca tgttcaccgt gacccagcag gcagcgatca ctgccctata 3240 gaaatccttg ggaatgacag ccccccatgc actggaagaa gacggcgttg gatatttgat 3300 cgagcgaact gggaagcctt tgagtcaaac atcgacagtc aaatcgaaat gaatcgggaa 3360 tattcagcag tagaattagg caaaataatc gttggcgcag ccaagcaaca catcccacgg 3420 acttcgaaaa cgccaggaag acaagccgtc ccatggtgga gtgacgaggt caaaaacgcg 3480 attaaattgc gtaggaaggc cctacgtgtc tggaagcgca caaaatcaga tgatccattt 3540 cacgacgcga agaaggaacg ttggctggag gcaagaagag tatgtagatc aataattaag 3600 caagctaaag aatcttcgtg gcaacatttt atcgatggta tcagccccaa atcatccact 3660 caggaaatat ggcagcgact gaatgcctta agaggaaaaa gacgcttttc tggttttcaa 3720 ctaaaagttg atggacatta tactgataat ccaaaggata tcgcggatga gttaggcaaa 3780 tattttagct ctatctcttc atctgctttg tactccgaat catttttggc caagaaaaca 3840 aaaatcgaag tcaacccaac aagcttctcc ctcgacacac tcacctctga cgaatcatac 3900 aacaaagcgt tcactatgtc agaactactt tgggcgctag atcatgccaa gggaaaatca 3960 tgcgggcctg acttaatagc atatccactt ctcaaaaggc tccctatcca cgtaaagcaa 4020 aaacttctcg atagctataa tctgttatgg gtcaatggat gctttccaga tgagtggaaa 4080 aatagtaccg tcgttccaat tcccaaagta ggagcaaggt cgaaagatgt gcaagatttt 4140 cgtcccatca gcttaactag ttgcaacggg aaaattatgg agcgcatggt gaatcgtcgc 4200 ttgatgacat gtctcagaga aaggaagcta ctcgaccctc gtcaacatgc cttctgtcaa 4260 ggtcgtggca cttcaacata tatgacggac ttaggtaata cactgaacca agcgataaac 4320 gaaaatctgc acatcgacat agctacccta gacctttcga aagcttacaa tcgagtgtgg 4380 cgtgaaggtg ttcttcgtca gcttcaggaa tggggattga acggcaattt acccctgttt 4440 attcaatcct tcttatcaaa acgttgcttc caagtagcaa taggacatca gttatcacgc 4500 acctgtatag aagaaaacgg agttcctcaa ggctcggtct tggcagtatc tttgtttttg 4560 atcgctatgg agtctctctt caaagttctc ccgaaagatg tgctaatatt cgtatatgca 4620 gatgacatag tcttagttgt tgtgggaaac aacgcaactc gagttcgcat caagcttcaa 4680 gcagctatta gagcagttgc aaggtgggca aactcaactg gatttcaaat ggcggcagag 4740 aaatgttcta tctctcacat atgtaaccac cggcataatc catggtctag ggaagtaaca 4800 gtggacaata ttcgcattcc atatagaaag gtgacaaaaa tactaggagt gtacttcgac 4860 agacatcttt cgtttaacca tcacttcgaa cagctaaaaa ccaatctgaa acagagaagc 4920 tgtttagttc ggacaattag tcggtgtcaa tccaccagta accggaaaac aatcatgaac 4980 attggcaata gtctgatttt gtcaaaaatc ttctacggtg tagaagcaac atgccgcaat 5040 actgaacaat tcaaaaaatc gctttcacca ttttataaca aaatggtacg agaagcctca 5100 ggattactcc cgtcgtcacc aacgaaatca tcgtgcgtag aggctggact tctcccattt 5160 gattctttgt tagcaaaagt tgttgccaca cgtgcaacat catatgccga aaaagcgaag 5220 cataaatctc cccaatgtct tttatcaacg gttgctaata atttgttcaa aaatctaaca 5280 agaagtcgac taccaaaagt tctacagctg atccgaataw ccgatcgcat atggaatatg 5340 cccagcccta gaatcgactg gggactgaaa cataaactct caacaaaaga ttctccactg 5400 aaagtacaag ccgagttccg ctcaacagta gaaacaaatt acaaatccca cattaaactt 5460 tatacagatg gttcgaagct cggaaacaat gtaggctttg gggtatatgg tccgggtatc 5520 aacaaatcgc ttcggttacc caattattgc tccgttttct ccgctgaggc agccgccttg 5580 aagctagccg ttgaagaaat gtatccatca gatgctcctg tagtgatttt tactgactct 5640 gccagtgtcc taaagtcaat cgagagtggg aaatcacgac atccatgggt acagctcatt 5700 gaaaataagt ataaaagcaa catatcattc tgctggattc ctggacacaa gggaattgct 5760 ggcaatgaat tggctgatag actcgcagct gtcggtagaa ccagccagcg tcaaagcacc 5820 aaggtaccag gtgacgacat tcgaaaatat attgcgtcat ctatagaaat tgcctggaaa 5880 gttacgtggt cgaaggaaac tgacctgttc ttgaggaagg tgaaaggaac tacagaaaag 5940 tggaacgaca gagagaagcg caaggagcaa aggattttat cccgtctgag agttggacat 6000 acaaaaataa cacacgccca ctatttagga agcgctcgtg atgctcctga atgtgaagta 6060 tgcctgtgtc gcctgacagt ggagcacttg cttgtatcct gcccaatcta cgacaactat 6120 cggaacgatg ttaacctacc tgcatcagta cgagatattc taagtaacga ttctaacact 6180 gaagaacaac tgttgctgtt tcttaaaaat agtcaactta ttgataagat ttgattgtat 6240 aggctaatgc aaaaccaatt tgtttccttt ttcttctcat tagaggtgaa tgaatttata 6300 aaatttaaaa cctcttaaat ataataataa taataataat aataataa 6348 // ID Rehavkus-2_TC repbase; DNA; INV; 2740 BP. XX AC . XX DT 30-APR-2006 (Rel. 11.04, Created) DT 26-MAY-2011 (Rel. 16.05, Last updated, Version 2) XX DE This is a recently transposed Rehavkus-2_TC DNA transposon - a DE consensus sequence. XX KW MuDR; DNA transposon; Transposable Element; Interspersed repeat; KW Rehavkus-2_TC; Rehavkus group. XX NM Rehavkus-2_TC. XX OS Tribolium castaneum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Coleoptera; Polyphaga; Cucujiformia; OC Tenebrionidae; Tribolium. XX RN [1] RP 1-2740 RA Kapitonov V.V., Gentles A.J. and Jurka J.; RT "Rehavkus-2_TC, a family of Rehavkus DNA transposons from the red RT flour beetle genome."; RL Repbase Reports 6(4), 195-195 (2006). XX DR [1] (Consensus) XX CC Rehavkus-2_TC belongs to the Rehavkus group of MuDR "cut and CC paste" DNA transposons. Transposons from this group are CC widespread in different metazoa, including insects, sea squirts, CC sea urchin and fish. The beetle genome harbors several ~95% CC identical copies of Rehavkus-2_TC. Its 580-bp inverted termini CC are composed of a 139-bp terminal inverted repeat and 148-bp CC subterminal minisatellite-like unit. The transposon is flanked by CC 9-bp target site duplications and encodes an incomplete 439-aa CC Rehavkus-2_TC transposase. XX FH Key Location/Qualifiers FT CDS 657..1973 FT /product="Rehavkus-2_TCp" FT /translation="TKTFSKILAKAILQIKRAGKTTVTFNSKFIRLRKVLK FT IYIETLEGAIPLVNSKNKDSMLVWKGEENIFPKYTNKNEDKLYSAVPKEAD FT DPLSLLEKTHKNSKEVETTNDDNDETMKDEEDRAVENWKGEGATKPKRPRR FT SYLENQKEWEFMELSARTKTIPISLIKNGNCSTLRTITINKKKFMFTNTCG FT FDALVQILSSCYCDCPAFREGVVDDNFLAELLQNLVKGKISRHSYILRAKI FT LSQIFDEVLRPDGLIEIPCVCTQQLLIDRLATNNFLEWSKKEITVCSKCGY FT NETRNYATIMLSDNNLVAELVTKTMEKTCFCPKCDEKYLNVKSNLNENYIF FT IDWASVSLTNTNRSVQLCQIPANIALTHKNLNYKLQGVVEIIGEGILASDV FT GHFIAYCKRPNEKWEIYDDLQDKVNTNCPPEKVLRGCALLVYRCL" XX SQ Sequence 2740 BP; 973 A; 437 C; 481 G; 849 T; 0 other; acgtgcgtta agttggccca accactttca aactttgaaa atgcacatac ggcattcaaa 60 tgtgctcgga aagtgctgaa gaatgctgct atttatttaa aaaatgctac tttagtttta 120 tagatttttt gtgttttttt ggttgggcta acttaacggt ggcccaacca ttttatgtgc 180 tggacgtttt gtttatgcca tttaaaagtc ctttaaaaat gctggcaaat gccgtatttt 240 aaataaaaag tgctgggaaa ataacggagt tttttaaatt taaataatgg ttgggctaac 300 ttaacgacag cccaaccatt catgtgctaa acgatttctt tatgtcaact aaaagatctt 360 gaaaaatgct ggcgaatgct gtactttaaa taaaaagtgc agagaaaaaa acgaagtttt 420 ttatttaaat aatggttggg ctaacttaac gacagcccaa ccacaccgag tgccttcgag 480 ttttttatgt caactaaaag accttaaaaa atgctggcga gtgctgtact taattcaaaa 540 agtgctgaga aaataacatt ggtttttaat atcgaaaatt ggtgggctaa cttcttaacg 600 gtggccggac agtccaatct tacaccgagt gctctacgat ttatttatac atatgaacta 660 aaacgttttc aaaaatactg gcaaaggcta tactgcaaat aaaaagagct gggaaaacaa 720 cagttacttt taattcaaaa tttattaggc ttagaaaagt tttgaaaatc tacatagaaa 780 ctttagaagg agctatacct ctagtaaatt caaaaaacaa ggacagcatg ttagtatgga 840 aaggcgaaga aaatattttc cctaaatata ctaacaaaaa tgaagataag ttatattcag 900 ctgtacccaa agaagcagac gaccctcttt ctttgctaga gaaaacacat aaaaattcca 960 aagaagtgga aacaacaaac gatgacaacg acgagacaat gaaagatgaa gaagatagag 1020 cagttgaaaa ctggaaaggg gaaggtgcaa ctaaacctaa acgtccacgg agatcttatc 1080 tggaaaatca aaaagaatgg gaatttatgg aactttcggc acggacaaaa acaattccta 1140 tatctttaat taaaaatgga aactgttcta cattaagaac aatcacgatt aataaaaaaa 1200 agtttatgtt tactaatacg tgtggatttg atgcacttgt tcaaatattg tcttcatgtt 1260 actgtgactg ccccgcattt agagagggtg ttgtagatga taattttttg gccgaacttt 1320 tacagaattt ggtcaaggga aaaatatcta gacattctta catactaaga gctaaaattt 1380 tgtctcaaat ttttgatgaa gtcctcaggc ccgatggtct aatagaaatt ccttgcgtat 1440 gtacgcagca attattaata gatagattag ctactaataa tttcttagaa tggtctaaga 1500 aagaaatcac ggtttgctct aagtgtggtt acaatgaaac cagaaattac gcaacaatta 1560 tgttaagtga caataattta gtagctgagc ttgttacaaa gacaatggag aaaacttgct 1620 tttgtccaaa atgtgacgaa aagtatctga acgtgaaatc aaatttgaat gaaaactaca 1680 tatttattga ctgggcatca gtttcattga ccaacacaaa tagaagtgtt cagttatgcc 1740 aaattcctgc taatatagca ttaacccata aaaacctaaa ctacaaactt cagggggtag 1800 tggaaattat cggtgaagga attcttgcta gtgatgtagg ccattttatt gcctattgta 1860 aaaggccaaa tgaaaaatgg gaaatatatg acgatcttca agacaaagtg aacacaaact 1920 gtcccccgga aaaagtttta agaggttgtg ccctacttgt ttatcgctgt ttgtaaaagt 1980 ataatattta tattattttt ttattgttta ttttatgtta aaatgaaatg tttttttaaa 2040 taaagtacta attattgtgt ttttattaaa caattccaat tcatgtgaat ctacttaaaa 2100 aaaataaacg atatatgatg tacctaagta agcactacag ctcctgcatt taagggggag 2160 aattttcgat attaaaaacc aatgttattt tctcagcact ttttgaatta agtacagcac 2220 tcgccagcat tttttaaggt cttttagttg acataaaaaa ctcgaaggca ctcggtgtgg 2280 ttgggctgtc gttaagttag cccaaccatt atttaaataa aaaacttgtt ttttttctca 2340 gcacttttta tttaaagtac agcactcgcc agcatttttc aagatctttt agttgacata 2400 aagaaatcat ttagcacatg aatggttggg ctgtcgttaa gttagtccaa ccattattta 2460 aatttaaaaa actccgttat tttcccagca ctttttattt aaaatacggc atttgccagc 2520 atttttaaag gacttttaaa tggcataaac aaaacgtcca gcacataaaa tggttgggcc 2580 accgttaagt tagcccaacc aaaaaaacac aaaacatctc taaaactaaa gtagcatttt 2640 ttaaataaat agcagcattc ttcagcactt tccgagcaca tttgaatgcc gtatgtgcat 2700 tttcaaaatt tgaaagtggt tgggccaact taacgcacgt 2740 // ID HOPPER_BD repbase; DNA; INV; 3131 BP. XX AC AF486809; XX DT 09-DEC-2004 (Rel. 9.11, Created) DT 09-DEC-2004 (Rel. 9.11, Last updated, Version 1) XX DE Bactrocera dorsalis strain white eye transposon hopper, complete DE sequence. XX KW hAT; DNA transposon; Transposable Element; HOPPER_BD; KW hAT transposon; terminal inverted repeat. XX OS Bactrocera dorsalis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Tephritoidea; Tephritidae; Bactrocera; Bactrocera. XX RN [1] RA Handler M.A.; RT "A new hopper element from the hAT transposon family exists in RT the Bactrocera dorsalis white eye strain."; RL Unpublished. XX DR Genbank; AF486809; Positions 1 3131. XX SQ Sequence 3131 BP; 1092 A; 487 C; 565 G; 987 T; 0 other; tagtgttggg gactatcgaa tcaaaacaat caagttattc ggtggttcag caattttcat 60 tcattcgata gttttcaaga aatcactcat tactattttt tcgttcattc gaattttttc 120 gttcatacca atatttcgtt caaaacgaat attttgttcg atactaattc ttcgtttatt 180 actatttttg cattcattag tattttttcg ctattgaatg atttttcgaa tgaatgattt 240 tatgcaagta gatatctagc aaagattttt tgtacgaatt catttattcc atttcaagta 300 aatggtatta tattcagcaa atgtacgaat aatgctgaaa ttgtgtgtat ttcaaaatta 360 tgttgaatga tttatgtgtg ataatggaaa ttaaattgaa tgcatgtatg agtggtagaa 420 gttaatagat gcttaaaata gatttaagtg aaaagtatta ataataataa atatagactt 480 ataaagcgat aatgtaggag gcattataat taataagatg gctaaagtag attcatatgg 540 tcatcggtat ttggcaagat taaatgacag tttctagtcg gtttgaaaaa atccaatggc 600 tttgaaatta agcaggttgg caacttggtt acgtattgcg attcgaaatt gtagctatgt 660 atacattcac ttaaccgttt taactgtcgg tattatttgc agcatcaata gtaaaacatt 720 tgaattgggt cgcaacttat ataaatattg gttttagatt tttaattgca gtgttttctg 780 tggtaatata aaatgtattt atgtattttg taaagaaaat gctgaaaatg tttttataca 840 taattagata aaacaaaaat gcgccctgga ttaaagagat cagccgtgtg ggagtttttt 900 actaagaatg gtaataacgt caactgcaat atttgtaaaa aaaatttaaa gtttgctggt 960 aacacatcaa atatgaatga tcacctacga cgaaggcatc catcatcaca tagtgtgggg 1020 gagacttctt cggcgatagt cgaacgagca cctgctgtaa gtgacttaga tatgccaagc 1080 tgttcttcaa cgcaaagaga tgtgaactta acagtggaaa ctattgaaag tgagataagt 1140 agtgcttctt tggtacctcc tgtgaacgta gaaactacaa gttcttcaac aatacccagt 1200 agaaatattt caggtggtgg accattaaaa agaagggcaa tgcaaacaaa attatttgtt 1260 acaagcacgc gatctgagct ttcggaaact gaaaagagaa gtatagatga gtctcttatt 1320 aaaatggtta caagagatat gcaaccactc tctattgttg aaaatgaagg atttcgagaa 1380 tatacaaaaa agcttcagcc tttgtattca atcccaaaca gaaaacttct ttccaacaca 1440 atgttgcctt caaaatataa cgagacacgg aaaaagcttc atgccatttt gcaaaatatt 1500 tcacaccttt ctataacgac agatatgtgg actactgaca gccagaagtc ttttttaact 1560 gtaactagtc atttcatttg ggaaagcaaa atgaactcag cagtcctggc aactaaagta 1620 gtgttcggct cacatactgc tcaaaatata gccacagaat taaaaagcat ttttgacgag 1680 tggtcaattt ttaacaaaat agtgacgata gtgagtgata acggtgcaaa tattaaaaaa 1740 gcgataaggg atatactcca aaaacaccac cacccatgtg tcgctcacac cctaaattta 1800 tgcgttgtgg atgcgataaa gactgttcct cagattttag agcttataac gaaatgtaga 1860 gctatagtaa catatttcca tcatagctct caagcagcag aaaaactaaa aaatatgcaa 1920 aagcaaatgg gagtagctga acttaagatg aagcaagacg ttgctactag atggaactct 1980 ggccttataa tgatggaacg tatatgcttg ataaaagaac cgctctctgc tgtactaact 2040 tccttaccta gtgcaccgaa ctttctgaat gcatcagaat gggaaaggtt acgtgattcc 2100 atcactgtat tgaagccgat agagcatatg acaatagaac tttcagcaca aaactaccca 2160 acgatgtcac tagtggtgcc tatagtcaga ggactgcaat atgcaataag atcccagcaa 2220 atgaaaacta cggagggaga atgcttgaaa agtagtttgt tggaagtaat ttcgagacga 2280 cttggccaac cagagtctga caagatgtgt gccaaatcga ccttcttaga cccacggttt 2340 aaaaagattg cctttggaaa tgaaagtaac tctagtaatg ctcaaaaatg gttgggagaa 2400 gaggtatcag cattcataga gcggaaccaa aggactgcta cagccccagt catagaatta 2460 cccgcggata aaagtaaatt atctctttgg actctacttg accaaaaagt agcggaagca 2520 aaaactattt gtcacaacgc ccctagtgtt aatgcacata tttcgttgga acaatatctt 2580 agacaagatt tcgttgagag acatcaaaac ccgttaaact attgggacag caaaaaggca 2640 acttttccag aactctacga gctttccaac aaatatttat gtatacctgc tacttcagtt 2700 ccttccgaaa gggttttttc taaagctggg caaataataa atgatagaag aaatagactt 2760 aaaggtgaaa agctagatca aataatgttt ttaaatagca attttaatat ataattaata 2820 tttttttgcg ttcttctgat atttatattt atatttatat ttattttata tgatattgaa 2880 tatcttttct gtgctttact atttttatgt tttattaaaa tattgaatta aattaaacct 2940 ttgtattggt ttaattaaaa atgaataaaa tgccagtact tttttgttgt ttgaagagca 3000 cgcctttgct tagttattga atttgattaa ataaatgaat gaatgaatga acgaaaaaat 3060 tcattcgata gtcaaaatca ttcagtcatt aacaatcgat agttgaaatc actcgatagt 3120 tcccaacact a 3131 // ID Copia-97_AA-I repbase; DNA; INV; 4179 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-97_AA_; KW Copia-97_AA-LTR; Ty1_copia_Ele184; Copia-97_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4179 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [1465-1992] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 118..2871 FT /product="Copia-97_AA-I_1p" FT /translation="MADAKVSVEKLNDQNYAIWKFKMRLLLTREKLLDVVI FT QPKPPAADATWVSNDEKAQAVIGLALEDAQLIHVMQKSTAKEMWDALKDYH FT ERSSLSSIIHVVRQLITLRMSDNGDMAEHLKQMTALRIRLSALGEDVKDNW FT FVALMLSSLPESYGGLIMALESRPDADLTVDFVKGKLLDEGRRRVENASLS FT EKALATGSWKKKPAEKPTVKKDKVCHYCKKEGHFRRDCRKLMQDRQEKEKS FT QQANIVVKSESGGELCLAAGVTGDAAVWYLDSGATAHMTNNSSILEEVDVA FT KATTICLADGKSIKSAGSGSGRLISVNGRGARMDVKLENVFHVPSLAGNLL FT SVSRIADLGFRVMFDKSECEVLKGEEVVLVGQRKGGLYHLKQLPEQAFLVD FT PKHGELCPHLWHRRFGHRDPQAVMKIVREDLGHGLKMGSCEVQSMCGCCCE FT GKMSRDSFPKASVTRSKVVGELIHVDLGGPMEVATPRGNRFYIALVDDYSR FT YSVVYPLQRKSDAEEKIREYCALLKTQFGRYPKCFRSDNGGEFAGASLMKF FT FSTNGIVVQRTAPYSPQQNGVVERKNRYLVEMMRCMLSEAGLDKRFWAEAI FT CTANYLQNRLPSSAIDRTPYELWSGKKPSYAHLRIFGSEAYVHVPKEKRRK FT LDLKAEKLVFIGYEEGSKAYRFLNLETDRVTISRDAKFLELCTVKEAVRRE FT PFATGGVLEVPLALPPEKVEETAEAPDLETDTESEVDDVNDSRYGSASEGD FT SSVHDFPLDENLRRSKRSTKGKPPSRLIEEIFVAESVSEEFEPGNLREALT FT CGKKKEWAAAMKDELKSHSDNDTWELVNLPPGRKAVGCRWVFKAKKNAAGE FT VTKYEARLVAQGYSQKYGEDFDEVFAPVTRYTTLRALPALASSRKMTLKHF FT DVKTAYLYGKLGEP" FT CDS 3051..4151 FT /product="Copia-97_AA-I_2p" FT /translation="MVCRLKRSIYGLKQSARCWNRRLDSVLREMGFRPSAA FT DPCLYTKVVNGKWVYLLVYVDDIMVGSEDESQIGAVYKALKKQFEMTDLGE FT VNYFLGLEIGHENGQYSVSLENYIQKVAEKFGLREAKSAKSPMETGYIRME FT DTSPLLSDTTAYRSLVGALLYIAACARPDVAACVAILGRKVSAPTEADWVA FT AKRVVRYLKTTKDWRLKFGSNDGLVGYCDADWAGDVGTRKSTTGFVFHYAG FT GAVSWVSRRQNCVTLSSMESEYVSLSEACQELVWLLSLFKDLGVQQSGPVT FT IREDNQSCIKFVGSERSTRRSKHIETKQHYVKELYEDKVLSLEYCSTDEMV FT ADILTKPLGTIKHRKLAGMLGLSA" XX SQ Sequence 4179 BP; 1075 A; 821 C; 1290 G; 993 T; 0 other; ataggttatt ggcccagtaa tcgtggaaag tgcgtgagtt tgtgtcgtcg cgagtgtgaa 60 ttgtgtggac aatttcctga accgaaaaat cgtgtttggt cgtgtcggaa agacaagatg 120 gcggacgcga aagtatcggt ggaaaagtta aacgaccaga actacgcgat ctggaagttt 180 aagatgcggc ttttgttgac acgggagaaa ttgttggatg tggtgattca accgaagcct 240 ccagctgctg atgcgacctg ggtcagtaac gacgagaagg cgcaggcggt catcgggttg 300 gcactggaag acgcccaact gatccatgtg atgcagaagt ccacggcgaa ggagatgtgg 360 gatgctctga aagattacca cgagcgttct tctctttcga gcatcattca cgtcgtgcgt 420 caattgataa cgttgcggat gtcggataac ggtgatatgg cggaacacct gaagcagatg 480 acggcgttgc gaattcgctt gtctgcttta ggagaggacg tgaaggacaa ctggttcgtg 540 gctctgatgc tgtccagtct gccggagtcc tacggtggcc tcatcatggc tctcgaaagt 600 cgaccagatg cggatctgac tgtcgatttc gtgaagggca agctgctaga cgagggaagg 660 cgtcgtgttg aaaacgcgtc gttgagtgag aaggcgttgg cgaccggatc ttggaagaag 720 aagccggctg agaaaccaac agtgaagaaa gacaaagtgt gccattattg taagaaggag 780 gggcacttcc ggcgcgactg tcggaagctg atgcaggatc gacaggagaa agaaaagtcg 840 cagcaagcaa acatcgtcgt gaaatcggag tcaggaggag agctttgtct tgctgctggc 900 gtgactggag acgctgccgt gtggtatctg gattcgggtg ccacagcgca tatgacgaac 960 aactccagca ttcttgaaga ggtggatgtg gcgaaagcga cgacaatctg tctggctgat 1020 ggaaagtcga tcaaatcggc cgggtccggc tctgggcggc taatttcagt gaatggaaga 1080 ggagcccgga tggatgtgaa actggagaac gtgttccatg tgccatcgtt ggcaggaaac 1140 ttgctgtcag tcagcagaat tgccgacttg gggtttcgag tcatgtttga caaatccgag 1200 tgcgaagtgc taaaaggaga agaagtagtg ttagtgggtc aaaggaaggg tggtctatat 1260 catctgaaac aacttccaga acaagcgttc ttagtggatc caaagcacgg cgaactgtgt 1320 ccgcatttgt ggcaccgtag atttggccac agagatccac aagctgtgat gaaaatagtg 1380 cgtgaagacc tggggcacgg gttgaagatg ggctcttgtg aagtgcaatc gatgtgtggc 1440 tgttgctgcg aggggaagat gagcagagat tcttttccga aggcatctgt gacaaggtcg 1500 aaagtagtag gagagctgat tcacgtcgac ctaggaggtc cgatggaggt agctacacca 1560 aggggtaatc gcttttacat tgctctggtg gacgactaca gccgttactc cgtcgtctac 1620 ccgctccaga ggaagtctga tgctgaggag aaaattcgtg agtattgcgc tttgttgaaa 1680 acccaatttg gtcgataccc gaaatgtttt cgctctgata acggtggtga gttcgcgggt 1740 gcttcgttga tgaagttttt ctcgacgaat ggcatcgtgg ttcagcgaac ggcaccgtat 1800 tcgccacagc aaaacggtgt ggtggagcgc aagaacaggt accttgtgga gatgatgcga 1860 tgcatgctgt ccgaagcggg actcgataag cggttctggg ccgaagcgat ctgcacagca 1920 aattacctgc agaatcgttt gccatcgtct gcgattgaca ggaccccgta cgagttgtgg 1980 agcggaaaga agccgtcgta tgcgcatctc aggatatttg gttcggaagc gtatgtccac 2040 gttccgaagg aaaagcgccg taagcttgac ctgaaggctg agaaactggt gttcatcggg 2100 tacgaggaag gaagtaaagc ctatcggttc ttgaatctgg agacggatcg tgtgacgatc 2160 agcagagacg caaagtttct tgagctgtgt accgtcaagg aagctgttcg tcgtgagcct 2220 tttgcgacag gaggagttct tgaagtccct ttggctttac cgccggagaa agtggaagaa 2280 actgctgaag cgccggattt ggagaccgac acagaatctg aagttgatga cgttaacgat 2340 tcccggtatg gtagtgcttc tgaaggcgat tcttctgttc acgattttcc gttggatgaa 2400 aatttgcgtc gttcgaagcg gtcgacgaaa gggaagccac cgtccaggtt gatcgaagag 2460 attttcgtgg ctgaaagtgt ttccgaggag ttcgagcccg ggaacttgcg agaagccttg 2520 acctgtggaa agaaaaagga atgggcagca gcgatgaagg acgaattgaa atcacattcc 2580 gacaatgata cgtgggagtt ggtgaatttg ccgcccggcc ggaaagcagt tggttgtcgt 2640 tgggtcttca aagcgaagaa gaatgcggct ggtgaagtga ccaagtacga ggcccggctg 2700 gtagcccaag ggtactccca gaagtacggg gaagattttg atgaagtttt cgctccggtg 2760 accaggtaca caactcttcg agctcttcca gctttggcga gtagcagaaa aatgacgttg 2820 aagcacttcg acgtgaaaac cgcctacttg tacggcaaac taggagagcc ataaaactaa 2880 gtcttgatgg tatgtttgat tgaaaagtac gccgtcacgc caactccgga cctgacactt 2940 ttttcgcagc tttggtgtag gaacgagcaa ctgattttct tcttgatgac cgtgatgagg 3000 agttgtttac gaggcaaccg cctggttacg aagtcgttgg gaaagagcac atggtgtgtc 3060 gattgaagcg tagcatttat ggcctgaagc agtcggccag gtgctggaat cgtcggctgg 3120 atagtgtttt gcgtgaaatg ggatttcgac cgagtgcagc agacccctgc ttgtatacca 3180 aagttgtgaa tgggaagtgg gtttatctgc ttgtgtatgt tgatgacatt atggttggaa 3240 gtgaagatga atcccagatc ggtgcagtgt ataaagcgtt gaagaaacaa tttgaaatga 3300 ccgatcttgg tgaagtgaac tattttctcg gacttgagat tggacatgaa aatggacaat 3360 acagtgtatc gttggaaaat tacattcaaa aagtggcgga gaaatttggc ctccgtgaag 3420 cgaaaagcgc caaatcgcca atggaaaccg gttatattcg gatggaggat acaagtccgt 3480 tgctatcgga cacaacagcc tacaggagtt tggttggcgc gctgctatac atagctgcgt 3540 gtgctagacc agatgtagcg gcatgtgttg cgatacttgg ccggaaggta agtgctccaa 3600 cagaagcgga ctgggtggcg gcgaagaggg tagttcgcta ccttaagact acgaaggact 3660 ggagattgaa attcggtagc aatgacggtc tggttggcta ctgcgatgcc gattgggctg 3720 gagatgttgg aacccgaaaa tcaacgacag gattcgtatt ccactatgcg ggtggagcgg 3780 tttcgtgggt gagccgtaga caaaattgtg tgacgctatc atccatggaa tccgaatacg 3840 tgtcgttaag cgaggcatgt caagagctgg tgtggttgct gtctctgttc aaggatcttg 3900 gtgtgcaaca aagtggtcca gtcactattc gagaggataa ccaaagttgc ataaagtttg 3960 ttggttcgga gaggtctacc cgtcgttcaa aacacattga gaccaagcag cactacgtga 4020 aggagttgta cgaggacaaa gtgttaagct tggaatactg ttcgaccgac gagatggtgg 4080 cagatatatt gacaaaacca ctgggcacca tcaagcaccg gaagttggca ggaatgctgg 4140 gtctatccgc ttaaggtgga ctcgttcgtt gaggaggag 4179 // ID I-54_AAe repbase; DNA; INV; 4454 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE An I non-LTR retrotransposon from Aedes aegypti. XX KW I; Non-LTR Retrotransposon; Transposable Element; I-54_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4454 RA Kojima K.K. and Jurka J.; RT "I clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1325-1325 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 10 sequences with >97% CC identity. Both termini are uncertain. XX FH Key Location/Qualifiers FT CDS 403..1821 FT /product="I-54_AAe_1p" FT /translation="MGGHPGXFRGPGPSNMLGGQFTGRSFPEWGDSQESHG FT NLVLLRMEAANGSLPNAPFLLRKSVEAYLGVPIDGAFPEAKGASYVLKLRS FT DRLVQRLKNMDQLADGTVIKISEHPVLNKTKCVISCKDTVGYSDEELKAEL FT QDQGVVDIRRITRSEGKERVNTPTVILTINGTVAPEYIEVGWLKCRTRPFY FT PAPMLCFGCYNFGHTKLRCQQQLPVCGNCSENHQSDRETPCKAAAFCSRCE FT SSDHSLSSRKCPTYKVEEEIQHLRVDLGISYPAAKQAFEQRKGKTTYSSVV FT KSDQDKKFEDLAMKVEKLQHDMLNKDGKIEALIQQVEARDLEIKKRDARIG FT QLETALRAGPQERLSIAQEYGTIEDLLQKLKTLEQKLEQKNKEVITLRKIY FT TPESLNILPLVTQASSKTSTTHPTAEKKSYNKTTEHTKQEKMTSFELERKK FT LGTLSSTRNKSKTRDSPKSKRQKKRCVG" FT CDS 2075..4453 FT /product="I-54_AAe_2p" FT /note="endonuclease, reverse transcriptase, FT C-terminal truncated." FT /translation="MNERSRLTEHEHQNEENAPTTLIARSPESALPNPPRS FT QNTHQNPYLIPTEARTNALLYDTFQLVTRNIRSPGPSNSWHHVDTVTRNHF FT EAERRKETSSAVKHSHSTTPYLKYPVEPLGSRGPVSAEVMGSPELADNLLR FT PLASGRGTDKGTCSYSPSPLAGGSMQGKPSSPPQDDVGCFRYQSYSNESLE FT TTSISTDHHSNSDDHTNKKAFALQWNINGFHNNLADLEMLVRELCPTVIAI FT QEVHRASKTGMGRTLSGGYNWELKINQNIYHSVAIGILAHISFTAIDLDTT FT LPIIAIRIDSPLPISIVNFYLPCSKIDNLDQQLGKIMEKIPEPRLLLGDCN FT GHHPAWGSSSTNLRGRIMTDMAEVTDSVILNDGSKTYISGGRESTVDITMV FT CSSVANRFLWKVHDDPLGSDHRPIIILLNDQPPKTSRRPRWVYDQADWASY FT QSAIEATLDTTEPTTISNFVKSIHQIASDTIPKTSSRPGRKALPWWSPETK FT KAIKARRKALRKFKRLPDSHPQKEEVHLIYRQARNNCRQVVRDSKKQSWSK FT FLDSINDQQTPAELWRRVNALNGKRRLKGFEIHHQGSSTSEPDIVANVLAK FT YFAELSSINRYDPKFCRVKKVSLTSLNHVQIPADNNXEINVPFSLAELNFA FT MKTAKSKSVGPDEIGYPMLKQLTERGKRCLLQLLNKEWNNNTLPREWKHSL FT VVPIPKNQAPSNEPNNYRPISLTCCISKVLERMVNRRLIRYLNEGDFLDHR FT QHAFRPGHGTGTYFATLGQVLDDALKRDDHVEVATLDLAKAY" XX SQ Sequence 4454 BP; 1419 A; 1044 C; 970 G; 1013 T; 8 other; agttcgacag ttggtcttgt gacgaacagt cgtgtttttc tctgcgctga tagactgtgc 60 gacwctkctc caatcgaaat ctgttttcta cgtgtccgag agatataaaa ggattgtgca 120 ctcagtgggt gtcccgtcga gcgaacaatt tctgcgacat ctgtttcgtg gcataaagaa 180 aagtgctaat catctgaaat caaagccaca aaaaagcacg cgggtgcatw tcgcacataa 240 tagtatagac aggtaccaaa gtgaacaaag gcaatagaca atagtgttgt gctaattgtt 300 ctgtgtgacg atacgagccc gataacgccg tatcggacgg tggagatttt cgtagaattg 360 taaagtaaag tctgaacacg agtggaagta ttctcgacaa aaatgggagg ccaccctggm 420 mmmtttcggg gcccgggacc gtcaaacatg ctaggagggc aattcacagg ccgcagtttt 480 cctgaatggg gcgactccca ggaaagccat ggcaatctag tcttgctgcg aatggaggct 540 gccaatggat ctttaccgaa tgcaccattt ctgttgagaa aatcggtgga ggcatatctt 600 ggcgttccaa tagacggagc ctttcccgag gccaaaggag cttcctatgt gctgaagctt 660 cgaagtgatc gtctagttca acggttgaag aatatggacc agctggcgga cggtaccgtg 720 attaaaatta gcgaacaccc tgtcctcaac aaaactaagt gtgtaattag ctgcaaggac 780 acagttggtt actcagatga agaactgaaa gctgagctcc aggaccaagg agttgtcgac 840 atccgacgta tcacccgctc ggaaggaaaa gaaagagtca acacaccaac agtcatctta 900 accatcaacg gaacagtggc tccggaatat atcgaggtag gttggctgaa atgccgaacg 960 cggccttttt accctgcccc gatgttatgt tttgggtgct acaatttcgg acacaccaaa 1020 ttacgctgtc agcagcagtt accagtatgt ggtaattgct cggaaaacca tcaaagcgac 1080 cgtgagactc cttgcaaggc agcagctttt tgcagtcgtt gcgaatcttc tgatcactct 1140 ctaagctcac gaaaatgccc tacgtacaaa gttgaggagg aaatccaaca cctacgtgtg 1200 gacttaggta tctcataccc ggcggccaaa caggccttcg agcagcgaaa gggaaaaact 1260 acgtactcct ctgttgtcaa aagtgaccag gataagaaat ttgaagatct agccatgaaa 1320 gtagaaaaac tccaacatga catgctaaat aaggatggca aaatcgaggc acttatccaa 1380 caagttgagg ccagagacct ggagattaag aagcgcgacg caagaatcgg acagctagag 1440 acggctttgc gagctgggcc ccaggaacgg ctttccattg cacaagaata tggaactatc 1500 gaagacctcc tccagaaatt gaagacgctg gaacagaaac tggagcaaaa gaataaggaa 1560 gttatcacac tgcgaaaaat ttacacacca gaatcgctca atattctccc gttggtcact 1620 caagccagtt cgaagaccag caccactcac ccgaccgcag aaaagaaaag ctacaacaag 1680 accaccgaac ataccaagca agaaaagatg accagctttg aattggaaag aaaaaaatta 1740 ggtacactct cgagcaccag aaataaatca aagacaagag acagtccgaa gtcgaaacga 1800 caaaaaaaac gatgcgtcgg atgatgacaa taacgataat tcgacgataa cgataccaga 1860 gatcatcgaa gtcgattacg acttctcgtc gaacgacgaa acgatgatgg aagaatcttc 1920 aaactaccca taaataatcg aaatcctctc cagtcatcta tcatatctgt aatacacaga 1980 aaccttctaa ttaattttca agcccattca agcaaaacta gaagacatta cgcagactgg 2040 gaggaacacc accgacaacg actttttttc atatatgaac gaacgatctc ggttgacgga 2100 acatgaacac caaaacgagg aaaacgctcc aaccacatta atagccagat ctcccgaatc 2160 agctttaccg aatccaccaa gaagccaaaa tacccaccag aatccatatc tgataccaac 2220 tgaagcaagg acaaatgcat tgctgtatga tacttttcag cttgttacta gaaatatccg 2280 ttcaccaggt cctagcaatt cctggcatca tgtcgataca gtcactagga atcacttcga 2340 agcagaaaga agaaaggaaa cttcctccgc tgtaaagcat agtcattcga ccacacctta 2400 tctcaaatac ccggtggagc cattgggtag taggggcccc gtcagtgcgg aagtcatggg 2460 ttcaccggaa ctggcggaca acctcctacg ccctttggcg tccggaagag ggacggacaa 2520 gggaacatgt tcctattccc cttcgccact tgctggaggg tcgatgcaag gaaaaccgag 2580 ctctccccct caggacgacg tgggatgctt ccgatatcaa tcatattcga acgaatcact 2640 cgaaactact tctatcagta cagatcacca ttctaactcc gacgaccaca ccaataaaaa 2700 ggcatttgca ctccaatgga acataaatgg attccataat aatcttgcgg atttggagat 2760 gctggttagg gaattatgtc ctacagtgat tgctatccag gaagttcatc gtgccagtaa 2820 aacaggaatg ggcagaacac tatcgggtgg ttacaattgg gagctaaaaa taaaccagaa 2880 catctatcac tcagtagcca ttggtatact agctcatatt tcgttcaccg caatcgacct 2940 tgataccact cttccaatta tcgccattcg tatagattcg ccgcttccca tatctatagt 3000 gaatttctat ctcccatgta gtaaaatcga caatctcgac caacaacttg gtaaaattat 3060 ggagaaaatt cccgaaccaa gactactcct cggagattgt aacggtcatc atccagcgtg 3120 gggtagttca tctacaaacc ttcgtggtag aattatgacg gatatggcag aagtgacaga 3180 ctctgtcata ctgaacgatg gaagcaaaac atatattagt ggcggaagag agtctactgt 3240 cgatataact atggtatgtt catctgtcgc caaccgtttt ctttggaaag ttcacgatga 3300 ccctctcgga agtgaccacc gaccgataat catcctcctt aatgatcaac cgccaaaaac 3360 gtcccgtcgg ccccgctggg tttatgatca ggcagactgg gcttcatacc aatctgcaat 3420 cgaagcaaca ttggacacta ccgagccaac taccatatcg aacttcgtga agtctattca 3480 tcaaatagca tcggatacaa tcccaaagac cagttcaagg cctggtcgaa aggctctccc 3540 ttggtggtca cctgaaacaa aaaaagctat aaaagctaga cgaaaagccc ttcgtaagtt 3600 caagcgacta ccagatagtc atccacaaaa ggaagaagta catttgatat atagacaagc 3660 gagaaataac tgccgacagg ttgttagaga ctctaaaaag caaagctggt ctaaatttct 3720 agatagcata aatgaccaac agacccctgc tgagctgtgg cgcagagtaa atgcccttaa 3780 tggtaagagg agacttaagg gctttgagat acaccatcaa ggctcttcaa caagcgaacc 3840 agacattgtg gcaaacgtgc ttgctaaata ttttgcagaa ttgtcttcta taaaccgata 3900 cgatccgaaa ttctgccgag tgaaaaaggt atcactgact tcgcttaatc atgttcagat 3960 acctgcagat aacaaccmcg aaatcaatgt tccgttttca ttggcagaat tgaattttgc 4020 aatgaaaacg gcaaaaagca agtctgtcgg tcccgatgag attgggtatc ccatgctaaa 4080 gcaactcaca gaaagaggta agcgctgcct gttacaactt ttgaataaag agtggaacaa 4140 taatactctc ccacgcgagt ggaaacatag cctagttgtt cctataccga agaaccaggc 4200 tccctccaac gaaccaaaca attatcgtcc catatccttg acttgctgca tttcaaaggt 4260 attagaaaga atggtcaacc gacgcttaat tcgctacttg aatgaagggg actttttaga 4320 ccatcgacag cacgctttta gaccaggtca cggaacagga acttatttcg ccactcttgg 4380 tcaagtgctg gacgatgcgt taaaaaggga tgatcacgtt gaagtagcca cactggacct 4440 tgccaaggct tata 4454 // ID Gypsy-22_DWil-I repbase; DNA; INV; 6217 BP. XX AC scaffold_181130; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-22_DWil_; KW Gypsy-22_DWil-LTR; Gypsy-22_DWil-I. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-6217 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_181130; Positions 413095 406879. XX CC Positions [5075-5581] - Integrase core CC LTRs are 92% similar to each other. XX FH Key Location/Qualifiers FT CDS 1422..2393 FT /product="Gypsy-22_DWil-I_2p" FT /translation="MDSGNASVSLPTTISTPNVTPTISSIANNPTSLSPTD FT ILAFIKELPTFDGIPNKLQDFIVNVEEVIALIRGTDQTPYSSLLLRAIRNK FT IVGRANDALKLSNTALTWDDIKANLIQHYGNKKSEAMLIRDLQNFPDSLSL FT GQLFYSILKIRSQLTDILQNTNQNSTDKPKLYDEICLNTFLINLREPLKTL FT IKSKNPQNIKEAYEHCRVEKTFYSQKPNTSQNFRAQVTPYLRPQNIRNHPN FT NNPYSNNRSQLFRNNSTISQPNINNQANNNYHNRNSTVNPFNSTGPTRNQL FT HNINEQQSSTELCNINDAGNFFETASENQQGT" FT CDS 2630..5899 FT /product="Gypsy-22_DWil-I_1p" FT /translation="MKIDLLLFDFHPHFKGLLGMDVLRKLKANINLASFCL FT ETPKSCLKISLQDNPVTEVHIIPASSKSLITLPVRMKEGNFYSQQTMYNKH FT LEISEGLYNSVNGYSKMVIVNNSLEDQQLYLEQPLETIPYDQNHYIEIYNI FT NSDNQSLPHVHKYVTKLKTDHLNSEEKIALIQLCSKYTDLFYDDNQKLTFT FT SDIKHYIRTTDDTPIFVKSYRYPYALKKEVKDQIDSMLKQNIIKNSYSPWS FT APVWVVPKKSNSLETQKWRLVLDYRKLNEKTISDRYPIPNINEILDQLGKS FT NYFSTLDLASGFHQIEMNQADRSKTGFTVEGGHFEFIRMPFGLKNAPATFQ FT RVMDHVLGDLVGSICLVYLDDIIIFSPSLQKHLIDIRQVFDRLRSANLKLQ FT IAKSDFLRREVNFLGHIVTQDGVKPNPNKIKSIQEFPIPKNRRQIKSFLGL FT LGYYRKFIKDFARITKPLTQQLKGTKAVTIDEEYCKTFEFCKTLLCNNPIL FT TYPNFKEPFILTTDASNYAIGAVLSQKLSGSDKPISYASRTLSDEETRYSA FT TEKEMLAVHWAVKHFRPYLFGKKFILVSDHKPLTWNTGPFSTNDKVARWKL FT QLLEYEYEFQYKKGSQNVVADALSRIRPEANIIVNNIDVENFSNNKVTSSL FT YNLASSIIIVEKPLNDFNLQIVLAEGSPASYQVSTPFKNKLRRTITEPRLN FT SDTVLDILKKILKINRTLAMYTSDSIFQIVLDVFSKHFSYNKLFKLLRCTT FT LLDDITNTTDQEKIVNDYHNNNNHRGIDETLMHLKRKVYFPHMKSKIAQII FT KGCEICLTLKYDRQPQKPPFEVPETPCKPMDIVHIDIYTINKNYNLTLIDK FT FSKFACAYPIPNRTCINVVTAMKHFFSLFGIPKKIVYDQGAEFSGSIFKDF FT CNQYDIQCHITSFQQSSSNYPVERLHSTLTETYRIITELKKKNKQINKHEE FT TLSETLLTYNNAIHSSTKYTPFELFTGRTPIFEQTIKYDTNHEYLQKLNFF FT QSKLYQEVKEILHKNTKSKLDKLNETRKSAAPLSTNESIFRKENRRNKLTP FT RFSKHKVLRDNGNTFITTRLQKIHKSKIRRTTK" XX SQ Sequence 6217 BP; 2426 A; 1213 C; 896 G; 1682 T; 0 other; tctagctccc ggtggtaatt ggcgcagctg ggtcaggaat agacctactt aatgttcaca 60 ttgagaacta tgtcttaaaa gcattggtaa ttggaacgga atcaattgtt gataatcgtt 120 tacatggcaa aaccaatcat ccccaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaattc 180 ataaattaac ttccaatttt tcaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaagg 240 aatatgtata tatgtacatg gatgcatata cacatgtgta catgtaaatc ttataattca 300 atggaaatgt ttatatgctg caaagctaca agatctcttt tacgaaatac acattgtttg 360 tcgctcattc tgctttcaat ttcataattc ggcggaattg ccaaaacgct acaaaaccac 420 aatacttagt attgtatttc gggcgaaata gatctgatag cagaaccacg ctgaacaaac 480 gacgcgtcgt cgcgacgcta aaaagctagc aatttctggc gaaactgatc agctagctga 540 gctgcgatgg gcagacgaag cggcaaagcg gcacattcta gtgaaatcga tctgctaagc 600 agagctgcaa ttggcaaacg aagcattcat aagtagtaac gttttattgt aggctcattt 660 acaaaagtga gctcgagttg caaagacgtt aataaaatta gaagttcagt ttaaagttgt 720 gacaccaagt ttaagaacaa tatactatta attactatta acaatatact attaatacat 780 aataaaagta ataaaggttt gttacaaaat atttttaaag aacattaaga acaatacaaa 840 cgatgcgatg ttagtgccat acagtgaaac aaaaaatata catatatata tatatatatg 900 tgtaccacct atgtgtgtat atacatgtat atatatatag atgtatatgt gtgcgggtgt 960 gtgttgtgtg tatgtgtatg tccatagaca attataatta tgttcaaaac attgcagaat 1020 tttaaaacac tcaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaacaca aaaaaaaaaa 1080 aattaaaaaa atattagaaa taaaaaaaaa aaaaataata ataataataa ggataataat 1140 acagatgtat attagtgtac gtgtgtgcat gtgtatgtgt gtgtgtgtgt atgtgtatat 1200 atatgtatgt gtgtgtcggt gttacatata tgtacatgta tataaataaa tataataatg 1260 tttaaaacat tacagaattt agaaacactc acgtcgacaa atcacgtcaa cataaatgcc 1320 aaaaaaaatc tacattccaa gcagccacaa aaagaacatc taccagacga acggcaagac 1380 tcatttatga agatccagtt cattcaccca acgacgtagc aatggattct ggaaacgcct 1440 cagtgtcact tcccacaact atatcgactc caaacgttac gcctacaata agcagcatag 1500 ccaacaaccc aacatcttta agtccaacgg acatccttgc ctttatcaag gagctaccta 1560 cttttgacgg catccctaat aaactgcaag acttcattgt caatgtggaa gaagtcatcg 1620 ctttgatcag aggaactgac cagacaccat acagctccct tctgttaaga gccatcagga 1680 acaaaatcgt aggtagagct aatgacgcct taaaactatc caatactgca ctcacctggg 1740 atgacatcaa ggccaatctc atacagcact atggtaacaa gaaaagcgaa gctatgctta 1800 ttcgcgattt acaaaacttt cccgatagct tgtcattagg tcaactattt tatagtattt 1860 taaaaataag atcacaattg acagacattt tacaaaacac aaatcaaaac tcaactgaca 1920 aaccaaaatt atatgacgag atatgcttga ataccttttt aattaatcta cgagaacccc 1980 ttaaaacttt aataaaatca aaaaacccac aaaatatcaa ggaagcttat gaacattgcc 2040 gtgtcgaaaa gaccttctat tcgcaaaaac caaatactag ccagaatttt cgagcacagg 2100 taacaccata cttacgcccc caaaacataa gaaatcatcc aaacaataac ccctattcta 2160 acaatagaag ccaattattt agaaataatt caaccatttc tcagcctaac attaacaacc 2220 aggctaataa caactaccat aataggaatt caactgtaaa tccttttaat tcaacaggcc 2280 caactaggaa ccaattacac aatatcaatg agcaacaatc ttcaactgaa ctttgtaata 2340 ttaatgacgc aggaaatttt ttcgaaactg cctccgagaa ccaacagggt acttaacaaa 2400 aaaatcacac ggatcgacac tcccttttat atggataaat tccgttacaa ctcaacccat 2460 aaaattctta atagatacag gttccacaaa ttcatttgta agcccaacca tagtgagcaa 2520 tgtcgatcgc aaaactcttg ataatcctat aactgtacag tcagtgttga accaacacgt 2580 cataaaagaa aaaaccacga ttcaatcact taaagagttt aatgaacaaa tgaagataga 2640 cttacttctt tttgattttc acccacattt caaaggtttg cttgggatgg atgtccttag 2700 aaagttaaaa gccaatatta atctggccag tttttgttta gaaactccaa aatcatgttt 2760 aaaaatttct ttacaagaca acccagtaac cgaagtacat attatcccag cttcttcaaa 2820 atctttaata acccttccgg tccggatgaa ggaaggtaat ttctactctc aacagacaat 2880 gtataataag cacttagaga tttcagaagg tctatacaat tcagttaacg gatatagcaa 2940 gatggtaatc gtaaataatt cacttgaaga tcagcaatta tatttagaac agccattaga 3000 aaccataccc tatgatcaaa atcactatat tgaaatatac aatataaatt cggacaatca 3060 aagtctccca catgtccaca aatacgttac caaattaaaa accgatcacc taaactccga 3120 agaaaaaatt gcattaatac aactctgtag caagtacaca gacctttttt atgacgacaa 3180 tcaaaaacta acatttacca gcgatattaa acattatata cgcacaactg acgacacgcc 3240 aatctttgtt aaatcatatc gataccccta cgccttaaag aaagaagtaa aagaccaaat 3300 agactcaatg ctaaagcaaa acataattaa aaacagttat tcgccttgga gcgcccctgt 3360 ttgggtagtt ccaaaaaaat cgaactccct tgaaacgcag aaatggagat tagttttaga 3420 ttatcgtaaa cttaacgaaa aaacaatttc agaccgttat ccaattccaa atataaatga 3480 gattttagac caacttggta aatctaatta cttctcaact cttgatttag ctagtggatt 3540 tcaccagatt gaaatgaacc aagcagatag gagtaaaaca ggttttacag tcgaaggtgg 3600 ccactttgaa ttcatccgta tgccgttcgg gttgaagaat gcacctgcaa cgttccaaag 3660 agtaatggac catgtccttg gagatctagt aggttccatt tgtttagtat acctagacga 3720 cattataatt ttttcaccat ctctacaaaa acacctgata gatataaggc aagtatttga 3780 tagactcaga tccgctaatc ttaagctaca aatcgcaaaa tctgacttcc ttcggagaga 3840 agttaatttt ctaggtcata ttgtaactca agatggggtc aagcctaacc caaataagat 3900 aaaatccatt caggagttcc ctatcccaaa aaacaggaga caaattaaat catttctagg 3960 ccttcttggc tattatagaa aatttattaa agactttgcc cgcataacaa agccactaac 4020 acaacaatta aaaggtacaa aagcagtaac aattgatgaa gaatactgta aaacttttga 4080 attctgtaag acattacttt gtaataatcc aatattgacc tacccaaact ttaaagaacc 4140 ctttatatta actactgatg caagcaatta tgccatcggc gctgttttat cccaaaaatt 4200 atctggaagc gacaaaccta tttcctacgc cagtcgcaca ttatccgatg aagaaacccg 4260 ttattcagct acagaaaaag aaatgcttgc agtccattgg gcagtaaaac acttcagacc 4320 ctacttattc ggaaaaaaat ttattttagt ttcagaccac aaaccattaa catggaacac 4380 aggaccattt tccaccaacg acaaggtcgc ccgttggaaa ctacaattac tagaatacga 4440 atacgaattt caatacaaaa aaggctctca gaacgtagtt gcagatgcac tgagtcgtat 4500 ccgacccgaa gcaaatatta tcgttaacaa tatagatgtc gaaaattttt caaataacaa 4560 agtaacatct tctctgtata atcttgcatc aagcataata attgtcgaaa aaccacttaa 4620 cgattttaat ttgcagattg ttttagctga aggttctcct gcttcttacc aggtttctac 4680 tccctttaaa aacaaattaa ggagaaccat cacagaaccc agacttaact cagatactgt 4740 actagacatt ctaaaaaaaa ttttaaagat taacaggaca ttagcaatgt acacatcaga 4800 cagtattttt caaatagtac tagatgtgtt ctctaaacat ttttcttaca ataaattatt 4860 taaattactt aggtgtacaa ctcttttaga cgatatcact aatacgaccg atcaagaaaa 4920 aatagtcaat gattatcaca ataataacaa tcaccggggt attgatgaaa cattaatgca 4980 ccttaaaaga aaagtctact ttccacatat gaaaagcaaa atagcgcaaa taataaaggg 5040 ttgtgaaatt tgcctcacac ttaaatacga tagacagccg caaaaaccac cttttgaagt 5100 accagaaact ccttgcaaac caatggacat agtccatata gacatataca cgataaacaa 5160 aaactataat cttacactaa tagataagtt ttcaaaattt gcgtgtgcct accctatacc 5220 caacaggacc tgcataaacg ttgttaccgc catgaaacat tttttcagcc tttttggaat 5280 ccctaaaaaa atagtttatg accaaggtgc ggaattttct ggcagtatat ttaaagattt 5340 ctgtaatcaa tacgacatcc aatgtcatat aacttccttc cagcagtcat ccagcaacta 5400 tccagtagaa cgattacatt cgacactgac tgaaacctat agaataatca cagaattaaa 5460 aaagaaaaat aagcagatta acaaacacga ggaaactctt tcggaaacac ttctcacgta 5520 caataacgct atacattcat ctacaaaata cacacctttt gaacttttta ctggacgcac 5580 acctatattt gaacaaacaa ttaaatacga cactaatcat gaatatttac aaaaattaaa 5640 tttttttcaa tcaaaattat accaagaggt gaaagagatt ttacataaaa atactaaaag 5700 caaacttgat aaacttaacg aaactcggaa atccgcagcc cccttatcaa ctaatgaaag 5760 catatttcga aaagaaaacc gtagaaataa gttaacgcca agattttcta aacataaagt 5820 cttacgagat aatggtaaca catttatcac cacacgcttg caaaaaatcc ataaatctaa 5880 aataaggcga acaacaaaat aataattagt ataatctttt ttttttacat atgtcattca 5940 ttgtacccaa acgcattgta aacaatatac agtatttgtt taacatttgg aaaaaaaaaa 6000 ttttttgtaa cataaaatgc tacaaaaaca actagaaaaa ttaagctttc agcataccat 6060 taaatccttt tgtgttgcgt tggaatcaca agtggacaac gacggtccga tgaacaagcc 6120 caagtcacaa ggaaaatcag cggcatccac cccgacagcc ccattctaaa tgcatttgct 6180 tcattttggg gagggaggag ttaccaaccc aacaccc 6217 // ID LanceleTn-3a repbase; DNA; INV; 173 BP. XX AC . XX DT 15-MAY-2006 (Rel. 11.05, Created) DT 15-MAY-2006 (Rel. 11.05, Last updated, Version 1) XX DE Non-autonomous DNA transposon - consensus. XX KW MuDR; DNA transposon; Transposable Element; Nonautonomous; KW Interspersed repeat; LanceleTn-3a. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-173 RA Osborne P.W., Luke G.N., Holland P.W.H. and Ferrier D.E.K.; RT "Identification and characterization of five novel miniature RT inverted-repeat transposable elements (MITEs) in amphioxus RT (Branchiostoma floridae)."; RL Int. J. Biol. Sci 2(2), 54-60 (2006). XX DR [1] (Consensus) XX SQ Sequence 173 BP; 48 A; 43 C; 33 G; 45 T; 4 other; taatctccaa gcagatccta caatggcata agatagtayc aaactggcca aggagtgtag 60 tcagccaara ggtgtccatt tgccatgtag craaacacac tcctaagccg gcttcactcc 120 tctgccagct tttgatacta tcttatgcta ccrtaggatc tgcttggaga tta 173 // ID CR1_Ele10 repbase; DNA; INV; 4624 BP. XX AC . XX DT 28-SEP-2010 (Rel. 15.1, Created) DT 28-SEP-2010 (Rel. 15.1, Last updated, Version 2) XX DE A CR1 clade non-LTR retrotransposon family from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1_Ele10. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4624 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4624 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (06-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. This consensus is generated from 30 CC sequences with >95% identity, and ~99% identical to the original CC sequence in [1]. Closely related to T1 and Q. This consensus is CC likely 5'-truncated. XX FH Key Location/Qualifiers FT CDS 2..1513 FT /product="CR1_Ele10_1p" FT /translation="QVESVVGCRRIFKRSVVILNXFFFAVYKHRARPVKMQ FT CSVESCNISDDRFLWKCEFCFKNYHAACIGVQRHQESFILSFMVPLCGDCQ FT HNLKTGIDTRKVLHQQQQLIDSIRAQTDANLRVAADLKKLSAMGDLFDQIE FT LQLKESVLSINNSTSSIVSNAVSALSRVAEKNSTIKNDTPNSDLLAIKNHL FT TGLLDISMKSSKQNIEDFVETLTTDLTGELKKICSEVQSLSSLTIEMAAHC FT NDHNASQPHMTTDAIVEMQSIGNEILSEVKVLIDTVNSPQTNIDAVRFVEV FT ESPPSLLDELADQGIETVNLNPGRNSNGTAGWRLLGTKKVWRSDWTEYDNR FT QQRRASQQKTKEKAQKRKKANKHRGHNNSTNRKTNSIFSKTPSKHTRSKNE FT RENMKNNTSQKWFNYNDSNMKRRSNGAVSNGLPSDKDLLAAAKLTFSKPPV FT KQRGIKFQRGEILNPYPVDDAFPSCSTSSSNWLFGPALRNSFSQHCSSCSC FT EKTCFRST" FT CDS 1633..4566 FT /product="CR1_Ele10_2p" FT /translation="MQRDVLPSTSSEANATEILVYCQNFNRMKSASKMNEI FT YKRILTSSFSVVMGTETSWNEKIKSEEVFGNSFNVFRDDRDLVLTQRMSGG FT GVLIAISTKFNSELIISSKHKEFEHVWVKAHIAGETHIFASVYFPPDQACK FT SSYEIFFQTAEEIVSQLSPEFKVHIYGDFNQRNADFIRDSENESILLPVVG FT ENESLQFIFEKAAFLGLNQINHVKNRQNCYLDFLFSNIMDDFCVNESVSPL FT WKNEAFHTAIEYSIFVHNNHNSDNCDFENFFDYKNANYDGIKQKLNRIDWQ FT SVLKNQGDLEIAVTDFYSILSEIIQMEVPLLRRRRRNGSKNPVWINKQITN FT LKNRKQKAHKIYRQQKSRESLENYLYIVNQLNQAISAALEEYNTKIENDLK FT SCPRNFFNYAKSKMKSNNFPSKMTLDDRAADNPDGMCNLFATFFQETFTNF FT SDKDRDFEYFSSFPDFTSDIGVSQILLQDILTGLKNLDSTKGSGPDGIPPI FT LLKNLANEFTAPLFWLFNKSLEMSRFPKEWKRSYLIPIYKSGRKSDVRNYR FT GIAIISSIPKLFESIINKNVFSQIKHRITNTQHGFYKGRSTTTNLLEFVNY FT TLNAMDRGNHVEALYTDFSKAFDRLDIPMLIFKLNKMGIAMRLLNWIESYL FT TDRQQIVRFEGKMSKPVHVTSGVPQGSHLGPLLFILFVNDISYVLKHLKIL FT IYADDMKLYMEINSNKDSDIYLNEISIFDKWCSKSLLQLNVKKCNLITYSR FT KRNTPQITVSLGNQIVQKCDKIRDLGVILDSKLTFTDHYNTIIHKATNMLS FT FIKRFSYNFRDPYTIKTLYIAYVRSVLEYCSIVWSPHIKIHGDRIESVQKQ FT FLLYALRKLGWTTFPLPSYEARCMLIDIQTLNERREIAMISFINDIVSQRI FT DSTNLLSCLNFYTPARQLRNRNLFASNNHRTNYAKFGPMNRMMSLYNQHCT FT EIDLTMSRTKLRQYFNSLRYNTT" XX SQ Sequence 4624 BP; 1580 A; 832 C; 828 G; 1383 T; 1 other; tcaagttgag tcagtcgtcg ggtgtagacg catttttaag cgctccgtag ttattttaaa 60 ttawtttttt tttgctgttt acaaacatcg cgcgcgaccg gtaaaaatgc agtgtagtgt 120 tgaatcatgc aatatcagtg acgatcgctt cctatggaag tgcgagttct gctttaaaaa 180 ttaccacgcg gcatgcattg gtgtgcaacg ccatcaggag agttttattt tatcatttat 240 ggtacctctc tgcggcgatt gccagcataa tttgaaaact ggaattgaca cccgcaaagt 300 gctccatcaa cagcagcagc taatcgattc aattagggca caaactgacg ccaatcttcg 360 agtagccgcc gatctaaaaa agctcagtgc aatgggggat ttatttgacc aaattgagct 420 tcaactaaag gaatcggtgc tgagtattaa caacagcaca tcgtcgatcg tttcgaatgc 480 tgtgtcggca ttatcgcgtg tggctgaaaa aaactcaaca ataaaaaatg atacgcccaa 540 cagtgattta cttgcaataa aaaatcattt gaccgggctt ttggacatat caatgaaatc 600 ttcaaaacag aacattgaag attttgttga aacgttgact acagatctaa ccggcgaact 660 aaaaaagatc tgttctgaag tccagtcact gagcagtcta accattgaga tggctgctca 720 ttgcaacgat cataatgcca gtcaaccaca catgacaacg gatgcgatag ttgaaatgca 780 atcgatcggc aatgaaattt tgagtgaagt taaagtttta atcgatactg ttaactcacc 840 acaaaccaac attgacgctg ttcgattcgt ggaagttgag tcacccccta gccttttaga 900 tgagttggct gatcaaggaa tcgaaactgt taatttgaac ccgggccgaa attcgaatgg 960 cactgcaggt tggcgtttgc taggaacaaa aaaggtttgg cgctctgatt ggacagaata 1020 tgacaatcgt cagcagcgtc gtgcatcgca gcaaaaaact aaggaaaagg cccaaaaacg 1080 gaagaaagcg aataaacaca gaggccataa caacagtacc aatcgaaaaa caaactccat 1140 tttttcaaaa actccctcca agcacaccag aagtaaaaat gaacgcgaaa atatgaaaaa 1200 caatacttca caaaaatggt tcaactacaa cgatagcaac atgaagcgca gaagcaatgg 1260 agcggtttct aatggtcttc cctcggataa ggatttactc gcagcagcaa agctgacttt 1320 ttccaagcct ccagttaagc aacgtgggat caaatttcag agaggcgaaa ttttgaatcc 1380 gtatcctgtc gatgatgcat tcccttcgtg ctcaacatct agttcgaact ggttgtttgg 1440 tcctgcctta agaaactcct tttcgcagca ttgttcctcg tgctcctgtg aaaaaacgtg 1500 ttttcgatca acttgacgca gtcctcagac gtagaccctg acgatccaat cgtggctgat 1560 gaacaactag ttggtaatat tttaaataat gacaatgtat ctaatgatag atctgaactt 1620 tcaaataatg ttatgcaaag ggatgtgtta ccatcaactt cttcagaagc aaatgctacc 1680 gaaattttag tatattgcca aaattttaat cgcatgaaga gtgcatctaa aatgaacgaa 1740 atttataaaa gaattctaac ttcatccttc tcagttgtta tgggcactga aactagttgg 1800 aatgaaaaaa ttaaaagtga agaagttttt ggtaatagtt ttaatgtatt cagagatgac 1860 cgagatttgg tccttaccca aagaatgtct ggtggtggag ttctcatcgc aatatctact 1920 aaattcaatt cagaattaat tatttcatca aaacataaag aatttgaaca tgtgtgggtg 1980 aaagcacaca tcgccggaga aacacatata tttgcttctg tgtattttcc gccagaccaa 2040 gcttgcaaat catcatatga aattttcttt caaactgctg aagaaattgt atcccagtta 2100 tctccagaat tcaaagtgca tatctatggc gactttaatc agcgtaatgc tgattttatt 2160 cgtgattccg aaaacgaaag cattctgctt ccggttgtcg gtgaaaatga atcattacaa 2220 tttatttttg aaaaagctgc atttttaggt ttaaatcaaa taaatcatgt gaaaaaccgg 2280 caaaactgtt atttggactt cttattctca aacattatgg atgatttctg tgtgaatgaa 2340 tctgtttctc cactatggaa aaatgaagca tttcacacag ctattgagta ctcaattttt 2400 gttcacaata atcacaattc tgacaactgt gactttgaga atttctttga ttataaaaac 2460 gctaattatg acggaattaa acaaaaatta aatagaatag attggcaatc cgttttgaaa 2520 aaccagggag atttagaaat cgcagtgacg gatttttatt caattttgtc ggaaattatt 2580 caaatggaag ttcctttatt gcgaagacgt cgtagaaatg ggtcgaaaaa ccctgtttgg 2640 attaataagc aaataactaa tttaaaaaat cgtaagcaaa aggctcataa aatttacagg 2700 caacaaaaat ctcgagaaag tttagaaaat tatttgtata ttgtcaacca acttaatcaa 2760 gcaatatccg ctgcgcttga agagtacaac acaaaaatag aaaatgattt aaaatcttgc 2820 cctaggaatt tcttcaatta tgctaaatcg aagatgaagt ctaacaattt tccatctaaa 2880 atgacattgg atgatagagc agctgataat ccagatggta tgtgcaatct ttttgctact 2940 tttttccaag agacttttac taacttttcg gataaagatc gtgactttga atatttttct 3000 agttttcctg attttacaag cgatatcggt gtaagtcaaa tacttttaca ggacatttta 3060 acaggactga agaatttaga ttccaccaaa ggatcagggc cagatggaat cccacctatt 3120 ttactgaaaa acttagctaa tgaatttaca gcaccattat tttggctatt caacaagtca 3180 cttgaaatga gcagatttcc aaaagaatgg aaaagatcgt atctcatacc catttacaag 3240 tctggtagaa aatctgatgt tcgaaattat cgtggaattg ctattatttc atccattcca 3300 aaactatttg aatcaattat aaataaaaat gtgttcagcc aaataaaaca tagaataacg 3360 aatacacaac acggtttcta taaaggtcgt tctactacaa caaacctttt ggaatttgta 3420 aattacactt tgaatgcaat ggatagaggt aaccacgtcg aagcacttta caccgatttt 3480 agtaaggcat ttgacagact agacattcct atgttgattt tcaagctcaa taaaatggga 3540 attgcgatga gactcctcaa ctggattgaa tcttatttaa cagatcgcca acaaatagtt 3600 agatttgagg gaaaaatgtc taaacctgtt cacgttacat caggagttcc ccaaggttcg 3660 catttaggtc cattgttatt tatattattt gttaacgata tttcatatgt gcttaaacat 3720 cttaaaattc tgatttatgc cgacgatatg aaactttata tggaaatcaa tagcaacaaa 3780 gatagcgaca tttatctgaa tgaaataagt atatttgata aatggtgtag caaaagctta 3840 cttcaattaa acgtcaaaaa atgtaattta atcacatata gcagaaaacg gaatacacca 3900 cagattactg tctctttagg aaatcaaatt gttcaaaagt gcgataagat tagagattta 3960 ggtgtcatat tagattctaa acttaccttt actgatcatt ataatacgat tatccataaa 4020 gcaaccaata tgttgagctt tattaagcgt tttagctaca attttcggga tccatatacg 4080 attaaaacct tgtacattgc ttatgtaaga tcagttctag aatattgtag tattgtgtgg 4140 tctccgcata taaaaataca tggtgataga atcgaatcag tacagaagca atttctttta 4200 tatgccttac gcaaattagg ctggacaacg tttccactgc catcatacga ggctcgatgc 4260 atgcttatcg atattcaaac cttaaatgag cgtcgcgaaa tagccatgat ttcatttatc 4320 aatgatattg tatcgcagcg tattgattcc accaatcttt tatcatgttt gaatttctac 4380 acgcccgcta gacagttaag aaatcggaat ttgtttgcat caaataatca tcgaactaac 4440 tatgcaaaat tcggcccaat gaatcgaatg atgtctcttt ataatcagca ttgcacagaa 4500 attgatctga ccatgtcgcg aacaaagcta agacaatatt ttaattcctt aagatataat 4560 acaacataga attaagcaaa tgtgtagtct actattgatt gacgaaataa ataaataaat 4620 aaat 4624 // ID Transib-17_HM repbase; DNA; INV; 3356 BP. XX AC . XX DT 15-DEC-2008 (Rel. 13.12, Created) DT 15-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE Transib DNA transposon -a consensus sequence. XX KW Transib; DNA transposon; Transposable Element; Transib-17_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3356 RA Jurka J.; RT "Transib transposons from the hydra genome."; RL Repbase Reports 8(12), 2106-2106 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(1043..1324,1416..3104) FT /product="Transib-16_HM_1p" FT /translation="MDRRCLMNYIMENNLLQNDDLHAKILSFVSQSNFEES FT NDNLSIISKFIYQAKNKWKKSNRTSAVFEKRYKVWLNQTIFDCKKKKSKQK FT PVVCLLGGRPSILFEDASVRTKDRIIKELLSKYNLNELLYAASKSLKRDKR FT FIEAKVVKKIYNKELSTESLTSEQSLALVLDANLSKETYQHLRNNALPFSS FT LYPSYHIVKEAKNKCYPENIQVSDYSAEVPLQALLFHTASRICSAYSVVLN FT SPVLKDCTKLFIIYKAGFDGATGQSVYKQLSDVSSELHTDESLFITCLVPL FT ELYCFKDDKKQVVWRNPKPSSTNYCRPLRFKYRKESNEVILAEYESIMASI FT KEITLSVVPVLLEDGTIKEVEVQHIVNLTMIDGKIQTIISTATNSYQCCSV FT CGASPIEMNNLSLVVQKKTLEGNIKHGVSTLHAWIRFLECMLHISYKMPIM FT KWQARNQDEKNIVKLRKLEVCAEFRTRIGLVIDQPRIGGAGTSNDGNTARR FT FFQQYDVSASIMNIDVDLMKRLHIILATISSGYEINSSKFKTFCWETASLY FT VFLYPWYPMTQTLHKILIHGYEVIELFTLPLGMFSEEAQEATNKVFKIFRE FT NFTRKCDRKKNNIDLFHRLLCASDPLICTLRKKIHNKKKSLPSGVLELLQE FT PTIEYSA*" XX SQ Sequence 3356 BP; 1258 A; 460 C; 484 G; 1153 T; 1 other; cacagtgggt cagaagccgg caaaaccata caaaaaagtt tttaaatttt gaaawtggca 60 aattttgctc aatatattaa taagccagaa attatatttt attaaaaaat aatttttcaa 120 taaaacgcca tatttgttgt ttaaatattt ccttaaatac tttttatcaa taacaacaaa 180 aaagtgtgaa aattttattt ttttacatct tttgaaaact ttgatttctt attttaaaga 240 agtttaagtt tattttttaa cttataaatg acttatatag tattatgtta atatttaaat 300 cacataaaaa aaaatatttt ttgcattttg tataaatatt ttcaacaatg caaacatcaa 360 taattttgag ttataagttt ttattttttt caaaaatatc atttaaaacc atcggatcaa 420 cctgtatcaa ccgcttaaaa tgatatataa ttgaatacca atatgacatt gatcttgtgg 480 tataaaaatt ggctgcctat agctgcctaa acgaatacac ttttaaattc aattaactac 540 gtccacgttt cttgttaatg cgtttttgat cgataacaaa aaatttaaga agatttaaaa 600 tttaaaaact gcagtttgac tttataataa tttttattaa agacataagt ttaaaattaa 660 atatttttgt attttttaat tgctttccag taacttcctc gtatttgctc aatgaacatt 720 tagatcgcat tgaaaccatt attcctaatt ttttatgcta ccgttttaaa gtaataaaaa 780 ctcagagact gtgataaaat atgaattata tacttctacg tagtacaagt ttgggacata 840 atatttatta tttattttat aatttccgaa aacaataaaa ttatatgtaa tcgataaaaa 900 acacgataac aaccaatgag aaataaagga aaggaactga gtttcatagt tactgctcgt 960 aaaaaaatta ctttattaaa aacattttaa attactttct ttcaaattaa aagtatttta 1020 ttgaacagac aatctcttaa aaatggatag gagatgtctt atgaattata taatggaaaa 1080 taatttacta caaaatgatg atttacatgc aaagattttg agttttgttt cccaaagcaa 1140 ttttgaagaa tctaatgata acttgtcaat tatttcgaaa ttcatttacc aagctaaaaa 1200 caagtggaaa aaatcgaata gaacaagtgc agtatttgaa aaaagatata aagtttggtt 1260 aaaccaaact atatttgact gtaaaaaaaa aaaatcgaaa caaaaacctg tggtatgttt 1320 actttaacta tgtgtaaatt tattataatg tgtataattt atttgtatta ttattacgac 1380 gatttaataa ttgataaaat aatattatac tttaaggtgg acgcccttcg attttgtttg 1440 aagatgcgtc tgtgagaact aaagatcgca taattaaaga gcttttgagt aaatataatt 1500 taaacgaact tttatatgca gcaagtaaaa gtttgaaaag ggataaaagg tttatagaag 1560 caaaagtggt taaaaaaata tacaacaaag aattatcgac agaatcactt acttcagaac 1620 aatcattagc gctagtctta gatgctaatt tatccaaaga aacttatcag catttaagaa 1680 acaatgctct tccattcagc tctctatacc catcctatca tattgtaaaa gaggctaaaa 1740 ataaatgcta ccctgaaaac attcaggtca gtgactactc tgcagaggtt ccattacaag 1800 ctcttctttt tcatactgca tcaagaatat gttcagctta tagtgttgtg ttaaattcac 1860 ctgtgcttaa agattgtact aaactattta taatatacaa agctggtttt gatggtgcaa 1920 caggacaatc tgtttataaa caactgtcag atgtttctag tgaacttcat acagatgaat 1980 cacttttcat aacttgcttg gttcctctag aactatattg tttcaaagat gacaaaaaac 2040 aggttgtttg gagaaatccg aaaccatcgt caactaatta ctgcagacct ctgagattta 2100 aatatagaaa agaatcaaat gaagtaattc ttgcagaata tgagtcaatt atggcttcta 2160 ttaaagagat aacattatct gttgtacctg ttttattgga ggacggtaca ataaaggaag 2220 ttgaagtaca acatattgta aatttaacaa tgatagatgg aaaaattcaa actattattt 2280 ccacagcaac caactcatac caatgctgtt cagtgtgtgg tgcatctcca atcgaaatga 2340 acaatttatc tttagttgtt caaaaaaaaa ctttagaagg caatataaaa catggcgttt 2400 caactttgca tgcttggata aggtttttag aatgtatgct ccatatctct tataagatgc 2460 cgattatgaa gtggcaggct agaaatcaag acgaaaaaaa tattgttaaa ctgagaaagc 2520 ttgaagtctg tgctgaattt cgaacaagga taggtcttgt catagaccaa ccgagaatag 2580 gtggggcagg tacatcaaac gatggcaaca cagctaggcg attttttcag caatacgatg 2640 tctcagctag tataatgaac atcgatgtag atcttatgaa aaggcttcat attattctgg 2700 ccacaatatc aagtggttat gaaattaact catccaaatt taaaacattc tgttgggaaa 2760 ctgcttcgct ctatgtattc ctgtaccctt ggtatcctat gactcaaaca cttcataaga 2820 ttttgattca tggctatgaa gtcattgaac tatttactct tccattaggt atgttttctg 2880 aagaggctca agaggctacc aataaggttt ttaagatttt tcgtgaaaat tttacaagaa 2940 aatgcgatcg gaaaaaaaac aatatagacc tttttcaccg ccttttatgt gcatctgatc 3000 ctttaatttg cacattaaga aagaagatac ataacaaaaa aaaaagtcta ccatctggtg 3060 tacttgaact attgcaagag ccaacgatag aatactctgc gtaaaacata aatttaaata 3120 atatccttgt tgttaatttt gttgggtaaa aacggaattt tttataaatg gaaaataaat 3180 gagtttttta ataatttgcg taattatgta ataaaatttg aagtttttaa aaaacaaatg 3240 ttattttttc cacaaattat gtttttcaat ataataaact gtttctaatt aaaaaaaata 3300 aaaaaatttt ggttttagag gttttggcta gaaaatgtca cttttgaccc actgtg 3356 // ID DNA8-7_CQ repbase; DNA; INV; 2900 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A DNA transposon family from Culex quinquefasciatus - consensus. XX KW DNA transposon; Transposable Element; nonautonomous; DNA8-7_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-2900 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the southern house mosquito."; RL Repbase Reports 11(1), 84-84 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >87% CC identity. 8bp TSDs. 16-bp TIRs. It encodes a protein similar to CC DNA ligase 1. XX SQ Sequence 2900 BP; 800 A; 629 C; 795 G; 675 T; 1 other; cgaggggcag gacacatgct ctaatcaact aaaataatcg gctaattgat tagaaattat 60 cagttaatca attagtgtga cgctaatcaa ctaacggttc ccaccttcta tcattatctt 120 ggtctgatga agaaatgtca aaaacacaaa acaaaaacaa tttttttccg cagaagccac 180 aaaagttccg tcgggacccg gttgctgctg gtgccactgc aaacgacgag ggtcatttcc 240 gttcgccgga tgtgaaagaa gtcccagctc caacacggtg tcatccgaca gcagcagaac 300 aagcagctgt tcattgtgga aggtcgtttg tttttgtttt ttcgtcacgt ccgcggtctg 360 cagcgccggc aagaaaccca aactggaggc ttcctccacc gaagagccgg cggccgggac 420 gcaagtcgac gaagtggagg cgaattgtgt cgttcagtga ggaggaaacg ccgggaaggc 480 caggtaggta gattataacc gattcactgt tatggagctg gctaaaacga ctatttttca 540 cagcccctca tcggtcaaga aatccccgag cgacagcatt ccaggaggag gagaaaaatg 600 tatagaaagt ccagatccag acagagtcca tggatcgttc aagtcggctg aaaagaagac 660 ttttaccgtt gtaaagaagg agcaaccgga agtgaagcgc gaaccgtcgg aatcgccgaa 720 aaggaggtga acamtaagaa caacatgatg agtttctcca cgagcgcaaa gagaaggcgg 780 ttgcggcgat tctggccgcc tccaagaact acgggacgga gtacaatacg ggagaaagtt 840 ctggcgaaag ggcgaaacgg tgacgtgctt ggcgcggacg ttacagatca tcgagcggct 900 gcggatgatc gagattctgg ccaactactt ctggtcggtg gttctgattc tgagtccgga 960 cgatctgctg gcgagtgtgt acttaagctt gaaccggctt gcttggcgcc ggcgtacgcg 1020 ggcgtcgagt tggacatgac ggggagcact cgttgatgat gcgattaggc agagcacggg 1080 ccgaagtttg gcgcaaattt tgatggacgc acagacgacg agggatttgg gcctggtggc 1140 ggagcagagc aagagtagct atcggatgat gttccggccg gctccgcact cagtcaagga 1200 cgtgtttggg aagttgagga agattaacaa gatgacgggg acggatgatg gacaagattc 1260 agtcgatgtt tgtggcgggt cggcacggtt tggtttggtg gagcagtcgc tgctgtaagc 1320 gttggcgcaa gcttgtgcga tgacgccgcc gaaccatgac ggttttaatg aaccggtggc 1380 gaacgcattg agcaagtgtg agaaagcttc cgaaggcgaa ggtggaggac gttgcgttgg 1440 tgctgaagac ggagtccgaa tttcaaccag attgtacagt tgctgctgga ggacggggtg 1500 gttcacctgg tggacaagtg ttcgatgatg ccgggaacac cgttgaagtc catgttggcg 1560 catccgacca agggcgttca ggaagtgttg gagcgattcg acgggatgat ttaagtgcga 1620 gtggaagtac gacggagatc gggcgcttta tccacctgct ggaggacggc agcgtgaaca 1680 tttacagttg gaactagggg aacaacacga gcatgtacct ggacgtgatt gcccggttgg 1740 accgcactcg gaaggattcg ctcaagagta cattcctgga ctacgaaaaa ttgttggtgt 1800 tgagcagatc tcgtgtctcg gaaaggctat catttgtaac acgagtgcta aattggggat 1860 atccggatta tcattctaat ctggagatta cgtgtgaaat aaactttggt ttttacaaat 1920 gggggtgggg gaatttcttt cgaatgaaag tacagctcaa gaactggccc acgtcaatga 1980 tgatcttttg gtaattagca ctgccggaat cacaatggaa atagtgatca tattacccag 2040 gcaacaacat tctggacatc gtaaactgtt cttttcctct tctaactcgt caaacgttga 2100 attatccgca atttccatgg aggtccgcaa cttcgaagcg ccttaaaatt ccccccggaa 2160 gaagctcgat tcgaatggtc tgaattatta gaaccgtgct tgcttccttg tttccggtta 2220 tctgtctgtg aaaaaagcaa accttccggt tacggccagc ccaacctatc tgggccacgt 2280 ttttagatcg ttactgccac aatagcagca agcggtagca gaaccaaaaa tcgtgccacc 2340 ggaaaaaaag ggagtttggc tgagttggca gcatctaaag aaatccgtgt caaaaacaaa 2400 aacttcatac agacccccca acgataaaaa cgactcggtg gaagcatcat ccgccggtac 2460 ctgatgcgga gaaactccag gaccatctaa taaaagctcg atctccggga agttccgctg 2520 gggacaccaa agattgaacc tatatttgtg cttcaaaacc atacttctca ctcaaaataa 2580 aaaataaatc tccgaaaata atgaccagga ttggtgccgc agtccacctc tttcgcttgg 2640 accacaggaa taacgataaa tgtcgtcctg gaaggagcaa ttttcacgaa attccgaatc 2700 ggggttttag aatccttcca aaaacattgg aaacaaaaca cgtttgctgc tgtcaaaatg 2760 actcttctcg atatgacaag atagtttaaa tcggggtggg aatgacgtgt cagttcagcg 2820 ctaattgatt agctaattga cagaaaatta cgctagttga ttagcgttac aataattgat 2880 taggtgtgtc ctgcccctcg 2900 // ID CR1-51_AAe repbase; DNA; INV; 6056 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-51_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6056 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1138-1138 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 20 sequences with >97% CC identity. XX FH Key Location/Qualifiers FT CDS 236..1675 FT /product="CR1-51_AAe_1p" FT /translation="MASVVCSGCVHKITTESDRVYCFFGCDKILHTKCAEI FT NLTGAKALKENPALRYICFDCRKNQLSINDVQSKCESVLKSVNELHSIQKS FT TIERIVNLERSIADSVSSSCEKVVSKCIEKLLTDLRDSACYVQHAVPNIAA FT GPSYAAVVQTGVPLSQRTENYKRKVADTYSQNSAKKLKPNSADDDVANDEG FT QWLRSGKRRNRIIQSTHLKATSVESSIRTPIGRPKPSQVPKMNRTVVIKPK FT VSQDVDTTKSEIKSNLDPVVHAVKDVFFNDNGNAIIRCDSQQSALKLIDSA FT QSSLGTKYEIGIQTALRPRVKIVGFSEVMQDEFISKLIKQNNLPDNAELKV FT VRFTKSKKLKDNPMSVVLEVDACTFKILLNAKYVNIEWERCPVFESIDVLR FT CFRCSEFGHIAANCNKPFCCPKCTECHEATECMSEYEKCINCTIANKAQNV FT PADQQLEVNHCSWSNECPVYLKRLEKSRQRIDYST" FT CDS 1691..5647 FT /product="CR1-51_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MLLITPLCCESGAYNAAYSVNTSPQGSGHQHNELAGP FT GKAFCSICPTEAFIGTTPALNRVHILQSTATSFDDMVQDDKAGPTDTLSPD FT SGHQHNELPGPGKASYSICPAEVYLDTAPALNRIHALQNSDLQQENVAVTL FT LPDSGHQHNELAGPGKVFYSICPTEAFTGTTPALNRVHILQSAATSLDEII FT QDDDAGPTDTLSPDSGHQHNEXPGPGKASYSICPSEAFTGTTPSLNQVHVL FT QNVPTYLNAIVSDASVVHTELYFPNASEKTESPIDATTSDTGHQHQELAGP FT GKALCSICPSEAKTTAPALNRDQTFQIMQSELSSTTDVDSGHQLSNSHLSA FT ALSSIISVSPPPSFKKLRIYYQNVRGLRTKIDDFFLAVADLHYDIIVLTET FT WLDDKIYSAQLFGDFYTVFRNDRSTLNSRKCRGGGVLIAVSSGLNCNLDPT FT PVDSSLEQLWVVLSLPEQKISVGVIYLPPDRKNDMPSISKHVESIQLIQSN FT LGIQDHAFLFGDYNQSALVWNYSTRNRLQINPLESHISTQCATLLDGFSLS FT GLNQINGIVNQNGRILDLALVTDSALNQCELLEAVEPLVPLDADHPAFTVE FT VKIPSRLIFEDMSNECGLNFRKADFIGLNEALGEIDWNIIESTPNLDDAVN FT LFNHLITRTMSDYVPVRRPAPKPPWTNARLKKLKRIRSKFLRKFCRTRCPH FT VKQQFDHASNNYRLYNRFLYQEYTLSIQENLRRNPKQFWSFVNSKRNENGL FT PVQMFLQDNSANTSAEKCELFATHFKHAFNSCVASNDEAEATCRDVPYNVC FT ELSNVQIDHQQVKAAISKLKYSTAAGSDGIPSCLLKKCANNLIRPLAFLFN FT ESLRQRLFPSNWKISVMFPVFKKGDKQNVENYRGITSLCACSKVFEIIIYE FT LLFASCKNYISSDQHGFYPRRSASTNLLHFSSFCIRNIDAGSQIDAIYTDL FT KSAFDRVDHKILLAKLNKLGISSDLIDWFNSYLTNRKLLVKIGDESSEFFT FT NPTGVPQGSNLGPLLFSIYVNELSRLLPPGCRLFYADDVKIFIVVNSIQDC FT YTLQRILNSFVAWCSNNKLTISLAKCSVITFHRKLQPIVHHYTILNQHLER FT VDHIRDLGVILDQAFSFRLHYEHIISKANKQLGFIFKLTSEFHDPLCLRAL FT YCALVRSILESNAVVWCPYQANWISRIEAIQRKFVRQALRSLPWRDPLNLP FT SYEDRCGLLGLRTLEERRCISQAVFVGKTLQGEVDSTEILGRLNIYAPERI FT LRQRNFLQLEPRRAVYGFHDPIRFISARFNEVFHVFDFNISTTAFKRRLQG FT SLPTLNR" XX SQ Sequence 6056 BP; 1756 A; 1327 C; 1226 G; 1745 T; 2 other; taatgtgaat tcgtgaattt tccgcgtgac atgacatatt actggatagc gaaaatcatt 60 ctaaatacgt gcgtcgtgaa gttatcataa ttttctgttt tctggatgtg tgatacttga 120 ttttataatt ctgttctgat cgatccaaaa caagcgattt gcatatagca gataaagctt 180 tgtttgtcgc ccgtggggcg aataagatgc caacaataaa tcaccgcttg ctaccatggc 240 atcggtggtt tgcagtggtt gtgtgcataa aatcacaaca gagagcgatc gagtgtattg 300 cttttttggc tgcgataaaa ttttgcacac taaatgtgct gagattaatt taactggagc 360 gaaagcgttg aaagagaatc cagcacttcg ttacatctgc ttcgattgcc gcaagaatca 420 attatctatt aatgatgttc aaagtaaatg tgagtctgtt ctgaaatccg ttaatgagct 480 gcacagcata caaaaaagta ccattgaaag gatcgttaat ctcgagcgct ctattgctga 540 ttcggtatca agttcttgcg aaaaagttgt ttcgaaatgc attgagaaat tattaaccga 600 cctccgtgat tctgcttgct atgtccaaca tgctgtacct aacatcgctg ccggtccttc 660 gtatgctgct gtggttcaaa ctggagtgcc cttatcccaa cgaactgaaa attacaaacg 720 caaggtagct gatacttatt ctcaaaactc tgcaaagaag ctgaaaccta actctgctga 780 tgatgacgtt gcaaatgatg agggacaatg gttgcgctcc gggaaaaggc ggaatcggat 840 tattcagagt acacatttaa aagccacttc tgtcgaatct tcaatcagaa ctcctattgg 900 ccgacccaag cctagccagg tgcccaaaat gaatcgtact gtagtaataa aaccgaaagt 960 atcacaggac gtagacacga cgaaatctga gataaaaagc aatttggacc ccgtcgttca 1020 tgccgttaaa gatgtgttct tcaacgataa tggaaatgcc atcattcgat gtgattcgca 1080 acaatctgcc cttaaactaa ttgattctgc tcaaagttca ttgggtacca aatatgagat 1140 aggtattcaa actgcactgc gtcctagagt taaaatcgtt ggattttctg aagttatgca 1200 agacgaattt atttcgaagc tcataaaaca aaacaacctg cctgataacg cagagttgaa 1260 agttgtgcga tttaccaaat ccaaaaaact gaaagacaac ccaatgtctg tcgttttgga 1320 agtggatgcc tgcaccttta aaattttgct caacgctaag tacgtgaata ttgaatggga 1380 acgctgtccg gtttttgaat caatcgatgt tcttcgttgt ttccgctgct ctgaatttgg 1440 acatattgcg gccaattgta ataaaccctt ctgctgtcca aaatgtactg aatgtcatga 1500 ggcaactgaa tgcatgtctg aatacgagaa gtgtattaac tgcactattg cgaataaagc 1560 acaaaacgta cctgctgatc aacagcttga agtcaaccat tgctcctgga gtaatgaatg 1620 tccggtgtat ctgaagcgtt tggaaaagtc aagacaaagg attgactact ccacatagca 1680 gtcagaactg atgctgctga tcactcccct ttgctgtgaa tctggagctt acaatgcagc 1740 ttattcagtc aacacgtcac ctcaaggctc aggacaccag cacaacgaac tggctggtcc 1800 aggtaaagct ttttgcagta tatgtcctac cgaagcgttt ataggtacta ctcccgccct 1860 caatcgagta cacatattgc agagtactgc tacttctttc gacgatatgg ttcaagacga 1920 taaagctggt ccaaccgaca cgttgtctcc ggattcagga caccagcata acgaactgcc 1980 tggtccaggt aaagcttcct atagtatatg tcctgccgaa gtctatttag atactgcgcc 2040 cgccctcaat cgaatacacg cattacagaa ttcagatctc cagcaagaaa acgtagctgt 2100 cacgttactg ccggattcag gacaccagca caacgaactg gctggtccag gtaaagtttt 2160 ttacagtata tgtcctaccg aagcgtttac aggtactacg cccgccctca atcgagtaca 2220 catattgcag agtgctgcta cttctcttga cgaaataatt caagacgatg atgctggtcc 2280 aaccgacacg ctgtctccgg attcaggaca ccagcataac gaacwgcctg gtccaggtaa 2340 agcttcttac agtatatgtc cttccgaagc gtttacaggt actacgccct ccctaaatca 2400 agtacacgtg ttgcagaatg ttcctactta tctgaacgca attgtttctg atgcttccgt 2460 tgtgcacacc gaattgtatt tccctaatgc ttcagagaaa acggagtctc caatagatgc 2520 taccacttcg gatacaggac atcagcacca agagctagct ggtccaggta aagctttatg 2580 tagtatatgt ccttccgaag caaaaactac tgcacccgcc ctcaatcgag atcaaacgtt 2640 tcagataatg caaagcgaac tgtcttctac gactgatgtg gattctggac atcagttaag 2700 taattcccat ctcagtgcag ctctttcgtc gatcatcagc gtttctccac cacccagctt 2760 caaaaaactg cgtatttact atcagaatgt tcggggtttg cgcacaaaaa tcgatgattt 2820 tttcctggct gtggcagatt tgcattatga tatcatcgtg cttacagaga cgtggctcga 2880 cgataagatc tattccgcgc agctttttgg agatttttat accgtattta gaaacgatcg 2940 tagcacgctg aatagcagaa aatgtagagg tggcggggta cttattgctg tttcctctgg 3000 tttgaactgt aatcttgacc ctactcctgt cgatagctct ctcgaacaat tatgggtagt 3060 gctgagtctt cctgagcaga aaataagcgt tggtgtcatt tatctgcctc ctgatcgcaa 3120 gaacgacatg ccaagtatat ccaaacatgt tgaatcaatc cagctaattc aatcgaatct 3180 cggaatacag gatcacgctt tcttattcgg tgactataat caatcagcgc ttgtttggaa 3240 ctattcaaca agaaatcgtt tgcaaatcaa cccattagaa tcacacattt cgactcaatg 3300 cgccacactt cttgatggat tcagtctaag cggtttaaat caaatcaatg gaattgttaa 3360 tcaaaacggg cggattcttg atttagccct ggtcactgat tctgcattaa accaatgtga 3420 attattagag gccgtcgaac ctttagttcc tcttgatgct gaccatcctg cgtttaccgt 3480 ggaagtaaaa attccgtctc gtttgatttt tgaagatatg tcaaacgaat gtggtttgaa 3540 ttttagaaag gccgatttta tcggtctgaa cgaagccctc ggagaaattg actggaacat 3600 tatagaatct accccgaact tagacgacgc cgtgaacctc tttaatcacc tgattaccag 3660 gacaatgtcc gattacgttc ctgttcgtag acctgctccc aagcctccct ggactaatgc 3720 tcgcttgaag aagcttaaac gaattaggtc aaaattttta cggaagttct gcagaactag 3780 atgcccacat gttaaacaac aatttgatca tgcgagcaac aactaccggc tatataacag 3840 atttttatac caagaatata ccttgagcat tcaggaaaac ctgcgtcgaa atcctaaaca 3900 gttctggtcg tttgtgaact ccaagcgtaa tgaaaacggc ttacctgtac aaatgtttct 3960 acaagacaat tctgctaaca catccgccga aaaatgtgag ttgtttgcca cgcattttaa 4020 gcatgctttc aacagctgtg ttgcatctaa tgacgaagca gaagctacgt gccgcgatgt 4080 accatataat gtatgtgaat tgagcaacgt tcaaatcgat caccagcagg ttaaggccgc 4140 catctctaaa ttgaagtatt caaccgctgc cggctcggat ggaattccat cctgtctctt 4200 gaaaaaatgc gcaaacaatt tgatccgtcc gctagctttt ttgttcaacg aatctctgcg 4260 acagcgtctg tttccatcta actggaaaat ttctgtaatg tttccagtgt tcaaaaaggg 4320 cgacaagcaa aacgttgaga actaccgagg aataacctca ctatgcgcgt gctcaaaagt 4380 ttttgaaatc atcatctacg agttgctttt tgcgagctgt aagaattata tctcttcgga 4440 tcaacacggt ttttacccta gaaggtcagc ctcaacgaac cttttgcact tttcctcgtt 4500 ttgcattcga aatattgatg ctggatcaca gattgatgcg atatatactg atctaaagtc 4560 ggcgttcgac cgagttgatc acaagatatt acttgccaaa ctgaataagt taggaatctc 4620 atcagacctt attgattggt tcaattccta tctcacgaat cgcaagctgc tagttaaaat 4680 aggcgatgag tcgtcggaat tcttcactaa tccaactggc gttcctcaag gaagtaattt 4740 gggacctctg ctgttttcaa tctacgtgaa tgagctttca agattgctac caccgggctg 4800 tcgattgttt tatgcggacg acgttaagat tttcattgta gttaacagca ttcaagattg 4860 ttacacactc caacggattt taaactcatt cgtagcatgg tgctccaata ataagctgac 4920 aatcagccta gctaaatgca gcgtgataac atttcacaga aaattgcaac caattgttca 4980 ccactataca atactgaatc aacatctgga gcgcgtcgac catatccgag atctgggagt 5040 tatactcgat caggcatttt ctttccgtct acattacgaa cacatcatat ctaaggccaa 5100 caaacaactc ggcttcatct ttaaactaac cagcgagttc catgatcctc tatgtttgcg 5160 ggcattatat tgtgcactag ttcgctcaat tttggaatct aatgctgttg tatggtgccc 5220 atatcaggcg aactggatct ccagaatwga ggccatacaa agaaagtttg tcaggcaagc 5280 gcttcgaagc cttccatggc gtgacccgct aaacttgccg tcgtacgaag atcgttgcgg 5340 actactcgga ctgcggacat tagaagaaag gagatgtatt agccaagcag tgtttgttgg 5400 aaaaacattg cagggagaag tcgactctac cgaaatttta ggacgattga acatctatgc 5460 tcctgaaaga attctacgcc agcgcaactt tctacagctt gagccacgcc gtgcagtata 5520 tggctttcat gacccaatcc gtttcatatc tgccagattt aacgaagtat ttcacgtatt 5580 cgactttaac atctctacaa ccgcgtttaa acgccgatta caaggctcat taccgacttt 5640 gaacaggtga gttcctctcc aacaactcta aaggttgtaa atggtgctac cagatgtttt 5700 ttttttctct cattctcgat tacaattgtc gccgagtctt gaccagagat catcgtaatg 5760 tttcctgtag accacgatgt ttggatggat ggatgatgca cgaatagttt ttaggtttta 5820 tgatgattga caatgtttgt agaaaagtaa ttgtgatcgt attgtattta atgactttgt 5880 ttataattgt tactgacagt gattgtaatg tttatgtaga aaagatatga ggtttttacg 5940 cctttttgag caagtcgcga gttgttggtc tactcaaatc ggcttttcct catcaaacgc 6000 ttcattaagc aaaagcagat gaagctagaa accaaataaa caaataaaca aataaa 6056 // ID Gypsy-93_CQ-LTR repbase; DNA; INV; 211 BP. XX AC AAWU01007335; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-93_CQ_; KW Gypsy-93_CQ-I; Gypsy-93_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-211 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 566-566 (2011). XX DR GenBank; AAWU01007335; Positions 831 1041. XX SQ Sequence 211 BP; 52 A; 64 C; 42 G; 53 T; 0 other; tgtggtggtg tcctgcccct gcgaacattt acacataaca cacacactta cacctacacg 60 ttgccggctc agcacacccc tgttgcagac acacgcacac agatcatttg ctcattcttc 120 tctgtgccgc gcaagagaca agtcgaataa aaagtagagt tttagtcggt gaactacgcg 180 ttttattccc tgcggcgatt ccgttcccac a 211 // ID Gypsy-35_DPu-I repbase; DNA; INV; 4614 BP. XX AC . XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from Daphnia: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-35_DP_; KW Gypsy-35_DPu-LTR; Gypsy-35_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-4614 RA Jurka J.; RT "LTR retrotransposons from Daphnia."; RL Direct Submission to RU (16-DEC-2010). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR [1] (Consensus) XX CC Positions [3562-4023] - Integrase core CC 'AAAC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 208..4542 FT /product="Gypsy-35_DPu-I_1p" FT /translation="MPTDEAKVTVPASKALERSPKVPDLSKMAPAAKAFKP FT YGSPPRFDLDEFKDSFELWHAQWKIFLALSTIDTVLEEDERPEYKTNVLLS FT CLSKETLQAVLSMGLDDDELEDHEVIIQQLRDRCNAGRNRHVWRQQFAMKK FT QRANESADNWLCELRELASKCEFLKDCCSKCQPTRILGQIVFGVYDDDVRR FT KLLERGDKLKLDEAIDILRVAEAASTQAKNLTQGDAMAIQALSKSSYKKDK FT LNKSSKPQHKQQSSSFQPSQSKKDEKSVKFDTSSKGCWNCGGETRHPRSEC FT PANGKDCGKCGKTGHFRKVCNSSSNKKNTTPSAQTGCIGLEPESPAQVSGI FT GALELVRMNISPKEGSKSISLSVLPDTGAQIDAIPADLYHSEFSPIKLMPG FT ETNTITAIGSRIVNLGHFTATICWPHGNSNTKATSIIHVLQDLKQPVISQA FT TQKNLGMLPACYPNQRVEIFNINAHQEAFSNLLANIHAHPTEGIKQNDLKN FT LTNDFPLIFDGVCRPMAGPACHFELKKDAVPVAFRGSRPVAEPLMPLLKKE FT LDLLEQQDVVRKVSKPTAWVHPIVIAPKSDGGIRVCGDFTILNKSIIRPRF FT ETATPFQAVRTIPPGMHFFTVIDALKGYHQVLLDDESAEMTTFSTPFGRYQ FT YKRLPFGVSLAGDDYCRRVSEVFDDLENCRRVVEDVLVFSKTYEEHLVLVR FT KLFERAAQHNVAINVSKLVFAQPSAKFGGYIVNSTGFSPHPDLMKAIREFP FT VPRNATDVRSFHGLCQQVGHFSTKVAESLKPLSPLLKKGYIWEWTTTHDQA FT FQNARKALSEITDLAFYDMNHPTALHVDASRLFGLGFVLKQKDADGNWRMV FT QAGSRYLSDAESRYAMIELECLSAAWAMHKCRQFIEGLPHFELITDHRPLI FT PILNDYSLDKLDNPRILRLRLKMQRYQFTARWVRGKDNLDADALSRAPVDI FT ASPGDELAEGPPSFSARAAIINTINCSDISVTDPVLDRIKTAALEDKQMIE FT LREAIINGFPNDKCNLSLLLRPFWNIRTQLAIDESDGMIMAGARIVIPESC FT RRPILQDLIQMHQGATKLKQRARLSLYWPGMDNDIDMAAQSCKHCTENQPS FT LPREPLQQREAATRPFEQIHMDIASVNGRDFLILVDQYSGWPDVIPFKDKH FT TTARRIVDASREFFVHGPGAPVKVWSDNGPQFSAMEFKNFAKDWGFSTGTS FT SPHYPQSNGLAEAAVKSMKKLIAGSWTAGSFNVDKFAKSLLLFRNAPRSGA FT ASPAQMVLNRPVRDALPAHRRSFAPEWQQKTDVLEKRARRAKEVQIEHYNK FT TAHSLPPLSIGDHVVIQHPISKCWSTPAVVVEIGPHRDYLLKTPAGRLFRR FT NRRMLRKRVPVMPGTKSTSTPTPPAPIPVKPETTSATKSMPPAPEASPKQA FT KPRQRGKPKPPTFAVLRRSTRTTKPPDRYTN" XX SQ Sequence 4614 BP; 1293 A; 1197 C; 1037 G; 1087 T; 0 other; tggcgcagtt gttttcttaa cgaacctaac ggagtaagtt gaacagttta caatatattc 60 tttcattggc gctgtgcata cttttgcgga cggttttcaa acagtattgt ctactggacg 120 gaggcgtgtt ttgtttgggg gctcttgaaa aaggtacaga accacatttc acggccgtca 180 tagtcaatat tcctttcttt aacttctatg cctactgacg aagctaaagt aactgttccc 240 gctagtaaag cacttgaacg tagcccaaag gtgcccgacc tctcaaaaat ggcgcctgca 300 gccaaagcgt tcaagccgta cggctcacca cctcgatttg atttagacga atttaaggac 360 tcattcgagc tatggcacgc gcaatggaaa atatttttag ctttatcgac gattgacaca 420 gtgctggagg aggacgaacg tccggaatac aagacaaacg tccttttatc atgtttatct 480 aaagagaccc tccaagctgt tttatcgatg ggtctggacg atgacgaatt agaagatcac 540 gaagtcatca ttcaacaatt acgtgacagg tgtaacgcgg ggcgtaatcg ccacgtatgg 600 cgtcaacagt ttgctatgaa gaagcaacgt gcaaacgaat cagcagacaa ctggctctgc 660 gaactcaggg aactggctag caaatgcgag ttcctcaaag attgctgttc gaaatgccaa 720 ccaacccgta tacttggtca gattgtgttt ggtgtgtacg atgatgatgt gcgccgcaaa 780 ctcctcgaac gaggagataa gttaaaattg gacgaagcga ttgacattct ccgcgtcgcg 840 gaggccgctt caacccaagc aaagaacttg acacaaggcg acgcgatggc cattcaagcg 900 ctatcaaagt catcctacaa gaaagataag ttaaacaaat cgtctaaacc tcaacacaaa 960 cagcagtcgt catcatttca accaagtcaa tcaaaaaagg acgagaagtc cgttaaattc 1020 gacacttctt ctaaaggttg ctggaactgt gggggtgaaa cccgtcaccc acgctcagaa 1080 tgccctgcta acggaaaaga ctgcgggaaa tgcggaaaaa ccggacactt taggaaagtg 1140 tgcaacagca gcagcaacaa gaagaatacg acaccatccg ctcaaacagg atgcattgga 1200 ttggaacctg aatcgccagc ccaagtgtca ggcattggag cattggaact cgttcgaatg 1260 aacatttctc cgaaagaagg cagtaaatca atcagccttt ccgtcttacc agacaccgga 1320 gctcagatcg acgccatacc agccgatttg taccacagcg agttctcacc cattaaatta 1380 atgcctggag aaactaatac gatcacggct atcggaagcc gtattgttaa ccttggccat 1440 ttcacagcaa caatctgctg gcctcatggt aattcaaaca ctaaagcaac gtcaatcatt 1500 cacgttttac aggatcttaa gcagccagtt atttctcaag cgactcagaa gaatttgggt 1560 atgcttcccg cttgctatcc taatcaacgc gttgagattt ttaacatcaa cgcccatcaa 1620 gaagcgtttt ccaacctgtt ggctaatatt catgcacatc caactgaggg tatcaaacaa 1680 aacgacttga aaaatttaac taacgacttt cctcttattt tcgacggagt ctgccgacca 1740 atggctggtc cggcatgcca ttttgaactc aagaaagacg ctgtccctgt agcgttcaga 1800 ggttcacgcc ccgtcgcaga gccgctgatg ccgcttttaa agaaagagct tgacctccta 1860 gaacaacagg acgttgttcg caaggtttcg aagccaacgg cttgggtcca ccccatcgtg 1920 atcgctccaa agagcgacgg gggtatccga gtgtgcggtg actttactat tttaaacaaa 1980 agtatcatcc gcccacgttt tgaaactgct acgccatttc aagcggtgag gacgatccca 2040 ccaggcatgc atttcttcac cgtcatcgac gctttgaaag gttatcacca ggtgctcctg 2100 gatgatgaat cggcggagat gaccacattt tcgaccccat ttggccgtta tcaatacaag 2160 cgtctcccgt tcggcgtctc cctcgcaggc gatgactact gccgccgcgt gtcggaggtc 2220 tttgacgacc tcgaaaactg ccgtagagta gtagaagacg tccttgtctt ctcgaagact 2280 tacgaagaac atttggtgtt ggtgcgaaaa ctctttgaaa gagccgcaca acacaatgtg 2340 gcaatcaacg tcagtaaact tgtcttcgca cagccgtcgg caaaatttgg cggatatatc 2400 gtgaactcaa ctggcttcag tccacacccg gatttgatga aggcaatcag agagtttccg 2460 gtcccgcgca acgccactga cgtacggtct ttccatggtc tgtgtcaaca agtcggtcat 2520 ttttcaacga aggtggccga atcccttaaa cctttgtcac ccctcctgaa aaagggttac 2580 atttgggaat ggacgacgac acacgaccag gcttttcaaa acgcaaggaa agctttatcc 2640 gaaatcactg atctggcttt ttacgacatg aatcacccga ctgctttaca tgtcgacgca 2700 tctcgtcttt tcggcctcgg ctttgtcctc aaacaaaagg acgcagacgg aaattggcga 2760 atggtccagg ctggatcacg atacctgtcg gatgcggaat cacgttacgc tatgatcgag 2820 ttggaatgtt tgagtgccgc ttgggcaatg cacaagtgcc gacaatttat cgaaggactc 2880 ccgcattttg aacttatcac agatcataga ccactaattc ctattttaaa cgattacagc 2940 ttggacaagc tggataaccc gcgtattctt cgcctcagac tgaaaatgca aaggtatcaa 3000 ttcacggccc gttgggtgcg aggaaaagac aacctcgacg ctgacgcgct atctcgcgca 3060 cccgtcgaca tcgcatctcc cggcgacgag ctagccgagg ggccaccctc tttttcagca 3120 agagcggcca tcatcaatac gatcaattgc tctgacattt ctgttacaga cccggttttg 3180 gacagaatca agacggcagc gcttgaagac aaacaaatga tcgaattgag agaagctatc 3240 atcaatggtt tccctaatga taaatgtaat ctctctcttt tacttcgtcc gttttggaac 3300 atccggactc agcttgcaat cgacgagtcg gatgggatga tcatggccgg ggccaggatc 3360 gtcattccgg agtcatgtcg ccgcccgatt cttcaggacc ttattcagat gcatcaaggc 3420 gcgaccaagc taaaacaacg tgcacgatta tcgttatact ggcccggtat ggacaacgat 3480 attgacatgg cagctcagtc atgcaaacat tgcactgaaa atcagccttc gcttccccgt 3540 gagccgcttc aacagcgtga agcagcaaca cgcccattcg agcagataca catggatatt 3600 gcctcagtca acggacgcga cttccttatt cttgtcgacc aatacagcgg atggcctgac 3660 gtcattccgt ttaaggacaa acacactaca gctcgacgca tcgtcgacgc ctctcgcgaa 3720 ttttttgttc atggacctgg cgcccctgta aaagtatgga gcgacaacgg tccccaattt 3780 agcgcaatgg agtttaaaaa tttcgcaaag gattggggtt tttcaacagg cacctcatcg 3840 ccgcactacc ctcaatcaaa tggtttagcc gaagccgctg tcaagagcat gaaaaaactc 3900 atcgctggat cttggacagc aggatctttc aacgtcgaca aattcgctaa atcgttattg 3960 ttattcagga acgcacctcg ttcaggcgca gcctccccgg cacagatggt gctcaaccgc 4020 cccgttcgag acgcgctacc cgctcatcgt cgctcatttg ctccagaatg gcaacaaaag 4080 actgatgtct tggaaaaaag ggcgagacgt gccaaagaag ttcagattga gcattacaat 4140 aaaacagctc actcacttcc acctctctct ataggcgatc acgtcgtcat tcagcacccc 4200 atttcgaagt gctggtcaac cccagccgtt gtggtggaaa ttggtccaca tcgggactat 4260 ttgttgaaga caccagcagg tcgactcttt cgccgtaacc gccggatgct ccggaaaagg 4320 gtaccggtta tgccaggaac aaaatcaaca tcgacgccaa cccctcccgc gccgatacca 4380 gttaagccag aaacaacgtc agccacgaag tcgatgcctc ccgcaccaga agcatcacct 4440 aaacaagcta aacctcgcca acgcggaaag ccaaagccgc ctacgtttgc tgtactacga 4500 cgatctaccc ggacaacgaa gccaccagac cgttacacaa attgatatgt ttgcagcacc 4560 cttttagtac atgtttaact gtatttgaaa aaaaaaaaaa aaaaaaaagg gaca 4614 // ID L1_Ele16 repbase; DNA; INV; 4463 BP. XX AC . XX DT 15-OCT-2010 (Rel. 15.1, Created) DT 15-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE An L1 clade non-LTR retrotransposon family from Aedes aegypti. XX KW L1; Non-LTR Retrotransposon; Transposable Element; L1_Ele16. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4463 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4463 RA Kojima K.K. and Jurka J.; RT "L1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (15-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update. This consensus is generated from 4 CC sequences with >99% identity, and ~100% identical to the original CC sequence in [1]. XX FH Key Location/Qualifiers FT CDS 164..1201 FT /product="L1_Ele16_1p" FT /translation="MKRRENTFCIDYSAVPKKPSYEELHAFIGTQLGLTRN FT DVQRIQCSRSTGCAFVKVNNLELAQKIVVEHDSKHAIEVDGQRYTLRLQME FT DGAVAVRLYDLTEGTSREQISEYLSAYGDIISAEPEIWDDKYAFGGIETGV FT WIVKMMVTRNIPSYITIDGDSTYVAYYGQRHTCRHCGEYVHNGTSCVQNKK FT LLMQKLAVDPAARPTYANVTKTKAXSSINTKKPTPVGSPMLLRSAVSKSKD FT TTNLNHADKQPSDGLNVSASAATAAALDFTTTSNPHSMAAPTKTFKVPRLP FT NISGDRGLKNSKGNDDDTDDSTTSSSSRRSQGRPPGKKQKHYAGFEALGDN FT TDL" FT CDS 1204..4386 FT /product="L1_Ele16_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MDFLSYNIATVNINTITNATKLESLRTFIRTLELDIV FT FLQEVENENLQIPGFNVICNVDNMRRGTAIALRDYIHYTHVEKSLDGRLIT FT ARVNNVTLCCVYAHSGSQMRNDRERFFNDTLSYYLRHNTPHVILAGDFNCI FT LRQCDGTSTNSSPSLQATVTQLQLCDVWVKLKSRLPGHTYITHHSASRLDR FT IYVSSDLVSHLRTVDTHCCAFSDHKAVSARICLPSLGRAPGRGFWTLRAHL FT LTAENIDEFQVRWQYWTRQRRNYRSWXQWWLFYAKPKIKSFYRWKSKSVYD FT AFHLEHQRLYAQLQQAYEGLQGNRCMLMDINRIKGKLLTLQRNFTQTFVRS FT NENFLPGETFSSFHLGERRRKKTTIARLSDDQGNVIEGQEEIENHVVAYFQ FT TLYTAEQTDVLAEEEFVSDQAIPNDDAANDAIMNEINTAEIFAAIKTSAAN FT KSPGPDGIPREFYLRSFEVIHREMNLVLNEALAGDFPTEFVDGVIVLVKKK FT GAGTDMHSYRPISLLNVDYKILSRILKLRLETVLRVHRVLNDSQKCGNSDK FT NIFQATLAIKDRIAHLIRRKQPGKLISFDLEQAFDRVSRRFLSRTMTSLGL FT NRQMITLLNAIAERSSSRILINGRLSQTFPIQRSVRQGDPLSMCLFVIYLH FT PLLRQLELVCDTDLIVAYADDITVIVTSTQKIERMRSLFLCFERAAGAKLN FT MMKTTSIDVGFIDQGTLVVPWLNTEVKIKILGVIFANSIRVMIKLNWDAVI FT NNFARLTWLHSLRCLNLNQKVLVLNTFITSKIWYLASILPPYSVHTAKITA FT TMGTFLWSRVPARVPMQQLARNREEGGLKLQLPAIKCKALLVNRHLQELNC FT MPFYKSFIDGNNTTPVTIPADLQCLRLLHHQLGYIPQNIQQNPCAGSILQF FT YLQATEPPKVEKNNPRHDWKRIWRNINDSKFSTKTRSELFMIVNEKIETRR FT LMNILGRTDGENCVHCGARVETVIHKYSECQRVTEAWALVQRRISTIAGGW FT RRFSFVDLLRPTLIGINRNNRIKILNLFSLYITFINLCNDVVDIDVLEFDI FT QCGI" XX SQ Sequence 4463 BP; 1387 A; 1023 C; 962 G; 1089 T; 2 other; agtttatgct catcttttga gccaagcaga cggtttgtac tatccgcgcg gtttgtaacc 60 gatttctgcg agtttttcgt taaagttctc cgcaactgtc gtgccgtggg cacgtggtgt 120 caatcgtatt agcggccacg attgctccgg gcgattaatc gcgatgaaac gccgtgaaaa 180 tacattttgt attgattact cggcggtacc caaaaaaccc tcatatgagg aactacacgc 240 tttcatcgga actcagcttg ggctgacgcg aaatgatgta caacgcatac aatgcagcag 300 atccaccggc tgcgcgtttg tcaaagtgaa caatctggag cttgcccaga aaatcgttgt 360 cgaacacgac agcaaacacg caatcgaagt tgatggacag cggtacactt tacgactaca 420 gatggaagat ggagctgtag cagtgaggct ctacgatctc accgaaggta cttctcgtga 480 gcagatctct gagtatcttt ctgcatatgg agatatcatc tcagccgagc ccgaaatatg 540 ggacgacaaa tacgcgttcg gtggtattga aaccggggtc tggattgtaa aaatgatggt 600 cacccggaat atcccgtcat acataacaat cgatggagac agcacatatg ttgcttatta 660 tgggcagcgt catacatgtc gccattgtgg agagtacgtc cataatggga catcttgcgt 720 acagaataag aagctgctaa tgcagaaatt agctgttgac ccagcagcaa gacccacgta 780 tgccaatgta acaaaaacaa aggctwcgag tagcatcaac accaaaaaac ctacacctgt 840 aggatcccct atgttgttac gtagtgcagt gtccaaatct aaagatacca ctaacctcaa 900 ccatgccgac aaacaacctt ccgatggatt aaatgtgtct gcatccgctg ccacggctgc 960 cgcattggac ttcaccacaa caagcaatcc gcattccatg gcggctccaa cgaaaacatt 1020 caaagtacca cgactaccga acatcagcgg agataggggt ttgaagaact ccaaaggaaa 1080 tgatgacgat accgatgact caaccacgtc ctccagcagc aggcgctctc aggggcggcc 1140 tccaggtaaa aaacaaaaac actatgctgg attcgaagca ctgggagata atactgattt 1200 gtaatggatt tcttgagcta caatattgcc acagttaata taaacactat aacaaacgcg 1260 actaaactgg aatcactacg tacttttatc cgaacacttg agcttgacat tgtgttctta 1320 caagaggtcg agaacgaaaa cctgcaaatt ccaggtttca acgttatttg caacgttgat 1380 aatatgcgtc gaggcacggc aatcgctctc cgagattata ttcattacac acatgtcgag 1440 aaaagcttag acggtcgttt aataacagca agagtaaaca acgttacgct ctgttgcgtc 1500 tacgctcact caggtagcca aatgcgtaat gatcgagaga ggttcttcaa cgatacactg 1560 tcctactatc tgcgccataa cacaccacac gtgatcctcg ccggtgattt caattgtata 1620 ctccggcagt gcgatggtac tagtaccaac tctagtcctt ccttacaagc aacagtaaca 1680 caactgcaac tgtgcgatgt ctgggtaaag cttaagtctc ggctcccagg gcatacatac 1740 ataacgcatc actcagcatc caggcttgat cgcatatatg tatcgtcgga tctcgtaagc 1800 catctacgaa cagtggacac acactgttgc gctttctccg accataaagc tgtatctgcc 1860 cgaatttgtc taccatcact aggtagagca ccaggcagag ggttctggac gctccgcgca 1920 cacctcctaa cagcagaaaa tatcgacgag ttccaagtcc gctggcaata ctggacacga 1980 caacggcgga actatcgctc ctggmtacag tggtggctat tctatgcaaa gccaaaaatc 2040 aaaagtttct accgttggaa atctaagtca gtatatgatg catttcacct agaacatcaa 2100 cgtctgtacg ctcagctgca acaagcatat gaagggctcc agggaaaccg ttgcatgctg 2160 atggatatta atagaatcaa gggcaaatta ctaacactcc agcgaaattt tacccagaca 2220 ttcgtccggt caaacgaaaa ctttctgcca ggggaaacat tctcgtcgtt tcacctagga 2280 gagaggagaa ggaaaaaaac aacgatagcc cggctgagcg atgatcaagg caacgtaatc 2340 gaaggccaag aagagattga gaaccacgta gtagcatact tccaaaccct gtacactgcc 2400 gaacaaacag atgtgttagc ggaagaggaa tttgtgagtg accaggcaat accgaacgac 2460 gacgcagcaa acgatgccat catgaatgaa atcaacacag ccgaaatttt cgcagcaatt 2520 aagacaagcg cagcaaacaa atctccagga cctgatggta ttccacgtga gttctatttg 2580 cgatccttcg aggtgataca ccgagagatg aatctggtac taaacgaagc tcttgctgga 2640 gacttcccaa cagagttcgt ggacggtgta atagtactag tgaagaaaaa aggagctggg 2700 accgatatgc attcatatcg accaatatca ctactaaatg tcgattataa aatcctctcg 2760 cgaatactca aattgagatt ggagaccgtc ttacgagtac atcgagttct caatgattct 2820 cagaaatgcg gtaactccga caaaaacatt tttcaagcga ctctcgctat caaagataga 2880 atcgctcacc tgattcgccg taaacagcct gggaaactca tctcatttga tttggagcaa 2940 gcttttgatc gtgtctctcg gagattccta tcacgcacga tgacttctct tggtctaaat 3000 aggcagatga taacactact aaacgctatt gcagaacgtt cttcgtctcg aatactgatc 3060 aacgggagac tatcacaaac ttttcctata caacgatcag ttagacaagg agatccgtta 3120 tcgatgtgtt tattcgtgat ctacttgcat ccactgttgc gacaactaga actcgtctgc 3180 gatacggatc taatagtggc ctatgctgat gacatcactg tgatagtgac atcgacacag 3240 aaaattgagc gaatgagaag cctattcctc tgcttcgaac gtgcagctgg ggcaaagctg 3300 aacatgatga aaacaacttc aattgatgtg ggcttcattg atcaagggac attggtcgtt 3360 ccatggctaa acaccgaagt aaaaatcaaa atacttggag tgatctttgc aaattcaata 3420 cgcgtcatga tcaaactgaa ctgggatgca gttataaaca attttgctcg gctaacgtgg 3480 ttgcactcac ttcgttgcct taatttaaac caaaaggttt tagtgttgaa tactttcatc 3540 acttcgaaga tttggtactt agcctcaata ttgccaccat acagcgtaca tactgctaag 3600 attacggcca caatggggac atttctatgg agcagagttc cagctcgtgt ccctatgcag 3660 cagttggctc ggaatcgaga agaaggagga ctcaaactac aactgccagc tatcaaatgt 3720 aaagccctct tagtgaatag acacttgcag gagttgaatt gcatgccgtt ctacaaatcg 3780 ttcattgatg gaaacaacac gacccctgta accattccag ctgatctcca gtgcttgagg 3840 ctactgcatc atcaactagg atacatcccc cagaatatcc aacaaaatcc ttgcgctggt 3900 agcattctcc agttttatct ccaagcaacc gaacctccaa aagtagaaaa gaacaatcct 3960 aggcacgatt ggaaaaggat atggcgaaac atcaatgact ccaagttttc aacaaagaca 4020 agaagtgagc tgttcatgat cgtcaacgag aaaatagaaa cgagaagact aatgaacata 4080 cttggacgga ctgacggaga aaactgcgtg cactgtgggg ctcgagtaga gacggttatt 4140 cacaaataca gcgagtgcca acgagtcacg gaggcatggg cactggtgca aagaagaatc 4200 tcaacgattg caggcggatg gaggaggttt tcatttgtgg acttgctgcg accgaccctg 4260 attggaatca acagaaacaa cagaattaaa atcttgaatt tgttttccct ttatatcacc 4320 tttataaatc tttgcaatga tgttgtcgac attgatgttc ttgaatttga tatacaatgt 4380 ggaatttagt ttattatgtg atgtaatatg acaataattt tactcaataa attaaccaac 4440 tttaaaaaaa aaaaaaaaaa aaa 4463 // ID BEL-192_AA-I repbase; DNA; INV; 6367 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-192_AA_; KW BEL-192_AA-LTR; BEL-192_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6367 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 875-875 (2011). XX DR [2] (Consensus) XX CC Positions [4999-5559] - Integrase core CC 'ACAT' target site duplication CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 24..3416 FT /product="BEL-192_AA-I_2p" FT /translation="MDVIYPCCEEDDQYDAMVQCGRCQKWHHFSCVGVDQS FT IAELVWYCRTCLGEDVSARESLGQPGVTRIQPSRAAKKSKDHTGKISDKTS FT QKRIDILVGTKGQKEVAERPTSQQEGNKGQETCQAGTSAAGQTSIAKSVKP FT SRSRVSIISENSRSESNKSGKSSKISGTSKSSSSKRDLALKRIEELKERSR FT LEALEEETQLQMLKIKQKQIERQRRELEELQQLSKLVEEESECNEEPDNQV FT GSDISYIDKVQGWMNSCNGANPGRPDTASEANASQFSLMFPEVSENEIIRR FT QQVQQRTSSMVVGPTTHQIASRQVFPKNLPAFAGYPQDWPLFISAFEQANA FT SCGFTHAENLVRLQQALQGKALEIVRNLLLLPENVPMIIDKLRRRFGNPEI FT LSTMLAQRIQKLEGPDSENLESLIEFGSAIGEFTQHLEVSKLNDHLKNPIL FT MQSLIQKLPPCYAMQWVEYKRNCQVVDLKTFGKFMEDLVDKALEVTFERIE FT VGSSKRREKPKAKVFTHLMKEDHEVNDSKANLYNKPSSGQVQISQRKDVSC FT SICEELGHLGRDCREFRNASLERRWMIVKHMGLCPLCIFNHGKWPCRSRNR FT CEVEGCNGLHHSLLHNPTNAEAVNREARCNSHCLPNKTVLFRIIPVSLYGE FT HRRWNTLALVDEGSSTSLIDNEVAEFLQLDGPREPFEMRWTNGVSRTESHS FT RNVEMKISGIDKIKESEICVARTVNKLDLPAQQLDASKLIEEFRHLEGVEI FT SSYALDTPKLLIGIDNMHLIAPVASQIGNKGQPIAVKCKLGWTIYGPRPNL FT LSNVHFLGHHRCGCEECSKADQDLNQMLRDSFQLEAVGVSPVRLESRDDCK FT AREILERTIKRVGDRFEIGLLWKDGVPRFPKSYSMAYNRLKNLEARLKRNP FT SIRDILQRQIDEYVSKGYAHKITERELLETPQEHCWYLPLNYVVNPKKPEK FT VRMVWDAAARTNGVSFNDSLLKGPDLVASLSGVINGFRERKVAFGGDIREM FT FHQILVRQEDKQAQRFLFRSDTNSEPDIYVMDVVIFGASCSPCLAQFVKNF FT NALEHQQKYPIASDAIIRKHYVDDYFDSVDTEAEAISRAKDVRKIHADAGF FT EIRNWVSNSSKVLEELGEAEVQS" FT CDS 4060..5904 FT /product="BEL-192_AA-I_1p" FT /translation="MGEIRLRRYKPFVAHRVGEILSKTSPEDWRWVPSKHN FT PADDVTKWGEATQLSSNNRWFCGPEFLYQTEDNWPVQKWNSLEIEEELRAN FT VLFHDIVLPNAVLPRVEHISKWKVLVRMVATIYRFVSNCRRRIDKQPIEAL FT PITKKGLAKIPAKQVPLRQEEYLLAENLIWRVTQGEAFADEVKTLIKNQDS FT SVNQFGAIEKSSPLYRLSPFLDQSGVVRMEGRTTAANYATFDVRFPVILPK FT EHIIARKLVEFYHQQCAHGSREMVVNEILQRFYIPGLRSLVESVTRDCIWC FT KVRKAKPVVPRMAPLPPSRMAVREHPFSYVGVDYFGPVEVAVGRRREKRWV FT ALFTCMTIRAVHLEVVHSLTTQSCKMAIRRFVKRRGSPIEIFSDNGTNFVG FT ASKDLAEQIRAINSECADTFTGARTKWTFNPPSAPHMGGVWERMVRSVKEA FT MQVLAYGERLTDEILQTTLVEAENLINSRPLTYVSTNVQEDKEAITPNHFL FT TSCPLVDCMPSRSSTELADRLRNSYSQAQYLADELWERWQREYLPTMNRRT FT KWYKESKPLAVGDLVYVADSEKRRTWERGVIEEVFAGDDGRIRSAIVRTKT FT GLKKRAVAKLAVLEIGA" XX SQ Sequence 6367 BP; 1947 A; 1208 C; 1671 G; 1541 T; 0 other; acaaagctca aaagaattta ccgatggatg taatctaccc gtgctgcgag gaagacgacc 60 agtatgacgc gatggtgcag tgcggtcggt gtcaaaagtg gcaccacttc tcatgtgtcg 120 gggtggacca aagcatagcg gagctagtgt ggtactgccg tacgtgcctc ggagaggacg 180 tttcagcacg tgagtcgtta ggacagccag gcgtcactcg tatacaacct tctagggcag 240 ctaagaagag taaagatcat accggtaaga taagcgataa gaccagtcag aaacgaatag 300 atatactagt cggaacgaaa gggcagaagg aagtagctga acggccgacg agccagcaag 360 aaggaaataa aggccaagaa acatgtcagg cagggacatc agcggcagga caaacaagta 420 ttgctaagtc agtcaagccg tcgcgtagtc gcgtttcgat tatcagtgaa aattctcgtt 480 ctgaaagtaa caaaagtggt aaaagttcaa agataagtgg caccagtaag tcgagcagtt 540 ccaaacgtga tttggcgttg aaacggatcg aagaattgaa ggaaaggagt cgtttggaag 600 cgctggaaga agaaactcag ttgcagatgt tgaaaataaa gcagaagcag attgaacggc 660 agcggcgcga attggaagag ctacagcaac tctctaaatt ggtcgaggaa gagtcggagt 720 gcaacgaaga acccgacaat caagttggaa gtgacatcag ttacatcgat aaggtacagg 780 gatggatgaa ctcttgtaat ggtgcgaatc ctggtaggcc ggacaccgcg tcagaggcta 840 atgctagtca gttctcactt atgttccccg aggttagtga gaacgaaatc attcgaagac 900 aacaggtaca gcaaagaaca tccagcatgg tagtaggtcc aacgacacat cagattgctt 960 cgcgtcaggt gttcccaaag aatcttcctg ctttcgcagg ttatccacag gattggccac 1020 tttttatcag cgcattcgaa caagcaaatg cttcttgtgg attcacccat gcagaaaacc 1080 tcgtcagact tcaacaagcg ttacaaggaa aggcgttgga gatagttcgt aatctgttgc 1140 tgttaccaga gaacgtgcca atgattatcg acaagctacg tcgtcgcttc ggaaatcctg 1200 aaattctgtc aactatgcta gctcagcgga tccagaaact cgaaggacca gattcggaaa 1260 atttggagtc cttgattgag tttgggagtg caatcggaga attcacgcag catctggagg 1320 tttcgaaact gaacgaccac ttgaagaacc ccatattgat gcaaagtctc atacagaaac 1380 taccaccatg ctatgccatg caatgggtgg agtacaaacg caactgccaa gtggttgact 1440 tgaaaacttt tggaaaattt atggaggatc tggttgacaa ggcgttggaa gtaacgtttg 1500 aaaggatcga agtagggtca tcaaaaagac gtgaaaaacc gaaggccaag gttttcaccc 1560 atctgatgaa agaagatcat gaagtaaacg atagtaaagc taatctgtac aataagccat 1620 cgtcaggaca ggttcagatt tctcaaagaa aggacgtcag ttgctctatc tgtgaggaac 1680 tgggtcattt gggtcgggat tgccgagaat tccgcaacgc cagtttggaa cgaaggtgga 1740 tgattgtaaa gcatatgggt ttgtgcccac tgtgcatatt caaccatgga aaatggcctt 1800 gtaggtcgag gaatcgttgc gaggtcgaag ggtgtaatgg cttacatcat tcgttactac 1860 acaacccgac taatgcagag gctgtcaata gagaagcacg ttgcaacagt cactgtttac 1920 caaataagac ggtgttgttt cgcataattc ccgtttcgtt gtatggagaa caccgaagat 1980 ggaatacact agcgttggta gatgaaggat cgtctacatc gcttatcgat aatgaggtag 2040 ctgagtttct acagttggac ggaccccgcg aaccatttga aatgcgctgg acgaacggag 2100 ttagtcggac cgaatctcat tcaaggaacg ttgagatgaa gatttcggga atcgacaaaa 2160 ttaaggaatc agaaatatgt gtagcaagga cagttaataa gctggatctt ccagctcaac 2220 aattggatgc aagcaagctg atcgaagaat tccgacactt agaaggagtt gaaatatcga 2280 gttacgcgtt ggatactccg aagctgttga ttggaatcga taatatgcac ctcatcgctc 2340 cggttgcatc gcaaatagga aacaaaggac agccgatcgc tgttaaatgc aaattagggt 2400 ggacaattta cggtccacga ccgaatctac tgtcgaatgt acatttcctt ggtcatcatc 2460 gatgtggttg cgaggagtgc agtaaagcgg atcaagattt gaaccagatg ttgcgggaca 2520 gttttcaact cgaagccgta ggagtttctc cagtaagatt agaatcaagg gatgattgta 2580 aagcgcgaga gattctcgaa agaacaataa aacgagtagg tgatcgattc gagatcggcc 2640 ttctttggaa agatggagta cccagatttc caaaaagtta ctccatggcg tataatcgat 2700 tgaagaatct agaagctcga ttgaaacgga atccgagtat ccgggacata ctgcaaaggc 2760 aaattgacga gtatgtgtcc aagggttacg ctcacaaaat cacggaacgt gagctgttag 2820 aaacaccgca ggaacattgt tggtatttac ctttgaacta tgtcgtcaac ccgaagaaac 2880 ccgaaaaagt aagaatggtg tgggatgctg cagcaaggac taatggagtc tcgtttaacg 2940 atagtcttct aaagggacca gaccttgtcg cttcgttgtc cggtgttatc aatgggtttc 3000 gtgagcgaaa ggtggcattt ggcggtgata tccgagagat gtttcaccag atcctggtgc 3060 gacaggaaga taagcaggct caacgtttct tgttccgatc tgatacgaac agcgaaccgg 3120 atatttatgt gatggacgtg gttatttttg gggcaagctg ttccccttgt ttggcgcaat 3180 ttgtgaaaaa cttcaatgcc ttggaacatc aacagaagta tccaatagca tccgatgcca 3240 ttatccggaa acattacgtt gacgactatt tcgatagtgt cgacacggaa gccgaagcta 3300 tttcgcgtgc taaagatgtc cggaaaatac atgcagacgc tgggttcgaa attcgaaact 3360 gggtgagcaa ctcatcgaag gtactagaag agctcggcga agcggaggta caatcatgac 3420 gaagtctttt gataaagctt cagatgcaga aagagtcctt ggaattatgt ggcaaccgga 3480 gaaagattcg ttcgtattct ccaccgaatt cagggaagat ttgcgaccat acattcagga 3540 aggagcttgg cctacgaaac gaatagcttt acgctgtatt atgagcatgt ttgacccgaa 3600 gcagtttctt gcgccggtgt tgatccacgg acgtatttta atgcaagact tgtggcgaag 3660 cggtatcggc tgggacgaga aaattggaaa ggcacatcac gaccgttggc tacgatggac 3720 agcattgttt ccattgatag ataatataag tattccacgc tgctatcttg ggaagatgag 3780 ccctaatact tacgatacag tacagctaca tgtttttgcg gatgctggag aggacgctta 3840 cggttgcgta gcatacttcc ggtttacaga tggagaaaat tttcattgtg cgttggtaga 3900 ggcaaaggca aaggtggctc ccttacaaca cctgtctatc ccgagaaagg agttagaagc 3960 agctgtacta ggagcaagat tgcttgtagc gataagtgaa aatcattcaa ttgaggtgaa 4020 gaagcgtttt ctttggactg attcgaatgc agttgtttca tgggtgaaat cagacttagg 4080 cgttataaac cattcgtggc tcatcgtgtc ggagagatac ttagtaaaac aagtccggaa 4140 gattggcgtt gggtaccctc gaaacacaat cctgcagatg atgttaccaa atggggagaa 4200 gcgacacaac taagttccaa caatcgatgg ttttgtggtc ctgaatttct gtatcaaaca 4260 gaagacaact ggccagtaca gaagtggaat tcgttagaaa ttgaggaaga attacgagct 4320 aatgtgctat ttcatgacat agttttgcca aacgcggtgt taccaagggt ggaacacatt 4380 tccaagtgga aagtacttgt tcggatggtg gcaacaatat atcgtttcgt ttcgaactgt 4440 cgacgtagaa tcgataaaca accgattgaa gccttgccta taacaaaaaa ggggcttgcg 4500 aagattccag caaagcaagt tcctttacgt caggaggagt atctgttagc agaaaattta 4560 atttggcgtg taacgcaggg tgaagccttc gcagacgagg tgaagacgtt gatcaagaat 4620 caagattcat ccgttaacca atttggagca attgagaaga gcagtccatt gtaccggtta 4680 tcgccgttct tggaccaatc tggagtagtg cgaatggaag gtagaacaac agcagctaac 4740 tacgcaactt ttgatgtgcg gtttccagta attttaccga aagaacatat cattgcccga 4800 aaattggtgg agttttacca ccaacaatgt gcccacggta gccgagaaat ggtggtaaat 4860 gaaatccttc agcgattcta cattccaggg ctgagatctt tagtcgaaag tgttacaaga 4920 gactgcatat ggtgcaaggt tcggaaggcg aaaccagttg ttcctagaat ggctccactt 4980 ccaccatccc ggatggctgt gagagaacat ccattttctt acgttggtgt agattatttc 5040 gggccggtgg aggtagctgt tgggcgcaga cgtgagaaac gttgggtggc attatttacg 5100 tgtatgacaa tacgtgccgt acacctagaa gtagtccaca gcctgacaac acagtcatgt 5160 aaaatggcaa ttcgcagatt cgtgaaacga cgtggaagcc cgattgagat cttttcagac 5220 aacgggacaa attttgttgg agcgagcaaa gacttagcag aacaaatccg agctatcaac 5280 tcagaatgcg ctgatacgtt cactggagca agaacaaaat ggacgttcaa cccaccttcg 5340 gccccccata tgggaggtgt ttgggagcgc atggtacggt cagtaaaaga ggcgatgcaa 5400 gtacttgcgt atggagaaag attgacagac gaaattctgc agacaactct tgtagaagca 5460 gaaaacctaa ttaactctcg tccgttaaca tatgtgtcta cgaacgtaca ggaagacaag 5520 gaagccatta caccgaatca ttttttgacg agttgtcctt tagtggattg catgccttcg 5580 agaagttcaa cagagctagc tgatcgacta cggaacagtt atagccaagc acagtatttg 5640 gcagacgaat tgtgggaacg ttggcaacgg gagtatctac ccacgatgaa ccggagaacg 5700 aagtggtata aggaaagcaa acctttggct gttggagatt tggtatatgt tgcggattcg 5760 gagaagcgaa ggacctggga gcgtggagta attgaagaag tttttgctgg agacgatgga 5820 agaatacggt cggcgatagt gcgtactaaa acaggactga agaagcgagc agtagcaaaa 5880 ttagcagttc tggagatcgg agcctaggag tggtaagcac agtagtgact accatcaact 5940 acagggctta cgggccgggg agatgttacg ccgggataga aagggttcac tccgtggacg 6000 cgaccatgaa aacgttggtt tagctgagag cgtgacggag atataaacag atggaatgac 6060 tgtcaaaccg tagggcgctg acatattagt tgaatgctag ttgggagagc tgaaactgaa 6120 gtgtaagaat attgaattta tcgaagatat attggaaata ttagagaaat aggattgtta 6180 ctgaaattaa atgttgaatt gaaatagtaa gtagaaaatt gtattaatta aaccgcaaat 6240 tgatgccgta gaattggatg aactgaaagg attctatgaa atagcgcaca agtggttaca 6300 taccgtgaat tccataatat acaacctaat acttactgat tgttcagtgt cgattcaggg 6360 tatacat 6367 // ID Gypsy-6_DPu-I repbase; DNA; INV; 5608 BP. XX AC scaffold_54; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-6_DPu_; KW Gypsy-6_DPu-LTR; Gypsy-6_DPu-I. XX NM Gypsy-6_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-5608 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 727-727 (2010). XX DR Genome; scaffold_54; Positions 506564 500957. XX CC Positions [2562-3065] - Reverse transcriptase CC Positions [4185-4661] - Integrase core CC 'CCAG' target site duplication CC LTRs are 99% similar to each other. CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX FH Key Location/Qualifiers FT CDS 226..1290 FT /product="Gypsy-6_DPu-I_1p" FT /translation="MPSKRVAVVPPVDYHIRTRSASRNLSQVVDSPRLVAL FT IPDSDSESEERPSRRLNFDISPVREEPEQVNVPFTMDPDLAQALTNFANGA FT AQDRAQYQGRHDALMNLLMAQQQESAALRALVTTQQQQAGAARGHINNAVV FT ESVPMFEGKLGESGRHWVRILNQVGDAEGWTNAQKRQVAVRRLGGIALQWH FT LQSGANNPNWQNWSTALGTNFTQRLSPSEWYKLIEERVQKTGEPGIAYAVE FT KSRLLDLSPHVLTNAQKVSYLIGGLANWQHVAAMMNDTPEDVDEFMTRLRN FT LETLFSTLRSNVPPSPQIIRFFSRLRCHQCRCIHHQLRHPCLLQHHTLMSD FT RHCCQWETGSLD" FT CDS 1731..5270 FT /product="Gypsy-6_DPu-I_2p" FT /translation="MAPFPLILGIDWLVKANINLVCKDNQIVPVFIKDVEQ FT EADIAKTDASIPEEEKRSLPSEEFFQDLARDLPAVRRKGSVRFRIRKAVEV FT PGESLRMLKGSIPINFTGTGIVRFGFSAEPSKTWVVPSALVSFKNGKAKVP FT LLNLESKSVILKRRNCVIFIDLDLDAQIAVIPTEQKKKSNDRAARPARMSC FT AAVRGYLRAAIRGDVNTGPNLSPAEKDKVLELLDQHRRCLPSEETQLGRAL FT GIQHNIDTGNSRPITSRPYRISQFERRIISEKVNEMLKSGVIQPSNSPWSS FT PVVLVKKKSGEYRFCIDYRRLNAVSKRDVYPLPRIDDVFDRLAGAKYFSTL FT DLANGYWQVPVAEKDCQKTAFVTPDGLFEFKRMPFGLANAPATFQRLMDQV FT LNRLKWTACLVYLDDILVFGKTFEEHQSRLNLVLTALGEANLVLNMKKCLF FT AADEVNHLGHIVKADGIRPDPEKVCALEKMEVNSVKTLRAFLGLASYYRKF FT IPEFSHLTAPLVTLLKKNAKWNWSEKQKEAVRALVRLLSEEPVLVHFDESL FT PTEIHTDASNFGLGAVLAQKVDGEVKPVAFLSRALSDAESRYHSNELECLA FT LVWALKKFRCYVYGRPFTVRTDNSAVKWLWDKKELSGKFSRWILSIQEYDF FT QIEHVKGKNNVVADALSRNLGGKEECRKSSTEHRVSCVLKSDGYTAKELAF FT LQQVDKELRPITNKILTNKKDPDFALRKGVVYKKSKSNTGRKLLLMVPSIL FT RRDLISKCHDEPQANHFGIEKTLARVVENYWWPRMESSVRAYVLSCIHCQF FT HNVPPGAPVGFLNPIPPPTRPFDTIGIDHQGPLLETSSGNRHILFAIDYLT FT KWVEAVAVPSTATCYVIKLLKEIINRHGVPSRIISDPGSAFTSRELATELD FT KWKIRHVIATAEHPQTNGLVERPNRTAARAIAGFISPNHRDWDERLSDAVS FT AIYSAKQSTTKHSPFQLVYGRLPNRPDNNLFEWPEDGPVSRRRFIKRVEQL FT RKAARFNIIRRQDRTKLLCDARRKASKPFHQGDLVLVRRMLKKKGLTKKFL FT PKYIGPFQIVKKVAETTYRVEELPSRRKKKVVRRFNAHVVQLKPFRARSDE FT WHVSEIDEPTGSENDIENRTEAEVTQVAVVPVMEEILAQQLPVPEVPIRPP FT VRTRSGRISRLPTIFEDFDLN" XX SQ Sequence 5608 BP; 1564 A; 1166 C; 1419 G; 1459 T; 0 other; attggtgtca gaagagggat acaagcctta gaaaaattca cattcgtgaa ctgctcgtag 60 aaccatcgtc ctgaattaat tttgttgtcc tttgcccaaa tcgtttgaaa ttttttctcg 120 tcgccggggg cgatagcgtg gtcgcccgga cttaagaatt ttttttttct ctctctctct 180 ttttttttgt ttgttctttc aatttattta atttttgtgg tatttatgcc gtcgaaacgg 240 gttgctgtag ttcccccagt agattatcat attaggacgc gaagtgcttc gagaaatctg 300 tcccaagtag tcgattcccc tcgattagtt gcattaattc ccgacagcga ttcagaaagt 360 gaagagaggc cctcacgtcg attgaatttt gatatctcgc ctgtcagaga agaaccggaa 420 caagtaaacg taccgttcac catggacccc gacctagccc aagctctcac aaattttgca 480 aatggcgctg ctcaggatcg ggcacagtat caggggcgtc acgatgcgtt aatgaatctc 540 cttatggccc agcaacaaga aagtgcagcc ttgcgggcac tggtaactac tcagcagcaa 600 caagcgggtg cggcgcgtgg acatataaat aatgcagttg tagagtcagt ccccatgttt 660 gaaggtaaat tgggagagtc cggtagacat tgggtccgta ttctcaatca ggttggtgat 720 gcggaagggt ggacaaacgc acagaagcga caagtggccg tcaggcgtct aggaggaatt 780 gccctccagt ggcatttgca atcgggcgcc aataatccaa attggcaaaa ctggtccact 840 gcgctgggga ctaactttac acaacgttta tcaccatctg agtggtacaa attgatcgaa 900 gaaagagttc agaaaacggg agaaccgggg atcgcgtatg cagtagaaaa atctcgtttg 960 cttgatcttt ccccccatgt cttgactaat gcgcagaagg tttcgtatct gattggtggg 1020 ttggcgaatt ggcaacacgt ggcggccatg atgaatgaca caccggaaga cgtcgacgaa 1080 ttcatgacga ggttgagaaa cctcgagacg ttattttcta cattacggtc aaatgtgccc 1140 ccctcccccc aaataatccg ttttttcagc cgtctacgtt gccaccagtg ccgatgcatc 1200 caccaccagc tccgacaccc ctgcctcctg cagcaccaca cattgatgtc ggatcggcac 1260 tgctgtcaat gggaaaccgg atcgctggat tgacggaacg actgaatgga ttgacgctgg 1320 gaggaggcca gccaacagcc agataatcgt ggttgctttc agtgtggagc gatcggacac 1380 atcaggagaa attgtcctcg tggcgcggga aacggcctgg ccccgtcggc tggccaaggc 1440 cagcgctgac gggaaaacac attgatgaat cgtcccaacc aactcaactg cctgtcatcc 1500 gcgttttcat tcaaggagtt ggtgagacgg acgccttgct ggacacagga tccagcataa 1560 caacagtaag gttaccagta gtaagacgaa ttgttaagaa taagagtact cgtatccttc 1620 cagggataag gggaatagat aataaggttg taaaagtagt agatgaaatt cctttgcaaa 1680 tttgttttaa agacactaaa gtgtctcttg aaaatgtcgc agttgttgaa atggccccct 1740 ttcctttgat tttgggaata gattggctgg tgaaagccaa tataaattta gtctgtaagg 1800 ataatcaaat tgttccagtg tttataaaag atgtagagca ggaagccgat attgccaaga 1860 ccgatgcatc gattccggaa gaagaaaaaa gatcgctgcc gtcggaagaa ttttttcagg 1920 atttggctag agatttgccg gcggtgcgac ggaaaggatc cgttagattt agaataagaa 1980 aggccgttga agttccagga gaatcgcttc gcatgttaaa aggatcaatc ccaattaatt 2040 tcaccggaac gggcatcgtg cggttcggat tttcagcaga gccatcgaaa acctgggtgg 2100 tgccttccgc cttagtttcg ttcaagaacg ggaaggccaa agttccgttg ttgaacttgg 2160 aatcaaagtc agtaatatta aaacggagaa attgtgtaat tttcattgat cttgatttag 2220 atgctcaaat cgccgtcatt ccaacagaac agaaaaagaa atcgaatgac agagcagcaa 2280 ggccagcgcg gatgagctgc gctgcggtga gaggatattt acgagcggcc attcgtggag 2340 acgtcaacac gggacctaat ctatcgccag ccgaaaagga taaagtcctt gaactgctgg 2400 accagcatcg tcgctgtctc ccttccgaag agactcagct aggcagagca ttgggcatcc 2460 aacacaacat cgatactggt aattctcgtc caatcactag ccgcccgtat agaatttcac 2520 agtttgaaag aaggataatc tcagaaaagg tgaatgagat gctaaagagt ggggtaattc 2580 agccatccaa tagcccgtgg tcatcacctg tagttcttgt aaaaaagaaa tccggcgagt 2640 atagattttg tattgattat aggaggttaa atgctgtttc taaaagggac gtttatccgc 2700 tgccacggat agacgatgtt tttgataggc tcgccggcgc aaagtatttt tctacgttag 2760 atcttgctaa tggatactgg caggtaccag ttgcggagaa agactgccaa aaaacagctt 2820 ttgtaacgcc cgatgggcta tttgaattta aaagaatgcc gtttggcttg gcaaatgcgc 2880 cggccacatt ccagagatta atggatcagg ttttgaatag gctcaaatgg acggcatgtc 2940 tcgtctattt ggatgacatt cttgtttttg gtaaaacttt cgaagagcat cagagtaggt 3000 taaatttagt tttgacggcc ctgggcgagg ctaatttagt tcttaatatg aaaaagtgtc 3060 tttttgcagc agatgaagtt aatcatttag gtcacatagt caaagcggat ggaataaggc 3120 cggacccaga gaaagtatgt gccttggaga agatggaagt aaattcggtg aagacactaa 3180 gggcattcct gggccttgcg tcatattacc gaaaattcat cccggagttt tcacatttga 3240 ctgcgccgct cgtgaccctg ctgaaaaaga acgcaaagtg gaattggagt gagaagcaga 3300 aggaagccgt tcgtgccctt gtccgtctat tatcggaaga accggtgttg gtgcattttg 3360 acgaaagcct tccgacggag atccatacag acgcaagtaa ttttggatta ggagccgtcc 3420 ttgctcaaaa agtggacgga gaggtaaaac cggttgcttt tcttagtaga gcgttgtctg 3480 acgctgaatc gaggtatcat tcaaatgagc tggaatgctt ggccttagta tgggcattga 3540 aaaaatttcg gtgttatgtg tatgggcgac cttttaccgt ccgtacagac aactctgcag 3600 taaaatggtt gtgggacaag aaagagttgt cggggaaatt ttcaaggtgg attttgagca 3660 ttcaagaata cgattttcaa attgaacatg tgaaaggtaa aaacaatgtg gtcgctgatg 3720 ctctctcgag gaatcttggc ggaaaagagg aatgccgaaa atcctcgaca gaacatcgcg 3780 tgagctgtgt attaaaatct gatggttaca cagccaagga acttgcattt ttgcagcaag 3840 tggacaaaga attacgtcca attactaaca aaatcctaac taacaaaaaa gaccctgatt 3900 ttgcgttaag aaaaggggtt gtctataaaa agagtaaaag taatacgggt aggaaattgc 3960 ttttgatggt cccttcaatt ctacgtagag atttaatcag taaatgtcat gacgagccgc 4020 aggctaatca ttttgggata gaaaaaactt tggctcgagt agtagaaaat tattggtggc 4080 cacggatgga gtcgagtgtg cgcgcatatg tgttatcatg catccattgt caatttcata 4140 atgttcctcc tggagcccct gttggttttc tgaatcctat ccctccacct actagaccat 4200 tcgacaccat cggaatcgac catcaaggcc cgttgctgga aacttcatct gggaatcgtc 4260 atattttgtt cgcgatcgat tatctcacca aatgggtgga ggcggttgct gtgccaagca 4320 ccgccacatg ttatgtgata aagctgctta aggagatcat caatcgacat ggagtgccga 4380 gccgaattat ttctgaccct ggatcggcat tcactagccg tgagctagcc acggagctgg 4440 acaaatggaa gattcgacac gtaatcgcca ctgcggagca tccccagacg aacggattgg 4500 tggagaggcc gaaccgcaca gccgcaagag ccatcgctgg atttatttcg cctaaccatc 4560 gagattggga cgagcggctg tcggatgctg tgtcagccat ttactctgct aagcagtcaa 4620 ccacgaagca ttccccgttt cagcttgtct atggacgact ccccaataga ccggataaca 4680 atctgttcga gtggccggaa gatggaccag tgtccaggag aagattcatc aaacgtgtgg 4740 agcaattgag aaaggcggcc agattcaaca ttatccgccg ccaggacagg acgaaattgc 4800 tgtgcgacgc tcgcagaaag gcatccaagc cgtttcatca aggtgacctt gtgctagtgc 4860 gtcggatgtt aaagaaaaaa ggattgacaa aaaaattcct gccaaaatat atcggcccat 4920 tccagatcgt caagaaagtg gcagaaacca cttatcgagt ggaagagttg ccgtctagaa 4980 gaaagaagaa agtggtacga aggtttaacg ctcatgttgt gcaactaaag ccatttcgag 5040 cgagaagtga tgagtggcat gtgagcgaga tcgacgaacc aaccggaagt gaaaacgaca 5100 ttgaaaatag aactgaagcc gaagtgactc aagttgccgt tgttccagtc atggaagaga 5160 ttctggccca gcagttaccc gtccctgaag ttcccattcg cccgcctgtg agaacaagaa 5220 gcggacgaat ttcccgtctc cccactatct ttgaagattt tgatttgaat taagagttat 5280 gattttgata ttaatgtgtg tcatctgatc ccggaatttg ttattcaatg ggtttttttt 5340 tgcgctttgc cccaacggcc catagtttcc cccatccgtg agctattttg gtttagtctt 5400 gatttttttt tctcccaccc agaagtagga attgaaggaa tacgaaggga atggtagaaa 5460 ttttattgga gtatgatgtg gcaagaacct tcattgtgtg tgttatgtgt gttcagtgcc 5520 ttttggcacg ggtgaaatgt tgttggtgtt atgtattgtt tattgtgtat gtggatcgtg 5580 caaaagtcgc aggccaggaa aggccgaa 5608 // ID Transib-N7_AAe repbase; DNA; INV; 1785 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A non-autonomous Transib DNA transposon family from Aedes DE aegypti. XX KW Transib; DNA transposon; Transposable Element; nonautonomous; KW Transib-N7_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-1785 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the yellow fever mosquito."; RL Repbase Reports 11(4), 1311-1311 (2011). XX DR [2] (Consensus) XX CC >92% identical to consensus. TSDs are 5-bp. TIRs are ~700 bp CC long. XX SQ Sequence 1785 BP; 601 A; 276 C; 293 G; 615 T; 0 other; cacacagtgg gacgaaactg aatattgacg gccaaaacca cttagaggcg ttttagctca 60 taggtgtctt cgggacattg ttttgaatta gcatcccgag taataataaa aaataatttt 120 tatattagac taaacagagc acccgagcaa aaaatatttt cttaaaaaaa atttttaggg 180 cgatcattgc ttcagcaaag ttttagatct ttttattttc aacaactttg ctgaacatac 240 ttgatatttt taacttcgtt ttcacggtta aatttttaaa tatcacctat tttaccataa 300 atatcagttt tttgaccatt tctatgctca atgtgcaatt tatgaacatg gcatgttcta 360 caacatttta tatgttacaa aaatgcacaa ctttgccgaa tatatcaaag ctgtaataac 420 atcacatcag aagatataat gaatttatgg taaactttat ataatttttg aacatttttc 480 tatgcaagtt tctcctgaat cggcactttt cggcagccaa acttagtata ctttgatgcc 540 tcattatatc agctaaagcc ccatggcaaa agaatgatcg ggtctcacag ggctcattga 600 tttcaaccgt tttttgacaa gaatggaatt ttactttgac atcaaatatt tttttgacct 660 tttgtgtgct cacagtgcaa tcaataaact tatgatgttt tgttattttt tgtatcacaa 720 aagtcaataa ctgtatcgaa tgcatctgta taagctgaca agctgtagaa gcgttgcatt 780 gaaaaacaaa atgtttttat tttaaaaaaa tactattttt gagaatttac ttggtagaat 840 tcttatgtat cagcaatttc gagcacaagt taacatttgc tggcacaaca acctaatggt 900 tgaaaatgag gggaattagg tttaacaggt gaaattatat ttcactgtat attattatga 960 ttggcttttt gtcataaaat atatattatt attggttctt aaaattttca cgtattcatg 1020 ctcgaagccg aatcatcgtc ggctactacg atttgatgca gttttagtat gacaaaggtg 1080 ttgaatacat agtattccaa atttattgat tgcactgtgt gcatacaaaa ggtcaaaaaa 1140 ctatttaaga tcgaaattcc aagctcgtca aaaaatggtt gaaatcaatg agccctgtga 1200 gacccgatca ttcttttgcc atggggcttt agctgatata atgaggcatc aaagtatact 1260 aagtttggct gccgaaaagt gccgattcag gagaaacttg catagaaaaa tggtcaaaaa 1320 ttatataaat tttaccataa attcattata tcttctgatg tgatgttatt acagctttga 1380 tatattcggc aaagttgtgc atttttgtaa catataaaat gttgtagaac atgccatgtt 1440 cataaattgc acattgagca tagaaatggt caaaaaactg atatttatgg taaaataggt 1500 gatatttaaa aatttaaccg tgtaaacgaa gtcaaaaata tcaagtatgt ttagcaaagt 1560 tgttgaaaat aaaaagatct aaaactttgc tgaagcaatg attgccccaa aaaatttttt 1620 tcaagaaaat atgttttgct cgggtgctct gtttagtcta atataagaat tattttttat 1680 tattactcgg gatgctaatt caaaacaatg ccccgaagac acctatgagc taaaacgcct 1740 ctaagaggtt ttgaccgtca atattcagtt tcgtcccact gtgtg 1785 // ID MSAT-4_CQ repbase; DNA; INV; 163 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE A satellite repetitive sequence family from Culex DE quinquefasciatus - consensus. XX KW MSAT; Satellite; Simple Repeat; MSAT-4_CQ. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-163 RA Kojima K.K. and Jurka J.; RT "Satellite sequences from the southern house mosquito."; RL Repbase Reports 11(1), 616-616 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 20 sequences with >95% CC identity. XX SQ Sequence 163 BP; 57 A; 23 C; 18 G; 65 T; 0 other; caaaactacg taaaatacag aaaatttgat actttccttt agaactttat ttgcttcatg 60 gtaaaatcat tgatttattt tatgaaaatg tttcaaatga tctaaattat aagtttccga 120 atcattctga gtgtgtttct tcctgaatac tacaaaattt tac 163 // ID Gypsy-7_DPu-I repbase; DNA; INV; 5056 BP. XX AC scaffold_126; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-7_DPu_; KW Gypsy-7_DPu-LTR; Gypsy-7_DPu-I. XX NM Gypsy-7_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-5056 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 729-729 (2010). XX DR Genome; scaffold_126; Positions 284153 289208. XX CC Positions [3913-4377] - Integrase core CC 'CCCAG' target site duplication CC LTRs are 100% similar to each other. CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX FH Key Location/Qualifiers FT CDS 1369..4848 FT /product="Gypsy-7_DPu-I_1p" FT /translation="MGRNCHACGKPDHFSSRCPNRDKGKFGANGGMGGGAK FT SKMAHITIGNVQDTHRRRCSPTILLDVICDDGSVGAQISDVIPDPGAEVSV FT GGRDVMASLGLSEKDLAASSFDLVMADRSSPLLSIGQRDIHVRYGDRSAHI FT TIVFCPEIRGMLLCRLDCVELNILHRQYPKPLSRVHSVTFSSPEESLPRDS FT TSPSSGETFLKDIYIPLEPTAEQISTIEAAISAEFEVVFDQEEGLRQMTGP FT DMLIQLRDDAVPFYVNGARPIAFGDRADVKRVLDDLVAKNVIIPVSEASEW FT AAPLVVIRNAKTGKIRICVDHTRLNKFVLRPTHPTRTPRDAVAEVDSESRY FT FTSFDAANGYYQIPLHPSCQHLTTFMTPWGRYKFLRASMGLCCSGDEYNRR FT ADVAFAALSNTVRVVDDLLRFDRTFPAHVAGVCAVLQAARTAGITFSKEKF FT KFAQPRISWVGYDIRHGGITIEEEKLKALSHFPKPTNVSELRSFMGLVEQL FT AGFSTEVAAAKSPLRPLLSTRNSFVWTEDHDRSFEAVKLALVSPPVLVHFD FT PMRETTIQVDASRKHGMGYALLQRHGDSWKLVDANSRWCSDTESRYAVVEL FT ELAAVEWAIRKCRLYLSGLPNFTLVVDHQALVAILDRYTLDAIDNPKIQRL FT KERLSPYAFTTVWRKGKDHAIPDALSRAPVNDPAADDECVGAELAYSVRRI FT TLQGISSICSPPDEPDLSSHLTDSLCENLRATASADPHYVDLVAAVESGFA FT VDRARTPDHIRQFWSIRNHLSVDNGIVLYGSRIVVPMAARKDMLNRLHAAH FT QGIVRTKRRAQQTIYWPGITNDVTMLIERCAICQERLPSHCQEPMMSDPPP FT TYVFQDVSADLFQHGSLHVLVYADRLSGWPVVHQWRRDPTAREVVQAVVSN FT FVELGVPMRFRSDGGPQFSAGIFQEALKRWGVEWGNSSPHYPQSNGHAEVA FT VKAVKELVAKIAPSGDLSSEAFLTGLLEFRNMPHESGLSPAQIVFGHQLRS FT ILPAHRSSYASKWKEAIAARELQAAADAAVRFRYDARTRPLHPLSIGASVR FT VQDPDTKLWDRVGVIVAIGRYRSYRIKFASGSVLWRNRRFLHLLVPATSAE FT EAEASSPGGDGGETAAGDPDSTSTGQRPLPAPSVATPRRSTRTRKPRVIIS FT V" XX SQ Sequence 5056 BP; 1097 A; 1312 C; 1435 G; 1212 T; 0 other; tggcgcagtt ggcataacct ttttctcagt ttcggctttg tgatttttcg tttgctggcg 60 tcccaacgtt ggccctcacg tgttcagtgt gtcggtgggc gccgccattt tggtttggtg 120 ctccgtttcc ccgatgcatg tgtgattggg cgcgtctcag tttggtttat ctggcaatcg 180 aatagacatc gtgtagacat ttttagtgtg cgatattttc gccatctcag ctcatatata 240 ctgtcgtttt cagtgacacg cggttcttga gcatcccctt ttgacttttt aaacttgatg 300 aacactccat attttcttcg ctcgtcaaaa catcggccgt cacatacttc cccccgcatt 360 tcgatcccgg tcagcatgac gacggcagct gatgcgttgg ctgcggccaa tgcggcgacg 420 gccgccatta cggccaatga tgctcgcatg gacgccatgg aacaagggca gacgaccatc 480 gccaatcagc tgacggcctt gacggcgcaa cttcaagctc tttttatggc cggtggtggc 540 gctggtggag gtgctggtgg cggtggcgct ggtggcgttg gtggaggcgg tggcgctggt 600 ggaggtggag ctggtggagg cggcggtggc gctggtggag gcggcggtgg tgccggtgga 660 ggaggtgttg ctggcggcgt tattgggcat gctgttccac atcagaggcg gcggctcgat 720 ccatctggca tggataaact tcacggtgac atcactattt cgttactgcg ttcttggaga 780 aatcggtgga acgattttgc ggagttgaac cagcttctca cctatccagt aacggaacag 840 atggccgctt ttcgcatggc cctagactcc acgatgcaac aggtggtaga agtggcactg 900 ggaatcacac ccgcaaccgt tacgaccccc gaccaagttc tcaacctcat cgcggattag 960 atacgtgcga aacgcaatgt agcactcgac cgagtggcgt ttgaagaacg tcgtcaaggt 1020 acatcagaat cattcgatga tttttacatc ggccttcgcc gtctagcaga agcagcagat 1080 ttatgtggtg cgtgttccga aacgagattg gtgactcgtg tcattgccgg gacgcgcgac 1140 gctgagacca agaagaaact attggccatc agtcctttcc ccagtttgca ggtggcggtc 1200 aacatctgcc gcagtgaaga gtcggcccgt gcgaacgagc gcactcttag tgggcaatca 1260 ggtgttgcag ctattcatcc caagaacggc aaggttgaca atcggtcatc gaatgagtgc 1320 ggcgcgtgtg gccgcttggc tcatgtgaat ggtgcgacat gtccggcgat gggcagaaat 1380 tgccatgcgt gtggaaaacc agatcatttc tcttccaggt gccccaatcg tgacaaaggg 1440 aagttcggcg caaacggagg aatgggcggc ggagctaaat ccaaaatggc tcacatcacc 1500 atcggaaatg tccaggatac tcatcgacgg cgctgttctc cgaccatatt gttggacgtg 1560 atttgtgacg atggcagtgt cggtgctcaa atcagtgacg ttattcccga tccaggcgct 1620 gaggtcagtg tcggcggccg tgatgttatg gcatctttag gcctctctga aaaggatctg 1680 gcagcttcgt cttttgactt agtgatggcc gacagatcct ctccgttgtt atccattgga 1740 cagcgcgaca ttcacgtccg gtatggagat cggagcgctc atatcacgat tgtgttttgc 1800 ccggagattc gtggcatgct cttatgccga ttggattgcg ttgaattaaa tattctgcat 1860 cgacaatacc cgaagccgct gtctcgagtg cattcggtaa cattttcatc gccggaggaa 1920 agtctaccac gtgattcaac ttcaccttca tcgggcgaga catttctaaa ggatatttac 1980 atccctctgg agcctaccgc ggagcagatt tcgaccatcg aggcggctat cagcgccgag 2040 tttgaagtgg tttttgacca ggaagagggg ttacgccaaa tgacaggccc ggacatgtta 2100 attcagctcc gcgacgatgc agtcccattt tatgtcaatg gtgcgcgacc tattgctttc 2160 ggggatcgcg ctgacgtgaa acgtgtactc gatgacctag tggcgaaaaa cgtgatcatt 2220 ccagtcagtg aggcgtcgga gtgggccgct ccattagtcg tcattcggaa cgcaaaaact 2280 ggaaaaattc gtatatgtgt ggatcataca cgtctaaaca agttcgtgct acgccctacc 2340 catccaacgc gtacaccacg cgacgcagta gcggaagtgg atagtgaaag ccgatacttc 2400 accagcttcg atgcggccaa cgggtattat cagatacccc tccatccctc ctgccaacat 2460 ctaactacgt ttatgacccc ctggggtcgc tacaaattcc ttcgggcgtc catgggatta 2520 tgctgttccg gggacgagta caatcggcga gcggatgtcg catttgcagc gttgtccaac 2580 accgtgagag tggttgacga cttgctgcga tttgaccgga catttccagc tcacgtcgcg 2640 ggagtatgtg cagtattgca ggcagcgcgg acggcaggaa tcaccttcag caaggagaaa 2700 tttaagtttg ctcaaccccg catttcttgg gtcggatatg acatccggca cggcggcatc 2760 accatcgagg aagaaaaatt aaaagcgttg tcgcattttc ccaagccaac caacgtatcg 2820 gagctgcgat ctttcatggg attggttgag caattggccg gcttttctac cgaagtggcg 2880 gcggcaaaga gtccccttcg tcccctgctc agcacgcgca attctttcgt atggaccgag 2940 gatcatgatc ggtctttcga agctgtaaag ctggccctcg tctctccgcc agtgttggtg 3000 catttcgacc ccatgcgtga gacaacaatc caggtggatg cttcgcgcaa acacggcatg 3060 ggatatgcgc ttttacaacg ccatggcgat tcctggaagc tggtggacgc taactcacgc 3120 tggtgctcgg atacggagtc acgatatgca gtggtggaac tggagttggc ggcggtggaa 3180 tgggcaatcc ggaagtgccg actttattta tccggattgc ccaattttac attggtggtc 3240 gatcatcaag cattggtggc tatattggat cgatatacat tggacgctat cgacaatccc 3300 aagatccagc gcctgaagga gcgtctctcc ccatacgcct tcacaacggt atggcgaaag 3360 gggaaagacc acgctattcc tgatgcacta tcacgggcgc cggtaaatga cccggcggcg 3420 gacgatgaat gcgtcggagc agagttggcg tattctgtgc gtcgcatcac acttcaaggc 3480 atcagcagca tttgcagccc acctgacgag ccggatttat cgagccattt gacggatagc 3540 ttatgtgaaa acttgcgagc gacggcgtca gcggaccccc attacgttga tcttgtggcc 3600 gccgtggaat ccggatttgc tgtcgaccgt gcacgcaccc ccgatcacat tcggcaattc 3660 tggtcgattc gaaatcattt atcggtggat aacggcatcg tcctatacgg ctcaagaatc 3720 gtcgtaccta tggcggcacg caaggacatg ttgaacaggc tgcatgcggc tcaccagggc 3780 attgtcagga caaaaaggcg cgcccagcag acgatctact ggcctggaat caccaatgac 3840 gtgaccatgc tcatcgagcg atgcgcgatt tgccaggagc gcttgccgag ccattgccag 3900 gagccaatga tgtccgaccc gccaccaacg tacgttttcc aagacgtgtc agcagatctc 3960 tttcagcacg ggtcgctgca tgtgctggtg tacgccgaca ggctatccgg ttggccggtg 4020 gttcatcagt ggcgacgcga tccgactgca cgcgaagtgg tgcaagccgt cgtaagtaat 4080 ttcgtggaac tgggtgtgcc gatgcgcttc cgatcggatg gtggtcccca attcagcgcc 4140 ggaatcttcc aggaggctct gaagcggtgg ggggtcgaat ggggaaattc gtcgccacat 4200 tatccccaaa gcaacgggca cgcggaggtg gcggtgaagg cggtaaagga gttggtggca 4260 aagatcgcac catcaggcga cttatcatcg gaagctttcc tcactggact cctcgaattc 4320 cgtaatatgc cgcatgaaag tggtttatcg ccggctcaaa tagtatttgg tcatcaactt 4380 cgttctatcc taccagccca ccggtcatcg tacgcgagta aatggaaaga agcaatagca 4440 gcacgggagc tccaggcggc agctgatgcc gcggtccgat tccgttacga cgcccggacg 4500 cgccctctcc atccgctatc catcggtgcg tcggtgagag tgcaagaccc ggacacaaaa 4560 ttatgggacc gtgttggcgt aatagtggcg atcgggcgat accgctcgta tcgcataaaa 4620 tttgcgagcg gcagcgtttt atggagaaac aggcgattcc tccacttgtt ggttccagcg 4680 acatcggcgg aggaggcgga agcatcttcc ccgggcggtg atggaggtga aacggcggca 4740 ggcgatcccg attctacgtc tactgggcag cgaccattac cggccccatc ggtagcgacg 4800 cctcgcagaa gcacccgcac acgcaagccg cgagttatta tttctgttta atttggcatg 4860 tgaagaatgt attcttcccc gctattatat ccatgttatg ttcccactat cctccaatcc 4920 cgtcggcgtg cgcgacattc caatgtatct ttgtcgcctt catctatttt tttaattatc 4980 aatctcccat attgcgtccg gcgccgccat tcgatggaga tcgaccatca ccggagagcg 5040 ggggctcggg aagggt 5056 // ID Ginger1-7_HM repbase; DNA; INV; 6088 BP. XX AC . XX DT 02-FEB-2010 (Rel. 15.01, Created) DT 02-FEB-2010 (Rel. 15.01, Last updated, Version 1) XX DE Ginger1 DNA transposon from Hydra magnipapillata. XX KW Ginger1; DNA transposon; Transposable Element; Ginger; integrase; KW Ginger1-7_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-6088 RA Bao W., Kapitonov V.V. and Jurka J.; RT "Ginger DNA transposons in eukaryotes and their evolutionary RT relationships with long terminal repeat retrotransposons."; RL Mobile DNA 1, 3-3 (2010). XX DR [1] (Consensus) XX CC TIR is 57-bp long. Tpase contains 2 introns: 261-1067, 1230-1412. XX FH Key Location/Qualifiers FT CDS join(134..260,1068..1229,1413..3421) FT /product="Ginger1-7_HM_1p" FT /translation="MADRDYSVEDVKKYLYDGTIKKCLNLKQRKSFLKYAK FT SFQFIDGKLYYVTATKKLQVLSTDEEKKLAFRDVHDSNHGAHVGLNNTRVK FT LKSAFYWLGMVSDITKWIKECDNCQRMEKIKTCAPELKPIKVNGLWEFLGI FT DLIGPLPLTTQGNKYILTVTDLWSKYVEAFSIPEKSAFFVSKSLTTLFYRF FT GPPKKVLSDQGREFVNDLNEFLFSIFNIKHLITSAYHPQTNGQDERSNQII FT KRALSKVANKSQDNWDTLLEPVLFGLRTCVQSSTKFTPFSLMFGREARLFS FT TLALNDNDPNFNDININIAESNATESQIQEIMDSRIAVHAIVNDNICIAQK FT KMQKNHDKKHSKGFKSFNFKVGDKVLVKNYKKIGRKGCRMEHDWVGPAIIT FT DFKPTGAVVSLNGKVWKSAVAMPNMKPYLEKNLNFISSNILKEHNYCLQIT FT DKIHLNEESCECASSSFMALNIASTLMYDLERKKMKDSQVRDFPTSPSDFY FT NCSNSNYDYSNKILAVEEVKPVDSVVKSVKSNSSCRSLNFSKVSCAIPVAS FT NFYSELSTQCKNKALDILKNPSGWLDDTLIDIAQSLLSYQFPKISGFQSSC FT ILSCFNSGGYSESCSFIQIINIRNSHWVLLSNVSSDDLSDSKCVQFYDSLF FT NSSNECDVPLLVHTVARSSLNCHTSSSLIVEVMDCQMQQNGNDCGLFAIAN FT ATALCNGIDPSLIIWKHNDMRKHYLKCIENGKLEMFPFVTVLDGHKRCAFT FT IDCNHFCKCRQKNNN" XX SQ Sequence 6088 BP; 2086 A; 792 C; 898 G; 2312 T; 0 other; tgtcacgtta aataagtaga tcgttaaata agtagacaaa cagttctgcg catgcataaa 60 aacgctgaaa agtttaaaac aaggatccta aaacccaacg ttaaataagt agcatagctt 120 ataaacataa aaaatggctg atcgcgatta ctctgttgaa gatgttaaaa aatatcttta 180 tgatggaaca attaagaaat gtcttaattt aaaacagcgg aaatcatttc taaagtatgc 240 taaaagtttt cagtttattg gtatgtatta atattaagaa taattttgta tttatacatc 300 aagtttgttt gaaggtttag aataatatct caggtaaaat gtaaacattg taaacaaatg 360 taaacatttg catataaatt gtaaatattg cagtatatta taattataat tctgcaatga 420 ctaaacatca aacacataca aaattaacct ttgcatacac cttttgtata caaatgtaaa 480 catcaaacac atacaaaatt aaccgttgca tacacctttt gtatacaaat gtaaatattg 540 cagatgaaat tttgtgataa ctagtcttca agtacacatt aagttatttt taaaaaataa 600 attttagtta taaagctata aagttatata cgtaaataaa taaataaatg tatatatata 660 attatatata agactcatca gaaatgcatc atttctgatg agtctaaatt gatgaaacac 720 tgtgtaaaat gaaaaaagat tgttcagtga ttttctacta atttatattt ttttatttat 780 ttacgtatat aactttatag ctttataact aaaaaaaata tatatatact caaggtattt 840 actttattat agataagata caagaaagta aagttcatga aaattacgtg ttttttgtca 900 aacatcaaaa agatttttgt tgcactgtgg aattgctata tttgtgtgtg ttatctgtac 960 aatataactt gtttaacata cataaagtct tttttaaaca cttttaaacc ataccatgta 1020 atttttgatt taagaagtgg aaactttttt ttattttttt cttttagatg gaaaactgta 1080 ctatgtaacc gctactaaaa agctccaagt tctttcaaca gatgaagaaa aaaaacttgc 1140 ttttagagat gtccatgact ctaatcatgg agctcatgtg gggcttaata acactagagt 1200 taagctaaaa agtgcttttt attggttagg taatgagttc gattacaccc catttcaatt 1260 gagtatattt tttatcaaat tttcttatac aataattttc ttctttgcat aaatgagttt 1320 tatcatggtt cattttctat attaatgtaa tagtaataat aattttttac agaataatat 1380 acttatcata ttgtttgcat aatgttatct aggtatggtt agcgatataa caaaatggat 1440 aaaagagtgc gacaattgtc agcgcatgga gaagatcaaa acttgtgctc cagaattgaa 1500 acccattaag gtcaacggat tgtgggaatt tttaggtata gaccttattg gtccccttcc 1560 gcttacaaca caaggtaata aatacatttt aacagttact gacctttgga gtaagtatgt 1620 tgaagctttt tcaataccag aaaagtctgc tttttttgtt tctaagagtt taactacttt 1680 gttttatcga tttggccctc caaaaaaagt tctatcagac caaggtagag agtttgtaaa 1740 tgatttaaat gaatttttat tttctatatt taatataaag catttaataa catctgctta 1800 tcatccgcaa acaaatggac aagatgagag aagcaatcaa attattaagc gagctctttc 1860 aaaagtagct aataaatcac aagataattg ggatacttta ttagaaccag tattgtttgg 1920 tcttcgcacg tgcgtacaat cttcaactaa gttcactcca ttttctttaa tgtttggcag 1980 ggaggcccgg cttttttcaa ctttagctct taatgataat gatcctaatt ttaatgatat 2040 taatattaac attgcagaaa gtaatgcaac tgaatcgcag attcaagaaa taatggattc 2100 ccgtattgca gttcatgcca tagtgaatga taatatttgt attgcacaaa aaaaaatgca 2160 gaagaaccat gataaaaaac attcaaaagg tttcaaatcg ttcaatttta aagtaggtga 2220 caaagttttg gtaaaaaatt acaaaaagat tggtcgtaaa ggttgccgta tggagcatga 2280 ttgggttgga cctgccataa ttactgattt taagccaact ggagctgttg tgtcccttaa 2340 tggtaaagtt tggaaatcag ctgtagctat gcccaatatg aagccttatt tggaaaaaaa 2400 tctaaatttt atctcttcaa atatcttaaa agaacataat tattgtttgc aaattactga 2460 caaaatacat ttaaatgaag agtcgtgtga atgtgcaagc tcttctttca tggcattaaa 2520 cattgcatcc acactaatgt atgacttaga aagaaagaaa atgaaagata gtcaagtgag 2580 agattttcca acttctccta gtgattttta caactgtagt aatagtaact atgattatag 2640 taacaaaatt ttagcagttg aggaagtgaa gccagttgat agtgttgtaa aaagtgttaa 2700 atctaatagt agctgtagaa gtttaaattt ttctaaagtt tcttgtgcaa ttcctgttgc 2760 cagtaatttt tattctgaat tgtcaacaca gtgtaaaaac aaggccttag atattttaaa 2820 aaacccatct ggctggctcg acgacacttt gattgatata gcacaaagtt tgctttcata 2880 ccagtttcct aaaatatctg gatttcaaag ttcatgtatt ttaagttgtt ttaattcagg 2940 tggttattca gaaagttgca gttttataca aataattaat attagaaatt cacattgggt 3000 gcttttaagt aatgtatctt ctgatgatct ttcagattca aagtgtgttc agttctatga 3060 ttcactcttt aatagttcaa atgaatgtga tgtaccattg ttagtgcata cagttgcacg 3120 ttcctccttg aattgtcata caagcagttc tttaattgtt gaagtaatgg attgtcaaat 3180 gcaacaaaat ggaaatgatt gtgggctttt tgcgatagca aatgcaacag ctttgtgtaa 3240 tggtatcgac ccaagtttaa tcatatggaa acataatgat atgaggaaac attacctcaa 3300 atgtattgag aatggtaaat tagagatgtt tccatttgtt actgttttag atggtcataa 3360 aaggtgtgcc tttactattg attgcaatca tttttgtaag tgcagacaaa aaaataataa 3420 ttgattgtca ttcaagtttt tctgaatatg gatttataat tcctcttaca tcattaaggg 3480 gaatgtatct tgttatttct ggtgtctttt tgcttttatg acttttatga atatttaaat 3540 tcaacctata tgcataaaaa agttttcttt gttttttatt gtcggtttgt ttatgtactt 3600 tatgttttag ttaagttcat tattttcttt tttttttttt tttttatgtc ttatagatgt 3660 tcaaagttgt ataattggct aatttaggtc aactattgct tggtttttta ttttgttatt 3720 agcatgacga tttgctattt tttaactttg tagttgttta tttcttattg ttatcagtta 3780 tttattttat caactagggt tgttgatata agacatagtt aatagctgca gttattaaag 3840 tcagttcttt atttacatgc atttgttgta gtagcagtta actctaacta ctactactac 3900 tactatgaca agagtaatac taaaaattac aattatgtta tttttaattt taatgagata 3960 tttaatcaat ttgttataga atattgtgaa agtttagact tagttctcac atgtagatgc 4020 catgactatt ttaacatttt taacattcat gttaatgaat gttaaaacaa tttgaatgat 4080 tcaaagattc aaatccttct ggcacacact tagatttcat tttttgacat atattgatca 4140 atgcagttgt taagtacttt tgtggcacaa aagcaggtat tttttcattt tttttttttt 4200 tcattacgat ttaagtatac tatttatgtt atactatatt tctatactat ataggattgt 4260 aattccatgt gcaaaaatca aatttttttc cgaaaatgcc caagtttgtt cttattttca 4320 aaaaaaagtt tcagtttagt atgaagtgtg tcttttcttc aatgtcgttt taatttaaaa 4380 cattcacaca gtaagaaata gtaaactaaa tttaggtcct ggtattttat tattattgtg 4440 ctattctaat gtaatgatgc tgcattttat tgttaaaagt tatgcattag gactttcatt 4500 tattgaactg actgcttcta aaatttacta aaattcaaga caatatttag ctatttaaat 4560 acaatttatt tttatctcaa tttcgttttt tatctttaat gtttatagtt tggatccttg 4620 aagcatttta tattttgttt tttaagtaat aatattttaa atccattgtg gtgtcttttt 4680 aagttaacta aagtttagta aaagccaaca tgaaataaca acattaaaca acactgacaa 4740 gaactgctgc atcacgacag caactgcagt atcacaacag gttaccaaaa gcaggtaaat 4800 tgttttctag tttttttttt cttttctttt atatctaatt acctgttaca atttttattt 4860 taggtttgtc atataacgca ttgatgcaat atgaaagatc aaagtttaca tatttttcct 4920 tatttgtgat agcatgtttg attgtaaatt aaaaattttt gtttaaaacg tttttgtcat 4980 caattgctta gttatttata ataaactaat aaaattaata gctcttgcat tacagaagtt 5040 aacacataat ctttattaaa gttcatgtta ttataaaaaa tattacaaga agttacttta 5100 tattttcgtc aatttaaacc atgatgaaga gtttaatttt tttaataatt taattaattt 5160 gtgtcagtaa ctaaaaactt attgtgttga actttgggaa tgtcgaggaa tatggttctg 5220 cctcaaacta acatgaccta agtcagttgt aaattattca aataatgata gatattaatt 5280 gataaattat tttatagcct caatttaaag gcgtttctct aaaaaggtat agatcaatta 5340 tttctgaagt taaggtttct caactttaaa ataaaatgtg gaaaacagct cgagggtcgt 5400 gtaaaatgtc aggtaaattg aacacaaagt tggcacactc gctataaatt cacaatttat 5460 tctgtaacaa accgaccata acccgtatta cgggttgcac tctgatagtt tgcttaattt 5520 tactaaaatg cgcaagttgt tcaataacca aagagtgcag taaatttgca cgtgtgatga 5580 aataactatt tcatttctta tcagctacta gaaaattgtt gatatcatta aagtatgtct 5640 gctttagtac acttatccag tgagtgacct aggaaaaata aaacttttgc taaaaaatat 5700 gtttcttgtt atactattca acaaagataa aatttcgaat agaatataca tcgtgagtta 5760 aatattttta agtgtaaatt atcactataa aaatgccgta aaacaatatt tcccgtgaaa 5820 aagtaaatgc ttttaatacg ttatacgttt aaatacgtta atacgtttaa aatacgttaa 5880 ccgaactaaa gttcggttaa cgtatgtttt ttttttaatt tattcgtctg ggttaacata 5940 taatgccttg attgttcaat ttgtctactt atttaacgat ctacttattt aacgtgactt 6000 actacgttaa ataagtatcc gtaaactaac gcatgcgcag aatatcattt tgtctactta 6060 tttaacgatc tacttattta acgtgaca 6088 // ID Gypsy-619_AA-I repbase; DNA; INV; 6475 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-619_AA_; KW Gypsy-619_AA-LTR; Ty3_gypsy_Ele152; Gypsy-619_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-6475 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [5259-5738] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 3276..6083 FT /product="Gypsy-619_AA-I_1p" FT /translation="MLTCNNFYKQKINLNDQTPVYIKNYRTPEVHRTEIDR FT HIDKMLEEQIIQPSISPYNSPVLLVPKKSATGDKKWRLVVDFRQLNKKVIA FT DKFPLPRIDDILDQLGRAKYFTTLDLMSGFHQIELDEESKQYTAFSTNSGH FT YEFNRLPFGLNISPNSFQRMMTIALSGLPPECAFLYIDDIIVVGCSIKHHL FT NNLEQVFQNLRKYNLKLNPAKCRFFCADVTYLGHHISEDGIQPDKSKYSTI FT LNYPKPQNADDVRRFVAFCNYYRRFIPYFSDKAAPLNALLAKKTIFKWTEE FT CQNAFETLKNELLSPRILQFPDFSKPFILSTDASKIACGAVLEQEHNGIRL FT PISYASKKFTKGESNKSTIEQELTAIHWAISHFRPYLYGRKFTVKTDHNPL FT VHLFSMKDPASKLTRMRLDLEEFNFEIEYVKGRENVGPDALSRIDIDSEIL FT KNMSILPMQTRSKTRNMTTEAKVNKPNAEKIDQLHAYDSINNIDAFNLPKL FT TFQALSEQLIINIKSKNLKGIIAQGQIFQQKGMVYINECMNLLNEMAKTTN FT MKKLAISVMDPIFNYISREQFKQSCNENLSDVTIIIYTPAMVIQDENEINK FT LIKENHDTPSGGHVGINKLLLKLRRKYYWKNMKNSISKYVKNCISCKLNKH FT SIKTQENFEKTTTPTKPFELVSIDTVGPLTRSIKGNRYALTLQCDLTKYIV FT TVPIVDKQASTLAKALVEHFILIYGCPRAIKSDMGTEYKNEVFQQICNILE FT INQKFSTAYHPETIGALERSHRCLNEYLRQFINEQHDDWDSWLPYFTFFYN FT TTPHTEHSFAPFELIFGTQVNYPTKFKNNTTIDPIYNFEAYYEELQFKLHT FT AALKAKLLLEKSKEKRISKQIQNSNPIEIQTGSKVWLKKENRRKLDPVYTG FT PFTIIEINHPNVTIKNQMSNETQTVHKNRIII" XX SQ Sequence 6475 BP; 2561 A; 1270 C; 989 G; 1651 T; 4 other; tggcgaccgt gacaggttat cttttatctt ttataagacc ctgtaatcct acgattcttc 60 cgcgcgcatc caagtatcta tgtcggttac ccgtcggtcg cgcgcatcga agttaaatta 120 actgtgtgtc tcaagtgaaa agtgatatac accgtgaaaa atgggatggt tctctagtga 180 cgaaatcgtc acaaatagtg ctgtgtgtcc tccgcaagaa aattatcacc taacacaggc 240 aatcgcactc gctgtgttag caggcattgc cgttgggtac attgtaacga aaatagtggt 300 aaaagctcat aaaaatcaca ccataagagt agcggaaaga gcagttcgcg ttgctaacgt 360 gtaaagtgac aataaagtta acagtgaaca ctggacaaat ccaaaatgaa agtttaagat 420 ggcagcaaaa gaaaatatta caataacatt gaacagattt agaaaatcaa tcgcagcaaa 480 agtgccaata atagtgaaca caaattcaag ttgcttaaaa ggacttgttc aggcactaga 540 acacgaaaaa ttaataacag tgcttaaagt gtacaaagac tcatgtgtag tgaaagtaac 600 aaccggaata accacaatag aagtactacc gcatttccaa caaacgaaga aaagccaact 660 cgcagtttgg cggcaaaaat ttctaccacg tggaaaagga catttaatca tctcaaccaa 720 taaagggatg atgtcggaca gcactgcaga acaaaaacaa ttaggtggtc aaatagtagg 780 attcgtctac taagaaaata cacaaaaatg ttgttctctt ctctcaaaaa aacaaaaaca 840 tgttcaaaaa acaagaagat taagttcaag ccatttgaag ggaaaacggt gaatacgcag 900 tatgaacgat acgacgttga acaatatttc tgcgtaggag gtaatttagg cgacaaaaca 960 aaagtgaccc gttacgaaga aaaagaaaca agaaaaaaag acgagtgcat accgtattgt 1020 ttacagttcc gctcgtttaa agcacatgaa cgataccaca gtgaaataaa caaaaaaata 1080 gacaaagaaa aaaacaaaca aacaataagt gcagtgtcga aagcagaata atgaggccga 1140 tattacaatt tttggctacg aatgtgtacg agatcaacgc gtcgacaggt gacgtatttc 1200 actacgtcag tttgactaac gatgactttg aagctgacga gttaatgaga tcactacaac 1260 aagtgcagtc ccaaccccag caacaggacg aacacccatc aacgccaagt gcgccttcaa 1320 tgtaacagat accagatgag cagcatttag taagtaatag gaaatatatc atcaaaaata 1380 gcccacagcc cacatgtcat acctagaaat ggttattgaa gaaaaaattc atgaactatt 1440 gattttgaaa acaaacttcg ataaaaaccc ttatcgaaat taccgtagaa taactctmtt 1500 gaacaaattg gttcaagcta aaactattta caataaaata attgaattaa ttacaacaaa 1560 cgaatttaaa ttaaccgaaa gcaaactaaa cataaatatt aaactaagtc gaaatttata 1620 ttacgaaatc aatcaactaa ttgcctcaaa actaaaaaac accaaagaca atttcaaatt 1680 tcgttcagtg gtgaaaggtg tgatattttg tctaagaata aaaaaattta actatacaaa 1740 aacaagcaat ggctagtata attgaaatta taaaagtggt tacttcactt gtacctattt 1800 atgatggaaa aggtgaaaag ttggaaaata ttatagcagc tttagaggca tgtaaacctc 1860 taataaacga tgccaatcga caagcagcta ttcaaacaat tttatctaga ttagaaaata 1920 aagcacgtgc agcagtaact gataatcctg ctagcattga tataatcata aacaaattga 1980 aggaaaagtg cactgttaaa gtagcaccag acactatatt agctaaaatt aacgcaacaa 2040 ggcaaaatgg aaattttgac aattttgcca ctaccataga aaaattaacc atcgatttag 2100 aaaaagctta tmtaggagac gaagttcccc ttgatacagc taccagactt gcaaataatg 2160 ccggaataag aggtttaact aatggaataa aaagtgagca aacaaaactg ttgttgaaag 2220 ccggacaatt taccacaatt tccaaagcaa ttgaaaaagc ctcagaactc gaagcagaaa 2280 acaatttgaa cagatcccct agtatcctac attacagttc caatccagga ccaggatcaa 2340 gaaaccaggc ggtccgcagg ccgcggcggc cccggggccc ggggccaaca aaatacccaa 2400 actcaaaata tccaacagca acaacagcca caacaacaac ctcacttcca agtagcatca 2460 gcaaacatgc atcaacaaag ataaaaggaa tttttaatat ctctactgcc ctgtcaaatt 2520 ttgtgattct gaaaacagaa tccacgcact ctgaatgtac tttcctagta gattcaggag 2580 ctgaaatatc agtcctaaag cccacaaaat tgaaacaggc aacaaatatc aacaattcgc 2640 atacatgttc tataacaggc attcagccca accccattga cacactcgga tcaatcaatt 2700 tatctttaca tctaccgtat gaagtttcag caacacacac attccatatt atacccgaaa 2760 ctcttacaat acccacagac ggaatcatag gacgagattt cctgacaaaa tttaaatgcg 2820 ttatcgacta cgacacctgg actctttcaa gctttataaa ttcagaatta attgagatcc 2880 ctatacaaga taaattacat gacaaaattc taataccacc ccgttgcgaa gtatttagaa 2940 aaattcctat ccaaaacccc caccaagatt acgttatttt atcaaaagaa ataaaacccg 3000 gagtattttg cgcgaactca gtgttaaatg ctaaaaattc acttgtaaaa attatcaata 3060 ccacaaacta cacggtaaaa attcatgaga atctgtcaga tctacttcaa cccctcgaaa 3120 attataacat atttactatc gatcaaacaa atcaaacaaa tcgtaaagac aaatgtatag 3180 ctgaactcaa tttaaatgac actccttgtc acattaaaga cagattaaaa gacttatgtg 3240 ccaaatacaa tgacttattt gcattacagg gtgacatgtt gacatgcaat aatttctaca 3300 aacaaaaaat aaacttgaac gaccaaacac cagtatacat taaaaactac agaacaccgg 3360 aagttcatag aactgaaatt gaccgacaca tcgacaaaat gttagaagag caaataatac 3420 aaccatctat ttcaccctac aactctcctg tattactagt ccctaaaaaa tccgccacag 3480 gcgacaagaa gtggcgtctt gtcgtagatt tccgacaatt aaataaaaaa gtaattgctg 3540 acaaatttcc actaccccga attgatgaca tactagatca gttaggaaga gcgaagtatt 3600 ttacaacact cgacctaatg tcaggttttc atcaaatcga gttagatgaa gaatcgaagc 3660 aatacacagc tttttcaaca aattctggac attatgagtt taaccgtctt ccttttggat 3720 taaacatctc ccctaatagt ttccaacgca tgatgacgat agccctcagt ggactaccgc 3780 cagaatgtgc ttttttgtac atcgacgaca taatagttgt cggatgctcc attaaacatc 3840 atttgaataa tttagaacaa gttttccaaa atcttcgaaa atacaatcta aaactgaacc 3900 cagcaaaatg tagatttttt tgtgccgacg tcacatattt aggacaccat atttcagaag 3960 atggtatcca accagataaa tcaaaatatt caactatact taattacccc aaaccacaaa 4020 atgcagatga cgttaggaga ttcgtcgcat tttgcaacta ttacagaaga tttatccctt 4080 acttttctga taaagcagct ccattaaacg cacttttagc caagaaaaca atattcaaat 4140 ggacagaaga atgccaaaac gcttttgaaa ccctgaaaaa cgaactacta agtccaagaa 4200 ttcttcaatt cccagatttt tccaaaccct tcatactaag cacagacgca tcaaaaatag 4260 catgcggagc tgtgttagag caagaacata acggaattag attacccata tcatatgcta 4320 gtaagaaatt tacaaaagga gaaagtaaca aatccactat tgagcaagaa cttacagcaa 4380 tacattgggc tatctctcac tttagaccct atctctatgg tcgcaaattc acagtaaaaa 4440 cagaccataa ccctctcgta cacctttttt ccatgaaaga tcccgcgtct aaattaacta 4500 gaatgcgatt agatttagag gaattcaatt ttgaaatcga atacgtcaaa ggaagagaaa 4560 acgtagggcc cgacgctctt tcacgtatag atattgattc tgaaattctt aaaaatatgt 4620 ctattcttcc aatgcagacc agatccaaaa ctcgaaatat gacgacagaa gctaaagtta 4680 acaagccaaa tgcagagaag attgatcaac ttcacgcgta tgactcgatt aacaatatcg 4740 acgctttcaa tttacctaaa ctaaccttcc aagctctaag cgaacaatta attataaata 4800 ttaaatccaa aaacctgaaa ggaattattg ctcaagggca aatattccaa caaaagggaa 4860 tggtttatat aaatgaatgc atgaacctac tcaacgaaat ggccaaaact acgaacatga 4920 aaaaactcgc gatcagtgta atggatccga tttttaatta tatttcccgg gaacaattta 4980 aacaatcatg caatgagaat ttaagtgacg tcacgattat tatatataca ccagccatgg 5040 tgatacaaga cgagaatgaa ataaataaat tgataaaaga aaaccacgat accccttcgg 5100 gaggacacgt aggtattaac aaactacttc ttaaacttag acgtaaatat tactggaaaa 5160 acatgaaaaa cagcatttct aaatatgtga aaaactgcat ttcgtgtaaa ctaaataaac 5220 attcaatcaa aacgcaagaa aactttgaaa aaacaacaac tcctacaaaa ccctttgaat 5280 tagtttcaat cgataccgta ggaccactta ctaggtcaat taaaggaaat cgttacgcat 5340 taaccttaca atgcgatcta acgaaatata tagttaccgt accaatagta gataaacaag 5400 caagtacact agctaaggct ttagtggaac actttatttt aatttatggt tgtccacgag 5460 caatcaaatc cgacatgggt accgaatata aaaatgaagt attccaacaa atttgcaata 5520 ttctcgaaat aaaccagaaa ttttccacag cgtaccatcc agaaaccata ggagctctcg 5580 aaagaagtca tcgttgttta aatgaatatc tcaggcaatt catcaatgaa caacatgatg 5640 actgggactc ttggttacca tattttacat tcttttataa cacaacacca cacactgaac 5700 attctttcgc accatttgaa cttatctttg gcactcaagt aaattatcca acaaaattta 5760 aaaataatac cacaatcgat ccaatctaca attttgaagc ctattacgaa gaattacaat 5820 tcaaattaca tacagcagcg ctaaaagcaa aactattact agaaaaatcc aaagaaaaaa 5880 gaatctcgaa acaaatacaa aattcaaacc caattgaaat tcaaacagga agcaaagttt 5940 ggcttaagaa agaaaaccga cgaaaactag atccagttta tacaggaccg tttactatca 6000 tcgaaataaa tcacccaaat gttacaatca aaaatcaaat gtcaaacgaa actcaaacag 6060 tacataaaaa cagaattatt atctaaacta aatcttaaat ttatccttaa ctcatttatt 6120 ttcatacaaa ttaatccatg tttataaaat tctatatwtt tttcaatttt ttttttttgt 6180 ttataaaaac catwattttt gtaaaacaat gttatcttaa gaagtgttta ttttaagaca 6240 cccgagccta atctcaaaaa aaaataagaa acgaaaaaat aagtgaaaaa agagataggt 6300 caaaagtcta acctcgagtt agacgtgaga aattcggcat ttaatattaa atagattaat 6360 aaccaaaatt ttgttaaact tacctaaaaa ttgtaagaga ctactggaga aagacattaa 6420 ctgaataatt acacttcatt tcattatgtt acattattct cctaaagggg gaagg 6475 // ID piggyBac-13_SM repbase; DNA; INV; 2554 BP. XX AC . XX DT 30-MAY-2008 (Rel. 13.05, Created) DT 30-MAY-2008 (Rel. 13.05, Last updated, Version 1) XX DE DNA transposon from Schmidtea mediterranea. XX KW piggyBac; DNA transposon; Transposable Element; piggyBac-13_SM. XX OS Schmidtea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae. XX RN [1] RP 1-2554 RA Kapitonov V.V. and Jurka J.; RT "Families of autonomous piggyBac element from the freshwater RT planarian genome and two clear examples of horizontal transfer of RT piggyBac transposons between a flatworm and insect."; RL Repbase Reports 8(5), 532-532 (2008)Repbase Reports. XX DR [1] (Consensus) XX CC piggyBac-13_SM is a young family of piggyBac transposons, CC characterized by 14-bp TIRs (1 mismatch) and TTAA target-site CC duplications. The consensus sequence was reconstructed based on CC multiple alignment of 7 copies, which are ~98% identical to the CC consensus sequence. XX FH Key Location/Qualifiers FT CDS join(554..1510,1573..2232) FT /product="piggyBac-13_SMp" FT /note="piggyBac transposase." FT /translation="MSHHNEDSDQEIYADNLSDTSCESDFDSDDSDLVDDE FT DIIPLPTTFTRRTISSSSDEEEVDGWSINDNPPVLEDFLGIPGITLNNPPK FT TQMDVLKLFIDDDFFKFLVIESNRYYYQNIDRLKSQTKGIKWKDITIPEMK FT KFVGLIIFMGLVRKDRRNEYWTTDPVLETPFFGKTMSRDRFNQIWRAWHFN FT NNEDITDPSDRLEKIRPIINHFVEKFKNIYKPKQQVSLDEGIIPWRGRLFF FT RVYNAGKIIKYGILVRILSESDTGYICNFEIYAAQGLRLIETIQTVVSPYT FT DVWHHLYMDNYYNSVDNTVALLQKKIRRGDTIFRRKNDILLQVWKSKKEVS FT LISSIHSAEMKESHNVDRMTRQKIIKPNALIDYNKYMKGVDRADQYLSYYN FT ILRKTTKWTKRVAMYMINCALFNSFVVYNSITTKKIKYKRFLYDIAFEWTS FT DSIHEEVISEDPGPSTSSWKRAPSRDPPGRLSMDMRKHVIAKIESKGKKKF FT VQRQCRVCSARKVRSETCFICKFCEVPLHRGKCFERYHTLKQY" XX SQ Sequence 2554 BP; 889 A; 404 C; 478 G; 783 T; 0 other; cacattgtaa tcggatcacg tattgatacg tgtttagcaa attaatgcta gagagcggat 60 cacgtattta tacgtttttg atagattgtt agtggagcgc gggtcacgtt aaaatgcggg 120 ttccaatata cttgagtttt cgctatatag cgtctgtagg gttgtagcca tgcaggtaaa 180 aaccttttga gaacaataga tggggtgtac ggtgggtacc gaggtagtgt tattcatcac 240 aaaaatagtc gaagataacg gtacatataa cggtctacat gcaacactaa gacgaagaat 300 aataattttt tttagtttat ttcactactc tttggtggtc ggacgttgct aaaagctaaa 360 tatcgagtac atagatatta tatatatata tatatattac gtatatattt acattttatt 420 ttagtgttta aaagttaaaa tacttttgat aaattttcga tgaaacagac atttttacca 480 agattgtgat tgaaagtttg ttattttttt tagttagtta tattattata ttgtgtttta 540 tcccataaga aaaatgtctc accacaatga agatagtgac caagaaatat acgccgacaa 600 tttatccgac acatcatgtg aaagtgattt tgatagtgat gatagcgatc tagttgatga 660 tgaagatatt attccactgc ctacaacatt tacccgcaga accatttcgt ccagttctga 720 cgaagaagag gttgatggtt ggtctataaa tgataatccc ccagttcttg aagatttttt 780 gggtattcct ggaattacct tgaataatcc tccaaaaacg cagatggatg tgttgaaatt 840 gttcatagac gacgattttt ttaagttttt ggttatcgaa tccaacagat attattatca 900 aaacatcgac cgactaaaat cacaaaccaa aggtatcaaa tggaaagata tcaccattcc 960 ggaaatgaaa aagtttgtag gtctcataat attcatgggg ctagtacgga aagatagacg 1020 gaatgaatat tggaccacag atcctgtttt agaaactccg tttttcggta aaacaatgtc 1080 aagagatagg tttaaccaaa tatggcgcgc ttggcatttc aataacaatg aagatattac 1140 tgacccatct gatcgtctgg aaaaaataag acctataatc aatcacttcg ttgaaaagtt 1200 caaaaacatc tacaagccaa aacagcaagt ctcattggat gaaggcataa taccatggag 1260 aggaagacta ttcttcaggg tgtacaatgc tggaaaaatc ataaagtacg ggattctcgt 1320 gagaatatta tccgagagtg atacaggata tatctgtaat tttgaaattt acgctgcgca 1380 aggactacga ttgattgaaa caatccaaac tgtagtttcg ccatatactg atgtctggca 1440 ccatttgtat atggataact attataatag tgtcgacaat actgttgcgc ttttacaaaa 1500 gaaaataagg gtttgtggaa ccattagaaa aaatagaggc ttacctgaat gcctaaaaaa 1560 acggatttac agagaggaga tacaatattc cgtcggaaaa atgatatatt gttacaagtt 1620 tggaaatcca agaaagaggt atctctcata tcttctatcc attctgcaga aatgaaggag 1680 agccataacg tcgaccgaat gacgcgacaa aagattatca agccaaatgc ccttattgat 1740 tacaataagt atatgaaagg cgtcgatcgt gccgatcaat atttatcgta ttacaatatt 1800 ttgagaaaaa ccacgaaatg gacaaaacgt gtggcaatgt atatgataaa ttgtgcctta 1860 ttcaattctt ttgttgtata caactccatt acaactaaaa agattaaata taaaagattt 1920 ttatatgaca ttgccttcga atggacttcc gattcgattc atgaagaagt tatttctgaa 1980 gatccagggc catcgacatc ttcgtggaaa agggctcctt ctcgtgaccc tcctggaagg 2040 ctttccatgg acatgcgcaa acatgtgata gccaaaattg aaagcaaggg aaagaagaaa 2100 tttgttcaaa ggcaatgtcg tgtctgttca gcgcgcaagg tgcgtagtga aacatgcttc 2160 atatgcaaat tttgtgaagt tcccctacac aggggcaaat gttttgaaag atatcacacc 2220 ctaaaacaat actaattttt attttacatt tagaaataat ttttgtgtat aatttgtaaa 2280 ataaaggatg ttatgtaaaa aaatattatt tttttcaaaa ctttcctgct ttatgtgaaa 2340 ttctgcacta gtttaacgat gaagaataca tttctaaata taataaaata aaatgaaata 2400 aaataaataa tttgcaaaaa gaattattga aatcaaacaa atatttttga agttatcgca 2460 atttttctag aggtctacgt tagcgcgaaa ttacgcgaaa aatcacccgt taccagcact 2520 agtacgcggc cgaaaactgc ccgatcacaa tgtg 2554 // ID Ginger2-1_BF repbase; DNA; INV; 6110 BP. XX AC . XX DT 03-FEB-2010 (Rel. 15.01, Created) DT 03-FEB-2010 (Rel. 15.01, Last updated, Version 1) XX DE Ginger2 DNA transposon from Branchiostoma floridae. XX KW Ginger2/TDD; DNA transposon; Transposable Element; Ginger; KW Ginger2; integrase; Ginger2-1_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-6110 RA Bao W., Kapitonov V.V. and Jurka J.; RT "Ginger DNA transposons in eukaryotes and their evolutionary RT relationships with long terminal repeat retrotransposons."; RL Mobile DNA 1, 3-3 (2010). XX DR [1] (Consensus) XX CC TIR is 161-bp long. XX FH Key Location/Qualifiers FT CDS 2052..3794 FT /product="Ginger2-1_BF_1p" FT /translation="MSCILEHNSIYYIVLQLANVLYPYSCHTRVGHSRRDK FT TWDEIKNNYAWIRYDLIQLYLRTCRECSTRAPLKKPAAGRPIISLGFMTRM FT QIDLIDMTSRPDDDNKWILHMRDHFSKFSWTHPLTSKRASGVAEKLVQTFC FT LFGSPHILQSDNGKEFVAGVINELTEKWPGLVIIHGRPRHPQSQGCIERAN FT GDLQLKLGKWMEEHPEKGWVEGLQHVTYAMNTSVCATTGKSPYAVVFGQSP FT RTHCAELEILAEQGIRHEDDLPDFFADTVDTADTADTADTGASEQPELTPQ FT QEHSSNKTPENSTQDPEPSRSLQSLAQESRPTPIKHGRDYHLFHGARMVAV FT GTEVLERETVHGQTVDPDHQAVFQLTNIADPTHIPACFNPFEEPLTEGQFV FT IWEIEKTTPVDESDVPHAKIRKIATENYLKAANRQQKNYDATVKHLIKSYS FT PGDTVGVRISEVDRTNTDPRLLPCKVLEANKQGQDTTYRVYSAEGKLKQAF FT SSEDLVDMRNVCFPRLQSIDVQDLEEVTLIHAARKHSGWQASAANGHSVCA FT CKGSCISNKCRCKKAGIPCATKCHPATLNVCQNR" XX SQ Sequence 6110 BP; 1830 A; 1276 C; 1232 G; 1772 T; 0 other; tgtcacaggg gagattggaa tccaggggga ttcggaatcc tcctagggga gattggaatt 60 ccaatctccc ctaggggaga ttagaattct tctggtgttc tactgggatc ccaatctccc 120 ctaggggaat ctggaatcct cctaggggag attggaattc ttctggtgtt ctactgggat 180 tccaatctcc cctaggggaa tctggaatcc tcctagggga gattagaatt cttctggtgt 240 tctactggga ttccaatctc ccctagggga atctggaatc ctcctagggg agattggaat 300 tctaccagga caatctttga atctaccagg agtggattca aagtttcaaa caatgcaaca 360 aaggatttat aattatttct aagaataaat cataccttat gcttaatcat atcttaatat 420 tcttatgctt tatatctaac acttatcatt tttctgacaa ttagaggtta agaatacttc 480 tctatagtgt acaatagcat cttatcctat actatagctc attaaacatc acattttcaa 540 acatttggta atagaggaga atccagaatt ctactaggac aatctttgat tttaccaata 600 atcagaatgg atgaaattgt tttggaaagt agactgaatc atttgagtca ctaactcata 660 atcatataat ggtatgcaaa tgtcaccaat atgtaaatta gctacatttg catatgtcgt 720 tataagttag tgactccata ctttagaggt cattttccaa tcgaagccag ttcaaaattt 780 gacaacattg catgataaag tactagattg atcaagcgat ggcagagaat cctccggagg 840 agcagaaagc agaattttac agactgctgg aagctcacat cagcagcctc ggggagaaga 900 agaaggacaa ctactgcatt tcacaggaaa gatataataa ggctctacaa gcactacagc 960 taaccaaatg gtccaagtgc cctgaagggg ggaagttcaa gttttgggct actaagaact 1020 ttgatctgca agaaatcggt tccaaaaaga ttctgtattg caaaaaatct tcccgccctg 1080 tcgttccgaa ggaagacatc tttgacacta taaaatggtg agtaaatgtt gtgtatttca 1140 cttttataca ggtgtatagt ataggagtaa aatttattgt ttgtttttct caattttcac 1200 ttcgaataaa ccgctggtat gggggaaaac aaatttttgt ggtaaatttt caaggtagga 1260 agtcctctta ctatcagcgt gcgccccctt actcaaatta caatcaatgt aacattattt 1320 accactctca cagtgcagtc tgtttcaaaa tgatcttttt actgtgaaaa tgcaacaaaa 1380 ataatacaca acaatgtatc ttatcatttc aagcttcatt actctcaaaa tattccatag 1440 catgtaaatc aaaccacaaa aataataatt tctggttttg agtaaatcgt ggaatcaggc 1500 ttctaatgtc ctatggctag cacacaccaa atcagtcttc ttgtaatctg gggttggttc 1560 aaatctgaga tttatcattt agtttttaaa aagtacagta tatacaattt attataatgg 1620 tgatttgtgg atggtttaca tttacaatga taaattcatg ttcttcatac atgtatgtta 1680 cagaaaaaaa ggaacgttgg tgaagaaaaa gtagtatcaa agttcatcat tttccttatg 1740 gcttttgaaa atatggtatc gtcatgaagt tagcatgata ttttggaaat tttgcaatgt 1800 gtataaaatt gtatatctca agtaaaaatt ctggtgaagt tttaattgtt gccactgaga 1860 aagaagaata tttcatcagt ccaagttttg gcagtatttg tcattatctt cattgtatat 1920 ctcactctac caaatatgcc ttctacacat acataacaca cattcaagtt gaaatagctt 1980 cagaatgtcc catttctacc gcagattttc aaaggcttgg tacccaccct cggtgtatat 2040 aagactatat catgtcatgc atacttgaac acaactctat atattatata gtcttacaat 2100 tggcaaatgt tttgtatcct tacagctgcc acaccagggt tggtcactct cgtcgcgaca 2160 agacttggga cgaaatcaag aacaactatg catggattcg ttatgacctt atccaactgt 2220 acctccgcac ctgtcgagag tgttccacaa gggctccact gaagaaacca gctgcgggca 2280 gacccatcat ctcacttggc ttcatgacac gtatgcagat agacctcatc gacatgacca 2340 gtagaccaga cgatgacaac aaatggattc tccacatgcg ggaccacttc agcaagttta 2400 gctggaccca tcccctgacc tctaaacgag cttctggtgt agctgagaag cttgtacaga 2460 ccttctgctt gtttggttca ccccacatac tacagtctga caatggaaag gaattcgtcg 2520 caggagtcat caatgagctc actgagaagt ggccaggcct ggtcatcatt cacgggcgtc 2580 cgcgtcaccc acagtctcag ggatgcatag agcgggccaa tggagaccta caacttaagc 2640 tcggcaagtg gatggaagag catccagaga agggctgggt ggagggactc caacatgtca 2700 cgtatgccat gaacacctca gtgtgcgcaa caacaggaaa gtccccctac gcagttgtct 2760 ttgggcaatc tccacgcaca cactgtgctg aattggagat tttggcagaa caagggattc 2820 gacatgaaga tgatctccca gatttctttg ctgacaccgt tgacaccgct gacaccgctg 2880 acaccgctga cactggagca tctgagcaac cagaactgac accacaacaa gagcattcct 2940 ccaacaagac tccagagaac tcaacacaag acccagaacc atctagaagc ctgcaatcct 3000 tggcacagga gtccagacca acaccaatca agcatggaag agactaccat ctcttccatg 3060 gagcaaggat ggttgcagtt ggcacagaag tcttggaacg ggaaactgtg catgggcaga 3120 ccgttgaccc agaccatcag gcagtgtttc aactgaccaa catcgcagac ccaacccata 3180 taccagcatg tttcaacccg tttgaagaac ctctcaccga aggacagttt gtcatttggg 3240 agattgagaa aacaacacca gtggatgaat cggacgtacc tcatgccaag attcggaaaa 3300 ttgcaacaga gaactatctg aaagctgcca accgacagca aaagaactat gatgcaacgg 3360 tgaagcacct catcaagtct tatagcccag gtgacacggt gggagtccgc atcagtgagg 3420 tggatcggac gaacacagat ccgcgactcc ttccctgcaa agttctggaa gctaacaagc 3480 aaggacagga caccacatac agagtctact cggccgaagg gaagttgaaa caagcattca 3540 gttctgagga ccttgtggac atgagaaacg tgtgcttccc acgactccaa agcattgatg 3600 tccaggacct tgaggaggtg accctcatcc atgctgcaag gaagcactca ggatggcaag 3660 ccagtgctgc aaatggacat tctgtttgtg cctgcaaagg aagctgtatc tcaaacaaat 3720 gcaggtgcaa gaaagccggc ataccatgtg caacaaagtg ccaccctgct actctgaatg 3780 tatgtcagaa tagatagaat caatcacagt ttcaagatgt tacagttcca cacctacatt 3840 gtacatgcat gtattcagat cagtacattt cactagtcag actttagtgt agttaacgtt 3900 gtagttaaag tttggtatcc ttcagtagag tcaattccaa gtcaccgaga tggttttgaa 3960 ataatatttg taataagttt gaactatttt gagtaaaaga acatttcagt cactaaagtt 4020 caaaattgtt caaacatttc aaaatgaaag tttaaacatg tttgaacctt tttatatttg 4080 ttggaacatt tgggaaagta atgatatcat tatctcttta tttagatcaa ttttcaaaca 4140 tctatgtgtt tttgttattg ttccaacatg ttccaacatt ttgtgtcatt ttttccctcc 4200 ataaaaactt ttattgaccc cataaacaaa gctgctaagc ttggtccttt gtttggctct 4260 tgtattttta gactttcttt agtatacact tgtgagtctt ttgtgagaag gtggcagaca 4320 gatataattt cctgtacttg gcagggatgt gtgaattgcc ctcttcccag cccactgacc 4380 cagtttcatc atattgtttc catgttcgaa catgtgatca caaatgatgc atctatgttg 4440 gaacagttta gaaactaaca ggaataatgt atttgatgtt agaaatgttt ctaacatgtt 4500 tgaatactat agaaatattt agaacgtcac atagatgtta tagatagttc taaacagtta 4560 cttatcacag ttagattgtt acaaagattt ccaaattgtt agaacaaaag gtaagaaaca 4620 ccctctcggt aatatggaat cgactctata cttgtaaaag tgacaatacc caacagaggg 4680 taggtttctc ttgcggtagg ctcaccatga gtcgacgtta cccctgctac cttgtagggc 4740 atctcttcac cacaggacag gttcttctcc tagagccaaa cacgtctacg ggagtcggac 4800 agtcttttgc ctgactttgc aggactttga ccgtatatgt taatgactaa gtccttcacc 4860 atccgtgagg tttcttactt aggctgttga tggttactcg attgtttgtt tcatctatca 4920 gcaccattcc gtcagtatag cagggtatac tatgttattg accactgact gttaagtgac 4980 acacaagcct tgctcctgtc gctctagctg ataggtttac ctccttcttg tctagcaacc 5040 tataagtatg tatgcctcgg tttggctagc ttgcctatag tacagacgta ccacgttctt 5100 tcttagttaa cgattgtgat tgttttagcg gttgttttaa ataattctcg tcactctcta 5160 gatttcatgt tagccatctg atttgcactc cacgcctctg cactctccac tccaaacacc 5220 tcactcttcc tccctgcggg cttggtcctg gtggttttca atgttgtgaa aacgccatta 5280 aactttacca agcacactca atctcagtct cattcaatct cagagagcag ataattgagt 5340 agaagtgtgc attgagtaaa gttggagaag tgactctggt gtggtagtct tcggtgcagt 5400 taaaatagaa gtgacacaag taggcttgag agtagttggt aatgagtcag tatgtttgca 5460 ttaatactaa ataaaatcgt ttatcaaaaa agtattgttg cttcctacat gttatttaag 5520 attggatcag ggtttgtcct aggacagatc ccgattgcaa tctgtacaag tagaatcgcc 5580 aattgtccta ggacagatct cgatcgcatt ttgtttcctg catcttcgca tcatttgagt 5640 ttggatcgga ggttgtccta ggacagattg agatcgcaga atagaattga gaattccaat 5700 ctgtcccaga acaatccacg attgcccaaa caaataagat gaacgacttg accagacggc 5760 tgtcaccata gttgcaatat aacaggtggc acgacaaaag aacagtaaag aatatgaagt 5820 tttttaccta gaatagcttt tcctgatgaa tgtaggcttt ataatcccca aggtggaatt 5880 gattaaaagt ggtgaaatgg tgtagtctac attcccaaac aagttcatca aagatggtcc 5940 tggtagaatg ccaatctctc cttagctagg agaattctgg attctcctag gggagattca 6000 aattgcagta gaatgacaga agaattctaa tctcccctag gggagattgg aattccaatc 6060 tcccctagga ggattccgaa tcctcctgga ttccaatctc ccctgtgaca 6110 // ID Copia-17_AA-LTR repbase; DNA; INV; 143 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-17_AA_; KW Copia-17_AA-I; Copia-17_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-143 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 945-945 (2011). XX DR [2] (Consensus) XX SQ Sequence 143 BP; 45 A; 33 C; 19 G; 46 T; 0 other; tgatgtgtag taacccgcga cttatgttag ctacgcaatc agatccttag caacgccatt 60 aaccttagaa acctaagttt gttatttcaa ataaatattc attagtcttc caaccttcga 120 accagaagca cctattttat tca 143 // ID Mariner-7_BM repbase; DNA; INV; 1734 BP. XX AC . XX DT 28-APR-2010 (Rel. 15.07, Created) DT 28-APR-2010 (Rel. 15.07, Last updated, Version 2) XX DE Mariner-like DNA transposon - consensus sequence. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-7_BM. XX OS Bombyx mori OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Bombycidae; Bombycinae; Bombyx. XX RN [1] RP 1-1734 RA Jurka J.; RT "DNA transposons from Bombyx mori."; RL Repbase Reports 10(7), 942-942 (2010). XX DR [1] (Consensus) XX CC >98% identical to consensus. XX FH Key Location/Qualifiers FT CDS 377..1399 FT /product="Mariner-7_BM_1p" FT /translation="MGKNKSCDPTLRKIIMKFVERGWSYRRIAQHLNCSKT FT MVCNAVQHFKRYNTALNVSRTKRSRKTTPQEDRMITRLAKKDPFLGSILIK FT HEVFGANERAGVSARTVRRRLVEAGLFGRVARKVPLLKKEHRNARLAFARK FT YGHWTYAQWQHVLFSDETKVNLISSDGRQYVRRPVKAEMNPRFTKKQVKHG FT GGNIKIWGSFSGRGVGPVRKVQGNLDQHQYKAILEETMLPYAEEHLPIIWT FT FQQDNDPKHTAHSVKSFLTSQFVSVLDWPANSPDLNPIEHLWHEIKKKVAT FT YRNTNLDELHEAFCEAWANIPVETCQKLIMSMPHRCQAVITNKGFATGY" XX SQ Sequence 1734 BP; 560 A; 297 C; 339 G; 538 T; 0 other; cagtcgtggt caactaatta gggacaatct ctagttaagc caaattgtga atatttttta 60 ttattttcca acgagtctta gcaatattcc atatctttta catgtaagtg taattagtca 120 tttatgtgcg cgattataca aacaataaca acttcttaaa cattaaacct ctttacagta 180 tagctgatac aatatcatag tgctattggt tcaaagaatg ctggctggcc atttaattag 240 ggacactgag gtattgcgtt ctaatttcaa aattggaatt ttaatacttt tttgtttctt 300 ttcttattgt ttagtggtcg ttttggcggc aatccagatt ttttatttta agttttgaga 360 attattttga agaaaaatgg gcaaaaataa aagttgcgat ccaacactac gaaaaattat 420 catgaagttt gtcgagcgtg gttggtcata ccgaaggata gcacaacatt taaattgttc 480 aaaaactatg gtatgcaatg cagtgcaaca tttcaagcgc tataacacag ctttaaatgt 540 gtccagaaca aaaagatcga gaaaaacgac tccgcaagaa gatagaatga taaccaggtt 600 agctaagaaa gatccgtttc taggctctat tttaattaaa cacgaagtgt ttggggcaaa 660 cgaaagagcc ggtgtatctg cgaggaccgt tcgacgacgt ttggtagaag cagggttgtt 720 cggaagagtg gctcgcaaag taccgttgtt gaagaaagaa catagaaatg ctcgtttggc 780 atttgcacgt aaatatggac attggaccta cgcccaatgg caacatgtac ttttctctga 840 cgagaccaag gttaatttga tttcttcaga tggaaggcaa tatgtacggc gtcctgtaaa 900 agcggaaatg aatccaagat tcacaaaaaa acaggtgaaa catggcggtg gaaacattaa 960 aatatggggt tcattttctg gacgtggagt gggacctgtg aggaaagttc agggtaactt 1020 agaccaacat caatacaagg caatattgga agaaactatg cttccctatg ctgaggagca 1080 tctccctatt atatggacgt ttcagcagga taacgacccg aaacataccg cccattcagt 1140 caagtcgttt cttaccagtc agtttgtatc ggtgctagat tggccagcaa atagcccgga 1200 cctcaaccca attgagcacc tttggcacga aatcaagaaa aaagttgcca cttatcgtaa 1260 tacaaatttg gatgaacttc atgaagcttt ttgtgaggca tgggcaaaca ttcctgtcga 1320 aacatgtcag aaactgataa tgtctatgcc acatcggtgt caagcggtaa ttactaacaa 1380 aggttttgct actgggtact gattttacac gtaataaaat ggagctgtta aagatattat 1440 aacactgaag attataagtt ttgtgtaact gtatttttca ttatacttta ctgtccttaa 1500 ttaaatgtcc actacagaat tgtaccttaa tatttaatac atgtttaatc tttgatatta 1560 aatacttttt ctatttatta aatcttttat ttacagccat ttatagcata ataatcggac 1620 cacattatgc aaataatagg tgctgagaaa tcaatagggt tttatattta aatgtacata 1680 accggttata gcccttcact atcttgaatg tccctaatta gttgaccacg actg 1734 // ID Gypsy3-LTR_Dmoj repbase; DNA; INV; 217 BP. XX AC scaffold_6498; XX DT 13-MAY-2009 (Rel. 14.05, Created) DT 13-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy3_Dmoj; KW Gypsy3-I_Dmoj; Gypsy3-LTR_Dmoj. XX OS Drosophila mojavensis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; repleta group; OC mulleri subgroup. XX RN [1] RP 1-217 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1041-1041 (2009). XX DR Genome; scaffold_6498; Positions 3079322 3079538. XX SQ Sequence 217 BP; 62 A; 47 C; 46 G; 62 T; 0 other; tgtgggaacc ggcaccgaga tgctaagcag aaagcgacgc ctacaaagca gagcccgtta 60 acactctctc ttgataatct cttgttactt ctctccccat tcaagctcta gcgtttagta 120 cttttgtgac aagctaagcg gccgcattga agcggaacaa tatatgtatc attactttgg 180 aaataaagtg ttttatgtaa agtagcgggt tattaca 217 // ID BEL-4_DPu-I repbase; DNA; INV; 7944 BP. XX AC scaffold_26; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-4_DPu_; KW BEL-4_DPu-LTR; BEL-4_DPu-I. XX NM BEL-4_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-7944 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 655-655 (2010). XX DR Genome; scaffold_26; Positions 948083 956026. XX CC Positions [6815-7393] - Integrase core CC 'AGTAG' target site duplication CC LTRs are 99% similar to each other. CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX FH Key Location/Qualifiers FT CDS join(1859..4279,4283..7768) FT /product="BEL-4_DPu-I_1p" FT /translation="MVQTLKNSRASTKGHMTRSIGLINGYANKVMTQQDVN FT SLDVQEGKLKGLFETYVTASREILEMLKASKAPQQELDDEQTNTLQTQEEV FT LGARAIIKQNKQEWLDDERDRRLLALVQTSNQAANQASNQASNQATSQAQM FT AQLIAQIVAAIPAPPAPVINVTAAPAPASAVQSIRLPQRQIKHFRGDVLEW FT TQFWESFNAAVHSSSLSNVQKFDYLKEYLKGEAYLLVNNLELTDANYQVAV FT DELKRMYGKTDVLIDAHFEKLDALQPVKDGNDVPALRSFQLNLQSHISALE FT TLGVPTSSFGGLLGSRLIKLIPSRLQLEWAKSATNKTTDIEMVIKFIGEQI FT DAAERYNRIKGVEKEKSSPAPKQPKPANPPPATASQLAVGAKASPAPQVKS FT SLKTKNVKFNPSWIVCERPCIFCSEIHWPTKCPKGLAERKKMIFDLKRCVN FT CFGDKHALKDCTSTRNCNKCGGRHHTALCDKGDVRVSNPTTAASIVASNPT FT TTTTACASTFGQLMLKTATVIISGQNGNETRAILFADDGSHRSWVLKSLSS FT QLNMKTVAVENISTRVFKKKEPSKPEMTKNVEMQVRGTWQGAPKVTLMALE FT SDHIADTGSYVGSEFASSLWIQNEKLADDRFEMAHTEEQPIGILVGMDQMF FT QIMSNEAAIQSPCGLRAFQTKLGRMIAGPSQEAKSKTGTQVIQNLILTSSY FT PIPQITSTFAASSQKRKILPTQANKTPMEKETGTASLQTPPLGASFSDAPI FT GSSGEADSKWFEKEEQVATNFDLSLFWKIENFANLEGADAVELENRFDFFD FT EKITRPDGRFCTPIPWTTDKWRLQKNLPLATGRVESTITRLRKNPGDLVNY FT HAEIQKLIDSKFVEEANMDYEELHTYLPHHPVFRGDKSTTKIRPVFDGAAK FT TKFGPSLNDVLETGPNLNPDLLSVLIRFRKYEIAWIADIEKAFLNIALQPE FT DAEAIRFLWPRDPAVAGSPLIAYKWKRVPFGLSSSPFLLRITINKHFKSLK FT SRFPETIQQLEENLYVDDYLGGASNLSIAKKRVDETDEIFSDAQLNMRSWA FT TNNEDLKQHLKEKGLSIKVVGLLSPALDGQQKVLGIRWDTGSDSFKFDPAS FT IIQAVEEVRAEITKRKILSISARIFDPIGFLSPTVLLLKIIYQKLWEKEIG FT WDQAAPTDIQKSWKIVMTGLTDFTELQIPRWIGLSEKVTSHEIHVFGDASE FT AAYGAVAYARFQRGQEDPCIVLLASKTRVAPLPKKKVTLPRLELLSSLLAV FT RLGEVVKNAMQIQYWRTIYWSDSLVALGWIRGEPNKWKPFVQNQVETIRKF FT SQPEWWRHCPGLQNPADPAPTLVTSTLWWNGPIWLKGEEAEWPDSPNDQIP FT QTTQSEMEAEARSTTVSTAAAIAIPTATVEWNLDKISTWNRLLRRTAWVSR FT FHSRSQKKLRPPGPERMESIKANGKVIRITRLSREELDEAELTIYRQLQRE FT RYPKAFESLQLDNTIQPKEKIAALFPIWDARDRLIRIHGKVSLALRDRNID FT PPILLPASHIVITFLITDKHESLLHAGAKVTLSELKEKFWIVKGRQQVKKI FT LFTCVECKKLTSPPFQELAAPLPLNRLKHAQSFHVAGVDFAGPLLYKPAQR FT RKKRKAPSGVQDPTPADDPTEEQIEEPPTEGDAPIEAPIVGEEDSIEEDVE FT IQDILTTTTKAKKTNHLKCYVCLFTCAVTRAVHLEILPDMTARSFLLAFRK FT FAARREPVSVMYSDNAQTFRCVERFLRTIHADPVIQDFLASRKTLWIFSAS FT LAPWWGGFWERMVRSVKDLLRRSNGRACLDYMELEASLAEIESVINARPLS FT YIGEGADDPLPITPNQFLNNRRSTRADPEPAVNLLAPTSTSIVLQEMDKNR FT REYVADICARFVDDYLLQLDNFHSKGKSGRKIRLGEVVVIHDEHSKRLMWS FT TGVVKELIPSRDGLIRSVMLKVPNGNIINRAIQCLHPIELREDLA" XX SQ Sequence 7944 BP; 2541 A; 1942 C; 1801 G; 1660 T; 0 other; tggtccttcg aacggtttaa ttcagtgcaa caccgtgtcc tccctctcaa aacgactaac 60 tcaaaattaa aaaattgtga agccggttgg ggctacacaa tacagacccc ccaaacgttt 120 atattgttcc cctttttgtt aatatttgcc caaatcagac taaattgtaa gaacaaagac 180 ggccaaaggg gggaacctat gcccaaaaat ccattttttt tactcaattc tttctcgcgt 240 ttcattgtcg ggcaatcctg agggggcggc cattaaccac gccgactcaa acgtctgcct 300 aagcctaaga attgtgctct catctattct tctgtctggg ttggtgttgg cttcctgtgt 360 ccgtttctgt ttttcctttt ctcttgttgc tcagagtaag aattaatatt cctttttttt 420 tcctctgtct caacatttat agaggagccg aaaggtctcc aaatcatcca tcaaaatagg 480 aaaacgagtc cagagggcgc tgttgcgcaa ggcggggctg ttgccagctt gggtggaacc 540 gccaccgcgt ccaccaccaa gggagctgat cccactcgat cagctgccac aaggcgccct 600 agcccctcat ctgcagccga tattggatat tgcaagaagg cacctcaacg cagagcagct 660 agttgacgcg ttccggccaa gatattgggg cgaccgacgc taccggccag aagaggaacc 720 ggccccacaa ccggttccac cgccggctca ggatttactg ccgaacgtga ttgaggaacc 780 ggaagcagag caggagccaa tccaggcacc tccgcccgac atcgagcaac cacaacgcca 840 agtacgctac cgtcagcagt tcgtcggaag aggcttactt cggcgagtgg tacatcaaca 900 agaaagacag gaggaagcag cagctgctgc agccccaaaa gcagaaagtg agtcggagtc 960 ggaggacgga atccagcttt gccacgaaag ctcgtcggaa gaatcaacag ccgaaagcag 1020 cgccacagga gaaaacaggc gccacttcga gagagaattg gcaacaaatc cagctctcag 1080 aagagcttac gagaggagcg aggcagcgag gagagaacga ggccgtttca acgcacgtcc 1140 tggccacctc tgcacacgaa gaaatacaag aggaacggag aggaggagcc caaactaagt 1200 aaaacttaat tgtattcttg aaaatggcga aaatctccgt taaatacaat tgttgaatgg 1260 gtatcatttc tttatctgtt cttctttgaa aaaaaaaaaa aacctctctt acataaggat 1320 cccgagaaat cggtgcatcc atacgaaaaa aaaaaaaacc ttttttctct ctactcgctc 1380 acgctcactg tagtggcacg ccgcaagggg cgccaccaaa aaaaaaaaaa acctctcttc 1440 ctctatcacg ctcactttag tggcacgccg caaggggcgc caccaaaaaa aaaaaaacta 1500 tcctatctct ctcccacagt gtagtggcaa gccgcaagtg gcgccaccaa caacaaaaca 1560 aaccagtttc gcttaattga gtcgaccagc ccggattttt tgcaggagcc tgggtataac 1620 cctagctcca aattcaaatc gattctccct cttttttttt ttttttttag atacatccaa 1680 gggatttcaa acaaagtgga aacaaaaacg gaaacacatc cgttccacga acccccaagg 1740 aaaataggaa aaggagacgg gcaatcaatc aataagtgtt caaaaatcaa attaaattct 1800 ctattttcct ccttttttcc cagtcaacag ctcttgaaga actacacaac caggaagaat 1860 ggttcagaca ctgaaaaatt caagggcaag cactaaaggc cacatgacaa gatcaatagg 1920 cctgatcaac ggatatgcca acaaagtaat gacccaacaa gatgtaaata gtctagacgt 1980 acaagaaggg aaattgaaag gtctcttcga gacttacgtt accgcatcaa gagagatttt 2040 ggagatgtta aaagcgagta aagcgccaca gcaggaactt gatgacgagc agaccaacac 2100 gctccaaact caagaagaag tactcggggc gagagcaatc atcaagcaga acaagcaaga 2160 atggttggat gatgaaagag acagacgatt gttagcgctg gtccaaacat caaatcaagc 2220 agcaaatcaa gcatccaatc aagcgtccaa tcaagcaaca agtcaagcac agatggcgca 2280 acttattgcc caaatcgtgg cggccattcc ggcacctccc gcgccagtaa tcaacgtgac 2340 ggcagcgcca gctccagctt cagccgtaca gtcgataaga ttgccgcagc gacaaattaa 2400 gcacttcaga ggggacgtac tggaatggac ccaattctgg gaatcattta acgcagctgt 2460 ccactcttca tctctatcga acgtgcaaaa attcgactac ctcaaagagt atttaaaagg 2520 tgaagcgtat ctgttagtta acaatcttga acttacagat gcgaattatc aagttgccgt 2580 cgacgaacta aaaaggatgt acgggaagac ggatgtccta atcgacgcgc actttgagaa 2640 actggatgcc ttgcagcccg taaaggacgg gaacgacgtt ccagctctga ggagttttca 2700 gctgaatctc caatcgcaca tcagcgcgtt agaaacgtta ggagtcccca caagttcctt 2760 tggcggactc ctcggatcaa gattaatcaa attgattccg tccaggctcc aactagaatg 2820 ggcgaaatcg gcaacgaaca aaaccacaga catcgaaatg gtcatcaaat tcattggaga 2880 acaaatcgat gcagccgaga ggtacaaccg gatcaaagga gtggaaaagg agaaatcatc 2940 tccagctcct aaacagccaa aaccggcaaa tccaccgcca gcaacagcat cacaactggc 3000 agtaggagcc aaagcaagcc cagctcctca ggttaaatcc tccttaaaaa ctaaaaacgt 3060 taaatttaac ccaagttgga ttgtatgtga aaggccctgc atcttctgca gcgaaattca 3120 ctggccgaca aaatgtccga aaggactggc ggaaaggaag aagatgattt ttgatctaaa 3180 aaggtgcgtc aattgctttg gagacaaaca cgcgctaaag gattgcacat ccacccggaa 3240 ctgcaacaag tgcggaggaa ggcaccacac cgctctctgc gacaaaggag acgtcagagt 3300 ctccaatcca acaacagcag caagcatcgt ggccagcaat ccaactacaa ctactacagc 3360 ctgtgcaagc acctttggac aattgatgct gaaaacggca acagtcatca taagcggcca 3420 gaacggtaac gaaacaagag cgattttatt tgcagacgac gggagtcatc gttcctgggt 3480 gttgaaatca ctctcatcgc aactcaatat gaagacggta gcggtggaga acatcagcac 3540 aagagtgttt aagaaaaagg aacccagcaa gcccgaaatg acaaaaaacg ttgaaatgca 3600 agtaaggggc acctggcaag gcgccccaaa agttacatta atggctttag aatcagatca 3660 cattgcagat actggctcgt acgtaggatc cgaatttgca agcagcctct ggatccaaaa 3720 cgaaaaattg gccgacgaca gattcgagat ggcacacacc gaagagcagc caattggaat 3780 tttagtcgga atggaccaga tgttccaaat catgtcaaat gaagccgcca tacagagtcc 3840 atgcggatta cgcgcgtttc agacgaaact cggaagaatg attgctggac catcacaaga 3900 agcaaaatcg aagacaggaa ctcaagtgat tcaaaattta attttaacgt caagttatcc 3960 aattccacag attacgtcaa cattcgcggc aagtagccaa aaaagaaaaa ttcttccaac 4020 tcaagcaaac aaaactccga tggaaaagga gactggaacg gccagtctcc aaactccgcc 4080 actaggagcg tcattctcag acgccccaat cggatccagc ggagaagcag atagtaaatg 4140 gtttgagaag gaagaacaag ttgctactaa tttcgacctt tctctttttt ggaaaataga 4200 aaacttcgcg aatcttgaag gcgctgatgc agtggaatta gagaatcgct tcgatttctt 4260 tgacgaaaaa atcacaagat gaccagatgg aagattttgt acaccaatcc cttggacgac 4320 ggacaagtgg agacttcaga aaaatctccc gttggccacc ggaagagtgg aaagtacaat 4380 aacaagattg agaaaaaatc caggagattt ggtgaattac cacgcagaaa tccaaaagct 4440 aatcgatagc aaattcgtcg aggaggcgaa catggactac gaagagctcc acacctatct 4500 cccacatcat ccggtcttca gaggcgacaa atcaacgacg aaaatcagac ccgtctttga 4560 cggagcagca aaaacgaagt ttggaccaag tctgaacgac gtgttggaga ccggaccaaa 4620 tctcaatccc gacctcctat cagtgctaat acgctttcga aagtatgaga tcgcctggat 4680 tgccgacata gaaaaggctt ttctaaacat cgccctgcag ccggaagatg ctgaagcaat 4740 caggttttta tggcccaggg atccagcagt ggctggatcc cctctaattg cctacaagtg 4800 gaagagagtg ccctttggcc ttagttctag cccctttctc ctacgcataa caattaataa 4860 gcactttaaa tcactgaaat ctcgttttcc agaaactatt caacagctgg aagaaaattt 4920 gtacgtagac gattacctcg gcggagcaag caatttatcg atcgccaaga aaagagtcga 4980 tgaaacggac gagatcttta gtgacgccca actcaacatg aggagctggg caaccaacaa 5040 cgaagaccta aaacaacacc tgaaggagaa aggactgtcg attaaagtcg taggcctact 5100 ctccccagcc ttggacggcc aacagaaggt gttaggaatc cggtgggaca ccggatccga 5160 ttctttcaaa tttgatccag catccatcat ccaagcagtg gaagaagtga gagcggaaat 5220 caccaaaaga aagatactca gtatctcggc aaggattttc gatccaattg gatttttatc 5280 tcccactgta cttttactaa agattattta ccagaaactc tgggaaaagg agataggttg 5340 ggaccaagca gctccaaccg atatccaaaa atcttggaag atagtgatga ctggtctcac 5400 cgatttcacc gaactacaaa ttccaagatg gataggatta tctgaaaagg tcacttctca 5460 cgaaattcac gtcttcggag acgcatctga agcagcgtat ggagccgtag cttatgcccg 5520 attccaaaga gggcaagaag atccatgcat cgttctactg gccagcaaaa caagagttgc 5580 acctctacct aagaagaaag taactttacc gcgtttagaa ttgctgagct cacttttagc 5640 cgtacgttta ggtgaagtcg taaaaaacgc gatgcaaatc cagtattgga gaactatcta 5700 ctggtcggac tccttggttg cactaggatg gattagagga gagccgaaca aatggaaacc 5760 attcgtccaa aatcaagtgg agacgattcg aaaattttca cagcccgagt ggtggagaca 5820 ctgtccaggt ctccaaaatc cggccgatcc agcgccaacg ctggtaacct caactctatg 5880 gtggaacggc ccaatttggc ttaaaggaga agaagcggaa tggccagact ctccgaacga 5940 ccaaattcca caaacgaccc aatccgaaat ggaagcggaa gcaaggagta cgacggtcag 6000 cacagcagca gccatcgcaa ttccaacagc caccgtcgaa tggaatctcg acaaaatatc 6060 aacctggaat cgactgttaa gacgtacagc ctgggtctca cgattccaca gcaggagtca 6120 aaagaagctc agacctcccg gtccagaacg aatggagtcg ataaaggcaa atggaaaagt 6180 gatccggatt acaagattgt ccagagagga gctggatgaa gcggaactaa cgatctacag 6240 acagctccaa cgagagcggt atcctaaggc gtttgaatcg ctacagctag acaacacaat 6300 ccagcccaag gaaaaaatcg ctgcactttt tccaatttgg gacgcaagag accggctcat 6360 tcgaatccat ggaaaagtat cactcgccct aagagacaga aatatcgatc ccccaatctt 6420 gcttcctgca tcacatatag taatcacttt cttaattaca gacaaacacg agtcactatt 6480 acatgcagga gcaaaggtga cactatcaga gttaaaagaa aaattctgga tagtcaaagg 6540 acgacagcaa gtgaaaaaga ttttgtttac ttgtgtggaa tgtaaaaagc tgacatcgcc 6600 accattccag gaattagcag cccctcttcc tttaaaccga ctcaaacatg cacaatcttt 6660 tcacgttgca ggagtcgatt tcgctggacc attgctgtac aagccagcac agcgaaggaa 6720 gaaaaggaaa gccccatcag gcgtccagga tccgacgcca gcagacgatc caactgaaga 6780 gcaaatcgag gagccaccca ctgaaggaga cgctccaatc gaagctccaa tcgtaggcga 6840 agaggattca atcgaagaag atgtcgaaat ccaagacatt ttaacaacaa caacaaaagc 6900 taagaaaact aaccatctca aatgttatgt ttgtcttttt acgtgcgcag ttactcgggc 6960 ggttcacttg gaaatactac cagatatgac ggcgcgttcc ttcttactag ccttccgaaa 7020 attcgcagct agacgagaac ccgtctcagt gatgtattcg gacaacgcgc aaactttccg 7080 atgtgttgaa cgattcctga gaactatcca cgccgatcca gtcattcaag atttcctcgc 7140 gtccagaaaa accctgtgga tattttccgc cagcctagcg ccatggtggg gaggattctg 7200 ggaacggatg gtgaggagcg tcaaggatct gctgcgacgc tccaacggtc gagcctgcct 7260 tgactacatg gaactggaag cgagtttagc agaaatcgag agcgtaatca acgcccgccc 7320 actcagctat attggggaag gagctgacga tccacttccg ataactccaa accaatttct 7380 aaacaataga cgttctactc gcgctgatcc ggagccagcc gttaacttgc tagctcctac 7440 atcgaccagc atcgtactac aagagatgga caagaaccgg agggaatatg tcgctgacat 7500 ttgtgcgaga ttcgtcgacg attatctact ccaactagac aatttccact ccaaaggaaa 7560 atccggaagg aagatccgcc taggagaagt cgtcgttatc cacgacgaac actccaaacg 7620 actcatgtgg tctaccgggg tagtgaagga attgatacca agccgcgacg gactcattcg 7680 ctctgtcatg ctcaaggttc caaatggtaa tataattaat agagccattc aatgtcttca 7740 ccccatagag ctgagagaag acctcgccta agacgtcgaa attggaaatc gggaaccgat 7800 ccaggaaccg gaagaggatc caaaatcgac tataccagaa caagaaatcg atctcgctgt 7860 cggagacgct acgccagctg ccgaagaaac ggacgacgtc gtcgaggatg tcgagccgga 7920 tgctacgggc tctagtgggg agtg 7944 // ID BEL-51_AA-LTR repbase; DNA; INV; 692 BP. XX AC supercont1.362; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-51_AA_; KW BEL-51_AA-I; BEL-51_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-692 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.362; Positions 695134 694443. XX SQ Sequence 692 BP; 236 A; 116 C; 131 G; 209 T; 0 other; tgttgcggga ggcgttacga acgttgacgc ttcagcaaaa agctccacag ggatgctgcc 60 ggcctgacgg atcagcttga catccaatga cttctattgt atgagtggca cgaatgacag 120 aaaagcagat gaattgtata atgtggtgat gaaattgtgt cggatcctga attaattatt 180 actacggaaa aactgttaat ttactattga ttagaactag ttttggatta ccttttgatt 240 attgctagtg agttaaacac gtaaaactac aatatgtaca agaaaaatgt gtataattca 300 caaatttatg attttaggtt atgttccatg cacttactcc tacttaatct tacgacaatg 360 aattaaaatt gaattataac ttaaaatact ttcttaaact atacaacagt gagtaaactt 420 atattctaat gaatgactta agactaagac tttaaagctg acttaaacag gtagactgac 480 ctcgagactg actataaatt acaggacggt aaccaggacg tagacgttgt attttgagtc 540 accgaaatgt gagtacaaat ggacataatt atagactgta ggaaaagtga ccaacaattt 600 atattttagc tttaaagcgc tattgccgaa taaaatggaa ttcgctaaaa ggactgtgac 660 ccgttttttc tacgtctccc accgcgccaa ca 692 // ID Copia-18_SI-LTR repbase; DNA; INV; 270 BP. XX AC AEAQ01022549; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: long terminal DE repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-18_SI_; KW Copia-18_SI-I; Copia-18_SI-LTR. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-270 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01022549; Positions 557 288. XX SQ Sequence 270 BP; 68 A; 59 C; 52 G; 91 T; 0 other; tggaactata cgtgttcaag acgttgaatg tgtgtagttg gcaacgcgga gaggttgcaa 60 gtacaagact ttctgaacgc ttgacttttt cgtgcctcga ttgcgcgcca aaactccctc 120 acgcgatccg ctcttactct ctgtacgcct ctatagtaaa gcgtcgtgtt gtctctgtga 180 taatctttta tgtctgtcta tgtaacttgt actaactata taaaagcctt tttcgactgt 240 attttcaata aacaaataga cttttcaaca 270 // ID Copia-2_AA-I repbase; DNA; INV; 2015 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-2_AA_; KW Copia-2_AA-LTR; Copia-2_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-2015 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 939-939 (2011). XX DR [2] (Consensus) XX CC 'CTGT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 452..2002 FT /product="Copia-2_AA-I_1p" FT /translation="MSDAKFSIPKLNGSNWQTWKVRVEMLLARDDLWYVID FT EPVFEEEEDRTDEWKKDNRKAKATIILLLEDSQLPLVKNCVLARDVYDALK FT RYHQKTTRSVRVSLLKKLCACDLAEDGDVEAHLRDFDDLFDRLDAAGTTLD FT KDTKICMLLRSLPPSFDGLVTALDSRSDDDVSLDVVKSRLIDEYNRQLERK FT GGESAKIEKALRSTEGKPDFESRVCHFCKNPGHIKRNCRKFLATLKKEVDS FT SCTSCGECGKAKTAQGDSRSIAFAVGEGKSISWVIDSGACAHMCRDKSFFT FT FLEEFAGDHVVLADGKNIPIQGEGSGVLYGIDGSGEVMKIEMNKVKYVPEL FT SANLISVEQLAQKSYKVLFTGEGCDIVSAGGAVVATGHQHGGLYYLNTIKP FT PTALDGRHQHRRQLNPRYRIETKRFPKEESTVKVKLESSSPSMNVSFQEEV FT VWYDAIDEPEAHRIPSGVGDDSVNRKQFLPERFTTSQRPPGVMDRTLSLMK FT FEGLRKLFRLDKSPPGNESSH" XX SQ Sequence 2015 BP; 548 A; 411 C; 590 G; 466 T; 0 other; ggttttggcc ccagcaagcg acccagaaag gaagaaaatt tgtgtggaaa gattcggccg 60 ggaagagccg aattggtcgt cgtgcgttaa tttttccgct gctgcgtcag ctcggaaggt 120 gccgcgtcgt ggagtgaaaa ctatagcaac gagttgcgga caggtgttcc actccaggtg 180 agggtgaaga aaattgtttt ccgagctagt gctcgaagaa agtgacgaag agtgtttgct 240 tctccgggta gagagaaaag cgatttccga gcaagggacg ctcgataaaa gtgtgttccc 300 cagtgcatct agccgtcgcc gctcgaggaa gggaagagaa agaagaagtg ccgagtggag 360 ctggagcagg cgaacaacaa cgaagtggtt gcctaccggt tcggttagtg tgttgactga 420 tcacccgagc agaaagtttt taccggtcag aatgagtgac gcaaagtttt cgattccgaa 480 attaaatggt tccaattggc aaacatggaa agtgcgtgtg gagatgttat tggctcgtga 540 cgatttgtgg tacgttatcg acgagccggt gttcgaagaa gaagaagatc gtaccgatga 600 atggaagaaa gacaacagaa aggccaaagc gacgatcatc ttgcttctcg aagacagtca 660 gttaccgttg gttaaaaatt gtgtcctcgc gcgtgatgtg tacgatgcgt tgaagcgata 720 ccatcaaaag acgacacgat cggttcgagt gtcgctcttg aaaaagttgt gtgcctgtga 780 tctcgcggaa gacggtgatg tggaagcaca tctacgtgat tttgacgacc ttttcgatcg 840 tctcgatgcg gctggtacga ccttggataa ggacaccaaa atatgcatgt tgctccgaag 900 tctcccgccg tcatttgacg gtctagttac cgcactggac agtcgctcag acgacgatgt 960 ctcgctcgat gttgttaaat cgagattgat cgacgagtac aaccgacagc tggaaagaaa 1020 aggaggtgaa tctgccaaaa ttgaaaaagc gttgcggtcc acggaaggca aaccggactt 1080 cgaatcccgt gtctgccatt tctgcaaaaa tcccggccat atcaagcgga actgccggaa 1140 gtttttggcc acgctgaaga aggaagtaga ctcttcgtgt accagttgcg gcgagtgtgg 1200 caaagcgaaa acagcacagg gtgattcgcg gagtattgct tttgccgtcg gtgaaggaaa 1260 atcgatttcc tgggtgatcg acagtggcgc atgtgcccat atgtgtaggg acaagtcttt 1320 tttcacgttc ttggaggaat ttgctggtga tcacgtcgtt ttggcggatg ggaagaatat 1380 tccgatccaa ggtgaaggaa gtggtgtact gtatggcata gatggttcag gtgaagtcat 1440 gaagattgaa atgaacaaag tgaaatacgt gccagaactt tcagcaaacc ttatttccgt 1500 cgaacagttg gcacagaaga gctacaaagt actattcacc ggtgaaggat gtgacattgt 1560 gagcgctggt ggagcagttg ttgctacagg acaccaacat ggcgggttgt actatctgaa 1620 cacgataaag ccgccgaccg ccttggatgg ccggcatcag caccgaaggc agttaaatcc 1680 ccgttaccgg attgaaacta aacgttttcc gaaggaagaa tcaaccgtga aagtgaagtt 1740 ggagagcagt tctccgtcga tgaatgtatc attccaggag gaagtcgtgt ggtacgacgc 1800 aatcgatgaa ccggaggccc accggattcc atccggagta ggagatgatt cggttaacag 1860 aaagcaattt ctaccagagc gtttcacgac aagccagaga ccgccgggcg ttatggacag 1920 gacgctgagt ttaatgaagt tcgagggatt acgtaagctg ttcagacttg acaagagccc 1980 accaggtaat gagtcttcgc attgaggggg agtgt 2015 // ID hAT-N2_BF repbase; DNA; INV; 351 BP. XX AC . XX DT 30-SEP-2008 (Rel. 13.09, Created) DT 30-SEP-2008 (Rel. 13.09, Last updated, Version 1) XX DE Amphioxus hAT-N2_BF non-autonomous DNA transposon - consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW hAT-N2_BF; non-autonomous. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-351 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-351 RA Kapitonov V. and Jurka J.; RT "hAT transposons in the amphioxus genome."; RL Repbase Reports 8(9), 911-911 (2008). XX DR [2] (Consensus) XX SQ Sequence 351 BP; 100 A; 85 C; 80 G; 86 T; 0 other; cagggctcta gccagctcga attttttttc cgtcagccaa ttcccattgt cgcgaaaacc 60 gcgaaaacac tattccagga gttagaaaga ctgaactgag accttgaaaa acgggttttt 120 aatgagatta tcgtatgcga aagggcgccg acacacttcg tcaacaataa taacaatagc 180 aaaacaaaca gtgtgtggct tttacggccg tggacggcgt ccccaagccg cctgtagctt 240 ccaccaagga tttttgtccg tcaaggttga cggattggtt ttaaaatttt tccgtcacac 300 gcagcaaatt tccgtcaatt gacggaaaaa cggacgctgg ctagagccct g 351 // ID Copia4-NVi_LTR repbase; DNA; INV; 755 BP. XX AC AAZX01004118; XX DT 05-NOV-2007 (Rel. 12.11, Created) DT 05-NOV-2007 (Rel. 12.11, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: long terminal DE repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia4-NVi; KW Copia4-NVi_I; Copia4-NVi_LTR. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-755 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from Nasonia parasitic wasp."; RL Repbase Reports 7(11), 1111-1111 (2007). XX DR Genome; AAZX01004118; Positions 1933 1179. XX SQ Sequence 755 BP; 210 A; 144 C; 127 G; 274 T; 0 other; tgatgtaatt ctgtagaaac aactaatata attgtctatt cgagctttcg aattgccgct 60 tagattctct attgatcaga gtagttgtaa cgcgtctaat cataggtacg gtagaacgcg 120 ttaaattctt actctctctc tctctctctt cccggataac caccgaagcg ttcggacgat 180 cgcctaatac tcatagtgta attgagtcgc aaattcatta aagataatat attcttaaga 240 gatagtgagt gactttattt tacctggctt cgcacacctg cttccatacc taaaacatca 300 aattctatat tacatttttt tttacatcac ctttttaatc ataactagcg cgcgacacac 360 ggccttaccg gcgctcgcct cgatattgcc tgatggtcgc actccgctcg aacgtgaggt 420 aatcaattaa agcttttggg ccaaatgggg cgtgtcacac cgaaattgtt attcaattat 480 atgtaccaaa aatcgtttct agggttgttg tttatgtata agcgttttga acacatgttt 540 gtcatggtaa aaagttgtgt tttcaaatta tctacaactt ttacgtttaa ggtttttatg 600 taaaatgtat actagtttat tcctataaca aaatcgattt tgccctgtta actcgaatta 660 aaagcatttt cacgcttaag tgtctagagg aaagagtttt tttccttgag atctataact 720 tttctattta actttttttt tgtttgtaga atgca 755 // ID Copia-38_AA-LTR repbase; DNA; INV; 284 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-38_AA_; KW Copia-38_AA-I; Copia-38_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-284 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 962-962 (2011). XX DR [2] (Consensus) XX SQ Sequence 284 BP; 69 A; 74 C; 53 G; 88 T; 0 other; tgtaggtatt cgatgatcac ctctccacta tacacacacc ttggttcgcc cctggtgttg 60 ccatacctga cacatcgcct catattgagc gcgggaacat agaggaaaag ttcattccaa 120 accaaacttt caacgaagca agttcgtgcg tgttatttcc tttctttcaa taaaagtaca 180 gttactaaag taatctcgcg tgtgcaaatt ctttctgtac cgaattcctt acgctgttat 240 ccactgtgtg gtttgctctg tccgctcgtc aggttatgac ccca 284 // ID BEL-617_AA-LTR repbase; DNA; INV; 616 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-617_AA_; KW Pao_Bel_Ele194; BEL-617_AA-I; BEL-617_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-616 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 616 BP; 204 A; 97 C; 130 G; 182 T; 3 other; tgttatggca acactgaaca acttcagccg tctattgtac ctgtagaaca aatgatgttc 60 gacgttagtg agaaggagtc tgtcactgca attgccaaca aaaaggggac agagtgaata 120 acgtgctcga cgtcgtatgg caattggatg tgtaaaagtt attggttaaa ttacgtttaa 180 ataatatgct aacwttgtac gctaatcata tgttgmtaaa ttagctagaa gtgcgttgaa 240 tagcagagat ttgggtaagg atactacatc tgtatcggta ctcaactata attgatccgc 300 attcgtaggt taagagaggt tataggcata ggcaactaam catatcgacc aaatgttgga 360 ttaaataggt caaggcaagg taaaactaca catttttatc aactatgata tattctactt 420 caaaattata attgtagtat tcagagctca attgcgactc gccattgaag gcaatagaga 480 agttggagtg gtacggtaga agaccatgta agcaaaataa aactctgttg gattatttac 540 taaagcatta aatatatttt agctttgagc tgtgttgagt aactccgctg ctacatcgat 600 ttcaccgatc cgaaca 616 // ID Poseidon_Hyd repbase; DNA; INV; 2690 BP. XX AC . XX DT 07-DEC-2006 (Rel. 11.11, Created) DT 07-DEC-2006 (Rel. 11.11, Last updated, Version 1) XX DE Poseidon_Hyd is a Penelope-like retrotransposon. XX KW Penelope; Non-LTR Retrotransposon; Transposable Element; KW retrotransposon; Penelope-like element; reverse transcriptase; KW GIY-YIG endonuclease; Poseidon_Hyd. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2690 RA Arkhipova I.R.; RT "Distribution and Phylogeny of Penelope-like elements in RT Eukaryotes."; RL Syst. Biol 55(6), 875-885 (2006). XX DR [1] (Consensus) XX CC Poseidon_Hyd is a Penelope-like element (PLE) from the freshwater CC hydra, Hydra magnipapillata. It belongs to the Poseidon group of CC PLEs. Its a single ORF contains regions homologous to reverse CC transcriptases and to GIY-YIG endonucleases. Consensus sequence CC was assembled from GenBank trace archives. The element is likely CC inactive, and the average divergence from the consensus is 5%. CC The 38-bp terminal repeat may or may not be present at the 3' CC end. XX FH Key Location/Qualifiers FT CDS 23..2503 FT /product="Poseidon_Hyd_1p" FT /translation="MSLKFYITSSYGGKLYKNTVQLQVEKTKNAASKNQWI FT FLERCIKNNIIPKSFHIHCPVKTKNGYNLMVEYRKKLLIIAKNDAKKRFYQ FT SKRRVIELSNAIKNIMKPEDMLNIERITNKSREIKFEQTKQHLIRKFNFLI FT NSKKNDSLKKFNPLNIISIPPTNLIKNPILNLSNISISHEQENLLSLGPQF FT IPTQNKIPFIDIITSTEECALNIEKNVNVERAEFLRQNVSKTLTKYLRVKI FT DSNISKNQRLAINELKNSTNIKLYPFDKGNGFAILNSDSAIQKIKEQLGNC FT AISAVDPTQKFLSLIQRTLSKLKKDKKLDKKTYGKIYPSDAIPPRLYGCIK FT AHKSEKGYPMRIIVSTIGTPPHKLSQHLVEIIQPTLNKNECRVKNSSSFVS FT TAKEWFINDEVQVSFDVINLYPSVPLDKATITIIDMLNKDRDDLIQRTKLS FT LVDIHKLIDLCISKCYFLWEDKIYILKNSGPIGLSIMVVLSEAYLQHIESN FT AIQQALNVNIAPKTYKRYVDDSHARFTNIDDANSFLTLLNSQDKSIQYTCE FT YENAQHQLNFLDVCISTNHSLHKYEFSIHRKDAITNVLIKPKSSISPNIAG FT IFKGFLARAYKICSEHNIQDEINFLIKIFTENGHSYKQYSDIAKNYKYSTN FT PSCSQKFDTKNLVILPWVPKLGLVLRREFRKVGVKTVFRFGNPLINILCRN FT KSKLPPNSHPGVYQLNCTCGAYYIGETRKKISTRIQEHKSNVTKANWETSG FT IVEHSQHCNGKINWENPKTLSIISNNYTRKIREAIEIQRVQCSKPKENVLN FT RDNGNLVMTHHWKPFFVKLKNVEI*" XX SQ Sequence 2690 BP; 1069 A; 442 C; 352 G; 827 T; 0 other; ataattacac atactattaa agatgtcact taaattttat ataacttctt cctacggtgg 60 gaagctctac aagaataccg ttcaattaca agtagaaaag accaagaacg ccgctagcaa 120 aaaccaatgg atatttttag agagatgtat aaaaaataat atcataccta aatcttttca 180 cattcattgt ccagttaaaa caaaaaatgg ttacaatcta atggtcgaat atagaaaaaa 240 actattaatt attgcaaaaa atgacgctaa aaaaagattt taccagagca aacgtagggt 300 catcgaacta tcaaatgcaa ttaaaaatat tatgaaacct gaagatatgt taaacataga 360 acgcataact aataaatcaa gagaaattaa atttgaacaa acaaaacaac accttatacg 420 taagttcaat tttcttatca attctaaaaa gaacgatagt ttgaaaaaat ttaatccttt 480 gaatattatt tcaataccac ctactaacct tataaaaaat ccaatattaa atctttcgaa 540 tatatcaatc tcacatgaac aggaaaattt attaagtctt ggtccgcaat ttattccaac 600 acaaaataaa ataccattta ttgatattat aacatcaact gaggaatgcg ctttaaatat 660 tgaaaaaaat gtgaacgtag aaagagcaga atttttaaga caaaatgtca gcaaaacatt 720 aaccaaatat ttaagagtga aaattgatag taatatttct aaaaatcaac gtctagctat 780 caatgaactc aaaaattcaa ccaacataaa gttatatcct tttgataaag gtaatgggtt 840 tgccatttta aattcggact cagcaataca aaaaataaaa gaacaacttg gcaattgcgc 900 aatatctgca gttgatccaa ctcagaagtt tctaagtctt attcaacgta cccttagtaa 960 attaaaaaaa gataaaaaat tagataaaaa aacatatggt aaaatatatc cttcagatgc 1020 aattccgccc cgtttgtatg gttgtattaa agcacacaaa agtgaaaaag gttatccaat 1080 gcgaataata gtctcaacaa tcggcacacc acctcataaa ctttcgcaac atcttgttga 1140 gatcatacaa ccaactttaa ataaaaatga atgccgtgtt aaaaattcat cttcatttgt 1200 atcaacagca aaagaatggt ttattaatga cgaagttcaa gtgtcatttg atgttattaa 1260 tttataccca tcagtaccac ttgataaagc aactattaca attattgata tgctaaacaa 1320 agatagagat gatctaatac aacgaactaa gttatcttta gtagacattc ataaactaat 1380 tgatctctgc attagtaagt gttatttttt atgggaagac aaaatatata ttttaaaaaa 1440 ctctggtcca ataggtctat ccataatggt cgtcttgtca gaagcatatc tacaacatat 1500 tgaatctaat gctattcaac aagctctaaa cgtcaatatt gcaccaaaaa cttataaacg 1560 ttacgttgat gatagccatg ctcgatttac taatattgat gacgcaaatt cttttttaac 1620 acttttaaat tcacaagata aatctattca atatacatgc gaatacgaaa atgctcaaca 1680 ccaattaaat tttttagatg tttgtatttc aaccaatcac tctcttcaca aatatgagtt 1740 ttccatacac cgtaaagatg caattacaaa cgtacttata aaaccgaaat catctatttc 1800 accaaatatc gctggtattt ttaaaggttt tttagctcga gcatacaaaa tttgttcaga 1860 acataacatt caggatgaaa ttaattttct tattaagatt tttactgaaa atggacattc 1920 ctataaacag tattcagata ttgctaaaaa ttataaatac tccacaaatc catcttgttc 1980 tcaaaaattt gataccaaaa atctagttat tttaccatgg gtcccaaagt taggccttgt 2040 tcttagaaga gaatttcgta aagtcggcgt taaaacagtc ttccgttttg gtaatccctt 2100 aatcaacata ctttgccgta acaagtctaa actgcctccg aatagtcatc caggagtgta 2160 ccaacttaat tgcacatgtg gtgcatatta tattggtgaa acaagaaaaa agatttccac 2220 acgaattcaa gaacataaaa gtaatgttac aaaagccaac tgggaaacat caggaatagt 2280 agaacactca caacattgta atggcaaaat aaactgggag aaccctaaaa cgctttctat 2340 tatttcaaac aactacacca gaaaaattcg tgaggctatt gaaatacaac gtgtacaatg 2400 ttcaaaacct aaagaaaacg ttttaaatcg cgacaatggc aatttggtga tgactcacca 2460 ttggaaacca ttttttgtaa aattaaaaaa tgttgaaatc tgacgttgca atgacatcat 2520 ttgtttacgt ttattataat tttttaaaat ggttttatta gtttgataat ggctctcact 2580 atatgagccg aaatatcact tatagagaaa agaaataaat agtatgatac cgtatatgat 2640 gttttaatgc ttataattac acatactatt aaagatgtca cttaaatttt 2690 // ID Gypsy-7_PPc-LTR repbase; DNA; INV; 286 BP. XX AC . XX DT 08-JUL-2010 (Rel. 15.07, Created) DT 08-JUL-2010 (Rel. 15.07, Last updated, Version -1) XX DE LTR retrotransposon from the Pristionchus pacificus genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-7_PPc_; KW Gypsy-7_PPc-I; Gypsy-7_PPc-LTR. XX OS Pristionchus pacificus OC Eukaryota; Metazoa; Nematoda; Chromadorea; Diplogasterida; OC Neodiplogasteridae; Pristionchus. XX RN [1] RP 1-286 RA Jurka J.; RT "LTR retrotransposons from the Pristionchus pacificus genome."; RL Repbase Reports 10(7), 1007-1007 (2010). XX DR [1] (Consensus) XX SQ Sequence 286 BP; 64 A; 74 C; 48 G; 96 T; 4 other; tgtwgtatag taatataatt ctatgtaaat tattaatcac aagtaattaa tgtgacccgt 60 tgatgctact acagtagtca gtacatttca tgccccctgg gctcccaccg cgatctgttc 120 ctactgtaag tattgcaatt ccccgctggc ctacttttcg tcagttagtt ctctccccct 180 cgttgcggct agttgtcgct gcmtattctc ctgtcawcta tctactctgc gattcttcaa 240 taaagccgat tctattggag ctgcmagtat tctctcaaga cacaca 286 // ID Sola1-N7_AAe repbase; DNA; INV; 1262 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A non-autonomous Sola1 DNA transposon family from Aedes aegypti. XX KW Sola; DNA transposon; Transposable Element; nonautonomous; KW Sola1-N7_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-1262 RA Kojima K.K. and Jurka J.; RT "DNA transposons from the yellow fever mosquito."; RL Repbase Reports 11(4), 1297-1297 (2011). XX DR [2] (Consensus) XX CC >93% identical to consensus. 4-bp TSDs. TIRs are 27 bp long. CC ~74% identical to Sola1-N4_AAe. XX SQ Sequence 1262 BP; 402 A; 236 C; 236 G; 387 T; 1 other; ctgcccatac tcgcaataca gtcccattag gaaaatcatc atgtcgagaa aaacgcttct 60 aaacataata caaaacgttt tcgtttcacc gttgctgaca gtagtaaatt gtggtggcat 120 tactcaacac attatcattg tagtattata aaaccttcaa gaactaaaat atagttaaag 180 tggttgtttt ccatatctta atctaagatt caatccaatg ggactcgtta tccgagtatt 240 tttcttgcca aacacaaata tacccgcata ccagtcccat tacgggggac tggcttacta 300 tgtttttacg caacttgaca caatcgtttt caagtttaca gtagatctca cgctaaatcc 360 caaggaatkt ttctaactgc aactgttagt actagtaaat ttggattaaa aatgtccttc 420 aaagttattg tactattcgg tggcgacagt gatcatgttg aagactctcc tggattttat 480 ttaactgcag aacatgttcc ggtcggccaa tctagaaagt taattgaatc caccacgggg 540 atgttggttt caagcaatgc aacaaagcaa ttttcggtca gcatgaagac ggcaaagcta 600 ttgaagattc cttagatata tgcggattgt cctgctgtat gccatgaacc accttacttc 660 ttggttttag aagtcgcatg tacctactta gggtgactgt ggaaactctg gctggccaaa 720 aaaatcaacc caatttcatg ccataacaag ttaagcctac cggagagtcc ggtgatcatt 780 ggccggagta caatagtagt gagagcagca gcattgactt aaagggaagc tattttgact 840 atttttggtt tagaggctcg caagcacaca tttcgagttg cggttctaga acatcgcagc 900 agaagtcatt ctgggaaaac tttaatatca cgtaagaata gatactttac atgaataaac 960 atggtttgag taattgaaat ttgtattatt tctcctttta ataaaaaaaa tacgatgcaa 1020 gattacaaaa cttataacag ggccaggttt gcttcaactt ctttgtaatt cactaatggg 1080 actattatgc gggtactttc aatatgggac gcacatgatt tggtattttt ttcgcatttt 1140 gtccatacaa aaacgcaatt ttaagcatga tttcataaag tacactaaaa ataaacacca 1200 ttcatgaagt taactaaaaa atggttaaaa tccatatggg actgttatgc gagtatgggc 1260 ag 1262 // ID BEL-10_DPu-I repbase; DNA; INV; 5607 BP. XX AC . XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE BEL-type retrotransposon from Daphnia (internal portion). XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-10_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-5607 RA Jurka J.; RT "LTR retrotransposons from Daphnia."; RL Direct Submission to RU (16-DEC-2010). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR [1] (Consensus) XX CC ~97% identical to consensus. XX FH Key Location/Qualifiers FT CDS 766..3120 FT /product="BEL-10_DPu-I_1p" FT /translation="MAEDIATFTIDQCRGRQKVLRQKITGCCTRMRKVITN FT KLSRREATRLLDEARTLLGDSGPINDRLLELLEEAEGEQQQESFLRYGGDV FT DAVADEVAAYISSREGDEASVPGWDPADPEVXAARQRAQDAAKVYQEAAKA FT AEEALKEMEAAKQSVQDLGEEKDLNGDREDDFKSIISVPPYKPVARVHAPD FT EWITNYCSGREKPFVYSGDHRHTSVRVDLEVYSGRALDWFEWIDLFHSLVH FT QTGKAPGEKLAILKRNVRGETADLVYGLGGGELAYKESLRRLKETCGSRAV FT IRAAHLQALDRVEPPTPATFRRFAEKVRTHLFNLTWIGETGHADLIERLTQ FT KLQPQDRLHWNDGQRGGLERRTMNEFGHWLCTRAAAYQNAYSIAAEQHQRS FT NNPARPPQTGNNPHHQKRHSRAHQASTNHXAGPARGAPAPTPREGTTPASK FT PPYCFKCEGEHRLADCQFFKNMPVAGRMTFIMVRGLCYCCFGVRHGAGNCS FT FKKECGKDGCKTFHHPLIHTDRGLPERTGTSHSARATSGTIAFGVIRLDAM FT NADGELVPINVMLDAGSNTTFIREGLVRSLRISGERQTLRIQGVADAASTH FT PNSEELFLQLKTAFGDMVTLRGSTLKTVTQPVPVYNWEQLRHRWTHLNDLP FT PLRSSGGRIDVLIGLNHTTLITPTDYRLGADDEPAAIKTRLGWTILGVLGS FT GSTTEALCHRAFASSDVHITAELVEQLRRFCDTESFGTEYQGAGMSTEDRR FT AVARLDAETKKLDVGYQAPILWKDGRLPGFQQDH" FT CDS 3056..5605 FT /product="BEL-10_DPu-I_2p" FT /translation="MWVTRPQFSGRMDDYLDSSKTTDEAMRRATAVNKALS FT SGDFXLNHWVTNDARLLEQLSIPDPKESGVNTVNLGADDAEMVLGVTWRPA FT TDVLGFRVRLAEINYTRAGLLSKVAGLFDPLGAAAPITVKAKIRLRHLGVK FT GLKWEDVVTGPDREWWESYFDTMQQLKTVEFARCLFPDEDRIVRTELDTFV FT DASEEACAAVCFIRNVYSDHRVIVRFIKAVTKLAPLKTVSVCKLELNAGLL FT GARLARFVESSLTRKIAARRFWTDSSTVRNWVRALSGDYQVYVSNRIGEIQ FT TLSDPSEWRFVPGVLNPADAATRSQLDDRAIPAWWLDGPPFSYQEETAWPR FT DLPWTLEKAEMRGARAHVSDVVSEKIVPFDWKTVKIAAGDVPTLIRLENDY FT LDLVRRCQRETYPDEISRLERGKVIRPTSTLQPLTPFLDADKVLRLGGRLS FT RANLPYEVLHPPILNGRHLLARAIIRAFHEQLHHSGTDMVLAQVRQHFWIT FT AGREAVKRARNECLACRRFRPKAALQMMADVHRARLGAHQPPFTYTSVDYF FT GPIDIAYHRGTAKRWGVLFTCLVTRAVYVDVAISLSANDFLLVLRRFVSIY FT RKPAEMFSDNGTNLTGAERLLREELERLKESSALETELKALGIRWWFQPAQ FT TPHFGGAHESLVRSVKRALYPVLDKELNGQRNPSEEILRTLLFEASGLLNS FT RPLTYVSSDADDIRPLTPNDFLNRAPIADLPAGDFSRALPRDHYRYVQRMA FT DRFWELWHGAFLQSMTSRRKWTRPARNLAVGDFVLDDWKDAPRGRWRTGKV FT ARTYPXKDELVRAVDVEFSTGILRRGANQLALLEPSSLSPEAGSSSGE" XX SQ Sequence 5607 BP; 1235 A; 1550 C; 1623 G; 1194 T; 5 other; tctggtcctt cgagtcggag ctgccggcat cgtcctgaac cacgccgttc agcatttgat 60 tcgacttcga ccggcgttgt ggcagacctg cccacgcctt gcagcatttg attcgacttc 120 gactggcgtt gtagcagacc tgcccacgcc ttgcagcatt tgattcgact tcgactggcg 180 ttgtagcaga cctgcccacg cctttcagca tttgattcga cttcgactgg cgttgtagca 240 gacctgccca cgccttncag catttgattc gacttcgact ggcgttgtgg cagacctgcc 300 cacgccttgc agcatttgat tcgacttcga ctggcgttgt agcagacctg cccacgcctt 360 tcagcatttg attcgacttc gactggcgtt gtagcagacc tgcccacgcc gttcagcgtt 420 tgatttgacc tcgactgcgc tgtggcgcct ttgacaaaga ttgcctgccg ttaagcagac 480 ccgaccaagc caacgagaac caagttggaa ggaactgacc agttccagct aagtggatct 540 ttgcccgtcg agcagagccg actactcgac gagaaccaag ttggggagtc cagctataga 600 cagacggtgt tccaggatcc taattttacc ctcatttgaa ctgactaaga ccctgccgtt 660 aagcaggcag cccagtagac gaagattctc catcgacgag cagagctgcc atccactacg 720 ttgctgttag ctctgatcct tcgctttctt ctagcctttg tcatcatggc ggaagatatt 780 gcaacattca ccatcgacca atgcaggggc cgccagaagg tactgcgcca gaagataacc 840 ggatgctgca cccgtatgcg gaaggttata accaacaagc tttcgcgccg tgaagccacg 900 cgcctcctgg atgaggctcg caccctcctt ggtgattccg gcccgattaa tgaccgttta 960 ttggagctgt tggaggaggc tgaaggagag caacaacaag aatcattcct gcgttacgga 1020 ggagatgttg acgcagtggc ggacgaggtg gctgcttata tctctagccg agagggggat 1080 gaagcttccg tccctggatg ggatcctgca gatccggagg tantcgccgc ccgacaacga 1140 gcccaagacg ccgcgaaggt ttaccaggag gctgccaaag cagccgaaga agcattgaag 1200 gagatggagg cagcaaagca gtccgttcag gacctagggg aggagaagga cttaaatgga 1260 gatcgagagg acgattttaa gtcgattatc tccgtacccc cgtacaaacc ggtagcacga 1320 gttcatgctc cggatgaatg gatcaccaac tattgcagtg gaagagagaa gcccttcgtt 1380 tattccggtg accatcgcca cacatcggtt cgagttgatt tggaggttta ctccgggcgc 1440 gctctggatt ggtttgagtg gatcgacctc tttcactcac tcgtacacca gaccggcaag 1500 gcacctggag agaagttggc gattttgaag aggaacgtac ggggagagac ggccgattta 1560 gtttacggat taggtggagg cgagctggcc tacaaggagt ctctccgacg tttgaaggag 1620 acatgtggca gccgtgcagt tatccgtgca gcacaccttc aagccttgga tcgagtcgag 1680 cctccgacgc ccgctacatt ccgtagattt gccgaaaagg tgaggactca cctcttcaat 1740 ctcacctgga tcggggagac cggccatgct gatttgatcg agcggctcac gcagaaactt 1800 cagccccagg atcggttgca ctggaatgac ggccagcgtg gaggattgga gcggcgcacc 1860 atgaacgagt tcggccactg gctatgtacg cgagcggctg cctaccagaa tgcttactcc 1920 attgccgccg aacagcacca gaggagcaac aatccagcgc ggccacccca gacgggaaac 1980 aacccgcatc atcagaagag gcattcacgg gcccaccagg cctccaccaa tcacngggcc 2040 ggaccggctc gaggggctcc tgcccctacg cctcgtgagg gaacgacacc agcatccaag 2100 cctccatatt gttttaaatg tgagggtgag catcggttgg ctgattgcca attcttcaag 2160 aatatgcccg ttgctggacg aatgaccttc atcatggtgc gtggcctgtg ttactgttgc 2220 ttcggcgtac gacatggagc tggcaactgt agcttcaaga aagaatgcgg caaggacgga 2280 tgcaagacgt tccaccaccc tctgatccac actgaccggg gacttccgga gagaacgggc 2340 acctcacatt cggccagagc aacgtccggt accatcgcct ttggtgtgat ccgcctcgac 2400 gccatgaatg ctgacgggga gctggtgccc atcaatgtca tgttggacgc cgggagcaac 2460 acaactttca tccgtgaggg attggttcgt tcgctacgga tttcaggcga gcgacagaca 2520 cttcggatcc agggagtagc tgatgcggct tcaacgcatc cgaattcaga agaattgttt 2580 ctccagctga aaacggcctt cggtgacatg gtgaccctta ggggctcgac cttaaagacc 2640 gtgacacagc cagttcccgt gtacaattgg gagcagcttc gccaccgttg gacccacctt 2700 aacgacttgc cgccgctacg cagttccggc ggaagaatcg acgtcctgat tggtttgaac 2760 catacaacgc tcatcacacc gacagattat cgacttggag ccgacgacga gccagcagct 2820 atcaagacca ggcttggatg gacaatcctt ggtgtgctgg gatctggatc gacgacggag 2880 gccctctgcc accgtgcctt tgcctcatca gacgtacaca tcacagcgga gctggttgaa 2940 caactacgac ggttctgcga caccgaatca ttcggtaccg agtaccaggg tgccggaatg 3000 tccaccgagg atcgacgggc agtcgcgagg ctggacgctg agaccaagaa gctggatgtg 3060 ggttaccagg ccccaattct ctggaaggat ggacgattac ctggattcca gcaagaccac 3120 tgacgaggcg atgcgccggg cgacagcagt caacaaggct ctgagcagtg gcgacttcca 3180 nctgaaccat tgggtgacca acgacgcacg cctgctggag cagctttcga ttccggaccc 3240 aaaggagagc ggcgtcaaca cggtcaacct tggcgctgac gacgccgaga tggtccttgg 3300 cgtcacctgg aggcccgcca ctgatgtttt ggggtttcgt gtccggttgg cagagatcaa 3360 ctacacccgt gccgggctcc tttcgaaagt tgccggatta ttcgatcccc ttggcgccgc 3420 tgctccaatc accgttaagg ccaagatacg actacgccac cttggcgtga aaggcctgaa 3480 gtgggaggat gtggtgactg gtccggaccg tgaatggtgg gagagttatt tcgatacaat 3540 gcagcagctg aagacggtgg aatttgcccg ctgcctgttt ccggatgaag accgaatcgt 3600 ccggaccgaa ctggacacgt tcgtggacgc ctctgaggag gcctgcgccg ccgtatgttt 3660 tatccgcaat gtttacagtg accaccgagt gatcgtgcgc ttcatcaagg cagtgacaaa 3720 gttggctccg ctgaaaacag tgtccgtgtg caagttggag ctgaacgctg gacttcttgg 3780 agcgcgtttg gcccgttttg tggaatcttc attgacccgg aagatcgcag cgcgtcgttt 3840 ctggaccgat agcagtacgg tgcgcaattg ggttcgagcg ctatctggtg actaccaggt 3900 ctacgtgagc aaccgtatcg gtgaaatcca gaccctatcg gacccttcgg agtggcgttt 3960 cgtcccggga gtcctaaacc cagcggatgc ggcgacccgc tctcaactgg atgaccgggc 4020 aatcccagcg tggtggttgg acggaccacc cttctcgtat caggaggaga cggcctggcc 4080 gcgagacctc ccctggacct tggagaaggc ggagatgcgt ggggcccgcg cccatgtgag 4140 tgacgtcgtg tcggagaaaa tagtgccatt cgactggaag acggtgaaga tagctgccgg 4200 cgacgtgccc actctcatca gattagagaa tgactatctc gatctcgtcc gacggtgcca 4260 gagggagacc tatcccgatg agataagcag actggagcgt ggaaaagtga ttcgccctac 4320 atcaaccttg cagcccttga cccccttttt ggatgctgat aaggtgttgc ggcttggagg 4380 ccgtctcagc agagcgaatt taccgtacga ggtcctccac ccgccgattc tgaatggccg 4440 acatctgtta gcgcgagcaa tcatccgcgc ctttcatgag cagcttcatc actccggaac 4500 cgacatggtg cttgcccagg tgcgacaaca tttctggatc acagcaggaa gagaggcggt 4560 caagagagct cgcaacgagt gtctggcgtg tcggcgtttc cgaccaaaag cagcgctaca 4620 gatgatggcg gacgttcaca gggcccggct aggtgctcac cagcctccgt tcacctacac 4680 ctccgtcgac tattttggcc ccattgacat cgcctaccac agagggacgg ccaagcgatg 4740 gggcgtcctg tttacgtgtt tagtgacgcg tgccgtgtac gtcgacgtgg cgatctctct 4800 atcagccaac gatttcttgc tggttttgcg ccgcttcgtt tctatctacc gcaagccagc 4860 tgagatgttc tccgacaacg gaactaattt gaccggggcg gagcgactgc tacgtgaaga 4920 gctggagaga ctaaaggaga gttcggcgtt ggagacggag ctcaaagctc tcgggattcg 4980 atggtggttc cagccagccc agacacccca ctttggtgga gcccacgaat cgctggtgcg 5040 ttcggtcaag cgagctctct accccgtgct ggacaaggag ttgaacggac agcgtaatcc 5100 gagcgaggag atccttcgga cgctcctgtt tgaggcgtcg ggactgctca attcccgccc 5160 actgacttat gtcagctcag atgctgacga tattcggccc ttgacgccga acgatttttt 5220 gaatcgcgcg ccaattgcgg atctaccggc gggtgatttc tctcgtgcct tgccacggga 5280 ccactaccgg tacgttcaac ggatggcgga ccgcttctgg gagctgtggc atggagcatt 5340 cctgcagtcc atgacctcgc gacgcaaatg gacaagacca gcccggaact tagcggttgg 5400 cgatttcgtg ctcgacgatt ggaaggatgc accgcgagga cgctggcgga ccggaaaagt 5460 tgcgagaacg tatccgngaa aggatgaact agttcgagcg gtagacgtcg agttttctac 5520 agggatattg cgccgtggag ctaatcaact cgccttattg gaaccaagct cactctctcc 5580 ggaagccgga tccagttcgg gggagaa 5607 // ID BEL-8_AA-LTR repbase; DNA; INV; 476 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-8_AA_; KW BEL-8_AA-I; BEL-8_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-476 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 866-866 (2011). XX DR [2] (Consensus) XX SQ Sequence 476 BP; 160 A; 71 C; 101 G; 144 T; 0 other; tgttcaccgc aacagcccct cgtaaaggac acccctgcca agagccattg tgagccattt 60 ggcccaagtg tggattgacg tttgagatga tgacgggcaa gctgataagc gatatgtcat 120 agattgataa caacgatgat aagcaaagtt gattggtcgg ctattgaaaa ttgaaattgt 180 tttggataaa ttacggttta ttgcgaatat ttatatttac actgtaagtt gaattacata 240 tatcttatga actaggtcaa taaaagtgtg aatttacaga ttaaggaatt aaaatacgtt 300 tgaacaagtt gtgaattgga agagtagttg tcggagactg aattgtaagt acaattaaag 360 taaagctggt ttgtaaactg atcctaaata aaaaatactt ttagctttaa gctgcttaac 420 acatatatcg ggacgtgttt tctgctgaaa gaactccgga atttccaatc gcaaca 476 // ID Gypsy-4_CQ-LTR repbase; DNA; INV; 650 BP. XX AC AAWU01033212; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-4_CQ_; KW Gypsy-4_CQ-I; Gypsy-4_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-650 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 388-388 (2011). XX DR Genome; AAWU01033212; Positions 16919 16270. XX SQ Sequence 650 BP; 272 A; 71 C; 148 G; 159 T; 0 other; tgacccactt aagtggcacg cgcagaaagg tgcccgacga gaaacaaaca aataacaata 60 tctgtgaaaa aggacaggca cgactggaaa ataaataaag attacattat ctatgggtta 120 agtcataaaa ggactgcgac taatttctga gaaaaatctg agaattagag cgcggtggaa 180 agaaatagaa catttatatt aggattttaa gagtgagaga aataaaatag gccaggatat 240 tatgtataac ttggaaggat ttttagagtg gggggaaaaa taaataaaat aagaaaaaaa 300 gagagttata tattgtagaa ttctgggaat taaggaaact tttcattata tcaagaaaaa 360 tacaggttgg agtgggggtt gtaaatgaga caaaaatata ttatataagt tagaggaaga 420 atttttggtg tgaaaccgaa aaggacggat atattatgca acatcaaaaa ctagagtggg 480 ggtaaaaata taagatatta aataattaag taaataacta gatttagtat aaaaacccaa 540 gctgaaatca aataaagtcg gtcagtaaat gagacagtga aagagagtga aagtgtttga 600 tccgaaatat ccagtcgaag tgccttggac ttagccggct gcacgcttca 650 // ID RP1_GL repbase; DNA; INV; 830 BP. XX AC L11331; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 25-MAR-1997 (Rel. 2.02, Last updated, Version 2) XX DE Giardia lamblia repetitive fragment showing homology with 3' DE non-coding region of a surface antigen gene. XX KW RP1_GL; Repetitive element. XX OS Giardia intestinalis OC Eukaryota; Diplomonadida; Hexamitidae; Giardiinae; Giardia. XX RN [1] RP 1-830 RA Ey L.P., Khanna K., Andrews H.R., Manning A.P. and Mayrhofer G.; RT "Distinct genetic groups of Giardia intestinalis distinguished by RT restriction fragment length polymorphisms."; RL J. Gen. Microbiol 138(12), 2629-2637 (1992). XX RN [2] RP 1-830 RA Khanna K., Mayrhofer G. and Ey L.P.; RT "A 0.8-kb repetitive fragment of Giardia intestinalis DNA showing RT homology with the 3' non-coding region of a surface antigen RT gene."; RL Unpublished (1993). XX DR GenBank; L11331; Positions 1 830. XX SQ Sequence 830 BP; 138 A; 219 C; 266 G; 207 T; 0 other; gatccattga attgtgttgc tgtctgagtt ccaatagttt ttgtttccgc tcttctgtag 60 aaaggttaag atgaaaggat gtcataaggg catctctctg ctgcaatatg atcaaggagt 120 caaaggggct gccctccaca gctgactccc gaagctcgat tgatctcacg ggcaattctt 180 attcaatttc tggccgtccg gagatgaggg ttggttttgg ataacataac gagagagtcc 240 tctccagtga tatatccaga gcctagcgcc ttccccggtc gctccgcagg gggtgtgctc 300 agaccgaatg cgtacttcgg tcctttcggg gttcaggcct tcttacggac tctctggtcc 360 accaatggcg cctcttcttg gtcttccttc gtggcgggtg aagctccggt cgctgtcttt 420 tttagatgcc ccgtctgtcg ggggctgcca tcgggctctc cgctggctgg gccctcggtg 480 tccgggcagc tgtggctcga gtcagcgctc ccacagtcca gtcacctgtg tctcagtggg 540 ccggctggtg ggtggggcgg gggtatgggg cggtgaggct ggaggggctc tggtgggacc 600 aggtggtggg tgttcgtatg tgcccctgag ctcctgatga gtggggtgat cgcagctgtg 660 gtgactgggg gctgtccatc tgtgactcac ccctctgtgt gctcgtggtc tgtgagtgcc 720 tccacccaga caacgcctca ggagccgtgc acaccgggac agagaggatg accaaggggg 780 tcaagcagcc ccgctacgag agatgagagt gcacagcaac cagcctgatc 830 // ID Copia-4_AA-LTR repbase; DNA; INV; 173 BP. XX AC supercont1.10; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-4_AA_; KW Copia-4_AA-I; Copia-4_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-173 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.10; Positions 4191044 4190872. XX SQ Sequence 173 BP; 44 A; 35 C; 28 G; 66 T; 0 other; tgttgtgaaa cagacgacag cgtaattact caagtttgtt atgttatatt ttagagtgta 60 agccccaata ttaaaccgtg ccactctttg ttatgatgct cttcattttc tcttttgtcg 120 tctcattctt aataaagcca tcactgcgtt aacagcaaac tgtgttttcg tca 173 // ID Gypsy-29_DWil-I repbase; DNA; INV; 5762 BP. XX AC scaffold_181154; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-29_DWil_; KW Gypsy-29_DWil-LTR; Gypsy-29_DWil-I. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-5762 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_181154; Positions 1355984 1350223. XX CC Positions [1959-2420] - Reverse transcriptase CC Positions [3664-4125] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 113..1228 FT /product="Gypsy-29_DWil-I_1p" FT /translation="MEAEMDNLRAALLQQEGELNLLRQQNQQQQQQLQQQQ FT QQQQQQPFQWLSNKDIIQQFRQLRQLDDQHDVLAFIKSVEFLMTLCQGNAL FT LIQFGTSIVANEKVSGTAANFIRQLGMEPSWDQMKTKLMEQMRPRMTYEDV FT FDRCRFIKVSSLRDLFQEFEKAKCEINKIYMFDESKPAMYENDKVDRDLLN FT MLMNKIDTPFRIHVDQGISMHALQTKYSNIKALDDPRAILQRYRKNSNNTY FT NKQGNTNANTNISTTHKNATQNNQFASNNNNTKQNQGGQPNTNNNASPPRS FT DQVNQQSYNRSQPTQNPNFKPNYNNKGNSSRQTRMSHMSVDTQKNDPTPME FT IGTLEEKQAEEEEEEQVNFLIPCLEHPYP" FT CDS 1263..3632 FT /product="Gypsy-29_DWil-I_3p" FT /translation="MLSRHWCDDKHLKSKILEFQYPKTELTKPCSYKTLNG FT INVVKYTVTTPLPKEIPYSGTLQWNLIDFSNKNFSAIIGQNFLKAFNAQIN FT NTERYVQLLDRKFYFLDYEYPEAMHNACALQPVNKEEIRDRFNLAHLNDEE FT RRAVETLLAESDDLFFQEGDVLSATNQIFHEIITTVDRPLYSKIYRYPQIH FT EKEINRQIKEMLEQNIIKKSNSPYNSPLWIVEKKLDNSGSKKFRIVIDYRK FT LNEFTVDDRFPIPNLNSLLDKLGRSQYFTTLDLAKGFHQILVREEDRPKTA FT FSTPSGHYEFVRMPFGLKNAPSTFQRLMNEVLKDHINDNCVVYMDDILIFS FT TSLQEHMTTLRKIFRTLKEANLKIQVDKCDFLKKETQFLGHILTTHGIKPN FT PDKVHIIQNLKLPKTAKQIKSFLGMTGFYRKFVKDYAKIAFPMSRYLKKNE FT TINTNDPSYIAAFEKLKTLVTNSPVLRYPNFSKKFTVTTDASNFAVGAVLS FT QESHPIAYASRTLNQHECNYSTIEKELLAIVWAVKYFRPYVYGREFDLESD FT HQPLKWLMAKYTGKDISPRLQRWLINLGEYNFHIEYIKGKNNNIADFLSRI FT KEDEINLMEVEPTDEEDNKSLTETVHSKEEDQGFNMSILETAVNRFKTQII FT FTEKKPNTIVQVFGRKRVYISKEDLENNQAVNILRREITAGKIGVFSHLSD FT HEFYQFQKILTKEYTSNPKVRFVKCTRFATDIESEDELHTQIALFHKNESG FT HCGIVATYQKLKLKIYHPNLKTHIHRIINNCERLQWWEI" FT CDS 4358..5761 FT /product="Gypsy-29_DWil-I_2p" FT /translation="MCLLPTLSTAMINITPIQAETGFISLGESTVELVSEF FT KMVLHIISPEEILELTNIIQNNTKILVKKDRHRLDTEIETIKIKIRSITPN FT SLARQKRGLINAIGKAQKWLTGLMDDDDRETIMNHLLNNQENQNNIINNIN FT KQVNINNDLQNSINTLKEVILDDREKITKTLNNIETYNTMLRTDIMYHDQL FT LKLKFLHDKVDTILDTIASAKYNLFHPSILTNKEFILYNIDFYKIKLIRMG FT VMTYDTNIIIAIKIPTNIITTTVFLITPLTNQKNKRIDMSPERVIKIENKT FT YTYEEQKTINELKLSSNCIIKRNCKMIEDVEFKTKLINEGTLLISNAQDNE FT IQYNCNNTTNSEILRGNFLIEFTNCTVRIDETYYKNNQEIYIQSFANEEEE FT ENSFNFTEKQTFKDIVLKEIENTKTIKEIKIQTIVSNSFGIVSITLVLLFI FT GIYSYNKCKRGIQENPSSKGGGV" XX SQ Sequence 5762 BP; 2339 A; 1140 C; 957 G; 1326 T; 0 other; acgaatataa ttggcgcagt cgaaatcgga caaacacgtt cagaattaac caaagttaat 60 ctgcgagaaa tttttttttt tttttttttt ttttttttta atttaaacca caatggaagc 120 agagatggat aatttaagag cagcattgct tcaacaagaa ggagagttga acctattaag 180 gcaacaaaat caacaacaac aacaacaact gcaacaacaa cagcaacaac aacaacaaca 240 accattccag tggctgtcaa acaaagatat catccagcag tttcgccagc taaggcagct 300 cgacgaccaa catgatgtat tggcttttat aaaatctgtg gagttcctca tgacattatg 360 ccagggaaac gcattgctga tccagtttgg cactagcatc gtcgctaacg aaaaggtgtc 420 gggaacagca gcaaatttca tcagacaatt ggggatggag cctagctggg accaaatgaa 480 gaccaaactg atggagcaaa tgcgacccag gatgacctac gaggacgtat tcgatagatg 540 cagattcatc aaagtgagta gtttaagaga tttgttccag gagtttgaga aagcaaagtg 600 cgaaataaat aaaatatata tgttcgacga atccaaacca gccatgtatg aaaatgataa 660 agtagataga gatttgttaa acatgttaat gaataaaata gacactccat tcaggattca 720 tgtcgatcag ggcatatcta tgcatgcttt acaaacgaaa tattcaaata ttaaagcact 780 ggacgatccc agagctatct tacagaggta taggaaaaat tcgaacaata cctataacaa 840 acagggtaac actaacgcta acaccaacat tagcaccact cataaaaacg caacccagaa 900 taatcagttt gctagtaata ataacaatac caaacagaat caaggtggcc aaccaaacac 960 caataataac gccagcccac ctcggagtga ccaagtcaac caacagtcgt ataacaggtc 1020 acaacccacc caaaatccaa attttaaacc taactataac aacaagggta acagctcaag 1080 acagacaaga atgtctcata tgagcgttga cactcaaaaa aatgatccca cacccatgga 1140 gattgggact ttagaagaaa aacaggcaga agaagaagag gaagaacagg taaatttttt 1200 gattccttgc ctagagcatc cttacccata atagtgtgga caataggaaa agaaaaaatt 1260 aaatgcttag tcgacactgg tgcgacgaca agcacttaaa atcaaaaatt ctcgaatttc 1320 aataccccaa aacagaactc accaaaccat gctcttataa aactcttaac ggcattaacg 1380 tcgtgaaata caccgtaaca acaccacttc caaaagagat accctactcg gggacactac 1440 agtggaatct tatagatttc tcaaacaaaa atttctcggc aataataggc caaaatttct 1500 taaaagcctt caatgctcaa atcaacaaca cagagcgata tgtacaatta ctcgatagaa 1560 aattttattt tttggattac gaatacccag aagccatgca caatgcttgt gctctgcaac 1620 ccgtgaacaa agaagaaata cgagatcgct tcaacctcgc tcatttaaat gacgaggaac 1680 gtcgagccgt cgaaacccta ttagcggaat ctgacgactt attttttcaa gagggggacg 1740 ttttgtcagc aacaaaccag attttccacg aaataattac cacagtggat aggcctttat 1800 attctaaaat atatagatat ccccaaatac atgaaaaaga aataaataga cagataaagg 1860 aaatgcttga gcagaacata ataaaaaaga gtaactcacc ctacaacagt cccttatgga 1920 tcgtcgagaa aaaactcgat aattcagggt caaaaaagtt cagaatcgtc atcgattacc 1980 gtaaattaaa cgaattcacg gtcgatgaca gatttccaat ccctaatcta aactctctat 2040 tggataaatt aggtagatcc caatatttca caaccctcga tctggcgaag ggttttcacc 2100 aaattctcgt aagagaggaa gacagaccaa aaacggcatt ttcaacaccg tcaggtcact 2160 atgagtttgt cagaatgccg ttcggactaa aaaatgctcc atcgacattt cagaggctca 2220 tgaatgaagt cttgaaagac catattaatg acaactgtgt cgtctacatg gacgacatat 2280 taatttttag cacttccctt caggaacata tgaccaccct aaggaagata tttagaacct 2340 tgaaggaggc aaatttgaaa attcaagtag ataagtgtga cttccttaag aaagaaacac 2400 aatttcttgg ccacatcctg accactcacg gtattaaacc gaacccagac aaagtacata 2460 ttatacaaaa tttaaaatta cctaaaacag caaaacagat taaatcattc ctgggaatga 2520 caggatttta cagaaagttc gttaaggatt atgcaaaaat agcattccct atgtcaaggt 2580 atctgaagaa aaacgagacg atcaacacta atgacccaag ctatatagct gcattcgaaa 2640 agttgaagac actagttaca aacagtccag tcctacgata cccaaatttt tccaagaaat 2700 tcactgtcac tacagacgcc agtaattttg cggtgggagc agtgctatcg caagagagcc 2760 acccaatcgc atatgcatcc cgaacattaa atcaacatga atgcaactac tcgactatag 2820 agaaagagct tctagccata gtgtgggccg taaagtactt tcgaccgtat gtttatggta 2880 gagaattcga cctcgagagc gatcaccaac ccttaaaatg gctgatggct aaatatacgg 2940 gcaaagatat aagcccaaga ctgcagagat ggctcattaa tctaggggag tacaatttcc 3000 atatagaata cataaaagga aaaaataata atattgccga ttttttgagt aggattaaag 3060 aagacgaaat taacctaatg gaggtcgaac caacagacga agaagacaac aaatccctga 3120 ctgaaacggt tcactctaaa gaagaggatc agggctttaa catgtccata ctagaaacgg 3180 cagtcaacag attcaaaaca caaataattt tcacagaaaa gaaacccaac accatagtac 3240 aagtgtttgg cagaaagaga gtttatatta gcaaagaaga cttggaaaac aaccaagcag 3300 taaacatcct cagacgagaa ataacagcag gaaagatagg agttttttca cacctaagcg 3360 accatgaatt ttaccaattt caaaaaatat tgacaaagga atacacttcc aatccaaaag 3420 taaggttcgt aaaatgtacc agattcgcta cagacataga gtctgaagat gagttgcaca 3480 cccaaatagc tctgtttcat aaaaacgaga gtggccattg cgggatagtt gctacctatc 3540 agaaacttaa acttaaaatt taccatccca atctaaaaac acatatccac agaataataa 3600 ataattgtga acgtttgcag tggtgggaaa tatgatagga aacccattaa aaacaacttc 3660 catctaacag aaacaccaaa aacctgcaac gagataatac atgtggacac atatgtgaat 3720 tcaaaacaat ctttcataat attcattgat aaattctcaa aacacgccac ttgcttgcca 3780 ctgaccgata gaaatagtat cacaatagta gaacacattc aacagttctt gtccattaaa 3840 gggaaagtac agaaatttgt cttcgacaac gaattcaata gcttgaacgt cagggaattc 3900 ctggaaaaag aaggcataga ataccacgta acaaagccaa atagtcatac tggcaatagc 3960 gatgtcgaaa gacttaataa tacattgaca gagaaaattc gtacactaaa tatagaagag 4020 aaaagaccta taactgaaca aatggccaga gctgtatact tttacaacaa tacacaccac 4080 accaccacaa agtctacacc atttgaagta caaaaccata aagtcgatca caaagaatta 4140 tatgacagaa tgtcagcaca aaaagctgaa aaaatatcaa aactaaacaa aaacagggaa 4200 acttacatag aaaccagaac agaaggattt ataaaaaatt acaaaaactt aagacacaag 4260 gaagaaccca aatttaggaa aacaaaatta caaaacatcc atacaacaaa cattaaaaga 4320 cctacaaaat tttcagataa cgttgacact cctcatcatg tgcctgttgc caacactgtc 4380 gacggcaatg atcaacataa ccccgatcca ggctgaaacc ggattcatat cgctggggga 4440 aagtacggtg gagctggtta gcgagttcaa aatggtccta catataataa gtccagaaga 4500 aatattagaa ttaaccaaca taatccagaa taatacgaaa attctggtta aaaaagacag 4560 gcataggcta gatacagaga tagaaactat aaaaatcaaa ataagatcca taaccccaaa 4620 ttcgctagca agacaaaaac gcggccttat caacgcaata ggaaaagctc agaagtggct 4680 caccggtctc atggatgatg acgacaggga aaccattatg aatcatctcc ttaataacca 4740 agaaaaccaa aataacataa ttaacaatat taacaaacaa gtaaacataa ataacgactt 4800 acaaaattct attaacacac ttaaagaagt aatattggat gacagggaaa aaataaccaa 4860 aacccttaac aacattgaga catataatac catgctcaga acagacataa tgtatcatga 4920 ccagctgttg aagttgaaat ttttgcatga taaagtagac accatcttgg acacaatagc 4980 gtcagccaaa tataacctct ttcacccctc aatactaacg aacaaagagt ttatactcta 5040 taatatagat ttttacaaaa ttaaattaat tcgcatggga gtcatgacct atgacactaa 5100 tataatcata gcaatcaaaa taccaactaa cataataact accacagtat tcctaataac 5160 cccactaact aatcagaaga acaaaagaat cgacatgagt cccgaaagag taataaaaat 5220 agagaacaaa acttatacat acgaagaaca gaaaacaata aatgaactca agcttagctc 5280 taactgtata attaaaagaa attgtaaaat gatagaagat gtagaattta agacaaaatt 5340 aataaacgaa ggtacattgt taataagtaa tgcacaagat aatgaaattc aatacaattg 5400 caacaatacc accaatagcg aaatattaag aggaaacttc ctcatagaat tcacaaattg 5460 cacagtccgc atagacgaaa catactacaa aaataaccaa gaaatttata tacaatcatt 5520 tgccaatgaa gaagaggagg aaaactcatt taactttacc gaaaagcaga cgtttaaaga 5580 tatcgtatta aaagaaatag aaaacacaaa aacaattaaa gaaataaaaa tacaaacaat 5640 agtttccaac agttttggca tagtatcaat tactcttgta ctcctcttca taggaatata 5700 ttcctataac aaatgtaaaa gaggaatcca ggagaatccc tcatcaaagg ggggaggagt 5760 ta 5762 // ID hATm-38_HM repbase; DNA; INV; 3261 BP. XX AC . XX DT 07-DEC-2008 (Rel. 13.12, Created) DT 10-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type family: consensus sequence. XX KW hAT; DNA transposon; Transposable Element; hATm-38_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3261 RA Jurka J.; RT "Families of hATm elements from Hydra magnipapillata."; RL Repbase Reports 8(12), 1932-1932 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(657..938,1095..1442,1414..2964) FT /product="hATm-38_HM_1p" FT /translation="MGKRKITKIKKSNYNVNLNPRRRFTFSEVRFLFFFNI FT NFQQMEQFYDIISMLLQKKVKGSSLPKKIFHAKQRVGVGIWFALTKISVMK FT IWRYAAVLRLTSLQLLVIVISIFQKAMKISFQHTKLKRQKTSKSGCSEIAI FT EFDRDKLLLATTPVSVRCGISIRSQTMFLGAVIKALGGNVHDLNISRSSIA FT RSRFKYIEDLGKFFLHSKYLVNFFYIQNIYNKIKNNDYKEKKNLFNLGATI FT RSSHYKEMLGKNLISHFDSKLVKEIEEKLKITVKKERVAISVTSPEFNTKD FT DRLLGIIAVESNKGKDHAIVIQNILEYFEIADNIIGVCTDTTSSNTGRLNG FT AIIIISRILNKCLFWFMCRHHIYELHITHAVQAITKVKSKGPSKSMYLAFQ FT RHWETSHESVNSRTPELKKFDWNKLEIGSRLYILALAAKDYAKYSLENKIF FT PRNDYNHLVKYLAFYLGVESDKLNEFKIHQPGACHEARFMADALYILTLEM FT TSTIITFLPDENKNEIETCAFIVAVCYAPWYLKSRSAQHAVLNDFNAFKAA FT YVIKDEFNADVGNALIKSFHKHSWYLSPAVVVFSLVDSHLEMFVKYEILKS FT LISFPVPELEKINMEKPNPVLILPNSKLSELVTADSWLIFIRLGIVNQVKD FT WVHSHPNFSKTESYSVFENFVKGLSVTNDCAERNIGLINDFFKYSQKEEQR FT QNILLIAREERKKVTKKVTQNELMKI*" XX SQ Sequence 3261 BP; 1195 A; 453 C; 505 G; 1105 T; 3 other; gggtgtccca aaaatatgaa aaaaaaaatt gtacacatca aattctttgt ctatatactt 60 atagaagacc gctagtattg aaaaaagatt tataagacca ctctaagtgc ttactgtaga 120 ttaaaaatcg gatgttatta agtgttctga gttttttaac ttaaatattt cggctgcatc 180 taaatgaaaa cttactcttt taacaattct aatagctgtt gtttttcata atcttttttt 240 ttaaaaactg tattaggata tgggtggtaa ctgattaact agtagtatat taacaaactt 300 ttcttgatca ctgttccaaa tattaggaaa ttttaaaaac aatatatatt aatgtgtttt 360 taaaacatcg atattttttt ttttttkgac taacaggtat ttttaaatta ttatatacca 420 atataccaac tttatcttat ttgactttaa aatatcgact tttaaatata taaactttga 480 aagtggcaat cttgattacc tttttagaaa acataaacaa acttaaacca atcatttttt 540 tctgataaaa aaacaactaa tcaaattttw actmattttg acaaaagaat cttttgcgta 600 atttagtttt taaaaaatat aaaaaaataa atacttataa caaattctta tcacaaatgg 660 ggaaaagaaa gatcactaaa ataaaaaagt caaactacaa tgttaattta aacccacgac 720 gacgctttac cttctcagaa gtccgatttc tgtttttctt caacatcaac ttccaacaaa 780 tggaacagtt ctacgatatt atcagtatgt tattacagaa aaaagtaaaa ggttcaagtc 840 tacccaagaa aatttttcat gcaaaacaaa gagtgggagt agggatatgg tttgccttga 900 caaagatttc tgttatgaaa atatggagat atgctgccta agaaaaattg ttaatatgaa 960 accttttttt tttgtttttt gtttttggca tcaggtaaat tatatcttaa tcttttattc 1020 caaaatattt ttatgaaaaa aaagataaaa cttttattca ctttattaag tcattaacac 1080 tttttgttac ttaggttctg cgactcacca gtcttcagtt gctagtgata gtgatttcga 1140 ttttccagaa agcaatgaaa atcagcttcc aacacacaaa gttaaagcgt caaaaaacat 1200 ctaagtctgg ctgttcagaa atagcaatag agttcgacag agacaagctt ctattagcta 1260 caacacctgt tagtgttaga tgcggaatta gcattcgctc acaaacaatg tttctaggag 1320 ctgttattaa agctttaggg ggaaatgttc atgatttaaa tatatctaga agctcaatag 1380 ctagaagtag attcaagtat attgaagatt taggtaaatt ttttttacat tcaaaatatt 1440 tataataaaa taaaaaataa tgattataaa gagaaaaaaa atctttttaa tctaggagcc 1500 accatcagat caagccatta taaagaaatg ttgggaaaga atttgatctc gcactttgac 1560 agtaagttgg taaaagaaat agaggaaaag ttaaaaatta cagttaaaaa ggaaagggtt 1620 gcgatatctg ttaccagtcc tgaatttaat acaaaagatg atcgtctact tggaattata 1680 gctgtagaga gcaataaagg aaaagatcac gctattgtta ttcaaaacat tttagagtat 1740 tttgaaatag ctgataatat aattggagtc tgtactgata ctacatcaag taatacaggt 1800 agactaaatg gagctataat aattattagt agaattctta acaagtgttt gttttggttt 1860 atgtgtcgtc accacatata tgaactccat ataactcatg cagtacaggc tatcacaaaa 1920 gttaaatcta agggtccaag caagtctatg tacttagctt tccaaagaca ctgggaaact 1980 tctcatgaat ctgtaaactc aagaactcca gaactgaaaa aatttgattg gaacaagctg 2040 gaaataggta gcaggcttta catcttggcg ctagcagcta aagattatgc aaagtattcg 2100 ttggagaata aaatttttcc aagaaatgac tataatcacc ttgtaaaata tttggctttt 2160 tatcttggcg tcgaatctga taagcttaat gaatttaaaa ttcatcaacc tggagcttgt 2220 cacgaagcaa gattcatggc tgatgctttg tatattctta cactagaaat gacatccact 2280 attataacct ttttacctga tgaaaataaa aatgaaattg aaacatgtgc ctttattgtt 2340 gctgtttgtt atgctccatg gtacctcaag tctagaagcg ctcaacatgc tgtacttaat 2400 gattttaatg catttaaagc agcatatgtt atcaaggatg aatttaatgc cgatgttggc 2460 aatgctttga ttaaaagttt tcataaacat tcgtggtatc tttctccagc agtagttgtt 2520 ttttcactag ttgactctca cttggaaatg tttgtcaagt atgagatttt gaagtcatta 2580 atttcctttc cagttccaga gctagaaaaa ataaatatgg aaaaacctaa tcctgtttta 2640 atcttaccaa acagcaagct ttcagagctt gttactgctg atagctggct tatatttata 2700 cgcctaggaa tcgtaaatca agtcaaagat tgggttcatt ctcatcccaa cttctcgaaa 2760 actgaatcat atagtgtttt tgaaaatttt gtcaaaggac ttagtgtaac taacgactgc 2820 gctgaaagaa acatcggatt aataaatgat ttttttaaat attctcaaaa agaagaacag 2880 cgtcaaaata tactattgat agcgagagag gagagaaaaa aagtaacaaa gaaagtaact 2940 caaaacgaac ttatgaaaat ttaaagcttt ctatatgata atatataatg agatatattt 3000 gtgaattaaa gaattatact gtcagttaat gataataatg agatatattt atgaataaaa 3060 gaattatact gtcagttatt taattttgat cttttctagc taaaaaagga agatttagga 3120 attttacagt aagcacttaa aatgatctaa gtgcaatttt ttagtatgta tttatcttct 3180 acctttatat agacaaggaa ttttgggtgt acaagcgaag tttagaaaaa aaaggttttt 3240 ctatatattt ttgggacacc c 3261 // ID Gypsy-109_AA-LTR repbase; DNA; INV; 172 BP. XX AC AAGE02027654; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-109_AA_; KW Gypsy-109_AA-I; Gypsy-109_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-172 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02027654; Positions 15944 15773. XX SQ Sequence 172 BP; 51 A; 23 C; 47 G; 51 T; 0 other; tggtacttag ttacggccct gcgttaaatg agattcggga gcggggagta cggccctggt 60 agagaaaatg tcaattattg ttgtttgaga gcgtggaaat aaaggacgtg tttgagtgaa 120 ctgtaaaata aatattctat gttcttatga gtgcattacg cagaatatca ca 172 // ID Gypsy-89_CQ-I repbase; DNA; INV; 3962 BP. XX AC AAWU01006656; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-89_CQ_; KW Gypsy-89_CQ-LTR; Gypsy-89_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-3962 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 557-557 (2011). XX DR GenBank; AAWU01006656; Positions 9913 13874. XX CC Positions [3069-3428] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 915..3554 FT /product="Gypsy-89_CQ-I_1p" FT /translation="MGTKSSSSRLIIFDRTANTRFLIDTGSDLSIVPATTK FT EKQGSSTQWQLHAANGTVIRTFGQRFVTTDLGLRRRFSWNFVIADVSAPII FT GADFLAHFGIVVDLKRRRLVDSTTKLHSIAGLATPTVYNVTTVLADHPFKD FT LLDEFRDVTLPSTIRSEVQHDVTHHIQTKGPPLSCKSRRLPPDKLQAAKQE FT FETMLELGICRHSSSSWASPLHCVLKKNGAWRFVGDYRRLNAVTVPDRYPV FT PHIHDLLNALHGKSIFTTLDLERAYHQIPVEPCDIPKTAVITPFGLFEFTR FT MQFGLCNASQTFQRFMHKIFGDLDFVTVFVDDICIASANEDQHRDHIRCVL FT ERLHKNGLVINPAKCRFAQREVEFLGYLVNKDGLRPLPERVQALQDYKLPG FT TVKELRRFLALINGYKRFLPHATDEQAALRKLIPGNRKNDTRKLEWCDEAR FT AAFDNCKQSIASAALLHYPDPTKRLSLMVDASNTSAGAVLQQFEHGRWAPL FT GFFSQKFSEAQTRYSTFGRELTAMKLAVQYFRYLVEGRSFTIFTDHRPLTQ FT ALGSTSNNHLPHEERYLRYISQFTSDVQHVSGSKNEVADALSRVATIGFPS FT SVNFEEVAAEQINDEELKRLLNDGRSSLKLEPKPLPGTQAQIYCDVSPEGR FT VRPYIPEKLRCMILDTLHKVSHPGVRGSRKLLGDRFVWPSMHKDVRKYVQS FT CDGCQRAKVHRHTSSPLQSFAIPKSRFQHVHIDLVGPLPTSNGNRYVLTMI FT DRFTRWPEAVPLVDMTAETVAQALLTGWISRFGIPETITTDQGRQFESELF FT RELSQLLGIKRIRTTAYHPQANGMVERFHRTMKAAIMCVTRNTGATTPLIL FT LGLRSSFREDMKCTSAECVRSNAEAAW" XX SQ Sequence 3962 BP; 959 A; 1156 C; 1019 G; 828 T; 0 other; ctggtgacca ccgtcgcgaa tgtaacggaa atcaatttct tcgttgcgca aaataccgaa 60 aaaaacgcct ttttcttcgt ggaaactctg ctgccgtccg ccatcttgga tgcaaaaatg 120 acgcttcccg gttcgaacgt ctcgcttcca gcgtctccga cttctgtcgc cgctgtcgcc 180 gtgaaattgc cggagttttg gaagacggac ccgctgatgt ggttcgccca agcggaggcc 240 cagttcgcgc tcgctggtgt aacttccgac caaacgatgt actaccacat tatctccaag 300 gtggaccaaa cagttctgcg acacatttcc gacatcgtcg cgaacccccc agaagagaac 360 aaatacccgg cggtcaaggc gcgcctgatc gcccacttcg agttgtcggc cgaggaaaag 420 ttcgagaagc tgctcaattc ctgcgatctc ggagacgccc gtccgtcgca tcttcttgcc 480 aaaatgcaag atctatccac gggcctgaac gtgagcgctg gattgatgag gatgctcttc 540 ctgcagcgga tgccagtcaa cattcgaacg gtcctcgcgg tgtccaacgc caagctcgac 600 gagctcgctg ccatggccga caagatgatc gacgtcgccg gtccgcaggt ggctgcgacg 660 caacaagctg ctccaacaag cgtacaggac ctggaagctc aaattgccgc cctaagcagc 720 caactgcgac gattggaagc aaatccgaac cgaggaagat ctcgctctcc gtcgcggcgc 780 cgcagcgctt cgcagtcgtc ctcgcaagaa atttgctggt accatcgcaa gttccgagac 840 gcggcccaac agtgccgttc cccctgccaa tacagctcaa aaaactagat tccttctcac 900 tcgagacggc ggagatggga acgaaatcga gtagcagccg tcttatcatc ttcgatcgga 960 ccgcaaacac gcgcttcctg atcgataccg ggtccgacct ttctattgta ccggccacaa 1020 ccaaggaaaa acaaggcagt tcaacccagt ggcagttgca cgctgctaac ggaacggtca 1080 ttcggacttt tggacaacgc ttcgtcacaa ccgacttggg tctgcgccgt agattcagtt 1140 ggaattttgt gatcgctgat gtgagtgcac ccataatcgg agctgacttt ctggcgcatt 1200 ttgggatcgt cgtcgacctg aaaaggcgtc gactagtcga ctcgaccacc aagctccact 1260 caatcgctgg gttggcaacg ccgacggtgt acaacgtgac gacggtgctc gctgatcatc 1320 cgttcaagga tctgctggat gaattccgtg atgtgacact tccgtccacg attcgctcgg 1380 aagttcaaca cgatgtcacg caccacatcc agactaaagg ccccccgctg tcctgcaaga 1440 gccgtcgact tcctccagat aaactgcagg ccgccaaaca agaattcgag acgatgcttg 1500 aactgggaat atgccgtcac tcgagcagct cctgggccag cccactgcac tgcgtgttga 1560 agaagaacgg agcttggaga ttcgtcggcg actaccgacg tctgaatgcc gtgacggtgc 1620 cggaccgata cccagttccg cacatccatg accttctcaa cgcgttacac ggtaagtcca 1680 tattcaccac tttggatttg gagcgggcct accaccaaat tcccgtagaa ccgtgcgaca 1740 tcccgaagac tgcggtgatc acgcccttcg ggctgttcga atttacaaga atgcagttcg 1800 gattatgtaa cgcgtcgcag acgttccaac gcttcatgca caagattttt ggggatcttg 1860 attttgtgac cgtttttgtt gacgatattt gcatcgcctc agccaacgaa gatcaacatc 1920 gcgatcacat tcggtgcgtc ctggaacgtc tgcacaaaaa tggtctggtc atcaaccctg 1980 ctaaatgccg gttcgcgcag cgtgaagttg agtttctcgg gtatctcgtg aacaaggatg 2040 ggctgcgacc tctacccgaa cgagttcaag cgcttcaaga ctacaagctg ccaggtaccg 2100 tcaaagaact ccgtcggttt ctcgctctga tcaacggcta caaacggttc cttcctcacg 2160 ccacagatga acaagctgcc ctgcgaaagc tcatacctgg aaaccgcaag aacgacacaa 2220 gaaaactcga gtggtgtgac gaagcacggg ctgccttcga caactgcaag cagagcatcg 2280 ccagcgcagc cctcctgcac tacccggacc caacaaaacg gctgagtttg atggtcgatg 2340 cgtcgaacac gtcggccggc gccgtgctgc agcagtttga acacggccga tgggctcctc 2400 ttggcttctt ctcgcagaag ttctctgaag cgcaaaccag atattcaacc ttcggaagag 2460 aactaaccgc aatgaagctg gctgtgcagt acttcaggta cttggtcgaa gggcgatcgt 2520 tcaccatctt caccgatcat cggccgctga ctcaagcact gggatctaca agcaacaacc 2580 acctaccaca cgaggaacgg taccttcggt acatttctca gttcacttcg gatgtccagc 2640 acgtcagcgg tagcaagaac gaagtcgcag acgccctgtc tcgtgtggca accatcgggt 2700 ttcccagttc ggtgaacttt gaggaggttg cagcggaaca gatcaacgat gaagagctga 2760 agcggctgct taacgatggt cgttcatccc tcaagttgga gccgaagccg ctaccgggta 2820 cacaagcgca aatctactgt gatgtatccc ctgaaggtcg agtccgccct tacatccccg 2880 agaaacttcg ctgtatgatt ctcgacacgc tgcacaaggt gtcccatcca ggagtccggg 2940 gaagccgcaa actgctgggt gatcgattcg tctggccgtc gatgcacaag gacgtgcgga 3000 agtacgtgca gtcttgcgat ggctgtcagc gtgcaaaggt ccatcgacac acatcttcgc 3060 cgctgcagag cttcgccatt cccaagagca gattccaaca cgtgcacatc gatcttgtcg 3120 gaccactccc gacatccaac ggaaaccgct acgtcctgac aatgatcgat agattcacaa 3180 gatggccaga agcagtcccg ttggtcgaca tgacagcaga aaccgttgct caagcactgc 3240 tcactggatg gatatctcgc tttggcattc cggaaaccat aacaaccgat caaggacgac 3300 agttcgaatc cgaactgttc cgggaactga gccagctctt gggaattaaa cgcattcgga 3360 caacagcgta tcaccctcag gcgaacggca tggtcgagcg cttccacagg acgatgaaag 3420 ctgctatcat gtgtgtgact cgaaacactg gagcgacaac tccgttgatc ctcttggggc 3480 ttcgatcgtc cttccgggag gacatgaaat gcacatctgc cgaatgtgta cggtcaaacg 3540 ctgaagctgc ctggtgagtt ttcgaagcgc cggcgcgagg ggaagtggac cgctcggagt 3600 tcgtccgttc gcttaagcaa actctccaac aactcgcacc tgttggcggt agcaaccacg 3660 caaagcccaa ggtatttgtg ccgaaggacc tgaccacttg cgactttgtg ttcgttcgag 3720 ttgaccacgt caagcggccg ttgcagcaac cgtacgaggg accttacgag gtcgtcaacc 3780 gaagcagcaa gttcttcgat gtccgcatcg gtggcaagga gaagcgcata gccatcgaca 3840 ggatcaaacc agcattcaac tccaccgagt accacgacca aagcccagac gaagacgcca 3900 aaactaaagt tacaccttcg ggacatcgtg tccgattcct cgtgtaactg gggggagccc 3960 tg 3962 // ID Gypsy-71_AA-I repbase; DNA; INV; 4348 BP. XX AC supercont1.332; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-71_AA_; KW Gypsy-71_AA-LTR; Gypsy-71_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4348 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.332; Positions 367997 372344. XX CC Positions [1776-2144] - Reverse transcriptase CC Positions [3413-3724] - Integrase core CC 'ATTCA' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 231..3425 FT /product="Gypsy-71_AA-I_1p" FT /translation="MAENQVAALPQLPAGPVLPAAQIPAGNMSVLGVRPPN FT PLNVGSEHLAEEFRDFRQSWDIFVLAAEIETKPDQVKVALLKNFLGLDAVK FT VLNTLLPEADQVTTAAILTALEGYCLPQTNETYERFVFNSSKQSDSEDINQ FT FVSRLRKLADTCNFGNLKDSLIRDRIVIGIQSDEARKVLLKTRNLTLVDAI FT DVCRSQKLVENRMAAICVKEEPLGAEANSEIIAKVRMESARSEKWSCKFCG FT RKDHKLGDRDSCPANNAKCRICGKLNHFAKVCRSKRDEEQIERKVVYGKQK FT RTAKVKTVEEQSDSSDESEYISTIETVNVVEKKIAVDVKFSYGKADPKAVQ FT CQVDTGASCNVITRYQLEKLVGKAVKLEASVTKLKDFGGNMVPAIGLVVLR FT HQSKDKRFKVAFQVVDLLPEVLQMPLLGLKTCKALGIVSIHDINTCERDAE FT ESKCEKKEVKNKEVEEIVKRYANLFSGDGKLEGKVDIKMNDAMPKQQEPRR FT VPVLLREKLECELNDLERRGIIAKVDQPTEWTSNIVVVVRKEKVRICLDPV FT ELNQVIKRPRFQMTTLEEILPDLHNAKVFSTFDVRNGFWHVELNERSSMLT FT AFWTPFGRYVWKRLPFGLSCSPEIFQQKLFQALQGLKGIEVLADDILCFGT FT GENEQQAIENHNNNLVALLERCRQTGIKLNRTKMRLNQSEVQFFGHVLSKD FT GLKPDPGKTTAISKMPQPSNAAELQRFLGLANYMSKFIPKLSEMSVDLREA FT LAEDSWSWGKRRREAFEVLKNQIAQITSLKFFNPGIETTIQCDASSYALGA FT VLMQTGQPILFASRMLTKTEQKYAQIEKETLAILFACKRFDQYIFGSRQVM FT VETDHLPLLSIFKKPISSAPKRIQRMVLSLQRYNLQVNYVKGSKLILADTL FT SRNVNVDQTPDAEEMDALWKAIEEVSTLEHIEIQDELLQHIKKETEQDEVC FT AQLKDYIVNGWPKEKNTVLDCAKPYFPFHNELVMQDGVILRGERLLVPAKC FT RMKALNLLHYSHQGEQATLRKARSVVYWPNMNDHIRNYVKKCEACNKFKSS FT QASEPIAR" XX SQ Sequence 4348 BP; 1316 A; 871 C; 1206 G; 955 T; 0 other; tggtgtcaga agtggttaat cttttttcgg tatttttgcg tggaaaagtc ggaatctaat 60 cggaaaaagt ggttacccgg aagcggaatt cgcgacggaa tcggaagata tcggaaactc 120 ggaggtgcgg cctctcgctg gttagaaagc actgggagaa gggatcggcg tcggcacgag 180 gttttcggga agaagcgcag tggttcatgc attagtagtg gtgttgattg atggctgaaa 240 atcaagtagc ggcactacca caactgccag caggacccgt cctaccagca gcacaaatac 300 cagcaggaaa catgagtgta ttgggtgtgc gtccaccaaa tcccttgaat gtgggtagtg 360 agcatttggc ggaagagttt cgtgattttc gacagtcgtg ggacattttt gtgctagctg 420 cagaaatcga aacgaaacct gatcaagtga aagtggcttt gttgaaaaac tttttggggt 480 tagatgcggt gaaggtgctt aacacattgc tacccgaggc ggaccaggtg acgacggcgg 540 caatactcac ggcgctggaa ggttattgtt taccgcaaac aaatgaaaca tacgagcgat 600 ttgttttcaa ttcgtcaaaa caaagtgaca gtgaagacat aaatcaattt gtgagcagat 660 tgcgaaaatt agccgatacg tgcaattttg ggaacttgaa ggattcgttg atcagggatc 720 ggattgtgat tggcattcaa tccgatgagg cgagaaaagt gctactgaaa acgcggaatc 780 tgacattggt tgatgcgatt gatgtgtgtc gctcacaaaa gctggtggaa aaccgtatgg 840 cggctatttg tgtgaaagag gaaccacttg gggctgaagc aaactccgaa atcattgcga 900 aagtacgaat ggaaagtgcg cgtagtgaaa aatggtcttg caaattttgt ggacgaaaag 960 accataagct gggtgatcgc gacagctgtc cggccaacaa tgcaaagtgt agaatctgcg 1020 ggaagcttaa tcatttcgcg aaagtgtgcc gctccaagag ggatgaagaa cagattgagc 1080 ggaaagtggt ctatgggaag caaaagcgta cggccaaagt gaagacagtc gaagaacaaa 1140 gtgacagtag tgatgagtca gagtatattt ctacgattga gactgtgaat gttgttgaaa 1200 agaagatcgc tgtcgatgtg aaattttcgt acggaaaagc cgatccgaag gcagtgcagt 1260 gtcaagtgga tacaggtgcg tcctgtaacg taattactcg gtatcagcta gagaagctag 1320 tgggtaaagc ggtaaagctc gaggccagtg ttaccaaact gaaggatttt gggggtaata 1380 tggtaccagc gattggatta gttgtcctcc ggcaccagag caaagataag cggttcaaag 1440 ttgcgtttca agtggtggat cttctaccgg aagtgctgca aatgccgctc ctaggcctca 1500 aaacgtgcaa ggcactagga atagtgagca tacatgacat caatacgtgc gaacgcgatg 1560 ctgaagaaag taagtgcgag aagaaagagg tgaagaataa ggaagtggaa gaaatcgtga 1620 agagatatgc gaatttgttt agcggtgatg gaaagctgga aggaaaagtg gacataaaaa 1680 tgaatgacgc tatgccgaaa caacaagaac caagaagagt gccagtgcta ttgagagaaa 1740 aacttgagtg tgaactgaac gaccttgaaa gaagaggaat catagcaaaa gtggaccaac 1800 ctaccgaatg gacgagtaac attgtggttg tggtgagaaa agagaaagtg agaatttgcc 1860 ttgatcctgt ggaactcaat caagtgatca aacgtcccag atttcaaatg acgacgttgg 1920 aagaaatttt gccagattta cacaatgcga aggttttctc tacgttcgac gtgcgcaacg 1980 gtttctggca tgtcgagctg aatgagagga gcagcatgct gacggcgttt tggacaccgt 2040 ttggccgata tgtctggaag cggctgccct tcggactatc gtgctcgcca gaaatcttcc 2100 aacagaagct gttccaagcg ctgcaaggac taaaaggaat cgaagtactt gcggatgaca 2160 tactgtgttt cggtacagga gagaacgagc aacaggcgat agagaaccac aacaacaatc 2220 tggttgctct attggaacgc tgcagacaga ccggaatcaa gctgaatcgg acgaagatga 2280 ggttgaacca atccgaagtt cagtttttcg gacatgtgct ctcgaaggat ggtttgaaac 2340 cagatccagg taagacaact gctattagta aaatgcccca gcctagcaat gcggcggaat 2400 tgcaacgctt tcttggtctc gcgaactaca tgtccaaatt cattccgaag ctttcggaaa 2460 tgagtgtgga tctccgggag gctctagcag aagattcgtg gtcttgggga aagcgtcgta 2520 gggaagcatt tgaggttttg aaaaaccaga ttgcgcagat tacgtccctc aaatttttca 2580 accctggtat cgagacgacg atccaatgtg atgcgagtag ctatgctcta ggagctgtcc 2640 taatgcaaac cggtcagccg attttgttcg cttctcgtat gctgaccaaa accgaacaaa 2700 aatatgccca gatagagaaa gaaactctgg caattttgtt cgcctgcaag aggttcgacc 2760 agtatatttt tggaagccga caagtgatgg tagagacgga tcatctaccg ctgctctcta 2820 tcttcaagaa gccgatatcg tcagcaccaa aacgtatcca gaggatggtc ctaagtctcc 2880 aacggtataa tctgcaggta aattacgtga aagggagcaa gctgatactg gctgacactc 2940 tctctcggaa cgtgaacgtt gatcagacac cggatgcaga ggaaatggac gcgttgtgga 3000 aggcgattga ggaggtcagc actctggagc acatcgagat acaggacgaa ctgttgcagc 3060 atatcaagaa ggaaacggag caggacgagg tttgtgcaca gttgaaggac tacatagtaa 3120 atggatggcc gaaagagaaa aatacggtgc tggattgtgc gaagccttac ttcccgttcc 3180 ataacgaact agtgatgcag gatggtgtga ttttgcgagg tgaaaggttg ctggttccgg 3240 ccaaatgtcg aatgaaagcg ttaaatctac tgcactacag tcaccaaggt gagcaagcta 3300 cactacggaa agcgcgttcg gtggtgtact ggccgaacat gaatgatcac atccgcaact 3360 acgtcaagaa gtgcgaggcg tgcaacaagt tcaaatcgag tcaagccagt gaaccaatcg 3420 ccaggtgatc acagattctg cgaaacagtt cacaagcgaa gaatggaaga cactgatggc 3480 ggagtacggc attcaacatt cgactagtgc tccgtaccac catgaagcaa acggcaaggc 3540 agaatccgcg gtcaaaatcg gtaaaaatct catcaagaag gcgttagagg aaggaaagga 3600 tgtatggcta gctctactgg aatggcgcaa cacaccgcag tccgatggtt acactccaac 3660 gcagaagctg atgggcagga aaactcgagg aatatttcca gttccgcaac accagctaca 3720 agtgaaccct gttgatgctg atcggatcag agacaacata gaagcaagga aggtgaagtc 3780 caaattctac catgatcgtg cagcacagga gttgccaaaa ctacagcgtg gacaggaagt 3840 gtatgtacag ttaaagccgg aaactactac gcaatggact cgtggaagag tagcggaggt 3900 tctgaacgat cgagattatc aaatacaagt aggagaggca gtctatcgtc gaaataggaa 3960 gtatgtgcgg gatgcaaata cgtcggcatt ctgtggctac caggattctc cagagttaag 4020 cagcaacaac gaccgtcagg atgaatcaac tgctgacagc ttcctcagtg cggccagtcg 4080 acgagatcaa caggtttcaa gcacaccaaa cgccaatagg ttgccttcga tcaagtttga 4140 gcctgctgaa gggtgtcaaa gcactccgac agtccacctc agcccctctc caagccagtc 4200 atcgtatgct gagcaacatc ctccatcaac agtgcgtgca gagccaacca accggccgaa 4260 gcggatggtc agacgaccat tgcggctgga tgactaccag ttggacgatt gagttatgtg 4320 gtgagagcca tcttaaagca gaggaagg 4348 // ID R2_DSe repbase; DNA; INV; 3595 BP. XX AC . XX DT 08-NOV-2010 (Rel. 15.11, Created) DT 08-NOV-2010 (Rel. 15.11, Last updated, Version 2) XX DE 28S rDNA-specific non-LTR retrotransposon R2 in Drosophila DE sechellia. XX KW R2; Non-LTR Retrotransposon; Transposable Element; R2_DSe. XX OS Drosophila sechellia OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; melanogaster subgroup. XX RN [1] RP 1-3595 RA Stage D.E. and Eickbush T.H.; RT "Origin of nascent lineages and the mechanisms used to prime RT second-strand DNA synthesis in the R1 and R2 retrotransposons of RT Drosophila."; RL Genome Biology 10(5), R49-R49 (2009). XX DR [1] (Consensus) XX CC This family is specifically inserted into 28S ribosomal RNA CC genes. XX FH Key Location/Qualifiers FT CDS 178..3348 FT /product="R2_DSe_1p" FT /note="reverse transcriptase and restriction-like FT endonuclease." FT /translation="FERQNFSDGLVPQRKFIHIGTTNRNNEPRSNLRNLMT FT TRPSVDIFPEDQYEPNAAATLSRVPCTVCGRSFNSKRGLGVHMRSRHPDEL FT DEERRRVDIKARWSEEEKWMMARKEVELTANGHKHINKQLAVYFANRSVEA FT IKKLRQRGDYKEKIEQIRRQSALVPEVANLTIRRRPSRSEQNHQVTTSETT FT PITPFEQSNREILRTLRGYSPVECHSKWRAQELQTIIDRAELEGKETTLQC FT LSLYLLGIFPAQGVRHTLTRPPRRPRNRRESRRQQYAVVQRNWDKHKGRCI FT KSLLNGTDESVMPSQEVMVPYWREVMTQPSPSSCSREVIQMDHSLERVWSA FT ITEHDLRASRISLSSSPGPDGITPKTAREVPSGIMLRIMNLILWCGNLPHS FT IRLARTVFIPKTVTAKRPQDFRPISVPSVLVRQLNAILATRLNSSINWDPR FT QRGFLPTDGCADNATIVDLVLRHSHKHFRSCYIANLDVSKAFDSLSHASIY FT DTLRAYGAPKGFVDYVQNTYEGGGTSLNGDGWSSEEFVPARGVKQGDPLSP FT ILFNLVMDRLLRNLPSEIGARVGNAITNAAAFADDLVLFAETRMGLQVLLD FT RTLDFLSLVGLKLNADKCFTVGIKGQPKQKCTVLEAQSFYVGSREIPSLKR FT TDEWKYLGINFTATGRVRCNPAEDIGPKLQRLTKAPLKPQQRMFALRTVLI FT PQLYHKLALGSVAIGILRKTDKLIRYYVRRWLNLPLDIPIAFIHAPPKSGG FT LGIPSLRWVAPMLRLRRLSNIKWPHLTQNEVASSFLEAEKQRARDRLLAEQ FT NELLSRPAIEKYWANKLYLSVDGSGLREAGHWGPQHGWVNQPTRLLTGKEY FT IDGIRLRINALPTKSRTTRGRHELERQCRAGCDAPETTNHIMQKCYRSHGR FT RIARHNCVVNRIKRGLEERGCVVIVEPSLQCESGLNKPDLVALRQNHIDVI FT DIQIVTDGHSMDDAHQRKINRYDRPDIRTELRRRFEAAGDIEFHSATLNWR FT GIWSGQSVKRLIAKGLLSKYDSHIISVQVMRGSLGCFKQFMYLSGFSRDWT FT " XX SQ Sequence 3595 BP; 1079 A; 800 C; 890 G; 826 T; 0 other; gggatcaggg gtaattgcga gcagaggggg agtatttttc tgtaattcgt aagtcatatc 60 atatggtgtg cggaagggga attttactct gtaactcaca agtctctcct ttactcaagt 120 cgactcaaaa cctcctcgtg gtggtccccg gtaatgctaa acttgtttag cagctaattt 180 gagcggcaaa acttttccga tgggttggtt ccccagagga aatttattca tattggaact 240 acaaatagaa ataacgagcc tcggagcaat ttacgcaatc tgatgacgac ccgaccctcc 300 gtggacatct tcccggagga ccaatatgaa ccaaacgcag cggctactct atctagggta 360 ccctgcacag tatgtggccg gtcctttaac agcaagagag gactcggtgt tcacatgcga 420 tctcggcacc cagacgaact tgatgaagaa cgtcgacgtg tcgatataaa ggcaagatgg 480 agtgaggaag agaagtggat gatggcgaga aaggaggttg agctcacagc aaatggacat 540 aaacacataa acaagcaact agcggtgtat tttgcaaacc gcagcgtcga agccatcaaa 600 aagctaagac aaaggggcga ttataaggag aaaatagagc agataagaag gcaatcagct 660 ctcgtcccgg aagttgcaaa tctaaccata aggcgccgcc ctagtagaag tgagcaaaac 720 caccaagtaa caacatcaga aacaactcca atcacaccct tcgaacagtc gaacagggaa 780 attttgcgga cactgcgtgg gtatagcccc gtagaatgcc attccaaatg gagagcccaa 840 gagctacaaa cgatcattga cagggcagag ctcgagggaa aggaaaccac tctccaatgc 900 ttatcgctat atctcctagg aatttttccg gcacagggtg tacgtcacac tctgacgaga 960 cctcctcgga ggcctcgaaa taggagagaa agcagaaggc agcagtatgc tgtcgtccag 1020 cgtaactggg ataagcataa aggaagatgc atcaagtcct tgctaaatgg aactgatgag 1080 tcggtaatgc caagccaaga agtaatggtt ccttactgga gagaagtaat gactcagcct 1140 agcccaagct cttgcagtag agaagtgata caaatggatc actcgcttga gagggtttgg 1200 tctgctatta cagagcacga ccttcgggca tcaagaatct cattatcttc atctccgggg 1260 cctgacggga taactccaaa aacagccagg gaggtgccgt caggtattat gttgcgaata 1320 atgaacctaa ttctatggtg cggcaatcta ccacactcta tccgactggc cagaaccgtc 1380 ttcatcccga agacggtgac ggcgaagcga ccgcaagact ttcgtccaat atcggtgcct 1440 tcagtcctgg taagacagct aaatgcaata ttggcaaccc ggttgaactc atcaatcaat 1500 tgggacccgc gccagcgggg cttcttacct accgacggat gtgccgataa cgcgacgata 1560 gttgacttag tcttgaggca tagccataag cactttagat cttgctacat tgcaaattta 1620 gatgtaagca aggcattcga ttctttatcg catgcatcta tatatgacac cttacgtgct 1680 tatggtgcgc caaagggctt cgttgactac gtacagaata cgtacgaggg tggcggtacc 1740 agtctcaatg gggacggttg gagttcagag gaattcgtcc ctgctagagg agtgaagcag 1800 ggtgaccctt tgtctcctat tctatttaac ttggtaatgg acaggttact tagaaaccta 1860 cccagcgaaa ttggtgccag agtcggaaat gccattacta acgcggccgc gtttgcagat 1920 gatttggtac tatttgctga aactcgaatg ggacttcaag tattgttgga cagaacgttg 1980 gattttctat ctctcgtcgg cctcaaactt aatgccgaca aatgttttac cgttggcatt 2040 aagggccagc cgaaacagaa gtgtaccgtg ctagaggcac agagctttta cgtaggctca 2100 agggagattc catcattgaa gcgaacggac gagtggaagt acttaggcat caacttcact 2160 gcaaccggga gggttcgatg caatccggcc gaggacattg gtccgaagct acaaagattg 2220 acaaaggccc ccctcaaacc acaacagagg atgttcgccc ttaggactgt ccttatccca 2280 cagctctatc acaagttagc ccttgggagt gtggcgatag gcatcctacg aaaaactgac 2340 aaactaatac gatactatgt gcgaagatgg ctaaatcttc cgctggatat accgatagca 2400 ttcattcacg cacccccaaa aagtggaggt ctcggaattc catcacttag atgggtagct 2460 ccaatgttaa ggctaagacg tttgagtaat attaaatggc ctcacctcac gcaaaacgag 2520 gtagccagct ctttcctcga agcagaaaaa caacgggccc gagatagatt attagcagaa 2580 caaaatgaat tgttatcgcg tccggcaata gaaaaatatt gggcgaacaa attgtacctc 2640 tcagttgatg gtagcggact ccgtgaagca ggccattggg gcccgcaaca cgggtgggtt 2700 aatcaaccca cgcgtttact aacaggaaag gaatatatag acggtattcg tctgcggata 2760 aatgccctac ccacgaagtc tcgtactaca aggggaaggc acgaattgga acgacagtgt 2820 cgtgcaggat gtgacgctcc cgaaacaaca aaccatatta tgcaaaaatg ttaccgatcg 2880 catgggaggc gcatagctag acacaactgc gtagtaaatc gaatcaagcg gggacttgag 2940 gagagaggct gcgtagtcat tgttgaacca agtctgcagt gcgaatccgg tcttaataaa 3000 ccagacctgg tggcactacg acaaaatcac attgatgtga tcgacataca aattgtgaca 3060 gacggacact ctatggatga tgcacaccag cgcaaaatca atagatacga cagaccggac 3120 atacgaactg agttgcgtcg cagattcgaa gccgcaggtg acattgaatt ccattctgcc 3180 accctgaact ggagggggat ctggagtggt caatccgtta aaagattgat agcgaagggt 3240 ctcctcagca aatatgatag tcatatcatt agcgtccagg ttatgagagg cagtctcggt 3300 tgttttaaac agttcatgta cctgagcggg ttttcccgag attggactta gctaaaacgt 3360 ttggttcaaa acatttgctt gctgtcttgg cataacatca ataaaggcat aaacatcgca 3420 aataatggta atatataaat tggctatgag gatggtttta gtacgtaggc gttgcggaac 3480 ttcggttcag atagagcaat gaatcgtgca tgctaggaaa ctgaagtgtt gacagaccta 3540 gtatctttcg atagatttcc atacctccgc gatcaaaaaa aaaaaaaaaa aaaaa 3595 // ID CR1-38_HM repbase; DNA; INV; 4308 BP. XX AC . XX DT 17-DEC-2008 (Rel. 13.12, Created) DT 17-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE CR1-type family - consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-38_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4308 RA Bao W. and Jurka J.; RT "CR1 families from Hydra magnipapillata."; RL Repbase Reports 8(12), 1866-1866 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 3..623 FT /product="CR1-38_HM_1p" FT /translation="KVEKSILENAKKISTLATEVEEIKVSLNFHEELIENK FT IKTALDIFMKSKCTPNEIQNNNTELKKINSKLREIEDRSRRNNLRVDGVKE FT DDNESWLESELKVKTIFDEYLGIKNVKIERAHRAGKVGIKEHRTIVLKLLD FT FKDKEAILKNSSKLKGKNIFINEDFCTETNRIRKDLREKMKIERQSGKFAY FT ISYDKLIVREWNAKKK*" FT CDS join(780..1835,1798..3222,3098..3919) FT /product="CR1-38_HM_2p" FT /translation="MNPKTDKYKNLFENISDKVFKINDQNTFNKHDPDNSF FT IDNFSLMNMETPYMFPHEIKKYLKEVEPYEGLSLIHLNIRSAKANFENFKI FT FLEGSNFIFNIICLSETWLTDDEFNESVRFQLNNYEGVHLQRKSKNRGGGV FT VIYIKNSLRYKLRNDLCISDYDREFVSIEIINNGFKNTIISCCYKPPQSST FT EIFSNHLQNIILKSFQEKKSLFVVGDFNLNALNYDKNKETQNFYNDIFRFG FT VIPLINRPTRITRNSSTLIDNILTNSLFEISLKKGVIKTPISDHFPIFISI FT KTSNKIKISQKMTITRRHFSLDSQVYFKRELANIDWSDLESSNDTNLMYNK FT FINIFFIFLCITSLSIFFLFFYEKHFPKVTKTIKLKDINSPWLSKGLKKSS FT KRKQKLYIKYLKKKSDKTKTEYKNYANLFEKLKKTAKATYYNKLLKKCQTD FT SRKTWQLLNEIIGKPKTSKPCFPKTIKINHKSIDNENDIANEFNNFFVNIG FT SKLASQIPRVNKSFDKYLIRNRNKLNNEKLTMLEIEEAFKSLKRNKATGID FT DVNSNIVINCYNELKNPLFKIFKHSLKEGIFPEKLKTAKVKPIFKSGDASE FT IGNFRPISILSTFSKILERIMHNRLYTFFKENNLIYSKQFGFQKNTSTEHA FT ILHLVEEIKNSFTNGEVTLGVFVDLSKAFDTVDHKILLSKLNMYGIRGRSN FT KWIESYLDKRVQYVYYGDNKLSNPSIIECGVPQGSILGPLLFLIYVNDLCM FT ASKRISTIMFADDSNFFISGKYNSELCVEMNNELEKISEWFRANKLSINSS FT KTKFSLFHPSFKKKRSSIFFSEIVYKRFQSGLGQTNSLLTQVKQSFLYFIL FT LLKKKEVQSFFPKLFIENREIGRDTVTKFLGVFIDENLNWNKHIAYIGSKI FT SKSIGIIYQSRYLLKKPLLKQIYYSFVQSYLNYGNIAWGSTYKTKLEPLYR FT KQKHAIRVINFKNKTEHSKPLFEQMSLLTLFELNVYNVLSLMYNSKMQKSP FT SIFYSLYKQKPEYKYLLRTNSTLLEPSCKNKLEQFKMTYRAPHIWNKFLLI FT NLNNIDLKNLNCFKKTIKKSILKYKNLVSDFFLNLFINIFFFI*" XX SQ Sequence 4308 BP; 1771 A; 548 C; 592 G; 1397 T; 0 other; acaaagtaga aaaaagtatt ttggaaaatg ctaaaaaaat atccacactt gcaacagaag 60 ttgaagaaat aaaagtaagt ttaaattttc atgaggaact cattgaaaac aaaattaaaa 120 ccgcgttaga tattttcatg aaaagtaaat gtactcctaa cgaaattcaa aataacaaca 180 ctgagctcaa aaagataaat agtaagctaa gagaaataga agatagatca agaaggaata 240 atttaagagt tgacggagtt aaagaagatg ataatgaaag ctggttggaa agcgagttaa 300 aagtaaaaac aatatttgac gaatacttgg gtataaaaaa tgtaaaaatt gaaagagcgc 360 atagagctgg taaggtaggt ataaaagagc acagaacaat tgttctgaaa ttattggact 420 ttaaagataa ggaggcaatt ttaaaaaact cttcaaagtt aaaaggaaag aacattttta 480 taaacgaaga tttttgcacg gagacgaatc gaattagaaa ggatttacga gagaaaatga 540 aaattgaaag acaatcggga aaatttgcgt atatttccta cgacaagctt attgtacgcg 600 agtggaatgc aaaaaaaaag taattttatc ttcttttttt aacctttttt atttacgtta 660 tattttaaaa ttttaaatgt tttattgaat atatatatat atatatatat atatatatat 720 atatatatat atatatatat atatatatat atatatatat ataacggaaa agctttaaaa 780 tgaatccaaa aacggataaa tacaaaaatc ttttcgaaaa catttcagat aaggtattta 840 aaataaatga tcaaaacact tttaataaac atgacccaga caacagcttt attgataact 900 tttcattaat gaatatggaa acaccttata tgtttccgca tgaaataaaa aaatatttaa 960 aagaagttga gccttatgag ggtctgtctc ttattcatct taatataaga agtgctaaag 1020 caaattttga aaactttaaa atattcttag aaggaagcaa tttcattttt aatattattt 1080 gtttaagcga aacatggtta actgatgacg aatttaacga aagcgtgcgt tttcaattaa 1140 ataattatga aggagtgcat ttgcaaagaa aatcaaaaaa tagaggcggg ggtgttgtaa 1200 tttacattaa aaatagtttg cggtataagt tacgaaacga tctatgcata tctgattacg 1260 acagggaatt tgtttctatt gaaatcatta ataatggctt taaaaatacc ataatatcat 1320 gctgttataa accgccacag tcttcaacag aaatattttc aaaccatctt caaaatatta 1380 ttttgaaaag cttccaagaa aaaaaaagtt tatttgtagt aggagatttt aatcttaacg 1440 ctctaaacta tgacaaaaat aaagaaacac aaaactttta caatgatatt ttccgatttg 1500 gtgtaattcc tcttataaat aggccaactc gaattacaag aaattcatca acattaatcg 1560 ataatatact aacaaattct ttatttgaaa tctcgctaaa aaaaggagta attaaaacac 1620 caatatcgga tcattttccg atatttattt caataaagac ctcaaacaaa ataaaaataa 1680 gtcaaaaaat gacaatcaca aggcggcatt tttctttgga cagccaggtc tactttaaaa 1740 gagagttagc gaatattgac tggtcagatt tagaatcttc taatgacacc aatttaatgt 1800 ataacaagtt tatcaatatt ttttttattt ttttatgaaa aacattttcc caaagtaaca 1860 aaaacaatta aacttaaaga cataaattca ccttggttaa gtaaaggtct taaaaagtca 1920 tctaaacgga aacaaaaatt atatataaaa taccttaaaa aaaaaagtga taaaacaaaa 1980 acagaataca agaattatgc aaatttattt gagaaactta aaaaaaccgc taaagctact 2040 tattataaca aacttcttaa aaagtgtcaa acagattcca gaaaaacatg gcaattatta 2100 aatgaaataa ttggaaaacc aaaaactagc aaaccatgct ttcctaagac aataaaaatc 2160 aatcacaaat caattgataa tgaaaacgac attgcaaatg agtttaacaa cttctttgtt 2220 aatataggat ctaaacttgc atctcaaatt cctcgtgtaa ataaatcttt tgataaatat 2280 ttaattcgta atagaaataa actaaataac gaaaaactaa ccatgttaga aattgaagag 2340 gcttttaaaa gccttaaaag aaacaaggct actggaattg acgatgtcaa tagcaacata 2400 gtaattaact gttacaatga gcttaaaaat cctttattca aaatttttaa acactcttta 2460 aaggaaggta tatttcctga aaaactaaaa actgctaaag taaaacctat atttaaatca 2520 ggagatgcaa gtgaaatagg taattttaga ccaatatcaa tactttcaac attctctaag 2580 atacttgaga gaattatgca caataggtta tatacttttt ttaaagaaaa caatcttatt 2640 tattccaagc agttcggctt tcaaaaaaat acatcaactg aacacgcaat ccttcacttg 2700 gttgaagaaa taaaaaactc ctttacaaac ggagaagtta ctttgggggt ttttgtagat 2760 ctctcgaagg cttttgatac agtcgaccac aaaatattat tatcaaagct taatatgtac 2820 ggtataaggg gtagatccaa caagtggata gaaagctatt tagataaacg tgtgcaatat 2880 gtatattacg gagacaataa actctctaat ccgtctataa tagagtgtgg tgtacctcaa 2940 ggttcaatac ttgggccctt gcttttctta atctatgtaa acgatctctg tatggcttca 3000 aaaagaattt caactataat gtttgctgac gatagcaact tttttatctc tggaaaatat 3060 aatagcgagc tgtgcgtaga aatgaacaat gaattagaaa agatttcaga gtggtttagg 3120 gcaaacaaac tctctattaa ctcaagtaaa acaaagtttt ctttatttca tccttctttt 3180 aaaaaaaaaa gaagttcaat ctttttttcc gaaattgttt attgaaaata gagaaatagg 3240 tagggatact gttacgaaat ttttgggtgt ttttattgac gaaaatctaa attggaataa 3300 acatatcgct tatataggta gtaaaatatc caaaagcata ggaataatat accagtcacg 3360 ttatttatta aaaaaaccac tactcaaaca aatctactat agttttgttc aaagttacct 3420 aaactacgga aatatagctt ggggtagtac ctataaaacg aaactagaac cactttatcg 3480 taaacagaag catgctatac gtgtgattaa ttttaaaaat aaaacagaac actcaaaacc 3540 tctttttgaa caaatgtctt tattaacatt gtttgaacta aatgtatata atgttctaag 3600 tcttatgtat aatagcaaaa tgcaaaaaag tccttcaatt ttctatagtc tttataaaca 3660 aaaaccagag tacaaatatt tattgcgtac aaatagtacc ctgttagagc cttcatgcaa 3720 aaataagctt gaacaattca aaatgactta tcgtgcacct catatctgga ataagttttt 3780 actaattaat ttaaataata ttgatttaaa aaacttaaat tgctttaaaa aaactattaa 3840 aaagagtatt ttaaagtata aaaatttagt tagtgatttt tttttaaatt tatttataaa 3900 tattttcttt tttatttaat ataaatatat tttcaacatt gcattagtaa taaatgtgta 3960 aatattattt taaaaaaaaa aaaaaacaaa acgatttttg taaaaatata tttgtatact 4020 tgttatctga cattttaata ccattataat ttaatgattt tatgtaaaaa ttttatgaat 4080 atgtttttat atttacggca atgtaaagcg gttcttgatg ataagaccat ttggtcttct 4140 gcaagttttc cgcgttcttt ttacagtgtt agagtgaatt tattattttt gtaaaaccac 4200 tttgtaatat tttgttttta tttatatatt cactgttaac gaatctttat tatgtggtta 4260 cgaatttgta atattaagga acaaaaaata aacttaaaat aaaataaa 4308 // ID Gypsy-34_DWil-I repbase; DNA; INV; 5633 BP. XX AC scaffold_181096; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-34_DWil_; KW Gypsy-34_DWil-LTR; Gypsy-34_DWil-I. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-5633 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_181096; Positions 694345 699977. XX CC Positions [4732-5208] - Integrase core CC 'ATAT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 879..2297 FT /product="Gypsy-34_DWil-I_2p" FT /translation="MPLIRSNEELSVLPEAECTICLKALLETEELRITKCN FT HVFHQECIENYIQEHQECNFCKKLCHQKDLVQFIMSDQAEGPIINKNRKTH FT NTRAQSQSSNVASPASQLKSGGRGRKPKSSTARKPENSNQQRSARSQQQSI FT SNRAAEINVNSIFERMEYLIDQRLGNLNLNNSVGRSNNHDDRVPTPRSDSV FT SHHYINPGKAAQLVQSWGIKFDASPKGPTVSEFLYRIKALTVDNFQSDFTP FT VCRNLHLLLTGRAKEWYWRYRKEHDEVIWEEFCMDLKQEYDDFKDDDLILE FT ETRNRKQRNGEAFDVYYEDVMTLISRLVIPLEEKRLVHQLQRNLISEVREA FT LLYVPIESVSKLRQLVHRREQFLREESGKRITKYTNVRKNIALLEETDENP FT EEEDIQPELNAIQVKEKKYICWNCDIEGHRWDMCLEERRIFCYGCGKKNVY FT KPQCDVCLSRSKNSNQGPSFRRSRVDPQ" FT CDS 3097..5577 FT /product="Gypsy-34_DWil-I_1p" FT /translation="MYKEIDRMLEMGVIERANSPWSSPMRLVIKPGKVRLC FT LDARKINEVTKKDAYPLQSIDGIFARLPKANFITKIDLKDAYWQIALSEKS FT KPITAFTIPGRPLYQFTVMPFGLCNASQTMARLIDDLIPADLKYCVFGYLD FT DLCVVSEDFDTHLEILIRMSQKFRKANLTLNIEKTKFCVTEVPYLGYIIGR FT GGISTDPAKIEAIKTWPIPKTIKETRGFLGVVGWYRRFIDNFSDITFPITQ FT LLTKKKKFLWTQEAQIAFEKLKEKLTSAPILINPDFNKKFFVHCDASNYGI FT GAMLVQLDEEGAEKPIAFMSKKLTTAQRNYSVTERECLAALEAIEKFRCYL FT ELQEFEVITDHASLVWLMRQSDLKGRLARWAMKLQGYKFEVSHRRGQDNVV FT PDALSRMFSEDLCALEELRPLVDLNSPSFNDEEYQQLRQKIEKEQERYPDL FT KIIDQFVYIRTEHDDGTESSQVNIWKLWIPQSLSKEIIKRAHEDTISAHGG FT MAKTLELIRRTFFWPGMVVQVRDYIRNCETCKSTKAPNQILKPPMGKPAVS FT ERPFQRLYIDLLGTYPRSKRGHIGLLIVVDHLSKYHWLQPLKVFKSNVIQE FT YLQDNIFFRYGVPEILVSDNGSQFKASELNAWLTTLGIKHIYTAVYSPQSN FT ASERVNRSIIAAIRAYLKEDQREWDLHIPAISCALRNSYHQSIGCSPFHAL FT FGFDMATHGSQYTLLEKINMLNESNNVLRKDDQLALRRNFLKKNIAQAAEQ FT NAKVYNLRTKEVNFKVGQEVFRRNFGQSNFQKCVSAKLNLLWLKSRVRERL FT GNCYYVLEDLNGKLIGTYHAKDIRQ" XX SQ Sequence 5633 BP; 1886 A; 1006 C; 1152 G; 1589 T; 0 other; taattttggc gcccaacgtg gggcttgtta gacttgtata cagtttctcg gttcaatcgt 60 gtttctttat aggattcagg ctgaactgaa gggttcaaga atttcaggtt cctttttcct 120 tcttctactt cttattctat ctcttaataa aatccccagc attttaatct atcctttgct 180 atcaaaaatt tttattttcc tttcatttcg ctttcttttt gtattcacgg atagttggtt 240 gagtgactta ttcattagga atgcagatat ttggagaatc cgagttctga ccagtgataa 300 ggcgcttgag cattaatcgg tgaatatact tagaaacata tacaaactct acaaaatagt 360 aacagacgta tattataggc tgtgtttttc ttggaattcg ttaatttttg attgtttcga 420 ttttacgacc agaaagcaca ataggcttgt gatgtcggcg ttagcattgt agtttattga 480 ttccattcat ttcttgtaat tacaatagaa caaatgcttc agcggatgtc aatgaattga 540 ctgcttgttc atctggaatg cagagatttg gctaggcccg aattctgacc gatgatacaa 600 agtgagttca gtttatgatg ttggttgaaa ttagtctttt tgttttggat ttgatataga 660 tgattaggtc tagaacgaca agttttctct atcccttcac atataaattt ctgagattga 720 actatccatt agttcaattc tccctctata ttagtaccgc acctataatt ttgctcaccc 780 tcactttcaa aacaaggaag ataatcgtta tatgatataa tatattaaaa caaattaaat 840 aaagtatcta aagtaatttt tgtggaaaaa taatcaaaat gccactaatt cggagtaatg 900 aagaattgag tgtgctacca gaagcagaat gtacaatttg cttgaaagcc ttattagaaa 960 cagaagagct taggattact aaatgtaatc atgtttttca tcaggagtgt atagaaaatt 1020 atatacaaga acatcaggaa tgtaattttt gtaagaaact atgccaccag aaagatttag 1080 tccaatttat aatgtcagat caagccgagg gtcccataat taataaaaac aggaagaccc 1140 ataatactag agctcaatca caatcaagca atgtagcatc tccggctagt cagttaaaat 1200 caggaggtag aggtagaaaa cccaaatctt ctacggcacg taaaccggag aactcgaatc 1260 aacagcgttc agctcggtca cagcaacaat caatttcaaa tcgagcagct gagataaatg 1320 ttaattcaat ttttgaaaga atggaatatt taattgatca gagattggga aatttgaatt 1380 taaataattc agttggacga tctaacaacc acgatgatcg cgttccaacc ccaagatcag 1440 attcagtatc acaccattat attaatcctg gaaaagccgc ccaattagta caaagttggg 1500 gtattaaatt tgatgcgtct cctaaaggtc ctacagtatc agagtttcta tacagaataa 1560 aagcattaac tgtagacaac tttcagagtg attttacacc ggtatgcaga aatttacacc 1620 tactgttaac cggaagagcc aaggaatggt attggcgtta tcgaaaagaa catgacgaag 1680 ttatttggga agaattttgt atggatttaa aacaagaata cgacgatttt aaggatgatg 1740 atttgattct tgaagaaaca cgtaaccgaa aacaaaggaa tggtgaagca tttgatgttt 1800 actatgaaga cgtaatgact ttgatatcga gattggtaat acctttggag gaaaaacgac 1860 tcgtacatca gctacaaaga aacctgattt cggaagtacg agaagcttta ttatacgttc 1920 caattgaatc ggtatctaaa cttcgccagt tggtgcaccg ccgagagcaa tttctccgtg 1980 aagaatcagg aaaacgcatt acgaagtaca caaatgttag gaagaatatc gctttactcg 2040 aagagacaga cgaaaatccc gaggaagaag atattcaacc ggagttaaat gcaattcaag 2100 tcaaggaaaa gaaatatatt tgttggaatt gtgacatcga aggacacaga tgggacatgt 2160 gcttagaaga gcgccgcatt ttctgctacg gatgtggaaa gaaaaatgtc tacaaaccac 2220 aatgtgatgt atgcctatcc cgttcgaaaa actccaatca gggcccatca ttccgtcgca 2280 gccgagtgga ccctcagtag atgtcaagaa taacacagcg acgaacaaca taacatcaga 2340 tttgacaaat aaaatttagc cgtatcccag actaccatac catcaacggc tgcaaaacta 2400 tttagcgatt cgagataaaa tatttgcaca ggaggttgca acttgccagt ttattaggag 2460 tcactcaagc gaacgtctgc gtcgcttttg gaagaaagta aaagatgtcc aaaaacgaat 2520 aatagcgtct atgattaacg aagccgcaga tggccgtacg ctccagtaaa gtttctcgaa 2580 ttttctgagc ttggtctatt ggatactgga gccaatatca gttgtatcgg atcagattta 2640 gcagcaaaag agtggacgaa atatcccgga ttttcaacta ttgtttcaag tgtggcaaca 2700 gcagatggga aaagacatcc agttaccgga ataataagaa cagaggtgca atacgaaagc 2760 aagcgaaaac ctctttctct ttttattatc ccgtcattag cgaaacgagt tattttagga 2820 atcgatttct ggaaatcttt tgagttatgt ccagaatttt catctcgttg taagactagc 2880 atattagata aaaatattca ctcgattgat gacgaaaaga aatgtttatt aaacccagag 2940 caagaaaaaa gattaaacga tgtgattcaa ttgtttccgg attttaaaca acagggatta 3000 ggccgaactc cacttattgt tcatgacata gagatcaaag aagcaactcc aatcaaacaa 3060 cgcttctatc ctgtgtcgcc agcggttgaa aagttgatgt ataaagaaat cgaccgaatg 3120 ttggaaatgg gagtgatcga gagagccaat agtccttgga gttctcctat gcgtctggtt 3180 ataaaacccg gaaaagtgcg attatgtttg gatgcaagaa aaataaatga agtcaccaaa 3240 aaggacgcat atccgctgca aagcatagat ggaattttcg cgcgactgcc caaagcaaac 3300 tttataacaa aaattgactt aaaagatgcc tactggcaaa ttgctctttc agaaaaatcc 3360 aaacccataa cagctttcac gatacccggc cgccccttgt accagttcac tgtaatgcca 3420 ttcggattat gcaacgcttc gcaaaccatg gcacgattaa tagacgatct aattccagca 3480 gatctgaaat actgtgtgtt cggataccta gacgatctct gtgtggtgtc agaagacttt 3540 gatactcatt tagaaattct aattcgtatg tctcagaaat ttagaaaggc aaatctaact 3600 ttaaatatag aaaaaacaaa gttttgtgtt acagaagtgc catatctggg ctatattata 3660 ggaagaggag gaatttcaac agaccctgct aaaatagaag caataaagac ttggccaatt 3720 ccgaaaacga taaaggaaac cagaggtttc ttgggtgttg ttggttggta tcgtagattc 3780 attgataatt tttctgatat tacatttccg ataacccaat tactaactaa gaaaaagaaa 3840 tttttatgga cacaggaagc tcaaattgcc ttcgaaaaac ttaaggaaaa gcttacctct 3900 gcaccgatct tgatcaatcc agacttcaat aagaaattct tcgtgcattg tgatgcaagc 3960 aactatggca tcggagcaat gttggtacaa ctggacgaag aaggagctga gaaacctata 4020 gcatttatgt cgaagaagtt aacaacagct caacgaaact atagcgtaac agaacgcgag 4080 tgcttggctg ctttggaagc aattgaaaaa ttccgctgct atctggaact acaggaattc 4140 gaagtcataa ccgaccacgc cagtttagtt tggttgatgc gacaatcaga tttgaaagga 4200 agactcgcga gatgggccat gaagctgcaa ggttataagt ttgaagtaag tcaccgacga 4260 ggtcaagata atgtagtacc cgacgccttg tccagaatgt tttcagagga tttgtgtgct 4320 ttggaagaac ttcgcccctt agtcgatttg aactctccta gtttcaatga cgaagagtat 4380 caacaactaa gacagaaaat tgaaaaagaa caggaaagat acccggattt gaaaataata 4440 gatcaatttg tttatatacg cacggagcat gatgatggaa cagaatccag tcaagtcaat 4500 atttggaaat tatggattcc acaaagtttg tccaaagaaa ttataaaacg agctcatgaa 4560 gacacgatat ctgctcatgg cggaatggca aaaactttgg agctcatacg tcggacattc 4620 ttttggcctg gcatggtggt acaggtgcgt gattatatac gaaactgtga gacgtgcaag 4680 tcaacgaaag cacctaacca aatcttaaag ccaccaatgg gtaagcccgc agtatcagag 4740 agaccttttc agcgattata tattgatctt ttgggaacat atccgagaag taaacgtggt 4800 catatcggac tcctgatagt ggttgatcat ttatcgaagt atcattggct tcaacccctt 4860 aaagtattta agagcaatgt tattcaagag tacctgcagg ataatatatt ctttagatac 4920 ggagttcccg agatattggt tagcgacaac ggatcacaat tcaaggcaag cgaattaaat 4980 gcatggttga cgacgttggg tataaaacat atatatacag cagtttattc cccacagtcg 5040 aatgcctctg agagggtaaa cagatccata attgcagcaa tacgcgccta tctcaaagag 5100 gaccagcgcg aatgggatct acacatacct gctataagtt gtgcactaag gaattcgtat 5160 catcagagta taggatgctc tcctttccat gctttgtttg gctttgacat ggccacacat 5220 ggttcccaat atactcttct cgaaaagatt aacatgctga atgaatcgaa taacgtgttg 5280 cgaaaggacg atcagttagc attacggcgt aattttctca agaagaatat agcacaggca 5340 gcggagcaga atgcaaaggt atataatctg agaaccaagg aggttaattt caaagtagga 5400 caagaagtct ttagacgtaa ttttggacag agtaatttcc aaaaatgcgt tagtgctaag 5460 ttaaacttgt tgtggttaaa gtctcgagtt cgagagagat taggaaactg ttattatgtg 5520 ttagaggatt taaatggtaa actgatagga acctatcatg ctaaggatat taggcagtaa 5580 ctgtcttttt tcaggttaat actccgaagc aagctctagt attatcctgt ggt 5633 // ID DNA8-106_AP repbase; DNA; INV; 1366 BP. XX AC . XX DT 29-AUG-2009 (Rel. 14.09, Created) DT 29-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-106_AP. XX NM DNA8-106_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-1366 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2044-2044 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 1366 BP; 486 A; 206 C; 229 G; 443 T; 2 other; cagtggcgta tttaggaatt ttgatagggg gggggggatg aaatatataa taataggtac 60 actcaataaa tacgaatata atatacattt ttaaaatccc ctaacttgtt gagattagtg 120 tggatcaagg tgaaataaga attaacacaa attgtatagt acctaatagc gtttattata 180 aattcatact aacgaaaatc ccgtgtttaa tacgataaac aaaaaaaata gctttataat 240 acaaaatccc gttttctatt tttttggatg ctcagttcat cgataacctt atctggcatg 300 attattgtat tgatcggtga cgtgcaaata aagccaacac attaagggtc cgtattacaa 360 tagttaaact aatgttaaac cgcggaattc gactggtaaa atcagttggt taagcgtaac 420 cgaggcaggg actggaacct tgatgatttt tacggttccg gttcctattc ggttcccagt 480 aaaactaaaa tttcggtttc ggttcctatt cggttcccag aaaaactaan atttaggttt 540 cggtttcttt tcggttctta aaaaataaaa aaattgttac ctgatttggt tttttactta 600 tgtaaaaaaa tgtgtcaagt aaaagtatcg gttccggtaa ggttccggtt tttaaattaa 660 tttaaataag ggttccggtg aggttccggt attaaaatta atttaaataa gggtttcggt 720 gaggttccgg tccctgaacc gaggtttaaa ctatgattgt aaaacgggcc ctaagtggct 780 aagtngctaa gtcggttata ataaacaaag tatataataa ttactattta atttatagtt 840 attattcaat attagatagg aaaattgata aaatatttac ttctgtcata ctatttatta 900 agtattactt aatacttaat aaaaacttga aaaacatcgt tcagttgtat agcagttgat 960 acaggcaaag tacaaaatat tttcaagagt tcatgaatat ttggataaaa ctctttatca 1020 caaacatcta aagcatgtaa cccagaggtg aatacaatgt tttcagtcac taaaaataaa 1080 cccatgccct atggtaaatc tacagtcttc atggtaattt tgttttattt accgaccatt 1140 tatcgggttt attataaaca ctcatgcatg aaaaataatt cgcagaaaat tcaaaaatac 1200 agtataccat accaccatag aggtatatgt ttttcagata tacctactta aaacttaaaa 1260 gtagtatcat attgagaaat acagaaaata agtaaactta actttacaaa aaaaaaaggt 1320 caagggggga tctatccccc atatcccccc cgtaaatacg ccactg 1366 // ID hATm-56_HM repbase; DNA; INV; 4118 BP. XX AC . XX DT 16-JAN-2009 (Rel. 14.02, Created) DT 16-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE hAT-type family: consensus sequence. XX KW hAT; DNA transposon; Transposable Element; hATm-56_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4118 RA Bao W. and Jurka J.; RT "Families of hATm elements from Hydra magnipapillata."; RL Repbase Reports 9(2), 390-390 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(1858..1941,1884..3242) FT /product="hATm-56_HM_1p" FT /translation="MKEIRGRAEKVAMNVIIFNVYFLMDEKIKSSNECDNI FT QCIFFDGRKDLTKYMREEEDGKLHQSEKKEEHYSVCSEPGGKYQFHFTNNS FT EERVETAAEQIADNLYEWIMNHDVGDYLVAIGADSTNLNTGWKGGVIHYLE FT HKLDKKLMWIICALHTNELPLRHLMIQLDGKTASKNNFSGSIGKLIGKATS FT FKVKDSIKKIEFPNAIIELDSEVVKSLSDDQKYLYQITHAINAGVFPENLK FT GKEIGPHSHARWLNFANRICLLWCSEIVLSVEETYNLQLLVEFIVGVYAPL FT WFEIKVKWKWTEGPNHILKQLKLLSYQRLKVKDIVMPYVLSSSWNAHSENV FT LQALLCSLEKAERQFAVETILKLRGDLKYGNTNVRSRKHHIHNENAESLFD FT LIDWSLDVHEPLLTCLLSKDELLKYNDIPMCVPCFPVHGQAIERCVQVVTR FT ASQSVFGKDRRDGFIRATLAHREIMPVNQSKKRSI*" XX SQ Sequence 4118 BP; 1496 A; 573 C; 674 G; 1374 T; 1 other; tagggtgggt cgtactttta aaattttgga tatgggagcg cgcgcaataa tttttttatg 60 tattaggata ttagaagaga cgtgaataat ttttttttct taaaaagttc atctagtgtg 120 ggcgcagggc gtttgaaaaa tgtcagtttt acgaaaaact catcatattg gcatttttcg 180 gcctttttta tattagccaa aggtaaggca attatcaaaa aacaatcaga cataacatag 240 acataaaaat aaaacataat attcggtgct acttttctta atgctataca aatacggtag 300 acgtatgaaa aaatgtctac cattatggat tttacaaaaa aaatatctat atataaatgt 360 tagaaatggg atggcgcgca atattttttt tatgtattag gatattagaa aagacgtgaa 420 taatttattt tttcttaaaa agttcatctt ttgtgggcgc agggcgttta aaaaatgtca 480 gttttttcga aaaactctaa aaaaaattag tattaactta ttattgttat tttcaatgcg 540 cataatgcta ctagtataaa atgaggtgaa tcatagtcat catagtcagt atgatttttt 600 atttgttata aagttccaat agcttatttg taggataatt actaagataa tagtttcctg 660 acgatttgcg ttttaaacgt gttctattta tttatgatca ctttgacatc ctaaatattg 720 ttataaaaat gatgtaatta acaaatttat aacatatata taaaggtagt ctttttacct 780 actacaaaaa gctactaata tttttaaaca aacataacat caatttcatt gcgataaatg 840 gctaaatcta ctcgagctca aacaatatta aaatgcccta aattgcacga gctattgggt 900 aatggactcg aactttgttt atcagagctg cctacgctga gagattgttt acgatatgga 960 attttgttgc gtgatagatc agttacacat ttggatacaa gatgtatgtg tatacaaata 1020 tttattaaaa taaaaagtat ttggatgtca gcaaattcta caatcccgat ggtctcagaa 1080 aaatatgcag tcgataaact tgtccttgaa tggaacattg caaaagacat tggaaaattg 1140 agaactacga aaaaaaagaa atatgtatac ggacaaactt gataagctat ttgacttgac 1200 aaaatgtaag tgcaaaatct tatgctgtaa tgaaaatgga tgcaatggtt gtctttttaa 1260 agctcatgta acttgcatat gtccaaaagt agcaaagata cctttgcaag aactattttt 1320 cctacaagct caaagaaata aaattggtaa caaaagttct ttacaaatag gattagttga 1380 tatatcagag acacatcggt taacaagata tttggatcga aaagaatcag actctaaaat 1440 acctggagca aatagtgagg atataggtaa atgctaaaat attaaaatat tttttgtcaa 1500 tttttgttta aactgtgaat taaaaaattg agtataaatt gaatataaat tattttgatt 1560 aaaattattt aacataattt attttctcct ttagcagtag aatttcaaac aacaaacgat 1620 tcggacattg gcaacaaaaa agcaaatgat acagaaaaag ctgacttcag tttgaaaaaa 1680 aattacttag atattaccag ggtttctcaa gccgctatta ggtatggggt atctaacaga 1740 gctgctgcag ccatagctac tgctacttta gcttctgcaa aagatgcaca ttttttaaag 1800 aaaaatactg atatttttgt tgatcacaac aaaataaaac gaagtaagac taagttaatg 1860 aaagaaatcc gaggtagagc tgaaaaagta gcaatgaatg tgataatatt caatgtatat 1920 tttttgatgg acgaaaagat ttaactaagt acatgagaga agaagaagat ggaaaattac 1980 atcaatcgga aaaaaaagaa gaacattatt cagtttgttc tgaacctgga ggaaaatacc 2040 aatttcattt cactaacaat tctgaagaaa gagtagagac tgcagctgag caaatagctg 2100 ataatcttta cgaatggatt atgaatcatg atgttggaga ctatttagtt gccattggtg 2160 cggactcaac aaatttaaac actggttgga aaggaggtgt tatccattac cttgaacata 2220 aattggataa aaaactaatg tggataattt gtgctcttca tacaaatgag ctacctctcc 2280 gacatcttat gattcagctt gacggaaaaa ctgcatctaa aaataacttt tctggttcta 2340 ttggaaaact tataggtaag gcaactagct tcaaagtcaa agatagcata aaaaaaattg 2400 aatttcctaa tgcaataata gaacttgact ctgaagttgt aaaatcgcta tcagatgatc 2460 aaaagtatct ttatcaaatt acccatgcaa ttaacgctgg ggtttttccg gaaaatctta 2520 aaggtaaaga aattgggcca cattcgcatg caagatggtt gaatttcgca aatagaatat 2580 gtttactttg gtgctccgaa attgtacttt cagttgaaga aacatataat cttcaacttc 2640 tggtggagtt tattgttgga gtttatgcgc cactttggtt tgaaatcaaa gtaaagtgga 2700 agtggactga agggcctaac catatactca agcaattaaa acttctttct tatcaaagac 2760 ttaaagttaa agacattgta atgccctatg ttttatcttc ttcatggaat gctcatagtg 2820 aaaatgttct tcaagcattg ttgtgtagtc tagaaaaagc agaaagacaa tttgcagttg 2880 aaacaatttt aaaattaaga ggagatttaa aatacggaaa cactaatgta agaagtagaa 2940 aacatcatat tcataatgaa aatgctgaat cactttttga tctaattgat tggagtttgg 3000 atgtacacga acctttactt acatgtttat tgtcaaagga tgaactttta aaatacaatg 3060 atattccgat gtgtgtacct tgttttcctg ttcatgggca agctattgaa cgatgtgtcc 3120 aagtagtcac aagagcttct caatcggtgt ttggaaaaga tagaagagat gggttcatca 3180 gagcaacatt agcacatcga gaaatcatgc ccgtaaacca atcaaaaaaa agatctattt 3240 aaattaattt aagcaatatg aatttttttg ttgttttgct ttggatttta tttatgatct 3300 caataaataa tttgagcttt aaatatattg tatcttttag ataacatttt atttaggaac 3360 aataaatttt taatccttca gaaagtttgt aaattattgt aggaactatc ttatttatta 3420 tctatagagc aaataaattt ttttcctctt ctaatatcct aaagcaaaca cggtatactt 3480 atgaatacgt ttttgtatga ctttatgaac agtagcaccc aaagttatgt tttattttaa 3540 tgtctatatt atgccttgtt gttttttgat gattgtatca ccataggctt atgaaaaaaa 3600 ggcagaaaag tgctaatttg atgcgttttt tgtaaaactg acatttttta aacgccctgc 3660 gcccacataa gatgaacttt ttaagaaaaa ataaattatt cacgtctttt ctaatatcct 3720 aatacataaa aaaaatattg cgcgcaatcc catttctaac atttatatat agatattttt 3780 ttgtaaaatc cataatggta gacaactttt catacgtcta ccgtgtttgt atagcattaa 3840 gaaaagtagc accaaatatt atgttttayt tttatgtcta tgttatgtct gattgttttt 3900 tgataattgc cttacctttg gctaatataa aaaaggccga aaaatgccaa tatgatgagt 3960 ttttcgtaaa actgacattt ttcaaacgcc ctgcgcccac actagataaa ctttttaaga 4020 aaaaaaaaat tattcacgtc tcttctaata tcctaataca taaaaaaatt attgcgcgcg 4080 ctcccatatc caaaatttta aaagtacgac ccacccta 4118 // ID DNA8-87_AP repbase; DNA; INV; 822 BP. XX AC . XX DT 26-AUG-2009 (Rel. 14.09, Created) DT 26-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-87_AP. XX NM DNA8-87_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-822 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 2023-2023 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 822 BP; 291 A; 144 C; 130 G; 255 T; 2 other; cagtggcgta atttggtcat gtttgtgggg ggggtagaca tttaaccccc cacaccaacc 60 accaaaaaaa aaaaattggt ttatatcata aaaacctctc agtccaacca ttgtgatctg 120 taatctgtat aatgtaagta gggtacatgt ataaatgcat gcatacatga aaataaataa 180 ttacatacta tttaataaag ttttaaatgt actttttcga gtagattcgt ggaataatct 240 gcgtagacgt aaaatactca aaattgcaaa cttcaaaagg ttataacctt ggaactaagt 300 ccgaccgaca aattctgatt gcaccattgt acttaactcg ttgaaatata tagatttccg 360 aaaaaaaatt tccaaaaatc gttcgagagc taaactgcat tttcnatgta gattcgtgga 420 ataatctgcg tatgcgtata atatacgcaa aatcccgaac ttcaaaangt tataaccttg 480 gaactaagtc cgaccgacaa attctgattg caccattgtg cttaactcgt tgaaatatac 540 ctattactat aggtttccga aaaaattccc aaaaatcgag ctaaactgca ttatttcaat 600 atatttgtgc aatattcaat ttgcaatagt gattagtgtt tattacgaag taatattatc 660 ttagttacag ctaccatagg ggatttgggt ttttatttgt tgtttctata aaatatacta 720 tattagtacc aataaataat aagtacataa atatggttcg ccagttaaaa aaaagtcatg 780 ggggatacga cacccaaatc cccccccgta attacgccac tg 822 // ID L2-5_NVi repbase; DNA; INV; 4321 BP. XX AC . XX DT 15-APR-2009 (Rel. 14.04, Created) DT 15-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE CR1-like retrotransposon: consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; L2-5_NVi. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-4321 RA Bao W. and Jurka J.; RT "CR1-like elements from the parasitic wasp Nasonia vitripennis."; RL Repbase Reports 9(4), 755-755 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS join(439..1551,1503..3626,3630..4319) FT /product="L2-5_NVi_1p" FT /translation="GILPHSPVLPVRMNTPPARPRPRRRELNQQQRQQQQI FT HLPRARLERMCPKRINRHPHRLLAPEPNRLREHRLRLSRRPLRTSSRRWMT FT FGRSRMRPFRHKTPCLKGSLELRKGLALLQSRLKALDDLPALKTRLNNVES FT SISELQAQYQELSSRSPAEQQNISIAPASEEINNLRSELAEVKRRQERSAN FT NVVVISDLAYTQETSLQLLAFTVLAALDSTVLRRDVATVRTMGRLDATTTN FT AQGDSRFPPLAVTLSSSALARSIIVAKARKRKLHTSELDAALLEEARALCP FT DHQGLININELLPPDIHKLRIKARLEAKRRRGCRSFVRDGRIYIRSGEDSE FT RATIISTDAELEAFLAHTPPIANTDITHGFFSPYAANSQHGHHTLRHTPII FT PPKPPSKSSNSSTSASKASLSEGLRVCHFNANSLTGHIEMVRLFLSTRSPF FT HVIAVTETWLSDKITSIPSLDDYILYRRDRNRNGGGVALYIHHSLTASVIS FT SSDGEWSGKPGKPEYLFCEISAKGVSPIFVGVVYRPPHSPFIQGSNFIEQL FT TTHMHNYSTKVIMGDFNSDQLSSSEDANFIRAFIDENSLTSVPYGATHHRQ FT DSDTWLDLCLIDEQDCLLSHWKTDTPFINGHDLITATLDVQIPRHVPATYS FT YRNYKGICAEKLRDFLSACDWSSFTTSSLDECITILNANLTNAINHLAPLK FT TVTPGRKRHPWFTAALRDLLTERNRLYRRFRKSRLPWDLYFYRIARDDAHK FT QVEEARLNYYYSRLSTLTDVAEIWRELENLGITTSKSPSPARFSSDALNKH FT FSSISNDPQAPSVEEYLRTLESLDLPEHFIFRAITESDVSAAVSHFNTQAR FT GSDGIPQVVISKALPILAPLLCQIFNLSLSEARFPSAWKSSLVRALNKVNS FT PTALTDYRPISLLCFLSKALEWLVHKQISEYLETRLYLDNFQTGFRTGHST FT QSGLIKLTDDVRLGIDRKKVTLLLLFDFSKAFDTVCHVRLLGKLSSFGFSK FT QVIRWLASYLSGREQAVIGDNNELSTSLPLNTGVPQGSVLGPLLFALYIND FT IGFCLDSDVSHLIYADDLQIYSCHLEELDSCSVRMSANAERIMGWAALNRL FT KLNVLKTKAIVLGSPYYINALPSIANTFINIGGARVDYESSVRNLGLVLDS FT KLTWKEHVTQICKRAHSLMYRLYFFRKSTNFRLRQHLVQALLFPIIDYCSL FT AYCDLTQELDTKLQRLVNTGIRYIYGVRRDEHISPYRRELQWLTTAGRRKY FT FMACFLRKIFNTSVPTYILAFFDFHVALRPVRGEVTPLDIPSFKTETL" XX SQ Sequence 4321 BP; 1111 A; 1194 C; 960 G; 1055 T; 1 other; tgctgtaacc taacctacaa cctctcatgg cttttctcct tgctgcacgg cgtctatttc 60 tcctcaaagc gatccacgtg gagccacctg tcgggtacac ggaacaccac cagcaacatc 120 atcaatcaca ggtcggtcaa ttctcacctc tctcgattag caccccatct ggcttcgaga 180 gtattaagtc tctagctctg tcgataactt accacccgct actatcttac aacactctct 240 acgttatcga ctgccagtaa gttgtcgacg gagacttaac aataacatcg aaaatgagcg 300 gaattggtgt gaagaatgac gagtcgctcc tcatgtgtgc tcactgcaaa aaaacaaaaa 360 gagggaaaat gaagacctgc gagttgtgtg accggcagtt ccacaccata tgtgtcgcct 420 tcaggtctat aatggtgagg gatactgcca cactcgcctg tgctgcctgt gcggatgaat 480 acgccaccgg cacgcccaag accacgcaga cgagagctaa atcagcagca gcggcagcag 540 cagcaaatac atctgccaag ggcacgcctg gaacggatgt gcccaaaaag aataaatcgg 600 cacccgcatc gcttgctggc tccagaaccc aatcgcctgc gagaacacag gctacgttta 660 tcgagacgtc ccttaaggac atcttcgcgg cgctggatga cattcggaag atccagaatg 720 aggcccttca ggcacaaaac accgtgtctg aaagggtctc tcgaattgag aaaaggcttg 780 gccctcttgc aaagccgcct caaagccctt gacgacttgc ctgcgctcaa gactcgacta 840 aacaatgtcg agtcctccat ctctgagctg caggcacaat atcaagagct gtcgtcaagg 900 agtcccgcag agcagcagaa catcagcatt gctcctgcct ccgaggagat caacaacctt 960 cgcagcgagc tggctgaggt taagaggagg caggagcgtt cagcgaataa tgtggtggtg 1020 atctctgacc tggcctacac ccaggagaca tcactgcaac tcctggcttt taccgtcctg 1080 gctgcactgg attccacggt cctaaggcgg gatgttgcaa ccgtcagaac catggggaga 1140 cttgacgcaa ccaccaccaa cgcgcaaggt gacagcagat tcccaccttt agctgtcact 1200 ctctcttcca gtgcgctggc ccgctccatc atcgtggcca aagccaggaa acgcaagctt 1260 cacactagtg aactggatgc tgccctactg gaggaagcga gagctctctg cccagatcat 1320 cagggactta ttaacatcaa tgagctgctt cctccagaca tccacaagct gcgcatcaag 1380 gccaggctag aggcaaagag gagaagggga tgccgctcct tcgtgaggga cggaaggatc 1440 tacatccgct ctggcgaaga cagtgaacgg gccactatca tctctactga tgctgaactt 1500 gaggcttttt tagcccatac gccgccaata gccaacacgg acatcacaca ctaagacaca 1560 cacctattat cccaccaaaa ccaccatcta aatcatctaa ctcatcyaca tctgcttcta 1620 aagcgtctct gtctgagggt ctgagagtct gtcatttcaa tgcgaattcc cttacgggtc 1680 acattgagat ggtcaggctc ttcttgtcca ctcgttcccc ctttcacgta atagctgtaa 1740 ctgagacctg gctgagcgac aagataacat ccatcccttc actggatgat tacatattgt 1800 acagacggga cagaaacaga aacggtggag gtgtggccct ctacattcat cattcattga 1860 cagccagcgt tatctcgtca tctgacggcg aatggtcggg caagccgggt aagccggagt 1920 atcttttctg tgagatctca gctaagggag tatcgcccat cttcgtgggg gttgtgtatc 1980 gccctccgca ctcacccttt atccagggat ctaatttcat tgagcaacta acaacgcata 2040 tgcacaatta ctccacgaag gtcatcatgg gtgactttaa ttccgatcaa ctttcctcat 2100 ctgaagacgc caacttcatc agggccttca ttgatgagaa ctctcttacc tctgtgccat 2160 atggtgcaac ccatcacaga caggactctg acacctggct tgacctatgc ttaattgatg 2220 agcaggattg cctgctctca cactggaaga cggacactcc tttcatcaac ggacatgacc 2280 ttatcacggc cacactcgac gtacagattc cacgccacgt acctgctaca tactcttaca 2340 gaaactacaa aggaatctgt gctgagaagt taagggactt ccttagcgca tgtgactggt 2400 catccttcac cacgtcatca cttgacgaat gcattaccat tcttaacgct aacttaacga 2460 acgccataaa tcatctcgcc ccattaaaga ctgtgacacc aggacgtaaa cgtcacccgt 2520 ggttcaccgc ggctcttcgt gaccttttga ctgaaaggaa cagactctac aggcgttttc 2580 gcaaaagtcg tttaccttgg gatctatact tctatagaat cgcaagggac gacgctcata 2640 aacaggttga ggaggctagg ctgaattact actattcacg cttgtctacc ttgaccgacg 2700 ttgcagagat ctggagagag cttgaaaatc ttggcattac cacctctaaa tcaccttcac 2760 ctgctcggtt ctcttcagat gctctcaaca aacacttcag ttccatctct aatgaccctc 2820 aagctccatc tgttgaggag tatctgcgaa ccctggagag tctggacctc cctgaacact 2880 tcatcttcag ggctatcaca gagtcggatg tgtcggctgc tgtatcacac tttaacaccc 2940 aggccagggg aagcgatggc atcccacagg ttgtaatctc caaagcactg ccaatactcg 3000 ctcctctact atgtcaaata ttcaacctgt ctctgagcga ggcacgcttt ccctccgcct 3060 ggaagtcgtc gctcgtaaga gcactaaaca aagttaactc tccaacagct ttaactgact 3120 accgtccgat ctctctactc tgtttcctgt ccaaggcgct ggagtggctg gtgcacaagc 3180 aaatctctga atatcttgaa acaagacttt accttgacaa tttccaaact ggctttcgca 3240 ctggccacag cacgcagtct ggcttaatta agctgactga tgatgtcagg cttgggatag 3300 acaggaagaa agtcactctg ctacttctat ttgattttag caaggcgttt gatactgtgt 3360 gtcacgtcag gctactcgga aagctatcct ctttcggctt ttctaagcag gttatccgct 3420 ggcttgcatc ttacctctcg ggaagagaac aggccgtcat tggtgacaac aacgaactct 3480 ccacctctct accattaaac accggtgtcc cacaggggtc agttttaggc cctctgttgt 3540 tcgcgttgta catcaatgat attggcttct gtctagattc agatgtgtcc catctaatct 3600 atgcggatga cctgcagata tacagctaat gccaccttga ggagctcgat tcctgttctg 3660 tcaggatgag tgctaacgcc gaaaggataa tgggctgggc tgcactaaat cgacttaaac 3720 tgaatgtcct taagactaag gcaatcgtcc tgggttcccc ctactatata aatgcattac 3780 cttctattgc taacaccttt ataaatatag ggggagcccg ggtcgactat gaatcatctg 3840 tgcgcaatct ggggttggtg cttgactcca aacttacatg gaaagagcac gttacacaaa 3900 tatgtaaacg tgctcactct ctgatgtacc ggctctattt ttttcggaag agcactaact 3960 tcaggctgcg ccaacaccta gtgcaagcac tcctatttcc catcatcgac tattgctcac 4020 tggcttactg cgacctgaca caagaactgg acacgaaact gcagagactt gtcaacacgg 4080 ggatccgtta tatctacggt gtaaggaggg acgagcacat ctccccttac aggcgtgagt 4140 tgcaatggct taccaccgcc gggcgtagga aatacttcat ggcctgcttt ctaagaaaaa 4200 tttttaacac ttcagtgcct acctacatat tagccttttt tgatttccac gtcgcgctca 4260 ggcctgtgag gggtgaggtg acccccctgg atatcccgtc cttcaagacg gagacgctga 4320 a 4321 // ID CR1-35_HM repbase; DNA; INV; 5005 BP. XX AC . XX DT 17-DEC-2008 (Rel. 13.12, Created) DT 17-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE CR1-type family - consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-35_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-5005 RA Bao W. and Jurka J.; RT "CR1 families from Hydra magnipapillata."; RL Repbase Reports 8(12), 1863-1863 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 80..793 FT /product="CR1-35_HM_1p" FT /translation="MAKFSATNIKDLLEIHENTLIKLFNDKFDKMEIKFNI FT IQEENNILKNEMKELRKAVDFVSEKYESTLAELKELKKNTISNVELTDNIK FT FNDKNNDVKDKLAELEDRSRRNNLRFNGVEESENESWEDSEKKIQEVLKSK FT FGFTNNLEIERAHRTGKKEKGVKKNRTIIVKFLNYKERNAVFEKFIKLKLW FT NEKLYVNEDFSERTMEIRKKLFKEAKELRAKGKFAKVIYNKLITRDF*" FT CDS 832..3927 FT /product="CR1-35_HM_2p" FT /translation="MDSNYQNNFDSLSFNFFQSNDLVLDTNSDPDLNYFCD FT VGALRNNCSYFYNKEIKEFISRDHLNVIHFNIRSLKKNFENLCTIIEETSN FT IFSIICLTETWCEFDDAKSNLNLHLPGFNMIPLARKANKRGGGVLFYIKEN FT LRFFIRPEMSISDDNKEILTIELLTNNSKNILISCCYRPPSGHIESFNAFL FT CNDILKNSFKEKKLNYLIGDVNLNCFEYHVNSNIKKFYDDLFEHGSIPLIN FT KPTRVTSTSATLIDNIITTDIFNETLKKGIIKSDVSDHFPIFFSINIDNKL FT IQQHKKVIIKRFFTDANLASFKDQLSLINWNHINSSFEANEVYDTFFQTFY FT EIYDVNFPKREFIKTNKSLKSPWVNKDLRKSSKIKQKLYIKYLKKPTLENK FT IIYKNYAKMFESQRKKFKKRYYYDLLEKYKHNSKRTWQIISQITGNSKLKL FT CSIPNTIKVDGIFLYEPKQIARELNKYFVSVGPNLAKNIPNMINPINDYVF FT PLISSLNKFEVSSSEFESAFKMLKTNKAVGPDDINCNIVIDSYDTIKHILF FT KVFKCSINQGIFPDQLKIAKITPVFKGGEPSNVSNYRPISILSVFSKILER FT ILFNQISNHLAINNILYINQYGFKRNNSTEHAIVHLTRSITDSFEKSEFTL FT GVFIDLSKAFDTIDHEILFKKLEHYGIIGNALKLLKSYLKHRKQFVYIDGS FT SSQDYLNIDYLNITCGVPQGSILGPLLFLIYVNDLHEVSKLTTIMFADDTN FT LFLSNNNIDKLFYDMNIELIKISDWFKLNKLSLNIDKTKWIFFHPRFKKHL FT LPSNMPLLHIDNIQIKRVVFTNFLGIIVDENLSWKNHIGHLCNKIAKSIGV FT LYKARSVLNKHSLTQLYYSFIHCHINYANIAWGSTHKSKLKPLYCQQKHIA FT RIINFKNRFTHSRSLLFNMNVFNIYQVNVYNVLCFMFKCKTNSSPASFFDL FT YSLKEKNKYSLRNDNFIKQPKFQTNFGKFCISFRGAFLWNQIILKNFDFSH FT EWNYYMFKRKLKKVIFSIENIFIYF*" XX SQ Sequence 5005 BP; 1971 A; 658 C; 636 G; 1740 T; 0 other; ttacgtttca accgcgatct tgaacggacg tattatttta tcgaagaaaa aaaaaaaaaa 60 aaaaaaaaaa aaaaaaaaaa tggcgaaatt ttcagcaacc aatataaagg atttactgga 120 gattcatgaa aacacattaa taaaactttt taatgataaa ttcgataaaa tggaaattaa 180 atttaatatt atacaagaag aaaataatat tcttaaaaat gaaatgaagg aattaagaaa 240 ggcagtcgac tttgtaagtg agaagtatga atcaactcta gccgaattga aagaactgaa 300 aaaaaacaca atttcaaatg tcgaattaac agataacata aagtttaatg acaaaaacaa 360 tgatgtgaaa gacaagcttg ccgaacttga agatcgaagt cgcagaaata atcttaggtt 420 taatggagtc gaagaaagcg agaacgaatc atgggaagac agcgagaaaa aaattcagga 480 agtcttaaaa tccaagtttg gttttactaa taatttagag attgaaaggg cacatcgaac 540 aggaaaaaaa gaaaaaggag ttaaaaagaa cagaacaata attgtaaaat ttcttaatta 600 caaagaaaga aacgcagttt ttgaaaaatt tatcaaatta aaactttgga atgaaaaact 660 atatgttaat gaagatttta gcgaaagaac catggaaatt aggaaaaagt tgtttaaaga 720 agcgaaggaa ttaagagcca aaggtaaatt tgctaaagtt atttacaaca aacttattac 780 gcgcgatttt taaaataaat tcttttattt ctagttattt aacttttaaa aatggattct 840 aattatcaaa acaattttga ttcgttatca tttaactttt ttcaatctaa cgacttggtg 900 cttgacacta attcagatcc ggatttaaac tatttttgtg atgtaggagc tttgcgaaac 960 aattgttcct acttctataa caaagaaata aaagaattta tttctcggga tcatttaaat 1020 gttattcatt ttaacattag aagtcttaaa aaaaactttg aaaatttgtg tactataata 1080 gaggaaactt caaatatatt tagtataatt tgcttaactg aaacgtggtg tgaatttgat 1140 gacgctaaat ctaatttaaa tctccatctt ccaggtttta atatgatccc actagcgcgt 1200 aaagcaaata aacgcggagg tggtgtactt ttttatataa aagaaaattt gaggtttttt 1260 attaggcctg agatgagtat ttctgatgac aataaagaaa ttttaacaat tgaactttta 1320 accaataatt ctaaaaacat acttataagc tgctgttatc gcccaccatc tggccatatt 1380 gaaagtttta atgcattttt gtgtaatgat attttaaaaa atagttttaa ggaaaagaaa 1440 ctaaattatc taattggtga cgtaaattta aattgttttg aatatcacgt taatagcaac 1500 attaaaaagt tttacgatga cttattcgaa catggatcaa ttccactaat taacaaacct 1560 actagagtaa cctcaacttc agctacctta attgacaaca ttataacaac tgatatcttc 1620 aatgaaactc ttaaaaaagg tataataaaa agtgacgttt ccgaccactt ccctattttt 1680 ttctctatta atattgacaa caaattaata caacaacata aaaaagtcat tatcaaacgc 1740 tttttcaccg atgctaatct tgcgtcgttt aaggatcaat tatctttaat aaattggaat 1800 catataaaca gttcgtttga agctaatgaa gtatacgata cattctttca aactttctat 1860 gaaatatatg atgtaaattt tccaaaacgt gagtttatta aaacaaataa aagtttaaag 1920 tcaccatggg tcaataaaga tcttaggaaa tcctcaaaaa taaagcaaaa actgtatata 1980 aagtacttaa aaaaaccaac actagaaaat aaaattattt ataaaaatta tgctaaaatg 2040 tttgaaagtc aaagaaaaaa atttaaaaaa agatattact atgatttact agaaaaatat 2100 aaacataatt caaaacgcac atggcaaatt ataagtcaaa ttactggaaa tagtaaatta 2160 aaattatgct ctatacctaa cactataaaa gttgatggta tatttttata tgaacccaaa 2220 caaattgcaa gagaattaaa taaatatttc gtgtctgttg gcccaaactt agctaaaaat 2280 attccaaata tgattaatcc aataaatgat tacgtgtttc cattaatctc tagtttaaat 2340 aaatttgaag tatcttcttc agaatttgaa agtgctttta aaatgcttaa aactaataaa 2400 gcagttggtc ctgatgatat aaattgtaat atagtcatag attcatacga taccataaaa 2460 catattcttt ttaaagtttt taaatgttct attaatcaag gaattttccc cgatcaattg 2520 aaaatagcca aaataacgcc tgtatttaaa ggaggtgaac cgtctaatgt cagtaattat 2580 cgaccaatat ctatcctctc tgttttttca aaaattttag aaagaatttt gtttaatcaa 2640 atatctaatc atcttgctat taataatata ctttacataa accaatatgg ttttaaaaga 2700 aataactcta ctgaacatgc aatagttcat cttactcgta gtataactga ttctttcgaa 2760 aagtcagagt tcactttagg cgtctttata gacttatcta aggctttcga cacaattgac 2820 catgaaatac tgtttaaaaa acttgaacat tatggtatca tcggtaatgc tttaaaacta 2880 ctaaaaagct atttaaaaca tcgcaaacaa tttgtctata tcgacggatc ttcttcacaa 2940 gattatttaa atatagatta tctaaatata acctgtggag ttccacaggg atctatacta 3000 gggcccttat tatttctaat ttatgttaac gatctccatg aagtctcaaa attaacaact 3060 ataatgtttg ctgatgatac aaacctattc ctatcaaata ataacatcga caaacttttt 3120 tatgatatga acattgaact aataaaaata tctgactggt ttaagttaaa taaactttca 3180 ctcaacattg ataaaactaa atggattttc tttcatccac gatttaaaaa acatttactt 3240 ccaagtaata tgcctcttct tcatattgat aatattcaaa taaaaagagt agttttcaca 3300 aattttctag gtatcattgt tgatgaaaat ctatcatgga agaatcacat cggacattta 3360 tgcaacaaaa ttgcaaaaag cattggagtt ttatataaag caagaagcgt tttaaacaaa 3420 cattcactaa ctcaactata ttactctttc atacactgtc atattaatta tgctaatatt 3480 gcatggggta gtacccataa aagtaaatta aagccacttt attgtcagca gaagcatatt 3540 gcacgtatta taaatttcaa aaatcgtttt actcactcaa ggtcactttt atttaacatg 3600 aatgttttta atatatatca agtaaatgtt tataatgttc tttgttttat gtttaaatgt 3660 aaaaccaact cttctccagc ttcctttttt gatttatatt ctttaaaaga aaaaaacaaa 3720 tattctttaa gaaatgataa ttttattaaa caaccaaaat ttcaaacaaa ctttggtaaa 3780 ttttgcattt cttttcgtgg agctttttta tggaaccaaa taattttaaa aaattttgat 3840 ttttctcatg aatggaatta ttatatgttc aaacgaaaac taaaaaaagt aatattctca 3900 attgaaaata tatttatata cttttaattt tacccagttt tgtcaacact catttatatt 3960 ttacttttat acaatattga tttaaagttt atgtcagaaa ctgtagttaa acatcattat 4020 ttgaattttg ttaaaagttc cactgacatt atttgtacag taggcattta ttatttactt 4080 gtttacttta atttttattt tatttcgtaa ttttgtaaat aatttgtatt acgaacggta 4140 tatactttta atattgaatt taattttgta atgaaaaaaa atatatatat atatatatat 4200 atatataaat atatatatat atatatatat atatatatat atatatatat atatatatat 4260 atatatatat atattattat taaaaaaaaa caaaaaaaaa aaaaaataca aaaaaaaaaa 4320 atttgatgta tgtgactgga tgtatctgca aacattgtat gtatgatact tttggaagtt 4380 aatcatcctt tgacagtggt ttagcatttt tgtcggcgaa agcgttaacg taaatgcacc 4440 aacatttgat aattatggct tctcacacaa tgacaccatc atttggctat ctaaattctt 4500 ctacacaatc ttttggacaa tcttgaagtg catttggatg tacctagtgc aatcatattc 4560 ttattatttt cgtttattta ttaattaatt aacaattaag atcagtgtaa ctgatagatt 4620 tgagtaacaa gaatcagtga tcgagtcaga tgcgagacgt atttatttat taagaatact 4680 atgatattat tgttttcgct tgaatcatcc ggttcagaag aattagtact ttaattccag 4740 ccataaagac cataacttaa aaaagttaaa aacttgtgtt tatttatttt tgtttttcct 4800 aaaatctaaa ctaagatcaa agttcttcaa cagcggttct cggtgacaag acctgtatgg 4860 tcttctttga gtatccgcgt tctttatctt aattctttat ttttttgtaa cgattttttt 4920 tgcatgtaaa tttttcttat tatatatttg taaatattgt attaaatatt gtaataaaga 4980 acaaaaaaaa aaaaaaaaaa aaaaa 5005 // ID Gypsy-160_AA-LTR repbase; DNA; INV; 1628 BP. XX AC AAGE02018302; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-160_AA_; KW Gypsy-160_AA-I; Gypsy-160_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-1628 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02018302; Positions 18450 20077. XX SQ Sequence 1628 BP; 492 A; 360 C; 419 G; 357 T; 0 other; tgttacgtta gcttttagtg cgtaaagata aggtttagtt tgacaaaaat aaacctcaag 60 tatggaacaa atatagagct tatagaaaat ctcaagtgaa atcatacgaa cgtgcataaa 120 cctcgactcg caaagcaaga aacccctcgc aaaacgctca acgaaacgca aaacaacggg 180 aagaacttca acgctctcgc acactacagc caaccaaaag agatcgaatc gatctactgt 240 agcgtgatca tcattcgata gtacgtcatt gccaagaacg gatggtgcct ctggtccggt 300 ggtgaatagg agcaatagaa catcacaggt aaccgaaaaa aagaaagata tatcgacagc 360 agataacacc cttttgatca tgacaacatc acctacagac gctccagtgc aacgacgcaa 420 tctacgcaga ttgcgctgtc gaaccagacc ggactcatct ccatccaaga acagagggat 480 cacaattgcg agcacgtgca tattcaatcg ctaagcatag taagtggtac acatagagcc 540 aatggattca gcacactgtg atccatctct caagggaatc tttctttctc tctaccatag 600 ctatcgatcc ctccaaccgc acaagaaagc tagttcggcg aatcacgtgt gaagccaaca 660 tacgaaggtg caaactcatc cgaaaactag agaagagccc cggcaagggt gagttgatat 720 ggaagtgatc attcattcag ggttctaatg ttcagtgtcc attagggctc ttttgcgttg 780 ttcatcattc cagcggaaaa cagagagacg tagtctggcc cactgcgacg cttacggtgt 840 gtgcgcttcg agccaggaag agagcgcctt gatctccaac acccatagca aagcaattaa 900 cagtcgcccg gcgcctcgtg tccgaatgta gtgtggctcc gttactcaag aagcacggaa 960 ggagggtgga agagacaata tttctgcaaa gtggggcgtc agctgtgaag aagagcccat 1020 agtgcaagtg gctgatcacc attgaaaatc gttgcttcgt tcggagaaga taattgggag 1080 agttgcagaa agaaaatttg agtgtggtgg agtgacatcg gtggtgtgcc agtgggtgaa 1140 gaacgtgccg caaagcggtg cttactcaga agaggcaagg agaaagtcaa ccgtaccaag 1200 tcgatcgttg gtacgcggag agaagttggt gtgagacgag agacggttgc gttccatcca 1260 gcacgccgct ccgacgtcga gagaagtgag tggcacggtg gttgacgtcg ctgagaagtt 1320 aatcagagcg acgccgcgag gacgagccga ggcaatgcga gttcgagggt tgcgttcgtc 1380 atcagcgtga gttccgtgag cccgtgagta ccaagagaaa atgaacttgt acatttaact 1440 ttaagcaccc agagagctag atagggccct aaggggcatt cagggcaaga tgaacaagat 1500 aacaaaataa aatcaaatgt cgtaatttta ggctgttgct tgcgtaattt gttggtgcga 1560 aattccctga attgagcttt gttctggtgg cctagccttg attccaccat aggacatatt 1620 tcgtaaca 1628 // ID Copia-11_CQ-LTR repbase; DNA; INV; 302 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-11_CQ_; KW Copia-11_CQ-I; Copia-11_CQ-LTR. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-302 RA Jurka J.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 338-338 (2011). XX DR [2] (Consensus) XX SQ Sequence 302 BP; 116 A; 53 C; 50 G; 83 T; 0 other; tgttgaaata aacacgcgaa gatcaagcaa aacatcttgt ttataaacac aaaagtcaat 60 tcgacattga cagttcaatt aggtttacac agtccgttga attaggtcaa cacatgaaaa 120 tacaaaacat ttatagggtt ccggctaggg ccagtgcaac cagttctaac agaatgttat 180 tcttatgaag tgttgctagt tgaattaata accatcgata acgtacatag tacagcttta 240 gttgtaagat acaccgagaa ataaagacga caagttttat caaacaaact caaatcttat 300 ca 302 // ID REX1-1_BF repbase; DNA; INV; 3128 BP. XX AC . XX DT 31-JUL-2009 (Rel. 14.07, Created) DT 31-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE Amphioxus REX1-1_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW Rex1; Non-LTR Retrotransposon; Transposable Element; REX1-1_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-3128 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-3128 RA Kapitonov V. and Jurka J.; RT "Young families of REX1 non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(7), 1698-1698 (2009). XX DR [2] (Consensus) XX FH Key Location/Qualifiers FT CDS 47..2935 FT /product="REX1-1_BF_1p" FT /translation="IMCYANISNVRTRGNRGGLARKQKHCRIKSLVTNRLD FT VYANDYVSRRCVVNNRVLRKITTVPATHGRSTQQNFRTLPCLFLSNVRSLV FT SKKCELEATLAYHSAQLACITETWTVPEIPDDVLAIDGFRLFRKDRLYGKG FT GGVALYIAEDIPVKRLYDLEIDGVEVMWLQARPKWLPRTVPILFVGIVYHP FT PCNNNKEGATKMVEHLITSCDCLYRKQPHAGLLLCGDFNNLPIQKLITVHP FT HLKQVVTQATRGSATLDLILTNLAQYYCTPRTLPPIGSSDHSSVLLPPCVE FT HPTRSKPVKVMRRVATTVTKLNLGLSLAMTDWSPVYEAALVEEKVKTFYSL FT TMTLVNQHLPVKVSATRACDKKWMTEGVKRAIRKRAEEFKRHGKSARWKSL FT RNNVQTSIRHAKNWHYRNFIQTLKQEDPRKWWGAVNRELGRAQERSNSTTI FT EDVPDHEVAEVLNQYFSSAWCPGTSLHLFPLQCPTPCVDLCSIGEVKTLLK FT GLNPHKASGPDDLPTWTLKHYADDLAPVITHLFNASYEEGVVPSIWKAANV FT VPVPKSKGAANASEMRPVSLLPVAAKLMERCILKRLLPSITPAIRNQYAYL FT KGSSTVLAAIRMVHTWLSALDSRRHAAVLALFADMSKAFDRVNHSILLQRV FT NDVVTNPRMVAWIQNYLQGRTQRVVANGKVSEWRVLTSGVPQGGVLSPYLF FT LLFMSTRDVVYSDTLDVGYADDVGLSRSISLEKDRVDNSMREEALQLDSWA FT ECNDMLLNGKKSQSLLICFSRNIPLLPPLSLGGEPVPFSRVAKGLGFIFDC FT KLSWHDHVQSLVSKASSRLHYLRLLTKQGMCVADLVQIYLSLIRSVLEYGH FT VLLVGCSKEQSDSMERVQKRALRIISLGGRRSVPNLPSLKERREAAAVKLF FT KDMLRPEHPLHDLVPPQRRTVTGRQLRNKNAFTLPKARTDRLKQSFLHTAV FT RLYNDSIS" XX SQ Sequence 3128 BP; 837 A; 754 C; 746 G; 791 T; 0 other; atttgttgac gatctttcta agcagatatc acccgtctag atctgaatca tgtgttatgc 60 aaacatttca aatgtaagaa caagagggaa ccgcggtggg ctggctagga aacagaaaca 120 ttgtagaatt aaatcattgg ttacaaatag acttgatgta tatgcaaatg actatgtaag 180 tcgcagatgt gtcgtaaaca atagagtact tcgtaagata acaacagtcc ccgccactca 240 tggcaggtca acacaacaga actttaggac tcttccatgc ctgttcctgt ctaatgtacg 300 atcacttgtt agtaagaaat gtgagttaga ggctactctg gcctatcact ctgcacaact 360 ggcatgtatc actgagacat ggactgtacc tgaaatccct gatgacgttt tagccatcga 420 tggttttagg ctgttcagaa aggataggtt atatggtaaa ggtgggggtg tggctcttta 480 catcgccgaa gacatacctg tcaaaagact gtacgacctc gagatagacg gagtggaggt 540 tatgtggcta caggctcgtc caaagtggct gccgagaact gttccgattc tttttgtggg 600 tatagtgtat catcctccat gcaacaataa caaagagggg gctactaaga tggttgaaca 660 tctcattacc tcctgtgact gtttatatcg taaacaacca catgctggac tgttgctgtg 720 tggcgacttc aacaatctac caattcagaa gcttatcaca gtgcatcctc accttaagca 780 agtggtgact caggccacgc ggggctcggc tacattagac ttgattctaa ccaacttagc 840 tcaatactac tgtacacccc gtaccctgcc acccatcggt tccagcgacc actcctctgt 900 tcttttgcca ccttgtgtag aacatccgac cagaagtaaa ccagtcaagg tcatgcgcag 960 agtggcaact acagtaacga agttgaacct cggtctgtcc ctcgccatga cggactggtc 1020 tcctgtttat gaagcagcct tggtcgaaga gaaagtcaag accttctaca gtttgaccat 1080 gaccttggtg aatcagcatc ttcctgtaaa agtgtcggcg acaagagcct gtgacaagaa 1140 gtggatgacc gagggcgtca aacgcgctat caggaaaagg gctgaggagt tcaagcggca 1200 tggcaaatct gctcggtgga aatctctcag aaacaacgtt caaaccagca tcaggcatgc 1260 caagaactgg cactacagaa acttcataca aaccctcaaa caggaagacc cacgtaagtg 1320 gtggggcgct gttaaccgtg agcttggcag agcacaagag agatctaata gtaccaccat 1380 tgaggacgtg ccagaccacg aggtggccga ggttctaaat cagtactttt cctcagcgtg 1440 gtgtcctgga acgtctttgc acctgttccc cctgcagtgt ccgactccat gcgttgactt 1500 gtgttcgatt ggtgaggtga agactcttct taagggctta aaccctcaca aggcaagcgg 1560 tcctgatgat ctcccaacat ggactctcaa gcattacgct gacgacttgg cgccagtcat 1620 cacccacctg tttaacgctt cgtatgagga gggggttgtt cctagcatat ggaaggcggc 1680 aaacgttgtc cctgtgccaa aatctaaggg tgcagctaac gccagtgaga tgaggcccgt 1740 gtccctctta cccgtagcgg ccaagctcat ggaacggtgt attctgaaaa gactattgcc 1800 ctctattaca cctgccatta gaaaccagta cgcttacctg aagggttctt ccactgtctt 1860 agcggccata aggatggtac acacgtggct gtcagcctta gactcaagac gtcacgcagc 1920 agtccttgct ttgttcgcag acatgagcaa agcatttgat agagtgaacc attcaattct 1980 cctccaacga gtgaacgacg tggtgaccaa tccacgcatg gtagcctgga tacagaacta 2040 tctgcagggc cgtacccaga gagtggttgc taacggtaaa gtcagcgagt ggagagtact 2100 cacttcgggg gtacctcagg gaggtgtact gtcaccctac cttttcctcc tgttcatgag 2160 tactcgagac gtagtctaca gcgacactct tgatgtcggc tacgcggacg acgtgggttt 2220 gtccaggtct atctccctgg agaaggatcg cgtggataac agtatgagag aggaagcgct 2280 acagcttgat tcctgggcgg agtgcaatga catgctgcta aacggcaaaa aaagtcagtc 2340 tctgttgatc tgcttcagca gaaacattcc acttttaccg ccactctctc tcggcggtga 2400 acctgtacca ttctcaaggg tcgcaaaggg ccttgggttc atcttcgact gcaagctgtc 2460 atggcacgat catgtacaat ccctcgtttc aaaagcatcg agcaggctgc actatctcag 2520 actactgact aaacagggga tgtgcgtggc tgacctggtt cagatttacc tgtcacttat 2580 tcgtagcgtt cttgaatacg gtcacgttct gctggtaggt tgcagcaaag aacaatcgga 2640 cagtatggaa cgtgtacaga aacgggccct ccgtatcatc tctctcgggg gtaggcgatc 2700 agttcccaac ttaccttcct taaaggaaag gagagaagct gctgcagtaa agctctttaa 2760 ggacatgtta agacccgaac atccactgca cgaccttgtg cctcctcaga gacgcacagt 2820 tactggcagg caactgagga acaagaacgc gttcactctc cccaaggcaa gaactgacag 2880 actgaaacag tcgttcctcc acactgcagt cagactttat aatgactcta tttcctagca 2940 ttaccatcag cctgtgagtt cggatctatg cttagtgttt gttaatatat tatgtatttt 3000 tgtttgttga cattctgcac gtataagttt ctttgtaaca aacctacgca attcagggta 3060 taccatgtgt gtatttccct gcctatgtac gtttgaggaa gaaataaata aaataaataa 3120 ataaataa 3128 // ID Gypsy1-I_DV repbase; DNA; INV; 4946 BP. XX AC scaffold_12963; XX DT 15-OCT-2009 (Rel. 14.12, Created) DT 15-OCT-2009 (Rel. 14.12, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy1_DV; KW Gypsy1-LTR_DV; Gypsy1-I_DV. XX OS Drosophila virilis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; virilis group. XX RN [1] RP 1-4946 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(12), 3094-3094 (2009). XX DR Genome; scaffold_12963; Positions 18184979 18180034. XX CC Positions [2429-2935] - Reverse transcriptase CC Positions [4001-4477] - Integrase core CC 'ATAT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 47..4864 FT /product="Gypsy1-I_DV_1p" FT /translation="MENININDVSIATLKSWLALLNLPTEGTKTELMARLN FT KVPVDIRDDAAKELEVQRNKEQTIEAQNELQNIMQQQRDEIANGAEMLKLM FT RLEIEASRKFLEEFQVTVNRNSHIGGSDGGEQESEVGDLLQLEDGHTEERP FT RSTDGNAGAADYTGTDGNAGAADHAGAAGNAGAAGRIDESGNAVGGNLNNC FT IQTGAMMLAKEILLEFTGESEVRKWVMQFFNVAKIYRLNDMQQHLLCISKL FT KGGALKWLHADPMRIIAPIDEMLNQLVLAFGGGFSKSELRQKFEDRVWKPD FT EVFATYFSEKSILAQDINIDVEELMEGIIRGIPCENLRTQASMHCFTNPAQ FT ILRAFAAIKLPIKRVRNHVVKQTAQEAQADKQQRCYNCNVKGHWAKDCLKP FT KREAGSCYACGSKDHLIAGCPNKKYDMENKYVRYLRIHFSNSFNKSIIAEC FT LIDSGSPISFLKEKFVPLKVKRMPDANSYVGLNESKLKNFGKILCFMKKKL FT INVYFNVIIVANESMRYDAVLGRDFMDSFGLNIDLRTLKIVNGHKVDHQNN FT GKNDIIATSYKEKQKKVDDEDIKHEQNETILLNDKIVNKEIVSNETVSDEL FT VSEKIVSPETLGDKVGSDEIVRNKIISGEIVRNEVIGIETVSDEIVSYGEL FT KLEPDSEEVVSDEIIYNQTGIEKVIIEDIGNSAVKINNHVNEAPVNSEIMG FT VLEENFEKDILTISWEDSNKRENIYILSEDIDYSTKCEFENLFENTYVKAK FT RPNQPKLRHELSIRLENDKVFNCTPRRLSYMEKEKLQELLDDYLKDGIIRP FT SCSEYASPIVLVKKKTGDMRLCIDFRRLNKIICRDNYPLPLIDDLLDRLCG FT KTVFSKLDLKNGYFHVFVNEQSIKYTSFVTPLGQYEFLRMPMGLKTAPHVF FT QRFVNNIFCDMIKDNKVIVYMDDIMIASTNIKEHMETLKEVFIRLVQNKLE FT LKLDKCEFFKSSVKYLGFVVDSKGVRADEKGLEAVKSFPIPNKIQGVQSFL FT GLCSYFRRFILNFSILAKPLYDLLKKDKKFKFGEEEFKCFNMLKDKLLEAP FT VLSIYNYKEEIELHCDASAVGFGAVLFQKKEDRKLHPIFYFSKRTTEVESK FT YHSFELETLAIIYALRRFRIYLQGKRFKIMTDCNSLTLTLNKIDLNPRIAR FT WALELQNYDFELVHRSGKQMQHVDALSRCYEDILVIESNSFEDNLVICQSK FT DDKLVDIRKMLEKEEHKLFEMRNGVIYRKTNDGRLVFYVPKDMEEHVLYKY FT HDELGHAGRDKMIDAIGKSYWFPYIKEKAIKHIGNCLKCIAFAPKTGKSEG FT LLHSIPKGNKPWEMIHIDHYGPINAGRANKYILAVVDGFSKFVRLYTAKTT FT STKEAIKALKDYFRAYSRPKFIVSDRGSCFTAKEFDEFLKNEGVTHIKIAT FT GCPQANGQVERLNRDLGPMMAKMAEPENGLHWDVIIENVEYAINNTQHKSI FT KKFPSEMLFGLQQKGKVVDELKEKLEEFTQVNTCDKRNLENIRKSGEIYQK FT EAQIRNEERINKKKKVSVEYKIGDYVMVKNFDSTPGVSKKLIPRYKGPYSI FT EKILKNDRYLLKDVEGFQLSRNPYVGIWNSQNFKHWMRK" XX SQ Sequence 4946 BP; 1869 A; 648 C; 1107 G; 1322 T; 0 other; atatcagaag tgggattgac tcagaaaaaa aaaaaaacac aacaaaatgg aaaatataaa 60 tataaatgac gtgtcaatag cgacattgaa aagttggttg gcattactaa atttgccaac 120 agagggtacc aaaactgagc tgatggcgag attaaataag gtaccagtgg atatccgaga 180 cgatgctgca aaggaacttg aagttcaacg taacaaggaa cagacaattg aggctcaaaa 240 tgagttgcaa aacataatgc aacaacagcg tgacgaaata gcaaatggcg ccgagatgct 300 gaaattgatg cgtctcgaaa ttgaagcgtc tcgaaaattc ctagaagaat ttcaagtaac 360 ggtgaaccgc aactcgcaca tcggcgggag cgacggggga gagcaagaaa gtgaagtcgg 420 cgatttattg cagctagagg atggtcacac tgaagaaaga ccacggtcga cggatggcaa 480 cgctggtgca gctgattaca ctggaacgga tggcaacgct ggtgcagctg atcacgctgg 540 tgcagcgggc aacgctgggg cagctggaag gatcgacgaa agtggcaacg ctgtcggagg 600 caacttaaat aattgcatcc aaactggagc gatgatgtta gccaaggaaa ttttattaga 660 atttactgga gaaagtgagg tgcgtaaatg ggtaatgcaa ttcttcaatg tggccaagat 720 ctacagacta aatgatatgc aacagcactt gctttgtata agcaaattga aaggcggtgc 780 tttgaagtgg ttgcatgcag accctatgcg catcattgct ccgattgacg agatgctaaa 840 tcaattggtt ttggccttcg ggggaggatt ttcgaagtcg gaactacgac agaagttcga 900 ggatcgggtt tggaaaccag atgaggtgtt cgccacatat tttagcgaaa aaagcatatt 960 ggcacaggac atcaacattg atgtagagga gttaatggag ggtattattc gaggcatacc 1020 ttgcgaaaac ttgcgcactc aagctagtat gcattgcttt accaatccgg ctcaaatttt 1080 acgtgcgttt gcagctataa agttgccaat taaacgggta cgaaaccatg tagtgaaaca 1140 aactgcacag gaagcacagg cggacaaaca acaacgttgc tacaattgca atgttaaggg 1200 ccattgggcc aaagattgtt taaagcccaa acgggaggca ggatcttgct atgcgtgcgg 1260 ctcaaaggat catttaatag caggatgtcc aaataaaaag tatgatatgg aaaacaaata 1320 tgtaagatac ttaagaattc attttagtaa ctcatttaac aaatcaataa ttgcagaatg 1380 ccttatagac tctggcagcc ctatttcatt tttaaaggaa aagtttgtgc cattaaaagt 1440 aaagcgtatg ccagatgcaa attcgtatgt aggtttaaat gaaagtaaat tgaaaaattt 1500 tggaaaaatt ttatgtttta tgaagaaaaa gttaataaat gtttatttta atgttataat 1560 tgtggcaaat gagtctatga ggtatgacgc tgtgcttggt agagatttta tggattcttt 1620 tggtttaaat attgatttaa gaacattaaa aattgtgaac gggcataaag ttgatcacca 1680 gaataatggt aaaaatgata tcatagctac aagctacaaa gaaaaacaaa aaaaagtgga 1740 cgatgaagat atcaaacacg agcaaaatga aactatattg ttaaatgata aaatcgttaa 1800 taaggaaata gtcagtaatg aaactgttag tgatgaatta gttagtgaga aaattgttag 1860 tccggaaacg cttggtgata aagttggtag tgatgagata gttagaaata aaataattag 1920 tggtgaaata gttagaaatg aagtgattgg gatcgaaact gttagtgatg agatagttag 1980 ttatggagag cttaagctcg aacctgatag tgaagaagtt gttagtgatg aaattatcta 2040 taatcaaacc gggattgaaa aagtgattat agaagatata ggaaacagtg cagttaaaat 2100 taataatcat gttaatgaag cgccagtgaa tagcgaaatc atgggagtgt tagaagagaa 2160 ttttgagaag gatatcttaa caataagttg ggaagattct aacaaaagag aaaatatata 2220 tattttaagt gaagatattg attacagcac aaagtgtgaa tttgaaaatt tatttgaaaa 2280 tacttatgta aaagcgaaaa ggccaaatca gccaaaatta agacatgagt taagcataag 2340 attagaaaac gacaaagtat ttaattgtac acctaggcgg ttatcttata tggaaaaaga 2400 aaagttgcaa gaattattag acgactattt aaaagatgga atcattaggc caagttgttc 2460 agagtatgca tcacctatag tactagtgaa gaaaaaaact ggagacatga ggttatgtat 2520 tgattttaga agactgaaca aaattatatg tagagacaac tacccgctac cattgatcga 2580 cgatttatta gacagactat gcggaaaaac agtattttca aagttggatc taaaaaacgg 2640 gtattttcat gtttttgtta atgagcaatc tattaaatac acttcatttg ttaccccttt 2700 aggacaatat gaatttctta ggatgcccat gggtttaaaa actgcgccac atgtatttca 2760 aagatttgtt aataacatat tttgtgatat gataaaagat aataaggtaa ttgtttacat 2820 ggatgatatt atgatagcaa gtacaaacat aaaggaacac atggagacct taaaggaagt 2880 gtttataaga ctggtgcaaa acaaattaga gttgaaatta gataagtgtg aattttttaa 2940 aagcagcgtt aaatatttag gatttgtggt ggatagtaaa ggcgttaggg ctgatgaaaa 3000 agggttagaa gcggttaaat cctttccaat accaaacaaa attcagggag tacaaagttt 3060 cctagggtta tgctcctact tcaggaggtt cattttaaac ttttctatat tagctaaacc 3120 cttatatgat ttattgaaaa aagataagaa attcaagttt ggagaagagg agtttaagtg 3180 ttttaatatg ttaaaagata aattgttaga ggcgccagta ctatcaatat ataattacaa 3240 agaagaaata gaactacatt gtgatgccag cgcagtggga tttggggctg ttttatttca 3300 aaaaaaagaa gataggaaat tacacccgat tttttatttt tctaaaagaa caaccgaagt 3360 agagtctaaa tatcatagtt ttgagttaga aacgctagcc atcatttacg cattacgcag 3420 atttcgtata tatctgcagg gaaaacgatt taaaataatg acagattgca actctttaac 3480 tttaacgctt aacaaaatag atttaaatcc gagaattgct agatgggcgt tagaactaca 3540 aaattacgat tttgagttag tgcatagatc gggcaagcaa atgcaacatg tggatgcgtt 3600 aagcagatgt tatgaggaca ttttagttat tgaatccaat agttttgagg acaatttagt 3660 tatttgccaa agcaaagatg acaaactggt agatattaga aagatgttag aaaaagaaga 3720 gcacaagcta ttcgaaatga gaaatggggt catttacaga aaaacaaatg atggcagatt 3780 agttttttat gtaccaaagg atatggaaga acatgtactt tacaaatatc atgatgagtt 3840 gggacatgct gggagagata agatgatcga tgccattgga aagagttact ggttcccata 3900 tattaaggaa aaagcaataa agcatatagg gaattgttta aaatgcattg ccttcgcccc 3960 caagacgggt aagagtgaag gactattaca tagcataccg aaaggaaaca aaccgtggga 4020 gatgatccat atagaccatt atggaccaat taatgcagga agggctaata aatatatatt 4080 ggcggtggtt gacggatttt caaaatttgt tagactatat acagcaaaga caacaagtac 4140 gaaagaagcg attaaggcat taaaagatta tttcagagcg tatagtagac ctaagttcat 4200 tgtatcggac agagggagtt gttttactgc aaaagagttt gatgaatttt taaaaaatga 4260 gggagtaacg catattaaaa tagcaacggg ttgtccgcaa gcaaatggtc aggtggaaag 4320 gttgaataga gacttgggtc cgatgatggc aaaaatggcg gagccagaaa atggtttaca 4380 ttgggatgtt attatagaaa atgttgaata tgcaataaac aacacacaac acaaaagcat 4440 aaagaaattt ccaagcgaaa tgttattcgg attacaacag aaaggaaaag ttgttgatga 4500 attaaaagaa aaattagaag agtttacaca ggtgaacacg tgtgataaaa gaaatttaga 4560 aaatataaga aagagtggag aaatatatca aaaagaagct caaatcagaa atgaagagcg 4620 gattaataaa aagaagaaag tatcagtaga atataaaata ggtgactatg tcatggtaaa 4680 aaattttgat agcacaccag gagtttcaaa aaagttaata ccaagatata aagggcctta 4740 tagtatagaa aagattttaa aaaatgatag atacttgtta aaggatgtag aaggctttca 4800 attatctcgc aacccgtacg ttggtatctg gaatagccaa aactttaagc attggatgcg 4860 aaaataggca ataagcatta gttataaaga tagatacaca tataaaagac agagatcggg 4920 agctctcttt gtcaggatgg ccgagt 4946 // ID Gypsy-32_DWil-I repbase; DNA; INV; 4218 BP. XX AC scaffold_181155; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-32_DWil_; KW Gypsy-32_DWil-LTR; Gypsy-32_DWil-I. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-4218 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_181155; Positions 1213952 1218169. XX CC Positions [3321-3638] - Integrase core CC 'TACA' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 2160..3779 FT /product="Gypsy-32_DWil-I_1p" FT /translation="MGDIIMHAETVKEGLTKLRKVLDVAAEYGLKIKWRKC FT RFLQTEIEFLGHQIGNGNIKPGAEKTKAIRKFPMPKNIKAVQAFLGLTGFF FT CKFIRDYSLIAKPLTDLLRKDIEFKMGPKEQNAVEVLKDALIKEPVLKLYR FT RDARTEIHTDASKDGFGAALLQWQESQLHPVLFWSKKTSESEAWLHSYVQE FT AKAIFLACKKFRQYILGINFKLVTDCAAFKQTLSKKDVPREVAQWVLYLED FT FDFEAVHRPGERLKYVDCLSRYTSEVMVVSSVVTAKIRKAQQDDTMIEAIS FT EILKSKPYDNFKLKGGLVIQGTDLLAIPKSLERKIITEAHNAGHFAVQKTM FT HAVQQSFWIPHLEGKVMQIINNCVKCIIYNKKLGKKEGFLYQINKGSEPLH FT TLQVDHLGPMDATSKLYKYIFAMVDSFSKFLCMYPTKTTNADEVIKKLQEW FT SDIFGHPARIISDRGSAFTSSSFEEYTRQNGIEHVWSTTGVPRKMDKLNMS FT TDRSLTLYPGYHRRSLKDGISSYREFKKRLILLSISPQSGLHLS" XX SQ Sequence 4218 BP; 1460 A; 825 C; 974 G; 959 T; 0 other; tttgggggct caaccgatcg cggactttgg aaatatatga ttggaaagct gacagtggaa 60 agagtgagaa agcgaaagtt ttcgacagca gaaagagcga gtacagtcag acgaaaaaac 120 gaatgccaat ttcgcaagaa gacgcgatgc cctggcgcca cacgaaataa gcaaaacgaa 180 gaaagagaca gcaagcagac gcgaaatttt ggcggcaaac aaaacgaaga aagagacagc 240 aagcagacgc gaaattctgg cggcaaacaa aacgaagaaa gagacagcaa gcggacgcga 300 aattctggcg gcaaacgaag gcagcaatcg gaaaaaacga gacgaataca gcaagccaaa 360 aacgaagagg aaaaatcgct agcaatattc caaaaagaga gtgtatgctt ccaataagag 420 tgttacgaaa tcgcatcatt atgtcggacc aagaggacca attagacccg atgtcgcacc 480 caggtgaaat ggttaatgaa ttagaggcaa ttgagaacaa ccatggaaat gatgaaaagg 540 ccggaataaa tgcagcaatg ctggttgaga acaccgcgac tcttcaacag ctggtccaac 600 tattttgtct gaaagtgcat gccgacgaaa tggacaaaaa gaatgacacc aagttggtgg 660 tggagaactt ttccaaaatc atcccagaat tctgcgggga aggaatgtct gtacaacagt 720 ggtttcaaaa cttcgaactc aacgccgatg catatggatt aaacaataaa cagaagtacg 780 ttcaagcacg agcgataatg attggtaccg ccgctctatt tctcgagtct acggctgtct 840 acgaatatga acaacttcgc cagcaagtcc tggaggaatt tgattgcgat cagccgtgca 900 gtgccgaaat tcacaaccaa ttatctctac gtacgaagtc cgatggtgaa agttttcatt 960 aggacatttt acaaatgaaa cggatcgctg cccgtgggat tattgatatg gagtcagtta 1020 tcaagcacat cgttaatgga ttgaacctaa agagtgacta caaatacacg ttgtacggct 1080 gcaagtcact gaaggatctg aaagacaaat atgacattta caaaaaaatc gccagtacca 1140 aggttaagac cgaagcagca aaacgagctc cgcagcggtt tacgccgaaa tcaaacttca 1200 ataggaatga caggaagcaa cattgcttca aatgcggatc aatggcgcat ctgcgcaagg 1260 actgtaagga ggaagtcaag tgtttccgct gcaaccaaac tggacacatg tccaaaaatt 1320 gtcctgcttc ctactctgta aacattgtgt atgaagaaaa ccgactcaaa tcattgcaga 1380 ctattacaga acaccacacg tcctgtaggt cagttcaagg cagaggtgca gatcgatggt 1440 atttgcctca cacacacgtt tctggtagtg gatgactgca caattggaat taaattactg 1500 ttgggatacg atttcatatc aaagttcgca cagctatttg aggaagaagc tgccaaccaa 1560 aatcagataa gtatttacaa tgtagtcaaa acaaacactg accctgatgt gccttcacag 1620 tatactcatg ttgttcaatc gatgattgag gattttggca aaagaaagaa gacagctgac 1680 tgcgcgatca cgttgaagat cgtgcctgac gacaacatca agctatttag gcacgcacca 1740 agccgaatag gcattaccga agcagatgtg gtcaagcagc agatcgacga atggctgaaa 1800 gatgacacaa tacgacgttc atcatctaac ttcgccagtc gtacagttat cgtaaaaaag 1860 aaggatggca ataatcgagt gtgcattgac taccggcagt taaacaagat ggtattgaag 1920 gattgtttcc ctgtgcctat cgtcgaagat gttttggaaa aattgggcaa tgcaaaagtt 1980 ttcaccatca tggaccttga aaatgggttc ttccacgttc cagttgagga aagcagcaag 2040 aaatacaccg cattcattac gaaaagccct ttggcttttg caactccgca gccgctttta 2100 tccgttttgt taatgacgta tttcaaaacc tcattaatga taacgttctc gatctttaca 2160 tgggcgatat cataatgcat gccgaaacag tcaaagaagg tcttacgaaa ttgcgaaaag 2220 ttctggatgt agcagctgag tacgggctaa aaataaaatg gaggaaatgc cgatttctgc 2280 agactgaaat agaatttttg ggtcaccaaa ttggaaacgg aaacataaag ccgggagcag 2340 aaaagaccaa agccattaga aagtttccaa tgccaaaaaa catcaaggct gttcaggctt 2400 ttctcggatt aactggcttt ttctgcaaat ttataagaga ctactcattg atcgccaagc 2460 cgctgacaga tctattgcga aaagatatcg aatttaaaat gggcccgaag gaacaaaacg 2520 ctgttgaagt cttgaaagat gccttaataa aagagcctgt tttgaagttg tatcgaaggg 2580 atgccagaac tgaaatacat accgacgcat ctaaagacgg gtttggtgct gcattgctac 2640 agtggcaaga gagtcaactt caccccgttc tgttttggag caagaaaacc tcagagtctg 2700 aagcatggct gcacagctat gtacaagaag ccaaagctat atttctagct tgcaaaaaat 2760 ttcggcagta tattcttgga attaatttta agcttgtaac cgattgcgcg gcgttcaaac 2820 aaacattgag taagaaagac gttcctcggg aggttgccca atgggttcta tacttggaag 2880 attttgattt tgaggcagta cacagaccag gagagcggct aaaatacgtg gattgcctaa 2940 gccgatacac aagtgaagta atggtagtat cgtcagtagt cacagcgaag attagaaaag 3000 ctcagcaaga tgatacgatg attgaagcaa tttccgaaat tcttaaaagc aagccatatg 3060 acaactttaa gcttaaagga ggtttggtga tacaaggaac agatttgctc gctataccaa 3120 aatccctaga aagaaaaata attacagaag ctcacaacgc aggacacttt gcagtgcaga 3180 aaactatgca cgcagtacaa caaagttttt ggataccaca tctggaaggc aaagtgatgc 3240 agataataaa caattgtgtc aaatgcatta tctacaataa gaaacttggc aaaaaagaag 3300 ggtttcttta tcaaatcaac aaaggatccg aaccgctgca caccttacaa gtggatcacc 3360 tcggaccgat ggatgctaca tcaaagctgt ataaatacat atttgcaatg gtcgatagtt 3420 tcagcaaatt tctatgtatg taccccacaa aaaccactaa tgccgacgag gtcatcaaaa 3480 agctacagga atggtcagat atattcggac atccagctcg catcataagt gatcgtggat 3540 cagcatttac atcgtctagt tttgaagagt atacgaggca gaatggaatt gaacatgttt 3600 ggagcaccac tggagtgcca aggaaaatgg acaaattgaa catgtcaacc gatcgatcct 3660 taacattata tccaggttat catcggagga gcctgaaaga tggtataagt tcgtaccgag 3720 agttcaaaaa gcgattaatt cttctgtcca taagtccaca aagcggtctc catttgagtt 3780 gatgctcggc gtgaaaatga ggagcggaac caaaagcgat atcttacagt tgctcgagga 3840 agaaatgctt gacacctttg actcagaaag gcaagagatg cgtgacgaag caaaagaaaa 3900 gatacagatg gctcagaaac gttataaaga acagtttgat aaaaaccgaa agaacgaggt 3960 tggatacaat gttgaagatc tggtggccat tcgtagaacc caatttgcag gcaaaaagtt 4020 ggcaagcgag tttttaggac cgtacgagat agtgaagttg aaacgtgcag gacgctatga 4080 tgttcgaaaa gcagcaacag acaccgaggg accaaaatat acgaccacta gcaatgacaa 4140 catgaaattg tggagttttg tagcagagaa cgaggatgca tggtcatcag agcctgatga 4200 ctaatcagga aggccgaa 4218 // ID Gypsy-74_AA-LTR repbase; DNA; INV; 255 BP. XX AC supercont1.281; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-74_AA_; KW Gypsy-74_AA-I; Gypsy-74_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-255 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.281; Positions 676301 676047. XX SQ Sequence 255 BP; 93 A; 38 C; 60 G; 64 T; 0 other; tgtaacccta tggattgaac cctggtggga gcatgaacgg gaaaaggaga gagcgagatt 60 gaatcggaga gtactgacgc catactcagt ggaggatggc gggtttttgc tacgctgcgt 120 gaagaagttg cggaaaacta aataatatta aatctgtaga accttattgt gaatttctta 180 accaaaacca aactgaatta taacaaaaag tgaattaaac caaatttgac tattaattaa 240 agtgatagag catca 255 // ID BEL1-I_Dya repbase; DNA; INV; 8382 BP. XX AC chr3R; XX DT 18-MAY-2009 (Rel. 14.05, Created) DT 18-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL1_Dya; KW BEL1-LTR_Dya; BEL1-I_Dya. XX OS Drosophila yakuba OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; melanogaster subgroup. XX RN [1] RP 1-8382 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1005-1005 (2009). XX DR Genome; chr3R; Positions 168265 176646. XX CC 'CTTAC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 1692..2744 FT /product="BEL1-I_Dya_2p" FT /translation="MPIGHGEEVKTRLTVSKTSQRVRAKSESSVPFKSPSP FT TRSKSPSTTPRKSTSTSLQVPRSAPNTKPNSDFSPQIIRATAKMAQAEADR FT ALMKFMTITDRVGQFEARVNTPAVTSPSIHTSRVRLEQIRALWDKVEKEYE FT ACSEALSGLGSTDTITVMQSKYDYCYAVYERCAASLNEIIEEGSRSQQAVQ FT ASIPAPPQGGCRLPPVECDSFSGDYVHWPTFRDLFSAIYIYNPRLSEVEKL FT FHLNAKTSGEAKSIVALSPLTNDGFESAWRNLQNRFENKRLLVNSQLKILF FT NLPSVGLECGNSLKQLKSTIQGCLTALKIAKIATSNWDCLLVFLCSAKLPK FT LTLGQQSA" FT CDS 4133..6085 FT /product="BEL1-I_Dya_1p" FT /translation="MGGMWFRCPLKIFNNINLGHSRSIALAQFLRNEIRLN FT KDVASKKQYDSVIQEYLDLGHMHQVSPDDSNNFYLPHHAVFKPDSTTTKVR FT VVFNASNPSSNGNSLNDILHAGPVLQSDLTIQILKWRFFKYVFNADITKMY FT RQIRVNSTHKRFQRILFRNKDGELCDYELDTVTFGVNCAPFLAIRVLQQLA FT QDIRGQYPLASDNISNFMYVDDVLAGTHNKQSAVLAIEELRLALESAGFPL FT RKWTSNERKLLQEVPKEHLISADFLELEEASTAKTLGIRWQATSDSFFFIP FT MVIHLQTAYTKREVLSQIAKLFDPAGWLSPFVVRAKIFMQKIWLRELGWDQ FT PLPRDLATQWREFLEGYPALKEIRIPRWVRFHPAAKLQYHAFCDASQDAYG FT AAIFVRIETKEGCCTHLLTSKTRVAPVRSISIPRLELCGAVLLTELAALVI FT SEMPPHVYETFYWTESTIVLAWLSKPACTWTTFVANRVAKIEQCTDGSKWG FT HVRSEDNPADLASRGVSPQELKDSALWWRGPAWLHLKQEQWPSELLVHPET FT ELEQRPVKCHTVAVPSAVEILERFSAFDRALRVLAYVFRFVKSCRREGVAS FT SAELTAAELSEVQERLIVLTQKNEFPAEYKVLSNQQPATSSTRKRNFEPKP FT LS" FT CDS 6135..7337 FT /product="BEL1-I_Dya_3p" FT /translation="MLSYDEKHPIILPARCRFAELQVLFIHRISLHGGNQL FT MIRLIRSKFWIPKLKMLVKRTIHSCRVCVIHKKILQTQLMGNLPRARSTFS FT RPFTHTGVDFAGPFDVKSYVGRACKITKGYVCVFVCFTTRAIHLEATSDLT FT TEKFLAAFSRFFARRGCPQHVYSDNGKTFVGASTSLSKDFIEATRTSVLSQ FT HSLQNVSWHFNPPGAPHMGGLWEAGVKSFKSHFYKYTAAGKYTFEELTTLL FT AKIEACLSSRPISPMSEDPTDLIALSPGHFLIGGPLLAVAESEVKEGPSSI FT INRWRRLKALNQQFCLRWKEEYLKELHKRNKWKFPTRDLQAGDMVVIKEEN FT LPSNEWRLGRIQVVCPGVDGKVRVADALTARGVVRRLVAKMIRLPMDSPNE FT SKDSSIL" XX SQ Sequence 8382 BP; 2161 A; 2154 C; 1911 G; 2156 T; 0 other; ttttggtcct tcgagccgga tcatccagcg ctttgccgag ataagtacgg tgccaaaatc 60 cccaccatat atgtatatac atacgcatat atattgctgc gatcaaaatc cgctcactat 120 caactagtgg ccacctgcgc aattatatat gttcgtgtgt atattttgta tgcatccttt 180 cccttttatg ttatcaacca atcatcaact ttccaataaa aacaccaaat taaaaaacaa 240 aattcgccac ccaaactttc acatacacaa aaacatacag acattccatg tatgttggcc 300 gatcgatgta catacatatg tccattgcat ctgcatacat tgcgtacatg gcacatacgc 360 aacgtcgtac atctgcatgg tcatagggtg tttcattaat tgccaaaaca ttcaaaccat 420 tcaaacagac attaatatca tccaacatat gtacatgcat acattaatgt tattgtttac 480 tttcacatac acgaaagctt ccacacacac gacaagcaca gacgattttt tggaccggca 540 gataagaacc ggctgcagcc aacactagcg acgaacacat ccacgaacac aacctccagc 600 gcagtaagag aaaaaaaaaa agaataacaa aaaaacaaat tttaaaaaca accatcggcg 660 ctaacattta catattgcat atatatatat gtgtgtacca gcatatgtac gtatgtgtca 720 aggtagccac tctagggcct attccaataa gggaccgttc acactgacac tacttttttc 780 tagtttccca tagaaagtgt caatgtgtgt ttatttttgt ttctttaata ctttttttaa 840 cttcgttctt gacttattcg tatatccgat tcgcatatat cgtctcaaaa aaattggccg 900 aacatccacg cttatccagc gtcccacatc aacaattata gtccagtgtg atccagcgcc 960 ctcatatctg ttttttttct caatcgatag caacgtttat tccgattatt gtgtattccg 1020 cgcttaatcc agcgacccca ggttttatat aaaacgataa cataatttgt aacgattacc 1080 atttgttttt caagcgcaat ccggcgcccc cttacctttg tctaaatcga ttaattccga 1140 ttattcattt ttcttttgat tttgtctctt gctcaataca gcgaccccgg tgtttgtcac 1200 atcttttttc acgcgttgtc ctgcgccccc acaccctgtc taaattgatg cgataatttg 1260 gttcttaaat tacttttgtg cgctttatcg agcgacccag tataactcca aatcaacaat 1320 cctaagcttt cgattatttt gggtgtccgc tttatctagc gactccagtt atttttttcc 1380 aaatacggaa cattccgatc gatccaactg gtcgcacgct tatccagcgt cccaatatat 1440 aagttgtttc gatcttggtt gcacgttttt tccagcgtcc caatatagaa gttgtttcga 1500 gcttgtttgc acgctttatc cagcgtccca atatataagt tatttcgatc ttggttgcac 1560 gccttatcca gcgtcccaat atataagttg tttcgagctt ggttgcacgc tttatccagc 1620 gtcccagtat ttgtccaagt gaacagactg tttcgccccc acacggtttt cggctcctcc 1680 gagagcttaa tatgccaatt ggccacggtg aggaggtaaa aacccgtttg actgtttcca 1740 aaacgtctca acgagttaga gccaagagcg agtcctctgt gcccttcaaa agcccaagtc 1800 ccactcgaag caaaagccca tcgacaactc cacgaaaatc cacgagcacg tcgctccaag 1860 ttccacggtc agcaccgaac acgaaaccga attcagactt ctcaccccaa attattagag 1920 caaccgcaaa aatggcgcaa gctgaggcag acagggcctt gatgaaattc atgaccataa 1980 ccgatagagt aggtcagttc gaggccagag tcaacacccc agcagtcaca agtccgtcaa 2040 tccacacaag tagggtgcga ctcgagcaga tcagggccct atgggacaag gtggagaagg 2100 aatacgaggc gtgctccgaa gctctgtccg gtctgggatc cacagacaca ataacagtca 2160 tgcagtccaa gtatgactac tgttatgccg tttacgagcg gtgcgcagcg agcctcaacg 2220 agatcattga agaaggttca cgctcgcaac aggccgttca ggcctccatt ccggcgcctc 2280 ctcagggagg gtgtcgctta cccccagtcg agtgcgacag tttcagtggg gactatgttc 2340 actggcccac gtttcgggat cttttctcag ccatctacat atacaaccct cggctttcag 2400 aagttgaaaa gttgttccat ctcaatgcta aaactagtgg tgaggctaaa tccattgtgg 2460 ccttatcacc cttaaccaat gatggattcg aatcagcctg gagaaacttg cagaatcgtt 2520 tcgaaaacaa gaggctactg gtcaacagtc agcttaagat tttgtttaat ttaccttctg 2580 ttggccttga atgcggcaat tccttgaaac agttaaagag cacaattcaa ggttgtttaa 2640 cagctttgaa aatagcaaaa atcgccacct ctaactggga ttgcctgctc gtttttctgt 2700 gctctgccaa gttgccaaag ctaacgctgg gtcaacagtc agcttaagat tttgtttaat 2760 ttaccttctg ttggccttga atgcggcaat tccttgaaac agttagagag cacaattcaa 2820 ggttgtttaa cagctttgaa aatagcaaaa atcgccacct ctaactggga ttgcctgctc 2880 gtttttctgt gctctgccaa gttgccaaag ctaacgctgt ctctatggga acaatccctt 2940 atgagtaaga cagatattcc actgtgggct gaattccagg ctttcttaca agatcgtcat 3000 cgaacgctcg aggcgatcga agattttaag ccaaacgctc aatccagagc actgcggact 3060 gacaatagcc cgcgggcgat ccagaccttt gagaatagag tgacggtaac tccacaatcc 3120 tgtaaacttt gttcgcaaga gaaccatcct atccgagtat gtccgttatt cttacgaatg 3180 ccagtcggag agcgggagaa atatataaaa cagcagaagt tgtgcttaaa ctgtttttct 3240 agaacgcatc tgcttaggga ttgcactagc acgcataatt gcaatacttg taaagggcgc 3300 cacaacactc tcctgcaccg aagcggcact ccaccagtcc agtcagcggt caatcctgac 3360 caccaggccc acgtatccac atttaaccac cagcaagata tacagtccac tccattaacg 3420 aatcaaccga atgtgcaaac gtttctggct gtaaacactc aaggggtcct gctgagtaca 3480 gctttaatag agatttgcca ttaaggcatc aaatactcag cacgggcact tattgactcg 3540 ggttctgagg caacctttat ttcagaacgc ttgttcaact tgattaagtt gccttatgag 3600 tccatccatg ctcaagtttc agggctcaac ctcactgttg cggcttagcc ccgtaagcgc 3660 tgtcaactca gcataggttc tccggtcaag ccccacatcc aaattaagac ttctgcatat 3720 gtactaccac aattggccgg gagtctccca tcctttactt tacctgagga ctctttaaag 3780 catctgccgc ctttgtaact agcagaccca aatttctacc ggagttcgca gatcgacgtt 3840 ctgatcggtg ccgatatcct gccatcgatc attctcagcg gctcccatcc caatatctgt 3900 ggcacgcttc ttggccagaa gaccatattc ggatggattc tttgcggtcc gatcgcaaca 3960 aatcccacaa gcagaatatg ctcattttcg tcccgactag ctgtcaccga gtccagactg 4020 gacaacattc tcacaaaatt ttgggaggtg gaggatgttc cagtgaagcg ggttagggaa 4080 tccacctcga tttgcgagga aaactttgtc caatctacca aaaggaacga gaatgggagg 4140 tatgtggttt cgttgccctt taaagatatt taacaatatc aaccttgggc actccaggtc 4200 gatcgcactg gctcagttcc tccgaaacga gattcgactg aataaagacg ttgcgtcaaa 4260 aaagcagtac gactcagtca ttcaagagta tttagattta ggtcatatgc accaagtctc 4320 tccggacgac tctaataatt tttatttgcc ccatcatgct gtcttcaaac cagacagcac 4380 cacaacgaag gttcgcgttg tgtttaatgc ttccaatcct tcatcaaacg ggaacagcct 4440 caatgatatc cttcatgcgg ggcccgtact gcagtccgat cttactatcc agattctaaa 4500 atggcgtttc ttcaagtatg tgttcaatgc ggacatcaca aaaatgtaca ggcaaattcg 4560 ggtaaattca acccataagc gctttcagag aatcctcttt cgcaataagg atggagagct 4620 ctgtgactat gaactcgata cagtcacctt cggcgtaaac tgtgcgcctt tcctagccat 4680 ccgcgtacta caacagttgg ctcaggatat ccgaggccaa tatcctttgg caagtgacaa 4740 catctcgaac ttcatgtacg ttgacgatgt gctcgcaggc actcacaata agcagtcggc 4800 agttttagcc atcgaggagc ttcggctggc tcttgagagt gctggttttc cattacggaa 4860 gtggacctcg aacgaaagaa aactcctaca agaggttcca aaggagcact tgataagtgc 4920 agactttctg gagcttgaag aggctagtac ggcgaaaact cttgggatcc gttggcaagc 4980 tacatccgat agtttctttt ttattccaat ggtgatccac ctccaaactg cttacacaaa 5040 acgagaggtt ttatctcaga tagccaaact gttcgaccca gcaggatggc tatcaccttt 5100 cgtagtccga gccaagattt ttatgcagaa aatttggcta cgcgagctgg gctgggatca 5160 gccactcccc agggatctcg cgacccagtg gcgggagttc ctagaagggt atcctgccct 5220 gaaggagatt cgtattccga gatgggtacg tttccatcca gctgcgaaac ttcagtacca 5280 tgcgttttgc gacgcgtcac aggacgccta tggggctgct attttcgtcc gaatcgaaac 5340 gaaggaaggc tgttgtaccc atctacttac atccaaaacc cgagtcgctc ctgtcagatc 5400 catctccata ccacgcttgg aattgtgcgg agcagtgctg ctcacggagc tggcagctct 5460 agtaatttct gaaatgcctc ctcatgttta tgagaccttt tactggaccg aatccacgat 5520 cgttcttgca tggctgagta agccagcgtg cacctggaca acattcgttg ccaacagggt 5580 cgcaaaaatc gaacagtgta ccgacggaag caagtgggga catgtacgat ccgaagacaa 5640 tccagctgat ctggcaagcc ggggcgtctc accccaagag ctgaaggaca gcgcactgtg 5700 gtggcgtggg ccagcttggt tgcacctcaa acaggagcag tggccgagtg agttgttggt 5760 ccatccggag actgagcttg agcagcgccc cgtcaaatgt cacacggtcg cagtgccctc 5820 agcagttgaa atcctagaga ggttctcagc ttttgaccga gctctgcgtg ttcttgccta 5880 tgtgtttcgt tttgtgaaaa gttgccggag ggaaggtgta gcctcatcgg cagaactcac 5940 tgcggcggag ttgtcggagg tgcaggagcg gctaattgtg ctcactcaga agaatgagtt 6000 tcccgccgaa tataaggttt tgagcaacca gcaaccagca accagttcca cccgcaagcg 6060 caatttcgaa cctaaacccc tttcttgacg gggatggggt cttgagagca agcggcaggt 6120 tacaagcatc tgaaatgctt agttatgatg aaaaacatcc gatcatcctt cctgcacggt 6180 gtaggttcgc cgaacttcaa gtcttgttta ttcatcgcat atccttgcat ggggggaatc 6240 aattaatgat ccgactgatc cgttccaaat tctggattcc caagcttaaa atgttggtga 6300 aacgaacaat acattcgtgc cgagtgtgtg ttattcacaa gaagatactg cagacgcagt 6360 tgatggggaa tttgccccgt gcgaggtcca cgttttccag accgttcacc catacgggag 6420 tggacttcgc aggaccattt gacgtcaaga gttatgtcgg tcgagcctgc aaaatcacaa 6480 aggggtacgt gtgcgttttt gtgtgtttta ctacccgggc tatccacctc gaagcgacct 6540 cggacttgac tacagagaag ttcctggccg cattctcccg ttttttcgct cggcgtgggt 6600 gtccacaaca cgtctactct gacaacggga aaacgtttgt cggagcttcc acgtccttat 6660 caaaggattt cattgaagct accagaacat cggttttatc gcagcacagt cttcagaatg 6720 tgtcctggca tttcaaccct cctggcgcac ctcatatggg agggctttgg gaggcaggtg 6780 tgaagagctt caagtcgcat ttttacaagt acacagccgc tgggaaatac acgttcgagg 6840 aactgactac cctcctggcg aagattgagg cctgtttgag ctccaggccc atctcgccaa 6900 tgtccgaaga tcccacggac cttattgctc tgagccctgg gcattttcta atcggtggac 6960 cattacttgc ggtagcagaa tctgaggtta aagaaggtcc cagctccatt atcaatcggt 7020 ggcggaggtt gaaagccctg aatcagcagt tctgcctgcg ttggaaggag gaatacctaa 7080 aggaactcca caaaaggaat aaatggaagt tcccgacgcg agatctccag gctggagaca 7140 tggtggtgat caaagaggag aacttgccat ccaacgagtg gcgacttggg cgtatccaag 7200 tggtctgtcc cggcgtggac ggcaaggtca gagtggcaga cgctcttaca gcacgggggg 7260 tcgtccgaag actagtggca aaaatgatcc gccttccaat ggacagcccg aatgagtcga 7320 aagactcctc catactttaa gtatttgaaa acctgctact gacgagtcct aagttgtgtc 7380 gtcctacgtc gtccatgtgt catgatcatc cgcatctatc tcatctaatt tttttatttt 7440 gcccatttct tatgtagccg catatacaat ggctccagcg atccgccacg atgtataccg 7500 ctgtcgagtg tgcagggggg ttcatccatt gaggagctga cagcgattcc tgcaactacc 7560 ggctgtgaag cggctccggg cagttttgat taacaagtac tgcgctaact gtctggccca 7620 cgagcattcg ggccggtcgt gtcgcagcaa agcaaagtgc cgcatctgcc gagaggccca 7680 ccacactctg ctgcacatcc gcggtgaaca gccaggctct tcgaggcctc aacggaagcg 7740 tgagccacaa gcagcgcctc agcctcaacg gcgtcaggag gtgggttcga ccgggtcacg 7800 acgctcagcc tcgccgatgg atagtttccc accccgcagc tcattgtcaa cgccgacggg 7860 cgcgcctagc ttggtgcgga cgagcgtctg catccttccg accgctgccg tgctcattgg 7920 gtccgagcag aagagattcg atgtgcgagt cttggtggat ccgtgctgcc ccgtgagtcg 7980 catccacgtc ttgctggtga aggccctcaa cttggccgtc acgggcgtgg gcaacgaacg 8040 gctgtgcacc accgagattc gctcaaaaac gtcccggatg agcaaacagc tgctgatgct 8100 ggtggacgag cagatgcaga gccggactcc ggcacagccc ctggatccga aggtgcagga 8160 catgttccac aacatgacgc tagcggatga ccggtggttc cacccagcgc ttgtctcggc 8220 tgtccttgga gcagacgttt acgctgacct gaagctgccg ggtatcatcg cgagccagga 8280 cggcctgccc atggcacaaa attccaagct gggatggatt ctttcgggac gctgttctct 8340 gtcctaaatt tgcaagtatt cctcttgcaa ggggggggag aa 8382 // ID CR1-72_AAe repbase; DNA; INV; 4826 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A CR1 non-LTR retrotransposon from Aedes aegypti. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-72_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4826 RA Kojima K.K. and Jurka J.; RT "CR1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1160-1160 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 10 sequences with >94% CC identity. XX FH Key Location/Qualifiers FT CDS 182..1762 FT /product="CR1-72_AAe_1p" FT /translation="MVQTRKCDASPCFGSSSRTGKISCVRCQKSFHLKCTG FT ITTNQFKSVRDCPGAVWLCIICRNTPDLDQAATNFTYNMILDRLNSVYRLV FT NSQIDVTRWLCSTFGHNPPHQKNSTPPPVSHVHNETSNHNFMDQIENLQFN FT FTREFNDFGDASINNTGNNRSSSPPSILTSSKSGHSTEVVVPSSIPISAIT FT TVQSFVENSRALTNVANTDVTTQATTSATMPATATESTAAYPTSTVIIDTA FT ATVQATVTKPTTATAPNVTTSTTMPTATIALTLPKTATATVTTSTTATAMH FT ATTSKTNAPPAASVTRSSGKNACTQTIPYPTTHIAPPVSARANTLPSHQPA FT RPTSSTTPSVSTNYPTSVPVLAIAPPSSNSSANDTWYYVTRFQPNETVQNI FT VAYISHKANCKPEHIRCLKLTRQINSESLTFVSFKVSVPKSIEELISATEF FT WPIGITVSPFLERRSNIYRRKKPFSPVPSRLVPRQMRGAQLNPPANVPTST FT LLRAVPQHYFNSNKVQPRMPPPPRRFQSSLV" FT CDS 1513..4653 FT /product="CR1-72_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="ILAYRNNRVAFFRTALEYLPTQKAFFAGSLALGSTTN FT ARSPTESTSECTNFNSSSSGSTTLFQLQQGSTPNATTATPFPIIPRLDNVL FT SLYYQNVGGINSSLHSYNLACSDASYDLVAFTETWLDEGTLSKQIFGSDYI FT VFRQDRNEFNSRKTRGGGVLLALRSTLACKQLYPPNCTIVEQIWVSVNLTS FT FTLFICVIYIPPDRVNDLAIIESHLDSLTWITSKMQLTDKIIIIGDFNLPG FT ISWTRARTNFLFPDSSKSTTTSSSINLLDGYSLANLCQINDVPNSNGRCLD FT LCYITEELLSSDSIKLSAAPGPLVKTGCHHPPLHISISVPQPLKYIDIADS FT MYYDFNKTNFMSMNSFLLGIDWESILADSEISSVVKTFSNVLLYAIDRFTP FT KRQKRSKSYPPWSNPHLKRLKSIKRCHLKKYSKHHTDFFKLQYNASNRMYK FT DLNDQLFSDYQQRMQLDLKNNPKKFWSYINDQRKESGLPSSMKLGNVETSS FT IARICNLFREHFSSVFTDELLNDDEVLRAANNVPHHSSCGQHPTVTTQMVE FT TALCKLKSSSNPGPDAIPSIILKRCSSSLSAPLSFIFNLSLKCGTFPSHWK FT DSFVFPIYKKGCKRDVSNYRGIAALCAVSKLFELIVLEFITHNCSLYISET FT QHGFMPKRSTTTNLLSYISFVTRCIQDRLQVDAIYIDLSAAFDKINHKIAV FT AKLEKLGFSGAFLSWLRSYLVGRTMSVKIGDCISSPFFVSSGVPQGSHLGP FT FIFLLYINDVNLSLECFKLSYADDFKLYNVIKCDNDTALLQKQLDVFVVWC FT KNNKMILNASKCSSISFSRKHTITEFNYHIGEVTLNRVKSIKDLGVMVDYK FT LTFNDHISYIVSKASKSLGFIFRVAKHFSDIHCLKSLYCSLVRSVLEYAAI FT VWAPFYQNSIQRIESIQRKFLRFALRNLHWRDPLHLPSYIDRCQLIHLNLL FT STRRDVCKLLFVSDLIQCRIDCSNLLEKLNFDIHIRTLRSHLFFRLPTNRT FT NYGYHEPVISMCRLFNQYCNVFDFHLSRPVLKHQFVQMLGV" XX SQ Sequence 4826 BP; 1388 A; 1209 C; 817 G; 1412 T; 0 other; tttttttttc attttcttac acgaaattca ttgacaaatt attaaagtgt agtgtaaagc 60 atatttcgct aaacggacaa tcccatctta tggaattaca tccccaagtg gccaccgtcg 120 cgaatttcat ctgattatcg gtcacgctat tgttcgctcc acgaagcagt atacaacaac 180 aatggtgcaa accaggaaat gtgatgcatc accttgtttc ggaagctctt ctcgtactgg 240 aaagattagc tgtgttagat gtcagaaatc ctttcacctc aaatgcactg gaatcactac 300 caatcagttc aaatctgttc gagactgtcc cggtgctgtg tggctgtgta taatatgccg 360 caatactcct gatctggatc aagctgccac aaattttaca tacaacatga tactggatcg 420 actcaattca gtctatcgac tggttaatag ccaaattgat gttacccgat ggttatgtag 480 tactttcgga cataacccac cccatcaaaa aaactctacg cctccaccgg tttctcatgt 540 ccacaatgaa acatccaacc acaacttcat ggaccaaatc gaaaatctgc agttcaactt 600 cacacgtgaa tttaacgact tcggcgacgc ttccatcaac aatactggaa ataaccgttc 660 cagcagccct ccatcaattt tgacatcgtc aaaatccgga cattccacgg aagttgttgt 720 tccatctagt attcctatat ctgcaatcac tacagttcaa tcatttgtag aaaactctcg 780 cgctctgacg aatgttgcca atactgacgt cacaactcaa gcaactacct ccgccaccat 840 gccagctact gccactgaat ccactgccgc ataccccacc tctaccgtta tcatcgatac 900 tgccgccacc gtccaagcca ccgtcaccaa gcccactacc gccactgctc ccaacgttac 960 aacctccacc actatgccca ctgctaccat cgccctcacc ttgcctaaaa ccgctacagc 1020 cactgttaca acatctacaa ccgccaccgc catgcatgct acgacatcca aaacaaatgc 1080 tcctcccgct gcatccgtca ctcgctctag tgggaaaaat gcatgtactc aaacaattcc 1140 atacccaaca actcacatcg caccgccagt atcagcccgc gccaatactc tgccatcgca 1200 ccaaccagct agaccaacat catcaacaac accatcagta tcaaccaact atcctaccag 1260 tgttccagtt ttagctattg ctccaccttc gtctaacagc agcgcaaatg atacatggta 1320 ctatgttact aggttccaac caaatgaaac cgtgcagaat atcgtagcgt atatatccca 1380 taaggcaaac tgcaaacctg agcacatccg ttgcctcaaa ctaacacgtc agattaacag 1440 cgaatcacta acatttgtat cgttcaaagt cagcgttcca aaatcaatcg aagagctgat 1500 ttctgctact gaattttggc ctataggaat aaccgtgtcg ccttttttag aacggcgctc 1560 gaatatttac cgacgcaaaa agcctttttc gccggttccc tcgcgcttgg ttccacgaca 1620 aatgcgcgga gcccaactga atccaccagc gaatgtacca acttcaactc ttcttcgagc 1680 ggttccacaa cattatttca actccaacaa ggttcaaccc cgaatgccac caccgccacg 1740 ccgtttccaa tcatccctcg tctagacaac gttctgtcgc tttattatca gaacgttgga 1800 ggaataaatt cttccttgca ctcctacaat cttgcatgct cagacgcttc atatgatctc 1860 gttgctttca ctgaaacatg gttagatgaa ggtacgcttt ccaaacaaat ttttggtagt 1920 gactatatcg tgttcaggca ggatcgtaat gaattcaaca gtcgtaaaac tcgcggggga 1980 ggtgtcctat tagctctacg ctcgacgcta gcttgtaaac agctttaccc tccaaattgt 2040 acaattgtgg aacaaatttg ggtatctgtt aacttgacta gttttacact ttttatctgc 2100 gtcatttaca tacccccgga tcgtgtgaat gatctcgcca tcattgagtc acaccttgat 2160 agtcttacct ggattacttc taaaatgcaa ctcaccgaca aaatcatcat aatcggcgat 2220 tttaacttgc ctggaatatc gtggacccgt gctagaacaa attttctgtt ccccgactca 2280 tctaagtcaa caaccacatc ttcatcaatc aatctgctag atggatatag cttagctaat 2340 ttatgtcaaa tcaatgatgt ccccaacagt aatggtcgtt gcttggatct ctgttacatc 2400 accgaagagc ttctgtccag cgacagcatc aaattatcgg ctgccccagg tcctctcgtt 2460 aaaaccggtt gccatcatcc tccattgcac attagcatca gcgtacccca gccactcaag 2520 tacatcgaca tcgcggattc catgtactac gatttcaaca aaacaaactt tatgtctatg 2580 aacagctttc ttcttggaat cgactgggaa tcaattctgg cagatagtga gatcagttca 2640 gtagtgaaaa ctttttccaa cgttttgttg tatgctatcg atcgttttac accaaaaaga 2700 caaaaacggt cgaagtctta tcctccatgg tccaatcctc atctaaaacg gctgaaatct 2760 atcaaaagat gccatcttaa aaagtactcc aagcaccata ctgatttctt caagttacag 2820 tacaatgcat caaataggat gtacaaagac ttgaacgatc aacttttctc cgattatcag 2880 caacgcatgc aactcgatct gaaaaacaat cctaaaaagt tctggagtta tatcaacgac 2940 cagcgcaaag aaagcggtct accatcttct atgaaactcg gaaatgtcga aacatcatca 3000 atcgctagaa tttgtaatct tttccgtgaa cacttctcaa gtgtgtttac tgacgagcta 3060 ctaaacgatg atgaagtttt gagagctgct aataatgtac ctcaccactc atcatgcggt 3120 cagcacccta cagtcacaac tcaaatggtg gaaacagcat tatgcaagct caaatcatct 3180 tcgaaccccg gcccagatgc aatcccatcc atcatcctaa aaaggtgctc atcaagtctg 3240 tctgccccac tatcattcat ttttaatttg tcgttaaaat gtggaacgtt tcccagtcac 3300 tggaaagatt cattcgtatt tcccatttac aaaaaaggtt gtaaacgtga cgtttctaat 3360 tatcgcggaa ttgctgccct gtgtgcggtc tcaaaattgt ttgaactgat agttttggag 3420 ttcattacac ataactgctc tctttatatc tctgagaccc aacacggatt catgccgaag 3480 aggtccacta caactaatct gttgtcatac atatcatttg ttactcgctg catccaggat 3540 cgacttcaag tagatgcaat ctatattgat ctctccgctg ctttcgacaa aatcaaccat 3600 aaaatagcgg tcgctaaact cgagaagctg ggattcagtg gcgctttcct ctcatggctt 3660 cggtcttatc tcgttggtcg tacaatgtcc gttaagattg gtgactgcat ctcatcacct 3720 ttctttgtgt catctggcgt tcctcaaggt agccacttag gacctttcat ctttcttctc 3780 tacattaatg atgttaattt gtccctcgag tgttttaaac tgtcttatgc cgatgatttt 3840 aagttgtata atgttataaa atgcgacaac gataccgcat tactacaaaa acaactagat 3900 gtcttcgtag tttggtgcaa gaacaacaaa atgattttga acgcgtctaa atgctcaagc 3960 atttctttta gccgtaagca caccatcaca gaattcaatt atcacatagg agaagttact 4020 ctgaaccgcg tgaagtcgat aaaagattta ggagttatgg tagattataa attgacattt 4080 aatgatcata tttcatatat tgtttcgaag gcatctaaga gtctaggttt tatattccgc 4140 gttgccaaac acttctctga tattcactgc ctcaaatcgc tttactgctc tttagtacgc 4200 tccgttttag agtatgcagc aattgtgtgg gctcctttct accagaatag cattcaacgc 4260 attgaatcca ttcaacgcaa atttttgcgg tttgcgctgc ggaatttgca ttggagagat 4320 cctttgcatc ttcctagcta cattgacaga tgtcaattaa tacatttgaa cctgctatcc 4380 acacgtcgag acgtttgcaa acttcttttt gtatcggatt tgattcaatg tagaattgat 4440 tgttcaaatc ttttagaaaa attgaatttt gatattcata ttcgaactct tcggtctcat 4500 ctctttttcc gattacccac aaatcggacc aattatggtt atcatgaacc tgtgattagc 4560 atgtgccgtc tgttcaacca atattgtaat gttttcgatt ttcatctttc ccgtcctgtt 4620 ttgaagcatc agtttgtcca aatgttaggt gtttaatatg tagataagga aaagttaagt 4680 taagttttat ttatgtatca ttgggaattg taatctgttg atacgaaaag aagggtaggt 4740 tttgtgccta tttgagagag agcattagtt gacagctcaa ctcaaacggg tttttcccta 4800 ctccgaaaaa aaaacacaaa ataaac 4826 // ID LmeSINE1c repbase; DNA; INV; 404 BP. XX AC . XX DT 07-APR-2009 (Rel. 14.04, Created) DT 07-APR-2009 (Rel. 14.04, Last updated, Version 2) XX DE Coelacanth DeuSINE element - consensus. XX KW SINE3/5S; SINE; Non-LTR Retrotransposon; Transposable Element; KW Interspersed repeat; SINE3; DeuSINE; conserved; LmeSINE1c; CNE. XX OS Metazoa OC Eukaryota. XX RN [1] RP 1-404 RA Jurka J.; RT "Coelacanth DeuSINE element."; RL Repbase Reports 9(4), 922-922 (2009). XX DR [1] (Consensus) XX SQ Sequence 404 BP; 116 A; 80 C; 83 G; 124 T; 1 other; gggtgtcggt grctcagtag gtagcactct tgcctcttag tcagaaggtc gcaggttcaa 60 atcccacttg aggaacctgg acaaaccccc agcaggtttt tggagtgcta tgctactgaa 120 ggtgttttct ttcagatgag atgccaaagg aatgttttgc ctacctttta gagatcataa 180 attctatgct atgttttgaa agaatatggg agtcctagtc aagttccctc aatagagtct 240 atcaggtcat aatgattgga agttgttgtg tgatatgcat tcaaaacata gttgttgtgt 300 tccatcccaa agacaacact tcagtgactt acatatgccc tcaatgcaat gtaacatcca 360 tggataaaac actacagaaa tgtaaattct actctactct aatt 404 // ID RTEX-10_BF repbase; DNA; INV; 6695 BP. XX AC . XX DT 30-JUN-2009 (Rel. 14.07, Created) DT 30-JUN-2009 (Rel. 14.07, Last updated, Version 1) XX DE Amphioxus RTEX-10_BF autonomous non-LTR retrotransposon - DE consensus. XX KW RTEX; Non-LTR Retrotransposon; Transposable Element; Jockey-7_BF; KW RTEX-10_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-6695 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-6695 RA Kapitonov V. and Jurka J.; RT "Young families of RTEX non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(7), 1726-1726 (2009). XX DR [2] (Consensus) XX CC The complete RTEX-10_BF consensus sequence contains two ORFs. The CC RTEX-10_BF ORF1 protein contains the DnaJ (1-70 aa) and esterase CC domains (487-672 aa). The 3' terminus is composed of the (CATT)n CC microsatellite. The DnaJ domain is conserved in the ORF1 proteins CC encoded by several families of RTEX retrotransposons in the B. CC floridae genome. Therefore, DnaJ is necessary for proliferation CC of the RTEX retrotransposons. XX FH Key Location/Qualifiers FT CDS 219..2984 FT /product="RTEX-10_BF_1p" FT /note="DnaJ, PHD, and RT domains." FT /translation="MATAASPTPSRIYELLQLPESTTFSDVHESYATYIQD FT YTREREATKKLKPAERKAYMYRKSGAELREVSAAFLSLFETHLGTTSDCLR FT GPERYSVTHNISCITVNIPGESVSSWKSVCGAYYGVDGIDRGPNGVQYSTT FT FTDADTRIGTELGSLHITVYKNKLLVQGTSYLLWFLSHYNILQGMVLSDMT FT EKSADVTPVCTTSELQPPIEETICAVCNKREVEGDEFVGCSTCSSWTHFDC FT TGLDDATKSTLRNTDEKYYCINCHVPRVATCTIMLSDPPINTDSTTAGKTM FT TARKTNLDHSSNADVNTVRTKAVQSSTSETKEDQYSAALHEKIETLQRAVM FT ELESSLASSKLQANVFETSVSGHLDVMRKEMESAKCTCSSKIDDTASRKLR FT EENGKLRSRVQHLEQEFSSLKTTSLTLKSELDDMKLSQSLLKVKVGNVKDL FT IKSSEHVSHPKPAPRKEAVISENNPEVPVLEYTLTTSNRFDVLNQEESERT FT SESSVEEAVEVLAEVRSPKVHAATRTVNDETSSVSRQEGQGRKRVVLIGDS FT NAQKLKPNLLCPTADTSKPYWCPNLFSAPTVLEDISNEGITPQTVILHLGT FT NDVMAKTKDTVITEFEVTVSTAQSLFPQAEVVISAVPPRRNAKYRPNVNED FT ISAINLHLQKMCDTNKQLVYVNHPQLWKGTDDYNAHMYEPDGYHLSGDGVR FT TMAYNLKKHASKALGLPPLRNHGTSPRRQGHRNASNAYRGNNTPSSQSKGR FT KPQRNMFDEQGSRTTLAPDRPGLPQHQSSVGLAKPPAPPPRNSYYNQPRAP FT KGAPTSLPGQDRGPQWGNFSPGPASAPYNRQGNFAPPTLPDGPPAPAPPFF FT PNRPRQMPPAPWFQEWPSPAEAYGMPPVGFGQHRWIGSPPFPYFHMDRGWY FT GHPPDPSFGLSDRPHGSR" FT CDS 2990..6562 FT /product="RTEX-10_BF_2p" FT /note="AP endonuclease, RT." FT /translation="MCIPHKSKQTKECRRNDFPSSSRNFGISFGFWNIHGL FT GSKLDDRDFISQIEGFDFFSLSETWHTGELSFPNFLYFSSHRTKSKKAKRN FT SGGLIFMYKKQYRYLVTRLNSKSEDILWVKIDRQLFNHDKDIYLASVYISP FT KGSTIHSNRTYDIFDILEEEIAYFSDLGDILVGGDFNARVGTLPDYIVNDA FT PLDMQILPTNYDFDHPLPRNNMDEGQPTVFGKNLIDLCITSKIRILNGRTP FT GDLLGKPTCFQPKGCSVVDYILASESIVLRKSPFFHVHSLSPLSDHCKLSL FT IIEGNPLAYQKKKTKLRTSPYRKFVWSTDSKEKFMETLNSTAFSSSMSSFI FT RKSYEPTPESLELALHDFVTPMQEAGQKSLSIRCTIAKRQRCPKQLNKKKW FT FNQTCASAKIKLQNICKLLSKNPRDPHIRGSYFIQKKQYIRLIRRTKKEYK FT NNILNQLNSFSGPGANPSAFWSLLKEYKSKDQTHSHEIISDESLHSHFSSL FT YGEDSTTNLPKLPNAFEIQIQRELLNLEDMLQNTHCNSLNEPINKDEIKQA FT ISNLKNGKASGGDLIRNEMLKCSSHILLEPLSKLFNLFLSSGYFPQDWCTS FT HIVAIHKGGAKDDPGNYRGISVTSCLAKLFTSILNTRLTTFLDENNLISPN FT QAGFRKEFGTNDNLFVLDSLLSKHLSENKRIYSCFIDFKKAFDSVWREGLR FT FKLLNSGIGGNFYKLIKCMYTKPQTCVKTESGLTPPFVTKKGVRQGCNLSP FT TLFNLFINDLVHVLDNAECEPPSLHNLFVSCLLYADDVVILSESDKGLQCA FT LNRLSAFCKKWKLQVNLKKTKVMIFNKSGRSFRNQTFVFDQDTLDIVPSYC FT YLGLTITSSGSFSLTKKQLSLKARKAMYSLKSLIRSSVSPKCLLKIFDSCI FT KPILLYGCEIWGTESINDNSPIEISHHRFCKNILGVSRNCSNLASRAELGR FT FPLDLDISVRVIRYWLRLLQMDPHRHLLQADALLYQHKSISNSRRRTLLQR FT VKEILDGSGHSYLFYSCNSLSDIRSICTMIRSRISDMFIQTSRHNLSIDTG FT KLRFYKTCKTAHTMEKYLELDNFDLRSAISKIRISAHPLEIERGRYKRLSV FT CERTCKQCTSNVVEDETHFMSNCSLYNQERKELFEEAIKFHPNFLTFTDKK FT KTVLLLTSQNPHIIEKVGSFVFHCLRKRSRTQ" XX SQ Sequence 6695 BP; 2071 A; 1603 C; 1343 G; 1677 T; 1 other; cgtgatcacc ggattggacg gacgcgcgag ttacgctctc tacaaatgtt gtaatttgcc 60 gctttttgag gtcgaattga gtccggaatt cgggtgtatt tagtttctaa cgttgttttt 120 gtctagttgt aaactttata gatcttaaac gaacgtttta gtttagtttt gtgaagttct 180 gtctctgagt ggctttcgtc gggcaccaag cgtccaaaat ggcgactgcg gcgtccccca 240 ccccaagccg gatctacgaa ctgctacaac tgccagagtc gacaacattc tccgatgtac 300 atgaatctta cgccacatac atccaggact acaccaggga gcgagaagct acgaagaagt 360 taaaacctgc tgagagaaaa gcctacatgt acaggaaaag cggtgccgag ctcagggagg 420 taagcgcggc gttcctctca cttttcgaga cgcacctggg tacaacttct gactgtcttc 480 gcggccccga acgctactct gtaactcaca acatttcatg catcaccgtt aatatacctg 540 gtgagtcagt ctcgtcttgg aagtcagttt gtggagcgta ttatggcgta gacgggatag 600 ataggggtcc gaatggtgtg cagtacagca ccactttcac tgacgctgac acacggatcg 660 gaacagaact cggctctctg cacattacgg tgtataagaa caagcttcta gtgcaaggaa 720 caagctacct gctgtggttc ctatcgcatt ataacattct acaaggtatg gtgctgtcgg 780 acatgaccga gaagagtgct gatgttaccc cagtttgcac caccagtgag ctccaacccc 840 ccatcgaaga aacaatctgc gccgtgtgta acaaacgtga agtagaaggg gacgagtttg 900 tgggctgtag cacttgttct agttggaccc attttgattg tactggactc gacgacgcta 960 ctaaatcaac tttgagaaac actgacgaga aatactattg tattaactgc catgtcccac 1020 gggtcgctac atgcacaata atgttgtctg atccaccaat taacactgac tcaaccacag 1080 cagggaaaac aatgacagcg cgtaagacca atcttgacca cagttcaaat gcagatgtga 1140 acaccgtcag aacaaaggcc gtacaaagta gcacctctga aacaaaggaa gaccagtatt 1200 cggcagctct ccacgagaaa attgaaacgc tacaaagggc cgtcatggag ctagagtcaa 1260 gcctggccag ttcaaagttg caagccaatg tgtttgagac aagtgtatcc ggacatctgg 1320 acgtcatgag aaaggaaatg gagtctgcta agtgcacgtg cagcagcaaa attgacgaca 1380 cagcatccag gaagcttcga gaagaaaacg gaaaactgcg gagcagggtc cagcacctgg 1440 aacaagagtt ttcctcactc aaaactactt cattgacttt gaaatcagaa ctagatgaca 1500 tgaaactctc acagagtctg ttaaaggtta aagtgggtaa cgtcaaggat ctcatcaagt 1560 cttcggaaca cgtcagtcac cccaagccag ctccaagaaa ggaagcagtc ataagtgaaa 1620 acaaccccga agtacctgtc ctagagtaca ctttgacaac ttcaaaccgc tttgatgtgt 1680 taaaccaaga agagtctgag agaacctccg aaagctcagt cgaagaggca gttgaagtct 1740 tggctgaagt tagatctcca aaagtgcatg ccgccacccg cacagtaaat gacgaaacgt 1800 catcagtttc cagacaggaa ggtcaagggc ggaaaagggt tgttttgatt ggtgactcaa 1860 acgcacagaa gttaaagcct aaccttttgt gccccacagc agacacctcc aagccatact 1920 ggtgccctaa cctgttctcg gcccccacag tgctagagga tatatctaat gaaggcatca 1980 ctccacaaac agtcatcctt catttaggta cgaacgatgt aatggccaag acaaaggaca 2040 ccgttataac agagtttgag gttaccgtca gcacagcaca gtcgctcttc cctcaggcgg 2100 aagtggttat ctccgctgtt ccgcctagaa gaaacgccaa gtaccgtcca aatgtcaacg 2160 aagacatctc agctataaac ctacatctgc agaagatgtg tgatacaaac aaacaactgg 2220 tctatgtcaa ccacccacag ctatggaaag gcaccgatga ctacaatgcc cacatgtacg 2280 aacctgacgg gtaccacctg agtggggacg gagtcagaac catggcttat aaccttaaaa 2340 aacatgcatc caaggctctc gggcttccac cattgcggaa ccacggaacc tcacctcggc 2400 gccaaggtca cagaaacgcc agtaacgcgt atcgaggaaa caatactcca tcctctcaaa 2460 gcaaaggtag gaagccccag cggaacatgt tcgacgagca aggaagccgt acgacgctgg 2520 cgccagatcg acccggtctg ccccaacacc aaagttctgt cggtctagcc aaaccgcccg 2580 cacctcctcc caggaattcg tactacaacc agcccagggc accaaaaggc gcgcccacga 2640 gcctacctgg tcaggacaga ggtccgcagt ggggaaactt ctcaccagga ccagccagcg 2700 ccccttacaa cagacagggt aacttcgccc cacccacgct acctgacgga ccgcctgctc 2760 ctgccccacc tttcttccct aaccggcccc ggcagatgcc accggcgccg tggttccagg 2820 agtggccttc tccagctgag gcgtacggca tgcccccagt cggcttcgga cagcaccgtt 2880 ggataggatc tccacccttc ccttacttcc acatggacag gggctggtat ggacaccccc 2940 cagacccgtc tttcggactc agtgacagac ctcatggatc ccgctgatca tgtgcatccc 3000 tcataagagc aaacagacta aggagtgccg cagaaatgac tttcctagta gcagtagaaa 3060 ctttggcata tcctttggtt tctggaacat acatggatta ggtagtaaat tagatgatag 3120 agatttcatt agtcagattg aagggttcga cttcttttcc ttatccgaaa cctggcatac 3180 cggtgagttg tcctttccta atttccttta tttcagtagt catagaacca aatcaaagaa 3240 agctaagcgt aattcgggtg gtttaatatt tatgtacaaa aagcaatata gatacctagt 3300 cactagattg aatagtaaaa gtgaagacat cctttgggtt aaaatagaca gacaattatt 3360 taaccacgac aaggatatat atttagcgtc tgtatatatc agtccgaaag gctctaccat 3420 tcattccaac agaacatacg atatttttga tatactcgag gaagagattg catattttag 3480 cgaccttgga gatatactag ttggcggcga tttcaacgcc cgagtaggga cgttaccaga 3540 ctatattgta aacgacgccc ctttagacat gcaaattctt cctacgaact acgattttga 3600 ccaccccctt cccagaaaca acatggatga agggcaacca acagttttcg gtaaaaacct 3660 aattgacctt tgcattacaa gtaaaataag aatactgaac ggaagaacac caggcgatct 3720 actcggcaaa cccacctgtt tccagcctaa gggctgcagt gtagtggact atatactggc 3780 gagtgaatct attgttctcc gcaaatctcc cttcttccat gtacattcac tatcaccatt 3840 atccgaccac tgcaaacttt ctctaatcat agaaggaaac cccttagctt accaaaagaa 3900 gaaaactaaa ctcagaacca gtccttaccg caaatttgtc tggagcactg actcaaaaga 3960 aaaatttatg gaaacgctta attcgacagc attttcatct agcatgagtt ctttcattcg 4020 caaaagctac gaaccaactc cagagtcgtt agagctagca cttcatgact ttgtcacgcc 4080 aatgcaggaa gccggtcaaa agtccctgtc catacgctgc accatagcaa aaagacagcg 4140 ctgcccaaaa caactaaata agaaaaaatg gtttaaccag acttgcgctt cagctaaaat 4200 caaactacaa aatatctgca aattactatc taaaaatccc agagaccctc atataagagg 4260 ctcctacttt atccagaaaa aacagtacat tagacttatc cgcagaacaa agaaagaata 4320 caaaaacaat atcttaaacc aactgaactc gttctcgggg ccgggggcca acccttcagc 4380 attttggtca ttactaaaag aatacaaatc taaagatcaa acacacagcc atgaaataat 4440 ctccgacgaa agtcttcatt cccattttag tagcctttac ggcgaggatt ccaccactaa 4500 cctgcctaaa ttacctaatg cttttgaaat tcaaatacaa agagagctat tgaatctaga 4560 agacatgctt caaaatacac attgcaattc attgaatgaa ccaattaata aagacgaaat 4620 taaacaagct atctctaacc tgaaaaacgg gaaagcttca ggtggtgacc tcattcgtaa 4680 tgagatgtta aaatgttcgt cacatattct cttagaaccg ctatcgaaac tttttaactt 4740 attcttatct tctggttact ttccccaaga ctggtgtacc agccacattg ttgctattca 4800 taaaggcggg gcaaaagacg accctggcaa ctacagagga atttcagtca ctagttgcct 4860 tgccaagcta tttacatcaa tactaaacac gcgcctaacc acatttctag acgaaaataa 4920 tctaatctcc ccaaaccaag ccggctttag aaaagaattt ggcacgaatg ataacttatt 4980 tgtcctggac tcgctgttaa gtaaacacct ttccgaaaac aaacgcattt attcatgttt 5040 tatcgatttt aaaaaagcct tcgactccgt ttggagagag ggtcttcgtt tcaaactgct 5100 aaactcggga atagggggta acttttacaa gctaattaaa tgtatgtaca ctaaacccca 5160 gacgtgtgta aaaaccgagt ccggtctaac gcctccattt gtaaccaaaa aaggcgttag 5220 acaaggctgc aatctaagtc ctacattgtt caacctattc attaatgacc tcgttcatgt 5280 acttgataat gccgaatgtg aaccaccatc tctacataat ttatttgtat cgtgcttgtt 5340 atatgcagac gatgtggtta tactttctga atcagataaa gggttgcaat gcgcgctcaa 5400 cagattaagt gccttctgca agaaatggaa actccaagtt aatcttaaga aaactaaagt 5460 tatgattttc aacaaatctg gccgatcctt ccgaaaccaa acattcgtat ttgatcaaga 5520 cactctggac atagtacctt cttactgcta tctaggcctg actattacat cgtcgggaag 5580 cttctccctg accaaaaaac aactttcact caaagctcgt aaagctatgt atagtttgaa 5640 atctctcatt cgctcatctg tctcgccaaa gtgcttgcta aaaatatttg attcctgtat 5700 aaagccgatt ctcctatatg ggtgtgaaat ttggggcaca gagtctatta atgacaactc 5760 tccaattgaa atttcgcatc accgcttttg taagaatata cttggagttt ccagaaattg 5820 tagtaattta gcatctagag cggagcttgg cagatttcct ttagacttag atatatctgt 5880 gcgtgttatt cgttactggc tacgcttatt acaaatggat cctcatagac atttattaca 5940 agctgatgcc cttttgtatc aacataaatc tatttctaat agtcgtagaa gaactttgtt 6000 acaacgagtg aaagaaattt tagatggtag tggccactcc taccttttct attcatgtaa 6060 ctcacttagt gatatcagat ctatatgtac aatgataaga tctagaatct ctgatatgtt 6120 tatccaaacc tctcgtcata acttgagtat agatactggt aagctaaggt tttataagac 6180 ctgtaaaaca gcacatacta tggaaaaata cttagaatta gataattttg atttacgctc 6240 ggcaatcagc aaaattagaa ttagtgccca ccctctcgaa attgagagag ggcggtacaa 6300 gaggctatcc gtgtgtgaaa gaacgtgcaa acagtgcacg tcaaacgtgg tggaagacga 6360 aacacatttt atgtcaaact gctctctcta taatcaagaa agaaaagagc tgttcgaaga 6420 ggccatcaaa ttccatccaa acttcctcac attcactgac aaaaagaaaa cagttctcct 6480 tctaacatca caaaacccac acataataga aaaggtggga tctttcgtct tccactgtct 6540 tcgaaaaagg agtcgaaccc aataacccag gatctagagt tagcttttac tagttttaga 6600 ctagagtaga cacytgtgat attgtatttt acattgtttg aatcttttat gttacctgca 6660 attagcccac gggcaagaat ttgcaataaa cttta 6695 // ID I-76_AAe repbase; DNA; INV; 7078 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE An I non-LTR retrotransposon from Aedes aegypti. XX KW I; Non-LTR Retrotransposon; Transposable Element; I-76_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-7078 RA Kojima K.K. and Jurka J.; RT "I clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1347-1347 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 9 sequences with >96% CC identity. XX FH Key Location/Qualifiers FT CDS 3183..6881 FT /product="I-76_AAe_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="MTSMATPNSSMTSLSSNRTDLHSRSTPLILQWNINGF FT FKNLSELERLVHNYEPLVIALQEPHRAFTDTMNKTLRKKYTWTTKCGQNIY FT HSVAIGVSIEIPHEPILIDSDLPMIAVRVSWPFPVTIVCLYLPNGRQPDLK FT NRLMKAFESVADPIIILADGNGHHEAWGNRPSNTRGRIIHDFATEMNLHVL FT NDGSSTFIRGQQETAIDISLASSTIANRFMWKTDKDSAGSDHIPIWINANN FT IAPETSRRPRWLYKKADWMNFQFALASTIDSQPTSIEVLTTRIYEAATVSI FT PRSSATPGRRALPWWSDGVKKAVKNRRKKLRAARRLPADHPEKKNAMTAYR FT VAHLACRQLIREAKEASWRDFLDGINDTQSSTELWSRVNAIQGKRRTRGIA FT LKINDHTTREPLVVADALADYFAGLSSHQKYPSNFLRKHPTPQQNVINFIV FT PPDKGQAFNRLFSEKELSFALSKANGQSAGPDEIGYPMLRHLPAEGKVALL FT EGLNREWTAGTLPSSWRTSLVVPLPKMTKSAHDVEHYRPIALTSCVAKIME FT RLVNRRLMEYLVENSLLDKRQHAFRAGYGTGTYFASLGQALNDALEKEEHI FT EMASLDLAKAYNRAWAPNILEQLAQWEISGNMLNFVKNFLSNRSFRVIIGN FT HRSKLVNEETGVPQGSVIAVTLFLISMNSVFGILPKGVQIFIYADDILLVV FT AGKHPAATRRKLQAAVNAVSRWTNRVGFEISAEKCSRIHICSEKHPVIRKP FT IKIEGKAIPTRKTVKVLGVTFDRHLNFRQHFDLVKENCRNRVNLVKSIANK FT RTKSDRKAQLRVANAVVCSKLFHGVEITSRAYEDMLLKLGPVYNNTIRAIS FT GLLPSTPASSACVEAGVPPFKFKAVTAISMKAVSFLEHTKNRRSETFITNQ FT TNRILNSVVNIKLPLVAKRQRNGAQSWQAKEPSIDSHIKNHFSRNCLPEEV FT RAHFKEWVRTKYPFTEIRYTDGSKVPGINNTNVGIGIHSSSYEEAHGLSKL FT CSVFSAEAAAIFQAASRHSEGPLLIVTDSASALSALQATENPHPWIQATQE FT IVAGQGNHITFMWIPGHKGISGNEEADRLANLGRSKRPLTRNIPGADIKMW FT LKNKTTDAWATVWQQDRTNFTGKIKNYVGTWKDLPNRRDQRVLSRLRTGHT FT RISHNFNGGVGFRKQCDICRVHNTVEHFICSCPSTEHLRQQYGISSIRMAL FT QNERNSETAILSFLKDAGMYEAI" FT CDS join(761..2305,2269..3183) FT /product="I-76_AAe_1p" FT /translation="MAASSWGPPPSSWGDAPPDGAYPGQIDGSYTAATLPP FT FMDSEQNHGELLLLRISGVDRPLPNAPFVIRKSVQRYVCGRIEGAFPEANR FT ATYALKVRSLRQFNRLLTMTQLIDGTPVKITEHPTLNSTRCVVSCRDVIDV FT TESELLEELKEQGVRDVRRITRRAGNGRENTPSIILTCRGTNRPDHIDFGY FT IRCRSRPYYPSPMQCFNCWCFGHTKLRCKSKTSTCGKCSGNHPIPEDRTCS FT NENYCKQCETKEHAISSRSCPLFKMENTIQRVKIDQGLSYPAARATVEGNL FT GTRSYSKAVEVGNNDAIAELTRKVDQLTAVVEKKDNEITELRSALAARNTP FT PPVDRSEIEELQAIIATQSNQIEALTNQLSNFLKMVMPAGTVALPSTSVQA FT SEVPVPMANDPPSKNIHLTQNNQKSIELEISTDSDSASPNQSPNRINPTIN FT LRTTRTSTAQHISKSTRPDTPVPTAAKSISSSPRTPNKRLLPRTEPVIQQQ FT QKRIRHKSIPEGTGTIPMRKHPGRNRDHSHAVVPSKTNIIQSLSLLHHFAV FT TTKRHTMPSFERQHNPSQTEKKHPKKQSCRRINKLEAMVQPDRPQHSIEQQ FT YLLFFVDSRGASVLEAIPQPESLGRLRHPEETDEEFCASDSRGTDRPEVKE FT PPEQFDRPRHPTDEENKESKNVKNPMRPPGSLGPSSADVFLTPELGDLPWR FT QGVVGRGTDKGWLSYTPKDDEGYVASPSKNENGEVRQIISRSSSRANKRRT FT FGHPGNLLPGGKSHPVSFYPVGLLLASEGPPSGALTHSTPITTKAKHTTCT FT DWASKRACSGHRLYNYSQ" XX SQ Sequence 7078 BP; 2174 A; 1738 C; 1586 G; 1578 T; 2 other; tgttcgtctt gctttcgctw cgaacagacg cgtttactac catcgtgcga agtttttatc 60 atcgtttgtt aaccgtttgc ttttaatccg ttttttgtgt tacacgtgtg ttctgtaatc 120 tgcaggggtt atcgtagctg gtctacgatc gcagtttttg tgtacgtgta tttacaacga 180 tatcgcgaaa acaacgaacg tttgattgta gaaaagcccc ccttatccca ttttgtgcag 240 ggagaggggg cggtgagact ccgcaagatc tcatttattg cgccaaagcg cattcgaaag 300 cctcaaagcg aagacagttt cgtgagagtg aaccctattg aacttkgctg actattgtgc 360 gggacatttc tctggtaata cttacctccc aacgcgtcgg tgggtgtccg aagtctttcg 420 gaatctcttt cgagagagta acatttgtcg tccgtgagtt tgggttacgt cgggtacaga 480 gaacgatact gtgagtggcc gtcagccgaa gaaaagagag ataagtgaga cgtgagactt 540 tctccaggaa gagaagtgag aaaagggcga gaagaaaata taaacactga gcaacaagca 600 acggcccacc aattcttgat gtgaagtgat cgaaaatatt tcctcgaata gtttcttgaa 660 gaactcctgg ttcacgtaaa gtggacagta aattcttgcc gtctagtgca cgaaatcctt 720 gcgtgtggtt ggtggtggtt ctctttcctc gggagccgcc atggcggcca gttcatgggg 780 accacctcct tcgtcatggg gagatgcccc cccagacgga gcatatcccg gtcaaattga 840 cggatcgtat accgcagcaa cgcttcctcc tttcatggac tccgagcaaa atcatggaga 900 gcttcttcta ttaaggatct caggtgttga tcgacccctc ccaaatgcac ccttcgttat 960 ccgtaaatcg gtgcagcgct acgtttgtgg acgtatcgaa ggtgctttcc cggaagcaaa 1020 cagggccact tatgctttga aggtaagaag cctcagacag ttcaaccgcc tactaaccat 1080 gacgcaactg attgacggca caccagtgaa gatcaccgaa catccgacac taaactcaac 1140 gcgatgcgta gtgagctgtc gcgacgttat tgatgttact gaatcggagc tcctggagga 1200 attgaaagag caaggagtta gagatgtacg gaggataact cgtcgagccg gaaatggaag 1260 agagaacacc ccgtcgataa ttttaacctg tcggggcact aatcgaccgg atcacatcga 1320 ttttgggtat attcggtgcc gatctaggcc gtactaccct agccctatgc aatgcttcaa 1380 ttgctggtgc tttggtcata ccaagctacg ctgcaaaagc aaaacaagta catgtgggaa 1440 gtgttcaggg aaccacccaa ttcccgagga caggacatgc tctaatgaaa actactgcaa 1500 gcaatgtgag actaaagaac atgccatctc tagccgttcc tgtccgctct tcaaaatgga 1560 aaacacgatt caacgggtta agattgacca agggctatcg tatccggcag ctcgtgcaac 1620 agttgaggga aatcttggca ccagatctta ttccaaagcc gttgaagtag gtaataacga 1680 tgccattgcg gaactcacca ggaaagtcga tcaattaact gccgtagttg aaaagaagga 1740 taacgaaatt accgagcttc gctctgcact ggccgcgcgt aataccccgc caccagtgga 1800 tagatcagaa atagaggaat tacaagccat tatagcaacc caatcaaacc aaatcgaagc 1860 tctcaccaat cagctttcca actttcttaa gatggttatg cccgcaggta cagtcgcctt 1920 gccgtcaaca tcagtacaag cttcagaagt accagtcccc atggcgaatg atccaccgag 1980 caaaaacatc cacctgacgc aaaataacca aaaatcgatc gaactcgaaa tctctaccga 2040 ctccgactca gcatcaccaa accagagccc aaaccgcatc aacccaacaa tcaatctacg 2100 aacaacaaga acttcaactg ctcagcatat ctccaaatcc actcgaccgg ataccccggt 2160 tccaaccgcc gcaaaatcaa tctccagctc tccaaggaca cctaataaaa gactgcttcc 2220 acgcactgaa cccgtcatcc agcaacaaca aaagcgtatt aggcataaaa gcatcccgga 2280 aggaacaggg accattccca tgcggtagtg ccatccaaaa ccaacatcat tcagtcgtta 2340 tcactactcc accactttgc agttaccacc aagagacata caatgccttc atttgaacga 2400 caacacaatc catctcaaac cgagaaaaaa cacccgaaga aacagagctg ccgacgcatc 2460 aacaaactgg aagccatggt ccaaccggac cgtcctcagc attcaatcga acaacaatat 2520 ctgctcttct tcgtggatag tcgaggcgcc agtgtactgg aagctatacc ccaaccggaa 2580 tcactgggtc gacttcgaca tccagaggaa acagatgaag aattttgcgc ctcggatagt 2640 cgaggcacgg acagaccgga agttaaggag ccaccggaac agttcgatcg acctcggcat 2700 ccgacggacg aagaaaataa ggaaagcaaa aatgtcaaga acccgatgag accccctggt 2760 agccttggcc cctccagtgc ggacgttttc ctcacaccgg aactggggga tctcccttgg 2820 cgccaagggg ttgtcggaag agggacggac aagggatggt tatcctatac ccctaaggac 2880 gacgagggat acgttgcaag ccccagcaaa aatgaaaatg gtgaagtaag acaaataatc 2940 tcgagatcat cctcacgagc aaataaaaga aggacctttg ggcaccctgg gaatctcctc 3000 ccaggtggta agtcccatcc cgtgtcgttt tatcctgttg ggttactgtt agctagcgaa 3060 ggcccaccta gtggagcttt gacccattcg actcccataa ccacgaaagc gaaacacaca 3120 acctgtacgg attgggcatc aaaacgggcc tgcagcggtc atagactgta taattactct 3180 caatgacatc aatggctacg cctaatagtt ctatgacatc cttgtcatcg aatagaaccg 3240 atcttcatag tagaagcaca ccgctgatcc tacaatggaa cataaatggt tttttcaaga 3300 acttgtcgga acttgaaagg ctggtgcata attatgaacc tcttgttatc gcactccaag 3360 aacctcacag agcattcacc gatacaatga acaagactct gaggaaaaag tatacctgga 3420 caactaaatg cggccaaaat atataccatt cagtggccat aggggtttca atcgagatac 3480 ctcatgaacc catattgatt gactccgatc taccgatgat tgccgtccgt gtttcctggc 3540 cttttccagt aacgatagtc tgcctctacc taccgaacgg caggcaacca gatttgaaga 3600 acagactaat gaaagctttc gaatcagttg cggatccgat tattatactt gctgatggaa 3660 acgggcacca tgaggcatgg gggaatcgtc ccagtaacac caggggacga atcatccatg 3720 atttcgcaac ggaaatgaat ctccacgttc tgaatgacgg ttcatcgacc ttcatccgag 3780 gacaacagga aacagctatc gatatctcct tggcatcctc cacaattgca aaccgtttta 3840 tgtggaaaac cgataaagac tcagctggta gtgatcacat cccgatctgg atcaacgcaa 3900 acaacatcgc ccccgaaaca tcccgaaggc ctcgctggct gtacaaaaaa gcagattgga 3960 tgaatttcca atttgcttta gccagcacaa ttgattccca accaacatcg attgaagtgc 4020 tcaccacaag aatctatgaa gctgcaaccg tatctatccc cagatcaagt gctacacctg 4080 gacgtcgcgc tctcccctgg tggtcagatg gagttaaaaa agcggtgaag aatcgaagga 4140 aaaaactgcg agccgcaagg cggttgccag ccgaccaccc cgaaaagaag aatgccatga 4200 cagcctatcg ggtagcgcat ctcgcttgcc gacagctaat cagggaagcc aaagaagcat 4260 cttggcgcga ctttctcgat gggataaacg atacgcaatc ttccactgaa ctctggagtc 4320 gggttaatgc aatacaaggc aaaagacgca ctcgaggaat tgccttgaaa atcaatgacc 4380 acaccacccg agaaccactt gtcgttgctg acgccttggc tgattatttc gccggcttat 4440 cttcgcatca aaaataccca agcaactttc taagaaagca tccaactcca caacagaacg 4500 tgattaactt tatcgttcct ccagacaagg gccaggcctt caatcgttta ttttctgaaa 4560 aagaacttag ttttgctctt agtaaagcaa acggccaatc agcgggccct gacgaaatcg 4620 gctacccaat gctaaggcac ctccccgccg aaggaaaagt tgcccttttg gaagggctca 4680 atagagaatg gacagctggc acacttccat ccagctggag gactagtctt gtcgttccct 4740 taccaaaaat gacgaaatcc gcacacgatg tggagcacta tcgacccata gccctaacaa 4800 gctgtgtggc gaaaataatg gaaaggttgg ttaaccgccg gctgatggaa tatcttgtag 4860 aaaactcttt actagacaag cgccagcacg cctttcgggc tggatatggc actggaacct 4920 attttgcctc actaggacaa gctctaaacg acgccttaga aaaagaggaa cacattgaga 4980 tggcttcgct tgacctagca aaagcataca acagggcttg ggcaccaaat atattggagc 5040 aattagccca atgggaaata tccggtaata tgctaaactt cgtgaaaaac tttttatcaa 5100 accggagctt cagggtcata atcggcaatc atcgctccaa attggtcaat gaggaaacag 5160 gagtgcctca aggatcggta atagcggtga ccctcttcct gataagcatg aacagcgtct 5220 tcggaatact ccctaaaggg gtacaaattt tcatctatgc agacgatatt ttattagtag 5280 tggccggaaa acatcctgcc gctaccagaa gaaaacttca agcggcggtg aacgccgtct 5340 cacgctggac aaatcgagtt gggttcgaaa tttcagccga aaaatgttcc agaattcata 5400 tctgctcgga aaaacatcca gtcatacgaa aacctatcaa gatagagggt aaggccatac 5460 caaccagaaa aactgtaaaa gtattaggag tgacctttga caggcaccta aactttcgac 5520 agcattttga ccttgtcaaa gaaaactgta gaaaccgggt gaacctggta aaaagtattg 5580 cgaataaacg tacaaagagc gatagaaaag cacagttgcg agtagctaac gcagtagtct 5640 gtagtaaatt gttccatgga gtagaaatca caagccgcgc gtacgaagat atgctactga 5700 aattaggacc cgtatataac aatacaatcc gggcaatatc aggcctgcta ccgtctactc 5760 ctgcctcttc agcatgcgta gaagccggag tgccgccttt caagttcaaa gctgtcactg 5820 ccatcagtat gaaagctgtt agcttcctag aacatacaaa aaacagaaga tcggaaactt 5880 ttatcaccaa tcaaacaaac cgaattctca actctgtggt caacattaag cttcccttgg 5940 tggccaaacg gcaacggaat ggagcccaaa gctggcaagc taaggagccc agcattgact 6000 ctcatattaa aaaccatttc agtcggaact gccttcctga agaagttcga gcgcatttca 6060 aggaatgggt aaggaccaaa taccctttca ctgaaattcg atacacagat ggatcgaagg 6120 tgcctggaat aaacaacaca aacgtcggaa ttggaatcca ctccagttca tacgaagaag 6180 cacatggtct ttctaagctc tgttccgttt tctcggcgga agctgccgct atcttccaag 6240 ctgcttctcg tcatagtgaa ggtccattgc tgatagtcac ggattcggcc agtgcactgt 6300 cagctctcca agcaacagag aatccacatc catggattca agcgacacag gaaatagtag 6360 caggacaagg gaatcacata accttcatgt ggatcccagg acataaaggg atttcaggaa 6420 atgaagaggc agaccgccta gcaaatcttg ggcgcagtaa aaggccatta acgcggaata 6480 ttccaggggc agatatcaaa atgtggttga aaaacaagac aacagatgcc tgggctaccg 6540 tatggcagca agacagaacg aatttcaccg ggaaaatcaa gaattacgtt ggcacttgga 6600 aggacctacc aaaccgtagg gaccaacgag ttctctcgcg actcagaact gggcacacaa 6660 gaatttcaca caactttaac ggtggagtag gtttccggaa gcagtgtgat atatgtagag 6720 tccacaacac cgtggaacat tttatctgta gctgtccgtc aacggaacat ctgaggcagc 6780 agtatggcat atcaagtatt agaatggcat tacagaacga acgtaatagt gaaactgcga 6840 tcctaagttt ccttaaagac gcgggaatgt atgaagcaat ttaaaactaa catcttcctt 6900 cggacacgaa aactggctcc acgaatgaag ggagaatcta aatagacatg ataatgttga 6960 tgcacaagaa aatactgata aatggtgttt aaaacattgt tacgcatata tgaacgagat 7020 tgagtggcga actcgcccag ggcgaaaagc cactataata aagataaaaa aaaaaaaa 7078 // ID BEL-2_DPu-LTR repbase; DNA; INV; 413 BP. XX AC scaffold_119; XX DT 13-MAY-2010 (Rel. 15.05, Created) DT 13-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE LTR retrotransposon from Daphnia: long terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-2_DPu_; KW BEL-2_DPu-LTR; BEL-2_DPu-I. XX NM BEL-2_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-413 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 652-652 (2010). XX DR Genome; scaffold_119; Positions 159988 159576. XX CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX SQ Sequence 413 BP; 81 A; 125 C; 46 G; 161 T; 0 other; tgtcgaaatt catgttcaaa ttctttctat cttgtctatt cttgttttcc cacgccgtat 60 tccccacctc tcttttgcaa catctcgaac cattctttac atcttcattt cctgtctcac 120 ccttcccctc tacatttcat catcccgact cttctcttct ctttttgctc acccttcctc 180 taatatgatc tttgtctcga ccaccactgt ttcaggtcag ttgaatttct gtcccggtcg 240 acgacgcact gttctcgtct cgtcgatctc tgcccctttc tctctcagta attttactca 300 atttgtctcc tcagttccca atacagattc tttcttttcc ttacctttcg cgtctaggta 360 attaaaaaaa cacacaacgt agggcctaga tttaatttaa gtgaattctt aca 413 // ID I_Ele25 repbase; DNA; INV; 7013 BP. XX AC . XX DT 06-OCT-2010 (Rel. 15.1, Created) DT 06-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE An I clade non-LTR retrotransposon family from Aedes aegypti. XX KW I; Non-LTR Retrotransposon; Transposable Element; I_Ele25. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-7013 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-7013 RA Kojima K.K. and Jurka J.; RT "I clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Direct Submission to Repbase Update (06-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update and extension. This consensus is generated CC from 29 sequences with >91% identity, and ~98% identical to the CC original sequence in [1]. Closely related to MosquI_Aa2. XX FH Key Location/Qualifiers FT CDS 264..1895 FT /product="I_Ele25_1p" FT /translation="MAASSSGPPPTAWWDGPENDFGGNRVDGTHSGPTLPS FT FMDPEGFHGELHLMRMSGANGPLPNRPFVIRRSVEKFIGGKIEGAFPEGRT FT SSYALKVRSQKQYCKLLTMSQLIDGTAICITEHPTLNSVRCVVSCRNVIER FT SEDELLEELKEQGIKEVRRITRRNGQSRENTPALVLTCRGTIRPDIVDFGY FT IRCRTRPYYPSPMQCFNCWLFGHTKLRCQAKXATCGXCSGDHPIAENRECS FT HESFCKSCNAKGHKISSRDCPXWQFENAVQRIKVDQGLSYPSARRIVEQNR FT SGRSFASVTGPATTXIXQQTNVXSEEQATILAAKDAEIAELRAALAARSAP FT PAXANQEIEQLKAXVAEQAKQIEALTVQISVFLKAVMPAANLSCPSEPTAT FT SIPSATSTVVLPVSSKINTTTSMLSTTSDGSINTPSKNHHTNDRITGTETD FT SGHTGQNVLSDSDTASSDRSVNVEFYSPNPMKTMVSPKTPKPTRTRTSLPS FT GSEITSDQVTPSVKRPISAVSRTESLFQLQKKTKKKASEGLGTIPKKR" FT CDS 2018..6619 FT /product="I_Ele25_2p" FT /note="endonuclease, reverse transcriptase and FT ribonuclease H." FT /translation="MASDRRSAGELEVLPQPASPDRLRHHAQRISPLTSLD FT SRGANVPDALPQPESLGRPRRLEGEMDEELLFLSVRCPAKTPGSQSPVGAV FT VSHSGTGGQPLALGEDLEEGNQASEPISLDDVGGEGFEVIGTRRALPATPM FT DNVGSQALFLQTNAKSTKTSNFEALLPSSIPAGSFAMQPGHPGHPNPGGKS FT HSVNSFSGTLSSPNSMVPGVSVRHHMQRYPNPVETSLSPNAPSIPSSGFSE FT TSLSDNRSQRIDSPSGLHGHFNPCDKSPSVISGLTTRPSETASLATNVAAA FT AHSQHINLGDRSFLRQTSQNSSGTSSISRTSFPTPTLPKPITFLLQWNMIG FT FYNNLANLELLVHDTAPWCLALQEVNRVSVEQLNRSLRGQYRWTMLRGSNL FT RHSVALGVLTSIPFEVLRLDSDLPVVGIRLNGPIRISVLNVYLPCGTLPNL FT RERIDRILEVIPGPTLFVGDMNAHHPIWGGHRSDARGIALQNIFEEHDLIV FT LNNGANTFFNGHTSSAIDVAAVSRSFVNRFQWSVDSDMYGSDHHPIRIGSC FT ASPPALARRPRWMYEKADWNEYTQSFRYLSRNHAPTNLTDLAKVIHNAAES FT SIPRTSNRPGRKALHWWTDDTRKAVKSRRKALRTVKRIPIGHPNKENALNL FT YRRRHMECKRIIADAKRASWENFLDSINPTQTSFDLWSKINALSGKRKATP FT LTLQIQGSDVSDPPAVAMALGEYFAKLAAIDSYSDFFKRRIQPTLSSVADF FT SVPDDHDNAPINNPFTLSELIFAIKCSTGKSAGPDGIGYPLIKNLPPSGKL FT QLLDLINKTWLTDSFPSEWSESLIIPIPKTNSDSRGPSKFRPVALTCCMSK FT VMERMVNRRLKQFLESHDLLDHRQHAFRAGHGTSTYFSQLGDVLHTAYSSG FT LHTEIVSLDLSKAFNRTWGPLVLRQLVQWNLSGHIMRFVQNFLQQRSFRVL FT VGDFLSDSYREETGVPQGSVIAVTLFLVAMNGVFSYLPSGIYVFVYADDIL FT LVVSGKTPVRVRIKTQAAVNAVYKWASAVGFELSASKSVRCHVCPSNHRNP FT VPIMVNGAVIPNKKTVKVLGVTVDHGLTFRHHFDSVKQSCQTRLNIIRTIS FT RPHRTNNRAVRFRVAQAIIDSRLTYGLELTCIALDRLVSTLAPVYHGYIRI FT ISGLLPSTPADSSCVEAGLLPFRFFIYTAVCRKAAAITEKTSGDDRIFILS FT EGCRILQSAANVDLPPVAKVHWYGDICWHSSKPNIDNAIANQFRAGDNSSF FT LRATVLDWLHNKYPSHEKRYTDGSLSNLGVGLGVSASDISLSLSLPPTCSV FT FSAEAAAIFIAATIPSDRPVLILTDSASVVSALQSERPTHPWIQGTLENAL FT PNTTYAWIPGHCGIPGNVAADNLAGTGHSSPRFEETVPFDDVKRWISKTFK FT NVWGTEWAQSYSPHLRKIKLSTENWIDCPLLHDQRVISRLRTGHTRVSHNM FT DGRGPFRRMCSICGVQNSVEHFICTCPQYESARRSFGISGSIRSALGNDAA FT TVSTLIGFLKAIXLYGQI" XX SQ Sequence 7013 BP; 1780 A; 1952 C; 1602 G; 1645 T; 34 other; tagcaaggga macggwttcc cccccaastm ttggggtgct ggggccgaaa tagacccttg 60 agmaggtttc gkaaaaccwg tggawccttt aattagtgcg cgawtcwtma ttatccccac 120 gtattgggaa aaggctkccw gtagtgcgcg agtgcaakcc gttmaaccgc cgcttcctgt 180 ttggtgtttc tgggcgctac ctgcatcaag gccgtgattc aggtgagttg gtggtggtgg 240 tgtctttaga gtaaggggcc gccatggcgg ctagttcgtc cggacctccc ccaaccgctt 300 ggtgggacgg tcccgaaaac gatttcggag gaaatagggt tgatggcacg cattctggtc 360 ctacgcttcc atcgtttatg gacccggaag gcttccatgg agaattacac ctaatgagga 420 tgtcgggggc aaatggacca ctccccaatc gaccattcgt aatccgaagg tcggtggaaa 480 aatttattgg tggaaaaatt gaaggggctt ttcctgaagg gaggacctct tcttatgcgc 540 ttaaagttag gagtcaaaaa caatactgta agttgttgac aatgagccag cttattgacg 600 gtactgcaat ttgcattacc gagcacccta ccctcaactc tgtccgttgt gtggtaagct 660 gccggaacgt tatcgaacgg tcggaggatg aactgttaga ggaactcaag gaacagggta 720 tcaaagaggt tcgcaggatc acgcgacgaa atgggcaatc gagggaaaat actccggcat 780 tggtcctcac ctgccgtggg actatccggc ctgatattgt tgactttggc tatattcgct 840 gcaggactcg accttactat ccgagtccaa tgcagtgttt caactgttgg cttttcggtc 900 acaccaaatt gaggtgccag gccaagaasg ctacctgtgg gamctgttca ggggatcacc 960 cgatcgccga gaacagggag tgtagccacg aaagcttttg caagtcmtgc aacgccaagg 1020 gtcacaaaat ttcgagcmga gactgcccwc watggcagtt cgaaaacgcg gtmcaaagga 1080 tcaaggtcga ccagggcctt tcctaccctt ccgcccgcag aatagtggag cagaaccgaa 1140 gcggwagatc tttcgcctcg gtgactggtc ccgccaccac tgstatcgwt cagcaaacta 1200 acgtgakatc wgaagagcag gcaacgatcc tggcggcgaa agacgccgag attgctgaac 1260 ttcgagcggc acttgccgcc cgttcagccc ctccggcckc cgccaaccaa gaaattgaac 1320 agctaaaagc cwtagttgca gagcaggcwa agcaaattga ggcactcacm gtgcaaatct 1380 cagtcttcct caaggctgtg atgccagccg ctaacttgtc atgtccatca gaacccactg 1440 cgaccagcat cccgtcagct acttccacag tcgtattacc cgtttctagc aaaataaaca 1500 ccactacctc gatgctatcg accacctcgg acggttcgat caacactcca tcaaaaaatc 1560 accatacaaa cgatagaatc acgggcactg aaactgactc tggccatacc ggccaaaatg 1620 tactctcaga ttccgacaca gcctcctctg accgcagcgt taatgttgaa ttttattcac 1680 ccaatccgat gaaaaccatg gtttcaccga aaacaccaaa accgacacgc accagaacct 1740 ctctcccgag tggaagtgag ataaccagcg accaagtcac accttcagtg aaacgtccaa 1800 tcagtgccgt ctcccgaacg gaatcgttgt tccaactaca gaaaaagacg aaaaagaagg 1860 cctccgaagg tcttgggacc attcccaaga agcggtaagt ccatcaacct ttttctctat 1920 tttctcaccg ccaccccaac cccccaaacc tgacaccccc gaaagagcct tccttaatgg 1980 taccgtaaat agcagcatac agcccaacac cgatttcatg gcttctgatc gtcggagcgc 2040 cggcgaactg gaagtcttgc cccaaccagc atcgccggac cgactccggc atcatgccca 2100 acgtatcagt cctctcacct ctcttgacag tcggggcgcc aacgtaccgg acgctctacc 2160 ccaaccggaa tcgttgggtc gaccccggcg tcttgaagga gaaatggacg aagagctcct 2220 gtttctttct gttcgttgcc cagctaaaac tcccggcagc cagagccccg tcggtgcggt 2280 cgtttcccac tccggaactg gcggacaacc tctggcgttg ggagaagacc tggaagaggg 2340 aaatcaagcc tctgagccwa tttccctgga cgacgtggga ggtgaaggtt ttgaagtaat 2400 cgggacgcgt cgggctctac ccgctacccc aatggacaac gtgggaagtc aagccctctt 2460 tcttcaaacg aatgccaaat caaccaagac cagcaatttt gaagccttat taccctccag 2520 tatccctgcc ggctctttcg cgatgcagcc tgggcacccc gggcacccca acccgggtgg 2580 taagtcccac tccgtcaata gcttttccgg tactctcagc tcgcccaact ccatggtacc 2640 tggagtctct gttagacatc acatgcaacg ctacccaaat cccgtcgaaa cttcactgtc 2700 accgaatgct ccttcaatac catcgtcagg gttttccgaa acatccctct cggacaaccg 2760 gtcccagcgg atagactccc cctctgggct ccacgggcac ttcaacccgt gtgacaagtc 2820 cccctccgtc attagcggtt tgacaaccag gccttccgaa actgccagct tggccaccaa 2880 tgttgcagcg gcagctcatt ctcagcacat caacctgggc gacagatcct ttctacggca 2940 aacgtctcag aactccagcg gaacctcatc gatttcccga acttcttttc ccacgccaac 3000 tctgccaaaa ccaattacct ttctccttca atggaacatg attggtttct ataataacct 3060 cgcaaacctc gaacttctag ttcatgacac tgcaccgtgg tgcttagctc tgcaggaagt 3120 gaacagggta tcggttgagc agctcaaccg ctcgcttcgt ggccaatatc gttggactat 3180 gctccgtggc agtaacttga gacactcagt cgccctgggt gtattaacgt cgatcccttt 3240 tgaggtcctc aggctcgact cagacctacc cgttgtcgga atacgcctta acgggcctat 3300 caggatttcc gttttaaacg tctatctccc ctgcggaacc ttacccaacc tgcgtgagcg 3360 gattgatcga atccttgagg tgatacccgg acctaccctt ttcgttggcg acatgaacgc 3420 tcatcaccct atttggggcg ggcaccgctc tgatgccaga ggaatcgccc tacaaaatat 3480 tttcgaggaa catgacctta ttgtccttaa caacggtgca aataccttct ttaacgggca 3540 tacctcctca gccatcgacg tagctgcagt aagtcgctcc ttcgttaacc gttttcagtg 3600 gagtgtggac tccgatatgt acggcagcga tcaccatcct attcgaatag gctcatgtgc 3660 ctctccgccg gctttggccc gtcggcctag atggatgtac gagaaggcag actggaacga 3720 atacactcaa agctttcgct atctttcccg aaatcatgca cccaccaatt taaccgattt 3780 ggccaaagtg atccacaatg ccgcggaatc atccatacct cgtactagca atcgaccggg 3840 caggaaggct ctacactggt ggacggatga caccagaaaa gcagtcaaat ctaggaggaa 3900 agcactacgc acggtcaagc gtattcccat tggccaccct aacaaggaga atgcactgaa 3960 cctatatcgg cggcgacata tggaatgcaa aaggataatc gcggacgcca agcgtgccag 4020 ctgggaaaat ttcttggaca gcatcaatcc aacacaaact tcctttgacc tctggagcaa 4080 gataaatgct ctcagtggaa agcgtaaagc aacaccactt acacttcaaa ttcaaggctc 4140 tgatgtttcg gatcctccag cagtagcaat ggctctaggc gaatatttcg caaagctagc 4200 agccatcgat agctacagcg atttctttaa acgccgtatt caaccaactc tcagttctgt 4260 ggctgatttt tccgttcctg atgatcatga taacgcgcca atcaacaatc cattcactct 4320 cagtgagcta atattcgcaa tcaaatgtag taccggtaaa tctgctggcc cagatggcat 4380 cggttatcct ctcataaaaa atcttcctcc ttccggcaaa ctacaactcc tggatctcat 4440 caacaaaaca tggctcacgg acagttttcc gtcagaatgg agcgaaagcc ttatcatccc 4500 cataccgaag actaacagtg actctcgtgg cccatccaaa ttccggccag ttgcactcac 4560 gtgctgtatg tcaaaggtga tggagcggat ggtgaaccgt cgactgaaac agttcctgga 4620 atctcatgac ctcttggacc atagacaaca tgcctttaga gcagggcacg ggacttcaac 4680 ctatttttcc caactcggtg atgtcctaca tacggcttac tccagtggct tgcataccga 4740 gatcgtttcc ctagatctat cgaaggcgtt caaccggaca tggggtccat tagttctccg 4800 tcaactagtt caatggaatc tatcggggca catcatgcgt ttcgtgcaga attttttaca 4860 acagcgttcc ttccgagtcc ttgttggtga cttcctatcc gactcatatc gggaagagac 4920 cggcgttcca cagggctccg tgatagcagt caccttattt ctggtggcaa tgaacggagt 4980 gtttagctac cttcccagtg gcatatatgt atttgtatac gctgacgata tccttcttgt 5040 cgttagcggt aaaacaccag ttcgtgtcag aattaaaacc caagcagctg tcaacgcggt 5100 atataagtgg gcctctgcag tcggcttcga gctgtccgca tcgaaaagcg ttagatgtca 5160 tgtgtgtcca tctaatcatc gtaaccccgt cccaattatg gtcaatgggg cagttattcc 5220 taacaaaaaa accgtcaagg tgcttggagt cactgttgac catgggctca ccttccggca 5280 tcactttgac tcggtgaagc agagttgtca aacccgccta aatataatcc ggacgatctc 5340 acgcccccac cgaacgaaca atcgggcagt gcgcttccgg gttgcgcaag ccataatcga 5400 tagcaggcta acatacgggt tagaactcac ttgtatagcg ttagaccgac tcgttagcac 5460 tctcgctccc gtataccacg ggtatatacg cataatatca gggttgctcc cttcaactcc 5520 ggcggactca agctgcgtcg aagcaggtct tctccccttc cgttttttca tttacacggc 5580 tgtctgcaga aaagctgccg ctattaccga aaaaacatcc ggagacgaca ggatttttat 5640 cctatctgaa gggtgcagaa tcctccagtc ggccgccaac gtggacctcc ccccagtggc 5700 caaggttcac tggtacggag atatttgttg gcactcatcg aaaccaaaca tagacaacgc 5760 aattgcaaat cagtttcgcg caggtgacaa ttcatcgttt cttagagcta ccgtcctcga 5820 ttggctacat aacaagtatc ccagtcacga aaaacgctac accgatgggt cactctcgaa 5880 ccttggcgtc ggtttgggag tttcggcctc cgacatttca ctaagcctta gtcttcctcc 5940 tacatgctcc gttttttccg cggaagccgc tgctattttc atagcagcca ccataccatc 6000 cgaccgacca gtattaatcc ttacagattc agcaagcgtg gtttctgcct tgcaatctga 6060 aaggcccaca catccatgga ttcaggggac gctggagaat gctctgccga acacaaccta 6120 cgcatggatt cccggacatt gtgggatacc cggaaatgta gctgccgata acctcgccgg 6180 gactggccat tcttctcctc gtttcgaaga aacggttcct ttcgatgacg tcaaaagatg 6240 gatttcgaaa accttcaaga acgtatgggg tactgaatgg gctcagtctt actctccgca 6300 cctgcgaaaa atcaaactct caaccgaaaa ctggatcgat tgtcctcttc tccacgatca 6360 acgagtcatc tcccgtttaa gaaccgggca tacgcgggtt tctcataaca tggatggacg 6420 tggtcctttc agacggatgt gctcwatctg cggcgttcag aactccgtcg agcactttat 6480 atgtacatgt ccgcaatacg aatcagcccg gagatccttc ggaatttcgg gaagtatccg 6540 cagcgcccta ggaaacgatg cggccaccgt gtcaacttta atagggttcc taaaagctat 6600 caamctgtac ggacaaatat gacccaacac tacaagccag cttcagtccg gctaagcagc 6660 ggcgatgtgc ccggggggcc accgacaaag tcggcggcta tacgcaatta ggttatcaaa 6720 tttttcaaaa taaggaagtt tcgcattatg cggtgcgcat ttacaattag gatatttcaa 6780 atcaagactg tctgtgggtg ttacatgtaa aacaacgaca ggtcgactgt ccagtgagtc 6840 agctaaaata gcgttactct gtgtttggcg agaccccttg gttgggctcg aaagaggcac 6900 cccttaaggg tcctcttctt tttttgaggc accatttcgg tgacctcttc tgacgagtgt 6960 tgaactagct ttaagttaaa aaacactaaa ataaagaaaa aaaaaaaaaa aaa 7013 // ID PIGGYB2_SM repbase; DNA; INV; 2513 BP. XX AC . XX DT 28-JUN-2007 (Rel. 12.1, Created) DT 07-OCT-2007 (Rel. 12.1, Last updated, Version 1) XX DE DNA transposon from Schmidtea mediterranea. XX KW piggyBac; DNA transposon; Transposable Element; PIGGYB2_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-2513 RA Jurka J.; RT "PIGGYB2_SM: PiggyBac-type element from freshwater planarian RT (Schmidtea mediterranea)."; RL Repbase Reports 7(10), 1094-1094 (2007). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 458..2308 FT /product="PIGGYB2_SM_1p" FT /translation="MARKRQTSQDIRNLFESIIDSDDEELANDNFDSDFSD FT FDIATDEAAVEMQNIETDEEQEVQNIEAAEEAEVEEEGIWYYAQNKKYKRK FT WRASPPNVTRVRDFQILRERQGISVETAQNINTHLAAFKKLLPETLINVIV FT AETNRKARHFYTINSNTARIWYDTNSNEIYALIGILIYAGAHKNWDERLNE FT LYHLDNRPFYRAAMSLQRVQQLLRFLRFDNYQTRAERLKEDKLAAFRDVWN FT MFLANIKGPYKPSSFLTIDEQLVLTRGRCSFRQYIPSKPGKYGIKIFWICD FT ALNSYPLKAEIYVGKQPNELRAVDYALNLVHRLSAPYINKGRTITMDNFFT FT SCPLAEQMLQKKTTIIGTIRANKPDLPKEFTNKAEIKNRVVNSSKFCFDDH FT LTLVSYVPKPNKNVLLLSTLHNDDIVDLDSNKPNIILDYNKTKGGVDTLDK FT LVRTYTCKRKTKRWPMVLFFNMIDITGIAAYNLYKCANPQLFIQRKSNERK FT IFLKELAMQLVKDHILNRISNPRAMKKNVKIAVQQIGYNIINLRQSSTAEG FT TAANRSVSKRPRCYICPRSADKKTGTRCYVCNQPSCTTHIKTVCINCINEE FT VMDSLEEENNEIMSDDASG" XX SQ Sequence 2513 BP; 921 A; 403 C; 446 G; 743 T; 0 other; ccttcagtct gtgcattggg gtcaaatgga ccccagtctt tgtaataagt gatataataa 60 gtttttacac caactaacta tgtcaagcaa ctagatttgc ctcaactaag gttttccaca 120 cacattacaa aagaaataat caaaaataca ttatatatga cgtagctacg atagttttaa 180 gcaatttaat tatattttgg tgcattgggg ttaatctgac cccaattttt ttcccgccta 240 acaatctata gaagttgtct ctttttcaaa tgtaaaaagt gaaattcagt gaatattatt 300 tttgaataga agtgttgtta ttataaatta atttgtttgt gataattata tattcatata 360 aaaaactatt tggtgactat ttttataagg tgagaatatt tatatatttt tgtggttgtt 420 gtgtttagtc taattttcta ggcatataat aatttggatg gcccgaaaaa gacaaactag 480 ccaggatatt cgaaatctat tcgaaagcat tatcgattct gatgatgaag agctagcaaa 540 tgataacttt gatagtgact tttctgattt cgatatagca acagatgaag ctgctgttga 600 aatgcaaaat atagaaactg atgaagaaca agaagtacaa aatatagaag ctgctgaaga 660 agctgaagtg gaagaagaag gaatttggta ttacgcgcaa aataaaaagt acaaaagaaa 720 atggagggca agtccaccaa atgtaactag agttcgcgac tttcaaatct tgagagaaag 780 acaaggaatt tctgttgaaa cagctcaaaa tataaatact catctggccg cgtttaaaaa 840 attacttccc gagacattaa taaatgtaat agtggccgaa acaaatagaa aagctcggca 900 tttctacact ataaatagta acactgcacg tatttggtat gatacaaatt caaacgaaat 960 ttatgcactg attggtatac ttatatacgc tggggctcat aagaactggg atgaaagatt 1020 gaatgaatta taccatctgg acaaccgtcc tttttatcgg gctgcaatgt cattacaaag 1080 agtgcagcag cttttacgat tcttgcgctt cgacaattat caaactagag cagaacgttt 1140 gaaggaagat aagctagcag ctttccgtga tgtatggaat atgtttttgg caaatatcaa 1200 aggaccttat aaaccatcat cattcttaac tatagatgaa caactggtat taactcgtgg 1260 ccgatgttca ttcagacaat atataccatc aaagccaggc aagtacggga ttaaaatttt 1320 ttggatttgc gatgcgttga attcgtaccc attaaaagca gaaatttatg taggcaagca 1380 accgaatgag ctacgtgcgg ttgattacgc attaaattta gttcatagat tatctgcacc 1440 atatatcaat aaaggacgaa ctataactat ggataacttt tttacaagct gtccattagc 1500 agaacaaatg ttacagaaaa aaacaacaat aattggaaca attcgcgcaa ataaaccaga 1560 tctacctaaa gaatttacga acaaggccga aattaagaac cgagtcgtga attcttccaa 1620 attttgcttc gacgatcatc tgacgttggt aagttatgtt ccgaagccca acaagaatgt 1680 attgttacta tccactctac acaatgatga tattgttgac ctcgattcaa ataaacccaa 1740 tatcatattg gactataata aaacgaaggg aggagttgac acactcgaca aattagttcg 1800 tacctatacc tgcaaaagaa agaccaaacg atggccaatg gttttatttt ttaatatgat 1860 tgatatcaca ggtatagctg cttataacct gtacaaatgt gccaatccac aactattcat 1920 tcagcggaaa agcaatgaac gtaaaatctt tttgaaggag ctggctatgc aattggtgaa 1980 ggatcatata ttaaaccgaa tatcgaaccc acgggcaatg aaaaaaaatg ttaaaatcgc 2040 tgttcagcaa ataggatata atataataaa ccttagacag agttcaacag ccgaaggaac 2100 agcggcgaat agatctgtca gtaagcgacc aagatgttat atttgccctc gatccgctga 2160 taagaaaaca ggaacaagat gttatgtttg taatcagcca tcttgcacaa cacatataaa 2220 aaccgtttgc ataaattgta tcaacgaaga agtaatggat tcgctcgaag aagaaaacaa 2280 cgaaatcatg tcagacgatg cgtctggtta attttttttc tgtaatataa tataatttaa 2340 attatcttaa tcttaatctt atatattttg aataaaaaag tttttttcat taattaataa 2400 aaataacatg actatttttt aacaaaaaga tcataaaaag cattaaaata tatatggggt 2460 cgggttgacc ccatgcacag actatggaaa atttactatg cacagactga agg 2513 // ID BEL-142_AA-LTR repbase; DNA; INV; 553 BP. XX AC supercont1.305; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-142_AA_; KW BEL-142_AA-I; BEL-142_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-553 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.305; Positions 1021894 1021342. XX SQ Sequence 553 BP; 224 A; 75 C; 119 G; 135 T; 0 other; tgtttagcac gtaagcaaaa taaaaggaat acatttctta gattgaattt tgaatcaccc 60 taaccgtccc aaaatcaagc gcccagttcc aaatgaaacg gtatggggcg caacatacga 120 agagacagat gtaaacaaaa tgtaaaagag atcaataaaa gttatgagcg tagagagaat 180 caataggtac atgtatagga agatacaaca caggttagca cagctggttt tgggtgaaag 240 tgaaagtttg ttgacaagtt agttaagaat gcgaaacagc tcagagcagc tcgttgcaat 300 accggtcgcg caagttaaat aaaattacca tcgtagatcc aaaagttaaa cgtgtagtgt 360 taacaaattt attttaaaca acattatgaa taataaaaga aagtttagaa taaaaggaaa 420 gtgttaaagg tgttataaaa ataaagaagg aaggtagagg aaagtgtgac cagtgaaaac 480 aaaagaataa gtgaattgta ttaccggttc cgacgttgga cccggaattt ataactgatg 540 aatttcatgt aca 553 // ID TBKIN1 repbase; DNA; INV; 1004 BP. XX AC J01454; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 28-SEP-1995 (Rel. 1.08, Last updated, Version 1) XX DE T.brucei kinetoplast closed minicircle repetitive sequence. XX KW Origin of replication; TBKIN1; Repetitive sequence. XX OS Trypanosoma brucei OC Eukaryota; Euglenozoa; Kinetoplastida; Trypanosomatidae; OC Trypanosoma. XX RN [1] RP 1-1004 RA Chen K.K. and Donelson E.J.; RT "Sequences of two kinetoplast DNA minicircles of Trypanosoma RT brucei."; RL Proc. Natl. Acad. Sci. U.S.A 77, 2445-2449 (1980). XX DR GenBank; J01454; Positions 1 1004. XX SQ Sequence 1004 BP; 380 A; 62 C; 213 G; 349 T; 0 other; cagaaaacag tataatttta gtagtatagg ataaaatatc tacagaaata tggcaaggtg 60 gttagaggaa aagaaatatg ataatagata agaattagaa ttttatagtt atatatgata 120 gtaaataaaa caaacagtgt atatggtctc agagatattg tataattatg gtgatttata 180 gttattaatt attgtaatat atttattatt atattttaag ccaagggaga taaaaatgat 240 agaattagta tggagtaagt tgggtgagga tgggagttgt aattgtaata ttgaagttaa 300 gaagatgtag gtaaagttag gtaaagttag gtaaagttag gtaaagttag gtaaagttag 360 agggtggtat atgaaaagtt gaagttagaa cgtaatagat aaaactattg aaaatggtga 420 aaatggtgaa aaaatagcga tttctgagct cgaaaaaacc gaaaatctta tgggcgtgca 480 gatttcacca tacacaaatc ccgtgctatt ttggggggtt tttgaggtcc gaggtacttc 540 gaaaggggtt ggtgtaatac tcacacggtt tttcctcgag attttcatga ttttggtgtt 600 tgtgggtttc gagactagat gtttgtgatt ttaatttgag atttatccta tgaaaagaaa 660 tgagataata gatagacttg aagtaattat agataatatc attgtatata tattaacaaa 720 taagccatta acaggtagat gaagtgtata tatagattat aaattttata tattatttat 780 gtatatattt attatattat tttttattat agggagatag gaggtgattt gatcttggtg 840 agataagaga aatgggataa tagatacgat ataaaagata ttataattaa tcatagtata 900 tatactgggt aatcatggat ttatgtagtg agataaagtg agtaaataac tataaaataa 960 agtaaattaa tatactatta tattctttta tttatatagg gctg 1004 // ID Gypsy-23_DYa-I repbase; DNA; INV; 4674 BP. XX AC chr2L_random; XX DT 06-APR-2011 (Rel. 16.04, Created) DT 06-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-23_DYa_; KW Gypsy-23_DYa-LTR; Gypsy-23_DYa-I. XX OS Drosophila yakuba OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; melanogaster subgroup. XX RN [1] RP 1-4674 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; chr2L_random; Positions 3035812 3040485. XX CC 'TACT' target site duplication CC LTRs are 96% similar to each other. XX FH Key Location/Qualifiers FT CDS 842..3487 FT /product="Gypsy-23_DYa-I_1p" FT /translation="MPYHNMTGGGAGLRNEKRAELCVVDEPQTSMLLRGED FT FPIFFDSGAECSLVKESFAEKLSGRKVNNVVSLKGIGNANVCSTMQIDNNI FT LINGNSLNVTFHVVPDVCMSNNIVLGREVLPEGYAAVVSCKGVEIIKDETK FT IVKTCSKFIDNSFDIIESNVNGENKNEIITLLKSYSDSFIQGASQTRVNSG FT EMQIRLIDQTKTVQRRPYRLSPSERNIVRQKVQELLDCKVIRPSSSPFASP FT VLLVKKKNGSARLCVDYRELNSNTVADKFPLPLISDQIDRLRGAKFFTCLD FT MASGYYQVPMHPNSVEYTALVTPDGQYKFLSMPFGLKNAPSVFQRAVIKAL FT GDLAYSYVIVYMDDIMIVAPTKELALERLKIVLNILSKAGFSFNVEKCFFL FT RTSVQYLGFEVRAGEIGPNNRKVESLNALPPPRSVTTLRQFIGLASCFRQF FT VPGFSQLMKPLYFLTSEKNKFEWNTEHEAIRRKIIHILTNKPVLIIFDPQY FT PIELHTDASAIRYGAILLHRIENKPYVVEYYSKTTSVCESKYTSYELETPA FT VVNAVKHFRHYLHGRKFVIFTDCNSLKSSRNKIELLPRVYRWWAYLQSFDF FT DIQYREGKRMAHADFFSRNPLPPENSPEYAKIVEKRIELTEVSEDWIRAEQ FT QKDQDIVEIITKLNDDELPFELSKTYDLRGGVLYRVIQRNGKTLCLPVVPR FT GFRWSVINQVHESIMHLGWEKTLDKVYDYYWFEGMAKYVRRFVENCITCRV FT SKSSSGKVQMELHPIPKINVPWHTIHIDITGKLSGKSDRKEYVIVQIDAFT FT KYVYLYHTFTLSAESCVMALKSSISLFGNPIRIVADQGRSFTGAKFVEFCS FT SQNIKLHLIAIGASRGNGQVERVMSTLKN" XX SQ Sequence 4674 BP; 1556 A; 756 C; 1080 G; 1282 T; 0 other; tcagaagtgg gattgtactt tcgtctacta atcatggaat accaactgcg catcttgatg 60 gagcaccaaa gccaaagttt cctggaattg gcaaaaactt tgcaagcgtc ggtacgagaa 120 caggcacaaa cttgagtcgt cacattgttg aaatttaacg ccgacattcc tggcgccgac 180 gctactcaat ggtgtgctac ggttgacatt attttggttg aaaaaccgct ggaaggcagt 240 gctctaatat tggccttaac aaaatcgttg gaaggaagct ttctgaagtg tgctacgcag 300 gcgtgacttg gccggaattt aaacagttat ttcaacaacg ctatgcaaat actgaaacgc 360 cagccgccat gtttttgaag ttattaaact gtcgccccgg caacagtgaa tgccttgcga 420 tatacgctag tcgcatggtg acttcgcttg tcaacaagtg gaagggcttg gatgctgaga 480 ggattgcggt gtccgtggtt ttgggacatt tggccaactt tgaccgccga gtgcaaagaa 540 ttgcatacac tgccgagatc aaaaatagac gtgacctaca ggctgagttg caggtcttcg 600 cttttgacaa gcgcacacat cattctggaa atgaggaatc tggagcaaag cgagctaagt 660 tggcgtcact gaagtgccat ctttgcggaa agatggggca caaaatgatt gattgccgaa 720 acaagaagca gagcgccggc atgtccagac aggatcaacg aggaagccaa ccatttcagc 780 agcattttaa agacaatata acttgctatc gttgccgaga gcaaggtcat atagcctcac 840 aatgccctat cataatatga caggcggtgg agctggattg cggaacgaaa agagagctga 900 gctgtgcgtg gtggatgaac cgcagacaag tatgctgtta cgaggtgagg actttccaat 960 ctttttcgat tctggagcag agtgctcttt ggttaaagag agttttgcag agaagctaag 1020 cggtagaaaa gtaaataacg tagttagctt aaaaggtatc ggaaatgcaa atgtttgtag 1080 tacaatgcag attgataaca atattcttat taatggaaat agtttaaatg taacatttca 1140 tgttgtaccc gatgtttgta tgtcaaataa tattgtactg ggacgagaag tgttgcctga 1200 gggatatgct gcagtagtat cttgtaaagg ggttgaaata attaaggatg aaacaaaaat 1260 tgtaaaaacg tgctctaaat tcatagacaa ctcatttgat ataatagaaa gcaatgtaaa 1320 tggagaaaac aaaaacgaaa taataacatt gttaaaatcc tactcagatt cttttataca 1380 gggtgcgtct cagactcgcg tcaattctgg tgaaatgcaa attcgattga ttgatcagac 1440 caaaacggta cagagaagac catatagatt gagtcccagc gagagaaata tagttcgaca 1500 aaaagttcaa gaattgttag attgcaaggt tatcagaccg agtagctccc cttttgcgag 1560 ccctgttcta ttggtgaaga agaagaatgg atcagccaga ctttgcgtgg attatagaga 1620 attgaattcc aatacagttg ccgacaagtt tcccttgcct ttaatatctg atcagataga 1680 caggctgcgc ggtgcaaagt ttttcacctg cttagacatg gcaagtggat attaccaagt 1740 tcctatgcat ccgaactctg tcgaatatac ggcattagtc acccctgatg gacagtacaa 1800 gttcctgtcg atgccattcg gactcaagaa tgcaccgtca gtttttcaaa gagcagtcat 1860 aaaggccttg ggtgatctcg cctattcgta tgtcattgtg tatatggacg acataatgat 1920 tgtagctcca accaaggaat tggcacttga aagactaaaa attgttttga atattttatc 1980 aaaggctgga ttctcattta atgttgaaaa atgtttcttt ttacggactt ctgtccaata 2040 tttaggattt gaagtacgag caggggagat tggcccaaat aatcgcaagg tagaatctct 2100 gaatgctttg ccccctccga ggtctgtaac aacattgaga cagtttattg ggctagcatc 2160 gtgctttcga cagtttgttc ccggattttc tcagctaatg aagcctttat attttctcac 2220 ctcagagaaa aataagtttg agtggaatac agaacacgag gcaattcgaa gaaaaattat 2280 tcacattctt acaaataaac ctgtcttaat tatctttgac ccgcagtatc caatcgaatt 2340 gcataccgat gccagtgcaa tccgatatgg tgctatactc ttacacagaa ttgaaaataa 2400 accgtatgta gtggaatatt atagcaagac aacaagtgta tgtgaatcca aatatacctc 2460 atatgagttg gaaacacctg ccgtggttaa tgctgttaag cattttcggc actacttaca 2520 tggcagaaag tttgtaatat tcactgactg taattcgtta aaatcatcac gaaataagat 2580 tgagttgttg cctagagtct atcgctggtg ggcttatttg cagtccttcg attttgatat 2640 acagtatagg gaaggcaagc gtatggccca tgctgatttc ttttccagaa atccacttcc 2700 tccggaaaat tcccctgaat atgcaaagat cgttgagaag cgaatcgaat taactgaagt 2760 atctgaggat tggattcgtg cagagcaaca gaaagatcaa gatattgtag aaatcattac 2820 caagttgaac gatgatgaat taccattcga gctgtctaag acatatgatt tgagaggtgg 2880 agttttgtat agagtgatac aaagaaatgg caagacgctg tgtttgccag tagtgcctcg 2940 cggatttaga tggtcggtta ttaaccaagt acatgaatcc ataatgcatc tgggctggga 3000 gaaaacactc gacaaggtat acgattacta ttggtttgag ggcatggcca aatacgttcg 3060 aagatttgtt gaaaattgca taacgtgcag agtttccaaa tcatcctctg ggaaagtaca 3120 aatggaatta catcctattc ccaaaataaa cgttccatgg catactatcc acatagacat 3180 aacaggcaag cttagtggta aaagcgatcg taaggaatat gtaattgtac agatcgatgc 3240 atttaccaag tacgtttatc tttaccacac tttcacatta agtgcagaaa gttgtgtaat 3300 ggcgttgaaa tcatcaattt ctttgtttgg aaaccctatt cgtattgtgg cggatcaggg 3360 aagaagtttc acaggtgcca agtttgttga attttgctct tcccaaaaca taaaactaca 3420 tttgatagca ataggagcaa gccgaggtaa tggccaggtt gagagggtta tgagtacctt 3480 aaaaaattaa ttgacagctg ttgaaacaag ttctagatct tggcaagagg cattgggcga 3540 cgtgcagtta gctatttatt gtactgctaa taagataacc aaagcgagcc cgttagaatt 3600 attaatggga aaggttgctc gtcctttagg tcttgttcca attcatgatc cagaaacaga 3660 aatagatctg accaagataa gaactcaagc tttagaaaat atagaaagat cggcaaagtg 3720 tgagaaggaa aggtttgata aaggcagagc taaggtagta aaatttagtt ttggagattt 3780 tgtgttgctt aataatagtg agagaaacca gactaaatta gatgctaaat atagaggtcc 3840 atttgagatt acagaagtat tggataatga cagatatacc ttaaagagta tgaagggtaa 3900 tagacgttat aagtatgcgc atgaaagatt gagaagattg ccaaatgggc aggtgccaag 3960 cgagctggag atcgatgata gcccgtcagt ggaagctgtt gaaagagaca gtagtgaagc 4020 cgttgaaaga gacggtatgg aagttgatga aagagaaagt gaatggagat gttgaaatag 4080 agagtagaat ggcggaaggt gtatggaagt acacagttgt cgatggctac gtggcggtac 4140 ctgaccctag agtggtatta taatgatgtt attgtttgga aagaaaataa aaaataaaat 4200 aaaaaaataa aagagaatga ttaaaagttg agataaaaga agagagaaat taagtaagaa 4260 atgatacgaa tgaattgaat tgaattcaaa ttagttgaac tatggagaga aacatttaaa 4320 gaattattaa aagaatgatt tgagttgtat tatgaaaagt taaggaatac tttgatttga 4380 aatatgaaaa attaaaaaaa aaaaaaaaag atctttaacg aaggatttaa ggagaaagat 4440 gtaaaacgaa atagtttaga acttgaagta ttagaacaaa tgatgaagta taacgttggt 4500 taatgtaaac taaagattat gttaaaggaa aacaacgatg ttaatgacta agactaagac 4560 taagttgatt gtcacattaa aattagaact gaagtcagaa tgcagtgtgt gaatctctga 4620 tttatggcag gacgtagctc atacacgagg gcgtgtaatt tatcagaaag gccg 4674 // ID Gypsy-9_TCa-I repbase; DNA; INV; 4066 BP. XX AC chrUn_2; XX DT 21-FEB-2011 (Rel. 16.02, Created) DT 21-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the red flour beetle genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-9_TCa_; KW Gypsy-9_TCa-LTR; Gypsy-9_TCa-I. XX OS Tribolium castaneum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Coleoptera; Polyphaga; Cucujiformia; OC Tenebrionidae; Tribolium. XX RN [1] RP 1-4066 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the red flour beetle genome."; RL Direct Submission to RU (21-FEB-2011). XX DR Genome; chrUn_2; Positions 136026 140091. XX CC Positions [1455-1880] - Reverse transcriptase CC Positions [3064-3558] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 21..2351 FT /product="Gypsy-9_TCa-I_2p" FT /translation="MPTDRYVTIQRDRLAQMTRHLDEEWQIDVVYGLLRHK FT IRDKIPRSDIGSFNELLNRARAIEENEKESAAGPVKSQKTNGTTTVKANGG FT KNRARCSFCRAFGHDVGECRKLKTNTTSGPPSAPNAIVPRVSTDIRCYGCQ FT RPGVFRRNCPDCNPPTAPVSAPTTDPTGFCAINTSSQPRRRPTVNIEILGI FT TGTALLDTAAGNSVASISLYQHLLRQGCTPRKEKVEATLADGGTRLMEVPT FT VTTNVKLAGKITKTTFIVLSTSEDTKTLLGADFIEDSGVIIDLKRQQFYFE FT QNPNEKVAFANDCATPTDEDIRTVFQATKVEFLSPLDPEYIANGYGPPPPD FT NSSMIPSVITPGGTWNIYQAPGGADYMLQDANEMLMHYDQYHDEYQTDLFA FT TIPYKTEICSLELRPTEGQFLTTTEKHKINRLVKEYRDLFAEFGPPTPYAT FT HRINTGDSLPVAVPPYRLTCEKRKVLQMEIERLLQQGIIEECDSAWASPVV FT MVPKANGTIRLCVDYRKLNAVTKPDVYPLPRLDDLLHATGKIGCITTLDLQ FT AGYWQIQVEPGDRDKTSFICPFGLYRFTRMPFGLRNAPASFQRLMDKFKTG FT IPDVPILAYLDDLIIISPDGHTHIRHLRSTFDQLRRFQLRINRNKCLIGCS FT KVRYLGHRISPSGIAPDEKKVEAIKQIPPPRNLKQLQSFLQTCSWFRRFID FT QFASVARPLSELTKKNASWKWGNAQDEAFTTLKEKLTTAPILRAADPTQPF FT ILRTDSSAYALGAALLQGEGPDEETR" FT CDS 2299..4029 FT /product="Gypsy-9_TCa-I_1p" FT /translation="MHSVRLFSRGRAPTRRPVEYASRLLTSSERNYSTTER FT EALAIVWAISKFRGYVGENSTTVITDHQPLRWFMSLKTPTGRLARWALQLQ FT PYNLVIEYTPGKANTIADFLSRPATEESSMICTVQIDLPARSDADIRNEQL FT KDPEVKKILDELMSSNPEIAQPWSSRGYVTNRGVLYRYSQADDYEEAQLVV FT PHHERLHIMKQHHDSPTGGHYGVDKTYSKIACRYYWPGMRRDVSDYLRTCI FT ACQRYKASNLKPPGLLQTPVLQQRLEVISIDLFGPLPPSNCGEKWIFIVED FT TCTRWVELFPLVDATAEMCAWTLLNDVCFRYGLPRRIISDNGSQFVSAIMQ FT KLTFCLGISQSFTPVYHPEANPVERKNRDLKTQLAIQIGDNPHSTWPEKLP FT TIRFAMNTTPCSSTSVTPAFLMFGRELRTIDDVTNDLRHIVESETFVAEAT FT PKLLLLSDTWKKARETHEDKQDHRKEAADRTRRTAPEFQPGDLVLVQLHTL FT SSSAKEFSSKLAPKRDGPYVILRRHGPTSYEIAAQDQPSQALGTYHSSQLT FT PYRGLEKEVPPPVVPIRRRGRPRKHNVQNQ" XX SQ Sequence 4066 BP; 1184 A; 1069 C; 902 G; 911 T; 0 other; ccgccttcca accagaagga atgcccacgg acagatatgt gacgatccaa cgggaccgcc 60 tcgcacagat gacacgccat cttgacgaag aatggcaaat tgacgttgtg tacggcctac 120 taagacataa aattcgcgac aaaattccgc gcagtgatat tggatcattt aacgaacttt 180 tgaatcgcgc gcgagcaatc gaagaaaacg aaaaagaatc ggccgcgggc ccggtaaaat 240 cccaaaaaac gaatgggacc acgaccgtta aagccaacgg cgggaaaaat cgcgcacggt 300 gtagtttctg tagagcattt ggacatgatg ttggtgaatg tagaaaacta aagaccaaca 360 ccacttccgg accaccttct gcaccaaatg caatagtgcc gcgagtgagt acggacattc 420 ggtgttacgg atgccaaagg ccgggggtgt tccgacgcaa ttgtccggat tgcaatccac 480 ctactgctcc agtaagtgct ccaaccacgg atccaactgg attctgtgca ataaacacaa 540 gcagtcaacc tcggcgtcgg ccaacggtta atatagaaat cttgggcatc acgggaaccg 600 cattgttgga caccgccgcc ggcaatagtg tggcaagcat aagcctgtat caacacctcc 660 tccgacaagg ttgtactcca cgtaaagaaa aagtggaagc aactcttgcg gatggtggaa 720 cccgcctgat ggaagtcccc actgtaacaa ctaatgtaaa gctggcaggc aaaatcacaa 780 aaaccacttt tatcgtgctc tcaacctcag aggatacaaa aactttgttg ggagccgact 840 ttattgaaga cagtggagtg atcatcgacc taaagcgtca gcagttttac ttcgaacaaa 900 atccaaatga aaaggttgcc ttcgctaacg attgcgccac tcccaccgat gaagacattc 960 gtaccgtgtt ccaagcaaca aaggtagaat ttctgtcacc cttggacccc gaatatattg 1020 ctaatggata cggccctcca ccacccgata attcaagtat gataccgtcc gtaattactc 1080 ccggaggtac gtggaacatt tatcaagctc ccggtggcgc cgactacatg ctacaagatg 1140 caaacgagat gctgatgcat tacgaccagt atcacgatga ataccagacc gatctatttg 1200 cgacgatccc ttacaaaacg gaaatttgtt cattggagct tcgccctaca gaaggtcagt 1260 ttttaaccac cacggagaag cataaaatca accggttagt gaaggaatac agagacttat 1320 ttgcggaatt tggtccacca accccatacg ccacccatcg aattaatacc ggtgacagcc 1380 tccctgtagc agtacctcca tatcgcctga catgtgagaa acgaaaggtc ctccaaatgg 1440 aaatagagcg actccttcaa cagggtatca tcgaagaatg tgatagtgcg tgggcgtctc 1500 ccgtggtaat ggtacccaaa gccaatggga ccattaggct atgtgtggat taccgtaaac 1560 taaatgctgt aacaaaacca gatgtctatc cacttcctag actggacgat ctgttacacg 1620 caactggaaa gattggttgt attacaacgt tggacctaca ggccggctat tggcagatcc 1680 aggtggaacc gggagatagg gataaaacat cttttatctg cccttttgga ctatatcgct 1740 tcacacggat gccattcggt ctacgcaatg caccggcttc gttccaacgt ttaatggaca 1800 aatttaaaac cggcattccc gacgttccca tactggcata cctggatgat ctcattataa 1860 tctcccccga tggtcataca catattcgcc acttgagaag tacttttgac cagttacgac 1920 gctttcaact acgcataaac aggaacaagt gcttgatcgg atgttcgaag gtacgatatc 1980 ttggacatcg tatctctccc tctggcatcg ccccagatga aaagaaggtg gaagctatta 2040 aacaaatacc acctccacga aacctgaagc agctgcagtc cttcttacaa acctgctctt 2100 ggtttcgccg atttattgat cagttcgcat ccgtggcacg tccactaagt gaactaacca 2160 aaaagaacgc ttcttggaaa tggggaaacg cacaagacga ggcattcacc actttaaagg 2220 aaaagttgac aacagctcct atattacgag cagccgatcc aacccaaccg tttatcttac 2280 gaaccgatag cagtgcttat gcactcggtg cggctcttct ccagggggag ggccccgacg 2340 aggagacccg ttgaatacgc tagccgcttg ttaacttcat cggaacgcaa ctattcaacc 2400 acggaacggg aagccctcgc catcgtatgg gccattagca aattcagagg atatgtgggg 2460 gaaaactcaa ctaccgtaat caccgatcat caacctctgc gatggttcat gtcgctaaaa 2520 actcctacgg gacggcttgc tcgatgggct cttcagctcc agccatacaa tctggtaatc 2580 gaatatacac cgggaaaagc aaatactatt gccgactttt tgtcacgacc agcaactgaa 2640 gaatctagta tgatctgcac agtccaaatt gatttaccgg ctcgctcgga cgccgacatt 2700 cgcaacgaac aattaaagga cccagaagta aagaagattc tcgacgaatt aatgtcctca 2760 aatcctgaaa tcgctcagcc ctggtcttct cgtggatacg tcacgaatcg aggtgtgttg 2820 taccgctact cccaagccga tgattatgaa gaggcccaac tcgtagtacc acatcatgaa 2880 cgtctccata ttatgaaaca acaccatgac tcccctaccg gcggccatta cggcgttgac 2940 aaaacctaca gtaagattgc ctgccgctat tactggcctg gtatgagaag agatgtttcc 3000 gactacctgc gcacttgtat tgcttgccag cgatacaagg cctctaacct gaaaccccca 3060 ggactcttgc aaaccccggt acttcagcaa cgcctggaag ttatttcgat agacttgttt 3120 ggacctctac ccccttccaa ttgcggagaa aaatggattt tcatagtgga agatacatgc 3180 actaggtggg tggaactatt ccctcttgtt gatgcaactg ccgagatgtg cgcttggact 3240 cttcttaatg acgtgtgctt tcgatatggc ctacctcgac gcataatcag tgacaacgga 3300 tcacaatttg tgagcgccat aatgcagaaa ctcaccttct gcttaggaat aagccaatcc 3360 ttcactccag tttatcatcc ggaggctaac ccagtagaac ggaaaaatcg ggaccttaaa 3420 acgcaactcg ctatccaaat aggcgacaat ccacactcca cttggccaga aaagcttccg 3480 accatacgct ttgctatgaa tacaacgccc tgttccagca ctagtgtcac tcctgcattc 3540 ctgatgtttg gacgagaatt acgaaccatc gacgacgtca ctaatgattt acgacacata 3600 gttgaaagtg agacctttgt ggccgaagcc acaccgaaac ttttactgtt gagcgatacg 3660 tggaaaaagg ctagagaaac acatgaagac aaacaagatc atcggaagga agccgctgat 3720 cgcacccgaa gaacagcccc cgagtttcaa cccggagatt tggttttggt ccaactccat 3780 actctcagca gtagcgccaa agaatttagt tcgaaactgg ccccaaaacg ggatggtccc 3840 tacgtaatcc tacgacggca tggccccact agttatgaaa ttgcagccca ggatcaacca 3900 agtcaagcac tagggacata ccacagctct caactcacac cataccgcgg cctggaaaag 3960 gaagtgccac cccctgttgt gcctattcgc cggcggggac gaccccgaaa acataacgtc 4020 cagaaccagt gatcgggacg cttcacatgg ttctgagggg gagaac 4066 // ID R1_Ele7 repbase; DNA; INV; 3371 BP. XX AC . XX DT 18-OCT-2010 (Rel. 15.1, Created) DT 18-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE A Waldo non-LTR retrotransposon family from Aedes aegypti. XX KW R1; Non-LTR Retrotransposon; Transposable Element; R1_Ele7. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-3371 RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-3371 RA Kojima K.K. and Jurka J.; RT "Waldo, (AC)n microsatellite-specific families of non-LTR RT retrotransposons from the yellow fever mosquito."; RL Direct Submission to Repbase Update (18-OCT-2010). XX DR [2] (Consensus) XX CC [2] Consensus update and extension. This consensus is generated CC from 13 sequences with 87-93% identity, and ~92% identical to the CC original sequence in [1]. Both sides are truncated, but quite CC similar to Waldo elements. XX FH Key Location/Qualifiers FT CDS 1548..3368 FT /product="R1_Ele7_1p" FT /note="reverse transcriptase." FT /translation="HRMVTACDATMPRKLEPRNKRRPAYWWNDTLSTLRTA FT CLRARRRAQRAGTEMERRERRTAFREARTALKREIKLSKSNCYKELCRDAD FT TNPWGNAYRVVMAKIKGPSTPAEMCPDKLKLIVEGLFPKHDPTSWPPTPYG FT EDAEAIAEERQVSNDELVEAAKRLKSKKAPGPDGIPNVALKAAILAYPDMF FT RKVLQKCLDEGNFPGIWKIQKLVLLPKPGKPLGDPASFRPICLLDTLGKLL FT ERIILNRLTKCTESERGLSQMQFGFRKGLSTVDAIRAVLECAEKASKQKRR FT GDRYCAVVTIDVKNAFNSASWEVPDYLCKILKSYFQSRILVYETNEGQKSM FT QVTAGVPQGSILGPTLWNGMYDGVLTLRLPRKVKIVGFADDVSLTVMGETL FT EEVEMSVMETIDAIEGWMSGVKLQIAHHKTEVLLVSNCKAVQQMEVNVGRY FT VIAPKRVLKHLGVMIDDRLNFNSHVDYACEKSEKAMNAIGRIMPNLGGPSS FT STRRLLANVSSSILRYGVPAWGKALETKRNREKLNRAFRLMAVRVASAYRT FT ISSDAVCVIAGMIPMHHFGGGYRVLPTKRRQKRTEDGQSRIYGEVEHEWDC FT AEKGRWTHR" XX SQ Sequence 3371 BP; 973 A; 739 C; 1027 G; 629 T; 3 other; gactccaaag agagccagaa ccttgccggk agatgggaga ccaggtggat ccaagaagca 60 gacgcgacag gatactaggg acgccgctcc aagagagatg cgctagtaca aacgccactc 120 cacgggagaa tgagtggaga aaagtcgttg gcagaaggga gaaaagacaa cagcaaaagc 180 ggcggaggta caaaaaatgg agcagaagaa gaaagaaaag cctccacttc ggcagatgcc 240 accaaagagg gacgccgtac tcgtcaaggc taacgatgct acgacgtacg cagcgcttct 300 caagagcgtc agagaagacc cgatgttaag ggatctgagc gaaaacgtgg tcaaagccag 360 acgtacccaa aaaggtgaga tgctctttga gttgaaggag aatmctacag tttggctatt 420 caggagcgca tgactcaatc gctgggtgat aaggcagaag tgaaggctcc aacttgtctg 480 caaggatctg gacgaaatta cgtcagaaga agagctgagt ggtgcactta agtcacagtg 540 taacttaggc gacgtgcaaa tggcgatccg attgaggagg catacgagac agtgatgatt 600 cgtctgccgg tgaccgctgc gaacaaggat tggatggtcg aaatgcgcga ttgagcaagc 660 agaaaaatgt ttcaagtgct taagctttgg acaccgggca ggcgcttgca acgggccgga 720 cagatccaaa atgtgctgga attgtggggg aacgggccat cttgtgaaag actgcacgca 780 gaggccaaag tgcatgctat gcactccaga agatggaaac gaccatcaaa cgtgtggctt 840 caaatgcaca gcagctgttg tggcagtcaa cgacagaaaa tagtgtgacg tggcgattat 900 tgcagagccg tatcaagttc cacctgataa cggcaactgg gtggcggata gaacgggaac 960 ggctgcgata catgttatgg gcagattccc tatccaggaa gtggtggaca attcaggtga 1020 aggcttcgtc atcgccaaaa tcaacggtgt tgwtgtatgc agttgctatg cacctccgag 1080 gtggacagtg gagcagttca gccagatggt ggatcagcta actgacaagt tgatcggaag 1140 gaagccgata ctcattgcag gagacttcaa tgcctgggct gtggtgtggg gcagcagagt 1200 gaccaacgca agaggatata tcctgcagga agccttagcg aagctagatg tacgattgtg 1260 caatgaaggc tccgttagca catttcggag aaacggaagg gaatccatca ttgacgttac 1320 gttttgcagt cctgcactgt cggcgaatat gaactgagag tttgtgaggg gtacacccag 1380 cgaccatcta gcgattcgct accgcatcgg tcaacggaac cctgcgccaa caatgggaag 1440 aataactagc gagcggaagt ggaagacgaa agcctttaac gaggaactat tcgttgaggc 1500 acttcgtcaa gacggcgata acgaggatat cgacgccacc gagctaacac aggatggtaa 1560 cggcttgtga cgcaactatg ccgcgaaaac tggagccgag gaacaaacgg cgaccagctt 1620 actggtggaa tgacacgctc agcacgcttc gcactgcttg tctcagagcc agaaggcggg 1680 ctcagagagc agggacggaa atggagagaa gggagcgaag gacggctttt cgagaagcta 1740 ggaccgcact caaacgggag atcaagctta gtaaatctaa ttgctacaag gagctgtgcc 1800 gagatgcaga caccaatccg tggggtaatg cgtatcgagt cgttatggca aagatcaagg 1860 gcccttcgac gccagctgag atgtgcccag acaagctgaa gttgatcgtg gagggtctct 1920 tcccgaagca cgatccaacg tcctggccac caacgccgta cggtgaagac gcagaagcaa 1980 tcgccgagga acggcaagtg tctaacgacg agcttgtaga agcggcgaaa cgactaaaat 2040 cgaagaaagc ccctggtcca gatggaatac cgaacgtggc gctgaaagct gccatcctgg 2100 catatccgga catgttcagg aaggtgttgc agaagtgcct tgacgaaggt aactttccag 2160 gaatatggaa gatccagaaa ctggtactgc tgccaaagcc agggaagcca ctgggagatc 2220 cagcctcgtt taggcccatt tgcctgctgg atacgctcgg aaaactcctg gaaagaatca 2280 tccttaacag gctgacgaag tgtacggaga gtgagcgcgg gttgtcgcaa atgcagtttg 2340 gattccgcaa aggcttatcg acggtagacg caattcgagc agtactcgag tgtgctgaaa 2400 aggcgtcgaa gcagaaacga agaggagatc gatactgcgc cgtggtcacg atagatgtga 2460 aaaacgcatt caacagcgcc agctgggagg tccccgacta tctgtgcaag atcctgaaga 2520 gctacttcca gagcagaatc ctggtgtacg aaacaaatga agggcagaag tcaatgcaag 2580 tgacagcggg cgtccctcaa ggttccattc tcggtccaac tctttggaac gggatgtacg 2640 acggagtctt gacactgcgg ctacccagga aggtgaagat tgttggtttt gcggatgatg 2700 tgtcactaac ggtgatgggc gagacgctgg aagaagtgga gatgtcggtg atggaaacaa 2760 ttgacgcaat cgagggctgg atgagcggag tcaaactgca gatagctcac cacaaaacag 2820 aggtactgct agtcagcaac tgcaaggcgg ttcagcagat ggaggtcaac gtcggaaggt 2880 atgtgattgc accgaagcgc gtgttgaagc acctgggagt aatgatcgac gatcggctga 2940 atttcaacag ccacgttgat tacgcatgcg aaaagtccga gaaggcaatg aacgcaatag 3000 ggagaatcat gccaaactta ggcggtccaa gcagcagtac gaggcgtctg ctagctaacg 3060 tctcgtcatc gatactgagg tatggagttc ctgcttgggg caaggcacta gaaacaaagc 3120 ggaaccgcga aaagctgaac agagcgtttc ggctgatggc agtgcgagtc gcgagtgcgt 3180 atagaacgat atcgtcagat gctgtgtgcg ttatagccgg tatgattccc atgcatcact 3240 ttggcggagg atatcgagtg ctaccaacga agagacgtca gaaacgtacg gaagatggcc 3300 agagtcggat ctatggcgag gtggagcacg agtgggactg cgcggagaaa ggaagatgga 3360 cccaccggct t 3371 // ID Gypsy-49_AA-I repbase; DNA; INV; 6668 BP. XX AC supercont1.288; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-49_AA_; KW Gypsy-49_AA-LTR; Gypsy-49_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6668 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.288; Positions 224321 217654. XX CC Positions [4869-5345] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 2789..4798 FT /product="Gypsy-49_AA-I_1p" FT /translation="MLGTAKLKSIKIPSNRALKAYATNVPMKVDFAFRAEI FT EPLESTNTVLADFLVVMEGRRSLLGRTTASDLELLFVRAQINSCEVVTDKQ FT FPKMPGVTVKFSVDESVTPTRNAYYNIPAAYREGARRRLLEMEESGIIERV FT TAAPEWISGMSAVPKGKNDFRLVVNMRAPNRAIKREYFRLPILDEMRVKLH FT GAKYFTKLDLKSAFYHLELSPESRDLTTFLAENGMFRFTRLMFGVNCAPEI FT FQREMTRILENVENKIVYIDDVLLFADNLDDLHEIVNQVLEILKENNLTLN FT EDKCEFDKTQITFLGHELDEDGFHVEEAKIGAIRKFRRPSTSSELRSFLGL FT ASFVSPYLKNFANISSPLWSLVGSKTWFWGEKQEDAFNEIKEQIIRCTVSL FT GYFSEHEDTILYTDASPNALGAVLVQESAGRNPRIISFASKALTDTEKRYA FT QNQREALAAVWAVEHFSYFLLGRHFTLRTDAKGITFIFNRSRESPKRALTR FT ADGWALRLSPYSYTVEFVKGSDNVADPSSRLYQGVDQPFGEESSPWEIAHV FT EANMARFLGEEEIREFTTNDEVLQMVVVVCHKFLNKLIFIFLKLIGYQSHR FT LRRMVKTFEQISGSFGRLVRQRWHTNEIRMCRDPGKITPQNASGSTRGTSL FT SSKTEEHSTRACLVAGNVCGC" FT CDS 4785..5789 FT /product="Gypsy-49_AA-I_2p" FT /translation="MSADAENWVKSCKTCAVNGRPEKPPPMRRVFAPQTVW FT DTIALDFNGPYARLGGISILVIIDYRSRYAIAKPVKSTCFESTRKVLDDVF FT NREGFPLAIKSDNGPPFNSDDYKSYCQERGIQTIFSTPFFPQQNGLVENFM FT KVVNRAMSTAISNSSNFNDELQAAVNAHNASAHTVTKMPPEEVLSGRKIRR FT GLPLINPGKVDIDEQLLNQRDADAKLHSKAIADMRRGAKPCSVSSGDVVIL FT ERQTKSKGDTRFDEKRYTVIKENNGSLLLSDDEGKLCRRHVSQTKKVHEWR FT KLDSEDVVPAKQSRTTATSSRSVRQKRLPTHLSDYVRLTEQDH" XX SQ Sequence 6668 BP; 2191 A; 1247 C; 1509 G; 1721 T; 0 other; ttggcgatcc tgccaggagt gtaaatgtta aggataatag caaatggaaa agaatcactc 60 gaccgaagat tatttaagaa aaccctaaaa gtgattttgg tgtgcaacaa aaagtatcag 120 ggagaggaaa acaaaatggc gttttttttt tcggagtacc accgtagtga atcagtaagg 180 atggctgaat cgcacttgca aaaaaaaaaa aatggagacg agtaatgaag accgtgaata 240 gaagaaagtt ttgtttgaaa gaatgataga cgtgtaagga aaaccgtctt cagatagtgc 300 atagtgaaaa ccacaaatta aaaacttttc gtgaatggtg cataatgaaa accaccaatt 360 aaaaatgttc ttgtgaatgg tgcgtaatga aaaccgccaa tcaaaaagtt ttcgtgaatg 420 gtgcgtaatg aaaaccacca attaaaatgt tctggtgcgt aatgaaaacc accgattaaa 480 aaagttttcg cgaatggtgc gtaatgaaaa ccaccaatta aaaaaaatcg tgaatggtgc 540 gtaatgaaaa ccaccaatta aaaacttttc gtgaatggtg cgtaatgaaa accaccaatt 600 aaaaactttt catgaatggt gcgtaatgaa aaccaccaat taaaaacttt tcgtgaatgg 660 tgcgtaatga aaaccaccaa ttaaaaagtt tttcgtgaat ggtgcgtaat gaaaaccacc 720 aattaagagg ttctcgtgaa tggtgcgtaa tgaaaaccac caattaaaaa cttttcgtga 780 atggtgcgta atgaaaacca ccaattaaaa agtttttcgt gaatggtgcg taatgaaaac 840 caccaataaa aaagttttcg tgaatggtgc gtaatgaaaa ccaccaatta aaaaattcgt 900 aaatagtgcg taatgaaaac caccaattaa aacgttttca tgaatggtgc gtaatgaaaa 960 ccaccaatta aaatgttctc atgaacggtg tgtagttaaa accaacaatt aaaatgttct 1020 cgtgaatggg gtgtaatgaa aaccaccatg aatccactaa tttcaaagga tacgtaataa 1080 aaaccgtaat gaatggattt attgaataat ttggatataa aaataattct gttaaatttt 1140 gttaacagga ctgaaatcaa tgatagtaat atttaaaaga attagagaat ctacaaccgt 1200 gagcgataat gccaaactct aattcaattt gttttcagtc attttacagt ttgcggacaa 1260 tttataaacg tgaacaagtt caaaggtgtg aagagtgaaa atgtcgaacg aaggcaagta 1320 cgttcgtatg taccgccata gcgacgatga ggaagaagct gaaaccagtg gtgcgggaag 1380 gtatgttcgt gccccactag tgtccaacga agaatcagga cacgatgagc acgaatttga 1440 cgattacgat atcgtggtag accataacaa gcagaacgaa atgtttaaac aacaactagc 1500 gaagatgcag aaagctatcg aggaaataac ttcgatgaac atgcgatccc gggaacgtag 1560 tggaagtaat gatgccgaag aaagttggac cttctctcgt aaccatccat cgtcacaagc 1620 aaatgaatac gtgcctagca tccggtggga tcagatgaag ccattcccga agaatattcc 1680 tgcgaacaag atgtgggaag agtgggcgaa gtatatcgaa aacttcgaaa tcgctactac 1740 cttatcaaat gccaacgatc ctgtgcgccg ttctcaactt ctctttctat caatgggtga 1800 agagttgcag ggcattgttc gcgcagctaa actgaggcca aatctgaatg acagcagttg 1860 ctaccgaatc tttgtgaaga acgttgaaga acatcttcgc tccatgaccg atgcgaacgc 1920 ggaacacgaa gaatttttcc gtatgcgaca ggaaagagga gaaactgttg tatcttttca 1980 tgccaggctc atggaaaagg cgctcaattg tcgttatggc ccaagcgtcc tggaacgatt 2040 tgttcatgcg cagttactca aaggtaagaa cgactgaagc taaaaaaaaa aagaaaaaat 2100 tgcttattga tctatttagg tatgacgaac caagaactgg ctaaaatggc gaggacattc 2160 ggccataaaa caagtgatat tgtgctagca gcaactcgcg acgaggcaca tcgaactgaa 2220 tcaacgcagg cagttaatac agacgattca ttggtacacc aggtgcgtag accaccattc 2280 agggctccgg gaaaacgcta tggcagttac agcaaaccgg atgatcctcg caacaaaaag 2340 actcgtttcg acagtcgtcg tctcagtcag aaacaagaac gttgttcaag atgcaataag 2400 tggatgcata agaaccgacc gtgtccagca ttgagactca aatgtcatga atgtggttca 2460 tttggtcact tcgctgtggt ttgcaagaag aaacgcgtca atgccatcgt tgatcagact 2520 cctcaggaag cgggattgga gcctaaggaa gaggaggtaa aatgaattaa ttctctacaa 2580 tgaaattcgt gaaatataat cttgaactat agctaactaa gtttctatct tattttatcg 2640 ttgatgacag ctcaatgcat tatctttcga agatgtactg atcaaatgtc gattgggctc 2700 ctccagtccc gtacaatttc tcatcgactc cggggcagat gtcaacgtga taggtggaaa 2760 tgattgggaa accctgcaga aggaatatat gttggggaca gctaaactga aatcaattaa 2820 aataccatcg aacagagctc taaaagcata tgcaactaac gttccaatga aggttgattt 2880 tgcttttaga gcagagattg aaccactgga atctacaaat accgttttgg ctgactttct 2940 cgtggttatg gaaggtagaa gatctctcct tggaagaacg acggcaagcg atttggaact 3000 gctatttgtg cgagcgcaaa tcaattcctg tgaagttgtg acagacaaac agttcccaaa 3060 aatgccaggt gtcacagtta aatttagcgt agatgaaagc gttacaccga ctcgcaacgc 3120 ctactacaat atccctgccg cataccgcga gggggctcgc cgtcggctgt tagaaatgga 3180 ggaaagtggt attatcgaga gggtgacagc ggctcctgag tggataagtg gtatgtcggc 3240 agtgccgaag ggaaagaacg atttcaggct cgtcgttaat atgagagcac caaacagagc 3300 aatcaaacgc gagtactttc gtttgccaat actagacgaa atgagagtta aattacatgg 3360 tgcgaaatat ttcactaaac tggatctgaa gagcgcattc tatcatctgg aattgagccc 3420 agagtcccga gatctaacaa cgttcctggc tgagaacgga atgtttagat ttacgaggtt 3480 aatgtttggc gtaaattgcg cgccggaaat tttccaaagg gaaatgacca gaatcctgga 3540 gaatgtcgag aacaaaattg tgtatataga cgacgttcta ctattcgcag acaacttgga 3600 tgatttgcac gaaatagtga atcaggtact cgaaatcctg aaggagaata atcttactct 3660 gaatgaagac aaatgcgaat ttgataaaac acaaatcacg tttctcggac atgaactaga 3720 tgaggatggc tttcatgttg aagaggcaaa gattggcgcg atacggaaat ttcggcgtcc 3780 ttctacatcg tctgagctac ggagtttcct tggattagca tcgttcgtta gtccatactt 3840 aaaaaatttt gccaacattt cgagcccctt gtggtctttg gttggatcaa aaacttggtt 3900 ctggggcgaa aaacaagaag acgcgttcaa tgagataaag gagcaaataa ttcgctgcac 3960 tgtatcattg ggctattttt cggaacatga ggacaccata ttatacacag atgcttcacc 4020 aaacgcatta ggggctgtgc tggtgcaaga atcagctgga agaaaccctc gaatcatcag 4080 ctttgcgtct aaggcgttaa cagatacgga aaaacgatat gctcagaacc agagagaagc 4140 cctagctgct gtctgggcag tggagcactt ttcttatttc ttgttgggca gacatttcac 4200 tctacggacg gatgcaaaag gaatcacttt tatcttcaac cgttcccgtg aatcaccgaa 4260 aagagcccta accagagcgg atggctgggc tttaaggctt agtccgtaca gctacactgt 4320 tgaatttgtc aagggaagtg acaatgtggc ggatccctct tctcggttat accaaggagt 4380 tgaccaacct ttcggcgagg aatcgagtcc ctgggaaata gcacatgtag aagcaaacat 4440 ggctagattc ttgggcgaag aagaaattcg agaattcaca accaacgacg aagttttaca 4500 aatggtagtt gtggtttgcc ataagtttct caataaacta atttttatct ttttgaaatt 4560 aataggttat caaagccata gactcaggag aatggtcaaa acatttgagc agatttcagg 4620 cagtttcgga agacttgtcc gtcaaagatg gcatactaac gaaattagga tgtgccgtga 4680 tcccggaaaa attacgccac aaaacgcttc aggtagcaca cgcgggacat cccttagcag 4740 caaaactgaa gagcattcta cgagagcgtg tctggtggcc gggaatgtct gcggatgctg 4800 aaaattgggt caaatcatgc aaaacatgtg ccgtgaatgg aaggccagag aaacccccac 4860 ccatgcgtcg tgtgtttgct ccacagactg tttgggatac gatagcactg gacttcaatg 4920 gtccgtacgc gaggcttgga gggatctcca tcctagtgat aatagactac aggtcaaggt 4980 atgcaattgc taaaccggta aaatcaacct gctttgaaag tacgcggaaa gttttggatg 5040 atgtgttcaa tagagaagga ttcccattgg ccataaagtc tgacaatggt ccaccattca 5100 acagcgacga ctacaaatcg tattgtcagg aaagaggtat tcaaaccatt ttctctacac 5160 catttttccc ccagcaaaat ggcctggtgg aaaatttcat gaaagtggtg aatagagcta 5220 tgtcaacagc gatatcaaat tccagcaact tcaatgatga attgcaggca gcggtaaacg 5280 ctcacaacgc ttcagcccac actgtcacaa agatgcctcc agaagaagtt ttgagtggca 5340 gaaaaatccg ccgaggtcta cctttgataa atcccggtaa agtcgatatt gatgagcagc 5400 tactgaacca gagagatgct gatgcaaaac tgcactctaa agctattgct gatatgcgac 5460 ggggagcaaa accttgcagt gtcagctctg gtgatgttgt catactagaa aggcaaacaa 5520 agtccaaggg cgatacaagg ttcgatgaaa aacgttacac agttatcaaa gagaacaacg 5580 gcagtttgct attaagtgat gacgaaggta agttatgtcg tcgtcatgtc tcacagacta 5640 aaaaggtaca tgagtggcga aaactcgact ccgaagatgt agtaccagca aagcagagtc 5700 gaacaacagc tacgtcttct agatccgtaa gacagaaaag acttcctact catctatcgg 5760 actatgtcag gctcacagaa caggatcatt agaactaaag tgctaagttc ataacacctg 5820 taaataaaac tacctgtttg aatctatctg catactcact gttttttttt ttgtgaagat 5880 cagggaagcg cactgaagca acctccactc attagcctgt ttttccttag cgcacaagaa 5940 aggaccatat ttttgttgct cctatgatgg taacgtctta ttcaggtggc acatgaaaag 6000 ggtttttttt tgtttttcgc gtgcagggaa attttatcgg ataaaatttc agaatcgcgg 6060 aaaagctcca aaattttggt ggttttgttt tggttttgtt atacttgcct ttgacagttg 6120 taactgggtt tgctgaatat gcgtaaattc attgacgtcg aaaaaggagg gggtctgctg 6180 aaaagaaaac cacacggaaa cgcgtattta ctcaaacctg aggtgcatat acgcaaaacc 6240 gtaattagga atattaaccc gaaatctaat gtccactgaa ttaggttcat tcaactcaaa 6300 catgagaata ttatatatgc attcacttgg cttatttggg tcaattttgg tagatgcatt 6360 ttctctctct atttttgaca acattcgtgg ggaagtagga gagaattccg agagtttgaa 6420 aaataatcgt tatacttaca ttttaggtaa gatagaaaaa tatctttgtt tctgtttaat 6480 acattgatgt ggaaaccaaa ttcatgattg aacattaagt ttaccaatat tttgaatgga 6540 gtgcgctcac tgagtgatta aattattttc ttgctagtat gtgagctatt catttcgcac 6600 gaaaacgtga agcgagacgg agcatagaca atcggaagca gaattttttt tatagagaag 6660 aaggagga 6668 // ID hAT-75_HM repbase; DNA; INV; 2891 BP. XX AC . XX DT 15-JAN-2009 (Rel. 14.02, Created) DT 15-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE hAT-type DNA transposon family - consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-75_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-2891 RA Bao W. and Jurka J.; RT "hAT-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 9(2), 415-415 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 6..2063 FT /product="hAT-75_HM_1p" FT /translation="MAKWRLASRMRLLECYGGSRAAYGFFPNCYMLIYFHI FT IISIILIVXKLAKMDKVKVNTSLLRKIRRKRKIKDENRIFHNDWTEKYAFI FT ERNNKPMCLICNIELAHNKTCNVKRHYETKHKQFSEEYPLSSNYRKNKIEL FT LKTKNKNQQLNISSYSQESINTTEASFVIAWNIARGKHPYTDGEFVKQNIS FT DVLSVLDPSNSKIQRLVSQIPISRHTTERRISEISIDVEQQLLNDLKNCEA FT FSLAVDESTDIQDKPQMAVFVRYVTSDVIVKEELLDLVELKDTTRGVDIKE FT ALDKALLKANVPINKLVSVATDGASAMAGKHLGLIGLLKSDPKYPEFIPIH FT CVVHREHLAAKHFNFPIVFKSVLKIVNYIRANAKNHRQFRNFINELDLADE FT PSDVSFFCSVRWLSSSDVFYRFVELLEPIKCFLIEKEKTFEILDDINFVQD FT LLFLTDIMQHLKSLNLFLQGKEKNISDLAQTIFSFQKKIILFQKDLSLKKF FT NHFPQMKKMITSNPSIQYDDDKIESYLDKLKDLQKDFQKRFKDLHELKSSL FT GFIVNPFLINIISNGCPIPEKMLEDISQFEMELLDLQEDQNLLMLHKTITF FT MEFWKIVPEVKYPQLKKIAIKFISIFGSTYTCESLFSTMKFVKSKYRANLT FT NEHLSELLRTATTSLKPDFKKLTSKVMSISLIKVKN*" XX SQ Sequence 2891 BP; 1079 A; 393 C; 453 G; 964 T; 2 other; cagggatggc caaatggcgg ctcgcgagcc gcatgcggct cttggaatgt tatggcggct 60 cgcgagccgc atacggcttt tttccaaatt gttatatgct aatatatttc catataatta 120 tttccatcat tcttattgtc wtaaaacttg caaaaatgga taaagtaaaa gtwaacacta 180 gtttgcttcg caaaataagg aggaaaagaa aaattaaaga cgaaaacaga atatttcata 240 atgattggac agaaaaatat gcatttattg aaagaaacaa taaacctatg tgtttaattt 300 gcaatataga acttgctcat aacaagacat gcaatgttaa acgtcattat gaaacaaaac 360 ataaacaatt ttctgaagaa tatccgttaa gttcaaatta cagaaagaat aaaattgaat 420 tgctcaaaac gaaaaataaa aatcaacaac taaatatttc atcttacagc caagaatcaa 480 ttaatacaac agaagcaagt tttgttatag catggaatat tgccagaggt aaacatcctt 540 ataccgatgg agaatttgta aagcaaaaca tttcagatgt cctgtcggtt ttagatccta 600 gtaatagcaa aatccaaaga ttagtatcac aaattcctat ttctcgacat acaaccgaaa 660 gaagaatatc tgagattagt attgatgttg aacagcaatt actaaacgat ttaaaaaact 720 gtgaagcatt tagtttagct gtagatgaat caacagatat tcaagacaag cctcaaatgg 780 ctgtattcgt tagatatgtg acttcagacg taattgtaaa agaagagtta ctagatttag 840 ttgagttgaa agacacaact cgtggagttg atatcaagga agcccttgac aaagctctgc 900 ttaaagctaa tgtacctatc aacaagcttg ttagtgttgc tacagatggt gcatcagcaa 960 tggctggaaa acatttagga cttataggct tattaaaaag tgaccctaag tatccagaat 1020 ttatcccaat tcattgtgtt gttcatcgag aacatttagc tgcaaaacat tttaactttc 1080 caattgtttt taaatctgtt ttaaaaattg taaattatat tcgagcaaat gcaaaaaatc 1140 acagacagtt tagaaacttt ataaatgaat tggatcttgc tgacgagcca agtgacgtgt 1200 catttttctg ttctgtaaga tggctttcaa gtagtgatgt tttttataga tttgtagaac 1260 tattagagcc aattaaatgt tttttgattg aaaaagaaaa gacatttgaa atcctagacg 1320 atataaattt tgtgcaagat cttttatttt tgactgatat aatgcaacat ttaaaaagtc 1380 tgaatttgtt tcttcaggga aaagaaaaaa atatatctga tcttgcacaa acgatattta 1440 gttttcaaaa aaaaattatt ctttttcaga aagatcttag cttgaaaaaa tttaatcatt 1500 ttcctcaaat gaagaagatg attacttcta atccttcaat tcagtatgat gacgataaga 1560 ttgaatcata cttggacaaa ctaaaggatt tacaaaaaga ctttcaaaaa aggtttaaag 1620 atttgcatga actaaaatct tctttaggat ttatagtaaa tcctttttta attaatatta 1680 ttagtaatgg ctgtccaatt cctgaaaaaa tgcttgaaga tatatctcaa tttgaaatgg 1740 aactactaga tctccaagaa gatcaaaatc tacttatgct gcataaaaca ataactttta 1800 tggaattttg gaagatagtt cccgaggtta agtatcctca acttaaaaag attgccataa 1860 aattcatttc aatttttggt tcaacgtata catgtgaatc tttattctct actatgaaat 1920 ttgtcaaatc aaagtatcgt gcaaatctaa ccaacgagca tttgtctgag ttacttagaa 1980 cagccactac aagtttgaaa ccagatttta agaaacttac cagtaaagtt atgagtatta 2040 gtttgatcaa agtaaaaaat tgataaaatg atgaaaaagt tatttcttga ttgtaaaaat 2100 caagtaaaat aatctaattt tagtgttggt tttttaatta ttatttcatt gatataagca 2160 ttgttataat tttgttaatt aaaaaaaagt ttatgatcag catgtgcttg tgcaaaaatc 2220 tatcataaaa aagatcagtg tgttgcacag cacatggcca gtgtgttgca cagcacctaa 2280 tcgtgtgctg tgcaaaatca gtgtgatcat gtgattgtcg aaaactatag aaatgatttt 2340 ttagtctgaa gggaatttta taaaataaat cagatataat ggcattgtaa acaaaatctt 2400 taaaggaata gttttttctt taaaaaattt attgagtgtt ttaaaacaat taactaaatt 2460 caattgtata catattttac ttaaatcttt aaaaaatgat tttttgtatc caagaaatat 2520 ctaaatttca tattaaaatg atattgttga atagtaatac aacaattatt tctacccctt 2580 aggagcatat ggaatttcct aaataatttg tattttctaa aaaattttct ttaaaaaatg 2640 attttaaata gtgaattaca aattgttgtt tttatatgaa accaagaaaa tgggctagta 2700 ctagaagttt cttatttttt gggtagtttt tttttactta tctagtacga aagtctctta 2760 aaattcagat tatcaaaaga aagagtataa aagtttctta tataaaaaaa caaacatgaa 2820 gttgttgcgg ctcttcggct cacgtaagta tataaatgcg gctcttgtga caaaacgtct 2880 ggccagcgct g 2891 // ID Gypsy-4_PPP-I repbase; DNA; INV; 5511 BP. XX AC ADBJ01000042; XX DT 13-DEC-2010 (Rel. 15.12, Created) DT 13-DEC-2010 (Rel. 15.12, Last updated, Version -1) XX DE LTR retrotransposon from the Polysphondylium pallidum (slime DE mold) genome: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-4_PPP_; KW Gypsy-4_PPP-LTR; Gypsy-4_PPP-I. XX OS Polysphondylium pallidum OC Eukaryota; Amoebozoa; Mycetozoa; Dictyosteliida; Polysphondylium. XX RN [1] RP 1-5511 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Polysphondylium pallidum (slime RT mold) genome."; RL Repbase Reports 10(12), 2164-2164 (2010). XX DR GenBank; ADBJ01000042; Positions 87844 93354. XX CC Positions [3633-4148] - Integrase core CC 'CTCTG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 117..5405 FT /product="Gypsy-4_PPP-I_1p" FT /translation="MMTVNDFLCRNDPLRSTHKSFCDFFVKVLSMYSISVK FT QAYDWTTATFAADPNSSSFNTWLNSNKVVIGSIQKFKLGDDIYSWIDAIYN FT LVPEEHRFNFTEVWLTPDAFSFFFANRIPYKDFAELSAALVKVFGCKIDAD FT SLYAKFSHQVVYDPKWSITSFNMSFNRLARIMNMGEAINVRMYKAAFINLG FT EIRTRLSEFTDLSQCLRKAEELFAFHYSAKNVNQSHLEKLDVADPCGWIDK FT HFPGFANKSVGQTTHHVSKPSFKCFNCGLPGHKSFECRKPRGPYHRDKRDR FT FRHSDRRSDSRSDRPKHSDRPSSGDKNPYINKMVGSVDTSSLPYVPIFVNG FT ITKDAKLDTGSEINLLSRTVWNNLKLATPLTPACINICGVNGTPTRADGMV FT SLLVGFTTVKGHRVHIPMDFIIADVDPKVCIIGVDSLAKLDFRIVDNTLTI FT SNNIVKTFRRGTTNHSALRNNIDFHVSGRKRKSALKCLFASTQKCDILHEI FT TKICHQIHVTKEKLKTDFENFHFGNFNKFGYSPMTMATLNPSNPTNAGIEH FT INTADIADSYLVWIDPDDVKTPLPTATTLTANATATSADGLDGVREDIINM FT LKSDFQDVVSPMTSPPPFREDVDMKINLNAPVQKQRMFPVPQSLLEELHNQ FT IKDLLHKGFIVPSNAEYGAPVLFVKKKTGNWRMCVDYRQLNKSTVKNSYPL FT PRINELLDRTKAAHWLSKLDLLHGYHQLRMNLQDADKTTFRCPFGSFKYTV FT IPFGLSNAPAVFQSFMDSVFKLEVFQKLLLVYLDDLLIMTNNSSITEHIDG FT IKHVFTILRKNNLHVNLSKCTFLVRSVDYLGHMVGNGKIDPLQDKIDSIIA FT WKEPNNKDELRSFLGLVNYYNRFIPRLATVQAPLTFLLRKFAPFHWSDDCQ FT LAFDTIKSAIKEKPSLFPPNYDLPFIFECDASDVGTGYRLFQLVDGKENVI FT CYGSKKLKDSETRYSTLEKELLAIVTALKANYYHIFGRDIEIRTDHKNITY FT INEANKIGINQTINRWIAHINLYQPKIKFIKGKDNTIADGLSRYTFNVSVS FT LSHPTIANDIIDGYRIEDNSNLSNRITDYTTINGLRYTHDGKIIVPRVMSI FT VNNLLYQAHDSEFGCHRSASKTLHSISELYVWPNMAADIRKYCDDCIVCRR FT ASDHARKRIGHLIPLPIPVKCFQRINIDFIPNLPVEEYMGRKVDAIMVVVD FT ALSKMTRLIPLGSDYTSKEVIDLFHKEIFKLHGIPIEIVSDNDSKLTSKLF FT KQLTKAYGIHHSTSAVSNQQANGQAERIIRAVKNATRKLMVDDRLHNTTGS FT WSKHIDTVEFSLNSTVHSTTEYTPFKIYLGHNPLTPLNLVKESDALVDVTD FT KQIIESIKERKAIIKLVKRNICIGNQEMRFTWNKGKKENQIKVGDYVFYNN FT NRDGKKKKLDVRYEGPRKVVEVDNYNNIFIDVIGKDGISTKKKKFNSKHIM FT LYDPKMNDNLINDVDDLLVENGATAETDRGSTVGDSSDDYDDSPRPIRRHT FT RRGLKNVESRVDINVERSVDKDDMEDNELNDDDDSDYDDNENNNNNNYNNN FT NNNDDMDVDDDDSDNLLINNNNNSNNSNNNNNNNNNNLNIDYFKFISNDSK FT FHQNITARGTLSNNLSKSILRNVENNFNELRNEVDKFDHNLKCNLNKSEII FT ATKNIISYINSDDFIPNTPSIIIADDVKVLAYTKWRKDIVAILAKRKFTLG FT TSKLARVEYLVKVGRALAWVPLPGNTLATKFSSAHSVPDSTDG" XX SQ Sequence 5511 BP; 1727 A; 1204 C; 1051 G; 1529 T; 0 other; tggtggactc agaagagtct tttcctttta cgccatattt taaacggacc cctactaagc 60 ccaccacccc tgccggttta accaaacgaa cagtgcctcg caatcacgct agtgctatga 120 tgacagttaa cgatttcctt tgcaggaatg accctttaag atcgactcat aagagctttt 180 gcgatttctt tgtcaaagtg ttatccatgt attccatctc cgtaaagcaa gcttacgatt 240 ggaccactgc tacttttgcg gccgatccaa attcatcaag tttcaacacg tggctgaact 300 cgaataaggt agtgattggc tcgatccaga agtttaaatt aggtgatgac atctactcat 360 ggatagatgc gatctataat ctggttccgg aggagcaccg tttcaacttt accgaggtat 420 ggctgacacc tgacgccttt agctttttct ttgccaacag aatcccttac aaggatttcg 480 ccgagttgtc cgctgctcta gtaaaagttt tcggttgtaa gattgatgct gactcactat 540 atgccaagtt ttcacaccaa gtagtatacg atccaaaatg gtcaatcacc agcttcaaca 600 tgtctttcaa ccgccttgcc agaattatga atatgggtga agccatcaac gtccgcatgt 660 ataaagctgc attcatcaac ctgggcgaga tccgtacccg tttgagtgag tttaccgatc 720 tctcacaatg ccttcgcaaa gctgaggagt tgttcgcatt ccattattca gctaagaacg 780 tgaaccaatc ccatctcgag aaacttgacg ttgctgatcc atgtggttgg attgacaagc 840 acttccctgg ttttgctaac aagtcggttg gtcaaaccac acatcacgtt tctaagccca 900 gctttaagtg tttcaattgt ggtctccctg gtcacaaatc ttttgaatgt cgcaagcctc 960 gtggcccata tcaccgcgat aaacgtgata gattcagaca ttcagatcgt cgttcagata 1020 gtcgctctga cagacccaaa cactcagatc gccctagttc tggtgataaa aacccgtaca 1080 taaataaaat ggtcggctct gtcgatacgt cttctttacc atacgttcct atttttgtca 1140 atggaattac caaagacgct aagcttgaca ctggttctga gattaatcta ctttctagaa 1200 ccgtttggaa taatcttaag ttggccacac ccctcactcc tgcatgcatt aatatatgtg 1260 gtgtcaatgg tacccccaca cgtgcagatg gtatggtctc attgcttgta ggtttcacca 1320 ccgttaaagg tcaccgtgtc catattccaa tggattttat cattgcggat gtggacccta 1380 aagtatgtat catcggtgta gattcactag ccaaactaga ttttagaatt gtcgataata 1440 cactcactat ttctaataat atagttaaaa cattcagaag aggtaccact aatcatagtg 1500 cgcttcgcaa caacatcgat tttcatgtat ctggacgtaa aaggaaatct gctttgaaat 1560 gtttatttgc gtcaactcag aaatgtgaca ttttgcacga aattacaaaa atttgtcacc 1620 aaatccacgt caccaaggaa aaattaaaaa ctgattttga aaatttccat tttgggaatt 1680 tcaataagtt tgggtactct cctatgacca tggccaccct taatccaagt aatccaacta 1740 acgctggcat cgagcatatt aacaccgctg atattgctga tagttacttg gtatggattg 1800 accctgatga cgtgaagact ccactgccga cagccactac tcttactgcg aacgccaccg 1860 ctacttcggc agatggtctt gatggtgttc gagaggacat cattaacatg ttgaaaagcg 1920 atttccagga tgtggtatct ccaatgactt caccaccacc attccgagag gatgtggata 1980 tgaagatcaa cttaaacgca cctgtccaga aacagaggat gttccctgtt ccgcaatcac 2040 tgttagagga actacacaac caaatcaagg acttgctaca taaaggtttc atcgttccat 2100 ctaacgctga atacggcgct ccagtattgt tcgttaagaa gaaaactggt aactggcgta 2160 tgtgtgtcga ctaccgccaa ctcaataagt ctaccgtcaa gaactcttat ccgcttccac 2220 gtatcaacga actgctcgac cgtaccaagg ctgctcactg gctaagcaag ttggacttgc 2280 tacatggtta ccaccaactt cgcatgaacc tacaagacgc tgacaagacc acttttcgtt 2340 gtccatttgg ttcattcaaa tacacggtta taccattcgg attatcaaat gcaccagcgg 2400 tcttccagtc tttcatggat tctgtattta aattggaagt atttcaaaaa ttattattag 2460 tatacttaga cgatttatta attatgacca ataattcttc tattaccgaa cacatcgatg 2520 gtatcaagca tgtattcaca atacttagaa agaacaactt acatgtcaac ttatcaaaat 2580 gcacattctt ggttcgatct gtcgactacc ttggtcacat ggttggtaat ggcaaaatag 2640 acccactgca ggacaagatc gattctatca tcgcttggaa agagccgaac aacaaagacg 2700 agctacgtag tttcctcggt cttgttaact actacaatag attcattccg aggttagcca 2760 ctgtccaagc accactcact ttcctgctta gaaagttcgc tccattccac tggtctgatg 2820 attgccaact tgcattcgat actatcaaat cagcaattaa agagaagcca tcgttgttcc 2880 caccaaatta cgatcttccg tttatatttg aatgcgacgc ttccgatgtc ggtaccggat 2940 atcgattgtt tcaattggta gacggtaaag agaatgtaat ctgctatggt tccaagaaat 3000 tgaaagattc ggaaacacgt tattcaacac tggaaaagga actattagcc atcgttactg 3060 cgttgaaagc caactactat cacatcttcg gcagagatat cgagattcgt actgaccata 3120 agaacatcac ctacatcaac gaggccaaca agatcggtat caaccaaact attaatcgct 3180 ggatcgcaca cattaatctt taccaaccaa agatcaaatt catcaaagga aaggacaata 3240 cgatcgccga tggtttgtct aggtacacat tcaatgtatc cgtatcactt agtcatccaa 3300 cgatcgccaa tgacatcatc gacggttacc gcatagagga caactccaat cttagtaaca 3360 gaatcactga ttacacgaca atcaacggtc tccgatacac ccacgacggt aagattatag 3420 tacctagagt catgtcaata gtcaataatt tactatatca agcccacgat tctgaattcg 3480 gttgtcatag gtcggcctcc aaaactctcc attctatatc cgaactatac gtatggccga 3540 acatggcggc agatatcagg aagtactgtg atgactgtat cgtctgtcga cgtgccagcg 3600 accatgcacg aaaaagaatt ggccacctga ttccattgcc cattccagtc aaatgcttcc 3660 aacgtattaa catcgacttc atcccaaatc taccagtcga ggagtacatg ggtaggaagg 3720 ttgacgctat aatggtcgtc gtcgatgcct tatctaaaat gaccagactg attcctttag 3780 gttcggacta taccagtaaa gaggtgattg atctgtttca taaagagatc tttaaactcc 3840 atggtatccc gattgaaatt gtttctgata acgattctaa attgacctct aagttattca 3900 agcagttgac gaaagcgtac ggtattcacc actccacatc ggccgtatcc aatcaacaag 3960 ccaacggaca agctgaaaga atcattcgtg cagtcaagaa tgcaactaga aagcttatgg 4020 ttgatgaccg tctccacaat actactggtt cctggtctaa gcacatcgat actgtcgagt 4080 tttcattgaa ttctactgtt cattctacta ccgagtacac accgttcaag atctatttgg 4140 ggcacaatcc gttaacacca ctcaacctcg tcaaagaatc ggatgctctg gtcgacgtca 4200 ctgataagca aatcatcgaa tctattaagg aaaggaaagc aatcattaag ttagtcaaac 4260 gaaacatctg tattggaaat caagaaatga gatttacatg gaacaaaggt aaaaaagaaa 4320 accaaatcaa ggtaggtgat tacgttttct acaataataa tagagatggt aagaaaaaga 4380 aattggatgt gaggtatgaa ggaccgcgaa aagtagtgga agtcgataat tataataata 4440 tattcataga tgtgattgga aaagatggaa taagcacaaa gaagaagaag tttaattcga 4500 aacacataat gttatacgat ccaaagatga acgataattt aattaatgat gtcgacgact 4560 tattagtcga aaacggtgct acggcggaga ccgatcgggg tagtaccgtt ggcgatagtt 4620 cagacgacta tgacgatagt ccgaggccta tacgccgtca cacacggagg gggctgaaaa 4680 atgtcgaaag tagagtagac ataaatgtcg aacgtagtgt tgacaaggat gacatggagg 4740 ataatgaatt aaatgacgat gatgatagcg actacgatga taatgaaaat aataacaata 4800 acaactacaa taacaataac aataacgacg acatggatgt cgatgacgat gatagcgata 4860 acttacttat taataataat aataactcta ataactctaa taataataac aataataata 4920 ataataattt aaatatcgat tattttaaat ttatttcgaa tgattcaaaa tttcatcaaa 4980 acatcaccgc acgtggaact ttatcaaata atttatcaaa atcaattttg cgaaatgtcg 5040 aaaataattt taacgaacta cgtaacgaag tggacaaatt cgatcataat ttaaaatgca 5100 atcttaacaa atcagaaata atagcgacaa aaaacattat atcatacatc aacagcgacg 5160 acttcatacc aaacactcct tctattatta ttgcagatga tgtaaaggtt ctagcataca 5220 ccaagtggag aaaggatatt gttgccatcc tcgccaagcg caagttcact cttggtactt 5280 ccaaattagc tcgtgtcgaa tatttggtta aggttggaag agctctcgct tgggtaccgt 5340 tacctggtaa tactctggcg actaagttta gctctgctca ttcagttcca gattctacag 5400 acggatgagt cggcacccgt ctctcgcgta agtcctgata aggttttctt atgcaaagcc 5460 gcgtataagt ttttaatcct gtcaggctgc tgcctcgaag aagaagggca g 5511 // ID Gypsy-11_DWil-LTR repbase; DNA; INV; 214 BP. XX AC scaffold_181036; XX DT 05-MAR-2011 (Rel. 16.03, Created) DT 05-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-11_DWil_; KW Gypsy-11_DWil-I; Gypsy-11_DWil-LTR. XX OS Drosophila willistoni OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC willistoni group; willistoni subgroup. XX RN [1] RP 1-214 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; scaffold_181036; Positions 36088 35875. XX SQ Sequence 214 BP; 79 A; 30 C; 54 G; 51 T; 0 other; tgtcaagaat gacaaagact atacagagag agagagagag cttaaataga gtgcatacag 60 agagagagca cagagggaga gtacgggagt cttgttggag aagccgcgac gatcggacgt 120 ctttggaagt acaccatata catatatata atttgtatga gttcgatcaa atttaagctg 180 taataaacgt atccaataaa ttggtatctt gaca 214 // ID Gypsy-39_AA-I repbase; DNA; INV; 3934 BP. XX AC AAGE02027584; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-39_AA_; KW Gypsy-39_AA-LTR; Gypsy-39_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-3934 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02027584; Positions 58842 62775. XX CC Positions [2983-3471] - Integrase core CC 'ACAAG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS join(37..2385,2389..3930) FT /product="Gypsy-39_AA-I_1p" FT /translation="MNFTKPPEFNVGDSWPLYEERLKRFFVAYEIGEDEDE FT RRAAFLLTAVSMEVYQIVKNLCFPELPEQKKFSEICELLKQRFTPTLVVFR FT ERARFFEARQGDAESVVEWSTRLKKLAADCEFGEDLNTFIKNLFVVGLRRG FT PIFERVCEEDAATSFDDLMKIAQKKECTLQQRGVLDVHKIQYGSDKKQSSS FT GAKCFACGRGDHDFRKCQYKSYVCKLCDKKGHLAKVCPTKTLEQPKEKKKS FT FGAEKRGPRVNHLRINKLDVPPPVMVRIQVNGCPIRFEVDTGSPVNAVSKE FT LYDTLFSALPLNINSKDEFICYNGSGFRAVGTFNAEMQYKDHVSTEEIYVF FT EGTRQPLLGRRTMCRWNLKIDFCHVSKEEVDCNKQLKPLLSKHAEVFDGEL FT GHYKHGKVRLALKDEAVPKFCKPRKVPLAFKEKVEAELDRLEETGIISKAP FT SSEAEWGTPLVPVLKKDSSIRLCADYRITVNPFLLDDHHPFPVIEEIFAAL FT QGGKYFSKLDLKNAYYQLEVDNDTKKLLAWSTHRGVYWMNRLPFGTKPACS FT IFQATLEKVLQGCRGTVIYLDDIMVSGRTIMEHLANLDAVLTRLKEAGFLL FT NKMKCEFFVQKVGYLGHVIDQDGLHKDPEKVKAIMDVKSPTDVKEVRAFVG FT LANYYAKFCPSLAQCLKPLYELLKDDVKFVWTKNRQKAFEMAKKLLSEDTV FT LVHYDPKLPVKLYCDASNEGIGAVIVHEFPDKSERPISFASRIYKKHEANY FT SVIDKEALAIYYGINKFNNYLQGRHFVLMTDHPLTGLFSPKGVPETAAGRL FT QRWAAFLSGYDYDIQHVKGVQNVPADYLSRYPTGSDKCVEDEDEDELASFL FT NFIEAETRSLVDRKQIVIESRRDKLLSRVAEYVKFGWPNSIQEDNLKTFFQ FT KREELTVEEDVLLWGYRIVVPVKLRKFLLDELHSVHLGIVKMKSLARSYFW FT WPSLDKEIEDICKKCESCIQQRPERSDPISPWRLTNAPCDRVHVDHFSFRG FT ADYLVLLDSYSKWIEAFPVRTQTSKESIEKISEFTSRFGSIGTLVSDNGTA FT FSSEEFQAFCSSRGIKHLRTAPYSPCSNGAAENAVKTVKNALKKLSSDPAF FT QRKSVALQLSSYLEMYRATKHATTGESPFKLMFGREMRIRFDKLKADPERR FT QREAIEEYNVKKKNVNFEVGETVYARDYRNPKKPLWMRARVIKKLGAVLYE FT CLVAELGVIKRRSHQLLKYPFDDYEEDTKRTNNNDVQLPNEQSDDEMDYVS FT ISESDAEEQTVQPRPAGSYVTRYNRVVTQPRRFGGE" XX SQ Sequence 3934 BP; 1107 A; 822 C; 1062 G; 943 T; 0 other; ttttggcgac gaaaggagaa aggaagttct gcaagaatga attttaccaa gccaccggag 60 ttcaacgttg gagattcatg gccactgtat gaagaacgtt tgaagcggtt ttttgtggcc 120 tacgaaatcg gtgaggatga agacgagcga agagcagcgt tcctgttgac ggccgtttcc 180 atggaagtat atcaaatcgt gaagaatttg tgtttcccgg aactgccaga acagaagaag 240 ttttccgaga tctgcgagtt gctgaaacag cggtttaccc cgacgctggt tgtgttccga 300 gagcgagcac ggttctttga agcccggcaa ggagatgcag aatcggtcgt agaatggtcc 360 acacgcctga agaagttggc cgccgattgc gaatttggag aagacctgaa cacgtttata 420 aagaatttgt tcgtggtggg cctccgtcgt ggtccgattt tcgagcgagt ttgcgaggaa 480 gatgcggcaa cgagtttcga cgatttaatg aagattgcgc agaagaagga atgtacccta 540 caacagcgag gagttctgga cgtccataaa atccagtatg gttccgacaa gaagcagtca 600 agttctggag cgaagtgttt tgcctgtggt agaggcgacc acgatttccg gaagtgccag 660 tacaagagtt atgtgtgcaa gctctgcgat aagaagggtc acttggcgaa agtttgcccc 720 acgaagacgt tggagcagcc aaaggagaag aaaaaatcgt ttggagcaga gaagcgagga 780 ccaagagtga accacctgag gatcaacaag ctggatgtgc caccaccggt gatggtacgc 840 attcaagtca acggatgccc tattcggttc gaggtggaca ccggtagtcc agtgaatgcc 900 gtttccaagg aactttacga cactttgttt tctgcgctgc cgttgaacat caattccaaa 960 gatgaattca tttgctacaa tggatctggt ttccgagcgg tcggcacatt caatgcagaa 1020 atgcaataca aggatcacgt gtccacggag gagatttacg tgttcgaggg aacaagacag 1080 ccgttgctgg gccggcggac catgtgccgt tggaacttga aaatcgattt ttgccacgtt 1140 tcgaaggaag aggttgattg taataagcag ctgaagccac tattgagtaa gcatgctgag 1200 gttttcgatg gtgagttggg ccattacaag catggcaagg tgcgactagc ccttaaggat 1260 gaagcagtcc cgaaattctg caagccccgg aaggttcctc tcgcattcaa ggagaaggtg 1320 gaggcggaac tggaccggct ggaagaaact ggaatcattt cgaaagcacc atcgtcagag 1380 gccgaatggg gtacaccact tgtccccgtt ctcaaaaagg attcgtcgat acgactatgt 1440 gcagattatc gcatcacggt caacccattc ctgctggacg atcatcatcc gtttcctgtg 1500 atcgaggaaa tttttgcagc gcttcaagga ggcaagtatt tttcaaagct cgacctgaag 1560 aacgcgtatt atcagctaga ggtggataat gatacaaaga agctgttggc ctggagcacg 1620 cacaggggtg tctactggat gaatcgactt ccgttcggaa cgaagccagc gtgttccatt 1680 tttcaagcga cattggagaa agttcttcag gggtgccgtg ggacagtaat ttatctggac 1740 gatataatgg tgtcaggccg gaccattatg gagcacttgg cgaatctgga tgccgtactg 1800 acccggctca aggaggccgg attcctgctg aacaaaatga agtgtgagtt tttcgttcag 1860 aaagttggtt atcttggtca cgttatcgat caggatggct tacataagga tcctgagaag 1920 gtgaaagcaa ttatggatgt caaatcaccg acggatgtca aggaggttcg agcgtttgtt 1980 ggcctggcca actattatgc caaattttgc ccaagtttgg cccagtgttt gaagccgtta 2040 tacgagctat tgaaggacga cgtcaagttt gtgtggacga agaaccggca gaaggctttc 2100 gagatggcga agaaacttct ttccgaagat acagtacttg tccactacga tccgaaactt 2160 ccagtcaagc tttattgtga cgcttcgaac gaaggaattg gagcagtgat tgttcacgag 2220 ttcccggaca aaagcgagcg tcccatctca tttgcgtcca ggatctataa gaagcatgaa 2280 gctaattact cggtaatcga caaggaagca ctggccattt attacggaat caacaaattc 2340 aacaattatt tgcaaggccg gcattttgtg ctgatgacag accactagcc attgaccggt 2400 ctattcagcc ccaaaggtgt cccggagacg gcagctggtc ggcttcaacg ttgggcggct 2460 ttcctatcag gttatgatta tgatattcaa catgtgaaag gagttcaaaa tgttcccgca 2520 gattatttgt ctcggtatcc caccggtagc gacaagtgtg tagaagacga agacgaggac 2580 gaattagcaa gttttctgaa tttcatcgaa gctgaaactc gttcactggt ggaccgcaaa 2640 caaatcgtta ttgaaagccg tcgtgacaag ctgctcagcc gtgtagctga atacgtgaaa 2700 tttggctggc cgaactcaat tcaagaggac aatctgaaga cattttttca aaaacgagaa 2760 gagctgaccg tagaagaaga cgttctcctg tggggctacc gcatcgtagt tccagtcaag 2820 cttagaaagt ttctgctaga cgagctccac tctgttcatt tggggattgt gaagatgaaa 2880 agcttggcca gatcgtattt ttggtggcca tcactagaca aagaaattga agatatttgc 2940 aagaaatgtg agtcatgtat ccagcagcga cccgaacgga gcgatccaat ctcaccatgg 3000 cgtttgacga atgcaccctg cgaccgagtg cacgtggacc atttctcgtt ccgaggggcg 3060 gattacctgg tgctgctgga cagctatagt aaatggatcg aagcattccc ggttcgtacg 3120 cagacttcga aagaatcaat cgagaagatt tcggagttta ccagcagatt tggttccatc 3180 ggtacactgg tgtcggacaa tggaactgcg ttttcttcag aagaatttca agctttctgt 3240 tcatcccgag gaattaagca tttgcgtaca gcaccatata gtccctgttc caatggtgct 3300 gcggagaatg ctgtgaagac tgtgaaaaat gcgttgaaga agttgtcatc ggatccggca 3360 ttccagcgta agtcggtggc gttacaactg agctcatatc tggagatgta tcgtgccacc 3420 aagcatgcaa caaccggaga gagtccattc aagctgatgt ttggaagaga gatgagaatc 3480 cgattcgata agctgaaggc agatcctgaa cgacgacaac gtgaggcgat tgaagaatac 3540 aacgtcaaga aaaagaatgt gaatttcgaa gtaggagaaa ccgtgtatgc cagagactac 3600 cgaaacccca agaaaccatt atggatgcga gcgagagtca tcaaaaagct tggtgcagtc 3660 ctctatgaat gtctggtagc agagctagga gttatcaagc gtaggagtca tcaattgctg 3720 aagtatccat tcgatgacta cgaggaagac acgaagagaa ctaacaataa cgatgtgcaa 3780 cttccaaacg agcaatctga tgacgagatg gactatgttt caataagtga atcagatgct 3840 gaggagcaaa ccgtacaacc acgtccagcc ggatcatatg ttaccagata caatcgagtg 3900 gtaacacaac cacgtcgctt tgggggagag tagt 3934 // ID Copia-1_DYa-LTR repbase; DNA; INV; 213 BP. XX AC chr2L_random; XX DT 06-APR-2011 (Rel. 16.04, Created) DT 06-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-1_DYa_; KW Copia-1_DYa-I; Copia-1_DYa-LTR. XX OS Drosophila yakuba OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; melanogaster subgroup. XX RN [1] RP 1-213 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (05-MAR-2011). XX DR Genome; chr2L_random; Positions 925053 924841. XX SQ Sequence 213 BP; 67 A; 44 C; 23 G; 79 T; 0 other; tgaaatacta aaaataatat gtatgttaat ttggcaacaa ccatatactg aaatttggta 60 acactgatat gaatctattc tttttttttt tctgagaact tccttcttct tgtgttaact 120 tccgaacgac tcacaacgct gtcacgctta tgtcaaataa aacaattatt aaactttaac 180 taaatcgcct cgcgttttac tcctccattt cca 213 // ID Crack-4_BF repbase; DNA; INV; 3624 BP. XX AC . XX DT 30-APR-2009 (Rel. 14.04, Created) DT 30-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE Amphioxus Crack-4_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; Crack-4_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-3624 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-3624 RA Kapitonov V. and Jurka J.; RT "Young families of Crack non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(4), 809-809 (2009). XX DR [2] (Consensus) XX CC Its 5'-terminal portion is incomplete. XX FH Key Location/Qualifiers FT CDS 58..3195 FT /product="Crack-4_BF_2p" FT /translation="MTNNLLSKTIMTTTSDQPSDQPLHETSNCDLSSLLFN FT PFDLNENVSESPYSSYDPDVNFFSYYHSSFKNTTQYYNSDSLNTCLSSKST FT STSLSMFHLNVRSLPRHFEELNEYLYTLKHRFSAIALTETWLTDINCQNYN FT IPGYKSIHHCRQNKLGGGVSLYIRDDLSYTERKDLEFEIDESGTQCIFIEL FT CSSRDEGKKVIIGCVYRPPNTEIQQFNDQLTECMSKINKEKSTCFILGDFN FT IDLLKTSHSPTNDHINNLFSSSFFPLVYKPTRITTHSATLIDNIYTNTLNS FT NLRVGIMVTDISDHLPIFLISGIEEPLNNDTWYSYRQINETNKEKFIESLT FT RTTWDPVLEETDTEMAYEKFLTTFTLLFDESFPMKTAKIGQHSRRKPWLTR FT GLLKSIKKKNKLYVKYVKNPTPLSESTYKVYKNKLNHTLRTAKKNYYHRKF FT TEASKNVKQTWNVINDLLNRSDKSKAAPHKIQTGARSITDKNKMCEEFNDF FT FVNVGPTLESKIPIVAKDPLDYIKTHLNRQLTSFEPPSTAEILDIVRNLKD FT SAAGHDGISAKIIKISLPYIICPLIHILTLSIQSGVVPKSTKIAKVIPIFK FT SNDSTSVSNYRPISVLPCISKILEKLIYNRLLKHLDQNDILYKHQYGFRRG FT FSTSMALIQLIDKITTAIDNNEFTIGIFLDLSKAFDTVNHSILLKKLNKYG FT INDTALKWFKSYLSDRKQYVTLNGYTSSCKLINCGVPQGSILGPLLFLIYV FT NDLAEASKIFFFILFADDTNLFLSHADYDSLIRLTNQELEKVITWFETNKL FT SVNIKKTYYLIFCSKNKTYNKKNTNIFLKNLPLSQECQAKFLGVLIDDRLT FT WKPHIAMILNKISKTIGIIGKIRHLLSQRTFITIYNSLIYPYLIYCNIVWG FT NAYKTSLHPLLILQKKFLRIATCSSFYTHSSPLFEKLRILNIYDVNRFQLA FT LFTAQHINHTLPDTFDSFLNFRSQFHNYQTRQSTNLHIPLFRTSLAQMSVK FT YKCVKIWNDLPPSLKNLTSSLLTFKRNLKIHLLIHPSPL*" XX SQ Sequence 3624 BP; 1216 A; 773 C; 494 G; 1141 T; 0 other; tctacttctg actcgtgata atcctgaaca ttttggacac atccgagact gcgtcacatg 60 accaacaatc tgttaagtaa aacaatcatg accactactt ccgatcaacc ctcagatcaa 120 cccctccacg aaactagcaa ctgtgattta agctctttac tctttaaccc atttgactta 180 aatgaaaatg taagtgaatc gccctattcc tcttatgacc ctgatgttaa cttctttagc 240 tactatcata gttccttcaa gaacactaca cagtactaca acagtgactc tttgaacaca 300 tgtctatcaa gtaaaagtac atcaacttct ctctccatgt tccatttgaa tgtcagaagc 360 cttccccgcc acttcgagga gctcaatgaa tacctctaca ctctaaaaca taggttttca 420 gcaattgccc ttacagaaac gtggcttacc gatatcaatt gtcaaaatta caatattcct 480 ggatataaaa gtattcatca ctgcagacaa aacaagttag gagggggggt ttccctatac 540 attagggatg atttatcata tacagaaaga aaagatctag agttcgaaat tgatgaatca 600 ggcacgcagt gtatattcat tgaactgtgc tcaagtcgcg atgaggggaa aaaagtaata 660 attggttgtg tatacaggcc tcctaatacc gaaattcaac agtttaatga tcaattgaca 720 gagtgtatga gtaagataaa caaagaaaag agtacctgct ttatcctagg tgattttaat 780 attgatttac ttaaaacctc acattcccct actaatgacc acattaataa ccttttctcc 840 tcctcattct tccctcttgt gtacaagcct accagaatca ccacacattc tgctacttta 900 attgataata tatatacgaa cacacttaac tccaacttaa gggttggaat catggttacg 960 gacatatcgg atcacttacc aatattcttg atatctggaa tagaggagcc tttgaataac 1020 gatacttggt attcttatcg ccagattaat gagacgaaca aagaaaaatt cattgaatcc 1080 ctaacaagga caacttggga ccccgttttg gaagaaacag atacagaaat ggcttacgaa 1140 aaattcctta ctacatttac tttacttttt gacgaatcat ttccaatgaa aacagcgaaa 1200 atcggtcaac attcccgcag aaaaccatgg ttgacacgtg gtctactcaa gtcgattaag 1260 aagaaaaaca aactttatgt gaaatatgtt aaaaacccta cccctctgtc tgagagtacc 1320 tacaaagtgt acaaaaacaa attaaatcat acattaagaa ctgccaaaaa gaattattac 1380 cacaggaaat tcacagaggc ctccaagaat gtaaagcaaa catggaacgt aattaacgac 1440 ttactaaatc gtagtgataa atctaaagca gccccacaca aaatacaaac cggtgcccga 1500 tctatcacag ataaaaataa gatgtgcgaa gaatttaatg atttttttgt taatgtcggc 1560 cctaccctag aaagcaaaat acccatcgta gcgaaagatc ccctcgatta catcaaaact 1620 catctgaata ggcaactaac gtcatttgaa cccccgtcca ctgctgaaat tttagatata 1680 gtacgaaacc tgaaagattc cgctgctgga cacgatggaa tcagcgcaaa aataatcaaa 1740 atatccctac cttatattat ttgtcctctt atacatatcc taacattatc cattcagagc 1800 ggtgtggtcc ctaaatcaac caaaattgcc aaagtgattc ctatatttaa aagtaatgat 1860 tctacctcgg taagtaatta ccgccctatt tcagttctac cgtgcatctc caaaattctc 1920 gaaaaattga tatacaatcg tcttttaaag catttagatc aaaatgacat tctatacaaa 1980 catcaatatg gtttcagaag gggattctct acttccatgg cacttattca attaatcgat 2040 aaaattacta cagctattga caacaatgaa tttaccatag ggatattttt agatctaagt 2100 aaggcattcg acacggtaaa ccattctatt cttctcaaaa agctaaataa atatggcatt 2160 aatgacacgg cactcaagtg gtttaagagc tatctatctg atagaaaaca atatgtcacc 2220 ctcaatggct atacatcttc atgcaaactg ataaattgtg gagttcccca gggatcaatt 2280 ctgggaccct tattgttttt aatatatgtt aatgatcttg ctgaagcatc caaaatattc 2340 ttttttatcc tatttgctga tgatacaaat ctcttcctct cacatgcaga ctatgattcg 2400 ttaatacgtc taaccaatca ggaattagaa aaagtcatca catggttcga aaccaacaag 2460 ctatctgtta atattaaaaa gacgtactat cttatctttt gtagtaaaaa caagacttat 2520 aataagaaaa ataccaatat cttcctaaag aaccttcctc tctcccaaga atgccaggct 2580 aagttcctag gagtacttat tgatgatcga ctaacatgga aaccccacat tgctatgata 2640 ttaaataaaa tttcaaaaac tatcgggatc atagggaaaa ttagacatct tttatcccag 2700 agaaccttta taaccatata taacagtcta atctatccat atctcattta ttgtaatatt 2760 gtttggggaa acgcctacaa aacttctcta caccctttat taattctaca gaaaaaattt 2820 ctacgcattg cgacatgttc gagtttctat actcactctt caccactatt tgagaaactt 2880 agaattttga atatctatga cgttaacaga tttcaacttg ctctattcac agcccaacac 2940 ataaaccata ctctccccga tacctttgac agctttctta attttcgctc gcagtttcac 3000 aattatcaaa ctcgccaaag cacaaatctg cacatccccc tattcagaac ctctcttgcc 3060 caaatgtctg ttaaatacaa gtgtgttaaa atatggaatg atcttccccc ctcccttaag 3120 aatcttactt cctctctcct gacttttaaa cgtaatctga aaatccatct gttaatccat 3180 ccgagccccc tgtagtagcc acattttttt tctttttaaa tcatattatc ttaattcatt 3240 ccattatttg attatcctta ttgcactatt atcattttct aagtattatg ctgatttatt 3300 atttattgtt aagtcaaacc attaattact tggttatcct tattgcatca ttatcatttt 3360 ctacatatta cgttacttta tcattcattg ttaattcata ccattaatta cttgtttatc 3420 cttattgtat cattattatt ctatgtatta ctttacttta ttattatata tcataatcaa 3480 ttttgtttag tttaaaggag gagggctaag tataagcttc acaagctttt tccctcctcc 3540 tgcacctgtt ttatccattg ttttcgtttt gtgttttaat gtaatacttt gtgtgcgaat 3600 aaataaacaa acaaacaaac aaac 3624 // ID EnSpm-1_HM repbase; DNA; INV; 4112 BP. XX AC . XX DT 21-MAR-2008 (Rel. 13.03, Created) DT 01-APR-2008 (Rel. 13.03, Last updated, Version 1) XX DE EnSpm-type family - consensus. XX KW EnSpm; DNA transposon; Transposable Element; EnSpm-1_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4112 RA Jurka J.; RT "EnSpm-type families from Hydra magnipapillata."; RL Repbase Reports 8(3), 183-183 (2008). XX DR [1] (Consensus) XX CC This family is highly diverged from most known EnSpm elements. CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 1878..2675 FT /product="EnSpm-1_HM_1p" FT /translation="MLSTSIYTLSNEIVSSNDIDACQQLIYLFQKALCSYF FT GEGIRTFTFHALAHLSDQVKNFGPLTATSAMVFENVNRQLKRSVTGTKGHG FT RQMAEKFIRFQTEDHLNTFSLNPFVLGAAQNISDKLRNMDLNDLNVDLNLY FT QFYNRFRVADRTFHAYHYGKKLKSASYYAYLIEFNVIVKILFIAHNQRQIL FT CICRLYKNVSKFHRHMDSFNIPIKIKHVLSKISPYLLLKKCGLAVFDSSKF FT THYALVKKTKKFYYSVVIQNNYEHD" XX SQ Sequence 4112 BP; 1474 A; 483 C; 483 G; 1670 T; 2 other; cccgagaaaa aaaaacccgc taatactact aggaatatgt ttaacttcct gctaatcctg 60 ttaagttcct gctaatacgg ctaatcctgg taagttttcg cttaatacgg ctaaacctat 120 taagtttccc attagtgaaa tgacgtttat gaatttcctg ctaatcctag taagttccta 180 ctaagtactt atcaggatta aacgaattac ttagtaagtt cctagtaagt tttcaaaaat 240 aaaacaagta taaaaaatcg tctttgttat acgccatctt ttatttattc acacggaaga 300 caagtcagcc ataaaacagc ttttttagat tcaatagttt tactaattaa tatttgtaat 360 tctataagga aaaagctcaa gaacatctgg ggatgattgc gtggtttgca gtttatttga 420 tgaatatttg ttaatgtgtt ataatgtggt tttaaatgtt aattttaact ctatataaat 480 tcgagcattt tctgttgtat caattcgagc attttctgat tcatagtgat atgtatggtg 540 atgtgttttt taaattatat tttattttta tttctgatkt ttttattttc attgtttatg 600 attttatatt gtttaataaa cataaaatca taaatcataa acaattatat tgtcactctt 660 tctttttact atttcttttt actaaaaaga gtaagtgact tatctttatt aataaagatt 720 tgtcactttt ttttagtaaa aagaaagact tttattttta attttcttaa aattttattt 780 gtaattttat ttaaaaagtc aaatttttat gtcaaaaaac cactcacttc aagcttgact 840 aaaagtttta gtcagactaa aagggagtaa accatttgac attaaaaatt gatcaatatt 900 gaaatcactc tctcataaaa tatatataaa cattttttac tggtaaatat atttttattg 960 ttatctcttg ttaacaagat ttaacatttc tttttaacat gttaaacaaa ctataaaatt 1020 attgaatatt gttatatttt gtattataat ttttatttac ttaacattaa gatttttgtt 1080 ataaatgcca ttatataaat taatttaaaa tttccataat ttaattttat gatattttga 1140 ttaattacta ctgtatatag ttatataatg ctagttactc aagtttttag taatatatat 1200 atatatatat atatatatat atatatatat atatatatat atatatatat atatatatat 1260 atatatatat atatatatat atatatatat atatatatat atatatatat atttatatac 1320 atatatgtat atatatatat atatatatat atatatatat atatatatat ctatatatat 1380 atatatatat atatatatat atatatatat atatatatat atatatatat atatatatat 1440 atatatacat cctgcaaaat tcaggttggt aatgccaacg caaaagccaa cctatttttg 1500 cttcacgccg gcatgcaact gctgacctaa atctttgtaa caagtagmaa aagtaatttt 1560 tttgtccacc tgacaaatat ttattatttt gataaataaa aaaatgataa ttttgaaata 1620 gacaaaataa aaaaataaaa actaagaaaa atttaagaag attttttctt atataaaaag 1680 atgttcatca aagtcttgta gatcttttta gtctaataga ctccataaaa ttgccacatt 1740 tttttcgacg gaagccaaaa aatttgaata atgaacttgt taagtggaaa gctcaagagc 1800 acaagtattt tctattgtat tatgggccat ttgtgtttta caaacttggt aatgctgaaa 1860 tttgttctct ttatttaatg ctttcaacat ctatttatac attatccaat gaaattgttt 1920 catcaaatga tattgatgct tgtcagcaac taatttattt atttcaaaag gcattatgct 1980 cctatttcgg tgaaggtatt cgcacattca ctttccatgc attagcgcat ctctctgacc 2040 aggtaaaaaa ttttggacct cttacagcaa cgtctgctat ggtgtttgaa aatgtaaatc 2100 gccaactaaa gaggtctgtt acaggaacta aaggccatgg aaggcaaatg gctgaaaaat 2160 ttattagatt tcagacggaa gaccacttaa atactttttc tttgaatcct tttgtgttag 2220 gtgcagcaca aaatatatca gataagttgc ggaatatgga tttgaatgat ttaaatgttg 2280 acctaaatct ttatcagttt tataaccgtt ttagagttgc tgatagaact tttcatgctt 2340 atcattatgg taaaaaatta aaatcagctt catattatgc atatttaata gaatttaatg 2400 tgattgtgaa aattttgttt attgcacata accaaagaca aatcttgtgt atatgtcgat 2460 tgtataaaaa tgtaagtaag tttcatcgtc atatggattc ttttaatatt cctatcaaaa 2520 tcaagcatgt tttgtcaaaa atatctcctt atttattgtt gaaaaagtgt ggattagctg 2580 tttttgattc aagcaaattt actcactacg ctcttgtaaa aaaaacaaag aaattttatt 2640 atagtgttgt tattcaaaat aattatgaac atgattaggt tttatttcta ttttttaaag 2700 tttgaaattg tattatgttt ttttgtatta ttatatattt actttacaat attcttttta 2760 ttagaagttg ttatttagga ttctgtttgc aatcaattaa ttaatgaaac atcaaactta 2820 attcataatg gcgacaagag tttgttgttt acaatgatct aaatcagatt tctttttgta 2880 ccattttgac tttgtgtagt gtttctttta agaattaaat aaatttgtta agagaaaaga 2940 aatcgtatcc acctgttatc gcggataaca ggtggatacg attaatatct attaaattaa 3000 ttaaatattt attttagata caattattaa ttgtttttcg tcaactgtaa attttccttt 3060 ttttatatgt atataaaaat atgtgtatat atatgtgtat atatatatgt gtatatatat 3120 gtgtatatat atatatatat atatatatat atatatatat atatatatat atatatatat 3180 atatatatat atatatatat atatatatat atatatatat atatatacac acacgaggct 3240 tatttaaatt tttttaattt aacttagttt catattttca tacactttta ttttgaaatt 3300 ttagattata atagctttga aacaatgata atgaaagttt tttcatcgtt ctaaatttat 3360 ggtaagttaa atatgctttt ttgttataaa atatatgtta ttatgttgtt gaattacatt 3420 aatattttta taaattacgt tatagctaca gttatggtcg taaaagatta agaaaaatct 3480 tatgggtgca attattccac tgacacctgc gaaattacca aaagaactga ccatcgctgt 3540 ttattgtatt tttttaaatg ttctagttaa acaatctaac acttaattat ttgttatttt 3600 tattattgtt ttataaataa gcgtccttat tgcttagcga ctagttaagt tatattataa 3660 ggattatatt ttcaagaaga ttctgtgttt aagtaaaaat tattattttt acttaaactc 3720 gtatatttct tgaaaaataa gcctactaat atatctatga aatctttaaa actcaaataa 3780 ttaaataaaa ccctaataaa catatgaatt taactaaaaa aactattaac taaaactaga 3840 aagtaatact taaccgagaa aaaaaaacct actaatcctg ttaagtacct actaagttcc 3900 gagtaaaaac cctagtaata ctactaagtt tccttctaaa aacttgctaa tcctactaag 3960 tttcctacta aaacttgcta atcctactaa gtttcctact aaaacctgct aatcctacta 4020 agttcctagt aagtactcgg cgggaactta ctaagtttta gtaggaaact tattaggaac 4080 ttaacaggat tagcgggttt ttttttctcg gg 4112 // ID EnSpm-N6_BF repbase; DNA; INV; 3902 BP. XX AC . XX DT 04-AUG-2008 (Rel. 13.08, Created) DT 04-AUG-2008 (Rel. 13.08, Last updated, Version 1) XX DE Amphioxus EnSpm-N6_BF non-autonomous DNA transposon - consensus. XX KW EnSpm; DNA transposon; Transposable Element; Nonautonomous; KW EnSpm-N6_BF; non-autonomous. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-3902 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-3902 RA Kapitonov V. and Jurka J.; RT "EnSpm-N6_BF - a family of non-autonomous DNA transposons from RT the amphioxus genome."; RL Repbase Reports 8(8), 794-794 (2008). XX DR [2] (Consensus) XX SQ Sequence 3902 BP; 1051 A; 827 C; 816 G; 1208 T; 0 other; cactgtgcga aagggtgtga tttccggaca aagtcggagt tgacatattt ttggacatgc 60 taaatattcg gactgtgtcc gggcacgatg ttcaatattt cccatatctg gaaatatcat 120 gagtcaggaa ttgtcttgtt atgtatccgg acaaataaga ctttcctgct tttcaagata 180 cgcgccattg acgtttcttg cctccgaata cgtgatatgt tctacattct ttacatatcc 240 aaggtccata tatttcgata aggaataccc ggacaaatta ggagtttgca cgaagcatga 300 tgtgcattat ttctgggcca ttgtcagatg acattttcaa ttcattatgt ctcagaatat 360 catgagtcat gaaatgtttc agcacatgcc cggacggata gaatttcccg ttcttccgaa 420 tacgtgtcag ggacgtttca tgttttcgaa ttcatactat tttcttttct ttccatttac 480 gagacatatc cgcgacagat attccatgga ttagttttag gacagattcg aagttagcat 540 attctgaccg ttaccgataa aggtattcta tacgtgaatt gtgaatcggt gattgtttta 600 ttgcgcatct gaacaaataa gactgcattt ttttgccgaa attttcgctc cttccgaatt 660 catattccgt tccatttacg acagatgtcc aaaacagaaa cattccagag atcagttttc 720 ggacaactta atggttaaca aattactttt cttgtcatac atgaatctga gattgttttg 780 ttgggtatcc ttaaaaaggt ttcctatctt gtcaaataca cgtcattaac gtttttgctt 840 ctgtatatgt gtttttattc tacaacgagt atgcccgatg tcaatgtatg tttactcttt 900 aattttctgg taattgaagc ttaagcttcc atatacatgt gcaaatatac tgacatcgac 960 atttacaatc aaatgacaca gaaagcaatc tttgcgatgg taacaattat gagtcaagta 1020 ttattgtttc tttgcatatc cggacaatat cttgaacaaa gttcagtcaa aacactccac 1080 cccagcggtg cagtttggga taggtccata tccggacaat atcatgaaca aagttcagtc 1140 aaaacactcc accccagcgg tgcagtttgg gataggtcca tatccggaca atatcttgaa 1200 caaagttcag tcaaaacact ccaccccagc ggtgcagttt gggataggtc catatccgga 1260 caatatcatg aacaaagttc agtcaaaaca ctccacccca gcggtgcagt ttgggatagg 1320 tccatatccg gacaatatct tgaacaaagt tcagtcaaaa cactccaccc cagcggtgca 1380 gtttgggata ggtccatatc cggacaatat cttgaacaaa gttcagtcaa aacactccac 1440 accagcggtg cagtttggga taggtccata tccggacaat atcatgacaa aaagtcactc 1500 atgatattga ataccagcgt tgctgtgtgg gaatggccca tatccggaca atatcacgag 1560 gcaactttac tcgtaaagcg gttatgcaag ctgtgcaaag tgccgagcag accagtacgt 1620 tggcatgtca acggtgcgcc tactggggca gtcgttgcag ataacagcct acaccgcgcc 1680 acagcgtttg ctgctgtgta caactttgaa actggtaagt acatgatgca tttattcaca 1740 aaaataatgc acaatgccat gtgtttccgt gaaacaggca tgatggatga cactaacttg 1800 cttggaacca tctggaggag tccttaggta acaagactat attaggaagt cacatgactt 1860 gactagacca atagtagacc agctggtgac agcagttagg tttgaaacta gatttgaatc 1920 tcagttgccg tctgccgcta gccgctaccg gaccaagaag tgagtagtgt agctctctag 1980 tacatagatt gtatatacat agcagttacc gttcttagtt ccgtatatgt ctgtcagcaa 2040 tgtctgttat cccttcctag tgttttgtgt atttaataat tagtagtgtg atctctttgt 2100 aaagattgtt gtatgataga tcacttctct ttcagctgga acccccgcgt gtcatccccg 2160 gcttgtttac aaattgcccg ttccatagtt agttaaattc tcatattatt ttgtgctgta 2220 tgtgccgtaa cggctgcttc taaactttcc tgtcctgttg ttttggctgg ctaaatcttt 2280 aatagaacca aggagaaact gtaatctcct cactaggacc aatgtgattg tgtgcatata 2340 caaattccat tcgttctctg ggggtcctta tctgtgccat ctgtgtggcc cgcgcccttt 2400 gatgatagcc gactccagat tcctgccttt gtccccgtct tgtcctgctg tgatttttat 2460 ggccgtgttg tttgaataat tttacatgta ccgttcatgg gttcggggca gttactagcc 2520 gttttttttc cattccagcc gctgagcctc tgtttcccaa tcgcctgtcc tgccatgccg 2580 acgaccgagg cctacctcct ggtaacactt acgacagtct gtgcccggtg ccgttgtatc 2640 ctgccagaac tgtttctact ctgaagacgg acagtgcctg gactatgcaa tggagcggac 2700 tgtcctcgag acgtgccgtc gatattgcgg ggtcttgcca gagcatcttc cgtcgccaat 2760 gatcgtccac taccacaaag acgcagaaga gggcgccccg gacgatgatt ccacagtcaa 2820 cacgatacgg acttatagag gacttgaaaa tggactgtcg tactgatttc gtaatgacat 2880 ttttatccta cgaatattga ttgtatggtt tataacgtta gacttgtgtc ggatactgta 2940 catttgatat tgtacattta aactgatcag ttttgttatg atgttattga cttgggcttg 3000 atattattgg atctgacagt gaactgttta actttggctt cattcgagct tcactttttg 3060 tgaagaccaa tgaggttgta gtgagctaaa acttccttgg taagactaag agacaaatgc 3120 atattgattt gccccttgac gttaacatga tctgttcctc cacataatgt atttgttgtg 3180 atacttgtgt catctgattt atacaggaaa gtctttgaaa agctttcaca aaatgtagcc 3240 ttaccgagtt agcattttcg gggttaaaac ggacaggaaa cgttaaatat taggtattta 3300 gcatgatcgt aaagttactg gatattctga aataatgtgt tgtattgttc ctatgatgtt 3360 tcagtatatc ttaagtttat gcaacctttc cggggcgtaa tagccccgaa cgcgctcaga 3420 aaatatgaaa tattcggaac gtgttttgtc tccaaagagt cctagaacgt tccattactt 3480 gaaatccgga tatgcgacat gattatatgg tcccattctt gactcctgaa attatccgga 3540 ttaatacgaa atattcggaa cgtgatttgt atccaaagag tccgaacacg ttccattact 3600 ggaaatccgg atatgagacg taattatttg gtcccgttcc ggacttctga aattatccgg 3660 atatatacga aatattcgga acgtggtttg tctccaaaga gtccgaaaac gttccattac 3720 tggaaatccg gatatgaggc aggaattaaa tggtcccgtt ccggactcct gaaattttcc 3780 gaaaacgcag gggagtcatg tcagatttcg tccggatatt gtccggatat atccagaaac 3840 ggtcccgatc cggatgtata caaacataga aacgtccgga aatgacaccc tttcgcacag 3900 tg 3902 // ID Copia-9_AA-LTR repbase; DNA; INV; 154 BP. XX AC supercont1.25; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-9_AA_; KW Copia-9_AA-I; Copia-9_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-154 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.25; Positions 1086112 1085959. XX SQ Sequence 154 BP; 45 A; 32 C; 23 G; 54 T; 0 other; tgtagcagca aagtgtaacg caagaaccgc catcttattt ccttcaaagt aaaatgtgct 60 ctcgctgtgt aaacatcgtt tcgtttaata caattcttta tttcgtttga ttaattaaat 120 actgcgtttt aattccactg gaacatacgt ccca 154 // ID BEL-182_AA-I repbase; DNA; INV; 6552 BP. XX AC AAGE02029203; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-182_AA_; KW BEL-182_AA-LTR; BEL-182_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6552 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02029203; Positions 16776 23327. XX CC Positions [5555-6133] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 41..6552 FT /product="BEL-182_AA-I_1p" FT /translation="MDMDGHVGGSARGESSCTDCQRPDTAEDMVQCDQCDV FT WKHYTCAGVNESIANRSFVCGNCTAQQSAVDQGAVDQNASDVISVRSGSSR FT SSSTSAFSVCLSQMVERQKLERARIELELQRRHLNEQQQLIDKVLDGEETA FT SHRSGGLPVGAKGGVNNELAPPASKALPTPPPSGALPTRAAGPSKQSTVKT FT TDSIEPLMVTIPLSVLDPSTVESLKLRENPLELRNRIEHCEFRANPTPEQE FT KNSSMMVSIPFSALDSGTVQSLKAKKDPQTALRELRNLVEQCENRANPTPE FT EMADLQVQVERCRKLLERFQTRAEPCKAARQQETSYPEKLQPLNPDANSTR FT IDKQTGAIPKATRNLPVVVENDQVLSQAGAVGGIDTTKEIWPLKKPVSRTC FT WAPVSQSIAAPETRPNDAGPSGKRQALNNLLNINEYTSKNCPTTNNPMTRE FT QPPFDARPGETHTLPTNSHEQLRESFARSGDATLFNSELNQNSTRLLTRNL FT PTRAVDSSLPPRHSTNYQANPNPDPVRPTQQQLATRQSLAKDLPRFSGDPA FT EWPIFISNYRYTTEACGYTNGENMLRLQRCLFGPALETVRSRLVLPAAVPQ FT VIETLRLRFGRPELLINALLRRVREIPAPKSDKLEGLIDFGMAVQALCDHI FT EAANERAHLSNPSLLQELVAKLPADQRMMWAGYKRGFQNVDLRTFGDYMAA FT VVQDATSVTTFEPEVKRINARDRPKHKGFINSHAIETTERNGDSSYASKET FT PKQIICAHCNKSGHRLRECNSFIQLKLDDRWRRIRELKLCQSCLFSHGRRA FT CRIRKTCEIDGCQYSHHKLLHSSGGPPKPSAPQLQVAENHTHRQSTSSTLF FT RIIPVTLHGDSGSISTFAFLDEGSDMTLVENDLITSLGVKGTPLPLCLRWT FT GNTSRIERGSQQVTIEIAGIGQKRRHKLLNARTVSNLGLPRQTFRMEEAEK FT RYQHLKGIPVCSYENVVPRILIGIDNLRLALPLKVREGEGTAPVAVKTRLG FT WCVYGPQVRNNRESFSFHICECSCDDTLHEVVKEFFAIEGAGAGPVDIPQT FT QEEERALTTMEQTTRRIGNRFETGLIWKQDDVEFPDSCSMAVRRLECLERR FT MDRNPELKENLHRQISEYESKGYAHKASKAEMEAADPRRVWYLPVGAVINP FT KKPGKVRVIWDAAAKVEGVSLNSQLLKGPDQLSSLPAVLFRFRQYGVAVSS FT DIQEMFHQIRIREQDKHSQRFLWRTNPSDVPTIYLMDVATFGSTCSPASAQ FT FVKNLNAEQHRERYPEAASAIIDDHYVDDYLASFGSEKEAAKVARDVRYVH FT GNGGFKLHNWRSNSTRVLEQLDEVQPKADKLLNLVDGGKSERVLGMLWSPS FT ADELSFSTQMTEEMQTLIQTQTRPTKRQVLRCVMTLFDPLGLLSPFIIHGK FT VLIQDLWREGTEWDEQISDSVNEKWKRWVRMIVYIAEIRIPRCYFHQATKE FT TYKKSEYHVFVDASGEAYCCAIYLRTINEKGEPQCCLVAAKSKVAPIKPWS FT IPKLELQGCVLGVRWSKFVKENHSLPVSKMVFWCDATVALGWIKADPKNYR FT PFVSFRVGEILEHTTTNQWRWVPSKDNPADEATKWGSGPYFDPESKWFHGP FT NFLRLPESEWPRPKERVIAPTEEMRASILHHCSFIPVIDFERFSSWVRLQR FT ATAYVLRFLHNLAKKQPILTGQLSQAELQVAEEVVFKLVQFECYPDEIAAL FT SNKAPNEIDQKAIGKESSIYRLMPMLDNVGVLRERGRISAASDVCYDVRHP FT IILPRNHRVTELLVHRFHQQYRHGNAETVVNEIRQRYTIPRLRLVVKQVTR FT DCTLCKVRRARPAIPVMAPLPLARLAHHERAFTYTGVDYFGPLLVKLGRSN FT VKRWIALFTCLTVRAVHLEVAYTMSTESCISCVRRFVCRRGPPAEFFSDNG FT TNFQGADRVLRHQISQGLSNTFTSTNTRWNFIPPGAPHMGGAWERLVRSVK FT AAMAEAYSEGKLDDEGLQTLVVEAESLVNSRPLTYLPLDSEEAEALTPNHF FT LMLSSSGVKFCAEGGSATAHNQSSELVRREILGRSWELIQRQLNVFWRRWL FT VEYLPVIRRQSKWFAQTRNLEVGDLVMVAEPTKRSGWERGRIVRTIPNTDG FT QHRQAVVKIGQKSVVRPASRLAQLDLKVYSETPEDSGPHRGET" XX SQ Sequence 6552 BP; 1736 A; 1726 C; 1762 G; 1328 T; 0 other; cgcccgactc cctcgataat cttcaaaaat tcgttgctac atggacatgg atggccacgt 60 gggtggatct gcaagaggag aatccagctg cacggattgt caacgaccgg acacggcgga 120 ggacatggtc cagtgtgacc aatgcgacgt gtggaagcac tacacgtgtg ccggggtcaa 180 cgagagcatc gctaaccgat cgttcgtgtg cggcaactgt accgcccagc agagcgcggt 240 cgaccaaggt gcagtggacc agaacgctag tgacgtgata tcggtgcgca gtggatctag 300 ccgctcaagt tcgacgtcag cgttctccgt ctgcctgagc cagatggtgg agcggcagaa 360 gctggagcga gcacgtatcg aattagagct gcaacgacgt catctcaacg agcaacagca 420 gctgatcgat aaagtgctag acggagaaga aacagcgagt catcgcagtg gaggtctacc 480 tgttggggcg aagggtggtg tgaacaacga gctagcacct cctgcatcga aggcacttcc 540 aacaccaccc ccttctggcg ctctcccaac aagggccgct ggtccatcaa aacaatctac 600 cgtcaaaaca actgattcca tcgaaccact tatggtgacg attccccttt ctgtgctgga 660 tccaagcacg gtggaatcat tgaaattacg agaaaatccc ctggaactca ggaaccggat 720 tgagcattgc gagtttcgcg ccaatccaac acctgaacag gagaagaatt catcgatgat 780 ggtgtcaatt cccttttctg ccttggattc gggcacagtg caatctttaa aagcaaaaaa 840 ggatccccaa actgccctgc gggagctcag gaacctggtt gagcagtgcg agaatcgcgc 900 caacccaaca cctgaagaga tggcagattt gcaggtgcaa gtagaacggt gccggaaact 960 cctggaaaga ttccagacaa gagcggaacc gtgcaaagca gctcgacaac aggagacgag 1020 ctacccagaa aaacttcagc ccttaaatcc agatgccaac tcgacacgca tcgacaagca 1080 gaccggcgcc attccgaaag cgactcggaa tttgcccgtg gtcgtggaaa acgatcaggt 1140 tctttcccag gctggcgcag tgggtggaat cgatactacg aaagagatct ggcccctgaa 1200 gaagccggta agccgaactt gctgggcccc ggtaagtcaa tccatcgcag cacccgaaac 1260 gcgtccgaac gatgcgggac cctcaggtaa gcgtcaagct ttgaataacc ttttaaatat 1320 aaacgaatac acctccaaaa attgccccac gaccaataat ccaatgaccc gagagcagcc 1380 gccttttgat gcgcgcccgg gagaaacgca tacacttccc acaaactctc acgagcagct 1440 gcgagaatct ttcgcgcgct ctggtgatgc gactttgttc aatagcgaat tgaatcaaaa 1500 ttcaacccgt ttactaacgc gtaacctccc cacgcgtgcg gttgattcgt ctctgccccc 1560 tcgtcattca acaaactacc aagccaatcc aaacccagat ccagtgcgcc ccacacagca 1620 gcagttagca acaaggcagt ccctggctaa ggatcttcct cgcttctctg gcgacccagc 1680 tgaatggccg attttcattt ccaactaccg ctacaccaca gaagcctgcg gctacacgaa 1740 cggcgaaaac atgcttcgac tccagcgttg cttattcggg ccggcattgg aaacggtgcg 1800 cagtcggttg gtgctgccag cagcagtgcc acaggttatc gagacgcttc gtttgcggtt 1860 cggacgtcca gagttgctga tcaatgcgct gctacgaaga gtgcgtgaaa ttccagctcc 1920 gaagtccgac aagctggaag gactcatcga cttcgggatg gcggtacaag cattgtgcga 1980 ccacatcgaa gctgcgaatg aacgcgctca cctttccaat ccatcacttc tccaagaact 2040 cgtcgccaag ttgcccgcgg atcaacggat gatgtgggca gggtacaaac gaggcttcca 2100 gaacgtcgac ttaagaacgt ttggagatta tatggcagcc gttgtacagg atgcaaccag 2160 cgtaacgaca tttgaaccag aggtgaagcg gatcaacgct cgtgatcgtc cgaagcacaa 2220 ggggttcatc aattcccatg cgatagagac aactgaaaga aatggagatt cgtcttacgc 2280 ttccaaggaa acgccgaagc agatcatctg tgctcactgc aacaaaagtg ggcaccgatt 2340 gcgagagtgc aattcgttca ttcaactgaa gctcgacgat cgctggcgac ggattcgtga 2400 gctaaagctg tgtcaaagct gtctgtttag ccacggtcgt agagcatgcc gcattcgaaa 2460 gacttgcgaa atcgacggtt gccagtatag ccaccataag cttcttcact cttctggagg 2520 acctccaaag ccatcggctc cacagcttca ggtggcggaa aaccataccc accgtcaatc 2580 tacctcctcg acattgtttc ggatcatccc agtgacgctg cacggagata gcgggtcgat 2640 cagcaccttc gcctttcttg acgaagggtc cgacatgaca ttggtggaga atgacttgat 2700 tacgtcgttg ggggtcaaag gtaccccgct tccgctatgc ctaaggtgga ctgggaacac 2760 ctcacgaatc gaaaggggat cgcaacaggt cacaattgag atcgctggaa tcgggcagaa 2820 aaggcgacac aagctactga acgcaagaac agtcagcaac cttggactgc cccgccaaac 2880 tttccgaatg gaagaagcgg agaaaaggta ccagcacctg aagggcatcc cggtctgcag 2940 ctacgagaac gtagttccca ggattctgat cggtatcgac aacctgcgcc tagcgctgcc 3000 gttgaaggta cgggaaggag aaggtacagc gccagtggcg gtgaaaacca ggctgggttg 3060 gtgcgtctat ggacctcaag tacgcaacaa tcgcgaatcc ttcagtttcc atatatgtga 3120 atgctcctgc gacgataccc tgcacgaagt agtgaaagag ttcttcgcta ttgaaggagc 3180 tggcgcaggt ccggtggata ttccacagac acaggaggaa gaacgagcgc taaccaccat 3240 ggagcaaacg actcgaagaa ttgggaatcg tttcgaaacg ggattgattt ggaagcaaga 3300 tgatgtggaa ttcccagaca gctgttctat ggccgtgcga cgactggaat gcctagaacg 3360 ccgaatggac cgcaatccgg aactgaagga gaatctccat cggcaaatct ccgagtatga 3420 atcgaagggc tatgctcaca aagcttcaaa agcggaaatg gaagcagccg atccgaggag 3480 agtatggtac cttccggtcg gagcggtaat caaccccaag aaaccaggaa aagttcgggt 3540 catctgggac gccgctgcca aggttgaagg cgtatcccta aacagtcaac ttctgaaagg 3600 tcccgatcag ctttcctctc ttccggcggt tctttttcgc tttcgccaat acggagtggc 3660 agtcagctcg gacatccagg agatgtttca tcaaatccga atccgagagc aggataaaca 3720 ttctcagcgt ttcttgtggc gtaccaatcc gtccgatgta ccgaccattt acctgatgga 3780 cgtcgcaaca ttcgggagca cctgctcccc agcttccgca caattcgtca aaaatttaaa 3840 cgctgaacaa catcgcgagc ggtatcctga ggcggcgagt gctatcattg acgaccacta 3900 cgtcgacgac tatttggcga gtttcggatc cgagaaagaa gcagcaaagg tagcccgaga 3960 tgtacgatac gtacacggca acggtggctt caagctgcac aactggcgtt cgaacagcac 4020 tagggtactg gaacagctgg acgaagtgca gcccaaagct gacaagctgc tgaacctagt 4080 cgacggtggg aagtccgaga gagtgcttgg aatgctgtgg agtccgtcgg ccgatgaact 4140 gagcttctcc actcagatga ccgaagagat gcagacactg atccaaaccc aaacgcgacc 4200 gaccaaaagg caagtgctac ggtgcgtcat gacgttgttt gaccctttgg ggttgctctc 4260 accgttcatc attcacggca aagtcctgat acaggactta tggcgagaag gcacggagtg 4320 ggacgaacaa ataagcgata gtgtcaacga gaagtggaag agatgggtga gaatgatcgt 4380 gtacatcgcg gaaatacgga tccccagatg ctacttccac caagccacta aggaaaccta 4440 caaaaaatcg gagtaccacg tgttcgtcga cgctagcggg gaggcgtatt gctgcgcaat 4500 ttatctacgc acgatcaacg aaaaaggaga accgcaatgt tgcctggtcg ccgccaaatc 4560 gaaagtggct ccgataaaac catggtcgat ccccaaactt gagctccagg gatgtgttct 4620 cggggttcgc tggtcgaaat ttgtcaagga gaatcatagt ttgcccgttt caaagatggt 4680 cttctggtgt gacgccacag tggcactggg ctggatcaaa gctgacccca agaactatcg 4740 accatttgta tccttcagag taggcgagat tctggagcac acaacgacaa accagtggcg 4800 ctgggtacca tcaaaggata atccagctga tgaagcaaca aaatggggta gcggaccgta 4860 tttcgatcca gaaagcaagt ggttccatgg acccaacttc cttcgtctgc cagaaagcga 4920 gtggccgcgg cccaaagagc gggtgatcgc ccctaccgaa gaaatgcgtg catcaatcct 4980 tcatcattgt tcgttcatac ccgtcatcga ctttgaacgg ttttcttcct gggttcgact 5040 gcaacgagcg acggcatacg tccttcgatt cctgcataat ttggccaaga agcagccgat 5100 attgactgga caactctcgc aagctgaact ccaggtagct gaagaagttg ttttcaaact 5160 ggtgcaattc gaatgttacc cagacgagat cgcggcactt tcgaacaagg cgccaaacga 5220 aatcgatcaa aaggctatcg ggaaagagag ctcaatttac cgactgatgc cgatgttgga 5280 taacgtcggt gtactgcgtg agcgtggccg aataagcgct gcctcagacg tctgctacga 5340 cgtgcggcat ccgatcatcc ttccacgaaa ccatcgagta acggagctgc tcgtacaccg 5400 attccatcag caataccgcc acgggaatgc cgagacagtg gttaatgaga tccgtcaacg 5460 atacactata ccgagactcc ggctggttgt caagcaagtg acccgcgact gcacattatg 5520 taaagtccgg cgagcacgac cagcaattcc agtgatggct cctcttccac ttgcgcgcct 5580 agcccaccac gagcgagcat tcacctacac aggggttgac tacttcgggc cattgctggt 5640 aaagctgggc cgctccaatg ttaaacggtg gatcgctttg tttacgtgtt taacagtgcg 5700 cgccgtccat ctcgaggtcg cctatacgat gtcaacagaa tcctgcattt cctgtgtccg 5760 tcggttcgtg tgccgtcgag gtccacccgc tgaattcttc agcgataacg gaacaaactt 5820 tcaaggcgcc gaccgggtat tgcgacacca gattagccag ggactgtcca acacgttcac 5880 cagcacaaac acaagatgga acttcatccc acctggagcg ccacacatgg gaggagcgtg 5940 ggaacgcttg gtacgatcgg tgaaggctgc aatggccgaa gcgtactccg agggaaaatt 6000 agatgacgag gggctgcaga cactggtcgt ggaggcggag agcctagtaa attcaaggcc 6060 tctgacgtat ttacccttgg attctgagga agcggaggcg ctcacaccca atcatttctt 6120 gatgctcagc tcgagtgggg tgaaattttg tgctgaagga ggttctgcta ccgcccataa 6180 ccagtccagt gaactcgtcc gccgggaaat actggggaga tcctgggaac tgatccagcg 6240 tcagttgaac gtattttgga gacgctggtt agtggaatac ctgccagtga ttcgtcggca 6300 gtcgaagtgg ttcgcgcaaa cccggaatct ggaagtggga gacctggtaa tggtagcgga 6360 gcctacgaag cgaagcggat gggaacgagg acgaatcgtc cgcacaattc caaatacaga 6420 cggccagcat cgacaggctg tcgtaaagat tggacagaag agcgtagtgc gaccggcatc 6480 gcggttggcg cagcttgacc tcaaggttta cagtgagact ccggaggact ccggacctca 6540 ccggggggag ac 6552 // ID Helitron-2_HM repbase; DNA; INV; 14302 BP. XX AC . XX DT 30-DEC-2008 (Rel. 13.12, Created) DT 30-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE Helitron DNA transposon from hydra - consensus. XX KW Helitron; DNA transposon; Transposable Element; Helitron-2_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-14302 RA Bao W. and Jurka J.; RT "Helitron DNA transposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 2057-2057 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(2176..4527,4514..6625) FT /product="Helitron-2_HM_1p" FT /translation="MSNNLRNEIIRLRQNERRQARRAEINRRRNERDEPRR FT AEINRRRIERDEPRRAEINRRRIERDEPRRAEINRRRIERDEPRRAEINCN FT QNERNQARRELNRYNMHRIARNNIAIESNYLGELNHNCQYCSAKKFLNETH FT FLCCHSGKVVLAPLSLYPPLLTGLMTGNHVDHAVNQNFFKHIRSYNSSLSF FT ASFTAEIAPPSNNGPFCFRVCGQILHRVGNLRPAEGCLPKYCQLYIYDPNA FT AVSFRMEQPGNDGCIHELMQLLQTLINQENPFALAFKNMAEVEDEEIRQAA FT LEGRPTSVVRMSLLEGHDRRRYNLPSHEEVAIVFVGDDGAPPASREIVIYP FT RGQPLRTISSMSANLDPLVYPIFFPRGDAGWHNQLEHNPDRATRVRNHVTL FT SQYYNYRLAVRQTFSPIFYGKKLFQQYVVDAYVKVEGQRLAFIRNNQNKLR FT SEQYDALHEHVINRANDLNVRPGRVVILPSSYVGSPRALKENFEDAMAVIK FT KYGKPDLFITFTCNPKWREIVENLNPGQTANDRPDLVCRVFKMKLKFFLED FT IFKHGVLGKVVSHVQVIEFQKRGLPHVHILLHFVNADKLETAEDIDSLISA FT EIPDPAVDPELFEIIKSCMIHGPCGILNPNSPCMKDGKCTKKFPKEFNPHT FT VAIFNGYPRYRRVDNGRIVNIKGNQVDNRWVVPYNPWLSKKYQAHINVEAC FT MTIKSVKYLYKYIYKGHDCANVVINEQVNHDEINTFLNCRYVSAPEALWRI FT FEYSLSDMSHNIIRLQVHLPDNQMIYFVEGEEQKAKNSEEQAALDRAAQRD FT THLTAWFKLNVENEQARHYPYVEIPYHFVFDSKHCKWKVRQRGSNKVIVRM FT YKVSPIGEIFYLRMLLLHVRGAVSFEDLRTVNGTVFNTFREACSQLGLLQD FT DAEWRNTLTEAAATRLPNQIRQLFSIILTFCEPDDPLNLWNAFKDFMMEDY FT IHHSMPPIIAEQAALRQIESIINQNGKTLADFNLPTLDEFLDYVPQNEEED FT VQVLIDEANRVRPLLNDNQRQIADAILSALSEQPNNENKQSRLFFMDGPAG FT CGKTFTYNYLIAETSSRHIITATAAWTGIAATLLKKGCTLHGLFKLPVPIL FT ENSTCNVTPNSIHGKFLRQVSLYLLDEASMIPKHALNAIDKLLQDVCNNKF FT PFGGKVILMGGDFRQILPVVKRGQPADIVEACIKCSQHWQYVQRFSLTENM FT RVQAEEEEFSQWLLKLGSGTLPLKADGSFRGCIEIPEQCLLGENESLVDKI FT FGVAEEDDYAKRAILTPNNVDSLAINEEVLDRLPGDVKVYLSADTIETDDL FT NEINNFPVEFLNSLTPSGMPVHCLKLKIGAVIMLLRNLDLKAGLCNGTRLI FT VRALQNNYIDGQVLTGVSVGKRVFVPRVQLTQSDSNLPFTLKRRQFPVRLA FT YSMTINKSQGQTFDKVGIYLKNPCFSHGQLYVACSRTRSFNTLFFKIDEHS FT LQGMSFNKCYTNNVVFTNVLNL*" XX SQ Sequence 14302 BP; 4639 A; 1932 C; 2259 G; 5471 T; 1 other; tcgcttataa ttaatactta acttaacgct gccgctgctg ctgctcacca ttcgtgcacc 60 tttacacgga cttttttacc atacttggcc tttgagtttt attataaatt gctttacttc 120 tgaactcgcg taatgaggct ctttcaaatc ttaataatct gtcagtatct gcttaatcag 180 taatgagatc aaaaactttg tttcatcagc ttaattagtg ataagatcaa caatttttat 240 ttgataactt ctgtagtaaa ggatgaacat tatgttatta ttgaattagt aaagtaactt 300 ttaaagtcag gtgatttatt taaatcatgc attttagtct tttgtcaatt gtatttaaat 360 ttattaattt ataaaattaa tctgtaattt ttaaattaaa agacgactca agaagacgtt 420 gctcatttca aaggtcaaga ggtcaaactc ttgataaagt tagtgtttaa aacacttgca 480 tctcacacgg tcaactcaat gttacatgtt aaagaacaag atcgtttaat accttattta 540 aaagaagtta taaaaatccg tataaaaaac cagattgttt gatagtttgt ttttgaaagt 600 tgttaaacat ttattacgag gtatttcaaa aaacaaaaat aaggttgtat ttgtaaagtt 660 tcaaatttat atatttttct taatttatat atttatctat atatgtgtgt tatttatatt 720 gtgttttatt gtgtatatat gtgttaaggt tatatgggtt tatattttgt aagtgtgtaa 780 atatatgaaa tgtttcaatg tttgtggtgt gttatatgta atatgtttat agtttgtaag 840 tgataattat gttgtttgca tggtttcaaa ttgttgtgac tgggcttagt ttacagcaag 900 attgttgacc ttgctagcca agtggtgtac tttgctataa tgtgacctgg ctttgttgag 960 agcaaaaata ttgtcctcgc cagcagagtg gtgtattgtc ataataaaac acataaatta 1020 attctgtttg taattagtgt atttctaaaa tgtgttgaaa aaatattgtt gaaaaaatgt 1080 gttttaatgt tgttatcctc tgttcttaac aatgtatgct tttgacaata aataaatttt 1140 gttaaaaagc aggctttact aacagctcct cctcattaaa atttttaaga gtttcgattt 1200 tcttcattaa atatgtttaa ctcaaatttt ttttatatat atataaagta agtatacagc 1260 gtgttgaagc ataagtttta ataacttttg ttaaattttt aggatttttt gtatagagaa 1320 tgttttcttt atattattat tttaatgttt tgtatgtaca tcaatatata tatatatata 1380 tatatatata tatatatata tatatatata tatatatata tatatatata tatatatata 1440 tatatatata tatatatata tatatagaca cacacacaca ataaaatata tagaattaaa 1500 atgtctatac atatattata tgtatagaca ttttaatttt tttgccttaa attattgtcc 1560 ctttttcttt agaatattaa atatctaaaa aaaaaattaa aatagtaaaa gaagtttcaa 1620 tatttccaag tttcttcact tgaagcaaat atatgattgt ttatagaggt tcttttagta 1680 gttttgtatt tatttattgc aaatcttgcg ttaaatgcct ttcaaagtaa tgaaagggtt 1740 ttgttgatgg atgattacat tttatccttc catgctagca gtacttgctg agttgacagg 1800 ccttaaattt ttttaatttg ctttatttat gagttcttat atgtaaaaaa tttgtttttt 1860 aaattacatt tgtttttttt tttttttttt ttttttgatt aaatacttat aatattttgt 1920 caataattaa taacaactac cataaatata taaacaatac ttagttgaaa atattacatt 1980 aatttataac tttagtttaa gtttttaata aaactatgtt atcttaacac aatgctatat 2040 atggtagaat ataatagatt atagtattat gctctgtatt atttgcttac actttatttt 2100 agataagttt aaaaaagaaa cttcaacttt tattatatta ggcatcattt ttacacactc 2160 ctttgatgcc taattatgtc taataattta agaaatgaaa ttattagact tcgtcaaaat 2220 gaaagaaggc aggctaggcg agctgagata aaccgccgtc gaaatgaaag agatgagccc 2280 aggcgagctg agataaatcg tcgtcgaatt gaaagagatg agccccggcg agctgagata 2340 aatcgtcgtc gaattgaaag agatgagccc cggcgagctg agataaatcg tcgtcgaatt 2400 gaaagagatg agccccggcg agctgaaata aattgtaatc aaaatgaaag aaatcaggct 2460 aggcgagaac ttaataggta taatatgcat cgcatagcaa gaaataatat tgctatagag 2520 agtaactacc tcggagaatt aaaccataat tgtcagtatt gcagtgcaaa gaagttttta 2580 aatgaaactc attttttgtg ctgccattca ggaaaagttg tattagctcc tctttctctt 2640 tatccgccat tgttgactgg tttaatgaca ggaaaccatg ttgatcatgc tgtaaatcaa 2700 aattttttta aacacataag gagttataat tcctctttat cttttgcttc cttcactgcc 2760 gaaatagctc ctccctcaaa taatggccct ttctgtttta gggtctgtgg gcaaattttg 2820 catcgtgttg gaaatttaag gcctgcagaa ggttgtctgc ccaaatattg tcagctgtat 2880 atctatgatc ctaatgcagc agtgtcattc agaatggaac aacctggtaa tgatggttgt 2940 atacatgagt taatgcaact cttgcaaaca ttaattaatc aagaaaaccc atttgctctt 3000 gcatttaaaa acatggcaga agtagaggat gaagaaatcc gccaagcagc tttagaaggt 3060 agaccaacct ctgttgtaag aatgtctttg cttgaaggtc atgatagacg tcgatacaat 3120 cttccttccc atgaggaagt tgctattgtg tttgtgggag atgatggtgc tccacctgca 3180 tccagggaaa ttgttatata ccctcgaggt cagcctttaa gaacaatttc gtctatgtct 3240 gcaaatttgg atcctttggt atatccaata ttcttcccga gaggtgatgc tggatggcac 3300 aatcaactag agcacaaccc tgaccgtgca acccgtgtta gaaaccatgt tactttgtct 3360 cagtattata attataggct tgctgttaga caaactttta gtcctatatt ctatggtaag 3420 aaactttttc agcaatatgt tgttgatgca tatgtcaaag ttgagggaca gcgccttgct 3480 tttattagaa ataaccaaaa caaacttaga agtgagcaat atgatgcatt gcatgaacat 3540 gtaattaatc gtgctaatga tcttaatgta agaccaggcc gtgttgttat attgccttct 3600 tcatatgttg gtagtccaag agcattaaaa gaaaactttg aagatgcgat ggcagtcata 3660 aagaaatatg gtaaaccaga tctattcatt acttttacct gcaatccaaa atggagagaa 3720 atagtagaaa atttaaaccc aggccaaact gctaatgaca gacctgattt agtttgtcgg 3780 gtatttaaaa tgaaacttaa atttttctta gaggatattt ttaagcatgg agtcttaggc 3840 aaagttgtaa gtcatgtgca ggttattgag tttcaaaagc gtggtttgcc acatgtacat 3900 attttattgc actttgtaaa tgctgataag ctagaaacag cagaagatat tgatagtttg 3960 atatcagcag aaatcccaga cccagctgtt gatcctgaac tttttgaaat tataaaatca 4020 tgcatgatac atggaccatg tggaatttta aatcccaatt ctccctgcat gaaagatggt 4080 aaatgcacaa aaaaatttcc taaggaattt aatccccaca ctgttgcaat ttttaatggt 4140 tatccacgct ataggcgcgt cgataatgga agaattgtca atataaaagg taatcaagtt 4200 gacaaccgtt gggttgtacc ttataaccca tggctttcta aaaaatatca agcacacata 4260 aatgttgaag cctgcatgac cataaagtca gttaagtatt tatataagta catttacaaa 4320 gggcatgact gtgccaatgt ggtgataaat gagcaagtta accatgatga aataaacacg 4380 tttttaaatt gtcgctatgt ctcagctcca gaagcattgt ggcgaatatt tgagtattct 4440 ttaagtgata tgtcacataa catcattcga cttcaggttc atctgccaga taatcagatg 4500 atatattttg tagaaggcga agaacagtga agaacaggca gctttagacc gtgcagcaca 4560 gcgtgacact cacctaactg catggtttaa attaaatgtt gaaaatgaac aagcccgaca 4620 ctatccatat gttgaaattc cttaccactt tgtttttgat agcaaacatt gtaaatggaa 4680 agttaggcaa agaggtagca ataaggtgat tgttaggatg tataaagtga gtccaatagg 4740 tgaaatattt tatcttagaa tgttactttt gcatgttaga ggtgcagttt cttttgaaga 4800 tttgcgtact gttaatggta cagtttttaa tacattccga gaagcatgtt ctcaattagg 4860 tctgttacaa gacgacgctg aatggagaaa tacattaact gaggctgctg caactcggtt 4920 gccaaatcaa attagacagt tgttttccat tattttgact ttctgtgaac ctgatgatcc 4980 attaaatctt tggaatgcct ttaaagattt tatgatggag gattacattc accattccat 5040 gccaccaata attgctgagc aggcagcttt gcgtcaaatt gaatctataa ttaatcagaa 5100 tggtaagact ttagctgatt ttaatcttcc tactttagat gagtttttag attatgtccc 5160 acaaaatgaa gaggaagatg ttcaggtttt aattgatgaa gcaaataggg ttagaccatt 5220 gttaaatgat aatcagcgtc aaattgctga tgctatcctt tctgctctta gtgagcaacc 5280 taacaatgaa aacaagcagt ctagattgtt ttttatggat ggccctgcag gttgtggtaa 5340 aacatttaca tacaattatt taatcgctga aaccagcagt agacatatta taactgcaac 5400 agctgcatgg acaggaatag ctgcaactct tttaaaaaaa ggatgtacac ttcatggttt 5460 gttcaaactt cctgtgccga ttttggaaaa tagcacatgc aatgtaactc caaattctat 5520 acacggaaaa ttcttaaggc aagttagcct ttatttacta gatgaggcat ccatgatacc 5580 taaacatgct ttaaatgcca ttgataagtt acttcaagac gtttgcaata acaagtttcc 5640 tttcggtggt aaagttattc ttatgggtgg ggattttagg caaatacttc cggttgtaaa 5700 gagagggcaa ccagctgata ttgttgaagc ctgtataaaa tgttctcaac attggcaata 5760 tgttcagcga ttttcattaa cagaaaatat gagggttcag gccgaagaag aggaattctc 5820 tcaatggctt ttaaaacttg gtagtggaac attaccttta aaagcagatg gttcctttcg 5880 cgggtgcatt gagattcctg agcaatgctt gcttggagaa aatgaatctt tagtagacaa 5940 gatttttgga gttgcagaag aagatgatta tgctaaacgt gccattttga cacccaataa 6000 tgttgattct ttagcaataa atgaagaagt tttggaccgt ttaccaggtg acgttaaggt 6060 ttatttaagt gctgatacta ttgaaacaga tgatcttaac gaaatcaata attttcctgt 6120 tgagttttta aacagtctta ctccatcagg aatgcctgtt cattgcctaa agctaaaaat 6180 tggtgctgtt ataatgttac ttcgaaattt agatctaaaa gctggacttt gcaatggtac 6240 ccgattgata gtgcgtgctc tacaaaataa ttacattgat gggcaagttc taacaggtgt 6300 ttcagttggc aagagggtat ttgtccctcg ggttcagtta acacaatctg attcaaattt 6360 accttttact cttaagcgtc gtcaatttcc tgtaagatta gcatactcaa tgaccataaa 6420 taaaagtcaa ggtcaaactt ttgacaaagt tggtatttat ctcaaaaacc cttgcttttc 6480 acatggtcaa ctctatgttg catgttcaag aacaagatcg tttaatacct tattttttaa 6540 aattgatgaa cattcattac aaggtatgtc atttaacaaa tgctacacaa acaatgttgt 6600 atttaccaat gtactaaatc tataaataat taatttttta caattaaact cctcatttct 6660 tataaatatt ttaactgtaa tttaaagtat ttttgtgtgt tgcttgatgc agtaatgtgt 6720 gatatttatt atgtgtttaa atatgtgtta aggttatatt ttgtaagtgt ttatgtgtaa 6780 ctaatgtgta aatgactgtt atatgtgaag tgtatttttt atgttatctg tttatagttt 6840 gtgagttgtt attatgttgt tttcatggtt tcaaagtggt gtgacttggc tttgtttatg 6900 gcaagagtgt tgtccttgct agccaagcgg tgtactttga ataatgtgac ttggctttgt 6960 ttatggcaag agtgttgtcc ttgctagcca agcggtgtac tttgaataat gtgacttggc 7020 tttgtttatg gcaagagtgt tgtccttgct agccaagcgg cgtactgaca tgaaacacat 7080 aataaaatct gtttgtaatc acaaaatgtg tacttgcaag tatgttatgc tgttagtaaa 7140 taaagtattt acaataaagt ctggataact caataaatca acttaaataa atcaaactaa 7200 ttttaaaatt ccagttgagc gattgtttat ggatcatact accactaact gaaatatttt 7260 ctaataagct aattgcttga tcctttggtg gtttaagtta ttgagtcttt actgtaagat 7320 tttgtgctat ttgttattga tgtttgtgat gttactgttg tgatgttttg tgttgataga 7380 cgacttggct taccttcacg tcggtgtcta tcgttaaaaa tgattatagg gttgtttcat 7440 actgaaactg actatatata atcattcact aaaagccaag cattcaattt ttatattaaa 7500 ctgtaagggt ttatcttttt ttgattattt ttagtgtgct ttttttgaaa agagtttata 7560 atcgaaactt gcaacaaaaa atcttgaaac ctttacacaa aacctattca cacaaaaaaa 7620 acaatctcaa agtaagtttc ttatttgata tcgccttggt cacacttttg agtttttaaa 7680 taaaagagta cattaatgtg ttatgttttc aataattttg ttttatttta gttcgaatgc 7740 cttctcttaa tgttgatcgt tttttaacat ccaaagggaa gttttgttaa tctttatacc 7800 atttactaaa agatatatta tagggaaaca attcaaattg attttataaa gtacttctga 7860 tactttttat gttatatata atagatattg taaaatttaa aaaaaaaaaa attatttttt 7920 taattttatc tcaagaaaaa tgttttttgt ttctcacaca aatttgtaaa ttttttctct 7980 ttggaattat agtatttgat ggatacattg ctcattttgc tttttttata tatatctccg 8040 ttatatatga atggaaaccg ttaatacttt tattaccatg gtaattacag atctagatag 8100 ttccattccc attaagtatt ttacaaattt tcggtacaaa taatagctat agtagtttaa 8160 taatttgcta atttataaac atttagaaaa atttgttgcg agtttattgt agaaaatttg 8220 cttttcaaac aaatagagct aatttttgac ataattagga ttcgtcacct gattctttgt 8280 caaatacttc tcctttttaa ataggaaagg ttaaatttta taaaaaagtt tgccaaaaca 8340 ctgggagaaa tgtcacgtaa ttggtaacta tgttaaatac acttttgtta ggtccacgta 8400 attttcgaat ctgctcatca ctacgttaga tggcgttatt tctttaaaat accttaaaaa 8460 tcttcaattc aaagtttttg tttcgaatca tcaaaatatt ggcatatgat ataaatatac 8520 gattctaata aaccaaaagt atgtggaaaa attgcgcata cttaggcatc ataaaaactt 8580 aattcacacc aaccatttac ttcgatattt tttagtgtat ggcagaaaaa taaattacga 8640 attgtcttag ataaataaag ttaccggacc tttattactt aatccataaa caattattgt 8700 ttcattcgat ttaacaccac aataaaaaac aaagcttttt ctatttattt tcgaaaaacg 8760 ttttactgga aaccgagttc ttttgtagca ttttagttag agttatagaa aaatctataa 8820 ctcttatttt gttaagtaaa aaaaaaaatc ttatgctctt ctaattttag tcaccaataa 8880 caactatgtt aggaatagta gaaaatgtta tgtgtggtag agaaactgtt ttagtgtggt 8940 gttgtaggag tactgcgggt ctaagaaact ttgttttcat cgataaatta ttctattggt 9000 ctattaaaac agtatgaaaa ctttccagta tttatatttg atgtcttttt ttcattttat 9060 tttataattg taatttaagt ttcaaaaagc agcaaaaaca acttgaaaag aactatcgca 9120 caatgactcg cttaaacaca ggattccaat ttcctgatat acatggaaat gctgatattg 9180 ctttcactat attgtctctt tgtaatgtgt aacatcattt ttacttaggt ttgccaaatg 9240 accggctttt acagagtaga atatagtccc aaacatatac agttcagttc ttttactgct 9300 ttcttcttca taaaagccgg acgtcttgca acccttgaat ttccccttct ttttgtttgc 9360 ataaattaca taattctctt aattataaat actccaaatc cgattttcga acagcttatt 9420 ttaaaatcta aagaaatata agcgattaaa gattacattt tgcaagccaa ctagatgctt 9480 aaacaactca aaagacagcc acgcgctcat tataactctc ttatagtttt agaatctgag 9540 tgagttactg ttttgaatat ctaaaatctt caaattaagt ttttatgatt ttaagttaaa 9600 ttagacttaa gaatatgtgt aagaattgag aataaatatt tactttttat tttagaaatg 9660 aacacacaag aaaatttaaa cggtatgtat ttattgcata tgtgtgttag tgctgtatgc 9720 gcttatatta gatattatat ctatatatct atatatatct aatatatgcg tatacagcac 9780 aaacgctata ctttatttag tacgcatata taaatatata aatataaata gacaaacaat 9840 ttacactctt ataaacagtg ttgggttaac tgatagacaa aataagcact gctatcatat 9900 tttatttatt ggaaataaat aaaaatatca agtaattctc tttttagacg atgatggtga 9960 cttttttaga agtaagtaaa ctataatttc tttagaagta aacaattttt atttatataa 10020 agtgatagtt ataaaaaatt tatttcaagt atggagaaat cagtagcata ttttttatca 10080 aatatttgta tactaatagt aatttgattg ttggtaatag ttatatagtt atatacacgc 10140 agctatttaa tagtgaaatt tcattattat atgtaaaatt caaatcttga aataagaaaa 10200 aaatctttat gtgtttttca actagaaact tggaatttat tttctagacc agatgttgtg 10260 attattatat tttctagact agatgtagtg ctttatcata tgatatgcta aatgtatata 10320 ttgtattgtg tgtatatctt agatgcgcag gagataatgg cagaagagac agagtctttt 10380 ctgccttatt ttaaagaatt cttcgataat caaagttagc attttctatt ttttcttttt 10440 tacttcagtt ttttcgaagt ttatgttaca gtaaataaat ttatgaacta atctttcata 10500 cattttgttt atttagaaaa tttagaaagt agagttagta agcaaaattt cattatatta 10560 cttttatgta tattaaataa aaaataatga ggttggcttt agtaatttgt tcatctattt 10620 agaaaaatta gaagatttac aaaagtccat aggtgagtaa aattgctttt atgtttatta 10680 tacaccctaa ttatttatgt acatctaata ttaaatgcgt tttttcaatt tgagattagt 10740 tttggacttt gtactttttg gtcaattttt ttagatatca ttttaaacta catttgtaag 10800 taactatatt atgttttatg tctcataaaa gttttaataa ataatataat gtaatcactt 10860 atatagtgac aaatccaact tcctcagtaa cgtatgttgt tgtactcact tagacttgac 10920 ttattacatt gattgattgt ttttattgca tttttaattt tcttttattt atctctagct 10980 gctgcttctt ctgttacaag taagttgttc tattcactta tacttgactt aatttatgga 11040 ttaattggtt ttattgcatt tttaattttc ttttatttat ctctagctgc tgcttcttct 11100 gttacaagta agttgttcta ttcacttata cttgacttaa tttatggatt aattggtttt 11160 attgcatttt taattttctt ttatttatct ctagctgctg cttcttctgt tacaagtaag 11220 ttgttctatt cacttatact tgacttaatt tatggattaa ttggttttat tgcattttta 11280 actttctttt acttaactta ttacattgat taattggttt tattgcattt ttaactttct 11340 tttatttatc tctagctgct gcttcttctg ttacaagtaa gttgttctat tcacttttac 11400 ttgacttatt acattgatta attgttttta ttgcattttt aattttcttt tatttatctc 11460 tagctgctgc ttcttctgtt acaagtaagt tgttctattc acttatactt gacttaattt 11520 atggattaat tggttttatt gcatttttaa ttttctttta tttatctcta gctgctgctt 11580 cttctgttac aagtaagttg ttctattcac ttatacttga cttaatttat ggattaattg 11640 gttttattgc atttttaact ttcttttatt tatctctagc tgctgcttct tctgttacaa 11700 gtaagttgtt atattcacct ttacttgact tattacattg attgattgct tttattgtac 11760 cttttgtttt ttaatatagc tacaaatgtg ctttccagtt taattagtta atgtaataag 11820 tcaagtatta gtgagtaaaa taaataagtt gttgtgctca cttgtactta atttattaca 11880 ttatttgctt ttattgtttt tttacagtag attttttgta agtattacct tggtagatag 11940 ttgtatttat atataatttt tttttcaggc tcgaagtcat gggaagattt cagaaagtat 12000 cgctcccagc atccaggccc tctatctatg agggccttcc atcgctggtg tttgtatggg 12060 gacgaccatc cctcctttag atggagggat ggcaccccat acaacacggg gaatagagcg 12120 gatgctggga ggagaagaca acgtggcgca aaagttgtca ataatttttt tacatattaa 12180 aatgtaaaga aattgacaac ttttgagcca cttcattttt tttttgtaat tttttttttg 12240 taatatattt ttgtaaattt atttttgtgt tgttttttaa tttttcccat gaatgttaaa 12300 tgttatcaaa gtaaatcatt taaataaaga cacctttatc aacaatgctc attaactaga 12360 aataaaagtc acggtcatac atttgataaa gttaatgtta atctcaaagg gatttggttt 12420 tcacacaatc aaattctatg tagcaccttc cagaactaag aatataagtg tgtggtgtta 12480 gtagtttaaa atatgtataa taaactcaga ataactcaat aaccctcaac aaaactatga 12540 actaattttt aaattcaagt tgagcttttt tatttattaa tacctcccct cccaataact 12600 ctaattttcc ttaaaaaact aatttttttg tcctttgaag gtggttctgc tatttaatta 12660 attgttgtga tgttttatgc aaatagattt agctcacatt catgttggtc tgtcagggtt 12720 atgtaaaaat aaaaaagtaa gtttcttatt cgcatttttg agttttcaaa agtattgatt 12780 gcatgacatg ttaatttgta attaatgtgt atgattatga ttatgcgttt aaataacttt 12840 gttccatttt tagttccgtt tcttatagct gacttgatta tggcttacct aaagcttaat 12900 atttatgaaa caatgattca atttaaattg tattataagt agctctgttg aataaatttt 12960 ttttttgtaa tttgtactaa aggttatatt tttattttat atcaagaaaa atatgttttt 13020 aagtaaatat ctatattttt ttattttggg atttattgtt ttaaatggat ataaaaatca 13080 tttttgcttt gtcttttata tttcctttta ttatatatga ataggaacca tttatgcttt 13140 aatcaccatt agcactcgaa gtttccattt gcattaagta tcgatttttg gcacacacaa 13200 caggtttaat aacttaggca gtaaaaacag cttgaatgat ttgttaagag tctttaaaga 13260 aacgctttta cgaaaagcat taaatagata aagctaactt atttaaaata ataagtcttt 13320 aacccgtttt tctatatcca aatttacttt ttaaataaat gttgatgaaa gggtaaaata 13380 tcgttatagc ctacgttaat tttaaattct ttttaaatta aaggaaagag ttttgatact 13440 tatttaaata gggttaataa aatattttaa attatattat atatattata tttaagaaat 13500 tattaaacta aattaaataa gatttttaga tgaaacagta aaaataattg tttaaaatac 13560 gtgcgtgcaa ataattccta caaatactaa acgtaaaata acgcttacaa acaccaaaag 13620 tatgtttgca gtgaataaag ttaagttcaa agataaatag gaagaaatgt ttaaaactgt 13680 ttcaataata acatttattt ttaattactt ttgttgtaag ttgggatgaa aaaaaaaagt 13740 cactggagta tgaactcccg gggtgagatg actatatctg tctaagtgtt gcgcgtctac 13800 caattagccc ataaacgtat tttctattta gaatttaaga ttatttatga cgcataagaa 13860 gtgtaattaa aaattcaaaa ccaagtttgc aacaactgaa tttaaataaa atctaattag 13920 atgtgtatat gttatgaacg ttaactaagg cgtactttaa cggcgtcacg tgttgaatgt 13980 tttagcattt ttctaagttc gtagtctttt ttacgttttt tcttattaaa atttttataa 14040 tcggcataat attaaaatcg tttgaagttt ttaaaagaac attttaacag aaatgttttt 14100 gtcatatcaa aaagtaagta tagtttttta tatgtacatt tttaaacggt tttaaaaaaa 14160 tattgaggta aggaataagt atagaaaagg tcggtgtrga atagatttac taagaccctt 14220 agtttacgtc cggatagcta tagataagta taaaaatggt cggttcggta aagacccgta 14280 ttttacgggt ccccccagct ag 14302 // ID BEL-171_AA-I repbase; DNA; INV; 6278 BP. XX AC supercont1.315; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-171_AA_; KW BEL-171_AA-LTR; BEL-171_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6278 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.315; Positions 1017538 1023815. XX CC 'CAAC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 39..6227 FT /product="BEL-171_AA-I_1p" FT /translation="MEGHHGQLEYSCKSCSRRDTSEAHMLICDQCRLWEHY FT ICAGVSETTPRRPFVCKQCQDKDRGSTISTRLRSHAKRVPSAKTSQGKPTS FT NVESKVSSIGSSSRSSIVKARLQLIEEEERLKQRELEEEEILKELEHVEAQ FT RQLEQKKKQMEEQKKLMEEESILRQRKLEADKARLMKQHMIRRESLEKKNE FT ILLQISERGSIVESATSSREKVSSWLTKHLPKGTTGEKALEIDHRTELISP FT APETVPSMTIEVPINRASKSETGALHSKRQATNPQSFTLGLTVADPIPRVT FT KTIRGVQSTTVREPFQRFVACSRPSQSQPVAALSCPPVTHQACTPYKEAGC FT SALPTNYQQGASWSTPIRYVANPQTATQRSTTSPNVYRGPTEECHHPLEFP FT DRVPMFTNVAGNNYLQNGLPGMLSSQQIAARQVVGKDLPHFSGDPTDWPMF FT ISSFEQATITCGYSNAENLVRLQRCLTGHAREAVRSRLLLPANVPHVISTL FT RTLYGRPELLIRSLHDKIRRTPGPKHDRPESILEFGLAVQNFVDHLQAAEQ FT EEHLSNPMLMQELVEKLPGPMRMDWASYKNLQPRATVLTFGEFMSKLVLAA FT SEVSFELPGLTKVREFDKQQRPREKARIQAHSADGNPYPIHETESAKKPSK FT MCRYCDREGHRVADCSKFIQLCLDERWKQVQDRDLCRTCLNNHGKWPCKSW FT KGCSVEGCRLKHHTLLHSPSVPSSTSHSVNVSANKLPAGDGETLFRILPVM FT LYANGKNVMVFAFIDEGSQITMLEEKIAKELGITGPRKPLTLQWTGNVKRN FT ELRSEEVSLEIAGKHNGNRYELRYARTVSRLHLPAQKLNYTEMTACFPHLK FT GLPIADYDLVQPKLLIGLDNLRLGVPLKLREGGSSDPIAAKCRLGWSIYGR FT ASRESAPRAIVNFHTAAVASPDDLMNEQLRDFFTLENNGVAAPNVKLESDE FT DNRARQILEATTRRTTNGFETGLLWKVDDPQFPNTYPMAMRRMKSLEKKLI FT GNRALAQRVSEQILDFERKGYIHKLTESEFSTVDPRKTWFLPLGVVINPKK FT PGKIRLIWDAAAKTGEVSFNSYLLKGPDLLTPLPRVLSGFRLFPIAVSGDI FT REMFLQIKLQAKDRNAQMFVFRHNPEEPIQVYAIDVTMFGSTCSPSSAQFV FT KNLNAEQYALHYPRAAVAIKEHHYVDDYLDSFRTIEEAVQVVNDVKFVHSK FT GGFEIRNFLSNEGEVLKRTGEIEPDSSKEFALVRGETTESVLGMKWIPVDD FT VFTYTFAMRDDLRSILVETHVPTKREVLKVIMSLFDPLGFVSFFLVHGKIL FT MQDVWASGIGWDEKINDQLFLRWQQWTAYFSQLDTMRIPRCYFSSPFPKNL FT DRLEVHILVDASDTAYACVAYYRLETENGIQMALICAKTKVAPLKVLSIPR FT LELKAAILGVRMLEAIQGFHTYAIHRHFLWSDSSTVLAWIRSEHRRYNKFV FT AVRIGEILTTTEVRDWRWVPSGLNTADLATKWKNEPNFTSGNPWLSGPSFL FT HQPEEFWPQQTLIATTCEELRPNHTHTTIPPLIDASRFSRWSCLQRTMAFV FT LRFIENLRRKRNGDKVLVDRFSQNELNRAETAVLKVAQAEAYSEEINILSE FT TQGPPDARHRRVSKGSPIYKNWPFLDEHGVLRQRGRIGAASFAPIEAKFPA FT ILPRQHPITFLITDWYHRRFRHANRETIVNEMRQRFEIAKLRALIAKVTKN FT CMMCRVKNAQPYPPAMAPLPAARLQPFIRPFTSVGVDYFGPVLVKVGRSSV FT KRWVALFTCLTVRAVHLEVVHSLSTESCIMAVRRFVARRGCPAEFYSDNGT FT CFQGASKELQREETIKRNNALANTFTTSKTRWCFIPPAAPHMGGAWERLVR FT SVKVAIGSLADVSRKPNDEVLETVLLEAEALINSRPLTYIPLESADQEAIT FT PNHFLLGSSNGDKLLPVDDFDSPMILRSSWKLARAIAQQFWSRWVKEYLPV FT ITRRSKWFEEGRDLTVGDLVLVVDGTVRDQWIRGRIEAVMPGRDGRVRQAL FT VRTASGVLRRPAVKLAVLDVLSNGEPGSAVLAEHGRD" XX SQ Sequence 6278 BP; 1767 A; 1501 C; 1566 G; 1444 T; 0 other; caacatccta aaagattatt tgtacgtggg tagacaggat ggaaggacat cacgggcaat 60 tagaatattc atgtaaatct tgcagtcggc gagacacttc ggaagcgcat atgctaattt 120 gcgaccagtg tcggttgtgg gagcattaca tttgcgccgg tgtgagcgag acgacaccga 180 ggcggccgtt tgtctgcaag cagtgccaag acaaggatcg aggttcaact atttctaccc 240 gactacgttc ccatgcaaaa cgtgtaccat ccgctaagac atcgcagggt aagccaacct 300 ccaatgtaga aagtaaagtt tcgtcgatag gctcaagcag tcgttcgtca atcgtgaagg 360 cacgtctgca actgatcgag gaggaagaga gattaaagca aagggagctt gaagaagaag 420 aaatcctcaa ggaacttgaa cacgtagagg ctcaacggca gttggaacaa aagaagaagc 480 agatggagga gcagaaaaaa ctaatggaag aagaatcgat actacgtcaa cgaaaactag 540 aagcggacaa ggcacgtttg atgaaacaac atatgatccg cagagaatcg ctggaaaaga 600 agaacgaaat cctgctgcag atttccgaac gaggcagcat tgtagaatcc gccactagct 660 ctcgagagaa ggtatcgagc tggctgacga aacatttacc taaggggaca accggtgaga 720 aagcactcga gatagaccac cgaactgaac tgatttctcc ggcccctgaa acggttccgt 780 caatgaccat cgaggtgcca atcaacagag catccaagtc agaaaccggt gcactccatt 840 cgaaacgtca agcaacaaac cctcaaagtt ttactctcgg tttgactgtt gcggatccca 900 ttcctcgagt taccaaaacg atcaggggtg tacaatcaac cacagtccgt gagccgtttc 960 aacgttttgt ggcttgttcg aggccgtcac agtcacaacc cgttgcagca ttatcgtgtc 1020 ccccggtgac ccatcaggct tgtacgccat ataaagaggc aggatgttcg gcgcttccga 1080 cgaattatca gcaaggagcg tcatggagca ctcctattcg ctatgtggct aatccccaaa 1140 cggcaacgca gaggtcaaca acatcgccta acgtgtaccg cggaccaact gaggagtgtc 1200 atcatccgtt ggagtttccc gatcgtgttc ctatgttcac caatgtagct ggaaataact 1260 atctacagaa cgggctacca gggatgctta gttcacaaca aatcgcagca aggcaggtcg 1320 tcggtaaaga tctacctcac ttcagtggtg atccgaccga ctggccgatg tttattagca 1380 gttttgaaca ggccacgatt acatgtggat actcgaacgc tgagaacctt gttcgtctcc 1440 aaagatgttt aactgggcac gcaagagagg cagtacgtag tagactttta cttcctgcca 1500 atgttccgca tgtgatcagc actctacgca cactttacgg tcgccccgag ttgctgatcc 1560 gatcgttgca tgataaaatt cgaagaactc cagggcccaa acacgatcgt ccagaatcga 1620 tcctggaatt cggattagcg gttcagaact tcgtcgatca tctccaagca gcggaacaag 1680 aggaacattt gtccaatcct atgcttatgc aagaattggt cgagaaactg ccagggccca 1740 tgaggatgga ttgggccagt tacaaaaatt tacagccacg ggccaccgtc ctgacattcg 1800 gagagttcat gtccaaacta gtattagcag ccagcgaagt aagtttcgag cttccgggac 1860 tcacgaaagt gagggaattc gataagcaac aacgaccaag agagaaggcg aggatacaag 1920 cacactcggc cgacggtaat ccgtacccca tacacgaaac agaaagtgcg aagaaaccat 1980 cgaaaatgtg tcggtactgt gatcgtgaag gtcatcgagt tgccgattgc tcgaagttca 2040 tacaactatg tctagatgag cgttggaaac aagtgcaaga tagggatctc tgtcgaacgt 2100 gcttgaacaa ccatggcaaa tggccatgca aatcgtggaa ggggtgcagt gttgaaggat 2160 gtcgattgaa acatcatact ctcctccatt caccttccgt accttcatca acaagccact 2220 ctgtgaatgt ttcagcaaat aaacttccag cgggcgacgg cgaaacgttg ttcagaatat 2280 tgccagtcat gctctatgca aacggtaaaa atgtaatggt tttcgcattt atcgatgagg 2340 gatcacagat aacgatgcta gaggagaaga tagccaagga acttggcata actggtcctc 2400 gaaaaccact tacactacaa tggacaggga acgtgaagcg taacgagctg agatcagagg 2460 aggtcagttt agaaatagcg ggaaaacaca atgggaaccg ctacgaactt cgttatgcgc 2520 gtacagtcag ccgtttgcat cttccagcac agaagttgaa ctacacggaa atgaccgcgt 2580 gtttcccgca tctcaaagga cttccgatag cggattatga cctggtgcag ccgaagctac 2640 taatcggcct agataatctt cggctcggtg taccgctgaa gctacgagag ggaggatcat 2700 ccgatcctat agctgcaaaa tgccgattag ggtggagtat ttatggccgt gcttcaaggg 2760 aaagtgctcc aagggcgatt gtaaacttcc ataccgcagc agtggcaagt ccagatgacc 2820 taatgaacga acagctgagg gatttcttca cgttagaaaa caatggagtc gccgctccaa 2880 acgtgaagct cgagtcagat gaagacaaca gagcgcgaca gatactagaa gcgacgacac 2940 gacgcactac taatggtttc gaaacaggtc ttctgtggaa agtcgatgac ccacaatttc 3000 caaacaccta tccgatggct atgcgacgaa tgaaatcgtt ggaaaaaaaa ttgataggga 3060 atcgagcgtt agcgcagcga gtaagtgaac agatccttga cttcgaacgg aagggatata 3120 ttcacaaatt aaccgaatca gaattctcca ctgtcgatcc tcgaaagaca tggtttcttc 3180 ccctcggagt cgtcatcaac cccaaaaagc caggaaaaat tcgattaatt tgggacgctg 3240 ctgcaaagac tggggaagtt tctttcaact cgtacttgtt gaaagggcca gaccttctta 3300 ctcctctgcc aagggtgctc agtggcttcc gtctgtttcc cattgcagta tctggagata 3360 tcagggagat gtttcttcaa atcaaactgc aggcaaagga cagaaacgct cagatgttcg 3420 tgttccgtca taatcccgag gaacctattc aagtttacgc gatcgatgtg acgatgtttg 3480 gttccacgtg ctctccgtcc tctgcgcagt tcgtcaagaa cttgaacgcc gaacagtacg 3540 cgctacatta ccctcgtgct gcggtggcta ttaaggagca tcactacgtg gatgactacc 3600 tggatagctt taggaccatc gaggaagctg tgcaggtagt gaacgacgtg aagtttgtcc 3660 attcaaaggg tggattcgaa atacgcaact ttttatccaa cgaaggcgaa gtcttgaaac 3720 gaactgggga gatcgaacca gattcgtcaa aagaattcgc tcttgtccgt ggggagacta 3780 ccgaatctgt tctcgggatg aaatggattc cagtggacga cgtttttacg tacacctttg 3840 caatgcgtga cgatcttcga tcaatcctcg tagaaactca cgtaccgaca aaacgagaag 3900 ttctgaaggt aatcatgagc ctttttgatc cgttagggtt tgtctcgttt ttcttagtcc 3960 atggcaagat attgatgcag gatgtttggg cttcgggcat cggctgggat gaaaagatta 4020 acgatcaact tttcctacga tggcaacaat ggactgcata cttttcacaa ctagacacca 4080 tgcggatacc gcgctgttat ttcagttctc catttcccaa aaacctcgac cgcttagaag 4140 tccatatcct cgtcgatgca agtgacactg cctatgcctg cgtggcatac taccgattgg 4200 aaactgagaa tggaatccaa atggcactca tatgcgcaaa aacgaaggta gcacccttga 4260 aagttctatc aattcctcgg ctagaactaa aggcggcgat cctaggtgtc cggatgttgg 4320 aagctattca aggattccac acttatgcta tccaccgtca ttttctctgg agtgattcaa 4380 gtaccgtctt ggcttggatc agatcagaac atcgtcgata caacaagttc gttgccgtcc 4440 gaattggtga aatccttacg acaaccgaag ttcgagactg gagatgggtc ccctccggat 4500 tgaacacagc cgatctcgcc acgaaatgga agaatgaacc taactttact tcgggtaacc 4560 catggctaag tggaccatcg ttccttcacc agccggagga attctggcct cagcaaaccc 4620 tgattgccac tacttgtgaa gaacttcgac cgaatcacac ccatactaca attccaccgc 4680 ttatcgatgc gtctcgattc agcaggtggt catgtcttca acgaacgatg gcgtttgtac 4740 ttcggttcat agaaaactta cgccgtaaac ggaatgggga caaagtacta gtagatagat 4800 ttagccagaa cgagttaaac cgcgctgaga cagcggtttt gaaagttgca caagccgaag 4860 cctattccga agaaattaat attctgtcgg agacacaggg acctccggac gcacgacacc 4920 gacgtgtttc aaagggtagt ccaatctata agaactggcc tttccttgac gaacacggag 4980 tattacgcca acgtggcaga atcggggccg catcgtttgc gccaatcgaa gctaagttcc 5040 ccgctattct tcctagacaa catccaataa cattcctcat tacagactgg taccatcgcc 5100 gtttccgcca cgccaaccga gaaaccatcg tcaatgagat gcgacaacgt ttcgagattg 5160 ccaaactacg tgctcttatc gcgaaggtca caaaaaactg catgatgtgt cgagttaaaa 5220 acgcccagcc ctatcctcca gcaatggcac cgctccctgc agcacgacta cagccgttca 5280 ttcgtccttt tacctcagtt ggggtagatt acttcgggcc agtactagtg aaggttggga 5340 ggagctcagt gaagagatgg gtggccttgt ttacatgtct cactgttcgc gcagtgcatt 5400 tggaggtagt ccatagtttg tctaccgaat cctgcattat ggcagtgcgg cgcttcgtag 5460 cacggcgtgg atgtccggct gaattttatt ccgacaacgg gacatgtttc caaggtgcaa 5520 gcaaggagct gcaaagggag gagactataa aacgtaacaa cgcccttgca aatacgttca 5580 cgacctccaa gacacgttgg tgtttcatcc caccagctgc tcctcatatg ggaggtgcat 5640 gggagcggct agtgaggtca gtaaaggtag cgataggctc cctagctgac gtgtcacgta 5700 aaccaaacga tgaagtcctc gaaacagttc tactcgaagc tgaggctttg attaactcgc 5760 gacctctcac ctacatccca ctagaatcag cggaccagga ggccatcacg cctaatcatt 5820 tcctcctcgg gtcttcaaac ggcgataagc ttctaccagt ggacgatttc gatagtccta 5880 tgattcttag aagtagttgg aaacttgcac gagccatagc gcagcagttc tggtcaagat 5940 gggtcaaaga atacctaccg gtgatcacac gacggtcgaa atggtttgaa gaaggtcgag 6000 acttgaccgt aggagatttg gtgttggttg tggacggaac ggtgagggac cagtggatac 6060 gaggaaggat agaggccgta atgcccggcc gtgatggaag agttcgacag gcattagtcc 6120 ggacagcatc cggagttttg cgtcggccag ctgttaaact tgcagtacta gacgtgctga 6180 gtaatggtga acccggatcc gcagttcttg cggaacatgg tcgtgactag ggttcacggg 6240 aggggggatg tcacgttacc cccctgctac agctcaac 6278 // ID Sola3-1_Lgigantea repbase; DNA; INV; 4837 BP. XX AC . XX DT 17-MAY-2011 (Rel. 16.05, Created) DT 17-MAY-2011 (Rel. 16.05, Last updated, Version 1) XX DE DNA transposon: consensus. XX KW Sola; DNA transposon; Transposable Element; Sola3-1_Lgigantea. XX OS Lottia gigantea OC Eukaryota; Metazoa; Mollusca; Gastropoda; Patellogastropoda; OC Lottioidea; Lottiidae; Lottia. XX RN [1] RP 1-3053 RA Yuan Y.W. and Wessler S.R.; RT "The catalytic domain of all eukaryotic cut-and-paste transposase RT superfamilies."; RL Proc Natl Acad Sci U S A 108(19), 7884-7889 (2011). XX DR [1] (Consensus) XX SQ Sequence 4837 BP; 1569 A; 921 C; 970 G; 1377 T; 0 other; gaacgcgtcc gcgagagcac caggcaaaaa tcaagaaaag tgctaaggtg ggaagttgga 60 atatttttac caaaatcatg tcataagacc tcataggtgc taggagtcac atgctgaaat 120 gattcattcc ccgacccctt acaggcgaag aaatcgaata cacttttatc atgaaaaaat 180 catcgtaata tttgatcatt ttgtccaaac ttcgcccata gtgtatgaat aacaattgat 240 accctcagtc tgattttttg atttgaatga tgtatcattt agtccacttt cacccgaaca 300 taagtatgct gctagtacta gtccagtagt aatgttactg aactttctga aataggtctt 360 tttcatacgc cttttcctat acaaatctaa ataagctcta gctctgtctt tgcaaggctg 420 agaccaacta gcatgtgatg ttcgaataga ccaatggttg ccttaaaagt gccagccaaa 480 aaaaattgag gaacgcagat cactgcatag aaagaggccc ttaaaaaaaa gaaaaaattt 540 gaaattttaa gtcatatttt tctcatgtga atgaagactt tctcaaatct tcgagaacta 600 atatagttgt gatacaaatt acctgttcaa aaggttatag ccattctgaa agaagaattt 660 aatctgctca gtgacacctt tcaaaagcct ttgttgtggt gagagaactc gacttacgct 720 ttacagaaat taacgtcaaa atgactacca actgctcgat tgcacggttt gtctccatga 780 tttcatgcgg aaatcacccc gcatacagta ataaaacgac catcatacca cttacatcct 840 gtaacaagga catttctgga catttgaggc gagtaaacct cccaaaacaa aatatatcat 900 gtgaatatga attgattttg tgccggactg gtaatttttc tatgacagat ttaccaattg 960 catctatgac tgtttgccct gcacatagat acagacttgg attatgctgg aataaaccca 1020 gaggttgtgc acacccactt cataagcaaa gattagcaaa aatgaaacca agtcgaatgg 1080 gagtgaagct gtctttatca caagaaattt atgaacgatg gaatgttctg gtcccagttg 1140 gttccggcat ctgcaaaaca tgctatagtc agcatagtgc tgttgcatct tcacaactgt 1200 cttcaatgcc actaactcct tccattccat ctgcaacaag tttaacgaca atgcctacag 1260 aactcaactc gccattgccc tcgacttcaa cacctatgtt tgcagaagca catttcccac 1320 agatacagat acagaatcag atcaggggga tagtagtagc tcagatccac catcgaattc 1380 tgagactata gcaacatttg aagatgcatt ctcttgtgac aagtcctggg ctccaacacc 1440 ggctacagct agaagtgaaa ttggtattct gaatgcattt ttgaaaactt ttggaagaga 1500 accgattaga ggtcagctga aaaaacctat cgaagatatt gctccttcaa caatgcggta 1560 ttataaaaac aaaacacatg atactatatc aattgtgttg aatttgattg ccccgaatca 1620 ggaggaaaaa ttaatggata ttgtgatgaa atcagaaaag acgggtattg attcggatga 1680 tagactcctt gaacaacttg taaactgcta caatcgagcc tcctcttggc aaacaaagcg 1740 ccaaattttg tcaatatctg ttcagaacca tactaagtca gcgctattgg atttgatacc 1800 gggtctgaca aagtacagga tcgaccaggc aagaaaacat gccatggaag ctggccacgc 1860 acaaccagtt gagagccatc ctgtaataag aaaaagactg gacgatacaa aagttgacca 1920 tttcttggat ttcatttcaa gacctgaata cacacaagac gtggcatatg gaacaaaaaa 1980 tatgaagtta agtcacggag aaaaattaga gataccaaat gtggtacgta cagtcattgc 2040 atctagactt atagacctgt acaagtccta ctgtcaagag attgactttg aaccacttgg 2100 acgcactatt atgtataata tacttcaagt ctgtgctgca tcaaagaagg tttctcttgc 2160 agttgaggga acggaaggct tttcaacgct tagacaaatc tcgaaaaccc tctcggagct 2220 cggtttgagc acagcatggt tggatgagat aacgaaatca ctcgcttcaa ctgaactgta 2280 ttttaaaacc gatatgaaat tgaacttatc agtctcctcg caatgtatca gtcattgcat 2340 tacctatgct ctgtcggaac aagagtgtga tcatgaacac actacgagct gttcaaactg 2400 tgagaaaatt cattcagttt tggaaaaaat tgagagcggt tactccgctg ctcaattgaa 2460 gttttcatac gaaggtcaaa aagaagaatt gatttatgac tttcaaaatg ccaaaagtgc 2520 agtgttaaat tacagggcgc atatcgtgcg tgccatcaat caagagcggg cgaaaaatga 2580 cattttgaac cggttagttc aaaatgaagg ttttattaca atggactggg ctatgaagtt 2640 tttgcccact cgattcagag aatcccagca agattggttc gcaaaaaagg ggatcagttg 2700 gcatgttact gctttcgtgg tgaaatcgtc tgatggagat tttgatatat actgttttgt 2760 tcatttaatc gaaaaatgta atcaaaactg gaatagtgta caagtgtctt ggaagcgagt 2820 attgaaaaaa tgaaagaatt agtgcctcaa gtgaacaaac tgtatgtcag acgcgacaac 2880 gccggatgct accattgtgc accgttaatt ttatccatgg ctgcgcttgc aaaacgtcaa 2940 aatgttacta tcactgctta tgatttttct gaagctcagg ctggaaagga tacttgtgac 3000 agaaaaatcg cctcattaaa agcacatatc caacgatata ttaatgatgg ccacaatgtt 3060 acaacggcat ctgagatgtt tgatgcatgt aggtcgtata acggagtgcg cggctgctgc 3120 gttagtgtca tgaacattga tgaaacagta ataagtgaac gcgtcacatg gactggaatc 3180 actacaatga ccaatttcac ttatgaagaa aatggtgtaa gggtttggaa agcgtttgga 3240 atcggaaaag gagtactaca cagttatgat aaactcttac ggaataatga gtgtaatgtg 3300 cggccaatgt tcactatggg ggaatattcc actcctcaaa tatctactgg tgctgtgcat 3360 cgtaccactg aaacactttt ctcctgttca gaagagggct gttcgaaact gtttcgaaca 3420 gaaaatgaac tcacaagaca tttatctgtc ggaagacatg agaggacgcc tgtacttgac 3480 acaaagtttg actcaattaa acgaaaatgg caaggtcgtg ttgcggaggt tgcggcacag 3540 aagaaagtac tatacactgt cagtgcttca gcgagtactt caacgaaatc cgttgtaagt 3600 agttgccctg atttacctat gggttgggcc ctgaagatca ccacaaagcg ttcaacataa 3660 tattcgaaac aattgaaaga ctttctcttg gaacaattta tgattggagt gcacactggt 3720 agaaaagaag atgccgcgac ggtagccaaa cgtatccagt ctaagttcag caaggatgat 3780 tggttgaatt cgagacaaat tgcctcttac ttttcaagat tggcctcctt gcaaaaatgt 3840 ggacacctag atgcagaatc aaataagcct gcgacaacca tggatgtgga aaataacgac 3900 tgtaatattg tagaacaggt gctggaacga cgtcgcctac gacgtagaat aatatccgat 3960 ttggatgaat aaattgatcg cgatgtgata ttcacggatt taatgtcaat gaattgtatg 4020 cagttgaaaa aatctctgct tgatattata tcgggtggta ctaaaaaaac gataacactt 4080 ttcatatcgg gattatgcac aaagtattga caaggaattt taaaaatgct gccaattcaa 4140 agtgttttaa catatgaggg tacttttttg tgaaaagatg ttttatatat ctatgattat 4200 gagggacaaa atattatggt ctgagctttg atacccgatg aataatccag aaatgaaaag 4260 agttatcgtt ttttttttgt accacccgat ttcagatatt tacatattta gtcatgatat 4320 tcagttctgt ctaccgtcct ctttttatgc attcccacaa attttatttt tgatgttgat 4380 agtagagggg tagaggatca taagcagctc aatactttct tgctcacccc tgcaaagaga 4440 gtgctacaag gtgttaaaac taacatatga aaaataataa ttttcacgtt gtgttttgta 4500 cctagtcagt tttcaggtgt cagttgagca atctattcca aacttgccat gtagtgagag 4560 gataacatat cctaaaacat gtagaaattt caaattcatc taatcaaagg gaagggcgta 4620 gtcattgtgc aaacttttca tagaaaaatg gctgaaaata gtcattttcc ctcatcgaaa 4680 cagttttctt tcaattttca ccgttatgaa atgggtgatc agtctggtat tatatattta 4740 agtgtttagg tactacccct acctatatac aaaatttggt ccaaatccct gactacagcc 4800 ttagcacctc ctgcctggtg ctctcacgga cacgctc 4837 // ID Copia-15_CQ-I repbase; DNA; INV; 4183 BP. XX AC AAWU01015815; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-15_CQ_; KW Copia-15_CQ-LTR; Copia-15_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4183 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 345-345 (2011). XX DR GenBank; AAWU01015815; Positions 53755 57937. XX CC Positions [1403-1939] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 41..4174 FT /product="Copia-15_CQ-I_1p" FT /translation="MSDRNYGIERLVGRENWADWKFAVQTHLEVEDLWDAV FT EPKVKADGTVEAVDPAKSKLARGKIIQFLEPVNYYHVREAKTAQEAWKNLV FT NAFEESGLTREVGLLHKLIRTDLDSCGTMEIYANQIISACHQLKSIGFPIP FT DRFVGALLLAGLPDRFKPMVMALKNSGMAITGDSVKTILLQEEDELLPGEK FT TFVAKRRVNKPGGYRRNTQTAGGFHGDDKTASGSRSNHGSRKSQPEERGPR FT CRKCNKIGHIAKHCPEKDAVRGKQALCTVLSTFKSAGTDDWYLDSGATQHM FT TRNLELLEDVRDRSGTVVAANKESMEIQACGTATLFPSCFNGEEPVPVSDV FT HFVPDLSTNLLSIAQIVRKGNTVLFNADGCIIRNADGGVVATGVIEDDLFK FT VDQRKNKALACSRVETLETWHKRLGHLNVSSIKKLANGVASGVQINGDGMD FT DCQICPMGKQARLPFSKSGSRAEELLEVIHSDIAGPMELPSLGGCRYYVSF FT IDDKSRRAFVYFLKTKSEDEVLACFKEFHVLAERQTGRKLKTLRTDNGKEY FT VNKGFREYLKRMGIRHQTTNDYTPEQNGMAERFNRTAVERGRCMLFEAKLP FT KPFWAEAVNAAVYLMNRSPTSGHSLTPEEMWSGKKPDLSHVRVFGTKAMVQ FT VPKVKRKKWDSKSRECVLIGFEEDTKGYRLYDLASKTFLKSREVTFINEGL FT GEKAAVTKKKPEVISLDFGTTPKVHAAQQPGLVPPEPDAVEEEQPLDVDDV FT LEDDGHEEPVVDAAVPGPRTDAVVDDQVASSSSAMTNVLPPQNISKPPGSG FT VLGRSGREHILPGKYKDFIVSGKGLPVSTISQALEDDSSDSSEYDDADDVE FT FGGLATGHRDDPLAEPRTYAEAMASPDAERWKQAMVDELESIKANETWTLV FT DLPPDRKAVGSKWVYKIKRDADGRVLRFKARLVAQGFSQQYGTDYDEVFAP FT VVRQTTFRALMAVAAKKRMTVKQYDIKTAFLYGDLEEEIYMRQPRGFEGAK FT GKVCKLKKSLYGLKQAARSWNQKLHDALKRQGFERCVADTCLYRKRRAGRW FT CYVLVYVDDLIVASEDPKMIEALAGALQQNFEMSVLGDIRFYLGMEIEKNA FT QGDYFLSQRKYIGEVVESSGLADAKVSTVPLDPGYVKRESKEEPLPDNKEY FT QKLVGKLLYIAVNTRPDISAAVSILSRKTSRPTQDDWNELKRVVRYLKGTK FT DYRLRLSQNGTDNGIVGFCDADWAENREDRKSNSGFVFKVNGGTVSWACRK FT QSCVSLSTAEAEFVSLSEAVQEALWLKVLLRELNDEQQVVIQEDNQSCLKM FT LSAEKFSNRTKHIATRFHFTKDQIEKGEVSCVYCPTEDMVADLLTKPLARI FT RTEKLAGMIGLTAAV" XX SQ Sequence 4183 BP; 1012 A; 1041 C; 1360 G; 770 T; 0 other; agaggttatg agcccggacg tgcgcgaaga ctagcggaag atgtccgatc ggaattacgg 60 gatcgaacgg ctggtcggcc gggagaactg ggccgattgg aagtttgcgg tccaaaccca 120 cctcgaagtt gaagacctgt gggatgccgt cgaaccgaag gtgaaggcgg atggaacggt 180 ggaagcggtc gacccggcca agagcaagct tgcgcgaggg aagatcatcc agttcctgga 240 accggtaaac tactaccacg ttcgcgaagc gaagacggcg caagaagctt ggaagaatct 300 ggtcaacgcg ttcgaggaga gcggactcac gcgggaggtc ggattgctgc acaagctcat 360 caggaccgat ctggacagct gcggaacgat ggaaatctac gcgaaccaga tcatctcggc 420 gtgccaccag ctgaaaagca tcggattccc gattccagac aggttcgtag gagcgttgct 480 gttggcgggg ctgccggacc ggttcaagcc catggtaatg gcgctcaaga actccggaat 540 ggcgatcacg ggggactccg tgaagacgat tctgctgcag gaagaggacg agctgttgcc 600 aggtgaaaaa actttcgttg ctaagcggcg agtcaacaag ccaggtggtt accgacgcaa 660 cacccaaaca gcgggcggtt tccatggcga cgacaaaaca gcgagcggct ctcgtagcaa 720 ccacggttcg cgaaaaagtc agccggagga acgcggccca cggtgcagga agtgcaacaa 780 aattggccac attgcgaagc actgccccga gaaggatgcc gtgcgtggga agcaagcgtt 840 gtgcacggtg ctgtcaacat tcaagtcggc cgggacggac gactggtacc tggactcagg 900 tgcgacacag cacatgacgc ggaacctgga gctgctggag gacgtgcggg accggagcgg 960 aacggttgta gctgccaaca aggagtcgat ggagatccag gcatgcggaa cggccacgct 1020 cttcccgtcg tgtttcaatg gtgaggaacc tgttccggta agcgatgtgc actttgtccc 1080 cgatctttcg acaaatttgc tgtcgatcgc acaaatcgta cgcaagggca acacggtgct 1140 gttcaacgcg gacggctgca ttatccggaa cgctgatggc ggcgttgttg cgaccggtgt 1200 tatcgaggat gacctgttca aggtggacca gcgtaagaac aaggcgctgg cgtgctcgcg 1260 cgtggagact ctcgagactt ggcacaagcg cctgggtcac ctgaacgtga gcagcatcaa 1320 gaaactggcc aacggtgtag ccagcggtgt gcagatcaac ggagacggca tggacgattg 1380 tcagatttgc ccgatgggga aacaagctcg attgccgttc agcaagagtg gttcccgcgc 1440 cgaggagttg cttgaggtaa ttcattccga tatcgctggt ccgatggaat tgccgtccct 1500 cggcggttgt cgatactacg tttcgtttat cgacgacaag tcccggcgag catttgtgta 1560 cttcctcaag actaagtcgg aggacgaagt actggcttgc ttcaaagaat tccatgttct 1620 ggcggagcgc cagacaggcc ggaagttgaa gacgttgcgt acggacaacg gcaaagagta 1680 cgtcaacaag ggattccggg agtacttgaa gcggatgggc atccgacacc agaccaccaa 1740 cgactacact ccagagcaga acgggatggc cgaacgcttc aaccggacag cggtcgagag 1800 agggcgctgc atgttgttcg aggccaagct cccgaaacca ttctgggctg aggcggtgaa 1860 tgctgcagtt tacctcatga accgatcgcc aaccagtggt cacagtctca caccggagga 1920 gatgtggagc ggaaagaaac cggatctctc tcacgttcgc gtgttcggaa cgaaggcgat 1980 ggtgcaagtg ccgaaagtca agcgcaagaa gtgggattcg aagtcgcgtg agtgtgtgct 2040 gattggtttt gaagaggaca cgaaggggta ccgcctgtac gacctcgcga gcaagacttt 2100 cctgaagagt cgcgaagtga ctttcatcaa cgaaggcttg ggcgagaaag cggctgtcac 2160 gaagaagaaa ccggaggtaa tttcccttga cttcggtaca actccgaagg ttcatgccgc 2220 gcagcagcct ggactggttc cgccggaacc ggatgccgtt gaggaggagc agcctctcga 2280 tgttgatgat gtcctcgaag acgacggcca cgaagagcct gtggtggatg ctgcggtccc 2340 tgggcctagg accgatgctg tggtcgatga ccaagtagct agcagtagtt cggccatgac 2400 caacgtgctc cctccgcaaa acatttccaa accacccggg tctggggtgt tggggcgcag 2460 cggtcgggag cacatccttc caggcaagta caaagacttt atcgtttctg gcaaaggtct 2520 acctgtgtcc accatttctc aggcactcga ggacgattcg agcgactcca gcgaatacga 2580 tgatgccgac gatgttgagt ttggagggct agcaactggc caccgagacg atccacttgc 2640 ggagccgcgg acctacgcgg aggcgatggc gtcgccggac gcggagcgct ggaagcaagc 2700 gatggtcgac gagcttgagt cgatcaaggc caacgagacg tggactctcg tcgacctgcc 2760 cccggatcgg aaggctgttg gctcgaaatg ggtctacaaa atcaagcggg acgcggatgg 2820 tcgcgtgttg cggttcaagg cacggttggt tgcacaagga ttcagccaac aatacgggac 2880 cgactacgac gaggtgttcg cgccggtggt gcgacaaacg acctttcgcg cgctgatggc 2940 tgttgctgcg aagaaacgca tgaccgtgaa gcagtacgac atcaagaccg ccttcctgta 3000 cggcgatctt gaggaggaga tttacatgcg tcagccacgc ggattcgaag gcgccaaagg 3060 caaggtgtgc aagctgaaga agagcctgta cggactgaag caggcggcgc ggtcctggaa 3120 tcagaagctt cacgatgcct tgaagcggca aggattcgag cgatgtgtgg cggacacgtg 3180 cctgtaccgc aagcgacgag ctggacgttg gtgctatgtt cttgtgtacg tggacgactt 3240 gattgttgcg agcgaagatc cgaagatgat cgaggccttg gccggagcgc tgcagcaaaa 3300 cttcgagatg agcgttctgg gagacatccg gttctacctc ggtatggaga tcgagaagaa 3360 cgcacaaggt gattacttcc tgagccaacg gaagtacatc ggagaggtcg tcgagtcgag 3420 tggacttgcg gacgcgaagg tgtccactgt tcccctggat ccggggtacg tcaagcgcga 3480 gtcgaaggag gagcctcttc cggacaacaa ggagtaccag aagttggttg ggaagctgct 3540 gtatattgcc gtcaacacga ggccggacat ctcggcggcg gtgtccatct tgagccgcaa 3600 gaccagcagg ccaacgcaag acgactggaa cgagctgaag cgggtcgtga ggtatctgaa 3660 aggaacaaag gattaccgat tgcgactgag ccagaacgga accgacaacg ggatcgttgg 3720 attttgcgac gccgactggg cggagaaccg agaggacagg aaatctaaca gcgggttcgt 3780 gttcaaggtc aacggcggaa ctgtaagctg ggcatgcagg aagcagagct gtgtctcgct 3840 ttcaaccgcc gaggccgagt ttgtatcgct ttccgaagcg gtgcaggagg cgctgtggct 3900 gaaggtgctt ctgcgagagt tgaacgacga gcagcaagtc gttatccagg aagacaacca 3960 aagctgcctg aagatgttgt ctgccgagaa gttcagcaac aggacgaagc acattgccac 4020 gcgattccac ttcacgaagg accagatcga gaagggggag gtaagctgcg tttactgtcc 4080 aacggaggac atggtggcgg accttctcac gaagccacta gccaggatca ggacggagaa 4140 gctggctggg atgataggct tgaccgcagc agtctaggag gag 4183 // ID MuDR11x_AP repbase; DNA; INV; 2220 BP. XX AC Contig25811; XX DT 25-JUN-2009 (Rel. 14.07, Created) DT 25-JUN-2009 (Rel. 15.12, Last updated, Version 2) XX DE MuDR-type DNA transposon. XX KW MuDR; DNA transposon; Transposable Element; MuDR11x_AP. XX NM MuDR11x_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-2220 RA Jurka J.; RT "DNA transposons from pea aphid."; RL Repbase Reports 9(7), 1360-1360 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX FH Key Location/Qualifiers FT CDS join(944..1549,1553..1708) FT /product="MuDR11x_AP_1p" FT /translation="MSYNSAIDNTDTSTTNAEEFLLINDQETNIIIFSCET FT NLIFLCESHTIFVDGTFEYFPKYFTQLFTIHGMKNNFYVPLVFCLLKNKNS FT ETYTQAFKYIQNKCYEKNLTFDPKNVTDDFEISIHNAILSIWPSTNLIGCR FT FHLRQAWYRKIQELGLSTEYKENGWLRTAFGLTFLDPQEVSESFVEDFMSI FT IPEEFIINMLIILTIISLKKMCFLQQYGPILVQVYPEQQIIVNLSTHILMN FT NFINHIQILIHF*" XX SQ Sequence 2220 BP; 810 A; 340 C; 337 G; 733 T; 0 other; gggcgtcagt cgtttccgcc aaagaacccg atttttgttt ccgccaaaac gaaaaaggtt 60 cgattttcat ttccgccaaa tctatttcta cctatattat tgagctggag tttgagaact 120 tctggtctgt ctgtagtgct atcgaccatc gatgtaccta tgacgataaa ccatttaata 180 tttaagatat cataaagaga taattgattt taaacgtaga aatacctaca taatattata 240 aataaatata aatcgattta acctatatca cgtgtagata tatcaacagt gtctgtgtta 300 tcgggatcgg agttctgtga tcggatctgt cagttctgtg gtaacaacga cgaaattagg 360 agtattaacc aaatattaca acatttagcc gcgtccgagt gaagcgtgaa taatatacgt 420 agattcattt ttgaatatat tggaatttta tattatggaa ttcataacaa gtaaacgtgg 480 aaaaaaaatg actgtgattg atgagttcaa atttagtttt tcttacaaat cggtccaaac 540 aaatatttca aggtatcgct gcttcgtaaa aacttgtacc gcaactattt atgttgatca 600 gttaaatcag ttgatcagta taacaggtat gttaaaaaat gtgtttattt taaatttact 660 ttttcttcga tattttaaaa ctaattagtt acgtattgat tacttgtact gtagaccaca 720 tatgcaatgg gccatgttct aatatcgata ggcaaaaagt gacgactgct tgtaagcgaa 780 aagcagtaaa tgatatcagt gaaagacctc aaaaaataat tttgaatgaa attcaatcga 840 ataacagtgt ttcgaccttt acggtcaatt atgtatcagc tattcgtcaa tgtatatatc 900 gagcacgaaa gcattcattt ccaatttaat ccttcgtcat taaatgagtt acaattctgc 960 aatagataac actgacacat ctaccacaaa tgccgaagaa tttttgttaa ttaacgacca 1020 agaaacaaat atcattatat tttcttgtga aacaaattta atttttctat gcgaatcaca 1080 tacaattttt gtcgacggaa cgtttgaata ttttccaaaa tattttactc agttatttac 1140 cattcacggc atgaagaaca atttttacgt acctcttgtt ttttgtttat tgaagaataa 1200 aaactctgaa acttatacac aagcttttaa atatattcaa aataaatgtt atgaaaaaaa 1260 tttgactttt gacccaaaaa atgtaacaga cgatttcgaa attagtattc ataatgctat 1320 attatctatt tggccttcaa ctaatcttat tggctgccgt tttcatttaa gacaagcgtg 1380 gtatcgtaaa attcaagagt taggacttag tacagaatat aaagaaaacg ggtggctaag 1440 gactgcattt ggattaacat ttttagatcc tcaagaagtt tcagaatcat ttgtcgaaga 1500 ttttatgtct ataataccag aagaatttat tataaatatg ctgattattt aactgacaat 1560 tatatctctg aaaaaaatgt gtttcctcca acaatatggt ccaattttag tgcaagtata 1620 tccagaacaa caaataattg tgaatctttc cacgcacatt ttaatgaaca attttataaa 1680 tcacatccaa atattaatac atttttaaaa attttaatct cgaatgttca aaacaacgta 1740 tacattcaaa ttaacagttg caacgtaggt atgcaaaaac ctgtaagatg taatataaaa 1800 aagcgattga taagtacaac aaacgcaata gacgaataca aaaataaaac aatttcaaga 1860 atgacatttt taaaaaaggt atgcaataat tattctaaat aataattaac aattaattat 1920 agaagtataa aactgtcaga aataattgat ttactatata tatacatata atataacaca 1980 aattgactta acctaattca aattcaaata ttttaacttc aattaaaaaa tgaagtcaaa 2040 caattttttg ggggtcaatg agaaccgcct cctcgtggtg gcccctgggt aactctaaat 2100 ggtcctcaaa ctatagttta gaccggtagg tagaaataga tttggcggaa atgaaaatcg 2160 aacctttttc gttttggcgg aaacgaaaat cgagtttttt ggcggaaacg atgatcgccc 2220 // ID BEL-627_AA-I repbase; DNA; INV; 6735 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-627_AA_; KW BEL-627_AA-LTR; Pao_Bel_Ele182; BEL-627_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-6735 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [5680-6258] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 1312..6651 FT /product="BEL-627_AA-I_1p" FT /translation="MIAMKPDVKKSPSMRALTARLKEIQLSFGDIDRFAQL FT FKETTTSTEIEIRLSKLDEFWESFSEALVDIFSHDDYNPDSTSLERERIEF FT SDRYYQLKSFLSDKLKDRQRPQELENSIRGGDASAHGNMDHVRLPQITLQP FT FNGNIDEWLSFRDLFTSLIHWKAELPEVEKFHYLKGCLQGEPRALIDPLPI FT TKANYQVAWDMLLKRYNNSKQLKKRQVQALFKLPTLAKESGSELHILLESF FT ERIVQTLDQVIQPVDYKDLLLVNILTTRLDPVTRRGWEEVSATKEQDTLED FT LREFLQRRIQVLDCLPAKSTDTRGVTSIPQQSKSKSQPMKTSYSSTQAPRT FT RCVVCSSDHFLFHCKEFQRLAVTDRDALLKSNGLCRNCFRPGHLARDCPSK FT YSCKNCRGRHHTLVCFKSEREKDALVAAVAGSTKPSTSQNHQESSSSSSVQ FT VSNIASTEPVVSATSHQQSGQVLLATAVVLIEDDSGNRLPARALLDSGSES FT NFITERLSQQMKVCRDRVDISVLGIGQTGTKVKHRIHAVIRSRLSTFAREM FT DFLVLPKVTVNLPTTTTNTDNWAFPSGIQLADPAFFERNGVDLVLGIEHFF FT DFFQTGRRISLGVGNPTLNESVFGWIVSGGISAPNRSLQISCNVSTMDDLD FT ALISRFWSCEEVETGKALSLEEKRCEDNFSRTVRRGSDGRYTVSLPKQDDG FT ISRLGDSRDIAYRRLQGTERRLARDPLLRQQYHDFMEEYAKLGHMRLVKGS FT ELETIKRCYLPHHPVIKEASTTTKVRVVFDASCKTATGASLNDILLTGPIV FT QEDLRSIILRSRTKQIMLVSDVEKMFRQIYVDTKDRPLQSILWRNSPSDIA FT LTYELNTVTYGTKPAPFLATRTLQQLAFDEEKNFPLAASAATDDTYMDDVI FT TGADNAEMAIEMRKQLDEMSSSGGFRLRKWASNCPAVLEGVAEEDRAIRTS FT EGIVLDPDPSIRTLGLTWMPGTDTLRFQFAIPSLIPDKPLTKRYVLSVIAS FT LFDPLGLLGAAITSAKIFMQLLWTLHDDHNQRLGWDQPLPSTVGESWRRLH FT VQLPLFNEIRINRCVIIPDAALIELHCFSDASEKAYGACLYVRSENLAGEV FT LVRLLTSKSKVAPLKCQTIPRLELCGALLAAQLFEKAAQSIRFTGNSYFWT FT DSTCVLRWIVATPTTWTTFVANRVAKIQSATQNSRWSHVPGVQNPADLISR FT GISPENIVKNTLWWQGPSWLQEKEENWPLLPEGLSAGAEDEERRRTVVVTA FT SSTVSEFYQAYFEKISSYNDLVRRLAYWLRFMRILQTPKEERRNYSFLSTM FT ELKQAENTLVRLVQREFFVTEWKALSSKTDVPRGSPLRWFNPFIDEDNLIR FT LGGRLKHCMESEEAKHPVVLPARHHFTRLLLRHYHERLIHAGPQLMLSVVR FT LRFWPLGGRSVVKQIVHQCLKCFRTKPKTVQQLMGDLPTSRVTVSRPFLRT FT GVDYFGPLYVRPAPRRPAVKAYVAIFICMSTKAVHMELVSDLSTDRFLQAL FT RRFISRRGRCTDMFSDNGTNFVGARNKMQEVLKMLQDCNYHESITRELAKE FT GIQWHFNPPSAPHFGGLWESAVRSAKHHLVKVIGDTPVTPEDMCTLLAQVE FT ACLNSRPLTPLSDDPNDLQPLTPAHFLIGESLHALPEPDFTSILPNRLNLL FT QLTQRRLQDFWKRWKREYLCQLQGRAKRWKSAIPIDVGTLVVIHDDNLPPL FT RWKMGRITQVHPGDDGTVRVVTLKTATGLLTRPVEKICILPLPIDDENNEL FT EPVTITKSK" XX SQ Sequence 6735 BP; 1816 A; 1563 C; 1618 G; 1733 T; 5 other; ttttggtcct tcgagccgga ttctggacaa tacacgaatg ttggcaattc ccgccggcat 60 taagtcgata caatttggtt tggaacgctt tacaacgcca tagtgatacc tcgcggtaga 120 ggtcactgca ctaaaggtca ttgtatgcac atgaacaaca ccgctgccat tggacatctg 180 gacaaagcat tacctttcgg gatggcgccg cttggacaga agcgccattg atgcaagctt 240 cgtttggctt cgcttgggat agcaattgtg gaactggact tcaactgaag tgttgcatgc 300 tacaacatca gcgaggttat tgcatgatac tcgctagtaa cgggaccact caacggattc 360 tgctatttgg attggatcaa aggtcgtcgc ctgaaggaat tcacgtttta ccggaaagga 420 gatcgcctct tcgaaccgct ggaccggcgc aacgctttct ttctcaggta aataagatag 480 aatagtgtat gtccagccga agcgtaggct ttctatatac taataccaac aattsgacct 540 atttttccgg acctgtgacc acctgacctg cgacaaggga agtcgcctcc gttgatgtgg 600 attggcattc cgccggaatc gaggatttac gtttacaacc aaggaagatt ggacgatatt 660 tcggactttg gattcttcga gatcgcctac actagggtaa tctggccaaa caactacagt 720 aactacacta ccactggttt tcaaggatcg tcggcaagaa gagggagcgt catcaaaggt 780 tttctcgggt gagtgaactt tttgtatact gccgaagcta gacgcagaaa atacaccatt 840 ttgaatgaaa tctatcctct tcgactttgc cattttcctg ctgagccctg aatgctcttc 900 tgagcgaccc attcttgaaa tttggawtag ttattcggac ttcgccggga tttgcaccgc 960 tgctggtttc tccaggtgag ttgccgcagt atatgcctgg ccgaagccgg aaactttcgg 1020 ttactacacc aacatttctc ctcttcgcct tctcctcgct cgagtcatcg ctgaatttgg 1080 tcggttattg agttggcacg acgctgatta cctggaataa gccgttgcgt tgtattggac 1140 ccttgccatc tggttgtcga ttgattgtcg tttcgtgcgt gttggattct tggagtagca 1200 gtactcgttt ctccgggtaa atactagcag tatatgccta gccgtggcta ggtctaccgt 1260 cagcatacta cacctcgcct ctctctctct ttgtgtcgcc aacacgtacg aatgatmgcc 1320 atgaaacctg acgtcaagaa atctccgtca atgagggcat tgacagccag attgaaggaa 1380 atccagctat ccttcggtga catcgatcgg tttgcgcaat tgttcaagga aactaccacg 1440 tccaccgaaa tagagattcg cctgagtaag ttagacgagt tttgggaaag cttcagtgaa 1500 gcwttagtcg atattttctc ccacgatgat tacaatccgg atagcacttc actggaaagg 1560 gagcgcatcg aattcagtga ccgttattac cagttgaagt cctttttgtc ggataagctt 1620 aaggatcggc agcggccaca agaattggag aattccattc gaggaggaga cgcatcagcg 1680 cacggaaaca tggatcacgt gcgcctgccg cagatcacgc tacaaccatt caacggaaac 1740 atcgacgaat ggttgagttt tcgggacctt ttcacctcgc ttatccattg gaaggcggaa 1800 ctaccggagg tggaaaagtt ccattatttg aaggggtgcc ttcaaggtga accgagagca 1860 ctwatagatc ctctcccaat caccaaggcc aactaccagg tggcctggga tatgctgttg 1920 aagcgttaca ataacagtaa gcagctaaaa aagcgacaag tacaagccct gttcaagttg 1980 cccactttag ctaaggaatc cggatcagaa ttacacatcc ttctggaaag cttcgagaga 2040 attgtacaga ctcttgatca agtcattcag ccagttgatt acaaggatct gctcttggtg 2100 aatattctta cgacccgttt agatccagtt actcggaggg ggtgggaaga ggtgtcagcc 2160 acgaaggaac aggacacgct cgaggatttg cgtgaatttt tgcagcgcag gattcaggtt 2220 ttggactgct tgccagcaaa atcaactgac actaggggtg tcacatcgat accacagcaa 2280 tcgaaatcta aatcgcagcc gatgaagacc agctatagct ccacccaagc acctaggact 2340 cgctgtgtcg tttgttcatc ggaccatttt ctgttccact gtaaggaatt tcaacggttg 2400 gcagttactg atagagatgc tctgctaaaa tcaaacggac tctgtcggaa ttgtttccgt 2460 cctggacatc tggctagaga ttgtccttca aaatacagct gtaaaaattg tagaggtcgt 2520 catcacacac tggtatgctt caagtcggaa agagagaagg atgccctcgt tgcggcagtt 2580 gctggaagca cgaagccgtc tacttcgcag aatcatcagg agtcatccag ttcatcttcc 2640 gttcaagtat ccaacattgc atccacggaa cccgtagtgt ccgctacatc tcatcaacag 2700 tcagggcaag ttttgttggc tacagcagtt gtattaatcg aggacgacag cggaaatcgt 2760 ctacctgctc gcgctctgtt ggattctggg tcggagtcca atttcatcac ggagaggttg 2820 agtcagcaga tgaaggtctg ccgagatcga gtggacattt ctgttctcgg aattggacag 2880 actggaacca aggtaaaaca caggatacat gcagtgataa ggtcacgctt gtctacgttt 2940 gcacgcgaga tggatttctt ggttctccca aaggtaacgg tgaaccttcc aaccaccacc 3000 actaatacgg ataattgggc gtttccgagc gggatccaac tagcagatcc tgcattcttc 3060 gaaagaaacg gtgtggatct cgtgctgggc atcgaacatt tcttcgattt cttccaaacc 3120 ggtcggagga tttctctggg cgtaggaaat cctactctta acgaatctgt attcggatgg 3180 atagtgagcg gcggtatttc ggctccgaat cgatcgctac aaatcagttg caatgtttca 3240 acgatggatg acttagatgc actgatctct cggttctggt cctgcgagga agtagaaaca 3300 ggaaaggcgc tctcattgga agagaagcgg tgcgaagaca atttttcacg aacagttcgc 3360 cggggttcag acggtcggta caccgtctca ttaccaaaac aggacgatgg catctcacgg 3420 ttgggcgatt caagggatat cgcataccga cgtctgcaag gaactgaacg tcgactggct 3480 cgggatccac tgcttcgaca acagtatcac gatttcatgg aagaatacgc caaattgggt 3540 cacatgcgat tagtgaaagg atctgaattg gaaacaataa agcggtgcta tttaccgcat 3600 caccctgtta tcaaagaggc aagcactact accaaggtac gggtggtgtt tgatgcttcc 3660 tgcaaaacgg cgaccggtgc atcgctcaac gatatactgc tgacagggcc aatcgttcag 3720 gaggacctcc ggtctatcat tctacgtagc cgaacaaaac aaattatgct tgtttcggac 3780 gtagagaaaa tgtttcggca aatctacgtt gatactaagg accgtccgtt gcagtccatt 3840 ctgtggcgca attcacccag cgatattgcc ctcacctacg agcttaacac agtaacctac 3900 ggaactaaac cagctccgtt tctggccact aggacactgc aacagttggc tttcgacgaa 3960 gagaaaaatt tcccactcgc agcaagtgct gcgaccgatg atacctatat ggacgacgtt 4020 attacaggcg ctgataatgc agaaatggcg attgaaatga gaaagcagtt ggacgaaatg 4080 tcttcaagtg gtggattccg tttacgaaaa tgggcgtcta actgtccagc tgtgttggaa 4140 ggtgtggcag aggaagatcg agctattcgt acatcggaag gaatcgtatt agatccagat 4200 ccctcaatca gaacgttggg tttgacttgg atgcctggta cggatactct gagattccag 4260 ttcgcaattc caagtttgat tcctgacaaa cctctcacca aacgatatgt actatcggtg 4320 atagcttctc tattcgatcc cttgggactt ttaggagcag caataacgtc agcaaaaatc 4380 ttcatgcagt tgctgtggac acttcatgac gaccataatc aacgtctggg ctgggaccag 4440 ccacttcctt cgacggtggg tgagtcttgg aggagattac atgtgcagct tccattattc 4500 aacgaaatac gaatcaaccg ttgcgtcatt attcctgatg ccgccttgat agaactgcat 4560 tgtttctcag atgcatcaga gaaggcatat ggagcttgcc tatacgtaag aagcgagaat 4620 ttggctggag aggtgctagt tcgtcttcta acttccaagt ctaaggtggc accgttaaag 4680 tgccaaacga tacctcggtt ggaactgtgt ggagcgctct tggcagcaca attattcgaa 4740 aaggctgccc aatctattcg attcacgggc aactcctatt tctggacgga ttcaacatgc 4800 gtacttcgtt ggattgttgc aacccctact acatggacca ctttcgtcgc caaccgagtg 4860 gcaaaaatac aatccgctac ccaaaactct agatggagtc acgttccggg tgttcaaaac 4920 ccagcagatc tcatatctag gggaatttca ccagaaaaca ttgtaaaaaa tactttgtgg 4980 tggcaaggtc catcgtggct acaagagaag gaagaaaatt ggccccttct accagaagga 5040 ttatcagcag gagccgaaga cgaagaaaga cggcgtacgg tagtagtcac cgcaagttca 5100 acagtttcgg agttttatca agcgtacttt gaaaaaattt catcgtacaa tgatcttgtt 5160 cgtcgccttg catattggtt acgttttatg agaatccttc aaactccaaa ggaagaaaga 5220 agaaactact cgtttttgtc cacgatggaa ctgaaacagg ctgaaaatac tttggttcga 5280 ttagtgcaaa gggagttttt tgtaacggaa tggaaggcat tgtcttctaa aactgatgtt 5340 cctagaggat cgccgttgag gtggttcaat ccatttatag atgaagacaa tttgattcgg 5400 ctgggcggcc ggcttaaaca ttgtatggaa tccgaggaag ccaagcatcc agtagtactt 5460 ccagccagac atcacttcac tcgtttgtta ttacgtcatt atcatgaacg attgatccac 5520 gctggaccac aactaatgct aagtgtcgtt cgactacgtt tctggccact aggaggaaga 5580 agcgtagtaa agcagatcgt ccaccaatgt ctgaaatgct ttcgtacgaa gcctaaaaca 5640 gttcagcagt tgatggggga tctacctaca tcacgagtaa cggtatcacg accatttctt 5700 cgaacaggcg tggactactt tggacctctc tacgttagac ctgctccaag acgtccggca 5760 gtcaaggcat acgttgcaat tttcatttgt atgagtacga aggcagtcca tatggagctt 5820 gtctccgatc tttcaaccga tagatttctt caagcgctaa gacggtttat ctcaagaaga 5880 ggacggtgca cagatatgtt ttccgacaac ggtactaatt ttgtcggtgc gcgaaataaa 5940 atgcaagaag tattaaaaat gctccaggac tgcaattatc atgaatcaat cacgagagaa 6000 ttagcaaagg aggggataca gtggcatttc aacccgccca gtgcgccaca ttttggcgga 6060 ttgtgggagt ccgctgtacg atcagcaaag catcatctcg tgaaggtaat tggtgatact 6120 ccagtgacac ctgaggacat gtgtacgttg ttagcacaag ttgaagcttg cctaaatagt 6180 cgccctctca cccccttgtc cgatgatccg aatgatctcc agccgttaac acccgcccac 6240 tttttgattg gagaatcatt gcacgcgttg cctgagccag atttcaccag cattctgcct 6300 aatcggctta atctgttaca actaacccaa cggagattgc aggacttctg gaagcgctgg 6360 aaacgagaat acttatgcca gctgcaaggt cgagccaaaa ggtggaaatc cgccatacca 6420 attgacgtag gaacgctcgt agtaatccac gacgacaatt taccacctct acgttggaaa 6480 atggggcgaa tcacccaggt acatccggga gatgatggaa ctgtgcgagt ggtaaccctc 6540 aaaaccgcta caggcctgtt aactcgtcca gtcgaaaaaa tttgcatatt gccactccca 6600 atcgacgatg agaacaacga attggaacca gtcacgataa ccaaatccaa atgatcagtc 6660 ctttccctcc ctataccgcg aagaggattt tttatctatt tacagaaact tatgaacttt 6720 ctgggtgggt gagaa 6735 // ID Neptune2_Ap repbase; DNA; INV; 1579 BP. XX AC . XX DT 27-DEC-2006 (Rel. 11.12, Created) DT 27-DEC-2006 (Rel. 11.12, Last updated, Version 1) XX DE Neptune2_Ap is a Penelope-like retrotransposon. XX KW Penelope; Non-LTR Retrotransposon; Transposable Element; KW Penelope-like element; reverse transcriptase; Neptune2_Ap. XX OS Acropora palmata OC Eukaryota; Metazoa; Cnidaria; Anthozoa; Hexacorallia; OC Scleractinia; Astrocoeniina; Acroporidae; Acropora. XX RN [1] RP 1-1579 RA Arkhipova I.R.; RT "Distribution and Phylogeny of Penelope-like elements in RT Eukaryotes."; RL Syst. Biol 55(6), 875-885 (2006). XX DR [1] (Consensus) XX CC Neptune2_Ap is a Penelope-like element (PLE) from the elkhorn CC coral, Acropora palmata. It belongs to the Neptune group of PLEs. CC Its incomplete ORF contains homology to reverse transcriptases, CC and is truncated at both ends. The element is apparently CC inactive, its copies are 80-95% identical. Consensus sequence was CC assembled from trace archives; a complete copy could not be CC assembled because of insufficient coverage. XX FH Key Location/Qualifiers FT CDS 1..1542 FT /product="Neptune2_Ap_1p" FT /translation="ANTRPVVDSENIDKYRNTAKKIGKYRNTASKVDEIPK FT LQKRGIRRNLLKDFDLFARRMRLKDIFSRERNKQHPFHVKSTWEPPVQQSV FT ALETFLEEVEFELANSPSKRPKDNLSPGERRALHNLLGDKTIIVKKADKGT FT TTVIMSREQKIKEGQILLNDLDNYRPLEKQMADETVEKIKQLTTSMLTESH FT IDEMTVKWLSQTPNPPRIPEFYTLTKIHRPTLVGRPIISGCDGPTERISCF FT VDRLIQPIAQQQESYLKDSKDFINFIENTKLPKNTILASKDVTSLYTNIPQ FT EEGITTVCKAYEDFYKNRLPIPTNFLRRMLCLILKENSFQFNKRHYLQTHG FT TAMGTKMAVAFANIFMAKIEKGIISKSIIKPLVWKRYIDDVFCLWDTNEDN FT IKEFVTRANHYHDNIKFTAEISDSEIAFLGHKSVQRREIQQRLPPSMCKRI FT INKQRPFNTRNFYSCHPPGVKKGFIKGEALHLLRTNSSHSTFNKNMQSFKT FT RLKNRGYPNEFLEKKGP*" XX SQ Sequence 1579 BP; 613 A; 339 C; 282 G; 345 T; 0 other; gcgaatactc gccctgtggt ggattcagaa aatatcgaca aataccgtaa taccgcaaaa 60 aaaatcggca aatatcgaaa taccgcatcg aaagtcgatg aaataccgaa attgcaaaaa 120 cgtggtataa ggcgcaacct tttaaaagac tttgaccttt ttgcaagacg aatgcgcctc 180 aaagacattt tttcacggga aagaaacaaa caacaccctt tccacgttaa atcaacatgg 240 gaaccaccgg ttcaacagtc agtggccttg gaaaccttcc tagaggaggt ggaatttgaa 300 ctggcgaaca gccctagtaa gaggccaaaa gacaacctat cacccggtga aaggcgtgca 360 ttacataacc tattaggtga caaaactatc attgtaaaga aagccgataa aggaaccact 420 accgtaataa tgagcagaga acaaaaaatt aaagagggtc aaattctgtt gaacgatctg 480 gataactaca ggcccttaga aaaacaaatg gctgacgaaa cagttgaaaa aataaaacaa 540 ctaacaacgt ctatgctcac agaaagccac attgacgaaa tgacagtaaa atggctctcc 600 caaacaccaa atccacctag aattccagaa ttctatacac taacaaagat tcacagacct 660 actttagtag gcagacctat aatctctggg tgtgacggcc ccacagagag aatttcatgc 720 tttgtcgacc gcctcataca gccgatagca caacaacaag aatcttatct gaaagattca 780 aaagatttta taaacttcat tgaaaacaca aaactgccaa agaacactat ccttgcctca 840 aaggacgtta ctagcttata cacaaacatt cctcaagaag aagggattac aactgtatgc 900 aaagcatacg aagacttcta taaaaaccgt ttaccaatcc ctactaattt cttgaggcga 960 atgctctgcc taatacttaa agagaattca tttcaattta ataaacgaca ttatctacaa 1020 acccacggaa cggctatggg aaccaaaatg gccgtggcct ttgcaaatat tttcatggca 1080 aaaatagaaa aaggcattat cagcaagagc ataattaaac cgctagtttg gaagagatat 1140 attgacgatg tcttctgtct gtgggacaca aacgaagaca atatcaagga atttgttaca 1200 agggcaaacc actaccacga taatataaaa tttacggctg aaatatcaga ctcagaaatt 1260 gcattcttgg gacacaaaag tgtacaaagg cgagagattc aacagagact cccaccctcg 1320 atgtgcaaac gcattataaa caaacagaga cctttcaata cacgaaattt ttattcgtgt 1380 catccaccag gcgtaaagaa aggattcata aaaggagagg cgctgcacct gctaagaaca 1440 aattcgtcac actcgacgtt taacaaaaac atgcagagtt tcaaaacacg cctaaagaac 1500 agaggatacc caaatgagtt tttagagaaa aaaggccctt gaaacaaaga caaaagcaca 1560 aaaaataaaa tattgcctt 1579 // ID L2B-7_AAe repbase; DNA; INV; 4443 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE An L2B non-LTR retrotransposon from Aedes aegypti. XX KW L2B; Non-LTR Retrotransposon; Transposable Element; L2B-7_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4443 RA Kojima K.K. and Jurka J.; RT "L2B clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1412-1412 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 30 sequences with >92% CC identity. Closely related to CR1-1_AG and L2B-1_CP. XX FH Key Location/Qualifiers FT CDS 48..1424 FT /product="L2B-7_AAe_1p" FT /translation="MYVCDTCLALHEYDDRMNIEKKIDVMLEKIGEMEQKF FT SFFKDFDGKVRALFKEELNKLCTSTNVSGEISLSDTPRYNLRSNKNKTGSE FT NTGRNVENTPKSGIVNEGMVSYSDVVKNNVDSVVESSLSAGQKGEWRAVSR FT KVYRSAGKTPMSDMRREKSSTNIVVENHKVNGRNAIPGSSSSFATKNRMND FT SNESKKKPNPRVIIKPRNSDDQRNTKKVLSEKLKTCSVKVNDVITRHDGAV FT IVELNDENSSETFKQAVKSAMGDGYEVEMSQPFRPMVKLLGITEELNKDEL FT KRSLLDHNEILNDVKHLKVLKIYSVNGLFNAIIEVDAYTYGKLMERGKLIC FT EWDRCRVVDGIDVLRCYKCCAFNHKGADCSARGITCPRCAGSHSIQQCDSV FT EYKCANCIRMLKSGRQGICSSLVDKKYFVPTMLVFICVDHAVWSEQCPVYR FT RMKDKKKEKIDFEA" FT CDS 1428..4232 FT /product="L2B-7_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="QSKSDILYANCAISSHIEEIRLIIRQHNPNIFVITET FT HLTDKHNVNLFDIKGYDMINCLSNSSHTGGVTFYVKNNIKWVVHSNESCSG FT NWFLTIYVSVGVMKGMYGGLYHSPNANDATFLMYLEEEWLPRVLEHNGRIT FT VIGDFNINWLENMDRRELKNLMDSLSLHQFISFPTRVNHRSNTMIDLLFSN FT YDIPQVTELPSDKISDHATISVLVSSIADKSTPIRTKIRSWKKYSKVRLQS FT ELRWKLRDFHQCSNIDESANMLAEALEVTVNNMVELKEITVTETAPWYTSS FT LHGMKLCRDEAHFKFRTSRSQEDWKEYQRKRNNYVNAIRDAKKTAIQQELR FT ASQSDSKKLWKSLKKLINPAKGTDNGVKFSDIDEDLGDKAKADKLNEYFID FT SVKKIHDSILECGSSDTGNDNLNPNYRWSTFKPIDMSRLRNTVLNLKKCGG FT INNVSTSVLIDAFEAIKHEFLILINKSLSRGEFPSGWKKSLVVPLPKVPNS FT QKPEDRRPINMLPVYEKVLEVLVKEDLMDFVNKYDILLPEQSGFRKQHSCE FT TALNLLLFKWKKALEDKKVIMTIFLDLKRAFETIDRKKLINTLGKLGIGGK FT VIEWFRSYLTMRMQCTLYRGQESECRRVDLGVPQGSVLGPILFILYINDVK FT DVVTLGNLNLFADDTVLFVIADSIEAAYAVMRVELNKLTKWLKIKKLKLNV FT TKTKYMVITNKSRNGYLDTLSIDGEVVEQVKLVKYLGVIVDEKLNFNEHID FT YIIRKAASKYGMLCKINKYLTFDNKIMIYKTLIAPHFEYCASILFLASKRQ FT MRRMQTVQNRTMRLILGCDRKTSSQLMREVLQWMSVSERITFRTLTLVFKI FT RNGLTPKYLTENVQYGSDVHRYATRRANDFRLPNLTMTGTQNSLFFKGFRL FT FNSLPDEVKYETNENRFKRKCKPLIKNLMET" XX SQ Sequence 4443 BP; 1634 A; 625 C; 966 G; 1218 T; 0 other; gtgtcaacga agatgagtgg aaggtgctaa aaggaataaa aaatgcgatg tatgtttgtg 60 atacgtgttt ggctttacac gaatatgatg atcgtatgaa tattgagaaa aaaattgacg 120 ttatgctgga aaaaattggg gagatggaac aaaagtttag tttcttcaaa gacttcgacg 180 gaaaagtccg agcactgttt aaagaagaac taaataaatt gtgtacatca acgaatgtga 240 gtggtgaaat ctcattgagt gatacaccgc gatacaattt gcgctcaaat aaaaacaaaa 300 ctgggtcaga gaacactggg agaaatgtag aaaatacgcc caaaagtgga atcgtaaacg 360 agggcatggt ttcatatagt gacgttgtta aaaataatgt agattctgtg gttgaatcat 420 ctttaagtgc cggtcaaaaa ggtgaatgga gggctgttag taggaaggtt tatcgaagtg 480 cgggtaaaac acccatgtcg gatatgcgac gtgaaaaatc aagtacgaat atcgttgtgg 540 agaatcacaa agtgaatggt cgcaatgcaa ttccaggatc atcaagctct tttgcaacga 600 aaaatcgtat gaatgattca aatgaatcga aaaagaaacc aaacccgaga gtgattatca 660 aaccgcgtaa ttcggacgat caaagaaata cgaagaaagt gttgagcgaa aaacttaaga 720 cgtgttccgt aaaagtcaat gacgtaatca ctcgtcacga tggtgcggta attgttgagt 780 taaacgatga aaattcaagt gaaacgttca agcaagctgt aaaaagcgct atgggagacg 840 gatatgaggt agaaatgtct cagccattta gacctatggt aaaattacta ggtataactg 900 aagagcttaa taaagatgaa cttaagagat ctctcttgga tcataacgaa atattgaatg 960 atgtgaagca cctaaaagtg ttgaaaattt attcagtcaa tgggctgttc aatgctatta 1020 tcgaagttga tgcttataca tatggtaaac ttatggagcg cggaaaatta atctgtgaat 1080 gggatcgttg tcgtgttgtc gatggcattg atgtgcttag gtgttataag tgttgtgcat 1140 tcaatcataa aggagctgat tgtagtgcta gagggataac atgtccccga tgtgctggtt 1200 ctcattcaat tcaacaatgt gattctgtag agtataaatg tgctaactgt atcagaatgt 1260 tgaaatcagg caggcagggt atctgttcat ctttagtcga taaaaaatat tttgtgccaa 1320 caatgttagt gtttatctgt gtggatcatg ctgtctggag tgagcaatgt ccagtgtatc 1380 gaagaatgaa agataaaaag aaagaaaaga tagattttga agcatagcaa tcgaaaagtg 1440 atattttgta tgcaaattgt gctattagta gtcacattga agaaataaga ttgataatac 1500 gacaacacaa tccaaatatt tttgtcatta ctgagactca cttgacagat aaacataatg 1560 tgaatttgtt tgatataaag ggatatgata tgatcaattg cttgtctaat tcgagccata 1620 ctggtggagt gacattttat gtaaaaaata atattaaatg ggttgtgcat tctaatgaat 1680 cgtgtagtgg aaattggttt ttgaccattt atgtatctgt tggagttatg aaaggtatgt 1740 atggtggttt atatcattca cctaatgcta atgatgcaac gtttctcatg taccttgaag 1800 aggaatggtt gcccagggta cttgaacata atggtagaat cacagtcatt ggtgacttta 1860 acatcaactg gttagaaaat atggatagaa gagaacttaa aaatctaatg gactcattat 1920 cactgcatca gtttataagt tttcctacaa gagtaaacca tagaagtaat acaatgattg 1980 accttttgtt tagtaattac gatataccac aagtgacgga attaccatca gacaaaatat 2040 ctgaccatgc aacaataagt gttttagtga gtagcatagc tgacaaatcg acacctatta 2100 gaactaaaat cagaagttgg aaaaaatatt ctaaagtacg tttgcaatcc gagctccgtt 2160 ggaaattgag agattttcat caatgtagta atattgacga gtctgcaaac atgttggctg 2220 aagcgttaga agttactgta aataatatgg tagagctgaa agagattact gttactgaga 2280 cagcaccgtg gtacacgagt tcattacatg ggatgaaatt gtgtagggac gaagctcatt 2340 ttaaatttag aacatcgcgt agtcaagaag attggaaaga ataccagaga aaaagaaata 2400 attatgtcaa tgcaattcgt gatgctaaga agacggctat acaacaggag ctaagagcaa 2460 gtcaaagtga ctctaaaaag ttatggaaat ctttgaaaaa attaataaac ccagctaaag 2520 gcacggataa tggtgtaaaa ttttcagata ttgatgaaga tcttggtgac aaagcaaaag 2580 ctgataaatt aaatgaatac tttattgata gtgtcaagaa gattcacgac agtattctag 2640 agtgtggatc aagtgatact ggaaacgaca atctaaatcc aaattatagg tggtctactt 2700 ttaaaccaat cgatatgtca cgactgagaa acacagtact aaatctgaaa aaatgtggtg 2760 gaatcaataa cgtcagtaca tcagtattaa ttgatgcatt cgaagcaatc aaacatgagt 2820 ttctcatcct cattaacaaa tcccttagca gaggagagtt cccatcggga tggaaaaaat 2880 cattagttgt accactgcca aaagttccca attctcaaaa acccgaggac agaagaccaa 2940 taaacatgtt accagtatat gaaaaggttt tggaggtgtt agtgaaggaa gacttgatgg 3000 attttgtgaa caaatatgac attttgttgc cagagcaatc aggtttcaga aaacaacatt 3060 cttgtgaaac ggctttgaat ttgctactgt tcaaatggaa aaaagctcta gaggataaga 3120 aagttattat gacgatattt ttagatttaa aaagagcctt cgaaacaatt gacaggaaaa 3180 aattgatcaa tactctagga aaactaggaa ttggtggaaa ggttattgaa tggtttagaa 3240 gttatttgac tatgagaatg cagtgtactt tatatagagg acaggaatcg gagtgcagaa 3300 gagttgacct aggggttcca caaggtagcg tgctagggcc gattcttttt attttataca 3360 taaacgatgt caaggatgta gtaacgttgg gtaatttaaa tttgtttgct gatgatacag 3420 tattgtttgt tattgcagac agcattgaag cagcatatgc ggtaatgagg gttgaactaa 3480 ataagttaac gaaatggttg aagatcaaga agttgaagtt aaatgtaaca aaaactaagt 3540 atatggtaat aacaaacaaa tctaggaacg gatatttgga tacattgtct attgatggtg 3600 aggttgtgga gcaagtaaaa ctggtgaaat acttaggagt tatagttgac gagaaactca 3660 atttcaacga acatattgac tacataatac gtaaagcagc atcaaagtat gggatgttat 3720 gtaaaatcaa caaatatctg acattcgata acaaaataat gatatataaa actttaatcg 3780 ccccacattt cgaatactgc gcttcgatac tgtttctggc ttcaaaacga caaatgagaa 3840 ggatgcaaac tgtgcagaac cggacaatga gattaatatt aggatgtgac cgaaagactt 3900 caagtcaact aatgagagaa gttctacaat ggatgtcagt atcagagaga ataaccttcc 3960 gaacactcac acttgttttt aagataagaa atggattgac accgaaatat ttgacagaga 4020 acgtgcaata cggatcagac gtgcatcgat atgcgacacg aagagcaaat gactttcgac 4080 taccaaattt gacaatgacg ggaacacaaa attcattatt tttcaaagga tttagacttt 4140 tcaactcact tccagatgaa gtaaagtatg agactaatga gaatcgattc aagcggaaat 4200 gtaaaccatt aataaaaaat ctgatggaaa cttaagaaaa caagacgatg gacatacgcg 4260 ggagccagac gaataaaaaa aacggataga agacgatgac acatgaggac actacgaata 4320 aaatatcaaa ttgataaacc aagtcttgta agcctcccct ttgcaaaaaa gcgtatgggg 4380 tggaggagga gttcgaaagt taaaactcct atgggaccat caagaaaaaa aaaaaaaaaa 4440 aaa 4443 // ID Gypsy-29_DPu-LTR repbase; DNA; INV; 333 BP. XX AC scaffold_68; XX DT 09-FEB-2011 (Rel. 16.02, Created) DT 09-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from Daphnia: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-29_DP_; KW Gypsy-29_DPu-I; Gypsy-29_DPu-LTR. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-333 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Direct Submission to RU (16-DEC-2010). XX RN [2] RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A. et al.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331(6017), 555-561 (2011). XX DR Genome; scaffold_68; Positions 629040 629372. XX SQ Sequence 333 BP; 97 A; 82 C; 57 G; 97 T; 0 other; tgtaatggga gcaatcgcca taaggcgact agtatcaaga acgagcgcag cctcgggact 60 gatatctaat tcattgttaa cgacatgatc gcgtgtctag gctgtaagat cttagacttt 120 ctataatatt agaaagtcgc cagttataac ctacccacat tcataggtat ataagcgcaa 180 ctccatcaga gttcgcgcac gcattcgtac acctttgtat ctccagtatt attgtatgta 240 ccgtcaatgt tagtactcta atacaaagtg catcatccca acgaatcgtc atctctactt 300 aatcatcttt accctccgta accatccgtt aca 333 // ID Vingi-1_Pp repbase; DNA; INV; 2912 BP. XX AC . XX DT 29-JAN-2010 (Rel. 15.02, Created) DT 17-AUG-2010 (Rel. 15.09, Last updated, Version 3) XX DE A family of Vingi non-LTR retrotransposons from Phlebotomus DE papatasi. XX KW Vingi; Non-LTR Retrotransposon; Transposable Element; Ingi-1_Pp; KW Vingi-1_Pp. XX NM Ingi-1_Pp. XX OS Phlebotomus papatasi OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Psychodoidea; OC Psychodidae; Phlebotomus; Phlebotomus. XX RN [1] RP 1-2912 RA Kojima K. and Jurka J.; RT "Ingi non-LTR retrotransposons from insects."; RL Repbase Reports 10(2), 148-148 (2010). XX RN [2] RP 1-2912 RA Kojima K.K., Kapitonov V.V. and Jurka J.; RT "Recent Expansion of a New Ingi-Related Clade of Vingi non-LTR RT Retrotransposons in Hedgehogs."; RL Molecular Biology and Evolution 28(1), 17-20 (2011). XX DR [1] (Consensus) XX CC Originally classified as Ingi [1] and re-classified as Vingi [2]. CC 5' terminus is temporarily positioned at the start codon. XX FH Key Location/Qualifiers FT CDS 1..2868 FT /product="Vingi-1_Pp_1p" FT /note="includes endonuclease and reverse FT transcriptase domains and a CCHC zinc-finger FT motif." FT /translation="MTKEDILCEFCVNFECDVICLQETHRGPTQGRPKING FT MKLVIERPHDKYGSAIFVKSDVDVKSVALTDCDDVEILTVNFEKFSVTSVY FT KPPGTCFQFEEPNNFGNSNAHFVLGDFNCHNILWGYGKNNDDGDALEEWSE FT SKGLTLIHDAKLPASFNSARWQRGYNPDLIFVDEKIAPSCCKNVLPPIPHT FT QHRPIMCQLFAAIKPREVPFRRRFNFSKANWNRFASDVDLSLGDIEPTSEN FT YDRFVEAVKKVSRRHIPRGCRTQYIPGLSSEISSIMSEYSKSFEADPFSDA FT TLELANVLTDAVAESRRERWVSIVESMDLSKNSHRAWRLIRHLNNDPTSVK FT TLPQVTANQIAHTLLMNGKVGSRSAKPKFIRDPTTEVQHLSAPFRVDELVL FT AVKSMKNKKAAGYDGLYSEQVKSFGPRTIDWLLKLFNYCLSHRKIPKAWRK FT AHVVAIPKPGKDINDPKSYRPVSLLCHLFKVFERMLLNRILTVVEEKLISE FT QAGYRPGKSSSNQVLNLTQFIEDGFEEGKITGVAFVDLSAAFDTVNHRVLL FT TKFYKLTSDYGLTQMIGTLLQNRRFFVEFRGQKSRWRLQKNGLPQGSVLAP FT MLFNIYTNDQPLPSPSKAFLYADDLALAVKASTFENVEAHLQTSLIELSKF FT YRENQLRPNPSKTEVCCFHLRNRWAKRRLNVEWEGRTLNHNFTPKYLGVTL FT DRSLTYKTHCNNIKDKVKTRNNLLRKLVNTQWGAQPKVLRTSALALSFSAA FT EYAAEVWERSAHAKKVDIALNDTARIVTGCLKSTPVNKLYPLAGIAPPQVR FT RQVISRKEMSKASTEPRHLLYGQVRVRQRLKSRKSFSQTVTCLEATPEEAK FT LSLWKNTFPPDFVEPKENLPPGGDQDWKTWRALNRLRSGVARSRDNRRRWG FT FLEGDVLCDCGVVQTTPHLYVCPLAPASCETGDLMTANSAAIDVARFWARH FT V" XX SQ Sequence 2912 BP; 847 A; 641 C; 673 G; 751 T; 0 other; atgaccaaag aagatattct ttgcgaattt tgtgttaact ttgaatgtga cgttatttgt 60 ctccaagaaa cccacagagg ccctacgcaa ggacgaccaa agatcaatgg tatgaaattg 120 gtgattgaga gaccccacga taagtacggc agcgcgattt tcgtaaagtc ggacgttgat 180 gtcaagtcgg tggcccttac tgactgcgac gatgttgaga tcctgacagt gaactttgaa 240 aagttctcag ttacatcagt ctacaagccc cctggcacat gttttcaatt tgaagaaccg 300 aataactttg ggaattctaa tgcccatttt gttctcgggg attttaattg tcataacatc 360 ctatggggct acggcaaaaa caatgatgat ggggatgcat tggaggagtg gagtgaaagc 420 aaggggttga cactaataca tgatgccaaa ttaccagcat cattcaacag tgcaagatgg 480 caaaggggct acaaccctga cctcatcttt gttgatgaaa aaatagctcc ctcttgctgc 540 aaaaatgttt tgccacccat tccccatacg caacacagac ctatcatgtg ccaattgttt 600 gctgcgatta agcctagaga ggtgcctttt cgccgaaggt ttaacttctc taaggccaat 660 tggaacaggt ttgcatcgga cgtggatctc agcttagggg acatcgaacc aacatctgag 720 aactatgacc ggtttgtgga agcagtaaag aaggtgtcac ggaggcacat tccaagaggg 780 tgtcgtactc aatacatccc aggtttgtcc tctgagatat ccagcattat gtcggaatac 840 agcaagtctt ttgaggctga tcctttcagt gacgccacat tagagcttgc taatgtattg 900 accgacgccg ttgctgagag tagacgggaa cggtgggtgt caatcgttga gagtatggat 960 ttgtcaaaaa atagccaccg agcttggcga ctgatccgac acctcaacaa cgaccctacc 1020 tccgtcaaaa ctctccctca agtgactgct aaccagatcg ctcatacgtt gcttatgaat 1080 ggaaaagtag gctctcgttc tgcgaagcca aagtttatta gagatcccac tacagaggtt 1140 caacaccttt ctgcaccatt tcgtgtagat gagcttgttt tggctgtcaa gtcgatgaag 1200 aataagaaag cggcaggcta tgatggcctg tattcagaac aagtgaagag ttttgggccc 1260 aggactattg actggctcct taaacttttc aactactgtt tatcacatcg caagataccg 1320 aaagcatggc gaaaagctca cgtagtggcg atccccaaac ctggaaaaga cattaatgac 1380 ccaaaaagct accgacctgt ctcacttctc tgtcatctct ttaaagtctt tgagagaatg 1440 ctgttgaata ggattttaac tgtcgtagag gaaaagttga tcagcgagca agctggatat 1500 cgtccaggaa aatccagctc aaaccaagtc ttaaacctga ctcaattcat tgaagatggt 1560 tttgaggaag gtaagattac tggtgttgct tttgttgacc tctcggcagc ctttgataca 1620 gtaaatcaca gagttcttct aaccaaattc tacaagctta catctgacta cgggctcaca 1680 caaatgatcg gtactctttt acaaaaccgt cgtttcttcg ttgagtttcg tggtcaaaag 1740 agccgatggc gactgcagaa aaatggctta ccacaaggta gtgttttagc tcccatgcta 1800 tttaacatct acacaaatga tcagccattg ccgtctccat caaaagcctt cctttacgcg 1860 gatgatctgg ccttggctgt aaaagcgtct acatttgaaa atgtggaagc ccaccttcaa 1920 accagcctta ttgaactttc aaagttctat agggaaaatc agttaaggcc caacccctcg 1980 aaaacagaag tgtgttgttt tcatttgcgc aataggtggg caaaaagacg actgaatgtg 2040 gagtgggaag gaagaacttt gaatcataat ttcacaccca aatatctcgg agtcaccctt 2100 gatagatctc taacgtacaa aacccactgc aataacataa aggacaaagt aaaaacgcgt 2160 aacaatttac tacgtaaact tgtaaatact cagtggggcg ctcagccgaa agtgctacgc 2220 acttctgctc tagctttgtc attctccgca gcggagtacg cggctgaggt atgggagaga 2280 tcggcacatg caaagaaagt ggacatcgct cttaacgata ccgcgcgcat cgtcacaggg 2340 tgtctaaaaa gcaccccggt aaacaaatta tatccgctag cgggaattgc tccaccacag 2400 gtcaggagac aagtgatttc caggaaggaa atgtcgaagg cttctacaga gccaagacat 2460 ttactctatg gtcaggtaag agtacgacag cggctgaaat ctcgcaagag tttcagtcaa 2520 actgttacgt gtctcgaagc aacaccggag gaagcaaaac ttagtctgtg gaagaatacc 2580 ttccctccag attttgttga gcccaaggaa aatctccctc ctggtggtga ccaagactgg 2640 aaaacttgga gagcgctaaa tcgtttgaga agcggagttg cgcggtcaag ggataacaga 2700 cggcgatggg gtttcttgga aggagacgtt ctgtgtgact gtggggtggt tcaaactact 2760 ccccacttat atgtttgtcc actagctcct gcatcatgtg agacaggaga tttgatgacg 2820 gctaattcag ccgctattga cgtggcccgt ttctgggcaa gacatgtata agtattttta 2880 tgcctcgact cgattaataa taataataac aa 2912 // ID Dmacul5 repbase; DNA; INV; 495 BP. XX AC GU229978; XX DT 19-APR-2011 (Rel. 16.04, Created) DT 19-APR-2011 (Rel. 16.04, Last updated, Version 1) XX DE Mariner-like element internal region of mauritiana subfamily. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Dmacul5. XX OS Drosophila maculifrons OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; guarani group; OC guaramunu subgroup. XX RN [1] RP 1-495 RA Wallau G.L., Hua-Van A., Capy P. and Loreto E.L.; RT "The evolutionary history of mariner-like elements in Neotropical RT drosophilids."; RL Genetica 139(3), 327-338 (2011). XX DR EMBL/GenBank/DDBJ; GU229978; Positions 1 495. XX CC Clone Dmacul5. XX SQ Sequence 495 BP; 161 A; 127 C; 118 G; 89 T; 0 other; tgggtgccgc acgagctgaa accaagagac attgaaaggc gattatgcat tgctgaacaa 60 actgcttgca agacaacaaa gaaagggttt tttgcaccga attgtgacag gggatgaaaa 120 gtggatccat tacaacaacg aaacccggcg caaatcttgg ggtaagcccg gtcacaaagc 180 agtatccact ccgaaaccca atttccatgg aaccaaggtt atgctctgtg tttggtggga 240 ccagctaggc ccgatccact acgaattgct gaaacagggg ccagactatc aacggggagc 300 tctaccgaca acaattgagc cgtctgagcc gggcactcaa agaaaaaagg ccacaattcg 360 aagaaaggca cgacaaagtc atccttcagc aagacaatgc aagaccacac accagcagag 420 tggtcaaaga ttacctcaac gagctgaaat gggagatttt gccccacccg ccatactcta 480 ccagatcttg cacca 495 // ID DNA3-4_AP repbase; DNA; INV; 127 BP. XX AC . XX DT 25-AUG-2009 (Rel. 14.09, Created) DT 25-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA3-4_AP. XX NM DNA3-4_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-127 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(9), 1945-1945 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 3 bp TSD CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 127 BP; 34 A; 33 C; 29 G; 31 T; 0 other; gggccgttct tacgacgtcc ggttaaagtt aaccgacact taaccggccg aattctgccg 60 gtaaaaaaat ctgttatcag tcggttagcg ttaaccgaca ctttaccgga cattgtaaga 120 acggccc 127 // ID BEL-1_SI-I repbase; DNA; INV; 6291 BP. XX AC AEAQ01000373; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the fire ant genome: internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-1_SI_; KW BEL-1_SI-LTR; BEL-1_SI-I. XX OS Solenopsis invicta OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. XX RN [1] RP 1-6291 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the fire ant genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; AEAQ01000373; Positions 3429 9719. XX CC Positions [5147-5707] - Integrase core CC 'TGCTA' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 935..5737 FT /product="BEL-1_SI-I_1p" FT /translation="MSTSDIALLKRQRTTIKSSCTRIKTYVDGVTVATPAV FT AAQLEERRLKLNEHWSQYNNFQSQIESLDATEGNDRIGFEEAYYTLCAKIR FT ELLNPTLAVRAPPTSPSTSNASNHSENHISVRLPKLNLPSFSGKYDEWFPF FT FDSFNSIIHSNASLSNVQKLQYLKSSVTGDASSVISSLEISDLNYNVAWTL FT LRERYDNKRIIIHTHIKALMELPRIVKEDSTELRRIADGAVRHVQALKALR FT CPTSHWDDLLVFILSSKLDPLTLREWQSSLVGTEPPTFMQFNEFISRHCQM FT LEATNKSTAASKDATKRSQNNSKRQASCVATVKSKCHYCKGEHAIYYCQDF FT LGLSARQRSTEARTRKLCTNCLRSSTHSADKCIARGCRTCGSKHNTLLHLS FT TNASEERNSTTDGTKSPKPEGSTSKVVTHASSDNNKDFMLSTAVVNAIDDN FT GAARPCRVLLDSGSQANFITREFAEVLGLPSHSLNASISAINNLTTNSTRA FT VKVTFQSRLNSYSRTLDCVITEQIIGKLPALTMNRREFNLPRNIHLADPRF FT NVSSRIDVLFGVEIFWELLCIGQIKATPDHPTLQKTRLGWILAGRHINVTS FT PARTLQTFTTTISNAQLSEQLVRFWQIEDVGEVVINNKNDAYCEEHFLNNV FT SQDSHGRYVVKMPVRDQLIHKIGNSRDIALKRLRGVERRFSRDATLREQYI FT HFMNEYQMLGHMKEVEVGVDDNQPSFYLPHHCVFKKADQSSKIRVVFDASC FT KSDTGVSLNDALRVGPVVQQDLMSIVIRFRTFSYVLIADIIKMYRQVLVHP FT SQTSLQRILWRDDIESDIKTYELTTVTYGTASAPFLATRCLKHLADQHSSA FT YPIGSTVIKRDFYVDDLLTGADTISEAKSVRNEVIQLLRLGAFELSKWASN FT SPELLDSLNNQNEEPVIISDNVDTNILGINWHPKTDTLHFSYEPDQSHNAV FT SKRTILSDISRLFDPLGLLGPTIVIAKLILQDLWRSNIGWDESIPQTIHTR FT WFMFQSQLTELNQLTIPRCVKHGSNQGIQMHGFCDASQYAFGACVYFRTDL FT GDGNHQCELVCSKSRVAPLKAISLPRLELSAALLLARLISKLKTAIDLTNI FT KIYLWSDSTITLNWIASESRRYSVFVANRIGEIQRLTSNISWQHVPSGQNP FT ADLLSRGLNPHELIQSTTWWNGPDFLQASEDHWPSVETSSHQDEPLELRKI FT YAHVVTMNDNIVETLLNNHSNLDKACRILAYCLRFLRKIPRATTHFISHEE FT IKSALHLMCRIVQQTTFPEEYKALERNQTVSASSKILSITPFRDKNGLIRV FT GGRLRRSPLNHDACHPILLPKNHELTRKIIMQTHERNLHSGTQATIASVRQ FT QFWPLSLRSVTRSIILKCIKCFKVKPVFSEALMGSLPTSRVTISKPFSHCG FT VYYAGPLILREGKRRNARNHKAYVAMFVCFATKAVHVELVSDLTSDAFLAA FT LKRFISRRGKPTQMYSDNGTAFVGAHNQLQEFFEFLSKADVQDDVKLFLRE FT QQIAWNFIPPSAPHCGGLWEAAVRSAKTHLYRIVGKAHLTFEEMQTVLCEI FT EAILNSRPISPLSEDPNGSHLSKPRTFLDWYCYE" XX SQ Sequence 6291 BP; 1854 A; 1519 C; 1361 G; 1557 T; 0 other; tctttggtcc ttcgagccgg atacagcagt gatccgactc ttcaccggca tcacaacagc 60 acggctacac accacgcgaa cgaggtatcc cgccatcaat acgcggagct cgttcgctgc 120 agcagacatc tctcagtgca gaacgatttc aaggcggtac accgtgtaac gttcagtgcg 180 cggactcagg cgaggggacg atcgacgaca gctcgatcgc gcgccaaaac agagaagaag 240 tctcacctcg aggaaggtat cctgtgacca tcgcagagct tccagcatat caggagcgtt 300 cagacgacag cgggacggca tcctacaatt cacataagca gacgtcggca ggcgataaga 360 cgatcggcac agcacgatcg cgcgccgagg caagaaaggt attccgccat tactgcgcgg 420 agcttccgtt gtcaagggga ttgcgccatc ttgggcgaaa tttctattgt cgaggacggt 480 atcctgtgac catcgcagaa cgtccgacct acgagaaggc gtcccggaat ccagaggaca 540 tctaacctta agctacatac acggtcgccg acaggcgaac gaacgaacgg catcaccagg 600 aggcctcgcc agcatcacag tgttgcgatc gcagttcccg aggtgcgcac atgtgcagat 660 aattccccga ggccacgacc aacggaggat cattcggaca gcagcgacac tgcgagcttt 720 tctacaagca ccgccgcgta cagtcccggt aagggaaact tattacttgg cagtaccaat 780 caattttcat ttgtcacaat taatttatta atatctgtca aatctcaatt tgccgcggac 840 ggttggtcaa aactattttc gaatttattt gtcgcaaaca attaaagtta ttcgagtctc 900 tgcgctgcga acatagcctt aaggtcaatt cacaatgtcc acttcggata tcgctttgtt 960 aaaaaggcaa cgaacgacga tcaagagttc gtgtacacgg attaaaacat acgtcgacgg 1020 agtcacggtt gccactccgg ccgttgcagc tcaactcgaa gagcgtaggt taaaattaaa 1080 tgaacattgg tctcaataca ataattttca atctcaaatc gaatcgctcg acgcgactga 1140 gggcaacgac cgaataggct ttgaggaagc atactatact ctctgtgcaa aaatacgcga 1200 acttctaaat ccaacattag ctgttcgtgc gccaccgacg tcaccgtcta cgtcaaatgc 1260 gtcaaatcat tcagaaaacc atatctctgt tcgactgcct aaattgaatc ttccttcatt 1320 ctccgggaaa tatgatgaat ggtttccctt tttcgattcg tttaattcaa tcattcattc 1380 aaacgcctct ttgagcaatg tgcaaaaatt gcagtattta aagtcgtccg taacaggcga 1440 cgccagcagc gtaataagtt cattagagat ttctgactta aattataacg tagcctggac 1500 tctcttaaga gaacgatacg ataacaaaag aattatcatt catacgcaca tcaaggcatt 1560 aatggaatta ccacgtatag tgaaggagga ttcaaccgaa ctgcgtcgca ttgcggacgg 1620 ggcagttcga catgtgcagg cgcttaaagc gctcagatgt cccacgtcac attgggatga 1680 tttgctcgtg tttatcctaa gctcaaaatt agatccgctt accttgcgtg agtggcaatc 1740 ttcgctggta ggcacggagc cccccacctt tatgcaattt aacgaattca tttctcgtca 1800 ttgtcaaatg cttgaggcca caaacaaatc gaccgccgct tccaaggatg ctaccaagcg 1860 ttctcaaaac aattcgaaac ggcaagcgtc gtgtgtggcc actgtcaaat caaaatgcca 1920 ctattgcaag ggcgaacacg ccatatacta ttgtcaggat tttttaggac tttcagcaag 1980 gcaaaggagt acggaagcac gtacacgcaa actttgcacc aattgcttgc gctcttcgac 2040 gcattctgca gataagtgca tcgccagagg atgcagaacc tgcggcagca aacacaatac 2100 gttgctccac ttgagcacaa acgcgtctga agaacggaat agcaccacag atggtacaaa 2160 gagcccgaaa ccggaaggct cgacatcgaa ggtagtgaca catgcatcga gtgacaacaa 2220 caaggatttc atgctatcca cagcagtggt caacgccatt gacgacaacg gagctgcacg 2280 gccttgccgt gtattgctag attcaggctc acaggccaac tttatcacaa gggaattcgc 2340 agaggttctt ggcctaccat cgcactcact aaacgcatcg atatctgcca tcaataatct 2400 caccacgaat tctactcgag cagtcaaggt aacattccaa tctcgattaa attcctattc 2460 caggacattg gattgtgtaa ttaccgagca aataataggt aaattgccag cattaacaat 2520 gaatcgaaga gagtttaatt tacctcgcaa tatccattta gcagatccac ggttcaacgt 2580 ttcttcgaga attgacgtac tgttcggagt agaaatcttc tgggagctac tctgcatagg 2640 ccaaatcaag gctacaccag atcatccgac gctgcagaaa acaaggcttg gatggatttt 2700 agcagggcgt cacatcaatg tgacttcacc cgcaagaact ttgcagacct tcaccacaac 2760 catatcaaac gcgcaactga gcgagcaact agttcgattc tggcaaatcg aagatgttgg 2820 tgaggtagtc attaacaata aaaatgatgc ctattgcgag gaacattttt taaacaacgt 2880 gtcacaggac tcccatggta gatacgtcgt aaaaatgcca gtaagagatc agctgattca 2940 caaaattgga aattcaagag acattgcctt gaaacggctg cgaggagtcg aacggcgttt 3000 ttctcgagat gcaaccttaa gggagcaata cattcatttt atgaatgaat accagatgtt 3060 gggacacatg aaggaagtcg aggttggcgt tgacgataat caaccatcct tctatttgcc 3120 acatcattgc gtattcaaga aggctgatca gtcatcaaaa attcgcgtcg ttttcgacgc 3180 ctcatgtaag agcgacacag gagtgtcatt aaatgatgcc ctgagagttg gaccagtggt 3240 gcaacaggac ctgatgtcaa ttgtaatacg ctttcggact ttcagttatg tgctgattgc 3300 cgacatcatc aagatgtatc gtcaagtact tgttcatcct tcccaaactt ctctacaaag 3360 gatcctgtgg cgtgatgata tagaatccga catcaaaacc tacgaattaa ctaccgtcac 3420 ttatggcaca gcatcagcac cattcctcgc aacaaggtgc cttaaacatt tagccgacca 3480 gcactcttca gcttacccaa tcggctcaac agtcatcaaa cgagacttct atgtcgatga 3540 cctccttacg ggggccgaca ctatttctga agccaaatct gttcgcaatg aggtaattca 3600 attgttgaga ctgggcgcat ttgaattaag caaatgggct tccaatagcc cagaattatt 3660 ggattcgcta aacaatcaga acgaggaacc agtaatcatc agtgacaatg tggacaccaa 3720 tatccttggc attaattggc atcctaaaac ggatacattg cacttttcct acgagcctga 3780 ccaatcacac aatgcggtgt ccaaacgtac aattctatca gacatttcaa ggttgtttga 3840 tcctttggga cttctgggtc caacgatcgt tatcgccaaa ctaattttgc aagatctttg 3900 gcgttcaaac atcggatggg acgaatccat accacagacc atccacacac gctggttcat 3960 gtttcaatca cagttaaccg aactcaatca actgacaatt ccaagatgcg tcaagcatgg 4020 aagcaatcaa ggcatacaaa tgcatggttt ctgtgacgcc agtcaatacg catttggagc 4080 atgtgtgtat tttcgaaccg acctcggtga tggcaaccat caatgtgaac tggtgtgctc 4140 aaaatcaaga gtagcaccgc ttaaagcaat ctctctacca aggttagaac tgtcggcagc 4200 cttgttgcta gcacggttaa taagcaagct aaaaacggca atagatttga ctaacatcaa 4260 gatatacctg tggtcagatt caaccatcac actaaattgg atagcttcag agtcacgaag 4320 atactccgta tttgtcgcaa accgcatcgg tgagatacag cgactcacat ctaacataag 4380 ttggcaacac gtgccatctg gccaaaatcc agccgatctg ttgtcaagag gattgaaccc 4440 acacgaacta attcagtcaa ctacgtggtg gaatggacct gactttttac aagcttccga 4500 ggaccattgg ccctctgtcg aaacttcatc tcaccaagat gaacctctag aacttcggaa 4560 aatctatgca cacgtcgtca cgatgaatga caacatagta gaaaccctac tcaacaatca 4620 ttcgaatctc gacaaggctt gtcgcatcct ggcgtactgt ctaagatttc tgcggaaaat 4680 tccaagagcc accactcatt ttatttcaca cgaagaaatc aaatcagcac tgcacctaat 4740 gtgtaggatt gtgcaacaaa ctacctttcc ggaggagtac aaagccttgg aaaggaatca 4800 gactgtcagt gccagcagca agattctatc aattaccccg ttccgtgaca agaacggact 4860 gatcagggtg ggaggcagac taaggcgttc gcctttaaat cacgatgcgt gccatccaat 4920 tcttttgcca aagaatcatg agctaacgcg caaaatcatt atgcaaacac acgagcgtaa 4980 tttacactcc ggcactcaag ccaccatagc aagcgttaga caacaattct ggcctttgtc 5040 cctgcgatca gtaacgcgga gtataattct taaatgtatc aaatgtttca aggtcaaacc 5100 agtattttca gaggccctca tgggctcctt gcctaccagt cgcgttacaa tttccaaacc 5160 gttttctcac tgtggcgtgt actatgccgg tccattaata ctacgggagg gtaaacgtcg 5220 caacgcgcgg aatcataagg cgtatgttgc catgttcgtg tgcttcgcca cgaaagcggt 5280 gcacgtagaa ttagtaagtg acctaacgtc agacgcgttt ttagcagcgc tcaaaaggtt 5340 tatatcgcga cggggcaaac cgactcaaat gtattccgac aacggcaccg cctttgtcgg 5400 agctcacaat cagttgcaag agtttttcga attccttagc aaggccgatg tacaagatga 5460 tgtcaaatta ttcttgcgag aacaacaaat tgcttggaat tttattcctc caagtgctcc 5520 gcattgcgga ggactatggg aagctgcggt aaggtctgcg aaaacccatc tgtatcgaat 5580 tgtcgggaaa gcgcatttaa cgttcgaaga aatgcaaaca gttctgtgcg agatagaagc 5640 gatactgaat tcgcgaccta tatctccatt gagtgaggat cccaatggat ctcacttatc 5700 taagccccgg acatttcttg attggtactg ctatgaatag tttccctaat cccgatctaa 5760 ccgatgtaaa cgaaaaccga cttattcgct ggcaacgttt ggaacaatta aagcaacatt 5820 tttggaaacg ttggaactta gaatatttac atacattgca gacacgttcc aaatggcgag 5880 cgagtaaggg cgatcaattg aaaattgggc aacttgtact cattagacaa caagatttac 5940 accctctcca ttggttgtta gggcgagtac aacaggtcta tccagatgac gaaggtgtta 6000 ttagaagcgc cgaagttaaa acggccaagg gtatacttac gagaccgcta gtcaaattag 6060 caatactacc catcgagtca acagacagta acgacctttt gtagcaccta tgcgtaacgt 6120 attatataaa caactcaagt atttcattat catattcgtt tgtttttagt tttaagtaac 6180 attttcataa ttagagacct atctgcgttt gcgtacaaga aattcctgat tgcgggtctc 6240 gcattaattc ttccgtttat tgaaagccga gcctttcaag gggggcggcg t 6291 // ID Gypsy-8_IS-I repbase; DNA; INV; 3948 BP. XX AC ABJB010050798; XX DT 14-FEB-2011 (Rel. 16.02, Created) DT 14-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the black-legged tick genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-8_IS_; KW Gypsy-8_IS-LTR; Gypsy-8_IS-I. XX OS Ixodes scapularis OC Eukaryota; Metazoa; Arthropoda; Chelicerata; Arachnida; Acari; OC Parasitiformes; Ixodida; Ixodoidea; Ixodidae; Ixodinae; Ixodes. XX RN [1] RP 1-3948 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the black-legged tick genome."; RL Direct Submission to RU (14-FEB-2011). XX DR Genome; ABJB010050798; Positions 8435 4488. XX CC Positions [3223-3555] - Integrase core CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 348..3254 FT /product="Gypsy-8_IS-I_1p" FT /translation="MGREAEDVLTSLHLSEAELNDYGVVKSKFELHFIPRV FT NVIFERAKFNMRKQEPHETVEAFITALHSLADSCDYGLLREELIRDRIVVG FT LQDRKLGERLQLDAKLTLQSATASARSFETVKRQQAELHSDKTESREIEEL FT RNRYPKDNGKKKNTDTRGTSHRQGDSNSAAWKCRWCGSLCKHEKSQCAARD FT KKCNACKKVGHFESVFFSKLASGSHRSQVHKPGLNEVFLGALVHADNGDPW FT YIKAKVNNQEITFKVDTGADVTANPHTEYKVGEMAKLEPAGEDLRGAGDSP FT LTTLGKFQATITWNGRTTRQTVYVSERLRHPLLGLPVIQALQVLPSLNEIK FT AGNHLEHDRVLTKYPSLSQGLGKMRVLYTISVQPDAKPFAITYPRRVPITL FT LPSVELELKRMQEMDVIEKVEHATVWCSPMVVANKQDGALRICVDYTQLNH FT QIIRERIIMPTVEENLAKLAGATVFSKLDANAGYWQAPLAPESRELTTFIT FT PVGRFQFKRLPFGISTAPDFFQRKMLRILEGIQGQVCHMDDILIFGRNKPE FT HDRNLELTLRRLLEAGVTLHSNKCDFGKSSIKFLGHIIDERGIAPDPKKPE FT AMAKIEAPANLVELRSFLGMVNHLGKFLPNMAEETKPLRDLLKGDASWLWS FT EAQEQALQRIKYLLGKTPILRLYNPSAHRTLSADASSYGLGAVLLQEDEDG FT IRRPVAYASRALTPTEQKYAQVEKEALAIVWACEHFRTYLVGQGFHVETDH FT KPLVPLLSTKRLDEMTPRLQRLRMRIFEYDFTISHVPGKHMYTADVLSRKP FT LRMMEHPIDSTEALVREYEMLTVDLLPASVSFLDRLRSELKKDPCTARVMT FT YCEKEWPGRETLPTDMKGFASLSADLSITQGLLMRNDSLVIPWSLQKEVLE FT RLHCGHQGVVRCRARARDSARWPGISKQIQAYVSRCKACIQHRLTKRASAQ FT IFVRPWMPRSTTLR" XX SQ Sequence 3948 BP; 1133 A; 1007 C; 1047 G; 761 T; 0 other; cgacatggtg gcagcggtgg gatttggcaa ccggtaaata tcgtctccga ggaagctgga 60 agcaccacag gaggagacct gacgagaaca gcgaaatctt ccacctggcc ggctccatga 120 acgggaacag tgacggctcc ggcacttcta cctcccaagc gagcagctcc caagcgagca 180 gcggtccttc tttcttcaat cccttcgcgg tcgagctccc tgaaaagttg gatttcaggg 240 atccccagga ctggaaaaga tggagcaccc gctgggaaag ataccggatc atttcgtgac 300 tccatcaacg ggacgccgcg acacaggtaa acacattcct gtacgccatg gggagagaag 360 cggaagacgt tctcacgtca ctccatctgt cagaggcaga gctaaatgac tacggggtag 420 tgaagagcaa gtttgaactc cacttcattc cgcgcgtaaa cgtcatcttt gaaagagcga 480 aatttaacat gaggaagcaa gaaccacacg agacagtgga ggccttcatc acggccttac 540 acagtttagc ggactcctgc gactatggac tacttcgcga ggagctgatc cgtgatcgca 600 tcgtcgtagg tcttcaggac aggaagctgg gcgaaaggct ccagctcgac gccaagctca 660 cgcttcaaag tgcgacagcg tcagcacgtt ctttcgagac agtgaaacga cagcaagcgg 720 agcttcacag cgataagacg gagtccaggg aaatcgagga gctccgaaac agatacccaa 780 aggataatgg caagaaaaaa aacacagaca ctcggggtac atctcatcgt caaggagatt 840 ctaacagtgc ggcgtggaaa tgccgatggt gtggcagttt gtgcaaacat gaaaaatcac 900 aatgcgccgc aagagacaag aaatgcaacg cttgcaagaa ggtaggacac tttgagagcg 960 tattcttctc caagcttgca tcagggtctc acaggtcgca agtgcacaaa ccaggtctca 1020 acgaagtgtt cctgggtgca ctggtacacg cagataacgg agatccgtgg tacatcaagg 1080 caaaagtcaa caaccaggag atcacattca aggtggacac tggggcggac gttaccgcca 1140 atcctcatac cgagtacaag gtcggagaga tggcaaagct agagccagcc ggagaagatc 1200 ttcggggggc aggagactcg ccgttgacaa cactcgggaa gttccaggcg acgataactt 1260 ggaacggcag aacaacgaga caaactgtct acgtttcgga aagactccga catcctttac 1320 tcggtcttcc ggtgattcag gctctgcaag tgctgccaag cctgaatgaa atcaaggctg 1380 gcaatcatct ggagcatgat cgcgtactta ccaagtaccc gtctctatct caaggactgg 1440 ggaagatgag ggtactttat acgatctccg ttcagccaga tgcgaagcct ttcgccatca 1500 cgtacccacg acgagtgccc atcacacttc tgcccagtgt agagctggaa ttgaaaagaa 1560 tgcaagagat ggacgtcatc gagaaagtgg aacatgccac agtctggtgc tcacccatgg 1620 tggtggccaa caagcaagat ggagctctcc gaatctgtgt ggactataca cagttaaacc 1680 atcagattat acgggaacga atcatcatgc cgacggtgga ggaaaaccta gccaaactcg 1740 caggagcgac tgtcttcagc aagctggatg cgaatgcagg atattggcag gcaccactgg 1800 ctccagaatc aagagagctg acgacattta ttacaccggt gggacggttc cagttcaagc 1860 gactcccgtt cgggatatca acagcgccag acttctttca gcgcaaaatg ctgcggatcc 1920 tcgaaggaat ccagggccaa gtgtgtcaca tggatgacat tttaattttt ggtcgaaaca 1980 agcctgaaca tgaccgaaat cttgaactta cactccgccg ccttcttgaa gctggagtga 2040 ccttacactc gaacaagtgc gacttcggca aaagcagcat caagtttcta gggcacataa 2100 tagacgaaag aggtattgcg cctgacccca agaaacctga agccatggcc aagatagaag 2160 ctcccgcaaa cttggttgag ctacggtcat ttctggggat ggtgaaccac ctcggcaaat 2220 ttctaccgaa catggcggag gaaaccaagc cgctccgaga tcttctgaaa ggagatgcaa 2280 gttggctctg gagtgaagcc caagagcaag cacttcaaag gatcaaatac ttgttaggca 2340 agactccaat cctcaggctc tacaacccaa gcgcgcaccg gacactatca gctgacgctt 2400 catcgtatgg acttggagcg gtgctactcc aggaggatga ggacggcatc aggcgtccag 2460 tagcgtacgc ttcgagagca ctgactccga cggagcagaa gtacgcgcaa gtcgaaaagg 2520 aagcattggc gatcgtttgg gcttgcgaac atttccgcac ctacttggta ggtcaggggt 2580 ttcatgtcga gacggaccat aagccattgg taccgttgtt gtcaacaaaa cgtctagacg 2640 agatgacgcc aagattacag cggttgcgaa tgcgcatttt tgaatacgac tttacgatat 2700 cacacgtccc gggcaagcat atgtacacag cagatgtgct gtccaggaaa cctctacgaa 2760 tgatggaaca cccgatagac tctacggaag cgctagtccg agagtatgaa atgctcacgg 2820 tcgacctact cccggcgtct gtttcatttt tggacagatt gaggtccgaa ctaaagaaag 2880 acccatgcac ggcaagggta atgacgtact gcgaaaaaga gtggccgggg agagaaacgc 2940 ttcctacgga tatgaaggga ttcgctagcc tttctgcaga tttgagcatt acacaaggcc 3000 tactaatgcg caatgatagc cttgttatcc catggagcct ccaaaaagaa gtactagagc 3060 gacttcactg tggacatcag ggggtcgttc ggtgcagagc acgagcaaga gattctgcgc 3120 ggtggcccgg aatcagcaaa cagatacagg cctatgtcag ccggtgtaaa gcttgcattc 3180 agcatcgact aaccaaaaga gcctcagctc agatatttgt gagaccatgg atgcccagaa 3240 gtactacact cagataatgg ccctccgttt cgatcgaacg actttcggcg gtttgcgaac 3300 gaatggggag tcaaattggt gacttcaagc ctttactacg ctcaaagtaa cggagaggct 3360 gagcgagctg tacaaacagc caaaaatctt tttagcaaat cacccaatgt agcacaggct 3420 cttcaagccc acagaacaac tccaggccca gaaggtatct ctccagcgga actcctcatg 3480 gggaggaagt tacgttcaac tgttccaaca gcgccaaata cactgacgcc tcggtggcac 3540 cacgtacgca agtataggca acagtacggt aaacaaaaac aagagaagcc ttgcactaca 3600 atcgacgcca tcgtacaaga gatcgcagca cgatacctcc tcactcagag gtgtggatac 3660 gcgcgggcgc agcttctcaa ggaaggatac accgccagac ggaaatgccg cggtcatacg 3720 tggcggaaac aagtgagggc acaatgagac gcacaagccg tcaccttcag cccgcgtttt 3780 tggaagtaga catccagacg gtcgggcaac agcagccctg cgacttcggg tgtacacaat 3840 cagacagtcg ggcaacaaag caccgaaacc tccgcaactg taacaaggag cggcagggta 3900 tcacgcccac cagaccgcta cggacaataa tgcttcgaaa gggggagg 3948 // ID DNA4-1_AP repbase; DNA; INV; 823 BP. XX AC . XX DT 21-AUG-2009 (Rel. 14.08, Created) DT 21-AUG-2009 (Rel. 15.12, Last updated, Version 3) XX DE A family of DNA transposons - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA4-1_AP. XX NM DNA4-1_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-823 RA Jurka J.; RT "DNA transposons from Acyrthosiphon pisum."; RL Repbase Reports 9(8), 1737-1737 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC 4 bp TSD. CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX SQ Sequence 823 BP; 307 A; 115 C; 94 G; 302 T; 5 other; cagggtgatt ttttaagcat gctcaccctc atttttccct ttaatagtac atattttatt 60 caaattctga tttttagaat ttttaagtat acctaaaatc taaasaccat attttcaaat 120 ttctgatatt ttttgtacta cttaaggagt gtcctgtggy gatacaaact tctatttttc 180 aaatgagaaa ccccctacct ttttttactg caaattattt agcgratgat tttcttgaaa 240 atgttgatgt atctaaatca aaattgaaaa ttctaatgag tagtttctga gttattaaaa 300 tgtttatact aagaataata atccttaaaa atggttttta aaaaatataa cctttataag 360 ttttatgtca taggtattag atgtatgatt tataatttat tatacttgga tyatttttac 420 aaattattta tattgataga tcaatagtaa ttactaacat tttcaaatcg tctaagcccc 480 catatctact taacccatta ccatagaaca taatccatag acatattatc cttagtacac 540 aaagttatat aactcaaaaa ctactcgctt gaattttgat ttagatacat caaaattctc 600 aaaaaaaaaa ttatctgctt aataattaaa agtaaaaaag gagttcctat ttgaaaaaac 660 agaagtttgt attgtcacag gatcatcctt aagtggtaca aaaaatctca aggattcgaa 720 aatatggtct tcaagtgtat ttaaaaattc caaaaatcag aatttraata tatgcataat 780 ttaagcataa aaatagggtg agcatgctta aaaaatcacc ctg 823 // ID Mariner-2N_SM repbase; DNA; INV; 909 BP. XX AC . XX DT 10-AUG-2009 (Rel. 14.08, Created) DT 10-AUG-2009 (Rel. 14.08, Last updated, Version 2) XX DE Mariner DNA-transposon from Schmidtea mediterranea. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Nonautonomous; KW Mariner-2N_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-909 RA Jurka J.; RT "Mariner DNA transposons from Schmidtea mediterranea."; RL Repbase Reports 9(8), 1862-1862 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX SQ Sequence 909 BP; 306 A; 102 C; 128 G; 372 T; 1 other; caccaagccg cgctgagata aattccgcta taatgcaagt tgatacataa atcaacttat 60 tggaatttat ttttagtgaa aaatcaatgg atttcagcca tttttgaaat ggctgcaaag 120 taagtgattt tttaaagttt taagagttga attatgattt tacaaggaaa ttaaatcatt 180 attcaagctt ttaaatataa cttttgatct acaagttcat atatattatt aataatgttt 240 atgaatattt aggaaatgat ttattttatg atgacttatt tttcggaatt aattcaaaaa 300 aagtgagtta aattattttt gaagtaataa aaataattaa tcgccataaa tatgtatttt 360 ttatacatct tggaaaaaat cttcgaatat gtacacaaaa taattgggtt catgattatt 420 ttttaaaact tattaataaa atttacaatt tatgcttttt tcggatctca ggttttgttt 480 ttttcgacag aaatgtctga aaacttatga aatattaaat taattttagt tttaagggta 540 aatcttacga agatacatcc tcatttacac gaaatttctg agcagaaacc tcaaaattta 600 aattaaaaax tttttccgcc attttttttc tctaaaaatg tctcaaaaat gtctataaat 660 taacttatta taggggtttg tctcaacgct ggttgactac acgttgaaag tttacatgca 720 ttttttatta tgtgttgtgt taatgtgcat tattttattt ttctatcact ttttgatttt 780 ctatagtgta gggtttatct caacgcacgt tttacaaaca acctcaaggt gagttggtgt 840 gtttatatgt tataaaatta ttgatttttg ttttatatta tagaagattt atctgaacgc 900 ggcttggtg 909 // ID BEL-201_AA-I repbase; DNA; INV; 5961 BP. XX AC AAGE02028917; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-201_AA_; KW BEL-201_AA-LTR; BEL-201_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5961 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02028917; Positions 35819 41779. XX CC Positions [5002-5550] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 339..1424 FT /product="BEL-201_AA-I_2p" FT /translation="MSNRDHSPRNCQSCSRRDSAESEMVQCNSCQLWEHFG FT CAGVDEQIRRSDTTYVCKRCGKPGVSNDDYLNVPFADQRSKGAKAVSKASS FT KVGSKRGKKVPDPTGSVTSSVRVAMLAEQLKLVEEEQLLEEQEIQEQEEIR FT KRLFDEKERQLEEKKRLTEEAERIRENKLQEELEAKRKQQQIRKESFEKRQ FT EIIGQMAEASSRGGSVVSSGQKVRDWLNNQGARKSDGNLVGGPEERNFKLI FT PEDPIESSNLAQQPAPQNTAHEQSLLPAQPNSVPLLPSPEPLPTCSARMLS FT QAQIAARQVLGKDLPVFDGNPEDWPVFISNYEQSTITCGYSDAENLVRLQR FT SLKGNALESVRSSRPVCPM" FT CDS 1828..5961 FT /product="BEL-201_AA-I_1p" FT /translation="MPTVFCRKHRRTLEIGSTERSLSDLSQQSRQRPCRSW FT SGCGVEGCRQRHHSLLHSAASSENFNVSPSHVTSGEYQWPLFRIIPVEIHG FT RDSSQTIYAFIDEGSSYTLLEDSVAERLGVRGEIVPLTLKWTGNVTRIESS FT SQIIQFDISGIGASTRYTIAHARTVSRLMLPTQTLKYNELAYRFPHLRGLP FT IEDYELVQPKILIGLDNLRLGVPLKLREGGLTDPIGAKCRLGWSIYGCIPN FT QVSHLTIVGFHVKATSDSVREMNDLLRDYFTLEDIGVNERSEMVESDEEKR FT EKTDELNFPDSYPMAIRRLESLERRLKKEPVLQESVRDQIEDYERKGYAHR FT ASLMELTSVEPNRIWYLPLGVVINPRKPGKVRLIWDAAAKVGDISFNSQLL FT KGPDLLTPLPKVLYQFRQYPVAVTSDIMEMFHQINIRSPDCQSQRFLFREN FT SADIPKIYVMDVATFGSTCSPASAQYVKNRNAEEFATEYPRAAAAIVNKHY FT VDDYLDSFETISEAVDVVNEVKLVHSKGGFTLRRFLSNEPAVLRGIGEEVV FT TEAKDLDLERAGKIESVLGIKWVPNEDVFIYTFGARDDILRLMHNDYVSSK FT REVAKVVMSLFDPFGLIAFLLVHGKVLIQELWVKGTGWDQKIPQDADVRWR FT RWVDSLSQLERLRIPRYYFPSSLPKNFICLQIHLFVDASDAAYAAVVYFRL FT QIEQGTQVALVGAKTKVAPLKALSIPRLELKAAVLGIRLMDTIQCQHTYPI FT TQRFCWTDANTVLAWIRANDHRRYHKFVAVRVGEILSSTQQSEWRWVPSKM FT NVADLATKWNDDPQATNPWFHGPPFRFEAESHWPKQRTTIQTEEELRPAHS FT NLVHFSSTFELERFNKWSKLLRVVAYVFRWINNLQKRRNGENLELGCLTSK FT EFHQAENALLKTAQWELYADEVVVLSKTQGPPENRHPTVNKSSTIYKKYPF FT VDEFGVLRSRGRIDAAPYALVESKFPIILPNQHLITFLIVDSYHRRFRHAN FT RETIFNEIRQRYEISALRRLLTKVERACMFCRIAKVVPRPPIMAPLPKIRL FT TAFIQPFTYTGLDYFGPILIMLGRSNVKRWVALFTCLTIRAVYLEVVHSFS FT TVSCIMAVQRFVARRGQPREFWSDNATCFQGTSNELRMHNITLAEKFTTSQ FT CTWKFIPPSTPHMRGAWERLVRSVKVAMGAVSESSRKPDDETFETILTEAE FT AMIDSRPLTYVPLESADQEALTPNHFLLGNSSGTKFLSTEPLDERVVLRSS FT WKMARYIVDELWRRWLKEYLPMITRRCKWFEDVKDLRVGDLVLIVGETARN FT QWIRGQIEEVYPGRDGRVRQALVRTSSGTVRRAAVKLAVLDVKENSKPAIE FT EAIVSDPHQGLQAGV" XX SQ Sequence 5961 BP; 1741 A; 1356 C; 1414 G; 1450 T; 0 other; tatgtcattc tataatcatt ttttcttaaa ttaacattga aattcaataa ttatcgtaag 60 tacgaaaata aactctaaaa ttatctggaa tgctaatgag cttgtaaaat ttatagaatc 120 cctaaatact catagcggga atacgttcaa ttgctctcca ctttcgcgat aagtacacca 180 aatttgtatg tgaactaaaa caaccaatta aatgtatact aatataaagt tctatttcta 240 gctaaaagca tacaaacaca aaccacaccg tattgctatt tggagttggt gacataaccc 300 aacatttctt ttagaaattt gtccgggtat tggaagtaat gtcaaatcga gatcattccc 360 ctaggaactg ccagtcctgc tcacgaaggg actcggcgga aagtgagatg gtgcagtgca 420 actcctgcca gctctgggag cactttggat gtgccggggt tgacgaacag atccgacgat 480 cggataccac ttatgtatgc aaacgttgtg gtaagccagg ggtttccaac gacgactatc 540 tcaacgttcc cttcgcggat caacgctcga aaggagcgaa agcggtttca aaggctagtt 600 ctaaagtggg ctcaaaaagg ggaaagaagg ttcctgaccc aaccggcagc gtgacgtcta 660 gtgtacgtgt tgcaatgctg gccgaacagc tgaaactagt cgaagaagaa caacttcttg 720 aagaacagga gatacaggag caggaagaaa tcaggaaacg cctgtttgat gaaaaggaac 780 gtcaattaga agagaaaaaa agactcacgg aggaggctga gcggattcgt gagaacaagc 840 tacaggaaga attagaagca aaacgaaaac aacagcaaat acggaaagag tcgtttgaaa 900 agcgtcagga aattatcggt cagatggcag aagcaagcag cagaggaggg tcggttgtga 960 gctccggcca aaaagttcgc gactggttaa acaatcaggg tgctagaaaa tcagatggga 1020 accttgttgg aggtccggag gaacgtaact tcaaactgat tcccgaagac ccaatcgaat 1080 catccaacct agcacaacaa ccagcgccac agaatactgc tcatgaacag tcattgttac 1140 ccgctcaacc aaattccgtt ccgttgctac cgtcaccaga acctcttccc acctgctctg 1200 ctcggatgct atcgcaagca caaattgcag ctagacaggt tctggggaaa gaccttccag 1260 tttttgatgg taacccggaa gactggccag tcttcataag taattatgag cagtcgacga 1320 tcacttgtgg ctattcggat gcagaaaacc ttgtacgcct tcaacgatct ctcaaaggaa 1380 acgccttgga atcagtgaga agctcccggc cagtgtgccc catgtaatac agactttgcg 1440 aacccttttt cgtagaacgg aattgcttat tagatcgatg ctgaataaga tccatcaaat 1500 tccgccgcct cgacatgacc gtctggagac actgatgcac tttggacaca agcatcaacc 1560 tgtgactctt gacacgttcg gagaatttat gtcggggttg gtaactgcgg caagcgaagt 1620 ctctttcaat cttccattct caatgaacca aaagcaaact tcagcgatag atttccgaag 1680 tggaaaacaa aaacctggga atacgggaat gctccaagct cacctgacca acgatactca 1740 cttcttggac tgtgacagta cccctacaag ttccaaacct gcaaagcaat gtcaagcgtg 1800 tggtagagta ggtcacagag tagctgaatg ccaacagttt tctgcagaaa gcatcggcga 1860 acgttggaaa ttggttcaac agaaaggtct ttgtcggact tgtctcaaca gtcacggcaa 1920 aggccgtgcc ggtcgtggag tggatgtggt gtcgaaggtt gtcgacaaag gcatcatagt 1980 ctacttcact cagctgcatc atcggaaaac ttcaatgttt caccgagcca cgtcacatcc 2040 ggagaatacc agtggccact tttccggatc attccagtgg aaatacatgg tcgtgacagt 2100 tcccagacaa tctacgcctt catcgacgaa ggttcatctt atacccttct agaggactcc 2160 gtagctgagc gactaggcgt tagaggtgaa atagtaccgc tgacactaaa atggacagga 2220 aacgttacac ggatcgaatc cagttcgcag attattcagt ttgatatatc cggcattggg 2280 gctagcactc gatacacaat cgcacacgct cgcactgtta gtcgattaat gttgcctacc 2340 caaacattga aatacaatga attggcatat cgttttccac atctacgagg attgcccatc 2400 gaagactatg agttagttca gccaaaaata ctaatcggcc tggataactt aagattgggc 2460 gtcccgttga agctacgaga aggggggtta acagatccca tcggagccaa atgtcgacta 2520 ggatggagca tttatggatg catcccgaat caagtatctc atttgacgat cgttggtttt 2580 cacgtcaagg caacctctga ttcagtacgc gaaatgaacg acttacttcg agactatttt 2640 actttggaag atattggagt caacgaacga agcgaaatgg tggaatctga cgaagagaaa 2700 cgagagaaga ccgacgagct taatttcccc gatagttatc caatggccat tcggcgtttg 2760 gaatcactag agcgtcgact caaaaaagaa cccgtcttgc aagaaagtgt gcgcgatcag 2820 attgaggatt atgagcgaaa gggttatgct caccgggcaa gtttaatgga gctgacgtcg 2880 gttgaaccaa accgcatatg gtatcttcct ctgggcgttg ttattaaccc tagaaaaccc 2940 gggaaagttc gtctgatctg ggatgcagcg gccaaggtgg gagacatttc gttcaactcc 3000 caactcctta aaggaccgga tcttctgacc ccacttccga aggttctgta tcagttccga 3060 caatatccgg ttgccgtaac tagcgacata atggagatgt tccatcagat caacatccgc 3120 agcccagatt gtcagtctca acgttttctg ttccgagaaa actcagcaga tatcccaaaa 3180 atttacgtga tggacgtggc cacctttggg tctacttgct ccccagcatc ggcccaatac 3240 gtaaagaatc gtaacgcaga agagtttgca acagagtacc ctcgcgcggc agcagccatc 3300 gtgaataaac actacgtcga tgactacttg gatagtttcg agacaatttc cgaggcagtg 3360 gatgtagtga atgaggtcaa actggtacat tctaagggtg gctttaccct acgtcgattt 3420 ctatctaatg aacctgcagt tttacgtggc atcggggagg aagtggtaac ggaagccaag 3480 gacctagact tggagcgagc tggcaaaatc gagtctgtat tgggaataaa gtgggtaccg 3540 aacgaggatg tgtttatcta cacttttgga gctcgagatg atattttgcg tttgatgcac 3600 aacgattatg tttcttcaaa acgtgaagta gccaaagttg tcatgagtct gttcgatccc 3660 tttggcctta tagcctttct attggtgcac ggaaaagtgc taattcaaga actctgggtg 3720 aaaggaaccg gatgggatca aaaaatccct caagatgcgg acgtacgctg gcgacggtgg 3780 gtggattctc taagtcaact agaaagatta cgcatacctc gctactactt tccatcatct 3840 ctaccgaaaa acttcatttg tctacaaatc cacctgtttg tggatgcaag tgatgcagca 3900 tacgcggcag tagtttactt ccgtcttcaa atcgaacaag gaacacaagt agcactcgtt 3960 ggagcgaaaa ccaaagtggc gccactgaag gctctatcga taccgagact tgagcttaag 4020 gcagcagtcc tcggcattcg cctgatggat acaatccaat gtcagcacac ttacccgatt 4080 acccaacgct tctgctggac tgatgccaac actgtactag catggattcg tgcaaatgat 4140 catcgacgtt atcacaagtt cgtagctgtt cgagtaggag aaattctttc gtctacacaa 4200 cagtcagaat ggcgatgggt gccatcaaaa atgaacgttg cagatctcgc cactaagtgg 4260 aacgacgatc cacaggcgac caatccatgg ttccacggtc cgccttttcg ttttgaagct 4320 gaatctcact ggcctaagca aagaacaacc atccaaacag aagaagaact acgtcctgcc 4380 cactccaatc tggtccactt ttcatcaact tttgaactcg agagattcaa taagtggtcc 4440 aaattactac gcgtagttgc gtatgtattt cgttggatta acaatcttca gaagcggaga 4500 aacggtgaaa atctggaact cggctgtcta accagtaagg agttccacca agcagaaaat 4560 gctctgctta agacggctca gtgggagttg tacgcagacg aagttgttgt tctcagtaag 4620 actcaagggc cgcctgagaa tcggcatccc actgtgaaca aatcgagtac catctacaaa 4680 aaatacccat tcgtggacga gtttggagtt ttgcgcagtc gtggccgtat tgatgctgca 4740 ccatacgcac tggtcgaatc caagtttccg ataattcttc ccaatcaaca cctcattacg 4800 tttctaattg tagattcgta ccatcgtcgc tttcgtcacg ccaatcgcga gacaatattc 4860 aacgaaatcc gccaacgcta cgagatatct gcacttcgac gactgttgac taaagttgaa 4920 agagcgtgca tgttttgtcg tattgcgaaa gtagtaccaa gaccaccgat catggcgcct 4980 cttcccaaaa tacgtttgac agcatttatc caaccattta cttatactgg cttggattac 5040 ttcggaccca tcttgataat gctaggcagg agcaatgtga agcgatgggt ggcactcttc 5100 acctgcctca ccatcagagc ggtgtactta gaggtagtac actcctttag caccgtatca 5160 tgcatcatgg cagtccaaag atttgtagct cgtagaggcc aaccaagaga gttctggtct 5220 gataacgcaa cctgtttcca ggggaccagt aacgagctac gaatgcataa cataactttg 5280 gcggaaaagt tcactacgtc acaatgcacg tggaagttca taccaccttc aacaccccat 5340 atgagaggtg cttgggagag gcttgttcgc tcggtgaaag tggcaatggg agctgtatcg 5400 gaaagttcac ggaaacctga cgacgagacg tttgaaacta ttctcacgga agcagaggct 5460 atgattgact ccagaccact cacatatgta cctctagaat cggctgacca agaagctctg 5520 acacctaacc actttttgtt gggcaactct tctgggacaa aatttctatc caccgagcct 5580 ttagatgaac gcgtggttct taggagcagt tggaaaatgg ctagatatat tgtggatgag 5640 ctttggcgta ggtggctcaa agagtatctg ccaatgatca cccggagatg caaatggttt 5700 gaagatgtaa aggatctgcg agtaggtgac ttggtattaa tagtcggtga gacagcgaga 5760 aaccagtgga ttcgagggca gatagaagaa gtgtaccccg gacgagacgg tagagtgcga 5820 caagcattag tcaggacatc gtcgggaact gtacgtcgag ctgcagtgaa actggctgta 5880 ctggacgtca aggagaatag taaacctgct atcgaagaag ccatagtttc ggaccctcac 5940 caaggtttac aggcgggggt a 5961 // ID CR1-23_BF repbase; DNA; INV; 2628 BP. XX AC . XX DT 30-JUL-2009 (Rel. 14.07, Created) DT 30-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE Amphioxus CR1-23_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-23_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-2628 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-2628 RA Kapitonov V. and Jurka J.; RT "Young families of CR1 non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(7), 1594-1594 (2009). XX DR [2] (Consensus) XX SQ Sequence 2628 BP; 754 A; 721 C; 555 G; 598 T; 0 other; agggctgatt tcgacaattt tagactggag ctaggtaaag ctaactggac ttcctgcatg 60 tcaggttcgg ttaatgctag atggaacagc tggaaggaca aattcttaaa aatagcaaaa 120 aacacaattc ccaacaaaac tgtcaaagtg aaccatagat cccacaaacc ctggatcacc 180 accgacatat tggacatgat ccggcaaaaa acggaccttt atcgagagta caagtcatca 240 cctacccagc aaaactggac gaactataca aggtgtaaaa acacactcac ctcggaactc 300 cgtcgtgcag aacgcgagta ttacaactcc atcggcgaca acatcaaaag tgctgacgac 360 tgcaaactcc tgtggtcggt actgaaaaac gctaccggaa agggaaggat gggtatccct 420 tcacttctct accagggtcg tactctcgac aaggactgcg agaaggctga actactgaac 480 aacatattcg ttcagacaac catggacagc aaacacccgg actacatgaa gagactacca 540 ctgtacacag ataacgagct cagcttcata cagctgtcga ccgaggaggt ctacgacgtg 600 ctactgggat tgccaacaaa caaggctccc ggtccggacg gaataacaaa ccgcctactc 660 agagaagcag ccccagctat cagtgcctca ctctgtgagc tgtttaactt ctcactcgcg 720 actggacact tcccaactga gtggaaacag tgtaacatat caccggtcca caagaaagga 780 gaccgcacag aaccatccaa ctacaggcct attgctctcc tgcaaacgat cgccaaagtc 840 cttgagaggc ttgtttacaa tcgtctatac acgttcctta ccgacaacaa cctgatgaac 900 cccaagcaat caggcttcaa gcagggggat gggactgttc tccaactgct ccgtctgact 960 gatgactggg caaaatctat agacgatcca aatattgcct gcactgctgc agtatttctc 1020 gatgtaaaac gtgccttcga taccgtgtgg catgacggcc taatatacaa gttaaccaga 1080 tacggaatcc atggctcgct tcttgcctgg ttcacgaact acctgtcggg gcgccagcaa 1140 agggtggtca ttaacggtac agcttctacg tgggggtctc caactgcagg agtgccacag 1200 gggagtattc ttggcccgct gttgtttgcg atctacctaa acgacattaa agacgtgccg 1260 acgatatcca gtttgaactg ttttgctgac gatacatcga tgtacaactc cggacataca 1320 gccgaggaag ttgccagcac aactaatgcc gatctttgtc ttgtttccga ttggttttcc 1380 gactgggggc tcgaactaca ccctgataag tgcaaagtta tgtgtataaa ggcttaccaa 1440 agcaaggtcc aactaccccc catctaccta gccggtcaaa tcctcgaaca ggtgacttcc 1500 tacactcacc taggcgtcac catgcactat acactccggt ggaaagaaca tgctgaaacc 1560 gtttccagta agtcgaaaaa ggttctcggg ctcctgagca aactacagcg caaactgtcc 1620 cgagaagcct tggagcttgc ctataacact cttgtacgca cgaaattaga gtacgcatcc 1680 atcctcttca gcaacattag cgtcactgcg agcaagacca tggagagggt acagtaccat 1740 gctggacgcc tcgtctcagg agcgatggca cgtacccctc atgacaaact actagaagaa 1800 ctagagtggg acagtctcgc aaccagaaga gactacaaca gactacttac catgcaaaaa 1860 ctagtgacag gctccgtacc agcccatctc cagccactag tccctactac cagagagagc 1920 cgacggcgac tacaccttcg tcttcggaac gacacacatc tacaagtccc gtactgccga 1980 accaccacgt acaagaacag ttttgtcccc tacagcacac gactctggaa tagtcttccc 2040 atggaagtaa aggaagccac atcttacaac cagttcaaaa agaagtgcag agatcacatg 2100 ctgtcttccc gtcaacacca gaagtttcgt agacttggaa atcgccagag caacatcctt 2160 actaccagac tacgtcttgg ctggtgtcag ctcaactcta cgttggctaa gttcaacatc 2220 acaacccgga gctgtgcttg tggtgctacg tccgagacgg ttgcacactt tctacttcac 2280 tgtcctctgc acacggctgc acggcaaacc ctagcttctt cagtgctcca acttgtcggc 2340 cagtctctct caaccggagt cctccttaat ggttctccag gtcaaaactc tgctacaaac 2400 caaaaactct cagacgctct gcattcattt ttaacatcaa caaaccgatt tacagcacct 2460 tccatctgct cctcatagct tacgtagcct atgtggccta ttatgtacat atgtgtaatt 2520 tagatttgtt tgttgtatct ttgtgggtca gccgtcatca gcttgaatag actgcctagg 2580 ctggcccagt gtaatgactt gttgtcattt tcaataaata aaataaaa 2628 // ID Copia-35_AA-LTR repbase; DNA; INV; 294 BP. XX AC AAGE02018832; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-35_AA_; KW Copia-35_AA-I; Copia-35_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-294 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02018832; Positions 40508 40215. XX SQ Sequence 294 BP; 65 A; 79 C; 54 G; 96 T; 0 other; tgatttatcc cattccattt gttagcctag attatccgca cacccctcac tcatcgaacc 60 aagagacaat cattattaca acaaagtctg ccgcgcatta taaacgtgtt tttaccgctg 120 ctaataaaag ttaatttcct aatcgtagtt tttcctcgcg agttaagtga ttcgattcca 180 aatccgattt tcctttcgtc cgctggtgct tggctgtgtc cgccgcgtat tcgctactct 240 gctcgtcgtc gagtttttcc gggaagacct gtcgttccgg agtcgaatac tcca 294 // ID Tx1-3_BF repbase; DNA; INV; 5276 BP. XX AC . XX DT 28-APR-2009 (Rel. 14.04, Created) DT 28-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE Amphioxus Tx1-3_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW Tx1; Non-LTR Retrotransposon; Transposable Element; L1-3_BF; KW Tx1-3_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-5276 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-5276 RA Kapitonov V. and Jurka J.; RT "Young families of Tx1 non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(4), 840-840 (2009). XX DR [2] (Consensus) XX FH Key Location/Qualifiers FT CDS 448..1299 FT /product="Tx1-3_BF_1p" FT /note="ORF1." FT /translation="MSRKKAITQDDSELVSMSTLKDLLDNQKEVFNNMIEL FT QQKNFRSFMESTMECHNKRIDNLIREVQEIKVSLQYSQKDIEDIQTNSKQQ FT EQRIKETEAKILQLKTDITSNHNKTDYLENQTRRNNILIDGITDVKEESWE FT ECEQKVRSLLKDKLQIDPKKIEIERAHRNGRFQPDARPRSIVAKMLRFKDK FT ETILQRARYLKGSHIFINEDFSEAVRQKRKELIPEMKAARERGNVAYLRYD FT RLIVHPPRPTTRSTTQRHSNNEGTDTGRRRQDLRSTRATFGAV" FT CDS 1315..4974 FT /product="Tx1-3_BF_2p" FT /note="endonuclease and RT." FT /translation="MDSDTTTQCYLTSELTCLSLNVRGLNDQVKRRTVFRW FT LHQQRVDVVFLQETFSNAKIERVWRAEWGGTILYSHGTNHSKGVAILVNPR FT KSCEIINCKNDTNGRLLIVNAVIAGEQVCFVNLYAPNEITEQYMFFQDIRT FT KLRENSTCHTIMAGDFNCSLTDMDKRGGRPVKEKLRTIQAINLLTKGQDLI FT DIWREKNPKERRYTWRDANQGIMCRLDYIFVSDSLKARCGSANIIHAPSPD FT HSAVTVHFEPQNACKRGPGFWRFNNTLLNDKTYTDTMKLKIDDARAKYSEI FT EDKRLLWDLIKMEIRSFTMYYSKMKAKERKKDELSAEKQTNSYIQQYESNP FT SQENWCKYNRAKEKLDEIRHLKVKGSIIRSRCRWHEEGEKSTKYFLNLEKR FT NYEQKCINELLTENNSVITDPGKILEQGKQFFESLYTSKETNPHDPKFEEF FT FTNNEINKLTGEQRDNCEGNITVKEALTALKQMAKKKNRSPGTDGLSFEFL FT LYFWDKIKNEIVDSFNTALEEGELAISQKRGIIRLLPKQGDCRKLENWRPI FT ALLNTDYKILTKCLALRLEPVLPSIIHENQTGYVKGRYIGTNIRRMWDIIQ FT ETERTQTPGIALFVDFRKAFDTLEWNYLEKALMTFEFGPIFRRWVSVLYKN FT SCSCVINNGFTSDWFTLKRGVRQGDPLSGLLFILGLELLACKLRSDTSIVG FT ITINNQITNNLMYADDTTIFANSVTSARRALATIETFGQFSGLQASKKKTE FT AMWIGGYKERKDKPLDVHWPDKPIKCLGVHFSYDKAKCEELDFNDKLTKMD FT NILKVWSGRNLTLIGRILIYKCLAVSKLIYNCSVLTTPPNFERKVNAIGLN FT FVWNHKPHKVKFSTLIAEKDKGGLKMMDFASMVKALRASWVGRICTTQALE FT SHFLQYGGAFLFKCNFSIKTLKLGNMPTFYIKILEAWEELNSHRPQTSMEI FT KQEFLWNNQYILQERKSVFWKEMYLCGIRTIQNIIDDGGNILTPKAITREF FT GFEFNASDILRYYGLCSSIPPGWKKVLTDPINSKPDVRSTLSYKCKQLYQL FT FVNKIVQPPSKQDEITRCLQDEEDIKKVYMLPYICSQETKMQYFQYRIIHS FT FLTTNAYLKKIRIKDSDQCPYCEEKQTIVHLFAECHTVKTFWKDFTDWWKK FT ENQHPVGLTNIEIIYGHLSHIENNKTLNRCILQAKYYLFCTAISKAYPTFH FT AVKHRILEYA" XX SQ Sequence 5276 BP; 1862 A; 1044 C; 1067 G; 1303 T; 0 other; gggatgacgt catgaccaag atggcagccg cttttgttta ggtgtccatc ttcataaatc 60 ttcttccaaa cttactcaat aagaaccttc cattaaacgt acatcgagca tgtatcgtga 120 agtggagatc ctgtaggact gctcagtatc gaaaaattgg gaaatttact cttcctcgca 180 tcaaattttg aagaagcgtc gtttccaggg tgactccata ccgacacaac aaaggacgaa 240 gccattacca cccagcaaac aacgaagccg attcaccgca gcgggctccc taagaggagc 300 cctcgccggt aggtctccct gatatcccac agattctgtt attacaggtt ctgctgtttc 360 cctgtaccat tttcatcttc cagttggttg ccgcgctttg ttgttgttgt ttcagtgcac 420 acccccctcc ccctgacaag ccaagccatg tctcggaaga aagctattac ccaagacgac 480 tcggagcttg tctcaatgtc cacactaaaa gatcttcttg acaatcaaaa ggaagtcttc 540 aacaacatga tcgaactgca gcagaagaac ttcagatcat ttatggaatc aacaatggaa 600 tgccacaaca aaagaattga caaccttata cgcgaagtcc aagaaatcaa ggtcagtctc 660 cagtactcac aaaaagatat tgaagacatt cagactaatt ccaaacaaca agaacagaga 720 atcaaagaga ctgaagcaaa gatattgcaa ctaaagaccg acataacctc gaatcacaac 780 aaaacagact acctagaaaa tcagacgagg cgtaacaaca ttctgataga tggcataacc 840 gacgtcaagg aagaatcatg ggaagaatgt gagcaaaagg taagaagcct actgaaagac 900 aaattgcaga tagatccaaa gaagatcgaa atcgagcggg ctcacagaaa tggacgcttc 960 cagccagacg cacgtccacg atctattgtg gccaagatgc tgaggttcaa ggacaaggag 1020 accattctac aacgtgccag gtaccttaag ggcagccaca ttttcattaa cgaagacttt 1080 tctgaggctg tcagacagaa aagaaaggag ctcatcccag aaatgaaggc cgccagggag 1140 aggggcaacg tcgcgtacct ccgctacgat cgtctcatcg tgcatccccc gaggccgaca 1200 accagaagta ctacccagag gcactccaac aacgaaggaa cggataccgg acgtcgcaga 1260 caagacctaa gatccaccag ggcaactttt ggtgctgtgt aattgtttta aaccatggat 1320 tctgatacta ctactcagtg ctatttaacg tcagagctta cctgtttaag tcttaacgta 1380 agggggttaa acgatcaagt taaaagaaga acagtttttc ggtggttgca ccagcagagg 1440 gtggatgtag tgttcctaca ggagacattt tcaaacgcca aaatagaacg cgtatggagg 1500 gcagagtggg ggggaacaat tctctatagt catggcacaa accacagcaa aggtgtcgcc 1560 atcctcgtaa atcccagaaa atcttgtgaa ataattaact gtaaaaacga tacaaatgga 1620 agattattga tagttaatgc agtcattgct ggtgaacaag tatgttttgt taatctttat 1680 gccccaaatg aaattacaga acagtatatg ttttttcaag acataaggac taagcttaga 1740 gaaaactcca cctgccatac aattatggca ggagatttca actgttctct cacagacatg 1800 gacaaacgtg gcgggaggcc tgtaaaggaa aagctaagaa caatacaagc cattaattta 1860 ctaactaaag ggcaagacct gattgatatt tggcgggaaa aaaatcccaa agagagaagg 1920 tacacctggc gggatgcgaa tcaaggcata atgtgcagat tagactacat tttcgtgtct 1980 gatagtttaa aagcccggtg tggcagcgcg aatattatcc atgcaccgtc acccgaccac 2040 tcggcagtta cggtacactt tgaacctcaa aatgcatgta agagaggccc aggattctgg 2100 agatttaata atacacttct taatgataag acatatactg atacgatgaa actcaaaatt 2160 gatgacgccc gtgcgaaata ttccgaaata gaggataaaa gacttctatg ggacttgata 2220 aaaatggaga taagatcttt tactatgtat tattctaaaa tgaaagcaaa agagagaaaa 2280 aaagatgagt tatcagcaga aaagcagaca aactcctaca ttcaacaata tgagtcaaac 2340 ccaagccagg aaaattggtg taagtacaac cgcgctaagg aaaaactaga tgagattaga 2400 catttaaaag tgaaaggatc aatcatcaga agccgatgcc gttggcacga ggagggtgaa 2460 aaaagcacca aatacttcct aaacctagaa aaacgtaatt acgaacaaaa gtgtattaat 2520 gagctactga cagaaaataa ctcagttatc acggatccag gtaagatttt agagcaaggg 2580 aaacaatttt tcgaatcact gtatacatca aaagaaacaa atcctcatga cccaaagttt 2640 gaagaattct tcacaaacaa cgaaataaat aaactaactg gagagcaacg agataattgt 2700 gaagggaata taacggttaa agaagcactt acagctctaa aacagatggc aaagaagaaa 2760 aatagatcac cagggacaga tggtctgtcg tttgagtttc ttttatactt ttgggacaag 2820 attaaaaatg aaatcgtcga ctcctttaac actgcacttg aagaggggga attagcaata 2880 tcacagaaac gagggatcat ccgcctcctc ccaaaacaag gtgattgtag aaaactcgag 2940 aactggagac ctattgcgct cttaaataca gactacaaaa tcctgacgaa atgtttagct 3000 ttacgcctag aacccgtgtt accatctatt atacatgaga accaaacagg atatgtaaaa 3060 ggaagataca tcggcactaa cataagacgc atgtgggata taattcaaga gactgaaaga 3120 acacaaaccc ccggtatagc cttattcgtt gattttagaa aggcatttga tactttagaa 3180 tggaattatt tggaaaaggc actcatgact tttgaatttg gaccaatctt cagacgatgg 3240 gtatcggttc tgtacaaaaa ctcctgtagt tgtgtaatta ataatggttt tacctcagac 3300 tggttcacgt tgaaacgcgg agtaaggcag ggggacccac tctcgggtct cctgttcata 3360 ctaggcctag agttactagc atgtaaatta cgctcagaca caagcatagt aggaataact 3420 ataaataatc agataacaaa taaccttatg tatgccgatg acacgacaat tttcgcaaac 3480 agtgtgacct cggcaagacg tgcgttggct acgatagaaa cattcggcca attctccgga 3540 ctacaggcca gtaaaaagaa aacggaggcc atgtggatcg gaggatacaa agaacgaaaa 3600 gacaaaccac ttgatgttca ctggccagac aaaccaatta agtgtctagg tgtacacttt 3660 tcctacgaca aggcaaaatg cgaagaactc gattttaatg acaaactaac gaaaatggat 3720 aacatactta aagtttggtc aggaagaaac ttaactctaa ttgggagaat actaatatat 3780 aaatgtcttg ctgtttcaaa attaatttac aactgctctg ttcttacaac accaccaaat 3840 ttcgaacgaa aagttaatgc tataggatta aattttgtat ggaaccacaa gccgcataag 3900 gttaaatttt caactttgat tgcagagaaa gataaaggag ggttaaaaat gatggatttt 3960 gctagcatgg ttaaggcatt aagagccagc tgggtaggga gaatttgtac gacacaagcg 4020 ttagagtctc acttcctcca gtatggagga gctttccttt ttaaatgcaa cttttcaatt 4080 aaaactctga aacttggaaa tatgcctacc ttttacataa agatactcga agcctgggaa 4140 gaattgaact cacatcgccc tcaaacttca atggaaatta aacaagagtt tttatggaat 4200 aatcaataca ttttacaaga aagaaaatct gttttttgga aggaaatgta tctatgtggc 4260 atacggacga tacaaaacat tatagatgac gggggaaata ttctaactcc aaaagctatt 4320 acgagggaat tcggattcga attcaatgcg tccgacatcc taaggtatta tggtctttgc 4380 tcctcaattc cccccgggtg gaaaaaggtg ttaacagacc ccattaatag taaacctgat 4440 gttagatcga ctcttagcta caaatgcaaa caattatatc agctcttcgt aaataaaata 4500 gtgcaaccac cttccaagca agatgaaatt actcgatgtc tacaggatga agaggacata 4560 aaaaaagtct atatgttacc ctatatatgc tcacaagaaa ccaaaatgca gtactttcag 4620 tacagaatta tacacagttt cttgaccacc aatgcatact taaaaaagat aagaattaaa 4680 gactctgacc aatgcccata ttgcgaagaa aagcagacaa ttgttcatct attcgcggaa 4740 tgccacaccg tcaaaacatt ctggaaagat ttcactgact ggtggaaaaa agaaaaccag 4800 catcccgtag gtctcaccaa cattgaaatt atttatggtc acttgtccca cattgaaaac 4860 aataaaactt tgaacaggtg tatcttacag gccaaatatt accttttctg caccgccatt 4920 agcaaggcct accccacgtt ccatgccgta aaacatagaa tactagagta tgcctaaggt 4980 tgtgttaatg gtttcaaagt gatgtaagtg tctaggaatt tgagagccaa ttgtaaagtt 5040 tcataattgt gtatctgttg tagtcgaaga tgaagtcatg tagctgtaac ggttacatgt 5100 aattgtaaat gtaacaacgt ccaggatttc gagaaccaat tgttaagttt tatagttata 5160 tatctgccgt agtcggagat gtagttacca agctatacat gtaaatgtca caatgtgtgt 5220 aattgttatg attttccaat aaaggttaat ttttaaaaaa aaaaatataa aaaaaa 5276 // ID Gypsy-28_OD-LTR repbase; DNA; INV; 142 BP. XX AC CABV01003315; XX DT 24-FEB-2011 (Rel. 16.02, Created) DT 24-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Oikopleura dioica genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-28_OD_; KW Gypsy-28_OD-I; Gypsy-28_OD-LTR. XX OS Oikopleura dioica OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; OC Oikopleuridae; Oikopleura. XX RN [1] RP 1-142 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Oikopleura dioica genome."; RL Direct Submission to RU (24-FEB-2011). XX DR Genome; CABV01003315; Positions 9660 9801. XX SQ Sequence 142 BP; 44 A; 33 C; 26 G; 39 T; 0 other; tgtaagatga tcggacaagc cctgcttatc acccgccctg ctcgccttga aatctcgatt 60 tgagacgagc agtcttgact gtaacagaaa tagaaataaa cacattttta atatattgag 120 agactcgcat cttgactcta ca 142 // ID hAT-41_SM repbase; DNA; INV; 2682 BP. XX AC . XX DT 14-AUG-2009 (Rel. 14.08, Created) DT 14-AUG-2009 (Rel. 14.08, Last updated, Version 2) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-41_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-2682 RA Jurka J.; RT "haT-type repetitive family from freshwater planarian Schmidtea RT mediterranea."; RL Repbase Reports 9(8), 1844-1844 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 448..2307 FT /product="hAT-41_SM_1p" FT /translation="MSSRKRKVTEENRLFNSSWELEFLFTMPGEKPVCLLC FT RNSISVLKKANLQRHYTSTRSNFEKNYPLGTKVREEKVVSLKKNLLGQQHV FT FRRLLTASDCVTEASLKVSWVLAKKQKPYSDGDTVKQCFQECADSLFAEFK FT NKDEIKKQISDLQLSHQTVARRIEMLSSDIYEQLCSDLREASGFSLALDES FT CDTTDTEQLIIWIRFDVKDNFKEELLALLPLKNTTRGEDIYKALKECLVKN FT GIDIKKLISVTTDGAPAMVGRKIGLIGLLVADKDFPNFQAYHCIIHQQALC FT SKLKDDELKKVMDVVVKIVNFIRANPLNHRQFKILLEEYDSNYNDLVLHTD FT VRWLSRGKVLARFWDLLQEVKIFLGGKRNEELLMHLDNPVFISRLAFLIDI FT TDHLNKLNLKLQGRYQILPSLIKEISVFESKLDLFMSQLQQSNYTHFPALS FT KVAQEYQDAIKSEQFDQSLEIVKLDFSTRFKDIRNLTPLLLFIDNPFTCNI FT SVVSTKMQLLGGDEGEVQLEIIELKHSVALQDKHKETTPTEFWMMYVPSDQ FT FPNIYNCCRKVLTMFGSTYVCEAGFSAMTNIKSKKRNSLTDEHLENLMRAA FT VTQYQPQIKKIANIIQSQISH*" XX SQ Sequence 2682 BP; 885 A; 460 C; 511 G; 825 T; 1 other; accagtgccg tccaacctgc ggcccgctgc acgatcttaa gtggcccgcc agagtgcttg 60 ctctttcgtt caaaacacgt ttaattttcc attctcctgt acaaaacgtt agctgcaatt 120 ttaagcaaac taatccacct tatatcaaac ctcactgcat tgtttcgact atacttaatt 180 caacattttt aatatattaa aattaaactg actacttaac tactaggagg cctgttttat 240 aaatttataa attgctgagt gtaattagct aacttattta cttacttacc agttattgtc 300 aaaccggaag tgcatgtcac aatgtacaat aataatattt atattaccat tatttttaca 360 aacagaaaaa gactactnac tttgaagtat aaaatataga ttattaaatt aacattatta 420 attttattat tttttaagtt ttcagatatg agcagcagaa agcgaaaagt tacggaagaa 480 aatcgtttgt tcaattcatc ttgggaactt gaatttttgt ttacaatgcc tggtgagaag 540 ccggtttgtt tgttgtgtcg aaattccatc tcagttttga agaaggccaa tctacagaga 600 cactacacgt ccacgcgtag taattttgaa aagaattatc cgcttggtac aaaggtgaga 660 gaagaaaaag ttgtttcttt aaaaaagaat ttacttggcc agcagcatgt tttcagacga 720 cttctgaccg caagtgactg tgtaacggag gcatccttga aggtcagctg ggttttggct 780 aaaaagcaaa aaccatattc agatggtgac acagttaagc aatgttttca agaatgtgcc 840 gattctttgt ttgctgagtt taaaaacaaa gatgagatca agaaacagat ttcagatctt 900 cagttatctc atcagactgt tgccagacga attgagatgt tatccagtga catatatgag 960 cagctgtgca gcgatttgag ggaagcatct ggattcagtt tagcgttgga tgaatcttgc 1020 gatacaactg acacggaaca gctcataatt tggattcgat ttgacgtgaa ggataacttt 1080 aaagaagaac tgttggcact gttgccactg aaaaatacaa ccagaggtga agatatatac 1140 aaagcactga aggaatgttt agtaaaaaat ggaattgaca ttaagaagct tatatcagtt 1200 accactgatg gtgccccagc aatggtagga aggaaaattg gcttgattgg cttgctggtg 1260 gcagacaagg atttcccaaa ctttcaagct tatcactgca tcattcatca gcaagcactt 1320 tgttcgaaat tgaaagatga tgaactaaaa aaagtaatgg atgttgttgt taagattgta 1380 aattttattc gggccaatcc tttgaatcac aggcagttca agatactgtt agaagagtat 1440 gacagcaatt ataacgattt ggtcctacat actgatgtgc gttggctgag cagagggaaa 1500 gtcctggcaa gattttggga cttattgcaa gaagtcaaaa ttttccttgg tgggaaacgc 1560 aacgaagagt tattaatgca tcttgataat ccagtgttta tttcaagact agcattcttg 1620 attgacatca cagatcacct taacaagctg aatttgaaat tgcaagggag atatcaaatt 1680 ttgccttctc tgataaagga aattagtgtg tttgaatcaa agcttgactt atttatgagt 1740 caattgcagc aatcaaatta tacacatttt ccagcacttt ccaaggtagc acaagaatac 1800 caagatgcaa ttaaaagtga gcagttcgac caatcactgg aaatagtgaa attagatttt 1860 tccactcgtt tcaaggacat tcgcaatttg actcctctgt tgctttttat tgacaatcca 1920 ttcacgtgta acatatcagt tgtatcaaca aagatgcagt tattaggagg agatgaaggt 1980 gaagttcagc tggaaattat tgaattgaag catagtgtcg ctcttcagga taaacacaag 2040 gaaacgaccc caactgagtt ttggatgatg tatgttcctt cagaccagtt tccaaacata 2100 tataactgtt gtaggaaagt attgaccatg tttggaagca cctatgtatg tgaagctgga 2160 ttttcggcaa tgaccaacat caaatcaaaa aagagaaact cgttgactga cgaacattta 2220 gaaaacctga tgagagcagc tgttactcaa tatcagccac aaattaaaaa aattgcaaat 2280 attattcaaa gtcaaatttc tcattaaaaa caataaattt ttaaaagttg taaacacatt 2340 ctggttttta tttaaaattt aataaagtta catagcagaa ttttagatta aattttttac 2400 catacttcaa catctctaaa atgcattgat caatgatgac agttctctag atctttatta 2460 taccacaaag taattcttta tattattaca aggggacaaa aatggacttc tatttgttgc 2520 caaatgctcc aaactgcatt agggaggtta atgatcgcta gttgctccgt gcatcacatg 2580 gccggacgct tgcggtgcca atcctcaacc catttgcggc ccttggcatc ttgaaaatta 2640 tttgagtggc ccattgtgtc aaaaaggttg gacagcactg gt 2682 // ID Copia1-NVi_I repbase; DNA; INV; 5397 BP. XX AC AAZX01001609; XX DT 05-NOV-2007 (Rel. 12.11, Created) DT 05-NOV-2007 (Rel. 12.11, Last updated, Version 1) XX DE LTR retrotransposon from Nasonia parasitic wasp: internal DE portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia1-NVi; KW Copia1-NVi_I; Copia1-NVi_LTR; internal portion. XX OS Nasonia vitripennis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-5397 RA Kohany O. and Jurka J.; RT "LTR retrotransposons from Nasonia parasitic wasp."; RL Repbase Reports 7(11), 1106-1106 (2007). XX DR Genome; AAZX01001609; Positions 11872 17268. XX CC 'AAAAT' target site duplication CC LTRs are 93% similar to each other. XX FH Key Location/Qualifiers FT CDS 1842..3854 FT /product="Copia1-NVi_I_1p" FT /translation="MEEQNKMFLKGIELLVKQLVAPARQDLDITPATDSAA FT KPIEAENATNSKPGNSSSLRVKEKPRITSNVFLKSSPFIREAASTPIPKQK FT VGPERVTELHEMNINMSSFITEKELGQSKLPPSPNDEILENPSDSESQKSN FT MVFTEKRDLLGRKTTALSFFNGRDTCSSASNNDKVTNNDNDTNRLSNETFT FT AALINELVESKVKLMINENEKENSLLNLSEQLKIEFSNRDNQSTRNYKLTS FT NMKFEIFEDYLKSELRIKKLIYILDNESLHDYDEQKLTDDKFKVQDIIINR FT IDLIYYNKIINLTEPIEILKKIKDVKRYETRTTSISARNDLNSIKYQPSRE FT TTSDFYDKFQEKVRVFENIPGAGKLMEKEKSDYFIQAVSEAVPGIITADSV FT YTETTAAEMSCDQLMNHLLKVEAARSGKEPKIPVKTFYTQAKKTGQGRCIC FT HGCGMEGHVQSECPHPVGLPQSNWDLAVKTAVYIYNRTTHKSIGMKTPLSE FT ILPSQSDYLKQIKRFGCAAYIKILRNTETKFSPQAMLGFLVGYFNTGYTVL FT VPKENKLYDSKHVRFVESLTYKDFPSENSETTDDLELSFSMAQESMDSIKE FT SSENQVVNEPLQPKRKRGRPKKSIKVMFFLCETPEEKDLEYDRELSDLKYH FT ALLAKYQSRTLCDKRQAGLYSS" XX SQ Sequence 5397 BP; 1979 A; 919 C; 1114 G; 1385 T; 0 other; ggttctgggc cagtgaggtt ggacgcttcc actgcacaga gtgctcgggt tccgcggatt 60 caaattaacc gtgtgagcgc atcggcgcgt acgcctttgt gcgcttatcc gtgtcgtgtt 120 ctcgtctcga acgatcgtga gtgtagtgag gtgacaagga cagacgaact caagggttgt 180 gtaaatcaca cccgcaccct tggtcatcct acacgagaga gaaaaacgac agagacaagg 240 agcttcagga aagagatttg cacgtgtacg gcacaacaaa gagggctgcc atgacacaac 300 agatatcaac taccggcact tctgcgaata agtagttatt atacatatat cacgagattc 360 agacaaagag ttcgcgctta aaattcacta ccatagaaag atgtcttgtt gcaaatagtt 420 aaaatagtgt taaaattaga taaagtcagt acatagatcg ggaagtgaaa taaagtaaaa 480 tgtaaaaaga atacagtgat agacgaaaaa tagccgtgac aaagaaagca gcgaaaatcc 540 gaaaaatttc gagtacaaga aaagtaacag aggctcttgc gcgttttgtc gattccgaga 600 atcgagaagt cgcgactggc taaacaaggg gagaggcgtt acgtcataag atggcgggcg 660 cctgcttcct cgtgtagagt caacaagtct ttcattgcct agcagcgaga gtgcggttga 720 gcgggacaag gggagcggcc atattgggac ttgttctctc tcgctcttgc gagtcgcggg 780 ctgagagagg gagactgcga cggccggtat tcggccgggc gagcggatga atttttggcc 840 tctctctttc tctcgctgcc gcgacgtggg ctgtaacgcg tatatgcaga tacgacgttg 900 ctgcactaca caaagagcat agagcggcgc tacagtagcg cgcatattta ccgatataga 960 gtcgaatgca taactttcat tcaaataatg agtaaataaa gtattaaata agacaccttg 1020 gttctaataa caatataata taacgaatat acacatatgt acataactga atttgagaat 1080 tgagaaattt tatataaaac ggatttatag aatttaaatt cagaaattct gggatttcaa 1140 aattcaggac aagaattttt tattcaagat aaataattaa ttcttgtaaa attaaaatcc 1200 tgtgttgcaa gattataact gcagttcaaa actttaaaat cgtattcagt gagggcccga 1260 ctcgaatacg taccagtttt agctctaaaa actgagcaaa gcttatgact tagtcaaatt 1320 aattcgtttg aaaattaatt tcagtcattt ttaattaaaa gccaacacag taacatagag 1380 ttaagtcaat agatagacat tcaagaacaa gagcattctt tagaagaatg taaatagata 1440 aagataaaag aaagaaagaa taagaaaata attataataa aatttacaaa gaaagtaaaa 1500 acgaaaaaat gatattgttt actaattata ttgttgaacc aacaggtgca gacagtgagg 1560 agtttaggaa aatcatccta agagagtggg aagcttcaga ggagtaagag tctccaggta 1620 atccatatac cgggaactcc attaccttgc taagaaaaac ggatggttct ctgtattttt 1680 ggaaaacaga acatcaaata agaagatttg acaactatct tcgaccaaaa gaaagggcgc 1740 tcgtcgacag agtgtggcca ctcagagaaa agtttctcaa atttaacctg gaatatgtac 1800 caacagcaga gtacgcaaca aagaagaaag tacttggact catggaggaa caaaataaga 1860 tgttcctgaa gggcatcgag cttcttgtga agcagctggt agccccagca agacaagacc 1920 ttgatataac accagcaacc gactcagctg caaagccaat cgaggctgag aatgctacga 1980 attctaagcc aggaaactca agcagcctca gagtcaaaga aaagcctaga ataacgtcaa 2040 acgtgtttct aaaatcatct ccttttatta gggaagctgc atctacgcca attccgaagc 2100 agaaagttgg gccggaacgg gtgaccgagc ttcatgaaat gaacataaac atgtcgtctt 2160 tcatcactga gaaagaatta ggacagagca agctgccacc ttctccaaat gacgagattc 2220 tagaaaatcc atctgattct gagagtcaaa aatccaacat ggtttttacg gagaaacgag 2280 atctactagg aagaaagaca actgccctaa gtttcttcaa tggacgtgac acctgttctt 2340 ctgcttcaaa taatgacaag gtaacaaata acgataatga tacgaataga ctttctaatg 2400 aaacttttac agctgcttta attaatgagt tagtagaaag taaagtcaaa ttaatgataa 2460 atgaaaatga aaaagaaaac agtttattaa atttatcaga acaattgaaa attgaatttt 2520 caaatagaga caatcaatcg acaagaaatt ataaactaac atcaaatatg aaatttgaaa 2580 tatttgagga ttatctcaaa tcggaactaa gaatcaaaaa attaatttat attcttgaca 2640 acgagtctct tcatgactat gacgaacaaa aattgactga tgataagttc aaggtgcaag 2700 acataataat aaatagaata gacctaattt attataataa aataataaac ttgacagaac 2760 ccatagagat actgaaaaag atcaaagacg ttaagcgata cgaaactaga acgactagta 2820 tatctgctag aaacgattta aattcaataa aatatcaacc aagcagagaa actacttcag 2880 atttttatga taagtttcaa gaaaaggtga gagtatttga aaatataccg ggagctggaa 2940 aactcatgga gaaagaaaag agtgattact ttattcaggc ggtttcagaa gccgtacctg 3000 gaattatcac agcagatagt gtttatacag aaactacagc ggccgaaatg tcgtgcgacc 3060 aactaatgaa tcacttgtta aaggttgaag cggcaaggag cggcaaagag ccaaagatcc 3120 ctgtaaagac cttctacaca caagccaaga aaactggtca aggacgctgc atctgccatg 3180 gctgtggcat ggaaggacat gtacagtcgg aatgtccaca tccagttgga ttgcctcaat 3240 ccaactggga tttagcagtc aaaacagcag tatacattta caatagaacc actcataagt 3300 ctataggaat gaaaactcca ctaagtgaga ttctgccatc ccaatcagat tatttaaagc 3360 agattaaaag gttcggttgc gcagcatata ttaagatact acgtaacaca gagacgaaat 3420 tttcacctca agcaatgtta ggttttctgg taggatactt taatacaggg tacactgtcc 3480 ttgtacctaa agaaaataaa ctctacgaca gtaaacatgt gagatttgtt gaaagtttaa 3540 catacaaaga ttttccgtct gagaattcag aaacaacaga tgacttagag ttatctttct 3600 caatggcaca agagtcaatg gactcaataa aagagtctag tgagaaccaa gttgtaaacg 3660 aacctttgca acccaaaagg aaacgaggta ggcctaagaa aagcatcaaa gtaatgtttt 3720 tcttatgtga gactcctgaa gaaaaagacc tggagtatga ccgagagctg agcgacctca 3780 aatatcatgc gttactagca aaatatcaga gcagaacttt atgcgataaa agacaagcag 3840 gtctatacag tagttgaaag atcaaaaata ccaatgggta ataaaaggcc taatatcata 3900 gattcgagat gggtttttaa gaaaaagaca gacgagaaag gatcagtaaa acacaaagtt 3960 agacttgtta ttcgaggctt taaggacaaa aatagttatg atttaaaaga aacgtatgct 4020 ccggtatcga gaatatcgtt gatcagagca ttcttttcga tttcaaacaa atacaagtat 4080 ataattagac aactagatgt cgaaactgct tttctctatg acgaactatc tgaggatata 4140 tatctggaaa tcccagaggg agttcaagta aacgaagata cgaaaaaaca atttgtttgg 4200 aaactcaaca aatctttgta cggtttaaag ataagtccta aaaaatggaa cgataaattt 4260 tccagtgtaa taaactcatt aggttttacg tctcacgata ttgatccttg cttgtttatc 4320 tataacaaaa actctgacat tgtgcttgca attctctacg tagacgatat tctcttagct 4380 gggcctaatg gcaatctact gaacaagttt agcaagagct taagtttaaa attcaaaatt 4440 aaagatttag gaagtcctaa agaatttttg aatatcaata tcagcagaga cttagacaaa 4500 caaattatta aattgaatca gtgataaatt taaatttaga aagactccta ttaaacctac 4560 tgatgcagcg agctgtgata ggaaagagag ggaggaagac gaatatacag aagagagaat 4620 accaaaccga ctgtacagag aagctgttgg atctctcttg tatctagcag gtacaacgag 4680 atctggaatt tcatatgctg ttaatgtgct tagcaggcac caagtcaatc ctacaaccaa 4740 cgagtggaag atggtaaaga gagtatttca gtatttatca gggacaaaga attatagctt 4800 aatctttctg aatgcttctg agaatatgac cgcttatgct gatgcgagtc tgtcagattg 4860 taagaacact ctgactacct gcgggtatgt gattcagctg ttcggtaatg ctgttgcctg 4920 gagaacgcac aagcaacaaa gtgttgcttt atcaacttgt caggctgaat atgtagctat 4980 gagtgaggct tgtcaagagg caatgtcctt acataactct gtatcaatta tgcttgaaaa 5040 gaatttgtat ccaataacac tgcgatgcga taatacatct gcgatatctt gtgctaaagt 5100 taacggtgga aatagactta gacatatggt agaaagacgc gaccactata tcaaagaatg 5160 tgtaaaccga ggtcatgttg tagtagaatg ggttaaatct aaagaacaac ttgaggatat 5220 ttttatgaaa gctctacaag atactttaca taaggactta acttataaaa taactaattt 5280 aaatcaaaac aatcaatgag ttctctttat gtttatttag ataattccgg agaggagaaa 5340 tcacaggaag aaaaggatga agagtcctgg gactaaagag tcaaaagtga gagagag 5397 // ID Ginger1-12_HM repbase; DNA; INV; 3044 BP. XX AC . XX DT 03-FEB-2010 (Rel. 15.01, Created) DT 03-FEB-2010 (Rel. 15.01, Last updated, Version 1) XX DE Ginger1 DNA transposon from Hydra magnipapillata. XX KW Ginger1; DNA transposon; Transposable Element; Ginger; integrase; KW Ginger1-12_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3044 RA Bao W., Kapitonov V.V. and Jurka J.; RT "Ginger DNA transposons in eukaryotes and their evolutionary RT relationships with long terminal repeat retrotransposons."; RL Mobile DNA 1, 3-3 (2010). XX DR [1] (Consensus) XX CC The 3'-end is not complete. Tpase contains one intron: 1408-1686. XX FH Key Location/Qualifiers FT CDS join(650..1407,1687..1765,1762..2652,2654..2986) FT /product="Ginger1-12_HM_1p" FT /translation="MEATLNKIQHEIEDKNNKLDEEIYMNLKHYILNKVYP FT PGYDKAQKRRLRQKATSFLIVKDFLYHKSKSVHSKLGRVIIDEDQQKNIIK FT SMHDGIIGGCHFGQSATIAKVTDRYWWPCIAADVKDFVRKCTACQAVNPVN FT KPGPSSLHPVKVMHMFHRWGIDLIGPLKETKKGKKSICVATEYLTKWAEVK FT AIREKNAISVHAFLMKLVFRFGASNVILHDQDKEFNNQLVDDLCEQMKITV FT AMTSAYHPQTNGLTEQFNQTLVRHLLKLINKENDDWDLLINPVVFAYRINI FT QASVKYSLFQLMYGVKARLPIDIEKDICEASVDENDDVLQRIITLGQSLQN FT TREQAKKNIEKAQAQQKVKYDLKHSAATYKVGQMVLKYNRRRDTRMGDKLS FT TRFTGPFKIIGELGKGVYRLQLGEKIIKQVVNASNLKLWFPPTLITSPTTV FT SLEDVHLKQAITSIEIIDLENSPPPIAWWIQELNLKQEDKEILKSNEWLND FT RIIDAVNTIVSLNLANEEPCQSTLLSQGIEGFKALSYDSIKILHDLNHWVC FT TACFDGNVYFADSLGGSISEYVRRQMRQLGDSYMRQLYAKFVNKSGNLAIK FT VVPVQHQTNAYDCGVFAAAFAFTWALGLEDFYYNEDDMRKHLASCLETRIP FT IAFPKKINKQTRKKISNYNNCHITFNTVFICNLFILKM" XX SQ Sequence 3044 BP; 1087 A; 470 C; 522 G; 965 T; 0 other; tgtattatga ttagaagctt actgatttat gaaaaatgaa aaaacgtgaa atgtctaaaa 60 gtgtatgcgt cataaatgta tttatattaa gatttaataa taatgctaac gctacatgag 120 gaactcttga aaagatgcct aacccacctt tttgaaattt cgcaggtgat cactttaaac 180 cacattaaaa gtgggttaaa gtggttacca tggaaattta agaaatgtgg gctacgcacc 240 ttttccactg agcccattat atgtagggtt ggatcggcat cagcctgccg atgcgtcgac 300 catttagcct atgcatcagc ctagataacc atcgattatt gccgatggtt ttaagaacag 360 tttttatttt ttgtctataa tgcaattatt aaaaatatat aattaatagt tattgctgac 420 catttaagga tctaggtaga taatgtcaaa tttttgcaag tcttgccatc cgacaaagaa 480 ccaatacaac gtagttttct cattaaaaat atttttgtct aatatttttg gcaattatta 540 aaaagattat tgtaaataat agttattact gagcaattaa tttaaaaata aattgaagaa 600 aaagtataga aatattaaat tgttatttag ttgtataaca gtctcaaaaa tggaagccac 660 attaaataaa attcaacacg aaattgaaga taaaaacaat aaactagacg aagaaattta 720 tatgaatttg aaacattata ttttaaataa agtgtatcca cctggttacg ataaagcaca 780 aaaaagacga ctgcggcaaa aagctacttc ttttttgatt gtaaaagatt ttctttatca 840 caagagtaag agtgtacata gcaaattagg aagagttatc attgatgaag accaacaaaa 900 aaatataata aagtctatgc atgatggaat tataggcgga tgtcacttcg gtcaatccgc 960 aacaatagct aaagtaaccg atcggtattg gtggccttgt atagcagcag acgtcaaaga 1020 ttttgtacgt aaatgtacgg cttgtcaagc tgtaaatcct gttaacaaac caggtccttc 1080 atcattgcat cctgttaaag tgatgcatat gttccatcgg tggggtatcg accttattgg 1140 tcctcttaaa gagacaaaaa aagggaaaaa gtctatttgt gtagctactg aatatcttac 1200 aaaatgggcg gaagtgaaag ctattcggga aaaaaatgct attagtgttc atgcattttt 1260 gatgaaactt gttttcagat ttggtgcttc taatgtaatt ttacacgacc aagataagga 1320 attcaacaac cagcttgtcg acgatctctg tgaacaaatg aaaataacag ttgctatgac 1380 atctgcttac catcctcaga ctaatgggta tatcttcttt tccataatta attatttaat 1440 tattttaaat aaatttactc aacgtaacct tttaacgacg tttcttcaat tataaatata 1500 tgagaggaaa aaagaggata tttctttttc aagatggctg aatattttta tatagttatt 1560 aaaacttgaa aaaaattatt tttaaaacgg taattttatg tccttataac atatacacac 1620 catatcatat tgaacaagtt tttttctata attataaaac aaatttttat cttattggat 1680 ttccaggttg acggagcaat ttaatcaaac cttggtaaga catttgctta agttaataaa 1740 caaagaaaat gacgattggg acttattaat ccagttgttt ttgcttatcg cattaacatt 1800 caagcatcgg taaagtattc gcttttccaa ctaatgtatg gagtaaaagc tcgtcttcca 1860 atcgatatag aaaaagacat atgcgaggct tctgttgatg aaaacgatga cgtacttcaa 1920 cgtataatta cccttggaca atccttacaa aatacacgag aacaggcaaa aaaaaatatt 1980 gaaaaagcgc aagcgcaaca aaaggttaag tacgatctaa agcattcagc tgcaacctat 2040 aaagtcgggc aaatggtact taaatataac cgtcgtcgtg acacacgaat gggtgacaag 2100 ctttctacgc ggtttacagg tccttttaaa atcataggcg agttaggaaa aggagtttac 2160 aggcttcaac ttggagaaaa aattattaaa caagttgtta atgctagcaa tttaaaactt 2220 tggtttccac ctacattaat tacttcaccg actactgttt cattggaaga tgttcattta 2280 aaacaagcga taacttcaat agaaattata gatttagaaa attcaccccc accaattgca 2340 tggtggatac aggaactgaa tctaaaacaa gaagataaag aaattttgaa gtctaatgaa 2400 tggctaaatg accgcattat cgacgctgtt aatactatcg ttagccttaa tttggcgaat 2460 gaagaaccat gtcagtcaac actactttcg caagggatag aagggtttaa agctttgagt 2520 tatgactcaa ttaaaatttt gcatgattta aatcactggg tatgcacagc atgttttgat 2580 ggaaacgttt attttgcaga cagtcttggt ggatcaatat cagagtatgt ccgacgccaa 2640 atgcgacagt tacggcgaca gttatatgcg acagttatat gcaaaatttg taaataaaag 2700 tggaaacctc gctataaaag ttgtacctgt tcagcatcaa actaacgcat atgactgtgg 2760 agtttttgct gctgcatttg cctttacgtg ggctttggga ttagaagatt tttattacaa 2820 tgaagacgat atgcgaaagc atttggcatc ctgtctagaa acgagaatac caattgcttt 2880 cccaaaaaaa atcaataaac aaacgaggaa gaaaatctca aattacaaca attgtcatat 2940 aactttcaat acggttttca tttgcaatct atttattttg aaaatgtaat atatttatga 3000 tgtatacatt tttcaatcat gctgtctttt tattttttca gaca 3044 // ID CR1-8_HM repbase; DNA; INV; 4075 BP. XX AC . XX DT 11-DEC-2008 (Rel. 13.12, Created) DT 13-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE CR1-type family: consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-8_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4075 RA Jurka J.; RT "CR1 families from Hydra magnipapillata."; RL Repbase Reports 8(12), 1836-1836 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 456..3494 FT /product="CR1-8_HM_1p" FT /translation="MEQNSIFQFSNENINDNDTETDCFDKLNFESKYYSVY FT QSAQYLKNNKNAFSILNINIRSLNKNFENLKILLDEIKHDFSIICLTETWC FT KCNKQSTQFDLINYTSIHQPRDSYGGGGVSIFVHNSIDFFPRNDLCVNETD FT CESLCIELYNKTTKNIIVNAIYRKPSGNLKKFKTHLNTFLTKVNKERKHIY FT VVGDINLNLLKQASNKIKHFIDKLLQHNVIPTINKPTRITKKSSTLLDNII FT TNNFYNNYISTGIIKTDLSDHFPTFLITNNITINNISSKFIIYRRQINKYS FT LQKFQHLLKYNVDWDLIMQSVEVNNAYELFLKFFCEQYEKAFPEKKIVINT FT KSYQAPWMSKGLLKSSKKKKKLYEKFLKKKTYKNETNYKLYKNIFEKTKQQ FT SKKIYYSKLLKKTNGNAKSTWSVIKEIIGRKNNGISIQPKKLLVDGKIIYD FT KSIVAENLNSFFLNIGPNLAEKIPSSSSSFDSYLKTYDKVMDESKLDINEL FT RYALDNLKNNTSVGFDNISINVVKSVFHIIETSLYHIFNLSLQSGTVPEKL FT KIARVLPIFKSGDDTDSSNYRPISVLPCFSKLLERIMYNRLNSYLTVNNML FT YNKQFGFKKNHSTDHALTELVTHVTNAFNDDRFTLGVFIDLSKAFDTVNHV FT ILLKKLEHYGIKNNNLLWFKDYLSNRQQYIQYENVKTTKSIIKCGVPQGSI FT LGPLLFSLYINDLSLTSNILNFILFADDSNLFYSHRDIKTLFKVVNEEVNK FT VNEWFICNKLSLNVNKTKFILFHKPTKTGNLPLKLPDLKINNINITRQTTV FT NFLGVILDENLTWRKHINTLERKLSINIAMMYKAKPFLDIRSLKSLYFSFI FT HSYLTYGNIVWASNSYAKLKKIYTKQKHACRIVFGVKRTVPCEPLLRKLNA FT LNVFKINIFLVIQFMYQIKHGLSPAIFYPYFREVSHKYPTKFSENNFVTQK FT YNLKLCSYSIQYRGPILWKNFQHIVNIKEKTLPQFKSKLRRYLVLTDFNIA FT DLLCHF*" XX SQ Sequence 4075 BP; 1614 A; 571 C; 499 G; 1388 T; 3 other; aaaattagaa aactgaagat aggaatagaa gaaacaatat tagaattgac ggtgtacgag 60 aaaaaaacat gggaaaactg aaaaaaaagt tcaacaaata tttaatgatc agttaaaaat 120 aagtaaatta ttattgaacg agcgcaccgt atcgaaagaa ataaagaata aaccgaacaa 180 tcatactcaa actgttaaat tataaagaca aagaaaaaat tttggaaaac gcaaagattg 240 aaaggtacag gtatttttat taatgaagat ttttcatttg aaacaacaaa ataagacgcg 300 aattaagtga aaaaatgaag attgagcgta acgctggaaa atatgctatt attgagtacg 360 ataagttaat agtgagagag tttaatctaa aagctaaata agtaatttta ttttttattt 420 atgtttcaaa attttatata aaaccatgta taaaaatgga gcaaaactct atatttcaat 480 tttctaatga aaatattaat gataatgaca ctgaaacaga ctgttttgac aaattgaatt 540 ttgaaagcaa atactactca gtttaccaat ctgcacaata tcttaaaaat aataaaaacg 600 cattttcaat tttaaatatt aacatccgaa gtctaaataa aaactttgaa aatttaaaaa 660 ttttattgga cgaaataaaa catgatttta gcattatttg tctaacagaa acgtggtgta 720 aatgcaataa acaaagtaca cagtttgatt taattaatta cacgtcaatt catcaaccac 780 gtgatagtta tggtggcgga ggcgttagta tttttgtcca caattcaatt gattttttcc 840 ctcgtaatga cctatgcgtt aatgaaacag attgtgagtc attgtgcatt gaactttata 900 ataaaaccac aaaaaatata attgttaatg ctatttatag aaagccatct ggtaacttaa 960 aaaaatttaa aactcactta aacacatttt taactaaagt caataaagaa cggaagcata 1020 tttacgttgt cggcgatatc aatttgaact tgttaaaaca agcttcaaac aaaattaagc 1080 attttatcga taaacttctt caacataatg ttattccaac aataaataaa ccaaccagaa 1140 taactaagaa atcttctact ttacttgata atataatcac taacaatttt tataacaact 1200 atatttccac aggtataatc aaaacagact tatcagatca ttttcctact tttttaataa 1260 ctaataatat aacaattaat aatatttctt ctaaatttat aatatacagg cggcagatca 1320 ataaatactc cttacaaaaa tttcaacacc ttttaaaata taacgttgat tgggatctta 1380 taatgcaatc ggtagaggta aataatgcct atgaactatt tcttaaattt ttttgtgagc 1440 aatatgaaaa agcttttccg gagaaaaaaa ttgttataaa taccaagtca tatcaagctc 1500 catggatgtc taaaggctta ttaaaatctt caaagaaaaa aaaaaagttg tatgaaaaat 1560 tcttaaaaaa aaaaacgtat aaaaatgaaa caaactacaa actttataaa aatatatttg 1620 aaaaaactaa acaacaatca aaaaaaattt actattcaaa gttgttaaaa aaaactaacg 1680 gaaatgccaa aagtacgtgg agcgtaatta aagaaattat tggcagaaaa aataatggaa 1740 taagtattca accaaagaaa ctcttagtag atggtaaaat tatatatgac aagtcaatag 1800 tagctgaaaa tctaaacagc ttctttctta atattggtcc aaacttagct gaaaaaattc 1860 cttcaagttc atcatcgttt gattcttatt taaaaacata tgataaggtc atggacgaat 1920 ctaaactaga tattaatgag ttacgatatg ctctagataa tcttaaaaac aacacaagtg 1980 taggttttga taatattagc ataaatgttg ttaaatcagt ttttcatatt atagaaactt 2040 ctttatatca catttttaat ctttctcttc aatctggaac agttccagaa aaattaaaaa 2100 ttgcacgggt cttgcctatt tttaagtctg gcgatgacac agattcttca aattatagac 2160 caatatcagt tttaccatgc ttttctaagt tactcgagcg tataatgtat aataggctaa 2220 acagctactt aacagtaaat aacatgttgt acaacaagca atttggcttt aaaaaaaatc 2280 attcaactga tcacgctttg accgagttag taactcatgt aaccaatgcc tttaatgatg 2340 accgttttac attaggagta tttattgatc tgtctaaagc ctttgacaca gtcaaccatg 2400 tcattctttt aaaaaaactt gaacattatg gaataaaaaa caataattta ctatggttta 2460 aagattactt atcaaacaga cagcaataca ttcaatatga aaatgttaaa acgacaaaaa 2520 gtataataaa atgtggagtt cctcaaggct caatcttagg accgttactg ttttctcttt 2580 atataaatga tctatctctt acctcgaaca tactaaactt tattttgttt gctgatgact 2640 ccaatctgtt ttattctcat agggatatta aaaccctctt taaagtagtt aatgaagaag 2700 taaacaaagt aaatgaatgg tttatttgta ataaactttc acttaacgtt aacaaaacta 2760 aatttattct atttcacaaa ccaaccaaaa caggaaacct tcctttaaaa ttacccgatc 2820 ttaaaatcaa taatataaat attactaggc aaacaactgt aaacttttta ggcgttattt 2880 tagatgaaaa tttaacatgg aggaaacata taaacactct tgaaagaaaa ctttcaataa 2940 acattgcaat gatgtataaa gcaaaaccat ttttggatat aagatctcta aaaagtttat 3000 atttttcctt catacatagc tatctaactt acggcaatat cgtatgggca agcaacagct 3060 acgctaagtt aaagaaaatt tatactaaac aaaaacatgc atgtagaatt gtatttggag 3120 taaaaagaac tgtgccttgt gagccacttt tacgtaaact caatgcatta aatgtgttta 3180 aaattaacat attcctcgtc atacagttca tgtatcaaat aaaacatgga ctgtctccag 3240 cgatattcta tccttatttt agagaagtaa gccataaata ccccaccaag ttctcagaaa 3300 ataactttgt aactcaaaag tataatctaa agttatgttc ttattcwatt cagtacaggg 3360 gtccaatttt gtggaaaaac tttcagcata ttgttaatat taaagaaaaa acacttccgc 3420 aattcaaatc taaattaaga cgttatttag tacttacgga ttttaatata gcagatcttc 3480 tatgtcactt ttaaagcttt aaaatttatt aaactaaatt agcaccataa atttaaaaat 3540 ctagaccatt taacaatact aagaatttca atgtctttat catactttca ttgaatatat 3600 atattctctt aaagttacct acaattttct tgatatattt tataatattg gaacagaatt 3660 tttatttgaa ttttgaatta ttttgaaaga aaataatata tacaaatata tatgtcgaat 3720 cacatccaat ttataaccaa aaacagaaaa tttttgtcat ttttattact atattttaat 3780 catttctttt tatttwtttt ttttaattat tttaatactt tttttttttt tttttttttt 3840 ttttyttctt tttaactttg tttcttaaca ttttaaattg tcttttttta aatacttttg 3900 cttatatatt acggattatt gcgtattacg gatatttatt tgcgttacta ttggggctcg 3960 gtgataaggc tattttatgc cttcttcttg ctccagccaa ttgtactttt tatattgttg 4020 tactttgtaa atatttttaa cggcgaaata aactaataaa ctaaaaaaaa aaaaa 4075 // ID L2-1a_Cis repbase; DNA; INV; 5657 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 01-SEP-2010 (Rel. 15.1, Last updated, Version 2) XX DE CR1 Non-LTR Retrotransposon from Ciona savignyi. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; L2-1a_Cis. XX NM L2-1a_Cis. XX OS Ciona savignyi OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. XX RN [1] RP 1-5657 RA Smit A.F.; RT "L2-1a_Cis - CR1 Non-LTR Retrotransposon from Ciona savignyi."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX CC includes Ci20, Ci56, Ci120; several subs at 3' end; 94% id to CC L2-1_b. XX FH Key Location/Qualifiers FT CDS 112..792 FT /product="L2-1a_Cis_1p" FT /translation="MNTMESALMKKLEELDTALAHIKSKQDKYESNCSLRP FT DTAPPVPVSDSTHPKELQEIRKSLNDLEASLKDLKAVVRDTDERLDNLEQY FT GRRNCLIIHGCKRXPQXNFISYVLSILNRLKLPYTISKAAIDIAHVLPSKR FT DSTPIIVKFVQRMVRNDVYDSKKHLKGTGMSMTESLTLRRLRIVEKAREAF FT GFRSVWTNNGVIFTVHQNTRRVIHRLSDISSILARSK" FT CDS 1216..4203 FT /product="L2-1a_Cis_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MLDEEDELTSIRCPESYVNTNQCKSFLFEKHQGLTVL FT SLNIRSLANPNNFTKLEALVASLQFKPEVIAITETWIVXNRSGHYSNLPGY FT VFVSNSRXKSRGGGVGMYIRSHLEFQIKDCISIMEEKVFESLFVVFPNLTP FT HLKPNHKSSLICGVIYRSPKLDTQSNSAFIHILSETLEKIDNRNCNCIIVG FT DFNYDLLNINNNTVTDFTNTMQDFCYESIINKPTRFTDSGATAIDHIWTTI FT ESPLVKACILTDNISDHLPVILALRPNSDNNSKSSKSQSSSRSFSDANIVS FT FNNNLQHLNIDNILCESDANFAYSNLMEQYNTEFENSFPLVANKAPKYHLK FT WFNEEIKELNKTKQKLYKKYLLKRTVSSKLEYNKARNLYHHKIQAAKKCHY FT RKMFKTNRNNIKATWRSINELLGKSKPQISKSFEINGLSTSNPIHIANHFN FT DHFSNVATELVNRIPPSTSHHTDYLKSQSSSTMYIFPTSPQEIKTIILDLK FT SKSSSGLDQIPTKVVKSTPVNILQALSHIFNLSLQSGKFINDFKIAKVIPV FT FKKGKHSLINNYRPISLLPSFSKIIEKLMYSRTHSFLNHNNVLHNNQFGFR FT KDHSTSHASNLLVNIISDHLENKKSVIGVFLDLSKAFDTIDHDILLNKMSH FT YGIRGVALDWFTSYLSNRQQIVDFNGTLSSNPNPVKLGVPQGSILGPLLFL FT IYINDLPNCLTHSKAIMFADDTTIFTPGRNQSITCVNANADLDRLXTWLSS FT NKLVLNTDKTKYMXFSNSXKSTLSVPPLISINNYKIEQVNSFKFLGLTLND FT NLSWKPHMQILLKKLNTNLLMVRKIKPLVDQPSLLTLYHSLILSHXHYCIS FT TWCYGNNXMLTKLQRVCNKFIRLIFNLGKRENTVSIMRQHGLLTIYDMHKL FT EILSIMHKCHNNTLPPALHNCIPNKPKTTMKTRSNSQFLIPFCNKSLSQQS FT LKYIGPKFWQQLPNNIREIKSLKKFTKAVKQHLLEDPHPLP" XX SQ Sequence 5657 BP; 1793 A; 1211 C; 824 G; 1802 T; 27 other; attctatttg aataaggttg catgagcgaa taggattttt caaataactg ggcatanttg 60 ttaatcttta tcttctgata ctgaaattga actctctgac aagcaagaaa tatgaataca 120 atggaatctg cactgatgaa gaagttggag gaattggaca ccgcattagc tcatatcaaa 180 agtaaacagg ataagtatga gtccaattgt agtttaaggc cagatactgc tcctcctgtt 240 cctgtttcag attcgactca tccaaaagaa ctgcaggaga ttcggaagtc gctgaacgat 300 ctcgaagcct ctctaaaaga tctaaaggcn gtggtcagag acacagacga gcgacttgac 360 aacttggagc agtatggcag gaggaactgt ctcattatcc atggctgcaa aagaatnccc 420 caaganaatt tcataagtta tgtgctntcc attctnaacc gccttaaact gccatatact 480 atttctaagg cagcaatcga catcgcacac gtgctaccat cgaaaaggga ctccactcct 540 atcatcgtga agtttgtcca acggatggtg aggaacgatg tgtacgactc caagaaacac 600 ctgaagggga cggggatgtc gatgacagaa tcactgaccc tgcggcggct ccgaatcgtg 660 gagaaagcca gagaggcctt tggctttcga agcgtttgga ccaacaatgg ggtcatattc 720 actgtgcacc aaaacacaag gcgagttatt catagattaa gcgatatttc ttctattctt 780 gctaggtcta aataatttat taatttatca atttcagtgc catttatttt tgcactatgc 840 ctagttnaaa tcctagctcc ttaatatgca aaccgtatcc aaaaaaaaga aagtctgtct 900 naactgcagc aagacaatcc gagttaatca aaattttatt tactgtgaca agtgtaaatc 960 aaatttccat tgcaaatgct tgccgacaat accaaagcaa cagttaaata ttaaaaagaa 1020 tgtcacaact ttacaccata aatggttttg tgataactgc caagatccac cgtgccttcc 1080 attctcgaat atttccgatc tggaactctc ttctttactt ttcatcccct cctctcaaac 1140 caccccgacc aaatcaccca gtcgaccctc gacccagnaa aattaaataa cttttttgac 1200 aatttaaatg aaaatatgct agatgaagag gatgagttga cttccattag gtgcccagaa 1260 tcatatgtca acaccaacca atgtaaatcc tttttatttg aaaagcacca aggcttaaca 1320 gttttaagtc ttaatatcag atcactggcc aacccaaata actttaccaa actagaggca 1380 ttagttgcat cccttcagtt taaaccggaa gtnattgcca ttacagaaac ctggatagtt 1440 ganaatcgta gtggtcacta ctcnaattta cccggctatg tttttgtaag taatagtcga 1500 nctaaatcta gaggcggggg ggtaggaatg tacattcgtt ctcatttgga gtttcaaatt 1560 aaggactgta tttctattat ggaggaaaaa gtgtttgaat ccctatttgt agtttttccc 1620 aatttaacac cacacctgaa acccaatcat aaatcttcgt taatctgcgg tgttatctat 1680 aggtctccaa aattggatac acaatcaaac tcagccttta ttcacatact atcagaaaca 1740 cttgaaaaaa ttgacaaccg aaactgtaac tgtattatag ttggggactt taattatgac 1800 ctccttaata ttaacaataa cactgtcact gatttcacta atacaatgca ggacttctgc 1860 tatgaatcaa tcattaacaa accaacccgg tttacagact caggtgcaac tgcaattgat 1920 catatttgga ccactattga atctccatta gttaaagctt gcattttaac tgataacata 1980 tcagaccatt taccagttat attagcactg aggccnaatt cagataacaa tagcaaatct 2040 agtaaatctc aatctagctc aagatccttc agtgatgcaa atatagtctc ntttaacaac 2100 aatctncaac atcttaacat agacaatata ttatgtgaat cagatgcnaa ctttgcatat 2160 tccaacctaa tggagcaata caacacagag tttgaaaaca gctttccact agtagccaac 2220 aaagcaccca aataccactt aaagtggttc aatgaagaaa taaaggaatt aaataaaact 2280 aaacaaaaac tatacaaaaa gtatttacta aaaagaacag tctcatcaaa attggaatac 2340 aataaagctc gtaatttgta ccatcacaaa atccaagctg caaaaaaatg tcattataga 2400 aaaatgttca agactaacag aaataacata aaggctacat ggaggtccat aaatgaacta 2460 ctcgggaaat ctaaaccaca aatctcnaag tcatttgaaa ttaatggatt atctaccagc 2520 aacccaatac acattgctaa ccattttaat gatcattttt caaatgttgc aactgagttg 2580 gtaaaccgga ttccaccatc cacctcacac cacacagatt acctaaaatc ccaatcatct 2640 tccaccatgt acatattccc cactagtcca caagagatta aaacaataat actggactta 2700 aaatctaaat ccagcagcgg cttagatcag attccgacaa aagttgtaaa atccacaccg 2760 gtcaatatat tgcaagcact atcccatata ttcaatctat cattacaatc aggcaaattt 2820 ataaatgact tcaaaattgc aaaagtcatc ccagtgttta aaaaaggcaa gcactcactg 2880 ataaataact accgaccgat tagtctactg ccttcattct ctaaaataat agaaaaatta 2940 atgtacagca gaacccactc cttcttaaat cacaataatg ttcttcataa caaccaattt 3000 ggatttagga aagatcattc aactagccat gctagcaatc tcctggttaa tataatatct 3060 gatcatttgg aaaataagaa atctgttatt ggtgtttttt tagacctgtc aaaagccttc 3120 gacaccattg atcatgacat tttgcttaac aaaatgtctc attatggcat ccggggtgta 3180 gcacttgatt ggttcaccag ttatttatca aaccgacaac aaattgttga ttttaatgga 3240 actctatctt ccaacccgaa cccagttaaa ttaggagttc ctcaaggatc cattctaggt 3300 cctttactat tccttatata tatcaatgac ctgccaaatt gcttaaccca tagtaaagcc 3360 attatgtttg ccgatgacac caccattttt acccccggcc gcaaccaatc cataacctgt 3420 gtaaatgcca atgcggatct agacagatta tncacatggc tttccagcaa caaactggtt 3480 cttaacactg ataaaacaaa atacatgtnc ttttctaact ccnttaaatc cactctttca 3540 gtaccacccc tcatttcaat aaataactat aaaattgaac aagtcaacag ctttaaattc 3600 cttggtctaa ctttaaatga caacctttcc tggaaaccac acatgcaaat actccttaag 3660 aaacttaaca ctaatttact catggtacgt aaaataaaac cccttgttga ccaaccctca 3720 cttctcacat tataccactc actnatacta agtcacnttc actattgcat atccacctgg 3780 tgttatggta ataaccnaat gctaactaaa ttacagcgag tgtgcaataa gttcattcgc 3840 ctaatattta accttggtaa acgcgaaaat acagtttcta taatgagaca acatggactt 3900 ttaacaattt atgacatgca caaattagaa atattatcca taatgcacaa atgccataac 3960 aatacacttc cccctgccct ccacaattgc atacccaaca aacctaaaac tactatgaaa 4020 acaagaagca attcccaatt tttaattcct ttttgtaaca agtccctatc ccaacaatcc 4080 ctgaaataca taggcccaaa gttttggcaa caattaccca acaacattcg tgagattaaa 4140 tcgcttaaaa agtttaccaa agctgtcaaa caacacctct tggaagaccc tcatccgctt 4200 ccctagtttc actaatattt accccaaata aatttttttt tttttttatt tttttttttt 4260 tatatttttt ttttttttat aataaatccc ctttgtaagt ttaaatttca tttactcctt 4320 gcacaaataa attcctataa tctgctttac cttntctatc cctttgcata tattgacacc 4380 tgcatgcatt tcttttcact tgttttcaca cgttgatttc cctattaaaa tcacatgcat 4440 gaagtttcca gattttgttc tttttaaatt atgatcacca cgttaaatca cagtcttgac 4500 ttcctttgca tgttaacaat attgatttgt ttacacaaac acaagtgtta tttacggtga 4560 tgttttatta cagtattctc tgttacttgg taagccgntg gcacatttgg ttttgttttt 4620 ctgtgaaata gtttttgccg gccctaacag agctctgtat tttaaaactt atcactttag 4680 tctttttatt tgtttttggt ttttataata tttttcacca cccgattgca cactaccctg 4740 ttccctaacc tgtctttatt tctatctatt tttcgatttc tttactgtct tgtagtattt 4800 ttacaaccat actgccaaga cactattgtt ccagatatcc ggacaccgaa atttcatcta 4860 cgaggagaag gtcgtccggt tttcttgcgt ggcctcctcc tcctcaaatt tactattcga 4920 gagttggcgg tgtgatctcc agaagaagat gncaatcaat attaaactaa tatgtcacaa 4980 ccaatctgct ttaatataat cggatccttg ctgcagtctt tagtattttt cacttgctag 5040 caaccctttt ttggcttgct attatttatt tatttatttt ttcttattta cttatttatt 5100 tatttaaaca ttttgtttgt tttgtgttgt tttgcgcttt ttttgttatt tatggtgcac 5160 tagtgtagtg ctcctctttt cgcactccta cataagtgct ctaatgcatt agtgttatgc 5220 aatctgtgtg ttaaatgaga atatttaatg taaatatgcc ccggtaagtg tgcttgaaac 5280 taatagtgtc atctgacatt agtggaattt cattttaggg ggtttttctt tttccactct 5340 ggtccacgct ttcgggccac accttctcct aaagccggct cccccacagt actggtcgcg 5400 ctttaccgca cacttttctt cagggcttta tatcgtttgg tcttaattaa atttggatcg 5460 ctaaatattg ttctttcgcc taggtttggc ggccgcgtcc gctatattta ttacattata 5520 tccttttttt tttaatgacg tgttttaacg actgttcgtt ttgccgtctt ttcccgttcc 5580 tgtgtcgtga ttatattcgt ttgccatttt taaaaactgg ggcaaaaaat gaataaacta 5640 aactaaaact aaactaa 5657 // ID Gypsy-35_AA-LTR repbase; DNA; INV; 220 BP. XX AC supercont1.299; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-35_AA_; KW Gypsy-35_AA-I; Gypsy-35_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-220 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.299; Positions 82903 82684. XX SQ Sequence 220 BP; 77 A; 27 C; 64 G; 52 T; 0 other; tgtagggatt tactactact atgacaacag aaatgagatg aacctgaaag tagattacct 60 tgggaggtag ctgggtggga gaaggaaaga tgacggggat tagtgacggg catgagacgg 120 tgtgtagagg gagaaattga acaagtgaaa taaaatcgaa gtgtgacttg taaaccagaa 180 tttaaaataa tttattcagt gtccggagct gcccattaca 220 // ID BEL-595_AA-I repbase; DNA; INV; 5957 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-595_AA_; KW BEL-595_AA-LTR; Pao_Bel_Ele34; BEL-595_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-5957 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [4905-5486] - Integrase core CC LTRs are 93% similar to each other. XX FH Key Location/Qualifiers FT CDS 592..3903 FT /product="BEL-595_AA-I_1p" FT /translation="MADERTRQQLINRRTTLIAALGRAEQFVEKYVAERDR FT GQVKLRLENLDTVWMGLEDVQTQLEDTELTNQGMAQNLAFRSHNETRYFQI FT KAALQSYLPIVPPSTNAIPQPSPSGLSAIKLPTISLPEFDGDYNNWLAFHD FT TFVALIHSNPEVHDIQKFHYLRAALKGEAAQLVESIGISSANYPIAWQTLV FT SRYANEYLLKKRHLQALLDCPRMKKESAAALHSVVDEYERHTKTLRQLGEP FT VDSWSTMLEHLLCLRLDDSTLKAWEDFATTTTNPDYACLIDFLQRRIRVLE FT SMSVNHQAQPTPNQQSSMFRRPPFHKTVSHAVTEATPRKCHSCDQHHPLFQ FT CPRFEKMTVADRLKVVNDQHLCHNCFRQDHLARNCQSKFSCRHCKRRHHSL FT LHPGYHPLDNDQSSTTSRTPTYTPKPIAKQDAAANHNTRPPTSTNNAVSAP FT TTNSSQTSNANVLLSTVVLMVLDCNGQAHPARALLDSGSQSNIISERLCQL FT LRLKRNKINIPVFGVGESSSSVNHSVSATIKSRNSDFEIGLDFLVMPRITI FT DLPVVSSLANDWRAPKNLPLADPTFNKTGAIDMLLGAEHFFTYVNPGTRIE FT DGESPALIKTVFGWIVTGRNPAPLRSPPVACHVSLSDPLHEAMERFWLLEE FT VENRPNYSVEEQQCESHYASNVSRTPEGRYIVRLPRRPNFDHLLGESKSAA FT LRRFHCLERRLGREPHLMEEYHNFLSEYLSLGHMRLVPPSEKEPLQVHYLP FT HHAVVKEASTTTKVRVVFDGSAKTSTGYSLNDALQVGPIVQDELLTLVLRF FT RKYPIALVADIAKMYRQVILHPDDAPLQRILWRFHPNDPIATYELRTVTYG FT LAPSSFLATRTLQQLAADEGHAYPRGGPALKKGFYVDDYIGGAETEEEAIE FT TRRELDELLAKGGFQLRKWASNNAKVLEGLDPSQIGTQPTLKLDNAEPTKA FT LGVCWEPGTDQLRFDSAPVVGNGPPTKRSILSAVSKHFDPLGLTAPIVIRA FT KMLLQELWLQPCGWDDEVSDSIRTKWDSYRTDLPKICSYRADRYAFLPNST FT VQLHTFADASQQAYGACIYARSTDEKGSTNVQLIASKSKVAPLKRIS" FT CDS 4161..5609 FT /product="BEL-595_AA-I_2p" FT /translation="MSVEEFLSSGLWHHGPAWLTAPETEWPISTQQQKTPE FT EILERRRMVAAIQNKPEVNEIFDTSSCYNRLIRVTAYCLRFIHACRRQTTP FT SPYPESDVITNAITVEEMSAARMKLIKLAQADVFAEDMRYLGKGKVLPKHS FT SMRLLSPFVDELGILRVGGRLRLSDQPYITKHPILLPSSHPLTRLVAMHYH FT MRLLHGGGRVTLAAMRHEFWPIQGRRLVNSVVRNCFRCSRASPVPAKQQTG FT QLPLQRITPSRPFSVTGIDYAGPVYIRAIHKRASPTKAYISVFVCFVTKAV FT HLELVSDLSTPAFLNALRRFIARRGCPAHIHSDNGKNFEGARNELRELYRM FT LQQEESTNSVATHCANQGIQWHMTPPKAPHFGGLWEAAVKIAKKHLYRQLG FT NANLSFEDMSTVLTQIEGAMNSRPLTPLTEDPNDLAVLTPAHFLIGTTLDA FT LPDTEVSEVPINRLDHYRALQHRVQQFWHQWQAEYLQE" XX SQ Sequence 5957 BP; 1488 A; 1762 C; 1350 G; 1324 T; 33 other; tcttggtgcc gtgaccagga tctggtttcc tcccagaaaa tccgccatcg catcacgggc 60 ttcgacgcta tctaccgcca tcgttccgca ctattgtgtg ccccattggt attcwtcaag 120 atcgccatct tccccggagg ctattgtgct ccatcgccat ctagaagctt ctccggagtt 180 ccagcttgag ttttttaata caaggcctaa tacaattagg cactcggtac gtataattcc 240 gaagttgcgc agaagtgtac cgcgggtgcc ttgagatctc cgtatttttc cgattttcat 300 cctatccgtt ccggagtttt caccgaatat tctggaagcg tgcccagccg ttccgtgagc 360 acggctattg tgagagtgag cgacgccatt gccatccgtg ttccgaatac cacccgtaga 420 gctacaccgc cggataggag aaccgttgca ggtttgaatt taatacaagg cttaatccta 480 gcctttagaa ggttagtagc gctccaaact atctactcca tttgtccggc tctaccgggt 540 ctttctctgt gtcacccgat ctagtgtgtc cagcgataac cagtttccgt aatggcggac 600 gaacgtacga gacagcagct catcaaccga cgtaccacac tcatcgctgc gcttggaaga 660 gcggagcagt tcgtcgagaa gtacgtggcg gaacgggata gaggccaggt gaaactacgt 720 ctcgagaacc tcgataccgt gtggatggga ttggaggatg tgcagaccca gctcgaggac 780 acggaactca ccaatcaagg tatggcacag aacctagctt tccgttccca caatgaaacc 840 cgctactttc aaataaaggc tgctcttcaa tcctatctcc ccattgtacc tccttcgaca 900 aacgcaattc cccaaccgtc tccctctgga ttgtccgcta tcaagttgcc gactatttct 960 ttgccggagt tcgacggaga ctataacaac tggctagcct tccatgacac gttcgtcgct 1020 ctaatccatt ccaaccccga ggttcacgat atccaaaagt ttcattacct ccgtgctgct 1080 ttgaaaggag aagcagcaca gctcgtcgaa tcaattggta tcagctccgc caattatccc 1140 attgcttggc aaactcttgt gtctcgctac gctaatgagt atctactcaa aaaacgccac 1200 ttacaagccc tcctcgattg cccccgaatg aaaaaggaat ccgccgctgc attgcattct 1260 gtcgttgacg agtacgaacg ccataccaaa acccttcgtc aactaggtga acccgtagac 1320 tcgtggagta ctatgttgga gcatctccta tgcctccgac tcgacgatag taccctaaaa 1380 gcatgggaag atttcgctac gaccaccacg aaccccgact acgcttgtct gatcgacttt 1440 cttcaacgcc gcattcgtgt actggaatct atgtccgtga atcatcaagc gcagcctact 1500 cccaatcagc aatcatcaat gtttcggcgc cccccattcc ataaaaccgt ttcccatgct 1560 gtaactgaag ctactccacg aaaatgccac tcatgcgatc aacatcatcc acttttccaa 1620 tgcccacgat ttgaaaaaat gaccgttgcc gatcgtctaa aagtcgtcaa tgaccagcac 1680 ctctgccaca attgttttcg ccaagaccac ctcgctcgca attgtcaatc aaaattctcg 1740 tgtcgtcatt gtaaaaggcg acaccattca ttgctccatc caggctacca cccactcgat 1800 aatgatcaat catccaccac ctctcgcaca ccaacgtaca caccgaagcc tatcgccaaa 1860 caagatgcag cagcaaacca taacacacgt ccacccactt ccaccaataa tgctgtcagt 1920 gctccgacga ccaattcatc gcaaacatca aacgccaatg ttttgctttc gacagtagtg 1980 ctgatggttc tcgattgtaa tgggcaagcg catcccgctc gagctctgtt ggatagtgga 2040 tcgcaatcca atatcattag cgaacgacta tgccagcttc tccgactaaa gcgaaacaag 2100 ataaacatcc ctgtgttcgg tgtcggtgag tcctcctcca gcgtaaacca ctctgtcagt 2160 gccaccatca aatcgagaaa ctccgatttt gagattggtc tcgatttcct cgtcatgccc 2220 cgtatcacga tagatttgcc cgttgtttct tctctggcga acgactggcg tgccccgaag 2280 aatcttccgt tggccgaccc taccttcaat aagaccggtg caatcgatat gctgctcgga 2340 gcggagcact ttttcaccta cgtaaaccct ggtacccgaa tcgaagatgg cgaaagtccc 2400 gccctcataa agaccgtttt tggctggatt gtcaccggta ggaacccggc ccccctcaga 2460 tcgccccccg tagcatgcca cgtatcactt tccgatcccc tccacgaagc catggagcgc 2520 ttttggctcc tcgaggaagt mgagaatcgt ccgaactact cggtggagga acagcagtgc 2580 gaatcccact atgcttccaa tgtttcacgc accccagagg gaagatacat cgtccgtttg 2640 cctcgccgtc ccaatttcga ccatttgctc ggcgaatcga agtccgctgc ccttcgacgc 2700 ttccactgcc tggagagacg tctcggcagg gagccccacc tgatggaaga gtaccacaat 2760 ttcctttccg agtacctgtc actcggccac atgcggctcg tcccaccgag cgaaaaggaa 2820 ccactgcagg tgcactattt gccccaccat gccgtagtga aagaggccag caccaccaca 2880 aaggtacgtg tggtctttga tggatcggca aaaacgtcca ccggatattc cctcaacgat 2940 gccctccaag ttggtccaat tgtccaggat gagcttttga ccctagtact ccgattccgg 3000 aagtatccga ttgctctggt agcggacatc gcaaaaatgt accgccaggt gatactccac 3060 ccagacgatg ccccactgca gcggattctc tggcgtttcc acccgaacga ccccatcgcc 3120 acatatgagt tgcgaacggt tacctacgga ttggcccctt cgtccttcct tgccaccaga 3180 accttgcagc agctcgctgc ggacgaggga cacgcctacc cgcgaggagg ccctgcgctc 3240 aagaagggat tctacgttga cgactatatt gggggagcgg aaacggagga agaggcaatc 3300 gaaacacgaa gggagcttga cgagcttctg gccaaaggtg gattccaact aagaaagtgg 3360 gcatccaaca acgcaaaggt actggaagga ctcgacccat cgcaaatcgg aacacagcca 3420 acactgaaat tggacaacgc cgaacccacc aaggcactcg gggtgtgttg ggagcctgga 3480 actgaccaac tacgatttga ctcggcccca gttgtaggaa acggcccgcc aacgaagcgc 3540 tcaatactct ctgccgtttc gaagcacttc gatccgcttg gcctgaccgc tcctatcgtc 3600 atacgagcaa aaatgctgct gcaggagctc tggctgcagc cctgcgggtg ggacgatgag 3660 gtttccgact ccattcgaac caagtgggac agctatcgca ccgatctgcc gaagatctgc 3720 tcctaccgtg ctgaccgata tgcattcctg ccaaattcca ccgtccagct gcacacattc 3780 gcggatgcat cccagcaagc atacggagct tgcatctatg cccgttccac agacgagaaa 3840 ggctcaacta acgtccagct gatcgcatca aaatctaagg tggcccccct caagcgaatt 3900 tctmtccccc gattggagct gtccgctgcc gtcctcgctg ctcgcctgca ccaaagggtt 3960 tcccaagcmc tsgacatgmc atckccgctt ctcatttttg gtccgattck acmgtcacac 4020 tcgagtggct tcgatctcct ccctacacct ggaccacctt cgtcgccaac agggtctccg 4080 aaatccaaac aacaacacag ggatcctatt ggcatcatat cgcagggaaa cagaatccag 4140 cggacctaat cacaagaggg atgagcgtcg aagagttcct ttctagtggt ttgtggcacc 4200 atggaccagc ctggctaaca gcacctgaaa cggagtggcc aattagtacc caacagcaga 4260 aaacacctga agaaattcta gagcgacgta gaatggtggc ggctatccaa aacaaacccg 4320 aagtgaatga gatttttgac acttcttcgt gctataatag attgatacga gtgacagcat 4380 attgtctacg cttcatccat gcttgtcgtc gtcaaactac cccgtcaccc tatcccgagt 4440 cagatgtgat aacaaatgcg atcaccgtag aggaaatgtc agcagcgaga atgaaactta 4500 taaagctagc ccaagcagac gtttttgcag aggacatgag gtacttgggg aaggggaagg 4560 tattacccaa gcattcgagc atgcgccttt tgagcccatt cgtcgatgag ttggggatcc 4620 ttagagtcgg gggcagattg cgtctgtcag accagcccta catcaccaaa caccccatcc 4680 tccttccaag ttcccatcca ctaacacggt tagttgccat gcactaccac atgagactac 4740 tgcacggtgg cggccgtgta accctagcgg ccatgcgaca cgaattctgg cccatacagg 4800 gtagacgttt ggtcaacagc gtcgtgcgaa actgcttccg ctgttctcga gcctcgcccg 4860 ttcccgccaa gcaacaaact ggacaactac ctctgcagag aattacacca agccgaccgt 4920 tctccgtcac cggaattgat tacgccggcc cggtatacat cagggcgatc cacaagcgcg 4980 cttcaccgac caaggcctac atcagcgtct tcgtctgctt cgtcacgaaa gcggtacact 5040 tagagctggt aagcgacctg tccacaccag catttctcaa cgctctccga agattcatcg 5100 ctcgtcgtgg ttgccccgca cacatacact ccgacaacgg aaagaacttc gaaggagccc 5160 gcaatgaact gcgcgaattg taccgaatgc tacagcagga ggaatccacc aattcggtcg 5220 ccacacactg cgccaaccag ggcatccagt ggcacatgac tcctccgaaa gccccacatt 5280 tcggtgggtt gtgggaagct gccgtcaaga tcgcgaaaaa acacttgtat cggcagctgg 5340 gtaatgcaaa cctttccttt gaggacatgt cgactgttct aacgcagata gaaggtgcca 5400 tgaactctcg cccgttgaca cckctgacsg aagacccaaa tgacctggcc gtcctcacac 5460 cagctcattt tcttatcggt accacactgg acgccctacc cgacaccgaa gtcagcgaag 5520 ttccaatcaa ccgactggat cactatcgag ctctgcagca ccgagttcaa caattctggc 5580 accagtggca ggcsgaatat ttgcaagagm tgcaaaaaga aagcaaamkc atcgccccga 5640 acacwgmgat caagcccggw aggatggtcg tcgtcgtmga tgagttcctc gctcccgtsa 5700 agtggccgtt agcgagaatc gtcaacgcca tcsmwggacc agacgggctc atccgagtcg 5760 tsgacctscg aaccagcaga ggaatcatcc gccgcccaat cacsaagatt tgtttgctgc 5820 ccttcgaaga cacgaagaat gcctagaatt ataatgtcaa cagaattkkw tcaktagatt 5880 agscwtcaat ggattgtktg ttgttgtgat aggagagctt tcgggtagtg aaattcagga 5940 atttcaggtg gcggcga 5957 // ID Kiri-30_AAe repbase; DNA; INV; 4723 BP. XX AC . XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 16-FEB-2011 (Rel. 16.02, Last updated, Version 1) XX DE A Kiri non-LTR retrotransposon family from Aedes aegypti. XX KW Kiri; Non-LTR Retrotransposon; Transposable Element; Kiri-30_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4723 RA Kojima K.K. and Jurka J.; RT "Kiri non-LTR retrotransposons from the yellow fever mosquito."; RL Repbase Reports 11(2), 725-725 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 11 sequences with >94% CC identity. CC This family and related Kiri elements found in two mosquito CC species, Culex quinquefasciatus and Aedes aegypti, constitute a CC distinct lineage belonging to the CR1 group. The phylogenetic CC analysis with PhyML indicates that Kiri is a sister lineage of CC the L2 clade. Although several Kiri elements do not encode two CC proteins, probably due to 5' truncation, Kiri elements generally CC encode two proteins. Their ORF1 proteins commonly contain an CC L1-like RNA-binding domain. XX FH Key Location/Qualifiers FT CDS 249..1049 FT /product="Kiri-30_AAe_1p" FT /translation="MSNENNKPLTRSTSTSSSSLNTDGRLSKQNKSDDVET FT LSDLWNKIRRMFAESKSDIEGKIESCKTELEQKITGVEQQLSSLKMGCDVQ FT IKDVSEAVSDVRHDLELTKRNVNRLTTTNELIISGVPYTSDENLTLIFQNI FT SVALASTPSNVPNVYLKRLSKNPINIGSSPPILCQFALRGVRDEFYGRYLR FT QRSLNLRHIGFNNDNRVYINENLTMNDREIRTQAIKLKHQGRIQQVFTRNG FT VVYARMKNGDEAEPFYSLEQLFASVN" FT CDS 1762..4578 FT /product="Kiri-30_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MADNAVSNASATMLIPKAVFHAVLPEDKINVCHINVQ FT SICARNFTKFNELKNNFINSKMDIVCMTETWLSDVITDPMIGIDGFSLLRN FT DRNRHGGGICIYFKSNLTCRIVKMSTAIDQLSMTEFLLVEVRGGGEPFLLA FT VYYNPPEIDCSDSLKEHLEEFTVKYNHTFFIGDFNTDLMKSNSKSLKFKET FT LSALSYTCVNNEPTYFYSNGCSLLDLLITDSPDMIYNLNQVSMPFASKHDL FT IFASINIMKTRTESVSYRDYKNFDANRLNNAFHSMDWNSMISITDPDMFLE FT IFNSRLKFLHDECIPIRKSKPKVNPWFNQDIEKAIVDRDIAYSNWKRSRNE FT LDMAHFKRLRNRVNLLIRDSKRFYDRERFNVNLPAKQLWANIKKLGISKPR FT STDGNSSFSVDEINNYFTDNFTVDDTTSSANFSSIDGFNFRFVEDIEIINA FT VHEIKSNATGLDDISIVFIKIMLPLILPFVKHLFNIIISNSKFPRAWKVVK FT IIPIKKKKSKDDIHNLRPISILCALSKVWEKLLKIQISEFVTEMNFLHPLQ FT SGFRQHHGTNTALLKVHDDISSVIDRKGVAILLLIDFAKAFDRVSHKKLLM FT KLSDIFQFSTSANRLIHSYLTGRSQAVCCNNKLSSFKNIESGVPQGSVLGP FT LLFSLFINDLPSCLRYCSVHLFADDVQIYLCEKNRNNYNDIGFKINHDLRN FT LLLWSQNNLLPINPTKTKAMLICKNRNPITCPDLFMDGERIEFVSQVDNLG FT VTFTSNLNWDVHINAQCGKIYGSLKRLNLTTRHFSIDTKIKLFKSLILPHF FT LYGDFIFSNALASSIDKMRLALNACIRYVYKLPRFSRVSHLHKDLIGCNFS FT NFYKFRACVTLFKLIRSRKPEYLFTKLNPMRSDRNRNYLIPQHQSAYYSQS FT LFARGVVYWNQLPLSIKTNVSLFNFKRDLKQHLAE" XX SQ Sequence 4723 BP; 1488 A; 841 C; 831 G; 1562 T; 1 other; cattctgtag ttgcagttga attggcgatt aaccaccgtg tctgattgag tttgaaccag 60 attctaatgc ataaacgaat ctttagtgac agtgatccta taatcaattg aagtttttat 120 cagttgaaca atttctgtag tgatacaaca aacattaaat acttcttcaa gwccattcga 180 catattgaac cccttcagaa aaatttgttc gtttatcttg aatctgctga gcgtcgaaat 240 cagacacaat gtcaaacgaa aacaacaaac cccttactcg ttccacatct acctcatcct 300 catccttgaa tactgacggt agattgagca agcaaaacaa aagtgatgat gttgagactc 360 taagcgatct atggaataaa atacggagaa tgttcgctga gtctaaatcc gatatcgaag 420 gaaaaatcga gagctgtaaa acggagttgg agcagaaaat taccggagtg gagcagcagc 480 tatcaagtct caagatgggt tgtgatgttc agatcaagga tgtttctgag gctgtaagtg 540 atgttcggca tgacctggag ctaacaaaac ggaatgttaa tcgtctcaca acaaccaatg 600 agcttattat ctctggtgtt ccatatacct ccgatgagaa tctaacgctc attttccaaa 660 acatcagtgt tgccttggct tctactccat ccaacgtgcc taatgtttat ctgaagaggc 720 tctccaaaaa tccgattaat attggatcat caccaccaat actttgccag tttgcgttac 780 gtggtgtgcg agacgagttc tatggacgat accttcggca aagatcatta aatctgcgtc 840 acattggttt caacaacgac aaccgtgtgt atatcaacga aaacttgacc atgaatgaca 900 gagaaattcg aacgcaagcg atcaaactaa aacaccaggg tcggatacaa caagtcttca 960 caaggaatgg tgttgtctat gctcgaatga agaatggaga tgaagcggag ccgttttact 1020 cattggaaca actttttgca tctgtaaact gacctttcct aagtattact cttttccttc 1080 cattgaatcc ttgtattctt tcctgaatta tccgtgtttc cgagcctctc ctaaaagcta 1140 attaaagttt aaaaaaaacc tttccctgtt atagaccgtt tttccttcca agacattcca 1200 tgattccgtt cctaagaaat ccatgtattc ttcagtccta aaagttataa tcagccacat 1260 atttgtggtt ttgccctgat gaattcgaga ttgatcttgc cttcaacaat gtcgaatgat 1320 attgtttggg taagcctgct tactggatgc tttcggcgac tttgatgagt gccacatgtt 1380 gccgttgttg ctgctgctgc tgctgctgtt gaggctttcc atggtgattc tgatgctgct 1440 gatgttgatg ctgtttctga ctgcacctgt ttttataacc aatgatgtta ccgttactag 1500 aaagtttttc tccatcgttt aatactttac cagaattgtt acattattga ataccatcaa 1560 cttttacttt aatattgcac tccgttgttt tggtagctag tcaattcaat tgcaactaga 1620 aaaagcttaa gctcatgttc agaaagttag aatataatat tagtcgtaag ttggctttcc 1680 ttatctttca tgctgacttg ttcctgccaa ttctgtttaa aaggtttctt ttaggtttaa 1740 ctttattgtt tatatttgac tatggctgat aatgcagtta gcaacgcctc tgcgaccatg 1800 ctaattccaa aagcagtttt tcatgctgtt ttgcctgaag ataaaataaa cgtttgtcat 1860 attaatgttc aaagtatttg cgcacgtaat tttacaaaat tcaatgaatt gaaaaacaat 1920 tttatcaata gcaaaatgga tattgtttgt atgactgaaa cttggctcag cgatgttatt 1980 actgacccta tgattggtat tgacggtttc agcttgcttc gaaatgaccg taataggcat 2040 ggaggtggaa tttgtattta ttttaaatct aatcttactt gtagaattgt gaaaatgtct 2100 acagctattg atcaactgag tatgaccgag tttttactag ttgaagttcg tggtggcggt 2160 gaaccatttt tattggccgt atattacaat cccccagaga ttgactgttc ggatagttta 2220 aaagagcatt tagaagagtt tactgttaaa tacaatcaca ctttctttat tggtgacttc 2280 aatacggacc tcatgaagtc aaattcaaaa tcactgaaat ttaaagaaac tctatctgca 2340 ttgtcttata catgcgttaa taatgaacca acctactttt attcaaacgg ttgttcacta 2400 ctagacctcc tgattactga ttctcctgat atgatttata atttaaacca agtttctatg 2460 ccatttgcat caaaacatga tttgatattt gcctctataa acatcatgaa aacaagaact 2520 gaatcagtta gctatagaga ctacaaaaat tttgatgcta atcgattgaa taatgctttc 2580 catagtatgg attggaactc aatgataagt ataactgatc ctgatatgtt tctagaaatt 2640 tttaacagcc gtttgaagtt tttgcacgat gaatgtatac ctattagaaa aagtaagcca 2700 aaggttaacc catggttcaa ccaagacatt gaaaaggcga ttgttgatcg cgatattgca 2760 tattccaatt ggaaacgaag ccgtaatgaa ttggatatgg cacactttaa acgacttagg 2820 aatagagtga atttacttat aagagattct aagcgttttt atgatcgtga aagatttaat 2880 gttaacttac ctgcaaaaca attgtgggcg aacattaaga aattaggtat ttccaagcct 2940 cggtccacgg atggtaattc aagttttagt gttgatgaaa taaacaatta tttcactgat 3000 aactttaccg tcgatgatac tactagttct gctaacttca gcagcataga tgggttcaat 3060 ttccgtttcg ttgaagatat tgaaataata aatgctgtcc atgagataaa atccaatgct 3120 actggattag acgacatttc gatagtattc attaaaatta tgttacctct aatcttacca 3180 tttgttaaac atttgttcaa cattatcata tcaaattcca agttcccacg tgcttggaaa 3240 gttgttaaaa taataccgat taagaaaaag aagagtaaag atgatataca taatcttcgt 3300 ccaataagca ttctgtgtgc cctttcgaag gtttgggaga aacttcttaa aatccaaata 3360 tcagaattcg ttaccgaaat gaattttttg catcctctgc aatcgggatt tcgtcaacat 3420 catgggacaa acacagctct cttaaaagtt catgatgata tttcaagtgt aattgacaga 3480 aaaggagttg caattttact gttaatcgac tttgcgaaag cttttgatcg tgtatcccat 3540 aaaaagttgc taatgaagtt aagcgatatt tttcagtttt caaccagtgc caaccgttta 3600 attcactcat atttaactgg tcgttctcaa gccgtgtgct gcaacaataa attatcgagt 3660 tttaagaata ttgaatctgg tgttccgcaa gggtcagttt tgggtccctt attattttct 3720 ttattcatca atgacctgcc ttcgtgtttg agatactgtt cggttcacct ctttgctgat 3780 gatgttcaga tttatttgtg tgaaaaaaat agaaataact ataatgacat tggctttaaa 3840 atcaatcatg atcttcgaaa tcttcttttg tggtcgcaaa ataatttatt acctataaac 3900 ccaactaaaa caaaggcaat gttgatatgc aagaatagaa acccaattac atgtcctgat 3960 ttgttcatgg atggtgaaag aattgaattc gttagtcaag ttgataattt aggagtaact 4020 tttacgtcca atttgaattg ggacgtccat attaatgcac aatgcggtaa gatttatggc 4080 tccttaaaga gattaaattt gacaacacga cattttagta ttgacaccaa aataaaactt 4140 tttaaatctc taattttacc acattttctt tacggtgatt tcattttttc gaatgcgtta 4200 gcttcttcaa tcgataaaat gaggttagct ttgaatgctt gtataagata tgtgtataaa 4260 ttacctaggt tttccagagt gtctcatcta cacaaagatc taataggctg taacttttca 4320 aatttctaca agtttagagc ttgtgtcacc ttattcaaac taatacggtc tagaaagcct 4380 gaatatttgt ttactaaatt gaacccaatg cgaagcgata gaaatcgtaa ctatcttatc 4440 ccacagcatc aatctgctta ttacagccaa tcactgttcg cacgaggcgt tgtgtactgg 4500 aaccagttac cactcagtat taaaactaat gtttcattgt ttaactttaa aagggacctt 4560 aaacagcatt tggcagagta ggatgcaaaa agattaggaa atgcaaaaga acatgaaatc 4620 aatatatata tatatcatca taataaacct tgtatagcgg taaaaaaagg ctttcagcct 4680 tatgttatac gaatctacat aataaataaa taaataaata aat 4723 // ID Gypsy-155_AA-I repbase; DNA; INV; 6803 BP. XX AC AAGE02017419; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-155_AA_; KW Gypsy-155_AA-LTR; Gypsy-155_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-6803 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02017419; Positions 67599 60797. XX CC Positions [4384-4860] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 602..1660 FT /product="Gypsy-155_AA-I_3p" FT /translation="MPLWVSVIRDKITDEASQALVSRHTECEWDAIKKVLI FT EYFGDKRDLYDLVSQITYLQQNRKSITEFYNECQELLADIIDKLALDNTTK FT HCVHSLSEGYESMIKNSFIDGLNEPFSVLTRVTNPGTLLEAYHAALGIYNT FT EQRKKDKFSRQFLKVQTHQISNKPPLQGAAPNKLQSPNQRFDHNLGRPPYQ FT NFGYNQYRGYQYRGNFVNQQNNRNNNFVPQPNNRYNNLVPQQNNRFNPPQN FT RPPIKAEATSQQLRQNPQQNQNFYHANSLYHHETFHPQNFQHTNFPHHEPP FT YYQNEPPNGEHCENQSPEYESQYDQQNQEEYPNNIQPENTESAMESENLNF FT HFLLDIPQEE" FT CDS 1660..5163 FT /product="Gypsy-155_AA-I_1p" FT /translation="MTNNFLPYIIIKSPKFGVIKFLIDTGANKNYISPKLV FT ALNNVMTGAKHKITNISGTFFVNKFTYSNPFMDYDNTLPPQRFFLFDFHNY FT FHGLIGYESLRSLGAVIDTAKDVLKINSISIPLYKKYPETVNLNANETKAI FT FMPTSASDGDFLMGEEIELVPNVHILSGIYTSKNHQTEILIHNFSKDTKEI FT NLGDTLNVEINNFEITSPQKNNTDKENPLRKQLRMDHLNSEERSKLTKMLS FT KYQDVFHQEKQPLTFTNAIHHEINTKDDLPVYTKSYRYPFCHKEEVQRQII FT KMLEQGIIRHSNSPWSSPVWVVPKKLDASGQRKWRLVIDYRKLNQKTLDDR FT YPIPNITDILDKLGKCQYFSTIDLASGFHQIELAQKDIRKTAFSVDGGHYE FT FVRMPFGLKNAPATFQRVMDNILRKHLGVRCLVYMDDIIIFSTSLQEHLQN FT LKMILDTLREYNMKIQIDKCEFLQKEVAFLGHIVTPEGVKPNPEKIKVIQE FT WPLPNNEKELKGFLGMIGYYRKFIRDFAKIAKPLTQQLRKGETIQHTPAFV FT SAFKRCRNLLTSSHVLQYPDFTKPFVLTTDASNYALGAVLSQGPIGKDKPI FT AFASRTLTRAEENYSAIEKELLAIDWVCKYFRPYLFGRKFTLYTDHKPLTY FT ALDLKTSNDRLIKMKLRLEQFNYDIKYRPGKQNVVADSLSRIKPTINVNEQ FT ETLPSSSSDDSEPDENNDNDDNESNDDSDSGTVHSSSSDASNLIKMTEKPI FT NVFKNQILITEGPKNDESYEEIFPSIFRRTITRTTFGMVNAVDIFRDYMHP FT TRTNCILCPERWLKWVQLAYRNHFSRNKTFKIVLTQSILQDIPSDEEQDQI FT IEETHNRAHRGIEDNCRVISKKFFFPKMRNKVTSFVNLCTTCLENKYERNP FT YKIKFAYTPIPKKPLDILHVDIFISNPNIFLSAVDKLSRYAMLIPVKSRSI FT PDIKRGLTKLFTTFGTPKMIVCDNEVALKAIEIRGLLQRLDVETYNTPSDH FT SEVNGIVERFHSTLSEIFRCIRNKFPDLSNKEIYKIATNLYNTTIHSVTKL FT KPLEVFFGVKDGEERPLNLQRILENRNEIFDEVVQELEARQKKQIDARNKF FT REEDPNFEEHDKIYVKTTGIPNKKKAKFRKQIVKNNRRKTIIDQRNIRLHK FT SNLKRKRKL" FT CDS 5344..6804 FT /product="Gypsy-155_AA-I_2p" FT /translation="MRTTFKNYKNNERSHSSDKNYRLQNSNGIHKNYTPDQ FT FSIEEAIEQITNTFYNKLTPKNELHEVIKFRVKRLYSTLYGLKPQQTHRHR FT RWDSIGTAWKWIAGSPDAQDMYIINSTMNEIIDQNNYQYRINESINKRIAQ FT LTGTINQMANVINNNKANLEVLDAITIMLNVDVINELLDNIQEAVTLSKIS FT VTNNKMLSTCEVNVIKSTLHEQGIEVNFPDEALQFVTPKIAVKQGDLLYIL FT HVPHLENSTSTVIRIFPLIVDNQIIKSYPSHIIRHGHRLFTTANPRDFVQR FT SSNIEEFEDECIQNIILGKRSKCNSIFKDETTQQLANENTVIISNAKNHTL FT STNCGPEDRNITGNFIVIFRNCTVHFNNQSFTSTEMTVDTEVIHGAFHNSF FT IEWNLHKQHDIATISKTAVSNRQQLNHVYLRQDSLHFKLWTTFGGFSFSTI FT LGFVIVGVLIKIIYKNRTNGTGRSSLRGGLVTEVTEQHEVTP" XX SQ Sequence 6803 BP; 2432 A; 1518 C; 1191 G; 1662 T; 0 other; ttacattttt agcgaacgaa tagctatctt ttaggtagtg agattagtga agtaacggcg 60 attagcgcac aaaatttctc cggttaacaa taatttttaa ttagcgaacc aaaatcctcg 120 tatcaaacga tcgggatttt gagaattaag agaattcctt tcggaaaccg tttccaaatt 180 tttaattcgt gaagttcatg caaacaagaa gaatgacgga agagctatcg agacaaatcg 240 aacagctcac cgctcagctt acagagcaga tcgcctcact ctcacaagaa cgcgaagaca 300 gcgaagaggc tgcaccaaac cgttcgattg agcaactagc tgcccaactc gaacagcaga 360 ggttgcaaat agcctcgctg acgcaaagca gattagacga actggaagta ggaccctcta 420 ggatggtaac aacaccaaac agagtggctt cctttaaccc gtctcgtgtt ccggactcaa 480 tcaagctgat tgttcctata aaggagataa gaaatctcta tcagcgtgga taacttctgt 540 agaaaagaag cttgagcacg caaagagtct ctgctcctct actgtggaaa ttgacgctgt 600 catgccacta tgggtgagcg taatcagaga caaaattaca gacgaagcta gtcaagctct 660 cgtctccagg cataccgagt gtgaatggga tgccatcaaa aaggtattga tcgaatattt 720 tggagacaaa cgagatctat atgatcttgt ctcgcaaata acttacctcc aacaaaatag 780 aaaaagcata accgaattct acaatgaatg tcaagagcta ttagctgaca tcattgataa 840 attggcatta gataatacca ctaaacattg tgtgcatagc ctgtccgagg gttatgaatc 900 aatgattaag aactccttca tagatggtct aaacgagcca ttttctgtgc tcactagagt 960 aacaaacccg ggcaccctat tagaagctta tcatgcagct ttagggatat acaatactga 1020 acaaaggaaa aaggataaat tttcaaggca atttctcaag gtgcagacgc accaaatttc 1080 aaataagcca ccactccaag gagcggctcc aaataaatta caaagcccga atcagagatt 1140 tgaccacaat cttggtaggc ctccctacca aaattttgga tacaaccaat acagaggtta 1200 ccaataccgt ggaaattttg ttaatcagca aaataaccgc aacaataatt ttgttcctca 1260 accaaataac agatataaca atcttgtccc tcagcagaac aatcgcttca acccaccaca 1320 aaatagacct ccaatcaaag cagaggctac gtctcaacaa ttgaggcaaa atccccaaca 1380 aaaccaaaat ttctatcacg caaactctct ttatcatcat gaaacctttc atccccagaa 1440 ttttcaacat actaatttcc cacaccacga acctccttac taccaaaatg aaccaccaaa 1500 cggtgaacat tgcgagaatc aatcccctga atacgagtct caatatgacc aacaaaacca 1560 ggaagaatat ccaaacaata ttcagcctga aaacactgaa tccgctatgg aaagtgaaaa 1620 tttaaatttt cattttctcc tggacattcc tcaagaggaa tgacaaacaa ttttctaccc 1680 tacattataa ttaagtcacc gaaatttgga gtgattaaat ttctcattga cacaggtgcc 1740 aataaaaatt acatctcacc aaaattggtc gcattgaata atgtaatgac tggtgcaaaa 1800 cacaaaatta ctaatataag cggtacgttt tttgttaata aattcacata ttctaaccca 1860 ttcatggact acgacaacac ccttccaccg caacggtttt tcctgttcga tttccataat 1920 tatttccatg ggctgatagg atacgagtcc ttgaggtcat taggagctgt catagacaca 1980 gcaaaagatg ttctgaaaat caactccatt tcaatacctc tatataaaaa atatcccgaa 2040 acggttaacc taaatgccaa tgagaccaag gctatcttca tgccaacatc tgcttcggat 2100 ggcgattttc ttatgggaga agagatcgaa ttggtcccaa atgtacatat cctatctgga 2160 atctacacca gtaagaacca tcaaaccgaa attttaatac acaattttag taaagatacc 2220 aaagagatca accttggcga tactcttaat gttgaaataa ataacttcga aattacttcc 2280 cctcagaaaa acaacactga taaagaaaac cctttaagaa aacaactgcg aatggaccat 2340 ttaaattccg aagaaagatc aaaactaaca aaaatgctaa gtaaatatca agacgtattt 2400 catcaagaaa aacaaccact gaccttcaca aatgccattc atcatgaaat taatacaaaa 2460 gatgatcttc ctgtatacac aaagtcttac agatacccct tctgccacaa agaggaggta 2520 caaagacaaa ttattaaaat gcttgaacag ggcatcattc gacactcgaa tagcccctgg 2580 tcgtccccag tatgggtagt gccgaagaag ctagatgctt ctggccaaag aaaatggcga 2640 ttggttatcg actatagaaa gttgaaccaa aaaactttgg atgatcgata cccgatccca 2700 aatattactg acattctaga taaactagga aaatgtcagt atttctctac aattgatcta 2760 gcttccggtt tccatcaaat cgagctagct cagaaggaca ttcgaaaaac ggcttttagc 2820 gtcgacggag gacactatga gttcgtacga atgcctttcg gcttgaaaaa tgcacccgca 2880 acgttccaaa gagtcatgga caacatactc aggaagcatt tgggtgtacg ttgtctggta 2940 tacatggacg acattattat tttttcaaca agtttgcaag aacatttgca aaacttgaag 3000 atgatattgg atactcttag agagtacaat atgaaaattc aaattgacaa gtgcgaattc 3060 ttacaaaaag aagtcgcctt tctgggtcat atcgtaactc ccgaaggagt taagccaaac 3120 ccagaaaaaa ttaaagttat tcaagaatgg ccactgccga ataacgaaaa ggaactgaaa 3180 gggttcctgg gcatgattgg ctattataga aaatttataa gagatttcgc aaaaatcgcg 3240 aaacctctaa cacaacaact tcggaaagga gaaactattc aacatactcc cgcttttgtc 3300 tctgctttta aacgttgcag aaatttatta actagtagcc atgtgctaca atatccagat 3360 tttacgaaac catttgtgtt aacaacagat gcttcgaact atgccttagg cgcggttctg 3420 tcccaaggac ccatcgggaa ggacaaacca atagcctttg catcacgtac cttaacccga 3480 gctgaagaga actactcggc tatcgagaaa gaacttttgg ccatagattg ggtgtgcaaa 3540 tattttaggc cttatttgtt cggtcgtaaa ttcactcttt acaccgatca taagcctctt 3600 acctatgcct tagatttgaa aacctcaaat gataggctca tcaaaatgaa actacgtctt 3660 gaacaattca actacgacat taaatatcgc cccggcaaac aaaatgttgt tgccgacagc 3720 ctatcccgga ttaaacccac aataaatgta aacgaacaag aaacgttacc atcttcatcc 3780 agtgatgaca gcgaacctga cgagaataat gacaatgatg acaatgaaag caatgatgac 3840 agcgatagtg gaactgttca ctctagttca tcagacgcaa gcaacctgat taaaatgacg 3900 gaaaaaccta ttaacgtttt taaaaaccag attctcatta cagaagggcc aaagaacgac 3960 gaatcatatg aagagatctt cccctctata tttcggcgga ccatcaccag aacaacgttc 4020 ggtatggtga atgccgtgga catcttccga gattacatgc atccgacccg aaccaattgt 4080 atcctatgcc cggaaagatg gctaaaatgg gtacaattag cgtatcggaa ccacttttcc 4140 cgaaacaaaa ccttcaagat cgtactaaca cagtctattt tgcaggatat tccttccgac 4200 gaagaacagg accaaatcat tgaagagacg cacaaccgtg cccaccgagg aatcgaggac 4260 aattgcagag tcatctctaa gaagttcttc ttcccgaaga tgcggaacaa ggtaacaagt 4320 tttgttaatc tttgtaccac atgtctcgaa aacaaatacg agagaaatcc ttataaaatc 4380 aaatttgcat acacccctat ccccaagaag cctctcgaca ttctccatgt ggatattttc 4440 atatccaatc ccaacatttt tctttccgca gtggataaac tatcgcgata tgcgatgctt 4500 atccctgtca aatcaaggtc cattccggac ataaaaaggg gactcactaa acttttcacc 4560 acctttggaa caccaaagat gattgtgtgc gacaatgaag ttgcacttaa agccatagaa 4620 atacgtggtc tccttcagag actagacgtg gaaacctata acacaccctc agatcatagc 4680 gaagtgaatg ggatcgttga acgatttcat tctactttaa gcgagatctt caggtgcatt 4740 agaaacaagt ttccagatct ctctaacaaa gagatttata aaatcgcaac taatttgtac 4800 aacacaacca tccattcggt aactaaattg aaacccttag aagtattctt tggagttaaa 4860 gacggcgaag aaaggccact taacctccag aggatactcg agaatagaaa cgagatcttt 4920 gacgaggtcg tccaagaact cgaggccaga cagaaaaagc aaattgatgc acgaaacaag 4980 ttcagagaag aagatcctaa cttcgaagaa catgacaaaa tttacgtaaa aactaccggt 5040 atccctaaca aaaagaaagc aaaattcaga aaacaaatag tcaaaaacaa taggaggaaa 5100 acaattatcg atcaaagaaa cattaggctc cataaatcaa atttgaaaag gaaaagaaaa 5160 ctctaaatcc atgctaactt taaattttga taaccgaaaa gggactttcg gatctaaagg 5220 ggggatactg tttccgattc attttagtta taataacaaa ttctaacaaa ctaacacact 5280 aacccttaaa ttacagatgc cctgccttct cgttttgctt tgtataacac tcgaattatt 5340 atcatgcgaa ccacttttaa aaattataaa aacaacgaac gatcccatag ctcagataaa 5400 aattacagat tgcaaaattc aaacgggatc cataaaaatt atacacccga tcaatttagc 5460 attgaggagg ctatagagca aatcactaac actttttata ataaactaac acctaaaaat 5520 gaattacacg aagtgataaa gtttagggtc aaacgactgt actcgactct atatggactg 5580 aagccacagc aaactcatcg gcaccgacga tgggactcca taggcacggc gtggaaatgg 5640 attgctggat cacccgacgc ccaggacatg tacatcataa actccactat gaatgagatc 5700 atcgaccaga acaactacca atacaggata aacgagagca ttaataaacg gattgcacag 5760 ctaacaggaa caatcaacca gatggccaat gtgatcaaca ataacaaggc gaatctggag 5820 gttttggacg ccatcacaat catgctgaac gtagacgtca tcaacgaact tttggataac 5880 attcaagaag cagtcaccct ttccaaaatt tccgtcacca acaacaaaat gttatcaacc 5940 tgcgaagtga acgtcatcaa atccaccctc catgaacaag gtatagaagt caactttccg 6000 gacgaagcac tacaattcgt cacccccaaa attgcagtga agcaaggaga cctgctctac 6060 atactacatg tcccccacct ggaaaactca acatcaaccg tcatcagaat ttttccactc 6120 attgtggaca accagatcat caaatcatac ccgagccaca tcatacggca cggtcatagg 6180 cttttcacaa cagctaatcc cagagatttt gtacaaagat catccaacat cgaagaattc 6240 gaagacgaat gtattcagaa cattatactg ggaaaacggt ctaaatgcaa ttctatcttc 6300 aaggacgaaa caacgcaaca gcttgctaac gagaacacag tcattatctc gaatgcaaaa 6360 aatcacacac taagtacaaa ttgtggacca gaggatagaa acataaccgg gaatttcatc 6420 gttatatttc gaaactgtac agttcacttc aacaaccagt ctttcaccag cacagaaatg 6480 accgttgata cggaggtaat acatggagca tttcacaaca gctttatcga atggaattta 6540 cacaaacaac atgatatcgc aacaatcagc aaaacagcag ttagtaaccg acaacaacta 6600 aatcacgttt acctgaggca agacagttta catttcaaac tgtggacaac attcggtgga 6660 ttttcattct ccaccatact tggatttgtc atcgttggcg tactcatcaa aatcatctac 6720 aaaaacagaa ccaacggaac gggacgttcc tctcttaggg gcggactagt tacggaagta 6780 acggagcagc acgaggtcac acc 6803 // ID Copia-118_AA-I repbase; DNA; INV; 3918 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-118_AA_; KW Copia-118_AA-LTR; Ty1_copia_Ele28; Copia-118_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-3918 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [1359-1862] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 5..1135 FT /product="Copia-118_AA-I_1p" FT /translation="MGDHRKEAAVHGIQQFVGKGYDVWQFRVQTYLESVNV FT WDAIVKDVPVTAAEAAKFHETDRKAKAILVGFVADEYLGCIRDKGTAKAMW FT KSLEDTFAKKSAGRQTLIRKQIARLHLKDGASVRTHLLEFEDLIRQLRMAG FT SKLEESDVCSSLALTLPESYDALVTALENLPENELTYEVMKTRLIDEESKR FT NDRSYTVEDKPAAFIGDNQKKGNRFNGKCHSCGKRGHMKKDCKKAKGNANV FT ASIFKPVVFMADRAVTKRDSVGKIVFKLDSGSSDHLVNKKTYFKSVKSVQS FT PIIINVAKDGQFLEAKQVGEISGSSNLGVPLTCEGRFVRAKSTGQFDVSKE FT IGKSRSRSGVQQQGGNVEAKRENHCDGLSARQLV" FT CDS join(1020..2156,2160..3062) FT /product="Copia-118_AA-I_2p" FT /translation="MSVKKLAKAGVEVAFNSKVATLKLNGKTIATAYLRGN FT LYEHEIEVPEKEANLCSSETANLWHRRLGHLSENSMKMMVREDLAKGLNFK FT PQKLQFCEACVQGKMCREPFDGKRERATRPLGRIHSDVCGPIEPASWDGHR FT YFVSFIDDYSHFAMVFLLKKKSQVFEKFQEYEAMVTAMFGASISKLTVDQG FT REYCSGNQMKFYKAKGIQIEATVAYSPQQNGVAERFNRTLVEKVRSMLIDS FT RMPKSMWGEAVLTGTFLMNRSPTSALPRNLTPTECWTKQKPNLEKLKVFGS FT QAFAWIPNQYRKKLDAKSRELVMIGYASNGYRLWDRNSRKIVIARDVKFNE FT ACYPYASNSSKSEDIPLMILTSSDQEGESKNYLVSGENEARNSVEMIEAED FT SEYDDELPTALPSQQERDSNSDESSTRRSERERKFPGKLLEYLTGFEATAV FT GSPQFTDIPQCFNDIAKSDDRDHWMEAVLDELKSMEANRVWQLMKRPPGVK FT PLKSKWVFRLKENECGQAIRHKARLVVKGFLQRPGIDYEETFAPVARLATV FT RVVLAVAVRYGFYLHQMDVKTAFLHGNLKEELYMEVPDGVQAAPGTVCKLL FT KSIYGLKQSPRCWNEKLNTVLLKLGFSRSKRDYCLYTSTKENDEIFLVLYV FT DDLLIIGRNIRTIEHLKQQLSREFKMSDCGE" XX SQ Sequence 3918 BP; 1187 A; 761 C; 1026 G; 937 T; 7 other; ggttatgggc gaccaccgta aagaagctgc ggttcacgga attcagcaat tcgttggtaa 60 aggttacgat gtatggcaat tccgtgttca gacctatttg gagtcggtga acgtatggga 120 cgctatcgta aaggatgtac ccgttacggc tgcggaagcg gcaaagttcc acgagacgga 180 caggaaggcc aaagcgattt tggtgggctt cgtagcagac gagtacctag gctgcattcg 240 ggacaagggc acagcaaaag caatgtggaa aagtctcgag gatacattcg cgaagaaatc 300 agcaggtagg cagaccctaa tcaggaagca aattgcaaga ttgcacctga aagatggcgc 360 ctcggttcgg actcatctgc tggagtttga agatctgatt cgtcagttac gcatggcggg 420 atcgaagttg gaagaaagcg atgtttgctc gtctctggcg ttaacgcttc cggaatcata 480 cgacgctctg gtgacggctt tagagaattt gccggagaac gaacttacct acgaagtgat 540 gaagacccgt ttgatcgatg aagaatcgaa acgaaacgat cgaagctata ctgtagaaga 600 caaaccagca gcgttcattg gcgataacca gaagaaaggc aaccggttca atggaaaatg 660 tcattcctgc ggaaaaagag ggcatatgaa gaaagattgc aagaaagcaa aaggaaatgc 720 aaatgtggca agtattttta aaccagtggt gttcatggcg gaccgtgcag ttactaagcg 780 agacagcgtc gggaagatcg tgttcaagtt ggactctgga tcgagtgacc atctcgtgaa 840 caagaaaacg tatttcaaat ccgtcaaatc agtacaatcg ccaattatca tcaacgtagc 900 taaggatggc caatttcttg aagccaagca ggtaggagag atctcaggat caagtaatct 960 tggagttccg ttgacatgtg aaggacgttt tgtacgtgcc aagtctacgg gacaatttga 1020 tgtcagtaaa gaaattggca aaagcaggag tcgaagtggc gttcaacagc aaggtggcaa 1080 cgttgaagct aaacgggaaa accattgcga cggcctatct gcgaggcaac ttgtatgaac 1140 atgaaattga agtcccggag aaagaagcga atttgtgcag ttcagagaca gcaaatctgt 1200 ggcatagacg actaggtcat ctgtctgaga atagtatgaa gatgatggtg cgggaagatt 1260 tagccaaagg actgaacttt aagcctcaga agctccaatt ttgtgaagct tgtgttcaag 1320 gtaagatgtg tcgagaacca tttgatggaa agcgcgaaag agcaacgaga ccactcggaa 1380 gaatccattc tgatgtctgt ggacctatag agcccgcgtc atgggatggt caccgttatt 1440 ttgtcagctt tatcgacgac tattcacact tcgcgatggt gttcctactg aagaaaaagt 1500 ctcaagtttt tgaaaaattt caagagtacg aagcgatggt aacagctatg tttggagcat 1560 cgatatcgaa gttgacagta gatcaaggtc gagaatattg ctccggaaat cagatgaaat 1620 tctacaaggc taaaggtatt caaatagaag cgaccgttgc atattcccct caacaaaacg 1680 gggtagcgga gcggttcaac cgaaccctag tggagaaagt tcgaagcatg cttattgact 1740 cgaggatgcc taaatcgatg tggggagaag cagttctaac aggtacgttt ttgatgaata 1800 ggagtcccac ttctgctctc ccgaggaatc ttaccccgac ggaatgctgg accaaacaga 1860 agccgaactt ggagaaactg aaagtttttg gttctcaagc atttgcatgg atccccaacc 1920 aatatcgcaa gaagttagat gcgaagagtc gagaactggt gatgatcgga tacgcttcaa 1980 atggatatcg cttatgggac aggaattctc ggaaaatcgt aattgctcga gatgtgaagt 2040 ttaatgaagc ttgttatccg tatgcgtcca atagtagcaa atctgaagat attcctttga 2100 tgatcttgac atcgtccgat caagaggggg aaagcaagaa ttacctcgta tctggagwag 2160 aaaacgaagc gcgaaattcc gttgaaatga ttgaagctga agatagcgaa tacgatgatg 2220 aactacccac cgcgctccct tcgcaacaag aacgggacag caattcagat gaatcgtcca 2280 cgaggcgtag cgaacgggag cgcaagtttc cgggtaagtt gctggaatat ttaactggtt 2340 ttgaagcaac tgctgttggc tctccacaat tcacagatat acctcagtgt ttcaatgata 2400 tcgcaaagag tgatgaccgt gaccattgga tggaggctgt actcgacgag ctcaaatcga 2460 tggaggccaa tcgtgtgtgg caactgatga aacgaccacc aggtgttaag cccttgaaat 2520 ccaaatgggt attccgattg aaggaaaacg aatgtggaca agcaattcga cacaaagcaa 2580 ggttagtggt gaaaggattt ctgcaacgtc ctggmatcga ttatgaagaa acgtttgctc 2640 ctgtggctcg cttggcaaca gtacgtgtgg tgctcgctgt ggccgtacga tatggtttct 2700 atctacacca aatggatgtg aagacggcct ttctccatgg caacctgaag gaagagctgt 2760 acatggaggt accggatgga gtgcaggcag cgccaggaac agtatgcaag ctactgaagt 2820 ctatttacgg gctgaaacag tctccacgat gttggaatga aaaactcaat acggtgttgc 2880 tgaagctagg tttttcaaga tcgaaaagag attattgctt gtacacttca acaaaagaga 2940 acgacgaaat ctttctggtg ctgtatgtag acgatttact tatcattggc agaaacattc 3000 gaaccattga acacctgaag caacaactgt cccgtgagtt caaaatgtcs gattgcggcg 3060 aggmaaagtt cttccttggc attaaattgg attacgatcg tcagagaggt atcctwmagt 3120 tgtcccaagg aagcaatgcg caaaaaatcc tgaagaaatt cggcatgata gattgcaacg 3180 cagtgaamac gcctatggaa aaaggattac agttgagtcg agccggaagc agaacgaatg 3240 aaccctacag agaacttcta ggcagtttaa tgtaccagat gctatgtgct agacctgata 3300 tttgttttac tgtaagttac cttggacgtt accaacagaa tccggtggat catcattggc 3360 aggccttgaa gcgcgttgtg agatacctga aaggcactag cgagcttggt ttgctctaca 3420 agcggaatga atcttcagct ccgttgattg gctttgtgga ctcagattgg gcatcagatc 3480 aagaagatcg aaaatccgta tctggctttg tcttcaaggt gttcggatca accgtatcgt 3540 gggcaagtag aaaacaagta accgtatcta cttcatctag tgaagcagaa tacgttgctc 3600 tcagctctgc agtaagtgaa gcgatttggt tgggcggtct cctggaagat ttgaaatgca 3660 aatcaaaggc agaagcaatt ccaatatttg aagataatcg tggctgtatt ggactggcaa 3720 gaaacatgga atcaaaaaga atcaagcaca tcgatatcaa acatcatttt ctgagggacc 3780 atgttgctgc tggaaccgtg acgattgagc ctattggaac tgcagatcaa gaagccgaca 3840 tcttcacgaa atcattggat actgtacgat tccaagatct acgttcaaga ctgggagtca 3900 ccgattgaga gggggtgt 3918 // ID hAT-34_HM repbase; DNA; INV; 4060 BP. XX AC . XX DT 07-DEC-2008 (Rel. 13.12, Created) DT 13-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE hAT-type DNA transposon family: consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-34_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-4060 RA Jurka J.; RT "hAT-type DNA transposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 2023-2023 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS 717..3275 FT /product="hAT-34_HM_1p" FT /translation="MPRGVHKMKRLSGHQQRKLKEAKLKSLKKYAGSILQY FT IDNPNDGQCSSKDSTLIQTPLVVEESENEASNNSTFTVDMEKNDQENEAES FT PQVLEITDLEKHDGKTNFKIFSDVSFWKLPLQDHLRIEIIKKGSSTFQNKD FT GPFNVVQRNVTQMKGHDRHLSSEWFYKQMKNGEKILRSWMVYSPNSGNLYC FT FCCRLFASNITNLTSKFVTGFQKWWKLNPKVYEHESSDNHLRFLEKWKILE FT TNLKSNNTIDAEIISFTENEKKKWRNILYRLLDVTLFLAKQNLPFRGHQED FT ESSLNKGNFLELVDLLSTYDPILKEHLIKLRESYSKVSYLSPKIQNEFISC FT LASHVKDKLIEEIKSARYFGMMFDSTPDISHVDQMSEVIRYIKIENKKVEI FT KEAFLGFFPLKGKKALDLSTEILKQLESDGLDIMMCRSQGYDNAATMAGIH FT GGVQKIISDKNNKAIFNGCIDHSLNLCGQHSFAVNASCVTFFGTIDAIYSF FT FAASTHRWEVLIKHTNISVKRLSTTRWSAHYNAVKPLQENFEKFIEAIEAL FT CGTNENLDTRGAAESLLPAVCQYTFLCFLYFWHDILEEVNNTQIYLQTKGL FT SLDKVIIKIEALSLFLNEERQNVVDKALNKALCVSKEHDIPIERRIRLRKK FT MAGELVCDTKLTIEEENKRTMYECIDRLNVELQARSKPLKNILDMFECVQA FT KFLVSEHNNEEDLTLSIKKLTSFYDEISHDELILEIPRLKRHLKAAGINLE FT VVKNWTALDILTFITEWDFTETLPMLSLTLRLFLTICVSVASCERSFSKLK FT LIKNYLRSTMGQSRLSDLAILYIESEFVKSIDFDNVIDKFASLKVRKGTF* FT " XX SQ Sequence 4060 BP; 1501 A; 534 C; 663 G; 1362 T; 0 other; cagtggcgta cctacggagg ggggggggag ggctcaccgc cccgggtgtc atccactttt 60 gggtgacacc caatttatta atatcttata ttttattttg gtttataatt aaacagtctt 120 acataaaaaa ataagtgggt atttatttac acgacctacg aaatgctgaa tgctgaaaga 180 tatagttatc attatttaga aaatataatc gcttagcttt cttcttataa tataaacaaa 240 atgaaataaa gacttaatgt cgtctaaaaa aattcttcat aatcaaaaca tactcaaatt 300 ttgtttgggt ttggataact ttgtgtgggt gcgattttaa ccgtttttca ctttatcagc 360 tttcttagtc ttattcgtat attttttcaa tgttagtact gcatcacatt gcagaatatc 420 aaaaccatct aattcgaact aaaacattcg aaaacatatt ctttgtttat gataacaatt 480 ttaaaaattt gccatattac atattactga aaattttata aaatttgtgc ataaattata 540 gtgactttgt aatagataca ttttgaaaca atatcgtata ttattaattg ttttaatttt 600 tcgtttttag tgcttaataa ataccattta atttaaaaag tttttagtgc ttaaattgta 660 agttttaaaa ttttgtttta atatttagtt taaagtttta aatattttgt ataacaatgc 720 cgcgtggggt gcataaaatg aagagattaa gtggacatca acaaagaaaa ttaaaggaag 780 ctaaacttaa atcccttaaa aaatatgcag ggtcaatact tcaatacata gataacccaa 840 atgacggcca atgttcttca aaagatagta ctttgataca aactccatta gtagtagaag 900 aaagtgaaaa tgaagcttct aataatagca ccttcactgt tgacatggaa aaaaacgacc 960 aagaaaatga agctgaatct ccacaagtat tagaaatcac cgatttagag aaacatgatg 1020 gaaaaactaa ctttaaaatc ttttctgacg tctctttttg gaaattaccg cttcaagacc 1080 atcttcgaat tgagattatt aaaaagggaa gttctacatt tcaaaacaaa gatggaccat 1140 ttaatgtagt acaacgaaat gtcactcaaa tgaaaggcca tgatcgtcac ctttcctctg 1200 aatggtttta taaacaaatg aaaaatggtg aaaaaatttt acgttcatgg atggtatatt 1260 cacccaacag tggaaattta tactgttttt gttgtcgatt atttgccagt aatattacca 1320 atttgacatc aaaattcgtc actggatttc aaaaatggtg gaaattaaat ccaaaagtat 1380 atgagcatga atcatctgac aatcatttgc gatttttgga aaaatggaag attctggaaa 1440 caaatttaaa atctaataat acaatcgatg ccgaaattat ttcttttaca gaaaatgaaa 1500 agaaaaaatg gcgaaacata ttatataggc tcttagatgt aactttgttt cttgctaagc 1560 aaaatttgcc atttcgtgga catcaggagg atgaatcgtc tttgaataaa ggtaattttt 1620 tagaattagt ggatttgctt tcaacgtatg accctatatt aaaggagcat ttaattaaat 1680 taagggaaag ttattccaag gtttcgtatc tttccccaaa aattcaaaat gaatttatca 1740 gttgtctggc aagtcacgta aaagacaaac ttattgaaga aataaaatcc gcgcgatatt 1800 ttggcatgat gtttgatagc actcctgata tatctcacgt ggatcaaatg tctgaggtga 1860 tcagatatat taaaattgaa aataaaaagg tagaaataaa agaggcattt ttaggatttt 1920 ttccattaaa aggaaaaaag gctctggact taagtactga aatacttaaa caattagaat 1980 ctgatgggtt agatataatg atgtgccgct cccagggata tgataatgct gctacaatgg 2040 ctggaatcca cggaggtgta cagaaaatta tttctgataa aaataataag gcaattttta 2100 atggatgtat tgatcattcc ctaaatttat gtggtcagca ttcttttgca gtaaatgcat 2160 cttgcgtcac atttttcgga acaattgatg caatttattc attttttgct gcttctacgc 2220 ataggtggga agtgttaata aagcatacca atatatcagt gaaaagactg tcgaccacgc 2280 gttggagtgc acattataac gccgtaaaac ctcttcaaga gaattttgaa aaatttatag 2340 aagcaattga agctctttgc ggtaccaatg aaaatttaga tacaagagga gcagctgaat 2400 ctcttttgcc ggcagtgtgt caatatacat ttttatgctt tttgtatttt tggcacgata 2460 tactggaaga agtaaataat actcaaattt atctacagac aaaaggtcta tctcttgata 2520 aagttataat aaaaatagaa gctcttagtc tttttttaaa cgaagaaagg caaaatgtag 2580 ttgacaaggc actaaataaa gcactttgtg tatcaaaaga acacgatatt ccaattgaga 2640 gaagaataag attgagaaag aagatggcgg gagaattggt atgtgatact aaactcacaa 2700 tagaagaaga aaataaaaga actatgtatg aatgtattga tcgtctaaat gttgaacttc 2760 aagcaagatc aaaaccatta aaaaatattt tagatatgtt tgaatgtgtc caagctaaat 2820 ttttagtttc tgaacataat aatgaagaag acttgacgtt atctattaaa aaattaactt 2880 cattttatga tgaaatatcc catgatgaac taattttgga aattccacga ctcaagcgac 2940 atcttaaagc agcaggaatt aatttggaag tagttaaaaa ttggacagct ttagatattt 3000 tgactttcat cacagaatgg gacttcacag aaacgcttcc aatgctttct ctcaccttaa 3060 gattattctt aactatttgc gtgtccgttg catcatgtga aaggagcttt tcaaaattaa 3120 aactgataaa aaattattta cgatcaacta tgggccaatc aagactttct gatcttgcaa 3180 tattatatat agagagtgag tttgtcaaaa gcattgactt tgataacgta attgataaat 3240 ttgcaagttt aaaagtaaga aaaggaacat tttaaaggtt aagtgttatg aaaatttagg 3300 ataataatta tataatgatt aaaaaattta gtttagtgtt tgaaaaattt agtttagtga 3360 taaaaacatt tagtttagtg attaaaaaat taatttagtg attgaaaaat ttagtttaat 3420 gattgaaaaa tttagttttg tgattaaaaa ttagtgtagt gattgcatag tgaataaaaa 3480 ctatacaaag ttcaaacttt tgtgcatatt attttaaaaa aatatattta gttcatatag 3540 atagtaaata attaagtaca ttaatatgtt gttatacatg aacacatcag tggcatagat 3600 tcatttatag gacagatctg cgtatttaac aaagggtaag tgaacgttta acttattttt 3660 cacagataag aagttaaaaa atcagtaggt ttgctatttt attattatac tttactatca 3720 aatcaaatcc aggtactact gtttttttaa gcatataaaa gaataaagct gattttagac 3780 gattttagat gataataatt tattatcgtc taaaacagag ataataatta tttggcaaca 3840 atttattttc aaataattat tttctctgtt atagacgaga attatttata gagagttata 3900 gacactaaat tattattgtc gaaaacaagg tttaaaatta catgtgtcaa cgtttctaaa 3960 tactgcttta ataatttcgt atataaaaat aattacatat aatatatggg tgacacccta 4020 tttgccgccc cgggtgacac cataactagg aacgccactg 4060 // ID PrD37D repbase; DNA; INV; 1761 BP. XX AC . XX DT 18-AUG-2005 (Rel. 10.08, Created) DT 26-APR-2010 (Rel. 15.05, Last updated, Version 2) XX DE DNA transposon from Philodina roseola. XX KW Mariner/Tc1; DNA transposon; Transposable Element; PrD37D. XX NM PrD37D. XX OS Philodina roseola OC Eukaryota; Metazoa; Rotifera; Bdelloidea; Philodinida; OC Philodinidae; Philodina. XX RN [1] RP 1-1761 RA Arkhipova I.R. and Meselson M.; RT "Diverse DNA transposons in rotifers of the class Bdelloidea."; RL Proc Natl Acad Sci U S A 102(33), 11781-11786 (2005). XX DR [1] (Consensus) XX CC This element belongs to the ITM D,D(37)D family, which occupies CC an intermediate position between the Tc and mariner families. XX FH Key Location/Qualifiers FT CDS join(90..320,402..1046,1050..1433) FT /product="PrD37D_1p" FT /translation="MRRRTLYNLQFRTSFIHLTFPCRRSHFLGGHFLLLRT FT TSSSSDSDDHIGVSAFERSHDRRQPVATLGPITVYMNRDLKREMKSQDMRN FT VVMRLFDDGLSSVQIARQLRNVVSERTVRRWENSYKATGGIDLKSPPGRPR FT IVRTKGLIQKVKQRLSSKSRQSARRLAKSLGVSRETMLRLLQDDLNLRAYR FT ITTQPKLTEDHKKRRVSFAYWVRKELRKRDHGQILFTDEKYFSLEGVFNRQ FT NERVYAVSRDDADQQGGINQTSKYPKRIMIWLGASENGLTSPIIFQPGETL FT HENYINVVLPHARAEGERLLGTDFIYQQDNATPHVHRKSLTWCAENFSNFI FT DNERWPPNSPDLNVLDYYVWDAVTTNMQWDKVNDYHSLTEEIKRGIRRVPL FT EDVCRSVQSWSKRILTILKTKGEYIK" XX SQ Sequence 1761 BP; 557 A; 327 C; 384 G; 493 T; 0 other; acgggtgacc cataataaat ggccgcattt tgcatttcag ttttctgtcg ataaggtcaa 60 tctactaagg acaataaata taccaaaaga tgcggcgtcg aacgctctac aatctacaat 120 tccgaacttc tttcattcat ctaacttttc cttgcagaag gagccatttt ctcggaggtc 180 attttttgct tttgcgaacg acatcatctt ctagtgattc agatgatcat attggtgttt 240 ctgcgttcga aaggtctcat gatagacgtc aaccagtggc tactttgggt ccaatcactg 300 tatacatgaa tcgggatctt tgatgatttt atcgtcaagg atagaggctg aagagacagt 360 atataaggat aaatgacaat gattggagtt cattcagttg aaaaagagaa atgaagtcac 420 aggatatgcg aaacgttgtg atgcgtttgt ttgatgatgg gttatcatca gtgcaaattg 480 cgagacaatt acgaaacgtt gttagtgaac gtactgtacg gcgttgggag aattcctaca 540 aggcaacagg tggtatcgat ctaaaatcac caccaggaag accacgcatt gtacgaacga 600 aaggcttaat tcaaaaagtg aaacagcgac tctcctccaa gagtcgtcaa agtgcaagac 660 gactggctaa atcattgggt gtctctagag aaacaatgct tcgtctactt caggatgatc 720 ttaatttacg tgcttaccgt atcacaacgc aaccaaagtt gaccgaagat cacaagaagc 780 ggcgagtttc attcgcatat tgggttcgaa aagaattacg aaaacgagat catggtcaaa 840 tcttgtttac cgatgaaaaa tatttttcat tggaaggtgt atttaaccgc caaaatgagc 900 gtgtctatgc tgtaagtcgc gacgatgccg atcaacaagg tggaatcaac caaacaagca 960 aatacccgaa gcgaataatg atatggttag gtgcatccga aaatggtctc acgtctccga 1020 ttatttttca gcctggtgaa acattataac atgaaaatta catcaatgtc gtgttgccac 1080 atgcacgtgc ggaaggtgaa cggttactgg gtactgactt catttatcag caagacaatg 1140 ctactcctca tgttcatcgg aaatcattga cctggtgtgc agaaaatttt tcgaatttta 1200 tagacaatga acgatggcca ccgaacagcc ctgacctgaa tgtccttgat tattacgtgt 1260 gggatgcagt cactacgaat atgcagtggg acaaagtcaa tgattatcat tctttaactg 1320 aagagataaa gagaggaatt cgtcgcgtgc cgctggaaga tgtctgtcgc agtgttcaaa 1380 gttggtctaa acgaattttg acaattttga agactaaagg tgaatatatt aaatgaacac 1440 gttttcataa catcaagtcg tagattttcg tgtgatcttc tgcttttgaa ataaaacatc 1500 tcagatgatt gagttaagaa gcacaaacaa cataatcacc tgaatcacta gaagatgatg 1560 tcgttcgcaa aagcaaaaaa tgacctccga gaaaatggct ccttctgcaa ggaaaagtta 1620 gatgaatgaa agaagttcgg aattgtagat tgtagagcgt tcgacgccgc atcttttggt 1680 atatttattg tccttagtag attgacctta tcgacagaaa actgaaatgc aaaatgcggc 1740 catttattat gggtcacccg t 1761 // ID hATw-3N1_BF repbase; DNA; INV; 2190 BP. XX AC . XX DT 12-JAN-2009 (Rel. 14.02, Created) DT 12-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE hATw-type DNA transposon family from Branchiostoma floridae. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; hATw; KW 7-bp TSD; hATw-3_BF; hATw-3N1_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-2190 RA Bao W. and Jurka J.; RT "hATw-type DNA transposons from Branchiostoma floridae."; RL Repbase Reports 9(2), 515-515 (2009). XX DR [1] (Consensus) XX SQ Sequence 2190 BP; 714 A; 362 C; 422 G; 692 T; 0 other; taggcatgtc aaacctaccc cgttcccgca ttagaacgaa gcggcccgcg gggccgtaaa 60 tgtttttaaa aaaatcttaa aaatcaaccg cttgaataat gccaatcaaa atcgttgtct 120 ataatctatg gaatgttgac tacgatacta tctgattttt agttgacaat atccataata 180 ataagtgagc taaaaatcat aaaagttgga gatacagccg gaaaaatgcc cacgttttga 240 cgccgcgcgc gcggcacgct atttcgaagg ccgcccggcg gagatgcaat tttcaactca 300 aaagccacaa gccgccgtgc atcggagaac tctgggaatg attccaggtg aaagggcaga 360 gatcaaggtt cactccagtg gttgcacaat tttgctacgt cattcctggc ggaagttatg 420 ggccagagta gggaaacctc cgaaagtaag tggatatttt tgccattttt cttgaattta 480 gcaattttgt tcatacacag tgacaaatat agatatattg cgatcatcat ggtgtagttt 540 ctgtttctat ttacgcgagt ttggcttgtg gctttagcga tacgatttga gagttttttt 600 aggcagcccg cgcacggccc gccgtaaggt ggagggggag gggggcgtga catcacggac 660 gtccgcggac attgtctgcg ctataaattc aacatattat cataaattct gcgctatttg 720 accgcagaat ggcaatatat tttatgtatt gtgtagaata cctttcataa ccgatgatgt 780 ggcatatttt gtaacaaaaa aacaacgttt cactatcgta aataatatgt tgcgtgcaga 840 cacgttatcg ccatgctcaa tcggaatttg tcagttgcat ggtattaatt tcgagatcat 900 ttttaatttt ctagttgaca caaaagggac atatcctggg cagatcagcg gaaacagcaa 960 aatagagact tgtcaaaact gctcaccaga gacaaggctg cagacaccga gacaacagat 1020 caagtttaga cacgacatgt ttggagttgc aataattggt gacgaacatt gaaagttatt 1080 tgatcttatt cttcattatt gttttgattt ttaaagaaga aaagctaagt taagagaaac 1140 tgaatagaaa agcaacctac taataataat cagaaagtta cttgaactaa acagttattt 1200 tttagaatgg gtaaaacatg tatgcatact aattagcact tactagttaa ttagcatatc 1260 ttattactaa tgtcattaat tacgatgtta attaggacaa ctgctaatta agcatttgat 1320 tgttaaaagt tctttaaagt aacatgcaat ttacatctgt acaacccatt taagttaatg 1380 cctacaacat aaatgtgatt aaaagccaat gtgtggatag ttttcaccat atatgtatca 1440 ttagcatatt gctaattagc ataaattctt aattatttat tgatgttcct aaaagtgtgg 1500 atacattcct gtacttacgt caacatgcaa tataggttag cactgtctat ttttatttca 1560 gggaatatgt aattcaaata tggacaagtg atacaattac atgttttgtt ggaagtactt 1620 tagggtaata accctcattt gcatattttg taatcaataa aaactacaaa acataccttc 1680 attttttaat gattagtctt actttgaatt agaaacgtga agtttgcttt cttaagcttg 1740 tagcatgttt aagaaatgca gattaattga atggtactat tgtggatgag tagattaatt 1800 agggttaatt aggattaatt aatacaaagt aatgatgata tgtcgagaaa acaatattat 1860 agtctgtcca gtaacattat cttcatgtgc ttgaaaattt gaagaacatt tgttaaagcg 1920 tttggaagat acattaaaaa gtgtgtaaat aactgtgtgc ttgtggtcat tttcggtgcc 1980 atgtcgaggt caactgttcc catagggcta atagataagc ctcttctaat gaaaacgtcc 2040 aacttttcaa ctttgaaaaa ttctagaaaa tttcaacatt gtcggatttc aacacaataa 2100 aaagcattgt aatcagaaga atctgatctg taatttgaaa gcagtttcag caactttaac 2160 cttaaccaaa attttggtgt gacatgccta 2190 // ID BEL-3-LTR_HM repbase; DNA; INV; 229 BP. XX AC . XX DT 08-JAN-2009 (Rel. 14.02, Created) DT 08-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE LTR retrotransposon from Hydra magnipapillata: long terminal DE repeat - consensus. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-3-LTR_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-229 RA Bao W. and Jurka J.; RT "LTR retrotransposons from Hydra magnipapillata."; RL Repbase Reports 9(2), 435-435 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX SQ Sequence 229 BP; 67 A; 40 C; 30 G; 91 T; 1 other; tgttgaagcc tatttggctt cccgcgaata atctattgrt tataactatt tcccgcagtg 60 cattgcgtct taagccatgc tctcttatgc tagttgtata ttatatattg accttactaa 120 cttacaactc ttcttactat ttaatatatt attgaaactc acatagtagt tatttcttag 180 tgttcaaatt atataaagta aaccgttgag tagatttaag tactcaata 229 // ID LIN3_SM repbase; DNA; INV; 7988 BP. XX AC . XX DT 04-FEB-2008 (Rel. 13.02, Created) DT 09-FEB-2008 (Rel. 13.02, Last updated, Version 1) XX DE Non-LTR retrotransposon; consensus. XX KW Non-LTR Retrotransposon; Transposable Element; LIN3_SM. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-7988 RA Jurka J., Tempel S. and Bao W.; RT "Non-LTR retrotransposon from Schmidtea mediterranea."; RL Repbase Reports 8(2), 161-161 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by CC Washington University School of Medicine: Genome Sequencing CC Center. XX FH Key Location/Qualifiers FT CDS 3247..7026 FT /product="LIN3_SM_2p" FT /translation="MVLSPELTINLIAQNNIYIVDHVAFKQFKGIKIIIGI FT ISNDNYWCLFLCSVCLGFSYIFDVRSSQNINGRLLGIGENITDYLNKFILE FT KTIIFKKSHTLFRFAMFSDDPIHHATTFILFLEEFISGNVVNRDKVIEKLV FT NMQSKMCDEVVVNLNLINCKITENILTNFFINLNLSENFIFLNNTMTTAIL FT DDCTQYLNEFLNIENLMKAKIVFAIIAPPKARQILMIIDYNSDEHYLLNPS FT SLLPNFPYLCTANSLLTKINEIKETPELCKLLGNCPHDIRGSGLVSRILIC FT AFIKHYALDLSLENINLRDIGNVIKSLLPTVQDIIKDSIKNLPDNKTNVKL FT KLDQRILLINENLLTLQNASVNEISNFIINEIRIRKETSVTKISRPYLGNK FT ISKVPPPKTNFKSSFNQNMKTTVYNILVNDEINERPTIKDHISNFSHQDPL FT IKSYDNVLNFGLKSQNTLLLRPIIDKEVIFELKRADNTSPGIDGIKYNDIA FT ILDPEGKLLTLLYNKIITENTMPDVWKSFKTLMIPKPDKGDKYNLISSWRP FT IALLSVVYKIFASILARRLTSWVNRNNLLHIGQKGGSVHDGCVEHNSVLSS FT CLEHSKYSKDTPIIIAFLDIKDAFGSVPHDYMWKILRHIGVGEKFTNTLKL FT LYTETSSYYTCGPIVTPNIPIKQGVKQGCPLSMILFAIAINPVLQAVTLSK FT VKPFKIGESSIQVLAYADDIALIANNTHDLQNILNIAFDVAREIGFEYRPE FT KCAYIQLPNVDTISEISINNIKIKKLLSKEFYQYLGVPIGEQPNQSPYEIL FT NKVVSDAKKLADSDLCGWQKLKAYKTFLHSRLTFPFRTREIKISALSANVC FT NNTTKLRSHLRKMMSLPNNSEISYFYNSIENGGASCTDLLDEYHTQTISHF FT FRLFTANCEFTKRINIDSLKFVTGPRLGNNNPSLQDSFNWINKADIIQKHS FT GKKTRFFRPRTAIRFFEKTHNITISFEVLDDKPILRLTTALKGTIILTEFD FT RRLVSKVLHLALYDSYFTKWKNSNISNSSTFYLSPNINRAIFKGELREDHW FT KFIHRARTNTLANFAKPYIKGENRLCRRCHNQDETLPHVLQNCKVHLTLAL FT NRHNDCLQQIVHHLKSPSLIVVVDHTCSLVSNSKERVDLIITNNEKKTILM FT VDIKCPFDSLVTFETVNKENLAKYDSLKKQIQAAKPSFTVEIFTCIIGSMG FT SVPPASYDLLGKMGVPFEKVAGLVKACAMSNIANSSRKWHYHVTGVLKSQ" FT CDS join(71..2509,2513..3166) FT /product="LIN3_SM_1p" FT /translation="MDLLKSQIVSSEAIMDFISKTSLSTEDISDLMSKKNE FT KNLDGYKHVPSTFEDSISRTNEFILQEIHRLDVSFCNIVNKLDLVISFLTQ FT NLNFNANSKLNLSNKKVNVEDSNLNLNNKLEIAKTLSLIGNAGLNLNNDKV FT TLENDLLISDIPVQVTDCEINNVIGNKSNLGQVYLDNVKLQHNEISNLQLV FT SKTSPNLNTYKKLENSSNKSTKNKVFSSKISNITSNNELKNNSKLFKALCK FT KEQQALFKKFPNLIKLIKIKESTDKSKNLNKELSFITKLINPLEIAGFEFF FT YRNNNICVCIDPFNINKPAVPISNKHTRGISIIKANIVTTCKKGFKFVSAP FT LMIRKFLFALSKKLAETATIKNEKHRGVQINNILKLLKQQNVLPDFLTKSQ FT IKYCDGDFVMLDLFNNHISDAEIKSINNSELTRSQFHGLLISFPELHELCF FT VGNMSSFPDEISFVGGKFLILNKINTSENPNFENSSDIMFNLGLLNFLEMS FT KDDTEFRTLLNKPNTSEKVIDIIFYLNYLNLKNNLNYKTNGTYGDYALNYI FT RNCLLMYSIVDIESTSSLDPDHNETLISEMIDMIPSLSYYNSSCRERLLRL FT RQWKLSGEIDRIHNIISPPTINSVNSFFDLLDKTFTEKSNQIQLLCEISNK FT INNWNPDSSNVDEIKAKRLIYFAIRDKKKFLSSKKQSYRIHQTCRKICTSK FT MSGFISQIIDSLVLFIQEKADVNQVEQFTTINVQTSTESTAPVVQTSLCFI FT NESKNLIVETPIVNIDNKVTETTNNLTTDIDEEPSHTLTIKNIDVKKMTTR FT KDVKLLKTTSKRYVPAPVSTPAALSDETDADCTFENFDFEFWFTDDHIDYY FT FESHIRNPKFAFLKCFVVSILCSNNSKDIFPIPSNIYEARVILCPININNC FT HWILFVYCKVSCNSYFIDPVFKNKRTFENNERAHIINSELNNKFDLQNKII FT DHPYINLQYQNNNFDCGPYICAYAMIVCKEWYYFPVDFILNIRKEIYEIQS FT FHLIHNTKLSGGGISKDHFPQPSKR" XX SQ Sequence 7988 BP; 2805 A; 1238 C; 1284 G; 2658 T; 3 other; tgctctttcg agccaattag atccgctgga gaaacatttc gattttcagc aatcgagttg 60 taaaagattg atggatcttc tcaaatccca aattgtctcc tctgaagcaa ttatggattt 120 tatttcaaaa acctcattat caactgaaga catttccgat ctaatgagta agaagaatga 180 gaagaattta gatggctata aacatgttcc ttccactttt gaagactcaa tatctaggac 240 taatgaattt atactacaag aaattcatag attagacgtt agtttttgta atattgtaaa 300 taagctagat ttagttatat cttttttaac tcaaaattta aactttaatg ctaattctaa 360 attgaattta agtaataaaa aagttaatgt agaagatagt aacttaaatt taaataataa 420 attagaaatc gctaaaacat taagcttgat tggtaatgct ggtttgaatt taaataatga 480 taaagttact ttagaaaatg atttattaat ttctgatatt ccagttcaag taactgattg 540 tgaaattaat aatgtaattg gtaataaatc aaatttaggg caagtttatt tagataatgt 600 aaaattacaa cataatgaaa tttctaattt acagttagtt tctaaaacat cgcctaattt 660 aaatacttac aaaaaattag aaaattcttc caataaatct actaaaaata aagtatttag 720 tagtaaaatt tcgaatataa cttctaataa tgaattaaaa aataattcga aattgtttaa 780 ggctttatgt aaaaaggagc aacaggcttt atttaagaag ttcccaaact taattaagtt 840 aattaaaatt aaagaatcga cagataaatc taaaaattta aataaggaac tttcttttat 900 aactaaatta ataaatccat tagaaattgc tggatttgaa tttttttatc gcaacaacaa 960 tatttgcgtt tgcattgacc catttaacat aaataaacca gctgtcccca tttctaataa 1020 acatacgaga ggaatttcta tcattaaagc taatattgtt acaacttgta aaaagggttt 1080 taaatttgtt tctgctccat taatgatacg gaagtttttg tttgcattat cgaagaaact 1140 tgcagaaaca gctacaatta aaaatgaaaa acatcgaggg gttcaaataa ataacatatt 1200 aaaattattg aagcagcaaa atgtgttgcc tgattttttg acaaaatcac aaataaaata 1260 ttgtgatgga gattttgtca tgctggacct ttttaataac catatatctg atgctgaaat 1320 taaaagtata aataattctg aattaacaag atctcagttt catgggttgc ttatatcctt 1380 tccagaatta cacgagttat gttttgttgg aaacatgtct agctttcctg acgaaatctc 1440 atttgtgggt ggaaaatttc taattcttaa taaaataaat acttcggaaa atccaaattt 1500 tgaaaattcg tccgatataa tgttcaatct tggtttgctt aattttctcg agatgtcaaa 1560 agatgacacg gaatttagaa ctttgcttaa taaaccgaac acttctgaaa aagtaattga 1620 tataatattt tatctcaact acttaaattt aaaaaataat ttaaattata aaacgaatgg 1680 aacttatggt gattacgctc taaactatat tagaaattgt ttacttatgt attcaattgt 1740 tgacattgaa agtacttctt cacttgatcc tgatcataat gaaacactta tttctgaaat 1800 gattgatatg attccatctt tatcttatta caactctagt tgccgtgaac gactcttaag 1860 gctcagacag tggaaactgt caggtgaaat agatagaatt cacaacatca tttctccgcc 1920 aacaataaac agcgtgaata gtttctttga tttgttagac aaaaccttta ctgaaaagtc 1980 taaccagata cagttactat gcgaaataag taataaaatc aataattgga atcctgactc 2040 tagcaatgta gatgagataa aagcaaaacg cttgatttac ttcgctatta gagacaagaa 2100 gaaatttctc tcttcaaaga aacaatcata tcgaatacat caaacttgtc ggaaaatctg 2160 cacaagtaaa atgagtggat tcatttctca aattatagac tcacttgtgt tgtttatcca 2220 agaaaaagca gacgttaacc aagtagaaca atttacaact ataaatgtac aaacttctac 2280 tgaatctacg gctcctgttg tacaaacctc gctctgcttt attaatgaaa gtaaaaactt 2340 aattgttgaa acgcctatag taaatataga taataaagtt actgagacaa caaataatct 2400 aactactgat attgatgagg aaccttcaca tactttaacg ataaaaaata tagatgttaa 2460 aaagatgacg acacgaaagg atgtaaagct gttaaagaca acatccaaat gaaggtacgt 2520 gcctgcacca gtctcaacac ctgctgcact gtctgatgag actgatgcag attgcacttt 2580 tgaaaatttt gattttgaat tctggtttac tgatgatcac atagattatt attttgaaag 2640 tcacattaga aatccaaagt tcgctttcct taaatgtttt gttgtgagta ttttatgctc 2700 aaataattcc aaggatattt ttccaattcc tagcaatata tatgaggctc gagttatctt 2760 atgtccaatt aacattaaca actgccactg gattttattt gtatactgta aagtctcttg 2820 taactcttac tttattgacc ctgtttttaa aaataagaga acgtttgaaa ataatgaaag 2880 agctcatata attaattctg aattaaataa taaatttgac cttcaaaata aaataattga 2940 tcatccctat ataaatctgc aatatcaaaa taataatttt gattgtggac cttacatttg 3000 cgcttatgcg atgattgtgt gtaaagaatg gtactatttt cccgttgatt tcattttgaa 3060 tattaggaaa gagatttacg aaattcaaag ttttcattta attcataata caaaactatc 3120 ggggggtgga atttctaaag atcacttccc ccaaccctca aagcgataga aactataacg 3180 ctgactcaag aaaaagcttg tgtctcgact ctcttaaaaa aatggtttga aaataatctt 3240 gaggtgatgg ttctttctcc agagctaaca attaacttga ttgctcaaaa caatatttat 3300 atagtcgatc atgttgcttt taaacaattt aagggcatta aaataatcat tggtattatc 3360 tccaacgata attattggtg tctatttctt tgttcagttt gtcttggctt cagttacatc 3420 tttgacgttc gatcttctca aaatatcaac ggaaggctgt taggaatagg agagaatata 3480 acagattact tgaataaatt tattcttgaa aaaacaatta tatttaagaa atctcatact 3540 ctctttcgct ttgcaatgtt ctctgatgac cctatacatc acgctactac ttttatacta 3600 tttttagaag aatttatttc tggaaatgtt gtcaatagag ataaagtaat tgaaaagctt 3660 gtaaatatgc aatccaagat gtgcgatgaa gttgtagtaa atttgaattt aataaattgt 3720 aagattacag aaaacatttt aacaaacttt ttcatcaatt tgaacttatc cgaaaatttc 3780 atttttttaa acaatacaat gaccacagca atactggatg attgcactca atacttaaat 3840 gaatttttaa atattgaaaa tttaatgaaa gcaaaaattg tatttgccat tattgctcct 3900 cctaaagctc ggcaaatttt gatgatcatc gactacaact ctgatgaaca ttatctactt 3960 aatccttcat ctttacttcc aaactttcct tatctttgta cagcaaactc tttattgact 4020 aaaatcaatg aaatcaaaga aactccggaa ctttgtaaac tacttggtaa ctgtcctcat 4080 gatattcgtg gctctggtct tgttagtagg attctcatat gtgcatttat caaacactat 4140 gccttagatt tgtctttgga aaatattaat ctccgtgata ttggaaatgt cataaagtct 4200 ttattaccga ctgtacaaga cattataaag gattcgataa agaatttacc tgataataaa 4260 acaaatgtaa agctaaaatt agatcaacgc atacttttga taaatgagaa tttgttgacc 4320 ctacaaaacg caagtgtaaa cgaaatcagt aactttatta ttaatgaaat acgtattcgt 4380 aaagaaacaa gtgtaactaa aatcagtaga ccatatttag gaaataaaat cagtaaagtt 4440 ccaccaccaa aaactaactt caaatcatca tttaatcaaa atatgaaaac gacggtatac 4500 aacattttag ttaacgatga aataaacgag aggccaacaa ttaaagatca cattagtaat 4560 tttagccatc aagatccgct aattaaaagt tatgataatg ttttaaactt tggtctaaaa 4620 tctcaaaata ctcttttact tcgaccaatt atagataaag aggttatttt tgaactgaaa 4680 agagctgata atacatctcc tggaattgat ggaattaaat ataacgatat tgctattctt 4740 gacccagaag gtaaattgtt gactcttctt tacaacaaaa tcatcacgga aaatactatg 4800 cctgacgttt ggaaatcttt taaaactcta atgattccta aacctgataa aggtgataaa 4860 tataatctga tctcttcttg gcgaccaatt gctttgttat ccgttgttta taaaatattt 4920 gcgtcaatat tagctagacg cttaacatcg tgggtaaatc gaaacaatct acttcatatc 4980 ggacaaaaag ggggttctgt gcatgatggt tgtgtagaac acaattctgt actttcatct 5040 tgtcttgagc actcgaaata tagtaaagat acacctatca taattgcatt tcttgacatc 5100 aaagacgctt ttggcagtgt accccatgat tacatgtgga agattttacg acacattgga 5160 gttggtgaaa aattcacaaa cactttgaaa cttctataca ctgaaactag ctcatactat 5220 acatgtggtc caattgttac tccgaatatt cctataaagc agggcgtcaa acaaggatgc 5280 cctctttcaa tgattttgtt tgccattgca ataaatcctg tccttcaagc tgttactctt 5340 tctaaagtaa aacctttcaa aattggtgaa tcttcaatcc aagttcttgc ttatgctgat 5400 gatatcgctc ttattgctaa taatactcat gatttgcaaa atatcttaaa cattgctttt 5460 gatgtagctc gtgaaattgg atttgaatat cgacctgaaa agtgtgcata tattcaactg 5520 ccaaatgtcg acacgattag tgaaatttca attaataata taaaaataaa gaaattattg 5580 agtaaagaat tttatcaata tcttggagtt ccaataggtg agcaacctaa ccaaagtcct 5640 tatgaaatct tgaataaggt tgtcagtgat gcgaaaaagc tagcggattc cgatttgtgc 5700 ggttggcaaa agttgaaagc ttataagacc ttccttcact ctcgcctaac cttcccattt 5760 agaacgagag aaattaaaat tagtgctctt tctgcaaatg tttgtaataa tactacaaaa 5820 cttcgtagtc atttgagaaa aatgatgagt ttacctaaca attctgaaat cagttacttt 5880 tataactcga ttgaaaatgg aggcgcatct tgcactgatc ttttggatga atatcacacg 5940 caaactataa gccacttttt tagacttttt actgctaatt gtgaatttac aaaacgtatt 6000 aatatagatt ctttaaaatt tgtcactgga cctaggcttg ggaataataa cccatcactc 6060 caagacagtt tcaattggat taataaagct gatattattc aaaagcactc tgggaaaaaa 6120 actcgtttct ttcgacctag aacagcgatt cggttttttg aaaaaaccca taatattaca 6180 atatcctttg aagtgttgga tgataagcca attcttcgcc tgacaacagc tttgaaagga 6240 acaataatat tgactgagtt tgatcgtcgt ttggtttcga aagttcttca tcttgctctt 6300 tatgattcat attttactaa atggaaaaac agtaacatta gtaattcatc aacattttat 6360 ctgtccccta atattaatag agcaatattt aaaggagaat tacgagagga tcattggaaa 6420 tttattcatc gtgcccgaac aaacacattg gctaattttg caaaacccta tataaaaggt 6480 gagaatagat tgtgtaggcg ttgccataac caagatgaaa cgttgccaca tgttttgcaa 6540 aattgcaagg ttcatcttac tcttgcactt aatcgtcaca acgattgcct tcaacaaata 6600 gtacatcatt tgaaaagtcc ttcactgatt gttgttgtgg accacacttg ctccttggta 6660 tcaaactcta aagaacgagt cgatttgatt ataactaata atgagaaaaa gacaattctt 6720 atggttgata ttaaatgtcc ttttgattct ttggtcacgt ttgaaactgt gaataaggaa 6780 aatttggcaa aatatgattc tttaaaaaaa caaatacagg ctgccaaacc tagctttact 6840 gttgaaatct ttacttgtat aatcggttct atgggatctg ttccaccagc atcttatgat 6900 ctacttggta agatgggagt tccttttgaa aaagtcgctg gtctggtaaa agcatgcgca 6960 atgagtaaca tcgcaaattc ttcaagaaaa tggcattatc atgtcactgg cgttctaaaa 7020 agccaataaa ttgattttta atttgattaa aaattatttt tataatattc tcaattaaag 7080 ttttattatg aataatttca aattcaaaat caaactattt caaatgatgc tctttcgagc 7140 caattagatc cgctggagaa acatttcgat tttcagcaat cgagttgtaa aagattgatg 7200 gatcttctca aatcccaaat tgtctcctct gaagcaatta tggaytttat ttcaaatatc 7260 tgtcaatctg ctctttatgt caaaccctcg aatatgactt atagcgcaaa gccggatcag 7320 ataaccgatg aaaatcagta aagccagaga aatttcgagg agtggaactt tattgagaca 7380 aatcttttct caaaattaaa tgataataaa ccgaaaattt tatctcgatc gttcaacctc 7440 ctcgattggc tgattccaag cagaatcatt tcaattcata tgaatcaaga acttgttgta 7500 ttaataaaat tttaaatagt actagtgagt gattttgtta agtctacgta atgtggtagg 7560 cgtgtcgatg attgttgaat gtaggtgtgt gganggttct rgaaggtgag tgtgtgtgga 7620 agttggtagg tgtgtggaag gttcaagaag gtaggtgtgt ttggaaggtt ctagaaggta 7680 ggtgtgtgga atgtggtagg tgtgtggaag gttctagaag gtaggtgtgt ggaaggttct 7740 agaagtgtgt tggtccacaa ctgcgtgaca taataagtgt gagttaggta ataagtgtgt 7800 gtggtgtaat aagtgtgtct ccgggtagtg actgaccagt gggcacgtga cataatcgtg 7860 cggtcacctg gtaacagtgg acagaggtca atctccaggt agcaacagcg tggacagagg 7920 tcaaatgtta tgtaataatt acgtaagcca aaggggggtt gtatctgacg ggctcacatc 7980 ctactact 7988 // ID Gypsy-45_AA-I repbase; DNA; INV; 5053 BP. XX AC supercont1.16; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-45_AA_; KW Gypsy-45_AA-LTR; Gypsy-45_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5053 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.16; Positions 995250 1000302. XX CC Positions [2237-2659] - Reverse transcriptase CC Positions [3794-4264] - Integrase core CC 'ATCAC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 1130..4894 FT /product="Gypsy-45_AA-I_1p" FT /translation="MRTYSPNMVGKSFSKLAQVYDHVKNSHHHVESEEPGR FT RPIKARLGYKQPYRGQFDYKFRNRNQTENRHPRDWRRDDGRQPSRYADLIC FT DFCGIKGHIKRKCFKLKHLQKGAVNLLQTSEPEPSVDRHLSELLDRMNARD FT SDSDSELDEGNYACNLVTFINKISNPCLVDVIVEQKNIKMEVDCGSSVTVM FT NKSQYFSNFEKKLKKSSKQLIVVNGAKLLIEGETSVLVNYRGVESKCKLLV FT LNCSNDFTPLLGRDWLDIFYPNWRNYFITQSMVNNVSEGRCTEYVNEIKQR FT FSNVFTKDFSSPIVGYEADLVLKNDVPIFKKAYTVPYRLKNRVCDYLMKLQ FT KEKVITPVKTSEWASPVIVVMKKNDDIRLVIDCKVSVNKYIIPNTYPLPVA FT QDLFASLAGSKVFCALDLEGAYTQLSLSERSKQFMVINTIKGLFTYNRLPQ FT GASSSASIFQQVMEQVLEGIENVCVYLDDVLIAGKDLEDCKCKLVIVLERL FT EKANIKVNLEKCKFFVENLDYLGHVISNNGLLPCKDKLDTIKNAKNPVNVT FT ELKSFLGLINYYNKFIPHLSSKLYHLYNLLRNGVKFVWDENCQKAFEKCKS FT SLINADFLEFYDPNKPIVIVSDASSYGLGGVIAHIVDGVEKPIGFTSFSLN FT DAQKSYPILHLEALALVCTVKKFHKYLYGQKFQIFTDHKPLVGIFGKSGRN FT SIYVTRLQRFILELSIYDYDIQYRCSAKMGNADFCSRFPLDQNIPAEYDHD FT LVRSINFSKDLPIDFTVIANETKHDKWLQTIISFLKNGWPEKLEKSLIDVF FT SNQHDLELVDGCLLYQDRVVIPQKLQKNILKLIHANHLGMVKMKQVARRSV FT YWFGINTDIEKLVSNCDVCNSMAIVKKPKIESRWIPTVRPFSRIHIDFFHF FT SHHTFLLIVDSYSKWIEVEWMKKGTDCGQVLKQLVAFFARFGLPDVIVSDG FT GPPYNSYAFVNFLEKQGIKVFKSPPYNPPSNGQAERLVRTVKEVLKKFLID FT SDLSHLSLEDQINLFLINYRNTCVTKDDSFPAEKIFTYKPKTLVDLLNPKK FT HFLKQLVPQSDDNDDSVHPLGEGCVSKGSVDPLDALSEGAELWYKNNNPNN FT NLRWIKSHFIKKYSKNVFQIQIGSAVLHAHRNQLRVCKAVGDLPRPNFFVQ FT RESQIQQEGRRGTQQDGRHEKESMVSIRNEGGMVPQANEVRSEEDSSNQAS FT TRCSSAHSNHSRKRKLQESASMPNATPRRSKRKRFLKMDSDFMYT" XX SQ Sequence 5053 BP; 1637 A; 769 C; 1130 G; 1517 T; 0 other; gtggcggcga gataaaggac tacgatttgg acacgtttgg atttcggcta acaggagaaa 60 aaaaggattg gtaaacggac aaggctacga aaaaggatct tctaaggaca acggaatatt 120 tgtgaacgct agtggtaacg ttggagagcc gaagtaacgt ttcccttcag ctaaggattt 180 gtaaaaacgc gtgtggtgaa aagcagaaaa tccgtggata cgcttgaaat ctaccggagt 240 gagcggagtc cagtggttga tccggtgtgg atcgtgttct tgtgtgagct atagagaacg 300 tttttttaaa agaggtccga acaaactgat tttgtaggag cttgcggctc cgagtagcga 360 gaatccttcc cgactggttt caaaggtaca tacgaggtcc cgttcgaggt catcttactg 420 ctgtggtgcc ttttttcact gtagcagaaa tagcgaagat accggtgagt caaaaagttt 480 tttttatcat ttgtttcttt gcctaattga gttgtctttt ctcggttttt ttgtttggaa 540 aataattaca ttgtaaaaca acaattgaaa taattaaaag ggcatagaca atggcttcta 600 tggcagttac tattgagcca taccgtaaag gttgttcctt tgcagactgg gtagaacgct 660 tagaatattc ttttgtcatt aatgcgatta cggatcagaa caaaaaggcg tatctaattg 720 cattaagtgg acctgtaatt ttttcggaac ttcgtttgct ttacccaagg gaaagcttgt 780 ccgaggtctc ttatccggat atggtggcta aattaaaagc ccgattggac aagactgaat 840 cagatttggt tcaacgcatg aaatttaatt ctcgggtgca acaacccgac gaaacggcag 900 aagatttcgt gctttcgtta aaacttcaag ccgaattctg ttcgtttggt gaatttaaac 960 aagtggcaat tagagataga attttggcgg gtttgagaga tggcgcttta agacaacgtc 1020 ttctcaatga agaaagctta actttacaaa gtgcggagaa actcattgcc acttgggaaa 1080 ttgcgggtga aaatatactt caggaaatac ttctggactg attgcatcaa tgcgtaccta 1140 cagtccaaat atggttggga aaagttttag taaattggct caggtttatg accacgtaaa 1200 aaatagtcat catcatgttg agagtgagga gccaggaagg aggccgatca aggctagact 1260 aggctacaag caaccatacc gaggtcaatt tgattacaaa tttaggaaca gaaaccaaac 1320 ggaaaatcgc catccgaggg actggcgcag agatgacgga aggcaacctt caagatatgc 1380 agacttgatc tgtgactttt gtgggatcaa gggtcacatt aagcgtaagt gctttaagct 1440 taaacactta caaaaaggag cggtaaacct tttgcagacg tcggagccgg aaccaagtgt 1500 tgatcgtcat cttagcgaat tactggacag gatgaatgct agggattcgg acagcgacag 1560 tgaattagat gaaggtaatt atgcatgcaa cttagtgacg tttattaata agataagcaa 1620 tccgtgtttg gttgatgtga ttgttgagca aaaaaatata aaaatggagg tggattgcgg 1680 ttcatcagtg actgttatga ataaatctca gtatttctct aattttgaaa agaaattaaa 1740 aaaaagttca aagcaattga tagtggttaa cggagcaaag ttgttgattg agggagaaac 1800 gagtgttttg gtcaactata gaggggttga gagtaaatgc aaattgttag tactgaattg 1860 tagtaatgat ttcactcccc tgctgggacg agactggctt gatatatttt atccaaattg 1920 gagaaattat tttataaccc aatcaatggt taacaatgta agtgaaggac gatgcactga 1980 atatgttaat gaaattaagc agcgattttc gaatgttttt accaaagatt tttcgtcacc 2040 catagtgggg tatgaggcgg atcttgtttt aaaaaatgac gtaccgatat tcaagaaggc 2100 ctatacggtt ccttatcggc ttaaaaatag agtttgtgac tatttgatga agttacagaa 2160 agagaaggtc ataacacctg ttaaaacaag tgagtgggct tcccctgtaa tagtagttat 2220 gaaaaagaat gacgatatac gtttagtcat agattgcaaa gtttctgtta acaagtacat 2280 aataccaaac acatacccat tacctgtagc tcaggattta tttgctagtt tggctggaag 2340 caaagttttt tgtgcattgg atctagaagg ggcttataca caattgagtt tatcagaaag 2400 gtcaaagcaa tttatggtaa taaatacaat taaaggttta tttacatata acagattgcc 2460 acagggagct tcatcaagtg cgtcaatttt tcagcaggtc atggaacaag ttttggaagg 2520 tattgaaaat gtgtgtgttt atttggacga tgtgttaatt gcgggaaaag atttggaaga 2580 ctgcaaatgt aaattggtta ttgttttgga aaggcttgaa aaggcaaaca ttaaggtaaa 2640 cctagagaaa tgtaagtttt ttgtagagaa tcttgattat ttaggccatg ttattagtaa 2700 taatggttta ctaccatgta aagataaatt agacacgatt aaaaatgcaa aaaatcccgt 2760 taatgtgact gaacttaaat cattcctagg tctcattaac tattataata aatttattcc 2820 tcatttatct tccaaacttt atcatttgta caacctgctg agaaatggtg tcaagtttgt 2880 ttgggacgaa aattgtcaga aagcgtttga aaaatgtaaa agttcactga ttaatgctga 2940 ttttctcgag ttttatgacc ctaataaacc aattgttatt gtcagtgacg cttctagtta 3000 tggtttgggg ggagtgattg ctcacattgt ggatggagta gaaaagccaa taggttttac 3060 ttcttttagt ttgaatgatg cccaaaaatc gtaccctata ttacatttag aggctttggc 3120 tttagtgtgc acagttaaaa aatttcataa atatttgtat gggcaaaagt ttcaaatttt 3180 tactgatcac aaaccacttg tgggaatttt tggaaaatct ggcagaaact caatttatgt 3240 cacccgactt caaagattta tcctagagtt gtcaatttat gactatgata ttcaatacag 3300 atgttctgct aaaatgggaa atgcagattt ttgctctagg ttccctttgg atcaaaatat 3360 accggcagaa tatgaccatg acttagtgag gagcataaat ttcagtaagg atttacctat 3420 tgactttaca gtaattgcaa atgagacaaa acatgataaa tggctccaaa caataataag 3480 ttttttaaaa aatggttggc ctgaaaaatt agaaaaaagt cttattgatg ttttctccaa 3540 tcagcacgat ctagaactcg tagacggatg tcttttgtat caggaccggg ttgttattcc 3600 acaaaaacta cagaaaaata ttttgaaatt gatacacgct aatcatttag ggatggtgaa 3660 aatgaagcag gtggcgagac gatcagtata ctggtttggt atcaatacag atattgagaa 3720 gctggtttca aattgtgatg tgtgtaatag catggctata gtgaagaaac caaagattga 3780 gtcacgatgg attccgacag taaggccctt tagtcgaatc catatagatt tttttcattt 3840 ttcccaccat acctttttgt tgattgttga tagttattcg aaatggatcg aggtagaatg 3900 gatgaagaag ggtacggatt gtggacaagt tctaaaacaa ttggttgcat tttttgcacg 3960 atttggttta ccggacgtga ttgtgtcaga tggagggcca ccctacaact cgtatgcgtt 4020 tgtgaacttt ctggaaaaac aaggaattaa agttttcaag agcccaccgt ataacccgcc 4080 tagtaatgga caagcagaac ggcttgtaag aactgtgaaa gaagttttga agaaattttt 4140 aatagactca gatctttctc atttatcgtt ggaggaccaa atcaatcttt ttttgattaa 4200 ttatagaaat acctgcgtga ctaaagacga tagctttcca gctgaaaaaa tatttacata 4260 taaaccaaaa actcttgttg atttgctaaa tcctaagaag cattttttga aacagttagt 4320 accacaaagc gatgataatg atgatagtgt tcacccgtta ggtgagggat gtgtttccaa 4380 gggatcggtt gatccacttg atgctttgtc tgagggagcc gaattgtggt acaaaaataa 4440 caatcctaac aataatcttc gttggataaa atcacatttt ataaaaaaat attcaaaaaa 4500 cgttttccag atacaaattg gaagcgcggt ccttcacgct catcgcaacc aactgagggt 4560 atgtaaggcc gttggcgatt tgccaagacc aaactttttc gtgcaacgcg agagtcaaat 4620 tcaacaagag ggccgaagag gaactcagca agatgggagg catgagaaag agagcatggt 4680 gtctattcgt aatgaagggg ggatggtacc ccaagcaaat gaagtgcgtt cagaagaaga 4740 ttcgtcaaat caagcaagta caaggtgttc aagtgcacat tcgaatcatt ccaggaagag 4800 gaaacttcaa gagtcggcgt caatgccgaa tgctactcca aggagatcga aaaggaaaag 4860 atttttgaaa atggatagtg attttatgta tacatagaat taagatcaaa agagaagttt 4920 taaagtgaat tttgttaggg atctattttg ctaccaagct ttcgaattaa atttgattat 4980 aattattagt agtcattcga gttgtttaat tgcataagtc tgtatgtatc attgatctta 5040 aggatggaga act 5053 // ID BEL-1_DPer-LTR repbase; DNA; INV; 367 BP. XX AC super_2; XX DT 04-MAR-2011 (Rel. 16.03, Created) DT 04-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: long terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-1_DPer_; KW BEL-1_DPer-I; BEL-1_DPer-LTR. XX OS Drosophila persimilis OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; obscura group; OC pseudoobscura subgroup. XX RN [1] RP 1-367 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Direct Submission to RU (22-MAR-2010). XX DR Genome; super_2; Positions 2541197 2541563. XX SQ Sequence 367 BP; 94 A; 117 C; 85 G; 71 T; 0 other; tgttcaggcc agcaatggac agtttccggg cactgtgccg gctccgcgcg cctcggtccc 60 ggctcgcgct cgcgcgctcg gaggggcgct cggatccaca atccacgatc cacacggcgc 120 tcacggcagc acaaaggcag cataaaacat agagcataag gcatagagag ttagtgacgt 180 tttcaccccc gcttcaacct gtcgacatac cgcgcgaata ccgaatcaat acctgaacaa 240 tatatatata catatacaca caacgaaacc catcggcctt tcctttctgt ggcgactgct 300 aggagggtta ctggaacata cacctcgccg ttggataatc gatcctcgat ccgccccact 360 acaaaca 367 // ID Copia-1_AC-I repbase; DNA; INV; 4807 BP. XX AC AASC02061324; XX DT 27-JAN-2011 (Rel. 16.02, Created) DT 27-JAN-2011 (Rel. 16.02, Last updated, Version 1) XX DE LTR retrotransposon from Aplysia californica (California sea DE hare): internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-1_AC_; KW Copia-1_AC-LTR; Copia-1_AC-I. XX OS Aplysia californica OC Eukaryota; Metazoa; Mollusca; Gastropoda; Heterobranchia; OC Euthyneura; Euopisthobranchia; Aplysiomorpha; Aplysioidea; OC Aplysiidae; Aplysia. XX RN [1] RP 1-4807 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Aplysia californica (California sea RT hare)."; RL Direct Submission to Repbase Update (02-FEB-2011). XX DR Genome; AASC02061324; Positions 847 5653. XX CC Positions [1453-1965] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 91..4062 FT /product="Copia-1_AC-I_1p" FT /translation="MRIRGLNAVFKNLHEEIVDVDSDQNEEAYAELVQILD FT DRSLSLIIRYAKDDGRQALKILRNHYLPKGKSRIISLYNELTSLINHSGES FT MTDYIIRAETAANSLKQAGETVSDSLLIAMVLKGLPARYQPFITVTTQREK FT AQTFAEFKESLRNQEDTVTNSSDCSVMNIRSRTQAQRRHQQSPIAAARDQE FT TRPAQSRWCNECKSKTHDTQFCRNRKRWCDHCKVRNHDTKYCRKFKSSSAK FT YVNSSASESNDFTFFVGSNAKFNEIHVADTLLVDSGASSHIINDKNKFVTF FT NDPFMPANHVIELADGRKLTDMAQGQGDAVISLRDTQGIVHKVTLKNALYI FT PTFSQDIFSVPSAVGKSSDTKIEFSKDSGILSSGNSNFEIKKKGSLYYLNS FT LHHHNAVRSLDEWHNILGHCNKQDIISLEKCVNGMTIRDPVVTDENDNCDV FT CLLGKMTQTKNRQPDTRARAILEKVHVDLAGPIGPTAKDNFKYALCCVDDF FT SGLCNVYFLQKKSDTCHAFQKFLAEMSPFGDVKCVRSDGGGEFTSNEFESL FT LVRNKIKHERSSPHSPHQNGTAERSWRTLFEMARCLLISAKLPKELWTFSV FT MYAAYIRNRCFNHRLKCTPFEVFTGKKPDLSKLQEFGRECFAFVQNPRKLD FT DRSERGIFLGFDKGSPAYLVYFHETQKIKKVRIVKFRRSLKVDVKNENDSV FT PFHVTYEEEFIPIPLTKQTLSDVSPAVDNVESTNHEIENSRDESVARSSNS FT QSDIVSDTGDSQDGKNRPKRSITKPKHLNDYYVDDEIDHYLNVTTTHYCYA FT IHDVPKSYKEAISCVSADKWQNAMQEEMRTLEENDTFQLTPLPPGREAVGG FT RWVFAIKEGQSDEAQYKARFVAKGYSQIPDIDYTETFSPTARMTSIRTLLQ FT LSVCNDMNIHQMDVKAAFLNAPIDTETYVEQPEGFQAKGHNGEQLVWKLKK FT SLYGLKQSGRNWNQTLYVHLVEQGFSQSLSDPCVYVKKYSDKKSQEEVRII FT LIVWVDDLLIASQNKIALETVKTNLKTNFQMKDMGEIHNFLGIEFDRQKDT FT MTMSQSHYIEKLLRRFGMEDCKPRLTPSEVNTSKLNNENCSDKEARRYREI FT VGGLIYIMTATRPDLSFIVTKLSQFMAKPDTYHMIMAKHVLRYLKGTVDQK FT LIFRKTDKSLDLVGFCDADWGSSDDRKSITGYGFELSQGGPLISWKSKKQP FT TVALSTCEAEYMALAAATQEGRFLKAVLNDMVDLNVTRFTLHCDNQGAIAL FT AKNPVHHKRSKHIDIRFHFIRNEINNDKLQLIYVQSEENIADVFTKPANRF FT RNDKFIALLMGK" XX SQ Sequence 4807 BP; 1554 A; 854 C; 978 G; 1421 T; 0 other; acaggttatg ggcccaggac tcaaagaccc atcgtattct gcggtgatga acgcagatac 60 gagttatggg agaccaaatt tcttggctac atgagaatcc gtgggttaaa tgctgtgttc 120 aagaaccttc atgaagagat cgtcgatgtt gactccgatc agaatgaaga agcttatgca 180 gaactcgttc aaatattaga tgaccgctct ctttcattaa tcatcagata tgcgaaggat 240 gatgggagac aagctctcaa gattctacga aaccattatc ttcccaaagg caagtcaaga 300 atcatttcat tatataatga acttacttct cttataaatc attctggcga aagcatgaca 360 gactatataa ttcgtgctga aactgctgca aattctttga aacaagccgg tgagactgtc 420 agtgattctc ttctaatcgc catggtcttg aaaggactgc cagcacgtta ccagcctttc 480 attactgtaa cgactcagcg tgagaaggct cagacatttg ctgaattcaa agaaagcctt 540 agaaatcaag aagatacagt caccaacagc agtgactgct cagttatgaa tattcgttcc 600 agaacacaag ctcagagacg acatcaacag tcacccatag cagccgcaag ggatcaagaa 660 accagacccg cacagtcgag gtggtgcaat gagtgtaaat caaaaacgca tgacactcag 720 ttctgccgca atcgaaaaag atggtgcgac cactgcaaag tgaggaatca tgatacaaaa 780 tactgtagga aatttaagtc gtcttctgcc aagtacgtga actccagcgc aagtgagtca 840 aatgacttta cattctttgt tgggagcaac gccaaattta atgaaattca tgtcgccgat 900 acccttctcg tcgacagtgg cgcctcatct cacattatca atgacaaaaa caaatttgtc 960 acattcaacg atccgtttat gcctgcaaat cacgtcatcg aattagcaga tggaaggaaa 1020 ttgacagaca tggcacaggg acaaggtgac gctgtgattt cactccgtga cacccagggt 1080 attgttcaca aagttactct gaaaaatgca ctgtacattc ctacgttttc acaggatata 1140 ttttctgttc catccgctgt tggaaaaagt tctgacacaa aaattgaatt ttccaaagat 1200 tctggaattc tctcgagtgg aaattcgaac tttgaaatca aaaagaaagg aagcctgtac 1260 tacttaaatt cacttcacca ccacaacgct gtacgctcat tggatgagtg gcacaacatt 1320 cttggacact gtaacaagca ggatattatt tctcttgaaa agtgtgttaa tggtatgact 1380 atcagggatc cagttgtaac tgatgaaaat gacaattgtg acgtgtgcct tttaggtaaa 1440 atgactcaga caaaaaaccg acaacctgac actcgtgcta gggcaattct tgaaaaagtt 1500 catgttgact tagctggacc aattggaccg actgctaagg acaatttcaa atatgcactt 1560 tgttgtgttg atgacttttc tggtttatgc aatgtttatt tcttacagaa gaaatctgat 1620 acatgtcatg catttcagaa attcttggca gagatgagcc cttttggtga tgttaaatgt 1680 gttagaagtg atggtggggg ggaattcacc tcaaatgaat ttgaatcact cttggtaaga 1740 aataagataa aacatgagag atcttctcct catagtcctc atcaaaatgg gacagcagag 1800 agatcctggc ggacattgtt tgaaatggca agatgtcttt tgatttctgc caaattgcca 1860 aaagaacttt ggactttttc agtcatgtat gcggcttata tccgcaacag gtgctttaat 1920 caccgtctga aatgcactcc atttgaagtt ttcactggaa agaaacctga cctttcaaaa 1980 ttgcaagaat ttggcaggga atgctttgca tttgttcaaa atcccagaaa acttgatgat 2040 aggagtgaga ggggaatttt ccttggtttt gataagggga gtcccgcata tttggtttat 2100 tttcatgaaa ctcaaaagat taagaaagtg agaattgtca agtttagaag aagtttgaaa 2160 gtggatgtga agaatgagaa tgacagtgta ccttttcatg tcacctatga agaagaattc 2220 attcctatcc ctttgactaa acaaactttg agtgatgtat ccccagctgt tgacaatgtt 2280 gaatcaacaa accatgagat agaaaattct cgtgatgaat ctgtggctag atctagtaat 2340 agtcagtcag atattgtttc agacactggt gacagtcaag atggtaagaa taggccaaaa 2400 agatcaatca ctaagcctaa acatttgaat gactattatg ttgatgatga gattgatcat 2460 tatttgaatg tcacaacaac tcattattgc tatgcaattc atgatgtacc aaaatcgtac 2520 aaagaagcta tttcttgtgt ttcagctgac aaatggcaaa atgcaatgca agaggaaatg 2580 cgtactctgg aagagaatga tacttttcaa ttgactcctt tacctccagg cagggaagca 2640 gtggggggaa gatgggtttt tgctatcaaa gaaggccaat cagatgaagc ccaatataaa 2700 gctagatttg tggctaaggg ttattcccag atacctgata tagattatac agaaacattc 2760 tcacccacag ctcgcatgac ctcgatacgt actttattgc aactgtcagt gtgtaatgat 2820 atgaacatac atcagatgga tgtaaaagct gcatttttaa atgcccccat tgacactgag 2880 acatatgttg aacaacctga aggttttcag gcaaagggtc acaatggtga acaacttgta 2940 tggaaattaa agaagtctct ttatggacta aaacaaagtg gcaggaattg gaaccagact 3000 ttgtatgtac acttggttga acagggtttt tctcaatcac tctctgatcc atgtgtgtat 3060 gtgaaaaagt actcagataa aaaatcccag gaagaagtga gaataattct cattgtatgg 3120 gtagatgatt tactcattgc ctcccaaaat aaaattgcct tggagactgt gaagactaat 3180 ctgaaaacaa actttcaaat gaaagatatg ggtgaaatcc ataattttct tggaattgaa 3240 tttgacagac agaaagacac aatgacaatg tctcaaagtc attacattga aaagttactg 3300 aggagatttg gaatggagga ttgcaaacct aggcttactc ctagtgaagt caatacaagc 3360 aaactcaaca atgaaaattg ctctgataaa gaagcaagaa ggtacagaga aatagttggt 3420 ggcttgatat acataatgac tgctaccaga cctgatctat ctttcattgt aacaaagttg 3480 tcacaattta tggcaaaacc ggacacatac catatgatta tggctaagca tgtgttgaga 3540 taccttaaag gtactgttga tcaaaaactt atcttcagga aaacagacaa atcgcttgac 3600 cttgtaggtt tttgtgatgc tgactggggt agctctgatg accgcaaaag tataactgga 3660 tatggattcg aactctctca aggtggtcct ttgatttcat ggaaatctaa gaagcagccc 3720 acagtggcgc tatcaacatg tgaggcggaa tacatggcct tggctgccgc cacgcaggaa 3780 ggtagatttt tgaaagctgt gttgaatgac atggttgacc ttaatgtcac ccgattcact 3840 ttgcactgtg acaatcaggg agctattgct cttgctaaaa acccagtaca tcacaaaagg 3900 tcaaagcaca ttgatataag atttcatttt atcagaaatg aaataaataa tgacaaattg 3960 caactgattt atgttcagtc agaagaaaac attgctgatg tgttcacaaa gccagcaaac 4020 agattcagaa atgacaaatt tattgctctc ttaatgggaa aatagattct cattaattgt 4080 gtgaactgtg aaatgatttt ttattttctg acatattatg atacatgtat ttgattattt 4140 gttaccttct gttgtccatc aggggttcat aaatgctttg cacaattcag ctatgtttct 4200 gttaggtatc ttcctttgtg ttttcttcat gatatttcca gttaccttca taaactgatt 4260 ttatgcatat atatagtaca aacatgtatt ttttgttcag aataatttgg caggacaaga 4320 cagagcctgt gtatagatct agtactgtac tagtacagca gtgccaacca cagcacagca 4380 atcatgtgga ttgcattgaa ttgaaatcag ttcattatca taatattata agtatgctgt 4440 ctgtgtatgt aggctattgt gtaatgtaca tgtggaaact aagagtagtg agtgacagac 4500 atgtgttcta catgacggtc tgcatggcct tgaagctgtt ctcaactctc tcagtttttg 4560 gtaaataact tacttaaaat tttttttttt ttttaaagtt ttgccattat gtgatgtgat 4620 tgtattcctc ttttgaatac tatgtacaag gggaaagaac tgtgcaaata atttacaact 4680 ctgttaaaat ttgggttatt tctctttaga tacttttgac catgattatc atttcattca 4740 ttgatttaaa ataaataaat gaataaaata aaataagcta agttagtaag ctgagtttaa 4800 gtggggg 4807 // ID RTE_SJ repbase; DNA; INV; 3921 BP. XX AC AY027869; XX DT 14-SEP-2004 (Rel. 9.08, Created) DT 20-JUL-2009 (Rel. 14.08, Last updated, Version 2) XX DE Schistosoma japonicum non-LTR RTE-like retrotransposon. XX KW RTE; Non-LTR Retrotransposon; Transposable Element; KW endonuclease domain; reverse transcriptase domain; KW RTE-like retrotransposon; SjR2; RTE_SJ. XX NM SjR2. XX OS Schistosoma japonicum OC Eukaryota; Metazoa; Platyhelminthes; Trematoda; Digenea; OC Strigeidida; Schistosomatoidea; Schistosomatidae; Schistosoma. XX RN [1] RP 1-3921 RA Laha T., Brindley J.P., Smout J.M., Verity K.C., McManus P.D. RA and Loukas A.; RT "Reverse transcriptase activity and untranslated region sharing RT of a new RTE-like, non-long terminal repeat retrotransposon from RT the human blood fluke, Schistosoma japonicum."; RL Int. J. Parasitol 32(9), 1163-1174 (2002). XX DR Genbank; AY027869; Positions 1 3921. XX SQ Sequence 3921 BP; 1114 A; 876 C; 875 G; 1056 T; 0 other; ctaccttgac gtggtggtcg ggcttgctta tcgtgacgac ccaactgaac tatattagct 60 ggaacgcatg ttcctctagg tcctaccatt tcaagcaggt ctgttggaga gcgataagac 120 taaaggcagc aacctaagat ctgagggcga agtcgtacag ctagctgcag cgaggtgtga 180 cagcagtaag gtgtttccct cggataacca gcatctgcac tgatgctgcc ttcctaacgg 240 aaggaagggg ttaaaaaagg ttgaccctaa aaatgtcaca cctcacctta ctccacggat 300 ttccgtcttc agcagtaagg ccttctgaag gacggagcta acacctcaaa aactccctat 360 aacaagaccg tgcgtaatcg gcctcaagca attgtccctt gggctttgcg atcgtgctcc 420 aaggtcacta tgaccaatac tagcccagtt tctttttcag gtatctcaac aagtaatatc 480 ctttcatggt gtgggcagcc aggaagtgac aatcgccctc atacttctaa cagcatacaa 540 attcatcttg ttcgaggtca aatcccacca tcaactctta ataactctct tatcaccatg 600 cctaatcctt ctaacataat tttatcgcct tgcgtctcta atgctattgg tttgagttca 660 cgaaacgtta ttcctggtgt cctaaaactg cgcactaagt tatatattgg agctttcaac 720 gttcgaacgt tatatcaagt aggacaacag gcttccttag ctacgactct agaatcccgc 780 tccatcgata tctgctgcat ctctgaaacg cgtatacagg atccaagcac agtcattcgc 840 ttgacttcac cttgtgaaaa taaggaaccg gctcagttca ctcttcgtgt ttccggggac 900 cccaccgcta ccactcgagg ccttgctggt gtaggtatag cactaagttt aaaagctgaa 960 caggctctcc tcgactggat tccaatagac agtcgattgt gcgctgtccg tctaaatgga 1020 acggtaagga ctcgtaaaga tagggacaca cgccgttgtc tctttgttgt ctctgcctat 1080 gctcccactg attgcagctc agatgaaatg aaagatgaat tttacaggaa gctttatgac 1140 cttctatgta aagctaagcg cactgatgta gtaatagtgg caggtgattt taatgcccaa 1200 gtaggtagac tagaggaaac cgaaaggcac ctaggtggat catatggtgt tgaggctcaa 1260 cgaacagaca atggcgaccg gttactacaa ctttgttcag acaagcgttt atttctcgca 1320 agcaccaact ttaagcataa ggaaagacat cggttgacat ggcggcctcc gaaatcaacc 1380 caacgctgga ctcaaataga ccacattgct attagccatc gttggagagg ctcgatagaa 1440 gactgccgct cgttttggag cacatgttta gactcagatc atgccttagt gcgagcgcgt 1500 attagtctgc gactcactgg acgtaaaaga tccactacaa ggatccccca cagagttcaa 1560 tttaatgatg aaaaagtcaa gaatacattc caggaacaac taaaaaacca attagtaaac 1620 ggcgtaaact gttctcaacc tgagctagcc tggaatgata tccaaaaagc cgtcgaaaga 1680 gcagtaatat ctactaataa actagaccta aaggttagga aaactcagtg gatttcagct 1740 gcatctactg aactgataga tcctcggcaa cacatcccgt ctggttctga acatgatgag 1800 caacgtaggc aacttagaca caaactaatc aaaagcttac gcaatgatcg tgagcagtgg 1860 tgggtggcga aagctaaaga gatggaaaag gcagcggcaa taggtaacag taggcaactg 1920 ttcagactta tcaaggaaac gggtatcagg aacccaactg taagtgaaac tatctccgaa 1980 aaagatggct ccattattca ctgtcaatcc agaagattag accgatgggc agaacacttt 2040 agggaacagt tcaactggcc cacggcgtca tccctgctgc ccaccatccc caaacaatct 2100 gaatggcaaa ttaacattgg tcccccaagt ctcagtgaag ttgtgaaggc tattggaaac 2160 ttaaagcgag ggagagcggc aggaccagat ggattgaccc ctgagatatt taaggaaggt 2220 ggtccagtct tagcggcgag attaactgag atcttggcta gaatttggga actagacgta 2280 atcccatctg actggtctcg aacactaacc attccggtct ttaagaaagg acagaagtcc 2340 tcgtgtgaca attaccgagg aatcagttta acgaatatag tgtctaaaat actagcttca 2400 ataatacttc gacgtttaac taaagctcat gaagagcaaa ctcgagaaaa ccaaggtggc 2460 ttcagacccg gacgtggttg tatagaccaa atattcacac ttagacaggt tctggaacat 2520 aggcacactt tcagacgccc aacaatagct gtattccttg accttaaggc ggcgtttgac 2580 tctattgatc gtaaggttat gtggcaatgt ctgtcattga aaggtgtacc ggagaagtac 2640 attaacctta tacaagctct ctactcgaac accacatgtc gtgttagagc ttatggcaga 2700 ttgtcttcgg aattgaccac ttctagtggt gttcgtcagg cttgtccgct atctccattt 2760 ttatttaatt ttatcattga catactttta gagttaacat tgtcatcgtc tgatttccct 2820 ggggttgacc tctttccagg agataaactt actgacttag aatatgccga tgatatagtt 2880 ctactgagtg aagatgctga taaaatgcag gattttttga ccaccttaaa catgaatgta 2940 agcatgcttg ggatgcgatt ctccccatcc aaatgtaaaa tgttacttca ggactggctt 3000 aattcggcac caaagttagt gatagggcgt gaaactattg aatgcgttaa ccgctttact 3060 tatctaggga gcctcatcag tcctaacggt ctggtgtccg atgaaatctc ggcccgtata 3120 cataaggccc gatcggcttt tgccaacctg cgtcacttgt ggcgtaggcg agacattcgt 3180 ttaatgacaa aaggacgcgt ttattgtgca gcagtcagat ccgttttacc ttatggatgt 3240 gaaacatggc ctttaagagt agaggatatt cgtaggatcc tagtattcga tcatagatgt 3300 ctccgaaaca ttgctcgtgt ttgctgggac aaccgagtaa gcaatgcatg ggttaggaat 3360 agagtactag ggaaatacgg taagtctatt gatgaagtag tgaatcttca ccggctgagg 3420 tggttaggac atgtgttgcg tatgcctgac caccgtctac ctcgacgagc aatgttgtct 3480 gttgtgggag taggctggaa gaaagctagg ggtggccaaa cgaaaacatg gcaccaatcc 3540 atgaagtcat tgacgattgg actgagccat gttaatagat gtagactacc tggttggggc 3600 ccacacgatg atcgtaacaa atggttagag actttaggtg atatggctca aaatcgattg 3660 caatggcgca ggtgcatcca ctccttgtct tcctccgaat tctaattttt gagcttctca 3720 taattttctc tttactgaat cactttttat ctcaaatctt atctatgatc tctatgcctt 3780 cccgttagta cttactctgt tactacctcc cctactttgg gatttggtcc gacaatttaa 3840 tctttatctg ctaattgagg catggcaact tgaactgatg tacgcatgta caaagttcta 3900 tgttgtaact gactgactga c 3921 // ID L1-31_AAe repbase; DNA; INV; 5209 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE An L1 non-LTR retrotransposon from Aedes aegypti. XX KW L1; Non-LTR Retrotransposon; Transposable Element; L1-31_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5209 RA Kojima K.K. and Jurka J.; RT "L1 clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1384-1384 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 12 sequences with >93% CC identity. XX FH Key Location/Qualifiers FT CDS 2002..5175 FT /product="L1-31_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MPMFSYKIATINICNISNHTKIDALTSFVRVSELDVI FT FLQEVQNDRLHLPGYMLFFNVDADQRGTAIAVRDQYNVHNVEKSLDNRIIS FT LRIGTITFINIYAPSGALQRTAREDFFNSAVAHYLHHSTEHVVLGGDFNAV FT VNPKDATGSSSISPMCKRLMNAAKLIDTWEALNGNTVEFSYIRSNTASRID FT RILITSKMKVQLRTAHFAVTAFSDHKAYVVRIVLPHLGTPCGRGVWRLQPH FT VLNDTEVINELSRKWIYWVRARRNYLSWIEWWVQFAKPKVISFLKWKTSIV FT YRDFKDAIELYRSLLKEAYDNYHGHPEQLVTINRIKAQMLRLQRNFSSNFR FT ANNETYISGESTTLFHVAQIHGKRSKTAISSLQTENGTETCDHSQIKETIR FT SYFENLYSSEEAVPSMDFMPIRAIPENNQANENLMQIITQDDVLTAIKGSS FT SRKSPGSDGLPKDFFLKTWQIIGFEFTNVINDALQGKATRQFYDGILVLVK FT KKGSDKSVKGYRPISLLNYDYKVLARILKQRIHCLLPLVLSNNQKCSNGKR FT TIFEATNRIYDKICQLKHLRQNALLVSFDLDHAFDRVSHQFLQQTMIRMNF FT NANFVELLATIWNQSYSKILIDGHLTPELKINRSVRQGDPLSMYLFVLYLQ FT PLLDAISRSFPNAVMNAYADDISMFLENERSLMKVVALFNDFGLTSGAVLN FT KRKTLAVMIGNVDLTTDSEWLHIENYVNILGVKYGDNIKQAQKLNWQSVLN FT GLRTQLWLHHPRKLNLCQKIILINTYITSKLWYMASNIPLTKGFATKIRAE FT LGKFIWHGQTLQRIAFANLTLPKTRGGLNLQCPETKSKALLLNRLMNPADE FT LPFLNSLLQNSSSSIPALFNHASLLQAELALLPERLINTPSSREIYHHYLE FT LKPDPGFVSADQRNWKMIFKNLHNKHLSSAQRSNWFVVMHRKIRHRELLYQ FT RGVVEDPYCETCPGELETTIHKLFRCQRVRNIWRYQRRQITNHVPELGRLE FT PEDFVYPSLRNMNQTTKNYIIKILGLYFSYLVESPENSIDLDDFVFYLNIN FT N" FT CDS join(150..1199,1169..1999) FT /product="L1-31_AAe_1p" FT /translation="MDDYRKDSVYLDFSGMPARPKLDKVHELISKKIQLDM FT IKVNCIQPSMTKARVIIELKSQAYVEELVADHSLKHTVELNNKEYAIPIVP FT YDNAIEVRVADLPSYFSTETIAKHLAPYGEVLSTQEEVWKNFWPGLPTGVR FT LARMRIRKPIPSYVPMTTHTAFISYRNQIRTCRYCVRPLHIGRTCNEARKE FT LGHDINSRLTAAQVVQGIIPPSPAINATPPPPSSPDPENEMQTSDGETNST FT ETTIGTAAAITNRLTATTSKHSSRPSSLSRHPLEQNTQAAGSSRSSSTLSH FT KGIGEKSINTSEFSISDDGEETINAPKRHRSQQSSVIPEAPFQEGKRKSRS FT RTNNTSREQEQNQQHFPIICAMASRIEILPKYINQQHLTENSFTTDITTNE FT GSAFCQVSPSICPAEVHIDTAVILKTIPIAFLPSSLERGNSTELTMKSLQS FT AHRHFPMSQIFNQHSMGKPIHHYRDPALFRYRQRLPRHGQRGLSLCEVSPS FT ICPTEACFDTAALENNALLVESKTPLHTETQPKVHPEKSLFNGCATTNETT FT THSKTALIFTKHQTPRHSCDSGERKLLRSKQNINDDDTEYVLLDILVWFDV FT CTHSIVSISARDRERFQVRCTFRLLLILG" XX SQ Sequence 5209 BP; 1669 A; 1214 C; 1021 G; 1304 T; 1 other; agttgacatt ttgctttcgt accgtacgga cgtgtttgag acaatcgctt ggtgaaaaca 60 tatcagttta tcgtgcctca gggcaaagtg taaaaggcgt tacgccgaaa agtgcggtcc 120 aatcggaatc cggtcattcc atcgctaaaa tggatgacta ccgtaaagat tccgtgtatc 180 tagactttag cggcatgcca gcccgcccta aactagacaa agttcatgag ctaatcagca 240 agaagatcca gctagacatg atcaaagtca actgtatcca gcccagcatg actaaagcac 300 gagtcattat cgagctwaaa tctcaagcat atgtggagga gctggtagca gaccacagct 360 tgaagcatac tgtcgaacta aacaacaagg aatatgctat tccaatcgtc ccatacgaca 420 atgcaattga ggttagagta gctgacctac cgtcctattt ctcgactgaa actattgcta 480 aacacctagc tccgtatggc gaagttcttt ccacgcaaga ggaggtgtgg aaaaatttct 540 ggcccggttt acccacggga gtaagattgg caagaatgag gataaggaag ccgatcccat 600 cttatgttcc gatgacgaca cacaccgctt tcatttctta ccggaaccaa attcgcacct 660 gccgttactg cgtaagacca ctccatattg gccgtacctg taacgaggca cgtaaagaac 720 taggtcacga catcaacagc cgccttactg cggcacaagt agtgcaaggc atcatcccac 780 catcgccggc aatcaatgct acgccaccgc cgcccagctc accagatcca gaaaacgaaa 840 tgcaaacatc cgatggagag acaaacagca cagaaacaac aattggcacg gcagcagcaa 900 taaccaatcg attaacggca acaacctcga agcactccag taggccttct tcgctttccc 960 gccacccact ggagcaaaac actcaagctg ctggatcaag cagaagcagt tcaaccctca 1020 gtcacaaagg aatcggcgaa aaatccatca acacttcaga gttcagcatt tccgacgacg 1080 gcgaagaaac gatcaatgct cctaaaagac accgttctca acaatcttct gttatccctg 1140 aagccccgtt ccaggagggg aaacgtaaga gcaggagcag aaccaacaac acttcccgat 1200 aatatgcgca atggcttcac gaatagaaat acttccaaaa tacatcaacc aacagcatct 1260 cactgaaaac tcattcacaa cggacattac gaccaacgag ggctctgctt tctgtcaggt 1320 aagccctagt atatgtcctg ccgaagttca cattgatact gccgtcattc tcaagacaat 1380 cccaattgct tttctcccat catcacttga acgcggaaac tctacggaac tcaccatgaa 1440 atcactgcaa tcagcgcacc gacacttccc aatgagccaa atcttcaacc aacactcaat 1500 gggaaaaccc attcatcact atcgcgatcc tgcactcttt cggtaccgac aacgactacc 1560 acgacacggt cagcgagggc tttctctctg tgaggtaagc ccaagtatat gtcctaccga 1620 agcttgtttt gatactgccg cactagaaaa taacgcactt cttgtagaat ccaaaacgcc 1680 gctgcacact gaaacacaac cgaaggttca cccggagaaa tcactcttta atggatgtgc 1740 gactacgaat gaaacaacaa cacattcaaa aaccgcacta atattcacca aacaccaaac 1800 tccgcgccat tcatgcgatt caggtgagcg taaactactg cgatcaaaac aaaacatcaa 1860 cgatgacgac accgaatatg tcctacttga catcttagtt tggtttgacg tctgcactca 1920 cagcattgtc agtataagtg ccagagacag ggaacgcttt caagtgcgct gtacgtttag 1980 actgctttta atattagggt gatgccaatg tttagctata aaattgcaac gatcaacatt 2040 tgcaatattt caaaccacac aaaaatcgat gcattaacat ctttcgttcg tgtatcagag 2100 ctagacgtaa tatttctaca agaagtgcaa aacgatcgtt tgcatttgcc aggatatatg 2160 ctgtttttca atgtcgatgc tgatcaacga ggtacagcca tagcagtcag ggatcagtac 2220 aatgttcata acgttgagaa aagcttagat aatcgtatta tttcattgag gattggaaca 2280 ataacgttta ttaacattta tgcgccctct ggtgctcttc aaagaacagc tagagaagat 2340 tttttcaact ctgccgttgc acactatctc catcattcca cggagcatgt ggtgctgggc 2400 ggtgatttca acgcagtagt taatcctaaa gatgcaacag gtagtagcag tataagtcct 2460 atgtgcaaac ggcttatgaa cgcagccaaa ctcattgaca cgtgggaagc gttaaatggg 2520 aacacagtgg agttttcgta tatcagatcc aacacagcgt cgcggatcga ccgaatacta 2580 atcactagta aaatgaaagt ccaactgcgt acggcgcatt ttgcagtaac tgcattctcc 2640 gatcacaaag catacgttgt gcggattgtt ctaccacatc taggaactcc ctgtggacgt 2700 ggggtatggc gtctccaacc acatgtcctg aatgacacgg aagtcatcaa cgagctatca 2760 agaaagtgga tatattgggt gcgagcgaga cgaaattacc tttcgtggat agaatggtgg 2820 gtccagtttg cgaaacccaa ggttatctcc tttttaaagt ggaaaacctc gattgtctat 2880 cgagatttta aagatgcaat tgagctttat cgatctttgt taaaagaagc atacgataac 2940 tatcacggcc atccagaaca acttgtaacg atcaatcgta tcaaggcaca aatgttgcga 3000 ctacagagaa atttttcttc aaactttcgg gccaataacg aaacctacat ttctggagag 3060 tcgactactt tattccacgt tgcacaaatt catgggaaaa ggtccaaaac tgccatctca 3120 agtttgcaga cagaaaatgg caccgaaacc tgcgatcatt cccaaatcaa ggagacaatc 3180 aggtcctatt ttgagaacct gtactcttca gaagaagccg taccttcgat ggatttcatg 3240 ccaatcagag cgatccctga aaataaccaa gcaaacgaaa acctgatgca aataataaca 3300 caagatgatg ttttaactgc aatcaaggga agtagctctc gtaaatctcc tggctccgat 3360 gggctaccca aggatttttt tctgaaaaca tggcagatta ttggctttga attcacaaat 3420 gtcattaatg atgcacttca aggtaaggca acgaggcagt tctatgacgg aatccttgtt 3480 cttgtgaaga agaaaggaag cgataaaagc gttaaaggat atcgaccaat atcacttttg 3540 aattatgact acaaagtttt agcaagaatt ttgaagcaaa gaatacattg tctcctaccg 3600 ttagttctgt caaacaacca aaaatgctca aatggaaaaa gaacgatttt tgaagcaacg 3660 aaccgcatct acgacaaaat ttgtcaacta aaacatcttc gacaaaatgc attgctggtg 3720 tcttttgatc tggatcatgc gtttgatcga gtaagccatc aatttctaca gcaaaccatg 3780 attaggatga atttcaacgc caatttcgtc gagttgcttg ccacgatatg gaaccaatcg 3840 tactccaaga ttctgatcga cggccacttg actccggagc tcaaaatcaa ccgatccgtc 3900 cgacaaggtg acccactgtc aatgtatctg tttgtgctgt atctacagcc gttgttagac 3960 gccatttcaa gaagtttccc taatgctgta atgaacgcct atgcagacga catttccatg 4020 ttcctagaaa acgaacgttc tctgatgaaa gtggtcgctc tcttcaatga ttttggacta 4080 acctcgggtg cagttttgaa taaacggaaa actctagcag taatgattgg caacgttgat 4140 cttacgactg attcagaatg gcttcacatc gaaaactacg ttaacatttt aggagtcaaa 4200 tacggagata acatcaagca agcacagaag ctgaactggc agtccgtttt gaacggattg 4260 cgaacccagc tttggctgca tcatcccagg aaattaaatc tgtgtcagaa aatcattctg 4320 atcaatacgt acatcacctc aaaactttgg tatatggcct ctaatatacc tctcacgaaa 4380 ggatttgcaa caaaaattcg ggcagagctt gggaaattca tttggcacgg ccaaacattg 4440 caaagaatag cttttgctaa tctaacgtta cctaaaacaa gaggtggtct aaatctacaa 4500 tgtcctgaaa caaagtcgaa agctctgctt ttgaatcgat tgatgaatcc agctgatgaa 4560 cttccgttcc tgaacagtct tctacaaaat tctagcagtt ccattccagc gttgttcaat 4620 catgcatcct tactacaagc agaactagcg ctcctaccgg aacgtttaat caacacacct 4680 tcatccagag aaatttatca tcactatcta gaattgaaac ctgaccctgg atttgtgtca 4740 gctgatcaac gaaactggaa aatgatcttc aagaatcttc acaataaaca tttatcatct 4800 gcgcaacgct caaactggtt cgtggtaatg cacagaaaaa taagacatcg agagctactg 4860 tatcaacggg gtgtagtgga ggatccttac tgcgaaacat gtccaggaga actagagacc 4920 actatacata agctgttccg ttgtcaacgt gtcaggaata tctggagata tcaacgaaga 4980 cagataacta atcacgtacc tgaactaggt agactagaac ccgaagattt cgtttatcca 5040 agccttagaa atatgaatca aaccactaag aattatatta taaaaatatt agggctttac 5100 ttcagttatt tagttgaatc tcctgagaac tcaatagatt tagatgattt tgtattttat 5160 ttaaacatta acaactaagc tgcaaatata aggttttaca aaaaaaaaa 5209 // ID Mariner-1_BTe repbase; DNA; INV; 1290 BP. XX AC . XX DT 28-JAN-2011 (Rel. 16.02, Created) DT 28-JAN-2011 (Rel. 16.02, Last updated, Version 2) XX DE Mariner-type sequence: consensus. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-1_BTe. XX OS Bombus terrestris OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; Apoidea; OC Apidae; Bombus; Bombus. XX RN [1] RP 1-1290 RA Jurka J.; RT "DNA transposons from the buff-tailed bumblebee."; RL Direct Submission to Repbase Update (31-MAY-2004). XX DR [1] (Consensus) XX CC >93% identical to consensus. XX SQ Sequence 1290 BP; 419 A; 240 C; 247 G; 384 T; 0 other; tattgggttg gcaactaagt gattgcggat tttgtcaata ggtggtctta gcacattctt 60 ttgttttgtc aacgtgttca aagcacctaa cctatattgt tttcttgttg tttggacttc 120 cacctttaat ttagtaaaaa aattgtatcg attagttatt tagtttcgtt tttatcccaa 180 tttaaaaatg gaagaacaag acgcgcattt cagacatatt ttattgtatt atttccgaaa 240 aggcaagaac gcatcgcaag cacacaagaa gttatgtgcc gtatatggga atgaagcctt 300 gaaagaaagg cagtgtcaaa attggtttgc caaatttcgt tctggtgatt tttcactgaa 360 aaatgctcaa cgatctggcc gtccagttga agttgatgag acccatatca aggccattat 420 cgattcagat cgtcatagca caacgcgtga gattgcagag aagctcgatg tatcgcatac 480 atgcattaaa aaaaattaaa acagcttggc tatgtcaaga aactcgattt atgggtccct 540 catcagctta aggaaattca tttgacgcaa cgcattagca tctgcgattc gcttctgaaa 600 cgcaacgaaa ttgatccatt tctgaaacga ctgattactg gcgaccaaaa gtggatagtt 660 tataacaacg ttaatcggaa aagatcgtgg gtgatgcaag atgaaccagc ccagacgaca 720 ccaaaagctg agattcacca aaaaagatta tgctgtcagt ttggtgggat tataaaggaa 780 ttctgtactt tgaactttta ccaagaaacc aaacgattaa ttcaaacgtg tacgttcaac 840 aactcgccaa actgagcgat gcagttcaag aaaagcggcc agaattggca aatcgtaagg 900 gtgttgtttt ccagcatgat aatgcaaagc cccacacatc tttggtcact cgccaaaaat 960 tattggaact aggttgggat gtgttgtcac atccaccata tagccctgac cttgcgcctt 1020 cagattacca tttatttcgt tccatgcaga actccttaaa tggtaaaatt tttaatgacg 1080 ctgatgatgt aaaatcacat ttaattcagt tttttgctgg caaaaatcag aagttttatg 1140 aacatggaat tatgacactg cctgaaagat ggcaaaaggt catcgacaaa aacggacaat 1200 acctaattga ataaagttat atttttaagc aaaaattttg aattttcctt cttatttaaa 1260 atacgcaatc acttagttgc caacccaata 1290 // ID Copia-25_DPu-I repbase; DNA; INV; 4477 BP. XX AC scaffold_34; XX DT 11-MAY-2010 (Rel. 15.05, Created) DT 11-MAY-2010 (Rel. 15.06, Last updated, Version 0) XX DE Copia-like LTR retrotransposon from Daphnia: internal portion. XX KW LTR Retrotransposon; Transposable Element; Copia-25_DPu-I. XX NM Copia-25_DPu-I. XX OS Daphnia pulex OC Eukaryota; Metazoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. XX RN [1] RP 1-4477 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Daphnia."; RL Repbase Reports 10(5), 713-713 (2010). XX DR Genome; scaffold_34; Positions 257574 253098. XX CC Positions [1822-2343] - Integrase core CC 'ACGAC' target site duplication CC LTRs are 98% similar to each other. CC We thank the US Department of Energy Joint Genome Institute CC http://www.jgi.doe.gov/ and the Daphnia Genomics Consortium CC http://daphnia.cgb.indiana.edu CC for making the data for Daphnia pulex available. XX FH Key Location/Qualifiers FT CDS 703..4383 FT /product="Copia-25_DPu-I_1p" FT /translation="MSHISAIENLVEQLKNMGEAPTLLQICTKIIYTLPPH FT LRGFIATWEALPEQEQTIALLTAKILNEESKNEMFQPQDLGYFAASKGYRG FT GSRGGYGGRGGHSGGVRFQDGPPAKRARYTGPPCEFCTTMGTTAAHPTSEC FT RKRTHAAKRDSANPAFINSPTVDFGFPATFSPLARLDLTDWFADSGATAHL FT TNQKNILQNIVTVPVGSWVIQGIHQATANVHAYGDVLFESTVNGKTRNGLL FT KRVLYVPDGGINLISIGAVTALSTQVTFFGEEVSFTKNDVLEVTGRRAGRT FT LYKLNLKAIIQSIQEEQACVARQSATSLINWHFRLGHVNCKTIQRMASTNA FT VDGLVITNHDVPQVCEGCIYGKFSRCPFHSRSEKPIEVGQVIVADVGGPMQ FT ESSLGGANYYVAFKDKCSGHRQVYFLKNKSEVAGKFKCFVSRFQNETGKLI FT KSLRTDNGTEFKGIAWTWAEEAGIIRQYTVAYSPQQNGTSERDNRTIVESS FT RSALYGKRHPVSLQVLLRLWAEAIAYSVYTLNRTLSRTRMVTPYEKYHDKT FT RRKWDKKGEKGMFLGYDDTSTGFRVLILETFKIKVSVDVTFYEGKVPSTKQ FT QNHDTPPKFPFATKQQPDDANQPSDEKDTPSEDNEEASVVPSTQTKANDAQ FT SERDGNADHPRDELVSDKEVNDATSHEEIPVCANRRYPLRDRKSKVIKSMQ FT SIDVHNSEPFEPRTYQEAMSCEEAHLWQPSVQDEYDSHIQNGTWILVPLPP FT GRKAIGCRWIYKVKPGHLSTPARYKSRLVAKGYAQIDYSSVIGYGPLRGIL FT SMCAALNLDMAQLDIKTAFLYGLVEEEIYIQQPEGFIVPGKEHLVGRLVKC FT IYGLKQSPRVWNCKFNDFLLKFGFTRSQHDPCVYHRRREEEILVIVIWVDD FT GLVCSNSKTAIAEVLAFLGENFEMRTLPTDRFVGLEIERIREEKKLFVNQV FT GFIEKILSRFNMSDCNPKSMPANPHVRLSKEMCPSTRTEKAAMETVPYQQA FT VGCLNYLTHATRPDIAFAVNQVSRYCHNPGSQHWEAVKHILAYLKGTTQYG FT ICFNGSSGSSDCALVGYTDSDYAGDLDHYRSTTGYILFFNNGPVSWCSRRQ FT PSTASFNTQAEYQALSDGSREAAFFMWLLQELGVESKRAVSLFCDNNAAIQ FT LASNPVFRSRTKHINVAYHIVRDYVEAHQIKAERVDTKDNLADILTKPLPS FT TDFKRFRELIRVRPFPE" XX SQ Sequence 4477 BP; 1329 A; 1156 C; 990 G; 1002 T; 0 other; ggttatgggc ccagttgtgc aagctgatat tactgaatag aaagaaacaa aaaggaggaa 60 atggcgtgct caaacagaag tttaagccat gtggtcaaat ttgatggtag caacttacca 120 ctctggaagt tgggcctgga cgttgcacta gaagaacatg atgttcagag tgtgaccgat 180 ggcacctgcc tcatgccacc tgaggtatat attcaaaagc ccctctttca tagttaatca 240 caagtttgtc tggacccgct ccacctactt taactcaaga tgtctaacta agactaaatc 300 atttctatct cgttaactca tgtacaaaac actacagatg cgagctgtcc atgaacctgc 360 tgaaggacaa gctgtgatcg atggagctca agtcatgctg ggaccaatcc taaatgcaga 420 tgcaattaag gattggaaaa cgaaagatta tacagcacgt cgaatcttac tctcaacaat 480 tgaggaaaaa ctgcagaata cgttggtcgg atgcaagaca gctttccaaa tatggacaag 540 actctcgtca caacacaaca aatgtgcagc caacaacaga tacgtaattc aacgaaaatt 600 ccttaactac gactaccaac aaggtaattt actcagtctg ttcatacagc aaaccatatt 660 ttttttaatt tgtgcccctc tcgttgtcca ggccatgacg ccatgtccca catctcggcc 720 attgagaacc ttgtcgaaca actcaaaaac atgggtgaag ccccaactct tttgcaaatc 780 tgcaccaaaa ttatctacac tcttccacct cacctgcgtg gattcattgc aacatgggag 840 gctcttccag aacaagaaca gacgattgca ttgctgacag caaagattct gaacgaagag 900 agtaaaaacg aaatgtttca gcctcaagat cttggatact ttgcagctag caaagggtat 960 cgaggtggca gcagaggagg atacggtggt agaggaggcc attcgggcgg tgtcagattt 1020 caagacggcc cacctgccaa aagagccagg tatactggtc caccatgtga attctgtaca 1080 accatgggca ctaccgcagc acatccaacc tcagaatgta gaaaacgaac tcacgcagct 1140 aaacgagact cagcaaaccc cgcctttatc aactcaccga ccgtggactt tggcttccca 1200 gccaccttca gtcccctagc tcgtcttgat ttaaccgatt ggttcgctga ctctggagcc 1260 acggcccatc ttacaaatca gaagaacatt ctacaaaaca tcgtcactgt tccagtaggc 1320 agctgggtca tacaaggcat tcatcaggca actgccaacg tccacgccta cggcgatgtc 1380 ttatttgaat caacagttaa tggaaagaca cggaatggcc tactcaaaag agtactgtat 1440 gttccagacg gcgggatcaa tctgatctcg attggcgctg ttactgctct cagtacacaa 1500 gttacattct ttggtgaaga ggtttcattt acaaaaaacg acgttcttga agtaactggc 1560 cgcagagcag gcagaactct ctataaactt aacctcaaag caatcattca gtcaatccaa 1620 gaagagcaag cctgtgtcgc aagacaatca gctacctcct taatcaactg gcattttcgt 1680 cttggtcacg tcaactgcaa aacaattcaa cgcatggcgt caaccaacgc agtggacgga 1740 ctcgttatca caaaccacga cgtacctcag gtgtgcgagg gatgcatcta tggaaaattc 1800 agccgctgcc catttcactc ccgttctgaa aaaccaattg aagtaggaca agtaatcgta 1860 gcagacgtcg gcggtcccat gcaagagtcg tcccttggtg gtgccaacta ttatgttgca 1920 ttcaaagata aatgtagcgg ccatcgacaa gtctacttcc tcaaaaacaa atcagaagtt 1980 gccggtaaat tcaaatgctt tgtttctcgt tttcaaaacg aaactggtaa actaatcaaa 2040 tcgcttcgaa cagataatgg caccgagttc aaaggcatcg cctggacctg ggctgaagaa 2100 gcaggcatca ttcgacaata caccgttgcc tactcccccc agcaaaatgg aacctcggag 2160 agagacaacc ggacaatcgt agaatcatca cgaagcgccc tgtacgggaa gagacaccct 2220 gtctctctgc aagtgttact acggctctgg gccgaagcaa ttgcgtactc cgtgtacact 2280 cttaatcgta cgctgtcccg caccaggatg gtcacacctt atgaaaaata tcacgataaa 2340 acacggcgca aatgggacaa gaaaggcgaa aagggaatgt tcctgggata tgatgacact 2400 tccactggat tccgtgtctt gattttagaa acctttaaaa tcaaagttag cgtcgacgtc 2460 acattttacg aaggaaaggt tccaagcacg aagcaacaaa accacgacac tcctccaaag 2520 ttcccatttg caaccaaaca gcaaccagat gatgcaaacc aaccctctga cgagaaggac 2580 acaccgtcag aagataacga agaagcctcc gtcgtcccat cgacacaaac gaaggcgaat 2640 gatgctcaat ctgaacgaga cggaaatgct gaccacccaa gagacgaact ggtttccgat 2700 aaggaagtca acgacgcaac cagtcacgaa gaaattccgg tctgcgccaa tcgacggtat 2760 ccgctacgtg atcgtaaatc caaagtgatc aagagcatgc agtcgatcga cgtccataac 2820 tccgagccat tcgagcctcg cacgtaccag gaggctatga gctgtgagga agcgcacctc 2880 tggcagccct ccgttcaaga cgaatacgac tcacacattc agaacggcac atggattctc 2940 gtaccgttac ctcccggacg caaggccatc ggatgtcgct ggatctacaa agtaaagcca 3000 ggccacctga gcacacctgc ccgctataag tcacgtctgg tcgccaaagg gtacgcccaa 3060 atcgactact cctcagtgat tggttacggc cccttacgtg gtatattatc aatgtgcgca 3120 gctctgaatc tagacatggc ccagttggac atcaaaacgg cctttctcta cggcctagtc 3180 gaagaagaaa tttacatcca gcagccggaa gggttcatcg taccaggcaa ggagcatcta 3240 gttggcaggc tagtaaaatg catctacggc ctgaaacagt ccccacgggt gtggaactgt 3300 aaattcaacg acttccttct caaattcggc ttcacccgaa gccaacacga cccctgtgtc 3360 taccaccgtc gccgagagga ggagattttg gtcatcgtga tctgggtcga cgacggcctc 3420 gtgtgcagca atagcaagac agcaatcgct gaagtcttgg ccttccttgg cgaaaatttc 3480 gaaatgagga cactgccaac agaccgcttc gttggactcg agattgaaag gattagggag 3540 gaaaaaaagc tgtttgtcaa tcaagttgga ttcatcgaaa aaattctatc cagattcaac 3600 atgtcagact gcaaccctaa gtcgatgcca gcgaatcccc acgtacgttt atcgaaagaa 3660 atgtgcccct caacgagaac agaaaaggct gcgatggaaa cggtgccata tcaacaagca 3720 gtgggctgcc tgaactatct tacacacgct acgcggccag atattgcgtt tgccgtgaac 3780 caagtctcgc gctactgcca caatcctgga tcccaacact gggaagcggt caaacatatc 3840 ctggcatacc tcaaaggaac cactcagtac ggcatatgct ttaatggaag tagtgggagt 3900 agtgactgtg ccctggtcgg ctacaccgac tctgattacg caggtgacct agatcactat 3960 cgctctacaa caggatacat tctcttcttt aataatggtc cagtcagctg gtgtagcagg 4020 agacaaccgt cgaccgcatc cttcaacact caagccgaat atcaagcact gagcgacggg 4080 tcaagagaag cggccttctt tatgtggctc ctacaggagc taggtgttga gagtaaacga 4140 gccgtttccc tcttctgcga caacaacgcc gcgatacagc tagccagcaa tcctgtgttt 4200 cgatcaagaa cgaaacacat caacgttgca taccacattg tccgtgatta tgtagaggcg 4260 caccagatca aagctgaacg ggtggatacg aaggacaacc tggccgacat cctcactaaa 4320 cctcttccgt caaccgactt caaaagattt cgtgaactca tccgggttcg gccttttcca 4380 gagtaaaagc ccacccgcta aagaagttaa aggggcataa ttgctgcctt gaatgtcgta 4440 tttacctctt gtttcctctt taatttttga gtgggag 4477 // ID Copia-122_AA-LTR repbase; DNA; INV; 206 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-122_AA_; KW Ty1_copia_Ele101; Copia-122_AA-I; Copia-122_AA-LTR. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-206 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX SQ Sequence 206 BP; 53 A; 50 C; 28 G; 75 T; 0 other; tgttagaata ccaatttcct acccctctcc ttaataccac actggcgaac cccccttgat 60 acgctcctga cattaaccat gtgttgttta cttttttcat tacacaactg tacttgagtg 120 aaataaaacg tttatctttt ctttgcatta atcgcggtgt tacgttttat tccggtagag 180 aaaacccgat tgcctaatat ttttca 206 // ID Gypsy-1_AC-LTR repbase; DNA; INV; 207 BP. XX AC AASC02000433; XX DT 18-JAN-2011 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from Aplysia californica (California sea DE hare): long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-1_AC_; KW Gypsy-1_AC-I; Gypsy-1_AC-LTR. XX OS Aplysia californica OC Eukaryota; Metazoa; Mollusca; Gastropoda; Heterobranchia; OC Euthyneura; Euopisthobranchia; Aplysiomorpha; Aplysioidea; OC Aplysiidae; Aplysia. XX RN [1] RP 1-207 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from Aplysia californica (California sea RT hare)."; RL Direct Submission to Repbase Update (02-FEB-2011). XX DR Genome; AASC02000433; Positions 32010 32216. XX SQ Sequence 207 BP; 43 A; 47 C; 45 G; 72 T; 0 other; tgtcacggta gtcaccgtat tgtaccttac tcttttttac tcacgggacg taagcgccct 60 gtatgccatg ggctcttggt taatgttact cacttcgggg atctataaaa agtcagacac 120 tacacccggt ctcattacat ttagaatgct tgctgtcgtg tgtgttcgtc tctagttttg 180 ctgtcgtgtg tgtagactaa cattaca 207 // ID BEL-101_AA-I repbase; DNA; INV; 2594 BP. XX AC supercont1.335; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-101_AA_; KW BEL-101_AA-LTR; BEL-101_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-2594 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; supercont1.335; Positions 1038439 1041032. XX CC 'CGCTC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 38..1693 FT /product="BEL-101_AA-I_1p" FT /translation="MSDVGEMMITACSACGTTSEVDERMIGCDNCNRWFHT FT RCVGITAASEKECKWFCPATECQQQYSEYLKKVEKQERKKKNKTNKALGVD FT SDRSSIKSARSVSNPGSELEKRLKELEKEQQAKEREIEIEMILLEKRMEME FT RALKEKRMRLENAIRAKQREDERELLEKALSDEKNHMAELKKMRDSFEAKM FT SSVQKEMNQLQMKGKGSQQKEKVLRTPLRNPSKTVPRQLKNKKRKEKLAQY FT RCEEELDDSEGDSEDEEEEEDSNSEEEESEEKSEEDSSEEEEEEQMKKSKN FT AKKEVANGLGRQRSVPTKMQMAARNGVTKKLPVFTGRPEEWPLFIGSYEAS FT TEACGFTDIENLVRLQESLKGQALESVRGQLLFPKCVPKVVSKLRQLYGRP FT EQLLQCHLEKVHQLEPPKADKLATFIPFGTAVEQLCEHLEAAGLKQHLVNP FT LLLQDLVDKLPSSDKREWVRYKKRKRTVTLRTFTNFLSDIVAEACEANVSI FT EFKPEGNSGSSGFEKGRFHERGAVYNHSATEAVINEPATMMSARMRAKCAS FT ARTIA" XX SQ Sequence 2594 BP; 804 A; 480 C; 795 G; 515 T; 0 other; ttctcaaaaa taaatatcgg acgtaacctc aaaagcgatg tcggatgttg gagagatgat 60 gatcacggcc tgcagcgcct gcggaacaac ttccgaagtg gacgaacgga tgattgggtg 120 cgacaactgc aaccgctggt tccatacgcg gtgtgttggc attacggcgg catcggagaa 180 ggagtgcaaa tggttctgtc cggccaccga gtgtcagcag cagtacagtg agtacttaaa 240 gaaagtagag aagcaggaaa ggaagaagaa gaataagacg aataaagcgt taggggttga 300 ttccgatagg tccagtatta agtcagctag gtcagtcagc aatcccggtt ccgagttgga 360 aaagcgcttg aaagagttag aaaaggaaca gcaagcgaaa gaaagggaaa tagagattga 420 aatgatcctg ttagagaagc gaatggagat ggaaagggct ttgaaggaga aaaggatgag 480 gttggagaac gccatcaggg caaaacagcg agaagatgag agagagcttc tggaaaaggc 540 tttgtcggat gagaagaacc acatggcgga gttgaagaag atgcgtgact cgtttgaagc 600 gaaaatgagt agtgtgcaga aagaaatgaa tcagctacaa atgaagggaa aaggaagtca 660 gcagaaggaa aaggttttga gaacgcctct caggaacccg agcaaaacgg tgccgaggca 720 gctgaagaat aagaaaagga aagagaagct ggcgcagtat cgttgcgagg aagaattgga 780 cgatagtgag ggagactccg aggacgaaga agaagaagaa gactccaact cggaggaaga 840 agaatcagaa gaaaaaagtg aagaagatag ttcggaggaa gaagaagaag agcagatgaa 900 gaaaagcaag aacgcgaaga aagaggtcgc aaacgggctg gggcggcagc gcagcgtgcc 960 gacaaagatg cagatggcag cgaggaatgg agtaacgaag aagttaccgg tattcaccgg 1020 cagaccggag gagtggccgc tgtttatcgg ttcgtacgag gcatccactg aggcgtgcgg 1080 atttacggac atcgaaaatt tagtgcgtct tcaggaaagc ttgaaaggac aagcgttgga 1140 aagcgttcga gggcagctac tatttccgaa atgcgttccg aaagtggtta gcaaacttcg 1200 gcagctatat ggacgtccag agcaactgct tcagtgtcat ctggagaaag tgcaccagct 1260 agagccgccg aaagcagaca agttggcgac cttcattcct ttcgggacag cggttgagca 1320 gctctgcgag cacttggagg cagcaggttt aaagcaacat ctagtgaatc cattgttgtt 1380 acaggacctc gtagataagc tgccgtctag cgataagcgc gaatgggtac gatataagaa 1440 gagaaaaagg acagttacgc ttcgtacatt caccaatttt ttgtcagata tcgttgccga 1500 agcctgcgaa gctaacgtgt ccatcgagtt taaaccggaa gggaattcag gtagctccgg 1560 tttcgagaaa gggaggtttc atgagcgagg tgctgtttac aaccacagcg caacagaggc 1620 agtcataaat gagccggcaa caatgatgtc agcaagaatg cgtgcaaagt gtgcaagcgc 1680 acggaccatc gcctgagatt ttgccaggat ttcaggagca tgacaccggt agatcgcgtg 1740 aacttcgtgc gaaaatggaa gctgtgcggc atgtgcctaa atgatcacgg caatgcagag 1800 tgcaacttca ggatgcggtg tacgataggc gggtgcatgg aaaggcataa gcctttgctg 1860 catgtcggat ccggattagt cgtgatgaac gctcacatcc gattgaacag caatattaaa 1920 taccgaatga ttccagtaaa gctgtcgttt cgtgacaaat ccgtaaccac gttagcgttt 1980 ttagatgagg gagcatcggt cacaatgatc gagaaggaac tggctgatga gttggaagct 2040 gagggcattc cgcagaagct cgaaatcagg tggactggag atgttgcccg tgtcgaggag 2100 gactcgaagc gaataagtct gaagatatcg ggactgggag gagcgcatca gctgctgatg 2160 aatgaggtgt gcaccgttgg agagttggcg ttgcctgagc aatcgctaga tgctcacgag 2220 atggcagagc gatacgacca ccttcgggac ataccggtta catcgtataa taaaggccgc 2280 ccaaggattc ttatcggatt gaataatctt catacgatgg cgccaatcga tgccaggtta 2340 ggcggatcaa gagaaccgat agcggtcaaa tcaccgttgg gatggacaat atacggtccg 2400 agtaatatag ttgaatcgtc caatgctcat gttgttggtt agtttgggca agatagccgg 2460 actttcgtta atgaagcata tgcgaagtag tagtgttctg gcgaaaggta gtcaaccgat 2520 ataagaggaa aaagtgactg gtaggtaatt ccggagtaac cggtgaacgg ctaccggagt 2580 tacgggctgg ggag 2594 // ID Gypsy15-I_Dya repbase; DNA; INV; 5028 BP. XX AC chr3L; XX DT 18-MAY-2009 (Rel. 14.05, Created) DT 18-MAY-2009 (Rel. 14.05, Last updated, Version -1) XX DE LTR retrotransposon from fruit fly: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy15_Dya; KW Gypsy15-LTR_Dya; Gypsy15-I_Dya. XX OS Drosophila yakuba OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora; OC melanogaster group; melanogaster subgroup. XX RN [1] RP 1-5028 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from fruit fly."; RL Repbase Reports 9(5), 1097-1097 (2009). XX DR Genome; chr3L; Positions 9932021 9937048. XX CC Positions [2231-2773] - Reverse transcriptase CC Positions [4109-4588] - Integrase core CC 'TATT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS join(53..1441,1445..4993) FT /product="Gypsy15-I_Dya_1p" FT /translation="MEYSVILKKSKTELIDILKANEVYFGEDATLVQLRNA FT VKSVMAENKNKTIASNLVKTIEADEEPTNVTIPKSEIKTIGGGNENVHEDS FT DEIKIYEKSITVTGDSGSMHREERDNHSVAPSGSVDDEIPGKDDEITTRNE FT AIRKNEMLNEKKHDLSREGEYVQLEMQHLNNLENLYLARMRVVELEADFKL FT MEGGARTVRNDVIGMKDIEALINCFSGDDGYPIRRWINDFEHIMNTYDVPD FT QRRWIFARRMLSGSAKAQMINVNPLTWNTLCKELLAIFEQKVTAWDVIKQL FT EARQKFPDESTLHYYIAMCGISKQATLADSDVIRAIVSGLQDKTGAATSLA FT FCESLDELRKKITIYDKVKISDQRVAAANVDSSSNVKMIKSTNLQQRPTAV FT RCYNCRKLGHIMSECRRPKRPEGACFNCYSQDHKYQDCPRKRGVTTVAAMQ FT DGELSDTDDLAAVQSVKLFLYGEERCITVLSLFDTGSPVSFIQKSLVPNNV FT KLKSQRSKYHGIGGQQILIFAKIECSVYFHKKAYKVSLLVVEDTLLPLPTL FT LGRDSLKIMEVQLVHKKQINAEYNFSNKNINNDSNNKFSASLFTSKYITHY FT NKKTNINNSSCLSKNKLNNQKLNNQKDINSYQQSNQLEVEQFFSNYAGYIE FT EELKELISVGNGFGKDYEEKCSEIVKTHYLSKNLDYKKAPIYQMTIRVTDT FT TPFHCSPRRLSYYEKEKVGEIIDDLLKKNVIRKSKSPYASPIVLIKKKSGE FT LRMCVDYRGLNKRIVKDNFPLPLIEDCLEYLEGKICFSIMDFKSGFHQIGV FT DSQSVPFTSFVTPHGQYEYLRMPFGLKNGSAIFQRFITEVLVDFIQNKQIV FT VYMDDIVIATKTVKEHLEILKKLLDRISENNLEINLKKCEFLKPECEFLGY FT KINRSGITPSEAHLDALKNFPIPQNLNTLHSCLGFFSYFRRFVLSFAKIAK FT PLTDLLKQGGPFPLNEEQLGAFKSLKTVLSNPPVLAIFNPNRETELHTDAS FT SHGFGAVLMQRQDDGKMHPVSFYSGKTNAAESRYHSFELETLAIINALRRF FT RVYLEHRHFKIVTDCNSLVQTLSRKAINPKIARWSLELENYDYSIVHRPGV FT NMPHVDALSRQVQVLDALRENSDNAIESLAKPLMSSEHSDNTVESIAKSIG FT NNRNLESRVGNRRYMPHCESRDETQSNISEDHNFSIEDSKDSWGGVPNELE FT FREKSIVESIKVVGVVGATPRDIDLILQTEQMRDPEIIKIRTAVEEGSCTS FT FELVDGVIYKKNSDESLCLYAPSCMEEQIIRLSHEKLGHLAAEKCVKDLKR FT NYWFPAMSSKVGNFIRNCLPCILHSVPKTIHNRTLHSIPKPCIPFHTIHID FT HLGPLPSIHSKRKHLLVVIDSFTKFVKLFPVNSTSTREAKDSLSRYFDFYS FT RPKRIISDRGSCFTSLEFSTFLQEREIEHIKVATGAPQANGQVERVNRILT FT PMLGKLSEPIEQADWYKLLNKVEFAINNSIHSSTQESPSILLFGVSQKGPI FT VDELSEYIEDKFNLEQKDLDSIRLKANRNIQHSQYKNEQYYAKKHKAPLEY FT KVGDFVAIRNIDTTPGVNKKFAPKYRGPYKVNRVFENDRYEIVDVDDCQLT FT QLPFKSILEPARLKHWKTVVNHTIALLL" XX SQ Sequence 5028 BP; 1827 A; 778 C; 1026 G; 1397 T; 0 other; aaaatcaggt gtggggtata tcccaaaaca gttataataa agcaagacta cgatggaata 60 tagtgttatt ctaaaaaaat caaaaactga gttaattgac attttaaaag caaacgaagt 120 ttattttggc gaagacgcga ctttagttca actgcgcaac gccgttaaga gtgtaatggc 180 tgagaataaa aataaaacca tagcgtcaaa cctagtgaaa accatagagg cagatgaaga 240 accaactaat gtgacgattc ctaaaagtga aattaagacg attggcggtg gcaacgagaa 300 tgtgcacgag gattcggatg aaattaaaat ttacgagaag tcaataactg taaccggtga 360 cagtggcagc atgcacaggg aagaaagaga caaccatagt gtagcgccaa gcggcagcgt 420 ggatgatgaa atacctggaa aagatgatga aataactacg agaaatgagg caataagaaa 480 gaatgaaatg ctaaacgaga aaaagcatga tttatcaaga gagggtgaat atgtacagtt 540 agagatgcag catttaaaca acttggaaaa cttgtaccta gctcgtatgc gtgttgttga 600 gctggaagca gattttaagt taatggaagg aggagcaaga accgtgagaa atgacgtcat 660 tgggatgaaa gacatagaag ccttgattaa ttgtttttcg ggagatgatg gctacccgat 720 acggcgttgg ataaatgact ttgagcatat aatgaacacg tatgacgtac cagatcaacg 780 ccgttggata tttgctaggc gtatgttaag cggatcagca aaagctcaaa tgatcaatgt 840 gaacccttta acatggaata ccctatgcaa ggagctgctg gcgatttttg aacaaaaagt 900 gactgcgtgg gacgtgatta aacagcttga agcgagacaa aaatttcctg atgaaagtac 960 gttgcactat tacatcgcta tgtgtggcat ttcgaagcaa gcaacgctag cggattctga 1020 tgtaattcgt gcaattgtga gtgggctcca agacaaaaca ggagctgcca cttctttagc 1080 cttttgtgaa tcgttggatg aattgcgcaa gaagataact atatatgata aagtaaagat 1140 atctgatcaa cgggtggctg ctgcaaacgt ggatagttcg agtaatgtaa aaatgattaa 1200 atcaacaaac cttcagcaac gacctacagc tgtgcgttgt tacaattgcc gaaaattggg 1260 gcatataatg agcgagtgca gacgcccaaa gagacctgaa ggtgcctgct tcaactgcta 1320 cagccaggac cacaaatacc aggactgccc gaggaagagg ggcgtcacta ccgtagcagc 1380 catgcaagat ggggaactat cggacactga tgacctggca gcggttcaat cggttaagtt 1440 atgatttctt tatggtgaag agaggtgtat tactgtttta agtttatttg acacaggaag 1500 tccagtaagc tttattcaaa aatctttagt tccaaataat gttaaattaa aatctcaaag 1560 aagtaaatac catggaatag gtggacaaca aatattaatt tttgcaaaaa ttgaatgcag 1620 tgtatatttt cataaaaaag cttataaagt tagtttatta gttgtagaag acacattatt 1680 accattacct accttactag ggcgagattc acttaaaata atggaagtcc aattagtaca 1740 taagaaacag ataaatgcgg aatataattt ttcaaataaa aatattaata acgacagtaa 1800 taacaagttt tctgcatctc ttttcaccag taaatatata acccattaca ataagaaaac 1860 taatataaat aattctagtt gtttaagtaa aaataagttg aataaccaaa aactaaataa 1920 tcaaaaggat ataaatagtt accaacaaag taatcaatta gaagtagaac aattttttag 1980 taactatgca ggatatattg aagaagagtt aaaggaacta attagtgtag gaaacggatt 2040 cggaaaagat tatgaagaaa aatgtagtga aatagtaaaa actcattatt taagtaaaaa 2100 tttagattat aaaaaagctc ctatatatca gatgactatt agagttacag atactacacc 2160 ctttcattgt tcacctaggc gtttgtcata ctatgaaaaa gagaaagtag gtgaaataat 2220 tgatgatctt cttaagaaaa atgttattag aaaaagtaaa tctccttatg catctcccat 2280 agttttgatt aaaaagaaat ctggtgaact tagaatgtgc gtagattata gaggacttaa 2340 taagcgaata gttaaagata atttcccatt accgttaatt gaagattgct tagagtactt 2400 agaaggcaag atatgttttt caataatgga ttttaagagt ggcttccacc aaataggcgt 2460 agattcacag agtgtaccgt ttacctcatt tgtaacaccc catgggcaat atgaatattt 2520 aagaatgcca tttggcttaa aaaatggatc agctattttt caaagattta ttacagaagt 2580 attagtggac tttattcaaa acaagcaaat agtagtgtat atggatgata tagtgatagc 2640 taccaaaact gtaaaggaac atttagaaat cctaaagaaa ttattagatc gtattagtga 2700 gaacaaccta gagattaacc taaaaaaatg tgaattttta aaacctgaat gtgagttcct 2760 tggctataaa atcaaccgta gtggaattac acctagtgaa gcacacctag atgctttaaa 2820 aaatttccct ataccgcaaa atttaaacac ccttcattcg tgcttaggat tcttctcata 2880 ttttaggcgt tttgtattat cctttgcaaa aattgcaaaa cctttaacgg atttattaaa 2940 acaaggaggc ccatttccac ttaatgaaga acaactagga gcttttaaat cgttgaaaac 3000 cgtattatcg aatcctccag tattggctat ctttaaccct aatcgtgaga ctgaacttca 3060 tacagatgct agctcgcatg gctttggagc cgtccttatg cagagacaag atgatggcaa 3120 gatgcatcca gtatcgtttt attcaggtaa aactaatgca gctgagtcta ggtatcatag 3180 ttttgaattg gaaaccctag cgattattaa tgcattacgt cgatttagag tttatttaga 3240 gcataggcat tttaaaatag tgacagattg taattcttta gtgcagacac tgagtaggaa 3300 agccataaac cctaagatag cgagatggtc attggaattg gagaattatg attattctat 3360 agttcataga cctggtgtta atatgcctca tgttgatgcg ttgagtagac aagttcaagt 3420 attagatgca cttagagaaa atagtgataa cgccatagag tccctagcga aacccctgat 3480 gagtagtgaa catagtgata ataccgtaga gtccatagct aaatccatag ggaataatag 3540 gaacttagag agtagagtag gaaataggag atacatgccg cattgtgaaa gtagagatga 3600 gactcagtca aatatatcag aagaccataa ttttagcata gaggatagta aagacagctg 3660 gggtggtgtc ccaaatgaat tggagtttag agaaaaatcc atagtagagt cgattaaagt 3720 agtgggggta gtaggagcaa cacctaggga tattgacctt atattgcaaa ctgaacaaat 3780 gcgtgaccct gaaataatta agataagaac agccgtggag gaaggtagtt gcacttcatt 3840 tgaattagta gatggcgtta tttataaaaa gaatagtgac gaatccttat gtctgtatgc 3900 tccgagttgc atggaagaac aaataattag gttatcacat gaaaaattag gccacctagc 3960 tgctgaaaaa tgtgtaaagg atttaaagag aaactattgg ttcccagcta tgagtagtaa 4020 agtaggaaat tttataagaa attgtttacc gtgtatattg cattcagtac cgaaaaccat 4080 tcataatcgt acactccata gcattccaaa gccgtgtata cctttccata ctattcacat 4140 agatcattta ggaccgttac caagcataca ttcaaaaaga aaacacctcc ttgttgtcat 4200 agattcattt acgaaatttg ttaagctatt tccagtaaat agcactagta ctcgtgaagc 4260 aaaagattca cttagtaggt attttgactt ctatagccgt ccaaaacgaa ttatttctga 4320 caggggatct tgttttacct ccctagagtt ttcgaccttt cttcaagaac gtgaaataga 4380 gcatatcaag gtggctacag gcgcacctca agccaatggg caagtagaga gagtgaacag 4440 aatcttaaca cctatgttag gtaaattatc agagccaata gagcaggcgg attggtataa 4500 gttattaaat aaagttgagt ttgccattaa taattccatt catagtagta ctcaagaatc 4560 acctagtatt ctattatttg gtgttagtca aaaaggccct atagttgatg aattatcaga 4620 atacattgag gacaagttta atttagagca aaaagactta gatagtatcc gtttaaaggc 4680 taataggaat atacagcact cacaatataa gaatgagcaa tattatgcaa agaaacataa 4740 agcccctctg gaatataaag tcggagattt tgtagctata cgaaatattg atacaacacc 4800 tggagttaat aagaaattcg cacccaagta tcgcggccct tataaagtaa atagagtatt 4860 tgaaaatgac cgttatgaaa ttgttgatgt agatgattgt caattaaccc aattaccatt 4920 taagagtata ttagaaccag cgagattgaa acattggaaa accgttgtaa accatactat 4980 agcattacta ttgtagatcg aggtcgatca taagtcaggt aggccgaa 5028 // ID hATm-9_HM repbase; DNA; INV; 3577 BP. XX AC . XX DT 22-MAR-2008 (Rel. 13.03, Created) DT 22-MAR-2008 (Rel. 13.03, Last updated, Version 1) XX DE hAT-type family: consensus sequence. XX KW hAT; DNA transposon; Transposable Element; hATm-9_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-3577 RA Jurka J.; RT "Families of hATm elements from Hydra magnipapillata."; RL Repbase Reports 8(3), 213-213 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX FH Key Location/Qualifiers FT CDS join(575..1219,1191..1835,1839..2432,2484..3041) FT /product="hATm-9_HM_1p" FT /translation="MAKRTRKNSQSCKLSVMIGFASEFLDSELPTLRDVIK FT YGLLLRRDENCKANAKLKDTCKKVFSKVKTQWLKANPEIPIVTDKVGVNKI FT IKVWKKATDIINYRCKPEIISKFNSTLDKLFDIAKCKCDIVSCEEASCNSC FT NFKAHISCQCNRENKIPTIELQFMKFQREKFGTKSSMQMGASDFKETNRQE FT KNNKRKATGKKKPQIEYLSFSFSRKNTYLFHSQENEIKETIDNIANTDDED FT VVSDNSFEYISRQVSKKTLNMEEVALASIRYGVSNRATAAISTLALKAAKE FT AGFLKKKVFDSEIVTDHKKIERSKLKVMKQLKSENDNVVKDGVQCILFDGR FT KDETKCSLMGEDGKFHHRLVQEDHYSVCAEPGGLYLYHFTNDKENRSECNT FT PAEHIAEHIYKWIIEHGVESTLVAVGCDSTNLNTGKLIINTGKYLIINVIH FT IFKHNYFLGWKGGIIHFLEKKLQTKLIWIVCALHTNELPLRHLMSIIDGKT FT TSNNKFSGNIGKLVPIANDLIISTVIQDIDTEINLLDIDEDILKTLSSDQK FT YLYRITKAIENKIMPKNLKEIQVGPHNNARWLNLASRLCRIWCSEHGLNKN FT DVENLKQLVQFCVGVYSVLWFEIKVRNVYMVKHKWTNGPYHIIKQLELVRK FT MNLNIQQAVLPYVCSSAWNAHSESILQALLCSKDKEERKFAVNKILSIRSS FT SSLGITAVRPRNIPNINTNAETIIELIDWSNDIYEPLLTCKIGKEELIKFL FT EVPMEVPEFPVHGQSIERIVKEVTRASSIVYGQERRDGFIKATIAHRELLP FT VLESKQNLLKLF" XX SQ Sequence 3577 BP; 1322 A; 480 C; 623 G; 1150 T; 2 other; tattagggcg ggtcattttt caactttttc cagtaaatgc ttaggccata tgatttttaa 60 atgttttggc ccttatataa aagtaatcca aaaaaaaaaa waaaaatttt atttttaggt 120 tttcaaattt tgacctaaaa aatcgaggtt ttgggtaaaa aacgctatat acgcggtttg 180 ccgcaacttt tgaccggttg ctccaattga ctcaaaaaca tcttttatgg aatcctcgat 240 cccaaatacc cttaattaga cacctagata gtcattttag cttaaaaact tataattata 300 aaagtttaca aaaatcgtta tttttgatat attttgctgg aatatcttgc gtagttacaa 360 taagatgttt tggttaaact taaatttatt tagtagatta tatgaaaaaa cttatgaaat 420 atcattcata agttgctgca tttgaattgc cgcattttaa accgttgctt tagcatctga 480 atttgctgca ttttaaatgc tgcatttgag ttgacacgtt aaatctacaa taaatattgt 540 ttataaacta ttgataactt tgagttcttt atcgatggct aaacgaacac gtaaaaatag 600 tcaaagttgc aaactaagtg ttatgattgg ttttgcaagt gagtttttag attctgagtt 660 acccacatta agagatgtta taaagtatgg tctgcttcta agaagagatg aaaactgcaa 720 agcaaatgca aaacttaaag atacatgcaa aaaggttttc tctaaagtga aaacacaatg 780 gttgaaagcg aatcctgaaa ttcctattgt gacggacaaa gtgggtgtta ataaaataat 840 aaaagtatgg aaaaaagcta ccgatattat taactacagg tgtaaacctg aaattatttc 900 taagtttaac tctactctgg acaagctatt tgatatagca aagtgcaagt gtgacattgt 960 ttcttgtgag gaggcaagtt gtaattcttg taattttaaa gcgcatattt cttgtcaatg 1020 caatagagaa aacaaaattc ctacaataga acttcagttt atgaaatttc aaagagaaaa 1080 atttggaaca aaaagtagta tgcaaatggg tgcttcggat tttaaagaaa ccaatagaca 1140 agaaaaaaat aataaaagaa aggccactgg aaaaaagaag ccacaaatag aatacctatc 1200 tttttcattc tcaagaaaat gaaatcaaag aaacaattga taatattgcg aatactgacg 1260 atgaagatgt tgtttctgac aatagttttg agtacattag tagacaagtt tcgaaaaaga 1320 cattgaatat ggaagaagtt gcattagcat caataagata tggggtatca aatcgagcaa 1380 ctgctgctat ttcaacatta gcactgaaag cagctaaaga agcaggcttt cttaaaaaga 1440 aagtgtttga cagtgaaata gtgacagatc ataagaaaat tgaaagatct aaattgaagg 1500 tgatgaaaca attaaaatct gaaaatgata atgttgtaaa agatggagtt caatgtattt 1560 tatttgatgg aagaaaggat gagacaaaat gttctttgat gggagaagat ggaaaatttc 1620 accacagact tgttcaagaa gaccattatt cggtttgtgc agagccaggt ggtttatatc 1680 tgtaccattt tacaaatgat aaagaaaacc gatctgagtg taatacccca gcagagcaca 1740 tagcagaaca tatatataaa tggattattg aacatggggt agaatcaact ctagttgctg 1800 taggctgtga ttctaccaat cttaatacag gtaaatgatt gataataaat acaggtaaat 1860 atttaataat aaatgtcata catatattta aacacaatta ttttttaggt tggaaaggag 1920 gtatcattca ttttttagaa aaaaaacttc aaacaaaact tatttggata gtttgcgctt 1980 tgcacacaaa tgaactaccc ttgaggcact taatgtcaat tattgacggg aaaacaactt 2040 caaataacaa gttttctggc aatattggta aacttgttcc tattgccaat gatctcataa 2100 tttccactgt gattcaagat atagatactg aaattaattt attagatatt gatgaagata 2160 ttttaaaaac tttatcgagt gatcaaaaat atctataccg aataacaaag gctattgaaa 2220 ataaaataat gcccaaaaat ttaaaagaaa ttcaagttgg tccccacaat aacgcacgct 2280 ggttgaattt ggcaagtaga ctttgcagaa tttggtgttc agagcatgga ctaaataaaa 2340 acgatgttga gaatttaaaa caacttgtac agttttgtgt tggtgtatat tctgtattat 2400 ggtttgaaat taaagtaaga aatgtttata tgtagtttga taatcgtcta tttatttttg 2460 aaaaatgtat tattaatgtt taggtaaaac ataaatggac caatggacca taccatatta 2520 tcaaacagct ggaacttgtc cgaaaaatga acttaaatat tcaacaagca gttttgccct 2580 atgtttgctc ctctgcctgg aatgctcata gtgaatctat tctacaagca ctactatgca 2640 gcaaagataa agaggagagg aagtttgcag tgaataaaat actttcaata aggagttcga 2700 gctccctagg aattactgct gttagaccaa gaaatatccc aaatataaat actaatgctg 2760 aaactattat cgaattgatt gattggagta atgatattta cgaacctttg ctcacatgca 2820 agattggtaa agaagaactt attaagtttt tagaagtccc aatggaagtt ccagaatttc 2880 ctgtacatgg gcaatccata gaaagaattg tgaaagaagt tacaagagct tctagtatag 2940 tttatggaca ggaaagacga gacggattta taaaggcaac aatagctcac agagaacttc 3000 ttcctgtttt agaatctaaa cagaatttgc ttaagctttt ttaagcaata agtttatatt 3060 tattgtatat ttagtaaaat taagtttact cttgcattgt gaatgataaa ttggtaaata 3120 ctgctataaa caataattaa tgatttttca tgtttctaat tggttgtgcc tcgcgttgtg 3180 gttgagaaat cctttgaaac tgttttgctg cattacattc acataacatt gttatggaag 3240 atattacagc aaaatatatc aaaaataacg atttttgtaa acttttataa ttataagttt 3300 ttaagctaaa atgattatct aggtgtctaa ttaggggtat ttgggatcga ggattccata 3360 aaagatgttt ttgagtcaat cggagcaacc ggtcaaaagt tgtggcaaac cgcgtatata 3420 gcgtttttta cccaaaacct cgatttttta ggtcaaaatt tgaaaaccta aaaaataaat 3480 ttttttttgg attactttta tataagggcc aaaacattta aaaatcatat ggcctaagca 3540 tttactggga aaatttgaaa aatgaccygc cctaata 3577 // ID CR1-53_BF repbase; DNA; INV; 4815 BP. XX AC . XX DT 30-JUL-2009 (Rel. 14.07, Created) DT 30-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE Amphioxus CR1-53_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1-53_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-4815 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-4815 RA Kapitonov V. and Jurka J.; RT "Young families of CR1 non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(7), 1624-1624 (2009). XX DR [2] (Consensus) XX SQ Sequence 4815 BP; 1288 A; 1051 C; 932 G; 1544 T; 0 other; aacctgacaa cattcatctc tcaagatggc ggcggagcgt aacatgtgag tttgagctcc 60 cgccataccg ggtcatttga ctccatttct ttcatatttc ggacattttt gtggcgtcca 120 gcgttactgt ctcctcttcc agtcaatgcc gggtaggaag caatccaaga ggaaacgcag 180 acctacttct ggagactctg tcgcgcatga ttcagaggat agcagcatgg cgacgcaggt 240 atctctgagc agtatcgctg cgcacctgag tgaatttgca tctaaaacct ctgcaagctt 300 tgacaagtta aatgcctcta tctatcactt gggtgaagac atgaagtctc tgcgtcagga 360 agttgtttct cttcagcaga gcgtatcctt cgctcacgac gagatcgatg cccttaaaca 420 agaaacttcc actaaaaact ccggcttgga acagcgcgtc atcgccctcg aggagaaact 480 gttgttgtca gagctctatt ccaagaagca aaatcttctc ttgtggggtg tacaggctga 540 ggaagaagaa gatgtggtgg agaaaaccta caaattcatg gaggacaact tgaagatgac 600 caacgtcaaa agtatcgcct ttgtgaacgt ccaccgtctc ccaaagatgc gcggaaatcc 660 gatcatcgcc aagtttgtat ccatggcgga tcgcgacctt gttctgcgcc acgcgtaccg 720 tctccagccg aagtctggca tcggtgtcag tgttcacctt cctgtcccca tccagaaggc 780 gaaggagagg ctaaagccag ccttctatga tgcaagaagc aagggcctta agcccaagtt 840 caagcttgtt ggtgccacta tgcacctctg cgttgcggga ggcaagaaat tcaagacacc 900 tgaggactat ttcaattaga caccttcttc tctgctaagt actttctcta cctaacttat 960 gatggcctgg taagggcaac agcctgagag ttacttgaac taactcatag tgactgtatt 1020 ttatgtgatg ttacttttga aacgtgagta ctgtatttac tttgttcttt gtcttttgtt 1080 caaaacggga tcgtctagct tggtagcaga ggttttatta ctcaaagtct gtttgttagg 1140 gatgtccttc atgacatgcc tataagtact gagcagcata tggcagtgat aggaatgttc 1200 gaagctgatc gtcattttgc cgtattgaaa agatgtaaga cctgtataca ttttgatatc 1260 ttcttactgt ttttcccact tcctgtatta gttgtactta gtctcattgc gggtgacatt 1320 catccaaacc ctgggccgtc aactatgtat acctcattta aatacctaaa cattctacat 1380 gccaatgtaa atagtttagt tgccggctca aagttggacg aactgtcatc tcttgcaatc 1440 caacacaaat tggatgttat cgcaatctct gaatcatggc ttggtgacac cattgacaca 1500 agtaacatcg tattagatgg ctttcaatct ccaattcgcc gagataggaa tagacacggt 1560 ggaggagtcg tagtttacgt ttctgaccag attgccttta acagacgcac agatctagaa 1620 actgccactg ttgaatgcat ttggctggaa ttgtgcactg gtaaattccg cattcttttt 1680 gctgtatact atagaccacc tgcgcaggat ttgtatacaa tcaacgagtt tgttgattcc 1740 tttagtgatt cagttttctc agccgggggt actccacatg atgctctcat tatcactggt 1800 gattttaatg caaaacacca ggcatggtgg ccccgtgacc caatcactac tgccggttta 1860 aaattatttc aagcgtcaca gatgttgaat cttacacagg tgattaagga accgacatgc 1920 gacctctcct tgtcagcttc tttgattgac ctgcttttta ctgacactag tggtattgtt 1980 ttgtccacct ctgttttatc tcctttatcg gggtgccacc attccccaat tgttgctact 2040 tttaaactat ctttacagct tccaaggcca tacactagat cagtctggga ctattccaga 2100 attgatttta ccaagcttac caccttcgta tccgatccca aatggaacga catataccac 2160 tgcaattctg tgaacgaagc tggtagtaag cttactgatt taatatatga ggccaaaaat 2220 atctgtgtac ctcacaagac cattaaaatc agaccacgtg ataaaccgtg gatgctccct 2280 cggctgagac ggctaatgcg acagcgtgac aaacttcaca agagggccaa actgtctaac 2340 aatcctgtgc attgggcgtc atatcgtaga attagaaaca aacttgttgg agatattgct 2400 tctgcaaaat ctgattatca caatcgtcta gttcagtctt tgtcggagcc taatacagca 2460 agtaagaagt ggtggcacat cattaagtat ttttataggt gtaatagtat ttctacaatt 2520 cctcctctaa acctcggcaa tactttcaca actgattcga aagaaaaagc atctttgttt 2580 aatgaatatt ttgctgctca gtcttcagtt gatgatagta atgccgttct tcctgaactc 2640 gattatttga ccgatgctcg tctatacgaa tgtgttactt ctgctgagga agttgaactc 2700 tacatatctc tccttgatgt atctaaggcc catggatttg atagcgtaga taatcgtttt 2760 ttaaaactta ttggtccatt tatatcagac aagattgcct ttgtatttaa cttatcactc 2820 tgccatggta cttttcctca aaattggaaa tgtgcaaatg tagttcccat ttttaaaaag 2880 ggagacccgc aacaagtttc taactaccgc cccgtttctt tacttccatc actctccaaa 2940 attcttgaaa aaattgttta caagcacctt tacaaccatt tgttatcgca aagcctacta 3000 agttcatttc agtcgggctt catccgtggc gactccactg tctgtcaact tgtctgcata 3060 actcacaaaa tactcgaagc acttgacgca aataaagaag tacgagctgt ttatctcgac 3120 ttctctcgtg cttttgacaa agtttggcat gatggactta tttttaagtt acgaagaaac 3180 ggggtggaag gaccactctt gaactggttt caaagctatc ttcatggcag aattcagcgt 3240 gttgtggttg atggccagtg ttctggttgg ctcagtgtat gtgcaggtgt acctcagggc 3300 tccgtactgg gccccttact ttttcttata tatattaacg acatgttgga cgatttatta 3360 actcaaccct ttttgtttgc agacgatagc tccttgatcg atattgttga gaatcccata 3420 cttacatcta taagacttaa ctccgatctt cagcgaattc tttcctggtc caataaatgg 3480 cttatggaac ttaacccatc caaaacagaa gaaatttgta tttcaaagaa aaaggttcca 3540 ccaattcacc ctcctctttt tcttggtaat tcagttattc aaacggtgaa acatcataag 3600 catatcggtg ttattttaac gtccaacatg tcatggagta ctcacatcat tcaaatggtt 3660 gcgaaggtgt ttaaaaaggt atccattatt aaaaacctaa aatttaagct gcctcgcaat 3720 attttggaaa atatttacaa gacttttata agaccgtctc tagaatatgc cgatgttgtt 3780 tggcacagct gctctgattc tgactctaga cttatcgagc ggctgcaata tgaatgctcc 3840 ctcacggtat ctggcgccgt gagggggtca tcatattctt ccgtccttca ggagctcggc 3900 tgggaaaaac tatccgagcg acgttatgtc cattctcttc ttctatttta taagattgtt 3960 aatggccaag ctcgtcagta tttaactgac ttgttaccac cggcagtttc tgaggcaact 4020 tcatacaacc ttcgtaacaa atcaaacatt caaatgccag tttgtactac aaatcgtttt 4080 ttaagatctt ttgtaccata ctcaattcag aaatggaact gtcttgatct aagtatccgc 4140 tcgttgagcc cccatttatt caaaaagtac ttaatcaagt ctgtacgtcc tgttgttcca 4200 acttatttta gttccggtcc aagatacccc tgtgtccttc ttacgcgcct tcgtgttggt 4260 acccatagcc taaatcatag tttatctgtg cgtaacttag ttgataatcc ttcctgtacc 4320 tgcgggtgcc gatgtgaaag tatttctcac ttccttttac actgtcctaa ttatattaat 4380 caacgtttac ttttatttga caggataaaa aatttgaata tttcaactct caatactgat 4440 gccatgtctg aaacagctgt tataaattta ctgattaggg gttcacccct cctctcttcc 4500 actgacaatg ctaatgtact ttatattgcc cagatgtatt tattagaaac caaacgtttt 4560 gtataattgt tttcttttgt tataactcca atactgtatt cattattggc atcatatttg 4620 tattgttaat ttgtattatt aatatgttta ctagtatttc atctctgtta tagtgacgat 4680 tcccgtattt ttgtgtttta gttgtgtatt gttcattgta ttctgttagt ggtgccgtga 4740 acataagcgt ttagcttgag tgcggcacca ctgtgtgtgt aatttgtatg ttctttaata 4800 aaaaaaaaaa aaaaa 4815 // ID Gypsy14-SM_LTR repbase; DNA; INV; 109 BP. XX AC . XX DT 14-OCT-2007 (Rel. 12.1, Created) DT 02-NOV-2007 (Rel. 12.1, Last updated, Version 1) XX DE LTR retrotransposon from Schmidtea mediterranea: long terminal DE repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; Gypsy14-SM_LTR. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-109 RA Jurka J.; RT "LTR retrotransposons from Schmidtea mediterranea."; RL Repbase Reports 7(10), 1066-1066 (2007). XX DR [1] (Consensus) XX SQ Sequence 109 BP; 38 A; 6 C; 15 G; 50 T; 0 other; agttaagata tttaaattat cttgtgttta aataattagt tattttggca cttttatgaa 60 ataaatcttg atggaattta attcttttaa aaatgagttg gatttaact 109 // ID Gypsy-16-LTR_HM repbase; DNA; INV; 136 BP. XX AC . XX DT 06-JAN-2009 (Rel. 14.02, Created) DT 06-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE Long terminal repeat of the LTR retrotransposon - a consensus DE sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy-16-LTR_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-136 RA Bao W. and Jurka J.; RT "Gypsy LTR retrotransposons from Hydra magnipapillata."; RL Repbase Reports 9(2), 405-405 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX SQ Sequence 136 BP; 41 A; 22 C; 19 G; 54 T; 0 other; tgtaatatcc gtgttcggca taatacacac atgcctcgtg ttattattat tacttgaaat 60 aaatatggcg gtttgcacgc tttaaacttt atgttgtatt taattcttag tataacatta 120 tcctcagaat attaca 136 // ID Jockey-13_AAe repbase; DNA; INV; 4276 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.04, Created) DT 22-MAR-2011 (Rel. 16.04, Last updated, Version -1) XX DE A Jockey non-LTR retrotransposon from Aedes aegypti. XX KW Jockey; Non-LTR Retrotransposon; Transposable Element; KW Jockey-13_AAe. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C. and Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-4276 RA Kojima K.K. and Jurka J.; RT "Jockey clade non-LTR retrotransposons from the yellow fever RT mosquito."; RL Repbase Reports 11(4), 1379-1379 (2011). XX DR [2] (Consensus) XX CC This consensus is generated from 20 sequences with >90% CC identity. XX FH Key Location/Qualifiers FT CDS 71..1381 FT /product="Jockey-13_AAe_1p" FT /translation="MGGKRKKKSLSPNKSQSSPLSKKDKRSSASSGVDFGR FT ELNASQSNMYALLDDISDRDDDQCSIHTSATPNERVPRSQVEARAIVSGTV FT KANSPPMKKKLPPLVVKSLPLEKLKKVMQAINVRAEFRLTGMGIKLIVKSD FT QEFSKAKTYLSKSGAEFFTHDVAAEKPFKAVVRGLSQMDTNEILCELKDVY FT KLQPLAVFNINRRAATSSTKYRDCLYLVHFAKGSATLGALKAVRTLNDCIV FT KWEAYRGPNRSVTQCMRCLNFGHGTRNCNLKPRCNFCSQEHWTENCVLEGA FT CEFRCANCSGQHMSTDKRCPKLEEYQRIRKQATTRNQPNQQKKKKPNLINL FT DEFPELPPPMSSSGWQRSRSPPRAPPGGIPPGFRWGNPNSGSEPVNQGFQQ FT PEALPTSVSSTIAHLAALVAEMQKMMMQMMQMFLSFNIQRQGC" FT CDS 1356..4004 FT /product="Jockey-13_AAe_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="ASTFNVKVVNWNARSLAAKHLEFIEFLEANRIDIAFI FT TETHLKPSVSFYLPEHSVFRLDRPGVQHGGGVAIAVRKGFKCRALPDYRLS FT TIEAVGVELLTPTGPIEIVAAYNPLQSKLSDGSSIRLKNDLQKLTRRMGKF FT ILAGDFNARHSLWGNARNNRNGMVLAEDLQAGHYVVLHPDSPTHFSPAGAG FT STLDIVITNISDNRSNLVAMTELPSDHLPVVFEVEHAINPRQQYRRKNYHR FT ANWPQFQQLIEDRSNENPVLVTADDIDNALRELTDTIGEAESRFIPSTAIV FT SKFTNIDSTTKRIISLRNCIRRQYQRTLNPSRKILYKKLNRIIANRVSKLK FT NXQFNTDLKNLPNYSKPFWRLAKVLKTKPKPIPPLREPSKICLTASEKANA FT ISKQFLASHNLGRDIVGPMEPLVTDSVNRLADQPNALPADKRVTSAELRTS FT IEFTRNMKAPGFDGVFNIVLKNLGPKAINVLVNVFNRCLEIGYFPSSWKLS FT KVVPILKPGKDPTNPASYRPISLLSSISKVFERVVYRRLLDHTEANNILLD FT VQFGFRRGHSTVHQLQRVTNQIRRSKQLSKTTVMALLDIEKAFDNVWHAGL FT THKLIQQNCPAYLVKIISDYLAGRTSQVSVSGTLSSPYPIPAGVPQGSILG FT PILYNLYTSDIPPLPAGGSLSLFADDSAISYTGRVIRALVNKLQKGLDAYV FT EYLKNWKIRVNATKTQVIVFPHRNTDRLKPNGKIKVLGNEVDWVDVVRYLG FT LLMDSKLLYRPHVDDRVLKSTSMLKRLYPMINRRSKVSSTNKLAIFKQVVL FT PMLLYGSPVWMGCARTHIRKLQTVQNKYLKMILKLPRHTRTAEVHRLANLD FT PIAERLDNVASNHRTRAVRSEWPIIRNIYV" XX SQ Sequence 4276 BP; 1224 A; 1131 C; 1023 G; 891 T; 7 other; ttcagttttg accttcgaag tgatcacagt aaaagttcaa gtgtaagttc gaaaagttat 60 cgcgagtgca atgggtggca agaggaagaa gaagagcctc tcacccaata aatcccaatc 120 cagccccttg tcgaagaaag acaagcgatc cagtgcctcc agcggagtcg actttggccg 180 cgagctgaat gccagccagt cgaacatgta cgcgctgttg gatgacatca gcgaccgcga 240 cgatgaccag tgctccatcc acacttcggc caccccgaac gagcgagttc ctcgttccca 300 ggtcgaagcg agagcgatcg tcagcggtac agtgaaggcg aattctcccc cgatgaagaa 360 gaagctgccc ccgttggtgg tgaaatctct tccgctcgag aagctgaaga aagtgatgca 420 agctatcaac gtccgagccg agttccggct caccgggatg ggcattaagc tcatcgtcaa 480 gtcggatcag gaattcagca aggcgaaaac gtacctctcc aagtctggtg cagaattctt 540 cacccacgat gtggcagcgg agaaaccatt caaggccgtc gttcgcggtc tctcgcaaat 600 ggacaccaac gaaatcctgt gcgagttgaa ggatgtgtac aagctacaac cgctggctgt 660 cttcaacatc aacaggaggg cggcaacctc gtcgacgaag tatcgcgatt gcctgtacct 720 ggtacatttc gcgaagggaa gtgccactct cggagcactg aaagcagtgc gcaccctcaa 780 cgactgcatt gtcaagtggg aagcgtaccg tggaccaaat cgcagcgtga cgcagtgcat 840 gcggtgcctc aactttggcc atggcacccg caactgcaac ttgaagccaa ggtgcaactt 900 ctgttcacag gagcattgga ccgaaaactg cgttctcgag ggtgcttgtg agttcagatg 960 tgccaactgt tctggacagc acatgtcgac cgacaagcgg tgcccgaagc tcgaggagta 1020 ccaacgaatt cggaagcaag caacaacaag gaaccagccg aatcagcaga agaagaagaa 1080 gccaaacctc atcaacctcg acgagttccc cgagctacct ccaccgatgt catcctccgg 1140 ttggcaacgt tcacgttcgc cgccaagagc tccccctggt ggcatcccgc ccggattcag 1200 gtggggcaat ccgaatagtg gctcagaacc agtcaaccaa ggcttccagc aaccggaggc 1260 ccttccaaca tctgtgtcca gcacgattgc acatctcgcc gctctcgttg ctgagatgca 1320 gaaaatgatg atgcagatga tgcagatgtt cctgagcttc aacattcaac gtcaaggttg 1380 ttaattggaa cgcccgctcc ctggcggcca aacatctaga gttcatcgag tttctagaag 1440 ccaaccgaat cgacatcgca tttattacgg aaactcatct gaagccgtcc gtaagtttct 1500 acctgccgga gcactccgtc ttcagactgg atcgccctgg tgtacaacat ggtggtggcg 1560 tagccattgc cgttcgcaag ggattcaagt gccgtgcgct gccggattat cgattgagca 1620 ccatcgaggc ggtcggagta gaactgctta caccgaccgg gccaatcgaa atcgttgcag 1680 cctacaatcc actgcagagc aagctctcgg atggttcctc cattcgcttg aaaaacgatc 1740 tccagaagct gactcgacgg atgggtaagt tcattctggc tggcgacttc aatgcccgtc 1800 actcgttgtg gggcaatgcg aggaacaaca ggaacggtat ggtgcttgcc gaggaccttc 1860 aggcagggca ctatgtcgtt ctccatccgg attctccaac acacttctcc ccggccggag 1920 ctggttccac attggacatt gtcatcacga acatctcgga caaccgctcc aacctggtcg 1980 ccatgacaga gctgccatct gaccacctac cggtggtttt cgaggtggaa catgccatca 2040 atccacgaca acagtatcgg aggaaaaact accatcgtgc gaactggccc cagttccagc 2100 agctcatcga ggatcggtcc aatgagaacc cggttctggt gacagcggac gacatcgaca 2160 atgctctccg agaactcact gataccattg gtgaggcgga gtcccgattc atcccatcaa 2220 ctgcaatagt aagtaagttc accaacattg acagtaccac gaaacggata atatccctta 2280 gaaattgcat tagaagacaa tatcaaagaa ccctcaatcc atcccgtaaa attctctaca 2340 aaaaacttaa cagaatcatt gccaatagag ttagcaagct taaaaacmaa cagttcaaca 2400 cagatctcaa aaatttacca aactactcca agcccttctg gcgtcttgca aaagtgttga 2460 aaacaaaacc caaaccaata ccacctctca gagaaccaag caaaatttgc ctaacagcca 2520 gtgaaaaggc aaacgccatt tcgaagcagt ttctagcctc gcataatctg ggtcgggata 2580 tagtcggtcc gatggagccg ttggtaaccg acagcgtcaa ccgattggcc gatcaaccaa 2640 acgcgcttcc agctgacaag agggtaacat cagcggaact gaggactagc atcgagttca 2700 cgagaaacat gaaggctcca gggttcgacg gagtgttcaa catagtcctc aagaaccttg 2760 gccccaaggc gatcaacgtt ctggtcaatg tcttcaacag atgcctcgag ataggatact 2820 tcccttccag ctggaaacta tcgaaggtgg ttcccattct gaagccgggg aaggacccca 2880 ccaatccggc cagctaccga ccaatcagcc tactctcgtc aatcagcaag gtgttcgaga 2940 gggttgtgta ccgtaggctt ctggaccata ccgaagcgaa caatatactg ttggatgtcc 3000 agtttggatt caggcgtggg cactcaacgg tacaccaact gcaaagggtg accaaccaga 3060 tcaggcggtc gaagcagttg tcaaagacaa cagtcatggc gttgctagac attgaaaaag 3120 cttttgataa tgtctggcac gccggactca cacacaagct aattcagcag aactgccccg 3180 cctatctggt gaagatcatc tccgactacc tcgccggaag aacatcccag gtcagcgttt 3240 ccggtacact gtcgagtccg tacccaattc cagccggagt gccacaaggc agtatacttg 3300 gtccgatact atacaacttg tacacttcgg acatcccacc gcttccagct ggaggctccc 3360 tatcgctgtt cgctgatgac tcagcgatta gctacaccgg gcgggtcatc agagcgctag 3420 tgaacaaact ccaaaaagga cttgatgcct acgttgagta cctgaaaaat tggaagatcc 3480 gggtgaacgc caccaagacc caagtcattg ttttcccgca taggaatacg gaccgcctca 3540 agccgaacgg gaaaataaaa gtgcttggca acgaagtaga ctgggttgac gtggtgagat 3600 acctgggtct tctgatggac agcaagctgc tctaccgacc gcatgtcgac gatagggtgc 3660 tcaagagcac atccatgctg aaaaggctgt acccgatgat caatcgccga tcgaaggtct 3720 ccagcaccaa caaattggcc atcttcaaac aggtagtcct tccaatgttg ctgtacggct 3780 ccccggtctg gatgggatgt gcccggaccc acattaggaa gctacaaacc gttcagaaca 3840 agtacctgaa gatgattctg aagctcccac ggcacacaag gacggccgag gtccatcgac 3900 tagcgaatct cgacccgatt gcggaaaggc tggataacgt agcttcgaac catagaacca 3960 gagccgttcg atccgaatgg ccaattataa ggaacatcta tgtttaacat gggttctctg 4020 tacatattgt aaatagtgta aatagttagk agtagtttag atttagattt agtatamtga 4080 aacaaaacaa tgaaatgttt aaaaattaac attaatattc aaacaacaaa acawaaacaa 4140 gttagaaata actaaatgwa aacwwccaaa gcctgaaaag ctctttagct gtatcactta 4200 gaatgtaagg aaaatgattg tttcaacaag aaaaaatgaa atacagacat attaattaaa 4260 aaaaaaaaaa aaaaaa 4276 // ID Gypsy-79_CQ-I repbase; DNA; INV; 4214 BP. XX AC AAWU01022223; XX DT 23-DEC-2010 (Rel. 16.01, Created) DT 23-DEC-2010 (Rel. 16.01, Last updated, Version -1) XX DE LTR retrotransposon from the southern house mosquito: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-79_CQ_; KW Gypsy-79_CQ-LTR; Gypsy-79_CQ-I. XX OS Culex quinquefasciatus OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. XX RN [1] RA Arensburger P., Megy K., Waterhouse R.M. and Abrudan J. et al.; RT "Sequencing of Culex quinquefasciatus establishes a platform for RT mosquito comparative genomics."; RL Science 330(6000), 86-88 (2010). XX RN [2] RP 1-4214 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the southern house mosquito."; RL Repbase Reports 11(1), 537-537 (2011). XX DR GenBank; AAWU01022223; Positions 1773 5986. XX CC Positions [3342-3818] - Integrase core CC LTRs are 96% similar to each other. XX FH Key Location/Qualifiers FT CDS 110..1774 FT /product="Gypsy-79_CQ-I_1p" FT /translation="MATVEELCENFTKIGLVRECERLGLPTNGGKNEMALR FT IIDHQAAANGKDNDDDGSQHDDDDGSVHDEQNDENESDADENDDSHADSDD FT ATVRGNDANDEREARRNTHQRPQVYSFRDMEESIESFGAEDGEDVKVWLQQ FT LETVAESARWTGEHKMIMLRKKLTGTARRFVFSLRDVSSYAKLKKALIDEF FT APFVRASDVHRTLGNRKKKPTETTRDYIYEMQRIALVIDLDEPSLCEYIVD FT GITDDEFHRSLLYEAQTIRHLKEKLLNYEKVTKNSRKKTKPDEKREKPRTE FT NKASVKSDPKKHCFNCGDASHVASECPQKSDGPKCFNCNAFGHLSKECTKN FT KTKQKEKKVNVVEKVNGDKPGVLLNLFGREFSAIVDTGAEISLIRKDLWTD FT LAENGVQMRKSTMKVRGFGGDVRVVCGEVALTATIGEEEFPIRFYVVPNEA FT IDMQVLIGMDFLSGVDYSISPGEVRVKKYRPSDADAKWIRRVAEYVAEDEV FT TVPYQYRDEVLELIENYTPEKPVVAANQLTITLLDNDVVCENPRRLAQLEK FT EVVKKQNQ" FT CDS 1917..4196 FT /product="Gypsy-79_CQ-I_2p" FT /translation="MPNVEEQIDQLSEARVYTTLDLKNSYFHVPVEEKSRK FT YTAFVTSCGQYEFLRAPFGLCTSGSGFGRFISAVLKEFIHDGSVMAFVDDI FT IIPTGSEEDGLKVLKRVLEVAAKAGLLFNWKKCVFLQRRVEYLGYTIYDGR FT VEPAPAKIEKLKQFPQPTTAKQLQRFYGLASYFRKFVPSFAGIARPLSDLL FT KKDKFAQFDDAAVYSFNRIKEILASYPVLRIFRQDGDVEVHTDASKTALAG FT ILMQRAEDDGKFHPCYYFSRLTSGAEKNYHSFELEALAVVESVKKFRCYLL FT GRPFKIITDCMAFKDAVKKKKLNTRIAKYVVALSEFRYEIDHRPGEKLPHV FT DALSRADVLVVSAPIVAKIRAAQQEDDRGKALLTALQQNDSVDGFTTSNGI FT LYEGEAESRRLYVPESMETEIIRSAHEQGHFGVRKTKERIKADYYISGLED FT KIKRCIATCVPCIVGEKKRGKPEGELCPIPKGDVPLDTLHVDHLGPMPSTR FT KSYGHILTVIDAFTKFVWLFATKSTTAEEAVKKLRVITDTFGNPRRIICDR FT GAAFTSGFFTKFCDEEGIELHTIVTGVPRGNGQVERVHRVIIPMITKLSVD FT NPEEWFKHVAVVQKCLNNSWQRAINMTPFELLTGVKMRTKEDAVLHELLLK FT EVQDSFTDDRNELRMAAQQSIQKMQEENRNYYNLRRKPYVEYKIGDMVAIP FT KTQFGVGQKVKPRFFGPYEITGVLPNNRYEVRKLDDETEGPKKTSTAGDQI FT KPWTLPGRK" XX SQ Sequence 4214 BP; 1133 A; 1020 C; 1282 G; 779 T; 0 other; tgggggctcg tccgggaagc gagttaatgt gaaaattacg cgaagtgcgc gaaatcgtta 60 caaaattcgc gaaaaaagac tctgaaagtg acgccagccg aacgacaaga tggcgaccgt 120 cgaagagttg tgtgaaaatt tcaccaagat tggccttgtg cgcgagtgcg aaagactagg 180 cttgccgacg aacggcggca agaacgagat ggcgctacga atcatcgatc atcaagcggc 240 tgcaaacgga aaggacaacg atgacgacgg cagccaacac gacgacgatg atggttcggt 300 ccacgacgag cagaacgacg aaaacgagag cgacgctgat gagaacgacg actcacacgc 360 cgacagtgac gacgccaccg tgagaggaaa cgacgcgaac gatgaaagag aggcacgacg 420 aaacacacac caacgaccgc aggtgtattc gtttcgcgac atggaagaga gcatcgaaag 480 tttcggtgcc gaagacggcg aagatgtgaa agtgtggttg cagcagctcg aaacagttgc 540 ggagtcggct cgttggactg gagagcataa aatgattatg cttcgcaaga aattaaccgg 600 cacggcgcgt cggtttgtgt tctctcttcg ggacgtcagc agctacgcca agttgaaaaa 660 ggcgctcatc gatgagtttg ctccgttcgt gcgggcaagc gacgtccacc gcacactggg 720 aaatcggaag aagaagccga cggagacgac gcgagactac atctacgaga tgcagagaat 780 cgcgctggtc attgacctcg atgagccaag cctgtgcgag tacatcgtgg atggaatcac 840 cgacgacgag tttcaccgct ctctgctgta cgaggcacaa accattcgcc acctgaagga 900 gaagctgctg aactacgaga aggtgacgaa gaactcccgc aagaagacga agccggacga 960 aaagagagag aagccgcgca cggagaacaa agcaagcgtc aaaagtgacc cgaagaagca 1020 ctgcttcaac tgcggcgacg cctcgcacgt cgcatcagag tgcccacaga agagcgacgg 1080 accaaaatgc ttcaactgca acgcttttgg tcatctgtcg aaggagtgca cgaagaataa 1140 gacgaagcag aaggagaaaa aagtgaacgt ggtggagaaa gtgaacggcg acaagccggg 1200 cgttttgctg aacctgttcg ggcgcgagtt ttcggcaatt gtcgacacgg gagcagaaat 1260 ttcgttgatt cggaaagatc tgtggacgga tctcgcggag aacggcgtcc agatgagaaa 1320 gtcgacgatg aaagtgcgcg gattcggcgg tgacgttcgt gttgtgtgtg gtgaggttgc 1380 tctcacagca acaatcggtg aagaagagtt cccgattcgc ttttacgtcg tgccaaacga 1440 agcgatcgat atgcaggtgt tgatcggcat ggactttctc agcggcgttg attattcgat 1500 cagcccggga gaagtgcgtg tgaagaagta tcggcccagc gacgccgacg cgaagtggat 1560 tcgccgcgtt gcggagtacg tcgcggaaga cgaggtgacg gtaccgtacc aataccgcga 1620 tgaagtgcta gaactcattg agaactacac gccggagaag ccagtggtcg ccgcgaatca 1680 gctgacgata acgttgttgg acaacgacgt tgtttgtgaa aacccgcgac gtctggccca 1740 gctggagaag gaagttgtga agaagcagaa tcagtgagct cgaggagggg atcatccagc 1800 agtcggaaag tgaatacgcg agtccagtcg tagtggttcc gaagaaggac ggatcgtacc 1860 gtgtgtgtgt tgattaccgg gagatcaaca agaaagtagt gcgcgataag ttcccgatgc 1920 cgaacgtgga agagcagatc gaccagctgt ccgaagcccg cgtgtacaca acgctggacc 1980 tgaagaactc gtactttcac gtgcctgtcg aagagaaaag ccggaagtac acagcgtttg 2040 tgacaagttg cgggcagtac gagtttttgc gtgcgccgtt cgggttgtgc accagtggaa 2100 gtggatttgg acgattcata agtgcagtgt tgaaagagtt catccacgac ggaagtgtga 2160 tggcgtttgt ggacgacatc ataattccaa cgggatccga ggaagatgga ctaaaagtgc 2220 taaaacgagt gctcgaagtg gctgcgaagg cagggctgct gttcaactgg aagaagtgtg 2280 tgttcctgca acgtcgtgtg gagtacctcg gctacacgat ctacgacggc cgagtggaac 2340 cagcaccagc gaagatcgag aagctaaagc agttcccgca gccgacgacg gcgaagcagt 2400 tgcagcgatt ctacgggctg gccagctact tccgaaagtt cgttccgtcg ttcgctggca 2460 ttgcccgccc cctgtcggat ctcctgaaga aggataagtt tgcccagttc gacgacgccg 2520 cagtttactc gttcaaccgc atcaaagaaa tcctggcaag ctacccagtg ttgaggatct 2580 tccgacaaga cggtgacgtg gaggtgcaca cagacgccag caagacggcc ctcgctggta 2640 tcctgatgca acgcgcagaa gacgatggca agttccaccc gtgctattac ttcagccgac 2700 tgacgagtgg tgcggagaaa aattaccatt cgttcgagtt ggaggcgttg gctgttgtcg 2760 agtcggttaa gaagttccgt tgttacttgc tcggccggcc attcaagatc atcaccgact 2820 gtatggcgtt caaggatgcg gtgaaaaaga agaagctcaa cacacggatc gccaagtacg 2880 tcgttgccct gtctgagttc cgctacgaga tcgatcaccg ccctggggaa aagctgccac 2940 acgttgacgc gttgtcgaga gcggatgttt tggttgtgtc tgcacccatc gtcgccaaaa 3000 tccgtgcagc acaacaagag gatgaccgag gcaaagctct cctgacagcg ctgcagcaga 3060 acgattccgt cgacgggttc actacgagca acggtattct ctacgaaggc gaagctgaat 3120 cgcgacgact gtacgtgccc gaatcgatgg aaacggagat catccggtct gctcacgagc 3180 aaggccattt cggagtccgg aagacgaagg agcggatcaa ggcggactac tacatctctg 3240 gcttggagga caagattaaa cgctgcatcg caacgtgtgt tccctgcatt gtcggcgaga 3300 agaaacgagg aaagccggaa ggtgagctgt gtccgatccc gaaaggagac gttccgctgg 3360 atacgctgca cgtggatcat ctaggaccga tgccctcaac gaggaagtcg tacgggcaca 3420 ttctaacagt cattgacgct ttcacaaagt ttgtctggct gttcgcgacg aagtcaacga 3480 ccgcagagga ggccgtgaag aagttacggg tgatcacaga cacgttcgga aacccacgcc 3540 ggatcatctg tgatcgggga gcagctttca cgtcgggatt cttcaccaag ttctgcgacg 3600 aagaaggcat cgagctacac acgatagtca cgggggttcc tcgtggcaac gggcaagtcg 3660 aacgagtgca tcgcgtcatt atcccgatga tcaccaagtt gtccgtggac aacccggagg 3720 aatggttcaa gcacgtcgcc gtggtccaaa agtgtttgaa caacagctgg cagcgagcga 3780 tcaacatgac cccgttcgaa ctgttgactg gtgttaagat gcgaaccaaa gaagacgccg 3840 tgctccacga gttgctgctg aaggaggtcc aagacagttt caccgacgac cggaacgaac 3900 tgcgaatggc ggcgcagcag agcatccaga agatgcagga ggagaaccgg aattactaca 3960 acctacgccg gaagccgtac gtggagtaca agatcggcga tatggtggct atccccaaga 4020 cacagtttgg agttggccag aaggtcaagc cgcgattctt cggaccgtac gagatcaccg 4080 gtgttctgcc gaacaaccgg tacgaggtca ggaagctgga cgacgaaact gaagggccga 4140 agaagacgtc gactgcagga gaccagatca aaccgtggac acttccgggc cggaagtgat 4200 gtcaggaaag gccg 4214 // ID Gypsy-5-LTR_HM repbase; DNA; INV; 135 BP. XX AC . XX DT 24-DEC-2008 (Rel. 13.12, Created) DT 24-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE Long terminal repeat of the LTR retrotransposon - a consensus DE sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-5-LTR_HM. XX OS Hydra magnipapillata OC Eukaryota; Metazoa; Cnidaria; Hydrozoa; Hydroida; Anthomedusae; OC Hydridae; Hydra. XX RN [1] RP 1-135 RA Bao W. and Jurka J.; RT "Gypsy LTR retrotransposons from Hydra magnipapillata."; RL Repbase Reports 8(12), 1977-1977 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by TIGR, CC J Craig Venter Institute. XX SQ Sequence 135 BP; 35 A; 13 C; 17 G; 70 T; 0 other; tgttgtgttt tctatttttt gttattgaac tatttgtttt atatatgtgt ctattttctt 60 gtatctttag attagtacaa ttattgttaa ataaagcctt aatcaagagg tcaatctttt 120 tattagttca ttaca 135 // ID R1E_NGi repbase; DNA; INV; 7736 BP. XX AC . XX DT 08-NOV-2010 (Rel. 15.11, Created) DT 08-NOV-2010 (Rel. 15.11, Last updated, Version 2) XX DE 28S rDNA-specific non-LTR retrotransposon R1 in Nasonia giraulti. XX KW R1; Non-LTR Retrotransposon; Transposable Element; R1E_NGi. XX OS Nasonia giraulti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; OC Pteromalidae; Pteromalinae; Nasonia. XX RN [1] RP 1-7736 RA Stage D.E. and Eickbush T.H.; RT "Maintenance of multiple lineages of R1 and R2 retrotransposable RT elements in the ribosomal RNA gene loci of Nasonia."; RL Insect Molecular Biology 19(Suppl 1), 37-48 (2010). XX DR [1] (Consensus) XX CC This family is specifically inserted into 28S ribosomal RNA CC genes. XX FH Key Location/Qualifiers FT CDS 2440..3930 FT /product="R1E_NGi_1p" FT /translation="DIGVLGQYRRETRQYDQLQSMASNKKHTPAILVESEN FT APEHTLNPPTSTDAAANLGFASNTGLQRDRHRSISSVSSIASDENSSAKTI FT GDTFAKRAKRKREEKVPDEVDSGNPLERRNHEIKEVRKTFEQATLELRSIC FT SIEKDKNRCKLNKGDLNKFTSVLSAALQLVEDAITDKYDYGILLQQKNEQL FT KQLREVLSARPRSYADAAGADRRMKQTLAPKRPPISSGANKPRPKNAGRPS FT ATVVVTCGRENGESPKADDIKEKLMSAVNPAAEGIRIKSVRVAKTGKVYVE FT TNSSEDGKKLSELPQLRDAGLIAEVPKGRKPRLILYDVPRTMEPSEVGAAL FT LQMNADVFEGIERSAFEKDCNFLFKTGKRDADVVHWVLETSSELRDRLREK FT ERLYLGWRRCRIQDYISLTRCYKCQGLGHIGKYCKAKEETCGHCGGQGHKK FT ESCPKKDQPAKCALCLRRDKPSDHSVMDPKCPSFKSAYDVYTQSIDYGGPV FT R" FT CDS 3891..7070 FT /product="R1E_NGi_2p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="RVHAKYRLRGPSAVIMASTINVLQHNLGRGKTSTAEL FT RDAAAAVRASVLLVQEPHAREDIGVIGLGSLSNKVLTGAKTGERPWACVVI FT TDRCLDVVVIPHLSSPFCVCAHVSTAVGDFHVVSAYFPPSQPLDSHLRMLG FT RIHDGLRSSKVILGIDVNAWSMRWGSNTTDRRGQEVEEFLATTGWLVVNEP FT GQLPTFSTANGESHIDVTLVGAGLSNLIREWNVREGWTSSDHRVIQITLGS FT PEPRQLIRIPRYNIKRANWEGFRGNVISHLDTLDQAVTCSEHLERAATSLG FT EIFHQASATNNNIPRKRHFIRSVPWWNRELTFWKRRVNRARRAFQGARRSS FT GAPALRRIYSVLKREYNKKVYLAKTGSWREFVTLDGNEDAYGVANKIIRKR FT FTPDTALSSISTHGGATLDWQDTAEKLLDALIPDCPPGDRMDLRELCLSFP FT EGEAEACPHWTAGEVTRVIKSLPRSKAPGGDLIEAAMIKSLANSPEFIEAL FT VGLYNKCLDIGVFPCTWKRGLIRVLLKSVEKDPADPGSYRPICLLPLFGKI FT LEKLIKLRLSGVLNHPEYASANQYGFRAGRSTEDALYELRRNVNSSSKKYA FT LALLYDAKAAFDHLWWPALIRELQQRGCPNNLLMLIGSYLSGRVVELTGSY FT NSLEKSVTRGCPQGSILGPDFWNIVFDTLLKLLEGLNLNIKIIAYADDLLV FT LVESDSRRMLEVGGQEVTDTVQDWFVRNKLILSESKSEMILLKGALPSRDP FT IIKINNRALALAESVKYLGVHVGERFSITKHLLAAGSKCRLVFDLLARSAR FT ANWGLRFRSLDIMYKGIFLAVMCYGAGAWGDLVKTNQHRQTLLTIQRAILL FT RVTKAYRTTSGDALCVIAGAIPLDLVIRERMILYMIRKNLEAGFGRYWFLP FT GSSKVEAKRGLREAILREWQMRWETSSKGRETFAYFPDVAKRVGSDWVKPG FT HYMAQALSGHGDFNAKLHSFRLKESPLCPCGMEETSSHLLFDCELWNDIRS FT EFVAKLEEAGLEWPPQRQRLMDKDVYQIFDGYVTAVLREKAARAAAGQTQA FT ANN" XX SQ Sequence 7736 BP; 1800 A; 1891 C; 2363 G; 1682 T; 0 other; gtttaatggc caaggttcac ttgaaaactc tagggttgaa gtgctccggg gtcttaaaca 60 agcgccaaga tctctgacgt ccacgtcgta ctagatgtcc acaacccccc gggccccagg 120 accctcgggt ccttgtggcg tcgggaacgg ttggctaaca gaggcgggct aaactcgaac 180 gcgtttagat gggacaggtg ggtatatctt tcgagagtag acaaccggcc cctgggttca 240 tccggttaaa atggcgatac tttgtacacg gaccttcttt ccttcttact tctcgaacgc 300 gtacctttcg aaagcgtact cgggcttcgg gcgtcgtctt gtctcctcgc gttcgcgtgt 360 aggaggcttt gcggcgctgc ggggctcccc tgttgcgtgg agagtgctct tcgtgatgcg 420 cctccagttg gctaaccttt ttggctgcta acactttgag actgacaagt tgtcggcgta 480 gaccaggcgg tcggggatac gttgcccccg gccgtttcaa aggactatcc gcaaactcgc 540 ctctctgatt cacaaatcgt gtcgccccac aaggcacgat ggaatgaatc agtaaggcga 600 tgcggtaacg tctgagtctg cgttggtggc ttgttggtcg gcttttgaat gttgtgcgga 660 tcgaggtggt tgtgggactc tcgaaccgat caggctacat ggccctcaac tcctcaaaag 720 ttgcctctcg gactcttgga ctcttgatac cgcccgagag gggtgactgg tttgccgggt 780 ttacctggtc tgctagtcgc tcctcgaagg tggtcgaatc gaaaagcaat gcgtgtgggg 840 aaacttagag atttttcttt cctcttgcgt tgcttaatca gaaaagcaat gcgtgtgggg 900 aaacttaggg atttttcttc ttgcgttgct caatcggaaa gcaatgcgtg tgggcagtgt 960 cgggggtttc gccggggggt cggtctgctg ggcgacttgc acgcggagct atggttaaag 1020 cgggttttgg tgagggaact cgtacgcgcg gggtcgatgt tccctcacgg acttagaata 1080 tcaatagttg taaaagcaat gcgtgtgggg gactttaaaa tcattgtaga tttagaagca 1140 atgcgtgtgg gggactttaa aattattata gatgtagaag caatgcgtgt ggaggatttt 1200 aaaagtcatt atagatgtag aagcaatgcg tgtgggggac tctaaaaatc attatagatg 1260 tagaagcaat gcgtgtgggg gactttaaaa aatcattata gatgtagaag caatgcgtgt 1320 gggggacttt aaaattattg tagatgtaga agcaatgcgt gtggaggact ttaaaaatca 1380 ttatagatgt agaagcaatg cgcgtggggt aaatcagaac ttagattggg agtagaagca 1440 atgtgtgtgg gaaaacttgg cactatgttg ggttaaggca acgcgcgtat agctcgctcc 1500 ggctggccga cccgcctgcg gggcatcccc tcacttacca ctttaatcga ctaggtcatg 1560 gcgcgcggtt acgaatcccc tgcggctagc tcccgtctgg gctctgtccg gcgtgttatg 1620 gggtgctccg cgtagaaatg ctccgcctgc tcggagagct atgcgtggag tctactctat 1680 ccccggggtc cacccggggt cttaggactt atctctgtct cagccgtcaa cccgggtcgg 1740 agcggcgaag actggcctga acggacgtac tttcttctag gtgtgcggag ggcggcaccc 1800 gtcaatcgac ggactgcttg cctgtttgca ctggactctg ggggcgtatg ggctgggttc 1860 gggtaaggtg cgaatcgcgt tatggggagg atcggccgca acggtgccct ggtcgcaagc 1920 ctgccctgac cttggactct ctggttttcc cgttccgctg cgattctttt caagtcatcc 1980 taacatcgag caattgtacg ggaggcatca aaggggggtt cgatcggggt agcgctcggt 2040 ggcttcaggg gggcgccgcg gcgtcgcgga gcctctgggt ttgggggagt ccgaaaacct 2100 ccacaaccct ggcgtggtga ccgtacgcgt caggttcatc aaaaccacgg ccccggtagc 2160 cgccggggca aactttcggg aaagggtgta ctgctcgacg ggcacccagg ctagccggga 2220 agcctcggga aaatcggtgg gaaactctcc gcaaaacacg ctccttaagc tcgatcttcg 2280 caggcgggtg gggcatggga gggtggacat cgacaactcg gggcggaacg gcttgcgcgt 2340 ctcctaaaag gacctcctta acccagaaga cggcagaggc cctgttatta cagggtagcc 2400 tgatgggggc ggagactctt tgagcatggc cgttattagg acatcggagt tttaggtcag 2460 taccggaggg agacgcgtca gtacgatcag ctacagtcca tggcatcaaa taaaaaacat 2520 acacctgcga tcttagtgga gtcggagaat gcgccggagc atactctgaa cccacccacc 2580 agcacagatg cggcggcaaa ccttgggttt gcgtcaaata cgggcctgca gcgcgacagg 2640 caccggagta tctctagcgt ttcgagcata gctagcgacg aaaattcgag cgcgaaaacc 2700 ataggcgata ctttcgcaaa aagggcgaag cggaaacggg aagaaaaggt tcccgacgag 2760 gttgatagtg gcaaccccct cgaaaggcgg aaccacgaga tcaaagaggt ccggaagacc 2820 ttcgagcagg caactctcga gctgagaagt atctgctcga ttgaaaaaga caagaaccga 2880 tgtaaactta acaagggtga cctaaacaag tttacatcgg ttctcagtgc tgcactgcaa 2940 ctggtcgagg atgctatcac tgataagtac gactacggca tcctcttaca gcagaaaaat 3000 gagcagttaa agcagcttag ggaggtccta tcagcgcgcc cgaggtcgta tgcagatgca 3060 gcaggagcag ataggcgaat gaagcagacg ctggcaccca agcgcccccc catcagcagt 3120 ggcgcaaaca aacctaggcc caagaatgca gggaggccga gcgccacggt agtggtgacc 3180 tgtgggagag agaacggcga aagcccaaaa gcggatgaca ttaaagaaaa attgatgtca 3240 gcggtaaatc cggcggccga ggggatccgg atcaaaagcg tccgtgttgc taaaacggga 3300 aaggtctacg tcgagacgaa cagcagcgag gacggtaaga aactctcgga gctgccacag 3360 ctccgggacg cgggtctcat cgccgaggtg ccgaaaggga ggaaacctcg gttgattctc 3420 tacgacgtac cgaggacgat ggagccaagc gaggtaggtg cggcccttct gcaaatgaac 3480 gctgacgtgt tcgaaggaat cgaacgctcg gcgttcgaga aggactgcaa cttcctcttt 3540 aaaacgggga aacgcgacgc ggacgtggtg cactgggtcc tcgagacttc gtcggaactt 3600 agagacaggt tgcgcgaaaa ggagaggctc tacctcgggt ggaggaggtg taggatacag 3660 gactacatct ccctcactcg ttgctataag tgccaaggcc ttggccatat aggaaaatac 3720 tgtaaggcca aggaggagac atgcggacac tgtggtggcc aggggcacaa gaaagaatcg 3780 tgccccaaaa aagatcagcc cgcgaagtgt gctctctgct tgcgcagaga caaacccagc 3840 gaccacagtg tcatggaccc gaagtgtccc tcatttaagt cggcctatga cgtgtacacg 3900 caaagtatcg actacggggg cccagtgcgg taataatggc ctcgacgatt aatgttctcc 3960 agcataactt gggtagaggg aaaacatcga cggcggagtt gcgggatgcc gcggcggccg 4020 tcagggcttc cgtgttgctt gtgcaggagc cccacgcgcg tgaggacatt ggggtgatag 4080 gacttggctc tctgtcgaac aaggttctca cgggcgcgaa aacaggggaa cgtccttggg 4140 cctgtgtagt aatcacggat cgctgcctgg acgtggtggt catcccgcat ctcagcagtc 4200 ccttctgcgt gtgcgcgcat gtgtccacgg cggtgggcga cttccatgtc gtctcagcgt 4260 actttccacc ttcgcaaccg ctggactcgc atttgcgcat gctggggcgc attcacgatg 4320 gtctacgctc gtcaaaggtt atactcggta tagatgtcaa cgcgtggtca atgcggtggg 4380 gctctaacac caccgaccgt agaggtcagg aggtagagga gttcctggcg acaaccggct 4440 ggctggtcgt caatgagcct ggccaactgc cgacattctc aacggcaaac ggcgagagcc 4500 acattgacgt cacgctagta ggagctggat tatcaaatct cattcgcgag tggaacgtca 4560 gggagggttg gacgagcagc gaccatcgtg tgatacagat cacgttgggt tcaccagagc 4620 ctagacaact tatccggata ccgcgctaca acattaaaag agcaaattgg gagggcttca 4680 ggggaaatgt catctcacac ctcgataccc ttgaccaggc cgtcacttgt tcggaacacc 4740 ttgaacgtgc ggccacctcg cttggtgaga tcttccacca agcgagcgcc acgaataaca 4800 acattcctag gaagaggcac tttattaggt ctgtcccctg gtggaacagg gaactcacct 4860 tctggaagag gagggtaaac agggcgcgca gggcatttca gggcgccagg cggtcttctg 4920 gggcgccggc gttaaggagg atttacagcg tattaaaaag ggagtataac aagaaggtgt 4980 atctggccaa gacggggagt tggagagagt tcgtaactct agacgggaac gaggacgcat 5040 acggtgttgc gaataaaatc attagaaaac ggtttactcc ggacacggct ctctccagca 5100 ttagcacgca tgggggagct actctggact ggcaagatac ggccgagaaa ctccttgacg 5160 cccttatccc ggattgcccg ccgggtgacc gaatggactt gcgtgagctg tgcctctctt 5220 tcccggaggg tgaggcggag gcgtgccccc actggactgc cggggaggtg acgagggtca 5280 tcaagtccct acctaggagt aaggcacctg gtggcgacct gatcgaggcc gcaatgatta 5340 aatccctagc gaactctcct gagttcattg aggctctggt gggcctatat aacaaatgtc 5400 tggatattgg agtatttcct tgtacatgga agcgcggact tattcgggta ctcttaaagt 5460 cagtcgagaa agacccggcg gatccgggtt cgtatcgccc catttgccta ttaccactgt 5520 tcggaaaaat actcgagaag ctaataaagc tacggttgag tggggtgtta aatcacccag 5580 aatatgcctc tgccaaccag tacgggttca gggctgggcg ctccacggag gacgcacttt 5640 acgagttaag gaggaacgtg aattcctcca gtaaaaagta tgccctggca cttctttacg 5700 atgccaaggc ggcttttgat catctctggt ggccagcgct tattcgcgag cttcaacaga 5760 gaggctgccc gaataacctc ctgatgctaa tcggaagcta tttgtctggc cgggttgtag 5820 aactcaccgg cagctataac tctctggaga agagtgtgac gaggggctgc ccccagggat 5880 ctatattggg tccagatttc tggaatatcg tcttcgacac tttgttgaaa ctcctggagg 5940 gcctgaacct gaatattaaa atcatcgcgt acgcggacga cctgctggtc ctggtagaat 6000 ctgacagtcg gagaatgtta gaggttggcg ggcaggaggt cacagacacg gtgcaggatt 6060 ggttcgtcag gaataaactg atcctctcag aaagcaaatc agaaatgatt ttgctaaagg 6120 gagctctgcc gtcaagggac cctataatca agattaacaa tagagcacta gcgttggctg 6180 agagtgtaaa gtatcttggc gtccatgttg gtgagagatt ctcgatcact aagcacctgc 6240 tagcggcagg atccaagtgc cggttggtct tcgatctcct agcccgctct gccagggcga 6300 actgggggct acggtttaga tcactcgaca taatgtacaa ggggatattt cttgcggtca 6360 tgtgttacgg agcgggggcc tggggtgacc tcgtaaaaac gaatcagcac aggcagacgc 6420 tccttaccat acaaagggcc atactcctcc gggtcaccaa ggcctatcgg actacctccg 6480 gggacgcact atgtgtaata gctggggcga tccctctgga cttggtcata cgggagagaa 6540 tgatcctgta catgatacgg aagaacttag aggcgggatt cggtaggtac tggttcctgc 6600 cgggttcaag caaagtggaa gcaaagagag ggttgcgtga agcaattctt cgcgaatggc 6660 agatgcgctg ggagacctcc agtaagggca gggaaacctt cgcttacttt ccggacgtcg 6720 cgaaaagggt gggcagtgac tgggtaaaac ccggtcacta tatggcgcaa gccctaagtg 6780 gacacgggga ctttaacgcg aaactgcatt cttttcggtt aaaagaatcc cccctttgcc 6840 cgtgcgggat ggaggaaacc tcgtcgcatc tgctctttga ctgcgaactg tggaacgata 6900 tccgatcgga gtttgtcgcg aagctggagg aggccggact cgaatggccc cctcaacgcc 6960 aacgattgat ggacaaagat gtgtatcaga tctttgatgg ctatgttact gcggttctac 7020 gggaaaaggc agcacgggcg gccgctgggc agacccaggc tgcaaacaac taaaggaatg 7080 tcttatatag cctcagggag ggagggaggg cgcaagtcct ccagcccgac gggcctggcg 7140 aacccggggc ccgagcaagt gtgagggtga tatggaggcc tagcgtcgac taagcccaat 7200 cgcgaatggc agcttggatc tggttagcgg cgagtgagcc ggctcacgcg cgcaatatcc 7260 gccgtaccat cctggcaacc gcgcgcgggc cgagaggtat ggaacgggac cggactaccg 7320 tggtcatgtg taaggcgaaa ttgggactgg cgttataggc agaaggctac ttgcagtcac 7380 cgaggctgtg gttttccggc ctccctgtat tattggaaca gggttatggg cctctgttga 7440 cgactccgcc tccccgaggt ccccgctggt aacgccgttc cccttcgggt ccatcttctt 7500 gggccccaag gggtgcggcg gcagaatgga gtcgcccccg gatggggggg ccctagctac 7560 tctatgatcg ggcacagccc gtgagcgtgg cggtttttcc aagaacgatt ttccttgggg 7620 gtgtttctcg catggatggt ggatgggtaa attctgattt gaagaccggt gatatagggc 7680 tctgcccgag gtgcaccatg cccccgcccc attggcgtgg gtgtcctcac attaca 7736 // ID Crack-28_BF repbase; DNA; INV; 3298 BP. XX AC . XX DT 30-APR-2009 (Rel. 14.04, Created) DT 30-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE Amphioxus Crack-28_BF autonomous Non-LTR Retrotransposon - DE consensus. XX KW Crack; Non-LTR Retrotransposon; Transposable Element; KW Crack-28_BF. XX OS Branchiostoma floridae OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. XX RN [1] RP 1-3298 RA Putnam N.H., Butts T., Ferrier D.E., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E. et al.; RT "The amphioxus genome and the evolution of the chordate RT karyotype."; RL Nature 453(7198), 1064-1071 (2008). XX RN [2] RP 1-3298 RA Kapitonov V. and Jurka J.; RT "Young families of Crack non-LTR retrotransposons from the RT amphioxus genome."; RL Repbase Reports 9(4), 833-833 (2009). XX DR [2] (Consensus) XX CC Its 5'-terminal portion is incomplete. XX FH Key Location/Qualifiers FT CDS 14..3127 FT /product="Crack-28_BF_2p" FT /translation="MWEKATSNVHLQSGTWITQNAYSVKINPGLSGVRLIG FT RSTTCTERTERTLNLLVCATLLLAGDVSPNPGPDTGGLPVWRKGIVYAFYN FT VVSLPRHLDEIQQLLLRNTRIHVLGLNETRLSDSIPDSSVDINGYTLYRTD FT RDRQGGGVGVYVKQTIASQRRCELEQEDLEVCCVEIKPEKARKTLLTCVYR FT PPTSGPDWRNSAESLVHKLNQTAEKENADVAIMGDFNSDLLTSTQAMSSVE FT FLMGLYQLVPVIREPTRITEKTESCIDNIFVSNPDRYKSSASVAWGPSDHN FT LILTCAKAGSEAGAAHRCEYRSYKLYTQQSFIDSLKSVRWDTVFDCTDVSE FT AWNAFKDIFLNVADEHAPLRTKTARENNRPAPWMTDTVKNMMGRRDAARRK FT AIRTKDVQDWDTYRSLRNQTTSIIRKEKKSHFATAVSEAKGDQSLMWKIIN FT SFTGKSKSTKRVQKLLRADNTSMSDPGEMAQEFNDYFTSCASRLTDGMPDS FT EEDPLRHIPDSTTKFSFDCVEETEVLNELQKLKTKKATGLDKIPAKLLKDS FT APVVAKPLAHIFNLSLASGEVPSDWKEAQITPVHKSGSCADVGNYRPVSVL FT SVTSKVMEKLVCNQVTRYLTRCKLLTTHQSGFRRHHSTATAVQKVVEDITS FT GYNCSKVTVALFLDLRKAFDSVNHEIMLSKLKKFGFDSDAMKWFTSYLSER FT LQCTCLQGQYSSKTRVSCGVPQGSVLGPLLFCLYVNDLPNVIQKCSIHMYA FT DDTVLYYSAVSVKVCEETVSMDMKRVVKWLSENRLLLHPDKTKSMLFGLPQ FT KLKHAGTTVNITDGVNVYEQVDSFTYLGITLDPALRWAAHVQKITKKLLSG FT LGAMGRARAFVTNEVLKTMYQTLLLAHLEYCATAWLPSLAQGNKTLMLQLD FT RLVNRAARLITGHKLRDHVTVDNLRAEAGIDSVRKRTEITTLVTVFKTIRG FT KAPAYLASLFKWEAPPTMSVRPTRSEVKRLRDYDPHLLWCPPARVIAFRNS FT LQSYGPFLWNSLPLKQRQLLSLRTFKKFIEN*" XX SQ Sequence 3298 BP; 955 A; 765 C; 773 G; 805 T; 0 other; agccctagtc cctatgtggg aaaaggctac tagtaatgtt cacttacaaa gtggaacttg 60 gattacacaa aatgcttatt ccgtaaaaat caatcctgga ctgtctggag taagactcat 120 tggtcgatca actacatgca cggaaaggac cgaacggacg ttgaacttac ttgtttgtgc 180 aacgttacta ctggctggtg atgtttcgcc taacccggga cctgataccg ggggattacc 240 tgtctggagg aagggcatag tgtatgcttt ctacaacgtg gtctccctcc cacgacacct 300 ggatgaaatc cagcagttac ttcttagaaa cactcgtata catgtattag gactcaatga 360 aacccgttta tcagatagta tccctgactc ttctgtggat atcaatggct acaccttgta 420 caggacggac agagacaggc aaggtggggg cgtaggcgtc tatgtcaagc agacgattgc 480 gtctcagcga cgttgtgagc tcgagcagga agaccttgaa gtttgctgtg tcgagattaa 540 accggagaag gcgagaaaga ctctgctgac ctgcgtctac cgaccgccga catctggacc 600 tgactggaga aactctgctg aatcacttgt gcacaaactc aatcaaacgg cagagaaaga 660 aaacgcagac gtcgctatca tgggagattt caactccgat ttacttacct caactcaagc 720 aatgtcctca gtggaattcc tgatgggttt gtaccagcta gtccctgtca tacgcgaacc 780 aacccgcatt acagagaaaa ccgaatcatg tattgacaac atatttgtct ctaacccgga 840 ccgctacaag tcaagtgcaa gtgtggcctg ggggccatct gatcacaacc tcattctgac 900 atgtgccaaa gctggaagtg aggctggtgc tgctcacagg tgcgaataca ggtcgtacaa 960 gctctacaca cagcagtcct tcatcgacag tttaaaatca gttagatggg atacagtatt 1020 cgactgcacg gatgttagcg aggcctggaa tgcatttaag gacattttcc tgaatgtcgc 1080 cgacgaacat gccccgctcc gtacgaaaac tgctcgggaa aacaaccgcc cagctccatg 1140 gatgactgat acggtgaaaa acatgatggg tcgtcgggac gctgccagac gcaaagccat 1200 cagaacaaaa gatgtacagg actgggatac ataccgttct ctaagaaacc agacaacatc 1260 catcatcaga aaggagaaga aaagtcactt tgccacagct gtgtctgaag ctaagggtga 1320 ccaatctctc atgtggaaaa ttattaatag tttcacaggg aaatcaaaat caacaaagcg 1380 agtccagaaa ttgctacgag ctgacaacac ttccatgtct gatccaggtg aaatggcaca 1440 ggaattcaat gactacttta catcctgcgc ttcacgactt acagatggga tgccagattc 1500 cgaggaagat cctcttcgcc acatccctga ttccacgacg aagttcagct ttgactgtgt 1560 ggaagagact gaagtgttaa atgaacttca aaagttaaaa acgaagaaag caacagggtt 1620 ggacaaaatt ccagccaaac ttctcaaaga ctccgcccca gtggtcgcaa agccattggc 1680 acacatattt aacttatctc tcgcctctgg agaggttcca tctgactgga aggaggcaca 1740 gattacacct gtacacaagt ctggaagctg tgctgatgtc ggaaactatc gccctgtgtc 1800 agttctaagt gttacctcga aggtgatgga aaagcttgta tgcaatcaag tgacacgtta 1860 cctgaccaga tgtaaacttc tcacgactca ccaaagtggg ttcagaagac accatagcac 1920 tgcgacggct gtacagaagg ttgtcgagga cattacatct gggtacaact gctctaaggt 1980 cacagtggca ctatttctgg acctaaggaa ggcattcgat tcggtcaacc acgagattat 2040 gttaagtaaa ctcaagaagt ttggcttcga tagtgacgct atgaaatggt ttacatctta 2100 cctctctgag cgccttcagt gcacgtgtct tcagggtcaa tactcttcaa aaactcgggt 2160 ctcttgtggg gtacctcagg gaagtgtcct aggtccgcta ttattctgtt tgtatgttaa 2220 cgatttaccc aatgtcatcc aaaaatgcag tattcacatg tacgcagatg atacggtcct 2280 gtactattct gctgtttccg tgaaggtatg tgaagagact gtctctatgg acatgaagag 2340 agtggtaaaa tggttaagtg aaaacaggtt actgcttcac ccagacaaga ccaaatcgat 2400 gctcttcggg ctccctcaga aactgaagca tgctgggaca accgtcaaca taacagacgg 2460 tgtgaacgtt tacgaacaag ttgactcgtt cacttacctg ggcattactc ttgacccagc 2520 gctcaggtgg gcagcacatg tacaaaaaat cacgaagaag cttcttagcg gacttggagc 2580 aatgggacgt gccagagcct ttgttacgaa cgaagtgctg aaaaccatgt atcagacact 2640 actactggct catctggagt attgcgccac agcatggcta ccaagccttg cacaaggcaa 2700 caaaactctt atgttacagc tggatagact ggtaaacagg gctgcaaggc tcatcacagg 2760 acacaaactc cgagaccacg tgacagtcga caacttgaga gcagaggcag ggatagactc 2820 tgtgcgtaaa cggacagaaa ttacaactct tgttactgtc tttaagacaa ttcgtggaaa 2880 ggcgccggcg tacttggcat ctttgttcaa gtgggaggct ccaccaacaa tgagtgtacg 2940 ccctactcgg tcagaggtaa aacgtctaag ggactacgac cctcacctac tctggtgtcc 3000 accagcaaga gtcattgcat tccgaaacag tctacagtcc tacggacctt tcctctggaa 3060 ttcactaccg ctaaagcaac ggcaattgct cagcctcagg acttttaaga agttcattga 3120 gaattagaac ttgatctggc tgatgctccg tagtctggac tttgatattg gataatttta 3180 ttatgtattg tatatgtgct atgtgtactt tgtactttta tgtccaggat tacctgaaaa 3240 gcaggccgat ttccggcctg agatgtaatt actggtaaaa taaataaagt gaagtgaa 3298 // ID Copia-111_AA-I repbase; DNA; INV; 4669 BP. XX AC . XX DT 01-NOV-2010 (Rel. 15.1, Created) DT 01-NOV-2010 (Rel. 15.1, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-111_AA_; KW Copia-111_AA-LTR; Ty1_copia_Ele79; Copia-111_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-4669 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to Repbase Update (30-OCT-2010). XX RN [2] RG Nene, V., Wortman, J.R., Lawson, D., Haas, B., Kodira, C., Tu, RG Z., Loftus, B., Xi, Z., Megy, K., Grabherr, M., et al; RT "Genome Sequence of Aedes aegypti, a Major Arbovirus Vector."; RL Science 316(5832), 1718-1723 (2007). XX DR [1] (Consensus) XX CC Positions [2313-2816] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 39..2867 FT /product="Copia-111_AA-I_2p" FT /translation="MDQDESAVVSSRNESAVVSSRKKSATGSNRYESAVGR FT DRDELAVGSSRNESAPGRGRGESAVGRARNESAVGSSRNESAVDSSRNELS FT RSKNESGSSQDRGAVGSSRGRATGAIPKHGGRSRSEDGATSANHSRERAAG FT VSRGESGHTATNRGRDNAAGESRSYIRNDAAGRGINRTVSQNRSRDQDGGV FT HGWDRRDETPGRNRFGRLHDDDRNLGAAGGDWYDEEEGQYDAESSSRQYGR FT VRTENIPSIRVTASDGLRTAEATNGMDNLLVPNAKTVMLIERRSQMVGVFE FT GDEREFTPWKWRLMHHLSRLKLDYTVLRTPRQEPYGQPDPNATPAEEARRQ FT ALYQKRCEDDCEAIDEIIFTIGNKPLQKLMHCQYAKECMDALSNAYQRGGT FT LVAMSLRRQLFRLGDRQFHSLEELFKEHSRLCMELEYADPQGTHLEKIQTL FT LGAVPHRLNHAVAAILMLPEPVLKTKPLDELHRMLLDAETIQRPERQPTNV FT AMVAKPRDGIECFGCHQMGHYRNKCPNETKNKKFIRKKWIRQASSDRKDRE FT HALMVSTRPVKQKRFVIDSGASNHMINDDTMMSDVENLDVNLIVTTAKQGE FT HMKATKKGSVALKSVVGENKIKKLKLYNCLFIPGLQENLISVKRVVAQGKE FT VHFSNKSVVIQEKSGEVIARGHLLNGLYWLDVHGNQPANNAMLCKNNGLVT FT WHKRLGHLGQSSMEKLHRHSMIEGFEIKSSEFDSNTLQCDSCLFGKQTRER FT FKDTEKPRSNRPLELVHSDVCGYFEEPTYEGYRYFVTFIDDFTHFTVVYLL FT QHKSDVLSCFQLYEAMASAHFGKKIACLRSDNGSEYYGNDFIQFCARKGIQ FT MVATVPYTPQQNGVSERMNRTLLDKARTMLHESNSPKELWGEAVYAATYLT FT NRSPTNAMLEHKTPGAAGAGGAAAGPPRGRPPAIRRTKV" FT CDS 2867..4669 FT /product="Copia-111_AA-I_1p" FT /translation="MINDSIDMDNVTISHGGHPEEEIVDHNITINEVEDET FT LDDTFITASDKESSIPEADPVQQTDRIQEQELRRNERTRKQPKWFADYETT FT FLAGGNMQEIPQTVEHLKKRSDWNHWKDAIIQEMNSLDENQTWIPANTLGN FT RKPVKSKWVFTIKDDDRYKARLVAKGCSQKPGLDYHDTFAPVANMNSIRVM FT LSLANEHKLIIHQMDVKTAYLNGELNEEVFMELPADEFGKSKIVKLQKSLY FT GLKQSGRSWNERFDGTMRKLGFVPLSNECCIYKMGQHQLYVVLYVDDVLII FT GSSAKAISWIKNKLSDEFQMKDLKEISHFLGLDIRRDIENQVTEISQTGYI FT ENILARFGMENCKPIATPMDANTNWDTKEVQPTEYPYKELLGCLQYLALSS FT RPDINAAVNILSKYQSRATDVHWSGLKRILRYLKGTKATKIVYKKRQEEPI FT LQGYADADFGNDVEDRKSISGYVFQVFGNMVSWATKRQPTVSLSSTEAELV FT ALCTAAKEGMWLSHLLTELEVDNLPIVIHEDNIPCINFSKEPREHQRMKHI FT DVKFVFIRDLIKKGDMKVIYKQTNDQPADALTKPLSKVQHHKLFGLLNMKI FT EGT" XX SQ Sequence 4669 BP; 1470 A; 910 C; 1216 G; 1073 T; 0 other; ggttatgggc ccaggccata ccaaggcgca acagaacgat ggaccaggac gaatcggctg 60 tggttagcag caggaacgag tcggctgtgg ttagcagccg gaagaagtcg gctaccggta 120 gcaatcggta cgagtcggct gtaggccgag accgggacga gttggctgtg ggcagcagtc 180 ggaatgagtc ggctccaggc agaggccggg gcgagtcggc tgtaggcaga gcccggaacg 240 agtcggctgt cggcagcagt cggaatgagt cggctgtgga tagcagccgg aacgaattga 300 gtcgcagcaa gaacgagtcg ggcagcagcc aggacagagg ggctgtgggt agcagccgag 360 gtcgagcgac aggagcaata cccaaacacg gtggcagaag tcggagcgag gacggagcaa 420 ccagtgcaaa tcatagtcgt gagagagcgg ctggtgttag ccgtggtgaa agtggacata 480 ctgccacaaa tcgtggcaga gataatgcgg ctggtgagag tcgcagttat atcagaaatg 540 atgctgcagg ccgtggcata aacaggacgg ttagtcagaa tcgtagccgc gaccaggacg 600 gtggagtgca cggttgggat cggcgtgacg aaacacctgg cagaaaccga ttcggcagat 660 tgcacgatga tgatcgtaat cttggggcag caggtggcga ttggtacgat gaagaagaag 720 gccagtacga cgccgaatct agtagcagac agtatggcag agtccgtaca gaaaatattc 780 catcgatacg agtgacagca tcggacgggc ttcgtacggc cgaagccacc aatgggatgg 840 ataatctttt ggtgccgaat gcgaagacgg tgatgctgat cgaaagacga agccaaatgg 900 ttggcgtgtt cgagggagac gaacgagaat ttacgccatg gaaatggcgt ctgatgcatc 960 atctgtcacg gttgaagctg gactatactg tgctgcgcac gcctcgacag gagccgtatg 1020 gacaaccaga cccaaatgcc acacctgccg aagaagcacg tcggcaagcg ctttaccaaa 1080 agcgttgcga agatgattgc gaggccatcg atgaaatcat cttcactata ggaaacaagc 1140 ctctccagaa gctcatgcac tgccaatatg cgaaagagtg catggatgct ttgtcaaacg 1200 cataccagag aggcggaaca ctggtggcta tgtcactacg ccgtcagctg ttcaggttgg 1260 gagatcgcca attccactcc ctggaggaat tgttcaagga acattctcgg ctctgtatgg 1320 agctggaata tgctgaccca caaggtacgc atctggagaa gattcaaaca ttgctcggag 1380 cagtaccaca tcgtctgaac catgcggtgg cagcaattct catgctaccc gaaccggttt 1440 tgaagaccaa gcctctcgac gaattacatc gaatgctgtt ggatgccgag acgattcagc 1500 gaccggaacg acaaccgaca aatgtggcta tggtagccaa accgagggat ggcattgaat 1560 gtttcggatg tcaccagatg ggccattacc ggaacaaatg cccgaatgaa acgaagaaca 1620 aaaagttcat ccgaaagaag tggatcaggc aggcaagctc ggacaggaag gatcgagaac 1680 acgcgctgat ggtttccacc agaccagtga agcaaaagcg tttcgtgatc gactcaggag 1740 catcgaacca catgatcaat gatgatacca tgatgagtga tgtggagaat ttggatgtga 1800 atttgattgt tactaccgca aagcagggtg agcacatgaa agcaaccaag aagggatcgg 1860 tagcattgaa gtctgtagtt ggagagaata aaattaagaa gttgaagctg tataactgtc 1920 tttttattcc tgggttacag gaaaatctga tttccgtgaa aagagttgtt gctcaaggga 1980 aggaagtcca tttctccaac aaatccgttg taattcaaga gaaatctggc gaggttatcg 2040 ctagagggca tcttttgaat ggattatatt ggcttgacgt tcatggaaac caaccggcca 2100 ataatgcaat gctttgtaag aataatggtt tggtaacttg gcataagcgc ctcggacatc 2160 ttggacaatc tagcatggag aaattgcatc gacattcaat gattgaagga tttgaaatta 2220 aatcgagtga atttgacagt aacacgttac aatgcgattc atgcttgttt ggaaaacaaa 2280 caagagaaag gttcaaggat acagaaaagc ctcgttcaaa cagaccgctt gaattggtac 2340 actctgacgt ttgcggatat tttgaagaac cgacatacga aggatatcga tattttgtaa 2400 cattcattga cgacttcaca cattttacgg tggtatacct tttgcagcac aaaagtgacg 2460 tattgagttg tttccaacta tacgaagcaa tggcatcagc gcattttgga aagaagatag 2520 cttgtttgcg ttcagataat ggttctgaat actatggaaa cgacttcatt caattctgtg 2580 ccagaaaagg gattcaaatg gtagcgacag tgccatatac tccacaacag aatggggtaa 2640 gcgagagaat gaataggaca cttttggata aggcacgaac tatgttgcac gagagtaatt 2700 cgccaaaaga gctttgggga gaagccgtgt atgctgcaac ttatttgact aatagatctc 2760 caacgaatgc gatgctggag cataaaactc ccggggcggc gggggcgggg ggcgcggccg 2820 ccgggccccc ccggggccgg cccccggcca tacggagaac gaaagtatga ttaatgattc 2880 tattgacatg gataatgtca ctatttccca tggaggtcat cctgaggaag aaattgtgga 2940 tcataatata actatcaatg aagttgaaga cgaaaccctc gatgatacgt ttatcacagc 3000 tagtgataaa gaatctagta ttccggaagc tgatccagtg caacagactg atcgaataca 3060 agaacaagag ctacgaagaa atgaacgaac acgaaaacaa cccaaatggt ttgcggatta 3120 cgagactaca tttcttgcgg gaggaaacat gcaagaaatt ccgcaaacgg ttgaacattt 3180 gaagaaacga tccgattgga atcattggaa agacgccata atacaggaaa tgaattcact 3240 ggatgaaaac caaacatgga ttcctgcaaa tacactgggg aaccgtaaac cagttaaatc 3300 aaaatgggtg tttaccatta aagatgatga tcggtacaag gctagattgg ttgcgaaagg 3360 ttgttcacag aaaccaggtc ttgactatca tgatactttt gcaccagtgg ccaatatgaa 3420 tagcattaga gttatgttgt cattggcaaa cgaacacaag ctaattatcc accaaatgga 3480 cgttaagaca gcttatttga atggagaact aaacgaagaa gtatttatgg aacttccagc 3540 agatgaattt ggaaaatcaa aaatcgtcaa acttcaaaaa tccctttatg gattgaaaca 3600 atctggtaga agttggaatg aacgatttga tggcaccatg agaaaacttg gatttgttcc 3660 attgagcaac gaatgttgca tttataaaat gggtcaacat cagctctacg ttgttcttta 3720 cgtcgatgat gtcctcatta ttggaagttc agctaaggct attagctgga tcaaaaacaa 3780 actttcagat gaatttcaaa tgaaagattt gaaagaaatt tcacactttt tgggattgga 3840 tatcagacga gatatcgaaa accaagttac ggaaatttct caaacgggat acattgagaa 3900 tattctggca agatttggta tggaaaattg taagccaatt gcaacaccga tggatgctaa 3960 taccaattgg gataccaaag aggttcaacc tactgaatat ccatacaaag aacttctcgg 4020 atgtttacag tatttggctt tatcatcgag accagatatc aacgctgccg ttaatatatt 4080 gagtaaatac cagtcacgtg caacggatgt tcactggagt ggactaaagc gtattttgag 4140 atacttgaaa ggaaccaagg ctacgaagat cgtgtataag aaacgccaag aagaaccaat 4200 actgcaagga tacgcggatg ccgactttgg aaacgacgta gaagacagga aatcgatttc 4260 tggatatgtt ttccaagttt ttggaaatat ggtttcgtgg gcaacgaaac gtcaacctac 4320 agtgagtctt tcatctacag aagcagagtt agttgctttg tgtaccgcag ccaaggaagg 4380 aatgtggcta tcacatttgt tgacagaatt ggaagtggat aacttaccaa ttgtcataca 4440 tgaagacaac atcccctgca tcaatttttc gaaggaacca cgagaacatc aacggatgaa 4500 gcacatcgat gttaaatttg ttttcattag ggacttgatt aagaaaggag atatgaaggt 4560 tatttataag caaactaatg atcaaccagc agatgcttta acaaagccgt tatcaaaagt 4620 tcagcatcac aaattgtttg gattgctaaa catgaaaatt gaaggaacg 4669 // ID Gypsy-9_PPc-LTR repbase; DNA; INV; 538 BP. XX AC . XX DT 08-JUL-2010 (Rel. 15.07, Created) DT 08-JUL-2010 (Rel. 15.07, Last updated, Version -1) XX DE LTR retrotransposon from the Pristionchus pacificus genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-9_PPc_; KW Gypsy-9_PPc-I; Gypsy-9_PPc-LTR. XX OS Pristionchus pacificus OC Eukaryota; Metazoa; Nematoda; Chromadorea; Diplogasterida; OC Neodiplogasteridae; Pristionchus. XX RN [1] RP 1-538 RA Jurka J.; RT "LTR retrotransposons from the Pristionchus pacificus genome."; RL Repbase Reports 10(7), 1011-1011 (2010). XX DR [1] (Consensus) XX SQ Sequence 538 BP; 177 A; 87 C; 158 G; 116 T; 0 other; tgcgactgct ggcttcatgc aaatgatggc aaagatgatg gaagcacaga gaaaacagaa 60 tgaattgatt gaagcgagga tggagagcca acagggactg attgaatctc agaaggaagc 120 gatgaaacag cagaatgagg ccattcaggc actgctgtct gagagaacag agagatcgga 180 tacgaggtcg gagcagtccg agagtttggg acacagagga ccaagtatcg atgctctaga 240 gaaacagatc aggaacttct cgtataaccc agaggatggg gtgacttttg acacgtggat 300 cgatcgtcac agagagatat tcgagaatga tctgaaggat atggctgatg gtgataaggg 360 tagattacta ctgcgtcatg tggatgacag agtgtataag cagttagtag atcacatcag 420 accgaagaag acatctgacc tcaagtttga ggagataatc acgataatgc aggacttgtt 480 tggtgacaaa aagtctgtat tcgagaagag actggatgtg ttcagtctca agatgtca 538 // ID Gypsy-245_AA-I repbase; DNA; INV; 7047 BP. XX AC . XX DT 18-JAN-2011 (Rel. 16.03, Created) DT 18-JAN-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-245_AA_; KW Gypsy-245_AA-LTR; Gypsy-245_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RP 1-7047 RA Jurka J.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Repbase Reports 11(3), 1091-1091 (2011). XX DR [1] (Consensus) XX CC LTRs are 92% similar to each other. XX FH Key Location/Qualifiers FT CDS 351..2351 FT /product="Gypsy-245_AA-I_1p" FT /translation="MEMSVLRAQYSALNTAHLTVDELEHELVIRRMDKLAP FT RASLERALRSRLKEEDSQSNVAYDFAKVNVVEELETCDDKLNSIKAFLENR FT RSKKAPDQVFKSRLFHILFRMERLKDRELSEECMNSLAVVAGECLRLLNTY FT FSATSHLPEVRAAELAIVNESLNRMRDELVEGNQQHEEEIGELVNLDGDEN FT VGEQERRMTQRVDENENAVEGNLEEDENVEKLKKEQEKLSAENKQLLEVVS FT QLLQRIEVLEAKQPKQAIQEQQAKTFNSTHLGETEKVPPKSSQPKPTDFLE FT WLKDRNSSMDSSEQLAEPRKVVVDPSQPKSSEREWSKVNSNRLPVHKWTIR FT YDGMDNGKRLNEFVKEVEFNARSEGFSEDELFQSAHHLFTNKARAWFMEVN FT GDYELGSWKVLVRELKNEFLPIDIDYVYERQAYNRKQGAKEKFQDYYLDMV FT RIFRGMSNPWDDRRKFDVLFRNTREDVRIAMLAANVTDVAKMKEFGKKFDA FT INWQLYQRNERFFNNRPIRVDEINYHRQNFXNRQDRYNKNEPDRRKESGNF FT RQNFNKGNGQKGYWKQRSSGEENQKRDEYKKSEKSDNKTHNPHRQKSPTPG FT PSGSSALQRIVKAYIPIKKGICFNCHENDHSFEECPKEKHMFCFKCGFPGF FT TLKNCPYCQSKNMHETAQ" FT CDS 2279..5503 FT /product="Gypsy-245_AA-I_2p" FT /translation="MWIPWIHIEKLPLLSIKKHARDCSVRQVSLPNNVESP FT HEIVASEPPEILNQLGYEQVLGENSGSEEIATIVTLSNDPRPFAKVEILGI FT SVLGLLDSGAERTVLGVGSRRLISSLKLKIKPTNTNLRTAAGERLEVIGSA FT DIPISFNGETKILPVVIAPKLNRRCILGYDFWRKFGIRPTFSQSVESVDED FT TNRDLEEEEQLSEEQVQQLEEVKKLFLVASPGNFGKTDLIEHRIDFKEEFQ FT GAEPVRKNPYPWSPEIQKKIHNSVDKMLQDNIIEPSESDWALPVVPVKKRD FT SEEVRLCLDARKLNERTKRDAYPLPHQNRILSHLGPFKYLSTIDLSQAFLQ FT VPLGQESRKFTAFSVPGKGLYQFTRLPFGLVNSPATLSKLMDRVLGYGALE FT PFIFVYLDDIVVASHTFADHIQKLRELARRLKEANLYINLEKSNFCCQELP FT YLGYILSRAGLRPNPDRVQAILGFEVPNSVRSLRRFLGMVNYYRRFIDKFS FT ELTAPLTDLLKNKPKRVQWNNAANEAFKAIKERLISAPVMANPDFSLPFCV FT QTDASDTAIAGVLTQVQNGEERVIAYHSEKLKGAELNYHAAEKEGLAALRC FT IDKFRCYIEGTHFILMTDSSALTFIMKAKWRSSSRLSRWSITLQQYDMEVK FT HRKGKDNIVPDALSRSIEALEVEDEDEWYKNLYNAVXENPENYMDFRIDGG FT KLYKLVSARSDVLDYSFQWKECVPESSRPQIMKQEHDDNLHIGFEKCVEKL FT KRRYYWPRMSANLKKYINNCVSCKENKPSTISTAPQMGQQRIATKPFQIIC FT MDYIQSLPRSKQGNAHLLVILDIFSKYCLLVPVKKISSASLCKIVEELWLR FT KLSVPQYLISDNATTFLSKEFQELLKKYEIQHWANARHRSQANPAERLNRT FT INAMIRSYVRSDQRLWDTKVSEMEYVLNNTVHSSTKFSPHRIVFGNEIMTR FT GSDHRLESNEELTDEVRMEKMRGVSKKVWEMVVENLKRTHESTKRQYDLRH FT KRYSPTFDVGQRVYKRSFRQSSAGHDYNAKLGPVYTPCIIVAKKGTSSYEX FT SDMNGKSLGVFSSADLKA" XX SQ Sequence 7047 BP; 2211 A; 1358 C; 1647 G; 1825 T; 6 other; cggagcaaat cctgcaacta tgttacamtt tggcacccaa cgaacgaatt gtctttgcgc 60 attccgtgct agggctcaaa cactttagag cttctgataa gaagcttgaa cttctttgcg 120 gtactaggtc aaattgatct ggtcagcatt tttctaggat tttcaaatta tttagagctt 180 ttgctaagaa gcgtgaacaa attttaggta ctttttcaaa ttaatttagc tttcgaaaaa 240 cgaaagcgtt tacaaaacta ggtttttttt ttcaaatttt acagctttgt gctgaaatag 300 gtttcttttt aatttaattc tactgatttc cggtggatag agctgcaatc atggagatgt 360 cagtgctgcg tgcacagtac agtgcgttga atactgccca cctcaccgtg gatgaactcg 420 aacatgagtt ggtaatccgt cgcatggaca aactcgcacc tcgcgccagt ttggagagag 480 cgttgcgtag tcgtctgaaa gaagaagata gtcagtcaaa tgttgcatac gattttgcca 540 aagtaaatgt tgttgaagag ttagaaacat gtgacgataa attgaattcg atcaaggcgt 600 ttttggagaa cagacgatcg aagaaagctc cagaccaggt tttcaagagc aggttatttc 660 acattctatt tcgtatggaa agattgaaag atcgggagtt gagcgaagag tgtatgaata 720 gcctggctgt agtggctgga gagtgtttgc gmttgttgaa cacatacttt tctgctacct 780 cacatcttcc ggaggttaga gcagccgagt tagcgattgt aaacgaaagt ttaaatcgta 840 tgagagacga gttagttgaa ggaaatcaac agcatgaaga agaaatagga gagctagtca 900 acttagatgg agatgagaat gtaggagaac aggaacgtag aatgacgcaa agagtagatg 960 agaatgaaaa tgccgttgaa ggaaatcttg aggaagacga aaatgtagaa aagttgaaga 1020 aagaacagga aaagctgtcc gccgaaaaca agcaattgtt agaggtagtc agccagttgc 1080 ttcaacgtat cgaagttttg gaggctaagc aaccaaaaca agcgatacaa gaacaacaag 1140 ccaaaacgtt caacagcacc cacttaggcg aaactgagaa ggttccacca aaatcttctc 1200 agccaaagcc gacagatttt ctggagtggt tgaaagacag gaacagttca atggatagtt 1260 cagagcagtt agcagaacca agaaaagtag tggttgatcc tagccaaccg aaatcgtcag 1320 agagagaatg gtcgaaagta aatagcaacc gtttgccggt gcataaatgg acgattcggt 1380 atgacggcat ggacaacggt aaacgcctga atgagtttgt gaaggaggtt gagttcaatg 1440 ctcgttcaga agggttctct gaagatgaac tgtttcagag tgctcatcac ctgttcacta 1500 acaaagccag agcttggttc atggaggtga atggtgatta cgaattaggg tcttggaaag 1560 tgttagttcg cgaacttaaa aacgaattct taccgattga catcgattac gtctacgaaa 1620 gacaggcgta taatcggaaa cagggcgcca aggaaaagtt tcaggactat tacttggaca 1680 tggtccgtat tttccgaggt atgtccaacc cctgggatga taggcgcaag tttgatgtgc 1740 tcttccgcaa tacgcgtgag gatgtgcgta tcgccatgct tgctgccaac gtgaccgacg 1800 tagctaaaat gaaagaattc ggtaagaagt ttgatgcgat maactggcaa ctataccagc 1860 gaaatgaaag attctttaat aataggccaa ttcgggtaga tgagattaat taccaccgac 1920 agaatttcsc aaatagacag gatagataca acaaaaatga accagatagg aggaaggaat 1980 cgggaaattt caggcagaat ttcaataaag gtaatgggca gaaaggttat tggaagcagc 2040 gcagttcggg tgaggaaaac caaaaaaggg atgagtacaa gaaatctgag aaatcagaca 2100 acaaaactca caatccccat agacagaaaa gtccaactcc gggaccgagt ggaagtagtg 2160 ccttacagcg cattgtcaag gcttacattc caattaagaa gggcatttgc ttcaactgcc 2220 atgaaaatga tcacagtttt gaggaatgtc ctaaggagaa acatatgttt tgtttcaaat 2280 gtggattccc tggattcaca ttgaaaaact gcccttactg tcaatcaaaa aacatgcacg 2340 agactgctca gtgaggcaag tcagtctccc aaataacgta gaaagtcctc acgaaattgt 2400 cgcgtcagaa cctccagaaa tactgaacca actcggctat gagcaggtac tcggtgagaa 2460 ctcaggaagt gaagagatcg caaccattgt cacgctgagc aacgatccga gaccatttgc 2520 gaaggttgag attctaggta tttctgtgtt aggtttgtta gacagcggtg cggagaggac 2580 tgtattaggt gtaggaagta ggaggctgat tagttcatta aaactgaaaa ttaaacccac 2640 gaacacaaat ctgagaacag cagctggaga aaggttagag gtgattggtt cagcagacat 2700 tccaattagt ttcaacggtg aaacgaagat cttacccgta gtgattgctc caaaactcaa 2760 tcgtagatgc atcctagggt acgatttttg gcgaaaattt gggattcgtc cgacgtttag 2820 ccagtctgtc gagtccgttg acgaagacac taacagggat ctagaagaag aagaacagtt 2880 gagcgaagag caggttcaac agttagagga ggtgaaaaag ttgttcttgg tagccagtcc 2940 tgggaatttt ggaaagacgg atcttattga acaccgtata gattttaagg aggagtttca 3000 gggagcggaa cctgttcgga agaatccgta cccctggagt ccagaaatac agaagaaaat 3060 ccataattcc gtcgacaaaa tgctccaaga caacatcatc gaaccttctg aatctgactg 3120 ggctctacct gtagtaccag taaagaaacg tgatagcgaa gaagtaaggt tgtgccttga 3180 tgctagaaag ctcaacgagc gcacaaaaag ggatgcgtac cccctccccc accaaaatcg 3240 cattttaagc catttaggac catttaagta tctgtccact atagatctat cccaagcctt 3300 tttacaggtg cccttggggc aagaatctcg taaatttacg gcattctcag tcccagggaa 3360 gggtctatat cagttcacac gtttaccgtt tggactggta aacagcccag ccaccttaag 3420 caaacttatg gaccgtgttt tagggtatgg tgccctggaa cccttcattt ttgtatatct 3480 ggatgatatt gtagtggcca gtcatacgtt tgcggatcat atccaaaaac tgcgagaact 3540 tgcgagacgt ctcaaggagg caaatctcta cataaacctt gagaagtcta acttttgctg 3600 ccaagaactg ccctatttgg gctacatttt atcccgagca ggtcttagac ctaacccaga 3660 tcgagtccaa gcgatattgg gttttgaggt cccaaactcg gtgcgttcat taaggcgatt 3720 cctcggtatg gtcaactact accgtaggtt catcgataaa tttagtgaac tcactgcacc 3780 tctcaccgac ttgcttaaaa acaagccgaa gagagtgcag tggaacaacg cagcaaatga 3840 agcttttaag gccatcaaag agaggctgat atcagcgcct gtgatggcca acccagactt 3900 tagtctgcct ttttgtgtcc aaacagacgc aagtgacacg gcaatcgctg gagttcttac 3960 gcaggtccag aatggggagg aaagggtgat tgcgtaccat tccgaaaagt taaagggcgc 4020 agaactgaac tatcatgcag cggaaaagga gggactagcc gccttacgct gcattgataa 4080 attcaggtgc tacattgagg gcacacactt tattctgatg accgattcct cagctttgac 4140 ctttataatg aaggccaaat ggaggtcatc ttcacgtctg agcaggtgga gtattacgct 4200 acagcaatac gacatggaag tgaagcacag gaaagggaag gacaacattg taccggatgc 4260 cctttctcga tcaatcgaag ctctagaagt agaggatgag gacgagtggt acaaaaattt 4320 gtacaatgct gtcgwagaaa atcccgaaaa ttatatggat ttccgaatag acggtggcaa 4380 attgtacaaa cttgtgtcag cccgatcaga tgtcctggat tacagcttcc agtggaagga 4440 atgcgttcca gaatcatctc gtccacaaat catgaagcag gagcacgatg ataacttgca 4500 catcggcttc gaaaaatgtg tggaaaaact caaaaggcgg tattactggc ctcgaatgag 4560 tgccaaccta aaaaagtata taaacaactg cgtttcgtgc aaagaaaaca aaccgtcgac 4620 aatttctact gcacctcaaa tgggccaaca acggatcgca accaaaccgt tccagatcat 4680 ttgtatggac tacattcagt ctttaccccg gagcaagcag ggaaatgcac acttattggt 4740 cattcttgac attttctcta agtattgcct gcttgtcccg gtgaagaaga tctcgtccgc 4800 tagtctgtgc aagatcgtgg aagagttgtg gctgcgaaag ttgtcggtgc cccaatatct 4860 gatttccgac aatgcaacaa cgttcttgtc gaaagagttc caagaactgt tgaaaaagta 4920 cgagattcaa cactgggcga atgctcgtca caggagtcag gcgaatcctg cggaacgttt 4980 gaaccgtaca atcaacgcga tgatcagatc atacgtccga tcagatcaac gtctgtggga 5040 cactaaagtg tcggagatgg aatacgtgtt gaacaacaca gttcactcgt caaccaaatt 5100 ctctccacat cgaatcgtgt tcggaaatga gatcatgact cgtggatctg accaccgttt 5160 agagagtaac gaggaactca ccgatgaagt tcgtatggag aagatgagag gtgtgtccaa 5220 aaaggtgtgg gagatggttg tagaaaacct aaaaaggaca cacgagagta cgaaaaggca 5280 gtacgattta cggcacaaga ggtactcacc aacattcgat gtaggacagc gagtgtataa 5340 gaggtcattc cgacaatcct cagctgggca cgactacaac gccaagctcg gcccagtcta 5400 cactccctgc atcattgttg ccaaaaaggg taccagctca tacgaastct ccgacatgaa 5460 tggcaaatct ctcggcgtct tttcgtcggc agatttaaag gcgtagtcaa caatttctcc 5520 atggtacgtc ttgttcctga gaatcgagtg gtcgaatcgg ttgacgacag gttttccgta 5580 aaatcctcat caggaagtca cgtcagatac cattcgcata agctcaattc ggggatggaa 5640 caaacggaca cattcaaatg tgtgcagctg cagatgcatt cattcatcaa tgcccatcgt 5700 taacgaacga ggcatcgcat ccattcaaaa atcctaggaa accgtagaat attttaataa 5760 ttttgtgaag taaaatttta tgtgttattt tctctttttg tttttaattt gtttgtttta 5820 aaccgtgttg gaaagtgttg gaaattatag aattaaatat gtattcaaat gcaaagtacg 5880 ttttatcatt tagctagctt ccttgtaaat ggaccagtct aaggttgcta tgttagtggt 5940 agatgagagt gagcatggaa aaaagtatat ccatagagca tgggctaggc aagatcgagc 6000 cacataaact cttaccacaa taattgtagg aaacacatca atagacaaac aaatttagga 6060 taaccaatta atcgctcaca atagtttaac ggtgcacagt acaattaaag gtacttacct 6120 ggaatcatgc ctcatggcaa gcaccacaat gagcttctag tcaggaataa acctggaaaa 6180 ctaggccttt ttatgagaat tgaaccacca aaatgatcat ttggccaaac tttcgtcgac 6240 gaaatgtcat taaaatgata aaaaagctct aaaatgcctg ttttcacaaa tttatcctaa 6300 ctagaaccac attggatact taccacgtcc taacaatagc cacatatgca gtccataagt 6360 atagctcgat cttcccgaac agcacacagt tgatcatgat aacgataacg cgcgtatatg 6420 atagtagatc ggccgatcga cagtgaactc cgacactgag aaatattgca aagtactaaa 6480 cgaaatgttt cgtaccatag tagagggtat ctctgcaacg ttgtttttgt aaataataat 6540 tccggaattt agtttgatta ttttgcatgt agtttcaatt gtagattgtg tagtattttt 6600 aaagttgtat aagcaacatt tttatttggt gataggtatt tttttaattc gttgtaaata 6660 tttttaaatt gtttctcatc aatgttggtc aataaaattt agtatcttgg gtacgttgga 6720 gaccggagtc ttagtacgtt gatggtcagt agaaggttga acctttaaat tgcgtggagt 6780 tggctaagcc aacgatgatg ccttatgacc ccgtcccatg tatatgctaa aataaaatgc 6840 tctcttaccc gggtggattt ttgaaattga aggagtggag cagtgtgtca agattttgtc 6900 tatttcattg taaaaaaata agtgtctgtc aacgaagttt gtgaatttca ggttcagagg 6960 tacgtgttaa gttgtcagag tgttgagtgt gtaaatgaaa aaaatattag ctccactaat 7020 atttttttta cccgagcggg gggagag 7047 // ID BEL-75_AA-I repbase; DNA; INV; 5833 BP. XX AC AAGE02021353; XX DT 23-DEC-2010 (Rel. 16.02, Created) DT 18-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the yellow fever mosquito genome: DE internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-75_AA_; KW BEL-75_AA-LTR; BEL-75_AA-I. XX OS Aedes aegypti OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Aedes; Stegomyia. XX RN [1] RA Nene V., Wortman J.R., Lawson D., Haas B., Kodira C., Tu Z.J. RA et al.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316(5832), 1718-1723 (2007). XX RN [2] RP 1-5833 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the yellow fever mosquito genome."; RL Direct Submission to RU (23-DEC-2010). XX DR Genome; AAGE02021353; Positions 9862 4030. XX CC Positions [4551-5123] - Integrase core CC 'AGTAG' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 255..5528 FT /product="BEL-75_AA-I_1p" FT /translation="MSNERRIKSLKTRQKSLLTSFCLIQKFVDEYTEETCE FT REVSVRLEHAETIWKDFNSVQTELETLDESALDDHLKQRVEFETTYFRVKG FT FLLSVNKHDLPSPRTPSSPHSTAQGPLSSHVRLPDIKLPVFTGNLDQWLNF FT HDLYVSLVHSSNELSSIQKFYYLRSSLSGDALKLVQTVALSANNYLVAWNL FT LLDHYQNPIRLKQSYVDSLFEFPSVKRESAPELHSLVEKFETNVKILKQLG FT EKTEFWDVLLIRMLSIRLDSTTRRDWEEYSSTLNAVSFKELTSFIQRRVNV FT LQTIGKTAENYSPSNSMKKPTQRIVSSHGASPTNQRKCVACSDHHPLYQCG FT VFSKLSVDEKEKEVRRHQLCRNCLRKGHQAKECPSSSTCRKCRGHHHTQLC FT LGATHINTKPADTPLKENTNLPSSENPSASVSASVDDCASYSSAGRGGKRV FT LLATAIVIVVDDNGTEHPARALLDSGSECCFATESFSQMVKAQRKRIHLPI FT AGIGQSSTESRFKFASKIRSRISEFSANVDLLVLPKVAIDLPSATFDSSSW FT DIPSGIQLADPAFYKPNKIDLILGAEVFFDLFKVSGRIQLGPGLPTLINSV FT FGWVVSGKSSHNPTSTPIVANVATIADLHDLVQKFWALEEDNLQPCYSVEE FT SACEEHFRRTVNRNAEGRYIVSLPIKEDVLAKLGDNRRTAIRRFHLLEGKL FT AHNAELDQQYRSFMEEYFQLGHMERVQDHQQNRLPCYHLPHHAVVREDSTT FT TKVRVVFDASCRTTNGPSLNDALMVGPIVQDDLRSITLRSRMHPILINADI FT KQMYRQILTDDRSNSLQHIVWRASPESPLQTYELKTVTYGTASAPFLATRV FT LRQLADDEQIDFPIAADVLRNDFYVDDLFTGANTTEEAIELRTQMESLLKR FT GGFELRKWASNDPVVLQDVPSENRALKPSIDLDREACIKTLGLHWEPGTDV FT LRYKTQIATDSMDGLTKRIALSYIARLFDPLGLVGPIITTAKIFMQAIWAL FT LDEDGKPWSWDKILPPTFRTRWEMYQTQLPNLNQLRVPRCVTFSSPTSLQL FT HVFTDASEHAYGACVYIRSTNNTGEVKINLLTSKSKIAPLKRQSIPRLELC FT GALLGAQLYEKILIALKIKAETFFWVDSTVVLCWLKSSPSAWATFVANRVS FT KIQLATENCSWNHVAGVQNPADLISRGTTTDQLIASKLWWNGPEWLALDPS FT EWPAFHLNMVESNEALNEAKATSNNAVTIAEEPSFIDEYVAIHSNYHRMLR FT ITAYCLRFLRNCCLKSNDRRTGILTTVEINAAETTLIKLVQRQFFGNEITQ FT LQQGNQISTKSRLRWFHPFLDAHQVLRIGGRLSNSLLPFESKHQAILPSSH FT PFSALLVQSIHLRQLHAAPQLLLTILRLRHWVIGARDLAKRTVRNCVTCCR FT NRPKTIQQFMAELPSTRVTASRPFTTTGIDYWGPIQIKQQHRRAAPRKAYV FT AVFVCFSTRAVHIELVADLSTAKFIQALRRFVSRRGICSDIHSDNGRNFVG FT AANELRRLINSKEHQAAVSQECLSNGIKWHFNPPKASHFGGLWEAAIASAQ FT KHFFRVLGPHILCYDDTETLLTQIECCLNSRPIIPISDDPSDYEVLTPGHF FT LVGSSMKAVPDIDHSATPFNRLHQWQQVQKLFQHLWQRWNREYLFTLQQRT FT KWVNPPTIIDVGRLVILKDENAPPMTWQTARIEELHPGADGIVRVVTLRTS FT QGKFTRPVSKICLLPVASNETKTTPSSKTASSSKQLSP" XX SQ Sequence 5833 BP; 1596 A; 1583 C; 1259 G; 1395 T; 0 other; aagtggtcct tcgagccgga tctaccgtcg aactccggag atccccaaac cttctggtag 60 tttccaccac gtgcttggcc aattctgagg tacccgaata acccagcctc catcgaagct 120 tctggagcaa cgtagattca tacaaggcaa aatcttgcct gtgatcaggt aattagtatt 180 cctttctttc tatcccgttc ccgtgtgagc tacgcggtct gtgcattcca gatccagctc 240 aatctctacc agtgatgtcc aacgagcgac gcatcaagtc gttgaagacg aggcagaaga 300 gcttgctgac gtcattctgc ctaattcaga agttcgtcga cgagtacacc gaagaaactt 360 gcgagcgtga agtgtccgtc cgcctagagc atgccgaaac catctggaaa gacttcaact 420 ccgtccaaac tgaactcgag actttggacg agtctgctct cgacgatcac ttgaagcaac 480 gtgttgagtt cgagactacg tacttcagag tgaagggatt cctgctttcg gtcaataaac 540 atgatctgcc gtcacctcgt actccgtcgt ctcctcattc aactgcgcaa ggccctctgt 600 cgtcgcatgt aaggttgccc gacatcaaac ttcccgtttt caccggaaat ctagatcagt 660 ggctgaactt ccatgatctc tacgtttcac ttgtccactc atcgaacgaa ctttctagta 720 tccagaagtt ttactatctg cggtcgtcgt tgtccggaga tgccttgaag ctggtacaaa 780 cagtagcact cagcgccaac aactacttgg ttgcctggaa cttacttctg gatcactatc 840 agaaccccat ccgactcaaa cagtcctacg tcgattcgct attcgaattc ccttcggtca 900 aacgagaatc tgctccagaa ctacactcac tggtagaaaa gtttgaaacg aacgtaaaaa 960 tcctcaagca gctcggggag aaaactgagt tttgggacgt gctgctgatt cgaatgctga 1020 gcatccgcct tgattctacc actagaagag attgggagga atattcgtcc acgttaaatg 1080 ctgtctcgtt caaggaactc accagtttta tccagagaag agtcaacgtt ctacaaacca 1140 ttgggaagac agctgagaac tattcacctt ctaactcaat gaaaaagcct actcaacgaa 1200 tcgtgagcag ccacggagca agtccaacta atcagcgtaa gtgtgttgct tgctcagacc 1260 atcatccgct ttaccaatgt ggggtattct ccaagcttag cgtcgatgag aaagaaaagg 1320 aagtccggcg tcaccagctg tgccggaatt gcctccgcaa aggacaccaa gcaaaggagt 1380 gtccgtcttc gagcacctgc cgcaaatgtc gaggtcacca tcacactcaa ctgtgccttg 1440 gtgcaaccca tatcaacaca aaaccagcag atacgcctct taaagaaaac acaaacttac 1500 catcatcgga aaacccatct gcgtcggtgt cagcatctgt cgacgactgt gcaagctact 1560 catctgctgg tcgtggtggc aaacgtgttc ttcttgccac tgctatcgtc atcgtagtcg 1620 acgacaacgg aaccgaacat ccagctagag ctctacttga ctctggcagc gaatgctgtt 1680 ttgcaacaga atcattttcc cagatggtga aggctcaacg caaacggatc cacctaccta 1740 tcgccggcat tggacagtca tcaactgaat ctcgtttcaa atttgcttca aaaattcgct 1800 cgcgcatctc cgaattttct gccaacgttg atctgctcgt actgccgaag gttgccatcg 1860 atttgccatc agctacattc gactcatcat cttgggatat tccatcggga atccaactcg 1920 cagatccagc attctacaaa ccgaataaga tcgatcttat cctcggtgcc gaagtcttct 1980 ttgatttgtt caaggtttct ggccgaattc aacttggacc tggtcttcct accctcatca 2040 attcagtttt cggatgggtt gtctctggaa agagttccca caacccaaca agcacaccaa 2100 tcgtcgctaa cgtcgccact attgctgatc tgcacgacct ggttcagaaa ttttgggccc 2160 tagaagaaga taatttgcaa ccctgctact cggtggagga atccgcttgc gaagagcact 2220 tccgtcggac agtaaatcga aacgctgagg gccggtacat cgtcagctta cccatcaaag 2280 aggacgtttt agccaaactc ggcgacaacc gtcgcacagc catacgtcgc tttcatttgt 2340 tggaaggcaa gctcgctcat aatgctgagc tcgatcagca atatcgatcg ttcatggagg 2400 aatacttcca gctgggtcat atggagcgcg tacaggatca tcagcaaaat cgtctaccat 2460 gttaccacct cccgcaccac gccgtcgtga gagaggacag caccactaca aaagtacgcg 2520 ttgtctttga cgcatcctgt cgcactacta atggaccttc cttaaacgac gcacttatgg 2580 tgggccctat agtgcaggat gatttacgat caatcacgct tcgatcacga atgcatccaa 2640 tcctcatcaa cgcagatata aaacagatgt accggcaaat cttgacggac gacagaagca 2700 acagtctcca gcacatcgtt tggagagcat caccagaatc ccccttacaa acctacgaac 2760 tcaaaaccgt tacttacggt accgccagtg ccccgtttct cgccactcga gtgctgcgac 2820 agttggccga cgacgaacaa atcgactttc caatcgcagc tgacgttctt cgtaacgatt 2880 tctacgttga tgatttgttc acaggtgcaa acacaaccga ggaagcaatt gaactccgaa 2940 cccaaatgga atcgcttctc aaaagagggg ggttcgaact gcgaaagtgg gcctctaatg 3000 acccagtcgt tctacaagac gttccgtctg agaatagggc acttaagcct tcgatcgacc 3060 tcgatagaga agcatgcatt aaaacgctag gcctccactg ggaacccggc acggacgttc 3120 ttcggtacaa aacccaaatc gccacagatt caatggacgg actcactaaa cgtattgcgc 3180 tgtcgtacat agcgcgacta tttgatccac ttggactcgt tggtccaatt atcaccacag 3240 ccaaaatctt catgcaagca atatgggcct tacttgatga agacgggaaa ccatggagtt 3300 gggacaaaat actacctcca acattcagga cgcggtggga aatgtatcaa acgcaacttc 3360 caaacctcaa ccaactgcga gttccacgct gtgttacttt ctcgtcaccc acctcgctgc 3420 agctgcatgt cttcaccgat gcatcagagc acgcatatgg tgcgtgcgtc tacatccgct 3480 caacaaacaa caccggcgaa gtcaagatta atctgctcac atcgaagtcc aagatagctc 3540 cacttaaaag gcaaagcatc ccgagacttg agctctgcgg cgcgcttctt ggtgctcagc 3600 tgtacgagaa aatccttata gcactcaaaa tcaaggctga aacttttttc tgggtagatt 3660 ctactgtggt tctttgttgg ctcaaatctt ctccgtcggc atgggcaact ttcgttgcaa 3720 atagggtctc caagatacaa ctggccacgg aaaactgctc ttggaatcat gtggcaggtg 3780 tacaaaatcc agcggattta atttcaagag gaacaacaac agaccagctc atcgcaagta 3840 aactctggtg gaacggaccg gagtggctgg cactcgaccc cagcgaatgg ccagcattcc 3900 acctcaacat ggtcgaatcc aacgaagctc ttaatgaagc taaggcaact tctaataatg 3960 ccgtcaccat agcggaggaa ccatcattca ttgatgaata cgtcgcaatc cactctaatt 4020 atcaccgaat gctgagaatc acggcatact gccttcgttt tctacgaaat tgttgcctca 4080 aaagtaacga ccgcagaact gggattctca ctactgtaga aatcaacgca gcagaaacga 4140 ctctaatcaa actcgtccag cgacaattct ttggcaacga aatcacacaa ctacagcagg 4200 gaaaccaaat ctcaactaaa tcacgactgc ggtggtttca cccctttctc gatgctcacc 4260 aggtgcttcg aattggagga cgacttagca actcattact accttttgaa agtaaacatc 4320 aagcgatact tccctcatct catccattct cggcattatt ggtgcaatct atccatttgc 4380 gccaactcca tgcagccccc caacttcttc tcacgattct tcggctgcga cattgggtaa 4440 taggggccag ggatctggcc aaaaggactg ttcgtaattg tgtcacctgt tgtagaaatc 4500 gacctaaaac gatacaacaa ttcatggcgg aattaccctc tacgcgagta acggcaagca 4560 gaccattcac gacgacagga atcgactact ggggaccaat acaaattaaa caacaacatc 4620 gacgggccgc accgagaaaa gcctatgtcg ccgtgttcgt ttgcttcagt acgagggccg 4680 tgcatattga gctcgtggca gaccttagca cggcaaagtt catacaagct ctgcgacgat 4740 ttgtttcccg acggggcatc tgttcagaca tccacagcga caacggcagg aatttcgtcg 4800 gagcagctaa cgagttgcgc cggcttatca atagcaaaga gcaccaagct gcggtgtccc 4860 aggagtgtct ttcgaatggg atcaaatggc acttcaaccc gccaaaagct tctcattttg 4920 gtgggttgtg ggaggcagca atagcttctg cacagaaaca ctttttccgc gtcttaggcc 4980 cacacattct gtgctacgac gacacggaaa ctctcctcac gcaaatagaa tgttgcctga 5040 actccagacc aattattccc attagcgacg acccgagcga ctatgaggtg ctcactcctg 5100 ggcatttcct ggtcggatcc tcaatgaaag cggtaccaga catcgaccat tcagcaactc 5160 cattcaaccg acttcatcaa tggcagcagg tgcaaaagct atttcagcac ttgtggcaac 5220 ggtggaaccg cgagtaccta ttcactctac aacaacgaac gaaatgggtg aatcctccaa 5280 caattattga tgtgggaagg ctggtcattc ttaaagatga aaatgctcct cctatgacct 5340 ggcaaacagc gcgcatcgaa gaactacacc ctggagcgga tggaatcgta cgtgttgtga 5400 ctcttcgaac atcacaagga aaattcacac gaccggtatc gaaaatttgc ttgctgccgg 5460 ttgcatccaa cgaaaccaaa acaacaccgt cttccaaaac agcttcgtcc agcaaacaac 5520 tttctccgta gtttccacat ccatccttca gcaatcagca gcgcggaatg aaaatttaag 5580 gccctttatc caaagggaca aggtgagaac ctgtccatta atcctcaaat ttgcgaaacc 5640 cgctaccggg tctttgtttt ccattttctc gtcagcaacc atggcttctg ttcttttgct 5700 ccggtcgaaa aatctcttga cgtttggacc aaaacaatct tcggcatcgg attggttcgt 5760 catcgaccga gggacaaccc gctagtggaa aattcgttat catccagaaa tgtgatttct 5820 gaaggggcca gga 5833 // ID Sola1-11_AP repbase; DNA; INV; 4188 BP. XX AC ABLF01035036.1; XX DT 28-JAN-2009 (Rel. 14.02, Created) DT 28-JAN-2009 (Rel. 15.12, Last updated, Version 2) XX DE Sola1 DNA transposons from Acyrthosiphon pisum. XX KW Sola; DNA transposon; Transposable Element; Sola1; Sola1-11_AP. XX NM Sola1-11_AP. XX OS Acyrthosiphon pisum OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Sternorrhyncha; Aphidiformes; OC Aphidoidea; Aphididae; Macrosiphini; Acyrthosiphon. XX RN [1] RP 1-4188 RA Bao W., Jurka M.G., Kapitonov V.V. and Jurka J.; RT "New superfamilies of eukaryotic DNA transposons and their RT internal divisions."; RL Molecular Biology and Evolution 26(5), 983-993 (2009). XX RN [2] RG International Aphid Genomics Consortium; RT "Genome sequence of the pea aphid Acyrthosiphon pisum."; RL PLoS Biol 8(2), e1000313-e1000313 (2010). XX DR [2] (Consensus) XX CC This sequence was derived from sequence data generated by Human CC Genome Sequencing Center at Baylor College of Medicine: Pea Aphid CC Genome Project. XX FH Key Location/Qualifiers FT CDS join(1153..2673,2762..3148) FT /product="Sola1-11_AP_1p" FT /translation="MYHPLDRPILTISDLESMDFRYLDSFGDTVEIQNDGV FT SNEGNNLFHTALMVPVSGTGINTHEIDQTELDYNSIINNSNLQMNNMTVNE FT ASIISESHDAPYEPDEIELNESFSDDNVVGRSGRKKPKKGTTRKVLKNYLI FT NGTGKSVKPNPCAKLYQNKCSKKFDESQREDIFKFFWELDHERRRDYLLSC FT MKRVQIKRKRSIGESRRSVTNEYFLKHNSIEVQVCQQFLLKTLDVPQKLLR FT YTHLNKSPVNTAKKDRRGYNTPINKTSPEQLQIVDKFIRCLPAVSSHYCRN FT KSNKKYLPTEFRNIANLYLIYKDYCTKKNEFPISSLVFRRFFKTKYSIGFH FT LPKKDKCKICEKKNENVNNNEEEIQKYNCHINLKEKAKAMFLSDQKKTESN FT KSFLCISFDLEKVLNTPYGNNMNFYYSRKYSQYNLTVYESVTKRVVCYLWG FT EGDANRGCYEIATCMHDYLMDIDKKKSIKTIALYCDSCCGQNKNHVIIGML FT NKFINKASGLLFSYGKTLMTDNKKVVKITKMKSVQFTKGEGIQVFFDYDEQ FT FENIMIQHSPRIRKSKKANTEVLLEPQQLYSCPRKITSQKYFDLIKLCKQE FT IIPTRFHDEYKNSSHDKNILDMLTETDEEDDVLEEKD*" XX SQ Sequence 4188 BP; 1601 A; 514 C; 607 G; 1466 T; 0 other; cgagccttat taggtaaaag ggcgtaagcg acaattttgg cgaaaatgag atattggtat 60 caaaattttt ctaaaattca gacgcatcga tctgtaaaat ggctaagcga caatataaac 120 ttaagcatgt cattttgcca tttaaagatt ggacgaaagc gacatgtcgt ttaggcaata 180 acttaaaaat tgatcgtaag tcacattaat ttcaaaaaaa atacgtaagc gactttgttg 240 catgaggtaa gcgacaaaat aataataatt atttaatcag gcgtatgcga caatttaatt 300 ggcgattaga aattcgttcg taacatcatg ttggttacga ttgttttcat aatattatat 360 tgtttatcta ttatcttcaa tattattatt aataattact tatagaattt atttaaaacg 420 gtgaaaaatc atttgatgag cttatattgt gttacaacat tatttaattt actttttacc 480 ctgcttttta cctacttctt atatttttac aactgtaatt cgtataataa aaatggaaca 540 tcaggaacat cacgttctcg agtacgaaaa atgttgtttg ctgcgaacaa tttgaataca 600 ataccattaa gtaagatttt gaaataatct ttaatttaat ttttgatttc tagtcatatt 660 attatgatta aactatttgt caaatacttt tagctgatga tagcatttgt attactcatg 720 ataattcaaa tgtacagaat ttgagtacac aagacaatga tgaatgtaat caaaatttgg 780 atcactcagg atactttggt atgttcaaca tgttaacaca aagttaacta actattggca 840 ttgatttgtt ttattataat atattattat gattttattg ttcttaataa tatatctata 900 aagtaaataa cattatatta ttatagttag tacctatttt aaatgtttca tttactattt 960 attttatact gcacttaatg ctaaaatgtt cattaccaaa tccaattatg ttagtctact 1020 tttatattgt tagttataca ctcattattt tttaatgttt cctataatat catgttttgc 1080 ttttattaat tagatgaaaa tgtacatatt attgctgaag agaatgatgc tgtcaatgat 1140 attaataata atatgtatca tcctctggat agaccaatac ttacgataag tgatcttgaa 1200 agcatggatt ttcgctactt agactcattt ggtgatactg ttgaaattca gaatgatggt 1260 gtttccaatg aaggtaataa tttatttcac actgcattga tggtgccagt atcaggtact 1320 ggcattaata ctcatgaaat tgatcagact gagttagact ataatagtat aattaacaac 1380 tcaaatcttc aaatgaataa tatgactgtg aatgaagctt ctataatatc tgaatctcat 1440 gatgccccgt atgaaccaga tgaaatagaa ttgaatgaaa gttttagtga tgacaatgta 1500 gttggaagaa gtggccgtaa aaaacctaaa aaaggtacta ctagaaaagt tttaaaaaac 1560 tacttaataa atggaacagg aaaaagtgta aaacccaacc catgtgcaaa actgtatcaa 1620 aataaatgtt caaaaaagtt tgatgaatca caaagagagg atatttttaa atttttttgg 1680 gaattagatc atgaacgtag aagagattat ttactgtcat gtatgaaacg agtacaaata 1740 aaaagaaaaa ggtcaattgg tgaaagtaga agatcagtga caaatgaata ttttttaaaa 1800 cataattcta ttgaagttca agtatgccag cagtttcttc ttaagacatt agatgtccct 1860 caaaaacttt taagatatac ccatttgaat aagtcacctg tgaacactgc taaaaaagat 1920 agacgtggtt ataatactcc tataaataaa acaagtcctg aacagttaca aatagtggat 1980 aaatttatca ggtgtttacc agctgtttca tcacactatt gtcgcaataa gtcaaacaaa 2040 aaatacctac cgactgagtt tagaaatatt gcaaatttat acttaattta taaagattat 2100 tgtacaaaaa aaaatgaatt tcctatttca tctttagttt ttcgacgttt ttttaaaact 2160 aaatactcta ttggcttcca tttaccaaag aaagacaaat gtaagatttg tgaaaaaaaa 2220 aatgaaaatg taaataacaa cgaagaagag attcaaaaat ataattgtca tatcaattta 2280 aaagaaaaag ctaaagctat gtttttgagt gatcagaaga aaactgagtc aaacaagagc 2340 tttttgtgca tcagttttga tttggaaaaa gttttaaata cgccttatgg caataatatg 2400 aatttttatt acagccgtaa gtactctcaa tataatttaa cagtgtatga atctgttaca 2460 aaaagagtag tatgttatct ttggggagaa ggagatgcta acagaggttg ctatgaaata 2520 gccacttgta tgcatgatta cttaatggat atagacaaaa aaaaatccat taaaactatt 2580 gcactctatt gtgattcctg ctgcggtcaa aataagaacc atgtcattat tggaatgctg 2640 aacaaattca ttaataaagc aagtggccta ctctgatagg taattcaaga aaaaatcctg 2700 gtccacttga agtaaaagtg atgaagttca atgatttctt tgactggaag acatttacta 2760 gttcagttat ggaaagacct taatgactga taataaaaaa gttgttaaaa tcacaaaaat 2820 gaaaagtgtt cagttcacta aaggagaagg aatacaggtt ttttttgatt atgatgaaca 2880 atttgaaaat attatgattc aacattctcc tagaataaga aaatcaaaaa aggcaaatac 2940 tgaagtttta ctggaaccac aacaactcta ttcatgccca agaaaaataa catcacaaaa 3000 gtattttgat ttaataaaac tttgtaaaca agaaattata ccaacaaggt tccatgatga 3060 gtacaaaaat tcaagccatg acaaaaatat tttggatatg ttaacagaaa ctgacgagga 3120 agatgatgtc ttagaagaaa aagattgact ttgaaataag aaaatattaa aaaatatgat 3180 agatttatgt tacagtttta cagttttatt gcaattaatt tattaaaact aagtattaag 3240 tattatattt tatattttaa tcctctaatc taataagtac tttttatttt atgtctgtaa 3300 aatgttcatt ttatgtctgt ttactcattt gaattgtgat attgttattt tttgttttat 3360 taatataact ataatattta taagttaaat gtgttaataa gatacctaca gatgatttat 3420 ataattcact acacatttta acataatata accaaaagac tggtatatat attattataa 3480 ctaagtatta attagattta atttcctatt aataatatta tgaactaaaa agactttaat 3540 tgtgataata ttgttatttt agtattttaa ttttacttac tttaaaaata aaaccaaaaa 3600 gacttggaag cctgttatat ttcttacatt atacctaatt atgtttaaat tctatgtcag 3660 attttatatt acaggtaaat attagagtaa agtttaatag aaaacgtatt ttttgaattt 3720 ctcagtaatt tttgagaaaa tagtgaagac cataatatta atatgaatac ataccaagct 3780 atgcttatgc atgtctgtct tagttgaggg taattaatac caaaaagtaa cagataataa 3840 aatgtataca caaataatat ggttaataga caaacaacca tcatagccat catattaata 3900 tttacgaagg gagggggggg ggaaatatga tagataaaga atatgaaaaa taaaaatttt 3960 agcaccattg agctatattc gaacatttat tttattgtaa gcgacataag ttacacaata 4020 ttaaaaatat tatttttatc ccgtaattgt attttcgaaa taattttttt ataactacac 4080 ttataatttt catgaaaatc catcagatat tcgattttta aagtgttttt tttagttcct 4140 gtaaaataat gaaaatgtcg cttacgccct tttacctaat agggctcg 4188 // ID SMAR17 repbase; DNA; INV; 1270 BP. XX AC . XX DT 03-OCT-2007 (Rel. 12.1, Created) DT 22-OCT-2007 (Rel. 12.1, Last updated, Version 1) XX DE Consensus sequence of Mariner-type family of repeats. XX KW Mariner/Tc1; DNA transposon; Transposable Element; SMAR17. XX OS Schmidtea mediterranea OC Eukaryota; Metazoa; Platyhelminthes; Turbellaria; Seriata; OC Tricladida; Paludicola; Dugesiidae; Schmidtea. XX RN [1] RP 1-1270 RA Jurka J.; RT "Mariner-type element from freshwater planarian (Schmidtea RT mediterranea)."; RL Repbase Reports 7(10), 1075-1075 (2007). XX DR [1] (Consensus) XX CC Individual sequences are <90% identical to consensus. XX FH Key Location/Qualifiers FT CDS 496..1134 FT /product="SMAR17_1p" FT /translation="MKMIQKEGKWIPHELTEANIENRLMVCNSLLTRHRRW FT HFLHQIVTGDEKWIYYDNPKRKKSWVHPGEPSTSTAKRNIHGSKVMLCIWW FT DQKEVVYYELLKPNETITAERYQQQLIDLNHALNEKRPIPANKQRKVILLH FT DNARPHVAKQVKNTLEALKWEILPHAPYSPDCAPSDYYLFRSMQSALQDQH FT FRNHEEIKKWIASKDKQFFRWNP" XX SQ Sequence 1270 BP; 476 A; 231 C; 220 G; 342 T; 1 other; attaggtgta taacttagtt ttttggattt tctaagagat ggcgtaactt ttgtactaaa 60 cattctttta caatgacaca tggcgcttta aagtgttgaa attccgcatt aaaatttagt 120 atgtcattat tcatgaaagc gtaaacaaca tatttaatcc crtttaaaca tgtcgaattt 180 cgtaccaaca aaacagcatt tgcgggaagc attacttttt tgctttaatc aaaagaaaaa 240 gctgctgaaa gttttcgtgt tcttcaagaa acttatggtg aacatgcacc aactcaacag 300 acttgcgaaa actggtttaa acgattcaaa agtggtgatt ttgatgtcag tgacaaagaa 360 cgttctggtc aaccaaaaaa attcgaagat gaagaattgg aagcattatt gaaggaaaac 420 tcagctcaaa cagaaaaaga acttgcaaaa cagttaaatg tcactcaagc agccataagc 480 aaacgtttac atgccatgaa aatgattcaa aaagaaggaa aatggatacc acatgaactg 540 actgaagcca acattgaaaa tcgtttaatg gtttgcaatt ctttgcttac aagacaccga 600 agatggcatt ttttgcatca aattgtgact ggcgatgaaa aatggattta ctacgataac 660 cctaaacgca aaaaatcatg ggttcatcca ggcgaaccat ccacatcaac agcaaaacga 720 aacattcatg gaagtaaggt tatgttgtgc atttggtggg atcagaaaga agtggtgtac 780 tatgagctgc taaaaccgaa cgaaactatt actgctgaac gttaccaaca acaattgata 840 gatttgaacc atgcattgaa cgaaaaacgt cctattccag ccaacaaaca gcgtaaagtg 900 attttgctgc acgacaatgc tcgaccacat gtcgcaaaac aagtaaaaaa tacgttagaa 960 gcacttaaat gggaaattct accgcatgct ccctattcac cagattgtgc cccatcagat 1020 tactacttgt tccgatcaat gcaaagcgct cttcaagacc agcacttcag aaaccacgaa 1080 gaaatcaaaa aatggatcgc atcaaaagac aaacaatttt ttcgatggaa tccataaatt 1140 gccagaaaga tggtcgaaag tcatagttaa cgatggaaaa tattatgatt aaaataaaat 1200 ttaccatttc ttagaaataa acgtgttttt ttttttgcaa aaaatcaaaa aactaagtta 1260 tacccctaat 1270 //